Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_017471 | Lactobacillus amylovorus GRL1118 plasmid p1, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
NC_017470 | Lactobacillus amylovorus GRL1118, complete sequence | 3 crisprs | cas14j,cas2,cas3,DEDDh,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,DinG,csa3 | 0 | 12 | 5 | 0 |
NC_017472 | Lactobacillus amylovorus GRL1118 plasmid p2, complete sequence | 1 crisprs | NA | 0 | 5 | 5 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_017470_1 | 540951-541134 | Orphan |
NA
Consensus repeat of NC_017470_1
|
2 spacers
spacers of NC_017470_1
>1.1|540975|60|NC_017470|PILER-CR CCTCCTTTGATTGGTCCATTGGAGCAGTGGTTTATCTCGCCTCCCTGTCACGGAGGAGAT >1.2|541059|56|NC_017470|PILER-CR GGACCGTAATATGGCGGAATTGGTGAAGGGGTTAACACACTGGTTTGTGGATCCAG |
CRISPR arrays and Neighbor proteins around NC_017470_1
The CRISPR arrays of NC_017470_1 >merge|NC_017470|1|540951-541134|PILER-CR CGCGCGTGGGTTCAAATCCCACATCCTCCTTTGATTGGTCCATTGGAGCAGTGGTTTATCTCGCCTCCCTGTCACGGAGGAGATCGTGGGTTCAAATCCCACATGGACCGTAATATGGCGGAATTGGTGAAGGGGTTAACACACTGGTTTGTGGATCCAGCATGCGTGGGTTCGAATCCCACAT >NC_017470|1|1|540951-541134|PILER-CR CGCGCGTGGGTTCAAATCCCACAT CCTCCTTTGATTGGTCCATTGGAGCAGTGGTTTATCTCGCCTCCCTGTCACGGAGGAGAT CGTGGGTTCAAATCCCACATGGAC CGTAATATGGCGGAATTGGTGAAGGGGTTAACACACTGGTTTGTGGATCCAGCATG CGTGGGTTCGAATCCCACAT
>NC_017470.1|WP_013437368.1|540291_540792_+|GNAT-family-N-acetyltransferase MIIKPLISEDEARQTSKLFQKCWQDVYKGILPAEFLDNIPENAWVKRFNESGRHNLIFEDDKNIVRAAVSYGRPRDTRMLGCGELMALYVEPDFQGYNVGKTLLNAAENELKKMGYGKIYLWCLDGNEKAQGFYEHFGWRNIATERFVEIVGKEYKYLLYQKNLRD >NC_017470.1|WP_014565643.1|538935_540282_+|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MKKNQIVDLEITDLSYEAMGVAHYEGLTVFVTNALPGEIVSAKILKVKKNFAFAKIEKIKKESPDRVKIKLNQWVQTGLASLAHIKYDKQLEFKRNQVVNLLKKAHLDDIEVGQTLPSPEQTGYRNKAQVPVREINGQLEIGFFRRHSHDLVPLTNFFTTDPEIDRVLVAVRDILRKYRVPAYDEINHKGEVRYLDVRRSKATGEIMVILVCLHNDFPQLLGVAAEVSQIPGVTGLILNHNPKKTNVILGKKDYLVLGNDQITDQIGDLKFRISPQSFFQINSLQTPRLYDLAIKKADLKPSDVVIDAYSGIGTIGLSVAKHVKAVRGIEVVRDAIKDAKDNAKLNDIDNAKYYLGKAEEIMPRWAKQGLKTDVVFVDPPRKGLTPEFIDATAKTGPEKVVYISCNPATMVRDLQLFQEQGYEFDRIDPVDMFPQTPHVEAVTVLTKK >NC_017470.1|WP_013437366.1|537189_538566_-|Na+/H+-antiporter-NhaC MEKKKVSFAESIIILIVLLAILGVSVIKFGLSPEVPVLFTVLLLTFWARLRGFSWQDVQNGIKEGIGVAIIPIFIFMLIGALIGVWIKAGIIPSIMVLGFNMISGSFFVPSVFIVCSIVGVAIGSGFTTISTVGIALFGIGSSMGANPALVAGAIISGAVFGDKMSPLSDSTNLSSAVAESELFSHIKNMMWSTIPSFVVSLILFWILGNSGHMDPTKIERTSQVLQNNFTISWWALLPIVLMLICAWRKIPAIPTLFMNIAITVVMIFIQSPHESAQSLNNLIMNGFVAKTSDASVNALLTRGGISSMMATVALIISTLSLGGMLMKFNIVQSAMEPLVKHLNKPGRLITVTILSGICINLFVGEQYLSVILPGRAFKPAFDKIGLSPLALSRVLEDGGSVINYLIPWGVAGSFAASALGVPVLQFLPFVFFSLLSPVFSIFSGVTGIGLKWAKKNK >NC_017470.1|WP_014565642.1|536790_537138_+|hypothetical-protein MDKKYTDIEVRGERSDHPDKSYAADEVRKLFFTENAKKKYDILTGSQKTFIDRELDDLRLSRSSSVSRKDNSELQQEIVFEEQNNQVIVTDILYDDYRNSKEYKKAQVRMYDMNN >NC_017470.1|WP_013437364.1|535939_536650_-|Bax-inhibitor-1/YccA-family-protein MDNMNFSSPERRQVHDVSEVNGFLSKMYSYMGLAVLVSAITAFLTMTVFRAAVMQMPTALMWIILIVPLGLSMGISFRATRNPVAGFVMLMILAVIYGFEFALLAGFYTGAQISTAFLSSAAVFGAMAIFGTFTKRDLNNLGSYMGAALIGLLVAMIVNIFLRNSVASFVFSIIGVIIFTGLTAYDAQKMKSIYNNYGSQVPTNGLAVLGALQLYLDFINIFLFLLQIFGMGNDRN >NC_017470.1|WP_013437363.1|535640_535889_+|hypothetical-protein MQRVGTMAGNPQLKLTEKERTLMTINVQVFVHSMIGLVEVLLNYGDILLPYDMRQSIMAFLHQSPVELMSSMVPNKEEKTEQ >NC_017470.1|WP_013437362.1|534465_535386_+|diacylglycerol-kinase-family-lipid-kinase MTSKARLIYNPVSGHEQMPKNVADILDVLEQAGYEASAFRTTPEQNSARNEATRAAKEGFDLIVAAGGDGTINEVVNGIAFLDKRPKMAIIPAGTTNDYARALAIPRDNIPDAAKVILKNKTRKMDIGKAVFGDQTQYFVNIAASGSLTELTYGVPSEVKSALGYAAYLIKGAEMLPHLTENEMRLTYDDGVYEGKLSMFLLGMTNSIGGFEQVMPDAQLSDGLFQLIVVKPSDPVSMMKLMALALNGKHVDDPNIIYTKTRSLKAELIGKNSGRDLPVNLDGEIGGYCPVEFHNLQQRIEFYVGK >NC_017470.1|WP_013437361.1|533009_534440_+|Asp-tRNA(Asn)/Glu-tRNA(Gln)-amidotransferase-subunit-GatB MNFKSTIGLEVHFELKTKSKIFSPSPVTYGAEQNTETNVIDWAMPGTLPMVNKNVYRLGIMVAIATHAHILPTTHFDRKNYFYPDNPKAYQITQFFQPLARDGYIEVEVRGKKKRIGIHEMHIEEDAGKNTHGTNGFSYVDLNRQGVPLLEVVSEPDMEDPEEAYAYLEKLRKIVQFTGASDVKMEEGSMRVDTNISIRPAGQKELGTKVEMKNLNSFDHVRRSLAYEEKRQEQVLLAGGHIQLSTRRFDEATGKTVLERVKEGDSDYRYFPEPDIAPDHISQEWIDQIAKELPKSPFDRYDDYVNKYGLKPYDANVLLQTKESSDFFDAAVAAGADPTLAANWMNTQVNGYLNDHRVGLNDIKLTPEHLAEMIQLIKDGTISSKIAKKVFAETIANGTDPKKYVEDNGMVQLSDTSVLAPMVKKVVDDNPQSVEDFKNGKDRAIGYLVGQIMKQTRGKANPKMVNKLLNQELQSR >NC_017470.1|WP_013641607.1|531565_533005_+|Asp-tRNA(Asn)/Glu-tRNA(Gln)-amidotransferase-subunit-GatA MNYLNEDIDSLNKKLASGDLSADKLAKDTVANIKDTDKKLNAWITVLDDAKPAENLDYSKSKLAGIPIAIKDNIITNGIKTTAASHILYNYMPMYDATVISKLKKAGVTLVGKTNMDEFAMGSSTEHSYYGATHNPWNLDKVPGGSSGGSAAAVAGGQVVAALGSDTGGSIRQPAAFNGIFGIKPTYGRVSRWGLIAFGSSLDQIGVMTKRVKDSAEVLNVIAGADEHDSTVSTREVPDFTKFIGQDVKGLRVAVPKEYMEAVSGEMREVIQKQIDTLKDAGAIINEVSLPHTKYVVPDYYIIASSEASSNLQRYDGIRYGYRAKDTKNLLDVYVKSRSEGFGTEVKRRIMLGSFALSAGSYDRFFRQAAKVRTLICNDFDKIFAENDVIVGPTTTEPAFGIGEEVSDPIKMYNNDILTISANLAGIPAASVPAGLVDGMPVGLQIMAKRFDEGNVFKTADFIERSNKFYEKTPTGMED >NC_017470.1|WP_013437359.1|531257_531566_+|Asp-tRNA(Asn)/Glu-tRNA(Gln)-amidotransferase-subunit-GatC MEITKDTIKHVATLSRLAFNEEELDKFTDQMGSIINMADQLSEVDTEGVDETVQVVDRDTVFREDKPEHWQGQTRETLMANVPEKANGYIKVPVIINKDEDE >NC_017470.1|WP_014565644.1|541291_542791_+|MFS-transporter MNKKQVTMVTIALMLGNVMSGLDGTIINTAIPAIVASLHGIQFMGWIVAIFLLGMSISIPIWTKVGEKITNKRAFEISLALFVIGSALQGMAPNIIFFLCSRFIMGVGAGGMGSLPYIIAGYVFKNIKTRTKVLGYLTASWNGAAILGPLIGGWLIDAFSWHWVFYINIPIGLIAFLICLIYYKPVTPKQTPVFDIPGASLLVIGLLPFLMGVQLVGLTASWIVISLIIVSLVFIVLFFIRENHAQNPIIPVSLFKNKDLDGDFLLFAFTWGAFIAVNTYLPMWAQALLGLSALLGGMTLIPNSIVEIIASQSVVAIQDHLTTFKLVFIGIFAMLISSAGMFFADLHMPIQLLAAIGAFSGIGVGFIFVALQLKVQIDAGLKNMATATSTSYLIRILAQTVMAAVYGVIMNLNLASGVSSHPGITITMMNKLSDAKSAKLLPQNLVPTMRNILHSGIKEIMLVSVILLVIALVLNFYFNFGKKTEETAIVNEKANSDWD >NC_017470.1|WP_013437370.1|543435_545208_+|oleate-hydratase MHYSNGNYEAFVNASKPKDVDQKSAYIVGSGLAALASAVFLIRDGHMKGERIHIFEELGLPGGSMDGIYNKQKESYIIRGGREMEPHFECLWDLFRSIPSPENKDESVLDEFYRLNRRDPSYAKTRVIVNRGEALPTDGQLLLTPKAVKEIVDLCLTPEKDLQNKKINEVFDKEFFQSNFWLYWSTMFAFEPWASAMEMRRYLMRFVQHVATLKNLSSLRFTKYNQYESLILPMVKYLKDHGVQFHYDTVVDNVFVNRSNGEKVAKQIILTENGEKKNIDLTENDLVFVTNGSITESTTYGDNLHPAPEEHELGASWQLWKNLAAQDEDFGHPEVFCKDIPKANWRMSATITFKNNDIVPFIEAVNKKDPHSGSIVTSGPTTIKDSNWLLGYSISRQPHFKAQKPNELIVWLYGLFSDTKGNYVEKTMPDCNGIELCEEWLYHMGVPEERIPEMAAAATTIPAHMPYITSYFMPRALGDRPKVVPDHSKNLAFIGNFAETPRDTVFTTEYSVRTAMEAVYTLLDIDRGVPEVFASAFDVRMLMNAMYYLNDQKKLKDLDLPMPEKLAIKGMLKKVKGTYVEELLKKYKLI >NC_017470.1|WP_081456847.1|545397_545874_+|GNAT-family-N-acetyltransferase MYMKAFPEWERFSMFSLLAMSLHRNVKFHAIYDDGKFCGITYYAENDNTVYLTYLAVSEKLRGQGYGSKILTMLEDNFPDKQIVIDIEPVTKKVKNYKQRVSRLKFYERNGFHRTDQKLKDPDGEFEALTTGERLDKNSFIKILRQMSFGFYQARVEK >NC_017470.1|WP_014565645.1|545963_546452_+|DUF3955-domain-containing-protein MEFSDQIKQLRKENNLSQVQYAKKLHVTRQAVSNWKNNRNLLDLEMLIEINRVFHISLDQLILGDDNMNKMTQKLIKDTDENRKAKYNMITTLIGGFLMIVGFVCFFIKANSVEYVDKQGFLHENFYLILVGYLFLFAGIIVLIAGGIVYLRNKHKHKKRAP >NC_017470.1|WP_014565646.1|546470_547286_-|PTS-system-mannose/fructose/sorbose-family-transporter-subunit-IID MTKANTKTNSGKLTKRDLFRANWRWLWGSQLSWNYERMMAPGYFYAVLPFLKRWYKDDELVEMMQMQTQFFNVNAYVGNFIIGVDLALEESQGIKSKDTVAGIKTGLMGPLAGIGDTIFSAIIPTICGSIGAYMGLRGNPLGSILWILVDLIILFLRFSFLPMGYYQGTKLIDSASGKLNAITDSAILLGVTVVGALIPTVIKAKVPYVFHTGKITLKMQTILNQIMPSLVPVLLVTLVYWLLGKKGVTSTKMIWFVLILGIILSYFHILG >NC_017470.1|WP_014565647.1|547260_548073_-|PTS-sugar-transporter-subunit-IIC MTIAWWQILLLTCLAFWVIIDQLTVSILNNPLAIGMVSGIIMGDITTGLAVGSTLQLMVLGVSTYGGASMPDFMTGAIVGTVYAVLSGKGIQFAIGLAVPVGLLMVQLDVLARFINTIFQHRMDKFIKENNPDAAARNALWGTFSWGLSRAIPVFILLIVGNDVVRMILHIIPTWLTNGLKVSGGILPVVGIAILLRYLPTKRFISYLAIGFIAASYMKIPMLGVALLGAALAYIHYQREVAKLEEKPATTNTNNTESEEYENDEGEYEN >NC_017470.1|WP_014565648.1|548083_548995_-|PTS-mannose/fructose/sorbose-transporter-subunit-IIAB MANFLLVSHGEYAKATKASVEMIAGEHKNVKAIAFKQTMNQDDLLEEITKAASEFDEAPTILVDIAGGTPANTAQRYQQKHPDVAVYSGLSMPLLLAVVMGTPIDEAIKQAIDNMAPVGLTKKKEEPKKTIKKEETPNKNVTLTPHTMQNVRIDERLIHGQVATMWTNALKLTRIMVVGDDIVKNDVLKTGLKTACPHGVHLSILTAHGAARRINSGKYVGQTVLLLVKNPGVLRQLVDFDVKLPEINVGNMSTKPHSRQVAKSVAVLDKDVEDFEYLDQKGCHIYHQMVPSEPKEDFMEMIK >NC_017470.1|WP_014565649.1|549339_549747_+|hypothetical-protein MGIKKVEVTSAIALALSAVALVGTQTNNKVQAASSNVENSSVVKESSNADIATIQKNYQVAQDQYKKANDAWNQIQQSENSKLNQAETNAEQAKANYDSQVKLNEQAKAENETAQNNLDQAQKVKEQAEKMLKRL >NC_017470.1|WP_014565650.1|549743_552128_+|SLAP-domain-containing-protein MNGGLDKANSARDAKQKEVDSAQSDLNKTQSDAKKKQEEVNKDQQDFDNKSKAVSDDTQKLNQANSELQTKTDAKNAADTAVDQANEEAKKNPDYKSASDQYESATDKLNEAQKNKDAADKALSEANEAVNTATSNQKEKQDAADGAKNGLTQAQKNKDAADKALNDANDGVKTTTATQKEKQTAVDEAQDALTQAQKGKETADKALSEANNGVKTTTATKKEKQTAVDEAQKSLTQAQKNKDVADKALSEANEAVNTATATQKEKQTAVDNAQKSLTQAQNSKDAADKALSDANEAVNTATATQKEKQTAIDEAQDALTQAQKGKETADKALSEANEAVNTATATQKEKQTAADEAQDALTQAQKAKDAADKALSAANDAVKTAIATRSAKQTAADKAKNEFDGITSQYNAAKDIFEKAQISEKNVEAALNKQKQATQELANAVKAQKQATDNLRKDTIAQQVLATAKTQQQTLDFAVKQKSDKLNNLQTELDQLKQTATALQKKVDDFTNATNALIIANQEAETAAQNAKDAQEKLQELDKINENAQSNLASVQKEYNDTIARAQSALKEAKSALDNAQAELTNAQLAEAQANKQNENNYSQTIGDENSSSSNNTNVNVANNNSDSQFVTDNNATNEHQGSILTPTTKTSNADKESKAVRVIRKAYVYTSNHKVAKKNGKKISLKKRSLIKVLDNAKVYRIKGGRFYRIGKNRFVKVGNVEKFTIQINMHATIAGRKNRKVHVSNSNGKHINKYVIAGHNYRFDRKKVIKGEVYYRIANKDQWIKENKLIFK >NC_017470.1|WP_014565651.1|552295_553132_+|aldo/keto-reductase MENFVTLNNGVKMPRLGFGVYQIDDLAQTQQVVEDGLEIGYRLVDTAQIYGNEQAVGDAIKRSNVPREDIFVTSKIWVNDYGYDNTLKAFDDSMKKLQLDYIDLYLIHKPYNDYYGTWRALERLYKEGRIRAIGVSSFWNERLADLITFNDVKPAVNQIETNVWNQEWKSQKYMEKEGVQPEAWAPFAEGANHIFTNPVLEEIAAKHHKTTAQVMLRWFLQRNYVVIPKSVHKKRLAENFDVFDFELDAEDMKKIKTLDQGHSILEDEMDPEIVESFR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_017470_2 | 1001645-1002709 | TypeI-E |
I-B,III-A,III-B
Consensus repeat of NC_017470_2
|
17 spacers
spacers of NC_017470_2
>2.1|1001673|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT CAAAAACAGCTTTAGCACCAGCACTATGGTAAG >2.2|1001734|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TACTCCCCGAGCTTTTAACCGACGTCGCTTTAA >2.3|1001795|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TCATCTATTCGTTGCTTAAAAATTTTTCGTTGT >2.4|1001856|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TATTCGGATCGTATGGTCAATTTGCGATTTATA >2.5|1001917|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TTATTCCGTCTGCGTAGTCATAGCCACCAACAA >2.6|1001978|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TAATCAGATTCTAGGAAGGAGGAAAACATGGCA >2.7|1002039|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TAATCAGATTCTAGGAAGGAGGAAAACATGGCA >2.8|1002100|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TATCAGTAACATAGTTGTCCGTGATAGCAGATT >2.9|1002161|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT CGTGGAACGCTTACGGTAACACCGTCAATCGAG >2.10|1002222|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TCGGCATTGTGGGATGCCAGCGCTGGGCTTTAT >2.11|1002283|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT CATGGACCACTTGGTTGAAGCCAGCACTAAGCT >2.12|1002344|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT CCAGTCCGACTACCACCAGCTCAAAACAGTGGG >2.13|1002405|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TGTTTGACCAAGTTTTACAGACTTTAATAATGG >2.14|1002466|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT CGTGCGGCGCCACTCGTTTGGCGTGCGGTAAAA >2.15|1002527|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TATCCTGAATATTTGCCTATTAATGGGGAATGG >2.16|1002588|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT CGCATTGCAACGCTTGTGGAGTGATACGGCAAC >2.17|1002649|33|NC_017470|CRISPRCasFinder,CRT CTATTCAATTATCAAGCATAACTAGTTGCTAAA |
cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,DEDDh |
CRISPR arrays and Neighbor proteins around NC_017470_2
The CRISPR arrays of NC_017470_2 >merge|NC_017470|2|1001645-1002709|PILER-CR,CRISPRCasFinder,CRT GTGTTCTCCACGTAAGTGGAGGTGATCCCAAAAACAGCTTTAGCACCAGCACTATGGTAAGGTGTTCTCCACGTAAGTGGAGGTGATCCTACTCCCCGAGCTTTTAACCGACGTCGCTTTAAGTGTTCTCCACGTAAGTGGAGGTGATCCTCATCTATTCGTTGCTTAAAAATTTTTCGTTGTGTGTTCTCCACGTAAGTGGAGGTGATCCTATTCGGATCGTATGGTCAATTTGCGATTTATAGTGTTCTCCACGTAAGTGGAGGTGATCCTTATTCCGTCTGCGTAGTCATAGCCACCAACAAGTGTTCTCCACGTAAGTGGAGGTGATCCTAATCAGATTCTAGGAAGGAGGAAAACATGGCAGTGTTCTCCACGTAAGTGGAGGTGATCCTAATCAGATTCTAGGAAGGAGGAAAACATGGCAGTGTTCTCCACGTAAGTGGAGGTGATCCTATCAGTAACATAGTTGTCCGTGATAGCAGATTGTGTTCTCCACGTAAGTGGAGGTGATCCCGTGGAACGCTTACGGTAACACCGTCAATCGAGGTGTTCTCCACGTAAGTGGAGGTGATCCTCGGCATTGTGGGATGCCAGCGCTGGGCTTTATGTGTTCTCCACGTAAGTGGAGGTGATCCCATGGACCACTTGGTTGAAGCCAGCACTAAGCTGTGTTCTCCACGTAAGTGGAGGTGATCCCCAGTCCGACTACCACCAGCTCAAAACAGTGGGGTGTTCTCCACGTAAGTGGAGGTGATCCTGTTTGACCAAGTTTTACAGACTTTAATAATGGGTGTTCTCCACGTAAGTGGAGGTGATCCCGTGCGGCGCCACTCGTTTGGCGTGCGGTAAAAGTGTTCTCCACGTAAGTGGAGGTGATCCTATCCTGAATATTTGCCTATTAATGGGGAATGGGTGTTCTCCACGTAAGTGGAGGTGATCCCGCATTGCAACGCTTGTGGAGTGATACGGCAACGTGTTCTCCACGTAAGTGGAGGTGATTCCTATTCAATTATCAAGCATAACTAGTTGCTAAAGTGTTCTCCATAAATGTGAAGGTATTAT >NC_017470|2|2|1001645-1002648|PILER-CR GTGTTCTCCACGTAAGTGGAGGTGATCC CAAAAACAGCTTTAGCACCAGCACTATGGTAAG GTGTTCTCCACGTAAGTGGAGGTGATCC TACTCCCCGAGCTTTTAACCGACGTCGCTTTAA GTGTTCTCCACGTAAGTGGAGGTGATCC TCATCTATTCGTTGCTTAAAAATTTTTCGTTGT GTGTTCTCCACGTAAGTGGAGGTGATCC TATTCGGATCGTATGGTCAATTTGCGATTTATA GTGTTCTCCACGTAAGTGGAGGTGATCC TTATTCCGTCTGCGTAGTCATAGCCACCAACAA GTGTTCTCCACGTAAGTGGAGGTGATCC TAATCAGATTCTAGGAAGGAGGAAAACATGGCA GTGTTCTCCACGTAAGTGGAGGTGATCC TAATCAGATTCTAGGAAGGAGGAAAACATGGCA GTGTTCTCCACGTAAGTGGAGGTGATCC TATCAGTAACATAGTTGTCCGTGATAGCAGATT GTGTTCTCCACGTAAGTGGAGGTGATCC CGTGGAACGCTTACGGTAACACCGTCAATCGAG GTGTTCTCCACGTAAGTGGAGGTGATCC TCGGCATTGTGGGATGCCAGCGCTGGGCTTTAT GTGTTCTCCACGTAAGTGGAGGTGATCC CATGGACCACTTGGTTGAAGCCAGCACTAAGCT GTGTTCTCCACGTAAGTGGAGGTGATCC CCAGTCCGACTACCACCAGCTCAAAACAGTGGG GTGTTCTCCACGTAAGTGGAGGTGATCC TGTTTGACCAAGTTTTACAGACTTTAATAATGG GTGTTCTCCACGTAAGTGGAGGTGATCC CGTGCGGCGCCACTCGTTTGGCGTGCGGTAAAA GTGTTCTCCACGTAAGTGGAGGTGATCC TATCCTGAATATTTGCCTATTAATGGGGAATGG GTGTTCTCCACGTAAGTGGAGGTGATCC CGCATTGCAACGCTTGTGGAGTGATACGGCAAC GTGTTCTCCACGTAAGTGGAGGTGATTC >NC_017470|2|1|1001645-1002709|CRISPRCasFinder GTGTTCTCCACGTAAGTGGAGGTGATCC CAAAAACAGCTTTAGCACCAGCACTATGGTAAG GTGTTCTCCACGTAAGTGGAGGTGATCC TACTCCCCGAGCTTTTAACCGACGTCGCTTTAA GTGTTCTCCACGTAAGTGGAGGTGATCC TCATCTATTCGTTGCTTAAAAATTTTTCGTTGT GTGTTCTCCACGTAAGTGGAGGTGATCC TATTCGGATCGTATGGTCAATTTGCGATTTATA GTGTTCTCCACGTAAGTGGAGGTGATCC TTATTCCGTCTGCGTAGTCATAGCCACCAACAA GTGTTCTCCACGTAAGTGGAGGTGATCC TAATCAGATTCTAGGAAGGAGGAAAACATGGCA GTGTTCTCCACGTAAGTGGAGGTGATCC TAATCAGATTCTAGGAAGGAGGAAAACATGGCA GTGTTCTCCACGTAAGTGGAGGTGATCC TATCAGTAACATAGTTGTCCGTGATAGCAGATT GTGTTCTCCACGTAAGTGGAGGTGATCC CGTGGAACGCTTACGGTAACACCGTCAATCGAG GTGTTCTCCACGTAAGTGGAGGTGATCC TCGGCATTGTGGGATGCCAGCGCTGGGCTTTAT GTGTTCTCCACGTAAGTGGAGGTGATCC CATGGACCACTTGGTTGAAGCCAGCACTAAGCT GTGTTCTCCACGTAAGTGGAGGTGATCC CCAGTCCGACTACCACCAGCTCAAAACAGTGGG GTGTTCTCCACGTAAGTGGAGGTGATCC TGTTTGACCAAGTTTTACAGACTTTAATAATGG GTGTTCTCCACGTAAGTGGAGGTGATCC CGTGCGGCGCCACTCGTTTGGCGTGCGGTAAAA GTGTTCTCCACGTAAGTGGAGGTGATCC TATCCTGAATATTTGCCTATTAATGGGGAATGG GTGTTCTCCACGTAAGTGGAGGTGATCC CGCATTGCAACGCTTGTGGAGTGATACGGCAAC GTGTTCTCCACGTAAGTGGAGGTGATTC CTATTCAATTATCAAGCATAACTAGTTGCTAAA GTGTTCTCCATAAATGTGAAGGTATTAT >NC_017470|2|1|1001645-1002709|CRT GTGTTCTCCACGTAAGTGGAGGTGATCC CAAAAACAGCTTTAGCACCAGCACTATGGTAAG GTGTTCTCCACGTAAGTGGAGGTGATCC TACTCCCCGAGCTTTTAACCGACGTCGCTTTAA GTGTTCTCCACGTAAGTGGAGGTGATCC TCATCTATTCGTTGCTTAAAAATTTTTCGTTGT GTGTTCTCCACGTAAGTGGAGGTGATCC TATTCGGATCGTATGGTCAATTTGCGATTTATA GTGTTCTCCACGTAAGTGGAGGTGATCC TTATTCCGTCTGCGTAGTCATAGCCACCAACAA GTGTTCTCCACGTAAGTGGAGGTGATCC TAATCAGATTCTAGGAAGGAGGAAAACATGGCA GTGTTCTCCACGTAAGTGGAGGTGATCC TAATCAGATTCTAGGAAGGAGGAAAACATGGCA GTGTTCTCCACGTAAGTGGAGGTGATCC TATCAGTAACATAGTTGTCCGTGATAGCAGATT GTGTTCTCCACGTAAGTGGAGGTGATCC CGTGGAACGCTTACGGTAACACCGTCAATCGAG GTGTTCTCCACGTAAGTGGAGGTGATCC TCGGCATTGTGGGATGCCAGCGCTGGGCTTTAT GTGTTCTCCACGTAAGTGGAGGTGATCC CATGGACCACTTGGTTGAAGCCAGCACTAAGCT GTGTTCTCCACGTAAGTGGAGGTGATCC CCAGTCCGACTACCACCAGCTCAAAACAGTGGG GTGTTCTCCACGTAAGTGGAGGTGATCC TGTTTGACCAAGTTTTACAGACTTTAATAATGG GTGTTCTCCACGTAAGTGGAGGTGATCC CGTGCGGCGCCACTCGTTTGGCGTGCGGTAAAA GTGTTCTCCACGTAAGTGGAGGTGATCC TATCCTGAATATTTGCCTATTAATGGGGAATGG GTGTTCTCCACGTAAGTGGAGGTGATCC CGCATTGCAACGCTTGTGGAGTGATACGGCAAC GTGTTCTCCACGTAAGTGGAGGTGATTC CTATTCAATTATCAAGCATAACTAGTTGCTAAA GTGTTCTCCATAAATGTGAAGGTATTAT
>NC_017470.1|WP_014565809.1|1001135_1001573_+|hypothetical-protein MLKDQKKFDELGEKLFMKGVLQNFEQKHGPIKGRMMVTEGKIPPEMLVKLQPELMKNPKFIVVEGSFDFSNYMIGMVIGLNPVRPLANGWLIPQLNHPGIKPTKNWQEFFMEKVMEKTDDNGKIDLPLYSWISDKSDITLSDKEK >NC_017470.1|WP_014565808.1|1000194_1001088_+|1-acyl-sn-glycerol-3-phosphate-acyltransferase MIFGFHRRQVINNIKKNVAKKQFDAKAELHDPVLNNKETNKIVSKYWQYTKTISYRLFNPLVRVVFNIASQILTGRCSIDGIENLPDSPTAFITGNHYNQFDVLLIGKLALKKRQRLFIVVEASNLAMPHLIGWAVRNFDSLPIDHDFHYLSRIFPKKLAQVLSKPGWILIYPEEELWFNYRKPRPLKKGAYYYAAKFNQPIISTFTEIQATSKRELFQRDFYKTKKILHILPTIYPNPDLKIRENMQRMAEIDYRQKKAAYEKYYQRKLTTDFSYEDIAGFSPKKHLLNKKIDDNQ >NC_017470.1|WP_014565807.1|999015_1000128_-|glycosyltransferase MRILIVIDDYFNQSNGMCISTQRFVHEYKKMGQEVRVLSTGEKADYPVPELKINIPFIHGLIAKQGFHFAKPIRKTLIKAVTWADIIQIETPFPVSWRAAKLAKKQGKPVIGTFHIYPQNVTASVPFLNNRLGNWCFMLFFREKSFKNCDALQVPTAKVAKWLKQHHFKQKLFVVSNGISDKFINNSHKDKVGHPFTILCIGRFSHEKKQETLFKAMQLTKHSSEIRLIFAGQGPLKKEYEKLANQLPQKPVMQYFPPVKLRQIMSQADLVVHCADVEIEGMACMEAFASGCVPVIADSPLSSTVSYALTPNNCFPAKNSEVLAQRIDYWFEHPQELIKMRQKYRKYSKTLSVARSAKTAIGNLEKLILR >NC_017470.1|WP_014565806.1|997392_998940_+|type-IV-secretory-system-conjugative-DNA-transfer-family-protein MQKIGFSNHSRSNKTESKAPWQNKYSRQATIFGKNTFLPLDLERALNDNTLVIGTSGTGKTYSFLEPNLLQTNSNYVIADAKGSILSEIGPSLKQMGYNLQVLNLVNLDHSMTFNPLANLHSDQDVVKFAEQVMTTDVAGRTNTGQKIDVFWKNAAEALFEAIIFFIRDELPEEEQTMATVNRLFKIVTLKPDRIDTAFSILNSKESDYYFDDYTPDSDDNRLIGDYLFDWVRENDPDSTSIRMWDQVRGMAGSPRTWSSVVGILGSDMAAYNLHDVENLLSGNQIQFAKLLEPKNALFVLYDDADSSKNFLSNILYAQLIKFLYHESRKYKHQALPEKVRFFLDDFKNVNIPGFEDILATARSRNISICMLLQDESQLQAKFGPATPSVIGNCSAYLLTGTTDLTMAQIASQRFDLSTTNIRRMARENFLLDVSGYTAMTKRYDYHDHPNYKGGYYDFEKELVTPQQQANNEGLEKILMYLPHEQNRVDDAENLFGNDYGSDDDLFTIIGNSDN >NC_017470.1|WP_014565805.1|995506_996952_+|amino-acid-permease MDSFDTTHKRKMISWPVLALMDFVTVIGFDDIIYNFKNQGLATISEWIIMLALYVVPYEMMVGQLGSTFSDTTGGLTSWIRHTSGDKMGYFMAWAGWVCALPYLVDVANSTVVSFGWLFAGNNSYEDKMNNWTFALLTAVVFIIFIFFQHRFANSLQILSVIGGGAMFIITVLYIIMTFAYLGKGGHIETQPFNWRSIFPTFDTKFFTSLGLFIFAMDGAEFVAPYVTEMKNGARDFPKAMIMLAVMTGFLTVFGSFALGVFFNAHHLPDDLKMNGSYYAFEAMGKDFGLGKFFLYLFIVTQALYMIAQLAMLVDGMSREFLSDTAKKYLPKGLTKKDKNGLPIHGYWLTALLCSFIMFSSATLPNINSIFNQLLNLNGIIDPFTTSFIFWAFIKIREDEKKYHAEYVYIKNRRMSLIMGWWCFLLTLVAAFGSIFQVDAPTGSTEYYQTIFLNVFESFVLLGLGLILPLIARWQREHDKA >NC_017470.1|WP_118027564.1|995094_995319_+|alpha-glucosidase-C-terminal-domain-containing-protein MVSKLFAYERYLENSDEKLLVFTNFYGKEHTVKLPEKYQGKEYQVLLNNYDAENGKLTDEITLAPYEALAIKIK >NC_017470.1|WP_014565803.1|994119_994473_+|hypothetical-protein MIDIYNEKLAKYADGERRIFTATFLRPDDRKGIFQNLTVNNEDNVVVKQIVLRMNKAFKELNLEKGDVVQFEAIVKQNSRGEYTVERPTGMERISSGQDEEDSGVHVVGDDWDWFEK >NC_017470.1|WP_014565802.1|992672_993878_+|L,D-transpeptidase-family-protein MNEDLRKRNKRNNLIILVVGIVIIIGIIAGFSIHNHRVATQTAAEKFARTHFNPNVKIDGVKVGKLTVKKATDKVNKNAKNVVALKDNKLVYSYSTTSQTIDEQETSELFKKQHTKTPSDKSYSYTTKDLATAKNKLNSLKKATINYKINGKSYKLKATELLNDVSYQNGKYKFGNTIKLTDKLNQIDKEVSTLHKSYKFTVPTGNKVKGKTITVKNKTWGWGVYVQKTRRLLLDAFAQGKTTFDGADAIYGLGYSTYAHGYGRSNHEIGNTYAVVSLKKQEVWLVRNGKLKVHLRDVVTGTMEGSKGDQTPRGVWYIHYKQRNATLRGSNDDGSSYASPVSYWMPFTLSGCGFHDASWRTDWSKTAYLKGGSHGCVNVKPSEIRSVWNNISKNEPVIIYE >NC_017470.1|WP_013437815.1|992039_992597_+|ECF-transporter-S-component MRKDINSLQSLIFTGLFAAIIYIGIWVLRIPVPAMVGRPFIHFGNTLTAVAILYLGYRNGMIAGIIGLGGFDLLNGYAATSWLTMLEVVVVATVLTAVYRGMNYRDSKKNIIILGIIAGVTKIFTTYCVSIVEALMVGTSLQVAYIGAFVSLPATVINSISTAICTPILYFALKDAVKAIMKKAN >NC_017470.1|WP_013641840.1|991208_992030_+|bifunctional-hydroxymethylpyrimidine-kinase/phosphomethylpyrimidine-kinase MINGGVLISQDLSCAGQVSSSVALPILGACGTRSTLLPTAILSTHTGFQGNTYLDLSSEMTKIVAHWQKINLNFDALYLGYLGQNALDFWLDKIEQIKRADQVVLIDPAMADHGKMYRGLDEGYVKKMRQLIPKATILTPNITEAAFLLGKDLTKVSLEKAQEFATELAKKFSIPNVVITGISITKEKIGEVGVTDGKNWSLIQKKLSGSFFGTGDMFASAFLAAVLHGNNLEKSCSIAADFIRLAIMNTKQNPLFGPNYAAGLPWLLDEIEK >NC_017470.1|WP_014565810.1|1002744_1005477_+|CRISPR-associated-helicase/endonuclease-Cas3 MKKLSRYAKNLWGKKATQDETELWLPLIAHMIDTKNVINWLYNHWLNQGQRNLFLQNMSDIDVQKLVRFLGYIHDIGKATPAFQTKESYNHDRDLDYDLLEHLLRNGFTNLDQLHLANARRTPHALAGEAILEREGLNTSVGAIIGGHHGKPQNDDSLRNVLEIYTSNFYQTDTPPNSKNHWLNVQKELINYGLNICGYDDIQSIPKVKQPQAVLLEGLVIMADWLASSEYLNDNFDKPMFTLIPLQEDFDNLDMKQRFRNALMTWYQNDVWQPDPVSDVAKEYQDRFNFTPRVVQKTMSEAIGNISDPGIVIVEAPMGIGKTEIALTAVEQIAGLTGRNGLFFGLPTQATTNAMFSRVDNWLTNIATSENTNIGIKLMHGKAQFNDEYRELPKAENVDTSGSVVINSWFSGKKTILEKFTIGTIDQLLLMGLKQKHLFLRHLGLSGKIVVIDEVHAYDIYMDSYLLKAIEWLGAYHVPVIALSATLSARLRKNLVRAYVRGKYSDPNKYQAEVGWQDNNSYPLLTFLDGQRLNQVDKFDNEGDNKAVVKVKRLQCDDEELINHIQDNIKDGGIAGVIVNTIKRAQDLAQLIPTDIPVLILHSAFLATDRSKLEQKLQSLIGKKAKRPDKLIVIGTQVLEQSLDIDFDVLYTDIAPMDLILQRIGRLHRHQIKRPLKLACPQVFIMGINSWGDYGDANEAIYDKYLLMKTDYFLPDQITLPIDISCLVQKVYSKENDSEIGGISQVKQNYLDKRKKLRKRASVFQIKPPLINFNIHGWLDNNQPGVSKNEERAQAAVRDTKETIEILLLKKTETGVCLLNGKSIEEDQVSSKEIARQIIRLPHAVTFNIDESIDKLETITSEKYPEWQNDIWLKSALALTLDENNNVEFNGWQLHYSKKIGLTYTKEAQS >NC_017470.1|WP_014565811.1|1005473_1007237_+|type-I-E-CRISPR-associated-protein-Cse1/CasA MKQLSFNLITDPWIKVIDKNNNLQKVSLSTLFKNSQDYKQLAGEMKSQDLAVFRFLLAILTTVYSRFDASGKPYDGLQLDDKFQVIPEDPDDVEDEVDGADPSLGSSSFDETQALLKTWADLYHIGHFSSIVVEYLEKYKEKFDMFGDTPFYQVTAEIYDSLVPEKKKISTGSGTTAVKQINRTISESAHTPNIFAPRSDSFKNRIKIDELVRWIITYQNFTGVTDKTKVNANEKFSVSAGWLYGLNPVFAQGDNLFETLMLNLTFFDKDEDLKLVPIQRPIWEWEKFSDYISYRLKAELPDNISETYTMWSRVLHIEWNGNAPTIFSAGLPKVSSENAFIEPMTTWKTDKKELVYKPNTKWIKTVGESMWRNFGQYIRLSESNEQEKKTIHQPGIVTWLNLLENRKLLPANKFINLATVGLISDGNATSQSPAVEVWDEMKIKADVLFDSNEKVAIHWPVVIEDEIDLTKKVVNYYWSLVNNVGKLRELSDPNSFANNYSAELYNQLNNPFLNWLSSLKNTDDRNKQAFIWRTTLKQIVLNEAENFVHSASPRDIKGIIDKDKKTKNIFTEYKKFTILVLSKLKKG >NC_017470.1|WP_014565812.1|1007246_1007831_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MVAKTLSSINKIILSLYNDGNINKSALANLRASNSINSKHMTEVWPIFFKYIAKEDLSQNYKPSYTEIAVFTAVKCFAIYQQGSTECTYGKSYGDNAKGLTFFNALANLRKDAEEKEALDRRVQALLATSNVESVINGIIHTLQILKSHNKHLVIDFAKLGQDLYHFQFDSYSARETCLKWGEEYFAADANLKK >NC_017470.1|WP_193363688.1|1007838_1008927_+|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MIMNLFLDINVLQTVPSSNLNRDDTGAPKTAIYGGVMRSRVSSQSWKRAVRQAFRAESQDAQWLKSSRTLKAPLLLANEIQKMDSSVSDEEAMKKSTDIFSKASIKVDKKTNQTKALLLISDGQLKKLAKAILENEDIDKKVIKKIFKEDNSLDLALFGRMVADNPDLNVDAACQVAHAISTHEVTPEFDYFTAVDDEKEEGTAGSAMIGSLEYNSSTLYRYANINLNELIHNIGSKLSVEGIKLFIKNFILTMPTGKENTFANKTLPQYVLITLRDDTPVNLVSAFEEPVKSRDGYVKKSIERLEKEYIDTESIIDKPIYSVVLSKYDSTLSNQAENLTSMIESVSKVVDEKVEKNENHNN >NC_017470.1|WP_014565814.1|1008907_1009603_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MKTITIRLASPLQSFGNEATFSHRTTELYPTKSLIVGMLAASLGYRRDDSRINQLNNLQIAVRIDQPGKVLTDFQTVEFKPDTRKLTYRNYLQDGVFIVAISAHDKTIDKLKYALLHPKFQLYIGRRSNPIAGVLKINEFDDDALKVLKKLDWQASEWYQKKYKSEEYFAEIIADASLSKNNSGSLVKDAVGSFNQHSRFHDYRAVVNVHVSLKNKFYQEHSTKHDIFNAI >NC_017470.1|WP_014565815.1|1009615_1010266_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRVEIDQGNRQKLKDLTHLGAYHSWVENSFPEKFGQDRPRHLWRIDTLRHKRYLLVVSAEKPNLNLLDKYGVPGTAETKNYDPFLEKVKQNMIYNFRLTANPVHRVTQPGQKNGKLYPHITIEKQKEWLINRAKNCGFEIIKDESGIYQFDVVSRDWPLLFHKGTKRVRLSRVSFEGQLKVVDLKLFKQHLISGIGKEKAYGMGMLTIIPVRA >NC_017470.1|WP_014565816.1|1010268_1011213_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MKKKYGAKKPELEELGRVRDRISFLYLEHAKLNREDSAIKVLDDRGIVLVPVALISVLLLGPGVDITHRAMELIGDSGTAVVWVGENGVRQYAHGRALNHSSRLLEAQAKLVSNKRTRVEVARKMYQMRFPNEDVSKLSMEQLRGKEGARVRKVYRDQSLKTGVAWERREYDPDNFEASTPINKALTEAHQALYGLSYSVIVALGASPGLGFVHTGHDLAFVYDFADLYKAKYSIPVAFETVKKFGKVDISDNTRLAMRDAFSSGKLLLQMVADLKYLLNIKDDTDENFAVMHLWDDKQGLQKFGVQYHEMDED >NC_017470.1|WP_014565817.1|1011216_1012113_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MIVITLTKVPNSLRGDLTKWCQEIQTGVYVGSFSARIREMLWSRILKNIGTGEATLVYSTNNELGYTFRTTRRDKRVVDFDGVPLMMQMIETPPIRHGFSKAAKYHKSKKFSKKTNISNNMIKGAIKNDFIAVDLETTGLNTSRSKIISIGAIKENLKGEEEQFYRLIKINEAIPEKITELTGLTSKVLNERGVTLEQALNEFRKFVGSEMIVGYNLSFDNNFLLKAYLSIGQRALVNSMKDLMGIVKEKDIFLDNYDLETVLKEYGIKNDNRHNALSDARATFKLAKELNKKGYLQI >NC_017470.1|WP_193363678.1|1013601_1014570_+|homocysteine-S-methyltransferase MSAFLLLLKNEKEVGRMSLIEDAKSGIVLDGAMSDELEKQGVETDNKLWTATALVDQLNKVYNAHQDYFRAGAELVITDTYQANVQAFEESGYSKKEAEKFIRDAVKVAKKARDDYQKETGKYNYVAGTIGSYGAYLADGNEYRGDYNLSEKEYLDFHLPRLKLVLKERPDLIALETQPKITEPVAVLNWLETNYPDMPIYVSFTLKDSKHVSDGTSIEHATQEISKYKQVFAIGINCVSPKLVDQALKEFAKYTSKPLVVYPNLGATYDPKIKKWRSFKEKFDFAELTQKWYEDGAHLIGGCRTTGPKEIKEIRQSIDKLR >NC_017470.1|WP_014565820.1|1014582_1015977_+|amino-acid-permease MAHKTHLKRKMETRHIRMISLGGVIGTGLFLSSGYTIHEAGPLGTVIAYLVGALIVFAVMLCLGELSVAMPYTGAFHVYAKKYIGPSTGFVVAIIYWLTWTIALGSEFTAAGLIMQKWFPHVPVWIWSLACMILIFLSNFFSVKVFAESEFWFAAIKVFAIVAFIILGVLAITGILPVKGFNHAPGLVNFYKNGWFPNGFSGVFTTMLTVNFAFSGTELIGITAGEAEDPQKAIPSAIKTTLWRLVIFFIGSIVVMAALITYKVAGVTQSPFVYVLDLIHVPFAANIMNFVVLTAIISAANSGLYASTRMLWSLSNEGTIPKVFQKTGKNGVPTLALGVSMLDGIFALISSKVAASTVYLVLVSISGLAVVIVWMAIAWAELNFRKQFLKDGHHLSELKYRTPWYPVVPYFAFFASLFSCILIWFDPTQRVALYYTIPFVAICYLVQYLWRKFDKNLRLAEEGK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_017470_3 | 1012143-1013268 | TypeI-E |
I-B,III-A,III-B
Consensus repeat of NC_017470_3
|
18 spacers
spacers of NC_017470_3
>3.1|1012171|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT CACCGTACACCATATTAAACCGTTAAGACTTGA >3.2|1012232|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT CGAAGACGGCACTTATACGATTGATCTTTGGAA >3.3|1012293|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT CATTTTTCCGTCGCATTTACGAAAACCATGTAA >3.4|1012354|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TACATCGACAATAAAGACCCCAAGGCCCGTATT >3.5|1012415|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TAACTATGTAAATCAAACGTTATTAACTCGTAA >3.6|1012476|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT CGATTGACCGCCAGTAACACGATTTGCCATTCT >3.7|1012537|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TTAAACAATACATGCGAACAAATCATTTATTTT >3.8|1012598|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TCAATCTATTTAATCTATACTCATAAGCTTTAC >3.9|1012659|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TATAAAAAGAATGATCCAGACTACTATAGATGG >3.10|1012720|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TGCTGAAGTATTTGACCAGTCTGTACCAACTTT >3.11|1012781|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT CATTCCAGACGGTGTCGAAGCCTTTAAACTGTC >3.12|1012842|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT CGCTGACCAGTACGAAACGGCGTTAAGAAGTCG >3.13|1012903|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TTAGCACTAATTCGAGCCAGTAATCGAAGTTCT >3.14|1012964|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT ACAGCGCTACGAATTAGAGCAGAAACAAGAATT >3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TAAGAACTGAAAAAAAGAAAAAAAGCTTTAAAA >3.16|1013086|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TTGCCAAACATACATAAAAGCAAATTTTTCGCG >3.17|1013147|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT TGAAACATCATCAGATAACTTTGCTAATTCGTC >3.18|1013208|33|NC_017470|CRISPRCasFinder,CRT TAACTCCCGCATAAATCTAACCGTAATAGAGCG |
DEDDh,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around NC_017470_3
The CRISPR arrays of NC_017470_3 >merge|NC_017470|3|1012143-1013268|PILER-CR,CRISPRCasFinder,CRT GTGTTCTCCACGTATGTGGAGGTGATCCCACCGTACACCATATTAAACCGTTAAGACTTGAGTGTTCTCCACGTATGTGGAGGTGATCCCGAAGACGGCACTTATACGATTGATCTTTGGAAGTGTTCTCCACGTATGTGGAGGTGATCCCATTTTTCCGTCGCATTTACGAAAACCATGTAAGTGTTCTCCACGTATGTGGAGGTGATCCTACATCGACAATAAAGACCCCAAGGCCCGTATTGTGTTCTCCACGTATGTGGAGGTGATCCTAACTATGTAAATCAAACGTTATTAACTCGTAAGTGTTCTCCACGTATGTGGAGGTGATCCCGATTGACCGCCAGTAACACGATTTGCCATTCTGTGTTCTCCACGTATGTGGAGGTGATCCTTAAACAATACATGCGAACAAATCATTTATTTTGTGTTCTCCACGTATGTGGAGGTGATCCTCAATCTATTTAATCTATACTCATAAGCTTTACGTGTTCTCCACGTATGTGGAGGTGATCCTATAAAAAGAATGATCCAGACTACTATAGATGGGTGTTCTCCACGTATGTGGAGGTGATCCTGCTGAAGTATTTGACCAGTCTGTACCAACTTTGTGTTCTCCACGTATGTGGAGGTGATCCCATTCCAGACGGTGTCGAAGCCTTTAAACTGTCGTGTTCTCCACGTATGTGGAGGTGATCCCGCTGACCAGTACGAAACGGCGTTAAGAAGTCGGTGTTCTCCACGTATGTGGAGGTGATCCTTAGCACTAATTCGAGCCAGTAATCGAAGTTCTGTGTTCTCCACGTATGTGGAGGTGATCCACAGCGCTACGAATTAGAGCAGAAACAAGAATTGTGTTCTCCACGTATGTGGAGGTGATCCTAAGAACTGAAAAAAAGAAAAAAAGCTTTAAAAGTGTTCTCCACGTATGTGGAGGTGATCCTTGCCAAACATACATAAAAGCAAATTTTTCGCGGTGTTCTCCACGTATGTGGAGGTGATCCTGAAACATCATCAGATAACTTTGCTAATTCGTCGTGTTCTCCACGTATGTGGAGGTGATCCTAACTCCCGCATAAATCTAACCGTAATAGAGCGGTGTTCTCCACGTATGTGGAAGCAAGTT >NC_017470|3|3|1012143-1013207|PILER-CR GTGTTCTCCACGTATGTGGAGGTGATCC CACCGTACACCATATTAAACCGTTAAGACTTGA GTGTTCTCCACGTATGTGGAGGTGATCC CGAAGACGGCACTTATACGATTGATCTTTGGAA GTGTTCTCCACGTATGTGGAGGTGATCC CATTTTTCCGTCGCATTTACGAAAACCATGTAA GTGTTCTCCACGTATGTGGAGGTGATCC TACATCGACAATAAAGACCCCAAGGCCCGTATT GTGTTCTCCACGTATGTGGAGGTGATCC TAACTATGTAAATCAAACGTTATTAACTCGTAA GTGTTCTCCACGTATGTGGAGGTGATCC CGATTGACCGCCAGTAACACGATTTGCCATTCT GTGTTCTCCACGTATGTGGAGGTGATCC TTAAACAATACATGCGAACAAATCATTTATTTT GTGTTCTCCACGTATGTGGAGGTGATCC TCAATCTATTTAATCTATACTCATAAGCTTTAC GTGTTCTCCACGTATGTGGAGGTGATCC TATAAAAAGAATGATCCAGACTACTATAGATGG GTGTTCTCCACGTATGTGGAGGTGATCC TGCTGAAGTATTTGACCAGTCTGTACCAACTTT GTGTTCTCCACGTATGTGGAGGTGATCC CATTCCAGACGGTGTCGAAGCCTTTAAACTGTC GTGTTCTCCACGTATGTGGAGGTGATCC CGCTGACCAGTACGAAACGGCGTTAAGAAGTCG GTGTTCTCCACGTATGTGGAGGTGATCC TTAGCACTAATTCGAGCCAGTAATCGAAGTTCT GTGTTCTCCACGTATGTGGAGGTGATCC ACAGCGCTACGAATTAGAGCAGAAACAAGAATT GTGTTCTCCACGTATGTGGAGGTGATCC TAAGAACTGAAAAAAAGAAAAAAAGCTTTAAAA GTGTTCTCCACGTATGTGGAGGTGATCC TTGCCAAACATACATAAAAGCAAATTTTTCGCG GTGTTCTCCACGTATGTGGAGGTGATCC TGAAACATCATCAGATAACTTTGCTAATTCGTC GTGTTCTCCACGTATGTGGAGGTGATCC >NC_017470|3|2|1012143-1013268|CRISPRCasFinder GTGTTCTCCACGTATGTGGAGGTGATCC CACCGTACACCATATTAAACCGTTAAGACTTGA GTGTTCTCCACGTATGTGGAGGTGATCC CGAAGACGGCACTTATACGATTGATCTTTGGAA GTGTTCTCCACGTATGTGGAGGTGATCC CATTTTTCCGTCGCATTTACGAAAACCATGTAA GTGTTCTCCACGTATGTGGAGGTGATCC TACATCGACAATAAAGACCCCAAGGCCCGTATT GTGTTCTCCACGTATGTGGAGGTGATCC TAACTATGTAAATCAAACGTTATTAACTCGTAA GTGTTCTCCACGTATGTGGAGGTGATCC CGATTGACCGCCAGTAACACGATTTGCCATTCT GTGTTCTCCACGTATGTGGAGGTGATCC TTAAACAATACATGCGAACAAATCATTTATTTT GTGTTCTCCACGTATGTGGAGGTGATCC TCAATCTATTTAATCTATACTCATAAGCTTTAC GTGTTCTCCACGTATGTGGAGGTGATCC TATAAAAAGAATGATCCAGACTACTATAGATGG GTGTTCTCCACGTATGTGGAGGTGATCC TGCTGAAGTATTTGACCAGTCTGTACCAACTTT GTGTTCTCCACGTATGTGGAGGTGATCC CATTCCAGACGGTGTCGAAGCCTTTAAACTGTC GTGTTCTCCACGTATGTGGAGGTGATCC CGCTGACCAGTACGAAACGGCGTTAAGAAGTCG GTGTTCTCCACGTATGTGGAGGTGATCC TTAGCACTAATTCGAGCCAGTAATCGAAGTTCT GTGTTCTCCACGTATGTGGAGGTGATCC ACAGCGCTACGAATTAGAGCAGAAACAAGAATT GTGTTCTCCACGTATGTGGAGGTGATCC TAAGAACTGAAAAAAAGAAAAAAAGCTTTAAAA GTGTTCTCCACGTATGTGGAGGTGATCC TTGCCAAACATACATAAAAGCAAATTTTTCGCG GTGTTCTCCACGTATGTGGAGGTGATCC TGAAACATCATCAGATAACTTTGCTAATTCGTC GTGTTCTCCACGTATGTGGAGGTGATCC TAACTCCCGCATAAATCTAACCGTAATAGAGCG GTGTTCTCCACGTATGTGGAAGCAAGTT >NC_017470|3|2|1012143-1013268|CRT GTGTTCTCCACGTATGTGGAGGTGATCC CACCGTACACCATATTAAACCGTTAAGACTTGA GTGTTCTCCACGTATGTGGAGGTGATCC CGAAGACGGCACTTATACGATTGATCTTTGGAA GTGTTCTCCACGTATGTGGAGGTGATCC CATTTTTCCGTCGCATTTACGAAAACCATGTAA GTGTTCTCCACGTATGTGGAGGTGATCC TACATCGACAATAAAGACCCCAAGGCCCGTATT GTGTTCTCCACGTATGTGGAGGTGATCC TAACTATGTAAATCAAACGTTATTAACTCGTAA GTGTTCTCCACGTATGTGGAGGTGATCC CGATTGACCGCCAGTAACACGATTTGCCATTCT GTGTTCTCCACGTATGTGGAGGTGATCC TTAAACAATACATGCGAACAAATCATTTATTTT GTGTTCTCCACGTATGTGGAGGTGATCC TCAATCTATTTAATCTATACTCATAAGCTTTAC GTGTTCTCCACGTATGTGGAGGTGATCC TATAAAAAGAATGATCCAGACTACTATAGATGG GTGTTCTCCACGTATGTGGAGGTGATCC TGCTGAAGTATTTGACCAGTCTGTACCAACTTT GTGTTCTCCACGTATGTGGAGGTGATCC CATTCCAGACGGTGTCGAAGCCTTTAAACTGTC GTGTTCTCCACGTATGTGGAGGTGATCC CGCTGACCAGTACGAAACGGCGTTAAGAAGTCG GTGTTCTCCACGTATGTGGAGGTGATCC TTAGCACTAATTCGAGCCAGTAATCGAAGTTCT GTGTTCTCCACGTATGTGGAGGTGATCC ACAGCGCTACGAATTAGAGCAGAAACAAGAATT GTGTTCTCCACGTATGTGGAGGTGATCC TAAGAACTGAAAAAAAGAAAAAAAGCTTTAAAA GTGTTCTCCACGTATGTGGAGGTGATCC TTGCCAAACATACATAAAAGCAAATTTTTCGCG GTGTTCTCCACGTATGTGGAGGTGATCC TGAAACATCATCAGATAACTTTGCTAATTCGTC GTGTTCTCCACGTATGTGGAGGTGATCC TAACTCCCGCATAAATCTAACCGTAATAGAGCG GTGTTCTCCACGTATGTGGAAGCAAGTT
>NC_017470.1|WP_014565817.1|1011216_1012113_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MIVITLTKVPNSLRGDLTKWCQEIQTGVYVGSFSARIREMLWSRILKNIGTGEATLVYSTNNELGYTFRTTRRDKRVVDFDGVPLMMQMIETPPIRHGFSKAAKYHKSKKFSKKTNISNNMIKGAIKNDFIAVDLETTGLNTSRSKIISIGAIKENLKGEEEQFYRLIKINEAIPEKITELTGLTSKVLNERGVTLEQALNEFRKFVGSEMIVGYNLSFDNNFLLKAYLSIGQRALVNSMKDLMGIVKEKDIFLDNYDLETVLKEYGIKNDNRHNALSDARATFKLAKELNKKGYLQI >NC_017470.1|WP_014565816.1|1010268_1011213_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MKKKYGAKKPELEELGRVRDRISFLYLEHAKLNREDSAIKVLDDRGIVLVPVALISVLLLGPGVDITHRAMELIGDSGTAVVWVGENGVRQYAHGRALNHSSRLLEAQAKLVSNKRTRVEVARKMYQMRFPNEDVSKLSMEQLRGKEGARVRKVYRDQSLKTGVAWERREYDPDNFEASTPINKALTEAHQALYGLSYSVIVALGASPGLGFVHTGHDLAFVYDFADLYKAKYSIPVAFETVKKFGKVDISDNTRLAMRDAFSSGKLLLQMVADLKYLLNIKDDTDENFAVMHLWDDKQGLQKFGVQYHEMDED >NC_017470.1|WP_014565815.1|1009615_1010266_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRVEIDQGNRQKLKDLTHLGAYHSWVENSFPEKFGQDRPRHLWRIDTLRHKRYLLVVSAEKPNLNLLDKYGVPGTAETKNYDPFLEKVKQNMIYNFRLTANPVHRVTQPGQKNGKLYPHITIEKQKEWLINRAKNCGFEIIKDESGIYQFDVVSRDWPLLFHKGTKRVRLSRVSFEGQLKVVDLKLFKQHLISGIGKEKAYGMGMLTIIPVRA >NC_017470.1|WP_014565814.1|1008907_1009603_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MKTITIRLASPLQSFGNEATFSHRTTELYPTKSLIVGMLAASLGYRRDDSRINQLNNLQIAVRIDQPGKVLTDFQTVEFKPDTRKLTYRNYLQDGVFIVAISAHDKTIDKLKYALLHPKFQLYIGRRSNPIAGVLKINEFDDDALKVLKKLDWQASEWYQKKYKSEEYFAEIIADASLSKNNSGSLVKDAVGSFNQHSRFHDYRAVVNVHVSLKNKFYQEHSTKHDIFNAI >NC_017470.1|WP_193363688.1|1007838_1008927_+|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MIMNLFLDINVLQTVPSSNLNRDDTGAPKTAIYGGVMRSRVSSQSWKRAVRQAFRAESQDAQWLKSSRTLKAPLLLANEIQKMDSSVSDEEAMKKSTDIFSKASIKVDKKTNQTKALLLISDGQLKKLAKAILENEDIDKKVIKKIFKEDNSLDLALFGRMVADNPDLNVDAACQVAHAISTHEVTPEFDYFTAVDDEKEEGTAGSAMIGSLEYNSSTLYRYANINLNELIHNIGSKLSVEGIKLFIKNFILTMPTGKENTFANKTLPQYVLITLRDDTPVNLVSAFEEPVKSRDGYVKKSIERLEKEYIDTESIIDKPIYSVVLSKYDSTLSNQAENLTSMIESVSKVVDEKVEKNENHNN >NC_017470.1|WP_014565812.1|1007246_1007831_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MVAKTLSSINKIILSLYNDGNINKSALANLRASNSINSKHMTEVWPIFFKYIAKEDLSQNYKPSYTEIAVFTAVKCFAIYQQGSTECTYGKSYGDNAKGLTFFNALANLRKDAEEKEALDRRVQALLATSNVESVINGIIHTLQILKSHNKHLVIDFAKLGQDLYHFQFDSYSARETCLKWGEEYFAADANLKK >NC_017470.1|WP_014565811.1|1005473_1007237_+|type-I-E-CRISPR-associated-protein-Cse1/CasA MKQLSFNLITDPWIKVIDKNNNLQKVSLSTLFKNSQDYKQLAGEMKSQDLAVFRFLLAILTTVYSRFDASGKPYDGLQLDDKFQVIPEDPDDVEDEVDGADPSLGSSSFDETQALLKTWADLYHIGHFSSIVVEYLEKYKEKFDMFGDTPFYQVTAEIYDSLVPEKKKISTGSGTTAVKQINRTISESAHTPNIFAPRSDSFKNRIKIDELVRWIITYQNFTGVTDKTKVNANEKFSVSAGWLYGLNPVFAQGDNLFETLMLNLTFFDKDEDLKLVPIQRPIWEWEKFSDYISYRLKAELPDNISETYTMWSRVLHIEWNGNAPTIFSAGLPKVSSENAFIEPMTTWKTDKKELVYKPNTKWIKTVGESMWRNFGQYIRLSESNEQEKKTIHQPGIVTWLNLLENRKLLPANKFINLATVGLISDGNATSQSPAVEVWDEMKIKADVLFDSNEKVAIHWPVVIEDEIDLTKKVVNYYWSLVNNVGKLRELSDPNSFANNYSAELYNQLNNPFLNWLSSLKNTDDRNKQAFIWRTTLKQIVLNEAENFVHSASPRDIKGIIDKDKKTKNIFTEYKKFTILVLSKLKKG >NC_017470.1|WP_014565810.1|1002744_1005477_+|CRISPR-associated-helicase/endonuclease-Cas3 MKKLSRYAKNLWGKKATQDETELWLPLIAHMIDTKNVINWLYNHWLNQGQRNLFLQNMSDIDVQKLVRFLGYIHDIGKATPAFQTKESYNHDRDLDYDLLEHLLRNGFTNLDQLHLANARRTPHALAGEAILEREGLNTSVGAIIGGHHGKPQNDDSLRNVLEIYTSNFYQTDTPPNSKNHWLNVQKELINYGLNICGYDDIQSIPKVKQPQAVLLEGLVIMADWLASSEYLNDNFDKPMFTLIPLQEDFDNLDMKQRFRNALMTWYQNDVWQPDPVSDVAKEYQDRFNFTPRVVQKTMSEAIGNISDPGIVIVEAPMGIGKTEIALTAVEQIAGLTGRNGLFFGLPTQATTNAMFSRVDNWLTNIATSENTNIGIKLMHGKAQFNDEYRELPKAENVDTSGSVVINSWFSGKKTILEKFTIGTIDQLLLMGLKQKHLFLRHLGLSGKIVVIDEVHAYDIYMDSYLLKAIEWLGAYHVPVIALSATLSARLRKNLVRAYVRGKYSDPNKYQAEVGWQDNNSYPLLTFLDGQRLNQVDKFDNEGDNKAVVKVKRLQCDDEELINHIQDNIKDGGIAGVIVNTIKRAQDLAQLIPTDIPVLILHSAFLATDRSKLEQKLQSLIGKKAKRPDKLIVIGTQVLEQSLDIDFDVLYTDIAPMDLILQRIGRLHRHQIKRPLKLACPQVFIMGINSWGDYGDANEAIYDKYLLMKTDYFLPDQITLPIDISCLVQKVYSKENDSEIGGISQVKQNYLDKRKKLRKRASVFQIKPPLINFNIHGWLDNNQPGVSKNEERAQAAVRDTKETIEILLLKKTETGVCLLNGKSIEEDQVSSKEIARQIIRLPHAVTFNIDESIDKLETITSEKYPEWQNDIWLKSALALTLDENNNVEFNGWQLHYSKKIGLTYTKEAQS >NC_017470.1|WP_014565809.1|1001135_1001573_+|hypothetical-protein MLKDQKKFDELGEKLFMKGVLQNFEQKHGPIKGRMMVTEGKIPPEMLVKLQPELMKNPKFIVVEGSFDFSNYMIGMVIGLNPVRPLANGWLIPQLNHPGIKPTKNWQEFFMEKVMEKTDDNGKIDLPLYSWISDKSDITLSDKEK >NC_017470.1|WP_014565808.1|1000194_1001088_+|1-acyl-sn-glycerol-3-phosphate-acyltransferase MIFGFHRRQVINNIKKNVAKKQFDAKAELHDPVLNNKETNKIVSKYWQYTKTISYRLFNPLVRVVFNIASQILTGRCSIDGIENLPDSPTAFITGNHYNQFDVLLIGKLALKKRQRLFIVVEASNLAMPHLIGWAVRNFDSLPIDHDFHYLSRIFPKKLAQVLSKPGWILIYPEEELWFNYRKPRPLKKGAYYYAAKFNQPIISTFTEIQATSKRELFQRDFYKTKKILHILPTIYPNPDLKIRENMQRMAEIDYRQKKAAYEKYYQRKLTTDFSYEDIAGFSPKKHLLNKKIDDNQ >NC_017470.1|WP_193363678.1|1013601_1014570_+|homocysteine-S-methyltransferase MSAFLLLLKNEKEVGRMSLIEDAKSGIVLDGAMSDELEKQGVETDNKLWTATALVDQLNKVYNAHQDYFRAGAELVITDTYQANVQAFEESGYSKKEAEKFIRDAVKVAKKARDDYQKETGKYNYVAGTIGSYGAYLADGNEYRGDYNLSEKEYLDFHLPRLKLVLKERPDLIALETQPKITEPVAVLNWLETNYPDMPIYVSFTLKDSKHVSDGTSIEHATQEISKYKQVFAIGINCVSPKLVDQALKEFAKYTSKPLVVYPNLGATYDPKIKKWRSFKEKFDFAELTQKWYEDGAHLIGGCRTTGPKEIKEIRQSIDKLR >NC_017470.1|WP_014565820.1|1014582_1015977_+|amino-acid-permease MAHKTHLKRKMETRHIRMISLGGVIGTGLFLSSGYTIHEAGPLGTVIAYLVGALIVFAVMLCLGELSVAMPYTGAFHVYAKKYIGPSTGFVVAIIYWLTWTIALGSEFTAAGLIMQKWFPHVPVWIWSLACMILIFLSNFFSVKVFAESEFWFAAIKVFAIVAFIILGVLAITGILPVKGFNHAPGLVNFYKNGWFPNGFSGVFTTMLTVNFAFSGTELIGITAGEAEDPQKAIPSAIKTTLWRLVIFFIGSIVVMAALITYKVAGVTQSPFVYVLDLIHVPFAANIMNFVVLTAIISAANSGLYASTRMLWSLSNEGTIPKVFQKTGKNGVPTLALGVSMLDGIFALISSKVAASTVYLVLVSISGLAVVIVWMAIAWAELNFRKQFLKDGHHLSELKYRTPWYPVVPYFAFFASLFSCILIWFDPTQRVALYYTIPFVAICYLVQYLWRKFDKNLRLAEEGK >NC_017470.1|WP_014565821.1|1017667_1018600_-|2-dehydropantoate-2-reductase MRIAIAGAGAMGSKFGWHLKKAGNDVTLIDTWDRNIAAIRENGVVARVKDEEIAEKMPIYSPEEIDEQHESVDLLIVFTKSMQLENMLNSLKPIISKDTYVLCLLNGLGHEDVLERFVTRDHIIMGVTMWASMMTAPGHITFANDNGNVEIQCLDPKGKDETQKIVKILTDAGLNASYSENVMYSIWRKACVNGVVNALCALLDADCKQFGHTKEADELTRNIVQEFADVAQYEGVNLDRKEVIEHVESLFDTPHYPSMYQDLVQNNRPTEIDYIDGAVWRKGLKHSVPTPYCAFITRLIHAKEDILKVK >NC_017470.1|WP_013437830.1|1018733_1019378_+|nitroreductase-family-protein MAIINNDFHDVLTGRHSVRRFDPSVKISREEMTEMLKETITAPSACNLQAWRFVVVDTDKGREKLHKYFMKFNFPQIDKSSAIVLFFGNTLAFKKYSKLWHSMYEAKKVTKEAMDAALNTFMPLYEKAPKEMLVADSMVDTSLAAMQFMLIAREHGYDTNAMAGYDSTKAAATMGLDPKQYVPVMAIAVGKHDPKAEPEIATTRYQISDLVDFE >NC_017470.1|WP_014565822.1|1019434_1019965_+|hypothetical-protein MRKRYLFLMSLVAFFSIFFVGMQSQNVYADSQYGIARKYTTPKATRGTWYYRETDKFSSDKKTIYTLKITAHTANKDKLYVPSQKFFKKNVYNVSSKKRNAFIKKVMKKNIYAAYNFKKGFNVNNWVNLAGDGVYYIPVTRTVKGKKVKALKIATGADQHASAYAFKTKALAKAAK >NC_017470.1|WP_014565823.1|1020058_1021102_+|hypothetical-protein MNWKVEDEMKKIGKISVILLAGLALAGCSQKPKQKTSSKGSATIKVTKNKKQPTKMGHLSDQDLSPQKTVAVVVAYAGDRYSGSWNKALLDGKQNGIEVDLKNQSNYSYMNEGSGVAYMVSADAGYTLKQVNGENIYYLFSNGKKLGSVTMKQMVDYLNKRDSDSLVNSLAQNAKVNDERSDSGDDSSDSAGKKSNLPGDDGLFNVPTEFQGTWYTYNDDKMSIIKISQNKINVDNYVQELHKVKAGFLDKYTYGDMSASYHKATKNWGMAGMGSRRVHGINYMNVRGWMQEAGDGDFYGLHTENGQSVLVLAQGAGPWVSGAAWKTPQLAQQYKHKKFKDLYYQDD >NC_017470.1|WP_014565824.1|1021106_1021682_+|ATP-binding-protein MKRIPLILMLGPQASGKSSFIKMNDLQNYTISADEIRIRLNGINSNNGHPQINFVNQTEKIVWQIFNQILQTRLQNGLPTIVDNTNLGGHGFNPINDILKRVPDNYQVYVIDCFKPLLDANDPLSEESLIHALKILDQRNRDREYSVNMDIIQRFVDYYAHFEIPNKVKVISSADLQKVQDLIDLILNFNR >NC_017470.1|WP_014565825.1|1021715_1022360_+|hypothetical-protein MECYSVLAFCYNKYKKYVNKEDKMKTLTLQYTISQKGWTDLRQNGKISIDEQSYLLNAIRTQDQYFKNIYLEQYLLQKIIKDTLPEAKKPILAKKIPISCTPFPEHVTEMPDAGEILLELVVPENEVVTVDYRTWLYLASEVNKTVEKYNSMKDMNTILKLPEKKLKIDKMMRVQLLDVLNPAKTMNFIPELKLDQVKKAYQSADGQLEELEDY >NC_017470.1|WP_014565826.1|1022328_1023813_+|hypothetical-protein MVNLKNLRIIKQKKVKTMKRRGYLIGAVAAGALLFSLNTNVQAATVPISDNATSYVRKGNQYRFYFKAPTRLTVATKAKYKILNTSNWECIPYKGKNKDTKVFYLRSGHYNLTTKSGKNVKIETSATRITKIRNKLETFSHKTYPLETRFTSAIPIKIGQTVTGMTDMYHTEKLNTMNRYKFTLDKDQKVTMNMSVQPVYENSRSNIFNNNDIQILQDTDYGYALNPWKTKGTLKNVKYSWNLEKGTYYLEKGSARGRFSFKLTSEDTNALPSTPKLTKVSSTEDGIKVDYTKADNATGYGIYGSSLRRYRSNDPLLDAASMIGHSNFTPDGNYPDVLTQTISKNRLINGETYDIAVRAVNDEEGRSFSPVSANQKFTYYIPLKGSHEKPKTPTLKVSYYNDHGSDEPYINIEWDVNPEADSYEIQYRLKGSSKWATFFSKTRSGDIVGDPTNDFGQDFKKGQVYEVRIRALHSNLISDWSGVKTTRVDVTPNR >NC_017470.1|WP_014565827.1|1026145_1026751_+|DUF1819-family-protein MSRSYNGGIASYAIWLPELTKFIELYQSGYSINDIKQMSDEENIFQMPTKARAKRCSRNLAVRVKALPESVLNIFSQLDTSNQKIISLLSVMLTSRILDEFIYEVYRPKVQMREDILQDYEVEAFINQKRIESPTIAAWSLNTYKRIKGALKTYMRDGGLMEIDPQNKKQDKFLFPLLDCQLVLAMKVAKLDYELAALGGM |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_017470_3 | 3.6|1012476|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1012476-1012508 | 33 | MN830256 | Lactobacillus phage JNU_P7, complete genome | 36784-36816 | 5 | 0.848 |
NC_017470_3 | 3.16|1013086|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013086-1013118 | 33 | MN856026 | Myoviridae sp. isolate 276, complete genome | 8930-8962 | 5 | 0.848 |
NC_017470_2 | 2.14|1002466|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1002466-1002498 | 33 | NZ_CP038855 | Pantoea vagans strain LMG 24199 plasmid unnamed2, complete sequence | 140401-140433 | 6 | 0.818 |
NC_017470_3 | 3.8|1012598|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1012598-1012630 | 33 | NZ_CP021156 | Photobacterium damselae subsp. damselae strain KC-Na-1 plasmid pPDD-Na-1-4, complete sequence | 22914-22946 | 7 | 0.788 |
NC_017470_3 | 3.8|1012598|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1012598-1012630 | 33 | NZ_CP035459 | Photobacterium damselae subsp. damselae strain KC-Na-NB1 plasmid pFPPDNB1-1, complete sequence | 41891-41923 | 7 | 0.788 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | KT997842 | Uncultured Mediterranean phage uvDeep-CGR1-KM17-C101, complete genome | 11810-11842 | 7 | 0.788 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | MN693762 | Marine virus AFVG_250M302, complete genome | 28116-28148 | 7 | 0.788 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | KT717083 | Streptococcus phage 73, complete genome | 25222-25254 | 7 | 0.788 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | MN694601 | Marine virus AFVG_250M301, complete genome | 28115-28147 | 7 | 0.788 |
NC_017470_2 | 2.16|1002588|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1002588-1002620 | 33 | MH622927 | Podoviridae sp. isolate ctdc_1, complete genome | 47899-47931 | 8 | 0.758 |
NC_017470_3 | 3.10|1012720|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1012720-1012752 | 33 | NZ_CP012741 | Vibrio vulnificus strain FORC_017 plasmid unnamed, complete sequence | 26984-27016 | 8 | 0.758 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | NZ_CP021438 | Bacillus thuringiensis strain C15 plasmid pBMB172, complete sequence | 4444-4476 | 8 | 0.758 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | MN694775 | Marine virus AFVG_250M170, complete genome | 13332-13364 | 8 | 0.758 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | MN694117 | Marine virus AFVG_250M169, complete genome | 40458-40490 | 8 | 0.758 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | MN693887 | Marine virus AFVG_250M171, complete genome | 40462-40494 | 8 | 0.758 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | KP027447 | Staphylococcus phage phiIPLA-C1C, complete genome | 115840-115872 | 8 | 0.758 |
NC_017470_3 | 3.2|1012232|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1012232-1012264 | 33 | LC168164 | Tenacibaculum phage pT24 DNA, complete genome | 171764-171796 | 9 | 0.727 |
NC_017470_3 | 3.8|1012598|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1012598-1012630 | 33 | NZ_CP049733 | Rhizobium leguminosarum strain A1 plasmid pRL10, complete sequence | 414090-414122 | 9 | 0.727 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | NZ_CP007510 | Pseudomonas stutzeri strain 19SMN4 plasmid pLIB119, complete plasmid | 2961-2993 | 9 | 0.727 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | NZ_CP046903 | Pseudomonas stutzeri strain PM101005 plasmid p1_PM101005, complete sequence | 64327-64359 | 9 | 0.727 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | NZ_CP043833 | Bacillus sp. BS98 plasmid unnamed3 | 1131-1163 | 9 | 0.727 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | NZ_CP032306 | Salmonella enterica subsp. enterica serovar Braenderup strain FORC93 plasmid unnamed2, complete sequence | 13090-13122 | 9 | 0.727 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | NZ_CP016184 | Escherichia coli strain EC2 plasmid pEC2-4, complete sequence | 214420-214452 | 9 | 0.727 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | NZ_CP016183 | Escherichia coli strain EC2_1 plasmid pEC2_1-4, complete sequence | 182669-182701 | 9 | 0.727 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | NZ_MH547560 | Pseudomonas aeruginosa strain PA34 plasmid pMKPA34-1, complete sequence | 74817-74849 | 9 | 0.727 |
NC_017470_3 | 3.16|1013086|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013086-1013118 | 33 | NZ_CP044103 | Streptococcus dysgalactiae strain FDAARGOS_654 plasmid unnamed1 | 3474-3506 | 9 | 0.727 |
NC_017470_3 | 3.17|1013147|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013147-1013179 | 33 | MF417868 | Uncultured Caudovirales phage clone 7AX_1, partial genome | 22277-22309 | 9 | 0.727 |
NC_017470_3 | 3.4|1012354|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1012354-1012386 | 33 | KY417925 | Ochrobactrum phage POI1126, complete genome | 5666-5698 | 10 | 0.697 |
NC_017470_3 | 3.8|1012598|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1012598-1012630 | 33 | NZ_CP053207 | Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1C, complete sequence | 443189-443221 | 10 | 0.697 |
NC_017470_3 | 3.11|1012781|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1012781-1012813 | 33 | NZ_CP039693 | Agrobacterium larrymoorei strain CFBP5473 plasmid pAlCFBP5473, complete sequence | 235840-235872 | 10 | 0.697 |
NC_017470_3 | 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT | 1013025-1013057 | 33 | NZ_AP017970 | Fusobacterium varium strain Fv113-g1 plasmid pFV113-g1-2, complete sequence | 29816-29848 | 10 | 0.697 |
NC_017470_2 | 2.17|1002649|33|NC_017470|CRISPRCasFinder,CRT | 1002649-1002681 | 33 | NZ_CP017108 | Lactobacillus salivarius strain CICC23174 plasmid pLS_1 sequence | 212-244 | 11 | 0.667 |
NC_017470_2 | 2.17|1002649|33|NC_017470|CRISPRCasFinder,CRT | 1002649-1002681 | 33 | NZ_CP017108 | Lactobacillus salivarius strain CICC23174 plasmid pLS_1 sequence | 248154-248186 | 11 | 0.667 |
1. spacer 3.6|1012476|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to MN830256 (Lactobacillus phage JNU_P7, complete genome) position: , mismatch: 5, identity: 0.848
cgattgaccgccagtaacacgatttgccattct CRISPR spacer actttgaccaccagtaacacgattagccattct Protospacer ******.************** ********
2. spacer 3.16|1013086|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to MN856026 (Myoviridae sp. isolate 276, complete genome) position: , mismatch: 5, identity: 0.848
ttgccaaacatacataaaagcaaatttttcgcg CRISPR spacer ttgccaaacatacataaaagcgaatttatcatc Protospacer *********************.***** **..
3. spacer 2.14|1002466|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP038855 (Pantoea vagans strain LMG 24199 plasmid unnamed2, complete sequence) position: , mismatch: 6, identity: 0.818
cgtgcggcgccactcgtttggcgtgcggtaaaa CRISPR spacer cgtccggcgccactcgcttggcgtgacgccaaa Protospacer *** ************.******** *. ***
4. spacer 3.8|1012598|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP021156 (Photobacterium damselae subsp. damselae strain KC-Na-1 plasmid pPDD-Na-1-4, complete sequence) position: , mismatch: 7, identity: 0.788
tcaatctatttaatctatactcataa-gctttac CRISPR spacer taaatttatttaatatatactcataataatata- Protospacer * ***.******** *********** . * **
5. spacer 3.8|1012598|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP035459 (Photobacterium damselae subsp. damselae strain KC-Na-NB1 plasmid pFPPDNB1-1, complete sequence) position: , mismatch: 7, identity: 0.788
tcaatctatttaatctatactcataa-gctttac CRISPR spacer taaatttatttaatatatactcataataatata- Protospacer * ***.******** *********** . * **
6. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to KT997842 (Uncultured Mediterranean phage uvDeep-CGR1-KM17-C101, complete genome) position: , mismatch: 7, identity: 0.788
taagaactgaaaaaaagaaaaaaagctttaaaa CRISPR spacer caagaactggaaaaaagaaaagaagcaattgaa Protospacer .********.***********.**** * .**
7. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to MN693762 (Marine virus AFVG_250M302, complete genome) position: , mismatch: 7, identity: 0.788
taagaactg----aaaaaaagaaaaaaagctttaaaa CRISPR spacer ----aactataacaaaaaaagaaataaagatttaaaa Protospacer ****. *********** **** *******
8. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to KT717083 (Streptococcus phage 73, complete genome) position: , mismatch: 7, identity: 0.788
taagaactgaaaaaaagaaaaaaagctttaaaa CRISPR spacer taaatatataaaaaaacaaaaaaaactttaaaa Protospacer ***. *. ******* *******.********
9. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to MN694601 (Marine virus AFVG_250M301, complete genome) position: , mismatch: 7, identity: 0.788
taagaactg----aaaaaaagaaaaaaagctttaaaa CRISPR spacer ----aactataacaaaaaaagaaataaagatttaaaa Protospacer ****. *********** **** *******
10. spacer 2.16|1002588|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to MH622927 (Podoviridae sp. isolate ctdc_1, complete genome) position: , mismatch: 8, identity: 0.758
cgcattgcaacgcttgtggagtgatacggcaac CRISPR spacer cggatttcaacgcttgtggagtgatcccagacg Protospacer ** *** ****************** * . *
11. spacer 3.10|1012720|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP012741 (Vibrio vulnificus strain FORC_017 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.758
tgctgaa---gtatttgaccagtctgtaccaacttt CRISPR spacer ---ggaacctatatttgaccagtttttaccaacttc Protospacer *** .************.* *********.
12. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP021438 (Bacillus thuringiensis strain C15 plasmid pBMB172, complete sequence) position: , mismatch: 8, identity: 0.758
taagaactgaaaaaaagaaaaaaagctttaaaa CRISPR spacer caaaaactaaaaaaaagaaaaaaagtaatacta Protospacer .**.****.****************. ** *
13. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to MN694775 (Marine virus AFVG_250M170, complete genome) position: , mismatch: 8, identity: 0.758
-----taagaactgaaaaaaagaaaaaaagctttaaaa CRISPR spacer ttatctaa-----aaaaataagaaaaaaagctttacaa Protospacer *** .**** **************** **
14. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to MN694117 (Marine virus AFVG_250M169, complete genome) position: , mismatch: 8, identity: 0.758
-----taagaactgaaaaaaagaaaaaaagctttaaaa CRISPR spacer ttatctaa-----aaaaataagaaaaaaagctttacaa Protospacer *** .**** **************** **
15. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to MN693887 (Marine virus AFVG_250M171, complete genome) position: , mismatch: 8, identity: 0.758
-----taagaactgaaaaaaagaaaaaaagctttaaaa CRISPR spacer ttatctaa-----aaaaataagaaaaaaagctttacaa Protospacer *** .**** **************** **
16. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to KP027447 (Staphylococcus phage phiIPLA-C1C, complete genome) position: , mismatch: 8, identity: 0.758
---taagaactgaaaaaaagaaaaaaagctttaaaa CRISPR spacer ccttacaga---aaaaaaataaaaaaaactttaaaa Protospacer ** ..* ******* *******.********
17. spacer 3.2|1012232|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to LC168164 (Tenacibaculum phage pT24 DNA, complete genome) position: , mismatch: 9, identity: 0.727
cgaagacggcacttatacgattgatctttggaa CRISPR spacer tgaagatggcacttatatgattgattatacaga Protospacer .*****.**********.*******. * ..*
18. spacer 3.8|1012598|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP049733 (Rhizobium leguminosarum strain A1 plasmid pRL10, complete sequence) position: , mismatch: 9, identity: 0.727
tcaatctatttaatctatactcataagctttac CRISPR spacer ttaatctatttaatctatagacataattaattg Protospacer *.***************** ***** . *
19. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP007510 (Pseudomonas stutzeri strain 19SMN4 plasmid pLIB119, complete plasmid) position: , mismatch: 9, identity: 0.727
taagaactgaaaaaaagaaaaaaagctttaaaa CRISPR spacer acaagccataaaacaagaaaaaaagatttaaaa Protospacer *.. * **** *********** *******
20. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP046903 (Pseudomonas stutzeri strain PM101005 plasmid p1_PM101005, complete sequence) position: , mismatch: 9, identity: 0.727
taagaactgaaaaaaagaaaaaaagctttaaaa CRISPR spacer acaagccataaaacaagaaaaaaagatttaaaa Protospacer *.. * **** *********** *******
21. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP043833 (Bacillus sp. BS98 plasmid unnamed3) position: , mismatch: 9, identity: 0.727
taagaactgaaaaaaagaaaaaaagctttaaaa CRISPR spacer ttttgactgaaaaaaacaaataaagcttttcat Protospacer * .*********** *** ******** *
22. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032306 (Salmonella enterica subsp. enterica serovar Braenderup strain FORC93 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.727
taagaactgaaaaaaagaaaaaaagctttaaaa CRISPR spacer taagaaatgaaaaaaagaaaagaaatcagagta Protospacer ****** **************.**... *. *
23. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP016184 (Escherichia coli strain EC2 plasmid pEC2-4, complete sequence) position: , mismatch: 9, identity: 0.727
taagaactgaaaaaaagaaaaaaagctttaaaa CRISPR spacer caaagttagaaaaaaataaaaaaaggtttaaag Protospacer .**.. . ******** ******** ******.
24. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP016183 (Escherichia coli strain EC2_1 plasmid pEC2_1-4, complete sequence) position: , mismatch: 9, identity: 0.727
taagaactgaaaaaaagaaaaaaagctttaaaa CRISPR spacer caaagttagaaaaaaataaaaaaaggtttaaag Protospacer .**.. . ******** ******** ******.
25. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MH547560 (Pseudomonas aeruginosa strain PA34 plasmid pMKPA34-1, complete sequence) position: , mismatch: 9, identity: 0.727
taagaactgaaaaaaagaaaaaaagctttaaaa CRISPR spacer acaagccataaaacaagaaaaaaagatttaaaa Protospacer *.. * **** *********** *******
26. spacer 3.16|1013086|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP044103 (Streptococcus dysgalactiae strain FDAARGOS_654 plasmid unnamed1) position: , mismatch: 9, identity: 0.727
ttgccaaacatacataaaagcaaatttttcgcg CRISPR spacer taaacgaagatacatacaagcaaatttttccat Protospacer * . *.** ******* *************
27. spacer 3.17|1013147|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to MF417868 (Uncultured Caudovirales phage clone 7AX_1, partial genome) position: , mismatch: 9, identity: 0.727
tgaaacatcatcagataactttgctaattcgtc CRISPR spacer ttttaaatcatcaaataacttttctaattctct Protospacer * * *******.******** ******* ..
28. spacer 3.4|1012354|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to KY417925 (Ochrobactrum phage POI1126, complete genome) position: , mismatch: 10, identity: 0.697
tacatcgacaataaagaccccaaggcccgtatt CRISPR spacer gccaacgacaagaaagaccccaaggctgcgaag Protospacer ** ****** **************. *
29. spacer 3.8|1012598|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP053207 (Rhizobium leguminosarum bv. trifolii TA1 plasmid pRltTA1C, complete sequence) position: , mismatch: 10, identity: 0.697
tcaatctatttaatctatactcataagctttac CRISPR spacer gtaatctatttaatctatagacataattaattg Protospacer .***************** ***** . *
30. spacer 3.11|1012781|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP039693 (Agrobacterium larrymoorei strain CFBP5473 plasmid pAlCFBP5473, complete sequence) position: , mismatch: 10, identity: 0.697
cattccagacggtgtcgaagcctttaaactgtc CRISPR spacer ctggtgcgacggtgtcgatgcctttgaactgga Protospacer * . *********** ******.*****
31. spacer 3.15|1013025|33|NC_017470|PILER-CR,CRISPRCasFinder,CRT matches to NZ_AP017970 (Fusobacterium varium strain Fv113-g1 plasmid pFV113-g1-2, complete sequence) position: , mismatch: 10, identity: 0.697
taagaactgaaaaaaagaaaaaaagctttaaaa CRISPR spacer attttatcaaagaaaagaaaaaaagcttttaaa Protospacer *...**.***************** ***
32. spacer 2.17|1002649|33|NC_017470|CRISPRCasFinder,CRT matches to NZ_CP017108 (Lactobacillus salivarius strain CICC23174 plasmid pLS_1 sequence) position: , mismatch: 11, identity: 0.667
ctattcaattatcaagcataactagttgctaaa CRISPR spacer aagcctgcttctctagcataactagttgctaat Protospacer ..... ** ** ******************
33. spacer 2.17|1002649|33|NC_017470|CRISPRCasFinder,CRT matches to NZ_CP017108 (Lactobacillus salivarius strain CICC23174 plasmid pLS_1 sequence) position: , mismatch: 11, identity: 0.667
ctattcaattatcaagcataactagttgctaaa CRISPR spacer aagcctgcttctctagcataactagttgctaat Protospacer ..... ** ** ******************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1236994 : 1246662
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_017470|1236994:1246662|DBSCAN-SWA AATGAGTACAATTTATAATTCAGTTACAGAATTGATTGGTAACACACCATTATTGAAGTTAAACCGTGTCGTTCCAGAAGATGCGGCTGATGTTTATGTTAAGCTTGAATTTGAAACCCCTGCAGGCTCAACCAAAGATCGTATTGCTTTAGCAATGATTGAAAAAGCTGAACGTGATGGCAATTTAAAGCCAGGCGGTACTATTACCTCACGCAACACCGGTATTGGTTTAGCCATGGTAGCTACTGCTAAAGGCTATCACATTATCATCGTCATGCCTGAAACCATGAGTGTAGAATGTCGTAAGCTCATGAAAGGCTACGGTGCTGAACTTATTTTGACACCAGGTTCTGAAGGCATGAAGGGTTCTATTGCTAAGGCTGAAGAATTAGTTAAGGAAAAAAGCTATTATATGCCAATGCAATTCGATAATCCTGAAAATCCTAATATTCATGAGTTGACTACTGGTCCAGAAATTATTAGCGCAATGAACGGTATCGGCAAGTCTGTCGACGCTTTTGTTGCTGGCGTTGGTACCGGTGGTACTCTTTCAGGTATTGGCCGTGCATTAAAGAAGGAAAATCCTAATACTAAGGTTTACGCACTGGAGCCAAGCGAGTCACCTTTATTAAAGGATGGCAAGACTGGCAAACACGGCATAGCTGCTGGATTTATTCCAAAAAACTTGGACAAAGATGTCTATGACGGCATTGTAGAAGTTTCAACTGATGAAGCGATGAAGATAGCCTGTGAAGTTGCCAAAAACGAAGGTTTCTTACCTGGTATTTTCGCTGGTGCCAACATCCACGGTGCAATCGAATTAGCTAAGAAGCTTGGTAAAGGTCATGCTGCTGTCACTGTTAGTCCAGATGGCGGTGATCGTTACCTTTCAACTGCTTTATTTAACGAATAAACAAAATATTACTGAAATCATTGTCTAGGGATGAATTGATTTGATTCATCCTTTTTCTTTTACCAGAACATTGGGCTAGACCCTTTTAATTTTATCAACTATAATTGGTATATACATATTGACGAAAAGAGGTATATATAATGAGAATGAGATCTTTAGGTCTATCCATCTATCCAGACCACAGTAATTTTGAAGACAATGCTAAATACCTAGAACTTGGTCATAAATACGGCTTCTCTCGCATTTTTATGAGTATGCTTGAAGTCAAGTCAGTCCTGCAGAAACTCAAGCCAAGTACAAAAAATAATTGAATACGGCAACAAATTAGGTTACCAAACTTTCTTAGACGTTTCTCCTCAATTATTTGATCAACTTGGAATTAGTTACTCTGACTTAAAATTCTTCGCAGATACTGGTGCAGCTGGTATTAGATTAGACCCAGCCTTCGACGGTGCAACAGAAGCAATGCTTTCTTACAATCCATATGGTTTAATCATTGAATTAAACATGAGTAACAACGTCGACTACTTAAACAATATTATTTCCTATCAAGCAATGCATTTATTTGCCTCAGGCTTAATTGATGACGTAATCATTGGTAACGCCTATGCTAGCGAAGAAGAATTAAGGGCTTTATCAGAGGTAAATCGTTACCAGTTAATGCTTCACGTTGATTATGTAAAACAAATTAGTGATATAGAAAAACCTCAACACTTCCGCCGCGGCGACATGAATGAAATTGTCATTCGTTCAACTATGCCTCGAGTAACTTACAAAGACATCCCTAACCCACCACATGATAATGAAGAGGAATTTCAACGCGGTGATGTTTTAATCGGTAATGACAATTTTGGTATTTATAAAAATGAATTTCAAATTGTCTTAAAGCCACATAAAGAGCCAAGAAAGAACAAAATTGGTAGTATCGCCAAAGATGAATTATTTTTACTTGATTTTATCAAGCCATGGACCAAATTTAAGTTAACTGGCAAATAGAACAAAAGAAATACCCCTAAAGAAATTTTTCCTTCAGGGGTATTTTTTTATTTCTGTTGATTTCACGCTATTTCTAAGCGTTATACAAATTAGTATAACGATCTTTAATTACTAGTGCAACATTTTAAGACGATTTATTTTTTAGATTTGTTAATACCTTTCCAAAACCATCTCACACCAACATATGCGATCACAAAGCATGCTAAACTAAGGATCCAGTTTGGAAAGATTGAGGTCATCGTAATAGAACGTCCTGCCTCAGTATTTGCTATCAAGTTGACGCCAAAGATATTCGAAACAATTGGAATATTTACATTAAACCACAAATTCAAAGTTAAATAGATCATTATTAAGAGTAACCAAATTGCAATGTCTTTAATAATTCGTTTTATCCAGATATTCATTTTTATAATTACTCCTTACTTTAATTCCATCGTATCTTGTGAGAAAAGAACCGGATATTCATCAGACCATTTCTCAGTTATCGTTTTAACAGTTTCAAAACCAAACTTTTTATACAAAGCTTTAGCACGATTATTGTTGGTAAACACCTCAAGACTGACTGGTCGCTTTAATTCGTCTAGTGTTTTTTCCATTAACTTGGTAGCTATCCCTCTACCTTGTGCATTTGGATCAACATAGATGAATTCGATCTTACCTGGTTTAAAGCCTATAAAACCCATTAATTTTTCGTCCTCAAGTGCTACGTAAATCTTACAAGACAACAAATAATCTAAATATGGTGCATCCCTGAAAGCCACAAAAACTGGCTCCAAATTTTCAGCTTCTAGTTCTTGCATCCTCGCTTTATCCATCACTGCGCCTAGTTGCGCAAAATATTGTTTTTCGTATGATTTGATTTTCATGCTTCCCCCTCTTTTATTAACTAATAATTTTATAAAAATTTTGCAAAATAAAAAAGCCCCAATGCTTTCGCATTAAGGCTTTTTATTTACTATTTAAGCACCGTGATATTTTACTAATGAATAAATATCAACGTCTGGCAACTTCTCTCTACCCTTTAAGTCAGGTAATTCAATATAGAATGCAGCACCTACAAGCTTACCACCAAGGTTTTCGATTAATTCCTTAGAAGCTCTCAAAGTACCAGCAGTAGCCATCAAGTCGTCACATACAACTACTCTTTGACCTGGCTTAATAGCATCCTTGTGCATTTCCAAGCTGTTTGAACCGTATTCCAAAGCATATGAAGCACGTTCAACTTCTCTAGGCAACTTGTGAGGCTTTCTTGCTGGAACAAAGCCAATGCCCAATTCAGTTGCTACAGGACAGCCAACGATAAAACCACGTGCTTCTGGACCAACGATTACATCCGCATTTCTGCTCTTGGCGTATTCTGCTAATTCATGAGTAGCAGTACGGTATAAATCGCCATCTTGCAAGATTGGTGTAATGTCGCGGAACACAATTCCCTTGTTAGGAAAGTCCTTTACGCTTGCAATATACTTACTAAAATCAATTGCCATAAGAAGTAAAACCTCCAAAAATCTTTGCGTTTTTATTCAAACACTTTTAACTTGTATATTTTAACAGATTTTTGGTATAAAAACATCAAAATCTTGATATTTATCTAATTATTCAAAAAATTATTAACATATGTAATAATTTGCTGAGTTGGCATTGTTCTCAATTGATCGACAAACTTAATTTGTGACTGCGTTGCTCTCAAATACTTAGATGCTGTTAAAGGCTGCTTTTGAGGATGCTGCTCGCCTACTAGCTTGCCATCGTCCAATTTAACAAAGCCTAATTCAAAGAAGACACGTAAAATGAACAATACACTGTCATAATCTAAGCCAAGATATGGCGCTACGGTGCGATAATCATCTGGACTCAACGTTGGGTGAGCATAAATATATTTCAACACTTTGCCAAAATAGCTTTTTGCTGGAATATGCTCTACTGGCAATTGATCAAGCAAGAAACGTAAGTAAATTTGTTGGTAATTCTTTTCTAGTGCCGAATTTAATTCGATTTGATTGCGTGGTACATCAAGCAAAGCAACTGTTTCACCAGATTTGTCGTAATCTTTAACCAAACTAATCTTATCGTCACTTACTTGCAGTCTATTCCTAACAATTGGAATATTCTTTTTGTCAAATAACAAATAACGATCTGCAAAACCCATGACTAGCTTTTCTTGACGCAAATCAATAACTGGCGTAGGAACAGCTAATTTAGGTGAAGCAAATTCAATTCCTTCGATGATACCTTGCAACGATATCTTATTGCGATAAGTATTAAGCGACAATTGAACAAAGACCCTGGCAATAAATGGCAACAAATTATTGTTTAAAAAGTCTTTATTAAAACCAATTACAGCCAGGCTGCCACCCTTTTTAGCGACATTAAACTTGACGTGATTCTTCTCTTTACCAATCTTAAAAAATTGGGTAATCGTTGGGTTGGAGATACTGAAAATTGGCTGTGCATCCCCCGTTCCAAACGGTCCCACTTGATTAATCTGCGCTAAGGTCTGTGGTGACAATCCCTGCAAAGGCAGTTCCATATCATATTCCTTAATTTCAAGTCCACCTTCAACGTGAAAACCTTTCTCAAGCTTTTCTCTTAAAGCACCAATTTTATCTTCAGTCATTGATAAACCGCAGGCAAAATCATGACCACCAAATTTGGTAAAAAGCTCTTCTTTTAAAGGATTAAGGGCATCAAACAAATTAAAGCCAGTAATTGATCTACCTGAACCTTTGATTTCTCCAGCCTGATTCTTAGTTAAAACGATCGTTGGCTTATGCGTCTTCTCAACGATCTTATTGGCTACCAGGCCCAACACACCTTCATGAAAATCTGGATCATATAAAACAAGCGTATTTTGTTTTTGCCAGCTATTCTCACGGATTAACGCCATACATTTATCATAGACTTCAGTCGTCAATTCCTTACGCTGATTATTCAGCTCTTCAATTTGATCCGCAATCTTTTGTGCTTCAACATCGTCGTCGCTTAATAATAACTCAACTGCAAGGTTAGCATTGGCCAATCTACCTACCGCATTCAAACGAGGAGCAATATTAAAGGCGATATCTGTTTCGTTAATCGAGCCTAAAGTTAAGCCAGCATTCTTTATCAGGGCACGCAAACCTGGTCGCTCAGTTTGGTTAAGTACTTCCAGCCCTCGTTTAACAATTACGTGACCTTCTCCGGTTACTTTAACCATATCGCCAATCGTACCTATCATGGCTAAATCAAGTAGTTCTGGCATCGTATCCTGCATTAAGCCGCGGCAGATTGTGTAGGCAACACCTGCACCACAATAATCATCAAAAGGATATTTTTGGCCAGGATAATTACAGTGTACGATGGCATAGGCATCTGGCTTTTGTTCTTGAAAAGTGTGGTGGTCGGTTACGATTGTATCTACACCATGTTCTTTAGCATATTTAACTTCGTCAATACCAGTCACACCATTATCTACAGTAATAATTAATTTGGTACCGTCTGCAACAATATCATGATAGCGATCAATATTAGGGCCATAGCCGTCTTTAAAGCGATCTGGAATAAAGTAATGCACATCAGCACCTAAAATACTTAGAGTCTCTGTCATAATCGTTGTCGCAGTGATCCCGTCAGCATCATAGTCGCCATAAATCGTAATCTTTTCACCATTATCAATTGCTTGATTGATTCGATCAATGGCCTTATCCATATCATGCATAAGCGAAGGATCCGCTAAATTCTCTTCGGTCGCATTAAACCAGAAATCTAGTTTTTCATCGCTGTCAATGCCACGTAAGGCAAAAAGCTTTGCCGCAATTGGACTTAGCTGATACTTTTCTATCAAACTTGGATCTAGCTCTTCAGCAGTTCTTTGTTGCCATTTCATTACTATAAAACACCCTCATCAAAACTAAAAGGCACTCACAAGCGAGTGTCTTTCTTGCAAATCTATTAATAATTTTAACGCAAATACGATATCTGTATAAGTCAGACAATCTATTTTAAGTTAAAGACTTTCAAATTATTATCTGTTGCTTTTTCGGTTTTAATTAAATTACCACGAATTGCCCAGCGATTTTTACCACCATCAGCACAGGTAATTAAGGTGACGATGTTTTGTCGGGTATTGTTAACTAACCAAACTGCAGAAGGATCAACGATCTTCTTCATATAAATGCGATAAATATATACCTTCTTCAAGTCAGTTAAATAAATTAACTCGTCCTTTTTAACATCTTCTAGTGGTGAAAATAGAATACCCTTCGCGGTCATATAGTGACCAGCAAGAGGATAATTACCTTTACCCATCACTTGGTCGGCACGCATTGTTCCACCACCAGTAGACATCGCATCATCGGACATCCCTAACATAATAGGCAAATGCATATTAACTTCAGGGATTGCTATAGCACCAATCGCACCTGAGGTCTTCTTTATCTGCGACCTTGTGGCCTGGCCTATGCCAATTGACTTAACCTTCTTAAAGTCGTACATCCCCTTTTTCTTTTGGTTTTGTTCAACTTTCTTTCTAGTTAAGCTAGTCAAAGTTGACTGTTGGTTATGACTGATCATCTGATTACCAATCTGTTTATTGAAAATCAGGACCAAACCAACAATCAATAAAATGACGGCAAAAATTCTGATTAGAATTGTGCTAACAGAACTTTTTTGTTTATTTTTAGCCATAATTACTTGCCTTTGATATCATCATCATTCATCTTAAGAACGGCCATGAAGGCATCTTGAGGCACTTCTACCTTACCTACAGCCTTCATTCTCTTTTTACCACGCTTCTGCTTTTCAAGCAATTTAGCACGACGGTCAGGGTCACCGGTGTGAATCTTCCAAGTAACATCCTTACGGTATGGCTTGATCGTAGCACGAGAAATAATCTTGGCGCCGATCGCACCTTGAATATCAACTTCGAAGTTTTGACGTGGAATTAACTTCTTAAGCATTGAAGTCATTTGACGAGCACGATCTTGTGCTTCACTTCTGTGAGCGATAAAGCTAAGAGCATCGATTGGTTCCTTGTTAAGCAAGATATCGATCTTAACCAAGTCGGTTGCACGATAGCCAGTAATCTCGTAGTCAAGTGAAGCATAACCCTTAGTTGATGACTTCAAATCATCGAAGAAGTCAAAGATGATTTCAGCCAACGGCATATTGTAGATAACGTTAACGCGGTACTTATCAAGATAGTCCATCGTAACAAATTCGCCACGTTTTCTTTGACAAAGTTCCATTACAGGGCCAACAAAGTCATTTGGCACCATAATTTCTGCCTTAACATAAGGCTCTTGCACTTCCTTGTATTCACCAGCATCTGGCAAATCTGATGGGTTATCAATTACCTTAGTTGAGCCATCATTCATAATTGCATGATAGTCAACGGATGGTGCAGTCATAATTAAATCAAGATCAAATTCTTGTTCTAGTCGTTCTTGCACAACATCCATATGCAAAAGTCCTAAGAAACCACAACGGAACCCGAAGCCTAAAGCAGTAGAAGTTTCAGGTTCAAATTCTAAAGCTGCATCGTTTAATTGCAACTTTTGCAAAGCTTCCTTTAAGTCTTCATAATCACGGTTATCAACTGGATACATACCAGAGTAAACCATTGGTGGAATTTGACGGTAACCTGGAAGTGGTTCGGCAGTAGGATTATCTGCTTGGGTGATAGTATCACCAACACGAGTTTCACGTACGGACTTAATGTTGGCAGTAATATAACCTACATCCCCAGCAATCAAGATATCCTTCTTAATTGGATGTGGACTTGAAACACCTACTTCTGTAACTTCATATTCCTTACCAGTATTCATAATTTGAACTCGGTCACCAGGCTTAACTGTACCGTCTTCGATTTTGACTGACATTACGACACCACGATAGTCATCATATTTTGAGTCAAAAATCAAAGCCTTGAGTGGCGCAGTAATATCACCAGATGGAGCTGGAATGTCTTTTACTACTTTTTCCAGCATGTCCTTGATGCCTTGACCGGTTTTACCAGAAACTTCCGCAGCTTCAGAAGCATCAAGACCAAGCATCTCTTCAATTTCTTCCTTAGTCTTAGGAATATCAGCAGATGGCAAGTCGATCTTATTAATTACAGGTAAAATTGCCAAATCATCATCGATCGCTAAGTAAGTGTTAGCCAAAGTCTGAGCTTGCACACCTTGTGAAGCATCAACTACCATCAATGCACCTTCACAGGCAGCCAAAGAACGTGATACTTCATAAGAAAAGTCCACGTGCCCTGGTGTGTCGATCAAGTGGAAGATGTAGTCTTCGCCATCTTCGGCATGATACTTAACTTCAACTGAGTTCATTTTAATAGTAATACCACGTTGACGTTCAAGTGGCATATCATCTAACATCTGATTCTTAAGTTGACGTTGACTTACAGTATCAGTCAATTCAAGAATTCTGTCGGCAATAGTAGACTTACCATGGTCAATATGCGCTACGATTGCAAAGTTTCGAATATGTTTTTGATAGTCTTTTAATTTGTTTAAATCCATAAATAAATGCACCCTTTTATTTCTGTCTTATTAATTATAGCAAAGGTTACGGCGCATACAAAAGAAATAGCGAGCTGCATTTTACAGCTCGCTATTTTTTTATTCTCCGTTCAATTTATCTTTAAGCCTTTCGAAAAAGCCTTTTTCTTTTGGAGTAATATGTCCACCACCATCTTGAACAAAGTTAGTCAAATCTTGTTTTTGCTTTTCATTAATATGCTTTGGAATAACGATATCAACAGTAATAATTTGATCACCGTTGCCATCACCACGAAGATAAGGCACACCTTTACCTCGCAAGGTAAATTTCTTATTAGGTTGCGTACCTGCTGGAATGGTCAACTTTTCATCACCATGAACAGTCTTAACGTCAATTTCATCACCTAATGTAGCTTGCGCAAATGAAATTGGCACGTGAGTATAAATGTTTTGACCACGACGCTCAAAAATCTTGGATGGCTTTACTCGGTAAATGATGTACAAGTCACCATATGGTCCACCATTAATACCGGCATCACCTTGGCCTTGGTAACGCAATTGTTGACCATTATCAATACCTGCTGGAATATCAACCTCAATAGTATTCTTACGTTCAACAATACCTCTACCGTGACAAGTCTTACATGGGTGTTCAATGACCACACCACGACCATTACATTTATCACAAGTAGTTTGTTGACGAACCATACCAAAGACTGAACGTTGGGTAACTGTCATATAACCTGTACCATGACATTTATCACAGGTAATTGGGTGCGTACCTTTTTCGCAACCATTACCACCACAAGTCTCACAAGTTTCATCACGAGTATAGCTAACTTGAGACTTTTTACCATTAATGGCATCCATAAAGTCAATCGTTAAAGTATAGTCTAGGTCCTCACCACGTTGAGGTGCTGTAGGATCGACTCTTTGTTGACGGCCTTGACCAAAGATATCGCCAAAAATGTCGCCAAAATCGCCAAAGCCTGAAGCATCAAAACCTGCACCACCAAAGCCTTGACCGGCTCCGCCACCGAAGCCACCTTGGCCATTTACACCGGCAGAACCAAATTGATCATATTGTGCACGCTTTTGCTTATCATGCAAAACTTCATAAGCTTCGTTAACTTGCTTATACTTTTCTTCAGCTCCGGGTTCATGGTTCAAATCTGGGTGGTATTTCTTTGCTAGTTTACGATATGCCTTATTAATTTCCTGGTCACTGGCGTCACGATCTACGCCAAGCACTTTATAATAATCTTCTTGTGCCAT
Protein sequences of DBSCAN-SWA_1 >NC_017470|1236994:1246662|1242877_1243567_-|WP_014565934.1|DBSCAN-SWA MAKNKQKSSVSTILIRIFAVILLIVGLVLIFNKQIGNQMISHNQQSTLTSLTRKKVEQNQKKKGMYDFKKVKSIGIGQATRSQIKKTSGAIGAIAIPEVNMHLPIMLGMSDDAMSTGGGTMRADQVMGKGNYPLAGHYMTAKGILFSPLEDVKKDELIYLTDLKKVYIYRIYMKKIVDPSAVWLVNNTRQNIVTLITCADGGKNRWAIRGNLIKTEKATDNNLKVFNLK >NC_017470|1236994:1246662|1236994_1237906_+|WP_014565929.1|DBSCAN-SWA MSTIYNSVTELIGNTPLLKLNRVVPEDAADVYVKLEFETPAGSTKDRIALAMIEKAERDGNLKPGGTITSRNTGIGLAMVATAKGYHIIIVMPETMSVECRKLMKGYGAELILTPGSEGMKGSIAKAEELVKEKSYYMPMQFDNPENPNIHELTTGPEIISAMNGIGKSVDAFVAGVGTGGTLSGIGRALKKENPNTKVYALEPSESPLLKDGKTGKHGIAAGFIPKNLDKDVYDGIVEVSTDEAMKIACEVAKNEGFLPGIFAGANIHGAIELAKKLGKGHAAVTVSPDGGDRYLSTALFNE >NC_017470|1236994:1246662|1243569_1245408_-|WP_013438127.1|DBSCAN-SWA MDLNKLKDYQKHIRNFAIVAHIDHGKSTIADRILELTDTVSQRQLKNQMLDDMPLERQRGITIKMNSVEVKYHAEDGEDYIFHLIDTPGHVDFSYEVSRSLAACEGALMVVDASQGVQAQTLANTYLAIDDDLAILPVINKIDLPSADIPKTKEEIEEMLGLDASEAAEVSGKTGQGIKDMLEKVVKDIPAPSGDITAPLKALIFDSKYDDYRGVVMSVKIEDGTVKPGDRVQIMNTGKEYEVTEVGVSSPHPIKKDILIAGDVGYITANIKSVRETRVGDTITQADNPTAEPLPGYRQIPPMVYSGMYPVDNRDYEDLKEALQKLQLNDAALEFEPETSTALGFGFRCGFLGLLHMDVVQERLEQEFDLDLIMTAPSVDYHAIMNDGSTKVIDNPSDLPDAGEYKEVQEPYVKAEIMVPNDFVGPVMELCQRKRGEFVTMDYLDKYRVNVIYNMPLAEIIFDFFDDLKSSTKGYASLDYEITGYRATDLVKIDILLNKEPIDALSFIAHRSEAQDRARQMTSMLKKLIPRQNFEVDIQGAIGAKIISRATIKPYRKDVTWKIHTGDPDRRAKLLEKQKRGKKRMKAVGKVEVPQDAFMAVLKMNDDDIKGK >NC_017470|1236994:1246662|1239861_1240389_-|WP_013438124.1|DBSCAN-SWA MAIDFSKYIASVKDFPNKGIVFRDITPILQDGDLYRTATHELAEYAKSRNADVIVGPEARGFIVGCPVATELGIGFVPARKPHKLPREVERASYALEYGSNSLEMHKDAIKPGQRVVVCDDLMATAGTLRASKELIENLGGKLVGAAFYIELPDLKGREKLPDVDIYSLVKYHGA >NC_017470|1236994:1246662|1239321_1239768_-|WP_014565932.1|DBSCAN-SWA MKIKSYEKQYFAQLGAVMDKARMQELEAENLEPVFVAFRDAPYLDYLLSCKIYVALEDEKLMGFIGFKPGKIEFIYVDPNAQGRGIATKLMEKTLDELKRPVSLEVFTNNNRAKALYKKFGFETVKTITEKWSDEYPVLFSQDTMELK >NC_017470|1236994:1246662|1240493_1242767_-|WP_014565933.1|DBSCAN-SWA MKWQQRTAEELDPSLIEKYQLSPIAAKLFALRGIDSDEKLDFWFNATEENLADPSLMHDMDKAIDRINQAIDNGEKITIYGDYDADGITATTIMTETLSILGADVHYFIPDRFKDGYGPNIDRYHDIVADGTKLIITVDNGVTGIDEVKYAKEHGVDTIVTDHHTFQEQKPDAYAIVHCNYPGQKYPFDDYCGAGVAYTICRGLMQDTMPELLDLAMIGTIGDMVKVTGEGHVIVKRGLEVLNQTERPGLRALIKNAGLTLGSINETDIAFNIAPRLNAVGRLANANLAVELLLSDDDVEAQKIADQIEELNNQRKELTTEVYDKCMALIRENSWQKQNTLVLYDPDFHEGVLGLVANKIVEKTHKPTIVLTKNQAGEIKGSGRSITGFNLFDALNPLKEELFTKFGGHDFACGLSMTEDKIGALREKLEKGFHVEGGLEIKEYDMELPLQGLSPQTLAQINQVGPFGTGDAQPIFSISNPTITQFFKIGKEKNHVKFNVAKKGGSLAVIGFNKDFLNNNLLPFIARVFVQLSLNTYRNKISLQGIIEGIEFASPKLAVPTPVIDLRQEKLVMGFADRYLLFDKKNIPIVRNRLQVSDDKISLVKDYDKSGETVALLDVPRNQIELNSALEKNYQQIYLRFLLDQLPVEHIPAKSYFGKVLKYIYAHPTLSPDDYRTVAPYLGLDYDSVLFILRVFFELGFVKLDDGKLVGEQHPQKQPLTASKYLRATQSQIKFVDQLRTMPTQQIITYVNNFLNN >NC_017470|1236994:1246662|1245507_1246662_-|WP_013642124.1|DBSCAN-SWA MAQEDYYKVLGVDRDASDQEINKAYRKLAKKYHPDLNHEPGAEEKYKQVNEAYEVLHDKQKRAQYDQFGSAGVNGQGGFGGGAGQGFGGAGFDASGFGDFGDIFGDIFGQGRQQRVDPTAPQRGEDLDYTLTIDFMDAINGKKSQVSYTRDETCETCGGNGCEKGTHPITCDKCHGTGYMTVTQRSVFGMVRQQTTCDKCNGRGVVIEHPCKTCHGRGIVERKNTIEVDIPAGIDNGQQLRYQGQGDAGINGGPYGDLYIIYRVKPSKIFERRGQNIYTHVPISFAQATLGDEIDVKTVHGDEKLTIPAGTQPNKKFTLRGKGVPYLRGDGNGDQIITVDIVIPKHINEKQKQDLTNFVQDGGGHITPKEKGFFERLKDKLNGE |
7 | Streptococcus_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1286112 : 1298493
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_017470|1286112:1298493|DBSCAN-SWA TTTACCAAATTACAATTCTATCTTCTGGATCAAGCCACATGCCGTCATCCTTCTTAACGCCAAAGGCATCATAGAATTCATCTTGGCATTGAACCGGAATATTTACTCTAGTTGGTTGTGGTGCATGGACATCAGTTTGAACTTCAGTCTTAATTGCTTCAGGACGTTGCTTTTCCATCCAGCTTCTAGCATAGTTTTCAAACAAGTCCTTCAAGTCTGCACCATCATTTTTACCTGCTTGGACAGCACAAGCTAAACCAGCAAGGTCAGCAATATTTTCAGAAACAACTTGCTTACCGTTAATCTTAGCAGGGCCATATTGCAAACCATCAAAGATATCGATCATTTGACCAACACGCTTATTAAATTCAGCGAAGTCTTGCTTAGTCCACCAGTTGTTCATATTACCGTTCTCATCAAACTTAGCACCATTGTTGTCGAAAGCATGAGAAACTTCGTGACCAATAGTAGCACCGATACCACCGTAGTTAGCACCTCTTGATTGGTTAATATCGTAGAATGGCTTTTGCAAAATACCAGCTGGGAAAGTTAAATCGTTTCTTTGCGGGTCGTAACAAGCGTTGTTCAAGTTACCTGGCATGAGCCATACTGAACGATCAACCGGCTTAGTCAATTTCTCTAGTTCATATTCGGTACGAACTTTATCCATAGCAGCTTCATTTTCATATAAGCTCTTTTCAGGATCTACTTGTAACAAATCAAAAATCTTTTCAATTTTTTCTGGATAACCAATCTTAAGAACCAAAGCATTTAACTTAACAATTGCCTTTTTCTTAGTTTCATCAGATAGCCAAGTATTATGGTTGATTCTTTCTTCATAAACCTTAAGCATGTTGTGGATCATATCTTCAACATCGTGCTTGGCTTTTTCACCAAAGTATTTCTTACCATAGAAAATACCAACCACTTCATCAAAGGCACTGTTAGCAATACGATAAGCTTGCTTAATTTGTGATGGCATTTCTGGTGCGCCTGTAATTGCTTGATTAAATGGAAACGCAGCTTCTCTAAACTTTTGTGACAAGTATTTAGCTACACTATTAATGTATTTAACCAGCATCCAACCCTTGATTTCGTCAAAGTTTTCAGCATTGATTAATTCTTCTGCATGGTCAAGAAAGCGTGGTTCCATCACAATTACCCGCTTTTGTTTTTCAGGTAAAAGATCGTCTAAGAATTGTGCCATGTTAAATGACTTGAATTTAGCAATGAATTTGTCGTATGAAACTGGGTTGTAAATAGCTGCGTCGTCGGCCCATTCTTCAGTAGACTTAACTACTTTAGCTACCTTAGCATCAAAGGCCAATGCATTATCAACGTATACTCTTGCTTCGGTCTTGCCAATACCTGCCATCATTAGCAAGTTAATGCTTTGATCTTGCAAAATGTCCAACAACTTTTTAGCATCGTCTGTCTTGTAAGTTGTTGTATCTGGCAAGAAGGTACTTGGACCACCAAAGTATAAGACATTAACGTCGGTATTCTTCATATCGGCATCGACGCCATAGACGAATGGCAAGATATATGGACCCATGTATAAGTCCTTAGCTTTTTTACCAAAGTCAGCAAAATCAGTTAATGATAATAACTTTTGCAAGTCATTTTGAATAGGATGTGCTTCTTCGGCATCCCTTTTCTCAAAATTCTTAGCAATCTTATATAAAGCAATTGCCTTATCAAAATCCTTAATCTCTGGCATCTTTTCTTTGCCTGCAGCAATATCGGCAAAGTCTTTCATCACTCTATTTTCAATCTTAATGTCTAGTTCTGAGTTGACACCAGTTGAAGTACGATCAGCAGGGATTTCAGCTTTAGAAAGCCACTCTGAATTAACAGCTAAATATAAATTATCTTGTGGCTTAGCATTAACGTCAGGTTCCGCAACATCGCCTGCACCACCACGAACAGCGTAAAATCTTCTCATTTTTCCTCCTTGATTAATATTCATAATTATTTCATTCCTTTTAATCATTATAACAAAAAGACGTCTGCATTCGTGCGAAACGTCTTTTATTAACTATTCTTTATTTTCATTGTTTTGTAAGGTGTACAAGTTGTAGTAGTAACCCTTCTTAGCGAGCAATTCTTCATGCGTACCACGTTCAATAATTCTACCCTTGTCGAGCACAATAATCTGATCGGCATCGGCAATCGTTGACAAACGGTGCGCAATGGCCAAAGTAGTTCTACCCTTACGCAGTTGTTTCAAGCCTTGTTGAATCAAAGTTTCGGTTTCGGTATCCACGTTAGCCGTAGCTTCATCCAAAACTAAAACTTTAGGATTGGTTACCAAAGTTCTAGCAAATGAGATCAATTGGCGCTGTCCTTGACTCAGTTCTTCGCCACCTTCGATTACCTTGGCATGATATTTACCTGGCATCTTTTCAATAAATTGATCGGCTTGAACAGCCTTAGCCGCATCCCTAATCTGTTCATCTGTAATGTTCTTATTATACAAGCGAATATTTGAAGAAATATCGCCATAAAACATGAATGGTTCCTGCAAAACGAGGCCCAATTTTTTACGCAATTCTTCTTTAGGATATTTTCTGATATCTACATCATCAATTAAGATCTCGCCTTTACCCAATTCGTAAAAACGCATCATTACATTGATGATCGAGCTCTTACCAGATCCAGTATGACCAACAATACCTAAAGTTTCACCTGGATTTACTACGAATGAAATATCATGCAAGATTTCATTTTTACCATCATATGAGAAACTAACGTGCTTGAATTCAATCTTGCCGCGACTGATAGTCAATCCTTCTTGAGCATTTTGCTTTGGCTCATATTGTTCATCATCTAAAATCGCAAAGATTCTCTTACCGGCTACGATACCATCTTGGAAGAAGGTCATCTGGTCCATCATAGTTGAAATTGGATTAAACAATTGTGACAAGTATTGTGAGAAAGCATATACCACACCAGCTGGCACAAATGTTGCCTGCAATGGAAAACCGAAGTACATCAATGTCACAGCTAAAGCCAATGAATAAAGCAAACTAGTCAAAGGTGAAAGAAGCAATGAGTTAATACGAATCATGTTAAAACGCGTATTCATCAACCCACCATTTTCACCCTCAAATTGGTTGGTCATTCTCTTTTCTTGATTAAACTGCTGAATAATTGAGACACCTTCAATTGACTCATTCAAGTTAGCGTTAATTCTACTCAAACGTTCACGATAGTTACGATAAAGCCGTGAACTCTTTTGCGAATAGAGCCAAATCACAAAGAGCAAAATTGGCACAAAGGCTAAAACGATCCAGCCGGCAATCACGTTGGTAGTAAACATTGCCACCAATGCCGTAACTACAGAAAATGCACCAATTACTACTGTTGAAAGCACAGTTAAGAAGTTGCTCAAAGTCATCGTATCGTTAGTAACTCTTGACACAATTGAACCGGCAGGAGTTTGGTCAAAATAGCGCATCCCTAACTTATGCAATTTCCGGTAAAGGGCCCTTCTGATACTCTCCAAGGTTTTTTCAGATCCAAGGGCAAAGAAATATTGGTAGGTAAACTGAATGATCGCTTTTAGAATCGAACCTATTGCATAAAGTAAGCCTGCAAACAAAATAATCTGGACAGTTGCGCTTTGTTTAAGCAGAAAGTTGTCTAAGAAATACTGCAATCCACGCGGCAGCAAAATATTAATGATGCTGACTAAAAATGCCCCAATCAATGCAATGGTCATTTCAAATTTAAAGTGCTTAACAAAGCCCATTAAACGTTCAAAAATCGCAACCTGTTCTTTAAATGGGATTGCTTTAGACCACACCGATTCTTGCTTTTCTTCATTATTCATCTACATCTTCACCCACCTTTGCTTGCAATTCTTGTCTACGCCACATTTCGGCATACCAGCCATTTTCTTTGAGCAGATCTTGGTGATTGCCTCGTTCAATAATTTGACCGTCTTTTAAGACTAAGATCAAATCGGCATCCATGACCGATGTCAATCTATGTGCCGCAATCATCGTGGTCTTATCTTTTCTTTCATTTCTCAAAGAAGTTAGAATTTCGGTTTCAGTCTTTGCATCAACCGCTGATAAAGCATCATCCAAAATTAAAATTTGGCTATCTTTAAGCAAAGCACGGGCAATGGACATTCTTTGACGTTGACCACCAGAAAGTGAAAGTCCATTTTCACCAACTAAAGTCTCATAGCCATGTGGCATTTGTAAAACATCATCGTGCAAATCACTCTTCTTAGCAGCTGCCGCAATCTCATTTTTGCCTGCATCTGCCTCTGAGAAAGCAATATTTCTACCAATACTCGTTGAAAAGAGGAAGTTGTCCTGGGGAACATAAGATATTTGGCTAAGTAAAACTTTAAGCGGAATTTTCTTGATATTGATGCCATTTAAGGTGATTTCACCGTCATATTGGTCGAATTCTCTCAATAGCAGCTGAATGATTGTCGTTTTACCAGCACCTACACGACCTACTAAACCTAAAGTTTGACCAGGTTTTAAGGTGAAATCGATGTTCTTTAACACTGGAATATCCTTTTCATCAGGATAAGCAAACGATTTAATATCAAAATGCAAATCACCTTGTAAGTCTTGAGCCTTAATACTTTGATCAGCGTGGGCATCAGTAATTAACGGTTTTTCATAAAGTAACTTTTCTACACGGTCGTAGCTCGCACTACCTCTTTCCAAAATATTGAACAAATAACCAATCGCAAACATCGGCCAAACCATGTTGGCAATATAGGCAATAAAGGAAACGAGCTGACCAATCGATAGCACTTTATTTGTAACTAAAAGTCCACCATAAATAATGGTAATTACATAGGTTGCACCAATAACTGCCGTACCCAATGGATCAAACAAGGAGTCCCAAACAAACACTTTTTTATTGATGTTAATAGTATCATCAACCATCTTGTCAAAAGCTTGGGTATCTTCTTTTCCTTGACCAAAAGTTTTAAGTACCTTAATGCCTGAAACTGATTCTTGAGTCTTATTATTCAAGCGAGAAAAGGCCGCCTGTGACTTATCAAAAGCATCGTGTAAATGATCCCCAAGTTTCCAAGCACCTAATGCTAAAAATGGCAGTGGCAATAAGGCAACGATGGTTAGTCGCCAATCAACGAAAATGATCATGGCAATCATTGTTGATAAGCCCATTACAAGTGAGTCAACCAAGGTTAAAACACCATCACCGGCTACGTTTTGAATTGCAGTCACATCGTTAGTTGCATGGGCCATCAAGTCACCGGTTCGGTGGCGCTGATAAAATGTTCTGTCCATGATCATAAAGTGGTCGAACAGTTTAGAACGCATTTGTCGTTCTAGTTCTGCGGCCCCGCCCCAAATTTGCTTACGCCAAAAGTAGCGCAAAATATATAAAACGAAGGCGGCAGCCAAAACAGCTAGAATCAACATCCCATATTGCCCCCATGAGATGTGACCTTGATCCAGCTGATCCGCCATCAGTCCTAATACTCTTGGCGGAATTAAGTTGGCTAAAGAGGTTAAAGCAAGAAAAGTAATTCCGATAATATAACGTTTCCTTTCTTGTTTAAAAAACCAACCCAGTTTTTTAAATATGTTCATTTACTCACTCCATTCTTTAAAAAGAGCACCAAAAAAGACGAAGGAATAATCCTTCGCCTTAACACCACAAAATAATCTAATAATTACTTTTGTTGATGCTTCATCATGTTCATTACTTGATGAACCTTCTTGTTAGAAGGCTTTTGTCCCATTTGAGACATCATAGCAACGATCATATCTTCACTAATAGGAGGATTTTCCTTAAAGTATTTCTTCATGTATGCTCTTGCACCATAAAAACCTGCTGTTGCACCAACTAAAAGTGCAATAATAATCAAAAATATTGCTAAACCTAAATTCATATTATATCCTCCATCTAACTAACAATAATATCTTACATAATTTTTCATATTTTTTAAAGGCTTAATTAATCTTTTCTAAGACCTTTTCTTCTTTGAGCGCGTTTAGCTCTTTCAGAAGTAACTTCTTTACCATTCTTATCGATGATTACCATATCCTCGACATCTTGCTTAAAAGCAGCTCTGAAGTTTGCGATAAATTTCTTATGAAGTTCCTTACGCTCTTGTTCCTCTGCAGGTGTTAAGCCTTCTCTTTCCTTCTTGTGGTAAAGTTCATTAATGCGATCAGTTACCTTTTTTTCTTCTTCTTTACTCATAATATCTTTATTCATAATATCCACCTCACATTGACAATCATTAGTCTACCTTATTTTTAAACTAAAAAAAGGTATAATTCTTAGAACGCTTGTTTGAGATTATTTGATTTTTCTGTTAAAATTACTTATAAGAATAAATAACTAAAATTAACGGAGAAAATGAAAATGCCTAAGAAAAACAGTGAAACTAAACAATTAGAAATTTTACGCTATATTTATGACACTGTCGAACATCGCGGCTTCCCACCTACAGTGCGTGAGATTTGTACCGCAGTTAATCTATCTTCAACTTCAACTGTCCACGGACATTTAGCTCGTCTTGAGCGCAAGGGCTTATTAATTAAGGACGCTACTAAGCCACGTGCCTTGGAAATTACCCCTGAAGGTAAAGATGCCCTTGGCATTAAGCCAAAGGAAATTCCAGTTGTCGGCGTGGTTACCGCAGGTCAACCAATCCTTGCAGTACAAGACATTGATGAATACTTCCCACTTCCACCAGATTTGGAAAATGATGCTGGTGAACTCTTCATGCTTAAGGTTCACGGTGAAAGTATGATCAACGCTGGTATTCTTAATGGCGATAGCGTCATTGTCAGAAAACAAAATTCAGCTAACAACGGTGAAATTGTTGTTGCTATGACCGAAGAAAATGAAGCCACAGTTAAGCGTTTCTACAAGGAAAACGGTCATTACCGCCTCCAACCTGAAAATGATACTATGGATCCAATCATTTTGCCAAAGGTAAGCATTTTGGGTAAGGTTGTTAGCCTTTACCGTAACAATATTGATTAATAATATTCACGAAATAAGCTAAATTGAATTTAGCTTATTTTTTTAGCAAAAGATTGTTTAGCTGCTCTTCAAATAAAGGCATCCATTTCTCTAAATAGCCAGCACGCGTTGGATGGATTTCATCAGCCATATAAAGTGAATCTTGCTTTTTAAATTCGGAATTATGATAAAGATCCAGTACTTGAACTCCCCATTTTTTAGCAAGTTCAAGCGTACTATCTACCATTTGTTCATACAAATTACTTTCAAAGTATGGATTGGTATAAATCAAAATAGGACAATCCCACTTTTTGTGTGCTTCATTGATGATATACTCAATAGAACCAATAATGGTTTGCATATCAAAAGTATCATCTGAACTGATTTTACCCAGTTTTTGATTGGTATTGGCATCATTAGTTGACAGTTGCAAGACAAGCATGTCTGGCTTATCTTCTTTAAGCTCTTCCTTAAAACGGGCAACGTATGAGTCACCATGATAGCTTGTATCTTGATCTACCAAAGTCGTGCCATTTTCAGCATCTTTAATAGCGTCGATTCCGTCTTTCTTCCATAAATAATCTACAAATGATTCACCTAAGGCACCAAAGCCAAAAGTAACTGATGAGCCTAAGAACAAAATCTTCTTATGACGCAATGGACTTTCAATCGTTTTAACACTATTCAAACTATACTTGCGGCTGTTGCCAGGCAAATATGCAATTTCAGTATATTTACGTTGATTTAGGGGATGCTCATCCAAATCCATTCTCAACTCATTTAAGGCAACATCAGACAAATCATTATTTTGTCTAATAGGTATTAAAAAAGTTTCGTTACTCATTATTTTTCCTTTTCTAGATAAGGCTTAATTATATTCCAATCTTTAATAATATCATCACTTAGTTTGCGATTGTAAAAAACAGCAGCTGGATGATATTCAGGAAAGATCGTGTATTTTTGTTTTGACCAAACATAACCATCTTTTTGTGGATTGAGCCGTAAAATTGGCGTATTTTCAATTACTTTGCCATGTTCTTCCGATATTTTATGTCCTGGACCAAGTAAACGTTCCAATCCGGTATTGCCTAAAGCGACTATTAATTTAGGTTGCGCATAATCTATTTCATAGTCTAAAAAAGGTGCATGCGCTAAAACTTCCTTTTTGGTTGGCTTGCGATTGGGATACTTAGTAACTTCCTTATTCTCACGTTTACTAAAGACCTTTTTAATTGCATATGGTCTGCTTCTGACTGCACTAGTAATGTAGACATCATCTCTAGTTAAGCCGATCTGTGCTAAAGATTTATTTAATTCTTTGCCTGCATCCCCACTAAAAGGAATGTTATTAACTATTTCATTTCTGCCGGGTGCTTCACCAATAATCATCAACTTAGGATGCTTAGGTCCACTACCTGAATTGATTCCTTCAAGTCGCATACCTGCAGAACGTTTTTTAACTTCATCTATCAACTCTTGAGGATAGTTCAACCACTTTTCCTCCCTTCATTTTTTTCTAACATTTACATTATAAGTAATGCCGACCACTATTTCACTTTTTTGCTTTTTAGAGTATAATAAACGCAATTTAATACTAAACAAAAGCCGCCGGTGTGGATAACTGGTGGCTTTTGTTTTAATTCCACATAGAATGTTGATTTTTCAAAATTAACTTTCACAAGCCGTAAATCTCTTTATATCAGCAATTATAAGTATTATTTTTTCACTTTGTGATAGAATATGCATGTTACACAGACGTTTCACACTTAATTATTTTCACAAACTAATGTAAAAAAATAGTCGATTCTTACGAATCGACTATTTTTCATTTTATTAACGACGTGCTTCAGCAATTCTTGCTGACTTGCCGTGACGTTCACGTAAGTAGTAAAGCTTAGCGCGACGTACACGACCATGACGCTTAACTTCTACCTTAGCAACACGTGGGTCGTTAACTGGGAAAGTTCTTTCAACACCAACACCTGAAGCAATCTTACGAACAGTGTAAGTAGCACCGATGCCAGTACCCTTCTTCTTAATTACAACACCTTCGAATAACTGAATACGTTCGTGAGTACCTTCAACAACACGTACGTGAACAGTAACAGTATCACCTGCACGGAAAGCAGGAATATCGTCACGAATTTGTTCTTTAGTTAATTCTTGAATTAATGGATCCATAATATATTTTTCTCCTTCTTCGGTCTTCATCAACGCACCAGCGTCAGCGGAACACCCATAACTTATAAATTCCGTGGAAAATCCACGCTTACTAGTATAACAAGGTTAGACTAACTGTTCAAGTTAAAAATCAATTACTAACCTAGCTTTTTTTATCTTTGTAGTAAAATATTTTTTATAAATCAAGGAAGAGTAATGAATAAAATAGTAAATTTTTTGGGCAAGCCACGTACTATCAAAAATGTGATTAAAAACCGTAAAACTTCTTGGTTAGAATTATTCTATGATTTAATTTTTGCTGTAGTCTTTTCACGACTAACGGAAAGCTTACTAGAACATCAAACACTAACCGGATTTATTAATGCAATCCTGACATTTTTCTGGCTAATTTGAGGTTGGGGCGAGTTCAGTGGTTATTTTGATAACCATGGTAACGATTCAATTATCAACATTTTAATTATTAATATTGATATGATCTTAATTGGCTTAGCTAGTATTTTCATCCCTGAAGCAGTAAACGGTAATTTCCATCATATAACTTGGCTATATATCATCTTAGAACTGTTTATGGGTGCTGTCTGGATCGGTTTAGGAATCTTTGATAAAACACACGGACCAGCTGCCAGAGTTTGGGGTATAACTTGTTACAGCTGCAATCATTGCCTGCGGCCTCTTTTTTAGTAATCAGCTTTTATTCTGGTTCTCAGTAGTCGGAACTCTAATCAATATCTTTGTAGTTATGTTCGCTAGTTCTCCTCTTGAGCGAGAATATGAAAGAACCAACATGGTTCATATAATCAAAGATTCATTGATTGAACGCTACGGTCTAATGACAATGATCGCTTTAGGAGAAATCATTTCAAGTCTTTACGACTTTAGCAAGACGCCGATCAATTGGAACCGATTCATTCAATTCACTCTTTGTATAATTTTGGTTGCGCTATTGGCAGCTGTATATTATCAAGTTTTAGGTGAACTTCATATTCAACTAAATTCTTCGATTGCTACATCTCTTACAGGCTGGTTATTTTTATTAGTAATTCTTTTTATCTTCTTAACTGACGTTAGTCTTCACCTAGTAATCATTGATGGGAATCTTGAAAGCAAAATTTTATTTAGTTTTTCTTTGATCCTCATGCTGCTAATGATCCGAGTTTTGTTTTTAATCAGTATTCATTTCAAATTGTCAAAACTTCAGATCAAACTTAGCTGGATTCTCCTGATTGAAATGATCATCAATCTAGCAGCTGCCTTTTTACCAGCCATGGGGTGATCCTTTTAGATATTCTTGTTTTAATGAAAAACGCCCGAAAATCCCCGTATTTTAATGCGGGGATGGATAGGGCTTGCCGTCTCTAACGGCATTGTAGATCTGGTCATTAATGTACTTTTCTACTACTTCTTTAGACATATTCCCTACCGTGCCAAAGTAATAGCTGGGTGACCATAAATGTCCGCCCCACATCTTGTTTTGCTTAATTTCGGGATGATTCTTGAAGAATAAAAAAGCACTTCTTCCTTTGAGTGCTTTGACTACGCTTGAACCTGATTTTGATGGTCTAAAGCTGATAAGCAAATGAATATGTTCAGGCATTACCTCCATTTTTTCAATTTTGATTTGATTATCTTCAGCTATTTGCTTGAGCAGGTCTCGCATTTCTTGAGCCAGGTCTTGGTTAGTAAAGATTTGATTGCGGTATTTAGTACACCAAATTAGATGATAGTGTGCATTATAGACGTAGTGTTTTTCATAGCCTGCGTCTTTTATTTTGTCTCTCTTATTATTCATAGTCTAATATCCTTTCGTCTATTATGTATCTATTTACTTTTGATTATAAGATATACATAGGCTATAATCAAATTACAATAATACAAGGAGGTGAAATACATGAAAAGAATGAGCAGTTTAGCCTATCATTTTGGCGTTAAGCTTCGCTTTTATCCTAGTTCTAAGCAAAAGAAAATCATTAAACTAAACTATGATGCGCAGCGCTTTGTCTACAACTCTTATGTAGGACGCAATCGGACTAGCTACCATGCTAAACGTTATTTAGCTAATAGGCAAAATCGAGCAATGCCCTTTGCTTTTTCTGCTCTTAATAGCTATGAAGTCCAACTGGCAGAAACAGTAGTCATCAATAATGAATTATTGGCTAAGCCTAAGAATATTCGTGATGCTTATAGCTTTTTACGTGTTAAAGAAATTGATAGTCTCGCTCTTGCCAATGCGATTCAAAACTATCAGAAAGCTTGGAACAACTATCGCAAGATTGGTTATGGAATTCCGACTTTTCACAAAAAACGCAGCGACTGGTCATATCAGACTAACTGTCAATATCCTAAGCAGTCAGAAGCCTATCTTGATAACGGGACAGCTAGATTTATTGATGCTAAACATATTAAATTGCCTAAGCTGGGAATTGTCCGCATTGCTGGTTTTAGAAAACTGATTAAAGAGCGCTTGCTTAAGCAGATTCCAACTAGAATTGGCACAGTCACGATAAAAAAGACCGCTGATGACCAGTTCTACCTGTCTATGCAATTAGGCAGCGATACTGCTTTTGTTAAGGAATTACCTAAAACACAAAGTCAAATTGGCATTGACCTCAATTTAGATAACTTCCTAACAGCATCTAACGGAGCAATGGTCGCTAATCCACGATTTTATTGCAAAACCAAAAAGAAGCTGGTCCATGCCCAACGTGTCTTGTCTCGCAGGCAGCGCCGGGCAAAAAAAGAAGGACGCAATTTGTGGCTAGCAAAAAACTATCAAAAACAGCGTCTAATAGTCGCTAAACTGCACGATAAGATCAGAAGACAGCGTAATGACTTTTTACAAGTACTCTCAACTGCACTAATCAAAAACCACGATTTAGTAGTTGCCGAGGAATTAAGAAACAGAAACCTGTTAAAGAATCATGCCTTGTCGCAATCAATTTCTGATGTCGGCTGGCGTAGTTTCCTGAATATGCTGGCTTATAAGGCAGATCTATATGGTAAAGAATTTCTAACAATTGATCCTAAATATACTACTCAACGCTGTCATGCTTGTGGCAGTATTATGGGCCAAAATGGTTATAAGAAATTAACCCTTAAGGATCGGGAGTGGACTTGTCCAATTTGTCGAATGCCCCATATTCGTGATTGGAATGCGGCAGTGAACATCTTAGAAAAAGGATTAAGCAAGTGGCAAAATCCTAAAATAAAAAAAGCAGCCTAG
Protein sequences of DBSCAN-SWA_2 >NC_017470|1286112:1298493|1296597_1297062_-|WP_013641431.1|transposase|DBSCAN-SWA MNNKRDKIKDAGYEKHYVYNAHYHLIWCTKYRNQIFTNQDLAQEMRDLLKQIAEDNQIKIEKMEVMPEHIHLLISFRPSKSGSSVVKALKGRSAFLFFKNHPEIKQNKMWGGHLWSPSYYFGTVGNMSKEVVEKYINDQIYNAVRDGKPYPSPH >NC_017470|1286112:1298493|1295468_1295666_+|WP_014565955.1|DBSCAN-SWA MNKIVNFLGKPRTIKNVIKNRKTSWLELFYDLIFAVVFSRLTESLLEHQTLTGFINAILTFFWLI >NC_017470|1286112:1298493|1286112_1288056_-|WP_014565950.1|DBSCAN-SWA MRRFYAVRGGAGDVAEPDVNAKPQDNLYLAVNSEWLSKAEIPADRTSTGVNSELDIKIENRVMKDFADIAAGKEKMPEIKDFDKAIALYKIAKNFEKRDAEEAHPIQNDLQKLLSLTDFADFGKKAKDLYMGPYILPFVYGVDADMKNTDVNVLYFGGPSTFLPDTTTYKTDDAKKLLDILQDQSINLLMMAGIGKTEARVYVDNALAFDAKVAKVVKSTEEWADDAAIYNPVSYDKFIAKFKSFNMAQFLDDLLPEKQKRVIVMEPRFLDHAEELINAENFDEIKGWMLVKYINSVAKYLSQKFREAAFPFNQAITGAPEMPSQIKQAYRIANSAFDEVVGIFYGKKYFGEKAKHDVEDMIHNMLKVYEERINHNTWLSDETKKKAIVKLNALVLKIGYPEKIEKIFDLLQVDPEKSLYENEAAMDKVRTEYELEKLTKPVDRSVWLMPGNLNNACYDPQRNDLTFPAGILQKPFYDINQSRGANYGGIGATIGHEVSHAFDNNGAKFDENGNMNNWWTKQDFAEFNKRVGQMIDIFDGLQYGPAKINGKQVVSENIADLAGLACAVQAGKNDGADLKDLFENYARSWMEKQRPEAIKTEVQTDVHAPQPTRVNIPVQCQDEFYDAFGVKKDDGMWLDPEDRIVIW >NC_017470|1286112:1298493|1292486_1293113_+|WP_013438156.1|DBSCAN-SWA MPKKNSETKQLEILRYIYDTVEHRGFPPTVREICTAVNLSSTSTVHGHLARLERKGLLIKDATKPRALEITPEGKDALGIKPKEIPVVGVVTAGQPILAVQDIDEYFPLPPDLENDAGELFMLKVHGESMINAGILNGDSVIVRKQNSANNGEIVVAMTEENEATVKRFYKENGHYRLQPENDTMDPIILPKVSILGKVVSLYRNNID >NC_017470|1286112:1298493|1293935_1294583_-|WP_014565954.1|DBSCAN-SWA MNYPQELIDEVKKRSAGMRLEGINSGSGPKHPKLMIIGEAPGRNEIVNNIPFSGDAGKELNKSLAQIGLTRDDVYITSAVRSRPYAIKKVFSKRENKEVTKYPNRKPTKKEVLAHAPFLDYEIDYAQPKLIVALGNTGLERLLGPGHKISEEHGKVIENTPILRLNPQKDGYVWSKQKYTIFPEYHPAAVFYNRKLSDDIIKDWNIIKPYLEKEK >NC_017470|1286112:1298493|1288149_1289946_-|WP_014565951.1|DBSCAN-SWA MNNEEKQESVWSKAIPFKEQVAIFERLMGFVKHFKFEMTIALIGAFLVSIINILLPRGLQYFLDNFLLKQSATVQIILFAGLLYAIGSILKAIIQFTYQYFFALGSEKTLESIRRALYRKLHKLGMRYFDQTPAGSIVSRVTNDTMTLSNFLTVLSTVVIGAFSVVTALVAMFTTNVIAGWIVLAFVPILLFVIWLYSQKSSRLYRNYRERLSRINANLNESIEGVSIIQQFNQEKRMTNQFEGENGGLMNTRFNMIRINSLLLSPLTSLLYSLALAVTLMYFGFPLQATFVPAGVVYAFSQYLSQLFNPISTMMDQMTFFQDGIVAGKRIFAILDDEQYEPKQNAQEGLTISRGKIEFKHVSFSYDGKNEILHDISFVVNPGETLGIVGHTGSGKSSIINVMMRFYELGKGEILIDDVDIRKYPKEELRKKLGLVLQEPFMFYGDISSNIRLYNKNITDEQIRDAAKAVQADQFIEKMPGKYHAKVIEGGEELSQGQRQLISFARTLVTNPKVLVLDEATANVDTETETLIQQGLKQLRKGRTTLAIAHRLSTIADADQIIVLDKGRIIERGTHEELLAKKGYYYNLYTLQNNENKE >NC_017470|1286112:1298493|1294925_1295273_-|WP_003629071.1|DBSCAN-SWA MDPLIQELTKEQIRDDIPAFRAGDTVTVHVRVVEGTHERIQLFEGVVIKKKGTGIGATYTVRKIASGVGVERTFPVNDPRVAKVEVKRHGRVRRAKLYYLRERHGKSARIAEARR >NC_017470|1286112:1298493|1293147_1293936_-|WP_014565953.1|DBSCAN-SWA MSNETFLIPIRQNNDLSDVALNELRMDLDEHPLNQRKYTEIAYLPGNSRKYSLNSVKTIESPLRHKKILFLGSSVTFGFGALGESFVDYLWKKDGIDAIKDAENGTTLVDQDTSYHGDSYVARFKEELKEDKPDMLVLQLSTNDANTNQKLGKISSDDTFDMQTIIGSIEYIINEAHKKWDCPILIYTNPYFESNLYEQMVDSTLELAKKWGVQVLDLYHNSEFKKQDSLYMADEIHPTRAGYLEKWMPLFEEQLNNLLLKK >NC_017470|1286112:1298493|1297161_1298493_+|WP_014565958.1|transposase|DBSCAN-SWA MKRMSSLAYHFGVKLRFYPSSKQKKIIKLNYDAQRFVYNSYVGRNRTSYHAKRYLANRQNRAMPFAFSALNSYEVQLAETVVINNELLAKPKNIRDAYSFLRVKEIDSLALANAIQNYQKAWNNYRKIGYGIPTFHKKRSDWSYQTNCQYPKQSEAYLDNGTARFIDAKHIKLPKLGIVRIAGFRKLIKERLLKQIPTRIGTVTIKKTADDQFYLSMQLGSDTAFVKELPKTQSQIGIDLNLDNFLTASNGAMVANPRFYCKTKKKLVHAQRVLSRRQRRAKKEGRNLWLAKNYQKQRLIVAKLHDKIRRQRNDFLQVLSTALIKNHDLVVAEELRNRNLLKNHALSQSISDVGWRSFLNMLAYKADLYGKEFLTIDPKYTTQRCHACGSIMGQNGYKKLTLKDREWTCPICRMPHIRDWNAAVNILEKGLSKWQNPKIKKAA >NC_017470|1286112:1298493|1292072_1292336_-|WP_013438155.1|DBSCAN-SWA MNKDIMSKEEEKKVTDRINELYHKKEREGLTPAEEQERKELHKKFIANFRAAFKQDVEDMVIIDKNGKEVTSERAKRAQRRKGLRKD >NC_017470|1286112:1298493|1291788_1292007_-|WP_013438154.1|DBSCAN-SWA MNLGLAIFLIIIALLVGATAGFYGARAYMKKYFKENPPISEDMIVAMMSQMGQKPSNKKVHQVMNMMKHQQK >NC_017470|1286112:1298493|1289938_1291705_-|WP_014565952.1|DBSCAN-SWA MNIFKKLGWFFKQERKRYIIGITFLALTSLANLIPPRVLGLMADQLDQGHISWGQYGMLILAVLAAAFVLYILRYFWRKQIWGGAAELERQMRSKLFDHFMIMDRTFYQRHRTGDLMAHATNDVTAIQNVAGDGVLTLVDSLVMGLSTMIAMIIFVDWRLTIVALLPLPFLALGAWKLGDHLHDAFDKSQAAFSRLNNKTQESVSGIKVLKTFGQGKEDTQAFDKMVDDTININKKVFVWDSLFDPLGTAVIGATYVITIIYGGLLVTNKVLSIGQLVSFIAYIANMVWPMFAIGYLFNILERGSASYDRVEKLLYEKPLITDAHADQSIKAQDLQGDLHFDIKSFAYPDEKDIPVLKNIDFTLKPGQTLGLVGRVGAGKTTIIQLLLREFDQYDGEITLNGINIKKIPLKVLLSQISYVPQDNFLFSTSIGRNIAFSEADAGKNEIAAAAKKSDLHDDVLQMPHGYETLVGENGLSLSGGQRQRMSIARALLKDSQILILDDALSAVDAKTETEILTSLRNERKDKTTMIAAHRLTSVMDADLILVLKDGQIIERGNHQDLLKENGWYAEMWRRQELQAKVGEDVDE >NC_017470|1286112:1298493|1295931_1296546_+|WP_082231542.1|DBSCAN-SWA MACGLFFSNQLLFWFSVVGTLINIFVVMFASSPLEREYERTNMVHIIKDSLIERYGLMTMIALGEIISSLYDFSKTPINWNRFIQFTLCIILVALLAAVYYQVLGELHIQLNSSIATSLTGWLFLLVILFIFLTDVSLHLVIIDGNLESKILFSFSLILMLLMIRVLFLISIHFKLSKLQIKLSWILLIEMIINLAAAFLPAMG >NC_017470|1286112:1298493|1295714_1295954_+|WP_082231541.1|DBSCAN-SWA MINILIINIDMILIGLASIFIPEAVNGNFHHITWLYIILELFMGAVWIGLGIFDKTHGPAARVWGITCYSCNHCLRPLF |
14 | Bacillus_phage(25.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1425377 : 1483434
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_017470|1425377:1483434|DBSCAN-SWA TCTAGGCTGCTTTTTTTATTTTAGGATTTTGCCACTTGCTTAATCCTTTTTCTAAGATGTTCACTGCCGCATTCCAATCACGAATATGGGGCATTCGACAAATTGGACAAGTCCACTCCCGATCCTTAAGGGTTAATTTCTTATAACCATTTTGGCCCATAATACTGCCACAAGCATGACAGCGTTGAGTAGTATATTTAGGATCAATTGTTAGAAATTCTTTATCATATAGATCTGCCTTATAAGCCAGCATATTCAGGAAACTACGCCAGCCGACATCAGAAATTGATTGCGACAAGGCATGATTCTTTAACAGGTTTCTGTTTCTTAATTCCTCGGCAACTACTAAATCGTGGTTTTTGATTAGTGCAGTTGAGAGTACTTGTAAAAAGTCATTACGCTGTCTTCTGATCTTATCGTGCAGTTTAGCGACTATTAGACGCTGTTTTTGATAGTTTTTTGCTAGCCGCAAATTGCGTCCTTCTTTTTTAGCCCGGCGCTGCCTGCGAGACAAGACACGTTGGGCATGGGCCAGCTTCTTTTTGGTTTTGCGATAAAATCGTGGATTAGCGACCATTGCTCCGTTAGATGCTGTTAGGAAGTTATCTAAATTGAGGTCAATGCCAATTTGACTTTGTGTTTTAGGTAATTCCTTAACAAAAGCAGTATCGCTGCCTAATTGCATAGACAGGTAGAACTGGTCATCAGCGGTCTTTTTTATCGTGACTGTGCCAATTCTAGTTGGAATCTGCTTAAGCAAGCGCTCTTTAATCAGTTTTCTAAAACCAGCAATGCGGACAATTCCCAGCTTAGGCAATTTAATATGTTTAGCATCAATAAATCTAGCTGTCCCGTTATCAAGATAGGCTTCTGACTGCTTAGGATATTGACAGTTAGTCTGATATGACCAGTCGCTGCGTTTTTTGTGAAAAGTCGGAATTCCATAACCAATCTTGCGATAGTTGTTCCAAGCTTTCTGATAGTTTTGAATCGCATTGGCAAGAGCGAGACTATCAATTTCTTTAACACGTAAAAAGCTATAAGCATCACGAATATTCTTAGGCTTAGCCAATAATTCATTATTGATGACTACTGTTTCTGCCAGTTGGACTTCATAGCTATTAAGAGCAGAAAAAGCAAAGGGCATTGCTCGATTTTGCCTATTAGCTAAATAACGTTTAGCATGGTAGCTAGTCCGATTGCGTCCTACATAAGAGTTGTAGACAAAGCGCTGCGCATCATAGTTTAGTTTAATGATTTTCTTTTGCTTAGAACTAGGATAAAAGCGAAGCTTAACGCCAAAATGATAGGCTAAACTGCTCATTCTTTTCATGTATTTCACCTCCTTGTATTATTGTAATTTGATTATAGCCTATGTATATCTTATAATCAACAATAAATAGATACATAATAGACGAAAGGATATTAGACTATGAATAATAAGAGAGACAAAATAAAAGACGCAGGCTATGAAAAACACTACGTCTATAATGCACACTATCATCTAATTTGGTGTACTAAATACCGCAATCAAATCTTTACTAACCAAGACCTGGCTCAAGAAATGCGAGACCTGCTCAAGCAAATAGCTGAAGATAATCAAATCAAAATTGAAAAAATGGAGGTAATGCCTGAACATATTCATTTGCTTATCAGCTTTAGACCATCAAAATCAGGTTCAAGCGTAGTCAAAGCACTCAAAGGAAGAAGTGCTTTTTTATTCTTCAAGAATCATCCCGAAATTAAGCAAAACAAGATGTGGGGCGGACATTTATGGTCACCCAGCTATTACTTTGGCACGGTAGGGAATATGTCTAAAGAAGTAGTAGAAAAGTACATTAATGACCAGATCTACAATGCCGTTAGAGACGGCAAGCCCTATCCATCCCCGCATTAAAATACGGGGATTTTCGGGCGTTTTTCATTAAAAAGGTGTGTGACTTAGTTTCTGGTGCGAAGCTCTTGCCCCAGTGACCTGAAGTCTTGAAGTGTTTAAACCAACTAGTAGCACTTCCTGCACCAGGTCGTGGCGTCTTGTCTGCCTTGACAATACCTTCGTTATCATCTGATCTTACTTCAACAGTAACATGGGCACTGGCTTGGAAGTTGCCGTGCCTGAAAGTTCTCGTTGAACCGATCCAACATGCAGGTCCTCTACGTGTAAGTCTTTGGTAAATCCCGAAATTACCATTACCAATCATCTTTGTTTGGAAAGGACTCAATTCCTTAGCTGCAACTGATTGTGGTTTAATAACAAACAATGTTAGAAGACACGCTAAAATAAAAGAAATGCTCCATTTCTTTTCCATATCTATTTTCTACTCCCCTACTTTTAACTTGATAGCACGACTAACATTGTAATTCTTTATACTGATTTATGCTAAGCTTGTAAGCAGTAGCATTTTAATTTTTGTTAATATATTAATTTATATAAGATGCAAATAATAACACTATTAAGTTTTTAAAATATAAAAAAGACGCTACATTCGTAGCGCCCTTTCATTTTATTAATAAATTCTAGTTTTCAACGATCAAATTCTTAAGATCATCTGGTAAGTTAAGTGGAGCATACTTAGCACTAACAATCTTCAATGCTGATTGCTTTACGTATCTCTTCTTGCCAATGTAAACGTATCTAGTCTTACCGTTCTTAACAATCATTGATGACTTGTACTTCTTGCCATACTTAATAGTCTTCTTGCTTGCAGCGAAGGTAATCTTCTTTGACTTCTTACTCTTCTTGTAAGCGTAGTAGATCTTAGCACCCTTCTTTGACTTAGGGGTTACAGTAATGTATTGCAAGCTTCTTTTGTAAGCATCTTGAATGGCCTTTACATCGTTGTTTGCTTGTTCAACAAAGGCTGATTCAGTTGGAACATTGCCTGAAGTAGTTACATCAGTTGATGGAATGAAGACAAAGCTGTCTTTACCATTGATTGTTCTATTTTCAATGTAGTACAAAGTTACGGTCTTGTCGCCTTGCTTAGCTTCAGTTGACCATAAAACATTAATGGTTTGACCCTTAGTATAATTTGAATCAATTTGGTCTGGATTGGCTTGATCCTTAGCTACGACAACAAGCTTTGGATTATCACTATTTACAATAGCTGAAACAGTTCTATGTTTTACACTAACATTACTAGTCTTAGTATTTGAAGATTGAGTGGTGCCTGCGGCATGAACTGGTTGACCCATTGCCATGCCAGTACCAGCAAGACCTGCTGAAAGCATCACAGCAGCTGCTAATGAAGTAAATAATTTTTTAGTTTTCATTTATAAAATCTCCTAAAAACAAAACTCTTATAATTTAATTATAACATTATTTAGGAAATTGCATTAATTACTTCCCCGCAAAGAGCTGGCCGATTGTTTCCATATCTTCTGGGGCAATCGTAAAGTCAGTTTGCGGATTTTCAACGATATCATCTACTTGAACAATTGGCACAATTCTCTTTTGGGCAAAGTAGCGCAAAACTAATTCCATCGTGCTTGTACCATAATGGCCGGCGATTTCGGCCAATGCATCAATATTGCCTGCAACCGAAAGTTCCACTTGCATCTTGTTTTCTCTAGCTAATTTGATTAAAGATCCATCTTCGTAATCCAAATGAATTACGCTGGGCTTAATCTTAGGTGCTTTCAAAATAGCAGCCAAGGTATCACCATTAGCATTGGTTACACCAATATTCTTGAGCCAGCCTTGGATTTTAACTTGTTCTAATTCTTGCCAAAGTTGATTATTACGTTCGGCGTCATCACTTAAGCGAAGCATAGCCAAATCGGCATGTTTAGCAACTAGATTATTTCTGATTTCTTTCAAATTATTACGCATCTCTTCACGCGTATTAACTTTTTCAGAAACCTGAATTTCCACGTAAGTTTGTGGACTCAGATTTTTAAAATCGATTTTTTCATCCGCAGTGCAATTAAGCAAACGATAGCCTGCCTTCATTGCCCTTGCGACATCTTCATCACTGGTAACAGTGACGCCAACTTTTGGCATCAAGCTACCATCATTCAAAACAACTGCTTCATCTAATAATGACATTAAATCATTTCTCTCCTTAAGCTAATTCAACATAAATTTTGATAGCTTCTTTGACAGTTTGATAAACAGTTGAGACATGCAAAGTATGAGCTAAAAACTTCTCCTTCTCATCCTTTGACATTTCTGAGTCAAAGAAGACTTTTATGTTCATTTCAGTTACGACCGATTTGCCGTGACCTAGATTTTCAGTTTTAGCATTATTTTCTACACGAAAATTTTTCACGTCTAAACCATGTGACTTAGCGACCATTCCTGCGGACATTACGATGCAGGAATTTACGCTACCACATAGATATTCGACCGGGTTAGGGCCCTTATCATGTTCCTTATCGTTGTCATCACAGATAAATTCATGCTCACGGGCCTGGTTAGCAATTTGCCATTCTGTGTCTTCAAGCTTGGACTTTACTAAATATTCACTCATAATGTTGATCCTTTCTTCGTAAATTAATGGTGGTCAATTGCTTCGTTTAAACGTTTGCGCCAGATAAAGTAAACTAGCAGCATGAATGGAATCAATACTAGTTCTACTGGATAAAACCATTCAATTGGCAACACGCTATCAACTACTCCGCTTAGCATTGGGCCAAAGGCCATCGCAATATCAAGTCCTAAATAAAAGGTAGAACTAGCTAAACCTTGTTCCTCAATCGGAGCAAGTAAAAGTGAGGTTGATTGTAAAACTGAATAAATAATGCCATAGCCCATTGCCATTCCGGCTGCGGCTAAAGCCATTTGCCAATTATTGTTCATCATGGCTAATAAGCTGATATATGCGATGGTTGAGATTAAGCTAATCCAGAACCAAACACCGAAGCGAACAGTATCAAAATAGCGTTTCAACCCAATTCTGATGATCAGTAGTACTACCGCATAGATCAAGAAATATGATCCTACGGCAACAGACAAATGCTTTTGTTCAACATAGGTAACGAGATCTGCCTGCGTTACAAAATATGGTGTCGCAAACAAGGTAGTCAAAATCGCTACCGGTAAAACATTCATCTGAATAATCTTGAAATGCTTTTTAGGCATCTTGCTTTTATCTTTAACAATTGGCTTGGCATGATCACCCACGAATTGGATAGCTATTACCATTAATAATGCGGAAATAGCCGAGGCGATCAAGCTCTCACGATAACCGATTTTTTGATAAATATTAATCGATAATGCCGGTGCCAAAGCCATGGCTAAGGCATTCATCAAACCATAAAAGCCCATTGCTTCGCCCACGTGCTGCCGCGGCACCAAGAATGCCAGCCAAGTTGTCATACAAACGGTACACAAAACATAGCCGGTACCATTGATCAATCTGAATAAAAGCAGCCAAGCACTTGATGGCGTAAAGACATAGCCTAATACACCAATTAAAATCAAAACACCGCCAATAAAAGATAGACGATATTTTGAAAACTTATCTGTCAAATTGCCGGCAATAGGACGCAAAAACATTGCGGCAACGCTCATAATACCTACGATTACACCGGCAAAGGCGCTGCTTGCCCCTAAGTTCTTGGCATAGCCATTGATCAACGGATTAACGAACATCGTACTAAACATAAAGAAGAAAGATGCTGCCATCACTAATACAACATCTTTGGTATAAATAGATTTTTTCTTTGCCAAATTAATGCCCCATTTCTTTAGCTTAATTATACAGCTATTGGCAAAACCTATTTAGCAATCTTCAAAACTACGGCAAAGACGATCAGTGAACCGATCCCCAGTACCACGGCCACTCTAAATAACAGCCAGAAACGGTTGTAAATTAGCGTTTCGTTTAAAAATACGACCATTGTTTGACCAACGGAATTTTTGAATTATCTAATCACCTACTTAACTAATTCATAACTACCACCTTTTTCGCCTTCAATTTTAACTGTGCTACCGATTGACCGCAGTTTTTGGCGTAAATATGAAACATAAACCCAGACTACTTCTGGATTGGCATCTTCGTCGTTCTTCCAAACGCGATTAAAAATATCTTCGGTTGATAATTCTTTATTTTCATTTAGCAAAAAAATAATTCATCAATTGGGTCTCAGGGCCGCTTAATTGAATGGAATTATGACTGGCCAGGTTTTGTTGTTCAACGTTAAGTGTTACATCCCCCACAGTCAATAAATTAGGAGTGTAATCATCTTCACGCCGTTCTTTTGAACACAATCTCGCAAGGAGCTCTTTTAATGAAAATGGCTTAGTAATATAGTCATCGGCACCACTATCAAGGCCAGTTACGCGGTCGTCAACTTCGGCTTTAGCAGTAAGCATCAAAATATAAGTCTTATCGTCGCTTTCTCTGATCTTCTTCAGCGCAGTAATACCGTCCATTACTGACGAATCGTCTGCATGCTTACATCACAAGAAGCAAGCTCGTCAGCTACTCGTTTTAATTCTTTCCCGATCATCTCAGGATTCTTGATCTGATGTTTATATTTCATTTCATGTTCAAGACTTGCCCAAGTGTCCATGGCAATTGTTCTTAGCTGCACTTCAACAAAATAATGACCTGGAATATTGCCGTCGACATCTTCGTCGGGTACGGTAATTTCAAGAATCATATGATATGAACGATAACCATTAGGCTTAGCGTTAGTAATATAGTCTTTTTCGTTATAAACACTTACATCATCCCAACTCTTGATGTAGTCAATACAGGTATAAATGTCATCAATAAAGTTGCAGACAACACGCAAGCCGATACTGTCACGCACTTCACGCAAAGCTGAGTGTGCAGTCAATGGATAATGCTTTCTGGTACATTTTTCTTCCATACTAGCTTCACTTTTTACACGGCCATTTAAGTGTTCATATAAACGCTCATGATGAACTTTTTGGTAATTGTTGTTTAATTCATCAAAACGATTCATTAACTGGTTAAGTATCTTATCCAGTGTGGGGGGCGTATTTTCCGTAAATATTCAACATTAGTCCCTTCTTTCGTTTCTATTATTATAACAATGCTTTTTGACTTTTCTTTGAAGGTCATGGGATTATTAGCAAATCATAATTTTCTATAAAAATCTTTTAAAATTGCAAAAAAGACACCTACCAATGAATTTCATTGATAGATGTCTTTTATTCTTCCTTCAATTCAAAATGGCTGCCATCTTCATCGACAATCCCGACTACTTCAGCATTTTTATCTCCAACTTTTATTGTCCCCATTTCAATTTTACCGTAACCTTTGGCCTCATCAGTTACTTCAATATCATCAAAAGCATTGCGCAATAATTTGCGAGCAATCTTACTTTTAATCCGGATTTCTTTTGACTTCTTGAAAGCTGCTTCTAGGGTATAACCTGCATCAAGTTGATCTTTAATAAAAGCAATCAAATAAACTGTTGCCAAACTGTAATGCCGCACGCCAGAACCTTCATTGATTGGCTTGATATAGCCCTTTTTCTCCCAATATCTAAGTTGACGCTGCGATACGCCCAAGCTACTACTTACGCCGCCTATACCTATTTCTAGATTTTCAAAAAGTTGATGTAATTTCTTTTTCTTTACATTTTCCATATTTATCAGCCTCACTAATTATCACAATTATATTAATTTATGTTAGATTTTTCAACTTTAGTGTCATATTTTATATCACTTGTGTAAGATTATTTGACATTGATATTTTTTAGGCTTATATTAGTTACCGAACTTAATTTTCACAAAAAGGGGGATGAACATGAATAAACAACCTGTCGATATTCATGAAAAACAATACAATCGTAACTTATTAGTACTAGTATTGATCATCGGCTCATTTTGTACGGTATTGAATGGTACGCTTTTGTCGACTGCTTTACCTTCTATTATGAGAGACTTCAAGATTAGTACTGCTACTGCAGAATGGCTTTCTACCGCTTTCTTATTAGTAAATGGAGTAATGATTCCTATTTCTGCTTGGCTAATCAATCGCTTCGGCTCTAGAAAGATGTACCTTTCAGCAATGTCAACCTTCTTCATTGGAACGGTAATTGCTGCTATTGCACCTAACTTTCAGACACTTTTAGCAGGGAGAATTATTCAAGGTCTAGGTGTTGGGGTCACCATGCCACTGCTTCAGACAATCATGCTTTCAATCTTCCCAGCTAATAAACGTGGGGCCGCCATGGGGACAGTCGGCATCGTTATTGGCCTAGCTCCAGCTATTGGGCCAACATTATCTGGTTGGGTAGTCGACAATCTTTCCTGGCGTTACCTTTTCAGCATTATTGCTCCAATTGCCGGAATTGTTGTAATCCTTGCTGCCTTTTTAGTAAAAGACGTTTTGCCAACTAAGGATGAAAAAATCGATGTCTTTTCAGTTGCCACTTCAACTATCGGATTCGGTTCTCTACTATATGGTTTTTCAGAAGCTGGTAATAAAGGTTGGACAAATCCTGAAATTTTAGCCTTTATTTTTGTCGGTATCATCTTCGTAATTCTCTTTGGTATTCGCCAATTGAAGATGGATGATCCATTTCTTGATATTAGAGTATTCAAGCACTTCGAATTCTCACTTGCAGCTATTCTTTCAGGTGTTACTAACCTAGCCATGGTTGGAATCGAAATGGTTTTACCACTTTATATTCAAAACTTACGTGGTGAATCCGCATTTCACTCTGGCTTGATCTTACTTCCAGGTGCTTTGATGATTGGGATTATGTCACCAATTACTGGTCGAATCTTTGACCGTTATGGTGCAAGAAAAATGGCCATTACTGGTATGACACTTCTTACTTTGGGAACTATTCCATTCGTATTTTTAACTGAGAATAGTTCATTCTTAATGATTATTATTCTCTACGCAATCAGAATGGTTGGTGTTGCTCTGGTTATGATGAACGTTACTACTTCAGGGATGAACTCACTTCCATTAGATAAGATTTCTCACGGTACCGCCGTTAACAACACCTTTAGACAGGTATTATCATCAATTGGTACTGCCATCCTTGTTTCAGTATTGACTACAACTACTAACAACAATATGCCAGCCAAGTCAATGCTTAAGACGTTGCCACTTCAATATAAGAATGGTGCAATTAATGCGACACTTGATGGTTTCCATGCCTCATTTGCAATTAGTATTCTCTTTGCATTGATCGCCTTAGTTCTTTCATTCTTCTTGAAGAAAGGTAATCGTGCACGTGAACGTTCAGAGAAGGTGGACAGCTAATGATTTTAATTATCTTAATTCTGGCAGTTATTGCCTTTATCTATTTCAATGTAATTCCTAAAAAGGGTTATATGCCGCTTGCGATTATTTCACTACTCATCGCAGTATTGAGTATCGCTGGAATCGTAGCTCACGACTACAATCACTATGGTATGAAGACCCAAACTACCACTGTTAAAAAAGAATTGGTCTCTTCTGCATCCCCGCAATTGCCAGTTCTTTTATACCAACCACTTGGCAACGGAACAGAAAAGATCTATTTATATAAAACAAAAAATACAGATAAGAAGCCTACCCCAATTAAAACCGATAAAACTCACGCTAGTATGAAAACTGCAGCTAAGCCTTCAATGACAATCAAGACTGAACGCTACGTTTTCAGTAATGGCTGGAATCAATTCTTGTTTGGCTGGTTTGGTCACAATAATGAGTTGAAGCATCGTGAATATACTTTCAATGTGCCAAAGAATTGGAAGGTTTTATCAATTAAGCAAGCTAAGTCTTTGCAAAAGCAAATGGCTAAACGTGCTGCGATGATGAAGAAGATGCAGCAAATGAAAAAGTAAACAAAAAGCGCAATTCCTTAATTGGAACTGCGTTTTTTCTATACTAAAATCTAAACTCTGGGCGATAGCCAACATAATGATGCAAAATGTAATCCGAATCTTCTTTCAAATCCCAATAATCGATATTATTAGAAAACATCATAACCATTTCATTCGTCTTGTAATTGGCGATAAAGCACGAATTATAACCGGGGATGCTTCCATTGCCGGCGATCACATTATTATGGAAATATACGCCGCCAAAATAGGCCACCTCTTCATGCTGAGTCTGTTGAGAAAAAGTATTGATCATCTTAGGATTCTTTAGCACTGCCTGATTTATAAACTTCCAATAATCTTTTGGCGACATGAATAAGTTCCCTGCTCCAAAATCAGAAGACGTAGTTACTGTAACCTCATGCCAGCTAACATTATTTACAGGCTGAGGCACCTCATTTTTGGCAACCTCAGAAAAATCCTTAATTTGTCTAAGGTTAAGTGGCCCCGCAAAAGTCTGTTTAATATAGTCATTGTAGCTCAGATGCGTCTGCTTACTAATAATTGCTGCTAACAATTCATAGTCAATGTCTTGATAGTCCCAAGTGTGAAGGTGGTCGTACTTGATGTGCTTGAGCATGAAAGCAGTCTGCTCCTTTTCACCTTTTAGCGGCGAATCTGGCCGGGCATTATTGATTAAACCACTAGTGTGGTTGAGCAATTCACGGATCGTTATGCTCTTGCTACCCGGAATTTGAGGATAATACGTGGATAGCGACGTATTCCAATCAAGTTGGCCTGTTTGCTTTAATTTATAAATAGCAGTCCCAGTCATGATCTTTTGCAAAGAAGCAATTGGAAACAAGCGATTAGCTTCGACCACTTTTTCTTTATTCGAAGTTTCTTTGTTTTTCACCACAATGGGCTGGCCTTTTTTACCGCTAACCAGGGCTACACCATTAATGTGGTGCCACTTCATGTAGTCCTTAACCTGCGTGGGCAGATCCTTATCGCCGGTCTCTACCCTATCTACCGACCAAGTAAACATCGCTGTGACAACACCGATTGCCAAAACTCCACCTAGTAAAATTCTTCTCCAACGCATGATTTTTTCCCTTCCACTCAAGACTTTTTATATGTTTAGTATACATGTTACAGCAAAAAAGAGGCGTTAATTTTTACTATTCCTGCAAAAATCACCCTCGAAAAAGCCAAAGTCAATCTATAAATAAGCCAAAAATAATATTTTAAAAATCCTTAACCTATTCTAATGAAAAAATTCGATACAATATTAATTATGATTTTTCAATTTAGAAAGTAGGAAAGAAAGTGACCCAAAAAAGAACCATCACTCCTTGGGCGATTTTTATTGTTTGTTGTTTTATTTCAATGATTGGCTTTGGCCTGATTGTTAATACTATCGGTCTGTTCTTTGGGCCGATCAGCCAGGAATTTCATGTTGGCCGTGCCAGCGTAGCCTTGATGACTACCCTGCAGAACGCAGCTGCCGCTATTTCACTCCTTTTTGCAGGAAAAGTTATGAAGAAAGTAAACTTGCGCTGGTTACTAACTGCATGTTTCAGCATAATCGCCATCAGTATGTTAACGCTCGCTGCAGCTCACAGCTTAACGCATTTTTACATTGCTTGGATCATCATTGGTATTTGTCAGCCAATTGCAATTACTTTATCAATTCCCGTTCTTTTAAGTAATTGGTTTAACCAAAAACTCGGCACAGTAATGGGTATTTCTCTGGGCCTATCCGCATTCGGTGGAACCATTTTTAACCCGATTATCGCAGATATCATTACTAAATTCGGCTGGCGTGGCGGCTTTATCGCAGAAGGTCTTTTAATTGGCTTGATCTTAATGCCACTTGCTGCTTCAATTAGACCTAACCCTGATGAAAAGCACCCTGCTTACGGCACAACGACTAAGTCTGAATCAAATTCACTCTTAAACGGCATTACGCTTAAAGAAGCACTTCATCAACCAGTCTTTTATGCCTTAGCCTTTGCGATGTTAGCTTTGCAATTTGTGTCGGGAAGCGTTCAGCATATTTCAGGCCACATTACTAATTTAGGAATTTCACCAGTTTTAGCAGCTAGTGTGGTATCCGGTGTAATGATCGGTGCAGCTGTTGGTAAAATTTCCATTGGTTATTTCTTGGATAAATTAAGTCCAATTTTAGTATTGTTGGTCTACTCACTTTTTGGCATTTTAGGTTGGAGTGGCCAAATCTTCTTGACTAACAGTACCCTGCTGACTATCTCTGCTTTTATTCTTGGTTTAGGTCAAGGTGTCTGCCTAGTGGCTCTACCATACTTAATCCAAAAGCAATTTGGCGAAAAAGATTATAGTAATATTTTGTCCGTAATCAACATGCTGGGAGCATTCGCAATGTCGCTATCAGTATACTTGGTAGGTCTATTCTTCGACCAAACCCATTCTTACAACTTAGGATGGACAATTAACGTAATTGCCTATATTCTAAGTTTTATCGCAATCTTTATTACCTTAAGAAAAAAGGCATAAAGCAAAATAAAACCGGCTCTTTGTCAAGAAATCCTGTTGATGAAGAGTAAAAGATGAGGATTAAGCAATTAATAAATTGCTTGGTCCTTTTTCTTTTACCTTGTTTTCTTCCAGTCTGATTCTAGTTAAGAGATTGATAAAGTTAGAGTAGCCGTAAGCAGTTCTTTCGATTTGTTTTATCTTGCGGTTGAAGCCTTCAAGACAGCCATTGGAATAAGGCAGACTAGCACCGTTTAGAACGGCTTTTAGGTTATGCCTAAAGGTTAAAAGCGTCCTATGCATTAAAGGTCCAACCTTTTCTTTAGAGTAGATAATATCTTTCATAGCTTTAGTATCATGTTTATCAAGTGCCGTAAAGAAGGCCTGCAACAAGTTGTAGGAGTTAGTTAGAGTGTCGTCACATTCAATGCCTTCCATGACGATTTGTTCTTGAGTAAGCGAATCCTTGTAATGCCAGTTGTATCTGGGATGGATCTTTTCTAAATCATCAAACTTCTTAAGATATAGCTTCCAAGGAGATTTGAGCAGGCTGTATTCTCTGGTGCCTTTTTTAAACTTCTTCATAACAGCTACTCTTAGAGAATTTAGAGATCTGGTAAGCATCTGCACCATGTGGAAGCGATCAATAATAATCTGAGCATTAGGGAAACATGCTCTAATGATATCCTGGTAATAGAAGTTAAGATCCATGGTTACGGTCTTAACGTTGGCTCTAGCTTTAAGCGTAAACTTGCCAAAGTATTTTAATAGATTCTTCTTGAACCTGTTTCTGAGAATTTGGACTATTTCATGAGTTTCGCCATTTAAGCAGATGAAATGAAGCTTTCTGTCTACTCCCTTAAACTCATCAAAAGCAAGATGTTCAGGCAAATAGTCATAGCCGTCAATAAAGCGATGAGAGCAAGATGCCAGTATCCTTTGAACTGTATTAACTGAAGTATTGTGCTCGCGAGCAATACTAGTCATTGAACGATTCTCAGTTAAGGCACTAAGGATCTTTAGCTTTGAAGTGTTAGAGATACAGCAGTATTTATTTATCAAGCTGCTTTGAGCCATGGATCTTTTACTGCAGTCACGACATTTGATCCTTTGCTTAAACAGCCTAATTAAGACAGGGACACTAGCATTAGCAGTGATATAGCGTACGTGGATCTTAAGGTGTCCGTTATGAATCAGACTAACTGAGCCACAAAAAGGACAGGCTGGCTGAATGAGCTCAGCCTCATAAATTTTATATTTAGTACCATTGAGAATTTTATAAAAATAATCTTTAAAAACAATGTTGTGGTCTTCAATATCAAGAGCAAATCTAATATAATCATTTAAAGAGGACATAGTAATACCTTTGTTTATACAAATAAAATGTTTGGTGAAACGGATTTTTTATTGTAGAGAGGACTATGTCCTCCTTTTTTATGCAATAAAAAATCCTGTTAACAGCTTATCACAGAGATAAGTCATCAACAGGAAAAATTATAGAGCCGCAATTTCTATTGATTTTGCGTGCGTTTCTGGCATGCTTTAAATTATCTAAAAAAGAGAGGAAGATCCTAATGAGTAAGACTGAAACCAAATCAGAAGAAAATTTTTCGAAGCCGTTCCGCCTGAAGCCTCAAACAAGTCAATATCTAATCAAGGCGTACGTTATTGCGATAGAATGTTTGTCCTAGAAAAAGACTGGGAAAACCTATCTAACCAAAACATGAAGCAATGACTCTGCCTAATTCCAATTTTGGAAAAGCCTTTGCTTACTCTTTGAATCATGAAACAACATTCAAACATGTTTTATTAAATGGCCGACTGGCTTTATCCAACAATTTAGCTGAACGGGCAATCAAGACGCTAGTAATTGAGCGCAAGAACTGGTTATTCTCTCAGAGTTTTAACGGAGCCAAATCTTCTATCATTCTTAGTTTGATTGAAACTGCAAAACGCAATGATTTAGATCCAGAAAAATATCTAAAATATTTACTCTATAAATTACCCAATGAATCAACTCTGACAGACAAAGCAAGTGCAGGCAAGCTGCAGATAAATTTTGAATATAATTAACCCCATCTTAGCACAAGCCAAGGTGGGGTTTTAAGATACTTCAGTTTTTTACGCTTACAATCAAACTTTTTCAGAAATATAAATTTCTTGAACATAAAATAAAAAAACAATGCGATGTGCATTGTCTTTCAATAAGGTTATAGATTATGATCTTTTGAGGAAATAATATTGAGGAATAATTTTAAAAGTTGATTTGCTTACCGTTAACCAAAACTCGGTAAGTTAAATCTTGTAAGTTAGCAGTTAATTCTGCTTTATGATTTTGTTCTTTATTGATTCGAACAATACGGATTTCATTTTCATTAGGGGTAGAGATATCAATAGAACCATTTAAGTCAAATGCAGGTTCATTGTTTCTAAAAGTAAAGAGTTTCAACAAAGAGGCAACAACAGGTCGCTGAATTTCTTGTTCAACTTCAGCTTTGCTGTAATAATGACGATTAATATTTCTCCCCTCTTTAGTTTCTTCAAGGAGCTTAATATCATTCTTGCCAGCTAACATCCCAACATAGTAGACCTGCGGAATCCCAGGAGCAAAGACCTGCAATAATCTTGCCATGAAGTATCTTTTATCGTCATCGCCCAAAGCTGAATAAAAAGTAGTATTAATTTGATAAATATCCAAATTATGATATTCAGCACTTGAATACTTTTTCTTAACATTAGCACCAACTTTATAAAGTTCTTGGCTGGTGTAATCGATCTCTTCTGGAGAAAGAATATCACGCGCATCAACTACGCCAATACCATCATGAGTATCCAAGGTAGTGAATTGCTTCATTGGACATTTCTTTAGCCAAGCTGCCAAACGATTGGATTTACCTGAATAAAGCGAATACAGAGTAACCATTGGTAGAGCAAAGTCGTAGATAAAGTAGCCATGTTTTGAAATCTTGAACGGCATAGAATAATGCTCATGGATTTCTGGCAAAATCGTTGCTCCTTCATCAGCAATATCATCTTGTACCTGCTTAAGCAAATTCCAGATTTCTGGTTCTACAAAGAAGTCATTAGTATCTAGTTTTTTCACAGCATAAGCAAAAGCATCCAAACGAATGATATCTGCTCCGTGCTTAATTAGGCTAACCAACGTGTCCTTAATAAACTTTTGCGTAACCTTTTTGCGCACATCTAAATCGATTTGCTCAGGACCAAAAGTATTCCAAAGTTTCTCCTTGCTCCCATCCGCAAAAGTAATCTCTTGATATGGAGCACGATCTTTACGCTTATAGATTAGGTCCACATCTTCTTTAGTTGGACGACCTTTTGGCCAAAACTTATCCCAACTTAAGAACATATCTGCATATGAACTTTTGTCCTTATTCTTTTGAAAATCTTCGTAATACTTAGAATGCCTAGAAATATGATTGATCATAAAATCAAACATCAGGTAGTATTTTTCGCCCAACTTTTCAACATCTGACCAATTACCAAATTTAGGATCAACTGTCGTGTAATCAGTTGGTGCAAATCCGCGATCACCCGTTGAAGGAAAGAATGGCAGTAAATGAATGCCGCCAACCGCACCTTTTAAATCATTTTCTAAGACTTCGCTTAATTCTTGCAAGTTTTTACCTAAACTATCTGGATAGGTAATCAACATTACTTTATTTTTAATTGGCATCAAAACACCTCTAGTCTATTGCTTTAAAGTGATACATATAGGCTTTAAAATCACCATGTTCAGCAGGATCATAAAAGCCCATATTCATCAGTTCATCGCCACCATAAACTTCGTGAGTAGCTTGATCCTCGTATTGTTTATTTGGATCAAGACCAACTAATTTCGTCTTAGTTAAATATGGTTGAGCTGAAGCCATTACTTTAACTGTGCTTAATACGACTTCATTTTGATCGGCTGAAACAGTCTCCCAAGCACACTGATTGCTAGTTTGGGCTGACTTCAAACGATAGAACTTGCCAAACTGGGTCACTTTTCTAATCTTTTTGTATGCAGCAACCTGCTTAGCCACCGCCTTACGGTCTTCCTGACTTAACTTGGTTAAATCAAGTTCATAACCAAGATCGCCCCACATTGCTACTATGCCACGAGTATTGAATGGAGTAATTCGACCGTTTTGTTCATTAGGACTAACTGATACGTGTGAAGTCATCATTGATTGCGGATAAACCAAACTCGTACCATATTGGATGGTTAAACGATCAATGGCATCACTATCATCACTAGCCCAAATTTGTGGATCATAGTAAGCCATACCGGCATCAAAACGACCACCGCCACCAGAACAGCCTTCAATTAAGAGGTCTGGATGGGCATCAACAATTCTTTCTAATAAGTCATAAAGACCTAGAACATAGCGATGGTAAACTTCTCCTTGTCGATTTGCTGGTAAATCTGCTTCATAGATATCAGACAAGTGACGATTCATATCCCATTTGATGTAATCAATCTTGCCTGATCCTAAAATCTTTTCCATCTGATCAAAAATATTGTCACGTACTTCTTTACGACCTAAATCAAGCAAGTACTGATTTCTTGATGGACTAGGATTTCTACCAGGAACATGCATCAAATAATCAGGATGATTCTTGTAAAGATCAGAATCATATGAAATCATTTCAGGTTCAAACCACAAACCAAATTTCAAGCCTTGCTCATGAACATAGTCAGCAAAATGATCTAATCCTCTAGGGAACTTTTTTGCAAATACTTTCCAGTCACCTAATGAAGAATTGTCATCATCACGGTGACCGAACCAACCATCATCAAGAACGAACATCTCAATACCTAATTTCTTGGCATCATCAACAATGGTCTTTAATTTATCTTCATTGAAATCAAAGTAAGTCGCTTCCCAGTTGTTCACTACAATTGGACGAACTTCATTCTTATACTTGCTACGAATAATTCGATCATGAATCAAGCTGTGACAGGCTTGACTCATTTTATTTAAACCCTGATCTGAGTAAACCATTAAGACTTCAGGGGTTTGAAATGCATCCCCTGCATTCAACTGCCAATTAAAGTTAAAGTCGTTAATTCCAACGTTTACATGAGTTTGCGCAAATTGATCTTTTTCGACTTCAAACTTATGGTTGCCAGAATATACAAAGGCAAAACCATATGCATCCCCTTGGAATTCATTTGTATCAGGATCAACTAGAGCCAAGAATGGATTCATTTGATGACTTGAAGTACCACGATGACTTGAAAAGACATGGATACCTTGATGAATCTTACTACGATCCACTCTTCGTTCATGCGCATGTGCACCAGGCAAAGTAATTGATTCAAAATTACGATCTACAAAATCGATTTGCATAGAAGCAACTTTTTCAAGATTGACCGTTTCCTTTCCTGCATTCTTCACTTTTACTGAGCGAACAATAACTGGACGATCACGATAAATAGTGTAGAGCAAATCAAAGTTCAACTCGGTTTTCTTATCTTCAAGAGTAATAATTAGAGTTTGAGCTTCACTTTTATCTATTACCCATGAATGAGGTAATCCCTTTAAATCCGGCTTACCATCTTCAATACGATAATCTTTATAAGTTAAAAACAAAGCATTAGAGCCATCAGCTTGCCGTACTACAGCAGCTGGAGTGTGATAATCCATTTCACCAGCTGTACTATATTCTTTAGGCAAAGAATCTCTTGAAAAAGTTCTATCAAGTGATCCTGGCAAATTACCAGAGAAGCCGCGATCAAGACGTGGATAGCGCAATTGATTATGGTAATTATTCACTTTTTTGCCAAAATACAAATGGCAAAGCGTACCACCATCTTCTACTGAAAGAAGGTATGAAATTTGATCATTGTGTAGATGAAAAACTTTGTTTTTTTCATCAAAAGTGATTAATTCTTGTTTCATTTAACAAATTCCTTAATTTTCAACTGCCTTTTGTGTTTCAGGATCAAAGAAGTGAGATTTATTCATTTCAAAGCCCATTCTTACTTTGGCACCAGGCTTAGTAAAGTCACGAGCGTCTACCTTTGAAACTAATTCAGTATTGCCAACCTTGCAATAAAGTTGTGTCTCAGCACCCAAAAGTTCAGAAACAATAACTTTAGCTTCAACAACTTGCTCTGGGAAGCTTTCCAAGAAAACTTCCTCTGTGTGAACGTCTTCTGGACGAATACCAAAAATTAATTCTTTACCATTGTATCCTTTATCTTTCAAAACTTTTAATTTACCTTCTGGAACTGGAATTTCAAAGTCATGATTTTCCTTGTCGATAATCTTTCCATCCTTCAAGATAACATGAAAGAAGTTCATCGCAGGTGATCCAATAAAGCCAGCTACAAAGACATTTGCAGGCTTGTTATAAACTTCCATTGGCGTACCAACCTGTTGTTGAATACCATCTTTAATAATGACAATTCTATCGGCCAAAGTCATAGCTTCAGTTTGATCGTGAGTAACATAAATAGTAGTTGTCTTTAATTGTTGATGTAATTTAGCAATTTCTGCACGCATTGTAACACGTAATTTAGCGTCCAAGTTTGATAAAGGCTCATCCATTAGGAAGATAGGAGCGTCACGGACAATTGCACGTCCTAAGGCAACACGCTGACGTTGACCACCTGATAAAGCTGCAGGTTTACGTTTCAAATACTGAGAAAGGCCTAAAATTTTAGCAGCTTTTTGTACTTTTTTATCAATTTCTTCTTTAGGAACCTTTCTCAACTTTAACCCGAAGGCCATGTTGTCATAAACAGTCATGTGAGGATATAAAGCATAGTTTTGGAAGACCATCGCAATATCACGATTCTTAGGTGCAACGTTATTCATGACTTTTCCACCAATCTTCAATGTACCTTTGGAAATATCTTCCAAACCAGCGATCATTCTTAAAGTAGTTGACTTACCACAACCAGATGGACCGACAAAGACGATAAATTCTTCATCTTTGATATGCAAATTAAAGTCAGTTACCGAATAATTATCGTTACCTTCATATTTCTTATAAACATGATCTAAATCTACTTTAACCATTTTCTTCTCTACCTCATTTTTCTCAATTACTTAACAGCACCGTTACTCATACCGGCAATAATATTCTTTTGGAAGATCAAGTACACAATTGTAATTGTGATAATTCCAACTACGTATGATGCGAAACTTGGACCATAGTCATTGAAGTATTGACCAGTGTAGTTGTATTGGAACAATGGCAAAGTCCACATCTTTGAATCTCTGTTTAAAATCAACAATGGAAGCATAAAGTCGTTCCAGAACCACATAGCATTAATAATTAATGTGGTTGCGTGCATTGGCTTTAACAATGGGAAGATAATTTTAAAGTAAGCAGTAATCTTGTTAGCACCATCAATTTCAGCAGCTTCATCCAAACTTTCAGGAACACTTTGTTTAATATAGCTGACATAAAGGAATAATGTTTGAGGTACAGCGTAAGTTAAGTACAAGATGATTAGTCCCCACATGTTAGCCAAACCAAGTTTACTCATCATTACTGTAATCGGAATCATAATTACTTGGAAAGGAACAAAGATACCAATGATTAATAAGATGTACATCCAGTTATATGCTGTCTTTTTGGACATATTTCTTGCAATGGAATATGCGGCCATTGGTACAAAAATCATTACCAAAACAATAGATAAAACTGTAATGATAATTGAATTACCGAAATAATTCATTACTCCATCGGCAAATAATCTTTGGAAGTTTGCAGTGGTCCAAGGATTAGGCCATGCAAAGAAATGTTCCATGATTTGTTTAGTTGTCTTAAATGAACTTAAAAATGTGTAAAGCAATGGGATTAAAATTAAGATTCCACCGACAATCAGTAAAGCATAATCCCAAAATTTCTTTGATTTGTTTTCTTTTTCCATTTTGTATACTCCTCTTATTCAACGGCATACTTATTAGAAATTTTGATTTGGATAAATGAAACAATTGCAATCAAAAGGAATAATACAATCGCAATCGCGTTAGCATAACCAAATGAGTTATTGTTAAAGGCATAGTTGTAAACCAACAAACCTAATGAAGTAGTTGAGTTGTTTGGTCCACCAGCCGTCATTGCAAAGATTTGGTCGAAAGCAGTTAAACCTGATTTAAGAGCTAAGATAAAGACCATTGAAATACTTGGCAACATGTAAGGAAGTTCAATCTTCCAAAAGATTTGACGACTATTAGCACCATCGACTTCAGCAGCTTCTTTGATTTCATCTGGAATACTTTGTAAACCAGCTAAGAAAATAATGATCGGCATAGCTACACCTTGCCAGAGGAGGACGAAAATTGTGGCAATTATTGCTCCAGTGTCGGTACCTAATAAACTCGTTTGCAACCATGAAATATGAAGTGCATTACCAATTGCTGGTAAACCATAGTTGAAAACTTGTTTAAAAATCAAAGCGACTGTCAAACCTGACAAAACAGCTGGGAAGAAGAACCATGCTCTAAAAAAGGTTTGACCTTTAATTTTAGAATTTAAAGCACGAGCTACCAGAATTCCGATAACGATTTCACCCACAATAAGGCAAATTGTTAATATTAAAGTAAAGCCAATCGACTTAGCAAAATTCTGATCCATAAACAATAACTTGTAATTGTTCAAACCCACAAATTTATAATTGTAAGTTAAACCAGTCCAATTAGTAAAACTGTAAAACGCACCTTGAACTAATGGAAAGTAGAAGAAGATGAGTTGCAGAATAATTGGGACGATTAAAAATAGCCAACCCCAATATTTTTCAATAAAATTTTTCGATTTCATTTTTGCATCCCCCTAATTAGAAGCCTTCATTGGGTTAAAGAACGTATTCATATCATTAACCATTTGTTGCTCATTACCTGTCATTAAGTAACTTGCAGTTAAATTAAAGAAATCATTTTCACTAGTCCAATCTTGAGCAAGCCAGACCATATCGTGTTTTGTAAACGCCAAGCTTGATAAACCACCTAATTGAGAATCAAACCCCTTTTGTTTAACTCCTTTAACCGCAACAGGACTTCCATCAACGTCATAATATTTCTGCATAGCAGCAGGTGTTGTCATATAAGCAACAAATTTTTCAGCGGCTTTCTTATGCTTACTCTTTGAAGAAATTGATAAAGCTAAGTCACCTGAACCAACTGTCATTTCATGACCTGCTTTAGCAGCTGGAAAAGCAAAAGTTCTAACTTCAAATTTAGGTTTTTGTTGATTAATCATTGGTAAAGCCCAAGAACCATTTGGCATAATTAGACTTTGACCATTTGCAAATGAAACAACTGCGTCATTGTAACTAGCACCACGCCAGTTGTTTTGAGCATTTTCACGCAATAAATTTAAACGCGTAAAGTCTTTTTGAATATACGGATTATTGACCTTAATTCCATTTGGAGCTGAGAATCTGAGCAACTTGTTAGCTTGTTTTGCACCGCCAGTTACAGTTGCAAGAGATAATTGATGATAACCATTCAAAGTCCATGGTTCAGTTCCAGCTACTGCAAATGGTGCTTTGCCACTAGCTTTGATTTTCTTAACGGCTTGTTTGAATTGATACCAAGTTTTTGGTGGCTTAATACCTAACTTCTTAAATTCAGTAGCGTTATAGAAGAAGCCATAGACATTTGCACTAAGTGGTGCATTATAGATTTTGCCGTTAACTTTGAATGAATCCGCATAATTATTCTTAATGTTTTTAATATATGGTGCATGAGTCATATCTTCAAAGTAACCAGCTTTAGCCCACTCTTGAAAATCAATATTTTGTGGGTAAATATTAATAACATCAGGAACATCACCAGATAAAATTCTAGTTTTCAGTACCGTACCAGCATTTGGTACATCTACTTCTTTTACATGAATGTCGGGATTCTTTTTTTCGAAATCCTTAATAATTGATTTTAAAGTACTCGACATTTCCTTCTTTTGGTTGAAATACTCAATAGTAACTTTTTTACCAGAATTACCATCTTGCTTTCCACAAGCTGAAAGACTTACAGCTGTTACAATAGCTGCTCCGATAAGAGCAGCCTTTTTCAGCCATCCTTTCATAAAAATCACTCCTCAACTATTAAATTCTCTTCTGAGCCTTCTTTAAGCGCTTTCATTATTTACATTCATTATTATTGCCTATTTAAATACAACAAACCATAGTATTTTAATTAGCATTATTGTCATTTTGTTCAATAATCATTTTGACATGTTAAATATAATGATTATCATATTGTCGCCATTCTGTAAGAATGATAAAATACATGTTAACATGTAAAAATATGATATTTTTTTCATAGGGGAATAAAAATGAAGGGTGAATACAAAACCTTAAATGATATTAGCCTGGAAAGTAACGTCCTATTTTTCGGCAAGGAAAGTTGCTTACCTAATTACTACTTCACAGGAAATAATGTGCGAAAGAATTACGTTATTCATTATATACTTAAAGGGAAAGGCGTTTTTTCCTCTGCAAATCATGAAGCAGTTCAACTTAAAGCTGGAGACATATTTATTCTGCCTAAGGGGGTTCCCTGTTTTTATCAAGCAGACGGTAAGGAGCCCTGGACATACTTTTGGATTGGATTATCCGGCTTAAAAATTGCCACAATGCTATCAGGCTCTATTTTGTCTTCAAAGCATTACTTAAGGCAGGTCGAAGATTCTAATTTTTGCAAAAGCTTAAATAAACTATTTGAAGCAGTTCATAATCCCAATGTGCTAACTAATGACTTACTAACAGAATCCCTAATTTATCAAACTTTTTATTATCTTGACACAGAATATCCCGTTAAGAAGAAAAAGCAACATATTGCTAATTCAGAACAATTAAAAATAGCATCCAAATATCTGCATGATAATTATGATGATCATAGTTGTTCAATAGCATCTCTATGTAATAAACTCGATGTTTCTCGCAGCTACTTATATAATCTTTTCAAAAATGGTTTTAATACTTCACCGCAAAAATTTTTAATAAAAATTCGTATGGAAGAAGCTAAAAATAGATTAAAAGATAGTCCAAGCTCCATTCGACAAATTGCTGAAGCAGTAGGTTATATTGATGAATTTACTTTTTCTAAGGCTTTTAAGAAATATTCTGGTTTTAGTCCTAAAATTTTTCGCCAAATGAATACTAAATAAAGTAAAAAGATGATTGCCCAACAAAGCAATCATCTTTTTCATCTATTCAAATTTCCAATTAACCGTTCGCCTAAATTTTTCATTTGGCAATAGCGTAATTTGACCTAAATCATTGCCTTCAGCTGGTGGACACTGTGCTTCAAAGGTCAATCCATCATATTGACCAATATTATCAGGGATACCTGTATGATCAAAATGATTACCAGCATAAAGCACTAACGCTGGAGCATTTGTTGTCACAGTCAATTTATGCTTATTTGAACTCAACACTGCATTCGGCAGATTGCCATTTAAAATGAACGGATGGTCCAATCCATTGCGAAGTTTGATTTGAGCATCATTACTATGTAATGCATCCCCTACTCGCTTCTTTTGTCTAAAATCGAAAGCTGTACCTTCAACTTCTTCCATGCCACGATCAGGTAAACCTGTTTTATCAACGGGCAAATAATAATCTGCGTTCATTTGTAAATCCAAATTCTCTGCTCGATCGCCTAAATTGAAATAAGTATGGTTAACCGGATTAAAAATAGTCAGTTTATCACTAACAGCTTCTAAGGAATAACGCAAAGTATTTTCATTATCAAGCTCATAGCAAGCATGTAATTTTAAATTACCAGGATAACCATTATGCCCATCTGGATCAAATAAAGTTAAATCTACTTTAGCAGTTTTTCCATCACAAGATGGCCTAAAATTCCAGACTTGCATATCAGTTCCAATACCGCCATGAATATGATTTTCACCATCATTTAACGGCAATTGATGGATTTCTTGACCATGCTTCCACTGTCCTTCACGCACGCGTCCTGCAATTCTACCAACGGTACCGCCTAAGAAGTTGCGTTCTCTTGAATAATCCGCTGGTGACTTCAATGATAAAATCATATTTTCACCGTTAAGCAGTACTTTTTCAAGAGTGGCACCATAATTCAGAACATTGACGACCATATCATGATCATTTGTCAATGTGATTTCGCATAGATCTCTACTATCTTTCCTGCCATATTTTATAAAAGAAGTTTTCATTATTAACCTACATTTTCAATCAAAGCTGAGATAAATTTATTCCAGCCCTTGTCGCCATCTGCATTGTTCTTAAACACACCGGCATCTTGCAAAACTTGCTCGAAGACTTCAACCAAAGCTTGTTCCATTACTTGATCGACATTGACCTTAGTAATGTTTTGTTCTGACTTAATTTTGTCAGCCCATTCTTTATGACTATCTGCCATGTCATTATCTTCGCCAAGCCAATATTTCTTAACTTCTTCAAGTTCACTCTTCAGACGACCTGGTAAAATAGCACGTCCCATAACTTCAATTAAGCCAATATTTTCTTTCTTAATGTGCCACAACTGCTTATGTGGGTGGAAAATACCCAGTGGATAATCCTTATTAGTATTATTATCACGCAGTACCAAATCAAGCACAAAGCTCTTACCCTCACGGTGCATAATAGGAGTTACCGTATGATGACGCGTTTCTCCATCAAAGGCCTTAATATCACGATCAGCATCAGAATATTGATCCCAGAACTTAATAATTTTGCTACCAAGATCGATTAGATCAAGTGAATTATTACTAATAAGGCGCAAATCACTCATTGGCCAATCAACAATCCCAGCCTTAACATCTGGATATTCATTAAAGACAACAGATTTCTTAATCTTAGCCTTCATCATTGGGAAGATATGACGACCACCTTGATAGTGCTCATGAGCTAACATTGAGCCACCAACAATTGGTAAATCAGCATTTGAACCAACAAAGTAATGTGGGAAAGTCTTTTCAATTTCAACTAAGTTAATCAAAGTCTGCTGGTTAATAACCATGGGAATATGCTTTTGATCCAAGAAAATACAATGCTCATTGAAGTAAGCATATGGTGAATATTGGAAGCCCCATGGACGACCAGCAATATTCATGCGAATAATTCTCAAATTAGAACGAGCATTTTTACCATAGCCACCTAAGTAGCCTTCGTTTTCAAGGCAAAGAGCACATTGTGGGTATTTTTTGCCAGTTGCATGAGCCGCTGCTGCGATAGCCTTTGGATCTTTTTCTGGCTTGGACAAATTAATCGTAATTTCAAGGCCATGTCCTTTTGAACTAGTTCCTGAAAAGACAACATTTCTGGCAATAGCTTCTTTTTTAACGTAATTGTTGTTTACACAAAGCTTATAAAACCAATCAGTTGCTTTTTCAGAAGATTTTTCCATTTTTTGCCAAAAGATACTATTAGTCTCAGATGGAGTTGGGGTTGCTAAATCATACAGTTGATCATTCAGCACCTCCCGTGAAGTTACATCATCTGGTATTTTTTTATTCTTCACAGCCATATCTACTAATTGTTTAACAGCTGCTTGATCATTTTCTTCTTCATCATGATCACCAACCAGACCACGAATTTTATTAATTACATAAACACGATCAAGCGGCTTATAAGCACCACTACTAATTACTTTATCCGCAAACTTTTCAATAACCTTCATAACTTACCCTTCCATAAAAATTATATTCTCTTAGTACCATCAACGATTTCTGCATCATAGAAGCTGGCATCGTAACCAATCTTGTCACGGTAGATATCGCCAACATTTTTCTTAAAGTTTTCTGCTTCTGACTTCTTAACAATAGCAATTGCACTACCACCGAAGCCACCACCAATCATTCTGGCACCCAATACACCCGGTTGCTTCCATGAAGCTTCAGCCAACGTATCGAGTTCCTTACCAGTTACTTCATAATCATAATGAAGTGAAACGTGTGAAGCATCGATCAAGCGACCCAACTTCTCTAAGTCGCCATCTTTCATCGCCTTAGTTGCGCGTAAGGTTCTTTGGTTTTCACTAACGGCATGACGAGCACGCTTTAGCTCTGTCTCATTATCAATAAGGTAAGAATATTCATCGAAAGTGTCGTTATCTAACTCACCTAGAGCCTTAATATCAAGCTTTTGTTGTAATTTCTTAACTGCATCATGACATTCGCGAACACGGTCATTATATGCAGAATCAGCTAAAGTATGTGGCTTGTTAGTGGCCATGATAATAATTTCATATTCGCCTAAAGCAAGTGGCATGTATTCATATTTCAAAGTATTGCAATCGAGGAAAATGGCACTATCTTTTTTGCCCATAATGCAGGCAAATTGATCCATAATACCTGAGTTCAAGCCTACAAATTCATTTTCGGTTCTTTGACCCATCTTAGCTAATACAATCCGATCAACATCTAAGTTGAATTCATCTTTTAAGATAATCCCCATCAACATTTCGATGGCAGCACTTGATGATAAGCCTGAGCCTGATGGTAAGTTGGCTTTAATATACAGGTTAAAGCCATGATTGATGTTGTCGTATTTTTCACGTAAGTAAGTGATCATGCCTTTAAAGTAGTTAGCCCAGAAACGATCGTCTTTCTCTACAGTGGTATCATCAATATCAAATTCGACGATATCACCATCAACGTTGCCTGAGAAAAGACGTACTTTCTTGTCTTCTCTTGGACCATAAACGCCATAAACACCTAAGCTAATTGCGGCTGGGAAAACATGCCCACCATTATAGTCGGTGTGTTCACCAATAACATTAATTCTACCAGGTGAGAAGAATACATCTTGTCCCTTTTCACCAAAAGTTTTTTCGTATTCAGTAAGTAATTCTTCTTTATTCATTGTTTGTACCTCAAATTTGTTCTGTAATCGTTTTCAGCTACTAGTATATCTCTTTCACTTCTTTTAGTAAATAGTTTACTAGAATAAATAAAATATTTAATTGTGTAAACTAAAAAAGCCAGACGTTTGTCTGCAAAAAAATAGCCTTAGCTCACACTAAGACCATTTTGCTTATTAAAATTTTCTTTCACCATGTTGGTAAACTTGTTTATCAGCTTGTTCAACTGCAGATACGTCTTTTAACGGATTATCTTTCAGCACTAAGAAATCTGCATATTTTCCCACTTCTAAAGAACCGTAATCTTGATCAATCTTAAGCAATTCCGCAGATCCTAAACCAGCCGCGCGAAGTGCTTGATAATTGCTAGCACCAGCTTTAGTTAATTCGGTAAGTTCTTTAGCCGTATCTTTAAGTGGATTCATGAAAGTACCAGCATCAGTCCCCAAAGCTAACTTCACACCTGATTCAATAGCTTTTCCTACATTCTTATAAAGAATGCTGCAAGTCCAAATAGAACATCTGTCATGTGCTCTGTTCTGTCTCTAAAGAATCTGGTTGAGACTTTCCAACTGAAAATTACTACTAGAAGCAAAGCGAAAAAAGTTTAATATACCGTTAGAAACCAGAGCGATTAAGTTTGCTACAGTTTGCAGTCCATGTTCAAATCCACTTTGTTGCCAAGTGGCCCAAGATCTGGAAATTAAACGACCTAATTGAAGGCCTTGCTTACCGAAAATTAAACCATTCATTGGGTCTACGATAACATTTTGGATCATGATACCGAAACCGGCAATCATAGTCGTAGCTGCGACAAGTACAAATGTATCACGCAGCGTTAATAATAATCTGTTACCGGAAATTTTTCCGGCAATTCTGATAAAACCATTTTCGAATTTCTTTAAACCCGATGGCGTCGAATTACTACTCATAATTTCTCTTCCTCTTTTTGTAACCGTCTACATCTCACTTGCAAATATATGATATAACATATTTTGTTTCGCCAATATTGTTATAACAAAAAGAGAGAATTTTTATTATTCTCTCCTTTTAAATCTATAAGACAATTGCTTCTTTTATTGGAAATTTTGTTATGCGGTAGTAATTTTCGTTCCATTCAAACGCCCGGCCACCACTCAAATATGATTTTTGCAGCCATTTAAGTAGTGGTTCCTTTTCTTCAACTTGACCTTTTACTTCGCCTTTAGCAATCAAATCCTGATCAGCATGAACTAAGGAAATTTCGCGTGTTGCCTTAGCAGGCTTAACATTAAGTTTTTGTTGGATTAATGGATATATTGAAGATTCCATCATTTCCATATTCAGGCCTTGGATTACTTTGGCAGGAATACAAATTTTTTGAAAAGCCACAAGTTGCTTATTTGTTGCCATTGCTGCTCGTTTTATTGTATAGACGATATCATATGGCTGAACATTTAAGGCCTCTGTGATTTTGGGAATTACTTGCCCTACTTCAAAATCAAGCAATTTCAATTTGTACTTTTTAAGATTGATTAATTCCAATTGATTTGAATCTACTGTAGCATGCATTAAATTACGTTCCTTTACATAAGTGCCTGCACCTTGGATTCGATAGATTAGGCCTTGGCTTTCCAACTCTTTCATAGCATTACGAACCGTGATGCGACTAACATTGTATGATCTTGCTAGATCGAATTCTTTAGGAAGTTGATCATTAGGTTGATATTCCCCTGTTTCAATCTCATGTTTTAGCTTTTCTGCGATTGCTAAATATTTAGGCGTATCTTTCATATTCTCATCTCAATTCATGATTTGTTATATCATATTTGTTCTAACATGCTTGCTTTTTTAGTGCATGTATATCATACCGTATGATTTTTATTTTGGTTTACAAAAAAAGAGATAGGTATTCAACCTATCCCTAATCTTCGTTTTTTCTTATCTATTCATAATCAACCAATTCTGAAAGTGGATATCTAGTAGTTGTAATCTCATCTTCAGCCTTTGGATCGTGCTTACCAATCGCAATTGCCATTACTGGTACGTATTGCTTAGGATCAAGTCCCATCACAGCTGCAGCCTTTTTAGCATCATAACCTGCCATCGCATTAGTGTCATAACCGTGCTCACGTGCAATCAACATAAATTGCATGGCAGCAAGTGAAGAATCAACCATCGAATCCGCAACTAACATTTCTTGTGGTGCCTTTTCATATAGTGGCATGAATGTATTTAAAGCCGCTTCCATTGCTTCCTTAGTTACTTTCTTTTCTTCATACATGCTGTGCCATAATTGACTGTATTTCTTGAAAGCCAACGTGTTACCGAAGAATAGGACTATGGCAGAGCTATTATCAACTTGCGGGAAGTTAAAAGGCATAAAGTATTGGTGCAGCTTCTCTCTACCCTCTTTTGTATCAACAACTATGAATTTCCAGGCTTGCAAATTGCATGCGGAAGGAGCAGTGATAGTTTCTTCCAGCATTTCATTCATTTCTTCATGACTAATTTTTATGGATGGATCAAATCTACGTACTGAATGTCTTTCAGTCAAAATTTTATGAAAATCATTATTTACTATACTCATTTTTTAAGCCTCCATCGTTATTTCTTATCGCTTACATCACTAGTATAAAGCATATTAGTCATAATGTCGCTCTTATATTATTGAAAAAGGCGTAAACAAAAAGGAGAAATAACTCCCTTTATTATCTAGATATGTACTATAATAATATTAGATTTAGCTAATGAAAGTTAGCTATTTCGATGACTACATTTTCATCTCTTGATATAGTCACGTTTTTCCAAACATCCTAGTGCATTGGAAATAATGATGCACTTGTTCGCATATAAAAAGACCACTGATTTATTCCGTCAGTGGTCTTTTTATTTATACTATATTCACAGTTAAGTGATATTATTACACTTTAACCATTGTTTAAGCTCTATTCTTGAAATGTCATTCACCCTCTTGCCATCCAAGACAAGCGTGTCAGGCAACAATTTTCTCAACTCTTCCACGCCACGTTCTGGGGTACTCAAAAGGAAGTTAGAAATTTCGCCTAGCTTCCTTATTTATTGATCTAAATTCGAGGGAAAGAAATAGGATTTGTTACTTGGTTAAGACTTGGACATCCCATCCTTTCATTGAACTCTTAGCTTTTTCACCAGTTAAGATGTCTTCATAATCCGCAAATTTATTTGGCAATTCTCTTTCTTCATTGCTCATGTTAAGCACGAAGTAAAGTTCCTTGCCATCTGCTGTGACACGCTTAGTTACTTCAAGCTTATGGCTGTCGCAAACTAATGATTCAACTCCAGTTTCAAGCACAATGTGGTTGAATAATTGTGTCAAACCTTGGTGACCTAACTTGGTTCCAACGTACCAAGCCTTGCCTTTGCCATAATCATTTTCAGTAATCGCAGCAGTGCCAGTGTAAAACTCATCAGCGTAGCTTGCCAAAACTTTGGCCTTATTTGGATGAATCAAATCACACATTAAGTTAGTTTCGTATTCTTTGCCATCCATCGTGACTCTAACTTTTTGACCTGGAACCATTGCGTCGCTTTCTTCAACCCAGATGCCGGTGACATCTTTCAATGGACCAGGATAGCCGCCAAGGTAAACATTGTCAGTTGAATCAACCATACCTGACATGTAGGTAGTAACGAGGTGGCCGCCCTTTTCAACATAGCTGTTGATCTTTTCAGCTAAACCTGCTTTAACCATGTAAAGCACTGGTGCAACAACCAAGTCATAGTTACTAAAGTCATCATCAACACCAATAATATCGGTTGGAACGTTGCGTTCATAGAATTGACGGTAGTAGTCCAAAATTGAGTCAACATAATTCAAATCTTGAGTAATGCCATCAACATATTCGTATGACCAGAAGTTGCTCCAATCAAAGACAATTGCGACTTTAGCTTTAGTCTTTGAACCTAAGATTGTTGGACCAGCCTCCTTTAATTTTTGACCCAAATCAGCCAATTCACGGAAGGCTCTAGTATCAGTTCTTTGTGAATGAGCAATAATTGCACTGTGGAATTTTTCGGAACCACCCACTGCTTGCTTTAATTGGAAGAACTGAACAGTGTCAGCACCGTGAGCAACAGCTTGCAATTCAGTTGCGGCCATTTGACCTGGACGCTTTAGTGGACTGTAAGATTGCCAGTTAACTTGTGAAGGTGCAGATTCCATTAACATAAATGGTTGATGCTTAAGGCTTCTCATTAAGTCATACAAGAATGCTGGCTTATAAGCAGGTGCGTCGTAAGTTGGATAACTATCATAGGAAATAATATCTTGATCCTTGGCCCACTTTTGATAGTCAATCATCTTGTTTGGCAAACTGTGGAAGTTAGTAGTTACAGGCGTTTCAGGATCGTACTTCTTGATGACTGCCTTTTCCATCTTGAAGAGGTTTTGCAAACTTTCGGATTGAAAGCGCAAGTAGTCGATTGAAAGGCCAGCAACAATGGTTTCGCTACTTTCTGGACCCCATGCATCCCCCAACTCGTTTGGAACAACAATTTCATCCCAGTCATAAATCGTGTGACTCCAAACATTCATGTTCCAAGCCTTGTTGAGCGCACCCAAGGTCTTGTATTTGTTTCTCAACCAGTCACGGAAGGCGTTTTGACAATTTCCACAGTAGCAGTTGCCACCGTATTCATTGTTCACGTGCCAAACTACGATATGCGGATTGTCAGCGTAGTGCTGGGCCAATTTTTCAACTAACTCACTATCCAGTCTTTGATAATTCTTGCTGTTCGGACAGAAATTGTGCCGCTGACCAAAGACGTGACGTCTGCCTTGGTAATCAACTCTTGCGATATCCGGATACTTTTTAAACATCCAGGCTGGCATTGCGGCTGTTGCCGTACCCATCACAATATCAAAGTTTGCATCAGATAATTCTTGAACGATCTTGTCTAATTTAGAAAAGTCATAGACTCCTTCTCTAGGTTCAAGAACTGCCCAAGAGAAAATATTAATTGTAGCTGAATTTAAATCTACCTTTTTAAAGACCTTAATGTCTTCAGGCCATGTTTCTTCTGTCCATTGATCGGGATTGTAGTCTCCACCATAAAGAAAACGTGATAATGTTTTTGTCATTTTTAAACCTCTACTTAAACTGGACTTCTACTACTTCATCACCATGTTTTACAGTAGAACCTGCTTTAATTGGACTAGCTTTTTTAATTCTACCTGGTTGAGTGTAGAAGGTAATGATCGTATCCTTGTAGCCATTCTTAAGAGCCAAATCACGATCAAATTCTAGTAATAAGTTGCCCTTCTTGACTTCTTGTCCATCATCATAATACGTTTCAAAGCCTTCACCACGCAAGTTTACTGTTCCAAGACCAACGTGGACGACTACTTGCAAGCCATTATCTGAGACAATCTCAAAGGCATGTTTTGTACCAAAGGTAAATCTAATCTTACCATCGAATGGTGCATAGATCTTGCCGGCACTTGGTTTGATTGCGAATCCCTTTCCTGGGAATGGCTTACCATCTTCATCGACGACGGAGGACATTTGCATCAATTCACCATCGGCTGGTGCGTAAATCTTTTGCTGTCTAGGTCTTGCTTTAGTTGAATCGTCTTTTTCGATTTCACCTTCAGCAAGCTTAGTCTTCAATTCTTCAACAACTTCGGCGTGTTTCTTTTCGGTCAAAGTTACTTTGCTTAAGAACACGATGATTGAAATCACAGCTAAGATTAAAGGAATATATAAGGCCATTGTATTAAAAGTGCTGATATCGTGTGCTGTCATGTCGGCAGCAGTTGCGGCACCTGTCATACCAGCTGCAATAGCTACATATCCAACCAAAGCATTAGAAACAGCACCAGTGAATTTATCGATCATTGGACGAACTGCCAAAACGACGGCCTCGTTTCTTTGACCACTCTTAAGCTGACCATATTCAATCGCATCAGTTAAGGTCAAAACAGTTACTAATTGAGCAAAGTTGATGTTGAATAAAACCAACCCTAAGTCCATTAACACAACATTATCACGGCAGAAAATGAATAAAAGATAAGCACAGATCATGCAAGTTTGACCTGCAACAAACAACCATTTTCTTGGAATGTACTTATTTAAAACTGGGAATAATGGACTGACACAGAACCCTACAATCGTTGCAGCAAGCCCTACAACCCAGAATTCGCCTGGTTTACCAATGACAAACTTGTACAGGTAGAACAAAACCCCGTTAGTAATAGTGTTAGCAAGTGAAAAGACTAAGTAGGCTAAACTCGGCCAAAGAATTTGATCGTTATGGAAAATAGCACTGAAGACTTGAGGCAAAGTAGTTTTCTGCTTGGCTGAATTTCTAATTAAATTGTGCTTTTCTTTAGTACCTAAGCAAACAATAATGGCACAAACAATTGCTAAAGCTGAAATTACGGCCGCAAAGGCAAGCCAACCTGGTGCGCCTTCTTCATGTTTACCAGTAACTGCATAAGTTACCCCAGTTACTAAAGGAACAACGATAATTGCTAAACCGTTCCAACCAATAGCACCTGAAAAGGCACCCAATGAAGTATAAATACCACGCTCATGTGAATCTTCACTTAAAGCCGGAACCATACCCCAATATGAAACATCGGATAATGAATAGAAGATATCAAAACCGATGTAGATAATTACGAATAAGATAGCGAAAAGAATCCAATTGCTTTGCGCTAAGCCAAAGATTCCAGTAAATAAAATTAATAATAATACTGCACTAACTACAGTACCCAAGAAAATCCATGGCTTAAACTTTCCCCATCTTGTTTTGGTATTATCAACTATGTTGCCCAAGATTGGGTCAATAACCAACTCGACAATTCTAATCACTACCATTAAGCCAGTGATTAAACCAATCAATTTATCAGCTACTGATTGCTCTAGCCCACTGAACATTCCGCTAGTAATAAAAATAATAAAGTATGTACTCATTACACCATAAAAAGCTGCGTGTCCTAAGTTACCCAAACAGAATGATGCATAGGAAATTATTTGTTTCCCTGAATTTTTATGACCGTTTGTCATAATGCTTCCTCCTAAGTTGTTATCATATGTAATCGTTTACATATCTGAGTATACGCTTAGTATCAGCTTTGGTAAATTATTTATTTAATATAACTAAATTATTTACTTAGAGTAAAAATAAAAGAGAATGCCATGCTTCAGCATTCTCTTTTCAAAATATTCAAAATAAAAAATTTTTATTTACTAAAAAATTTACTCACCTTTAGGAATAAAGCTTTTACGTACCACCAAATTAGTGTTCATATTCACATCAATATGCGCACGATCAGGTTGAATAATTAAATTGGTCAGCATAGTTAAAGCTAAATCAATCATTTCCTGCTGGTTGATGTTGTAAGAAGAAAGTGGTGGTGATACGTACTTCACTACATTACTGTTATTGATACTCAAGATTGCGGTATCTTTTGGTACGTTAATATTTTCTTCGTTAAAGGCCTGTAAAACACCGACTGCTAATGTATCAGAAGCAATTAAAAATGCATCTGGTAAATTGTTCTTGTACTTCTTTATTACTTCTTTGCCAAGCTTATAACCATTTTCAACGCTGAATGGTCCTGAAACGAACATGTTGCTGTCATCTTTTCCGCGAACCTTCATATATTCCGCAAAGGCAACGGAACGTGGATCATTTTCTTGAATATGATCATGCTTAGGGCCAACACCACCAATAAAACCAATCGATTTATAGCCGTTTTTAATAAACAAATCAATAGCATTCTTAACTGTTAAAGTTAAGTTAGGTCTAATTGAATCAAACAATTCGGGAGCTGGATTAGTATCAACAAACACCCCATTTGGTAAAACTCCGTGCAATTTTACCAACTCAGCATTTTCGATTGGCGCTGCTCCTACTCCAATAAAACCTTGGAACAATGAGGCATTTTCAATTAAGTCAGCCGCATTATAAAAGGTCTTCATCTTCATTGAGTCTTGCTTAACGGTTTCTACCAAGGCTTGCTTTAAAGAAGTAAAATACTCATCTTGCAACTGTTCTTTATGGTTAGTGCGGTAAAGAAGCGCAATTGTAGGCTTAATTTCCTTTTCTTGGTGGTCTTTCCAATAACCAAGCTTATTAGCAATTTCTAAAATTTTATTTTTGGTATCTGCAGTAATCGATAAGTTAGGATCATTATTGAGCAACCGCGAAACAGTCGCAGAAGAATACCCCGACTTTTGTGCAATTTCTTTAATAGTTGTCATAATTATTTCTTTCCCATTTACTAATATTCCTGTCTTTATTATACAAAATATTTACTTAATTCCAATAAAGATCAATTTTATTCAAAAATATTTTTCTCGAGATTTGCCAATTAATTTATTTACTTTTAACTACTAAATTATTTATTTATTTGATAAATTATTTAATTTGGTGTTATAATAAAACCGTTTTCAAAATATATTTTTTATAGAAACTAGGTCTTATCATGAAAGCAAATATCAAATGGCTTGATGATCCTGAAGTCTTCAGGGTTAATCAACTGCCAGCTCACAGTGATCATCCCTTTTACAAAGATTACCGTGAATGGCAAAACCACAGCAGCAGTTTTAAACAAAGCCTCAACGGTGCCTGGCAGTTTCATTTTTCAAAAGATCCTCAAAGCCGGCCCATTGATTTTTACAAACGCAGCTTTGACTCATCTTCCTTTGACACCATCCCCGTTCCTAGTGAAATTGAATTAAACGGCTACGCCCAAAATCAATACACTAACATTCTGTATCCCTGGGAAAGCAAAATCTATCGTAAACCTGCCTACACCCTGGGGCGCGGCATCAAGGATGGCGATTTTAGTCAAGGAAAAGACAACACAGTCGGTTCATATCTTAAACACTTTGACTTAAATCCTGCTTTAGCCGGCCATGACATCCATATTCAATTCGAAGGCGTTGAACGGGCCATGTATGTCTATCTCAACGGTCACTTCATCGGTTACGCCGAAGATAGTTTCACACCTTCCGAATTCGATCTTACACCTTACATTCAAGCCAAAGACAACATCTTAGCTGTCGAAGTCTTCAAACACAGCACTGCTTCCTGGCTAGAAGATCAAGATATGTTTAGATTCTCCGGTATTTTTCGTTCAGTTGAACTCTTAGCCTTACCTCGGACTCATCTAATGGATCTAGATATCAAACCAACTGTTGTTAATGATTACCATGACGGTGTCTTCAACGCTAAATTGCACTTTATGGGTAAAACCAGTGGCAATGTCCACGTCTTAATTGAAGATATCGATGGCAAAACTTTGCTTAATAAAAAGTTGCCACTTAAATCAACCGTTGAAATTGAAAACGAAACTTTTGCCAATGTTCACCTATGGGACAACCATGATCCATACCTTTATCAATTAATCATTGAAGTTCATGACCAAGATGGCAAATTAGTTGAACTTATTCCTTACCAATTTGGTTTTAGAAAAATTGAAATTACCAAGGATCATGTGGTTTTGCTTAACGGCAAAAGATTAATCATCAACGGCGTTAATCGTCACGAATGGGATGCTAAGCGTGGCCGCAGTATTACCCTAGCCGACATGAAGCAAGACATTGCTACCTTTAAACACAACAACATCAATGCCGTTAGGACCTGCCACTATCCAAATCAAATTCCTTGGTACTATCTCTGCGATCAAAACGGAATTTACATGATGGCGGAAAACAATTTGGAGTCCCACGGCACCTGGCAAAAGTTAGGCCAAGTTGAAGCTACTTCCAATGTACCCGGCAGCATCCCCGAATGGCGTGAAGTCGTCGTCGACCGTGCCCGCAGCAATTATGAAACCTTCAAGAATCACACTGCAATCTTATTCTGGTCACTTGGCAACGAATCATATGCCGGCAGCAACATCGCCGCTATGAACAAACTCTACAAAGATCACGATTCTTCTCGCCTCACCCACTACGAAGGCGTCTTCCATGCACCAGAATTTAAGAAAGAAATCTCAGACCTTGAAAGCTGTATGTATTTGCCACCTAAAGAGGCAGAAGAATATCTCCAAAATCCTAAAAAGCCACTTGTCGAATGTGAATACATGCACGACATGGGCAACTCCGACGGTGGCATGGGCTCTTACATCAAGCTAATTGATAAATACCCTCAGTACATGGGCGGCTTTATCTGGGACTTCATCGATCAGGCTCTGCTTGTGCATGATCCAGTCAGCGGACAGGACGTATTGCGTTACGGTGGCGACTTTGATGATCGTCACTCCGATTATGAATTCTCTGGCGACGGTTTAATGTTCGCAGACCGCACACCCAAACCAGCAATGCAGGAGGTTAGATACTACTATGGCTTACACAAATAATTTACACGTCGTTTATGGCGAAGCTAGTTTAGGAGTCAATGGTCAAGATTTCGCCTATTTATTCAGCTACGAGCGTGGTGGCCTTGAATCTTTGAAAATCAAGGACAAAGAATGGCTTTACCGCACGCCTACACCAACTTTTTGGCGGGCGACCACCGATAACGATCGCGGTAGCGGCTTTAATCAAAAAGCAGCCCAATGGCTAGGAGCTGACATGTTCACTAAATGTGTGGGTATTCACGTTCAAGTTGACGATCACCAATTCGACGAATTGCCTGTCGCTCCAATCAATAATCAATTTAGCAATCAGGAATTTGCCCATGAAGTAAAAGTGGCTTTTGACTACGAAACTTTAACTACTCCTGCAACCAAAGTCAAAATCATTTATAATATTAATGATTTTGGTCACATGACGATTACCGTGCATTATTTTGGTAAAAAAGGCTTGCCGCCTTTGCCTGTTATCGGCATGAGATTCATTATGCCAACCAAGGCTAAAAGTTTTGACTATACTGGCTTGTCTGGTGAAACCTACCCTGACAGAATGGCTGGAGCAGAACGCGGAACTTTCCACATTGACGGCTTGCCAGTTACTAAGTATCTTGTACCACAGGAAAACGGGATGCACATGCAGACTAATGAATTAGTCATCACCCGTAATTCTACGCAAAACAATGCGGACAAAGATGGCGACTTTAGTTTAAAGATTACGCAAACTAAGCAGCCATTTAACTTCAGTTTACTGCCATACACTGCAGAAGAATTAGAAAATGCTACCCACATTGAAGAGTTGCCATTGGCTCGTAGAAGCGTATTGGTAATCGCGGGAGCTGTTCGCGGCGTAGGCGGCATCGACAGCTGGGGCTCCGATGTAGAAGAGCAATACCACATCGATCCTGAGCAAGATCATGAATTCTCATTTACGCTAAATTAATTTAAATAATTTATTTTAAATTAGTAAATAATTTACTAAATAATGTGTTATAATGTAATCGGTTTAAGTAAAGACTAATAACCAATTAAAAAAGGAGTAAACATGAAAGTTTTAGTTATCGGCGGTGCCGGCTATATCGGCTCACACGCTGTTCGTGAATTAGTTAAGGAAGGCAACGATGTCGTTGTTCTTGATGCTTTGTACACCGGTCACAGAAAAGCCGTTGATCCAAAGGCCAAGTTTTACCAAGGCGATATCGAAGATACCTTCTTAGTTTCAAAGATTTTACGTGATGAAAAGATCGATGCTGTTATGCACTTTGCTGCATATTCATTGGTACCTGAATCAGTTAAGAAACCACTTAAATACTACGACAACAACGTTGCTGGAATGATTTCACTTTTGAAAGCAATGAATGACGCTGGTACTAAGTACTTAGTCTTCTCAAGTTCAGCCGCTACCTACGGCATCCCTAAGAAGTTGCCAATCACAGAAGACACCCCACTTAACCCAATCAACCCATACGGTGAAACCAAGATGATGATGGAAAAGATCATGGCCTGGGCCGACAAGGCTGACGGCATTAAGTACACTGCTCTTCGCTACTTCAACGTTGCTGGTGCTTCAAGCGACGGTACAATCGGTGAAGACCACGCTCCAGAAACTCACCTTATTCCAAACATTTTGAAGAGTGCTATTTCTGGTGACGGCAAATTTACCATCTTCGGTGATGACTACAACACTAAAGACGGTACTAATGTCCGTGACTACGTTCAAGTTGAAGACTTAATTGACGCTCACATCTTAGCTTTAAAGCACATGATGAAGATCAACAAGTCAGATGTCTTTAACTTGGGTACTGCTCACGGCTACTCAAACCTTGAAATTTTGGAAAGTGCTAAGAAAGTTACCGGTATCGACATCCCTTACACTATGGGACCACGTCGTGGCGGTGACCCTGACTCACTTGTTGCCGACTCAACTAAGGCAAGAACCATTTTAGGCTGGAAGCCAAAGCACGAAAATGTCGACGATGTAATCGCAACCGCTTGGAAGTGGCACAAGAGTCACCCAAAGGGCTACGAAGACAAGTAATAAAGTAAATTAAAATAAAATCTAACCCTATCTTTTTAACTATTTCTATTTTCTATTATCAAAAAATGTCAGTCCGATCAACTGGCATTTTTCTTTGGCACAAATTCTGGTACAGACCATTATTAATTTGATATTGTATAATAGTTGATGAAACAAATGGCGGAATTTTATGACAAGTATTAGAGACATCGCTAAAATTGCAGGCGTATCCCCTGCTAGCGTTTCGCGAATCTTAAACAATGATCCCACCTTTCATATCAATGAAGCTGCACGTGGTCGTGTAATCGAGATTGCCAGAAAGCTAAATTACAACAAAGCCGACAAAAAGCGCGGACCAAAACAACCTGATTCTTCCCTTTCAATTGCTTTAGTTATGCGCTATGGCAACATGCGTGAATTCAACGACCCCTACTTTTTGAATATGCATAAAGGAATCAACGAAGAAGCTAAAAAATGGCACTTACGCGTTGAACAGCCTTGTCAACTACCCTAGACTGAAGTCTAGGGCTTAGTGGCTCGTTAGCTTGCGCTAACCAGAACACCAACTTGCTTAGATACGGACTTGACTTACCAGCAATAGTCTTTTGACTATTGCCGATAGAGGTCGTCTAATTACCAAAGCCTTTTAAGTCCGCGAGTTGCTAACGGACTGCAGCCTTTCTAGGCTGCTTTTTTTATTTTAGGATTTTGCCACTTGCTTAATCCTTTTTCTAAGATGTTCACTGCCGCATTCCAATCACGAATATGGGGCATTCGACAAATTGGACAAGTCCACTCCCGATCCTTAAGGGTTAATTTCTTATAACCATTTTGGCCCATAATACTGCCACAAGCATGACAGCGTTGAGTAGTATATTTAGGATCAATTGTTAGAAATTCTTTATCATATAGATCTGCCTTATAAGCCAGCATATTCAGGAAACTACGCCAGCCGACATCAGAAATTGATTGCGACAAGGCATGATTCTTTAACAGGTTTCTGTTTCTTAATTCCTCGGCAACTACTAAATCGTGGTTTTTGATTAGTGCAGTTGAGAGTACTTGTAAAAAGTCATTACGCTGTCTTCTGATCTTATCGTGCAGTTTAGCGACTATTAGACGCTGTTTTTGATAGTTTTTTGCTAGCCGCAAATTGCGTCCTTCTTTTTTAGCCCGGCGCTGCCTGCGAGACAAGACACGTTGGGCATGGGCCAGCTTCTTTTTGGTTTTGCGATAAAATCGTGGATTAGCGACCATTGCTCCGTTAGATGCTGTTAGGAAGTTATCTAAATTGAGGTCAATGCCAATTTGACTTTGTGTTTTAGGTAATTCCTTAACAAAAGCAGTATCGCTGCCTAATTGCATAGACAGGTAGAACTGGTCATCAGCGGTCTTTTTTATCGTGACTGTGCCAATTCTAGTTGGAATCTGCTTAAGCAAGCGCTCTTTAATCAGTTTTCTAAAACCAGCAATGCGGACAATTCCCAGCTTAGGCAATTTAATATGTTTAGCATCAATAAATCTAGCTGTCCCGTTATCAAGATAGGCTTCTGACTGCTTAGGATATTGACAGTTAGTCTGATATGACCAGTCGCTGCGTTTTTTGTGAAAAGTCGGAATTCCATGACCAATCTTGCGATAGTTGTTCCAAGCTTTCTGATAATTTTGAATGGCGTTGGCAATAGCGAGACTATCAATTTCTTTAACACGTAAAAAGCTATAAGCATCACGAATATTCTTAGGCTTAGCCAATAATTCATTATTGATGACTACTGTTTCTGCCCGTTGGACTTCATAGCTATTAAGAGCAGAAAAAGCAAAGGGCATTGCTCGATTTTGCCTATTAGCTAAATAACGTTTAGCATGGTAGCTAGTCCGATTGCGTCCTACATAAGAGTTGTAGACAAAGCGCTGCGCATCATAGTTTAGTTTAATGATTTTCTTTTGCTTAGAACTAGGATAAAAGCGAAGCTTAACGCCAAAATGATAGGCTAAACTGCTCATTCTTTTCATGTATTTCACCTCCTTGTATTATTGTAATTTGATTATAGCCTATGTATATCTTATAATCAACAATAAATAGATACATAGTAGACGAAAGGATATTAGACTATGAATAATAAGAGAGACAAAATAAAAGACGCAGGCTATGAAAAACACTACGTCTATAATGCACACTATCATCTAATTTGGTGTACTAAATACCGCAATCAAATCTTTACTAACCAAGACCTTGCACAAGAGATGCGAGACCTGCTCAAGCAAATAGCTGAAGATAATCAAATCACAATTGAAAAAATGGAGGTAATGCCTGAACATATTCATTTGCTTATCAGCTTTAGACCATCAAAATCAGGTTCAAGCGTAGTCAAAGCGCTCAAAGGAAGAAGTGCTTTCTTATTCTTCAAGAATCATCCCGAAATTAAGCAAAACAAGATGTGGGGCGGACATTTATGGTCACCCAGCTATTACTTTGGTAGCGTAGGGAATATGTCTAAAGAAGTAGTAGAAAAGTACATTAATGACCAGATCTACAATGCCGTTAGAGACGGTAAGCCCTATCCATCCCCGCATTAAAATACGGGGATTTTCGGGCGTTTTTCATTAAACTTGATGATCCAGATAAAAATTGGACCGATCTAGCAAATTACGGTGCAGTTATTATCGAAGGTGAAATGACTGCGGCCGCTATCGAGCAGATTCAAAACATCAATCCGAATGTGATTTTCCTAGACGTAAACACCAACATCCGCGGCTGTAATATCGTCCGCAATGACTTTATTTAAGCAACCACTAATATCTTAGATACGCTCTACGAAATGGGACATAGAAATATCGCCTATATTGGTGGTAAATCTGCCGTAGTTAACTTAGATGGCAAAATTGTTTTAAGAAAAGATGACTTACGTGAAGACGGCTATATCGCCTGGATGAAAATGCACAACTTAGATCAGTATTGTCACACCTTTACTGCTAACTGGTCAGCTGATGAAGCTTTAGAAGCTACCAATCAATTATTGCAGTTAAAAGATCGACCAACAGCGATCGTCGTTGCGAGTGATCCAATGGCCTTAGGCGTCTACAAGGCCTTAAATGATGCCAACGTTAATATTCCAAACGATATTTCAGTTGCCAGTTTCGACGATGTAGAAATTAATCGCTTTTTAACCCCTACTCTTTCAAGTATCGATATGAATAACGAAGGCAATGATAAAACCAATTCTAGCAGGCCACTTGGCACCAAAACTATCGTATATTCTACCAGCAAAAGCAGAAGTTACGGCGTTAATGACACCACCTGGAAGCATGATGATCCCAGTCAAAGCAACAGCTACAAGCAAACCATTTTGCAAGTATTGTGGCAAAAGATACATTGCTGACAAGATGATACCAAAATCAACCATGACCAAAATTGCACCCAAAGTGAAGTTGCGGTGTCTAAACACACGCATGTTTAAAACAGGTACTTCAAGTTTTAGTTGACGTCTTACATAGAAAATCAAGGCAATAACACCGACGATTAAACATGCAAAAACTGGTACTGAAAGCCAACCCATTTCGCTGGCAAAACTTGCACCCGCAATCAAACCTGAGAAGACAAAGATTGACAAAATGATGGACACAAAATCAACTTTTGGCTTAGTAATTTGACTAACATTCTTCAAACTAAATAAGCCGATTACAAAGGCAATGACTAAAAATACTACGAATGTAAAGAAGATGTCGCGCCATGAACCAGTAGCTAAAATTAAGCCCGTCAAAGTTGGACCAATTGCTGGAGCAAACATGATTACTAAAGCAAGAACACCATTAACTGTACCTAACTTATTTGGTGGGAAAATCAACATCGCAATAGTAAACATTAAAGGCAAAATGATTCCAGTTCCGATCCCCTGGATCATTCTACCAATTAAAACCATCGGAAAATTAACCCCGAATCCCGAAACTTTTGACATATTTTAAAACTTTTTGACGCAAAAAAAGACTGTTCGATTAATTGGCACTTCGAACAGTCCTCGTCAAAACTATAAAATTAGACCTAAAAAGAGGTCTAAACCACTTTTGTGTCAATCAAAACATAAAAAGAGGTTCACTCTTATAATAACATAATTACAAGCCCATATCATCGTCATCACCAATATCAGGCTTCTTTTTGCCGTCAGAATCATAACCATTTGTATGCAAGTGGGGTAAGAACTTGTGTTCTTTACTAGGTCACATTGTTTGCTGATGTGGCTCTTTTTTTGTGCCGTTCAAATATACAGTACGAAAATTCGTACTGTAGAAATTGCGGGCAAGACAATTTTTCTTTTAATTTTATTGAATATTATTGTTTTGTCACCGAAGTCCGATTTTGTCACCAAACTTGTCACCGAACTTTCGTGACTTCGGTGACAAAATTCAAATCAATTAAAACCCGTATAGAATAAAAATTGATCAAAATTATCGAAAAAGTGGCTCTTACAAATCCCGTCATATCAAGGTTTTGAAAATTAAGCATGATTTTAACGCTCATGGGACTCGAATTCCGAGCGCTTAAAAGCAAAAATTAAAAGTCCGTATAGAATAACTTGAAAATGGCACAAAAAAATAGTCACACATAATTGCGTGACTAAAATTTTTATTTTCGTTCGTATCCAACTCCGTGAAAGTTTTCCAAGAAGGTGTCAACTTCGTCTCCATTTGCTTGTGGAGAGCTGTATGTGATGGTATCACCATCTTCATCAACTCCATCAGAAGTTCAGCAAATTTATGCTAATGTAGCTAGATTTCGTGTTGAAAATGCTGAAAAGCAGGCTAAATCTGTTAAATCATTGCCACAACCAGGCGAAAGCAATGAGAAACTCGCTGTTCTCGGTTCACTTGTATCAGCAATCTCATTGGTAAGTTTAGGCTTATCTGAAATTGAACGGAATAAACGTCAGAATTAGTCTTATACTAATAGCGTCTTCCCATTTCGTATTTACACGAACAACAAAAAAGCTCCTAGAGTAAATTTCTAGGGGCTTTTTTATATATACTTCGATTTTAAGTCACCAAAAATAACTGGAGTAGTGACTTTTAGGGATAGAATGAGAGCTAATCTAAGCTCAAAAAGAATAATAGCAGAAATGAGAAACATTTTAGGTGTTTTCTGCAAATAATGTGACAATTATTCCATTTTGTCTGCTATTTGGTGACAAAATGCAAAAAGTGTTCATTGAATTAGCTAAAACTAATCATCCGTGCCAAATTCCTTGTCTACCAATTCTTGCAAAGTGCCGTTCTTAGAGGCTTCAGCACTGTCAGCTTGCTTAAAGTGATCAGTAGTAATGTCTTTTTTGATGCTGTCATCAAAGCCTAAGATGTAGTTTGTAGCAATTAGGTAGATAATTCTAGTTGGAGCTAAGCCAAAGACCTGATGGTTTAAAATATGGATCAATCTTTCGTGATCATTAGAGTAAAACTGCTTCATAACAGGGTTTGTATACAATCGTTTAACGATTTCAGTAATATATAAACCAGATTTCATGTACAGATCAGCAAATGTAGCATTAGGATCATCAAAGACGTTAGGATTTTCCTTTTCTAAGTCATCAACCATGTGTTTTACTACTTTTTTAGGCGTAAAGATTTGGTTTGTCTTTTGTGGTGGAATGTAATCAAAGATGTCTTCTTTATTGTTTTCATCAAAGTAGTTGCGAAGCTTGTATTTTAAGTCAAGGAATTGCTGAATGGAGTCATTAAAGACCACCTCATCAAACAAATGACCCTCAAAATGCTTCTTTTTGCCAGTTTCTTGATCAATGTAGTCACCACCATCACGTAAAAATCTAAATTGATCCTCAGTAATTCCAGTAACTTCCTTGAAAACGTCATCTTCCGTGTAATCATCAAAGTTTTGAAGTTTCAAGTTGCGATCACCATAAGCCATGATGAAACTAGGAATTGTACGAGCAAAGCCACGTAAATGTGCTCTAGCATCATCTTCAACTGAATGCTTTTCTTCCTCAGCTTTATGCTGTTCAACCTGTCTGATAGCTTCTTTTGGCGTATCCTGAATTGTTTGCTTTACATGATCAGTAACTTTTTGCTTAAAGTCATCGAAAATGTCAGACATCTTACTTTCATACTCATCTTGTGCTTGTTTTAAGTCATCGTCAGATTGAGCAAATTTTTGCTGTTCTTTCAATTCAGTTTCAGCCACTTTTTGTTGATCGTGGTATTGATCAGCAATTTGATCTAATTTCTTTTCACTTTCTTTATTGACTTGTTCTTGGAAGCGCTCAGTTTGCTTTTTAGTTAAACCATAATTGTCTTTAACCTTGCTCCAGACGTGGGTTTGCAGAGTATCAGTCAAACTGGTTTTCACTTGATCAATAACCTTTTGTGTTTCATCTTGATCAGTATTGTCAAAGCTATTATTAATTGCTTGATTCATTTCATCATTGGAATCTTGGTAAACTTTGTCACCAAATACTCCTTGTGTCTTACCAATGACAATTTCATCAGGAACATCAACGTTTCCCTCATCGTCAGTAGGCACATCATCTATGTCATCAATAGAATTGTCTTGTTTTTTGTTCTTCTCTTCCTTTGCTTTTACTAAATTGTCTAAAATGTCTTTAACCTCAGCAGGAGCACTGAAAATACGGCTAATGTTAGCAAATAAGAAGTTGCTCATAAAGCCATGCTTTACTACTTCATGTGACTTTAATTGACGTGGAATTGACATGATTTGCTTTGCATCCAGTTCAACCATTTTGCCGTCTTTGTCTTCGCCAATCACAGGGAAGAAGTTGAGCAGTTTTCTGATATTTTCAGCGTGTTCCTCAGCAGTACCACGTCCGCCAGCCGTTTTTGCATTTAAATTATTAGCAAAATCATCAAAGATTGTTAATGTTCTAGCAGGATCAAAGTCAAAGACATAGGCATTCTCTTTTTGGTAAATGTCCTTGCCACGTCTTACCTTATAAGGATTCTGTGCTCTAAAAGCCGATTGCATGTATTCGGCAGGACTCTTCATATTAGAAAGCATCAAGACACCCGTCCACGGCTTAACTGTCACGCCTGTTGTTAGCTGTCCTACGCTCAAAGTAATGGTTCGTGGGTGATCCTTAATTGCCTTGATAACACGGTCGTATGACTTCTCATTAGCTTGTGAGAGCTGATCGTCACTCAATTGGTCATCATCCAATTTTCCGTCCCCAGCTGCAACGATAATTTCATAATCCTTGAAAATAGGGTCTTTTTTAAGCTTCTTGGCTAAGGCTTTAGCACTGTCTACACGGTTTAAAAGCCAGAAGGTATGGGCTAATTCACCACGCAATTTAGGCGTTGAAAAGGGGTATTTTTCGTTATGTGTCAAGGCATACAAGAAGCGGTCAACATCTTCGTCATAAACAAATTTTCCACTAGCATTTACACGGAAAAATTCGTTTAAATCAAAGGCAGGATCAACCTGTTCATCATCAGACAAGTCAACTTTTTGCTCAGCCTTTTCTTGCATGATTTTAGACATTTGATAGGTGAACATGTTAAGACGTGGCATCACAGCATAAGGATTGCTACCATTATTATCGTCCCAATTTTCTTTGGCGGTTTGTTCATCAGCATATGACCAGTTAAAAATTTGATCACGTGCAAATTTGCCTTCTGCTAAGGCTTTAAACGGTGTGCCAGTAAGGTACAAGGTATAGTCACGATTAATCTTGTCAAAAGCCTTGTCAGTCTTGTAGGTTTCAACACCTTCATGAGCTTCATCAACTACCAGCAAGTCCCAATTAAGGTCTTTAATCCACTCTAATTTGTTATAATCGCCACCAAAATAGACAGAACCTTTCAAGCCTTGCAAGCTTTCAAAGGCTATCTGCTTGTAATCGCCATGATCCAACAGTTTGATAAACTCTTGACGTGAGAGGACATCTTTTTTCTTCAAAGAATCATTATCCGTAACAAATTTATAATCAGTTTGCCAGCCGATAAACTTTTTGAAGTCATCAAACCATGAATTAGCAATGCTAGGACGATTTGTCACAACAAGAACGTTTTGCAGGTTCATTCTACGTACTAAATCATAGGTAGACAGGGTTTTACCAAATCTCGGCTTAGCATTCCACAAGAACTCAGCGCCCTTGCCATGTTGTTTAAAGTAATTTTCTGTCATTTCTACGGCTTCATTTTGTTCTTTACGTAAAGTGTAAGTAGAAATATCGTCGTCTTTTGTTTGTACATCGCCATAATCACGACTTGCAAACTTGTTAAACAATTCATGTGATTGTGTTCCATCGGTATGAAACCATTCAGTCTTTGGCTTGCGTTCAATATGTGCCTTATCGGTCAAATAGCGGTGAAAGTCATGATCAGTAAAAGATTTGCCTGAGCCATCTTTGTAAATTGCATTATCTTGCCACAACAGTTTGATCTCAACATCAGCAGTATGAACTTGCTCATTGATACGGTCTTTTACTGTTTGTTTATCGGTATATCCAATTTTTGTCCAACCTTGATGGCGTTTAATTTCAGGAGTGGTATAAGCATAAATCATGGGAATAATCGGACGATAAGGTTCAATTTTCGTATTAGCCATGTTTATTTCATCTCCTTTACGTTCTTTTCAATAAAGGCAATTTCATCATCAGACAAACCGTATTTATCATAAAGTTGCTGATCAATTTCAGGAATTGATTTAGTCCAGTCAATGTCTGATTTATTTGTAAAATCTTGCATAGGAACGTTTTTCCACGTTCCTTTATTATTATCTTGAGTAACTTTTAGAGTTCCCAGCATTACTCGGCAGAATTTACTTTTAATATATTTAAATGCATTTTGGCAATCTACTTTTTGATTGAATTTTCCAATACCTATAAATGATTGAGTATATCCTACCAAAGGTGTACCAATTATGGGTGTACTTAAAGTTTCACCCAAGGCACCACTGCCATTAGATTTTGGAATTAATACTTTATAATAGTGTAATGTGCTATCGGGATCTAAATATTTTTCATCAATGAATCTATATTTTCGAACATTATTGTTAATAATTCCCCATATCTTTACAGTACTTGTAATATTTTCTTTATCTGTAAAAACAGGTAAAGAAAATATTGAAGTAGTTAAACGCTTTTCACGACCATTACTTCCTACTTGCTTTTTTAATTCTGGATGATCGGCATAAAGGGCAGTCAAATTAAATTTATTTTGTAAATGGATGATAGTAGTTATTGATTTGAATGCAATGGATTTGTTGTAAACGACTTTATGTAAAATGCTTCTAAGTTCTTTAAATGGTGAAAATGTACCAATAGGACCTAACGTTTTCGATATATCACGATATGTAACACAAACTCCACCCTTTATATCAGTATTTTCAAATACTTTACTGCTATCTTGCTCAAAAAATAGAACTTTTAAATGTGGGTCATTTAGCATCTTCTCATTCCATTTTTTCGGAGTTTTTCCAGCATTTGATAAGAATTTAGCAGGAGTAATTAATTCAACCTTTTCACCAATTTTATAGGCATCATCCATAAAATAAGGAAAGATTTGTTTATCACTTGTTCCTTTTAAAGTTTCTTGAAATGGTGGATTGCCGATAATCACGTTAAATTTACGGTGTTTTTCTTCTGTCATTTTGTCTTTTCTTCCTTATAAATCTTTGTTACAGGACAAATTGCATACTCAATAGTTGAAGATGGTTCATCTAAATTGAATAAATCTAATTGCTCATCAATTTGACCATCATCAGAGCCGTCAAACATTGATTTAAAGGTGAAAGTCTCACGTTTTACTTGTTTTGAATCAATTGGTTGCCAATCACTGAACTGGATTAATTGTTTAGCATGATTTTTGTAAGTTAGTGAATTGCCCTGCACAATGTTAGTTTTTATCACGAAATTAGTAGCTTTATAGAAGTCAGTACGATGACTTAGGTCTTTTTTTAAAACCCTCTTGTAATGCTTTGCTACTACTTCTATCATTCTACTACGTGCAACTATCAGGTTATCTTGTAAAAGTTCAATCCCATAGACGCTCATCAAAGCCCACAGTGCATTTACCGTCCAATTGGTTTTACTGGAAATCTTGTCTACGTAATCCAGCTTTTGGTTCAGGATTTCTACTAGAAAAGCCCCTTCACCAGCACTAGGTTCAAGAAAAGTAGCATGAAGATCATTGAGTTTTGCTTTAATTGACGGTTCAGCAAGCATCTTTTTAACCATCCAGTTAGGAGTAAATACTTCACCATGATGCTGAACACGAGCTTTTGATTTAATTAGATTTTCTGTCATCATTAACCTTCTTTTTAGCTTCAATTTCTTTGAACCGTCTTCTTTGTGTTGCAGTTGAAATACCCGTAGACTTCTCAATCATGCGATATGTGTATCCCTGTTGCCGTAGTTTGTAAGCCAGCTCCAATTGTTCAGGAGTAAATTTTCGAGGTCTTCCATCACGGTAGCTAGGATCATGCTTGCGTGCGTAAGCTTTGCCTTCTTGAGTACGTGTAACAATCAAATCACGTTCAAATTGAGCAAAGGACATAAAGACATTAAAAATCAATTTTCCGCTTGGCGTGTTGTCAATTCTGCCAAGATTTAGGACATCAATCGCCACCTGTCTTGAAAAAAGCGCATCAACCACATGGAGCGCTTCACCAAGATTTCTGGCGAGTCGGTCTAACTTGGTAACAATCAACGTATCGCCACTTTTTAAGATTTTCATTACTTCATCAAAGACGGGTCTTTCAGTAGTAGTACCAGTGTACTTTTCGCTGTAAATCTTTTCTGCTCCTGCTTGCTTCAATTGTTCAATCTGAGGTTTAAGATTTTGTTCAATTGTTGAAACACGAGCATAGCCAATTTTCAATTTTTATCACCTTTTTTCAATTTTACTTTTGACTAGCATTTATGATCAATTCTTATCCATTATAACATCGTGGGGTCAAAATCTTCGGACAAATGTGCCAAATCTTGAAATAAGTGAACCGATAATAAAGGCACCGAGGCCAAATAAAACTAATTTTTTGGTCGCAAACCACTTAGTGAGCAAACTTGAAAGTGGCAGGACTATCCCGATTACGAGCATATAGCCGGTAACCAGCCATTGAATTGAAGACTGACCAACTTGAAGAGCCTTCATCAATTGTGGTAAGGCAATGTTTAATGAAGTTTCACTAAACATCCCTACAAATGATCCTAGCATTAATGGGAGAATAGCAAGCCATGGACATTTAACGTTAACACGAGCTTGCACCCCTGTTTGAGCATTATCCATTAATTCTTCCTACTTTTATTGCAACGTTGATTACTTTACCATAAAGCTTTTTCTTCATAAGTAAAATTTTTCTCAAAAACAAAAAAGACGTCTTTCGACGCCTTTTTGCTTAATTATTTAATTTAGTACTTCAAATCGCTAAATTGCTTATTTGAGTTTTGTTTTGCCAAATCTTTACTCTTCCAGTAAACAGCATTCGTAGTCCCTTTAGAGTTAGCTGAAACGATCACATCTTGACCATTTTCGGTATGACGAGCTAAGTAAGTGCTACCTTGACCAGGTTCTTGACCCATATTAGTGAAACTGTAGAAGTTAATGCCATTCATGGTAGTTTGCTTCATGTAGCCTGATAATGAATCTGATGAACTATTGGTATTATCCTTCTTAACGTTCATGCTAAAGCCGGCTTCATCGTTGATCTTGTTGCCTGTAACAGTTAAAGTATCGATATTGCCATTACCAGTATCAGCAGTGTACCAAGTACCTTGAAGAGCTGCTGGAATTACGCTATTTACATTAGCAGTATCCTTAGTTGTAGCACCATCGCCTTCAAGTTTAGCATTTAATTCAGCGTTTTTAGCTAATTGGTTAACTAAATCTGCATCATTTTTATCATTTAGGTAGTTTACAATGTCAGACAAGCTAGCTGTGCCAACATTACGTTGCTTTACGAAAATAGTGATTTGCTTATTAGCACCATCGCCGCTAAGCATATAACGAGCGTTACTGCTCTTGCCTGTACCACTAACTTCGTAAACATAGCCGCTATTGTCTGAAGTAATACCAGTTTCAGCCTTGCTTCTAAAAGCTACGCTCAAATGATTTTGCTTAGCAGCATTTACTGCGATCTCCCAGGTATCACCATACTTTTGTGAAGCATAAACCGCAATTGCACTAGCGGTAGTCTTGTAGTCCATGTTTTCTGCAGAGAGCTTATCTGAATCACTGCTCTTCGAACTAGAGTCTGATGAAGTATTGCTGTTGTTTGAGCTCTTCTTAGAAACTTGACTGCTCTTAGATTGTTCTGATTGTGTATTTCCTTGTGTACTTTGTGAGCATCCTGCCAAAGTTAAAGCGGCTGAGATTGCCACAAGTACGGTTAATTTGCGTTTCAAAAGGATTCTTCCCTTCTATAAAATCTATCTCTTTGACATAATACTATTATAAAAAAAGTAATAAGCCAATCTTTTATCAAAGATATTATAGAAAATTTACGAATTAACTGAGGTTAGCAAATTTAGCGTCTACATTAGCTTTAGCTAGCTTAACAGACTTCCAGTAGCTACCGGTGCAACTGCCAGTATCAACGCTATATGTCGCTACAGCCTCATTTTTGCCTTTCTTTTGAACGGTATATAGCCAGCCAAAGTTTTGGGCATTCAAAGCCTGAACGTGATAACAATCAATCCCATTGATATTTTCCATTCTGGCTCTAGCCCATCTTCTAGTTTGTTCAAATGACGTTGGTGCAAAACTACCAGAGATCTTATGGATCTCCCCGCCATTAATGCTATGAGCATCGATCACCAGCTTTTTACCTCGACGGTTGTACCAGGTGCCGCGCAAGTGCTTTGGAACTGTAGCGAGACCATTATCGCCTTTAACACCATATTTATCAGAAGTAATTGACGCTCCCATAGTCGTATAAGCAGCCAGTTTTCTGACATTTTTTATCTCGTTTTTGTCGTTTAAATATGACACAAGCTCATTTAAACTTGCACTACCTATTTTTTTGCGGTTATAAAAATTAACGGTGCTGCCTTCCAAGATGTAAAAAGTATCCGGTTCTTTGCCATTACCGCTAACCTGATAGATATAACCCTTATCCTTAATGTATCTAAAAGCAGTAGCACTCTTTACGGCAACGCTTAAGCCATTTTTATCGGCAGCTTGATAAGTCTTCTGCCAGGCCTTTTTATATTTCAATGCACCATAAACCGTGATCGCACTTGCTTTTTCAGAATCAGATAAATCTTGGGCAACTAATTTTTTTGGAGCCTGTTTTTGAGTGCTAGAAACTCTGCCATCATTTTTATTATCAGTATGAGAATTTTGGCTACATGCCGTTAATCCTAAAACGGCAGCAGTTAAACACATGAATGTTAATATTCTTCTACTCATCAAAAAAGACTCCAAATTCTAATTGCAACACAAAAATATCCCTAAACTTATTAGGGATATTATTACCTTTATTTTTAAATATTATATAGCATATTTGCCCGTTTGACATTGCTCAAACTTTAAAAATTAATAAAAAATCCTCGCACAGAGCGAGGATTCTTACTTGATAATCTATTAAATTACCAATTCATATAACGTTGACGTTCCCAATCTGAAACAGATTGTGAGTACTTGGACCATTCTAATTCTTTTGATTCGATAAAGCTGTGAGTTAAGTGTTCACCCAAAGCGCTCTTAATTAAGTCGTCTTCTTTAAATGCCTTGATTGCATTGTGAAGAGTAGTTGGAAGTGGCTTAATGCCGTGTTCTGCTCTTTCTTCTTCAGTCATTTCGAAGATGTTTTCTTCAACTGGCTTCATTGGCATCTTTTGTTCCTTGATACCTTTCAAACCAGCAGTTAAACATGCAGCAAGAAGTAAGTATGGGTTAGCAGTTGGGTCAGCTGAACGCATTTCTAAACGAGTGTTGATTTCACCGGCACTTGGAATACGAACAAGTGGTGAACGGTTCTTAGCAGCCCATGCAATGTATACAGGAGCTTCAAAACCAGGAATCAAACGCTTGTATGAGTTAACAGTTGGGTTACCAATTGCAGTGATTGCACGTGCGTGTTCCAAAATACCGTTCAAGAAGTAAAGTGCAGTGTTTGAAAGGTGGAATTCACCATCTTTGTCGTAGAAGACATTGTGCTTGTTCTTGAAGAGTGACATGTTGTTGTGCATACCGTTTCCGGCTTGACCTTCAACAGGCTTAGCCATAAATGTAGCAAACAAACCATGCTTTCTAGCAATGTGACGAGCAACCATCTTAAAGGTTTGGCATCTGTCAGCAGTAGTTAAAGCATCATCGAATCTAAAGTCGATTTCTTGTTGACCATCACCAACTTCGTGGTGAGCAGCTTCAACTTCGAAACCAATTTCTTCCAAAGTTTCAACGATTTCACGACGGCAACGTGCACCTTCGTCATCTGAAGTCATGTCAAAGTATGATGCGTGGTCTGGAACTTCAGTAGTCCAGTTACCATTTTCATCTAACTTGAAGAGGTGGAATTCCATTTCAAAGCCGATATCAAATGTATCGAAACCAGCTTCTTTCATTTCACCAAGAACACGCTTCAAGTTGTTTCTCGGATCACCTGCAAATGGCTTACCATCAGTCATGTGAACTGAACAAATCAAACGACCAATCTTGCCACCGTGTTCGTCACCCCATGGCAATACTGACCAAGTTGAGAAGTCAGGGTATAAAACCATGTCACTTTCTTCAAGACGAACAAAGCCGTCAATTGAAGAACCGTCAAAACGAATATCATTAGTCAACACTTTATCCAATTGACTAGTTGGTACTTCAACAGCCTTTTCAGTACCGTTAATATCAGTGAAGCATAAACGTAAAAATCTTACGTCTTTGTCTGCTACTTCTTTTCTAATTTCTTCTGTAGTATATTGTTTACTCATAGCTTTTCCACCTATTAAAAACAAAAAAACATTAAATTGGTATGGTCCTATGATTTGATTTTTATCTCCTAGAACTGTTTAACACAATAGCAGATCAATTTAAAAAAGTCAAATTGGTCTAATAACATTTTTTAACTTTTACTATTTGTTAAAAAATGTATCGCGAACCGCATTAATGATTGCGACCTTGTCATGAGCATAAGTCAAACCGCCTTGCATATATACTGCGTAAGGTGGTCTAATCGGACCATCTGCTGAGAATTCAATTGTTGAACCTGAAACGAAGTTACCAGCAGCCATAATAATCTTGTCTTCGTAGCCTTCCATGTGGACAGCTTCTGGAGTTACGAAGGAGTCGATTGGTGAGTTCTTTTGTACTTCTTGAACGAACTTCACCATCTTGTCTGGGTCATTGAAGATGATTGTTTCGATAATGTCACTCCGCTTTTCATCCCAGGCAGGCGTTACATTCATGCCCATCTTATCAAACAATGCAGCCGCAAAAATCATACCCTTTTCGGCCATGCCAGTTACGTTAGGTGACAAGAAGAAGCCTTCATAGAAGTCATGCAAGTTGCCAATAGTAGCTCCTTCATCGGTACAGCCAGGTGCAGTCAAAGCGAGTTTAGCATTTTCTACTAAGTCCTTTTTACCCACGATGTAGCCACCGGTCTTGGCTAGACCACCGCCAGCATTCTTGATTAAGGAACCAGCCATTAAGTCGGCACCGTATTCAGTTGCTTCGTGCTTTTCAGAAAATTCACCGTAACAATTATCAATAAAGACAATACTCTTAGGACTTACCTTCTTAACAAAGGCAATCATTTCTTTAATTTGATCGATGGTGAAAGTCTTTCTAGTTGAATAGCCACGTGAACGTTGAATAGCGACAATCTTTGGTTGATCACGCTTCAAAATCTTTTCTGCTTGATCATAGTCAACAGTATTTTCATCATTTAGTGGCACGTATGAAAACTTCACGCCCTTTTCAGCCAAAGTACCACGCTTATCAACGGTTAAGCCGATTACTTTTTGCAAAGTATCATAAGGTTGACCAGTTAAATAAGTTAAAGTATCGCCTGACTTCAAATTACCGTTTAAAGCAACAAATAAAGTATGCGTACCTGAAACGAATTGTGGACGTACTAAAGCATCTTCAGTATCAAAAACTTGGGCATAAATCCGATCAAGCTTGTCACGGCCCATATCGTCATCACCATAGCCCGTAGAACCGCTTAAATCAGCTTCTGCGACGGCATTATCTTGGAAGGCTTTAAGTACCTTAGCTTGGTTAAAAACGATCTGATCTTCAATTTCTGCCAATTTAGGAGCGATTTGTTGATCAACTTCTTTGACAATCTTTTGTAGTTTTTCTGGTAAATTATTCATTTTTCCATTCCTCGACTTTTGCAATAATCTTTTGTTTACAATCAGGATCATTCAGCGGATCAAACCAAACGATGGGTAGTTGATTTCTAAAATAAGTTAATTGTCGTTTAGCATACCGCCTCGAAGCAGTTTTTAACTGCGTAATACAGCTGTCTATATCTTCCTTGCCTTCGAAGTAAGGAAAGAATTCCTTGTAAGCAATCGCCTGCAAAATTTGATGCTCTTTAGCACGATTCTCATAAATGAATTTAGCTTCAGCCAGCATCCCTTTTTGCATCATCTTGTCGACGCGCAAATTAATTCTGCGATAGATTTCCTGTCTATCAGAATTAAGGCCGATAATCAAATAATCATAGCGGGGGGCAATCTTCTTTTGTTGTTCCGAAAATTTCTTACCAGTACGATCAATTACGGTCAGTGCCCGCATCGTTCTACGTGAATTAGGCACCGGAATTTTCTTAGCTGCAGCCGGATCCTTTTCATTTAATACCTGCCACAGTTTTTCTGGGCCGTAGCGAGCGAGATAATCTTCCCATTTTTGCGACGTACCGCGATTTTCTTCTTGCTTTTCACCCAGCTGCATTTGGTTAAGCAAGGCATTGACATAAAAACCGGTACCTCCAGCTAAAATTGGCAATTTACCTTTAGCACTAATTTCTTTAATAGCTTTTTGGGCTTCGTCAACGAAATCTTTGACTGAAAATTCATCAAACACAGAGCGCGTATCTACTAGGTAATGTTTAACCTTGCTCTGTTCCTCTTTGGTTGCTTTGGCGGTACCAACTTCGACTTCTTGGTAAACTTGCATAGAGTCACCTGAGACGATCTCACCATTTAATTTCTGGGCTAAGGCAATAGCCAGATCGGTCTTGCCAATCGCGGTTGGGCCAACAATGGCTAATACTTTTTGCATATCATCACTTTCACAATCCATTTTTAAACGAAAAAAGCACATTTGAGCAGATTCTCCTCCCAAATGTGTTTTTCAAATTAATATTTAGACTTTTTAGTTCTGCCGTCCCAGGTCTCAAAACCACCCTTAAGCCAGTGGACTGAAACAAAGCCCTTCTTCTTCAAGAAACGGGTAGCTCTCAATGTGATTGTGTTTGAGTCTGAGTAAAGGTAAACCGGCAAATCACTTCTAAGTTCATTATATTGATACTTAAGCATCGTATATGGCAAGCTTCGCGCACCATCAATATGCTTGCGTTTAAAAGGCTCTTTTTCTCGCAGATCAATAATTTGTGCCTTACGCATGCCTTCTTTAAATTCTTCATTAGTCAATTCACCGCCGAGGGATTTGGCTTGAATCTTATTCCAGGCCCAAACACCACCGAAGACTAAAAGAATAACTAATAAAACAGTATCTAAGACAATTAAAAAACTCGACATAAAATCTAATCTGCCCTTTCATAAAAGTTTACAAATATAATGATACCGCTAAACTAATTTACTTGCTATTATTTTTCTTATCTTTCTTTTTTTCATTGAGTTCTTCAATGCGATATTCTCTACGTAAAATAAGTTTAGCCATCATATAATCATGTTGATCGATCAAACCAGCCTTATTAATATTGTCGAGTTCTAAGGCCATTAGTTCAATGTCCCAGATTCTTTTACCAATATGAACCAAAATGCCATATTTCTCTAAAAGCTGCTGTACATCATATAAAGTCTTCATTACTTTTCCTTAGAAGTTAATAATCATACCTGTTCTAACTGTCCAAACTATATAAAAAATTAGTACGGCACCAGCCAAAACACGCCACTTAGGATTGTAAGTCTTCATTACACGATCGCCTAAAATAACCGCTAATAAGAAACCGACAATCAAACCACCGAGGTGACCCCAAATATCGATGCCAGGGGCAAAAATATCAATACCCAGGTTAATTAAGGCTAAAACGAAGGCCTGTCTACCTAAGAAACTGATCATTGGGTTATGAATATTGCGAAGTCCAATTGCAGTCATCGCACCAAAAAGACCAAACAAAGCAGTGGAAGCACCAGCGCTCAAACCGCGGTCGGAACTAAAGGCTAAACTCATTAGATTACCGCCAATACCGGCTAAAAGATAAGTCACTAAAAATCTAGCGTGACCCATAATCGGTTCAATATATTGCCCCATGTAGTAAATAATTACGGCGTTAGAAACCAAGTGCATTACACCAATATGCAAAAATTGGGCAGTAAATAAACGCCACCATTGGTTGCCAACCACTACGGCGTAGTTGCTCATGGCACCCATTTTCATCAATACGTTGGTATTTTCTGAACCGCCAAGGAATACTTCAACCAAAAAGACGATCAATAATACTACTAAAATACCCATTGTCACAAATGCTTGCGACAGGTTGATTCTTCGATTCAT
Protein sequences of DBSCAN-SWA_3 >NC_017470|1425377:1483434|1476158_1477151_-|WP_014566057.1|DBSCAN-SWA MKRKLTVLVAISAALTLAGCSQSTQGNTQSEQSKSSQVSKKSSNNSNTSSDSSSKSSDSDKLSAENMDYKTTASAIAVYASQKYGDTWEIAVNAAKQNHLSVAFRSKAETGITSDNSGYVYEVSGTGKSSNARYMLSGDGANKQITIFVKQRNVGTASLSDIVNYLNDKNDADLVNQLAKNAELNAKLEGDGATTKDTANVNSVIPAALQGTWYTADTGNGNIDTLTVTGNKINDEAGFSMNVKKDNTNSSSDSLSGYMKQTTMNGINFYSFTNMGQEPGQGSTYLARHTENGQDVIVSANSKGTTNAVYWKSKDLAKQNSNKQFSDLKY >NC_017470|1425377:1483434|1429440_1429848_-|WP_013438340.1|DBSCAN-SWA MSEYLVKSKLEDTEWQIANQAREHEFICDDNDKEHDKGPNPVEYLCGSVNSCIVMSAGMVAKSHGLDVKNFRVENNAKTENLGHGKSVVTEMNIKVFFDSEMSKDEKEKFLAHTLHVSTVYQTVKEAIKIYVELA >NC_017470|1425377:1483434|1482753_1483434_-|WP_013438377.1|protease|DBSCAN-SWA MNRRINLSQAFVTMGILVVLLIVFLVEVFLGGSENTNVLMKMGAMSNYAVVVGNQWWRLFTAQFLHIGVMHLVSNAVIIYYMGQYIEPIMGHARFLVTYLLAGIGGNLMSLAFSSDRGLSAGASTALFGLFGAMTAIGLRNIHNPMISFLGRQAFVLALINLGIDIFAPGIDIWGHLGGLIVGFLLAVILGDRVMKTYNPKWRVLAGAVLIFYIVWTVRTGMIINF >NC_017470|1425377:1483434|1465467_1466799_-|WP_014565539.1|transposase|DBSCAN-SWA MKRMSSLAYHFGVKLRFYPSSKQKKIIKLNYDAQRFVYNSYVGRNRTSYHAKRYLANRQNRAMPFAFSALNSYEVQRAETVVINNELLAKPKNIRDAYSFLRVKEIDSLAIANAIQNYQKAWNNYRKIGHGIPTFHKKRSDWSYQTNCQYPKQSEAYLDNGTARFIDAKHIKLPKLGIVRIAGFRKLIKERLLKQIPTRIGTVTIKKTADDQFYLSMQLGSDTAFVKELPKTQSQIGIDLNLDNFLTASNGAMVANPRFYRKTKKKLAHAQRVLSRRQRRAKKEGRNLRLAKNYQKQRLIVAKLHDKIRRQRNDFLQVLSTALIKNHDLVVAEELRNRNLLKNHALSQSISDVGWRSFLNMLAYKADLYDKEFLTIDPKYTTQRCHACGSIMGQNGYKKLTLKDREWTCPICRMPHIRDWNAAVNILEKGLSKWQNPKIKKAA >NC_017470|1425377:1483434|1475034_1475628_-|WP_014566055.1|DBSCAN-SWA MKIGYARVSTIEQNLKPQIEQLKQAGAEKIYSEKYTGTTTERPVFDEVMKILKSGDTLIVTKLDRLARNLGEALHVVDALFSRQVAIDVLNLGRIDNTPSGKLIFNVFMSFAQFERDLIVTRTQEGKAYARKHDPSYRDGRPRKFTPEQLELAYKLRQQGYTYRMIEKSTGISTATQRRRFKEIEAKKKVNDDRKSN >NC_017470|1425377:1483434|1455536_1457543_-|WP_014566043.1|DBSCAN-SWA MTKTLSRFLYGGDYNPDQWTEETWPEDIKVFKKVDLNSATINIFSWAVLEPREGVYDFSKLDKIVQELSDANFDIVMGTATAAMPAWMFKKYPDIARVDYQGRRHVFGQRHNFCPNSKNYQRLDSELVEKLAQHYADNPHIVVWHVNNEYGGNCYCGNCQNAFRDWLRNKYKTLGALNKAWNMNVWSHTIYDWDEIVVPNELGDAWGPESSETIVAGLSIDYLRFQSESLQNLFKMEKAVIKKYDPETPVTTNFHSLPNKMIDYQKWAKDQDIISYDSYPTYDAPAYKPAFLYDLMRSLKHQPFMLMESAPSQVNWQSYSPLKRPGQMAATELQAVAHGADTVQFFQLKQAVGGSEKFHSAIIAHSQRTDTRAFRELADLGQKLKEAGPTILGSKTKAKVAIVFDWSNFWSYEYVDGITQDLNYVDSILDYYRQFYERNVPTDIIGVDDDFSNYDLVVAPVLYMVKAGLAEKINSYVEKGGHLVTTYMSGMVDSTDNVYLGGYPGPLKDVTGIWVEESDAMVPGQKVRVTMDGKEYETNLMCDLIHPNKAKVLASYADEFYTGTAAITENDYGKGKAWYVGTKLGHQGLTQLFNHIVLETGVESLVCDSHKLEVTKRVTADGKELYFVLNMSNEERELPNKFADYEDILTGEKAKSSMKGWDVQVLTK >NC_017470|1425377:1483434|1432502_1432943_-|WP_013438344.1|DBSCAN-SWA MENVKKKKLHQLFENLEIGIGGVSSSLGVSQRQLRYWEKKGYIKPINEGSGVRHYSLATVYLIAFIKDQLDAGYTLEAAFKKSKEIRIKSKIARKLLRNAFDDIEVTDEAKGYGKIEMGTIKVGDKNAEVVGIVDEDGSHFELKEE >NC_017470|1425377:1483434|1482513_1482744_-|WP_013438376.1|DBSCAN-SWA MKTLYDVQQLLEKYGILVHIGKRIWDIELMALELDNINKAGLIDQHDYMMAKLILRREYRIEELNEKKKDKKNNSK >NC_017470|1425377:1483434|1474394_1475054_-|WP_014566054.1|DBSCAN-SWA MTENLIKSKARVQHHGEVFTPNWMVKKMLAEPSIKAKLNDLHATFLEPSAGEGAFLVEILNQKLDYVDKISSKTNWTVNALWALMSVYGIELLQDNLIVARSRMIEVVAKHYKRVLKKDLSHRTDFYKATNFVIKTNIVQGNSLTYKNHAKQLIQFSDWQPIDSKQVKRETFTFKSMFDGSDDGQIDEQLDLFNLDEPSSTIEYAICPVTKIYKEEKTK >NC_017470|1425377:1483434|1449796_1451260_-|WP_014566041.1|DBSCAN-SWA MKVIEKFADKVISSGAYKPLDRVYVINKIRGLVGDHDEEENDQAAVKQLVDMAVKNKKIPDDVTSREVLNDQLYDLATPTPSETNSIFWQKMEKSSEKATDWFYKLCVNNNYVKKEAIARNVVFSGTSSKGHGLEITINLSKPEKDPKAIAAAAHATGKKYPQCALCLENEGYLGGYGKNARSNLRIIRMNIAGRPWGFQYSPYAYFNEHCIFLDQKHIPMVINQQTLINLVEIEKTFPHYFVGSNADLPIVGGSMLAHEHYQGGRHIFPMMKAKIKKSVVFNEYPDVKAGIVDWPMSDLRLISNNSLDLIDLGSKIIKFWDQYSDADRDIKAFDGETRHHTVTPIMHREGKSFVLDLVLRDNNTNKDYPLGIFHPHKQLWHIKKENIGLIEVMGRAILPGRLKSELEEVKKYWLGEDNDMADSHKEWADKIKSEQNITKVNVDQVMEQALVEVFEQVLQDAGVFKNNADGDKGWNKFISALIENVG >NC_017470|1425377:1483434|1464976_1465300_+|WP_013642269.1|DBSCAN-SWA MTSIRDIAKIAGVSPASVSRILNNDPTFHINEAARGRVIEIARKLNYNKADKKRGPKQPDSSLSIALVMRYGNMREFNDPYFLNMHKGINEEAKKWHLRVEQPCQLP >NC_017470|1425377:1483434|1478336_1479674_-|WP_013642275.1|DBSCAN-SWA MSKQYTTEEIRKEVADKDVRFLRLCFTDINGTEKAVEVPTSQLDKVLTNDIRFDGSSIDGFVRLEESDMVLYPDFSTWSVLPWGDEHGGKIGRLICSVHMTDGKPFAGDPRNNLKRVLGEMKEAGFDTFDIGFEMEFHLFKLDENGNWTTEVPDHASYFDMTSDDEGARCRREIVETLEEIGFEVEAAHHEVGDGQQEIDFRFDDALTTADRCQTFKMVARHIARKHGLFATFMAKPVEGQAGNGMHNNMSLFKNKHNVFYDKDGEFHLSNTALYFLNGILEHARAITAIGNPTVNSYKRLIPGFEAPVYIAWAAKNRSPLVRIPSAGEINTRLEMRSADPTANPYLLLAACLTAGLKGIKEQKMPMKPVEENIFEMTEEERAEHGIKPLPTTLHNAIKAFKEDDLIKSALGEHLTHSFIESKELEWSKYSQSVSDWERQRYMNW >NC_017470|1425377:1483434|1431754_1432294_-|WP_014566034.1|DBSCAN-SWA MNRFDELNNNYQKVHHERLYEHLNGRVKSEASMEEKCTRKHYPLTAHSALREVRDSIGLRVVCNFIDDIYTCIDYIKSWDDVSVYNEKDYITNAKPNGYRSYHMILEITVPDEDVDGNIPGHYFVEVQLRTIAMDTWASLEHEMKYKHQIKNPEMIGKELKRVADELASCDVSMQTIRQ >NC_017470|1425377:1483434|1469489_1469717_+|WP_166484902.1|DBSCAN-SWA MMVSPSSSTPSEVQQIYANVARFRVENAEKQAKSVKSLPQPGESNEKLAVLGSLVSAISLVSLGLSEIERNKRQN >NC_017470|1425377:1483434|1453498_1454215_-|WP_013642262.1|DBSCAN-SWA MKDTPKYLAIAEKLKHEIETGEYQPNDQLPKEFDLARSYNVSRITVRNAMKELESQGLIYRIQGAGTYVKERNLMHATVDSNQLELINLKKYKLKLLDFEVGQVIPKITEALNVQPYDIVYTIKRAAMATNKQLVAFQKICIPAKVIQGLNMEMMESSIYPLIQQKLNVKPAKATREISLVHADQDLIAKGEVKGQVEEKEPLLKWLQKSYLSGGRAFEWNENYYRITKFPIKEAIVL >NC_017470|1425377:1483434|1427238_1427637_-|WP_193363680.1|DBSCAN-SWA MFVIKPQSVAAKELSPFQTKMIGNGNFGIYQRLTRRGPACWIGSTRTFRHGNFQASAHVTVEVRSDDNEGIVKADKTPRPGAGSATSWFKHFKTSGHWGKSFAPETKSHTFLMKNARKSPYFNAGMDRACRL >NC_017470|1425377:1483434|1435185_1436223_-|WP_014566035.1|DBSCAN-SWA MRWRRILLGGVLAIGVVTAMFTWSVDRVETGDKDLPTQVKDYMKWHHINGVALVSGKKGQPIVVKNKETSNKEKVVEANRLFPIASLQKIMTGTAIYKLKQTGQLDWNTSLSTYYPQIPGSKSITIRELLNHTSGLINNARPDSPLKGEKEQTAFMLKHIKYDHLHTWDYQDIDYELLAAIISKQTHLSYNDYIKQTFAGPLNLRQIKDFSEVAKNEVPQPVNNVSWHEVTVTTSSDFGAGNLFMSPKDYWKFINQAVLKNPKMINTFSQQTQHEEVAYFGGVYFHNNVIAGNGSIPGYNSCFIANYKTNEMVMMFSNNIDYWDLKEDSDYILHHYVGYRPEFRF >NC_017470|1425377:1483434|1445540_1446416_-|WP_007126478.1|DBSCAN-SWA MKSKNFIEKYWGWLFLIVPIILQLIFFYFPLVQGAFYSFTNWTGLTYNYKFVGLNNYKLLFMDQNFAKSIGFTLILTICLIVGEIVIGILVARALNSKIKGQTFFRAWFFFPAVLSGLTVALIFKQVFNYGLPAIGNALHISWLQTSLLGTDTGAIIATIFVLLWQGVAMPIIIFLAGLQSIPDEIKEAAEVDGANSRQIFWKIELPYMLPSISMVFILALKSGLTAFDQIFAMTAGGPNNSTTSLGLLVYNYAFNNNSFGYANAIAIVLFLLIAIVSFIQIKISNKYAVE >NC_017470|1425377:1483434|1481055_1481976_-|WP_013642277.1|tRNA|DBSCAN-SWA MQKVLAIVGPTAIGKTDLAIALAQKLNGEIVSGDSMQVYQEVEVGTAKATKEEQSKVKHYLVDTRSVFDEFSVKDFVDEAQKAIKEISAKGKLPILAGGTGFYVNALLNQMQLGEKQEENRGTSQKWEDYLARYGPEKLWQVLNEKDPAAAKKIPVPNSRRTMRALTVIDRTGKKFSEQQKKIAPRYDYLIIGLNSDRQEIYRRINLRVDKMMQKGMLAEAKFIYENRAKEHQILQAIAYKEFFPYFEGKEDIDSCITQLKTASRRYAKRQLTYFRNQLPIVWFDPLNDPDCKQKIIAKVEEWKNE >NC_017470|1425377:1483434|1428716_1429424_-|WP_014566032.1|DBSCAN-SWA MSLLDEAVVLNDGSLMPKVGVTVTSDEDVARAMKAGYRLLNCTADEKIDFKNLSPQTYVEIQVSEKVNTREEMRNNLKEIRNNLVAKHADLAMLRLSDDAERNNQLWQELEQVKIQGWLKNIGVTNANGDTLAAILKAPKIKPSVIHLDYEDGSLIKLARENKMQVELSVAGNIDALAEIAGHYGTSTMELVLRYFAQKRIVPIVQVDDIVENPQTDFTIAPEDMETIGQLFAGK >NC_017470|1425377:1483434|1426808_1427273_+|WP_013641431.1|transposase|DBSCAN-SWA MNNKRDKIKDAGYEKHYVYNAHYHLIWCTKYRNQIFTNQDLAQEMRDLLKQIAEDNQIKIEKMEVMPEHIHLLISFRPSKSGSSVVKALKGRSAFLFFKNHPEIKQNKMWGGHLWSPSYYFGTVGNMSKEVVEKYINDQIYNAVRDGKPYPSPH >NC_017470|1425377:1483434|1425377_1426709_-|WP_014566029.1|transposase|DBSCAN-SWA MKRMSSLAYHFGVKLRFYPSSKQKKIIKLNYDAQRFVYNSYVGRNRTSYHAKRYLANRQNRAMPFAFSALNSYEVQLAETVVINNELLAKPKNIRDAYSFLRVKEIDSLALANAIQNYQKAWNNYRKIGYGIPTFHKKRSDWSYQTNCQYPKQSEAYLDNGTARFIDAKHIKLPKLGIVRIAGFRKLIKERLLKQIPTRIGTVTIKKTADDQFYLSMQLGSDTAFVKELPKTQSQIGIDLNLDNFLTASNGAMVANPRFYRKTKKKLAHAQRVLSRRQRRAKKEGRNLRLAKNYQKQRLIVAKLHDKIRRQRNDFLQVLSTALIKNHDLVVAEELRNRNLLKNHALSQSISDVGWRSFLNMLAYKADLYDKEFLTIDPKYTTQRCHACGSIMGQNGYKKLTLKDREWTCPICRMPHIRDWNAAVNILEKGLSKWQNPKIKKAA >NC_017470|1425377:1483434|1452987_1453374_-|WP_013642261.1|DBSCAN-SWA MSSNSTPSGLKKFENGFIRIAGKISGNRLLLTLRDTFVLVAATTMIAGFGIMIQNVIVDPMNGLIFGKQGLQLGRLISRSWATWQQSGFEHGLQTVANLIALVSNGILNFFRFASSSNFQLESLNQIL >NC_017470|1425377:1483434|1439886_1441329_-|WP_013642252.1|DBSCAN-SWA MPIKNKVMLITYPDSLGKNLQELSEVLENDLKGAVGGIHLLPFFPSTGDRGFAPTDYTTVDPKFGNWSDVEKLGEKYYLMFDFMINHISRHSKYYEDFQKNKDKSSYADMFLSWDKFWPKGRPTKEDVDLIYKRKDRAPYQEITFADGSKEKLWNTFGPEQIDLDVRKKVTQKFIKDTLVSLIKHGADIIRLDAFAYAVKKLDTNDFFVEPEIWNLLKQVQDDIADEGATILPEIHEHYSMPFKISKHGYFIYDFALPMVTLYSLYSGKSNRLAAWLKKCPMKQFTTLDTHDGIGVVDARDILSPEEIDYTSQELYKVGANVKKKYSSAEYHNLDIYQINTTFYSALGDDDKRYFMARLLQVFAPGIPQVYYVGMLAGKNDIKLLEETKEGRNINRHYYSKAEVEQEIQRPVVASLLKLFTFRNNEPAFDLNGSIDISTPNENEIRIVRINKEQNHKAELTANLQDLTYRVLVNGKQINF >NC_017470|1425377:1483434|1434575_1435142_+|WP_013438346.1|DBSCAN-SWA MILIILILAVIAFIYFNVIPKKGYMPLAIISLLIAVLSIAGIVAHDYNHYGMKTQTTTVKKELVSSASPQLPVLLYQPLGNGTEKIYLYKTKNTDKKPTPIKTDKTHASMKTAAKPSMTIKTERYVFSNGWNQFLFGWFGHNNELKHREYTFNVPKNWKVLSIKQAKSLQKQMAKRAAMMKKMQQMKK >NC_017470|1425377:1483434|1443553_1444666_-|WP_013642254.1|DBSCAN-SWA MVKVDLDHVYKKYEGNDNYSVTDFNLHIKDEEFIVFVGPSGCGKSTTLRMIAGLEDISKGTLKIGGKVMNNVAPKNRDIAMVFQNYALYPHMTVYDNMAFGLKLRKVPKEEIDKKVQKAAKILGLSQYLKRKPAALSGGQRQRVALGRAIVRDAPIFLMDEPLSNLDAKLRVTMRAEIAKLHQQLKTTTIYVTHDQTEAMTLADRIVIIKDGIQQQVGTPMEVYNKPANVFVAGFIGSPAMNFFHVILKDGKIIDKENHDFEIPVPEGKLKVLKDKGYNGKELIFGIRPEDVHTEEVFLESFPEQVVEAKVIVSELLGAETQLYCKVGNTELVSKVDARDFTKPGAKVRMGFEMNKSHFFDPETQKAVEN >NC_017470|1425377:1483434|1459665_1460673_-|WP_014566045.1|DBSCAN-SWA MTTIKEIAQKSGYSSATVSRLLNNDPNLSITADTKNKILEIANKLGYWKDHQEKEIKPTIALLYRTNHKEQLQDEYFTSLKQALVETVKQDSMKMKTFYNAADLIENASLFQGFIGVGAAPIENAELVKLHGVLPNGVFVDTNPAPELFDSIRPNLTLTVKNAIDLFIKNGYKSIGFIGGVGPKHDHIQENDPRSVAFAEYMKVRGKDDSNMFVSGPFSVENGYKLGKEVIKKYKNNLPDAFLIASDTLAVGVLQAFNEENINVPKDTAILSINNSNVVKYVSPPLSSYNINQQEMIDLALTMLTNLIIQPDRAHIDVNMNTNLVVRKSFIPKGE >NC_017470|1425377:1483434|1473357_1474398_-|WP_014566053.1|DBSCAN-SWA MTEEKHRKFNVIIGNPPFQETLKGTSDKQIFPYFMDDAYKIGEKVELITPAKFLSNAGKTPKKWNEKMLNDPHLKVLFFEQDSSKVFENTDIKGGVCVTYRDISKTLGPIGTFSPFKELRSILHKVVYNKSIAFKSITTIIHLQNKFNLTALYADHPELKKQVGSNGREKRLTTSIFSLPVFTDKENITSTVKIWGIINNNVRKYRFIDEKYLDPDSTLHYYKVLIPKSNGSGALGETLSTPIIGTPLVGYTQSFIGIGKFNQKVDCQNAFKYIKSKFCRVMLGTLKVTQDNNKGTWKNVPMQDFTNKSDIDWTKSIPEIDQQLYDKYGLSDDEIAFIEKNVKEMK >NC_017470|1425377:1483434|1433103_1434576_+|WP_013438345.1|DBSCAN-SWA MNKQPVDIHEKQYNRNLLVLVLIIGSFCTVLNGTLLSTALPSIMRDFKISTATAEWLSTAFLLVNGVMIPISAWLINRFGSRKMYLSAMSTFFIGTVIAAIAPNFQTLLAGRIIQGLGVGVTMPLLQTIMLSIFPANKRGAAMGTVGIVIGLAPAIGPTLSGWVVDNLSWRYLFSIIAPIAGIVVILAAFLVKDVLPTKDEKIDVFSVATSTIGFGSLLYGFSEAGNKGWTNPEILAFIFVGIIFVILFGIRQLKMDDPFLDIRVFKHFEFSLAAILSGVTNLAMVGIEMVLPLYIQNLRGESAFHSGLILLPGALMIGIMSPITGRIFDRYGARKMAITGMTLLTLGTIPFVFLTENSSFLMIIILYAIRMVGVALVMMNVTTSGMNSLPLDKISHGTAVNNTFRQVLSSIGTAILVSVLTTTTNNNMPAKSMLKTLPLQYKNGAINATLDGFHASFAISILFALIALVLSFFLKKGNRARERSEKVDS >NC_017470|1425377:1483434|1436447_1437653_+|WP_013642249.1|DBSCAN-SWA MTQKRTITPWAIFIVCCFISMIGFGLIVNTIGLFFGPISQEFHVGRASVALMTTLQNAAAAISLLFAGKVMKKVNLRWLLTACFSIIAISMLTLAAAHSLTHFYIAWIIIGICQPIAITLSIPVLLSNWFNQKLGTVMGISLGLSAFGGTIFNPIIADIITKFGWRGGFIAEGLLIGLILMPLAASIRPNPDEKHPAYGTTTKSESNSLLNGITLKEALHQPVFYALAFAMLALQFVSGSVQHISGHITNLGISPVLAASVVSGVMIGAAVGKISIGYFLDKLSPILVLLVYSLFGILGWSGQIFLTNSTLLTISAFILGLGQGVCLVALPYLIQKQFGEKDYSNILSVINMLGAFAMSLSVYLVGLFFDQTHSYNLGWTINVIAYILSFIAIFITLRKKA >NC_017470|1425377:1483434|1460897_1462778_+|WP_014566046.1|DBSCAN-SWA MKANIKWLDDPEVFRVNQLPAHSDHPFYKDYREWQNHSSSFKQSLNGAWQFHFSKDPQSRPIDFYKRSFDSSSFDTIPVPSEIELNGYAQNQYTNILYPWESKIYRKPAYTLGRGIKDGDFSQGKDNTVGSYLKHFDLNPALAGHDIHIQFEGVERAMYVYLNGHFIGYAEDSFTPSEFDLTPYIQAKDNILAVEVFKHSTASWLEDQDMFRFSGIFRSVELLALPRTHLMDLDIKPTVVNDYHDGVFNAKLHFMGKTSGNVHVLIEDIDGKTLLNKKLPLKSTVEIENETFANVHLWDNHDPYLYQLIIEVHDQDGKLVELIPYQFGFRKIEITKDHVVLLNGKRLIINGVNRHEWDAKRGRSITLADMKQDIATFKHNNINAVRTCHYPNQIPWYYLCDQNGIYMMAENNLESHGTWQKLGQVEATSNVPGSIPEWREVVVDRARSNYETFKNHTAILFWSLGNESYAGSNIAAMNKLYKDHDSSRLTHYEGVFHAPEFKKEISDLESCMYLPPKEAEEYLQNPKKPLVECEYMHDMGNSDGGMGSYIKLIDKYPQYMGGFIWDFIDQALLVHDPVSGQDVLRYGGDFDDRHSDYEFSGDGLMFADRTPKPAMQEVRYYYGLHK >NC_017470|1425377:1483434|1454366_1455011_-|WP_013642263.1|DBSCAN-SWA MSIVNNDFHKILTERHSVRRFDPSIKISHEEMNEMLEETITAPSACNLQAWKFIVVDTKEGREKLHQYFMPFNFPQVDNSSAIVLFFGNTLAFKKYSQLWHSMYEEKKVTKEAMEAALNTFMPLYEKAPQEMLVADSMVDSSLAAMQFMLIAREHGYDTNAMAGYDAKKAAAVMGLDPKQYVPVMAIAIGKHDPKAEDEITTTRYPLSELVDYE >NC_017470|1425377:1483434|1444692_1445526_-|WP_013438351.1|DBSCAN-SWA MEKENKSKKFWDYALLIVGGILILIPLLYTFLSSFKTTKQIMEHFFAWPNPWTTANFQRLFADGVMNYFGNSIIITVLSIVLVMIFVPMAAYSIARNMSKKTAYNWMYILLIIGIFVPFQVIMIPITVMMSKLGLANMWGLIILYLTYAVPQTLFLYVSYIKQSVPESLDEAAEIDGANKITAYFKIIFPLLKPMHATTLIINAMWFWNDFMLPLLILNRDSKMWTLPLFQYNYTGQYFNDYGPSFASYVVGIITITIVYLIFQKNIIAGMSNGAVK >NC_017470|1425377:1483434|1441339_1443541_-|WP_014566039.1|DBSCAN-SWA MKQELITFDEKNKVFHLHNDQISYLLSVEDGGTLCHLYFGKKVNNYHNQLRYPRLDRGFSGNLPGSLDRTFSRDSLPKEYSTAGEMDYHTPAAVVRQADGSNALFLTYKDYRIEDGKPDLKGLPHSWVIDKSEAQTLIITLEDKKTELNFDLLYTIYRDRPVIVRSVKVKNAGKETVNLEKVASMQIDFVDRNFESITLPGAHAHERRVDRSKIHQGIHVFSSHRGTSSHQMNPFLALVDPDTNEFQGDAYGFAFVYSGNHKFEVEKDQFAQTHVNVGINDFNFNWQLNAGDAFQTPEVLMVYSDQGLNKMSQACHSLIHDRIIRSKYKNEVRPIVVNNWEATYFDFNEDKLKTIVDDAKKLGIEMFVLDDGWFGHRDDDNSSLGDWKVFAKKFPRGLDHFADYVHEQGLKFGLWFEPEMISYDSDLYKNHPDYLMHVPGRNPSPSRNQYLLDLGRKEVRDNIFDQMEKILGSGKIDYIKWDMNRHLSDIYEADLPANRQGEVYHRYVLGLYDLLERIVDAHPDLLIEGCSGGGGRFDAGMAYYDPQIWASDDSDAIDRLTIQYGTSLVYPQSMMTSHVSVSPNEQNGRITPFNTRGIVAMWGDLGYELDLTKLSQEDRKAVAKQVAAYKKIRKVTQFGKFYRLKSAQTSNQCAWETVSADQNEVVLSTVKVMASAQPYLTKTKLVGLDPNKQYEDQATHEVYGGDELMNMGFYDPAEHGDFKAYMYHFKAID >NC_017470|1425377:1483434|1429871_1431050_-|WP_014566033.1|DBSCAN-SWA MAKKKSIYTKDVVLVMAASFFFMFSTMFVNPLINGYAKNLGASSAFAGVIVGIMSVAAMFLRPIAGNLTDKFSKYRLSFIGGVLILIGVLGYVFTPSSAWLLLFRLINGTGYVLCTVCMTTWLAFLVPRQHVGEAMGFYGLMNALAMALAPALSINIYQKIGYRESLIASAISALLMVIAIQFVGDHAKPIVKDKSKMPKKHFKIIQMNVLPVAILTTLFATPYFVTQADLVTYVEQKHLSVAVGSYFLIYAVVLLIIRIGLKRYFDTVRFGVWFWISLISTIAYISLLAMMNNNWQMALAAAGMAMGYGIIYSVLQSTSLLLAPIEEQGLASSTFYLGLDIAMAFGPMLSGVVDSVLPIEWFYPVELVLIPFMLLVYFIWRKRLNEAIDHH >NC_017470|1425377:1483434|1477254_1478133_-|WP_014566058.1|DBSCAN-SWA MCLTAAVLGLTACSQNSHTDNKNDGRVSSTQKQAPKKLVAQDLSDSEKASAITVYGALKYKKAWQKTYQAADKNGLSVAVKSATAFRYIKDKGYIYQVSGNGKEPDTFYILEGSTVNFYNRKKIGSASLNELVSYLNDKNEIKNVRKLAAYTTMGASITSDKYGVKGDNGLATVPKHLRGTWYNRRGKKLVIDAHSINGGEIHKISGSFAPTSFEQTRRWARARMENINGIDCYHVQALNAQNFGWLYTVQKKGKNEAVATYSVDTGSCTGSYWKSVKLAKANVDAKFANLS >NC_017470|1425377:1483434|1448807_1449794_-|WP_013438354.1|DBSCAN-SWA MKTSFIKYGRKDSRDLCEITLTNDHDMVVNVLNYGATLEKVLLNGENMILSLKSPADYSRERNFLGGTVGRIAGRVREGQWKHGQEIHQLPLNDGENHIHGGIGTDMQVWNFRPSCDGKTAKVDLTLFDPDGHNGYPGNLKLHACYELDNENTLRYSLEAVSDKLTIFNPVNHTYFNLGDRAENLDLQMNADYYLPVDKTGLPDRGMEEVEGTAFDFRQKKRVGDALHSNDAQIKLRNGLDHPFILNGNLPNAVLSSNKHKLTVTTNAPALVLYAGNHFDHTGIPDNIGQYDGLTFEAQCPPAEGNDLGQITLLPNEKFRRTVNWKFE >NC_017470|1425377:1483434|1467605_1468166_+|WP_014566049.1|DBSCAN-SWA MGHRNIAYIGGKSAVVNLDGKIVLRKDDLREDGYIAWMKMHNLDQYCHTFTANWSADEALEATNQLLQLKDRPTAIVVASDPMALGVYKALNDANVNIPNDISVASFDDVEINRFLTPTLSSIDMNNEGNDKTNSSRPLGTKTIVYSTSKSRSYGVNDTTWKHDDPSQSNSYKQTILQVLWQKIHC >NC_017470|1425377:1483434|1463814_1464807_+|WP_013438365.1|DBSCAN-SWA MKVLVIGGAGYIGSHAVRELVKEGNDVVVLDALYTGHRKAVDPKAKFYQGDIEDTFLVSKILRDEKIDAVMHFAAYSLVPESVKKPLKYYDNNVAGMISLLKAMNDAGTKYLVFSSSAATYGIPKKLPITEDTPLNPINPYGETKMMMEKIMAWADKADGIKYTALRYFNVAGASSDGTIGEDHAPETHLIPNILKSAISGDGKFTIFGDDYNTKDGTNVRDYVQVEDLIDAHILALKHMMKINKSDVFNLGTAHGYSNLEILESAKKVTGIDIPYTMGPRRGGDPDSLVADSTKARTILGWKPKHENVDDVIATAWKWHKSHPKGYEDK >NC_017470|1425377:1483434|1446428_1447682_-|WP_014566040.1|DBSCAN-SWA MKGWLKKAALIGAAIVTAVSLSACGKQDGNSGKKVTIEYFNQKKEMSSTLKSIIKDFEKKNPDIHVKEVDVPNAGTVLKTRILSGDVPDVINIYPQNIDFQEWAKAGYFEDMTHAPYIKNIKNNYADSFKVNGKIYNAPLSANVYGFFYNATEFKKLGIKPPKTWYQFKQAVKKIKASGKAPFAVAGTEPWTLNGYHQLSLATVTGGAKQANKLLRFSAPNGIKVNNPYIQKDFTRLNLLRENAQNNWRGASYNDAVVSFANGQSLIMPNGSWALPMINQQKPKFEVRTFAFPAAKAGHEMTVGSGDLALSISSKSKHKKAAEKFVAYMTTPAAMQKYYDVDGSPVAVKGVKQKGFDSQLGGLSSLAFTKHDMVWLAQDWTSENDFFNLTASYLMTGNEQQMVNDMNTFFNPMKASN >NC_017470|1425377:1483434|1462761_1463712_+|WP_014566047.1|DBSCAN-SWA MAYTNNLHVVYGEASLGVNGQDFAYLFSYERGGLESLKIKDKEWLYRTPTPTFWRATTDNDRGSGFNQKAAQWLGADMFTKCVGIHVQVDDHQFDELPVAPINNQFSNQEFAHEVKVAFDYETLTTPATKVKIIYNINDFGHMTITVHYFGKKGLPPLPVIGMRFIMPTKAKSFDYTGLSGETYPDRMAGAERGTFHIDGLPVTKYLVPQENGMHMQTNELVITRNSTQNNADKDGDFSLKITQTKQPFNFSLLPYTAEELENATHIEELPLARRSVLVIAGAVRGVGGIDSWGSDVEEQYHIDPEQDHEFSFTLN >NC_017470|1425377:1483434|1427893_1428649_-|WP_014566031.1|DBSCAN-SWA MKTKKLFTSLAAAVMLSAGLAGTGMAMGQPVHAAGTTQSSNTKTSNVSVKHRTVSAIVNSDNPKLVVVAKDQANPDQIDSNYTKGQTINVLWSTEAKQGDKTVTLYYIENRTINGKDSFVFIPSTDVTTSGNVPTESAFVEQANNDVKAIQDAYKRSLQYITVTPKSKKGAKIYYAYKKSKKSKKITFAASKKTIKYGKKYKSSMIVKNGKTRYVYIGKKRYVKQSALKIVSAKYAPLNLPDDLKNLIVEN >NC_017470|1425377:1483434|1437713_1438988_-|WP_014566036.1|transposase|DBSCAN-SWA MSSLNDYIRFALDIEDHNIVFKDYFYKILNGTKYKIYEAELIQPACPFCGSVSLIHNGHLKIHVRYITANASVPVLIRLFKQRIKCRDCSKRSMAQSSLINKYCCISNTSKLKILSALTENRSMTSIAREHNTSVNTVQRILASCSHRFIDGYDYLPEHLAFDEFKGVDRKLHFICLNGETHEIVQILRNRFKKNLLKYFGKFTLKARANVKTVTMDLNFYYQDIIRACFPNAQIIIDRFHMVQMLTRSLNSLRVAVMKKFKKGTREYSLLKSPWKLYLKKFDDLEKIHPRYNWHYKDSLTQEQIVMEGIECDDTLTNSYNLLQAFFTALDKHDTKAMKDIIYSKEKVGPLMHRTLLTFRHNLKAVLNGASLPYSNGCLEGFNRKIKQIERTAYGYSNFINLLTRIRLEENKVKEKGPSNLLIA >NC_017470|1425377:1483434|1447931_1448765_+|WP_013642256.1|DBSCAN-SWA MKGEYKTLNDISLESNVLFFGKESCLPNYYFTGNNVRKNYVIHYILKGKGVFSSANHEAVQLKAGDIFILPKGVPCFYQADGKEPWTYFWIGLSGLKIATMLSGSILSSKHYLRQVEDSNFCKSLNKLFEAVHNPNVLTNDLLTESLIYQTFYYLDTEYPVKKKKQHIANSEQLKIASKYLHDNYDDHSCSIASLCNKLDVSRSYLYNLFKNGFNTSPQKFLIKIRMEEAKNRLKDSPSSIRQIAEAVGYIDEFTFSKAFKKYSGFSPKIFRQMNTK >NC_017470|1425377:1483434|1479815_1481063_-|WP_013642276.1|DBSCAN-SWA MNNLPEKLQKIVKEVDQQIAPKLAEIEDQIVFNQAKVLKAFQDNAVAEADLSGSTGYGDDDMGRDKLDRIYAQVFDTEDALVRPQFVSGTHTLFVALNGNLKSGDTLTYLTGQPYDTLQKVIGLTVDKRGTLAEKGVKFSYVPLNDENTVDYDQAEKILKRDQPKIVAIQRSRGYSTRKTFTIDQIKEMIAFVKKVSPKSIVFIDNCYGEFSEKHEATEYGADLMAGSLIKNAGGGLAKTGGYIVGKKDLVENAKLALTAPGCTDEGATIGNLHDFYEGFFLSPNVTGMAEKGMIFAAALFDKMGMNVTPAWDEKRSDIIETIIFNDPDKMVKFVQEVQKNSPIDSFVTPEAVHMEGYEDKIIMAAGNFVSGSTIEFSADGPIRPPYAVYMQGGLTYAHDKVAIINAVRDTFFNK >NC_017470|1425377:1483434|1451280_1452444_-|WP_013642259.1|DBSCAN-SWA MNKEELLTEYEKTFGEKGQDVFFSPGRINVIGEHTDYNGGHVFPAAISLGVYGVYGPREDKKVRLFSGNVDGDIVEFDIDDTTVEKDDRFWANYFKGMITYLREKYDNINHGFNLYIKANLPSGSGLSSSAAIEMLMGIILKDEFNLDVDRIVLAKMGQRTENEFVGLNSGIMDQFACIMGKKDSAIFLDCNTLKYEYMPLALGEYEIIIMATNKPHTLADSAYNDRVRECHDAVKKLQQKLDIKALGELDNDTFDEYSYLIDNETELKRARHAVSENQRTLRATKAMKDGDLEKLGRLIDASHVSLHYDYEVTGKELDTLAEASWKQPGVLGARMIGGGFGGSAIAIVKKSEAENFKKNVGDIYRDKIGYDASFYDAEIVDGTKRI >NC_017470|1425377:1483434|1457553_1459473_-|WP_014566044.1|DBSCAN-SWA MTNGHKNSGKQIISYASFCLGNLGHAAFYGVMSTYFIIFITSGMFSGLEQSVADKLIGLITGLMVVIRIVELVIDPILGNIVDNTKTRWGKFKPWIFLGTVVSAVLLLILFTGIFGLAQSNWILFAILFVIIYIGFDIFYSLSDVSYWGMVPALSEDSHERGIYTSLGAFSGAIGWNGLAIIVVPLVTGVTYAVTGKHEEGAPGWLAFAAVISALAIVCAIIVCLGTKEKHNLIRNSAKQKTTLPQVFSAIFHNDQILWPSLAYLVFSLANTITNGVLFYLYKFVIGKPGEFWVVGLAATIVGFCVSPLFPVLNKYIPRKWLFVAGQTCMICAYLLFIFCRDNVVLMDLGLVLFNINFAQLVTVLTLTDAIEYGQLKSGQRNEAVVLAVRPMIDKFTGAVSNALVGYVAIAAGMTGAATAADMTAHDISTFNTMALYIPLILAVISIIVFLSKVTLTEKKHAEVVEELKTKLAEGEIEKDDSTKARPRQQKIYAPADGELMQMSSVVDEDGKPFPGKGFAIKPSAGKIYAPFDGKIRFTFGTKHAFEIVSDNGLQVVVHVGLGTVNLRGEGFETYYDDGQEVKKGNLLLEFDRDLALKNGYKDTIITFYTQPGRIKKASPIKAGSTVKHGDEVVEVQFK >NC_017470|1425377:1483434|1470001_1473355_-|WP_014566052.1|DBSCAN-SWA MANTKIEPYRPIIPMIYAYTTPEIKRHQGWTKIGYTDKQTVKDRINEQVHTADVEIKLLWQDNAIYKDGSGKSFTDHDFHRYLTDKAHIERKPKTEWFHTDGTQSHELFNKFASRDYGDVQTKDDDISTYTLRKEQNEAVEMTENYFKQHGKGAEFLWNAKPRFGKTLSTYDLVRRMNLQNVLVVTNRPSIANSWFDDFKKFIGWQTDYKFVTDNDSLKKKDVLSRQEFIKLLDHGDYKQIAFESLQGLKGSVYFGGDYNKLEWIKDLNWDLLVVDEAHEGVETYKTDKAFDKINRDYTLYLTGTPFKALAEGKFARDQIFNWSYADEQTAKENWDDNNGSNPYAVMPRLNMFTYQMSKIMQEKAEQKVDLSDDEQVDPAFDLNEFFRVNASGKFVYDEDVDRFLYALTHNEKYPFSTPKLRGELAHTFWLLNRVDSAKALAKKLKKDPIFKDYEIIVAAGDGKLDDDQLSDDQLSQANEKSYDRVIKAIKDHPRTITLSVGQLTTGVTVKPWTGVLMLSNMKSPAEYMQSAFRAQNPYKVRRGKDIYQKENAYVFDFDPARTLTIFDDFANNLNAKTAGGRGTAEEHAENIRKLLNFFPVIGEDKDGKMVELDAKQIMSIPRQLKSHEVVKHGFMSNFLFANISRIFSAPAEVKDILDNLVKAKEEKNKKQDNSIDDIDDVPTDDEGNVDVPDEIVIGKTQGVFGDKVYQDSNDEMNQAINNSFDNTDQDETQKVIDQVKTSLTDTLQTHVWSKVKDNYGLTKKQTERFQEQVNKESEKKLDQIADQYHDQQKVAETELKEQQKFAQSDDDLKQAQDEYESKMSDIFDDFKQKVTDHVKQTIQDTPKEAIRQVEQHKAEEEKHSVEDDARAHLRGFARTIPSFIMAYGDRNLKLQNFDDYTEDDVFKEVTGITEDQFRFLRDGGDYIDQETGKKKHFEGHLFDEVVFNDSIQQFLDLKYKLRNYFDENNKEDIFDYIPPQKTNQIFTPKKVVKHMVDDLEKENPNVFDDPNATFADLYMKSGLYITEIVKRLYTNPVMKQFYSNDHERLIHILNHQVFGLAPTRIIYLIATNYILGFDDSIKKDITTDHFKQADSAEASKNGTLQELVDKEFGTDD >NC_017470|1425377:1483434|1482053_1482455_-|WP_014566059.1|DBSCAN-SWA MSSFLIVLDTVLLVILLVFGGVWAWNKIQAKSLGGELTNEEFKEGMRKAQIIDLREKEPFKRKHIDGARSLPYTMLKYQYNELRSDLPVYLYSDSNTITLRATRFLKKKGFVSVHWLKGGFETWDGRTKKSKY >NC_017470|1425377:1483434|1466898_1467363_+|WP_014566048.1|transposase|DBSCAN-SWA MNNKRDKIKDAGYEKHYVYNAHYHLIWCTKYRNQIFTNQDLAQEMRDLLKQIAEDNQITIEKMEVMPEHIHLLISFRPSKSGSSVVKALKGRSAFLFFKNHPEIKQNKMWGGHLWSPSYYFGSVGNMSKEVVEKYINDQIYNAVRDGKPYPSPH |
50 | Enterococcus_phage(18.18%) | transposase,protease,tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1527181 : 1535949
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_017470|1527181:1535949|DBSCAN-SWA TTTAGTTTTCTTCCTTTTCCATTATTTGACTTAATACTTTTCTTAACGTAGCTGGGAATAATTTGTGTTCTGTCTCATGGACCCTTGCTTCGAGCGTATCAACCGTATCATCTGGATAAATTGGTACTGCCTGTTGAGCAATGATTGGTCCGTGATCTAAATGGGCATCAATAAAGTGGACTGTGACGCCAGTTTCTTTTATCTTGCCTTTTTTATAATCATCAAAAGCTCTTTCGATTGAATTAAGGCCCGGATATTTAGGCAGTAGTGCCGGATGCAAGTTGATGATTGAATTTGGATATTCGTTTAAAATCGTGGGACCAACCACTCTGAGATACCCTGACAAAACAATGAAATCAATTTGATAATCTTGCAAAACTTTTAGCAGACGCTTTTCATAGGCATCTTTGCCACCGCATTCTTTAACGGAAAAAGTTTCGTATGGCACGCCTAATCTTTCAGCTCTCTTAATTACCGGAGCATTCGGATGATTGCAAAACATTAAAGCTTCAATTCCAGGAATCTCGCCAGCTTGAAACTGTTTAGTTAAAGCTTCAAAGTTGGTTCCATTCCCAGAAGCTAAAATAGCAACTCTCATTTTTCTCCTCTATTTGATTACGATCTTTTCTTCCCCTTCGGGGCGTGCAACTAATTGACCAATTTCGTAGAATTTTTCATTCTCGCGATTTAGTTGTTCTTTGACTGCAGCAATGTTTTCTACTGGAACGGCTAAAACTAAACCAACGCCCATATTGAAAGTGTTGTAGCAATCTCGTTCATCTAAATCGCCTAGTTTTTTCAGATAATCAAAAATTGGTAAAACAGGCCACGAACCTTTATTAACCACTGCTTGCAAGTCATTGCCATAAATTCGAGGTAAGTTTTCGATAAAGCCGCCACCAGTAATATGGGCAATGCCGTGAACTAAATGCTTTTTTACTAGTGGCAAAACTGCTTTGACATAGATTCTGGTTGGCGTAAGGAGTGTTTCGCCTACGGCTTTGCCCGCTAATTCTGCTGGTTTGTCGCTTAATGAAACGTCGTGATCTTTGAACAAAATTTTTCTAATTAAACTGAAGCCGTTTGAATGAACGCCACTTGATGGCAAGGCAATTAAATGATCGCCTGCTTTGGCTAAAGAACTAGATAATATGTCCTCTTTTTCAGCGATCCCTGTTGAAAAACCAGCTAAATCGTATTCATGCGGTTCATACATGTCAGGCATTTCGGCAGTTTCACCACCAATTAAGGCAGATTCACTTTGCTTGCATCCCTCAGCCACGCCTTTAACCACATCAGCTAGTTTTTCAGGATCGTTATGTCCGCATGCAATGTAATCTAAGAAAAATAAAGGTTCAGCGCCTTGGGCTAAGACATCGTTAACACACATAGCCACGCAGTCGATACCGACTGTGTCGTTTTTATGCATCTTTTGCGCAATCATCAACTTAGTACCAACGCCATCGGTGCCTGAAACGAGGACTGGATGCTTATAATCTAATTCTTCTAAGTCGAACATCCCGCCAAAACTGCCGATGCCGCCCATCACGCCTTTACGCTTGGTTGCAGCTACCATCGGCTTAATTCGATTGACTAAATCGTAACCAGCAGTGACGTCAACACCTGCTTCTTTATAACGATTCATTTTATAAATCCTTTCTTTGTCGCCGTTCTTGAGCATTGAGAGAAGCTAAATAGCCAGCTTCGTAGTCATCCAATTTTGTTGGATATTTGCCATTAAAGTAAGCAACTGTCAGACCCGATGAGTCTCCTCGATCAGGCACATCGATTGCTTTGATCAGGCTATCAACGCTTAAAAAGCCAAGGCTATCAGCACCGATAATTTTCCGCATCTCTTCTACTGAATAATGAGCTGCCATTAATTCGGCTCTAGTTGAAATATCAATGCCGTAAAAACATGGAAATCTAAATGGAGGACTAGCGATTCTAAGGTGTACTTCTTTAGCACCAGCTTCTTTTAACATTTTGACAATTTGCTTAGAGGTGGTGCCTCTTACGATCGAATCATCAATAACAGCGATCTTTTTACCGGCAACGACACCACGAACAGCTGACAGTTTTAACTTAACACTCTTTTCACGCAAGGCTTGAGTTGGCTGAATGAAGGTTCTGGCAACGTATTGGTTTTTAACCAGACCCATTTCATAAGGCAACCCCAATTCTTCGGCGTAACCAGAGGCAGCTGACAAAGATGAGTTTGGTACGCCAATTACCATGTCGACATCGGCTGGAGCCTCACGAGCTAGCAGCCGCCCCATTCTTTTTCTAGCATTATGAACAGTCACACCATGTATGATTGAATCAGGACGAGCAAAATAGATGTATTCCATTGAGCAGATTGCTAAATGCGTATTTTTAGTAAAATGATCAATTTTCATTCCGTCTCGATCGATGATGATTAATTCACCAGGCTGCACATCACGAACGAACTTGGCACCGATGATGTCAAGTGAACAAGTTTCGCTAGAAACAACGTAGGCGCCATTGTCCAATTTACCAATACATAAGGGACGAATCCCATTAGGATCAAGTGCCGCAATCATGCGATCTTTTTGCAAAAGCAGGAAGGCAAAACCACCGTGAACTTCATTCAGGCTTTGCTTTAAAGCTGAAATAAAGCCATCTTTGATATGATTACGAATCAAGTGAATCAAGATTTCGGTATCAGATGAGGATTGAAAAATCGCACCTTGTTTTTCGAGCTTGTTTCGTAAAGACACGGCATTAACTAAATTGCCGTTGTGGGCTAAAGCAACATCACCGTCTAGAAAATGAAAGAGAAATGGTTGTACGTTTTGAATCGAATTACGACCCGTTGTACTGTAGCGAACGTGACCAATGGCACTGTCGCCGACCAATTTTTTTAAATCATTCGGATCAGCAAAAGCATCGGATAAAAGGCCACGATCGCGGTGTTGATACAAATGCTCACCATCGCTTGAGACGATGCCGGCCCCTTCTTGTCCGCGGTGCTGCAGGTTATGCAAACCTAGGTAAGTTAATTGACTGGCATCAGGTGCACCAAAGACGCCGAAGACGCCACATTCCTCATTTAGGCTTTTGATTTCATTAAACAAGGGATGCACTCCTTCCAAATCTTTTGCAATTCAGCTACATCTTCATTTACTTGATCGTTGGCCAGCGAAATGTTCAATTGCTGATCAGCAGTAACTTGACCAATTTCACTAACGGAATCGCCCATTTCTTGTTCAAATTTTGCCGCATTTGCAGGATCAACTGAAACGATCAAGCGACCTGGAGTCTCACTGAAAAGTAGATTTTTATCAAAATCAAGCTTTACCTTTGCACCAAAGTCTGTATCAAAAAGTGTTTCAGCTAAGGAAACGCCTAAGCCGCCTTCGCTTAGATCGTGGGCACTTTTGACTAAGCCGTTTGCCATTAATGCTTGCAATTTGTAAAGATATTGCTTTATCTCTGGCAAGTTTGGCGCATGTGGCAATCCGCTAATTTCACCAGTGATCATCTTTTGCAATTCTGAGCCGGCAAAATCATCGTCTGTTTTACCAACGAGATAAATCTTGTCGCCTGCTTTTTGCATGTGCATTGGAATGACGTGATCATAATCCTTGATTAAACCAACCATCCCAATCATTGGTGATGGATAAATTGCCTTGCCATTATTTTCGTTATAAAGAGAGACATTTCCTGAAACAACCGGTGTTTCTAAGATTTCACATGCATCGGCAATCCCTTGACAAGATTGATGCAATTCCCAGAAAATTTCTGGATCATTAGGGTCACCATAGTTAAGGCAGTCAGTAATTGCTAAAGGTTGGGCACCACTAGCCACGATATTTGTTGCACTTTCAAGGACAGTTCTTTGACCACCAACTTTTGGATCAAGATAGACAAAGCGTCCATTAGTATCAGTAGTCATCGCAATTGCCTTCTTAGTATGGCGCACTCGCAAGACGCCACTATCTGAACCAGGGCCTACAATCGTGTCCGTTCGAACCTGTGAATCATATTGTTGCGTTACAAATTGATCGTTAGCAATGGTTGGCTGGTTCAAAAGATCTTTTAAAGTTTGACCGGCACTTTCAATATTTGGTTGCCAATTTTCACTTTGCTCAGCATCAATGATTCTTTGTGGTTTCTTTTCTGCGCTTTTTTCTTCAAGGACTTTTTCAGTTAAAGTTACCACTGGAATATCACATACCACTTGATCATCGTGATGAAGCACGTATCTACCATCATCAGTAATGCGACCAATAGTTACAGCGTCAAGATTAAATTCATCGAAGATCTTTTTAACGTCCTCTTCATGGCCCTTCTTTACACAAAGGAGCATTCTTTCTTGCGATTCACTAAGCATAATCTCGTAAGCAGACATGTTAGGTTCACGTTGAGGTACCAAGTTAAGGTTAAGATCCATCCCGGATTTGCCTTCAGTAGCCATTTCAGCACTTGAAGAAACAATTCCGGCTGCACCCATGTCTTGGATACCAACAAGCCATTCACGGTGATGCAAAATCAATTCCAAGCAAGCTTCCATCAAAAGCTTTTCCATAAATGGATCACCGACTTGAACGGCTGAACGTTGGGTCGCATGTTCTTCAGAGAAGTCAGCTGAAGCAAAAGTAGCACCGTGGATACCGTCACGACCGGTTTTAGCACCAACGTACATTACGGCATTGCCAACACCGGTGGCATCCCCGTGTTCCATATCCTTGATATCCATAATGCCGACATTCATCGCATTTAATAAAATGTTGCCGTTGTAACAAGGATCAAATGTAGTTTCGCCACCTAAGTTAGGAATTCCCATGCAATTGCCGTAGTCGCCGACGCCTTTGATGGTTTCTTCCATCTTGTAGCGCATTGTTGGATTATCTTTTAATTCACCAAAGTGCAAAGAATCAAGGATTGCAACGGGGCGAGCGCCCATACTGAAAACGTCTCTTAAAATACCGCCAACACCAGTGGCTGCACCTTGGTAAGGTTCAACTGTAGTTGGGTGGTTGTGACTTTCAGCCTTAAATACAACTGCTTGACCATCATCAATATCAACTACACCAGCACCTTCACCTGGACCTTGAACAACGCGCTTGCCCTTGGTTGGGAAAAGCTTCAAAACTGGCTTAGATTTTTTGTAAGAACAGTGTTCACTCCACATTGCGGAGAACAAGCCGATTTCTGTGTAGTTGGGCAAGCGATGAAGCAAATGATCGCAAATATAGTCATATTCACGTTCTGATAAACTCCAATCAAGGTAAGGTTTCTTTTCTTTGATTTCTTCTGGTGTCATCGCTTGTTTCATGCTTGAACTCCTGCTTTCAAAAGTGACTTAAATAAAGGCAATCCATCTGTTCCGCCGAGAATTTCTTCAACGGCTCTTTCAGGGTGAGGCATCATGCCCAAGACATTGCCTTCTTTGTTACAGATACCAGCAATATCATGAAGACTACCGTTAGGATTTTCACCATGATATCTAAAGACAACTTGATGATTATTTTCTAATTCTTCAAGCACATCTTCGTCTGCGTAATAGCTGCCTTCACCGTGAGCAATTGGAATACGGATAAGTTCCTTGTCTTTATATTCGGTGGTAAATGGCGTATGTGTATTTTCAACTTCTAGCGTTACGGTTTTGCACACGAATTGAAGACTATCGTTTTTCTTAAGTGCTCCAGGTAGAAGTCCCATTTCTGTTAAAATTTGGAAACCATTGCAGACACCTAATACTAACTTTCCTTCATCTGCCATCTTTTTAACTGCTGGGATGATATTTGAAAAACGAGCGATTGCACCAGTTCTTAAGTAGTCGCCGTAAGAAAAGCCACCTGGTAAAACAACTGCGTCAAAGCCATCTAGGCTTTTTTCTTTGTAAGAAACGTATTCGACATCTGCTTTGCAAACAGTGCGAAAAGCTTCGTACATGTCGATATCGCAGTTAGAACCTGGAAAAACGATTACTGCAATTTTCATGCGTTTTCCTCCAAAACTTTGATCTTGTAAGTTTCCATGTTGAAGTTAACCAATAACGCTTCAGCGATATCCTTCACATCAGCCTTAACTTTATCTAAGTCATCTTCATCAACAGTTACATCGAAGAACTTACCAACAACGATTTTTTTGACTTCGTCATGACCAAGAGAATTAACTGAATCGGTGATTGCAGCTCCTTGTGGATCAAAAACTGAAGGTTTGTAGGTTACGTAAATACGGACAGTAGTCATTTAGATCTCCTTAAGTGCTTCTTGAATGCGGGCTAAGTCTTGTTCATAAACAGTGGTTAAATCGCCGATGTCTCTACGGTAGACATCTTTATCCATATGCTTCTTGGTCTTTTTATCCCAAAGACGGCAGTTATCTGGTGAAAATTCATCTGCAAGGATTATATTTCCATCTGCATCTTTACCAAATTCAAGTTTGAAATCGACTAATTCCATTCCAGCCTTATCAAAAAGCGGAATCAAAAGCTTGTTAACTTGACGAGACAATTCCCAGATCTTCTTTTGTTCTTCAGCGGTCGCAATATGTAATGCAATCGCATCAGATTCGTTCATGATTGGGTCGTCGAGTTCATCACTCTTATAAAAAAGTTCTTCGACTGGCGTATCGAATTTTTCACCTTCGCCCAAGCCATAACGACTTGAGAAATGGCCAGCTGCAATATTTCTAGTAACAACTTCAAGTGGGAACATGTCGCACTTTTTAACTAATTCTTCAGTGTCTGAGATTTTCTTGATGAAATGAGTTGGAACTCCGTTTTTGGACAAGTATTCAAAAATCAAAGTTGAAATTTCGTTGTTTAAATAGGCTTTGTTCTTAAAGTCATCCTTCTTTTCGCCGTTACCGGCAGTTGCCTGGTCTGTATAAACCACACGTAAAACTTCTGGGTCATCTGTAGACCACATTTCCTTAACTTTACCGGTATACAACAACTTTGCCATTTTATTTTTCTCCTTGGTAAAAACACGAATATTATTGAAGAGTGAATACTGATTTGTTCACTTTTATGTTAACAATATTATCAACTATTGCTTATAATGTCAATATATTATAAAGCACTAAAAATACTATCATGAAGTATTGTTCGTGTTTTAGTGAAAATGATAAATAAAAAGCAGAGCTTTAATAGCTCTGCTTAAACAAGATTAATTTACTTTTGTTTTCCAAATATGGGTGTCATTAATTTCTTGCAACGTTTTTTCAATATCGTCAGTAAGAATTGTAACGTGACCCATTTTACGATTATGACGGATTTCGGCCTTGCCATAGTAGTGAAAATGCCAGTTGCTCTTTTCAGGAATGACTTTTCTTACGCCGGCTACATGTTGACCCAAAACATTCACCATTACCACTTTGGACAACAGCTTAATCTTAGGCAACGGCCAATTGCAAATAGCACGATCATGGATATCAAACTGGTCAAAATTGCAGGCTTCAATCGAATAGTGACCTGAATTATGCGGACGTGGTGCTAGTTCATTGACATAAATATCACCGTTATCCAGAATAAATAGCTCCACACCCAAAATACCGCGCAAATGAATTGCTTCTGCAATTTGCTTAGCAATAGCCTGTGCTTTAGCATAAATTTCTTTTGAAACACGTGCTGGCACAATACTAATATGCAAAATTTCATCGGCATTGTAGTTTTCACTAACTGGGAAAACTGTCACTTCGCCTTTTGCATTTCTTGCGACCATGACAGAAGCTTCAACTTTGAATGGCACCCAGCCTTCAAGAATGCAGTCTCCCGTAGCTAAAATTTCTTTGATATGCGCATTATTTTTCAGATCGGCATCGCTCTTTAAAACTTCTTGGCCGTGGCCATCGTAGCCACCTTCACAGGTTTTAAGGACACAGTTATAGCCGATCTTTTTAATTGCCTCTTTTAGATCCGCCATATTTTTAACGGGCAAATATGGCGCAGTCATACAACTGGCACTACGCAGGAAGTTCTTTTCACGCAGTCGGTGTTTAGTGATATAAAGCAGATTAGTCCCCTGAGGAATTTTCACTTTATCAGCAACATCCTTTAAAGCTTGAAGGTCGACATTTTCAAATTCATACGTTAATACGTCGCTCTCTTTAGCCAGCTCTTCAATTGCCTTAACGTCTGAATATTCAGCCACAATCTGCTTGTCAGCCACCTGACCACAAGGACAGTCTGGCGTTGGATCCAAGGTGATGACTTTCATTCCACCATACTTGGCTGACAAAGCCATCATTTGACCTAGTTGTCCGCCTCCAATGATGCCAATAGTCTTTCCTTGTTCAATAAAATCTGGATCAATCAAGTTCCGCACTGCTTTCTTTTGCTTCATCATGCATCTTTTGGCGATAGTCTTTCAATTGCTGTCTTATTTTTTCGTCACTAATTCCCAAAATTTCTAGTGCAAGAAGTGCCGCGTTGCTTGCACCAGCATTACCAATGGCAGTGGTAGCTACAGGGATACCGGTTGGCATTTGAACAATCGATAAAAGAGAATCCATGCCACCTAATGCCTTAGTTTGACCGGGAACACCAATCACTGGAATCACGGTATTTGCGGCAGTCATGCCAGGCAAATGAGCTGCCATACCGGCACCCGCAATAATTACTTTAGTGCCATTTTTCTCAGCATTTTGGGCAAAATCATACATTTCTTTTGGCATTCTGTGTGCAGAAATAACGTGCTTATCATAACCAACACCAAATTGATCTAAGATTTCGCAAGCATGCTTCATCGTTGACCAATCTGACTTACTACCCATAATTACGCTTACTTCAGCCAT
Protein sequences of DBSCAN-SWA_4 >NC_017470|1527181:1535949|1535463_1535949_-|WP_014566083.1|DBSCAN-SWA MAEVSVIMGSKSDWSTMKHACEILDQFGVGYDKHVISAHRMPKEMYDFAQNAEKNGTKVIIAGAGMAAHLPGMTAANTVIPVIGVPGQTKALGGMDSLLSIVQMPTGIPVATTAIGNAGASNAALLALEILGISDEKIRQQLKDYRQKMHDEAKESSAELD >NC_017470|1527181:1535949|1527181_1527778_-|WP_014566079.1|DBSCAN-SWA MRVAILASGNGTNFEALTKQFQAGEIPGIEALMFCNHPNAPVIKRAERLGVPYETFSVKECGGKDAYEKRLLKVLQDYQIDFIVLSGYLRVVGPTILNEYPNSIINLHPALLPKYPGLNSIERAFDDYKKGKIKETGVTVHFIDAHLDHGPIIAQQAVPIYPDDTVDTLEARVHETEHKLFPATLRKVLSQIMEKEEN >NC_017470|1527181:1535949|1527787_1528825_-|WP_014566080.1|DBSCAN-SWA MNRYKEAGVDVTAGYDLVNRIKPMVAATKRKGVMGGIGSFGGMFDLEELDYKHPVLVSGTDGVGTKLMIAQKMHKNDTVGIDCVAMCVNDVLAQGAEPLFFLDYIACGHNDPEKLADVVKGVAEGCKQSESALIGGETAEMPDMYEPHEYDLAGFSTGIAEKEDILSSSLAKAGDHLIALPSSGVHSNGFSLIRKILFKDHDVSLSDKPAELAGKAVGETLLTPTRIYVKAVLPLVKKHLVHGIAHITGGGFIENLPRIYGNDLQAVVNKGSWPVLPIFDYLKKLGDLDERDCYNTFNMGVGLVLAVPVENIAAVKEQLNRENEKFYEIGQLVARPEGEEKIVIK >NC_017470|1527181:1535949|1533146_1533401_-|WP_013438446.1|DBSCAN-SWA MTTVRIYVTYKPSVFDPQGAAITDSVNSLGHDEVKKIVVGKFFDVTVDEDDLDKVKADVKDIAEALLVNFNMETYKIKVLEENA >NC_017470|1527181:1535949|1534322_1535498_-|WP_014566082.1|DBSCAN-SWA MKQKKAVRNLIDPDFIEQGKTIGIIGGGQLGQMMALSAKYGGMKVITLDPTPDCPCGQVADKQIVAEYSDVKAIEELAKESDVLTYEFENVDLQALKDVADKVKIPQGTNLLYITKHRLREKNFLRSASCMTAPYLPVKNMADLKEAIKKIGYNCVLKTCEGGYDGHGQEVLKSDADLKNNAHIKEILATGDCILEGWVPFKVEASVMVARNAKGEVTVFPVSENYNADEILHISIVPARVSKEIYAKAQAIAKQIAEAIHLRGILGVELFILDNGDIYVNELAPRPHNSGHYSIEACNFDQFDIHDRAICNWPLPKIKLLSKVVMVNVLGQHVAGVRKVIPEKSNWHFHYYGKAEIRHNRKMGHVTILTDDIEKTLQEINDTHIWKTKVN >NC_017470|1527181:1535949|1533401_1534118_-|WP_013642318.1|DBSCAN-SWA MAKLLYTGKVKEMWSTDDPEVLRVVYTDQATAGNGEKKDDFKNKAYLNNEISTLIFEYLSKNGVPTHFIKKISDTEELVKKCDMFPLEVVTRNIAAGHFSSRYGLGEGEKFDTPVEELFYKSDELDDPIMNESDAIALHIATAEEQKKIWELSRQVNKLLIPLFDKAGMELVDFKLEFGKDADGNIILADEFSPDNCRLWDKKTKKHMDKDVYRRDIGDLTTVYEQDLARIQEALKEI >NC_017470|1527181:1535949|1528826_1530278_-|WP_013438443.1|DBSCAN-SWA MFNEIKSLNEECGVFGVFGAPDASQLTYLGLHNLQHRGQEGAGIVSSDGEHLYQHRDRGLLSDAFADPNDLKKLVGDSAIGHVRYSTTGRNSIQNVQPFLFHFLDGDVALAHNGNLVNAVSLRNKLEKQGAIFQSSSDTEILIHLIRNHIKDGFISALKQSLNEVHGGFAFLLLQKDRMIAALDPNGIRPLCIGKLDNGAYVVSSETCSLDIIGAKFVRDVQPGELIIIDRDGMKIDHFTKNTHLAICSMEYIYFARPDSIIHGVTVHNARKRMGRLLAREAPADVDMVIGVPNSSLSAASGYAEELGLPYEMGLVKNQYVARTFIQPTQALREKSVKLKLSAVRGVVAGKKIAVIDDSIVRGTTSKQIVKMLKEAGAKEVHLRIASPPFRFPCFYGIDISTRAELMAAHYSVEEMRKIIGADSLGFLSVDSLIKAIDVPDRGDSSGLTVAYFNGKYPTKLDDYEAGYLASLNAQERRQRKDL >NC_017470|1527181:1535949|1530253_1532482_-|WP_013438444.1|DBSCAN-SWA MKQAMTPEEIKEKKPYLDWSLSEREYDYICDHLLHRLPNYTEIGLFSAMWSEHCSYKKSKPVLKLFPTKGKRVVQGPGEGAGVVDIDDGQAVVFKAESHNHPTTVEPYQGAATGVGGILRDVFSMGARPVAILDSLHFGELKDNPTMRYKMEETIKGVGDYGNCMGIPNLGGETTFDPCYNGNILLNAMNVGIMDIKDMEHGDATGVGNAVMYVGAKTGRDGIHGATFASADFSEEHATQRSAVQVGDPFMEKLLMEACLELILHHREWLVGIQDMGAAGIVSSSAEMATEGKSGMDLNLNLVPQREPNMSAYEIMLSESQERMLLCVKKGHEEDVKKIFDEFNLDAVTIGRITDDGRYVLHHDDQVVCDIPVVTLTEKVLEEKSAEKKPQRIIDAEQSENWQPNIESAGQTLKDLLNQPTIANDQFVTQQYDSQVRTDTIVGPGSDSGVLRVRHTKKAIAMTTDTNGRFVYLDPKVGGQRTVLESATNIVASGAQPLAITDCLNYGDPNDPEIFWELHQSCQGIADACEILETPVVSGNVSLYNENNGKAIYPSPMIGMVGLIKDYDHVIPMHMQKAGDKIYLVGKTDDDFAGSELQKMITGEISGLPHAPNLPEIKQYLYKLQALMANGLVKSAHDLSEGGLGVSLAETLFDTDFGAKVKLDFDKNLLFSETPGRLIVSVDPANAAKFEQEMGDSVSEIGQVTADQQLNISLANDQVNEDVAELQKIWKECIPCLMKSKA >NC_017470|1527181:1535949|1532478_1533150_-|WP_014566081.1|DBSCAN-SWA MKIAVIVFPGSNCDIDMYEAFRTVCKADVEYVSYKEKSLDGFDAVVLPGGFSYGDYLRTGAIARFSNIIPAVKKMADEGKLVLGVCNGFQILTEMGLLPGALKKNDSLQFVCKTVTLEVENTHTPFTTEYKDKELIRIPIAHGEGSYYADEDVLEELENNHQVVFRYHGENPNGSLHDIAGICNKEGNVLGMMPHPERAVEEILGGTDGLPLFKSLLKAGVQA |
9 | Prochlorococcus_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1699710 : 1707102
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NC_017470|1699710:1707102|DBSCAN-SWA ACTAGCGATTCATGACATAGCTGGCACTATTAGCAGCAACTCTTACCTTAGGAAGACCTAACGCTGCTCTTAATTTATCAGACACTCTTTGTCTTTCTTGTCTACTTACTACTTCGATTTCCATAGTACCAAACTTAGGATTCTGAGTTTGCTTACTTACACCTTGAGCATGATCTTGTGACAAGTTATCTGTTGCATTTCTATAATCCATCCCAATCTTGTACATTTGATTCATAGTCAAGTCAGTTTGCATTTCACTTGAGATTGAATTCAAGAATTTAGTATTAAGTAGTGTAGTAGGTGAGATAGACTTCTTAAGAAGTGCCATGACAACACGGCGATCACGGTCTTGACGACCATAATCTTGACGTGGATCCCCATGTCTAAGTTGAGCAAAAGCTAATGCCTTCTTACCATCCATATGGTAGGTCTTACCTTCTTGGAAGTGGTAGCCCATATTGTCGAAAGTCAATGGTGAAGTTACATCAATACCACCAACTTGATCAATAGCCTTCTTAAGACCACCCATGTTGATCATAATGTAACCGTCAATTGGCACACTATAGTATTTATCCAGTGTCTTAACTGTTTCACCAACTCCGCCTAGTGTATAAGCTGAGTTAATCTTCGTTACTCCATATTGCGGGAAATCTGGGAAGATTGCATTTGAATCACGTGGAATAGATACAATTGACGTACTATTAGTCTTAGGATTAATTGCCATCATCATGATGGTATCAGTACGTCCTTTCCAGCTACGTCCCATTGCTCCAGTATCAGTACCGAGCAAAAGAATATTGATTGGTTTCTTCTGCGCTAGCAAATTATCAAGATTACGTCCCTTGTTGCTCTTAGTTGTCTTAGCAACGGGAGTATACATATTTTGCGTGGTATCGCGTAAGTTCTTATAAACCATACCACAAACAAAAAGTGCAATGACTACAATAACGCCGATCACAATCCAGAACCAACGCCAAAACTTACGATGGTGATGACGGTGATGTCTATGGTGGTGATGGCGACGTACATCATTATTTGGTTGATTATTTTCTGCCATTTTGATCCCCTTATTTATTTATACAACTTCCTTATTATATAACTGATACTAGCTTTTTTACAGGCTTGTAACGGCTTTCCACGTTAAATAATCCTGTTTTTTTCAAATATGTTGAACATGTAATACTAACTTCTTACATGCAAATGAAATACCGTATCCTCTCAGTTGATTTATTAGAATAGTCTTTCTCTTTTTAGACTTATTTATATCTACTCCTCTACTAAATAAATTATCTAGCGTCCAACCATCATCGGTTAGTTTAATCAAACCATTTTTTTGACCAACGGCATACATCTCAATTTTGTCAGAAAAGTTATCTAAAAATGGAACATCAATAGCAACTACATTATCATTCAGATTTTCAAAGCGAGTGTTTTGATTATACCATTCTATATGTACTTAATAATTCACATTTTTTCAATTAGTTGCCTATTGTAATACAATTAACTTATCTACACATTTAAAAAGAGGTGTTTACATGATTGATAATCAACCTAAAAAAACTAAAGCCTACATTGCAGGTGTGAACTTAAACGATCCTAACTTTGACTACTACATGACTGAACTGGCCAACTTAACTGAAGCTGACAACATGGAGGTCGTGGGACAAAGTTACCAAAACGCAGAATCAATCGTTGCCGGGACTTACTTCGGTGTCGGCAAGATTAACGAAATCAGAACAATGGCCCAAGGCTTGAAGGCCAAGGTCTTAGTTTTAAACGATGAATTAACCCCAGTGCAAATCCGTAACCTTGAGAAGTTAACCAAGATGCGGGTCATCGACAGAACTGAATTAATTTTGGAAATTTTTGCCAGTCGTGCTCGCACTAAGCAGGCCAAACTACAGGTGCAACTCGCCAGATTGCAATATGAGTTGCCACGTCTTCACCCTTCCGAGAACAACTTGGACCAGCAGCGTGGTGGTGGTTTCGCCAACCGTGGTGCCGGTGAAAGTAAGCTGGAAATGAACCGCCGGACAATCGGCAAGCAAATTTCCGCTATTAAGAAGGAACTTAAGGCCGTTGCGAGTCAAGAAGAGATCAAGTCGGCTCGCCGTAACCAAAGCCGCATCCCTAAAGTCGCCCTTGTTGGCTACACCAACGCCGGTAAGTCCACTACAATGAACGGTTTGCTCAAGGAATTTTCAAAAGAAGGCAGCGACAAGGAAGTCTTCGTCAAAAACATGCTGTTTGCAACGCTTGACACTAGCGTGCGGCGCATTGACCTAAAGGATAACTTCAGCTTTATCCTGTCCGACACCGTTGGTTTTATCTCAAAACTGCCTCACAACCTAGTTGAATCATTCAAGGCAACGTTACAAGAAACCAAAGATGCCGATCTCTTAATCAACGTGGTCGATGCTTCTGACCCGAACATGGTTCAGATGATCCGCACTACGCAGAACGTTTTGGATGAAATTGGCGTAAAGGGTATCCCAATGATCACCGCCTACAACAAAGCCGATAAGACTGATCGAAATTACCCCCAGATTGAAGGCAGCGACATTCTTTATTCCGCAATCGATCCTAAGTCGATCAAATTGCTGGCCGATTTGATTACAAAGCGCGTATTCTCCAACTACGAGAAGCTCAACCTGAACCTCCCATTAAGTGCAGGCAAGGAACTGGCTTATTTGCATGAGAATGCTCAAGTTTTGAGTGAAAATTACGAAGAAGACGGCGTCCACATTGAAGCCAACATTGCGCCTGATGATCAGGGCCGCTTTGAAAAATATTTGGTTAATAATAGGTAATATATAATGAAGAAACACAATTTGCTGGTGGCCATCGTTGCAGTGGTTGAATGCATTGCGTTCACTGCTCAGCCCGTTCAAGCAGCGAAGTATTCGAAGTCTGAGGCTAAGCAGGTTAAGTATTTCCAACGTAAATACAAGAATTTGGATAAAGCGCAATACAACCGCAACAGCATTTACCAGCAAACACCAAATTTTGCCAATCCTTTTTCACCAGGGGTCTTGAATCCGGCATACATTTCAACCACGATGGATTACGTTAATTACTACCGTGATTTGATGGGACTGCCTAGCGAAGCTAATCCTGATGACGCTAACCGTAGCGCCCAGATCGGTGCTGCATCCCTCGCCGCAGTCAACGCAAGTGCCAGCCTACAAGCGCACGGTTTAATTAACTACCTGCGTCCCAACTACATCAGCGAAAACGATTGGGCCATCGCCGAGAACGCTACGCTAGGCAACATCAACTTCTTGGACGATGCGCATAGCGCCTCAGCCGGCGAGATCGTAACGGACTTGATGCGCGAAGATAACAACATTGCGGGTGCCGGCAATATCGGCCACCGCGCTATAATTTTATCAACACGCGCAACGCGCATGGGAATCGGTGCAGCGTACGGCATGAGCACCGATATGCTTTATTCAGTTGAATATGGCCTGTTCGCCGACGACATCTTGCGTGCGCCAGTAAAGTCACGCATCGTTTATCCAGCTGCGAAGGTCTTCCCATATGAATTAGTCGGCAAGGACACGCCATGGTCATACTCGACCACGAAGCGGATATCGAGCAAGCCGAAGATTTATATTACGGATTTGACCAAGAACAAGAAGAAGCGTTACCGCGCAACGCAGGTGCGCAACTTCAAGACGCTGTTCTATGGCGAAGGCTACACCACGACGATCACGTATCGTCCGGGCAAGGTCAAGCTGGTCAACACGCACAAGTATAAGGTGCAGATTGGCAAGTATTACACATACACGTTTAGATTCTTTAGACAAAAAGGTTAATAGTATGCAAAAAGGCAGCCGAGATTTCGGTTGCCTTTTTGAGCTTTTATGACATATACCCCCAACATTTTGAGTTTACGGAGGTGAGAGTCATGATTAATTGGATATTTATAGTTATACAAATTTCTTGTAGATTTCGCAATTACTACAAGAATTTAGAAACCATACGAAAAAGCCATCCATTTTTGGATGGCTTCATTATCTATAGCACTCTCTTAGCAATCGTAGGATAGTAATAACTCGACATATTTTGCAAAATCGTACCTTGATCTGGCGTTGCGGAATTAACATATTGATTATTACCAATATAAATCGCAACGTGATATGGAGCCGTTTCAGAACCCCAGAAGACCAAATCGCCTGCTTGCAATTTATCTAATGGAACAGTCTGACCTACCTTAACTTGATCGTAAGTCGTACGTGGCAAGTTGATTCCGGCCGCATGTTGATAAACATATTGTACTAAGCCAGAGCAATCGAACTTATCTGGTCCAGTTGCACCCCAAACGTAAGGCTTACCCACTTGCTCTTTTGCAAGTGCAACAACAGAAGCGGCCTTGCTTGAGGCTTGGATTGATGAAGTATTGGTCTTCTTATTAGCCGTCGTTTTTTTAACCGTTTCGTTAACGATTTGGTTAGCTTTTTTAACTTTATCTTCCTTAGTCGTTTTCTTCTTTTGCGTAGCCTGTTTTACCTTGTTGGCCAAATTAGCAACTGGCGACTTCTTCTTAGTAGTCTTAGTCGCAGTTTTAACCGGTTCTTCCTCACTGACATCAACGGTGTAACGCGCTTCAATCCATTCACGCGTACCCACCTTGTACCATAGGCTGCCTTTACTATCGACCGCAGTTTCAGCTACGTTCCAAACGGAGCCGTCCTTTGCGCGGTAACCCATGAACTTACCATTTTCATAATTAGTCCAAATATTCAGACTCTTACCTGGCAAGTAGCTAATTTTGACGCGTTTAACAGCAGTAAAATTGGCTGCTCTAACTGTTTTAAGTGATGTTATTGCAGAAACGCCTGTAACGGACAGTGCTGCAGCCGCTCCCAACTTAAACAATTTATGTTTCGACATAATTCAATCCCCTTGGTAGCGAAAAATATCCCACTTTTACTACTTGATAACGTCTTCATTATCACATGTGTAATCAAAAATCGCAAATGTAACTATTTAAATAAGTTTATAGACGCAAAAAAAACGAACCACAAATCGTGTGATTCGCTTAATTCTTTTATTCAATTTTCGTTTAGTTCAAAATACGCTTAGCTGCGCTTGGGTAGAAGTAACCACTCAAACTTTGCTTGATAACGCCTTGACCCGGAGTCGCGGCGTGGATGTATTGATTGTTACCAACATAAATACCTACGTGGTATGGAGCACTTGCGGAGCCCCAGAAAAGCAAGTCACCTGGTTGTAAGTTCTTCATTGAAACTGAGGTACCTTGCTTAACTTGGTCGTAAGTAGTTCTACCGATGTTCACACCAGCAGCCTTTGAGTAAACATATGAGGTAAGACCTGAACAGTCAAAACCGTTAGCACCGGTACCGCCCCATACGTAATTCTTACCAAGTTGTGCTTCAGCTAAAGTAACAATAGCTGATACATCGCCGGTTGCTTGAACTGACTTCTTAGCCTTCTTTGGCTTAGCAGCTGTTGCAGTAGCTTGAGTTGTAGTATTCTTAGTTACTGTATTACCAGCTGGAGTGGTGTATTGAGCCATGATCCATTGGTTTTGACCAACTTGGTACCAAACGCGTCCCTTCTTGTCAGTCTTTTGATCAAGGACATTCCAAGTGGTGCCGTGTTGTGCACGTTGACCTGTGAAGTGACCACCGTTGTAGTTATCCCAGATATTTATACCATAACCTGGAACGTAGTTGATCTTTACTTTAGTAGTAGCAGCTTGAACTTTAGCAGTAGTTGATTCTGGCTTAACAGCACTTACTGTAGCTACACCGGTCAGAGTTAAAGCAGCTGCTGCAGTAACTTTTACAAAATTGCTCTTAATATTCATCAAATGCTCCTATATATGTCTGTTAGTGTTTACTTAAGTTGTCAGATCTTTTAGTTTTGGTAACGAAAATGTGTTTTTTCGGAAAATCTTTAAACTTCTTTAATTAATCTACAAGCTCTATAATAATGTCCTTATGTTACAGAACTGTTACAAAGGTAATACTAACTTAATAAAAAAAGCGTTTTCGAAAATTTACACAAAAAAAGAGTTGGATAAAAAATCCAACTCTTCTTTCTATAATTTTTTGAGAAATATATTTTTTATTTTGTTTTAGTCGATAACACGCTTAGCTGATGATGGCATGAATGAACCTAAAGTTTGGATCTTAACGTTTTGACCAGGAGCTGGAGCGTGAACATACTTACCATTACCAATGTAGATAGCTACGTGTGAGTTACCCCAGAATAACAAGTCACCCTTCTTAAGCTTCTTAGTTGAAAGTGATACTGCCTTACCTAAAGTGATTTGGCCGTAAGTAGTTCTTGGCAAAGTCTTGTTAATTGCGTTCTTGAAAACGTAAGTAGTTAAACCTGAGCAATCGAAACCTGAAGGACCAGTTGCACCCCAGATGTAAGGCTTGCCAACTTGCTTCTTAGCGAGCTTAACAACTGCGCCACGTTTTTGGTCAGCTGCTGAAGTCTTCTTAACTGAAGAATCGGCTGGAGTAGTTGTTTCGATATCAGAAAGAGAATCGTTAGTGTTTACAGTAGTAGTGTTAATGTCGTCAGCGTGAACAGTTGCTGCAGGAACATTAACAGTTGCAAGACCTGTAAAGAAAATTGATAAAGCTGCTGTGTATTTAACCAAAGTACGTTTAAAACTCAAAATTTATTTCTCCTTAAAAATTATTTTTTATTCTTAGTTTTTAAATTCGTTTATTTAGTGATCTCTTGGATCAACTGACGTTTTATATAATAACGCCCGAATGTTACATGAATGTTTCAAAAGAACTACCATCACGTTAAGAAACATTGCAAAAGGAGAAATGATCTGTAAAAAACGGAGTGCCTAGTCTTCCTCAAGCCTTGATATGATTGAGTTTAACGTTTCTTTAGTTTTGTTAAGATTATCGTTAACAATTACATGAGCGTATTGTTTCAAATCTGCTGGAAGCAAGTTTAATTCGCTACCACTAAGGCGTTCCTTAATCTTCTCAGGATCGTCGCCACGCTTGAGCAGCCGCTCTTTCAGCTCTTCTTTAGTCGAAGTTGTGACATATAAGAAATAAACTTTGTCACCTAATTGTTTAATGTAAGAATATACGCCTTGAATATCGACGATTAATGACACCAGATCGTGTTTTTCCCAAGCGAGGTTTAATGCTTCACGACTTGAACCATACTGATATGAACCGTATTTTACATGTTCAAAGAAGTGCAATTTCTTAAATGAATCATCCGTTTCAAAGTGATATGACACATTCTGCCTTTCGCCGACACGCATTGGCCGTGTGGTATGCGTTAAAACGCGCGGAATGTCATATTTTTCATATAAATAATCGGAAATGGTCGTCTTGCCAGCACCACTTGGTCCCGCAATTAAAATTATTTTTTTCAA
Protein sequences of DBSCAN-SWA_5 >NC_017470|1699710:1707102|1700868_1701060_-|WP_014566172.1|DBSCAN-SWA MYAVGQKNGLIKLTDDGWTLDNLFSRGVDINKSKKRKTILINQLRGYGISFACKKLVLHVQHI >NC_017470|1699710:1707102|1705815_1706370_-|WP_013438629.1|DBSCAN-SWA MSFKRTLVKYTAALSIFFTGLATVNVPAATVHADDINTTTVNTNDSLSDIETTTPADSSVKKTSAADQKRGAVVKLAKKQVGKPYIWGATGPSGFDCSGLTTYVFKNAINKTLPRTTYGQITLGKAVSLSTKKLKKGDLLFWGNSHVAIYIGNGKYVHAPAPGQNVKIQTLGSFMPSSAKRVID >NC_017470|1699710:1707102|1706553_1707102_-|WP_014566176.1|DBSCAN-SWA MKKIILIAGPSGAGKTTISDYLYEKYDIPRVLTHTTRPMRVGERQNVSYHFETDDSFKKLHFFEHVKYGSYQYGSSREALNLAWEKHDLVSLIVDIQGVYSYIKQLGDKVYFLYVTTSTKEELKERLLKRGDDPEKIKERLSGSELNLLPADLKQYAHVIVNDNLNKTKETLNSIISRLEED >NC_017470|1699710:1707102|1699710_1700766_-|WP_014566171.1|DBSCAN-SWA MAENNQPNNDVRRHHHHRHHRHHHRKFWRWFWIVIGVIVVIALFVCGMVYKNLRDTTQNMYTPVAKTTKSNKGRNLDNLLAQKKPINILLLGTDTGAMGRSWKGRTDTIMMMAINPKTNSTSIVSIPRDSNAIFPDFPQYGVTKINSAYTLGGVGETVKTLDKYYSVPIDGYIMINMGGLKKAIDQVGGIDVTSPLTFDNMGYHFQEGKTYHMDGKKALAFAQLRHGDPRQDYGRQDRDRRVVMALLKKSISPTTLLNTKFLNSISSEMQTDLTMNQMYKIGMDYRNATDNLSQDHAQGVSKQTQNPKFGTMEIEVVSRQERQRVSDKLRAALGLPKVRVAANSASYVMNR >NC_017470|1699710:1707102|1704777_1705545_-|WP_013642459.1|DBSCAN-SWA MNIKSNFVKVTAAAALTLTGVATVSAVKPESTTAKVQAATTKVKINYVPGYGINIWDNYNGGHFTGQRAQHGTTWNVLDQKTDKKGRVWYQVGQNQWIMAQYTTPAGNTVTKNTTTQATATAAKPKKAKKSVQATGDVSAIVTLAEAQLGKNYVWGGTGANGFDCSGLTSYVYSKAAGVNIGRTTYDQVKQGTSVSMKNLQPGDLLFWGSASAPYHVGIYVGNNQYIHAATPGQGVIKQSLSGYFYPSAAKRILN >NC_017470|1699710:1707102|1702525_1703527_+|WP_014566174.1|DBSCAN-SWA MKKHNLLVAIVAVVECIAFTAQPVQAAKYSKSEAKQVKYFQRKYKNLDKAQYNRNSIYQQTPNFANPFSPGVLNPAYISTTMDYVNYYRDLMGLPSEANPDDANRSAQIGAASLAAVNASASLQAHGLINYLRPNYISENDWAIAENATLGNINFLDDAHSASAGEIVTDLMREDNNIAGAGNIGHRAIILSTRATRMGIGAAYGMSTDMLYSVEYGLFADDILRAPVKSRIVYPAAKVFPYELVGKDTPWSYSTTKRISSKPKIYITDLTKNKKKRYRATQVRNFKTLFYGEGYTTTITYRPGKVKLVNTHKYKVQIGKYYTYTFRFFRQKG >NC_017470|1699710:1707102|1703729_1704605_-|WP_014566175.1|DBSCAN-SWA MSKHKLFKLGAAAALSVTGVSAITSLKTVRAANFTAVKRVKISYLPGKSLNIWTNYENGKFMGYRAKDGSVWNVAETAVDSKGSLWYKVGTREWIEARYTVDVSEEEPVKTATKTTKKKSPVANLANKVKQATQKKKTTKEDKVKKANQIVNETVKKTTANKKTNTSSIQASSKAASVVALAKEQVGKPYVWGATGPDKFDCSGLVQYVYQHAAGINLPRTTYDQVKVGQTVPLDKLQAGDLVFWGSETAPYHVAIYIGNNQYVNSATPDQGTILQNMSSYYYPTIAKRVL >NC_017470|1699710:1707102|1701244_1702519_+|WP_014566173.1|DBSCAN-SWA MIDNQPKKTKAYIAGVNLNDPNFDYYMTELANLTEADNMEVVGQSYQNAESIVAGTYFGVGKINEIRTMAQGLKAKVLVLNDELTPVQIRNLEKLTKMRVIDRTELILEIFASRARTKQAKLQVQLARLQYELPRLHPSENNLDQQRGGGFANRGAGESKLEMNRRTIGKQISAIKKELKAVASQEEIKSARRNQSRIPKVALVGYTNAGKSTTMNGLLKEFSKEGSDKEVFVKNMLFATLDTSVRRIDLKDNFSFILSDTVGFISKLPHNLVESFKATLQETKDADLLINVVDASDPNMVQMIRTTQNVLDEIGVKGIPMITAYNKADKTDRNYPQIEGSDILYSAIDPKSIKLLADLITKRVFSNYEKLNLNLPLSAGKELAYLHENAQVLSENYEEDGVHIEANIAPDDQGRFEKYLVNNR |
8 | Clostridioides_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_017472_1 | 40785-41023 | Orphan |
NA
Consensus repeat of NC_017472_1
|
3 spacers
spacers of NC_017472_1
>1.1|40821|30|NC_017472|CRISPRCasFinder,CRT CTTTAATTTTATCATCATTTGGAATATACT >1.2|40887|30|NC_017472|CRISPRCasFinder,CRT AAAATACTGATAAATCAATGCTTAGTTTAG >1.3|40953|31|NC_017472|CRISPRCasFinder,CRT TAAGCTCATTTCTGCATCTAATACCCGATTG >1.4|40891|26|NC_017472|PILER-CR TACTGATAAATCAATGCTTAGTTTAG >1.5|40957|27|NC_017472|PILER-CR CTCATTTCTGCATCTAATACCCGATTG |
CRISPR arrays and Neighbor proteins around NC_017472_1
The CRISPR arrays of NC_017472_1 >merge|NC_017472|1|40785-41023|CRISPRCasFinder,CRT,PILER-CR GTTTTCGGTGGTTGTCATTTCAAGCAGGTAAATACCCTTTAATTTTATCATCATTTGGAATATACTGTTTTCGGTGGTTGTCATTTCAAGCAGGTAAATACCAAAATACTGATAAATCAATGCTTAGTTTAGGTTTTCGGTGGTTGTCATTTCAAGCAGGTAAATACCTAAGCTCATTTCTGCATCTAATACCCGATTGGTTTTCGGTGGTTGTCATTTCAAGCAGGTAGATACCTAAA >NC_017472|1|1|40785-41019|CRISPRCasFinder GTTTTCGGTGGTTGTCATTTCAAGCAGGTAAATACC CTTTAATTTTATCATCATTTGGAATATACT GTTTTCGGTGGTTGTCATTTCAAGCAGGTAAATACC AAAATACTGATAAATCAATGCTTAGTTTAG GTTTTCGGTGGTTGTCATTTCAAGCAGGTAAATACC TAAGCTCATTTCTGCATCTAATACCCGATTG GTTTTCGGTGGTTGTCATTTCAAGCAGGTAGATACC >NC_017472|1|1|40785-41019|CRT GTTTTCGGTGGTTGTCATTTCAAGCAGGTAAATACC CTTTAATTTTATCATCATTTGGAATATACT GTTTTCGGTGGTTGTCATTTCAAGCAGGTAAATACC AAAATACTGATAAATCAATGCTTAGTTTAG GTTTTCGGTGGTTGTCATTTCAAGCAGGTAAATACC TAAGCTCATTTCTGCATCTAATACCCGATTG GTTTTCGGTGGTTGTCATTTCAAGCAGGTAGATACC >NC_017472|1|1|40851-41023|PILER-CR GTTTTCGGTGGTTGTCATTTCAAGCAGGTAAATACCAAAA TACTGATAAATCAATGCTTAGTTTAG GTTTTCGGTGGTTGTCATTTCAAGCAGGTAAATACCTAAG CTCATTTCTGCATCTAATACCCGATTG GTTTTCGGTGGTTGTCATTTCAAGCAGGTAGATACCTAAA
>NC_017472.1|WP_014566310.1|39970_40651_+|hypothetical-protein MAKTTNNGDEPTKSRKGLDLLYRLTHYDSYHSYRVRMAIVRSLLWLAVVLYIYKLFIPNYQTDIFVNDIIIFLLGSYLIAYYSVTAKASIMFKDSAYVMRKIAFSGVISNSWNIYDGILKDPEEGEVSVEITFFKEKAFLVNVFDLMRKNGKTLLLTKDDIENLKRYIEVLDGSQALRKKFEAYATLSDEMVKTLSDAGILLTRLDNHYRTKPYRWDYARTTDIKS >NC_017472.1|WP_014566309.1|37907_39761_-|hypothetical-protein MINFKNNFKTISITDDQFLLIKQIANLGFVTKPQLEMIYSIIKNKPTSISNHILNKLVNKDKVLNRIQSNESQNRIKQIAYVISRYGRNLLSAYHCFYRDPRSFGINFHNLQANEVVIQALYASNFKPTALGSNNSSLRFNDEEKTVSITNSFGSTFTLPTFDVPFYDKKVPKPVVSRLNEDYFIKNADLARLPELLTEGLLVGGITDKKLRLSLDSKAQSTFSSEKGNEDDDWFVQDLSFLKNPRLLRYISFFKPFLSKKFIEEMQSYQGLTKMSGLRGVGAHVKLVKISNFYQQLLKRSNCYYFLLNLYQNLLNIDNKKQSLATVGNYWQQLATGNNNYQELVTFTNSTLKKDTLDKNLAIIKPIGLLPFRVGSDFDDRIQRASLALRGFNYHLKLTEFDTRPFNAQLGITPTKKDNTAFEADTMITFKRNNKVQSVFIELDNRTEGSATQAQKILNYIEYANQHPNDNFLLAIVSADGSLPTNKLKQYTYPDQHLGVLVDKMLRIRVGEGQQTEDGRLVKATPYLIELYERCPNLKIIFAGLSEAPMRIAEFIVNANHNIDYISSAFVLARSISKETQWDVTFDPTIEVKEAIKNTPALVDTTYNQLHYNVSGL >NC_017472.1|WP_014566308.1|37165_37924_-|hypothetical-protein MFPDFNYSNKQTNLHVRQPVIAGDEYDLDTPYLVTEVTQSFGKHHVLKDRDSIPPMVIFPTRERLTHANVPPAIQKMCNWSPKFIAGQIYYYQPRLGIDNNPYLLRELRNLVIRHSKDIYQYYSKGFLSRKELAQGITYSDKDKKLPFAYADHILQKTRKYNELHRLTNYMNDKAFAEQIMLNEIPLKMLKSLIARTQGRAFSIPVIADLPYSIVGDDSLIVPDHISLSDCLTTPNTRYRRSDAKIDLKYSY >NC_017472.1|WP_014566307.1|36562_36703_+|hypothetical-protein MYTERGRVLDLPVISSKDEAEIMKNAKDLKKPHWLPVELVSIDENK >NC_017472.1|WP_014566304.1|35377_35599_-|hypothetical-protein MAKVYDHDKHEFLTGYRGLTIGWSDGNTFLPVNFALMSTKKKKNMIGSQPVTTDQRSIANRRRTQAQRPMNKS >NC_017472.1|WP_014566303.1|35120_35264_-|hypothetical-protein MLKQTKKVYYRYRGRLYDIKELYERLAASKMHQKADYLYSSVVEAKY >NC_017472.1|WP_166484917.1|34766_34952_-|hypothetical-protein MCKSALGITIGPSFILEKENAFVKPVPLEYRVKLSYGTASLNSNHKIQIKDFYNFFKLNLQ >NC_017472.1|WP_014566302.1|33897_34752_-|LysR-family-transcriptional-regulator METKQLAIFLDVCKTQSFSETSRNMYITRSAIVQHINKLEKYLGVKLFHRNSHGVKLTDAGKVLIPFAQNMVDTNDSIIQTMHNFSHTITIGTIYLQKPNLITKMLNDRPKYAKKIQIKFQELNNIKQINSQIDIIEYYEVTKYLDQSFNFLKLEEEPIFIALPPNHKLARKDSIDLKDLEGYTVAIEKSGVSVIGDKVKEKLEKYPQINLKSYGIYNSSFFATAQYNNYLICIARGMGIDTTPYVLRPLNVSEKALYGIYYRKKPNNLVKEFIKNFSEKKTVQ >NC_017472.1|WP_014566300.1|32288_32597_-|hypothetical-protein MPSSKKSGSVVGKPLSSKYGTSSNSQTDSNAVKPDGLLCISFCLSACKSNCRSECASNCGKACASACKAACRAECRSMCYGAGSDAPTTLNKQVKSVEDIIL >NC_017472.1|WP_014566299.1|30659_32258_-|ABC-transporter-ATP-binding-protein MFLKRIWNKYKFDYILLLGMNIINTCIETSNVYLEGMLINSLVYKADRVSFIRNIIVIIVLNLIRLFLSFFISKIQILKYRKINLDFNDSIIKELYSKDTLEVIKKDPVKTADRITEDTDEILTFLFHTINQVISILFSSIIIFVYIFKTKSRFFLLIMILLPAYICLYLFLKPKIFEINLKLKQAYNEYFSGFTEWLSRYIEIKGNNRENKESKRWSKTKKSLLNITKRDFLLNLNMSSSEIIFQLIFQLILFINGGLSVISGNMTVGSFSILFQYFNQLLGEVDEIFSVLFGLESFRVAKMRINKLLSIKNEVDGKKIISRIESIYVHDFDISLHRNSPLFVKKLNCTFSSPGLYIIKGKNGIGKSTFLRTLIGLYTPIKEGEVLINNENIDLINKKKLRENNISCLFQDVPLPSCTVAEYIRDKHTNSNSDQNEAFKKVFYSSQFNIKRILDRKMDELSTGELQLVKLYSAFLKEKVDCYLLDEPLANIYPELQYDTLNLLKQMAQTKLVIIISHDLQFEKIGKTIKVG >NC_017472.1|WP_014566311.1|41549_41873_+|hypothetical-protein MKNEDLQEMRKEYIQDITLEVSKMIAKSSKLSLEEAKKAFINSRTYNFLAYSDDPFVEEGPEDFYEMFKNDRKYGRMVTDIQIYLEKHPELYINPNEKDNVRKGNNK >NC_017472.1|WP_014566313.1|42910_43153_+|hypothetical-protein MSKESELKQIEDKLLIYISSDRRNWADTFKLTKRVRDEELYSGEYADYGDFFWLCIPKELLEVAQKYVAKGWGILLVTLQ >NC_017472.1|WP_014566314.1|43268_46115_-|methylase MTKVNKKKLKDFIDTWQNQGSEVADKVTYWNTLLELLGVPKEQIDNKTYIEYEKPIKLHENESFHGSIDAYIPSTHVLIEQKSNGVDLTKPENRPNGNHTEKITPFSQAKRYDDHLGSKEKANFLVLSNFNQIVVYDVRESIDTKPIIINIEDLEKDLYLLNFLVKPDDSKRLEKEKRVSFAAGTLVSQIYNELADIFAKYDQTADEQIKHSINTLCVRLVFCLYAEDAGLFPTKEQFYNYLEPVKPNKMGLALKALFKTLDTKDRKAEDPFWEDENPELAQFPYVNGGLFADEDIIIPPFTEKLKDIILNKASRGFDWSDISPTIFGAVFESTLNPDTRREGGMHYTSIENIHKVIDPLFLDDLKAKLEKIKQYKNQKTIHDKAVAFQEELANLTFFDPACGSGNFLTETFLSLRRLENEAIRLELGGESVLDVGQAKDWIKVSIQQFSGIEINDFAVSVAKTALWIAEDQMMKETQDLLYAPDWDFLPLKTYTRIHEGNALEMDWNKVIPNYACHYIIGNPPFSGLSALPAKNKKLKKQQTEDMNRVFKDLPKHGKLDYVTAWYEKAADMMQGTNIKASFVSTNSITQGEQVGILWKHLIEDKNLTIIFAYRSFVWNNEAKDTAKVHCVIVGFTCGKYKGEKTLFEGEKVKKVDHINGYLIDYDDIYVKSRKVVPPYNMPLMSQGSKPIDGGGLILKSDEYNKFITEYPELKDLVKPYMGASELIKGKRRYCFWLKDVDSKRFVNNKLIRERLKIVIEARRKSPTKSVHDHAEEAPYLFSQIRQPDVDYIAVPSPSSGNRKYIPMAILSKNIIASNRLYIIPSTSLWIFSVLMSSVHMAWVNVVTGRLKSDFSYSPAVYANFPWLDFTNEQKAQLNKSAQEILDAREKYPDDSLADLYDPLGMPPELIKAHKENNKLILKMYNLPADSSEADIVAHLFKMYEKLTK >NC_017472.1|WP_014566315.1|46289_46877_+|recombinase-family-protein MIYGYARVSTAQQDYATQIDDLKRAGATKIYKDKYTGTTANRPEFDKLMDKLQNGDTLIVTKLDRLARNTQDALSIVKQMNDEGVILRVLNIGTIDNSPSGRLIFTVFSAFAEFERDLIVSRTQEGKAWAKANNPNFHDGMPRKYDQEQINFAWKLHTQDHMSYSEISKKLGMSKATIYRRFRELRDSPNRKSRL >NC_017472.1|WP_014566319.1|50043_50667_+|NUDIX-hydrolase MEDKDLLIEWAKRLQSLAQAGLTYGKDDFDLDRYQEIRDISAEMMAYKSDLPLQKVKDLFCNEIGYQTPKLGTRAAIFKDNKILLVQENDGSWSLPGGWCEVNMSVKENCIKEAKEESGLDIEVERVIGIYDQNKHSEAIYPYNVVHVFFLCKPLGGEFKKNIETTTRKYFAYDQLPENLSTDRNSLDEIEACFKAYKDPGFQVECD >NC_017472.1|WP_014566320.1|50883_51609_+|coenzyme-F420-0:L-glutamate-ligase MFSNTISTKLWLDFPQITKKCDLAEVIIQFCEKKRDSLKDGDILCIASKIISKSQGLFVDLNTIKPSELALKIHHQVPRKDPRIIQLIINQTKDLSGKRLQISPNFIGGWLPNGLFLTSAGVDRIGEDTAIVLPNNCDEIAKQIAEKIYEKTGKRVAIVITDSDGRIDKKGATQIAVGLYGINGLRKTQSNGKINVETICDMLAASAGLLMGQRGNMVPIVTIRGFEYEFDRDATIKDAVN >NC_017472.1|WP_014566322.1|52136_52433_+|hypothetical-protein MIIDTQKLQKAVIENKKNHGFNTTDVKFELLLLYGEVNELFQAWLKDDRDSINEELADVAIFLLGISEMLGSDLGEDIVKKMKINAKRKYIDGKKIEG >NC_017472.1|WP_014566323.1|52711_54217_+|ABC-F-type-ribosomal-protection-protein MSNIRISNLSFRYDDSSENIFNKLNLNLDSTWKLGLVGRNGRGKTTFLNLLRRKLHGLGEIQTRLSFSYYPIKVEDQKNITLYELQKQVAFEEWELERELNLMNVNPNLLWQPFNTLSGGEQTKVLLALSFTDKDSFALIDEPTNHLDEDSRKEISNYLGKHEKGYIVVSHDRDFLNQVTDHILAIENMEIHLYQGNFAAYEDTKQKRDEFNREKNQKLKGEIRTLNESRLRLKGYSSKSENQKNAKAHSNEIHAYINKGFYSHKAAKVMQRSKNVERRMNDDIQAKQGLMTNIEDIPELTMNFQPNYHSTLLEAQHLDLQIENITLFKDLNLVVKNHGIVSLEGKNGSGKSTFLKMLLNKTFSVTYQGKYELANGLSISYLPQNFTEYHGTLHNFAYEHKISYEKLLNNLKKMGFPRAGFVTPIEEMSMGQQKRVALAKSLVEPADLYLWDEPANYLDVFNQDQLIELLKKVKPAMLLIEHDEYFIEQVTDHRVRLDIAE >NC_017472.1|WP_014566327.1|57155_59084_+|tetracycline-resistance-ribosomal-protection-protein-Tet(W) MKIINIGILAHVDAGKTTLTESLLYASGAISEPGSVEKGTTRTDTMFLERQRGITIQAAVTSFQWHRCKVNIVDTPGHMDFLAEVYRSLAVLDGAILVISAKDGVQAQTRILFHALRKMNIPTVIFINKIDQAGVDLQSVVQSVRDKLSADIIIKQTVSLSPEIVLEENTDIEAWDAVIENNDKLLEKYIAGEPISREKLVREEQRRVQDASLFPVYYGSAKKGLGIQPLMDAVTGLFQPIGEQGSAALCGSVFKVEYTDCGQRRVYLRLYSGTLRLRDTVALAGREKLKITEMRIPSKGEIVRTDTAYPGEIVILPSDSVRLNDVLGDPTRLPRKRWREDPLPMLRTSIAPKTAAQRERLLDALTQLADTDPLLRYEVDSITHEIILSFLGRVQLEVVSALLSEKYKLETVVKEPTVIYMERPLKAASHTIHIEVPPNPFWASIGLSVTPLPLGSGVQYESRVSLGYLNQSFQNAVRDGIRYGLEQGLFGWNVTDCKICFEYGLYYSPVSTPADFRSLAPIVLEQALKESGTQLLEPYLSFTLYAPREYLSRAYHDAPKYCATIETVQVKKDEVVFTGEIPARCIQAYRTDLAFYTNGQSVCLTELKGYQAAVGKPVIQPRRPNSRLDKVRYMFQKIRKSR >NC_017472.1|WP_014566328.1|59139_60090_+|SLAP-domain-containing-protein MYIKAANFSSKKTATTTDLGDGYETTMLHNAYIYNSKGKRVRGKKLLKNHDITYYGKVLMIKGKKYVQIGDNQYVRSSNVLLAYDGPISSNSNVNRHATNCSSNNDTSINSNNSTNNSKNNNVVNNTANSQNGSKSSKTNQTNNQSANILRNGNQNNQTNTDVATDTDFEALSLAIQKAEATKYYDATFARAQAYHQAKEAAEVLMVNHKHPYKYQPVITAAEVHAATANVEAAAANLDGDAEYDKMPNVKIERATDGDIKYDWTPAQKQLVLDIANEIHGSTDAHYFDNDRQIGLTDGNGMAHTFNTSYFLHETY |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_017472_1 | 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT | 40821-40850 | 30 | NC_017472 | Lactobacillus amylovorus GRL1118 plasmid p2, complete sequence | 40821-40850 | 0 | 1.0 |
NC_017472_1 | 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT | 40887-40916 | 30 | NC_017472 | Lactobacillus amylovorus GRL1118 plasmid p2, complete sequence | 40887-40916 | 0 | 1.0 |
NC_017472_1 | 1.3|40953|31|NC_017472|CRISPRCasFinder,CRT | 40953-40983 | 31 | NC_017472 | Lactobacillus amylovorus GRL1118 plasmid p2, complete sequence | 40953-40983 | 0 | 1.0 |
NC_017472_1 | 1.4|40891|26|NC_017472|PILER-CR | 40891-40916 | 26 | NC_017472 | Lactobacillus amylovorus GRL1118 plasmid p2, complete sequence | 40891-40916 | 0 | 1.0 |
NC_017472_1 | 1.5|40957|27|NC_017472|PILER-CR | 40957-40983 | 27 | NC_017472 | Lactobacillus amylovorus GRL1118 plasmid p2, complete sequence | 40957-40983 | 0 | 1.0 |
NC_017472_1 | 1.4|40891|26|NC_017472|PILER-CR | 40891-40916 | 26 | MN694558 | Marine virus AFVG_250M9, complete genome | 10121-10146 | 3 | 0.885 |
NC_017472_1 | 1.4|40891|26|NC_017472|PILER-CR | 40891-40916 | 26 | MN694640 | Marine virus AFVG_250M10, complete genome | 10126-10151 | 3 | 0.885 |
NC_017472_1 | 1.4|40891|26|NC_017472|PILER-CR | 40891-40916 | 26 | MN694392 | Marine virus AFVG_250M8, complete genome | 29464-29489 | 3 | 0.885 |
NC_017472_1 | 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT | 40887-40916 | 30 | NZ_CP017256 | Clostridium taeniosporum strain 1/k plasmid pCt3, complete sequence | 89603-89632 | 5 | 0.833 |
NC_017472_1 | 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT | 40821-40850 | 30 | NZ_AP018564 | Staphylococcus argenteus strain 58113 plasmid p2, complete sequence | 15457-15486 | 6 | 0.8 |
NC_017472_1 | 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT | 40821-40850 | 30 | MK250021 | Prevotella phage Lak-B2, complete genome | 296834-296863 | 6 | 0.8 |
NC_017472_1 | 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT | 40821-40850 | 30 | MK250024 | Prevotella phage Lak-B5, complete genome | 290856-290885 | 6 | 0.8 |
NC_017472_1 | 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT | 40821-40850 | 30 | MK250028 | Prevotella phage Lak-B9, complete genome | 295746-295775 | 6 | 0.8 |
NC_017472_1 | 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT | 40821-40850 | 30 | MK250025 | Prevotella phage Lak-B6, complete genome | 294048-294077 | 6 | 0.8 |
NC_017472_1 | 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT | 40821-40850 | 30 | MK250022 | Prevotella phage Lak-B3, complete genome | 294063-294092 | 6 | 0.8 |
NC_017472_1 | 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT | 40821-40850 | 30 | MK250020 | Prevotella phage Lak-B1, complete genome | 295729-295758 | 6 | 0.8 |
NC_017472_1 | 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT | 40821-40850 | 30 | AP014341 | Uncultured Mediterranean phage uvMED isolate uvMED-GF-U-MedDCM-OCT-S37-C76, *** SEQUENCING IN PROGRESS *** | 11025-11054 | 7 | 0.767 |
NC_017472_1 | 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT | 40821-40850 | 30 | MN693625 | Marine virus AFVG_250M334, complete genome | 9846-9875 | 7 | 0.767 |
NC_017472_1 | 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT | 40821-40850 | 30 | MN693057 | Marine virus AFVG_25M77, complete genome | 50720-50749 | 7 | 0.767 |
NC_017472_1 | 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT | 40887-40916 | 30 | MN694558 | Marine virus AFVG_250M9, complete genome | 10121-10150 | 7 | 0.767 |
NC_017472_1 | 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT | 40887-40916 | 30 | MN694392 | Marine virus AFVG_250M8, complete genome | 29460-29489 | 7 | 0.767 |
NC_017472_1 | 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT | 40887-40916 | 30 | MN694640 | Marine virus AFVG_250M10, complete genome | 10126-10155 | 7 | 0.767 |
NC_017472_1 | 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT | 40887-40916 | 30 | NZ_CP014152 | Clostridium botulinum strain BrDura plasmid pRSJ20_1, complete sequence | 52873-52902 | 7 | 0.767 |
NC_017472_1 | 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT | 40887-40916 | 30 | NZ_CP013710 | Clostridium botulinum strain F634 plasmid pRSJ2_3, complete sequence | 226483-226512 | 7 | 0.767 |
NC_017472_1 | 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT | 40887-40916 | 30 | NC_010379 | Clostridium botulinum B1 str. Okra plasmid pCLD, complete sequence | 20263-20292 | 7 | 0.767 |
NC_017472_1 | 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT | 40887-40916 | 30 | NC_010418 | Clostridium botulinum A3 str. Loch Maree plasmid pCLK, complete sequence | 153081-153110 | 7 | 0.767 |
NC_017472_1 | 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT | 40821-40850 | 30 | NZ_CP010582 | Bacillus thuringiensis serovar morrisoni strain BGSC 4AA1 plasmid pBMB51, complete sequence | 38695-38724 | 8 | 0.733 |
NC_017472_1 | 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT | 40821-40850 | 30 | MT074686 | Enterococcus faecium strain E1077 plasmid pE1077-217, complete sequence | 88154-88183 | 8 | 0.733 |
NC_017472_1 | 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT | 40821-40850 | 30 | NZ_CP045225 | Clostridioides difficile strain TW11 plasmid p_TW11, complete sequence | 5101-5130 | 9 | 0.7 |
NC_017472_1 | 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT | 40887-40916 | 30 | NC_005354 | Lactobacillus prophage Lj928, complete genome | 38323-38352 | 9 | 0.7 |
NC_017472_1 | 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT | 40887-40916 | 30 | AY459533 | Lactobacillus johnsonii prophage Lj928, complete genome | 38323-38352 | 9 | 0.7 |
1. spacer 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT matches to NC_017472 (Lactobacillus amylovorus GRL1118 plasmid p2, complete sequence) position: , mismatch: 0, identity: 1.0
ctttaattttatcatcatttggaatatact CRISPR spacer ctttaattttatcatcatttggaatatact Protospacer ******************************
2. spacer 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT matches to NC_017472 (Lactobacillus amylovorus GRL1118 plasmid p2, complete sequence) position: , mismatch: 0, identity: 1.0
aaaatactgataaatcaatgcttagtttag CRISPR spacer aaaatactgataaatcaatgcttagtttag Protospacer ******************************
3. spacer 1.3|40953|31|NC_017472|CRISPRCasFinder,CRT matches to NC_017472 (Lactobacillus amylovorus GRL1118 plasmid p2, complete sequence) position: , mismatch: 0, identity: 1.0
taagctcatttctgcatctaatacccgattg CRISPR spacer taagctcatttctgcatctaatacccgattg Protospacer *******************************
4. spacer 1.4|40891|26|NC_017472|PILER-CR matches to NC_017472 (Lactobacillus amylovorus GRL1118 plasmid p2, complete sequence) position: , mismatch: 0, identity: 1.0
tactgataaatcaatgcttagtttag CRISPR spacer tactgataaatcaatgcttagtttag Protospacer **************************
5. spacer 1.5|40957|27|NC_017472|PILER-CR matches to NC_017472 (Lactobacillus amylovorus GRL1118 plasmid p2, complete sequence) position: , mismatch: 0, identity: 1.0
ctcatttctgcatctaatacccgattg CRISPR spacer ctcatttctgcatctaatacccgattg Protospacer ***************************
6. spacer 1.4|40891|26|NC_017472|PILER-CR matches to MN694558 (Marine virus AFVG_250M9, complete genome) position: , mismatch: 3, identity: 0.885
tactgataaatcaatgcttagtttag CRISPR spacer ttctgataaatcaattctttgtttag Protospacer * ************* *** ******
7. spacer 1.4|40891|26|NC_017472|PILER-CR matches to MN694640 (Marine virus AFVG_250M10, complete genome) position: , mismatch: 3, identity: 0.885
tactgataaatcaatgcttagtttag CRISPR spacer ttctgataaatcaattctttgtttag Protospacer * ************* *** ******
8. spacer 1.4|40891|26|NC_017472|PILER-CR matches to MN694392 (Marine virus AFVG_250M8, complete genome) position: , mismatch: 3, identity: 0.885
tactgataaatcaatgcttagtttag CRISPR spacer ttctgataaatcaattctttgtttag Protospacer * ************* *** ******
9. spacer 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT matches to NZ_CP017256 (Clostridium taeniosporum strain 1/k plasmid pCt3, complete sequence) position: , mismatch: 5, identity: 0.833
aaaatactgataaatcaatgcttagtttag CRISPR spacer aatatactcataaatcaatgcttatatttg Protospacer ** ***** *************** ** *
10. spacer 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT matches to NZ_AP018564 (Staphylococcus argenteus strain 58113 plasmid p2, complete sequence) position: , mismatch: 6, identity: 0.8
ctttaattttatcatcatttggaatatact CRISPR spacer cacaatttttattatcatttggaatatatt Protospacer * . * ******.***************.*
11. spacer 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT matches to MK250021 (Prevotella phage Lak-B2, complete genome) position: , mismatch: 6, identity: 0.8
ctttaattttatcatcatttggaatatact CRISPR spacer catctattttataatcatatggaatatatt Protospacer * *. ******* ***** *********.*
12. spacer 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT matches to MK250024 (Prevotella phage Lak-B5, complete genome) position: , mismatch: 6, identity: 0.8
ctttaattttatcatcatttggaatatact CRISPR spacer catctattttataatcatatggaatatatt Protospacer * *. ******* ***** *********.*
13. spacer 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT matches to MK250028 (Prevotella phage Lak-B9, complete genome) position: , mismatch: 6, identity: 0.8
ctttaattttatcatcatttggaatatact CRISPR spacer catctattttataatcatatggaatatatt Protospacer * *. ******* ***** *********.*
14. spacer 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT matches to MK250025 (Prevotella phage Lak-B6, complete genome) position: , mismatch: 6, identity: 0.8
ctttaattttatcatcatttggaatatact CRISPR spacer catctattttataatcatatggaatatatt Protospacer * *. ******* ***** *********.*
15. spacer 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT matches to MK250022 (Prevotella phage Lak-B3, complete genome) position: , mismatch: 6, identity: 0.8
ctttaattttatcatcatttggaatatact CRISPR spacer catctattttataatcatatggaatatatt Protospacer * *. ******* ***** *********.*
16. spacer 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT matches to MK250020 (Prevotella phage Lak-B1, complete genome) position: , mismatch: 6, identity: 0.8
ctttaattttatcatcatttggaatatact CRISPR spacer catctattttataatcatatggaatatatt Protospacer * *. ******* ***** *********.*
17. spacer 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT matches to AP014341 (Uncultured Mediterranean phage uvMED isolate uvMED-GF-U-MedDCM-OCT-S37-C76, *** SEQUENCING IN PROGRESS ***) position: , mismatch: 7, identity: 0.767
ctttaattttatcatcatttggaatatact CRISPR spacer gttttattttatcatcatttgaaatttttg Protospacer *** ****************.*** * .
18. spacer 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT matches to MN693625 (Marine virus AFVG_250M334, complete genome) position: , mismatch: 7, identity: 0.767
ctttaattttatcatcatttggaatatact CRISPR spacer ttataatattatcatcatttgaaatatcac Protospacer .* **** *************.***** .
19. spacer 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT matches to MN693057 (Marine virus AFVG_25M77, complete genome) position: , mismatch: 7, identity: 0.767
ctttaattttatcatcatttggaatatact CRISPR spacer gaataattttatcaacatttggtatagatt Protospacer *********** ******* *** *.*
20. spacer 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT matches to MN694558 (Marine virus AFVG_250M9, complete genome) position: , mismatch: 7, identity: 0.767
aaaatactgataaatcaatgcttagtttag CRISPR spacer ttctttctgataaatcaattctttgtttag Protospacer * ************* *** ******
21. spacer 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT matches to MN694392 (Marine virus AFVG_250M8, complete genome) position: , mismatch: 7, identity: 0.767
aaaatactgataaatcaatgcttagtttag CRISPR spacer ttctttctgataaatcaattctttgtttag Protospacer * ************* *** ******
22. spacer 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT matches to MN694640 (Marine virus AFVG_250M10, complete genome) position: , mismatch: 7, identity: 0.767
aaaatactgataaatcaatgcttagtttag CRISPR spacer ttctttctgataaatcaattctttgtttag Protospacer * ************* *** ******
23. spacer 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT matches to NZ_CP014152 (Clostridium botulinum strain BrDura plasmid pRSJ20_1, complete sequence) position: , mismatch: 7, identity: 0.767
aaaatactgataaatcaatgcttagtttag CRISPR spacer aaaatactcataaatcattgcttctgtaac Protospacer ******** ******** ***** * *
24. spacer 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT matches to NZ_CP013710 (Clostridium botulinum strain F634 plasmid pRSJ2_3, complete sequence) position: , mismatch: 7, identity: 0.767
aaaatactgataaatcaatgcttagtttag CRISPR spacer aaaatactcataaatcattgcttctgtaac Protospacer ******** ******** ***** * *
25. spacer 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT matches to NC_010379 (Clostridium botulinum B1 str. Okra plasmid pCLD, complete sequence) position: , mismatch: 7, identity: 0.767
aaaatactgataaatcaatgcttagtttag CRISPR spacer aaaatactcataaatcattgcttctgtaac Protospacer ******** ******** ***** * *
26. spacer 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT matches to NC_010418 (Clostridium botulinum A3 str. Loch Maree plasmid pCLK, complete sequence) position: , mismatch: 7, identity: 0.767
aaaatactgataaatcaatgcttagtttag CRISPR spacer aaaatactcataaatcattgcttctgtaac Protospacer ******** ******** ***** * *
27. spacer 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT matches to NZ_CP010582 (Bacillus thuringiensis serovar morrisoni strain BGSC 4AA1 plasmid pBMB51, complete sequence) position: , mismatch: 8, identity: 0.733
ctttaattttatcatcatttggaatatact CRISPR spacer agaaaattttatcatcatttggtatattag Protospacer ****************** ****
28. spacer 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT matches to MT074686 (Enterococcus faecium strain E1077 plasmid pE1077-217, complete sequence) position: , mismatch: 8, identity: 0.733
ctttaattttatcatcatttggaatatact CRISPR spacer tgattatattatcatcacttggaatatagc Protospacer . * ** *********.********** .
29. spacer 1.1|40821|30|NC_017472|CRISPRCasFinder,CRT matches to NZ_CP045225 (Clostridioides difficile strain TW11 plasmid p_TW11, complete sequence) position: , mismatch: 9, identity: 0.7
ctttaattttatcatcatttggaatatact CRISPR spacer aagacattttatcatcatttgcaataatat Protospacer **************** **** *
30. spacer 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT matches to NC_005354 (Lactobacillus prophage Lj928, complete genome) position: , mismatch: 9, identity: 0.7
aaaatactgataaatcaatgcttagtttag CRISPR spacer tgaatactgataaatcaatgtttcaagtgt Protospacer .******************.** . *.
31. spacer 1.2|40887|30|NC_017472|CRISPRCasFinder,CRT matches to AY459533 (Lactobacillus johnsonii prophage Lj928, complete genome) position: , mismatch: 9, identity: 0.7
aaaatactgataaatcaatgcttagtttag CRISPR spacer tgaatactgataaatcaatgtttcaagtgt Protospacer .******************.** . *.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
0 : 10035
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_017472|0:10035|DBSCAN-SWA
Protein sequences of DBSCAN-SWA_1 >NC_017472|0:10035|7947_10035_+|WP_014566284.1|DBSCAN-SWA MPTIDTTWEEKPVDITKEANFIWSIANKIRAAYMPDKYGDVIIPMTIIRRFECALEPTKDQVLAQYQEMPEFPAMAFYQITGYQFYNTSKFDLKELCNDPDNIAENFKAYISGFSKDVQEILKQLDMSGQIDKMNDNNCLYSVVKAFSEIDLSVEHFDSIKMGYIFENLIGRFYQNVDAGQFYTGRDIIKLCVSLLLAEGCDDITDKNKVITVIDQACGTGGMLSTAYTYLKHYNPTADVHLYGQEMMGQSYAVGLAEMLIKNQNIDNFKIADTLKEDCFPDRKMRFALENPPFGTPWGGKDAKDGQEDAVKEEYAKGKNSRWPAGLPASGDSQLLFLQSALAKLEDNGRAAIIENGSPLFTGNTASGESQIRRWLLENDYLEAIVAMPTDLFYNTGIATYIWILSKNKSEKRRGKVQLIDATNIYTKLRKPLGNKKNEFSPENRAEITKLYTDFSENDLSQIHANNEFIYREYTVKQPLQRDYGITEARIQQMLQSTSVKNFYDEAKVQELESSETKLKAKDAKKLAKYKKNEAVYKQMMSILKENISNKLWMSPEEFEPVLHNLLDGIVDKKLISKIMDGLSQMDKKAEIQHDRKGNIVYDKETADTEIVNIDEPIDDYMQKEVLPFVPDAKAFFDEDLGKKKPVIKTGAEIPFTRYFYKYQKPEDSEKLASEINKLEAAISEEMDSLFKMED >NC_017472|0:10035|6116_7946_+|WP_014566283.1|DBSCAN-SWA MKNSKIQKLAKALRSFSPQPARAILAGEAIIKLDNKDLTFTSKKELLECIRSHSDQSVLMNNLKKWIITQLNTVSEKDLFSILNLLANYSLKEVFNALNYFLISFPNTSKAYYNDGSATGLDLLTTELADVRKDDKVLDPSSGINGAWLELLKNNPNQNMTVQELNEIDAEFAYLNTKILGATNCIVYQGDTLSDPKYTQDGNLQLFDKIVTFPPINARISKDAIIENRFNRFRYGDITYTKGESAFISNAISSLNQTGKAVIVVSDGPLFQGGKVASFRKFLVDHDLIETVIALPSSLLSYSIIPINILIINKNKTDSKGQIQFINANQNEWYQTDKHGKRILSTLGIQKIVELYHSRASVEGKSAIFANTDYKGTLGIKQYILPSEVQLDNSTYHINRSALQNLNTVQLQELVNIKRGYNVTRRNEDKKGRYLTAKVTDITTDHHINDSNLTRINIKTNAESYLIENNDILISTRGTIGKVAFVNNIKQCTVPNANLAILRVKSSKLNTVNMIWLMLYLASPLGQFMIQQVATGTAISTISTKDLGKIPIPVLPLEAQNKAVQQFQTVQAKLNAEKAALQKKIEANQEELYSSMNVTKVLRKENTEN >NC_017472|0:10035|1509_2697_-|WP_014566279.1|DBSCAN-SWA MNKLVKYSAAALIAAGLFGVAGQSVNAATGYQRLTHNAYAYNYNGQRANRKLYRKGSKVRVIGSITLNDGKKYNIIQGNIYIKAVNFKKAKASNDGYKTSLLRNSYVYNSKGQRVRGMKLRKGHSVTYYGQPVKIKGKKYVMIGKNQYIRSANVLLAYNGSTDSNNTDKTNNSDSNNSSSNTTDSNNSANTNNSNSSTNINTSDSNKATNNTPNSNNKKSDNNTSNTKSDSNKNKDNTSTDANTEAKATKADYEALSDALVRSQDADKMYASYPRRKALEDAANKGYDYLTFHNSFDRTDFSAKEIQNAIEAINTAMNNLDANAERAKLPKVTDTDGKWNWTPDKIQQALNVANEVWGSTDAHIVKGIKGYSITKIVLTEPNGTVRSIPLDQYAK >NC_017472|0:10035|5459_5990_+|WP_014566282.1|DBSCAN-SWA MISIDDLKQKYNLNDDLIKDWHTALQTLGVSDFCVSELDADYPAEVSMLKHISQEKIITKTGLESIPKVYQWIFSYLVEKQHQLPYALIMQNDYLYWLYVQVPHTCHDTRGIPHALGNGVPRKDERYDNIVCAETALFDINEIKDKQYNDLIENSFRYYHVNKDGIWRVMEAGEPL >NC_017472|0:10035|2852_3659_-|WP_014566280.1|DBSCAN-SWA MFQDWKETKKEFYGEQNRWVDEEDGVIDFSPDGMKTVKGAENLSYDDYLDIQHDSLGHQKKSLMAIFVNYDMGYYKGEIVKITRKNVLFERLYIDAIRLDGTGYKSKESHVWMSKKAFSKFKLYDCVEFDAEVYRYLKTGHGKMIDYGLRNPTSVKKIQPYQLPSDQQLMAQTIDDILWETSLYHDTVDRINWPDVRNEEEYRLKFNQLFTIMTLNDVSWIGIKNVLQIRHRLWCSGSNKVSKDLVRRVTDAEWSYFQPGMIFLLSLY >NC_017472|0:10035|3800_3947_-|WP_014566281.1|DBSCAN-SWA MITLTYKGSIEYSKDDNMFLEEVLGLDHTFILYGGISMAEVKKDFENG |
6 | Liberibacter_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
15085 : 19607
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_017472|15085:19607|DBSCAN-SWA GCTATGAAGAAATGTATTGCTTGAAGAGTTCTTCTGCTGTCTGATATTGGAGGATTCGGCGATGTTTGTGGTTAATGGCATTTGTTGCCTGTTCAACATAAGCCTTTGTTTTGTCTTTCATACTCTTGCCCTTAGGGAAGAATTGACGCAAAAGACCATTGCAGTTCTCATTGGTGCCCCTTTCCCATGGAGAATATGGGTGGGCGAAGTAGATCTGGCAGCCTTTGATCTTGGTCATATCCGCAAACTCAGAGCCATTGTCAAACGTGATCGATTTAAACCAGGCAGGATGTCTGTCAATCTCATTTTGAAGCAGCCTTTGGCATGTGCTTGCCCGATAGTCAGGAATCTTGATAACTACTTCAAAGCGTGTGGTTCTTTCGGTCAAAGTCATTAAAGCAGGCTGATTCTTGCGTCTGACGCCTTTAACCAGATCTCCTTCCCAGTGCAGCGGCGTTTTTCTGTCTTCAATCTCTTTAGGACGCTGTTCAATGGAATTGCCAAGATTCTTCTTGTGCAAAGCATGACCGCCATGGCCGTGATGACGCTTGTTCCTGCGTCTTTTGAGCTTCATAGGCAGATCAATATTGCTTATGTCCAGCAAGCCCTGATCAATATAGCGATAGACGGTTGGTGTGCTGGGACAAGGGTAGCCTGGCATAGTCCTTTTGAATTCACCAACGAACTCATCGATGCTGGTGGCATCAAATTTAGCTTTAAAGCGTCTAGAGAGCATTCTCAAAAAGACAGCATAATGCTTCAATGGATTGCGCTGATAGCAGTTTTTGCGACGCTTCTCATAATAAATCTGTGCAGTATCAGCGTAATAGTGCTCATACAATAAATAGCTGCTGTCTCTTTGCAGGACGCTTCCGCGTTTGATTTCGCGACTGATTGTCGACTTATGGCAGCCAACTTCTTGCGCGATAACAGTACGGGAAGTTATGCCGGAATCCAGCATTGCTTGAATTTGTCCACGTTGTACGCTGGTTAATTGATGATAGTGCTTAGAAATGCTAGAATTTGAATTGGTCATGAAGATCTTCCTTTCTTGATTTTTCGTCACTTCAAGTTTAGGTCTTCATGGCCTTTTTGTTTAACACTTAGTGTTGCACTTCAATTTTAAATCCGGGAAAACAGAATAATTAAAAAATCGCTTAGATCAGTTACCAATTTACTCTAAGCGATTAAACTAAACAAAATTATTTTTTTAAATCTTGCTCAAGTGCGGTTGCCCAAGCTCTGATTAACTCACTAATTGAATTATACCCGTATTCTTGACTAGCCTTTTTTAAAATACGTTTATCACTTGGACTAACATAAATTGTCATATATGTCTTTTTTTCTTTTTTACGTTTCTTACGTGGAACTGAACTTACTAAATCGTTTGTAGATTCTTTCTCTTGTTCTAAACGGTCTTTTTTCTCATTTTCTTCAATTGATTGTTGAATTTTTGCATCACGAGTAAATGCACTTGATTTAGGAAAGTTTTCTGGAAAAGCCATTTTATCCTCCTTATGTTATTCTCAAACTATTATTTTACGCATAGAAATGCAATTGCATTTCTATGTGTATTTGCATGTGCAATTACACATATTTTGTAATCGTATCAAAAATATTGTCAATTTGCTTATAATTCTTGGGATTTTGATTAGCAAAATAATTGCGCATCTTCATGAAATCCTTGTTGTTCTTATAGTCATCTGGAATTTCAGATAATGGTGTGTTTGCCATTTGTTCCATCTCGCAAATAGGTACCTGATAGTATGTTGAGCGATTAAAAATTTCACGTTCAGGTATTTCTGCTATCCATTCGACTTCCTTATCTTCTTTAATACTCTTTAAGAAGTCCTTGCTGATTCCTGTATTATGCTTAATCATATTTGCAATATAATACACCTTAGCTCTTATCAAGGTGTTACCTGACATAACATCTACAGTTTCCTTTTTCAATTCCTTCATTCGTGTTTCTAATTCAGTGATTGAATCATAACTAAATTTCGAAGGAGTTACTGGACTTAGGACAGCATCAGAAACGGCAATCGCATTTTTAGTTGCAATACCAATATCTGGGTGACAGTCAATAATCATATAATCATAATTAGCAACAATATCGTCATAGTGTTGATGAAGCCACATAAACAGAATTAAGTTTTTATTATTATGCGTTTCTAGGCGGCTTTGCACTTTATCTAATCTTGTTGTACCAGAAATTAAATCGACATTCTTATTTACATGATGTATCTTAACTTCTTTGCGATTTTCATCGTCATTGTAAACGTTAAAAATTTCTTCTACAGTACCTTCTTGATCTGTAATACCGTAAGTTCTACTTAACGAACTTTGTTGGTCTAAATCCATGAGCAGAACTTTATGCCCTTTACTAGCTAAATACTCGCCGTAATTATATGAGAGGGTTGTCTTACCTATTCCACCCTTAATCGTCGCAAAAGTGATGACTTCCATTATTTTCTCCTTCATTGAAAAATGCTTATATTATTTATTATAATTAGTTACACATGCAATTGCAATTCAAACTAAAATTGCATTTTTAATTGCATGTGCAAATACATGTGTAAATATGATAAAAATATAGATACTACCTACTTAAATAAACTACACTAAGATTGTATTTTTTCTTTTTTAAGTACTTAAAAAAGAAAATTGCTTGAAAAAATTATGGAGGGACATTAATGGCTGTTCAGGAAACACCGATTAAGAAATTAACTCATAGCAACGCTGTTGATGCAGAAATTGTTGAACATAATGGACGTTTCGTTATTGTAAATGGTAAAAATGGTGAGATCATTGAGAATTGCAATGGATGGGGATATAAAACACGAGAAAAGGCTTTAAAATTTCTTCAAGCTAATTTTGACTACAAGCCTAAAAAAGCTGAATTAATTAAAGATAATGTCGATTCAGCTGTAAAGGAATGGAAAAATGGAACCATAGAAGTAAAACAAGTTATCCATTCGCAAAAAGTTACTGCTAGACAAAAACCAATACACTTAGCTAATATTACTAAAAAATATGAATTCGAAGATTTAGCTTTGGCTAAGGTCAAAATGAAAAAAGGTGATCGATACATTATTGTCGATCGCTCAAATAAGGTAGTCGATAATTGTAATGGTTATGGTTATCGTTCTAAAAATAAAGCCATTGCTTATTTAACTCGCTTATACAAAACTACTGAGAAAAAAGGTAAGCGTAAAAAGGCTAACAAAGTTTTTAACACGCAACTAAGAAGTGCTTTATCAGAAAGTCAAGAGGAGATTGATTGGTCAAAAGTTAAGCTAAATTCTGAGCAACACAGGGCAATTGAATTAATCGAAAAAGGTGAAAATGTCTTTTTAACTGGTTCAGCTGGTACAGGTAAGAGCTTCTTACTTAAGTACATTATCCATAAATTTAACGATGATTATACAAGAGTTTGCGCACCAACGGGACGAGCCGCTGTTAATGTAGGTGGTACAACTATTCATCGTTTATTACATTTAAAGCTGAGTACAGATACAATTAATGACCATCCTACTACAATGCCAAAATCTCTTAGGGACACTCATAGAGTAATCTTAGATGAAATTTCTATGGTGAGAGCAGATGTATTTAAGTGGCTTAGTGAATGCCTTAGATTAGCAGAAAGAGAAAATAATACACCTATCCAACTTATTGTTGTAGGTGACTTTTATCAATTGCCCCCAATTGTAAGTACAGATTATGAGCGACAGTACTTTGCTGGCGGAAAAGAATATGCATTTAATTCTGATGAATGGGCTAGTTGGCACTTTAAGCCCGTAATTTTCCGTAACATTGTACGTCAAGATAATCCAGATTTTATTGAAGCCTTGAACAAGATTCGTGTAGGGGATGCCAGCGGTATTGAGTTTTTTAATAAAAACGCTGCTACAAACGAAATAGATCAAGCCATTACGTTAACTAGTAGAAATGCAACAGCGGAGAAAATTAATCGAGAAAAATTGAGTCAAATCAAAGAGCCAGTACATACTTTCATTAGTGAATCGGAAGGAAAAATCAGTGATTCTGAAAAGCCAGTACCTGATAAAATCGAGCTTAAGATCGGTACTCAGGTTGTCGCTACCGCAAATGGTGAAAATTATAATAATGGCGACATTGGTATTGTTACGGGATTTGGTAACAACGGCTCTGTGGAAGTTAAGTTTGCACCAAATAGACCAGCTATTCACATTAAGCCAAAGGAATGGAAAGTTTATGATTATTTAGCTAGTGACAAAGGTGCTTATCTTAAAACCATGATAGGATCTTATGTCCAAATCCCGCTCAAATTAGGCTATGCGATTACAATTCACAAGTCTCAGGGACAGACTTATTCACGAGTTAATGTGCAACCTGCGGGATGGAGTAATGGCTTATTGTACGTATCTCTTTCTAGGTGTAGAAAGGTTGACTCTATGTACCTGTCATCATATTTATCAAGGAATATGGTTAAGACTAGTCCGTATGTTAATGATTTTTATTCAAGTCTAGAAAAATTGAATTAG
Protein sequences of DBSCAN-SWA_2 >NC_017472|15085:19607|17777_19607_+|WP_014566290.1|DBSCAN-SWA MAVQETPIKKLTHSNAVDAEIVEHNGRFVIVNGKNGEIIENCNGWGYKTREKALKFLQANFDYKPKKAELIKDNVDSAVKEWKNGTIEVKQVIHSQKVTARQKPIHLANITKKYEFEDLALAKVKMKKGDRYIIVDRSNKVVDNCNGYGYRSKNKAIAYLTRLYKTTEKKGKRKKANKVFNTQLRSALSESQEEIDWSKVKLNSEQHRAIELIEKGENVFLTGSAGTGKSFLLKYIIHKFNDDYTRVCAPTGRAAVNVGGTTIHRLLHLKLSTDTINDHPTTMPKSLRDTHRVILDEISMVRADVFKWLSECLRLAERENNTPIQLIVVGDFYQLPPIVSTDYERQYFAGGKEYAFNSDEWASWHFKPVIFRNIVRQDNPDFIEALNKIRVGDASGIEFFNKNAATNEIDQAITLTSRNATAEKINREKLSQIKEPVHTFISESEGKISDSEKPVPDKIELKIGTQVVATANGENYNNGDIGIVTGFGNNGSVEVKFAPNRPAIHIKPKEWKVYDYLASDKGAYLKTMIGSYVQIPLKLGYAITIHKSQGQTYSRVNVQPAGWSNGLLYVSLSRCRKVDSMYLSSYLSRNMVKTSPYVNDFYSSLEKLN >NC_017472|15085:19607|16671_17550_-|WP_014566289.1|DBSCAN-SWA MEVITFATIKGGIGKTTLSYNYGEYLASKGHKVLLMDLDQQSSLSRTYGITDQEGTVEEIFNVYNDDENRKEVKIHHVNKNVDLISGTTRLDKVQSRLETHNNKNLILFMWLHQHYDDIVANYDYMIIDCHPDIGIATKNAIAVSDAVLSPVTPSKFSYDSITELETRMKELKKETVDVMSGNTLIRAKVYYIANMIKHNTGISKDFLKSIKEDKEVEWIAEIPEREIFNRSTYYQVPICEMEQMANTPLSEIPDDYKNNKDFMKMRNYFANQNPKNYKQIDNIFDTITKYV >NC_017472|15085:19607|15085_16120_-|WP_013437050.1|transposase|DBSCAN-SWA MTNSNSSISKHYHQLTSVQRGQIQAMLDSGITSRTVIAQEVGCHKSTISREIKRGSVLQRDSSYLLYEHYYADTAQIYYEKRRKNCYQRNPLKHYAVFLRMLSRRFKAKFDATSIDEFVGEFKRTMPGYPCPSTPTVYRYIDQGLLDISNIDLPMKLKRRRNKRHHGHGGHALHKKNLGNSIEQRPKEIEDRKTPLHWEGDLVKGVRRKNQPALMTLTERTTRFEVVIKIPDYRASTCQRLLQNEIDRHPAWFKSITFDNGSEFADMTKIKGCQIYFAHPYSPWERGTNENCNGLLRQFFPKGKSMKDKTKAYVEQATNAINHKHRRILQYQTAEELFKQYISS >NC_017472|15085:19607|16286_16589_-|WP_014566288.1|DBSCAN-SWA MAFPENFPKSSAFTRDAKIQQSIEENEKKDRLEQEKESTNDLVSSVPRKKRKKEKKTYMTIYVSPSDKRILKKASQEYGYNSISELIRAWATALEQDLKK |
4 | Staphylococcus_prophage(33.33%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
43268 : 46877
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_017472|43268:46877|DBSCAN-SWA ATTATTTTGTTAGCTTTTCATACATTTTGAATAGGTGAGCTACAATGTCAGCTTCGCTGCTATCCGCAGGCAAGTTATACATTTTTAGAATCAACTTATTGTTTTCTTTGTGTGCTTTTATTAATTCTGGTGGCATTCCCAACGGATCATATAGATCCGCTAATGAATCATCTGGATATTTTTCTCGCGCATCTAAAATTTCTTGTGCTGACTTATTTAATTGTGCTTTTTGCTCGTTTGTAAAGTCAAGCCAAGGGAAATTAGCATATACGGCCGGTGAATAACTAAAATCACTTTTTAATCGGCCTGTTACTACGTTAACCCAAGCCATGTGAACACTAGACATTAATACGCTAAAAATCCACAAACTTGTACTAGGAATTATATAAAGTCTATTGCTTGCAATAATATTTTTGCTAAGTATTGCCATAGGAATGTACTTTCTATTACCTGATGATGGACTTGGGACTGCAATATAATCTACATCAGGTTGACGTATTTGACTAAATAAGTATGGAGCTTCCTCTGCATGGTCGTGCACTGATTTAGTAGGGCTCTTTCTTCTAGCTTCTATGACAATTTTTAGTCGTTCTCTGATTAATTTATTATTAACAAATCGCTTAGAATCAACGTCTTTCAACCAGAAACAATATCGCCTTTTACCTTTTATTAATTCACTGGCTCCCATATATGGTTTTACTAAATCTTTTAATTCGGGATATTCAGTAATAAATTTGTTGTATTCATCACTTTTTAATATTAATCCACCACCATCTATAGGTTTGCTTCCTTGACTCATTAAAGGCATGTTATAAGGCGGTACGACTTTTCTGGATTTTACATAAATATCATCATAATCAATCAAATATCCATTAATATGATCTACTTTCTTTACTTTTTCACCTTCAAATAGTGTTTTCTCACCTTTATATTTTCCGCAAGTGAAGCCTACAATCACACAATGAACCTTTGCAGTATCTTTCGCCTCGTTGTTCCAGACGAAGCTCCGATAAGCAAATATAATAGTTAGATTCTTATCTTCTATTAAATGCTTCCATAAAATGCCAACTTGTTCCCCTTGTGTAATTGAATTAGTAGACACGAATGAAGCTTTTATATTAGTACCTTGCATCATATCGGCAGCTTTTTCATACCAAGCTGTTACGTAGTCTAATTTACCATGCTTAGGTAAATCCTTAAATACACGATTCATGTCTTCGGTTTGTTGTTTTTTTAATTTTTTATTCTTGGCTGGTAAAGCTGATAAGCCAGAGAATGGAGGGTTTCCAATAATATAATGGCAGGCATAATTAGGAATAACTTTATTCCAATCCATTTCTAGGGCATTACCTTCATGAATCCGAGTATAGGTCTTAAGTGGCAAGAAATCCCAGTCTGGAGCGTATAACAAATCCTGCGTTTCTTTCATCATTTGATCTTCTGCAATCCATAAAGCCGTTTTAGCTACAGATACAGCAAAATCATTTATCTCAATACCTGAAAATTGTTGAATACTTACTTTTATCCAATCTTTAGCCTGTCCAACATCTAAAACACTTTCACCGCCTAATTCTAGTCGGATTGCTTCGTTTTCTAAGCGTCTTAATGAAAGAAAGGTTTCAGTTAGGAAGTTTCCGCTACCACATGCAGGATCAAAGAAAGTAAGATTGGCCAGTTCTTCTTGAAAAGCAACAGCTTTATCATGAATTGTTTTTTGATTCTTATATTGTTTGATCTTCTCAAGTTTGGCTTTTAAATCGTCTAAGAACAGTGGATCTATAACTTTATGAATGTTTTCAATTGAAGTATAGTGCATACCGCCTTCACGTCTAGTATCAGGATTCAGTGTACTTTCGAAGACCGCCCCGAAAATAGTCGGAGAAATGTCACTCCAATCAAAGCCACGTGAAGCTTTATTTAGAATTATGTCTTTAAGCTTTTCCGTAAAAGGTGGAATAATGATATCTTCATCGGCAAAAAGACCACCGTTAACATAAGGAAATTGAGCTAACTCAGGGTTTTCATCTTCCCAAAACGGATCTTCAGCTTTTCTATCTTTCGTATCTAATGTTTTAAATAAAGCTTTTAAGGCTAATCCCATCTTGTTTGGCTTTACTGGCTCCAAATAGTTATAAAATTGTTCTTTAGTAGGGAAAAGCCCAGCGTCTTCTGCATATAGACAAAAAACAAGGCGAACACAGAGAGTGTTAATACTGTGTTTGATTTGTTCATCGGCTGTTTGATCGTATTTGGCAAAGATATCAGCAAGTTCATTGTAGATCTGACTTACAAGTGTTCCTGCGGCAAATGAAACTCTTTTTTCTTTTTCAAGTCTTTTAGAATCGTCAGGCTTTACTAAAAAGTTAAGTAAATAAAGATCTTTCTCTAAATCTTCTATATTAATGATTATGGGCTTTGTATCAATAGATTCACGCACATCGTAAACAACAATCTGATTGAAATTAGATAAAACTAAAAAGTTAGCTTTTTCTTTACTGCCCAAATGATCATCATACCGTTTAGCTTGTGAAAATGGAGTGATTTTCTCTGTATGATTGCCATTCGGTCTATTCTCAGGTTTTGTTAAATCAACCCCATTGCTTTTTTGTTCAATCAACACATGCGTTGAAGGAATATATGCATCAATGGATCCATGAAATGATTCATTTTCATGCAATTTGATCGGCTTTTCGTATTCAATGTAAGTTTTATTATCAATTTGTTCTTTGGGTACACCGAGCAATTCTAGAAGCGTATTCCAATACGTAACTTTATCAGCTACTTCGCTTCCTTGGTTTTGCCATGTATCGATAAAATCTTTTAGTTTTTTTTTGTTAACTTTAGTCATAAGCTATGTCCTTAATTGAAATTTATCTAATATCTTAATTTTATAATAAATACAAAATAAATAAACTGTATTCAAGAGTTGAAAGTCTTGAATACAAATTCGAAGACTTGTAAAATTAATGCATCAGCTGTATTCAAGACTTAGTGTCATAGGATGAAAGGGAGATACTGCAAAATGATCTACGGTTATGCTAGAGTTTCAACCGCTCAACAAGATTACGCTACTCAGATAGACGATTTAAAGCGCGCAGGTGCTACGAAAATATATAAAGATAAATACACTGGTACTACTGCCAACCGTCCTGAATTCGACAAGTTGATGGACAAACTTCAAAATGGTGATACCTTGATCGTCACTAAATTGGATCGATTGGCTAGGAATACGCAAGATGCGTTAAGCATCGTAAAGCAAATGAATGATGAGGGCGTCATTCTACGCGTCTTAAATATTGGAACCATTGATAATTCACCAAGTGGACGTCTGATCTTCACTGTGTTTAGTGCCTTTGCAGAATTTGAGCGAGATTTGATTGTTAGTCGAACCCAAGAGGGCAAAGCATGGGCTAAAGCTAATAATCCTAATTTTCATGATGGTATGCCAAGAAAATATGATCAAGAACAAATTAATTTTGCTTGGAAGTTGCATACCCAAGACCACATGAGTTATTCTGAAATCAGCAAGAAATTGGGCATGTCTAAAGCGACTATTTATCGGCGTTTTAGAGAATTAAGGGATTCGCCCAACAGAAAAAGCCGCCTCTAG
Protein sequences of DBSCAN-SWA_3 >NC_017472|43268:46877|46289_46877_+|WP_014566315.1|DBSCAN-SWA MIYGYARVSTAQQDYATQIDDLKRAGATKIYKDKYTGTTANRPEFDKLMDKLQNGDTLIVTKLDRLARNTQDALSIVKQMNDEGVILRVLNIGTIDNSPSGRLIFTVFSAFAEFERDLIVSRTQEGKAWAKANNPNFHDGMPRKYDQEQINFAWKLHTQDHMSYSEISKKLGMSKATIYRRFRELRDSPNRKSRL >NC_017472|43268:46877|43268_46115_-|WP_014566314.1|DBSCAN-SWA MTKVNKKKLKDFIDTWQNQGSEVADKVTYWNTLLELLGVPKEQIDNKTYIEYEKPIKLHENESFHGSIDAYIPSTHVLIEQKSNGVDLTKPENRPNGNHTEKITPFSQAKRYDDHLGSKEKANFLVLSNFNQIVVYDVRESIDTKPIIINIEDLEKDLYLLNFLVKPDDSKRLEKEKRVSFAAGTLVSQIYNELADIFAKYDQTADEQIKHSINTLCVRLVFCLYAEDAGLFPTKEQFYNYLEPVKPNKMGLALKALFKTLDTKDRKAEDPFWEDENPELAQFPYVNGGLFADEDIIIPPFTEKLKDIILNKASRGFDWSDISPTIFGAVFESTLNPDTRREGGMHYTSIENIHKVIDPLFLDDLKAKLEKIKQYKNQKTIHDKAVAFQEELANLTFFDPACGSGNFLTETFLSLRRLENEAIRLELGGESVLDVGQAKDWIKVSIQQFSGIEINDFAVSVAKTALWIAEDQMMKETQDLLYAPDWDFLPLKTYTRIHEGNALEMDWNKVIPNYACHYIIGNPPFSGLSALPAKNKKLKKQQTEDMNRVFKDLPKHGKLDYVTAWYEKAADMMQGTNIKASFVSTNSITQGEQVGILWKHLIEDKNLTIIFAYRSFVWNNEAKDTAKVHCVIVGFTCGKYKGEKTLFEGEKVKKVDHINGYLIDYDDIYVKSRKVVPPYNMPLMSQGSKPIDGGGLILKSDEYNKFITEYPELKDLVKPYMGASELIKGKRRYCFWLKDVDSKRFVNNKLIRERLKIVIEARRKSPTKSVHDHAEEAPYLFSQIRQPDVDYIAVPSPSSGNRKYIPMAILSKNIIASNRLYIIPSTSLWIFSVLMSSVHMAWVNVVTGRLKSDFSYSPAVYANFPWLDFTNEQKAQLNKSAQEILDAREKYPDDSLADLYDPLGMPPELIKAHKENNKLILKMYNLPADSSEADIVAHLFKMYEKLTK |
2 | Leptospira_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
52711 : 66271
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_017472|52711:66271|DBSCAN-SWA TATGAGCAACATTAGAATTTCAAATTTATCTTTTAGATATGATGACAGCAGTGAAAATATTTTTAATAAATTGAACTTAAACTTAGATAGTACTTGGAAACTAGGACTTGTGGGCAGAAATGGACGTGGTAAGACAACATTTTTGAACTTACTGCGTAGAAAATTACATGGTCTAGGCGAGATTCAAACTCGGCTAAGTTTTTCATATTATCCAATTAAAGTTGAGGATCAAAAGAATATTACCTTATATGAGTTACAGAAGCAGGTCGCTTTCGAAGAGTGGGAATTAGAACGTGAGTTGAATTTAATGAATGTCAACCCAAACTTATTATGGCAGCCGTTTAATACTTTAAGTGGTGGTGAGCAAACTAAAGTTTTACTGGCTTTATCTTTTACAGATAAAGATTCTTTTGCTTTAATTGATGAACCTACAAATCATTTAGATGAAGACAGTAGAAAAGAGATCAGCAATTATCTAGGTAAGCATGAAAAGGGCTATATCGTGGTTAGTCACGATCGTGATTTTTTGAATCAAGTAACCGATCATATTTTGGCGATTGAAAATATGGAGATTCATTTGTATCAAGGAAACTTTGCTGCGTATGAAGATACGAAGCAGAAGAGGGATGAGTTTAACCGCGAGAAGAATCAGAAGCTGAAAGGCGAGATTCGTACATTGAATGAGAGTCGTTTACGTCTAAAGGGATATTCAAGTAAATCTGAAAATCAAAAGAACGCAAAGGCTCACTCTAATGAAATTCATGCGTATATTAATAAGGGCTTTTACAGTCATAAAGCTGCTAAAGTAATGCAGCGATCCAAAAATGTTGAACGAAGAATGAACGATGATATTCAAGCAAAACAAGGCTTGATGACTAACATTGAAGACATTCCAGAACTGACAATGAATTTTCAACCTAATTACCATTCGACTTTACTTGAAGCGCAACATCTTGATTTGCAAATAGAAAATATAACTCTATTTAAAGATCTAAATTTAGTGGTTAAAAATCACGGTATTGTTTCTCTTGAAGGAAAGAACGGTTCTGGCAAATCGACTTTCTTGAAAATGTTGTTAAATAAAACATTTTCAGTCACTTATCAAGGTAAGTATGAACTAGCAAATGGTTTATCGATTTCATATTTGCCACAAAATTTTACAGAATATCATGGCACTTTACACAATTTTGCATATGAACATAAAATTTCTTATGAGAAATTACTCAATAATTTGAAAAAGATGGGCTTTCCACGAGCAGGTTTTGTGACACCGATTGAGGAGATGAGTATGGGACAGCAAAAACGAGTAGCACTGGCTAAATCTCTAGTTGAACCAGCCGACTTGTATTTATGGGATGAACCAGCAAATTATCTAGACGTTTTCAATCAAGATCAACTGATTGAATTATTGAAAAAGGTCAAACCTGCCATGCTGTTGATTGAACATGACGAATACTTCATTGAGCAGGTGACGGATCACAGAGTGAGATTGGATATAGCAGAATAGAAAGAAGCACGTTACTGGTTAGTTAGTAACGTGCTTTTGTTCAGATCTTTTGGACTAATGATTCTAAATATTGTTTTCTTCTTTGAGCAGAACTTTTATCAATATTGCCAAAGCTTAACCATTTGATATGTTTCATCCCTAATTGTTTCAAGGTTGAATTGATGAAAACACTTTTAATGGCACTACCGCAGAACATTTTCAAATACCAGGTGGGTGAGGTAGAGGTCGTCATCACTTCAACACGCTGAATATTCTTTAAATGCCCAATGACACCAGTAGCACCAACATTATATGCAAATTGTTTTTTCATAACTTTATCGATGAAGCCCTTAATAATTGCAGAAGTATCATTCCACCAAACTGGGAAGATAAGGCGATTAGCTTCTTTCAAAAGCTTTTGATATTTTTTAACGTTAGGATCACTGGTTTTACCAGCTTTAAATAAGGAAAGTTCACGCGCATCATATGCTGGGTTAAAGTTATCCCAAGTAAAAGGTGATCAGGATTGAAGTTGAAGGAACTTAGACGGACAATTTTACGATAAACAATAGCGGCTTCTTGAACGTTAGAGAGATTGATTATTGTTTGATCTTTACTGGTCAAATTGAATAATTGGGCTAACTTTTTAAAATCGTTATTGCTGTATGAAGTGATCGTTATTTGTTTTTTATCGACTGACCAGTATTGGTTAAAACATGCTGGAAAGGTAATGCCGAGTCCTAAAATCAAGACGATGGCGAAGACCCACCAAGGTTGATAGCCAAATCCAGTTACAAATAATGCTAGAAAAACAACAAAGATAGTTCCTAAAACTAAACTTAAAATATGTGGCTTGTTTCCGACTGTTTGCTTTTTAGTAGGTGAAAAAACAGGGGAGATAGCCTTCATCTCATTCATAGCGTCTTTCATGATCTGCACACTTTCAACACTAGTTAAATCATTTGCCACCTTAAGTTTACCCCCCCTTAAGAGAAATACAACTTAAAAGTACGTATCATTTGTAAATGCTTACATGGGGCGAACTAAAAAACAGAAGCACACGCTGCCTGACTACAAGCAGCATGCAGATGTGTTGGAAAATATGATCATCAAGCTTTACTCTAAAGGCATAACCACCAGAGAAATAGCCGACTTGATTGAAAAGATGTATGGCAGTCATTACAGCCCAGCTCAGGTATCTAACATCTCCAAGCAGATGATTCCAAAAGTCGAGGCCTATCATCAGCGCAAATTCAGTGACAAGTTCTTCTGCGTTTATTTGGACGCTACCTACATTCCTTTGCGTCGCATAACCTTTGATCGTGAAGCAGTTTATATTGCCATTGGCATCAAGCCTAATGGCCACAAGGAAGTCATCGATTACCGCATTGCCCCCAGCGAAAACGTCGAGAATTGGACGGAAATGCTTCAAGACATGAAGTCTCGCGGTCTAGAACAGGTTGAGCTCTTTCTTTCAGATGGCGTTGTAGGCATGAAGACAGTGCTGGAGCAGACTTATCCGAAAGCTCATTTTCAACGCTGTCTAGTTCATGTCATGCGCAATATTTGCGCTAAAGTACGCGTAGATGATCGCGAAACGATCATGAATGAATTCAAGCAAATTCATCAACAGCCAAATAAAGAAGCTGCGATCAAAGTCTTGCACGCCTTCTATGACAAGTGGAACAGAGCGTACAACCATGTCATCAGAAATCTCAAAGAGATTGAACCTGATCTGCTGGTCTTTTACAGCTATCCTAAGCAGATCAGGGCTTCAATTTACTCTACCAATATGATCGAATCCTTTAACAACGTTATCAAGCGTAAAGCTAAGCCTAAAGCAGAGTTTCCAACTGAACAGTCGCTTGATACTTTTATTGGCATTCAGGCAATGAGCTACAACGAACGTTATTTCAATCGAATCCACAAAGGCTTTGGTCAGGTTCAGGATACCTTAGAATCCTACTTTGATTAAATAAAAGATTAAAAAATCAATCTACGAGAAAAATCTATTTACACAAAAAATTTGACAGTTTCAAAAAGAGAGTTTTGTACCATTTCTTTAGTACAAAACTCTCTTTTTGTTTAGTTAATAATTTAAAAATTATGGATTATTTCTATGATTTTCGTATCAGAATCAGGCTATTATTAGCGCCTCTATCTTTTTATAATCCACATTATAGATACGTTCACCATCCGACAACTGTATTCGATCGTGCCGCTCAATTAAATTCTCTAACTGCTTGGTAGTTGCTGAAATTGGTAAATATAGTTTTATTTGCTTACCACCTTGCAATAATGCTCATGTACGGTAAAGAAGCAAGCAACCGAAAACAATAGATAGACCGCCAGCACTACACTATTCCGAACCAAAGACCAAAAGATAAGCATTTTGAGAAAATCTAAACTTTTGGATTTTCCTACAATGCCGACTACGGCGGAATCCCTCCCACTCCTTATATATTTCTTTCTGTATACATTGAATTTGTATTTAGTAAAATGCAGACAACACCACGGATCGGCTTTTGGCTGGACAATTCCAACCAAACACCGCAGCAGACAGTAGAAACCATTCTGAACGTTAGGAAGCCGGTATGATTGTTACATATAAGGGGAAGAAAAATTTCTTTTAGGTACTTGCTTTCCTAAAACTGATGTGATACAATGATTTAAACCAGAAAAGGAGTAAAAAATATGCGGCAAGGTATTCTTAAATAAAACTATAATCAAATAGTGGGAACAAAGGATTATGATAGCTCCTTTTGTAGGGGCTTAGTTTTTTGTACCCAATTTAAGAATACTTTTGCCTTATCAATTTTGACATATCCCCAAAAACAGCAATCACAAACAGGTGTATGCTGTATATGTGTATGTCCGCAACTTATAATCCCCAGTGGTAAAAGTATTTTACTGCTGGGGATTTTTATGCCCTTCGGGGCAGTAAAGGGAGGACAATCACATGAAAATAATCAATATTGGAATTCTTGCCCATGTAGACGCTGGAAAGACGACCTTGACGGAGAGCCTGCTATATGCCAGCGGAGCCATTTCAGAACCGGGGAGCGTCGAAAAAGGGACAACGAGGACGGACACCATGTTTTTGGAGCGGCAGCGTGGGATTACCATTCAAGCGGCAGTCACTTCCTTCCAGTGGCACAGATGTAAAGTCAACATTGTGGATACGCCCGGCCACATGGATTTTTTGGCGGAGGTGTACCGCTCTTTGGCTGTTTTAGATGGGGCCATCTTGGTGATCTCCGCTAAAGATGGCGTGCAGGCCCAGACCCGTATTCTGTTCCATGCCCTGCGGAAAATGAACATTCCCACCGTTATCTTTATCAACAAGATCGACCAGGCTGGCGTTGATTTGCAGAGCGTGGTTCAGTCTGTTCGGGATAAGCTCTCCGCCGATATTATCATCAAGCAGACGGTGTCGCTGTCCCCGGAAATAGTCCTGGAGGAAAATACCGACATAGAAGCATGGGATGCGGTCATCGAAAATAACGATAAATTATTGGAAAAGTATATCGCAGGAGAACCAATCAGCCGGGAAAAACTTGTGCGGGAGGAACAGCGGCGGGTTCAAGACGCCTCCCTGTTCCCGGTCTATTATGGCAGCGCCAAAAAGGGCCTTGGCATTCAACCGTTGATGGATGCGGTGACAGGGCTGTTCCAACCGATTGGGGAACAGGGGAGCGCCGCCCTATGCGGCAGCGTTTTCAAGGTGGAGTATACAGATTGCGGCCAGCGGCGTGTCTATCTACGGCTATACAGCGGAACGCTGCGCCTGCGGGATACGGTGGCCCTGGCCGGGAGAGAAAAGCTGAAAATCACAGAGATGCGTATTCCATCCAAAGGGGAAATTGTTCGGACAGACACCGCTTATCCGGGTGAAATTGTTATCCTTCCCAGCGACAGCGTGAGGTTAAACGATGTATTAGGGGACCCAACCCGGCTCCCTCGTAAAAGGTGGCGTGAGGACCCCCTCCCCATGCTGCGGACGTCGATTGCGCCGAAAACGGCAGCGCAAAGAGAACGGCTGCTGGACGCTCTTACGCAACTTGCGGATACTGACCCGCTTTTGCGCTACGAGGTGGATTCCATCACCCATGAGATCATTCTTTCTTTTTTGGGCCGGGTGCAGTTGGAGGTTGTTTCCGCTTTGCTGTCGGAAAAATACAAGCTTGAAACAGTGGTAAAGGAACCCACCGTCATTTATATGGAGCGGCCGCTCAAAGCAGCCAGCCACACCATCCATATCGAGGTGCCGCCCAACCCGTTTTGGGCATCCATCGGACTGTCTGTTACACCACTCCCGCTTGGCTCCGGTGTACAATACGAGAGCCGGGTTTCGCTGGGATACTTGAACCAGAGTTTTCAAAACGCTGTCAGGGATGGTATCCGTTACGGGCTGGAGCAGGGCTTGTTCGGCTGGAACGTAACGGACTGTAAGATTTGCTTTGAATACGGGCTTTATTACAGTCCGGTCAGCACGCCGGCGGACTTCCGCTCATTGGCCCCGATTGTATTGGAACAGGCATTGAAGGAATCAGGGACGCAACTGCTGGAACCTTATCTCTCCTTCACCCTCTATGCGCCCCGGGAATATCTTTCCAGGGCTTATCATGATGCACCGAAATACTGTGCCACCATCGAAACGGTCCAGGTAAAAAAGGATGAAGTTGTCTTTACTGGCGAGATTCCCGCCCGCTGTATACAGGCATACCGTACTGATCTGGCCTTTTACACCAACGGGCAGAGCGTATGCCTTACAGAACTGAAAGGGTATCAGGCCGCTGTCGGCAAGCCAGTCATCCAGCCCCGCCGTCCAAACAGCCGCCTGGACAAGGTGCGCTATATGTTTCAGAAGATAAGGAAATCGCGTTAAAGTTATTGGATCAATAGAATTAAACGGAAAAAAGTACAACATTATTGCTGGCAATTTGTACATCAAAGCTGCTAACTTTAGTAGTAAAAAGACTGCTACAACAACAGATTTAGGTGATGGCTATGAAACTACAATGCTTCACAATGCGTACATTTACAATAGTAAAGGCAAGCGTGTAAGAGGCAAGAAGCTATTGAAGAACCATGATATTACTTACTACGGCAAGGTTTTGATGATTAAAGGCAAGAAATACGTTCAAATTGGTGACAATCAATACGTACGTTCAAGCAATGTATTATTAGCTTACGATGGTCCAATTAGTTCTAACAGCAATGTTAACCGTCATGCTACTAATTGCAGTTCTAACAACGATACCTCAATTAATAGTAATAATTCTACTAATAACTCAAAGAATAATAATGTTGTGAACAACACAGCTAATAGCCAAAATGGTTCCAAGAGTAGTAAAACGAATCAAACTAACAATCAATCTGCAAATATACTGCGAAATGGTAATCAAAACAATCAAACTAACACTGATGTAGCCACAGATACAGACTTTGAAGCATTAAGCTTAGCTATTCAAAAAGCAGAAGCTACCAAATATTACGATGCTACTTTTGCTAGAGCACAAGCTTACCATCAAGCAAAGGAAGCAGCTGAAGTACTCATGGTAAACCATAAGCATCCATACAAATATCAGCCTGTAATAACTGCAGCAGAAGTTCACGCTGCAACAGCAAATGTAGAAGCAGCCGCAGCAAACCTAGATGGCGATGCTGAATATGACAAAATGCCTAATGTGAAAATTGAAAGAGCAACAGATGGAGACATTAAATACGATTGGACTCCTGCACAAAAACAATTAGTCTTAGACATTGCAAACGAAATCCATGGTTCAACCGATGCACATTATTTTGATAATGATCGTCAAATTGGTTTGACCGATGGCAATGGGATGGCACATACATTCAATACTAGTTACTTTTTACATGAGACTTATTAATGAAAAGATGCTATACAATGTAGCATCTTTTTTGTATCTAAAATATTAGGCACTACTTAGTGCTGTAATACTTAAAAATTTTTATTAGTCTTCTTTTGATTTTTAACCATTATTGTAGTAATTAACGTATTCTCATGTACGCTTAATTATATTGAGCTTTCACATACTGCTGAGAGGGCAATTCTAAAGCTATTTTTATTTTAAAACACTAATAGTTAATCAATTATTTTCTAGAATCTTCCTTATTATATATCCTTACCAAAGATAGCTTCTTCTACATCGTTTGCTGCAGGGTTTGTTGCGGCATCAATGTTGTGAGCGTAGATTTCTGTAGTGCCAATGTTGCGGTGACGCAACAATTCTTGCGTTTGCTGAAGTGTAGCACCGTTAAGTAAACTCAAGGTTGCAGCGGTATGTCTTGTAGAGTGAGCAGTTAGGCGAGGGCTGTCGTATCCAGCAGATATAAAAGCAGTCTTAACGATACGCCTGATAGAACGAGTAGTCATGCGGCCATTGGCATTGTGGTTACTAGTACTGACAAATAATGGCTTGCTTAGATCATTTGCTTTTCTAACGCTTAAATAGTCTCTGATTGCTGACTCCACGTGTTGCGGCATTCTAATTAAGTCATCTTTTTCTTCATGCCCTTTACCTTGAACGTACAGTACAGTCATGTTGCCTTTAGTACGAATGTCATCAATATCAGCACGTGAAACCTCAATAGTTCTAAGTCCCATTGTTAACATCGTGACCAACATAGCGTAATCACGTTTACCTTTGATGGTAGAACGATCAATCTTATCTAAAATCTGTCTAGCTTGAGATCCAGTTAGGTAATCCTTTTTAAAGTTTTTGCTAAGATGGCCACTCTTAATATATTTGGCAATGTTAGGGTAGAAGCCAGCTTCTTCAGTCCATTCGAAGAATCTTTTAACGGCAATAATATAGTTTTTCACTGTCGTTGGCTTTTTGCCACTGTTTTGTAGTTTCTGACGATATTGCCGCACGGTCTCTGAATCAGGGTGACCAATCTGATTTTGTCTCAAATAAAGAAACCATTGCTTTAATGATCCACGATAAGTTCGAACTGTATTTGGAGTGGCATCAATAAAAATAATAAATTGTTGAAAAAGGTCTTCCAGGTTATTTTGCTGAAAAATTAATTGCTTATTCGAAGTGTTTCTTTGAAAATTATTTTGCGGATTTTCCAATTCATTTTTTGTCATCATCGATCACCAAATTTCGTCTAAATTTTTAACTGTTTTAAGTCTATTTTACACTAATTTTCAAAGTCCGCTAACGCTAGTTTGTGGACTTTCAAATAGTATCACCATTTTTTAGATTTTTAAAGCATTAAAAAAGCTAAATTTCTTTTGTCTTGAACATTAAGCTATCTTATGTTCATAATAAGATTAATAAGTAAAGAAGGAGAAAATACAGTCATACGAAAACCTAAAATAACTATAACTGTGCGTAAAACTACTAAGAAGGAAATGATTCAAGGTACAATAGTCTGGGGAGTTTTAATCGCAGTATTGGGATTATTTTTGCTCCAAAAATAAAAATATCAAATTAACCACAATCTATTGGAAATAATAGCTTATTAGAGTAATTGCATATTTTGTTAAGTTTTCAAAGAAATATGTGATTGCCTTTTTACTTTAGAAACCATAGAGATATTAATCAAATGCAAAAAACTTTCTGATAAATGGAAAACCATATATCAAAGAGTTTTTCAAACAGGCCTATTTAGAAGAATTCTATGTTTTATTTCACTAAATTTTAGAGCAAAAACTTAAAAGTATTGGCTAAGGCTAGCGACATTTATGATGTCTTTTAAAATGTTCTAAGTGTGTGGGGGGTTAGATTCTTCGTTGCTTTGTTTTCTACTTTGATAAAAAAATCATTGCTGGACATACGTACTACCATATTAAGGGGTATCAAGACTTGGGTAAGATCTGGTAATGTCAAACTTATTAAGCACAACAAAAAGTAAATAAATTTAATTCTTAAAAAAGCACTAGTTCAGTCAGTAATGCTGAATTAGTGCTTTTTTGATATTTTATCAACTTTCAAGATTTTAATATTGATACATGTACAGATAAAAATGCATGTATTATTACTTGTACTACTATCATAGTAAAATTGACATATTCATGAATTATTCAGTAGAATTAAAGCAGGCTTTCAGCAACGGTTAGCTACCGCGAAAGCTTAGTAACTTAGTGCTGACTCCACACCCCTCACAAATCCTGATGTTACGGAGGTGATGAGTCATGCCAAATTGGATAGACTGGAGAGTGCGAAAGCCGCGTGATAGCTTGGTCTATTGCAATAAATATAACGCTAGGCGCAGTTGGCAAATTTGTCATCAATCTAGCAAAAGCTTATGCGATAATAAAAAAAGCTAACCACAAGGGTTAGCACGAACCGTCACCTGAGTTACGATTGGAGTTAGCAACTTGCACTTGCTAGCTCCTTTTCTTTTATTATTATATGAAATAAAACCAAAAACTGCAATCAATTAAAATTTGGTGTAAGTGTCAATGTTTTTGCGGTAGCCTAAAAACCATTAGTCTGACCTTAATTCCTTAATAATTTATGTGGTTTAACATTCGGTTGAATTTACTTAAATAACTTCCGCCAGCAACTTCTGTTGTTAACGCTCCATGTTGAATTCCAATGTGCGTTTCATGACGCCGCTTTAAGCTATCTCTAGCTGCTGCCTGTAATTTCTTGCGTTCTAAACTGATCAACTTACCTGTCTCAAAGTCACTATTGAGCCATTGCTCTAAGCCGTGACAGCACTTTTTGGAATGCACGAAGCTATTAGCTACTTCCTTGTAGGAATACTGGATGAAAACAATGAAGAACTACAGAAAAGTGTAGCAGATGCACATTATTATGCTACTCAAACAGAACTTGATGCTGAAAGTATTGATGATAAACTTGCTTTAACCGTAGTCAAGTTACTGTTATTAGCTAAAAACACACATACTTATTTTGCTACTGAGCTTGCTAAACGTAGCTTTGAAGTTGAAATGCTGATACAAGCATTATGTGTTGTAATTATGAATAACGTTGAAAAAATTGCATAAGCACAATGAAAAAGTTACATTGATTAGTAACTTTTTTATTGTTTTGTGTTGATTTAAATACTATAAACTATATTAGTATCTATAAAGCTTGCGAGGATAATTGGCTCATAACTTGAAAGATCAGCATTTTCAGGGCGGGTTACTACTTTTCAATTTATGAATACAAAATATGTTAATCTTTCCCACGGTAATATCTTTTAAGTTCTTGATATAAAGAACTATTTTCTAGCAAGTATTCATTATAGTGAAAAATTCCTAGTATTTTCTGGCAGAAAGGTTGCATTATTGTGACTACAATTGTAAACTAATAAATGTTAGGAGTCTACAAACGTAAACTTAGAATTCAATTAATGGAGGAAGCCAATTGAAGGATGCAAAGAATGTAACCATCACCGATTCTGAATGGATGGTTATGAGAGTAATTTGGACAATGGGACATGCGACTAGTCGTGAACTAATTGATGCTATGAACGAATTAGAAGGCTGGTCAGCCTCAACAACCAAAACGCTACTTCACAGGTTGATTCAAAAACAGGCGGTTGCGCAGCATGGCGGCAGTCGACCATTCACGTATAAGCCGGTTGTTGGCGAGAAGAAATCAATGGCGGCAGCGGCAGATGATTTGTTTGACCATATGTGTGCGATGCGGGTTGGTTCAACAATTGCCGGCGTTATTCAGTCAAGGGAACTCTCACGGGCGGATATTGCGAACTTACAGGCGATTCTAGCTGAAAAGGCCAAAACAGCGCCCGAGGAAGTTCAGTGTAACTGCTTGCCAGGTGATCAAAAGTGTTAAGACTGTGGTGTTCTGTTTACTTTTAGGATTGCTGCACAAAAATCCCGAGCCCGGGGATTTCCACACAAGGAACTTATATTTAGGCTTCTAACCGCTTTGCTACGCACACTTATTTTTAGGAGGGTCGATAATCATGGAAAATAAAGAGGAACATATGAACATGAAAAGTATGAGTGCTAAGAATATGGAAAATAATGAATCAAATATGAGTCATATGGATCACGATATGACCGAAACGGATCACAGCCAAATGAACATGAATCACGGTGATATGGATATGGCTGGGACCGACATGATGATGCACGGTGGCTCAATGATGCATATGGGGAATTTGAAAGTTAAATTTTGGGTCTCCGTTGTCTTAGCGATTCCAGTTTTACTGTTAGCACCAATCATGGGTTTAAACGTTTCCATCCTTAGTTTCAGTTCACCGCTAATTGTTGGCATTATCATCGTTTTGTTTGATACGGCACTTTACTTTTATGGCGGAATGCCCTTTTTAAAGGGGGCTAAAGCGGAAATTCAGAATAAATCTCCTGAAATGATGACGCTAGTAACCCTTGGCATTTCCGTTTCGTATTTCTACAGTTTGTACGCATTTATTGCCAACAACTTCTTGAACCCGGCAAATCCTGTAATGGATTTTTCGTTCGAACTTGCAACCCTGATTTTAATTATGCTTCTAGGACACTGGATTGAAATGAATGCATTGATGGGGGCTGGGAATGCATTACAAAAGATGGCGGCTCTGTTGCCTAAGACGGCCCATCTAGTTACAGATAATGGTGAAACAAAAGAAGTGCCAGTATCTGATTTAAAAGTTGGTCAAGCTTTCCAAGTGCGTTCAGGTGAGAGTATTCCAGCCGATGGTGTTATTACGGCTGGGGAGTCGACCGTGAATGAAGCACTGGTAACCGGTGAATCTGCTGCCGTTACCAAGAACGTTGGCGATAAGGTCATTGGTGGTACAATCAACAATAACGGGACGCTAACGGTTAAAATTAGTGGTACTGGTGACTCCGGCTATCTTTCTCAAGTAATGAAAATGGTTCGAAATGCCCAGCAAGCTAAATCTAAAGCAGAAGATAAAGCTGATTTAGTTGCCAAGTATCTATTTTACGCGGCACTTAGTGTTGGGATTATTGCTTTCTTTGCCTGGTTACCCCAGGGATTGGCGACTGCAATGACGATCATGGTGACCGTCTTCGTGATTGCTTGCCCGCATGCATTAGGATTAGCGATTCCATTAGTGGTTTCTCGTTCTACTACGATTGGCGCTCAAAATGGGCTATTAGTTCGAAATCGCCAAGCCATTGAAGCAAGTCAACATGTTAGCCACGTTCTCTTGGATAAAACTGGCACGTTAACAGAAGGTAAATTTACGGTGAATGCATTGATTCCAAATGATGGGATTGACGAAACAACGTTATTAAGCCGACTGGCCGCCCTTGAAAATAATTCGACTCATCCGCTGGCCCAAGCAATCATTGCTGAAGCCCAAGCGAAGGACATTGAAGTCGTTGCGGCTGAAAAGTCTCAAAATATTCCAGGCGTTGGTATTTCCGGTAATATTGATGGCACTGACTATATGATTGTTAATGGTAACTATTTAACGAAGCAAGGGATCAGGTTTGACAAAGCCGCTGCTGATAAATGGGCTGCTAAGGGTAATTCCGTCAGCTTCCTATTGCAGGGCACCCAAGTTCAAGGAATGGTTGCTGAAGGCGACACCATCAAAGCGAGTGCTAAGGAATTAATTAGTGGTCTTCAGAGGCGAGGAATTACCCCCGTAATGCTCACTGGCGATAATCCAAAAGCAGCGGAACACGTTGCTAACTTACTAGGATTGACTGAATTCCATGCAGGCCTATTACCAGATGATAAGCAAAAGATTATTGCTGATTATCAAGCAAAGGGCAATCACGTTATCATGGTTGGTGACGGCGTAAATGACGCACCAAGTCTTTCCGCGGCCGATATTGGAATTGCAATTGGTGCCGGAACCGATGTTGCCATTGATTCCGCTGATGTTGTGTTGGTTAAATCAGAACCCAGCGATATTTTACATTTTCTTGATTTGGCTAAAATCACAAATCGGAAAATGGTTCAAAATCTCTGGTGGGGAGCAGGCTACAATATTGTCGCAATTCCACTCGCTGCCGGTGTGTTATCATTTATGGGAATCACTCTAGACCCAGCCGTTGGTGCTGTGGCCATGGCGATGTCGTCAATTATCGTGGCAATTAATGCGATGGGATTGACCGGTAAAAAGATAAAAAACGTATAA
Protein sequences of DBSCAN-SWA_4 >NC_017472|52711:66271|62888_63188_+|WP_118027589.1|DBSCAN-SWA MTALFGMHEAISYFLVGILDENNEELQKSVADAHYYATQTELDAESIDDKLALTVVKLLLLAKNTHTYFATELAKRSFEVEMLIQALCVVIMNNVEKIA >NC_017472|52711:66271|64117_66271_+|WP_014566334.1|DBSCAN-SWA MENKEEHMNMKSMSAKNMENNESNMSHMDHDMTETDHSQMNMNHGDMDMAGTDMMMHGGSMMHMGNLKVKFWVSVVLAIPVLLLAPIMGLNVSILSFSSPLIVGIIIVLFDTALYFYGGMPFLKGAKAEIQNKSPEMMTLVTLGISVSYFYSLYAFIANNFLNPANPVMDFSFELATLILIMLLGHWIEMNALMGAGNALQKMAALLPKTAHLVTDNGETKEVPVSDLKVGQAFQVRSGESIPADGVITAGESTVNEALVTGESAAVTKNVGDKVIGGTINNNGTLTVKISGTGDSGYLSQVMKMVRNAQQAKSKAEDKADLVAKYLFYAALSVGIIAFFAWLPQGLATAMTIMVTVFVIACPHALGLAIPLVVSRSTTIGAQNGLLVRNRQAIEASQHVSHVLLDKTGTLTEGKFTVNALIPNDGIDETTLLSRLAALENNSTHPLAQAIIAEAQAKDIEVVAAEKSQNIPGVGISGNIDGTDYMIVNGNYLTKQGIRFDKAAADKWAAKGNSVSFLLQGTQVQGMVAEGDTIKASAKELISGLQRRGITPVMLTGDNPKAAEHVANLLGLTEFHAGLLPDDKQKIIADYQAKGNHVIMVGDGVNDAPSLSAADIGIAIGAGTDVAIDSADVVLVKSEPSDILHFLDLAKITNRKMVQNLWWGAGYNIVAIPLAAGVLSFMGITLDPAVGAVAMAMSSIIVAINAMGLTGKKIKNV >NC_017472|52711:66271|59139_60090_+|WP_014566328.1|DBSCAN-SWA MYIKAANFSSKKTATTTDLGDGYETTMLHNAYIYNSKGKRVRGKKLLKNHDITYYGKVLMIKGKKYVQIGDNQYVRSSNVLLAYDGPISSNSNVNRHATNCSSNNDTSINSNNSTNNSKNNNVVNNTANSQNGSKSSKTNQTNNQSANILRNGNQNNQTNTDVATDTDFEALSLAIQKAEATKYYDATFARAQAYHQAKEAAEVLMVNHKHPYKYQPVITAAEVHAATANVEAAAANLDGDAEYDKMPNVKIERATDGDIKYDWTPAQKQLVLDIANEIHGSTDAHYFDNDRQIGLTDGNGMAHTFNTSYFLHETY >NC_017472|52711:66271|57155_59084_+|WP_014566327.1|DBSCAN-SWA MKIINIGILAHVDAGKTTLTESLLYASGAISEPGSVEKGTTRTDTMFLERQRGITIQAAVTSFQWHRCKVNIVDTPGHMDFLAEVYRSLAVLDGAILVISAKDGVQAQTRILFHALRKMNIPTVIFINKIDQAGVDLQSVVQSVRDKLSADIIIKQTVSLSPEIVLEENTDIEAWDAVIENNDKLLEKYIAGEPISREKLVREEQRRVQDASLFPVYYGSAKKGLGIQPLMDAVTGLFQPIGEQGSAALCGSVFKVEYTDCGQRRVYLRLYSGTLRLRDTVALAGREKLKITEMRIPSKGEIVRTDTAYPGEIVILPSDSVRLNDVLGDPTRLPRKRWREDPLPMLRTSIAPKTAAQRERLLDALTQLADTDPLLRYEVDSITHEIILSFLGRVQLEVVSALLSEKYKLETVVKEPTVIYMERPLKAASHTIHIEVPPNPFWASIGLSVTPLPLGSGVQYESRVSLGYLNQSFQNAVRDGIRYGLEQGLFGWNVTDCKICFEYGLYYSPVSTPADFRSLAPIVLEQALKESGTQLLEPYLSFTLYAPREYLSRAYHDAPKYCATIETVQVKKDEVVFTGEIPARCIQAYRTDLAFYTNGQSVCLTELKGYQAAVGKPVIQPRRPNSRLDKVRYMFQKIRKSR >NC_017472|52711:66271|60335_61316_-|WP_014566329.1|integrase|DBSCAN-SWA MTKNELENPQNNFQRNTSNKQLIFQQNNLEDLFQQFIIFIDATPNTVRTYRGSLKQWFLYLRQNQIGHPDSETVRQYRQKLQNSGKKPTTVKNYIIAVKRFFEWTEEAGFYPNIAKYIKSGHLSKNFKKDYLTGSQARQILDKIDRSTIKGKRDYAMLVTMLTMGLRTIEVSRADIDDIRTKGNMTVLYVQGKGHEEKDDLIRMPQHVESAIRDYLSVRKANDLSKPLFVSTSNHNANGRMTTRSIRRIVKTAFISAGYDSPRLTAHSTRHTAATLSLLNGATLQQTQELLRHRNIGTTEIYAHNIDAATNPAANDVEEAIFGKDI >NC_017472|52711:66271|63552_63984_+|WP_014566333.1|DBSCAN-SWA MKDAKNVTITDSEWMVMRVIWTMGHATSRELIDAMNELEGWSASTTKTLLHRLIQKQAVAQHGGSRPFTYKPVVGEKKSMAAAADDLFDHMCAMRVGSTIAGVIQSRELSRADIANLQAILAEKAKTAPEEVQCNCLPGDQKC >NC_017472|52711:66271|52711_54217_+|WP_014566323.1|DBSCAN-SWA MSNIRISNLSFRYDDSSENIFNKLNLNLDSTWKLGLVGRNGRGKTTFLNLLRRKLHGLGEIQTRLSFSYYPIKVEDQKNITLYELQKQVAFEEWELERELNLMNVNPNLLWQPFNTLSGGEQTKVLLALSFTDKDSFALIDEPTNHLDEDSRKEISNYLGKHEKGYIVVSHDRDFLNQVTDHILAIENMEIHLYQGNFAAYEDTKQKRDEFNREKNQKLKGEIRTLNESRLRLKGYSSKSENQKNAKAHSNEIHAYINKGFYSHKAAKVMQRSKNVERRMNDDIQAKQGLMTNIEDIPELTMNFQPNYHSTLLEAQHLDLQIENITLFKDLNLVVKNHGIVSLEGKNGSGKSTFLKMLLNKTFSVTYQGKYELANGLSISYLPQNFTEYHGTLHNFAYEHKISYEKLLNNLKKMGFPRAGFVTPIEEMSMGQQKRVALAKSLVEPADLYLWDEPANYLDVFNQDQLIELLKKVKPAMLLIEHDEYFIEQVTDHRVRLDIAE |
7 | Streptococcus_phage(75.0%) | integrase | attL 48323:48338|attR 75906:75921 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
69753 : 70095
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NC_017472|69753:70095|DBSCAN-SWA TATGGTAAAAATTCCTACAAGTGGAAACATTATCTATATTAATTTTGATCCATCTACGGGTGCCGAAATTCAAAAGCGTAGACCGGCAGTAGTAGTCAGCAATGATATACTAATGAAAACTTCACCCTTTGTGTGGGTAGTACCAATTTCTCATGGCTCGTTTAATGGTGAAAGCTATCCGCTTCATGTTCATTTAGATTCACGGACAAAAAGTGATGGAACAATATATGTAGAGCAGCTAAAATCCTTTGATTTTAGCAGAAGAAAATGGGAATATATTGAACAATTACCGCCTGATTTATTAGATGAGGTTCGTCAAAAAATTAAGCTGGTAGTTTCTTAG
Protein sequences of DBSCAN-SWA_5 >NC_017472|69753:70095|69753_70095_+|WP_014566338.1|DBSCAN-SWA MVKIPTSGNIIYINFDPSTGAEIQKRRPAVVVSNDILMKTSPFVWVVPISHGSFNGESYPLHVHLDSRTKSDGTIYVEQLKSFDFSRRKWEYIEQLPPDLLDEVRQKIKLVVS |
1 | Streptococcus_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|