Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_004565 | Clostridium tetani E88 plasmid pE88, complete sequence | 0 crisprs | Cas14u_CAS-V | 0 | 0 | 0 | 0 |
NC_004557 | Clostridium tetani E88, complete sequence | 8 crisprs | csa3,cas3HD,WYL,DEDDh,cas6,cas8b2,cas7,cas5,cas3,cas4,cas1,cas2,cas7b,cas8b1,cas14j,RT | 9 | 37 | 7 | 1 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_004557_1 | 1217308-1219530 | Unclear |
NA
Consensus repeat of NC_004557_1
|
33 spacers
spacers of NC_004557_1
>1.1|1217338|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT TTAAATGAAGGTACTAAATTTAAGGTAAGAATGGTG >1.2|1217404|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT AGCATTCCTCTATCTCCATTAACTACTGAAAAAGGA >1.3|1217470|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT GCTATAAGAATAATTCTAATTTTATCCAAGGAACTG >1.4|1217536|38|NC_004557|PILER-CR,CRISPRCasFinder,CRT TTCCAATTTTACTAGCTGCTACCCCAACGCCTAATAAT >1.5|1217604|37|NC_004557|PILER-CR,CRISPRCasFinder,CRT TAAATATATTACTTCCTTCTTGCACTGTAGGTTTTTC >1.6|1217671|35|NC_004557|PILER-CR,CRISPRCasFinder,CRT TTGTTCGTAACTGTTAAAGCTATCTTTCTTTATGC >1.7|1217736|37|NC_004557|PILER-CR,CRISPRCasFinder,CRT AGAGCCTAGTTTCCTAAGCCCTTATAACCAACTTACC >1.8|1217803|39|NC_004557|PILER-CR,CRISPRCasFinder,CRT ATACAATGCTCCATGGAAAGGACTCCACTTAGATATATA >1.9|1217872|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT CCCACATCATTAAAGGATATAAAATTACCACCTTCC >1.10|1217938|37|NC_004557|PILER-CR,CRISPRCasFinder,CRT GTTTTAATATTAATATCGGCAAGTGCTAATTCATATG >1.11|1218005|37|NC_004557|PILER-CR,CRISPRCasFinder,CRT GTTTTAATATTAATATCGGCAAGTGCTAATTCATATG >1.12|1218072|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT AACTGCACAGTATCACCGCTAGCTTTTAATTCTTTA >1.13|1218138|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT TTTTGAAGTATATTATAAAGGCACAGTAACACGCCC >1.14|1218204|35|NC_004557|PILER-CR,CRISPRCasFinder,CRT GCTTTAACTCTTAAAAAAGATAAAGTTCTAAATTC >1.15|1218269|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT AAGATGCAGCCAACGCACTTGGATATATGGCTTTGG >1.16|1218335|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT GGTAATGTAAGTAATTCTACAACCAATAATAGCAAT >1.17|1218401|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT ATACAGAATACAAGATTATAGTTAGTGGATATAGAA >1.18|1218467|35|NC_004557|PILER-CR,CRISPRCasFinder,CRT AATGAAGTCAAATAATAACATACCATTTTGTGCTC >1.19|1218532|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT TTTAAATCTGGTTTATTTTTTACATTCTTCCAATCC >1.20|1218598|37|NC_004557|PILER-CR,CRISPRCasFinder,CRT AAGTGCGTTATTTACGCCTTCTATATGTCCGAATACC >1.21|1218665|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT TTTTCATTAGCAATTCCTTTGTACTGCCATCTTCCG >1.22|1218731|38|NC_004557|PILER-CR,CRISPRCasFinder,CRT CCTAGTACGCCCAGCATACCCAAAAAAGAACTACTTAA >1.23|1218799|38|NC_004557|PILER-CR,CRISPRCasFinder,CRT CTGGAAAGAGGCAATAAAGCATTAGGAATAATAAAATG >1.24|1218867|38|NC_004557|PILER-CR,CRISPRCasFinder,CRT CTTACTAACACTTTCAGACCTAGTATTAAAATAATTTT >1.25|1218935|37|NC_004557|PILER-CR,CRISPRCasFinder,CRT GTGTTACATCTCCCAATTTCTCCTCATAATACTTTAA >1.26|1219002|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT GCTATAGCTAGTATAGTAGATACGTTGCGAGAATGG >1.27|1219068|37|NC_004557|PILER-CR,CRISPRCasFinder,CRT AATTAATATTGGCAGTATATGCTATACCATCTATAGC >1.28|1219135|38|NC_004557|PILER-CR,CRISPRCasFinder,CRT TATCTAACTCAATATTTTCTTCTTTTACATCCTGTTTA >1.29|1219203|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT AAATTAATAAAGATAGTAGGTTAAAGGGTATATTAG >1.30|1219269|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT TTATAAAATTCTATTTCTAGTTCTTCTTGAGTATAT >1.31|1219335|35|NC_004557|PILER-CR,CRISPRCasFinder,CRT GTCGACCCATTGGAGTTAGACAGATGGGATTTTCA >1.32|1219400|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT TTAGCATCTATAATATTTACTTCTTTAATAGTTCTT >1.33|1219466|35|NC_004557|PILER-CR,CRISPRCasFinder,CRT ATATGTGATGATGAATTAGAGAAAGTGCTTGAAAG |
cas2,cas1,cas4,cas3,cas5,cas7,cas8b2,cas6 |
CRISPR arrays and Neighbor proteins around NC_004557_1
The CRISPR arrays of NC_004557_1 >merge|NC_004557|1|1217308-1219530|PILER-CR,CRISPRCasFinder,CRT GTATTAGTAGCACCATATTGGAATGTAAATTTAAATGAAGGTACTAAATTTAAGGTAAGAATGGTGGTATTAGTAGCACCATATTGGAATGTAAATAGCATTCCTCTATCTCCATTAACTACTGAAAAAGGAGTATTAGTAGCACCATATTGGAATGTAAATGCTATAAGAATAATTCTAATTTTATCCAAGGAACTGGTATTAGTAGCACCATATTGGAATGTAAATTTCCAATTTTACTAGCTGCTACCCCAACGCCTAATAATGTATTAGTAGCACCATATTGGAATGTAAATTAAATATATTACTTCCTTCTTGCACTGTAGGTTTTTCGTATTAGTAGCACCATATTGGAATGTAAATTTGTTCGTAACTGTTAAAGCTATCTTTCTTTATGCGTATTAGTAGCACCATATTGGAATGTAAATAGAGCCTAGTTTCCTAAGCCCTTATAACCAACTTACCGTATTAGTAGCACCATATTGGAATGTAAATATACAATGCTCCATGGAAAGGACTCCACTTAGATATATAGTATTAGTAGCACCATATTGGAATGTAAATCCCACATCATTAAAGGATATAAAATTACCACCTTCCGTATTAGTAGCACCATATTGGAATGTAAATGTTTTAATATTAATATCGGCAAGTGCTAATTCATATGGTATTAGTAGCACCATATTGGAATGTAAATGTTTTAATATTAATATCGGCAAGTGCTAATTCATATGGTATTAGTAGCACCATATTGGAATGTAAATAACTGCACAGTATCACCGCTAGCTTTTAATTCTTTAGTATTAGTAGCACCATATTGGAATGTAAATTTTTGAAGTATATTATAAAGGCACAGTAACACGCCCGTATTAGTAGCACCATATTGGAATGTAAATGCTTTAACTCTTAAAAAAGATAAAGTTCTAAATTCGTATTAGTAGCACCATATTGGAATGTAAATAAGATGCAGCCAACGCACTTGGATATATGGCTTTGGGTATTAGTAGCACCATATTGGAATGTAAATGGTAATGTAAGTAATTCTACAACCAATAATAGCAATGTATTAGTAGCACCATATTGGAATGTAAATATACAGAATACAAGATTATAGTTAGTGGATATAGAAGTATTAGTAGCACCATATTGGAATGTAAATAATGAAGTCAAATAATAACATACCATTTTGTGCTCGTATTAGTAGCACCATATTGGAATGTAAATTTTAAATCTGGTTTATTTTTTACATTCTTCCAATCCGTATTAGTAGCACCATATTGGAATGTAAATAAGTGCGTTATTTACGCCTTCTATATGTCCGAATACCGTATTAGTAGCACCATATTGGAATGTAAATTTTTCATTAGCAATTCCTTTGTACTGCCATCTTCCGGTATTAGTAGCACCATATTGGAATGTAAATCCTAGTACGCCCAGCATACCCAAAAAAGAACTACTTAAGTATTAGTAGCACCATATTGGAATGTAAATCTGGAAAGAGGCAATAAAGCATTAGGAATAATAAAATGGTATTAGTAGCACCATATTGGAATGTAAATCTTACTAACACTTTCAGACCTAGTATTAAAATAATTTTGTATTAGTAGCACCATATTGGAATGTAAATGTGTTACATCTCCCAATTTCTCCTCATAATACTTTAAGTATTAGTAGCACCATATTGGAATGTAAATGCTATAGCTAGTATAGTAGATACGTTGCGAGAATGGGTATTAGTAGCACCATATTGGAATGTAAATAATTAATATTGGCAGTATATGCTATACCATCTATAGCGTATTAGTAGCACCATATTGGAATGTAAATTATCTAACTCAATATTTTCTTCTTTTACATCCTGTTTAGTATTAGTAGCACCATATTGGAATGTAAATAAATTAATAAAGATAGTAGGTTAAAGGGTATATTAGGTATTAGTAGCACCATATTGGAATGTAAATTTATAAAATTCTATTTCTAGTTCTTCTTGAGTATATGTATTAGTAGCACCATATTGGAATGTAAATGTCGACCCATTGGAGTTAGACAGATGGGATTTTCAGTATTAGTAGCACCATATTGGAATGTAAATTTAGCATCTATAATATTTACTTCTTTAATAGTTCTTGTATTAGTAGCACCATATTGGAATGTAAATATATGTGATGATGAATTAGAGAAAGTGCTTGAAAGGTATTAGTAGCACCATATTGGAATATAAAT >NC_004557|1|1|1217308-1219530|PILER-CR GTATTAGTAGCACCATATTGGAATGTAAAT TTAAATGAAGGTACTAAATTTAAGGTAAGAATGGTG GTATTAGTAGCACCATATTGGAATGTAAAT AGCATTCCTCTATCTCCATTAACTACTGAAAAAGGA GTATTAGTAGCACCATATTGGAATGTAAAT GCTATAAGAATAATTCTAATTTTATCCAAGGAACTG GTATTAGTAGCACCATATTGGAATGTAAAT TTCCAATTTTACTAGCTGCTACCCCAACGCCTAATAAT GTATTAGTAGCACCATATTGGAATGTAAAT TAAATATATTACTTCCTTCTTGCACTGTAGGTTTTTC GTATTAGTAGCACCATATTGGAATGTAAAT TTGTTCGTAACTGTTAAAGCTATCTTTCTTTATGC GTATTAGTAGCACCATATTGGAATGTAAAT AGAGCCTAGTTTCCTAAGCCCTTATAACCAACTTACC GTATTAGTAGCACCATATTGGAATGTAAAT ATACAATGCTCCATGGAAAGGACTCCACTTAGATATATA GTATTAGTAGCACCATATTGGAATGTAAAT CCCACATCATTAAAGGATATAAAATTACCACCTTCC GTATTAGTAGCACCATATTGGAATGTAAAT GTTTTAATATTAATATCGGCAAGTGCTAATTCATATG GTATTAGTAGCACCATATTGGAATGTAAAT GTTTTAATATTAATATCGGCAAGTGCTAATTCATATG GTATTAGTAGCACCATATTGGAATGTAAAT AACTGCACAGTATCACCGCTAGCTTTTAATTCTTTA GTATTAGTAGCACCATATTGGAATGTAAAT TTTTGAAGTATATTATAAAGGCACAGTAACACGCCC GTATTAGTAGCACCATATTGGAATGTAAAT GCTTTAACTCTTAAAAAAGATAAAGTTCTAAATTC GTATTAGTAGCACCATATTGGAATGTAAAT AAGATGCAGCCAACGCACTTGGATATATGGCTTTGG GTATTAGTAGCACCATATTGGAATGTAAAT GGTAATGTAAGTAATTCTACAACCAATAATAGCAAT GTATTAGTAGCACCATATTGGAATGTAAAT ATACAGAATACAAGATTATAGTTAGTGGATATAGAA GTATTAGTAGCACCATATTGGAATGTAAAT AATGAAGTCAAATAATAACATACCATTTTGTGCTC GTATTAGTAGCACCATATTGGAATGTAAAT TTTAAATCTGGTTTATTTTTTACATTCTTCCAATCC GTATTAGTAGCACCATATTGGAATGTAAAT AAGTGCGTTATTTACGCCTTCTATATGTCCGAATACC GTATTAGTAGCACCATATTGGAATGTAAAT TTTTCATTAGCAATTCCTTTGTACTGCCATCTTCCG GTATTAGTAGCACCATATTGGAATGTAAAT CCTAGTACGCCCAGCATACCCAAAAAAGAACTACTTAA GTATTAGTAGCACCATATTGGAATGTAAAT CTGGAAAGAGGCAATAAAGCATTAGGAATAATAAAATG GTATTAGTAGCACCATATTGGAATGTAAAT CTTACTAACACTTTCAGACCTAGTATTAAAATAATTTT GTATTAGTAGCACCATATTGGAATGTAAAT GTGTTACATCTCCCAATTTCTCCTCATAATACTTTAA GTATTAGTAGCACCATATTGGAATGTAAAT GCTATAGCTAGTATAGTAGATACGTTGCGAGAATGG GTATTAGTAGCACCATATTGGAATGTAAAT AATTAATATTGGCAGTATATGCTATACCATCTATAGC GTATTAGTAGCACCATATTGGAATGTAAAT TATCTAACTCAATATTTTCTTCTTTTACATCCTGTTTA GTATTAGTAGCACCATATTGGAATGTAAAT AAATTAATAAAGATAGTAGGTTAAAGGGTATATTAG GTATTAGTAGCACCATATTGGAATGTAAAT TTATAAAATTCTATTTCTAGTTCTTCTTGAGTATAT GTATTAGTAGCACCATATTGGAATGTAAAT GTCGACCCATTGGAGTTAGACAGATGGGATTTTCA GTATTAGTAGCACCATATTGGAATGTAAAT TTAGCATCTATAATATTTACTTCTTTAATAGTTCTT GTATTAGTAGCACCATATTGGAATGTAAAT ATATGTGATGATGAATTAGAGAAAGTGCTTGAAAG GTATTAGTAGCACCATATTGGAATATAAAT >NC_004557|1|1|1217308-1219530|CRISPRCasFinder GTATTAGTAGCACCATATTGGAATGTAAAT TTAAATGAAGGTACTAAATTTAAGGTAAGAATGGTG GTATTAGTAGCACCATATTGGAATGTAAAT AGCATTCCTCTATCTCCATTAACTACTGAAAAAGGA GTATTAGTAGCACCATATTGGAATGTAAAT GCTATAAGAATAATTCTAATTTTATCCAAGGAACTG GTATTAGTAGCACCATATTGGAATGTAAAT TTCCAATTTTACTAGCTGCTACCCCAACGCCTAATAAT GTATTAGTAGCACCATATTGGAATGTAAAT TAAATATATTACTTCCTTCTTGCACTGTAGGTTTTTC GTATTAGTAGCACCATATTGGAATGTAAAT TTGTTCGTAACTGTTAAAGCTATCTTTCTTTATGC GTATTAGTAGCACCATATTGGAATGTAAAT AGAGCCTAGTTTCCTAAGCCCTTATAACCAACTTACC GTATTAGTAGCACCATATTGGAATGTAAAT ATACAATGCTCCATGGAAAGGACTCCACTTAGATATATA GTATTAGTAGCACCATATTGGAATGTAAAT CCCACATCATTAAAGGATATAAAATTACCACCTTCC GTATTAGTAGCACCATATTGGAATGTAAAT GTTTTAATATTAATATCGGCAAGTGCTAATTCATATG GTATTAGTAGCACCATATTGGAATGTAAAT GTTTTAATATTAATATCGGCAAGTGCTAATTCATATG GTATTAGTAGCACCATATTGGAATGTAAAT AACTGCACAGTATCACCGCTAGCTTTTAATTCTTTA GTATTAGTAGCACCATATTGGAATGTAAAT TTTTGAAGTATATTATAAAGGCACAGTAACACGCCC GTATTAGTAGCACCATATTGGAATGTAAAT GCTTTAACTCTTAAAAAAGATAAAGTTCTAAATTC GTATTAGTAGCACCATATTGGAATGTAAAT AAGATGCAGCCAACGCACTTGGATATATGGCTTTGG GTATTAGTAGCACCATATTGGAATGTAAAT GGTAATGTAAGTAATTCTACAACCAATAATAGCAAT GTATTAGTAGCACCATATTGGAATGTAAAT ATACAGAATACAAGATTATAGTTAGTGGATATAGAA GTATTAGTAGCACCATATTGGAATGTAAAT AATGAAGTCAAATAATAACATACCATTTTGTGCTC GTATTAGTAGCACCATATTGGAATGTAAAT TTTAAATCTGGTTTATTTTTTACATTCTTCCAATCC GTATTAGTAGCACCATATTGGAATGTAAAT AAGTGCGTTATTTACGCCTTCTATATGTCCGAATACC GTATTAGTAGCACCATATTGGAATGTAAAT TTTTCATTAGCAATTCCTTTGTACTGCCATCTTCCG GTATTAGTAGCACCATATTGGAATGTAAAT CCTAGTACGCCCAGCATACCCAAAAAAGAACTACTTAA GTATTAGTAGCACCATATTGGAATGTAAAT CTGGAAAGAGGCAATAAAGCATTAGGAATAATAAAATG GTATTAGTAGCACCATATTGGAATGTAAAT CTTACTAACACTTTCAGACCTAGTATTAAAATAATTTT GTATTAGTAGCACCATATTGGAATGTAAAT GTGTTACATCTCCCAATTTCTCCTCATAATACTTTAA GTATTAGTAGCACCATATTGGAATGTAAAT GCTATAGCTAGTATAGTAGATACGTTGCGAGAATGG GTATTAGTAGCACCATATTGGAATGTAAAT AATTAATATTGGCAGTATATGCTATACCATCTATAGC GTATTAGTAGCACCATATTGGAATGTAAAT TATCTAACTCAATATTTTCTTCTTTTACATCCTGTTTA GTATTAGTAGCACCATATTGGAATGTAAAT AAATTAATAAAGATAGTAGGTTAAAGGGTATATTAG GTATTAGTAGCACCATATTGGAATGTAAAT TTATAAAATTCTATTTCTAGTTCTTCTTGAGTATAT GTATTAGTAGCACCATATTGGAATGTAAAT GTCGACCCATTGGAGTTAGACAGATGGGATTTTCA GTATTAGTAGCACCATATTGGAATGTAAAT TTAGCATCTATAATATTTACTTCTTTAATAGTTCTT GTATTAGTAGCACCATATTGGAATGTAAAT ATATGTGATGATGAATTAGAGAAAGTGCTTGAAAG GTATTAGTAGCACCATATTGGAATATAAAT >NC_004557|1|1|1217308-1219530|CRT GTATTAGTAGCACCATATTGGAATGTAAAT TTAAATGAAGGTACTAAATTTAAGGTAAGAATGGTG GTATTAGTAGCACCATATTGGAATGTAAAT AGCATTCCTCTATCTCCATTAACTACTGAAAAAGGA GTATTAGTAGCACCATATTGGAATGTAAAT GCTATAAGAATAATTCTAATTTTATCCAAGGAACTG GTATTAGTAGCACCATATTGGAATGTAAAT TTCCAATTTTACTAGCTGCTACCCCAACGCCTAATAAT GTATTAGTAGCACCATATTGGAATGTAAAT TAAATATATTACTTCCTTCTTGCACTGTAGGTTTTTC GTATTAGTAGCACCATATTGGAATGTAAAT TTGTTCGTAACTGTTAAAGCTATCTTTCTTTATGC GTATTAGTAGCACCATATTGGAATGTAAAT AGAGCCTAGTTTCCTAAGCCCTTATAACCAACTTACC GTATTAGTAGCACCATATTGGAATGTAAAT ATACAATGCTCCATGGAAAGGACTCCACTTAGATATATA GTATTAGTAGCACCATATTGGAATGTAAAT CCCACATCATTAAAGGATATAAAATTACCACCTTCC GTATTAGTAGCACCATATTGGAATGTAAAT GTTTTAATATTAATATCGGCAAGTGCTAATTCATATG GTATTAGTAGCACCATATTGGAATGTAAAT GTTTTAATATTAATATCGGCAAGTGCTAATTCATATG GTATTAGTAGCACCATATTGGAATGTAAAT AACTGCACAGTATCACCGCTAGCTTTTAATTCTTTA GTATTAGTAGCACCATATTGGAATGTAAAT TTTTGAAGTATATTATAAAGGCACAGTAACACGCCC GTATTAGTAGCACCATATTGGAATGTAAAT GCTTTAACTCTTAAAAAAGATAAAGTTCTAAATTC GTATTAGTAGCACCATATTGGAATGTAAAT AAGATGCAGCCAACGCACTTGGATATATGGCTTTGG GTATTAGTAGCACCATATTGGAATGTAAAT GGTAATGTAAGTAATTCTACAACCAATAATAGCAAT GTATTAGTAGCACCATATTGGAATGTAAAT ATACAGAATACAAGATTATAGTTAGTGGATATAGAA GTATTAGTAGCACCATATTGGAATGTAAAT AATGAAGTCAAATAATAACATACCATTTTGTGCTC GTATTAGTAGCACCATATTGGAATGTAAAT TTTAAATCTGGTTTATTTTTTACATTCTTCCAATCC GTATTAGTAGCACCATATTGGAATGTAAAT AAGTGCGTTATTTACGCCTTCTATATGTCCGAATACC GTATTAGTAGCACCATATTGGAATGTAAAT TTTTCATTAGCAATTCCTTTGTACTGCCATCTTCCG GTATTAGTAGCACCATATTGGAATGTAAAT CCTAGTACGCCCAGCATACCCAAAAAAGAACTACTTAA GTATTAGTAGCACCATATTGGAATGTAAAT CTGGAAAGAGGCAATAAAGCATTAGGAATAATAAAATG GTATTAGTAGCACCATATTGGAATGTAAAT CTTACTAACACTTTCAGACCTAGTATTAAAATAATTTT GTATTAGTAGCACCATATTGGAATGTAAAT GTGTTACATCTCCCAATTTCTCCTCATAATACTTTAA GTATTAGTAGCACCATATTGGAATGTAAAT GCTATAGCTAGTATAGTAGATACGTTGCGAGAATGG GTATTAGTAGCACCATATTGGAATGTAAAT AATTAATATTGGCAGTATATGCTATACCATCTATAGC GTATTAGTAGCACCATATTGGAATGTAAAT TATCTAACTCAATATTTTCTTCTTTTACATCCTGTTTA GTATTAGTAGCACCATATTGGAATGTAAAT AAATTAATAAAGATAGTAGGTTAAAGGGTATATTAG GTATTAGTAGCACCATATTGGAATGTAAAT TTATAAAATTCTATTTCTAGTTCTTCTTGAGTATAT GTATTAGTAGCACCATATTGGAATGTAAAT GTCGACCCATTGGAGTTAGACAGATGGGATTTTCA GTATTAGTAGCACCATATTGGAATGTAAAT TTAGCATCTATAATATTTACTTCTTTAATAGTTCTT GTATTAGTAGCACCATATTGGAATGTAAAT ATATGTGATGATGAATTAGAGAAAGTGCTTGAAAG GTATTAGTAGCACCATATTGGAATATAAAT
>NC_004557.1|WP_035109977.1|1216850_1217129_+|CRISPR-associated-endonuclease-Cas2 MYVILVYDIKSGEEGQRVLNRTFKVCKKYLSHIQNSVFEGELAESQIIKLKYELDDIIRKDKDSVILFKSRNKRWLTKDMWGKKEDRTSNFI >NC_004557.1|WP_011099384.1|1215846_1216839_+|type-I-B-CRISPR-associated-endonuclease-Cas1 MKRSYYIYNNGILKRKDNSMAFIDELGERRYIPIETANEIYVMSEMDFNTSLINYLSQYDVIIHFFNYYSFYTGSFQPRKKLVSGNLLVNQVNHYSDNSKRLEIAKKFVDGASYNIYRNLRYYNGRGKDVQIYMDKIEALRKQIYVSTNINELMGYEGNIRKIYYEAWNIIIDQKIDFTKRVKNPPDNMINTLISFVNTLIYTKVVGAIYHTQLNPTVSYLHEPGVRRFSLSLDIAEIFKPILADRLIFSLLNKKQITKKSFTKELNYLHLTKDASKIIVGELDQKIQTTIKHKDLNKNVSYEYLMRLECYKLIKHLLGEKEYEPFKIWW >NC_004557.1|WP_035110461.1|1215334_1215829_+|CRISPR-associated-protein-Cas4 MKKEITGVMIYYYKVCKRKLWYFYNEIQMEQGNESVEIGKAIDEETYRRDKKHINIDNIINIDFIRSKGILHEVKKSNKIEEASILQVKYYLYFLNKRGIENIKGKIDYPLLKQNIDVELTREDVTIIEGILDDIQNIVKASNPPNLEKKRICKSCAYYDLCFI >NC_004557.1|WP_052040366.1|1213026_1215306_+|CRISPR-associated-helicase/endonuclease-Cas3 MENKLNILDDDILKLIEEKKAKPDKTIKEHTLELIEVLNLLRELGYIKNDKIYNLVEKACIYHDLGKLNKEFQKRVNGKNVKFNETKEVVHNILSLYFINSKNFESKENYLKVAHSVLNHHNYCNNFDEISEKEELIKSLIEGFKTYKVKRSTISKLKSIVSDIDSIKVKGYLHKCDYSASSGNKAEYPNNFLENGLNNLLIKWKKGTKEATWNELQNFCIENKDENIIAIAQTGMGKTEAGLLWIGNTKGFFVLPIRTAINAIYDRVRKDILNNKGIDEKIAILHSSSLEYYIRNITGDTNEKEEIDLMNYHKIGKQLSIPINISTMDQIFDFVYKYPGYELKLTTLSYSKIVIDEIQAYGPDLLAYLICGLEKIAELGGRIAILTATLPPFIKDLLQKNIKFIENSTAFTNDMKRHNLKIIDERINSEDIYNKYVENKKLNKNNKILVVCNTIKEAQKLYEELKILINNEELHILHSKFIRKDRLKKESEIIEFGKTYDENKNIDKKNGIWISTSIVEASLDIDFDCLFTELQDLNSLFQRLGRCNRKGKKDSSNYNCYIYTEIDTANLINGDKGFIDKRLFDLSKKAIISCDGQISERDKINLIDSYLTTENLKGSDYMRKYKEIYNFIKDIPSYEFDLNQIDLRNILSEEIIPSPVYEEFLEEIKEIECKLANENISYYEKIILKDEIRKYTVSVHPNDIRNYDRAKQKGAAINYNKILLSKYKNEYIKVIECKYDEAGYKRIKYGETTRSSNIW >NC_004557.1|WP_011099381.1|1211893_1212976_+|CRISPR-associated-protein-Cas5 MKALRIVLTQSSANYKKEETIDNKMTYPLPPISTIIGAIHDACGYKDYHPIDISVQGKFESMHKEPYTDYCFLNSVMDDRGILIKMKNESLLSNAFDKVASAKKSQGNSFRKGITIQVYNEELLKEYRDLKDLNDKIAHYKKNEFKEKLDSIKAAKTKLAEDKKKLDKKSKEFEDIIKREKEVKLKEKNFKQKVKEFELEKYTKPISKFRSLTTSLKYYEILNNVELVIHIRSDEKTLNEIEENIYNLKSIGRSEDFVNIIEAKIVTLTESDDYEIKSNYSAYLNYDDVKNEKVWFENTKADRKVSGTKYYINKNYIIKDDKRFFEKKKVIYGSQYFIEETSENIFIDNEENKEYIVNFI >NC_004557.1|WP_035109979.1|1210966_1211866_+|type-I-B-CRISPR-associated-protein-Cas7/Cst2/DevR MKDKKALTLTVVANMTSNYSEGLGNIASVQKVFKNRKVYTIRSRESLKNAIMVQSGMYDDLQTEVDGATQKLANKELNASNCRALEGGYMSTKGTTNIRKSSFYLTDAISCESFVNETRFHNNLYLANNAAQAKNINLQEKSSEAGLMPYQYEYDKSLKIYSITIDLEMIGKDENFQQEEDYKEADNKEKADRVNSILNAIENLSLTVKGNLDNAEPVFVVGGLSNRKTHYFENVVKVEEEKLIISEDLKDKIEKGYHVGLLEGKTLQNEKEIKEQLNPISITKFFDMIRHEVNTYFGI >NC_004557.1|WP_011099379.1|1209607_1210951_+|type-I-CRISPR-associated-protein-Cas8a1/Csx8 MKTSIQNEKYDTMLEPSDWRFSATIVGLLQYLNYHDLDYKLEEDYILYNSSGINEERYLDFVEYKYGEELHHRLVENILSNEEITEEQLKLINEKLVANTIMKKTFGKIKFDNTNKKEILDIINKNRYELIRETFRRKSNMYANYGNTNQLFNDSQDHCRLLGYCIDTGKKGKSTGFNFMMSTFVGSDIKEFDFIPFAFEGSREAFFINDNYTIQRLKISNEILSKKIEDDLEGENKRKDARQTLFKAIMETSDFIKRDVEVILKDISKEYFETLYIRKESIDIFKEFINEKIEYKSFCFSHKVTDKYYINIQKKVTESILNNVLLDELIEIFLKEKNRSYLVLQLIKINVLIRRDKTMKDRLKGAFACAKQVSKAIESNKLDSYKQKLTSSIIFKDYDRVCQILLQLSNYSGIEFGFVYDLYDDFEENKDLAYTFINALSKKSENN >NC_004557.1|WP_011099378.1|1208849_1209593_+|CRISPR-associated-endoribonuclease-Cas6 MRFCLTLHLKEKIFLIEYRKVILSYIKNAISKCNNGKYYECFFKDTKQKDYCFSVILPNPTFTKNEIILNGNEIKVLFSTNNNSKIGFILFSAFIAQKNKPYPLPNNNSMILKNINNKKQEEIFNSKAIFKTTLGSGLCVRDHDKEENKDTYYVYTDEKFREKLKVVLIKQILKAGFTEEEANDIKVNPIQCKKVVVKHYRRYIDTTTGLFEIQANNKILQHFYDVGIGSRKSMGFGMIDLVTQDLL >NC_004557.1|WP_035109981.1|1207826_1208132_-|transposase MSKVKFKRTFTEEDRISYVKEVLECGSNILVAKKYDINQVQLSTWVNNYRRYSQTLTPKKPKDVDIIPNYKKEYKKVVEQLKEKELEIAILKDLLKKKNRL >NC_004557.1|WP_011099377.1|1206918_1207515_-|DDE-type-integrase/transposase/recombinase MYRLMSSLNLLGDSTKYRKPRISRICESIRVAGSNQLWQMDIKYCFITGTRKTAYITSIIDVFDRSIVSQSIDLSATGNVAKSVLLKRLYCRGLKDSPNGLIIRTDNGSQFISGVFEKACLREEVIHERIPVRSPNYNAYIESFHRYLQDECLTGKIYMTLEDLKIDVEDYVYRYNHERIHSSIGYYSPHDYYIKNVS >NC_004557.1|WP_035124903.1|1219628_1221578_+|HIRAN-domain-containing-protein MDFNETIYNRILKFVKENPDSVYLPQDFDEAGKKDAYFIFNAKCGTEKFEIENSNNLIKLISNYLNDEAQYNDLIEYIHEFPIIVYYFEFCRILEMQIKESLLSRKKVIEVGKKFVTESNDNEQIKLGIALLGLSADIQTKTILETIALHNEFTFYVVVSMKHWNHYNSFVFELAQKTKGYGKLHCVKNLEPINDEMKTWFIEESCNNTVFKSLSAIMCVDKVDMSWYLKTRKITKIEFSNISRLIYYIFSVDENDIYELEDSLETVEFYLKYAEKYAENFRDLCAIVYIKRWMRPYWEQFNVDIEKKNGWTSNIESKVGDICKNLLKDKKWIPVLKSAIYNAEEDVEIYTRIAESIGFDLTFNMLDSVLKKDKFNIEVFYFLYTKDDEGDIKNVIDYAKNTLPYQVIFSGSEEINEDNLTVENKPDICFLYILKYLNNCNYIEFELPTMALQARFQKCREEAIKYLRNNKEHWNEKIVCKIREAIEVEVNDKLLRKLKRLIGEEVIDKKKQRKYVDISKQRLKPHIKDIYLFSTYVAGVYYRDTSVVEDYIGVNDILFLKEEPENPYDKNAILVTNENGYVLGYLPKSVNKIPKNLLAGGKFLYTIIEEYSLESNTISIDVYLSYKDVIDLVEELMKISESKVNYYKQ >NC_004557.1|WP_011099386.1|1221804_1222617_+|alpha/beta-hydrolase MGYYVRVEPNVKIYVEDLNPTGDKTIVFLHGWPGSHKLFEYQFNELPKRGYRCIGVDQRGFGQSDKPWRGYDYNRLADDVRCIVETLKLQDFILAGHSTGGAIAIRYMARHNGYKVSKLALFAAAAPSLIKRPNFPYGLDKETVMKIIEGTYTDRPKMLRDFGDIFFFQHITEPFNYWFLQLGLQAAGWATADIAKTWLREELFCDLGTITVPTLIMHGIHDKVVPFELGKIQKQGIKNSKLIPFEYSGHGLFYDQREKFNGELRSFIEE >NC_004557.1|WP_011099387.1|1222805_1223213_+|DUF1259-domain-containing-protein MRDFCRTCNEFARILGAEILSTANNVCTVMFMRDIDAEILGRRTNSPLALMAMFSFESPDNQGRTLNLGETVILQDEINDFISILRENGILVTALHNHWLFEDPRLMYIHFESIDRPLDFARKVAEALRVLRDNC >NC_004557.1|WP_035109974.1|1223406_1223634_+|PepSY-domain-containing-protein MTTPQYVLWDRYWRSYRIDSESAIQIALQQIPGEVIKVELDTENGVLVYEVTIRNNTGIYEISIDANTGQIVEFD >NC_004557.1|WP_011099388.1|1223806_1224727_-|homoserine-O-succinyltransferase MPIVIPKDLPATETLENENIFVITEHRAIHQDIRPLKIAIVNLMPKKIETETQLLRLLGNIPIQVSIDLIHPKTHHSKNISEKHLLSFYKTIDDIKNEKFDGMIITGAPVEQIAFEDVDYFQELKTIMDFSVTNVFSTLHICWGAQAALYYHYNINKNILPKKVFGVFSHHININKGTVKLLRGFDDKFYVPHSRHTEVKKEDIEKVPELEIFAESNEVGPYIIASKNGRQIFITGHPEYDANTLKSEYYRDINLGKHIEIPKNYFKNNNPREELIANWRGHANLLFSNWLNYYVYQETPYSYISI >NC_004557.1|WP_035109971.1|1224745_1226023_-|O-acetylhomoserine-aminocarboxypropyltransferase/cysteine-synthase MNNSWGKGTICIQGGYNPKPGEPRVLPIFQSTTYKYEDPDHVAKLFDLTEEGHIYSRISNPTVSAYEEKVNCLEGGAGALAVSSGQSATTLALLNICKSGDHIISASTIYGGTFTLLSSTLKKFGIEVSFINPDSSKEDILKEFKSNTKAIFAETIGNPGLNILDFDKFSDIAQKTEVPFIVDNTLASPYLCSPLELGANIVIHSSTKYIDGHATSIGGIIVDGGNFNWDNGKFPDLVKEDPTYHGIRYTKTFGKSAYIVKCRVQLLRDLGTCLSPFNAFLNNLGLETLHLRMERHCSNTLKLAKFLENHKKVNWVNYPGLYNNSNYQLANKYLSKGSGAILTFGVKGCKDAGSKFIRNLKLAALVVHLGDARTSVLQPATTTHRQLNEKEQISSGVYPDLIRVSVGIEDPEDLINDFNQALLNI >NC_004557.1|WP_011099390.1|1226391_1226925_-|hypothetical-protein MKIQQHSNFVNIFNLHNNPQKTKADQIKIDIARKENPQLDRALYLNDINKVFEEKAKVEQIAKKIARGKQLTREEKELISRTDPEMLRKAEMAKQENDALKRSLKSAKSKQHAQRILAQACIKAQLVSEVDPQYADLLMDTIQELHKDINKGNNPYDKANQYTQNKKSPYEMLNLKR >NC_004557.1|WP_035110459.1|1227275_1228748_+|CZB-domain-containing-protein MFEKKPCYEAECIIKYVEERLEGNKTLEPKVEYPIHVKLLKNYKKLFSNEGIMSSSAKTLLDINASLSDFDVQMSSISYELIDFAKEMSELSESNLAVVEEITASMNQVNHTIEDTSKTLEDLSISSKELIEENHKSLAEIEDINHLKEEVMNNANIMSSQIEELVEMANKVSDIVEGVGAIADQTNLLALNASIEAARAGEHGKGFAVVAQEIRKLADDTKGSLQNMRNFVNNIQNTSREGKKSMDNTISSTEKMSKKIDAITYTTKSNVDMLEDSVRSIYTINESMGGINLAATEINKAMDTSTQDAEKLSLMTNTIHDNALKSADYAKKISNIDDLLSEVLKHMMGGLQGTINAISNEEFLEYMEKAKKAHKNWLENLKNIVNEMRIYPLQTNGAKCAFGHFYNSIQATHPSILEEWKGIDNIHKEFHNLGDKVLKSVKENNKHKAQEYYDSAEKISKEIFMSMDKIIIESKKQMEKGVQLFQQIKN >NC_004557.1|WP_011099392.1|1228883_1231022_-|anaerobic-carbon-monoxide-dehydrogenase-catalytic-subunit MSNEDIKKSYEKSANRMSGDNTTFGSKLTPEDFNDPNINTNAFNKKKVDYNDFEKSPISMDEVHKWQRQHISKKDQPKEGYPLNVIIDPAMREMYQIVNKAGMTNVFDRFSQQQPIQCKFCIEGLSCQLCANGPCRISPSAPRGTCGVDAHTMVARNFMYRHVTIGTSANIFHAHQAARTLRAAGEHPESGLKIRDSEKLKNFADMAGLDANKSINELAVDFANWVINDIHSEQHIPSKTVEAFAPTKRKDLWRKLGLFPGGGYSEIAYSQTSCMTNFRSDPVEFLLNSVRLGIANEYQGLFLLNIIQEILMGTQEIEMKKQNMGLLNENRINIITNGHMPLLAHVAIDLASTDEWQQKAKNVGADGIQILGHVCEGQQLINYSGTHNQKAYAGQEGEWLSEEYLLATGVIDLFMFDYNCTIPTLPLYAEKFGTKLLSTHPVIKLQGTETLDFVPEKMKEQAEKALNMALEAFKERKKSNKEIYIPPHVSECMVGFSTESVKGALGGSFKPLIEQIVNGNIRGIATIVGCTTARYGQGGSNIFKITKGLIENNILVLSGGCTSSVMEYTGLTHPNAADEAGEGLKAVCKQLGIPPVLTYGACVDIGKMSQTAKEIADELDVDTNKLPLVIGAPEYLEQKAVADACTAVALGWLVHIAPVPSITGSDLVVKTLTETTESLGLGKVVVEMDAEKTIEIYKNHIEGKRKELGLDS >NC_004557.1|WP_023438026.1|1231664_1231820_+|hypothetical-protein MAKDSQPDNKKARMEKCNYQIPITSEEGKNQNRTLKKHSVKRKGFQSQHIN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_004557_2 | 1570766-1571383 | TypeI-B |
III-B
Consensus repeat of NC_004557_2
|
9 spacers
spacers of NC_004557_2
>2.1|1570796|37|NC_004557|CRISPRCasFinder,CRT GTTTTTGCTGCTTCTTCTGGTTGGTATTCATTATCTT >2.2|1570863|36|NC_004557|CRISPRCasFinder,CRT AGATAAATGAATTGTTTAGTTACTTAGAGGAAGGCA >2.3|1570929|34|NC_004557|CRISPRCasFinder,CRT GCTTAGGCTAGGAGCTACCTCTTTTTTTATTTTT >2.4|1570993|34|NC_004557|CRISPRCasFinder,CRT GTTTTGCAGAGGTTCGAGAAAAACTTAAATATTA >2.5|1571057|36|NC_004557|CRISPRCasFinder,CRT TGGTGCTAAATTAACAACTGTTAATCCAAATATAAA >2.6|1571123|35|NC_004557|CRISPRCasFinder,CRT TTTCTTGCAACCATAGCACATAGTTGCAGCATAAC >2.7|1571188|36|NC_004557|CRISPRCasFinder,CRT AGGTTGGGACTGTTGGGGAAATGAAGTAAATCTTAA >2.8|1571254|34|NC_004557|CRISPRCasFinder,CRT ACCCTAATTGTAGAACTACAATAGTTCCGTATTT >2.9|1571318|36|NC_004557|CRISPRCasFinder,CRT ACAGTAACATGAATACACTCATGTTACTGTTTTTCA >2.10|1570798|37|NC_004557|PILER-CR GTTTTTGCTGCTTCTTCTGGTTGGTATTCATTATCTT >2.11|1570865|36|NC_004557|PILER-CR AGATAAATGAATTGTTTAGTTACTTAGAGGAAGGCA >2.12|1570931|34|NC_004557|PILER-CR GCTTAGGCTAGGAGCTACCTCTTTTTTTATTTTT >2.13|1570995|34|NC_004557|PILER-CR GTTTTGCAGAGGTTCGAGAAAAACTTAAATATTA >2.14|1571059|36|NC_004557|PILER-CR TGGTGCTAAATTAACAACTGTTAATCCAAATATAAA >2.15|1571125|35|NC_004557|PILER-CR TTTCTTGCAACCATAGCACATAGTTGCAGCATAAC >2.16|1571190|36|NC_004557|PILER-CR AGGTTGGGACTGTTGGGGAAATGAAGTAAATCTTAA >2.17|1571256|34|NC_004557|PILER-CR ACCCTAATTGTAGAACTACAATAGTTCCGTATTT >2.18|1571320|36|NC_004557|PILER-CR ACAGTAACATGAATACACTCATGTTACTGTTTTTCA |
cas2,cas1,cas4 |
CRISPR arrays and Neighbor proteins around NC_004557_2
The CRISPR arrays of NC_004557_2 >merge|NC_004557|2|1570766-1571383|CRISPRCasFinder,CRT,PILER-CR TATTAAATACAACTCTTGTTATTGTTCAACGTTTTTGCTGCTTCTTCTGGTTGGTATTCATTATCTTATTTAAATACAACTCTTGTTATTGTTCAACAGATAAATGAATTGTTTAGTTACTTAGAGGAAGGCAATTTAAATACAACTCTTGTTATTGTTCAACGCTTAGGCTAGGAGCTACCTCTTTTTTTATTTTTATTTAAATACAACTCTTGTTATTGTTCAACGTTTTGCAGAGGTTCGAGAAAAACTTAAATATTAATTTAAATACAACTCTTGTTATTGTTCAACTGGTGCTAAATTAACAACTGTTAATCCAAATATAAAATTTAAATACAACTCTTGTTATTGTTCAACTTTCTTGCAACCATAGCACATAGTTGCAGCATAACATTTAAATACAACTCTTGTTATTGTTCAACAGGTTGGGACTGTTGGGGAAATGAAGTAAATCTTAAATTTAAATACAACTCTTGTTATTGTTCAACACCCTAATTGTAGAACTACAATAGTTCCGTATTTATTTAAATACAACTCTTGTTATTGTTCAACACAGTAACATGAATACACTCATGTTACTGTTTTTCAATTTAAATACAACTCTTGTTATTGTTCAAC >NC_004557|2|2|1570766-1571383|CRISPRCasFinder TATTAAATACAACTCTTGTTATTGTTCAAC GTTTTTGCTGCTTCTTCTGGTTGGTATTCATTATCTT ATTTAAATACAACTCTTGTTATTGTTCAAC AGATAAATGAATTGTTTAGTTACTTAGAGGAAGGCA ATTTAAATACAACTCTTGTTATTGTTCAAC GCTTAGGCTAGGAGCTACCTCTTTTTTTATTTTT ATTTAAATACAACTCTTGTTATTGTTCAAC GTTTTGCAGAGGTTCGAGAAAAACTTAAATATTA ATTTAAATACAACTCTTGTTATTGTTCAAC TGGTGCTAAATTAACAACTGTTAATCCAAATATAAA ATTTAAATACAACTCTTGTTATTGTTCAAC TTTCTTGCAACCATAGCACATAGTTGCAGCATAAC ATTTAAATACAACTCTTGTTATTGTTCAAC AGGTTGGGACTGTTGGGGAAATGAAGTAAATCTTAA ATTTAAATACAACTCTTGTTATTGTTCAAC ACCCTAATTGTAGAACTACAATAGTTCCGTATTT ATTTAAATACAACTCTTGTTATTGTTCAAC ACAGTAACATGAATACACTCATGTTACTGTTTTTCA ATTTAAATACAACTCTTGTTATTGTTCAAC >NC_004557|2|2|1570766-1571383|CRT TATTAAATACAACTCTTGTTATTGTTCAAC GTTTTTGCTGCTTCTTCTGGTTGGTATTCATTATCTT ATTTAAATACAACTCTTGTTATTGTTCAAC AGATAAATGAATTGTTTAGTTACTTAGAGGAAGGCA ATTTAAATACAACTCTTGTTATTGTTCAAC GCTTAGGCTAGGAGCTACCTCTTTTTTTATTTTT ATTTAAATACAACTCTTGTTATTGTTCAAC GTTTTGCAGAGGTTCGAGAAAAACTTAAATATTA ATTTAAATACAACTCTTGTTATTGTTCAAC TGGTGCTAAATTAACAACTGTTAATCCAAATATAAA ATTTAAATACAACTCTTGTTATTGTTCAAC TTTCTTGCAACCATAGCACATAGTTGCAGCATAAC ATTTAAATACAACTCTTGTTATTGTTCAAC AGGTTGGGACTGTTGGGGAAATGAAGTAAATCTTAA ATTTAAATACAACTCTTGTTATTGTTCAAC ACCCTAATTGTAGAACTACAATAGTTCCGTATTT ATTTAAATACAACTCTTGTTATTGTTCAAC ACAGTAACATGAATACACTCATGTTACTGTTTTTCA ATTTAAATACAACTCTTGTTATTGTTCAAC >NC_004557|2|2|1570768-1571383|PILER-CR TTAAATACAACTCTTGTTATTGTTCAACGT TTTTGCTGCTTCTTCTGGTTGGTATTCATTATCTTAT TTAAATACAACTCTTGTTATTGTTCAACAG ATAAATGAATTGTTTAGTTACTTAGAGGAAGGCAAT TTAAATACAACTCTTGTTATTGTTCAACGC TTAGGCTAGGAGCTACCTCTTTTTTTATTTTTAT TTAAATACAACTCTTGTTATTGTTCAACGT TTTGCAGAGGTTCGAGAAAAACTTAAATATTAAT TTAAATACAACTCTTGTTATTGTTCAACTG GTGCTAAATTAACAACTGTTAATCCAAATATAAAAT TTAAATACAACTCTTGTTATTGTTCAACTT TCTTGCAACCATAGCACATAGTTGCAGCATAACAT TTAAATACAACTCTTGTTATTGTTCAACAG GTTGGGACTGTTGGGGAAATGAAGTAAATCTTAAAT TTAAATACAACTCTTGTTATTGTTCAACAC CCTAATTGTAGAACTACAATAGTTCCGTATTTAT TTAAATACAACTCTTGTTATTGTTCAACAC AGTAACATGAATACACTCATGTTACTGTTTTTCAAT TTAAATACAACTCTTGTTATTGTTCAAC
>NC_004557.1|WP_023438394.1|1569373_1570516_-|iron-containing-alcohol-dehydrogenase MKEFSINTDVYFGEGSLDRLNEIKNKRVLIVCDKFMETSGMVTKVQQKLTDCEVTIYSDIVPDPSVEVIASGIQKLQSCNAQIIIALGGGSSIDGAKAIKEYSKKVTGKTINIEEFYAIPTTSGTGSEVTEYAVITNKQEGLKYAITDKSLLPTVAILDPQLVKSVPKAITADTGMDVITHALEAYVSKNATDFSDALAEKAFTLAFRFLPQAYADGEDIIAREKLHNASCLAGMAFNAAGLGITHSLAHAVGGKLHISHGRSNAIILPYVVEYNANLNKESFNAEYSIAAKKYQRLAKLLKLHAPNVTIGVNNLIKSIVKLQNTLMIPQTLKQQREDINLDETSKEEIINAALRDVCTTSNPRETKKEDFLKILDKVLG >NC_004557.1|WP_023438393.1|1568460_1569291_-|MerR-family-transcriptional-regulator MKEELYSIGKVGEICKITKKALRYYDKMDILSPDKVSDESGYRYYSKKTLLSVPMIKYYKQSGFKLEEMKVFLEGETYDFFHKSFRNKIDELKELEKEINLKIRSVKDWDDLIVEAQNVIENNVCDVAIKYIDNKTLTFLDQEFKYDYMDSIINIEFTNYIDSIENAITGPVIIRFPCHEDKMNGKCTKMRIMQETILKCKEELSVEFGGWMAAACYHIGPHETISDTYKKIKEWTKEHGYICFEECYERYVTDYWTTKNTDKFVTEILIKISRER >NC_004557.1|WP_011099669.1|1566901_1568017_-|membrane-protein MSENSKTVSLEAIAAKKKLSSDFFKKGISLALFSGLAYGLYTAFLTMGMTKGVWGDWYGDNTAGLSVFVIAYLLAALGNAINDTCSAIWSLLYAVVKGKFGDFLRCINTKPGRIMILAALIGGPIASTAYVIALQMAGSIVVPISALCPAIAAILGKVLYKQELNKRMAFGIVICVCASFLIGSTGFTSDGISRNTLLGLLIAIIAALGWGFEGCVAGYGTAMIDPEIGICIRQVTAGIADLCILLPVLGMMAGGINISVDLTMQAFTSAPAMIWFTLSGLLTFMTFMTWYAGNSMCGAGLGTACNGTYSFFGPLFCLLVLGVYGGMDGWALPTVAWIGAVVMIIGILIISMNPLDLFKRKKMEVDVDETA >NC_004557.1|WP_023438392.1|1566648_1566915_-|hypothetical-protein MKPLNYAILKHFTKVPEACVDDVIEALKGEYGHFKALNRKAVTNALMTAEANGLIEEVRFDLDENKQLRVYYHAHKEGADTINKYIPD >NC_004557.1|WP_023438391.1|1566245_1566542_-|BMC-domain-containing-protein MRYYGDEALGLVETIGLVPALEAADKMLKAANVELISYENIGSTLVTIMVKGDVAAVKASVEAGAKAAAAIGKLTAHNVMPRPIREVGDIVSVHDIDL >NC_004557.1|WP_035125179.1|1565915_1566224_-|BMC-domain-containing-protein MARYRALGLIETFGLVFALEAADAMCKAANVELIGYENVASGYISVLVSGDVGACRSAVDAGVAAVNGMEGGNLYSSIVIPSPHEELEKIIKRYSITTLIPE >NC_004557.1|WP_011099667.1|1563236_1565777_-|choline-trimethylamine-lyase MDIREFSNMLMEATKNMSDEERNGLMNMFQSISKEIKKEEKVTSNVVFNNNGEIPDGMTERLIKLKENYMKQVPSITTHRARAITKIAKENPGVPKSVLRGKCFKYCCETAPLVIQDNELIVGAPNGKPRAGAFSPDIAWRWMEDEIDTIANRPQDPFYISEEDKKIMREELFPYWKGKSVDEYCEDQYREAGVWELSGESFVSDCSYHAVNGGGDSNPGYDVILMKKGMLDIKREAEEKLASLSYERPEDIEKIYFYKSIIDTAEGVMIYAKRMSDYAAELAAKETDPKRKAELQKISKVNARVPAHKPSTFWEAIQAVWTIESLLVVEENQTGMSIGRVDQYMYPFYKSDIESGRMTDFEAFELAGCMLIKMSEMMWITSEGGSKFFAGYQPFVNMCVGGVTREGRDATNELTYLLMDAVRHVKIYQPSLACRIHKGSPQKYLKKIVDVIRAGMGFPACHFDDVHIKMMLAKGVSIEDARDYCLMGCVEPQKSGRLYQWTSTGYTQWPICIELVLNHGVPLWYGKQVCPDMGDLSQFKTYEQFEGAVREQIKYITKWTAVATTISQRVHRELAPKPLMSMMYEGCMEKGRGVEAGGAMYNFGPGVVWSGLATYTDSMAAIKKLVFEEKKYTLEELSEALKADFVGYERLRKDCLEAPKYGNDDDYADYIAADLVNFTEQEHRKYKTLYSVLSHGTLSISNNTPFGQMTGATANGRRAWMPLSDGISPSQGSDFKGPTSIIKSVSKISCEDMNIGMVHNFKLMSGLLDTPEGEQGIIALLRSACALQLGEIQFNYLDNETLIEAQKHPEQYRDLIVRVAGYSAFFVELCKDVQDEIISRTMLTHF >NC_004557.1|WP_023438388.1|1562261_1563215_-|choline-TMA-lyase-activating-enzyme MSNGNLGVIKEKATVFNIQKYSIYDGDGIRTLVFFQGCPLRCKWCSNPEGLIKKHRVMFKSNLCVNCGACVSVCPVSIHTLSNETLKHEINRNIDCIGCGKCKDACLKSAISIVGEEKTISELLKIVEEDRVFYEMSGGGVTLGGGEVLMQPKAASSLLMACKQEGINTAIETCGYTNLETILKVAESVDLFLFDIKHINPDRHFELTGVRNEQILENLQELLRRKYNVKIRMPLLKDINNSKEEIEATMEFLTPYKDYKNFKGIDLLPYHKMGVNKYNQLGIEYPIKGDPSLNDEELDRIEEWIKKYDLHVKVIRH >NC_004557.1|WP_011099665.1|1561868_1562222_-|BMC-domain-containing-protein MGDFENRQIQRVIQESVPGKQITIAHVIASPMADIYERLGIDECGAIGILTLSPFETAIIAADIATKASDVEIGFLDRFTGSVVISGDVQSVETALNAVNNTLKNMLGFTPALITRT >NC_004557.1|WP_011099664.1|1561428_1561872_-|lysine-sensitive-aspartokinase-III MIKKRIMVIGSSGSGKTTIVNALNDYNGPLRRTPDLIYGKNTIDVPGAYIENPWMYKHIIALAQNSASCIVILVDQSNCTEVYPHGFAKSFRCPVIGVVTKCDLMPENKEKCLGQLKDIGVVEPYFHISLKTGIDALKKYLLKKCKE >NC_004557.1|WP_035125177.1|1571587_1572091_-|hypothetical-protein MNKTKKLPIIILLAVIVMFSGVNIYRRIDANRLKSKKTSISCIERIKDEKFNDNNVSFSFKKLNGVWQLLLLDSKKDDEITIINNSKIDEGKFYIGVLNSENEIIAFDKEKQDKITFVTPEEGCYLVRILAKNSSGKCDVKVDSKKGIDLNYNSINGHNMGLLEKNN >NC_004557.1|WP_011099673.1|1572395_1572800_+|RDD-family-protein MVLIIINFNRTVLYRIIASFIDDSALLLLYMFFTNIINKNNSSFVYVLLLLVSFISIEICFFIKSTSLGKFIMGLKVIDKTSSLELGFIKMLIRETFGKVLSNILFIGNIYILFNDSNQGFHDKLVNSIVIEND >NC_004557.1|WP_035125175.1|1573867_1574935_-|tRNA-2-thiouridine(34)-synthase-MnmA MKKKVLVGMSGGVDSSVAAYLLKEQGYEVIGATMQIWQDDKEFIEREGGCCSLSAVADARRVANKIGIPFYVMNFKDAFKKNVIDYFVDEYMEGRTPNPCVACNKFIKFSSFLDKAMTLGIDYVATGHYAIIEKQNNRYIVRKSEDDKKDQTYALYNLTQFQLERTLMPCGRYKKSEIREIAKKIGLRVHNKKDSQEICFIPDNDHGKYIKNRFPSKVRQGNFVDKSGNVLGTHKGIVYYTIGQRKGLDIALGKPMYVVDINPFRNEVVLGNLDDLLNTELIAKDVNYIPFDNLKEPMEVEAKIRYSQIPSKAVITPMENDKVKVNFTEKQRAITKGQSVVFYKGDLLIGGGIIE >NC_004557.1|WP_011099676.1|1574977_1576114_-|cysteine-desulfurase MKKNIYMDYAATTYIKKEVIEAMMPYLTEYYANPSSVYNMSNNLKIVIDEAKEEIADFIGATPEEVFFTSGGTEGNNWAIKGIAYANEEKGKHIITSSIEHPAVLNSCKYLKEKGFEITFLPVDSYGKVDLEKLEKSIRNDTILVSIMAANNEIGTIQHIKSIGEICKRHKVLFHTDAVQALGHIPINAEEMDIDLMTIAAHKIYGPKGIGALYIKKGTKIENILHGGSQERGKRPGTENTAAIVGFKKAVSLLKENGLEESKRIEKLRDKFIKGLLQIENTKINGAMGKERLKGNVNVSFKNIDGELLLMLLDREGIYASAGSACSAGSIDASHVLVALGLEDEFLKGTIRFTLGARNTEEEVDFVLEKLNQLIKKI >NC_004557.1|WP_023438397.1|1576489_1577302_+|hypothetical-protein MFKSLSKKVIISVVILIVVLLGVLGMKNYNKSKDYKNLVTTANEYMNEKDYDKAMDKFKESLDYKKDQKAEEKLEECKNELINLSKEALKNKEYEKADNYLNVLLKHDGKNEEAIKMKNTIKDEIQKSKEEEEIKKAIEKEKREQELKKQMDKEEQAKKKTGITEQNKKKEVKENNKITKEKAESLVQPLKNKNEEIRYLGTRQVPEIPAKSTPYKKFPKEIENKKVYIFDIAVVYNSDSKATIGRYYVDFSGNIYKDTYPSNLECVKVK >NC_004557.1|WP_035109706.1|1577339_1577690_-|SdpI-family-protein MNILTNCIIGFIFIVIGLVLRAYPPQHINNSLGYRTPFSIKNKDTWYEGNRFCGTILLISSIIFIPFSILIKYLYSNNLNLSMGISSLSLLIIIIIGIVYTEIHLRMMFDKNGTRK >NC_004557.1|WP_011099679.1|1578401_1579985_-|AAA-family-ATPase MKKRFNVTGTCIPERHYMVDISNKLDSILKLVNNEEYFIINRPRQYGKTTTLYMLERYLNKFKDYLVISISFEGIGDLIFQDEKVFSKEFLQIMSDSLLLNSQALSECLEEQKPHVENFIDLSRVITKFIVKAKRKVVLMIDEVDKSSNNQLFLSFLGLLRNKYLLRNVGKDYTFHNVILAGVHDVKSLKLKIRPDEEHKYNSPWNIASDFDVDMSFSIDEIKTMLDDYVENKEVNLDKEYFAEKIYFYTSGYPFLVSKLCKIVDEKIMVKDELKWEKEYLQIAVKELLKESNTNFDSLIKNIENNKDLQELVRKIILDGYEITYNEDNPLITMGVTYGIFKNSHGKVKIHNRIYEQRIYNYMISLIETKINLGFYTERERYLKPNGDLDIKKVLKKFQEFMKHEYSQKREGFLEEDGRLVFLAFLSPIINGAGFAFKEVQGGEEKRFDIVITYNKKMYILELKIWRGEEYHKKGLKQLVEYLNQYGLEEGYLLIFDFRKATNLIGQVEETHINAEDNIKKIIGVYC >NC_004557.1|WP_023438399.1|1580562_1580853_-|CRISPR-associated-endonuclease-Cas2 MSKNFNYNYAFVFYDVNEKRVNRVFKTCKKYLSHFQKSVFRGELTPANFILLKKDLNKVINEDEDFICIIKLMNNKVYDEEILGNPHSCTGEDLIL >NC_004557.1|WP_011099680.1|1580853_1581852_-|type-I-B-CRISPR-associated-endonuclease-Cas1 MGSTRYITSVGELKRKDNSLCFRKNNKNVYIPVENTKEIYCMSEVNINSKLLDFLSQNNIIMHFFNYYEGYSGTFYPREHYNSGKLLVKQVETYENRRLEVAKSIVEAIGDNIYELLYHYYKHDKKEVKETLDWIKNHSKINLKKANDIKQIMQVEGETWQRFYGEFKNILPEEFVMNKRVKRPPDNPINALISFGNTLLYGKTITAIYNTHLDQRISFLHEPSEGRFSLSLDISEAFKPVIVFKTIFDLVNNKRIQVSKHFDKKLNYCLLNDEGRNIFITAFEERMESIFLNEKLKRKISYKTAIKLDCYKLIKFILENKEFKPFSLKERM >NC_004557.1|WP_011099681.1|1581861_1582353_-|CRISPR-associated-protein-Cas4 MKVNGTLVNYYFHCKRQCWLHGNRINLEDNSQDVKIGKAIHEVKKEKGKQTEISIDNIKIDKITKDYLTEVKKSDSDIEAAKWQLLLYLKVLKDKGIERKGKLEFIEKNKSKSTIIIELDENNLSELEDVIKNIENLLIQENPPEVINESKCKKCAYFEYCYI |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_004557_3 | 1573235-1573655 | TypeI-B |
III-B
Consensus repeat of NC_004557_3
|
6 spacers
spacers of NC_004557_3
>3.1|1573265|34|NC_004557|CRISPRCasFinder,CRT TTAATCCAGATAAAATATATTCTCTTACAGCAAT >3.2|1573329|34|NC_004557|CRISPRCasFinder,CRT,PILER-CR GTGTGTTCCGTATCAATCTAGGTCGGCAAACTTC >3.3|1573393|35|NC_004557|CRISPRCasFinder,CRT,PILER-CR AGATGTTTTAACAACGATAATGAATGCTTACAAAA >3.4|1573458|37|NC_004557|CRISPRCasFinder,CRT,PILER-CR TATTAATTCTCCTTTTGAGCTATGCTCATATATATTT >3.5|1573525|35|NC_004557|CRISPRCasFinder,CRT,PILER-CR GAGCTACAAGATAAATACAAAGATGTGGATTTAGT >3.6|1573590|36|NC_004557|CRISPRCasFinder,CRT,PILER-CR ATATGCAATAGCCATATTTCAAAGATATTCAAAGGA |
cas2,cas1,cas4,cas3,cas5 |
CRISPR arrays and Neighbor proteins around NC_004557_3
The CRISPR arrays of NC_004557_3 >merge|NC_004557|3|1573235-1573655|CRISPRCasFinder,CRT,PILER-CR ATTTAAAAACATCCTATGTTATTGTTCAACTTAATCCAGATAAAATATATTCTCTTACAGCAATATTTAAATACAACTCTTGTTATTGTTCAACGTGTGTTCCGTATCAATCTAGGTCGGCAAACTTCATTTAAATACAACTCTTGTTATTGTTCAACAGATGTTTTAACAACGATAATGAATGCTTACAAAAATTTAAATACAACTCTTGTTATTGTTCAACTATTAATTCTCCTTTTGAGCTATGCTCATATATATTTATTTAAATACAACTCTTGTTATTGTTCAACGAGCTACAAGATAAATACAAAGATGTGGATTTAGTATTTAAATACAACTCTTGTTATTGTTCAACATATGCAATAGCCATATTTCAAAGATATTCAAAGGAATTTAAATACAACTCTTGTTATTGTTCAAC >NC_004557|3|3|1573235-1573655|CRISPRCasFinder ATTTAAAAACATCCTATGTTATTGTTCAAC TTAATCCAGATAAAATATATTCTCTTACAGCAAT ATTTAAATACAACTCTTGTTATTGTTCAAC GTGTGTTCCGTATCAATCTAGGTCGGCAAACTTC ATTTAAATACAACTCTTGTTATTGTTCAAC AGATGTTTTAACAACGATAATGAATGCTTACAAAA ATTTAAATACAACTCTTGTTATTGTTCAAC TATTAATTCTCCTTTTGAGCTATGCTCATATATATTT ATTTAAATACAACTCTTGTTATTGTTCAAC GAGCTACAAGATAAATACAAAGATGTGGATTTAGT ATTTAAATACAACTCTTGTTATTGTTCAAC ATATGCAATAGCCATATTTCAAAGATATTCAAAGGA ATTTAAATACAACTCTTGTTATTGTTCAAC >NC_004557|3|3|1573235-1573655|CRT ATTTAAAAACATCCTATGTTATTGTTCAAC TTAATCCAGATAAAATATATTCTCTTACAGCAAT ATTTAAATACAACTCTTGTTATTGTTCAAC GTGTGTTCCGTATCAATCTAGGTCGGCAAACTTC ATTTAAATACAACTCTTGTTATTGTTCAAC AGATGTTTTAACAACGATAATGAATGCTTACAAAA ATTTAAATACAACTCTTGTTATTGTTCAAC TATTAATTCTCCTTTTGAGCTATGCTCATATATATTT ATTTAAATACAACTCTTGTTATTGTTCAAC GAGCTACAAGATAAATACAAAGATGTGGATTTAGT ATTTAAATACAACTCTTGTTATTGTTCAAC ATATGCAATAGCCATATTTCAAAGATATTCAAAGGA ATTTAAATACAACTCTTGTTATTGTTCAAC >NC_004557|3|3|1573299-1573655|PILER-CR ATTTAAATACAACTCTTGTTATTGTTCAAC GTGTGTTCCGTATCAATCTAGGTCGGCAAACTTC ATTTAAATACAACTCTTGTTATTGTTCAAC AGATGTTTTAACAACGATAATGAATGCTTACAAAA ATTTAAATACAACTCTTGTTATTGTTCAAC TATTAATTCTCCTTTTGAGCTATGCTCATATATATTT ATTTAAATACAACTCTTGTTATTGTTCAAC GAGCTACAAGATAAATACAAAGATGTGGATTTAGT ATTTAAATACAACTCTTGTTATTGTTCAAC ATATGCAATAGCCATATTTCAAAGATATTCAAAGGA ATTTAAATACAACTCTTGTTATTGTTCAAC
>NC_004557.1|WP_011099673.1|1572395_1572800_+|RDD-family-protein MVLIIINFNRTVLYRIIASFIDDSALLLLYMFFTNIINKNNSSFVYVLLLLVSFISIEICFFIKSTSLGKFIMGLKVIDKTSSLELGFIKMLIRETFGKVLSNILFIGNIYILFNDSNQGFHDKLVNSIVIEND >NC_004557.1|WP_035125177.1|1571587_1572091_-|hypothetical-protein MNKTKKLPIIILLAVIVMFSGVNIYRRIDANRLKSKKTSISCIERIKDEKFNDNNVSFSFKKLNGVWQLLLLDSKKDDEITIINNSKIDEGKFYIGVLNSENEIIAFDKEKQDKITFVTPEEGCYLVRILAKNSSGKCDVKVDSKKGIDLNYNSINGHNMGLLEKNN >NC_004557.1|WP_023438394.1|1569373_1570516_-|iron-containing-alcohol-dehydrogenase MKEFSINTDVYFGEGSLDRLNEIKNKRVLIVCDKFMETSGMVTKVQQKLTDCEVTIYSDIVPDPSVEVIASGIQKLQSCNAQIIIALGGGSSIDGAKAIKEYSKKVTGKTINIEEFYAIPTTSGTGSEVTEYAVITNKQEGLKYAITDKSLLPTVAILDPQLVKSVPKAITADTGMDVITHALEAYVSKNATDFSDALAEKAFTLAFRFLPQAYADGEDIIAREKLHNASCLAGMAFNAAGLGITHSLAHAVGGKLHISHGRSNAIILPYVVEYNANLNKESFNAEYSIAAKKYQRLAKLLKLHAPNVTIGVNNLIKSIVKLQNTLMIPQTLKQQREDINLDETSKEEIINAALRDVCTTSNPRETKKEDFLKILDKVLG >NC_004557.1|WP_023438393.1|1568460_1569291_-|MerR-family-transcriptional-regulator MKEELYSIGKVGEICKITKKALRYYDKMDILSPDKVSDESGYRYYSKKTLLSVPMIKYYKQSGFKLEEMKVFLEGETYDFFHKSFRNKIDELKELEKEINLKIRSVKDWDDLIVEAQNVIENNVCDVAIKYIDNKTLTFLDQEFKYDYMDSIINIEFTNYIDSIENAITGPVIIRFPCHEDKMNGKCTKMRIMQETILKCKEELSVEFGGWMAAACYHIGPHETISDTYKKIKEWTKEHGYICFEECYERYVTDYWTTKNTDKFVTEILIKISRER >NC_004557.1|WP_011099669.1|1566901_1568017_-|membrane-protein MSENSKTVSLEAIAAKKKLSSDFFKKGISLALFSGLAYGLYTAFLTMGMTKGVWGDWYGDNTAGLSVFVIAYLLAALGNAINDTCSAIWSLLYAVVKGKFGDFLRCINTKPGRIMILAALIGGPIASTAYVIALQMAGSIVVPISALCPAIAAILGKVLYKQELNKRMAFGIVICVCASFLIGSTGFTSDGISRNTLLGLLIAIIAALGWGFEGCVAGYGTAMIDPEIGICIRQVTAGIADLCILLPVLGMMAGGINISVDLTMQAFTSAPAMIWFTLSGLLTFMTFMTWYAGNSMCGAGLGTACNGTYSFFGPLFCLLVLGVYGGMDGWALPTVAWIGAVVMIIGILIISMNPLDLFKRKKMEVDVDETA >NC_004557.1|WP_023438392.1|1566648_1566915_-|hypothetical-protein MKPLNYAILKHFTKVPEACVDDVIEALKGEYGHFKALNRKAVTNALMTAEANGLIEEVRFDLDENKQLRVYYHAHKEGADTINKYIPD >NC_004557.1|WP_023438391.1|1566245_1566542_-|BMC-domain-containing-protein MRYYGDEALGLVETIGLVPALEAADKMLKAANVELISYENIGSTLVTIMVKGDVAAVKASVEAGAKAAAAIGKLTAHNVMPRPIREVGDIVSVHDIDL >NC_004557.1|WP_035125179.1|1565915_1566224_-|BMC-domain-containing-protein MARYRALGLIETFGLVFALEAADAMCKAANVELIGYENVASGYISVLVSGDVGACRSAVDAGVAAVNGMEGGNLYSSIVIPSPHEELEKIIKRYSITTLIPE >NC_004557.1|WP_011099667.1|1563236_1565777_-|choline-trimethylamine-lyase MDIREFSNMLMEATKNMSDEERNGLMNMFQSISKEIKKEEKVTSNVVFNNNGEIPDGMTERLIKLKENYMKQVPSITTHRARAITKIAKENPGVPKSVLRGKCFKYCCETAPLVIQDNELIVGAPNGKPRAGAFSPDIAWRWMEDEIDTIANRPQDPFYISEEDKKIMREELFPYWKGKSVDEYCEDQYREAGVWELSGESFVSDCSYHAVNGGGDSNPGYDVILMKKGMLDIKREAEEKLASLSYERPEDIEKIYFYKSIIDTAEGVMIYAKRMSDYAAELAAKETDPKRKAELQKISKVNARVPAHKPSTFWEAIQAVWTIESLLVVEENQTGMSIGRVDQYMYPFYKSDIESGRMTDFEAFELAGCMLIKMSEMMWITSEGGSKFFAGYQPFVNMCVGGVTREGRDATNELTYLLMDAVRHVKIYQPSLACRIHKGSPQKYLKKIVDVIRAGMGFPACHFDDVHIKMMLAKGVSIEDARDYCLMGCVEPQKSGRLYQWTSTGYTQWPICIELVLNHGVPLWYGKQVCPDMGDLSQFKTYEQFEGAVREQIKYITKWTAVATTISQRVHRELAPKPLMSMMYEGCMEKGRGVEAGGAMYNFGPGVVWSGLATYTDSMAAIKKLVFEEKKYTLEELSEALKADFVGYERLRKDCLEAPKYGNDDDYADYIAADLVNFTEQEHRKYKTLYSVLSHGTLSISNNTPFGQMTGATANGRRAWMPLSDGISPSQGSDFKGPTSIIKSVSKISCEDMNIGMVHNFKLMSGLLDTPEGEQGIIALLRSACALQLGEIQFNYLDNETLIEAQKHPEQYRDLIVRVAGYSAFFVELCKDVQDEIISRTMLTHF >NC_004557.1|WP_023438388.1|1562261_1563215_-|choline-TMA-lyase-activating-enzyme MSNGNLGVIKEKATVFNIQKYSIYDGDGIRTLVFFQGCPLRCKWCSNPEGLIKKHRVMFKSNLCVNCGACVSVCPVSIHTLSNETLKHEINRNIDCIGCGKCKDACLKSAISIVGEEKTISELLKIVEEDRVFYEMSGGGVTLGGGEVLMQPKAASSLLMACKQEGINTAIETCGYTNLETILKVAESVDLFLFDIKHINPDRHFELTGVRNEQILENLQELLRRKYNVKIRMPLLKDINNSKEEIEATMEFLTPYKDYKNFKGIDLLPYHKMGVNKYNQLGIEYPIKGDPSLNDEELDRIEEWIKKYDLHVKVIRH >NC_004557.1|WP_035125175.1|1573867_1574935_-|tRNA-2-thiouridine(34)-synthase-MnmA MKKKVLVGMSGGVDSSVAAYLLKEQGYEVIGATMQIWQDDKEFIEREGGCCSLSAVADARRVANKIGIPFYVMNFKDAFKKNVIDYFVDEYMEGRTPNPCVACNKFIKFSSFLDKAMTLGIDYVATGHYAIIEKQNNRYIVRKSEDDKKDQTYALYNLTQFQLERTLMPCGRYKKSEIREIAKKIGLRVHNKKDSQEICFIPDNDHGKYIKNRFPSKVRQGNFVDKSGNVLGTHKGIVYYTIGQRKGLDIALGKPMYVVDINPFRNEVVLGNLDDLLNTELIAKDVNYIPFDNLKEPMEVEAKIRYSQIPSKAVITPMENDKVKVNFTEKQRAITKGQSVVFYKGDLLIGGGIIE >NC_004557.1|WP_011099676.1|1574977_1576114_-|cysteine-desulfurase MKKNIYMDYAATTYIKKEVIEAMMPYLTEYYANPSSVYNMSNNLKIVIDEAKEEIADFIGATPEEVFFTSGGTEGNNWAIKGIAYANEEKGKHIITSSIEHPAVLNSCKYLKEKGFEITFLPVDSYGKVDLEKLEKSIRNDTILVSIMAANNEIGTIQHIKSIGEICKRHKVLFHTDAVQALGHIPINAEEMDIDLMTIAAHKIYGPKGIGALYIKKGTKIENILHGGSQERGKRPGTENTAAIVGFKKAVSLLKENGLEESKRIEKLRDKFIKGLLQIENTKINGAMGKERLKGNVNVSFKNIDGELLLMLLDREGIYASAGSACSAGSIDASHVLVALGLEDEFLKGTIRFTLGARNTEEEVDFVLEKLNQLIKKI >NC_004557.1|WP_023438397.1|1576489_1577302_+|hypothetical-protein MFKSLSKKVIISVVILIVVLLGVLGMKNYNKSKDYKNLVTTANEYMNEKDYDKAMDKFKESLDYKKDQKAEEKLEECKNELINLSKEALKNKEYEKADNYLNVLLKHDGKNEEAIKMKNTIKDEIQKSKEEEEIKKAIEKEKREQELKKQMDKEEQAKKKTGITEQNKKKEVKENNKITKEKAESLVQPLKNKNEEIRYLGTRQVPEIPAKSTPYKKFPKEIENKKVYIFDIAVVYNSDSKATIGRYYVDFSGNIYKDTYPSNLECVKVK >NC_004557.1|WP_035109706.1|1577339_1577690_-|SdpI-family-protein MNILTNCIIGFIFIVIGLVLRAYPPQHINNSLGYRTPFSIKNKDTWYEGNRFCGTILLISSIIFIPFSILIKYLYSNNLNLSMGISSLSLLIIIIIGIVYTEIHLRMMFDKNGTRK >NC_004557.1|WP_011099679.1|1578401_1579985_-|AAA-family-ATPase MKKRFNVTGTCIPERHYMVDISNKLDSILKLVNNEEYFIINRPRQYGKTTTLYMLERYLNKFKDYLVISISFEGIGDLIFQDEKVFSKEFLQIMSDSLLLNSQALSECLEEQKPHVENFIDLSRVITKFIVKAKRKVVLMIDEVDKSSNNQLFLSFLGLLRNKYLLRNVGKDYTFHNVILAGVHDVKSLKLKIRPDEEHKYNSPWNIASDFDVDMSFSIDEIKTMLDDYVENKEVNLDKEYFAEKIYFYTSGYPFLVSKLCKIVDEKIMVKDELKWEKEYLQIAVKELLKESNTNFDSLIKNIENNKDLQELVRKIILDGYEITYNEDNPLITMGVTYGIFKNSHGKVKIHNRIYEQRIYNYMISLIETKINLGFYTERERYLKPNGDLDIKKVLKKFQEFMKHEYSQKREGFLEEDGRLVFLAFLSPIINGAGFAFKEVQGGEEKRFDIVITYNKKMYILELKIWRGEEYHKKGLKQLVEYLNQYGLEEGYLLIFDFRKATNLIGQVEETHINAEDNIKKIIGVYC >NC_004557.1|WP_023438399.1|1580562_1580853_-|CRISPR-associated-endonuclease-Cas2 MSKNFNYNYAFVFYDVNEKRVNRVFKTCKKYLSHFQKSVFRGELTPANFILLKKDLNKVINEDEDFICIIKLMNNKVYDEEILGNPHSCTGEDLIL >NC_004557.1|WP_011099680.1|1580853_1581852_-|type-I-B-CRISPR-associated-endonuclease-Cas1 MGSTRYITSVGELKRKDNSLCFRKNNKNVYIPVENTKEIYCMSEVNINSKLLDFLSQNNIIMHFFNYYEGYSGTFYPREHYNSGKLLVKQVETYENRRLEVAKSIVEAIGDNIYELLYHYYKHDKKEVKETLDWIKNHSKINLKKANDIKQIMQVEGETWQRFYGEFKNILPEEFVMNKRVKRPPDNPINALISFGNTLLYGKTITAIYNTHLDQRISFLHEPSEGRFSLSLDISEAFKPVIVFKTIFDLVNNKRIQVSKHFDKKLNYCLLNDEGRNIFITAFEERMESIFLNEKLKRKISYKTAIKLDCYKLIKFILENKEFKPFSLKERM >NC_004557.1|WP_011099681.1|1581861_1582353_-|CRISPR-associated-protein-Cas4 MKVNGTLVNYYFHCKRQCWLHGNRINLEDNSQDVKIGKAIHEVKKEKGKQTEISIDNIKIDKITKDYLTEVKKSDSDIEAAKWQLLLYLKVLKDKGIERKGKLEFIEKNKSKSTIIIELDENNLSELEDVIKNIENLLIQENPPEVINESKCKKCAYFEYCYI >NC_004557.1|WP_011099682.1|1582361_1584962_-|CRISPR-associated-helicase/endonuclease-Cas3 MYFNNIEKVNLENIIENNDKIYAHIHNGRKETLKEHSDLALKYLYKISERKSLDNVFLKIENNFLEKCSNEEKMVYRKMLLNTIYMHDLGKINCNFQRKKMANKIFKEEKMSSTNHSMLSSIIYINHFLKEIASIENGEHIKLLIAFLLLNSYVISKHHGAFNSVNKFKEKLVYDGEEGKDLYTKYMYIFDKVYKEEIIINESLIKEDLFDMYKSTIQEKTEENKDFPVELYIYERFLASLLLSCDYYSTSEFKNQKEVEEFGEIKNIEKFYKSFKSTEVYNWIRKYEKNDYGKTDDFSNIDDINVLRNELFLDAEKTMVSNIDKDIFYLEAPTGSGKSNVSFNLSFKMVERFKEINKIFYVYPFNTLVEQNIKTLEKIFKNNEIMKDIAIINSVVPIKIKSSKDNKIKEIDTNEESDILNEDYERALLDRQFLHYPIVLTTHVSIFNYLFGTSKDNLFPLCQIANSIIVLDEIQSYKNRIWKEIITFLACYSRLLNIKIIIMSATLPNLNKLVDGEIKTVNLIENRKKYFENPIFKNRVMVDFSLLEEKENIKEVLFNNVIKNTKAPNKNILVEFITKESAMDFYEKLKDYNKYLQESEKREIELITGDDNRVERNRIIDKIKSQKNIILVATQVIEAGVDIDMDIGYKDISMLDSEEQFLGRINRSCKNDEQGIVYFFDLDLASHVYKRDIRKQKNINLTCPKIREILINKNFQEFYDYVIKELNKKAGEYNNSSFQTFFLDKVKMLNFKEIEERMKLIDELYENNVFLNRNITLENEEELCGEDVWNEYIAILKNNKLDYAEKKIKLSQVTAKLNYFIYQISSDDFIYEDRVGDIYYIGDGEKYFEDGKFDRKKFKSIVADII >NC_004557.1|WP_011099683.1|1585018_1585789_-|type-I-B-CRISPR-associated-protein-Cas5 MDALKFSLSGRTAFFKKPDVNSFFYFTYGNVHKVALLGILGAICGYGGYNSQCLNKEQIYPEFYEKLKDINIGVVPKNEKGYIDKKIQVFNNSVGYASKELGGNLIVKEQWLENPKWAIYILMDENVPKDLKDRLLNFKFKYIPYLGKNDHMANITDVEYLENIEKLDNTNKLDSIFIKDKYEIQKESKNFNDLKNIIKKSSSKIQEFKYEEMLPISLEETTNKYNLETFIYTNSNLKPLADTKTYKCGDKNIFFF |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_004557_4 | 1577804-1577966 | TypeI-B |
III-B
Consensus repeat of NC_004557_4
|
2 spacers
spacers of NC_004557_4
>4.1|1577836|32|NC_004557|PILER-CR TTAAAGCTTCTACTAATTCTTTTTTATTCATT >4.2|1577900|36|NC_004557|PILER-CR GTACAAAACTTACCTCAAAACCATCTACCAGATTTA >4.3|1577835|34|NC_004557|CRISPRCasFinder TTAAAGCTTCTACTAATTCTTTTTTATTCATTGT >4.4|1577899|38|NC_004557|CRISPRCasFinder GTACAAAACTTACCTCAAAACCATCTACCAGATTTAGA |
cas2,cas1,cas4,cas3,cas5,cas7b,cas8b1,cas6 |
CRISPR arrays and Neighbor proteins around NC_004557_4
The CRISPR arrays of NC_004557_4 >merge|NC_004557|4|1577804-1577966|PILER-CR,CRISPRCasFinder AATTTAAATACATCCTATGTTAAGGTTCAACTTAAAGCTTCTACTAATTCTTTTTTATTCATTGTATTTAAATACATCCTATGTTAAGGTTCAACGTACAAAACTTACCTCAAAACCATCTACCAGATTTAGAATTTAAATACATCCTATGTTAAGGTTCAAC >NC_004557|4|4|1577804-1577966|PILER-CR AATTTAAATACATCCTATGTTAAGGTTCAACT TAAAGCTTCTACTAATTCTTTTTTATTCATTG TATTTAAATACATCCTATGTTAAGGTTCAACG TACAAAACTTACCTCAAAACCATCTACCAGATTTAG AATTTAAATACATCCTATGTTAAGGTTCAAC >NC_004557|4|4|1577805-1577966|CRISPRCasFinder ATTTAAATACATCCTATGTTAAGGTTCAAC TTAAAGCTTCTACTAATTCTTTTTTATTCATTGT ATTTAAATACATCCTATGTTAAGGTTCAAC GTACAAAACTTACCTCAAAACCATCTACCAGATTTAGA ATTTAAATACATCCTATGTTAAGGTTCAAC
>NC_004557.1|WP_035109706.1|1577339_1577690_-|SdpI-family-protein MNILTNCIIGFIFIVIGLVLRAYPPQHINNSLGYRTPFSIKNKDTWYEGNRFCGTILLISSIIFIPFSILIKYLYSNNLNLSMGISSLSLLIIIIIGIVYTEIHLRMMFDKNGTRK >NC_004557.1|WP_023438397.1|1576489_1577302_+|hypothetical-protein MFKSLSKKVIISVVILIVVLLGVLGMKNYNKSKDYKNLVTTANEYMNEKDYDKAMDKFKESLDYKKDQKAEEKLEECKNELINLSKEALKNKEYEKADNYLNVLLKHDGKNEEAIKMKNTIKDEIQKSKEEEEIKKAIEKEKREQELKKQMDKEEQAKKKTGITEQNKKKEVKENNKITKEKAESLVQPLKNKNEEIRYLGTRQVPEIPAKSTPYKKFPKEIENKKVYIFDIAVVYNSDSKATIGRYYVDFSGNIYKDTYPSNLECVKVK >NC_004557.1|WP_011099676.1|1574977_1576114_-|cysteine-desulfurase MKKNIYMDYAATTYIKKEVIEAMMPYLTEYYANPSSVYNMSNNLKIVIDEAKEEIADFIGATPEEVFFTSGGTEGNNWAIKGIAYANEEKGKHIITSSIEHPAVLNSCKYLKEKGFEITFLPVDSYGKVDLEKLEKSIRNDTILVSIMAANNEIGTIQHIKSIGEICKRHKVLFHTDAVQALGHIPINAEEMDIDLMTIAAHKIYGPKGIGALYIKKGTKIENILHGGSQERGKRPGTENTAAIVGFKKAVSLLKENGLEESKRIEKLRDKFIKGLLQIENTKINGAMGKERLKGNVNVSFKNIDGELLLMLLDREGIYASAGSACSAGSIDASHVLVALGLEDEFLKGTIRFTLGARNTEEEVDFVLEKLNQLIKKI >NC_004557.1|WP_035125175.1|1573867_1574935_-|tRNA-2-thiouridine(34)-synthase-MnmA MKKKVLVGMSGGVDSSVAAYLLKEQGYEVIGATMQIWQDDKEFIEREGGCCSLSAVADARRVANKIGIPFYVMNFKDAFKKNVIDYFVDEYMEGRTPNPCVACNKFIKFSSFLDKAMTLGIDYVATGHYAIIEKQNNRYIVRKSEDDKKDQTYALYNLTQFQLERTLMPCGRYKKSEIREIAKKIGLRVHNKKDSQEICFIPDNDHGKYIKNRFPSKVRQGNFVDKSGNVLGTHKGIVYYTIGQRKGLDIALGKPMYVVDINPFRNEVVLGNLDDLLNTELIAKDVNYIPFDNLKEPMEVEAKIRYSQIPSKAVITPMENDKVKVNFTEKQRAITKGQSVVFYKGDLLIGGGIIE >NC_004557.1|WP_011099673.1|1572395_1572800_+|RDD-family-protein MVLIIINFNRTVLYRIIASFIDDSALLLLYMFFTNIINKNNSSFVYVLLLLVSFISIEICFFIKSTSLGKFIMGLKVIDKTSSLELGFIKMLIRETFGKVLSNILFIGNIYILFNDSNQGFHDKLVNSIVIEND >NC_004557.1|WP_035125177.1|1571587_1572091_-|hypothetical-protein MNKTKKLPIIILLAVIVMFSGVNIYRRIDANRLKSKKTSISCIERIKDEKFNDNNVSFSFKKLNGVWQLLLLDSKKDDEITIINNSKIDEGKFYIGVLNSENEIIAFDKEKQDKITFVTPEEGCYLVRILAKNSSGKCDVKVDSKKGIDLNYNSINGHNMGLLEKNN >NC_004557.1|WP_023438394.1|1569373_1570516_-|iron-containing-alcohol-dehydrogenase MKEFSINTDVYFGEGSLDRLNEIKNKRVLIVCDKFMETSGMVTKVQQKLTDCEVTIYSDIVPDPSVEVIASGIQKLQSCNAQIIIALGGGSSIDGAKAIKEYSKKVTGKTINIEEFYAIPTTSGTGSEVTEYAVITNKQEGLKYAITDKSLLPTVAILDPQLVKSVPKAITADTGMDVITHALEAYVSKNATDFSDALAEKAFTLAFRFLPQAYADGEDIIAREKLHNASCLAGMAFNAAGLGITHSLAHAVGGKLHISHGRSNAIILPYVVEYNANLNKESFNAEYSIAAKKYQRLAKLLKLHAPNVTIGVNNLIKSIVKLQNTLMIPQTLKQQREDINLDETSKEEIINAALRDVCTTSNPRETKKEDFLKILDKVLG >NC_004557.1|WP_023438393.1|1568460_1569291_-|MerR-family-transcriptional-regulator MKEELYSIGKVGEICKITKKALRYYDKMDILSPDKVSDESGYRYYSKKTLLSVPMIKYYKQSGFKLEEMKVFLEGETYDFFHKSFRNKIDELKELEKEINLKIRSVKDWDDLIVEAQNVIENNVCDVAIKYIDNKTLTFLDQEFKYDYMDSIINIEFTNYIDSIENAITGPVIIRFPCHEDKMNGKCTKMRIMQETILKCKEELSVEFGGWMAAACYHIGPHETISDTYKKIKEWTKEHGYICFEECYERYVTDYWTTKNTDKFVTEILIKISRER >NC_004557.1|WP_011099669.1|1566901_1568017_-|membrane-protein MSENSKTVSLEAIAAKKKLSSDFFKKGISLALFSGLAYGLYTAFLTMGMTKGVWGDWYGDNTAGLSVFVIAYLLAALGNAINDTCSAIWSLLYAVVKGKFGDFLRCINTKPGRIMILAALIGGPIASTAYVIALQMAGSIVVPISALCPAIAAILGKVLYKQELNKRMAFGIVICVCASFLIGSTGFTSDGISRNTLLGLLIAIIAALGWGFEGCVAGYGTAMIDPEIGICIRQVTAGIADLCILLPVLGMMAGGINISVDLTMQAFTSAPAMIWFTLSGLLTFMTFMTWYAGNSMCGAGLGTACNGTYSFFGPLFCLLVLGVYGGMDGWALPTVAWIGAVVMIIGILIISMNPLDLFKRKKMEVDVDETA >NC_004557.1|WP_023438392.1|1566648_1566915_-|hypothetical-protein MKPLNYAILKHFTKVPEACVDDVIEALKGEYGHFKALNRKAVTNALMTAEANGLIEEVRFDLDENKQLRVYYHAHKEGADTINKYIPD >NC_004557.1|WP_011099679.1|1578401_1579985_-|AAA-family-ATPase MKKRFNVTGTCIPERHYMVDISNKLDSILKLVNNEEYFIINRPRQYGKTTTLYMLERYLNKFKDYLVISISFEGIGDLIFQDEKVFSKEFLQIMSDSLLLNSQALSECLEEQKPHVENFIDLSRVITKFIVKAKRKVVLMIDEVDKSSNNQLFLSFLGLLRNKYLLRNVGKDYTFHNVILAGVHDVKSLKLKIRPDEEHKYNSPWNIASDFDVDMSFSIDEIKTMLDDYVENKEVNLDKEYFAEKIYFYTSGYPFLVSKLCKIVDEKIMVKDELKWEKEYLQIAVKELLKESNTNFDSLIKNIENNKDLQELVRKIILDGYEITYNEDNPLITMGVTYGIFKNSHGKVKIHNRIYEQRIYNYMISLIETKINLGFYTERERYLKPNGDLDIKKVLKKFQEFMKHEYSQKREGFLEEDGRLVFLAFLSPIINGAGFAFKEVQGGEEKRFDIVITYNKKMYILELKIWRGEEYHKKGLKQLVEYLNQYGLEEGYLLIFDFRKATNLIGQVEETHINAEDNIKKIIGVYC >NC_004557.1|WP_023438399.1|1580562_1580853_-|CRISPR-associated-endonuclease-Cas2 MSKNFNYNYAFVFYDVNEKRVNRVFKTCKKYLSHFQKSVFRGELTPANFILLKKDLNKVINEDEDFICIIKLMNNKVYDEEILGNPHSCTGEDLIL >NC_004557.1|WP_011099680.1|1580853_1581852_-|type-I-B-CRISPR-associated-endonuclease-Cas1 MGSTRYITSVGELKRKDNSLCFRKNNKNVYIPVENTKEIYCMSEVNINSKLLDFLSQNNIIMHFFNYYEGYSGTFYPREHYNSGKLLVKQVETYENRRLEVAKSIVEAIGDNIYELLYHYYKHDKKEVKETLDWIKNHSKINLKKANDIKQIMQVEGETWQRFYGEFKNILPEEFVMNKRVKRPPDNPINALISFGNTLLYGKTITAIYNTHLDQRISFLHEPSEGRFSLSLDISEAFKPVIVFKTIFDLVNNKRIQVSKHFDKKLNYCLLNDEGRNIFITAFEERMESIFLNEKLKRKISYKTAIKLDCYKLIKFILENKEFKPFSLKERM >NC_004557.1|WP_011099681.1|1581861_1582353_-|CRISPR-associated-protein-Cas4 MKVNGTLVNYYFHCKRQCWLHGNRINLEDNSQDVKIGKAIHEVKKEKGKQTEISIDNIKIDKITKDYLTEVKKSDSDIEAAKWQLLLYLKVLKDKGIERKGKLEFIEKNKSKSTIIIELDENNLSELEDVIKNIENLLIQENPPEVINESKCKKCAYFEYCYI >NC_004557.1|WP_011099682.1|1582361_1584962_-|CRISPR-associated-helicase/endonuclease-Cas3 MYFNNIEKVNLENIIENNDKIYAHIHNGRKETLKEHSDLALKYLYKISERKSLDNVFLKIENNFLEKCSNEEKMVYRKMLLNTIYMHDLGKINCNFQRKKMANKIFKEEKMSSTNHSMLSSIIYINHFLKEIASIENGEHIKLLIAFLLLNSYVISKHHGAFNSVNKFKEKLVYDGEEGKDLYTKYMYIFDKVYKEEIIINESLIKEDLFDMYKSTIQEKTEENKDFPVELYIYERFLASLLLSCDYYSTSEFKNQKEVEEFGEIKNIEKFYKSFKSTEVYNWIRKYEKNDYGKTDDFSNIDDINVLRNELFLDAEKTMVSNIDKDIFYLEAPTGSGKSNVSFNLSFKMVERFKEINKIFYVYPFNTLVEQNIKTLEKIFKNNEIMKDIAIINSVVPIKIKSSKDNKIKEIDTNEESDILNEDYERALLDRQFLHYPIVLTTHVSIFNYLFGTSKDNLFPLCQIANSIIVLDEIQSYKNRIWKEIITFLACYSRLLNIKIIIMSATLPNLNKLVDGEIKTVNLIENRKKYFENPIFKNRVMVDFSLLEEKENIKEVLFNNVIKNTKAPNKNILVEFITKESAMDFYEKLKDYNKYLQESEKREIELITGDDNRVERNRIIDKIKSQKNIILVATQVIEAGVDIDMDIGYKDISMLDSEEQFLGRINRSCKNDEQGIVYFFDLDLASHVYKRDIRKQKNINLTCPKIREILINKNFQEFYDYVIKELNKKAGEYNNSSFQTFFLDKVKMLNFKEIEERMKLIDELYENNVFLNRNITLENEEELCGEDVWNEYIAILKNNKLDYAEKKIKLSQVTAKLNYFIYQISSDDFIYEDRVGDIYYIGDGEKYFEDGKFDRKKFKSIVADII >NC_004557.1|WP_011099683.1|1585018_1585789_-|type-I-B-CRISPR-associated-protein-Cas5 MDALKFSLSGRTAFFKKPDVNSFFYFTYGNVHKVALLGILGAICGYGGYNSQCLNKEQIYPEFYEKLKDINIGVVPKNEKGYIDKKIQVFNNSVGYASKELGGNLIVKEQWLENPKWAIYILMDENVPKDLKDRLLNFKFKYIPYLGKNDHMANITDVEYLENIEKLDNTNKLDSIFIKDKYEIQKESKNFNDLKNIIKKSSSKIQEFKYEEMLPISLEETTNKYNLETFIYTNSNLKPLADTKTYKCGDKNIFFF >NC_004557.1|WP_011099684.1|1585792_1586752_-|type-I-CRISPR-associated-protein-Cas7 MGMNKRVYGVLGIVSRMSNWNADFTGYPKTTSSGDVFGSDKAFKYPMKKMWENGGEKVLYIKSIKFQENKKKERELIPRTLKERYEYIFDVEDLKKNKDSEEVLKNLFTAVDVKNFGATFAEEGNNISITGAVQIGQGFNKYKETYAEEQQILSPFRDPNQKEKSKDGEEAKSSTLGTKIVSNEAHYFYPLTVNPSAYSQFEEIGVTNGYTEEDYEKFKETSMIAATSFNTNSKIGCENEFALFVETKEDLYLPDLSQYVDFEKVEDKNIIILSCSELLNSFENEIENIEIYYNSYTTEIKSDEIKKAKKFNIFTKKEV >NC_004557.1|WP_011099685.1|1586757_1588497_-|type-I-B-CRISPR-associated-protein-Cas8b/Csh1 MLKDVISIFKREYEKIGDRYVTESYIPSDGEYIIVDTFENDFKILDKVIIKKDRKTQKIDDSNQYFPFIREADYLSRLLDMNKPIDHKKIIHSNNYLSFFIKKENVNNGKLSDEIIDRYYEILKDPLIKYKNTKAEKLYEEVEEEHGKVNEKLIDEIKNWIKEKIHDFVDKGSKEKEYLKIFFKYDLDKYRKESEKYISPNLYNSNDYNVKIKEEIYGLPNDNMGLNSKKPYLENKTRKSKVPYLISKEEVLIQKKFFDYLMNQVAIGKSNIYINEKGIKGISNKETLGEDFTGYYLRIQKGKEVEIHNFDTIVNYRAKIEPFKLENVLELEKSELNYNVFIYEIGKLKDLIDNVFFYKFLSGNFFTKAEDLNINDATLKRSILLSRDTLFTWFYKGVDNNTWNNLNISSLNLIKGSINKRYLLKAGEQFNLRCSLKNYFEGGISMADVLLEVKNSLREKINKTVKENKNHEDVTLDNDREYYFAVGQLAYYLISLSKSKNKSHSLVNPIINAKTNERIKDEIRRLYTRYNYRIEFGSKRVERLYSMISSYVPKGKINGDLIIAGFLKNNLIYEKSEEE >NC_004557.1|WP_011099686.1|1588509_1589202_-|CRISPR-associated-endoribonuclease-Cas6 MKIYELTLKVFLLKDIKSDESLEKISNLIDKSLSKDGKLLDFHERNTYKNYTFNSLYPIEKDKIYNEGKIYSVQIRTVDESLIQYFKKNLTNEYTEYIKALTLECRVIPQRYIEKIYSITPVIIKTEKGYWKGNLSLGEFEERIKNNLIKKYNSFFNTKIDERFTLFRTINLINNKPISCSYKDINILGDKITLIIDENEMAQKLACFSLGSGVGEMNARGYGFVNYKWL >NC_004557.1|WP_155274218.1|1589321_1589468_-|hypothetical-protein MKKKLKFSISATYEDLKEKERIEIDDIIYIIELVSVITLILKIFQFIN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_004557_5 | 1580088-1580377 | TypeI-B |
III-B
Consensus repeat of NC_004557_5
|
4 spacers
spacers of NC_004557_5
>5.1|1580118|36|NC_004557|CRISPRCasFinder,CRT GGATGCACTTTCTTTAAATATAAATAAAAAATCTAA >5.2|1580184|36|NC_004557|CRISPRCasFinder,CRT AACATCAAATTCCTACTTCACAATAATTTCATGTTG >5.3|1580250|34|NC_004557|CRISPRCasFinder,CRT AAGAGTTGCACTTTTTTATATAATCTCTTTTAGG >5.4|1580314|34|NC_004557|CRISPRCasFinder,CRT TTGGAGATTTAAAGGAAGCTTATAAATATTTCTA >5.5|1580119|36|NC_004557|PILER-CR GGATGCACTTTCTTTAAATATAAATAAAAAATCTAA >5.6|1580185|36|NC_004557|PILER-CR AACATCAAATTCCTACTTCACAATAATTTCATGTTG >5.7|1580251|34|NC_004557|PILER-CR AAGAGTTGCACTTTTTTATATAATCTCTTTTAGG >5.8|1580315|34|NC_004557|PILER-CR TTGGAGATTTAAAGGAAGCTTATAAATATTTCTA |
cas2,cas1,cas4,cas3,cas5,cas7b,cas8b1,cas6 |
CRISPR arrays and Neighbor proteins around NC_004557_5
The CRISPR arrays of NC_004557_5 >merge|NC_004557|5|1580088-1580377|CRISPRCasFinder,CRT,PILER-CR TTTTAAATACAACTCTTGTTATTGTTCAACGGATGCACTTTCTTTAAATATAAATAAAAAATCTAAATTTAAATACAACTCTTGTTATTGTTCAACAACATCAAATTCCTACTTCACAATAATTTCATGTTGATTTAAATACAACTCTTGTTATTGTTCAACAAGAGTTGCACTTTTTTATATAATCTCTTTTAGGATTTAAATACAACTCTTGTTATTGTTCAACTTGGAGATTTAAAGGAAGCTTATAAATATTTCTAATTTAAATACAACTCTTGTTATTGTTCAAC >NC_004557|5|5|1580088-1580377|CRISPRCasFinder TTTTAAATACAACTCTTGTTATTGTTCAAC GGATGCACTTTCTTTAAATATAAATAAAAAATCTAA ATTTAAATACAACTCTTGTTATTGTTCAAC AACATCAAATTCCTACTTCACAATAATTTCATGTTG ATTTAAATACAACTCTTGTTATTGTTCAAC AAGAGTTGCACTTTTTTATATAATCTCTTTTAGG ATTTAAATACAACTCTTGTTATTGTTCAAC TTGGAGATTTAAAGGAAGCTTATAAATATTTCTA ATTTAAATACAACTCTTGTTATTGTTCAAC >NC_004557|5|4|1580088-1580377|CRT TTTTAAATACAACTCTTGTTATTGTTCAAC GGATGCACTTTCTTTAAATATAAATAAAAAATCTAA ATTTAAATACAACTCTTGTTATTGTTCAAC AACATCAAATTCCTACTTCACAATAATTTCATGTTG ATTTAAATACAACTCTTGTTATTGTTCAAC AAGAGTTGCACTTTTTTATATAATCTCTTTTAGG ATTTAAATACAACTCTTGTTATTGTTCAAC TTGGAGATTTAAAGGAAGCTTATAAATATTTCTA ATTTAAATACAACTCTTGTTATTGTTCAAC >NC_004557|5|5|1580089-1580377|PILER-CR TTTAAATACAACTCTTGTTATTGTTCAACG GATGCACTTTCTTTAAATATAAATAAAAAATCTAAA TTTAAATACAACTCTTGTTATTGTTCAACA ACATCAAATTCCTACTTCACAATAATTTCATGTTGA TTTAAATACAACTCTTGTTATTGTTCAACA AGAGTTGCACTTTTTTATATAATCTCTTTTAGGA TTTAAATACAACTCTTGTTATTGTTCAACT TGGAGATTTAAAGGAAGCTTATAAATATTTCTAA TTTAAATACAACTCTTGTTATTGTTCAAC
>NC_004557.1|WP_011099679.1|1578401_1579985_-|AAA-family-ATPase MKKRFNVTGTCIPERHYMVDISNKLDSILKLVNNEEYFIINRPRQYGKTTTLYMLERYLNKFKDYLVISISFEGIGDLIFQDEKVFSKEFLQIMSDSLLLNSQALSECLEEQKPHVENFIDLSRVITKFIVKAKRKVVLMIDEVDKSSNNQLFLSFLGLLRNKYLLRNVGKDYTFHNVILAGVHDVKSLKLKIRPDEEHKYNSPWNIASDFDVDMSFSIDEIKTMLDDYVENKEVNLDKEYFAEKIYFYTSGYPFLVSKLCKIVDEKIMVKDELKWEKEYLQIAVKELLKESNTNFDSLIKNIENNKDLQELVRKIILDGYEITYNEDNPLITMGVTYGIFKNSHGKVKIHNRIYEQRIYNYMISLIETKINLGFYTERERYLKPNGDLDIKKVLKKFQEFMKHEYSQKREGFLEEDGRLVFLAFLSPIINGAGFAFKEVQGGEEKRFDIVITYNKKMYILELKIWRGEEYHKKGLKQLVEYLNQYGLEEGYLLIFDFRKATNLIGQVEETHINAEDNIKKIIGVYC >NC_004557.1|WP_035109706.1|1577339_1577690_-|SdpI-family-protein MNILTNCIIGFIFIVIGLVLRAYPPQHINNSLGYRTPFSIKNKDTWYEGNRFCGTILLISSIIFIPFSILIKYLYSNNLNLSMGISSLSLLIIIIIGIVYTEIHLRMMFDKNGTRK >NC_004557.1|WP_023438397.1|1576489_1577302_+|hypothetical-protein MFKSLSKKVIISVVILIVVLLGVLGMKNYNKSKDYKNLVTTANEYMNEKDYDKAMDKFKESLDYKKDQKAEEKLEECKNELINLSKEALKNKEYEKADNYLNVLLKHDGKNEEAIKMKNTIKDEIQKSKEEEEIKKAIEKEKREQELKKQMDKEEQAKKKTGITEQNKKKEVKENNKITKEKAESLVQPLKNKNEEIRYLGTRQVPEIPAKSTPYKKFPKEIENKKVYIFDIAVVYNSDSKATIGRYYVDFSGNIYKDTYPSNLECVKVK >NC_004557.1|WP_011099676.1|1574977_1576114_-|cysteine-desulfurase MKKNIYMDYAATTYIKKEVIEAMMPYLTEYYANPSSVYNMSNNLKIVIDEAKEEIADFIGATPEEVFFTSGGTEGNNWAIKGIAYANEEKGKHIITSSIEHPAVLNSCKYLKEKGFEITFLPVDSYGKVDLEKLEKSIRNDTILVSIMAANNEIGTIQHIKSIGEICKRHKVLFHTDAVQALGHIPINAEEMDIDLMTIAAHKIYGPKGIGALYIKKGTKIENILHGGSQERGKRPGTENTAAIVGFKKAVSLLKENGLEESKRIEKLRDKFIKGLLQIENTKINGAMGKERLKGNVNVSFKNIDGELLLMLLDREGIYASAGSACSAGSIDASHVLVALGLEDEFLKGTIRFTLGARNTEEEVDFVLEKLNQLIKKI >NC_004557.1|WP_035125175.1|1573867_1574935_-|tRNA-2-thiouridine(34)-synthase-MnmA MKKKVLVGMSGGVDSSVAAYLLKEQGYEVIGATMQIWQDDKEFIEREGGCCSLSAVADARRVANKIGIPFYVMNFKDAFKKNVIDYFVDEYMEGRTPNPCVACNKFIKFSSFLDKAMTLGIDYVATGHYAIIEKQNNRYIVRKSEDDKKDQTYALYNLTQFQLERTLMPCGRYKKSEIREIAKKIGLRVHNKKDSQEICFIPDNDHGKYIKNRFPSKVRQGNFVDKSGNVLGTHKGIVYYTIGQRKGLDIALGKPMYVVDINPFRNEVVLGNLDDLLNTELIAKDVNYIPFDNLKEPMEVEAKIRYSQIPSKAVITPMENDKVKVNFTEKQRAITKGQSVVFYKGDLLIGGGIIE >NC_004557.1|WP_011099673.1|1572395_1572800_+|RDD-family-protein MVLIIINFNRTVLYRIIASFIDDSALLLLYMFFTNIINKNNSSFVYVLLLLVSFISIEICFFIKSTSLGKFIMGLKVIDKTSSLELGFIKMLIRETFGKVLSNILFIGNIYILFNDSNQGFHDKLVNSIVIEND >NC_004557.1|WP_035125177.1|1571587_1572091_-|hypothetical-protein MNKTKKLPIIILLAVIVMFSGVNIYRRIDANRLKSKKTSISCIERIKDEKFNDNNVSFSFKKLNGVWQLLLLDSKKDDEITIINNSKIDEGKFYIGVLNSENEIIAFDKEKQDKITFVTPEEGCYLVRILAKNSSGKCDVKVDSKKGIDLNYNSINGHNMGLLEKNN >NC_004557.1|WP_023438394.1|1569373_1570516_-|iron-containing-alcohol-dehydrogenase MKEFSINTDVYFGEGSLDRLNEIKNKRVLIVCDKFMETSGMVTKVQQKLTDCEVTIYSDIVPDPSVEVIASGIQKLQSCNAQIIIALGGGSSIDGAKAIKEYSKKVTGKTINIEEFYAIPTTSGTGSEVTEYAVITNKQEGLKYAITDKSLLPTVAILDPQLVKSVPKAITADTGMDVITHALEAYVSKNATDFSDALAEKAFTLAFRFLPQAYADGEDIIAREKLHNASCLAGMAFNAAGLGITHSLAHAVGGKLHISHGRSNAIILPYVVEYNANLNKESFNAEYSIAAKKYQRLAKLLKLHAPNVTIGVNNLIKSIVKLQNTLMIPQTLKQQREDINLDETSKEEIINAALRDVCTTSNPRETKKEDFLKILDKVLG >NC_004557.1|WP_023438393.1|1568460_1569291_-|MerR-family-transcriptional-regulator MKEELYSIGKVGEICKITKKALRYYDKMDILSPDKVSDESGYRYYSKKTLLSVPMIKYYKQSGFKLEEMKVFLEGETYDFFHKSFRNKIDELKELEKEINLKIRSVKDWDDLIVEAQNVIENNVCDVAIKYIDNKTLTFLDQEFKYDYMDSIINIEFTNYIDSIENAITGPVIIRFPCHEDKMNGKCTKMRIMQETILKCKEELSVEFGGWMAAACYHIGPHETISDTYKKIKEWTKEHGYICFEECYERYVTDYWTTKNTDKFVTEILIKISRER >NC_004557.1|WP_011099669.1|1566901_1568017_-|membrane-protein MSENSKTVSLEAIAAKKKLSSDFFKKGISLALFSGLAYGLYTAFLTMGMTKGVWGDWYGDNTAGLSVFVIAYLLAALGNAINDTCSAIWSLLYAVVKGKFGDFLRCINTKPGRIMILAALIGGPIASTAYVIALQMAGSIVVPISALCPAIAAILGKVLYKQELNKRMAFGIVICVCASFLIGSTGFTSDGISRNTLLGLLIAIIAALGWGFEGCVAGYGTAMIDPEIGICIRQVTAGIADLCILLPVLGMMAGGINISVDLTMQAFTSAPAMIWFTLSGLLTFMTFMTWYAGNSMCGAGLGTACNGTYSFFGPLFCLLVLGVYGGMDGWALPTVAWIGAVVMIIGILIISMNPLDLFKRKKMEVDVDETA >NC_004557.1|WP_023438399.1|1580562_1580853_-|CRISPR-associated-endonuclease-Cas2 MSKNFNYNYAFVFYDVNEKRVNRVFKTCKKYLSHFQKSVFRGELTPANFILLKKDLNKVINEDEDFICIIKLMNNKVYDEEILGNPHSCTGEDLIL >NC_004557.1|WP_011099680.1|1580853_1581852_-|type-I-B-CRISPR-associated-endonuclease-Cas1 MGSTRYITSVGELKRKDNSLCFRKNNKNVYIPVENTKEIYCMSEVNINSKLLDFLSQNNIIMHFFNYYEGYSGTFYPREHYNSGKLLVKQVETYENRRLEVAKSIVEAIGDNIYELLYHYYKHDKKEVKETLDWIKNHSKINLKKANDIKQIMQVEGETWQRFYGEFKNILPEEFVMNKRVKRPPDNPINALISFGNTLLYGKTITAIYNTHLDQRISFLHEPSEGRFSLSLDISEAFKPVIVFKTIFDLVNNKRIQVSKHFDKKLNYCLLNDEGRNIFITAFEERMESIFLNEKLKRKISYKTAIKLDCYKLIKFILENKEFKPFSLKERM >NC_004557.1|WP_011099681.1|1581861_1582353_-|CRISPR-associated-protein-Cas4 MKVNGTLVNYYFHCKRQCWLHGNRINLEDNSQDVKIGKAIHEVKKEKGKQTEISIDNIKIDKITKDYLTEVKKSDSDIEAAKWQLLLYLKVLKDKGIERKGKLEFIEKNKSKSTIIIELDENNLSELEDVIKNIENLLIQENPPEVINESKCKKCAYFEYCYI >NC_004557.1|WP_011099682.1|1582361_1584962_-|CRISPR-associated-helicase/endonuclease-Cas3 MYFNNIEKVNLENIIENNDKIYAHIHNGRKETLKEHSDLALKYLYKISERKSLDNVFLKIENNFLEKCSNEEKMVYRKMLLNTIYMHDLGKINCNFQRKKMANKIFKEEKMSSTNHSMLSSIIYINHFLKEIASIENGEHIKLLIAFLLLNSYVISKHHGAFNSVNKFKEKLVYDGEEGKDLYTKYMYIFDKVYKEEIIINESLIKEDLFDMYKSTIQEKTEENKDFPVELYIYERFLASLLLSCDYYSTSEFKNQKEVEEFGEIKNIEKFYKSFKSTEVYNWIRKYEKNDYGKTDDFSNIDDINVLRNELFLDAEKTMVSNIDKDIFYLEAPTGSGKSNVSFNLSFKMVERFKEINKIFYVYPFNTLVEQNIKTLEKIFKNNEIMKDIAIINSVVPIKIKSSKDNKIKEIDTNEESDILNEDYERALLDRQFLHYPIVLTTHVSIFNYLFGTSKDNLFPLCQIANSIIVLDEIQSYKNRIWKEIITFLACYSRLLNIKIIIMSATLPNLNKLVDGEIKTVNLIENRKKYFENPIFKNRVMVDFSLLEEKENIKEVLFNNVIKNTKAPNKNILVEFITKESAMDFYEKLKDYNKYLQESEKREIELITGDDNRVERNRIIDKIKSQKNIILVATQVIEAGVDIDMDIGYKDISMLDSEEQFLGRINRSCKNDEQGIVYFFDLDLASHVYKRDIRKQKNINLTCPKIREILINKNFQEFYDYVIKELNKKAGEYNNSSFQTFFLDKVKMLNFKEIEERMKLIDELYENNVFLNRNITLENEEELCGEDVWNEYIAILKNNKLDYAEKKIKLSQVTAKLNYFIYQISSDDFIYEDRVGDIYYIGDGEKYFEDGKFDRKKFKSIVADII >NC_004557.1|WP_011099683.1|1585018_1585789_-|type-I-B-CRISPR-associated-protein-Cas5 MDALKFSLSGRTAFFKKPDVNSFFYFTYGNVHKVALLGILGAICGYGGYNSQCLNKEQIYPEFYEKLKDINIGVVPKNEKGYIDKKIQVFNNSVGYASKELGGNLIVKEQWLENPKWAIYILMDENVPKDLKDRLLNFKFKYIPYLGKNDHMANITDVEYLENIEKLDNTNKLDSIFIKDKYEIQKESKNFNDLKNIIKKSSSKIQEFKYEEMLPISLEETTNKYNLETFIYTNSNLKPLADTKTYKCGDKNIFFF >NC_004557.1|WP_011099684.1|1585792_1586752_-|type-I-CRISPR-associated-protein-Cas7 MGMNKRVYGVLGIVSRMSNWNADFTGYPKTTSSGDVFGSDKAFKYPMKKMWENGGEKVLYIKSIKFQENKKKERELIPRTLKERYEYIFDVEDLKKNKDSEEVLKNLFTAVDVKNFGATFAEEGNNISITGAVQIGQGFNKYKETYAEEQQILSPFRDPNQKEKSKDGEEAKSSTLGTKIVSNEAHYFYPLTVNPSAYSQFEEIGVTNGYTEEDYEKFKETSMIAATSFNTNSKIGCENEFALFVETKEDLYLPDLSQYVDFEKVEDKNIIILSCSELLNSFENEIENIEIYYNSYTTEIKSDEIKKAKKFNIFTKKEV >NC_004557.1|WP_011099685.1|1586757_1588497_-|type-I-B-CRISPR-associated-protein-Cas8b/Csh1 MLKDVISIFKREYEKIGDRYVTESYIPSDGEYIIVDTFENDFKILDKVIIKKDRKTQKIDDSNQYFPFIREADYLSRLLDMNKPIDHKKIIHSNNYLSFFIKKENVNNGKLSDEIIDRYYEILKDPLIKYKNTKAEKLYEEVEEEHGKVNEKLIDEIKNWIKEKIHDFVDKGSKEKEYLKIFFKYDLDKYRKESEKYISPNLYNSNDYNVKIKEEIYGLPNDNMGLNSKKPYLENKTRKSKVPYLISKEEVLIQKKFFDYLMNQVAIGKSNIYINEKGIKGISNKETLGEDFTGYYLRIQKGKEVEIHNFDTIVNYRAKIEPFKLENVLELEKSELNYNVFIYEIGKLKDLIDNVFFYKFLSGNFFTKAEDLNINDATLKRSILLSRDTLFTWFYKGVDNNTWNNLNISSLNLIKGSINKRYLLKAGEQFNLRCSLKNYFEGGISMADVLLEVKNSLREKINKTVKENKNHEDVTLDNDREYYFAVGQLAYYLISLSKSKNKSHSLVNPIINAKTNERIKDEIRRLYTRYNYRIEFGSKRVERLYSMISSYVPKGKINGDLIIAGFLKNNLIYEKSEEE >NC_004557.1|WP_011099686.1|1588509_1589202_-|CRISPR-associated-endoribonuclease-Cas6 MKIYELTLKVFLLKDIKSDESLEKISNLIDKSLSKDGKLLDFHERNTYKNYTFNSLYPIEKDKIYNEGKIYSVQIRTVDESLIQYFKKNLTNEYTEYIKALTLECRVIPQRYIEKIYSITPVIIKTEKGYWKGNLSLGEFEERIKNNLIKKYNSFFNTKIDERFTLFRTINLINNKPISCSYKDINILGDKITLIIDENEMAQKLACFSLGSGVGEMNARGYGFVNYKWL >NC_004557.1|WP_155274218.1|1589321_1589468_-|hypothetical-protein MKKKLKFSISATYEDLKEKERIEIDDIIYIIELVSVITLILKIFQFIN >NC_004557.1|WP_011099687.1|1590319_1591906_-|AAA-family-ATPase MKKRFNVTGTCIPERHYMVDISNKLDSILKLVNNEEYFIINRPRQYGKTTTLYLLEKRLNHMKEYLPIKISFEAIDTEGYSKVEKFLSSIMMQIVNYFRFSTNKEMYKFIKNCENQITNMNDFNSFITDLVEFSEKKVVLIIDEVDKSSNNQLFLDFLGMLRSKYLLRNEGKDYTFHSVILAGVHDVKTLKLKIRPDEEHKYNSPWNIASDFDVDMSFSIDEIKTMLDDYVENKKVNLDKEYFAKKIYFYTSGYPFLVSKLCKIIDEKIMVEDELKWEKEYLELAVKELLKESNTNFDSLIKNIENNKELSQIIDNILIKGTRINFNIHNPDINLGYLYGIFKNNKGNLEINNRIYEQLIYEYRISKIQTASNFLNYNLKENFIKCNGDLDITKVLIKFQEFMKHEYSQKRDAFLEEDGRLVFLAFLSPIINGAGFAFKEVQGGEEKRFDIVITYNEKMYILELKIWRGEEYHKKGLKQLGEYLNQYGLEEGYLLIFDFRKATNLIGKTEETHVNAEDNIKKIIEVYC |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_004557_6 | 1589679-1590034 | TypeI-B |
III-B
Consensus repeat of NC_004557_6
|
5 spacers
spacers of NC_004557_6
>6.1|1589709|37|NC_004557|CRISPRCasFinder GATATCTCCTATTATAACTAGTATTGCCAATAATCCT >6.2|1589776|36|NC_004557|CRISPRCasFinder ATTAACCACAAATAACCAAAGATTTTACCTTATTTG >6.3|1589842|35|NC_004557|CRISPRCasFinder ATAAGTGGAAATGAAGGTAGAGTATGGGTTAACAC >6.4|1589907|34|NC_004557|CRISPRCasFinder CTTTCTCTGTTATTTCTTCATCTTCATATTTTAA >6.5|1589971|34|NC_004557|CRISPRCasFinder CTTTCTCTGTTATTTCTTCATCTTCATATTTTAA |
cas6,cas8b1,cas7b,cas5,cas3,cas4,cas1,cas2 |
CRISPR arrays and Neighbor proteins around NC_004557_6
The CRISPR arrays of NC_004557_6 >merge|NC_004557|6|1589679-1590034|CRISPRCasFinder TTTTAAATACACATAATGTTAAGGTTTAACGATATCTCCTATTATAACTAGTATTGCCAATAATCCTATTTAAATACATCCTATGTTAAGGTTCAACATTAACCACAAATAACCAAAGATTTTACCTTATTTGATTTAAATACATCCTATGTTAAGGTTCAACATAAGTGGAAATGAAGGTAGAGTATGGGTTAACACATTTAAATACATCCTATGTTAAGGTTCAACCTTTCTCTGTTATTTCTTCATCTTCATATTTTAAATTTAAATACATCCTATGTTAAGGTTCAACCTTTCTCTGTTATTTCTTCATCTTCATATTTTAAATTTAAATACATCCTATGTTAAGGTTCAAC >NC_004557|6|6|1589679-1590034|CRISPRCasFinder TTTTAAATACACATAATGTTAAGGTTTAAC GATATCTCCTATTATAACTAGTATTGCCAATAATCCT ATTTAAATACATCCTATGTTAAGGTTCAAC ATTAACCACAAATAACCAAAGATTTTACCTTATTTG ATTTAAATACATCCTATGTTAAGGTTCAAC ATAAGTGGAAATGAAGGTAGAGTATGGGTTAACAC ATTTAAATACATCCTATGTTAAGGTTCAAC CTTTCTCTGTTATTTCTTCATCTTCATATTTTAA ATTTAAATACATCCTATGTTAAGGTTCAAC CTTTCTCTGTTATTTCTTCATCTTCATATTTTAA ATTTAAATACATCCTATGTTAAGGTTCAAC
>NC_004557.1|WP_155274218.1|1589321_1589468_-|hypothetical-protein MKKKLKFSISATYEDLKEKERIEIDDIIYIIELVSVITLILKIFQFIN >NC_004557.1|WP_011099686.1|1588509_1589202_-|CRISPR-associated-endoribonuclease-Cas6 MKIYELTLKVFLLKDIKSDESLEKISNLIDKSLSKDGKLLDFHERNTYKNYTFNSLYPIEKDKIYNEGKIYSVQIRTVDESLIQYFKKNLTNEYTEYIKALTLECRVIPQRYIEKIYSITPVIIKTEKGYWKGNLSLGEFEERIKNNLIKKYNSFFNTKIDERFTLFRTINLINNKPISCSYKDINILGDKITLIIDENEMAQKLACFSLGSGVGEMNARGYGFVNYKWL >NC_004557.1|WP_011099685.1|1586757_1588497_-|type-I-B-CRISPR-associated-protein-Cas8b/Csh1 MLKDVISIFKREYEKIGDRYVTESYIPSDGEYIIVDTFENDFKILDKVIIKKDRKTQKIDDSNQYFPFIREADYLSRLLDMNKPIDHKKIIHSNNYLSFFIKKENVNNGKLSDEIIDRYYEILKDPLIKYKNTKAEKLYEEVEEEHGKVNEKLIDEIKNWIKEKIHDFVDKGSKEKEYLKIFFKYDLDKYRKESEKYISPNLYNSNDYNVKIKEEIYGLPNDNMGLNSKKPYLENKTRKSKVPYLISKEEVLIQKKFFDYLMNQVAIGKSNIYINEKGIKGISNKETLGEDFTGYYLRIQKGKEVEIHNFDTIVNYRAKIEPFKLENVLELEKSELNYNVFIYEIGKLKDLIDNVFFYKFLSGNFFTKAEDLNINDATLKRSILLSRDTLFTWFYKGVDNNTWNNLNISSLNLIKGSINKRYLLKAGEQFNLRCSLKNYFEGGISMADVLLEVKNSLREKINKTVKENKNHEDVTLDNDREYYFAVGQLAYYLISLSKSKNKSHSLVNPIINAKTNERIKDEIRRLYTRYNYRIEFGSKRVERLYSMISSYVPKGKINGDLIIAGFLKNNLIYEKSEEE >NC_004557.1|WP_011099684.1|1585792_1586752_-|type-I-CRISPR-associated-protein-Cas7 MGMNKRVYGVLGIVSRMSNWNADFTGYPKTTSSGDVFGSDKAFKYPMKKMWENGGEKVLYIKSIKFQENKKKERELIPRTLKERYEYIFDVEDLKKNKDSEEVLKNLFTAVDVKNFGATFAEEGNNISITGAVQIGQGFNKYKETYAEEQQILSPFRDPNQKEKSKDGEEAKSSTLGTKIVSNEAHYFYPLTVNPSAYSQFEEIGVTNGYTEEDYEKFKETSMIAATSFNTNSKIGCENEFALFVETKEDLYLPDLSQYVDFEKVEDKNIIILSCSELLNSFENEIENIEIYYNSYTTEIKSDEIKKAKKFNIFTKKEV >NC_004557.1|WP_011099683.1|1585018_1585789_-|type-I-B-CRISPR-associated-protein-Cas5 MDALKFSLSGRTAFFKKPDVNSFFYFTYGNVHKVALLGILGAICGYGGYNSQCLNKEQIYPEFYEKLKDINIGVVPKNEKGYIDKKIQVFNNSVGYASKELGGNLIVKEQWLENPKWAIYILMDENVPKDLKDRLLNFKFKYIPYLGKNDHMANITDVEYLENIEKLDNTNKLDSIFIKDKYEIQKESKNFNDLKNIIKKSSSKIQEFKYEEMLPISLEETTNKYNLETFIYTNSNLKPLADTKTYKCGDKNIFFF >NC_004557.1|WP_011099682.1|1582361_1584962_-|CRISPR-associated-helicase/endonuclease-Cas3 MYFNNIEKVNLENIIENNDKIYAHIHNGRKETLKEHSDLALKYLYKISERKSLDNVFLKIENNFLEKCSNEEKMVYRKMLLNTIYMHDLGKINCNFQRKKMANKIFKEEKMSSTNHSMLSSIIYINHFLKEIASIENGEHIKLLIAFLLLNSYVISKHHGAFNSVNKFKEKLVYDGEEGKDLYTKYMYIFDKVYKEEIIINESLIKEDLFDMYKSTIQEKTEENKDFPVELYIYERFLASLLLSCDYYSTSEFKNQKEVEEFGEIKNIEKFYKSFKSTEVYNWIRKYEKNDYGKTDDFSNIDDINVLRNELFLDAEKTMVSNIDKDIFYLEAPTGSGKSNVSFNLSFKMVERFKEINKIFYVYPFNTLVEQNIKTLEKIFKNNEIMKDIAIINSVVPIKIKSSKDNKIKEIDTNEESDILNEDYERALLDRQFLHYPIVLTTHVSIFNYLFGTSKDNLFPLCQIANSIIVLDEIQSYKNRIWKEIITFLACYSRLLNIKIIIMSATLPNLNKLVDGEIKTVNLIENRKKYFENPIFKNRVMVDFSLLEEKENIKEVLFNNVIKNTKAPNKNILVEFITKESAMDFYEKLKDYNKYLQESEKREIELITGDDNRVERNRIIDKIKSQKNIILVATQVIEAGVDIDMDIGYKDISMLDSEEQFLGRINRSCKNDEQGIVYFFDLDLASHVYKRDIRKQKNINLTCPKIREILINKNFQEFYDYVIKELNKKAGEYNNSSFQTFFLDKVKMLNFKEIEERMKLIDELYENNVFLNRNITLENEEELCGEDVWNEYIAILKNNKLDYAEKKIKLSQVTAKLNYFIYQISSDDFIYEDRVGDIYYIGDGEKYFEDGKFDRKKFKSIVADII >NC_004557.1|WP_011099681.1|1581861_1582353_-|CRISPR-associated-protein-Cas4 MKVNGTLVNYYFHCKRQCWLHGNRINLEDNSQDVKIGKAIHEVKKEKGKQTEISIDNIKIDKITKDYLTEVKKSDSDIEAAKWQLLLYLKVLKDKGIERKGKLEFIEKNKSKSTIIIELDENNLSELEDVIKNIENLLIQENPPEVINESKCKKCAYFEYCYI >NC_004557.1|WP_011099680.1|1580853_1581852_-|type-I-B-CRISPR-associated-endonuclease-Cas1 MGSTRYITSVGELKRKDNSLCFRKNNKNVYIPVENTKEIYCMSEVNINSKLLDFLSQNNIIMHFFNYYEGYSGTFYPREHYNSGKLLVKQVETYENRRLEVAKSIVEAIGDNIYELLYHYYKHDKKEVKETLDWIKNHSKINLKKANDIKQIMQVEGETWQRFYGEFKNILPEEFVMNKRVKRPPDNPINALISFGNTLLYGKTITAIYNTHLDQRISFLHEPSEGRFSLSLDISEAFKPVIVFKTIFDLVNNKRIQVSKHFDKKLNYCLLNDEGRNIFITAFEERMESIFLNEKLKRKISYKTAIKLDCYKLIKFILENKEFKPFSLKERM >NC_004557.1|WP_023438399.1|1580562_1580853_-|CRISPR-associated-endonuclease-Cas2 MSKNFNYNYAFVFYDVNEKRVNRVFKTCKKYLSHFQKSVFRGELTPANFILLKKDLNKVINEDEDFICIIKLMNNKVYDEEILGNPHSCTGEDLIL >NC_004557.1|WP_011099679.1|1578401_1579985_-|AAA-family-ATPase MKKRFNVTGTCIPERHYMVDISNKLDSILKLVNNEEYFIINRPRQYGKTTTLYMLERYLNKFKDYLVISISFEGIGDLIFQDEKVFSKEFLQIMSDSLLLNSQALSECLEEQKPHVENFIDLSRVITKFIVKAKRKVVLMIDEVDKSSNNQLFLSFLGLLRNKYLLRNVGKDYTFHNVILAGVHDVKSLKLKIRPDEEHKYNSPWNIASDFDVDMSFSIDEIKTMLDDYVENKEVNLDKEYFAEKIYFYTSGYPFLVSKLCKIVDEKIMVKDELKWEKEYLQIAVKELLKESNTNFDSLIKNIENNKDLQELVRKIILDGYEITYNEDNPLITMGVTYGIFKNSHGKVKIHNRIYEQRIYNYMISLIETKINLGFYTERERYLKPNGDLDIKKVLKKFQEFMKHEYSQKREGFLEEDGRLVFLAFLSPIINGAGFAFKEVQGGEEKRFDIVITYNKKMYILELKIWRGEEYHKKGLKQLVEYLNQYGLEEGYLLIFDFRKATNLIGQVEETHINAEDNIKKIIGVYC >NC_004557.1|WP_011099687.1|1590319_1591906_-|AAA-family-ATPase MKKRFNVTGTCIPERHYMVDISNKLDSILKLVNNEEYFIINRPRQYGKTTTLYLLEKRLNHMKEYLPIKISFEAIDTEGYSKVEKFLSSIMMQIVNYFRFSTNKEMYKFIKNCENQITNMNDFNSFITDLVEFSEKKVVLIIDEVDKSSNNQLFLDFLGMLRSKYLLRNEGKDYTFHSVILAGVHDVKTLKLKIRPDEEHKYNSPWNIASDFDVDMSFSIDEIKTMLDDYVENKKVNLDKEYFAKKIYFYTSGYPFLVSKLCKIIDEKIMVEDELKWEKEYLELAVKELLKESNTNFDSLIKNIENNKELSQIIDNILIKGTRINFNIHNPDINLGYLYGIFKNNKGNLEINNRIYEQLIYEYRISKIQTASNFLNYNLKENFIKCNGDLDITKVLIKFQEFMKHEYSQKRDAFLEEDGRLVFLAFLSPIINGAGFAFKEVQGGEEKRFDIVITYNEKMYILELKIWRGEEYHKKGLKQLGEYLNQYGLEEGYLLIFDFRKATNLIGKTEETHVNAEDNIKKIIEVYC >NC_004557.1|WP_023438413.1|1593443_1594592_-|polysaccharide-deacetylase MRRKNKFGVILFLMIIITALSIILSDSRGSRQFIKSQKSENMASKNEKNKKKIKSEKYNGEMEFYNGEIEHLFFHPLILDKKAAFTGPKWHTDNMDNWLVTVEESKKVINSLYSKGYILIDPNSLYEEYESDGKKLFRKKPLKVPKGKKPLILSIDDLSYNEGMRKATALKLILDENGNLATYRKDQNGKEIIGYDEIVMILDRFVKEHPDFSLNGAKGVIALTGYEGVFGYRTNLDSKNREEEAKKAKVIANKLKENGWKFASHSYGHLDNAKIPFQTLKRDADKWEQLVKPIIGHTSIYVYPHGTAIKTNSEKFKYLQSKGFKIFYSVDSYLGERISENDLVVEGGRMPIDGLSMRNRREAFLKFFDAKEVLDLESRPKR >NC_004557.1|WP_041744712.1|1595394_1595637_-|hypothetical-protein MQLNPTQVVCSSINALEGNVVDIEKELNAKANQEYFHKLVKLDKHADSIIKFISELNCTESLSNDATYMLMNDLVERIKL >NC_004557.1|WP_035125170.1|1595605_1595929_-|hypothetical-protein MFNTADSEISEIITAAIHDSSGIGSEDTSSLKFMILEGDWNNKEIPQCFEGIKSAGESGKIQVLSRGKNLFTKDSIYYLNVVTGKTIRNPSKFVSSSIAVKPNTSCM >NC_004557.1|WP_011099690.1|1596261_1596732_-|DUF4829-domain-containing-protein MKKSYMMIIMISILFGVKLIYSNSAESIIKEYYKVIDSQQDVGKYNKLVIEDERLKNLEGIPDIVEKRDILELKKLNVNEHPLLEKELNYKYADEKDNVRYYMIKYDIKFKENVATPVDSGIYYEVITVVKRKNKWLVTTDIRKASFHNDKLTIDS >NC_004557.1|WP_011099691.1|1596866_1597955_-|amidase-domain-containing-protein MKKKLMKLKSKLDYGSLKNAYIIKGDEITNEDINALIDNYFNWIYENLINNTIGELQNIVGNNKLAEFKKSKLKWLINWYGKKDEEIKDYKIYTEINDVDINGNIIYINVIYGEDLILKSSSDIVQKIRNQEHKILAKNVGSKLVIIHDYYNDELADEMFLVSDREFKTNKKVKSINKKLEKKTLEINKNIKKIDKLVKQYKRNLHNTLQINNIQERKYPGYDGIAAAKYAVKYAINYNPEYQDYNGRGGDCTNFISQCIYAGGIPTDNVWYKDSHAWIRVVELRSWLLKKGYARELTVQDNAKKGDLIQLRNSGGYWYHSLIVTYKNSSNGELFVSCHTGDYVNRALSTYTTDRRYLILTS >NC_004557.1|WP_011099692.1|1598119_1598677_+|hypothetical-protein MIKTKKKVFANVSFTVLLLVLINTSVFAAYPSKATMDNTWGFEKGDSTEKQLIYHQHGDLDWKGHVDFAMNEININPADISCYYGTSEDLANIVVTSNYWPDATWSGSTYAPIGLEPKTIELNSSAELTDWQRDAVTTHEFVHIWGINDSRNKNSISYGFTPVNYRTITDDVTTLLKNRYNEEVK >NC_004557.1|WP_011099693.1|1598677_1599271_+|hypothetical-protein MIKKISVILALTGVINTNSALSTPPSKVNEPIKPKASASYMEIDGLKELKAKSDIIVEVEGTDKFELIDYKGIKMRKTTVKILDVMKGNPTLKEITVVQTEGLESEEPPMKNEKLLMFLRKGIDITDSYVPIGGNQGIYKIITKKTKKNSMTPKKLPHLNAPKDDAIKIVTPTSLINNKILRDLNGNYDDIKKKLIE >NC_004557.1|WP_035125168.1|1599281_1600583_-|S41-family-peptidase MKRFKKITILAVVLIILILSKSFIGKAYYKKNAPEHIKNFSKKEALEDYDYMWNVLERNYPCFNVIERKHGVTIKDIKNGYRKRIENRENVDFKYFNMILNKSINKFSNVGHLYVMDFNFYIMLRGTFDAIGKNEIGGIVKNNFEMAINKKTEETYKHIYNISYGKKILKNLNFTNISNKLYDNKNLSFKEIDKDTAYIKINNFYHYNIANDKDKLINFYRKNSDKKNLVIDLTENRGGADSYWMTSIVAPNIDKELKLYNQYALYKNGDIVNDQWVKKHGNNEYREITKDFSEVLKLSKIRKEDLKDLKYLEISKSIPYNVKPSSKEKLFKGKIYVLVSEQVQSSGEDFVEYCKNTKFATLIGTTTGGNSPAMSPVYDVLPNSGLMLSYQIDYKLNPDGTCNTEFGLPPDIVSKENEEPLDTFKRVILEKKL >NC_004557.1|WP_023438420.1|1600700_1600883_+|hypothetical-protein MILLLISMLSSKIFDIIFFNLLGETTGTLGSIIGFVLPYSIALEIILKKLFFEPSSKDSK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_004557_7 | 1592009-1592427 | TypeI-B |
III-B
Consensus repeat of NC_004557_7
|
6 spacers
spacers of NC_004557_7
>7.1|1592039|35|NC_004557|CRISPRCasFinder,CRT GTGCTGCACTTCTAGAACTTAAATTACATTCCTTT >7.2|1592104|34|NC_004557|CRISPRCasFinder,CRT ATATAGAGAAATCACTTAAAATAATAGAATTTGC >7.3|1592168|34|NC_004557|CRISPRCasFinder,CRT AAAATAAAAGGAAGTGGTAATATAGTGAAAACAT >7.4|1592232|34|NC_004557|CRISPRCasFinder,CRT ATCGACTAAAGATTATATTTGGGGGTAAGATAAA >7.5|1592296|36|NC_004557|CRISPRCasFinder,CRT GAATGCTACACAATTTGCTAACAATGCTATGAATGA >7.6|1592362|36|NC_004557|CRISPRCasFinder,CRT GCATCAAACATAGTTACAGCAATTGTAGTTACAAAG >7.7|1592040|35|NC_004557|PILER-CR GTGCTGCACTTCTAGAACTTAAATTACATTCCTTT >7.8|1592105|34|NC_004557|PILER-CR ATATAGAGAAATCACTTAAAATAATAGAATTTGC >7.9|1592169|34|NC_004557|PILER-CR AAAATAAAAGGAAGTGGTAATATAGTGAAAACAT >7.10|1592233|34|NC_004557|PILER-CR ATCGACTAAAGATTATATTTGGGGGTAAGATAAA >7.11|1592297|36|NC_004557|PILER-CR GAATGCTACACAATTTGCTAACAATGCTATGAATGA >7.12|1592363|36|NC_004557|PILER-CR GCATCAAACATAGTTACAGCAATTGTAGTTACAAAG |
cas6,cas8b1,cas7b,cas5,cas3,cas4,cas1,cas2 |
CRISPR arrays and Neighbor proteins around NC_004557_7
The CRISPR arrays of NC_004557_7 >merge|NC_004557|7|1592009-1592427|CRISPRCasFinder,CRT,PILER-CR TTTTAAATACAACTCTTGTTATTGTTCAACGTGCTGCACTTCTAGAACTTAAATTACATTCCTTTATTTAAATACAACTCTTGTTATTGTTCAACATATAGAGAAATCACTTAAAATAATAGAATTTGCATTTAAATACAACTCTTGTTATTGTTCAACAAAATAAAAGGAAGTGGTAATATAGTGAAAACATATTTAAATACAACTCTTGTTATTGTTCAACATCGACTAAAGATTATATTTGGGGGTAAGATAAAATTTAAATACAACTCTTGTTATTGTTCAACGAATGCTACACAATTTGCTAACAATGCTATGAATGAATTTAAATACAACTCTTGTTATTGTTCAACGCATCAAACATAGTTACAGCAATTGTAGTTACAAAGATTTAAATACAACTCTTGTTATTGTTCAAC >NC_004557|7|7|1592009-1592427|CRISPRCasFinder TTTTAAATACAACTCTTGTTATTGTTCAAC GTGCTGCACTTCTAGAACTTAAATTACATTCCTTT ATTTAAATACAACTCTTGTTATTGTTCAAC ATATAGAGAAATCACTTAAAATAATAGAATTTGC ATTTAAATACAACTCTTGTTATTGTTCAAC AAAATAAAAGGAAGTGGTAATATAGTGAAAACAT ATTTAAATACAACTCTTGTTATTGTTCAAC ATCGACTAAAGATTATATTTGGGGGTAAGATAAA ATTTAAATACAACTCTTGTTATTGTTCAAC GAATGCTACACAATTTGCTAACAATGCTATGAATGA ATTTAAATACAACTCTTGTTATTGTTCAAC GCATCAAACATAGTTACAGCAATTGTAGTTACAAAG ATTTAAATACAACTCTTGTTATTGTTCAAC >NC_004557|7|5|1592009-1592427|CRT TTTTAAATACAACTCTTGTTATTGTTCAAC GTGCTGCACTTCTAGAACTTAAATTACATTCCTTT ATTTAAATACAACTCTTGTTATTGTTCAAC ATATAGAGAAATCACTTAAAATAATAGAATTTGC ATTTAAATACAACTCTTGTTATTGTTCAAC AAAATAAAAGGAAGTGGTAATATAGTGAAAACAT ATTTAAATACAACTCTTGTTATTGTTCAAC ATCGACTAAAGATTATATTTGGGGGTAAGATAAA ATTTAAATACAACTCTTGTTATTGTTCAAC GAATGCTACACAATTTGCTAACAATGCTATGAATGA ATTTAAATACAACTCTTGTTATTGTTCAAC GCATCAAACATAGTTACAGCAATTGTAGTTACAAAG ATTTAAATACAACTCTTGTTATTGTTCAAC >NC_004557|7|6|1592010-1592427|PILER-CR TTTAAATACAACTCTTGTTATTGTTCAACG TGCTGCACTTCTAGAACTTAAATTACATTCCTTTA TTTAAATACAACTCTTGTTATTGTTCAACA TATAGAGAAATCACTTAAAATAATAGAATTTGCA TTTAAATACAACTCTTGTTATTGTTCAACA AAATAAAAGGAAGTGGTAATATAGTGAAAACATA TTTAAATACAACTCTTGTTATTGTTCAACA TCGACTAAAGATTATATTTGGGGGTAAGATAAAA TTTAAATACAACTCTTGTTATTGTTCAACG AATGCTACACAATTTGCTAACAATGCTATGAATGAA TTTAAATACAACTCTTGTTATTGTTCAACG CATCAAACATAGTTACAGCAATTGTAGTTACAAAGA TTTAAATACAACTCTTGTTATTGTTCAAC
>NC_004557.1|WP_011099687.1|1590319_1591906_-|AAA-family-ATPase MKKRFNVTGTCIPERHYMVDISNKLDSILKLVNNEEYFIINRPRQYGKTTTLYLLEKRLNHMKEYLPIKISFEAIDTEGYSKVEKFLSSIMMQIVNYFRFSTNKEMYKFIKNCENQITNMNDFNSFITDLVEFSEKKVVLIIDEVDKSSNNQLFLDFLGMLRSKYLLRNEGKDYTFHSVILAGVHDVKTLKLKIRPDEEHKYNSPWNIASDFDVDMSFSIDEIKTMLDDYVENKKVNLDKEYFAKKIYFYTSGYPFLVSKLCKIIDEKIMVEDELKWEKEYLELAVKELLKESNTNFDSLIKNIENNKELSQIIDNILIKGTRINFNIHNPDINLGYLYGIFKNNKGNLEINNRIYEQLIYEYRISKIQTASNFLNYNLKENFIKCNGDLDITKVLIKFQEFMKHEYSQKRDAFLEEDGRLVFLAFLSPIINGAGFAFKEVQGGEEKRFDIVITYNEKMYILELKIWRGEEYHKKGLKQLGEYLNQYGLEEGYLLIFDFRKATNLIGKTEETHVNAEDNIKKIIEVYC >NC_004557.1|WP_155274218.1|1589321_1589468_-|hypothetical-protein MKKKLKFSISATYEDLKEKERIEIDDIIYIIELVSVITLILKIFQFIN >NC_004557.1|WP_011099686.1|1588509_1589202_-|CRISPR-associated-endoribonuclease-Cas6 MKIYELTLKVFLLKDIKSDESLEKISNLIDKSLSKDGKLLDFHERNTYKNYTFNSLYPIEKDKIYNEGKIYSVQIRTVDESLIQYFKKNLTNEYTEYIKALTLECRVIPQRYIEKIYSITPVIIKTEKGYWKGNLSLGEFEERIKNNLIKKYNSFFNTKIDERFTLFRTINLINNKPISCSYKDINILGDKITLIIDENEMAQKLACFSLGSGVGEMNARGYGFVNYKWL >NC_004557.1|WP_011099685.1|1586757_1588497_-|type-I-B-CRISPR-associated-protein-Cas8b/Csh1 MLKDVISIFKREYEKIGDRYVTESYIPSDGEYIIVDTFENDFKILDKVIIKKDRKTQKIDDSNQYFPFIREADYLSRLLDMNKPIDHKKIIHSNNYLSFFIKKENVNNGKLSDEIIDRYYEILKDPLIKYKNTKAEKLYEEVEEEHGKVNEKLIDEIKNWIKEKIHDFVDKGSKEKEYLKIFFKYDLDKYRKESEKYISPNLYNSNDYNVKIKEEIYGLPNDNMGLNSKKPYLENKTRKSKVPYLISKEEVLIQKKFFDYLMNQVAIGKSNIYINEKGIKGISNKETLGEDFTGYYLRIQKGKEVEIHNFDTIVNYRAKIEPFKLENVLELEKSELNYNVFIYEIGKLKDLIDNVFFYKFLSGNFFTKAEDLNINDATLKRSILLSRDTLFTWFYKGVDNNTWNNLNISSLNLIKGSINKRYLLKAGEQFNLRCSLKNYFEGGISMADVLLEVKNSLREKINKTVKENKNHEDVTLDNDREYYFAVGQLAYYLISLSKSKNKSHSLVNPIINAKTNERIKDEIRRLYTRYNYRIEFGSKRVERLYSMISSYVPKGKINGDLIIAGFLKNNLIYEKSEEE >NC_004557.1|WP_011099684.1|1585792_1586752_-|type-I-CRISPR-associated-protein-Cas7 MGMNKRVYGVLGIVSRMSNWNADFTGYPKTTSSGDVFGSDKAFKYPMKKMWENGGEKVLYIKSIKFQENKKKERELIPRTLKERYEYIFDVEDLKKNKDSEEVLKNLFTAVDVKNFGATFAEEGNNISITGAVQIGQGFNKYKETYAEEQQILSPFRDPNQKEKSKDGEEAKSSTLGTKIVSNEAHYFYPLTVNPSAYSQFEEIGVTNGYTEEDYEKFKETSMIAATSFNTNSKIGCENEFALFVETKEDLYLPDLSQYVDFEKVEDKNIIILSCSELLNSFENEIENIEIYYNSYTTEIKSDEIKKAKKFNIFTKKEV >NC_004557.1|WP_011099683.1|1585018_1585789_-|type-I-B-CRISPR-associated-protein-Cas5 MDALKFSLSGRTAFFKKPDVNSFFYFTYGNVHKVALLGILGAICGYGGYNSQCLNKEQIYPEFYEKLKDINIGVVPKNEKGYIDKKIQVFNNSVGYASKELGGNLIVKEQWLENPKWAIYILMDENVPKDLKDRLLNFKFKYIPYLGKNDHMANITDVEYLENIEKLDNTNKLDSIFIKDKYEIQKESKNFNDLKNIIKKSSSKIQEFKYEEMLPISLEETTNKYNLETFIYTNSNLKPLADTKTYKCGDKNIFFF >NC_004557.1|WP_011099682.1|1582361_1584962_-|CRISPR-associated-helicase/endonuclease-Cas3 MYFNNIEKVNLENIIENNDKIYAHIHNGRKETLKEHSDLALKYLYKISERKSLDNVFLKIENNFLEKCSNEEKMVYRKMLLNTIYMHDLGKINCNFQRKKMANKIFKEEKMSSTNHSMLSSIIYINHFLKEIASIENGEHIKLLIAFLLLNSYVISKHHGAFNSVNKFKEKLVYDGEEGKDLYTKYMYIFDKVYKEEIIINESLIKEDLFDMYKSTIQEKTEENKDFPVELYIYERFLASLLLSCDYYSTSEFKNQKEVEEFGEIKNIEKFYKSFKSTEVYNWIRKYEKNDYGKTDDFSNIDDINVLRNELFLDAEKTMVSNIDKDIFYLEAPTGSGKSNVSFNLSFKMVERFKEINKIFYVYPFNTLVEQNIKTLEKIFKNNEIMKDIAIINSVVPIKIKSSKDNKIKEIDTNEESDILNEDYERALLDRQFLHYPIVLTTHVSIFNYLFGTSKDNLFPLCQIANSIIVLDEIQSYKNRIWKEIITFLACYSRLLNIKIIIMSATLPNLNKLVDGEIKTVNLIENRKKYFENPIFKNRVMVDFSLLEEKENIKEVLFNNVIKNTKAPNKNILVEFITKESAMDFYEKLKDYNKYLQESEKREIELITGDDNRVERNRIIDKIKSQKNIILVATQVIEAGVDIDMDIGYKDISMLDSEEQFLGRINRSCKNDEQGIVYFFDLDLASHVYKRDIRKQKNINLTCPKIREILINKNFQEFYDYVIKELNKKAGEYNNSSFQTFFLDKVKMLNFKEIEERMKLIDELYENNVFLNRNITLENEEELCGEDVWNEYIAILKNNKLDYAEKKIKLSQVTAKLNYFIYQISSDDFIYEDRVGDIYYIGDGEKYFEDGKFDRKKFKSIVADII >NC_004557.1|WP_011099681.1|1581861_1582353_-|CRISPR-associated-protein-Cas4 MKVNGTLVNYYFHCKRQCWLHGNRINLEDNSQDVKIGKAIHEVKKEKGKQTEISIDNIKIDKITKDYLTEVKKSDSDIEAAKWQLLLYLKVLKDKGIERKGKLEFIEKNKSKSTIIIELDENNLSELEDVIKNIENLLIQENPPEVINESKCKKCAYFEYCYI >NC_004557.1|WP_011099680.1|1580853_1581852_-|type-I-B-CRISPR-associated-endonuclease-Cas1 MGSTRYITSVGELKRKDNSLCFRKNNKNVYIPVENTKEIYCMSEVNINSKLLDFLSQNNIIMHFFNYYEGYSGTFYPREHYNSGKLLVKQVETYENRRLEVAKSIVEAIGDNIYELLYHYYKHDKKEVKETLDWIKNHSKINLKKANDIKQIMQVEGETWQRFYGEFKNILPEEFVMNKRVKRPPDNPINALISFGNTLLYGKTITAIYNTHLDQRISFLHEPSEGRFSLSLDISEAFKPVIVFKTIFDLVNNKRIQVSKHFDKKLNYCLLNDEGRNIFITAFEERMESIFLNEKLKRKISYKTAIKLDCYKLIKFILENKEFKPFSLKERM >NC_004557.1|WP_023438399.1|1580562_1580853_-|CRISPR-associated-endonuclease-Cas2 MSKNFNYNYAFVFYDVNEKRVNRVFKTCKKYLSHFQKSVFRGELTPANFILLKKDLNKVINEDEDFICIIKLMNNKVYDEEILGNPHSCTGEDLIL >NC_004557.1|WP_023438413.1|1593443_1594592_-|polysaccharide-deacetylase MRRKNKFGVILFLMIIITALSIILSDSRGSRQFIKSQKSENMASKNEKNKKKIKSEKYNGEMEFYNGEIEHLFFHPLILDKKAAFTGPKWHTDNMDNWLVTVEESKKVINSLYSKGYILIDPNSLYEEYESDGKKLFRKKPLKVPKGKKPLILSIDDLSYNEGMRKATALKLILDENGNLATYRKDQNGKEIIGYDEIVMILDRFVKEHPDFSLNGAKGVIALTGYEGVFGYRTNLDSKNREEEAKKAKVIANKLKENGWKFASHSYGHLDNAKIPFQTLKRDADKWEQLVKPIIGHTSIYVYPHGTAIKTNSEKFKYLQSKGFKIFYSVDSYLGERISENDLVVEGGRMPIDGLSMRNRREAFLKFFDAKEVLDLESRPKR >NC_004557.1|WP_041744712.1|1595394_1595637_-|hypothetical-protein MQLNPTQVVCSSINALEGNVVDIEKELNAKANQEYFHKLVKLDKHADSIIKFISELNCTESLSNDATYMLMNDLVERIKL >NC_004557.1|WP_035125170.1|1595605_1595929_-|hypothetical-protein MFNTADSEISEIITAAIHDSSGIGSEDTSSLKFMILEGDWNNKEIPQCFEGIKSAGESGKIQVLSRGKNLFTKDSIYYLNVVTGKTIRNPSKFVSSSIAVKPNTSCM >NC_004557.1|WP_011099690.1|1596261_1596732_-|DUF4829-domain-containing-protein MKKSYMMIIMISILFGVKLIYSNSAESIIKEYYKVIDSQQDVGKYNKLVIEDERLKNLEGIPDIVEKRDILELKKLNVNEHPLLEKELNYKYADEKDNVRYYMIKYDIKFKENVATPVDSGIYYEVITVVKRKNKWLVTTDIRKASFHNDKLTIDS >NC_004557.1|WP_011099691.1|1596866_1597955_-|amidase-domain-containing-protein MKKKLMKLKSKLDYGSLKNAYIIKGDEITNEDINALIDNYFNWIYENLINNTIGELQNIVGNNKLAEFKKSKLKWLINWYGKKDEEIKDYKIYTEINDVDINGNIIYINVIYGEDLILKSSSDIVQKIRNQEHKILAKNVGSKLVIIHDYYNDELADEMFLVSDREFKTNKKVKSINKKLEKKTLEINKNIKKIDKLVKQYKRNLHNTLQINNIQERKYPGYDGIAAAKYAVKYAINYNPEYQDYNGRGGDCTNFISQCIYAGGIPTDNVWYKDSHAWIRVVELRSWLLKKGYARELTVQDNAKKGDLIQLRNSGGYWYHSLIVTYKNSSNGELFVSCHTGDYVNRALSTYTTDRRYLILTS >NC_004557.1|WP_011099692.1|1598119_1598677_+|hypothetical-protein MIKTKKKVFANVSFTVLLLVLINTSVFAAYPSKATMDNTWGFEKGDSTEKQLIYHQHGDLDWKGHVDFAMNEININPADISCYYGTSEDLANIVVTSNYWPDATWSGSTYAPIGLEPKTIELNSSAELTDWQRDAVTTHEFVHIWGINDSRNKNSISYGFTPVNYRTITDDVTTLLKNRYNEEVK >NC_004557.1|WP_011099693.1|1598677_1599271_+|hypothetical-protein MIKKISVILALTGVINTNSALSTPPSKVNEPIKPKASASYMEIDGLKELKAKSDIIVEVEGTDKFELIDYKGIKMRKTTVKILDVMKGNPTLKEITVVQTEGLESEEPPMKNEKLLMFLRKGIDITDSYVPIGGNQGIYKIITKKTKKNSMTPKKLPHLNAPKDDAIKIVTPTSLINNKILRDLNGNYDDIKKKLIE >NC_004557.1|WP_035125168.1|1599281_1600583_-|S41-family-peptidase MKRFKKITILAVVLIILILSKSFIGKAYYKKNAPEHIKNFSKKEALEDYDYMWNVLERNYPCFNVIERKHGVTIKDIKNGYRKRIENRENVDFKYFNMILNKSINKFSNVGHLYVMDFNFYIMLRGTFDAIGKNEIGGIVKNNFEMAINKKTEETYKHIYNISYGKKILKNLNFTNISNKLYDNKNLSFKEIDKDTAYIKINNFYHYNIANDKDKLINFYRKNSDKKNLVIDLTENRGGADSYWMTSIVAPNIDKELKLYNQYALYKNGDIVNDQWVKKHGNNEYREITKDFSEVLKLSKIRKEDLKDLKYLEISKSIPYNVKPSSKEKLFKGKIYVLVSEQVQSSGEDFVEYCKNTKFATLIGTTTGGNSPAMSPVYDVLPNSGLMLSYQIDYKLNPDGTCNTEFGLPPDIVSKENEEPLDTFKRVILEKKL >NC_004557.1|WP_023438420.1|1600700_1600883_+|hypothetical-protein MILLLISMLSSKIFDIIFFNLLGETTGTLGSIIGFVLPYSIALEIILKKLFFEPSSKDSK >NC_004557.1|WP_128993785.1|1601055_1602771_+|histidine-decarboxylase MTQPTKDPNTVYPKVPGIDYDKFKLSEDKMTSKQINDALEELHNYISNQQINFLGYQINQSFNYMKDLKEYLNVHMNNIGDPFVSGNFTVNTKFLERAVLDYFASLWNAQWPHESKGDSNTNDWKNSYWGYVVSMGSTEANFFGIWNARDYLSGKALLLDTSTHKRAKSASINGNPQSVEPRVLNYQAKSLEDNPNMYTPIAFYSQDTHYSIIKGMRILNFTTFNEAGSGKFECPLKYPEDYPKGFSINYLDENGWPFEVPSNNDGSVFIPALKKLVEAFASKGYPIFVNFNYGTTFKGSYDNVEKAIDELVPILKKYNLYEREIIFDKNNKNSDTRTGFWFHVDGALGAAYMPFLEMTTDNEDFPVFDFRLKDVHSISMSGHKWIGVPWPCGIYMSKIKYQLLPPDNPNYIGSPDSTFAGSRNAFSSLILWYYIATHSYEDCKNMILDCQDTAKYTVEKLNELSKKLGIDLWVEYSSKSLTIRFKEANPDIVFKYSLSGEILYVNGEKRAYSHIYIMPHVTKDLIDKFIKDLSKPGAFPEQVSHLEKDGVNFNSNSHKGIYVPQIGRGFK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_004557_8 | 1595950-1596044 | TypeI-B |
III-B
Consensus repeat of NC_004557_8
|
1 spacers
spacers of NC_004557_8
>8.1|1595980|35|NC_004557|CRISPRCasFinder GTACCTGTGCCAAGACTATTAAATTTTTTTGCTAA |
cas6,cas8b1,cas7b,cas5,cas3 |
CRISPR arrays and Neighbor proteins around NC_004557_8
The CRISPR arrays of NC_004557_8 >merge|NC_004557|8|1595950-1596044|CRISPRCasFinder ATTTAAATACAACTCTTGTTATTGTTCAACGTACCTGTGCCAAGACTATTAAATTTTTTTGCTAAATTTAAATACAACTCTTGTTATTGTTCAAC >NC_004557|8|8|1595950-1596044|CRISPRCasFinder ATTTAAATACAACTCTTGTTATTGTTCAAC GTACCTGTGCCAAGACTATTAAATTTTTTTGCTAA ATTTAAATACAACTCTTGTTATTGTTCAAC
>NC_004557.1|WP_035125170.1|1595605_1595929_-|hypothetical-protein MFNTADSEISEIITAAIHDSSGIGSEDTSSLKFMILEGDWNNKEIPQCFEGIKSAGESGKIQVLSRGKNLFTKDSIYYLNVVTGKTIRNPSKFVSSSIAVKPNTSCM >NC_004557.1|WP_041744712.1|1595394_1595637_-|hypothetical-protein MQLNPTQVVCSSINALEGNVVDIEKELNAKANQEYFHKLVKLDKHADSIIKFISELNCTESLSNDATYMLMNDLVERIKL >NC_004557.1|WP_023438413.1|1593443_1594592_-|polysaccharide-deacetylase MRRKNKFGVILFLMIIITALSIILSDSRGSRQFIKSQKSENMASKNEKNKKKIKSEKYNGEMEFYNGEIEHLFFHPLILDKKAAFTGPKWHTDNMDNWLVTVEESKKVINSLYSKGYILIDPNSLYEEYESDGKKLFRKKPLKVPKGKKPLILSIDDLSYNEGMRKATALKLILDENGNLATYRKDQNGKEIIGYDEIVMILDRFVKEHPDFSLNGAKGVIALTGYEGVFGYRTNLDSKNREEEAKKAKVIANKLKENGWKFASHSYGHLDNAKIPFQTLKRDADKWEQLVKPIIGHTSIYVYPHGTAIKTNSEKFKYLQSKGFKIFYSVDSYLGERISENDLVVEGGRMPIDGLSMRNRREAFLKFFDAKEVLDLESRPKR >NC_004557.1|WP_011099687.1|1590319_1591906_-|AAA-family-ATPase MKKRFNVTGTCIPERHYMVDISNKLDSILKLVNNEEYFIINRPRQYGKTTTLYLLEKRLNHMKEYLPIKISFEAIDTEGYSKVEKFLSSIMMQIVNYFRFSTNKEMYKFIKNCENQITNMNDFNSFITDLVEFSEKKVVLIIDEVDKSSNNQLFLDFLGMLRSKYLLRNEGKDYTFHSVILAGVHDVKTLKLKIRPDEEHKYNSPWNIASDFDVDMSFSIDEIKTMLDDYVENKKVNLDKEYFAKKIYFYTSGYPFLVSKLCKIIDEKIMVEDELKWEKEYLELAVKELLKESNTNFDSLIKNIENNKELSQIIDNILIKGTRINFNIHNPDINLGYLYGIFKNNKGNLEINNRIYEQLIYEYRISKIQTASNFLNYNLKENFIKCNGDLDITKVLIKFQEFMKHEYSQKRDAFLEEDGRLVFLAFLSPIINGAGFAFKEVQGGEEKRFDIVITYNEKMYILELKIWRGEEYHKKGLKQLGEYLNQYGLEEGYLLIFDFRKATNLIGKTEETHVNAEDNIKKIIEVYC >NC_004557.1|WP_155274218.1|1589321_1589468_-|hypothetical-protein MKKKLKFSISATYEDLKEKERIEIDDIIYIIELVSVITLILKIFQFIN >NC_004557.1|WP_011099686.1|1588509_1589202_-|CRISPR-associated-endoribonuclease-Cas6 MKIYELTLKVFLLKDIKSDESLEKISNLIDKSLSKDGKLLDFHERNTYKNYTFNSLYPIEKDKIYNEGKIYSVQIRTVDESLIQYFKKNLTNEYTEYIKALTLECRVIPQRYIEKIYSITPVIIKTEKGYWKGNLSLGEFEERIKNNLIKKYNSFFNTKIDERFTLFRTINLINNKPISCSYKDINILGDKITLIIDENEMAQKLACFSLGSGVGEMNARGYGFVNYKWL >NC_004557.1|WP_011099685.1|1586757_1588497_-|type-I-B-CRISPR-associated-protein-Cas8b/Csh1 MLKDVISIFKREYEKIGDRYVTESYIPSDGEYIIVDTFENDFKILDKVIIKKDRKTQKIDDSNQYFPFIREADYLSRLLDMNKPIDHKKIIHSNNYLSFFIKKENVNNGKLSDEIIDRYYEILKDPLIKYKNTKAEKLYEEVEEEHGKVNEKLIDEIKNWIKEKIHDFVDKGSKEKEYLKIFFKYDLDKYRKESEKYISPNLYNSNDYNVKIKEEIYGLPNDNMGLNSKKPYLENKTRKSKVPYLISKEEVLIQKKFFDYLMNQVAIGKSNIYINEKGIKGISNKETLGEDFTGYYLRIQKGKEVEIHNFDTIVNYRAKIEPFKLENVLELEKSELNYNVFIYEIGKLKDLIDNVFFYKFLSGNFFTKAEDLNINDATLKRSILLSRDTLFTWFYKGVDNNTWNNLNISSLNLIKGSINKRYLLKAGEQFNLRCSLKNYFEGGISMADVLLEVKNSLREKINKTVKENKNHEDVTLDNDREYYFAVGQLAYYLISLSKSKNKSHSLVNPIINAKTNERIKDEIRRLYTRYNYRIEFGSKRVERLYSMISSYVPKGKINGDLIIAGFLKNNLIYEKSEEE >NC_004557.1|WP_011099684.1|1585792_1586752_-|type-I-CRISPR-associated-protein-Cas7 MGMNKRVYGVLGIVSRMSNWNADFTGYPKTTSSGDVFGSDKAFKYPMKKMWENGGEKVLYIKSIKFQENKKKERELIPRTLKERYEYIFDVEDLKKNKDSEEVLKNLFTAVDVKNFGATFAEEGNNISITGAVQIGQGFNKYKETYAEEQQILSPFRDPNQKEKSKDGEEAKSSTLGTKIVSNEAHYFYPLTVNPSAYSQFEEIGVTNGYTEEDYEKFKETSMIAATSFNTNSKIGCENEFALFVETKEDLYLPDLSQYVDFEKVEDKNIIILSCSELLNSFENEIENIEIYYNSYTTEIKSDEIKKAKKFNIFTKKEV >NC_004557.1|WP_011099683.1|1585018_1585789_-|type-I-B-CRISPR-associated-protein-Cas5 MDALKFSLSGRTAFFKKPDVNSFFYFTYGNVHKVALLGILGAICGYGGYNSQCLNKEQIYPEFYEKLKDINIGVVPKNEKGYIDKKIQVFNNSVGYASKELGGNLIVKEQWLENPKWAIYILMDENVPKDLKDRLLNFKFKYIPYLGKNDHMANITDVEYLENIEKLDNTNKLDSIFIKDKYEIQKESKNFNDLKNIIKKSSSKIQEFKYEEMLPISLEETTNKYNLETFIYTNSNLKPLADTKTYKCGDKNIFFF >NC_004557.1|WP_011099682.1|1582361_1584962_-|CRISPR-associated-helicase/endonuclease-Cas3 MYFNNIEKVNLENIIENNDKIYAHIHNGRKETLKEHSDLALKYLYKISERKSLDNVFLKIENNFLEKCSNEEKMVYRKMLLNTIYMHDLGKINCNFQRKKMANKIFKEEKMSSTNHSMLSSIIYINHFLKEIASIENGEHIKLLIAFLLLNSYVISKHHGAFNSVNKFKEKLVYDGEEGKDLYTKYMYIFDKVYKEEIIINESLIKEDLFDMYKSTIQEKTEENKDFPVELYIYERFLASLLLSCDYYSTSEFKNQKEVEEFGEIKNIEKFYKSFKSTEVYNWIRKYEKNDYGKTDDFSNIDDINVLRNELFLDAEKTMVSNIDKDIFYLEAPTGSGKSNVSFNLSFKMVERFKEINKIFYVYPFNTLVEQNIKTLEKIFKNNEIMKDIAIINSVVPIKIKSSKDNKIKEIDTNEESDILNEDYERALLDRQFLHYPIVLTTHVSIFNYLFGTSKDNLFPLCQIANSIIVLDEIQSYKNRIWKEIITFLACYSRLLNIKIIIMSATLPNLNKLVDGEIKTVNLIENRKKYFENPIFKNRVMVDFSLLEEKENIKEVLFNNVIKNTKAPNKNILVEFITKESAMDFYEKLKDYNKYLQESEKREIELITGDDNRVERNRIIDKIKSQKNIILVATQVIEAGVDIDMDIGYKDISMLDSEEQFLGRINRSCKNDEQGIVYFFDLDLASHVYKRDIRKQKNINLTCPKIREILINKNFQEFYDYVIKELNKKAGEYNNSSFQTFFLDKVKMLNFKEIEERMKLIDELYENNVFLNRNITLENEEELCGEDVWNEYIAILKNNKLDYAEKKIKLSQVTAKLNYFIYQISSDDFIYEDRVGDIYYIGDGEKYFEDGKFDRKKFKSIVADII >NC_004557.1|WP_011099690.1|1596261_1596732_-|DUF4829-domain-containing-protein MKKSYMMIIMISILFGVKLIYSNSAESIIKEYYKVIDSQQDVGKYNKLVIEDERLKNLEGIPDIVEKRDILELKKLNVNEHPLLEKELNYKYADEKDNVRYYMIKYDIKFKENVATPVDSGIYYEVITVVKRKNKWLVTTDIRKASFHNDKLTIDS >NC_004557.1|WP_011099691.1|1596866_1597955_-|amidase-domain-containing-protein MKKKLMKLKSKLDYGSLKNAYIIKGDEITNEDINALIDNYFNWIYENLINNTIGELQNIVGNNKLAEFKKSKLKWLINWYGKKDEEIKDYKIYTEINDVDINGNIIYINVIYGEDLILKSSSDIVQKIRNQEHKILAKNVGSKLVIIHDYYNDELADEMFLVSDREFKTNKKVKSINKKLEKKTLEINKNIKKIDKLVKQYKRNLHNTLQINNIQERKYPGYDGIAAAKYAVKYAINYNPEYQDYNGRGGDCTNFISQCIYAGGIPTDNVWYKDSHAWIRVVELRSWLLKKGYARELTVQDNAKKGDLIQLRNSGGYWYHSLIVTYKNSSNGELFVSCHTGDYVNRALSTYTTDRRYLILTS >NC_004557.1|WP_011099692.1|1598119_1598677_+|hypothetical-protein MIKTKKKVFANVSFTVLLLVLINTSVFAAYPSKATMDNTWGFEKGDSTEKQLIYHQHGDLDWKGHVDFAMNEININPADISCYYGTSEDLANIVVTSNYWPDATWSGSTYAPIGLEPKTIELNSSAELTDWQRDAVTTHEFVHIWGINDSRNKNSISYGFTPVNYRTITDDVTTLLKNRYNEEVK >NC_004557.1|WP_011099693.1|1598677_1599271_+|hypothetical-protein MIKKISVILALTGVINTNSALSTPPSKVNEPIKPKASASYMEIDGLKELKAKSDIIVEVEGTDKFELIDYKGIKMRKTTVKILDVMKGNPTLKEITVVQTEGLESEEPPMKNEKLLMFLRKGIDITDSYVPIGGNQGIYKIITKKTKKNSMTPKKLPHLNAPKDDAIKIVTPTSLINNKILRDLNGNYDDIKKKLIE >NC_004557.1|WP_035125168.1|1599281_1600583_-|S41-family-peptidase MKRFKKITILAVVLIILILSKSFIGKAYYKKNAPEHIKNFSKKEALEDYDYMWNVLERNYPCFNVIERKHGVTIKDIKNGYRKRIENRENVDFKYFNMILNKSINKFSNVGHLYVMDFNFYIMLRGTFDAIGKNEIGGIVKNNFEMAINKKTEETYKHIYNISYGKKILKNLNFTNISNKLYDNKNLSFKEIDKDTAYIKINNFYHYNIANDKDKLINFYRKNSDKKNLVIDLTENRGGADSYWMTSIVAPNIDKELKLYNQYALYKNGDIVNDQWVKKHGNNEYREITKDFSEVLKLSKIRKEDLKDLKYLEISKSIPYNVKPSSKEKLFKGKIYVLVSEQVQSSGEDFVEYCKNTKFATLIGTTTGGNSPAMSPVYDVLPNSGLMLSYQIDYKLNPDGTCNTEFGLPPDIVSKENEEPLDTFKRVILEKKL >NC_004557.1|WP_023438420.1|1600700_1600883_+|hypothetical-protein MILLLISMLSSKIFDIIFFNLLGETTGTLGSIIGFVLPYSIALEIILKKLFFEPSSKDSK >NC_004557.1|WP_128993785.1|1601055_1602771_+|histidine-decarboxylase MTQPTKDPNTVYPKVPGIDYDKFKLSEDKMTSKQINDALEELHNYISNQQINFLGYQINQSFNYMKDLKEYLNVHMNNIGDPFVSGNFTVNTKFLERAVLDYFASLWNAQWPHESKGDSNTNDWKNSYWGYVVSMGSTEANFFGIWNARDYLSGKALLLDTSTHKRAKSASINGNPQSVEPRVLNYQAKSLEDNPNMYTPIAFYSQDTHYSIIKGMRILNFTTFNEAGSGKFECPLKYPEDYPKGFSINYLDENGWPFEVPSNNDGSVFIPALKKLVEAFASKGYPIFVNFNYGTTFKGSYDNVEKAIDELVPILKKYNLYEREIIFDKNNKNSDTRTGFWFHVDGALGAAYMPFLEMTTDNEDFPVFDFRLKDVHSISMSGHKWIGVPWPCGIYMSKIKYQLLPPDNPNYIGSPDSTFAGSRNAFSSLILWYYIATHSYEDCKNMILDCQDTAKYTVEKLNELSKKLGIDLWVEYSSKSLTIRFKEANPDIVFKYSLSGEILYVNGEKRAYSHIYIMPHVTKDLIDKFIKDLSKPGAFPEQVSHLEKDGVNFNSNSHKGIYVPQIGRGFK >NC_004557.1|WP_011099696.1|1602894_1603476_-|TlpA-family-protein-disulfide-reductase MKKRMKKIILLAVFVITIISLVGCSSDKKDNTSQNSKSIQSTSNAKVFPKFQGEDFEGNTVDEKVFSKHPVTVVNLWFAGCKACVDEMPDLEKMSAEFQKKNVKMLGIDIDSTDDKEEVKKLLKAKGVTYQNLMLKSDKEIDEFLSKISAFPTTFLINSKGEIVGEAIEGVINSPKRIEEINRKIDEIIGQDK >NC_004557.1|WP_035125165.1|1603540_1604416_-|4Fe-4S-binding-protein MDRKRNIIQAFSTFITNIHFPNFLKGVLYNGQIKRVCVPGLNCYSCPAATGACPIGSFQAVVGSSKFSFSYYITGILILFGVLLGRFICGFFCPFGWFQDLLYKIPFKKFSTKKLKLLTYLKYLMLFVGVGLLPILITNNVGMGSPFFCKYVCPQGILEGGIPLSIANKGIRSSLGALFALKSIILVMVILLSIMFYRPFCKWICPLGAFYSFFNKISLYSYDFNKDNCVNCGKCRRVCKMDVDITKSTTHNECIRCGECIKVCPTKAISTFWGYEKRNSSTKIIGKYKNI >NC_004557.1|WP_011099698.1|1604648_1605866_-|HAMP-domain-containing-protein MERDISLMNKNNYENKSYDGSFSKKIKNSIVVKLMVTIIIIFIGMMTLSNFMINICVNNYFQEFDVEIENIFQTANHDDLYLLINEAQVKSTLEFRLYIILIMLFTVLIGCFFLYFIISHMMKPLKSLAEQVSEIDIHNIEDLNQEIVAIKGGYEIEDLAHTFNVTLKKLYLDYESQKKFSSNVAHELRTPLAVLYSKIDVFGKKSERNIEEYEELITSLKFNIERLADLVSKILLLTKKSNNIKLINVCLKDIVEEIVFDLEGIAEEKSVTATITGDNISMCTDDGLIQRVLFNLIENAIKYNVNNGKVNINLSKNDTDTIIEIADTGIGITDEHKEKVFDIFYRVEQSRNRALGGYGIGLALVESIVKVLGGKIFIRDNKPQGTIFVLSFQNIDSKMYSGSLN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
NC_004557_1 | 1.1|1217338|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT | 1217338-1217373 | 36 | NC_004557.1 | 1205308-1205343 | 0 | 1.0 |
NC_004557_1 | 1.2|1217404|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT | 1217404-1217439 | 36 | NC_004557.1 | 1191961-1191996 | 0 | 1.0 |
NC_004557_1 | 1.8|1217803|39|NC_004557|PILER-CR,CRISPRCasFinder,CRT | 1217803-1217841 | 39 | NC_004557.1 | 1099343-1099381 | 0 | 1.0 |
NC_004557_1 | 1.9|1217872|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT | 1217872-1217907 | 36 | NC_004557.1 | 1399450-1399485 | 0 | 1.0 |
NC_004557_2 | 2.9|1571318|36|NC_004557|CRISPRCasFinder,CRT | 1571318-1571353 | 36 | NC_004557.1 | 345159-345194 | 0 | 1.0 |
NC_004557_2 | 2.18|1571320|36|NC_004557|PILER-CR | 1571320-1571355 | 36 | NC_004557.1 | 345159-345194 | 0 | 1.0 |
NC_004557_3 | 3.1|1573265|34|NC_004557|CRISPRCasFinder,CRT | 1573265-1573298 | 34 | NC_004557.1 | 2280755-2280788 | 0 | 1.0 |
NC_004557_7 | 7.6|1592362|36|NC_004557|CRISPRCasFinder,CRT | 1592362-1592397 | 36 | NC_004557.1 | 462335-462370 | 0 | 1.0 |
NC_004557_7 | 7.12|1592363|36|NC_004557|PILER-CR | 1592363-1592398 | 36 | NC_004557.1 | 462335-462370 | 0 | 1.0 |
1. spacer 1.1|1217338|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT matches to position: 1205308-1205343, mismatch: 0, identity: 1.0
ttaaatgaaggtactaaatttaaggtaagaatggtg CRISPR spacer ttaaatgaaggtactaaatttaaggtaagaatggtg Protospacer ************************************
2. spacer 1.2|1217404|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT matches to position: 1191961-1191996, mismatch: 0, identity: 1.0
agcattcctctatctccattaactactgaaaaagga CRISPR spacer agcattcctctatctccattaactactgaaaaagga Protospacer ************************************
3. spacer 1.8|1217803|39|NC_004557|PILER-CR,CRISPRCasFinder,CRT matches to position: 1099343-1099381, mismatch: 0, identity: 1.0
atacaatgctccatggaaaggactccacttagatatata CRISPR spacer atacaatgctccatggaaaggactccacttagatatata Protospacer ***************************************
4. spacer 1.9|1217872|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT matches to position: 1399450-1399485, mismatch: 0, identity: 1.0
cccacatcattaaaggatataaaattaccaccttcc CRISPR spacer cccacatcattaaaggatataaaattaccaccttcc Protospacer ************************************
5. spacer 2.9|1571318|36|NC_004557|CRISPRCasFinder,CRT matches to position: 345159-345194, mismatch: 0, identity: 1.0
acagtaacatgaatacactcatgttactgtttttca CRISPR spacer acagtaacatgaatacactcatgttactgtttttca Protospacer ************************************
6. spacer 2.18|1571320|36|NC_004557|PILER-CR matches to position: 345159-345194, mismatch: 0, identity: 1.0
acagtaacatgaatacactcatgttactgtttttca CRISPR spacer acagtaacatgaatacactcatgttactgtttttca Protospacer ************************************
7. spacer 3.1|1573265|34|NC_004557|CRISPRCasFinder,CRT matches to position: 2280755-2280788, mismatch: 0, identity: 1.0
ttaatccagataaaatatattctcttacagcaat CRISPR spacer ttaatccagataaaatatattctcttacagcaat Protospacer **********************************
8. spacer 7.6|1592362|36|NC_004557|CRISPRCasFinder,CRT matches to position: 462335-462370, mismatch: 0, identity: 1.0
gcatcaaacatagttacagcaattgtagttacaaag CRISPR spacer gcatcaaacatagttacagcaattgtagttacaaag Protospacer ************************************
9. spacer 7.12|1592363|36|NC_004557|PILER-CR matches to position: 462335-462370, mismatch: 0, identity: 1.0
gcatcaaacatagttacagcaattgtagttacaaag CRISPR spacer gcatcaaacatagttacagcaattgtagttacaaag Protospacer ************************************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_004557_1 | 1.6|1217671|35|NC_004557|PILER-CR,CRISPRCasFinder,CRT | 1217671-1217705 | 35 | KM983328 | Clostridium phage phiCT453B, complete genome | 7040-7074 | 0 | 1.0 |
NC_004557_3 | 3.1|1573265|34|NC_004557|CRISPRCasFinder,CRT | 1573265-1573298 | 34 | KM983328 | Clostridium phage phiCT453B, complete genome | 594-627 | 0 | 1.0 |
NC_004557_3 | 3.1|1573265|34|NC_004557|CRISPRCasFinder,CRT | 1573265-1573298 | 34 | KM983334 | Clostridium phage phiCTC2B, complete genome | 594-627 | 0 | 1.0 |
NC_004557_3 | 3.1|1573265|34|NC_004557|CRISPRCasFinder,CRT | 1573265-1573298 | 34 | KM983331 | Clostridium phage phiCT19406B, complete genome | 594-627 | 0 | 1.0 |
NC_004557_3 | 3.3|1573393|35|NC_004557|CRISPRCasFinder,CRT,PILER-CR | 1573393-1573427 | 35 | KM983328 | Clostridium phage phiCT453B, complete genome | 24672-24706 | 0 | 1.0 |
NC_004557_3 | 3.5|1573525|35|NC_004557|CRISPRCasFinder,CRT,PILER-CR | 1573525-1573559 | 35 | KM983327 | Clostridium phage phiCT453A, complete genome | 21916-21950 | 0 | 1.0 |
NC_004557_7 | 7.2|1592104|34|NC_004557|CRISPRCasFinder,CRT | 1592104-1592137 | 34 | KM983328 | Clostridium phage phiCT453B, complete genome | 14660-14693 | 0 | 1.0 |
NC_004557_7 | 7.8|1592105|34|NC_004557|PILER-CR | 1592105-1592138 | 34 | KM983328 | Clostridium phage phiCT453B, complete genome | 14660-14693 | 0 | 1.0 |
NC_004557_3 | 3.6|1573590|36|NC_004557|CRISPRCasFinder,CRT,PILER-CR | 1573590-1573625 | 36 | KM983327 | Clostridium phage phiCT453A, complete genome | 5886-5921 | 1 | 0.972 |
NC_004557_4 | 4.2|1577900|36|NC_004557|PILER-CR | 1577900-1577935 | 36 | KM983327 | Clostridium phage phiCT453A, complete genome | 9576-9611 | 1 | 0.972 |
NC_004557_4 | 4.4|1577899|38|NC_004557|CRISPRCasFinder | 1577899-1577936 | 38 | KM983327 | Clostridium phage phiCT453A, complete genome | 9576-9613 | 1 | 0.974 |
NC_004557_1 | 1.6|1217671|35|NC_004557|PILER-CR,CRISPRCasFinder,CRT | 1217671-1217705 | 35 | KM983329 | Clostridium phage phiCT9441A, complete genome | 8919-8953 | 2 | 0.943 |
NC_004557_2 | 2.8|1571254|34|NC_004557|CRISPRCasFinder,CRT | 1571254-1571287 | 34 | KM983327 | Clostridium phage phiCT453A, complete genome | 19576-19609 | 2 | 0.941 |
NC_004557_2 | 2.17|1571256|34|NC_004557|PILER-CR | 1571256-1571289 | 34 | KM983327 | Clostridium phage phiCT453A, complete genome | 19576-19609 | 2 | 0.941 |
NC_004557_1 | 1.14|1218204|35|NC_004557|PILER-CR,CRISPRCasFinder,CRT | 1218204-1218238 | 35 | NZ_CP013848 | Clostridium botulinum strain Af650 plasmid pRSJ14_1, complete sequence | 10611-10645 | 3 | 0.914 |
NC_004557_2 | 2.4|1570993|34|NC_004557|CRISPRCasFinder,CRT | 1570993-1571026 | 34 | NC_010418 | Clostridium botulinum A3 str. Loch Maree plasmid pCLK, complete sequence | 196160-196193 | 3 | 0.912 |
NC_004557_2 | 2.13|1570995|34|NC_004557|PILER-CR | 1570995-1571028 | 34 | NC_010418 | Clostridium botulinum A3 str. Loch Maree plasmid pCLK, complete sequence | 196160-196193 | 3 | 0.912 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | CP002037 | Lactobacillus salivarius CECT 5713 plasmid pHN3, complete sequence | 56952-56983 | 5 | 0.844 |
NC_004557_2 | 2.7|1571188|36|NC_004557|CRISPRCasFinder,CRT | 1571188-1571223 | 36 | NZ_CP013615 | Clostridium perfringens strain JP838 plasmid pJFP838A, complete sequence | 313612-313647 | 6 | 0.833 |
NC_004557_2 | 2.16|1571190|36|NC_004557|PILER-CR | 1571190-1571225 | 36 | NZ_CP013615 | Clostridium perfringens strain JP838 plasmid pJFP838A, complete sequence | 313612-313647 | 6 | 0.833 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | NZ_CP045273 | Bacillus megaterium strain FDU301 plasmid pFDU301A, complete sequence | 353608-353639 | 6 | 0.812 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | NZ_CP015592 | Bacillus cereus strain AR156 plasmid pAR460, complete sequence | 412221-412252 | 6 | 0.812 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | NZ_CP009368 | Bacillus cereus strain FM1 plasmid unnamed, complete sequence | 364324-364355 | 6 | 0.812 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | MT774380 | CrAssphage cr1_1, complete genome | 3245-3276 | 6 | 0.812 |
NC_004557_4 | 4.3|1577835|34|NC_004557|CRISPRCasFinder | 1577835-1577868 | 34 | NZ_CP045273 | Bacillus megaterium strain FDU301 plasmid pFDU301A, complete sequence | 353606-353639 | 6 | 0.824 |
NC_004557_2 | 2.7|1571188|36|NC_004557|CRISPRCasFinder,CRT | 1571188-1571223 | 36 | MN693162 | Marine virus AFVG_25M233, complete genome | 7269-7304 | 7 | 0.806 |
NC_004557_2 | 2.16|1571190|36|NC_004557|PILER-CR | 1571190-1571225 | 36 | MN693162 | Marine virus AFVG_25M233, complete genome | 7269-7304 | 7 | 0.806 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | NZ_CP033050 | Virgibacillus halodenitrificans strain Bac324 plasmid unnamed, complete sequence | 309797-309828 | 7 | 0.781 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | NZ_CP043831 | Bacillus sp. BS98 plasmid unnamed1 | 186464-186495 | 7 | 0.781 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | NZ_LR214986 | Mycoplasma cynos strain NCTC10142 plasmid 13 | 840082-840113 | 7 | 0.781 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | NZ_AP017969 | Fusobacterium varium strain Fv113-g1 plasmid pFV113-g1-1, complete sequence | 69076-69107 | 7 | 0.781 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | NC_007021 | Staphylococcus phage Twort, complete genome | 113971-114002 | 7 | 0.781 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | MT151386 | Staphylococcus virus Twort, complete genome | 25794-25825 | 7 | 0.781 |
NC_004557_4 | 4.3|1577835|34|NC_004557|CRISPRCasFinder | 1577835-1577868 | 34 | CP002037 | Lactobacillus salivarius CECT 5713 plasmid pHN3, complete sequence | 56952-56985 | 7 | 0.794 |
NC_004557_4 | 4.3|1577835|34|NC_004557|CRISPRCasFinder | 1577835-1577868 | 34 | NZ_CP015592 | Bacillus cereus strain AR156 plasmid pAR460, complete sequence | 412221-412254 | 7 | 0.794 |
NC_004557_4 | 4.3|1577835|34|NC_004557|CRISPRCasFinder | 1577835-1577868 | 34 | NZ_CP009368 | Bacillus cereus strain FM1 plasmid unnamed, complete sequence | 364324-364357 | 7 | 0.794 |
NC_004557_4 | 4.3|1577835|34|NC_004557|CRISPRCasFinder | 1577835-1577868 | 34 | MT774380 | CrAssphage cr1_1, complete genome | 3243-3276 | 7 | 0.794 |
NC_004557_7 | 7.2|1592104|34|NC_004557|CRISPRCasFinder,CRT | 1592104-1592137 | 34 | NZ_CP024873 | Leptospira mayottensis 200901116 plasmid p1_L200901116, complete sequence | 73565-73598 | 7 | 0.794 |
NC_004557_7 | 7.8|1592105|34|NC_004557|PILER-CR | 1592105-1592138 | 34 | NZ_CP024873 | Leptospira mayottensis 200901116 plasmid p1_L200901116, complete sequence | 73565-73598 | 7 | 0.794 |
NC_004557_1 | 1.25|1218935|37|NC_004557|PILER-CR,CRISPRCasFinder,CRT | 1218935-1218971 | 37 | NZ_CP014152 | Clostridium botulinum strain BrDura plasmid pRSJ20_1, complete sequence | 146195-146231 | 8 | 0.784 |
NC_004557_1 | 1.25|1218935|37|NC_004557|PILER-CR,CRISPRCasFinder,CRT | 1218935-1218971 | 37 | NZ_CP013710 | Clostridium botulinum strain F634 plasmid pRSJ2_3, complete sequence | 133154-133190 | 8 | 0.784 |
NC_004557_2 | 2.3|1570929|34|NC_004557|CRISPRCasFinder,CRT | 1570929-1570962 | 34 | NC_028838 | Clostridium phage phiCD506, complete genome | 19424-19457 | 8 | 0.765 |
NC_004557_2 | 2.6|1571123|35|NC_004557|CRISPRCasFinder,CRT | 1571123-1571157 | 35 | NZ_CP039845 | Acetobacter pasteurianus strain CICC 22518 plasmid pAP22518-1, complete sequence | 44968-45002 | 8 | 0.771 |
NC_004557_2 | 2.7|1571188|36|NC_004557|CRISPRCasFinder,CRT | 1571188-1571223 | 36 | MN693344 | Marine virus AFVG_25M232, complete genome | 2321-2356 | 8 | 0.778 |
NC_004557_2 | 2.7|1571188|36|NC_004557|CRISPRCasFinder,CRT | 1571188-1571223 | 36 | AP013454 | Uncultured Mediterranean phage uvMED DNA, complete genome, group G17, isolate: uvMED-CGR-U-MedDCM-OCT-S33-C36 | 19997-20032 | 8 | 0.778 |
NC_004557_2 | 2.12|1570931|34|NC_004557|PILER-CR | 1570931-1570964 | 34 | NC_028838 | Clostridium phage phiCD506, complete genome | 19424-19457 | 8 | 0.765 |
NC_004557_2 | 2.15|1571125|35|NC_004557|PILER-CR | 1571125-1571159 | 35 | NZ_CP039845 | Acetobacter pasteurianus strain CICC 22518 plasmid pAP22518-1, complete sequence | 44968-45002 | 8 | 0.771 |
NC_004557_2 | 2.16|1571190|36|NC_004557|PILER-CR | 1571190-1571225 | 36 | MN693344 | Marine virus AFVG_25M232, complete genome | 2321-2356 | 8 | 0.778 |
NC_004557_2 | 2.16|1571190|36|NC_004557|PILER-CR | 1571190-1571225 | 36 | AP013454 | Uncultured Mediterranean phage uvMED DNA, complete genome, group G17, isolate: uvMED-CGR-U-MedDCM-OCT-S33-C36 | 19997-20032 | 8 | 0.778 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | MF186604 | Methanosarcina spherical virus, complete genome | 971-1002 | 8 | 0.75 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | AP014322 | Uncultured Mediterranean phage uvMED isolate uvMED-GF-U-MedDCM-OCT-S33-C37, *** SEQUENCING IN PROGRESS *** | 11492-11523 | 8 | 0.75 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | NC_019749 | Stanieria cyanosphaera PCC 7437 plasmid pSTA7437.02, complete sequence | 38582-38613 | 8 | 0.75 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | MK614706 | Gammaproteobacteria virus GOV_bin_2604, complete genome | 55000-55031 | 8 | 0.75 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | MT325768 | Psychrobacillus phage Perkons, complete genome | 7524-7555 | 8 | 0.75 |
NC_004557_4 | 4.3|1577835|34|NC_004557|CRISPRCasFinder | 1577835-1577868 | 34 | NZ_CP043831 | Bacillus sp. BS98 plasmid unnamed1 | 186464-186497 | 8 | 0.765 |
NC_004557_5 | 5.1|1580118|36|NC_004557|CRISPRCasFinder,CRT | 1580118-1580153 | 36 | MN693129 | Marine virus AFVG_25M62, complete genome | 46972-47007 | 8 | 0.778 |
NC_004557_5 | 5.5|1580119|36|NC_004557|PILER-CR | 1580119-1580154 | 36 | MN693129 | Marine virus AFVG_25M62, complete genome | 46972-47007 | 8 | 0.778 |
NC_004557_6 | 6.4|1589907|34|NC_004557|CRISPRCasFinder | 1589907-1589940 | 34 | NZ_CP017256 | Clostridium taeniosporum strain 1/k plasmid pCt3, complete sequence | 158090-158123 | 8 | 0.765 |
NC_004557_6 | 6.4|1589907|34|NC_004557|CRISPRCasFinder | 1589907-1589940 | 34 | NC_011732 | Gloeothece citriformis PCC 7424 plasmid pP742404, complete sequence | 17051-17084 | 8 | 0.765 |
NC_004557_6 | 6.5|1589971|34|NC_004557|CRISPRCasFinder | 1589971-1590004 | 34 | NZ_CP017256 | Clostridium taeniosporum strain 1/k plasmid pCt3, complete sequence | 158090-158123 | 8 | 0.765 |
NC_004557_6 | 6.5|1589971|34|NC_004557|CRISPRCasFinder | 1589971-1590004 | 34 | NC_011732 | Gloeothece citriformis PCC 7424 plasmid pP742404, complete sequence | 17051-17084 | 8 | 0.765 |
NC_004557_7 | 7.1|1592039|35|NC_004557|CRISPRCasFinder,CRT | 1592039-1592073 | 35 | NZ_CP026601 | Clostridiaceae bacterium 14S0207 plasmid unnamed1, complete sequence | 34147-34181 | 8 | 0.771 |
NC_004557_7 | 7.7|1592040|35|NC_004557|PILER-CR | 1592040-1592074 | 35 | NZ_CP026601 | Clostridiaceae bacterium 14S0207 plasmid unnamed1, complete sequence | 34147-34181 | 8 | 0.771 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | MH617682 | Microviridae sp. isolate ctcb14, complete genome | 11-42 | 9 | 0.719 |
NC_004557_4 | 4.3|1577835|34|NC_004557|CRISPRCasFinder | 1577835-1577868 | 34 | NZ_CP033050 | Virgibacillus halodenitrificans strain Bac324 plasmid unnamed, complete sequence | 309795-309828 | 9 | 0.735 |
NC_004557_4 | 4.3|1577835|34|NC_004557|CRISPRCasFinder | 1577835-1577868 | 34 | NZ_LR214986 | Mycoplasma cynos strain NCTC10142 plasmid 13 | 840082-840115 | 9 | 0.735 |
NC_004557_4 | 4.3|1577835|34|NC_004557|CRISPRCasFinder | 1577835-1577868 | 34 | AP014322 | Uncultured Mediterranean phage uvMED isolate uvMED-GF-U-MedDCM-OCT-S33-C37, *** SEQUENCING IN PROGRESS *** | 11490-11523 | 9 | 0.735 |
NC_004557_5 | 5.3|1580250|34|NC_004557|CRISPRCasFinder,CRT | 1580250-1580283 | 34 | NZ_CP015331 | Borrelia hermsii HS1 isolate Browne Mountain plasmid lpN31, complete sequence | 6668-6701 | 9 | 0.735 |
NC_004557_5 | 5.3|1580250|34|NC_004557|CRISPRCasFinder,CRT | 1580250-1580283 | 34 | NZ_CP039041 | Piscirickettsia salmonis strain Psal-072 plasmid unnamed1, complete sequence | 53993-54026 | 9 | 0.735 |
NC_004557_5 | 5.3|1580250|34|NC_004557|CRISPRCasFinder,CRT | 1580250-1580283 | 34 | NZ_CP039048 | Piscirickettsia salmonis strain Psal-073 plasmid unnamed2, complete sequence | 95035-95068 | 9 | 0.735 |
NC_004557_5 | 5.7|1580251|34|NC_004557|PILER-CR | 1580251-1580284 | 34 | NZ_CP015331 | Borrelia hermsii HS1 isolate Browne Mountain plasmid lpN31, complete sequence | 6668-6701 | 9 | 0.735 |
NC_004557_5 | 5.7|1580251|34|NC_004557|PILER-CR | 1580251-1580284 | 34 | NZ_CP039041 | Piscirickettsia salmonis strain Psal-072 plasmid unnamed1, complete sequence | 53993-54026 | 9 | 0.735 |
NC_004557_5 | 5.7|1580251|34|NC_004557|PILER-CR | 1580251-1580284 | 34 | NZ_CP039048 | Piscirickettsia salmonis strain Psal-073 plasmid unnamed2, complete sequence | 95035-95068 | 9 | 0.735 |
NC_004557_6 | 6.4|1589907|34|NC_004557|CRISPRCasFinder | 1589907-1589940 | 34 | NZ_KT897276 | Clostridium botulinum strain INGR16-02E1 plasmid pINGR16-02E1, complete sequence | 127266-127299 | 9 | 0.735 |
NC_004557_6 | 6.4|1589907|34|NC_004557|CRISPRCasFinder | 1589907-1589940 | 34 | NZ_KT897280 | Clostridium botulinum strain FI1111E1 plasmid pFI1111E1, complete sequence | 136997-137030 | 9 | 0.735 |
NC_004557_6 | 6.4|1589907|34|NC_004557|CRISPRCasFinder | 1589907-1589940 | 34 | NZ_KT897275 | Clostridium botulinum strain IFR 12/29 plasmid p12/29, complete sequence | 132152-132185 | 9 | 0.735 |
NC_004557_6 | 6.4|1589907|34|NC_004557|CRISPRCasFinder | 1589907-1589940 | 34 | NZ_KT897277 | Clostridium botulinum strain ST0210E1 plasmid pST0210E1, complete sequence | 127266-127299 | 9 | 0.735 |
NC_004557_6 | 6.4|1589907|34|NC_004557|CRISPRCasFinder | 1589907-1589940 | 34 | NZ_KT897278 | Clostridium botulinum strain FWSKR40E1 plasmid pFWSKR40E1, complete sequence | 134422-134455 | 9 | 0.735 |
NC_004557_6 | 6.4|1589907|34|NC_004557|CRISPRCasFinder | 1589907-1589940 | 34 | NZ_KT897279 | Clostridium botulinum strain SWKR38E2 plasmid pSWKR38E2, complete sequence | 135558-135591 | 9 | 0.735 |
NC_004557_6 | 6.5|1589971|34|NC_004557|CRISPRCasFinder | 1589971-1590004 | 34 | NZ_KT897276 | Clostridium botulinum strain INGR16-02E1 plasmid pINGR16-02E1, complete sequence | 127266-127299 | 9 | 0.735 |
NC_004557_6 | 6.5|1589971|34|NC_004557|CRISPRCasFinder | 1589971-1590004 | 34 | NZ_KT897280 | Clostridium botulinum strain FI1111E1 plasmid pFI1111E1, complete sequence | 136997-137030 | 9 | 0.735 |
NC_004557_6 | 6.5|1589971|34|NC_004557|CRISPRCasFinder | 1589971-1590004 | 34 | NZ_KT897275 | Clostridium botulinum strain IFR 12/29 plasmid p12/29, complete sequence | 132152-132185 | 9 | 0.735 |
NC_004557_6 | 6.5|1589971|34|NC_004557|CRISPRCasFinder | 1589971-1590004 | 34 | NZ_KT897277 | Clostridium botulinum strain ST0210E1 plasmid pST0210E1, complete sequence | 127266-127299 | 9 | 0.735 |
NC_004557_6 | 6.5|1589971|34|NC_004557|CRISPRCasFinder | 1589971-1590004 | 34 | NZ_KT897278 | Clostridium botulinum strain FWSKR40E1 plasmid pFWSKR40E1, complete sequence | 134422-134455 | 9 | 0.735 |
NC_004557_6 | 6.5|1589971|34|NC_004557|CRISPRCasFinder | 1589971-1590004 | 34 | NZ_KT897279 | Clostridium botulinum strain SWKR38E2 plasmid pSWKR38E2, complete sequence | 135558-135591 | 9 | 0.735 |
NC_004557_1 | 1.33|1219466|35|NC_004557|PILER-CR,CRISPRCasFinder,CRT | 1219466-1219500 | 35 | MT457553 | Shewanella phage Thanatos-2, complete genome | 40254-40288 | 10 | 0.714 |
NC_004557_1 | 1.33|1219466|35|NC_004557|PILER-CR,CRISPRCasFinder,CRT | 1219466-1219500 | 35 | MT457552 | Shewanella phage Thanatos-1, complete genome | 36870-36904 | 10 | 0.714 |
NC_004557_2 | 2.7|1571188|36|NC_004557|CRISPRCasFinder,CRT | 1571188-1571223 | 36 | MN693242 | Marine virus AFVG_25M170, complete genome | 29744-29779 | 10 | 0.722 |
NC_004557_2 | 2.16|1571190|36|NC_004557|PILER-CR | 1571190-1571225 | 36 | MN693242 | Marine virus AFVG_25M170, complete genome | 29744-29779 | 10 | 0.722 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | MF001361 | Enterococcus phage EF5, partial genome | 94857-94888 | 10 | 0.688 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | MN693045 | Marine virus AFVG_25M135, complete genome | 24068-24099 | 10 | 0.688 |
NC_004557_4 | 4.1|1577836|32|NC_004557|PILER-CR | 1577836-1577867 | 32 | MF001358 | Enterococcus phage EF1, partial genome | 22458-22489 | 10 | 0.688 |
NC_004557_4 | 4.3|1577835|34|NC_004557|CRISPRCasFinder | 1577835-1577868 | 34 | MF186604 | Methanosarcina spherical virus, complete genome | 969-1002 | 10 | 0.706 |
NC_004557_5 | 5.1|1580118|36|NC_004557|CRISPRCasFinder,CRT | 1580118-1580153 | 36 | NZ_CP009967 | Bacillus cereus E33L plasmid pBCO_1, complete sequence | 449487-449522 | 10 | 0.722 |
NC_004557_5 | 5.1|1580118|36|NC_004557|CRISPRCasFinder,CRT | 1580118-1580153 | 36 | NZ_CP053657 | Bacillus cereus strain CTMA_1571 plasmid p.1, complete sequence | 85812-85847 | 10 | 0.722 |
NC_004557_5 | 5.1|1580118|36|NC_004557|CRISPRCasFinder,CRT | 1580118-1580153 | 36 | NC_007103 | Bacillus cereus E33L plasmid pE33L466, complete sequence | 338896-338931 | 10 | 0.722 |
NC_004557_5 | 5.1|1580118|36|NC_004557|CRISPRCasFinder,CRT | 1580118-1580153 | 36 | CP024685 | Bacillus wiedmannii bv. thuringiensis strain FCC41 plasmid pFCC41-1-490K, complete sequence | 33508-33543 | 10 | 0.722 |
NC_004557_5 | 5.4|1580314|34|NC_004557|CRISPRCasFinder,CRT | 1580314-1580347 | 34 | NZ_AP018284 | Chondrocystis sp. NIES-4102 plasmid plasmid3 DNA, complete genome | 83426-83459 | 10 | 0.706 |
NC_004557_5 | 5.5|1580119|36|NC_004557|PILER-CR | 1580119-1580154 | 36 | NZ_CP009967 | Bacillus cereus E33L plasmid pBCO_1, complete sequence | 449487-449522 | 10 | 0.722 |
NC_004557_5 | 5.5|1580119|36|NC_004557|PILER-CR | 1580119-1580154 | 36 | NZ_CP053657 | Bacillus cereus strain CTMA_1571 plasmid p.1, complete sequence | 85812-85847 | 10 | 0.722 |
NC_004557_5 | 5.5|1580119|36|NC_004557|PILER-CR | 1580119-1580154 | 36 | NC_007103 | Bacillus cereus E33L plasmid pE33L466, complete sequence | 338896-338931 | 10 | 0.722 |
NC_004557_5 | 5.5|1580119|36|NC_004557|PILER-CR | 1580119-1580154 | 36 | CP024685 | Bacillus wiedmannii bv. thuringiensis strain FCC41 plasmid pFCC41-1-490K, complete sequence | 33508-33543 | 10 | 0.722 |
NC_004557_5 | 5.8|1580315|34|NC_004557|PILER-CR | 1580315-1580348 | 34 | NZ_AP018284 | Chondrocystis sp. NIES-4102 plasmid plasmid3 DNA, complete genome | 83426-83459 | 10 | 0.706 |
NC_004557_6 | 6.4|1589907|34|NC_004557|CRISPRCasFinder | 1589907-1589940 | 34 | NZ_CP022140 | Salmonella enterica subsp. salamae serovar 55:k:z39 str. 1315K plasmid unnamed1, complete sequence | 55234-55267 | 10 | 0.706 |
NC_004557_6 | 6.4|1589907|34|NC_004557|CRISPRCasFinder | 1589907-1589940 | 34 | MH617769 | Siphoviridae sp. isolate ctjc_2, complete genome | 42776-42809 | 10 | 0.706 |
NC_004557_6 | 6.4|1589907|34|NC_004557|CRISPRCasFinder | 1589907-1589940 | 34 | LR588166 | Pseudomonas phage vB_PaeM_MIJ3 | 50432-50465 | 10 | 0.706 |
NC_004557_6 | 6.5|1589971|34|NC_004557|CRISPRCasFinder | 1589971-1590004 | 34 | NZ_CP022140 | Salmonella enterica subsp. salamae serovar 55:k:z39 str. 1315K plasmid unnamed1, complete sequence | 55234-55267 | 10 | 0.706 |
NC_004557_6 | 6.5|1589971|34|NC_004557|CRISPRCasFinder | 1589971-1590004 | 34 | MH617769 | Siphoviridae sp. isolate ctjc_2, complete genome | 42776-42809 | 10 | 0.706 |
NC_004557_6 | 6.5|1589971|34|NC_004557|CRISPRCasFinder | 1589971-1590004 | 34 | LR588166 | Pseudomonas phage vB_PaeM_MIJ3 | 50432-50465 | 10 | 0.706 |
NC_004557_8 | 8.1|1595980|35|NC_004557|CRISPRCasFinder | 1595980-1596014 | 35 | MN694169 | Marine virus AFVG_250M458, complete genome | 620-654 | 10 | 0.714 |
NC_004557_1 | 1.19|1218532|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT | 1218532-1218567 | 36 | NZ_CP010123 | Escherichia coli strain C5 plasmid A, complete genome | 146371-146406 | 11 | 0.694 |
NC_004557_1 | 1.28|1219135|38|NC_004557|PILER-CR,CRISPRCasFinder,CRT | 1219135-1219172 | 38 | NC_011737 | Gloeothece citriformis PCC 7424 plasmid pP742402, complete sequence | 86193-86230 | 11 | 0.711 |
1. spacer 1.6|1217671|35|NC_004557|PILER-CR,CRISPRCasFinder,CRT matches to KM983328 (Clostridium phage phiCT453B, complete genome) position: , mismatch: 0, identity: 1.0
ttgttcgtaactgttaaagctatctttctttatgc CRISPR spacer ttgttcgtaactgttaaagctatctttctttatgc Protospacer ***********************************
2. spacer 3.1|1573265|34|NC_004557|CRISPRCasFinder,CRT matches to KM983328 (Clostridium phage phiCT453B, complete genome) position: , mismatch: 0, identity: 1.0
ttaatccagataaaatatattctcttacagcaat CRISPR spacer ttaatccagataaaatatattctcttacagcaat Protospacer **********************************
3. spacer 3.1|1573265|34|NC_004557|CRISPRCasFinder,CRT matches to KM983334 (Clostridium phage phiCTC2B, complete genome) position: , mismatch: 0, identity: 1.0
ttaatccagataaaatatattctcttacagcaat CRISPR spacer ttaatccagataaaatatattctcttacagcaat Protospacer **********************************
4. spacer 3.1|1573265|34|NC_004557|CRISPRCasFinder,CRT matches to KM983331 (Clostridium phage phiCT19406B, complete genome) position: , mismatch: 0, identity: 1.0
ttaatccagataaaatatattctcttacagcaat CRISPR spacer ttaatccagataaaatatattctcttacagcaat Protospacer **********************************
5. spacer 3.3|1573393|35|NC_004557|CRISPRCasFinder,CRT,PILER-CR matches to KM983328 (Clostridium phage phiCT453B, complete genome) position: , mismatch: 0, identity: 1.0
agatgttttaacaacgataatgaatgcttacaaaa CRISPR spacer agatgttttaacaacgataatgaatgcttacaaaa Protospacer ***********************************
6. spacer 3.5|1573525|35|NC_004557|CRISPRCasFinder,CRT,PILER-CR matches to KM983327 (Clostridium phage phiCT453A, complete genome) position: , mismatch: 0, identity: 1.0
gagctacaagataaatacaaagatgtggatttagt CRISPR spacer gagctacaagataaatacaaagatgtggatttagt Protospacer ***********************************
7. spacer 7.2|1592104|34|NC_004557|CRISPRCasFinder,CRT matches to KM983328 (Clostridium phage phiCT453B, complete genome) position: , mismatch: 0, identity: 1.0
atatagagaaatcacttaaaataatagaatttgc CRISPR spacer atatagagaaatcacttaaaataatagaatttgc Protospacer **********************************
8. spacer 7.8|1592105|34|NC_004557|PILER-CR matches to KM983328 (Clostridium phage phiCT453B, complete genome) position: , mismatch: 0, identity: 1.0
atatagagaaatcacttaaaataatagaatttgc CRISPR spacer atatagagaaatcacttaaaataatagaatttgc Protospacer **********************************
9. spacer 3.6|1573590|36|NC_004557|CRISPRCasFinder,CRT,PILER-CR matches to KM983327 (Clostridium phage phiCT453A, complete genome) position: , mismatch: 1, identity: 0.972
atatgcaatagccatatttcaaagatattcaaagga CRISPR spacer atatgcaatagccctatttcaaagatattcaaagga Protospacer ************* **********************
10. spacer 4.2|1577900|36|NC_004557|PILER-CR matches to KM983327 (Clostridium phage phiCT453A, complete genome) position: , mismatch: 1, identity: 0.972
gtacaaaacttacctcaaaaccatctaccagattta CRISPR spacer gtacaaaacttacctcaaaaccatttaccagattta Protospacer ************************.***********
11. spacer 4.4|1577899|38|NC_004557|CRISPRCasFinder matches to KM983327 (Clostridium phage phiCT453A, complete genome) position: , mismatch: 1, identity: 0.974
gtacaaaacttacctcaaaaccatctaccagatttaga CRISPR spacer gtacaaaacttacctcaaaaccatttaccagatttaga Protospacer ************************.*************
12. spacer 1.6|1217671|35|NC_004557|PILER-CR,CRISPRCasFinder,CRT matches to KM983329 (Clostridium phage phiCT9441A, complete genome) position: , mismatch: 2, identity: 0.943
ttgttcgtaactgttaaagctatctttctttatgc CRISPR spacer ttgttcgtaactgttaaaattatctttctttatgc Protospacer ******************..***************
13. spacer 2.8|1571254|34|NC_004557|CRISPRCasFinder,CRT matches to KM983327 (Clostridium phage phiCT453A, complete genome) position: , mismatch: 2, identity: 0.941
accctaattgtagaactacaatagttccgtattt CRISPR spacer atcctaattgtagaactgcaatagttccgtattt Protospacer *.***************.****************
14. spacer 2.17|1571256|34|NC_004557|PILER-CR matches to KM983327 (Clostridium phage phiCT453A, complete genome) position: , mismatch: 2, identity: 0.941
accctaattgtagaactacaatagttccgtattt CRISPR spacer atcctaattgtagaactgcaatagttccgtattt Protospacer *.***************.****************
15. spacer 1.14|1218204|35|NC_004557|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP013848 (Clostridium botulinum strain Af650 plasmid pRSJ14_1, complete sequence) position: , mismatch: 3, identity: 0.914
gctttaactcttaaaaaagataaagttctaaattc CRISPR spacer gatttaactcttaaaaaagatagagttttaaattc Protospacer * ********************.****.*******
16. spacer 2.4|1570993|34|NC_004557|CRISPRCasFinder,CRT matches to NC_010418 (Clostridium botulinum A3 str. Loch Maree plasmid pCLK, complete sequence) position: , mismatch: 3, identity: 0.912
gttttgcagaggttcgagaaaaacttaaatatta CRISPR spacer ggtttgcagaggttagagaaaaactaaaatatta Protospacer * ************ ********** ********
17. spacer 2.13|1570995|34|NC_004557|PILER-CR matches to NC_010418 (Clostridium botulinum A3 str. Loch Maree plasmid pCLK, complete sequence) position: , mismatch: 3, identity: 0.912
gttttgcagaggttcgagaaaaacttaaatatta CRISPR spacer ggtttgcagaggttagagaaaaactaaaatatta Protospacer * ************ ********** ********
18. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to CP002037 (Lactobacillus salivarius CECT 5713 plasmid pHN3, complete sequence) position: , mismatch: 5, identity: 0.844
ttaaagcttctactaattcttttttattcatt CRISPR spacer cgatagcttctactaattcgttcttattcatt Protospacer . * *************** **.*********
19. spacer 2.7|1571188|36|NC_004557|CRISPRCasFinder,CRT matches to NZ_CP013615 (Clostridium perfringens strain JP838 plasmid pJFP838A, complete sequence) position: , mismatch: 6, identity: 0.833
aggttgggactgttggggaaatgaagtaaatcttaa-- CRISPR spacer aggttgggattgttggggaaatgaagt--tttttagat Protospacer *********.***************** *.***.
20. spacer 2.16|1571190|36|NC_004557|PILER-CR matches to NZ_CP013615 (Clostridium perfringens strain JP838 plasmid pJFP838A, complete sequence) position: , mismatch: 6, identity: 0.833
aggttgggactgttggggaaatgaagtaaatcttaa-- CRISPR spacer aggttgggattgttggggaaatgaagt--tttttagat Protospacer *********.***************** *.***.
21. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to NZ_CP045273 (Bacillus megaterium strain FDU301 plasmid pFDU301A, complete sequence) position: , mismatch: 6, identity: 0.812
ttaaagcttctactaattcttttttattcatt CRISPR spacer ctacttcttcaactaattctttttcattcatt Protospacer .** **** *************.*******
22. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to NZ_CP015592 (Bacillus cereus strain AR156 plasmid pAR460, complete sequence) position: , mismatch: 6, identity: 0.812
ttaaagcttctactaattcttttttattcatt CRISPR spacer ctacactttttactaattctgttttattcatt Protospacer .** * .**.********** ***********
23. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to NZ_CP009368 (Bacillus cereus strain FM1 plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.812
ttaaagcttctactaattcttttttattcatt CRISPR spacer ctacactttttactaattctgttttattcatt Protospacer .** * .**.********** ***********
24. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to MT774380 (CrAssphage cr1_1, complete genome) position: , mismatch: 6, identity: 0.812
ttaaagcttctactaattcttttttat-tcatt CRISPR spacer caaaagcttctactaattattatttatatcaa- Protospacer . **************** ** ***** ***
25. spacer 4.3|1577835|34|NC_004557|CRISPRCasFinder matches to NZ_CP045273 (Bacillus megaterium strain FDU301 plasmid pFDU301A, complete sequence) position: , mismatch: 6, identity: 0.824
ttaaagcttctactaattcttttttattcattgt CRISPR spacer ctacttcttcaactaattctttttcattcattgt Protospacer .** **** *************.*********
26. spacer 2.7|1571188|36|NC_004557|CRISPRCasFinder,CRT matches to MN693162 (Marine virus AFVG_25M233, complete genome) position: , mismatch: 7, identity: 0.806
aggttgggactgttggggaaatgaagta----aatcttaa CRISPR spacer gggttgggattgttggggaaatgaggtataagaatc---- Protospacer .********.**************.*** ****
27. spacer 2.16|1571190|36|NC_004557|PILER-CR matches to MN693162 (Marine virus AFVG_25M233, complete genome) position: , mismatch: 7, identity: 0.806
aggttgggactgttggggaaatgaagta----aatcttaa CRISPR spacer gggttgggattgttggggaaatgaggtataagaatc---- Protospacer .********.**************.*** ****
28. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to NZ_CP033050 (Virgibacillus halodenitrificans strain Bac324 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.781
ttaaagcttctactaattcttttttattcatt CRISPR spacer gcaaaccttcaactaattcttttttatatagt Protospacer .*** **** **************** .* *
29. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to NZ_CP043831 (Bacillus sp. BS98 plasmid unnamed1) position: , mismatch: 7, identity: 0.781
ttaaagcttctactaattcttttttattcatt CRISPR spacer ttaaatcttctactaattcttgtttctcggtc Protospacer ***** *************** *** *. .*.
30. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to NZ_LR214986 (Mycoplasma cynos strain NCTC10142 plasmid 13) position: , mismatch: 7, identity: 0.781
ttaaagcttctactaattcttttttattcatt CRISPR spacer tagataattctaataattcttttttattaatt Protospacer * .* . ***** *************** ***
31. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to NZ_AP017969 (Fusobacterium varium strain Fv113-g1 plasmid pFV113-g1-1, complete sequence) position: , mismatch: 7, identity: 0.781
ttaaagcttctactaattcttttttattcatt- CRISPR spacer ataaagcttttagtaattc-tttttatgaactt Protospacer ********.** ****** ******* *.*
32. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to NC_007021 (Staphylococcus phage Twort, complete genome) position: , mismatch: 7, identity: 0.781
ttaaagcttctactaattcttttttattcatt CRISPR spacer tacaagcttctactaattcttgcttaataagt Protospacer * ****************** .*** * * *
33. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to MT151386 (Staphylococcus virus Twort, complete genome) position: , mismatch: 7, identity: 0.781
ttaaagcttctactaattcttttttattcatt CRISPR spacer tacaagcttctactaattcttgcttaataagt Protospacer * ****************** .*** * * *
34. spacer 4.3|1577835|34|NC_004557|CRISPRCasFinder matches to CP002037 (Lactobacillus salivarius CECT 5713 plasmid pHN3, complete sequence) position: , mismatch: 7, identity: 0.794
ttaaagcttctactaattcttttttattcattgt CRISPR spacer cgatagcttctactaattcgttcttattcattaa Protospacer . * *************** **.*********.
35. spacer 4.3|1577835|34|NC_004557|CRISPRCasFinder matches to NZ_CP015592 (Bacillus cereus strain AR156 plasmid pAR460, complete sequence) position: , mismatch: 7, identity: 0.794
ttaaagcttctactaattcttttttattcattgt CRISPR spacer ctacactttttactaattctgttttattcattat Protospacer .** * .**.********** ***********.*
36. spacer 4.3|1577835|34|NC_004557|CRISPRCasFinder matches to NZ_CP009368 (Bacillus cereus strain FM1 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.794
ttaaagcttctactaattcttttttattcattgt CRISPR spacer ctacactttttactaattctgttttattcattat Protospacer .** * .**.********** ***********.*
37. spacer 4.3|1577835|34|NC_004557|CRISPRCasFinder matches to MT774380 (CrAssphage cr1_1, complete genome) position: , mismatch: 7, identity: 0.794
ttaaagcttctactaattcttttttat-tcattgt CRISPR spacer caaaagcttctactaattattatttatatcaagg- Protospacer . **************** ** ***** *** *
38. spacer 7.2|1592104|34|NC_004557|CRISPRCasFinder,CRT matches to NZ_CP024873 (Leptospira mayottensis 200901116 plasmid p1_L200901116, complete sequence) position: , mismatch: 7, identity: 0.794
atatagagaaatcacttaaaataatagaatttgc CRISPR spacer atttctaaatatcacttaaaataactgaatttgc Protospacer ** * *.* **************. ********
39. spacer 7.8|1592105|34|NC_004557|PILER-CR matches to NZ_CP024873 (Leptospira mayottensis 200901116 plasmid p1_L200901116, complete sequence) position: , mismatch: 7, identity: 0.794
atatagagaaatcacttaaaataatagaatttgc CRISPR spacer atttctaaatatcacttaaaataactgaatttgc Protospacer ** * *.* **************. ********
40. spacer 1.25|1218935|37|NC_004557|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP014152 (Clostridium botulinum strain BrDura plasmid pRSJ20_1, complete sequence) position: , mismatch: 8, identity: 0.784
gtgttacatctcccaatttctcctcataatactttaa CRISPR spacer catttgaatttcctaatttctcctcataatactttag Protospacer **. **.***.**********************.
41. spacer 1.25|1218935|37|NC_004557|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP013710 (Clostridium botulinum strain F634 plasmid pRSJ2_3, complete sequence) position: , mismatch: 8, identity: 0.784
gtgttacatctcccaatttctcctcataatactttaa CRISPR spacer catttgaatttcctaatttctcctcataatactttag Protospacer **. **.***.**********************.
42. spacer 2.3|1570929|34|NC_004557|CRISPRCasFinder,CRT matches to NC_028838 (Clostridium phage phiCD506, complete genome) position: , mismatch: 8, identity: 0.765
gcttaggctaggagctacctctttttttattttt CRISPR spacer taatttgctagttgctacctctttttttatttta Protospacer * ***** ********************
43. spacer 2.6|1571123|35|NC_004557|CRISPRCasFinder,CRT matches to NZ_CP039845 (Acetobacter pasteurianus strain CICC 22518 plasmid pAP22518-1, complete sequence) position: , mismatch: 8, identity: 0.771
tttcttgcaaccatagcacatagttgcagcataac- CRISPR spacer ctatttgcaaccatagaacatggttgcag-gtatcg Protospacer .* .************ ****.******* .** *
44. spacer 2.7|1571188|36|NC_004557|CRISPRCasFinder,CRT matches to MN693344 (Marine virus AFVG_25M232, complete genome) position: , mismatch: 8, identity: 0.778
aggttgggactgttggggaaatgaagtaaatcttaa----- CRISPR spacer gggttgggattgttggggaaatgaggta-----taagaata Protospacer .********.**************.*** ***
45. spacer 2.7|1571188|36|NC_004557|CRISPRCasFinder,CRT matches to AP013454 (Uncultured Mediterranean phage uvMED DNA, complete genome, group G17, isolate: uvMED-CGR-U-MedDCM-OCT-S33-C36) position: , mismatch: 8, identity: 0.778
aggttgggactgttggggaaatgaagtaaatcttaa----- CRISPR spacer aggttgggattgttggggtaatgaagt-----ttgattgtt Protospacer *********.******** ******** **.*
46. spacer 2.12|1570931|34|NC_004557|PILER-CR matches to NC_028838 (Clostridium phage phiCD506, complete genome) position: , mismatch: 8, identity: 0.765
gcttaggctaggagctacctctttttttattttt CRISPR spacer taatttgctagttgctacctctttttttatttta Protospacer * ***** ********************
47. spacer 2.15|1571125|35|NC_004557|PILER-CR matches to NZ_CP039845 (Acetobacter pasteurianus strain CICC 22518 plasmid pAP22518-1, complete sequence) position: , mismatch: 8, identity: 0.771
tttcttgcaaccatagcacatagttgcagcataac- CRISPR spacer ctatttgcaaccatagaacatggttgcag-gtatcg Protospacer .* .************ ****.******* .** *
48. spacer 2.16|1571190|36|NC_004557|PILER-CR matches to MN693344 (Marine virus AFVG_25M232, complete genome) position: , mismatch: 8, identity: 0.778
aggttgggactgttggggaaatgaagtaaatcttaa----- CRISPR spacer gggttgggattgttggggaaatgaggta-----taagaata Protospacer .********.**************.*** ***
49. spacer 2.16|1571190|36|NC_004557|PILER-CR matches to AP013454 (Uncultured Mediterranean phage uvMED DNA, complete genome, group G17, isolate: uvMED-CGR-U-MedDCM-OCT-S33-C36) position: , mismatch: 8, identity: 0.778
aggttgggactgttggggaaatgaagtaaatcttaa----- CRISPR spacer aggttgggattgttggggtaatgaagt-----ttgattgtt Protospacer *********.******** ******** **.*
50. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to MF186604 (Methanosarcina spherical virus, complete genome) position: , mismatch: 8, identity: 0.75
ttaaagcttctactaattcttttttattcatt CRISPR spacer gtgtcctttctactaattcttttttagtcatg Protospacer *. .******************* ****
51. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to AP014322 (Uncultured Mediterranean phage uvMED isolate uvMED-GF-U-MedDCM-OCT-S33-C37, *** SEQUENCING IN PROGRESS ***) position: , mismatch: 8, identity: 0.75
ttaaagcttctactaattcttttttattcatt CRISPR spacer gtcttacatctactaataattttttattcatt Protospacer * .* ********* *************
52. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to NC_019749 (Stanieria cyanosphaera PCC 7437 plasmid pSTA7437.02, complete sequence) position: , mismatch: 8, identity: 0.75
ttaaagcttctactaattcttttttattcatt CRISPR spacer tttttccttctactaactctttattattcggt Protospacer ** **********.***** ******. *
53. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to MK614706 (Gammaproteobacteria virus GOV_bin_2604, complete genome) position: , mismatch: 8, identity: 0.75
---ttaaagcttctactaattcttttttattcatt CRISPR spacer tgttccaa---tctactatttcttttttagtcata Protospacer *. ** ******* ********** ****
54. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to MT325768 (Psychrobacillus phage Perkons, complete genome) position: , mismatch: 8, identity: 0.75
ttaaagcttctactaattcttttttattcatt CRISPR spacer gaagtgtttctagtaattcttttatattcaat Protospacer *. *.***** ********** ****** *
55. spacer 4.3|1577835|34|NC_004557|CRISPRCasFinder matches to NZ_CP043831 (Bacillus sp. BS98 plasmid unnamed1) position: , mismatch: 8, identity: 0.765
ttaaagcttctactaattcttttttattcattgt CRISPR spacer ttaaatcttctactaattcttgtttctcggtctt Protospacer ***** *************** *** *. .*. *
56. spacer 5.1|1580118|36|NC_004557|CRISPRCasFinder,CRT matches to MN693129 (Marine virus AFVG_25M62, complete genome) position: , mismatch: 8, identity: 0.778
ggatgcactttctttaaatataaataaaaaatctaa CRISPR spacer ttttacattttctttaaacataaataaaaaatcttt Protospacer *.**.**********.***************
57. spacer 5.5|1580119|36|NC_004557|PILER-CR matches to MN693129 (Marine virus AFVG_25M62, complete genome) position: , mismatch: 8, identity: 0.778
ggatgcactttctttaaatataaataaaaaatctaa CRISPR spacer ttttacattttctttaaacataaataaaaaatcttt Protospacer *.**.**********.***************
58. spacer 6.4|1589907|34|NC_004557|CRISPRCasFinder matches to NZ_CP017256 (Clostridium taeniosporum strain 1/k plasmid pCt3, complete sequence) position: , mismatch: 8, identity: 0.765
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer cattttctgtgatttcttcatcttcaaaaaatac Protospacer * **.***** *************** * **
59. spacer 6.4|1589907|34|NC_004557|CRISPRCasFinder matches to NC_011732 (Gloeothece citriformis PCC 7424 plasmid pP742404, complete sequence) position: , mismatch: 8, identity: 0.765
ctttctctgttatttcttcatcttcatattttaa------ CRISPR spacer ctttatctgttctttcttcatc------ttttaaaactcc Protospacer **** ****** ********** ******
60. spacer 6.5|1589971|34|NC_004557|CRISPRCasFinder matches to NZ_CP017256 (Clostridium taeniosporum strain 1/k plasmid pCt3, complete sequence) position: , mismatch: 8, identity: 0.765
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer cattttctgtgatttcttcatcttcaaaaaatac Protospacer * **.***** *************** * **
61. spacer 6.5|1589971|34|NC_004557|CRISPRCasFinder matches to NC_011732 (Gloeothece citriformis PCC 7424 plasmid pP742404, complete sequence) position: , mismatch: 8, identity: 0.765
ctttctctgttatttcttcatcttcatattttaa------ CRISPR spacer ctttatctgttctttcttcatc------ttttaaaactcc Protospacer **** ****** ********** ******
62. spacer 7.1|1592039|35|NC_004557|CRISPRCasFinder,CRT matches to NZ_CP026601 (Clostridiaceae bacterium 14S0207 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.771
gtgctgcacttctagaacttaaattacattccttt CRISPR spacer gtgctgcacttcttgaacttaatttatttttaatc Protospacer ************* ******** ***. **. *.
63. spacer 7.7|1592040|35|NC_004557|PILER-CR matches to NZ_CP026601 (Clostridiaceae bacterium 14S0207 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.771
gtgctgcacttctagaacttaaattacattccttt CRISPR spacer gtgctgcacttcttgaacttaatttatttttaatc Protospacer ************* ******** ***. **. *.
64. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to MH617682 (Microviridae sp. isolate ctcb14, complete genome) position: , mismatch: 9, identity: 0.719
ttaaagcttctactaattcttttttattcatt CRISPR spacer ataatgcttttactaattctttttcggacgct Protospacer *** ****.**************.. *..*
65. spacer 4.3|1577835|34|NC_004557|CRISPRCasFinder matches to NZ_CP033050 (Virgibacillus halodenitrificans strain Bac324 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.735
ttaaagcttctactaattcttttttattcattgt CRISPR spacer gcaaaccttcaactaattcttttttatatagttc Protospacer .*** **** **************** .* * .
66. spacer 4.3|1577835|34|NC_004557|CRISPRCasFinder matches to NZ_LR214986 (Mycoplasma cynos strain NCTC10142 plasmid 13) position: , mismatch: 9, identity: 0.735
ttaaagcttctactaattcttttttattcattgt CRISPR spacer tagataattctaataattcttttttattaatttc Protospacer * .* . ***** *************** *** .
67. spacer 4.3|1577835|34|NC_004557|CRISPRCasFinder matches to AP014322 (Uncultured Mediterranean phage uvMED isolate uvMED-GF-U-MedDCM-OCT-S33-C37, *** SEQUENCING IN PROGRESS ***) position: , mismatch: 9, identity: 0.735
ttaaagcttctactaattcttttttattcattgt CRISPR spacer gtcttacatctactaataattttttattcatttt Protospacer * .* ********* ************* *
68. spacer 5.3|1580250|34|NC_004557|CRISPRCasFinder,CRT matches to NZ_CP015331 (Borrelia hermsii HS1 isolate Browne Mountain plasmid lpN31, complete sequence) position: , mismatch: 9, identity: 0.735
aagagttgcacttttttatataatctcttttagg CRISPR spacer aaaatactgtcttttttatgtcatctcttttagg Protospacer **.* . *********.* ************
69. spacer 5.3|1580250|34|NC_004557|CRISPRCasFinder,CRT matches to NZ_CP039041 (Piscirickettsia salmonis strain Psal-072 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.735
aagagttgcacttttttatataatctcttttagg CRISPR spacer aaagatgtcacttttttaaataatgtcttttact Protospacer **...* ********** ***** *******
70. spacer 5.3|1580250|34|NC_004557|CRISPRCasFinder,CRT matches to NZ_CP039048 (Piscirickettsia salmonis strain Psal-073 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.735
aagagttgcacttttttatataatctcttttagg CRISPR spacer aaagatgtcacttttttaaataatgtcttttact Protospacer **...* ********** ***** *******
71. spacer 5.7|1580251|34|NC_004557|PILER-CR matches to NZ_CP015331 (Borrelia hermsii HS1 isolate Browne Mountain plasmid lpN31, complete sequence) position: , mismatch: 9, identity: 0.735
aagagttgcacttttttatataatctcttttagg CRISPR spacer aaaatactgtcttttttatgtcatctcttttagg Protospacer **.* . *********.* ************
72. spacer 5.7|1580251|34|NC_004557|PILER-CR matches to NZ_CP039041 (Piscirickettsia salmonis strain Psal-072 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.735
aagagttgcacttttttatataatctcttttagg CRISPR spacer aaagatgtcacttttttaaataatgtcttttact Protospacer **...* ********** ***** *******
73. spacer 5.7|1580251|34|NC_004557|PILER-CR matches to NZ_CP039048 (Piscirickettsia salmonis strain Psal-073 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.735
aagagttgcacttttttatataatctcttttagg CRISPR spacer aaagatgtcacttttttaaataatgtcttttact Protospacer **...* ********** ***** *******
74. spacer 6.4|1589907|34|NC_004557|CRISPRCasFinder matches to NZ_KT897276 (Clostridium botulinum strain INGR16-02E1 plasmid pINGR16-02E1, complete sequence) position: , mismatch: 9, identity: 0.735
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer atgaggctgttatttctttaccttcatatttaat Protospacer * ************.*.********** *
75. spacer 6.4|1589907|34|NC_004557|CRISPRCasFinder matches to NZ_KT897280 (Clostridium botulinum strain FI1111E1 plasmid pFI1111E1, complete sequence) position: , mismatch: 9, identity: 0.735
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer atgaggctgttatttctttaccttcatatttaat Protospacer * ************.*.********** *
76. spacer 6.4|1589907|34|NC_004557|CRISPRCasFinder matches to NZ_KT897275 (Clostridium botulinum strain IFR 12/29 plasmid p12/29, complete sequence) position: , mismatch: 9, identity: 0.735
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer atgatgctgttatttctttaccttcatatttaat Protospacer * . ************.*.********** *
77. spacer 6.4|1589907|34|NC_004557|CRISPRCasFinder matches to NZ_KT897277 (Clostridium botulinum strain ST0210E1 plasmid pST0210E1, complete sequence) position: , mismatch: 9, identity: 0.735
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer atgaggctgttatttctttaccttcatatttaat Protospacer * ************.*.********** *
78. spacer 6.4|1589907|34|NC_004557|CRISPRCasFinder matches to NZ_KT897278 (Clostridium botulinum strain FWSKR40E1 plasmid pFWSKR40E1, complete sequence) position: , mismatch: 9, identity: 0.735
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer atgaggctgttatttctttaccttcatatttaat Protospacer * ************.*.********** *
79. spacer 6.4|1589907|34|NC_004557|CRISPRCasFinder matches to NZ_KT897279 (Clostridium botulinum strain SWKR38E2 plasmid pSWKR38E2, complete sequence) position: , mismatch: 9, identity: 0.735
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer atgaggctgttatttctttaccttcatatttaat Protospacer * ************.*.********** *
80. spacer 6.5|1589971|34|NC_004557|CRISPRCasFinder matches to NZ_KT897276 (Clostridium botulinum strain INGR16-02E1 plasmid pINGR16-02E1, complete sequence) position: , mismatch: 9, identity: 0.735
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer atgaggctgttatttctttaccttcatatttaat Protospacer * ************.*.********** *
81. spacer 6.5|1589971|34|NC_004557|CRISPRCasFinder matches to NZ_KT897280 (Clostridium botulinum strain FI1111E1 plasmid pFI1111E1, complete sequence) position: , mismatch: 9, identity: 0.735
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer atgaggctgttatttctttaccttcatatttaat Protospacer * ************.*.********** *
82. spacer 6.5|1589971|34|NC_004557|CRISPRCasFinder matches to NZ_KT897275 (Clostridium botulinum strain IFR 12/29 plasmid p12/29, complete sequence) position: , mismatch: 9, identity: 0.735
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer atgatgctgttatttctttaccttcatatttaat Protospacer * . ************.*.********** *
83. spacer 6.5|1589971|34|NC_004557|CRISPRCasFinder matches to NZ_KT897277 (Clostridium botulinum strain ST0210E1 plasmid pST0210E1, complete sequence) position: , mismatch: 9, identity: 0.735
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer atgaggctgttatttctttaccttcatatttaat Protospacer * ************.*.********** *
84. spacer 6.5|1589971|34|NC_004557|CRISPRCasFinder matches to NZ_KT897278 (Clostridium botulinum strain FWSKR40E1 plasmid pFWSKR40E1, complete sequence) position: , mismatch: 9, identity: 0.735
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer atgaggctgttatttctttaccttcatatttaat Protospacer * ************.*.********** *
85. spacer 6.5|1589971|34|NC_004557|CRISPRCasFinder matches to NZ_KT897279 (Clostridium botulinum strain SWKR38E2 plasmid pSWKR38E2, complete sequence) position: , mismatch: 9, identity: 0.735
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer atgaggctgttatttctttaccttcatatttaat Protospacer * ************.*.********** *
86. spacer 1.33|1219466|35|NC_004557|PILER-CR,CRISPRCasFinder,CRT matches to MT457553 (Shewanella phage Thanatos-2, complete genome) position: , mismatch: 10, identity: 0.714
atatgtgatgatgaattagagaaagtgcttgaaag CRISPR spacer tctagtgatgatgaattagataaattgctaaatgg Protospacer . **************** *** **** .* .*
87. spacer 1.33|1219466|35|NC_004557|PILER-CR,CRISPRCasFinder,CRT matches to MT457552 (Shewanella phage Thanatos-1, complete genome) position: , mismatch: 10, identity: 0.714
atatgtgatgatgaattagagaaagtgcttgaaag CRISPR spacer tctggtgatgatgaattagataaattgctaaatgg Protospacer . **************** *** **** .* .*
88. spacer 2.7|1571188|36|NC_004557|CRISPRCasFinder,CRT matches to MN693242 (Marine virus AFVG_25M170, complete genome) position: , mismatch: 10, identity: 0.722
aggttgggactgttggggaaatgaagtaaatcttaa CRISPR spacer tggttgggattgttggggtaatgaagtatgacaagt Protospacer ********.******** ********* . * .
89. spacer 2.16|1571190|36|NC_004557|PILER-CR matches to MN693242 (Marine virus AFVG_25M170, complete genome) position: , mismatch: 10, identity: 0.722
aggttgggactgttggggaaatgaagtaaatcttaa CRISPR spacer tggttgggattgttggggtaatgaagtatgacaagt Protospacer ********.******** ********* . * .
90. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to MF001361 (Enterococcus phage EF5, partial genome) position: , mismatch: 10, identity: 0.688
ttaaagcttctactaattcttttttattcatt CRISPR spacer cagttttttctactagtccttttttattcaat Protospacer . . .********.*.************ *
91. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to MN693045 (Marine virus AFVG_25M135, complete genome) position: , mismatch: 10, identity: 0.688
ttaaagcttctactaattcttttttattcatt CRISPR spacer ctaaagcctttactaattctttttctaagtct Protospacer .******.*.**************. .*
92. spacer 4.1|1577836|32|NC_004557|PILER-CR matches to MF001358 (Enterococcus phage EF1, partial genome) position: , mismatch: 10, identity: 0.688
ttaaagcttctactaattcttttttattcatt CRISPR spacer cagttttttctactagtccttttttattcaat Protospacer . . .********.*.************ *
93. spacer 4.3|1577835|34|NC_004557|CRISPRCasFinder matches to MF186604 (Methanosarcina spherical virus, complete genome) position: , mismatch: 10, identity: 0.706
ttaaagcttctactaattcttttttattcattgt CRISPR spacer gtgtcctttctactaattcttttttagtcatgca Protospacer *. .******************* ****
94. spacer 5.1|1580118|36|NC_004557|CRISPRCasFinder,CRT matches to NZ_CP009967 (Bacillus cereus E33L plasmid pBCO_1, complete sequence) position: , mismatch: 10, identity: 0.722
ggatgcactttctttaaatataaataaaaaatctaa CRISPR spacer caaaaaattttcattaaatataaaaaaaaaatcttg Protospacer .* . *.**** *********** ********* .
95. spacer 5.1|1580118|36|NC_004557|CRISPRCasFinder,CRT matches to NZ_CP053657 (Bacillus cereus strain CTMA_1571 plasmid p.1, complete sequence) position: , mismatch: 10, identity: 0.722
ggatgcactttctttaaatataaataaaaaatctaa CRISPR spacer caaaaaattttcattaaatataaaaaaaaaatcttg Protospacer .* . *.**** *********** ********* .
96. spacer 5.1|1580118|36|NC_004557|CRISPRCasFinder,CRT matches to NC_007103 (Bacillus cereus E33L plasmid pE33L466, complete sequence) position: , mismatch: 10, identity: 0.722
ggatgcactttctttaaatataaataaaaaatctaa CRISPR spacer caaaaaattttcattaaatataaaaaaaaaatcttg Protospacer .* . *.**** *********** ********* .
97. spacer 5.1|1580118|36|NC_004557|CRISPRCasFinder,CRT matches to CP024685 (Bacillus wiedmannii bv. thuringiensis strain FCC41 plasmid pFCC41-1-490K, complete sequence) position: , mismatch: 10, identity: 0.722
ggatgcactttctttaaatataaataaaaaatctaa CRISPR spacer caaaaaattttcattaaatataaaaaaaaaatcttg Protospacer .* . *.**** *********** ********* .
98. spacer 5.4|1580314|34|NC_004557|CRISPRCasFinder,CRT matches to NZ_AP018284 (Chondrocystis sp. NIES-4102 plasmid plasmid3 DNA, complete genome) position: , mismatch: 10, identity: 0.706
ttggagatttaaaggaagcttataaatatttcta CRISPR spacer tcagtattttaaaataagcttataaatatttaat Protospacer *..* . ******. ****************
99. spacer 5.5|1580119|36|NC_004557|PILER-CR matches to NZ_CP009967 (Bacillus cereus E33L plasmid pBCO_1, complete sequence) position: , mismatch: 10, identity: 0.722
ggatgcactttctttaaatataaataaaaaatctaa CRISPR spacer caaaaaattttcattaaatataaaaaaaaaatcttg Protospacer .* . *.**** *********** ********* .
100. spacer 5.5|1580119|36|NC_004557|PILER-CR matches to NZ_CP053657 (Bacillus cereus strain CTMA_1571 plasmid p.1, complete sequence) position: , mismatch: 10, identity: 0.722
ggatgcactttctttaaatataaataaaaaatctaa CRISPR spacer caaaaaattttcattaaatataaaaaaaaaatcttg Protospacer .* . *.**** *********** ********* .
101. spacer 5.5|1580119|36|NC_004557|PILER-CR matches to NC_007103 (Bacillus cereus E33L plasmid pE33L466, complete sequence) position: , mismatch: 10, identity: 0.722
ggatgcactttctttaaatataaataaaaaatctaa CRISPR spacer caaaaaattttcattaaatataaaaaaaaaatcttg Protospacer .* . *.**** *********** ********* .
102. spacer 5.5|1580119|36|NC_004557|PILER-CR matches to CP024685 (Bacillus wiedmannii bv. thuringiensis strain FCC41 plasmid pFCC41-1-490K, complete sequence) position: , mismatch: 10, identity: 0.722
ggatgcactttctttaaatataaataaaaaatctaa CRISPR spacer caaaaaattttcattaaatataaaaaaaaaatcttg Protospacer .* . *.**** *********** ********* .
103. spacer 5.8|1580315|34|NC_004557|PILER-CR matches to NZ_AP018284 (Chondrocystis sp. NIES-4102 plasmid plasmid3 DNA, complete genome) position: , mismatch: 10, identity: 0.706
ttggagatttaaaggaagcttataaatatttcta CRISPR spacer tcagtattttaaaataagcttataaatatttaat Protospacer *..* . ******. ****************
104. spacer 6.4|1589907|34|NC_004557|CRISPRCasFinder matches to NZ_CP022140 (Salmonella enterica subsp. salamae serovar 55:k:z39 str. 1315K plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.706
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer cgaaatctgttattttttcatctttatatacgag Protospacer * **********.********.**** . *.
105. spacer 6.4|1589907|34|NC_004557|CRISPRCasFinder matches to MH617769 (Siphoviridae sp. isolate ctjc_2, complete genome) position: , mismatch: 10, identity: 0.706
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer atactgatattattttttcattttcatattttat Protospacer * .. *.******.*****.***********
106. spacer 6.4|1589907|34|NC_004557|CRISPRCasFinder matches to LR588166 (Pseudomonas phage vB_PaeM_MIJ3) position: , mismatch: 10, identity: 0.706
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer aaatatcttttatttctttatcttcatatccgta Protospacer * *** *********.**********.. *
107. spacer 6.5|1589971|34|NC_004557|CRISPRCasFinder matches to NZ_CP022140 (Salmonella enterica subsp. salamae serovar 55:k:z39 str. 1315K plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.706
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer cgaaatctgttattttttcatctttatatacgag Protospacer * **********.********.**** . *.
108. spacer 6.5|1589971|34|NC_004557|CRISPRCasFinder matches to MH617769 (Siphoviridae sp. isolate ctjc_2, complete genome) position: , mismatch: 10, identity: 0.706
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer atactgatattattttttcattttcatattttat Protospacer * .. *.******.*****.***********
109. spacer 6.5|1589971|34|NC_004557|CRISPRCasFinder matches to LR588166 (Pseudomonas phage vB_PaeM_MIJ3) position: , mismatch: 10, identity: 0.706
ctttctctgttatttcttcatcttcatattttaa CRISPR spacer aaatatcttttatttctttatcttcatatccgta Protospacer * *** *********.**********.. *
110. spacer 8.1|1595980|35|NC_004557|CRISPRCasFinder matches to MN694169 (Marine virus AFVG_250M458, complete genome) position: , mismatch: 10, identity: 0.714
gtacctgtgccaagactattaaatttttttgctaa CRISPR spacer aattttgtgcctagactatgaaatttttttccaat Protospacer . ..****** ******* ********** * *
111. spacer 1.19|1218532|36|NC_004557|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP010123 (Escherichia coli strain C5 plasmid A, complete genome) position: , mismatch: 11, identity: 0.694
tttaaatctggtttattttttacattcttccaatcc CRISPR spacer tttaaagctggtttatttttaacattaaggagttta Protospacer ****** ************* ***** . *.
112. spacer 1.28|1219135|38|NC_004557|PILER-CR,CRISPRCasFinder,CRT matches to NC_011737 (Gloeothece citriformis PCC 7424 plasmid pP742402, complete sequence) position: , mismatch: 11, identity: 0.711
tatctaactcaatattttcttcttttacatcctgttta CRISPR spacer tttctaactgaatatcttcttcttttacaactccccat Protospacer * ******* *****.************* *.. ..
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
688595 : 695591
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_004557|688595:695591|DBSCAN-SWA TTTGGATGAAATTTATATGAAAAGAGCAATTGAATTAGCTCATTTGGGTGAGGGATATGTAAATCCTAATCCTTTAGTAGGAGCAGTTATAGTAAAGGATGGAAGGATAATAGGAGAAGGATATCATAAAAAATTTGGAGAAGCTCATGCTGAAATAGAAGCCTTCAAAAGCTGTAAAGAGGATCCAAAAGGTGGTACTTTATATGTTAATTTAGAACCTTGTTCCCACTATGGAAAGACTCCTCCTTGTGTAGATGTCATAATTAAAAAGGGTATAAAAAAAGTAATAATAGGTATGAAAGATCCTAATCCTTTAGTAGCAGGAAGAGGTTTAGAAATATTAAAAAAAGCTAACATAGAAGTAAGGGTTGGAACTTTAGAAGATGAATGTAGAAATTTAAATGAAATATTTATAAAATATATAACTTATAAAAAACCATTTTGTATATTAAAATGGGCATCAACATTAGATGGAAAGATATGTTCGTCCATAGGAGATTCTAGATGGATAACAGGAGAAGATTCAAGAGAATATGTGCATTTAATTAGAAATAAAGTAAGCTCTATAATGGTTGGAGTAAATACAATTCTAAGAGATAATCCTTCTTTAACCACAAGATTAAAAGATAGAAAAGGTGTAAATCCTACAAGAATTATTGTAGATAGCAAAGGAAGAACACCCTTAGATGCAAGGATTTTTAAAGAAGAAGGGGACACTTTTATAGCAACTACTTCTCAAATTGAAGATAAGAAAATAAAAGAATTTGAAAAAGTAGGGGCTAAAATAATTATCATACCAGAAAAAGGTAGAAAAGTGGATTTGCAATATCTTGTTAACTATTTAACAAAACTTAATATAGATAGCATACTTTTAGAAGGCGGTGGAACATTAAATTACTCAGCCTTAAAAGAAGGCATTGTAGATAAAGTTCTCATGTTCCTAGCTCCTAAAATATTAGGTGGAGAAAATAGCAAAACACCTATAGAAGGGGAGGGAATAAAGTATATTAAAGATTGCATAGAACTAAAGGATTTATCTATTAAAAATTTTAAAGAAGATATTTTGATAGAAGGCTTTATTTAGATAATAACTAATAAAAGGTGGTGTTTTATTGTTTACGGGTATAGTAGAGGAATTAGGAAGTATAGAGAATATAGAAAAAAAGGGAAACGCCTATAGCATAAAAATAAAGGCTAAAAAGGTTTTGGAGGATGTAAATCTAGGAGATAGTATATGCACTAATGGAGTTTGTCTTACAGTTACTAACTTTTCAAAGGATAGTTTTACTGTGGATGTAATGCCAGAAACTATAAGACAAAGTAATCTAAAAAATATTAAAAAAGGCAGTTTAGTAAATTTAGAAAGGGCTCTAAAGGCTACAGGAAGATTAGGTGGACATATAGTTAGCGGACACATAGATGGAGAAGGGATTATAAAGGAATATAAAAAAGAAGGAAATGCTTGGTGGATTTCAGTAGAACCTGAAAAAGGACTTCTAAAGTATGTAATAGAAAGAGGTTCTATAGCCTTAGATGGTGTAAGCCTAACCGTAGCTTATGTAGATGAAAAGCTATTTAAAGTTTCTGTAATCCCTAATACCAGTGAAAAAACTACCCTATTAAAAAAAGGAGTAGGGGACATTTTAAACATAGAATGCGATTTAATAGGAAAATATGTAGAGAAAATCTTAAATTTCAAAAATCATAAAGAAGAGCAAAGTAAAATAGACATGGACTTTTTGAAAAACAACGGATTTGTTTAGCTAATAACTAATAGTGAATAGTGAATAACTAAGGAGGATTTTCTTCCGCTTTGCTACAGAAAATCTTTAATTTAAAGGTAGTTATACTATGGAAAACTCCTTCAGCTTTGCTGAACATCGAAGATATTTTAGGTTTAGTTCATTTATAAAGGTTTAGCATTAGCTAACCGCAATTATTCATTCTTCATTCTTCATTAAATAAAGAGCTCGTTTGCTTCGCTAAACATCGTTGTTTTAGTTAAGAAGCACTTAATATATAAATCATACAATTAAAATTTTTTTGTAATGATAATGGAGAAAAAATAACCACAATTGTTCATTCTTCATTGTTCATTATTCATTAAATAAAGGGGTGGTAAAAATGTTTAAGTTTAATAGTATTGAAGAGGCTATTGCTGATATAAAAGAAGGAAAAATGGTTATTGTGGTAGATGATGAGGATAGGGAAAATGAGGGGGATCTTTTAATGGCAGCAGAAAAAGTAACTCCTGAAAATATTAATTTTATGATTAAATATGGAAGAGGATTAGTGTGCATGCCTATAATTGGAGAAAGATTAAAGGAATTAAACCTTAATCAAATGGTAGATATAAATACAGATACAAATGGAACAGCTTTTACAGTCTCCATAGACTTCATAGATACTACTACAGGTATTTCAGCTTATGAAAGGGCTCATACCATATCAAAAGTATTAGATAGCAGTGTAAAGGGAGAGGACTTTAAAAGACCAGGACATGTTTTTCCATTAGAAGCAAAAGAAGGTGGGGTATTAAAAAGAGCAGGACATACTGAAGCTTCTGTAGATCTAGCAAGGCTTGCAGGATTTTATCCAGCAGGAGTAATTTGTGAGATTGTAGGGGAAGATGGAAAGATGGCAAGACTCCCTCAATTAATGGAGTATTCTAAGGAACATAATTTAAAAATTATAAATATAGCAGATTTAATAGCTTATAGAAGAAAGAAAGAAACTCTAGTAAAAAGAGTTGTAGAAGCTAAAATGCCAACAAGATGGGGAGAATTTAAAATAATAGGCTATGAAAACAAAATAAATGGTGAGCACCATGTAGCCCTTGTAATGGGAGATATAGAAAATGGAGAAGATGTATTAGTTAGAATGCATTCAGAATGTCTTACAGGAGATGCTCTAGGCTCTGTAAGATGTGATTGTGGGTATCAATATGAAGCAGCTATGAAAGCTATAAGTGAAGAGAGAAGAGGAGTACTAGTATACATGCGCCAAGAAGGAAGAGGTATAGGGCTTATTAACAAATTAAAGGCTTATAATCTTCAAGACAAGGGCATGGATACTGTGGAAGCTAATATTGCCTTAGGTTTTCCACCTGATTTAAGAGATTATGGAATAGGAGCTCAAATATTAAATGATTTAGGAATTAAAAAGATAAATCTTATGACAAACAATCCTAAAAAAATAACTTCATTATCTGGTTATGGAATAAAAATAGTAAAAAGAGTTCCACTAGAAATTCATGAAAATGAGGAAAGTGAATTTTACCTAAAAACTAAAAAAGAAAAAATGGGTCATTTATTACATTTTTAATATATAAAATTGGGAGGAATATAAAATGAATATAATAGAAGGAAAATTAATAGGACAGAGTTTGAAGTTTGGTATAACTATAGGAAGATTTAATGAATTTATAGGAGGTAAACTTTTAGATGGAGCTGTGGATGCACTTATAAGACATGGTGTAGATGAAAAAGACATAGAAATAGCTTGGGTGCCAGGAGCTTTTGAAATACCATTAATAGCTAAAAAGATGGCAAAAAGTAAAAAATATGATGGGATAATATGCTTAGGTGCTGTTATAAGAGGAGCCACTACTCATTATGACTATGTGGCAAGTGAAGTATCTAAGGGTATAGCTAAAATTACTTTAGATGAAGAAGTGCCAGTAATATTTGGAGTTTTAACTACAGAAAATATAGAACAAGCCATAGAAAGGGCAGGAACAAAAGCTGGTAATAAAGGATATGAGGCAGCTTGTACTGCAATAGAAATGGCAAATATTATAAATATAATATAAATCAAAAAATTATAAAAATATAAAATTTGAACAAACCTATTCTTTTTTTATACAATGCATAACAAATCTTAAAATCAGGTTGACAAGCAAATTGCTAATATATTAACATATGTATATATTAACATATATATTATGAAGATTTTGAGGAGGTTAGCTATGTAAAGATAGAATAGGTTTATTATTTATGTTAATTTGAACTATATTATTAATAAACTTACTATAAAAAATAATTGAATGATATATAATAAAGTGAATTTTTAGAATTATTATTCTTAAGCATAATTATATTAAATAAGTAGTTAGGAGGTAATTAAAAAAATGAAAAATAGTTTAAAAGGAATAATAACATTATGTTTAATAGCAGCCATTTGTGGAGGAATCTTAGGCATTACATATGATGCAACTAAAGATACCATAGCAGCCATTGAGAAGAAAGAAAGCTTGCAATTAGATGTAATATTGCCAGGATTAAATGCAGATGAACCAAAAGAAATGGATGTTAAAATAGAAGAAGAAGGACCAATAAGTTCAGCATATGAAGTTTATGCAGCAGGTGAGTTAGTTGGACATGCTATTATAGCAAATTCCAAGGGTATGGGACCTTTAAAAATGACTGTTGGAATAACTAAAGATGGAAAAATAGGTGGTCTTAAAATAGTTTCCCATGCAGAAACACCAGGTATTGGAGATATAGTAGAAAAAGAAAGTTTTATGGGAAGATATAAAGAAAAATCAGTAAAAGAGGAATTAAAAACAGTAAAAACGTCCCCATCAGCGGATAATGAAGTAGAAGGAATAACAGGAGCTACAATTACATCTACAGGTGTAACTAAAGCTGTAAACGAAGCTATAAAATTCTATAAGGAAAATGTATTAGGAGAAGAAGTTAAGGAAGAAAAAGAAAAGCCATTAGAAGCTAAGGATATAATACCTGAAGCGGATAGTATGAAAGATGTAGCAGTAGAATTAACTGAAAATGTTAAAGAAGTTAAAGGTATATACAAAGGGGAACAACTTTTAGGATATGCTATAACTGGTTTAGGTACAGGAATGGAAGAAATTCAAACTATGGTAGGTATATCTAATGAAGGCAAAGTTGTATTTGTAAAAGTAGTGGCAGATAGTGAAACAGAAGGAATTGGAGACGTAATACACGAAAAAGATTTTATAAATAAATTTTTAAATAAATCTGTGGATAAAAAACTAGAAGTAGTTAAAAATCCACCATCAAAAGATTATGAGATAGAGGCTGTGTCAGGAGCTACTATAAGTACAGAAGGAGTTACTGGTGGAGTAAATAATGCTATTAAGTTCTATAAGGAAAAACTTAAAAAATAAATGTTTTTACGATTATTTTTATTATAAGAGTTAATTTAACTTTCAATGCTACGATATTATAATAGCATCAAAAGTTAAAGGAGAGAATACATTTTGTTTAAATTGAAGAAAGCAAAAATAGAAAAAGACATTTTAGAACACCTTAGTAGAAATGGATATTCAGTTACAGAGGAAAATAGGAGTTTATATGTAATAAAATCTAAATTCTTAAACCTTCACTATACTTATCCAACATTAGAAATTATGTATTTTGGGAAGGATGAAATTTTAATAATTGCCATTAGTTGCTTCAAAGGTGTTATGTTAAATAAGGTAAAGACCATTCCAATGTCAGAAGTAGAAAATATTACTTTATATAAAAGGTTCCTATATAATAAACTTACCATGAAGGTTAATGGTAAAAAGAAAAAATATAACATACCAAAAGGTTTAAAAATTCCTAAATGGCATAAAGATAATTTCTCAATTTTATGTAATGAGTTAAGATAAAAAATATCCTGTAGAGTATACAATTAAAAGGTTTACTTTATGGGATATTTTTAGGTTTTATGGGATTTATTTATTAAGTATAGTAAAAATAGTATAATAAAAATAGTACATATGGATAAAAATATAGATAAATAGAGAAGATGAGAAAGGAGATTTTATGGAAGAAAAACACACCATATTTAAAAATATAGAAAAACACTTACTACAAGATGAAAATCCTTCCGAATATTTGAATAAATTAAGTGAAGAGGGGTTTTTAAAAGAATATCCTTTTAACCTTTTAGAAAATTTAAAAAAGACCGAACAAAATTTAACCCATCACCCAGAAGGAAGTGTTTGGAATCATACTATGATGGTGCTAGATAGAGCAGCGAAAAATAAAGAATTTAGTGAGGATAGTAAAGCTTTTATGTGGGCAGCACTATTACATGATATAGGAAAAGGTACTACTACTAAAATAAGAAGGGGAAAGATAACTTCCTATAATCATGATAAAGAAGGAGAATTTTTATCTATAAAATTTTTAGAAGAATTTATAAATGATAAAGATTTTATAAAAAAAGTAGCAGCTTTAGTAAGATGGCATATGCAACCTTTATTTGTAGCTAAAAAAATGAGTTTTGCTGATATAGATGGCATGAAAAAAGAATGTTCCCCAGAGGAAATAGCTCTTTTGTCAAAATGTGATAGATTGGGTAGAGGAGATATGAATGAAGATAAAATAAAAAAAGAGGAAGAAGATATAGAAGTTTTTTTAAATACTGCAAAAAAAGAAAGTACTAAAAAATAATAAAAAATGCTATAATTAACAAGATAACTGATGACAAAGGAGAGGTTTAAATGAAAAATCCTATAATTACTATAACAATGGAAAATGGAGATGTTATGAAAGGAGAACTATATCCTGAAATAGCTCCAAATACTGTTAGAAACTTTATAAGCTTAATAGATAAAGGCTTTTATGATGGGTTAATTTTTCACAGGGTAATACCTGGATTTGTAATACAAGGTGGATGCCCAGAAGGAACAGGAGTTGGAGGACCTGGATACTCTATTAAAGGAGAATTTTCAGCTAACGGATTTCCAAATTCACTTAAACATGAAGAAGGAGTTTTATCCATGGCAAGAGCTATGAGTCCTGATTCAGCAGGAAGCCAATTTTTTATAATGGTGGGAGAATCACCTCATTTAGATGGACAATATGCTGGTTTTGGAAAGATAACAGAAGGATTGGACGTAGCATTTAAAATAGTAGAACAACCAACAGATTTTATGGATAAACCTTTAGAAGATCAAAAAATAAAGGAAATAACTGTAGATACCTTTGGAGAAAAATACGAAGAACCTGAGGAAGCATAA
Protein sequences of DBSCAN-SWA_1 >NC_004557|688595:695591|692723_693743_+|WP_011098946.1|DBSCAN-SWA MKNSLKGIITLCLIAAICGGILGITYDATKDTIAAIEKKESLQLDVILPGLNADEPKEMDVKIEEEGPISSAYEVYAAGELVGHAIIANSKGMGPLKMTVGITKDGKIGGLKIVSHAETPGIGDIVEKESFMGRYKEKSVKEELKTVKTSPSADNEVEGITGATITSTGVTKAVNEAIKFYKENVLGEEVKEEKEKPLEAKDIIPEADSMKDVAVELTENVKEVKGIYKGEQLLGYAITGLGTGMEEIQTMVGISNEGKVVFVKVVADSETEGIGDVIHEKDFINKFLNKSVDKKLEVVKNPPSKDYEIEAVSGATISTEGVTGGVNNAIKFYKEKLKK >NC_004557|688595:695591|689706_690357_+|WP_011098943.1|DBSCAN-SWA MFTGIVEELGSIENIEKKGNAYSIKIKAKKVLEDVNLGDSICTNGVCLTVTNFSKDSFTVDVMPETIRQSNLKNIKKGSLVNLERALKATGRLGGHIVSGHIDGEGIIKEYKKEGNAWWISVEPEKGLLKYVIERGSIALDGVSLTVAYVDEKLFKVSVIPNTSEKTTLLKKGVGDILNIECDLIGKYVEKILNFKNHKEEQSKIDMDFLKNNGFV >NC_004557|688595:695591|688595_689678_+|WP_035110796.1|DBSCAN-SWA MDEIYMKRAIELAHLGEGYVNPNPLVGAVIVKDGRIIGEGYHKKFGEAHAEIEAFKSCKEDPKGGTLYVNLEPCSHYGKTPPCVDVIIKKGIKKVIIGMKDPNPLVAGRGLEILKKANIEVRVGTLEDECRNLNEIFIKYITYKKPFCILKWASTLDGKICSSIGDSRWITGEDSREYVHLIRNKVSSIMVGVNTILRDNPSLTTRLKDRKGVNPTRIIVDSKGRTPLDARIFKEEGDTFIATTSQIEDKKIKEFEKVGAKIIIIPEKGRKVDLQYLVNYLTKLNIDSILLEGGGTLNYSALKEGIVDKVLMFLAPKILGGENSKTPIEGEGIKYIKDCIELKDLSIKNFKEDILIEGFI >NC_004557|688595:695591|695072_695591_+|WP_011098949.1|DBSCAN-SWA MKNPIITITMENGDVMKGELYPEIAPNTVRNFISLIDKGFYDGLIFHRVIPGFVIQGGCPEGTGVGGPGYSIKGEFSANGFPNSLKHEEGVLSMARAMSPDSAGSQFFIMVGESPHLDGQYAGFGKITEGLDVAFKIVEQPTDFMDKPLEDQKIKEITVDTFGEKYEEPEEA >NC_004557|688595:695591|690718_691918_+|WP_035110798.1|DBSCAN-SWA MFKFNSIEEAIADIKEGKMVIVVDDEDRENEGDLLMAAEKVTPENINFMIKYGRGLVCMPIIGERLKELNLNQMVDINTDTNGTAFTVSIDFIDTTTGISAYERAHTISKVLDSSVKGEDFKRPGHVFPLEAKEGGVLKRAGHTEASVDLARLAGFYPAGVICEIVGEDGKMARLPQLMEYSKEHNLKIINIADLIAYRRKKETLVKRVVEAKMPTRWGEFKIIGYENKINGEHHVALVMGDIENGEDVLVRMHSECLTGDALGSVRCDCGYQYEAAMKAISEERRGVLVYMRQEGRGIGLINKLKAYNLQDKGMDTVEANIALGFPPDLRDYGIGAQILNDLGIKKINLMTNNPKKITSLSGYGIKIVKRVPLEIHENEESEFYLKTKKEKMGHLLHF >NC_004557|688595:695591|693836_694232_+|WP_011098947.1|DBSCAN-SWA MFKLKKAKIEKDILEHLSRNGYSVTEENRSLYVIKSKFLNLHYTYPTLEIMYFGKDEILIIAISCFKGVMLNKVKTIPMSEVENITLYKRFLYNKLTMKVNGKKKKYNIPKGLKIPKWHKDNFSILCNELR >NC_004557|688595:695591|691943_692405_+|WP_011098945.1|DBSCAN-SWA MNIIEGKLIGQSLKFGITIGRFNEFIGGKLLDGAVDALIRHGVDEKDIEIAWVPGAFEIPLIAKKMAKSKKYDGIICLGAVIRGATTHYDYVASEVSKGIAKITLDEEVPVIFGVLTTENIEQAIERAGTKAGNKGYEAACTAIEMANIINII >NC_004557|688595:695591|694389_695022_+|WP_011098948.1|DBSCAN-SWA MEEKHTIFKNIEKHLLQDENPSEYLNKLSEEGFLKEYPFNLLENLKKTEQNLTHHPEGSVWNHTMMVLDRAAKNKEFSEDSKAFMWAALLHDIGKGTTTKIRRGKITSYNHDKEGEFLSIKFLEEFINDKDFIKKVAALVRWHMQPLFVAKKMSFADIDGMKKECSPEEIALLSKCDRLGRGDMNEDKIKKEEEDIEVFLNTAKKESTKK |
8 | Staphylococcus_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1133918 : 1184898
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_004557|1133918:1184898|DBSCAN-SWA CATGAATAAACCTGAAATATTAGCACCAGCAGGCAATTTAGAAAAGCTAAAAACAGCTATATTATTTGGTGCAGATGCAGTTTATTTAGGTGGAAGCAAGTTAAATCTAAGAGCTTTTGCTGATAATTTTACCGATGAAGAGCTAATTGAAGGAATAGAGTTTGCTCATAGTAGAGGAAAAAGGGTATATGTAACAGTTAATGTTTTTCCTCACAATGAGGATTTAGAAGGGTTAGAACCCTATTTAAAGGATTTGGAAAACATGAAAGTAGATGCCATTATAGTATCAGACCCTGGAATAATAATGACAGCTAGGGAAGTGGCACCTAAGCTAGAAATTCATTTAAGCACTCAAGCTAATAATGTTAACTGGAAATCAGCTATATTTTGGCATAAACAAGGGGTTAAAAGAATTGTTTTAGCAAGAGAATTATCCTTTGAAGAAATAAAAGGAATAAGAGAAAAATTGCCAGAAGATTGTGAGTTAGAAGCTTTTGTACATGGATCTATGTGTATGTCCTACTCAGGAAGATGCCTTCTTTCCAACTATATGACTGGAAGAGATGCTAATAGAGGAGAATGTGCTCAACCATGTAGATATAAATATTATTTAATGGAAGAAAAGAGAGAAGGACAATACTTCCCAGTATTTCAAGATGAGAGAGGGACTTACATATTAAATTCTAAAGATTTATGTATGATAAAACATATACCTGAGTTGGTTAGATGTGGAATAAATTCTTTTAAAATTGAAGGAAGAATGAAAAGTTCTTATTATGTAGCTTCAGTAGTAAAATCCTATAGACAGGCATTAGATGCCTATATAGAGGATACTAAAAATTATAAATTCAATGAAGATTGGATGAATAATTTATTGAAAACAAGCCACAGAGATTATTATACTGGTTTTTATCTTGGAGATAAAGATAGTCAAATATATGAAACTTCATCATATATAAGAAACTATGATATAGTTGGAATAGTTAGAGACTATAATAAGGAAACCAATGAAGCAACTATAGAACAGAGAAATAAATTGTTTGAAGGAGATTCAGTGGAGGTTCTAAGACCTGTTGGAGATAGCTTCCAAGTAAAGCTTGAAAACATTCGTAATGAAAAAGGTGAAAAAATAGAATCAACTCCTGTAGCTCAAATGATATATAAAGCTAGGGTTAACATAGAATTAAAGAAAAATGATATGCTTATTAAAGCTAAGTAAAGTTTATTTGAGGGGGGTAGAAGATTTACTATGAGTAAAAGACCGATATTAATAGGTATAACTGGTGGAACGGGCTCAGGAAAGAGCACTGTAAGTAAAGAAATTTGTAGAAGATTTGATAAAGAGCTTATAGTTATGATAGAGCAAGATTCCTATTACAAGGACCAAAGTCATTTATCCATTGAAGAAAGAGTTAAAACTAATTATGATCACCCTAATGCTTTTGATACAGAGCTTTTAGTAAAGCATTTAAAAGAGCTTTCATATTGGAGTAAAGTGGAAAAACCTATATATGATTTTGAATTACATAATAGAAAAAATGAAACAGAAATTGTAGAACCTACTGAAATTATAATAGTAGAAGGAATATTAGTATTAGAAGAAAAAGAAATAAGAGATCTATTAGATATAAAGATTTATGTGGATACAGATGCAGATGTAAGAATAATAAGAAGACTTGTTAGAGATATAAAGGAAAGAGGAAGGTCTTTAGATTCCGTAATAAATCAATATTTAAATGTTGTAAGACCTATGCATATGCAATTTATAGAACCAAGTAAAAGATATGCAGATATAATAATCCCTGAGGGAGGACATAATAAAGTCGCCATAGATATAATCGTAGGAAATATAAAACAGATGGTTCAAAAATCAGAATAAAAGGATAAAAGTACCTCATTTTGACAATAATAAAGTTAAAGTGAGGTGTTTTTTATTCATAAGGAAAGATGTTATATAATACTTTTGTTTTTTTTAATATCATTAGGATTTTTAAATTTAAGATTATACAATCTACAATACAAAGATAGAGAAAGATATAATGTTATGGCAAATATGCAGCATTCTTATGTAGAAACTACAAGGGACTTAAATTATAAATTATTGGATTGTAATGGAGATAATTTATTAAAGTACAAAGAAGAATATAAAGCTGTTATAATACCTAAAGTATTTATAAAGCAAAATGAAGAAGAAAACTATAAAGAATATTTAAAAATAATAGAGGAATTAGAACTTTACAATGAAGGCTATAATAATTTAGATATAACCTCAAATGCTAAGCATTACTTTACTATAGACTTACCTACATATGAAAAAATAAAGGACATAAAAAATATTAAAGGATTTTATACTTTTAAAAATTTAAAGGTAGACAGAAGTGAAGCTTGGAGCCTAGAAAATATGATAACTACTACGAAAAATATTAGTAGTGGAAAAGAAAAATCATTAGGTTCATTGGAAATGACTATAAGAAATAAGCTTAAAAATAATAAAAATTATACTATAGAGTTTGAAAAAGATGTAGATAATAATATATTAAATATAGAAGAGAATATACCTAAAAATAACTTAAATGTAAAGTTAACTTTGGATAGGAATATTCAAGAAGACATAAAGGAATTATTAAAAAGTAAATATGGAGACTATGAACAAATAGGAGTCATTTTAATGGAAAGTAATAGCGGGAATATAAAGGCTATGGTACAAAAGGACGATACTTTACCCAATATAAATATAGGGGCAGAAACTTTAAATGGATTTTTTCCAGGTTCTATTTATAAGACCTTAGTTATGGAAACTATTTTGGAAAAGGATAATGATAATGTAAATAAAAGTTTTTCTTGCAAAGGATTATACGAAGATAATAACAAAGGCATAAATCATGGTAGTTTAAATATAAATGATGCTTTTGTGGTGTCTTGTAATCATATATTTTCACAATTAGGTATAGAATCAGGTTTTCAGAGTATGCATTCCCTATCTAAGTCTCAGGGACTTTTAGATAAGGTACTAAATTTAGATAGAGAGCAAAGTGGTAAGTTTGAAGTAGAAGAACCTAAAATGGAAAATGGGAGTTTGGGCCTTACTGCTATAGGTCAAAATATGAGAATTACTCCTATAGAAGCAATATCTATTGCAAATACTGTAATAAATGAAGGAATATATATAAAACCAAATATAATAAGTGCATATGTAGATAACGCTAATAAGGTAGTAGAGAAAACAGAATATCAAGGTGAAAGATGCATATCAAAAGAAACGGCTTTAAATGTGAAAAAGGCTATGAATGGAGTAGTTAATAAGGGAACTGCTAGAGAAGCTTACTTAGAAGATATAGATATAGGTGGGAAAACTGGTTCTACAGAAAGATTAGAGATAGTAAAGGATAGGAATGGTAGAAGTAAAAGAATTAAATATTCTGATGGATGGTTTATTGGATTTTTTAATAAAGAAGATAAATATTATTCTATGGTGGTATTTGTAAAAAATATAAATAAAGATTCTGAAAGTGGTGGAAATACTGCAGCACCTATCTTTAAAGATGTAGTTAAGATATTTCTAGATGAAGAACATTAATACAAGTAAAAGTTATTTTGTTTGATACTTAAAGAAAATTTATATAAAATAAGAACTAAAATTAGTCTCTCTGAATATTATAATAATAAGCACTATTTGGAGGGACTAAAAGTGTATATGTTTGATTCAATAATAAACATATTAGGTAGCAATATACTATTAACAGCCTATGTTACAGGTAATAATTCTTTTCCAAAACCATTAAAGGCAGAAGAAGAAAAATATTATTTAAAACAATTGGAAAAAGGGGATATATCAGCAAAAGGAGTACTTATAGAAAGAAATTTAAGACTTGTAGCCCACATAGTAAAAAAATATTCATTTCCCGGAAAGGATATGGATGATTTAATATCTATAGGAACTATAGGTCTTATAAAAGCCATAGATTCATTTAAAATAGACAGAGGCACAAGACTCGCTACTTATGCAGCAAAGTGCATTGAAAATGAAATATTAATGCTTATTAGAAATAACAAAAAAACAAAAGGAGAGGTATATCTTCAAGATCCAATAGGAATAGATAAAGAAGGCAATGAAATACCCTGGACAAGTTATCAGTCTAGGGCTTAAGTTTAGGAAATATATCTATTCCAAAGGTATCTCCACGTTTATTCTTAGTATACTCAACCTTATTTACTAACTTTTTCATCAACTCATTTTTCAATTTAATATCATCTGTATTTTTATAACCATCTAGTAATTTTTGAAATTTAATTACATCTTCTTCTTTTACTACTTTTTTCTCTTTATTTATTATATCATTTATTTTTTCTATTCCAGAAGAAGTTTTAGTTATTCTTTCCTCTATATTTTTAGACCTTTCTAAAAAAGTATTTTCATCATATATACCTCTTTCTAAAAAATCAAACAGTTTTAATTTTTGTTCATTCAGTGCAGCCAATTCTTTTTCTAATATATTTACCTGACGTTCGTATGGCTTTATATCAGAGGTTTTATTCTGATTAGATATATTTATCCTATACTCCTTTAAGTAGCGTTCTAAGGCTTGGATAATAGCTTTTTCTGTAGAATCATATCTATTACTGATATTATCACATTTATTATTTCTACATAAAAGCCTATCTACTCCCTTTAATTTTCTCATAACCATTTTAAACTTACATTTACTACAAATAACTATACCCGCAAGAGGATTAGCAGGACCATTTACAAGTTGGTATGGAATATGGTATTTATTATTTAATATTTCCTGTGCTTTATTCCACATCTTCATACTTATCATAGGTTCATGCTTGCCATCTACAACTATCCATTCTGATTTATCTCTTGTTCTAGTATCTTTTACTTTATTAGGATTTTTAGATTTTTTAATTTCTTTCTTCTTCCAAGTTACTTTACCTATATAAATAGGGTTTTTAAGAATAAATAAAACAGAACTTCTAGAAAAATTATTATTAAATTTTGTTTTATATCCTAAGTTATTGAGATGTTCTGCTATAGATCCAGCTCCATTACCCTCTGTATATAATTTAAAAATTAATTTTATTATTTCACATTCATGTGGGTTAATTTTTAATGTTCTAGATTTTTTTATCCAATGTATATCATATCCTAGAGGTGGATTAGTTGCTATATAATTACCATCTTCTACACTTCTAACCCTACCACCTTGCATTCTACGATTTATCATCTTTAATTCTTTTCTAGACATGAAGGCTTCAAATTCAGTATACTCTTCATCAAAATCATCAGATAAGTCATAGGTTTTCATAGGTGTAATTATTTTAGTATTAGATTTTTTAAAAGTTTCTAATATAATGCCTTGGTCTTGCATATTACCTCTGCCAAGTCTTTGCATGTCCATAACTAATACACCAGTATATTGTTTGTTTTCAACTTCTTTTAAGAGCTCTAACATTTTAGGTCTAAAAAATAAACTTTCACCAGATACTATTTCTTCTTTTATTTCTACTATGTTTAATTTTTTTTCTTTAGCAAACTTTAATAATGCTTTTCTATGTTTGGATAAGGTTTCACCTTCACCTAAAGTTTTTTCAAGTTCTTCATCAGCACGTGATTTTCTTAAATAAATACAAATTTTATTCAAAACATCACCTCATTAAATATAATTATTTAATCCTTTTTATAATTACTTCTTTATGTTTGGCAACTTCATCTTCAATTCTATTAAGTTTCTCTGAATTTTGTTTATAACCATCAAATAAAGCTTCAAGTTTTTTACCATGGTCATTTTCAATTCTTATTACAGTTTTCTTAACTTCTGATATTTCACTTTTTAATTCAGTGCATATTTTTTCTTGACCTTCTTTTAAATCTGCATACATTTTTTCTTGACTTCCCTTTAGGTCTATATACATTTTCTCTATTAGTTCAAATATTTTTTCATTTTCCATAAGTATCATCCTTTCTATTATCTATTAGTAGTTACTTTACGTTGAAGATTAAATAAAACTTTCTCATTTTGACTTTCTTTATGAGTTAGATATTCAACATCATCTTTAATTTGATTAAGATTTTCAATGGATTCTGTACGAAATTCAGTAAGGTCAGCAGTTTGATTATATACAGCATTAACCTTTTCTTTAATTTCTATTATATCAGATTGCATTTTTTCCTGGGTTTCCTTAATTCCAGAAACTTCACCTTTAATACCAGCAACTTGGTCTTTGATTCCTGTTATTTCACTTTGCATTTTTTCTTGGGTACTTTTAATTCCCTTAATCTCACTTAATATTTCTTTTAATAAGTTTTCCATAAATAAAACACTCCTTATATAATATTTTAAAATATTTTCATAACCCCTAAAGTAGGTTCAAAATATATTAAATAGTTATCTATTTCGTAATATAAACCGTACTTAGTTTTATAATAGTTTATTGCTTCTTCTAAGAATTCTTCTGTAACATTTAAATAATCTGCCAAAGTATATTTATCCCTAGCACCATTATTATAAGCATTAACTAAATCTATGATTCCTATAAGTTTTTCATAACTCCATCTTCTAGCTCTGACCTCTTGTTTAAGATTTACAATCTTTGATTGATCAGTAATATCACCTACAGTTGTATAAAAATGCCCAAGTTCTTCAGCTAGTATACAACGTTTCTCTCTATTAGTTGTTAGATTCTTATTAATGGCTATTCTATTTCCATAACATAAACCATCTTTAGTTTTTAAATCTATTTCTTTAACCATTATATCGTTGTTAAAAGCTTCATCTAGTAAAGTATCATAGCTCATATAGTCACCTCACAGTTATTTCCATAGTTCATCATCTTTCATAAGATTTAAGTCATGTTCCATATCTTCTTTTGAGAAATTACCCTCCTTATCATGTGCCGCAATTGGCATTAAGTGATCTTTTTCTTCGTGAGTATACTTGTCTATTTGAGCAAGTTCTTCAACTCTTTTAATAGCTTCAGATTTTCCAGTATCATTCAGTTTATTGAAGTTTGTTAATAATGTTGTTTCTTTTTTAAAATTAATATTTTCCATTTCAGTTTCTGTTTTATCATCAAAATCATCAAGTGTACATCCTAAGACCCTAGATAATGATTTAAGGGTTTCTAGTTTAGGATCTTTAGTAGTACCGTTTAAAATTTTGTCCAACGTTCCTTTAGGTACACCAGACTTCTCGGATAATTGTTTTGAAGTAAATCCTTTTTCTTTTTTTAATTTTTCTATTATTTCCAGTCCCATAATACTCACCTCTTATCTATAGGATACTACTGTAAAAAAGTACTGTCAATAAAAAAATACCGTTAAAGGATAAAAAGTTTCAAAAAAAGTATTGACTTTTTCCGTTGAGGGGTATAATATGTAATCATAATATCCGTTGACGGAGAAAGAGGTGTGAAATATGTACCGTGAATTATTAGGTGAACTTGTTAAGAAAGGATTAACTAAAAAAGATTTAGCCAAAAAGATTGGTGTATCTGAAAAAACTATATTTAATAAATTAAATGGAAAAAGTGATTTTACACTTAGTGAAATAAAAAAAATAAGAGATCTAGTTTGCCCAGGTGCTTCATTAGAAAAATTATTTGAAAAATCAGAAATGAAAAAATTAAATTAAAAGGAGATGAAAGAAATGAATAGTCTACGAGTTATAGACCAAAGAGAAGTATTAAATAAAAACTTTAAAATCTATGGAAATATTGAAAATCCACTATTTTTAGCAAAAGATGTAGCTGAATGCATTGAACATAGTAAACCTTCAGTGATGTTAGAAGGGATAGATACCCAGGAAAAGCTAAAGGAAACAATCTTTACCTCAGGTCAGAATAGAGAAATGTGGTTCTTAACAGAAGATGGGCTTTATGAAGTTCTAATGCAAAGTAGAAAACCTATTGCTAAGCAATTTAAAAAGAAAGTAAAAGAAATTTTAAAAGATATAAGAAAACATGGAATGTATGCAAAAGATGAATTGCTAGATAATCCAGATTTATTAATACAAGTTGCTACAAAATTAAAAGAAGAAAAAGCTAAAAATAAAATGCTTGAATTACAAAATAAACAGAAAGAGCAAATCATAGGAGAGTTGAAACCTAGAGCAGACTATACAGATAGAATTTTAAAGAATAAAGGACTTGTAACAATAACACAAATTGCTAAAGACTACGGCATGACTGGAACAGGATTAAATAAGTTACTACATGAATTAAAAGTACAGTATAAACAAAATGATCAATGGCTTTTATATAAAGAACATAGTGGTAAGGGTTATACACATTCTGAAACTATAGATATTGTTAGAAGTGATGGCAGACCAGATGTAAAAATGAATACTAAGTGGACACAAAAGGGTAGATTGTTTTTATACAACTTACTTAGAGATAATGGGATATTACCAACGATAGAACAAGAGGCTGAAAGAGAATTTGCTTGTAATTAAAAGGAGCTAAAAAAAGACACCTACTCAAGTGTCTAAAGTAAATGGTTATTTCTAATTTTTTTAAGAATTTTAATATCCATTTCATGGGAGCCGGTAGTTTCTTTAATAACATCAATAGCATCTACAACATCATTAACAGATTTAGAAGTATTGGAAATAGCAAGTTCAATAATGTCTAATCTTTCACCTAGAGTTTTACCATCTTTATCTTTAGCTCTATCTAGTTGTTCAGAAAAGGATTGTTGAACTTCAGCTAATGTTTTTATATCAGTTTGAATTTTTTCAAGCATTAAAGAGTTTGTATCTACTTGAGATTTTATTTTAGAGGTAGTTTTTTCTAAGCCTTCAAAACGTTGATTAAAATTTACATATATTTTCTCAAGTAAATCATAAATTTTATTTTCCATTGTATCAGCTCCTAAAAAATTAATCACATTACCATTATACACTACTTAGGGAATTTTATGAACTTGGAAAATTAAAGTTATTAAAGCCACATTAAATTAATAGATATATAAGGAGGTGAGTATTATGGCAGCGATAGTAAGAATTATAGAGCCAGACATCTCAGATGAAGAAAACGAGAGAAACCTACAGGAAGTAATTGAAGTACTAGAAAGAATTGCAGATAGCATAAGCATAAGTAAAAGCACAAAGGATGAATAAGGCTTAATAGAGCCTTTTACCACACTTCTAGAAAAATACAAACACTTCACACTATATTTATATGCACATCTATTTATTATATGCACATCAATATCACCGCAATGGTGAAGAAAATAACTAAAGTAAAAAGATATAAATGTTGGAGGTAAATAAAGTGTTAAAACAAATGTTAGAGGGGAGAGTGGGGCATTTAAGTAATGACGAATTTAAAGAAGTTATGGATATAGTTACTGATGATATTAAGTTTAATCGCATTAATTTTGGTAAAAGAACAAATAAGATTGAATTAATAGAAATAGCTGAAAGATCACTACATGCACTTAGAAGAATGGAACTTGAATATGACAGATATGGAAGAGCGAAATATAATCCGTTCATTCATAGAAATACTGGAAAGCCATGGTCTAAGACAGATTTAAATTATTTAATTAATTGGTGTGACATTATAGGTCCAGATGAAATGAGTTTTGCATTAGAAAGAACTATAGCAACAGTTATGAATAAGGTTTACATCCTTAGAAAGAAAGGAGTTATGAATAAACATAAACGTATTAGAAATTGCAAAAGAGTTAGAAGTATGCATTAAAAAAAGAGCTGCAAGAGCAGCCAAATAAAAAACATAATTAAAAAACCTAACCTAAAGTATAAGAGAAAACGGAGGATTTGTAAAGATGATAAAAGATATAACATTAGCACTTTTAACAACAACTATAAATAGATATTCTAGTTTAGGTGACAGTATCAGAGCAATACACAAAGAAGCAGTGATTAACGATTTAACAGGTATTTTAGATTATGTAACAGATTTAAAAGAAGAGAATAATCAGCCTATAACAGTAGTTTTAAATTCTAATGGAGAAGTTGAATTTCTTAAGAAAAGGATTAGAGAACTTGAGGAAAGCTGTGAAACTGATAATGAAGTTATAGAAAAGCAACATAGAAAAATTAAGGAGTTACAGAATAGCAATTATAGATGGAATACACTTTGTGTTCAACTTAAAGCAAAAAATAAAAGGTTTGAAGCTGAAAATAAAGATTTAAAAAGTAGATTTGGAGTAGATAAATTAGAAATTATGAAAAGGGGGGGAAGTGGATTGGATAAACTCAAATGGCTTCAAGAAAGACAAAAAGGAATTGGTGGAAGTGATGCTGGGGCAATACTTGGAATAAATAAATGGAAAACACCATTCCAGATATATTTAGAAAAAACAGAACCTATAACAGAAATTAATGAACAAAGTGAAGCAGCATATTGGGGAGATCAATTTGAAGAAGTAGTTGCTAAAGAGTTTGAAAAAAGGACAGGTAAAAAAGTAAGAAGAGATAGAAGGCATTTTAAACATGAAAAGTATCCATTTATGGTTGCAAATATAGACAGAAGAGTTATAGGAGAAAATGCAGTATTAGAATGTAAAACAGCTAATCAATTCTTAGCTAAAGAATGGGAAGGAGAAGAAATACCAGCTAGCTATTTAGTACAAGTGCAGCATTATTTAGAAGTAACAGGAGCAGAAAAAGGATATATAGCAGTATTAATAGGTGGACAAAAATTTATTTGGAAAGAAGTAGAACGTGATGAAGAACTTATAGAAATTATAATCAATACTGAAAAAGAATTTTGGGAAAATCATGTGCTTAAGAAAATTCCTCCAGCATTGGATGGAAGTTCAGCAGCAGAAAAATACTTAAATGAAAAATATAAAAAATCAAATTCAAATATAAGTATTGATTTAAAATCAGAATACATGGATAAAATAGACGAGCTAATGCAACTTAAAGAAACTATTAAAAATTTAGAAGGACAAGCTAAAGAAATTGAAAATAACATAAAAAATGAACTTAAAGAAGCTGAAATAGGATATGCCCAAGGGTATGAAGTCAATTGGAAAAAAGTTATTTCAAATAGAGTAGATAGTAAACTTCTAAAGGAAAAATATTCAGAGATTTATAAAAAAGTATGCAAGGAAAGTGTATTTAGAAGATTTAATATTAAAAACTTAAAGGAGGAAAATTAAGATGGCAACAAATGAAAGTTTAAAAAATCAACTAAGCACTAAGAAAGAGACAGGGTTAGGGAGTGCAGGAAATACAATAAAAGGTTTAATGAACAGCCCAGCGATAAAGAAAAGATTTGAAGAGGTTTTAAAGCAAAGAGCACCTCAATACATGAGCAGTATAGTTAATTTGGTTAATAGTGATATAAACTTAAAGAAATGCGACCAAATGAGTGTGGTTGCAAGTTGTATGGTAGCAGCAACTTTAGATTTACCAGTAGATAAAAATTTGGGGTATGCATGGGTAGTACCTTATGGTAATAAAGCTCAATTTCAACTAGGATATAAGGGTTATGTGCAACTAGCATTAAGAACAGGGCAATATAAATCTATAAATGTTATAGAGATCCATGAGGGTGAATTAATAGATTGGAATCCACTTACTGAAGAATTAAAGATAGATTTCTCTAAAAAAGAATCAGATGCAGTAATTGGATATGCAGGATATTTTGAACTACTTAATGGATTTAAAAAATCAACTTACTGGACCAAAGAACAAATAACTAAACATAAAAATAAATTTAGTAAATCAGACTTTGGCTGGAAAAAAGATTTTGATGCAATGGCTAGGAAGACAGTATTAAGAAATATGTTAAGTAAATGGGGAATATTGAGTATAGAAATGCAAAATGCTTATACTGCTGACCAAGGAATAATAAAAAATGAAATAATCGAAACTGGTGAAGTAAAAGAAAATATAGAATATATAGAAGCTGATTTTGAAAGTTATGAAGATAATTCAATAGAAGAAGGTGGGGCAAATGAATAATATACCAGAGTGTATGTATGACTATAGATATGAATTTGAAAAGATGCAAATAATAGATAACTGTTGTAATTGCGACTGCAATATTTGTGAAGGAGAAGAATACTACGATATTGATGGAACTATACTTTGTGAAGAATGTATTAGAGATTACAAGCATACAGCAGAACTCTAATAACTAAGGAGGAAACTATGGCGGAAGGAAAGGAAAAAGGATGGATAAGTTTATACAGGGATATTCAAGAGCATTGGATATGGGAAGATGCAGAAAAGTTAAAAGCATGGCTGGATCTTCTCCTTCTTGCCAATCATCAGAGTAGAAAAATTTTATTAGGAAATGAACTTATAGAGATAAAAAGAGGTTCTACACATGCATCAGAATTAAAGCTTATGGATAGATGGAATTGGTCTAAGAAAAAGGTTAGAAACTTTCTTCAACTACTAGAAAAAGATAATATGATAATTTGTGAAAAATCTAAAAAGGGAACCACTATAACAATAATAAACTACGAGGTTTATCAAGGTTCAAGGAACCATAGAGAAACCATAAAAGAACCACAGGGGAACCATAAGGAAACCATAAAGGAACTATTAGGAGACCATAAGGGATACACTAACAATAATGATAATAATTATAATAATATAAATAATGATAATAACAGTAGTGGCAGCACTCCAAATTATATAGAATTTTTTAATAGTAATTTTCATCTGATAAATTCTTATGAAATAAATATATTAAATAGTTTTGTAAAAGATGGATTAAGTGAGGAAGTAATATTATTAGCTTTAAAAAAAGCAGTAGAAAATAATGTTAGGACTATAAAGTATGTTAAAAGCATATTGCAAAACTGGTTAGAAAACAATATTAAAACTGTTGAAGGAGTAAAAGCAGAGGAGGAAAGATTCAAGAGGGACATTGAACATAAAAAAGCAAAGGGCAATGTAGAAAGAAAAGACAATGGAGCTAAAGTAGACAGTTTTAACGGCTATCAGCAAAGAACATACGACGGCAGTGATGGTGGAATGACATTTGATGATTTAGAAAAGAAACTTTTAGGTTGGAAATAAGGAGGGAAAACAATGCGAAAGATATGGAGAAATGCTGAATTGGACTTTATAAGGAAAAATAAAGGAAAATTAAGTTACAAAGAAATGTCTAAACATCTAAGAAGGACACCTGCGGCAATAGAACAAAAATGGCAATGCTTTATCGATAGGCAAGAGAGAGTGGACAAGATTGTAAATAAGATGGATGTTGCAGAAAAGAAGTTTAGATTTAATAAAGGGCAAAAGGTAAAAACTAAGAGGATGGAATGGACAACAGGATGGGATACTTTAATTACAAAAGGTAAGGTTATTGCAGACAATAAATACTTTGTAGTGCTGGATAATGGAGTGTATAAAGAATGTGTAAATAAAGTGGATCTATATACTAAGAGTGTGGTTTTAGTATAAGAATAACATCATATAGGAGTATAATTGTAAAACTATGTTCTCATATGACAATCATTTAATAACAAACACAGGAGGATAACAATGGCTAGAAGTAAATATGGAGCTAAAAAGATAACTATAGATGGAAATGCTTTTGATAGTAAAGATGAAGGTAAATATTATGAGTACCTTAAGAAGCTTAAATTTCAAGAGAAGATATTAAATTTTGAATTACAACCAAGATATGAATTAAGGCCAGCCTTTGAAAAAATGGGTAAAAAATATAGAAAGGCTGAATATGTAGCAGATTTCTTAATATATCATCTAGATGGCACAGAGGAAGTAATAGACGTAAAAGGTATGGCAACAGAAACTGCTAAATTAAAAAGAAAACTGTTTGATGAAAAATATAGGAATTTAAAACTTACGTGGATAGTTAGAAGTCTTAAATACAGTGAAACAGGCTGGATTGAATATGATGAACTTCAAAAGGTTAGAAGAGAAGAAAAGAAAGGAGCTAAAAATAAATGAGTAAAATAAGGGTTATAGCAGACACAGGAGAATTGATACATGAAATAGAGTGTAGCCATTATAACATTCAATATGTAACATCAGAAAGTGATGGGAAGATACAAAGGATAATTAATCTTAATAATGGCAAATATGGACCTAAGCACTGGATAAAAAATGATATTTATCAGCCTATAGCTAGAAAAATTAAAAAGAATCTTATAAATCAAGTTCCAGAGTTAAGTTGTGTAAATGTAGATAAAATACTGTTTATTGAGGATATAGACTATGTATCTGATGAAATAAATAGAAATACTGATTGGGTGATGAAAATAAAGAAAGCCCCATCACAATTAACTGAATTTACAGGATATAAATTTATTATAGAGAGCAGAGAATTCTGGATGGAAAGATGTTCTCATGAACAGATAGTAGCTCATATCTATAGCTGCTTAAAACAAATAGATAAGGACAAGCTTAGGGAACCAGACGTTAAAGGTTGGAAAGAGGTAATAGGTAACTTAGGATTAGGTTGGGAAACTACTATAAGCCCAATACCTAATCTTATGAATGGATTTGATGTGGAAGATTTTAAAATGCTTAAGAAAGCAGACAAGCAAATGAAATTTGATTTAAGAGCAAAATAACAGAGGTGATGCAATGGCAGTAGATACAGTAGCAATAGCACAGGAAATATATACAGTTGCTAAGATTACAAAAGATTGGATAGGCTGGGAATATTGTAAAGTAATAAGTTGAAATTAGCTATTTAATATAGGGGGATATCTATATTATTAGATACCCCTGTTAGTAAACTATTGTTATTTACCAAGGAATGATTTTAGCATCCAAACATGTTTTTCAAGTGCAGTGTGAATAGCAAGAAGCATATCGCTAGTAGTTTCATCATTTTCTTTTTGGGCTATTTCCATTCCGGTTTTAAGCTCACTAACTATAGTGTTATAATCACTAATGATTGTTGCAACCATATCTTCTGCAACTTCATTTCCAGTAGCTTCTTTTATAGATGAAATTTCTAAGCATTCTTTCATAGTTGCCACTGGATTACCTCCAATTGAAAGCTGTCTTTCAGCCAGTTCATCAATATGAAGTGCAGCTTCGTTATAAAGTTCTTCAAATTTTAGATGTAAAGTGAAAAATTGTTGACCTTTAACGAACCAGTGGAAGTTATGAAGTTTAATGTACAATACACTCCAATTAGCGATTTGCTTATTAAGAACATCAGTTAGTTTACATGACATAATTAACATGCCTCCTTAAACGATTTATAAAACTAGTATTATTAAAAATGAATTATTATATTCATACTTATATTAAATATAATTTTATAATTATTTAGAGATATACCTATTGGAGAGAAAATAGAGTATATAAGTTGTGGTAAGAAGTATAAAGGTGTTGTGCAAGGTTATGCCTATAATGGTAAATATATTGAGTGCACTACAAATCAAAAAGGTTATAGTGGAGTTCTATTGCATATTGAAAATAAAGGTAGAAATTGGTTTACAAAATAGTACGTAATTCATGTTAATAAAGTCATATGAGGACATAGTTATATAAATATATTCTTATATGACAATCAATTGAAAAGATAGTTACATGAAAATAATAATAATGAATGTAAAAAATAATACCAGGTATTTTGGCAAAGTTGGAAAAGTATATAAAGTACAAGCAGTAAGCAAAAATGATGCTGAAATTATAGAGAGATAGGAGGAAAATCATATGCCAAAAGAAGGTCAAATTAATATATTGGCTATTCCTTATAAGAATTTTAAACACAAAATAAGGCTTACAAAGCGATTTTTAAATAGATATATGATACAGGACTTAGGTGGGGTTTTATACATGGAAAGGAGAGAGGAATATGAGAATAAAAATAAATAGTGTGAAGGATATCCTTAATAACTCAAAGTATATCCCAGCAGAAGTAATTCAAGACATAGACAAAAGAATATCTGATTGGTTAGCTTCTGGGGGTAAGAAAGATGATTCTTATATAAAACAACAATTTAGATACGCTGAAAGAGTTGCAAATATTATATCAGGAAACATGGAGGGATAGTAATGGAAATAATATTAATATCTATGCTTATATGTATAGCTATAGCATCAGTAAGTGTAAAATATAGCATTAAAAAAGTACAAGCTATAAAACAGAAAGAACTAGATAGACCAGACTACATAAGAATGGAGGAATGGCAGTAATGTTTAAAGGTTGTGAAAGACTAGAAGAAATATATTGTACAAGGGGGAGTTGTAGACGTATAGATTGTAAATATAACCAGGAACATGCATTAAAACAAAGTACTCAATCTGGAATAGAGTTTACTGTTAAAGCTTTTTATGGAACAGATAAATACCTGTTTAGGAGGAGATACAAAAAAATTGGAGAAAAGATATTCTAAGGGTTATTTTAGATAGTGTAAATAGGAGTGAGTAACTATGGAAAAAGTTAAACTATGGACACAAGAAGAGGATAAGTATATTTTAGAAAATCAAGGGAAAATGACATATAGAGAAATGGCTAAAAAAATAGGAAGAACTGAATCGGCTGTTAAAAATAGAATTAGTTATGAAAGAATGAAAAGCCATATGAACAAAGATAAAGCTAAGAAATGGCTAGAAGAACAACTTCCTGGAATGACACCGCAACAAGGTAAGAAACAAATTATGAAAAAGTTAAACATAGATGAAAAGAAGGCAGAGAAAGTGTACAACAAATGGAGAAAGAATTACATTAAGGCAATGATGTAAGGAGGACTTGAATGATAGATAAAGAAACTTTAGCGATGGATGAAAAAACTTTTAAGAAAACAGAAGGAGTATTATATAATTATAAAGATTTAGAAGTAGAAATAAAAGCTGTAGAGTTAGCAATAAAAGAATTAGAGATAGATTATAAAGGAGGTAGAGGTATAGGTTACGAAGAGAGGACAGGAGAAACTTACAGGATAAATAGACCTATAGAAGATGAAATAATTTATAAAGAAAAATATATAGAAAAACTTGAATTAGAGATAGAGAAGAAAAAAATATTAAAAGAAAAGATAGAGAATGCAATAAGAATATTAGACGATAGAGAAAGTGAAATAATTAAGCTTAGATATTTTATAAAACCTAAAAAAAGTTGGGTAGCTATTGGGATGGAAGTTAAAATGGACAAGGACTATTGTTCACTTATATGTAAGGAACAAATAATACCTAAACTAGCAAATATCTTATGGAACTATTAAAAAACCTTAAAAATACCGTACTTATTACTGAAACATTACAGATTTAGGTAGAAAAGATATGATATTCTATTAACATAAACAAATATAATTAATTAAACAAAACAACTGAATCGTTACAGCAGGGATTACTGTAACAGGCAAGTAGTGCGTACTTGCCACATTTATTAACACAAAAGGCACTTACTTAAATAAAAGTAGGTGTCTTTTTCTATTATGATTAAAAGGGAGTGAGAGGTGTGAAGATAGAAAAGATAATAAAAGCACAGCAACCAGATATACATAAGAGATTAAAGCAACAGAATAGAAAGAAGAAGTCTAGGAGGGAGGGAGAGCACATCTCTTTTAGTGATGTAATGGAGCTTATGAAGCATGATAGTTACAAAAGGCATAGAGGAGCTATAAGGCAGAGATAAATAGATTCGTTAGTAACGACAAGCTTGTCGAGATTGGAGATACATTGCAAATAAATTAACTAGTAATTAGTAGACAGCTAACTAATGATAATCCACAACATATTGTTAATAGTGTGGATTATAAATACTATATATTGTATGTGTGTCGCAAACAATCGGAAATAGACAAGTAAATGATGGAGGATTACCTCTTTTTATGTAGAAGAAATTACATGGAAGGAGGAATGTAGAATGTGGTTATTTAAATCATATGTAGATATAGAAGGTATTAATCCTTTTGCTAAATCAGGTAATGGTTATTGGACAGTGGGTACTGATAGAACTCCAATAAATAAAATAGCAGCAGTAGTTAAATATGGACATAGTACACTTAATTTCACTCCATCAATAAGTATTAGTTCAAATCCTTTAGGATTTTCATTTAATACCACTACAGTAATATCTGGAACACAATCAGCTACACTTAAATTCTAAAATTAATCAAAGAAAGGATAATCTATATGCGGTTATTTATACTATCCATTATTAGTGCTTTTATTGGTCTTTTAATTGGAGGTTTTATATTACCACCTATTATTCCAGGATTAAGTGAGGCTTATTATATAGTTTACATATGTGCATTTAGTGGGTTCTTTATTCCTAGTTTTTATGTATTAGAAAAACTATATAAAAGTCATAAAAAAGGAAACGATAATTCGTAGTTATTTAATGATTTGCAATATGTAAATTTTTTATAACTTCATAAAATAATAAATATATAATGGGAAAATATAAGTGAATAACTTAGGAAAGAAGATATTAGGCACTCAATAATGGGTATCTTTTTCTATTCCCAAAACAAACAATAGCAATTAACAATGAGGAGGTGAGTCTATGGCTAAATCTAAATATGAAACTAATGTAAAAGATAAACTTATATTAGTTGAAGGGTGGGCAAGAAATGGGCTCACTGATGAACAAATAGCAAAGGGATATATCTATTAAATTGAATGTTTTTTATATAACAACAATTGTAATTATGGTAGAAGGAATTAAGTAAGCTATATAGAATGACTTTATTAGGTGATATAAATGATACTAAAAGAAAACTTATTATATCCATTAGTACATTATACAGATATGTATGGCAATTTCATAGGATTTCAGAAAGATAAAAATTCACAAGTAGTTTTATGTTCATGTATGAGAAAAGCAGTAGAAAACTGTATAAAACTATTTTTAAAATATCCTTCAAGTTTGCTTAATCCTCCAGAGTGGATATTGTTAAAACAACTGGATATGCCAGAAAGTATAAATGATGTCATAAGAAAGAAAAATCCACCAGTAGGGCTAGAATGGCTAAATTATATTAAATTTAAAAAGGATGTATGCCATAGATGTAATATAGAAAAACCTAATAAAGAATATTGTACTCCTATGTATGGAACTAAATTTAAAAGAACCTTTGGATGGTATATAAATATAAATTACTTTAATAAGGGAATAAACCCCAGTACACATGATGGAATATATTATTTAAAAGAAGATGGTCCTATTGAAATTAGATTAATATTAGATCCAACAGATAAAGATTTACTTGAAGATATAAGAAGATATAAACTACTAGATACATTTGAAGGCAAAGAAATATTGGATATGCTTAAAAAGATAGAGCATAGAGAAGAAGCAATATTATTAAGATATTTTCATATAGAAGATAAAGAGTTACAATATCAAAAATTATATTATGCAATTGAGTATGTAATGAAACAAAGGTTAAAGGAAGTACATAAAGTAATTGAAAATGAAGTAAGAGACTGGTTTAAATCTAAAAGAGTTGGTGAGAAATGGGAGAATGAAACTAACTTATATAAGATAATAAGAAAACTATATCCAGAGTTAACTATGTATAGACATTTTAGGCCACCATTTTTAGATGGACTAGAATTAGATATTTATATAGAAACTTTAGATATAGGGATAGAATATCAAGGAGAACAACACTTCAAACCCTTTGAACATTGGGGAGGAGAAGAAGCTTTTGAGAAAAGACAAGAATTAGATAAAAAGAAAAGGGAATTATGCAATAAAAATAATATAAAACTCATTTATTTTAATTATGATGAGGAGATAAATGAGGAACATATAAGAAAGAAGTTATTAAAAGAGCTTAATATATAGGCTCTTTTTTATTTATGGAAAAGCAGAGGTGGTGACAATGTAGATATGGCTAAAACAAGGAGTCCCGATTGGGAAAATATAAAGAAGGAATACATAGAATTAAATGGCGATGTAAAGTTAAAAGAATTTGCAGAGGAACATGGAATTAAATATTCTACTCTAAGAAGTAGAAAAAATAGAGAGAATTGGGATAGTGAAATAAATAAAGATGTTGCAACAAAGAGTGCAACGCAACAAAAGAATGTTGCAACAGAAAATAAGACTAAGAATAATGATAAGGAGCCTATTGATAAGGAAGTAAAAGAGGTATTAGAAAATACTGAACTTACTGATAAGCAAAGGCTCTTTTGTATTTACTATGTTAAAAGCTTTAATCAGACGATGGCAGCTATAAAAGCAGGATATTCGCAAGAAAGAGCTCATGTAACAGGAAGTGAATTAGTAAGAAATAGTAAGGTAAAAGCCTATATTAAAGAGCTTAAAGGGAAAATGATAGGAGAAATATTTATAGATGCTATGGATGTATTAAATAAGTATATAAAGATAGCATTTGCAGATATAACTGATTACCTAACATTTGGACAACGAGAAGTACCTGTTATGGGACCATTTGGACCTATAGTTGATAAAAAGACTAAAAAGGAAATCACTAAAATAATTAACTATGTAGACTTTAAAGAAAGTAATGTAGTAGATGGAACAATAATAAGTGAAGTAAAACAAGGCAAAGATGGAGTATCTATTAAATTTGAGGATAGAATGAAGGCTTTAGATAAACTATCTCAATACTTTGATTTATTCCCAGATAACTTTAAAAGAAAGATAGAAGAAGAAAGATCCAAACAAGCTAGAGAAAAACTAGAATTAGAGAAATCTAAAGTAACAGGTAATGATGATGAAGTTCAAGATGATGGATTTATTGAAGCATTAGAAGGTAAAGTAGAGGAAGTATGGAAAGATGAAAAATAAAGTTGTACCTTTTAAGTTTAAACCTTTCTCTAATAAACAATTAAAAGTTTTAACATGGTGGATGAAAGACTCACCAGTAGCTAATAAAGATATACTTATTGCAGATGGTTCAGTAAGGGCAGGTAAAACTGTAGCAATGTCATTATCTTTTGTAATGTGGGCCAATGAAACTTTTGATGGTGAAAACTTTGCTTTATGTGGTAAAACAATAGGCTCATTAAGAAGAAATGTTATAAAACCACTGCTTAAAATGTTAAAAAGTAGAGGGTGCAAGTATAAAGAACATAGAACAGATAATTATATAACTATTTCTAAGGGTAAAGTTAGTAATGACTTTTATTTATTTGGTGGTAAAGATGAAGCATCTCAAGACTTAATTCAAGGTATTACTTTAGCTGGAGTACTATTTGATGAAGTTGCATTAATGCCCCAAAGTTTTGTCAATCAAGCAACCGCTAGATGTTCAGTAGATGGGGCTAAGATGTGGTTTAACTGTAACCCAGAAGGACCATATCATTGGTTTAAAGTAGAGTATCTTGATAATTTAGAATATAAAAATGGAATACACCTTCATTTTACTATGAATGACAACTTATCTTTAAGTGAAAAGGTAAAAGAAAGATATAAAAGAATGTACTCAGGTATATTCTATAAGCGTTATATTTTAGGTTTATGGTGTTTAGCAGAGGGTGTTATCTATGACATGTTCAACGAAGATTTCCATAAAGTTAAGACAGTTCATAGAAAGTATGAAAAATATTATGTATCTATAGACTATGGTACTCAAAATGCTACTGTATTTCTTCTATGGGGACTATGCGAAGGCAAGTGGTATATTGTAAAAGAATACTATTATAGTGGTAGAGATAAGAGTTTACAAAAGACAGATATTCAATATTCAAAGGACTTAAAAGGTTTTTTAGGAGATATAGTGCCTGTAAAAATAATTATAGATCCTTCAGCAGCAAGTTTTATAGCTCAATTGAGAAGTGATGGATTTGAACAAATAAGAAAAGCAAAAAATGATGTTTTGGATGGAATAAGAACAGTTGCAAGTGCATTGTCATTAGATATGTTTAGAGTAAATGATTGTTGCAAAGAAACAATTAAAGAATTTGTATCATATGTATGGGATACTAAAAAGATTTCTATAGGAATCGAGGAACCTTTAAAAGATAAAGATCATTGTATGGATGCTATGAGGTATTTTATATATACAATATTAAAACATAATATTGATGTTAAATATGACAAATCAGTTTATAACAAAGGTAGAGGACTTAAACAAAATGTTCTTAAAAAGTATGGAAAGAAAGGAGGTACAGTATTCTAATGAATATAAAAGAAACATTGCTTAATCTAAATGAAAGAGAAAAAAAGGAAAGAAAAAAAGCACTAAAAGATTTTATATTTTATTTAGGTGAATGTGAAAATATAGATGCTGCGAAATTAAACCAGGATCTATTAGGACAAAATTGGATTACTTTAGATGTTTTAGACTATATACCAAGTCAGATAATAGATAATAAGGTTAAACCACTTATAAACAAACAAGCACGTTTTATGTTTGGTAAGGAACCAGATATACTATTCAAGCCACTAGATAAACAGAACAAAGAAACATGTGAAGAATTAAGGCAGTATATAGATGCAATACTTAATGCTAGTAAGTTTTGGAGTAACACTATGAAAGCTTTTAGGCTTGCAACAGTAACTAAAAGGGTAATGTTAAGATTAGAAGCTAATCCAGGTCAACCTATAAGACTTTATTACCATGATATAAATGACTTTAGTTATGAAGTAGATCCTAATGATATAACTAAATTGAATAAAGTAATACTAGTAAGACAAGACGCTGAAACCGCAAATAAAGAGGAAAAAGACCAGATATGGTGCAGATACACTTACTATATGAATAAAATAAGTGACAAGGAATCTACTTGTTATTTAAGAATAGAAACTTTTAAAGGTAATAATCTGGAAGCACCAATAGAAATAAAAGAACAGGACACAGGACTATCTAAAATACCATGTTGGGTAATTTGCAATGAACAAAGCATTATTAACCCTTATGGACAAAGTGATATAAAAGATTTAAAACCATTGCAAGATAGCTACAATAGGAGACTATCAGATTTTAACGACAGTTTAAGATTTCTTATGTTTGGACAAACAGCTGTAATAGATGCAACAGAAGATACGGTTAATGCTTGTAACATTGCACCTAATTCACTTATGGCTTTAAAAAGTATTGATGATACAGAAGGTAATAAACAAGCTAAAGTACAGCGTGTAGAGAGTAATTTTACTAATGCAGATCCAGTACTTAAATATTTAAAAACATTAGAAGATAGTATGTATGAAAAATTAGGAATACCTAAGCTAGAAAGTCTTCAACAAGTTCCAAGTGCTAAGAGTATTAAATATATGTACACAGAGCTTGTAGCACGTTGTGAGGAAAAATGGCATGATTGGGAACCAATTATAAGGCAAATGATTAGATTAATAGTTGAAGCTTGTGGCAAATTTAAATGCTATGAAGAGTGGAAAGACGAATGGAATGACTTACTTTATAATATAGTTTTAAATAAAAATTATCCTATTCCAGAAGATGAAGAAGATAAGAAAAGACTTGCAATGGAAGAAGTAAGAACTAATGTAAGAAGTCATAGAAGTTACATAAAGGACTTTACAGATGATGAAAATGTAGATGATATTTTAAAAGAAATATGCGAGGATATAACTTCTATTACAGCAGCAGAGCAAGAACAATTTTTAAGGGAAATATAGTACTAATCTTCCCCTTATATATGGTATAATTTACATGAAGATAAGGGGGAAGAGTTATGGAGTTGGTAATTATATTTTATATACTATTTATAATATTTTTTTCCGCATTAATTATAGCAGCTTTAACAGGAAGTAAAATACTTAAAAATAAAAAAGATGCAATTGGTGTATATTCAATAGGAACTGTAATTTATGGAGTTTTATTCTTCCTAACTTTAAGATTAGAACTTAGTAGTTATAATTATATATACATGGAGCCAGTGAGTAAAACATCTATTAATAGAAAAGTTAGCAGTAAAGATGATGGAAATAACTTTAGAGCAGAAACTCCTAAAGAACAATTGTATGTAGATGAAAATGAAAAAGGAAAAGGTCTTATAAAAGGAAATACAAGTAAAAAGACAGGAGAAAAAATATATCATACACCAGGATCTAGATATTATAATAGTACAAAGATAGAAGATACTGAAAGATGGTTTAAAACTATAGAAGAAGCAGAGAAAGCTGGATACCGAGCACCTAAAAAATAATAGGTGCTATTTTTATGCTTAAAATGAGGTGGTGGTGTGAACGAATATTTAAAATTAGTAGCTAAAGCACAAGAACAACGAATAATGCTTACTAAGAAGCAAATAAAAAATATAAGGGATTTATACAGAGATGTAGCAAAAGACCTAGGAAAAAGGTCAAAGAAAGCGAATAAGGATAGTTTAAGCGAAATATGGTTATTAGATTATCAAAAACAATTCAAGAAAGATATAAAGGAATTAAATAAGATTCTTAAAAAAGATATAGAGTATTCTATATTGGAAAGTGCAAAGTATGCAACAAATATTCAAACTGACTTTTTTAATTTAATGGATGTAAAATATAAATTAAATTCAAAAGAAACATTTTCTAATATGTTTTCTAGGATACCGCAGCAGGCATTAGAAGAACTTATAAGTGGAGATTTTTATAAAGATGGTAAAGGGCTTTCAGAAAGGCTATGGTTCCATGAGAAAGAAGCTAATGCAAACTTTGATTATATAATACAAAAAGGATTATTAGAAAAGAAGAGTACTTATGAGTTAGCAAAAGATTTATCAGATTATGTTAATCCAGAAGTTAAAAAAGATTGGGATTTCAAAAGGATATATCCAGGAGTAGGAAATAAAAAGATAGAATATAACAGTTTTAGGTTAGCAGTAACTTCTATAAGTCATGCTTATCAATTATCTATGCAGAGAAGTTGTAAAGCTAATCCATTTGTAGAAGGAATAGAGTGGCATACAAGTAATTCTCATAGAGGGCCATGTTCCATATGCAGAGAAAGAGAAGGAAAAACATATAAAACAGATGAATTACCTTTAGAACATCCAAATGGAGTATGTTATTTTACACCTGTTATAACTAAATCTTTAGATGAAGTGGGCATGGAACTACATGGTTGGTTATATGGGGGAAGTAATAATAAATTAGATGATTGGTATAAGGAATATGGAAGAGAATTTGTTGGAGAAAGTAATTTATTTAGAAATATTAGTAAAAAAGATAACAATAATGATATAATAAAAGAAAAGCCTATTAAAGATTTTAAGTATCCAGATATAAAAACTATAAAGGAAGCTGAAAAATGGGCTATAAACAATCTTAATCTTAATAAAATTAGTTATAAAGATATTGATATAGGTGTAGCTAACTATGTAAATAAATCAATGAGTGAAATTTATCAGGAATATCCTTTATTAAATGGATTTATTCAAGAAATAAAAACAGATGGGAGAGCTTCAGCACCAGCAAGTGCGAGTATAAGTTTTAAAGATGGGAAATTAAATACAAAACTTATTCTATCAAAGAAAGATTTAGCAGATTTAAAGTCCATTGATGATATGATTAAAGACTGCGTTGATTATAAGTGGTGGACACCTAAAGATGGAGTAAAGGGAATAATAAAACATGAAATGGGACATATGATAGAATATGCTACAACTTTAAAAAAATATGGTGTAATAAATAAGAATAATGAATTAAGTGACTTAAACAACTTAGGTTTAGCTTTTAGTAGAATAAAAAACGGAGAATTATCTAAAGAAATAAAAATGAAGGCATTAAACAATCTTAACATAGTAAATACTAAGAAAAATATTAAAGAGAATTTAAGTAATTATTCTAATCGCAGTACACTTGAGTTTTTAGCTGAAGCCGTTTCAGAAGATAATCCTAGGACTCTTGCCAAAGAAGTAGTTAAATTATTAAAAGAAAAGATAAAGGAGGTATGGAAATGATAGCCTTACCCAAAGAACTCATGGGAATAACTGATTATGATGAAAATGACAATTGGATTTTAAAAGATGGAGCAACAGAAGAACAAAAGAAAATATTTGAAGAGTTTAAAAGAGATTTAGAATCTGCTAAATTATCAGATGTAGAATTGTTTATAGATGGGAGAAATATAATAACAGGTGAGCCCGAGAGATATTAAGCACTTACTAGGTAAAACAGTAGGTGCTTTTATTATGTAAAAAATTAAGGAGGAATGATGTAAATGGAATTTAGAAAAGCTTATGAGTTATTAAAACAAGGTAAGCATGTTAAGAGAAAGCATTGGGGTGGATATTGGAATTGGGAGAACAATACTATAATGATGTATTGTAAAGATGGGAAAGTATTAGATATAAGAGATACAAAAGATGTTGATTTTACAATGTCTAATATGTTAGAAGAAGATTGGGAGGTTGTAGAGTAATGCCAAAATTAAGTGAAATATTAGGAGAACATTTTAAACAGATACCAGAGGATATTCAAAAGAAGTATAAGGACATAGACTTAGTGGACAGCTCTAACTATATAGAGAAAAAAGAGCTAGATACTGCAAATGAAACAATTAAACAGTATAAGAAAGACATAGCAAAGAGGGACAAGGATTTAGTAGATTTGCAAGGCAAGATTAAAGATAATGAGGAGCTAAATGCAGAGATAGAAAATTTAAAAGCTGCTAATAAAAAAGCAAGTGAGGATTATGAATCCAAGCTCAATCAAATAACATTTGAAACTAAGCTAGAGAAAAAGCTAGGAGAGTTTAAGCCTAAGAATTTAGGAATACTTAAGAAAGCTTTAGATATAGAAAAAATAAGTCTTGATGGAGATAACTTCTTAGGACTAGAGGATCAGATTAAGAATTTAAAAGAATCTGATCCTTATTTATTCGCTGAGGAAACTCCAGGAGGTACTGGCAATATAGGAGGTGGTCAATCCTCAATAATTGACGATAACAAAGATTCTAAAAGTATAGGTGAAGTTTTAGGGAAACAACAAGCTGACCAATTTAAAATAAATGAGACTATAGACAGTTTCTTTAAATAAGGGAGGAATGAAAAATGAGACAAAGTACAACTAAAATATTAGGAACTCAAAAGAACATTTTAGCATTAGCTGGAGCATTATTTCAAAACACTAACATTAAGGTAAGTAAGACAGTAGCAACATTAAAAGAAGGAATACTTGAAGCTGGAACAATAGTAGACAAAACAGGTAAAAAAGTAACTGATGGTACTGCTTTTGGAATTGTTTATGAGGATGTAGATTTTAATAATTCCAGTGGTACAGAAGTAGTTTCAGTGACCATTTTTGGTTTTATAAAAGAAAATGTATTACCACAAAAACCAGCTACAGAAGTAAAAGCAGCATTAAAAATGATTCAATTTTTATAATTAGGGGAGGAATAACAATATGAATTTACAAGATTTTATAAACGCAAATGAAATAGCATTATATATTAAAAATTTACCACTACAAGTTACACTAGATAAGGCTTTATTTCCAAATGATAAACAACTAGGAATGGAACTAGAGGTTGCTAAGGGTGCAAAACAAAGACCAGTAGCTTTAAGAATGAGCACATTTGATGTAGCAGTTAAACCAAGAACTTTAAAAGCTGACATAAACATTGAAAAGAAAGAAATGCCTTTCTTCAAAGAATCAGTTTTAATAAAAGAAAAGGATAGACAACAAATGCTTTTAGCAATGAAAGCTAATAACCAAGAACTAGTCAATCAAATATTAAATCAAATATTCGGTAATTATAAAGCTTTAGTTGATGGTGCAGAAGTGCAAGCAACTAGAATGAGAGCTCAACTACTTCAAAGTGGAGAAATAAAGATAATTACTGATGATGGTGATGTAGTTGTAGATTATGGAATTCCATCTGACCATAAAGAAGTGCTTGCTGGCACTGCTAAATGGAGTGATAAAACAGCTAATATAGTAGGAGACATTGAAAGATGGCAAAATACTTTAGTGGCAGAAGGTTATGCAAAACCTAATAGAATGTTAATGACACAAAAAACTTTCGGATATATAAGAGCAAATGCTGCTATAGAAAGTGAAATTAAAGGCATAAGCAATGTAGTAGTTACTGATAAGCTAGTAAAGAATTACTTAAAAGAAAAATTAGAAATAGAAGTGGGTATATTAAATGGTGTATTTACAGCCGAAGATGGTAGCACAATGAACCTTTACGAGGATAATAAAGTTACTTTAATTCCTCAAGGTGCACTAGGCAAAACAATGTATGGTACTACACCAGAAGAGGCTGACAAAATGTTTGGTTCATCTAAATTAGATACTCAAATAGTTAATACTGGTGTAGCAATAACTACTATGGCAAAAGAGGATCCTGTTACAGTAGAAACTAAGGTATCACAGTTAGTTCTACCTTCATTTGAAAGGGCAGATGAATGTTTTTTTGCTACAGTAGCTGAATAAGAGGGAAGGTGAAATACCTTCTCTTTTTAAATTAAAAATAGAAAGGGTGGGTAATATGGCTAAGAAAGCAGATGAAAAATTGAAGAAAGTAAAAGCTTTAGTAAATATAAAATATGATGAAAGCTGTTTTAAAATAGGTGATGAATTTGAGGTTAGATCAGAGGATAGTGAGGAAATGACCAAGAGAGGTTATGTTGAGTCTTTAGAGAAAATAGAAAAAGGAGAAACTAAGGAAGGAGAATAAGCATGACTATACCTTTGGAAATATTAAAATTTAATCTTCAGGAGCGAGAATATCCATATTTTGAAGATAATGACCTGAGTTTGTTATTGCAATCCAATGATAATAATATTAATAAGGCTAGTTATAAAGGTTGTTTATTAAAAGCTAACGCAGATGATAGGTTAGAGGTAGCAGGTGTAAAGTTAAGTTCTAATAGGCAATATTGGTTAGGATTAGCAGAGGAATATAAAAAAGCATATGAAGAAAGTTTGCAAGGGACTATAACAGGATATAAAACATCAATGAAAAGAGTTGATGGACAATGAGTAAAATTAATAGAGGAAAAATAAGCAGGCAGATATATAAGCAATTAGAGAAAAAAGGGTTATTAAAGGAAATTAAGATATTAAGAAATGGTAAAAATGCTTATGGAGAGAAATTGGAGGATTTATATGTAACAACTATTAAGGGTTATTACTACAGGGAAAAAAGTAAGATAAATATCAGTACAGATACAGGAGCAGCCATAAATACTGATTATCGTGAGAAACTATTAGTTACCTACAATGAAGAAAGTAAAAAGATAAAGCAAGACGATTATTTAACATCAGATGGCATTAAATACAAAGTAATTGATACAGGGAATGTAGAAAATATAGTCTTCGATATGTATTTGGATAGGATGTGATATTATGAGTTTTAAGTTTGATATAGAAAGTGTTATTAATGGATTATCTGAATTTGAAGTGAAAAGCAAAGCAGCCATAGGAGTTTACGCTGATACTGCAGGTAAAAAATTAGAAGAACATGCAAAGAAAAATGCATCCTGGACTGATAGAACTGGTTTAAGTAGAAAAACTATAGAAGGTGGAAAACAGTGGGAAGGAGATAAATGTAATGTTTATGTAGCTGGTAACACAGAACATTTTCCATACTTAGAATTGGCAATGGATAAGCAGTATAGCATTCTCAACCCTACTGTAAATAAACTAGGTGCTGAAATACTAAGTGGTATGAATAATTTGTTAGGAAAGTGATTAAATGAGTAGTTTTAATTATAAAGTTCCAGGAGATAAGTTACAGGAGGATTTAATAAATAGTGCAATCCCCCAAACGTTATGGCAGAAAATATATTTACACTTAAAAAAACTAGGATATAAAGTGTATTCTCCTGGACAAAAAAGAGATAAATGCACAGAACCATATCTAGTAATAAAAGAAAATGGAACCTATGCATTGAATAGCAATGTTAATGGTTATAAGCTATTTGATATTATTATTTATCATCCTATGTCTAATTATTCTACTATGGAGTTTTATGTGGAGAATATAAAACAAGCTATGAAAGATATAACAGAATTAAGACCTACAGGTAATGAAACACCAGCAATTATTGATGAAAAAATAGAAGCTTATACTGCTAGCATAGAGTACCAACAATTCAAGAGTTTAAGGAGGTAATAATATGGATGGTAAAACATTAGTTAATGTAATTAAGGTAAATTTTATTGATGAGGTTACAAATATAAAACATACAATTGAAACTTCTGATGAAATAGATATAGAACCTATTAAATCTGACGGTAAAAGAGATATATTAAGAGTTAAAAATACTATTTATGGGATTAATGAAACAGAAGATATTGTAATTGGTTATAAATTAAAACTTAAAGACAACTTATTTAATATGAAAACTATGTCCCTTATTGATGGTGGTAGTATAGAAGGTAATAAATATTGTGGTGCAGAAGCAGGAAAGGTAGCAGAAAGACATCCATTTACAATGGAAATATTTACAGAAGAAAAAGATTATAGTAGAACTACAGGTTATGCTAAATTTACTTACAAGCATTGTAAAGGTAAACCTGCTAAATATAAAGTTAAAGATGGTGAATTCTTAGTTCCAGAATATGAAGCAGAATCAATTCCATTTAGAGGAGAGAAACCAGTTGAAATAGAATTCTTGGATACTTTACCTAGCAAACCAGAAAAACCAGATCCACCAGTTGAAGATATAGGAGTTGAAGGTGGAACTGTAAAAGATACAAATACAGATGTTGGAGTAGAAATAACTAATAGGATAGTATGGACATTTAAAGAACAAATAAACCAAGATTATGTAACTAAGACTAATTTTAGCACAAGGAGAAAATCAGATAATAGCTTGGTTCAAGGTAATGTAACTATAGATGGAACTAAGAAAATAGTAACATTTGTACCTACAAGTTTAGCTGTGGACGTAGTGTATATAGCACAAGCTAAGCCAGTTGACAAGCTTAATGGTAGTGGTAAGACTGCAGCATTATCTACAGAGTTTAAGACAATAAAAATAAAATAGTAAGGGGAATGAATGATGAGTATACAAGTAACAAGTATAGAAGATTTAAAAAAGATGGCTGAATATGATTTAATTGAATTACCAAGGTTTAAAGCAGAGATACCTTTTGTAGCTAAGGTTAAGAGGGCATCCCTTTTAAATTTAGTTCGAAAGGGTGTAATACCTAATAAATTACTTAGTGCAGCAGAGGAATTATTCTATGGCAAAAGCTCCAATAAGGGTAATGTAGATATGAAAGCGTTGACAGATGTTATGTTTATTATGGCAGAAAATGCACTTATAGAACCTTCAGCTAGGGATTTGAAAGAAGTAGAGCTTGAATTAACAGATGAACAGATAGTAGCTTTATTTAATTATACACAAGAGGGGTTAAATGAGATAGAAAAATTTCCTGAAAAGCCAGAGAATACTGTCGGTGATAACAATGAGCAAACGGTATCAAATAAGACCAAGCCAGATAATAGCATTAACAAATGATTATGAAGCGTTTTGTTTTGATGAAGCTTGTACATATATACTAGCTGAAATGAGTAAAGAAAATGGCACAGAACCTAAATTCGAAGATAATAAGAACGTAAACAAACAAAACAATAGTGATGTAATAGATTGGTTGAATTCCAATAATAAATAGGGGGAATTAATTATGGCAACTTTTAAAAATCCAGTAGTAACTTGTGATAAATGCGGAAAAGATTTTAAATTAAAACAAAATAGATTAAAGAGTGAGATAGTAAAAGAAAATATAGAAAGAACTTATTTTAAGTGTCCAAAATGTAAGCATAAATTTATAGTTATGTATAAAGATCAAGAGATTAAAGAAAATCTAAAGAAAATGGACAACATAAAAGTGCAGATACAAAAATCAATAAATGAAAAGAAAAATACAAAAAAGTTAATAGGGAGATATGAAGAGTTATATTATAACAATTTAGAAATAAGTGAGAAATACAAAAGTTTATATGGAAGATAAAACTTAAAGGCATAGAAATATGTCTTTTTTATTCCCTAAAATGGGAGGTGAGAACACATGGCAATAAACGTAGGTACAGCTGTTGCTTACCTTACACTTGATAGAAGTGGATTTAAAAACAGTTTAAAAAGTGCAGGGGCAAATCTTAAAAACTTTGTTGCTGGTACAGGTGGAGCAGAGGATAGAGTTAAATCTTTAGGAAATGCTTTAACTAATACTGGTAAAAGTTTAGCCAAACCTTCTATTGCAGCAGGTGGATTTTTAGGAATGGCAACTAAGACGGCAATGGGTTTTGAAGAGCAGATGAGTAAAGTTCAAGCAATTTCAGGTGCAAATGCTAAAGAGATGGGAAAATTAACTGCATTAAGTAGAGAATGGGGTGCAAAATCTAAATTTAGTGCAATGGAATCTGGACAAGCTCTTGAATATATGGCTATGGCAGGATGGAAAACGCAAGAAATGATGGATGGTTTACCTGGTATTCTTAATCTTGCGGCAGCTTCAGGAGAAGAGCTTGGAACTACATCTGATATAGTAACTGATGCTTTGACTGCTTTTGGACTACAAGCTAAAGATAGTGCTTACCTTGCTGATTTATTAGCAAGTACTTCAAGTAACAGTAATACTAATGTGTCTATGCTTGGAGAATCATTTAAATATGTAGCACCAGTAGCAGGAGCATTAGGTATATCTGCTAAAGATACATCCTTTGCATTAGGATTAATGGCTAATGCAGGTATAAAAGGAAGTTCAGCAGGTACATCTTTAAGATATTCTTTAACTAACTTAGCTAATCCTTCTAAGAATATGAGAAAAGAGATGGAAAGGTTAGGAATTAGTTTAACAGATAGTAATGGAAAAGTTAAATCTGGAAAAGTATTATTTGACGAACTAAGACAAAAATTTAGTAAACTAACAGATGCACAAAAAGCACAATCTGCAAGTATAATTTTTGGTAAAGAAGCTATGTCTGGGATGTTGGCAGTAATCAATGCTAGTGATAAAGATTATAACAAACTTTATGGTAATTTGAATAAGGCAGAAGGTTCAGCTAAAAAGATGGCTGATACTATGCAAAACAATTTAAAAGGACAGTTAACGACATTAAAAAGTGCATTTGAAGAATTGCAAATATCTTTAGCCCAAGCAGTAGTGCCAATATTAAGCAAAGTAGTTAAATTAGTAACTAAACTAGTTAATGGATTCAATGGATTACCACAGCCGGTTAAATCTGCAATAGGAATAATAGTAGGTGCAATAGCATTATTAAGTCCCGTATTCTTAATACTTGGGAAACTTGTAAAAACAATAGGTAGTGTAATTGGGATGTTTGGGAAACTTAAAACAGCAACAGGTGTTTTTAAATTATTACCGACATTAATTACTCCGCACACACTTATTATTTTAGGTGCTATAGCAGCAATAGGATTTATAGTATACGAGGTTATAAAGCATTGGGATAAGCTAGTTGCAGCAGGAAAGAAAATGGGGAAAATTTTAGAGAAAATATTTAAAGGTATAGGAGATACAATTAAATGGGTAATAGGTGGATGGAAATTATTAATAGGACATTTCATTAATTGGGGAAAACAAAAAGTAAAAAATATAGTAGATGGATTTATAGGTGGAATTAATAAAGTAAAAGGATTATTTAAAGGTAAAGGCAAGGAACTTGGAAAAGGTGTAAAAGAAGGATTTGAGGAAGAACTACAAATACATTCGCCAAGTAGAGTTTTTGATGGATATGGTGGTTTTATAGGCGAAGGATTAATTCAAGGTATAGATGGACAAGAAGATAACATAGATACTAAATTTAGAGGATTAGGAAATAGGATTAAATCTTTAGGCAATATCAGACCAGAATTTAATGGATTAAATAATATGGCACTTAGTGGAGCGTATAGTGCTGATAGTGGGTTTAATTCTATAAGCAATAGTAATAAACAATTAAATTTTAATCCCACAATAAATATGTATATAACTGTAGCAGATATTGAGAATAAAGGAATAGAACAACTAACACAAGAAGTAAAATCAATGGGAGAAACAGCAATAAAGAATAGTATGGTAGATTTATTTATGAAAGATGCAATTAGAAATTAGGAGGTGTTATAGCTGGATTTTAATTTACAACTTAAATACGATAATGAAACAGATACAGGAGCAGTAATAACTAACTATAAGCCTCCTGTTCCTGTTACATTAAGAAAGGGTAATAAAAGTTTAACAGGTTATACAGTATTTCAAGATGCAGTTAAATCAGATTCATCAATAAAATTTTCCGTAGCATTTGAAACACATACACCAGAGCAAATATCAAAATTTAAAGAGTTTAGAAATAGATACAATGAAAGATTTATATTTATTGATGAATTTGGCACAGAATACAGAGGATATTTCCAAGGAAATTTTGATATAGATACTCCTATTGAAGGTGATATATACTACATGAGTTGTGAATTAATATGTCCTTGTGGAATTGAAGGTTGGAATGGTGATAGCAATGAATTATAGGGTAGAAATAATTGATAAAGGTGAAAAAAAAACAACCTTAAAAAGAATTGTATTAAATATAGTTATAGATAGAAGAATAGATATGCAAACTGCAAGTGCTACAGTAGTAGTGCAAGATGTGAAGAGTAAAGTACCTAGCTTATATAATTTTGGATATACAGATGGAATAATTGCCAATGGAAATCATATTAGAATTTATATAAGTGATAAAATACAATTTACAGGGATAATAAGGAACTTTGAAATAGATGATGAAGCAAAAACTATAAATATAAATTGTCATGATGTAGGTTGCAAAATATTAAGACCCATTGATGGAGGAGTTCCTTACCATGTATTTAATAATATTAATGCTACACAATTAATAAGCAATATGGCAATTAAAGCAGGTTTAGGAACTCCTATATTCAATATAATATCAAGCAACAATTACGCGATTAAAAATTTAAAAATGCAATATGATGTACAAATGAGTGATATTATAGATGAAGTTTTAAAGACATTAGAAGCCAGAGCTAGAGTATTAAAAGATGGTACATACAAAGTAGAAAAATTATACCCAGATTACAAGGCAAGTAATGTACAAAATAAAATAAACTATGATTTTTATTACGAGGATTTTATTATAATAGGTAAGGCAAATAGAAAAAGAGGTAGTGAAACCTTATATAATAGAGTTTTAGTTAGACATGCAAATGATAAATATAATGTATTTGAAGATCCCAGTATGGTTGCTTATTTAGGATATAAAAACTTTAAAGAAATTGAAAGTCCTTTAGGTGATACAGTAGATAAAAGACAAAAGGTAGCTAGTAGATTTTTCTTAGATTGTTGGAGAGAAAACTCTAATGTAGATATAGTAGCAACCAAAGGAAATCCTGATTTAGATTTAGGTAAAATAGTAAGAATAAAATTAAATGAAATGATTGGACATTATATCGTTACAGGAATAAGAACAGAATTAACACCAGATGGAAATTACATAGACCAAATATCACTTGATGGTATGAGAGAAGTAACTAATATAGCTAAATTAGGTAAAGGTAATTACACCTTAAAAGAAGGTGATAACTAATGAAACAGAATGATTTTAGAAAGCCAGTTGTTTATATTTTAGACCAAGAGCTTAGAAGAAGGGATTTAAAAAATAAAATAAAGTTTGAAGGTGGAGAAGAGTATAAAGGTGAGTTGCCTGAATATCCAGTAAGATTAGTAAGAGATACAAATAAAAAAGTGATAAAATGTATATATGCAGAAAATACAGAGTTAGAATGGTCAGAGGAACTTATAAGAAATACAGAGGGAAAGGTATATAAAATAAAGACTATATATCCAAATAAATCAGAGAAAGCAATAGAATTATTCAAAAATCCAGATAATAGAGTAGAATTAATAGATTATGTATAAGGGGATGATGATATGGGGCTACCAAGTTATGTGATTAACTGGGATGAATTATCCGATTTAATATCAGATTATTTAAAGAATGGTGTAAATGCAGATATAGGTAATATAACATTTAATACAGAAGATATAGAAAGATTGTTAAGTGAAATTAAAGATAAAATACAAGGCGTAGACTACAATGACTTAATTCAAGCCTTAAATGATTTAGGTGCAAAATTAGATGGATTAAGTGGTAATTTAGGAATATTAGGAACACAAAAGATACATGGAAAAATGTTAGAAGTACCAGCAATAGTAGGACAGCATACAATAGAATTTACTGCACCAGCTAACAGTAAGTTAACGGGTATAACTTATTCTCAATCTAGTTGGAGGTTTGAAGATAGTTGGGATTTAGTTGTAGAAGAAGAAAAACTATTCACAGAAACAAGAACTAAAGAATATGGTGAGCATAAGTATTTTAATGTATTTTATCCTGTAGATGGTACTATAAAATTTATTTATAACAATAATAGTGGATCTTCTAAAGTGGTATGGGTAGATTTTTGTATTCTTGAAGGTGGTTCAGAATGAGTTTATCAAAATATATAATTAATTTTGAGGAATTAGCAGATGAATTAAAAAAACGAATATTAAAGTTGATTGATGATGAATTGAGGAGCAAATATTCAAAGTTAAATACAAATGATATAGAAGGACTATTAGAGGATTTAAAAGAACTATTACCAAGTCTACAGTACAATATGTTAAAGAAAAAGATAGAGAATTTTATTTATTACAATTATAATGGTATGCAAAAAGTGGATAGTAAACTATTAGATATTCCTCCAATTATACAAGAAAATAAAGAGGATTTTATATTTGATAAAGATGTATATTTGACAGGATTACATTTTAATCAAACTGGATGGAAGAAAGAAGATACTTATACCTTAGAAATAAATAAAAATAGAATAATAAATAATGCAACTATAAAAGAAATAGGAGAGCATAAATACTTTAATACATTTTATAAAGTTAATGCTAATACACCTATTTCTTTTATTTTGCATAATAACAGTGGTAATAGTAGACAAACAATGGTAGATTTAGAATATTTAATAGGAGAAATAGTAACACCACCAGAACCACCAATTGAAGAACCGGAAACACCAACAATTGATGATATTTTAAATGACTGGGATATTGCGGTAGTAATGCAATGGGAAGCTAATACAATGGCAGATATAGATTTGCATGCAGTTATTGGAGATAAATACATCTATTTTGGAAATAAAGAGGAGTGGAATTTTTTCTTGAATTTTGATTTTAGGCAACATATGACTAATCAAAATCCTGAAATATTAAGTGTTAAAGGATATAAGAATAAAAAATTAGAGATATACATACACAACTTTGATGGGATTGGACTAAATGAACCTGTAAATTTAAAGATATATGAAAAAAGACCATATGGAAATAACTTGATAAAGGAACTAAATATAGATATAAGCCCAAGTAGGGATGTAATTAGAGAAACTATACAAATAAATCTAAACACTTTAGAAATAAAAGAAATAAATAGAGATATTAACTTAGAAAAATTTTTAGGAGGTAGATAAATATGGCAGAAGAAAAATTTTATTATGCAGAAGGAAGTTGCCAGGTTAAGGATTTAGTAAAAAATCTAGTAACGGAAATAACAAAAAATGCTGGAATATACAAATGGAATTTGGTTTATCCAAAAACACTAGATGAAATAGGAACATCAGGTGAAAATAAAGAAATAAATCTTATTACAGATGATTCTACAACAGATAAGGTGGATACTAAGTTTACAGTAACAGAACAAAATGATACCTGCATTATTAAGGCTACAACAACATATGGTAAAGAATTTTATGTAAAAATAGATAGAGAAAAAGCAGATTTAACAAAAGAGGAAAAGAAAGCATTAGTTGATTTTAAAAGTTTACATAGATATTGTATACGGGAATATTGTTATGATAGAACAGATGCGCAAGTATTAGAAATAATGGCAGGTGTAAATGATGAATCAAGTAAAAGTGGAGATTACAATGCTTACGTAAGTGCCATGACAAAATCTAATTCAATTAATAATATAAGATTACAAATATCAGATAAATTAAATAAAGAAGGTAATGATTTAGATATACCTAAAAATATGCAAAAGGAATATAATTATAGACTAGCTTGGTATAGAAAGTTACAACCAGAAATTAAGGACTTTTTACCAGTTCAATATTGGATCAATGTAACTAAAGATAGTATAAATTTAGTTTTGCGTGGAGATCCTTCGGCAGATGTGCATCCATATGAAAACTATTTAACATCCTATGCTTATATGGGAGCATTAAAACCAGTTGAAGATAGTGCTACAACAGATGATAAATATAATTTTGGTATAACTGTATCTTCTGATATAGAACCTAATTATACAAAGCTCTATGGAGAAAGAACAGCTACTGGAGTAACAGATATGTGTATGATAGCTAATAAAATAGGTATGCCTTATCAGCCACATTACCCTGCTTTCTATGCTACTAATCCATTTATGGACAAGTGTAATGTAGAAGGTTCGAGATGGAATCATAAGAAACATCAATTTAGCGATATTACTTTAGTACATCCAGTAGATATGGAAAGAGGAAAGATGATCAATGTACTTGCTGGAGATGCTAGTGCAATATATGATATGGATAAATTAGCATATAAGAAAGATACAGATGAAGAAGAATATTACAAGAAATTTAAACTCACAGCACCTTTTAATTTTCTTAATAATAGTGCAAATATTAATTATTGTATAGCTATAAGGTGTTATAAAACAACAGAATAGGAGGTTCTTTATATGCCCCTACATAAAATACCCCTATGTAGTCTTAAATATGTAGGGGATACTCATTCTGGTGCAACTTTTAAATATGATATTACAAGCAAAGTGTTAAAACATAAAGAAAAACTATTTTATAAGGATAAGATAGTTGAAATAGATAAGACTAGAGGAGATAAACTCTTATATAAGTATTTTTATAATATAGAAAAGAAGAATAATGAATATTTATATAAAGAAAATCCCATTATGGTAAAAGAATTTGAAAAAGAGCTAAAGTTGCATGACAGAGAAATAAATAAAGATAAAAGTATGCCTTTAGAAGATACAAACATTATAGAATTTAATAAAAAGAAAAGTATGCAACTAGGGCAAAAGGAATGTATAAATATAAACATAGAAGTAGATACAGAACTCCAACCTAGAGATAGTATAGATATACATGTAGAAACAGACAAGAATTTACTAGAGTTTAGAAATATACAATTAAATAAAGATAATAAGTGTATAAATCTAAATATAGATAGAGAAAATTTACAATTAGATAAATTTGAAAGTATATATACAAATAAAATAACAGAAAAAGGATTATCTAAATATCAATTTATACAAAGTTTAAAATTAGAGAAAAACACAGGCATGGAAAAACATGATTATAGATTTTTAAATAGGATTTATTTTAAGGAAATTGACATGTATAAATTAAAGTTTTTTGAAAAGTATAGGGATAAGTATATAGATAAAGATAAATATAATTTTGTAGACAGAGTGAATTTTAAAAAAATAGATACTATAGAAAATCCAGATATTTTAAAAAGAATTGGTATAACTAAAATATTAAAATCTAATATTGTAAATAATTTAGATAGTATAACCATAAGAAGAATTAGTAAAGATTACAATATTAGATTAATGCATAAGATACTACTTAAGAATATAGAAAAATACAAATATATAGATTGTTTAGATAGAATAAATATAAAATCTATAGATAAAGATAAGAATAAAAAGTATTTCTATAGAGAAGGGTTAAAGACTATAGATAAATATTATAATAGATATTTAGATAGGGAGGCTATAATATCTATACATAGAGATAATGATAAATATTTAGATTATATGCCTTTAATAAATGTCTATAAACAAATAGAAAAAGATTTATTAGATTTAACTATTTGGGATATATATAAAGAACATGATGAGCAATTACAAGGTACACTCATGAAAAACATATATAAAGTAGGCAATAATAATAAATTTATTGAGGTGGCCAAAAGATGGTGGTGGCTAAATCCCACAGATCCAATGGATAAATTAATTATTCCTAATAAAGACTATGAAAAAATGAAAGAGCTATTAGAAAATCCTAACTATGAATATCTAAGATATAATGACAATCCTATTGAATGGGGTAAATATTGGGGAATAGATTATAATATCCCACCTATGCCAGTATCAATAGAAATAATGGTAGATTTAATAAATATATTAATAATGGTATGGCATAAAAATGTACAAGCTTGGATGTGTTGTAGTGGTAAAGAAGCTGTACAATTTATTATGGAATTATTATATGATTGGTATTCTTTAGATACATCCAAACCTAATAAAGACTATATTAGAGCTTACAGGTGGGTTAGATGGGAAGCTGAAAAAGTATATTTCTTAAACGCTGAAAATGGATTACAAGCTATAGGAATATTAATAGCTAACTTAATAGATTATATGAAATACCATCATTTTGATTTAGTCCCATTATGGAGAAATCCAAAAGCTATGGATATAGAAAGAAACTTTAATAGGATAGTACAAAATGGAGATATTATGAAAGATTTAGACAAACTAAAGGGCAGTAGACACTATTGCATTGAAACGCAAAATTTTGAAAGAAAAAATATATTTGGAGAGTGATAATATGTTAGAAAGCACAATAGATTTTAAAAAACCAAGACAGAAGATGTGGGGAATATTAAAAGATAAACAACTAATATCTTTACCATATGGACATGAAACTAATGAAGAAGGAAAAGAATTAACCAGTTATGCAACTAATTGTTATGAAGATGCATTAGCAGAAGCACACACATTATTAGCACAGGGTACAGGAACCAAAAATATACAAGTGGTTGAATTTGTACCATATGATTATATTATGCAGCCCAGAGTTTAATTGAGGTGGTAGTTTATGAAATTAATAAAAGTGAAGGATGGAGTATTAGAGGTAGATAATTTTTATCTTACCTCTTTTTTTAATGACTTTGCAGGGAATTCAAATATAACTAGAGATATTAAAACAGGTAAGATAAAACTAATGAGTAATAATAAGATGGAAAGAAAATTCGATTATAATGAATTTGTTATAGAGCTAGAGAAAGAAAATTTCAAGCACATGAATGACTATGATTACTCTATGATTTACTTAGGAAATAATGAGTATACCTTTGGTATTAAAGATGAAGAATTAGATCAACAGAATAAATATTGGAAAATATTGAAGCAAGATAATTATATACAGGCTTATTCGAGTACAGATGGTATGAATTATAAAAATATAGGTGGTATGAAATTCGATGAAGCTATGACTAAACAAGGATTTATGAAATATAGTGCTGAAGATTTTATATTAAATAAATATAGTGTTTATGATAATCCTTATATAACTATTCAAAATTTCCCTGAAAATACTATTTGTGAATTGTATGATTTAAAAGGGAATTTATTAAAGACAAGGATATTTGATAAGAACTTAGAATGTAAAGTATTTATAGATGGTAATATGGTAGGATATTTTATATTTAAAGATCAAGATAATAATGAGATTTACACTACAAGTCATTTATCTTTACAGTATGGGGATGTTTATATATGTAGTCCTTACAATTTTGAAATTATATATCATGGTAATGTGGTTACTAATGTGAATCCTGCTACACTCCAAGATTTAGAGGAGCTTATAACTATTAAAAACATTGGAGATAAAGATTATACAAACATCAACATAGGCACTGAAACAAGCAGTAATGACCTGATACAATTGTCTTTTGATGGAGAAAATTATAAAGAATGTTTGGTTGTATATAGTATTGAACAAGGTGAAAGCAAAGATATATTTGTAAGAATAACTAAAAATGCAGAAAATCACAACTTTAATGTTAGGGATTTTCAATTAGTTATCAATGAATAGAGGTGATGATGTGAGCAAATTTTTTAATGTAACACATGACAAGGATATAGTTTTAGATGACAGTATAATATCCAATAAGACTGCATGGACAAGTAAAAAAATACATAAAGAGATAGTAGATAAAAGAATTACTAAATTTGAAGAACTTGAAGATGTAGATGTAGTAAACAAGCAGGACAAACAACTAGTAGCTTATTCAAAAGATACAGGTAAATTTACTACTATAAATGGAACAGAAGCAGGAGAAATTACAGGTGCAAGCATGAAGCAAATATCTAAGATGGGTGTTATAGGAAGTCCTGAAGAACCTAGAATAGTAAATATATCTGTTAATACAATAGATTTTAAAGTACCCCGTGTAAATGTGCTTAAATATGATTTAGGCACACAGAATATTATATTAACAAAGAATGAATTTACTAATGGTGAATCTAACGATTTTGAAAAAGATCCAATGATGGTGTTTGATGGTAAGGCACATCTAAAAACAGAACACACATCTAATTTCGCGTTCAATAGAGAAATGGATATTAAATCAGAGTATGTTGTTACTTTCAAAAAGTCCAATTTTAAAAAAGTAGAAAGTTTCAAAATTGGGGAAGATGGGGTTATAAAAACTTTAGTTACTACAGCTATACCCCATGATAGATTATTAATTCCAAAGTGTGATATGAATTTAAGTAATGTAGAAAATATAGACTATTTTAATCTTACTGCTACAGGTGAAAATATAAGGATAGTTTGTTCAGTTGATAGTGGACAAACATGGAAAGTATTTAAAAATGAGAAATGGGTAGATATTAATTTAGATGTAGAGGACATAAGAACTAACGGGATGACTATTGACTTATTTAACTCTATTAATGATGTATTTTGGAATGAGTTAGTAACTACTAAAAAGATAAGATTTGCCTATTTATTATCCATGAACAATATAGATAATATAGAAGAATTAAAAAACCTTGACCTTCAATATGACGGTGAGGGCAAATGGATACAAGCTAAAGAAGATACATTTGATGTTGTATATGCAAGTAATACATTATTACAAGTACATGTGAAGTTTAATGGTGATATAAAAATCAATTATTGAGTAGCCGATTTATTCGGTTGCTCTTTTCTTTAAAAGATAGATTCTAAATAAGATTTTATAGAAAGGAATGATAGCATGGATAAAAACTTATGTGTAAACGGTACAGTTATTGCATCGAATGATTATCAACCCTCATATCCTGAAAGAAAGAAGGAATGTGCATTTGATGGAAATTTAGATACAAATTGGGGTGGCATTACTGGACAAACAATAGATAAATCTTGGATAGGGTATGATTTTTTAAAACCAATAACAATAATAAAAATAAAACTGCATCAATCTAATAATAGTAATTCTTCTATAAATATAGTGTCAGTGCAAGGTTTCGATGATGAGATGCAATGGAAAGAAATTAAACAGTTTTCATTGATTCCTGGGTTAAATATATTAGATTTATCAACAAATAATAAAGCATTTAATAAATATAGATTAGTGGCAAAGACAGATGTAGGTATACCAACGTGGGCGTGGTTAGTAAAAGAAATAGAAATGTACGGGAATCTATATAAATTTTTACTTAAACAAAACTCTAACTATCATTCGATTAAAAACAATTTTTATAAACTTGGACAACCTAATGATAATGCACAATTAGAGCAATGGTATGATAAATATGGAACAAAAAACATTAATACTATACTAGAACCATTAGATTTTAAAGACGTTCCAATGAGTTTAGAGGAATCTACGGGAATATGGAAAACAGATTTTGAATTAGATGCTAATAAGATTCTAGGCAACATTAAGATGATAGAAGAAAGTATAAAAAACAAAGGAAATAAAGTAATTAGATATGAATGTGAACAATATAAAATTTATGATAAATTAGATAATGAATTTGAAATAATGATGGATGTGTAAAGAAAAACTTGATTAATAAACAAATTTTATTACGAAAGGATGATATAGTTTGAATAAAAAAAAATACTCTTTGAATTTTAAAAGTGATGGTTATATTTTTTTCGATAACTTTCCAAAGTTTGGAGATAACTACACCTTGCAAATTTGGATTAATAGCAATTCGAATTCTTTTAACGCTTGGTCAGGGTTATTTATGCGTGGTGACTTAAATACTTCTCAAGGATTATATTGGTACGGTAAAGACTTGTGTTTTACTAATGCTGTAAGTGGAAGAGTAGTAATAGATACTGTAGACAATTTTCCAATAGGAAGTTGGCACAATATAACAATTGCCCATGAGAAACAATTCATAAAATATTATAGAGATGGAAACTTTATTAGAAAATTTCAAGTTGGTAAATCTATTAATAGTTTAGATGATTTGTATATAGGTGTTTGGGACAATTATTCAAGATATAATGGAAAAGTATCCGAAGCTAGAGTATGGAGTAAAGCACTAAATGACCAAGAAGTATTATTTACTTATAATAAAGAACTAACAGGAAACAATAGACATTTAGTTGCTCATTGGGGAATAAACGAGGGTAATCTTACGGAGATATATGATTATTCTAATAATGGATATAACGGAAGGGTTTACGGTGGAGAATGGGTTGAAGATTCTCCAAGTATTTATTGTTATAAATATCTGATTAAACAAGATTCAAATTATTACTCAATTAAACCTGAATATATCCAAGAAGAAGTGTTTAAGTCTTTAATTTTAGATGGTGGTAAAAAACCTAATAGAGATGATTTTAATAAATATGGATTTAATAATTTAAATGATTTATTAATAGAATATAATTTAGAAAAGCTGTTTAAACCTGCAGATATATTAAAGAAGATCAATAGTGGCAAGTTTCATATAGTTATGATGGAGATGTAATAAATTAATCAATAAAAGTAGATTTAAAAAGAATACTAGACTAGAAAGAGTCTTTTTTCAATGCAAAAATAGGAGCAATGCTACGTTGCCATTGCTCTTTTATCTAGGAAAGGAATTTTAATATGAAACTATGACTATTAATAGTATAACCATAATATTACTTTTATACAATTATTTATTTTAATAAAAGGAGCAGTGTATTATATAGCACTGCTCTTTTTGAGTGGATTGAGGATAATTATTATCTATATGTTCGTGTTAAATGAAAGAGTTTATAGAAGAAAAACGGGAAGGAATTCTAGAAAGATAATTAATGTATAAAATCCACATTTTTCATTTTAGATTGACAGGTTTTCATATATCATCTATACTATATAAATGATGCAAATACTAAAAAGAAGGGTTTAAGGGGTTAGAACAGATATTAATTTTGAAGAATATAAAAATTTAATTGGAAAACTAAGTAGTCTAAAAGTAAGAGAGTGGTATATTTATCATGATAAAAACATTGTTAATAAAATAGATAAATCATTAGAAATAAAAGAGCAAGCAATAAAAGCTCATTTGCTAAGAAACAAATATAGAATGCAAGCCAGAAAACTAATGAAAGATAGAGAGTTAGCAGCATATTTGGATATTAATAATTCTAATTTACCATTTGAGTATTATGAAAATAAATATTTAAAACAAGGATACACTGGTAATTTACTTTATAGAAAAATATTAGAAGCTTCAAATAGAACAAATAAAGAAGTAAATAAACAATTAGGAATAATATAATAAGAACTAGAAGGCACTCAATAAGGTGTCTTTTTTATTGCAATAAAATAAAGAGTAATGATTGCCAATCACTACTCTTTTAAAATTATAAAGGAGATTTAATGTTTAAAAAAACTATTAGTATTATAGACACATCTTAAATTTAATATACAAGAAAAGTAATTAAAAATTTAAATAAAAATAAGAGCAATAATAACATAATATTGCTCTTATTTAAGTGAAGGAAAATACAGTATGAAACACATTCATTAAGAGTATAGACTAAATTTAATTTTTATATACAGTTATTGATTTTAATAAAAAAGAGCAATGTACTGAAACACTGCTCTTTTAATTGAAAGGCGGTAATGATTATGAATATGTTGATATTAGGATTATAGACAGATTAAAAAAATATTATACAATTTGGAGGTGCAATATGAATGAAGAATTAATAAGGGATAAACTTGGAACACATGATAAAAGACTTAATGATCATGCAGGGAGGCTAGATAAACTAGAACAAAATCAAAGCAGAGTAGATGTGAAAATAGAAAATTTATGTGATCAAATTAAACAATTAGTAAGTGTTATGAAATGGTATATTGGGTTGACAGTAGGAGCTTTAGTTAGCTTCTTTTTTTATGCAATTCAACACAATATTTTTAAATAGGAGGATGCTTTATGGAAATGAACATAATGGAATTTATTACAGAGCAGGCATTTATATTGGTACCTGCTTTATACGTTTTGGGACTAATGTTAAAGGGAACAGAAAAGATTAAAGACTGGACAATACCATGGATACTATTAATTATAGGTATATTAGGTTCTATAGCTTTAATTGGACTCAATGCAAATGCAGTAATACAGGGAATACTTACAGCAGGTGTTGCAGTATTTGGAAATCAATTAGTAAAACAAACTACAGTAAAAAGTAAGGAGGAGAAATAGTTGAGCTATACAACTAACTTTATAAATAGTGTTAAAGATGGGGCTATAGCTTCACAGAAGAAATATGGGGTATTAGCTTCAATTACTATTGCACAGGCAATATTAGAGAGTGGTTGGGGCAAAAGTAGTTTATCAAGAGATTGTAAAAATCTATTTGGAGTAAAAGCCATAGGTGGATGGAGAGGATGTAAAAAAAGTTATCCTACATATGAATATTATAATGGTAAGAAGACACTTATAAATGATTACTTTAGAGTTTATAATAGCTACGCAGAAAGTATAGAGGATCATGCTTTATTTCTAGTAAATAATTCTAGATATAAACAGCATGGTTTTTTTAATGAAAAAGATTATGTAGGACAGGCTAACGCTTTACAGAGAGCAGGATATGCAACATCTCCTATATATGCACAGCAATTAATCAATCTAATAAGGCAACACAATTTAAATGAGTATGATAATATAAATAATTCCTACATTAATATAGATGGTGGGGGATACGCCAGTTATCAAGGTGGTGCTCCAGGAATAAATCTAATAATTAGAGATTACAGCAAAGATATTAATAGAATATTCGCTTGGGTGGATAACGATAAAGGGGCAAGTTGGGCATTTGATTTAACACCACCTAATAGTAATTACACTAAGTTATTTAAAAACACGAGTAAGGTAATAACCAAGAGAAATGGCGGTTATACTTTTTCTAAGGGAAGTATTTATAAAATAAATGTTAAAGGATATGATAAGAAAGGTAAAGTTGTAGCAGAAAATCAAATAGTTCTTAAAGTACCGAAGTGTTAATTTATAGATAGTTCTTAGGCTAAGCTAAGAACTATCTATTTTTTTATTTTACATATAACTAATTTTATCTAATATTGTCGAATAGCTTGTACCAAATGTAGAAAAGTTTTACAATAAAGAAGAAACCACCGTGAATGGAACTAAAATGGTTTCTTAAAAATGATGGTATTTCATTGTTTATACTTCTATTATACTAGTATATTCTCTGAAATACAATATAATGGAGGATGTTATGAATAGAGAACTATATAAAAATTTAATCAATAATACAATTGAATATTATTTACCGGAATATAATTTGCAACAAATAATAAATAATTTAACAAAACAAAAAATAGCAAGTTATAAAGAAAAAGAAAATAGTGTTTTAATTAATACATCTGAATCAAAAAGCCAAAGGATTAAGAAATGTATTGATAATAATTCAGATACGTTTTACATAATAGAAAATATGCTTATAAATAAAGCATACAGATATGCTATTTACTATAAATGTGATAATTTTAACCTTAATGATAATATAGAAAATTATATTGATATTGAAAGTTTAAAATGTGAAAAATTTAATTTGATTGAAGAATTTGATAAACCTATATTTGTACAGACTGACGATTACATTGCATTTAAATTTTGTAAATCTATAAATCCTTTTATTATTAAATTAGATCAAACAAAAGATGTTTTTTATTCTACATTGTGGGTTTATCATAAAAAGATGAAAATATTAGAACATAGATTTGATATGATAGGGTTTAAATCTGATGATACTTTTTATGAGACTACATTTAAACCACAATTGTACAAAATATGTACAGATTTTAATTGTACTATTAATGAATTTAGAACAAGTAAAATAATAAAATGTATTGTACAGAATAAAAAAGATGAAGTAAATGAAATTTCACAATGTATGGGTTTAAAAGCTGATAGTTTGGCAAAACTAAAAGCAGGTAAAACTTTAGTAATGCCATTCATAGGAGATTTAGAAATTATTATGGAAGAGAGTAAAGAGCTATTTGACAAAACTAATGAAACAAAGTTAATAAAAAAAATGTTAGAGAAATATATTAACAATATAAAAGAAAATGCCAAATATAAATCAAGATTACTGTCTTGGTGTAAAGGAGAATATAATTCTCTTTGTGTAAATATATTATTTAGTTATAGAGATAAAAACTATGATTTGTTTAATTTCCAGGATCCTAAAAAAATTAATATGGAGTTGATGAATTATGCCATCGAATATATATGGGGGATTAGAGGGGATATTGAAAATATTAGATGACAAACAGATTGAAAGTTTGAAAAGCTTTTTAGAATATCAATATAAGAGTCATGTTTCTTGTAGCGATATAGCTAATAGACTTGATATTACATATGATAAAGCAAAAGAATTAATTAAAATTTTATTGAGAAATAATGTTTTGGAAATGAATTTTAAAATTTTTTGTGAAAATGAATTAGATACCAATATACAAGATACTTATGAAAATATAGAAGATATACCTGAAGAGCCATGTAGCAATTGCGAAAAAAAATGTTCTATTTTAAAAAATGTGATAGTTATATATAAAGTTATAAGTAGGGATATATATGAGTGAAGAAAAATATAAATTCCAAGAATGTGAGTACCTATTAGATAAATTTGAGGATTTTAATAAAGATGCTCCTGATATACTAGCAAAGGTTTTACAGAAAAGGAGTGGAATAAATTCTGAGGAATTTAACGAATTATATAAATTGCTTATAAATATGCACTCAAATGAAGAAAAATATAAAGGAGTAAAGGCAAAGACAAAAGGAAATATATTAGAAAAAATAATAGAGCTTATAACTATAAAAACAAAATTATTTAAATTGTTTACTAATGTAAGTAATAATTCAAATGAATATGATATTATAATTACTCCAAGTGAATTAGCTACAATGTCATATAATGCATTACCTGAGATCATATATCAGCCAATAATATGTGAGTGTAAAAATTATAAAAAACCTGTTGATGTAACTTGGATAGGAAAGTTTTATATGCTCTTATCTATTTCAAATATAAAATGTGGAATTATTTTCTCTTATGAAGGTATTACTAGTGGAAAAGGTGCAGGCAAAGATAAGGGACAAAATGATTCTGAAGAATGGAATAATGCTAAAGGATTGATAAAAAAGATTTTTCTTAAAGATGGGATAGTTATTATAGATATATCAAAAAAAGAATTGGATTCAATTAAAAATGGTAAAAGATTATATGACATAATACAATCGAAGTATGCAGATTTAATGTTTATGACTAATATTGAAAAATATAAAATAGAGCATATATCAAGTGAAAAAATTAAGGAAATAATTGAAGATGTTAATGAACAAGTAGAATTCTATGAAAGTAAAAAATTTAAAAAGTAGTAGGTAGATTAAATATAATTATCTTAAATCAAGCTACACTAGAGGACTTGGAAGAGTTAAAGCAACTAATAAAAGAAAGAGAAGAAAATTTAAATAGTAATAACATTTAGGTAGCTAGCTGCTACCTCTTTTTTTATTGTTGTAAAATGTCTAAATATGTAGTAAATATTTACATACAATCTCCTATTGTACTATAATGGAGTTAATTACAATAGGGGGGGGTATTGGAATGTCGGAGCTTAAATAAGCAAATAAAAAATATTTTATTTGTTTAATAACTTTTATTTTAACATTTGTAGTCGGATTAAAATCAGCTATATTTTTTTTAATATCATTAGGATTTTTAATAGCAATGTTTATTTATGCGTTTAAAATAAGTACTATTAAGAAAAATATAAAAGATCAATCTTCAAATAATGAAGTAGTTAATGAAAAAACAGAAGATAATCTCTTAAAGCAAACCGATTATTTAAGTAATATTCACGAATTAGAAGATGATAAAGGGAGAGAAGAAACATCAGACTTGTCTAGCGATGAATATTATAAAAAACTTTGTGGAAGAGACAAACATATTAATTATTCTAAAAGTGTTTTTGATGAATTTGTTGTAATAGATTTAGAAACTACTGGGCTATATCCAGTTACAGATAAAATTATAGAAATTACGGCAATTAAATATAAAAGTGGACAAATAGTAGAAAAGTATAATACTTTGATTAATCCTAAAATCAATATACCTAAAAGGGCTACGGAAATTAATAATATAACTAATAATATGGTTAAAGATAGCCCTGTAATAGAAGATGTCTTACCAGAGTTATTAAAATTTATAAAAGAGTACCCATTAGTTGCACATAACGCTAGCTTTGATATAAAATTTTTGAATGCAAATTTAGCTTTAATAAATAGAGAAATAAAAAATGTTGCAATTGATACCCTTCAATTAAGTAGAGCTATGTATACATTTTTACCGAATCATAAATTAACCACGATTAAAGAACATTTATGTATATCGGAAGACAATTCACATAGAGCTTTACCAGACTGTATTACAACTGCTGAAATATATTTAGATTATTGCTATAAAGCAAATAATAATTTAAAAGAATTTAATGAATTAGAAAAAGCATGTTTTAAGGTAATAAAAGATATGTTAAATAAAAATAATAGAGATACTGAATTTTTAAAACTAAAACACACAGGGAATTATACAGATATAGCATATTTTTATCCATTAGTTAGAATAAAAGTTGGTGATAGAAAACAATATTTTCTTACTAACATTCCACTTGAAGAATTAAAAGAACATGAAACAGAAAAACCTTCTAAATCTGAAAATTTTAAATCAAGAATACATTTAAAATCAGAAGAGGATTTAAAGCCATTTGAGGAATATATTATACAAATCTTTGATGAAAAAAAGAAAAGTTTTGAATGGTATAAAAACAACATTAAAAGTTCAGAAATTGAAATAGTTAAATATCTTGCAAATTAA
Protein sequences of DBSCAN-SWA_2 >NC_004557|1133918:1184898|1163402_1163822_+|WP_011099339.1|DBSCAN-SWA MSSFNYKVPGDKLQEDLINSAIPQTLWQKIYLHLKKLGYKVYSPGQKRDKCTEPYLVIKENGTYALNSNVNGYKLFDIIIYHPMSNYSTMEFYVENIKQAMKDITELRPTGNETPAIIDEKIEAYTASIEYQQFKSLRR >NC_004557|1133918:1184898|1163050_1163398_+|WP_115604774.1|DBSCAN-SWA MMSFKFDIESVINGLSEFEVKSKAAIGVYADTAGKKLEEHAKKNASWTDRTGLSRKTIEGGKQWEGDKCNVYVAGNTEHFPYLELAMDKQYSILNPTVNKLGAEILSGMNNLLGK >NC_004557|1133918:1184898|1172441_1174334_+|WP_011099349.1|DBSCAN-SWA MPLHKIPLCSLKYVGDTHSGATFKYDITSKVLKHKEKLFYKDKIVEIDKTRGDKLLYKYFYNIEKKNNEYLYKENPIMVKEFEKELKLHDREINKDKSMPLEDTNIIEFNKKKSMQLGQKECININIEVDTELQPRDSIDIHVETDKNLLEFRNIQLNKDNKCINLNIDRENLQLDKFESIYTNKITEKGLSKYQFIQSLKLEKNTGMEKHDYRFLNRIYFKEIDMYKLKFFEKYRDKYIDKDKYNFVDRVNFKKIDTIENPDILKRIGITKILKSNIVNNLDSITIRRISKDYNIRLMHKILLKNIEKYKYIDCLDRINIKSIDKDKNKKYFYREGLKTIDKYYNRYLDREAIISIHRDNDKYLDYMPLINVYKQIEKDLLDLTIWDIYKEHDEQLQGTLMKNIYKVGNNNKFIEVAKRWWWLNPTDPMDKLIIPNKDYEKMKELLENPNYEYLRYNDNPIEWGKYWGIDYNIPPMPVSIEIMVDLINILIMVWHKNVQAWMCCSGKEAVQFIMELLYDWYSLDTSKPNKDYIRAYRWVRWEAEKVYFLNAENGLQAIGILIANLIDYMKYHHFDLVPLWRNPKAMDIERNFNRIVQNGDIMKDLDKLKGSRHYCIETQNFERKNIFGE >NC_004557|1133918:1184898|1161103_1162138_+|WP_011099336.1|capsid|DBSCAN-SWA MNLQDFINANEIALYIKNLPLQVTLDKALFPNDKQLGMELEVAKGAKQRPVALRMSTFDVAVKPRTLKADINIEKKEMPFFKESVLIKEKDRQQMLLAMKANNQELVNQILNQIFGNYKALVDGAEVQATRMRAQLLQSGEIKIITDDGDVVVDYGIPSDHKEVLAGTAKWSDKTANIVGDIERWQNTLVAEGYAKPNRMLMTQKTFGYIRANAAIESEIKGISNVVVTDKLVKNYLKEKLEIEVGILNGVFTAEDGSTMNLYEDNKVTLIPQGALGKTMYGTTPEEADKMFGSSKLDTQIVNTGVAITTMAKEDPVTVETKVSQLVLPSFERADECFFATVAE >NC_004557|1133918:1184898|1145851_1146031_+|WP_035124629.1|DBSCAN-SWA MNNIPECMYDYRYEFEKMQIIDNCCNCDCNICEGEEYYDIDGTILCEECIRDYKHTAEL >NC_004557|1133918:1184898|1135961_1137494_+|WP_078688072.1|DBSCAN-SWA MANMQHSYVETTRDLNYKLLDCNGDNLLKYKEEYKAVIIPKVFIKQNEEENYKEYLKIIEELELYNEGYNNLDITSNAKHYFTIDLPTYEKIKDIKNIKGFYTFKNLKVDRSEAWSLENMITTTKNISSGKEKSLGSLEMTIRNKLKNNKNYTIEFEKDVDNNILNIEENIPKNNLNVKLTLDRNIQEDIKELLKSKYGDYEQIGVILMESNSGNIKAMVQKDDTLPNINIGAETLNGFFPGSIYKTLVMETILEKDNDNVNKSFSCKGLYEDNNKGINHGSLNINDAFVVSCNHIFSQLGIESGFQSMHSLSKSQGLLDKVLNLDREQSGKFEVEEPKMENGSLGLTAIGQNMRITPIEAISIANTVINEGIYIKPNIISAYVDNANKVVEKTEYQGERCISKETALNVKKAMNGVVNKGTAREAYLEDIDIGGKTGSTERLEIVKDRNGRSKRIKYSDGWFIGFFNKEDKYYSMVVFVKNINKDSESGGNTAAPIFKDVVKIFLDEEH >NC_004557|1133918:1184898|1182596_1183403_+|WP_011099356.1|DBSCAN-SWA MSEEKYKFQECEYLLDKFEDFNKDAPDILAKVLQKRSGINSEEFNELYKLLINMHSNEEKYKGVKAKTKGNILEKIIELITIKTKLFKLFTNVSNNSNEYDIIITPSELATMSYNALPEIIYQPIICECKNYKKPVDVTWIGKFYMLLSISNIKCGIIFSYEGITSGKGAGKDKGQNDSEEWNNAKGLIKKIFLKDGIVIIDISKKELDSIKNGKRLYDIIQSKYADLMFMTNIEKYKIEHISSEKIKEIIEDVNEQVEFYESKKFKK >NC_004557|1133918:1184898|1141557_1142370_+|WP_011099316.1|DBSCAN-SWA MKEMNSLRVIDQREVLNKNFKIYGNIENPLFLAKDVAECIEHSKPSVMLEGIDTQEKLKETIFTSGQNREMWFLTEDGLYEVLMQSRKPIAKQFKKKVKEILKDIRKHGMYAKDELLDNPDLLIQVATKLKEEKAKNKMLELQNKQKEQIIGELKPRADYTDRILKNKGLVTITQIAKDYGMTGTGLNKLLHELKVQYKQNDQWLLYKEHSGKGYTHSETIDIVRSDGRPDVKMNTKWTQKGRLFLYNLLRDNGILPTIEQEAEREFACN >NC_004557|1133918:1184898|1137611_1138064_+|WP_128993679.1|DBSCAN-SWA MFDSIINILGSNILLTAYVTGNNSFPKPLKAEEEKYYLKQLEKGDISAKGVLIERNLRLVAHIVKKYSFPGKDMDDLISIGTIGLIKAIDSFKIDRGTRLATYAAKCIENEILMLIRNNKKTKGEVYLQDPIGIDKEGNEIPWTSYQSRA >NC_004557|1133918:1184898|1171193_1172429_+|WP_011099348.1|DBSCAN-SWA MAEEKFYYAEGSCQVKDLVKNLVTEITKNAGIYKWNLVYPKTLDEIGTSGENKEINLITDDSTTDKVDTKFTVTEQNDTCIIKATTTYGKEFYVKIDREKADLTKEEKKALVDFKSLHRYCIREYCYDRTDAQVLEIMAGVNDESSKSGDYNAYVSAMTKSNSINNIRLQISDKLNKEGNDLDIPKNMQKEYNYRLAWYRKLQPEIKDFLPVQYWINVTKDSINLVLRGDPSADVHPYENYLTSYAYMGALKPVEDSATTDDKYNFGITVSSDIEPNYTKLYGERTATGVTDMCMIANKIGMPYQPHYPAFYATNPFMDKCNVEGSRWNHKKHQFSDITLVHPVDMERGKMINVLAGDASAIYDMDKLAYKKDTDEEEYYKKFKLTAPFNFLNNSANINYCIAIRCYKTTE >NC_004557|1133918:1184898|1146939_1147314_+|WP_011099322.1|DBSCAN-SWA MRKIWRNAELDFIRKNKGKLSYKEMSKHLRRTPAAIEQKWQCFIDRQERVDKIVNKMDVAEKKFRFNKGQKVKTKRMEWTTGWDTLITKGKVIADNKYFVVLDNGVYKECVNKVDLYTKSVVLV >NC_004557|1133918:1184898|1139584_1139869_-|WP_035125317.1|DBSCAN-SWA MENEKIFELIEKMYIDLKGSQEKMYADLKEGQEKICTELKSEISEVKKTVIRIENDHGKKLEALFDGYKQNSEKLNRIEDEVAKHKEVIIKRIK >NC_004557|1133918:1184898|1177610_1178489_+|WP_011099353.1|DBSCAN-SWA MNKKKYSLNFKSDGYIFFDNFPKFGDNYTLQIWINSNSNSFNAWSGLFMRGDLNTSQGLYWYGKDLCFTNAVSGRVVIDTVDNFPIGSWHNITIAHEKQFIKYYRDGNFIRKFQVGKSINSLDDLYIGVWDNYSRYNGKVSEARVWSKALNDQEVLFTYNKELTGNNRHLVAHWGINEGNLTEIYDYSNNGYNGRVYGGEWVEDSPSIYCYKYLIKQDSNYYSIKPEYIQEEVFKSLILDGGKKPNRDDFNKYGFNNLNDLLIEYNLEKLFKPADILKKINSGKFHIVMMEM >NC_004557|1133918:1184898|1179938_1180202_+|WP_035110473.1|holin|DBSCAN-SWA MNIMEFITEQAFILVPALYVLGLMLKGTEKIKDWTIPWILLIIGILGSIALIGLNANAVIQGILTAGVAVFGNQLVKQTTVKSKEEK >NC_004557|1133918:1184898|1183755_1184898_+|WP_011099357.1|DBSCAN-SWA MFIYAFKISTIKKNIKDQSSNNEVVNEKTEDNLLKQTDYLSNIHELEDDKGREETSDLSSDEYYKKLCGRDKHINYSKSVFDEFVVIDLETTGLYPVTDKIIEITAIKYKSGQIVEKYNTLINPKINIPKRATEINNITNNMVKDSPVIEDVLPELLKFIKEYPLVAHNASFDIKFLNANLALINREIKNVAIDTLQLSRAMYTFLPNHKLTTIKEHLCISEDNSHRALPDCITTAEIYLDYCYKANNNLKEFNELEKACFKVIKDMLNKNNRDTEFLKLKHTGNYTDIAYFYPLVRIKVGDRKQYFLTNIPLEELKEHETEKPSKSENFKSRIHLKSEEDLKPFEEYIIQIFDEKKKSFEWYKNNIKSSEIEIVKYLAN >NC_004557|1133918:1184898|1147820_1148450_+|WP_011099324.1|DBSCAN-SWA MSKIRVIADTGELIHEIECSHYNIQYVTSESDGKIQRIINLNNGKYGPKHWIKNDIYQPIARKIKKNLINQVPELSCVNVDKILFIEDIDYVSDEINRNTDWVMKIKKAPSQLTEFTGYKFIIESREFWMERCSHEQIVAHIYSCLKQIDKDKLREPDVKGWKEVIGNLGLGWETTISPIPNLMNGFDVEDFKMLKKADKQMKFDLRAK >NC_004557|1133918:1184898|1162683_1163049_+|WP_011099337.1|DBSCAN-SWA MSKINRGKISRQIYKQLEKKGLLKEIKILRNGKNAYGEKLEDLYVTTIKGYYYREKSKINISTDTGAAINTDYREKLLVTYNEESKKIKQDDYLTSDGIKYKVIDTGNVENIVFDMYLDRM >NC_004557|1133918:1184898|1144109_1145051_+|WP_041744719.1|DBSCAN-SWA MKRGGSGLDKLKWLQERQKGIGGSDAGAILGINKWKTPFQIYLEKTEPITEINEQSEAAYWGDQFEEVVAKEFEKRTGKKVRRDRRHFKHEKYPFMVANIDRRVIGENAVLECKTANQFLAKEWEGEEIPASYLVQVQHYLEVTGAEKGYIAVLIGGQKFIWKEVERDEELIEIIINTEKEFWENHVLKKIPPALDGSSAAEKYLNEKYKKSNSNISIDLKSEYMDKIDELMQLKETIKNLEGQAKEIENNIKNELKEAEIGYAQGYEVNWKKVISNRVDSKLLKEKYSEIYKKVCKESVFRRFNIKNLKEEN >NC_004557|1133918:1184898|1146048_1146927_+|WP_011099321.1|DBSCAN-SWA MAEGKEKGWISLYRDIQEHWIWEDAEKLKAWLDLLLLANHQSRKILLGNELIEIKRGSTHASELKLMDRWNWSKKKVRNFLQLLEKDNMIICEKSKKGTTITIINYEVYQGSRNHRETIKEPQGNHKETIKELLGDHKGYTNNNDNNYNNINNDNNSSGSTPNYIEFFNSNFHLINSYEINILNSFVKDGLSEEVILLALKKAVENNVRTIKYVKSILQNWLENNIKTVEGVKAEEERFKRDIEHKKAKGNVERKDNGAKVDSFNGYQQRTYDGSDGGMTFDDLEKKLLGWK >NC_004557|1133918:1184898|1154591_1155935_+|WP_011099330.1|terminase|DBSCAN-SWA MKNKVVPFKFKPFSNKQLKVLTWWMKDSPVANKDILIADGSVRAGKTVAMSLSFVMWANETFDGENFALCGKTIGSLRRNVIKPLLKMLKSRGCKYKEHRTDNYITISKGKVSNDFYLFGGKDEASQDLIQGITLAGVLFDEVALMPQSFVNQATARCSVDGAKMWFNCNPEGPYHWFKVEYLDNLEYKNGIHLHFTMNDNLSLSEKVKERYKRMYSGIFYKRYILGLWCLAEGVIYDMFNEDFHKVKTVHRKYEKYYVSIDYGTQNATVFLLWGLCEGKWYIVKEYYYSGRDKSLQKTDIQYSKDLKGFLGDIVPVKIIIDPSAASFIAQLRSDGFEQIRKAKNDVLDGIRTVASALSLDMFRVNDCCKETIKEFVSYVWDTKKISIGIEEPLKDKDHCMDAMRYFIYTILKHNIDVKYDKSVYNKGRGLKQNVLKKYGKKGGTVF >NC_004557|1133918:1184898|1145052_1145859_+|WP_011099320.1|DBSCAN-SWA MATNESLKNQLSTKKETGLGSAGNTIKGLMNSPAIKKRFEEVLKQRAPQYMSSIVNLVNSDINLKKCDQMSVVASCMVAATLDLPVDKNLGYAWVVPYGNKAQFQLGYKGYVQLALRTGQYKSINVIEIHEGELIDWNPLTEELKIDFSKKESDAVIGYAGYFELLNGFKKSTYWTKEQITKHKNKFSKSDFGWKKDFDAMARKTVLRNMLSKWGILSIEMQNAYTADQGIIKNEIIETGEVKENIEYIEADFESYEDNSIEEGGANE >NC_004557|1133918:1184898|1165342_1165669_+|WP_011099342.1|DBSCAN-SWA MATFKNPVVTCDKCGKDFKLKQNRLKSEIVKENIERTYFKCPKCKHKFIVMYKDQEIKENLKKMDNIKVQIQKSINEKKNTKKLIGRYEELYYNNLEISEKYKSLYGR >NC_004557|1133918:1184898|1143364_1143622_+|WP_035125388.1|DBSCAN-SWA MELEYDRYGRAKYNPFIHRNTGKPWSKTDLNYLINWCDIIGPDEMSFALERTIATVMNKVYILRKKGVMNKHKRIRNCKRVRSMH >NC_004557|1133918:1184898|1160119_1160737_+|WP_035124914.1|DBSCAN-SWA MPKLSEILGEHFKQIPEDIQKKYKDIDLVDSSNYIEKKELDTANETIKQYKKDIAKRDKDLVDLQGKIKDNEELNAEIENLKAANKKASEDYESKLNQITFETKLEKKLGEFKPKNLGILKKALDIEKISLDGDNFLGLEDQIKNLKESDPYLFAEETPGGTGNIGGGQSSIIDDNKDSKSIGEVLGKQQADQFKINETIDSFFK >NC_004557|1133918:1184898|1175617_1176700_+|WP_035124965.1|DBSCAN-SWA MSKFFNVTHDKDIVLDDSIISNKTAWTSKKIHKEIVDKRITKFEELEDVDVVNKQDKQLVAYSKDTGKFTTINGTEAGEITGASMKQISKMGVIGSPEEPRIVNISVNTIDFKVPRVNVLKYDLGTQNIILTKNEFTNGESNDFEKDPMMVFDGKAHLKTEHTSNFAFNREMDIKSEYVVTFKKSNFKKVESFKIGEDGVIKTLVTTAIPHDRLLIPKCDMNLSNVENIDYFNLTATGENIRIVCSVDSGQTWKVFKNEKWVDINLDVEDIRTNGMTIDLFNSINDVFWNELVTTKKIRFAYLLSMNNIDNIEELKNLDLQYDGEGKWIQAKEDTFDVVYASNTLLQVHVKFNGDIKINY >NC_004557|1133918:1184898|1152544_1153633_+|WP_011099328.1|DBSCAN-SWA MILKENLLYPLVHYTDMYGNFIGFQKDKNSQVVLCSCMRKAVENCIKLFLKYPSSLLNPPEWILLKQLDMPESINDVIRKKNPPVGLEWLNYIKFKKDVCHRCNIEKPNKEYCTPMYGTKFKRTFGWYININYFNKGINPSTHDGIYYLKEDGPIEIRLILDPTDKDLLEDIRRYKLLDTFEGKEILDMLKKIEHREEAILLRYFHIEDKELQYQKLYYAIEYVMKQRLKEVHKVIENEVRDWFKSKRVGEKWENETNLYKIIRKLYPELTMYRHFRPPFLDGLELDIYIETLDIGIEYQGEQHFKPFEHWGGEEAFEKRQELDKKKRELCNKNNIKLIYFNYDEEINEEHIRKKLLKELNI >NC_004557|1133918:1184898|1149549_1149711_+|WP_035110040.1|DBSCAN-SWA MPKEGQINILAIPYKNFKHKIRLTKRFLNRYMIQDLGGVLYMERREEYENKNK >NC_004557|1133918:1184898|1151296_1151473_+|WP_162827854.1|DBSCAN-SWA MKIEKIIKAQQPDIHKRLKQQNRKKKSRREGEHISFSDVMELMKHDSYKRHRGAIRQR >NC_004557|1133918:1184898|1133918_1135136_+|WP_011099308.1|DBSCAN-SWA MNKPEILAPAGNLEKLKTAILFGADAVYLGGSKLNLRAFADNFTDEELIEGIEFAHSRGKRVYVTVNVFPHNEDLEGLEPYLKDLENMKVDAIIVSDPGIIMTAREVAPKLEIHLSTQANNVNWKSAIFWHKQGVKRIVLARELSFEEIKGIREKLPEDCELEAFVHGSMCMSYSGRCLLSNYMTGRDANRGECAQPCRYKYYLMEEKREGQYFPVFQDERGTYILNSKDLCMIKHIPELVRCGINSFKIEGRMKSSYYVASVVKSYRQALDAYIEDTKNYKFNEDWMNNLLKTSHRDYYTGFYLGDKDSQIYETSSYIRNYDIVGIVRDYNKETNEATIEQRNKLFEGDSVEVLRPVGDSFQVKLENIRNEKGEKIESTPVAQMIYKARVNIELKKNDMLIKAK >NC_004557|1133918:1184898|1135166_1135796_+|WP_011099309.1|DBSCAN-SWA MSKRPILIGITGGTGSGKSTVSKEICRRFDKELIVMIEQDSYYKDQSHLSIEERVKTNYDHPNAFDTELLVKHLKELSYWSKVEKPIYDFELHNRKNETEIVEPTEIIIVEGILVLEEKEIRDLLDIKIYVDTDADVRIIRRLVRDIKERGRSLDSVINQYLNVVRPMHMQFIEPSKRYADIIIPEGGHNKVAIDIIVGNIKQMVQKSE >NC_004557|1133918:1184898|1165123_1165330_+|WP_078688091.1|DBSCAN-SWA MSKRYQIRPSQIIALTNDYEAFCFDEACTYILAEMSKENGTEPKFEDNKNVNKQNNSDVIDWLNSNNK >NC_004557|1133918:1184898|1159919_1160120_+|WP_035124916.1|DBSCAN-SWA MEFRKAYELLKQGKHVKRKHWGGYWNWENNTIMMYCKDGKVLDIRDTKDVDFTMSNMLEEDWEVVE >NC_004557|1133918:1184898|1147395_1147824_+|WP_011099323.1|DBSCAN-SWA MARSKYGAKKITIDGNAFDSKDEGKYYEYLKKLKFQEKILNFELQPRYELRPAFEKMGKKYRKAEYVADFLIYHLDGTEEVIDVKGMATETAKLKRKLFDEKYRNLKLTWIVRSLKYSETGWIEYDELQKVRREEKKGAKNK >NC_004557|1133918:1184898|1170159_1171191_+|WP_011099347.1|DBSCAN-SWA MSLSKYIINFEELADELKKRILKLIDDELRSKYSKLNTNDIEGLLEDLKELLPSLQYNMLKKKIENFIYYNYNGMQKVDSKLLDIPPIIQENKEDFIFDKDVYLTGLHFNQTGWKKEDTYTLEINKNRIINNATIKEIGEHKYFNTFYKVNANTPISFILHNNSGNSRQTMVDLEYLIGEIVTPPEPPIEEPETPTIDDILNDWDIAVVMQWEANTMADIDLHAVIGDKYIYFGNKEEWNFFLNFDFRQHMTNQNPEILSVKGYKNKKLEIYIHNFDGIGLNEPVNLKIYEKRPYGNNLIKELNIDISPSRDVIRETIQINLNTLEIKEINRDINLEKFLGGR >NC_004557|1133918:1184898|1179687_1179921_+|WP_035109991.1|DBSCAN-SWA MNEELIRDKLGTHDKRLNDHAGRLDKLEQNQSRVDVKIENLCDQIKQLVSVMKWYIGLTVGALVSFFFYAIQHNIFK >NC_004557|1133918:1184898|1149891_1150032_+|WP_155274221.1|DBSCAN-SWA MEIILISMLICIAIASVSVKYSIKKVQAIKQKELDRPDYIRMEEWQ >NC_004557|1133918:1184898|1149691_1149889_+|WP_035110039.1|DBSCAN-SWA MRIKINSVKDILNNSKYIPAEVIQDIDKRISDWLASGGKKDDSYIKQQFRYAERVANIISGNMEG >NC_004557|1133918:1184898|1148624_1149065_-|WP_035110041.1|DBSCAN-SWA MSCKLTDVLNKQIANWSVLYIKLHNFHWFVKGQQFFTLHLKFEELYNEAALHIDELAERQLSIGGNPVATMKECLEISSIKEATGNEVAEDMVATIISDYNTIVSELKTGMEIAQKENDETTSDMLLAIHTALEKHVWMLKSFLGK >NC_004557|1133918:1184898|1140257_1140716_-|WP_011099314.1|DBSCAN-SWA MSYDTLLDEAFNNDIMVKEIDLKTKDGLCYGNRIAINKNLTTNREKRCILAEELGHFYTTVGDITDQSKIVNLKQEVRARRWSYEKLIGIIDLVNAYNNGARDKYTLADYLNVTEEFLEEAINYYKTKYGLYYEIDNYLIYFEPTLGVMKIF >NC_004557|1133918:1184898|1176775_1177561_+|WP_011099352.1|DBSCAN-SWA MDKNLCVNGTVIASNDYQPSYPERKKECAFDGNLDTNWGGITGQTIDKSWIGYDFLKPITIIKIKLHQSNNSNSSINIVSVQGFDDEMQWKEIKQFSLIPGLNILDLSTNNKAFNKYRLVAKTDVGIPTWAWLVKEIEMYGNLYKFLLKQNSNYHSIKNNFYKLGQPNDNAQLEQWYDKYGTKNINTILEPLDFKDVPMSLEESTGIWKTDFELDANKILGNIKMIEESIKNKGNKVIRYECEQYKIYDKLDNEFEIMMDV >NC_004557|1133918:1184898|1174608_1175607_+|WP_011099350.1|DBSCAN-SWA MKLIKVKDGVLEVDNFYLTSFFNDFAGNSNITRDIKTGKIKLMSNNKMERKFDYNEFVIELEKENFKHMNDYDYSMIYLGNNEYTFGIKDEELDQQNKYWKILKQDNYIQAYSSTDGMNYKNIGGMKFDEAMTKQGFMKYSAEDFILNKYSVYDNPYITIQNFPENTICELYDLKGNLLKTRIFDKNLECKVFIDGNMVGYFIFKDQDNNEIYTTSHLSLQYGDVYICSPYNFEIIYHGNVVTNVNPATLQDLEELITIKNIGDKDYTNINIGTETSSNDLIQLSFDGENYKECLVVYSIEQGESKDIFVRITKNAENHNFNVRDFQLVINE >NC_004557|1133918:1184898|1169602_1170163_+|WP_011099346.1|DBSCAN-SWA MGLPSYVINWDELSDLISDYLKNGVNADIGNITFNTEDIERLLSEIKDKIQGVDYNDLIQALNDLGAKLDGLSGNLGILGTQKIHGKMLEVPAIVGQHTIEFTAPANSKLTGITYSQSSWRFEDSWDLVVEEEKLFTETRTKEYGEHKYFNVFYPVDGTIKFIYNNNSGSSKVVWVDFCILEGGSE >NC_004557|1133918:1184898|1182271_1182604_+|WP_035109987.1|DBSCAN-SWA MKILDDKQIESLKSFLEYQYKSHVSCSDIANRLDITYDKAKELIKILLRNNVLEMNFKIFCENELDTNIQDTYENIEDIPEEPCSNCEKKCSILKNVIVIYKVISRDIYE >NC_004557|1133918:1184898|1165726_1167772_+|WP_011099343.1|tail|DBSCAN-SWA MAINVGTAVAYLTLDRSGFKNSLKSAGANLKNFVAGTGGAEDRVKSLGNALTNTGKSLAKPSIAAGGFLGMATKTAMGFEEQMSKVQAISGANAKEMGKLTALSREWGAKSKFSAMESGQALEYMAMAGWKTQEMMDGLPGILNLAAASGEELGTTSDIVTDALTAFGLQAKDSAYLADLLASTSSNSNTNVSMLGESFKYVAPVAGALGISAKDTSFALGLMANAGIKGSSAGTSLRYSLTNLANPSKNMRKEMERLGISLTDSNGKVKSGKVLFDELRQKFSKLTDAQKAQSASIIFGKEAMSGMLAVINASDKDYNKLYGNLNKAEGSAKKMADTMQNNLKGQLTTLKSAFEELQISLAQAVVPILSKVVKLVTKLVNGFNGLPQPVKSAIGIIVGAIALLSPVFLILGKLVKTIGSVIGMFGKLKTATGVFKLLPTLITPHTLIILGAIAAIGFIVYEVIKHWDKLVAAGKKMGKILEKIFKGIGDTIKWVIGGWKLLIGHFINWGKQKVKNIVDGFIGGINKVKGLFKGKGKELGKGVKEGFEEELQIHSPSRVFDGYGGFIGEGLIQGIDGQEDNIDTKFRGLGNRIKSLGNIRPEFNGLNNMALSGAYSADSGFNSISNSNKQLNFNPTINMYITVADIENKGIEQLTQEVKSMGETAIKNSMVDLFMKDAIRN >NC_004557|1133918:1184898|1153678_1154602_+|WP_011099329.1|terminase|DBSCAN-SWA MAKTRSPDWENIKKEYIELNGDVKLKEFAEEHGIKYSTLRSRKNRENWDSEINKDVATKSATQQKNVATENKTKNNDKEPIDKEVKEVLENTELTDKQRLFCIYYVKSFNQTMAAIKAGYSQERAHVTGSELVRNSKVKAYIKELKGKMIGEIFIDAMDVLNKYIKIAFADITDYLTFGQREVPVMGPFGPIVDKKTKKEITKIINYVDFKESNVVDGTIISEVKQGKDGVSIKFEDRMKALDKLSQYFDLFPDNFKRKIEEERSKQAREKLELEKSKVTGNDDEVQDDGFIEALEGKVEEVWKDEK >NC_004557|1133918:1184898|1157958_1159659_+|WP_011099333.1|DBSCAN-SWA MNEYLKLVAKAQEQRIMLTKKQIKNIRDLYRDVAKDLGKRSKKANKDSLSEIWLLDYQKQFKKDIKELNKILKKDIEYSILESAKYATNIQTDFFNLMDVKYKLNSKETFSNMFSRIPQQALEELISGDFYKDGKGLSERLWFHEKEANANFDYIIQKGLLEKKSTYELAKDLSDYVNPEVKKDWDFKRIYPGVGNKKIEYNSFRLAVTSISHAYQLSMQRSCKANPFVEGIEWHTSNSHRGPCSICREREGKTYKTDELPLEHPNGVCYFTPVITKSLDEVGMELHGWLYGGSNNKLDDWYKEYGREFVGESNLFRNISKKDNNNDIIKEKPIKDFKYPDIKTIKEAEKWAINNLNLNKISYKDIDIGVANYVNKSMSEIYQEYPLLNGFIQEIKTDGRASAPASASISFKDGKLNTKLILSKKDLADLKSIDDMIKDCVDYKWWTPKDGVKGIIKHEMGHMIEYATTLKKYGVINKNNELSDLNNLGLAFSRIKNGELSKEIKMKALNNLNIVNTKKNIKENLSNYSNRSTLEFLAEAVSEDNPRTLAKEVVKLLKEKIKEVWK >NC_004557|1133918:1184898|1157448_1157922_+|WP_011099332.1|DBSCAN-SWA MELVIIFYILFIIFFSALIIAALTGSKILKNKKDAIGVYSIGTVIYGVLFFLTLRLELSSYNYIYMEPVSKTSINRKVSSKDDGNNFRAETPKEQLYVDENEKGKGLIKGNTSKKTGEKIYHTPGSRYYNSTKIEDTERWFKTIEEAEKAGYRAPKK >NC_004557|1133918:1184898|1163826_1164699_+|WP_011099340.1|DBSCAN-SWA MDGKTLVNVIKVNFIDEVTNIKHTIETSDEIDIEPIKSDGKRDILRVKNTIYGINETEDIVIGYKLKLKDNLFNMKTMSLIDGGSIEGNKYCGAEAGKVAERHPFTMEIFTEEKDYSRTTGYAKFTYKHCKGKPAKYKVKDGEFLVPEYEAESIPFRGEKPVEIEFLDTLPSKPEKPDPPVEDIGVEGGTVKDTNTDVGVEITNRIVWTFKEQINQDYVTKTNFSTRRKSDNSLVQGNVTIDGTKKIVTFVPTSLAVDVVYIAQAKPVDKLNGSGKTAALSTEFKTIKIK >NC_004557|1133918:1184898|1142402_1142804_-|WP_128993678.1|DBSCAN-SWA MINFLGADTMENKIYDLLEKIYVNFNQRFEGLEKTTSKIKSQVDTNSLMLEKIQTDIKTLAEVQQSFSEQLDRAKDKDGKTLGERLDIIELAISNTSKSVNDVVDAIDVIKETTGSHEMDIKILKKIRNNHLL >NC_004557|1133918:1184898|1141335_1141551_+|WP_035125315.1|DBSCAN-SWA MYRELLGELVKKGLTKKDLAKKIGVSEKTIFNKLNGKSDFTLSEIKKIRDLVCPGASLEKLFEKSEMKKLN >NC_004557|1133918:1184898|1162384_1162687_+|WP_035124909.1|DBSCAN-SWA MTIPLEILKFNLQEREYPYFEDNDLSLLLQSNDNNINKASYKGCLLKANADDRLEVAGVKLSSNRQYWLGLAEEYKKAYEESLQGTITGYKTSMKRVDGQ >NC_004557|1133918:1184898|1168160_1169258_+|WP_035124907.1|DBSCAN-SWA MVIAMNYRVEIIDKGEKKTTLKRIVLNIVIDRRIDMQTASATVVVQDVKSKVPSLYNFGYTDGIIANGNHIRIYISDKIQFTGIIRNFEIDDEAKTININCHDVGCKILRPIDGGVPYHVFNNINATQLISNMAIKAGLGTPIFNIISSNNYAIKNLKMQYDVQMSDIIDEVLKTLEARARVLKDGTYKVEKLYPDYKASNVQNKINYDFYYEDFIIIGKANRKRGSETLYNRVLVRHANDKYNVFEDPSMVAYLGYKNFKEIESPLGDTVDKRQKVASRFFLDCWRENSNVDIVATKGNPDLDLGKIVRIKLNEMIGHYIVTGIRTELTPDGNYIDQISLDGMREVTNIAKLGKGNYTLKEGDN >NC_004557|1133918:1184898|1150269_1150581_+|WP_011099326.1|DBSCAN-SWA MEKVKLWTQEEDKYILENQGKMTYREMAKKIGRTESAVKNRISYERMKSHMNKDKAKKWLEEQLPGMTPQQGKKQIMKKLNIDEKKAEKVYNKWRKNYIKAMM >NC_004557|1133918:1184898|1169257_1169590_+|WP_011099345.1|DBSCAN-SWA MKQNDFRKPVVYILDQELRRRDLKNKIKFEGGEEYKGELPEYPVRLVRDTNKKVIKCIYAENTELEWSEELIRNTEGKVYKIKTIYPNKSEKAIELFKNPDNRVELIDYV >NC_004557|1133918:1184898|1139886_1140231_-|WP_011099313.1|DBSCAN-SWA MENLLKEILSEIKGIKSTQEKMQSEITGIKDQVAGIKGEVSGIKETQEKMQSDIIEIKEKVNAVYNQTADLTEFRTESIENLNQIKDDVEYLTHKESQNEKVLFNLQRKVTTNR >NC_004557|1133918:1184898|1179074_1179269_+|WP_012047659.1|DBSCAN-SWA MQARKLMKDRELAAYLDINNSNLPFEYYENKYLKQGYTGNLLYRKILEASNRTNKEVNKQLGII >NC_004557|1133918:1184898|1174338_1174593_+|WP_023437986.1|DBSCAN-SWA MLESTIDFKKPRQKMWGILKDKQLISLPYGHETNEEGKELTSYATNCYEDALAEAHTLLAQGTGTKNIQVVEFVPYDYIMQPRV >NC_004557|1133918:1184898|1155934_1157392_+|WP_011099331.1|portal|DBSCAN-SWA MNIKETLLNLNEREKKERKKALKDFIFYLGECENIDAAKLNQDLLGQNWITLDVLDYIPSQIIDNKVKPLINKQARFMFGKEPDILFKPLDKQNKETCEELRQYIDAILNASKFWSNTMKAFRLATVTKRVMLRLEANPGQPIRLYYHDINDFSYEVDPNDITKLNKVILVRQDAETANKEEKDQIWCRYTYYMNKISDKESTCYLRIETFKGNNLEAPIEIKEQDTGLSKIPCWVICNEQSIINPYGQSDIKDLKPLQDSYNRRLSDFNDSLRFLMFGQTAVIDATEDTVNACNIAPNSLMALKSIDDTEGNKQAKVQRVESNFTNADPVLKYLKTLEDSMYEKLGIPKLESLQQVPSAKSIKYMYTELVARCEEKWHDWEPIIRQMIRLIVEACGKFKCYEEWKDEWNDLLYNIVLNKNYPIPEDEEDKKRLAMEEVRTNVRSHRSYIKDFTDDENVDDILKEICEDITSITAAEQEQFLREI >NC_004557|1133918:1184898|1151704_1151947_+|WP_035110030.1|DBSCAN-SWA MWLFKSYVDIEGINPFAKSGNGYWTVGTDRTPINKIAAVVKYGHSTLNFTPSISISSNPLGFSFNTTTVISGTQSATLKF >NC_004557|1133918:1184898|1181235_1182288_+|WP_035109989.1|DBSCAN-SWA MNRELYKNLINNTIEYYLPEYNLQQIINNLTKQKIASYKEKENSVLINTSESKSQRIKKCIDNNSDTFYIIENMLINKAYRYAIYYKCDNFNLNDNIENYIDIESLKCEKFNLIEEFDKPIFVQTDDYIAFKFCKSINPFIIKLDQTKDVFYSTLWVYHKKMKILEHRFDMIGFKSDDTFYETTFKPQLYKICTDFNCTINEFRTSKIIKCIVQNKKDEVNEISQCMGLKADSLAKLKAGKTLVMPFIGDLEIIMEESKELFDKTNETKLIKKMLEKYINNIKENAKYKSRLLSWCKGEYNSLCVNILFSYRDKNYDLFNFQDPKKINMELMNYAIEYIWGIRGDIENIR >NC_004557|1133918:1184898|1162193_1162382_+|WP_035124911.1|DBSCAN-SWA MAKKADEKLKKVKALVNIKYDESCFKIGDEFEVRSEDSEEMTKRGYVESLEKIEKGETKEGE >NC_004557|1133918:1184898|1140731_1141175_-|WP_052042385.1|DBSCAN-SWA MGLEIIEKLKKEKGFTSKQLSEKSGVPKGTLDKILNGTTKDPKLETLKSLSRVLGCTLDDFDDKTETEMENINFKKETTLLTNFNKLNDTGKSEAIKRVEELAQIDKYTHEEKDHLMPIAAHDKEGNFSKEDMEHDLNLMKDDELWK >NC_004557|1133918:1184898|1164714_1165176_+|WP_052042375.1|DBSCAN-SWA MSIQVTSIEDLKKMAEYDLIELPRFKAEIPFVAKVKRASLLNLVRKGVIPNKLLSAAEELFYGKSSNKGNVDMKALTDVMFIMAENALIEPSARDLKEVELELTDEQIVALFNYTQEGLNEIEKFPEKPENTVGDNNEQTVSNKTKPDNSINK >NC_004557|1133918:1184898|1138053_1139562_-|WP_011099312.1|DBSCAN-SWA MNKICIYLRKSRADEELEKTLGEGETLSKHRKALLKFAKEKKLNIVEIKEEIVSGESLFFRPKMLELLKEVENKQYTGVLVMDMQRLGRGNMQDQGIILETFKKSNTKIITPMKTYDLSDDFDEEYTEFEAFMSRKELKMINRRMQGGRVRSVEDGNYIATNPPLGYDIHWIKKSRTLKINPHECEIIKLIFKLYTEGNGAGSIAEHLNNLGYKTKFNNNFSRSSVLFILKNPIYIGKVTWKKKEIKKSKNPNKVKDTRTRDKSEWIVVDGKHEPMISMKMWNKAQEILNNKYHIPYQLVNGPANPLAGIVICSKCKFKMVMRKLKGVDRLLCRNNKCDNISNRYDSTEKAIIQALERYLKEYRINISNQNKTSDIKPYERQVNILEKELAALNEQKLKLFDFLERGIYDENTFLERSKNIEERITKTSSGIEKINDIINKEKKVVKEEDVIKFQKLLDGYKNTDDIKLKNELMKKLVNKVEYTKNKRGDTFGIDIFPKLKP >NC_004557|1133918:1184898|1160751_1161084_+|WP_011099335.1|DBSCAN-SWA MRQSTTKILGTQKNILALAGALFQNTNIKVSKTVATLKEGILEAGTIVDKTGKKVTDGTAFGIVYEDVDFNNSSGTEVVSVTIFGFIKENVLPQKPATEVKAALKMIQFL >NC_004557|1133918:1184898|1159655_1159856_+|WP_035124918.1|DBSCAN-SWA MIALPKELMGITDYDENDNWILKDGATEEQKKIFEEFKRDLESAKLSDVELFIDGRNIITGEPERY >NC_004557|1133918:1184898|1150031_1150232_+|WP_035124920.1|DBSCAN-SWA MFKGCERLEEIYCTRGSCRRIDCKYNQEHALKQSTQSGIEFTVKAFYGTDKYLFRRRYKKIGEKIF >NC_004557|1133918:1184898|1180202_1181003_+|WP_011099354.1|DBSCAN-SWA MSYTTNFINSVKDGAIASQKKYGVLASITIAQAILESGWGKSSLSRDCKNLFGVKAIGGWRGCKKSYPTYEYYNGKKTLINDYFRVYNSYAESIEDHALFLVNNSRYKQHGFFNEKDYVGQANALQRAGYATSPIYAQQLINLIRQHNLNEYDNINNSYINIDGGGYASYQGGAPGINLIIRDYSKDINRIFAWVDNDKGASWAFDLTPPNSNYTKLFKNTSKVITKRNGGYTFSKGSIYKINVKGYDKKGKVVAENQIVLKVPKC >NC_004557|1133918:1184898|1150592_1151060_+|WP_011099327.1|DBSCAN-SWA MIDKETLAMDEKTFKKTEGVLYNYKDLEVEIKAVELAIKELEIDYKGGRGIGYEERTGETYRINRPIEDEIIYKEKYIEKLELEIEKKKILKEKIENAIRILDDRESEIIKLRYFIKPKKSWVAIGMEVKMDKDYCSLICKEQIIPKLANILWNY |
69 | Clostridium_phage(95.59%) | capsid,holin,terminase,tail,portal | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1651758 : 1658683
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_004557|1651758:1658683|DBSCAN-SWA TTTATTTTTTAAAAATGCTTTTAAATGGGAACTTTTTAATTATAGGTGCTTTATAACTATTTTTTGCTACAAGACTTTCGGTAAATATAAGTTTGTCTTCTTTAAACACATTTATACTTCCAACTTTATCTCCTTCATATATATTGTGATTTATTTCATTTGGCACATTAATTTTTACACTATAGTCGGTTCCTTTTTCTGTTGGAATTATAATGTCTTCTTTGCACACAAGGTCTATATTTTTTTTATCCTTTGGAGCCTTAGCTACAACCTCTCCTTTAGAATGTAATTTTTTATATTCATAATTTTTTTTAACATAATTGTTTACTTTTGTTGTCTCCTTCCATCTTCCAGGACAATCTAAAACTACTATTATTACTTCTCTTCCTGAAATATCTACAGAACTTACAAGACACTTTCCCGCTCCTCCTGTATAACCCGTTTTAACTCCTGTAGCATTTGGAAGTTGCCAAAGTATTTTATTTATATTATGATAATCCCTTGAGAAACTGTATTTATCTTTAGTAATATCCTTTGTACTAACTATCTCATGAAATAATGGATGTTTTTTACTTTCTGCTGTAAGTATAGCCAAGTCATATACTGTTGAGTAATGTTCTGCACTATCTAAACCATGAGGTGAACAAAAATGACTATCTATAAGACCTAAACTTGCTGCATATTCATTCATAAGCCTAACAAATTCTTCTTCACTACCAGCTACTCCTTCCGCTATTGCTATTGCTGCATCATTACCAGATCTAAGCATTAATCCGAATAGAAGCTCCTTTAGTGAGATTTCCTCTCCTTTTTTGTATCCTGCAACAGATCCTCTTATTGAAGACGCTTTAGGAGATATTTCTACCATTTTATCTAAATCACCATATTTTAAAGTTACGAGAGCTGTAAGTATTTTAGTGGTACTAGCCATAGGCATAATAACCTCTGAATTTCTTCCTGCCAACACTACTTTAGAATTAGCATCCAAAGCTATATAAGCTCTAGCATTCACATTTAATTTATTTTCTTTTTTATTTAAATGTTGATTTGCTAAAACACTATACGTATTTATATTTATAGTTATTATAAAGCATATAAAATATAAAAGTATTTTTTTTATCTTCACCTACAAATCACCTCTTCTTCTTATTATTTTCTATATAAATTTTATTAATCATGAAATTACTTCCATAGGATGTTTGTATATTTTTTTATTATTAGGACATAATTTCTAATATTACATGGAGGATGAATTTTATGTATATAACTCTTATATTACTATTATCTTTAATTATTCTTTTATTTATCCCTATTCCTATAAAAATTTATATAATATATAACGAAAACACATTATCTTTAAAATTAGGTAATAAAGTATTTTCTTTAGATAGATTAAAATTTGAAAATATTAATAAAACTTCTCCCATTGAAAAAATAAGTAAAAAAGAAAATAATCAAAAGCTTTCCTTTAAAAGTGTATTGGGGTATTTAAAAATCACTCCTTCAAAGCCTAAATTAAAAATTAGCATATACCTAAACTATGGCTTTGAAGATGCCTATATAACTGCTATTTCCTATGGAATATTCTATTCTATTTATCCCTTTATTTTAGGGCTAATAAACTCATGTTTTACTATAAAAAAAGAAAGGCTTCACATAGAACCTCATTTTAATAAAAATATATTTAATTTTGAAATAAAGAGTATATTTTTCATGAGTATAGCTAAAATTATGTATATATATATTTATTTACTTATAAAGTAAAAAGGAGGTTATAACTCGTGGAAAATCATCCAATAGAAAATTTAATGCAAAGTACAATGGAAAACATAAAAAATATGGTAGATGTAAACACCATTGTAGGCAATACAATACTAACCCCTGATGGAGCTTCCATTATACCTATTTCTAAAGTTTCTTTTGGCTTTGCTTCTGGTGGAAGTGAATATAATTCTTCTATTAGTTCAGATTTAGGCAAATATCCTTTTGGTGGAGGTTCAGGAGCAGGTGTGTCATTAAAACCTGTGGCATTTTTATTTATTAAATCTGATTCTGTAAGACTACTACCTGTAGATCAAACCACTCCTTACGATAGAGTAATAGATACAGTACCGCAACTTGTAGATATGTTTAAAGATATGAGTAAAAGTAAAACTTCTTCAAATAATGCTAATAAATCTTCTACTGTCAATGAATAAATTCCCTCATATTTTAAAATATAGCTTTAGGGAATGTTCTCTAAAGCTATATTTTTTATATAATAAATACGATGTTAAACTTCAGCTTAATGAGCCTTCAGAAGAAAATTAAAATTTTTCAGAGCAAAGCAAGTAAAAATGTCATTAATTGTTCATTATTTAAAGGTCTTCCTCTTGAAATTCTTCTAGAAGTTCTTCTAAACTTGGAAGTTCTTTTATACTTTCCAAGCCAAAATGTTTTAAAAATTCTTCTGTGGTACTATATAATATAGGTCTGCCTGGCACATCTAGCCTACCTGTTTCTTTAATTAAACTTTTTTGCAATAAGGTTTGTAGTGCTCTATCACTTTTAACACCTCTGATTTCATCTACATCTACTCTTGTTACAGGTTGCTTATAGGCTACAATGGACAAAGTTTCTAAGGCCGCCTGAGAAAGAGATTGACTACTGTTGGTTTTTAAAAGTTGTTCTATAAAGCAGCTGTTTTCTTCCTTTGTAACTAATTGATATTTTTTTTCTATTTCTATTAGTTTTATACCTCTTTTTTCTATTTCATATTCCCTAATCATTTTCTGTAAAATTTCCTTAGTATCTAAGGTATTTAATTCTATTATTTCACTTATGTCTTTTAAGCCTAAAGGTTCTCCACTAGCAAATAAAAGTGACTCTATTATTGAGAAAAATCTTAGTTCATTCTCTTCATTAATAATATAGTTCATCTGCTTCATTCTCTTCCATCCTCTCTAAGTAAATTTCTTTAAAATTACCTTCTTGAACCACTTTAACTATTTTTATTCTTATTAATTCTAATAGAGCTAAAAAAGTAACTATAACTTCTCCTTTACAACTACATTCTTTACTAATTTGGGAAAAAGAAAGATTTTTATTTTCTATTATTCTGCTTTTTAAATAATTCATTTTGTCTCCTATTTTAAATTCTTCTAACTCTATTTCCTCTGGTATTGTGTTGCTTAGATTTATTTTATCCTTATATTTAGTTATTAGCTCATTATATAAGTTATATAGTTCTAATATGGTTACATCTATAAAAATATCTTTATCCTCTTCCTCTTTTTTTTCAATTATTTCTGGCTTCTTAGAAAATACCGTGAAGCCTTTTTCTAATCTTCCTTTTAAATAACTAGCTACATTTTTAAACTTCTTATATTCTATCAGCTTGTCCAATAATATTTTTTCTGGATTTTCTTCTGATATTTCTTCCTCTTCTTCATCTTGTATCTTAGGAAGTAATTCTCTAGACTTTATTTGAAGGAGTGTGGAAGCTATAACTATAAATTCAGAGGTTACTTCTAAATCCATTTCCTTTAAACTTTCTATGTACTCCATATACTGACCAGTAATTTCTGATATATTTATATTATATATGTCCAATTGATTTTTCTTAATTAAATGAAGTAATAAATCAAAAGGACCCTGAAAGTTTTGTATTTTAATCTGTAATTCCATATAATCGCTCCTGTGTTTTTCTCTTTTAAGTTTTAAAAAATTATATGATAGCCTATTAAAAGACTATCATATTATAAGCTAAACTGCATTATCGAAAATATTTCTAAAATTATGCAAGATATTATCTAATAAACCGGATTTTTTTATGTCTCTATCGGAATATAGTTTAACCTTTCCTATTAATTTATCTCCCACATAAACTTCGCAATGGCCTACTACTTCTCCTTTATTATAGTATTTTTTATCTAAGTTTACTATAGGTTTTTTAGTAATTTTATCTTTAGAACCTCTTTCTACTACAACATCTAGATCTTCTAGTGCCTTAGCTATAAAAAATTTATCTTCACTTTTACCCAACTTCACCTTTTCTACTTCACTATCTCTTTTTATAATATTTTTATTCTCGAATTTGGAGAAACCATAATTCATAAGGTTGGAAGCCTCTTTATTTCTTATTTTATAGTTAGGTGCTCCCATAATTACTGTAAGCACTCTAACTCCATCTCTTACCGCAGTTGCTGATATACAATACTTAGCCTCACTTGTGTAACCTGTTTTTAAGCCATCACAACCTTTAAAAAATCTAACTAATTTGTTGTGATTAACTAATCCTATAGGGCTTTTTCTACCTTCTGAAATAGTTTCCATATATGTTCCAGTATATTTTAATATTGTAGGATGTTTTAAAAGTTCTTTGGACATTAAAGCTATATCATAAGCTGTAGTTATATGACCTTCTTCTGGAAGACCAGAGCAATTTTTAAAATTAGTATTTTTCATACCCAAGGCTTTTGCTCTATCATTCATGGTTTTTACAAAAGCCTCTTCGCTTCCATCTAAGTACTCTGCCATAGCAGTAGCCGCATCATTACCTGAAGCTATAGCTATACCTTTTAATATTTCTTCTACAGTTCTAATTTCCCCTGTATCTAGTAGCATACTACTATTGCCACCTTGACCATTTTTTTTAGCATTTTCACTTACAGTAATTTTATCTGTTAGTTTTATTTTCCCTGAATCTACAGCTTCCATAGCTATTAACATAGTCATAATTTTTGTTACAGAAGCTGGTGGTAACTTCTCATTAGCATTTTTTTCATATATTATCTTACCAGAATTAGGTTCCATTAGTAAAGCTGATGATGCTTCTACTTGTATTTCACCTTTATCTTCTCCCTCTGCTTTTACCCTTATTGGTACTACAGTACACTGGAATATAAAAATTAAAAGAATTGTTAATATAAAAGTCTTTTTATTTTTTATCACGGTTATTCTCTCCTTTCTTCTAAAAATATTTTTCCCCAGCCTTTAAAATTTATCCATTAAAACTAAAAAAATAGATTATCTCATAAAAATAAGATAATCTATTTGGGTAATTAATAATAGCGATGTTTAGCTCAGCAAACGAGTCTTCTTATAAAACAGCAATGTTTAGCGTAGCAAACGAGCCTTCAAAACAAAATTAAAATTTTTCAGAGCAAAACAAAGGAAAATCTCCTTAATTGTTCATTAATCAAACTCTTTCTATTATGCTTCCTACAAGATTTATAAAGGTATTTTTAACTTTATTTGAAGTTTCTATAACTTCTTTATGATTTAAAGGTTGATCTAAAATTCCCGCTGCCATATTAGTTATACAAGATATACCTAAAACCTTCATACCACAATGTTTTGCTGTTATTGCATCTGGTACTGTAGACATACCTACAGCATCTCCACCTAATATTCTCACCATTTTAACTTCTGCTGGAGTTTCATAGGTTGGTCCTGTCATCATACAGTAAACTCCTTTTTCTATGTTCACATTTATATCCTTTGCAATATTTTCCGCTAGTTCAACTAAATCCTTATCATATACATTACTCATATCAGGAAATCTTGGTCCTATTTTTTCGTTGTTTTCACCTATTAATGGATTATTGAAAGCTAGATTTATATGATCTGTTATAATCATAAGATCCCCCGGATTAAAACTCTCATTCACACCACCTGCTGCATTTGTTACCAATAAAGTCTTTGCTCCTAAGTATTTCATTATATGAATAGGCAAAGATAAAATCCCCATAGGATGTCCTTCATAATAGTGAATTCTTCCTTGCATCATTACCACATTTTTTCCTTTATATTTTCCAAAAACATATTGCCCCTTGTGACCTGATACTGTTGATTTTGGCATATTAGGTATATCTTCATATTTTACAACTACTTTATCTTGGATTTCTTCAGCTAAATCCCCTAATCCTGAACCTAATATTACACCTATTTCTGGTACAATATTTATTTTACTTTTTACAAACTCCACACTTTCTAATATTTTTTTTAAATTCATGTTATTTCCTCCTTTGTTATAACTAGTATACTTAAGCTCTAGGATGAGTTTTTTTATAAACCTCAGCTATCTTACTTTTATTAGATATAGTTGAATAGATTTGAGTGGCTGCTAAATCACTATGTCCTAAAAGTTCCTGCACTGATTTCATATCTGCTCCATTTTGCAAAAGATGTACTGCAAAAGAATGTCTTAAAGTATACGAGTTTATGGATTTTTTTATACCTGATATTTTAGCATATTTTTTAACTATTTTCCAAAAACCTTGTCTTGTCATTTGAGTACCTTTTAAATTTAAAAACAGATAATCTAAATTATATATATTAATGTTTGGTCTAACACTTAAATAATCATCTAAACAGTTCACTGCATATGCCCCAATAGGAATAATTCTTTCTTTATTTTTGGATCCTCTGCACTTTATATAAGAGAGTTTTAAATTAACATCAAATACTGTCATATTTAAAAGCTCAGTAACTTTAACCCCTGTAGCATACATAACCTCTAGCATTGCCTTATCTCTTATGCCTTTTTCTTCAGAAATATTTGGAGCATTTAATAAAATATCTACCTCTTCTACTGTTAGTACCTTAGGTATATTTCTTTTAACTTTGGGCAGTTCATAATTTATAACTGGGTCCTCATCTATATAACCTTTTTTGTATAAAAATTTATAGAAATTTCTAATAGACACTACATTTCTCACAATTGAGGAGTTAGCTTTTCTACTCTTTTGAAGAGACTGAACATATGCCATTATTGTAACTTCTTCTACATCTAATATGTCTTCTTCTCTTTCTTTTAAAAAGTCTAAAAATCTATTTACATCTCTTATATAAGCATCTAAAGTATTTTTGCTTAAGCCTTTATCCATAAGGCGGCTTACATACTTGTTAATTAGTTCATCTCTAGTTCTTATATTCAATTTCAT
Protein sequences of DBSCAN-SWA_3 >NC_004557|1651758:1658683|1657783_1658683_-|WP_035109633.1|DBSCAN-SWA MKLNIRTRDELINKYVSRLMDKGLSKNTLDAYIRDVNRFLDFLKEREEDILDVEEVTIMAYVQSLQKSRKANSSIVRNVVSIRNFYKFLYKKGYIDEDPVINYELPKVKRNIPKVLTVEEVDILLNAPNISEEKGIRDKAMLEVMYATGVKVTELLNMTVFDVNLKLSYIKCRGSKNKERIIPIGAYAVNCLDDYLSVRPNINIYNLDYLFLNLKGTQMTRQGFWKIVKKYAKISGIKKSINSYTLRHSFAVHLLQNGADMKSVQELLGHSDLAATQIYSTISNKSKIAEVYKKTHPRA >NC_004557|1651758:1658683|1656936_1657752_-|WP_011099742.1|DBSCAN-SWA MNLKKILESVEFVKSKINIVPEIGVILGSGLGDLAEEIQDKVVVKYEDIPNMPKSTVSGHKGQYVFGKYKGKNVVMMQGRIHYYEGHPMGILSLPIHIMKYLGAKTLLVTNAAGGVNESFNPGDLMIITDHINLAFNNPLIGENNEKIGPRFPDMSNVYDKDLVELAENIAKDINVNIEKGVYCMMTGPTYETPAEVKMVRILGGDAVGMSTVPDAITAKHCGMKVLGISCITNMAAGILDQPLNHKEVIETSNKVKNTFINLVGSIIERV >NC_004557|1651758:1658683|1651758_1652883_-|WP_011099736.1|DBSCAN-SWA MKIKKILLYFICFIITININTYSVLANQHLNKKENKLNVNARAYIALDANSKVVLAGRNSEVIMPMASTTKILTALVTLKYGDLDKMVEISPKASSIRGSVAGYKKGEEISLKELLFGLMLRSGNDAAIAIAEGVAGSEEEFVRLMNEYAASLGLIDSHFCSPHGLDSAEHYSTVYDLAILTAESKKHPLFHEIVSTKDITKDKYSFSRDYHNINKILWQLPNATGVKTGYTGGAGKCLVSSVDISGREVIIVVLDCPGRWKETTKVNNYVKKNYEYKKLHSKGEVVAKAPKDKKNIDLVCKEDIIIPTEKGTDYSVKINVPNEINHNIYEGDKVGSINVFKEDKLIFTESLVAKNSYKAPIIKKFPFKSIFKK >NC_004557|1651758:1658683|1653538_1653955_+|WP_011099738.1|DBSCAN-SWA MENHPIENLMQSTMENIKNMVDVNTIVGNTILTPDGASIIPISKVSFGFASGGSEYNSSISSDLGKYPFGGGSGAGVSLKPVAFLFIKSDSVRLLPVDQTTPYDRVIDTVPQLVDMFKDMSKSKTSSNNANKSSTVNE >NC_004557|1651758:1658683|1654114_1654684_-|WP_011099739.1|DBSCAN-SWA MKQMNYIINEENELRFFSIIESLLFASGEPLGLKDISEIIELNTLDTKEILQKMIREYEIEKRGIKLIEIEKKYQLVTKEENSCFIEQLLKTNSSQSLSQAALETLSIVAYKQPVTRVDVDEIRGVKSDRALQTLLQKSLIKETGRLDVPGRPILYSTTEEFLKHFGLESIKELPSLEELLEEFQEEDL >NC_004557|1651758:1658683|1654658_1655423_-|WP_011099740.1|DBSCAN-SWA MELQIKIQNFQGPFDLLLHLIKKNQLDIYNINISEITGQYMEYIESLKEMDLEVTSEFIVIASTLLQIKSRELLPKIQDEEEEEISEENPEKILLDKLIEYKKFKNVASYLKGRLEKGFTVFSKKPEIIEKKEEEDKDIFIDVTILELYNLYNELITKYKDKINLSNTIPEEIELEEFKIGDKMNYLKSRIIENKNLSFSQISKECSCKGEVIVTFLALLELIRIKIVKVVQEGNFKEIYLERMEENEADELYY >NC_004557|1651758:1658683|1655501_1656656_-|WP_115638824.1|DBSCAN-SWA MLLIFIFQCTVVPIRVKAEGEDKGEIQVEASSALLMEPNSGKIIYEKNANEKLPPASVTKIMTMLIAMEAVDSGKIKLTDKITVSENAKKNGQGGNSSMLLDTGEIRTVEEILKGIAIASGNDAATAMAEYLDGSEEAFVKTMNDRAKALGMKNTNFKNCSGLPEEGHITTAYDIALMSKELLKHPTILKYTGTYMETISEGRKSPIGLVNHNKLVRFFKGCDGLKTGYTSEAKYCISATAVRDGVRVLTVIMGAPNYKIRNKEASNLMNYGFSKFENKNIIKRDSEVEKVKLGKSEDKFFIAKALEDLDVVVERGSKDKITKKPIVNLDKKYYNKGEVVGHCEVYVGDKLIGKVKLYSDRDIKKSGLLDNILHNFRNIFDNAV >NC_004557|1651758:1658683|1653014_1653521_+|WP_035109636.1|DBSCAN-SWA MYITLILLLSLIILLFIPIPIKIYIIYNENTLSLKLGNKVFSLDRLKFENINKTSPIEKISKKENNQKLSFKSVLGYLKITPSKPKLKISIYLNYGFEDAYITAISYGIFYSIYPFILGLINSCFTIKKERLHIEPHFNKNIFNFEIKSIFFMSIAKIMYIYIYLLIK |
8 | uncultured_Mediterranean_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1663820 : 1682779
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_004557|1663820:1682779|DBSCAN-SWA ATTACTCTTTTATAAATCCATTAAATCCTTTATTCTTTAATTCATTTCTTAATTTATCTGCATTATCCTTATCCTTAAAAGCACCTACCTGTACCCTATATAATTTATCATCTTCGATGTCTAAAGTTTTTAATACTATCTCATTTTTAGTTACAACCTTACCATTTTTATACCCTTCAACTCTTACTTTATAATTTGTATTAGGAGAAAATGTATAACCTTCATTTCTTTTATATATTGCCTTAGAACAATTCTTTTCAAGTTTTACATAGTTATCATTAAGAGGATTAATGTCAAAAGCATAACTAGCTTTTTTATCATTATCAATGTAACCGAATATCCTATCTATATCACTTGAAAAATCTCTTATTATTAAGTTGATTGCAGATTTATTATTTAATATAGTTCCTCCACCATCTACAGATATACTTGCTTTTGATGCAGGTTTATATGGGTTAGGTATTACATTTATAGCCTTGCCTATTTCTCTACCTAGTTTAACCCCTCCTACCCAGTTAAAATTATTAACATCTGCCTTGCTATCTATAAAGAACGGCTCTAGTATAATTGCAGGACAACCCACTTTACCATTCATTTCTGCCAAGCCTCTTGTATCGGGCTTAGCACATCTATTAATAAATCCATTCTTCTTTAATACATTGCAAACGTCTAAAGCATAATTCCTTGAAACACTATCTCCTGGTGGGAATATAACTTCTCCACCCATGGCACTATCGGTTATTTTAAAAGCATTTGCATGGCAAGAGATAAATAAATCTGCTCCAAACTCCCTAGCTTTATTAATACTATATGATAAGGAATCTGCTAAACTTCTATTTGCTTCTGGTGGAGTTACGTCTAGTACTTCATATCCCTTTGATTTTAGACTTTCTATAATATATGGCATATATTGCTCTACAGCTTCTATTTCTTTAACAAAACCTTCTGCTCCTAGATCCTTTCCTGTCCTTTGATGACCTCTTCTTATTGCTATTTTAATACTCATTTTACAACACTCCTCTTTATTTTATTTCTCTTTTTACTGTGGTTTGTTTTATCAACTGATTTCCAAATACCGCAACACCTGCTGTAAGTATTCCTTGTATTACTGCATTTGCATTGAGTCCAATTAAAGCTATAGAACCTAATATGCCTACAACTAATAATATCCATGGTATTGTCCAATCTTTAATCTGTTTAGTTTGTTTTAACATTAGTCCCAAAACATATAAAGCAGGTACCAATATAAATGCCTGCTCTGTAATAAATTCCATTATGTTTATTTCCATAAAGCATCCTCCTATTTCAAAATATTGTGTTGAATTACATAAAAAAAGAAGCTTACAAAGCTCCCTATAATTAGTCCCATAAACCATCTCATAGTTGTGGTTAAACCTTTTATATCTTTACATAAATTTTTTATTTCAATTTTAAACTCTCTTCCATCTTGTTCTATCTTATCTAGCCTTTTATCATGTTCTTTTATAATTTCTGTGTGTTCATTAATTTTTATTTCCATAGTCTCGCTGTTCATATCCATCACCTCATATACCATTTTTCAACATTATAAACATTGTTTACAATCACATATTCACATTACTATAAATTAACATATATTACAATTAGGTGGTATTTTATAGGGCTTTAAATTTCATAAAACTAAAGTCATAGGTATTCACTACTTTTATAAAATTGAATGGATAAAAAAAGACTAGAGTAACATTACTCCAGCCTTTTTTTATCTTATATTTCATTATCTTTAGGTTAATATATACTCTTTTACCTATAGCTTTTATTAATTCAATTTTACTTATTTTTAATCTTGTCCCACACTTCATTAGAGTCTTTATCTACTGGAATATCTAAAGCTTCTTTTGCTTCCTTTATATGTTCTTTGTAATAGTCTAAAGCATCTTGTTTATTTTCTGTTTGATAATATAATAATTTTAATCCTGCTAATGCTATATATCCTACTGTATCTCTTTCTTTATCTGTCACTTCTCTATTTTGTATTTCGCAAATATCTGTTACGTCCTCTGAATATTCTTCGTACTTATCACCATTTGCCTCTGTTTTATTCTTAAAACATCTCTCAGTATAAGCCAACATCTCTACAGACCTTTTATATATCTCTTCATCCATATCTTTTGGAATATTACTTGTGGCTACACTTCCGCAACCAATAAAACTCATGATAAAAACAAATAATAAAAAAATAATACCTTTTTTAGCCAATTACTCCATCCCCTTTAATTTTGATTATTAGCTATATATTACCATATTTAATGATTATTTGTCAATTTTATCTATTTTTCTACGTACAGATTCTAGATACACATTCAAACTTGATTATTTCTTTAAGTTCTTCTCAAATCTTTCTGCTTTATTTTTATATACTGCAAAATTATGATTAACTGGTTTAGCCAATAAACACATCTTGAATTGCTCTTGTTTTTTGCAATAAAAAAAAGACTTCTTAAAAGTCTTGGCTACTTCGCATTATTTTTCTATTGTGTTACTATTTTATAAACACCAATGCTACCCCATAACCCTTACTATCACTGTATTCTGTTGTAACTTCCATAACCCTCCATCTTGTATAGGTAGGATTATCATCTTTAGTAGCTCTACCTAAATTATCAGATTTTATAAAATCCCCTGTTACAACTGTTTCATCTATTCTTACATAAACTTGTCCCGTAAGCCCTACCACATTCCACTCTTCTCTATCTTCTCTAGCTATATATTCTTCTTTCATGTCATGCTCTAAATTTTCTTTGAACACTTTTATATATTGTTTAGTTTCTTCGTCATATTTCTTCTCGTACACCAATCCACCAAATTCATTTTTTAAATATCTATCTTGCCAGTGAAAGCCACATGCATTAAGTAACACTCCAGCAGTTTCGGAAATCACTCCTAACATTTCTTCTCCTTGCCTACAAGGTCTTATTTTATCCCCATCTAATGTTACTATAATTCCAGTAGGGATCGCCTTACCATTAGTACTTTCAAAATACTCAGCAAAGTCTGCGAAAGTGCTGTTTCCAGTAATTGCACCTGTCGCCTTTATTGTACCAGAGTAAGAAGATATCTCCCACTTTTTATTTTTGGTAGAAGGTTCGCCATATCCATACCCTCCAACGATTGTTTGCAGTCCATCATCCTTTACGCACTCACTTGCAACTATTATACTTTTGCCATTTGTTATACATTTACTGCTTGATATTATTGCACTTGCCTCAAAATCCGTCGAGCAGCTTGAACTGTTCACGATAAAACTTCGATTACGATTTGCCGATGATGCTATACTACTAGAAATGACAGTGTGAGAACCAGCGCTTTTACATCTGTATGAACAATTTACCTGCCTATCGTGAGGTGTTGTGACCTCGCCTGTATCGCTTTTAGCAGATATAGTTCCTTTAGTATTTTCTAATTCCTTATCTGTAAGAGGCTTGTTCTTTATTCCTGACCATTCTACACTGTCTGCAACTTCTGCTCTATCCACTATACCATTATTATTTTTGTCGTAAACTTCTTTAAGCATATCCCCTGTTCCATCACCATCTTTACCTTTTTTGGCAATACATATCCAATGTTCTGTATTAGTGGGGTCTATTCCTTTAGAATCTTTTAAAGATTCATACGAACTACCATTGAATGTTACTCTATTGTACTTTTTATATTCTTTTTTAGGGTTATATTCCTCACATATATCAAAACTATCTATAGTATTTTTCATATTAGTATATCCTTGTTGCCTAGCGACTTCTTGTGTCTCTCTTTTATTTTCAGATTCTATTCTTTTAGTTTCATTCTTTTCTCTTGCTTTTTCATTTTCAAGTCTAGTTTTCTCATTAACCTGTCTGATTTTTTCAGCTTCATCTCTATTGGTTTCATTAGATTTTCTAATACTTTCAGTATCAGCCCTTATATTTTCAGATTCCACTCTTTTGTTTTCATTCTCTTGTCTTATTTTTTCAGTTTTAACTCTCTTATTCTCTTCTATTTCTCTAGCTATCTCATTTTCTTTTCTTATATCTTCATTCTTTTCTCTTGCTTTTTCTTGCCTTATTCTCTTTTCTTCTTCTGTATATACAGATTGTATTAAATCTAAATCTTCTGGTTCTATATAATCATTTCTGTAAATACTAGCATCTACAAACATATTAAAAGTTACAGAAGTTATTTTTCTATTATCCTTATCAAAAATACTTAAATCTGCTTTTACTTCTCCAACTGTTTCTAATACTTTAGTAAGTATATTTACTTTTATTTCTCCCTCTGTTGGATTAATTACATCAACCATTTGGAGAAATAACTTATTATCTGGTCTTTTATAATTTATCCTTACAGTTAAACCAGTTAAGTCAAAGGGAATACTGTTCTGTACCAATGTAATATTTAAAACAGAATTATTATCCCCTTCCTTTAAACCCCTTATAGTATTAAAACCAGTTCTTTTCGTATCTATAAGTAAATCAAACTTCTTCTCCATTCATTTCACCTCTTTCTTTATGTATTATTTCTTTAGCATTTATATCAATTATTTGCTTTATCTCATTTATCATGCTTTGCATTGCTACAACTGGCAATCCTACTTCATTTAAAGTATTAATTATTTTTTCTTTAGTATCTAAGTAAACTAAATTAAAAGACTTAGGCTTCACCTTCTAATTTCACCCTCCTTCCTTTAAAATAGATACCACTTGGTGTTATAGAAGCTAATATACTTCCTCTATTATAAAAGTCAAAATCCCCACTAATTTTTAGATAGTTATTAGAGGAATTTCTTATTCTTATGTCTCCACCACGGCAATCTATATCTTGTCCATAAGGAAATTCTATAATTCCACTATGTGGCTTAATCTCTATTCTATCGGCAATCATACTTATAGCCGAATAATCTTCGCTTATGGCTAATGCTATTTTTGCAGCTCTAATATCTCCTCTACCATCAACTACCATTTCTATTTTATCCGCCTCTTGGGAAATCCAAGAACTTAAATCTCTATAGTTATCTTGCACTTTACTTTCTATCTTATCAGCTGTTATTTCTATCTTGCTATTAAGCTCTTTTTCTGTATCTGATACCATAAGTTTTATTTCATCTGCGGTTTGAACTATTTCAGATTGAAGCTTATTGTCAACATCATTTACTTCTAGCCTTATCTCTTTTGCGGTTTGGGTGATTCTTGAGTTCATATTTTCATTCTCGTCAACAACATATGTCATTATCTTATCAGCGGTCTGTACTATCTCTGAATGTAATTTATTGTCAACGTCGTTTACTTCGGTTCGAATTTCTCTAGCGGTTTGTGTGATGTAACTATTTAAATATTCATCCCCAACCTTCATTTCATATCGTATATCTTTAGCAGTTTCTTTTATAGTATTATTAGTATCAACAAAATTATTGTAGGTATCTTCTCTAGTATACTGTAGTTCCTCTTCTAATGTATCGGCTTTTTTAGTTACTTTAGTTAAGGTCATTTCTATAGATTTCTGTATCATACCTAATTCTAATTCTATATATTCTTCATTTAAAGCATCCCATATATACTTAACAACTCTTTGTTCTGTGTCAATCCCTAATTCCTTATGCGTAACTTTAAGTTTATCCCCTATATGAAGCTCTTCCAATTGTTTGTAGTGCTTATATTCTTCTGTTTTGCTTAACTCTATAAAATCTACAGTGTAATTAATAATAGGTTTATCTAACCCCATTTTGAACATGTGGTCTACTTTATCTCTTAATAATTGGTAAACCTCATCTTTAGTCAACACTGCTCCTTGTGTCCTATCTGAATTGACTTTATACTCTCCCATATGAATATGTTTTATCTTTGGATGTGGATATTCATTGATTAACGGGCTATCAATATATTTCTCTGGTAATAACAGTGAACTATCACTTTCGGTTAATCCTGTTGGCATAACTCTAGTAGCAACTTCTTCCTCGTCAACATGAGCCTCTAAACCAGTTAAATTCTTACCATATTGTATATGAACCCCTGTGTCTTGTCCTATTTTCTCTAACATTTTTACAATATAGTTATCTCTATATAATTCTCCTCCCCACCTACTGTTAAATGAGGTTTCTGAATCACCTAATAGTATTTCAATTGGATTTTTTCTAATATAATATGCAGTAGATTCTGTGTTTAAATTTGAACTAGCCTTAAACTTTGTAGGATACTGCATATTACTGAATATATATTTTATAGCTTCATCTGCTCCCATGTTTTTAGGTCTACAATCTTCTAAAAAATTATCTTTTAAATCATAAAATATATGCTTAGCATATACTTCTATATTAGATAAAGTTTTTACTACTCTATCTATTCTAAATAATTGAAGCCCTTGGGGTGTAGATACTTTTAGTATATTATCTTTTTTTAAGTGTTTACTTTTACTATTATAGGGGTGTATTAGTTCTAGTCCATAATATCCATTTAATTCTTCTTCTACATCACATCTTATAGTGTTATTTAATATGGCTAATCCATTTGAACTAAACTCTGTTGCTAACTTATCATAAACTTTAAGCATGTATACACCTCCTATTTATATTTCCATTGTACTTCTACTGTTCCATTAGATTCTAGTTTTATAATGTTTTTACCTCTCTTAAACACAGGATAATCACCTGTCATGTAGTTATTAAGATTTTTATTTTCTTCATAACACTCAAACATAACACTATCTATTGTTACATATTCAGATACATTATCTACTTTAATTGCTTTATTATTGATTACTACAGACACTGTTCCTGTGCCATATACTGTTAATATAGGTTGTGATTCTAGCCTAGAGGGATTGATAATTTCATGAGTTCCTTTGGTTAAATTAATAGTTTCTAATGGTAAATATCTAAAAGGCTCACATATAAACACAACTTCAAAATGTCCTGTACTTCCATCAATGTCTATACCCTCAAATTGAACATCTTTAACTTTATATTCATATTGTGGCAAATGGGACATGATTAACGTTCCAGAACCTTCTAGCCACTCATTTATAAGCTCACTATTGTCTATTATACTGTTGGTATTATTGCAATCACAACACGTATCAGAGGGCAGAATGGTACAAGTATAGGGAAGGGGAATGTCAAAATAACCCTTCCTATCTATGGTTAGACTTCCGTTTCTACCCACTACCTCCACGTGTTCAATATCTTTTTTAGGTTTAATTAAAGAAGGTTTCTTATTGATAACTATATTAAAGTCTTTACTATTATATCCTACGCCATCTGATTGAATAAATACAAAATTATGCATATACTACCTCCTTCTTCTATATTCCATTTCTGTACCTAAGAATGGAACTGTAGCTTGGGCTATTACTCTACCATCTACATTAATAATGACATCCTGCTTCATGTTTTCCATTTCTTCTCTAATCATACCTTTAAGCTTTTCAATAGGTAATACCGCTTCCATAGCACTGCCTCGACCATTGTTCCTATCTCCAACACCTATACCATTAGGGAATACTGTTGGCTTAGTGAATATACCTCCCTCATGCATCCATTTAACGCCAAGCTTAGGCACTGTCATTTCCTTTAAACTAAACTTACCTTCTAAAGTAAATACTGGTAGCTTAGGTCTTGGTATTTTAATCTCTGGTAATTTAAGATTCTTGAAGAAACCAAATATCTTATCTAAAGCGGTCTTAACTGCGTCTTTTGCTACTGTCATTGGCTTAGTTATCTTTTCTTTAATGTTATTCCATATAGTTCCTACTTTAGTTGCTAGAGAAGTAAATATATTACCTGTAGTTTCTTTTATTCCATTCCATACGCCTAATACCTTCTCTTTAACTCCATTAACCACATTACTGATAACGGTCTTAATTCCATTCCATATATTAGTTACAACGCCTTTAATAGCGTTAAATACGGTACTAGTCACATTCTTAATAACATTCCATGTAGTTGTTATAATAGCTTTAACTCCATTGACTACGCCTGTAATAACCCCTTTTATGGCGTTCCATACGGTTGTTACTACTCCCTTTATAGCATTAAATACAGTAGTGGTTACAAGCTTAATACTCTCCCATACTGTAGTAATAACATTCTTAATTCCTGTAACTATAGTAGTTATAAATCCACTAATAAAGTTCCATACAGTTACTATAATATTCTTAATCTTGTTCCAAGAAGAATCTTGGCGATCAACTTGTTCACTTATGATATTAGTTATCCAACCAAATATACTAGAGAATATACCTTTAATCCAATTAAAAGCTCCACTTATACCGCTACTAATACCATTCCATATGGCGGAAATAATTCCTTTGATGATATTAAGTGGTACTTGTATTATAGCTATTAATATGGTCATGGCTGATTCTATTAAAAATCTCATACCTAATAGGAATCCTTTAAAGATATTCATGATAAATTCACCTATTGGAGCTAAGAAGTTAGCTATACTTTCCATTAAGTTGCTGAACCATTCAGAAATCGCAGTACCTATATTAGTACACCATTGAATAAAGCTATTATACCCATTAGTAAATAAGTTACTTATATATTCCCATTCATTAGCAAACCATGCGGAAATCCTATCCCAATTCTTATAGATTAGAAATGCACCCATAGCTATACCTGCAATAGCTACTGTAACCATACCTGCTGGTGATAATAAGAAAGTAAATGCTTTACCTACACCGCTCATTATACTAGGAATACCTTTTAATAGTTTAGTACCCTTCATGAACGCCATGACGCCCTTCTCTGCTATCTTAGCCTTACTACTGACTAATAAGAAGCCTGTAGCCATTGTTCCTAGACCTATAGCCAATGTATCTATTATAGCCTTATGATCTCTATAAAACTGTGTTAATTCTTGTATTTTATTACTTATATATTCTATTTCATTAGGTATTCCTTGTAATACTTTTGCAAGTCCATTAAAGGTATCAACTAATAGCCCGTCTTGCATCTTACCCAGCTTAATCAGCAAACCTTCCCATGCTGAACTAACACTTGCCAATGCTCCACCTAGACCGCCATCCATAGTATCTGCCATTTTCTGTGTTATACCATTAGAATTATTAATAGATTCAGTTAAGTCATCAAATTTCTCTTGACTTGCATTAAGAACTGCGGTCATACCTGCTACTGCCTCTGTACCAAATATAGTAGATATATATTGTGCCTTTTGAGCGTCACCTAATGTACTGAACTTATCTCTTAAATGTACCAATAATTCATTAAATGGCTTAACTTTACCAGATGCATCTGTCATAGATATACCTAATTTCTTTATATCCTTAGCACCTTGACCAACTGGATTATTAAGTCTTAATATTGCATTTTTTAATGTGGTACCAGCTTGAGACCCTACAATACTCTTATTAGCCATTAGTCCCATAGCTACTGATAAATCTTCTATCTTATATCCTAATCCACCCGCTAATGCACCAGAATATTTTAGAGTTTCCCCTAATAATCCTATATCTGTGTTACTAGCTGATGCAGTAGCAGATAATACATCCGTAAATCTACCTGCTTCCTCTGCTTTCATACCAAACATAGATAGTCCATTAGCAACTATACTTGAAGCATTACCTAAATCTATACTATTAGCCGTAGCAAAGGTTAGAGTTTCCCCTATACCTGCTAATACTTCATCTGTTTTCCAACCTGCTCTAGCCAAGGCTTCCATTCCTTCTCCTGCTTCTGTAGCACTAAAACGAGTGGATGACCCCAATGCTTCTGCTTTTGATTGTAACTTAGCATATTCTTCACCTGTTGCTCCCGTAACCGCTTGTATCTTACGCATTTGATCATCATAGGTACTATATACGCCTATAATCTTCTTTTCAAAATTTACCATTGCCTTAACAGTAAATGCACCTGCAACAACTTTACCTAAAGTTCCAAATTTATTACCTAACCCCTTAGCTACTTCTTCGCCTTTTTTACCTACATCCTCCAATGGCTTTGTGACTTTATCGTCTAGTCCAATAGAACAAAATAATTCAAATAGTTTAATCGCTCGCCACCTCACTTCTAGGGTTAGTGTTATTTAGTATCATTTTAGCTATATTAAAACCTTTTTTGCCTCTATCCTTATTATGGATAGTTGATTTTGAACCTGCAGTAAAACTAAATTCATTGGGTACTAATTTCTTATAGTAATCATTAAACGTAGGTGGTTTTTCTTTACTACCCATATACATAGGTAATATAGCTAAATATTGCTTAAATAGATTGTCCCTAACCTCTCCATTTATACATTCTTCTACCAAATTAATAAAAGAGGTACTGGGTAAATTCTCTACATAAGATAGATTACCTCCATACCTCTTTAACAAGTTGTCCATAATAAAAGTCTCATTTATTTGGTAGTTTTCATAAAACCCTGTATCTGTTCATTATTTATTATTTCTTCTATATCCTTTACTAGATCTTCTATAGGTTGTTCAGCTATTTCTTTAGCCTTAACTCCCTTAATGTCAGCTATGAGTTCTGTAAGTTCCTTTTCATACTCTCCCATTAATTCCATAACTGCTTTTATTATTACTATTCCTTTAGCCTGTGCATCCATAGTTTTGTCTTTACCTACTTTTATTAATTCTTCCATATCTATATTCTTTCCTATTCTTGCTAATGCAAATAAATGTTTCCCTTTAATATCCATTAATAAACACTCCCTTTATAATTATATTTTTTTACTAGAATCGGTCTTATTTTTCTTCTGGTGGCGGTTTTGGTAATTCACAAGTATCGTTTATATCTAATTTAGGATAGTATATAGCGAATGGTGGATTATCCATATTATCAGGGTCGTACGAACCATGAAACTTTAATTCTGCTACTGCGTTATCCTTATCAACTACATCTATAGATAAACCTTCTTCATTTAATACATTAAACACTTGGATAATTATAGGTCTACTAGAACCTGTTATATTACCCATGAATGTTATATTATCTAAATAATCACAGTGAGTTATTCTATTCTTACCTCTAATCTTAGTATAATCAAAGTCCTCAATCTTTATTTCTTCACTATCTGTTGCACCAAAAGCATATCTAAATGTATCTGGCTTAAACTCTACTACGTTAGCCGTCATGGATACGTCCCAACCGTCTAATACAGTAGAACCCTTAGTCTTACCTTTAACACCATCTATCTCTATGTCTCTAAATTCTGGTACTGCTTCGAAGTTTCCTCCTCCACTAGTTGCTCCTATACATTTCTTCATAGCCGTATCTAAAGTGTCATTTTCTATATCATAATTAATAAAGAACGCACCTGCGTCAAATATCAATCTTTTATTTGTATCCTTGCTTAATCCGCTCATTGCCATAAATTAACACTCCCTTTGTTATATTATTTGTACTATATAATTTAGCTTTATGCGTCTTATTCGTCTATCTGGGTCTTTAATATTGTTCCTATATACCACATCTCTATCTATTACATATAACCTTCCATCACAATTCCATTGTCTTTTTAATAATTGTTTATCTATTAACTTAACCATATCCTCCATAGGTATAATTTGGTCATTTCTCCCCCACACATTAATAACTAACGATATATGCTCTTGAAATCCATGTGCGTCCATAGAACCACTTTCTAACTCATAGGATAAGTTTATATCATCAATATTAATAACTCCCTCTGTGTCCCGCTCTAAGTCTTCAAAATATACGTTATCACAACAAAAACTTAGCAACTTGCTAACATAAGTCATTAACTCTACCATATTTATTTGCTCACATTACTAAAGTTTTTCTTTATGATGTTCACTACCGAGGATTGAGTTTGCTTAATTCCAAGGCTCATAAAGGGGTTAGACTTCTGTCTACTAGTTCCTTCGTGTACACTTATAGCATAAGGAACATTAGTACCTATATATACCTTTAAATCCTGACTATCTACTTTATGGGTTATACTCCGTCTTAATTTACCTGTATTAACTGGTGTAACCGCTTGGGTGTTTGTAGTAGCAGTTATACCTAATTCATTTAAACTACCTTCCATGGCTTTATTAATAGCTTGTTTGACTTGCTTAATATTATTAATGTATTTCATTTATAACCACTTCCTTATAATTAAAGTCTGGGTCTTCTTCCTCTTCCCATTCTACAACCTTAACTATCTCATAAGTATTGCCATTTCTCTTGAGTACCTGTTCATTAACTATCAAAAAAGGCACTAAGTCCATATATATAACATGACTACATGCCTGTGTTATACCATATATCCCTAGTGCTTTTTCCTGTGTTAGGGGTTGTATGTCGCATTTAAAGTTAGTTACTAATTCTGTTTCTTTATAGTGTACTTGCTTAATCTTATCTAAATACTTCTTTCCCTTATTCCAAAGGGACATAGTATGCTTATAAAACACTCCTAATCCCTCCCTTAATACATTTTTACGTATGGCATGGGAAGTAAATTAGCAACGCTATCAGTTATTGTAAAAGCAGTATTATCTGCATATGTTATACTTCTAGCCCCTTGAGTTTTAGATTTAATATTTTTATTATCTTTAAAGTTATAAGCACTAACAACTATATCTATTATTGCATCCTTAAAATTAGCTTGAATATAATCACTTTTAAAGTTGTAATTGTTAAGATAGTTTTGTATTAGAGTTGTAGCTTTTCTTATGTAAATATTTAAAATAGAATCTTTAGAATTATCATTTATATTTAATAATATTTTTATATCTTTTAACATTTATCATTCCCCCTCATTATTAAAATAAGAGGGGTTGAGTTACCCCTCAATTATTAAGCACTTGGTGCATCTAAGGTGTAATATAAAACTGCCATAGCATCTTCTCTAAGAACCACACCATCATATACAGCTAATCCTCTAATACCATCAGAGAAAGCATTTTGTAATCTCATAGCTTCTATTTCAGAAATTTGTTTATCATATCCTAAAGATGACTTGTGCAATGCTAATACGGTGTTAACTGGTAATTCTTCTGTTTCCACAACCGTCATACCATTTATTTTAGCATTAGCAACCACACCATTAGCCAAAACTTCTGGATTTCTAGTAAATCTATCATCTTTAGATAATAGTCCTAAAATTTCAGAATTAACTACTACAAATCTGTCTGTTTTTGGTGCTTTATTCTTAGATAAAATTGTACCTAAATCAACTATATAATCATATATATTTGTTTTATTTACATTCTTTTTAGTAGTAGAAGAACCTATTTTATTACCTGTTTTAACACCCTTAACAGCTTTGTCTAAGATATAATTATCTACTGTTTCAGCTAATACAGAAGAATGTTCCTCTGTTGTAGATTTCATTATATCAGCTACTAATTGCACCTTGTCTACATCTTCTAATGCAAAAGCAAAATATTTCTTTTGTGTAAATGTCATTTCTATAGGAGTTGTAGATATTTCGTCCCAGCTAATAGAACCTGTATAATCTTTTACAGAACCTGCTCCTACCCTATTAAATATTATTTTGTTACCATCTTTATTTGTTGGTTTAGTTGTTATCACATCAGCTATACTTACATTGTGAAAATTGTGTAGTAAAGCCCCTTCCCATAATGTTTTCTTAAAGTTTTCTATTGCCATAATTCATTACCTCTCTTTATATTTTTCTTATTCTTAATTTGTTCTTTAACGTCTGCAAATTCCAAAAAGACATAAAAAATAAACTAGAATTATTTATCTAGTTTTGAAAATTGTTCTGCTATTTGTTCTGGTGTCATATTATCAGCATTTTTAAGTAAATTATCATATGTGTCTATATTATTGCCATTATTTTTAGGTGGTGTATAAGAAGAACCTTTTAATCTCTCTTGTACTACTGTTTCTAATTGCTTATTAAAAACATCTTCAAATTTATTTAAATTATTAAGTGTAACTTCTTCATCTTTACCTATAAAATAATCTACTATTTCTGTTGGCAACTTTTTTTCAGTAGCTTGTTTTAAGGCTATATTTTTTAATTTTTCATATTGTTTTTCTTCCTCCATAGCTTCAATTTTTCTTTGTAATTCTTCTAAAGCTATATCTTTTTCATCCCTAGCAGGATTTCTTTTTCTAATTTCTTCATTTATTATTTTTTCTAAATTGTTTTGTTTCCATGTTTCTAAAGCTTTACCATAGTGTTTATCTTTTTCACTATCCAAAAATCCTTTAAAATCTTTATTAGTTGCAACTAAATTTTTAAAATTATCTATACTTAGCATAGACTTTGCAAAATCTGTGCCTTTTAATATTTCGTCAATGCTTTCATTGTCATCTATATCTTTAATTAACTCTAATAAATCTTTCTTTAACATATTATCAACTCTCCTCTTTCCCTTAGACGATTAAATATCCCCTAAGACAAATTTATATTTATAATAAAAACAAGACATCTACTAAATGTCTTGTTTTTTTGCCCATTCGTTATAATTTATAAATGGTACATATTCTTTTGTTATATTATCCTTCCTAACTTTAGGTTTCCAATTATCAGAAGGTATATTTATTAAAACTGACCTGCACAACGGGTGCAATGGTGGTATTGGTTTCTTCGTGTCGTCTACATTAAATTCCTGTCCATCATAACTTCTACAGATATTAGAAGTTCGCTTGTCCAAAGTAGCTATAAACATTTGTTTTTCTATCCCGTACTCTTTGGCAAAGACATCATTTACAGTACTCTGGCATCTACACACCTCGGTTTCAATTAATCTTTTAGTATTATAGGCATTTTGGTTAAACTTATTTTTTATATTTTTTTCTATCTTATTAACATTTGTTTTTCCTTTTAGAAAGGTACCAACTTCTTTCTTTAACTCTTTCTCTAATTTCTTTTTATTCTTCCATAACCTATTACTCCAAATTTCATCTTTAATTTTGTCATTAACTATTTTTTCAATTTGTTTATCAGTTAATTTTTTTAAATTAAAGTCTATACCAATATTAAGTAAATATCCATCTTCACAATATTTATCTTTAGAAACTATAGATAGAATATCTTTAGTTATACTTTTTTCAATTTTGTATTGATCAGTACAAATACCGCTTATAATACTATTAAATTCTTTAGCTAGAGTTTTCTTGTCTTTATCTCCAATAGACAAAACTTCTTCTACCACAGTATAAGTTAAAATCACTCTAGCTATTTTATTTAATAATTCTTCTTTGTCTTTCTTCTGCTGTTTATAAATATCTTTAGATTTTTTATCGGCTAGATTATAACTTTCTTGTTCAAATTGTAAATATAAATCCTTATATAACTTATTCATATATATCAGCTTCTTCTAATAGATTTTCACCTATGTTATTTTCTTCGCTTTCAGCTTTTAATTTTTCTAATTCATTTTTAGGATTTTCAATGAAGCTTAGTAAGCTTAATGCTGTTTCAGTACTTAACTTATCACCCAATTGTGCTATTACTTGAGCTATCATCAAATCATCTTGTGGAATGTTTGGTGTAAATTTAATCTTTATATCTCTAAAATCATACTTTATATTTTTAAGCACTCTAAGATATACAAATAAAAATTTAAGTCTAGTTTTTATACAATCAGCGATTGATTTTTGATTTAATTTGCACTTTTCCTCCAAAGATATTAATCTACTTCTTAAAGCTAAGCTAGACGTATTGCTCTGCATTTTTTCATTGTGATTTATATGGCTTGTTATTTGATAAATTTTATCTTCTATTGTTTCAAGTGTATTTTGGATAAATGTATCATTAATATTTTTTATAAGCCATTCAATTTTACTGTTTTTATCCTTAGCTTGTAATACACCTAATTCTTTCATTTTAGGAATATCTTCTTCTGGAATTTGCACCCCAGTTAATACTAAATATGCATTTCTAAAATCAGATATTTCATTACTTATATCAGATAAGTTTGTTTCGTAAGCGTCTTGTAATCCTTTTATATCCTTGTAAATAGTGTTATCCTTACCTTCTTCACTTAATTGTGCTAATCCAACTGGAACTTGTCCAAATATATGTGGTGTAGGTTTATTAATTTCCTTAAATTCCTCATCAAAATGATAAATAAGGTTACTATTATATACATCAATATGTATTTCATCTTTAAATTTTAACTTGAACATATGTAGAAAATATTGTATATTACCAAAATCATCAATAACAGCATATCCATTTTCAGGATTTATTACTTTACTTGAAAATTGTCCCTCTCTATCTACATAATATAATTCATAAGCAAGTGAATATATAAGCATATTTTTAGCTAAACTACTGTCATGTCCTTCACTCCAATGATCTAAATAATAATCTATGTCATTAACTATGTTTTCATTTCCACTTTTGGATATGTAGGTTACATCATTTCCAACACTATAACTAACTTCTTCATTTATCAACTTTTTAATGTAGTTTACATTAATCTTATTATTACTTCTTTCAGTTACCATTTTATAATTTTTAATAGCATCGGTATTGCCCTTATAATATTCATACATTTTGTTATATGTAGTTTTATTTAATTTATAATCCTCATAGCACTTTTTTAATAATTCTATATCCATCTTTTCACCTCTTTCTTAGAAAAATAATCTTCTATCTAATATTTGTAAATTTGATACGACTTCAATAGTATCTAATCTATTACTAGCTTCTGCAACACAATCCACAAAGTCATCATGTGGTGTGTATAACTGTCCCTGAAACTCCATTAATTGATTTAATGCCTCTGGCATAACTCTTTCTTTATCAAATATAATTCTTCCATTATTAATACTATCTATAATAGTAGAAATTTTTTCATCTTTATTTTTCCTTTGTAATTCGTTTATAAATTCAAATTGTCTATTACATAACTCTTCATCACTAGCAATAAACTCTTTTAGCTTATCTAAATCTAATCCCATATATGTATTTTTTTCTATAGCTATATGTGTTATATCCTCAAATTTCTTTAATGTATTACATATATGCTTTATGTATTCTTCAAAAGACACAAAGTGTAATATTTCACCCTTACGGACATATTTGAAATCATTATCTGCTAAAGATTCAACTACAAAGGCAAAATAGTCCCCACGTTTTTTATTTTTTATACCAGCAGGGTCAACTGTAAGTATAGTTTTAATAAAATTATGGTCTTCTATTTCTTCTGTAGTTTGTACCACATTACTCTTAAACCACTTTTCTCCTATACTTGTAGCGTTATTCATCATTTCCGACATAAAAGCTTGTCTATTTTCCCAATATGCTATTGCTATATCTTCAAAACAATCCCATTTTTCTTCCCAAAGTACATGAAATTTCATTTCTTCATAATTTTGTTTATAAAACTCTTTTGCTTCTTCTTTTGGATTATCAATCTTATCATTAAAATATATTTTTTTACATTCAAGCCATAAATCAGATTCAAGTATATCTTCTACTGTTTGTCCTTTTTCTAATATTATTGCTTGTCTTAAATATGTATAATAATCTTTATTTTGTGCTAATCTAGAAATTAAACATTGTGAGTGTAATACAGTTCCAATAGATATTATTTTAGTAGCACTTTTAACCTTTTTTCCATTACGATAAACTGCTGTATCTCCTACCTTTTCTATTTCTTTACACCATTTATCATATTTTTTCTCTCTAGCATCTTCTGTTAATATGTCTTTCTCGTCCTGATAATCGTCAGCAATTACTACTGTTGGTCTTATTCCTTTAAAGTTAGCACCTCTTACAGAACTACTAGACCCTACTGCTCTTATATACATTCCATTAGTAAATTCAATTTCATTTGCATTAACTTTAAATTTTCTATTATTTATTAATTCCCCAAAGTTATCTATTATAAATTTATTTTCTTTAAATACCTTTTTAATGGAATCAACAAACTGGGTAGCATCATCGTCTTTTTTAGCACCTATAAGAGTAAATTTAGATTTTCTATAGCATATTAACCAAATAGAAATAGCTAAATCAAATATAGTTGTTTTTGCAAAACCCCTGGGACAAATAATATTAACTTTATCAAATATATCATCTGTAAAAGTTTTATTTGCAATATCCCAAAGGTGGTAGTGGTCTTTAGATAAATTTCTAGCTTCGTTTGTATCTTTAACAACAAATATATCCTGTAAGAAATACAAACAGAAAAATGCTATATCTTTTTCACCTAGAGCTATACATATTTGGTCAAGTTTCTTCTGATTTTGTTTAATAATCTTAGTTGCTATATCTTTAGAATAATGTTTAGATAAATATTTATATAATATGTAGGTATTATATTCGGCTTGTGTAAAATCTTTATTATCGTAATATATTATTTGTACCACCTCCTAATTGACTTGAAGTTAAAAAATATAATAAAATTTTATAGAGCTTAACAACGCCACTTTCCTATTTCTAAATTTAGAATAGGGGCGGGTATAATAAAAAAGAAGATAGATATTATTTATCTACCTTCTAAATCTCTTTTAATTCTATATATAGTTGTTCTTCCTAGTCCTGTCTTATCTACTATATCTTTAATCTTATAACCTTCTCTTAACATTAACTCTACTACATCAGCCTTTTTATTTCTCTTGTTAGGTCTTCCACCTTCTCGACCTCTAGCCTTAGCAGAAGCTAAACCTTCTTTAGTTCTTTGACTTATTAAGTCTCTTTCCAATTGGCTTAATCCCGCCATGACAGTTAGTAAGAAACTATTATAAGGATTATCTGTAGTAGTATCTAACCATGTGTCCTTAATAGATTTAATACTAGCACCTTTATCTTTTATCTTTTCTACTATTTCTAATAAGTCTTTTGTACTTCTACTAATTCTTGTTAAATCAGCTACTATTACAATATCATTTTCTTTTAACTCATTTAACATTTTATTTAATTGTTCTCTATCTCTTTTAGTACCTGTTATCTTTTCTTTATATATACTTCTTTCATCTACTCCATATTTAACTAAAGCGTCTATTTGTCTATTTAAACTTTGTTCCTCTGTACTTACTCTACAATATCCTATTAACAT
Protein sequences of DBSCAN-SWA_4 >NC_004557|1663820:1682779|1668303_1668492_-|WP_035125149.1|DBSCAN-SWA MKPKSFNLVYLDTKEKIINTLNEVGLPVVAMQSMINEIKQIIDINAKEIIHKERGEMNGEEV >NC_004557|1663820:1682779|1666337_1668320_-|WP_011099751.1|plate|DBSCAN-SWA MEKKFDLLIDTKRTGFNTIRGLKEGDNNSVLNITLVQNSIPFDLTGLTVRINYKRPDNKLFLQMVDVINPTEGEIKVNILTKVLETVGEVKADLSIFDKDNRKITSVTFNMFVDASIYRNDYIEPEDLDLIQSVYTEEEKRIRQEKAREKNEDIRKENEIAREIEENKRVKTEKIRQENENKRVESENIRADTESIRKSNETNRDEAEKIRQVNEKTRLENEKAREKNETKRIESENKRETQEVARQQGYTNMKNTIDSFDICEEYNPKKEYKKYNRVTFNGSSYESLKDSKGIDPTNTEHWICIAKKGKDGDGTGDMLKEVYDKNNNGIVDRAEVADSVEWSGIKNKPLTDKELENTKGTISAKSDTGEVTTPHDRQVNCSYRCKSAGSHTVISSSIASSANRNRSFIVNSSSCSTDFEASAIISSSKCITNGKSIIVASECVKDDGLQTIVGGYGYGEPSTKNKKWEISSYSGTIKATGAITGNSTFADFAEYFESTNGKAIPTGIIVTLDGDKIRPCRQGEEMLGVISETAGVLLNACGFHWQDRYLKNEFGGLVYEKKYDEETKQYIKVFKENLEHDMKEEYIAREDREEWNVVGLTGQVYVRIDETVVTGDFIKSDNLGRATKDDNPTYTRWRVMEVTTEYSDSKGYGVALVFIK >NC_004557|1663820:1682779|1673830_1674166_-|WP_011099756.1|DBSCAN-SWA MDNLLKRYGGNLSYVENLPSTSFINLVEECINGEVRDNLFKQYLAILPMYMGSKEKPPTFNDYYKKLVPNEFSFTAGSKSTIHNKDRGKKGFNIAKMILNNTNPRSEVASD >NC_004557|1663820:1682779|1675174_1675558_-|WP_035125143.1|DBSCAN-SWA MVELMTYVSKLLSFCCDNVYFEDLERDTEGVINIDDINLSYELESGSMDAHGFQEHISLVINVWGRNDQIIPMEDMVKLIDKQLLKRQWNCDGRLYVIDRDVVYRNNIKDPDRRIRRIKLNYIVQII >NC_004557|1663820:1682779|1668481_1670374_-|WP_011099752.1|tail|DBSCAN-SWA MLKVYDKLATEFSSNGLAILNNTIRCDVEEELNGYYGLELIHPYNSKSKHLKKDNILKVSTPQGLQLFRIDRVVKTLSNIEVYAKHIFYDLKDNFLEDCRPKNMGADEAIKYIFSNMQYPTKFKASSNLNTESTAYYIRKNPIEILLGDSETSFNSRWGGELYRDNYIVKMLEKIGQDTGVHIQYGKNLTGLEAHVDEEEVATRVMPTGLTESDSSLLLPEKYIDSPLINEYPHPKIKHIHMGEYKVNSDRTQGAVLTKDEVYQLLRDKVDHMFKMGLDKPIINYTVDFIELSKTEEYKHYKQLEELHIGDKLKVTHKELGIDTEQRVVKYIWDALNEEYIELELGMIQKSIEMTLTKVTKKADTLEEELQYTREDTYNNFVDTNNTIKETAKDIRYEMKVGDEYLNSYITQTAREIRTEVNDVDNKLHSEIVQTADKIMTYVVDENENMNSRITQTAKEIRLEVNDVDNKLQSEIVQTADEIKLMVSDTEKELNSKIEITADKIESKVQDNYRDLSSWISQEADKIEMVVDGRGDIRAAKIALAISEDYSAISMIADRIEIKPHSGIIEFPYGQDIDCRGGDIRIRNSSNNYLKISGDFDFYNRGSILASITPSGIYFKGRRVKLEGEA >NC_004557|1663820:1682779|1664841_1665102_-|WP_035125262.1|holin|DBSCAN-SWA MNIMEFITEQAFILVPALYVLGLMLKQTKQIKDWTIPWILLVVGILGSIALIGLNANAVIQGILTAGVAVFGNQLIKQTTVKREIK >NC_004557|1663820:1682779|1670385_1671108_-|WP_011099753.1|DBSCAN-SWA MHNFVFIQSDGVGYNSKDFNIVINKKPSLIKPKKDIEHVEVVGRNGSLTIDRKGYFDIPLPYTCTILPSDTCCDCNNTNSIIDNSELINEWLEGSGTLIMSHLPQYEYKVKDVQFEGIDIDGSTGHFEVVFICEPFRYLPLETINLTKGTHEIINPSRLESQPILTVYGTGTVSVVINNKAIKVDNVSEYVTIDSVMFECYEENKNLNNYMTGDYPVFKRGKNIIKLESNGTVEVQWKYK >NC_004557|1663820:1682779|1674529_1675156_-|WP_011099758.1|DBSCAN-SWA MAMSGLSKDTNKRLIFDAGAFFINYDIENDTLDTAMKKCIGATSGGGNFEAVPEFRDIEIDGVKGKTKGSTVLDGWDVSMTANVVEFKPDTFRYAFGATDSEEIKIEDFDYTKIRGKNRITHCDYLDNITFMGNITGSSRPIIIQVFNVLNEEGLSIDVVDKDNAVAELKFHGSYDPDNMDNPPFAIYYPKLDINDTCELPKPPPEEK >NC_004557|1663820:1682779|1682209_1682779_-|WP_035125138.1|DBSCAN-SWA MLIGYCRVSTEEQSLNRQIDALVKYGVDERSIYKEKITGTKRDREQLNKMLNELKENDIVIVADLTRISRSTKDLLEIVEKIKDKGASIKSIKDTWLDTTTDNPYNSFLLTVMAGLSQLERDLISQRTKEGLASAKARGREGGRPNKRNKKADVVELMLREGYKIKDIVDKTGLGRTTIYRIKRDLEGR >NC_004557|1663820:1682779|1671111_1673850_-|WP_011099754.1|tail|DBSCAN-SWA MRWRAIKLFELFCSIGLDDKVTKPLEDVGKKGEEVAKGLGNKFGTLGKVVAGAFTVKAMVNFEKKIIGVYSTYDDQMRKIQAVTGATGEEYAKLQSKAEALGSSTRFSATEAGEGMEALARAGWKTDEVLAGIGETLTFATANSIDLGNASSIVANGLSMFGMKAEEAGRFTDVLSATASASNTDIGLLGETLKYSGALAGGLGYKIEDLSVAMGLMANKSIVGSQAGTTLKNAILRLNNPVGQGAKDIKKLGISMTDASGKVKPFNELLVHLRDKFSTLGDAQKAQYISTIFGTEAVAGMTAVLNASQEKFDDLTESINNSNGITQKMADTMDGGLGGALASVSSAWEGLLIKLGKMQDGLLVDTFNGLAKVLQGIPNEIEYISNKIQELTQFYRDHKAIIDTLAIGLGTMATGFLLVSSKAKIAEKGVMAFMKGTKLLKGIPSIMSGVGKAFTFLLSPAGMVTVAIAGIAMGAFLIYKNWDRISAWFANEWEYISNLFTNGYNSFIQWCTNIGTAISEWFSNLMESIANFLAPIGEFIMNIFKGFLLGMRFLIESAMTILIAIIQVPLNIIKGIISAIWNGISSGISGAFNWIKGIFSSIFGWITNIISEQVDRQDSSWNKIKNIIVTVWNFISGFITTIVTGIKNVITTVWESIKLVTTTVFNAIKGVVTTVWNAIKGVITGVVNGVKAIITTTWNVIKNVTSTVFNAIKGVVTNIWNGIKTVISNVVNGVKEKVLGVWNGIKETTGNIFTSLATKVGTIWNNIKEKITKPMTVAKDAVKTALDKIFGFFKNLKLPEIKIPRPKLPVFTLEGKFSLKEMTVPKLGVKWMHEGGIFTKPTVFPNGIGVGDRNNGRGSAMEAVLPIEKLKGMIREEMENMKQDVIINVDGRVIAQATVPFLGTEMEYRRRR >NC_004557|1663820:1682779|1680350_1682081_-|WP_035125243.1|DBSCAN-SWA MIYYDNKDFTQAEYNTYILYKYLSKHYSKDIATKIIKQNQKKLDQICIALGEKDIAFFCLYFLQDIFVVKDTNEARNLSKDHYHLWDIANKTFTDDIFDKVNIICPRGFAKTTIFDLAISIWLICYRKSKFTLIGAKKDDDATQFVDSIKKVFKENKFIIDNFGELINNRKFKVNANEIEFTNGMYIRAVGSSSSVRGANFKGIRPTVVIADDYQDEKDILTEDAREKKYDKWCKEIEKVGDTAVYRNGKKVKSATKIISIGTVLHSQCLISRLAQNKDYYTYLRQAIILEKGQTVEDILESDLWLECKKIYFNDKIDNPKEEAKEFYKQNYEEMKFHVLWEEKWDCFEDIAIAYWENRQAFMSEMMNNATSIGEKWFKSNVVQTTEEIEDHNFIKTILTVDPAGIKNKKRGDYFAFVVESLADNDFKYVRKGEILHFVSFEEYIKHICNTLKKFEDITHIAIEKNTYMGLDLDKLKEFIASDEELCNRQFEFINELQRKNKDEKISTIIDSINNGRIIFDKERVMPEALNQLMEFQGQLYTPHDDFVDCVAEASNRLDTIEVVSNLQILDRRLFF >NC_004557|1663820:1682779|1674180_1674483_-|WP_035125146.1|DBSCAN-SWA MDIKGKHLFALARIGKNIDMEELIKVGKDKTMDAQAKGIVIIKAVMELMGEYEKELTELIADIKGVKAKEIAEQPIEDLVKDIEEIINNEQIQGFMKTTK >NC_004557|1663820:1682779|1676588_1677404_-|WP_011099761.1|DBSCAN-SWA MAIENFKKTLWEGALLHNFHNVSIADVITTKPTNKDGNKIIFNRVGAGSVKDYTGSISWDEISTTPIEMTFTQKKYFAFALEDVDKVQLVADIMKSTTEEHSSVLAETVDNYILDKAVKGVKTGNKIGSSTTKKNVNKTNIYDYIVDLGTILSKNKAPKTDRFVVVNSEILGLLSKDDRFTRNPEVLANGVVANAKINGMTVVETEELPVNTVLALHKSSLGYDKQISEIEAMRLQNAFSDGIRGLAVYDGVVLREDAMAVLYYTLDAPSA >NC_004557|1663820:1682779|1665119_1665353_-|WP_035125153.1|DBSCAN-SWA MNSETMEIKINEHTEIIKEHDKRLDKIEQDGREFKIEIKNLCKDIKGLTTTMRWFMGLIIGSFVSFFFYVIQHNILK >NC_004557|1663820:1682779|1677493_1678117_-|WP_011099762.1|DBSCAN-SWA MLKKDLLELIKDIDDNESIDEILKGTDFAKSMLSIDNFKNLVATNKDFKGFLDSEKDKHYGKALETWKQNNLEKIINEEIRKRNPARDEKDIALEELQRKIEAMEEEKQYEKLKNIALKQATEKKLPTEIVDYFIGKDEEVTLNNLNKFEDVFNKQLETVVQERLKGSSYTPPKNNGNNIDTYDNLLKNADNMTPEQIAEQFSKLDK >NC_004557|1663820:1682779|1675873_1676203_-|WP_011099759.1|DBSCAN-SWA MFYKHTMSLWNKGKKYLDKIKQVHYKETELVTNFKCDIQPLTQEKALGIYGITQACSHVIYMDLVPFLIVNEQVLKRNGNTYEIVKVVEWEEEEDPDFNYKEVVINEIH >NC_004557|1663820:1682779|1679063_1680335_-|WP_011099764.1|portal|DBSCAN-SWA MDIELLKKCYEDYKLNKTTYNKMYEYYKGNTDAIKNYKMVTERSNNKINVNYIKKLINEEVSYSVGNDVTYISKSGNENIVNDIDYYLDHWSEGHDSSLAKNMLIYSLAYELYYVDREGQFSSKVINPENGYAVIDDFGNIQYFLHMFKLKFKDEIHIDVYNSNLIYHFDEEFKEINKPTPHIFGQVPVGLAQLSEEGKDNTIYKDIKGLQDAYETNLSDISNEISDFRNAYLVLTGVQIPEEDIPKMKELGVLQAKDKNSKIEWLIKNINDTFIQNTLETIEDKIYQITSHINHNEKMQSNTSSLALRSRLISLEEKCKLNQKSIADCIKTRLKFLFVYLRVLKNIKYDFRDIKIKFTPNIPQDDLMIAQVIAQLGDKLSTETALSLLSFIENPKNELEKLKAESEENNIGENLLEEADIYE >NC_004557|1663820:1682779|1663820_1664825_-|WP_011099748.1|DBSCAN-SWA MSIKIAIRRGHQRTGKDLGAEGFVKEIEAVEQYMPYIIESLKSKGYEVLDVTPPEANRSLADSLSYSINKAREFGADLFISCHANAFKITDSAMGGEVIFPPGDSVSRNYALDVCNVLKKNGFINRCAKPDTRGLAEMNGKVGCPAIILEPFFIDSKADVNNFNWVGGVKLGREIGKAINVIPNPYKPASKASISVDGGGTILNNKSAINLIIRDFSSDIDRIFGYIDNDKKASYAFDINPLNDNYVKLEKNCSKAIYKRNEGYTFSPNTNYKVRVEGYKNGKVVTKNEIVLKTLDIEDDKLYRVQVGAFKDKDNADKLRNELKNKGFNGFIKE >NC_004557|1663820:1682779|1676217_1676535_-|WP_011099760.1|head,tail|DBSCAN-SWA MLKDIKILLNINDNSKDSILNIYIRKATTLIQNYLNNYNFKSDYIQANFKDAIIDIVVSAYNFKDNKNIKSKTQGARSITYADNTAFTITDSVANLLPMPYVKMY >NC_004557|1663820:1682779|1678198_1679071_-|WP_011099763.1|capsid|DBSCAN-SWA MNKLYKDLYLQFEQESYNLADKKSKDIYKQQKKDKEELLNKIARVILTYTVVEEVLSIGDKDKKTLAKEFNSIISGICTDQYKIEKSITKDILSIVSKDKYCEDGYLLNIGIDFNLKKLTDKQIEKIVNDKIKDEIWSNRLWKNKKKLEKELKKEVGTFLKGKTNVNKIEKNIKNKFNQNAYNTKRLIETEVCRCQSTVNDVFAKEYGIEKQMFIATLDKRTSNICRSYDGQEFNVDDTKKPIPPLHPLCRSVLINIPSDNWKPKVRKDNITKEYVPFINYNEWAKKQDI >NC_004557|1663820:1682779|1675560_1675887_-|WP_035125141.1|DBSCAN-SWA MKYINNIKQVKQAINKAMEGSLNELGITATTNTQAVTPVNTGKLRRSITHKVDSQDLKVYIGTNVPYAISVHEGTSRQKSNPFMSLGIKQTQSSVVNIIKKNFSNVSK >NC_004557|1663820:1682779|1665625_1666054_-|WP_011099750.1|DBSCAN-SWA MAKKGIIFLLFVFIMSFIGCGSVATSNIPKDMDEEIYKRSVEMLAYTERCFKNKTEANGDKYEEYSEDVTDICEIQNREVTDKERDTVGYIALAGLKLLYYQTENKQDALDYYKEHIKEAKEALDIPVDKDSNEVWDKIKNK |
22 | Clostridium_phage(50.0%) | capsid,holin,plate,head,tail,portal | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2053688 : 2060217
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NC_004557|2053688:2060217|DBSCAN-SWA TTTATTCATCAACCTCTATAAATTCAAACACTTCTTCTACTGATTTTCCAAAAACTTTGGCAATATCCATACCCAGTTTTAAAGAGGGATTATATCGTTCATTCTCTAAGTGTACTATAGTTTCCCTTCTTACACCTACTAATTTTGCCAATTCTTCTTGTTTCATATTAAACTCTCTGCGTAATTCACGAATTTTAGTCTTTAACTTTTGCACCCTAAGCACCAACCTTTTCATAAACAACAAAGAAAATGAATTCAAGAAAATTAACTCCACCTAATACAAACGGTAATATTATTTTTAAATTAATTACCCATTCACCTTTAAAAATAGCTATTAATGTACAAATAGAAATTCCTAATGATAGAAACCCATATACTGTTGATTTAGCTCTATTCTTATTATACTCTGCCATTTCATCTTCCGGCTCTGATTTTATAAAATATGGGATAAACATTGAAACTGAACCAACTAATAAAATAATAGCTGCTATTATTAATAAAATTTTATTGGCTTTAAAAACCTGTATAAACCCCATGCCAATCCAAACCACTCCAAAGAATCCATTATTCAACATTGATTTTTCTCTTAAAGTCAGTACCTTTTTCATGTAACCACTCCCTAGTATATAATTCTAATATTTGTTATGTATTTATAACTTATGTTAGAATTATATAACATTTGTATTTATTTATCAATAAAAATGTTATATATACTTCACATAATGTAAATATATTACTGAAAATAGCATTCCTTCATCATACATTCTTGCCACCACCCTTAAATAGCAATAATTCCATTTGGTCATCTTCCTTATCATTAATTACAGCATAAGAAAAACCACCCTTTAGATGGCTTAAATTCTACTTTTCTATTGCTGTGTATCTTGGATAATCATAACTTTCTGAATCTACTAAAATGCTTTTCTTAGTTTTTGTATTTTTAACTCTTATGCATCGTAATTCTCCTTTTGAATTGCATCCCCCATCTTCTTTTGTTATCCAAGGTTGATCTCTGCAAAAGTCCTTTGCAAATTTTTTAAATTCCTCATCACTTAGTTTAACTTCTTTTGTAACTTCGTAAAGTTCTCCTTTCATGCCATCCTTTTTAGCTTCTTTTGTAAGGTATTTAAGTTCCTTTAAATTTAAAACCTTTCTTCCAAATAATGCTTTCATTTTTTTAACCTCCATGTGTGTTTTCGTTACATACATATATCACTCTAAAGCACACATATATCAAGGGTTTTATTCGCTTTTTTTAATATCTTTATAAGAAATTTTTTCACCATTTCTTATAAGAAATACCGCCTCATCACTATCTACTTGTCCTATATATCTCTTAACAATAACATCAGCATACTTTTCATCAAGTTCTATCGTATAGCAAATTCTATCTGTCTGCTCACAAGCTATAAGAGTTGAACCACTTCCTCCAAAAGGATCTAGAACAATAGAATTTGTTAAACTTGAATTTGTAATTGGGTAAGCAACTAAAGCTATTGGCTTCATTGTTGGATGATATTTTGATTTTGTAGGTCTATCAAAATTCCAAGTGGTCCTTTGCTTTCTGTCTCCGTAAAATTTATGTCCTGCAGTAGGTTTCCATCCTACAAGTATAGGCTCATGATTATATTGATAATCACAACGCCCTAGTACTGGAGAATTCTTTATCCATATACAAGTTTGGTGACAAAAGAATCCTGCATCTTTAAAAGCTACTCTAAAATTAACTGTTTCTTTATCGGCATGAAACACATAAATAGAACCACCATCTGCAAGACTTTCATACATTCCTTTATAGGAATTTAAAAGAAACTCATAAAACTTTTTATCTCCCATGTTATCATTTTGAATTTTTCCTGCTGTTCCTTCATAGGAGACATTGTACGGAGGGTCTGTTACAACTAAATTTGCTTTCTTTCCTTCCATAAGTTTTTCATATGTTTCAAATTTTGTGCTATCACCGCATATCAAGCGATGTCTTCCAAGTAACCAAACATCACCATTCTTTATGATTGGAACTTCTGGTGGTGCTTCATCAAAACCATCCTCTTTCACACCTTTAGGATGAAGTTCATTAAAAAGTTCATCAATTTCTGGCGGCTCAAAACCTGTAAACTCTACATCGTAATCAATTTCCTGTAGATCCTTTATTAAATCAGCAAGTAATTCTTTGTTCCATTCACCAGTAATTTTATTAAGAGCAATATTTAATGCCTTTTCTTTGGTTTTATCAATATCTATTACAATGCAATCTATATCTTCATAACCTAATGTTTTTAAAACTGAAACCCTTTGATACCCTCCTATAATTGTCATATCAGAATTTACAATAATAGGCTCAACATATCCAAATTCATTTATGCTGCCTTTAATTTTCTCAAATTCTTTATCTCCTGGTTTTAACTTTTTTCTAGGATTATATGCAGCAGGAACTAAATCTGAAATTTTTAATTTCTTAAATTCCATTTTCCTCATCCCTCCAAAATCTATCTCTAATGTAGCAGTTGTGGCTACAATATTTTCTATTTTTATTTCCATAAGCACTAAATTCTTTTCCACAGTAAGCACAGGTATATTTATAAAATGCTGTCTCTTTTCTATTTCTTTTATCTTGATTTTCATTCCACCATTTTCTTCTACACTCATCAGAGCAAAATCTTCTAGTTCTGCCTTTACCATTTTGCTTTAAGGGTTTATTACAATAAGCGCAAAGTAAATTTCTTTTCATTTTTTCTTCAAAATTTAGAGCAACAACACATGAATTTCCTTCTAACCCATTACACTTACAAAACCCTCTAACACTATCTCTAGTTAGATTTAAAAAGTTAGCAATTTTTTTATATCCCATACCTTTTAATCTTAGTTCTCTTATTTTTTCCTTTTCATCACTTGTCATATGAGTTGCTCCTTTCATTAAACTTTTACGCAATAAAAAAAGCACCCACAACGCTTGTAGCTACTTAATTTAAACTTTATATTATGCGATTTTTGCCTATCCCCCTTGTTTAATTCTGCGAAAATTCACAGGAGAGGGGGCGGCGGTCCTTGGCATGTCCTCTTTAGAGGTTTGCCCCCCCTCCCCATGAAAATTTTATTAAACAATTTCTATTAACTATTATTACTACAACTAATAATCCAATCTATAATATAGTCATCTTCATACTTACCTGATGCTATGCCCAATCCAATATCTACAATATCTTCATCACTACATTCCAATTGAATTCCGTTAATTTCTAAAAACACCATCATTGCTAGCATTCCTATTCTTTTGTTTCCATCTAAGAAGGGATGGTTCTTAATAATACTAAATCCTAATCTTGCTGCTTTAGCTTGTATAGATGGATATAGTTCTTCACCTGCAAATGACTGAAATGGAGAATTTAAAGCTGAATCTAACAATCCTTCATCTCTTATCCCATCTAATCCGCCGGTCTTTTTAACTGCCATAGAATGTAGATACATCATTTGTTCCTTACTTAGGTGTTTCATTTAGCCAACTCCTCAAATGCTTTTAGATGTTTATTTAAAATATTATTTGCAGCTTCTTCTACTGTAGCATCATCTGCAATAGTATCTTGCTGAAATTTACTATAGTCTATTAAGACATATCTTGGTGCATTATTCTTTAATATAATAGCTGCACCATTTTCATCTACCATTCTTGCAACTCTAGAAAAATTTTGATTAGCCTCTGATATGGATACTAAATTATTAATATTTACTTGCATAAAAAGCACCTCCTCTATAGTTATATTATACACAATTTTAGGATAAATTAAACCTAAAATTATTTTTTATTATATTTAAATTCTTGATACCTATCTTCTGTCATTGTCTTCTTATCATGACAGCTCTTACAAAGAGGTTGCCAATTGCTCTCATCCCAAAACAATTTATAATCTCCTCTATGAGGAATAATATGATCAACTACTGTAGCATTTGTTATCCTTCCTTGTTTTTTACACTCAACACATAGAGGATTATTTTTTAGAAATAGTAGTAAGTTAAGTGATCTTACTTATGTTAATGTTGCAGGTAAATGGCATTATATATGTTTATTAATTAACCTCTTTAATAGAGAAATCGTAGGTTATTCAGCAGATCCTAAAAAGGATGCTGAATTAGTTTATGAAGCATTTATGAGCACTAATATAAATTTAACAAAAGTTGAAATATTCCACACTGACAGGGGTAATGAATTTAAAAATAAGGTTATAGATGAAGTTCTAAAAGTTTTTGAAATAAAACGTTCCTTAAGTAAAAAAGGTTATCCATACGATACTGCAGTTGCTGAAGCTGGATACAAAATTATAAAAACAGAATTCGTATTTAATAGAATTTTTAAAAATTTAGAAGAACTAAAAAGAGAGTTGAAAAATTACGTATTGTGGTATAACTATAGACGTATTCTTAGTGCTCTTAATTATATGGCACCAATAGAATATCGTCTAGCAAAAGTGACCGAATAATATTTATATTAAAAAGCGTTGACAATCCAAGTCAAGCTGTTGCTGATAATGTTTGCTTATCAAAAGATGATGCTATAAATATTATTCTTTGTTTAGAATTAAAAGTTTTTATTGCATTATTCTTTTTAAGAAAATCATTTAAAATTCTTGATGCTTTTTCATAAGCAGTTAACTCTCCAAGCTCAAACTCATTTTTATATTTTTCAATGTATGATTCAAATATTTTATCAACTAAATCATCAGTAGTTTTGATTTTAGCGTAACTTGTAGCATACCTTATAGCTTGAAATTCAAAAGCCTCTTTCCTTTGTATAATACCTTCAACATCTCTTTTAATTTCAATAAGTACAAGATTACCATTTTCATCTACTGCTGTTAAATCACTTCTTCCATTTTCTTTATTTACAACTTGCTTGCCAACCACAAGTAAAGTTTCATCTTCAAATATAACTTCTATGTTCTTTTTTAAAAATTCTTCTATATGTTCTTCTCTTAGGTTTAAGGATTTAAAGTTTGTATTTTTTATTTCAATATCTTCAATTTCTATAGAATCTTTTATTTTTACTGGAATTAACATATGATTTACCTGCCTAATTGTTTTTTATATCTATGTTTATACCTATTTTATATTTTTAGCTTTCTTTTGTTCATAATTTGTTAACAACCCCCGGGGGTCTTTGCATGTTAAATATCTTGCTATAGAATTTTTAGAAAGTTTATATGTTTCTCCTATTTCTTCATGGGTTCTTAACTTTTTCCCCAATGGGGAAAAAGTTTCATCCTCCTTGATTTCATTGGATTTAGAAAGTCTTTCTATCTCATTTAATAGGTCATTTCTAATTCCTTGATTAGCCATAGCTTTATGTCTTGTATAAATAACAGCAGCTCTTTCTGAATGAAGTAAATCTGAAAATGACCTTTGCATTGTATTTGTTTCAGTAACAATTAAAGTAGCTTCATCTTCTGTAAGTCCTTCCTTAATAATTGCTGGCACTTTAAGTATTCCTACAAGTTTTGCAGCATTAACTCTATTATGACCAGATAGAATCTGATAGCTTGTACCTATAGTCCTTACAACTATAGGTACTATAACTCCAAACTCCTTAATACTTTCTACCATGTCATTTAGTCTTTCTCCTTCATATAGCTTAAAAGGATGATTACTAAAAGACACCAGTTTTTCTATTTCAATTTCTGTTATTCCTGATTTTGTTTCTTTTTTAGCTTGAGTATATCCAAACATATCATCTAAACTCATTAATCTTTTTTCACTCATAGTGCTGCTACCTCCTTTGCAAATTCCATATAAGCTATGGATACTTTATTCTTAGGATCATATTCTATGGTGCTTTTACCTCTCATATTTGCTTCTCCAACCTTTACCGATGTTGGTATCTTGCTTTCAAAGATGTGAATATGACTTCCATAAGCTTCTTGAATAATTGATAGCACTTCCTTTGAAAGTTTCATTCTTTCTGCGTACATTGTAAGCAGTATGCCATCAACTTCAATACTAGGATTTATTCTCTTTTTAACTCTTATAATATTCCTAAGAAGTAGCTCCAAGCCTTTAGCTGAAAGATATTGAGGAGTAACAGGAATAATTACACTATCACAAGCAGCTAGTGCATTTATAGTAAGCATTCCAAGTGATGGTGAACAATCAATAATAACGTAGTCATAATCAGTTTTTACTTTATCAACTATAGATTTTAAAACCAGTTCCCTGCTCATAACATTAACAAGTGCTATTTCTACTGCTGATAATTCTAAGCTACAAGGAATAATATCAATATTTCCTGCTGAAAGAATATATTCTTCTTTTTTAGGTAAACTTTTCTCTTCAATAGCCAGTGCCATAAGATTATATATTGTTGTTTTTATACTATCCGTATTATCATAGCCAAAGCATACTGTTAAACTACTCTGAGGATCAAAATCTATAAGTAGTACCTTCTTTCCCATTTCTGCTAATGCATATCCTAAATTTAAAGTTGTTACAGTTTTAGCTACTCCTCCTTTCTGATTAACAATAGAAATAACTTTATTTTTACACAT
Protein sequences of DBSCAN-SWA_5 >NC_004557|2053688:2060217|2057634_2058159_+|WP_078688053.1|integrase,transposase|DBSCAN-SWA MLPCFLHSTHRGLFFRNSSKLSDLTYVNVAGKWHYICLLINLFNREIVGYSADPKKDAELVYEAFMSTNINLTKVEIFHTDRGNEFKNKVIDEVLKVFEIKRSLSKKGYPYDTAVAEAGYKIIKTEFVFNRIFKNLEELKRELKNYVLWYNYRRILSALNYMAPIEYRLAKVTE >NC_004557|2053688:2060217|2054929_2056153_-|WP_035109258.1|DBSCAN-SWA MEFKKLKISDLVPAAYNPRKKLKPGDKEFEKIKGSINEFGYVEPIIVNSDMTIIGGYQRVSVLKTLGYEDIDCIVIDIDKTKEKALNIALNKITGEWNKELLADLIKDLQEIDYDVEFTGFEPPEIDELFNELHPKGVKEDGFDEAPPEVPIIKNGDVWLLGRHRLICGDSTKFETYEKLMEGKKANLVVTDPPYNVSYEGTAGKIQNDNMGDKKFYEFLLNSYKGMYESLADGGSIYVFHADKETVNFRVAFKDAGFFCHQTCIWIKNSPVLGRCDYQYNHEPILVGWKPTAGHKFYGDRKQRTTWNFDRPTKSKYHPTMKPIALVAYPITNSSLTNSIVLDPFGGSGSTLIACEQTDRICYTIELDEKYADVIVKRYIGQVDSDEAVFLIRNGEKISYKDIKKSE >NC_004557|2053688:2060217|2053688_2053901_-|WP_035109261.1|DBSCAN-SWA MQKLKTKIRELRREFNMKQEELAKLVGVRRETIVHLENERYNPSLKLGMDIAKVFGKSVEEVFEFIEVDE >NC_004557|2053688:2060217|2058778_2059435_-|WP_011100106.1|DBSCAN-SWA MSEKRLMSLDDMFGYTQAKKETKSGITEIEIEKLVSFSNHPFKLYEGERLNDMVESIKEFGVIVPIVVRTIGTSYQILSGHNRVNAAKLVGILKVPAIIKEGLTEDEATLIVTETNTMQRSFSDLLHSERAAVIYTRHKAMANQGIRNDLLNEIERLSKSNEIKEDETFSPLGKKLRTHEEIGETYKLSKNSIARYLTCKDPRGLLTNYEQKKAKNIK >NC_004557|2053688:2060217|2053902_2054295_-|WP_011100099.1|DBSCAN-SWA MKKVLTLREKSMLNNGFFGVVWIGMGFIQVFKANKILLIIAAIILLVGSVSMFIPYFIKSEPEDEMAEYNKNRAKSTVYGFLSLGISICTLIAIFKGEWVINLKIILPFVLGGVNFLEFIFFVVYEKVGA >NC_004557|2053688:2060217|2058190_2058736_-|WP_011100105.1|DBSCAN-SWA MLIPVKIKDSIEIEDIEIKNTNFKSLNLREEHIEEFLKKNIEVIFEDETLLVVGKQVVNKENGRSDLTAVDENGNLVLIEIKRDVEGIIQRKEAFEFQAIRYATSYAKIKTTDDLVDKIFESYIEKYKNEFELGELTAYEKASRILNDFLKKNNAIKTFNSKQRIIFIASSFDKQTLSATA >NC_004557|2053688:2060217|2056795_2057179_-|WP_011100103.1|DBSCAN-SWA MKHLSKEQMMYLHSMAVKKTGGLDGIRDEGLLDSALNSPFQSFAGEELYPSIQAKAARLGFSIIKNHPFLDGNKRIGMLAMMVFLEINGIQLECSDEDIVDIGLGIASGKYEDDYIIDWIISCSNNS >NC_004557|2053688:2060217|2056142_2056583_-|WP_035109255.1|DBSCAN-SWA MTSDEKEKIRELRLKGMGYKKIANFLNLTRDSVRGFCKCNGLEGNSCVVALNFEEKMKRNLLCAYCNKPLKQNGKGRTRRFCSDECRRKWWNENQDKRNRKETAFYKYTCAYCGKEFSAYGNKNRKYCSHNCYIRDRFWRDEENGI >NC_004557|2053688:2060217|2054548_2054860_-|WP_035109259.1|DBSCAN-SWA MKALFGRKVLNLKELKYLTKEAKKDGMKGELYEVTKEVKLSDEEFKKFAKDFCRDQPWITKEDGGCNSKGELRCIRVKNTKTKKSILVDSESYDYPRYTAIEK >NC_004557|2053688:2060217|2059431_2060217_-|WP_035109250.1|DBSCAN-SWA MCKNKVISIVNQKGGVAKTVTTLNLGYALAEMGKKVLLIDFDPQSSLTVCFGYDNTDSIKTTIYNLMALAIEEKSLPKKEEYILSAGNIDIIPCSLELSAVEIALVNVMSRELVLKSIVDKVKTDYDYVIIDCSPSLGMLTINALAACDSVIIPVTPQYLSAKGLELLLRNIIRVKKRINPSIEVDGILLTMYAERMKLSKEVLSIIQEAYGSHIHIFESKIPTSVKVGEANMRGKSTIEYDPKNKVSIAYMEFAKEVAAL >NC_004557|2053688:2060217|2057175_2057418_-|WP_035109252.1|DBSCAN-SWA MQVNINNLVSISEANQNFSRVARMVDENGAAIILKNNAPRYVLIDYSKFQQDTIADDATVEEAANNILNKHLKAFEELAK |
11 | Streptococcus_phage(44.44%) | integrase,transposase | attL 2049646:2049662|attR 2064981:2064997 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2237703 : 2286330
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NC_004557|2237703:2286330|DBSCAN-SWA ATCACTTAGCATCATACCAATTTTCTCCTTCATTTATATCAACTTCAAGTGGTACACTAAGTGGTAATACATTTTCCATAGAGTCTTTGACTATACTTTTAATTTCTTCTAATTCATCTTTATAAACATTTAATATAAGCTCATCGTGAACTTGTAATATTATTTTACTTTTTAAATCTCTTTCTTTAAGTACATTAAAGACTTTTACCATAGCCATTTTTATTATATCAGCGGCACTTCCTTGAATAGGTGTATTCATGGCTAATCTTTCCCCTAAAGCCTTTACTATTTTGTTTTTATTCCCTATTTCAGGTATGAATCTTCTTCTGTTTAAAATTGTTGTAACATACATATCTCTTTTAGCTTTATCTATAGTTTCATCCATATATTTTTTAACATTAGGATATCTTTCAAAATAAGTATCTATATACCCCTTTGCTTCTTTTCTAGATATTTTTAAGTCTTTAGCTAAACTAAAATCTCCTATTCCATAAACTATTCCAAAGTTTACTGCCTTAGCATTTCCTCTCATGTTAACTGTTACTTCATCTATTGGTACTTTAAAAACTTCTGATGCAGTTTTTGTATGTATATCTGCATGATGCTTAAATGCATTTATTAAGTTTTCATCATCTGCTATGTGTGCAAGAACTCTTAATTCTATTTGAGAATAATCCGCTGATAGTATAATACTATTTTCATAATTTGGTACAAATACCTTTCTTACTTCTCTTCCCATTTCATATTTTATAGGTATATTTTGAAGATTAGGCTCTGTGCTTGAAAGTCTCCCTGTAGTTGTTACAGTTTGATTAAAGCTTGAATGTATTCTTCCATCTTCATCTATAACATTTTTTAATCCTTCTACATAAGTTGAATATAGTTTTGTAAGTTGTCTATAGTAACCTATTTTACTTATAATAGGATGTTTATCCTCTAAGACTTCTAAGACTTGTGCATTAGTGGAATATCCTGTTTTTGTCTTCTTTATAACAGGTAAATCTAATTTTTCAAATAAAATTTTTCCTAATTGTTTTGGTGAATTTACATTAAATTCTTCATCAGCTAGTTCATATATTTCCTTTTGAACTTTTTCTATCTCCCTCTGAAGCTTTTTGCCTATATTTTCTAGCATTTCTCCATCTACTTTAAAACCTTCTAATTCCATATTGGCTAAAACAAAAGCTAGCGGTAGCTCTACTTCATAATAAAGTTTTTCCATTTCCAAATTAATAATTTGCTCTTTTAAGTTTGCACATAATTTTTCTAACAATGAAGTTTCTCTTATTTCTCTTTCTTCATCATCTTTTACTATTTTACTTAAACATTCTTGTATTAATGTGGACAAGTCATAATCTGACTTTGAAGAATCTATAAGATATGCTGCTATTTTCAAATCAAATAAAGCTTTAGTAAATTCAACATCTAATTTTTTAAGAGCCATAGACAGAACTTTTACATCATGACTTACTACTCTTACGGCTTCTTTTTCAAAAATAGTTTTTAATATTTCTATGGTTTTATCTTTATTTTCTTCAATTCTTTCTTCTATATTAACTACATAGCACTTATTTTTAAATAATATATATAGTTTTTCTAAAGATATTTTAGAATATAAACTTGTATCAGTGCTTTTAAAATTTACATATATTTTTTCATCTTTATTTCCTATTTCAGTACACAATTTAATTAAATCTTCTATAGTATGAATTTCTTCATATACTAGCTCTTCTTCCTCTTCTACTTCTTTATGATTTTCACCTTTAGGTATTTTATCTATTAAAGACTTAAATTGAAGTTTTATAAATATTTCTCTTACTGCTTCTATATCATATTCTTCTTTAGATTTAATTTTTTCCAAATCTATGTCTATTGGAACATTTCTCATAATAGTAGCTAGTTTTTTACTAAATATAGCCTGTTCACTATTTTCTATTAAATTTTCTTTTAATTTTTTACCACTTAAATTTTCTATATTATTTAATACATTTTCTACGCTACTATATTCTTTTATGAGCTTTAGTGCAGTTTTTTCTCCTATACCAGGAACTCCAGGTATATTATCAGATGGATCTCCCATAAGTCCTTTTACATCTATAAACTCAGTGGGAGTAACACCCATTTCATCTACTACCTTTTGTCTGTCATATATTTCTCTTTCAGTTATACCTCTCTTTGTAAATAATATTTTTGTAGTATCTGATGCCAATTGAAGAGCATCTTTATCTCCAGTTACTATGTAAACTTCTATATCTTTTTCTTCTGCAAATTTAGATAAAGTACCTATTAAATCATCAGCTTCAAAGCCTTCTATTTCAAAAATTTCTATTGCCAATTTAGAAAGCAAACCCTTTACTATAGGAAACTGCTCCCTTAGTTCCTCTGGCATTTTCTTTCTTCCTGCTTTATAGTCTTTATAAGCCTGATGTCTAAATGTAGGTGCACTTTTATCAAAGGTACACACTATATAATCTGGTTTTATATCATCTTTAATTTTTAAAAGCATGTTTGTAAATCCATATACTGCATTTGTATGAACTCCCTCATGGTTAGTTAATGGTGGTAAAGCATAAAAAGCTCTATTCATTAAACTATTACCATCTAATATTACTAATTTATCCCTTTTAGGCATTAATATCTCTCCTATCAAAATCTATCTTAAATATATGGTTCCTCTATAATGTGATATATTTCTCTTTAACTCATAATATTCGTCTGAAGGAAATTTATTTAATATTTCCTTTAATATTTCAAAAGCAACTATATTTTCTAAAACAATAGATGCTGGCACAACAGCAGATACATCGGATCTTTCGTATCTTGTAGTAGTTTCTTTTCTATTTCTTAAATTTACAGTTCTAATATTTTTTTTAATAGAAGGTATAGGTTTTATATACACTTTCATTTCTATATTTTCCCCATTTGAAACTCCTGACTCTATGCCACCACAATTATTTGTAGGTCTTTTAAATTTTCCTTTTTCATATAAAATTTCGTCATTAAAAGTGCTTCCTCTTAAATTTAAATTCATACCATTTCCAAATTCTATAGCTTTAACTCCTTGTAGTGACATGATTGCATAACTAAGAAGAGCATCTAACTTTCTATCCCATTGAGAATAGCTGCCAAGTCCTATCGGCACACCTCTTACAGATAAAAATACTGTTCCTCCTATAGTTTCTCCTTGTTCTCTGCAAATATCTATTTTTTTTACAAAAGATTTTTCTACCTCTTCATTATAGCATCTAAGTAAACTGTTATCTATTTTTTTGTATTTGCATTGGTCAAATAAATCTACCTTTTCATCAAAAAGATTTCCTATACTATAAACCTTACTTCTCACTTCTATACCAATACCTTTTAGAATTTGTTTGCAAAGTGCTCCAACGGCAGTTCTTATACTGGTTTCTCTTGCAGATGTTCTTTCTATGCTATCTCTAATATCTCCTGTACCATATTTAAAATATCCTACTAAATCTCCATGTCCTGGTCTTGGAATGGTTATTTTTTCATCTTTTTTTACTTCTCCTGATAATATTTTTTCCCAATTAGGGGAATCATTATTATATATAGCCATGGTAATTGGATTTCCTGTTGTCATAGTTCCTCTTAATCCTGATAAAAATTGTATTTTATCTTTTTCTATCTTCATTCTGCCTCCTCTACCATAGCACTTTTGTCTTCTTTTTAATTCATTATTTATAAAATCTATATCCACTTTAAAGTTAGATGGAATTCCATCTATTATAGACATCATAGCTTTACCATGGGATTCTCCAGCGTCTAAAAATCTAAGCATTGTGTATCTCTCCATTCAAATAAATATATTAACTAATCTTATCTAAGTACTATTTTAATATATATTTTGCCTTTATTCATTCATAAATTACTTTTTTTTTCAAATTTCATTTTATTTTCTATAGAACCTCATTAGTTTATTTTATTTAAATTATGCCTTTCTACCTTTAATTTATATTTATTTTTTATTATATATACATATTGTAATTTACATTTTATTGAATTAAATTTATATTATTAAATTTTTTGGAGGTGTAATATGCATAAATTAGAATACGAATTTGTAAACAATACTCCTATTTTTAAATGCAATTATTGTGGACATTGCAGCAAAGAAATAGAGGCAACATCATTTACTTCAGTAAAAAATAGAGGTTGCTGTTGGTATTTTCCAAAGTATACCTTATTAAACATAAAAAATATTCTAAATATTGGTAAAGAAAATTTCATAATTTCTCTTCTAAACAATAAAAATTCAAATATATCCAGTTATTTCATAGAAGTTAAAGGTTCCTTTGAAGAAGAAGAATATTACAAATTTATGAGAGAAAATGAATATACTGAAAGTTCTTTTGACTATAAATTATTTTTTAGAAAATGTTCTTTTGTTACAGATAAAGGATGTTCTTTAGATTTTTCTCTAAGACCTCATCCTTGTAATCTGTATCTATGTAGAAATATTATTAACACCTGTGATAAAGACTACTCATCTTTTTCAAGAGAACGAAAGGATTATTTTTCATATTGTAATTATTACAATGAATACTTAAAATATGCCCTTATGGATAAAAATCTAGATTTAATTTCAGCTCCACTAGCTACTTTAGAATTTTTAAAAACCATCTCTATACCTAATTTTGAGCCTAGTGAAATTAAGTCTATAGTTTTCAATCCCTATGGAGATGTAGCAAGCTAAATATTATGAAAAATGAGAGGGCTTCCCTCTCATTTAAGTTAGCAATATTTTTTATCAAATTCTTCTAAGTTCTTTATCTCTTTTTTCTTTTCAACTATCATAGCATTTAATGCCTTGTATGTACTTGTATTCATATCCTCTTCTCGTTGAACAATTCTACCTTGAATATATATACAAAGTGCTAAAATATCATTTTGAAACTCTAGACAAAATTTGATAAGACCTTTAATATTAAAAATATCTGGTGCTAAAAGCTTAGTACTAAATTCATCTATTAAAAAAGATATTTTATCATATACAATAAAATCAATTTCTTCCATTTTACTTTTCAACATTTGCTCTTTTAATTCTTTATGAAATTCTATACTTCTATCTAAATTTCTTATCATTACCTTTATAAGTATATATATTCTCATATCATTTTTTGCCTTATCTAATCCAGCTCCACAAATATCTCTCTTTTTCTCCTCTACATAAATTATTTTATCTAATAAATCTAAAGCTGTATATGCCAATATTTTTCAACTCCCAAAAATATAATTATAATTTTAATGAAATAATTATATCCTAGAAACCCTTTATTTGTCCATTTATAAAATATACTTGACTATTATGGGTCTATATGATATTATTTTTTTGTAACTTGCCGAAGTGGTGGAACTGGCAGACGCGCTGGATTCAAAATCCAGTGGGGCTTAAACCTCGTGCGGGTTCGATTCCCGCCTTCGGCACCAGAACTGTTCGCACAAAAGTGTTGTCTTAAGCGAATATAAATAAACCCCAGGATTAAACCACCTGGGGTTTATTTATATTTTATATTACTTTTAAATGTTTTAATATTTTTATACTATCTCTTAAAGCCCTATCATAACATTCTTTCAATCTTAAACATATCAACTCATTTTGAATTTCCTTTAAACTGTTTAATATTACCCTATCTCCTTCTTGAAGCTTCTCATTTAGTTTTTTATACATATTATCATATTTTGCTTTTTCTATAATATAATCTTCACTTAAAAATTCTCTTTCTATAGCAGAATTAAAAAATAAATCCTCAAATACTTCTTCTATATAATCTTTTTCCATATAATTTTGTTCCATATGTATCTCTCCTTACAGCTAAATTATTATTTCCAATTAATTAGTATTATTATACCAAACGTACGTTCGTAATTCAAGTTACTCAATTTTGTTTTTACAGATAATTATCTTTAAATTTATAATAACACGGAATCGTATCGGGTGCCCCGATAGTTTCGACAATTCTGATAAAATTTACGACAAAAAACGACAAAGCACCCTACTTTTTAGTAGAATGCTTCTTTAATAAATCTTCTATTGCTTCATCTAATAATCGACTCATTGGGATCCTTGTTTTTTCACTAAGTTTTTTTAATTTGTCATATAGCTTTTTATCTATCGAGTTACTTATTCTTGTTCTATGTACTAACATAAATCCACCTCATTTAAATACATTTTATTTAAATTAAATTGTATAAGGTGCATAAAGTTAAAACAAGAAACATAAAGTGCAATAACCTACCATAAGTTACCTGTATACAAAGCATATATAGCCCGTTTATAATTAGATTCGGTGATAGAAGGTGGTAGCAATGTACAAACTAGCAATTAGAGAATATAGAATTCTAAATAAATTAACTCAAAAAGATTTAGCCTACAGAATTGGAATAAGCCAAAATTATCTAAGTGAAATCGAAAAAGGCAAGTATGATATTAGAGTATCTTTCTTACTTAGTATAAGCAATGCTCTAGATGTATGTCCTGGATATTTAATAAGATGTAATATGTGTAAATGTAGGAGAAATAGAAAAAGAAGATCTAAGGCTTAAACTTTAGATCTTCTTTTATTATAAAGATGTTTTGAATATAGGAAGTATTTATACATTATACTCTGTTTTTAAGTTTGTTACTGCTGCCTCTTCTTTAAATAAATCATTATGATATTTTTCATTACCTTCTATTAATTCTGTAGCAGTACAAACCTTAGATAATATTTTATGTGTATTCTTAAAATGATTAAATTGATTTTTAATAAAAGTTAGTTCCTTAGGATTGATCTTTCTTCCTTTGCACAATACTTTTGATTTTAAAATTTTCTCCTTTGGAATAACATATAAAGCATCTAATTTTATGAAAGAAGGTCTAACAAAAGGTGGTTTATATTTAGTAATTTCAATATTAGACTCCATTCCTAACTTCCACTCTTTGCCCTTTATACTAGAAACATTAAGCAACCAAAATTGTTGATCTTGATTATCTATTACTAAAAATGTCCTCTCTGCTTTGCATATTCCCCCATCTGCAAAATTAATTCTTAGTAAAAGACCATTGCCAGGTTGTATCATTTAATAGACAACTAATCCTTGTTCTTCATCTATATATAATGAATACGCTTCATCATAACTAGGGAATTCTTCCAACTGTTGCTTTATAGCTTCATTTATACATAGTTCTTCCGGATTATAATAAAATTCAGTATTATTAATTACAATCTTTTCTTCCTTAATATCATAACTTATTTCCAATGCATCAACCATCTTTTTTATCAATAATAAATCATCTTTATAAACATTTTTTATAACACCGATTGGCATTCTAGCTGATTCTTTATTGTATTTGTAACCTTTAGCCTCTTCAATAGATTTGTTATAATAATGTTTCCAACATTTATGTTGATGTGTTAATTCTGATAAATCTTCTGGACTTACATTTAAAAATAATTCCTTAGTTGATTCTAATATATCTATTTCATCTTCATTAAAGTTTTTATCATATTCTTTAGCTAACTTAACAAAATCATCATAATTGTTACAATAATCTTTTCTTATTTTTTCTACAACAACTCCATTTTCAAATGCATATAGCTCTTCCTCAAATAATGGTTTATTTCTTTTAATTAAACTTATCATTTGTGCAAAATATAATAGTTTATTTAACTTTAAATTTCCACTTTTATCGTTGGTAGGTTTAAAATCATTATTAATAAACCATCCCGCCACGCTTATCGCTTTGCACATTAAACTCCCCCCTTAAATTATGTATATAATTATAATATCATAATTTGTTTTATTAAATGTAATTATTTTATTAAATCACTTATATAATTATATATACAATAATGCCATATGTAAATGTTTACTACATATTTAGACATTTTACAACATAAAAAAGCCATGGACTAGGAATTAACCTAACCCATGGCTTTAAAATTACTTTAACGGTACCTTAAGAACTATTTGATTTTCTGATATGACCTGTCCACACTTATTATATCCTTTTATTTTTAGCTTATACAAAGAACCTTTAGAAAAAGTATAACCGCCATTTCTCTTGGTTATTACCTTGCTGGTGTTTTTAAATAACTTAGTGTAATTACTATTAGGTGGTGTTAAATCAAATGCCCAGCTTGCTCCTTTATCTTTATCTACCCAAGCGAAAACCCTTACTATATCTTTAGAACAATCTCTAATTATTAGATTTATTCCTGGAGCTCCGCCTTTATATGAAGCATATGCTCCGCCGTCTATATTAATGTAGGAATTATTTATATTATCATACTCATTTAAATTGTATTGCTTTATTAGATTAATTAACTGCTGTGCATATATAGGAGATGTTGCATATCCTGCTCTCTGTAAAGCATTAGCCTGTCCTACATAATCTTTTGCATTAAAAAAACCATGCTGTCTATATCTAGAATTATTTACTAGAAATAAAGCGTGATCCTCTATAGATTCAGAATAAGTATTATATACCCTAAAGTAATCATTTATTAAAACTTTTTTACCATTGTAATATTCGCATGTTGGGTAACTTTTCTTTTGTCCTCTCCACCCACCTATAGCTTTTACACCAAATAGATTTTTACACTCCCTTGATAAACTACTTTTACCCCAACCTGATTCAAGTATGGCTTGTGCAATAGTAATTGAAGCTAATACCCCATATTTCTTCTGCGAAGCTATAGCCCCATCTTTAACACTATTTATAAAGTTAGTTGTATAGCTCAACTATTTCTCCTCCTTACTTTTTACTGTAGTTTGTTTTACTAATTGATTTCCAAATACCGCAACACCTGCTGTAAGTATTCCCTGTATTACTGCATTTACATTAAGTCCAATTAAAGCTATAGAACCTAATATGCCTACAACTAATAATATCCATGGTATTGTCCAATCTTTAATCTGTTTAGTTTGTTTTAACATTAGTCCCAAAATATATAAAGTGGGTACCAATATAAATGCCTGTTCTGTAATAAATTCCATTATGTTTATTTCCATAAAGCATCCTCCTATTTAAAAATATTGTGTTGAATTGCATAAAAAAAGAAGCTTACAAAGCTCCCTATAATTAGCCCCATAAACCATCTCATAGTTGTGGTTAAACCTTTTATATCTTTACATAAATTTTTTATTTCTATTTTAAATTCTCTTCCGTCTTGCTCTATCTTATCTAGCCTTTTATCATGTTCTTTTAATGTTTCTGTATGTTCACTTATTTTTATTCCCATAGTTTCGCTATTCATTTATATTTATCACCTCATTTGTTAACTTTTTACAAAAAAGAATCTACTGATTAGTAGATTCTTTAAAATATCCTATTTATTTAAATAAAACTTTTCTTCGTGCATTGATCCAGTACCTCTAATTAGTTTACCATTTTTATCATATTTATTACAGCTGCTTCTTATTCGATCTTCCAATACTAAATTATCAAATGCTTTCCCTATTAAGTATTCTTGTGACTCATCTATAGATCCTCCGCCAACTGATAGCACTATATTTGCTTTACTTACACCATTTTTGTCATTAACTAACCCCATTGACTCCATATATTTTATTATTAAACCTTTAGCTTCTTCTATATTATTCATAAACAAATCCCTCCCTTTACTGTATTTTACCACAAAGTGAGAACAAGAAGGGCTTTTTTATTTACTTTCTATAACCCCAACATCTTTAAGCACCTCTCTAACAGCTTCATTAGTTATCTTTCCCACCAGTTTAAATCTTTTTTTATTGGAACCTTTGCCATCAATCTTACTTTATAAATATTATCTTCTAAATTTACAGAAGACCCATTTATAACTATTAGATCACCTGTTGTAGTAATAAGACCTCCAATAGTATTTATCTTATCCCTAACTATTGTGCTCCAAATCTGTTTAAAATCACTGTTATAATAATATAAAGTATTATCATAACTTCTTAAATATCCATTTTTTAATTGTGCAAAACCATCTGCATAATACTTATATACTATATTGTTCTTTGTTACAGTGCATAGTTTTTTAGTTTTTTGATCTATTAATAATACTTCATCTCTTAAAAAAGAATTTCTAATAAATTTAAGTTCTTTATAATCATCAAATGTTTCGATCGTATTAATATGCTGTGCTAAATTATTGTATACATATATTTTTTTCATGCTCGTATCGATTATATATATGTATGTATTGTCTGTTTCAATACAACAATGAGGATCTTCTGAATCAGATATCATTGAATACGCAGGCAATGTACTTTGTGCTTTTATATATTTTTTATCACCATCTATAATTAATATAAAATCATAAGGATCATCACCATGACCTTCTGGTCTTCTCATCAAGACTATAAATTCTTTATATTTATTCATTCTAACATCAAAAATATATGCATTGTTTTTGGGTGGAACTCCTGAAATACTTACTTTATCTTTAAAATAGCCTTTAGCATCAAAACTAGAAATATATATTTGGTCATTTTCTAAGGTTACTTGTAATATTTCATTATTATAAAAAATCATATTAGCTCTTAAATTTAAATCAGTTGCATTGTGTGAAATTGATGTTATTTTACCATTTTCATCTACGTTATATAAAGATGGATATGCGTGTACTTGAGTAAATATAAAATTATTATTAATGTAACCATACCTAGATATGTTACCAACTCTGGAAACATTTGAATAAGGTGCAGCTTCTATTCCTAGATCACTAAATAAAACAACTTCACCATCCATAAACCCACCTACTGAGCCAGTATTAGTAAAACCTAATGCCATAATTACACCTCCCTTAATAAATGTATTTCACATTTTAAATCTTTTTCTGGTTTTTTCTTAGAATACAATATAACCTTTGCAGGTTCAGTCTTAGTAACTGGATATAAAAATCTTGAATTGAATAAACTATCTTTTTCAAAAACAACAAAAGCTACCATTTTATCAGTTATTGAGCTATCTAAAAATCTATATTCATATTCGTTGCCTATTTGCTTCCAATTATCCTTATTTATAGTTATATTTTTTATACTTGTGAAAATACTAGATACTAATTCAGGGACCTTTTTATCTAAAATTTCGTACCCTTGCATCATCGTTACTATATCATAAGCATCATCATCTTCATGAATTGGTATATTAAAATTCGAAGTATACTTCATTATTTTATCACCTCATTCTCATTTATTTCATTAATTCTATATTCTTTCATTTGTTTATGTGTAATTCCTTCTATTTTAGGATCGCTGTGTATTACTTTTAAATTATTTTTTAGCATCCCATGTGTAATTAATATAAATTCAAAAGTATAACCAAGATGTGCAGGTTTTATTGTTTCTAGCATATCTTTAAAATCCCCCATATTTTTAGGAATCCCATATGCTCCTATAAATTTTACTGTAAAAGAATAACTTTCAGGATTTTCTATTATTTCTACTTCTCCACCACTAAAAGCTTCTGCTGTCTCTTTTATTAATTTTTTAGTTACAGTTCCATGACCACGTTTCTTAGCTTTAAGAATTTCTCTACGTTCTTCATAACTCTTGTTTAAATCTGTTTCTATTCCATATTCATTTTCCCATAAACTTAACCCCCATGTAGCTGTATCTATAAAGCATTGCTTAATTAGATCTTCTCTTTTCCAGTTTAATAATGCTAGTTCATATCCTACAACATCATCCCATACTTTAAATTCTTTTATCTGTCGTAAAACAGGAGGTAAATAAGCTAACAAATCTGGTCTATATAATTCTATTTCTTTATTTTTAGTTTCCAATTCTTGTCCATATTGATTTGCACCATAGTTTATAGATCCATACAATTATTACCCCTCCCTTAAAACTCCGTTAATTTTAGTCCAAACTTTATCTATTTCCCTAAGTTGACCATCAACTTTAACCCAACCATTTTCATAGTCACATTTTTTTTTATCAACTTTAATACTTAAATTTATATTGATTGGAACTTCATATTCAATTGTAATATATGGTTTTTTTAAATGATTAGAGGAATAAAATATTTTTCCATTATCATTATTTTTGTAATCGGTGCATAATATTATTACATTTTGACCGGATTGCAACGATGAAACAAGATGTTTTGCATTTAGTTTCTTCCATCCAATGCCTTTTACAGTGCGTGTTCCCCAAGAACTACTTGGTCTTCTTTTCCACTCTGGAGTATTAGACCATGTAGCTGTTTTTTCAAAATCACTATTATAAATAGAAGTATTACCAAGCTTAATGCCATAATCAACATTGCTATCTTCTTCATACCACAAGTATATGTTTAAGTTAGCACTTATTATATTTGCATTTAATGGTAAATCTTTTAAATTAAATCTGATGTAAGAAACAGCTTGGCCTATTGAATTATTACATACTTCTAAATTATTTTCGTTATGATAATTTGTATTCGGTGAATGACTTAAAATATATGTGCTATTTGTTGCATAGATTTTTTTTATATTCATAATATCACCTCTATTTATACTTAATCCATATCTCACCATCATTCATAGCATTAACATTTGCATCGTTAGGTGATAAAATTATATTTCTAATTTGAGATACTGTATATGAGGTATTAGGGTAGGCTTTAGCAATTCCAGTAAAGGTATCTCCTGTTTTACTTATTTTTTTGTTTAGATCCGGTTTATTTTCTATATTCATCCAATCTACACTCTCTGCCACCTCTGCAATATCAACTTTGCCATTTTGGTTTTTATCATATTTTTTTGTATGCATATCTCCTTGTCCAGTATTGCTTAAATCATCTTGTGTTATATATTTTTTATCATTTTCAAGTTGTGAAATCTTAGTTGGTATACTGTTTATATCTGCTTTATCATTCCAGTTATCAATTAATCCTTGTGTTATCTTATCTAATATAGATTTATTGCTATGATCATGCCTTTTATTATAAGCTTCATTAAAGTTAGTTTTTTCAGTGTCATTTATAAATCTTCTATTTTTATTTTCAATTATCATATCAGCAGAATGACTAGCTGGATGTATATATTTATTAGCATTTTCCTCTATTCCTGCAAGTTTTGCTTTTTCAGCACTTGTATAGTTCTCTGTAGAAAGTCCTTTACCTTCAATCTTATCAACTTTGTTTGCAAGTTCCTTAGTTATGGTTCCAGCAAAGTCAGGATCATTGTTTAAAGCATTAGCAATTTCAGATAAAGTATCTAGTGCCTCTGGAGCAGTACCTATTATATTTTTTATTTTTTCTAATACTTCTTCTTTTGTAAAAACTTGCTCTTTTAAATATCTTTTATTTAGCTCAGTATCTACATATGTTTTATCAGATTTTTTATTTTCTAAATTTCCTGCTCTAACTTTTAAATTACATATTTCTAAATTATTACTAGCCTTATATTTTTCTACTTCTATCTGAGTATTTACTATGCTTTCTTGTACTTTATTAATATCATCTGCTTCTACTGTGTCACCTGGGGTTTCATAAGTTATATAGCACTTATCTACTTTAGCAAAGATTTTTATTTCCTTTTTCCATGGAGTTAAACTTGGAGTAGATAAAATGAAGTTTTCTATCTTATTTCCTGTTAGTTTTGGGCCAGTATAAACCCTTACGCTAGTGTTGCTTATATTATCATGCTGCAATTCTCCTTCATAAACTCCATCTTTAAGCTCTACTTCTTCCTCAATTACATAAGTGTTCCCATCAATCTTATTTAGTTTTTCTGTAAATTTATCTATACTTATCACACCTCCAATTCCACATTGTCCAATATCGGTATTTCTTCATCTTCTAAGCTTATATTTGCTGTAGAATTATTTATTTTTAAATTACTATAGTCTAGTACTCCTGGAGTATTTAGGATAATATTACCTATCCTTGCTATACTAATGTAAGAAATTTCAAACCCTATTTCTTTTAAATAATTATCTAATAAATTGCTTAACTCACTTTGTATACTTCCTATATTAAAACCATTTGCCAATGTTATCTTTGCAATTATATTTATAGACTTTTCTTTAGCACTAACAACTGTAACAGTTGCTCCAATTGGTCTTTGTTCTTCTATATACTTAAATACTTTATCTATTAATTCTTTACTTGCACTATGCTTGTTTGAATCTATAATTACTACTTTTACTGTTCCATTACCATCCCACAAAGGAAATACTTTTGCATCTCCAACTCCTGGAACTTCTAATGCCCACAATTTATAATGATACTTATTTCCAGAAGTGGCTGGTGTAGTTACTTTTATCTTATATCTTTTATATAGATCCTCTATACTTTCAATGTTTATTCCACTATTTATATCACTTTTATTTGTAACTTTAGTAATATTAATTAACTGTATAGGCATTTCAGTTATTGTATTAGCTTTTACATTATATCTACTTCCAATTTCAATAGCTTCAATTTTTACATCAACTTCACCAATTTCTCTAATAGTAGCATTTTCAATAGTTTTAAATTGCAACTGATTTTTTGTTTGTATTAGTGTTCCTTTTTGTATTATTGTATCTTTAGCACCGTAAAAAGTTGATGTACCAGTAGCTTTTGTACCTTCTTTTCTATAGATGCCTATTTCAGCAGCTCTTTTTATAACTTCTTCTTCAAAACCATTCATAAAAGCACTATCTATAAAAACTTTGTTTAATATCTCTTCTAAATCTTCTCTAGTATCCTCCAATTCGTATGCAGTAGGTGCTAAAGTATCATGTAGTAAAGTACCTTCCATTGTAGATACATCATTCTCAACATAGTTTTTCATACGCTTCAAAATATCATCTGCATTTTCTTTATACACTAGCTTCCACCTCCCCATATATTGTTTTCGTTTTTACATTTATAGTTAATTTACTACCTTCAAAATCTACATTTATATTTTCTATACTTTTAATATAAGGATTAGCTAATAAACATTCTTCTAAATATCTACTAACTTCACTATCTACAAGTTTTTTAGAATAGCCTTTCCCTATAAGGGTTTCTAATTCATTTCCATAGTTATTACTATATATTTTATATTTATTTTTACTAGTCTTTAATACCTTCCATATCCATATTTTTAGTGCTTCATTTTTTTCAACTATTATATTTTCTCCATGGTCATTAGTAATAAGTTGTCTTGAAGAAAAATTTATTTTATATTCTCGGAAAACAGGTAATTCTTCTTTTATATTTTCTATTTCTCTTTCTTCTTCAATATTTATATCAGCATTGGCTGGGAATATACTCAAATCTACACCACCTTTGCTAAAACAATATATGTTTGATTCTTTTCTATCGGTATAGTTGCCACTAGATCATTTATATTTAATTCTTTTATATCTACATTTACCAGCATATTTTCTCTATGAAGTATTAAATCCTTAACCTTAATTTTCAAAGGGTTAGATGATAAAACAACTCCCATTCCTAATGGTGGGGGATTATTTTCAGCACCTTCTTTTTGCATTATTCGAATTAATTCTGCATATGGATTATTTCCTATTTTTCTTCACCTTCTTTTTAGCTAACCTTTGCTTTTTCTTAGCTTCTCTTATAGCTTGTTTTTTCATTTTAGTATCAAATTTTAATGTATCATCTGATTCTTTAGTATCCATTTTGTTAATATAAGATAACTCTAATTGTGTTGTAAATGTACCAGTTTCCATATCCCATGTATGTGTATCAGAAGTAATATACATAAGTGCATTTTTCAATGTACTTATGTATGGTATTTGTACATTAACTGCATATCCTGTTCTATAATTATAATCTCCTAAAACCCCTACACTAACATCTTTATCTATTCCATGTAACATATTTCTAGCAACTATATTCGGGTCCTTATCCTGTTCTTTAACATAGTTATCTTGTAATATTCCATAAGTTTTTATATCTGTTGTATTTTCAATTTTAGATAAGTATTTGTTTTTATCATCATACACTTTAACTCTGTTTATCATGTTGCCCATAGAATCTTTATATTCAAAAGACAGTAAATTCCCATCTCCAGTAAATACATCCCTAGCTGGCTTGATTATTTTACTAGTTAATTTAGAACCCATTTCTATTACATTAACTTTAGTATTAGACATTACAATAAAATATTGTTTCCCAGTTTGTTTACTCACTTGTGTATATAATTCTTGGATTGCATCATATACACTTTTTTGTGCTATTAACCATCTTAATTTTATATTGCTACTAGCGATATTACCCGTTTGAACCCCTATTTCTCCTAGTATTTGTTTAACTGCTCTGTCTGCTGTTGTATTAGAAAAATTATAAGTAACTTTATTCTTAGTTAAATAGAAAGCATAATCAAATGCAGTAAAATTTAAAGTATCATCACTATTTATGCTTCTGTCTATTAAAATTCCCTTAAAAATCTCTTTATTATCTAATGTTGCCAATACTTTTGTCGCTACTCCAATTTGTTGTTTGATTTGATATGGATCATTTAATGGGTACATAATAGTACATTCTAATTTTCTACTAACTTCTGCTATGCTTCTACTAATTGTTATATTCTTACAAAATTCTGTTATTTCTGTTGTTTTACCATTATAATTTTTAAATATTCTAATCAAAGTACCACCAACTTTTTTTGCCCTACGACTTTATGCTCCTTAAAACTTATACTAAAGTAAATATCTCCACTTCCATCCTTTTCTCCCCATGTAAATTCTTCTATTGTGCAAGCACTATTAAGTCTTGTGCCAGTAGCTATAAATCTGCATACCTGTTTACTATATTTAATACTATCTATTATCTTGCAATAATCATATGGATTAGACTTCCTAGAACACTGTAAAAAAGAATAATCTTTAGAAGGGAAAAAACCTGATACACTCCATTCTCTTAATCTATTCTCACCTGGCAATGATAACTCTCCGAAATTAAGCAAATTAACTGTTTTAGTCATAGTTCCGAAACTTACCTCGAATTCTGAAAAAGGTACTGGAAATTGAAAGGTCTCTATTTTATTTTTAAGCCAAAACTCCATATAAAACCTCCTAATAATAAAAGGCACTTAATTTAATAAGTACCTTTATGCCATATTTAGACTTTGTATTTTTAATTTCTGTACTATAGCATTTGCTATTGCGTCTATGTCAGACTCATTTCTTACATGCACATCTCCAAAAGTTATTGTTACACTTCCACCATTACCAACCATTCTCATACTATCTTTATTATCGTGTACTTTACTACCAGATGGCATTTCCACAATCTCTGGTCCATGTTCTCCTACTACAGATAGTCCGCCACCCCAATAAGGAGTACCAAGTGCATTTTTACCTTTTACACCACTTAAATCTCCATTTTTAATTAAAGATATAGTCCCTTGTATTGGATGCTTCATCCATTCTTTTATGCTACCCCATGTTTCTTTAATAGCTTGTGCAAACTCTGCACATTTCTCTTTTACACTATTAAATGTTTCTTTTATGCTATCTATTTTTCCTTTTACATAATCCACTACAGCTTGTGTCTTATCCTGTATTCCACCCCAGTTCTTTACCCAAGCCTTATGAAGTAAAATAGCTACCCCTATTATAGCTCCTATAGCTAACACAATCCAACCAACTGGTGTTGCAACAAAGGCTGCATTTAATGCCCATTGTGCCGCAGTTAATAACTTAGTAACTGCGGTTAATCCAATTACAGCTGCTTTATGTGCTATAAGTTTAATAGTATGTGCTACTACAACTCCTTTTTGGATTATCCATGCAGAAGTTTGTGCTACAATAGCTCCAACAGCTTTCCAACCTTTTGCAGCATAAGAAACTATAGCTATTATTAAATCACCTGTTATTTTTGCAGCAGTTATAATTACTTGAGTTCCTAAATCAACAAGTTTGCTAATTAAGTTTATTGTTATTTTTGCTCCTGTTTTTACTGCTTCTACTCCTAATTTTATAATAGATCCTATTAAAGTTATAAGAGATTCTGCTCCCGTTATAAGTGCATTTATTCCAAGTTCTATAATTTTAGGTATTAAACTACCAGCTATTTTTATTCCAGCTTTAACTGCTTGTATACCAGTTTTTATTAATGCAGGTGCAAATATAACTCCTAAAATTTTTGCTACACTTTTTATAGCAGGTTCAGCTTTTTTTAATGCTTCTTTGAATTTCTCAATTTTTTTCTTAGCAACTTCAAAAGATTTATGAACCTTATCTAAAGCATTATTTTTTAATTCGATTAGTTTTTCTTTAACTTCTTTAGCAGTTTTACATATTTTATCCCAATTTTTAATAACTAAAGCTGTAGTCGTTACTATGGCAATTAATGCTATTACCACTAAATGTCCTGGGCTAGTTAACCAAGACATTATACCTCCAGCTTTTTTAATACTACTAGATAATTTATTAATTCCAGTAATAGTTTTACTTATACTTGTTGTTATTTTTCCTATTACAAAAATAACAGGTCCTATAGCGATAGCTAACATAGCAAATTTAATTATAGATTCCTTTTGAGCTTTAGATAACTTATCAAATTTATTCACTGCTTTAGTTATTATATCACTTGCCTGTGTAACTATTGGTAACAATACAACCCCTAATTCTGTTAAACTAACTCTAAGATTATTTATAGCTACATTAAATTTGTCTTTTGTTGTGTCACTAACTTTTTTTAATGCCTTATCTGTTGCTCCTGTTGCATTTTTCATTTCTTTTGTTTTTCTTATTAAATCAGAGTAGTTTGCACCAGTTAATGCCGTTAGTGCAGTTATAGCTTCTGTACTGGAAAACAATTTAGCCATCTTATCAGATTGTCCACCAGTTTCTTTCTTTAGAATTGCTAATGTTCCAGCTAATCCTTTCGCTTTTATCATAGCTTCACCATTTTTAAAACCATATTTTTTTAATAGTTCGGTCATATCTTTAGTTGGTCTCATTAAATTGCTAAAGATAGCTTTCATTTGAGTGCTTACTTCACTTGTATTTCCTGTAACTCCCGTTAAGGTTGCCATTGTTCCAAATAACTCCTCATAGCTAAAATTTAAGTTTTTAGCAAGGGGAAAAAGAGGTTGCATTGATTTAGCCATTTCTGGAAAAGTGGTTACCCCTAATTTTGCAGTTTGGAATGCCAAATCTGATATTTTTTTAGCTGTTGCATCATTTATAGAACCATAACCCTTCATTGCAGCACTTATAAGTGCAACTGAATCATTAACTTCTGCTCTTCCAGCTTTAGCTGAATTAGCCATAGTTTTAAATATAGCTTGTGTTTCTTTTCCACCATCACCTAAAGAAGATACCGCAGTATACATACCCTCACTCACTGTTTTTAAATTCATGCCCGTTGATTTAGATATATTTAATACTTGTTGTTCGTATCCTTTTAAATGTGAATGATTATCTAAAAGAGTGTCTATATCAGCTATGCTTTCTTTAAAATCCATAGCTGATTTGGTTGCTAAAGTACCAATAGCCGCAAGTGGTGCAGTAACATGAGTAATAAAATTTTTCCCTATATTTTCTGTACTCTTGCCAACCTTCCTCATATCATTGGCAATATATTTAGATCTCCTCTGAAAATCATTTAAGCTTTGATTAATATTTCTTAACTGTCCAGAGAATTGATCTCTAAGCATTAAAACTGCATCTATAACGTGAGCCATGTATTACCTCCTTTCTTAGATAATAAGAAAAAGCACCTACAATAAGTAAGTGCTTAGCTTTCCCTATTCCATTTTCCGTTATTATAAGTATATTTAACAAAATCATGCGGTTGTTCTTGTATTATAGTTAAATACATATCTTTATATTTATTTACGAATTCTTGTTCAACTAATTTAGTAATAGCATTAACTACGCCTTCTCTTAACATTTCATCTTTATATGCTATATCTATTTTTAGAGTTGCATTATCATCTTTAACTGTCTCTGTGCATTTAGCATAATCCAAATCTAAAGAAGGCATATTATGCTTTAATTTCTTAGTTAACTTAGATGAATCTATTTCAGTGACCGTTTCCTCGCTATCTCCTCCGCAACCAATTAATGCAAATATAAATACACAACTTAATAATATACTTAAAAATTTCTTCATGTATAATTACCTCCCCTTCTAAGGTATATTATACAAAACTTTCTTATTCTTTTCTATCTTCTATTTCCTGCATTGCAAAAGCATGTAGTATTTTCTTTTCTCCAATTCCCATAGAATAAAAAGAAGATGGTGTAATATTCTTATGTTTAAATAAATAATACATAAGACTTACTTCTCCATCTTCTTTTATAAGTTTTTTATTTCTTCTTCCTCTTCTTCATCCTGTTCATAGCCATTTAATTCATTTATTGTGTTATATAACTCGTCTATTTCTCCAGCTAATAGTAATTTATTAATTAATTCTTTTGGAGTAGGAGCATTAAAATGTTCTAGTAAATCTTTATTTCTAAATGTGTCCGAGCAACCTTCCATAACTGTAAGTACTTTATTCTTCCCCATGTTGGTACCTTGGATATTTCCCTTTTTAGATATTTCTATTAAGTTATCCTGTATCTTGGCCATCTTTTCTGGACTTACTGCTTTACATAAAAATGTAACTTTTGCTCCTCCTAATTTACCTAGCTTTAATGTAACTTCCTTATTAGGCATTTCTAATCTTCCAGCATCTAAAGCTAATAATTTTTCTACTGTATTCATATTATTTTTCCTCCATTATACTAAATACTTTTCTAGTTAAATCACTTTTTACTTCCGTAGGTACTCTATTGTCTGTTAATAATGATTTTATACAATTAATTAAACCTAAAAATTTATCTGTATCTGTTAAACTTATTTCAATCCCTGTTTTCATATTATTACCTCACTTTATTATTTTTTATATTGAATCTAACACTTCGTAATCAGAAAAACTAAATGGAACACTTTCTTCTCCATTTTTCTTCCCTTCCCAATCTGCCAATGTCAGCTCTTCAAATGTTGCATCCTTAATACATACTCTTTCAACTCCAAGTGCATCTGGATCAGATAATTTACTTATTATAGTAACCTTAGTTTGTTTGCCTTTTTTAATGTTATCAGCCATTAATTTAATAAACATACTATCTACTTTATTAAGTTTTAAAGTTCCTTTTCCCTCATATCCAATTACTTTAGTATGCTTTGCTAATGTCCCTAACATATTTACATCTGCAGTTTTTAAAGAAATTTTCGCCTGAAATGCTGTAGCGTTAGCTAGTAACTCCCCATTTATCCAAAGCTCCGCCCAGGTACCATTTATAACATTTTCCTCTGCGTAATGTCCAGCCATTTAATCACTTCCTTTACATTTACATTTCAAATTGTATTTCTAAATCTTCCATAGCATCTAAAATTCTTATTTTCCCTTTTAGAAATACATTAGATCCAGTACTTAATTCTTCTAATTCTTTTTTATTTAATTTAGATACGTCTACTTGTTTTTTAGTCCAATATTTTTTTATAGCATCTACATTTATAGATATGTTATTTTGTCTTTTTGCTAAAATACCTTCCTGCTCTAATCCTTCTAAATAGCCATTAACCGCCACTACGAAATTAACTTGATTATCATATCCATTGCCATATTTACTCACATAATCATCTTCAAAAGTTTTTCTAATATCCCTATCTATTAAATCCATAGTAGATACGATTTTTATTTTTTTATATTCTTCTCCCTTACCTTCTATAGTAGAAGTTAGACTGTTTACCGCCCTTCCTATCTTGCATTTTTCACCATCATTAATAAGCACTAATTCTCCATTATTTATCTTCTTATCAATCTCACTTTTTCTTAAATGCGGTACGTCTGTAACTTCATTTAAGACATAATAAGTTGCACTTATGTTTAATGGTGTTCCAGCAAAAATACCTGCCAATCTAGAACAGTATTGAGTATTTGTATACTCTTTTTCTCCTACTTGTATTGGTGTAGTTGTAAAGTTAATTATTCCTTCATTATCTGCTTTTCCATTTGGCAATACTGCTTGAACTTTTATATTATCTTCTCTAAGTCCTTTTATCCAAGTTGCTACCTTATCTACCTCCCCTTCTTGGATACATGGTATAGCTATATAATCCCATACCTCTGCTTCCATAACATTTAAAGCATCTTCTAATACTCCTTCTTCTCCAAGGCAATAAAAAATGACCTTCTTAGGTGTTTTATACCCACCCTTAAAAGCCAGTTCTATTTGCTCTTTATTTTCTGTACTTAATTCCTTTGGTGTATCTTTTACACTTTCTAATGTAATTACTTTATCTATAACTTTCTTATCTTTAAGTACTAATGCTACTGTACCTTTTGAACCACGTTCTATTGCACTTTTAGCTAAAGATTTAAAGACAATATCTATATTAGGTAATCCCATGTAATCACTCCATTTCTAATTTCAATTTTGTTGCTTTTGTTGTATTTTCTTTTCTAGGTAATGCATCTAACCAGCTAAGGTTGAACCTAAAACTTAGTACACCATCTATATCCTCGTTACGGACATTCCTAGGCACTAGTTTTCTATCCTTAATATTTATATAAGGAAATACTTTTTTCTTTAGTATTTCAGCCATTTTAAGGTTGTCAATTTGCTTTTGGCTTTTAGAAAAATAATTTATTTCTATCATATAACTATTTTCTAAAATGCTACCTTTTAATAAATCTGTTCCCCCTATAGGCAGAATCTGCACAAAAAAACAAGGTCTTTTAAAACCTTCTTTGTATCCATTATCAACTGTTTTAATTCCTGTATCTTTTAATAGATCGTTTATAGTTTCCTTTATATCTAACATATCAATCAATTGTATCACCTTACTTCCAAAGCATAAAAGATTCATCTCTAAGTCTTCAATTCAGTTCTTCAATTCTTACTTATAATATTTTTTAAAGTTGATAATTGACCATCTAAAGTTAAATTAGCAACTTTTCTTGCTTCATCAATATACTTTTTTGTATACTCAAAATCCTCTTCTAAGCCATCGTAACCAACACATTTATATGAGCTACATAAAATATTTTGTAGCAATATCGGTAGATCACAATAAATTAACTTCATAGTTTCTCTGTAATTATTATTTTGATAATCTTCCATATTCTCACCTACCTTAAAATTTTCTTAACCATCTTTTCTAGCATTTTAGGGAATTCTTCTTCCATTTCTTCCATAGATCTTTCTACCATGTGCTTACCTTCATACCACCCTTTTTCTATGCCCCCAGGTGTTACTATTCTATGTCCCTTTTCCTCAAGATGGAAATGTGGGGCTGTATTTGTCATCTTAATGTTCATTCCATTTCTTTCATACTCTATAGGTAAAACTTTATAACTATTCCTTATATGTTTCTTTCCTTTATCATTATAAGATGTTTTTTCTTTAGATAATTTCTTAGTTTTATTCATACCTTTTTTTAAAGCTTTTTCACTTAATTCAGGTACCCTTTTTTCAGCTTCTTTTAATTCGTTCATTGTTTCTTCTAATCCATCAAATTTAAAATCAACTGTATGTTCACCACTCATTTTCTTCACCTTCATAAGTTTCGTTTTCGGATACTTTTTCTTCTGCAAATAGAGTAAGATATTGACCATTCATATCCTTGCCTAATATATCCTTAATATTAAAAATAGTATCTTTATATTTAACAAACATACTTTGGTCTATATCTTTTATATACCTAATAATAATCTTATAAATTAATCTTTGCTGTAATTTTTTATTTTCTAAGTATTCTCCACCTTTGTCTAAGCTAGTTATAAATGCCCATACTTTTTTAGTAGGTACTAGCTTTAATTCTATGTCTCCTACTTCATTTTCTACATCTTTATATTCCATAATACTAACTCTTTTATTTAATGCTCCTGCATCCATATTATCACCTACCTACAATACTGAATTTGGGTTAATAAACTTCTTACTGTATATCTAGCCTTTTCACTTGTTTTCATGTTCATAGAACGATTATCGTACCAATCACTTATTAGAACTAAGGCTAATAATTTAGCTTTTTCTTTAGTTCTATCGCTCATTTCATCTATAGATTTACTAGCATCTTCTATATATAATTCTGCATTTCTTATTAATAGTTTTAATGTATTATCTTCTTCTTCTGTATCAATTCTTACATATTCTTTAACTTCTTCTAATGTTAAAATCATAATATCACCTACTTTTTAGTAGACTTTTCTGCGTAACCAAAATTAATAAGTTTTTCAGATAATTCCTTTGGTAACTCTCGAATTTCATCTTTTTTAAAATCTTCATATCCTATCCCAGTACAATTTATTAGTGCTATTACTTTCATTTTATCACCCCTATAATAAAAAATAAGAGGGAATAAATCCTCTTATTTTATTGTTAATTGACCATAAACAAACGCTTCACCATCTCTTGTTTTAACTTCTTCTCTTTCTATAGCTCTAAATAATGTTACATCTGTTAAGTAAGCATCTCCAGCAACATCACTCGCCATAACGGATAATGTTTCTCTGTCAAACATTACAACTGCTTCTTTTAAATCTCCAATTATAACTGGTGCTTTAGTACCATCTGTTGTGTCAGATGGTAAATCTTTATTTGAAATGACGATAATTGGCACACCAAATAATTGTTTACCACTTGGAGCGGTAATTGAAGGTTGTAATAAGTAATCGCCGTCAGTATCTTTTAATTTATCTAAGTAATTATATCCATCTTGATTAGTTACTATTGAGCTTGTATATCTAAATGCAGGATCTAATTGAACATTTAAAACATCTTTAATATCATCAATTTTAGCTATTGCAGTTTTAGGCTTTTTATTTAATTCAGCTAATATTAATTTGTTTCTTGTAACTCTACTTTCATCACCAATCCATTTTATTAGTGCACCTCTTATAGCTTGGTCGTTATCTTTTAGCAATTCATTTGTAACCTTAAAAAATCCAGCATATTTTTTAACTGAATAAGGTAATTGAGTAAATTTAGGAGTTGCGTTCTCCTTTATCTCTCCACTTTCAGCAACCTCAGCAAAACCTGTCTGTTGACTTCTTGACTTGAATACTCTTGAACCACTTAGAGTTTGAACTGGCTCAACTGTAATTAAGTTTTGAAGTGCATCTTTAGACTGTCTTAATTCATTAATAGCCGTGCTTATATCTTGTGGTACTATATAACCACCATCTCCGGCACTTCCTTCACTCATAGAGTTTTTGAACTTAGTTCTTAAACCATTAATAAATTCTTTAGTATAATCATTTTCTACTGGGGTTGGTTCTTTAGGTACTTTATTCTCCACTTTTTCCTTTTCTTCTTGCTCTAGTTTTTCTTGCATTACTATTTTAGCCTGTAAAATTTCAATTGCTTCATTTTTAGCTTTTATATCCTCTAATTTAGCTTCCTTATTTTCTAATAAAGCTTTTGCTTCATTCTTTAAAATTTCTAATTCTTGTTTCATTTCTATACTTTTTAACATATTATAAATTCCTCCCTTAAAATAAAAATAACTACATATTTAAATGTAGTTGTAATTTTGCTTTTTCTAATTCTAAATCATTGTTTATATTTTCTTGTGGTAACTCATTTTTTAATTTATCTCGCATTTTATTAATAACTTCTTGTGGTAACATTCCACTATTAATACTAGCCACAAGTTTAACCCCATCTGTAAACATAATTTCATCTACAAATTTATTATCTAATGCATTTTGAGGTGTTAACCACGTTTCTTTATTCATCATATCAAGTAATTCTTCACTTGATAAGCCTGTTTTCAACTTATAGGCATTTGATATTGTAATGTTAACATTTTTCAATATCTCTGAACCTTTTTCCATATTCCTATAATCTCCACTAAAACAACCACTAGCATTATGAATCATAATTTGCCCAGTTGGTGAAATTAAAACTTTATCCCCAGCCATAGCAATTACTGAAGCTGCACTTGCTGCTAAACCTACTATTTTTACAGTTACATTACCTTTATAATCTTTTAGCAAAGTATAAATTTCACTTCCAGCAAAAACCGAACCCCCGCCAGAGTTGATTTCAACTTCTAAGCCCTCATTATTTTTTGAATTAGATAATTCATTTTGTACTTTTTTAGGACTTGTAGCATCTATCCCAAACCAATCATATATCCATTGGTCATCACTATCTATAATAGGACCTTTAACATTTATTTTCATACTGTGTTACCTCCTTTCTTGTATTGTTCTCCAGCCATTTCTATAGGCATGTAGTTACCATTTACAAGTAATGCATCACCTCCCTGGGCGGTTGGCATATCCAAATAGTTTCTTGCTTCATTAGGAGTATATATACCATTATTCACGGCTTTTGCTAAACTTTCCATTTGAGTTTTCATATCAGCTCTTAAAATTGCATTTACGTTAAATTTAAAAAAGTAACCATTGTTAATTAACTGGCTACTTAATATCTTATAGGTTATTTCTTCTTCGTAATGTTTTAAAATATATAATAATGTATCTGTGTAAAATGCTAAGTTCTGACTTTCTGCACTCGCATAACTAGACTTCTCAAAGTCGTTTATCTGTGATGGCATAATACCAAATGCACTTGCTATTTGTAAAGCACTATATTTTTTCAATTCAAAAAACTGACTATCGGTTAGCTTTAGGTCTAATGGCACAAGTCTCATTCCAAGTGGTACTGGTATAATCTTGCCGGCATTAGTTGAGCCAGTAGCATATCCTTCAAAACCTTTTACAAGTCTAGTTTTAGCAGTATCATCTAAATCACCCGTGTACTCCAACACCGCTTTAGCAGTTAAGCCATTCTTATACAAGTTATTCATAAAATTTTGACTTTCTAAGCTTCCTTCAATAGTAGTAGCTAATATTTCTCTTACTGATTTTCCAATAATCCCATCAAAAGTTGTACTAGTTTTAAAGTGCATAACTTCTTTATTATTAAAAGTATATGTTTTACCATTTTTTCTATCTTGATATTTATACCATATACTATCCTTAGTACCTAAAACACCTTTATTATCTATAACTACAATAACATCTGAACTAGGCATTATCCATAAATCTTGTAGTTTAGGTCCTGAATATCTACACCAAACATAAGCATTACCATAATGTAACCTATTCATTTCAACCGTGCTCCAAAATGTTGCAGCAGTCATGTATGGATTAGGTCTACTTTTTAAGACATTATATAAGTTTGTTTTATCTGATTTTACTATGCCTTTATTAGTATTTTGATACATCTTTAATGTAAGTTTACCAACACTTTCACTTAACTTTTTTATACACGCAAAATAAGTAGCATCTGATAATTTATCTTTATTTGTTAAAGGATCTATACCTAACCACTCTAACAATTTTGGATTATTTAAATTTACAGTTTCATTTTTAAATTTAGATTTTACCCAATTGAATAATTTCAAATTCTCACCTCCTACCACCCCATCATATTTAGATAATCCTCTGTAACTTCATTTATATCAATTTTATTATCTAATAAATACAGTTGACTATAACAAAATGTACTAGCAACTAATAAGTCAATTCTTTGCTTATTTTTATTTTCTTTATCAAGCATAATGTCTTCTGCTTTACCTTTATTAGTAGTAGCACAACTTACACACCAATCAAGTAATTTATTTTTTTGGTATACTATCTTACCTTCATATACATCATCACGAAAACTCTTTGTAGGCGCACTTAAATTAGTATAAGTTTGTTTTAACAGTATAACTTCATAGTCACTTGATAGGCTCTCCATCATCTGTAAAGCATTATATGGGTCACTAACAATACATTTAATCTTACAGTTATAAGTGCTTTCAATACTTCTTATATATTCCTCTATCTTAGTATAGTTGATTATATATCCATCATGTATTTCACAATAACCTAACCTTTCATATAATCTATAATCTATTTTTTCACGCCTTTTACTTAAAGTTTCTGACGGTAAAAAACCTTTGGAATGTAAATAATACTTTCCTTCTTCCTTACACATTATAGATACTGCAGTTAAGTCTGTCGTAATAGACAAGTCCACTCCAACTACAACTTCTTTACCATTAAAATCTATTTTATCAACTTCACATTTCTTCCATTCTTCAATATTAAGATATTTTTCTTCACTATTTTCTTGTGTAAATACATTACAAGTCTTAGTTATAAATTCTTCTTTAAGATTATCCTGTACTAGTGCCTTTGCTCTATCTTCTCTCATTATTTTATAATTTTCTTCTAATTGTAAAGGATTCGCTTGATATAATCCTTTATCATTCCAAATGTTTTCTTTATCTGCATAATATATAAGAGCAAACATTCTTTCATTATCTACTACACAGTTATAAACTTTTCTAATATAGTCTAAATCCTCTTCCATAATTGAATTATTTATAGCATAAGCTGTTGTAGTTCTAAAAACTAAAGGATTTATTACGTTCTTTTGACCAGACTTCATAGCGTTAAAGTTATCAGCATTTTGGAAATTACCATGTTCATCTGATACAAATGCACTCGGTCTAATAGAGTTGTTTTTTCCTGCTTCTGCAACTCTAGGCTCAAAAAAACTATTGGTTAACTTACAAGTAATTCTTCCTGTTTTAGTAGTTGATATGTTAAAATGCTTTTTAATAAGTAGACTAGCATTAATAATTTGCTCCATGATTTTCTTAATCTCTGCTGCAAGTTCTTTAGTTAAACATATTGAATAAAATTCACTATATTGTTGCTCTGTAAGTAGCAATAGTATAAATACTAATCCTATTAATGCTGTTTTACCATTCTTTCTAGCAATAAATAAAGTTACATCATTGTATCTAAACTTAGCTTTATTATTTTTATGTCGCCATCCAAAAATATTGCAAATAAAAAAGCACTGGAACGAGTCCAAATGCTCTAACACTTCATTATTCGCTAAATACCCAGTAGCAAAATTCATTAATTTTAATAAATCATTTATCTTAGATAATTCCTCAATATCTATATAGAATTCAAAACTTTCCTCATGTTGCCTTTTATAATAATCCTGAATAAAAATAGCACACTGTTTTTTTACTTCCCATGTAGTAATCCCCCTACCTTCTACTACATCAGTAGCGTATTTAATAGCACGATCCAAAAGTATCACTTACTACCACCACGCAATACCTGGAGTAATACATCTTCTCGCTGTTCTTTAGCTTGTAGATTGATGTTACCTAATTTTGCACGCGATTGAGGACTTAACGACAACTCGTTGCAACACCTGAAAAAGTCTTTAGTGTATTTATCTTTTGCACTCATTAAATCCTTATCTAATAACCTTTCAATATCCTTATTAATTAACTTTTCAATTTCCTGTAATCTATCTATCGATATAGAACAAGTTCCTAAAATATAAATGTCTAAATTCCCAAGTATTCCACTTACTTCAAGCTCATGTACTATATATTTAAATATTTTTTTTTGCCTAACGTTTAAGTGTGAAGGTGGGGAAATTTTATCAACTCCACCTTTTAATCTTTCTTCTGTTTCTAGTCTACTTTCTTTTTCTTCTTTTGTTAGATTTTTACTCATTGTTTTAACTGATTTTGAAGGTCTCGCCATTTTCCTCACCTCCTATAATTTTCATTTAGGGAATTTTGTTTAACCGAGAGTGCACCAGGGACTTCCTTCTGTTACTTAAAACTTTCTAGATACCCCCCTCTCAATTAACTTTTCCAACTCCTTCTGTATATCCAATTTATTATTTTTATATTCCTCATGTACCTTCTGGTGACAACTTTCACATAAAGAAATGAGATTGCCAGGGTCTAATCTCTTATCCCAACAATCTTTCAACTCTTCTATATGATGTACTGTATCCATGTAAGTTATATTACATTTGCTCAAACACAACTTACATAAACCTCTATCTCTTTGCTTTACTGTATCCCTTATTATTGTCCATTCCTTACTAACATAAAATCTCTGTTCTTTCTTATCTTTCCTATTCTTTTTATAATGTCTATATCTTTCTCTATTTTGTTCTTCAAATTTCTTACTACAATCATTGCAATATTTCTGTGTGTAATCTAGTACTTTTCCACATCTGCATAACTTGTATAGCAATCAATCACCTTCTTTCATTTAAATAGAAATATAAAAATTAAAATTATTTCTAACATTCTTCCTTAATTACTTGACACGTATATATAGCACGTGTTATAATATAATTGTAAGGAGGAAGGGGATGAATTCAAAGGAAATAATTAAAATTATTACAAAAGATGGTTGGTTTGAAGTAAGACAACGAGGCTCACACAAACAATTTAAACATCAAATTAAACAAGGTACAGTAACAATACCTTATCACAATAAGGACTTGGACATAAAAACTTTAAACTCAATCCTAAAACAAGCAGGGCTTAAATAGCCCTCCTTGTTACAAGGATTTAATATATAAAATAAAAATTAAAAGGAGGTTTCTATATTGGATAAATATATTTTCTCTGCTATATTTGAACCTGGTGAAACTAAAGGTTATTGTGTAACCTTCCCTGATTTACCTGGATGTATAACAGAAGGTGATACTTTAGAAGAATCTTTGTTAATGGCAAAAGAAGCATTAGAATTACATCTATACGGACTTGAGGAGGATAATGGCGATATTCCTCTAGCAACTTTACCAGAAAAAATAAATTCTCCTGATAGTTCTTTTATAGTACCTATAGAGGTTTATATGCCTTTAGTAAGGAATGAAATGTCTAATAAGGCAATAAAAAAGACCTTAACCATTCCTTACTGGCTTAATAAAATTGCAGAAGATAAAAAAGTTAACTTCTCCCAAACACTCCAAGTAGCTTTAAAAGAACAACTCGGTGTACAGGATTATAAATAATTATTAATATAATTTAAAAGAGGTAAATTTATTTTACCTCTTTTGTTTTTGTCTAATAACTTCATTTACTTTTTTATAACTTCTTTCTTGCCCACCCTTCAACTAGTATAAGTTTATCTTTTACATTAGTTTCATATTTAGATTTTGCCATGAGCTCACCTCCTCATTGTTAATTGCTATTGTTTCTTCTTATAACAACTATTTTCAAACGGTTTGTTTTGCAATGTATCTCCAATCTCGACAATTTATCTCTGCCTTATAGCTCCTCTATGCCTTTTGTAACTATTATGCTTCATAAGCTCCATTACATCACTAAAAGAGATGTGCTCTCCCTCTCTCCTAGACTTCTTCTTTCTATTCTGTTGCTTTAATCTTTTATGTATATCTGGTTGCTGTGCCTTTATTATATTTTCTATCTTCACACCTCTCACCTTCTTTACATTTACTTATACAACTAACTAAGTTTATTGTAGATTCTGTATAATTAACGCAAAATCCGTCTGTTCCTTTTTCAAAGTGTTTACAATGCCCATTTAATCTCTTTTTCTTTTCCCTTTTAACTTTAGGTTTGATTTCCATGGTAAATATATATTTTTCTAAAAAATCTTTTAATAGCATTTTTATTTCCTCCAATAAAATAGCACCTGGAATACAGATTAAAAGCCTGCTCCAAGTGCTTTAATGTTATTACTGCGGTGCTTGTAGGCAAAGGCTTACCCTGCAACAACATTAGTATATTTCACTCTTCTTATGTTGCCTACAGATACATAAGCTTCATTGCTACTAAAATAAGCAGTTTATACACTTGCTTAGGTGTTGTGTTAATAAATGTGGCAAGTACGCACTACCTGCCTGTTACAGTAATCCCTGCTGTAACGATTCAGTTGTTTAATTTAATTAATTATATTTGTTTATGCTAATATATTATCACATCTATTCAGTAAATTCCTTGCAGTTTTGTTGCAATTTTGTTGCACGTTTTTATTTTTCATCTTTAAAATAAGTTTCAAAACCATCTTCAAGTGTATTTAAATATTTTTCTAGTTCTTCATCTGTCATGCTATCGTAATTGGGTTTATATCCAGATAGTGTTTTATATTTACTTAATTTTTTTAATAACTCTTTTCTATTCATTCTTATCACTCCTATAGATTTAACACTTTACTTATATCTTCAACTATTTTTACTCTTACTCTACTAACAGTACTCTTGTCCATATTTAAATTGAATCCTATTTTTTCTAAACTTAATTTATCCTTATATTTCATTTCTGCTATTAATCTATATTGAGTACCTAAAAGTCCAATTATATATTCCATCTCTGCATTTTCATTTTTAGTTTTCCTTATCCTTGCATGAAGTCTTAATATCAGTTTCCTGGTATATTTCCATTCTTCCATTAGTTTATCTATTTGTTTAATGGTTTCTTGTTCTGCATAACTTACTCCCAATGAAGAACTTTGAACCTTTTCTGAATAGCTTATTCCCATATTTAACTCTGTTTCTATATCTACATTGTTGTTTCTTATATCTTGCCTTAATTGTTCTTTAGTCTTTTCTAATACCATGCATCTATATTCTAGTTTTTCTATGTTCTTTAAATTATCATAATATCTATAGATCCTTCCCTCTGTTTTTCTAAAAGTTTCTTTATCTATCATTCAAGTCCTCCTCAACCTATTTAATTAATACATTCTCATACTCCAAATAATCTCAATATCGCTATACTTTTGAAATAACTTTTTAACTATATCAACTGTTAGTCCTTTTATTGGTTCGCTATTCATAAACCATCCTCTATGCATATGCGTATGCATATAGGACATACTTGCCTTTTCTCCATCTAATTTTAAAGTTAGTTTAGGACAAAATAACATTTTATTGATTAATTGCCTTTGTGTAACCTTCATTTCCATTCCTCCATTCTTATGTAATCTGGTCTATCTAGTTCTTTCTGTTTTATAGCTTGTACTTTTTTAACACTATATTTTAAACTTACTGCTGCTATAGCTATACATGTAAGCATAGATATTAATATTATTTCCATTACTACTCCTCCAATAACTCTGGGTTTTCGTATATGTTTCCTATAACTTCGCAATTTCTAACTGTGCCAATATTTAAGCAAGGAAAACTTTTAATATCTCCCTTTATAACTTTACATCTAAAACAAGCAATACTATCTTGATATATAATTTCATAGAGGTTGCTTGACCTATTAGATTTTTTAATAATATCTCCCTCATAAATCTCTTTTTCATTTTTATCTTTTAATCCTGTGTATTGCATGAACACATATCCATAATCATCTTGATTTAAAATAATATTGATTGTATCTGCTGGGTTAGTATTATATTCAATGTGTCCTGTCTGCTCTTTAGTGTATACCATCATATTCAATTCTTTATCCCAAGCTCTAAATTTAACCTCTCTATTCATTAAATTTTCCTCCCTTGTTTAATTCTCTGTTCCATCTTTAAATCTATATATCCAACACCACTCTGGAACGGACTTTTTTATATCTTCTTCAAATTTCTTACCTGGATTAGCCATAAAGTCTACCTCCCATGCAGGACCTTATAATATCCTCTTGTTTATATTTATCTACATGCTTATTTATAGATTTTGTTATTTTCCTTACTGCTACATTACCTATATCTAAAATCTCTTTTATTTCTTTTTCTGTATATTCTTCTTTAGTAAATAATTTAAATAGTTTCTTCTGGAACTGTAGCTTGTACCTTATGTCTATTTGCCTACTTTTATGTGGGCTATTGTTCCCTCTGTGATGTTCTGAACATAAATACATGATATTATATTGGTAGTGTTCTAAAGCCTTCTGTTGGCTTCTAAACACAATATGATGTTTTTCTGAATTTGGCTTTCCACATACTTTACATATCTTCATATTTTCCTCCTATGTTTTATTAATTGAGTGTCATATGAGAATATAGCTTTATAACTATACTCCCATATGACCTTATTTTTGTATTATGCTATTATTTGAATGTTTCCTAATCCTTTAAGTTGTTCTTCTAAGTAAGCTTTTATTTTTATCATAGCTTCATTTCTCCATGCTCCTCCATCTGCTTCAAATAGTGCCGCCCTTGGACCACTTTGCATTCTAAATATAAACTTTGATTCTGGTTGTTCTATTTCTGGAAAAGTTCTAAATGGTGCCAATACTACTGGATTAGGTACTTTTACTTCATTTACACTTGCCACTCCTGTTTTTATTGTTGCTGCTTGACTTACCCCATCATCACCAACTTCTTTTACTGCACTATCTTTAACACAACCTGTTACTTTAAGTAATAAACCTCTATCTTTATTTTCTGCAAATGATGATTGAAGCATTATATTGAACTGTTCTGTATCTAAAAATCTATCAAATACTATATTATTAGGTGTTAATGCTTCACATTTAATATAGTTTTCTCTATCCCTATCTCCTCTTAACTCTGAACAAAGTTCTACACAACTTGGACTTTTTACATGGACCAATAATTTTTTACTGCTCTTTGCATCAAAATTAGACTTCAAGTAATCTACTAATGCCGTTAATGTAGTTGCATTTAATTCTGATGGTTTTGGATCTCTTACTCTGTATAACTCTTTCGTTGAATATTTTTGACCATCTATTTCTATTACCTTTGTTTCTCCTAAACCTACTAAATACTCTAAAGCTTCTTTGTTTTCACTATACATAATTCATTCCTCCTAAAATTTTATTATTTAATTACTTGTAATCCAGATAAATCTTCTTTATTTTCTTCCAGCACTTCTCCAGTTTCACTATCAACTTTTAAAACCTGTTGTCCTGGTAGCTGCTTTTTAAATTCAGTTCCCAATACTTCTCCTTCTAAAGTTCTATCTATAATAATTTTTGTACTTACTGATGACTTTGGAGCCAATTTGGTCTTTGCAATAATTTCAACTTCTGTAAGTTCTCTATCTTCTCCAGTTATGAACTTCATTTCTAGAGTTAATTTTCTTTTAGTTTTATAATCTGTATTTGGATCTGCTATATTCTCTAACACCTCCTTTAAAGCTTGGTCCATTCTTTCTGCTAAAGCTCCATTAGCGAAAGTTTCTAAGTTAATCATTTTAGCCATTACTGTTCCTCCCCTATATTTTTAATATTAATACTTTCTCTTTTCTTAACCTGTTCTTTTAAGTACATCCGCCAATTTCCACCTATGGAATATATTTTAAGTACTTTCTTAATTATTTCCTCGTTCTTCATAGCTTTTATTCCTCTTCAATTCTTTCCTCTAATCTTTTGATTTTTAAATCTTTTTCAAATTTAACTTTGTCTTTTATATCAAAAGCTAATTCCATTTGCTCTAGCATTATTTTTACATCTGCTATTTCTTCAGCTATGTTAGATATATTACTTTTACCTCTTTTAAACTTGCATAGCTCTTTCTGAAGCTCTGACATTTCTTCAAACACCATGTCGATTTGAGCATATAAGCCATACTTTGATATTGCTTTTTTATAAATTTCTTTATTTGAATCTACATTTTTTATTATTTTTATTGCAGCATTTAAGGCTTTTACATCCTTTTCCCATATTGGATCACTGTCCTCGGTTATAGAATATTCACTATTTTCTTTTAAGCTTTCAAGTTGTTCTATTATTTGTTCCTTATTCATATTTACACTCTCCTATTTAAATATTCTTATTCCCCTAAATGTCATAGCCATAAATTCTTTAGATATATGTTCCTTTAATTCTTCTTTAAGTTCCTTCTTTATGTCTGCAATATCATAGCCAAGCTTTTCTGCGTCTTTTCCTATTTGTACTGTTATAGATAGTTCTGTATGCTCTATATCTTTCTTTAGCTTGTCTATCTCCTTTAAAACCATAATTCCTACTGCTACTGTAATTGTTATGTTTGTTATTAATAAAGTTGTTAACATGGTTTATTCCTCCTTATTTCCAACCTAAAAGTTTCTTTTCTAAATCATCAAATGTCATTCCTCCATCTGATCCGTCATATGTTCTTTGCTGATAGCCGTTAAAACTGTCTATCTTAGCTCCATTGTCTTTTCTTTCTACATTGCCCTTTGCTTTTTTATGTTCAATGTCCCTCTTGAATCTTTCCTCCTCTGCTTTTACTCCTTCAACAGTTTTAATATTGTTTTCTAACCAGTTTTGCAATATGCTTTTAACATACTTTATAGTCCTAACATTATTTTCTACTGCTTTTTTTAAAGCTAATAATATTACTTCCTCACTTAATCCATCTTTTTCATAACTTCTTAATACTTCTAATTCATAGCTGCTAATCATATGGAAGTTACTATTAAAAAATTCTATATAATTTGGAGTGCTGCCACTACTGTTATTATCATTATTTATATTATTATAATTATTATCATTATTGTTTGTTTCTGCTGACGTTCTGCTATCATTCTGGTTGCATTCTGTTTGCGTTCTGTTGTCATTCTGCATATTCTCGCTATCCGTTGGTATATCTGCGTTAAATCCGTTCTGTTTTTGATACCTGTCATAGTTAAGTATTATTATGGTAGTTTTCTTTTTATCTGGTTGAAACTTTATCATTCCATCTGAATCAAGCAATTTTAAGAATGTCCTTGTTTTTTCAGAACCCCATCCCCATCTTTCCATTAATTTTTTTTGTGATGTTATAAAACTGCCTCTTTCAACATTTATTAGTTCATTTCCTAATAAAATTTTTCTACTCTGATGATTGGCAAGAAGGAGAAGATCCAGCCATGCTTTTAACTTTTCTGCATCTTCCCATATCCAATGCTCTTGAATATCCCTGTATAAACTTATCCATCCTTTTTCCTTTCCTTCCGCCATAGTTTCCTCCTTAGTTATTAGAGTTCTGCTGTATGCTTATAATCTCTAATACATTCTTCACAAAGTATAGTTCCATCAATATCGTAGTATTCTTCTCCTTCACAAATATTGCAGTCGCAATTACAACAGTTATCTATTATTTGCATCTTTTCAAATTCATATCTATAGTCATACATACACTCTGGTATATTATTCATTTGCCCCACCTTCTTCTATTGAATTATCTTCATAACTTTCAAAATCAGCTTCTATATATTCTATATTTTCTTTTACTTCACCAGTTTCCATTATTTCATTTTTTATTATTCCTTGGTCAGCAGTATAAGCATTTTGCATTTCTATACTCAATATTCCCCATTTACTTAACATATTTCTTAATACTGTCTTCCTAGCCATTGCATCAAAATCTTTTTTCCAGCCAAAGTCTGATTTACTAAATTTATTTTTATGTTTAGTTATTTGTTCTTTGGTCCAGTAAGTTGATTTTTTAAATCCATTAAGTAGTTCAAAGTATCCTGCATATCCAATTACTGCATCTGATTCTTTTTTAGAGAAATCTATCTTTAATTCTTCAGTAAGTGGATTCCAATCTATTAATTCACCCTCATGGATCTCTATAACATTTATAGATTTATATTGCCCTGTTCTTAATGCTAGTTGCACATAACCCTTATATCCTAGTTGAAATTGAGCTTTATTACCATAAGGTACTACCCATGCATATCCCAAATTTTTATCTACTGGTAAATCTAAAGTTGCTGCTACCATGCAACTTGCAACTACACTCATTTGGTCGCATTTCTTTAAGTTTATATCACTATTAACCAAATTAACTATACTGCTCATGTATTGAGGAGCTCTTTGCTTTAAAACCTCTTCAAATCTTTTCTTTATTGCTGGGCTGTTCATTAAACCTTTTATTGTATTTCCTGCACTCCCTAACCCTGTCTCTTTCTTAGTGGTTAGTTGATTTTTTAAACTTTCATTTGTTGCCATATTAATTTTCCTCCTTTAAGTTTTTAATATTAAATCTTCTAAATACACTTTCCTTGCATACTTTTTTATAAATCTCTAAATATTTTTCCTTTAGAAGTTTACTATCTACTCTATTTGAAATAACTTTTTTCCAATTGACTTCATACCCTGGGGCATATCCTATTTCAGCTTCTTTAAGTTCATTTTTTATGTTATTTTCAATTTCTTTAGCTTGTCCTTCTAAATTTTTAATAGTTTCTTTAAGTTGCATTAGCTCGTCTATTTTATCCATGTATTCTGATTTTAAATCAATACTTATATTTGAATTTGATTTTTTATATTTTTCATTTAAGTATTTTTCTGCTGCTGAACTTCCATCCAATGTTGGAGGAATTTTCTTAATTACATGCTCCTGCCAAAATTCTTTTTCAGTATTAATTATAATTTTTATAAGTTCTTCATCACGCTCTACTTCTTTCCAAATAAATTTTTGTCCACCTATTAATACTGCTATATATCCCTTTTCTGCTCCTGTTACTTCTAAATAATGCTGCACTTGTACTAAATAGCTAGCTGGTATTTCTTCTCCTTCCCATTCTTTAGCTAAGAATTGATTAGCTGTTTTACATTCTAATACTGCATTTTCTCCTATAACTCTTCTGTCTATATTTGCAACCATAAATGGATACTTTTCATGTTTAAAATGCCTTCTATCTCTTCTTACTTTTTTACCTGTCCTTTTTTCAAACTCTTTAGCAACTACTTCTTCAAATTGATCTCCCCAATATGCTGCTTCACTTTGTTCATTAATTTCTGTTATAGGTTCTGTTTTTTCTAAATATATCTGGAATGGTGTTTTCCATTTATTTATTCCAAGTATTGCCCCAGCATCACTTCCACCAATTCCTTTTTGTCTTTCTTGAAACCATTTGAGTTTATCCAATCCACTTTCCCCCCCTTTTCATAATTTCTAATTTATCTACTCCAAATCTACTTTTTAAATCTTTATTTTCAGTTTCAAACCTTTTATTTTTTGCTTTAAGTTGAACACAAAGTGTATTCCATCTATAATTGCTATTCTGTAACTCCTTAATTTTTCTATGTTGCTTTTCTATAACTTCATTATCAGTTTCACAGCTTTCCTCAAGTTCTCTAATCCTTTTCTTAAGAAATTCAACTTCTCCATTAGAATTTAAAACTACTGTTATAGGCTGATTATTCTCTTCTTTTAAATCTGTTACATAATCTAAAATGCCTGTTAAATCGTTAATCACTGCTTCTTTGTGTATTGCTCTGATACTGTCACCTAAACTAGAATATCTATTTATAGTTGTTGTTAAAAGTGCTAATGTTATATCTTTTATCATCTTTACAATTCCTCCGTTTTCTTATAAACTGTAATTGGTTTTTTATTTAAGTATTTTAATTTGGCTGCTCTTGCAGCTCTTTTTTTATACTTCTTTAAACATTCTTTTATATCTATTAAATTCACCTCTCTTTTTAAGTCCACAAACCTTTGACATACATGTTCCTTCTGTCCTTCCAAGTGCTAATGATATATCTTTTTTCTTTGTGCTTTCCCACATACCACAAAGATAGGCAATATCCTTAAGTGTCCATGCTTTATTATGTTTAAAATGAAATTCTGGATTGTATATCATGCGGTGGTTTGAACTTGTATATTCAATTCCGTCTTTTACATAGGTTTTTACCATTCTTCTCCCTCCAATAAACTTTTTATATGTTCTGTTACCATTTCTCTAATGTCTACTATGGTTTGTCCATCTACTCCGTATAATTCCGTTCCGTCTAAATCACATTTGCTAAGAATATCAGAACTGGTATCAACATTGCTAATTCCCAACCTTCATCACCTGCCTTTATTATTCTTTTAATATTTATTGTTATTAGAATTATTACTGTTAATAAGCTGTATAGACTTAAAATTAATTGTCCTGTAGGTGTCATGTTTTAATCACCCCTATATTCAAATTTTCTTTCATACCCTCTAACTGGGTTACTAACTTTATATTTATCTCTTGTATTTTTTATAAAATCAATAACATAGTTTATAAATTCTTTAGCTTCATTACATTCTTTCGTATCTAGATTTAATACAGAACTTTTACGAAATGCTTTTCCAATTATGCAACATATAGAAGATCTTGTTTGAGCAACTCTTGGATTATTTTCATTAGTCCCAAATGCTTCAAGTAATCTTTTATCTAGATTTATTTCTTCGCAAAGTTCATACCATGGTTTAATAATTTTTTTCTGTTCAAGTTTCTTTGATAATTCTAGAACTTGTCTTTCTAAAAGGTCTATTCTTTCTTTATCTGTCATACCATTACCTCCTATCAAATTGAAAATACGGTAACCTTCCACCTGTTTCATGATGTAAAGTCATTACAACTACAAGTTCATCTATAGATTGTCCAATATCAACTTTCTTTTGTATAAACTTCTTAAACGCTTCTCTGTATTCTTCTGTGTAAACATAATCAGGTAAGTCATTCCATGTTTCTAAACATTCATCAAACTTTTTCATTGTTTTATTCATCTTTTATAATCTCCTTTCATAGAATAAACTAGGGAAGTATAAACCTTTTTAGCTTAGTAAAATTTTTTATAAAGGCTTTTAGCCTTTAGTTAGTGCCTACCTTATGCTTTATAGCCATCTGGCTTACTATAGTTGTATATATCTCTATAAGCTTTTTATCTTGTGCTATAACATCTAAATAGTTAAGTTCATCAATTTTACTCTTAGACACACCTTGTAATACTTGCCTTACCTTCATATTCTTCAATCTAATCTTTAAATCACAACCCGCTCTTTCTTCTAGTGCTTTGTAAGCTTCTTCTTTTGGTTTTTGGTAATCTTTAAGTTTAAAACAAATCTTAGTCATTAATCTGTTTGTTTCTCCTCTCCAACTGTTAGAAGGTCTTATTTCTATAACCTCTCTTACTGTTTTTAATTCCTCTTTAGTTTCTAACATCTTATTATTAACTTGGTTAAGCTGTTGTTTAACATCTTTCATTTCCTGTAAGCTCTGTATTAAAACATCTTCAATGCAAGTTGGCTTGTGCTGTTTTTTAATTTTTTTCTCACATTCAATAAAATAATTTCTATATTCATGTGATTTTTCAGTTCTAGTCATCATTGCTATATGTTTTGCAAATTCTAAAGTTATAGCAAAATCCATTGTTTCATTACCTTCAACATCTTGTTGAACCCCTATCCAATCAATGTTTTCTTTAAAAAACTCATTCTTCTCTATATTAGTTACATGCCATCTTGACCATACTGCTTTATTAAGACCTAGCCCTAAATATAATTCCTTTGCACTTACTAGTTGTTGTCCATCTTTATTAGAAATTTTAATTAAGTTACTCATTTTATCCTCTCCTTTTTATTTTGTTATTTACAATCTTAAACTATATTAGCTACTTTGTTTTTCACATTTTCTTCGCTTTTTCTTTACTTATTATTCGCCATTGCGGCACTACTGATATGCTTTTGTTAAATATGTTTAAATTGGATATGATTAAGCTTGTACTATTTTATAGAGTTGTTTTTAGTTTCATAGTTTTATCAGCTAATATATTACTACCTAAATATTCAACTACTGGTTCATAAAAGCTCTCTTCATTTATTTCTATTTTTTCTATTATTTTTTCTTCTATTATTTCTTCAGTTTTTTTATTAAATCTAGTCCTTACAACTATTGCTTGCATATAACCCCTCCTATTTACCACTATTAGTTGTATTGAAAAGTAAAAAAATATCTTCTAATGTGCAATCAAACTGCTTGGACATTTTTATTGCCAATTGAGGACTTGGTTTTTTGTATCCACCCTCCATTTGATACATCATACTATTACTTATATCCAAGCTTTTAGCTGCTTCTTTTGCTGTATCAAACCCAGCCTTTTTTCTTAAATTTGTAATATGATTTGCCATAATTTATCACCTCAATTCCTCGTTTGTTCTTTACTATGATTGTATTTTATAACTAATTGTGATAATAAACAAGTCCTATTATGACTAATAGTGAGAATTTAAATGCTATTTTCTTCAAATTGCTCCATATAGCTTTATTTATCACTATTAGTTGTATTGTTATTATTATCACTATGGGTTATAATAATAACGTATTTAAATATACCCATATTAATGTTAGGTAAGAGGTGATTGATTTGTTAGGTAAAAAAATTAAATCCTTAAGAAAAGATAACAAAATTACTCAAGAAGAATTAGCTATAAAAATTGGCGTTAGTACATCTATGGTAGGTATGTATGAAACAGATGCACGTAAACCAAGTTATGAAGTATTAATTAAAATCGCTGATTATTTCAAAGTTTCACTAGATTATCTGCTTCGTGAAACAGAGTATAAAACTTATATTGGTACAAAAGAAAATTGTATAAAGTTTAAAACTGTTGAAGAGGCAATGCAATTTATTTTAAAACAACCTGTTGTTATAAATTTTTGTGAATTTAATGTAGATAAAATGACTAATAGAGATTTAATAGAATTTGCAAATGAACTTTTAAATCAACTAAAATTAATATCTTATAAATATAAGAATAACATTTAATTTAAAAATGGAGGATGATTTTATGAGGGCGGTAGCATATGCTAGATTTAGTTCTGATAATCAAAGAGAAGAATCCATTGAAGCTCAAATTATGGACATAAAAAAATATGCACTTAAAAACAATATTACAGTGCTTAGAGAATACGTTGATGAAGCTATATCTGGCAGAACTTTTGAAAGAAAATCTTTTAAAAGAATGATTGAAGATGCTAAAAAAAATATGTTTGATCTGATACTAGTCCATAAAGTAGATAGATTTGCAAGAAATAGATATGACGCTGCTATCTACAAATCTATACTTAAAAAACATAATATAAAAATAAAATATGTCATGCAGCCAATTGATGATTCTCCTGAAGGAAATCTTATGGAGGGTATTCTTGAAAGTTTTGCTGAATATTATTCTGAAAATCTAGCTAATGAGGTTATGAAAGGATTAAAAATCAATGCAAAAAAAGCTCAATTTAACGGTGGATACCCACCACTAGGGTATGATATAGCAGAAGACAAGACATATATAATAAATGAAAGGGAAGCTAGAATAGTACGCGAAATATTTGATTTATATTTAGATGGTATTGGTTATAAAAAAATTGCTGATATTTTGAATAATAAAGGTTATAAAAATAAAAGAGGAAAACCTTTCGTTTTTAACTCAATTCCAACTATCCTAAAAAATGATAAATATTGTGGAATATATACATATAATAAAACTAGTAGAAAATACAAAAATGGTAGACGAAACCTTAAAAAATATAACAAAGATGAAGATATAATTAGAGTTGAAGATGGTATACCTAAAATAATTTCAAAAGAAAAATTTAATACCGCACAACAAGAAATAAAAAAGCGTACAAAGTCTAGAGGTAAAAAAATTGCTGTAAGAGAATATATTTTATCTGGATTAATAAAATGTGAATGTGAAAGAAAAATGAGTGGGTACGCACAAAAACGTTCCAAAGAAAGTAATCGATACTTTTATTATAGATGCACTGGATGTAATAACAGTATTAGAGCTGAAAAAATAGAAACTATAGCTACTAATTTTATAAAAGAACAGGTATTCAGAGATATAGATAATTTAATTATAAAAATACATAAATATATAGCTGATCAAGAAGCAGAATCCCCATCTGAACTTAAATATTTAAAAAATGAATTATCTAATTCTAATAACCAAATCAATAACATTGTAAAAATGATTAGTAACGGAGTAACATCCATGCACTTAGCAAAAAAACTAGAGGAACTTGAAACATATATTGATGGAATACAACAACGAATTGGTGAAATAAATAGAATGTCTGTTATACCAGAAGATGAAATAAAAAACTGGTTACTAGAACTTAAAAATTCTTTTGATAATGGTAAAAATATAAAAAAGATAATCTCTGTTTTCATAAAAAACATTGAAATAACGAAAGAAGATATTAATATAGATTTTTTCGTAAAAGCACCCTATAAGGGTGCGAACAGTCTAAGTGTCGCTCCTTCGGCACCATATTGAAAAGCAAAACCCCTAACCTATTGAAAACATTGTGTTTCTAGGTTAGGGGGGTTTTTAGGCTATGTGCAAATATTCATAATAATATTAATTTCCTATTCCTAAAGCACCCTTAGCTAATGAATCAGCTAAATCATTGTATTTATCTCCTGAATGTCCTTTAACTTTAATAAAATTAATATCAACTAATTTAATAGCTTCATCATAATATAGCTTATAGGCTTGTGTACCAGTTTTTTTAGCTTGCCAAACCCCAGTACACCATTTTTCTATTCCTTCATAGTCATAAATTATATCTAAACTTTTAATACCCTTATCTATACAAAATTGCATTGCACGTTTTGCACCTTCAATTTCTCCAGCTACATTTCTCATAGAAACCATATCTGGATTTGAAAATTTTTCAGCAAAGTGTTCCTCGCAACCATTATAAAACATAACAACTCCATAAGAATATTGTTTCTTAGTATTTTCATAACTTCCATCTACATATGCAACTGCTTCTGATTTATAGAATATATCTGATATGGTTTTATCTTTATCTATATTTGAATTTTCCATTCCCATAAAATTTTTAGCTTCTTCTTCTGTTGAAAATCCTTTATAAATAGCTCCAGAAACTCCATTTACATTTTCTTTGCATTCCTTCCATGTTGTATATATACCAGGTTTAATACCTTTTTTTACTGCATAAAATTTCTTTGCCATTAATTTTTAGTCCTCCTTTTTAAATAAATCCCTCAATGGTTTCTGCCCTTTTATATGCACCTTCTGTTACTTCCTTTGGATAGCCAATATAATCTAAAAGCCTTATTGCATTTCTAGTCTGAGAAACACCAACTTTAAGTTTATAATCAAAACTCAATCCATTTTCACTATCAACATCTTCACTGAAATAATAGAATTCGTAACCACCCTTTAAAATATCTACAAGTTCCCTATCATGGGTAGCTACAATAGGAATACTTCTTCCATTATTTATATAAGTTAATATCTCTGCTGACATGGCTATTCTCTCTATAGGATTAGTTCCTCTAAATATTTCATATATGGGGCAAAATACAGGTAACTCTTTTTTAAAGGCATTGACAATTCTAAGTATTCCTTCTGCTTCAGCCATGTAATAGCTTTTTCCTTTTGATAAATCATCATTAGGACTAATTGATGAAACTATATTAAAAAATGAAGCTTTATATTCTTTAGCTAATGCAAAATAAAAAGTCTGTGATAAAAGTATATTTATACCTAACATTCTTAAGAAAGTTGATTTTCCAGACATATTTGTACCAGTAAGTACTATACCCTTATCTTTAATAAAAATAGAATTTGCCACAGGATTATCTAAAAGTGGGTGAACCCCCTCTATAATATTCAGTGAAACTTCCTTTGTAAAATTTGGTTTTACATAATTCTCTTCTAAACTTTTCTGGTAACTTGATATTGAAAGCATTGCATCTATCTCACCTAAAGCGTAAAATATATTCATTATATATTTCTTTTTCTCTTTTAAAATCGCTGATATTCTATAGTAAGCACACTCTTCTACGAGAAACAATGTAGATATTACTTCAAAAAATCCGTGCCACATATTTATTATTCCTATTAATAATGTACTTCTATCAATATCTTTAATTTCTTTAAGATTAATTGTTATCTTTTCTATGTACTCTTTAATATCATCATTTTTCATATTAGATATTTTTTTAGAAGCCTTTATAATGTCCCTTAAATAAATTATCCCACGTGATTTAACAGTATTTTTTTCCGTATAATTTATAAACATATTTAATGAGCTAGAAACCATTATCATTAAGGCATACTTCTCACCAAACAATATAATTAAAATGATAGATATTATAGGAACAACTTTACCTAATAGTGTATATAGATAATATTTAAATTTATTTACTACAAGCTCACTTTCTATCATATCTAAAAAAGTATTTTTTGCATCTGTACTCAATTCAAGAAATATACGCAAAAGCTTTTCTCTTAAATCTGAGTCATTTTTAAATGCCTCAATTAATTTATCCCTTTTCTTTAATTTTTCTTCATCCTTTAGCGGATTCCTTAACATATAGTATAGAACTGCCTCTCCTGGAGTACTATAGGTTCTATCTAACTTTTCATAAACCCTATTCATATCCATATCACTCCAGGTATCATCATCAATTGTATATTCTTGTTTTTCACTTGTATCAAAAAAACCTTTTATATTTTTAAATTTCCTATTCTTTTCAATATCCTCTCCATATTGTTTTCTTACAATGCTTAAAGCAAATTTTTTATCCAATTGTTTTCTCCCTTCAGCTTCAAAATTTATATATATATTTTACCATAAATTCTATTTATCCAACAAACTAAACTCTATAGTAACCTTAAAAAGATCTCCATCAATAACTATATCCATGTTTCCTCCTTGCAGTTCCACAATACTTTTTGCAATAGCAAGTCCTAATCCTGAACCTTCTGATTTTCTTGCTTTATCCGATCTTTTAAATCTTTCAAATATTTCTTCTACGTCAAAATCCATTTCGTAGGCTGAAATATTTTTCATAGTTACTATGGCTTTATTTCCTTCTGTTTCTAAGTCAATATATACTCTGGAATTATTCATGGAGTATTTTAGAATGTTTCCTATTAAGTTTTCAAATACTCTCCAGGTTTTTTTACCATCTAAATTTGCATATATTCCGTCATTTTCTATATTAACTTTAAAGTTAAGAGAAGAACCTTCTATTTTTTCATAGAATTCTCCTAAAGCTTGTCTTAAAATTTCTGATATATTTACTTTTTCAATATTCAATTCTATAGATCCACTTGAAATTTTTGATGCTTCAAATAAATCCTCTATAAGTACTTTAAGTCTTTTAGATTTTTTATCTAAAATTTGAATATAAACATTCATATCTTCTTTAGATAAGTCCTCTGATTTTAATAAATCTACATAGTTTATTATAGATGCTAATGGAGTTTTTAAATCATGAGAAACATTTGTTATTAACTCAGATTTAAGTCTTTCGCTTTTAACCTGACTCTCAACAGATTTTTTTAATCCTTTCTTCATATTATTTATATTGTGAGCTAACTTTCCTAAAAGTCCTTTATCATTTTCATCTATGCTATAATCTAAATTACCAGCTACCATTTCTTCTGTACCCTTTACTATTTTGTTTATCATAGCAGTCTTTTTGAATATATATCGTGGAACTATATATGCATAGATTCCTATATAAGTCATAATTAATAAGAATATCAATCTATTGTATTTAAGCATAATCATACCAATTATAAATACGAATCCAATGGAAATAGTAGTAATATAAATTAAAGTTACTTTTCTAATTGTACTCTTTATTTTAAAGCTTTTCCTTATATCTTCTTTCATTTTATAAACAATGCTGTTTCTAAGTATTTTTGTTCCATTATTTTTATTACTTATAAAATTTAATCCACATATTAAAAGATAAAAAATATATAGTGCTACAAAACTTAATTTAATAAATCTTTCAATTTTAATAGGCATATCAAATAAATAAATTCTTCTTTGATATTCAGACATTATTTTTAAAAATATAAATAATAAAATTACCCGTAATTCTATTGGTATTTTATTGTACCTTTCTCTTGTTCTTTCTATAAACTGCAATTCATCTCTTTTATGTTTTAACAAGTATAAAGTTATAACTGTTCCAATGCCAAATGATATAATTCCTATAATAATTTCTTTTATAAGTTTACTTTTAATAGAGTTGTAATAATCATAATCTCTATAAAGTTGAGTAGATGTTTCACCTTTTTTTAAAACTATAATGTTACCTATGAAATTATTTCGTTCAACTTCTGTCCTAATAAAATTAAATCTATTATCTTCTATTTTTGAAAGATTTAAATTGTATAATGCCTTATCTTCTATATATTTATTTAAATTAGATAATCCACTTATATTTGTATATACTTTTCCTGTATTCTGATTTTTAATATAATAACTAAAAGCTTTATTATTATTTATAACTTTTTTTAATTTCACACTCTCTAAATCTTCCTGTCCTTTTTTATTTGAATCTGATTTATCTGCTCTTATTAGAAGCTGCACATTCTCAAAATAATTTATAACTTGGTTTTTAAAAAATTCGCTATTAATATAAGCGTCCTTTTTTAAATAATGCCTATGATTTAAAACATCTATACATGCTATAGATGCTATAGCTATTAAATATATACCTACTATAAAGCATATTAGTATTGATTTTTTGTTTTTCAATTTTATATCCAATTCCCCAAACCACCTTTAAGTATTCTGGTTCCCTTGGATTAATTTCTATTTTTTCTCTTATCCTCCTTATGTGAACCGCTACAGTATTTTCCCCATTATAAAAAGGTTCCTGCCAAACGCTTTCATATATATGTTCTATAGAAAATACCCGTCCCTTGTTTTCCATTAATAATTTTAATATTTTATATTCCGTTAATGTAAGGCTTACTTCTTTTCCATCTACCATAGCAGTCTTTGCATCCTTATTTAAAGTAAGTCCCCTTACTACTATTGCATTTTTATTTTCTTTATAATTCCCAAATCTTATATATCTTCTCAACTGAGATTTTACTCTAGCTATTAGTTCTAAAGGATTAAAAGGTTTTGTTACATAATCATCCGCTCCTATATTAAGTCCCAATATTTTATCCATATCTTCAGATTTTGCTGAAAGCATTATTATAGGTATTTTTTTACTTTCCCTTATTTTAAAAGTAGCTTTAATTCCATCCATTCTAGGCATCATAATGTCCATTATAATAAGATGAACTTCTTCTTCTTCTAAAACCATCAAAGCCTCTATGCCATCTTTTGCTTTAAGAACTTTTATTCCCTCATTTTTTAAATATATACCTATGGCATCTCTTATTTCATCTTCATCATCTACTACAAGCACACTATATTCCTCCAT
Protein sequences of DBSCAN-SWA_6 >NC_004557|2237703:2286330|2254723_2257306_-|WP_011100275.1|tail|DBSCAN-SWA MAHVIDAVLMLRDQFSGQLRNINQSLNDFQRRSKYIANDMRKVGKSTENIGKNFITHVTAPLAAIGTLATKSAMDFKESIADIDTLLDNHSHLKGYEQQVLNISKSTGMNLKTVSEGMYTAVSSLGDGGKETQAIFKTMANSAKAGRAEVNDSVALISAAMKGYGSINDATAKKISDLAFQTAKLGVTTFPEMAKSMQPLFPLAKNLNFSYEELFGTMATLTGVTGNTSEVSTQMKAIFSNLMRPTKDMTELLKKYGFKNGEAMIKAKGLAGTLAILKKETGGQSDKMAKLFSSTEAITALTALTGANYSDLIRKTKEMKNATGATDKALKKVSDTTKDKFNVAINNLRVSLTELGVVLLPIVTQASDIITKAVNKFDKLSKAQKESIIKFAMLAIAIGPVIFVIGKITTSISKTITGINKLSSSIKKAGGIMSWLTSPGHLVVIALIAIVTTTALVIKNWDKICKTAKEVKEKLIELKNNALDKVHKSFEVAKKKIEKFKEALKKAEPAIKSVAKILGVIFAPALIKTGIQAVKAGIKIAGSLIPKIIELGINALITGAESLITLIGSIIKLGVEAVKTGAKITINLISKLVDLGTQVIITAAKITGDLIIAIVSYAAKGWKAVGAIVAQTSAWIIQKGVVVAHTIKLIAHKAAVIGLTAVTKLLTAAQWALNAAFVATPVGWIVLAIGAIIGVAILLHKAWVKNWGGIQDKTQAVVDYVKGKIDSIKETFNSVKEKCAEFAQAIKETWGSIKEWMKHPIQGTISLIKNGDLSGVKGKNALGTPYWGGGLSVVGEHGPEIVEMPSGSKVHDNKDSMRMVGNGGSVTITFGDVHVRNESDIDAIANAIVQKLKIQSLNMA >NC_004557|2237703:2286330|2272201_2272585_-|WP_011100294.1|DBSCAN-SWA MAKMINLETFANGALAERMDQALKEVLENIADPNTDYKTKRKLTLEMKFITGEDRELTEVEIIAKTKLAPKSSVSTKIIIDRTLEGEVLGTEFKKQLPGQQVLKVDSETGEVLEENKEDLSGLQVIK >NC_004557|2237703:2286330|2278842_2279016_-|WP_155274208.1|DBSCAN-SWA MQAIVVRTRFNKKTEEIIEEKIIEKIEINEESFYEPVVEYLGSNILADKTMKLKTTL >NC_004557|2237703:2286330|2246447_2246711_-|WP_035124655.1|holin|DBSCAN-SWA MNIMEFITEQAFILVPTLYILGLMLKQTKQIKDWTIPWILLVVGILGSIALIGLNVNAVIQGILTAGVAVFGNQLVKQTTVKSKEEK >NC_004557|2237703:2286330|2267822_2268005_+|WP_035124618.1|DBSCAN-SWA MNSKEIIKIITKDGWFEVRQRGSHKQFKHQIKQGTVTIPYHNKDLDIKTLNSILKQAGLK >NC_004557|2237703:2286330|2241759_2242416_+|WP_011100261.1|DBSCAN-SWA MHKLEYEFVNNTPIFKCNYCGHCSKEIEATSFTSVKNRGCCWYFPKYTLLNIKNILNIGKENFIISLLNNKNSNISSYFIEVKGSFEEEEYYKFMRENEYTESSFDYKLFFRKCSFVTDKGCSLDFSLRPHPCNLYLCRNIINTCDKDYSSFSRERKDYFSYCNYYNEYLKYALMDKNLDLISAPLATLEFLKTISIPNFEPSEIKSIVFNPYGDVAS >NC_004557|2237703:2286330|2276989_2277145_-|WP_155274207.1|DBSCAN-SWA MGISNVDTSSDILSKCDLDGTELYGVDGQTIVDIREMVTEHIKSLLEGEEW >NC_004557|2237703:2286330|2252926_2253142_-|WP_035124599.1|DBSCAN-SWA MQKEGAENNPPPLGMGVVLSSNPLKIKVKDLILHRENMLVNVDIKELNINDLVATIPIEKNQTYIVLAKVV >NC_004557|2237703:2286330|2249524_2250172_-|WP_035124597.1|DBSCAN-SWA MNIKKIYATNSTYILSHSPNTNYHNENNLEVCNNSIGQAVSYIRFNLKDLPLNANIISANLNIYLWYEEDSNVDYGIKLGNTSIYNSDFEKTATWSNTPEWKRRPSSSWGTRTVKGIGWKKLNAKHLVSSLQSGQNVIILCTDYKNNDNGKIFYSSNHLKKPYITIEYEVPININLSIKVDKKKCDYENGWVKVDGQLREIDKVWTKINGVLREG >NC_004557|2237703:2286330|2274495_2275302_-|WP_011100297.1|DBSCAN-SWA MATNESLKNQLTTKKETGLGSAGNTIKGLMNSPAIKKRFEEVLKQRAPQYMSSIVNLVNSDINLKKCDQMSVVASCMVAATLDLPVDKNLGYAWVVPYGNKAQFQLGYKGYVQLALRTGQYKSINVIEIHEGELIDWNPLTEELKIDFSKKESDAVIGYAGYFELLNGFKKSTYWTKEQITKHKNKFSKSDFGWKKDFDAMARKTVLRNMLSKWGILSIEMQNAYTADQGIIKNEIMETGEVKENIEYIEADFESYEDNSIEEGGANE >NC_004557|2237703:2286330|2261153_2261513_-|WP_011100282.1|head|DBSCAN-SWA MDAGALNKRVSIMEYKDVENEVGDIELKLVPTKKVWAFITSLDKGGEYLENKKLQQRLIYKIIIRYIKDIDQSMFVKYKDTIFNIKDILGKDMNGQYLTLFAEEKVSENETYEGEENEW >NC_004557|2237703:2286330|2250182_2251433_-|WP_011100270.1|DBSCAN-SWA MISIDKFTEKLNKIDGNTYVIEEEVELKDGVYEGELQHDNISNTSVRVYTGPKLTGNKIENFILSTPSLTPWKKEIKIFAKVDKCYITYETPGDTVEADDINKVQESIVNTQIEVEKYKASNNLEICNLKVRAGNLENKKSDKTYVDTELNKRYLKEQVFTKEEVLEKIKNIIGTAPEALDTLSEIANALNNDPDFAGTITKELANKVDKIEGKGLSTENYTSAEKAKLAGIEENANKYIHPASHSADMIIENKNRRFINDTEKTNFNEAYNKRHDHSNKSILDKITQGLIDNWNDKADINSIPTKISQLENDKKYITQDDLSNTGQGDMHTKKYDKNQNGKVDIAEVAESVDWMNIENKPDLNKKISKTGDTFTGIAKAYPNTSYTVSQIRNIILSPNDANVNAMNDGEIWIKYK >NC_004557|2237703:2286330|2257923_2258334_-|WP_011100277.1|DBSCAN-SWA MNTVEKLLALDAGRLEMPNKEVTLKLGKLGGAKVTFLCKAVSPEKMAKIQDNLIEISKKGNIQGTNMGKNKVLTVMEGCSDTFRNKDLLEHFNAPTPKELINKLLLAGEIDELYNTINELNGYEQDEEEEEEIKNL >NC_004557|2237703:2286330|2242454_2242931_-|WP_011100262.1|DBSCAN-SWA MAYTALDLLDKIIYVEEKKRDICGAGLDKAKNDMRIYILIKVMIRNLDRSIEFHKELKEQMLKSKMEEIDFIVYDKISFLIDEFSTKLLAPDIFNIKGLIKFCLEFQNDILALCIYIQGRIVQREEDMNTSTYKALNAMIVEKKKEIKNLEEFDKKYC >NC_004557|2237703:2286330|2266735_2267197_-|WP_035124616.1|terminase|DBSCAN-SWA MARPSKSVKTMSKNLTKEEKESRLETEERLKGGVDKISPPSHLNVRQKKIFKYIVHELEVSGILGNLDIYILGTCSISIDRLQEIEKLINKDIERLLDKDLMSAKDKYTKDFFRCCNELSLSPQSRAKLGNINLQAKEQREDVLLQVLRGGSK >NC_004557|2237703:2286330|2248484_2248862_-|WP_035124595.1|DBSCAN-SWA MKYTSNFNIPIHEDDDAYDIVTMMQGYEILDKKVPELVSSIFTSIKNITINKDNWKQIGNEYEYRFLDSSITDKMVAFVVFEKDSLFNSRFLYPVTKTEPAKVILYSKKKPEKDLKCEIHLLREV >NC_004557|2237703:2286330|2277252_2277624_-|WP_035124357.1|DBSCAN-SWA MTDKERIDLLERQVLELSKKLEQKKIIKPWYELCEEINLDKRLLEAFGTNENNPRVAQTRSSICCIIGKAFRKSSVLNLDTKECNEAKEFINYVIDFIKNTRDKYKVSNPVRGYERKFEYRGD >NC_004557|2237703:2286330|2260508_2260739_-|WP_035124607.1|DBSCAN-SWA MEDYQNNNYRETMKLIYCDLPILLQNILCSSYKCVGYDGLEEDFEYTKKYIDEARKVANLTLDGQLSTLKNIISKN >NC_004557|2237703:2286330|2247034_2247310_-|WP_035124585.1|DBSCAN-SWA MNNIEEAKGLIIKYMESMGLVNDKNGVSKANIVLSVGGGSIDESQEYLIGKAFDNLVLEDRIRSSCNKYDKNGKLIRGTGSMHEEKFYLNK >NC_004557|2237703:2286330|2279026_2279242_-|WP_035124361.1|DBSCAN-SWA MANHITNLRKKAGFDTAKEAAKSLDISNSMMYQMEGGYKKPSPQLAIKMSKQFDCTLEDIFLLFNTTNSGK >NC_004557|2237703:2286330|2265047_2266739_-|WP_011100286.1|terminase|DBSCAN-SWA MILLDRAIKYATDVVEGRGITTWEVKKQCAIFIQDYYKRQHEESFEFYIDIEELSKINDLLKLMNFATGYLANNEVLEHLDSFQCFFICNIFGWRHKNNKAKFRYNDVTLFIARKNGKTALIGLVFILLLLTEQQYSEFYSICLTKELAAEIKKIMEQIINASLLIKKHFNISTTKTGRITCKLTNSFFEPRVAEAGKNNSIRPSAFVSDEHGNFQNADNFNAMKSGQKNVINPLVFRTTTAYAINNSIMEEDLDYIRKVYNCVVDNERMFALIYYADKENIWNDKGLYQANPLQLEENYKIMREDRAKALVQDNLKEEFITKTCNVFTQENSEEKYLNIEEWKKCEVDKIDFNGKEVVVGVDLSITTDLTAVSIMCKEEGKYYLHSKGFLPSETLSKRREKIDYRLYERLGYCEIHDGYIINYTKIEEYIRSIESTYNCKIKCIVSDPYNALQMMESLSSDYEVILLKQTYTNLSAPTKSFRDDVYEGKIVYQKNKLLDWCVSCATTNKGKAEDIMLDKENKNKQRIDLLVASTFCYSQLYLLDNKIDINEVTEDYLNMMGW >NC_004557|2237703:2286330|2252483_2252924_-|WP_011100272.1|DBSCAN-SWA MSIFPANADINIEEEREIENIKEELPVFREYKINFSSRQLITNDHGENIIVEKNEALKIWIWKVLKTSKNKYKIYSNNYGNELETLIGKGYSKKLVDSEVSRYLEECLLANPYIKSIENINVDFEGSKLTINVKTKTIYGEVEASV >NC_004557|2237703:2286330|2269451_2269604_-|WP_155274225.1|DBSCAN-SWA MNRKELLKKLSKYKTLSGYKPNYDSMTDEELEKYLNTLEDGFETYFKDEK >NC_004557|2237703:2286330|2270519_2270909_-|WP_011100291.1|DBSCAN-SWA MNREVKFRAWDKELNMMVYTKEQTGHIEYNTNPADTINIILNQDDYGYVFMQYTGLKDKNEKEIYEGDIIKKSNRSSNLYEIIYQDSIACFRCKVIKGDIKSFPCLNIGTVRNCEVIGNIYENPELLEE >NC_004557|2237703:2286330|2245646_2246447_-|WP_011100265.1|DBSCAN-SWA MSYTTNFINSVKDGAIASQKKYGVLASITIAQAILESGWGKSSLSRECKNLFGVKAIGGWRGQKKSYPTCEYYNGKKVLINDYFRVYNTYSESIEDHALFLVNNSRYRQHGFFNAKDYVGQANALQRAGYATSPIYAQQLINLIKQYNLNEYDNINNSYINIDGGAYASYKGGAPGINLIIRDCSKDIVRVFAWVDKDKGASWAFDLTPPNSNYTKLFKNTSKVITKRNGGYTFSKGSLYKLKIKGYNKCGQVISENQIVLKVPLK >NC_004557|2237703:2286330|2254258_2254678_-|WP_035124602.1|DBSCAN-SWA MEFWLKNKIETFQFPVPFSEFEVSFGTMTKTVNLLNFGELSLPGENRLREWSVSGFFPSKDYSFLQCSRKSNPYDYCKIIDSIKYSKQVCRFIATGTRLNSACTIEEFTWGEKDGSGDIYFSISFKEHKVVGQKKLVVL >NC_004557|2237703:2286330|2268845_2269088_-|WP_052042366.1|DBSCAN-SWA MLLKDFLEKYIFTMEIKPKVKREKKKRLNGHCKHFEKGTDGFCVNYTESTINLVSCISKCKEGERCEDRKYNKGTATRYT >NC_004557|2237703:2286330|2271461_2272178_-|WP_011100293.1|DBSCAN-SWA MYSENKEALEYLVGLGETKVIEIDGQKYSTKELYRVRDPKPSELNATTLTALVDYLKSNFDAKSSKKLLVHVKSPSCVELCSELRGDRDRENYIKCEALTPNNIVFDRFLDTEQFNIMLQSSFAENKDRGLLLKVTGCVKDSAVKEVGDDGVSQAATIKTGVASVNEVKVPNPVVLAPFRTFPEIEQPESKFIFRMQSGPRAALFEADGGAWRNEAMIKIKAYLEEQLKGLGNIQIIA >NC_004557|2237703:2286330|2253167_2254262_-|WP_011100273.1|DBSCAN-SWA MIRIFKNYNGKTTEITEFCKNITISRSIAEVSRKLECTIMYPLNDPYQIKQQIGVATKVLATLDNKEIFKGILIDRSINSDDTLNFTAFDYAFYLTKNKVTYNFSNTTADRAVKQILGEIGVQTGNIASSNIKLRWLIAQKSVYDAIQELYTQVSKQTGKQYFIVMSNTKVNVIEMGSKLTSKIIKPARDVFTGDGNLLSFEYKDSMGNMINRVKVYDDKNKYLSKIENTTDIKTYGILQDNYVKEQDKDPNIVARNMLHGIDKDVSVGVLGDYNYRTGYAVNVQIPYISTLKNALMYITSDTHTWDMETGTFTTQLELSYINKMDTKESDDTLKFDTKMKKQAIREAKKKQRLAKKKVKKNRK >NC_004557|2237703:2286330|2269615_2270131_-|WP_011100290.1|DBSCAN-SWA MIDKETFRKTEGRIYRYYDNLKNIEKLEYRCMVLEKTKEQLRQDIRNNNVDIETELNMGISYSEKVQSSSLGVSYAEQETIKQIDKLMEEWKYTRKLILRLHARIRKTKNENAEMEYIIGLLGTQYRLIAEMKYKDKLSLEKIGFNLNMDKSTVSRVRVKIVEDISKVLNL >NC_004557|2237703:2286330|2243229_2243517_-|WP_035124561.1|DBSCAN-SWA MEQNYMEKDYIEEVFEDLFFNSAIEREFLSEDYIIEKAKYDNMYKKLNEKLQEGDRVILNSLKEIQNELICLRLKECYDRALRDSIKILKHLKVI >NC_004557|2237703:2286330|2275303_2276224_-|WP_011100298.1|DBSCAN-SWA MDKLKWFQERQKGIGGSDAGAILGINKWKTPFQIYLEKTEPITEINEQSEAAYWGDQFEEVVAKEFEKRTGKKVRRDRRHFKHEKYPFMVANIDRRVIGENAVLECKTANQFLAKEWEGEEIPASYLVQVQHYLEVTGAEKGYIAVLIGGQKFIWKEVERDEELIKIIINTEKEFWQEHVIKKIPPTLDGSSAAEKYLNEKYKKSNSNISIDLKSEYMDKIDELMQLKETIKNLEGQAKEIENNIKNELKEAEIGYAPGYEVNWKKVISNRVDSKLLKEKYLEIYKKVCKESVFRRFNIKNLKEEN >NC_004557|2237703:2286330|2258963_2260025_-|WP_011100279.1|tail|DBSCAN-SWA MGLPNIDIVFKSLAKSAIERGSKGTVALVLKDKKVIDKVITLESVKDTPKELSTENKEQIELAFKGGYKTPKKVIFYCLGEEGVLEDALNVMEAEVWDYIAIPCIQEGEVDKVATWIKGLREDNIKVQAVLPNGKADNEGIINFTTTPIQVGEKEYTNTQYCSRLAGIFAGTPLNISATYYVLNEVTDVPHLRKSEIDKKINNGELVLINDGEKCKIGRAVNSLTSTIEGKGEEYKKIKIVSTMDLIDRDIRKTFEDDYVSKYGNGYDNQVNFVVAVNGYLEGLEQEGILAKRQNNISINVDAIKKYWTKKQVDVSKLNKKELEELSTGSNVFLKGKIRILDAMEDLEIQFEM >NC_004557|2237703:2286330|2276216_2276648_-|WP_011100299.1|DBSCAN-SWA MIKDITLALLTTTINRYSSLGDSIRAIHKEAVINDLTGILDYVTDLKEENNQPITVVLNSNGEVEFLKKRIRELEESCETDNEVIEKQHRKIKELQNSNYRWNTLCVQLKAKNKRFETENKDLKSRFGVDKLEIMKRGGKWIG >NC_004557|2237703:2286330|2273141_2273396_-|WP_035124627.1|DBSCAN-SWA MLTTLLITNITITVAVGIMVLKEIDKLKKDIEHTELSITVQIGKDAEKLGYDIADIKKELKEELKEHISKEFMAMTFRGIRIFK >NC_004557|2237703:2286330|2246728_2246962_-|WP_035124565.1|DBSCAN-SWA MNSETMGIKISEHTETLKEHDKRLDKIEQDGREFKIEIKNLCKDIKGLTTTMRWFMGLIIGSFVSFFFYAIQHNIFK >NC_004557|2237703:2286330|2261521_2261800_-|WP_035124611.1|head,tail|DBSCAN-SWA MILTLEEVKEYVRIDTEEEDNTLKLLIRNAELYIEDASKSIDEMSDRTKEKAKLLALVLISDWYDNRSMNMKTSEKARYTVRSLLTQIQYCR >NC_004557|2237703:2286330|2243716_2243869_-|WP_035124563.1|DBSCAN-SWA MLVHRTRISNSIDKKLYDKLKKLSEKTRIPMSRLLDEAIEDLLKKHSTKK >NC_004557|2237703:2286330|2263126_2263807_-|WP_011100284.1|protease|DBSCAN-SWA MKINVKGPIIDSDDQWIYDWFGIDATSPKKVQNELSNSKNNEGLEVEINSGGGSVFAGSEIYTLLKDYKGNVTVKIVGLAASAASVIAMAGDKVLISPTGQIMIHNASGCFSGDYRNMEKGSEILKNVNITISNAYKLKTGLSSEELLDMMNKETWLTPQNALDNKFVDEIMFTDGVKLVASINSGMLPQEVINKMRDKLKNELPQENINNDLELEKAKLQLHLNM >NC_004557|2237703:2286330|2244314_2244782_-|WP_011100263.1|DBSCAN-SWA MIQPGNGLLLRINFADGGICKAERTFLVIDNQDQQFWLLNVSSIKGKEWKLGMESNIEITKYKPPFVRPSFIKLDALYVIPKEKILKSKVLCKGRKINPKELTFIKNQFNHFKNTHKILSKVCTATELIEGNEKYHNDLFKEEAAVTNLKTEYNV >NC_004557|2237703:2286330|2279478_2279880_+|WP_035124368.1|DBSCAN-SWA MLGKKIKSLRKDNKITQEELAIKIGVSTSMVGMYETDARKPSYEVLIKIADYFKVSLDYLLRETEYKTYIGTKENCIKFKTVEEAMQFILKQPVVINFCEFNVDKMTNRDLIEFANELLNQLKLISYKYKNNI >NC_004557|2237703:2286330|2283728_2285648_-|WP_155274209.1|DBSCAN-SWA MKNKKSILICFIVGIYLIAIASIACIDVLNHRHYLKKDAYINSEFFKNQVINYFENVQLLIRADKSDSNKKGQEDLESVKLKKVINNNKAFSYYIKNQNTGKVYTNISGLSNLNKYIEDKALYNLNLSKIEDNRFNFIRTEVERNNFIGNIIVLKKGETSTQLYRDYDYYNSIKSKLIKEIIIGIISFGIGTVITLYLLKHKRDELQFIERTRERYNKIPIELRVILLFIFLKIMSEYQRRIYLFDMPIKIERFIKLSFVALYIFYLLICGLNFISNKNNGTKILRNSIVYKMKEDIRKSFKIKSTIRKVTLIYITTISIGFVFIIGMIMLKYNRLIFLLIMTYIGIYAYIVPRYIFKKTAMINKIVKGTEEMVAGNLDYSIDENDKGLLGKLAHNINNMKKGLKKSVESQVKSERLKSELITNVSHDLKTPLASIINYVDLLKSEDLSKEDMNVYIQILDKKSKRLKVLIEDLFEASKISSGSIELNIEKVNISEILRQALGEFYEKIEGSSLNFKVNIENDGIYANLDGKKTWRVFENLIGNILKYSMNNSRVYIDLETEGNKAIVTMKNISAYEMDFDVEEIFERFKRSDKARKSEGSGLGLAIAKSIVELQGGNMDIVIDGDLFKVTIEFSLLDK >NC_004557|2237703:2286330|2277926_2278676_-|WP_011100301.1|DBSCAN-SWA MSNLIKISNKDGQQLVSAKELYLGLGLNKAVWSRWHVTNIEKNEFFKENIDWIGVQQDVEGNETMDFAITLEFAKHIAMMTRTEKSHEYRNYFIECEKKIKKQHKPTCIEDVLIQSLQEMKDVKQQLNQVNNKMLETKEELKTVREVIEIRPSNSWRGETNRLMTKICFKLKDYQKPKEEAYKALEERAGCDLKIRLKNMKVRQVLQGVSKSKIDELNYLDVIAQDKKLIEIYTTIVSQMAIKHKVGTN >NC_004557|2237703:2286330|2257359_2257737_-|WP_011100276.1|DBSCAN-SWA MKKFLSILLSCVFIFALIGCGGDSEETVTEIDSSKLTKKLKHNMPSLDLDYAKCTETVKDDNATLKIDIAYKDEMLREGVVNAITKLVEQEFVNKYKDMYLTIIQEQPHDFVKYTYNNGKWNRES >NC_004557|2237703:2286330|2261808_2261946_-|WP_162827859.1|DBSCAN-SWA MKVIALINCTGIGYEDFKKDEIRELPKELSEKLINFGYAEKSTKK >NC_004557|2237703:2286330|2240355_2241501_-|WP_035109083.1|DBSCAN-SWA MLRFLDAGESHGKAMMSIIDGIPSNFKVDIDFINNELKRRQKCYGRGGRMKIEKDKIQFLSGLRGTMTTGNPITMAIYNNDSPNWEKILSGEVKKDEKITIPRPGHGDLVGYFKYGTGDIRDSIERTSARETSIRTAVGALCKQILKGIGIEVRSKVYSIGNLFDEKVDLFDQCKYKKIDNSLLRCYNEEVEKSFVKKIDICREQGETIGGTVFLSVRGVPIGLGSYSQWDRKLDALLSYAIMSLQGVKAIEFGNGMNLNLRGSTFNDEILYEKGKFKRPTNNCGGIESGVSNGENIEMKVYIKPIPSIKKNIRTVNLRNRKETTTRYERSDVSAVVPASIVLENIVAFEILKEILNKFPSDEYYELKRNISHYRGTIYLR >NC_004557|2237703:2286330|2273409_2274306_-|WP_011100296.1|DBSCAN-SWA MAEGKEKGWISLYRDIQEHWIWEDAEKLKAWLDLLLLANHQSRKILLGNELINVERGSFITSQKKLMERWGWGSEKTRTFLKLLDSDGMIKFQPDKKKTTIIILNYDRYQKQNGFNADIPTDSENMQNDNRTQTECNQNDSRTSAETNNNDNNYNNINNDNNSSGSTPNYIEFFNSNFHMISSYELEVLRSYEKDGLSEEVILLALKKAVENNVRTIKYVKSILQNWLENNIKTVEGVKAEEERFKRDIEHKKAKGNVERKDNGAKIDSFNGYQQRTYDGSDGGMTFDDLEKKLLGWK >NC_004557|2237703:2286330|2268059_2268470_+|WP_035124620.1|DBSCAN-SWA MLDKYIFSAIFEPGETKGYCVTFPDLPGCITEGDTLEESLLMAKEALELHLYGLEEDNGDIPLATLPEKINSPDSSFIVPIEVYMPLVRNEMSNKAIKKTLTIPYWLNKIAEDKKVNFSQTLQVALKEQLGVQDYK >NC_004557|2237703:2286330|2263803_2265036_-|WP_035124613.1|portal|DBSCAN-SWA MKLFNWVKSKFKNETVNLNNPKLLEWLGIDPLTNKDKLSDATYFACIKKLSESVGKLTLKMYQNTNKGIVKSDKTNLYNVLKSRPNPYMTAATFWSTVEMNRLHYGNAYVWCRYSGPKLQDLWIMPSSDVIVVIDNKGVLGTKDSIWYKYQDRKNGKTYTFNNKEVMHFKTSTTFDGIIGKSVREILATTIEGSLESQNFMNNLYKNGLTAKAVLEYTGDLDDTAKTRLVKGFEGYATGSTNAGKIIPVPLGMRLVPLDLKLTDSQFFELKKYSALQIASAFGIMPSQINDFEKSSYASAESQNLAFYTDTLLYILKHYEEEITYKILSSQLINNGYFFKFNVNAILRADMKTQMESLAKAVNNGIYTPNEARNYLDMPTAQGGDALLVNGNYMPIEMAGEQYKKGGNTV >NC_004557|2237703:2286330|2237703_2240334_-|WP_035109085.1|DBSCAN-SWA MPKRDKLVILDGNSLMNRAFYALPPLTNHEGVHTNAVYGFTNMLLKIKDDIKPDYIVCTFDKSAPTFRHQAYKDYKAGRKKMPEELREQFPIVKGLLSKLAIEIFEIEGFEADDLIGTLSKFAEEKDIEVYIVTGDKDALQLASDTTKILFTKRGITEREIYDRQKVVDEMGVTPTEFIDVKGLMGDPSDNIPGVPGIGEKTALKLIKEYSSVENVLNNIENLSGKKLKENLIENSEQAIFSKKLATIMRNVPIDIDLEKIKSKEEYDIEAVREIFIKLQFKSLIDKIPKGENHKEVEEEEELVYEEIHTIEDLIKLCTEIGNKDEKIYVNFKSTDTSLYSKISLEKLYILFKNKCYVVNIEERIEENKDKTIEILKTIFEKEAVRVVSHDVKVLSMALKKLDVEFTKALFDLKIAAYLIDSSKSDYDLSTLIQECLSKIVKDDEEREIRETSLLEKLCANLKEQIINLEMEKLYYEVELPLAFVLANMELEGFKVDGEMLENIGKKLQREIEKVQKEIYELADEEFNVNSPKQLGKILFEKLDLPVIKKTKTGYSTNAQVLEVLEDKHPIISKIGYYRQLTKLYSTYVEGLKNVIDEDGRIHSSFNQTVTTTGRLSSTEPNLQNIPIKYEMGREVRKVFVPNYENSIILSADYSQIELRVLAHIADDENLINAFKHHADIHTKTASEVFKVPIDEVTVNMRGNAKAVNFGIVYGIGDFSLAKDLKISRKEAKGYIDTYFERYPNVKKYMDETIDKAKRDMYVTTILNRRRFIPEIGNKNKIVKALGERLAMNTPIQGSAADIIKMAMVKVFNVLKERDLKSKIILQVHDELILNVYKDELEEIKSIVKDSMENVLPLSVPLEVDINEGENWYDAK >NC_004557|2237703:2286330|2272721_2273003_-|WP_052042367.1|DBSCAN-SWA MKNVDSNKEIYKKAISKYGLYAQIDMVFEEMSELQKELCKFKRGKSNISNIAEEIADVKIMLEQMELAFDIKDKVKFEKDLKIKRLEERIEEE >NC_004557|2237703:2286330|2270155_2270380_-|WP_035124623.1|DBSCAN-SWA MKVTQRQLINKMLFCPKLTLKLDGEKASMSYMHTHMHRGWFMNSEPIKGLTVDIVKKLFQKYSDIEIIWSMRMY >NC_004557|2237703:2286330|2277628_2277841_-|WP_035124359.1|DBSCAN-SWA MNKTMKKFDECLETWNDLPDYVYTEEYREAFKKFIQKKVDIGQSIDELVVVMTLHHETGGRLPYFQFDRR >NC_004557|2237703:2286330|2281471_2282092_-|WP_011100304.1|DBSCAN-SWA MAKKFYAVKKGIKPGIYTTWKECKENVNGVSGAIYKGFSTEEEAKNFMGMENSNIDKDKTISDIFYKSEAVAYVDGSYENTKKQYSYGVVMFYNGCEEHFAEKFSNPDMVSMRNVAGEIEGAKRAMQFCIDKGIKSLDIIYDYEGIEKWCTGVWQAKKTGTQAYKLYYDEAIKLVDINFIKVKGHSGDKYNDLADSLAKGALGIGN >NC_004557|2237703:2286330|2247423_2248482_-|WP_011100267.1|DBSCAN-SWA MALGFTNTGSVGGFMDGEVVLFSDLGIEAAPYSNVSRVGNISRYGYINNNFIFTQVHAYPSLYNVDENGKITSISHNATDLNLRANMIFYNNEILQVTLENDQIYISSFDAKGYFKDKVSISGVPPKNNAYIFDVRMNKYKEFIVLMRRPEGHGDDPYDFILIIDGDKKYIKAQSTLPAYSMISDSEDPHCCIETDNTYIYIIDTSMKKIYVYNNLAQHINTIETFDDYKELKFIRNSFLRDEVLLIDQKTKKLCTVTKNNIVYKYYADGFAQLKNGYLRSYDNTLYYYNSDFKQIWSTIVRDKINTIGGLITTTGDLIVINGSSVNLEDNIYKVRLMAKVPIKKDLNWWER >NC_004557|2237703:2286330|2248861_2249521_-|WP_011100268.1|DBSCAN-SWA MYGSINYGANQYGQELETKNKEIELYRPDLLAYLPPVLRQIKEFKVWDDVVGYELALLNWKREDLIKQCFIDTATWGLSLWENEYGIETDLNKSYEERREILKAKKRGHGTVTKKLIKETAEAFSGGEVEIIENPESYSFTVKFIGAYGIPKNMGDFKDMLETIKPAHLGYTFEFILITHGMLKNNLKVIHSDPKIEGITHKQMKEYRINEINENEVIK >NC_004557|2237703:2286330|2244782_2245454_-|WP_011100264.1|DBSCAN-SWA MCKAISVAGWFINNDFKPTNDKSGNLKLNKLLYFAQMISLIKRNKPLFEEELYAFENGVVVEKIRKDYCNNYDDFVKLAKEYDKNFNEDEIDILESTKELFLNVSPEDLSELTHQHKCWKHYYNKSIEEAKGYKYNKESARMPIGVIKNVYKDDLLLIKKMVDALEISYDIKEEKIVINNTEFYYNPEELCINEAIKQQLEEFPSYDEAYSLYIDEEQGLVVY >NC_004557|2237703:2286330|2261988_2263095_-|WP_011100283.1|capsid|DBSCAN-SWA MLKSIEMKQELEILKNEAKALLENKEAKLEDIKAKNEAIEILQAKIVMQEKLEQEEKEKVENKVPKEPTPVENDYTKEFINGLRTKFKNSMSEGSAGDGGYIVPQDISTAINELRQSKDALQNLITVEPVQTLSGSRVFKSRSQQTGFAEVAESGEIKENATPKFTQLPYSVKKYAGFFKVTNELLKDNDQAIRGALIKWIGDESRVTRNKLILAELNKKPKTAIAKIDDIKDVLNVQLDPAFRYTSSIVTNQDGYNYLDKLKDTDGDYLLQPSITAPSGKQLFGVPIIVISNKDLPSDTTDGTKAPVIIGDLKEAVVMFDRETLSVMASDVAGDAYLTDVTLFRAIEREEVKTRDGEAFVYGQLTIK >NC_004557|2237703:2286330|2276732_2276996_-|WP_035124355.1|DBSCAN-SWA MVKTYVKDGIEYTSSNHRMIYNPEFHFKHNKAWTLKDIAYLCGMWESTKKKDISLALGRTEGTCMSKVCGLKKRGEFNRYKRMFKEV >NC_004557|2237703:2286330|2267272_2267701_-|WP_011100288.1|DBSCAN-SWA MLYKLCRCGKVLDYTQKYCNDCSKKFEEQNRERYRHYKKNRKDKKEQRFYVSKEWTIIRDTVKQRDRGLCKLCLSKCNITYMDTVHHIEELKDCWDKRLDPGNLISLCESCHQKVHEEYKNNKLDIQKELEKLIERGVSRKF >NC_004557|2237703:2286330|2258335_2258488_-|WP_155274217.1|DBSCAN-SWA MKTGIEISLTDTDKFLGLINCIKSLLTDNRVPTEVKSDLTRKVFSIMEEK >NC_004557|2237703:2286330|2244029_2244266_+|WP_052042365.1|DBSCAN-SWA MYKLAIREYRILNKLTQKDLAYRIGISQNYLSEIEKGKYDIRVSFLLSISNALDVCPGYLIRCNMCKCRRNRKRRSKA >NC_004557|2237703:2286330|2279902_2281387_+|WP_052040968.1|DBSCAN-SWA MRAVAYARFSSDNQREESIEAQIMDIKKYALKNNITVLREYVDEAISGRTFERKSFKRMIEDAKKNMFDLILVHKVDRFARNRYDAAIYKSILKKHNIKIKYVMQPIDDSPEGNLMEGILESFAEYYSENLANEVMKGLKINAKKAQFNGGYPPLGYDIAEDKTYIINEREARIVREIFDLYLDGIGYKKIADILNNKGYKNKRGKPFVFNSIPTILKNDKYCGIYTYNKTSRKYKNGRRNLKKYNKDEDIIRVEDGIPKIISKEKFNTAQQEIKKRTKSRGKKIAVREYILSGLIKCECERKMSGYAQKRSKESNRYFYYRCTGCNNSIRAEKIETIATNFIKEQVFRDIDNLIIKIHKYIADQEAESPSELKYLKNELSNSNNQINNIVKMISNGVTSMHLAKKLEELETYIDGIQQRIGEINRMSVIPEDEIKNWLLELKNSFDNGKNIKKIISVFIKNIEITKEDINIDFFVKAPYKGANSLSVAPSAPY >NC_004557|2237703:2286330|2260747_2261179_-|WP_011100281.1|DBSCAN-SWA MKVKKMSGEHTVDFKFDGLEETMNELKEAEKRVPELSEKALKKGMNKTKKLSKEKTSYNDKGKKHIRNSYKVLPIEYERNGMNIKMTNTAPHFHLEEKGHRIVTPGGIEKGWYEGKHMVERSMEEMEEEFPKMLEKMVKKILR >NC_004557|2237703:2286330|2251429_2252491_-|WP_129052575.1|plate|DBSCAN-SWA MYKENADDILKRMKNYVENDVSTMEGTLLHDTLAPTAYELEDTREDLEEILNKVFIDSAFMNGFEEEVIKRAAEIGIYRKEGTKATGTSTFYGAKDTIIQKGTLIQTKNQLQFKTIENATIREIGEVDVKIEAIEIGSRYNVKANTITEMPIQLINITKVTNKSDINSGINIESIEDLYKRYKIKVTTPATSGNKYHYKLWALEVPGVGDAKVFPLWDGNGTVKVVIIDSNKHSASKELIDKVFKYIEEQRPIGATVTVVSAKEKSINIIAKITLANGFNIGSIQSELSNLLDNYLKEIGFEISYISIARIGNIILNTPGVLDYSNLKINNSTANISLEDEEIPILDNVELEV >NC_004557|2237703:2286330|2285553_2286330_-|WP_035124369.1|DBSCAN-SWA MEEYSVLVVDDEDEIRDAIGIYLKNEGIKVLKAKDGIEALMVLEEEEVHLIIMDIMMPRMDGIKATFKIRESKKIPIIMLSAKSEDMDKILGLNIGADDYVTKPFNPLELIARVKSQLRRYIRFGNYKENKNAIVVRGLTLNKDAKTAMVDGKEVSLTLTEYKILKLLMENKGRVFSIEHIYESVWQEPFYNGENTVAVHIRRIREKIEINPREPEYLKVVWGIGYKIEKQKINTNMLYSRYIFNSYSIYSMYRCFKS >NC_004557|2237703:2286330|2270376_2270517_-|WP_155274223.1|DBSCAN-SWA MEIILISMLTCIAIAAVSLKYSVKKVQAIKQKELDRPDYIRMEEWK >NC_004557|2237703:2286330|2282111_2283677_-|WP_011100305.1|DBSCAN-SWA MDKKFALSIVRKQYGEDIEKNRKFKNIKGFFDTSEKQEYTIDDDTWSDMDMNRVYEKLDRTYSTPGEAVLYYMLRNPLKDEEKLKKRDKLIEAFKNDSDLREKLLRIFLELSTDAKNTFLDMIESELVVNKFKYYLYTLLGKVVPIISIILIILFGEKYALMIMVSSSLNMFINYTEKNTVKSRGIIYLRDIIKASKKISNMKNDDIKEYIEKITINLKEIKDIDRSTLLIGIINMWHGFFEVISTLFLVEECAYYRISAILKEKKKYIMNIFYALGEIDAMLSISSYQKSLEENYVKPNFTKEVSLNIIEGVHPLLDNPVANSIFIKDKGIVLTGTNMSGKSTFLRMLGINILLSQTFYFALAKEYKASFFNIVSSISPNDDLSKGKSYYMAEAEGILRIVNAFKKELPVFCPIYEIFRGTNPIERIAMSAEILTYINNGRSIPIVATHDRELVDILKGGYEFYYFSEDVDSENGLSFDYKLKVGVSQTRNAIRLLDYIGYPKEVTEGAYKRAETIEGFI >NC_004557|2237703:2286330|2274323_2274503_-|WP_035124629.1|DBSCAN-SWA MNNIPECMYDYRYEFEKMQIIDNCCNCDCNICEGEEYYDIDGTILCEECIRDYKHTAEL >NC_004557|2237703:2286330|2271015_2271378_-|WP_011100292.1|DBSCAN-SWA MKICKVCGKPNSEKHHIVFRSQQKALEHYQYNIMYLCSEHHRGNNSPHKSRQIDIRYKLQFQKKLFKLFTKEEYTEKEIKEILDIGNVAVRKITKSINKHVDKYKQEDIIRSCMGGRLYG >NC_004557|2237703:2286330|2258512_2258944_-|WP_011100278.1|tail|DBSCAN-SWA MAGHYAEENVINGTWAELWINGELLANATAFQAKISLKTADVNMLGTLAKHTKVIGYEGKGTLKLNKVDSMFIKLMADNIKKGKQTKVTIISKLSDPDALGVERVCIKDATFEELTLADWEGKKNGEESVPFSFSDYEVLDSI >NC_004557|2237703:2286330|2260029_2260440_-|WP_129028538.1|DBSCAN-SWA MLDIKETINDLLKDTGIKTVDNGYKEGFKRPCFFVQILPIGGTDLLKGSILENSYMIEINYFSKSQKQIDNLKMAEILKKKVFPYINIKDRKLVPRNVRNEDIDGVLSFRFNLSWLDALPRKENTTKATKLKLEME |
72 | Clostridium_phage(91.43%) | protease,capsid,holin,plate,head,terminase,tail,portal | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2348328 : 2360631
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NC_004557|2348328:2360631|DBSCAN-SWA TCTAATCAGTTTCTATTCCATAATTATAACATAAAGCTTTAAGTCCACCCTTAAACCCACTTCCTATACAATTAATTTTCCATTCACCTTTATATCTATATATTTCAGCAACTACAATAGCAGTTTCCTCTGATAACTGGTCATCAAATTTATAATTTAAAATTTCCTTTTTTGTATATGTATCTATTAATCTAAAATATGCATTTGAAATATTTTCAAATGTTTCGCCTCTATCCTCAGCTTCATATATTGTAACTGTAAAAGCTAACTTTTCAATATTCAAAGGAATTTTAGTTAAGTCTAATTGTACAACTTCATCGTATCCATCTTTTAGATTAGAATTATGATCTGACTTTAAAACTATTCCTCCTTGTGCACTTTTTAAGTTACTATAAAATATGAAATCTTCTTCCTGTGTAGTATTATTTTTATCCACCATAAATATAGATGTATCTAATTCCAATTTTTTATTAGAAGTATTCTTTATCTCCCATTCTAAAGCTACCATAAGTTTTGATATATTTGGTGATTTTTGATTCAAGCTTATTTTTTGTCCTTTTAAAATTTTCATACTTGCATTTGTATTAACCAATGATTTTATTTTTATATTCTTAGTCTTTTCTTTCACTATATTATTATCCTGATTAAAAGGCTTTATATTAGTGCTATTTGAACTTATAGTATTTTTAATATCTAATAAACTAATAACTGGACTACTGTTTTTTACACTCCTATCTATAGTTAAATTTCCATATCGATTTTTTTGTGAAGAATTTATATTCCTATTTCCTCTATCTATAATTAATTTCACTTTTTTATCCCCTTTATAAAATAAAAGTTTAGTTTTTAATCAATAGCAATATCTAATCCAGATATTCTTTCGGATGGTTGAACTATAACAACTTCACCTGGCTTATTAAATTCTAGGCAATAGGATTCTCCTGAAGTTTGTCCAAAAAAACTTTTCCAATTTACATCTACTTTAAAATTAGGATCTGCCCCTGTCCAACAAATTACGGCATCAGGGTCAACTACACATGGAGTTTCTATAATGATAGGATTTCCATCTGTAGTTATAGTCACATATCCATTATTCCCTAAGCCTGTTATTTTTGAAGTAAACAATCCCTTTTGTGATATAACTCCAGAACCTATAAACCTAACTCCGTATTTACAATCCTCTGTAAAAGCTAATAGATTTTCTCCTTCCACACCTATGCTCTGTCCTTGTTCAAGTTTAATAACACTCACATGCTTAGATGCATTTGCCATATAATAGCTTCCAGTTCCTTTAACTTTCATTAATGGTATGTTTTCACCTGTAATTTTTCTTGATGCAAAATTTATTACAGAACCTAAAAGACTTTTATTTGTATTTGTATCTAATAAAACTTTTTCCGAAGTAAATTCTCCTTTATAACAAACCATAGCCCCTACTCTAGCAAAAAAATGACCGCTTCCTTCTGCTAAACAAGTTAATTCTTCATGCACTCTAAAATTAAACATAAATTTTTATCTCCTTTGTTTGTAAAAATATCGATAATTCATTATAATCATAGTATATCATAATTATTTTTATACTTTAATGTCATAATTTATCATATTATTAAAAATAATTTTTATATAATTTCAAGTTTTTTAAATTACTTATATTTTATAAACTATCATTTAAAAAGGATGTATGTTCTATGGTTAATACTAATAAAATTATAAACTCAAAAATCATCAAAACAAATAATGTTTTTGATGATATATTACTGCCATTAAACAAAAGAAATGGTTTTGCAAAATCTAACCCTCCTATTGTACCTATCTATTTTTATAGATATATAGGTATTACTAAAAATCAAGATGAATATTATAATAATTTATACAAATTAGATAAAAACTTATCTGATTTAAACAATCTATATTTAAAATTTATTTCAAATATCCCTTTTAATACGGATTATGAACTCATAAAAAAAACAGCAAATTTATGGGATAATTTAAATCCTTTTAATGATAAAAAGAAAATATTTTTATTTTCTTCTATAAAATTAGTAAAAGCTTTACCTTTATTCCCTAATGAATTAACACAAACTTCTATAAATGAAGGATTTAACTATATATTTAATTTATATATTAAAAACACCAAAAACATTAATGCTACTAAAATCAAAAATTTCACATTAAAAATACTTACTTGGATAAATAAATTTATCCCTAATTTATTTGAAAATGTCGATTATAACAAAAATGAAACTAATGGTATATTTAATCCAAAAGTTATTTATTATGGTAACATTAAAAATCATGAAATTTATTTTTTAATTTTTTTATCTAAAATTGGATGTGATGTTTTATACATAAATTCCTATTCAGATGGAGCTTTTTTAGAAATTGATAAAGAAAATACATATTCAAATATTATTGAGCTTAACGCCAAAAGTCCTTTGAAAGATTTTCCAAAACGAGAGGTTATTTCAAGAACTGAAACTATAGCTTTTAAAGCTTCAAATGAAATAGAAGAAATAATTTATGATGAAGAAAGTGGACTTTATAAACCTTGGCAATTTGAAAATTATAATGTTGAACCAGTAACTTTAAAAACCACTTATGATGAATTAAAAATTTTATGGAATGAAGAAGCAAGAATGAGAACTGGTTTCAAAATAGAAAATGGTACTATATATATACCAAATATTTTCGCAAAAATAAGTGGAGTAACTGAAGATATAAATATATATTGGAAGGATTTATCTTATTTTAAATTAACAGGTGCAAATCTTTTTATATCCAATTTACCTTTTACAGAAATTAATTATTCTAAACATGATTTATATAGATTATCTTATTTAATTAATAATGATGGATATTTAAATGAAGATTCTTTAATTAAAAGTGAATTTTATAAGTTTTCTTATTTGAAAACTTCATTACAAAAAACTATAATTAAAAAAATAAATGATATATTTGAATTGCCCTTATTTAAAAAAGAAGTTAATAAAGAATTTAAATTAAAAATATTAATGACAATATTAAATATAGATAAAAATTTAATAGAATTACTCCAAAGCTTTGATTATCCATTTAAAATACCTAAATTGATTATTTATTCCAAAAATGAAAATAGTTTTAGTGATGAGGATTCTATAATTATTACTTTTTTAAATCTGATAGGCTTTGATATATTGATAATTACTCCAACAGGTTATAACAACATAGAAAATCAAATTCCTAGAAAATATTATGATATTCATAAATTCGAAAAGATTAATTTAAATTTAGAACTTGTGGATTTAGATAATATTAATAAAAAGAAATTTTCATTTTTGTCAAAATTCTTTAAAATTTAACTAATAAATTATTCCTAAAAAATAAGGAGGACTTTTTCATGGTAAACAATGAAATTAATACTATTAATGATGTTCCAGATTTCAATTTAGAAAAAACAACTAATGAGGTTACATTTAAACTACAGAATTCACCAGAAATTTTATCTTTATCAAAAGAAATTGATTTAAAAAACGTTGATACGATAATGAATTTTGGTCAAAGTGCATCTCAGGAAATCTCAACTTTTGCAGATAAAATACTTCACTCAATAGAAACAACTTCTGTGGAGGATTCTGGAGAGCTTTTAACTCAACTTAATAAAATAATGAATAAATTTGATGTACAAGATTTTAAAGATAAAAAGCCTGGCTTCTTTGAAAAATTATTTAAAAAGACAAAAGATTCTATAGATGCTCTACTTAAAAAATATCATACAATGGGTGGAGAAGTAGATAAAGTTTATATTCAATTAAAAAAATATGAAGAAGAAATAAATACTTCAAACAATACTTTAAATGAAATGTTTAATAAAAATATGGACTACTATGAACAATTAGAAAAATATATTTTAGCAGGTAATTTGGTAGTTGATAATTTTAAAAATAAAGTGCTTCCTGAATTACAACAAAAAGCACAAAATTCTATGGAACAAATAGATCAAATAAATTTATCTAATGGTCAACAATTACTTGAAATGCTAGAACAAAGGGTTTATGATTTGGAACTTGCAAAAAATGTTTCATTACAAACAATGCCACAAATTAAATTAATACAAAGGGGAAATTATAACTTAGTTAGAAAAATTAACTCTGCTTTCATAATTACAATACCTATATTTAAACAATCTTTAACTCAAGCCATAACTTTAAAAAGACAAGCAGTTCAAGCTGAAGCCATGTCTGCTTTAGATGAAAAAACTAATGAATTGTTACTTAGAAATGCACAAAATACTGCTATGCAATCTAAAATTACAGCTAAACTTGCCTCTGATAGTTCTATAAAAATAGAAACTCTGGAAAAAACTTGGGAAACTATTATTCGAGGTATTGATGAAACAAAACAAATTCAAGATGATGTAAAACAAAAACGTATTGAAGGAACACAAAAATTGCATCAAATACAAAATCAGTTTAAAAGTAGAATAGAAAAATAAAAGAAAATGTAGTGCTCAAGATTACTTTTGCACTACATTTTCTTTTATAATGTTTAAAATTATTACTACATATATTTCTGTATCATATCATTTATTCCTGTAGCCATACTTCCTTCTCCAACAGCTGCAAACTTCCATTCACTTCCGTGTCTATATATTTCACCTACAAATAATCCTGTTTTATTTGAATATTCATCTGTTAAATTAAAACGAAGCATTTCTTTATTATCAGTCATATTTACTATTCTGATAAAAGCATTTTTTATCATACCAAAATGTTGATTACGTTTTACACAATTATATATGTTTACTATAAAAACTAATTTATGAATATTAGAAGGCACTCTAGTAAGATCAACCATTATTTGCTCATCATCACCATCGCCATCTCCTGTAAGGTTATCACCCATATGTCTAATGCTTCCATCTTTACTATTTAAATTACCAAAATAAATTATACTTTTTTTATCGCTAACTTTATCATTTTCATCTAACATAATTACAGAGGCATCACAATCTATATCAATTTGGTTTCCTTTACCGAACAAACTTCCTAATAATCCTTTACCATTTGATTGTTCTACGGCATCCCAACCTAATCCAACCATTACTTTAGACAATTTATTGTTTTCTTTTGTCAAACTTATCCTTTGTCCTTTTTTTAAACTAATAGACATTTTACCAGCTCCTATATCTTTATACTTACACTTGTAAACTGTAATTCTTACAAAGTGCTTCAAGACCTCCTGAAAATCCACTTCCTATTGCACTAAATTTCCAATCTCCATTGTTACGATATAATTCACAAAATACTAATGCTGTTTCAATTGAAAAATCTTCTGATAAATCATATCTTAAAATTTCTACATCAGTTTCTTCATTTACTATTCTCACAAAAGCATTTGAAACTTGCCCAAAATTTTGCCCTCTTTCTTCTGCTTCATGAATAGTTACTGTAATAGCTATTTTATCAATATCTGATGGAACTTTACTAAAATCTATGATGATTTGTTCATCATCACCATCTCCATCACCAGTTCTATTATCCCCTGTATGCACTACTGCTCCACTTGGATCTTGTAAATTGTTGTAAAAAACAAAATTTAAATCATCATTTACTTTTCCATTATTGCCAACAAGAAATGCTGATGCATCTAAGTCAAACTCATATCCTCCAGAATACTTATTTGTATCCCAACCAAGTCCTATAATAGCTTTATTTAATCCTGGATTATCTTTAGTTAGGTTAATTTTTTGACCTTTTTTTAAATTAATAGACATTTTAAATTCTCCTTTACTCCACTTTGCTACTAAATTATAACTTATCCTACATTTATTCCAAAATTATTACAAAGTGCAGCTAGTCCACCTTCAAATCCACTACCTATTGCATTAAACTTCCATTCTGCTCCATGACGATATAATTCTCCAACAACTACTGCCGTTTCAATACTATAATCTTCTGATAAATCATATTTTATTAATTCTTCATTTGTTTCTTCATTAAATATTCTAATAAATGCATTTGAAACCTGTCCAAAATTTTGAGCTCTTTCTTGTGCTTCATGAATAGTTACTGTAAAATCAATTTTATGTATGCTTTGTGGTATATTTTGTAAATCTATGCTAATTTGCTCATCATCGCCGTCGCCTTCACCAGTTTTATTATCTCCTAAGTGTACTATTGAACCTGAATCATCTTTTAAATTATTATAAAAAATAAAATCATTGTCTGACATCACTTTACCATTTTCTCCTAGTAAAAAAGCTGCTGCATCTAAATCAAAATCATTTCCACCATCATATTTATTAGTATCCCATCCAAGTCCTACTATAACTTTTTTCAATCCTGGATTTGTTTTAGTTAAATCTACCTTTTGACCTTTTTTTAAAGATATCGCCATAAAAATACCCTCTCTCTTTTTAAATAATCTTTGAAATACTTGTAACTTAACCAAATTTTAATTAAAATTTACAAGAAATTTCATATTTTATGTAGTTATATATTATTATATATATTGAATTGTCAAAAATCAATATATCTGATACTACATGCTACAATTAACAATAGCATTAACAAAATCCAATATTTCCTCTGAAACCTCAATTCCACTTTTCTTGCTCATAACAATATGTTTTTGTTGTATTTTATTTTTATATATTAAATCTTTTAATTCTCCATGTAAGGCTGCAATTGAATAATTTGCATTAATTAATATTGGATAATCTAAATAAGAATCCCCTGCTGAAATGACTCTTTGTTTCTTTTCTATATCTTTTATATAAGATATTGCATTCCATTTATTAACCTCGTTGGGAACAATGTATAATTTTTTTCCCTGCAGTGATATATGCCATTTATTTCGTATTATCCAATTTTCAAAGTTTTTTAATTCCTCTTTAGAATTATTGTTTATTTCATCAAAGTGAACACTAAAAAACAAATCATCTCTAAAAACCATTTTATTAATCCATGATAAATCTGAAAAACTTTTTAAGAACTGTTTTTCTACTATACTAGCTTTTGTACTATTTACAAGATTTGTTTTTATAACTTTACTCCATTCTATACTAATTTTTCCATTCTCTAATATGACTCCCCCATTACTAGTAACAGCATATTTAGGATTTATTTCTTCTTTTATCCCAAAAATGCGATTATATTGTTCTGTAGTTCTAGTAGTTACTGGAACAAATAATATTTTATTATTTATTAACTTTAATTTTTCAATAGTATTTTTAGTCATAAAACTAAGTTCTTTTCCCAAATATTTTTCCACTAAAACAATAGATTTAACGTAGGCATCATCAATTAATTTTTTAGTATAAATTAATGTTCTATCTAAATCTGATGCAAAAATCATTAAATATTCTCCTTCGTTTTTGATTTAATAACTCCACAGCAAGAATAACTCATATCTTTATAAACCTCTATAGGTATACATTTATCTTTAGCTAAAAGCTTAATATGCTTTAAATTAGGATTGTCTAAGTTATTTATTAATATTTTCCATGGAACTCGTCTAAGTAATACCCTGGTAGTTTCTCCTACACCTGGCTTAATCAAATTAATATCCGAAATTTTATACATATTTTTTATTTCATTAACTTCCTTAATTCCTGAATCATAAATTTTGAAATCATGTTCTTTTAATTCTTTTTCACTAATTGAAATTTTAGAAAAATTCTTAGTTATAGTATCAATAAAAAGATTAGATACATCTTTATCTTTCCATTCACTGTAAAACTTTCCACCATGAAAATCATTTTTTCCTATAATATCATCCCTTAAAACAGTTCTACTTATAAGCCCAGATACAGTTGAGTTAAGACAAGCTGATGGTATCAAAAAATCTTCTCTTGTACCATACAATGAAACACACTTTGATGGATCTGATAAAACTGCCAATTCACTATCTAACCTTATATTGTATTTTTCATAAAAATCTTCAACTGCTTTTTTTAGTACCCTATTGATAGCACCTTTACCTGTCCATCCATCCACAAATTGTATATATTCCTCATTGTGATTATTTAATATATATTTTATAGCATTCTCATCAATTCCTCTTCCCCTTATAATTGATATACTATAATGTTTAAATGTTTTTTTATATTTTTCTTCTAGGTATCTTTTTATTAAAATACCTATTGGTGTTCCCGCTCTTGCTAAAGAAACAATTACTATATTTTCACCTTTTTTTCTATATATAAGTTCTGAAACAATTGCTACAGCCCTGGCTACTTTTTCTTTTGTTTCCTCTAATGTTTTATAAAATACATCCATATACTCTTCTGTTGGAAAATATTCAATAGGTAACATTTCTGAATAATGTCCGCCATTTTGCATTATATATTCTCTCTCTTCATTGGATTTCTCTTCAATCATATTAGAAATATCTTTTAAAACAAATGTTACATCCTTTTGAGAATAACTTCCGATAATCCTTGGTTCCTTAATTTTTTCAAATCTCATATACTCACTTCCTTTATATATCTTATAATAATTTTGTATCACAACTATCTATTCTAAAAATATATAAATCTAACTTTATTCACCTCTAATTTTTGAAATAATTGAATTATTTCCTCTCTACTTTCTTTATCTATTTCTCTTTCTGTAACAAATAGTATTTCTTTATAAATGCCTTTCTCTACATTATATAAATAATTTATAATAGATTTATCAAAAGGATTTTTAAAAGTTTCTGCACATTTTATTCCATAATCTATTTCTTTTTTGGGATACACTGGACTTCTTGTTGTTGATTGATATTTAGCATTAGGAATTTTGGAAGCTAAAATCATTGGTAAATACATGAATTCTCCTGTACCTAAAACAAGTATATCCTTTTCAAATTCCCCCATATTCTTTACTTCTTTTTCTATGATTTTTTCAATAGTTTTAACATCTTCAGAATTTATTCCAAACCTTCCAGTATCCATTAAATACTGATACTGTTTTTCATCTCCACAATTTAATATTCTAGTAAATTTTTTATACCTATTGTCTAACCTAATATGAAAATCTAATACTTCTATTTCTTTAGAAAATTCATGAACTTTATTAGAATTTTGATATATATTTTTTTCTTCAAGTAAATAATTATTTTCACAACTAACTTGCCCCTGTATAAGAGAGATTGTTTCAATTTTTATATTTAATTGTTTCTTTAACTGTTCAAATTTGTTAATACACTCTTTACTTCTCCAATCTAAGATTGATATTACTACATATTCTTTTCCTGGATATTTTGAGTTAATAGATTTTATTAAATTTAATACTGTATTTCCTGTAGTAATTTCGTCATCTACCAAAACAATTCGTTCAAATTCATTTATAAAATTCTCATAGCCAATAGGGTAACAAAAATGCTCTACTGCATGTGAATGCTCCTCATCAAAAGAAAAAACAGATCTACATTCCTTTAACATATCTCTTGTTGTATGTATATATTTAATATTTTCACCTAAAAAGGTAGCAAATACGCTATTACCAAGAGCCGTAGCTGTTTCAGCAAATCCAATAAAAAGTGTTGATTTATTACACATAATAGGATTATCCTTTATATATTCCCATGCTTTTTGTATATATTTATCATCTTTTAACGCTTGGACAATTAATTTAGTATCTAAATTTCCATTGTCTATATTATTAACCATCTTATTTGCTAATAATGCACCTATTAATATAGATTTTTTAGGATTAACAGGTATATGTTTACCCAAAACTGTACTTACAAATAGAAAAACCCTTTTAGGATTTTTTCTTGCAGCCATTAAAAACAATTTGTTTAATAAAAAATCATATTCATTATGTATAACTTCCACAGTAACATTTATTTTATCTAAGATTTTAAATTTATTTTTCATTTTTCTCCTCAACTTTAACAATATTTATCCGATGACTACCCGCTCTAATACTTCTATCCTTTTGAAAATAGAAGTAAAGAGCGGCTACGTCCCTGGATAACGAGTTCTAACCTTTAGCGGGAGTAGAAACTCCCTATGAAGCCAAGAACTCTGTTTATAATAACTTTAAATAAGTAAATTCTTCTTTAATAACCCCATAGACGTATGCTCTTTTCATAATTTTATTTGCCCAATTAAAATGTGGTTTTGATTCATTCATTTTATTATTATATTTACTTTTATTTACCCCACCTTTATCGTTAATCTGCAAAATTTCCAAAGCATCTTCATATTCTTCCTGTGTAACAGTATATAATGCATTTACAAGCTTTATATGTGTAGGATGAATAACTGTTTTGCCAACTAATCCATTAGCTTTATCTAAAATAACTTCTCTTATAAGTCCATCCACATATTTTTCTAAATAATGCTTTCTTTTAATAATACCCTCTTGTCCTTCATTTTCTACAAAAGGAGTTTTTCTTAACATTGGTTTTAATACTCTATTTTGATCATTTGAATTAAAATATTCCCATACAGGTCCTGATATAACATACTCATTATCAGCTCTTGAAAATATATTTATAATATCATAAATACAATCACTAACTACACCTATATCATATATACTATTATCAATCCCTCTGCGTAAAGAATATAACCCTTGATAATCTGTAGCACCAATCCTAACATTTAAAATATATTTTTTATATTTATCCAGTATGGATTTTATTTTCAACAATTCATCTATTCTTGATTCCTTATAAATCACTCTTTTAGATTCTAAAATGGGCATTCCATATAATATATTAGTTCTCTTATCATTAAATTTATCCAATATATTTAAATATTCTTCTCCATTTGTAGAATCAAACTTAGGGAATACGAAACCTGTTATTGGAATAAAAAAACCTTTTTTATTTAATATTTTTTTAAATTGTTCTACGCTTCTTACCCTTAAAAAAATTAAAGGTAAATTATTAGGATTAATTTTTTTAGTTTCTATACAATATGTTATTTTTTTACATATATCTATAACATTAGTCTCTGCCTTCTCTACGTCTTTTTCCGAAATAGCATCTTCTAAACATATCACCATAGATTGCATACTATTATATTTATTCTTATAAATAATTTCAAAAATATCATCTCTTACACCTGGCATGTACAATGTAGCTCCTAATGCATACTGCATATATGTCTTATCTGTTTTAGATGTAATATTCTGTGGTCTTTTGTAAAAAATATCTTTTATTTCTTCTTCATTTAAGTTACTAAAAGCTCTCATATAATTCCTCCATTGCTACAAATTTATACTAGAATAGTCAATTTAACTTTCAAAAAAATCATAAATATAAAATTCTATATTTATGATTTAATGAAGGAAATCTCTTATATCCACTTTTTAACTGTAATCGTAAAAACTTATAAAAAAATATTTAATGCAACACCTATAATATATATAATAAACACTATATAAACTGCCAACATTACAAATGCATCTTTTTTAGTAACTCCCTTTTTCTTCAATAATAACAAAGCAGTAGTTAACATAGATACAATAAGAAATATTAAAACTCCCACATTTTCACTAAAATTTGCTGGTAAAGTTTTTCCTGCTATAATTATAGGAACACCTAAACATATACAAATATCAAATATATTTGAACCTACAGCATTAGCTACTGCCCCTTCTACATCTCCATCTTGTGATGATTTTATTGATAAGAGAGTATCTGGTATTGAAGTACATGCAGCTATTATTACTACCGATACAATATATTGTGGAATATTAAAAAATGTAGAAATAACAGTTGTAGACTGAATAATTCCATCAATACTTACCCAAATTAAAGCAATTGTTATTGCCATAATAATAAATATTTTCAAATAAGACATATCTTCTTCTGTATCTAATTCTGCAGCTATTTCTTCATAGGCTTTATTTTCTTCTGATATATTTTTTCTGTAATTTTTAGTTTCACCATAAAGAACAAAAATATATCCTATATAAATCAAAACTAGTACTATACCTGCAAACACAGTATATGATCCCAAATATGTAAAAAACACCAAAGCTCCTATAGCCATTGTATAAAAAATCATATCTCTATAAACAACCTTTTTATCAACTTTTATTAATAAATCTTTTCCCCTATAAGCAAAAATTGTCGCCATAGGAATAATAAGAATATTAAAAACACCTGAACCAGCTATAGTTGGAACACCTACATCTGTGAATCTTTTATAAACTATAACTGCAATCATCGCTGTAGAAAATTCCGGGAATGATGAACTAATTGCATCAAAAGTTGCACCTCTAACTGATGTTGGTATTTTTAATTTAACACCTAATATATGTAGAGCATCTCCAAGCTTATCTGAAGCCTTGCTTATAATCCACGACATAACTATTAACAT
Protein sequences of DBSCAN-SWA_7 >NC_004557|2348328:2360631|2355661_2356774_-|WP_035124381.1|protease|DBSCAN-SWA MRFEKIKEPRIIGSYSQKDVTFVLKDISNMIEEKSNEEREYIMQNGGHYSEMLPIEYFPTEEYMDVFYKTLEETKEKVARAVAIVSELIYRKKGENIVIVSLARAGTPIGILIKRYLEEKYKKTFKHYSISIIRGRGIDENAIKYILNNHNEEYIQFVDGWTGKGAINRVLKKAVEDFYEKYNIRLDSELAVLSDPSKCVSLYGTREDFLIPSACLNSTVSGLISRTVLRDDIIGKNDFHGGKFYSEWKDKDVSNLFIDTITKNFSKISISEKELKEHDFKIYDSGIKEVNEIKNMYKISDINLIKPGVGETTRVLLRRVPWKILINNLDNPNLKHIKLLAKDKCIPIEVYKDMSYSCCGVIKSKTKENI >NC_004557|2348328:2360631|2353500_2354079_-|WP_011100376.1|DBSCAN-SWA MSINLKKGQKINLTKDNPGLNKAIIGLGWDTNKYSGGYEFDLDASAFLVGNNGKVNDDLNFVFYNNLQDPSGAVVHTGDNRTGDGDGDDEQIIIDFSKVPSDIDKIAITVTIHEAEERGQNFGQVSNAFVRIVNEETDVEILRYDLSEDFSIETALVFCELYRNNGDWKFSAIGSGFSGGLEALCKNYSLQV >NC_004557|2348328:2360631|2359635_2360631_-|WP_115606241.1|DBSCAN-SWA MLIVMSWIISKASDKLGDALHILGVKLKIPTSVRGATFDAISSSFPEFSTAMIAVIVYKRFTDVGVPTIAGSGVFNILIIPMATIFAYRGKDLLIKVDKKVVYRDMIFYTMAIGALVFFTYLGSYTVFAGIVLVLIYIGYIFVLYGETKNYRKNISEENKAYEEIAAELDTEEDMSYLKIFIIMAITIALIWVSIDGIIQSTTVISTFFNIPQYIVSVVIIAACTSIPDTLLSIKSSQDGDVEGAVANAVGSNIFDICICLGVPIIIAGKTLPANFSENVGVLIFLIVSMLTTALLLLKKKGVTKKDAFVMLAVYIVFIIYIIGVALNIFL >NC_004557|2348328:2360631|2350015_2351665_+|WP_011100373.1|DBSCAN-SWA MVNTNKIINSKIIKTNNVFDDILLPLNKRNGFAKSNPPIVPIYFYRYIGITKNQDEYYNNLYKLDKNLSDLNNLYLKFISNIPFNTDYELIKKTANLWDNLNPFNDKKKIFLFSSIKLVKALPLFPNELTQTSINEGFNYIFNLYIKNTKNINATKIKNFTLKILTWINKFIPNLFENVDYNKNETNGIFNPKVIYYGNIKNHEIYFLIFLSKIGCDVLYINSYSDGAFLEIDKENTYSNIIELNAKSPLKDFPKREVISRTETIAFKASNEIEEIIYDEESGLYKPWQFENYNVEPVTLKTTYDELKILWNEEARMRTGFKIENGTIYIPNIFAKISGVTEDINIYWKDLSYFKLTGANLFISNLPFTEINYSKHDLYRLSYLINNDGYLNEDSLIKSEFYKFSYLKTSLQKTIIKKINDIFELPLFKKEVNKEFKLKILMTILNIDKNLIELLQSFDYPFKIPKLIIYSKNENSFSDEDSIIITFLNLIGFDILIITPTGYNNIENQIPRKYYDIHKFEKINLNLELVDLDNINKKKFSFLSKFFKI >NC_004557|2348328:2360631|2354120_2354702_-|WP_035109019.1|DBSCAN-SWA MAISLKKGQKVDLTKTNPGLKKVIVGLGWDTNKYDGGNDFDLDAAAFLLGENGKVMSDNDFIFYNNLKDDSGSIVHLGDNKTGEGDGDDEQISIDLQNIPQSIHKIDFTVTIHEAQERAQNFGQVSNAFIRIFNEETNEELIKYDLSEDYSIETAVVVGELYRHGAEWKFNAIGSGFEGGLAALCNNFGINVG >NC_004557|2348328:2360631|2351703_2352798_+|WP_011100374.1|DBSCAN-SWA MVNNEINTINDVPDFNLEKTTNEVTFKLQNSPEILSLSKEIDLKNVDTIMNFGQSASQEISTFADKILHSIETTSVEDSGELLTQLNKIMNKFDVQDFKDKKPGFFEKLFKKTKDSIDALLKKYHTMGGEVDKVYIQLKKYEEEINTSNNTLNEMFNKNMDYYEQLEKYILAGNLVVDNFKNKVLPELQQKAQNSMEQIDQINLSNGQQLLEMLEQRVYDLELAKNVSLQTMPQIKLIQRGNYNLVRKINSAFIITIPIFKQSLTQAITLKRQAVQAEAMSALDEKTNELLLRNAQNTAMQSKITAKLASDSSIKIETLEKTWETIIRGIDETKQIQDDVKQKRIEGTQKLHQIQNQFKSRIEK >NC_004557|2348328:2360631|2354846_2355662_-|WP_011100378.1|DBSCAN-SWA MIFASDLDRTLIYTKKLIDDAYVKSIVLVEKYLGKELSFMTKNTIEKLKLINNKILFVPVTTRTTEQYNRIFGIKEEINPKYAVTSNGGVILENGKISIEWSKVIKTNLVNSTKASIVEKQFLKSFSDLSWINKMVFRDDLFFSVHFDEINNNSKEELKNFENWIIRNKWHISLQGKKLYIVPNEVNKWNAISYIKDIEKKQRVISAGDSYLDYPILINANYSIAALHGELKDLIYKNKIQQKHIVMSKKSGIEVSEEILDFVNAIVNCSM >NC_004557|2348328:2360631|2356827_2358171_-|WP_035109014.1|DBSCAN-SWA MKNKFKILDKINVTVEVIHNEYDFLLNKLFLMAARKNPKRVFLFVSTVLGKHIPVNPKKSILIGALLANKMVNNIDNGNLDTKLIVQALKDDKYIQKAWEYIKDNPIMCNKSTLFIGFAETATALGNSVFATFLGENIKYIHTTRDMLKECRSVFSFDEEHSHAVEHFCYPIGYENFINEFERIVLVDDEITTGNTVLNLIKSINSKYPGKEYVVISILDWRSKECINKFEQLKKQLNIKIETISLIQGQVSCENNYLLEEKNIYQNSNKVHEFSKEIEVLDFHIRLDNRYKKFTRILNCGDEKQYQYLMDTGRFGINSEDVKTIEKIIEKEVKNMGEFEKDILVLGTGEFMYLPMILASKIPNAKYQSTTRSPVYPKKEIDYGIKCAETFKNPFDKSIINYLYNVEKGIYKEILFVTEREIDKESREEIIQLFQKLEVNKVRFIYF >NC_004557|2348328:2360631|2348328_2349138_-|WP_011100371.1|DBSCAN-SWA MKLIIDRGNRNINSSQKNRYGNLTIDRSVKNSSPVISLLDIKNTISSNSTNIKPFNQDNNIVKEKTKNIKIKSLVNTNASMKILKGQKISLNQKSPNISKLMVALEWEIKNTSNKKLELDTSIFMVDKNNTTQEEDFIFYSNLKSAQGGIVLKSDHNSNLKDGYDEVVQLDLTKIPLNIEKLAFTVTIYEAEDRGETFENISNAYFRLIDTYTKKEILNYKFDDQLSEETAIVVAEIYRYKGEWKINCIGSGFKGGLKALCYNYGIETD >NC_004557|2348328:2360631|2349173_2349833_-|WP_011100372.1|DBSCAN-SWA MFNFRVHEELTCLAEGSGHFFARVGAMVCYKGEFTSEKVLLDTNTNKSLLGSVINFASRKITGENIPLMKVKGTGSYYMANASKHVSVIKLEQGQSIGVEGENLLAFTEDCKYGVRFIGSGVISQKGLFTSKITGLGNNGYVTITTDGNPIIIETPCVVDPDAVICWTGADPNFKVDVNWKSFFGQTSGESYCLEFNKPGEVVIVQPSERISGLDIAID >NC_004557|2348328:2360631|2358325_2359498_-|WP_011100381.1|DBSCAN-SWA MRAFSNLNEEEIKDIFYKRPQNITSKTDKTYMQYALGATLYMPGVRDDIFEIIYKNKYNSMQSMVICLEDAISEKDVEKAETNVIDICKKITYCIETKKINPNNLPLIFLRVRSVEQFKKILNKKGFFIPITGFVFPKFDSTNGEEYLNILDKFNDKRTNILYGMPILESKRVIYKESRIDELLKIKSILDKYKKYILNVRIGATDYQGLYSLRRGIDNSIYDIGVVSDCIYDIINIFSRADNEYVISGPVWEYFNSNDQNRVLKPMLRKTPFVENEGQEGIIKRKHYLEKYVDGLIREVILDKANGLVGKTVIHPTHIKLVNALYTVTQEEYEDALEILQINDKGGVNKSKYNNKMNESKPHFNWANKIMKRAYVYGVIKEEFTYLKLL >NC_004557|2348328:2360631|2352863_2353475_-|WP_035110326.1|DBSCAN-SWA MSISLKKGQRISLTKENNKLSKVMVGLGWDAVEQSNGKGLLGSLFGKGNQIDIDCDASVIMLDENDKVSDKKSIIYFGNLNSKDGSIRHMGDNLTGDGDGDDEQIMVDLTRVPSNIHKLVFIVNIYNCVKRNQHFGMIKNAFIRIVNMTDNKEMLRFNLTDEYSNKTGLFVGEIYRHGSEWKFAAVGEGSMATGINDMIQKYM |
12 | Caulobacter_phage(37.5%) | protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage | ||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_004557.1|WP_155274208.1|2278842_2279016_-|hypothetical-protein |
2278842_2279016_-
Protein sequences of NC_004557.1|WP_155274208.1|2278842_2279016_-|hypothetical-protein>NC_004557.1|WP_155274208.1|2278842_2279016_-|hypothetical-protein MQAIVVRTRFNKKTEEIIEEKIIEKIEINEESFYEPVVEYLGSNILADKTMKLKTTL |
57 aa aa | NA |
HTH_XRE
HTH_XRE HTH domain information
|
NA | 2237703-2286330 |
yes
Self-targetings in the prophage
1. spacer 3.1|1573265|34|NC_004557|CRISPRCasFinder,CRT matches to NC_004557 position: 2280788-2280755, mismatch: 0 ttaatccagataaaatatattctcttacagcaat CRISPR spacer ttaatccagataaaatatattctcttacagcaat Protospacer ********************************** |