Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP043434 | Salmonella enterica subsp. enterica serovar Enteritidis strain PT1 plasmid pPT1-1, complete sequence | 0 crisprs | cas14j | 0 | 0 | 7 | 0 |
NZ_CP043433 | Salmonella enterica subsp. enterica serovar Enteritidis strain PT1 chromosome, complete genome | 2 crisprs | PD-DExK,WYL,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,csa3,DEDDh,DinG | 0 | 12 | 8 | 0 |
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
0 : 1905
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP043434|0:1905|DBSCAN-SWA
Protein sequences of DBSCAN-SWA_1 >NZ_CP043434|0:1905|1518_1905_-|WP_000751876.1|DBSCAN-SWA MKKTLITLIITTLSFSSLARQTDIVSSVEQPGYVQGGFTGPAPTQTSVSQAKKQWDDAWVVLEGNIIRQVGHELYEFRDSSGTVYVDIDNKYWMGQTASPADKVHIEGEVDRDWDGIKIDVKNIRVMK >NZ_CP043434|0:1905|1304_1466_-|WP_015059604.1|DBSCAN-SWA MRALSEKTQARRECPPKDEVRLVPLTDISYVRQIESWMITPVPAAQILRILPS |
2 | Stx_converting_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
11473 : 12031
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP043434|11473:12031|DBSCAN-SWA CATGAAAAAAATCGTTCTGTCCTCACTGCTGCTGTCCGCTGCCGGGCTGGCTACCGTACCGGTGGCACAGGCTGACACCCATTCCGTGTCGGTGGGATATGCCCAGAGCCGGATAGAGCATTTTAAGGATATCCGTGGGGTGAACCTGAAATACCGCTATGAGGCTCAGACGCCGCTGGGACTGATGGCGTCGTTCAGCTGGCAGTCAGGTAAGCGCGGAGAGTCAGGTGGCATTCCTGGCGGAATGAGCTGGCGCGATGATGTGAAGGCAACGTACTGGTCGCTGATGGCGGGTCCCGCTGTCCGTGTGAACGAGCTGGTATCTCTGTATGTCCTGGCCGGTGCCGGTACCGGCAGGGCTGAAGTGAAGGAGCATATCAGCATGCCGGGATACAACGGACGGTTCACGGGTTCGGAGCGCAGAACGGGGTTTGCCTGGGGAGCCGGCGTGCAGTTTAATCCGGTGGAAAATGTGGTCATCGACCTGGGCTATGAGGGAAGTAAAGTTGGCGCAGCGAAACTGAACGGCGTTAACGTTGGTGTCGGTTACCGGTTCTGA
Protein sequences of DBSCAN-SWA_2 >NZ_CP043434|11473:12031|11473_12031_+|WP_000725064.1|DBSCAN-SWA MKKIVLSSLLLSAAGLATVPVAQADTHSVSVGYAQSRIEHFKDIRGVNLKYRYEAQTPLGLMASFSWQSGKRGESGGIPGGMSWRDDVKATYWSLMAGPAVRVNELVSLYVLAGAGTGRAEVKEHISMPGYNGRFTGSERRTGFAWGAGVQFNPVENVVIDLGYEGSKVGAAKLNGVNVGVGYRF |
1 | Enterobacteria_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
29215 : 36123
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP043434|29215:36123|DBSCAN-SWA CATGTTACTGCTTGTAGCACCCGAACAGGAGCCGGTACAGTCCACTGCCCCGCTGTTCACCGAACGTTGCCCGGCGGGGTTTCCGTCGCCGGCCGCCGATTACACGGAGGAGGAACTGGATCTGAATGCCTACTGTATACGGCGTCCGGCTGCAACATTCTTCGTCAGGGCTATCGGGGATTCGATGAAAGAAATGGGTCTGCATTCCGGCGATTTAATGGTTGTCGACAAAGCTGAGAACCCCATGCAGGGGGATATTGTCATCGCGGAAACAGATGGCGAGTTCACCGTTAAGCGCCTGCAGTTGAAGCCCCGTATCGCCCTGCTGCCGATGAATCCGGCCTACCCCACGCTTTATCCGGAAGAACTGCAGATTTTCGGGGTGGTGACGGCTTTTATACACAAAACCCGGAGCACAGACTGATGTTCGCTTTGGCTGACGTCAACTCGTTCTATGCCAGTTGCGAAAAGGTCTTTCGCCCCGATTTGCGTGACAGGTCTGTGGTGGTCCTGAGTAACAACGATGGTTGTGTAATCGCGCGCAGCGCTGAAGCAAAAAAACTGGGCATAAAAATGGGGGTGCCGTGGTTTCAACTGAGGTCGGCGAAATTTCCGGAGCCGGTTATCGCGTTCTCCAGTAATTACGCCCTCTACGCATCAATGTCGAACAGGGTCATGGTTCACCTGGAAGAGCTGGCGCCACGCGTTGAGCAGTACTCCATTGACGAGATGTTTCTGGATATTCGCGGTATAGACAGTTGCATCGATTTCGAGGATTTCGGCCGACAGCTGCGTGAGCACGTTCGTTCCGGCACAGGGCTGACCATCGGCGTCGGAATGGGACCGACCAAAACGCTGGCTAAAAGCGCGCAATGGGCATCGAAAGAATGGTCACAGTTCGGAGGCGTGCTTGCCCTGACGTTACACAATCAGAAACGAACGGAGAAACTGCTGTCACTGCAACCGGTGGAGGAAATCTGGGGTGTGGGCCGCAGGATCTCAAAGAAGCTGAACACAATGGGGATCACCACAGCACTGCAGCTGGCCCGCGCGAATCCAACGTTCATCAGGAAAAACTTTAATGTGGTTCTGGAGAGAACGGTCCGGGAACTCAATGGAGAAAGCTGTATTTCCCTGGAGGAAGCACCGCCCCCCAAACAGCAGATAGTCTGCAGCAGAAGCTTTGGAGAACGAGTCACGACTTATGAAGCGATGCGGCAGGCAGTCTGTCAGCACGCTGAACGTGCCGCCGAAAAGCTGCGTGGGGAACGGCAGTTCTGCAGGCATATTGCCGTGTTTGTGAAAACATCGCCGTTCGCCGTAACTGAGCCCTACTACGGTAATCTGGCCAGTGAAAAACTGCTCATCCCCACGCAGGATACACGGGACATCATTGCCGCTGCCGTCAGAGCTCTGGACAGGATCTGGGTGGATGGTCACCGTTACGCAAAGGCAGGCTGTATGCTGAACGATTTCACGCAAACCGGGGTATCGCAGCTTAATTTATTTGATGAAGTACAGCCGCGGGAACGGAGCGAGCAGCTGATGCAAGTCCTGGATGGGATTAACCATCCGGGCAAGGGGAAAATATGGTTCGCCGGTCGGGGGATTGCCCCCGAGTGGCAGATGAAAAGGGAACTACTTTCGCCAGCTTATACGACACGTTGGGCAGATATTCCTGCAGCGAAATTGACTTGAAATTGTGGCGATTTCACCTTGAAATTCTCAATATAAATTACTTAATTACAATAAGTTAGATGTGAAATCGCCGTGGCGAAATTATTGACTCAAATTTCTCTCAAGGACCTCGTTGATAGCTTTGTCCAGTTCATCCTGAACCACTTTTGACATACGGCTAAACTCATAAGTCAGCGTACGCCCTTTCACACGCTTGCGCGCGAACTTGTCCTTTTCTTCAAAGGTCCAGAGTGCGGAAATTACTGACTTTTCCTTAGGAGGCGGAGCTATCAATGCCTGAGAGGCCTGAGAAATCAATTTCAGTATACTGGCTTTCTGTTCGTCGTCTGGTCGCTCGTAGTCACTCAAAATTGCATCATGCTGATCTGAGACAGACTGAATGAGCCCTTCAGACGTTAAACCCTTTTCACTGAGTTTTTCATTAACCTCTAACAGAATTTTATAGTCACTAAACGACAGCTCAGACTGAACCGGGAACAGAGAAATCAATTCTTGCGGTACTGCTGCCGCCTGCAGTGCCCTGGTGACCTTAGCCTGGGATAACCCCTCCAGTTCAGCAATTTCCTTTTGATTGAATCCAGACTCTTTCAGAGCCATCAATCTTAAGCCGATCTCCCGAAGGTTATGCTCTTTAGCTGTCTGGACATCTTTAGCCAGATGGCGAGCTTCATCAGCTGATAAACGTTCATTGGTAACCATTACATCAAGGCCTGTACGGATATAAATCGCTGAGGCTCGCCGTCTTGAACCATCCAGTATCTCAATCCTTTCACCTTGCTGAATGCCAATACAGGGGAAGAACTGCTGAAACTTAATCGTCTGGATAATAGATTTCAGGGACTCCCTGGTCAGAGCAAGTTGATCTCTTCCGTTGGTTTCCTGGTTTACGAATGTCTTACTTTCAACGTCCGATGCAGGTACCCGAACAAACCTGAACGTTATCTTCCTGCCGGTTTTTAGGGTAAATACCTGACTTCTTTGGGTGTCAGTCATCTCAACCATTGACGCCTGGGTATTTAACTGTCTGCCAATAGTTTTTCTTCTCTCATTCGACATCACATACCTCCGTTAGTACGAATGAACTCAATGCGATCAAATACGGCCTTCGCAAAGTCTTCCGCTGCTGATTTTGCGCTTTTAAGCGCCTCGGTGCTTCCATCATAGGTTGCTGGATTTGCGGAAATAACTGTGTCAAAAGTCTCCCCGCATCGCTCGAAGCCATCCAGCCTCGGCAGAACCATGTCGAGCATATCGGCACCGAATACTTCTTTGGCCTGACTATGGCAAATTTTGTGATCAGACTTGTTCAGGATTTTGGACATAAAACCTACATTTCCGATGAGATTGCAGGTGTGTCCATCCTGTTCGATCGAATCAATAAGCGCAGGAAGGCTGGCTACAAACTTCAGGGAAGAATGAAAATCAACCGTTGCCGGAGGAAGTGGAGTCAACATCAGGTCTGCCGCTCCGATGCAGTTTTTCAGGAACGCATCGAGGTGAGGACCACTATCAAGGAAGATAAAGTCGTAGTCATACCGGAGCTTGTCGATAATGTTCTCTTTAAGCACTGCATGAATATTTTGCCCAGGCAGATGTTCCTCACACAATCCCTTCCAGCCTTCAGCAAGGAATGCATCGTCAATGGAAGCGGGAATAACATCAACGCCCGGGATAATTGACGACACGATGAAATCGGAAAGTAATTCTTCACGAGACACGTTCTGGAGCATGGCCTGAGCAGCCGTATTTTCAACCAGGCCCACTGAATTCTCATGGCTCAGGAACATTGTCAGAGATGCCTGCGGATCGAAATCGATGGCCAGGATACGCAGGTCTTCGAACAGAAGCTGAGGATGAGCCCGAAAGGCATGGGATAATGAAGCGGTAGAAACTGTCTTGGAACCGCCACCTTTAAGGTTACATACAAATATAGTGAACGCTTTATCGAAGCGATCTCTGTACTTGGGCACTTTACGATGGAAATACAGATCGATGACGTTCTGAATCGTCATCGCGTACTTCATCGTATTGCCGGAAGGTTTCTTTTTAAAAATATAGCCACTGGCTTCCATTTCGGCCACAGCATACTCAACACTCCCTTTGCTTAGCTTAGGCAGCTTATATAGGGCGGCTTTGCTGTATTCCTGATAGAACTCGGTAAGCTTCAATTCCTCTTTTTGCTGGCGGATACTTTCGCTCAAGGATGTCAGAAGTTTACCTGCACGTGTGGCGACTTTCCGTAACTGCTCAATATTTTCCATTTATCACCTCATGATGTTTTTTGTCATTTTTAACAAAAAGAATCATGTTGCAAGAAATAAATCATAATATTTTGATTTTGCTCTCACAGGGGGCACCGAAGATTAAATGGAGAAGAGGGCACTACATCCACAATCATTTTGTTGACCCACAGCGCAAACCGTGAGTCTGTTCAGATCTTGCATCTGAGCAGGCGAAGGGTTGGACAAGCCCAAAGGGCGCGTCAGCCTGTCACATTGGGGTATAGTTCAGCAGTACCTGTTTCTGCCAGGACATGGATATCTCCGGGCAAGCAAGCTGATGATTAAAAAAGAAAGAGGGGACAACCCCTGAAGGAATTGCAATTCCGTTCCTCGCAATGTCAGCATTTTTTGCCTGTGATACTATGACCGGGTCATTTTTCAGAGCATGGACTGATGAAAAAGAAAAACACTACGCCGACTCCCCACGACGCGACATTCCGGCAGTTTCTGACACAACCTGACATTGCCCGGGATTTTATGGAGCTGCATCTGCCGGCAGAGCTGCGTGCCATCTGCGATCTCAGTACACTGAAACTGGAATCAGGCTCGTTTGTTGAGGATGACCTCCGCCAGTATTTCAGCGACGTCCTCTACAGCCTGAAAACCACAGCCGGCGACGGATATATTCATGTCCTGGTTGAACACCAGTCAACACCTGACAAACATATGGCTTTCCGCCTGATACGCTATGCGGTGGCCGCCATGCAGCGCCACCTGGAGGCAGGGCATAAAAAATTGCCACTGGTGATACCAGTGCTGTTCTATACGGGGAAACGCAGCCCGTATCCGTACTCCACCCGCTGGCTGGACGAATTTGACGATACGGCGCTGGCAGGCAAACTCTACAGCAGCGCTTTCCCGCTGGTAGACGTTACGGTCATTCCGGATGATGAAATCGCCGGCCACCGTAGCATGGCCGCCCTGACTTTATTGCAGAAACATATTCATCAGCGGGACCTGGCAGAACTGGTTGACCGGCTGGCGCCCATTTTGCTGGCCGGATATCTGTCTTCATCGCAGGTAATATCGCTGGTACACTATATAGTGCAGGCAGGCGAAACATCCGACGCCGAAGCCTTTGTACGCGAACTGGCACAGCGTGTGCCGCAACACGGAGACGCACTTATGACCATCGCACAACAGCTCGAACAGAAGGGCATCGAGAAGGGGATTCAGCTCGGTGAACAACGTGGTATTGAAAAAGGCCGTTCAGAAGGGGAGCGCGAAGCGACTCTGAAAATAGCCCGTACCATGCTCCAGAACGGCATTGACCGCAATACCGTCATGAAAATGACCGGTCTGACTGAAGACGACCTGGCGCAGATCCGCCACTAATCCCCCTTTTCTGGCACTCCCTGCCCGGCCATCAGCGAATACTGGTGGGGCAGGGAGGCATCAGGCTCTGCCAGCCAGCCACAGCGATCAAACAGCTTCATTACCCCCGCTACGTTCCCTGTCAAACGTGCTGTCAGCGAATTTATCTGTCGGCCGCTGTCGTTAAAACCGACATCGAGTTTGGCCTGTGAGAGATAAATGCCGTTCACGCCTGCACTGAGCGCCACAGCATTCCGGCCGGACCATTCCCGATCGCTCAGCAGACGGTAGTTCCAGTGCATATCTTTCGCTGTTTCCAGAGCATACGTCAGCCGGAACAGGTCGTTCTGCTCGGACAACTCACCGGTGCATGAACGCCAGAGGTAATTCAGCTTGCTGGCCAGCTTGGCGCTGACGCCCATCTGACGTCCCTGTTTCAAAAGCCACTCGATATCGGGCGTCACATCACGAGAAAAACGCCGTTGCTTCAGCGCAGTCGCCAGCCAGCGGGTGAGAAAGAGGTTTTCCTGTGCCGGCGAGAGAACGCCACCATCCTGCCTGGCCAGCGCAAGCGCCACCAGGGCACACCACGCCAGATGACCTGATTTTTCTGTCAATGTCACGGCTTCTCCCGCAAGTCAGAAAAATGATGTAATTCATCAGGCATAATCAGCACACGTCAGCCCGTCTTACGGAGAAACATTCGCCGCATCTCACGCGTCAGCTGCTTCTTCTCCCATCCGTAAGGGAAAATAATGCTGTAGTCCCTCCTCTCAGGGAGGATATCAGGGCCCGCGACGATTCGCGCCACCTCCTTATTCAGGCGTTCTCCTCGCCAGCCATAAGCCGCCAGTCGCTTACGGATACGTGCAACGGGAAAACGTTTATCGGAGTGACGGGGGAAGAGTAACGTCCCGTCCTCATTTCGTTTGACACCAAACATATCCAGAGACTGTTCGATCACCACTGTTCGAACGGGGAAATGCTCACGCATAGTTTTCAACATATTCTCCATTTCAGACTCCTGAATTAATCCCTGCAACTGTCTGACCAGCCAGAGAGCAGCGTTATGCTGGCTCACTCCCTGCGTCATCATTCATTTCACACTGGAGTTCATGCCAGTATCCGGCAATATTGTTTACCACCTGTGCTGTCATAACGGCGTCAGTGACGGCGGAATGAGCCACCCCGATCCAGCTCAGGCCAGCCTGACTGACAGCACCGGAAAGCGATATGGTTCCGTAACGATTTGTACTGCAGCTGCTGCCGGATCGATGCCGACTGTGGGTTTCAGGCGGGTCTCAAATATCCTCTCACCGCGGGCATTCACCAGACCGATTTCCAGCGCCTGCGCGCCGGCATCAAGTCCGGTCGTTTCGGTATCGAGGAAGACGGGGCCCAGTGCTAACCAGGTATGTGCCAGCATGGCGAACCTGCCACGCTCACTCTTCATCCGGGCCTGCAGGCCCAGACGTGTGCTGGCCAGCCGCTGTTTTTCTGTCTGAGGGCGTTTCGCCCGCATCGGGACGCAGTCAGCCAGCCGCCATACGCCGTATTTCCCGCCGTAAGGACTTTTGTACATTCTGACCGGTTCCGCACCAGGGGCTGGCTTCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP043434|29215:36123|29637_30912_+|WP_000457542.1|DBSCAN-SWA MFALADVNSFYASCEKVFRPDLRDRSVVVLSNNDGCVIARSAEAKKLGIKMGVPWFQLRSAKFPEPVIAFSSNYALYASMSNRVMVHLEELAPRVEQYSIDEMFLDIRGIDSCIDFEDFGRQLREHVRSGTGLTIGVGMGPTKTLAKSAQWASKEWSQFGGVLALTLHNQKRTEKLLSLQPVEEIWGVGRRISKKLNTMGITTALQLARANPTFIRKNFNVVLERTVRELNGESCISLEEAPPPKQQIVCSRSFGERVTTYEAMRQAVCQHAERAAEKLRGERQFCRHIAVFVKTSPFAVTEPYYGNLASEKLLIPTQDTRDIIAAAVRALDRIWVDGHRYAKAGCMLNDFTQTGVSQLNLFDEVQPRERSEQLMQVLDGINHPGKGKIWFAGRGIAPEWQMKRELLSPAYTTRWADIPAAKLT >NZ_CP043434|29215:36123|29215_29638_+|WP_000925628.1|DBSCAN-SWA MLLLVAPEQEPVQSTAPLFTERCPAGFPSPAADYTEEELDLNAYCIRRPAATFFVRAIGDSMKEMGLHSGDLMVVDKAENPMQGDIVIAETDGEFTVKRLQLKPRIALLPMNPAYPTLYPEELQIFGVVTAFIHKTRSTD >NZ_CP043434|29215:36123|34525_35131_-|WP_000176303.1|DBSCAN-SWA MTLTEKSGHLAWCALVALALARQDGGVLSPAQENLFLTRWLATALKQRRFSRDVTPDIEWLLKQGRQMGVSAKLASKLNYLWRSCTGELSEQNDLFRLTYALETAKDMHWNYRLLSDREWSGRNAVALSAGVNGIYLSQAKLDVGFNDSGRQINSLTARLTGNVAGVMKLFDRCGWLAEPDASLPHQYSLMAGQGVPEKGD >NZ_CP043434|29215:36123|35187_35523_-|WP_001527010.1|DBSCAN-SWA MENMLKTMREHFPVRTVVIEQSLDMFGVKRNEDGTLLFPRHSDKRFPVARIRKRLAAYGWRGERLNKEVARIVAGPDILPERRDYSIIFPYGWEKKQLTREMRRMFLRKTG >NZ_CP043434|29215:36123|30993_31971_-|WP_077681951.1|DBSCAN-SWA MMSNERRKTIGRQLNTQASMVEMTDTQRSQVFTLKTGRKITFRFVRVPASDVESKTFVNQETNGRDQLALTRESLKSIIQTIKFQQFFPCIGIQQGERIEILDGSRRRASAIYIRTGLDVMVTNERLSADEARHLAKDVQTAKEHNLREIGLRLMALKESGFNQKEIAELEGLSQAKVTRALQAAAVPQELISLFPVQSELSFSDYKILLEVNEKLSEKGLTSEGLIQSVSDQHDAILSDYERPDDEQKASILKLISQASQALIAPPPKEKSVISALWTFEEKDKFARKRVKGRTLTYEFSRMSKVVQDELDKAINEVLERNLSQ >NZ_CP043434|29215:36123|35706_36123_-|WP_001541564.1|DBSCAN-SWA MKPAPGAEPVRMYKSPYGGKYGVWRLADCVPMRAKRPQTEKQRLASTRLGLQARMKSERGRFAMLAHTWLALGPVFLDTETTGLDAGAQALEIGLVNARGERIFETRLKPTVGIDPAAAAVQIVTEPYRFPVLSVRLA >NZ_CP043434|29215:36123|33587_34529_+|WP_000728919.1|transposase|DBSCAN-SWA MKKKNTTPTPHDATFRQFLTQPDIARDFMELHLPAELRAICDLSTLKLESGSFVEDDLRQYFSDVLYSLKTTAGDGYIHVLVEHQSTPDKHMAFRLIRYAVAAMQRHLEAGHKKLPLVIPVLFYTGKRSPYPYSTRWLDEFDDTALAGKLYSSAFPLVDVTVIPDDEIAGHRSMAALTLLQKHIHQRDLAELVDRLAPILLAGYLSSSQVISLVHYIVQAGETSDAEAFVRELAQRVPQHGDALMTIAQQLEQKGIEKGIQLGEQRGIEKGRSEGEREATLKIARTMLQNGIDRNTVMKMTGLTEDDLAQIRH >NZ_CP043434|29215:36123|31967_33173_-|WP_000427676.1|DBSCAN-SWA MENIEQLRKVATRAGKLLTSLSESIRQQKEELKLTEFYQEYSKAALYKLPKLSKGSVEYAVAEMEASGYIFKKKPSGNTMKYAMTIQNVIDLYFHRKVPKYRDRFDKAFTIFVCNLKGGGSKTVSTASLSHAFRAHPQLLFEDLRILAIDFDPQASLTMFLSHENSVGLVENTAAQAMLQNVSREELLSDFIVSSIIPGVDVIPASIDDAFLAEGWKGLCEEHLPGQNIHAVLKENIIDKLRYDYDFIFLDSGPHLDAFLKNCIGAADLMLTPLPPATVDFHSSLKFVASLPALIDSIEQDGHTCNLIGNVGFMSKILNKSDHKICHSQAKEVFGADMLDMVLPRLDGFERCGETFDTVISANPATYDGSTEALKSAKSAAEDFAKAVFDRIEFIRTNGGM |
8 | Escherichia_phage(33.33%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
39671 : 40232
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP043434|39671:40232|DBSCAN-SWA TATGGCACTTTACGGCTATGCCCGCGTTTCAACCAGCGATCAGGATCTTACTCTGCAGACACAAATTCTCCGCGCTGCTGGTTGTGAAATTATTCGTGCAGAAAAAGCCAGCGGAAGCGGCCGGACCGGAAGGAGCGAGTTGCAGCTGCTGCTGGAGTTCCTGCGCCCTGGTGATACGTTGATGGTGACACGCGTGGATCGCCTGGCCCGCAGCATTAAGGACCTGCAGGACATTGTGTATGCCCTGAATCAACAGGGCGTAACGCTCAGGGCAACAGAACAGCCAGTGGACACGCGTTCAGCTGCAGGCAAAGCCTTCCTCGATATGCTGGGTGTTTTCGCTGAGTTTGAAACCAATCTGAGACGTGAACGCCAGATGGAAGGCATTGCCGCGGCGAAAGCCAGGGGCGTATACCGGGGAAGGAAACCGTCCATAGATCCTGCTGAGGTATATCGTCTGTATACCATTGAGAAGATGGGAGCCACAGCCATCGCCCGCCAGCTCGGGATTGGGAGGGCGTCAGTCTACCGGGCGCTGGAAAATTATGAGCAGCCGGCGTAG
Protein sequences of DBSCAN-SWA_4 >NZ_CP043434|39671:40232|39671_40232_+|WP_001240330.1|DBSCAN-SWA MALYGYARVSTSDQDLTLQTQILRAAGCEIIRAEKASGSGRTGRSELQLLLEFLRPGDTLMVTRVDRLARSIKDLQDIVYALNQQGVTLRATEQPVDTRSAAGKAFLDMLGVFAEFETNLRRERQMEGIAAAKARGVYRGRKPSIDPAEVYRLYTIEKMGATAIARQLGIGRASVYRALENYEQPA |
1 | Ralstonia_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
45738 : 45903
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP043434|45738:45903|DBSCAN-SWA CATGGGCATTAACGCGCTGGCTCACGCCACTTTACTGAAGAAACTGAATAACGGTGACTATGACGGCGCAGCGAATGAATTCCTGAAATGGGACCACGCCAGCGGTCAGGTTGTTCCCGGCCTGACCCGACGCCGGAGCGCTGAACGTTGTTTATTCCTGAGTTAA
Protein sequences of DBSCAN-SWA_5 >NZ_CP043434|45738:45903|45738_45903_+|WP_001576629.1|DBSCAN-SWA MGINALAHATLLKKLNNGDYDGAANEFLKWDHASGQVVPGLTRRRSAERCLFLS |
1 | Salmonella_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
50845 : 51678
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP043434|50845:51678|DBSCAN-SWA GCTACTCCGGCGTATTCTGTCTGAGCACGTATGCTTTAAGCGTTTCTATCGTGGCACCACCGGCGCTACAGGCAAAGTAAGAACGAGACCACAGCAGCCCGGTTTTGCTCTGCATCCGCAAGTGGGTATTCTGCTGACGGAGAAGCCGCGACGATACCGACTTTAAATTATTTACCATCACGCTGACCCCCAGTTTTAGCGGATACGCTATCAGCAGATGGACGTGATCTTGTTCACCATCCATCTCGATAATTTCGCACTCCAACTTTGCCGCAGCCGAACCAAAAGCATCACGTAACTGAGCGATAATCTGCCCGTCAAACAGCTTGCAGCGGTATTTTGTCGTAAAGATTAAATGCACAACCAGCTTACTTACACTATGCCGTTTACGAAGAAAGCCTTCCAGTGATTCGTGGTGATTACTCAATTGAATTTTTCACTTAACATGTTAAAATAAATACAATATATTAATGAGCGCTGAATATGTTAAGAGCCACGAAAGTATGCATATATCCGACACCGGAACAGGCGGAGCACCTTAACGCCCAGTTCGGTGCAGTCCGTTTTGTGTACAGCAAATCTCTGCATATCAAGAAACACGCTTATCAACGACACGGCGTAAGTTTAACCCCGCGTAAAGACATTAAACCGCTTCTGGCTGTAGCGAAAAAATTCCGTAAATTCCGTAAATACGCATGGCTAAAGGAATATGACTCTATTGCGTTGCAACAGGCGGTGATCAATCTCGATGTTGCCTTTTCCAACTGCTTCAATCCGAAGCTAAAAGCCCGCTTCCCTATGTTCAAGCGCAAACACGGCAAGCTGTTGGGGTAA
Protein sequences of DBSCAN-SWA_6 >NZ_CP043434|50845:51678|51327_51678_+|WP_001541541.1|DBSCAN-SWA MLRATKVCIYPTPEQAEHLNAQFGAVRFVYSKSLHIKKHAYQRHGVSLTPRKDIKPLLAVAKKFRKFRKYAWLKEYDSIALQQAVINLDVAFSNCFNPKLKARFPMFKRKHGKLLG >NZ_CP043434|50845:51678|50845_51271_-|WP_000064919.1|transposase|DBSCAN-SWA MSNHHESLEGFLRKRHSVSKLVVHLIFTTKYRCKLFDGQIIAQLRDAFGSAAAKLECEIIEMDGEQDHVHLLIAYPLKLGVSVMVNNLKSVSSRLLRQQNTHLRMQSKTGLLWSRSYFACSAGGATIETLKAYVLRQNTPE |
2 | Helicobacter_phage(50.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
54738 : 55521
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP043434|54738:55521|DBSCAN-SWA TCTATGAACCTCCTTTGAGCATAGCCACTGCATCAGCACCCGGCATCTGAAACTGCACCCGGTGTCGTGCGGCAACATCAAGCGCAAACACTTTCGTGTACACTTCCGTCGAGCTCACCGATTTGTGTCCCATCAGCGCCTGCAGCACCTTCAGCGGTATGCCGGCGTACAGCATGTGCATCGCATAGGAGTGGCGGAATGTATGGGGTGTCACCGGGACTGAGAACGTCACGTCGTCAGCCGCTGCGGCCTCAACCGCCTCCCCAATCCAGGTACGGACCGTCCGGTCGGTGATCTCCCAGAGGCGCGCCTTTTCCGTTCTGCCGGTACGTCTGTTACGACGCTCCAGCGGGATTTTCAGCGTGGCCACCATCATCTGCAGTTCGCTGACATACTGGTTATCAGAAAGCGGCACCAGGCGGTGGGGCTGGCTGCCGGACGGCATCCGTCCTGCCGTTCTGGCCGCCTTTTCCGCCCGCTGTTTCAGAGTGGCCAGCTGCACAAACGGATAAGGTGGTGCCAGCGAAAAGTCTCCCCGTGTCAGGGCCAGCGCCTCGTTAATCCGTGCGCCGGTATTCCATAGAGTGGCCAGCAGCATCCTGCGATGCAGATCAGGGACGTAGTGGAGCAGGGCACTTACCTCCGGTGCCAGCAGATATTTCGGGTAGTCGTCATGCTGCATTGCCATCTGGCGCAGTGCGAGGGCCGCCGGGTAATCAATGGCCACCGGCAGCAGGGCGGACGCTGCCTGCGTACAGACAGCAGGTAACGGTGGCTGACTCAT
Protein sequences of DBSCAN-SWA_7 >NZ_CP043434|54738:55521|54738_55521_-|WP_000082169.1|integrase|DBSCAN-SWA MSQPPLPAVCTQAASALLPVAIDYPAALALRQMAMQHDDYPKYLLAPEVSALLHYVPDLHRRMLLATLWNTGARINEALALTRGDFSLAPPYPFVQLATLKQRAEKAARTAGRMPSGSQPHRLVPLSDNQYVSELQMMVATLKIPLERRNRRTGRTEKARLWEITDRTVRTWIGEAVEAAAADDVTFSVPVTPHTFRHSYAMHMLYAGIPLKVLQALMGHKSVSSTEVYTKVFALDVAARHRVQFQMPGADAVAMLKGGS |
1 | Macacine_betaherpesvirus(100.0%) | integrase | attL 51013:51024|attR 57721:57732 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP043433_1 | 940894-941533 | TypeI-E |
I-E
Consensus repeat of NZ_CP043433_1
|
10 spacers
spacers of NZ_CP043433_1
>1.1|940921|34|NZ_CP043433|PILER-CR,CRT TGAGGATTTCGCCGCGCTGTTGGCCTCCAGATTC >1.2|940982|34|NZ_CP043433|PILER-CR,CRT TGCCGTGCCTGTCCAGGACAAATTGCCGATTATT >1.3|941043|34|NZ_CP043433|PILER-CR,CRT TGCTGCCGGTCTGTGCTGTTGTCGTCAATAATCA >1.4|941104|34|NZ_CP043433|PILER-CR,CRT CGGCATGTGACAGTCTGATTTTTATAGCGCATGA >1.5|941165|34|NZ_CP043433|PILER-CR,CRT CCAATTTAGGGGCCGGAACTCCGGGAAAGGCAGC >1.6|941226|35|NZ_CP043433|PILER-CR,CRT CGCAAACACCAAGCTCTTCCGCCGCGCGTTCCTGA >1.7|941288|34|NZ_CP043433|PILER-CR,CRT CGTTGCTCTCATTAAAGGGGTTTCCATGTTTGAT >1.8|941349|34|NZ_CP043433|PILER-CR,CRT CGCGGTCGCAGCCTGGCCTGTTGCCGTAGAATCG >1.9|941410|34|NZ_CP043433|PILER-CR,CRT CGAGTTACTGATGCAGACTGCGGATCTTAATCGG >1.10|940923|32|NZ_CP043433|CRISPRCasFinder AGGATTTCGCCGCGCTGTTGGCCTCCAGATTC >1.11|940984|32|NZ_CP043433|CRISPRCasFinder CCGTGCCTGTCCAGGACAAATTGCCGATTATT >1.12|941045|32|NZ_CP043433|CRISPRCasFinder CTGCCGGTCTGTGCTGTTGTCGTCAATAATCA >1.13|941106|32|NZ_CP043433|CRISPRCasFinder GCATGTGACAGTCTGATTTTTATAGCGCATGA >1.14|941167|32|NZ_CP043433|CRISPRCasFinder AATTTAGGGGCCGGAACTCCGGGAAAGGCAGC >1.15|941228|33|NZ_CP043433|CRISPRCasFinder CAAACACCAAGCTCTTCCGCCGCGCGTTCCTGA >1.16|941290|32|NZ_CP043433|CRISPRCasFinder TTGCTCTCATTAAAGGGGTTTCCATGTTTGAT >1.17|941351|32|NZ_CP043433|CRISPRCasFinder CGGTCGCAGCCTGGCCTGTTGCCGTAGAATCG >1.18|941412|32|NZ_CP043433|CRISPRCasFinder AGTTACTGATGCAGACTGCGGATCTTAATCGG >1.19|941473|32|NZ_CP043433|CRISPRCasFinder TGCGCCAACGACTGGAATTTTTGCGTGTAGCC >1.20|941471|34|NZ_CP043433|CRT CGTGCGCCAACGACTGGAATTTTTGCGTGTAGCC |
cas3,cas8e,cse2gr11 |
CRISPR arrays and Neighbor proteins around NZ_CP043433_1
The CRISPR arrays of NZ_CP043433_1 >merge|NZ_CP043433|1|940894-941533|PILER-CR,CRISPRCasFinder,CRT GTGTTCCCCGCGCCAGCGGGGATAAACTGAGGATTTCGCCGCGCTGTTGGCCTCCAGATTCGTGTTCCCCGCGCCAGCGGGGATAAACTGCCGTGCCTGTCCAGGACAAATTGCCGATTATTGTGTTCCCCGCGCCAGCGGGGATAAACTGCTGCCGGTCTGTGCTGTTGTCGTCAATAATCAGTGTTCCCCGCGCCAGCGGGGATAAACCGGCATGTGACAGTCTGATTTTTATAGCGCATGAGTGTTCCCCGCGCCAGCGGGGATAAACCCAATTTAGGGGCCGGAACTCCGGGAAAGGCAGCGTGTTCCCCGCGCCAGCGGGGATAAACCGCAAACACCAAGCTCTTCCGCCGCGCGTTCCTGAGTGTTCCCCGCGCCAGCGGGGATAAACCGTTGCTCTCATTAAAGGGGTTTCCATGTTTGATGTGTTCCCCGCGCCAGCGGGGATAAACCGCGGTCGCAGCCTGGCCTGTTGCCGTAGAATCGGTGTTCCCCTCGCCAGCGGGGATAAACCGAGTTACTGATGCAGACTGCGGATCTTAATCGGGTGTTCCCCGCGCCAGCGGGGATAAACCGTGCGCCAACGACTGGAATTTTTGCGTGTAGCCGTGTTCCCCGCGCCAACAAGGATAGCCGT >NZ_CP043433|1|1|940894-941470|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAAC TGAGGATTTCGCCGCGCTGTTGGCCTCCAGATTC GTGTTCCCCGCGCCAGCGGGGATAAAC TGCCGTGCCTGTCCAGGACAAATTGCCGATTATT GTGTTCCCCGCGCCAGCGGGGATAAAC TGCTGCCGGTCTGTGCTGTTGTCGTCAATAATCA GTGTTCCCCGCGCCAGCGGGGATAAAC CGGCATGTGACAGTCTGATTTTTATAGCGCATGA GTGTTCCCCGCGCCAGCGGGGATAAAC CCAATTTAGGGGCCGGAACTCCGGGAAAGGCAGC GTGTTCCCCGCGCCAGCGGGGATAAAC CGCAAACACCAAGCTCTTCCGCCGCGCGTTCCTGA GTGTTCCCCGCGCCAGCGGGGATAAAC CGTTGCTCTCATTAAAGGGGTTTCCATGTTTGAT GTGTTCCCCGCGCCAGCGGGGATAAAC CGCGGTCGCAGCCTGGCCTGTTGCCGTAGAATCG GTGTTCCCCTCGCCAGCGGGGATAAAC CGAGTTACTGATGCAGACTGCGGATCTTAATCGG GTGTTCCCCGCGCCAGCGGGGATAAAC >NZ_CP043433|1|1|940894-941533|CRISPRCasFinder GTGTTCCCCGCGCCAGCGGGGATAAACTG AGGATTTCGCCGCGCTGTTGGCCTCCAGATTC GTGTTCCCCGCGCCAGCGGGGATAAACTG CCGTGCCTGTCCAGGACAAATTGCCGATTATT GTGTTCCCCGCGCCAGCGGGGATAAACTG CTGCCGGTCTGTGCTGTTGTCGTCAATAATCA GTGTTCCCCGCGCCAGCGGGGATAAACCG GCATGTGACAGTCTGATTTTTATAGCGCATGA GTGTTCCCCGCGCCAGCGGGGATAAACCC AATTTAGGGGCCGGAACTCCGGGAAAGGCAGC GTGTTCCCCGCGCCAGCGGGGATAAACCG CAAACACCAAGCTCTTCCGCCGCGCGTTCCTGA GTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCTCTCATTAAAGGGGTTTCCATGTTTGAT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGGTCGCAGCCTGGCCTGTTGCCGTAGAATCG GTGTTCCCCTCGCCAGCGGGGATAAACCG AGTTACTGATGCAGACTGCGGATCTTAATCGG GTGTTCCCCGCGCCAGCGGGGATAAACCG TGCGCCAACGACTGGAATTTTTGCGTGTAGCC GTGTTCCCCGCGCCAACAAGGATAGCCGT >NZ_CP043433|1|1|940894-941531|CRT GTGTTCCCCGCGCCAGCGGGGATAAAC TGAGGATTTCGCCGCGCTGTTGGCCTCCAGATTC GTGTTCCCCGCGCCAGCGGGGATAAAC TGCCGTGCCTGTCCAGGACAAATTGCCGATTATT GTGTTCCCCGCGCCAGCGGGGATAAAC TGCTGCCGGTCTGTGCTGTTGTCGTCAATAATCA GTGTTCCCCGCGCCAGCGGGGATAAAC CGGCATGTGACAGTCTGATTTTTATAGCGCATGA GTGTTCCCCGCGCCAGCGGGGATAAAC CCAATTTAGGGGCCGGAACTCCGGGAAAGGCAGC GTGTTCCCCGCGCCAGCGGGGATAAAC CGCAAACACCAAGCTCTTCCGCCGCGCGTTCCTGA GTGTTCCCCGCGCCAGCGGGGATAAAC CGTTGCTCTCATTAAAGGGGTTTCCATGTTTGAT GTGTTCCCCGCGCCAGCGGGGATAAAC CGCGGTCGCAGCCTGGCCTGTTGCCGTAGAATCG GTGTTCCCCTCGCCAGCGGGGATAAAC CGAGTTACTGATGCAGACTGCGGATCTTAATCGG GTGTTCCCCGCGCCAGCGGGGATAAAC CGTGCGCCAACGACTGGAATTTTTGCGTGTAGCC GTGTTCCCCGCGCCAACAAGGATAGCC
>NZ_CP043433.1|WP_001199961.1|939925_940597_+|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWDKLSDREVSLFSILAKTKESDKWGAASSEDLLAVINRQGYTARHVVITGGEPCIHDLMPLTDLLEKSGFSCQIETSGTHEVRCTPNTWVTVSPKVNMRGGYDVLSQALERANEIKHPVGRVRDIEALDELLATLSDDKPRVIALQPISQKEDATRLCIETCIARNWRLSMQTHKYLNIA >NZ_CP043433.1|WP_000036734.1|938491_939790_+|phosphopyruvate-hydratase MSKIVKVIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVGAVNGPIAQAILGKDAKDQAGIDKIMIDLDGTENKSNFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKGKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >NZ_CP043433.1|WP_000210863.1|936771_938409_+|CTP-synthase-(glutamine-hydrolyzing) MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQLAVDIGREHALFMHLTLVPYLAAAGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISMKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIYEEANPAGEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVTVNIKLIDSQDVETRGVEILKDLDAILIPGGFGYRGVEGKIATARYARENNIPYLGICLGMQVALIEFARNVAGMDNANSTEFVPDCKYPVVALITEWRDEDGNVEVRSEKSDLGGTMRLGAQQCQLSDDSLVRQLYGASTIVERHRHRYEVNNMLLKQIEAAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAANEHQKRQAK >NZ_CP043433.1|WP_000210451.1|935743_936544_+|nucleoside-triphosphate-pyrophosphohydrolase MTTNHQIDRLLTLMQRLRDPENGCPWDKEQTFASIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFGELSADNSEEALVRWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCSNVGFDWTTLGPVVDKVYEEIDEVMFEARQAVVDQAKLEEEMGDLLFATVNMARHLGTKAELALQKANDKFERRFREVERIVAARGLEMTGVDLETMEEVWQEVKRQEIDL >NZ_CP043433.1|WP_000842512.1|934548_935136_-|fimbrial-protein MKSSHFCKLAVTASLVMGIVSGAQAAGSNTAKVTFLGNIVDSPCSVTLDTEDQTVNMGSSIGNGTLSNGKTTINNARTFHIDLEGCTWATEKNMNVVFTTGSGTTAATGATDNLALMKTDGTGAISNVSLAIGDAGKNNIKLGDTYTQAIADLDGDTILDEKQSLNFTAWLVGAATGTVGTGEFSSAANVTISYL >NZ_CP043433.1|WP_000981797.1|931769_934469_-|fimbrial-biogenesis-outer-membrane-usher-protein MMNNTWKSVLCPIACGVGMLLSLSPYSASGKDIEFNTDFLDVKNRDNVNIAQFSRKGFILPGVYLLQIKINGQTLPQEFPVNWVIPEHDPQGSEVCAEPELVTQLGIKPELAEKLVWITHGERQCLAPDSLKGMDFQADLGHSTLLVNLPQAYMEYSDVDWDPPARWDNGIPGIILDYNINNQLRHDQESGSEEQSISGNGTLGANLGAWRLRADWQASYDHRDDDENTSTLHDQSWSRYYAYRALPTLGAKLTLGESYLQSDVFDSFNYIGASVVSDDQMLPPKLRGYAPEIVGIARSNAKVKVSWQGRVLYETQVPAGPFRIQDLNQSVSGTLHVTVEEQNGQTQEFDVNTASVPFLTRPGMVRYKMALGRPQDWDHHPITGTFASAEASWGVTNGWSLYGGAIGESNYQAVALGSGKDLGVVGAVAVDITHSIAHMPQDDGFDGETLQGNSYRISYSRDFDEIDSRLTFAGYRFSEKNFMSMSDYLDAKTYHHLNAGHEKERYTVTYNQNFREQGMSAYFSYSRSTFWDSPDQSNYNLSLSWYFDLGSIKNLSASLNGYRSEYNGDKDDGVYISLSVPWGNDSISYNGTFNGSQHRNQLGYSGHSQNGDNWQLHVGQDEQGAQADGYYSHQGALTDIDLSADYEEGSYRSLGMSLRGGMTLTTQGGALHRGSLAGSTRLLVDTDGIADVPVSGNGSPTSTNIFGKAVIADVGSYSRSLARIDLNKLPEKAEATKSVVQITLTEGAIGYRHFDVVSGEKMMAVFRLADGDFPPFGAEVKNERQQQLGLVADDGNAWLAGVKAGETLKVFWDGAAQCEASLPSTFTPELLANALLLPCKMLEGQPPTAPQKSSPLPAQPLIQEHTQTDGQPAAPVATTTQTPPIPLADNHAVNRKDME >NZ_CP043433.1|WP_001044459.1|930983_931757_-|fimbria/pilus-periplasmic-chaperone MNKTNHFKRQALIASVLLAAPLVSHSAIVPDRTRVIFNGNENSITVTLKNGNATLPYLAQAWLEDDKFAKDTRYFTALPPLQRIEPKSDGQVKVQPLPAAASLPQDRESLFYFNVREIPPKSDKPNTLQLALQTRIKFFYRPVAVARQVDKTHPWQTKLTLTYQGDGVIFDNPTPFYLVISNAGSKENETASGFKNLLIAPREKVTSPIKGASLGSSPVVGYVDDYGGHRLLVFTCSGNTCKVNEEKTRDAEKKANK >NZ_CP043433.1|WP_000178270.1|930457_930964_-|fimbrial-protein MTMLTRWKMLVLLCGGFVTGTEAAGTKTVQLELHLVVTQPPPCTVGGASVEFGDVLTTKVGDASQTKPVGYSLNCDGRASDYLKLQIQGTTTTISGEQVLQTSVQGLGIRIQQAGNKQLVPVGITDWLNFTLSGSNGPELEAVPVKEPTTQLAGGDFNASATLVVDYQ >NZ_CP043433.1|WP_000832393.1|929972_930443_-|fimbrial-protein MKRVLILTLLITQFACADNLTFHGKLINPPACTINNGEMLEVSFGSVIIDNIDGVNYLTEIPWTLTCDSSFRDDALTFTLSYLGTATPYSAKALTTSVPELGIELQQNGTVFPPGTSLTINESSLPTLKAVPVKQPGKEPAEGDFEAFATLQVDYQ >NZ_CP043433.1|WP_001079646.1|929439_929976_-|fimbrial-protein MNRIFQTAGHLIGGVMLWAVCNTLPAATPNVHYSGKLVAGACNLVVDNDTMATVDFHTIGSDNFDASGQTTPVPFTLSLQDCKTALANGVLVTFQGVEDSTLPGLLALEPSSEASGFAIGVETAAQQPVSINATVGTAFVLKEGITTINLQARLQKYAGEEVMPGEFSGSATVSFEYQ >NZ_CP043433.1|WP_001207998.1|941631_942429_+|MBL-fold-metallo-hydrolase MALRIRVLLENHKGAGADKSLKARPGLSLLVEDESTSILFDTGPDGSFMQNALAMGIDLSDVSAVVLSHGHYDHCGGVPWLPDNSRIICHPDIARERYAAMTFLGITRKIKKLSCEVDYSRYRMMYTRDPLPIGKNFIWSGEIPVVAPEAYGIFGGHDAEPDSILDEGVLIYQSTKGLVIITGCGHRGIANIVRHCQNITGIKRIYALVGGFHLRCASPFTLWRVRRFLQEQKPEKLCGCHCTGAWGRLWLPEITAPATGDVLRF >NZ_CP043433.1|WP_000108313.1|942516_942879_-|6-carboxytetrahydropterin-synthase-QueD MSTTLYKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIMDFADLKAAFKPTYDRLDHYYLNDIPGLSNPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCVYRGE >NZ_CP043433.1|WP_000210932.1|943302_945102_+|NADPH-dependent-assimilatory-sulfite-reductase-flavoprotein-subunit MTTPAPLTGLLPLNPEQLARLQAATTDLTPEQLAWVSGYFWGVLNPRSGVVAVTPVPERKMPGVTLISASQTGNARRVAEALRDDLLAANLNVTLVNAGDYKFKQIASEKLLVIVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDTSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDVLKSRAPVAAPSQSVATGAVNDIHTSPYTKDAPLIATLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVDGKTLPLAEALEWHFELTVNTANIVENYATLTRSESLLPLVGDKAQLQHYAATTPIVDMVRFSPAQLDAEALIGLLRPLTPRLYSIASAQAEVESEVHVTVGVVRYDIEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPQTPVIMIGPGTGIAPFRAFMQQRAADGAEGKNWLFFGNPHFTEDFLYQVEWQRYVKEGVLSRIDLAWSRDQKEKIYVQDKLREQGAELWCWINDGAHIYVCGDARRMAADVEKALLEVIAEFGGMDLESADEYLSELRVERRYQRDVY >NZ_CP043433.1|WP_001290670.1|945101_946814_+|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLSDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTTQWQAIDKFAADNTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVAITDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGLETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDNNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHQGEFRITANQNLIIASVPESQKAKIETLARDHGLMNAVSAQRENSMACVSFPTCPLAMAEAERFLPSFTDKVEAILEKHGIPDEHIVMRVTGCPNGCGRAMLAEIGLVGKAPGRYNLHLGGNRIGTRIPRMYQENITEPDILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDFWE >NZ_CP043433.1|WP_071524068.1|946770_946989_-|hypothetical-protein MVSANASTRSTFGSSFRAFRSSLDISSSRYTLPDGASLIRPTRAVGRIRHLCRHPAIQVIPRNHARDRAPGE >NZ_CP043433.1|WP_000039870.1|946915_947650_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRVLALAETNAQLETLTAEERVAWALENLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYQFIDELTDKLKLNLKVYRAGESPAWQEARYGKLWEQGVEGIEKYNEINKVEPMNRALKELKAQTWFAGLRREQSGSRAHLPVLAIQRGVFKVLPIIDWDNRTVYQYLQKHGLKYHPLWDQGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >NZ_CP043433.1|WP_001145541.1|947737_948691_-|SPI-1-type-III-secretion-system-effector-SopD MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINKIYAFKRLQHLACPAHQDLFTIKMDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGAKMTWHPELLQESISTLRKEVTGNAQIKAAVYEMMRPAEAPDHPLVEWQDSLTEDEKSMLACINAGNFEPTTQFCKIGYQEVQGEVAFSMMHPCISYLLHTYSPFAEFKPTNSGFLKKLNQDYNDYHAKKMFIDVILEKLYLTHERSLHIGKDGCSRNILLT >NZ_CP043433.1|WP_000029737.1|949134_951798_+|CRISPR-associated-helicase/endonuclease-Cas3 MSIYHYWGKSRRGETDGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYMLWNSWLSECPELFPPSSLSVRKSKRVMALWMPVTTGHHGRPPEAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATANAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPYSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERCVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMRNSAYVYPDHGRMWLTQRILREQGTIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFAGQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDSVVTPYASGEHAWEMSVLRVRQSWWNKHKDEFEKLDGEPLRKWCAQQHQDKDFATVIVVTDFAACGYSANEGLIGMMGE >NZ_CP043433.1|WP_000368579.1|951809_953366_+|type-I-E-CRISPR-associated-protein-Cse1/CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVDLADENVVDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADVLHKALAPLEHAFQFGAESPSFMQDFEPLSGEKVSIASLLPEIPGAQTTKFNKDHFVKRGVTERFCPHCAALALFSLQLNAPAGGKGYRTGLRGGGPLTTLVELQEYQGERQTPIWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGCCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIRGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKAVQTAARLLSLLRSALKEAWFTNAKDARGDFSFIDIDFWNLTQGRFLNLIHDLENGHKPDERLNKWQRELWLFTRCYFDDHVFTNPYESSDLERIMKARKKYFTSSAEKQSAKAAKAKKQEAAE >NZ_CP043433.1|WP_000117945.1|953362_953917_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MSVVTKDDKATLRQWHDELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKTPDELLRQLRRAVKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDTDNEQD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP043433_2 | 957685-958201 | TypeI-E |
I-E
Consensus repeat of NZ_CP043433_2
|
8 spacers
spacers of NZ_CP043433_2
>2.1|957714|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT ACGTTGGCTGAAAACGGTTTTTCGGTCCGCCT >2.2|957775|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT AATATATGGCGCTCACGCGCATGAGCATTCTC >2.3|957836|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT CAGGGCAAATTCATCCGCCGCTGACCACTGGT >2.4|957897|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT GACGCTTACATCTCACCGAGAGATTTTGAGGC >2.5|957958|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT GGAACTGGTTTAGCTATCGCTGCCGGGGCTAT >2.6|958019|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT GATTGCTCAGATTGGGAATTTGACCAGCGGCC >2.7|958080|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT TCACGAGGGCCCCCTTATTGGGTCGGGCAGGT >2.8|958141|32|NZ_CP043433|CRISPRCasFinder,CRT GTTGGGTTGCATAGATGACACGCTTATAAATA |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around NZ_CP043433_2
The CRISPR arrays of NZ_CP043433_2 >merge|NZ_CP043433|2|957685-958201|PILER-CR,CRISPRCasFinder,CRT GTGTTCCCCGCGCCAGCGGGGATAAACCGACGTTGGCTGAAAACGGTTTTTCGGTCCGCCTGTGTTCCCCGCGCCAGCGGGGATAAACCGAATATATGGCGCTCACGCGCATGAGCATTCTCGTGTTCCCCGCGCCAGCGGGGATAAACCGCAGGGCAAATTCATCCGCCGCTGACCACTGGTGTGTTCCCCGCGCCAGCGGGGATAAACCGGACGCTTACATCTCACCGAGAGATTTTGAGGCGTGTTCCCCGCGCCAGCGGGGATAAACCGGGAACTGGTTTAGCTATCGCTGCCGGGGCTATGTGTTCCCCGCGCCAGCGGGGATAAACCGGATTGCTCAGATTGGGAATTTGACCAGCGGCCGTGTTCCCCGCGCCAGCGGGGATAAACCGTCACGAGGGCCCCCTTATTGGGTCGGGCAGGTGTGTTCCCCGCGCCAGCGGGGATAAACCGGTTGGGTTGCATAGATGACACGCTTATAAATAGTGTTCCCCGCGTCAGCGGGGATAAACAC >NZ_CP043433|2|2|957685-958140|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGTTGGCTGAAAACGGTTTTTCGGTCCGCCT GTGTTCCCCGCGCCAGCGGGGATAAACCG AATATATGGCGCTCACGCGCATGAGCATTCTC GTGTTCCCCGCGCCAGCGGGGATAAACCG CAGGGCAAATTCATCCGCCGCTGACCACTGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GACGCTTACATCTCACCGAGAGATTTTGAGGC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGAACTGGTTTAGCTATCGCTGCCGGGGCTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GATTGCTCAGATTGGGAATTTGACCAGCGGCC GTGTTCCCCGCGCCAGCGGGGATAAACCG TCACGAGGGCCCCCTTATTGGGTCGGGCAGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG >NZ_CP043433|2|2|957685-958201|CRISPRCasFinder GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGTTGGCTGAAAACGGTTTTTCGGTCCGCCT GTGTTCCCCGCGCCAGCGGGGATAAACCG AATATATGGCGCTCACGCGCATGAGCATTCTC GTGTTCCCCGCGCCAGCGGGGATAAACCG CAGGGCAAATTCATCCGCCGCTGACCACTGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GACGCTTACATCTCACCGAGAGATTTTGAGGC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGAACTGGTTTAGCTATCGCTGCCGGGGCTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GATTGCTCAGATTGGGAATTTGACCAGCGGCC GTGTTCCCCGCGCCAGCGGGGATAAACCG TCACGAGGGCCCCCTTATTGGGTCGGGCAGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTGGGTTGCATAGATGACACGCTTATAAATA GTGTTCCCCGCGTCAGCGGGGATAAACAC >NZ_CP043433|2|2|957685-958201|CRT GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGTTGGCTGAAAACGGTTTTTCGGTCCGCCT GTGTTCCCCGCGCCAGCGGGGATAAACCG AATATATGGCGCTCACGCGCATGAGCATTCTC GTGTTCCCCGCGCCAGCGGGGATAAACCG CAGGGCAAATTCATCCGCCGCTGACCACTGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GACGCTTACATCTCACCGAGAGATTTTGAGGC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGAACTGGTTTAGCTATCGCTGCCGGGGCTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GATTGCTCAGATTGGGAATTTGACCAGCGGCC GTGTTCCCCGCGCCAGCGGGGATAAACCG TCACGAGGGCCCCCTTATTGGGTCGGGCAGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTGGGTTGCATAGATGACACGCTTATAAATA GTGTTCCCCGCGTCAGCGGGGATAAACAC
>NZ_CP043433.1|WP_001518648.1|957294_957588_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAVWLLEVRAGVYVGDTSKRIREMIWQQITQLGGVGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVENQ >NZ_CP043433.1|WP_000144830.1|956374_957295_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLNPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLASTVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALDDDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRATYALLAKQYGVKWHGRNYDPKDWEKGDVVNRCISAATSCLYGISEAAILAAGYAPAIGFIHSGKPLSFVYDIADIIKFESVVPKAFEIAARHPAEPDKEVRLACRDIFRSSKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPESLGDSGHRGHG >NZ_CP043433.1|WP_000281483.1|955727_956378_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTSELSPAQLLHLVERGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQEQPAASTIFDVQTRPFAPMLSAGQTLRFNLRANPTICKNGKRHDLLMEAKRQRKTQGDSQDIWSYQQQAALEWLARQGEQNGFTLREASVDAYRQQQIRREKSRQMIQFSSVDYTGVLVINEPALFLQRLAQGYGKSRAFGCGMMMIKPGDDA >NZ_CP043433.1|WP_000085115.1|954999_955746_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLVFQLHGPMASWGVDAPGEVRHSHELPSRSALLGLLAAALGIRRDEEERLNTFNRHYQFLLCASGNPRWARDYHTVQMPKEVRKARYFSRREELQDPELLSALISRRDYYTDAWWMIAVSATPDAPYTLAQLQAALQHPVFPLYLGRKSHPLALPLAPQLLEGNAADVLREAYRWYQDQFNALKLTLPGLQNECWWEGEHDGLTANKILRRRDMPLSRQQWLFGERSVNQGPWLRKEDACISQE >NZ_CP043433.1|WP_000206417.1|953930_954989_+|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRISSQSLKRAWRTSELFEQALAGHIGIRTGRIAREAAQILVDSGIDAKKAVEYVKNIANCFGKVKEDKKPKDELTNAETEQLVHISPAEFEAVKALARRLAEEKRPATEEEAELLRHDRMAVDIAMFGRMLAKKTDFNVEAACQVAHAFGVSETIIEDDFFTAVDDLRQASAEDAGAGHLGETGFGSALFYTYICIDKDLLVKNLNGNEELANKTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTDQPRSLAAAFYEPINGTDQLNVAVKRITALHENMNEVYAQETAFKNFNVMNQQGSMKDVLDFICA >NZ_CP043433.1|WP_000117945.1|953362_953917_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MSVVTKDDKATLRQWHDELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKTPDELLRQLRRAVKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDTDNEQD >NZ_CP043433.1|WP_000368579.1|951809_953366_+|type-I-E-CRISPR-associated-protein-Cse1/CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVDLADENVVDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADVLHKALAPLEHAFQFGAESPSFMQDFEPLSGEKVSIASLLPEIPGAQTTKFNKDHFVKRGVTERFCPHCAALALFSLQLNAPAGGKGYRTGLRGGGPLTTLVELQEYQGERQTPIWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGCCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIRGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKAVQTAARLLSLLRSALKEAWFTNAKDARGDFSFIDIDFWNLTQGRFLNLIHDLENGHKPDERLNKWQRELWLFTRCYFDDHVFTNPYESSDLERIMKARKKYFTSSAEKQSAKAAKAKKQEAAE >NZ_CP043433.1|WP_000029737.1|949134_951798_+|CRISPR-associated-helicase/endonuclease-Cas3 MSIYHYWGKSRRGETDGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYMLWNSWLSECPELFPPSSLSVRKSKRVMALWMPVTTGHHGRPPEAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATANAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPYSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERCVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMRNSAYVYPDHGRMWLTQRILREQGTIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFAGQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDSVVTPYASGEHAWEMSVLRVRQSWWNKHKDEFEKLDGEPLRKWCAQQHQDKDFATVIVVTDFAACGYSANEGLIGMMGE >NZ_CP043433.1|WP_001145541.1|947737_948691_-|SPI-1-type-III-secretion-system-effector-SopD MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINKIYAFKRLQHLACPAHQDLFTIKMDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGAKMTWHPELLQESISTLRKEVTGNAQIKAAVYEMMRPAEAPDHPLVEWQDSLTEDEKSMLACINAGNFEPTTQFCKIGYQEVQGEVAFSMMHPCISYLLHTYSPFAEFKPTNSGFLKKLNQDYNDYHAKKMFIDVILEKLYLTHERSLHIGKDGCSRNILLT >NZ_CP043433.1|WP_000039870.1|946915_947650_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRVLALAETNAQLETLTAEERVAWALENLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYQFIDELTDKLKLNLKVYRAGESPAWQEARYGKLWEQGVEGIEKYNEINKVEPMNRALKELKAQTWFAGLRREQSGSRAHLPVLAIQRGVFKVLPIIDWDNRTVYQYLQKHGLKYHPLWDQGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >NZ_CP043433.1|WP_000490481.1|958215_959262_-|aminopeptidase MFSATRRFAVILALGVGFILPAQAASPGPGEIANTQARHIATFFPGRMTGSPAEMLSADYLRQQFTQMGYQSDIRTFNSRFIYTTKDNRKNWHNVTGSTVIAAHEGRVPQQIIIMAHLDTYAPQSDADVDANLGGLTLQGMDDNAAGLGVMLELAARLKDIPTHYGIRFIATSGEEEGKLGAENLLKRMSDAEKKNTLLVINLDNLIVGDKLYFNSGKNTPEAVRTLTRDRALAIARRYGIAANTNPGRNPSYPKGTGCCNDAEVFDKAGISVLSVEATNWNLGKKDGYQQRVKNASFPNGNSWHDVRLDNQQHIDKALPGRIERRSRDVVRIMLPLVKELAKAEKTS >NZ_CP043433.1|WP_000372384.1|959512_960421_+|sulfate-adenylyltransferase-subunit-CysD MDQKRLTHLRQLEAESIHIIREVAAEFANPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYAFRDRTANAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWRNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMVDDDRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESHAQTLPEIIEEMLVSTTSERQGRMIDRDQAGSMELKKRQGYF >NZ_CP043433.1|WP_001092251.1|960430_961870_+|sulfate-adenylyltransferase-subunit-CysN MNTILAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTLQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCDLAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYCEETFARIREDYLTFAEQLPGDLDIRFVPLSALEGDNVAAQSANMRWYSGPTLLEVLETVDIQRAVDRQPMRFPVQYVNRPNLDFRGYAGTLASGSVKVGERIKVLPSGVESSVARIVTFDGDKEEACAGEAITLVLNDDIDISRGDLLLAANETLAPARHAAIDVVWMAEQPLAPGQSYDVKLAGKKTRARIEAIRYQIDINNLTQRDVESLPLNGIGLVEMTFDEPLALDIYQQNPVTGGLIFIDRLSNVTVGAGMVRELDERGATPPVEYSAFELELNALVRRHFPHWDARDLLGDKHGAA >NZ_CP043433.1|WP_001173663.1|961856_962462_+|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVAAREQLHGHRGVVLWFTGLSGSGKSTVAGALEEALHQRGVSTYLLDGDNVRHGLCRDLGFSDADRQENIRRVGEVASLMADAGLIVLTAFISPHRAERQLVKERVGHDRFIEIYVNTPLAICEQRDPKGLYKKARAGELRNFTGIDAIYEAPDSPQVHLNGEQLVTNLVSQLLDLLRRRDIIRS >NZ_CP043433.1|WP_001118109.1|962479_962836_+|DUF3561-family-protein MPGMVKVTGFNMRNSHNITFTRSDAFMVDDDATSAFPGAVVGFVSWLLALGIPFLLYGPNTLFFFLYTWPFFLALMPVSVIIGIALHLLVKGKILFSIMFTLLAVGALFGALFIWLLG >NZ_CP043433.1|WP_000517480.1|963026_963338_+|cell-division-protein-FtsB MGKLTLLLLALLVWLQYSLWFGKNGIHDYSRVNDDVVAQQATNAKLKARNDQLFAEIDDLNGGQEAIEERARNELSMTKPGETFYRLVPDASKRAATAGQTHR >NZ_CP043433.1|WP_000741653.1|963356_964067_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MAATLLDVCAVVPAAGFGRRMQTECPKQYLSIGNKTILEHSVHALLAHPRVTRVVIAISPGDHRFAQLPLANHPQITVVDGGNERADSVLAGLQAVAKAQWVLVHDAARPCLHQDDLARLLTISENSRVGGILASPVRDTMKRGEPGKNAIAHTVERADLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPALVEGRADNIKVTRPEDLALAEFYLTRTIHQEKA >NZ_CP043433.1|WP_001219253.1|964066_964546_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRISYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLMKAAK >NZ_CP043433.1|WP_000134246.1|964542_965592_+|tRNA-pseudouridine(13)-synthase-TruD MTEFDNLTWLHGKPQGSGLLKANPEDFVVVEDLGFTPDGEGEHILLRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDFSAFQLEGCKVLEYARHKRKLRLGALKGNAFTLVLREISDRRDVETRLQAIRDGGVPNYFGAQRFGIGGSNLQGALRWAQSNAPVRDRNKRSFWLSAARSALFNQIVHQRLKKPDFNQVVDGDALQLAGRGSWFVATSEELPELQRRVDEKELMITASLPGSGEWGTQRAALAFEQDAIAQETVLQSLLLREKVEASRRAMLLYPQQLSWNWWDDVTVELRFWLPAGSFATSVVRELINTMGDYAHIAE >NZ_CP043433.1|WP_001221538.1|965572_966334_+|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFDNGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLNGYQHYDTAAAVTCALLRGLSREPLRTGRILNVNVPDLPLAQVKGIRVTRCGSRHPADKVIPQEDPRGNTLYWIGPPGDKYDAGPDTDFAAVDEGYVSVTPLHVDLTAHSAHDVVSDWLDSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP043433_1 | 1.15|941228|33|NZ_CP043433|CRISPRCasFinder | 941228-941260 | 33 | NZ_CP032236 | Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence | 63403-63435 | 4 | 0.879 |
NZ_CP043433_1 | 1.15|941228|33|NZ_CP043433|CRISPRCasFinder | 941228-941260 | 33 | NZ_LN681230 | Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence | 91355-91387 | 4 | 0.879 |
NZ_CP043433_1 | 1.5|941165|34|NZ_CP043433|PILER-CR,CRT | 941165-941198 | 34 | NZ_CP044178 | Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1 | 18967-19000 | 5 | 0.853 |
NZ_CP043433_1 | 1.5|941165|34|NZ_CP043433|PILER-CR,CRT | 941165-941198 | 34 | CP053324 | Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence | 25136-25169 | 5 | 0.853 |
NZ_CP043433_1 | 1.6|941226|35|NZ_CP043433|PILER-CR,CRT | 941226-941260 | 35 | NZ_CP032236 | Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence | 63403-63437 | 5 | 0.857 |
NZ_CP043433_1 | 1.6|941226|35|NZ_CP043433|PILER-CR,CRT | 941226-941260 | 35 | NZ_LN681230 | Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence | 91353-91387 | 5 | 0.857 |
NZ_CP043433_1 | 1.5|941165|34|NZ_CP043433|PILER-CR,CRT | 941165-941198 | 34 | NZ_LN890526 | Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence | 31716-31749 | 6 | 0.824 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_CP044178 | Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1 | 18969-19000 | 6 | 0.812 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | CP053324 | Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence | 25138-25169 | 6 | 0.812 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_LN890526 | Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence | 31718-31749 | 7 | 0.781 |
NZ_CP043433_1 | 1.8|941349|34|NZ_CP043433|PILER-CR,CRT | 941349-941382 | 34 | MN694003 | Marine virus AFVG_250M677, complete genome | 17627-17660 | 8 | 0.765 |
NZ_CP043433_1 | 1.12|941045|32|NZ_CP043433|CRISPRCasFinder | 941045-941076 | 32 | MG592432 | Vibrio phage 1.050.O._10N.286.48.A6, partial genome | 21687-21718 | 8 | 0.75 |
NZ_CP043433_1 | 1.12|941045|32|NZ_CP043433|CRISPRCasFinder | 941045-941076 | 32 | MG592431 | Vibrio phage 1.049.O._10N.286.54.B5, partial genome | 21426-21457 | 8 | 0.75 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_CP053022 | Sphingobium yanoikuyae strain YC-XJ2 plasmid p-A-Sy, complete sequence | 329022-329053 | 8 | 0.75 |
NZ_CP043433_1 | 1.17|941351|32|NZ_CP043433|CRISPRCasFinder | 941351-941382 | 32 | MN694003 | Marine virus AFVG_250M677, complete genome | 17629-17660 | 8 | 0.75 |
NZ_CP043433_1 | 1.10|940923|32|NZ_CP043433|CRISPRCasFinder | 940923-940954 | 32 | CP006879 | Rhizobium gallicum bv. gallicum R602 plasmid pRgalR602b, complete sequence | 405613-405644 | 9 | 0.719 |
NZ_CP043433_1 | 1.12|941045|32|NZ_CP043433|CRISPRCasFinder | 941045-941076 | 32 | NC_047790 | Pseudoalteromonas phage C5a, complete genome | 34441-34472 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_CP048340 | Escherichia coli strain 142 plasmid p142_C, complete sequence | 2410-2441 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_LR130559 | Escherichia coli strain MS14385 isolate MS14385 plasmid 5 | 41882-41913 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_CP020518 | Escherichia coli strain 222 plasmid unnamed2, complete sequence | 13450-13481 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_CP020497 | Escherichia coli strain 103 plasmid unnamed2, complete sequence | 37140-37171 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_CP040921 | Escherichia coli strain FC853_EC plasmid p853EC2, complete sequence | 32060-32091 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | CP053252 | Escherichia coli strain SCU-204 plasmid pSCU-204-5, complete sequence | 19381-19412 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_CP042622 | Escherichia coli strain NCYU-26-73 plasmid pNCYU-26-73-7, complete sequence | 2614-2645 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_LT985302 | Escherichia coli strain ECOR 39 genome assembly, plasmid: RCS82_pI | 11943-11974 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_CP028194 | Escherichia coli strain CFSAN018748 plasmid pGMI14-004_3, complete sequence | 15383-15414 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_CP024865 | Escherichia coli strain AR_0015 plasmid unitig_3_pilon, complete sequence | 22646-22677 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | AP019710 | Escherichia coli O145:H28 122715 plasmid pO145_122715_2 DNA, complete genome | 4361-4392 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_CP024829 | Escherichia coli strain CREC-544 plasmid pCREC-544_3, complete sequence | 2221-2252 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_CP009861 | Escherichia coli strain ECONIH1 plasmid pECO-b75, complete sequence | 2868-2899 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | CP025877 | Escherichia coli strain 503458 plasmid p503458_49, complete sequence | 18343-18374 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_CP023368 | Escherichia coli strain 1428 plasmid p48, complete sequence | 4914-4945 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_CP032259 | Escherichia coli strain AR_0067 plasmid unnamed2, complete sequence | 23402-23433 | 9 | 0.719 |
NZ_CP043433_1 | 1.14|941167|32|NZ_CP043433|CRISPRCasFinder | 941167-941198 | 32 | NZ_CP037450 | Escherichia coli strain ATCC 25922 plasmid unnamed, complete sequence | 15851-15882 | 9 | 0.719 |
NZ_CP043433_1 | 1.15|941228|33|NZ_CP043433|CRISPRCasFinder | 941228-941260 | 33 | NZ_CP031947 | Ruegeria sp. AD91A plasmid unnamed1, complete sequence | 143751-143783 | 9 | 0.727 |
NZ_CP043433_2 | 2.2|957775|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT | 957775-957806 | 32 | KY006853 | Erythrobacter phage vB_EliS_R6L, complete genome | 41418-41449 | 9 | 0.719 |
NZ_CP043433_2 | 2.6|958019|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT | 958019-958050 | 32 | MK449011 | Streptococcus phage Javan92, complete genome | 36157-36188 | 9 | 0.719 |
NZ_CP043433_2 | 2.6|958019|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT | 958019-958050 | 32 | MK448835 | Streptococcus phage Javan93, complete genome | 36157-36188 | 9 | 0.719 |
NZ_CP043433_2 | 2.6|958019|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT | 958019-958050 | 32 | MK448836 | Streptococcus phage Javan95, complete genome | 37400-37431 | 9 | 0.719 |
NZ_CP043433_2 | 2.6|958019|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT | 958019-958050 | 32 | MK448825 | Streptococcus phage Javan639, complete genome | 37400-37431 | 9 | 0.719 |
NZ_CP043433_2 | 2.8|958141|32|NZ_CP043433|CRISPRCasFinder,CRT | 958141-958172 | 32 | NZ_MG266000 | Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence | 5501-5532 | 9 | 0.719 |
NZ_CP043433_1 | 1.10|940923|32|NZ_CP043433|CRISPRCasFinder | 940923-940954 | 32 | NZ_CP049244 | Rhizobium pseudoryzae strain DSM 19479 plasmid unnamed3, complete sequence | 699963-699994 | 10 | 0.688 |
NZ_CP043433_1 | 1.16|941290|32|NZ_CP043433|CRISPRCasFinder | 941290-941321 | 32 | NZ_LR134399 | Listeria monocytogenes strain NCTC7974 plasmid 2, complete sequence | 103231-103262 | 10 | 0.688 |
NZ_CP043433_1 | 1.6|941226|35|NZ_CP043433|PILER-CR,CRT | 941226-941260 | 35 | NZ_CP031947 | Ruegeria sp. AD91A plasmid unnamed1, complete sequence | 143751-143785 | 11 | 0.686 |
1. spacer 1.15|941228|33|NZ_CP043433|CRISPRCasFinder matches to NZ_CP032236 (Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence) position: , mismatch: 4, identity: 0.879
caaacaccaagctcttccgccgcgcgttcctga CRISPR spacer catttaccaagctcttccgctgcgcgttcctga Protospacer ** .***************.************
2. spacer 1.15|941228|33|NZ_CP043433|CRISPRCasFinder matches to NZ_LN681230 (Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence) position: , mismatch: 4, identity: 0.879
caaacaccaagctcttccgccgcgcgttcctga CRISPR spacer catttaccaagctcttccgctgcgcgttcctga Protospacer ** .***************.************
3. spacer 1.5|941165|34|NZ_CP043433|PILER-CR,CRT matches to NZ_CP044178 (Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1) position: , mismatch: 5, identity: 0.853
ccaattt----aggggccggaactccgggaaaggcagc CRISPR spacer ----tttgagaaggggccggaactccgggaaaggcacc Protospacer *** ************************* *
4. spacer 1.5|941165|34|NZ_CP043433|PILER-CR,CRT matches to CP053324 (Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.853
ccaattt----aggggccggaactccgggaaaggcagc CRISPR spacer ----tttgagaaggggccggaactccgggaaaggcacc Protospacer *** ************************* *
5. spacer 1.6|941226|35|NZ_CP043433|PILER-CR,CRT matches to NZ_CP032236 (Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence) position: , mismatch: 5, identity: 0.857
cgcaaacaccaagctcttccgccgcgcgttcctga CRISPR spacer ggcatttaccaagctcttccgctgcgcgttcctga Protospacer *** .***************.************
6. spacer 1.6|941226|35|NZ_CP043433|PILER-CR,CRT matches to NZ_LN681230 (Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence) position: , mismatch: 5, identity: 0.857
cgcaaacaccaagctcttccgccgcgcgttcctga CRISPR spacer ggcatttaccaagctcttccgctgcgcgttcctga Protospacer *** .***************.************
7. spacer 1.5|941165|34|NZ_CP043433|PILER-CR,CRT matches to NZ_LN890526 (Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence) position: , mismatch: 6, identity: 0.824
ccaattt----aggggccggaactccgggaaaggcagc CRISPR spacer ----tttgagaaggggccggaactccggaaaaggcacc Protospacer *** *****************.******* *
8. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_CP044178 (Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1) position: , mismatch: 6, identity: 0.812
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccgggaaaggcacc Protospacer . ************************* *
9. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to CP053324 (Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.812
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccgggaaaggcacc Protospacer . ************************* *
10. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_LN890526 (Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence) position: , mismatch: 7, identity: 0.781
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggaaaaggcacc Protospacer . *****************.******* *
11. spacer 1.8|941349|34|NZ_CP043433|PILER-CR,CRT matches to MN694003 (Marine virus AFVG_250M677, complete genome) position: , mismatch: 8, identity: 0.765
cgcggtcgcagcctggcctgttgccgtagaatcg CRISPR spacer cgcgatcgcagcctgggctgttgccgtgctcgcc Protospacer ****.*********** **********. *
12. spacer 1.12|941045|32|NZ_CP043433|CRISPRCasFinder matches to MG592432 (Vibrio phage 1.050.O._10N.286.48.A6, partial genome) position: , mismatch: 8, identity: 0.75
ctgccggtctgtgctgttgtcgtcaataatca CRISPR spacer aatctgctctgtgctgttgtagtcaattataa Protospacer *.* ************* ****** ** *
13. spacer 1.12|941045|32|NZ_CP043433|CRISPRCasFinder matches to MG592431 (Vibrio phage 1.049.O._10N.286.54.B5, partial genome) position: , mismatch: 8, identity: 0.75
ctgccggtctgtgctgttgtcgtcaataatca CRISPR spacer aatctgctctgtgctgttgtagtcaattataa Protospacer *.* ************* ****** ** *
14. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_CP053022 (Sphingobium yanoikuyae strain YC-XJ2 plasmid p-A-Sy, complete sequence) position: , mismatch: 8, identity: 0.75
---aatttaggggccggaactccgggaaaggcagc CRISPR spacer gtcagtt---gggccggaaagccgggaaaggcata Protospacer *.** ********* ************
15. spacer 1.17|941351|32|NZ_CP043433|CRISPRCasFinder matches to MN694003 (Marine virus AFVG_250M677, complete genome) position: , mismatch: 8, identity: 0.75
cggtcgcagcctggcctgttgccgtagaatcg CRISPR spacer cgatcgcagcctgggctgttgccgtgctcgcc Protospacer **.*********** **********. *
16. spacer 1.10|940923|32|NZ_CP043433|CRISPRCasFinder matches to CP006879 (Rhizobium gallicum bv. gallicum R602 plasmid pRgalR602b, complete sequence) position: , mismatch: 9, identity: 0.719
aggatttcgccgcgctgttggcctccagattc CRISPR spacer cagggtcgaccgcgctgtcgccctccagattc Protospacer .*. *. .*********.* ***********
17. spacer 1.12|941045|32|NZ_CP043433|CRISPRCasFinder matches to NC_047790 (Pseudoalteromonas phage C5a, complete genome) position: , mismatch: 9, identity: 0.719
ctgccggtctgtgctgttgtcgtcaataatca CRISPR spacer tttggtgtctgtgccgttttcgtcaataagct Protospacer .* ********.*** ********** *
18. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_CP048340 (Escherichia coli strain 142 plasmid p142_C, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
19. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_LR130559 (Escherichia coli strain MS14385 isolate MS14385 plasmid 5) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
20. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_CP020518 (Escherichia coli strain 222 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
21. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_CP020497 (Escherichia coli strain 103 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
22. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_CP040921 (Escherichia coli strain FC853_EC plasmid p853EC2, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
23. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to CP053252 (Escherichia coli strain SCU-204 plasmid pSCU-204-5, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
24. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_CP042622 (Escherichia coli strain NCYU-26-73 plasmid pNCYU-26-73-7, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
25. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_LT985302 (Escherichia coli strain ECOR 39 genome assembly, plasmid: RCS82_pI) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
26. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_CP028194 (Escherichia coli strain CFSAN018748 plasmid pGMI14-004_3, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
27. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_CP024865 (Escherichia coli strain AR_0015 plasmid unitig_3_pilon, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
28. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to AP019710 (Escherichia coli O145:H28 122715 plasmid pO145_122715_2 DNA, complete genome) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
29. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_CP024829 (Escherichia coli strain CREC-544 plasmid pCREC-544_3, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
30. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_CP009861 (Escherichia coli strain ECONIH1 plasmid pECO-b75, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
31. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to CP025877 (Escherichia coli strain 503458 plasmid p503458_49, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
32. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_CP023368 (Escherichia coli strain 1428 plasmid p48, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
33. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_CP032259 (Escherichia coli strain AR_0067 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
34. spacer 1.14|941167|32|NZ_CP043433|CRISPRCasFinder matches to NZ_CP037450 (Escherichia coli strain ATCC 25922 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
35. spacer 1.15|941228|33|NZ_CP043433|CRISPRCasFinder matches to NZ_CP031947 (Ruegeria sp. AD91A plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.727
caaacaccaagctcttccgccgcgcgttcctga CRISPR spacer gaaacaccaaggtcttccgccgcacgggcaaag Protospacer ********** ***********.** * ..
36. spacer 2.2|957775|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT matches to KY006853 (Erythrobacter phage vB_EliS_R6L, complete genome) position: , mismatch: 9, identity: 0.719
aatatatggcgctcacgcgcatgagcattctc CRISPR spacer acgcaatggcgctgacgcgcatgatcatttcg Protospacer * ******** ********** ****..
37. spacer 2.6|958019|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT matches to MK449011 (Streptococcus phage Javan92, complete genome) position: , mismatch: 9, identity: 0.719
gattgctcagattgggaatttgaccagcggcc CRISPR spacer agtcgctcagattgggaatttgtcaagatgta Protospacer ..*.****************** * ** *.
38. spacer 2.6|958019|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT matches to MK448835 (Streptococcus phage Javan93, complete genome) position: , mismatch: 9, identity: 0.719
gattgctcagattgggaatttgaccagcggcc CRISPR spacer agtcgctcagattgggaatttgtcaagatgta Protospacer ..*.****************** * ** *.
39. spacer 2.6|958019|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT matches to MK448836 (Streptococcus phage Javan95, complete genome) position: , mismatch: 9, identity: 0.719
gattgctcagattgggaatttgaccagcggcc CRISPR spacer agtcgctcagattgggaatttgtcaagatgta Protospacer ..*.****************** * ** *.
40. spacer 2.6|958019|32|NZ_CP043433|PILER-CR,CRISPRCasFinder,CRT matches to MK448825 (Streptococcus phage Javan639, complete genome) position: , mismatch: 9, identity: 0.719
gattgctcagattgggaatttgaccagcggcc CRISPR spacer agtcgctcagattgggaatttgtcaagatgta Protospacer ..*.****************** * ** *.
41. spacer 2.8|958141|32|NZ_CP043433|CRISPRCasFinder,CRT matches to NZ_MG266000 (Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence) position: , mismatch: 9, identity: 0.719
gttgggttgcatagatgacacgcttataaata CRISPR spacer gaattatggcatagatgacatgattataaatt Protospacer * .* ************.* ********
42. spacer 1.10|940923|32|NZ_CP043433|CRISPRCasFinder matches to NZ_CP049244 (Rhizobium pseudoryzae strain DSM 19479 plasmid unnamed3, complete sequence) position: , mismatch: 10, identity: 0.688
aggatttcgccgcgctgttggcctccagattc CRISPR spacer gggatatcgccgcgctgatggcctatgaccac Protospacer .**** *********** ****** ... . *
43. spacer 1.16|941290|32|NZ_CP043433|CRISPRCasFinder matches to NZ_LR134399 (Listeria monocytogenes strain NCTC7974 plasmid 2, complete sequence) position: , mismatch: 10, identity: 0.688
ttgctctcattaaaggggtttccatgtttgat CRISPR spacer gattcgtcattacagtggtttccatgttttag Protospacer .. ****** ** ************* *
44. spacer 1.6|941226|35|NZ_CP043433|PILER-CR,CRT matches to NZ_CP031947 (Ruegeria sp. AD91A plasmid unnamed1, complete sequence) position: , mismatch: 11, identity: 0.686
cgcaaacaccaagctcttccgccgcgcgttcctga CRISPR spacer gagaaacaccaaggtcttccgccgcacgggcaaag Protospacer . ********** ***********.** * ..
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1414656 : 1420709
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP043433|1414656:1420709|DBSCAN-SWA CATATACTTTTTGTGGGACTGCTTCAACCTTTGGGCAGATATCGGAAATGAAAAAGATAGACCAGGCAACTATTCGCTATCAGAATATCCGGTACACCAACTACCAACTACCAACTACCAACTACCAACAAATCATTTAGTCGATGGTCTCGTTGCCATTGGTTCATAGAGTGTTGGTCTTGGTATGGATGGCTGGGGAAGTTATGTATCGAACATTCTTATGCAAGATTGCGCAGGGTCTGGTGATCTATGGTACACATATGGGAAGGCATTCACATATATTTCTGTAATCGATACTAAAACTTTAACACTAACTAATTGTTTGTAGAAAGTGGTTGCATTATTAATGGTTTGAGACTCATTGACATAAAACAAACACCATCTGGTAATCTGTCAGACCCCGCATCCTTAATAGTTAACTATAAAGATTGTATGGTAGTTGAGATGTCGTTAGTTCTAATATCGTGTCAGATAAATTTATAAAAGATTACTCATGTTTTGTGTCACATGCAAAAATAATTCGGCTTCGTTAAGGTCTTTAGGGGAAATACCTAATGGATAATTAGTTAGATTAACGTTAACAACACTTTGAACGTGTAATGAATATGGGGGTAAAATATAAGTATTGGGAGATTGTAATTAAAAATTATGTAATTGTCTGATTATTATATATTCACTCCAGCAAAGGAGAAAGGCAATTATGGACGAAAAGAAACTCACAGCTCTTGCGGCTGAACTGGCTAAAGGTCTTAAAACTGAAGCCGACCTCAGTCAGTTTTCCCATATGCTGACGACATTAACCGTCGAAACGGCGCTCAATGCTGAGCTGGCTGACCATCTCGGGCATGAGAAAAATGCTCCTAAAACGAGCTCAAACTCCCACAATGGCTACTCATTAAAAACGGTGCTGTATGATGGCGGCGAGATAGTGCTGAACATGCTGCGTGACCGTGAAAATACCTTTGAACTGCAGCTGATTAAGGAGCATCAGACGCGTATTACGAGATGGACAGTCAGATTTATCCCTTTATGCCAAAGGCATGACATGACTACCCGCGAAATCATCGCCACCTTCAAAGAGATGTACAACGCTGATGTGTCGCCCATGCTGATACTTAAGATCACCGCCGCAGTAAAAGAGTTGGTCAAAGGGGCGTCAAAAAATGGAGTATGCCGATCCAGGACTGGTCGCTGGCAATGAGTCGCTTTATTATCGAGTTCGGTGACCGCCTGAGCGATCACCGTTAATATAGTGGCAGTTACACAGTATGACTGGCAGGTTCTATATTGGATTTATATGTAATAGTAACAGCCTATCTATTTGATTATTTTATTTCCGATTTTATTAAAAAGGAAATCAGAACCAGGCTTTGTTAAATGTCCCCAATCAACGGCAGTGATAAAATCAGGACCATTACCAACTCTTGTAAGACATCCACTTTCGTTACATAATGCTTTGTATGCTGATATATATTCAATTCCCATTTTTGGAACATTGTTACTAAAGTAAGAGTCCCACTCGCTTATTTCACTATTTAATCCATATGTCATATACAATGGTGGGGTTTTTTTAAACTCACTCAGGTAGTTAGATATTATTTTAACTAAATTTGCATTCCATTCCGGGACTGGTCCAATGAAAATAATCCTTGAGTCAGGGGATGCCTCTTTAATTTTTTTTATGGTTAATGATAACGTATCAATTGCTAACTTTTTATCATGTACTCCATTTGTTCCTCGAACTGACCATGTCAGCAGAACCACCTCAGGCTGAACACGTTTAATTTCATTAATTCTATTATTGTTTAGAGTGATGACACTTCTCTGTAAATCATCTTTACCGTCAACAAATAGAGGAGGAGCGTTACCATCTGTCATTTGGCTTATTATATAATCAGAACCTTTATTATCTATATAATGAGAAAGTCCATTGAAAAGAGCCGCCGCATAAGAATCACCAATGATAAATATATTATGCTTGCCATTTTTTATACATCCATTGGATATGGCAGCAGTAAGTTGTACTGAGTGGCATATCCCTCCACGGAGTAGTTCTCCATATTTATAATAATTGTACACGTCAGTAACAGAAGCATATTCACCTGCTGATTTATTGATTTCCCTGTCTTTAACTCCATTTATATGAAAAATAAATGCGCCAATTAAACCTGTCCCAAATACTGATAATGCTAATAATATTGCTGTGATATATTTATTTCTGGCATTTCTCAGTGGTTTTTCAATTAAATAATAAGTTAATATCGCCAAAAAGAACGATGATAATAAAAGAAGAATTAATTCATGGTAGTCTGGTGAACCAGCAAATATTGAACGATAGAATGAATAAATAGGCCAATGCCACAAATAAAGAGGATAGCTAATAAGACCAAAGAAAACAACAGGCCTAACACTAAGCAATTTCGACACAACTAAATCATTACCATTAGATGCTATTATAAGAGAGGCGCCAAGTACTGGGATTATTGCTATATATCCAGGAAATGACATCTTTTCATCTATCATGGTTATTGATAATGCGATTAGTATAATTCCTAACAGGGACATTAATTTTGATAACGAAGTGTTTATTCCTATAAATCTCAATGTGGATATAATCGCTCCAGCCATTAACTCCCAAAATCTTGATGCGGGAGAGTAGTAATTAGCTCCGCCATCAGATGCCATTGTAAAAATGCTAATCGCATAGCTAATTATAAATATAGTTGCGCATGATAATACTATGTTTCTGTTATGGTTTTTGCTTCTAAAGCATAGCAATATAACTACTGGCCATATTATATAAAATTGCTCTTCAATTCCCAGTGACCACAAATGTAGTAAAGGTTTAAGGTATGACTTTGAATCAAAATAGCCAGACTCACTCCAAAGAGTAAAGTTTGATATAAAGAATGAGCCGCTAAAAACATGCTTACCAAGTAATTTGTAATCATCCTGGAATAAATAAACCCAGCCAACAATAAGACATGATACAAGAACTATGGATAATGCTGGAAATATTCTAAGCACTCTCCTTTTATAGAAATCAAGGTATGAAAATGATTTGTTTGATGCAGATTTTAATATTATTGATGTTATAAGGTATCCAGATATCACAAAGAATATATCTACTCCAACAAACCCACCCGGCAATAATGATGGGAAATAATGGAATATTACCACAGATAAAACCGCTATTGCGCGTAATCCATCTATATCAGGTCTGTATTTTAAGTGTTCCAACTTAAATTACCTCAATTTTAAAAAAAGATTAATAAAATGGTTGTGCATCTTGCATCATTCCCGAAGTTTCGTGTAAACGAAAACGGAATGACGAGTGGATCAGATACGGCTATTATTTTAATTATTGACTCTGTCACATCTTTACTTCCGTCATTAATAAAAATAATCTCAACTTTATATTTTTCAAGTTCATTAAACTCATGTACCGTTCTATAGAAAATCGGTATCGTGTCTTCTTCGTTAAAAACTAGGACGACAAGAGAGATTTTCATTTTTATCCCTGAAGATAAAGAATCTGGAATAGATAAAGCCGCATACCAGGCTAATTGCCGAGAAAGTGATGAGGGTAACCAATGGTGGCAAAGAACATTGGTCAGCCATCCATCCAACAACAGCGCTCAGTGCTCCCATGAATCCCATATACATCATGTAGCGAATTGCAGTAGTGCTGGCATTAAAGGTGAAGCGCGCATTAGCATAGAAGCTGAACGATACAGCGATAACAAAATCGGAAAAGTTCGTCAGCGCCTGATGCGTATGCATCCCATACATACAGAAAGCAAATACTCTCCAATGAATGAGCATGTTAAGAACACCGATCGATGTGTACTTAGCGAATAACTTCAACATTATGAAAATTATCAGATTCAGAAAGGTCTGGAGTGTAGCACTACAAATTGGTTTGATCGATATAAGCGATCAATAATTGTATTTTTAATAGTTTTAAACTATTGAGTTTTAATATATTGATCGATGTTATCGATCAATTGGTATTGCTGATTGCCAAGCGTCTTGGAATAAAAACGGGACATGTAAAGCTTTGCATCGTCTTACAAGGCTTTGCATTTTTTTTCAGGGAGAGGTACTTGAAAGGGTGGAAGTGCTGGGGGGAGGGGGAGCGTTAAAAATTCTGTATAATTTTAGTAACATAAAATAAAAAAGAATGGCACATGTCCCATCCCTTCGATTTCGACAAAGCACTTAAAGCCCTTCAGGTCCGGCCAGGCATTAACGGGCAAAGATGGCATCTTAACGCCATTAATCAAGTATTTAACCGAGTCTACCCTGTCTGCTGAACTTGATTCCCATCTGGCTCAGGATGTTGAGGCAAACCGTAAAAATGGTTCCGGCAAAAAGCCATTAAAGCCCCAACAGGCAGTTTTGAACTGGCAACTCCGCGCAATCGTAACGGCACTTTTGAGCCATAACTGGTGAAGAAGCTTCAGACCCCCTGTACGACGAGATCGAGCGCAATATCATTCGACTGTTTGCGCTGGAGATGAGTTATCAGGACATCGGCCGGGAGAGTGAAGATCTTTATGCCTTCAGCGTTTCAACCGCCACCGTCAGTGCAGTACCGATAAAGTTATCCCTGAACTAAAACAGTGGCAACAGCGCCCGCTGGAGAAGGTTTATCCCGTCGTCTGGCTGGACGCTATTCATTATAAAAACCGTGAGGATGGCCGTTATCAGAGCAAGGCGGTTTATACCGTTCTGGCACTGAATCTGGAAGGCAAAAAAGAAGTTCTGGGCCTATATCTGTCGGAAAGTGAAGGTGCTAACTTTTGGTTAAGTTCTAACAGCGAGAGGGTACTTTAAAGGGATGCTTTTCGTTATGTTTATAGGCACTATTCGCTGGAAATCATAAGACATCAAAAACGCTGCAACGCCTTGTGTGGTGTGGGGTTGCTGAGATTTGTGAGAGGTGGGTAAAAGAGGTCATGGTGTCCCCTGCAGGAATCGAACCTGCAACTAGCCCTTAGGAGGGGCTCGTTATATCCATTTAACTAAGGGGACAACGCGGCGCCAGTATAGCGTTTTTTATTCGCCGGAGTAAGTGTAGCGCCGCCTGACTGGTTAAACCGTCGCCACTCAGCGCTGTTTTTCCGCTTTTTTCCGCTCCCGTTCCAGGCGCTCGCCGCGTAGCCTCGCTTCTTCCTTACGCTTATTGCTCATATCGTTGCGGATCTGCGCGTGGCTCATCAATGCGAAAATAAAAGTGCCGCCGCAGATATTTCCGGCAAGTGTGGGAAGGGCGAAGGGCCAGAGAAAGTCGCTCCAGGGCAGCGTGCCGTTGAAAACCAAATACAAAATTTCAACGGAACCGACGACAATATGGGTGGTATCGCCCAGCGCGATAAGCCAGGTCATCAAAATAATGACCACAATCTTTGCCCCGCCTGCTGCAGGAAACATCCATACCATTGTGGCGATGATCCAGCCAGAGATAATCGCGTTGGCAAACATCTCCGTTGGGCTATTTTTCATGACCTCCATACCAATTTTGACAAAGGCGTCGCGGGTCTCTTCATCAAATATAGGCATATATTCAAATGCCCACGCCGCAACCCCGGTGCCAATAAGGTTGCCCAATAAGACTACGCCCCACAAGCGCATCAGCAGGCCAACGTTACTCAGAGTGGGATTTTGCATTACCGGCAACACGGCGGTAACGGTATTTTCAGTAAATAATTGCTGGCGGGCCATGATGACAATGATAAAACCAAAGGTATAGCCGAGATTTTCCAGTAAAAAGCCGCCGGGAACGCCTTCAAGCTGCACGTGGAAAATCCCTTTCGCCAGGAGTGATGCCCCCATAGAAAGTCCTGCGGCAATGGCTGACCAGAGCAAAGCCATCGCATCGCGTTCCATCTCTTTTTCACCATCCTGGCGAATATGTTCATGAATCGCCATGGCGCGGGAAGGAAGACGATCTTCATCCACTTCAATCTCTTTACCGCTTTGTTTTTCTTCGCTTTCAACTTCCAGGTCACTGCTTTGCCGGTTAATTTTATCGTCGTTAAGGCTATCCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP043433|1414656:1420709|1414656_1414824_+|WP_105789229.1|DBSCAN-SWA MYFLWDCFNLWADIGNEKDRPGNYSLSEYPVHQLPTTNYQLPTNHLVDGLVAIGS >NZ_CP043433|1414656:1420709|1418135_1418525_-|WP_001576268.1|DBSCAN-SWA MLKLFAKYTSIGVLNMLIHWRVFAFCMYGMHTHQALTNFSDFVIAVSFSFYANARFTFNASTTAIRYMMYMGFMGALSAVVGWMADQCSLPPLVTLITFSAISLVCGFIYSRFFIFRDKNENLSCRPSF >NZ_CP043433|1414656:1420709|1414839_1414983_+|WP_105789228.1|DBSCAN-SWA MDGWGSYVSNILMQDCAGSGDLWYTYGKAFTYISVIDTKTLTLTNCL >NZ_CP043433|1414656:1420709|1415972_1417895_-|WP_000400616.1|DBSCAN-SWA MEHLKYRPDIDGLRAIAVLSVVIFHYFPSLLPGGFVGVDIFFVISGYLITSIILKSASNKSFSYLDFYKRRVLRIFPALSIVLVSCLIVGWVYLFQDDYKLLGKHVFSGSFFISNFTLWSESGYFDSKSYLKPLLHLWSLGIEEQFYIIWPVVILLCFRSKNHNRNIVLSCATIFIISYAISIFTMASDGGANYYSPASRFWELMAGAIISTLRFIGINTSLSKLMSLLGIILIALSITMIDEKMSFPGYIAIIPVLGASLIIASNGNDLVVSKLLSVRPVVFFGLISYPLYLWHWPIYSFYRSIFAGSPDYHELILLLLSSFFLAILTYYLIEKPLRNARNKYITAILLALSVFGTGLIGAFIFHINGVKDREINKSAGEYASVTDVYNYYKYGELLRGGICHSVQLTAAISNGCIKNGKHNIFIIGDSYAAALFNGLSHYIDNKGSDYIISQMTDGNAPPLFVDGKDDLQRSVITLNNNRINEIKRVQPEVVLLTWSVRGTNGVHDKKLAIDTLSLTIKKIKEASPDSRIIFIGPVPEWNANLVKIISNYLSEFKKTPPLYMTYGLNSEISEWDSYFSNNVPKMGIEYISAYKALCNESGCLTRVGNGPDFITAVDWGHLTKPGSDFLFNKIGNKIIK >NZ_CP043433|1414656:1420709|1419767_1420709_-|WP_000377779.1|DBSCAN-SWA MDSLNDDKINRQSSDLEVESEEKQSGKEIEVDEDRLPSRAMAIHEHIRQDGEKEMERDAMALLWSAIAAGLSMGASLLAKGIFHVQLEGVPGGFLLENLGYTFGFIIVIMARQQLFTENTVTAVLPVMQNPTLSNVGLLMRLWGVVLLGNLIGTGVAAWAFEYMPIFDEETRDAFVKIGMEVMKNSPTEMFANAIISGWIIATMVWMFPAAGGAKIVVIILMTWLIALGDTTHIVVGSVEILYLVFNGTLPWSDFLWPFALPTLAGNICGGTFIFALMSHAQIRNDMSNKRKEEARLRGERLERERKKAEKQR >NZ_CP043433|1414656:1420709|1417912_1418167_-|WP_000703599.1|DBSCAN-SWA MKISLVVLVFNEEDTIPIFYRTVHEFNELEKYKVEIIFINDGSKDVTESIIKIIAVSDPLVIPFSFTRNFGNDARCTTILLIFF |
6 | Salmonella_virus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1657142 : 1666313
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP043433|1657142:1666313|DBSCAN-SWA GATGATTGAATTTAACCATGTCAGTAAAACCTTCGGCGATCAACAGGCTGTTAGCGACCTCAATTTGCACTTTAGCGAAGGCAGCTTTTCGGTGTTAATTGGCACCTCCGGTTCGGGAAAATCGACCACTCTGAAGATGATTAACCGGCTGGTAGAGCATGATAGCGGAACGATCCGTTTTGCCGGGGAAGAGATCCGCAGCCTGCCGGTGCTTGAACTGCGCCGTCGCATGGGCTATGCCATTCAGTCTATCGGTCTTTTTCCCCACTGGACGGTGGCGCAAAATATCGCCACCGTACCGCAACTACAAAAGTGGTCGCGTGCGCGGATTAACGATCGTATTGATGAACTGATGGCATTATTGGGTCTGGAAAGCGCGCTGCGCGATCGTTATCCGCATCAGCTTTCCGGCGGGCAACAGCAGCGGGTCGGCGTTGCGCGGGCGCTGGCTGCCGATCCGCAGGTATTGCTGATGGACGAGCCTTTCGGCGCGCTTGATCCGGTAACGCGCGGCGCATTGCAGCAGGAGATGACCCGCATTCATCAGCTACTGGGGCGCACCATCGTACTGGTGACGCACGACATCGACGAGGCGCTACGCCTCGCCGACCATCTGGTGCTGATGGACGGGGGCAATGTTATCCAACAGGGATCGCCACTTTCTATGCTGACCTCGCCGGAAAATGATTTCGTGCAGGCGTTTTTTGGCCGCAGCGAGCTGGGCGTAAGGCTGCTTTCGTTACGTAGCGTAGGCGATTATGTACGTCGGCATGAACAGCTCAGCGGCGACGCGCTGGTGGAAGAGATGACGCTACGCGATGCGCTATCGATGTTTGTCGCCCGTCGGTGCGACGTCCTGCCGGTGGCGAATCAGCAGGGCGAGCCCTGCGGTACGCTCCATTTCCGCGATCTACTTTCGGAGACGTCCCCCCGTGAAACGACTGTGTGATCCGCTTCTCTGGCTTATTGTTCTGTTCTTGCTTCTGCTGTTTGGATTGCCTTATAGCCAGCCGTTCTTCGCCGCGCTGTTTCCCGATTTACCGCGCCCGGTCTACCAACAGGAGAGTTTTGCCGCCCTCGCGCTCGCCCATTTCTGGTTGGTGGGCATCTCAAGTCTGTTTGCCGTCGTGGTGGGCGTCGGCGCAGGGATTGCGGTCACGCGAGAAAGTGGGAAGGAGTTTCGTCCCCTGGTGGAGACTATCGCCGCCGTCGGGCAGACCTTTCCCCCGGTCGCGGTACTGGCGATCGCGGTACCCGTCATGGGTTTTGGTCAGCAACCAGCCATTATCGCCTTGATCCTGTATGGAGTGTTGCCCATCCTGCAGGCGACCCTGGCCGGGCTGGGCGCGGTGCCTGCCAGCGTGATGAGCGTTGCCAGCGGTATGGGAATGAGCCGTCGCCAACAGTTGTATCAGGTTGAACTGCCGCTGGCCGCGCCGGTGATTCTGGCGGGCATCCGAACCTCGGTGATTATCAATATTGGTACGGCGACCATCGCTTCAACAGTGGGGGCCAGTACGTTAGGCACGCCGATCATTATCGGGCTTAGCGGCTTTAATACGGCCTATGTTATCCAGGGGGCGCTGCTGGTGGCGCTGGCGGCGATCATTATCGATCGCCTGTTTGAAAGGCTGACGCGCGCGCTTACCCGGCACGCAAAATAAAACTGTAACCTGCCAGCATCACGCCGCCGATACCGCCAATAGCCATCAGCAGGAAAAGGGCGATCACCCCGATTTTCGCTACGCGCATTATGTACTCCTTATGTTAATAAAAGGAGTATACATTAAAGCGAATTTGTTAGCTGCTGTTTAAACGCCAAGGGGATGAATGTCGCGTCCCTGGGCGCGCCATGCCAGGAGTTGCTGCTGCTGCGCCAGCGTCTGGTTTTCTCCGCACCATACCAGTAACGTCTTGCCGTCAAACAGTTCCGGGCGGAACTGGCTAAGCGAATGCGCCAGCACGTCGATTCGCCATCCCTGTTGGCTGGCGACCCAACCTTCCAGCCACAGGCGGGTGGTATCATGGATATTCCAGCCGATCACCAACGCATCTTTTCCCTGTTTCTTACGCGCAGACGCCAGGCAGAGCGCAATATAGTTGATCAGGATACCGTCAAGAATGCCGAGCAGCGCCTGAAGGGCAGGTTGTTGGCACTGTAATCGTCGCCGCAGCGGGACGAACAGGTTAGTGGTCAATGTTTGGGCTGGATAATCCTGACAGCGTTCTTTGACCCATAACCGTAAACTGTGCAGATTACTGCTTTGCAGGTAGTGCAGCAGGATCTCCTGCTGTTCGCGCCAGCCGTTAGGTTGTTCGCTACTGTCGCTACTGAGCAGCACTTTGACTTTGCTGACCTGGACGCCGTTATTTATCCAGCGCTTGATTTCGCGGATTCTGTCGATATCGGCATCGTTAAACAGACGATGACCGCCATCCGTTCGCTGTGGTTTTAAAAGTCCATAACGTCTCTGCCACGCGCGCAACGTGACAGGATTGATATCACAAAGCAAAGCCACTTCACCAATTGTGTAAAGCGCCATCGTTTCACCCTTGCTCGCGAGGTCCCGGTTTAACTTTAGACGCTCTTTTACGAACCAGGAAGTTTTGCCTGTTTTTTATGCATTAAAACGCGAAGTAGCGGGTTGCGGCGGGGTGTTTAAGTGATCGTATTCACGAATTCATATTTTTATGCAACAGTTCAAAGAAAGTTAATCGTACTCAATGTATGTTACGCGCTTTTAATTGAAGTGTGGTTTGCGGGTATGTACGAGTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTGTTTCTGGTCATTGCGTGGCTAATGAGTAAAACGCGTCTGTTCATCCCGCTTATGCAGGTCACGGTTCGTCTGCCGCACAAGCTTCTGTGTTACGTCACGTTTTCTATCTTCTGCATCATGGGCACTTATTTTGGGCTGCATATCGAAGATTCGATTGCCAATACCCGCGCAATTGGCGCGGTGATGGGCGGCCTACTCGGCGGGCCGGTCGTCGGCGGGCTGGTCGGTCTGACCGGTGGGTTACATCGGTATTCTATGGGCGGCATGACGGCGCTGAGCTGTATGATTTCCACCATCGTCGAAGGGCTGCTGGGCGGGTTGGTACACAGCGTTCTCATACGTCGCGGACGCCCGGACAAAGTGTTTAGCCCGCTGACGGCGGGAGCAATTACGTGTATTGCCGAACTGGTGCAGATGCTGATCATTTTACTGATAGCCAGGCCGTTTGACGATGCCTTGCATCTGGTCAGTAATATTGCCGCGCCGATGATGGTGACGAATACCGTTGGCGCCGCGCTGTTCATGCGTATTTTGCTCGATAAGCGCGCCATGTTCGAAAAATATACTTCGGCATTTTCTGCTACCGCGCTGAAGGTCGCCGCGTCAACGGAGGGGATTCTGCGTCAGGGATTTAACGAAGTGAACAGTATGAAGGTGGCGCAGGTGTTATATCAGGAGCTGGATATTGGCGCCGTCGCCATCACCGATCGCGAAAAACTGCTGGCTTTTACTGGTATTGGCGACGATCACCATCTACCGGGCAAACCCATTTCATCAGGTTATACGCTGAAAGCAATTGAAACCGGAGAGGTGGTTTATGCCGATGGCAACGAAGTGCCGTATCGCTGTTCGCTACACCCGCAGTGTAAACTCGGCTCGACGCTGGTGATCCCGCTGCGTGGCGAAAATCAGCGAGTCATGGGCACCATTAAATTGTACGAAGCGAAAAACCGGCTGTTTAGCTCAATTAACCGCACCCTGGGAGAGGGTATTGCGCAGCTTTTATCCGCGCAGATCCTGGCCGGGCAGTATGAACGGCAGAAGGCGTTGCTGACGCAGTCAGAGATCAAGCTGTTGCACGCGCAGGTGAACCCGCATTTTCTGTTTAACGCGCTCAATACCATTAAAGCGGTGATTCGCCGCGACAGCGAACAGGCCAGCCAACTGGTGCAGTACTTGTCGACCTTTTTTCGCAAAAATTTAAAACGCCCGTCGGAAATCGTCACGCTGGCGGATGAAATTGAACACGTAAACGCTTATCTGCAAATTGAAAAAGCGCGTTTTCAGTCGCGTCTGCAGGTACAGCTTGATGTTCCATCGACGCTTTCACGTCAGAAATTGCCTGCGTTTACATTACAGCCGATTGTTGAGAACGCCATTAAACATGGCACGTCGCAACTGCTTGATACCGGCAACGTCGCTATTCGCGCCCGGCGCGAAGGGCAGCATTTGATGTTAGATATTGAGGATAATGCGGGACTGTATCAGCCTTCCGCCGGCAGTAGCGGGCTGGGGATGAGTCTGGTTGATAAACGTCTGCGCGAACACTTTGGCGATGATTATGGTATTAGCGTGGCCTGCGAGCCGGACTGTTTTACCCGAATTACATTACGACTTCCACTGGAGGAGGACGCATGATTAAAGTGCTGATTGTGGATGATGAGCCGTTAGCGCGGGAAAATCTGCGGATTTTGCTCCAGGGGCAGGATGACATTGAGATTGTGGGAGAGTGCGCGAACGCGGTAGAAGCGATTGGCGCGGTACATAAGTTGCGACCTGATGTGCTGTTTCTGGATATTCAGATGCCGCGTATCAGTGGACTGGAGATGGTAGGAATGCTTGATCCGGAACACCGCCCGTATATCGTTTTTTTAACCGCGTTTGACGAATACGCCATCAAAGCCTTTGAAGAACACGCTTTTGATTATCTGCTCAAGCCGATAGAGGAGAAACGGCTGGAAAAAACGTTACATCGTCTGCGTCAGGAGCGCAGTAAACAGGATGTTTCGTTGTTGCCGGAAAACCAGCAGGCGCTTAAATTCATTCCCTGTACCGGACACAGCCGGATCTATTTGTTGCAAATGGATGATGTCGCCTTTGTCAGTAGCCGTATGAGCGGCGTTTATGTGACCAGCAGTGAAGGGAAAGAGGGGTTTACCGAGCTGACGCTGCGCACGCTGGAAAGCCGGACGCCGCTACTGCGTTGTCATCGTCAGTTTCTGGTGAATATGGCCCATTTGCAGGAAATTCGGCTGGAGGATAATGGGCAGGCAGAGCTGATTTTACGCAACGGCCTGACGGTGCCGGTAAGCCGTCGCTATCTGAAAAGTTTAAAAGAGGCGATTGGCCTGTAAAAGACTGTTAGAATATCGTTTTGCCATAGAAACGACCGAAGGCCTCATGCTGAGTAACGATATTCTGCGTAGCGTGCGCTACATTTTAAAAGCTAATAATACCGATCTGGCGCGTATCCTGGCGCTGGGTAACGTTGATGCTACGCCGGAGCAGATAGCAATCTGGTTGCGCAAAGAAGAGGAAGAGGGGTTTCAGCGTTGCCCGGATATCGTGTTGTCCTCATTTCTCAATGGCCTCATTTATGAAAAACGCGGCAAAGATGAGGCGGCGCCTGCATTGACGGCGGAACGTCGTATCAACAACAATATTGTGCTGAAAAAGCTGCGTATTGCCTTTTCGCTAAAAACAGATGATATCCTGGCGATACTTACCGGTCAGTTGTTTCGTGTCTCAATGTCAGAGATCACCGCGATGATGCGCGCGCCGGACCATAAGAACTTCCGCGAATGCGGCGATCAGTTTATGCGTTATTTTCTGCGCGGTCTGGCGGCCCGTGAACACGCGGCGAAGTAATTCTGCGGTATTGTTCCCGGCAGCGTCCTGTCTGACCGGGAAAACGCATTATTATACTAATTGATTCTATGATACCCGCTCTCTTCCAACAGTTTCTGCGAGCGAATCATTGACAGATAGTACGCGGAACAGTTGTCAATTGATGATCCTGGCAATTTACAGAGGTCGCTTATTTTTGCCTGGGTAAAATCAATATCCACATATTCCGTAGCATAGCTATCATAATAGTCGATTCGTTCAGTCAAACCCGGCATACCCTGATAAGCTTCGCCGACTTGACTCAGCATTTTTTGTGCTTCTTCTTTATTATTGGCTTTCAGGGTCTTATAAAGTAGTTTATGTTCAGATATTTGACGTAAAACGATATCCCCTTTGTAGTAATAGGTTAACTTGATTTCAATACCGTTGAGATTACCTACATAGCGTTGTGTTTCCTCTGACTCTTTGCTAGCGGCTATCTTCTTGATAAACGCTGTCATGTTATTTTGCTTTCCCTGGAGAGTATCGTTTTTCTGATCGCAGCCAGTTATGCTAACCGATAGAGAGAGCGCGAATAGTGGCAGTGCCATAAGACGTAGAACCTGCATAACAATTCCTTGTCGTTAAGTATTGGTGTGGCCAGGAATTCAGGGATTATAGGCTTTGGCGAGGGGACTTACAGCGAGGCTGTCTTTTTTCGGAATTCATAAAGAAAAGACGCTGCCGAAGCAGCGCCCTGAGCGACTTTACCAGTCGATGCAATACATTATGCCTGCCAGTTATTTCGCTTCTTTAAAACCAGCAGCTTCCAGCAGCGTCTGGGTTTGTTTCATGCTGATACCTTTGCTGGTATCGCCGGACACCATCGTCCCTGAGATTTGCTGTAACGCTTTAAAGTCCACTTTTTCCATATCCACAGAGACGTTTTCCTGGGCATAGGTATCTTCATAGGTTAATTTTTCTTCCACTCCGGCGATATTTTTATATTTCGCGCTCAGCGGATCGAGAATTTTGGCGGCATCTTCTTTCGTTTTAGCGCCTACAGTGGCATAGCTGATTTTACTTTCAGACGTCTGCTTAATGATTTTGTCACCTTTATAGGTGTAAGTAATTGAAATTTCTGTCCCCGCCAGGTTTGCGTTAAAGGTCTTTGATTCTTCTTTATCGCCACAGCCAGCAAGAGAGAACACCAGTACGGAAGCCAAAGCCGTGGACAATAACTTGCCAGAAATTTTCATCTAAAACTCCATTTTATATAATAATTGGGCTTTTAAAATAATTTCAATGAATTAATTTAACCCAGTAATAGCAATGTATCAGGGAGAGATAGAATATGACTTTTAGCCGTTATTTAGCAGTCCGGATATGGAGTCTTAGCGCTATTGCTTATTAAGGAAAAAGTTAAAACGTGCGGAGGAGGCGATATGCCAGTCAGGATTAAGCGGTTAAAAAAGCCGGAGCATGCTCCGGCTTGTTGCTTATTTCACCTGTTGGCCAGGCTTCGCGCCGTCATCAGGGCTTAACAGGAAGATATCTTTCCCGCCAGGGCCTGCGGCCATCACCATTCCCTCGGAGACGCCAAAGCGCATTTTGCGCGGCGCGAGGTTGGCGACCATTACCGTCTGGCGGCCGATCAGCGCCTGCGGGTCCGGGTAGGCGGAACGAATGCCGGAGAAGACGTTACGCTTCTCGCCGCCCAGATCCAGCGTCAGACGCAGCAATTTGTCGGAGCCTTCCACGAACTCAGCGTTTTCAATCAATGCTACGCGCAGGTCAATTTTGGCGAAATCGTCAAAGGTGATGGTTTCCTGAATCGGGAAGTCGGCTAACGGGCCGGTAACCGGTGCGGCTGCGGCTTTCACCTCTTCTTTAGACGCTTCAACCAGCGCTTCAACTTGCTTCATGTCGATGCGATTGTAGAGCGCCTTAAAGGTGTTGACCTTGTGACTGAGCAGCGGCTGTTCGATGGCATCCCAGTTCAGTTCGCTGTTCAGGAAGGCTTCAACGCGTTCAGAAAGCGTCGGCAGTACCGGTTTCAGATACGTCATCAGCACGCGGAACAGGTTGATGCCCATCGAGCAAATGGCCTGCAGGTCAGCGTCGCGGCCTTCCTGTTTAGCCACCACCCACGGCGCTTGCTCGTCAACATAACGGTTAGCGACGTCGGCCAGCGCCATAATCTCACGGATAGCTTTGCCGAATTCACGGCTTTCCCATGCTTCGCCAATCACCGCAGCGGCGTCAGTAAAGGTTTTGTACAATTGCGGATCGGCCAGTTCAGCCGCCAGCACGCCGTCGAAACGCTTATTGATAAAACCGGCGTTACGGGATGCCAGGTTGACTACTTTATTGACGATATCGGCATTGACGCGCTGGACAAAGTCTTCCAGGTTCAGGTCGATGTCATCAATGCGTGAAGAAAGCTTCGCGGTGTAGTAGTAGCGCAGGCTGTCGGCGTCAAAGTGTTTCAGCCAGGTGCTGGCCTTAATAAAGGTGCCGCGAGACTTAGACATCTTCGCGCCGTTCACCGTCACGTAACCGTGAACGAACAGGTTGGTCGGCTTACGGAAGTGGCTGCCTTCCAGCATGGCAGGCCAGAACAGGCTGTGGAAATAGACGATGTCTTTGCCGATAAAGTGATACAGCTCGGCGTCGGAGTCTTTTTTCCAGTACTCATCAAAACTGGTCGTGTCACCGCGCTTATCGCACAGATTTTTGAAGGAGCCCATATAGCCAATCGGCGCGTCCAGCCAGACGTAGAAATATTTGCCCGGCGCGTTCGGGATTTCGAAACCAAAATACGGCGCGTCGCGGGAAATGTCCCACTGTTGCAGGCCGGATTCAAACCACTCCTGCATTTTGTTCGCCACCTGCTCCTGCAGCGCGCCGCTGCGGGTCCACGCCTGCAGCATTTCGCTGAATGACGGCAGATCAAAGAAAAAGTGCTCGGAGTCACGCATTACCGGCGTCGCGCCGGACACCACGGATTTCGGCTCGATAAGTTCGGTCGGGCTGTAAGTTGCGCCGCAGACTTCACAGTTATCGCCGTACTGGTCCGCGGATTTACATTTCGGGCAGGTGCCTTTCACAAATCGGTCCGGCAGGAACATGCCTTTTTCCGGATCGTAGAGTTGAGAGATAGTGCGGTTCTTAATAAAACCGTTCTCTTTCAGGCGCGTATAAATCAGCTCGGACAGCTCGCGATTCTCGTCGCTGTGCGTTGAGTGGTAGTTGTCGTAGCTAATATTAAAACCGGCGAAATCGGTCTGGTGCTCCTGGCTCATTTCACCGATCATTTGCTCCGGCGTAATACCAAGCTGCTGCGCTTTCAGCATGATCGGCGTGCCATGAGCGTCATCGGCACAGATGAAGTTAACCTCATGGCCGCGCATTCGCTGGTAACGGACCCAGACATCAGCCTGGATGTGCTCCAGCATATGGCCGAGGTGGATAGAGCCGTTGGCGTACGGCAGCGCGCACGTTACCAGAATTTTCTTCGCGACTTGAGTCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP043433|1657142:1666313|1661588_1662308_+|WP_000598637.1|DBSCAN-SWA MIKVLIVDDEPLARENLRILLQGQDDIEIVGECANAVEAIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIEEKRLEKTLHRLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMDDVAFVSSRMSGVYVTSSEGKEGFTELTLRTLESRTPLLRCHRQFLVNMAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >NZ_CP043433|1657142:1666313|1663580_1664039_-|WP_000703145.1|DBSCAN-SWA MKISGKLLSTALASVLVFSLAGCGDKEESKTFNANLAGTEISITYTYKGDKIIKQTSESKISYATVGAKTKEDAAKILDPLSAKYKNIAGVEEKLTYEDTYAQENVSVDMEKVDFKALQQISGTMVSGDTSKGISMKQTQTLLEAAGFKEAK >NZ_CP043433|1657142:1666313|1658952_1659684_-|WP_001240420.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWINNGVQVSKVKVLLSSDSSEQPNGWREQQEILLHYLQSSNLHSLRLWVKERCQDYPAQTLTTNLFVPLRRRLQCQQPALQALLGILDGILINYIALCLASARKKQGKDALVIGWNIHDTTRLWLEGWVASQQGWRIDVLAHSLSQFRPELFDGKTLLVWCGENQTLAQQQQLLAWRAQGRDIHPLGV >NZ_CP043433|1657142:1666313|1662354_1662822_+|WP_000950414.1|DBSCAN-SWA MLSNDILRSVRYILKANNTDLARILALGNVDATPEQIAIWLRKEEEEGFQRCPDIVLSSFLNGLIYEKRGKDEAAPALTAERRINNNIVLKKLRIAFSLKTDDILAILTGQLFRVSMSEITAMMRAPDHKNFRECGDQFMRYFLRGLAAREHAAK >NZ_CP043433|1657142:1666313|1664279_1666313_-|WP_000195340.1|tRNA|DBSCAN-SWA MTQVAKKILVTCALPYANGSIHLGHMLEHIQADVWVRYQRMRGHEVNFICADDAHGTPIMLKAQQLGITPEQMIGEMSQEHQTDFAGFNISYDNYHSTHSDENRELSELIYTRLKENGFIKNRTISQLYDPEKGMFLPDRFVKGTCPKCKSADQYGDNCEVCGATYSPTELIEPKSVVSGATPVMRDSEHFFFDLPSFSEMLQAWTRSGALQEQVANKMQEWFESGLQQWDISRDAPYFGFEIPNAPGKYFYVWLDAPIGYMGSFKNLCDKRGDTTSFDEYWKKDSDAELYHFIGKDIVYFHSLFWPAMLEGSHFRKPTNLFVHGYVTVNGAKMSKSRGTFIKASTWLKHFDADSLRYYYTAKLSSRIDDIDLNLEDFVQRVNADIVNKVVNLASRNAGFINKRFDGVLAAELADPQLYKTFTDAAAVIGEAWESREFGKAIREIMALADVANRYVDEQAPWVVAKQEGRDADLQAICSMGINLFRVLMTYLKPVLPTLSERVEAFLNSELNWDAIEQPLLSHKVNTFKALYNRIDMKQVEALVEASKEEVKAAAAPVTGPLADFPIQETITFDDFAKIDLRVALIENAEFVEGSDKLLRLTLDLGGEKRNVFSGIRSAYPDPQALIGRQTVMVANLAPRKMRFGVSEGMVMAAGPGGKDIFLLSPDDGAKPGQQVK >NZ_CP043433|1657142:1666313|1657142_1658090_+|WP_000569168.1|DBSCAN-SWA MIEFNHVSKTFGDQQAVSDLNLHFSEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGTIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWTVAQNIATVPQLQKWSRARINDRIDELMALLGLESALRDRYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHQLLGRTIVLVTHDIDEALRLADHLVLMDGGNVIQQGSPLSMLTSPENDFVQAFFGRSELGVRLLSLRSVGDYVRRHEQLSGDALVEEMTLRDALSMFVARRCDVLPVANQQGEPCGTLHFRDLLSETSPRETTV >NZ_CP043433|1657142:1666313|1659906_1661592_+|WP_000272845.1|DBSCAN-SWA MYEFNLVLLLLQQMCVFLVIAWLMSKTRLFIPLMQVTVRLPHKLLCYVTFSIFCIMGTYFGLHIEDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSVLIRRGRPDKVFSPLTAGAITCIAELVQMLIILLIARPFDDALHLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSGYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEIVTLADEIEHVNAYLQIEKARFQSRLQVQLDVPSTLSRQKLPAFTLQPIVENAIKHGTSQLLDTGNVAIRARREGQHLMLDIEDNAGLYQPSAGSSGLGMSLVDKRLREHFGDDYGISVACEPDCFTRITLRLPLEEDA >NZ_CP043433|1657142:1666313|1662878_1663409_-|WP_001197951.1|DBSCAN-SWA MQVLRLMALPLFALSLSVSITGCDQKNDTLQGKQNNMTAFIKKIAASKESEETQRYVGNLNGIEIKLTYYYKGDIVLRQISEHKLLYKTLKANNKEEAQKMLSQVGEAYQGMPGLTERIDYYDSYATEYVDIDFTQAKISDLCKLPGSSIDNCSAYYLSMIRSQKLLEESGYHRIN >NZ_CP043433|1657142:1666313|1658073_1658805_+|WP_000824854.1|DBSCAN-SWA MKRLCDPLLWLIVLFLLLLFGLPYSQPFFAALFPDLPRPVYQQESFAALALAHFWLVGISSLFAVVVGVGAGIAVTRESGKEFRPLVETIAAVGQTFPPVAVLAIAVPVMGFGQQPAIIALILYGVLPILQATLAGLGAVPASVMSVASGMGMSRRQQLYQVELPLAAPVILAGIRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIIDRLFERLTRALTRHAK >NZ_CP043433|1657142:1666313|1658785_1658893_-|WP_001261696.1|DBSCAN-SWA MRVAKIGVIALFLLMAIGGIGGVMLAGYSFILRAG |
10 | Enterobacteria_phage(66.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1733511 : 1744018
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP043433|1733511:1744018|DBSCAN-SWA TATGCCCGTGAATAAGTTCTCCCGACGTACCCTCCTGACGGCAGGTTCCGCGCTTGCTGTTCTTCCTTTTCTGCGCGCCTTGCCGGTACAGGCGCGTGAACCTCGCGAGACCGTCGATATTAAGGATTATCCGGCGGATGACGGTATCGCCTCGTTCAAACAGGCCTTCGCCGACGGACAGACCGTGGTCTTACCGCCAGGATGGGTGTGTGAAAATATCAATGCGGCGATAACGATTCCGGCGGGAAAAACGCTGCGGGTACAGGGCGCGGTGCGTGGGAATGGCCGGGGACGGTTTATTTTGCAGGACGGGTGTCAGGTGGTGGGGGAGCAGGGCGGCAGTCTGCACAATGTGACGCTGGATGTTCGCGGGTCGGACTGTGTGATTAAAGGCGTGACGATGAGCGGCTTTGGCCCCGTCGCGCAAATTTTCATCGGCGGTAAGGAACCGCAGGTGATGCGTAATCTCATTATCGATGACATCACCGTTACCCACGCCAACTACGCCATTCTCCGCCAGGGATTTCATAACCAAATGGACGGCGCGCGGATTACGCATAGCCGCTTTAGCGATTTGCAGGGGGACGCCATTGAGTGGAATGTCGCGATTCACGACCGCGACATCCTGATTTCCGATCATGTCATCGAACGCATTGATTGTACCAATGGCAAAATCAACTGGGGGATCGGCATCGGGCTGGCGGGTAGCACCTATGACAACAGTTATCCTGAAGATCAGGCAGTAAAAAACTTTGTGGTGGCCAATATTACCGGATCTGATTGCCGACAGCTGGTGCACGTAGAAAATGGCAAACATTTCGTCATTCGCAATGTCAAAGCCAAAAACATCACGCCCGATTTCAGTAAAAATGCGGGTATTGATAACGCAACGATCGCCATTTATGGCTGTGATAATTTCGTCATTGATAATATTGATATGACGAATAGTGCTGGGATGCTCATCGGCTATGGCGTCGTTAAAGGAAAATACCTGTCAATTCCGCAAAACTTTAAATTAAACGCTATTCGGTTGGATAATCGCCAGGTTGCTTATAAATTACGCGGCATTCAAATTTCCTCCGGCAACATCCCCTCTTTTGTCGCCATCACCAATGTACGGATGACGCGTGCTACGCTGGAACTGCATAATCAACCGCAGCACCTCTTTCTGCGTAATATCAACGTGATGCAAACTTCAGCGATTGGCCCGGCGTTAAAAATGCATTTCGATTTGCGTAAAGATGTCCGTGGTCAATTTATGGCCCGCCAGGACACGCTGCTTTCCCTCGCTAATGTTCATGCCATCAATGAAAACGGGCAGAGTTCCGTGGATATCGACAGGATTAATCACCAAACCGTGAATGTCGAAGCAGTGAATTTTTCGCTGCCGAAGCGGGGAGGGTAAGTACCGCTATTTTTACGAAAATTCCTGGGAAAAAGTTGTTCATACTTAATGTTATGGTGCCGACTAAGACGTAATGTAAAGCGTGCCATCATTATCCCTGGCAGCAGAGTAATTCATGCTGGCGAAAACAAGCTAAAGAGCTATAATTCAGCAACCATTTTACAGGTGGAAGAAACAATGATGAATTTGAAAGCAGTTATACCGGTAGCGGGTTTGGGTATGCATATGTTGCCTGCCACCAAGGCAATCCCAAAAGAGATGCTACCGATCGTCGACAAGCCAATGATTCAGTACATTGTCGATGAGATTGTGGCTGCAGGGATCAAAGAAATCGTACTGGTGACTCACGCGTCTAAAAACGCCGTTGAGAACCACTTCGACACCTCTTATGAACTTGAATCACTTCTTGAGCAGCGCGTTAAGCGCCAGCTTTTGGCGGAAGTGCAATCTATCTGCCCACCGGGCGTGACGATTATGAACGTTCGCCAGGCGCAGCCGTTAGGGCTGGGGCACTCTATTCTGTGCGCGCGTCCGGTCGTGGGCGATAACCCTTTCATTGTGGTACTCCCGGATATTATTATCGATGATGCTACCGCCGATCCGCTGCGCTATAACCTTGCGGCGATGGTGGCGCGTTTCAATGAAACGGGTCGCAGCCAGGTGCTGGCGAAGCGCATGAAAGGTGATTTATCGGAGTATTCCGTTATCCAGACGAAAGAACCTCTGGATAATGAAGGCAAAGTCAGCCGGATTGTGGAGTTTATCGAAAAACCGGATCAGCCGCAGACGCTGGATTCCGATTTGATGGCGGTAGGCCGTTATGTGCTTTCAGCCGACATCTGGGCGGAACTGGAAAGAACCGAACCGGGCGCCTGGGGCCGTATCCAGCTCACCGATGCCATTGCAGAACTGGCGAAAAAACAGTCGGTTGACGCGATGCTAATGACGGGTGACAGCTATGACTGCGGTAAAAAAATGGGCTACATGCAGGCATTTGTGAAGTACGGGCTGCGCAACCTGAAAGAAGGAGCGAAGTTCCGTAAGAGCATAGAGCAGCTTTTGCATGAATAAGTATTAACAACCGTGATAAATGGTTGGTGATAAACATAATAACGGCAGTGAACATTCGAAGCGGCAAGTTGGCTGAAGCGAGTGTTGACTGCCGTTTTAGTTTTGTATAAAGGGCTTAAGTAACAAGGGGTTATCTGGAGCATTTTAATGCTGATTTTATAAGATTAATCCTTGTTTCCGGATGCAATTAATAAGACAATTAGCGTTTAAGTTTTAGTGAGCTTTGCCCTGCTGGGCGAGGTTTGTAACAAGTCGATATGTACGCAGTGCACTGGTAGCTGATGAGCCAGGGGCGGTAGCGTGTGTAACGACTTGAGCAATTAATTTTTATTGGCAAATTAAATACCACATTAAATACGCCTTATGGAATAGAAAAGTGAAGATACTTATTACTGGCGGGGCAGGTTTTATTGGATCAGCTGTTGTCCGCCATATTATTAAGAATACACAGGACACTGTAGTTAATATTGATAAATTAACCTACGCCGGTAATCTTGAATCCCTTTCTGATATTTCTGAGAGTAATCGCTACAATTTTGAACACGCGGATATTTGTGATTCCGCTGAAATAACGCGTATTTTTGAGCAGTACCAGCCGGACGCGGTGATGCATTTGGCTGCGGAAAGTCATGTGGACCGTTCGATTACCGGGCCAGCAGCATTTATTGAAACCAATATCGTCGGCACCTATGTACTTCTTGAAGTTGCGCGTAAATACTGGTCTGCGCTTGGCGAAGATAAAAAAAATAATTTTCGTTTTCATCATATTTCCACTGATGAAGTTTACGGCGATTTACCGCATCCTGATGAAGTTGAAAACAGCGTTACGCTGCCGTTATTTACTGAAACGACGGCATATGCGCCAAGTAGCCCCTATTCTGCGTCAAAAGCATCCAGCGATCATTTAGTCCGTGCCTGGCGGCGTACCTATGGTCTACCAACGATCGTTACCAATTGTTCTAATAACTATGGCCCTTATCACTTCCCTGAAAAACTGATTCCGTTGGTCATTTTGAACGCACTGGAAGGAAAGCCTTTGCCAATTTATGGCAAAGGGGATCAGATTCGCGATTGGCTATATGTAGAAGATCATGCTCGCGCGCTTCATATGGTAGTGACTGAAGGCAAGGCGGGGGAGACTTATAACATTGGTGGCCACAATGAGAAGAAAAATCTCGATGTGGTATTTACCATCTGTGATCTGCTGGACGAGATTGTACCCAAAGCGACTTCTTATCGTGAACAAATCACTTATGTCGCGGATCGTCCGGGCCATGATCGTCGTTATGCCATTGATGCAGGTAAAATTAGCCGCGAATTAGGCTGGAAACCGCTGGAGACCTTTGAAAGCGGTATTCGTAAAACAGTGGAATGGTACCTTGCAAATACTCAATGGGTAAACAATGTTAAAAGTGGGGCGTATCAGAGTTGGATAGAACAGAACTATGAAGGACGCCAGTAATGAATATCTTACTTTTTGGTAAGACAGGGCAAGTAGGCTGGGAGTTGCAACGTTCTCTGGCACCAGTAGGGAATCTGATTGCCCTGGATGTCCATTCAAAAGAGTTTTGCGGTGATTTTAGTAATCCGAAAGGCGTTGCCGAAACCGTTCGTAAGCTTCGTCCCGATGTGATTGTTAACGCAGCAGCACATACTGCAGTAGATAAAGCAGAGTCTGAACCAGAACTGGCGCAGTTACTTAACGCCACCAGTGTGGAAGCCATCGCTAAAGCAGCCAACGAAACTGGCGCATGGGTAGTGCATTATTCAACCGATTATGTATTTCCTGGTACCGGCGATATCCCATGGCAGGAAACGGACGCTACGTCGCCGCTGAATGTCTATGGCAAGACCAAACTGGCGGGAGAAAAGGCCCTGCAGGATAACTGCCCTAAGCATCTTATCTTCCGCACCAGTTGGGTTTATGCAGGTAAGGGCAATAATTTCGCAAAGACAATGCTTCGTCTGGCGAAAGAGCGTCAGACACTTTCAGTCATCAACGATCAGTACGGTGCGCCAACCGGTGCAGAATTACTGGCTGACTGCACGGCTCATGCGATCCGTGTGGCGTTAAAGAAACCAGAAGTCGCAGGTCTTTACCATCTGGTTGCCGGGGGAACCACAACCTGGCATGACTACGCGGCCTTAGTCTTTGACGAGGCGCGCAAAGCAGGGATAACGCTTGCGCTGACTGAGCTTAATGCTGTGCCGACCAGCGCCTACCCGACGCCGGCGAGCAGACCAGGCAATTCGCGTCTCAATACTGAAAAGTTTCAGCGTAATTTTGACCTTATTCTGCCGCAATGGGAATTAGGAGTTAAGCGCATGCTGACTGAAATGTTTACGACGACAACCATCTGATAAATTTAAATGCCCATCAGGGCATTTTCTATGAATGAGAAATGGAAATGAAAACGCGTAAGGGCATTATTTTAGCGGGGGGCTCCGGCACCCGTCTTTATCCGGTGACCATGGCGGTAAGTAAGCAATTGCTACCAATTTATGATAAACCGATGATTTACTATCCCCTTTCCACGCTTATGCTGGCAGGCATTCGGGATATCCTGATCATCAGTACGCCACAGGACACGCCGCGTTTTCAACAACTGCTGGGAGACGGCAGCCAGTGGGGGCTGAATCTTCAATATAAAGTACAGCCAAGCCCGGATGGCTTAGCACAGGCGTTTATTATTGGTGAAGAGTTCATTGGTAATGATGATTGTGCATTAGTACTGGGTGACAATATCTTCTATGGTCATGATTTACCAAAGTTAATGGAAGCTGCCGTTAATAAAGAAAGTGGTGCTACCGTCTTTGCTTATCATGTAAACGATCCGGAGCGCTACGGTGTGGTTGAGTTTGACCAAAGTGGCACAGCCGTTAGTCTGGAGGAAAAACCGTTACAACCGAAGAGTAATTACGCGGTAACGGGGCTGTATTTTTATGATAATAGCGTGGTGGAGATGGCGAAAAATCTTAAGCCTTCCGCTCGCGGTGAGTTAGAAATCACGGATATTAACCGTATCTATATGGATCAGGGAAGATTGTCTGTCGCCATGATGGGGCGCGGTTATGCCTGGCTGGATACAGGGACGCATCAGAGTTTGATAGAGGCCAGTAATTTTATTGCAACCATCGAAGAACGCCAGGGGCTAAAAGTGTCCTGCCCGGAAGAGATCGCATTTCGTAAAAATTTTATAAATGCACAACAGGTTATAGAACTGGCCGGGCCATTATCAAAAAATGATTATGGCAAATATTTGCTGAAGATGGTGAAAGGTTTATAAGTGATGATTGTGATTAAAACAGCAATACCAGATGTCTTGATCTTAGAGCCTAAAGTTTTTGGCGATGAGAGGGGATTCTTTTTTGAAAGTTATAACCAGCAGACCTTTGAAGAGTTGATTGGACGTAAAGTTACATTTGTTCAAGATAATCATTCAAAATCCAAAAAGAACGTACTCAGAGGGCTACATTTTCAGAGAGGAGAAAATGCACAGGGGAAGTTAGTTCGTTGTGCTGTCGGTGAGGTTTTTGATGTTGCGGTCGATATCCGAAAAGAATCGCCTACTTTTGGTCAATGGGTTGGCGTAAATCTATCTGCTGAGAATAAGCGACAGCTTTGGATTCCAGAAGGTTTTGCTCATGGTTTTGTTACTCTTAGTGAGTATGCAGAGTTTCTGTACAAAGCAACTAATTATTACTCACCTTCATCGGAAGGTAGCATTTTATGGAATGATGAGACAATAGGTATTGAATGGCCTTTTTCTCAGCTGCCTGAGCTTTCAGCAAAAGATGCTGCAGCACCTTTACTGCATCAAGCCTTGTTAACAGAGTAAGCATCGTGTCTCATATTATTAAGATTTTTCCATCAAATATTGAATTTTCCGGTAGAGAGGATGAATCAATCCTCGATGCTGCGCTATCGGCTGGCATCCATCTTGAACATAGCTGCAAAGCGGGTGATTGTGGTATCTGTGAGTCCGATTTGTTGGCGGGAGAAGTTGTTGACTCCAAAGGTAATATTTTTGGACAGGGTGATAAAATACTAACCTGCTGCTGTAAACCTAAAACCGCCCTTGAGCTAAATGCGCATTTTTTTCCTGAACTAGCTGGACAGACAAAAAAAATTGTCCCATGCAAGGTAAATAGTGCTGTACTGGTTTCAGGCGATGTTATGACTTTGAAGTTACGCACACCACCAACAGCAAAAATTGGCTTCCTTCCAGGGCAGTATATCAATTTACATTATAAAGGTGTAACTCGCAGTTATTCTATCGCTAATAGTGATGAGTCGAATGGTATTGAGTTGCATGTAAGGAATGTTCCCAATGGTCAGATGAGCTCTCTCATTTTTGGGGAGTTACAAGAAAATACTCTTATGCGCATTGAAGGACCTTGCGGAACATTTTTTATTCGTGAAAGTGACAGACCTATAATCTTCCTTGCAGGCGGTACTGGATTCGCTCCAGTTAAATCAATGGTTGAGCATCTCATTCAGGGAAAATGTCGTCGTGAGATCTACATCTACTGGGGAATGCAAGATAGTAAAGATTTTTACTCTGCATTACCGCAGCAGTGGAGTGAACAGCACGACAACGTTCATTATATCCCTGTTGTTTCTGGTGATGACGCCGAATGGGGGGGAAGAAAGGGATTTGTCCATCATGCTGTGATGGATGATTTTGATTCTCTAGAGTTCTTCGATATATATGCATGTGGTTCACCTGTGATGATCGATGCCAGTAAAAAGGACTTTATGATGAAAAATCTCTCTGTAGAACATTTCTATTCTGATGCATTTACCGCATCTAAATAATATTGAGGATAATTTATGAAAGCGGTCATCCTGGCTGGTGGACTTGGTACCAGACTAAGTGAAGAAACAATTGTAAAACCAAAACCGATGGTAGAAATTGGTGGCAAGCCTATTCTTTGGCACATTATGAAAATGTATTCTGTGCATGGTATCAAGGATTTTATTATCTGCTGTGGTTATAAAGGATATGTGATTAAAGAATATTTTGCGAACTACTTCCTTCACATGTCAGATGTAACATTCCATATGGCTGAAAATCGTATGGAAGTTCACCATAAACGTGTTGAACCATGGAATGTCACATTGGTTGATACGGGTGATTCTTCAATGACTGGTGGTCGTCTGAAACGTGTTGCTGAATACGTAAAAGATGACGAGGCTTTCCTGTTTACTTATGGTGATGGCGTTGCCGACCTTGATATCAAAGCGACTATCGATTTCCATAAGGCTCACGGTAAGAAAGCGACTTTAACAGCTACTTTTCCACCAGGACGTTTTGGCGCATTAGATATCCAAGCTGGTCAGGTCCGGTCATTCCAGGAAAAACCGAAAGGCGATGGGGCAATGATCAATGGTGGTTTCTTTGTGTTGAATCCATCGGTTATCGATCTCATCGATAACGATGCAACAACCTGGGAACAAGAGCCATTAATGACATTGGCACAACAGGGGGAGTTAATGGCTTTTGAACACCCAGGTTTCTGGCAGCCGATGGATACCCTACGTGATAAAGTTTACCTTGAAGGGCTGTGGGAAAAAGGTAAAGCTCCGTGGAAAACCTGGGAGTAAGTAGATGATTGATAAAAATTTTTGGCAAGGTAAACGTGTATTCGTTACCGGCCATACTGGCTTTAAAGGAAGCTGGCTTTCGCTATGGCTGACTGAAATGGGTGCAATTGTAAAAGGCTATGCACTTGATGCGCCAACTGTTCCAAGTTTATTTGAGATAGTGCGTCTTAGTGATCTTATGGAATCTCATATTGGCGACATTCGTGATTTTGAAAAGCTGCGCAATTCTATTGCAGAATTTAAGCCAGAAATTGTTTTCCATATGGCAGCCCAGCCTTTAGTGCGCCTATCTTATGAACAGCCAATCGAAACATACTCAACAAATGTTATGGGTACTGTCCATTTGCTTGAAGCAGTTAAGCAAGTAGGTAACATAAAGGCAGTCGTAAATATCACCAGTGATAAGTGCTACGACAATCGTGAGTGGGTGTGGGGCTATCGTGAGAACGAACCCATGGGAGGGTACGATCCATACTCTAATAGTAAAGGTTGTGCAGAATTAGTCGCGTCTGCATTCCGGAACTCATTCTTCAATCCTGCAAATTATGAGCAACATGGCGTTGGTTTGGCGTCTGTGAGGGCTGGTAATGTCATAGGCGGAGGCGATTGGGCTAAAGACCGTTTAATTCCCGATATTCTGCGCTCATTTGAAAATAACCAGCAGGTTATTATTCGAAACCCATATTCTATCCGTCCCTGGCAGCATGTACTGGAGCCTCTTTCTGGTTACATTGTGGTGGCGCAACGCTTATATACAGAAGGTGCTAAGTTTTCTGAAGGATGGAATTTCGGCCCGCGTGATGAAGATGCGAAGACGGTCGAATTTATTGTTGACAAGATGGTCACGCTTTGGGGTGATGATGCAAGCTGGTTACTGGATGGTGAGAATCATCCTCATGAGGCACATTACCTGAAACTGGATTGCTCTAAAGCAAATATGCAATTAGGATGGCATCCGCGTTGGGGATTGACTGAAACACTTGGTCGCATCGTAAAATGGCATAAAGCATGGATTCGCGGCGAAGATATGTTGATTTGTTCAAAGCGTGAAATCAGCGACTATATGTCTGCAACTACTCGTTAAGAAAATAAGTTTAAGGAATCAAAGTAATGACAGCAAATAACCTGCGTGAGCAAATCTCTCAGCTTGTCGCTCAGTATGCGAATGAGGCATTGAGCCCGAAACCTTTTGTTGCAGGTACAAGCGTTGTGCCTCCTTCCGGGAAGGTTATTGGTGCCAAAGAGTTACAATTGATGGTTGAGGCGTCTCTTGATGGATGGCTAACTACTGGTCGTTTCAATGATGCCTTTGAGAAAAAACTTGGGGAATTTATTGGGGTTCCTCATGTTTTAACGACTACATCTGGCTCTTCGGCAAACTTGCTGGCACTGACTGCGCTGACTTCCCCAAAATTAGGCGAGCGTGCTCTCAAACCTGGTGATGAGGTTATTACTGTCGCTGCTGGCTTCCCGACTACAGTTAACCCGGCGATCCAGAATGGTTTAATACCGGTATTCGTGGATGTTGATATCCCGACATATAATATCGATGCCTCTCTCATTGAAGCTGCAGTTACTGAGAAATCAAAAGCGATAATGATCGCTCATACACTCGGTAATGCATTTAACCTGAGTGAAGTTCGTCGGATTGCCGATAAATATAACTTATGGTTGATTGAAGACTGCTGTGATGCCCTTGGGACGACTTATGAAGGCCAGATGGTAGGTACCTTTGGTGACATCGGAACCGTTAGTTTTTATCCGGCTCACCATATCACAATGGGTGAAGGCGGTGCTGTATTCACCAAGTCAGGTGAACTGAAGAAAATTATTGAGTCGTTCCGTGACTGGGGCCGGGATTGTTATTGTGCGCCAGGATGCGATAACACCTGCGGTAAACGTTTTGGTCAGCAATTGGGATCACTTCCTCAAGGCTATGATCACAAATATACTTATTCCCACCTCGGATATAATCTCAAAATCACGGACATGCAGGCAGCATGTGGTCTGGCTCAGTTGGAGCGCGTAGAAGAGTTTGTAGAGCAGCGTAAAGCTAACTTTTCCTATCTGAAACAGGGCTTGCAATCTTGCACTGAATTCCTCGAATTACCAGAAGCAACAGAGAAATCAGACCCATCCTGGTTTGGCTTCCCTATCACCCTGAAAGAAACTAGCGGTGTTAACCGTGTCGAACTGGTGAAATTCCTTGATGAAGCAAAAATCGGTACACGTTTACTGTTTGCTGGAAATCTGATTCGCCAACCGTATTTTGCTAATGTGAAATATCGTGTAGTGGGTGAGTTGACAAATACCGACCGTATAATGAATCAAACGTTCTGGATTGGTATTTATCCTGGCTTGACTACAGAGCATTTAGATTATGTAGTTAGCAAATTTGAAGAGTTTTTTGGTTTAAATTTCTAA
Protein sequences of DBSCAN-SWA_3 >NZ_CP043433|1733511:1744018|1739273_1739825_+|WP_000973709.1|DBSCAN-SWA MMIVIKTAIPDVLILEPKVFGDERGFFFESYNQQTFEELIGRKVTFVQDNHSKSKKNVLRGLHFQRGENAQGKLVRCAVGEVFDVAVDIRKESPTFGQWVGVNLSAENKRQLWIPEGFAHGFVTLSEYAEFLYKATNYYSPSSEGSILWNDETIGIEWPFSQLPELSAKDAAAPLLHQALLTE >NZ_CP043433|1733511:1744018|1741598_1742678_+|WP_000565913.1|DBSCAN-SWA MIDKNFWQGKRVFVTGHTGFKGSWLSLWLTEMGAIVKGYALDAPTVPSLFEIVRLSDLMESHIGDIRDFEKLRNSIAEFKPEIVFHMAAQPLVRLSYEQPIETYSTNVMGTVHLLEAVKQVGNIKAVVNITSDKCYDNREWVWGYRENEPMGGYDPYSNSKGCAELVASAFRNSFFNPANYEQHGVGLASVRAGNVIGGGDWAKDRLIPDILRSFENNQQVIIRNPYSIRPWQHVLEPLSGYIVVAQRLYTEGAKFSEGWNFGPRDEDAKTVEFIVDKMVTLWGDDASWLLDGENHPHEAHYLKLDCSKANMQLGWHPRWGLTETLGRIVKWHKAWIRGEDMLICSKREISDYMSATTR >NZ_CP043433|1733511:1744018|1742704_1744018_+|WP_000126349.1|DBSCAN-SWA MTANNLREQISQLVAQYANEALSPKPFVAGTSVVPPSGKVIGAKELQLMVEASLDGWLTTGRFNDAFEKKLGEFIGVPHVLTTTSGSSANLLALTALTSPKLGERALKPGDEVITVAAGFPTTVNPAIQNGLIPVFVDVDIPTYNIDASLIEAAVTEKSKAIMIAHTLGNAFNLSEVRRIADKYNLWLIEDCCDALGTTYEGQMVGTFGDIGTVSFYPAHHITMGEGGAVFTKSGELKKIIESFRDWGRDCYCAPGCDNTCGKRFGQQLGSLPQGYDHKYTYSHLGYNLKITDMQAACGLAQLERVEEFVEQRKANFSYLKQGLQSCTEFLELPEATEKSDPSWFGFPITLKETSGVNRVELVKFLDEAKIGTRLLFAGNLIRQPYFANVKYRVVGELTNTDRIMNQTFWIGIYPGLTTEHLDYVVSKFEEFFGLNF >NZ_CP043433|1733511:1744018|1738394_1739273_+|WP_000857535.1|DBSCAN-SWA MKTRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQYKVQPSPDGLAQAFIIGEEFIGNDDCALVLGDNIFYGHDLPKLMEAAVNKESGATVFAYHVNDPERYGVVEFDQSGTAVSLEEKPLQPKSNYAVTGLYFYDNSVVEMAKNLKPSARGELEITDINRIYMDQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIATIEERQGLKVSCPEEIAFRKNFINAQQVIELAGPLSKNDYGKYLLKMVKGL >NZ_CP043433|1733511:1744018|1735092_1735986_+|WP_000981469.1|DBSCAN-SWA MMNLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEIVLVTHASKNAVENHFDTSYELESLLEQRVKRQLLAEVQSICPPGVTIMNVRQAQPLGLGHSILCARPVVGDNPFIVVLPDIIIDDATADPLRYNLAAMVARFNETGRSQVLAKRMKGDLSEYSVIQTKEPLDNEGKVSRIVEFIEKPDQPQTLDSDLMAVGRYVLSADIWAELERTEPGAWGRIQLTDAIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKSIEQLLHE >NZ_CP043433|1733511:1744018|1740820_1741594_+|WP_000648783.1|DBSCAN-SWA MKAVILAGGLGTRLSEETIVKPKPMVEIGGKPILWHIMKMYSVHGIKDFIICCGYKGYVIKEYFANYFLHMSDVTFHMAENRMEVHHKRVEPWNVTLVDTGDSSMTGGRLKRVAEYVKDDEAFLFTYGDGVADLDIKATIDFHKAHGKKATLTATFPPGRFGALDIQAGQVRSFQEKPKGDGAMINGGFFVLNPSVIDLIDNDATTWEQEPLMTLAQQGELMAFEHPGFWQPMDTLRDKVYLEGLWEKGKAPWKTWE >NZ_CP043433|1733511:1744018|1736362_1737448_+|WP_000697846.1|DBSCAN-SWA MKILITGGAGFIGSAVVRHIIKNTQDTVVNIDKLTYAGNLESLSDISESNRYNFEHADICDSAEITRIFEQYQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEVARKYWSALGEDKKNNFRFHHISTDEVYGDLPHPDEVENSVTLPLFTETTAYAPSSPYSASKASSDHLVRAWRRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKPLPIYGKGDQIRDWLYVEDHARALHMVVTEGKAGETYNIGGHNEKKNLDVVFTICDLLDEIVPKATSYREQITYVADRPGHDRRYAIDAGKISRELGWKPLETFESGIRKTVEWYLANTQWVNNVKSGAYQSWIEQNYEGRQ >NZ_CP043433|1733511:1744018|1733511_1734915_+|WP_001144948.1|DBSCAN-SWA MPVNKFSRRTLLTAGSALAVLPFLRALPVQAREPRETVDIKDYPADDGIASFKQAFADGQTVVLPPGWVCENINAAITIPAGKTLRVQGAVRGNGRGRFILQDGCQVVGEQGGSLHNVTLDVRGSDCVIKGVTMSGFGPVAQIFIGGKEPQVMRNLIIDDITVTHANYAILRQGFHNQMDGARITHSRFSDLQGDAIEWNVAIHDRDILISDHVIERIDCTNGKINWGIGIGLAGSTYDNSYPEDQAVKNFVVANITGSDCRQLVHVENGKHFVIRNVKAKNITPDFSKNAGIDNATIAIYGCDNFVIDNIDMTNSAGMLIGYGVVKGKYLSIPQNFKLNAIRLDNRQVAYKLRGIQISSGNIPSFVAITNVRMTRATLELHNQPQHLFLRNINVMQTSAIGPALKMHFDLRKDVRGQFMARQDTLLSLANVHAINENGQSSVDIDRINHQTVNVEAVNFSLPKRGG >NZ_CP043433|1733511:1744018|1737447_1738347_+|WP_001023658.1|DBSCAN-SWA MNILLFGKTGQVGWELQRSLAPVGNLIALDVHSKEFCGDFSNPKGVAETVRKLRPDVIVNAAAHTAVDKAESEPELAQLLNATSVEAIAKAANETGAWVVHYSTDYVFPGTGDIPWQETDATSPLNVYGKTKLAGEKALQDNCPKHLIFRTSWVYAGKGNNFAKTMLRLAKERQTLSVINDQYGAPTGAELLADCTAHAIRVALKKPEVAGLYHLVAGGTTTWHDYAALVFDEARKAGITLALTELNAVPTSAYPTPASRPGNSRLNTEKFQRNFDLILPQWELGVKRMLTEMFTTTTI >NZ_CP043433|1733511:1744018|1739830_1740805_+|WP_000018223.1|DBSCAN-SWA MSHIIKIFPSNIEFSGREDESILDAALSAGIHLEHSCKAGDCGICESDLLAGEVVDSKGNIFGQGDKILTCCCKPKTALELNAHFFPELAGQTKKIVPCKVNSAVLVSGDVMTLKLRTPPTAKIGFLPGQYINLHYKGVTRSYSIANSDESNGIELHVRNVPNGQMSSLIFGELQENTLMRIEGPCGTFFIRESDRPIIFLAGGTGFAPVKSMVEHLIQGKCRREIYIYWGMQDSKDFYSALPQQWSEQHDNVHYIPVVSGDDAEWGGRKGFVHHAVMDDFDSLEFFDIYACGSPVMIDASKKDFMMKNLSVEHFYSDAFTASK |
10 | Enterobacteria_phage(37.5%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1856941 : 1904437
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP043433|1856941:1904437|DBSCAN-SWA ATTATTGTTTCTGAGCAAACTCAAACGGCCTGACCATCTCATACCGATTCTCATCAAGAAAATCAGCCCACCACTGGAGCATCAGGCGGCGTTGGTCAAGATGCTTGGCCTTATGGGTATAGGCTGCGCGAACGCTGTTACTTTCCTTATGGCTCATTTGAAGCTCTACAGCATCTTCTGACCATAGACCAGACTCAATTAAGGCACTACACGCCAGTGTGCGGAAACCATGACCGCAGATGTCCTGTGTGGTGTCATAGCCCATCTTACGTAGCGCCTTGTTAATAGTGTTTTCACTCATGGGTTTGAATGAATCATAACAGCCAGTGAAAATTAATTCTGCTTCGTTACCTTCTTCATAGGTAAGCTGGCGGATCTCTTTCAGTATCTTGAGAGCCTGCCTGCAAAGGGGAACGAAGTGCTGACGTTTCATTTTAGCCCCACGAGTCGAGTGCTTGACGTTTTCAATCGCTTCCCGCTGTTCGGGGATCACCCATAACTTACTTTTGAAGTCGATTTCCGACCACCGGGCGAAACGAAGTTCGCTGGAACGAATGAAGATCAGCAGATTGAGTTTAATCGCTAGCGTAGTCAGTCCACGACCTTTGTAGGCATCAATACGTTCAAGCAGTAGCGGGATCTCTTCCAGCTCCAGTGCAGGGCGGTGTTCCGTCTCTGGTTTCTGGACAGCGCCTTCCATATCATAGGCCGGATTATGACGCATAAGCTTTTGCTGGACGGCATGACGTAGGATCGCGGTGATGTATTGCTTAATCCGCATGGCAATTTCAAGGTAGCCGAGTGTTTCCGCTTTTTTGACCGGGACAAGCAGATCACCCGTATCCAGTTCTGAAACGTTTCTGTCGCCTATATCCGGGAAGACATAGGTTTCAAGGCGCTTCCAGACAGTATCGGCGTAATCTTCTGACCATTTTGTTTTGGTGGCAAACCAGCTTTTGGCGACGACACGGAACGAGCGGGTTTTATCCCGCTTCTCCTGAAGGACTTTTTCATCAGCCTGTTTTTTAGCGTTCGGGTCAATCCCCTGAGTCAGCAGCCTTTTGGCCTCGTCCCGGCGTTGTCTGGCATCGGCAAGTGAAACCGCAGGGTAAACCCCAATGGAAAACACCTTCTGTTTGCCATCAAAGCGATAGCCTAACTGCCAGTATTTTGAACCGTTAGGATGCACCAGCAGATAGAGGCCAAACCCGTCAGTGAGCTTGACGGCCTTTTCCGATGGTCTGGTATTTTTTACTTTGGTATCAGTAAGTGACATGACGGTTCCCTCCGCGTGCTGGTAAAACACAAATCGAACCAGCTTTACCAGCATTTTTACCAGCAAAAGGGTATGGCTTCGAGTGGTTTTTAGTGAACGAGAGTGAACCTGAAGAAGGGAATAACCAGTTGATATAAATGCAGAAAGCAGACGTCAGTGAACGTCTGCTTTCCTTAATTTGGCTCCTCTGACTGGACTCGAACCAGTGACATACGGATTAACAGTCCGCCGTTCTACCGACTGAACTACAGAGGAATCGTGTGAACGGGGCGCATAGTAACGATGTGCGATCGGCTTGTCAAAGGGGGAAATAAGGTTGCGCGTTTGTTTGCTGACAAAAACAACAAAGCGTTGAAGTTTTGATCTAATTTCTACTTTGCCCCGGCATGGCGCAACTTTGTCTGTAATTGCACAAGTCAAATGCTGTGACCTTACCGCAATGGCTATGTGCCGGCGTCTGATGAAACGTGAAAAACTGGCAGGCACTTGGCAAATAATTCTGAGACATAACGCCGTAGAGATTAAGGGCAGGGGGCAGAATGAACTTTAGACGTGAAATATTTTGTAAAAATGGTTGATACAGGCAGTCTGACGCCGGTAGCGGAAATGGCAGATAAATTTCTGGTGCAGGCGAAAAGATTTCCGTCAATATCATAGGCAGAATTATGGCGCATCAGCTTTTGGCGACGACACGGGACGAGCGGGTTTTATCGCGCTTTTCCTGAAGGATTTTTTCATCAGCCTGTTTTTTGCGTTCGGGTCAATCCCCTGAGCCAGCAGCCTTTTGACCTCGTCACGGCGTTGTCTGGCATCAGTAAGAGAAACCGCAAGGTAAACCCCAATAGAAAACACCTTCTGTTTGCCATTGAAGCGATAGCCTGTCTGCCAGTATTTTGAATCGATAGAGGCCAAACCCGTCAGTGAGCTTGACGGCCTTTTCCGCTGGTCTGGCATTTTTTACTTTAGTATCAGTAAGTGACATGACGGTTCCCCCCGCGTGCTGGTAAAACGCAAATCGAACCAGCTTTACCAGCAAAAGGGTATGGCTTCGAGTGGTTTTTAGTGAACGAGGGTGAACCTGAAGAAGGGCATAACCAGTTGATATAAATGCAGAAAGCAGACGTCACTGAACGTCTGCTTCCCTAAATTTGGCTCCTCTGACTGGACTCGAACCAGTGACATACGGATTAACAGTCCGCCGTTCTACCGACTGAACTACAGAGGAATTGTTTCAACGAGGCGCATAATACTGGGCCGGCTATAACGTGTCAACAGTAAAATTAACGCGCCAATTCAATTGGTTAATTAACCCACAAAGTGAGGAATTAATGTTCGGATTCCTCGCCGTAGTCGCCTTCGGCAGCGCTTACCCGTAAGCGCTGAGAAGGATCCTGGCGATAGAACTGGCAAAAACGCTGCCATAGTGCCGGGAAACGTGGAGCAAACAGTTCTGGCGCGCTGAAAAAATACTCTGACAACACGGCAAAACATTCTGCAGGGTCGGTGGCGGCATAGGCATCTATACTGGCAGCGCTTTCGCCAACAAGATCGATTTCATCCTGAATATTATTCATTGCCGCGTGGAGATCGTGTTCCCAGCCAGCCACATCGCGTAACGGGATGAAAGGGATGCCGCTGGCGCGATCGCCATTACGCATATCCAGTTTGTGCGCGACTTCATGAATAATGAGGTTGAAACCCGAAGCATCGAACGAGTCCTGGATATCCAGCCAGTTCAGAATGATGGGCCCTTGTTGCCAGCTTTGCCCCGACTGTACGACACGCTGGCTATGCACCAGACCTATGTCATCTTCCCATTCATCATCTACCACAAAGGGCGCGGGATAAATGAGCACTTCATGAAAACCATCAAGCCACTCAATACCGAGCTCCAGGATCGGTAAGCAAAAAATTAACGCAATACGTGCACTTTTTAACGAGTCGAGCTCAAATCCCTGTAGCGCTACCAGTCTTTTCTGCTGCAAAAAACGTTCGGCTAGCGCAATAAGCCGAGCCTGTTCTTGCGCGGTGAGGTTTACCAGAAGAGGTATAGCCAGCGCATCATCCCACGGCCAGTCTTCGTTCTGGGTTATTTCTTGTGCTTTCCAGGGCCACTTAATCATCGTTTTGCTCGTAAACTCGTCACTTGAACAAAATTACCCGAATAGGGTCTGTTAAAATGCCAAATTACCTGGCATCATTGCAATATACGGAGAGATGCCGGAGCGGCTGAACGGACCGGTCTCGAAAACCGTTGCGGGGGTAACTCCGCCGAGGGTTCGAATCCCTCTCTCTCCGCCACTATTCAAGCACTTACGTGATTTTCTTATAGTGATGAAAATCACGTTGAGAAAAAATGAGAAAATTCGGTGAGAAAAAAACGCCAGAATTTTAACTGGCGCACATCGAAAAGCTCAACGCTTCCTGTCCAGGGTTGGGCTCATTTTCACCTTGCGATCGTAAACAAGAACCTGGGATTCGGTTTTGTGGCCACTGTACTTCTGCTTGTCTTTTGCCGTTCCCTCATAGTCTGAAATCCCCTTTGCCTTTAGATCGTGGAAAGTGCAGTCAAGAGGACGTCCCAGATCATCCCCCGCAGCCTTTCGCGCCTTTCTCCACGCCTCGTTAAATCCTTTATAAGAATAACGCTCGCCATACATAGTCCTGATAACAGGGCCTTCCTCTCCCCATTCACGACATATTTCAACGGCATCACGTAAGCGATCTGTCCAGGATTTAATTTGTTTAACTCCGGTTTTTCCTTGCTGAATAAAAATTCCTTTCTCCAGTATTTGATTCCAGTTCATTTTCAATACATCAGAAACTCTGGCAGCACATAAATAAGCTATTTCCATTGCAGCCCTGACGGCTGGCGTTGCGTTATTATATATCGCTCTGTACTCTTCATCGGTAATATATCGATCGCGTTGAGGCTTAGGAAACTTATCCACACCAACGCAAGGATTACCAGGAACATAACCACGTTGATAACTCCAACGAAATACGCGCGACATAGAGCTGTGTTCATGATTCGCCTGGACACGGCTTTTTTGCCCACGGGCATCCATATAACGCCGGATATGTTCTGGCTTTATTGCTTTAGCTTCGGCATCACCAAATACGGCAAGTATATATTTCTCATGTGCCAGATAATCTTTCTGCGTTCTTGGGGCCAGATCAGCATAATCGGCACTGGCAAGAAATTTTCGCCATAATTGTGTGAATGTAATTCTGTTTTTTCTACCCTCAACTTTTTTTTCGTAAGCCACCCAGACCTCAGCTTTAGTTGCATCAGCTGGAGCTATATTTTCTGTTGATCCTCCCGGTTTCCAGTAGTAACCAGAAGGGCGAAAGAATACACCCTTTGGCATCCACTCATTACCGGGCGCACGTTTGCGACCCATGTTAACTCTCTATAGCGTCAAAATTCATGCCTGGTACAGGCTGATACCCTGCTGGTGGAAGAAGGCGCGAAACTGGGTGATTTATATGATACCAGGTTGTTCTGATTGAACCGTCCCGGCGTTCTATAAAATAAATACCGTTCAGCGTTAATACTTCTTTCTGGAGTGACTTCTGGCTTGCTCCTGTAGCATCTTCCAGTTCCTCCTCAGTCAGGAAGCGATCGCTCATGAGTTGCTTCTCCATCTAACCGGCTGCACCCGGTTTAAATCAGATATTGTTGCTGGTGGGCGGGATCAGTTTCTGCCAGATAGCTGAAACGTATTTCGCCTGATGCCTGGCATCAGCCAGCGCGTTGTGCATATCGCCTTCGAACGGCATGTCACGTTTGGGATCGAAGCCGATTGCACGACCCAAAGTAACCATCGTGCGGACGTCATGATCGTTCCGGTAATTCCACAGGCAGGGGAGGCTGGCACGTTCGAAAGCGCCACGCAGGATAACGTTATCGAAATTGGCACCGTTACCCCATACCTTCAGGTATTTCAAATCGTCGGCGTGACAGGTGATAAAGTCATTGAACTCAATAAGCGCGGTCGTAACAGATACTGCATCAGCGCAAATAGCCGCTCGCGCTTCCGGGCTTTGTCTTAACCACCATAGAATGGTATCGCCATCAGGAACGGCACCTTGTTCCATTGCGCTTTCAAGGCTAACGGCGGTATAGAACTCAGGTCCAATTTCACCACTTTGCGGATCGAAGAACACAGCACCGATGGAGACAACAGGCGCGTTAGGTTTTTTACCCATCGTTTCAAGGTCGACCATTAAGTTGTTCATTCGTTATTTATCTCCGGTTTTGGTGCTGCTGGCACTGGCATCCAGTGTGTAACAAAGATGTGCTCAATACAGTTCATCTGATTGCCGCCTCGCATGTCAAAAAATAGTCCAGAATGCTCATCAAAATATGAAACATAACGGTATCCCAACTTGTTATGAACAATTACTTCTTGCTCGTCTTCTGGCATCCGCTCACTACAGCTTATCCAACCATCCGGAATTACCGGAGAGTTGCCAGCATTTGGATGCAGAGCGCGAATGCCTTCGGCCAATTCTTCCAGGGTGGATGAATATCCGCGTTGGGCATCATTGCCGAACTCGAACGCTCCGGTATCAGGATCAGACCATCCATGCTCGCTGTCGTATGCCTCACGCTGCTGATCTATCCATTTGGCGGCGGCTTCAACGCCATCGCGATAGAAAGTGACTACCGGCGCTGACTGTTCTTTGCCAACCAGACGCGTTATTTCTGATTCCAGAAGTGAACCTAAAACTGTACTCGTGCAGTGCTCAGCCCATTCGTTGTTTTCCAACAGGCTGATGATATTTAGCACATCATTGTAAACGCCAGTTTCCTGTACTGGCTGGGCGGTGTAAAAATACGGTCTGATAGTCCACTTTTTATTCCAGAAATCTCGCGTTTTCTCGGCTTCCTCAAGCGTTGCAACACTACAGCCAACCTTTCCGCACTCTTTGATTACGTGATATCCGGCTGGCTCAGCTTCCAGCGATGCCAGCGCACGCTTCAGCACAATAAGAACTTTGGCGTCGTCATCGCTCAGGCCAAACGGAATATCGTCGCGAGTGTTTTCAAATTCAGCGATAGTTTGCTGTAGCCATTCTTTGGTAATAGTGGTCATGGTGTACTCCAGTTATCTTCGATAGCCACACTAAGTCGGTGCAGCCAGTCAGCTAATTTCAGCATTGCTTCGCGTTCGCTAAGCCCTTCCGGAAAATCTTCAAGTTCAACGAAAGGCTTAAACCGTCCGAAGGCGTCGTTCTTAACTGTCAGCTTCTGTTCAAGCGTGGTCTTCTTAACCTTGCTGTGATGCCGTAGAAGGTAAACAGACTGAGATTTTTCGGTTTCCGGATCGTATTCGTAGGCAGTTAGAATCATCTGGCTGCCGCCGCGATTTAATCCTCTCCACATAGTCACTTCCCCTTGCCGATGCCAGCGGCGTTTAATTTCTGCCGTAATTCTACTAATTTGGTTTGTACCGCTCTTTGCCTGTGCCATTGCAGGACGAGCATTTCGGACTACCGTTGTGGTCGTAATAACCACTGCCGTTACACGCCGTGCAGGGACGCAGCTTCCAGCCAAAAACGAAACGTTGGTAATATTCAGTTCGACGAACTTTGCGTTCGTGGAAGTTACACATCTCACCCCCTCTCAATACGCAGCGCCTGTTTATCGATGTTGCTCATTGGGCTGTCTCCTGTGGATAACAAATATCGTCGAAATATTTTTCTGCTACGCGCATGTTGAAGTGATCGAGATTCATCTCCTTCACCTGGAGTTTTGCCCCAACAATGCCTGTGCATTGATTGACGTAATCCCGGTTTTCTGGGGATTCCGTTACCCACTCCATAAGGTTTTCGGTGACACTTTTTAAGCAACGTAAAGCGCAGTCCAAATCAGTAAAATGCTGAGCATCAGTTATGCAGGAGACGACATAATACGTGGTGACTTTTGGCCCATCAGCGCGTCGTTTAAGCTCTCTTTCGATAGCGTTTTTCAGATCAACCAGTTCATGGTCATTGAGTTTGTCGATGTTGCTCATTGGGCAGCCTCCATTAGTTGCATTACGTGTCGCTTGTGCTCTTCGCTTTGCGGAACACCTGTAAAGTTCACCGCCATGAAATAAGCCAGGCGATCCTTGTACGTCACCTCATCCAGCACAACTGCTGGGAGTGGACGACGCCCAAATGCCAATTGCTCCGCACGAGTCATATCTCGCCAGTAAAGCGGACCATCAGCCAAGATGATTGGAATCTCATTGGTGATGAGTTTTTTCAGGGTTGTGAGACGCTGCTTGCCGTCAACAACTTCTATGTAAGGAAGTTCACGCGAACACCAGTCAGGTGCCTTTGCCAGCGCCACTGAGCCGATAGGAAAACCAGAAATAACTGCGTTTAAGAATGCCTGCTGCTCTTCATGCCCCCAGACATACCCGCGCTGATAATTGGCATCAAAATCAAGTTCACCACCAATGATCCAGCGAATGTACATATCAACCGGGTACTCACCGGTGCGCGCGTCGAATACCTGAGCATTGCGAATTCGGTTGCTCATTGAGCTGCTCCTTCAGCTTTCTTTTCATCAACGCTCCACGCCGTAGCCAGCGCACTTGTTACCTGAAGAAACGAGTGTTTAACCCTCACGGTGAAAGTTTCTCCGGTGGCTGAAATAGTCTCGATGACTGTTAACTCGCCGCCGCTTTTGTAGTCTGGATAGAACTGTGTTACCAGATTGCTATCAACAATCACCGATCCGTCCAGGGTGTACATTTCCAGTTTCATGCTGCACCGCCTTCAACGCGCTCCCACAAAAGTCTTGATCTGATTGCCTTCACTACAGACTCTTTATCTTTTATGCAGCACATTGGCGTAGCTCCATCAGTTCATTAAAGCGGGCCATAAACAGGCCGAAAGCCTGACCGGGGCGAAGTGGGTAGATTTCGAATAAATCTGTCGGGGGGATACCTTCCAGTATTACCCAGGGAATACTGTCATCAATATCCAGATCACGGCGTTCAGTTGCCAGCATGGTCAGATCTGCATACTTCACTACGCTGGCTTCTTCCAGTGGCAAGCCAAACTTAAAGCGGATCAGTTGATCGGTACGTTTCTCAATCTCGCGATAATCAGGCAGTAACGCTTTTAATGGGGCAGGGATATCCTGGCAATACGCTTCGGCTGCGTCGTGCATCAGGGCTTCAAAGGCAAACTCCGGTGATACAAGCTGGCTGCACAGTACGGAATGCTGCGCCACGCTATAAAATTCAGGAAGATGTCCGGAGAAGCGGCAAATATTGGAAAGCGCCACGGCGATATCTTCAATATCAATGTCGTCAATAGTTGCGCTGAGATAATCAAATTGTTTACCTGAAAGTGTTTGAATAAAACTCATCGTTGGTTCTCCTTATAATTTATTTCGCGCTGCACCGCGTGAATTTTGAGTACAGCAACCCAACCCACGATGTGGGGTTAATTGCCGCTATGAGTTATCGCTTGGCTTCGCCGCCGAGGGCAGCCGTTAAATCAGAAATAAGATTACTGAGTTCGCCTGTCATCAGCGTAATGTCAGCATCAAATCGCTGCACGACATCTTCGCTGTCGATATCGTCGTTCTGGCTGATTAACTGATCTGCAAATTTTATGCGTTTCAATATACCGTCACAAGAAAGGGTGAAACTGATACGCTGCTGCCATTCCATTGAAATCTGGGTAACTACCTTCCCAGCTTCGATATGGGTAAGAATTTCATCGCAGGCAAGATCCTGCTTTTTAAACCGGCCTGTGCCACCATCTTCGAGAATAGCTTTAAGGACCGCTTCATCGCCGATGGAGAACCCGGAAGGAGCCGCTTCGCTACGAACCCACTCAGTTAGCGTGAGCTCGATAGGGTTTTCCATCGTCAGCGGCACGACTGGCAAGGAACCTAGGGTTTTACGAAGCAGGGCGAGAGAATCTTCTGCGCGCTTGATGCTGGATGTATCAACAACGATAAACCCGGCTGCAGTGTTTATCCAAATACGAACCAGACTGTTTTTAGTAAACGCCCTGGGTAACAGGGAATGAAGAACCTCATCACGAATAGAGTCTTTCTCAGTTTTCTTAAGACGGCGGCCTTGCTCTCGCTCAAGCGTGGAAACCTTCTTATTAATCTCATCGGCGATCGTTTGTTTAGGTATGATTTTTTCTTCACGACGAATAACCAAAAGTAACTGGTTATTGACTGCATGATATAGCACATCTGAATACTGGACTAATGGTGAAAACCATCCGCTTTTTGCCATATCCTGGCTTCCGCATGGTGAGAAGCGAAACAGCTCAAGTTTCTTATCAAGAGAGTCTATGTCGATGTTAAAGTCGCGGCTGAAGCGATATATCAGCATATTTTTAAAAAATGGGTTATGCATTTTGTTTCCTTAACGCCTCTGCACTGGCGTTTTACGTTGGTTTCTCCACAAAACAGAAAAGAGCACCTGCTGTAACAGCTTTCCGGGTGGATTGGGTAATGAGCCCGTCGCGCGGAGATGCTCTTTTCTGTTGTGTAAAAAGGTCGGCGTCACGGCAGAACACTGTCGCCTTCCTCCTGTTGTTGGAAGAGCCGGACGCCGACAAGACTTCACACAGCAATAACGTTGTGGTGGGGCTGTCACTCAGGCGCATGGTCAACCTGACAACCCGGTGTCCTACTGGGTACAAATGGAGAAAAACCCGCCATACTTACCGCCGCGCCATTTCGCGGATTACCACAACGAAGAGAGCACTGCCGGTGTCCGAATTGAACGGACCTTTTCTCTGCCCAACCCTCCTGACTAAACAGGACTGTCTGGAATCGAACCAGCACTTATGCCTTGCTCGTCAATGCTCTCATCGTTGTGTGCCTGTCTTTTCACCACATCAGGCTCGGTGGACCTTGCTATTCCCCAACAGTAAGGATTCGGGTAATCTTTTTAATTCCCCAACAACATAAGGGCTTAACATGTCTCAGAAGGATGATATTCCTGTCTTTCCCGTAACCGGCTGGCAGGCTGGACCGCTTCCTGGTTACGACGCTCTGGTAGTGAAATTCCAGTTTCTCTCATCACCGATGCAACCAATTGAGTCTGCTCAGGAAACGCAATTTTTAGTACTTACTCCTGAGATGGCTGAGAGCCTGGCTTCAGACTTGCAAAGGCATATTCAGGATTTGCGAAATTCCGACGTTCACAGCCCACAAGAAGGCAAGCACTAATAAGGAACACCTGAACTACTTCATTTCCCTTAAAGCGCCGTTGGGTGATGGCGCTTTTCTTTGCATTAACCAGCATCATTCCCCCTTCGTGACGTTCAGTTTTACTGGCTTTATCACGGCTGCGTAGTTGATAAGAATGTTTACGCATGAAACAACGCACTCGGAACAGATAGCTGGCTCGTCCTTACTTCCTTTAGCGATGAGCTTTTTTGCATCCAGCTCACTGACCCCGCAGAAGGAGCATGTGTAGTAGTTATTCATCTGAACTCCTGTGTAATGCATCATTGCGAATCATCCGGTCATTCGTATGCCACCGGCGGCTACTTCGTGGGCGTCCTGCCTGTTCGCTGCTCTATGAGTGCAAATTACATTTAAATTGCACATTGCGCAAGTATAAAATTGCGATATATGCAATTTTGAGTCAAAAAAAAAGCCACTATAATGGTGGCCTTGTCGACGCTTTCTATTAATTGTGTCGTTTGAGTGACTGCGTCTGGCTTATCAGAACCTTGCCAAAAACACCGAACCTGCACTCGTTGTCTTTGGTAATACTCCATTCCCTGTAGTTAGTGTTATCAGATATCACCAGTAATTTATCGGGGATCATCTGTAGCCTTTTTACGTATATTTTATCATCAAAGCCAAAGACATAGATGCCATCACCATCGAACTGGTTGATGCTTATATCGACAAAAATAAGATCTCCGGGTTCAATTGTTGGTGCCATGCTGTCACCACGCACGTTAATCACTTTAAGCTCAGCGGCAGGACGCCCGCCGAACATAGCTAGTGCTTTGTCCTTGTTATATTCGATAGCATGGATTACATCGATAACATCACCGCCCTGAATGAGTCCATTACCGGCGCTTGCACTGACATCCAGTATCTCGATACGGAACAAATCCTTCACGTTAGCTGAATCCTTCCTCATATCACTGTGTTTACATACAGTATTACCTTTTGGGTCTGAGGTAAAGAGTTCTGCTATATCAACACTTAAGCAGTCAGCCAGCCTAGAAAGTGTTTGTTCGGTAAATTGCTTTTGCTTGCCAGTCTCCAGACGAGAGATGTTTGCGGCATCCACGCCGATCGCTTCTGCTAGCTCAGCAATTTTCATGTTCTTCGCGCGGCGAAGTTGTCTGACACGGTTTCCTATATTCATGCGTTCATTACATTAATTTTTTGCGCATTGTGCAAATCAACTTGCGCAAGTTTGCTGTATGAAATAACATGCGACATACGCAAAAGAAGGAGGTTTTATGCAATCACCATTGAGAAAATTGCGGAAATCGCATGGTTATACGTTACAGCACGTCGCTAAAGGGGTTCAGGTTGATCCTGCAACATTAAGCCGGGTTGAAAGATGCGAGCAGGCTCCTTCAACAGAGCTTGCTGAGCGCCTGGCTCAATTTTACGCCGGAGAAATTAGCGAGATGCAAATTTTGTATCCAAACAGATATCAGCTTAGTGATTCGGCGATTTGACCGCCACCACAGCAGAAGGAGTAGATCCGTGGGACATGAACCTGAATGGAAAGTTGAAAAGCAGCCCCGCTGGCTGGTGGCTGCGATTAAAAAGACGATTTCCAGTCTGCATGGCGGTTATGAAGAAGCTGCGGAATGGCTGGATGTCACCAAAGATGCTCTGTTTAACCGCCTGCGTACTGGTGGTGATCAGATCTTCCCGATTGGGTGGGCGCTGGTACTGCAACGTGCCGGAGGAACCTATCACCTGGCACATTCAGTAGCCAGGGCATCAGGTGGCGTTTTTGTTCCGCTGGCAGATATGGAAGAAGTGGATAACGCAGATATTAATCATCGCCTGCTGGAAGCGATTGAGCAGATCACCAGTTATTCCCAGCAAATCAGGGTGGCTATCGAAGATGGCGTTATTGAGCCACATGAAAAAGCCGTGATTGATGAGGAGTTGTATCAGGCGATCGCAAAGCTGCAACAGCATTCGACACTGGTATACAGAGTTTTTTGCGTGCCAGAAAAGGGTGACGCCCGCGAGTGTGCAGCTCCGGGCGCCGTGGCGTCAAATTTTATGGAGAAAACCAACGCATGAACAGTTTAACGGTAAATAACCGTTTGTCGCAACAACCGGGGATGTATGAGTACCGGCCGTTGCGTCATGAATGCAGATTATCAAATAGCCTGGTCGTGCGTAACCACAGGGAACACAGCCTGACCGTGGGGGATGAATCGTGCAGGAACTTAACCGCTGGTTTCGGGATGGAAGGGGACTTTATGTCCATGTCATTCGCTGGGAACCAGAAACTGAGCGCGTTATCTATCTGCGCAAGGGCTATCCGCATGAGTGTTTTAGCCCTTTGTGGAAATTCAGGCGTGATTTTGTTGAGTGTGAAGCGCCAGGAACACATTGATTCTGCAATTCCGGGACGTTACACTGTTCAGGCACCTCATAAAGCGGGTGCCGGGCGTGGAAACCCGGAATTCAATATAGAGCACAACCGCGCTCATGCGGTTTTTTCTTGTCATGAGCATTGTTACGCCCAAATTATGGTGGGGCGTGCAGGGCCAGTTTCGGCTGGGCCGGGTTCTATGTTGACCGGTATTTCCACCCCTGTACGTCTCACCACCTATAAGGTCGTGGAAAGCCTTGGTGGTGAGTTCATTGAATTCAACATAGAGGCTGCCACTATGGCTACTGTCCCAACCCTCGCTCAACCTGAAATTAGAATTATTAACGGCCAAGCCGTTACTTCCTCCCTGGCTGTTGCCGACTACTTCATCAAGCGTCACGCTGATGTTATCCGTAAAATAGAATCTCTCGAATGTTCCACTCTATTTCGTAAACGCAATTTTGCGTTTACATCGATTTCAATAAATCAGCCCAACGGCGGTACTCGCAAACTCCCATGCTATCAAATCACACGCGATGGTTTTGCGTTTTTGGCAATGGGTTTCACTGGTAAACGTGCTGCTCAGTTTAAAGAGGCATACATCGATGCCTTTAACCAGATGGAGAAACAGCTTTCAACTCCATCGGTGCTGAGCGATGCAGCACATAATGCCAGCGTTCTTTATTCCTACATTTCATCCATTCATCAGGTTTGGTTACAGCAGCTTTATCCCATGCTGGAAAAAGCGGAATCTCCGCTGGCTGTAAGCCTGCACGATCGCATCAATGACGCTGCGGCGCTTGCGAGCCTTATCAATATGACACTGAACCGTTCAGAGGTAAGGGGGCGCAAATGATCCGGAATATTTTTAAACGGTTCACCAGCCAACGTTTTCATTGCCCTCGTCCAGGACAGTGGTACAGCACACCAGAAGGGTACGTTCTGCGTATTAGCCTGGTCGATCGCGAATGTCAGAAGGTTGTCTGTGAGCCTCTTGGGCGTAATTACCGCGTCAACATGCCGCTTATTGCCTTTCGTTCCGGCAAAAACATGAAGCATCTCGGAGGTGCTGCATGAGTTCCCTTATTCAATTACTCGATCGCCCCATCGCCTACAACCCTGCTTTTGCAAAACTGAAAGCCGGGAAGGTAAAAGCTGGCCCGGTTGCGGCAGTATTCCTGTCCCAGCTTGTTTACTGGCATAACCGGATGGATGGCGGCTGGATGTACAAAACACAGGCTGATATTGCCAGTGAAACGGCGCTAACCCGCGACGAACAGGAAACAGCACGTAAACGTCTGGTAGCACTTGGTGTACTGGAAGAAGCCCGTCGCGGTGTACCTGCCACCATGCACTACCGCATCAACACCGCACGGCTTGAAGCGCTGTTGCTGGAAACGGCGAAGCCAGTGAAAAAGGGCGCTCAGGAGAAAACCAGATTGCGGGACTTCCAGAATGTGGAAACCCCGCAATCTGGATTGGTGCAACCCCGCAAACCAGATTGCGGTGATGCCGCAAACAAGAATGTGGAAACCCCGCAAACAAGTACGGGGCAACCCAACGAACAAGCATGTGGCGATCCCACAATCTTTCCTACAGGAGATTACACAGAGACTACTCAGGAGATTACACAGGAGAGTAAAACCCCTTTTTGTCCGGTTGCTGAGCAACCCGACCCCGAAGTGACGCTCACCGATCAGGCGATTGAGGTTTTAACCCACCTGAACCAGGTAAGTGGCTCCCGGTATCAGAAGTCAAAAACCTCCCTGGAAAACATCCGTGCCCGACTGCGTGAGGGGTACAGCGTTGCTGATCTGCAACTGGTTATCGACCTGAAGCATGAGCACTGGCACGAGAACGACGAGCAGTACCAGTACATGCGCCCGGAAACGCTGTTCGGTCCGAAGAAATTCGAGAGCTATCTGCAAAGCGCTACCCGCTGGGATCAGAAGGGGCGGCCTAAACGCGCTGACTGGGGTGCGAAGAAGCGCGATGTGATGGCTTTTGGTCCGGTTGATACAACGATTCCGGAGGGATTCAGAGGATGAGTCTGTTAGCAAAAGTGCAGGCGTTTATCGAGCTTAATCCGGGGCTGACATCAAATGAGATTGCCGATGCTTTTCCTGAATACGCACGCTTTGATGTGCAGCGTTCAGCGAGCAAGTTGTATCGGTGTAAGCGTGTTAACCGCCGCCTGGATGGAGATGTATTTCGCTATTACGCGGGTAAAGACGAGGCAGTGATTTTGACGTTACGACAGAAAAGGTCAGGTCATACAGGTTCGGGTGATCCGATGGTGATTGCAAAGCTGGTAAGCCGCGCTGAAGAACTGGAATCCAGAGGGTTATTTAATCGTGCATCGATAGTGTGGCTGGAGGCATTTAGCGAAAGCCAGTTTATCTACGAACGCGAGGAATTTTTACGCCGCCGTCAGAAGTGTCTGAACCGCATCAAAAAGAGAATCAGACCCGTAGAGCAGGTTTATCTGGCAGGGCGATTTGTGGGGAACGTGGAATGACCAGTGAATCCGTTTGTATTGAAAGCAGTGATGTAACGATATCTGTTGATGAATCCGCTTCGCGCACCTGGCGTCGCCCGTTCCTGAAATGGGCAGGCGGTAAATATTCCATGTTACCCGATCTTTACCAGGTCATTCCGGCAGGTATGCGCCTGATTGAACCGTTTGTCGGCGGTGGTTCGGTGTTTCTCAACTCAGACAAACACGCCTGCTTCCTGCTGGCCGATGTGAATACCGACCTTATCAATCTGTATCAGATGCTGGCTGTTGTACCTGGTGCGGTGATAAGACATGCTAGGGTAATGTTTGACCGTCTCAATGACGCTGAAAGCTATATGGCGCTACGGGAAGAGTTCAATGCTCAGGTGATGGACGCTCCGGAACGCGCCGCCGCTTTCCTTTTCCTTAATCGTCACTGCTTCAATGGCCTGATCCGGTACAACCGCAACAACCAGTTTAACGTTGGCTGGGGCAAATACCCGTCGCCTTATTTCCCGGAAGAAGAAATCAGGGCATTTACCGAAATGGCGCACAACTGCGTATTCATGGCGGCAGGATTTCGCCGGACGCTGGCACTTGCGGGAGAGGGTGACGTTGTGTACTGCGATCCACCCTACGAACCGATGCCCGGCAAGGATGGTTTTACTCACTACGCCGCTGGTGGCTTTACCTGGGATGATCATATCGCGCTGGCGGAATGTTGTGTTGCTGCTCATCAGCGAGGTGCCAGAGTCGTGATCGGCAATTCCACATCTCCGCGTGTTATCGACCTGTACTCGCAGCACGGCTTTGAAATCCGCTATATCAGCGCCCGCCGCTCAATATCAAGTAAGGGCAGTACCCGCGAGAAAGCGAAAGATCTCGTGGCGATTCTGTAGGGGGCGGCATGAAACTGACATTGCCATTTCCACCCAGCGTTAACACCTACTGGCGGGCTCCGAATAAGGGACCGCTTAAAGGTCGTCACATGGTCAGCGCCAGCGGCCGGAAGTATCAGAGCGAGGCGTGCGCGGCAGTGATTGAGCAGTTACGCCGTCTGCCAAAACCTTCAACAGCCCCGGCAGCGGTGGAAATCACCCTGTATCCGCCAGACAAGCGGATCAGGGATCTGGACAACTACAACAAGGCGCTGTTTGACGCCCTGACCCACGCGGGTGTGTGGGAAGACGACAGCCAGGTGAAAAGAATGCTGGTGGAGTGGGGACCAGTTTTCCCGAAGGGGAAGGTAGAAATCACGATCACGAAATTTGAAACAGGGGCGGGTGCAGCTGCCTGAACATGGAGAAAGAAGCATGAATAATTTAATGGTCATTGATGGTATCGAAGTTCGCCGCGACGTTCATGGGCGCTATTGTCTTAACGATTTGCACCGGGCTGCGGGTGGAGAGCAGAAATACCGTCCGAAGTACTGGCTTGATAATAAGCAAACCCGTGAGCTGATTGAGCAACTTTTCACCGAGGGCGGAATTCCACCCTCGGAACAAAATCAATCTGTTAGCTTTTTTCAGGGCGGTAGTGATACCCGAAGTTTGGCACGTGCTCCAGTAAATACTGTTCGCGGTGGTGCTGAACAAGGTACATACGTATGCAAAGAACTGGTATTTGCTTATGCAATGTGGATCAGTCCGTCTTTCCATCTCAAGGTGATCCGCACGTTCGATCGGATTACCAGTGCGCCACAAATATCTTCTGGTATGGCTGCCGATAAGATGCAGGCGGGGGTGATTCTGCTGGGTTTTATGCGCAAAGAGTTAAACCTGTCCAATTCATCGGTACTGGGCGCGTGCCAGAAACTCCAGGAGGCAGTGGGACTACCTAACCTGGCGCCACAATATGCCATTGATGCTCCGGCTGGCGCGCCGGATGGTTCAAGCCGCCCGACGCTTGCACTGAGCGCGCTGTTAAAACAGCATGGTATCCGGATGACGGCTAATCAGGCGTATCAGCAGTTAGCAAAGCTGGGTGTTGTTGAACATCGTGAGCGTTACAGTCGCTCCGCGATTAACGGCATTAAAAAATTTTGGTCGCTGACGGCGAAAGGCTGCATGTTCGGCAAAAACATCACCAGCCCGGCAAACCCTCGCGAGACGCAGCCGCATTTCTTCGAATCCAAATTCCCTGAGCTGCTGAAGCTGCTCGATACCGTTCATTGAGGTGATCGTGAGAGCGTTACTGACCCCTGAAATTGCTCCTCGTATGGGCGTTGTATTGTTCAGGCCGGGATCGGAACTGATGCCCCTGTTTATGCAGGGGCGTGTTCTGCTTGAACCAGAGCCGGAACAATATTCATCTTTCGCCTGCGGCGCGGTCCCGGCGGTATCACAGCCGCTGGCGGATGATCCTGCTGTTCGTGATGTGTTCCGTAATGAGTCGGTTATCTATCGTGCTGGTGGTCTCGATAGTCTGGAAAGCTGGCTACTCCGGGGGAATGGCTGTCAGTGGCCGCATTCAGTCTGGCACAGCGAACAGATGACAACCATGCGCCACGCACCGGGGGCAATCCGACTGTGCTGGCACTGCGATAACCTGCTGCGCGAACAGTTTACGGAACGGCTGGAATCAATAGCTGTGGAGAACACGACAAAATGGGTTTTATCGGTTGTTTGTCGTGATCTGGGTTTTGACGATATGCACGCAGTCACGCTCCCGGAACTGTGCTGGTGGATGGTACGCAATGACCTGGCAGAAGTCTTACCGGAGAGCGCTGCGAGAAAAGCATTAAGGATGCCGAAGGCAATTGTCCAGTCAGCTACCCGTGAAAGTGAAATTGTCCCCTCGGTGCCGGCCACCAGCATTGTACAGGATAAGGCGAAAAAGGTACTGGCACTCAGGGTTGATCCGGAATCGCCGGAAAGCTTCATGTTACGTCCGAAACGCCGTCGATGGATCAATGAAAGATATACCCGCTGGGTTAAATCCCAGCCGTGCGCGTGCTGCGGGAAGCAGGCGGATGATCCGCACCACCTGACAGGCCACGGTCAGGGAGGGATGGGAACAAAGGCGCATGACCTCTTTGTGCTGCCGTTGTGCAGAACGCATCACAATGAGTTACATGCGGACACCGTGGCATTCGAAGAGAAATACGGCTCTCAACTGGAGTTGATATTTCGTTTTATCGATCGCGCGCTGGCGATCGGCGTGCTGGCGTAAATGGAGAACACGCATGAACCTTGAAGCCTTACCAAAATATTACTCACCAAAATCTCCAAAATTGAGCGATGACGCTCCGGCGACAACCTCCGAATCTTTGACGATTACGGATGTAATGGCGGCGCAGGGGATGGTGCAATCGAAAGCACCACTGGGGTTTGCTTTATTCCTGGCAAAAGTTGGTATTCAGAATCCTGACTTCGCGATTGAAGGGCTGATTCATTACGCGGTGGCACTGGATAACCCGACACTGAATAAATTGAGTGAAGAAACTCGGTTACAGATTGTTCCTTACCTCGTGAATTTTGCATTTGCTGATTATTCCAGATCTGCTGCAAGCAAGGCTCGCTGTGAGCATTGTGCTGGTACGGGATTTCATCATGTATTACGTGAAGTGGTGAAACACTCCAGAAATGGTGAACCCGTCATCAAAGAGGAGTGGGAGAAGGAACTATGTCAGCATTGTCATGGTAAGGGAGAAGTCAGCACGGTGTGCAGAGGGTGTAAGGGTAAAGGTATTGTCTTGGATGAAAAAAGAACTCGGTTTCATGGCGCGCCTGTTTATAAGATTTGTGGGCGTTGCAATGGAAACCGGTTTAGTCGTTTACCAACCACACTGGCGCGGCACCATGTCCAGAAACTGGTACCGGATCTGACGGATTATCAGTGGTACAAAGGATATGCAGACGTCATTGATAAACTGGTTACAAAATGCTGGCAGGAAGAAGCATATGCAGAAGCGCAATTAAGAAAGGTGACAAGATGAAAGATTTTCAACGAAGATAGCGACATGATGCTTGCATATTTCAAAAAATATGGATAAGATTCTCCCAACGATGGGCTTTGTATGTCTATCGTTGATAAGTCTCAAGAACCCGCCTCCGAGTGGGTTTTTTATTTGTGATCACTTTATTTTTTGTCTTGCTAAGTTATTGTATGGACAAGAACTAAAATTAAGTGGTGACATTGTGCTCTCTAATAACGAACGTTGGGTTTCCTTTTTTGACTTTGCTTTTACGCCTACACACGCAGCGGCGCCGAGTATTCCCATTGAAGACATACTCAAGAAATTGAAGGTACTGGTGAGCTCAGGGAGTGCTGTAAAGTTATACAATCATAGGTCTAGAGCGCTTAGGATTTCGGAGATGAAATATTCTATTGGGGATAGCCAGGCGACTCTACTTATCCAGCTTTGTGATAAAAATGGTTCTGACCCTGTTTTTGGTGAGTTAACAACAGGCAACCTTAGAGTAGAACCTAAGCTTGCCGGTGAAGGTATCGCAGTTTCTTGTCACATTGTAATATCCACAGATGTTGTCAAAAACACTGCCGATCACCACAAAACTCTCGTTGAATCTGTCCCCGGTATCAGTAAGTCAGTTCTTGAGCCATTTTTAAATGCTATGCTCAGAGAAGCCTTCGCTGGATGTGAGTTTAAAAATCCTGCAACTAAAGGTATGTGCCAGCACCGCCCAAAGCTGGAAATCTATTCTCATGGTTCACAAACGCTGATGGATGCATTAAAAGGTGCAAAGATTCATAACGTTAAACTTGTGAGTACAAGAAGGAAAGGTGGATTGGACCAAACGGCGTACACTGAGCTCTCAGAAAGGTCCGTAAAGTATAAAATCATTAGACAGCCGCCATTGAAAGATAAAGAAAGGTTGTTAGAGATTTTAAGAAAGAAAGGGCAGCAGTCTGGATATACCAAGGTTTCAATTAGTTACTCAAAAGATGGCAAGCAAGCCAGTTTGGATCTTGACCGTAACGAAGATGCTGCCACAAAACTGTTCACTAAAAGTGAGAGGGTAATATTAGGTAACCTCATCAACCAATGTGAAAGCACAGTACATCTGCAGCTTGAAACAAAAATGATAGGGTTGCTCTAACGGGAGTTTCATATGAAACTTTTTTCACCGCTGAGTTATCTCCGCATCAAGCATGAGGAAAAGGACTGGTATGATTACAAAATACCAGCTGCAGTGTCTCTAATCGTCACTATTGTTTATTATTTTCACGCTAGCAAAATTTCTTTAATCGAGACTAACGGACTCCTGCTTCAGGTTAATGGGTTACTTCAAGTCTTGATTGGTTTTTATATCGCAGCACTGGCTGCGGTTTCTACTTTTTCTAGCTCTTCGATCGACGAAGTAATGGCGGGCGTACCTCCGACTCTAGTAGAGAAATTCCGAGGGCAGAAGCTTACTGTAGAACTGACGCGCAGGCGCTTTGTTTGTTACCTTTTTGGTTATCTAGCTCTTGTGAGCTTTATGTTATTTTGCTTAGGGATGATTTCTATTCTGATTGGGAAGCCTTTCCATTTGTGGCTGCTCACATTCTGTTCTCCTGATGCAATCTTGTGGCTTAAAACGGTATTTGTTGGCGTTTATATATTCATCTTAATGAATATCATAACAACAACTTTGCTGGGACTTTACTTCCTTGCAGTTCGGTTCCACCAATCATCGCTGTAAAAAATCTAAATACTTTTAGGCTGCCTTCGGGCGGCCTTTTTTATTTCCCCTCATAACTGAGAGGACCCACATAACCAGAGGGGGATGAATGTCCGAACCTGTATCCAGTGCGACAGTGTTGGCTGGTGGATTAATGGGGGCCAGTGTATTCGGTCTGGCAACCGGAACTGATTATGGTGTGGTATTCGGCGCTTTTGCCGGCGCGGTGTTTTATGTCGCCACGGCAACCAACATCGGACGCATCAGGCTGGTCGCTTATTTTATTACATCATTTATTGTGGGAGTGCTTGGTGCCGGGCTGATAGGTACTAAGCTTGCGGCAATAACGCATTATGAAAAACCACTGGATGCACTTGGCGCAGTGATTATTTCTGCAATGTGTATAAAGTTTCTCACTTTTCTCAACAGTCAGGATCTGAACACCCTGTTCAGTATTCTCTCTCGTATCAGGGGAGGGGGATCAGATGGTAGCAAATGACCCTTCTGCAGCTCTGAATGCCGTAATTTGTGGGGTGATAGTCATCGTTCTGATGTTTTACCGACGCGGTGATGCGACACACCGCCCCCTGATTTCGTTACTGGCCTATGTCATGGTGCTGGTATATGCCAGCGTCCCTTTCCGGTTTGTTTTTGGTTTATATGAATCATCCCACTGGCTGGTGGTGATGGTGAATATCCTTATCTGCGCCGCTGTGCTGTGGGCTCGCGGTAATGTGGCGCGTCTGGTCGATGCACTGAGGCACTGATGAATCAACAACAATTTCAGCAGGCGGCTGGTATTAGCGCCGGGCTTTCTGCACGCTGGTTTCCGCACATTGATGCGGCAATGAAAGAGTTTGGTATTACAGCAGTTAATGATCAGGCCATGTTTATTGCACAAACGGGACATGAATCAGCAGGATTTACTGTTCTGAAGGAAAGCTTCAATTATTCGGTGGAGGCGCTGAAAAAGACGTTTGGTAAACGCCTGACGCCGTATCAGTGTGAAATGCTGGGGCGTATTGATGGTCGCCAGGTTGCCCACCAGCCGCAAATAGCCAATCTGGTTTACGGTGGCCGCATGGGTAACAAAGACGCCGGAGATGGCTGGAAGTATCGCGGGCGTGGTCTGCTTCAAATCACCGGCCGCGAGAACTACGTCAAATGCGGAGCTGCGCTGAAGCTTGATCTGATCAGCACACCAGAGTTGCTGGCACAGGAGAAGCATGCAGCCCGTTCTGCTGCATGGTTTTTCACATTACGTGGTTGCCTGATGTATTCAGGTGATGTTGTCCGTGTAACGCAGATCATCAACGGTGGCCAGAATGGACTGGCTGACAGAAATAGTCGTTATAACAAAGCGCGGGCGGCGTTGCTGGTATGACAGCGGTCTTTGCTTTCGTTAAGGCGCGGTGGAAAACAATCATTGTTTTGCTGATGTTGGCTGGTGCATTTCTTGCCGGGATCATCTGGAGTGATCGGGGCTGGCAAAAGAAGTGGGCTGACCGCAATAGCATGGAATCTTCACAGGAAGCGAACGCGCAGACTGCCGCACGCTGGATTGAACAAGGGCGCATAATTGCCCGTGATGAGGCTGTAAAAGATGCACAAGCACAAGCCGCTAAATCTGCTGCCACTGCTGCTGGCCTGTCTGCCACTGTTAGCCAGCTGCGTACCGAAGCAACAAAGCTTGCCGCCCGCCTGGACGCCGCAAAGCACACCTCAGATCTTGCCGCTGCCGTCAGAAGCAAAACAGCCGGAGCCGACGCCGCAGTGCTCGCCGACATGCTCGGACGCCTTGCAGAAGAAGCTCGATATTATGCTGAGCGATCTGACGAAAGCTACCGCGCAGGAATGACGTGTGAGCGTATTTACAACTCGGTGAGAGAGTCAACCAACAATCCCATAGCCCCGCACTAGCGGTGCTTTTTACCGGAGTTTATATGCCACCAAGAACACCTAAATCCTGTCGTGTTCGCGGCTGTCGCAGTACAACAACAGATCCATCCGGATATTGTGAAAGTCACAGAAGCGAGGGCTGGAAACAATACAAGCCAGGACAATCCCGTCATCAACGCGGTTATGGTTCGAAGTGGGACGTTATCCGTGAGCGCATACTGAAGCGTGATAAAGGTTTATGCCAGTTATGTCTGCGTGCCGGTGTGGTGCGTGAGGCGAAAACTGTTGACCACATTATCCCTAAAGCGCATGGCGGAACTGATGCCGACAGTAATCTGCAGAGTCTGTGCTGGCCCTGCCATAAGGCGAAGACGGCTCGTGAACGGCTGAAATAAAAACCAGTTTCCACAGCCAGAGGGGAGGGGCGGGGTAAATCCCTGTGGCCTGACGTCTTCCGGACTGCCCGCCTCATCAAATTTTTACGCGCCAAAAATAAGAAACTTTTTTCCGGAAGGTTCAACCTATTGAACTGGAGGTTTTGATGGGTGCTGTTGTGAGATCTTCCGGTGGTGGCCGTAAGCGCAATTTGCCTTCGGGCCAGAAAAGCAAGCTGACCAGGATCGCACCGCCGGAAGAGTTAATGAGTGATATCGCGATCCGCATCTGGAAAACGCAGAGCAAAATTTTAATTGAGCGGGGCGTTTTTGATCTTGAAGACGCGCCGCTACTCCTGGCGTACTGCAATGCGTTTCACTTGATGATTGAGGCCGAAAAAGTCATCGCGGAAGAAGGCCTGACCGTATCAAGTGAAATGGGTGGTGAGAAAAAACACCCTGCAGTCAATGTCCGTAATGACTCCGTTTCGCAGCTCGCCCGTCTGGGTTCACTTCTCGGGTTAGACCCGCTCAGCCGCATAAGAATGACCAGCGGAAAAAATGATCCGGACGATGAAGGGAATGAATTTGATGAGTTTGACTGATGGCTACATATCCGAACGTCAATGCGGCGAACCAGTATGCGCGGGACGTCGTGAACGGGAAGATACTGGCCTGCCGGTTAACCATGCTTGCCTGTCAGCGACATCTTGACGACCTGGAACGTGCCAAAGATCCGCATTGGCCTTACCGCTTCGATAAAAATAAAGCAGAACGTTTCCTTCGCTTTTCCCAGAAAATGCCGCACACCTCCGGAGAGTGGGCTCGCCGGAAGTTGCGGATAGAATTTGAACCCTGGCAAAAATTTGCGCTGGGCGTGCCGTTTGGCTGGGTGCGCAAGGATACCGGTTTTCGCCGCTTCACTGAGATTTACATCGAGGTACCGCGTAAAAATGGGAAATCGGCGATTGCGGCCGCCGTCGGTAACTATATGTTCTGTGCAGATGGCGAGTACGCAGCGGAAGTTTACTGTGGTGCCACAACGGAAAAACAAGCCTGGAAAGTTTTTGCGCCTGCACTGGCGATGGTGAAAAAGCTGCCGGCGTTGCGTCAGAAGTTCTGTATCAAACCCTGGGCAAAGAAAATGACTCGCCCGGATGGTTCCCTGTTCGCGCCAATTATCGGTGACCCTGGAGATGGCGACTCACCATCATGTGCGATCATCGATGAGTACCACGAGCATGATACTGACGCGCTATACACCACAATGACTACCGGGATGGGGGCGAGGGAGCAGCCCATCACGCTGATCATCACCACGGCAGGCTTTGATATTGCCTCGCCTTGCTATGAAAAACGTACTCAGGTGGTCGAGATACTGGAGCGCATCCGGGAGGGTGGTGAAAACGAGGCAATTTTCGGGATCATCTATACCCTGGATGATGACGATGACTGGACACAGCCGGAAGCTCTGATCAAAGCCAACCCGAATTACAACATTTCGGTGAAAGAGGGATTCCTCAAGGCTAAACAGTTGCTGGCGATGTCCACGCCAGGCCAGACCAATAAAATACTCACCAAGCATTTCAACAAATGGGTGAGTTCTAAAGCAGCTTACTACAACCTGCAGAAGTGGATGACCGCAGCAGACAAAACGCTCAGACTGTCCGATTTTGCAGGTGAGGAGTGTTATCCCGGCATCGACCTGGCATCAAAACTTGACCTTAATGCAGTGGTGCCGGTATTCCGCCGTGAAATAGACGGCCTGAGTCATTATTACTGCGTTTCGCCTATGTTCTGGGTACCGGAAGACACCGTCTACGCCACGGACCCGGCGTTGAAAACTATTGCAGACCGTTACCAGTCTTTTGTTAATCAGGGCGTGCTGGTTCCGTCAGACGGTGCAGAAGTGGATTACCGCCTTATCCTGGAAGCGATCCTGAAATTACGGGAAACGGTGAAGATAGCCGCGAGTCCGATTGACCCCTACGGTGCAACAGGCTTATCTCATATGCTGCAGGATGAAGGGCTTGAACCTGTCACCATTACCCAGAACTACACGAACATGAGCGACCCGATGCGTGAGATTGAGGCTGCGATCGCTGCTGGCCGATTCCATCATGACGGTAATCCCTTGATGACCTGGTGTATTTCGAACGTGGTTGGCAAGTACCTGCCTGGTAGCGACGATGTTGTTCGCCCGGTGAAAGAAGGCGCAGGCAACAAAATTGATGGTGCAGTTGGCCTGATGATGGGGGTTGGCCGCGCAATGCTGAACGAGCCGAAAGACTTTCTTTCTAACCTCGATCCTGATGAGGAACTGTTATTCCTGTGAAATCACTAATTATCGATGTGGCCGGGGTGGCAGGCTTCGGCGCGCTGGTGGGAGGTATTTACCTCAAATTTGGCGCGGCGGTTGCTCTTATGGCTGGTGGTAGTGGCCTGCTGCTGTGGGCACTGCTGGCGGCCAGGAGAATAAAAACATGCTGATTGATGCCATTTTCAGAAGCAACTCGCTGGAAAACCCAGCTGTTCCGGTCACCGTTGAAGCGGTCGAAAACGACGGGATCTTTAATGGTGATGTGATTGTTAACCCCCGGACGGCAATGAAACTGGCGGCGGTGTATGCATGTATCTACGTTATTTCATCCAACGTTGCGCAGATGCCCCTGCACGTCATGCGGCGAACCGGGAAGAAGGTTGAAACTGCCCGCGACCATCCTGCCTTTTACCTGGTTCATGACGAACCCAATTCCTGGCAGACCAGCTATAAATGGCGCGAGCTCAAACAACGTCACATTCTGGGCTGGGGTAACGGATATACCAGAGTTCTCCGTCACCGCCGAACCGGTGAAGTCACTGGCCTTGAAGCCTGTATGCCGTGGGAAACAACGCTGCTGAACACCGGCGGGCGCTATACCTACGGCGTGTATAACGAAGAAGGTTCCTTTGCCATTAATCCTGATGACATGATCCACGTCAGGGCGTTGGGTAACGATCAGAAAATGGGGCTCAGTCCGGTTCTTCAGCACGCCGAAACCATCGGTATGGGTATGAGCGGGCAGAAATACACGGAAAGTTTTTTCAGCGGTAACGCCAGACCAGCGGGCATAGTTTCAGTAAAAGGAGAATTGAATGACGGCTCCTGGAAAAGGCTGAAAGAGATGTGGCAAAAAGCCACGGCGATGCTGCGCAGCCAGGAAAACAGGACAATGTTGCTCCCGGCTGAACTGGATTATAAAGCGCTGACGGTTTCCCCAGTCGATGCCCAGCTCATCGACATGATGAAGCTCAACCGTTCCATGATTGCCGGGATTTTCAACGTGCCGGCACACATGATCAACGACCTCGAAAAAGCCACCTTCTCCAATATTTCCGAACAGGCGATTCAGTTTGTTCGCTACACAATGATGCCGTGGGTGACGAACTGGGAGCAGGAGCTTAACCGTCGGTTGTTCACCCGCGCCGAACGGGAAGCCGGGTATTACGTGCGCTTTAACCTGGCGGGTTTATTGCGCGGTACTGCCAAAGAGCGCGCGGAGTTCTATCACTTCGCTATCACCGATGGCTGGATGAGCCGCAACGAAGCACGCGCGTTTGAGGATATGAATCCGAAAGACGGCCTTGATGAAATGCTGGTCAGCGTTAACGCCTCCCGGCCAGCCAAATCCACAACCCAGGAGAACACTCAAGATGAGTGAACGTGAAATTCGCTGTTACAGCGGCGAGGTGCGCGCAGAAACGCACGACAGCGAGCCCAGCCGGATCATCGGGTATGGTTCGGTCTTTGACAGCCGTTCTGAACTGATTTTCGGTTCGTTTCGCGAAATCATCCGGCCCGGTGCGTTTGATGAAGTGCTGAATGACGATGTACGGGCGTTATTCAACCATGACCCCAATTTTATCCTGGGTCGCAGAAGTGCGGGCACGCTGGCACTGACGGTTGATGAGCGGGGTCTGCGTTATGACATCACCGCGCCAGAAACTCAGACAATCCGTGATCTGGTGCTGGCACCAATGCAGCGCGGGGATATCAACCAGTCCTCTTTTGCATTTCGCGTCGCCCGCGACGGAGAGGAATGGTACCAGGACGAGGATGGTGTGGTGATTCGTGAGATTACCCGTTTTTCCCGTCTGCTGGATGTCAGCCCTGTGACATATCCGGCGTATCAGGAGGCAGATTCCGCCGTCCGCTCTATGAAAGCCTGGCAGGAGGCGCGCGATAGTAGCGCACTGCAGAAAGCCATTAACCAACGAATGGCGCGTGAGCGCGTCCTGACCCTTCTTAACGCGTAAGGAAAAACCATGAAATTGCATGAACTGAAACAAAAACGTAACACCATCGCGACCGACATGCGCGCGCTGAACGAAAAAATCGGCGATAACCCATGGACGGATGAGCAGCGTACCGAATGGAACAAGGCAAAATCTGAACTGGAAGCACTCGACGAGCGCATCGCCCGCGAAGAAGAGCTGCGCCGCCAGGACCAGACCTACGTTGATGAAAACGAGGAAGAGCAGCGCAATAATCAGGATCCTGATAAAGACCCGCAGCAGGACGAAAAACGCGGCCAGATTTTTGATAAATGGATGCGTCACGGCGCCAGCGAACTGAGTTCCGAAGAGCGCAAAGCCTTACGCGAACTGCGTGCGCAGGGTGTGGCGCCGGATGAAAAGGGCGGCTATACCGTGCCTGATACCTTCCTGGCGAAAGTGGTCGAACAGATGAAATCCTACGGTGGTATTGCCAGCGTGGCGCAGATCCTCGCTACATCCGATGGGCGCACTATGGAATGGGCCACTGCTGATGGTACCGCTGAAGTGGGTGTGCTGCTGGGTGAAAACGAAGAAGCGGGTGAAGAAGATACCGAATTCGGTATGGATAGTCTGGGCGCGCTGAAAATGACATCCAAAATTATCCGCGTATCCAACGAGCTGCTACAGGACAGTGCGATCGACATGGAAGCCTATCTCGCCCGCCGTATTGCGGAGCGCATTGGCCGCGGTGAAGCGCGTTACCTTATTCAGGGGACCGGCACCGGTACGCCAAAACAGCCTAAGGGTCTGAAAGCATCCGTAACCGGCACTACGCAGACGGCCGCTGCCGGAGCTGTTAAATGGCAAGAGATTCTGGCGCTGAAACACAGTATTGATCCGGCGTACCGCCGCGGGCCGAAGTTCCGCCTGGCGTTCAATGACAATACGCTGAAACTCATCAGCGAGATGGAAGACGGTCAGGGCCGTCCACTCTGGCTGCCTGATATCGTCGGCGTGGCGCCAGCATCAGTGCTGAATGTTCCGTACGTTATTGATCAGGAGATCGATGATATTGGCGCGGGCAAAAAATTCATGTTCTGTGGCGACTTCGACCGCTTCATTATCCGCCGTGTTCGCTACATGATCCTGAAGCGCCTGGTGGAGCGTTACGCGGAATTCGACCAGACCGGCTTCCTGGCGTTCCATCGCTTTGACTGTATTCTCGAAGATACCTCTGCGATTAAAGCGCTGGTGGGCAAAGGCTCGGCAAGCAGCTAATCCCTCTCACCTCTGAACAAACCATGCCGCGTTAAGCGGTTTTTTTGTGCCCGCCACCCGGCGGGCGCAGGAGGATCCTATGTTGCTTTCTCCTGAGGAGATCAAGTTGCAGCTCAGGCTGGATGAGGATTACGCCGATGAAGATAAATTTCTTGAGCTGTTGGGGCGGGCGGTTCAGGCCAGGACAGAAAATTTTCTGAACCGGAGACTTTATACGGCGGAGGCGGGGGTGCCAGCCGACGATCCGGAGGGGCTTATTCTCTCGGATGACATCAGGATGGGGATGCTGCTTCTGGTGACGCACTTCTACGAGAATCGTTCTACCGTCACCGAAGTGGAGAAAGTCGAACTGCCGATGAGCTTTAACTGGCTCGTCGGTCCATACAGGTACATCCCGCTATGAAACTCAGGCAGGCGCAGGCCAGCGCCACATACCTTTTGCCCGACCCAGGCGAACTTGACCAGCGCATTGTTATCCGGCGGCGTGTCGATGTTCCGGCTGATGACTTTGGCGTAACGCCGACGTACCCGGAGCAGATCCGGGCGTGGGCCAAAAAAGCGCAACCCGGCGCGGCAGCTTATCAGGGGGCTGTGCAGATAGAAAACAGGGTGACGCACTATTTCACCATCCGTTTTCGCCGCGGTATCACCGCCGATCATGAAGTGCTCCACGACGATATTTCTTATCGGGTTAAACGGGTCCGTGATCTGAACAGTAAACGCCGCTTTCTGTTGCTCGAGTGCGAAGAGCTGGGTACCGATAACGGGAGTGACTATGCCGCAGAAAGCATATTTACACGTTGATTTTGAACAGCCGGAAACGCTTGTTTTTAACCGGGCGCGTATGCGCCGGGCGTTTGTCAGTATCGGGCAGGTACATATGCGCGATGCCCGCCGCCTGGTCATGAAGCGGGGGCGTTCCGGACCCGGCGATAATCCTTCATACAGAACGGGAAAACTGGCACGCTCCATCGGGTATTACGTTCCGCGGGCATCCAGTCGCCGTCCTGGATTGATGGTGAAAATTGCCCCTAATCAGAAGAACGGGGAAGGGAACCGCCCGATCTCAGGCGCATTTTACCCTGCCTTTCTGTTCTACGGTGTTCGCCGTGGGGCGAAGCGTAAGAAAGGCCATCATCGAGGCGCATCAGGCGGCAGCGGCTGGCGTGTGGCACCACGTAACAACTACATGACTGAGGTTCTGGATAAACGCCGCAGCTGGACACGTTATGTGCTCTCCCGCGAATTGCGAAAATCACTCCGTCCTCAGCGAAGGAAGAAAAAATGAAATTAACCCCGATTATTGCGGCACTTCGCAGCCGTTGCCCTCGGTTTGAAAACCGTGTGGGTGGCGCAGCGCAGTTTAAAGCGATACCGGAGGCCGGAAAGCTCAGGCTACCAGCCGCGTATGTTGTGCCAGCCGAAGACGTCACGGGTGAGCAGAAATCGCAGACCGACTACTGGCAGGATTTGACGGAGGGTTTTTCCGTCATCGTGGTACTCAGCAACGAACGGGATGAAAAAGGGCAGTGGGCTTCTTACGACGCAGTTCACGACGTCAGGCAGGAAATCTGGAAGGCGCTGCTGGGGTGGGAACCGGACCCGCAGGCGCATGAAATTCAGTATGCGGGTGGGATGCTTCTCGATCTGAACCGCCACGAACTGTATTACCAGTTCGACTTCACGGTGAAGTATGAAATTACCGAAACAGACACCCGCCAGCAGGATGATCTGGACGGCCTGCCCGACCTTAAAACGCTCAGTATTGATGTTGATTTTATCGAACCCGGTACCGGGCCAGATGGCGACATCGAGCACCACACCGAAATTACATTTCAGGAATAAACCATGTTTGTGAAACCCGCAAAAGGGCGATCGGTTCCCGATCCGGCCCGTGGCGACCTTTTACCTGAAGGAGGTCGAAATGTTGATGAGAATAACTACTGGCTGCGCCGCGAGGCCGCTGGTGATGTCCGGCGCACGAATAAAAAGGTGAAAACAAATGGCGATTAGTTTTAATTCCATCCCGTCAGATACGCGGGTTCCGCTGTTTTATGCCGAGATGGATAACTCGGCGGCAAATACCGCCCGGGACAGCGGGGCATCACTGCTGATTGGTCACGCCAGCAATGATGCGTCAATTGCCGTCAACAGTCTTGTTCTGGTGTCATCGGTTGATTATGCCCGTCAGATTTGCGGTGCAGGAAGCCAGCTGGCCCGTATGGTCGGGGCGTACCGTAAGACCGATCCATTTGGCGAACTGTATGTCATTGCCGTACCTGAATCCACAGGCGCGGCAGCAACCGTCGCTTTGACGGTAACTGGCGAAGCGACGGAAACCGGAACGGTGAATGTCTATACCGGCCGAACCCGCGTTCAGGCTCCCGTGACCAGCGGTGATGACGCTGCGGCGGTGGCTGTGAGCATTAAGGATGCGGTCAATGCAAACCCTGATCTTCCCTTTACGGCAACATCAGAAGCGGGGGTGGTGACACTGACTGCGCGCCACAAGGGGTTATATGGAAATGAAATTCCGGTCACTCTCAATTATTACGGCTTTGGCGGTGGGGAGGTGTTACCGGCAGGTGTGAATATTACGGTTGCCAGCGGCGTGAAGGGGGCTGGTGCGCCAGCTCTTAACGACGCGGTGGCAGCGATGGGAGATGAGCCGTTCGATTATATCGGCCTTCCGTTTAACGACACGGCATCGGTGAACACGATGGCAACTGAAATGAATGATTCCAGCGGTCGCTGGAGTTATATCCGGCAGTTGTATGGTCACGTTTATACGGCGAAGACGGGGACGCTGTCGGAGCTTGTGGCCGCGGGTGACCAGTTTAACCTGCAGCACATCACCCTGGCGGGCTATGAGAAAGACACCCAGACGCCTGCTGATGAACTGGCTGCAAGCCGTACTGCCCGTGCTGCGGTTTTTATCCGTAACGATCCGGCGCGCCCGACCCAGACCGGGGAACTGGTGGACATGCTGCCGGCACCGAAAGGCAAACGCTTCACGACGACTGAACAGCAGACGTTACTTTCCCACGGTGTGGCAACGGCGTATGTGGAAAGCGGCGTGCTGCGTATTCAGCGGGATATCACGACGTACAGGAAAAATGCGTATGGTGTGGCGGATAACAGCTACCTTGACAGCGAGACGCTGCATACCAGTGCTTATGTGTTGCGCCGTCTGAAATCTGTTATTACCAGTAAATACGGGCGCCATAAACTTGCTAATGATGGTACGCGTTTCGGGCCTGGTCAGGCCATTGTCACGCCTGCCGTTATCCGTGGTGAGCTGGGATCAACATATCGCCAGCTGGAGCGGGAAGGCATCGTGGAAAACTTCGATCTGTTCCAGCAACATCTGATAGTTGAGCGTAACGCGAACGATTCGAACCGCCTTGATGTGCTGTTTCCGCCTGATTATGTCAATCAGTTACGTGTGTTTGCGGTGCTTAACCAGTTCCGTCTGCAGTACAGCGAGGAGGCTGCATAATGGGAAAAATTGCGGGAACAACATATTTCAAAATCGACGGACAGCAACTGTCGGTAACCGGAGGGATTGAAGTCCCCATGAACACCAAAGTTCGTGACGACGTGATTGGCCTGGATGGTTCCGTTGACTACAAGGAAACCAGCCGGGCACCGTATACGAAGGTGACCGCCAAAGTGCCGAAAAACTTCCCGGTCGATAAAATTACGTCTTCTGATGTCATGACCATCACATCAGAGCTGGCAAATGGTCAGGTGTATGTTCTCTCAAACGCCTGGCTGCACGGCGAAGCCAACCATAACCCGGAAGAGGGCACCGTGGATCTTGAGTTCCACGGTGAGGAGGGATTTTACCAGTGATAAAAGAACTTGTGCTCAAAAAGCCGATTATGGCGCATAACGAAAAGCTTCATGTGCTGGAGCTGCGCGAACCGTCCTACGATGAAATCGAAGCCATTGGTTTTCCGTTCACCGTTTCCGGTGATGGCGGCGTCCGGCTGGACAGTTCGGTTGCTCTGAAATATATCCCTGTGCTGGCAGGTATTCCACGCTCCTCGGCAGCGCAACTGGCAAAACTGGATATTTTCAAAGCCTGTATGTTGATCCTCAATTTTTTTACCCGGTCGGAGACGGAGGAGGACTCAGAAAGCGGGTCTACAACACCGCATACTTCTGGCGAATAAATCCCCTGGAGCTCCGGCGGGCGGCGATATCCGATTTTCTGGAGCTGGAGTCGGAGGCTGTCCGTATCAATGAGGAAATGAAGCATGGCTGACAGTTTCCAGTTAAAGGCCATTATCACTGCCGTTGACCAGTTATCGGGTCCGCTGAAAGGGATGCAGCGGGAACTGAAGGGATTTCAGAAAGAAATGGCCGGGCTGGCGATCGGTGCTGCCGCTGCCGGGACCGCTGTTCTTGGGGCGCTGGCGCTGCCCGTGAATGCTGCGATCGGCTTTGAGTCAAAAATGGCTGACATCCGGAAGGTGGTTGACGGCCTGGATGATAAAAAAGCATTCGCGCAGATGAGTGACGATATCCTGACGCTGTCCACACAGTTACCGATGGCGGCGGAGGGAATTGCAGAGATCGTGGCGGCGGGCGGGCAGGCAGGCATTGCCCGCGGCGATTTGATGCAGTTTGCGAACGACGCAGTGAAAATGGGTGTGGCGTTTGATACCACTGCCGAAGAGTCCGGTCAGATGATGGCGCAGTGGCGGACAGCGTTCAGACTGACGCAGGAAGACGTGGTTGTCCTGGCCGATAAAATCAACTATCTGGGGAATACCGGCCCGGCAAATGCGAAGAAAATTTCTGATATCGTGACGCGGATTGGTCCGCTGGGCGGTGTTGCCGGAGTGGCATCCGGCGAAATTGCCGCGATGGGCGCCACCATTGCCGGGATGGGGGTTGAATCAGAAATTGCCTCCACCGGCATCAAAAACTTCATGCTGTCGTTAACCGCAGGTAATTCGGCAACCAAAGCCCAGAAACAGGCTATGGCTTTCCTGAAGCTGAATCCCCGGAAACTCGCTGAGGATATGCAAAAGGATTCGCGCGGGGCCATGCTGAAGGTGCTGGACTCGCTCGCGAAAGTGCCAAAAGCTAAACAGGCCGCCGTCATGAATGCGCTGTTTGGCAAGGAGTCACTTAGCGCGATTGCCCCGCTGCTGACCAACCTGGATTTGTTACGCACCAATTTTGATCGTGTGGCTGATGCCCAGGAATATGGCGGCTCGATGCAGAAGGAATACGCATCCCGCGCGTCCACAACAGAAAACCAGCTGGTTCTGCTGAAAAACAGCGTCAATGCGATTTCGGTAACGCTGGGCGATACCTTCCTGCCCGCCATTAACGAAGCTGCAGAAGCGGTCATGCCTTACCTGGAGCAGCTCCGGACATTCGTTCGCGCGAATCCTGAACTGGTTCAGTCTGCGGCGAAGTTCGGCGCGGCGCTGCTGGCTGTTGGCGTATCCATTGGCAGCCTGTCCCGGGCTGTCAAAATCCTGAACAGTGTCATTAACCTCTCTCCGGCGAAAGTCGCCATTGCGGCGCTGGTGGCCGGCGCTATGCTGATCATTGAGAACTGGGACGATGTTGCTCCGGTGATTAAGGCGGTATGGCAGGAGGTCGATAACGTTGCGCAGGAGATGGGCGGATGGGAGACGGTGATTGAAGGGGTTGGTCTGGTTATGGCTGGTTCTTTTACCGTCAGGACCATTGGTGCCCTGCAGCAGTCCGTCCTGCTGGCCGGACGGCTTTCCGGTCTGCTGGGTAAAATTGGCCGGATGGGGGCCATGACGCTGACAATTGGCGTGGCGGTGTCACTCTTTAAAGAGCTTAAGGATCTGGAGCAGGGGGCAAAGGATGCGGGTATGGATGCTGGCGCATTCGCTGTACAGAAGCTGCAAACGAAGGAGCGTGAACGCGGGTATAACGGTTTTATTCCCAGACTCAAAGAGCTTCTTGGTATGGACACCCCGATTCCGCAGGGGCGTTATCAACCTTATGTGCCACTGACCCGGCGTTCTGGCGTACTCGGGCGAGCTGTCCCGCCATCAACGCAGCGCAGCGAACTCAAAGTGACATTTGAGAATGCACCACAAGGTATGCGTGTGACTGATATACCGAAATCCGGTAATCCATTGATGAACATCAGCCATGATGTGGGTTACTCACCCTTTCGTACATCACGATAAACCTGCTCCGGCAGGTTTTCTTATGGGGTAAATATGGCTTTTTTCTCCTCAACTGGCTGGCGCGGGCGCCTGCGTGATGCATCATTTCGTGGAGTGCCTTTCTCCGTTGAAGATGATGAAAGCACGTTTGGACGCCGCGTACAGGTACATGAATATCCGAACAGGGATAAGCCCTGGACGGAGGATTTAGGTCGCGCCACGCGCCGCCTGACGATAAATGCTTATCTTGTCGGTGATGATTACGCAGACAGGCGGGATCGTCTTATTGGTGCCATTGAAACCGCAGGCCCTGGTACGCTGGTCCATCCGCAGTATGGCGAAATGCAGGGCAGCATTGACGGACAGGTCAGGATCACTCACAGCAGTACAGAAGGGCGCATGTGTCGTGTCTCCTTTCAGTTTGTGGAAAGTGGTGAACTTTCTTTTCCTGTGGCGGGAATGGCAACGGCGAAGCGCCTGGAAACATCAGGCGGGCTTTTCGACGATGCGATTGACAGTATGTTTTCCACATTCTCGTTGTCAGGTATTTCTGATTTTATCCAGAACGATGTCATTGCCGATGCTGCCTCCATGCTGGGCGATGTTGCCGATGCTTTCAGGATGGTTGACTCCGGCGTGTCTGCCGCAATGCGGCTGTTACAGGGGGATTTGTCTGTCATTCTGATGCCACCGGGCGCCGCAAGTGATTTCGTTAACGCACTGCAAAAAGCCTGGCGCTCAGGTGACAGGCTCAGAGGCAGTACATCGGATCTGGTCACGATGATAAAAACGATGTCAGGTATCACCCTTGATCCCGGTCTTTCCCCCCGTGGCACCTGGCCCACTGACTCCGGATCTGCTGCGAAACAGAAAATGCAACGCAATATGATCGCAGCCGCCATCAGGACAACAGCCATCAGCACAGCCGTCCACGCCGTGACAACACTGAAGCAGCCGCGTGATGTACCTGATGTCCGGGGCGTAAATCAGCCTGCAGGAACAGGCCGTGACTCAGACATTATCACTGTCATGCACCCGGCGCTGGATGGTGTACAGACAGTCAGTAATGGCAGCTTTCCACCGAATTATGAAGATCTGAAAGCTATCCGGACCGCGCTCAATGCTGCGATTGACCAGGAGCAGTTGCGTATCCGGGATGATGTGCTTTTCCAGCAAATTTCCGTTATGCGGACGGATCTCAATCGCGATATTTCTGCACGACTGGCACAGGTTGAACGTACTGCATTGCGAACGCCTGATGATGTTCTGCCTGCACTGGTACTGGCTGCAGCCTGGTATGACGACGCCGGGCGGGAATCTGATATCCTCACTCGTAATCCCGTTCCCCATCCGGGATTTATCCCGGTTGAGCCGCTGAGGGTTCCGGTACGATGAATAATACGGTTTTTTTACGCGTCAACGGGCGTGACTGGGGAGGATGGACGTCAGTACGGATAAGTGCGGGCATTGACCGTATTGCCCGGGACTTTAATGTCTCGATCACCCGGCAGTGGCCTGGTGGAGAAGACGTACCGCCAGTAAAAAATGGTGACGCTGTAGAGGTACTCATTGGCGATGATTTAGTTATTACCGGCTGGGTTGAGGCGTTACCGCTACGTTATGATGCGCAGACCATTATGACGGGCATTGTCGGGCGCAGCAAAACGGCAGATCTTATCGACTGTTCTGCATCGCCTGCACAGCATAACGGGAAAAATTTATTCCTGATCGCCAGCGCACTTGCCCGGCCATTCGGTGTGGACGTTGTTGATGCAGGCGCGCCGGCAGCCGCCGTTATTGAGGCTCAGCCGGAACATGGTGAAACGGTTGTGGACTGTCTGAACAGGCTGCTTGGACAGGCTCAGGCGCTGGCATATGACGACGAACGGGGACGGCTGGTTCTCGGCAGGCCGGGCAGTATGAAAGCAGCCACGGCACTGGTACTTGGCGAAAATATTCTTTCCTGTGATACCGAGCGTAGTGTTCGTGAGCGTTTCTCCAGTTATCTGGTTACGGGGCAGCGTCCTGGTACGGATGACGATTTCGGCGAGGCAACCATTGCTGCTATCCGGCAGAGTACTGGTGATGCAGGCGTCACGCGGTATCGTCCCCACACCATTCAGCAGTCAGGAACTGCCACAACTGACAGCTGCAAATCCCGCTGTGAATTTGAAGCCCGTCAGCGTGCGGCGAAAACGCTGGAAACCACCTATACCGTACAGGGATGGAGACAGGGGAATGGCGAATTGTGGAAACCGAATCAGGCCGTGGTGGTGTATGACCCGCTGAACGGTTTTGACAATGAAACGCTGGTGATCGCCGAAGTGACGTACAGCCAGGACAATAACGGCACCCTGACCGAAATCCGGGTGGGGCCTGCGGATGCCTATCTTCCTGAACCATTCAGGCCGAAAGCGAAGAAAAAAGTCAGTGAGGAGGCAGATTTCTGATGGCTAACCATCCTCTTCAGAACATGATAACGCGCGCAGTCATTACCGCGATTGATACCGTCAGAAAATGCCAGACTGCCGGACTGAAACTTATTGCCGGTGAAAAAAAAGAGAATGTGGAGCATCTTGAACCTTACGGTTTCACCTCTGCAGCACAGAATGGCGCAGAAGCGGTGGTATTGTTTCCCGGCGGTGGCCGTTCGCACGGAGTGGCTGTGGTTGTGGCTGACCGCCGCTTCAGACTGAAAGGGCTGGCGCGCGGGGAAGTCGCGCTATATGACGATCAGGGGCAGTCGGTCACATTAACCCGAGCCGGAATAGTGGTAAATGGCGGCGGAAAGCCAGTTATTTTCACGAATGCCACTAAAGCACGTTTTGAAATGCCGATCGAATCCACTGGCGATATCAGGGACAACTGTGACAGCAGTGGAAAAACGATGGCTGAAATGCGCACGACCTATAACGGTCATACCCATAAAGAAAATGGCGATGGCGGCGGTATAACCGATAAGCCTGGCCAACCCATGAGCTGACACCATGATCCTTTATGTTAATGGAATCCGTAAGGATGCCACGGCTTCGCTCGACTTTCTGACGCGGGCAGTGGTGATTTCTCTTTTTACCTGGCGCCGGGCGGAGCGGGATGACAGGACCCCACAGCCATACGGCTGGTGGGGGGACACCTGGCCTGCTGTTCAGAATGACCGCATCGGTTCCCGCCTCTACCTGCTGAAACGCCGCAAACTCACCAATAAAACGCCGCAGGATGCCCGCGAATACATGCAGCAGGCGCTGGCGTGGATGACAGACGATGGCGTGGCGGCACGTATTGATGTGACATCTGAACGCACAGGAACAGATACCCTGGCAGCTGGCGTGACGATATATCAGCGGGACGGGGTAATTCACAATATTACATTCGATGATATATGGAGCAAACTTAATGGCTGACAGTCAATTTGCACGTCCTGAACTTCCTCAGTTGATTGCTACCATTCGCAGCGATTTACTGACCCGTTTTCAGCAGGATGTTGTGTTACGTCGCATGGATGCCGAGGTTTACAGCCGGGTACAGGCTGCTGCCGTACATACGCTGTATGGTTATATCGATTATCTGGCCCGGAATATGCTGCCTGATATGTGTGATGAGGACTGGCTTTACCGTCACGCGAGGATTAAGCGTTGTCCCAGGAAAAATGCCGTATCTGCGAAGGGATTTGCACGCTGGGATGGTATTGCCGGAACGCCGGAGATCCCCGCGGGTACACAGATTCAGCGGGATGATCAGGTTACATTCACGACCCTGCAGACGGTGAAAGCTTCCGGCGGCCTGTTACGTGTGCCGGTTATTGCTGATGTGGCGGGAACTGCCGGTAATACTGACGATGGTACGGCGTTACGCCTTGGCACGCCGATTACTGGTATTCCTTCTACAGGTTACGCTGACACTCTGACCGGGGGGGCTGATACAGAGGAGCCTGAAACGTGGCGCGCGCGCGTCATGGAACGCTATTACTGGATACCACAGGGGGGCGCTGATCCTGATTACGTCATCTGGGCAAAGGAAATCGCGGGAATAACCCGTGCGTGGACATTCCGCCATTATAAGGGGACCGGCACCGTTGGTGTGATGGTGGCTACCAGTAACCCGGTGAATCCGGCTCCTGGCGACGATCTCGTTAAGGCTGTACGTGACCATATTTTGCCGCTGGCACCTGTTGCTGGCGGCGGACTCTTTGTTTTCGCTGCCACTGAAAAAAGCATTCCGGTAACAGTCGCACTGGCCAAAGATACCCCGGAAATTCGTACTGCCATTATTGCGGAGCTAAATGCGCTGATGCTGCGTGATGGCGCGCCGTCCGGAAAAATTTATGTTTCGCGAATCAGCGAGGCGATAAGTCTGGCGACCGGGGAAGTGGCACATCAGCTGCGTGTGCCGGCGGCAGATGTGGTGCTGGGAAAAACTGAACTTCCTGTCCTGGGGAATATAACCTGGGCCACCTATACCGGGGAGAACGGATAACTATGGCATTACAGGACGAATATACGCAGTTACTTTATCACCTTCTGCCGGAAGGGCCTGCCTGGGACGGAGAAAACCCACTGATTGAAGGGCTGGCGCCGTCGCTGAACCGGGTACATCAGAGAGCGGATGAACTGATGGCTGAAATTGATCCGGCCAGAACCACAGAACTGATAGACCGTTATGAACAGCTGTATGGCCTGCCTGATTCCTGTGCACCGGAAGGCGTTCAGACATTACAGCAGCGCCAGCAACGGCTGGATGCAAAGGCAAATGTTGCTGGCGGTATAAACGAGAGGTTTTATCGGGAACAGCTTGATGCGTTGGGGTATACCGCTGCCACCATTGAGCAGTTTCAGAATCTCGACAGCACACCCGATCCTGAATGGGGGGAATTCTGGCGTTACTACTGGCGTGTGAATATTCCGGCTGATGCGAACATCAGCTGGCAGACCTGTACAAGCACCTGCGACTCTGCGATCAGAACGTGGGGCGATACTGTTGCTGAATGTGTGATTGATAAGCTTTGTCCGTCACATACGGTTGTTGTTTTTGCTTATCCGGAAGGAAAAGAGAATGCACAGAATTGATACGCCCACCGCGCAAAAAGATAAATTTGGTCAGGGAAAAAACGGATTTACGAATGGTGATCCCGCCACGGGCCGCCGCGCAACGGATCTCAACAGTGATATGTGGGATGCAGTCCAGGAAGAGGTCTGTACTGTTATTGAAGCCGCCGGCATACCACTCAGTAAAGGCGAACATACGCAGCTTCACGCCGCCATTGGCAGGCTGATCGATGAACAGGTTAAAACCCGTCTTGAAAAAAATCAGAATGGCGCGGACATCCCGAATAAGCCGCTGTTTCTCCAGAACGTCGGTTTAGGAGAAACGATAAATCTCGCTGCAGGGGCCCTGCAAAAATCGCAGAACGGCGGCGATATTCCTGACAAAAAACAATTTGCGAGAACCATCGGTGCGGTAACGTCAACCACCATTACACTTGGCGAATCAGGCTGGTTCAAAATCGCCACGGTTGTAATGCCGCAGGCTACATCAACTGCGGTGATTAAACTGTACGGTGGGGCGGGGTTTAACGCTGGTTCACCTGAACAGGCGGCAATCAGCGAACTGGTATTGCGTGCCGGTAATGGTTCACCTGTTGGAATAACCGCCACATTATGGAGGCGTTCACCTTCTGCTGCTAACGAGGTCGCATGGGTTAATACATCAGGCGACACCTACGATATTTATATTAATATCGGCCAGTATGCGTACTGGTTAATTGCGCAATATGATTACACCGGTAATGCAAATGTCACGCTGCACAGTACGCCTGAATATTCATCAGTTCAGCCGGGAAACTCAACCAGCGGTCAGACATATACACTGTTTAATAGTCTGATGAAACCCACAGCCGGTGACGTTGAGGCACTGTCAGTTAATGGAGGGAGGCTAAACGGTCCGTTAGGCATTGGTACTGATAATGCGCTGGGTGGTAATTCGATTGTATTCGGAGATAACGATACAGGGTTTAAGTGGCACAGTGACGGCGTTCTGGGGATTTATGCCAATAATGCTCTGGTTGGTTATATCGACAATTCCGGGCTGCACATGTCAGTAGATGTTCTCACTAATGGTGCCGTACGCGCAGGCAACGCAAAAAAACTGTCACTGACGAGCAATAATAATTCGACAATGACAGCCACGTTTAATTTATGGGGCGACGCAAACAGGCCAACAGTTATTGAACTGGACGACGATCAGGGATGGCATCTGTACAGCCAGCGAAATCCTGACGGTTCGATTGTCTTTACGGTCAATGGAGATATCACCGCTAACACGCTTCGTGCAAGCGGGGCTATCTATCAGAATAACGGCGACATCTTTGGTTCGCTATGGGGAAATGGCTGGTTAAGTACCTGGATTAATAATAATCTCGTCTTAGATGTTCAGTTAGGGGCTGGCACATCAGTGACTACCTGGAACAATGCAGGTTCCTGGCCTAACACTCCCGGATATGTAGTTACCTCCGTCTGGAAAGATTATCAGGGCGAAAATATTGATGGTATTAATTATGCGCCTTTGCAAAAACGAGTCGGGAGTCAGTGGTATACCGTACAAGGGGGAACGGTATAATGAAAAAATATCAGAATATCAAAAATTTCAGACTTATTGACGCGCCCGTAAACAGGGATAAAACTCAGGCTGAAATAAATATAGGTGCATATTTTCTGGAGTCGGACGATGGACAGGACTGGTATGAGTGTCAGTCATTATTTTCTGATGATACTGCAAAAATAATGTACGACCATGAGGGGGTTATCTGGGGTGTTGTTAATAAGCCAGTCCCGCAACGAGGAAACACATATTCTGTATCAATGTTGTGGCCGGTTAATATGTCTGTTGCGGAAATAGACGCTGCTGACTGCCCTGATGATTGCCGTGGTGATGGTACGTGGTTATATCAGGACGGTAAAGTCGTTCAACGGGGTTATTCTCCGGAAGAGCTGCGTAAAAAGGCGGAGGCTGAAAAAGTTCGCCGCCTTGCTGAGGCTGAATCAGCCATCGCACCACTGGCACGGGCAGTAAAACTAAAAATTGCCACAGATGAAGAGATTAAACGGCTGGAAGCATGGGAACTTTATAGCGTAATGGTAAACAGGGTGGATACATCTGCGCCTGACTGGCCGGATATACCACGCTAAATATTCAGGCGGGTTTATTACCCGCCTTTTCTTTTTCCTGTCGTTGTGCCATCAACCTGACAGCCGGTACAAATAGCCCCCTCTTGTGTACTGACCTGAAAATATACTCACCCCTTAACCACGGAGTTAACCGGATGAGTGATTTTCACCACGGCACGCAGGTCATCCAATTTAATGGAGGTACGCGCGTCACATCCACGATATCGATCGAAATCGTCAGTATGGTCTTTACGTTTAGCTTTGCCGAAACATTGCTCGCGTTTATATCATACGTTTGCAAATAATTATCTCCAGGAGCTGATAGTCAAACTGCTGGTATCCATAAAAATATGTTCATGCTTAAATTCTATAAAATTACAAAACTCTTCATAATTGTGCCGTATAGAACAATTAAAATAATGCCTTAGCCCACCGCAGACACCATCAATGTATGGATCGCCATAAGGTTTACTGTGCATAATTTCAAGTCCTTTTTTCAATGCCGGATGCTCGCTGCGGTTAACGGCGATAATCCCATTTTCAAGACTCATGCTATTACCTTTACGACTTACATGAACAGCAATACCATCAGGTAAATACAAAGTGCCGAGTTTACCTGTAAGTAACATATCAGCATCAAGATATATGCAGCCACCTCCAGGTTGCAGGTGATGGCAGCCATGTTTGCCCGCCTCCAGAAAAGCATTACTTCCTTTTAATAAAAAAAGATTTCTGTAAAAATCAAACCGTACATGTCCCAAGCGTTTATCATGGGACGAAACAAGGGACTCCTCTGGATTGTTCTTTAAAACTTCATTTAAACTCTTTTTTATCTCACCAAGCAGATATTCATCTCTGACATTTGCTGGTTGAGCTTCAATTTTAGCGATGTTTTCTAAATAAATATCTGATAGTTTCTTGTCATACATGCTATAATCCAGGTCGGAATTATAGATAACCTTTATATTTTCATATTGTTTTTCCAGCTTTGCTAATGCTTTCTTTTGTCCAGCACTAAAATTCCCATCAACTAAAACCCCGATAGTTCTCTCTTTTTCTATTATAGCGGCGTTGATAATATTATTGAGATAGGGGTTTTGTTGAGTATTAATAATTGGGATCTGGTTTTCCCCAAACCTGCTTGGGTTTCGTTCAAACCATTGAAAAAGTAGGGGGGTGTGCTGATCTAATGGCAATAAAGGATATTCAACTCCGGCAAAGTTTGCACTACCTGATGAAGGCAGAGTAATAGCTGGAGTTGCAGTATGAGAATAGTTCTGGCATGAAAGAAAACCTCTGACTCGAGAAAACATTTTTCAACCCTTACGCTATTAATATAACATACCATGATTTGAGATTGATAAGATATCGGGTTTTAACCTTAGTTTAATAAATTGGAAGATTTTTGTATGAAGATTTGCTACCGTATTTTCGTGCCATAGCTATCTGAAACGATAGTTTTTACTTGGTTAGGGGGGGCTTAAAATTACATTTTTGAATAAGTATTTTATACTTCTTATACGATATGTTTTTATTGTATTGCAGGGGGACAGGGGAGATTGGCGGGCGAAGACCAAAGTTATCACCTGAGCAATGGGCGCAGGCCGGGCGTCTGATTGGGGCAGGAATACCGCGACAGCAGGTAGCGATTATTTATGATGTGGGGCTGTCGATACACGGGCTGTACTGGATTCTGTTATCGGCTAACCGGAAGGGAGTTAAAAAACGGGATTACCGCGCTAACTAGAAGCATGAGGGAAGGTTGTGGCATCCGGTGAACCTGTAGGAATCCCATAACTGATTAAGGACGGAAACCACCAGATGCCACGAAGATGTTATGGCGCATGTTGTTGTGGGAAGTCAATAAAGGAGATGGTTTTGTTTAAAAAATAGTCTATCATGGTGAGAAAATGATTTTGATAAGTAGGCTAACTTTCTGAAAATACTGCGTACAAAAATGCTACTTTTTTCTCATGTTATTAGATAAGTTAATGTTAAATAAGAATATTGGGACGGTCTCGAAAACCGGAGTAGGGGCAACTCTACCGGGGGTTCAAATCCCCCTCTCTCCGCCAATCATTCAACAAAATCAATCACTGACAAAGCGTTTTTGATTTCGTCATACATAAATCCCTGCATTAAAATTTCTTTCACTCGCTCGATTTTTCTTACTACTGATGCTGTTTTTTACCATTTTGCTGCGCGTCGGGCATCCACTTTTATCTTCTTTAGCTCACAAGCTCACACTCTTTTGCAACGCCCATCTCACATCTCTGGTGACAAAACAGTTTAACTGCCGTTGCTTCACACGGATCCTGTGAGGGTATCTTAAAGGAAATGGATTGCTGGAGGATGGAGGAAAAGAGGCGAAATCTTTTTCAGTGGTGATACCGCCTCCCCTGGACATGAGCTTGCCAGAACCTACGCGGTAAATCCTACGACACGAAGCCACGCGCAATTTTATGCGAAAACGCTGGCAATAGCCTTGCTGACTGATGCAAGCTATCTAAAATCGCTTGAAGATGACATCATGCAAAACAATCGCTTAAAAGACTCACCTGTTGATAAGCATGTTGAGGACGAAAAGAAGAAGAAAAATGCAGCCAGGTAAAAAGGGTTACGTTCGCCGTAAGCGAACGTGATCGCATCCAGCCTTAACGAACCTTTTTCCGTTCATCTCCCTTCATGTTTTCAAAAAACTCTTTTATGAATGCCCCTAAAACGTCATACGCGAATTCGCTATTATTGTTCCCGTCGCGACAACATAGGGATTACGTAAAAAAACAGAGGGCAGACATTGAAAATATTGGAAATAACGTAAAACGATTTGTCCGTGTTTTTATCTGTCATTTAAACTTACAGAAATGGCTGTACTGCTATCAGTATAAAGATAATTTCAGTTGTCCAGATAAATCATTTTTAAAACTATTTGTATTAACAGGGTTTATTCATTTATTGAATGGATGCGTGCTCGTTTGGTCTTGTCATCTTGACGCTTTGAGATCACATATTCTATTCTATCATAGACAGCAGGGAAAATTAGCTGCTGGGATCGATTTTTGATTAATAAAATATTGGATGGAACCACGATGCAATTACCAGAACAGGATGAGTTTTCTGATTTTTTTGCTGCCAATGATGATGAACAAGCCTCTTTAAGGCGTAAGTTTTTTTTGGAGAAACATAAAGAACCGTGTCTGTCTGAGTCTGCATTAGAGGACTACCAGGCGCTGTTTATGAGTATCTACGGAATTAATATTGACTGGAAAGAGGGGACTTTTAGCCTGCTTGAGGCACTTTCAGATAATCAGGGAGGGAAGCCTGTCACGGTCAAATTCGATTATGACAGTGAGATTGAAACAGCAACGATAAATTTGGTTGATACGCAGTATGTGTTTCATCACTACCCAATGGGAAGTGATGGTTTTGATACAGAACTGGTGCGCATTGAGCATATATTGGCTAATAGTGGATATAGTTTGCGGGTATATCAGAACAGCACTTTTAGTGATACATTATCGTTCTTACTCATTCCGTCAGATGAGTGGAAACGTGTTGAACAGCATTATAGCCCAGAGCATATTTCTGAATACTTCGTTCCGTATGGAAAACAACTTGTTATTCCTGAGGTTACTGCTCCAGTCGTAAATTATGTGCCATCCGTTAAACAAGAAGCATCAAATGTCCCAGCGTTGTTTAATGCTCGGGGTATCCGTATTTGTTTTTTAAGTATAATGCTAATCGCATTTGCAATTTATATTTTGTGGAATATCCTGACAAAAATAGAGCCTTTATCATCAGGCCAACCTGCTGGCTGTGAAAATCTACAAAATTTATACTCAAAATTACGTCCAGAAGTAGCCGAGCCATTAAAAGAAAAAATGCGTAAGAGTTTGGGCTGTAAATGATAAATTTTCAGTTTAAGAAAATTACACTAATGGATAGGTAATAGAGATATATTTAATTTGATATTGTGTTATCGAAATAGTAATAACAGATAATATACATATTCTTGCGGTGGTGGATGAAGCGATTTTTATCTTATTTGTACAGGAAAAGAAAAAGAGTTTTTTTAGCTACCATGACTCTTATTAATTTTGTTGCCGCAAATCAAAAATACTACGCTTTTAATTTGTTGTCAGTGTTTATCATGGTGTTCTGGTCAGTTCAGTACCTCCTGCAGCATCTCCTTGTCCAGACTCAGGTCAGCCACCAGCTTCTTCAGCCGCTGATTCTCATCCTCCAGTTGCCGCAGACGCCGCAGTTCCGTCACGCCCGGCCCGGCAAATTTTTTCTTCCAGTTAATGGGATGGACTACTTCCTCCCGCTGCGGTTAACTGTACGAAATGTGCTCACCAGGAGAATCACCATGAATATCGTATTTCTGGGTATTGATCTGGCTAAAAATGTTTTTCAGCTCTGCGGGTTAAACCAGGCCGGCAAACCGGTTTATACGAAACGCACTGGCCGAAAAGAATTGCTCCAGACGCTGGCAAATATTCCTGCATGTCTGATTGGGATCGAAGCGTCCACCGGGGCATTTTACTGGCAGCGTGAGTTTGAGAAACTGGGGCACAAAGTAAAGGTCATCAGTCCTCAGTATGTAAGACCCTTTGTCCGCGGGCAAAAAAATGATGGTAATGATGCACAGGCCATCGCAGTGGCTCTGATGCAACCGACAATGCAGTTCGTGCCGCCAAAAAGCCCCGAACAGCAGGATATCCAGGCTTTACACCGGGCAAGGCAGCGTATTGTCAATCACCGCACTGCTACAGTCTGTCAAATAAGGGGGCTGTTACTTGACCGGGGGATCCCCATTGGCAGTGCTGTCTCCAGAGCTCGCCGTGCTATTCCTCTTATCCTTGAAGATGCAGAAAACGGTCTAAGTTCCCGTATGCGCAGAACAATTGCCGAACTCTATGATCTCTTTAACGATCTCGGGCGTCGGATCCATTTTTTTGATAAGGAAATTGAAACAGTATTCAGGCAATCAGAAGCCTGTCCGCGTATCGCCAAAGTTAAAGGCATTGGTCCTAAAACGGCCACGGCCGTTGTTGCTGCTATTGGCAAAGGAACTGAATTTAAGAATGGTCGCCACTTTGCTGCATGGCTGGGTCTGGTTCCACGCCAGCATTCGAGTGGCGACAGGCAGGTGCTGATGAATATGACGAAAAAAGGCGACAAGCATCTGCGGACACTTTTTATTCATGGTGCCCGCGCTGTCGTCAGGGTTGCCACGAATAACAATGATGGTCATATGAATCAGTGGGTTAACCAGTTAAAGGAACGGCGCGGATTTAATAAAACGACCGTGGCGGTCGCTAACAAAAACGCGAGAATAATCTGGTCGATGCTGAGAAATGATACCGGGTATCAGGTAGTGTGTAATTAA
Protein sequences of DBSCAN-SWA_4 >NZ_CP043433|1856941:1904437|1901155_1901338_+|WP_001752481.1|DBSCAN-SWA MISSYINPCIKISFTRSIFLTTDAVFYHFAARRASTFIFFSSQAHTLLQRPSHISGDKTV >NZ_CP043433|1856941:1904437|1901730_1902108_+|WP_001576155.1|DBSCAN-SWA MFSKNSFMNAPKTSYANSLLLFPSRQHRDYVKKQRADIENIGNNVKRFVRVFICHLNLQKWLYCYQYKDNFSCPDKSFLKLFVLTGFIHLLNGCVLVWSCHLDALRSHILFYHRQQGKLAAGIDF >NZ_CP043433|1856941:1904437|1862454_1863318_-|WP_000208076.1|DBSCAN-SWA MTTITKEWLQQTIAEFENTRDDIPFGLSDDDAKVLIVLKRALASLEAEPAGYHVIKECGKVGCSVATLEEAEKTRDFWNKKWTIRPYFYTAQPVQETGVYNDVLNIISLLENNEWAEHCTSTVLGSLLESEITRLVGKEQSAPVVTFYRDGVEAAAKWIDQQREAYDSEHGWSDPDTGAFEFGNDAQRGYSSTLEELAEGIRALHPNAGNSPVIPDGWISCSERMPEDEQEVIVHNKLGYRYVSYFDEHSGLFFDMRGGNQMNCIEHIFVTHWMPVPAAPKPEINNE >NZ_CP043433|1856941:1904437|1902134_1902953_+|WP_001176778.1|DBSCAN-SWA MQLPEQDEFSDFFAANDDEQASLRRKFFLEKHKEPCLSESALEDYQALFMSIYGINIDWKEGTFSLLEALSDNQGGKPVTVKFDYDSEIETATINLVDTQYVFHHYPMGSDGFDTELVRIEHILANSGYSLRVYQNSTFSDTLSFLLIPSDEWKRVEQHYSPEHISEYFVPYGKQLVIPEVTAPVVNYVPSVKQEASNVPALFNARGIRICFLSIMLIAFAIYILWNILTKIEPLSSGQPAGCENLQNLYSKLRPEVAEPLKEKMRKSLGCK >NZ_CP043433|1856941:1904437|1896360_1896948_+|WP_001207832.1|DBSCAN-SWA MALQDEYTQLLYHLLPEGPAWDGENPLIEGLAPSLNRVHQRADELMAEIDPARTTELIDRYEQLYGLPDSCAPEGVQTLQQRQQRLDAKANVAGGINERFYREQLDALGYTAATIEQFQNLDSTPDPEWGEFWRYYWRVNIPADANISWQTCTSTCDSAIRTWGDTVAECVIDKLCPSHTVVVFAYPEGKENAQN >NZ_CP043433|1856941:1904437|1891939_1893280_+|WP_000863817.1|DBSCAN-SWA MAFFSSTGWRGRLRDASFRGVPFSVEDDESTFGRRVQVHEYPNRDKPWTEDLGRATRRLTINAYLVGDDYADRRDRLIGAIETAGPGTLVHPQYGEMQGSIDGQVRITHSSTEGRMCRVSFQFVESGELSFPVAGMATAKRLETSGGLFDDAIDSMFSTFSLSGISDFIQNDVIADAASMLGDVADAFRMVDSGVSAAMRLLQGDLSVILMPPGAASDFVNALQKAWRSGDRLRGSTSDLVTMIKTMSGITLDPGLSPRGTWPTDSGSAAKQKMQRNMIAAAIRTTAISTAVHAVTTLKQPRDVPDVRGVNQPAGTGRDSDIITVMHPALDGVQTVSNGSFPPNYEDLKAIRTALNAAIDQEQLRIRDDVLFQQISVMRTDLNRDISARLAQVERTALRTPDDVLPALVLAAAWYDDAGRESDILTRNPVPHPGFIPVEPLRVPVR >NZ_CP043433|1856941:1904437|1884485_1885715_+|WP_000766103.1|capsid|DBSCAN-SWA MKLHELKQKRNTIATDMRALNEKIGDNPWTDEQRTEWNKAKSELEALDERIAREEELRRQDQTYVDENEEEQRNNQDPDKDPQQDEKRGQIFDKWMRHGASELSSEERKALRELRAQGVAPDEKGGYTVPDTFLAKVVEQMKSYGGIASVAQILATSDGRTMEWATADGTAEVGVLLGENEEAGEEDTEFGMDSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGTGTPKQPKGLKASVTGTTQTAAAGAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVERYAEFDQTGFLAFHRFDCILEDTSAIKALVGKGSASS >NZ_CP043433|1856941:1904437|1886114_1886519_+|WP_000776844.1|head|DBSCAN-SWA MKLRQAQASATYLLPDPGELDQRIVIRRRVDVPADDFGVTPTYPEQIRAWAKKAQPGAAAYQGAVQIENRVTHYFTIRFRRGITADHEVLHDDISYRVKRVRDLNSKRRFLLLECEELGTDNGSDYAAESIFTR >NZ_CP043433|1856941:1904437|1865048_1865588_-|WP_000008351.1|DBSCAN-SWA MSFIQTLSGKQFDYLSATIDDIDIEDIAVALSNICRFSGHLPEFYSVAQHSVLCSQLVSPEFAFEALMHDAAEAYCQDIPAPLKALLPDYREIEKRTDQLIRFKFGLPLEEASVVKYADLTMLATERRDLDIDDSIPWVILEGIPPTDLFEIYPLRPGQAFGLFMARFNELMELRQCAA >NZ_CP043433|1856941:1904437|1870859_1871834_+|WP_000096529.1|DBSCAN-SWA MSSLIQLLDRPIAYNPAFAKLKAGKVKAGPVAAVFLSQLVYWHNRMDGGWMYKTQADIASETALTRDEQETARKRLVALGVLEEARRGVPATMHYRINTARLEALLLETAKPVKKGAQEKTRLRDFQNVETPQSGLVQPRKPDCGDAANKNVETPQTSTGQPNEQACGDPTIFPTGDYTETTQEITQESKTPFCPVAEQPDPEVTLTDQAIEVLTHLNQVSGSRYQKSKTSLENIRARLREGYSVADLQLVIDLKHEHWHENDEQYQYMRPETLFGPKKFESYLQSATRWDQKGRPKRADWGAKKRDVMAFGPVDTTIPEGFRG >NZ_CP043433|1856941:1904437|1867169_1867421_+|WP_000078504.1|DBSCAN-SWA MSQKDDIPVFPVTGWQAGPLPGYDALVVKFQFLSSPMQPIESAQETQFLVLTPEMAESLASDLQRHIQDLRNSDVHSPQEGKH >NZ_CP043433|1856941:1904437|1872300_1873182_+|WP_000200166.1|DBSCAN-SWA MTSESVCIESSDVTISVDESASRTWRRPFLKWAGGKYSMLPDLYQVIPAGMRLIEPFVGGGSVFLNSDKHACFLLADVNTDLINLYQMLAVVPGAVIRHARVMFDRLNDAESYMALREEFNAQVMDAPERAAAFLFLNRHCFNGLIRYNRNNQFNVGWGKYPSPYFPEEEIRAFTEMAHNCVFMAAGFRRTLALAGEGDVVYCDPPYEPMPGKDGFTHYAAGGFTWDDHIALAECCVAAHQRGARVVIGNSTSPRVIDLYSQHGFEIRYISARRSISSKGSTREKAKDLVAIL >NZ_CP043433|1856941:1904437|1864747_1864978_-|WP_000764235.1|DBSCAN-SWA MKLEMYTLDGSVIVDSNLVTQFYPDYKSGGELTVIETISATGETFTVRVKHSFLQVTSALATAWSVDEKKAEGAAQ >NZ_CP043433|1856941:1904437|1880347_1880785_+|WP_000501481.1|terminase|DBSCAN-SWA MGAVVRSSGGGRKRNLPSGQKSKLTRIAPPEELMSDIAIRIWKTQSKILIERGVFDLEDAPLLLAYCNAFHLMIEAEKVIAEEGLTVSSEMGGEKKHPAVNVRNDSVSQLARLGSLLGLDPLSRIRMTSGKNDPDDEGNEFDEFD >NZ_CP043433|1856941:1904437|1896934_1898497_+|WP_000554738.1|DBSCAN-SWA MHRIDTPTAQKDKFGQGKNGFTNGDPATGRRATDLNSDMWDAVQEEVCTVIEAAGIPLSKGEHTQLHAAIGRLIDEQVKTRLEKNQNGADIPNKPLFLQNVGLGETINLAAGALQKSQNGGDIPDKKQFARTIGAVTSTTITLGESGWFKIATVVMPQATSTAVIKLYGGAGFNAGSPEQAAISELVLRAGNGSPVGITATLWRRSPSAANEVAWVNTSGDTYDIYINIGQYAYWLIAQYDYTGNANVTLHSTPEYSSVQPGNSTSGQTYTLFNSLMKPTAGDVEALSVNGGRLNGPLGIGTDNALGGNSIVFGDNDTGFKWHSDGVLGIYANNALVGYIDNSGLHMSVDVLTNGAVRAGNAKKLSLTSNNNSTMTATFNLWGDANRPTVIELDDDQGWHLYSQRNPDGSIVFTVNGDITANTLRASGAIYQNNGDIFGSLWGNGWLSTWINNNLVLDVQLGAGTSVTTWNNAGSWPNTPGYVVTSVWKDYQGENIDGINYAPLQKRVGSQWYTVQGGTV >NZ_CP043433|1856941:1904437|1878673_1879291_+|WP_001075993.1|DBSCAN-SWA MNQQQFQQAAGISAGLSARWFPHIDAAMKEFGITAVNDQAMFIAQTGHESAGFTVLKESFNYSVEALKKTFGKRLTPYQCEMLGRIDGRQVAHQPQIANLVYGGRMGNKDAGDGWKYRGRGLLQITGRENYVKCGAALKLDLISTPELLAQEKHAARSAAWFFTLRGCLMYSGDVVRVTQIINGGQNGLADRNSRYNKARAALLV >NZ_CP043433|1856941:1904437|1889566_1889893_+|WP_000588852.1|tail|DBSCAN-SWA MIKELVLKKPIMAHNEKLHVLELREPSYDEIEAIGFPFTVSGDGGVRLDSSVALKYIPVLAGIPRSSAAQLAKLDIFKACMLILNFFTRSETEEDSESGSTTPHTSGE >NZ_CP043433|1856941:1904437|1878016_1878406_+|WP_001294874.1|DBSCAN-SWA MSEPVSSATVLAGGLMGASVFGLATGTDYGVVFGAFAGAVFYVATATNIGRIRLVAYFITSFIVGVLGAGLIGTKLAAITHYEKPLDALGAVIISAMCIKFLTFLNSQDLNTLFSILSRIRGGGSDGSK >NZ_CP043433|1856941:1904437|1859541_1860339_-|WP_000598920.1|DBSCAN-SWA MIKWPWKAQEITQNEDWPWDDALAIPLLVNLTAQEQARLIALAERFLQQKRLVALQGFELDSLKSARIALIFCLPILELGIEWLDGFHEVLIYPAPFVVDDEWEDDIGLVHSQRVVQSGQSWQQGPIILNWLDIQDSFDASGFNLIIHEVAHKLDMRNGDRASGIPFIPLRDVAGWEHDLHAAMNNIQDEIDLVGESAASIDAYAATDPAECFAVLSEYFFSAPELFAPRFPALWQRFCQFYRQDPSQRLRVSAAEGDYGEESEH >NZ_CP043433|1856941:1904437|1894334_1894868_+|WP_001273650.1|plate|DBSCAN-SWA MANHPLQNMITRAVITAIDTVRKCQTAGLKLIAGEKKENVEHLEPYGFTSAAQNGAEAVVLFPGGGRSHGVAVVVADRRFRLKGLARGEVALYDDQGQSVTLTRAGIVVNGGGKPVIFTNATKARFEMPIESTGDIRDNCDSSGKTMAEMRTTYNGHTHKENGDGGGITDKPGQPMS >NZ_CP043433|1856941:1904437|1868680_1868905_+|WP_001191666.1|DBSCAN-SWA MQSPLRKLRKSHGYTLQHVAKGVQVDPATLSRVERCEQAPSTELAERLAQFYAGEISEMQILYPNRYQLSDSAI >NZ_CP043433|1856941:1904437|1889977_1891906_+|WP_000785387.1|tail|DBSCAN-SWA MADSFQLKAIITAVDQLSGPLKGMQRELKGFQKEMAGLAIGAAAAGTAVLGALALPVNAAIGFESKMADIRKVVDGLDDKKAFAQMSDDILTLSTQLPMAAEGIAEIVAAGGQAGIARGDLMQFANDAVKMGVAFDTTAEESGQMMAQWRTAFRLTQEDVVVLADKINYLGNTGPANAKKISDIVTRIGPLGGVAGVASGEIAAMGATIAGMGVESEIASTGIKNFMLSLTAGNSATKAQKQAMAFLKLNPRKLAEDMQKDSRGAMLKVLDSLAKVPKAKQAAVMNALFGKESLSAIAPLLTNLDLLRTNFDRVADAQEYGGSMQKEYASRASTTENQLVLLKNSVNAISVTLGDTFLPAINEAAEAVMPYLEQLRTFVRANPELVQSAAKFGAALLAVGVSIGSLSRAVKILNSVINLSPAKVAIAALVAGAMLIIENWDDVAPVIKAVWQEVDNVAQEMGGWETVIEGVGLVMAGSFTVRTIGALQQSVLLAGRLSGLLGKIGRMGAMTLTIGVAVSLFKELKDLEQGAKDAGMDAGAFAVQKLQTKERERGYNGFIPRLKELLGMDTPIPQGRYQPYVPLTRRSGVLGRAVPPSTQRSELKVTFENAPQGMRVTDIPKSGNPLMNISHDVGYSPFRTSR >NZ_CP043433|1856941:1904437|1893276_1894335_+|WP_001066630.1|plate|DBSCAN-SWA MNNTVFLRVNGRDWGGWTSVRISAGIDRIARDFNVSITRQWPGGEDVPPVKNGDAVEVLIGDDLVITGWVEALPLRYDAQTIMTGIVGRSKTADLIDCSASPAQHNGKNLFLIASALARPFGVDVVDAGAPAAAVIEAQPEHGETVVDCLNRLLGQAQALAYDDERGRLVLGRPGSMKAATALVLGENILSCDTERSVRERFSSYLVTGQRPGTDDDFGEATIAAIRQSTGDAGVTRYRPHTIQQSGTATTDSCKSRCEFEARQRAAKTLETTYTVQGWRQGNGELWKPNQAVVVYDPLNGFDNETLVIAEVTYSQDNNGTLTEIRVGPADAYLPEPFRPKAKKKVSEEADF >NZ_CP043433|1856941:1904437|1879287_1879827_+|WP_000127618.1|DBSCAN-SWA MTAVFAFVKARWKTIIVLLMLAGAFLAGIIWSDRGWQKKWADRNSMESSQEANAQTAARWIEQGRIIARDEAVKDAQAQAAKSAATAAGLSATVSQLRTEATKLAARLDAAKHTSDLAAAVRSKTAGADAAVLADMLGRLAEEARYYAERSDESYRAGMTCERIYNSVRESTNNPIAPH >NZ_CP043433|1856941:1904437|1866631_1866853_-|WP_100228258.1|DBSCAN-SWA MRLSDSPTTTLLLCEVLSASGSSNNRRKATVFCRDADLFTQQKRASPRDGLITQSTRKAVTAGALFCFVEKPT >NZ_CP043433|1856941:1904437|1880784_1882515_+|WP_000257219.1|terminase|DBSCAN-SWA MATYPNVNAANQYARDVVNGKILACRLTMLACQRHLDDLERAKDPHWPYRFDKNKAERFLRFSQKMPHTSGEWARRKLRIEFEPWQKFALGVPFGWVRKDTGFRRFTEIYIEVPRKNGKSAIAAAVGNYMFCADGEYAAEVYCGATTEKQAWKVFAPALAMVKKLPALRQKFCIKPWAKKMTRPDGSLFAPIIGDPGDGDSPSCAIIDEYHEHDTDALYTTMTTGMGAREQPITLIITTAGFDIASPCYEKRTQVVEILERIREGGENEAIFGIIYTLDDDDDWTQPEALIKANPNYNISVKEGFLKAKQLLAMSTPGQTNKILTKHFNKWVSSKAAYYNLQKWMTAADKTLRLSDFAGEECYPGIDLASKLDLNAVVPVFRREIDGLSHYYCVSPMFWVPEDTVYATDPALKTIADRYQSFVNQGVLVPSDGAEVDYRLILEAILKLRETVKIAASPIDPYGATGLSHMLQDEGLEPVTITQNYTNMSDPMREIEAAIAAGRFHHDGNPLMTWCISNVVGKYLPGSDDVVRPVKEGAGNKIDGAVGLMMGVGRAMLNEPKDFLSNLDPDEELLFL >NZ_CP043433|1856941:1904437|1868556_1868742_+|WP_071529734.1|DBSCAN-SWA MSDTVSYIHAFITLIFCALCKSTCASLLYEITCDIRKRRRFYAITIEKIAEIAWLYVTARR >NZ_CP043433|1856941:1904437|1873596_1874457_+|WP_001061459.1|DBSCAN-SWA MNNLMVIDGIEVRRDVHGRYCLNDLHRAAGGEQKYRPKYWLDNKQTRELIEQLFTEGGIPPSEQNQSVSFFQGGSDTRSLARAPVNTVRGGAEQGTYVCKELVFAYAMWISPSFHLKVIRTFDRITSAPQISSGMAADKMQAGVILLGFMRKELNLSNSSVLGACQKLQEAVGLPNLAPQYAIDAPAGAPDGSSRPTLALSALLKQHGIRMTANQAYQQLAKLGVVEHRERYSRSAINGIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESKFPELLKLLDTVH >NZ_CP043433|1856941:1904437|1877355_1877928_+|WP_000765639.1|DBSCAN-SWA MKLFSPLSYLRIKHEEKDWYDYKIPAAVSLIVTIVYYFHASKISLIETNGLLLQVNGLLQVLIGFYIAALAAVSTFSSSSIDEVMAGVPPTLVEKFRGQKLTVELTRRRFVCYLFGYLALVSFMLFCLGMISILIGKPFHLWLLTFCSPDAILWLKTVFVGVYIFILMNIITTTLLGLYFLAVRFHQSSL >NZ_CP043433|1856941:1904437|1864235_1864751_-|WP_000071068.1|DBSCAN-SWA MSNRIRNAQVFDARTGEYPVDMYIRWIIGGELDFDANYQRGYVWGHEEQQAFLNAVISGFPIGSVALAKAPDWCSRELPYIEVVDGKQRLTTLKKLITNEIPIILADGPLYWRDMTRAEQLAFGRRPLPAVVLDEVTYKDRLAYFMAVNFTGVPQSEEHKRHVMQLMEAAQ >NZ_CP043433|1856941:1904437|1900570_1900792_+|WP_001526483.1|DBSCAN-SWA MFLLYCRGTGEIGGRRPKLSPEQWAQAGRLIGAGIPRQQVAIIYDVGLSIHGLYWILLSANRKGVKKRDYRAN >NZ_CP043433|1856941:1904437|1887717_1889214_+|WP_001007993.1|tail|DBSCAN-SWA MAISFNSIPSDTRVPLFYAEMDNSAANTARDSGASLLIGHASNDASIAVNSLVLVSSVDYARQICGAGSQLARMVGAYRKTDPFGELYVIAVPESTGAAATVALTVTGEATETGTVNVYTGRTRVQAPVTSGDDAAAVAVSIKDAVNANPDLPFTATSEAGVVTLTARHKGLYGNEIPVTLNYYGFGGGEVLPAGVNITVASGVKGAGAPALNDAVAAMGDEPFDYIGLPFNDTASVNTMATEMNDSSGRWSYIRQLYGHVYTAKTGTLSELVAAGDQFNLQHITLAGYEKDTQTPADELAASRTARAAVFIRNDPARPTQTGELVDMLPAPKGKRFTTTEQQTLLSHGVATAYVESGVLRIQRDITTYRKNAYGVADNSYLDSETLHTSAYVLRRLKSVITSKYGRHKLANDGTRFGPGQAIVTPAVIRGELGSTYRQLEREGIVENFDLFQQHLIVERNANDSNRLDVLFPPDYVNQLRVFAVLNQFRLQYSEEAA >NZ_CP043433|1856941:1904437|1867496_1867682_-|WP_001067433.1|DBSCAN-SWA MNNYYTCSFCGVSELDAKKLIAKGSKDEPAICSECVVSCVNILINYAAVIKPVKLNVTKGE >NZ_CP043433|1856941:1904437|1899350_1900358_-|WP_000492926.1|DBSCAN-SWA MFSRVRGFLSCQNYSHTATPAITLPSSGSANFAGVEYPLLPLDQHTPLLFQWFERNPSRFGENQIPIINTQQNPYLNNIINAAIIEKERTIGVLVDGNFSAGQKKALAKLEKQYENIKVIYNSDLDYSMYDKKLSDIYLENIAKIEAQPANVRDEYLLGEIKKSLNEVLKNNPEESLVSSHDKRLGHVRFDFYRNLFLLKGSNAFLEAGKHGCHHLQPGGGCIYLDADMLLTGKLGTLYLPDGIAVHVSRKGNSMSLENGIIAVNRSEHPALKKGLEIMHSKPYGDPYIDGVCGGLRHYFNCSIRHNYEEFCNFIEFKHEHIFMDTSSLTISSWR >NZ_CP043433|1856941:1904437|1883873_1884476_+|WP_000003793.1|head,protease|DBSCAN-SWA MSEREIRCYSGEVRAETHDSEPSRIIGYGSVFDSRSELIFGSFREIIRPGAFDEVLNDDVRALFNHDPNFILGRRSAGTLALTVDERGLRYDITAPETQTIRDLVLAPMQRGDINQSSFAFRVARDGEEWYQDEDGVVIREITRFSRLLDVSPVTYPAYQEADSAVRSMKAWQEARDSSALQKAINQRMARERVLTLLNA >NZ_CP043433|1856941:1904437|1876269_1877343_+|WP_000357930.1|DBSCAN-SWA MDKILPTMGFVCLSLISLKNPPPSGFFICDHFIFCLAKLLYGQELKLSGDIVLSNNERWVSFFDFAFTPTHAAAPSIPIEDILKKLKVLVSSGSAVKLYNHRSRALRISEMKYSIGDSQATLLIQLCDKNGSDPVFGELTTGNLRVEPKLAGEGIAVSCHIVISTDVVKNTADHHKTLVESVPGISKSVLEPFLNAMLREAFAGCEFKNPATKGMCQHRPKLEIYSHGSQTLMDALKGAKIHNVKLVSTRRKGGLDQTAYTELSERSVKYKIIRQPPLKDKERLLEILRKKGQQSGYTKVSISYSKDGKQASLDLDRNEDAATKLFTKSERVILGNLINQCESTVHLQLETKMIGLL >NZ_CP043433|1856941:1904437|1885794_1886118_+|WP_000927251.1|head,tail|DBSCAN-SWA MLLSPEEIKLQLRLDEDYADEDKFLELLGRAVQARTENFLNRRLYTAEAGVPADDPEGLILSDDIRMGMLLLVTHFYENRSTVTEVEKVELPMSFNWLVGPYRYIPL >NZ_CP043433|1856941:1904437|1886999_1887560_+|WP_000779215.1|DBSCAN-SWA MKLTPIIAALRSRCPRFENRVGGAAQFKAIPEAGKLRLPAAYVVPAEDVTGEQKSQTDYWQDLTEGFSVIVVLSNERDEKGQWASYDAVHDVRQEIWKALLGWEPDPQAHEIQYAGGMLLDLNRHELYYQFDFTVKYEITETDTRQQDDLDGLPDLKTLSIDVDFIEPGTGPDGDIEHHTEITFQE >NZ_CP043433|1856941:1904437|1895278_1896358_+|WP_000785580.1|plate|DBSCAN-SWA MADSQFARPELPQLIATIRSDLLTRFQQDVVLRRMDAEVYSRVQAAAVHTLYGYIDYLARNMLPDMCDEDWLYRHARIKRCPRKNAVSAKGFARWDGIAGTPEIPAGTQIQRDDQVTFTTLQTVKASGGLLRVPVIADVAGTAGNTDDGTALRLGTPITGIPSTGYADTLTGGADTEEPETWRARVMERYYWIPQGGADPDYVIWAKEIAGITRAWTFRHYKGTGTVGVMVATSNPVNPAPGDDLVKAVRDHILPLAPVAGGGLFVFAATEKSIPVTVALAKDTPEIRTAIIAELNALMLRDGAPSGKIYVSRISEAISLATGEVAHQLRVPAADVVLGKTELPVLGNITWATYTGENG >NZ_CP043433|1856941:1904437|1878392_1878674_+|WP_000226304.1|holin|DBSCAN-SWA MVANDPSAALNAVICGVIVIVLMFYRRGDATHRPLISLLAYVMVLVYASVPFRFVFGLYESSHWLVVMVNILICAAVLWARGNVARLVDALRH >NZ_CP043433|1856941:1904437|1861621_1861864_-|WP_000414876.1|DBSCAN-SWA MEKQLMSDRFLTEEELEDATGASQKSLQKEVLTLNGIYFIERRDGSIRTTWYHINHPVSRLLPPAGYQPVPGMNFDAIES >NZ_CP043433|1856941:1904437|1865682_1866600_-|WP_000551790.1|DBSCAN-SWA MHNPFFKNMLIYRFSRDFNIDIDSLDKKLELFRFSPCGSQDMAKSGWFSPLVQYSDVLYHAVNNQLLLVIRREEKIIPKQTIADEINKKVSTLEREQGRRLKKTEKDSIRDEVLHSLLPRAFTKNSLVRIWINTAAGFIVVDTSSIKRAEDSLALLRKTLGSLPVVPLTMENPIELTLTEWVRSEAAPSGFSIGDEAVLKAILEDGGTGRFKKQDLACDEILTHIEAGKVVTQISMEWQQRISFTLSCDGILKRIKFADQLISQNDDIDSEDVVQRFDADITLMTGELSNLISDLTAALGGEAKR >NZ_CP043433|1856941:1904437|1874464_1875454_+|WP_012543375.1|DBSCAN-SWA MRALLTPEIAPRMGVVLFRPGSELMPLFMQGRVLLEPEPEQYSSFACGAVPAVSQPLADDPAVRDVFRNESVIYRAGGLDSLESWLLRGNGCQWPHSVWHSEQMTTMRHAPGAIRLCWHCDNLLREQFTERLESIAVENTTKWVLSVVCRDLGFDDMHAVTLPELCWWMVRNDLAEVLPESAARKALRMPKAIVQSATRESEIVPSVPATSIVQDKAKKVLALRVDPESPESFMLRPKRRRWINERYTRWVKSQPCACCGKQADDPHHLTGHGQGGMGTKAHDLFVLPLCRTHHNELHADTVAFEEKYGSQLELIFRFIDRALAIGVLA >NZ_CP043433|1856941:1904437|1873190_1873580_+|WP_000779149.1|DBSCAN-SWA MKLTLPFPPSVNTYWRAPNKGPLKGRHMVSASGRKYQSEACAAVIEQLRRLPKPSTAPAAVEITLYPPDKRIRDLDNYNKALFDALTHAGVWEDDSQVKRMLVEWGPVFPKGKVEITITKFETGAGAAA >NZ_CP043433|1856941:1904437|1875467_1876220_+|WP_001047141.1|DBSCAN-SWA MNLEALPKYYSPKSPKLSDDAPATTSESLTITDVMAAQGMVQSKAPLGFALFLAKVGIQNPDFAIEGLIHYAVALDNPTLNKLSEETRLQIVPYLVNFAFADYSRSAASKARCEHCAGTGFHHVLREVVKHSRNGEPVIKEEWEKELCQHCHGKGEVSTVCRGCKGKGIVLDEKRTRFHGAPVYKICGRCNGNRFSRLPTTLARHHVQKLVPDLTDYQWYKGYADVIDKLVTKCWQEEAYAEAQLRKVTR >NZ_CP043433|1856941:1904437|1868933_1869488_+|WP_000509728.1|DBSCAN-SWA MGHEPEWKVEKQPRWLVAAIKKTISSLHGGYEEAAEWLDVTKDALFNRLRTGGDQIFPIGWALVLQRAGGTYHLAHSVARASGGVFVPLADMEEVDNADINHRLLEAIEQITSYSQQIRVAIEDGVIEPHEKAVIDEELYQAIAKLQQHSTLVYRVFCVPEKGDARECAAPGAVASNFMEKTNA >NZ_CP043433|1856941:1904437|1861888_1862458_-|WP_001061370.1|DBSCAN-SWA MNNLMVDLETMGKKPNAPVVSIGAVFFDPQSGEIGPEFYTAVSLESAMEQGAVPDGDTILWWLRQSPEARAAICADAVSVTTALIEFNDFITCHADDLKYLKVWGNGANFDNVILRGAFERASLPCLWNYRNDHDVRTMVTLGRAIGFDPKRDMPFEGDMHNALADARHQAKYVSAIWQKLIPPTSNNI >NZ_CP043433|1856941:1904437|1870638_1870863_+|WP_000620702.1|DBSCAN-SWA MIRNIFKRFTSQRFHCPRPGQWYSTPEGYVLRISLVDRECQKVVCEPLGRNYRVNMPLIAFRSGKNMKHLGGAA >NZ_CP043433|1856941:1904437|1863314_1863608_-|WP_000267991.1|DBSCAN-SWA MWRGLNRGGSQMILTAYEYDPETEKSQSVYLLRHHSKVKKTTLEQKLTVKNDAFGRFKPFVELEDFPEGLSEREAMLKLADWLHRLSVAIEDNWSTP >NZ_CP043433|1856941:1904437|1889213_1889570_+|WP_000515952.1|tail|DBSCAN-SWA MGKIAGTTYFKIDGQQLSVTGGIEVPMNTKVRDDVIGLDGSVDYKETSRAPYTKVTAKVPKNFPVDKITSSDVMTITSELANGQVYVLSNAWLHGEANHNPEEGTVDLEFHGEEGFYQ >NZ_CP043433|1856941:1904437|1860630_1861620_-|WP_000532847.1|integrase|DBSCAN-SWA MGRKRAPGNEWMPKGVFFRPSGYYWKPGGSTENIAPADATKAEVWVAYEKKVEGRKNRITFTQLWRKFLASADYADLAPRTQKDYLAHEKYILAVFGDAEAKAIKPEHIRRYMDARGQKSRVQANHEHSSMSRVFRWSYQRGYVPGNPCVGVDKFPKPQRDRYITDEEYRAIYNNATPAVRAAMEIAYLCAARVSDVLKMNWNQILEKGIFIQQGKTGVKQIKSWTDRLRDAVEICREWGEEGPVIRTMYGERYSYKGFNEAWRKARKAAGDDLGRPLDCTFHDLKAKGISDYEGTAKDKQKYSGHKTESQVLVYDRKVKMSPTLDRKR >NZ_CP043433|1856941:1904437|1882786_1883881_+|WP_077905357.1|portal|DBSCAN-SWA MKLAAVYACIYVISSNVAQMPLHVMRRTGKKVETARDHPAFYLVHDEPNSWQTSYKWRELKQRHILGWGNGYTRVLRHRRTGEVTGLEACMPWETTLLNTGGRYTYGVYNEEGSFAINPDDMIHVRALGNDQKMGLSPVLQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKGELNDGSWKRLKEMWQKATAMLRSQENRTMLLPAELDYKALTVSPVDAQLIDMMKLNRSMIAGIFNVPAHMINDLEKATFSNISEQAIQFVRYTMMPWVTNWEQELNRRLFTRAEREAGYYVRFNLAGLLRGTAKERAEFYHFAITDGWMSRNEARAFEDMNPKDGLDEMLVSVNASRPAKSTTQENTQDE >NZ_CP043433|1856941:1904437|1856941_1858216_-|WP_001680077.1|integrase|DBSCAN-SWA MSLTDTKVKNTRPSEKAVKLTDGFGLYLLVHPNGSKYWQLGYRFDGKQKVFSIGVYPAVSLADARQRRDEAKRLLTQGIDPNAKKQADEKVLQEKRDKTRSFRVVAKSWFATKTKWSEDYADTVWKRLETYVFPDIGDRNVSELDTGDLLVPVKKAETLGYLEIAMRIKQYITAILRHAVQQKLMRHNPAYDMEGAVQKPETEHRPALELEEIPLLLERIDAYKGRGLTTLAIKLNLLIFIRSSELRFARWSEIDFKSKLWVIPEQREAIENVKHSTRGAKMKRQHFVPLCRQALKILKEIRQLTYEEGNEAELIFTGCYDSFKPMSENTINKALRKMGYDTTQDICGHGFRTLACSALIESGLWSEDAVELQMSHKESNSVRAAYTHKAKHLDQRRLMLQWWADFLDENRYEMVRPFEFAQKQ >NZ_CP043433|1856941:1904437|1879850_1880201_+|WP_001135228.1|DBSCAN-SWA MPPRTPKSCRVRGCRSTTTDPSGYCESHRSEGWKQYKPGQSRHQRGYGSKWDVIRERILKRDKGLCQLCLRAGVVREAKTVDHIIPKAHGGTDADSNLQSLCWPCHKAKTARERLK >NZ_CP043433|1856941:1904437|1867887_1868583_-|WP_001020644.1|DBSCAN-SWA MNIGNRVRQLRRAKNMKIAELAEAIGVDAANISRLETGKQKQFTEQTLSRLADCLSVDIAELFTSDPKGNTVCKHSDMRKDSANVKDLFRIEILDVSASAGNGLIQGGDVIDVIHAIEYNKDKALAMFGGRPAAELKVINVRGDSMAPTIEPGDLIFVDISINQFDGDGIYVFGFDDKIYVKRLQMIPDKLLVISDNTNYREWSITKDNECRFGVFGKVLISQTQSLKRHN >NZ_CP043433|1856941:1904437|1871830_1872304_+|WP_000054227.1|DBSCAN-SWA MSLLAKVQAFIELNPGLTSNEIADAFPEYARFDVQRSASKLYRCKRVNRRLDGDVFRYYAGKDEAVILTLRQKRSGHTGSGDPMVIAKLVSRAEELESRGLFNRASIVWLEAFSESQFIYEREEFLRRRQKCLNRIKKRIRPVEQVYLAGRFVGNVE >NZ_CP043433|1856941:1904437|1887563_1887728_+|WP_000497739.1|DBSCAN-SWA MFVKPAKGRSVPDPARGDLLPEGGRNVDENNYWLRREAAGDVRRTNKKVKTNGD >NZ_CP043433|1856941:1904437|1903414_1904437_+|WP_001028172.1|transposase|DBSCAN-SWA MNIVFLGIDLAKNVFQLCGLNQAGKPVYTKRTGRKELLQTLANIPACLIGIEASTGAFYWQREFEKLGHKVKVISPQYVRPFVRGQKNDGNDAQAIAVALMQPTMQFVPPKSPEQQDIQALHRARQRIVNHRTATVCQIRGLLLDRGIPIGSAVSRARRAIPLILEDAENGLSSRMRRTIAELYDLFNDLGRRIHFFDKEIETVFRQSEACPRIAKVKGIGPKTATAVVAAIGKGTEFKNGRHFAAWLGLVPRQHSSGDRQVLMNMTKKGDKHLRTLFIHGARAVVRVATNNNDGHMNQWVNQLKERRGFNKTTVAVANKNARIIWSMLRNDTGYQVVCN >NZ_CP043433|1856941:1904437|1898496_1899066_+|WP_000760554.1|tail|DBSCAN-SWA MKKYQNIKNFRLIDAPVNRDKTQAEINIGAYFLESDDGQDWYECQSLFSDDTAKIMYDHEGVIWGVVNKPVPQRGNTYSVSMLWPVNMSVAEIDAADCPDDCRGDGTWLYQDGKVVQRGYSPEELRKKAEAEKVRRLAEAESAIAPLARAVKLKIATDEEIKRLEAWELYSVMVNRVDTSAPDWPDIPR >NZ_CP043433|1856941:1904437|1894872_1895286_+|WP_000605050.1|DBSCAN-SWA MILYVNGIRKDATASLDFLTRAVVISLFTWRRAERDDRTPQPYGWWGDTWPAVQNDRIGSRLYLLKRRKLTNKTPQDAREYMQQALAWMTDDGVAARIDVTSERTGTDTLAAGVTIYQRDGVIHNITFDDIWSKLNG >NZ_CP043433|1856941:1904437|1886490_1887003_+|WP_001135695.1|DBSCAN-SWA MPQKAYLHVDFEQPETLVFNRARMRRAFVSIGQVHMRDARRLVMKRGRSGPGDNPSYRTGKLARSIGYYVPRASSRRPGLMVKIAPNQKNGEGNRPISGAFYPAFLFYGVRRGAKRKKGHHRGASGGSGWRVAPRNNYMTEVLDKRRSWTRYVLSRELRKSLRPQRRKKK >NZ_CP043433|1856941:1904437|1858879_1859170_-|WP_001675175.1|DBSCAN-SWA MPDQRKRPSSSLTGLASIDSKYWQTGYRFNGKQKVFSIGVYLAVSLTDARQRRDEVKRLLAQGIDPNAKNRLMKKSFRKSAIKPARPVSSPKADAP >NZ_CP043433|1856941:1904437|1869484_1870642_+|WP_001087406.1|DBSCAN-SWA MNSLTVNNRLSQQPGMYEYRPLRHECRLSNSLVVRNHREHSLTVGDESCRNLTAGFGMEGDFMSMSFAGNQKLSALSICARAIRMSVLALCGNSGVILLSVKRQEHIDSAIPGRYTVQAPHKAGAGRGNPEFNIEHNRAHAVFSCHEHCYAQIMVGRAGPVSAGPGSMLTGISTPVRLTTYKVVESLGGEFIEFNIEAATMATVPTLAQPEIRIINGQAVTSSLAVADYFIKRHADVIRKIESLECSTLFRKRNFAFTSISINQPNGGTRKLPCYQITRDGFAFLAMGFTGKRAAQFKEAYIDAFNQMEKQLSTPSVLSDAAHNASVLYSYISSIHQVWLQQLYPMLEKAESPLAVSLHDRINDAAALASLINMTLNRSEVRGRK >NZ_CP043433|1856941:1904437|1867320_1867569_-|WP_071590080.1|DBSCAN-SWA MRKHSYQLRSRDKASKTERHEGGMMLVNAKKSAITQRRFKGNEVVQVFLISACLLVGCERRNFANPEYAFASLKPGSQPSQE >NZ_CP043433|1856941:1904437|1863879_1864239_-|WP_000065085.1|DBSCAN-SWA MSNIDKLNDHELVDLKNAIERELKRRADGPKVTTYYVVSCITDAQHFTDLDCALRCLKSVTENLMEWVTESPENRDYVNQCTGIVGAKLQVKEMNLDHFNMRVAEKYFDDICYPQETAQ |
65 | Salmonella_phage(78.18%) | plate,tail,terminase,head,protease,capsid,transposase,portal,integrase,holin | attL 1859842:1859856|attR 1908032:1908046 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2436624 : 2452568
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP043433|2436624:2452568|DBSCAN-SWA TATGAACAGAACGATTCTTGTACCCATCGATATTTCAGATTCAGAATTAACTCAACGCGTAATTTCGCATGTTGAAGCTGAAGCAAAGATTGACGACGCTAAAGTGCACTTTTTGACTGTAATCCCGTCTTTGCCCTATTACGCTTCACTGGGACTGGCTTATTCAGCAGAGCTTCCCGCAATGGACGATTTGAAAGCCGAAGCCAAATCTCAACTGGAAGCGATTATCAAGAAATTCAACCTTCCTGCGGACCGCGTGCAGGCTCACGTTGCAGAAGGCTCTCCTAAAGATAAGATTCTGGAAATGGCAAAAAAATTACCGGCCGATATGGTGATTATCGCCTCGCATCGCCCGGATATTACTACCTATCTGTTGGGTTCCAACGCCGCAGCCGTTGTGCGTCATGCGGAATGCTCCGTACTGGTGGTACGCTAAATACCCGAGCCCGCATAAGAGAGTGCGGGCTCAAATCAATCATCATCGCCTAGTGACTGACACTTTTTGAAAACAGATTGATTACGATTACGCCGCAGATGATAAGCAACATGCCAATAATAGCCGGCATATCCAGTTTTTGGCCGAGAAATAACCATCCTATCAATCCAATAAGAACAATCCCTACCCCAGACCAAATAGCATAAATGATACCCGCAGGGATGGTTCGCATTGGGATGGTAAGACACCAGAACGCAATACAATATCCGATGATAGTAACGAGGCTCGGTACCAGACGCGTAAAACTATCTGATAATTTTAATGAGATCGTGGCGATAACTTCTACCACGATGGCGATAAATAGAAAGATTACAGCTTCTTTAGTCATACATCTCTTCGTTCCAATATTTTTCGGGGCGTGATGATATCTTAAGTAATACGGATGACAGAATAAAGACGCTTTTGAACCATTGCGTATCTTGCCATAAAAACGTGCTGGCATTCGTCGCCGTAATCCGTACCATACGCGACATGCTTTTGCCTGCTGGCTGTTGACGGCAGGAGCGAACCCGGCATTTATCGCCAGCCAAATGGGGCATGAAACTGCGCAGATGGTGTATGAAATTTACGGTATGTGGATTGATGACATGAACGACGAACAAGTAGCGATGTTGAATGCACGGTTATCGTAGTTGCAAAGTTTGCCCCCAATTTGCCCCATTTAGTACCAGAGAACTGAAATAATGCAAGAAATTCAAAAGAATACAAAGAAAGAGCAATACAACCTCAACAAGTAAGGGCAAAAATCACAACCATCTGATGCATAATAGTTTTATTTATTTTTCAATGCGTTAAGCATTATCAGCAACAACAATAAGCTACGATAATCCACTCTTTTGTTGCCCCATTTTTGCCCCTTTTACAGCATTTTGCCCCATTTTTGCCACCGAAAAAATTCCAAAACTTCTCAACCTCAGCACGTTCTTGAATATGGCATATGACAATTTCCTTTACGATTTATATGCTGTTATAAAAATCCCCAGTTGTCAGCAACAACTGGGGATTTACTTTTCAGGCCCAAAAGACGTTCACTACGACTCTGCCCGGCAGCTTCGAATATCTGGCGCGCCTTATCCAGGCTGGCGCACCCCACCAGTAAAAAAGGCACCAGTATCGCTACCAGTGCCCATTTCGCCGCCATTCGCGGCATTCTGTGTGTCCAGTGTTTTCGGTTCATATCAATACGCGCTCTTTCATCCAGCCATAGACAAACGACTCGTCAGCCTCCCGCTTTTCCGCCAGCTCCAGATAGCGCTCCCCCTGCGTGCAGTTCAGCGCCACCAGCAGTACACGCTCGCCATCCTTACCGCGCTTTTCCAGATAAACACGTAACGCGTTAAGGGTTCGCGGCCCGATGCGCCCGTCCGTATCCATATCCGGATACAGCCTCCCGCCCTGGTTGAACACGTTCAGCCAGCGCTGCAACATTTTCGCTGCCACCGACGGCCCCATGTTCACGCCCGTGTCACACAGTTCGGCAGCAACATCCGGCGAGGCCTTCGCCACCCGGTCAAAGCGGGGACCGTACCAGTAGTCGGTCTCCAGAATTTCCAGCGCCTGTCCACGGGTTAAATTGCGCATATCACCACGGTATCCGTGGGCGCGGGCAACTTTTTCCGTAATACCCCATTTTGTCGGCCCGCCTTTATCGTCCGGATGGTTGACGTAGCCGCCTTCCTTACCCAGAATTTCATCAAAAATTTCGTCCTTCGGTTTCATCGCAGCCTCAGCAGTGAAAGTATTTTCGACACATTACCCCGCGCCCACAGCACCAGCCCGCAGAACAGCGCGTTCATCAGCACCACAGGCCAGGATGAAGACGGGTAATGTCCGGCCAGAAAACGGAACGGAACCAGCGCATAGCCCAGCATCAGCAGGTACGACAGCCAGGCTATTCCCGGTTTGTATCTGGCGCCGTTCCGGCGGTAGAGAAAAAGGGCCAGCACTGTCACCAGGCACAGGCCCGCATTCAGTGTGCCGGTCAGATTACTTTCCATGACCACTCCCTCCTCCCCGCATCCGTGAAAAAACCCCGGTCAGCGATGAAATGTCCTGGTCATGAATGAAGGTCAGAAATTTCACTGACAGCACCGACATCACCACTGCACACAGCGCATCCAGCGGTTTACCGTTGTAGTCAGCAAGGCGGGACAGGAACGAGGCCAGTACAGGCACAGCACATCAACCGCTTCTGCTGTTCCTGCCACGTCAGCCAGCTCACGCAGTGCCTGAGCATTCGCGCTACGCGGTGACGCCTCATAGCGGGCGCGGATATTGGCACTCATCTGGGCAATCTCGCCGGAGAACGGCGCAAAGGCAGGAAGTGCCGCGATGGTTTTCTCCAGCACTGCAATTTCTTTCGCGTCACAGGTGCCATCGGCGTATGCAATGGAATATGCGCCCCAGACGGTCGCCTCCACCGCATCCCGGTTCTCCATCTTCTTCACTTCAACAATAGCCTTGCGGGTTTTCTTTCTGAAAATACCTAACATCGTGACTTTTCCTTTTAGTGGGTGAGCCTGCGCCCGGGGGTGACCAGCCCACAGAGAAAGTCACACTGACCATCCCGTAAGCTCACCCCTGAAAGGCTCCGTGGTTAGTTATGAATGTGCGCCGGGCGTGGCGCGGGGAATGAAATAAGCCTTACCGGAAAATAAGGTTGTTTCCGGGGTTCCGGTCTTATTTTGTTATGAGTAAAATAATGGCGACCTCCCGAAAGGGATAGCCTGAAATTTTTCAACTATCTATCACGGACATTGTCCCGCGGCTTAAATCCGACAGCCGCGCTTCTTTTTTAATTCATTATTTCCCGCCCCGGATACTTCCCGCACCCTGGTCTGAAAACGTTAACTTTATTATATGGCGGGCAGAACTTTTTCCCGGAATAAAAAAAACCGCCTCACGGGAGGCGGTGCTCAATATCTGAAGCATGTTTTATAGTTATCGACTGGAAAAACAGTAGAGGTGTCGGGTGCCTCCCGAAAACACATATTAGCCCGGTATGTGTCTGTGACCATCGGCAGAACATTTTTCCCGGTCACCCCCGCACTGGGGAACACCTCAATAAATAATGCAGTGCCGGAATTATCTCCGGAGGGCCGTTGGTCAGTCACCATGACCCGCAGACGAAAATAAACGCACCTCATATCCTGTTTATGCCCCACTTTATGTGGGTTTCACTGCATTCAGATGGCGTACCACAAATCCCGTATGTGTTACAAGAGCATCAGACAGTGGCGCAAGCAATGACCGTTTTTTTGCAGTACGCCATCTGAATGCAGTAAAAAAATTGCGGTGCCTGATTCACTATCAGGAGGTTAAATTAAAGGAACATTAATGTAACCACCCCATATCCATATGCCTGCACCACAAAGAGTAAGAACGATACCTATATCAATGGAATCTTAATTTCACCTTAATGCCAAATCGAGTGATGATTCTTTAATCAGACGACTCAGGAGGTATAAAGAATCAACTCTCTTATTCGACGGAGTGGAGAAATCTGCCAGTCACCTCCGCCAAACTGTCGACAATAATAAATCATAAAAATTTCATTTCAACAAGCAGTCGCGTCAAGAAATGTAAATTTATCATTAGATGTTTATTTATTCCAAATGTTTTGCATTAATTCAAAAAAGACAAAATACAATCAGGGCATATTCTGATGACTCCATCTTATTTCTGCAATTTGCGGGAATAAAAAAACCACCTCATGAGAGAGAGGTGGTAATCATAACATGGAGTTATTATTATAGTTTTATTGATGGCGATAAAAAACCTGAGGCGCCGGGGACTCCCGAAAAATATTGCTGTGTCAGGTGTTGTATTGACAAGAATTTTCCGTATGTTCTGCCTCTGTGATATGAAGATTTTTAGTAATAAATGCAGTGCCGGGATGCCCCGGAGAGTCTTTAGTCAGCCACTATGACCCGCTACAAATTGATCAAACGCAGACAATGCTGTTTAGCTCTGCTCAGAGCAGATTTCACTGCATTCAGATAGCGCACTGAGAACCCCGCATTACCAACACACTGACATTCTCATTTATCCGATTCACAGTACGCTATCTGAATGCAATAAAAAATTGTGGCACTGAGTCCATCGGGGCTTCTCCGCCCCCACAAACCGGATGCTGTATCAGGATATACATCATGCACTCATCATAAACGCACTATCCACACTTTACGTGCCAGCACCACCAACGGAGTAATTACGATAGTTGTATCGAAAAAAACGCCTCTTCTTCTCTGGTTGAGAGCCGGTTCCTTCACGTGAATTTTCCAGGATGCACAAAAAAACCGGCTCTCTGATCCTCGGGGGCAGAAGAAGGTTATCCATCATCCCCTTCGGACCGATGAAAATAATACGAGTCAGAACAGGCGTTTCAACAATTCATTGTGTCAATAAACGTAAAGTTATTATTACATTTTTATTTTATCAAAACATTTCACATTAATTTAATGTACATAAAACATATTTAAAACGCACCATGTTTAACAACCCGGATTACACTGGCGAATATTTTTCTGTATTGCAGAAACGACAAAACCCGCACGATGGCGGGTTTCAAAATGCGTTCATGTCTGTCATTAGCCTCGCGATACAGCTTTGCGAAGCGTACTGGAATTGAAGCAGTTTGTGGCTCATTTTGCAAATGATTTTTTAAGCATAATCGAACGCTTTTCTCATAGGTGAATACAAAATGAACTCAGCGATACTCAACCACTGCTCAACACGGCGTCGGCATGTAATCAACGCCCACTCTGGGTACTGTTCGTTTAGTAACTCGGCCATCCTTCTCTTACTCATCCCCCGCCCCACGTAACGCTGGTTCAGGATACCAAGCAGCCCCGGATGATCCGTCAGTACCTCGCCTACAACCCGATCGATAATCAGCGCTTCCGTATCCGTACAGTGTGCCAGCCAGCTTTTTTGCTTTCCGTTGATCATCTCTCTCAGGAACACTTCCAGTTCAGGTTTCTCCAGCCCCGCTTTTTTCATACGGCGCAAAGCATCGTTAATGGCCGTCTTCGTCAGTTTTTTGGACGCCAGCAACTGGTTAAACATATTCCCTGTTTTGCCGCCACCAATGTATGACCAGCGCCCCCACATACGCAATTTCCCCTGAATCCACACACTTTCCAGCGTGCTGAGACGAAGGTGTTCTCCGCTTTTTCCGGTGTTTGTTGGGTAAATCATAAAATGCCTTTCTCTCTCCAGATTTCCTGCGTGCGGAAAACGCCCTCTGCGTGCATCAGGCGAATTTCTGTTTTCGTGAAGTCGTCGATTTTTACCCGCCCATCGACAATATCGTGGCACGAGCTACAGGCAATTGCTGCCTGCATATCGTTTGGTTTTGTCGCCGTTCCGCACGTACCCGCCAGCCGGTAATGTGCCAGTACGGACGTTTCAGGATTATGGTTGCAATGGCCGGGAATTCTTACCGTACACATCAGGCCCCGCGCCGCTTTACGTAAATCCGCCATTACGCAAACTCCAGCAGTTGCGCTGCGACATTTTCGACCTCTTCCGGAGAGGAGAATTTACGAAACAGAATCCAGTTCCACAGGACGTTCAGTACGGCCTTATAAACCTGCTGAAACTCGGTTTCGTCCATATTCGCAAAAGCGATGGATTTCGCCCGGCGCCCACGGCTGCCATCCGGATAATAATGCTCAGTATAAAACCCGGCCTGAATGGTTACCCACTCGCGGAAAGCGTCGAAGGACTTGAGCAGCGCAACATCGAGCGTTCGGTTGACAGCAACCTTATGCAGGTACTGTTCCGCCGCGTCACTGAGTGCGGGACTGTGGCCCTGCGCTGCTGACTCACACAGGAAATCGACAAAGCCGGTAATCAGCTCCTGTTCCTGAGGCAGGATCGCCCCGCCGATCGGAGTCCAGTAATCGAAACCAAGCTGAAGGAGCTTGAAAAAACGTTTGTGGAACGCGTAGTTACGTACTCGCTTAAAATCGGCGTGTATCCACGCACCGATTTTTACTGAGCGCAGGAAGTCCTCACTCTCCGGCGTTGCCGGGAGCAGAAGCCCTGATGAGGTTTGTTTAACAAGTTGTAGATGCGCCATCGTTCTCTCCGGTGGCGCTGTAGGTTGCTGATTGTTCAGGTCAGCCGTAACATATTAAAACATTAATAACTGACAGTGAAACCCAGTCTTATCAGATAATCAATAAACGCTTCAACAGACAGAATCAGATGGTCGTCAGGAATTAGCGTACAGAATGAGATTTCACCATTTTTTACACGTACTGCATAAAGCCCGTCTTCATCTAACTCTAATAAATCCTTGAGTTTTTTCACGTTACCTCCAGACAACTAAGGAAAAATGAAAAGGTGCGATTTCAACGCGATTTCTGTTGAGGCGGGAAATATAAACACTGCGACTATTTATTTCATTATATAAATTTGCTTATTTTATGTTCACCAGCAAGGACATTTTTCACTTGTTGCGCAACCAATCTGAAAGTTGATCATTTTTATGAATTTTTATTTTACGGGTAACAAAAAACCCGCCGAAGCGGGTTAAGTGTGGGTGCGTTGAGGATGCCTGACACGTCAGAGGTGGCGGGGATTTCTCCCCGCCAGGTCTCTTACTCCTCAGGTTCGTAAGCTGTGAAGACAGCGACCTCCGTCTGGCCGGTTCGGATTCGTACCTCGCAGAGGTCTTTCCTCGTTACCAGTGCCGTCACTATGACGGTTAAACAGATGACGATCAGGGCGATTAACATCGCCTTTTGCTGCTTCATAGCCTGCTTCTCCTTGCCTTTCGGCACGTAAGAGGCTAACCTACATGTGCAAAGCATGAAATTGGCCTCAGATTAATGTTAAGCGTCTTGCCGGACGCGTAATGTTAACTGGGGCTTTTCTCTATCTGCCTTTTGGTGTTCATGCCTGAGGCAGATAGCCTCAAGCACCCGCAACAATTCTACTTAACTCTCCTTTTCCCGCAAACCGTTTTTATCCCCAACGCAAATTTTACCAATACCCCTTAATACATCTCCCTTGCCCTGACGATACATCCCTCTTTACACAGACCAAAATTTATGTATTATCGCTTGAAAACAATCATTTAAGAGCTATCGGTGGGTGAATTCGCCCTGCGGTAGCTTTTCCTTTATGCATTGCATACATTTATGTTCTAGTATATTCCTGTATACTCAAAAGGATTTTTCATGCACAGCGTTAATTTCTATTCATTCCGCGTATTGACCCATAAAGGCAGTCGAGCCAGCAAAAAACTTAATGACTTAGGTTTAAGTAATAAAAAAACGGCATATGAACTTTTTGTTGATTATTTTACTCTTTATAAAAACACCCCCATCGAGTTCGGCGTTTCAAAAACTAAAATATCTCTGGAACAACATACTAAACTTCACTTTGATAACACAAAAAAAATCATATATGGTTATATAAAAGTTGGAAAATATGGAGAAAGCAGTGAAATAAAAGATGTAAAACTCAAAAAAGTCCATTACAGAACAACTGCTTATGATGTAACACTCAAAGAGCGTTATATTTTAATATATCTACCAGATAATCTTGAAGAAGGAATTATTGCATTCCATTCATGCGATAATATTTCTGCTCGAGGTGTTCTTTCTGATTCTATCACTGAATATCTAAAAAAACAATTTCAACTAGAAGCAAGAATCAATCCATTACATCATAAGAAAATCCCTCAATACATTCTCAATTCTGAATTGAAACAAATTAAAGCTCAAGGATATAAAGCACCAGAAGATATTGCTGATTCCTTTGGTAAAAACAAAACAAACATCAAGACAGACTTAATAATAAAAGCAAACGATGGCATATTCGGAAGTTTCAGGGATTTAAGAAACAAGAATATAGGAAACATCATTGAGATTATTGAAGATAAATGTGATGCAATAAAAGTAAGCTTACAGCTCGGCAGTCGGACTGTCGTTTTCAATTATGATACCATACTAAAAAAAGGAATTTCAGCAGAGTTAGATGATAATGATCTAAAAATCAACCCATTAACAGGTATACCTGATCTAACAGCACTTCATGACACGATAAAAAACCTTTCCAATGATATATTGGAAGAACTGCACTGTGGAAATAAAGGGGTGATTATATGAATAAAATAAATGTGCTGGGTGTAATAATAAAACACTACAAAACAATGTCAGATCAGCGTGGAACAATGTTGATGAGCGACATTACCGTACATTTTATAGTTCCTCTATCTCTTTCTTTCGTTCTGTGCTGGACATACGGAATAATGAAACCGGCAATTGCTTCCGTCTTCGTTAACTTCGGGGCTATTACAACAGCACTATTAATGAGTGCAGTAATAATGATTTATGAACAAAAACAAAAAACCATTACTAAGATATCAGATATAATTGAAGGAAACAAATCAAGAGACAAATTGATATCATTAAACACTAACAAAACCATATATGAGCAGTTATGCCACAACGTCGCTTATGCAATATTAACTTCAATAGTATTGGTTATATTTTCAGTAATAATATATTTCCTGCCTGACAATGCAGTGGATTTAATGAAATGGTATTTTCGCGCACCCGCATATATCGTTAGCTTTTTAGCCTATACATCCTTTTTTATCACTGTCATAACTTTCTTAATGGTAATAAAAAGATTTAGCACGATTTTAGACAATTAAGCAATGGAGCTACCGCCCTTTCGGGCGGTCTCCTGGTGTTCTGAGCGTGCAGGAATCCATCCGGTTAAGGATTAAAGTTTATTTACATCACTAAATTTAATTATTCATATTTGGATTATGCTTTCTCTTTCACTTCACGCAGTTCCGATTGTTAATTTGGCTCACAACAGCACCTCCTGAAAGTTTCCCCGATAAAACGCCAGTACCCGCTGCATGTACTCGCTTTTACGACACTCACGACAAATTACGTTGTGGTGCCTGTCGTAACGACGTACCTCACCGTCAGGCAGCTTCCAAATCAGGTCAGCGTCCACTTTCGCCTGTTTCTTCCAGGCACGATAAGCCTCTTCTGACGGGAAAATCCCCCCTCTTCTGCCCGCCTGGTACACATCCCCACAACGTTCTGCCTTGTCCAGGTAGTGGCGGGTCGAATAAATGGTTAACCCCGTCATCCTCCTCAGCTCCGTAAGTGTTATCAGGCCGGAGTTCATGAGGTGTAACCTCGAAATTCGTAGCCTCTGCGATACGCAATACCTTTTCGGGGCTTAACTGACTACGGCCAGTAGTAACAAGGCTAATCATCGATTGCGAACAACCAGCCAGCGTGGCCAAACAAGACTGTCGTACACGATTTTTTTTCAAATATTCATCTAACGTCATAAAGGTCACCTTAGTAATTCTCGCTAAATATTAACCATACTAATTTAAATGATCAATACTTATATCAGTTTGAGTTTATGAGTCACATTCATAAGATGAGCGCATGAGAAAAAAACGTGAAGAAATAGCTCCACCAGAAGCTACCCAGCGCTTACGCGCCATCTGGGACGCCAAAAAGCGAGACCTCAAACTTACTCAGGAGATCGCCGCTGATCTTATGGGCTTTGAGACACAATCTACCGTCAGTCACTATTTGAACGGTAAGGCACCTCTCAACACTGATGCGGCCTTAAAATTTTCTGTTCTATTGAGAGTTAAACCCGAAGAGTTACGACCGGATTTGGCCGATCTAATGAACTACGTTCGTTCTTCTGGTACTTACGATGACAACTTCGAAGGTGGTGGCTGGCGAATGGTGAGCAGACAACAGGCTGATTTACTAAACCTTTTTGATATACTCCCTGAATCTGAGAAAGAAAAACTCATTGACCGGCTTAAAGGTCAGAATGAGCTATATAAAGAGGCTTTCCAGAACATGCTCGCTGCACAAAAGCGCCTAAAAAATCAGTGACGAACCAACCACATCAAACCGCCTCTCCCGGCGGTTTTCTTTTGGCTAAAGCTCCTCCCTCCCCCTCTTGCATCAGATAAAACACATTTATTTTCATTAGGATAGATCATATTTATGATATATATGTATATTTATCTTGATCAACTATATGAATATCGCTAATATAATCTCAGAAACAGCACGGCGCTGTAGGTTTTAGTTCCGCCACCCGGCGTTAAGGGGAGAGGGAAAGATGGGAAGGAATGAAGTGATTCAGTATTTGATGGATAGTTGCAACGTCAGCTTTAGCGCAGCTCTCCAAGCATTGCGCGACAATGGATGGGATATGTTTTTGGCTCAATGCGAGCTACAGGAACAGTATTATCCGGGGTGATAATGGACAAGCTGCAAAAAATCCACCTCGGCAATAACGAATCCCTGGTGTGTGGCGTGTTCCCCAACCAGGATGGAACGTTCACCGCCATGACGTACACCAGAAGCAGGACGTTTAAAACTGAAACAGGCGCACGTCGCTGGTTAGAAAGAAACTCAGGTGAGTGATATGGATTTCGACACAATCATGGAAAAGGCTTACGAAGAATACTTTGAAAGCCTTGACGAAGGAGAAGAAGCACTCAGTTTCAGTGAGTTTCTGCTGGCGCTTTCAGCTAACGGCTAATATCGAACCGTTTTGTGCAGGGATTCCAGGTGGAAAGCATTAACGACTCATACTCGGGTTCGATTTTGAACATCACAGAAACACCGTCGATACACGAACCAGCCGGACGAACGATACCTGCCTTAACCAGAGCATCTGCATCAGGGTTATCCCAGGACGCCCGGAAAGTGGGAGAATGAGTTTTAAGAAAGGGTTCAAGTAAAAACAGCTGTTCAGTACTCAATGAATTAAGCGTTTTTAACATGCGTTTTGCCCTGCCCCTACGCTGGCACAACGGCAGCGCCGACACGAAAAAATAACCAGTGACTTTGAGGATCAGCACTATAAGGTAAGCCACAGCAAAACTAAAAATCTGAACGGCATATGGTAAATCGCTACGCGCCTCAATAAGCTCCGTAATATCTCTAGGAATGAAGATGATAATTAAGAAAAATAAAGCCACTGTGAGCATAAATTGTCCGACGGATTTACCAATTAGGAATTTTACGATAACCAGAGCGGTTTCTGACATATCAATAACTCTCAACTGTAAGGGTATTGAAATGTTAACACAGGTTCTCGCTGTAGGGGTATAGCCGAGACCACCGAAGCCCGGAGGTGGTTAAATAAAGCCGGGCACAACACGAAGGCGCATTTCTGATGTTTTCTGAGTCGGTCTTGTCTGTAAATCCAAATAGTGGAAGTGCGCCTCCGGTTGTAGTTGCCACTGCGACAATAATGCTGTGTGTAGTACTTGGCGGCATCAGTTTTTCTTAGTCCTTTCTGATGTCCGCCCTTTTTAAAGTGAATTTTGTGATGCGGTGAATGCGGCTAAGCGCACGCGGCACAGTTAAAACTCCCTGAATCAGTATGGGTGGTTTAAGTCGGCATTAATTGTTAACTGGTTAATGTCACCTGGAGGCACCAGGCACCGCACCACAAAATTTATTTACCAGAAATGGAGGGGCTATGATTGCTCATCACTTCGGAACGGATGAAATACCGCGTCAGTGTATTACGCCGGGAGATTATGTTATCCATGATGGTCGTACTTATATCGCCTCAGCGAATAACATTAAAAAACGCCGTTTATATATCCGTGATTTAACAACGCAAAGATGTATTACCGATTGCATGGTAAAAGTCTGGCTGAACAGAAATGGTCTGCCTGCCAAAGCTGAATCATGGTAACAGAACAGTAATCGTTTAAACCATCCTGTTTTTAAATATGCCTGCAATGGCAGGAATTCACTCAACCTGAAAAAAGGAATCTATATGAAAAATGTACCTGAATCAGTAATTGCAGAGCTCCGCCAGCTTTCAGGAAAAATTCGTACGTTATGTATCGAAAACAATATGCCCTGTGTTGTTTCTTATGCCCGGGACTGTGACGATGAAAGTGTTTCCAGAACTCTTGTTGCATATACAGACAGTGAAACAGGCGCATATGACAGGTCAATAACAGCCGCAATAATGCTGTTAAAAATGAACGAAGCTCCTCCCGAAAATTTTATTTCATTGTTGAAACTGATGGAGTGTAAAGAGCTCGTCACGAACGCATTCTGCTCAATGAAAAATGAAAGTCTTCATTAAGTATGATTTGTAATAAGGGTAATGATAATGAGCGACAATAAAACAGAATACTCATATTATATTAAGGTTAAAAATGAAAGCGCCCGGAAACGTCTCGGCTTCCCTTTTGCTTTCTGGTGGAAAACTGAAAGCAGCGAAGCCGCTGCCACAGCACGCCTTGCCGTATCAATGCTTGACGCCGGATTCGAACCGACAGATTTTGCAAAACCGGTTCGCGTTAATTCCCCCGCTGTTAACGAACTTCCGCCGGAGGGAAGTTTTGATACCACCTTCTGTCAGAAATATGAGCTGGGCGGCGAAGATGGCAAAACATTTATGCTCATCCCCGGCACGCCCGCTACTGACGCCCACGACGAAAAAACGGAGGAATGCGCCGACGACGCTGGCACCGAAGAAAGCGGGACAGACACCAGTGACAACGACGAATGTCAGGACTGCGAAGTTTCCGTCGCCACCCTGCCGTTCCCCCAGCGCGTGTTGCACATTTTTACTTACGCTGCCACAGACAAAAAATATTTGCATCACGCCACCCGCGCTCAACGCAGGCATATTACCGTTCTCGAAATGGAACAGGAAAACAGCTATATCCAGAACCTGTTAATGGTATTGCGGAAGTCTGAACAGGTTCATGCCCAGGATGAGTAAAGGAAAACAGAAACCTTATTACTCGGTAAGCAGTTTAGGGGCAAGGTGGAATGCGGCAGTAAAACGTGCTGGTATTCGCCGCCGTAATCCGTACCATACGCGACATACTTTTGCCTGCTGGCTGTTGACGGCAGGAGCGAACCCGGCATTTATCGCCAGCCAAATGGGGCATGAAACTGCGCAGATGGTGTATGAAATTTACGGTATGTGGATTGATGACATGAACGACGAACAAGTAGCGATGTTGAATGCACGGTTATCGTAGTTGCAAAGTTTGCCCCCAATTTGCCCCATTTAGTACCAGAGAACTGAAATAATGCAAGAAATTCAAAAGAATACAAAGAAAGAACAATACAACCTCAACAAGTTGCAAAAGCGCCTGCGCCGTAACGTTGGCGAAGCGATTGCCGATTTTAATATGATTGAAGAAGGCGATCGCATTATGGTTTGCCTTTCTGGCGGCAAAGATAGCTATACGATGCTGGAAATTTTACGTAATTTGCAGCAAAGCGCCCCGATCAATTTTTCACTGGTCGCCGTCAACCTCGATCAAAAGCAGCCAGGTTTTCCGGAACATATCCTGCCAGCCTACCTTGAGCAGCTGGGCGTAGAATATAAAATCGTCGAAGAAAACACCTACGGCATTGTGAAAGAAAAGATTCCGGAAGGAAAAACCACCTGCTCGCTGTGCTCGCGTTTGCGTCGGGGTATCCTGTATCGTACGGCGACTGAACTGGGCGCGACCAAAATCGCCCTGGGCCACCATCGCGACGATATTCTGCAAACCCTGTTTCTGAATATGTTCTATGGCGGAAAAATGAAAGGGATGCCGCCGAAACTGATGAGCGATGACGGCAAACATATCGTGATCCGCCCGCTGGCTTACTGCCGCGAGAAAGATATTGTCCGTTTTGCTGAGGCCAAAGCCTTCCCTATCATTCCTTGTAATCTGTGCGGTTCGCAACCAAACCTGCAACGCCAGGTGATTGCCGACATGCTACGCGACTGGGATAAGCGCTATCCTGGACGGATCGAGACGATGTTTAGCGCCATGCAGAATGTCGTGCCGTCTCACCTTTGTGACACTAACCTGTTCGATTTCAAAGGAATCACTCACGGTTCCGAGGTCGTCGACGGCGGCGATTTAGCGTTCGATCGTGAAGAGATTCCCTTGCAGCCCGCTGGCTGGCAGCCGGAAGAAGATGACACCGCCTTAGAGGCGTTGCGGCTTGATGTTATCGAAGTGAAATAATCTGCAGGCGTCTCAGCACTCCGCTGAGACGCCATGCTATCGATCATTTTAATAGCCGTACCCGGCATGACTTGCCTTTGATCTTCCCGTTTTGCAACTGCTTCCAGGCTTTTTGCGCTACTGCTTGACGTACGGCGACGTAAACGTGCATTGGATGCACGTTAATTTTGCCAATATCCGCCCCGTCTAATCCAATATCGCCGGTCAGCGCGCCCAAAATATCTCCCGGACGCATTTTCGCTTTTTTGCCGCCGTCAATGCATAGGGTAGCCATCTCTGCGGCCAGAGGGAGTGACGGCTGCCGGGCGGGCGCATTCAGCCAGTTCAGCTTGAGTTGCAGCATTTCTGAAAGAATATTCGCCCGCTGCGCCTCTTCCGGCGCGCAGAAACTGATCGCCAGGCCGCTGCTTCCCGCGCGCGCCGTACGGCCAATACGATGGACATGCACCTCCGGGTCCCAGGCCAGTTCATAGTTAACCACCAGTTCGAGCGATTTAATGTCTAATCCTCGCGCGGCAACGTCGGTGGCAACCAGAATGCGCGCGCTACCGTTTGCAAAACGCACCAACGTCTGGTCGCGGTCGCGTTGTTCCAGATCGCCGTGGAGCGCCAACGCGCTTTGTCCTACCGCATTAAGCGCATCACAAACGGCCTGACAATCTTTTTTGGTATTGCAAAATACCACGCAGGACGCTGGCTGATGCTGGCTAAGCAACGTTTGTAGCAGCGAAATTTTTTCATGCGCAGACGTTTCGAAGAACTGTTGTTCGATAGCCGGTAGCGCATCTACCGTATCGATTTCAATACGTATTGGCTGCTGCTGTACACGACCGCTAATCGCCGCGATGGCCTCAGGCCAGGTTGCTGAAAACAATAACGTCTGGCGCGTCGCAGGCGCAAAGCGGATCACCTCATCAATGGCGTCACTGAATCCCATGTCCAGCATTCGGTCTGCTTCATCCATTACCAGAATATGCAGCGCATCCAGCGATACGGTTTCTTTTTGTAAATGATCCAGCAGGCGCCCCGGCGTCGCGACAATGATATGCGGAGCGTGCTGAAGCGAGTCGCGCTGTGCGCCAAAGGGTTGCCCGCCACACAAGGTCAGAATTTTGGTATTTGGCAGAAAACGGGCCAGGCGACGTAACTCTCCGGCAACCTGATCCGCCAGCTCCCGCGTCGGGCACAGCACTAATGCCTGTGTCTGGAACAGAGTGACGTCAATTCGATGCAAGAGCCCAAGACCAAACGCCGCCGTTTTGCCGCTACCGGTCCTGGCCTGCACACGCACATCATTACCCGCCAGAATGACGGGTAATGCTGCGGCCTGAACAGGCGTCATCTCAAGATAGCCCAGCTCAGTAAGGTTATTGAGCTGGGCGGCGGGCAAAACATTCAGGGTTGAAAAAGCGGTCAC
Protein sequences of DBSCAN-SWA_5 >NZ_CP043433|2436624:2452568|2442604_2443204_-|WP_000940751.1|DBSCAN-SWA MAHLQLVKQTSSGLLLPATPESEDFLRSVKIGAWIHADFKRVRNYAFHKRFFKLLQLGFDYWTPIGGAILPQEQELITGFVDFLCESAAQGHSPALSDAAEQYLHKVAVNRTLDVALLKSFDAFREWVTIQAGFYTEHYYPDGSRGRRAKSIAFANMDETEFQQVYKAVLNVLWNWILFRKFSSPEEVENVAAQLLEFA >NZ_CP043433|2436624:2452568|2443727_2443940_-|WP_000882662.1|DBSCAN-SWA MLCTCRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >NZ_CP043433|2436624:2452568|2437439_2437592_-|WP_000089155.1|DBSCAN-SWA MSRMVRITATNASTFLWQDTQWFKSVFILSSVLLKISSRPEKYWNEEMYD >NZ_CP043433|2436624:2452568|2444309_2445242_+|WP_000556390.1|DBSCAN-SWA MHSVNFYSFRVLTHKGSRASKKLNDLGLSNKKTAYELFVDYFTLYKNTPIEFGVSKTKISLEQHTKLHFDNTKKIIYGYIKVGKYGESSEIKDVKLKKVHYRTTAYDVTLKERYILIYLPDNLEEGIIAFHSCDNISARGVLSDSITEYLKKQFQLEARINPLHHKKIPQYILNSELKQIKAQGYKAPEDIADSFGKNKTNIKTDLIIKANDGIFGSFRDLRNKNIGNIIEIIEDKCDAIKVSLQLGSRTVVFNYDTILKKGISAELDDNDLKINPLTGIPDLTALHDTIKNLSNDILEELHCGNKGVII >NZ_CP043433|2436624:2452568|2438834_2439116_-|WP_000445513.1|holin|DBSCAN-SWA MESNLTGTLNAGLCLVTVLALFLYRRNGARYKPGIAWLSYLLMLGYALVPFRFLAGHYPSSSWPVVLMNALFCGLVLWARGNVSKILSLLRLR >NZ_CP043433|2436624:2452568|2437108_2437447_-|WP_000159240.1|DBSCAN-SWA MTKEAVIFLFIAIVVEVIATISLKLSDSFTRLVPSLVTIIGYCIAFWCLTIPMRTIPAGIIYAIWSGVGIVLIGLIGWLFLGQKLDMPAIIGMLLIICGVIVINLFSKSVSH >NZ_CP043433|2436624:2452568|2447671_2448193_-|WP_000004762.1|DBSCAN-SWA MSETALVIVKFLIGKSVGQFMLTVALFFLIIIFIPRDITELIEARSDLPYAVQIFSFAVAYLIVLILKVTGYFFVSALPLCQRRGRAKRMLKTLNSLSTEQLFLLEPFLKTHSPTFRASWDNPDADALVKAGIVRPAGSCIDGVSVMFKIEPEYESLMLSTWNPCTKRFDISR >NZ_CP043433|2436624:2452568|2451194_2452568_-|WP_000123686.1|DBSCAN-SWA MTAFSTLNVLPAAQLNNLTELGYLEMTPVQAAALPVILAGNDVRVQARTGSGKTAAFGLGLLHRIDVTLFQTQALVLCPTRELADQVAGELRRLARFLPNTKILTLCGGQPFGAQRDSLQHAPHIIVATPGRLLDHLQKETVSLDALHILVMDEADRMLDMGFSDAIDEVIRFAPATRQTLLFSATWPEAIAAISGRVQQQPIRIEIDTVDALPAIEQQFFETSAHEKISLLQTLLSQHQPASCVVFCNTKKDCQAVCDALNAVGQSALALHGDLEQRDRDQTLVRFANGSARILVATDVAARGLDIKSLELVVNYELAWDPEVHVHRIGRTARAGSSGLAISFCAPEEAQRANILSEMLQLKLNWLNAPARQPSLPLAAEMATLCIDGGKKAKMRPGDILGALTGDIGLDGADIGKINVHPMHVYVAVRQAVAQKAWKQLQNGKIKGKSCRVRLLK >NZ_CP043433|2436624:2452568|2438292_2438838_-|WP_000802786.1|DBSCAN-SWA MKPKDEIFDEILGKEGGYVNHPDDKGGPTKWGITEKVARAHGYRGDMRNLTRGQALEILETDYWYGPRFDRVAKASPDVAAELCDTGVNMGPSVAAKMLQRWLNVFNQGGRLYPDMDTDGRIGPRTLNALRVYLEKRGKDGERVLLVALNCTQGERYLELAEKREADESFVYGWMKERVLI >NZ_CP043433|2436624:2452568|2441781_2442318_-|WP_000640113.1|DBSCAN-SWA MIYPTNTGKSGEHLRLSTLESVWIQGKLRMWGRWSYIGGGKTGNMFNQLLASKKLTKTAINDALRRMKKAGLEKPELEVFLREMINGKQKSWLAHCTDTEALIIDRVVGEVLTDHPGLLGILNQRYVGRGMSKRRMAELLNEQYPEWALITCRRRVEQWLSIAEFILYSPMRKAFDYA >NZ_CP043433|2436624:2452568|2436624_2437059_+|WP_001082296.1|DBSCAN-SWA MNRTILVPIDISDSELTQRVISHVEAEAKIDDAKVHFLTVIPSLPYYASLGLAYSAELPAMDDLKAEAKSQLEAIIKKFNLPADRVQAHVAEGSPKDKILEMAKKLPADMVIIASHRPDITTYLLGSNAAAVVRHAECSVLVVR >NZ_CP043433|2436624:2452568|2445238_2445793_+|WP_001033796.1|DBSCAN-SWA MNKINVLGVIIKHYKTMSDQRGTMLMSDITVHFIVPLSLSFVLCWTYGIMKPAIASVFVNFGAITTALLMSAVIMIYEQKQKTITKISDIIEGNKSRDKLISLNTNKTIYEQLCHNVAYAILTSIVLVIFSVIIYFLPDNAVDLMKWYFRAPAYIVSFLAYTSFFITVITFLMVIKRFSTILDN >NZ_CP043433|2436624:2452568|2445954_2446284_-|WP_001676916.1|DBSCAN-SWA MNSGLITLTELRRMTGLTIYSTRHYLDKAERCGDVYQAGRRGGIFPSEEAYRAWKKQAKVDADLIWKLPDGEVRRYDRHHNVICRECRKSEYMQRVLAFYRGNFQEVLL >NZ_CP043433|2436624:2452568|2439215_2439611_-|WP_000900605.1|DBSCAN-SWA MLGIFRKKTRKAIVEVKKMENRDAVEATVWGAYSIAYADGTCDAKEIAVLEKTIAALPAFAPFSGEIAQMSANIRARYEASPRSANAQALRELADVAGTAEAVDVLCLYWPRSCPALLTTTVNRWMRCVQW >NZ_CP043433|2436624:2452568|2450215_2451151_+|WP_001156217.1|tRNA|DBSCAN-SWA MQEIQKNTKKEQYNLNKLQKRLRRNVGEAIADFNMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNLDQKQPGFPEHILPAYLEQLGVEYKIVEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFLNMFYGGKMKGMPPKLMSDDGKHIVIRPLAYCREKDIVRFAEAKAFPIIPCNLCGSQPNLQRQVIADMLRDWDKRYPGRIETMFSAMQNVVPSHLCDTNLFDFKGITHGSEVVDGGDLAFDREEIPLQPAGWQPEEDDTALEALRLDVIEVK >NZ_CP043433|2436624:2452568|2442314_2442605_-|WP_000774470.1|DBSCAN-SWA MADLRKAARGLMCTVRIPGHCNHNPETSVLAHYRLAGTCGTATKPNDMQAAIACSSCHDIVDGRVKIDDFTKTEIRLMHAEGVFRTQEIWREKGIL >NZ_CP043433|2436624:2452568|2449281_2449899_+|WP_001676915.1|DBSCAN-SWA MSDNKTEYSYYIKVKNESARKRLGFPFAFWWKTESSEAAATARLAVSMLDAGFEPTDFAKPVRVNSPAVNELPPEGSFDTTFCQKYELGGEDGKTFMLIPGTPATDAHDEKTEECADDAGTEESGTDTSDNDECQDCEVSVATLPFPQRVLHIFTYAATDKKYLHHATRAQRRHITVLEMEQENSYIQNLLMVLRKSEQVHAQDE >NZ_CP043433|2436624:2452568|2439105_2439294_-|WP_001688615.1|DBSCAN-SWA MPVLASFLSRLADYNGKPLDALCAVVMSVLSVKFLTFIHDQDISSLTGVFSRMRGGGSGHGK >NZ_CP043433|2436624:2452568|2446556_2447024_+|WP_001227859.1|DBSCAN-SWA MRKKREEIAPPEATQRLRAIWDAKKRDLKLTQEIAADLMGFETQSTVSHYLNGKAPLNTDAALKFSVLLRVKPEELRPDLADLMNYVRSSGTYDDNFEGGGWRMVSRQQADLLNLFDILPESEKEKLIDRLKGQNELYKEAFQNMLAAQKRLKNQ >NZ_CP043433|2436624:2452568|2448936_2449254_+|WP_000800272.1|DBSCAN-SWA MKNVPESVIAELRQLSGKIRTLCIENNMPCVVSYARDCDDESVSRTLVAYTDSETGAYDRSITAAIMLLKMNEAPPENFISLLKLMECKELVTNAFCSMKNESLH >NZ_CP043433|2436624:2452568|2447408_2447564_+|WP_085981757.1|DBSCAN-SWA MQKIHLGNNESLVCGVFPNQDGTFTAMTYTRSRTFKTETGARRWLERNSGE >NZ_CP043433|2436624:2452568|2448630_2448852_+|WP_000560208.1|DBSCAN-SWA MIAHHFGTDEIPRQCITPGDYVIHDGRTYIASANNIKKRRLYIRDLTTQRCITDCMVKVWLNRNGLPAKAESW |
22 | Escherichia_phage(62.5%) | tRNA,holin | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2676993 : 2693108
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP043433|2676993:2693108|DBSCAN-SWA GTCAGTTTGTGAAGTTGCTCCCCGACCGGGAAACCATCACCAGCGGCCAGACGGAAGCAGACGTGGTGTACTGCCCACGGACCCTCAGAGAGACGCTGATATCCACGACAGGTGAAGTGGTGTAGACCGAAAAGACGACGGTCTGATACATGGCCGGAATCCCTGCGGTATACGGCATAACCTCCGCCGTTTTCACCTGGCCGTTAATATTTATCGTGACGGTGATGGCACCGGTGCCACCGTTACGCTCACAGTTAGCCATCACCGTGATGGTTTTCCCTATCTGATAGGTGGCGCTGTCGGTATACCGTGTTGAGGTGCTGCGTTCGTCGTTCGTCGCCCTGATGCTCACGCCCTGCATGACTTTTGAGCCGCAGATATCACCGACAAACTCTCTTGCTTCTATCACGCCAGAAAACTTACCGGAGGTGGCATTGATTTCTCCCGTAAACGAGCCAGATACAGCGTTGATATGGCCGCTGATATCCGCATTTTTCGCAGTCAGCTTTCCATCCGGCGTCAGGGAAAATGCCGGAGGATTCCCGCCACTGGTAATGGTCGGCGCGCTCAGGTATTTCAGGAACGCCTCATTCATGATTATCTGGTCGCCCTGCATGACGAATCCGGGCGTCTCGTTTCCGTTTGCCGGGTTAATATAAGCAATGCGATCCGCCGCCACCAGGAACTGGCTTATCTTCCCGTCAGGCGTGTCTTCCATGCTCAGTCCAAGTCCGGCCACATAATATTTGCCGTCTTTGGTCTGCTCTATTTTGACGCCCCACGTGGCGCTCCATTTATCGTTAGCGTCCTGCCACTCCTTCGAAAATTGTTGCAGTTTGCTGGCGTTATCCTCCGTCAGGTCAATTTTTTTTCAGCAACTCCGTACCAAGATACGTCTCCGTAATCAGTCCCTTAAAAAAATCCAGATACCCTTTCGCGTCATCCCCCGGACGTCCGGATGCTTCCGCAAATACTGATTTTCCAGCCAGATTTACACTGCGCACGTAAAACCAGGCATCATGCAGTGGTTTCAGTCCATCCTTTATCCAGAATGACCCGACGCCCAGATACTGTGCTTTTGACTGAATATCGGCGGCAGTCGCCAGTTGCGTGGCGGAGTACCAGAACTCATACTGCACACTGGCATCGTAGACAGTCTGGTGCGGCGTCACCGTTATCTGAAAATAACCCGGCGTCATCTCAATCGTGGATGGCGCTTCCGGTGCCTGAATACTGAATGCCACGGACGCCGGTTCACCCTGCTGCCCGTAACCGTTTATTGCCCTGACTGTCAGCGTGTAGTCACCCAGTGGCAGTTCGTGGAAGGCGTACTCCGTTTCGCTGGTCGTCGCCGTTGTCACCAGACGAACCGGATCGCCCTCGTTCCCACTGCCTGTGGTCAGCCTCACCACAAAACGCACATCTTTTACCACCCGCGGCGTGCCCCACTTCGCTTTGGCCTGATACAGGGTACTGTCGTTATCCGTGCTGACTGTCAGATGCTGCACAGCGGGCGGAATAATGCTGTTGGTGGTCCCCGGTAACGGGTCAAAGTGCGCCCCGTTGTCCACGATGGACTCTTTTTCCGGAACGTGCTGCAAGGCAGTGATGGCGTATGTGCCGTCGTCATTCTCCTTAATACGCACGCAACGGAAAAGGCGGCGCTTCAGGGAGGGCAGTTTCAGCCCCCAGATACTGTATGGCTGCACGGTTTCCGGCAGGACTTTCGTTACCACCCGATCCGGTGCGGGCTGCGACTGAATCTCCGTACTGAACGGCTTACCGTCAGGCCCGACAATATTCAGCGTGGTGGCGCCGCTTTCCGGTAGTGTTATTTCCCGGTCAAGCGTCAGCGTGCGGGTGGAAATATCCAGGTCAGTGATACGCCCACCGACCGACGCCCCGGCGTAATCGTTGTCGCAGACCTCAATAATATCGCCCGGTGTATGACGCAGACCTTCCGCACCGACAGAAAAATCCACGGTCTGCGTTTCCAGCAGCTCCGTCATCATCACCCACAACCCCGTCCGGTGCGCCTGTCCACGTGAGGTACAGCCGAACGCGTCCATTTTCAGCAGATTGCGTCCATAACGGGCCTGTGAGGCATGGTCTTCCACCAGCTCCGTGGAGGTTTGCCAGCCATTCAGCGGATCGGTGTATCTCACTTCTATCGCGTTATGGCGGTCTTTCAGGGCACTGAAGCTGTATTTAAAGCGCCCGCCCACCACGTTACCGTTGGTGTAGGTCCATGCTTTATCGGAGGGGCGGTCCTGGATGAAGGTCATTTTGCGGCCATTCCATACCGGCATACAACGCATCACCGAGCAGAAATCCGCCAGAACGTCATACGCCTTACGCTGGGTGGTAATATACGCATTAAGCGTCATGCGGGGTTCCGTGCCGCCAAATCCGTCCGGCACCGGTTGATCGCAGTACTGCGCGATGGCGTACAGCGCCCATTTATCCACATCCGCCCCCCCGATACGCCTGCCCAGCCCGTAACGGGGGTGGGTCAGTTTATCCATCGTGCACCACGCCGGGTTATTCGTGTACGCCGGTTTAAACGCCCCGTCCCACAGGCCGGTATATGTGCGGGTATCCGGGTCATAGTTTGAGGGGACCTGAAAAATACGTCCGCGCAGGTGGTAGTTACGCGTGACCTGCTGGCTGCCGAACTGTTCCGCATCCACCAGCAGACCGGCAACCGCTGTGCCAGGATAACCCTGCCGGATATCGATGATTTCCGTATACGACGACCACAGCGTTTTGTTCTGAAGCCTGTCGGTGGTGCTGTCCGGTGTCACCCTGACCATGCGGACACTGAACGGGCGCGGCGGTAAATTATCAGCCACTACCGATGCCAGATATTGTGTTGTGATCTTGCCGTTAATAGTGATATCAAATTCTGTGTTCCAGATCCCGCTACGCTGAAACTGTATCAGCAGATTCACGGAGGACGGGTTACGGTCCCCCTTGTCCATGGTCTCCTGCAGCATCTGTACACCAAAGGTGAAGCGTAGCCGGTCGACATTCTCTGAGACAACAGTACGGGTAACGGGATTATCGTGTTTCACTTCCACACCCAGCACCGTTTCCGCGCCGGAAGCCTCAAAACCTTCCAGCGGTGCCTGTGGTGTCTCCCCCACCTGATATACCACGGTCACGCCGTGAATATTACTGTTACCGTCCGCGTCCACCACCGGCGTGTTATTAATCAGCACGCTCTGCAGACCGTTCCGCTTTGTATCTGGAGGGTACGATCCGCACTGCTGAGACAGGGCTTACCGTCAGGATGGCTGTGTACCAGCGCCACGATGTCGCCGCGGTTCCGGGCATTCAGGTAATCCTCCGGGGATATACGAAAATACATCGTGGGTTCAGCAGACAGATTTTCACACGGAAAATACCGCTCTCCCTGTGCTGTTCTGACCACATAACCGCACGATTCCGCAGGCGCACACTGTCGGGCATGTGCCAGAATGTCATCGTTAATCATGGGAACCTGTTAAGACAGTTTGTTGATGGAAGCGAAAAATCCGGCATTCACCAGATTGTTACGCATTTCACAGCCTTTCATGCAGTGGCTGCATTTATCCTTTTTCGGGTCTGAGGTGGGCTTATCGAACTCATCGGCCACGGGCGGGCCGTCGTATCCGCAGTTTTCATCCCGGTAATCCCACGGACAGGAGTCCGCCAGCATGGTACGCCCCGGCACCACAGAACCGTCGGTTTCTGCCGGTGATGCCAGAATAATGGTAGCAGTTGATGAATCCAGTTCTGACAACTGCTCCACGTTATAGCGCGCTACCGCCTCCTGCTCCGGGTCAGCGCCCGGATTGCCGTTACTGAAATTCACCGCATCAAGAAACTTGCTGTAAACCTGATGCCTTACCACTGACGCGCCGACGAGACTTTGCAAATCCTCCGCCATCCCCGTGACCAGACCAAAGAGATTGGCAACAACGAGGTTCGGGCGGGGAGATGCGCCTTTCCCGTTCATCTCAAAATCCTGTACCTGTATCGGGTACGGTTCGTACTGCCTCCCCTGCCAGGTTAACGGCTCGCCTTTTTCGTTCGGTTCGTTACAGAAGAAAAAGCGCTCACCGCCAATCGCGGTTAAATCAAATTCCCACAAATCCACCTTCGCGGACTGCTCCGCTTTGGTGGTCTCGCTCAAGGTTTCCTGTGGTATATCCTGCATATATGAGAGATCCTTTATTATTTATCTTGCAAAAATATACCTGCTTTTATTAATGGTATTTACGATACAACCAAAAAACGAGGTAACTAATGAAATACACAATATTGTCGCTGGTAGCTGGTGCGCTCATCAGTTGTTCAGCAATGGCAGAGAATACCCTGACTGTAAAGATGAACGATGCCCTGTCCAGCGGAACAGGAGAAAACATAGGTGAAATCACAGTTTCAGAGACACCTTACGGTCTGCTTTTCACTCCTCACCTAAATGGTCTTACGCCAGGAATTCACGGCTTCCATGTCCACACAAACCCAAGTTGTATGCCGGGAATGAAAGACGGTAAAGAGGTTCCGGCGCTCATGGCCGGAGGACATCTTGACCCCGAAAAAACCGGGAAACATCTTGGCCCATATAATGACAAAGGGCATTTGGGGGATCTGCCTGGACTGGTTGTCAATGCAGATGGTACAGCCACGTATCCGTTACTGGCACCACGCCTTAAATCACTGTCAGAACTGAAAGGTCACTCATTGATGATCCATAAAGGCGGTGACAATTACTCCGATAAACCTGCTCCACTGGGTGGTGGCGGTGCACGTTTTGCCTGTGGTGTCATTGAGAAATAACAGCAACATAGCCATATCGTCATAATTTCGTTTTACCCATAAAAAAGCCCTCTCACTGGAGGGCATTAAATCTGTATCGATGTTAAAGGTCAGAAGCTGTAACCTACGCCAAGCACCTAGGTTCCAGCTTTGACGTCACTGTCAGCATCAGTGGAAAAACTTGTATGCTCATAAGACGCATTAACGGCAATATTTTCAACCGGGTTAAGCTGAATACCTGCCCCATAAGCAAAGGCGGTTTTATTGTCAGAATTTCCCCAGTTATCCTTAATATGTCCGTTTGCTGCACCAATCATCACGTAAGCATTCAGATAGTCGTTAAAACGGTATGAAGGACCAACAAGAAGGGAGGTATAATCAGCATCACCTACCTTAAACGCTCAGCACCTCGCTGGCCGGGATCAGAGTTCTCATCTCCTTTTCCAGCTCAATACGCGTCATTTCCGACTGAAACCATGCCCGTCGATCTGAGGGTTTCATCTTATTGGGGTCATTCTCCCCGGTAACAGCCGGCATCGTCATCATGGCGGTCAGAATATCCACCAGCCGGTAGATTTTCAGGTTACTACCGTTCCCACCTGAGGTTTTTACGCCCTTCAGCCTGCTGGCGATGGTCTGTCGGTGTGCACCAGTGATGGCCGAAAGCTGTGTGATGTTTAATTCGAGGCTTTTAATTTCCTGATCCACAGTCGTGCTCTTTTCCTGTATACGGTGAAAATGGCGTTCAGTGTCGAACAAAAAACGTACCACCTCGACACTGAAAACAGTAAATGTATTGATTTTTAAGGTTATTTTTCAGTGCTGACAGAGACTAAAAAATCAAAAATCAGCCGATTCCCGCGAGCCCGAAGCCACCCGTGGCGCCCCCTGCCCGGGAGTACCTTTTTAATACAGTCACCATTGGTTACTAGTTTTTCCTGCATTACCGTGGCATTGGGTGCGAATGTACGCCTGCGCCCCTTCCAGTTGCTTTTGCATCGTCTTCACTCGCTCTTTGAGGGCGAAATAATCCCGTTGAGCGGAGTCTGCCAGTCTGGGGCTGGCTGCATTATCCATGCGGGCGGTGGAGGTGGATTTACCTGTCGGCACTGCGGGGCATGTTGCGTTGACGTACAGGCGACGGCGGCCAGCGGCAACATCATCGCGCAAAGCATCATTCTCAGCTTTCGCATCGGCTAATTCCTTCGTGTATTTTTCATCGAGGGCGGCAACGTCACGCTGGCGCTTAGTCATGTCGGTAATTGTCGCGTTCGCCAGCGTCAGCCTATGAGTAACGGTATCGCGCTGCTCTTTGTAGGTGATGGCGTTATTTCGGTAGTGATTTGCCAGCCGACCGGCAACAATTAGCGAGACAAGCAACAGGCCAACAAACATCGTTTTCCAGTTGAACATCATGACAGGAACAGAGCACGCTCCGCCTCACGCCGACGGGTAAGCCCGTTCAGTACTTTGCCACCAGCCTTATTCCAGCGCAGGAACTCATCAGCGGCGCCAGCGTAATCACCAGCGTTTAGCTTCCGCAGCAGAGTTGATGAGGATAATGTCCGGGCGCCGAGGTTGTACGCGAACGACACCAGCGCATCAAACTGGCCTTGCGTCAACTTGACCTTAACCAGTCTGGACACATCATTTTCATAACCGACTAAACCAGTGTTAAGCAAGCGCTCGGCAGTAGCCTCGTCAATCATCATTCCGGGCTTAACTGGCTTACCGTCAACAGAGTGGGTCCAGCCATAACCAATCGTCCAGGGATCTCCCCCCGTTCCCGGGTCCGGATAAGCTGTCAGGCTACAACCTTCAAACTCTTTGATTAGGGTAATGCCTTTTTCACTGATTCTCATCATTAACCCCTGCACGTTTTTTGAGTGCGCTAATTGCGATTTCGCGCAGCTTGTCCACACCGACAAAGCCAATAATTCCGCCAACGAAAGGCGAAATGGAAACCGGCAGGCCTACCACATCAAGCGCACTGGTGACACATAAGGAAAGAGCGCCACACAGGACGCCCTCAAGCCATTTATTTTTACGGGTGGCGCCGTCGTATATCAGTCGGCCGTAGGCAATGAGTCCGGCCATTAACGCCCCAAGTATCTGGGGCCACGCATTTTTGAGTCCGGTCAAAACCGCAGCCCAGAATTCAGGAGTCTTGTCATTCATTTTCATAAGCCTCACCTCCGATGATTTCGGATGGTAACTAGAGTGAGTGAAATGGTTGGGTTGCAGGGTTTAATATCTTGTAAAACAGGATTGCCTGTGGTTGCAGAATCTGAAAGTAAAATCACGCAGAGTACAATTTTAATGGAGGTGAGGCACAAATACTGCAAATTTAGCTTTTAGTTTAATTGATTGCGTGCTGAGTGAATTCTGTTTGACAAAAACATGCTATTTATAGAATGTTAATTCCATGTAATAAAAAGGATGTGTAACTCATCATGCCAACGGGAATTAAACCAATATTTATCAATAATATGATGTCAACATATGGATTATCCCATCCTCATGACAGCAAGGTATTTCCAGACCTTCCAGAACACCAAGATAATCCTTCGCAATTACGCCTCCAACATGATGGTCTTGCTACCGATGATAAAGCCAGGCTGGAACCAATGTGTCTTGCTGAATACCTTATCTCTGGACCAGGAGGAATGGATCCTGATATCGAAATTGATGATGATACCTATGATGAATGCCGTGAGGTGCTATCACGCATACTTGAAGATGCATACACTCAAAGCGGGACATTCCGCAGACTGATGAATTATGCCTACGATCAGGAATTGCGTGATGTAGAACAACGCTGGTTGCTGGGAGCCGGAGAAAACTTTGGTACTACCGTAACTGATGAGGACCTGGAGAGTTCAGAAGGCAGAAAAGTGATTGCCCTCAACCTGGATGATACAGACGATGATTCAATACCAGAGTACTATGAAAGTAATGATGGCCCACAACAATTTGATACAACACGCTCATTTATTCACGAAGTTGTACACGCGTTGACTCACCTTCAGGACAAAGAAGACAGTAATCCAAGAGGCCCGGTAGTCGAGTATACCAATATCATTTTAAAAGAGATGGGTCACACATCACCACCAAGAATCGCCTACGAATTTAGTAATTGACACTCATCAAAAAATGCAAAATCCCACGATGCTACAACACAGTAACCAGTTCAGGTCTGAGCTAATACAGGTCAGCAGTCCATAGACACTGGCTCCTGTCAGGATGCCACCTGCTAACCCAGTACCAGAAATCGATTCGGACATTCATCCCCCTCTGGTTGTGTGGGGCCTCTCAGTTATGAGGGGAAATAATAAATATCCTCCGGCATAGCCGGAGGATATTTATTCATAAAGAACACAATTAAGAATAATACCGATTTAATTAAAATAACTTGATCTCACAGTTGAAGAATGAATAATAGCGAGCCCTGCCAAGGCAGGGCATAGAAATAACCAACGAGAAGAAATAGGTAGGAACTAATGAAAAACACCGCTCTGGGTAAGTTCATTTTTATCGTCGGCACCGCGTTACTGCTCGGTGGCTGTAGTGGCATGGTCATGCCTCCCTATGCCACCCACGGTACATCGGTCGGAATCATTGCGCCAGCGGGAGGCTATAGCGAGTGGCACACGGATAGCCGCAACCACACCACAGGAGACAGTCACAGCCAGTCACAGGGAAACTGCACCCAAAGTGAAGATAGCCAGCTCAGCGAAAATAGTCTCACACGGACACACCAAAGCAACTGTAACACCCGTAGTCAAACCCACAGCAGTAGCACCAGCAAAACCCGCTCCAGCAGCGTCGGTTTCAGCGTCGGGGGGCCTGTTGGTGCTAGCATAGGGTTGATCAAGCAGATGGAGTCGATGAACCGTGCGCCAGCCAACGATATGAGTAGTAATGAGATGTTCAAGAATTTCGGTTTCTAGCACATAACGCCACCTGGTACCGTTGTGGTGTCTGGCCCGGCGGCTATCTGTAACGACTCACAATCGAAAAAAGTCAGACTCGCAATCAGCGCAAAAATAGATTGTGAGTCCATTGAATGGGGATCGTTGTGCATTTTCATAAGCCTCGCCCCCGATAGCTTGGATGGCGCTGTATTTGTAAGGGTGAGAGGCCCTCGGGCGGGGTTTTAACAACGAAGCGTGTAGATGATGATTTCCGAGGGCTGAATAAAAAAACCGGCGAAAAGCCAGGAAGATGTAAATAAGGCCATTTCGACTCTGTGGACGAAGATACCCTAACATTAGTTTGATGTGTGGCAATTCTTACCGAGGGTGTTGAGCAATCCTCTCTAAACTATTTCTGCGAGGCTATATAAAGTTCATGAGTATCAGCTAACTGCACAAATTTGTACTTAATAGCCCCCAATAAAGTACCTAATCTTGTATGACCCTCCATAAGGTGTAAGCCGCTCTCTCCAGGAATAATAAGCGAACGCTCAATAAACATCGGTGGTTCAGCCCATGTACCGAATTTAAGCCAATGGTTTGCAACCTCTTCACGGGCATCAATGCAAAACTTGCTGCCGCAGGCATTAAAGTCTTCTGAAATCTCGAGCATGTAATCAGGATATGTGGCATTTCTGCCAAACTTTGTAAACTCTGCTGTTTTCAATCTGACCAAATCCCACTTCAGTGATTTAAGATTTAGATGCCCATACAAGGTTTGAAATTCAGAATTATTAGATAACCCACAATAAATTTGCTTAAAAATTTGTTCTGGAGCTTCGATCCCATATTGCTCACGAAGGATGGCAATTCCTTCTTCTTCCTTATACAACGGGTCGGGACCAAAAACTTGAAATAAATCACGATAAAACATCATCAATTCCTATTGCTGATTAATACAAAAATCCCGTTCCTCAGCAGGCTTGCATATTTTAGGCATGATATCAAATTTACATGAAATATATGTATTTCAGTTCGGTTTTGCAAGACTTATATCCAAATTTGTCGCCTTTTGTTGTGAACGCGATCGTGTTACGGAGATAAGCGCATCACTATCAAGCCGTTTAAAGCTGTTACGCATTACTAACCAATGAGACATGTACGTCTCTGTCCAGGTAGATTTTGCCACGCCCATCAGTTCCGCCAACGTTTGGTACTCATACGTCTCACGTCTGGCCAGTTCCGCCTTCACATCCTGCGCAGCAAGCCAGATGAGCTGGCGCAACCGTGCTAAAGTCTTCTTCGCAACTCGCTTCCCTTCCAACTGCTGGCTGAATTTTTCCCAAGCCCATCGAGTTACAGTGACCTGATATTCCCAGCAGGTGTTTTCACTGTAATTCCACAATAGCCAGGCCTTATTATGTTCTTCGAGTGACAAAAGCGCACGGCGCCATGAGGAAGTGGAGTATTCTACGGGCTGCACAAGCGGAATTGCGCTTCCTTTCGCCAGTGATTGCTTACCAGCAATCGGTGGATTATTCAGCGTTATCATTTTTCCGGTCACTTCATCCCTGATACGCTGTTTTTTTCGGGGATAGTTTTTCGTATAGAGCTGGGCGTTCTCCAGCCATGCCTGCAACTGACCTTTCGTGGCGCCACTCAGATCTGCAGTCGCCACGATGAGCTGCTGTCGCACATATTCCAGATATTGTGTATTCATACGGTACCGCCCGTTATCTTCACGTAGTTTTTCAAAATCCGGTAATCGATCAGGATGGAACCCGGAAATGGTATAAGCACAACTGCTACCAGCGAGCACGGAGATGATCGGCAAAGTAGGATTCGAATGTCATGCCGCCTCCAGCTTTTTTAGCGCACGCAGATCCGCCAGTGCAGCGATCCTGATTTCTTTCAGCTCCTCAACCGTCCAACGTTGCGGAGTATTATTGCTCTCAAGTTCCAGCACCTCCGCCTCACCGTAACGCTCAACCTTCACGCCACGGCGCCGGTATCCCGCCACCAGCTCATCCGCCTCTTCGGTGGTACACACCGGATGCTGAAACCATGTCATTTTCATGCGAACTCCAGCAGATGCGCGGCCACATTTTCAACTTCTTCCAGAGAGGAAAATTTACGAAACAGAATCCAGTTCCACAGGACGTTCAGCACAGCTTTATAGACCTGTTGAAACTCGGTTTCGTCCATACTGGCGAATGCTATGGATTTCGCCCGGCGCCCGCGACTGCCATCCGGATAAAAATGCTCGGTATAAAACCCGGCCTGAACGGTTACCCATTCCCGGAAGGCATCGAAAGACTTAAGAAGGGCGACGTCCCCGGTTCGCAGGGTAGCTACGTTATGGAGGTACTGTTCCGCCGCCTCGTTAAGGGCCGGGGTATATTCCTGGCCTGCGGAGTCGCAAAGAAAATTAACGAACCCGGAGATAAGTTTCTGTTCCCGCGATGTGACCGTGCCGCCCGTTGGCATCCAGTAGTCGAAACCAAGCTGAAGGAGTTTAAAAAATCGTTTATGAAAGGCGTAGTTGCGGACACGTTTAAAATCGGCGTGTATCCACTCACCGATTTTTACTGAGCGCAGGAAATCCCCACTCTCCGGCGTCGCCGGGAGCAGAAGCCCTGATGAGGTTTGCTTGACCAGTTGTAAATGCGCCATCGTTCTCTCCGTTGGCGCAGTAGATTGGGAGTTCAGCCCGCAGACGAGTATAACAAAGGATGATTATTCATGATAACCGGCCCTGATAGTCAGCTCATTAATCAGGGTATCGCTCCCCATGATGTCATTTTGCAACAACGGCAGAAACCGGACATAGCGGCCATCCCGATACATCAATGACCTGTTGCAGTCAGGAAAAAAATCCATTTCAGCAATTACTGTCATGTCATCACGGCGAATAACAGCATATTTACAAGTGAATGTTTTATTTAAATTTTTCACGGTGTCTCCATAGATAACGAACTTGAGCATTTTTAAAGCATCTTCATTCTCAACATGAATATATAGGAGACTATTAATTATCATCATCAATAAATATATCTATTTTTTGACCATATGCAATGACATTTTCTCTGTGTTCTATTTATAATCTTATAACTGGTTATTTTTTGACATGCTCATTTCCCGGACATTAAAAAAACCGCCGGCGCAGGTATTAAGTGCGGGTACATTGAGGTTGTCTGACACATCACAGGTGATGGAGATTCATCCCCCAAGGTCTCTTACTTAGCAATGAAGACAACTACCTCCTCTCTGTCTGGCCGGTTCGATCGCAGTCTCTCCTCGTTACTGGTGCAGTCACTGTGACAGTGATGCAGATGATAATCAGGACGATTAACATCGCTGCGGTTGACTTATCCGGCAAAATTATGCTGCCATGATGCCAGTTAACCATACTGGCATCATGGCCAACCGGCATCGAAAAGCATGTTGGCCAGACTCGCAGGCCATTGAATCACGCAACAACCAGTTACTGTCATCTGATGAAAAAGGCTGTGCATAACAGAATCAGAACTGACTGGTATCAGGGCCATGTTCTTCAGCAGCAAATACATAAGATGAAGCAAGATATAAAGAATGAAGGGAAAATAGAGTATAAAAAACGTACAGAATTGTCTGAAGTAACTTCCCTGCAGCATTGACGCCGCAGGGAATCTATTTATGGTGTAACTATATTGAACCAGAACTCAAACTTGTCCATATAGCCCAGCATCTCATCCAGTTTCGCAGCATTACCGGTAACGTTGACTTCTCCTTTATCTTGAGCCTGCTTCAGAGTTTCTTCCTTCAGGATAATTTTATTCAGCGTGTCACGGTTCAGAGTAATCGTGGCATCAGCATCTTTCGCTTCAGCATTAGCCGTGTGGTTCAGCACGCCATTTTCCAGCTCAAGCTTGTACTTTCCGCCGTCGCTGCCAAGGTCAATATTAAATACCGCCCGGGCATTACCCGCTTTTTCACCGTTGATATGTACAGCCAGAAAGTCGAAGAACATTTCAGGGGTCATCGCCCGAACGGTATCCGGACTTGCTGTATTTGGCGTCGGACCTTTAACCACACCGTTACGCAGCTCCTGCGCACCGGTCAGGTAGAAGTTACGCCATGGACCAGATTCAGCCTGATACCCCAATTGCTCCAGCGCATCGGCTTCAAGGTTACGTGCATTCTGGTTATTTGGATCGGCAAACACGACCTTACTCACCACCTGAGCAACCCAACGGTAGTTCCCCTGGTCAAAGTCTGCTTTAGCTTTCTGAAGAATCGCATCGGCACCGCCCATGTATTCAACAAATTTCTTGGCCGCTTCTTCGGGTGGCAGCTCATCAAGGGTTGCCGGATTGCCATCGAACCAACCGAGATACAGCACATACGTTGCTTTTACGTCATGGCTGATGGAGCCGTAATAGCCGCGGTTGGCCCAGGTTTTTGCCAGGCTATCCGGTAGTTTAAAGTTGGCCGCTATTTCGTCGCGAGTCAGACCTTCATTGGCCATGCGCAGAGTCTGGTCATTGATATAACGATACAGGTCTCGCTGGCTTTTCAGCAGACCAACAACATTCTCGTTACCCCAGGTCGGCCAGTGGTGCTGGGCCATAATAATTTCAGCTTTGTCACCCCAACGCACTATAGCTTCGTTGATATATTTCGACCACGGCAACGGCTCACGAATTTTTGCGCCACGTAGCGAGTAAGTGTTATGCAGGGTGTGAGTGACGTCCTCTGCGGCTTCGATGAGTTTCTTCTCTTCGATGAACCACAGCATTTCCGAAGGGGCTTCCGAACCAGGGGCCAGCATAAAGTCGTAAGTCAGGCCATCAATCACTTCTTTCTGGCCGTCTTTATCGATGATATTAGTGGGCGCAATCAGTGTCACCGTCCCCGCAGAGGTGGTCGTCCCCAGTCCGGCGCCAACCTGGCCGGAGGCATCTGGTTTCAGGAGGTTGCCATACATATAGCTGGCACGGCGGCTCATCACGTTGCCGGCCATAATATTCTCGGCTACTGCTGCCTCCATAAAGCCAGCAGGCGCATACACTTTCACCTTGCCGGATTTCACGTCCGCTTCATCGACAACGCCACGCACACCGCCATAGTGGTCAACATGGCTATGAGTATAAATGATGGCGACAACAGGCTTATTGCCACGGTTTTTGAAATACAAATCCATACCGGCTTTGGCTGTTTCCGCAGAAACCAGCGGATCGACAACCGTAATCCCCTCTTTACCTTCGATAATCGTCATGTTGGATAAATCAAGGTTACGAATCTGGTAGACGCCGTCTGTGACTTCAAACAAGCCACTGATATTGATTAGCTGGGACTGACGCCACAGACTAGGGTTAACAGTGTCAGGAGATTTTTCCCCTTCTTTTATGAAAGCGTACTGCTGTGGATTCCAGATGACATTCCCTTGCTCTCCCTTAATCACCTCTTCAGGTAAACCAGCGATAAAGCCTTTATGGGCATTCGTGAAATCGGTGTTATCAGAGAAAGGAAGTTGGTTATAAAGCGCATCGTTAGCTTGCTTGGTTGAAGCAGTGGCACCTTTTGGGGCTTCCTGTGCAAATAAAGGTGTCAGCGCAGTGGAAGAGAGTAGCCCCGCCAGCGCAAAACTTTTAACGATCAACTTAAGTCTCATTTGTACCCCTCATGTAAAAATATTCTGTATCACTCAGTCTGGTAGATTAATTATCTGTTAATTCAAACAATTAAAGTTATTGCTGACCATTTTCTCTCTTTTAAATATAACCAAAACGTTACATTTCGCTATTTATGGATACAAATAAATCGTGTTTTACGTCAGCCAGTTCCATCCTCTTTTAGTAAGTGGGGTAAGCTCGCTTCCCGTTTCCGGGACCCTGCCTGACTGAAGAGCAGGCTGACAGGATACGCGCCGCGCATTGGGCAGGATTTACAGGACGGCGCGCCGGAGATGTAAAAGTTTTACCGGGCGGCGGTCAGCAGTCCTTTAAATTTTACCCTGATCATTGATGTTCAACCCTGACCGACCGCCACACCGTATAGTTGGCGGCGGTCATGAAGTAAAGAGACATGACTATGAGCTTTGTGAGACTTGAAACCTGGGGTGAATTAAATTATCCCGATGATCCACCACCTCTCACAACACTAAGACGATGGGCGCGAAACGGAAATATTTACCCGACTCCAGTATTACATGGCAGGACGTATCGGGTTGATCCGGACGCGTTTTATATCAAGCCGAATAAAGTGGGACTTGTGCTTGAACAGCACCACCCAAACGGGCGCACCGGAAAACCGAGTGCATTGCTGGAGAAGTTGATCAGTGAGTCGAAAAAAGTACGATGCTAACCTTCCGAGGAACCTCACCTACCGTAAGGCCAGTAAATCTTTTTTCTGGCGTAACCCGCTAACTGACAAGGAATTTCCGCTCGGTCAGATCGCCCGCAGGGACGCTATCACACAGGCCATAGAGGCAAACAACTTCATAGCGCAAAACCACACACCAGTGGCGCTTATTGAAAAGCTAAAAGGAACTGACTCATTCACTGTGTCCGCATGGATTGATCGCTATGAGGTTTTATTACAGCGCCGGAGTCTGTCGGTTAATACCTACAAGATTCGCGGTAATCAATTAGCGACCGTACGCGAAAAAATGGGGGAAATAATACTGGCAGAAGTAACAACCAGGCACATTGCCAAGTTTCTTGAGTCGTGGATAACCGAGGGAAAAAACACTATGGCGGGAGCAATGAGATCAGTTCTATCTGACATGTTCAGAGAGGCTATTGTCGAAGGGCATATTGTGAAAAACCCGGTGGAAGCAACCCGGATACCAGAGATTAAGGTGGCCAGGGAACGCCTGCAACTGGAAACGTATAACGCCACACGAGCGGCAGCAGAGCATATGCCTGCATGGTTCCCTCTCGCGATGGATTTAGCGCTCGTTACTGGTCAACGTAGGGAGGATATCGTAAATATGAAATTTAGTGATGTTTTTGACAACCGCTTATACGTAACTCAGATTAAAACCGGAATGAAAATAGCCATTCCCCTCTCCCTGACACTTCGGGCGACGGGGTTACGTCTGGGAACGGTAATCGATCGCTGCCGACTGGTAAGCCGCACTGATTTCATGATCAGTGCCGGAATCAGGAAAAATAGCCCAACCGGGAATATTCATCCGGATGGATTGACAAAGACATTTGTAAAAGCAAGAAAAGCCTCCGGTGTTAACTTCAGCAATAATCCACCGACATTTCACGAGATCCGAAGTCTGGCCGGGCGGCTGTACAAAAACGAGCACGGCGAGGTGTTCGCCCAAAAACTCCTGGGCCACACATCAGCGAACACCACGAAACTCTATCTCGATGAGCGTGATGATAAAGCTTATATGATGCTCTAA
Protein sequences of DBSCAN-SWA_6 >NZ_CP043433|2676993:2693108|2683207_2683660_-|WP_000984586.1|DBSCAN-SWA MMRISEKGITLIKEFEGCSLTAYPDPGTGGDPWTIGYGWTHSVDGKPVKPGMMIDEATAERLLNTGLVGYENDVSRLVKVKLTQGQFDALVSFAYNLGARTLSSSTLLRKLNAGDYAGAADEFLRWNKAGGKVLNGLTRRREAERALFLS >NZ_CP043433|2676993:2693108|2691775_2692054_+|WP_001575998.1|DBSCAN-SWA MTMSFVRLETWGELNYPDDPPPLTTLRRWARNGNIYPTPVLHGRTYRVDPDAFYIKPNKVGLVLEQHHPNGRTGKPSALLEKLISESKKVRC >NZ_CP043433|2676993:2693108|2680497_2681193_-|WP_001152416.1|tail|DBSCAN-SWA MQDIPQETLSETTKAEQSAKVDLWEFDLTAIGGERFFFCNEPNEKGEPLTWQGRQYEPYPIQVQDFEMNGKGASPRPNLVVANLFGLVTGMAEDLQSLVGASVVRHQVYSKFLDAVNFSNGNPGADPEQEAVARYNVEQLSELDSSTATIILASPAETDGSVVPGRTMLADSCPWDYRDENCGYDGPPVADEFDKPTSDPKKDKCSHCMKGCEMRNNLVNAGFFASINKLS >NZ_CP043433|2676993:2693108|2683643_2683973_-|WP_001574216.1|holin|DBSCAN-SWA MNDKTPEFWAAVLTGLKNAWPQILGALMAGLIAYGRLIYDGATRKNKWLEGVLCGALSLCVTSALDVVGLPVSISPFVGGIIGFVGVDKLREIAISALKKRAGVNDENQ >NZ_CP043433|2676993:2693108|2687782_2688382_-|WP_000940753.1|DBSCAN-SWA MAHLQLVKQTSSGLLLPATPESGDFLRSVKIGEWIHADFKRVRNYAFHKRFFKLLQLGFDYWMPTGGTVTSREQKLISGFVNFLCDSAGQEYTPALNEAAEQYLHNVATLRTGDVALLKSFDAFREWVTVQAGFYTEHFYPDGSRGRRAKSIAFASMDETEFQQVYKAVLNVLWNWILFRKFSSLEEVENVAAHLLEFA >NZ_CP043433|2676993:2693108|2689382_2691362_-|WP_001237395.1|DBSCAN-SWA MRLKLIVKSFALAGLLSSTALTPLFAQEAPKGATASTKQANDALYNQLPFSDNTDFTNAHKGFIAGLPEEVIKGEQGNVIWNPQQYAFIKEGEKSPDTVNPSLWRQSQLINISGLFEVTDGVYQIRNLDLSNMTIIEGKEGITVVDPLVSAETAKAGMDLYFKNRGNKPVVAIIYTHSHVDHYGGVRGVVDEADVKSGKVKVYAPAGFMEAAVAENIMAGNVMSRRASYMYGNLLKPDASGQVGAGLGTTTSAGTVTLIAPTNIIDKDGQKEVIDGLTYDFMLAPGSEAPSEMLWFIEEKKLIEAAEDVTHTLHNTYSLRGAKIREPLPWSKYINEAIVRWGDKAEIIMAQHHWPTWGNENVVGLLKSQRDLYRYINDQTLRMANEGLTRDEIAANFKLPDSLAKTWANRGYYGSISHDVKATYVLYLGWFDGNPATLDELPPEEAAKKFVEYMGGADAILQKAKADFDQGNYRWVAQVVSKVVFADPNNQNARNLEADALEQLGYQAESGPWRNFYLTGAQELRNGVVKGPTPNTASPDTVRAMTPEMFFDFLAVHINGEKAGNARAVFNIDLGSDGGKYKLELENGVLNHTANAEAKDADATITLNRDTLNKIILKEETLKQAQDKGEVNVTGNAAKLDEMLGYMDKFEFWFNIVTP >NZ_CP043433|2676993:2693108|2686118_2686643_-|WP_001574213.1|DBSCAN-SWA MFYRDLFQVFGPDPLYKEEEGIAILREQYGIEAPEQIFKQIYCGLSNNSEFQTLYGHLNLKSLKWDLVRLKTAEFTKFGRNATYPDYMLEISEDFNACGSKFCIDAREEVANHWLKFGTWAEPPMFIERSLIIPGESGLHLMEGHTRLGTLLGAIKYKFVQLADTHELYIASQK >NZ_CP043433|2676993:2693108|2686739_2687429_-|WP_001097218.1|DBSCAN-SWA MNTQYLEYVRQQLIVATADLSGATKGQLQAWLENAQLYTKNYPRKKQRIRDEVTGKMITLNNPPIAGKQSLAKGSAIPLVQPVEYSTSSWRRALLSLEEHNKAWLLWNYSENTCWEYQVTVTRWAWEKFSQQLEGKRVAKKTLARLRQLIWLAAQDVKAELARRETYEYQTLAELMGVAKSTWTETYMSHWLVMRNSFKRLDSDALISVTRSRSQQKATNLDISLAKPN >NZ_CP043433|2676993:2693108|2688445_2688751_-|WP_000972675.1|DBSCAN-SWA MMIINSLLYIHVENEDALKMLKFVIYGDTVKNLNKTFTCKYAVIRRDDMTVIAEMDFFPDCNRSLMYRDGRYVRFLPLLQNDIMGSDTLINELTIRAGYHE >NZ_CP043433|2676993:2693108|2682710_2683190_-|WP_001541990.1|lysis|DBSCAN-SWA MFVGLLLVSLIVAGRLANHYRNNAITYKEQRDTVTHRLTLANATITDMTKRQRDVAALDEKYTKELADAKAENDALRDDVAAGRRRLYVNATCPAVPTGKSTSTARMDNAASPRLADSAQRDYFALKERVKTMQKQLEGAQAYIRTQCHGNAGKTSNQW >NZ_CP043433|2676993:2693108|2676993_2677857_-|WP_072100756.1|DBSCAN-SWA MDLTEDNASKLQQFSKEWQDANDKWSATWGVKIEQTKDGKYYVAGLGLSMEDTPDGKISQFLVAADRIAYINPANGNETPGFVMQGDQIIMNEAFLKYLSAPTITSGGNPPAFSLTPDGKLTAKNADISGHINAVSGSFTGEINATSGKFSGVIEAREFVGDICGSKVMQGVSIRATNDERSTSTRYTDSATYQIGKTITVMANCERNGGTGAITVTININGQVKTAEVMPYTAGIPAMYQTVVFSVYTTSPVVDISVSLRVRGQYTTSASVWPLVMVSRSGSNFTN >NZ_CP043433|2676993:2693108|2689173_2689365_+|WP_001676972.1|DBSCAN-SWA MNHATTSYCHLMKKAVHNRIRTDWYQGHVLQQQIHKMKQDIKNEGKIEYKKRTELSEVTSLQH >NZ_CP043433|2676993:2693108|2692028_2693108_+|WP_000087636.1|integrase|DBSCAN-SWA MSRKKYDANLPRNLTYRKASKSFFWRNPLTDKEFPLGQIARRDAITQAIEANNFIAQNHTPVALIEKLKGTDSFTVSAWIDRYEVLLQRRSLSVNTYKIRGNQLATVREKMGEIILAEVTTRHIAKFLESWITEGKNTMAGAMRSVLSDMFREAIVEGHIVKNPVEATRIPEIKVARERLQLETYNATRAAAEHMPAWFPLAMDLALVTGQRREDIVNMKFSDVFDNRLYVTQIKTGMKIAIPLSLTLRATGLRLGTVIDRCRLVSRTDFMISAGIRKNSPTGNIHPDGLTKTFVKARKASGVNFSNNPPTFHEIRSLAGRLYKNEHGEVFAQKLLGHTSANTTKLYLDERDDKAYMML >NZ_CP043433|2676993:2693108|2685295_2685745_+|WP_000798708.1|DBSCAN-SWA MKNTALGKFIFIVGTALLLGGCSGMVMPPYATHGTSVGIIAPAGGYSEWHTDSRNHTTGDSHSQSQGNCTQSEDSQLSENSLTRTHQSNCNTRSQTHSSSTSKTRSSSVGFSVGGPVGASIGLIKQMESMNRAPANDMSSNEMFKNFGF >NZ_CP043433|2676993:2693108|2687558_2687786_-|WP_000784710.1|DBSCAN-SWA MKMTWFQHPVCTTEEADELVAGYRRRGVKVERYGEAEVLELESNNTPQRWTVEELKEIRIAALADLRALKKLEAA >NZ_CP043433|2676993:2693108|2684248_2684935_+|WP_001574215.1|DBSCAN-SWA MPTGIKPIFINNMMSTYGLSHPHDSKVFPDLPEHQDNPSQLRLQHDGLATDDKARLEPMCLAEYLISGPGGMDPDIEIDDDTYDECREVLSRILEDAYTQSGTFRRLMNYAYDQELRDVEQRWLLGAGENFGTTVTDEDLESSEGRKVIALNLDDTDDDSIPEYYESNDGPQQFDTTRSFIHEVVHALTHLQDKEDSNPRGPVVEYTNIILKEMGHTSPPRIAYEFSN >NZ_CP043433|2676993:2693108|2681282_2681816_+|WP_000877926.1|DBSCAN-SWA MKYTILSLVAGALISCSAMAENTLTVKMNDALSSGTGENIGEITVSETPYGLLFTPHLNGLTPGIHGFHVHTNPSCMPGMKDGKEVPALMAGGHLDPEKTGKHLGPYNDKGHLGDLPGLVVNADGTATYPLLAPRLKSLSELKGHSLMIHKGGDNYSDKPAPLGGGGARFACGVIEK |
17 | Salmonella_phage(30.77%) | tail,integrase,lysis,holin | attL 2673623:2673652|attR 2693244:2693273 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2865500 : 2906198
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP043433|2865500:2906198|DBSCAN-SWA TATGCTTCCGGTCACCTACAGATTAATACCTCAAAGCGGAGTATCCACATATGGATTAAATACCGCAGATACACCTGTTTTCCCCGATATTCCCGAACATGCACCAAACCCCTCACGGCTACGCCTTGCTCATGACAGCCTTGCCATAAACAGTGAATTCCGTCTGGAGCCAGAGTGTGTGGTGGAATACCTTATCTCAGGCGCGGGTGGAATAGACCCTGATACAGAAATTGATGACGACACTTATGACGAATGCTACGATGAACTATCCTCCGTACTTCAAAATGCGTATACCCAAAGCGAAACATTCCGCAGACTGATGAGTTACGCATATGAAAAAGAACTACATGATGTGGAGCAGCGCTGGCTACTGGGGGCAGGCGAAGCCTTTGAAACTACCGTGGCTCAGGAACACTTCAAACTTTCAGAAGGCAGGAAAGTTATTTGTCTCAATCTGGACGATTCTGATGATTCATATACCGAACATTATGAAAGTAACGAAGGAAGACAACTTTTTGACACAAAACGTTCATTTATTCATGAAGTTGTACATGCACTGACCCATCTTCAGGATAAAGAAGAAAATCATCCAAGAGGCCCTGTTGTCGAATATACCAACATTATTCTGAAAGAGATGGGGCATCCTTCACCTCCCAGAATGGTCTACATCTTCAATAAATAGACACATCAGGAAACGAAAAGAAACTAAAAACCCACGTAGTCCGTTTTTTCGGGAAATGTTCTAGCAGTATTTTCTAACTATATTCTAAGCACCCAAAAAACAAAGGGGTTACCTTTCGGTAACCCCTTGTTTAATCTGGCGGAAGCGCAGAGATTCGAACTCTGGAACCCTTTCGGGTCGCCGGTTTTCAAGACCGGTGCCTTCAACCGCTCGGCCACACTTCCGGAATGAGGCGCACTATAAACATCCCGGTGCGTCATGTAAAGACCGAATGTGTTCGTTTGCGTGAAAAACAGGCAAATTTTCGTTAATTGCCTGAAATAGCGGCACATTGACCATTTCTTCAACACAAAAACGGGCTTAATACGTGTTGATCGTCCCCACTGTTTTACCGCTGTGATTATCAATTTTTTATCGGTAACGCGTGGCTGGCGCAGCCGCCTGTTTTATGGCGTAGCGCCAGGCATGAGCAGACCGTTTTAACGACGGTAATCACCTGAAATCTCTAAAGGAAACTAAACGTAAACGAATGCGTAAAGATGGGTAAAACCACTGGGTGACAGTGACTGTTTTTATTATTCTCCTTTATTAATTACATCTGTCATAAGAGAGTGACTAATGGATCGTATCATTACATCATCGCGTGATCGTAGCTCGCTACTGAGTACGCACAAAGTACTGCGCAACACCTATTTTCTGTTGAGCCTGACGCTGGCTTTCTCCGCAATTACCGCGACAGCCAGTACGGTACTGATGCTGCCCTCCCCCGGCCTGATTCTGACGCTGGTCGGTATGTATGGGCTGATGTTCCTGACCTATAAAACCGCCAATAAGCCGGTCGGTATCCTGTCTGCTTTTGCGTTTACCGGCTTCCTCGGGTATATCCTGGGCCCGATTCTTAACGCCTATCTGTCAGCAGGCATGGGCGATGTCATTGGCCTGGCGCTGGGTGGTACCGCGTTAGTATTTTTCTGCTGCTCCGCTTACGTTCTGACCACCCGTAAGGATATGTCTTTCCTGGGCGGTATGTTAATGGCCGGGATCGTCGTCGTGCTGATTGGTATGGTGGCGAATATTTTCCTGCAACTGCCTGCATTGCACTTGGCGATTAGCGCGGTGTTTATCCTGATTTCCTCGGGCGCTATCCTGTATGAAACCAGTAACATCATTCACGGCGGTGAAACCAACTATATTCGTGCGACCGTCAGTCTGTACGTTTCGCTGTACAACATCTTTGTCAGCCTGCTCAGCATTCTGGGCTTCGCCAGCCGCGACTAATCGACATTCCCCTCTTCCAGCCCTGCTTATGCAGGGCTTTATTTTTTGCTACACTGCCGTCCGTGTTGATTACGCAGTGAAACGTTATGTTGATCTTTGAAGGTAAAGAAATCAGTACTGATAGCGAAGGCTATCTGAAAGAGACGACGCAGTGGAGTGAAACGCTGGCAGTCGCTATCGCCGCCAATGAAGGCATTGAGCTCTCTGCGGAACACTGGGAAGTCGTGCGCTTCGTGCGCGAATTTTACCTGGAGTTTAATACCTCTCCCGCCATCCGAATGCTGGTAAAAGCGATGGCGAATAAGTTCGGCGAAGAAAAAGGCAACAGCCGCTATCTGTATCGTCTGTTCCCGAAAGGCCCGGCAAAGCAAGCGACCAAAATCGCCGGTCTGCCCAAGCCGGTAAAATGTATCTAATACCGAATACTGAAGCCTGTTAACGTCTCGCGAGGGCTGTGCGGTTCAGTGAGGATTTTGTCCACACGAGCAGAGCGCGGCCCGCCCTCTTTCAGCCACTTGATGAGCTTTTCTACCTGCGCCGCGTCGCCACAGGCCACCACTTCTACGCTTCCGTCATCCATATTCTTCGCATAGCCGGTTAACCCCAGCCGCTGCGCCTCATGCTGCGTGGTATAGCGAAACCCGACGCCCTGAACGCGACCATAAACCCAGGCGATAATGCAGACGTTCGACATGATGGTTCTCCTTTCATAATCAACGGTGAAGACGCATACAGGACATACTTAACGTACCTGACTAGGCAGACACCCATAATGTTGGCGGAATCTGGCCGTAAATCGCGACCCCGACAGGTAGCCGCACCGTAGCGCTATTTCGCCGATCGGCAGCGTCGTCGATTGCAGTTGAGAAAGCGCGCAGGACATGCGTACCTCTTCAATAATTTGCCGATAACTCTGCGATTCGCGCTGCAAACGCCGACGCAGCGTCGATGTTCCCATAGCAAGACGGCGGGCTATCTCCTGGGCTGTCCATAGCTTAGCAGGTGAGAGCATAATCAGTTGCCGCACCTGCTCCGTGAGGGTGTAACGCCGTTCAATAAGCAGCGGTCCCGCCGCGCCATCATGCAAAAGCGCTAACAGTAACCCCATCGCCTGATGTTCCTGTAGCCCGACAGGTAAGCCCTGGCGCACAGCATCCAACACGTTCTCCCACATAAATGTCAGGCTACGGCTCATCGGGGTACACAGCGATGTCAGGTTTGCCGGTGGATAATCCTGAACGTACATCGTTTTAAAGCGAGCAATAATCTCCGGTGAAAGTAACAACAGGTCGGAGCGAAAACCGTTCTGCGCAGGCTGATTAATAATCTCCAGCGGCGTGTTCGCCGGAATAATAATCAACTCGCCCGGCCCGGCGACAAGGCGGCTATCATCCTGAATGATGACTTTACTGCCCTGCGTAATATGGCAGATAGCGGCGGAAAAAAGCGTAACACGATGCAAACGGTGCAGATGACTGGAACGCACCTCTGCCGTCGTAAGATCCTGATGTCGAATATGCATCTCCTGCACCGTGCCGTCCTCTGTTTTATCTTCCGTATGTGGCCGTGAATTTTGCTTTCGCAATCACATGCGCGTTAAGCATAAACCCTACCAACGCGCCACTCGACTCACTGTCGAGCGGCAGTGTTGCCGTATTCAGCGCCCACACAGTAAACTGATAGCGATGAGGCTTATCGCCCGGCGGCGGACAGGCGCCGCCAAAACCAGCATAACCGAAATCATTCCGTCCCTGCACCACTCCAGCGGGTAACGTTTTTTTATCAGCCCCCGTCGGCAGGTCGTGAATCTGCGCAGGAATATTCACCATCGTCCAGTGCCACCAGCCGCTTCCGGTGGGCGCATCAGGATCGAAGACGGTAATAGCGTAGCTTTTGGTTCCGGCAGGCGGATTGCGCCAGCTCAGCTGCGGCGAAATATTTTCTCCGCTACAACCAAATCCTTTAAAGACGTGCTGTTGCGTTAAACGAAAATCCGCCGGAATATCCGCGCTACTGAGGCTAAACGCCGACGCGGCAGATACGCCGCCACTCAATGCCATCATTGCGGCTGCGGTCAAAGAGAAGACTGTTTTCATTGTATTTCCTTCTTCATGTAAAGAAATAACACTATAAAAAGCGTCCGCTGAAGAAACTGTGCCTTTACGATTCTATTTTGTGCAGAAACGCTCACCACCATACGCTGGTGATTGCCCCTCTCATCTGTGCTACCCTACGCGCCATTTCGTTTTCACGGTCCCAGGCTGGAACTATCATCCGGCCCGCGTCACTGTTTGTGAGAGGTTACTCCGCATGACTGAATCTACATTTCCGCAATATCCCCGTTTAGTCCTCAGCAAAGGGCGAGAAAAATCTTTACTCCGCCGCCACCCATGGGTCTTTTCCGGCGCCGTATCCCGTCTGGAAGGCAAAGCCAACCTCGGTGAAACTATCGATATCGTCGACCATCAGGGAAAGTGGTTAGCACGCGGCGCCTGGTCACCAGCCTCCCAGATCCGCGCGCGCGTCTGGACATTTGATAAAGCAGAATCCATTGATATTGCGTTTTTCACCCGCCGCCTGCGCCAGGCGCAGCAGTGGCGCGACTGGCTGGCGAAAAAAGACGGCCTGGATAGCTATCGTCTGATCGCCGGTGAGTCCGATGGCCTGCCTGGCGTCACTATCGATCGTTTTGGTCATTTTTTGGTGCTGCAACTGCTCAGCGCCGGGGCCGAATATCAACGCGCCGCATTAATTAGTGCGCTGCAAACATGCGATCCGGATTGCGCTATTTACGATCGCAGCGACGTCGCCGTGCGGAAAAAAGAAGGGATGGCGCTGACGCAAGGTCCGGTCACTGGCGAACTGCCGCCTGCGCTTTTGCCAATTGAAGAACACGGTATGAAATTGCTGGTCGATATCCAGGGCGGCCACAAAACCGGTTATTATCTTGATCAGCGCGACAGTCGTCTGGCGACGCGCCGCTACGTGGAAAATCAGCGGGTACTGAACTGCTTCTCTTATACCGGCGGTTTTGCCGTGTCGGCGTTAATGGGCGGTTGTCGCCAGGTTGTCAGCGTGGATACCTCACAGGATGCGCTGGATATCGCCAGGCAAAACGTTGAACTGAACCAACTGGACTTGAGCAAAGCCGAATTCGTGCGCGACGACGTGTTTAAGTTGCTGCGCGCTTACCGTGAACACGGCGAAAAATTCGACGTCATCATCATGGACCCGCCCAAATTCGTTGAAAATAAAAGCCAGTTAATGGGCGCCTGCCGGGGCTATAAAGACATTAACATGTTAGCGATTCAACTGCTCAATCCGGGCGGCATACTGCTGACATTCTCTTGCTCCGGACTGATGACCAGCGATTTATTTCAGAAAATCATTGCCGATGCCGCAATAGATGCCGGTCGTGATGTACAATTTATAGAGCAGTTCCGTCAGGCCGCCGATCACCCGGTGATCGCCACCTACCCGGAAGGGCTGTATCTGAAAGGGTTTGCCTGTCGCGTCATGTAACTTGAAAAGTGGAATAGTATCCTCATATAAAGGGTATCTATTTCCCGGGAGGTGACTATGATAGCCAGCAAATTCGGTATCGGCCAACAGGTCCGCCATTCCCTGTTAGGTTACCTCGGAGTGGTCGTCGATATCGACCCGGAATATTCGCTTGATGAGCCGTCGCCTGATGAACTGGCGGTTAACGACGAACTTCGCGCCGCTCCGTGGTACCACGTGGTAATGGAAGATGATGATGGTCAGCCAGTGCATACTTATCTGGCCGAGGCCCAGTTGCGAAGCGAAATGCGGGACGAGCATCCAGAACAGCCATCGATGGATGAACTGGCGCGTACCATTCGCAAGCAGCTTCAGGCGCCGCGACTACGTAACTGATTGTATGTAAAAGGCCGGAGAGCGATATCCGGCCATTTAAACTTTTATTTCGCCAGCCCCAGGCGAGGGATCTCAATCGCCGGGCAGCGATCCATCACCACGGTCATGCCTGCATCCCGCGCCAGCACCGCCGCCTGTTCGTTGATCACGCCAAGCTGTAGCCATAGCGTTTTTGCGCCGATGGCGATAGCTTCCTGCGCGACGCCCCATGCAGCTTCAGAGTTGCGAAAAACATCCACCATATCGACTTTTTCGGGAATGTCCGCCAGCGTGTCATACCCCTGTTGCCCCAGCAATGTCTTACCCGCGACTTTTGGCGCAACCGGAATCACATGGTAGCCCTGTTCAAGCAGGTATTTCATTACCCGATAACTTGGACGATCGGGTTTATCGCTCGCACCTACCAGCGCGATAGTGCGAGTGGACGTCAAAACATCAGCAATATCGGTCTCTTTCATCATCTTTCTCCTGGCTGTTTTGCAAAGTGTACGACAAACCTGACATAGCAACCATTCATCCCATACCGATGTATGAGAGCAAAACGCCAAAATATGTCTATATGTTAGTAAACATGATCTAAGAAAAATTTCGATACATTATTAGCCAGACGGCCTTTGGACAGGAGAACCTTATGAAAACCGGCGCGCTAGCCACCTTCCTTGCTCTCTGTTTGCCGGTGACTGTTTTTGCCACAACGCTCCGTTTGTCTAATGAGGTTGATCTGCTGGTGCTGGACGGCAAAAAGGTGTCCAGCTCTTTATTACGCGGCGCAGAGAGCATTGAACTGGAAAACGGGCCGCATCAACTCGTTTTCCGGGTGGAAAAAACCATCCGCCTGCCCGGTAATGAAGAGCGGCTTTATATTTCTCCGCCGCTGGTGATAAGTTTCGATACCCAACTGATAAGTCAGGTCAATTTCCAGCTCCCGCGGCTGGAAAACGAGCGCGAAGCCTCGCATTTTAACGCGGCGCCGCGTCTGGCGCTTTTGGACGGCGACGCCATGCCCATTCCGGTAAAACTCGATATTCTGGCCATTACTTCTACGGCTAAAGTGGTCGATTATGAAATAGAGACCGAGCGTTATAATAAATCGGCGAAACGCGCCTCGCTTCCTCAGTTCGCCACGATGATGGCAGACGACAGCACATTACTTTCCGACGTCTCAGAGCTTGATACCGTACCGCCGCAATCACAAACTCTGACAGAACAGCGGCTGAAATATTGGTTCCGACTGGCCGACCCGCAGACACGCCATCATTTTCTGCAATGGGCGGAAAAACAGCCGCCCTCTTGATATGAGTTGTCCCAGCGCGCAATTTTTTTCTCTTTCTCTTGCAGCATCAAGCGCTTCCAGTAATCTGTAGACAGGTTAACTACGGAATCGAATTATGGAACTGACGACTCGCACCTTGCCGACGCGCAAACATATCGCGTTGGTTGCTCACGACCACTGCAAACAGATGTTAATGAACTGGGTGGAACGCCATCAGCCGTTGCTGGAAAAACACGTTCTTTATGCAACCGGCACCACGGGGAATCTGATCCAGCGCGCAACCGGTATGGACGTTAATGCGATGCTGAGCGGCCCGATGGGCGGCGACCAGCAGGTTGGCGCACTCATTTCAGAAGGGAAAATCGACGTGTTGATTTTCTTCTGGGACCCGCTTAACGCGGTACCGCACGACCCCGATGTCAAAGCGCTACTGCGTCTGGCCACCGTATGGAATATCCCTGTCGCCACTAACGTCTCAACGGCGGACTTTATCATCCAGTCCCCGCATTTTAATGACGCTGTAGATATTCTTATTCCGGATTATGCGCGTTATCTGGCCGAGCGCCTGAAATAACCGCTACGCGGGCGGGATGCTGTACCGCCCGCGCTTTAGGGCTTCCTCGCCACCGGCACATCCAGCTGTTTAAGCGCCTCCACAAAGCGCGAGGGATTATCTTTATTAAAAAGCAGCCAGACCCGCGCACGCGCGCGCGTCAACGCCACATATAACAAACGCCGCTCTTCAGCGTCCGGAAAATCCTCAACCTGAGGAAGAAGCGCACTTTCCATAATGGATTCTCGCGCCGGGGCGGGGAAACCGTCGTTACCCTCCTGCAATCCGACAAGAATCACGTAATCGGCCTGTTGGCCTTTGCTGGCATGGATAGTCATGAAGTCTATCTGTAACTTCGGCCAGCGAGTCGCCGCTTTTTGTAGACTCGCGGGTTTCAGATGGTGATAGCGCGCCAGCACCAGGATACGTTCATCTTCCTTCGCATAGCCGGATAATTTATCCAGTAACGCGTCCAACTGGCTTTCATCCAATAACGTCACCGCTTTTTTATCGCCTGGGGTCAGACTATTTAACGGTTTTTTAAGCTGGTGCGGATTCTGCTGTACAAAGCGATTGGCAATATCGCCAATCCGACTGTTAAAGCGGTACGTGGTGTCCAGGTGGCAATGCTCGCCCTCGCCGAACGTCTGATGAAACGCCGTCGTTAAGGAGAGCTGCGCCCCGCTAAAACGGTAAATCGCCTGCCAGTCATCGCCAACGGCAAACAGCGTAGTCTGGCTATTCTGCTTGCGCAGCGCCTCTAACAGCGCCGCCCGTTGCGGGGAAATATCCTGAAATTCATCGACCAGAATATGCTTCCACGGGCTGATAAAACGCCCTTTTTCGAGGATCACCATTGCCTGATGGATCAGCCCGGAGAAATCAACGGCATTTTCCGCTTTCAACGCGCTTTTCCAGGCCTTTAGCAGCGGCGCCATCAGCTTAATGCGTTTGCCAAACAGCTCGCGGCACTCCTCTGGCGCGCCAGCGATCATTTCTGCCTGCGCGCCGCCGTGCATACGCATCAAACTGACCCAACGATCCAGGCGAGGGGCCAGACGCCGCTGCAATGTCTCATCGTCCCAGAAATTACCTTCCGGCACTACCCACTGCATCTCCTCTTCCAGCCACTGACGCCAGCCTTTGGCCTGCGCTTTTTTCTCGCTACACTGCTGACGCCAGGTGCGCAGAAATAGCTGATGCCGCGCGGTGGCGTCACTTTCCAGCTTACTGACAACCGGCGCTTTTTTACTGCCTTGCTGAATAATATACAGCGCCAGCGAATGGAACGTACGGGCAGTAATCTCTTCCGTATGTAAGCGCTCGCGGATACGCTCATCCATCTCTTCTGCCGCCTTGCGACCAAACGCCAGTAAAAGAATTTGCCCCGCATCGGCTTGTCCGCGCGCCAACAGCCAACCCGCGCGCGCCACCAGCACCGAGGTCTTACCGCTTCCCGCGCCCGCCAGTACCAGTAACGATGATTCGCCGTTAACCACCGCCCTGGCCTGGGAAGGATTAAGCGGAGAGGATTCTATCTGCGTAAAAAAGTCGGCATGCGCTTCCAGCATGGCATCAGCATACGCCTGATTATGCTGCTGCCGACTTCCCTCGCTATCCTGTAACCATGCCAGACACTTACGCCATATCTCGCGACAGTGAGCAAACTCCTCCAGACGTGAGACCGGCAGCGGCAAGGCGGCAAACGTCTGGCGAATTTCGTGTTCCAGCCCCCGAACGCGTTCACGGGTAAGCCATTGGTTTCCGCCCGTTCGCTCACTAATACGCGCCCATTGCTCCTGTAATGCCTGCGCGGCGACATCACTCATCTCCTGGCTCCAGCGACGCCAGTGCGCGTCCAGATAGCGATGGAATTGCTGGGTTTCCGACCACTCGGTACCGTGCAAACGCACGACTTTATCCTCCGGCAGGACAAACTCCAGCTCGCCCCATACCAGCCCACGCTTACAGTGAATCGCCAATAACTGATTGAATGGAATAAGATATTCATGCCTGTCGCCAGACACTTTCACCCCAGCGTTGAGGATCTCGGCACGATCATACGGGTGTTGCGCCAGACGTTTTCCAAGAGAAGTTGCTTTCAGTTCCATGACTCAGCGAATCCAGACGTAAAGATGTGCTTATCAGTGTAACCGCCAGAGAAAACGTGCTCCAGTCTAAAAAAACGTTACAATTGCCCGCAGTGATAGAAAACCGGAATGAACTGAGGGTTTATGCGTACCGTTCTGAATATTTTAAACTTTGTACTGGGCGGCTTTGCCACTACGCTGGCCTGGCTGCTGGCGACGCTTGTCAGTATTGTGCTTATTTTTACCCTGCCGTTGACCCGCTCCTGCTGGGAGATAACCAAACTGTCCCTGTTCCCTTACGGTAATGAAGCCATTCACGTTGACGAACTTAATCCGGCGGCGAAAAGCGTATTAATGAATACTGGCGGTACCTTGCTGAATATTTTCTGGTTACTTTTTTTCGGCTGGTGGCTATGCCTGATGCACATTGCCTCCGGTATCGCTCAGTGTGTCACTATCATCGGGATACCTGTCGGTATTGCGAACTTTAAAATTGCCGCGATTGCGCTTTGGCCTGTCGGTCGCCGCGTCGTCTCTGTAGAAACCGCTCGCGCCGCGCGAGAAGCTAACGCGCGCCGCCGTTTTGAATGATCGGGACAAATAGCCTTTATGTTAAGTCCGTTGATTCGCCGTTATACCTGGAACAGTACCTGGCTGTATTACATCCGCATTTTTATCGCTCTGTGCGGCACCACCGCCCTGCCCTGGTGGCTGGGCGACGTCAAACTGACCATCCCGCTCACGCTCGGTATGGTTGCCGCGGCGCTAACCGATCTCGACGATCGCCTTGCCGGACGCTTGCGTAATTTAATCATTACCTTAATTTGCTTTTTTATCGCGTCGGCTTCTGTGGAGCTGCTCTTTCCCTGGCCGTGGCTATTTGCGTTGGGCTTAACGTTATCCACCAGCGGGTTTATTCTGTTGGGAGGACTGGGGCAACGCTATGCCACCATCGCGTTCGGCGCGTTACTCATTGCCATCTATACGATGCTGGGTACCTCTTTATACGATCACTGGTATCAGCAACCACTGCTCCTGCTGGCAGGCGCAGTATGGTATAACCTACTGACGCTAACCGGGCATCTGCTATTTCCGATCCGTCCGTTGCAGGATAATCTGGCACGCAGTTACGAACAGTTAGCGCACTATCTGGAACTGAAATCACGTCTGTTTGATCCTGATATTGAAGATGAAAGCCAGGCGCCGCTCTATGATTTAGCGTTAGCGAACGGGCAGTTAATGGCGACGCTGAACCAAACGAAAGTGTCGTTATTGAGTCGCCTGCGCGGCGATCGCGGTCAACGCGGTACGCGCCGCACCCTCCATTACTATTTTGTGGCGCAGGATATTCATGAACGCGCCAGTTCTTCGCATATTCAATACCAGACACTGCGCGATTATTTTCGCCATAGCGACGTCATGTTCCGCTTTCAGCGTCTGATGTCGATGCAGGCGCAGGCCTGTACGCAGCTGGCGCGCTGTATCTTACTGCGTACGCCGTACCAGCATGATCCGCGTTTTGAACGCGTCTTTACCCACATTGACGCCGCGCTTGAACGTATGCGCGCCAGCGGCGCTTCTTTAGAGCTGCTGAATACGCTTGGATTCTTATTAACCAACCTACGCGCCATTGATGCGCAACTGGCGACGATCGAGTCGGAGCAGGCCCAGGCAATGCCGCGCAATGAGTCAGAAAACCAGTTGGCTGATGATAGCCTGCACGGGTTTAGCGACATCTGGCTGCGTCTGAGCCGTAATTTTACCCCGGAGTCCGCTCTCTTTCGCCATGCGGTACGCATGTCGCTGGTATTGTGCATCGGTTATGCTCTCATCCAAATTACCGGGATGCGCCACGGGTACTGGATATTGCTCACCAGCCTGTTTGTTTGCCAGCCTAACTATAACGCGACCCGCCATCGCCTTGCGCTCAGGATTATCGGCACGTTGGTAGGCGTTGCTATCGGCCTGCCGATTTTATGGTTTGTTCCTTCGCTTGAAGGACAGTTAGTTCTGCTGGTGATTACCGGCGTGCTTTTCTTCGCATTCCGTAATGTGCAGTATGCCCATGCAACGATGTTCATCACCCTGCTGGTATTACTCTGCTTTAACCTCCTGGGCGAAGGCTTCGAGGTAGCGTTACCGCGCGTCGTCGACACGTTAATTGGCTGCGCTATCGCCTGGGCTGCGGTCAGCTTTATTTGGCCGGACTGGCGCTTTCGCAACCTTCCCCGGGTACTCCAGCGCGCCACCGATGCTAATTGCCGCTACCTTGATGCGATCCTTGAGCAATATCACCAGGGACGGGATAACCGCCTGGCCTATCGCATTGCCAGACGCGACGCGCACAACCGCGATGCAGAACTGGCATCCGTGGTTTCTAATATGTCGAGCGAACCCGACGTCACGGCTGAAACCCGGGAGGCGGCGTTCAGGCTGCTTTGCCTCAATCACACTTTTACCAGCTATATCTCCGCCCTCGGCGCGCACCGCGAAAAGCTCAGTAATCCGGACGTGTTGGGGCTTCTGGATGACGCGGTCTGCTACGTTGATGATGCGCTTCATCATCAACCTGAAGACGAACAGCGCGTACACCAGGCGCTGGAGGGTTTAAAGCAGCGAGTTCAGTCACTGGAAACACGTCCGGACAGCAAAGAACCTCTGGTTGTTCAACAAATTGGTTTGCTCATTGCCTTACTGCCCGAGATTGGACGCTTGCAACGGCAAATTTCACCGCCGACTTCTACATTAATTACCCAGCCGTAAGCGAATGAGCCCAGTCGGCAAGCTCCTGGCGACGACTCGCCGGGAGCGCCGCTTCATGTACGCCGACAATTGCGCCTTCCAGGGCATACAAAACTTTCACCGTCAGCAAGGGATTGCTTTGTCGCAACCTTAGCCAGCACATTTTCGCGCCCAGTATGCGTAACATATTTTCGTCCTTTATCCCTGACTCATTGAGCAATGTTTCCAGATGGAAGGTCATATTGGGAAGATCTTTGAGTCTGTGCTGTAAAATACGGCTGTGTTTTTCCTTCATTGCCGCGTCAAGAGAATACTTCGATAAACGTACCAGCTGCTGCTGATCGCGCCACAGGCTTTCATCCACCCGGTAATAGTTGAGCATAACGGGACGTCCACATTTCATAAACATGAGCCAGGCGGGGGGATGCTTCACACAGTAGGGTACGCTTTCTTCACAGGCGCGGAGATAAAGCTCGCCATTAGCCACCATCGCAAAGACGGTATCCTCCACGGTCAGACTATAGCTACCGAATAAAGATCGATACTGGATCGTCCCCAAAGAGGCCAAATATTCTTGCGATTTATAGATCCTGTCATAAGAGAGTGCTCTCATAAAATTCCTTTTAAATCATAAAGTAAAAGAATGATTTGCAGTAACGGATCCGTTAATGACGAAAATAGGCAACTTATACTCCGCGAGCAAGATGATTTTTATTTTTGACGCCACTAAGAATAAAAATTGCGAGACAGTTTCCGAAAATAGAGTTGATCTTTCATCGCCACAGGGGTACTGTATGAATATACAGTAACTCACAGGGCTGGATTGATTATGTACACTTCAGGTTATGCAAATCGTTCTTCGTCATTTCCTACCACTACCCACAACGCTGCGCGCACCGCTACGGAAAATGCCGCGGCAGGACTGGTCAGTGAAGTTGTCTACCACGAAGACCAGCCCATGATGGCGCAACTCCTGCTTTTGCCTTTACTCCGTCAGTTAGGCCAACAATCACGCTGGCAGCTCTGGCTCACGCCGCAGCAAAAGCTCAGCCGTGAATGGGTACAGTCTTCAGGTTTGCCATTAACGAAAGTGATGCAAATTAGCCAGCTTGCGCCTCGTCATACGCTGGAGTCGATGATCCGCGCTTTGCGTACAGGAAATTACAGCGTGGTAATTGGTTGGATGACTGAAGAACTGACAGAAGAAGAACATGCCAGCCTGGTTGAAGCAGCGAAGGTAGGTAATGCGGTAGGGTTTATCATGCGCCCTGTACGTGCGCACGCTTTATCCAGGAGACAGCATTCCGGGCTAAAAATTCACTCTAATTTGTATCATTAAGTAAAATTAGGATTTATCCTGGACTTTTTTTTACGCGAACGTATCTCCTTTGAGTGCTAACGTTTTTTTTGCGAGAACGCTTGTCAGAAGCGGTTTCCGCAATTTTTGCTGTACGATTTATCATCTGAAACTGTTAAATGATGTGTATATCCGTCATGTTTTTTTCACATGTCTGACGGAGTTCACACTTGTAAGTTTCCAACTACGTTGTAGACTTTACATCGCCAGGGGTGCTCAGCATAAGCCGTAGATATCGGTAGAGTAACTATTGAGCAGATCCCCCGGTGAAGGATTTAACCGTGTTATCTCGTTGGAGATATTCATGGCGTATTTTGGATGATAACGAGGCGCAAAAAATGAAAAAGACAGCTATCGCGATTGCAGTGGCACTGGCTGGTTTCGCTACCGTAGCGCAGGCCGCTCCGAAAGATAACACCTGGTACGCTGGTGCTAAACTGGGCTGGTCTCAGTACCATGACACCGGCTTCATTCACAATGATGGCCCGACTCATGAAAACCAACTGGGCGCAGGTGCTTTTGGTGGTTACCAGGTTAACCCGTATGTTGGCTTTGAAATGGGCTACGACTGGTTAGGCCGTATGCCGTACAAAGGCGACAACATCAATGGCGCTTATAAAGCTCAGGGCGTTCAGTTGACCGCTAAACTGGGTTATCCAATCACTGACGATCTGGACGTTTATACCCGTCTGGGTGGTATGGTATGGCGTGCAGACACCAAGTCTAACGTCCCTGGCGGCCCGTCTACTAAAGACCACGACACCGGCGTTTCCCCGGTATTCGCGGGCGGTATCGAGTATGCCATCACCCCTGAAATCGCAACCCGTCTGGAATACCAGTGGACTAACAACATCGGTGATGCCAACACCATCGGCACCCGTCCGGACAACGGCCTGCTGAGCGTAGGTGTTTCCTACCGTTTCGGCCAGCAAGAAGCTGCTCCGGTAGTAGCTCCGGCACCGGCTCCGGCTCCGGAAGTACAGACCAAGCACTTCACTCTGAAGTCTGACGTACTGTTCAACTTCAACAAATCTACCCTGAAGCCGGAAGGCCAGCAGGCTCTGGATCAGCTGTACAGCCAGCTGAGCAACCTGGATCCGAAAGACGGTTCCGTTGTCGTTCTGGGCTTCACTGACCGTATCGGTTCTGACGCTTACAACCAGGGTCTGTCCGAGAAACGTGCTCAGTCTGTTGTTGATTACCTGATCTCCAAAGGTATTCCGTCTGACAAAATCTCCGCACGTGGTATGGGCGAATCTAACCCGGTTACCGGCAACACCTGTGACAACGTGAAACCTCGCGCTGCCCTGATCGATTGCCTGGCTCCGGATCGTCGCGTAGAGATCGAAGTTAAAGGCGTTAAAGACGTGGTAACTCAGCCGCAGGCTTAAGTTTCCGTCTGATAAAAAACCCCGCGTCGCGGGGTTTTTTGCTCTGGTCTGGGTGACAACGCCTTTCAGCGTTACTTCTTGCCTAATAACGCCTGTAAATCCTGCTTTAACGTGGTCATTTGCGTGGCATATTTCTCTTTATGCTCCGCGTCTTCTATCAGTTGCACTATCGTTTCGGATAATGTTTTACCGCGTCGCTGCGCAAGGCCCGCCAGACGCTGCCAGACCATAAACTCTAAATCGATAGATTTTTTGCGCGTATGCTGATGTTCCGCATTGAAGTGTCGTTTGCGTCTTGCCCGAATGGTTTGTTTCATCCGATTAAGCAAGGCAGGATTCATATGCCTGTCTATCCAGACATTTACCCGCACCGGTTCATTTTCGAGGGCCAACAACAAATTGACGGCTTCCTGGGCAGCACTGGCTTCCACGTAGCGGGTGATCAACTCCCCTTCGCGGTGCTTTTTCACCAGATATTTCCATTTCCAACCGCTTTCAAGATTTTCAAGTTGTTGATATTTCATTGCGATCCCAACGTTACCGTGTAACTGTTATCAGAATATCAGTTTTTTAGCCATCTGAAGAAAAGAATCTCGAACCTGTGAATGGTTACACGTCTTCATATTACTTTGCATTCGCTTTCCGCGTGTGCAGCGTGACGGGTTAGGCTATAATCCCCCCTTTTACAACAGACTAAAAAACCTCAACTTTGACCATTACGAAACTTGCATGGCGTGATCTGGTTCCGGATAGCGAAAGCTATCAGGAGATATTTGCACAGCCACACGCGACTGACGAAAACGACACCTTACTCAGTGATACTCAGCCACGACTGCAATTTGCGCTTGAGCAACTTATACAGCCGTGGGCATCATCCTCTTTTATGCTGACTAAAGCGCCTGAAGAGCAAGAGTATCTCACTTTACTTTCAGATGCCGTCCGCGCTCTGCAAACCGATGCCGGACAATTAACCGGCGGACATTATGACGTTTCCGGGCATACTGTTCATTACCGCGCCGCGCAGAATGCGCAAGACAACTTTGCCACCGTCACACAAGTCGTCAGCGCGGACTGGGTCGAAGCCGAACAGCTCTTTGGTTGCCTGCGGCAGTATAACGGCGACATTATCCTGCAGCCGGGACTGGTTCATCAGGCGAACGGCGGCGTGCTGATTATTTCCTTACGAACCCTTCTGGCGCAGCCGTTACTGTGGATGCGTCTGAAAGCCATCGTTAGCCGCGAGCGTTTTGACTGGGTGGCCTTTGACGAGTCGCGTCCATTACCGGTCTCCGTGCCATCAATGCCGCTCAAACTGAAGGTGATTCTGGTTGGCGAACGTGAATCACTGGCTGATTTTCAGGAGATGGAACCGGAGCTCGCGGAACAGGCTATCTACAGTGAATTTGAAGACAATTTACAGATAGCGGACGCAGAAGCTATGACCCTGTGGTGTCAATGGGTGACGCGTATCGCTTTACGCGATAATTTGCCCCCTCCGGCACCGGACGCCTGGCCCGTCCTGATACGCGAGGCTGTGCGCTATACCGGCGAACAGGATACGCTGCCTCTTTGCCCACTGTGGATAGCCCGCCAGTTTAAGGAGGCGTCGCCTTTATGCGAAGGCGATACCTGCGGCGCAGAAGCGCTCAGCCTGATGCTTGCCCGACGCGAATGGCGAGAAGGCTTTCTGGCGGAGCGGATGCAGGATGAGATTCTGCAAGAGCAGATCCTGATTGAAACCGAAGGCGAACGCGTTGGACAAATCAATGCGCTTTCCGTCATTGAGTTTCCCGGGCATCCGCGCGCCTTTGGCGAACCGTCGCGAATTAGCTGTGTTGTGCATATCGGCGATGGCGAATTTAACGATATTGAGCGCAAGGCCGAACTTGGCGGGAATATCCACGCTAAGGGAATGATGATTATGCAGGCCTTCCTGATGTCGGAGTTGCAGCTGGAGCAACAAATTCCCTTCTCTGCCTCGTTAACCTTTGAGCAGTCCTACAGCGAAGTGGATGGCGATAGCGCCTCAATGGCGGAATTATGTGCGCTCATCAGCGCGCTGGCCAATGTGCCGGTGAATCAAAACATTGCGATTACCGGCTCGGTCGATCAGTTTGGTCGCGCGCAACCGGTGGGTGGGCTAAACGAAAAAATTGAAGGTTTCTTCGCCATCTGCGAGCAGCGGGAATTAAACGGTAAACAGGGCGTGATTATCCCTGCAGCCAATGTCCGCCATCTCAGTCTTAAATCTGAACTGCTGCAAGCGGTTAAAGAAGAGAAGTTCACTATCTGGGCGGTAGACGACGTGACCGACGCCTTACCGCTACTGTTAAATCTGGTGTGGGATGGCGAAGGTCAAACGACGTTGATGCAGACTATCCAGGAGCGTATCGCGCAGGCGACGCAACAGGAAGGCCGTCATCGTTTCCCGTGGCCATTACGTTGGCTGAACGCTTTTATTCCGAACTGATCGGACTTGTTCAGCGTACACGTGTTAGCTATCCTGCGTGCTTCAATAAAATAAGGTTTACATAAAACATGGTAGATAAACGCGAATCCTATACAAAAGAAGACCTTCTTGCCTCTGGTCGTGGTGAACTGTTTGGCGCTAAAGGGCCGCAACTCCCTGCGCCGAACATGCTGATGATGGACCGCGTCGTTAAGATGACCGAAACGGGCGGCAATTTCGACAAAGGCTATGTCGAAGCCGAGCTGGATATCAATCCGGATCTATGGTTCTTCGGATGCCACTTTATCGGCGATCCGGTGATGCCCGGTTGTCTGGGTCTGGATGCTATGTGGCAATTGGTGGGATTCTACCTGGGCTGGCTGGGCGGCGAAGGCAAAGGCCGCGCTCTGGGCGTGGGCGAAGTGAAATTTACCGGCCAAGTTCTGCCGACAGCCAGGAAAGTCACCTATCGTATTCATTTCAAACGTATCGTAAACCGTCGCCTGATCATGGGCCTGGCGGACGGTGAGGTTCTGGTGGATGGTCGTCTGATCTATACCGCACACGATTTGAAAGTCGGTTTGTTCCAGGATACTTCCGCGTTCTAAGTGTTGAATTGATATTCCGCGCTATCGGTATCATGCTTGCAAAGCCCATAAAGGCGAAACCTCCGCACTGCGGAGGTTTCTTTTTCTAAAGAGACAGAATCAGGCCATTACCGCCCTGTCCTCCATGGCTTGTCGCCAACCTCCCAACCAGTATGACCGTTGATTCAGCGTCTGATAGGGACACATTTCTTTTGATCGTCCGGCGATGCCGGCCTGATATCCACGTTGATGTGCCCGTTCCAGGCGATCTCGTTTTTGTCTCTTCATGCCTCGTTTCCCTCATATTTGTATCTGGTGGAAAAGAAAACAGTGATTACAAAATGTGCAATCACACTACACGAATACCTCTAAATGGATCGCCCGTCAATGCGCAAAATTCACACCAATGTCATATTTGTGAGCTATACGGTAAATCATTTGTACAAAAAATGTGAGCCATAACAACTATTTTTCCGGCGTCAAAAATAAAAAAACCGCCTCAGGCGCACGACCTGAAGCGGTTAAATTTACATTACTTTTATCTCAGGGAAGGCGCTTTATCTCATCTGCGATAGCGGCTGCTTCCTGACTCCATGCGCTTGCCAGTACTTTGACCATCTCATCATAGCCATCTTTCTGCTGACTGGCTTCGATATGGAAAGGCCGCTTAATCAGCTGTCCATTGTGGTTTAATAACCACTCCCCGCTGACGATAACCTTACCATCGTAGCGTCCATGGAATCCGGTTACGGTAACGTTTAACGTATCCTGGGTGGTTCCCAACGGCTGGGAAGCGACGACCCAGCCAGGAAGCCGGGCGCTAAGATTCGCCACCAGCGTGTTACGTAATTGCTGGTCCAGCGGGCTGGCCCACAGATTGTTATTGGCAATCACATACTGAACATCACTGGTTTGATACACCACGCCGTTACCTGCCAGATAGTCAGGTACGGAAACCTGCTCTACCCATAAGAGACGGTTGCCCTGGCTTGCGGTGCTTTGCACACCGCTTTGCGCTATCGGGAGCTGATAATAGCTTTTATTCTCTCCGCCGGAGCTACATGACGCCAGCCAGAACGCCATTATCACGACTAGCCATTTTTTCATTGTTTCGCCCTCTTAGGCTCAGGATCTTTTTTATCCTTCGCTTCAAATACCAGCGCGTTGCTCTTCTCGTTCAGAGTTTTCAACACCGGTTGTAACTCACGAAGCACCTGATCGAGACGCTGCATATCCGCCACCATTTTGTTATACGCCGCCGATCCAGGCTGGAAGCCCTGCATACTGCGGTTAAGTTCGCGTAACGTCGTTTGCATATCCGCCGGAAGCTGCTGCATCGACTGACTGGAGGTAATCTTGTTCATATTATCCAGCGTGGTTTGCAGCCGACGCATAGTACGCTGGCTTTCAGACAGCGTATTGGTCGCTTGTTCAATCATCGGATTCAGCGGCAGGTTGTTGATCTTATCCAACGTTTCCACCAGTCGCTGTTGAATTTGCGCCAGGCCGCTGCTGACGGTGGGAATAATTTCATAACCATCAAATTCGCGTAGCCCGGTAATCGGCGGCTCCTTAGGATAAAAGTCCAGATCGACATACAGCGCCCCGGTCACCAGGTTACCGGTTTTAAGCGAAGCGCGTAAACCGCGCTTAAGCAGTTCCGTCAAATGCGCGCCAACATCCGCATTTTCTCCCAATTGCGCTTTCAGACGCTCCGGTTCAATGCGCACCAGCACAGGAATACGGTAATCGTCGTTAAATACCTGGCGCATTTTAGACGCAAAGAAAGGCACTTTGCTTACCGTCCCCAGACGAATACCGCGGAACTCCAGCGGCGCGCCGGGTTGTAATCCGCGCACCGAATCTTTAAAGAACATCAGGTAATCGATATGATCGGTATACAGCGAATCCTGAATACTTTTTTGATCGTCGTAGAGATTAAACGCCGTTTTTTCGGCGACGGGTTGCCCCTGCTCAAGTCCTTCCGGCACATCAAAACTCACGCCGCCGCCAAACAACGTCGTCAACGATCCCATTTCCACGCGCATTCCCGCCGATGTCAGATCCACAGCGATACCGCTATCTTTCCAGAATCGGACATTACTGGTGACCAACCGATCGTTTGGCGCCTTAATGAACAACTGATAACTCATTGTCCGTTTTTGCGGATCGAAAGAGCTGGTTTCAACGGACCCTACCCGATAGCCCCGGAACAGAACGGGATCGCCAGGACTGAGCTGACCGGCCTTTTTGCTGTCCAGAATCACACGAATACCTTTGGCATCGGGCGGCGCCAACGGCGGTGAGTCAAGAAGCTGATAGCTTTCCGGCTGACTGCCCTTACTTCCTGGTTGTAGTTCAATATACGCACCTGATAGCAGCGTCCCAAGCCCGCTGATGCCTTCACGCCCCACCTGCGGTTTTACCACCCAAAATACCGAGTCTTTATGCAGCAACTTTTCCATACCGGAATGGAGCCGCGCTTTGATTTGTACGTGGGTCAGATCGTCAGTCAGCGTCGCGCTTTCAACCACGCCAACATCCACGCTGCGACTTTTGATCGTCGTTTTTCCGCCTTCAATGCCTTCCGCATTGGTGGTGATTAAGGTAACTTCCGGTCCCTGGTGGCTGTAGTGATAAAACAGAATCCAGGCTCCGATAAGCGCAGTAACAATCGGGAATATCCAGACAGGCGACCAATTTTTCACCTTTTGTACTTTGGCTTCCCCTTTTTTAGGTTCCATGCTATCAGGACTCCTCATGGCCTGGTTCGTACTCACGATCCCACGACAGACGCGGATCAAAGGTCATCGCAGAAAACATTGTCATTATGACGACTAAAGCAAACATCAACGCACCCATCGCAGGATAAATATTCATTAACCCTCCCATACGCACCAGCGCAGAGAGTACGGCAATGACAAAGACATCAATCATTGACCAGCGCCCCACAAACTCCACTACTTCATAAATAAAATGCATCCGTTCACTGTCGCGCTTACCGTGGCCTTTCGCATCCCAACAAAGCCAGGCAATGGCGATCATTTTTAGCGTCGGCACCATAATACTGGCGAGAAAGATAACCGCCGCCACCGGATAAGACCCCTCGCTCCACAGCAAAATCACGCCAGCCAGAATGGTCGACGGCATTTTCGAGCCCAGTAAGTCGGTAATCATAATCGGCAAAATATTGGCGGGCAGATAAAGCATGATCGAGGTAAATAATAACGCCAGCGTCCACTGCAAACTGTTTTTACGCCGGACATAGCCTTTCGTATGGCAACGCGGACAGACGAGACTCTCCGCAGGCAGGATCGCCGTACAGCAGGCGCAAGAACGCAAAGACTGCCGGATACCGGTAATCCCCGGCGTTAACGGCTGCGCCAGTGCAGGCTGCGGGGCGATATCATCCCACAACCAGCGGCGATCGACACACTGGAACGCCCGCAGTTGAACGAGACAAAATAAGCACCAGGGAATAAAGCTGCTGCCGATGCCGATATCGCCGTAGGCCATCAGCTTAACAAAACTGACCAACACCCCGGCGAGAAATATTTCCGCCATTCCCCACGATTTGAGGAGGAAAAAGATCCTTGCCAGCGTTTTTTTTACCGACAACGGCAGACTGGCGCGGTTGACAAGTAATAGAATAGTCACCAGGCAAAATGCCGGCACCAGTTGGACAAATAATAAAAAGAACGTGCCGAGGCTGGCGTAATCTTCAGAAAACATGACGCCGGGGATTTCCAGAAGCGTGACTTCGCTGGTGACGCCCGCGACATTCATATTCACGAAGGGAAAAAGGTTAGAAAGCAGCAACATAAATAGTGCCGCTAACGCACAGGCGGTAGGACGCTGCCGGGGCGCGTCCCACTCGGTCGTTAACGTTGCGCCGCATCGTGGACATGCTGCTTTTTGCCCATGCGAGAGGCGGGGTAAAGCCACCAGCATGTCACATTGCGGGCACAATATGTGCTTCGCAGCATGGTGATGTTCACACATAGGCGCTCCTTTCGTTATGCCCCGTTTTTCAGGCCTTCAAGATACTCCCAACGTTCGAAGGCTTGCTCAAGTTCTTGCTCCGCCTGACTTAAATCGGCCAGAACTTTTTGCGTTTGCTCATGGGGTTGGCTAAAAAAGGCCGCATCCGCAACCTGCGCCTGTAGCGCTTCCAGCTTCGCTTCCAGGTCTTCAAGCTGACCGGGTAACTGCTCCAGCTCGCGCTGCAGTTTATAGCTTAGTTTGCTACTGCCACGTTTTACAATTTCTGCTTTAGGGGCAATAACTTCCTCATTTTTTTTCGCCATCGGCTGTTTCGTCGCCAGATGCTGCTCTTGCTGCGCACGCGCATCATGGTAGCCGCCGATATAACGTCCGATTTTGCCGCCGCCCTCGAAAATCCAGCATTCCGTCACGGTATTATCGACAAATTGCCGATCGTGGCTGACCAGTAGTACCGTGCCCTGATAGCCATCAATTAATTCTTCTAATAGTTCCAGCGTTTCGACGTCAAGATCGTTCGTTGGTTCATCGAGAATTAAAAGATTGCTCGGCTTGAGGAACAGTCGCGCCAGCAGCAGACGGTTACGTTCGCCGCCGGAAAGCGCGCGGACGGGCGTCATCGCCCGTTTGGGGTGAAACAGGAAGTCCTGCAAATAGCCCAGTACATGGCGCGGCTTACCGTTTACCATCACCTCCTGCTTGCCTTCCGCCAGGTTATCCATCACGGTTTTTTCCGGGTCCAGTTCGGCGCGATGCTGATCGAAGTAGGCGACTTCCAGCTTCGTTCCTACGTGGATGCGCCCGCTGTCAGCCTGAAGCTGTCCCAGCATCAGTTTCAGTAGCGTGGTTTTACCGCAGCCGTTCGGGCCAATTAACGCAATCTTGTCACCGCGCTGTACCTGAGCGGAGAAATCTTTTACCAGTTGTTTTCCTTCTACCTGGTAATCGACGTTTTCCATCTCAAAAACGATTTTACCGGAGCGCGTCGCCTCTTCGACCTGCATCTTCGCCGTGCCCATCACTTCCCGGCGCTCGCTGCGTTCACGACGCATCGCTTTTAACGCCCGCACGCGCCCTTCATTACGGGTGCGGCGCGCCTTGATCCCCTGGCGAATCCAGACTTCTTCCTGCGCCAGTTTGCGATCAAATTCCGCGTTTTGTAACTCTTCTACGCGCAGCGCTTCTTCTTTCTCCAGCAGGTATTGATCGTAATTTCCCGGATAGGTGACCAGTTTGCCACGATCGAGATCGACAATGCGGGTCGCCATATTGCGAATAAACGAACGATCGTGAGAGATAAAAATAATCGTTCCGTTAAAGGTTTTCAGAAACCCTTCCAGCCAGTCGATAGTTTCGATATCCAGATGGTTAGTCGGTTCATCCAGTAACAATACGCGCGGATTGCTGACCAGCGCCCGACCCAGCGCGGCTTTACGCAGCCAGCCGCCGGAGAGCGACGACAGCGCGGCGTTAGGATCGAGTCCAAGCTGCGCCAGCACTTCATTAATACGGTTTTCCAGCTGCCACAGATTATGATGATCGAGCTGTTCCTGTACGCGCGCCATTTCATTGAGGTTTTTCTCGCTGGGATCGGTCATCACCAGACGGGAAATCTCATGATAGCGCTTGAGGTATTCCGCTTGTTCTTCGATACCTTCGGCGACAAAGTCATAGACGCTGCCCGCAATATTACGGGGCGGGTCCTGTTGCAGACGCGCGACGATCAGATCCTGCTCATAAATAATACGCCCATCGTCCAGACCCTGTTCGCGGTTGAGGATCTTCATCAAGGTGGATTTCCCGGCGCCGTTACGCCCCACCAGACAGACGCGTTCGTTATCTTCGATATGCAACTCTGCGTTATCGAGAAGCGGCGCGTCGCTGAACGACAGCCATGCGCCATGCATACTGATTAATGACATTTACTTTTCCTTTCAGGCGGCGCGGATCAGCCAGCAGTTATGGATCTGACGGTTACGGGCAAAATCCGGGGAAAGCGTTTTTTGCGTAATTTCTTGTGCGGTAAGCCCCAGCTCAGCCAGCCCTTCCAGATCCATACGGAATCCGCGCTTATTATTTGAGAACATGATGGTGCCGCCTTTACGCAGCAGACGTTTTAAATCTTTCATTAACGCGACATGATCGCGCTGAACATCAAACGACTCTTCCATACGTTTTGAGTTAGAGAACGTCGGCGGATCGATAAAGATCAAATCGAACTGTTCATTCGCCTCGCGCAGCCAGCCCAGGCAGTCGGCCTGAATCAGGCGATGCGCGCGGCCGCTCAGTCCGTTCAGACGCAAATTACGTTCGGCCCACTCCAGATAGGTGCGGGACATATCCACCGTTGTGGTGCTGCGCGCGCCGCCCAGACCCGCATGTACGCTGGCGCTGCCGGTATAAGAAAAGAGATTCAGGAAATCTTTGCCTTTGCTCATTTCTCCCAGCATCCTGCGGGCAATACGGTGATCGAGGAACAGACCGGTGTCGAGATAATCCGTCAGATTTACCCATAAGCGCGCATTATATTCGCTGACTTCAAGGAACTCGCCCTTCTCGCTCATTTTCTGATACTGGTTTTTTCCTTTTTGCCGTTCACGGGTTTTTAACACCAGTTTATTCGGCGGAATACCGAGCACTGACAAGGTTGCCGCAATAATATCGAACAGGCGCTGCCGCGCTTTTTGCGCATCCACCGTTTTCGGCGGCGCATATTCCTGAATCACCGCCCAGTCGCCGTAACGGTCTACCGCCACGTTATATTCCGGCAGGTCGGCATCATACAAGCGATAGCATTCAATCCCTTCCTGGCGCGCCCATTTCTCCAGCTTTTTAAGATTTTTACGCAGGCGGTTAGCGTAATCTTCCGCCACCGTCGCCGGTTTACTGTCCGCCGTGGTTTCCGCAATATGATAGTTTTTCTGCACGCAGTCCAGCGGGCCATTCTTGGCTTTAAACTGTTTGTCGGCACGTAATTGCAGGCTGCCCAGCAGATCGGGCGAAGCGCTGAACAGCGACAGGTTCCAGCCGCCAAACTGATTTTTCATGGTACGGCCCAGCAGACTGTGCAACGCAATCAGCGCCGGTTCGCTGTCCAGACGTTCGCCGTAAGGCGGGTTACTGATCACCGTACCATACGGGCCTTTCGGCAATGGATTACTCAGTTGCGCCACATCTTTCACTTCAAAGGTGATAAGCTCCCCGATACCGGCGCGACGGGCGTTGCTGCGCGCCCGCTCAATGACGCGCGCATCGCTGTCGGAACCGTAGAAATGAGAGGAATACTCCGCCAGCCCCTTACGCGCCCGGGTCTGCGCTTCGGCTTTCACTTCCTGCCAGATAGTTTCGTCATGCTGCGCCCAGCCGCTAAATCCCCAGTGACCACGGTGCAGTCCCGGCGCGCGATCGGTCGCCCACATCGCGGCCTCAATCAACAGTGTCCCCGAACCACACATAGGGTCGAGCAGCGGCGTACCTGGTTGCCAGCCGGAACGCATAACAATCGCTGCCGCCAGCGTCTCTTTAATTGGCGCCAGCCCGGTGCGATCGCGATAACCGCGCAGGTGCAGGCCATCACCACTGAGATCCAGCGCAATGCTGGCAGTTTCTTTATTCAGCCAGACGTTAATACGGAGGTCCGGCGATTCGCGGTCCACATTTGGACGCGGAAGATTTTTCCGCGTAAACGCATCGACAATCGCGTCTTTAACTTTCATCGCGCCATACTGACTATTACGGATGGTGTCGTTCAGGCCGCTGAAATGCACCGCAAACGTCGCGCCAGGATTAAAAATCTCTGTCCAGTTTATCGCCTGAACGCCGAGGTAAAGATCGAGATCGCTGTAGACCTTGCACTCACCCATCGGCAGGATAATACGCGAGGCCAGGCGGCTCCACATCAGGCTCTGGTAAATAAGCCGCGTGTCGCCCTGAAAATGGACCCCACCCTGAACAACCTGACACCCTACGGCGCCCAGTTTTTCCAGTTCAGTTTTTAACAGCTCTTCCAGCCCGCGGGCCGTACTGGCAAACAGAGAATTCATATTGTCACTTTTACGCTAAGAAAATTGTTGCGCATTATAGCTAATCTCACGCCCATGTCATAAAGTTGAAGGCTTATTTTCATTTGAGGGACTGTACGGTGGCGACGTTATCACGGCTCTTTATTCATCCGGTCAAATCCATGCGCGGCATTGGCCTGACTCATGCGCTGGCAGATATCAGCGGCCTGGCTTTTGATCGCATCTTCATGGTGACCGAGCCTGACGGCACATTTATTACCGCCCGCCAGTTTCCACAAATGGTACGTTTTACCCCTTCTCCCTTACACGACGGCCTCCATTTGACCGCGCCAGACGGCAGTAGCGCGCTGGTTCGCTTTACGGATTTCACCCCGCAGGATGCGCCGACCGAGGTCTGGGGAAACCATTTTACCGCTCGCGTCGCCCCGACGGCGATTAATCAATGGTTGAGCGGCTTTTTCTCCCGCGATGTCCAGTTGCGCTGGGTTGGGCCGCAGTTGACGCGCCGGGTCAAACGACATAACGCGGTGCCGCTGGGATTTGCCGATGGCTACCCGTATTTATTGACCAACGAAGCCTCGCTGCGCGATCTGCAACAGCGTTGTCCGGCAGGCGTACAAATGGAACAATTTCGCCCAAACTTAGTGGTTTCCGGCGTAGCGGCCTGGGAGGAAGATAGCTGGAAAGTGCTTCGCATTGGCGATGTGATTTTTGACGTCGTGAAGCCCTGTAGCCGCTGTATTTTTACAACCGTCAGCCCTGAAAAGGGGCAAAAACATCCTTCCGGAGAACCGCTGGCGACACTGCAAGCTTTTCGTACCGCCCAGGACAATGGCGATGTGGATTTCGGTCAGAATCTGATTGCCCGCAATAGCGGCGTCGTTCGCGTCGGCGATGAAGTGGAGATTCTGGCGACAGCTCCGGCAAAAGCTTATGGCGCCACAACGGTCGACGACAGCGTTACGCCAGATAAACACCCGGACGCGAGTGTAACCATCGACTGGCAGGGGCAAACCTTCTGCGGGAATAATCAACAGGTACTACTGGAACAGTTGGAGAATCAGGGGATTCGTATTCCGTATTCTTGCCGGGCTGGTATCTGTGGTTGCTGCCGGATACGTTTGCTGGAAGGCGAAGTAAGTCCGCTGAAAAAATCGGCTATGGGTGACGATGGTACGATTCTGAGCTGTAGCTGCGTGCCTAAGACCGCGTTACGACTGGAGAATTAAACCGCTTGTTCAAGGCTGAAACTGTGACAGTGAACCTGCGGTTTCAGCCTGTCGTTCATAATTTTAATCGCATCGCCAAGCTGCATGGTTCGGCCTGCTATCACCACGCCCGGCTGGGCCAGCAGACACAACGCGGCATTTTCTCCCGGCTCAACGACCAGCAAATTAACCTGTTCCGCCGTATCGCTTAACCAGACGCTTGCGGCATCGCCCGTTCCTGGGGTCCACATTTCGCCATGCGCTACAAAATGCCAGCTTTTTGGCATCTGCGGTTTGAGATAGCGAATCGCCACCAGCGCATTCAACACCAGCTCCGCGCGTTGCTCTTTGGTTAATTCGAAATCCCGGCATTTTTCTTCAAAGGAAAAATAGAGCGCGGCATCATCCACACAAAAACCGGTCGGGCAAAACGCGTCCGGCGTAAGCATTTTACGAGAGAAGCGCGAGCGAAAAAGTATACCATTGGCGAGATCGAGCATCATACGATCGTGCTCTTCATCATAATACCAGCGCCAGTTATCGTCAGGTTTAATTCGCATTACCTCTCTCCCGCTTTTAAGCAACTCGTAACAGCATTGTCCTTTTGCCGCTTCGTTTATTACCCCAATAATAAAAGCCTAAAATGTCTAAATAAGCAACAGTGGAGAAATATAAAACAACCAGCGCCGGAAATAAAGCCCTGGTTGTCTGATAAAGGCTAAAGTTTAGATGTGCGTTACGATTTCTTTAATCAATGGCGGGCCTTTAAAAATAAAGCCGGAATAAATTTGTACCAGCGTAGCTCCTGCCGCTATCTTCTCGCGCGCGGCGATAACTGAGTCAATGCCGCCGACGCCGATAATAGGCAATTGTCCCTTTAACTCCTGGGATAAACGGCGAATAATTTCTGTGCTTTTTAATTGTAATGGCCGGCCACTTAATCCCCCCGTTTGCTGGCAATTTTTCATTCCTTGTACCAGAGAACGATCGAGGGTGGTATTTGTCGCAATCACCCCATCAATATTATGACGAAGCAGGCTATCGGCAACCTGGATCAATTCTTCTTCACAAAGATCCGGCGCGATCTTTACTGCCACCGGCACATATTTATGGTGGATCGCCTGAAGATCGTTTTGCTTATTTTTAATGGCAGTTAACAGATCGTCCAGCGCATCGCCATACTGGAGCGTACGTAGCCCTGGCGTATTCGGCGAAGAAATATTAATGGCGATATAACCCGCATAAGCATAGACTTTTTCCATACAAATCAGGTAGTCATCTTTGCCATTTTCGACAGGCGTATCTTTATTTTTACCGATGTTAATTCCCAGAATACCATCAAAATGGGCTTTTTTAACATTCTCGACCAGGTTATCGACGCCCAGATTATTAAAGCCCATCCGATTGATCAGACCTTCAGCATCCACCAGACGAAAAAGACGCGGCTTATCGTTACCCGGCTGTGGGCGCGGCGTCACGGTGCCGATTTCCAGGGAGCCAAACCCCATCGCGCCTAACGCGTCGATGCACTCCCCGTCTTTATCCAGACCGGCAGCCAGCCCCAGTGGATTTTTAAAGGTAAGTCCCATGCAGGTAACCGGCTTTGTCGGTACTTTCTGGCGCACCAGCGCTTCCAGCGGCGTACCTGTAATGCGGCGTAATTGTTGAAATGTAAATTCATGAGCGCGCTCTGGATCGAGCTGGAAAAGGGCTTTACGAACGAAGGGATAGTACATGAACTCTCCTGGATTCCCGGTGTGCAAACCGGGGGCGTATTATGGGCGATAACAAGGCAAAAGGGAATTGACCTACGGCAATAAATAGCAATCGTTTTCCTTCATTCTCACTTCACGTTTCGCCAGCGAGCGTCATCGTCCGGCAAAAAAATGCCCGGCAGCGCTAACGCTTACCGGGCATGTCTTACCTCTTTATAACGGATACTGTCAGGCTAACGCTTTAGTTATCTTCTCGTACAGATCGCCGGAAAGATTCTCCAGTCCTTTTAACTGCTCCAGCGCCGCACGCATTTTCTCCTGACGCTTTTCATCGTAACGTTTCAGACGAATCAGCGGTTCAATGAGGCGAGATGCTACCTGCGGGTTACGGCTATTCAGATCGGTCAGCATCTCGACCAGGAACTGGTATCCGCTACCGTCTTGCGCATGGAACGCCGCCGGGTTGCTGCCAGCAAACGCGCCAATTAATGAACGGACGCGGTTCGGGTTGCTCATACTGAAAGAACGGTGTTTGAGCAGGCCGCGTACGGTTTCCAGTACATTTTCCGCCGGGCTTGTGGATTGCAGGATAAACCATTTATCCATCACCAGGCCGTCCTGATGCCACTTATCGTCATACTCCTGCATCAGCGTATCGCGGCACGGCAACTGCGCCGCCACCGCAGCAGACAGGGCCGCCAGCGCATCGGTCATATTATTGGCGTCGCGATACTGTTTGCTGACCAGCGTATTAGCCAGCTCCGTCTCGCCGAACGCCAGGAAGCGCAGGCAAGCATTGCGCAGCGTGCGCTTACCGATATCGCCGTGATCAACACGATACTCATCCAGATGATTGGCGTTATAGATAGCCAGGAACTCATCCGCCAGTTCTGCCGCCAGCGTACGCGTTAGCGCTTCACGAACTTGCGCAATGGCGATCGGGTCAATGACCTCAAACAGCTCCGCAATTTCATTGGCCGAAGGCAGCGTTAAAATTTCTGCGGCCAACGCCGGATCGATTTTCTCATCCAACAGTACTGCACGGAACGCATCAGCGACATGCACCGGAAGCGATAGCGGTTGCCCCTGCTGATGACGCGCCACATTCAGTTTAATGTATGTGGCCAGCAGGCTTTGCGCCGCATCCCAACGGGAGAAATCATTGCGCGCATGGCGCATCAGGAACGTCAACTGCTGATCGCTCCATTTATATTCCAGTTTCACCGGCGCTGAAAACTCGCACAGCAAGGCCGGAACAGGCTGGAAGTAAACATTATCGAAGGTAAATGTCTGCTCCGCCTGCGTGACGTTCAGCACGGCGTTGACCGGGTGACCGCCTTTTTGCAACGGAATGACGTTGCCTTCGTTATCGTACAGTTCGATGGCGAATGGAATATGCAGCGGCTGCTTCTCCGCCTGATCCGCCGTCGCCGGAGTGCGCTGGCTGATGGTCAACGTGTACTGCTCGGTTTCCGGATTATAATCATCTTTTACCGTTACAATCGGCGTGCCGGACTGACTGTACCAGCGGCGGAAATGGGACAAATCGACATTAGAAGCATCTTCCATCGCCTGTACGAAGTCATCACACGTCGCGGCGCTGCCGTCATGGCGCTCAAAATAAAGCTGCATCCCCTTCTGGAAATTTTCCTCACCCAGCAACGTGTGGATCATGCGAATGACTTCCGCGCCCTTTTCATAAACGGTGAGGGTGTAGAAGTTATTCATTTCGATTACTTTATCCGGGCGGATAGGATGCGCCATCGGGCTGGCGTCTTCCGCGAATTGTAAACCGCGCATGGTACGCACGTTACTGATGCGGTTCACCGCGCGTGACCCCAAATCAGAGCTAAACTCCTGATCGCGGAACACGGTTAGCCCCTCTTTAAGGCTCAACTGGAACCAGTCGCGGCAGGTGACGCGGTTGCCGGTCCAGTTGTGGAAATACTCATGGCCTATCACGCGCTCAATATCGAGATAATCTTTATCCGTCGCGGTATCGGTTCGCGCCAGCACGTATTTGGAGTTAAAGATATTGAGACCTTTATTCTCCATCGCGCCCATATTAAAGAAATCCACCGCGACAATCATATAGATGTCGAGGTCATATTCGAGCCCAAAACGCGCTTCATCCCATTTCATGGAATTTTTCAGCGAGGTCATTGCCCACGGCGCGCGATCCAGATTGCCACGGTCAACGTACAGTTCTAATGCGACGTCACGCCCGGAGCGGGTGGTAAAGGTATCGCGCAGCACGTCAAAATCACCGGCCACCAGCGCAAACAGATAACACGGTTTCGGGAACGGATCTTGCCACTGAACCCAGTGACGGCCATTCTCCAGCTCGCCCTGTGCAACACGGTTGCCATTGGAGAGCAGGAACGGATATTTGCTTTTATCGGCAATAATTTTGGTGGTAAATCGCGCCAGTACGTCCGGGCGGTCAAGATACCAGGTAATATGGCGGAAGCCCTCCGCTTCACACTGGGTACAGAGCGCATCGCCGGACTGGTACAATCCTTCCAGCGCCGTATTCGCCGCCGGACTTATCTCGTTGACAATGCGTAACGTAAAACGCTCTGGCAGGTCGCTGATGATAAGCGCGCCCTCTTCTTCCTTATATGCTGTCCACGGCGCATCGTTGACGTGGATAGATACCAGCGTTAAATCTTCCCCATCAAGGCGAAGAGGCGCATCAGGCGCGCTATGACGAACAGCCTGGCTTATTGCGGTGACCACGGTTTTTTCGGCATCGAGGTCAAAGGTCAAGTCAATATCAGTAATCTGGTAATCCGGCGCGCGATAGTCATGGCGGTATTTGGCTTGTGGCTGTTGTGTCATAAAAAACCTTTCGCATCTTTGTGTAGAGTGTCGACTCCAGTCTATTCCTGTTGCGCAAATCGCGCTACGCAGAATGTTCATCTTTTCAGGCACAAACGGCCTATTTGCTACATTTTTATAATATGTACTCATAGTTTTTAAAATCGATAAAGATCGTCCAGGAGCGCTTTAAACAAGGGATGAGCGAGATCGAGATGAAGCTTGATTAACGAACTTTAACGAACTTCACAGACCACTTTTGCCCCATCCATGCCCCACACATAATTTTAGTCAACGCCAGACCCCATCAGCCCTGCAGGGAAAATCGATAACAAACACGCCCGCAGTATGTTCCCCAAAATTTAAAAACAAGTGGTTCATGCCTTGACAGACAAAACCGCCAAAGCAAAAATACTGTATATACAAACAGTTATTAGCGAAACACGTATGTTCGTAGAACTGGTTTATGACAAGCGTAATGTTGAAGGACTCGAAGGGGCCAGCGAGATCATTCTGGCCGAACTGACGAAGCAGGTGCACCAGATTTTCCCTGATGCCGAAGTGAGGGTGAAGCCGATGCAGGCAAACTGCTTGAATAGTGATACCAACAAAAGCGATCGCGAAAATTTGAACAGATAGCTTATTAAAAATAGATTTATCTTAAACCACGTCATTTACATTTAGCCACCTCCCCAAAATCCGGATTCAGCTTAAGAAAAATGCGACAATACAATAAAAACATATCATATAAGCCCCCTCAACAAATGTAATTTTAAGGCCAACAAACACCTCTAACTTATTCACTTTCAATTAATTTCATAAATAATAATTAACAACAAAAGAATTGTATTAATATCCACACTGTAGTATATAATTACATTAACAAAATTACTATTCGGCGAGTATATTATGTTAAGACACATTCAAAATAGTTTAGGCAGCGTTTACAGAAGTAATACAGCAACTCCTCAGGGTCAGATTATTCACCATCGTAACTTTCAAAGCCAGTTTGATACCACAGGCAACACCCTCTACAATAATTGCTGGGTTTGCTCATTAAATGTTATCAAATCCAGAGATGGCAATAATTATAGTGCATTAGAGGACATCACTTCTGATAATCAAGCGTTTAATAATATATTAGAGGGTATTGATATAATAGAATGTGAGAATTTATTAAAAGAAATGAATGTGCAAAAAATACCTGAATCCTCTCTTTTTACAAACATTAAAGAAGCTTTACAGGCAGAAGTTTTCAATAGTACTGTAGAAGATGACTTTGAGAGTTTTATTTCTTACGAATTACAAAACCATGGACCACTGATGTTGATCAGGCCTTCACTTGGCTCGGAATGTCTACATGCAGAGTGCATTGTAGGCTATGATAGTGAAGTGAAAAAAGTATTAATTTATGATTCAATGAATACCTCACCTGAATGGCAATCAAATATTGATGTCTATGACAAGCTTACCTTAGCATTCAATGATAAATATAAAAATGAAGATTGCAGTATTTGTGGTCTTTACTATGACGGTGTTTATGAGCCAAAACCTTTACACTCCTCCTCCTGGAAAGACTGGTGTACCATTTTATGATAGTTAACCTTTACCAAGATAATTATTCAGGCTACCGCCAACATGGGGGGTCGGGGGTCGGAGGTTCAAATCCTCTCGTGCCGACCAAAATTCCCCTTAAAAACCAGCCTGTCAGGGCTGTTTTTTTTATGGCTCAATTTCCTACGGGGAAGCTATGGGGTGAAACTGGGGAATAAAGCCGTCGAGATTCGACGCAATTTGCGATTGATTCATCAGTTTGCTCACTGCTCAGTTTCGGAATTCATCAATACACAAATTTTCATTTCGAATTACTGTATAAATTCCCTGTAAATCATTACCGGAGCGCGCCACATTTTCCCCCTGCCCTATACTTTCAGTCTGACGACTGGAGGTTTCATATGTGTGGACGCTTTGCACAAGCACAGACCCGTGAAGAATATCTGGCATATCTGGCCGACGAAGCCGATCGTAATATTGCTTATGACCCTCAGCCTATAGGCCGGTATAACGTGGCGCCCGGGACTAAAGTCCTGCTATTGAGCGAACGCGACGAGCAATTACATCTCGACCCGGTGATTTGGGGTTACGCTCCCGGATGGTGGGATAAAGCTCCACTTATTAACGCCCGTGTCGCGACAGCGGCCTCCAGCAGAATGTTTAAGCCACTATGGCAGCATGGCCGGGCTATCTGTTTTGCCGATAGATGGTTCGAGTGGAAGAAGGAAGGCGACAAAAAACAGCCGTATTTCATTCACAGAAAGGACGGGAAGCCGATATTCATGGCTGCCATTGGCAGTACGCCGTTTGAGCGCGGTGATGAAGCAGAGGGATTCCTGATTGTTACCTCCGCAGCCGATAAAGGTCTGGTAGACATTCACGATCGTCGCCCGCTGGCACTGACACCGGAAACTGCTCGGGTATGGATGCGCCAGTTCCTGGAACCACATTCTAAGTCAATAACATACCGCGTCATACCTGCGCTCACACGTCCCATGATGCGAAAAGATACCAATCCATGCCAATAGTTAAAAACGGATGACTGTCCCGAATCCGTCCCACCTGCCCTATCCCACAAACCGGCGTTCAGATGCTCATTAAGAAAACCACCTCACCCTCATAACTCAGTAAGCGTCCCGTTTAGGACGTAGCGTAAGGATTATTTTACGGTTTCGAGGTTCCAGGGCAGCAGTTCGTGCACCCGGTTCGATGACCAGTCGCTGATTTTCCACAGCACGTCGCGTAACCATGCCTCGGACTCTACGCCGTTTAGTTTGCACGTACCCAGCAGGCTGTAGATGATCGCCGCTGCCTCGCCGCTCCTGTCTGAGCCGAAGAACAGATAGTTACGTCGGCCCAGCGCCACGCACCGTAAGGCGTTTTCACAGATGTTGTTGTCGATCTCCACCCGACCGTCGCTGCAGAGGACACGCTCAACGCATCACACTGCTTCAGCATGTAACCGAACTCCTTCGCCATCTCCGCATGCACCGACAACGTTTTCAACTGCGCCTGTATCCAGTCGTACAGCGACTGGCTTTTCTCTTTCCTGACCGACAGCCGTGTTTGCGCCGGGCTGTGGACTCTGTCATCTGTGATAGTGTCCATCAAAATTTAAGTGGACACTATCATCGCCGGATTGACAGGGTTCTGACAGACGTCCTCCACGGTGCGCTTACATTTTACCTATTAAGGAATATTTTTGCTTTTTAAAGGTATTAAACCATCTCGGTGATGTAACAAAAACTTTCCCTGCCATAGATTCTGATTCTAATTCTCGTGGTAATGCATCATAGGCATTAGCTGCTATACTTGAATTACTAAAATCAGTATAATAAATAAGTTTCCTTCTTGTTGCCATTCTATATTTACACACCCATTCATCTGCTGAAAGAATTAATGGGCCATTCAGATTACTCATACCTCTATTTTCAAACTGATGGGCTGAAACATCAAACACATAGTCTTTCCCTTCTTTATTTCCAACCACTGCAAAATGATTTGTTGGTATTTCCTCTGTTGGTTTATCCCAGATAAATATACCTCGATAACGAATATTATCGAACCCTTTTTCATTCATAAAATTGCTTACAGGAGTCATTAATGACTCACACTGCCCTACCGGATTCATTATTTTATTATTTATAATTGGATTCTGTTTCAATTCCTCCAGATAGGCCGCAGCATCAATATCACTGGTTAGGTTGTAAGTTATATCTGTGCGTTCCACTCCCGGTTCTGTTGCCATGGTTAAGTGATGTGTTTCACTGTACCCCTGGCAATTTACGGTATAGTTCCCGACGTCATCCAGGGTGACTGATAATATCTCCTGACTGTCTTCATCCAGAATACAGAAGTAGTTTTCCCCGTGCAGGCCGGAATGAATGTTTTCCTCCCATCCGTCATACGCGAGCGTCCTGAGCAGTTCAAATCTGCTGACCACATCCTCCCGCGTCGTTCCGGCCGGCGGGTGACAAATCGTCCAGATGCACTCCAGCGCTTCAGTCTGGTGCGTTGAGCAAAAAAATTCCTTCATTTTTTCCCAGGAACTCATTTCAGGGGGGGTATCAGACCAGGCAATACGATAAATGCGGCGGTTACTGATGATGGCGGGAAGACATCCGCTTCCAATATGAAAGGGCATAACAAAAAACCTTTATAAATTTACATATAGTATCTGTCCGACAGACATCATCTCTTCCTGGTCTTAATTTCACAATAAGGTTATCGGCGGATTCATGGTCGTCCTGCCATGGCGGGCTTCAGAAGGTGCAGAAGAAAAATCCGTTATGATGACCGGATGGCGGGACTGTCATTTTACAGCTAAAGTGTCGATTTTTTCAGGGTCGCTTTCCACGATGACCAGATCATCCGGCATCAGCGCCCGGGCGTTCAATTTTGGCGGGCAGAAGTCACCCGGCAGAAATTACTTAACGATGCAGATAATGCCATTAAGGACTGGCGCACAGAATTAACGTTGGGAATTATCAGTGATGAAAATAAAGCAGCTTTGATTCTGCCGATGAATTATATCAATGTTCTTAAATCGCTGGACTTAACAGGTGTTTCAGATGAGGCCACCTTCACAGCAATCAGGTGGCCTGCATTACCACAGTAACGCCTACTGGCTGGCTGGTCTTTCCGGCCAGTCAGGGGCGGCTGTATCCATCCGGTTTACCAGCACCCTATATTTTTTCCATTCGTCGAGCCGCGCTTTCTCATCATCTGTTGCGATTCCAAGATCAACTGCATCCTGCAATGGCGCGATTTTTTCAGATGCCATTTGCAAAAGACGGCTTTTGGTTTCTTCCGCCTGACGAAGCTGCGCTGCTTTTTCAGCCGCTTCGTCTTTTACCCAGACCTTAGCCTTACCATCCCATTTCTGGTATTCACCACCTGGTGAAACTGATGTGACATTTTCGGGCAACGGACCAGGAGCGGAGATATAAACCTGATTGCCGGTTGTTGTGTCGTAAACCATCTCGCCGCGGTGATCCTCATGCAAACTCCATGTCTGGGTTTCAGCGTCAAATATAGCAATATGACTGGCGGGAATATCAGGAGGGGCGATATCAGTACAGTTTGCCGGTAGTCCTGTGTGCGGCGGAATATACGCATCACCTGCACCAATAAATTCGTTAGTATCTGAACGCAGATTGAAAATTTTAATTGTCTGCGCCTGTTCGCTCATTTTAAAAGTCATTATGCCAGCCTCACTATGTAGTTAAATGCAATGTTTTTAACCGTGGTTTCCGCATTACCGTCTGCGTCCACAATAACGACGTGTCCGTGTGGACCTATATACATGGTGTGCTCATGTCCTCCGATATAAACTGTATGTGCATGGTCGCCAGCGGCCTGTGTCCATGCACCACCTCCTGGCTGAAATGAGGTGTGATTGGAATCTCCCCAGTATGAATTGATATAACCGCCGAACTGGTGAGTATGATTGCCCGTGGTATTGGTCGATTTCGTGCCGTAATCAAAGGATGAGGTAGATTTTGTCCCTAAGTCAGTAACCTGCGCCCGCGCGGTGTGCGAGTGCGATTTATTGCCGTCCATTTCTTGCGACAATACGGCACGTCCACTGATGGGCTTACCCTTTATTGTCCAGCCTCTCATGTCAGGGATAACGCCGGACGGATACGCTATAGCCAGTAACGGGTAAGCAGATTTATCGAAGGACTGCCCCTGCATCAGAGCGTAACCTGCCGGAGTAGCATCAGATGGCCATGCAATCGCCGCCCCTACTGGATGCGAATCCGGAGGTGGGTTTAGTGTGGTGTAAAGCATTGCCCATTCGGACCACTCAGCATCGGCGGTATCTCGATGGCTGCGAATATATGCGGGCGCTGGCGCACCATTTGTCCCGCTCCAGCCAATGAGGATTTCCCCATCACCGGTTCCGGTCAGACGTAAAATATTTCCGTATTGCGTCGGATAGCCATTGTTGTAAACCTCGCCCATTATCAGGCCACTATCGCTGCCTCTTGTCGTACCAGTCAGTGCCGGAAGCGCGCCGCGTGATGCCAGTCTGTTCGCTGCAACAGCCGTACCTGATGCAGGGAGCGCTCCGATATTTTGTACAAACAGCGGCTTATTCGGGATATCCGCACCACACTGGCTTTTAGCCATGTAGTTTTCATCACTTTCGGTTTTGCTGTAGACCTCAAGACTGGAGCGACCTTTGGCCTTATCCGGTACGTCGGACAGGTTCTGGTCCTTCTGCAAATACCGTGATCCCAAATAAATCTCCAGGGGATTAAGCACAGAAAAATAGGTTTTTGTATTATCCAGAACGCATAAGACAGGAATATCTTTAATAATATCATTGGCCGATAACTCTGCTTTATTCCCCTTGTATAGTGGGAATATGCCAAGCACACGTCCTCCCATCGTCAGTTGCAGAGTGCTGGCTCCGGTATTGTTTAGCGCCGGAATAACCACAAGTGGAGTGCGCAATGTCCAGTCAACTCCACCATTGACGAAATAAGTTGCTGGTAACTCCAGCGTCAGATTATTTTCTGTACCTCCGGCCACACCAGCGACATAATGCCCACTCTGGAGCTCTTCAATTTGTACAAACTGATTTTCAGATCCTCGCGTCGCAAAATTCGCTATAACGTCATTCAGTGACCATCCCTTCGCTGTTGTACCTTCCTGACCGCGAATAACCGTCAGCATGTCATTATTAACTGCTGTCAGATGGCATACCTCAAAAACTGTTTCTTTTGCGTCTGTCAGTGTAATTTTGGCGTAAGTTTTAAGAGGGTTTGAGCTGTTTGCATAATCGCTGGTCAGCAAATTAGCAAACATCGCTCCCACACCAGGCATCACCTGAATGGTCGTCTGGCTGGCGGTAATATCAGCCGCCAGTGAGGAGACGACATTATTTCCGAATCCAATAATCATTGCTCAACCACCGTTACCGAATAGGTATAAATAAAAGGGAGTTTCACCAGCGACTGGTCAATTGCATCTTTAAGAAAGTGTCCGACACCATCGCCATAGTCAGGAATGGAGACAAAAAAAATGCCCTTATCGGGCATTACACTAATATCAAAAGTGGACTGTACAGGTGGGTCTATTCCGTTAGCTCCATGTATAAAGCGTGCAAGCCGTCGTTTGAACCAGTTGATACAGAAGTGCGAACCATCGCCTTTATAAAAATTCCATGTCAGTATCCGTTTAAAATAGTCGTCCGGAACATATGACGCTGAGCCGGGAACATAATTTCTCAGTTTTGCATACGCGACATTATTGTACTCAATAGTGTTATACGCCCCACGAGCAATGGCATCCTCGGAGATTTGAAGCAAGGGGCGTGATTCCCCATAAATACCCGCCGCAATCCAGTCCAGCAACTCACCGGTAATCGCCGGGGAGGTCCAGCAAGGTAAATTCAGGTTGTTAAAGTAATCAAGATACCCCTGTGCCAGTTTGTTATAAGCATCAAAAAAGGCAACTATATCCGGATCGTCATTATATTGCGTATAGGGGTAGGCCGGAATAATGCTTTCAAGAAGAGCTGCCATATTGCTTAACCTGAATTTGTGAAGATGAAGTGGAAAAATAGGCGTAAGTATCACCATAAACCAGGCTGGAGTCGGTTGCAGGTGGGACAATTTTTCCGTTTATTCCAACCTGAATATCAATCATTGATACAAGGTTTGAAGATACAAGCCCCTTAACCTGATTAAGAAAAATATCCCGAATCAGGAAAATGTTTATTGGTTCACCCGTTGCAATTCCGTTAATGTAATCAGCAATGCTTTGCTGCACTGCTTTTTCAATCCCGGTTGGATCGATATAGCTGGTTGAGGCTGTATTCCAGGTGATTAAAAGCGTAACGTTTTGTGATGATGGCACTACAAACGGCACGTGATACGTATCCGGATACACAATGATCGGTATCGTTTTTTTATCCACCGCAGCGCCTGATGGATTCACTACATCATTCGTCAGTACGGAGATATCTGGCACGGCTTTATAGATAGCGTAAGCCACTTCATAAGGATCGCCGCCACCAGCAATCGCTACCCATGCCCCCAGCGATGCCTGTCGGTATGAGATCAGATTCTCCTGTACACCATAAACATTTTTCAGTTCAATCCGGTAACAGTCAGGCGTTCCCTGTACACCGTACATACAGGCATAAGGGGAAAATCATGGAAAACACAAATATTGTTACCACTGAGCAGCAGGCACCAAACACCATTTCTGCCAGTAACGCAATTTTTAACGTTCAGGCACTGGGTCAGTTAACAGCTTTCGCTAACCTGATGGCAGACTCACAGGTGACGGTACCGGCACACCTTGCAGGGAAACCAGCCGACTGTATGGCTATCGTCATGCAGGCTATGCAATGGGGCATGAACCCTTATGCATGCTGGTCATTATCAGCTGGTGCTAACCCAAACTTTATAGCAACGCAGATGGGGCATACCGATGCACAGATGGTTTACAAGGTGTATGGAAAGTGGATGTCAGAGAAGAGCGCAGAACAGGTTTCTCTGCTCAACCAGGCACTTTCCCGCTATGCCCCATCACTGCCCCAAAGCATGGTAGCAGCGCAGTAG
Protein sequences of DBSCAN-SWA_7 >NZ_CP043433|2865500:2906198|2894973_2897586_-|WP_000193790.1|DBSCAN-SWA MTQQPQAKYRHDYRAPDYQITDIDLTFDLDAEKTVVTAISQAVRHSAPDAPLRLDGEDLTLVSIHVNDAPWTAYKEEEGALIISDLPERFTLRIVNEISPAANTALEGLYQSGDALCTQCEAEGFRHITWYLDRPDVLARFTTKIIADKSKYPFLLSNGNRVAQGELENGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFTTRSGRDVALELYVDRGNLDRAPWAMTSLKNSMKWDEARFGLEYDLDIYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDLGSRAVNRISNVRTMRGLQFAEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMIHTLLGEENFQKGMQLYFERHDGSAATCDDFVQAMEDASNVDLSHFRRWYSQSGTPIVTVKDDYNPETEQYTLTISQRTPATADQAEKQPLHIPFAIELYDNEGNVIPLQKGGHPVNAVLNVTQAEQTFTFDNVYFQPVPALLCEFSAPVKLEYKWSDQQLTFLMRHARNDFSRWDAAQSLLATYIKLNVARHQQGQPLSLPVHVADAFRAVLLDEKIDPALAAEILTLPSANEIAELFEVIDPIAIAQVREALTRTLAAELADEFLAIYNANHLDEYRVDHGDIGKRTLRNACLRFLAFGETELANTLVSKQYRDANNMTDALAALSAAVAAQLPCRDTLMQEYDDKWHQDGLVMDKWFILQSTSPAENVLETVRGLLKHRSFSMSNPNRVRSLIGAFAGSNPAAFHAQDGSGYQFLVEMLTDLNSRNPQVASRLIEPLIRLKRYDEKRQEKMRAALEQLKGLENLSGDLYEKITKALA >NZ_CP043433|2865500:2906198|2886546_2887800_-|WP_000333139.1|DBSCAN-SWA MCEHHHAAKHILCPQCDMLVALPRLSHGQKAACPRCGATLTTEWDAPRQRPTACALAALFMLLLSNLFPFVNMNVAGVTSEVTLLEIPGVMFSEDYASLGTFFLLFVQLVPAFCLVTILLLVNRASLPLSVKKTLARIFFLLKSWGMAEIFLAGVLVSFVKLMAYGDIGIGSSFIPWCLFCLVQLRAFQCVDRRWLWDDIAPQPALAQPLTPGITGIRQSLRSCACCTAILPAESLVCPRCHTKGYVRRKNSLQWTLALLFTSIMLYLPANILPIMITDLLGSKMPSTILAGVILLWSEGSYPVAAVIFLASIMVPTLKMIAIAWLCWDAKGHGKRDSERMHFIYEVVEFVGRWSMIDVFVIAVLSALVRMGGLMNIYPAMGALMFALVVIMTMFSAMTFDPRLSWDREYEPGHEES >NZ_CP043433|2865500:2906198|2878021_2878627_-|WP_001202375.1|DBSCAN-SWA MRALSYDRIYKSQEYLASLGTIQYRSLFGSYSLTVEDTVFAMVANGELYLRACEESVPYCVKHPPAWLMFMKCGRPVMLNYYRVDESLWRDQQQLVRLSKYSLDAAMKEKHSRILQHRLKDLPNMTFHLETLLNESGIKDENMLRILGAKMCWLRLRQSNPLLTVKVLYALEGAIVGVHEAALPASRRQELADWAHSLTAG >NZ_CP043433|2865500:2906198|2889734_2891843_-|WP_001086485.1|DBSCAN-SWA MNSLFASTARGLEELLKTELEKLGAVGCQVVQGGVHFQGDTRLIYQSLMWSRLASRIILPMGECKVYSDLDLYLGVQAINWTEIFNPGATFAVHFSGLNDTIRNSQYGAMKVKDAIVDAFTRKNLPRPNVDRESPDLRINVWLNKETASIALDLSGDGLHLRGYRDRTGLAPIKETLAAAIVMRSGWQPGTPLLDPMCGSGTLLIEAAMWATDRAPGLHRGHWGFSGWAQHDETIWQEVKAEAQTRARKGLAEYSSHFYGSDSDARVIERARSNARRAGIGELITFEVKDVAQLSNPLPKGPYGTVISNPPYGERLDSEPALIALHSLLGRTMKNQFGGWNLSLFSASPDLLGSLQLRADKQFKAKNGPLDCVQKNYHIAETTADSKPATVAEDYANRLRKNLKKLEKWARQEGIECYRLYDADLPEYNVAVDRYGDWAVIQEYAPPKTVDAQKARQRLFDIIAATLSVLGIPPNKLVLKTRERQKGKNQYQKMSEKGEFLEVSEYNARLWVNLTDYLDTGLFLDHRIARRMLGEMSKGKDFLNLFSYTGSASVHAGLGGARSTTTVDMSRTYLEWAERNLRLNGLSGRAHRLIQADCLGWLREANEQFDLIFIDPPTFSNSKRMEESFDVQRDHVALMKDLKRLLRKGGTIMFSNNKRGFRMDLEGLAELGLTAQEITQKTLSPDFARNRQIHNCWLIRAA >NZ_CP043433|2865500:2906198|2905787_2906198_+|WP_001676370.1|DBSCAN-SWA MENTNIVTTEQQAPNTISASNAIFNVQALGQLTAFANLMADSQVTVPAHLAGKPADCMAIVMQAMQWGMNPYACWSLSAGANPNFIATQMGHTDAQMVYKVYGKWMSEKSAEQVSLLNQALSRYAPSLPQSMVAAQ >NZ_CP043433|2865500:2906198|2904527_2905154_-|WP_000729406.1|DBSCAN-SWA MAALLESIIPAYPYTQYNDDPDIVAFFDAYNKLAQGYLDYFNNLNLPCWTSPAITGELLDWIAAGIYGESRPLLQISEDAIARGAYNTIEYNNVAYAKLRNYVPGSASYVPDDYFKRILTWNFYKGDGSHFCINWFKRRLARFIHGANGIDPPVQSTFDISVMPDKGIFFVSIPDYGDGVGHFLKDAIDQSLVKLPFIYTYSVTVVEQ >NZ_CP043433|2865500:2906198|2875416_2875863_+|WP_001261222.1|DBSCAN-SWA MRTVLNILNFVLGGFATTLAWLLATLVSIVLIFTLPLTRSCWEITKLSLFPYGNEAIHVDELNPAAKSVLMNTGGTLLNIFWLLFFGWWLCLMHIASGIAQCVTIIGIPVGIANFKIAAIALWPVGRRVVSVETARAAREANARRRFE >NZ_CP043433|2865500:2906198|2893047_2893590_-|WP_001574119.1|DBSCAN-SWA MRIKPDDNWRWYYDEEHDRMMLDLANGILFRSRFSRKMLTPDAFCPTGFCVDDAALYFSFEEKCRDFELTKEQRAELVLNALVAIRYLKPQMPKSWHFVAHGEMWTPGTGDAASVWLSDTAEQVNLLVVEPGENAALCLLAQPGVVIAGRTMQLGDAIKIMNDRLKPQVHCHSFSLEQAV >NZ_CP043433|2865500:2906198|2887814_2889722_-|WP_000053044.1|DBSCAN-SWA MSLISMHGAWLSFSDAPLLDNAELHIEDNERVCLVGRNGAGKSTLMKILNREQGLDDGRIIYEQDLIVARLQQDPPRNIAGSVYDFVAEGIEEQAEYLKRYHEISRLVMTDPSEKNLNEMARVQEQLDHHNLWQLENRINEVLAQLGLDPNAALSSLSGGWLRKAALGRALVSNPRVLLLDEPTNHLDIETIDWLEGFLKTFNGTIIFISHDRSFIRNMATRIVDLDRGKLVTYPGNYDQYLLEKEEALRVEELQNAEFDRKLAQEEVWIRQGIKARRTRNEGRVRALKAMRRERSERREVMGTAKMQVEEATRSGKIVFEMENVDYQVEGKQLVKDFSAQVQRGDKIALIGPNGCGKTTLLKLMLGQLQADSGRIHVGTKLEVAYFDQHRAELDPEKTVMDNLAEGKQEVMVNGKPRHVLGYLQDFLFHPKRAMTPVRALSGGERNRLLLARLFLKPSNLLILDEPTNDLDVETLELLEELIDGYQGTVLLVSHDRQFVDNTVTECWIFEGGGKIGRYIGGYHDARAQQEQHLATKQPMAKKNEEVIAPKAEIVKRGSSKLSYKLQRELEQLPGQLEDLEAKLEALQAQVADAAFFSQPHEQTQKVLADLSQAEQELEQAFERWEYLEGLKNGA >NZ_CP043433|2865500:2906198|2869769_2870981_+|WP_000140478.1|DBSCAN-SWA MTESTFPQYPRLVLSKGREKSLLRRHPWVFSGAVSRLEGKANLGETIDIVDHQGKWLARGAWSPASQIRARVWTFDKAESIDIAFFTRRLRQAQQWRDWLAKKDGLDSYRLIAGESDGLPGVTIDRFGHFLVLQLLSAGAEYQRAALISALQTCDPDCAIYDRSDVAVRKKEGMALTQGPVTGELPPALLPIEEHGMKLLVDIQGGHKTGYYLDQRDSRLATRRYVENQRVLNCFSYTGGFAVSALMGGCRQVVSVDTSQDALDIARQNVELNQLDLSKAEFVRDDVFKLLRAYREHGEKFDVIIMDPPKFVENKSQLMGACRGYKDINMLAIQLLNPGGILLTFSCSGLMTSDLFQKIIADAAIDAGRDVQFIEQFRQAADHPVIATYPEGLYLKGFACRVM >NZ_CP043433|2865500:2906198|2868201_2868981_-|WP_000548080.1|DBSCAN-SWA MHIRHQDLTTAEVRSSHLHRLHRVTLFSAAICHITQGSKVIIQDDSRLVAGPGELIIIPANTPLEIINQPAQNGFRSDLLLLSPEIIARFKTMYVQDYPPANLTSLCTPMSRSLTFMWENVLDAVRQGLPVGLQEHQAMGLLLALLHDGAAGPLLIERRYTLTEQVRQLIMLSPAKLWTAQEIARRLAMGTSTLRRRLQRESQSYRQIIEEVRMSCALSQLQSTTLPIGEIALRCGYLSGSRFTARFRQHYGCLPSQVR >NZ_CP043433|2865500:2906198|2883300_2883819_+|WP_000227928.1|DBSCAN-SWA MVDKRESYTKEDLLASGRGELFGAKGPQLPAPNMLMMDRVVKMTETGGNFDKGYVEAELDINPDLWFFGCHFIGDPVMPGCLGLDAMWQLVGFYLGWLGGEGKGRALGVGEVKFTGQVLPTARKVTYRIHFKRIVNRRLIMGLADGEVLVDGRLIYTAHDLKVGLFQDTSAF >NZ_CP043433|2865500:2906198|2902240_2902822_-|WP_000143167.1|tail|DBSCAN-SWA MTFKMSEQAQTIKIFNLRSDTNEFIGAGDAYIPPHTGLPANCTDIAPPDIPASHIAIFDAETQTWSLHEDHRGEMVYDTTTGNQVYISAPGPLPENVTSVSPGGEYQKWDGKAKVWVKDEAAEKAAQLRQAEETKSRLLQMASEKIAPLQDAVDLGIATDDEKARLDEWKKYRVLVNRMDTAAPDWPERPASQ >NZ_CP043433|2865500:2906198|2891941_2893051_+|WP_000224079.1|DBSCAN-SWA MATLSRLFIHPVKSMRGIGLTHALADISGLAFDRIFMVTEPDGTFITARQFPQMVRFTPSPLHDGLHLTAPDGSSALVRFTDFTPQDAPTEVWGNHFTARVAPTAINQWLSGFFSRDVQLRWVGPQLTRRVKRHNAVPLGFADGYPYLLTNEASLRDLQQRCPAGVQMEQFRPNLVVSGVAAWEEDSWKVLRIGDVIFDVVKPCSRCIFTTVSPEKGQKHPSGEPLATLQAFRTAQDNGDVDFGQNLIARNSGVVRVGDEVEILATAPAKAYGATTVDDSVTPDKHPDASVTIDWQGQTFCGNNQQVLLEQLENQGIRIPYSCRAGICGCCRIRLLEGEVSPLKKSAMGDDGTILSCSCVPKTALRLEN >NZ_CP043433|2865500:2906198|2871038_2871356_+|WP_000561983.1|DBSCAN-SWA MIASKFGIGQQVRHSLLGYLGVVVDIDPEYSLDEPSPDELAVNDELRAAPWYHVVMEDDDGQPVHTYLAEAQLRSEMRDEHPEQPSMDELARTIRKQLQAPRLRN >NZ_CP043433|2865500:2906198|2875881_2878035_+|WP_000950876.1|DBSCAN-SWA MLSPLIRRYTWNSTWLYYIRIFIALCGTTALPWWLGDVKLTIPLTLGMVAAALTDLDDRLAGRLRNLIITLICFFIASASVELLFPWPWLFALGLTLSTSGFILLGGLGQRYATIAFGALLIAIYTMLGTSLYDHWYQQPLLLLAGAVWYNLLTLTGHLLFPIRPLQDNLARSYEQLAHYLELKSRLFDPDIEDESQAPLYDLALANGQLMATLNQTKVSLLSRLRGDRGQRGTRRTLHYYFVAQDIHERASSSHIQYQTLRDYFRHSDVMFRFQRLMSMQAQACTQLARCILLRTPYQHDPRFERVFTHIDAALERMRASGASLELLNTLGFLLTNLRAIDAQLATIESEQAQAMPRNESENQLADDSLHGFSDIWLRLSRNFTPESALFRHAVRMSLVLCIGYALIQITGMRHGYWILLTSLFVCQPNYNATRHRLALRIIGTLVGVAIGLPILWFVPSLEGQLVLLVITGVLFFAFRNVQYAHATMFITLLVLLCFNLLGEGFEVALPRVVDTLIGCAIAWAAVSFIWPDWRFRNLPRVLQRATDANCRYLDAILEQYHQGRDNRLAYRIARRDAHNRDAELASVVSNMSSEPDVTAETREAAFRLLCLNHTFTSYISALGAHREKLSNPDVLGLLDDAVCYVDDALHHQPEDEQRVHQALEGLKQRVQSLETRPDSKEPLVVQQIGLLIALLPEIGRLQRQISPPTSTLITQP >NZ_CP043433|2865500:2906198|2880833_2881286_-|WP_000877172.1|DBSCAN-SWA MKYQQLENLESGWKWKYLVKKHREGELITRYVEASAAQEAVNLLLALENEPVRVNVWIDRHMNPALLNRMKQTIRARRKRHFNAEHQHTRKKSIDLEFMVWQRLAGLAQRRGKTLSETIVQLIEDAEHKEKYATQMTTLKQDLQALLGKK >NZ_CP043433|2865500:2906198|2872744_2873203_+|WP_000424187.1|DBSCAN-SWA MELTTRTLPTRKHIALVAHDHCKQMLMNWVERHQPLLEKHVLYATGTTGNLIQRATGMDVNAMLSGPMGGDQQVGALISEGKIDVLIFFWDPLNAVPHDPDVKALLRLATVWNIPVATNVSTADFIIQSPHFNDAVDILIPDYARYLAERLK >NZ_CP043433|2865500:2906198|2899520_2900147_+|WP_000334547.1|DBSCAN-SWA MCGRFAQAQTREEYLAYLADEADRNIAYDPQPIGRYNVAPGTKVLLLSERDEQLHLDPVIWGYAPGWWDKAPLINARVATAASSRMFKPLWQHGRAICFADRWFEWKKEGDKKQPYFIHRKDGKPIFMAAIGSTPFERGDEAEGFLIVTSAADKGLVDIHDRRPLALTPETARVWMRQFLEPHSKSITYRVIPALTRPMMRKDTNPCQ >NZ_CP043433|2865500:2906198|2871987_2872650_+|WP_000847719.1|DBSCAN-SWA MKTGALATFLALCLPVTVFATTLRLSNEVDLLVLDGKKVSSSLLRGAESIELENGPHQLVFRVEKTIRLPGNEERLYISPPLVISFDTQLISQVNFQLPRLENEREASHFNAAPRLALLDGDAMPIPVKLDILAITSTAKVVDYEIETERYNKSAKRASLPQFATMMADDSTLLSDVSELDTVPPQSQTLTEQRLKYWFRLADPQTRHHFLQWAEKQPPS >NZ_CP043433|2865500:2906198|2878843_2879353_+|WP_000288733.1|DBSCAN-SWA MYTSGYANRSSSFPTTTHNAARTATENAAAGLVSEVVYHEDQPMMAQLLLLPLLRQLGQQSRWQLWLTPQQKLSREWVQSSGLPLTKVMQISQLAPRHTLESMIRALRTGNYSVVIGWMTEELTEEEHASLVEAAKVGNAVGFIMRPVRAHALSRRQHSGLKIHSNLYH >NZ_CP043433|2865500:2906198|2873238_2875293_-|WP_000420505.1|DBSCAN-SWA MELKATSLGKRLAQHPYDRAEILNAGVKVSGDRHEYLIPFNQLLAIHCKRGLVWGELEFVLPEDKVVRLHGTEWSETQQFHRYLDAHWRRWSQEMSDVAAQALQEQWARISERTGGNQWLTRERVRGLEHEIRQTFAALPLPVSRLEEFAHCREIWRKCLAWLQDSEGSRQQHNQAYADAMLEAHADFFTQIESSPLNPSQARAVVNGESSLLVLAGAGSGKTSVLVARAGWLLARGQADAGQILLLAFGRKAAEEMDERIRERLHTEEITARTFHSLALYIIQQGSKKAPVVSKLESDATARHQLFLRTWRQQCSEKKAQAKGWRQWLEEEMQWVVPEGNFWDDETLQRRLAPRLDRWVSLMRMHGGAQAEMIAGAPEECRELFGKRIKLMAPLLKAWKSALKAENAVDFSGLIHQAMVILEKGRFISPWKHILVDEFQDISPQRAALLEALRKQNSQTTLFAVGDDWQAIYRFSGAQLSLTTAFHQTFGEGEHCHLDTTYRFNSRIGDIANRFVQQNPHQLKKPLNSLTPGDKKAVTLLDESQLDALLDKLSGYAKEDERILVLARYHHLKPASLQKAATRWPKLQIDFMTIHASKGQQADYVILVGLQEGNDGFPAPARESIMESALLPQVEDFPDAEERRLLYVALTRARARVWLLFNKDNPSRFVEALKQLDVPVARKP >NZ_CP043433|2865500:2906198|2866799_2867459_+|WP_000374046.1|protease|DBSCAN-SWA MDRIITSSRDRSSLLSTHKVLRNTYFLLSLTLAFSAITATASTVLMLPSPGLILTLVGMYGLMFLTYKTANKPVGILSAFAFTGFLGYILGPILNAYLSAGMGDVIGLALGGTALVFFCCSAYVLTTRKDMSFLGGMLMAGIVVVLIGMVANIFLQLPALHLAISAVFILISSGAILYETSNIIHGGETNYIRATVSLYVSLYNIFVSLLSILGFASRD >NZ_CP043433|2865500:2906198|2884341_2884905_-|WP_000759136.1|DBSCAN-SWA MKKWLVVIMAFWLASCSSGGENKSYYQLPIAQSGVQSTASQGNRLLWVEQVSVPDYLAGNGVVYQTSDVQYVIANNNLWASPLDQQLRNTLVANLSARLPGWVVASQPLGTTQDTLNVTVTGFHGRYDGKVIVSGEWLLNHNGQLIKRPFHIEASQQKDGYDEMVKVLASAWSQEAAAIADEIKRLP >NZ_CP043433|2865500:2906198|2867545_2867875_+|WP_000904449.1|DBSCAN-SWA MLIFEGKEISTDSEGYLKETTQWSETLAVAIAANEGIELSAEHWEVVRFVREFYLEFNTSPAIRMLVKAMANKFGEEKGNSRYLYRLFPKGPAKQATKIAGLPKPVKCI >NZ_CP043433|2865500:2906198|2883918_2884086_-|WP_001537784.1|DBSCAN-SWA MKRQKRDRLERAHQRGYQAGIAGRSKEMCPYQTLNQRSYWLGGWRQAMEDRAVMA >NZ_CP043433|2865500:2906198|2869006_2869555_-|WP_000859416.1|DBSCAN-SWA MKTVFSLTAAAMMALSGGVSAASAFSLSSADIPADFRLTQQHVFKGFGCSGENISPQLSWRNPPAGTKSYAITVFDPDAPTGSGWWHWTMVNIPAQIHDLPTGADKKTLPAGVVQGRNDFGYAGFGGACPPPGDKPHRYQFTVWALNTATLPLDSESSGALVGFMLNAHVIAKAKFTATYGR >NZ_CP043433|2865500:2906198|2871400_2871817_-|WP_000975204.1|DBSCAN-SWA MMKETDIADVLTSTRTIALVGASDKPDRPSYRVMKYLLEQGYHVIPVAPKVAGKTLLGQQGYDTLADIPEKVDMVDVFRNSEAAWGVAQEAIAIGAKTLWLQLGVINEQAAVLARDAGMTVVMDRCPAIEIPRLGLAK >NZ_CP043433|2865500:2906198|2893755_2894766_-|WP_000291723.1|DBSCAN-SWA MYYPFVRKALFQLDPERAHEFTFQQLRRITGTPLEALVRQKVPTKPVTCMGLTFKNPLGLAAGLDKDGECIDALGAMGFGSLEIGTVTPRPQPGNDKPRLFRLVDAEGLINRMGFNNLGVDNLVENVKKAHFDGILGINIGKNKDTPVENGKDDYLICMEKVYAYAGYIAINISSPNTPGLRTLQYGDALDDLLTAIKNKQNDLQAIHHKYVPVAVKIAPDLCEEELIQVADSLLRHNIDGVIATNTTLDRSLVQGMKNCQQTGGLSGRPLQLKSTEIIRRLSQELKGQLPIIGVGGIDSVIAAREKIAAGATLVQIYSGFIFKGPPLIKEIVTHI >NZ_CP043433|2865500:2906198|2884901_2886542_-|WP_000433414.1|DBSCAN-SWA MEPKKGEAKVQKVKNWSPVWIFPIVTALIGAWILFYHYSHQGPEVTLITTNAEGIEGGKTTIKSRSVDVGVVESATLTDDLTHVQIKARLHSGMEKLLHKDSVFWVVKPQVGREGISGLGTLLSGAYIELQPGSKGSQPESYQLLDSPPLAPPDAKGIRVILDSKKAGQLSPGDPVLFRGYRVGSVETSSFDPQKRTMSYQLFIKAPNDRLVTSNVRFWKDSGIAVDLTSAGMRVEMGSLTTLFGGGVSFDVPEGLEQGQPVAEKTAFNLYDDQKSIQDSLYTDHIDYLMFFKDSVRGLQPGAPLEFRGIRLGTVSKVPFFASKMRQVFNDDYRIPVLVRIEPERLKAQLGENADVGAHLTELLKRGLRASLKTGNLVTGALYVDLDFYPKEPPITGLREFDGYEIIPTVSSGLAQIQQRLVETLDKINNLPLNPMIEQATNTLSESQRTMRRLQTTLDNMNKITSSQSMQQLPADMQTTLRELNRSMQGFQPGSAAYNKMVADMQRLDQVLRELQPVLKTLNEKSNALVFEAKDKKDPEPKRAKQ >NZ_CP043433|2865500:2906198|2902821_2904531_-|WP_000583382.1|tail|DBSCAN-SWA MIIGFGNNVVSSLAADITASQTTIQVMPGVGAMFANLLTSDYANSSNPLKTYAKITLTDAKETVFEVCHLTAVNNDMLTVIRGQEGTTAKGWSLNDVIANFATRGSENQFVQIEELQSGHYVAGVAGGTENNLTLELPATYFVNGGVDWTLRTPLVVIPALNNTGASTLQLTMGGRVLGIFPLYKGNKAELSANDIIKDIPVLCVLDNTKTYFSVLNPLEIYLGSRYLQKDQNLSDVPDKAKGRSSLEVYSKTESDENYMAKSQCGADIPNKPLFVQNIGALPASGTAVAANRLASRGALPALTGTTRGSDSGLIMGEVYNNGYPTQYGNILRLTGTGDGEILIGWSGTNGAPAPAYIRSHRDTADAEWSEWAMLYTTLNPPPDSHPVGAAIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGVIPDMRGWTIKGKPISGRAVLSQEMDGNKSHSHTARAQVTDLGTKSTSSFDYGTKSTNTTGNHTHQFGGYINSYWGDSNHTSFQPGGGAWTQAAGDHAHTVYIGGHEHTMYIGPHGHVVIVDADGNAETTVKNIAFNYIVRLA >NZ_CP043433|2865500:2906198|2900794_2901763_-|WP_001674638.1|DBSCAN-SWA MPFHIGSGCLPAIISNRRIYRIAWSDTPPEMSSWEKMKEFFCSTHQTEALECIWTICHPPAGTTREDVVSRFELLRTLAYDGWEENIHSGLHGENYFCILDEDSQEILSVTLDDVGNYTVNCQGYSETHHLTMATEPGVERTDITYNLTSDIDAAAYLEELKQNPIINNKIMNPVGQCESLMTPVSNFMNEKGFDNIRYRGIFIWDKPTEEIPTNHFAVVGNKEGKDYVFDVSAHQFENRGMSNLNGPLILSADEWVCKYRMATRRKLIYYTDFSNSSIAANAYDALPRELESESMAGKVFVTSPRWFNTFKKQKYSLIGKM >NZ_CP043433|2865500:2906198|2865500_2866181_+|WP_000938186.1|protease|DBSCAN-SWA MLPVTYRLIPQSGVSTYGLNTADTPVFPDIPEHAPNPSRLRLAHDSLAINSEFRLEPECVVEYLISGAGGIDPDTEIDDDTYDECYDELSSVLQNAYTQSETFRRLMSYAYEKELHDVEQRWLLGAGEAFETTVAQEHFKLSEGRKVICLNLDDSDDSYTEHYESNEGRQLFDTKRSFIHEVVHALTHLQDKEENHPRGPVVEYTNIILKEMGHPSPPRMVYIFNK >NZ_CP043433|2865500:2906198|2898012_2898204_+|WP_000497441.1|DBSCAN-SWA MFVELVYDKRNVEGLEGASEIILAELTKQVHQIFPDAEVRVKPMQANCLNSDTNKSDRENLNR >NZ_CP043433|2865500:2906198|2867871_2868153_-|WP_000072884.1|DBSCAN-SWA MSNVCIIAWVYGRVQGVGFRYTTQHEAQRLGLTGYAKNMDDGSVEVVACGDAAQVEKLIKWLKEGGPRSARVDKILTEPHSPRETLTGFSIRY >NZ_CP043433|2865500:2906198|2881471_2883232_+|WP_000156448.1|protease|DBSCAN-SWA MTITKLAWRDLVPDSESYQEIFAQPHATDENDTLLSDTQPRLQFALEQLIQPWASSSFMLTKAPEEQEYLTLLSDAVRALQTDAGQLTGGHYDVSGHTVHYRAAQNAQDNFATVTQVVSADWVEAEQLFGCLRQYNGDIILQPGLVHQANGGVLIISLRTLLAQPLLWMRLKAIVSRERFDWVAFDESRPLPVSVPSMPLKLKVILVGERESLADFQEMEPELAEQAIYSEFEDNLQIADAEAMTLWCQWVTRIALRDNLPPPAPDAWPVLIREAVRYTGEQDTLPLCPLWIARQFKEASPLCEGDTCGAEALSLMLARREWREGFLAERMQDEILQEQILIETEGERVGQINALSVIEFPGHPRAFGEPSRISCVVHIGDGEFNDIERKAELGGNIHAKGMMIMQAFLMSELQLEQQIPFSASLTFEQSYSEVDGDSASMAELCALISALANVPVNQNIAITGSVDQFGRAQPVGGLNEKIEGFFAICEQRELNGKQGVIIPAANVRHLSLKSELLQAVKEEKFTIWAVDDVTDALPLLLNLVWDGEGQTTLMQTIQERIAQATQQEGRHRFPWPLRWLNAFIPN >NZ_CP043433|2865500:2906198|2905137_2905767_-|WP_000274547.1|DBSCAN-SWA MYGVQGTPDCYRIELKNVYGVQENLISYRQASLGAWVAIAGGGDPYEVAYAIYKAVPDISVLTNDVVNPSGAAVDKKTIPIIVYPDTYHVPFVVPSSQNVTLLITWNTASTSYIDPTGIEKAVQQSIADYINGIATGEPINIFLIRDIFLNQVKGLVSSNLVSMIDIQVGINGKIVPPATDSSLVYGDTYAYFSTSSSQIQVKQYGSSS >NZ_CP043433|2865500:2906198|2898474_2899161_+|WP_001525490.1|DBSCAN-SWA MLRHIQNSLGSVYRSNTATPQGQIIHHRNFQSQFDTTGNTLYNNCWVCSLNVIKSRDGNNYSALEDITSDNQAFNNILEGIDIIECENLLKEMNVQKIPESSLFTNIKEALQAEVFNSTVEDDFESFISYELQNHGPLMLIRPSLGSECLHAECIVGYDSEVKKVLIYDSMNTSPEWQSNIDVYDKLTLAFNDKYKNEDCSICGLYYDGVYEPKPLHSSSWKDWCTIL >NZ_CP043433|2865500:2906198|2879709_2880762_+|WP_001674965.1|DBSCAN-SWA MKKTAIAIAVALAGFATVAQAAPKDNTWYAGAKLGWSQYHDTGFIHNDGPTHENQLGAGAFGGYQVNPYVGFEMGYDWLGRMPYKGDNINGAYKAQGVQLTAKLGYPITDDLDVYTRLGGMVWRADTKSNVPGGPSTKDHDTGVSPVFAGGIEYAITPEIATRLEYQWTNNIGDANTIGTRPDNGLLSVGVSYRFGQQEAAPVVAPAPAPAPEVQTKHFTLKSDVLFNFNKSTLKPEGQQALDQLYSQLSNLDPKDGSVVVLGFTDRIGSDAYNQGLSEKRAQSVVDYLISKGIPSDKISARGMGESNPVTGNTCDNVKPRAALIDCLAPDRRVEIEVKGVKDVVTQPQA >NZ_CP043433|2865500:2906198|2901988_2902237_+|WP_072100753.1|tail|DBSCAN-SWA MRHQRPGVQFWRAEVTRQKLLNDADNAIKDWRTELTLGIISDENKAALILPMNYINVLKSLDLTGVSDEATFTAIRWPALPQ |
40 | Salmonella_phage(28.57%) | tail,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
2977477 : 2984790
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NZ_CP043433|2977477:2984790|DBSCAN-SWA TTTGCATAAGTATCTCTCGGTAGTAAAAAAGCACCGAGTTCCTCTGTCTGATGCTGCTGTTGATTTGTTAAAAGATTTACCACGATTAAAAGATAACAATCATGTATTCCCTGCCCCTCGCGCTGAAACACTTTCTGATATGTCGTTATTGGCTGTATTGAAACGAATGGGATATATCGACTTAACGCAACATGGCTTCCGTTCTACTTTCCGTGAGTGGGCTGGTGAAGCAACGGATTATCAACGTGAGGTTATTGAACATGCGTTGGCGCACCAGTTGGCAGATAAGGCTGAAGCAGCGTATCAGCGTGGGACGTTATGGCCTAAACGGGTGGCGTTGATGGATGATTGGACGGGGTATAGCACTGCCAACAGCTAAGCTACCTGTACGAAAGCATTATCGTTGATAACAACGTAGAAAGTGTGATGCTAATAGCATTCGCTTTCGAAAATGTGATAAGTAATAATTTCATACTGAACTATTTCTTATATAATTATTATCATAATTTGCAAATTACATAACCCACTCAAGGAGAGGTTATGCCCGGACTGATAGGCTACTGGAAGCAACTTCCCACCAAAGATGAATATATTAAAAAACACAATATGAGCAAAATATCCTGCTACAGTTGTGGTCACGAGAAATTCAGCGATGTTGGTTTGATACAGGTATGGGATAATCACAGAAGAATTCTTTGTGCTAAGTGTAAGACTACTCTTTTCAGAGAAGAGGATTAGTTTTTTTGGCATTGGTAACAGCGGCTTCAGCATCCCTTTTCACGCAGCGGGTCGGGCTTTTTTTTCGCATTTGACCCGTCGATTACCGGATGATGACGCAATTTACAAGCGCCTTGTCCGCCTACCGCGAGCACAACGCCATCAGGCTAACTATTAGCCGGCGTAAAAAAACCGGGCGCTAAGGCCCGGTTTGTACGGCAGTGAAACGAAGATTAATGCGCGGCTTCCGGCTTGTGCTTTTGCGCACTCTGGAAGCCATACGTCAACGCATTTTTCTCTTTATCCAGCGCGACGGTGACCTGTCCGCCATCAACCAGCGATCCAAACAGCAACTCATTGGCCAGCGGTTTTTTCAGGTTATCCTGAATCACACGCGCCATCGGTCGTGCGCCCATCGCCCGGTCATAGCCCTTTTCCGCCAGCCAGTCGCGCGCTTCCTGACTGACTTCCAGAGAGACGCCTTTCTGATCCAACTGAGCCTGCAACTCGACGATAAACTTATCGACAACCTGATGAATCACCTCGCCAGACAGATGATCGAACCAAATAATGTTGTCGAGACGGTTACGGAACTCCGGCGTAAACACTTTCTTGATCTCGCCCATCGCATCGGTACTGTTGTCCTGATGAATAAGACCAATAGATTTACGTTCGGTTTCTCGCACGCCGGCGTTGGTGGTCATCACCAGCACCACGTTGCGGAAATCCGCCTTACGGCCATTGTTATCGGTCAGCGTACCGTTATCCATCACCTGCAGCAGCAGGTTAAAGACATCCGGGTGCGCTTTTTCGATCTCATCCAGCAACAGCACCGCATGAGGATGCTTAATCACCGCATCCGTCAGCAGCCCGCCCTGGTCGAAACCGACGTATCCCGGAGGCGCGCCGATCAAACGGCTCACCGTATGACGCTCCATATATTCGGACATATCGAAGCGCAACAGCTCAATACCCAGCGCTTTTGAAAGCTGTACCGTAACTTCAGTTTTCCCTACGCCAGTTGGCCCGGCGAACAAGAATGAGCCGACAGGTTTATGCTCATGGCCCAGACCGGCACGACTCATCTTAATAGCTTCGGTCAGCGCCTCAATCGCGTTATCCTGGCCGAAGACCAGCATTTTCAGACGATCGCCCAGGTTCTTCAGCGTATCGCGATCGCTCTGCGAGACGCTCTTTTCAGGAATTCGCGCAATTCGCGCCACTACGGACTCAATATCCGCCACGTTGACCGTTTTCTTACGTTTGCTCACCGGCATCAGACGCGCCCGAGCGCCCGCTTCGTCAATCACGTCAATGGCTTTATCCGGCAGATGGCGGTCATTGATATATTTTACCGCCAACTCGACCGCCGCACGCACCGCTTTCGCGGTATAACGCACGTCGTGGTGCGCTTCGTACTTAGGTTTCAAGCCGTTGATAATTTGCACCGTCTCTTCCACCGAAGGCTCGGTAATATCAATTTTCTGGAAACGGCGCGCTAATGCACGGTCTTTCTCAAAAATATTGCTGAATTCCTGATAGGTCGTTGAGCCGATCACCCGGATCTTGCCGCTGGAAAGCAGCGGTTTAATCAGATTTGCCGCATCCACCTGTCCGCCCGACGCCGCGCCAGCGCCGATAATGGTATGGATTTCATCGATAAACAGGATGCTGTTGGTATCCTGCTCAAGCTGTTTCAGCAACGCCTTAAACCGTTTTTCAAAATCGCCGCGGTATTTGGTGCCCGCCAGCAGCGAACCGATATCCAGAGAGTAAATGGTGCAATCGGCCATCACTTCCGGCACATCGCCCTGCACGATACGCCAGGCCAGCCCTTCGGCAATCGCCGTTTTGCCGACGCCGGATTCCCCTACCAGCAACGGGTTATTTTTACGGCGACGACACAAGACCTGGATCGCGCGTTCAAGTTCTTTTTCACGACCAATCAGCGGATCGATGCCGCCCACGCGAGCAAGTTGGTTAAGATTCGTCGTGAAGTTTTCCATACGTTCCTCCCCGCCAGCTTGTTCGTCGCCAGTTGGCTGATTGCCGAGATCGGAAGATTGGCTCGGTTCGTCTTTTCGCGTCCCGTGAGAAATAAAGTTCACGATATCCAGACGGCTCACTTCATGCTTGCGCAGCAGATAAGCCGCCTGTGATTCCTGTTCGCTAAAGATAGCCACCAGCACATTCGCGCCAGTCACTTCACTACGCCCGGAAGACTGAACATGGAAGACGGCACGCTGCAGGACACGCTGGAAACTTAACGTCGGCTGCGTATCACGCTCTTCTTCACTGGCAGGCAGTACGGGTGTGGTTTGTTCAATGAAGGCTTCGAGTTCCTGACGGAGCGCCACCAGATCCACGGAGCATGCTTCCAGCGCTTCGCGAGCCGATGGGTTGCTGAGCAGCGCCAGCAACAGATGCTCGACGGTCATAAACTCATGACGGTGCTCGCGCGCTCTGGCGAAAGCCATGTTTAAACTGAGTTCCAGTTCTTGATTGAGCATAGGCACCTCCCCCAATTTTTATACCTGCATTCAGGCTTTTTCCAGCGTACACAGCAACGGATGCTCGTTCTCCCTTGCATACTTGTTCACCATCGCCACTTTGGTTTCCGCCACCTCGGCGGTGAACACGCCGCAGATGGCTTTGCCTTGATAGTGAACTGCAAGCATCAATTGCGTTGCACGTTCTACATCATAAGAAAAGAATTTTTGTAACACGTCAATAACAAACTCCATCGGAGTGTAATCATCATTGACTAATATCACTTTATACATAGATGGCGGTTTTAGCGCGTCGCGCACGCTATCTTCCACCAACTGGTCAAAATCCAGCCAATCGTTCGTCTTACCCATTGTCAGTCGTCATTATCGGTTACGGTTGTCGGCAGGAAAATCTGCCGCTGACCAGAGTCTATGCACACAATCAATCTACCTCAATTGATAGATAACTAACATCTATCAGTACCATCCGCGACATCTGTCACATTCCCGGCAATAGCGTTAACTGCTTCAAATTTTTGATTCATTTTTACCCGATCCCCCCTGCCTGATGCTTGACGCCTCGCCTGATTTCTCTAAATTGTAATGTCGAGAGTTGGTGAGGTTTTGAACAGCCCCCACTCCGTCACCGGTTCATTCCATCTTACTTATATAAGATTTACGAAGGATGTCGAAGCATGGAAACGGGTACTGTAAAGTGGTTCAACAATGCCAAAGGGTTTGGTTTCATCTGCCCTGAAGGCGGCGGCGAGGATATTTTCGCCCATTATTCCACCATTCAAATGGATGGTTACAGAACGCTTAAAGCCGGACAGTCTGTCCGGTTTGATGTCCACCAGGGGCCAAAAGGCAATCACGCCAGCGTCATCGTGCCCATCGAAGCAGAGGCCGTTGCATAGCTCCTCTGTCTCATTGTGTACATCCAGGAGGCAAAATGCCAGCCCGATCGGCTGGCATTTTTATTTAACGCCAGTGCCTGATAGCGACACTGTTGCATCTTATCAGGCCGACAAATGACGTCAGCGAGATTACTCCCTTGCCAGCGCATCCACCGGGTCCAGTCGCGCCGCGTTTCTCGCCGGTAGCCAGCCAAACAGTATCCCGGTAAACGTCGAACATAAAAACGCGCTCGCCAGCGCAGTCAGTGAAAAACCGATCTCCCAGCCGGGCAGGAAAAGCTGTAGCATAAATGCGATGAACATCGACAAGCTAATCCCCAGCGCTCCCCCAACCAGGCAAACCAGCACCGCTTCAATAAGAAACTGCTGTAGCACATCGCTGGCGCGCGCGCCTACCGCCATACGGATGCCGATTTCACGCGTTCGCTCGGTGACGGAAACCAGCATAATATTCATAACGCCAATGCCGCCGACAACCAGCGAAATGACGGCCACCAGCGTCAGAAATAACTGAAGAGTATAGGTGGTTTTTTCAGCCGTTTTCAGAACGCTGTCCATATTCCAGGTGAAGAAGTCTTTTTTACCGTGGCGTAAGGTGAGCAGGCGGGTAAGCTGCTGTTCAGCCTGATCGCTATCAACGCCATCTTTCACACGAACGGTGATCGAGTTAAGCCATGACTGACCCATTATGCGATCTGACATCGTGCTATAGGGCAACCAAACTTGCAACAGATTGCTATTGCCGTACATGGACGGTTTCTCTTCCGCCACGCCAATAACAATAACCGGCATATTACCCACCAGCACCACTTCCCCTACGACATTCGCTTTATTTGGAAATAGCTGGCGTCGCGTGTTGGCATCCAGCACCACCACCTGCGCGCGATCCTGTTGCTGTACAGAATTGAAGGTGTTCCCCTCCCTAAAGGACATGCCGTAAACGTTAAAATAATCGCCACTGACGCCATTAGCATTTACGGCAATATCAATATTGCCATAGCGAAGACGTAAGCTCTTTGAAACGCTGGGCGTCGCAGAGTTAACCCACGGCTGTTTCTGAATAGCGACCAGATCGTCATATTTCAGCGCCTGTCGATACTGCGGATTGTCGTCGCCAAAATCTTTGCCTGGATGAATATCAATCGTGTTAGTGCCCATAGCGCGGATATCCGCCAGTACCATCTGTTTTGCGGCGTCGCCGACCACCACAATCGACACCACCGACGCAATACCGATAATAATTCCCAGCATGGTCAGTAAAGTACGCATTTTGTTAGCGGCCATCGCTAACCACGCCATTGACAGCGCTTCGCGAAAGCTGCTGGCAAATTGCCGCCAGCCGGGAGCCGTATTAACTACGGCAGCGTCAACGCCCTGTTCGCGTTTCTTTTCCTCCGCGGGCGGATTATGGACAATCTTGCCATCGTGAATTTCAATAATCCGCTCCGCCTGGGCGGCAATCAGCGGATCGTGCGTCACAATGATCACCGTATGTCCGCGATCGCGCAGTTGGCGCAAAATCGCCATCACCTCTTCGCCGGAATGGCTATCCAGCGCGCCGGTCGGCTCATCTGCCAGAATCACCTGTCCACCGTTCATCAGAGCGCGGGCAATACTGACACGCTGCTGCTGTCCGCCAGAAAGCTGTGAAGGTGGGTAATCGACGCGATCGCTTAATCCCAGCCGCAAAAGCAACTCTCTGGCGCGCGCCTGGCGTTTTTTGCGTTCAATGCCGGCGTAGACGGCGGGGATTTCAACATTTTGCGCTGCCGTTAAATGCGACAACAGATGGTAGCGCTGAAAGATAAAGCCAAAATGCTCACGCCGCAGCTGCGCCAGCGCGTCCGGGTCCAGCGTCGAGACGTCCCGCCCCGCCACCCGATAAGTGCCGCTGGTCGGTTTATCCAGGCACCCGAGGATATTCATCAGCGTTGATTTTCCAGAACCGGAAACGCCGACGATCGCCACCATCTCCCCGGCGTGGATTTGCAGGGAGATATCTTTCAACACCGCCACCTGCTCTTCTCCGGAGGGGTAGCTACGACTCACATTGCACAGTTCAAGCAATGCCGTCATGGCGTCGCTCCTGGCCTGCTTTCGCCGATGATCACCTCATCGCCCGCTTCCAGACCTTTAACCACTTCCACGTCTGTATCGTTACGCTCGCCAATGACCACTTCGCGCTCACGTTTTTCGCCGTTACGCAACAGCGCCACTTTATAACGATTGCCGCCCACCGGTTCGCCAAGCGCGGCGAGAGGAATAATCAGCACATTTTTGACATCCATGAGTTGAATATAAACCTGTGCGGTCATATCAAGACGCAAGATTCTTTTGGGATTCGGCACTTCAAACCGGGCGTAATAAAAAATAGCGTCGTTGATCTTTTCCGGCGTCGGCAGAATATCTTTTAAAACGCCTTCATAGCGCGTTTGCGGATCGCCTGCAATGGTGAACCATGCTTTCTGCCCCGCCCGAAGATGGATCACGTCCGCTTCCGAGACCTGCGCTTTTACCAGCATGGTGCTCATATCCGCCAGCGTCAGAATATTGGGCGCCTGCTGAGCTGCAATCACCGTTTGTCCTTGCAGGGTAGTGATTTGCGTCACTTCCCCCGCCATGGGGGCGACAATACGGGTATATTCCAGGTTGGTTTTCGCGGTGTCCAACGAGGCCCGATTACGTTTGATCTGGGCATCTATGGTGCCAATACGCGCCTGTTTAACCGCCATCTCCGTCGCCGCGGTATCCAGATCCTGTTGCGATACCGCCTGAGTCTTAGCTAACTGCTGCTGGCGCGCCAGCGTAACCCGCGCCAGCTTTAACTCAGCGGCTGCCTGCTGACGCTCCGCGTTCAGCTCCATCAAGGTGGCCTCGACCTCTTTTATCTGGTTCTCCGCCTGATCTGGGTCAATCACGCCGAGTAGCTGATCTTTTTTAACGTTATCGCCAATGGAGACCAGCAGCGTTTTCAACTGGCCGCTCACCTGCGCGCCGACATCCACTTTACGCAACGCGTCCAGTTTTCCAGTCGCCAGTACACTCTGTTCAAGATCGCCTGGCCGCACGATTAATGTCTGATAAGTTGGCAGCGGGGCATTTATCATTCGCCAGCCAGCCATCCCCCCCACTAAAAGAATTAAAATAATGACCAGATAACGCTTTTTAAATTTCTTTCCCTTAGCACGCAT
Protein sequences of DBSCAN-SWA_8 >NZ_CP043433|2977477:2984790|2978016_2978214_+|WP_001117984.1|DBSCAN-SWA MPGLIGYWKQLPTKDEYIKKHNMSKISCYSCGHEKFSDVGLIQVWDNHRRILCAKCKTTLFREED >NZ_CP043433|2977477:2984790|2981728_2983675_-|WP_000125875.1|DBSCAN-SWA MTALLELCNVSRSYPSGEEQVAVLKDISLQIHAGEMVAIVGVSGSGKSTLMNILGCLDKPTSGTYRVAGRDVSTLDPDALAQLRREHFGFIFQRYHLLSHLTAAQNVEIPAVYAGIERKKRQARARELLLRLGLSDRVDYPPSQLSGGQQQRVSIARALMNGGQVILADEPTGALDSHSGEEVMAILRQLRDRGHTVIIVTHDPLIAAQAERIIEIHDGKIVHNPPAEEKKREQGVDAAVVNTAPGWRQFASSFREALSMAWLAMAANKMRTLLTMLGIIIGIASVVSIVVVGDAAKQMVLADIRAMGTNTIDIHPGKDFGDDNPQYRQALKYDDLVAIQKQPWVNSATPSVSKSLRLRYGNIDIAVNANGVSGDYFNVYGMSFREGNTFNSVQQQDRAQVVVLDANTRRQLFPNKANVVGEVVLVGNMPVIVIGVAEEKPSMYGNSNLLQVWLPYSTMSDRIMGQSWLNSITVRVKDGVDSDQAEQQLTRLLTLRHGKKDFFTWNMDSVLKTAEKTTYTLQLFLTLVAVISLVVGGIGVMNIMLVSVTERTREIGIRMAVGARASDVLQQFLIEAVLVCLVGGALGISLSMFIAFMLQLFLPGWEIGFSLTALASAFLCSTFTGILFGWLPARNAARLDPVDALARE >NZ_CP043433|2977477:2984790|2980733_2981054_-|WP_000520789.1|protease|DBSCAN-SWA MGKTNDWLDFDQLVEDSVRDALKPPSMYKVILVNDDYTPMEFVIDVLQKFFSYDVERATQLMLAVHYQGKAICGVFTAEVAETKVAMVNKYARENEHPLLCTLEKA >NZ_CP043433|2977477:2984790|2983671_2984790_-|WP_001201751.1|DBSCAN-SWA MRAKGKKFKKRYLVIILILLVGGMAGWRMINAPLPTYQTLIVRPGDLEQSVLATGKLDALRKVDVGAQVSGQLKTLLVSIGDNVKKDQLLGVIDPDQAENQIKEVEATLMELNAERQQAAAELKLARVTLARQQQLAKTQAVSQQDLDTAATEMAVKQARIGTIDAQIKRNRASLDTAKTNLEYTRIVAPMAGEVTQITTLQGQTVIAAQQAPNILTLADMSTMLVKAQVSEADVIHLRAGQKAWFTIAGDPQTRYEGVLKDILPTPEKINDAIFYYARFEVPNPKRILRLDMTAQVYIQLMDVKNVLIIPLAALGEPVGGNRYKVALLRNGEKREREVVIGERNDTDVEVVKGLEAGDEVIIGESRPGATP >NZ_CP043433|2977477:2984790|2977477_2977855_+|WP_001539594.1|integrase|DBSCAN-SWA MHKYLSVVKKHRVPLSDAAVDLLKDLPRLKDNNHVFPAPRAETLSDMSLLAVLKRMGYIDLTQHGFRSTFREWAGEATDYQREVIEHALAHQLADKAEAAYQRGTLWPKRVALMDDWTGYSTANS >NZ_CP043433|2977477:2984790|2978426_2980703_-|WP_000934064.1|protease|DBSCAN-SWA MLNQELELSLNMAFARAREHRHEFMTVEHLLLALLSNPSAREALEACSVDLVALRQELEAFIEQTTPVLPASEEERDTQPTLSFQRVLQRAVFHVQSSGRSEVTGANVLVAIFSEQESQAAYLLRKHEVSRLDIVNFISHGTRKDEPSQSSDLGNQPTGDEQAGGEERMENFTTNLNQLARVGGIDPLIGREKELERAIQVLCRRRKNNPLLVGESGVGKTAIAEGLAWRIVQGDVPEVMADCTIYSLDIGSLLAGTKYRGDFEKRFKALLKQLEQDTNSILFIDEIHTIIGAGAASGGQVDAANLIKPLLSSGKIRVIGSTTYQEFSNIFEKDRALARRFQKIDITEPSVEETVQIINGLKPKYEAHHDVRYTAKAVRAAVELAVKYINDRHLPDKAIDVIDEAGARARLMPVSKRKKTVNVADIESVVARIARIPEKSVSQSDRDTLKNLGDRLKMLVFGQDNAIEALTEAIKMSRAGLGHEHKPVGSFLFAGPTGVGKTEVTVQLSKALGIELLRFDMSEYMERHTVSRLIGAPPGYVGFDQGGLLTDAVIKHPHAVLLLDEIEKAHPDVFNLLLQVMDNGTLTDNNGRKADFRNVVLVMTTNAGVRETERKSIGLIHQDNSTDAMGEIKKVFTPEFRNRLDNIIWFDHLSGEVIHQVVDKFIVELQAQLDQKGVSLEVSQEARDWLAEKGYDRAMGARPMARVIQDNLKKPLANELLFGSLVDGGQVTVALDKEKNALTYGFQSAQKHKPEAAH >NZ_CP043433|2977477:2984790|2981377_2981599_+|WP_000447499.1|DBSCAN-SWA METGTVKWFNNAKGFGFICPEGGGEDIFAHYSTIQMDGYRTLKAGQSVRFDVHQGPKGNHASVIVPIEAEAVA |
7 | Ralstonia_phage(16.67%) | integrase,protease | attL 2972274:2972288|attR 2983526:2983540 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|