Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP014049 | Vibrio vulnificus strain ATL 6-1306 chromosome 2, complete sequence | 0 crisprs | DinG,csa3,DEDDh,cas3,csx1 | 0 | 0 | 3 | 0 |
CP014048 | Vibrio vulnificus strain ATL 6-1306 chromosome 1, complete sequence | 0 crisprs | DEDDh,csa3,WYL,cas3 | 0 | 0 | 0 | 0 |
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
201509 : 208565
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP014049|201509:208565|DBSCAN-SWA AATGAAACTGCCTATTTATCTAGACTATTCGGCAACTTGCCCGGTAGATCCACGTGTCGCTGAGAAAATGGTTCAGTACATGACGATGGATGGTACCTTTGGTAACCCAGCGTCACGTTCTCATCGCTACGGTTGGCAGGCAGAAGAAGCGGTTGATACCGCGCGTGAGCAAATCGCAGAATTGTTGAATGCCGACCCTCGTGAAATTGTGTTTACATCTGGTGCAACAGAATCAGATAACCTAGCGATCAAAGGCGCAGCTCACTTCTACTCAAAACAAGGTAAGCACGTCATTACTAGCAAAACTGAGCATAAAGCCGTACTGGATACTTGCCGTCAGCTTGAGCGTGAAGGTTTTGAAGTGACTTATCTCGAGCCTGAATCAAATGGTCTTATCAGCCTAAGCAAGCTTGAAGCCGCAATGCGTGATGACACCGTTCTTGTGTCTATTATGCACGTGAACAATGAAATTGGTGTGATTCAAGATATTGAAGCCATTGGCGAACTTTGCCGTTCTCGTAAAATCATCTTCCACGTCGATGCAGCGCAGTCTGCGGGCAAAGTGGCGATTGATGTACAGAAGCTAAAAGTAGACTTGATCTCGCTTTCTGCACATAAGATTTACGGTCCGAAAGGTATCGGCGCGCTTTATGTACGTCGTAAGCCACGTATTCGTCTTGAGGCTCAAATGCATGGTGGTGGTCATGAGCGTGGTTTCCGTTCAGGTACGCTACCGACTCACCAAATCGTGGGCATGGGTGAAGCTTTCCGCATTGCCAAGCTTGATATGGAAAAAGATTACCAACATGCGTTAGCGCTTCGTAATCGTCTGTTAGACGGCGTGAAAGACATGGAAGCGGTGACCATCAACGGCGATCTTGACCAGCGTGTACCGCATAACTTGAACATCAGCTTTGCTTTTGTTGAAGGAGAGTCGCTACTGATGTCTCTAAAAGATCTTGCAGTATCGTCAGGCAGTGCTTGTACTTCAGCGAGTCTAGAGCCTTCTTATGTACTGCGTGCTCTCGGTCTAAATGATGAGTTAGCACATAGCTCGATTCGTTTCTCTTTTGGTCGTTTTACCACGGAAGAAGAGATCGACTACGCAATTGAACAAATCCGTGTAGCGGTTGAAAAGCTGCGCGACATGTCTCCTCTATGGGATATGTACAAGGAAGGAATTGATCTCAACACGGTTGAGTGGGCTCACCACTAAGATCAATTCATAGACAAGTTATAGAGGATTCGAGGTAAATATCATGGCATACAGCGAAAAAGTAATTGATCACTATGAGAACCCACGTAACGTTGGTTCGTTTGACAAAGAAGATCCATCGGTAGGTAGCGGCATGGTTGGTGCTCCTGCATGTGGTGACGTAATGAAACTGCAAATCAAAGTAACGCCAGAAGGCATCATTGAAGACGCAAAGTTCAAGACCTACGGTTGTGGTAGTGCGATTGCATCAAGCTCATTGGTTACTGAATGGGTAAAAGGCAAGAGCATCGAAGAAGCGGCTTCGATCAAAAACTCGGAAATTGCAGAAGAGCTCGAGTTGCCTCCAGTGAAAGTGCACTGTTCAATTCTTGCAGAAGATGCAATTAAAGCCGCAGTGGCTGACTACAAGAAAAAACACCAACTTTAAGGCGTTTTTTGAGGTTATAATGGGAGCCCGATGCTCCCATTCCATTGGTAAACTATTTAACTAAGGTGCAGTATGGCCGTTACATTAACAGATTCCGCAGCAAATCGAGTAAGAGCATTCCTAGATAACCGAGGCAAAGGCATTGGTTTACGTTTAGGTGTGAAGACGACTGGTTGTTCGGGTATGGCCTATGTCCTTGAGTTTGTTGACGAACTGAATGAAGAAGACGAAGTGTTTGAACATTCAGGCGTTAAGGTGATTATCGATAAAAAGAGCTTGGTGTACCTCGACGGCACGCAACTTGACTACGTAAAAGAAGGGTTGAATGAAGGCTTTGAATTCAACAACCCAAATGCGAAAAGTGAATGTGGTTGCGGTGAAAGCTTCAACGTCTAATGACTGAAATTCTGAGCCGTTACCGTCGGCTCAGAAGATTTATTCAAGGACCATAGATCAATGAATCACTTTGAATTATTTGGGCTACCACCTCAGTTTTCGCTGGATGGTAGCCTTCTTTCTTCTCAGTTCCGAGAATTGCAAAAATGCTTCCATCCGGACAATTTTGCGACCGCTTCTGAACGCGATCGTTTGTTAGCGGTACAAAAAGCGGCGCAAATTAATGACGCTTACCAGGTGCTTAAGAATCCCATTTCTCGTGCGGAATACCTTTTATCGCAAAACGGTTTAGAAATCCGAGGTGAGCAGCAAACCATGCAAGATCCCATGTTTCTGATGGAACAGATGGAATTGCGTGAAGAGTTGGAAGAGATCCCTCATGGCTCAGATGCTGAGAGTGCTTTGGCTGCGTTTGATGCCAGAGTGAGTAAAATGTACAAACAACATCTCGCAACGATTGAGCAAGAGCTGAATGACGCCCAGTGGCTACAAGCCGCCGATCGCGTACGTAAGCTGAAATTTATTGCCAAACTAAAGAATGAAATAGAGCTAGTGGAAGAGAAACTCTTCGGCTAGTTTGAAAAACAAGGATCCATCATGGCATTACTTCAAATTGCAGAACCGGGCCAAAGCTCGGCACCTCATGAGCACAAAAGAGCAGCCGGTATCGATCTTGGTACCACTAACTCTTTGGTCGCTTCAGTCAGAAGCGGCACCGCAGACACTTTGAAAGATGCTCAAGGTCGTAGCCTTCTTCCATCGATTGTTAATTATGCCAATGAAGAAGCGATTGTCGGTTACGAGGCGAAAGCGCTGTCTGAATCTCAGCCACAAGATACCATTATCTCGGTAAAACGCTTATTGGGTCGTTCGTTAACGGACATCCAGACTCGTTATCCTTCTTTGCCTTATCGTTTTAAAACCAGTGAGAATGGTTTGCCTGTTCTCCAGACTACTCAAGGGGACAAAAACCCAATTGAGGTGTCTGCCGATATCCTAAAGGTGCTAGCGAAACGCGCAGAAGAGAGCCTCGGTGGTGAGCTTTCTGGTGTGGTCATTACCGTTCCTGCCTACTTTGATGATGCGCAGCGAGCTGGCACTAAAGATGCGGCGAAACTTGCTGGCCTGCATGTGCTTCGCTTACTGAACGAACCGACAGCCGCCGCCATTGCCTATGGCTTAGACTCTGGCCAAGAAGGGGTCATTGCGGTCTACGACTTGGGTGGTGGTACGTTTGATATCTCAATTTTGCGCCTATCTAAAGGGGTGTTTGAGGTATTGGCCACCGGCGGTGATTCTGCATTAGGTGGTGATGATTTTGACCATCTGTTGGCCGATTTTCTTGCCGAGCAAGCGGGCTTAGAGACGCCTTTGAGCGCAGAAAAAAACCGCACCCTATTGAACATTGCGACGGCCACTAAGATTGCTTTCTCTGAGCAAGACTCTGTTGAAGTAGAGGTCTTTGGTTGGAAAGGCGTAGTCACGCGTGAGCAATTTGAAGAGCTGATTCGTCCGCTAGTGAAAAAAACGCTGATGTCCTGTCGTCGAGCGTTGAAAGACGCAGATGTGGAGGCTGACGAAGTACTTGAAGTGGTCATGGTCGGTGGATCTACTCGCACTTTATTGGTTCGCGAAATGGTGGGTGAGTTCTTTGGTCGCACACCATTAACTAACATCAACCCTGATGAAGTGGTTGCTATTGGTGCCGGTATTCAAGCGGATATTCTGGCGGGCAATAAGCCTGATTCTGAAATGCTATTGTTGGATGTGATCCCTCTTTCGTTGGGGATTGAAACCATGGGTGGTTTGGTTGAGAAAATCATTCCTCGTAATACCACCATTCCGGTCGCACGCGCTCAAGAGTTCACTACCTTTAAAGATGGCCAAACCGCTATGAGTGTTCATATTGTTCAAGGTGAACGTGAGATGGTGGATGATTGTCGTTCGCTCGCTCGCTTTTCACTCAAAGGCATTCCGCCAATGGCCGCGGGTGCTGCTCATATTCGTGTTACCTACCAAGTGGATGCCGATGGTCTGCTGTCAGTGACCGCGATGGAGAAAAGCACGGGTGTTCAATCCGAAATCCAAGTGAAGCCATCTTATGGTTTGAGTGATGACGAAGTGGCGAACATGCTTCGCGATTCCATGACGTATGCCAAAGAAGACATGCAAGCTCGCGCTTTAGCTGAACAACGTGTAGAAGCGGATCGAGTGATTGAAGGTCTTATTGCCGCGATGCAAGCTGACGGGGATGAGCTACTCAGCGAAGCTGAAAAAGCCACCTTACTGCAAGCGATTGAATCCTTGATTGAACTGCGCAATGGCAACGAAGCGAATGCGATTGAGCAAGGTATTAAAGACACTGACAAAGCGAGCCAAGATTTTGCCTCTCGTCGAATGGATAAATCCATTCGTGCAGCACTGGCTGGCCAGTCAATTGACACTATTTAAGAGTAATAACTATGCCTAAGATTATTGTATTGCCTCATGAAGATTTGTGCCCAGAAGGTGCAGTGCTAGAAGCAAACAGTGGTGACACAGTATTAGATGTGGCACTGAAAAATGGGATTGCTATTGAGCACGCGTGTGAGAAGTCTTGCGCTTGTACTACGTGTCACGTCATCATTCGTGAGGGCTTTGATTCACTTGAAGAAAGCGATGAGTTAGAAGATGACATGCTCGACAAAGCATGGGGGCTAGAGCCTGAATCGCGTTTAGGCTGTCAAGCGAAAGTGGCGGATGAAGACCTAGTGGTTGAAATTCCAAAATACACGCTAAACCACGCATCGGAAGACCATTAATCGATGTGTCGTGCAGTTTGCTGGGTCTTTTGACTTAACGAACTGGTACTGAAGTGACGAATGTTTCAAAGGAAGATGAGTATGAAATGGACAGATTCGCGTGATATCGCCATTGAGCTTTGCGAACGCTTTCCTGAGATGGATCCAAAAACGGTGCGTTTTACCGATCTGCATCAGTGGATTTTAGAGATTGAAGATTTTGATGACGAGCCCAATCATTCCAATGAAAAGATCCTCGAAGCAGTGATTCTTTGCTGGTTGGATGAATGGGAATAAATCCAGCCGAGTTAACAAAATTCATCAAAAAAAGCGGACCTTTTGGTCCGTTTTTTTATCAATTTGACATTTCTACCCTACATAATGCTAACATCCGTGAGTTATTCAAATAGAAAGGCGTAGGTAGCGCCGTAATAAGGCAAGGAGAAACCATGTCTACACAGATGTCTGTATTTTTAAGTCACGAAGTCGCCGCTCCACAGTGGGGCGAGAAAGCACTGGTATCATTCAATGAGCAGGGCGCTCTTATCCATACAGGCGAAAGTACGGATTTAACTAAAATCCAACGCGCAGCGCGCAAGTTTGATGTTCAAGGCATTAAGAGTGTATTTCTTACAGGTGAAGGTTGGGATGTGGAAGCGATTTGGGCTTTCCACCAAGGTTATCGCAATCCGAAAAAATACAGCAAACTTGAGTGGGTTGCGCTAGAAGAGAAAGCACAAGCAGAACTGGAAGCGCGCATTAAAGCCACTGAGTTCACTCGCGATATCATTAACAAACCAGCGGAAGAAGTCGCCCCTCGCCAACTTGCGACGATGGCGGCGGAATTTATTCGATCTGTGGCTCCAGAAGGCACAGTGACAGCGCGTATCGTCAAAGATAAAGATCTGCTTGCTGAAGGCTGGGAAGGCATTTACGCCGTAGGCCGTGGTTCTGATCGCACTTCAGCGATGCTTCAACTCGACTTTAACCCTACTGGTGATGAGAATGCGCCTGTTTGGGCCTGTCTAGTCGGTAAAGGTATTACCTTCGATTCTGGCGGTTACAGCATCAAAGCGTCCAATTTTATGGACTCAATGAAAGCGGACATGGGCGGTTCAGGTACGATTACTGGTGGTCTAGGCCTCGCAATCATGCGCGGACTGAACAAGCGCGTAAAACTGATCCTATGCTGTGCGGAAAATATGATTTCTGGCCGAGCGTTGAAATTGGGTGATGTGATTACCTACAAAAACGGTAAGACAGTTGAAATCATGAACACCGACGCGGAAGGTCGCCTAGTACTTGCAGATGGCCTGATTTACGCCTCGGAACAAAATCCAGAGTTGATCATTGATTGCGCGACCTTAACGGGCGCGGCGAAAAATGCGCTAGGTAACGATTACCACGCGCTACTGACGTTTGATCAAGCACTTGCGCAAGAAGCGCTAAAGTCAGCAGCAGAAGAAAAAGAAGGTTTGTGGCCTCTGCCACTAGCAGAATTCCATCGTGAAATGCTGCCATCAAACTTTGCTGATATGTCTAACATCGGTGGCGGTGACTACTCTCCTGGAGCCAGCACGGCAGCGGCATTTCTTTCTTACTTTGTGCAAGATTACCAAACAGGTTGGTTGCACTTCGATTGTTCGGGAACTTACCGTAAGTCAGCCAGCGACAAATGGTCTGCGGGTGCGACAGGTATGGGGGTTTGTACTCTGGCCAACCTGTTAGTGGCACAAGCGAACAAGTAATACTTCACAGCAGCCCAATTGGGCTGCTGGACTATAACAACGACATGAAATAAAGGGACACCTTATGGCTCTAGAAAGAACATTTTCGATCATTAAGCCTGACGCTGTAGAACGCAACTTAATCGGTGAAATTTACCACCGCATTGAGAAGGCGGGATTACGCATCATCGCGGCAAAAATGGTGCATCTTAATGATGAACAAGCAAGTGGCTTTTACGCTGAGCACGAAGGAAAAGAATTTTTTCCAGCATTGAAAGAGTTCATGACTTCAGGACCCATCATGGTTCAGGTTCTAGAAGGCGAAAATGCCATTGCACGTTACCGTGAGTTGATGGGTAAAACCAACCCAGAAGAAGCGGCTTGTGGCACCATTCGCGCCGACTATGCATTGAGTATGCGTCACAACTCCGTGCACGGTAGTGACAGCCCTGCATCAGCAGCGCGTGAAATTGCGTTTTTCTTCCCAGAGTCGGAAATCTGCCCTCGTTAA
Protein sequences of DBSCAN-SWA_1 >CP014049|201509:208565|201509_202724_+|AMG11328.1|DBSCAN-SWA MKLPIYLDYSATCPVDPRVAEKMVQYMTMDGTFGNPASRSHRYGWQAEEAVDTAREQIAELLNADPREIVFTSGATESDNLAIKGAAHFYSKQGKHVITSKTEHKAVLDTCRQLEREGFEVTYLEPESNGLISLSKLEAAMRDDTVLVSIMHVNNEIGVIQDIEAIGELCRSRKIIFHVDAAQSAGKVAIDVQKLKVDLISLSAHKIYGPKGIGALYVRRKPRIRLEAQMHGGGHERGFRSGTLPTHQIVGMGEAFRIAKLDMEKDYQHALALRNRLLDGVKDMEAVTINGDLDQRVPHNLNISFAFVEGESLLMSLKDLAVSSGSACTSASLEPSYVLRALGLNDELAHSSIRFSFGRFTTEEEIDYAIEQIRVAVEKLRDMSPLWDMYKEGIDLNTVEWAHH >CP014049|201509:208565|206429_206624_+|AMG11334.1|DBSCAN-SWA MKWTDSRDIAIELCERFPEMDPKTVRFTDLHQWILEIEDFDDEPNHSNEKILEAVILCWLDEWE >CP014049|201509:208565|203223_203547_+|AMG11330.1|DBSCAN-SWA MAVTLTDSAANRVRAFLDNRGKGIGLRLGVKTTGCSGMAYVLEFVDELNEEDEVFEHSGVKVIIDKKSLVYLDGTQLDYVKEGLNEGFEFNNPNAKSECGCGESFNV >CP014049|201509:208565|203607_204123_+|AMG11331.1|DBSCAN-SWA MNHFELFGLPPQFSLDGSLLSSQFRELQKCFHPDNFATASERDRLLAVQKAAQINDAYQVLKNPISRAEYLLSQNGLEIRGEQQTMQDPMFLMEQMELREELEEIPHGSDAESALAAFDARVSKMYKQHLATIEQELNDAQWLQAADRVRKLKFIAKLKNEIELVEEKLFG >CP014049|201509:208565|206009_206348_+|AMG11333.1|DBSCAN-SWA MPKIIVLPHEDLCPEGAVLEANSGDTVLDVALKNGIAIEHACEKSCACTTCHVIIREGFDSLEESDELEDDMLDKAWGLEPESRLGCQAKVADEDLVVEIPKYTLNHASEDH >CP014049|201509:208565|204144_205998_+|AMG11332.1|DBSCAN-SWA MALLQIAEPGQSSAPHEHKRAAGIDLGTTNSLVASVRSGTADTLKDAQGRSLLPSIVNYANEEAIVGYEAKALSESQPQDTIISVKRLLGRSLTDIQTRYPSLPYRFKTSENGLPVLQTTQGDKNPIEVSADILKVLAKRAEESLGGELSGVVITVPAYFDDAQRAGTKDAAKLAGLHVLRLLNEPTAAAIAYGLDSGQEGVIAVYDLGGGTFDISILRLSKGVFEVLATGGDSALGGDDFDHLLADFLAEQAGLETPLSAEKNRTLLNIATATKIAFSEQDSVEVEVFGWKGVVTREQFEELIRPLVKKTLMSCRRALKDADVEADEVLEVVMVGGSTRTLLVREMVGEFFGRTPLTNINPDEVVAIGAGIQADILAGNKPDSEMLLLDVIPLSLGIETMGGLVEKIIPRNTTIPVARAQEFTTFKDGQTAMSVHIVQGEREMVDDCRSLARFSLKGIPPMAAGAAHIRVTYQVDADGLLSVTAMEKSTGVQSEIQVKPSYGLSDDEVANMLRDSMTYAKEDMQARALAEQRVEADRVIEGLIAAMQADGDELLSEAEKATLLQAIESLIELRNGNEANAIEQGIKDTDKASQDFASRRMDKSIRAALAGQSIDTI >CP014049|201509:208565|208139_208565_+|AMG11336.1|DBSCAN-SWA MALERTFSIIKPDAVERNLIGEIYHRIEKAGLRIIAAKMVHLNDEQASGFYAEHEGKEFFPALKEFMTSGPIMVQVLEGENAIARYRELMGKTNPEEAACGTIRADYALSMRHNSVHGSDSPASAAREIAFFFPESEICPR >CP014049|201509:208565|206776_208075_+|AMG11335.1|DBSCAN-SWA MSTQMSVFLSHEVAAPQWGEKALVSFNEQGALIHTGESTDLTKIQRAARKFDVQGIKSVFLTGEGWDVEAIWAFHQGYRNPKKYSKLEWVALEEKAQAELEARIKATEFTRDIINKPAEEVAPRQLATMAAEFIRSVAPEGTVTARIVKDKDLLAEGWEGIYAVGRGSDRTSAMLQLDFNPTGDENAPVWACLVGKGITFDSGGYSIKASNFMDSMKADMGGSGTITGGLGLAIMRGLNKRVKLILCCAENMISGRALKLGDVITYKNGKTVEIMNTDAEGRLVLADGLIYASEQNPELIIDCATLTGAAKNALGNDYHALLTFDQALAQEALKSAAEEKEGLWPLPLAEFHREMLPSNFADMSNIGGGDYSPGASTAAAFLSYFVQDYQTGWLHFDCSGTYRKSASDKWSAGATGMGVCTLANLLVAQANK >CP014049|201509:208565|202767_203151_+|AMG11329.1|DBSCAN-SWA MAYSEKVIDHYENPRNVGSFDKEDPSVGSGMVGAPACGDVMKLQIKVTPEGIIEDAKFKTYGCGSAIASSSLVTEWVKGKSIEEAASIKNSEIAEELELPPVKVHCSILAEDAIKAAVADYKKKHQL |
9 | Faustovirus(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
285058 : 291766
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP014049|285058:291766|DBSCAN-SWA CATGACAACGAATCAACAAAGCGTCGCAGCTTCTCAACCAAAAACTATCGTTGTGAAGCTTGGAACCAGTGTATTGACCGGTGGTACATTAGCGTTAGATCGCGCTCATATGGTTGAGCTTGCGCGCCAATGTGCAGAGTTAAAAAAACAAGGCCACTCTGTGGTGGTCGTCTCGTCAGGCGCGATTGCCGCTGGACGTGAGCACTTAGGTTACCCCGCGCTGCCCAATGCGATGGCAAGTAAACAGTTGCTTGCTGCGGTTGGGCAAAGTCAGTTGATTCAAACTTGGGAATCGTTATTTGCACTGTATGGCATCAAAATTGGCCAAATGTTGCTCACTCGAGCGGATCTCGAAGATCGCGAACGTTTTCTCAATGCTCGAGATACGATTAACGCGTTAGTGGATAACGGCATTATCCCTGTCGTTAATGAAAATGACGCCGTCGCCACGAGCGAAATCAAAGTGGGTGACAACGACAACCTCTCTGCGTTGGTGGGCATTTTATGTGGTGCAGATAAACTACTGTTGCTGACAGACCAAAAAGGCTTGTATACCGCCGATCCGCGTAAAGATCCCAATGCTGAGCTGATCAAAGAAGTTAAAGTCATTGACGATACTTTGCGTAAAATTGCTGGTGGCAGCGGTACAACACTAGGTACTGGTGGAATGGCAACTAAGCTCCAAGCTGCGGACATTGCGCGTCGTGCTGGCATTGAAGTCATCATTGCGGCAGGCCGTGGTCAAAATGTCATTTTTGATGCGTTGAGCCCAGCCCCTCAAGGAACTCGTTTCTTGCCTTGTGAAGAAGCGTTGGAAAATCGTAAGCGTTGGATTCTCGCTGGCCCTGCGGCCTCTGGTGATATTGTGATTGACCAAGGTGCGGTGAAAGCCGTGGTCGAAAAAGGGAGCAGTTTGTTGGCGAAAGGGGTGACCAAAGTACTTGGCGAGTTTTCTCGTGGCGAAGTGGTTCGTGTAACGGATGCACAAGGCCATTTGGTTGCGCGAGGTATTGCGAGTTATTCCAACCAAGATATGGCGAAAATTGCCGGTAAGCACAGCAAAGACATTATTTCGATTCTGGGCTATGACTATGGTTCAGAAGTGATTCACCGTGACGACATGGTTGTCATTCAAGAATAGTCGCTCACCAGAATCAAAGAGGACAACAAAGTGGATTTAATCACATTAGGTAAAGCGGCAAAAGACGCGGCTTTTCAGCTTGCCACCGCTTCTACGGCGCAAAAGAACAAAGCATTGGCCATCATTGCAGATGAACTTGAAGCGAATGTGGCGGATATTTTGGCGGCAAACAGCAAAGATATCGAACTCGGTCGACAAGCGGGTTTGAGCGAAGCGATGCTGGATCGTTTACTTTTGAATGAATCTCGCTTGAATGGCATCGCCAACGATGTACGTAACGTGATCAGCCTTACGGATCCGGTCGGTAGCGAAATCGACAGTAAAGTGTTGGAAAATGGCATGCAGCTGTCACGCCGTCGTGTGCCGCTTGGCGTGGTTGGTGTGATTTACGAAGCGCGCCCAAATGTCACTATCGACATCGCGGCATTGTGCCTAAAAACCGGCAATGCGAGCATCCTACGTGGCGGAAAAGAGACCTTCTTCTCCAACATGGAGTTGGTGAAGGTGATTCAATCGGCGTTAGCGAAAGCTGGCTTGCCAGCGGCATCGGTGCAGTACATTGAAAAACCGGATCGTGAGCTGGTCACTCAATTGCTGAAACTGGACGATTACGTTGACATGATCATCCCTCGTGGTGGCGCGGGGCTGCACAAAATGTGTAAAGAGAACAGCACCATCCCTGTGATCATTGGCGGTTTCGGTATCAGCCATATTTTTGTTGATGAAACGGCTGATCTCGCGAAAAGCGTGGATGTGGTGGAGAATGCAAAAGTTCAGCGTCCTTCCGCTTGTAATGCGCTGGATACCTTACTTGTTCATGAGCGTATCGCAGAGCAGTTTCTTCCGATGCTGGTGGCAAAACTCAATGGCAAAGTAACTTTTGTTGTTGAGCCAAAAGCCAAAGCGTACATGACTAAAGCTGAACAGGTGCGTGATGCGAGTGATGGTGATTTTGATACCGAATGGTTGAGTTACACGCTGGGGGTCAAAGTGGTTGCGGATGTGCAAGAAGCGATCGAGCACATGCGTGAACATAACGCGAGTCACTCTGATGCCATCATGACGAACCATCTGCAGAATGCTGAGTTGTTCATCAACTCGGCGGGCTCTGCGGCGGTGTATGTCAACGCATCGACACGCTTTACCGATGGCGCACAATTTGGTTTGGGCGCAGAAGTGGCGGTGTCTACTCAGAAATTGCACGCACGTGGTCCAATGGGATTAGAAGAGTTAACCAGTTACAAATGGGTTGGCAAGGCGAATTACCTTTCACGCGCGTAAGCGTAATCGAGTTTGAGTTAACATATCCGGTCTAAACCGTTTTGAAAAAGGGGCATCACGCCCCTTTTTGTTTTATTCGGCCTAGCAATTCCGCTAAACTAGCCGCAAGTTCTGCACCTGCTTTGCTACCGTGAATCGTTGTATTAGCACGGATATTCGCAATAGCGTGCAGATGGATCTATGCATCATTGGGAGGTGATATGCATTGTCCTTTCTGTTCAGAAAATGACACCAAAGTAATTGATTCACGTTTGGTGGCCGATGGTCATCAAGTACGTCGCCGTCGTCAGTGCCTCGCGTGTAACGAGCGTTTTACCACGTTTGAAACCGCGGAGCTGTTAATGCCAAAGGTGATCAAATCCAATGGCAACCGTGAGCCGTTCAATGAAGACAAGATGGTTGGTGGTATTCAGCGTGCTTTAGAGAAGCGCCCTGTCAGTGCCGATGCGATTGAACTGGCGATCAGCATGATCAAATCCAAATTACGTGCCACTGGCGAGCGTGAAGTACCGAGCAAAATGATCGGTAACTTGGTGATGGAACAGCTCAAACATTTAGACAAAGTGGCCTATATTCGCTTTGCTTCCGTATACCGTAGTTTTGAAGATATTCGTGAATTTGGTGAAGAAATCGCCAGATTGGAAGATTAACGCAAGGTAACGGTTCACCATGTCGGAATTTACTGCAATCGATAGACAAATGATGCTGCGCGCCATTGCTTTGGCGAAACGCGGCCTCTATACCACCGCACCTAATCCTAATGTTGGCTGTGTATTACTGCGCGACGGCGAAATTGTCGGAGAAGGTTTTCACTTTCGTGCTGGCGAGCCTCATGCGGAAGTGCACGCGATGCGCATGGCTGGTGACAAAGCCAAAGGTGCGACCGCTTATGTCACGCTAGAGCCTTGTTCACACTATGGTCGAACGCCCCCGTGTGCGGAAGGCTTAATTAAAGCTGGGGTAAGCCGAGTGGTGTGTGCCATGGAAGACCCAAATCCTCAAGTCGCGGGGAGAGGTTTTGCCATGTTGCGAGAGGCAGGGATCGAGGTATCGGTGGGTCTACTTCAAACAGAAGCAGAAGCACTGAATCCTGCTTTTATTAAGCGAATGAAAACGGGCATGCCGTTTGTTCAACTCAAAATGGCCGCGTCCCTCGACGGTCAAACAGCGTTGGCCAATGGCAAGAGTCAGTGGATCACCTCACCACAAGCTCGCCGAGATGTGCAACGTTTTCGGGCTCAGTCTGGGGCGATTTTGTCCACCAGTAAAACGGTGATTGCTGACAATGCGTCTCTCAATGTTCGCTGGTCAGAGTTACCTTCGTCTGTGCAACGCGCTTTGCCGCAAGAGCAGCTGCGCCAGCCAACTCGAGTTGTGTTAGACAGACAAGCCGAGCTTTCGCCGGAGCTAAAACTTTACCAAACCGCGGGTGAGCGACTCATTGTTGGCCCCAAAGGCGATCTCCCTGCACCTTTGGATGAGCACGGACAGATCGATTTGCCGCAACTGTTCGCGCAACTGAGTCAAATGCAGTCCATTCACCATCTTTGGGTTGAAGCGGGAGCAACGCTGGCGGCGAAACTGATCAAACACCAGTTGGTGGATGAGCTGATTGTTTACCTTGCCCCAAAACTGATGGGCAGCGATGGACGTGGATTGATGGGTGCGCTGGGTTTGCAAGCCATGTCGCAAGTGATCGATTTAGACATTAAAGATGTGCGAATGGTTGGGCCGGATATCCGCATCATTGCACACATCAAAGCAAAAGAAAGTTGATATGTTCACAGGAATTATCGAAGCAGTTGGAACCTTGGCTGCGATTACCCCAAGAGGGGAAGACATTACCGTAACGGTTAACACAGGCAAGCTCGATATGTCGGACGTTAAGTTGGGCGACAGTATTGCCACGAATGGTGTTTGCCTGACGGTGATTGATTTTTCAGACCGTCATTACAGTGCCGATCTCTCGTTAGAGTCTTTAAAGAAAACGGGCTTTGCCAATTATCAAGTGGGTGACAAGGTCAATCTCGAGAAAGCGATGCTGCCTACCACCCGTTTTGGCGGGCATATTGTTTCTGGCCATGTGGATGGCGTTGGTGAAATTGTTGAGCGTAATCAGGTTGGTCGTGCTATCGAGTTTTGGGTCGCAATGCCTGAGAGCTTGAGTAAATACGTTGCGGAAAAAGGCTCAATCACCGTTGATGGGATCAGTTTGACCGTGAATGATCTACGCAAGAATGCGTTTAAACTCACCATCGTTCCTCATACGGGGGACGAAACCACGATTAATGATTTCCAAGTGGGGCGCAAAGTTAACTTGGAAGTGGATGTTCTTGCTCGTTACATGGAACGTTTGCTACTTGGTCAACAAGCACAATCCAATGACGAGTCCCGCATCACCATGGAATTCTTGCAACAGAATGGTTTTGCTTAGTTCACAGAATGTTAGCCCACAGGTTGACACTCGGTAACGATGAATCAGACTAGTGTCTATCAGCGTTTTAGCGCTTAAATTGACAGGAAATATCGCTATGCCAATCAGTACACCCCAAGAGATCATTGAAGACATTCGTTTGGGCAAAATGGTCATCCTAATGGACGATGAAGACCGCGAAAACGAAGGTGATTTGATTATGGCGGCGGAGCACATTACGCCAGAAGCCATCAACTTTATGGCCAAATATGGTCGCGGTTTGATCTGCTTGACCATGACCAAAGAGCGTTGTCAGCGTCTTGGATTACCACCAATGGTGCAAGATAACAACGCGCAATACACCACCAACTTCACTGTTTCCATCGAAGCGGCGGAAGGCGTGACAACAGGCATTTCAGCAGCGGATCGTGCTCGTACCGTGCAAGCGGCAGTCGCAAAAGAAGCTAAAGCGGCAGACTTGGTGCAACCGGGGCATATTTTTCCATTAGCCGCGCAAGATGGTGGCGTATTGACTCGCGCGGGTCATACCGAGGCGGGCTGCGACCTTGCTCGTTTGGCGGGTTTTGAACCAGCTTCCGTTATCGTTGAAATTCTGAATGACGATGGCACTATGGCGCGTCGCCCTGATTTGGAAGTGTTTGCTGAGCAACATGGTCTCAAACTCGGCACCATTGCCGATTTGATTGAGTATCGAAACAATACCGAAACCACGATTGAGCGCGTTGCTGAGTGCAAGCTACCCACCGAGTTTGGTGAGTTTGATCTGGTGACTTACCGCGACACCATTGATAATCAAATTCACTTTGCTTTGCGTAAAGGCACTGTGGGTGATCAAACACCACTGGTTCGTGTTCATTTACAAGACACCTTTACCGATTTACTGCGCAGCGATCGCAATGCTGAGCGCAGTTGGACGCTTGATAAAGCGATGAAGCGTATTGGCCAAGAAGGGGGCGTTTTGGTGATTCTGGGGAATGAAGAGTCGCCTGAGCTGCTGATCCATCGCGTCAAAATGTTTGAAGCTCAAGACAAAGAAGAGGCACCCAAATTGGCGAAGAAGCAGGGCACTTCACGTCGTGTTGGTGTCGGCTCGCAAATTCTTGCAGACTTAGGTGTGCACGATATGCGTCTACTTTCTTCCAGCAACAAGAAATACCATGCCCTTGGCGGTTTTGGTCTGAACGTTGTTGAGTATGTGTGTGAATAACACCGCTTCTAATAAACGCAGCGAAAATAACGCTGCGTTTATTTTTCTCCCCTAATTCGACTCATCTCTTCGCTGTAAGGCAGTATTTCGTTGCGAAATACCATTAGATATTGCTCACAAAATTGTGCTAGAATCCGGCGATTCTCACTTGATGAACATAGTTAAAGGAAGGCTTATGAAAGTGATCGAGGGTGGCTTCCCTGCGCCAAACGCAAAAATTGCTATCGTTATTTCTCGTTTCAACAGTTTTATTAACGAAAGTCTATTGTCTGGTGCCATCGACACTTTGAAGCGTCACGGTCAAGTCAGTGAAGACAACATCACCGTTGTACGTTGTCCTGGTGCTGTTGAACTACCTCTAGTGGCTCAGCGCGTAGCAAAAACAGGTAAGTTCGACGCGATTGTATCTCTTGGCTCAGTGATCCGTGGTGGTACACCACACTTTGACTATGTTTGTAGTGAAATGAATAAAGGTCTCGCACAAGTGTCTCTTGAATTTAGCATTCCAGTGGCGTTTGGTGTTTTGACCGTTGATACTATCGATCAAGCTATTGAACGCGCAGGAACCAAGGCTGGTAACAAAGGTGCAGAAGCTGCACTGAGCGCACTTGAGATGATAAACGTTCTTTCTGAAATTGATTCCTAA
Protein sequences of DBSCAN-SWA_2 >CP014049|285058:291766|285058_286198_+|AMG11400.1|DBSCAN-SWA MTTNQQSVAASQPKTIVVKLGTSVLTGGTLALDRAHMVELARQCAELKKQGHSVVVVSSGAIAAGREHLGYPALPNAMASKQLLAAVGQSQLIQTWESLFALYGIKIGQMLLTRADLEDRERFLNARDTINALVDNGIIPVVNENDAVATSEIKVGDNDNLSALVGILCGADKLLLLTDQKGLYTADPRKDPNAELIKEVKVIDDTLRKIAGGSGTTLGTGGMATKLQAADIARRAGIEVIIAAGRGQNVIFDALSPAPQGTRFLPCEEALENRKRWILAGPAASGDIVIDQGAVKAVVEKGSSLLAKGVTKVLGEFSRGEVVRVTDAQGHLVARGIASYSNQDMAKIAGKHSKDIISILGYDYGSEVIHRDDMVVIQE >CP014049|285058:291766|287679_288129_+|AMG11402.1|DBSCAN-SWA MHCPFCSENDTKVIDSRLVADGHQVRRRRQCLACNERFTTFETAELLMPKVIKSNGNREPFNEDKMVGGIQRALEKRPVSADAIELAISMIKSKLRATGEREVPSKMIGNLVMEQLKHLDKVAYIRFASVYRSFEDIREFGEEIARLED >CP014049|285058:291766|288148_289255_+|AMG11403.1|DBSCAN-SWA MSEFTAIDRQMMLRAIALAKRGLYTTAPNPNVGCVLLRDGEIVGEGFHFRAGEPHAEVHAMRMAGDKAKGATAYVTLEPCSHYGRTPPCAEGLIKAGVSRVVCAMEDPNPQVAGRGFAMLREAGIEVSVGLLQTEAEALNPAFIKRMKTGMPFVQLKMAASLDGQTALANGKSQWITSPQARRDVQRFRAQSGAILSTSKTVIADNASLNVRWSELPSSVQRALPQEQLRQPTRVVLDRQAELSPELKLYQTAGERLIVGPKGDLPAPLDEHGQIDLPQLFAQLSQMQSIHHLWVEAGATLAAKLIKHQLVDELIVYLAPKLMGSDGRGLMGALGLQAMSQVIDLDIKDVRMVGPDIRIIAHIKAKES >CP014049|285058:291766|290010_291120_+|AMG11405.1|DBSCAN-SWA MPISTPQEIIEDIRLGKMVILMDDEDRENEGDLIMAAEHITPEAINFMAKYGRGLICLTMTKERCQRLGLPPMVQDNNAQYTTNFTVSIEAAEGVTTGISAADRARTVQAAVAKEAKAADLVQPGHIFPLAAQDGGVLTRAGHTEAGCDLARLAGFEPASVIVEILNDDGTMARRPDLEVFAEQHGLKLGTIADLIEYRNNTETTIERVAECKLPTEFGEFDLVTYRDTIDNQIHFALRKGTVGDQTPLVRVHLQDTFTDLLRSDRNAERSWTLDKAMKRIGQEGGVLVILGNEESPELLIHRVKMFEAQDKEEAPKLAKKQGTSRRVGVGSQILADLGVHDMRLLSSSNKKYHALGGFGLNVVEYVCE >CP014049|285058:291766|291295_291766_+|AMG11406.1|DBSCAN-SWA MKVIEGGFPAPNAKIAIVISRFNSFINESLLSGAIDTLKRHGQVSEDNITVVRCPGAVELPLVAQRVAKTGKFDAIVSLGSVIRGGTPHFDYVCSEMNKGLAQVSLEFSIPVAFGVLTVDTIDQAIERAGTKAGNKGAEAALSALEMINVLSEIDS >CP014049|285058:291766|289256_289913_+|AMG11404.1|DBSCAN-SWA MFTGIIEAVGTLAAITPRGEDITVTVNTGKLDMSDVKLGDSIATNGVCLTVIDFSDRHYSADLSLESLKKTGFANYQVGDKVNLEKAMLPTTRFGGHIVSGHVDGVGEIVERNQVGRAIEFWVAMPESLSKYVAEKGSITVDGISLTVNDLRKNAFKLTIVPHTGDETTINDFQVGRKVNLEVDVLARYMERLLLGQQAQSNDESRITMEFLQQNGFA >CP014049|285058:291766|286210_287479_+|AMG11401.2|DBSCAN-SWA MKEDNKVDLITLGKAAKDAAFQLATASTAQKNKALAIIADELEANVADILAANSKDIELGRQAGLSEAMLDRLLLNESRLNGIANDVRNVISLTDPVGSEIDSKVLENGMQLSRRRVPLGVVGVIYEARPNVTIDIAALCLKTGNASILRGGKETFFSNMELVKVIQSALAKAGLPAASVQYIEKPDRELVTQLLKLDDYVDMIIPRGGAGLHKMCKENSTIPVIIGGFGISHIFVDETADLAKSVDVVENAKVQRPSACNALDTLLVHERIAEQFLPMLVAKLNGKVTFVVEPKAKAYMTKAEQVRDASDGDFDTEWLSYTLGVKVVADVQEAIEHMREHNASHSDAIMTNHLQNAELFINSAGSAAVYVNASTRFTDGAQFGLGAEVAVSTQKLHARGPMGLEELTSYKWVGKANYLSRA |
7 | Staphylococcus_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2283483 : 2298726
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP014049|2283483:2298726|DBSCAN-SWA ATTAAAGACGTTCAGCGATCCACGCGTCGACAGATGCCAGAGCTGAAGGTAGAGCCGAGCTATCGGTACCACCAGCCTGTGCCATATCTGGGCGGCCACCACCTTTACCACCCACTTGCTGCGCAACCATGTTAACGAGCTCGCCTGCTTTCACTTTTGAAGTCAAATCTTGGGTCACACCCGCGATAAGGCCCACTTTGTCATCAGCAACGTTACCCAACATAATGATGCCGCTGCCCATTTGGTTTTTCAGCTCATCCACCATGCCGCGCAGCGCTTTGTTATCAGCACCGTCAAGCTTCGCTACTAAGACTTTCACACCCGCAATTTCTTTTACTTGATCAATCAAGTTTGCGCTGGCTTGTGAAGCCAACTTGTCTTTAAGCTGTTGAATTTCTTTCTCTAGCGACTTCGCTTTATTCGCTGCATCTTGAAGTTTCTCTTCAAACTTACGAGCTTGAGCATCAATCGCCTCCAACGCAGCTTCGCCAGTTACCGCTTCAATACGGCGAATACCCGCCGCAATACCACCTTCTGAAGTGATCTTGAACAAGCCAATATCACCGGTATTTGATGCGTGAATACCACCACAAAGTTCGGTAGAGAAATCACCCATAGACAACACACGAACTTCGTCATCGTATTTCTCACCGAACAGCGCCATAGCCCCTTTCTGCTTCGCCGATTCGATGTCCATGATGTTGGTTTCAATGGTATGGTTACGACGAATCTGAGCATTCACAAGACGCTCAACTTCTTTCAGCTCTGCTGCCGTAACCGCTTCTAGATGTGAGAAGTCAAAGCGTAAGTTCTCTGCCTTCACCAATGAGCCTTTTTGCGCTACGTGCTCACCAAGCACTTGGCGTAGAGCGGCGTGAAGCAAGTGTGTTGCAGAGTGGTTAAGTGAAATGGCAGCGCGGCGCTCAGCATCCACGATCGCAGAAACTTGATCACCTTTTGCTAGTACACCTTCAACTAGAACGCCGTGGTGCGCAATTGCGTTACCTAATTTCTGGGTGTCTTCCACTTGGAATAGACCTGATTCTGTTTTCAACACGCCGCTATCGCCACATTGACCACCCGATTCAGCGTAGAAAGGCGTTTCTTGCAGAATGACGATCGCTTTGTCGCCTGCAGAAAGCGTTGCGGTCTCCTCACCTTCAACAAACAGAGCAACCACATCGCTACGCTCTTGTGTTGAGGTGTAACCACAAAATTCAGATTGTGCGTCTACTTTGATTTTCGCGTTGTAATCGGTGCCAAATTGGCCAGCTTCACGAGCACGCTGACGCTGCTCTTCCATCGCTTTCTCGAAACCAGCTTCGTCGATAGTGAACTCTCGTTCACGAGCAACATCGTTCGTTAAGTCAGCAGGGAAACCGTAAGTGTCATAAAGCTTGAATACCGTCTCACCGTCAAGTTCTTTACCTTCAAGTGCATCGAGTGCTTCATTGAGGATAACCATGCCGCGCTCGAGTGTGCGGCCAAAGTTTTCTTCTTCGATGCGAAGTACTTTTTCAACAACCGCTTGTTGTTTCTTCAGTTCGACCGCGGCACTGCCCATCACTTCAGCCAACACGCCGACAAGTTTGTGGAAGAATACGCCTTGTGCACCTAGCTTGTTACCGTGACGAACGGCACGACGAATGATGCGACGTAGCACGTAACCACGACCTTCGTTTGAAGGCATAACGCCATCCGCGATTAGGAATGAACAAGAGCGGATGTGGTCTGCGATAACGCGTAGCGACTGGTTTGATAGGTCTTCACAACCGACGATTTCAGCTGTCGCTTTGATTAGCTTTTGGAACACATCAATTTCATAGTTTGAATGTACGCCCTGCATGATGGCAGAAATACGCTCAATGCCCATACCGGTATCAACAGACGGCTTAGGAAGCGGCTCCATCGTACCATCGGCATGGCGGTTGAACTGCATGAAAACGTTGTTCCAGATTTCGATGAAACGGTCACCATCCTCTTCAGGAGTGCCAGGACGACCGCCCCAAATGTGTTCACCGTGGTCATAAAAGATCTCAGTACAAGGACCACACGGGCCCGTGTCACCCATAGTCCAGAAGTTGTCTGACTCGTACTGCTTGCCACCTTTTTTATCGCCGATGCGAATGATGCGATCTGCTGGAACACCGACTTTTTGGTTCCAGATATCAAAAGCTTCATCATCCGTTTCGTAAACCGTGACCAAAAGGCGATCTTTTGGCAATTTTAGCGTCTCAGTTAGGAATTCCCAAGCAAAAGCAATCGCGTCTTCTTTGAAGTAGTCACCAAAGCTGAAGTTACCTAGCATTTCAAAGAAAGTATGGTGACGAGCGGTAAAACCTACGTTTTCTAGGTCATTGTGTTTACCACCTGCACGTACGCAACGTTGAGCCGTAGTTGCTCGAGTGTAGGCACGCTTTTCGGCGCCTAAGAAGCAATCTTTAAATTGGTTCATACCTGCGTTAGTAAATAGCAGGGTTGGATCGTTATGTGGAACCAACGATGAACTGTCTACGATTTGGTGTCCTTTGCTCTCAAAGAACTTGAGGAACGCGTTACGAACCTCATCAGTGCTCATGTACATGCAGCTCTTCCTGAAAATAGTCGAGTTAGAATTTTGCCGTATTGTAGATCATGCTTAAAGCTTCGACTAGTTTTCTTATGGAAAAGAGCGGATGAAATGAAAAAATCGTTTAATCGCGGGAAATCATGATTTCTTCACGAATGTGAGGAGAGAAAATTGGCAGGGAACTGACGGAAACATTATGCTTTATCGTTCAACGCATAGCGTATCTGTTCAAATGAAAAACCACGATATTGGAGAAAGCGAACTTGTTTTGCATACTCTTTTTGTTCTTTGGCCTTCACTCCGTTGAACTTTTTCATCGCAACGGATTTGGCTAACTCAAACCAATCTTGTGGCTCTTCCGCAAATGCATTTTCAACGACAGAATCAGACACTTGTTTAAGCGATAACTCCTGCCGTATGCGTCTTTCGCCATGCCCTTTTCCAACGTGTTGTCGAATTTGGCTTTTGGCATAACGAAGATCATCGAGATAACCATGTTCCAAGCAAAAATGCATCGCACTTTCAATATCTTGCCTAGCATACCCTTTAACTGTGAGCTTTTGCTTCAACTCATACTGACCATGATCTCGTCGACTGAGCAGTTGAATCGCCGTGTCTTTGCAATTCATCATTGGCGGCGTAAAGCGATGATGCATCAGAACTCCTTAAATCGGGGGATAAGTAATAACAATGAATCTGAAATGGAAAAGAGAAGAAAAAAGCCCTGCATTGCAGGGCTTAATAAAATAGAGGCTGAAATTAAAACTCTTCTTGCTCTGGCATTTCTTCGACTAACTCTGCCGATTCATCGTTGATGTTTGCTGGAGAAAGCAACATTTCACGCAATTTCGTATCCAGTACTTTTGCTACATCTACATTTTCTTTCAGGTATTTACAAGCGTTCGCTTTACCTTGGCCAATTTTGTCGCCGTTATAGCTATACCAAGCGCCTGATTTTTCAATCAGCTTACATTTCACGCCTAGGTCAATCAGTTCACCTTCGCGGTTAAAGCCTTGGCCGTACATAATTTGAGTGTTGGCTTCTTTAAACGGCGCAGCGATCTTATTCTTCACCACTTTGATGCGCGTTTCGTTACCCACGACCTCATCACCTTCTTTGATCGCACCAGTACGGCGAATATCAAGACGTACAGAAGCGTAGAATTTCAGAGCGTTACCACCCGTTGTGGTTTCTGGGTTACCAAACATCACACCGATCTTCATACGGATCTGGTTGATGAAGATACACATACAGTTAGACTGCTTTAGGTTACCCGTTAACTTACGCATCGCTTGAGATAGCATACGAGCTTGAAGACCCATGTGCGAGTCGCCCATCTCACCTTCGATTTCTGCCTTTGGTGTCAATGCTGCAACAGAGTCGACAACAATAACGTCAACCGCACCTGAGCGAGCAAGAGCATCACAGATTTCCAATGCTTGTTCACCGGTGTCAGGCTGAGATACCAACAACTGGTCGATATTAACGCCAAGCTTCTTCGCATACACAGGATCCAACGCGTGCTCGGCATCGATAAACGCACAAGTTTTGCCTTCACGTTGAGCCGCAGCGATCAGCTCAAGGGTCAACGTGGTTTTACCTGAAGATTCTGGACCAAAAATTTCAACGATACGGCCCATTGGTAAGCCACCAGCACCCAGCGCAATATCCAGAGATAGTGAACCTGTCGAGATGGTTTCAACATCCATCGCACGGTTGTCACCTAGGCGCATGATTGAACCTTTACCGAACTGCTTTTCAATTTGACCTAGTGCGGCGGCCAGTGCCTTCTGTTTGTTCTCGTCCATTACTCTCTCCAGATAGTCACTCTCAGGTGATAGGTAATTATGTCGAAATAGGGTCGACAATGTTTGTCATGTTGGGGCTCATTATACTGTTGATTTGTACAGTGTCCACCCCTGTATAAAAAAAATTTCGCCTGTTATTGACGTTCACTGCCATTTTTCAGTAGCGCATCACACAAGACTTTTAAACTGTATTCAATCGCTTGTATGCGAACTTTTGCTCTATCGCCAGCAAAATAGCAGGTCTCACACCGTAGCCATCCATGCTTATCCGCAAAACCAAAACAAACGGTGCCAACAGGCTTTTCCGGGCTACCGCCGCTCGGCCCTGCAATGCCGCTAATGGAAACCGCGATAGTTGCATTCGAATGGGCTAAAGCCCCCAGTACCATTTCTTTGACCACCGCCTCCGAGACCGCACCAAAGTCGGCCAAGGTTTTTTCTGCCACCCCCAACATTTCTTGTTTTGCTTCATTGCTGTAAGTAACAAATGCACGATCAAACCAAGCCGAGCTACCCGCGACTTCCGTGACCATGTTTGCAACACCACCGCCAGTACACGATTCGGCGGTAGCCAACACTTCACCTTGTTGCAAAAGACGCTCACCCAGTTGTTCTGATAATTGTATTAGTGATTGCATGCCGATTTCCCTTTTCTCTCTTTTACACTATCAGTGATTCACGTATCCTAAGCCGCAAACAGTCCAAACTAAAGATAAAAACCGTGAAAGCTGAACAACAACATACCCCAATGATGCAGCAATACCTCAGATTGAAGGCAGAAAATCCCGATATTTTGCTGTTTTATCGCATGGGCGACTTCTACGAACTTTTTTACGATGATGCAAAGAAGGCGTCGCAATTGCTGGATATTTCTCTCACCAAGCGCGGCGCTTCGGCAGGAGAACCCATTCCGATGGCGGGTGTGCCATTTCATGCCGTTGAAGGGTATTTAGCCAAATTGGTTCAGCTTGGGGAGTCGGTGGCGATCTGCGAACAAGTTGGCGATCCTGCCACCAGTAAAGGCCCAGTAGAGCGCAAAGTCGTTCGTATTGTCACGCCGGGTACGGTAACGGATGAAGCTTTACTGTCTGAACGTTTGGATAACTTAATTGCCGCGATTTATCACCACAATGGTAAATTTGGTTACGCCACCTTGGATGTCACCTCTGGTCGTTTCCAATTGGTTGAACCTCAGTCAGAAGAGGCAATGGCAGCTGAGCTACAACGCACCTCTCCGCGTGAGTTACTCTTCCCAGAAGATTTTGAGCCCGTTCATTTGATGACAGGTCGTAACGGCAACCGTCGTCGTCCAGTTTGGGAGTTCGAACTCGAAACGGCCAAACAACAGCTCAACCAGCAATTTGGCACCAAAGACTTGGTCGGTTTTGGCGTAGAAAATGCAGTTTTAGGATTGTGCGCCGCAGGTTGCTTGATCCAGTATGTCAAAGATACTCAACGTACAGCACTTCCTCATATCCGCGCGCTTACTTATGATCGCCAAGACGACTCGGTTATCCTTGATGCCGCGACCAGACGCAATCTCGAACTGACTCAAAACCTTGCTGGCGGAAGTGACAACACGCTTGCTGCGGTTTTGGATCGTTGTGCAACGCCGATGGGAAGCCGGATGCTGAAACGTTGGATCCATCAACCAATGCGCTGTATTACCACGCGAGAGCATCGCCTAGACGCCATCGCCGAACTGAAAGAACAAGCTCTATTTAGCGATATTCATCCTGTGGTGAAACAAATCGGCGATATTGAACGTATTTTGGCTCGCTTAGCACTCCGCTCTGCTCGTCCACGCGATCTCGCGCGATTACGCCATGCGATGCAACAGCTACCCGAATTGGCTCAAACGCTGTCTTCACTGGGCAATAGCCATCTCAAATCACTGGCCACGGCAGCCGCTCCAATGGATGATGTGTGTGAATTGCTCGAGCGTGCCATTAAAGAAAACCCGCCGGTTGTGATTCGCGATGGTGGGGTCATTGCCGAAGGGTACAGCGCTGATTTGGATGAATGGCGCGATCTTGCAGACGGTGCCACGGGCTACTTGGAAAAACTCGAAGAGGAAGAGCGTGATCGCCACGGTATCGATACACTGAAAGTGGGATACAACAATGTCCACGGTTTCTACATCCAAGTAAGCCGCGGCCAAAGCCATTTGGTTCCACCACACTATGTTCGCCGTCAAACGCTGAAAAACGCTGAACGTTACATCATTCCTGAACTGAAAGAGCACGAAGACAAAGTTCTCAACTCAAAATCAAAAGCATTAGCCATTGAAAAGCAACTGTGGGAAGAGCTCTTTGATTTATTGCTACCTCACCTAGCCCGTTTGCAAGAGTTGGCAGCAGCGGTTGCACAATTGGATGTATTGCAAAATTTGGCGGAGCGTGCTGATACGCTGGATTATTGCCGCCCAAATTTGACCAAAGATCCCGTCGTTCACATTACCGCGGGTCGTCACCCTGTGGTAGAACAAGTCACTTCCGATCCCTTTATTGCCAACCCAATTGAACTGAACAGCCAACGTAAAATGTTGATCATCACCGGTCCAAACATGGGGGGTAAGTCCACCTACATGCGCCAAACCGCATTGATTGCTTTAATGGCGCACATTGGTTCTTACGTGCCTGCAGAATCGGCCACCATTGGCTCAATTGATCGCATCTTTACTCGAATTGGTGCATCGGATGATCTCGCGTCAGGTCGTTCAACCTTCATGGTAGAAATGACAGAAACAGCCAATATCTTGCACAACGCGACAGCAAATAGCTTAGTTTTGATGGATGAAATTGGCCGTGGTACCAGTACCTATGATGGTCTTTCCCTAGCGTGGGCAAGCGCACACTGGCTTGCGACTCAGATTGGGGCAATGACGCTATTTGCGACGCATTACTTTGAACTGACAGAACTGCCAAATCAACTTCCTCACTTGGCCAACGTGCATCTCGATGCGGTTGAGCATGGCGACAGCATCGCCTTTATGCACGCCGTACAAGAGGGGGCGGCAAGCAAATCCTACGGTTTAGCGGTTGCGGGGTTAGCGGGCGTTCCAAAAACGGTGATTAAAAACGCCCGTCAAAAATTGTCTCAACTTGAGCTGCTCAGCGCAGAAGGTTCTCAGCCAAAAGCAAGAACGGTGGATATCGCTAACCAATTGAGCCTCATTCCAGAGCCAAGTGAAGTAGAACAAACTTTAGCCAGCATCGATCCAGATGATCTGACCCCACGCCAAGCGTTAGAAGCCCTATATCGTTTAAAGAAAATGCTCTAAAGTTTTTAAACGAAGCTCACTAAAATGACAAAAGGCTATGATTCCATAGCCTTTTTTCTATTGGAGAATGCAAATTCAATCCATATCTATATCAAACAAATTTTCCATATTCAGACCTTGTTTGATCAGAACTTCACGGAGGCGACGTAACCCTTCAACCTGAATCTGGCGTACTCGCTCGCGAGTTAAACCAATCTCTTGCCCAACCTCTTCCAGCGTTGAAGGCTCATAACCCAACAAACCAAAGCGACGTGCCAACACTTCTTTTTGTTTCGGATTGAGCTCTTCCAACCAATGAATCAAGGAAACACGCATGTCTTCATCTTGCGTTGAAACCTCTGGGTCAGAGTTGTTTGCATCAGGAATAATATCCAATAACGCCTTCTCTCCATCTCCACCAATCGGTGTATCGACAGAACTAATACGCTCGTTCAAACGCAGCATTTTGCTCACATCGTCAACCGGAATCTCCAGTTGTGCAGCGATCTCTTCAGCGGTTGGTTCATGATCGAGTTTTTGAGAAAGCTCACGCGCCGTACGTAGATAAATGTTCAGCTCTTTAACAACATGGATAGGTAAACGGATGGTTCGAGTCTGATTCATTAAAGCACGTTCGATGGTTTGGCGAATCCACCATGTTGCGTAAGTCGAGAAGCGGAAACCACGCTCTGGATCGAATTTCTCTACCGCACGGATCAAACCAAGGTTACCCTCTTCAATCAAATCAAGCAGTGCAAGACCGCGATTACTGTAACGGCGGGAAATCTTAACCACTAGACGTAAGTTACTTTCAATCATACGTTTACGTGCGGCTTCATCGCCTCGTAATGCACGTCGAGCATAAAGAACTTCTTCTTCTGCGGTTAATAGTGGAGAAAAACCAATCTCTCCTAGATAGAGCTGTGTCGCGTCTAAGCTCTTGTTGCTGGCGTCAAACTCCTCGCGAAGTGTGACTTTTTCTTCGGTAAGATCTTCTTTTTCATTAATCTCGATATCGTCGTTAAAATCATCATCCATCTGATTAACATCGAAATCTTTAACTTTGGTGACTGTGTTGCTGATACTCATAACGCCTCCCCATGGCGAGTTAGCAAGACATTACAACTTTAAATGTCGCTAGTTGTGTCGCAAGTTGTTTCAAAGTTGATTTACGGTAAATAGCGCTTTGGATTCACTGATTTACCTTGGTAACGAATTTCGAAGTGCAACATCACTGTTTTGGCTCCAGAACTGCCCATTGTGGCGATTTGTTGCCCAGCTTTAACACTTTGCCCTTCCGATACCAACAACCGATCGTTATGCGCATACGCACTGAGGTAATTATCGTTATGTTTAACAATGATTAAATTGCCATAACCTCTTAGCGCGTTACCCGAATAAACCACGGTACCTCCTGCCGTTGAAACGATAGGCTGACCTCGCTGTCCTGCGATATCAATGCCTTTATTTCCTTGTTCTCCTGCAGAGAAGTTTTTGATTACTCTCCCTTTTGTTGGCCACTGCCACTTGGCTATTTTTTGGTTATTTGCCGGTGGTGGTGAAGTGACAGGTTTAACATTTTCTTTACCTTTTGAACCAACATACTCCTTTGGTTTGGATTGATCAATACCCTTAACTGGATCTTTTTTAACCACTTGAGGCGCAGGTTTTGCGGGTTTCTGAGTAGAGTTATTATTCGATTTTTTTGGCGTTGTCGGTGTTGGTTTGGGAGCCACCACCGCGACAGTTGACGCAGCAACCACAGTGGTAACAGGGGCTTCTACCGTTTGGCCATATTTGGGTGCGACATAAGCAGGTGCCCATAACTTGAGTTTCTGGCCCGGATAAATCGTGTAAGGCTCTGTCAAATTATTGAAACGGATTAACTCATTTACATCTTTATCTGTGACGTAGGATATAAAATACAGCGTATCGCCCTTTTCAACTTCGTAGTAGCTACCACGATAGCTACCACGATCAACCGCGTTATAGTCTTTACTCAATCCCGTTACCGGCGCAGGAGTATGCGCAGCACAGCCTACCAGTAAAGCTGTACTGAGAATCATTGACCCACGTAAATAAAGACGCTTACTCACACCTAATCTCAAATTATGCCAAGTCTCCGGCGATCAGCGGCACAAAGTTCACCGCTTCAATCACTTGGGAAACAAACTGATCTTTTTGACGCTCAATCAAGATCAACTGCTGCTCATCCGTACCAACAGGGAGTACCATGCGGCCACCATCATTCAATTGTTGAAGTAACACTGGTGGAACACTTTCCGCTGCGGCGGTCACAATGATCGCGTCAAAAGGTCCTTTATTTTCCCAACCAAGCCAGCCGTCGCCATGCTTCGTCGAGACATTGTAAATATCCAACTGTTTTAAGCGACGTTTTGCATCCCACTGTAAAGATTTGATTCGCTCAACAGAATATACATGCTCAACCAGAAGCGCTAATACTGCGGTTTGATAACCCGATCCAGTGCCAATTTCCAATACTTTACTGTCTCTTTTGAGACGCAAAAGTTCCGTCATTTTGGCCACAATATAGGGCTGAGAGATCGTCTGACCTTGACCAATAGGCAATGCGTTGTTGTCATAAGCCTGATGCATCATAGCTTGTGACAAAAAACACTCTCTCGGTACACGAGCGATGGCAGATAGCACTTCGCTATCACGGATGCCATTGACCGCTAAGAAATGAACCAATCTTTCTGCTTGAGGATTGCTCATGATTACTTTTCCTTTAGCCAAGTATCCATCGCTCGGAGCGATTCATGGGCAGTTAAATCAACTTGCAGTGGTGTAATGGACACATAGCCTTGTTCAATCGCATGAAAATCGGTTCCCTCCCCCGCATCTTGCTCTTTACCCGGAGGGCCTAACCAATAGATTTCATGACCACGGGGATCAAGCTGCTTGATCATATTCTCTGCATGATGGCGCGCGCCTAAACGAGTCACTCGTATCTCTTCTAGCTGTTCGAGCGGCAGATCCGGAATATTGATATTCAATAAACGATTGGTCGGAATCGGTTTCGCTAAATGTTGCTCAACGATCCGCTTGGCAATGGTTGCTGCGGTTTTAAAGTGGGTTTTGCCAACCAAAGAAAAAGCGATCGACTGCACGCCAAGAAAATGCCCTTCCATCGCTGCGGCGACCGTGCCCGAATAAAGCACGTCATCCCCCAGATTGGCGCCATGATTGATGCCACTTAACACTAAGTCTGGGAGATCATTTTTCAGTAGTTCGTTTAAAGCAAAATGGACACAATCTGTCGGCGTGCCTTGCACTGAATACACATTTTCTTCAACACAGGTCACTCGCAGCGGCTGTTCCAGTGTTAACGAGTTCGACGCTCCTGAGCGATTGCGATCCGGAGCAACGATGATCACTTCTGCCAATGTTCTTAGCTCACTGGCTAACGTGCGAATACCTTCTGCGAAAACGCCATCGTCATTACTCAGCAAAATGCGCAGCGGTTTAGCTTGCTTATCTTCCATGATAATTAGAATTCTCTTTCAAACGCGATTTCTTGTACTAGCTCGCGGATAATAGACGTAGCAAAAGAGCCAGCATCGAGCGAGAAAGTCAGGATGATGTTATTGCCATCCACTTGCCAACTGAGATTGGCTGGCTTCAATGCGATCGCGCGGCGATCGTGTCTCATGCGATTGCCGCGAATCAGTGCCATAAGATCGGGTTCAGCATCTAAGAACACTTGCTCTAGCGCCAACGCGTCATCTTGAGTCGGCAAGGCATTATCGCCAGCCAAGGCACCACTGATCACCAAGTCGCCAGCATCATATTGAGACTGGAGATCCGCATGATTTTCTGCACTCACCAATAGCTGTTCATCCCCCTTAAACAGGATGTCGCCAAGCAAAACCTTATCAAACAGTGATTGCTCAAGTCGCGCCGAGACAATTAAGTTGAAGATCCACGAACGCGCAGCAGACAAGTAGAGGCTACGTTGATTTTGATTGCGGCTACGAACATTGTCTCTACCCCAACGTTTTGCTTCTTGCAGGTTATTGCCATTGTTACCAAAACGCTGATTACCGAAGTAATTTGGAACACCGAGTTTTGCAACCGTCTCTAGACGTTTGAGCACATCGTCCACATCCGAGACTTCGGATAGCGTCACAACAAAATCGTTCCCAACAAGGTCTCCAGGACGTAATTTCTTGTTATGACGAGCGGTCGTGAGAATTTCAATCGATGGGTACTGTGCGAGGAAAGCGGAAAAATCCGGTGTTTCCGCTTTAGGCAAATGGACACTAAGCCATTGTTCCGTCACCGCGTGGCGATCTTTCAAACCAGCCCAGCTGACGTCTTTTGATTTCACGCCACATGCTTTCGCCAATTCATTGGCAACAAAGGAGGTGTTCTCGCCAGTTTTTCGAATGCGCACCATCAAGTGCTCACCAGAACCAGTGAACTCAAAACCTAAGTCTTCTCTTACCTGAAAATGTTCTGGTTTTGCTTTGATTTTTGCTTGTGCGGTTGGTTTACCCGCTAAATAGGCCAGAGAAGCCAAAGTGTCTGTCATAGAACTATCCACTGCCAGTTTTGCTAATGCTAATATTATTGCTTAATCAGTAATACAACGGCTTCCGTTGCAATACCTTCTTTGCGCCCTGTAAAACCCAAACGCTCGGTTGTTGTCGCTTTGACATTAATGTTACGAATATCGGTTTCCAATTCGTGAGCAATGGCTTCACGCATGGCATCGATATAAGGGGCCATTTTGGGTGCTTGCGCAATGATGGTCACATCCGCATTCCCCAGTTTGTAACCTTGTTCTTTGACTCGGCGATAAACGTCTTTCAACAATTCTCGACTGTCTGCCCCTTTCCATTTGTCATCCGTATCTGGGAAATGGCGACCAATATCACCCGCGGCAATCGCCCCCAGCAGAGCATCGCTTAACGCATGTAAAGCAACATCACCATCGGAGTGAGCAATAAGTCCTTGTTCATAAGGAACCGCAACACCACCGATGATAACCGGGCCTTCGCCACCAAATTTGTGTACATCGAAACCGTGACCAATTCTAATCATTTTCTATCCTTTTCTCGGCTCAAATAAAATTCCGCTAAAGCAAGATCTTCAGGTTGAGTGACTTTGATGTTACTCGAACACCCTTGAACTAAAGCTGGCAGCTCTCCTCGCCACTCTAGCGCAGAGGCTTCATCGGTTATCGCCACACCTTGCGCCAACGCATCGCTTAGTGCGTCTGTCAGTACTTCGGCTTTGAACATTTGCGGCGTTAAGGCATGCCAAAGTGCATTGCGATCCACTGTATGGTCGATCATTTGTTGTGCGTTGGCGCGCTTCATGGTGTCACGTACGGGCGTCGCCAAAATCCCCCCCGTTTCATGCGAGGAACAGCGTTCAATCAAGGCATCAATATCTTGATGAGCAACACAAGGTCTTGCAGCATCATGCACTAGAACCCAGTCTGCTTTTTGTGGCTGCAGCGAAAGGAAACGTAACGCCGATAACACCGAATCTGCTCGCTCTTTCCCACCAGCAACACGAATGATATCGGGATGCTGCGCAATGGCGAGCTCTGAGTAATAGGGGTCGTCTTCACTCACGGCAACGACGACTTGTGTAATGGCCGGGTGAGAGAGCAACCTTTCGATGGTATGTTCTAAAATGGTTTTGCCGTGAATCTGCAAATACTGCTTTGGTCGGTCTGCCTTCATTCGACTGCCGACACCTGCGGCTGGTACCACTGCGACCAATGACGTCATACTACGCGCCATTATTGATTCTCCTCACCGACAATGCGATAGAACGTCTCCCCTTCTTTCAGCATACCCAGCTCGTGACGAGCACGCTCTTCAATGGCATCAAGACCTTGCTTCAAGTCATCTATCTCTGCGTACATTTCGCTGTTTCTTGCCTGCAACTTGGTGTTCACCAATTGCTGCGCTTCAATATCACTTTCAATCGTGTAGTAATCGGAAACCCCATTTTTACCAAACCACAGGGTGTACTGCAGCCAACCAAACAGTAAGGTTAAAACGAGAATAAACAGTCGCATGGCAATTTACTTATGACTTATTAGAAGGAAAGATTGATGAAGCCCTGTTTATAGCACAATTAGCTCATTGGCTCTAGAGAGGGTTAGACAGAAGCAAGTGAAGAGAATGACAGAGTCTGATGCACGGTCACTTTTTTATTTATCGAAGATAGGGGGACTCGCAGAAATATTGGCGATAAAAAAACACCCCGCAAACGCGAGGTGTTTTGAAAGTTTCAAACTAAGCGATTAATTCAATTAAGAATTAAGCTTGGCCTTTCACTTCTTTAAGACCGTTGAAAGGAGCACGACCAGCTAGAGCTTCCTCGATACGGATTAGTTGGTTGTACTTAGCAACACGGTCAGAACGGCTCATAGAACCAGTCTTGATTTGACCTGCTGCAGTACCTACCGCTAGGTCAGCGATAGTTGCATCTTCAGTTTCGCCAGAACGGTGAGAGATTACTGCTGTGTAACCTGCGTCTTTAGCCATCTTGATTGCAGCTAGAGTCTCAGTTAGAGAACCGATTTGGTTGAACTTGATAAGGATAGAGTTAGCGATGCCTTTCTCGATACCTTCAGCTAGGATCTTAGTGTTTGTAACGAATAGATCGTCACCAACTAGTTGGATCTTGTCGCCTAGAAGTTGAGTTTGGTGTGCGAAACCAGCCCAATCAGACTCGTCTAGACCGTCTTCGATAGATACGATCGGGAATTGCTCAACTAGACCAGCTAGGTAGTGGTTGAACTCTTCAGAAGTGAACGTTTTGCCTTCACCTTTCATGTTGTAGATGCCAGCTTCTTTGTCGAAGAACTCAGATGCTGCACAGTCCATAGCTAGAGTAACGTCTTTACCTAGTACGTAACCCGCTGCTGCAACTGCTTCTGCGATAACTTCTAGCGCTTCAGCGTTAGACTTAAGGTTAGGAGCGAAACCACCTTCGTCACCAACAGCAGTGTTGTAGCCTTTAGACTTAAGAACTTTAGCTAGGTTGTGGAATACTTCCGCACCCATGCGAACTGCTTCTTTCAGAGTTTTTGCGCCAACTGGTTGGATCATGAACTCTTGGATGTCAACGTTGTTGTCAGCGTGCTCACCACCGTTGATGATGTTCATCATTGGTAGAGGCATAGAGAACTGACCAGCAGTACCGTTTAGCTCAGCGATGTGCTCGTATAGAGGCATGCCTTTAGAAGCAGCCGCTGCTTTAGCGTTAGCTAGAGAAACAGCTAGGATTGCGTTCGCACCAAACTTAGATTTGTTTTCAGTGCCGTCTAGCTCGATCATGATAGCGTCGATCGCTGCTTGATCTTTCGCATCTTTACCAACTAGCGCGTCAGCGATTGCACCGTTTACCGCTTCAATCGCTTTAAGAACACCTTTACCTAGGAAACGTGCTTTGTCACCGTCACGTAGCTCAAGCGCTTCGCGAGAACCAGTAGATGCGCCAGATGGAGCTGCCGCCATACCTACGAAACCGCCTTCTAGGTGTACTTCAGCTTCAACAGTTGGGTTACCACGTGAGTCGATGATTTCACGACCTAGAACTTTAACGATCTTAGACAT
Protein sequences of DBSCAN-SWA_3 >CP014049|2283483:2298726|2297424_2298726_-|AMG13109.1|DBSCAN-SWA MSKIVKVLGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKARFLGKGVLKAIEAVNGAIADALVGKDAKDQAAIDAIMIELDGTENKSKFGANAILAVSLANAKAAAASKGMPLYEHIAELNGTAGQFSMPLPMMNIINGGEHADNNVDIQEFMIQPVGAKTLKEAVRMGAEVFHNLAKVLKSKGYNTAVGDEGGFAPNLKSNAEALEVIAEAVAAAGYVLGKDVTLAMDCAASEFFDKEAGIYNMKGEGKTFTSEEFNHYLAGLVEQFPIVSIEDGLDESDWAGFAHQTQLLGDKIQLVGDDLFVTNTKILAEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALAGRAPFNGLKEVKGQA >CP014049|2283483:2298726|2283483_2286066_-|AMG13096.1|tRNA|DBSCAN-SWA MYMSTDEVRNAFLKFFESKGHQIVDSSSLVPHNDPTLLFTNAGMNQFKDCFLGAEKRAYTRATTAQRCVRAGGKHNDLENVGFTARHHTFFEMLGNFSFGDYFKEDAIAFAWEFLTETLKLPKDRLLVTVYETDDEAFDIWNQKVGVPADRIIRIGDKKGGKQYESDNFWTMGDTGPCGPCTEIFYDHGEHIWGGRPGTPEEDGDRFIEIWNNVFMQFNRHADGTMEPLPKPSVDTGMGIERISAIMQGVHSNYEIDVFQKLIKATAEIVGCEDLSNQSLRVIADHIRSCSFLIADGVMPSNEGRGYVLRRIIRRAVRHGNKLGAQGVFFHKLVGVLAEVMGSAAVELKKQQAVVEKVLRIEEENFGRTLERGMVILNEALDALEGKELDGETVFKLYDTYGFPADLTNDVAREREFTIDEAGFEKAMEEQRQRAREAGQFGTDYNAKIKVDAQSEFCGYTSTQERSDVVALFVEGEETATLSAGDKAIVILQETPFYAESGGQCGDSGVLKTESGLFQVEDTQKLGNAIAHHGVLVEGVLAKGDQVSAIVDAERRAAISLNHSATHLLHAALRQVLGEHVAQKGSLVKAENLRFDFSHLEAVTAAELKEVERLVNAQIRRNHTIETNIMDIESAKQKGAMALFGEKYDDEVRVLSMGDFSTELCGGIHASNTGDIGLFKITSEGGIAAGIRRIEAVTGEAALEAIDAQARKFEEKLQDAANKAKSLEKEIQQLKDKLASQASANLIDQVKEIAGVKVLVAKLDGADNKALRGMVDELKNQMGSGIIMLGNVADDKVGLIAGVTQDLTSKVKAGELVNMVAQQVGGKGGGRPDMAQAGGTDSSALPSALASVDAWIAERL >CP014049|2283483:2298726|2295712_2296189_-|AMG13106.1|DBSCAN-SWA MIRIGHGFDVHKFGGEGPVIIGGVAVPYEQGLIAHSDGDVALHALSDALLGAIAAGDIGRHFPDTDDKWKGADSRELLKDVYRRVKEQGYKLGNADVTIIAQAPKMAPYIDAMREAIAHELETDIRNINVKATTTERLGFTGRKEGIATEAVVLLIKQ >CP014049|2283483:2298726|2296185_2296899_-|AMG13107.1|DBSCAN-SWA MARSMTSLVAVVPAAGVGSRMKADRPKQYLQIHGKTILEHTIERLLSHPAITQVVVAVSEDDPYYSELAIAQHPDIIRVAGGKERADSVLSALRFLSLQPQKADWVLVHDAARPCVAHQDIDALIERCSSHETGGILATPVRDTMKRANAQQMIDHTVDRNALWHALTPQMFKAEVLTDALSDALAQGVAITDEASALEWRGELPALVQGCSSNIKVTQPEDLALAEFYLSREKDRK >CP014049|2283483:2298726|2296898_2297180_-|AMG13108.1|DBSCAN-SWA MRLFILVLTLLFGWLQYTLWFGKNGVSDYYTIESDIEAQQLVNTKLQARNSEMYAEIDDLKQGLDAIEERARHELGMLKEGETFYRIVGEENQ >CP014049|2283483:2298726|2286245_2286707_-|AMG13097.1|DBSCAN-SWA MHHRFTPPMMNCKDTAIQLLSRRDHGQYELKQKLTVKGYARQDIESAMHFCLEHGYLDDLRYAKSQIRQHVGKGHGERRIRQELSLKQVSDSVVENAFAEEPQDWFELAKSVAMKKFNGVKAKEQKEYAKQVRFLQYRGFSFEQIRYALNDKA >CP014049|2283483:2298726|2287994_2288498_-|AMG13099.1|DBSCAN-SWA MQSLIQLSEQLGERLLQQGEVLATAESCTGGGVANMVTEVAGSSAWFDRAFVTYSNEAKQEMLGVAEKTLADFGAVSEAVVKEMVLGALAHSNATIAVSISGIAGPSGGSPEKPVGTVCFGFADKHGWLRCETCYFAGDRAKVRIQAIEYSLKVLCDALLKNGSERQ >CP014049|2283483:2298726|2294633_2295677_-|AMG13105.1|tRNA|DBSCAN-SWA MTDTLASLAYLAGKPTAQAKIKAKPEHFQVREDLGFEFTGSGEHLMVRIRKTGENTSFVANELAKACGVKSKDVSWAGLKDRHAVTEQWLSVHLPKAETPDFSAFLAQYPSIEILTTARHNKKLRPGDLVGNDFVVTLSEVSDVDDVLKRLETVAKLGVPNYFGNQRFGNNGNNLQEAKRWGRDNVRSRNQNQRSLYLSAARSWIFNLIVSARLEQSLFDKVLLGDILFKGDEQLLVSAENHADLQSQYDAGDLVISGALAGDNALPTQDDALALEQVFLDAEPDLMALIRGNRMRHDRRAIALKPANLSWQVDGNNIILTFSLDAGSFATSIIRELVQEIAFEREF >CP014049|2283483:2298726|2292291_2293188_-|AMG13102.1|DBSCAN-SWA MILSTALLVGCAAHTPAPVTGLSKDYNAVDRGSYRGSYYEVEKGDTLYFISYVTDKDVNELIRFNNLTEPYTIYPGQKLKLWAPAYVAPKYGQTVEAPVTTVVAASTVAVVAPKPTPTTPKKSNNNSTQKPAKPAPQVVKKDPVKGIDQSKPKEYVGSKGKENVKPVTSPPPANNQKIAKWQWPTKGRVIKNFSAGEQGNKGIDIAGQRGQPIVSTAGGTVVYSGNALRGYGNLIIVKHNDNYLSAYAHNDRLLVSEGQSVKAGQQIATMGSSGAKTVMLHFEIRYQGKSVNPKRYLP >CP014049|2283483:2298726|2293231_2293858_-|AMG13103.1|DBSCAN-SWA MSNPQAERLVHFLAVNGIRDSEVLSAIARVPRECFLSQAMMHQAYDNNALPIGQGQTISQPYIVAKMTELLRLKRDSKVLEIGTGSGYQTAVLALLVEHVYSVERIKSLQWDAKRRLKQLDIYNVSTKHGDGWLGWENKGPFDAIIVTAAAESVPPVLLQQLNDGGRMVLPVGTDEQQLILIERQKDQFVSQVIEAVNFVPLIAGDLA >CP014049|2283483:2298726|2291218_2292211_-|AMG13101.1|DBSCAN-SWA MSISNTVTKVKDFDVNQMDDDFNDDIEINEKEDLTEEKVTLREEFDASNKSLDATQLYLGEIGFSPLLTAEEEVLYARRALRGDEAARKRMIESNLRLVVKISRRYSNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERALMNQTRTIRLPIHVVKELNIYLRTARELSQKLDHEPTAEEIAAQLEIPVDDVSKMLRLNERISSVDTPIGGDGEKALLDIIPDANNSDPEVSTQDEDMRVSLIHWLEELNPKQKEVLARRFGLLGYEPSTLEEVGQEIGLTRERVRQIQVEGLRRLREVLIKQGLNMENLFDIDMD >CP014049|2283483:2298726|2288581_2291143_+|AMG13100.1|DBSCAN-SWA MKAEQQHTPMMQQYLRLKAENPDILLFYRMGDFYELFYDDAKKASQLLDISLTKRGASAGEPIPMAGVPFHAVEGYLAKLVQLGESVAICEQVGDPATSKGPVERKVVRIVTPGTVTDEALLSERLDNLIAAIYHHNGKFGYATLDVTSGRFQLVEPQSEEAMAAELQRTSPRELLFPEDFEPVHLMTGRNGNRRRPVWEFELETAKQQLNQQFGTKDLVGFGVENAVLGLCAAGCLIQYVKDTQRTALPHIRALTYDRQDDSVILDAATRRNLELTQNLAGGSDNTLAAVLDRCATPMGSRMLKRWIHQPMRCITTREHRLDAIAELKEQALFSDIHPVVKQIGDIERILARLALRSARPRDLARLRHAMQQLPELAQTLSSLGNSHLKSLATAAAPMDDVCELLERAIKENPPVVIRDGGVIAEGYSADLDEWRDLADGATGYLEKLEEEERDRHGIDTLKVGYNNVHGFYIQVSRGQSHLVPPHYVRRQTLKNAERYIIPELKEHEDKVLNSKSKALAIEKQLWEELFDLLLPHLARLQELAAAVAQLDVLQNLAERADTLDYCRPNLTKDPVVHITAGRHPVVEQVTSDPFIANPIELNSQRKMLIITGPNMGGKSTYMRQTALIALMAHIGSYVPAESATIGSIDRIFTRIGASDDLASGRSTFMVEMTETANILHNATANSLVLMDEIGRGTSTYDGLSLAWASAHWLATQIGAMTLFATHYFELTELPNQLPHLANVHLDAVEHGDSIAFMHAVQEGAASKSYGLAVAGLAGVPKTVIKNARQKLSQLELLSAEGSQPKARTVDIANQLSLIPEPSEVEQTLASIDPDDLTPRQALEALYRLKKML >CP014049|2283483:2298726|2286810_2287860_-|AMG13098.1|DBSCAN-SWA MDENKQKALAAALGQIEKQFGKGSIMRLGDNRAMDVETISTGSLSLDIALGAGGLPMGRIVEIFGPESSGKTTLTLELIAAAQREGKTCAFIDAEHALDPVYAKKLGVNIDQLLVSQPDTGEQALEICDALARSGAVDVIVVDSVAALTPKAEIEGEMGDSHMGLQARMLSQAMRKLTGNLKQSNCMCIFINQIRMKIGVMFGNPETTTGGNALKFYASVRLDIRRTGAIKEGDEVVGNETRIKVVKNKIAAPFKEANTQIMYGQGFNREGELIDLGVKCKLIEKSGAWYSYNGDKIGQGKANACKYLKENVDVAKVLDTKLREMLLSPANINDESAELVEEMPEQEEF >CP014049|2283483:2298726|2293860_2294628_-|AMG13104.1|DBSCAN-SWA MEDKQAKPLRILLSNDDGVFAEGIRTLASELRTLAEVIIVAPDRNRSGASNSLTLEQPLRVTCVEENVYSVQGTPTDCVHFALNELLKNDLPDLVLSGINHGANLGDDVLYSGTVAAAMEGHFLGVQSIAFSLVGKTHFKTAATIAKRIVEQHLAKPIPTNRLLNINIPDLPLEQLEEIRVTRLGARHHAENMIKQLDPRGHEIYWLGPPGKEQDAGEGTDFHAIEQGYVSITPLQVDLTAHESLRAMDTWLKEK |
14 | uncultured_Mediterranean_phage(22.22%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|