Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP033398 | Escherichia coli strain WCHEC020031 plasmid p2_020031, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP033401 | Escherichia coli strain WCHEC020031 chromosome, complete genome | 9 crisprs | DEDDh,c2c9_V-U4,DinG,cas3,RT,csa3,PD-DExK,cas5,cas6e,cas1,cas2 | 0 | 19 | 9 | 0 |
CP033397 | Escherichia coli strain WCHEC020031 plasmid p1_020031, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP033400 | Escherichia coli strain WCHEC020031 plasmid pOXA1_020031, complete sequence | 0 crisprs | NA | 0 | 0 | 2 | 0 |
CP033399 | Escherichia coli strain WCHEC020031 plasmid pNDM5_020031, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
13932 : 53228
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP033400|13932:53228|DBSCAN-SWA CATGAAAACCGCCACTGCGCCGTTACCACCGCTGCGTTCGGTCAAGGTTCTGGACCAGTTGCGTGAGCGCATACGCTACTTGCATTACAGTTTACGAACCGAACAGGCTTATGTCCACTGGGTTCGTGCCTTCATCCGTTTCCACGGTGTGCGTCACCCGGCAACCTTGGGCAGCAGCGAAGTCGAGGCATTTCTGTCCTGGCTGGCGAACGAGCGCAAGGTTTCGGTCTCCACGCATCGTCAGGCATTGGCGGCCTTGCTGTTCTTCTACGGCAAGGTGCTGTGCACGGATCTGCCCTGGCTTCAGGAGATCGGAAGACCTCGGCCGTCGCGGCGCTTGCCGGTGGTGCTGACCCCGGATGAAGTGGTTCGCATCCTCGGTTTTCTGGAAGGCGAGCATCGTTTGTTCGCCCAGCTTCTGTATGGAACGGGCATGCGGATCAGTGAGGGTTTGCAACTGCGGGTCAAGGATCTGGATTTCGATCACGGCACGATCATCGTGCGGGAGGGCAAGGGCTCCAAGGATCGGGCCTTGATGTTACCCGAGAGCTTGGCACCCAGCCTGCGCGAGCAGCTGTCGCGTGCACGGGCATGGTGGCTGAAGGACCAGGCCGAGGGCCGCAGCGGCGTTGCGCTTCCCGACGCCCTTGAGCGGAAGTATCCGCGCGCCGGGCATTCCTGGCCGTGGTTCTGGGTTTTTGCGCAGCACACGCATTCGACCGATCCACGGAGCGGTGTCGTGCGTCGCCATCACATGTATGACCAGACCTTTCAGCGCGCCTTCAAACGTGCCGTAGAACAAGCAGGCATCACGAAGCCCGCCACACCGCACACCCTCCGCCACTCGTTCGCGACGGCCTTGCTCCGCAGCGGTTACGACATTCGAACCGTGCAGGATCTGCTCGGCCATTCCGACGTCTCTACGACGATGATTTACACGCATGTGCTGAAAGTTGGCGGTGCCGGAGTGCGCTCACCGCTTGATGCGCTGCCGCCCCTCACTAGTGAGAGGTAGGGCAGCGCAAGTCAATCCTGGCGGATTCACTACCCCTGCGCGAAGGCCATCGGTGCCGCATCGAACGGCCGGTTGCGGAAAGTCCTCCCTGCGTCCGCTGATGGCCGGCAGCAGCCCGTCGTTGCCTGATGGATCCAACCCCTCCGCTGCTATAGTGCAGTCGGCTTCTGACGTTCAGTGCAGCCGTCTTCTGAAAACGACAATGGAGGTGGTAGCCGAGGGTGTGGAAACACCCGACTGCCTTGCGTGGTTGCGGCAGGCGGGTTGCGACACGGTGCAGGGTTTCCTGTTCGCCAGGCCGATGCCGGCGGCGGCCTTCGTCGGCTTCGTCAACCAATGGAGGAACACCGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCCTCAATACTCGTGTGGGCTCTGTTGCAAAAATCGTGAAGCTTGAGCATGCTTGGCGGAGATTGGACGGACGGAACGATGACGGATTTCAAGTGGCGCCATTTCCAGGGTGATGTGATCCTGTGGGCGGTGCGCTGGTATTGTCGCTATCCGATCAGCTATCGCGACCTTGAGGAAATGCTGGCGGAACGCGGCATTTCGGTCGACCATACGACGATCTATCGCTGGGTCCAGTGCTACGCCCCGGAGATGGAGAAGCGGCTGCGCTGGTTCTGGCGGCGTGGCTTTGATCCGAGCTGGCGCCTGGATGAAACCTACGTCAAGGTGCGGGGCAAGTGGACCTACCTGTACCGGGCAGTCGACAAGCGGGGCGACACGATCGATTTCTACCTGTCGCCGACCCGCAGCGCCAAGGCAGCGAAGCGGTTCCTGGGCAAGGCCCTGCGAGGCCTGAAGCACTGGGAAAAGCCTGCCACGCTCAATACCGACAAAGCGCCGAGCTATGGTGCAGCGATCACCGAATTGAAGCGCGAAGGAAAGCTGGACCGGGAGACGGCCCACCGGCAGGTGAAGTATCTCAATAACGTGATCGAGGCCGATCACGGAAAGCTCAAGATACTGATCAAGCCGGTGCGCGGTTTCAAATCGATCCCCACGGCCTATGCCACGATCAAGGGATTCGAAGTCATGCGAGCCCTGCGCAAAGGACAGGCTCGCCCCTGGTGCCTGCAGCCCGGCATCAGGGGCGAGGTGCGCCTTGTGGAGAGAGCTTTTGGCATTGGGCCCTCGGCGCTGACGGAGGCCATGGGCATGCTCAACCACCATTTCGCAGCAGCCGCCTGATCGGCGCAGAGCGACAGCCTACCTCTGACTGCCGCCAATCTTTGCAACAGAGCCCGCCGTGCTAGTCTGCTCGGTGATGGTGGAGTGAAGCCAACCCGCAATCGGGTTATGAATCTGCATCGCGATTCGCAATCAGCTGTCTCTTGAGCATGTCGAACTCCTGCGATGTTATCTCGCCGTGCTCGTGCTGCTACACCAACTTGTCCAACTCATCCGCCACACCGCTCCCCACCACCTGCGATGGCCAGGTTGTTTGCTGGGCGCATTGGCGGAGTATCGTCTCGACCCGGGACACCCACACCGGCACAACCTTACGGGAGTGATTCACTGTCAAAGAATCGGCCCGGTGCTCTGACGCAAGTATCGGGATGGTCACCATTTGTAAGCCGTAGACCTGAGTGGTGATCAAGACTTCGATACCACCGACCGTACCGGTACTAATCGACGACGGTCGTGTTCGTCGCCTGCCGCAGGGACTCTGCACACCTCCGTTTACGCATGTGCCTGGAGGAGTTGGAAATCGTCGTGTTCGGGAAACATTAAACACAGGATGGCAGCGATCTGAGCCAGCACATGATCAGCTAGCTCACCATCCGGATCGACGGCCCACTGCATCGTCGCGCCAGCGATGACCGAGTGCAGGAGCAACTCAGCTGCCGCAGGAGCACCTGGGGGCAGTCGCTTGCGGATCCCCTCCACCACCGCGCGGTTCCGCTGGATCGCAAGCGTGCGTAGCTCCGGCACCTGGAGCTCGTACCAGGAGATGAGATAGTTCACCGAGAAGTCGTTGCGAGTGTTCATGCTCCGAACGAGCACCTGCAAAAATTCCCAGAGCCCTTGCGGCCCTGCGCCTATCGGTATCGCATTCAGGTAATGCCGCACCTGCTCGACGCCGCGCTCCATCATCCTCACCAGCAGCGTATCGCGGTTGGTGAAGCGCTGGATTAACGCTGCGCGGGAGAGCCCCACCTCCTTTGCTACTCCGCTGAGCGTGAACTCTATGGGACCGCAACGCTTCAGCACTACGGTGGCGGCCTCGAGTACCTCGTCATCGGACTTGAGCTTGGGGCGGGGCATCAGTGTTCACCTTCTGTATGGGTTGGGGCGGAGGCTGTGGCTGCCGCCGCCATTGTAGCAAATTGAAGACGGAGCGAGAGTAGAGCCACGAGCCCCGCAAACACGGCCGATACAACGAGGCCAGGGAGCGGACCAGCAAGGTCGACAAACGCGCCGGCCGCAAGCATAACCATGGGCGAGGCTGACAGCATCACCGCCGAGACCGTGCCGAGTACCCGGCCGAGAAGTTCTGGCGGCGTGCGGTTGTAGATGGCAGCGTTGAGAATGGGAGAGACTGAGCCGGTCAGCAGTCCCACGAGCGCGCCCAACAACATCAGCACCGGCACGCCTGGCAACTGTGAAAGCAGAAGCGAGCCCACCGCAGAGCCACAAAATGCCACCGCCAGCCAGTTCTGCGCTGATATCCGGGCGCCGACCGACGCATGAATGGCAATGCCAAGGAGACCACCAGCCCCCATCATTGAGGAGAACAGCCCGAGCTCTGCTACTTGGCGTCCTGCATCTACAAACAGCGCAGGCATGATGACGCTGCCGTTGGCGCCAACGATGCCCACGAAGATCATCACTATACCAAAGAGAGGGCGCAGCAGGGGTTCGCTCCAGAGAAAAGCGACGCCGGCGCGCATGGAGAGAGTCGCCGTCGTGGTCATCGTCCGAGCGGCACGCGCGGGAAGCACCCACGCGCCGAGCAGACCTGCAAGGACGGAGCAGAACGCCGTCAGCCCGAGCGTTGGCGCAGCGCCAAGCAGGCCGATTGCGGCCCCCCCAAGGGCCGGGCCACCTAGAATCGCGACGTTCCCGATCACCGCTTTCAGTGACGAGACGCGCTCAACGGAGAGCCCGGCGACGTGGCCGAGTTTGGGCAGCTCACTGTCCTGCGCGGCCATACCGGGTGCGTCGAACGCGGCACCGAGCACCACGCAAGCGATCAGCCCAGTGTTCGAGAGGGCGCCAACGGCATCGAGCAGTGGGATGCTCGCCATGGCCACGCCGCCCACCACACCCGAGATCAATGCGACGGGCGCGCGCCCGAACCGATCGACGAGGCCACCACCAACCCACGCGCCGATGATGGTCGCGATGACGCTGCTAGCGGCCGTGGCGCCCGCCCAGGCCGCGCTCTTTGTATGAGACAGGACGAACCATGGAAGCGCGAGGGCCGCCACCGCGTTGCCGATCCGGAAGAGAAAGGTCGCCGCGAACAGCGTCGCGAGCGGGCTATATCGACGTTCGCTCATTCCGCTGCGGCGAGCTGCGCCTTCGCCGCAGCGAGGTACTCTTCGTTACCCGAGTCGAGGGCGAAGAGTGCGTAGGTGACCGCCCCGAACGCAAGGCGCTCCGCGATGTGGTGGGCGAGCCGCGGCCACACCCGGCCACCGGCCGCTTCATACGTGAGGAGGAGCTTCGCGAGCCCCTCTTCACCAAAGACCATAAGGTGCGCGGCCATGTCGATGGCAGGGTCATCAACGCGGGCCTCGCTCCAGTCGATCATCCCGCTGACGCGCTCCGTGTTGTCGATGAGCACATGGCCCACGTAGAGATCGCCATGCACCACCACGGAGAAATCTGGCCACGACGAATCGTCGTCGAGCCAGCGCTGCCACCGGTGGAGGCGCTTGTCGTTCACCACGAACTCGCGTCGGACGCGGTCAACGTCGTCGGCCACCTTCTGACGGGCCTGCGTCGGTGTACGGATGAGCATCCCCGCATCCACGGCGGCGGAAATGGGGACGGCATGCAGGGCGGCGAGCGCGGTCGCGAAGCTCTCCGCGAAGACCTCCGAGTCCTGCGGCACGACCCAGTCGGGCGTGGACGAACCAGGCTGGATGACCATCGCAGTCGAGTCTTCGAGCATGGGATAGGCAACGAGCTCGGCGTTGGCCACGCGCCAGTCCGGCACCGCGAACGGCAGGCGATTCTTGAGCATTGCCAGCACCCGCGCCTCTGGTTCGACCTTCGCGCTTACCTCGGCTCGGCGCGGGATGCGCAGCACCCACCGACGTCCATCGTCGACGGTGGCGATCACGATCCTATAGTCGAGCCCAAGCTCATTGACAGTCAGCGGGCCATGGAGCTTGAGCCCATGTCGGGCTGCAAGTGCGTACAGTTGGGAGGTATCGGCGGTCGTGACTACGGTCATGATTCACTCCTGAGGGCTTGACGGGTTTAGCCACCTAAATGTAACAGTCACGTCGGTTATATTCAATCCGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCCGTCGGCTTGAGATGCACATGTTGGCTGAGGTCGCCGGGTAGCGTGAACAGGCGCGGACCGTCCCGAAACTGGAACACGCGATACAGATGGAATTGATCGCCCGCCTCCTTGGAGAATTCGAGTTCGTTGTGGCTGACCAAGAAAGACGAGCCTACCCCGCCATTGGTGGTTTTCACCTCGATGAAGCGCTCATGGGCGTCCTCTTCGAACGACAGGATGTCGAACCCCGCACCGTCTCCCTGGGTGTCGGACACCCAATCCAGCCGCTGAAAAAGCTCTGGGTGGCCGAGCTCGGTCAGGCGTTGCTGTTCGTAGCCAATCACCCACTGCTCCCCTGCCCGGCCCAGCTTGCGGTTGGCTTCATCGCGAGCGGCATAATCGAACTTTCGCGGTAGGCGTTGCCGTAGAGATGCCGGGGTACGCACAAGCACTTCACGGGCGGGTGGTTCTACCAAAGCCGCTCGGTAGGTTTTGTCACCCGGAAGTTTTACCTCCTCCAGGGCATCGACAAGAGCGCCGACCGTCTGCTGATGTTCCAGAACGTAGGCGTGTACGGATTTACGCAGCAGCAGTTGGCTGTTGCCGCGTGGCTTGTAGCCGTTGATATAGGGCAGGCCCAGGGCATCGAGTACGGCGCTAATGTTCTGGTGCTTGAGCTCGACTGAAGACTTGCTGCGACCGTTCAGCAGTTGGCGCAGTGCCTGGTTGTGCTCGGACTTGTTGTACGGCTCCCCAGCCGCCTCGGCACGCAGCATGTCGAAATAGTCTTCGACCGTGGCCAGGACCTCTTCTTCGGACCAGTCTTCGCCGATGCGAATGATGCGAAACCCGAGCCGCGTCAGCGCCGGAACGACGGTCGCCTCGCCACCGGAGAAGCTGTCAGCAGTGAGCGGGCCCTGCTCGGGAAATTGCTTGCCGAAGGCCACACCGGCGATGGCCTTGGAATCGCAATCGGTGCCGGTCTTCGGATCACGTACCAGGAAGTCGCGGGACTTGCCGTAGCCGTGGCGCGCCAGGAATTTCGTGCGGCCCAGTTGCACGAACTCATCGATGGCAGCCTGCACGGCGGCGGGGCTTCGAAGCTGGGAGAGTTGAGACACAGGGTCCTTCCTTACTGTCATGGTGTGCCGGGAACCGCCGAGCCACGAGATTATGAGTAGCCCCTGAACAGAAACGTCACGATAAAAGCCGTGAACGCCACCAGGCCCATTAAATCCCTTGCGTATTTGCAGCCCGTGCTGTCCAAACCTGTACCAGGTCCGATCAACACGCTCCAACCATTGAGGTACGAAAACACCGCCTGACCGAACAAAAAATGCGTGATAGCCGCCGCAGCCATGACGCCGGAATCGTCAGACAGGCTGTAGCTGTTGACCATTGCCCTGTACGCCTGCATCTGAAATGGCTATTGCCCTGATGGGAGCCGGCTTTTCAGCCACCGACACCAGCGATGCCGTCAATATTCTTTACCCGATAACCATGACCGTACAAGCTAACAAGGCCTGGCAAGCCAGCGGACTTAAAAAGTCTTTTTTTCTACCATCCCACAAAAAAGTTCGTGGTGGAGAAAAGATAAGCTATGCAAGGCTTTAGGAGACGTGGTTTTTCAGGATGACGAAGAACGATTCGGCGCTAGGTGCAACATAGGTGCATCGCACGAGCGCTAGGAACGGCGAAAAAAGGCGGACGTGGCGAAATCGGTAGACGCAGCAGACTTTAAAATTGGAGTGCCCGCGGGGAAATCCGCGGAGTAGAACCGCTCAAAGTCGGGGAACGCTAACGGGCAATACCCTAAGCCAATCCCGAGCCAAGCCCCTTCGGGGGAAGGTGTAGAGACTGGACGGGCGGCGCCTAAAGCCTTCGGGCAATGGCGAAGGGACAGTCCAGACCACGAACGTCATCAGACGGCGGCGAAAGTCGAGGTGGTACGAAAATCTGCTTCTCTGTGAGAGTACGGGTTCGAGTCCCGTCGTCCGCACCACAAAGCCAAACATCCCTGCGATGATCGACCTCTGGGCGTTTGGTCGTGAGCCCGCCACCCTCGCGCTACGCTTTGCGCAGGCGTCGAAGGCGATCAGGTGCGCCCATCGATTCCGTCGAGCACCATCGCTGCGAGTTCATCGCTGGTGTGTGCACCACTATCGAGAACACGGGTTGTGCATGGAAGCCGTTCCCTTGCGGCCAAGCATCGGGCGACATTCGCTAATCGCCACTCTCGAATCTCCGCATTTCGATTCGGGTCAGGATGCATGGTCTGGTTCGCGATCCGGTGACGCAATAGGTCCTCGTTGAGCGTCAGAAAGATGTGCAGCAGCTGATCGTCGATCCGCCTTACCCCGTCGAGTATCTCAGTCAGATAGTCCGGGTGCACGAGCGTCATTGGGATGATGATGTCCTGCGAGTAATTCCTTCGAATCTCCCTGACCGCCGCGATCGTAAGTCCCCTCCACAAGGGGAGATCCTGATAGTCTCCGCTCGCTGGCATGGGGACCGTTTCTTTCACCACGAACCCGATTTCCTCGGGGTCAAAGATCAGCGATTTGGAACGCCGATCGCGCAGCCGCTTAGCGAGCGTCGTCTTTCCGGCGCCGAAAGGTCCGTTGATCCAGATTATCATTGTCGACGGCCTCTAACCTGAAGGCTCGCAAGAGCGCTCGACGGCCTCGTGCGGAGGCACGATCGGAGTGGTTCCGAAATGCTTCTCAAGATAGGTGACGCCGAACGTCACGATGTCCTGCGCGTCGAACAGGTAGCACTGAGCAAAGCCCACGACACCTTCTCGATGGCGACCGAGCTTCACGTAAGCATTTGCTATAGTTTCAACCGCATCCGGCTTTCCTTCGATAGCAAAGCAATCGAGAATGCCGTTTGAATCGTAATCCGATGCCGTTTTCCAGGCGACTTCACCGTCTCTTCCAAGCATCGGCATCTCATACGTCACCCACCGTTTGTTGGGGATATCGGCAACCGCCTCGGCGTAGTGCAATGCGGTAACGGAGTTTAGCGGCGCACCCAACAGCAGGGCCTTCCCGCCAAGGCGAACGAACCGCTCGACGGGCGATCCTTCCCCCAAGGCGTGACCGAGTTCGTGAGGCTCCGTCAGCGTTTCAGCCAGCGGACCAACCGCGACCATCGATGCATCGGGGTGCGCGCTGCGCCGCGCGCCGGGGGCTTGAACCAGAAATTGATTCAGCAGGCCGAACCCACGGTAAGTCCCGGCTGTTGCGGGATCGAACGGCAGCCAGGTACGGCGGGCTTCGTCATCCAGCCGAGCGCCATTCAGAGTCTCCTCGTAGGGTGATCGGTCCCACGACGCGTATCCCATCACAGTGCCAGTCGGCCCAACCGCGGAGCGTAACGCGGCAACGACCGTCTCCGCTCCTCCTTCGACCGGACCAATCGCTTTAAGTGAGGCATGCACCATCAAGAGGTCACCGGTTTGGACTCCGAGTTTTTGAAGCGCCTCCGTTATTGCCTTCCGCGTATGCATCGCGATATCTCCTCTAAACTGCAAAACACTATACGCTGATGAATCCCCTAATGATTTTGGTAAAAATCATTAAGTTAAGGTGGATACACATCTTGTCATATGATCAAATGGTTTCGCGAAAAATCAATAATCAGACAACAAGATGTGCGAACTCGATATTTTACACGACTCTCTTTACCAATTCTGCCCCGAATTACACTTAAAACGACTCAACAGCTTAACGTTGGCTTGCCACGCATTACTTGACTGTAAAACTCTCACTCTTACCGAACTTGGCCGTAACCTGCCAACCAAAGCGAGAACAAAACATAACATCAAACGAATCGACCGATTGTTAGGTAATCGTCACCTCCACAAAGAGCGACTCGCTGTATACCGTTGGCATGCTAGCTTTATCTGTTCGGGCAATACGATGCCCATTGTACTTGTTGACTGGTCTGATATCCGTGAGCAAAAACGGCTTATGGTATTGCGAGCTTCAGTCGCACTACACGGTCGTTCTGTTACTCTTTATGAGAAAGCGTTCCCGCTTTCAGAGCAATGTTCAAAGAAAGCTCATGACCAATTTCTAGCCGACCTTGCGAGCATTCTACCGAGTAACACCACACCGCTCATTGTCAGTGATGCTGGCTTTAAAGTGCCATGGTATAAATCCGTTGAGAAGCTGGGTTGGTACTGGTTAAGTCGAGTAAGAGGAAAAGTACAATATGCAGACCTAGGAGCGGAAAACTGGAAACCTATCAGCAACTTACATGATATGTCATCTAGTCACTCAAAGACTTTAGGCTATAAGAGGCTGACTAAAAGCAATCCAATCTCATGCCAAATTCTATTGTATAAATCTCGCTCTAAAGGCCGAAAAAATCAGCGCTCGACACGGACTCATTGTCACCACCCGTCACCTAAAATCTACTCAGCGTCGGCAAAGGAGCCATGGGTTCTAGCAACTAACTTACCTGTTGAAATTCGAACACCCAAACAACTTGTTAATATCTATTCGAAGCGAATGCAGATTGAAGAAACCTTCCGAGACTTGAAAAGTCCTGCCTACGGACTAGGCCTACGCCATAGCCGAACGAGCAGCTCAGAGCGTTTTGATATCATGCTCTGATGAATCCCCTAATGATTTTGGTAAAAATCATTAAGTTAAGGTGGATACACATCTTGTCATATGATCAAATGGTTTCGCGAAAAATCAATAATCAGACAACAAGATGTGCGAACTCGATATTTTACACGACTCTCTTTACCAATTCTGCCCCGAATTACACTTAAAACGACTCAACAGCTTAACGTTGGCTTGCCACGCATTACTTGACTGTAAAACTCTCACTCTTACCGAACTTGGCCGTAACCTGCCAACCAAAGCGAGAACAAAACATAACATCAAACGAATCGACCGATTGTTAGGTAATCGTCACCTCCACAAAGAGCGACTCGCTGTATACCGTTGGCATGCTAGCTTTATCTGTTCGGGCAATACGATGCCCATTGTACTTGTTGACTGGTCTGATATCCGTGAGCAAAAACGGCTTATGGTATTGCGAGCTTCAGTCGCACTACACGGTCGTTCTGTTACTCTTTATGAGAAAGCGTTCCCGCTTTCAGAGCAATGTTCAAAGAAAGCTCATGACCAATTTCTAGCCGACCTTGCGAGCATTCTACCGAGTAACACCACACCGCTCATTGTCAGTGATGCTGGCTTTAAAGTGCCATGGTATAAATCCGTTGAGAAGCTGGGTTGGTACTGGTTAAGTCGAGTAAGAGGAAAAGTACAATATGCAGACCTAGGAGCGGAAAACTGGAAACCTATCAGCAACTTACATGATATGTCATCTAGTCACTCAAAGACTTTAGGCTATAAGAGGCTGACTAAAAGCAATCCAATCTCATGCCAAATTCTATTGTATAAATCTCGCTCTAAAGGCCGAAAAAATCAGCGCTCGACACGGACTCATTGTCACCACCCGTCACCTAAAATCTACTCAGCGTCGGCAAAGGAGCCATGGGTTCTAGCAACTAACTTACCTGTTGAAATTCGAACACCCAAACAACTTGTTAAGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGTTGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGTGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCCTAGAACGCACGAATGAGGGCCGACAGGAAGCAAAGCTGAAAGGAATCAAATTTGGCCGCAGGCGTACCGTGGACAGGAACGTCGTGCTGACGCTTCATCAGAAGGGCACTGGTGCAACGGAAATTGCTCATCAGCTCAGTATTGCCCGCTCCACGGTTTATAAAATTCTTGAAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGGTAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTTCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGTGCGGTATTATCCCGTGTTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCTGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGCAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCTTATGGCAGAGCAGGGAAAGGAATTGCCGGGCTATGTGCAACGGGAATTTGAAGAATTTCTCCAATGCGGGCGGCTGGAGCATGGCTTTCTACGGGTTCGCTGCGAGTCTTGCCACGCCGAGCACCTGGTCGCTTTCAGCTGTAAGCGTCGCGGTTTCTGCCCGAGCTGTGGGGCGCGGCGGATGGCCGAAAGTGCCGCCTTGCTGGTTGATGAAGTACTGCCTGAACAACCCATGCGTCAGTGGGTGTTGAGCTTCCCGTTTCAGCTGCGTTTCCTGTTTGGGGTCGTTTGCGGGAAGGGGCGGAATCCTACGCTAAGGCTTTGGCCAGCGATATTCTCCGGTGAGATTGATGTGTTCCCAGGGGATAGGAGAAGGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCCTTGATATCTAGTATGACGTCTGTCGCACCTGCTTGATCGCGGCCGCGATAGCTAGATCGCGTTGCTCCTCTTCTCCATCCGCGTTCCAAGCTGCGGAAAGGCACCCATAAGCGTACGCCTGGTCGAGCAGGCGACGCGGATCGACGTCCAGCGCACGAGAGAATGCGTCCGCCATCTGTGCAATGCGTCTAGGATCGAGACAAAGGTCGTCTCTGTCAGCCGGATCGTAGAACATATTGGCGGCGCCAAAGCCCACTTCACCGACCAGACCGACGGGATCTATCACCAGCCAGCCGCGACTGGAGAACATGATGTTTTCATGATGCAGATCGCCATGTAGCCCACGCAGTTCCGAGGCATTGCTCATCATTTGATCGGCTATAATCGCCGCGTGGACGTAGTCAGTTTGACAACCTGCGTTTTGATCATCGCGCGCCCGCTGAAACAAAGCTGCAAAGCGATCCCGGATCGGGAGAAGGGCAGAAGGCAGGGGTTCCTCAGATGCGGCATACAGCTTCGCCATTAGTTCCGCTGCAATTTCGGTCGCCTGGTAGTCGCCGTGCTCGGCAACGATGTGAGAGAGCATTCGCTCCCCGGCATATTCGAGCAACATCAGATTGTTCTCACGACCGAGCAACCGGACTGCTCCCCTCCCATTGCGCCATACCAGATAGTCGGCCCCGCGCAGTTCATCAGCAATGTCTTCTATAGGTTTCAATCCCTTGACGATTGCAGGAGTCCCGTCTGGCAATGAAACTTTCCAAACGAGGCTGGAAAAGGTGTCCGCAATGAGAACAGGTTGCGAAACGTGCCAATGAGCAGGAAAAACAGGCGGCATGAACATCAACCCCAAGTCAGAGGGTCCAATCGCAGATAGAAGGCAAGGCGTTCGCGGTCGGGGGCTTCGATCCCCAATACATTGAATAGGACAGCGAAGGCGCGCTCTGCTTCATCTGGCGCTGCCCAGTTCTCTTCGGCGTTAGCAATCATGAGTGCCAAATCGGCATAGCGATCTGCTGTTCCGAGCCGCCCAAGGTCGATCAGACCCGTGCATTGAAGAGTTTTAGGGTCCACCATGAAGTTCGGCATGCAGGGATCACCATGGCAAACAACCATATCGGTGCGCTCTTGGTCGAGCCGCACCGGTAGCTCTCGTTCGACACGAGCCAAAAGATCGAGCTGCGGCGTACTCTTGTCCTCGTCCGGTAAGAAGTCGGGATTGACGGCATTGCGGGACACCACATCAACGGCGCGTCCGAACATTCGCGACAGCCTGCGCTCAAACGGACATTGATCAACCGATAGGCTGTGAACAGCGCCAAGTTGCTGCCCCATTGACGGCCACGCTTTGAGCAAATCCGCTCCAGACAGATCAGCCGCCGGTACTCCCGGAATTGCCGTTATCACCAAGCATGCACCCTCCTGTTCCTCCTGCCAGTTGATGACCTCGGGGCAAGCCACACCTCGACCTTTGAGCCAAATGAGGCGGTCACGCTCTCCAGCGAGCTCACCGCGGCGGGAAGCAGGTGCGATTTTCGCGAAGGCATGCCCGTCACCACGTCGAAAAACAAAATCACCAGATTCTCCGCCTCTGACAGGCAACCAGTCAGAATGCGATTCACCAAAAAAAATATTAGTTCGATTCAATGGAGGTTCCTTCAGTTTTCTGATGAAGCGCGAATATAGAGAAATATCCCGAATGTGCAGTTAACGAATTCTTGCGGTTTCTTTTAGCGCCGCCAATACCGCCAGCCCGTCGCGCAAGGGGCGCGGCTCGTGTGTGCGGATGAAGTCAGCTCCACCTGCGGCGGCGGCAAGCTCTGCAGCGAGTGTCGCGGCCCCGACATCCCCCGGACCACGGCCTGTGAGCGCGCGCAGAAAGGATTTGCGCGAAACAGACAGAAGCACCGGCAAATCGAAGCGCAGCCGCAATTCATCGAACCGCGCCAGCACCGAGAGCGAGGTTTCGGGAGCAGCCCCCAGAAAAAACCCCATGCCGGGATCAAGGACAAGGCGGTTGCGTTTGATACCGGCACCCGTCAGCGCCGCGATGCGCGCGTCAAAGAACGCCGCAATGTGATCCATGATGTCGCCAGCGGGTGCCTCGCGCCGATCTGCCTGCCCGTCTTGCACCGAATGCATAACGACGAGTTTGGCAGATGATTTCGCCAATTGCGGATAGAACGCAGCGTCTGGAAAACCGCGAATATCATTGAGATAGGCCACACCACGCGACAAGGCATAGGCTTGCGTCGCGGGTTGATAACTGTCGAGCGAGACGGGAATGCCATCTGCCTTGAGCGCGTCCAGCACCGGCGCGATACGCTCGATTTCTGTGTCGGACGAAACAGGCGCGGCGTCGGGGTTGCTGGATGCCGGACCGAGGTCGATCACATCTGCCCCCTCGGCCATCAGCTTACGCGCCTGCGCAATGGCTGCGTCTGGCGCCAGATACCGGCCTCCATCGGAGAAACTGTCCGAGGTTATGTTGACGATGCCGAAAATGATGAGCGATTTATTCATGGGGGCTTCTATAATAATAATAATCGAGCATGAGTCTCATACGGATGCTCGGGTCGAAAGGGAATCCCCAGGCGAGTAACCTGTTTGCGGTGATCCATTAGCTGCAGGAGCAGAATAGCATACATCTGGAAGCAAAGCCAGGAAAGCGGCCTATGGAGCTGTGCGGCAGCGCTCAGTAGGCAATTTTTCAAAATATTGTTAAGCCTTTTCTGAGCATGGTATTTTTCATGGTATTACCAATTAGCAGGAAAATAAGCCATTGAATATAAAAGATAAAAATGTCTTGTTTACAATAGAGTGGGGGGGGTCAGCCTGCCGCCTTGGGCCGGGTGATGTCGTACTTGCCCGCCGCGAACTCGGTTACCGTCCAGCCCAGCGCGACCAGCTCCGGCAACGCCTCGCGCACCCGCTGGCGGCGCTTGCGCATGGTCGAACCACTGGCCTCTGACGGCCAGACATAGCCGCACAAGGTATCTATGGAAGCCTTGCCGGTTTTGCCGGGGTCGATCCAGCCACACAGCCGCTGGTGCAGCAGGCGGGCGGTTTCGCTGTCCAGCGCCCGCACCTCGTCCATGCTGATGCGCACATGCTGGCCGCCACCCATGACGGCCTGCGCGATCAAGGGGTTCAGGGCCACGTACAGGCGCCCGTCCGCCTCGTCGCTGGCGTACTCCGACAGCAGCCGAAACCCCTGCCGCTTGCGGCCATTCTGGGCGATGATGGATACCTTCCAAAGGCGCTCGATGCAGTCCTGTATGTGCTTGAGCGCCCCACCACTATCGACCTCTGCCCCGATTTCCTTTGCCAGCGCCCGATAGCTACCTTTGACCACCATGGCATCAGCGGTGACGGCCTCCCACTTGGGTTCCAGGAACAGCCGGAGCTGCCGTCCGCCTTCGGTCTTGGGTTCCGGGCCAAGCACTAGGCCATTAGGCCCAGCCATGGCCACCAGCCCTTGCAGGATGCGCAGATCATCAGCGCCCAGCGGCTCCGGGCCGCTGAACTCGATCCGCTTGCCGTCGCCGTAGTCATACGTCACGTCCAGCTTGCTGCGCTTGCGCTCGCCCCGCTTGAGGGCACGGAACAGGCCGGGGGCCAGACAGTGCGCCGGGTCGTGCCGGACGTGGCTGAGGCTGTGCTTGTTCTTAGGCTTCACCACGGGGCACCCCCTTGCTCTTGCGCTGCCTCTCCAGCACGGCGGGCTTGAGCACCCCGCCGTCATGCCGCCTGAACCACCGATCAGCGAACGGTGCGCCATAGTTGGCCTTGCTCACACCGAAGCGGACGAAGAACCGGCGCTGGTCGTCGTCCACACCCCATTCCTCGGCCTCGGCGCTGGTCATGCTCGACAGGTAGGACTGCCAGCGGATGTTATCGACCAGTACCGAGCTGCCCCGGCTGGCCTGCTGCTGGTCGCCTGCGCCCATCATGGCCGCGCCCTTGCTGGCATGGTGCAGGAACACGATAGAGCACCCGGTATCGGCGGCGATGGCCTCCATGCGACCGATGACCTGGGCCATGGGGCCGCTGGCGTTTTCTTCCTCGATGTGGAACCGGCGCAGCGTGTCCAGCACCATCAGGCGGCGGCCCTCGGCGGCGCGCTTGAGGCCGTCGAACCACTCCGGGGCCATGATGTTGGGCAGGCTGCCGATCAGCGGCTGGATCAGCAGGCCGTCAGCCACGGCTTGCCGTTCCTCGGCGCTGAGGTGCGCCCCAAGGGCGTGCAGGCGGTGATGAATGGCGGTGGGCGGGTCTTCGGCGGGCAGGTAGATCACCGGGCCGGTGGGCAGTTCGCCCACCTCCAGCAGATCCGGCCCGCCTGCAATCTGTGCGGCCAGTTGCAGGGCCAGCATGGATTTACCGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCCTTCAACGCATGAAAAAAGAGACTGCACAGTTATATCTGCTGTTTTTTGCTTTTCATCGGTTTCAGCAAATTAATGACAACTTAATTGAAGCATTGCTCCATTGGGTCGATCAATACGAGAAACAGGCCAAGCGTGCCGCTGAAGAAGCAATGAATAATGCGGTTACCAATGCAGCGAAAAATTTACAGGCTGCGGGTCATGTATTGAGCCTGTTTACGGATGACACCATCACCGATGACACACCTTTTTCCATTATTAAAGAAAAAGCCTATGCATTGCTTGAACAAGAGAGATTCCCATTAGTTGCTGATTACTTACGCAATATTGCTTTCGACAAAACGGCATTTGAATGGTCACATTACACAAAATTATCCGCCACATTCAAACGTAACTTAAGGCAACTTTTTACTGATCTGGATTTTGCCGGACGTGTAGAAGACTCTCCTTTGCTTGAAGCTATCGCGTTTTTACAAAACTTATTGCGCACAGAAAAATCACCAAGGCAAACTGACCCTAATTCATTTCCGACTGAGATTATTCCTAAAGGTTTACGCCGATATTTGTTTAGTAAAGAGGGCAAAACATTTAAAACGCTTGATGTAGATCGCTATGAGTTTTTGGTCTATCGCCTACTACGCAACTCACTGGAAGCGGGTGATGTGTACGTTAAACCCATATAATGCGCCTACTTATCAGCCTTTGGGACGTTGGGACGGTATTTTTCACTATATACCCCCAAGTTTTTAGACATGAAAAAAGCTCGATATATTATAAATATATCGAGCCTTTAAGTTACTGTTTTTCAATATCGTTCAAAATCTTGATAATTACAGTATACGGTTTTATGAAGTGGTATATTTGCAAATGCTTATTTTGCAAGGTCTATTTGCTTGATTTATCCTACAATCGTAATGCAAAACCCATGACCGGAAGCAACGAACTTAGGTTCAATCTCACTCAAATCCGAGGATAGCTTTTGTCCGGATTCTTGATTGGGCAATGTATCTGCGCCGCTTGATTTTACCGCTGCGTCCCAACTGCTTTCGATAACTTGTCCGTGATTCTCACCCATGCCGAACATGAGGTCGCTGCTAAAGAAAATACCACGTTTCTTTTCAAAAAATAAAAGTCCTTCCCACATATGCATTTCAGATGGGTAACTGATAGTTTGAAATTCAAAATCATCTCCGGCAAATATTTCATTCGGCTTTTTAATAAGCACATTGTTAGTAATACCAAACCCCATCAGTTGTCTTGCTGTGGTTTCGGAACAAACCGCGACAGCTTCGGGATGTTCTTTGAGAACCAAAGCAAGTCCGCCACATTCGTCTGATTCAAAATGAGAAATTAGAATGTATTTTATTTTGCGTTCACCGAGCAACTCTTGCAACTTAGGAATGGTAGTTTGCGCTTGTGATACGGCTCCGGTCTGAATGAGAACAGGCTCATTTGTCATCAATAAGTATTGGTGCATTGAAAGCTTAATCGGCTCCATTACCTCTGTAAATTGGTATAAATCTTTGATAATCTCTGTCATTGTCAAAACACCTTTTTCTATTTATAGTCTAACCTAAATTCCACGTGTGTTTTTTATTAGCTTCAAAAATCACTATTTCACGAAGAATTTAGACTGCTTCTCACACATTGTAACATTATTTACAACCACCTTTCAATCATTTTTGATAAATCATTGATTTCATCTTTGCTGCAATGATACTTAATAAACTCTGCAAGTTATCCACAGAGCAACACTCAATTTTATTGATGATATTCTTATTATACCAGACATTTTTCATACACTCCCTTGTACGGATAGTTTTCCGACAACTTCATGATTACATATCTTGCGGTTTTGATTATTTTTGCTGCAAGAAATACATACTTCAAACGAAAGGTCTTTATTTGCTGTCTGTATTCTGAAGAGTCCAAGGAATCAAACTTGAACAACAAAAATAGGTTATATGAAAGCATCATCATTTGAAACACGGCTTCATTCGCCCAAAATGACTTTAGCAAGAGATGACCCACCGCCATGTCGTATTTGGCTTCTTTGATATAGTTTTCAGCATTACCACGCTTTTCATAGTATATAACTACTTAGGGAATTCCATGACTGGACAGCGCATTGGGTATATCAGGGTCAGCACCTTCGACCAGAACCCGGAACGGCAACTGGAAGGCGTCAAGGTTGATCGCGCTTTTAGCGACAAGGCATCCGGCAAGGATGTCAAGCGTCCGCAACTGGAAGCGCTGATAAGCTTCGCCCGCACCGGCGACACCGTGGTGGTGCATAGCATGGATCGCCTGGCGCGCAATCTCGATGATTTGCGCCGGATCGTGCAAACGCTGACACAACGCGGCGTGCATATCGAATTCGTCAAGGAACACCTCAGTTTTACTGGCGAAGACTCTCCGATGGCGAACCTGATGCTCTCGGTGATGGGCGCGTTCGCCGAGTTCGAGCGCGCCCTGATCCGCGAGCGTCAGCGCGAGGGTATTGCGCTCGCCAAGCAACGCGGGGCTTACCGTGGCAGGAAGAAATCCCTGTCGTCTGAGCGTATTGCCGAACTGCGCCAACGTGTCGAGGCTGGCGAGCAAAAGACCAAGCTTGCTCGTGAATTCGGAATCAGTCGCGAAACCCTGTATCAATACTTGAGAACGGATCAGTAAATATGCCACGTCGTTCCATCCTGTCCGCCGCCGAGCGGGAAAGCCTGCTGGCGTTGCCGGACTCCAAGGACGACCTGATCCGACATTACACATTCAACGATACCGACCTCTCGATCATCCGACAGCGGCGCGGGCCAGCCAATCGGCTGGGCTTCGCGGTGCAGCTCTGTTACCTGCGCTTTCCCGGCGTCATCCTGGGCGTCGATGAACTACCGTTCCCGCCCTTGTTGAAGCTGGTCGCCGACCAGCTCAAGGTCGGCGTCGAAAGCTGGAACGAGTACGGCCAGCGGGAGCAGACCCGGCGCGAGCACCTGAGCGAGCTGCAAACCGTGTTCGGTTTCCGGCCCTTCACCATGAGCCATTACCGGCAGGCCGTCCAGATGCTGACCGAGCTGGCGATGCAAACCGACAAAGGCATCGTGCTGGCCAGCGCCTTGATCGGGCACCTGCGGCGGCAGTCGGTCATTCTGCCCGCCCTCAACGCCGTCGAGCGGGCGAGTGCCGAGGCGATCACCCGTGCTAACCGGCGCATCTACGACGCCTTGGCCGAACCACTGGCGGACGCGCATCGCCGCCGCCTCGACGATCTGCTCAAGCGCCGGGACAACGGCAAGACGACCTGGTTGGCTTGGTTGCGCCAGTCTCCGGCCAAGCCAAATTCGCGGCATATGCTGGAACACATCGAACGCCTCAAGGCATGGCAGGCACTCGATCTGCCTACCGGCATCGAGCGGCTGGTTCACCAGAACCGCCTGCTCAAGATTGCCCGCGAGGGCGGCCAGATGACACCCGCCGACCTGGCCAAATTCGAGCCGCAACGGCGCTACGCCACTCTCGTGGCGCTGGCCACCGAGGGCATGGCCACCGTCACCGACGAAATCATCGACCTGCACGACCGCATCCTGGGTAAGCTGTTTAACGCTGCCAAGAATAAGCATCAGCAGCAGTTCCAGGCGTCAGGCAAGGCCATCAACGCCAAGGTACGTCTGTACGGGCGCATCGGTCAGGCGCTGATCGACGCCAAGCAATCAGGCCGCGATGCGTTTGCCGCCATCGAGGCCGTCATGTCCTGGGATTCCTTTGCCGAGAGCGTCACCGAGGCGCAGAAGCTCGCGCAACCCGATGACTTCGATTTCCTGCATCGCATCGGCGAGAGCTACGCCACCCTGCGCCGCTATGCACCGGAATTCCTTGCCGTGCTCAAGCTGCGGGCCGCGCCCGCCGCCAAAAACGTGCTTGATGCCATTGAGGTGCTGCGCGGCATGAACACCGACAACGCCCGCAAGCTGCCAGCCGATGCACCGACCGGCTTCATCAAGCCGCGCTGGCAGAAACTGGTGATGACCGACGCCGGCATCGACCGGCGCTACTACGAACTGTGCGCGCTGTCCGAGTTGAAGAACTCCCTGCGCTCGGGCGACATCTGGGTGCAGGGTTCACGCCAGTTCAAGGACTTCGAGGACTACCTGGTACCGCCCGAGAAGTTCACCAGCCTCAAGCAGTCCAGCGAATTGCCGCTGGCCGTGGCCACCGACTGCGAACAATATCTGCATGAGCGGCTGACGCTGCTGGAAGCACAACTTGCCACCGTCAACCGCATGGCGGCAGCCAACGACCTGCCGGATGCCATCATCACCGAGTCGGGCTTGAAGATCACGCCGCTGGATGCGGCGGTGCCCGACACCGCGCAGGCGCTGATAGACCAGACAGCCATGGTCCTGCCGCACGTCAAGATCACCGAACTGCTGCTCGAAGTCGATGAGTGGACGGGCTTCACCCGGCACTTCACGCACTTGAAATCGGGCGATCTGGCCAAGGACAAGAACCTGTTGTTGACCACGATCCTGGCCGACGCGATCAACCTGGGCCTGACCAAGATGGCCGAGTCCTGCCCCGGCACGACCTACGCGAAGCTCGCTTGGCTGCAAGCCTGGCATACCCGCGACGAAACGTACTCGACAGCGTTGGCTGAACTGGTCAACGCTCAGTTTCGGCATCCCTTTGCCGGGCACTGGGGCGATGGCACCACATCATCATCGGACGGACAGAATTTCCGAACCGCTAGCAAGGCAAAGAGCACGGGGCACATCAACCCAAAATATGGCAGCAGCCCAGGACGGACTTTCTACACCCACATCTCCGACCAATACGCGCCATTCCACACCAAGGTGGTCAATGTCGGCCTGCGCGACTCAACCTACGTGCTCGACGGCCTGCTGTACCACGAATCCGACCTGCGGATCGAGGAGCACTACACCGACACGGCGGGCTTCACCGATCACGTCTTCGCCCTGATGCACCTCTTGGGCTTCCGCTTCGCGCCGCGCATCCGCGACCTGGGCGACACCAAGCTCTACATCCCGAAGGGCGATGCCGCCTATGACGCGCTCAAGCCGATGATCGGCGGCACGCTCAACATCAAGCACGTCCGCGCCCATTGGGACGAAATCCTGCGGCTGGCCACCTCGATCAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCAGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCACCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGAAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTACCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTATCACGAGGCCCTTTCGTCTTCAAGAATTTTATAAACCGTGGAGCGGGCAATACTGAGCTGATGAGCAATTTCCGTTGCACCAGTGCCCTTCTGATGAAGCGTCAGCACGACGTTCCTGTCCACGGTACGCCTGCGGCCAAATTTGATTCCTTTCAGCTTTGCTTCCTGTCGGCCCTCATTCGTGCGTTCTAGGATCCTCCGGCGTTCAGCCTGTGCCACAGCCGACAGGATGGTGACCACCATTTGCCCCATATCACCGTCGGGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCCCACATCTTTTGTCACCAACGAGCGGCTGCCTATCACCGCACCGTGCCCGATCTTGATTCCGGGCATGACCATTGCCTCAGAGCCGATCCAAACGTCATTGCCAATGACAGTATTACCTGCTTTTTGGAAGGCATCGAGTGCGCTTGAGAATGCAGGTTCTTCCTGCATATAAAAGAACGGGAAAGATGATGCCCAGTCGTACCGATGCCCCTGATTGCCAGCCATGATAAAGGAAGCCCCACTCCCGATAGAGCAGAAACTACCGATGATCAACTTATCAACGTCATCACGGTCCGGAAACAGATACCGTGCGCAGTCATCGAATGAGTGCCCATGATAGTAGCCAGAGTAATAGCTGTACCGCCCAACTTTGATATTGGGGTTCTTCACTTGCTCAGAAAGCAGCTTGCCTTTGAAGGGGCTATCAAAGTAGTTGGTCATAAGAGATCCCGCGGTCTGTGACTTTGCCGTCTAACGTTTGAAATAAGGGGCGCCGAGCGCCAGCGAGGGGAGCCAAAAGCTTGCTTTTGGCCGTCCCGACTTGATTGAAGGGTTGGGCGATTTTGCCATTAGATTTTTTATAAATTTAGTGTGTTTAGAATGGTGATCGCATTTTTCTTGGCTTTTATGCTTGATGTTAAATTCGACCCCAAGTTTCCTGTAAGTGCGGACACAAAAACATATTTATGTCCTGATTTGCTTATAATAAACCCTTCAAACCATCCGTTTTGTAAGGTTCTATTTGCTGTGAATCCTGCACCAGTTTTCCCATACAGTTTTGTACTATTATCCAGATCTTGTAGATACATGTTCTCTATGGTGTTTTCTATGGCTGAGTTTTTAACTGGGAGATTGTGATTAATAATTTTACGCAGGAATTGAATTTGTTCTTCTGGTGAAATTTTTAAGCTACTTTCGAGCCATGCTTCTGTTAATCCGTTGTTTCTTTCTTTATCTCCAGAGAAGTCTTGATTTCCATAATCAAAATCTTTGAGATAATTCTTGATTTTATTTAATCCAATTTTTTGGGTTATTTCTTGCGAAACCCAAACAACAGAAAATTGCATCCACGTCTTTGGTGTATGATTGCTGTTCCAGATCTCCATTCCTTTGGGGGTTTTATCCCATTTGAATATGGTTTTCTGATCTATTATTTCCGCATCAAATGCCATAAGTGATAATGCGATCTTGAAAGTTGAATCTGGTGCCATTTGCGTTGCACACTTTGCTTTATTGAATTGAGCAATTTCAGCGTTTGTGGATGCATCGTAAAGTAAAAAACAACCTTCAGTTCCTTCAAATAATGGAGATGCAACAGTAGAGATATCTGTTGATGCACTGGCGCTGCTGTAGATAATATTTGCAATTATTAAAAAAATAGCGAAGTTGATATGTATTGTGTTTTTCATAATAAGTATTGGTTTGGTAAAGGGCTTAATTTTAACGGCTAACAATTAATGAGGCTCCGGGTTCGCCCAACGTTTGACATGAGGGGCGGCCAAGGGCGCCAGCCCTTGGACGTCCCCCTCGATGGAAGGGTTAGGCATCACTGCGTGTTCGCTCGAATGCCTGGCGTGTTTGAACCATGTACACGGCTGGACCATATGGGGTGGTTACGGTACCTTGCCTCTCAAACCCCGCTTTCTCGTAGCATCGGATCGCTCGCAAGTTGCTCGGCGACGGGTCCGTTTGGATCTTGGTGACCTCGGGATCATTGAACAGCAACTCAACCAGAGCTCGAACCAGCTTGGTTCCCAAGCCTTTGCCCAGTTGTGATGCATTCGCCAGTAACTGGTCTATTCCGCGTACTCCTGGATCGGTTTCTTCTTCCCACCGTCCGTCCCCGCTTCCAAGAGCAACGTACGACTGGGCATACCCAATCGGCTCTCCATTCAGCATTGCAATGTATGGAGTGACGGACTCTTGCGCTAAAACGCTTGGCAAGTACTGTTCCTGTACGTCAGCAAGTGTCGGGCGTGCTTCTTCTCCGCCCCACCACTCGACGATATGAGATCGATTTAGCCACTCATAGAGCATCGCAAGGTCATGCTCAGTCATGAGGCGCAGTGTGACGGAATCGTTGCTGTTGGTCACGATGCTGTACTTTGTGATGCCTAACTTTGTTTTTGCGTTGCTCATGATGTCTAACTCCCAATTTGTGTAGGGCTTATTATGCACGCTTAAAGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCTAACAGCGGATGTTCGCGATCACCTGGACAAACTTCTGAGTGAAATGCTCGCCGGCAATATCAGTCGTTTCATCTGGCTTCGCAACTTCGAGGTTGGTAACAACTCGGCTGCTGCTAACCGTTTGCTCGACAGGCTCGAATTTCTGCGTACCCTGAATATCAATCATAGTGCTTTGGCCAGCATACCTGCCCATCGCATTGCCCGGCTGCGTCGGCAGGGTGAACGCTACTTCACCGACGGTTTGCGTGACATCACTTCGGACCGCCGCTGGGCGATCCTTGCCGTCTGTGTTGTGGAGTGGGAAGCGGCGATTGCTGATGCCATAGTCGAAACCCATGACAGGATCGTAGGAAAAACCTGGCGGGAAGCGAAGCGCCAGCATGACGAAACAATTTCCGGCTCTAAAGCCACACTCACGGATACGATCCGTACCTTCACCGCGCTGGGAGCTTCGTTGCTTGAGGCCCGCAGTGACGGAACCCCGCTGGAGATGGCTGTCGCCAGTTCGGTTGCATGGGACCGGCTCGCTCAACTGGTAGCGACAGGGACTCAACTCAGCAACACGCTAGCCGATGAGCCTCTTGCATATGTCGGGCAGGGATACCATCGCTTTCGTCGTTATGCGCCCCGCATGTTGCGCTGTCTGAAGCTCGAAGCCGCGCCGGTCGCCGGACCATTGGTAGCAGCAGCTTTGTCGATCGGAGAGATGAAAGGTGTTGCATCGCCAGAAAGGCGTTTCCTGCGGCCCAGCTCCAAATGGAACCGTCATTTACGAGCTCAGGAAAAAGGAGATACCCGTCTTTGGGAAGTGGCGGTACTCTTTCACCTCCGGGATGCTTTTCGTTCCGGAGATGTCTGGCTCGCTCATTCGCGCCGCTATGGTGACCTCAAGCAGGTACTGGTGCCGATGATCGCGGCGCAGGAAAATGCAAAACTGGCCGTGCCTTCCAACCCACAGGATTGGCTGGCAGACAGAAAGGCGCGACTCACGATCGCTCTTAAGCGGCTGGCCCGGGCTGCCCGTAACGGCACTATTCCGCACGGTAGCATAGAAGATGGAACGTTGCGGATCGACAGGTTGACAGCAGACGTGCCGGATGGTGCCGAGGCACTCATACTGGATCTGTATCGCCGAATGCCGTCCGTTCGGATTACCGACATGCTGCTTGAAGTTGATGCAGCCCTTGGTTTCACAGATGCGTTTACCCATCTGAGAACCGGGGCTCCATGTCGCGACCGGATCGGTCTGCTCAACGTCCTGCTCGCTGAAGGGCTCAATCTGGGCCTGCGTAAGATGGCGGAAGCTACAAACACGCATGATTACTGGCAGCTCTCACGCCTTGCCCGCTGGCATGTTGAAAGCGAAGCCATGAACCAGGCATTGGCAATTGTGGTGGCCGCGCAGGGTAAACTGCCGATGTCACGCGTCTGGGGGATGGGCACGTCAGCATCGAGCGATGGTCAGTTTTTCCCGACAGCGCGGCATGGCGAAGCCATGAACATGGTCAATGCCAAATATGGTTCTGTTCCCGGCCTCAAAGCGTATACTCACGTAAGCGACCAGTTCGCGCCATTCGCTTGTCAGTCGATCCCGGCGACCGTGAGCGAGGCACCGTATATTCTCGATGGACTACTGATGAACGAGGTCGGTCGCCATGTTCGCGAACAGTATGCCGATACAGCAGGATTCACCGACCATTTGTTCGGAGCCAGTAGCCTGCTCGGCTACAATCTCGTTCTGCGAATCAGGGATCTGCCATCGAAGCGGTTGTACGTATTTAATCCCGATACGACCCCCAGGGAGTTACGCAAGTTGGTAGGTGGAAAAGCCCGGGAGGATCTTATCGTTGCGAACTGGCCTGATATTTTCCGTTGTGCCGCGACGATGACCGCTGGCAAAATCAGGCCCAGCCAACTCCTGCGCAAGCTCGCTTCTTACCCACGACAAAACAACCTTGCAGTTGCGCTTCGTGAAGTTGGTCGTATTGAACGGACCCTTTTCATTATTGAGTGGATCCTGGATACGGACATGCAGCGGCGTGCTCAGATCGGTCTTAACAAGGGAGAGGCCCACCATGCGCTCAAAAATGCGCTCCGTATCGGGAGGCAGGGGGAAATTCGCGATCGCACGACAGAGGGGCAGCACTACCGAATCGCTGGGCTCAATTTATTGACTGCGGTGATCATTTACTGGAATACCGTCCATCTTGGTCATGCCGTCACGGAGCGGCGGAACGAAGGGTTGGATGTTCCCCCTGAATTTCTTCCCCACATATCCCCATTGGGCTGGGCGCACATTCTACTGACTGGCGAATATCTTTGGCCCAAGGAACCGAAAGCTTAGGGTGTCATTTCGCCCTCAGCCGGAACCGACCCCTTTTAGCCAAATAACGTTTGGGAATCACCAGAATGGTGGGACAACAGCGGTTTTAGTGCCCTAAATCGTACGTTTTCATCCAGTTGCCCCTCAAACCCCATGTTCAAGTCAGAATAGTGGACAGGCGGCCAAGAACTTCGTTCATGATAGTCTCCGGAACCCGTTCGAGTCGTTTTCCGCCCCGTGCTTTCATATCAATTGTCCGGGGTTGATCGCAACGTACAACACCTGTGGTACGTATGCCAACACCATCCAACGACACCGCAAAGCCGGCAGTGCGGGCAAAATTGCCTCCGCTGGTTACGGGCACAACAACAGGCAGGCGGGTCACGCGATTAAAGGCCGCCGGTGTGACAATCAGCACCGGCCGCGTTCCCTGCTGCTCATGACCTGCGGTAGGATCAAGCGAGACAAGCCAGATTTCCCCTCTTTCCATGTCAGATTTCCTCCTGACCAGTCGCCGGTGCATCCAGCCATTCTCGTTCTTCAGCTGATATTTCAGCATTCGGATCACACTGTGCCAGTAGCTCAGCCAGTGAATATTGCGGGCGTCTGTACGGCTCAACAATCAGCCGGCCATTATCAATGACCATGCCAACTTCATTATCTGTGCCCAGAGACAGCGCATTCAGCAGTGCCGGTGGGACGGTCAGCATAACTGAGCCGCCAACCCTCTTCAGTCGGGTGGTATGCATTCTTCACCTCCATAAAAGTTATATTTAAATATAACATCCACTAAAAAAACACACCAGGCTTTAACGCACAATGTTTAATAAAAATATAACTTTCAACCAAACAGTAAACCCAGCGTGGTTGCTTCCATATGCAGCAATACCGGCAGCAGCAGGCCACCACTTCTGATCCTGGCCACTGATGTAATCAACCCCACCAGGAACAGTTCTGCCAGTGTCAGCAGGTTCTGATACTGGCTGTGCGCGGCGACGAACAACAACGACGTTATCAGCGCCCCCAGCCACATCGTCCAGCAGTACCGTGAACGGAAGACGTTCAGCATAATCCCCCGGAACAGCGTTTCCTCATTCAACGGGGCAAGGATAAAGATGGTCAGCAACGTCAGGATCACGTCAGGTATGGACTTATCGGCAAAAAGTTTCGTCATAAATGGCTCAGCAGGCAGAGCCAGCGCCTTACCGAGCAGAAATACACCGACATACACCACGGCCATCGCACCGACCAGCCACGGTACGCCAACGTTGCGCAGCTGACCAACGACCGGTAGCGGAGCTATCCAACGGCGGTATACCAGGAAAACACACAGCAGGTACATCAGAACAGTACCATGACTGAAGAACAAATAGTTTTTTCCTGATCCATAAAGCAGAACGGCCTGCTCCATGACAAATCTGGCTCCCCAACTAATGCCCCATGCAGCCAGCATAACCAGCATAAACTGCAGATATTGATTACGTGTTTGAATCATTGCATCGCCTGTAAATTTTTAACTTGTCCTATTTTTGTCATTACCACGTATATACACATGTATAACAATTCAGATATCGTTACCAGGATATGCCGCATCAGCGGCATGGAAGGCGGCACTCTGTTGTTTCATATGATACAGGAGTAAAACCGCCGAAGCCCGGCGTAAGCCGGTACTGATTGATAGATTTCACCTTACCCATCCCCAGCCCTGCCAGACCATACCCGCTTTCAGCCATGAGAGAGCTTCTGTGCGCGGTCGGAGTGGTCCCGACGAGGGTTTACCCGAAGTCGGGGCGTATCTCCGCGTTAGCGGGCCGTGAGGGCCGCTTACGAGCGTGTACTGAGAACTTCCAGCGAGAAGACTGACAGCGATGAAGATGTAGTTACAACATTCATAATTAAAAGCGACTCTGTTCCGGCCCGAAGGGCCGGGGCGGGGCCGCTTTTCAGTTATGAGGGAGGGGCTTTGTGGTTTCAGTTCTGCGCTGGTTCGGGGTTTTTCTGGAGGTTGGTTTTGTGTGTTGTAACTAAAGTGGCTCCGGTTGGGGCCCGCCGTTTACGGTGGGAGGTGCATATCTGTCTGTCCACAGGACAAGCAGTGAATAGGTTTTCTTTTTAAATGAATGTAATTAAGTAGTTTAAAGGAGATATAAACAGGTGTTTAAAAGATACATTGCACCCTGTAGGGCTGACGGCTGGCGCTTTATGACATTAACGATTGTAACCTTATGGGGAAGTCCCTTGCAGTTTAATGTGGATAAGCAAAATTACCCGTCTGTGAGGCGTGTTTTGTATCAAAAACAAGGGGGACCGGATGCACCTGAAGGTGGATGATGAGGTTGTTTTTTTGTATGTGGTGCTGATTTTTTGTGCACTGGCGGGCTTCAGGCGTGCGAATGCCTCCGGCGCGTGCCGAATTATTCAGAGGAGGTCACTTTCAGGGGGAAGCTGTGGCCAGGCGGCTGTAATTGCGGTTACGTGACAGAATCATGCGCTCCTTCACACGACGCTCCACTTCGCGTTTTACCGCCTCACGATTGGCAGTGAAGCGCCCTTCCGAGATTTCACGCGTCAGCTGTCGTTTCACCAGGGTGACGATATCCTGACGTTTCCTGTTCGCATCACGACGCGCACGGGCACGCTTTATTCCCCGGGACTTAAGCTCTGTTTGGTAACTGCGGAAACGCTCACGCACAAAACGCCAGGCTTTCGCTATCAGTTCATCCATACCCAGGGTATCCAGCCCCTGCTTTTTGCGCTGTTTGTTTTCCCATACCACACGGCTGCGGCGCGCAGCTGCCACTGCATCCTCAGACACATCAAGGGCGGCAAACAGCGCCAGTGTGAACGTGATATCGGTCGGAATGTAGCACCCGATAAGCGGGTCATATTCCGTCTGGTAAGTAATCAGTCCCAGCTCTGACAGGAACGTCAGGGCCCGGGTGGCCCGGGTGATGGAGAGTTTTCCTGCACCGGACTCTGTCGCCAGTCCGCACTCAATGGCCAGCGTGGTGATGGAACACTGGACGCGGTTGGCCAGCGGGTCATAATGGAAACACAGCCCCTGCAGCAGCGCATCAATAGCCCGTCGACGCAGCACCGGTGGCATGCGTCGACGCAGACCACGCGAACGGGCATGCGCCACATGAATAGCGAAATCAAAACGGGAGCTGAAGCCCACCGCTTTTTCCATCAGTTTTTCGCAGAACTTCAACGTTCCGGCACCTTCACGGGGTGTGAACACCGGATTCGGGTTCTTTACCTGGCGGTAATACGTTTGGTGAAGATCAGTCACACCATCCTGCACTTATGTTGCACAGAAGGAGTGAGCACAGAAAGAAGTCTTGAACTTTTCCGGTCATATAACTATACTCCCCGCATAGAGCAACAGCTTCTATGCAGTTTCTTGTTAGCCCCGGTAATCTTCTCTTAGTCGCCAAACCTGGTGAAGATTATCGGGGTTTTTGCTTTTCTGGCTCCTGTAGATCCACATCAGAACCAGTTCCCTGCCACCTTACGGCGTGGCCAGCTGCGTATTTTCATGAAAGGAGATCACTCAATAACTTCCATCGAGATCGGGTAATAACATTTGAACAGATCGCTGAATAACATCGATGGAGATCACTTTTGACTCATTTTGTTATTCAGTGATCTCCATCAATGTTATTGGAACTTCACAGGTGTGTTGATCTGTATCTTTTGCCATTCCGGTAAAGGATACCTATGCCAACAGTTCCAATTTCTATGAGAAAACTTAAAGAAATTCTTAGGCTTAAATACGGTGTTGGACTCAGCCATCGACAAATTGGTCGTAGTCTTGCAATCTCCCCTTCCGTTGTATCCAGATATGCTAATCGGGCGGCTCAACTTGGCATAAAGCAGTGGCCCTTACCTACAGGATGGGATGATACAAAACTAAAACATGCGTTCCTTCAGACCCAGGTTAAGATGAAGAAGCACTCTCTGCCTGACTGGGCTACAGTACACCGGGAACTGCGTAATAAATGCGTGACGCTGCAGCTACTCTGGGAAGAATACTGTGAGCGTAATCCAGGCGGTTTTTACAGCTATAACCATTACTGCCGGATGTACCGTGAATGGCTCAAAACCACTTCACCATCAATGCGTCAGGTACATAAAGCTGGCGAAAAACTTTTCGTTGATTACTGTGGACCTACCGTTGGCGTTACCGACCCTGAGACCGGAGAAATAAGAACTGCTCAGGTCATCGTAGCTGTTCTCGGGGCATCAAGTTACACATGGGCAGAGGCCACCTGGTCTCAGCAGCTTGAAGACTGGGTGATGAGTCATGTTCGCTGCTTCCAGTGGTTGGGTGGCGTTCCTGAACTTGTTGTTCCGGACAATCTGAAAAGCGCCACATCCAGGGCATGTAAGTATGATCCTGACGTTAACCCTACCTACCAGCAGATGCTTGAGCATTATAATGTCGCAGTTTTGCCTGCGCGGCCACGTAAACCGAAAGATAAAGCCAAAGCTGAAGTTGGCGTTCAGGTTGTTGAACGCTGGATCATGGCCCGAATCAGGCATGAGATCTTCTACAGCCTTGCATCGCTTAATCAGCGCATTCGGGAGTTGCTGGAAAGACTGAATAACAAAATAATGCAGAAGTTGGGTTATTCACGTGCAGAACTCTTCATCCAGCTTGATAAACCCGCACTGAAGCCTCTTCCTGAAGCCAGTTACAGTTACACCCTGGTGAAGAAAGTCAGAGTTCATGCCGATTACCACGTGGAAATCGACAAACATTACTACTCGGTTCCATGTTCGCTGTTAGGCCAGCAACTGGAAGCATGGATCTCCGGAGAACTGGTAAGACTCTTCAATCAGGGGCAGGAGGTTGCTGTGCACCCGCGCAAGCGTACTTATGGCTACAGTACCCGCAACGAGCACATGCCTGAAGCTCATCGACAGCATGCCACCTGGACGCCAGAGCGTCTTCTGGAATGGGCGGGGCACATAGGCAGTGAAACTCATAGTTATGTGCTTCATATACTGAACTCTCGTCCACATCCGGAACAAAGCTATCGCTTCTGCCTTGGACTCCTGAACCTTCATAAAAAATACAGTAAAGCCAGACTTAATGCAGCATGTGCAAGAGCTCTGAAAACAAAGGTATGGCGTCTGTCAGGTATTAAATCGATCCTGGAAAAAGGTCTGGATAAACAACCTGTTCAGGATCCAAAACCAGATCTGTTATCCACGATGGAACACGAAAACGTACGCGGCAGTGAGTATTACCACTGA
Protein sequences of DBSCAN-SWA_1 >CP033400|13932:53228|18032_19271_-|AYP99992.1|DBSCAN-SWA MSERRYSPLATLFAATFLFRIGNAVAALALPWFVLSHTKSAAWAGATAASSVIATIIGAWVGGGLVDRFGRAPVALISGVVGGVAMASIPLLDAVGALSNTGLIACVVLGAAFDAPGMAAQDSELPKLGHVAGLSVERVSSLKAVIGNVAILGGPALGGAAIGLLGAAPTLGLTAFCSVLAGLLGAWVLPARAARTMTTTATLSMRAGVAFLWSEPLLRPLFGIVMIFVGIVGANGSVIMPALFVDAGRQVAELGLFSSMMGAGGLLGIAIHASVGARISAQNWLAVAFCGSAVGSLLLSQLPGVPVLMLLGALVGLLTGSVSPILNAAIYNRTPPELLGRVLGTVSAVMLSASPMVMLAAGAFVDLAGPLPGLVVSAVFAGLVALLSLRLQFATMAAAATASAPTHTEGEH >CP033400|13932:53228|51408_51546_+|AYQ00020.1|DBSCAN-SWA MKIIGVFAFLAPVDPHQNQFPATLRRGQLRIFMKGDHSITSIEIG >CP033400|13932:53228|36115_36757_-|AYQ00011.1|DBSCAN-SWA MTEIIKDLYQFTEVMEPIKLSMHQYLLMTNEPVLIQTGAVSQAQTTIPKLQELLGERKIKYILISHFESDECGGLALVLKEHPEAVAVCSETTARQLMGFGITNNVLIKKPNEIFAGDDFEFQTISYPSEMHMWEGLLFFEKKRGIFFSSDLMFGMGENHGQVIESSWDAAVKSSGADTLPNQESGQKLSSDLSEIEPKFVASGHGFCITIVG >CP033400|13932:53228|43215_44046_-|AYQ00059.1|DBSCAN-SWA MKNTIHINFAIFLIIANIIYSSASASTDISTVASPLFEGTEGCFLLYDASTNAEIAQFNKAKCATQMAPDSTFKIALSLMAFDAEIIDQKTIFKWDKTPKGMEIWNSNHTPKTWMQFSVVWVSQEITQKIGLNKIKNYLKDFDYGNQDFSGDKERNNGLTEAWLESSLKISPEEQIQFLRKIINHNLPVKNSAIENTIENMYLQDLDNSTKLYGKTGAGFTANRTLQNGWFEGFIISKSGHKYVFVSALTGNLGSNLTSSIKAKKNAITILNTLNL >CP033400|13932:53228|51686_53228_+|AYQ00021.1|transposase|DBSCAN-SWA MPTVPISMRKLKEILRLKYGVGLSHRQIGRSLAISPSVVSRYANRAAQLGIKQWPLPTGWDDTKLKHAFLQTQVKMKKHSLPDWATVHRELRNKCVTLQLLWEEYCERNPGGFYSYNHYCRMYREWLKTTSPSMRQVHKAGEKLFVDYCGPTVGVTDPETGEIRTAQVIVAVLGASSYTWAEATWSQQLEDWVMSHVRCFQWLGGVPELVVPDNLKSATSRACKYDPDVNPTYQQMLEHYNVAVLPARPRKPKDKAKAEVGVQVVERWIMARIRHEIFYSLASLNQRIRELLERLNNKIMQKLGYSRAELFIQLDKPALKPLPEASYSYTLVKKVRVHADYHVEIDKHYYSVPCSLLGQQLEAWISGELVRLFNQGQEVAVHPRKRTYGYSTRNEHMPEAHRQHATWTPERLLEWAGHIGSETHSYVLHILNSRPHPEQSYRFCLGLLNLHKKYSKARLNAACARALKTKVWRLSGIKSILEKGLDKQPVQDPKPDLLSTMEHENVRGSEYYH >CP033400|13932:53228|44874_45579_-|AYQ00015.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP033400|13932:53228|13932_14946_+|AYP99986.1|integrase|DBSCAN-SWA MKTATAPLPPLRSVKVLDQLRERIRYLHYSLRTEQAYVHWVRAFIRFHGVRHPATLGSSEVEAFLSWLANERKVSVSTHRQALAALLFFYGKVLCTDLPWLQEIGRPRPSRRLPVVLTPDEVVRILGFLEGEHRLFAQLLYGTGMRISEGLQLRVKDLDFDHGTIIVREGKGSKDRALMLPESLAPSLREQLSRARAWWLKDQAEGRSGVALPDALERKYPRAGHSWPWFWVFAQHTHSTDPRSGVVRRHHMYDQTFQRAFKRAVEQAGITKPATPHTLRHSFATALLRSGYDIRTVQDLLGHSDVSTTMIYTHVLKVGGAGVRSPLDALPPLTSER >CP033400|13932:53228|22464_22656_+|AYP99997.1|DBSCAN-SWA MAIALMGAGFSATDTSDAVNILYPITMTVQANKAWQASGLKKSFFLPSHKKVRGGEKISYARL >CP033400|13932:53228|48123_48456_-|AYQ00016.1|DBSCAN-SWA MERGEIWLVSLDPTAGHEQQGTRPVLIVTPAAFNRVTRLPVVVPVTSGGNFARTAGFAVSLDGVGIRTTGVVRCDQPRTIDMKARGGKRLERVPETIMNEVLGRLSTILT >CP033400|13932:53228|37329_37890_+|AYQ00012.1|DBSCAN-SWA MTGQRIGYIRVSTFDQNPERQLEGVKVDRAFSDKASGKDVKRPQLEALISFARTGDTVVVHSMDRLARNLDDLRRIVQTLTQRGVHIEFVKEHLSFTGEDSPMANLMLSVMGAFAEFERALIRERQREGIALAKQRGAYRGRKKSLSSERIAELRQRVEAGEQKTKLAREFGISRETLYQYLRTDQ >CP033400|13932:53228|22213_22459_-|AYP99996.1|DBSCAN-SWA MQAYRAMVNSYSLSDDSGVMAAAAITHFLFGQAVFSYLNGWSVLIGPGTGLDSTGCKYARDLMGLVAFTAFIVTFLFRGYS >CP033400|13932:53228|15347_16052_-|AYP99988.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP033400|13932:53228|27809_28670_+|AYQ00003.1|DBSCAN-SWA MSIQHFRVALIPFFAAFCLPVFAHPETLVKVKDAEDQLGARVGYIELDLNSGKILESFRPEERFPMMSTFKVLLCGAVLSRVDAGQEQLGRRIHYSQNDLVEYSPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLTTIGGPKELTAFLHNMGDHVTRLDRWEPELNEAIPNDERDTTMPAAMATTLRKLLTGELLTLASRQQLIDWMEADKVAGPLLRSALPAGWFIADKSGAGERGSRGIIAALGPDGKPSRIVVIYTTGSQATMDERNRQIAEIGASLIKHW >CP033400|13932:53228|41879_42584_+|AYQ00014.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP033400|13932:53228|19267_20173_-|AYP99993.1|DBSCAN-SWA MTVVTTADTSQLYALAARHGLKLHGPLTVNELGLDYRIVIATVDDGRRWVLRIPRRAEVSAKVEPEARVLAMLKNRLPFAVPDWRVANAELVAYPMLEDSTAMVIQPGSSTPDWVVPQDSEVFAESFATALAALHAVPISAAVDAGMLIRTPTQARQKVADDVDRVRREFVVNDKRLHRWQRWLDDDSSWPDFSVVVHGDLYVGHVLIDNTERVSGMIDWSEARVDDPAIDMAAHLMVFGEEGLAKLLLTYEAAGGRVWPRLAHHIAERLAFGAVTYALFALDSGNEEYLAAAKAQLAAAE >CP033400|13932:53228|30857_31661_-|AYQ00007.1|DBSCAN-SWA MNRTNIFFGESHSDWLPVRGGESGDFVFRRGDGHAFAKIAPASRRGELAGERDRLIWLKGRGVACPEVINWQEEQEGACLVITAIPGVPAADLSGADLLKAWPSMGQQLGAVHSLSVDQCPFERRLSRMFGRAVDVVSRNAVNPDFLPDEDKSTPQLDLLARVERELPVRLDQERTDMVVCHGDPCMPNFMVDPKTLQCTGLIDLGRLGTADRYADLALMIANAEENWAAPDEAERAFAVLFNVLGIEAPDRERLAFYLRLDPLTWG >CP033400|13932:53228|48807_49461_-|AYQ00018.1|protease|DBSCAN-SWA MIQTRNQYLQFMLVMLAAWGISWGARFVMEQAVLLYGSGKNYLFFSHGTVLMYLLCVFLVYRRWIAPLPVVGQLRNVGVPWLVGAMAVVYVGVFLLGKALALPAEPFMTKLFADKSIPDVILTLLTIFILAPLNEETLFRGIMLNVFRSRYCWTMWLGALITSLLFVAAHSQYQNLLTLAELFLVGLITSVARIRSGGLLLPVLLHMEATTLGLLFG >CP033400|13932:53228|48457_48715_-|AYQ00017.1|DBSCAN-SWA MHTTRLKRVGGSVMLTVPPALLNALSLGTDNEVGMVIDNGRLIVEPYRRPQYSLAELLAQCDPNAEISAEEREWLDAPATGQEEI >CP033400|13932:53228|23692_24553_-|AYP99999.1|DBSCAN-SWA MHTRKAITEALQKLGVQTGDLLMVHASLKAIGPVEGGAETVVAALRSAVGPTGTVMGYASWDRSPYEETLNGARLDDEARRTWLPFDPATAGTYRGFGLLNQFLVQAPGARRSAHPDASMVAVGPLAETLTEPHELGHALGEGSPVERFVRLGGKALLLGAPLNSVTALHYAEAVADIPNKRWVTYEMPMLGRDGEVAWKTASDYDSNGILDCFAIEGKPDAVETIANAYVKLGRHREGVVGFAQCYLFDAQDIVTFGVTYLEKHFGTTPIVPPHEAVERSCEPSG >CP033400|13932:53228|25767_26634_+|AYQ00001.1|transposase|DBSCAN-SWA MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWVLATNLPVEIRTPKQLVKALLQS >CP033400|13932:53228|26667_27372_-|AYQ00002.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIINATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP033400|13932:53228|28819_29245_+|AYQ00004.1|transposase|DBSCAN-SWA MAEQGKELPGYVQREFEEFLQCGRLEHGFLRVRCESCHAEHLVAFSCKRRGFCPSCGARRMAESAALLVDEVLPEQPMRQWVLSFPFQLRFLFGVVCGKGRNPTLRLWPAIFSGEIDVFPGDRRRALLQIVGGDKLIIPFC >CP033400|13932:53228|29256_29961_+|AYQ00005.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP033400|13932:53228|31721_32537_-|AYQ00008.1|DBSCAN-SWA MNKSLIIFGIVNITSDSFSDGGRYLAPDAAIAQARKLMAEGADVIDLGPASSNPDAAPVSSDTEIERIAPVLDALKADGIPVSLDSYQPATQAYALSRGVAYLNDIRGFPDAAFYPQLAKSSAKLVVMHSVQDGQADRREAPAGDIMDHIAAFFDARIAALTGAGIKRNRLVLDPGMGFFLGAAPETSLSVLARFDELRLRFDLPVLLSVSRKSFLRALTGRGPGDVGAATLAAELAAAAGGADFIRTHEPRPLRDGLAVLAALKETARIR >CP033400|13932:53228|15192_15402_+|AYP99987.1|DBSCAN-SWA MVAAGGLRHGAGFPVRQADAGGGLRRLRQPMEEHRHCCKVSDEAAFCLIQRPYISKTLLTRRISPRGSP >CP033400|13932:53228|32844_33696_-|AYQ00009.1|DBSCAN-SWA MVKPKNKHSLSHVRHDPAHCLAPGLFRALKRGERKRSKLDVTYDYGDGKRIEFSGPEPLGADDLRILQGLVAMAGPNGLVLGPEPKTEGGRQLRLFLEPKWEAVTADAMVVKGSYRALAKEIGAEVDSGGALKHIQDCIERLWKVSIIAQNGRKRQGFRLLSEYASDEADGRLYVALNPLIAQAVMGGGQHVRISMDEVRALDSETARLLHQRLCGWIDPGKTGKASIDTLCGYVWPSEASGSTMRKRRQRVREALPELVALGWTVTEFAAGKYDITRPKAAG >CP033400|13932:53228|50400_51258_-|AYQ00019.1|DBSCAN-SWA MTDLHQTYYRQVKNPNPVFTPREGAGTLKFCEKLMEKAVGFSSRFDFAIHVAHARSRGLRRRMPPVLRRRAIDALLQGLCFHYDPLANRVQCSITTLAIECGLATESGAGKLSITRATRALTFLSELGLITYQTEYDPLIGCYIPTDITFTLALFAALDVSEDAVAAARRSRVVWENKQRKKQGLDTLGMDELIAKAWRFVRERFRSYQTELKSRGIKRARARRDANRKRQDIVTLVKRQLTREISEGRFTANREAVKREVERRVKERMILSRNRNYSRLATASP >CP033400|13932:53228|23137_23680_-|AYP99998.1|DBSCAN-SWA MIIWINGPFGAGKTTLAKRLRDRRSKSLIFDPEEIGFVVKETVPMPASGDYQDLPLWRGLTIAAVREIRRNYSQDIIIPMTLVHPDYLTEILDGVRRIDDQLLHIFLTLNEDLLRHRIANQTMHPDPNRNAEIREWRLANVARCLAARERLPCTTRVLDSGAHTSDELAAMVLDGIDGRT >CP033400|13932:53228|16191_16956_+|AYP99989.1|transposase|DBSCAN-SWA MTDFKWRHFQGDVILWAVRWYCRYPISYRDLEEMLAERGISVDHTTIYRWVQCYAPEMEKRLRWFWRRGFDPSWRLDETYVKVRGKWTYLYRAVDKRGDTIDFYLSPTRSAKAAKRFLGKALRGLKHWEKPATLNTDKAPSYGAAITELKREGKLDRETAHRQVKYLNNVIEADHGKLKILIKPVRGFKSIPTAYATIKGFEVMRALRKGQARPWCLQPGIRGEVRLVERAFGIGPSALTEAMGMLNHHFAAAA >CP033400|13932:53228|40510_41371_-|AYQ00013.1|DBSCAN-SWA MSIQHFRVALIPFFAAFCLPVFAHPETLVKVKDAEDQLGARVGYIELDLNSGKILESFRPEERFPMMSTFKVLLCGAVLSRVDAGQEQLGRRIHYSQNDLVEYSPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLTTIGGPKELTAFLHNMGDHVTRLDRWEPELNEAIPNDERDTTMPAAMATTLRKLLTGELLTLASRQQLIDWMEADKVAGPLLRSALPAGWFIADKSGAGERGSRGIIAALGPDGKPSRIVVIYTTGSQATMDERNRQIAEIGASLIKHW >CP033400|13932:53228|30021_30858_-|AYQ00006.1|DBSCAN-SWA MFMPPVFPAHWHVSQPVLIADTFSSLVWKVSLPDGTPAIVKGLKPIEDIADELRGADYLVWRNGRGAVRLLGRENNLMLLEYAGERMLSHIVAEHGDYQATEIAAELMAKLYAASEEPLPSALLPIRDRFAALFQRARDDQNAGCQTDYVHAAIIADQMMSNASELRGLHGDLHHENIMFSSRGWLVIDPVGLVGEVGFGAANMFYDPADRDDLCLDPRRIAQMADAFSRALDVDPRRLLDQAYAYGCLSAAWNADGEEEQRDLAIAAAIKQVRQTSY >CP033400|13932:53228|20294_20999_-|AYP99994.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP033400|13932:53228|44176_44731_-|AYQ00060.1|DBSCAN-SWA MTNSNDSVTLRLMTEHDLAMLYEWLNRSHIVEWWGGEEARPTLADVQEQYLPSVLAQESVTPYIAMLNGEPIGYAQSYVALGSGDGRWEEETDPGVRGIDQLLANASQLGKGLGTKLVRALVELLFNDPEVTKIQTDPSPSNLRAIRCYEKAGFERQGTVTTPYGPAVYMVQTRQAFERTRSDA >CP033400|13932:53228|51250_51325_-|AYQ00061.1|DBSCAN-SWA MTGKVQDFFLCSLLLCNISAGWCD >CP033400|13932:53228|34451_35156_-|AYQ00010.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP033400|13932:53228|21035_22163_-|AYP99995.1|DBSCAN-SWA MSQLSQLRSPAAVQAAIDEFVQLGRTKFLARHGYGKSRDFLVRDPKTGTDCDSKAIAGVAFGKQFPEQGPLTADSFSGGEATVVPALTRLGFRIIRIGEDWSEEEVLATVEDYFDMLRAEAAGEPYNKSEHNQALRQLLNGRSKSSVELKHQNISAVLDALGLPYINGYKPRGNSQLLLRKSVHAYVLEHQQTVGALVDALEEVKLPGDKTYRAALVEPPAREVLVRTPASLRQRLPRKFDYAARDEANRKLGRAGEQWVIGYEQQRLTELGHPELFQRLDWVSDTQGDGAGFDILSFEEDAHERFIEVKTTNGGVGSSFLVSHNELEFSKEAGDQFHLYRVFQFRDGPRLFTLPGDLSQHVHLKPTGTVANSRW >CP033400|13932:53228|17448_18033_-|AYP99991.1|DBSCAN-SWA MPRPKLKSDDEVLEAATVVLKRCGPIEFTLSGVAKEVGLSRAALIQRFTNRDTLLVRMMERGVEQVRHYLNAIPIGAGPQGLWEFLQVLVRSMNTRNDFSVNYLISWYELQVPELRTLAIQRNRAVVEGIRKRLPPGAPAAAELLLHSVIAGATMQWAVDPDGELADHVLAQIAAILCLMFPEHDDFQLLQAHA >CP033400|13932:53228|17146_17503_-|AYP99990.1|DBSCAN-SWA MFNVSRTRRFPTPPGTCVNGGVQSPCGRRRTRPSSISTGTVGGIEVLITTQVYGLQMVTIPILASEHRADSLTVNHSRKVVPVWVSRVETILRQCAQQTTWPSQVVGSGVADELDKLV |
38 | Escherichia_phage(53.33%) | protease,integrase,transposase | attL 9584:9643|attR 27116:27675 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
83090 : 90520
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP033400|83090:90520|DBSCAN-SWA GCTATATTTTTCCGCCTACGCCTTTAAACTTCTCAATAAACGAGACGATTTTCTGGAAAACTGCCTGTTTTTTCGTTTTATATTGCGGGTTTAACGGACTAAGTTTTGGTAATGTCTCGTTTAATTCTGTGCCATTTTCGGTGGCGTATTCGCGTTTTAAAGACGTGCGAATATAGCGTTTCGCCGCCTCTTCATTGAGATTTTCTTCTTTAATCAACGCTTCTGCTTCACGTAGCTGTTCGTGTTGAGCAAAGGTAAAGAACGCGTCAATGATGCTGGCTTTGTCTGGTAAATCATCCAGGTTCGTTTGCTGAATAAAATCGACCACCAGGCCCTCTTTCGCCCGGTTCCCCAGGCTTGAACGAATTAAGCGTTTGACCTCTTCGATCATTTCGCCCTTGCCTTTATTTTGTCTGTTATGTTCGAAAATCAGTCCAAGGATATAATCCAGGTTTATTTCCTGAGACTTCAGCAAATCGACCTCAAAAACCACGTCATCCCAGTCAGTGGTTGATTTCTCTTTTTTCTCAGCTTCTTTCTCACGGCGCTGCCAGTCGCGAATATCGTTATAGGCAGAACGATAATCCTGAATCTTGCGATCAGCAGGGAGACGAATTGTTTGCAATTCAGCGAGCTTTTCATCATCCACATAATGTTCTGCTTTGAATTTTTCTACTGCAACAGGATCGCTAAGATCGATTTGTTGCAGGGCTTTCAGCGTGGCAAATTCATCATAGTTTTGCAGGATGTTCTCGGTACGCAGGTATTCACCAAACAGTTTTACGAAGTCTTTCTTCTCTTTTTCACTTTCAATACTGGCAGGGTCAGGGAACCGTTGTTCCAGTTCTGCAACTACTGCCATAAAGCCGCGCTTAGCTTCACCAGTAGCAGCATCAGTAAAGCCTTCCATATACTCTGCATAACTCTTTTCTAACACTACATTTTTGGTGTTTTTGTCACCAAACAGCGTTATGGCATCAATAGTTGAGCGTTCCAGATCCCGGAAAGTGACGATATTACCGAAGGTTTTAGTGGCGTCATAAATGCGGTTGGTGCGGGAAAATGCCTGCATCAGGCCGTGAAAACGTAAGTTTTTATCGACGAATAGCGTGTTCAATGTTGGAGCATCGAAGCCGGTTAAGAACATCCCCACGACAATTAGCAGATCGATATCCTGATTTTTAACCCGCTGGGCTAAATCACGATAGTAGTTCTGAAAACCGTTACTGTCGGTGCTAAAGTTAGTTTTAAAATGGCTGTTATATTCACGAATTGCAGCGTCCAGAAACTCTTTAGCACTGCTGTCCATTGCGCTGGTATCAAAAGTTTCATCGGAAATCTCACCAATGGCATTTTGTTCTTCATTGGCGGCAAAGGAGAAGATTGTCGCAATACGCAGCGGTTTATAGGTAGCCGATTTATTAGCGGCTTCTTCTTGTAACCGTTTAAACGTCGCATAATAGGCTTTCGCGGCATCCACGCTGCTCACTGCCAACATAGCATTAAAACCTTTTGAGCCAGGGAAGGTACGGTGAGTTTTCTGGCGGAAATTATTCAGAATATATTGCGTAATTTCCTGTATACGCATGGGATGAAGAAACGCCTGCTGATTTTCAGCCGCACTCAGTTTTTTCTCGTCGGTTTCTGTCTCTAAAGACTTAAACTGTGGCCGCACATCGTTGTAGTCCACCTTGAATTTAAGCACTTTTTCATCTCGAATCGCATCGGTAATAACATATGAATGCAATTCACGACCAAATACGCTGGCGGTTGTTTCTGAGCCTAAGGCGTTTTCCGGGAAAATAGGGGTACCGGTAAAACCAAACTGATAATAGCGTTTGAATTTCTTCTTCAGGTTTTTCTGTGCTTCTCCAAACTGGCTGCGGTGGCATTCATCAAATATAAACACCACTTGCTGATTGTATACAGGCAGGTCGCTTTCTGCTTTCATCAGGTTATTGAGTTTCTGAATAGTGGTGACGATAATTTTGTTATCATCCTTATCCAGATTTCGTTTAAGACCTGCGGTATTTTCCGATCCATTGACACTGTCTGGCGAAAAACGCTGATATTCCTTCATGGTCTGGTAATCGAGGTCTTTCCTGTCGACCACAAAGAAGACTTTATCAATAAAGTCCAGTTCTGTTGCCAGACGCGCGGCTTTAAAGCTGGTCAGGGTTTTACCAGAACCGGTAGTGTGCCAGATAAAGCCACCACTTTCGGGGGTAGACCAGTTTTTCGCTTTATAGGAGCTGTTGATTTTCCATAAGATACGCTCAGTGGCGGCAATCTGGTACGGTCGCATCACCAGTAGCGTCTGGCTACTGTCAAAAACGCTGTAGTTCACCAGAACATTCAGCAGAGTATGTTTCTGGAAAAAGGTAGCGGTAAAGTCTTTGAGGTCTTTAATCAGCGTGTTGTCTGATTTTGCCCAATTCATGGTGAAGTCAAAACTGTTTTTATCGCGCTTTGTCGTGTTGGCAAAATAATGGGTATCGGTGCCGTTAGAAATGACAAACAGTTGCAGATACTTAAACAGGGAATTTTCGCTGTTAAAACTCTCTTTACTGTAACGATGTATCTGGTTGAAAGCCTCACGAATCGCCACCCCGCGTTTTTTTAGTTCGATTTGCACCAGCGGTAAACCATTAACCAGGATCGTGACGTCATAACGGTTAGCATGAGAACCCGTCTGTTCAAACTGCTGGATAATCTGCACCTTATTGCGCATGAGATTCTTTTTATCTATCAAATAGATGTTCTCAAGACGCTCGTCATCAAAAATAAAGTCGCAAATATAGTCGATATGGATTTTACGGGTCTTATCCAGAATGCCATCGCTCGGGTTATCCAGATACTGCTCCGTGAAACGCCGCCACTCGCTGTCATTAAACACCACACCATTGAGGTTCTGAAGCTGTTCCCGAACATTGGCCAGCATCGCCGACTGTGATTTTACGGATATAAATTCATAGCCCTGATTCCGCAGGTCCTGAATCAGTTCTCGTTCCAGGTCCGATTCGCTCTGGTAGCTGTCGCCTGTTGGCTCAGCTTTGATGTACTTATCAAGGACGATAAAGTTATTGGATTCAGCAATGGTGTGTGTCTGATGAGTCATAGCGCATCCTTTGTGCCGTCTGGCAAGGGCCGGAAGGGAGTTAAGGGTGACTTCCGGCACGTAAAAAATAGTCTATATACAGACCGGATGTTAAGGTGGCCCGGTCGGTAGCAACGGTCAATTAATTACTGACAGTTTCAGGTTTTGGGAAACTGAACAGTAAATCACGGTAGTATTCGTATTGTTTCTGGCGCAACTCGATTTCACGCGGAAGACCTTCGGTGATGGAGTTAGTCAGTGTGTCGAATTTGTCGAGTATTTCGACAATGCGAGCTTGTTCCTTAAGTGATTTTTCGTGATCTTTAGGATATGGAACCGGAATCATAATTTTTGAAAAACCATTAATGAGTAGTGTATTAACTTTTGTTCTGGCTACATACTTTGCTTTTTCAGAAATAAACGAATCGGTTTGCATGTAATAGGAAATAAATTTTGGATTCAAAGAATGTCGAAAAGCATAACAGTGATCATGAATAGCGATATCGTCATCCCCAAGCCATGCCACTGCTTTACCAACGTCTTCTACAGTCTCCCCCACGTCAGTTATCACGACATCTCCATGTTTGGCATAGCGTAATGACGCTGCCATATCAGCTCTAACCTGTGATAACGAATGAGTTGTGTAAACACCATATCGTGTATATATCTCACCATAATGGATTACACTGATACCACCATCTTCTACATAATCTGCTTTAGTAAAACGTTTTCCACGAATAAACTCACCAATTTCCCCCAAAGCTTTCCACTCAACCTCACCTTCTTTAAAAGTCAGCAATTGGTCGCGGTAGTAGTTGTACTGTTTTTTACGCATGTTAAGCTCAGCGGTAAGCTCAGCGGTAAGCTCAGCGGTAAGTGCAGTAAACTTATCCAGAATCCGAACGATTTCAGACTGGATGGCAAGGGACTTTTCCGGATTATCCGGGCAGGGGATGGGGATCTTAATATTTTTGACAATCTGCGCATTTATGTTTGTCTGGGACCCTGTTCCAAGGGATTTGATATATGTGTATTGGCTACACAAGAAGTGAAATACATATCTATAATGAGCAACTTCTTCATTAAGTTGAATATTTGCGCACGCTTGATTTGTTGTCATTGGAATTTTGTTTATGCCGATTTTCCCCACAGTTGCCCCATACATAGCAACAATGACACAATTCTTTGGTATCCATTTTGCACTAGAGTTTTTAACTCCAGACTCAGTTATTTTTACCTCGGTATCCCATATATCACAAAAGTTTACTTCTTGAGTTCTCAACCAAGGAATGTCGCCATCATAAAATTCTGATACGCCAGTTTTAGGGGTTCCTCCAGATGATATCTTTATAGAAATATCCTCAAGGGTTTTCCACTCAACCTCAACCCCATCCAGCAATTTTTCCAGATAACTCAACTCGCTCATTTCTGCACCTCGCAGCCTTCAATTTCAGCCACAATCGCATCAATATCTTTACGCAACTGGTCGATTTTGCTGACCGTAGTTTTAAGCTCTGCATTTAGCTCAGCAATATTGATAATTTCGCGGTTATCTTTCGCTTCTACATAGCAGCTCACCGACAGGTTATAGTCATTAGCGACAACGGTCTCAAACGCGACAGATTTCGCCAGATGAGCAACATCTTCCTTGCTGGCAAATACCTGCATAATCTGTTCGATATGGGCATCGGTCAGGATATTGTTGTTGGTCTCTTTTTTGAATAGTTCGCTGGCATCAATAAACTGAACTTTGGTATCCGTTTTATGTTTAGACAACACCAGAATATTGACGGCAATGGTGGTGCCAAAGAACAGGTTCGGTGCGAGTGAAATCACGGTTTCGACATAGTTATTGTCAACCAGATACTGACGGATTTTCTGCTCCGCGCCGCCACGGTAAAAAATGCCCGGGAAGCAGACAATCGCAGCACGACCTTTGGCAGAAAGATAGTTCAGCGCATGTAATACAAACGCAAAGTCAGCTTTGGATTTGGGGGCCAGAACGCCAGCCGGGGCAAAACGTTCATCGTTAATCAGCGTCGGGTCATCGCTGCCAATCCATTTCACCGAATACGGCGGGTTAGAAACGATGGCATCAAACGGTTTTTCATCTCTGAAGTGCGGTTCAGTCAGCGTATTGCCCAGCTTGATATCAAACTTGTCGTAGTTGATGTTGTGCAAAAACATGTTCATACGCGCCAGGTTATAGGTCGTATGGTTGATTTCCTGACCAAAAAAACCTTCTTCGATGATATGGTTATCAAACTGTTTTTTCGCCTGCAACAACAGTGAGCCGGAACCCGCTGCCGGGTCGTAGATTTTGTTAACGCTGGTCTGCCCGTGCATAGCCAGTTGTGCAATCAGCCTGGAGACGTGCTGCGGTGTAAAGAACTCACCGCCTGACTTACCGGCATTTGCCGCATAGTTAGAAATCAGGAACTCATAGGCATCACCGAACAGGTCAATCTGATGTTCGTTGAAGTCACCAAGTTTTAACCCTTCAACCCCTTTCAGAACCGCAGCCAGGCGGGCATTTTTATCTTTAACGGTGTTACCCAGGCGGTTACTGGTGGTATCGAAATCAGCAAACAAACCTTTGATGTCAGCTTCTGAAGGATAACCGTAAGCAGAACTTTCGATAGCAACGAAGATGCTGTTTAAATCTGCATTCAATCTGTCATTGGTATTTGCTTTCGCAGCTACGTTGCAGAAAAGCTGGCTTGGGTAGATGAAGTAGCCTTTGGTTTTGATGGCATCGTCTTTAATGTCATCAGTAATTACGCTGTCATCCAGTTTCGCATAACAGATACTGTCATCACCGGCTTCAATATAACTGGAAAAATTTTCGCTGATAAAACGGTAGAAAAGTGCGCCCAGAACGTATTGCTTAAAATCCCATCCATCGACCGAACCCCTGACATCGTTAGCAATTTGCCAGATTTGACGATGAAGCTCTGCACGTTGTTGAATACTTGTCATTTTCATCCACTTATTTCAGGCTTATGTAATTGGCGGTGATTCTACAGCAACTTGGATGCTTTAGCAGTTCGGACATTAGGCTACGAATGACCTGCCTAGAGGTTTGTTAAGCCGCAAAGTGCTGGTGCTTTATGCCTGTGAAGTTTATAATTGTGTACACATAACGAGTACACGAGGTGTTTATGCAATCCATTAACTTCCGTACCGCGCGCGGCAACCTTTCTGAAGTGCTCAACAATGTTGAGGCCGGGGAAGAGGTTGAAATCACCCGCAGAGGCCGTGAGCCAGCAGTAATTGTCAGCAAGGCTACTTTCGAAGCCTACAAAAAAGCGGCGCTGGATGCTGAATTTGCATCCCTGTTTGACACCCTGGACTCCACCAACAAGGAACTGGTTAACCGATAATGAGGCATATATCACCGGAAGAACTTATTGCGCTTCATGATGCGAATATAAACCGCTACGGCGGCCTGCCGGGAATGTCTGATCCGGGCAGGGCAGAGGCCATTATCGGGAGAGTTCAGGCCAGAGTTGCCTACGAAGAGATCACCGACCTTTTCGAAGTCTCCGCCACCTACCTAGTGGCTACAGCGAGAGGGCATATATTCAATGATGCCAATAAGCGTACCGCGCTAAACAGTGCGCTGTTATTTCTACGCCGTAACGGGGTGCAGGTATTTGATTCACCTGAACTGGCAGACCTTACCGTAGGGGCTGCGACCGGAGAGATATCTGTATCTTCTGTCGCCGACACGTTACGTAGATTGTATGGTTCTGCGGAGTAGATTAATGGCACGCAAATACAACAAATTGTCCCGTGAAGCGTTAAAGATGCTTCTTGATGGCGTGAGTCGCCGCGAGGTAAAGCAATACCTGGCTGGTAAGCAAATTGGTGCCAGGACCGCTATTGCTGTGTTATGCCGTCAGGAAATGGTTGTGCTTAAACAGAGAATGCCGGGCAGCAGATAAAGCCCAATCAGTGATGAAAGGTGTGATGTGAAAGCCGTAATTACTCCCTTTGTACAAAAAGAGCTTGGCGTCGCCACATTCAAAGTGGATCAGGAAGTCAGAAAGCTGGTGGAGGCTGGCCGTAAATTTATTATGGAGCCGGTGCCGCGTGAGTTAATCGAGCACATGGACGACGGCCTCGTTGTTTCCGAGCAAACTATGGCAACAAATGAGGCGTTGCAGCCGTTTTTTAACAGCGATGAACTGTTTCGCCGTATTGGTGGAATTGACTCGCTGGTAGCGTGGTTGCGCAGGAAAGAGGGGCAGGTCAGGCTGGGCCATGTCGTTCTCGCCGGACAGCAGGCTTACCATGAAAGCACTGGAAATGGCATGGGAAACCCGTGGTAA
Protein sequences of DBSCAN-SWA_2 >CP033400|83090:90520|89347_89569_+|AYQ00053.1|DBSCAN-SWA MQSINFRTARGNLSEVLNNVEAGEEVEITRRGREPAVIVSKATFEAYKKAALDAEFASLFDTLDSTNKELVNR >CP033400|83090:90520|86328_87612_-|AYQ00052.1|DBSCAN-SWA MSELSYLEKLLDGVEVEWKTLEDISIKISSGGTPKTGVSEFYDGDIPWLRTQEVNFCDIWDTEVKITESGVKNSSAKWIPKNCVIVAMYGATVGKIGINKIPMTTNQACANIQLNEEVAHYRYVFHFLCSQYTYIKSLGTGSQTNINAQIVKNIKIPIPCPDNPEKSLAIQSEIVRILDKFTALTAELTAELTAELNMRKKQYNYYRDQLLTFKEGEVEWKALGEIGEFIRGKRFTKADYVEDGGISVIHYGEIYTRYGVYTTHSLSQVRADMAASLRYAKHGDVVITDVGETVEDVGKAVAWLGDDDIAIHDHCYAFRHSLNPKFISYYMQTDSFISEKAKYVARTKVNTLLINGFSKIMIPVPYPKDHEKSLKEQARIVEILDKFDTLTNSITEGLPREIELRQKQYEYYRDLLFSFPKPETVSN >CP033400|83090:90520|89953_90133_+|AYQ00055.1|DBSCAN-SWA MARKYNKLSREALKMLLDGVSRREVKQYLAGKQIGARTAIAVLCRQEMVVLKQRMPGSR >CP033400|83090:90520|87608_89165_-|AYQ00065.1|DBSCAN-SWA MTSIQQRAELHRQIWQIANDVRGSVDGWDFKQYVLGALFYRFISENFSSYIEAGDDSICYAKLDDSVITDDIKDDAIKTKGYFIYPSQLFCNVAAKANTNDRLNADLNSIFVAIESSAYGYPSEADIKGLFADFDTTSNRLGNTVKDKNARLAAVLKGVEGLKLGDFNEHQIDLFGDAYEFLISNYAANAGKSGGEFFTPQHVSRLIAQLAMHGQTSVNKIYDPAAGSGSLLLQAKKQFDNHIIEEGFFGQEINHTTYNLARMNMFLHNINYDKFDIKLGNTLTEPHFRDEKPFDAIVSNPPYSVKWIGSDDPTLINDERFAPAGVLAPKSKADFAFVLHALNYLSAKGRAAIVCFPGIFYRGGAEQKIRQYLVDNNYVETVISLAPNLFFGTTIAVNILVLSKHKTDTKVQFIDASELFKKETNNNILTDAHIEQIMQVFASKEDVAHLAKSVAFETVVANDYNLSVSCYVEAKDNREIINIAELNAELKTTVSKIDQLRKDIDAIVAEIEGCEVQK >CP033400|83090:90520|90160_90520_+|AYQ00056.1|DBSCAN-SWA MKAVITPFVQKELGVATFKVDQEVRKLVEAGRKFIMEPVPRELIEHMDDGLVVSEQTMATNEALQPFFNSDELFRRIGGIDSLVAWLRRKEGQVRLGHVVLAGQQAYHESTGNGMGNPW >CP033400|83090:90520|89568_89949_+|AYQ00054.1|DBSCAN-SWA MRHISPEELIALHDANINRYGGLPGMSDPGRAEAIIGRVQARVAYEEITDLFEVSATYLVATARGHIFNDANKRTALNSALLFLRRNGVQVFDSPELADLTVGAATGEISVSSVADTLRRLYGSAE >CP033400|83090:90520|83090_86207_-|AYQ00051.1|DBSCAN-SWA MTHQTHTIAESNNFIVLDKYIKAEPTGDSYQSESDLERELIQDLRNQGYEFISVKSQSAMLANVREQLQNLNGVVFNDSEWRRFTEQYLDNPSDGILDKTRKIHIDYICDFIFDDERLENIYLIDKKNLMRNKVQIIQQFEQTGSHANRYDVTILVNGLPLVQIELKKRGVAIREAFNQIHRYSKESFNSENSLFKYLQLFVISNGTDTHYFANTTKRDKNSFDFTMNWAKSDNTLIKDLKDFTATFFQKHTLLNVLVNYSVFDSSQTLLVMRPYQIAATERILWKINSSYKAKNWSTPESGGFIWHTTGSGKTLTSFKAARLATELDFIDKVFFVVDRKDLDYQTMKEYQRFSPDSVNGSENTAGLKRNLDKDDNKIIVTTIQKLNNLMKAESDLPVYNQQVVFIFDECHRSQFGEAQKNLKKKFKRYYQFGFTGTPIFPENALGSETTASVFGRELHSYVITDAIRDEKVLKFKVDYNDVRPQFKSLETETDEKKLSAAENQQAFLHPMRIQEITQYILNNFRQKTHRTFPGSKGFNAMLAVSSVDAAKAYYATFKRLQEEAANKSATYKPLRIATIFSFAANEEQNAIGEISDETFDTSAMDSSAKEFLDAAIREYNSHFKTNFSTDSNGFQNYYRDLAQRVKNQDIDLLIVVGMFLTGFDAPTLNTLFVDKNLRFHGLMQAFSRTNRIYDATKTFGNIVTFRDLERSTIDAITLFGDKNTKNVVLEKSYAEYMEGFTDAATGEAKRGFMAVVAELEQRFPDPASIESEKEKKDFVKLFGEYLRTENILQNYDEFATLKALQQIDLSDPVAVEKFKAEHYVDDEKLAELQTIRLPADRKIQDYRSAYNDIRDWQRREKEAEKKEKSTTDWDDVVFEVDLLKSQEINLDYILGLIFEHNRQNKGKGEMIEEVKRLIRSSLGNRAKEGLVVDFIQQTNLDDLPDKASIIDAFFTFAQHEQLREAEALIKEENLNEEAAKRYIRTSLKREYATENGTELNETLPKLSPLNPQYKTKKQAVFQKIVSFIEKFKGVGGKI |
7 | Escherichia_phage(57.14%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP033401_1 | 371135-371252 | Orphan |
NA
Consensus repeat of CP033401_1
|
1 spacers
spacers of CP033401_1
>1.1|371166|56|CP033401|CRISPRCasFinder TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA |
CRISPR arrays and Neighbor proteins around CP033401_1
The CRISPR arrays of CP033401_1 >merge|CP033401|1|371135-371252|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGCTGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAACCGAGCCGTAGGCCGGATAAGGCGTTTACGC >CP033401|1|1|371135-371252|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGC TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA CCGAGCCGTAGGCCGGATAAGGCGTTTACGC
>CP033401.1|AYQ04222.1|369907_371038_-|ribonucleoside-diphosphate-reductase-1-subunit-beta MAYTTFSQTKNDQLKEPMFFGQPVNVARYDQQKYDIFEKLIEKQLSFFWRPEEVDVSRDRIDYQALPEHEKHIFISNLKYQTLLDSIQGRSPNVALLPLISIPELETWVETWAFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEQIQKRAEGISSYYDELIEMTSYWHLLGEGTHTVNGKTVTVSLRELKKKLYLCLMSVNALEAIRFYVSFACSFAFAERELMEGNAKIIRLIARDEALHLTGTQHMLNLLRSGADDPEMAEIAEECKQECYDLFVQAAQQEKDWADYLFRDGSMIGLNKDILCQYVEYITNIRMQAVGLDLPFQTRSNPIPWINTWLVSDNVQVAPQEVEVSSYLVGQIDSEVDTDDLSNFQL >CP033401.1|AYQ00379.1|369653_369908_-|ferredoxin MARVTLRITGTQLLCQDEHPSLLAALESHNVAVEYQCREGYCGSCRTRLVAGQVDWIAEPLAFIQPGEILPCCCRAKGDIEIEM >CP033401.1|AYQ00378.1|368949_369600_+|lipopolysaccharide-kinase-InaA MAVSAKYDEFNHWWATEGDWVEEPNYRRNGMSGVQCVERNGKKLYVKRMTHHLFHSVRYPFGRPTIVREVAVIKELERAGVIVPKIVFGEAVKIEGEWRALLVTEDMAGFISIADWYAQHAVSPYSDEVRQAMLKAVALAFKKMHSINRQHGCCYVRHIYVKTEGKAEAGFLDLEKSRRRLRRDKAINHDFRQLEKYLEPIPKADWEQVKAYYYAM >CP033401.1|AYQ00377.1|366215_366530_-|hypothetical-protein MTNKLGGELIDIADKKLAPLINDSFSYTRDFFAYSKQENNIFTFDNSKFVDPKEKEGLMIQHSNGQLVITGKYCPEGVQTAFTQEQYDKLIRYINIFFTFPKCE >CP033401.1|AYQ00376.1|364920_365997_+|glycerophosphodiester-phosphodiesterase MKLKLKNLSMAIMMSTIVMGSSAMAADSNEKIVIAHRGASGYLPEHTLPAKAMAYAQGADYLEQDLVMTKDDHLVVLHDHYLDRVTDVADRFPDRARKDGRYYAIDFTLDEIKSLKFTEGFDIENGKKVQTYPGRFPMGKSDFRVHTFEEEIEFVQGLNHSTGKNIGIYPEIKAPWFHHQEGKDIAAKTLEVLKKYGYTGKDDKVYLQCFDADELKRIKNELEPKMGMDLNLVQLIAYTDWNETQQKQPDGSWVNYSYDWMFKPGAMKQVAEYADGIGPDYHMLIEETSQPGNIKLTGMVQDAQQNKLVVHPYTVRSDKLPEYTTDVNQLYDVLYNKAGVNGLFTDFPDKAVKFLNKE >CP033401.1|AYQ00375.1|363557_364916_+|glycerol-3-phosphate-transporter MLSIFKPAPHKARLPAAEIDPTYRRLRWQIFLGIFFGYAAYYLVRKNFALAMPYLVEQGFSRGDLGFALSGISIAYGFSKFIMGSVSDRSNPRVFLPAGLILAAAVMLFMGFVPWATSSIAVMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLFLLGMAWFNDWHAALYMPAFCAILVALFAFAMMRDTPQSCGLPPIEEYKNDYPDDYNEKAEQELTAKQIFMQYVLPNKLLWYIAIANVFVYLLRYGILDWSPTYLKEVKHFALDKSSWAYFLYEYAGIPGTLLCGWMSDKVFRGNRGATGVFFMTLVTIATIVYWMNPAGNPTVDMICMIVIGFLIYGPVMLIGLHALELAPKKAAGTAAGFTGLFGYLGGSVAASAIVGYTVDFFGWDGGFMVMIGGSILAVILLIVVMIGEKRRHEQLLQKRNGG >CP033401.1|AYQ00374.1|361656_363285_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-A MKTRDSQSSDVIIIGGGATGAGIARDCALRGLRVILVERHDIATGATGRNHGLLHSGARYAVTDAESARECISENQILKRIARHCVEPTNGLFITLPEDDLSFQATFIRACEEAGISAEAIDPQQARIIEPAVNPALIGAVKVPDGTVDPFRLTAANMLDAKEHGAVILTAHEVTGLIREGATVCGVRVRNHLTGETQALHAPVVVNAAGIWGQHIAEYADLRIRMFPAKGSLLIMDHRINQHVINRCRKPSDADILVPGDTISLIGTTSLRIDYNEIDDNRVTAEEVDILLREGEKLAPVMAKTRILRAYSGVRPLVASDDDPSGRNVSRGIVLLDHAERDGLDGFITITGGKLMTYRLMAEWATDAVCRKLGNTRPCTTADLALPGSQDPAEVTLRKVISLPAPLRGSAVYRHGDRTPAWLSEGRLHRSLVCECEAVTAGEVQYAVENLNVNSLLDLRRRTRVGMGTCQGELCACRAAGLLQRFNVTTSAQSIEQLSTFLNERWKGVQPIAWGDALRESEFTRWVYQGLCGLEKEQKDAL >CP033401.1|AYQ00373.1|360407_361667_-|glycerol-3-phosphate-dehydrogenase-subunit-GlpB MRFDTVIMGGGLAGLLCGLQLQKHGLRCAIVTRGQSALHFSSGSLDLLSHLPDGQPVADIHSGLESLRQQAPAHPYSLLGPQRVLDLACQAQALIAESGAQLQGSVELAHQRITPLGTLRSTWLSSPEVPVWPLPAKKICVVGISGLMDFQAHLAAASLRELDLSVETAEIELPELDVLRNNATEFRAVNIARFLDNEENWPLLLDALIPVANTCEMILMPACFGLADDKLWRWLNEKLPCSLMLLPTLPPSVLGIRLQNQLQRQFVRQGGVWMPGDEVKKVTCKNGVMNEIWTRNHADIPLRPRFAVLASGSFFSGGLVAERNGIREPILGLDVLQTATRGEWYKGDFFAPQPWQQFGVTTDETLRPSQAGQTIENLFAIGSVLGGFDPIAQGCGGGVCAVSALHAAQQIAQRAGGQQ >CP033401.1|AYQ00372.1|359220_360411_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-C MNDTSFENCIKCTVCTTACPVSRVNPGYPGPKQAGPDGERLRLKDGALYDEALKYCINCKRCEVACPSDVKIGDIIQRARAKYDTTRPSLRNFVLSHTDLMGSVSTPFAPIVNTATSLKPVRQLLDAALKIDHRRTLPKYSFGTFRRWYRSVAAQQAQYKDQVAFFHGCFVNYNHPQLGKDLIKVLNAMGTGVQLLSKEKCCGVPLIANGFTAKARKQAITNVESIREAVGVKGIPVIATSSTCTFALRDEYPEVLNVDNKGLRDHIELATRWLWRKLDEGKTLPLKPLPLKVVYHTPCHMEKMGWTLYTLELLRKIPGLELTVLDSQCCGIAGTYGFKKENYPTSQAIGAPLFRQIEESGADLVVTDCETCKWQIEMSTSLRCEHPITLLAQALA >CP033401.1|AYQ00371.1|358128_359028_-|ISNCY-family-transposase MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAYYSDVLWSVETSDGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKGYDRVPLVVPLLFYHGETSPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELIQKHIRDRDLIGMVDRITTLLVKGFTNDSQLQTLFNYLLQCGDTSRFTRFIEEIAKRSPLQKERLMTIAERLRQEGHQIGWQEGMHEQAIKIALRMLEQGFEREIVLATTQLTDADIPNCH >CP033401.1|AYQ00380.1|371271_373557_-|ribonucleoside-diphosphate-reductase-1-subunit-alpha MNQNLLVTKRDGSTERINLDKIHRVLDWAAEGLHNVSISQVELRSHIQFYDGIKTSDIHETIIKAAADLISRDAPDYQYLAARLAIFHLRKKAYGQFEPPALYDHVVKMVEMGKYDNHLLEDYTEEEFKQMDTFIDHDRDMTFSYAAVKQLEGKYLVQNRVTGEIYESAQFLYILVAACLFSNYPRETRLQYVKRFYDAVSTFKISLPTPIMSGVRTPTRQFSSCVLIECGDSLDSINATSSAIVKYVSQRAGIGINAGRIRALGSPIRGGEAFHTGCIPFYKHFQTAVKSCSQGGVRGGAATLFYPMWHLEVESLLVLKNNRGVEGNRVRHMDYGVQINKLMYTRLLKGEDITLFSPSDVPGLYDAFFADQEEFERLYTKYEKDDSIRKQRVKAVELFSLMMQERASTGRIYIQNVDHCNTHSPFDPAIAPVRQSNLCLEIALPTKPLNDVNDENGEIALCTLSAFNLGAINNLDELEELAILAVRALDALLDYQDYPIPAAKRGAMGRRTLGIGVINFAYYLAKHGKRYSDGSANNLTHKTFEAIQYYLLKASNELAKEQGACPWFNETTYAKGILPIDTYKKDLDTIANEPLHYDWEALRESIKTHGLRNSTLSALMPSETSSQISNATNGIEPPRGYVSIKASKDGILRQVVPDYEHLHDAYELLWEMPGNDGYLQLVGIMQKFIDQSISANTNYDPSRFPSGKVPMQQLLKDLLTAYKFGVKTLYYQNTRDGAEDAQDDLVPSIQDDGCESGACKI >CP033401.1|AYQ00381.1|374252_378005_+|AIDA-I-family-autotransporter-YfaL MRIIFLRKEYLSLLPSMIASLFSANGVAAVTDSCQGYDVKASCQASRQSLSGITQDWSIADGQWLVFSDMTNNASGGAVFLQQGAEFSLLPENETGMTLFANNTVTGEYNNGGAIFAKENSTLNLTDVIFSGNVAGGYGGAIYSSGTNDTGAVDLRVTNAMFRNNIANDGKGGAIYTINNDVYLSDVIFDNNQAYTSTSYSDGDGGAIDVTDNNSDSKHPSGYTIVNNTAFTNNTAEGYGGAIYTNSVTAPYLIDISVDDSYSQNGGVLVDENNSAAGYGDGPSSAAGGFMYLGLSEVTFDIADGKTLVIGNTENDGAVDSIAGTGLITKTGSGDLVLNADNNDFTGEMQIENGEVTLGRSNSLMNVGDTHCQDDPQDCYSLTIGSIDQYQNQAELNVGSTQQTFVHALTGFQNGTLNIDAGGNVTVNQGSFAGIIEGAGQLTIAQNGSYVLAGAQSMALTGDIVVDDGAVLSLEGDAADLTALQDDPQSIVLNGGVLDLSDFSTWQSGTSYNDGLEVSGSSGTVIGSQDVVDLAGGDNLHIGGDGKDGVYVVVDASDGQVSLANNNSYLGTTQIASGTLMVSDNSQLGDTHYNRQVIFTDKQQESVMEITSDVDTRSDAAGHGRDIEMRADGEVAVDAGVDTQWGALMADSSGQHQDEGSTLTKTGAGTLELTASGTTQSAVRVEEGTLKGDVADILPYASSLWVGDGATFVTGADQDIQSIDAISSGTIDISDGTVLRLTGQDTSVALNASLFNGDGTLVNATDGVTLTGELNTNLETDSLTYLSNVTVNGNLTNTSGAVSLQNGVAGDTLTVNGDYTGGGTLLLDSELNGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKMVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVEDNNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQPVLNAKVGGYLNNLRAANQAFMMERRDHAGGDGQTLNLRVIGGDYHYTAAGQLAQHEDTSTVQLSGDLFSGRWGTDGEWMLGIVGGYSDNQGDSRSNMTGTRADNQNHGYAVGLTSSWFQHGNQKQGAWLDSWLQYAWFSNDVSEQEDGTDHYHSSGIIASLEAGYQWLPGRGVVIEPQAQVIYQGVQQDDFTAANRARVSQSQGDDIQTRLGLHSEWRTAVHVIPTLDLNYYHDPHSTEIEEDGSTISDDAVKQRGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW >CP033401.1|AYQ00382.1|378132_378855_-|bifunctional-2-polyprenyl-6-hydroxyphenol-methylase/3-demethylubiquinol-3-O-methyltransferase-UbiG MNAEKSPENHNVDHEEIAKFEAVASRWWDLEGEFKPLHRINPLRLGYIAERAGGLFGKKVLDVGCGGGILAESMAREGATVTGLDMGFEPLQVAKLHALESGIQVDYVQETVEKHAAKHAGQYDVVTCMEMLEHVPDPQSVVRACAQLVKPGGDVFFSTLNRNGKSWLMAVVGAEYILRMVPKGTHDVKKFIKPAELLGWVDQTSLKERHITGLHYNPITNSFKLGPGVDVNYMLHTQNK >CP033401.1|AYQ00383.1|379001_381629_+|DNA-topoisomerase-(ATP-hydrolyzing)-subunit-A MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDLAVYNTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDDEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEVDAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDGMRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDIIAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHAPTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVKYQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVYSMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMATANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAEGKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQNGYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITDAGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDTIDGSAAEGDDEIAPEVDVDDEPEEE >CP033401.1|AYQ00384.1|381777_383466_+|DUF2138-domain-containing-protein MSGEKKAKGWRFYGLVGFGAIALLSAGVWALQYAGSGPEKTLSPLVVHNNLQIDLNEPDLFLDSDSLSQLPKDLLTIPFLHDVLSEDFVFYYQNHADRLGIEGSIRRIVYEHDLTLKDKLFSSLLDQPAQAALWHDKQGHLSHYMVLIQRSGLSKLLEPLLFAATSDSQLSKTEISSIKINSETVPVYQLRYNGNNALMFATYQDKMLVFSSTDMLFKDDQQDTEATAIAGDLLSGKKRWQASFGLEERTAEKTPVRQRIVVSARWLGFGYQRLMPSFAGVRFEMGNDGWHSFVALNDESASVDASFDFTPVWNSMPAGASFCVAVPYSHGIAEEMLSHISQENDKLNGALDGAAGLCWYEDSKLQTPLFVGQFDGTAEQAQLPGKLFTQNIGAHESKAPEGVLPVSQTQQGEAQIWRREVSSRYGQYPKAQAAQPDQLMSDYFFRVSLAMQNKTLLFSLDDTLVNNALQTLNKTRPAMVDVIPTDGIVPLYINPQGIAKLLRNETLTSLPKNLEPVFYNAAQTLLMPKLDALSQQPRYVMKLAQMEPGAAWQWLPITWQPL >CP033401.1|AYQ00385.1|383462_384086_+|DUF1175-domain-containing-protein MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIYRLNFLAR >CP033401.1|AYQ04223.1|384229_388624_+|alpha-2-macroglobulin-family-protein MRLEAPGRDYRRYQMEEYGGVDVRLYRIPDPMAFLRQQKNLHRIVVQPQYLGDGLNNTLTWLWDNWYGKSRRVMQRTFSSQSRQNVTQALPELQLGNAIIKPSRYVQNNQFSPLKKYPLVEQFRYPLWQAKPFEPQQGVKLEGASSNFISPQPGNIYIPLGQQEPGLYLVEAMVGGYRATTVVFVSDTVALSKVSGNELLVWTAGKKQGEAKPGSEILWTDGLGVMTRGVTDDSGTLQLQHISPERSYILGKDAEGGVFVSENFFYESEIYNTRLYIFTDRPLYRAGDRVDVKVMGREFHDPLHSSPIVSAPAKLSVLDANGSLLQTVDVTLDARNGGQGSFRLPENAVAGGYELRLAYRNQVYSSSFRVANYIKPHFEIGLALAKKEFKTGEAVSGKLQLLYPDGEPVKNARVQLSLRAQQLSMVGNDLRYAGRFPVSLEGSETVSDASGHVALNLPAADKPSRYLLTVSASDGAAYRVTTTKEILIERGLAHYSLSTAAQYSNSGESVVFRYAALESSKQVPVTYEWLRLEDRTSHSGELPSGGKSFTVNFAKPGNYNLTLRDKDGLILAGLSHAVSGKGSTAHTGTVDIVADKTLYQPGETAKMLITFPEPIDEALLTLERDRVEQQSLLSHPANWLTLQRLNDTQYEARVPVSNSFAPNITFSVLYTRNGQYSFQNAGIKVAVPQLDIRVKTDKTHYQPGELVNVELTSSLKGKPVSAQLTVGVVDEMIYALQPEIAPNIGKFFYPLGRNNVRTSSSLSFISYDQALSSEPVAPGATNRSERRVKMLERPRREEVDTAAWMPSLTTDKQGKAYFTFLMPDSLTRWRITARGMNGDGLVGQGRAYLRSEKNLYMKWSMPTVYRVGDKPAAGLFIFSQQDNEPVALVTKFAGAEMRQTLTLHKGANYISLTQNIQQSGLLSAELQQNGQVQDSISTKLSFVDNSWPVEQQKNVMLGGGDNALMLPEQASNIRLQSSETPQEIFRNNLDALVDEPWGGVINTGSRLIPLSLAWRSLADHQSAAANDIRQMIQVNRLRLMQLAGPGARFTWWGEDGNGDAFLTAWAWYADWQASQAIGVTQQPEYWQHMLDSYAEQADNMPLLHRALVLAWAQEMNLPCKTLLKGLDEAIARRGTKTEDFSEEDTRDINDSLILDTPESPLADAVANVLTMTLLKKAQLKSTVMPQVQQYAWDKAANSNQPLAHTVVLLNSGGDATQAAAILSGLTAEQSTIERALAMNWLAKYMATMPPVVLPAPAGAWAKHKLTGGGEYWRWVGQGVPDILSFGDELSPQNVQVRWREPAKTAQQSNIPVTVERQLYRLITGEEEMSFTLQPVTSNEIDSDALYLDEITLTSEQDAVLRYGQVEVPLPPGADVERTTWGISVNKPNAAKQQGQLLEIARNEMGELAYMVPVKELTGTVTFRHLLRFSQKGQFVLPPARYMRSYAPAQQSVAAGSEWTRMQVK >CP033401.1|AYQ00386.1|388624_390274_+|DUF2300-domain-containing-protein MNWRRIVWLLALVTLPTLAEEPPLQLALRGAQHDQLYKLSSSGVTNVSTLPDTLTTPLGSLWKLYVYAWLEDTHQPEQPYQCRGNSPEEVYCCQAGESITRDTALVRSCGLYFAPQRLHIGADVWGQYWQQRQAPAWLASLTTLKPETSVTVKSLLDSLATLPAQNKAQEVLLDVVLDEAKIGVASMLGSRVRVKTWSWFADDKQEIRQGGFAGWLTDGTPLWVTGSGTSKTVLTRYATVLNRVLPVPTQVASGQCVEVELFARYPLKKITAEKSTTAVKPGVLNGRYRVTFTNGNHITFVSHGETTLLSEKGKLKLQSHLDREEYVARVLDREAKSTPPEAAKAMTVAIRTFLQQNANREGDCLTIPDSSATQRVSASPATTGARTMAAWTQDLIYAGDPVHYHGSRATEGTLSWRQATAQAGQGERYDQILAFAYPDNSLSRWGAPRSTCQLLPKAKAWLAKKMPQWRRILQAETGYNEPDVFAVCRLVSGFPYTDRQQKRLFIRNFFTLQDRLDLTHEYLHLAFDGYPTGLDENYIETLTRQLLMD >CP033401.1|AYQ00387.1|390278_391055_+|DUF2135-domain-containing-protein MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPVEGEDASFSQSINYPASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDGSFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTPGTGTIRARLRLVLSWDTDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPVHGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGELTLVKSFDW >CP033401.1|AYQ00388.1|391128_392313_-|acetyl-CoA-C-acetyltransferase MKNCVIVSAVRTAIGSFNGSLASTSAIDLGATVIKAAIERAKIDSQHVDEVIMGNVLQAGLGQNPARQALLKSGLAETVCGFTVNKVCGSGLKSVALAAQAIQAGQAQSIVAGGMENMSLAPYLLDAKARSGYRLGDGQVYDVILRDGLMCATHGYHMGITAENVAKEYGITREMQDELALHSQRKAAAAIESGAFTAEIVPVNVVTRKKTFVFSQDEFPKANSTAEALGALRPAFDKAGTVTAGNASGINDGAAALVIMEESAALAAGLTPLARIKSYASGGVPPALMGMGPVPATQKALQLAGLQLADIDLIEANEAFAAQFLAVGKTLGFDPEKVNVNGGAIALGHPIGASGARILVTLLHAMQARDKTLGLATLCIGGGQGIAMVIERLN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP033401_2 | 976017-976140 | Orphan |
NA
Consensus repeat of CP033401_2
|
1 spacers
spacers of CP033401_2
>2.1|976060|38|CP033401|CRISPRCasFinder CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA |
CRISPR arrays and Neighbor proteins around CP033401_2
The CRISPR arrays of CP033401_2 >merge|CP033401|2|976017-976140|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA >CP033401|2|2|976017-976140|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA
>CP033401.1|AYQ00911.1|975576_975882_-|monooxygenase MATLLQLHFAFNGPFGDAMAEQLKPLAESINQEPGFLWKVWTESEKNHEAGGIYLFTDEKSALAYLEKHTARLKNLGVEEVVAKVFDVNEPLSQINQAKLA >CP033401.1|AYQ00910.1|973846_975451_-|FAD-NAD(P)-binding-protein MKKIAIVGAGPTGIYTLFSLLQQQTPLSISIFEQADEAGVGMPYSDEENSKMMLANIASIEIPPINCTYLEWLQKQEASHLQRYGVKKETLHDRQFLPRILLGEYFRDQFLRLVDQARQQKFAVAVYESCQVTDLQITNAGVMLATNQDLPSETFDLVVIATGHVWPDEEEATRTYFPSPWSGLMEAKVDACNVGIMGTSLSGLDAAMAVAIQHGSFIEDDKQHVVFNRDNASEKLNITLMSRTGILPEADFYCPIPYEPLHIVTDQALNAEIQKGEEGLLDRVFRLIVEEIKFADPDWSQRIALESLNVDSFAQAWFAERKQRDPFDWAEKNLQEVERNKREKHTVPWRYVILRLHEAVQEIVPHLNEHDHKRFSKGLARVFIDNYAAIPSESIRRLLALREAGIIHILALGEDYEMEINESRTVLKTEDNSYSFDVFIDARGQRPLKVKDIPFPGLREQLQKTGDEIPDVGEDYTLQQPEDIRGRVAFGALPWLMHDQPFVQGLTACAEIGEAMARAVVKPASRARRRLSFD >CP033401.1|AYQ00909.1|973022_973835_+|hypothetical-protein MIITRADLREWRIGAVMYRWFLRHFPRGGSYADIHHALIEEGYTDWAESLVEYAWKKWLADENFAHQEVSSMQKLATDPGEIPFCSQFARSDDHARIGCCEDNARIATAGYAAQIASMGYSVRIGSVGFNSHIGSSGERARVAVTGNSSRISSAGDSSRIANTGMRVRVCTLGERCHVASNGDLAQIASFGANARIANSGDNVHIIASGENSTVVSTGVVDSIILGPGGSAALAYHDGERVRFAVAIEGENNIRAGVRYRLNEQHQFVEC >CP033401.1|AYQ00908.1|972233_973019_+|thiosulfate-reductase-cytochrome-B-subunit MNPSQHAEQFQSQLANYVPQFTPEFWPVWLIIAGVLLVGMWLVLGLHALLRARGVKKSVTDYGEKIYLYCKAVRLWHWSNALLFVLLLASGLINHFALVGATAVKSLVAVHEVCGFLLLACWLGFVLINAVGGNGHHYRIRRQGWLERAAKQTRFYLFGIMQGEEHPFPATTQSKFNPLQQVAYVGVMYGLLPLLLLTGLLCLYPQAVGDVFPGVRYWLLQAHFALAFISLFFIFGHLYLCTTGRTPHETFKSMVDGYHRH >CP033401.1|AYQ00907.1|971568_972237_+|4Fe-4S-dicluster-domain-containing-protein MSFTRRKFVLGMGTVIFFTGSASSLLANTRQEKEVRYAMIHDESRCNGCNICARACRKTNHVPAQGSRLSIAHIPVTDNDNETQYHFFRQSCQHCEDAPCIDVCPTGASWRDEQGIVRVEKSQCIGCSYCIGACPYQVRYLNPVTKVADKCDFCAESRLAKGFPPICVSACPEHALIFGREDSPEIQAWLQQNKYYQYQLPGAGKPHLYRRFGQHLIKKENV >CP033401.1|AYQ00906.1|970857_971505_+|YdhW-family-putative-oxidoreductase-system-protein MGEMNHRDELPLAKVSEVDEAKRQWLQGMRHPVDTVTEPEPAEILAEFIRQHSAAGQLVARAVFLSPPYSVAEEELSVLLESIKQNGDYADIACMTGSQDDYYYSTQAMSENYAAMSLQVVEQDICRAIAHAVRFECQTYPRPYKVAMLMQAPYYFQEAQIEAAIAAMDVAPEYADIRQVESSTAVLYLFSERFMTYGKAYGLCEWFEVEQFQNP >CP033401.1|AYQ00905.1|968751_970854_+|aldehyde-ferredoxin-oxidoreductase MANGWTGNILRVNLTTGNITLEDSSKFKSFVGGMGFGYKIMYDEVPPGTKPFDEANKLVFATGPLTGSGAPCSSRVNITSLSTFTKGNLVVDAHMGGFFAAQMKFAGYDVIIIEGKAKSPVWLKIKDDKVSLEKADFLWGKGTRATTEEICRLTSPETCVAAIGQAGENLVPLSGMLNSRNHSGGAGTGAIMGSKNLKAIAVEGTKGVNIADRQEMKRLNDYMMTELIGANNNHVVPSTPQSWAEYSDPKSRWTARKELFWGAAEGGPIETGEIPPGNQNTVGFRTYKSVFDLGPAAEKYTVKMSGCHSCPIRCMTQMNIPRVKEFGVPSTGGNTCVANFVHTTIFPNGPKDFEDKDDGRVIGNLVGLNLFDDYGLWCNYGQLHRDFTYCYSKGVFKRVLPAEEYAEIHWDQLEAGDVNFIKDFYYRLAHRVGELSHLADGSYAIAERWNLGEEYWGYAKNKLWSPFGYPVHHANEASAQVGSIVNCMFNRDCMTHTHINFIGSGLPLKLQREVAKELFGSEDAYDETKNYTPINDAKIKYAKWSLLRVCLHNAVTLCNWVWPMTVSPLKSRNYRGDLALEAKFFKAITGEEMTQEKLDLAAERIFTLHRAYTVKLMQTKDMRNEHDLICSWVFDKDPQIPVFTEGTDKMDRDDMHASLTMFYKEMGWDPQLGCPTRETLQRLGLEDIAADLAAHNLLPV >CP033401.1|AYQ00904.1|968104_968731_+|ferredoxin-like-protein MNPVDRPLLDIGLTRLEFLRISGKGLAGLTIAPALLSLLGCKQEDIDSGTVGLINTPKGVLVTQRARCTGCHRCEISCTNFNDGSVGTFFSRIKIHRNYFFGDNGVGSGGGLYGDLNYTADTCRQCKEPQCMNVCPIGAITWQQKEGCITVDHKRCIGCSACTTACPWMMATVNTESKKSSKCVLCGECANACPTGALKIIEWKDITV >CP033401.1|AYQ00903.1|967439_967649_+|fumarate-hydratase-FumD MGNRTKEDELYREMCRVVGKVVLEMRDLGQEPKHIVIAGVLRTALANKRIQRSELEKQAMETVINALVK >CP033401.1|AYQ00902.1|965471_966884_-|pyruvate-kinase-I MKKTKIVCTIGPKTESEEMLAKMLDAGMNVMRLNFSHGDYAEHGQRIQNLRNVMSKTGKTAAILLDTKGPEIRTMKLEGGNDVSLKAGQTFTFTTDKSVIGNSEMVAVTYEGFTTDLSVGNTVLVDDGLIGMEVTAIEGNKVICKVLNNGDLGENKGVNLPGVSIALPALAEKDKQDLIFGCEQGVDFVAASFIRKRSDVIEIREHLKAHGGENIHIISKIENQEGLNNFDEILEASDGIMVARGDLGVEIPVEEVIFAQKMMIEKCIRARKVVITATQMLDSMIKNPRPTRAEAGDVANAILDGTDAVMLSGESAKGKYPLEAVSIMATICERTDRVMNSRLEFNNDNRKLRITEAVCRGAVETAEKLDAPLIVVATQGGKSARAVRKYFPDATILALTTNEKTAHQLVLSKGVVPQLVKEITSTDDFYRLGKELALQSGLAHKGDVVVMVSGALVPSGTTNTASVHVL >CP033401.1|AYQ00912.1|976454_977711_+|hypothetical-protein MGSDAKNLMSDGNVQIVKTGEVIGATQLTEGELIVEAGGRAENTVVTGAGWLKVATGGIAKCTQYGNNGTLSVSDGAIATDIVQSEGGAISLSTLATVNGRHPEGEFSVDQGYACGLLLENGGNLRVLEGHRAEKIILDQEGGLLVNGTTSAVVVDEGGELLVYPGGEASNCEINQGGVFMLAGKASDTLLAGGTMNNLGGEDSDTIVENGSIYRLGTDGLQLYSSGKTQNLSVNVGGRAEVHAGTLENAVIQGGTVILLSPTSADENFVVEEDRAPVELTGSVALLDGASMIIGYGADLQQSTITVQQGGVLILDGSTVKGDGVTFIVGNINLNGGKLWLITGAATHVQLKVKRLRGEGAICLQTSAKEISPDFINVKGEVTGDIHVEITDASRQTLCNALKLQPDEDGIGATLQPA >CP033401.1|AYQ00913.1|977751_979125_-|multidrug-resistance-protein-MdtK MQKYISEARLLLALAIPVILAQIAQTAMGFVDTVMAGGYSATDMAAVAIGTSIWLPAILFGHGLLLALTPVIAQLNGSGRRERIAHQVRQGFWLAGFVSVLIMLVLWNAGYIIRSMENIDPALADKAVGYLRALLWGAPGYLFFQVARNQCEGLAKTKPGMVMGFIGLLVNIPVNYIFIYGHFGMPELGGVGCGVATAAVYWVMFLAMVSYIKRARSMRDIRNEKGTAKPDPAVMKRLIQLGLPIALALFFEVTLFAVVALLVSPLGIVDVAGHQIALNFSSLMFVLPMSLAAAVTIRVGYRLGQGSTLDAQTAARTGLMVGVCMATLTAIFTVSLREQIALLYNDNPEVVTLAAHLMLLAAVYQISDSIQVIGSGILRGYKDTRSIFYITFTAYWVLGLPSGYILALTDLVVEPMGPAGFWIGFIIGLTSAAIMMMLRMRFLQRLPSVIILQRASR >CP033401.1|AYQ00914.1|979339_979981_+|riboflavin-synthase MFTGIVQGTVKLVSIDEKPNFRTHVVELPDHMLDGLETGASVAHNGCCLTVTEINGNHVSFDLMKETLRITNLGDLKVGDWVNVERAAKFSDEIGGHLMSGHIMTTAEVAKILTSENNRQIWFKVQDSQLMKYILYKGFIGIDGISLTVGEVTPTRFCVHLIPETLERTTLGKKKLGARVNIEIDPQTQAVVDTVERVLAARENAMNQPGTEA >CP033401.1|AYQ00915.1|980020_981169_-|cyclopropane-fatty-acyl-phospholipid-synthase MSSSCIEEVSVPDDNWYRIANELLSRAGIAINGSAPADIRVKNPDFFKRVLQEGSLGLGESYMDGWWECDRLDMFFSKVLRAGLENQLPHHFKDTLRIASARLFNLQSKKRAWIVGKEHYDLGNDLFSRMLDPFMQYSCAYWKDADNLESAQQAKLKMICEKLQLKPGMRVLDIGCGWGGLAHYMASNYDVSVVGVTISAEQQKMAQERCEGLDVTILLQDYRDLNDQFDRIVSVGMFEHVGPKNYDTYFAVVDRNLKPEGIFLLHTIGSKKTDLNVDPWINKYIFPNGCLPSVRQIAQSSEPHFVMEDWHNFGADYDTTLMAWYERFLAAWPEIADNYSERFKRMFTYYLNACAGAFRARDIQLWQVVFSRGVENGLRVAR >CP033401.1|AYQ00916.1|981459_982671_-|Bcr/CflA-family-multidrug-efflux-MFS-transporter MQPGKRFLVWLAGLSVLGFLATDMYLPAFAAIQADLQTPASAVSASLSLFLAGFAAAQLLWGPLSDRYGRKPVLLIGLTIFALGSLGMLWVENAATLLVLRFVQAVGVCAAAVIWQALVTDYYPSQKVNRIFATIMPLVGLSPALAPLLGSWLLVHFSWQAIFATLFAITVVLILPIFWLKPTTKARNNSQDGLTFTDLLRSKTYRGNVLIYAACSASFFAWLTGSPFILSEMGYSPAVIGLSYVPQTIAFLIGGYGCRAALQKWQGKQLLPWLLVLFAVSVIATWAAGFISHVSLVEILIPFCVMAIANGAIYPIVVAQALRPFPHATGRAAALQNTLQLGLCFLASLVVSWLISISTPLLTTTSVMLSTVVLVALGYMMQRCEEVGCQNHGNAEVAHSESH >CP033401.1|AYQ00917.1|982783_983716_+|LysR-family-transcriptional-regulator MWSEYSLEVVDAVARNGSFSAAAQELHRVPSAVSYTVRQLEEWLAVPLFERRHRDVELTAAGAWFLKEGRSVVKKMQITRQQCQQIANGWRGQLAIAVDNIVRPERTRQMIVDFYRHFDDVELLVFQEVFNGVWDALSDGRVELAIGATRAIPVGGRYAFRDMGMLSWSCVVASHHPLALMDGPFSDDTLRNWPSLVREDTSRTLPKRITWLLDNQKRVVVPDWESSATCISAGLCIGMVPTHFAKPWLNEGKWVVLELENPFPDSACCLTWQQNDMSPALTWLLEYLGDSETLNKEWLREPEETPATGD >CP033401.1|AYQ00918.1|983712_984738_-|PurR-family-transcriptional-regulator MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVNHTKSIGLLATSSEAAYFAEIIEAVEKNCFQKGYTLILGNAWNNLEKQRAYLSMMAQKRVDGLLVMCSEYPEPLLAMLEEYRHIPMVVMDWGEAKADFTDAVIDNAFEGGYMAGRYLIERGHREIGVIPGPLERNTGAGRLAGFMKAMEEAMIKVPESWIVQGDFEPESGYRAMQQILSQPHRPTAVFCGGDIMAMGALCAADEMGLRVPQDVSLIGYDNVRNARYFTPALTTIHQPKDSLGETAFNMLLDRIVNKREEPQSIEVHPRLIERRSVADGPFRDYRR >CP033401.1|AYQ00919.1|985036_985126_+|YnhF-family-membrane-protein MSTDLKFSLVTTIIVLGLIVAVGLTAALH >CP033401.1|AYQ00920.1|985291_986461_+|MFS-transporter MKINYPLLALAIGAFGIGTTEFSPMGLLPVIARGVDVSIPAAGMLISAYAVGVMVGAPLMTLLLSHRARRSALIFLMAIFTLGNVLSAIAPDYMTLMLSRILTSLNHGAFFGLGSVVAASVVPKHKQASAVATMFMGLTLANIGGVPAATWLGETIGWRMSFLATAGLGVISMVSLFFSLPKGGAGARPEVKKELAVLMRPQVLSALLTTVLGAGAMFTLYTYISPVLQSITHATPVFVTAMLVLIGVGFSIGNYLGGKLADRSVNGTLKGFLLLLMVIMLAIPFLARNEFGAAISMVVWGAATFAVVPPLQMRVMRVASEAPGLSSSVNIGAFNLGNALGAAAGGAVISAGLGYSFVPVMGAIVAGLALLLVFMSARKQPETVCVANS >CP033401.1|AYQ00921.1|986606_987188_-|superoxide-dismutase-[Fe] MSFELPALPYAKDALAPHISAETIEYHYGKHHQTYVTNLNNLIKGTAFEGKSLEEIIRSSEGGVFNNAAQVWNHTFYWNCLAPNAGGEPTGKVAEAIAASFGSFADFKAQFTDAAIKNFGSGWTWLVKNSDGKLAIVSTSNAGTPLTTDATPLLTVDVWEHAYYIDYRNARPGYLEHFWALVNWEFVAKNLAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP033401_4 | 1986833-1986977 | Orphan |
NA
Consensus repeat of CP033401_4
|
1 spacers
spacers of CP033401_4
>4.1|1986885|41|CP033401|CRISPRCasFinder TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC |
CRISPR arrays and Neighbor proteins around CP033401_4
The CRISPR arrays of CP033401_4 >merge|CP033401|4|1986833-1986977|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGCTGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTCGTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC >CP033401|4|4|1986833-1986977|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC
>CP033401.1|AYQ01808.1|1985483_1986767_+|acyl-CoA-thioesterase MNTFSVSRLALALAFGVTLTACSSTPPDQRPSDQTAPGTSSRPILSAKEAQNFDAQHYFASLTPGAAAWNPSPITLPAQPDFVVGPAGTQGVTHTTIQAAVDAAIIKRTNKRQYIAVMPGEYQGTVYVPAAPGGITLYGTGEKPIDVKIGLSLDGGMSPADWRHDVNPRGKYMPGKPAWYMYDSCQSKRSDSIGVLCSAVFWSQNNGLQLQNLTIENTLGDSVDAGNHPAVALRTDGDQVQINNVNILGRQNTFFVTNSGVQNRLETNRQPRTLVTNSYIEGDVDIVSGRGAVVFDNTEFRVVNSRTQQEAYVFAPATLSNIYYGFLAVNSRFNAFGDGVAQLGRSLDVDANTNGQVVIRDSAINEGFNTAKPWADAVISNRPFAGNTGSVDDNDEIQRNLNDTNYNRMWEYNNRGVGSKVVAEAKK >CP033401.1|AYQ01807.1|1984278_1985349_+|integrase MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >CP033401.1|AYQ01806.1|1984082_1984301_+|excisionase MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >CP033401.1|AYQ01805.1|1983875_1984043_+|hypothetical-protein MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >CP033401.1|AYQ01804.1|1983757_1983943_-|hypothetical-protein MFSASITLLNGSPFHSPVTRKCIYHLHKTKPAVASSDKRNPRQCEDAVHCCYTLFCSQRKR >CP033401.1|AYQ01803.1|1983030_1983633_-|hypothetical-protein MSYFLRKKWMVNLSGSGKILWALNMKKDSYPYLICMTVSGLIFIFLFFWWRADIYRVTFLNQSISHYYILFSMGIAFLLSLFWVKKGIVKQSGWKSLSAYLKVYAGMCIFAGFFLIIPLTTLTYFLPGETSSYVAPYRYTSGSSKSCSGAEVDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >CP033401.1|AYQ01802.1|1982598_1982820_+|TraR/DksA-family-transcriptional-regulator MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQQDLELISKQRGSK >CP033401.1|AYQ01801.1|1982218_1982500_+|cell-division-protein-ZapA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >CP033401.1|AYQ01800.1|1982016_1982208_+|DUF1382-family-protein MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >CP033401.1|AYQ01799.1|1981861_1982044_+|DUF1317-family-protein MTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLPLS >CP033401.1|AYQ01809.1|1987000_1989262_-|hydratase MIKLSEKGVFLASNNEIIAEEHFTGEIKKEEAQKGTIAWSILSSHNTSGNMDKLKIKFDSLASHDITFVGIVQTAKASGMERFPLPYVLTNCHNSLCAVGGTINGDDHVFGLSAAQRYGGIFVPPHIAVIHQYMREMMAGGGKMILGSDSHTRYGALGTMAVGEGGGELVKQLLNDTWDIDYPGVVAVHLTGKPAPYVGPQDVALAIIGAVFKNGYVKNKVMEFVGPGVSALSTDFRNSVDVMTTETTCLSSVWQTDEEVHNWLALHGRGQDYCQLNPQPMAYYDGCISVDLSAIKPMIALPFHPSNVYKIDTLNQNLTDILREIEIESERVAHGKAKLSLLDKVENGRLKVQQGIIAGCSGGNYENVIAAANALRGQSCGNDTFSLAVYPSSQPVFMDLAQKGVVADLIGAGAIIRTAFCGPCFGAGDTPINNGLSIRHTTRNFPNREGSKPANGQMSAVALMDARSIAATAANGGYLTSASELDCWDNVPEYAFDVTPYKNRVYQGFVKGATQQPLIYGPNIKDWPELGALTDNIVLKVCSKILDEVTTTDELIPSGETSSYRSNPIGLAEFTLSRRDPGYVGRSKATAELENQRLAGNVSELTEVFARIKQIAGQEHIDPLQTEIGSMVYAVKPGDGSAREQAASCQRVIGGLANIAEEYATKRYRSNVINWGMLPLQMAEVPTFEVGDYIYIPGIKAALDNPGTTFKGYVIHEDAPVTEITLYMGSLTAEEREIIKAGSLINFNKNRQM >CP033401.1|AYQ01810.1|1989444_1990878_-|anion-permease MNKKSLWKLILILAIPCIIGFMPAPAGLSELAWVLFGIYLAAIVGLVIKPFPEPVVLLIAVAASMVVVGNLSDGAFKTTAVLSGYSSGTTWLVFSAFTLSAAFVTTGLGKRIAYLLIGKIGNTTLGLGYVTVFLDLVLAPATPSNTARAGGIVLPIINSVAVALGSEPEKSPRRVGHYLMMSIYMVTKTTSYMFFTAMAGNILALKMINDILHLQISWGGWALAAGLPGIIMLLVTPLVIYTMYPPEIKKVDNKTIAKAGLAELGPMKIREKMLLGVFVLALLGWIFSKSLGVDESTVAIVVMATMLLLGIVTWEDVVKNKGGWNTLIWYGGIIGLSSLLSKVKFFEWLAEVFKNNLAFDGHGNVAFFVIIFLSIIVRYFFASGSAYIVAMLPVFAMLANVSGAPLMLTALALLFSNSYGGMVTHYGGAAGPVIFGVGYNDIKSWWLVGAVLTILTFLVHITLGVWWWNMLIGWNML >CP033401.1|AYQ01811.1|1990953_1992006_-|4-oxalomesaconate-tautomerase MKKIPCVMMRGGTSRGAFLLAEHLPEDQTQRDKILMAIMGSGNDLEIDGIGGGNPLTSKVAIISRSSDLRADVDYLFAQVIVHEQRVDTTPNCGNMLSGVGAFAIENGLIAATSPVTRVRIRNVNTGTFIEADVQTPNGVVEYEGSARIDGVPGTAAPVALTFLNAAGTKTGKVFPTDNQIDYFDDVPVTCIDMAMPVVIIPAEYLGKTGYELPAELDADKALLARIESIRLQAGKAMGLGDVSNMVIPKPVLISPAQKGGAINVRYFMPHSCHRALAITGAIAISSSCALEGTVTRQIVPSVGYGNINIEHPSGALDVHLSNEGQDATTLRASVIRTTRKIFSGEVYLP >CP033401.1|AYQ01812.1|1992189_1993143_+|LysR-family-transcriptional-regulator MKHELSSMKAFVILAESSSFNNAAKLLNITQPALTRRIKKMEEDLHIQLFERTTRKVTLTKAGKRLLPEARELIKKFDETLFNIRDMNAYHRGMVTLACIPTAVFYFLPLAIGKFNELYPNIKVRILEQGTNNCMESVLCNESDFGINMNNVTNSSIDFTPLVNEPFVLACRRDHPLAKKQLVEWQELVGYKMIGVRSSSGNRLLIEQQLADKPWKLDWFYEVRHLSTSLGLVEAGLGISALPGLAMPHAPYSSIIGIPLVEPVIRRTLGIIRRKDAVLSPAAERFFALLINLWTDDKDNLWTNIVERQRHALQEIG >CP033401.1|AYQ01813.1|1993183_1994179_-|6-phosphogluconolactonase MKQTVYIASPESQQIHVWNLNHEGALTLTQVVDVPGQVQPMVVSPDKRYLYVGVRPEFRVLAYSIAPDDGALTFAAESALPGSPTHISTDHQGQFVFVGSYNAGNVSVTRLEDGLPVGVVDVVEGLDGCHSANISPDNRTLWVPALKQDRICLFTVSDDGHLVAQDPAEVTTVEGAGPRHMVFHPNEQYAYCVNELNSSVDVWELKDPHGNIECVQTLDMMPENFSDTRWAADIHITPDGRHLYACDRTASLITVFSVSEDGSVLSKEGFQPTETQPRGFNVDHSGKYLIAAGQKSHHISVYEIVGEQGLLHEKGRYAVGQGPMWVVVNAH >CP033401.1|AYQ01814.1|1994333_1995152_+|pyridoxal-phosphatase MTTRVIALDLDGTLLTPKKTLLPSSIEALARAREAGYQLIIVTGRHHVAIHPFYQALALDTPAICCNGTYLYDYHAKTVLEADPMPVNKALQLIEMLNEHHIHGLMYVDDAMVYEHPTGHVIRTSNWAQTLPPEQRPTFTQVASLAETAQQVNAVWKFALTHDDLPQLQHFGKHVEHELGLECEWSWHDQVDIARGGNSKGKRLTKWVEAQGWSMENVVAFGDNFNDISMLEAAGTGVAMGNADDAVKARANIVIGDNTTDSIAQFIYSHLI >CP033401.1|AYQ01815.1|1995152_1996211_-|molybdenum-import-ATP-binding-protein-ModC MLELNFSQTLGNHCLTINETLPANGITAIFGVSGAGKTSLINAISGLTRPQKGRIVLNGRVLNDAEKGICLTPEKRRVGYVFQDARLFPHYKVRGNLRYGMSKSMVDQFDKLVALLGIEPLLDRLPGSLSGGEKQRVAIGRALLTAPELLLLDEPLASLDIPRKRELLPYLQRLTREINIPMLYVSHSLDEILHLADRVMVLENGQVKAFGALEEVWGSSVMNPWLPKEQQSSILKVTVLEHHPHYAMTALALGDQHLWVNKLDEPLQAALRIRIQASDVSLVLQPPQQTSIRNVLRAKVVNSYDDNGQVEVELEVGGKTLWARISPWARDELAIKPGLWLYAQIKSVSITA >CP033401.1|AYQ01816.1|1996213_1996903_-|molybdenum-ABC-transporter-permease MILTDPEWQAVLLSLKVSSLAVLFSLPFGIFFAWLLVRCTFPGKALLDSVLHLPLVLPPVVVGYLLLVSMGRRGFIGERLYDWFGITFAFSWRGAVLAAAVMSFPLMVRAIRLALEGVDVKLEQAARTLGAGRWRVFFTITLPLTLPGIIVGTVLAFARSLGEFGATITFVSNIPGETRTIPSAMYTLIQTPGGESGAARLCIISIALAMISLLISEWLARISRERAGR >CP033401.1|AYQ01817.1|1996902_1997676_-|molybdate-ABC-transporter-substrate-binding-protein MARKWLNLFAGAALSFAVAGNALADEGKITVFAAASLTNAMQDIATQYKKEKGVDVVSSFASSSTLARQIEAGAPADLFISADQKWMDYAVDKKAIDTATRQTLLGNSLVVVAPKASEQKDFTIDSKTNWTSLLNGGRLAVGDPEHVPAGIYAKEALQKLGAWDTLSPKLAPAEDVRGALALVERNEAPLGIVYGSDAVASKGVKVVAIFPEDSHKKVEYPVAVVEGHNNATVKAFYDYLKGPQAAEIFKRYGFTTK >CP033401.1|AYQ01818.1|1997842_1997992_-|multidrug-efflux-pump-accessory-protein-AcrZ MLELLKSLVFAVIMVPVVMAIILGLIYGLGEVFNIFSGVGKKDQPGQNH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP033401_5 | 2497646-2497799 | Orphan |
NA
Consensus repeat of CP033401_5
|
1 spacers
spacers of CP033401_5
>5.1|2497699|48|CP033401|CRISPRCasFinder TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA |
CRISPR arrays and Neighbor proteins around CP033401_5
The CRISPR arrays of CP033401_5 >merge|CP033401|5|2497646-2497799|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCGTCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAACGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG >CP033401|5|5|2497646-2497799|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCG TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA CGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG
>CP033401.1|AYQ02247.1|2495771_2497511_+|flagellar-type-III-secretion-system-protein-FlhA MLSRSDLLTLLTINFIVVTKGAERISEVSARFTLDAMPGKQMAIDADLNAGLINQAQAQTRRKDVASEADFYGAMDGASKFVRGDAIAGMMILAINLIGGVCIGIFKYNLSADAAFQQYVLMTIGDGLVAQIPSLLLSTAAAIIVTRISDNGDITHDVRHQLLASPSVLYTATGIMFVLAVVPGMPHLPFLLFSALLGFTGWRMSKRPQAAEAEEKSLETLTRTITETSEQQVSWETIPLIEPISLSLGYKLVALVDKAQGNPLTQRIRGVRQVISDGNGVLLPEIRIRENFRLKPSQYAIFINGIKADEADIPADKLMALPSSETYGEIDGVLGNDPAYGMPVTWIQPAQKAKALNMGYQVIDSASVIATHVNKIVRSYIPDLFSYDDITQLHNRLSSMAPRLAEDLSAALNYSQLLKVYRALLTEGVSLRDIVTIATVLVASSAVTKDHILLAADVRLALRRSITHPFVRKQELTVYTLNNELENLLTNVVNQAQQGGKVMLDSVPVDPNMLNQFQSTMPQVKEQMKAAGKDPVLLVPPQLRPLLARYARLFAPGLHVLSYNEVPDELELKIMGALM >CP033401.1|AYQ04308.1|2495041_2495812_-|putative-lateral-flagellar-export/assembly-protein-LafU MIVNSVSKSERESIIAALHGQSIFSGGGLSPLNKISPSHPPKPATVAVPEETEKKARDVNEKTALLKKKSATELGELATSINTIARDAHMEANLEMEIVPQGLRVLIKDDQNRNMFECGSAQIMPFFKTLLVELAPVFDSLDNKIIITGHTDAMAYKNNIYNNWNLSGDRALSARRVLEEAGMPEDKVMQVSAMADQMLLDAKNPQSAGNRRIEIMVLTKSASDTLYQYFGQHGDKVVQPLVQKLDKQQVLSQRMR >CP033401.1|AYQ02246.1|2493915_2494971_-|DNA-polymerase-IV MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARKFGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPLSLDEAYLDVTDSVHCHGSATLIAQEIRQTIFNELHLTASAGVAPVKFLAKIASDMNKPNGQFVITPAEVPAFLQTLPLAKIPGVGKVSAAKLEAMGLRTCGDVQKCDLVILLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMAEDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQEHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLGL >CP033401.1|AYQ02245.1|2493466_2493919_-|GNAT-family-N-acetyltransferase MNNIQIRNYQPGDFQQLCAIFIRAVMMTASQHYSPQQIAAWAQIDESRWKEKLAKSQVRVAVINAQPVGFISRIERHIDMLFVDPEYTRRGVASALLKPLIKSESELTVDASITAKPFFERYGFQIVKQQHVECRGAWFTNFYMRYKPQH >CP033401.1|AYQ02244.1|2492893_2493160_-|hypothetical-protein MEWYMGKYIRPLSDAVFTIASDDLWIESLAIQQLHTTANLPNMQRVVGMPDLHPGRGYPIGAAFFSVGRFYPARRRGNGAGNRNGPLL >CP033401.1|AYQ02243.1|2491079_2492537_+|cytosol-nonspecific-dipeptidase MSELSQLSPQPLWDIFAKICSIPHPSYHEEQLAEYIVGWAKEKGFHVERDQVGNILIRKPATAGMENRKPVVLQAHLDMVPQKNNDTVHDFTKDPIQPYIDGEWVKARGTTLGADNGIGMASALAVLADENVVHGPLEVLLTMTEEAGMDGAFGLQSNWLQADILINTDSEEEGEIYMGCAGGIDFTSNLHLDREAVPAGFETFKLTLKGLKGGHSGGEIHVGLGNANKLLVRFLAGHAEELDLRLIDFNGGTLRNAIPREAFATIAVAADKVDALKSLVNTYQDILKNELAEKEKNLALLLDSVANDKAALIAKSRDTFIRLLNATPNGVIRNSDVAKGVVETSLNVGVVTMTDNNVEIHCLIRSLIDSGKDYVVSMLDSLGKLAGAKTEAKGAYPGWQPDANSPVMHLVRETYQRLFNKTPNIQIIHAGLECGLFKKPYPEMDMVSIGPTITGPHSPDEQVHIKSVGHYWTLLTELLKEIPAK >CP033401.1|AYQ02242.1|2490360_2490819_-|xanthine-phosphoribosyltransferase MSEKYIVTWDMLQIHARKLASRLMPSEQWKGIIAVSRGGLVPGALLARELGIRHVDTVCISSYDHDNQRELKVLKRAEGDGEGFIVIDDLVDTGGTAVAIREMYPKAHFVTIFAKPAGRPLVDNYVVDIPQDTWIEQPWDMGVVFVPPISGR >CP033401.1|AYQ02241.1|2489024_2490269_-|esterase MTQANLSETLFKPRFKHPETSTLVRRFNHGAQPPVQSALDGKTIPHWYRMINRLMWIWRGIDPREILDVQARIVMSDAERTDDDLYDTVIGYRGGNWIYEWATQAMVWQQKACAEEDPQLSGRHWLHAATLYNIAAYPHLKGDDLAEQAQALSNRAYEEAAQRLPGTMRQMEFTVPGGAPITGFLHMPKGDGPFPTVLMCGGLDAMQTDYYSLYERYFAPRGIAMLTIDMPSVGFSSKWKLTQDSSLLHQHVLKALPNVPWVDHTRVAAFGFRFGANVAVRLAYLESPRLKAVACLGPVVHTLLSDFKCQQQVPEMYLDVLASRLGMHDASDDALRVELNRYSLKVQGLLGRRCPTPMLSGYWKNDPFSPEEDSRLITSSSADGKLLEIPFNPVYRNFDKGLQEITGWIEKRLC >CP033401.1|AYQ02240.1|2488565_2488967_-|sigma-factor-binding-protein-Crl MTLPSGHPKSRLIKKFTALGPYIREGKCEDNRFFFDCLAVCVNVKPAPEVREFWGWWMELEAQESRFTYSYQFGLFDKAGDWKSVPVKDTEVVERLEHTLREFHEKLRELLTTLNLKLEPADDFRDEPVKLTA >CP033401.1|AYQ02239.1|2487471_2488527_+|phosphoporin-PhoE MKKSTLALVVMGIVASASVQAAEIYNKDGNKLDVYGKVKAMHYMSDNDSKDGDQSYIRFGFKGETQINDQLTGYGRWEAEFAGNKAESDTAQQKTRLAFAGLKYKDLGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGVIDGLNLTLQYQGKNENRDVKKQNGDGFGTSLTYDFGGSDFAISGAYTNSDRTNEQNLQSRGTGKRAEAWATGLKYDANNIYLATFYSETRKMTPITGGFANKTQNFEAVAQYQFDFGLRPSLGYVLSKGKDIEGIGDEDLVNYIDVGATYYFNKNMSAFVDYKINQLDSDNKLNINNDDIVAVGMTYQF >CP033401.1|AYQ02248.1|2497828_2498326_-|transposase MSEYRRYYIKGGTWFFTVNLRNRRSQLLTTQYQMLRHAIIKVKRDRPFEINAWVVLPEHMHCIWTLPEGDDDFSSRWREIKKQFTHACGLKNIWQPRFWEHAIRNTKDYRHHVDYIYINPVKHGWVKQVSDWPFSTFHRDVARGLYPIDWAGDVTDINAGERIIL >CP033401.1|AYQ02249.1|2498501_2499260_-|peptidoglycan-endopeptidase MSFMSSFLLGRFLHPGVFSLCVLLPLFASATTSHISFSYAARQRMQNRARLLKQYQTHLKKQASYIVEGNAESRRALRQHNREQIKQHPEWFPAPLKASDRRWQALAENNHFLSSDHLHNITEVAIHRLEQQLGKPYVWGGTRPDQGFDCSGLVFYAYNKILEAKLPRTANEMYHYHRATIVANNDLRRGDLLFFHIHSREIADHMGVYLGDGQFIESPRTGENIRVSRLAEPFWQDHFLGARRILTEETIL >CP033401.1|AYQ02250.1|2499551_2500292_+|transpeptidase MRKIALILAMLLIPCVSFAGLLGSSSSTTPVSKEYKQQLMGSPVYIQIFKEERTLDLYVKMGEQYQLLDSYKICKYSGGLGPKQRQGDFKSPEGFYSVQRNQLKPDSRYYKAINIGFPNAYDRAHGYEGKYLMIHGDCVSIGCYAMTNQGIDEIFQFVTGALVFGQPSVQVSIYPFRMTDANMKRHKYSNFKDFWEQLKPGYDYFEQTRKPPTVSVVNGRYVVSKPLSHEVVQPQLASNYTLPEAK >CP033401.1|AYQ02251.1|2500262_2501030_-|class-II-glutamine-amidotransferase MCELLGMSANVPTDICFSFTGLVQRGGGTGPHKDGWGITFYEGKGCRTFKDPQPSFNSPIAKLVQDYPIKSCSVVAHIRQANRGEVALENTHPFTRELWGRNWTYAHNGQLTGYKSLETGNFRPVGETDSEKAFCWLLHKLTQRYPRTPGNMAAVFKYIASLADELRQKGVFNMLLSDGRYVMAYCSTNLHWITRRAPFGVATLLDQDVEIDFSSQTTPNDVVTVIATQPLTGNETWQKIMPGEWRLFCLGERVV >CP033401.1|AYQ02252.1|2501235_2501814_-|D-sedoheptulose-7-phosphate-isomerase MYQDLIRNELNEAAETLANFLKDDANIHAIQRAAVLLADSFKAGGKVLSCGNGGSHCDAMHFAEELTGRYRENRPGYPAIAISDVSHISCVGNDFGFNDIFSRYVEAVGREGDVLLGISTSGNSANVIKAIAAAREKGMKVITLTGKDGGKMAGTADIEIRVPHFGYADRIQEIHIKVIHILIQLIEKEIVK >CP033401.1|AYQ02253.1|2502053_2504498_+|acyl-CoA-dehydrogenase MMILSILATVVLLGALFYHRVSLFISSLILLAWTAALGVAGLWSAWVLVPLAIILVPFNFAPMRKSMISAPVFRGFRKVMPPMSRTEKEAIDAGTTWWEGDLFQGKPDWKKLHNYPQPRLTAEEQAFLDGPVEEACRMANDFQITHELADLPPELWAYLKEHRFFAMIIKKEYGGLEFSAYAQSRVLQKLSGVSGILAITVGVPNSLGPGELLQHYGTDEQKNHYLPRLARGQEIPCFALTSPEAGSDAGAIPDTGIVCMGEWQGQQVLGMRLTWNKRYITLAPIATVLGLAFKLSDPEKLLGGAEDLGITCALIPTTTPGVEIGRRHFPLNVPFQNGPTRGKDVFVPIDYIIGGPKMAGQGWRMLVECLSVGRGITLPSNSTGGVKSVALATGAYAHIRRQFKISIGKMEGIEEPLARIAGNAYVMDAAASLITYGIMLGEKPAVLSAIVKYHCTHRGQQSIIDAMDITGGKGIMLGQSNFLARAYQGAPIAITVEGANILTRSMMIFGQGAIRCHPYVLEEMEAAKNNDVNAFDKLLFKHIGHVGSNKVRSFWLGLTRGLTSSTPTGDATKRYYQHLNRLSANLALLSDVSMAVLGGSLKRRERISARLGDILSQLYLASAVLKRYDDEGRNEADLPLVHWGVQDALYQAEQAMDDLLQNFPNRVVAGLLNVVIFPTGRHYLAPSDKLDHKVAKILQVPNATRSRIGRGQYLTPSEHNPVGLLEEALVDVIAADPIHQRICKELGKNLPFTRLDELAHNALAKGLIDKDEAAILVKAEESRLCSINVDDFDPEELATKPVKLPEKVRKVEAA >CP033401.1|AYQ02254.1|2504540_2505014_-|inhibitor-of-vertebrate-lysozyme MGRISSGGMMFKAITTVAALVIATSAMAQDDLTISSLAKGETTKAAFNQMVQGHKLPAWVMKGGTYTPAQTVTLGDETYQVMSACKPHDCGSQRIAVMWSEKSNQMTGLFSTIDEKTSQEKLTWLNVNDALSIDGKTVLFAALTGSLENHPDGFNFK >CP033401.1|AYQ02255.1|2505167_2505938_+|amidohydrolase MPGLKITLLQQPLVWMDGPANLRHFDRQLEGITGRDVIVLPEMFTSGFAMEAAASSLAQNDVVNWMTAKAQQCNALIAGSVALQTESGSVNRFLLVEPGGTVHFYDKRHLFRMADEHLHYKAGNARVIVEWRGWRILPLVCYDLRFPVWSRNLNDYDLAIYVANWPAPRSLHWQALLTARAIENQAYVAGCNRVGSDGNGCHYRGDSRVINPQGEIIATADAHQATRIDAELSMVALREYREKFPAWQDADEFRLR >CP033401.1|AYQ02256.1|2507376_2507826_-|hypothetical-protein MMKYLMVLLSLFSGSVLGMGRVNELCGIDSVKTIEIINLPSYVTTLVPLSKEGLNEIYRYKVVVNEISDLYAGKIIDLLQMKYFRKEKYNNIRWGVSIISKGNNKCEIYFDAFGECGSVNGINVCFEKNEMIGWIKKEIPLLSQKIGGL >CP033401.1|AYQ04309.1|2507837_2510897_-|RHS-repeat-protein MTSPLNSEGRYTEGEGGLKRVVKKEHADGSITRSEYDEAGRLKAQTDAAGRRTEYSLHMASGAVTAVTGPDGRTVRYGYNSQRQVTSVTYPDGLRSSREYDEKGRLTAETSRSGETTRYSYDDPASELPTGIQDATGSTKQMAWSRYGQLLAFTDCSGYTTRYEYDRYGQQIAVHREEGISTYSSYNPRGQLVSQKDAQGREIRYEYSAAGDLTATISPDGKRSTIEYDKRGRPVSVTEGGLTRSMGYDAAGRITVLTNENGSQSTFRYDPVDRLTEQRGFDGRTQRYHYDLTGKLTQSEDEGLITLWHYDASDRITHRTVNGDPAEQWQYDEHGWLTTLSHTCEGHRVSVHYGYDDKGRLTGERQTVENPETGEMLWEHETGHAYSEQGLATRQEPDGLPPVEWLTYGSGYLAGMKLGGTPLVEYTRDRLHRETARSFGGAGSTAGYEQATAYTLTGQLQSRHLNLPQLDCDYTWNDNGQLVRISGPQECREYRYSGTGRLTGVHTTAANLDIDIPYATDPAGNRLPDPELHPDSTLTAWPDNRIAEDAHYVYRYDEYGRLAEKTDRIPEGVIRMHDERTHHYHYDSQHRLVFYTRIQHGEPQVESRYLYDPLGRRTGKRVWRRERDLTGWMSLSRKPEETWYGWDGDRLTTVQTQQTRIQTVYQPGSFTPLLRIETENGEQAKARHRSLAEVLQEDTGVTLPAELAVMLGRLERELRQGSVSEESQQWLAQCGLTAEQMAAQLEAEYIPERKLHLYHCDHRGLPLALISPEGETAWQGEYDEWGNLLGEESAQHLQQSLRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLRGEWNLYKYPLNPVRFIDSLGLKFHVNGDPSDFNQAVEYLKQDSQMKETIDFLSSSEETINIEYIEGTNVRFNSNNMAIYWNSRASLFCSTELNSKSQSPALGLGHEFAHAQYYLLDKENFMALLSRTDKKYENKEEARVITIIESRAAKTLGECTRGAHSGLPFYRVDGPLQTMKITGTPE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP033401_6 | 2711093-2711225 | Orphan |
NA
Consensus repeat of CP033401_6
|
2 spacers
spacers of CP033401_6
>6.1|2711110|42|CP033401|PILER-CR TGTCACACGCAGATAAATCCAACTTTCAATATTGTTAAGTTC >6.2|2711169|40|CP033401|PILER-CR CATGGCGTAGCAAAAAGAAATTTTCAATATTGCTTTATGG |
CRISPR arrays and Neighbor proteins around CP033401_6
The CRISPR arrays of CP033401_6 >merge|CP033401|6|2711093-2711225|PILER-CR ATCACCAATATTGAAAATGTCACACGCAGATAAATCCAACTTTCAATATTGTTAAGTTCCTCACCAATATTGAAAACATGGCGTAGCAAAAAGAAATTTTCAATATTGCTTTATGGATCACCAATATTGAAAG >CP033401|6|1|2711093-2711225|PILER-CR ATCACCAATATTGAAAA TGTCACACGCAGATAAATCCAACTTTCAATATTGTTAAGTTC CTCACCAATATTGAAAA CATGGCGTAGCAAAAAGAAATTTTCAATATTGCTTTATGG ATCACCAATATTGAAAG
>CP033401.1|AYQ02422.1|2710231_2711002_-|protein-FixA MKIITCYKCVPDEQDIAVNNADGSLDFSKADAKISQYDLNAIEAACQLKQQAAEAQVTALSVGGKALTNAKGRKDVLSRGPDELIVVIDDQFEQALPQQTASALAAAAQKAGFDLILCGDGSSDLYAQQVGLLVGEILNIPAVNGVSKIISLTADTLTVERELEDETETLSIPLPAVVAVSTDINSPQIPSMKAILGAAKKPVQVWSAADIGFNAEAAWSEQQVAAPKQRERQRIVIEGDGEEQIAAFAENLRKVI >CP033401.1|AYQ02421.1|2709275_2710217_-|protein-FixB MNTFSQVWVFSDTPSRLPELMNGAQALANQINTFVLNDADGAQAIQLGANHVWKLNGKPDDRMIEDYAGVMADTIRQHGADGLVLLPNTRRGKLLAAKLGYRLKAAVSNDASTVSVQDGKATVKHMVYGGLAIGEERIATPYAVLTISSGTFDAAQPDASRTGETHTVEWQAPAVAITRTATQARQSNSVDLDKARLVVSVGRGIGSKENIALAEQLCKAIGAELACSRPVAENEKWMEHERYVGISNLMLKPELYLAVGISGQIQHMVGANASQTIFAINKDKNAPIFQYADYGIVGDAVKILPALTAALAR >CP033401.1|AYQ02420.1|2707938_2709225_-|FAD-dependent-oxidoreductase MSEDIFDAIIVGAGLAGSVAALVLAREGAQVLVIERGNSAGAKNVTGGRLYAHSLEHIIPGFADSAPVERLITHEKLAFMTEKSAMTMDYCNGDETSPSQRSYSVLRSKFDAWLMEQAEEAGAQLITGIRVDNLVQRDGKVVGVEADGDVIEAKTVILADGVNSILAEKLGMAKRVKPTDVAVGVKELIELPKSVIEDRFQLQGNQGAACLFAGSPTDGLMGGGFLYTNENTLSLGLVCGLHHLHDAKKSVPQMLEDFKQHPAVAPLIAGGKLVEYSAHVVPEAGINMLPELVGDGVLIAGDAAGMCMNLGFTIRGMDLAIAAGEAAAKTVLSAMKSDDFSKQKLAEYRQHLESGPLRDMRMYQKLPAFLDNPRMFSGYPELAVGVARDLFTIDGSAPELMRKKILRHGKKVGFINLIKDGMKGVTVL >CP033401.1|AYQ02419.1|2707654_2707942_-|ferredoxin-like-protein-FixX MTSPVNVDVKLGVNKFNVDEEHPHIVVKADADKQVLELLVKACPAGLYKKQDDGSVRFDYAGCLECGTCRILGLGSALEQWEYPRGTFGVEFRYG >CP033401.1|AYQ02418.1|2706265_2707597_-|MFS-transporter MQPSRNFDDLKFSSIHRRILLWGSGGPFLDGYVLVMIGVALEQLTPALKLDADWIGLLGAGTLAGLFVGTSLFGYISDKVGRRKMFLIDIIAIGVISVATMFVSSPVELLVMRVLIGIVIGADYPIATSMITEFSSTRQRAFSISFIAAMWYVGATCADLVGYWLYDVEGGWRWMLGSAAIPCLLILIGRFELPESPRWLLRKGRVKECEEMMIKLFGEPVAFDEEQPQQTRFRDLFNRRHFPFVLFVAAIWTCQVIPMFAIYTFGPQIVGLLGLGVGKNAALGNVVISLFFMLGCIPPMLWLNTAGRRPLLIGSFAMMTLALAVLGLIPDMGIWLVVMAFAVYAFFSGGPGNLQWLYPNELFPTDIRASAVGVIMSLSRIGTIVSTWALPIFINNYGISNTMLMGAGISLFGLLISVAFAPETRGMSLAQTSNMTIRGQRMG >CP033401.1|AYQ02417.1|2705627_2706158_-|glutathione-regulated-potassium-efflux-system-ancillary-protein-KefF MILIIYAHPYPHHSHANKRMLEQARTLEGVEIRSLYQLYPDFNIDIAAEQEALSRADLIVWQHPMQWYSIPPLLKLWIDKVFSHGWAYGHGGTALHGKHLLWAVTTGGGESHFEIGAHPGFDVLSQPLQATAIYCGLNWLPPFAMHCTFICDDETLEGQARHYKQRLLEWQEAHHG >CP033401.1|AYQ02416.1|2703772_2705635_-|glutathione-regulated-potassium-efflux-system-protein-KefC MDSHTLIQALIYLGSAALIVPIAVRLGLGSVLGYLIAGCIIGPWGLRLVTDAESILHFAEIGVVLMLFIIGLELDPQRLWKLRAAVFGCGALQMVICGGLLGLFCMLLGLRWQVAELIGMTLALSSTAIAMQAMNERNLMVTQMGRSAFAVLLFQDIAAIPLVAMIPLLATSSASTTMGAFALSALKVAGALVLVVLLGRYVTRPALRFVARSGLREVFSAVALFLVFGFGLLLEEVGLSMAMGAFLAGVLLASSEYRHALESDIEPFKGLLLGLFFIGVGMSIDFGTLLENPLRIVILLLGFLIIKIAMLWLIARPLQVPNKQRRWFAVLLGQGSEFAFVVFGAAQMANVLEPEWAKSLTLAVALSMAATPILLVILNRLEQSSTEEAREADEIDEEQPRVIIAGFGRFGQITGRLLLSSGVKMVVLDHDPDHIETLRKFGMKVFYGDATRMDLLESAGAAKAEVLINAIDDPQTNLQLTEMVKEHFPHLQIIARARDVDHYIRLRQAGVEKPERETFEGALKTGRLALESLGLGPYEARERADVFRRFNIQMVEEMAMVENDTKARAAVYKRTSAMLSEIITEDREHLSLIQRHGWQGTEEGKHTGNMADEPETKPSS >CP033401.1|AYQ04316.1|2703101_2703581_-|type-3-dihydrofolate-reductase MISLIAALAVDRVIGMENAMPWNLPADLAWFKRNTLNKPVIMGRHTWESIGRPLPGRKNIILSSQPGTDDRVTWVKSVDEAIAACGDVPEIMVIGGGRVYEQFLPKAQKLYLTHIDAEVEGDTHFPDYEPDDWESVFSEFHDADAQNSHSYCFEILERR >CP033401.1|AYQ02415.1|2702181_2703024_+|diadenosine-tetraphosphatase MATYLIGDVHGCYDELIALLHKVEFTPGKDTLWLTGDLVARGPGSLDVLRYVKSLGDSVRLVLGNHDLHLLAVFAGISRNKPKDRLTPLLEAPDADELLNWLRRQPLLQIDEEKKLVMAHAGITPQWDLQTAKECARDVEAVLSSDSYPFFLDAMYGDMPNNWSPELRGLGRLRFITNAFTRMRFCFPNGQLDMYSKESPEEAPAPLKPWFAIPGPVAEEYSIAFGHWASLEGKGTPEGIYALDTGCCWGGTLTCLRWEDKQYFVQPSNRHKDLGEAAAS >CP033401.1|AYQ02414.1|2701797_2702175_+|Co2+/Mg2+-efflux-protein-ApaG MINSPRVCIQVQSVYIEAQSSPDNERYVFAYTVTIRNLGRAPVQLLGRYWLITNGNGRETEVQGEGVVGVQPLIAPGEEYQYTSGAIIETPLGTMQGHYEMIDENGVPFSIDIPVFRLAVPTLIH >CP033401.1|AYQ02423.1|2711475_2712990_+|L-carnitine/gamma-butyrobetaine-antiporter MKNEKRKTGIEPKVFFPPLIIVGILCWLTVRDLDAANVVINAVFSYVTNVWGWAFEWYMVVMLFGWFWLVFGPYAKKRLGNEPPEFSTASWIFMMFASCTSAAVLFWGSIEIYYYISTPPFGLEPNSTGAKELGLAYSLFHWGPLPWATYSFLSVAFAYFFFVRKMEVIRPSSTLVPLVGEKHAKGLFGTIVDNFYLVALIFAMGTSLGLATPLVTECMQWLFGIPHTLQLDAIIITCWIILNAICVACGLQKGVRIASDVRSYLSFLMLGWVFIVSGASFIMNYFTDSVGMLLMYLPRMLFYTDPIAKGGFPQGWTVFYWAWWVIYAIQMSIFLARISRGRTVRELCFGMVLGLTASTWILWTVLGSNTLLLIDKNIINIPNLIEQYGVARAIIETWAALPLSTATMWGFFILCFIATVTLVNACSYTLAMSTCREVRDGEEPPLLVRIGWSILVGIIGIVLLALGGLKPIQTAIIAGGCPLFFVNIMVTLSFIKDAKQNWKD >CP033401.1|AYQ02424.1|2713020_2714163_+|crotonobetainyl-CoA-dehydrogenase MDFNLNDEQELFVAGIRELMASENWEAYFAECDRDSVYPERFVKALADMGIDSLLIPEEHGGLDAGFVTLAAVWMELGRLGAPTYVLYQLPGGFNTFLREGTQEQIDKIMAFRGTGKQMWNSAITEPGAGSDVGSLKTTYTRRNGKIYLNGSKCFITSSAYTPYIVVMARDGASPDKPVYTEWFVDMSKPGIKVTKLEKLGLRMDSCCEITFDDVELDEKDMFGREGNGFNRVKEEFDHERFLVALTNYGTAMCAFEDAAHYANQRVQFGEAIGRFQLIQEKFAHMAIKLNSMKNMLYEAAWKADNGTITSGDAAMCKYFCANAAFEVVDSAMQVLGGVGIAGNHRISRFWRDLRVDRVSGGSDEMQILTLGRAVLKQYR >CP033401.1|AYQ02425.1|2714291_2715509_+|L-carnitine-CoA-transferase MDHLPMPKFGPLAGLRVVFSGIEIAGPFAGQMFAEWGAEVIWIENVAWADTIRVQPNYPQLSRRNLHALSLNIFKDEGREAFLKLMETTDIFIEASKGPAFARRGITDEVLWQHNPKLVIAHLSGFGQYGTEEYTNLPAYNTIAQAFSGYLIQNGDVDQPMPAFPYTADYFSGLTATTAALAALHKARETGKGESIDIAMYEVMLRMGQYFMMDYFNGGEMCPRMSKGKDPYYAGCGLYKCADGYIVMELVGITQIEECFKDIGLAHLLSTPEIPEGTQLIHRIECPYGPLVEEKLDAWLAAHTIAEVKERFAELNIACAKVLTVPELESNPQYVARESITQWQTMDGRTCKGPNIMPKFKNNPGQIWRGMPSHGMDTAAILKNIGYSENDIQELVSKGLAKVED >CP033401.1|AYQ02426.1|2715582_2717136_+|ATP-dependent-acyl-CoA-ligase MDIIGGQHLRQMWDDLADVYGHKTALICESSGGVVNRYSYLELNQEINRTANLFYTLGIRKGDKVALHLDNCPEFIFCWFGLAKIGAIMVPINARLLREESAWILQNSQACLLVTSAQFYPMYQQIQQEDATQLRHICLTDVALPADDGVSSFTQLKNQQPATLCYAPPLLTDDTAEILFTSGTTSRPKGVVITHYNLRFAGYYSAWQCALRDDDVYLTVMPAFHIDCQCTAAMAAFSAGATFVLVEKYSARAFWGQVQKYRATITECIPMMIRTLMVQPPSANDRQHRLREVMFYLNLSEQEKDAFCERFGVRLLTSYGMTETIVGIIGDRPGDKRRWPSIGRAGFCYEAEIRDDHNRPLPAGEIGEICIKGVPGKTIFKEYFLNPKATAKVLEADGWLHTGDTGYCDEEGFFYFVDRRCNMIKRGGENVSCVELENIIATHPKIQDIVVVGIKDSIRDEAIKAFVVLNEGETLSEEEFFRFCEQNMAKFKVPSYLEIRKDLPRNCSGKIIRKNLK >CP033401.1|AYQ02427.1|2717244_2718030_+|crotonobetainyl-CoA-hydratase MSESLHLTRNGSILEITLDRPKANAIDAKTSFEMGEVFLNFRDDPQLRVAIITGAGEKFFSAGWDLKAAAEGEAPDADFGPGGFAGLTEIFNLDKPVIAAVNGYAFGGGFELALAADFIVCADNASFALPEAKLGIVPDSGGVLRLPKILPPAIVNEMVMTGRRMGAEEALRWGIVNRVVNQAELMDNARELAQQLVNSAPLAIAALKEIYRTTSEMPVEEAYRYIRSGVLKHYPSVLHSEDAIEGPLAFAEKRDPVWKGR >CP033401.1|AYQ02428.1|2718035_2718626_+|carnitine-operon-protein-CaiE MSYYAFEGLIPVVHPTAFVHPSAVLIGDVIVGAGVYIGPLASLRGDYGRLIVQAGANIQDGCIMHGYCNTDTIVGENGHIGHGAILHGCVIGRDALVGMNSVIMDGAVIGEESIVAAMSFIKAGFRGEKRQLLMGTPARAVRSVSDDELHWKRLNTKEYQDLVGRCHASLHETQPLRQMEENRPRLQGTTDVTPKR >CP033401.1|AYQ02429.1|2718744_2719140_-|transcriptional-activatory-protein-CaiF MCEGYVEKPLYLLIAEWMMAENRWVIAREISIHFDIEHSKAVNTLTYILSEVTEISCEVKMIPNKLEGRGCQCQRLVKVVDIDEQIYARLRNNSREKLVGVRKTPRIPAVPLTELNREQKWQMMLSKSMRR >CP033401.1|AYQ02430.1|2719400_2722622_-|carbamoyl-phosphate-synthase-large-subunit MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVILVNSNPATIMTDPEMADATYIEPIHWEVVRKIIEKERPDAVLPTMGGQTALNCALELERQGVLEEFGVTMIGATADAIDKAEDRRRFDVAMKKIGLETARSGIAHTMEEALAVAADVGFPCIIRPSFTMGGSGGGIAYNREEFEEICARGLDLSPTKELLIDESLIGWKEYEMEVVRDKNDNCIIVCSIENFDAMGIHTGDSITVAPAQTLTDKEYQIMRNASMAVLREIGVETGGSNVQFAVNPKNGRLIVIEMNPRVSRSSALASKATGFPIAKVAAKLAVGYTLDELMNDITGGRTPASFEPSIDYVVTKIPRFNFEKFAGANDRLTTQMKSVGEVMAIGRTQQESLQKALRGLEVGATGFDPKVSLDDPEALTKIRRELKDAGAERIWYIADAFRAGLSVDGVFNLTNIDRWFLVQIEELVRLEEKVAEVGITGLNAEFLRQLKRKGFADARLAKLAGVREAEIRKLRDQYDLHPVYKRVDTCAAEFATDTAYMYSTYEEECEANPSTDREKIMVLGGGPNRIGQGIEFDYCCVHASLALREDGYETIMVNCNPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKPKGVIVQYGGQTPLKLARALEAAGVPVIGTSPDAIDRAEDRERFQHAVERLKLKQPANATVTAIEMAVEKAKEIGYPLVVRPSYVLGGRAMEIVYDEADLRRYFQTAVSVSNDAPVLLDHFLDDAVEVDVDAICDGEMVLIGGIMEHIEQAGVHSGDSACSLPAYTLSQEIQDVMRQQVQKLAFELQVRGLMNVQFAVKNNEVYLIEVNPRAARTVPFVSKATGVPLAKVAARVMAGKSLAEQGVTKEVIPPYYSVKEVVLPFNKFPGVDPLLGPEMRSTGEVMGVGRTFAEAFAKAQLGSNSTMKKHGRALLSVREGDKERVVDLAAKLLKQGFELDATHGTAIVLGEAGINPRLVNKVHEGRPHIQDRIKNGEYTYIINTTSGRRAIEDSRVIRRSALQYKVHYDTTLNGGFATAMALNADATEKVISVQEMHAQIK >CP033401.1|AYQ02431.1|2722639_2723788_-|carbamoyl-phosphate-synthase-small-subunit MIKSALLVLEDGTQFHGRAIGATGSAVGEVVFNTSMTGYQEILTDPSYSRQIVTLTYPHIGNVGTNDADEESSQVHAQGLVIRDLPLIASNFRNTEDLSSYLKRHNIVAIADIDTRKLTRLLREKGAQNGCIIAGDNPDAALALEKARAFPGLNGMDLAKEVTTAEAYSWTQGSWTLTGGLPEAKKEDELPFHVVAYDFGAKRNILRMLVDRGCRLTIVPAQTSAEDVLKMNPDGIFLSNGPGDPAPCDYAITAIQKFLETDIPVFGICLGHQLLALASGAKTVKMKFGHHGGNHPVKDVEKNVVMITAQNHGFAVDEATLPANLRVTHKSLFDGTLQGIHRTDKPAFSFQGHPEASPGPHDAAPLFDHFIELIEQYRKTAK >CP033401.1|AYQ02432.1|2724243_2725065_-|4-hydroxy-tetrahydrodipicolinate-reductase MHDANIRVAIAGAGGRMGRQLIQAALALEGVQLGAALEREGSSLLGSDAGELAGAGKTGVTVQSSLDAVKDDFDVFIDFTRPEGTLNHLAFCRQHGKGMVIGTTGFDEAGKQAIRDAAADIAIVFAANFSVGVNVMLKLLEKAAKVMGDYTDIEIIEAHHRHKVDAPSGTALAMGEAIAHALDKDLKDCAVYSREGHTGERVPGTIGFATVRAGDIVGEHTAMFADIGERLEITHKASSRMTFANGAVRSALWLSGKEGGLFDMRDVLDLNSL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP033401_7 | 4174124-4174263 | Orphan |
NA
Consensus repeat of CP033401_7
|
1 spacers
spacers of CP033401_7
>7.1|4174173|42|CP033401|CRISPRCasFinder ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT |
CRISPR arrays and Neighbor proteins around CP033401_7
The CRISPR arrays of CP033401_7 >merge|CP033401|7|4174124-4174263|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCAACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGTTTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA >CP033401|7|6|4174124-4174263|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT TTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA
>CP033401.1|AYQ03698.1|4173031_4174072_+|permease MTGQSSSQAATPIQWWKPALFFLVVIAGLWYVKWEPYYGKAFTAAETHSIGKSILAQADANPWQAALDYAMIYFLAVWKAAVLGVILGSLIQVLIPRDWLLRTLGQSRFRGTLLGTLFSLPGMMCTCCAAPVAAGMRRQQVSMGGALAFWMGNPVLNPATLVFMGFVLSWGFAAIRLVAGLVMVLLIATLVQKWVRETPQTQAPVEIDIPEAQGGFFSRWGRALWTLFWSTIPVYILAVLVLGAARVWLFPHADGTVDNSLMWVVAMAVAGCLFVIPTAAEIPIVQTMMLAGMGTAPALALLMTLPAVSLPSLIMLRKAFPAKALWLTGAMVAVSGVIVGGLALLF >CP033401.1|AYQ03697.1|4172323_4172959_+|NAD-dependent-epimerase/dehydratase-family-protein MSQVLITGATGLVGGHLLRMLINEPKVNAIAAPTRRPLGDMPGVFNPHDPQLTDALAQVTDPIDIVFCCLGTTRREAGSKEAFIHADYTLVVDTALTGRRLGAQHMLVVSAMGANAHSPFFYNRVKGEMEEALIAQNWPKLTIARPSMLLGDRSKQRMNETLFAPLFRLLPGNWKSIDARDVARVMLAESMRPEHEGVTILSSSELRKRAE >CP033401.1|AYQ03696.1|4171677_4172196_-|glutamine-amidotransferase MSKKIAVLITDEFEDSEFTSPADEFRKAGHEVITIEKQAGKTVKGKKGEASVTIDKSIDEVTPAEFDALLLPGGHSPDYLRGDNRFVTFTRDFVNSGKPVFAICHGPQLLISADVIRGRKLTAVKPIIIDVKNAGAEFYDQEVVVDKDQLVTSRTPDDLPAFNREALRLLGA >CP033401.1|AYQ03695.1|4171254_4171698_+|hypothetical-protein METLIAISRWLAKQHVVTWCVQQEGELWCANAFYLFDAQKVAFYILTEEKTRHAQMSGPQAAVAGTVNGQPKTVALIRGVQFKGEIRRLEGEESDLARKAYNRRFPVARMLSAPVWEIRLDEIKFTDNTLGFGKKMIWLRDSGTEQA >CP033401.1|AYQ03694.1|4170901_4171204_-|GIY-YIG-nuclease-family-protein MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAFSAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAELLSSLQTPEIKSD >CP033401.1|AYQ03693.1|4170411_4170915_+|N-acetyltransferase MLIRVEIPIDAPGIDALLRRSFESDAEAKLVHDLREDGFLTLGLVATDDEGQVIGYVAFSPVDVQGEDLQWVGMAPLAVDEKYRGQGLARQLVYEGLDSLNEFGYAAVVTLGDPALYSRFGFELAAHHDLRCRWPGTESAFQVHRLADDALNGVTGLVEYHEHFNRF >CP033401.1|AYQ03692.1|4169893_4170418_+|SCP2-domain-containing-protein MLDKLRSRIVHLGPSLLSVPVKLTPFALKRQVLEQVLSWQFRQALDDGELEFLEGRWLSIHVRDIDLQWFTSVVNGKLVVSQNAQADVSFSADASDLLMIAARKQDPDTLFFQRRLVIEGDTELGLYVKNLMDAIELEQMPKALRMMLLQLADFVEAGMKNAPETKQTSVGEPC >CP033401.1|AYQ03691.1|4168689_4169685_-|collagenase MELLCPAGNLPALKAAIENGADAVYIGLKDDTNARHFAGLNFTEKKLQEAVSFVHQHRRKLHIAINTFAHPDGYARWQRAVDMAAQLGADALILADLAMLEYAAERYPHIERHVSVQASATNEEAINFYHRHFDVARVVLPRVLSIHQVKQLARVTPVPLEVFAFGSLCIMSEGRCYLSSYLTGESPNTVGACSPARFVRWQQTPQGLESRLNEVLIDRYQDGENAGYPTLCKGRYLVDGERYHALEEPTSLNTLELLPELMAANIASVKIEGRQRSPAYVSQVAKVWRQAIDRCKADPQNFVPQSAWMETLGSMSEGTQTTLGAYHRKWQ >CP033401.1|AYQ03690.1|4167802_4168681_-|U32-family-peptidase MKYSLGPVLWYWPKETLEEFYQQAATSSADVIYLGEAVCSKRRATKVGDWLEMAKSLAGSGKQIVLSTLALVQASSELGELKRYVENGEFLIEASDLGVVNMCAERKLPFVAGHALNCYNAVTLKILLKQGMMRWCMPVELSRDWLVNLLNQCDELGIRNQFEVEVLSYGHLPLAYSARCFTARSEDRPKDECETCCIKYPNGRNVLSQENQQVFVLNGIQTMSGYVYNLGNELASMQGLVDVVRLSPQGTDTFAMLDAFRANENGAAPLPLTANSDCNGYWRRLAGLELQA >CP033401.1|AYQ03689.1|4166589_4167597_-|LLM-class-flavin-dependent-oxidoreductase MTDKTIAFSLLDLAPIPEGSSAREAFSHSLDLARLAEKRGYHRYWLAEHHNMTGIASAATSVLIGYLAANTTTLHLGSGGVMLPNHSPLVIAEQFGTLNTLYPGRIDLGLGRAPGSDQRTMMALRRHMSGDIDNFPRDVAELVDWFDARDPNPNVRPVPGYGEKIPVWLLGSSLYSAQLAAQLGLPFAFASHFAPDMLFQALHLYRSNFKPSARLEKPYAMVCINIIAADSNRDAEFLFTSMQQAFVKLRRGETGQLPPPIQNMDQFWSPSEQYGVQQALSMSLVGDKAKVRHGLQSILRETDADEIMVNGQIFDHQARLHSFELAMDVKEELLG >CP033401.1|AYQ03699.1|4174276_4174852_-|osmotically-inducible-protein-OsmY MKALSPIAVLISALLLQGCVAAAVVGTAAVGTKAATDPRSVGTQVDDGTLEVRVNSALSKDEQIKKEARINVTAYQGKVLLVGQSPNAELSARAKQIAMGVDGANEVYNEIRQGQPIGLGEASNDTWITTKVRSQLLTSDLVKSSNVKVTTENGEVFLMGLVTEREAKAAADIASRVSGVKRVTTAFTFIK >CP033401.1|AYQ03700.1|4174861_4175452_-|DnaA-initiator-associating-protein-DiaA MQERIKACFTESIQTQIAAAEALPDAISRAAMTLVQSLLNGNKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIANDRLHDEVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVALTGYDGGELAGLLGPQDVEIRIPSHRSARIQEMHMLTVNCLCDLIDNTLFPHQDD >CP033401.1|AYQ03701.1|4175471_4175867_-|YraN-family-protein MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTVFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS >CP033401.1|AYQ03702.1|4175824_4177861_-|penicillin-binding-protein-activator MVPSTFSRLKAARCLPVVLAALIFAGCGTHTPDQSTAYMQGTAQADSAFYLQQMQQSSDDTRINWQLLAIRALVKEGKTGQAVELFNQLPQELNDSQRREKTLLAVEIKLAQKDFAGAQNLLAKITPADLEQNQQARYWQAKIDASQGRPSIDLLRALIAQEPLLGAKEKQQNIDATWQALSSMTQEQANTLVINADENILQGWLDLQRVWFDNRNDPDMMKAGIADWQKRYPNNPGAKMLPTQLVNVKAFKPASTNKIALLLPLNGQAAVFGRTIQQGFEAAKNIGTQPVAAQVAAAPAADVAEQPQPQTVDGVASPAQASVSDLTGEQPAAQPVPVSAPATSTAAVSAPANPSAELKIYDTSSQPLSQILSQVQQDGASIVVGPLLKNNVEELLKSNTPLNVLALNQPENIENRVNICYFALSPEDEARDAARHIRDQGKQAPLVLIPRSSLGDRVANAFAQEWQKLGGGTVLQQKFGSTSELRAGVNGGSGIALTGSPITPRATTDSGMTTNNPTLQTTPTDDQFTNNGGRVDAVYIVATPGEIAFIKPMIAMRNGSQSGATLYASSRSAQGTAGPDFRLEMEGLQYSEIPMLAGGNLPLMQQALSAVNNDYSLARMYAMGVDAWSLANHFSQMRQVQGFEINGNTGSLTANPDCVINRKLSWLQYQQGQVVPAS >CP033401.1|AYQ03703.1|4177925_4178786_+|rRNA-(cytidine-2'-O-)-methyltransferase MKQHQSADNSQGQLYIVPTPIGNLADITQRALEVLQAVDLIAAEDTRHTGLLLQHFGINARLFALHDHNEQQKAETLLAKLQEGQNIALVSDAGTPLINDPGYHLVRTCREAGIRVVPLPGPCAAITALSAAGLPSDRFCYEGFLPAKSKGRRDALKAIEAEPRTLIFYESTHRLLDSLEDIVAVLGESRYVVLARELTKTWETIHGAPVGELLAWVKEDENRRKGEMVLIVEGHKAQEEDLPADALRTLALLQAELPLKKAAALAAEIHGVKKNALYKYALEQQG >CP033401.1|AYQ03704.1|4178828_4179920_-|fimbrial-protein MKRAPLITGLLLISTSCAYASSGGCGADSTSGATNYSSVVDDVTVNQTDNVTGREFTSATLSSTNWQYACSCSAGKAVKLVYMVSPVLTTTGHQAGYYKLNDSLDIKTTLKANDIPGLVTDQTVSVNTRFTQIKSNTVYSAATQTGVCQGDTSRYGPVNIGANTTFTLYVTKPFLGSMTIPKTDIAVIKGAWVDGMGSPSTGDFHDLVKLSIQGNLTAPQSCKINQGDVIKVNFGFINGQKFTTRNAMPDGFTPVDFDITYDCGDTSKIKNSLQMRIDGTTGVVDQYNLVARRRSSDNAPDVGIRIENLGGGVANIPFQNGILPVDPSGHGTVNMRAWPVNLVGGELETGKFQGTATITVIVR >CP033401.1|AYQ03705.1|4179930_4182303_-|fimbrial-biogenesis-outer-membrane-usher-protein MLETTKSGMQTTDLSRFSKKYAQLPGTYQVDIWLNKKKVSQKKITFTANAEQLLQPQFTVEQLRELGIKVDEIPALAEKDDDSVINSLEQIIPGTAAEFDFNHQRLNLSIPQIALYRDARGYVSPSRWDDGIPTLFTNYSFTGSDNRYRQGNRSQRQYLNMQNGANFGPWRLRNYSTWTRNDQTSSWNTISSYLQRDIKALKSQLLLGESATSGSIFSSYTFTGVQLASDDNMLPNSQRGFAPTVRGIANSSAIVTIRQNGYVIYQSNVPAGAFEINDLYPSSNSGDLEVTIEESDGTQRRFIQPYSSLPMMQRPGHLKYSATAGRYRADANSDSKEPEFAEATAIYGLNNTFTLYSGLLGSEDYYALGIGIGGTLGALGALSMDINRADTQFDNQHSFHGYQWRTQYIKDIPETNTNIAVSYYRYTNDGYFSFDEANTRNWDYNSRQKSEIQFNISQTIFDGVSLYASGSQQDYWGNNEKNRNISVGVSGQQWGIGYSLNYQYSRYTDQNNDRALSLNLSIPLERWLPRSRVSYQMTSQKDRPTQHEMRLDGSLLDDGRLSYSLEQSLDDDNNHNSSVNASYRSPYGTFSAGYSYGNDSSQYNYGVTGGVVIHPHGVTLSQYLGNAFALIDANGASGVRIQNYPGIATDPFGYAVVPYLTTYQENRLSVDTTQLPDNVDLEQTTQFVVPNRGAMVAARFNANIGYRVLVTVSDRNGKPLPFGALASNDETGQQSIVDEGGILYLSGISSKSQSWTVRWGNQADQQCQFAFSTPDSEPTTSVLQGTAQCH >CP033401.1|AYQ03706.1|4183452_4184208_-|galactosamine-6-phosphate-isomerase MERGTASGGASLLKEFHPVQTLQQVENYTALSERASEYLLAVIRSKPDAVICLATGATPLLTYHYLVEKIHQQQVDVSQLTFVKLDEWVDLPLTMPGTCETFLQQHIVQPLGLREDQLISFRSEEINETECERVTNLIARKGGLDLCVLGLGKNGHLGLNEPGESLQPACHISQLDARTQQHEMLKTAGRPVTRGITLGLKDILNAREVLLLVTGEGKQDATERFLTAKVSTAIPASFLWLHSNFICLINT >CP033401.1|AYQ03707.1|4184208_4185000_-|PTS-N-acetylgalactosamine-transporter-subunit-IID MGSEISKKDITRLGFRSSLLQASFNYERMQAGGFTWAMLPILKKIYKDDKPGLSAAMKDNLEFINTHPNLVGFLMGLLISMEEKGENRDTIKGLKVALFGPIAGIGDAIFWFTLLPIMAGICSSFASQGNLLGPILFFAVYLLIFFLRVGWTHVGYSVGVKAIDKVRENSQMIARSATILGITVIGGLIASYVHINVVTSFAIDSTHSVALQQDFFDKVFPNILPMAYTLLMYYFLRVKKAHPVLLIGVTFVLSIVCSAFGIL >CP033401.1|AYQ03708.1|4184989_4185793_-|N-acetylgalactosamine-permease-IIC-component-1 MHEITLLQGLSLAALVFVLGIDFWLEALFLFRPIIVCTLTGAILGDIQTGLITGGLTELAFAGLTPAGGVQPPNPIMAGLMTTVIAWSTGVDAKTAIGLGLPFSLLMQYVILFFYSAFSLFMTKADKCAKEADTAAFSRLNWTTMLIVASAYAVIAFLCTYLAQGAMQALVKAMPAWLTHGFEVAGGILPAVGFGLLLRVMFKAQYIPYLIAGFLFVCYIQVSNLLPVAVLGAGFAVYEFFNAKSRQQAQPQPVASKNEEEDYSNGI |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP033401_8 | 4218662-4218779 | Orphan |
NA
Consensus repeat of CP033401_8
|
1 spacers
spacers of CP033401_8
>8.1|4218702|38|CP033401|CRISPRCasFinder GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA |
CRISPR arrays and Neighbor proteins around CP033401_8
The CRISPR arrays of CP033401_8 >merge|CP033401|8|4218662-4218779|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGGGTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGATGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG >CP033401|8|7|4218662-4218779|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGG GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA TGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG
>CP033401.1|AYQ03737.1|4217330_4218641_+|serine-dehydratase-subunit-alpha-family-protein MFDSTLNPLWQRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPGTGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFSRAKVWNGEKWACVTIVGGHTNIVHIETHNGVVFTQQACVAEGEQESPLTVLSRTTLAEILKFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVIRTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAIYIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSCAMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEIMASKAR >CP033401.1|AYQ03736.1|4215126_4216143_+|IS5-like-element-IS5-family-transposase MFVIWSHGTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMVEVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMRLFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQGTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSGLTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVDWLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRAKVEHPFRIIKRQFGFVKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH >CP033401.1|AYQ03735.1|4214772_4215117_+|transporter MEIASNKGVIADASTPAGRAGMSESEWREAIKFDSTDTGWVIMSIGMAIGAGIVFLPVQVGLMGLWVFLLSSVIGYPAMYLFQRLFINTLAESPECKDYPSVISGYLGKVRTSP >CP033401.1|AYQ03734.1|4213133_4214498_+|L-serine-ammonia-lyase MISAFDIFKIGIGPSSSHTVGPMNAGKSFIDRLESSGLLTATSHIVVDLYGSLSLTGKGHATDVAIIMGLAGNSPQDVVIDEIPAFIELVTRSGRLPVASGAHIVDFPVAKNIIFHPEMLPRHENGMRITAWKGQEALLSKTYYSVGGGFIVEEEHFGLSHDVETSVPYDFHSAGELLKMCDYNGLSISGLMMHNELALRSKAEIDAGFARIWQVMHDGIERGMNTEGVLPGPLNVPRRAVALRRQLVSSDNISNDPMNVIDWINMYALAVSEENAAGGRVVTAPTNGACGIIPAVLAYYDKFRRPVNERSIARYFLAAGAIGALYKMNASISGAEVGCQGEIGVACSMAAAGLTELLGGSPAQVCNAAEIAMEHNLGLTCDPVAGQVQIPCIERNAINAVKAVNAARMAMRRTSAPRVSLDKVIETMYETGKDMNDKYRETSRGGLAIKVVCG >CP033401.1|AYQ03733.1|4212672_4213062_+|enamine/imine-deaminase MKKIIETQRAPGAIGPYVQGVDLGSMVFTSGQIPVCPQTGEIPADVQDQARLSLENVKAIVVAAGLSVGDIIKMTVFITDLNDFATINEVYKQFFDEHQATYPTRSYVQVARLPKDVKLEIEAIAVRSA >CP033401.1|AYQ03732.1|4210364_4212659_+|PFL-like-enzyme-TdcE MKVDIDTSDKLYADAWLGFKGTDWKNEINVRDFIQHNYTPYEGDESFLAEATPATTELWEKVMEGIRIENATHAPVDFDTNIATTITAHDAGYINQPLEKIVGLQTDAPLKRALHPFGGINMIKSSFHAYGREMDSEFEYLFTDLRKTHNQGVFDVYSPDMLRCRKSGVLTGLPDGYGRGRIIGDYRRVALYGISYLVRERELQFADLQSRLEKGEDLEATIRLREELAEHRHALLQIQEMAAKYGFDISRPAQNAQEAVQWLYFAYLAAVKSQNGGAMSLGRTASFLDIYIERDFKAGVLNEQQAQELIDHFIMKIRMVRFLRTPEFDSLFSGDPIWATEVIGGMGLDGRTLVTKNSFRYLHTLHTMGPAPEPNLTILWSEELPIAFKKYAAQVSIVTSSLQYENDDLMRTDFNSDDYAIACCVSPMVIGKQMQFFGARANLAKTLLYAINGGVDEKLKIQVGPKTAPLMDDVLDYDKVMDSLDHFMDWLAVQYISALNIIHYMHDKYSYEASLMALHDRDVYRTMACGIAGLSVATDSLSAIKYARVKPIRDENGLAVDFEIDGEYPQYGNNDERVDSIACDLVERFMKKIKALPTYRNAVPTQSILTITSNVVYGQKTGNTPDGRRAGTPFAPGANPMHGRDRKGAVASLTSVAKLPFTYAKDGISYTFSIVPAALGKEDPVRKTNLVGLLDGYFHHEADVEGGQHLNVNVMNREMLLDAIEHPEKYPNLTIRVSGYAVRFNALTREQQQDVISRTFTQAL >CP033401.1|AYQ03731.1|4209122_4210331_+|propionate-kinase MNEFPVVLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVNGGEPAPLAHHSYEGALKAIAFELEKRNLNDSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLHNYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTSHRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSGDVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLRVLEKAWHEGHERAQLAIKTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVMEHLAVLGVEIDTEMNNRSNSCGERIVSSENARVICAVIPTNEEKMIALDAIHLGKVNAPAEFA >CP033401.1|AYQ03730.1|4207765_4209097_+|threonine/serine-transporter-TdcC MSTSDSIVSSQTKQSSWRKSDTTWTLGLFGTAIGAGVLFFPIRAGFGGLIPILLMLVLAYPIAFYCHRALARLCLSGSNPSGNITETVEEHFGKTGGVVITFLYFFAICPLLWIYGVTITNTFMTFWENQLGFAPLNRGFVALFLLLLMAFVIWFGKDLMVKVMSYLVWPFIASLVLISLSLIPYWNSAVIDQVDLGSLSLTGHDGILITVWLGISIMVFSFNFSPIVSSFVVSKREEYEKDFGRDFTERKCSQIISRASMLMVAVVMFFAFSCLFTLSPANMAEAKAQNIPVLSYLANHFASMTGTKTTFAITLEYAASIIALVAIFKSFFGHYLGTLEGLNGLILKFGYKGDKTKVSLGKLNTISMIFIMGSTWVVAYANPNILDLIEAMGAPIIASLLCLLPMYAIRKAPSLAKYRGRLDNVFVTVIGLLTILNIVYKLF >CP033401.1|AYQ03729.1|4206754_4207744_+|serine/threonine-dehydratase MHITYDLPVAIDDIIEAKQRLAGRIYKTGMPRSNYFSERCKGEIFLKFENMQRTGSFKIRGAFNKLSSLTDAEKRKGVVACSAGNHAQGVSLSCAMLGIDGKVVMPKGAPKSKVAATCDYSAEVVLHGDNFNDTIAKVSEIVEMEGRIFIPPYDDPKVIAGQGTIGLEIMEDLYDVDNVIVPIGGGGLIAGIAVAIKSINPTIRVIGVQSENVHGMAASFHSGEITTHRTTGTLADGCDVSRPGNLTYEIVRELVDDIVLVSEDEIRNSMIALIQRNKVVTEGAGALACAALLSGKLDQYIQNRKTVSIISGGNIDLSRVSQITGFVDA >CP033401.1|AYQ03728.1|4205717_4206656_+|transcriptional-regulator MSTILLPKTQHLVVFQEVIRSGSIGSAAKELGLTQPAVSKIINDIEDYFGVELVVRKNTGVTLTPAGQLLLSRSESITREMKNMVNEISGMSSEAVVEVSFGFPSLIGFTFMSGMINKFKEVFPKAQVSMYEAQLSSFLPAIRDGRLDFAIGTLSAEMKLQDLHVEPLFESEFVLVASKSRTCTGTTTLESLKNEQWVLPQTNMGYYSELLTTLQRNGISIENIVKTDSVVTIYNLVLNADFLTVIPCDMTSPFGSNQFITIPVEETLPVAQYAAVWSKNYRIKKAASVLVELAKEYSSYNGCRRRQLIEVG >CP033401.1|AYQ03738.1|4218852_4219017_-|hypothetical-protein MSKKSAKKRQPVKPVVAKEPARTAKNFGYEEMLSELEAIVADAETRLAEDEATA >CP033401.1|AYQ03739.1|4219039_4219741_-|pirin-like-protein-YhaK MITTRTARQCGQADYGWLQARYTFSFGHYFDPKLLGYASLRVLNQEVLAPGAAFQPRTYPKVDILNVILDGEAEYRDSEGNHVQASAGEALLLSTQPGVSYSEHNLSKDKPLTRMQLWLDACPQRENPLIQKLALNMGKQQLIASPEGTMGSLQLRQQVWLHHIVLDKGESANFQLHGPRAYLQSIHGKFHALTHHEEKAALTCGDGAFIRDEANITLVADSPLRALLIDLPV >CP033401.1|AYQ03740.1|4219845_4220742_+|LysR-family-transcriptional-regulator MAKERALTLEALRVMDAIDRRGSFAAAADELGRVPSALSYTMQKLEEELDVVLFDRSGHRTKFTNVGRMLLERGRVLLEAADKLTTDAEALARGWETHLTIVTEALVPTPAFFPLIDKLAAKANTQLAIITEVLAGAWERLEQGRADIVIAPDMHFRSSSEINSRKLYTLMNVYVAAPDHPIHQEPEPLSEVTRVKYRGIAVADTARERPVLTVQLLDKQPRLTVSTIEDKRQALLAGLGVATMPYPMVEKDIAEGRLRVVSPESTSEIDIIMAWRRDSMGEAKSWCLREIPKLFSGK >CP033401.1|AYQ03741.1|4220792_4221149_-|DUF805-domain-containing-protein MQWYLAVLKNYVGFSGRARRKEYWMFTLINAIVGAIINVIQLILGLEFPFLSLIYLAATIIPVIALCVRRLHDTDRSGAWALLYLVPIIGWLVLFVFACLEGNSGSNRYGNDPKFGSN >CP033401.1|AYQ03742.1|4221390_4221756_-|DUF805-domain-containing-protein MDWYLKVLKNYVGFRGRARRKEYWMFILVNIIFTFVLGLLDKMLGWQRAGGEGILTTIYGILVFLPWWAVQFRRLHDTDRSAWWALLFLIPFIGWLIIIVFNCQAGTPGENRFGPDPKLEP >CP033401.1|AYQ03743.1|4222048_4223035_-|glutathione-S-transferase-family-protein MGQLIDGVWHDTWYDTKSTGGKFQRSASAFRNWLTADGAPGPTGTGGFIAEKDRYHLYVSLACPWAHRTLIMRKLKGLEPFISVSVVNPLMLENGWTFDDSFPGATGDTLYQHEFLYQLYLHADPHYSGRVTVPVLWDKKNHTIVSNESAEIIRMFNTAFDALGAKAGDYYPPALQTKIDELNGWIYDTVNNGVYKAGFATSQQAYDEAVAKVFESLARLEQILGQHRYLTGNQLTEADIRLWTTLVRFDPVYVTHFKCDKHRISNYLNLYGFLRDIYQMPGIAETVNFDHIRNHYFRSHKTINPTGIISIGPWQDLDEPHGRDVRFG >CP033401.1|AYQ03744.1|4223104_4223587_-|DoxX-family-protein MILSIDSNDANTAPLHKKTISSLSGAVESMMKKLEDVGVLVARILMPILFITAGWGKITGYAGTQQYMEAMGVPGFMLPLVILLEFGGGLAILFGFLTRTTALFTAGFTLLTAFLFHSNFAEGVNSLMFMKNLTISGGFLLLAITGPGAYSIDRLLNKKW >CP033401.1|AYQ03745.1|4223682_4223982_-|hypothetical-protein MSSKVERERRKAQLLSQIQQQRLDLSASRREWLEATGAYDRRWNMLLSLRSWALVGSSVMAIWTIRHPNMLVRWARRGFGVWSAWRLVKTTLKQQQLRG >CP033401.1|AYQ03746.1|4223971_4224376_-|hypothetical-protein MADTHHAQGPGKSVLGIGQRIVSIMVEMVETRLRLAVVELEEEKANLFQLLLMLGLTMLFAAFGLMSLMVLIIWAVDPQYRLNAMIATTVVLLLLALIGGIWTLRKSRKSTLLRHTRHELANDRQLLEEESREQ >CP033401.1|AYQ03747.1|4224378_4224684_-|DUF883-domain-containing-protein MSKEHTTEHLRAELKSLSDTLEEVLSSSGEKSKEELSKIRSKAEQALKQSRYRLGETGDAIAKQTRVAAARADEYVRENPWTGVGIGAAIGVVLGVLLSRR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP033401_9 | 4592112-4592567 | Orphan |
I-E
Consensus repeat of CP033401_9
|
7 spacers
spacers of CP033401_9
>9.1|4592141|32|CP033401|PILER-CR,CRISPRCasFinder,CRT TCCACGCTGTAACGGCCATCATTAAGTTTAGT >9.2|4592202|32|CP033401|PILER-CR,CRISPRCasFinder,CRT GAAGTAGGCCTGACAGTGATTGAACGCATACT >9.3|4592263|32|CP033401|PILER-CR,CRISPRCasFinder,CRT AGTTGGGGCGGCGCAATAACGAGACGATACGC >9.4|4592324|32|CP033401|PILER-CR,CRISPRCasFinder,CRT GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT >9.5|4592385|32|CP033401|PILER-CR,CRISPRCasFinder,CRT TCAACGCGCTCAGACGTTGCGTGAGTGAACCA >9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT AAATATCCAGGGCTGGGCTGGAGGCAGACGGC >9.7|4592507|32|CP033401|PILER-CR,CRISPRCasFinder,CRT CCCGGAATGCATTCTGAAGGTTTGCTGTATAT |
CRISPR arrays and Neighbor proteins around CP033401_9
The CRISPR arrays of CP033401_9 >merge|CP033401|9|4592112-4592567|PILER-CR,CRISPRCasFinder,CRT GAGTTCCCCGCGCCAGCGGGGATAAACCGTCCACGCTGTAACGGCCATCATTAAGTTTAGTGAGTTCCCCGCGCCAGCGGGGATAAACCGGAAGTAGGCCTGACAGTGATTGAACGCATACTGAGTTCCCCGCGCCAGCGGGGATAAACCGAGTTGGGGCGGCGCAATAACGAGACGATACGCGAGTTCCCCGCGCCAGCGGGGATAAACCGGGGAGTGGCACTTCTGGGGTAGCGGCGGCCCTGAGTTCCCCGCGCCAGCGGGGATAAACCGTCAACGCGCTCAGACGTTGCGTGAGTGAACCAGAGTTCCCCGCGCCAGCGGGGATAAACCGAAATATCCAGGGCTGGGCTGGAGGCAGACGGCGAGTTCCCCGCGCCAGCGGGGATAAACCGCCCGGAATGCATTCTGAAGGTTTGCTGTATATGAGTTCCCCGCGCCAGCGGGGATAAACCA >CP033401|9|2|4592112-4592567|PILER-CR GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA >CP033401|9|8|4592112-4592567|CRISPRCasFinder GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA >CP033401|9|1|4592112-4592567|CRT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA
>CP033401.1|AYQ04067.1|4591100_4591772_+|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWEKLEDREVSLFSILAKTKESDKWGAASSEDLLAVISRQGYTARHVVITGGEPCIHDLLPLTDLLEKNGFSCQIETSGTHEVRCTPNTWVTVSPKLNMRGGYEVLSQALERANEIKHPVGRVRDIEALDELLATLTDDKPRVIALQPISQKDDATRLCIETCIARNWRLSMQTHKYLNIA >CP033401.1|AYQ04066.1|4590821_4590962_-|hypothetical-protein MSEENKENGFNHVKTFTKIIFIFSVLVFNDNESKITDAAVNLFIQI >CP033401.1|AYQ04065.1|4589935_4590808_-|YgcG-family-protein MRYFILMFTFVCSFVAAQPTIVPQLQQQVTDLTSSLNSQEKKELTHKLESIFNNTQVQIAVLIVPTTKDETIEQYATRVFDNWRLGDAKRNDGILIIVAWSDRTVRIKVGYGLEEKVTDALAGDIIRSNMIPAFKQQKLAQGLELAINALNNQLTSQHQYPTNPSESESASSSDHYYFAIFWVFAVMFFPFWFFHQCSNFCRACKSGVCISAIYLLDLFLFSDKIFSIAVFSFFFTFTIFMVFTCLCVLQKRASGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASGRW >CP033401.1|AYQ04064.1|4588577_4589876_+|enolase MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >CP033401.1|AYQ04063.1|4586852_4588490_+|CTP-synthetase MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK >CP033401.1|AYQ04062.1|4585833_4586625_+|nucleoside-triphosphate-pyrophosphohydrolase MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFADSSAENSSEVLARWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCANVGFDWTTLGPVVDKVYEEIDEVMYEARQAVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLEMTGVDLETMEEVWQQVKRQEIDL >CP033401.1|AYQ04061.1|4585427_4585763_+|mRNA-interferase-MazF MVSRYVPDMGDLIWVDFDPTKGSEQAGHRPAVVLSPFMYNNKTGMCLCVPCTTQSKGYPFEVVLSGQERDGVALADQVKSIAWRARGATKKGTVAPEELQLIKAKINVLIG >CP033401.1|AYQ04060.1|4585179_4585428_+|MazF-MazE-toxin-antitoxin-system-antitoxin-MazE MIHSSVKRWGNSPAVRIPATLMQALNLNIDDEVKIDLVDGKLIIEPVRKEPVFTLAELVNDITPENLHENIDWGEPKDKEVW >CP033401.1|AYQ04059.1|4582867_4585102_+|GTP-pyrophosphokinase MVAVRSAHINKAGEFDPEKWIASLGITSQKSCECLAETWAYCLQQTQGHPDASLLLWRGVEMVEILSTLSMDIDTLRAALLFPLADANVVSEDVLRESVGKSVVNLIHGVRDMAAIRQLKATHTDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHLREVKDAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYKRIAKLLHERRLDREHYIEEFVGHLRAEMKAEGVKAEVYGRPKHIYSIWRKMQKKNLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQSIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAAAGGARSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQVFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQIEIITQKQPNPSRDWLNPNLGYVTTSRGRSKIHAWFRKQDRDKNILAGRQILDDELEHLGISLKEAEKHLLPRYNFNDVDELLAAIGGGDIRLNQMVNFLQSQFNKPSAEEQDAAALKQLQQKSYTPQNRSKDNGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQLAELRSHAPERIVDAVWGESYSAGYSLVVRVVANDRSGLLRDITTILANEKVNVLGVASRSDTKQQLATIDMTIEIYNLQVLGRVLGKLNQVPDVIDARRLHGS >CP033401.1|AYQ04058.1|4581518_4582820_+|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MAQFYSAKRRTTTRQIITVSVNDLDSFGQGVARHNGKTLFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSPERETPRCPHFGVCGGCQQQHASVDLQQRSKSAALARLMKHDVSEVIADVPWGYRRRARLSLNYLPKTQQLQMGFRKAGSSDIVDVKQCPILAPQLEALLPKVRACLGSLQAMRHLGHVELVQATSGTLMILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILETVSGEMPWYDSNGLRLTFSPRDFIQVNAGVNQKMVARALEWLDVQPEDRVLDLFCGMGNFTLPLATQAASVVGVEGVPALVEKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVMQQIIKLEPIRIVYVSCNPATLARDSEALLKAGYTIARLAMLDMFPHTGHLESMVLFSRVK >CP033401.1|AYQ04068.1|4593204_4594683_-|sugar-kinase MSKKYIIGIDGGSQSTKVVMYDLEGNVVCEGKGLLQPMHTPDADTAEHPDDDLWASLCFAGHDLMSQFAGNKEDIVGIGLGSIRCCRALLKADGTPAAPLISWQDARVTRPYEHTNPDVAYVTSFSGYLTHRLTGEFKDNIANYFGQWPVDYKSWAWSEDAAVMDKFNIPRHMLFDVQMPGTVLGHITPQAALATHFPAGLPVVCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAYWPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDAKAQDLSPEDLLNKKASCVPPGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNYFAKHVIITGGGSNSDLFMQIFADVFNLPARRNAINGCASLGAAINTAVGLGLYPDYATAVDKMVRVKDIFMPVESNAKRYDAMNKGIFKDLTKHTDVILKKSYEVMHGELGNADSIQSWSNA >CP033401.1|AYQ04069.1|4594709_4595987_-|MFS-transporter MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQVAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAPDDSTSLKTVIIIYSVVYILLGILCWFFVSDNNNLRSANNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAILSYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSIIGLLALTALLVTNSNPQSVAMGIGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGHWQDTLPAAEAYRNMWLMGMAALGMVIVFIFLLFQKIRTADSAPAMASSK >CP033401.1|AYQ04070.1|4596305_4597091_+|SDR-family-oxidoreductase MSIESLNAFSMDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEKQGVEVDFMQVDITAEGAPQKIIAASCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAAFELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNIQVNGIAPGYYATDITLATRSNPETNQRVLDHIPANRWGDTQDLMGAAVFLASPASNYVNGHLLVVDGGYLVR >CP033401.1|AYQ04071.1|4597160_4598615_+|FAD-binding-oxidoreductase MSLSRAAIVDQLKEIVGADRVITDETVLKKNSIDRFRKFPDIHGIYTLPIPAAVVKLGSTEQVSRVLNFMNAHKINGVPRTGASATEGGLETVVENSVVLDGSAMNQIINIDIENMQATAQCGVPLEVLENALREKGYTTGHSPQSKPLAQMGGLVATRSIGQFSTLYGAIEDMVVGLEAVLADGTVTRIKNVPRRAAGPDIRHIIIGNEGALCYITEVTVKIFKFTPENNLFYGYILEDMKTGFNILREVMVEGYRPSIARLYDAEDGTQHFTHFADGKCVLIFMAEGNPRIAKATGEGIAEIVARYPQCQRVDSKLIETWFNNLNWGPDKVAAERVQILKTGNMGFTTEVSGCWSCIHEIYESVINRIRTEFPHADDITMLGGHSSHSYQNGTNMYFVYDYNVVDCKPEEEIDKYHNPLNKIICEETIRLGGSMVHHHGIGKHRVHWSKLEHGSAWALLEGLKKQFDPNGIMNTGTIYPIEK >CP033401.1|AYQ04072.1|4598708_4600046_+|MFS-transporter MNTSPVRMDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFLGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGILLGAFSVVWTVGYVLASIAGHHFISENPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLFSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQVTLLLLATVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV >CP033401.1|AYQ04073.1|4600023_4600803_+|electron-transfer-flavoprotein-subunit-beta/FixA-family-protein MNILLAFKAEPDAGMLAEKEWQAAAQGKSGPDISLLRSLLGADEQAAAALLLAQRKNGTPMSLTALSMGDERALHWLRYLMALGFEEAVLLETAADLRFAPEFVARHIAEWQHQNPLDLIITGCQSSEGQNGQTPFLLAEMLGWPCFTQVERFTLDALFITLEQRTEHGLRCCRVRLPAVIAVRQCGEVALPVPGMRQRMAAGKAEIIRKTVAAEMPAMQCLQLARAEQRRGATLIDGQTVAEKAQKLWRDYLRQRMQP >CP033401.1|AYQ04074.1|4600799_4601660_+|electron-transfer-flavoprotein-subunit-alpha/FixB-family-protein MNIAIVTINQENAAIASWLAAQDFSGCTLAHWQIEPQPVVAEQVLDALVEQWQRTPADVVLFPPGTFGDELSTRLAWRLHGASICQVTSLDIPTVSVRKSHWGNALTATLQTEKRPLCLSLARQAGAAKNATLPSGMQQLIIVPGALPDWLVSTEDLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVGYSRARVMNGGVDAEKVIGISGHLLAPEVCIVVGASGAAALMAGVRNSKFVVAINHDASAAVFSQADVGVVDDWKVVLEALVTNIHADCQ >CP033401.1|AYQ04075.1|4601807_4602383_-|glycerol-3-phosphate-responsive-antiterminator MPLLHLLRQNPVIAAVKDNASLQLAIDSECQFISVLYGNICTISNIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLVTEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVCDEEDARNAINAGVVALSTTNTGVWTLAKKLL >CP033401.1|AYQ04076.1|4602399_4602660_-|ferredoxin-family-protein MSVARNLWRVADAPHIVPADSVERQTAERLISACPAGLFSLTPEGDLRIDYRSCLECGTCRLLCDESTLQQWRYPPSGFGITYRFG >CP033401.1|AYQ04077.1|4602650_4603922_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERCITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP033401_10 | 4614952-4615530 | Unclear |
I-E
Consensus repeat of CP033401_10
|
9 spacers
spacers of CP033401_10
>10.1|4614982|31|CP033401|CRISPRCasFinder TTGCCCGCGCAATTCCGGGAGCATCCGCAAT >10.2|4615043|31|CP033401|CRISPRCasFinder ACGGACAAAATATATATTGATTTGCGAATTA >10.3|4615104|31|CP033401|CRISPRCasFinder GTAAAGAAACTGCCGACAAATCCCTGTTCGT >10.4|4615165|31|CP033401|CRISPRCasFinder CCCGTCACCGACGCGCAGTGGCGCTACCGTG >10.5|4615226|31|CP033401|CRISPRCasFinder GGATCTAACGCGCTGTAAAAATTCCGTGCTT >10.6|4615287|31|CP033401|CRISPRCasFinder TGCGGATTACCGGCAAAACATGGGAGCAAAC >10.7|4615348|31|CP033401|CRISPRCasFinder CCGAACGGCTGGCGAAGCAGGTGGCTGGCGT >10.8|4615409|31|CP033401|CRISPRCasFinder GTTTACCGCCCCGCAGAGGCGCTGGCAGATC >10.9|4615470|31|CP033401|CRISPRCasFinder GGATGACCTGTCGCTAAAACTCGCCGCGTAC >10.10|4614982|32|CP033401|PILER-CR,CRT TTGCCCGCGCAATTCCGGGAGCATCCGCAATT >10.11|4615043|32|CP033401|PILER-CR,CRT ACGGACAAAATATATATTGATTTGCGAATTAT >10.12|4615104|32|CP033401|PILER-CR,CRT GTAAAGAAACTGCCGACAAATCCCTGTTCGTT >10.13|4615165|32|CP033401|PILER-CR,CRT CCCGTCACCGACGCGCAGTGGCGCTACCGTGA >10.14|4615226|32|CP033401|PILER-CR,CRT GGATCTAACGCGCTGTAAAAATTCCGTGCTTT >10.15|4615287|32|CP033401|PILER-CR,CRT TGCGGATTACCGGCAAAACATGGGAGCAAACC >10.16|4615348|32|CP033401|PILER-CR,CRT CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA >10.17|4615409|32|CP033401|PILER-CR,CRT GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC >10.18|4615470|32|CP033401|PILER-CR,CRT GGATGACCTGTCGCTAAAACTCGCCGCGTACA |
cas2,cas1,cas6e,cas5 |
CRISPR arrays and Neighbor proteins around CP033401_10
The CRISPR arrays of CP033401_10 >merge|CP033401|10|4614952-4615530|CRISPRCasFinder,PILER-CR,CRT TGTGTTCCCCGCGCCAGCGGGGATAAACCGTTGCCCGCGCAATTCCGGGAGCATCCGCAATTGTGTTCCCCGCGCCAGCGGGGATAAACCGACGGACAAAATATATATTGATTTGCGAATTATGTGTTCCCCGCGCCAGCGGGGATAAACCGGTAAAGAAACTGCCGACAAATCCCTGTTCGTTGTGTTCCCCGCGCCAGCGGGGATAAACCGCCCGTCACCGACGCGCAGTGGCGCTACCGTGAGTGTTCCCCGCGCCAGCGGGGATAAACCGGGATCTAACGCGCTGTAAAAATTCCGTGCTTTGTGTTCCCCGCGCCAGCGGGGATAAACCATGCGGATTACCGGCAAAACATGGGAGCAAACCGTGTTCCCCGCGCCAGCGGGGATAAACCGCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTAGTGTTCCCCGCGCCAGCGGGGATAAACCGGTTTACCGCCCCGCAGAGGCGCTGGCAGATCCGTGTTCCCCGCGCCAGCGGGGATAAACCGGGATGACCTGTCGCTAAAACTCGCCGCGTACAGTGTTCCCCGCGCCAGCGGGGATAAACCG >CP033401|10|9|4614952-4615530|CRISPRCasFinder TGTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAAT TGTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTA TGTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGT TGTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTG AGTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTT TGTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAAC CGTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGT AGTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATC CGTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTAC AGTGTTCCCCGCGCCAGCGGGGATAAACCG >CP033401|10|3|4614953-4615530|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTGA GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTTT GTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAACC GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACCG >CP033401|10|2|4614953-4615530|CRT GTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTGA GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTTT GTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAACC GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACCG
>CP033401.1|AYQ04086.1|4614562_4614856_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAIWLLEVRAGVYVGDTSKRIREMIWQQITQLAGCGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVDNQ >CP033401.1|AYQ04085.1|4613642_4614566_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLSPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLAATVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALTEDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRQTYALLAKQYGVKWNGRKYDPKDWEKGDVVNRCISAATSCLYGISEAAVLAAGYAPAIGFIHSGKPLSFVYDIADIIKFDSVVPKAFEIAARQPAEPDKEVRLACRDIFRSTKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPETLGDSGHRGRGG >CP033401.1|AYQ04084.1|4612995_4613646_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTGQLSPAQLLHLVDRGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQERPAESDTFTIECRSFAPELRTGQQLCFNLRANPTICKSGKRHDLLMEAKRQVRGQAEGSDVWLHQQQAALDWLAAQGERSGFTLLDTSVDAYRQQQLRRENSRQLIQFSSVDYTGMLTVTDPGLFLQRLSQGYGKSRAFGCGLMLIKPGAEA >CP033401.1|AYQ04083.1|4612267_4613014_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLIFQLHGPMASWGVDAPGEVRHTHELPSRSALLGLLAAGVGIRRDDTERLNAFNRHYSLVVCASRNPRWARDYHTIQMPKEVRKARYFSRREELSDPDLLSAIISRRDYYTDAWWMVAVATTADAPYSLEQLQDGLRHPVFPLYLGRKSHPLALPLAPLLLEGNACDALCNAYQQYQDHFHKLKVSLPKLQDECWWEGEHDGLVASKILRRRDVPLNRQQWLFGERTINQGPWLSKEEPCTSQE >CP033401.1|AYQ04082.1|4609264_4609417_+|type-I-toxin-antitoxin-system-Hok-family-toxin MLTKYALVAIIVLCCTVLGFTLMVGDSLCELSIRERGMEFKAVLAYESKK >CP033401.1|AYQ04081.1|4608265_4609000_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIHPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTHTTRKWEPGMLEEETRFFGLKRECGLHEG >CP033401.1|AYQ04080.1|4606479_4608192_+|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLTDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHKGDFRITANQNLIIAGVPESEKAKIEKIAKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVMRVTGCPNGCGRAMLAEVGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPEILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDLWD >CP033401.1|AYQ04079.1|4604680_4606480_+|NADPH-dependent-assimilatory-sulfite-reductase-flavoprotein-subunit MTTQVPPSALLPLNPEQLVRLQAATTDLTPTQLAWVSGYFWGVLNQQPAALAATPAPAAEMPGITIISASQTGNARRVAEALRDDLLAAKLNVKLVNAGDYKFKQIASEKLLIVVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDSSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDALKSRAPVAAPSQSVATGAVNEIHTSPYSKDAPLVASLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVEGKTLPLNEALQWHFELTVNTANIVENYATLTRSETLLPLVGDKAKLQHYAATTPIVDMVRFSPAQLDAEALINLLRPLTPRLYSIASSQAEVENEVHVTVGVVRYDVEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPETPVIMIGPGTGIAPFRAFMQQRAADEAPGKNWLFFGNPHFTEDFLYQVEWQRYVKDGVLTRIDLAWSRDQKEKVYVQDKLREQGAELWRWINDGAHIYVCGDANRMAKDVEQALLEVIAEFGGMDTEAADEFLSELRVERRYQRDVY >CP033401.1|AYQ04078.1|4603999_4604365_-|6-carboxytetrahydropterin-synthase-QueD MMSTTLFKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIIDFAELKAAFKPTYERLDHHYLNDIPGLENPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCIYRGE >CP033401.1|AYQ04077.1|4602650_4603922_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERCITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL >CP033401.1|AYQ04087.1|4615611_4616649_-|aminopeptidase MFSALRHRTAALALGVCFILPVHASSPKPGDFANTQARHIATFFPGRMTGTPAEMLSADYIRQQFQQMGYRSDIRTFNSRYIYTARDNRKSWHNVTGSTVIAAHEGKAPQQIIIMAHLDTYAPLSDADADANLGGLTLQGMDDNAAGLGVMLELAERLKNTPTEYGIRFVATSGEEEGKLGAENLLKRMSDTEKKNTLLVINLDNLIVGDKLYFNSGVKTPEAVRKLTRDRALAIARSHGIAATTNPGLNKNYPKGTGCCNDAEIFDKAGIAVLSVEATNWNLGNKDGYQQRAKTAAFPAGNSWHDVRLDNQQHIDKALPGRIERRCRDVMRIMLPLVKELAKAS >CP033401.1|AYQ04088.1|4616900_4617809_+|sulfate-adenylyltransferase-subunit-2 MDQIRLTHLRQLEAESIHIIREVAAEFSNPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYEFRDRTAKAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWHNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMIDDNRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESNAQTLPEIIEEMLVSTTSERQGRVIDRDQAGSMELKKRQGYF >CP033401.1|AYQ04089.1|4617810_4619238_+|sulfate-adenylyltransferase MNTALAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTRQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCELAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYSEKTFTRIREDYLTFAGQLPGNLDIRFVPLSALEGDNVASQSESMAWYSGPTLLEVLETVEIQRVVDAQPMRFPVQYVNRPNLDFRGYAGTLASGRVEVGQRVKVLPSGVESNVARIVTFDGDREEAFAGEAITLVLTDEIDISRGDLLLAADEALPAVQSASVDVVWMAEQPLSPGQSYDIKIAGKKTRARVDGIRYQVDINNLTQREVENLPLNGIGLVDLTFDEPLVLDRYQQNPVTGGLIFIDRLSNVTVGAGMVHEPVSQATAAPSEFSAFELELNALVRRHFPHWGARDLLGDK >CP033401.1|AYQ04090.1|4619237_4619843_+|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVQQRELHHGHRGVVLWFTGLSGSGKSTVAGALEEALHKLGVSTYLLDGDNVRHGLCSDLGFSDADRKENIRRVGEVANLMVEAGLVVLTAFISPHRAERQMVRERVGEGRFIEVFVDTPLAICEARDPKGLYKKARAGELRNFTGIDSVYEAPESAEIHLNGEQLVTNLVQQLLDLLRQNDIIRS >CP033401.1|AYQ04091.1|4619892_4620216_+|DUF3561-family-protein MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNTLFFFIYTWPFFLALMPVAVVVGIALHSLMDGKLRYSIVFTLVTVGIMFGALFMWLLG >CP033401.1|AYQ04092.1|4620409_4620721_+|cell-division-protein-FtsB MGKLTLLLLAILVWLQYSLWFGKNGIHDYTRVNDDVAAQQATNAKLKARNDQLFAEIDDLNGGQEALEERARNELSMTRPGETFYRLVPDASKRAQSAGQNNR >CP033401.1|AYQ04093.1|4620739_4621450_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MATTHLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKRVVIAISPGDSRFAQLPLANHPQITVVDGGDERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALSETSRTGGILAAPVRDTMKRAEPGKNAIAHTVDRNGLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTRTIHQENT >CP033401.1|AYQ04094.1|4621449_4621929_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYERGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLIKATK >CP033401.1|AYQ04095.1|4621925_4622975_+|tRNA-pseudouridine(13)-synthase-TruD MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDLSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGAQRWAQTNTPVRDRNKRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRAMLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE >CP033401.1|AYQ04096.1|4622955_4623717_+|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVICSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP033401_3 | 3.1|1680692|40|CP033401|CRISPRCasFinder | 1680692-1680731 | 40 | NZ_CP041417 | Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence | 47951-47990 | 0 | 1.0 |
CP033401_6 | 6.1|2711110|42|CP033401|PILER-CR | 2711110-2711151 | 42 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 141085-141126 | 0 | 1.0 |
CP033401_6 | 6.2|2711169|40|CP033401|PILER-CR | 2711169-2711208 | 40 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 141028-141067 | 1 | 0.975 |
CP033401_2 | 2.1|976060|38|CP033401|CRISPRCasFinder | 976060-976097 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 113727-113764 | 2 | 0.947 |
CP033401_5 | 5.1|2497699|48|CP033401|CRISPRCasFinder | 2497699-2497746 | 48 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 4089-4136 | 3 | 0.938 |
CP033401_5 | 5.1|2497699|48|CP033401|CRISPRCasFinder | 2497699-2497746 | 48 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 4088-4135 | 3 | 0.938 |
CP033401_5 | 5.1|2497699|48|CP033401|CRISPRCasFinder | 2497699-2497746 | 48 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 4088-4135 | 3 | 0.938 |
CP033401_5 | 5.1|2497699|48|CP033401|CRISPRCasFinder | 2497699-2497746 | 48 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 4088-4135 | 3 | 0.938 |
CP033401_7 | 7.1|4174173|42|CP033401|CRISPRCasFinder | 4174173-4174214 | 42 | NZ_CP010208 | Escherichia coli strain M11 plasmid B, complete sequence | 30214-30255 | 7 | 0.833 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NZ_MG299151 | Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NZ_KY471628 | Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence | 45716-45747 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NZ_MG299131 | Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NZ_KY471629 | Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence | 45716-45747 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NZ_MG299133 | Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NZ_MG299128 | Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NZ_MG299147 | Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NC_018995 | Escherichia coli plasmid pHUSEC41-1, complete sequence | 29015-29046 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NZ_CP053235 | Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence | 78292-78323 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NZ_CP005999 | Escherichia coli B7A plasmid pEB1, complete sequence | 39563-39594 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | KU932021 | Escherichia coli plasmid pEC3I, complete sequence | 51902-51933 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NZ_CP024154 | Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence | 18560-18591 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NC_011754 | Escherichia coli ED1a plasmid pECOED, complete sequence | 49240-49271 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NZ_CP015141 | Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence | 81434-81465 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NZ_LR213460 | Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3 | 28916-28947 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NZ_MH287044 | Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence | 36182-36213 | 7 | 0.781 |
CP033401_9 | 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592446-4592477 | 32 | NZ_MH618673 | Escherichia coli strain 838B plasmid p838B-R, complete sequence | 32230-32261 | 7 | 0.781 |
CP033401_10 | 10.1|4614982|31|CP033401|CRISPRCasFinder | 4614982-4615012 | 31 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62682-62712 | 7 | 0.774 |
CP033401_10 | 10.1|4614982|31|CP033401|CRISPRCasFinder | 4614982-4615012 | 31 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1222106-1222136 | 7 | 0.774 |
CP033401_10 | 10.1|4614982|31|CP033401|CRISPRCasFinder | 4614982-4615012 | 31 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2467672-2467702 | 7 | 0.774 |
CP033401_10 | 10.4|4615165|31|CP033401|CRISPRCasFinder | 4615165-4615195 | 31 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17977-18007 | 7 | 0.774 |
CP033401_10 | 10.7|4615348|31|CP033401|CRISPRCasFinder | 4615348-4615378 | 31 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530641-530671 | 7 | 0.774 |
CP033401_7 | 7.1|4174173|42|CP033401|CRISPRCasFinder | 4174173-4174214 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24899-24940 | 8 | 0.81 |
CP033401_9 | 9.5|4592385|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592385-4592416 | 32 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1417960-1417991 | 8 | 0.75 |
CP033401_10 | 10.4|4615165|31|CP033401|CRISPRCasFinder | 4615165-4615195 | 31 | NZ_CP017753 | Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence | 97498-97528 | 8 | 0.742 |
CP033401_10 | 10.7|4615348|31|CP033401|CRISPRCasFinder | 4615348-4615378 | 31 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14953-14983 | 8 | 0.742 |
CP033401_10 | 10.7|4615348|31|CP033401|CRISPRCasFinder | 4615348-4615378 | 31 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14983-15013 | 8 | 0.742 |
CP033401_10 | 10.7|4615348|31|CP033401|CRISPRCasFinder | 4615348-4615378 | 31 | NZ_CP015882 | Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence | 3454-3484 | 8 | 0.742 |
CP033401_10 | 10.7|4615348|31|CP033401|CRISPRCasFinder | 4615348-4615378 | 31 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 148992-149022 | 8 | 0.742 |
CP033401_10 | 10.10|4614982|32|CP033401|PILER-CR,CRT | 4614982-4615013 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62682-62713 | 8 | 0.75 |
CP033401_10 | 10.10|4614982|32|CP033401|PILER-CR,CRT | 4614982-4615013 | 32 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1222106-1222137 | 8 | 0.75 |
CP033401_10 | 10.10|4614982|32|CP033401|PILER-CR,CRT | 4614982-4615013 | 32 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2467671-2467702 | 8 | 0.75 |
CP033401_10 | 10.10|4614982|32|CP033401|PILER-CR,CRT | 4614982-4615013 | 32 | NC_008759 | Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence | 12670-12701 | 8 | 0.75 |
CP033401_10 | 10.13|4615165|32|CP033401|PILER-CR,CRT | 4615165-4615196 | 32 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17977-18008 | 8 | 0.75 |
CP033401_10 | 10.13|4615165|32|CP033401|PILER-CR,CRT | 4615165-4615196 | 32 | NZ_CP017753 | Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence | 97497-97528 | 8 | 0.75 |
CP033401_10 | 10.16|4615348|32|CP033401|PILER-CR,CRT | 4615348-4615379 | 32 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 148991-149022 | 8 | 0.75 |
CP033401_10 | 10.16|4615348|32|CP033401|PILER-CR,CRT | 4615348-4615379 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530640-530671 | 8 | 0.75 |
CP033401_10 | 10.17|4615409|32|CP033401|PILER-CR,CRT | 4615409-4615440 | 32 | NZ_CP006991 | Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence | 532343-532374 | 8 | 0.75 |
CP033401_7 | 7.1|4174173|42|CP033401|CRISPRCasFinder | 4174173-4174214 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24786-24827 | 9 | 0.786 |
CP033401_10 | 10.1|4614982|31|CP033401|CRISPRCasFinder | 4614982-4615012 | 31 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86182-86212 | 9 | 0.71 |
CP033401_10 | 10.2|4615043|31|CP033401|CRISPRCasFinder | 4615043-4615073 | 31 | CP011075 | Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence | 244686-244716 | 9 | 0.71 |
CP033401_10 | 10.2|4615043|31|CP033401|CRISPRCasFinder | 4615043-4615073 | 31 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78536-78566 | 9 | 0.71 |
CP033401_10 | 10.4|4615165|31|CP033401|CRISPRCasFinder | 4615165-4615195 | 31 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 405875-405905 | 9 | 0.71 |
CP033401_10 | 10.4|4615165|31|CP033401|CRISPRCasFinder | 4615165-4615195 | 31 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 2248363-2248393 | 9 | 0.71 |
CP033401_10 | 10.8|4615409|31|CP033401|CRISPRCasFinder | 4615409-4615439 | 31 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35740-35770 | 9 | 0.71 |
CP033401_10 | 10.13|4615165|32|CP033401|PILER-CR,CRT | 4615165-4615196 | 32 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 405875-405906 | 9 | 0.719 |
CP033401_10 | 10.16|4615348|32|CP033401|PILER-CR,CRT | 4615348-4615379 | 32 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14953-14984 | 9 | 0.719 |
CP033401_10 | 10.16|4615348|32|CP033401|PILER-CR,CRT | 4615348-4615379 | 32 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14983-15014 | 9 | 0.719 |
CP033401_10 | 10.16|4615348|32|CP033401|PILER-CR,CRT | 4615348-4615379 | 32 | NZ_CP015882 | Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence | 3454-3485 | 9 | 0.719 |
CP033401_10 | 10.17|4615409|32|CP033401|PILER-CR,CRT | 4615409-4615440 | 32 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35740-35771 | 9 | 0.719 |
CP033401_9 | 9.1|4592141|32|CP033401|PILER-CR,CRISPRCasFinder,CRT | 4592141-4592172 | 32 | NZ_CP030933 | Enterococcus gilvus strain CR1 plasmid pCR1A, complete sequence | 51062-51093 | 10 | 0.688 |
CP033401_10 | 10.10|4614982|32|CP033401|PILER-CR,CRT | 4614982-4615013 | 32 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86181-86212 | 10 | 0.688 |
CP033401_10 | 10.11|4615043|32|CP033401|PILER-CR,CRT | 4615043-4615074 | 32 | CP011075 | Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence | 244686-244717 | 10 | 0.688 |
CP033401_10 | 10.11|4615043|32|CP033401|PILER-CR,CRT | 4615043-4615074 | 32 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78536-78567 | 10 | 0.688 |
CP033401_10 | 10.13|4615165|32|CP033401|PILER-CR,CRT | 4615165-4615196 | 32 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 2248362-2248393 | 10 | 0.688 |
1. spacer 3.1|1680692|40|CP033401|CRISPRCasFinder matches to NZ_CP041417 (Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence) position: , mismatch: 0, identity: 1.0
gcgctgcgggtcattcttgaaattacccccgctgtgctgt CRISPR spacer gcgctgcgggtcattcttgaaattacccccgctgtgctgt Protospacer ****************************************
2. spacer 6.1|2711110|42|CP033401|PILER-CR matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 0, identity: 1.0
tgtcacacgcagataaatccaactttcaatattgttaagttc CRISPR spacer tgtcacacgcagataaatccaactttcaatattgttaagttc Protospacer ******************************************
3. spacer 6.2|2711169|40|CP033401|PILER-CR matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 1, identity: 0.975
catggcgtagcaaaaagaaattttcaatattgctttatgg CRISPR spacer catggcgtagaaaaaagaaattttcaatattgctttatgg Protospacer ********** *****************************
4. spacer 2.1|976060|38|CP033401|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 2, identity: 0.947
cggacgcaggatggtgcgttcaattggactcgaaccaa CRISPR spacer cagacgcagaatggtgcgttcaattggactcgaaccaa Protospacer *.*******.****************************
5. spacer 5.1|2497699|48|CP033401|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
6. spacer 5.1|2497699|48|CP033401|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
7. spacer 5.1|2497699|48|CP033401|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
8. spacer 5.1|2497699|48|CP033401|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
9. spacer 7.1|4174173|42|CP033401|CRISPRCasFinder matches to NZ_CP010208 (Escherichia coli strain M11 plasmid B, complete sequence) position: , mismatch: 7, identity: 0.833
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer acaaatgccggatgcggcgtaaacgccttatctggcctacgc Protospacer ***. *.****************.*********.******.
10. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299151 (Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
11. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KY471628 (Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
12. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299131 (Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
13. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KY471629 (Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
14. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299133 (Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
15. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299128 (Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
16. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299147 (Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
17. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NC_018995 (Escherichia coli plasmid pHUSEC41-1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
18. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP053235 (Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
19. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP005999 (Escherichia coli B7A plasmid pEB1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
20. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to KU932021 (Escherichia coli plasmid pEC3I, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
21. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP024154 (Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
22. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NC_011754 (Escherichia coli ED1a plasmid pECOED, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
23. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP015141 (Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
24. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LR213460 (Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
25. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MH287044 (Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
26. spacer 9.6|4592446|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MH618673 (Escherichia coli strain 838B plasmid p838B-R, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
27. spacer 10.1|4614982|31|CP033401|CRISPRCasFinder matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer tccctatcgcaatgccggcagcatccgcaat Protospacer *. *. ****** **** ************
28. spacer 10.1|4614982|31|CP033401|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatc Protospacer **** ************ ***** * ** .
29. spacer 10.1|4614982|31|CP033401|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatc Protospacer **** ************ ***** * ** .
30. spacer 10.4|4615165|31|CP033401|CRISPRCasFinder matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.774
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer agcgtcaccgacgcgcagggccgctaccaac Protospacer **************** * *******.
31. spacer 10.7|4615348|31|CP033401|CRISPRCasFinder matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.774
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer ccgaacaggtggcgaagcaggtgatgggcca Protospacer ******.* **************.. ***
32. spacer 7.1|4174173|42|CP033401|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 8, identity: 0.81
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer attgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer *. * ******************.*******.*******.
33. spacer 9.5|4592385|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
tcaacgcgctcagacgttgcgtgagtgaacca CRISPR spacer acaacgcggtcggacgttgcgtgattaccccg Protospacer ******* **.************ *. **.
34. spacer 10.4|4615165|31|CP033401|CRISPRCasFinder matches to NZ_CP017753 (Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.742
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacgtcaccgacgcgcagtcgcgcttcttca Protospacer ***************** ***** *. ..
35. spacer 10.7|4615348|31|CP033401|CRISPRCasFinder matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer agcggcagctggcgatgcaggtggcttgcgt Protospacer ..*.******** ********** ****
36. spacer 10.7|4615348|31|CP033401|CRISPRCasFinder matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer agcggcagctggcgatgcaggtggcttgcgt Protospacer ..*.******** ********** ****
37. spacer 10.7|4615348|31|CP033401|CRISPRCasFinder matches to NZ_CP015882 (Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer ttgcgcagctggcgcagcaggtggctgccga Protospacer ..* .*.******* ************ **
38. spacer 10.7|4615348|31|CP033401|CRISPRCasFinder matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer gggtacggctggcgaaggaggcggctgcgga Protospacer * ************* ***.***** *
39. spacer 10.10|4614982|32|CP033401|PILER-CR,CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer tccctatcgcaatgccggcagcatccgcaatc Protospacer *. *. ****** **** ************.
40. spacer 10.10|4614982|32|CP033401|PILER-CR,CRT matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
41. spacer 10.10|4614982|32|CP033401|PILER-CR,CRT matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
42. spacer 10.10|4614982|32|CP033401|PILER-CR,CRT matches to NC_008759 (Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcg-----caattccgggagcatccgcaatt CRISPR spacer -----cgtgaaactcatttccgggagcatccgcattt Protospacer **.* ** ***************** **
43. spacer 10.13|4615165|32|CP033401|PILER-CR,CRT matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer agcgtcaccgacgcgcagggccgctaccaact Protospacer **************** * *******.
44. spacer 10.13|4615165|32|CP033401|PILER-CR,CRT matches to NZ_CP017753 (Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacgtcaccgacgcgcagtcgcgcttcttcaa Protospacer ***************** ***** *. ..*
45. spacer 10.16|4615348|32|CP033401|PILER-CR,CRT matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer gggtacggctggcgaaggaggcggctgcggaa Protospacer * ************* ***.***** * *
46. spacer 10.16|4615348|32|CP033401|PILER-CR,CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ccgaacaggtggcgaagcaggtgatgggccag Protospacer ******.* **************.. *** .
47. spacer 10.17|4615409|32|CP033401|PILER-CR,CRT matches to NZ_CP006991 (Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence) position: , mismatch: 8, identity: 0.75
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer catcatcctcccgcagatgcgctggccgatcc Protospacer *.*.* .******** ******** *****
48. spacer 7.1|4174173|42|CP033401|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 9, identity: 0.786
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer gttgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer .. * ******************.*******.*******.
49. spacer 10.1|4614982|31|CP033401|CRISPRCasFinder matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 9, identity: 0.71
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer gctaccgcgcaattcgaggagcatccgctgg Protospacer . *********** .*********** .
50. spacer 10.2|4615043|31|CP033401|CRISPRCasFinder matches to CP011075 (Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.71
acggacaaaatatatattgatttgcgaatta CRISPR spacer tgaggcaaaatatagattgatttccgaaaat Protospacer .*.********* ******** ****
51. spacer 10.2|4615043|31|CP033401|CRISPRCasFinder matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 9, identity: 0.71
acggacaaaatatatattgatttgcgaatta CRISPR spacer acggaaaaattatatattgattttacttctg Protospacer ***** *** ************* .*.
52. spacer 10.4|4615165|31|CP033401|CRISPRCasFinder matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.71
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacgtcactgacgcgcagtcgcgcttcttca Protospacer ******.********** ***** *. ..
53. spacer 10.4|4615165|31|CP033401|CRISPRCasFinder matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 9, identity: 0.71
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacatcaccgacgcccagtggcgcgacgtcc Protospacer *.********** ********* ** .
54. spacer 10.8|4615409|31|CP033401|CRISPRCasFinder matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.71
gtttaccgccccgcagaggcgctggcagatc CRISPR spacer cgagaccgcctcgccgaggcgctggcagcga Protospacer ******.*** *************
55. spacer 10.13|4615165|32|CP033401|PILER-CR,CRT matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.719
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacgtcactgacgcgcagtcgcgcttcttcaa Protospacer ******.********** ***** *. ..*
56. spacer 10.16|4615348|32|CP033401|PILER-CR,CRT matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
57. spacer 10.16|4615348|32|CP033401|PILER-CR,CRT matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
58. spacer 10.16|4615348|32|CP033401|PILER-CR,CRT matches to NZ_CP015882 (Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ttgcgcagctggcgcagcaggtggctgccgag Protospacer ..* .*.******* ************ ** .
59. spacer 10.17|4615409|32|CP033401|PILER-CR,CRT matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.719
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer cgagaccgcctcgccgaggcgctggcagcgac Protospacer ******.*** ************* *
60. spacer 9.1|4592141|32|CP033401|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP030933 (Enterococcus gilvus strain CR1 plasmid pCR1A, complete sequence) position: , mismatch: 10, identity: 0.688
tccacgctgtaacggccatcattaagtttagt CRISPR spacer ccgctgctgtgacgcccatcattaagttactc Protospacer .* .*****.*** ************* .
61. spacer 10.10|4614982|32|CP033401|PILER-CR,CRT matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 10, identity: 0.688
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer gctaccgcgcaattcgaggagcatccgctggg Protospacer . *********** .*********** .
62. spacer 10.11|4615043|32|CP033401|PILER-CR,CRT matches to CP011075 (Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer tgaggcaaaatatagattgatttccgaaaata Protospacer .*.********* ******** ****
63. spacer 10.11|4615043|32|CP033401|PILER-CR,CRT matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer acggaaaaattatatattgattttacttctgg Protospacer ***** *** ************* .*.
64. spacer 10.13|4615165|32|CP033401|PILER-CR,CRT matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 10, identity: 0.688
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacatcaccgacgcccagtggcgcgacgtccc Protospacer *.********** ********* ** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
499447 : 508889
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP033401|499447:508889|DBSCAN-SWA AATGATTGAATTTAGCCATGTCAGCAAACTGTTTGGCGCACAAAAAGCCGTTAACGATCTCAATCTCAATTTTCAGGAAGGGAGTTTTTCGGTGCTGATTGGCACATCTGGCTCCGGCAAATCCACCACCCTGAAAATGATTAACCGCCTAGTGGAACATGACAGCGGCGTGATCCGCTTTGCCGGAGAAGAAATTCGCTCGCTGCCAGTACTGGAGTTGCGCCGCCGGATGGGCTATGCCATTCAATCTATTGGCCTGTTCCCCCACTGGAGCGTGGCGCAAAACATCGCTACCGTGCCGCAATTGCAAAAATGGTCGCGGGCGCGGATTGACGATCGTATCGACGAATTAATGGCGCTACTGGGGCTGGAGTCAAATTTGCGTGAGCGTTATCCGCATCAGCTTTCCGGTGGTCAGCAGCAACGTGTGGGAGTGGCGCGCGCACTGGCTGCCGATCCGCAAGTCTTACTGATGGATGAACCTTTTGGCGCACTGGACCCGGTAACGCGCGGCGCGTTGCAACAAGAGATGACGCGCATTCACCGTTTGCTGGGGCGCACTATTGTGCTGGTCACTCATGATATTGATGAGGCGCTAAGGCTGGCAGAACATCTGGTATTGATGGATCACGGTGAAGTGGTGCAGCAGGGGAATCCGCTGACGATGCTGACTCGTCCGGCGAATGATTTTGTCCGCCAGTTTTTTGGACGTAGTGAACTGGGTGTGCGCCTGCTTTCGTTACGTAGTGTGGCGGATTACGTGCGTCGCGAAGAACGGGCAGAAGGTGAGGCACTGGCAGAAGAGATGACGCTACGCGATGCGCTCTCCCTGTTTGTCGCGCGGGGATGCGAGGTGCTGCCGGTGGTGAACACGCAGGGCCAGCCTTGCGGCACGCTGCATTTTCAGGATCTGCTGGAGGAGGCGTAAGCGTATGAAGATGTTGCGCGATCCGCTGTTCTGGCTCATTGCTTTGTTTGTGGCACTGATTTTCTGGCTGCCTTACAGCCAGCCGCTGTTTGCTGCCTTGTTCCCACAACTGCCACGACCCGTTTATCAGCAAGAAAGTTTTGCAGCTCTGGCACTGGCTCATTTCTGGCTGGTGGGAATTTCGAGTTTGTTTGCGGTGATCATTGGCACTGGTGCCGGAATTGCTGTCACTCGCCCGTGGGGCGCGGAATTTCGCCCACTGGTGGAAACTATTGCCGCCGTTGGACAGACTTTTCCGCCTGTCGCAGTGCTGGCGATTGCCGTTCCGGTGATCGGCTTTGGTCTGCAACCAGCGATTATCGCCTTGATCCTTTACGGTGTGCTGCCCGTCCTGCAGGCGACACTTGCCGGGCTGGGAGCGATTGATGCCAGCGTGACAGAAGTTGCGAAAGGTATGGGAATGAGTCGTGGTCAGCGACTGCGTAAGGTCGAGCTACCGCTGGCGGCTCCGGTGATTCTGGCGGGCGTGCGAACTTCGGTGATTATCAACATTGGTACGGCGACGATCGCCTCAACGGTAGGGGCCAGCACGCTGGGTACGCCCATCATCATCGGGCTTAGCGGATTTAATACCGCGTATGTGATCCAGGGGGCGTTACTGGTGGCACTGGCGGCGATCATCGCAGACCGCCTGTTTGAAAGGCTGGTGCAGGCGCTTAGCCAGCACGCAAAATAAAGGTATAACCTGCGAGCATGACGCCACCAATTCCGCCTAACGCCATAAACAGGAACAGGGCGATGACCCCAATTTTAGCTATGCGCATATTGCACTCCTTATGTTAACGAAAGGATTGTACAGTAAAGCGCATTTGTTAACGAATCATTAAATGCCGAGTGGGAAAATATCATGGCCTTGTTCTTGCCAACTGGTGAGTTGCTGCTGTTGGGCGGAGGTTCGATTTTCACCGCACCACACCAGCAATGTACGGCCTTCGAATAGTTCAGGGCGTAGTTGATTGAGCGAGTGGGCGAGGACATCAATGCGCCATCCTTGTTGACTGGCAATCCAGCCCTCCAGCCACAGACGGGTGGTATCCTGAATATTCCAGCCAACCACCAGCGCATCTTTACCCTGTTTTTTACGTGCCGAAGCCAGACAAATGGCGATGTAGTTGATCAGTACGCCGTCGAGGATCGCCAGCAGCGCCTGGAGAGTCGGTTGTTGGCATTGAAGTCGTCGGCGCAGAGGAATAAACAGATGTGTGGTGAGTGTCTGGGCGGGGTAATCCTGACCGCGCTCTTTGATCCACGTTCGCAGGCTATGCAGATTGCCGCTTTGCAGGTAGGTCAGTAATGTTTCTTGCTGATCGCGCCAGCCGTTCTGCACATCAACATTTTCATTACTGAGCAGCATTTTAACTTTGCTGACCTGCACGCCGTTGTCGATCCAGCGTTTGATCTCGCGGATACGGTCAATATCGGCATCGTTGAACAGTCGATGACCGCCGTCTGTCCGTTGCGGTTTCAGCAACCCGTAACGCCTCTGCCACGCGCGTAACGTGACAGGGTTAATATCACAAAGCAACGCCACTTCACCAATTGTGTAAAGCGCCATCGTCTCACCCTTGCTCGCGAGGTCCCGGTTTAACTTTAGACGCAGTTTTGCGAACCAGGTAGTTTTGCCCGTTTTTTGTGCATCTATAGGGTGATTTTATTTTTGCCAGGCGATTTTGAGTGATCGTACTCACGAATTCTCATTTTTCTGCAAGAGTTCAAAGAAAGTTAAACGCAGGCAATGTATGTTACGCGTTTTAAAGGGAAGTGTGGTTTGCGGGTATGTACGATTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTTTTTTTAGTCATTGCGTGGTTAATGAGTAAAACGCCATTATTCATACCGTTAATGCAGGTCACGGTTCGTCTGCCGCATAAATTTCTCTGCTACATCGTCTTTTCCATCTTCTGCATCATGGGCACCTGGTTTGGGTTGCACATTGACGATTCTATTGCCAATACCCGTGCGATAGGCGCGGTAATGGGCGGCTTACTCGGCGGTCCGGTCGTCGGTGGGCTGGTTGGTCTGACCGGCGGCTTACATCGATATTCGATGGGGGGCATGACCGCGCTAAGTTGCATGATCTCAACCATCGTTGAAGGATTGCTCGGCGGCCTGGTACACAGCATCCTGATCCGTCGCGGGCGCACTGATAAAGTCTTTAACCCCATTACCGCCGGTGCCGTCACGTTCGTCGCTGAAATGGTGCAAATGCTGATCATCCTTGCGATCGCCCGACCTTATGAAGATGCGGTGCGTCTGGTGAGTAATATTGCTGCACCAATGATGGTCACCAATACCGTCGGCGCGGCGCTGTTTATGCGTATATTGCTCGATAAACGCGCGATGTTTGAAAAATACACTTCTGCTTTTTCTGCCACTGCGCTGAAAGTGGCAGCCTCGACGGAAGGCATTTTGCGACAGGGGTTTAACGAAGTGAACAGCATGAAAGTGGCTCAGGTGCTGTATCAGGAGCTGGATATTGGTGCAGTCGCGATTACCGATCGAGAGAAATTGCTGGCCTTTACCGGAATTGGTGACGACCACCATTTACCCGGCAAACCGATTTCTTCGACTTACACCTTAAAAGCGATTGAAACCGGTGAAGTGGTCTACGCTGATGGCAACGAAGTACCTTATCGTTGCTCTTTGCATCCGCAATGCAAACTGGGGTCGACGCTGGTAATTCCGTTGCGTGGCGAAAATCAGCGCGTGATGGGCACCATCAAATTGTATGAAGCCAAAAACCGTTTATTCAGTTCAATAAACCGCACGCTGGGCGAGGGGATTGCGCAACTGCTTTCGGCGCAGATTCTTGCCGGGCAATATGAGCGGCAAAAAGCGATGCTCACCCAGTCAGAAATCAAACTGCTTCACGCCCAGGTGAATCCTCATTTTTTGTTTAATGCGCTTAACACCATTAAAGCGGTGATCCGCCGCGACAGCGAACAGGCCAGCCAGCTGGTGCAGTATCTTTCCACTTTTTTCCGCAAAAACTTAAAGCGGCCTTCGGAGTTTGTTACTCTCGCCGACGAAATTGAACATGTGAACGCTTATCTGCAAATTGAAAAGGCGCGCTTCCAGTCGCGGTTGCAGGTCAACATTGCTATTCCGCAAGAATTATCCCAGCAGCAATTGCCCGCGTTTACCCTGCAACCGATAGTGGAAAACGCCATTAAACATGGGACATCACAACTGCTGGATACAGGGCGAGTGGCAATCAGCGCCCGACGTGAGGGGCAACATTTGATGCTGGAGATCGAAGACAATGCCGGTTTGTATCAACCGGTAACCAATGCCAGTGGGCTGGGGATGAATCTGGTGGATAAGCGTTTACGTGAACGGTTTGGCGATGACTATGGAATAAGCGTCGCCTGTGAGCCTGATAGTTACACCCGAATAACGTTACGACTACCATGGAGGGACGAGGCATGATTAAAGTCTTAATTGTCGATGATGAACCGTTAGCACGGGAGAACCTGCGCGTATTTTTGCAGGAGCAGAGCGATATTGAAATCGTTGGAGAGTGTTCAAACGCCGTGGAAGGGATCGGCGCGGTGCATAAACTGCGCCCGGATGTGCTGTTTCTCGATATCCAGATGCCGCGCATCAGTGGTCTGGAAATGGTGGGGATGCTCGACCCGGAACATCGTCCGTATATTGTTTTTCTCACCGCGTTTGACGAATACGCTATTAAAGCCTTTGAAGAACATGCCTTTGATTATCTGCTGAAGCCAATTGATGAAGCGCGACTGGAGAAAACGCTGGCGCGTTTGCGTCAGGAACGCAGCAAGCAGGATGTTTCGCTGTTACCGGAAAATCAACAGGCGCTGAAATTTATCCCTTGTACGGGGCATAGTCGGATTTATTTGCTGCAAATGAAAGATGTGGCATTTGTCAGCAGTCGGATGAGCGGTGTCTACGTTACCAGCCACGAAGGGAAAGAGGGCTTTACCGAATTGACATTACGTACCCTGGAAAGTCGTACACCACTACTGCGCTGCCATCGTCAGTATCTGGTTAACCTCGCGCATTTACAGGAGATTCGTCTGGAAGATAACGGCCAGGCCGAGTTGATTTTGCGTAATGGCTTAACCGTGCCGGTCAGCCGCCGTTATCTGAAAAGCTTAAAAGAGGCGATTGGCCTGTAAAAGACTGCTAAAATGGCTTTTTGCCTCATCAACACCTGAAGGCCTCATGCTAAGTAACGATATTCTGCGCAGCGTGCGCTACATTTTGAAAGCCAATAATAATGACCTGGTGCGTATTCTGGCGCTGGGTAATGTCGAAGCCACCGCGGAACAGATCGCCGTCTGGCTACGTAAAGAAGACGAAGAGGGTTTTCAGCGTTGTCCGGACATTGTTTTGTCGTCATTCCTCAATGGCCTGATTTATGAAAAACGCGGCAAGGATGAGTCTGCTCCGGCACTGGAGCCGGAACGTCGCATTAATAACAACATCGTGCTGAAAAAATTACGCATCGCGTTTTCGCTGAAAACCGATGACATTCTGGCTATCCTCACCGAACAGCAGTTCCGCGTTTCGATGCCGGAAATTACAGCGATGATGCGCGCACCGGATCATAAAAACTTCCGCGAATGCGGCGATCAATTTTTACGTTATTTTCTGCGTGGACTGGCAGCGCGCCAGCATGTGAAGAAAAGCTAAGACGGGTATGGCGGCCATGCGAAACATGGCCGCCGACAGATTATTTCACTTCTTTAAAACCAGCGGCTTTCATCACCAGTTCCATTTGCGCCATAGTGATACCTTTTTTGGCATCTTCAGCAGAAACGTTGATTCCTGAAATACCCTGCAGGGCTTTAAAATCCACTTTTTCCATATCGATAGTCACGTTTTCCTGCGCGTAGGTATCTGTATAGGTTAATTTTTCTTCAACACCCGCGATGTTTTTGTATTTGGCGCTTAACGGCTCAAGTGTCTTGGCAGCGTCTTCTTTGGTGGTTGCACCAATAGAAGCAAATTGAATTTTGGTTTCAGAAGATTGCTTAAGCACCTTGTCACCTTTGTAGACATAGGTAATGGCAATTTCAGTGCCGTTCAGATTGGCGCTGAATTTCTTCGATTCTTCTTTGTCACCGCAGCCAGCAAGAGAGAAAACCAGAACAGATGCAACAACGAGGGAAAACAGCTTATTGAAAGCCTTCATGTAAAACTCCATTTTATTTAATCAAGAAACTGGTGACTCTCACCAGGGGCTATATAGAATATGCCTAATACCGTGACGTGAGCAGTCCGGAACTGGAGTAGAACTCTTAGTAAAAAGCACTATTTCATCCTTGTTGCTGAAGCATGGGGAATAATTGTTCGCAAAGTAAAACACCGTTATTCATTGCTTCTACCCGTGCCTCGCTTTCTGTATTACGAAATTGTCCCAACACATGTGCCAGCCGATAAAAACCGACCGCGGAGAGGTCATTCGCCAGCAACTCTGCCTGACTAATAGCGCTCTGTTCCTGATAGCGCCAGCCGTTATGGAGCAGTTGAATAAGTAACGCCTGGCAGCGCATCAGCAACTGATGAGCGGTAGACGGCACAGGCAAAACGCTGGCAGAAGGTAGCGGTGCCACAGGCGCAGTTTCTGCGTCCAGCGCCCAGGCGCGGGTTTTTGTCATCATCACCCGTGGTTCCAGTGTCAATTGCCCATCAACAAAACTGACAAAGCCAGAAACCAGACTCACGGGGTCGTCTGTTTGTTGCAAAAGCGCCGCCATGCGTTCAACGGCAAAAGGAGAACAGGCAGATGCCGGAAGGGATAACGTCAGGAGATTATCTTCACCTTCGCCGCTGATTACCTGCGCATCCAGCGTCTGGCGGCTGCTATCCCAACCGAGCGAAATACACTCAGCGACCGGCAGAATAAATAAGTTATCGACCTGATTAAGAGGCCGTATGCAGGCGGGGGGACGCTGGCGTAAATATTCCCGCAAAGCCACAATGCCCGGCTGGCGTAACGGCGCGCTCAACATTTGCCAGGCATCAGGCGACAGCGGCACAACGCTGCTTAAGCGGTTGCGGGTAGCTAACAGCAGCTCGCCATCGGCACTGCGTTTTGCTGCTTGTGAAACAATTTGCCCACCCGCCAGTGCGCCAGCCTGAAAACTAAACAGCCGACGCGTAGCTGCCGGTGAGTTTTCCTGTTCACTTCGCGGCCAACTGCGCGAAAGGTGCAAAATACTGCCGGTGTCGGGATCGGTAAACCAGATGCGTAAACCATAATGCTCAATATCCTGCCAGCAACGCATACCTAAAGACACCAGCCGCAGATGATCAAGCTTTGCTTCTCCGGCAATGCCAGAGCCAACGACCGTGCGCCACGGCACAGGAGGAACTTCACCAACACTGTCGCGCCGGGCCATCTCTTGTGCGCAATTTAATCGACTGTTTAATGCCGCAAGCTGACGTAAGCATTCTCCGGCATGATAGTGGCTGGCGCGGACGTGGAAGGCATCAACGCTTGCCCGTAGCTGTCGTAGTGATTCACTCACCCATCGCCAGTTGCAGCGTTCCGCCGCCTGCTGCGCGCGACTGAAAGCGGCCTCGTAGTGAATAAGCGGCTGGCTGATGCCGCCCAGCCATAATGCCTGGCTTAATTGCTGAACATATTGACGACACGCGTTACCCTCGTCGTTGGCAAACGGATCGTCAGATGATGTGACGTGTTCGCTGCGCATCTGCCAGATTAAATGAGTAAATTCCGCTTGCTGAGTTTTGGCCTCGACGAAGGCCTGCACAGCCAGTACGACATGTTCGCAAAGTGTGCCTTCAATACAATCACAACGGGCGAAACGAATACTGCTGCGGGAATAAAAACGCACATCGCTCATCGGTAAGCGGGCAGAGGGAATTTCGCCCGGCGTACAGAACAACTCAATGGTGATGCCTTTACCGACCAACGCCTGTGCGCGTTTGCGGGTGGCATCGGGCAGGGTAGCCAGTTCTTCCAGCCAGATTGCCGGATCCCACGCTTCTTCTTTTTCCGTAGGCTGGGCGGTAGTACAAAGTCGTTGATAACTTAACACAAGCATCACGCGATGACGGCACATACCGCTGGCCCCGCAGCTGCACTGAGCCTCTTTCAGTGCCTGGCCGTTCGCCAGCTGGGTACGGACACCGTCACTGAAGGTGGCGATTAAAGCGCTGTTCTCATGGCTGATTTCCGGGACGTTGCCATTTTCCAGTTCCTTAAGACTGCGCTTAACAAAACCGGCATTGCTTAACGCCGTCAGGGCCTGCGGTGTCAGTTCTAATAATTCCGGACGTAGTGAATTCATGACTGAAGATTCTCCGCAAGCCATGATGCCAGCTCGCCCGGTGTCATGGCGGCTATTTGTGCGCCGACATTTACCAGCGCCTGGGCCGTATCGTGGTCATAGCAAGGTGTTGCTGTGCTATCGAGCGCTGCCAGTCCCAGCACTTTGATGCCGCTCTGGACACACTTTTTCACCTGATGCGTCAGTAATGATGATGAACCCCCTTCGTAAAAATCGCTCACGAGGATAATGACGCTTTTCGCTGGTTGTTCAATAAGTTGCCGACCATACTCCACGGCACTGGCGATATTGGTCCCGCCGCCCAACTGTACTTTCATTAATAACTCTACCGGATCGGCAACGTCTGCCGTGAGATCAACGACGCTTGTGTCAAACGCCACCAGATGGGTACGAATGCCGGGTAACTGCCACAAACAGGCCGCCATCACCGCAGAGTGGATCACCGAGTCCACCATCGATCCGCTTTGATCAACCAGTAAGACCAGTTGCCATTGTTCGCTTTGGCGTTTAATGCGGCTGTTAAAGCGGGGGGATTCGATATACAACTTGCCGTGTTGCGGGTGCCAGTGTTGCAGGTTGGCGCGCAGAGTACTTTTGAAATCAAAGTTTCGCGCCAGTGGAATTAATGAGCGGCGACGGCGATCGCGGACACCAGAAAAAGCCTGACGAACTTCCTTTGCCAGTCGCGCCATAATTTCTTCAACAACCTGGCGCACTATCTGGCGGGCGGCAGCCAGTACTTCGGGATTCATCAGATGTTTGGTGTGCAAAACGGCGCGTAGCAGGCTTTCCGAAGGCTGCATACGTTCCAGCACGTCGAGATTTGTCACCACATCTTCAATGCCGTAGCGCAGTACGGCATCGCTTTCCAGCCGCTCAATCACCTGTTGCGGAAACAGCGTGTGGATACTGTTGATCCACTCAGGAGTGGTGAGATTTGAGCCACCTAATCCACCAGAGCGTTCACCACGCTGGAGCCGTTCAGGATCGCGCCCATACAGCCACTCCAGCGCGTGATCTATCTGCCGGGCGTTGTCATCCAGCCCACAAAGCGTCGTTTCTGCCGCTTCGCCAAGAATTAATCGCCAGCGTTGTAGCTCACGGGTGGTCAGAAGATCGTTCAGTTCAGACAT
Protein sequences of DBSCAN-SWA_1 >CP033401|499447:508889|502210_503896_+|AYQ00484.1|DBSCAN-SWA MYDFNLVLLLLQQMCVFLVIAWLMSKTPLFIPLMQVTVRLPHKFLCYIVFSIFCIMGTWFGLHIDDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSILIRRGRTDKVFNPITAGAVTFVAEMVQMLIILAIARPYEDAVRLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSTYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQQLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGLYQPVTNASGLGMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLPWRDEA >CP033401|499447:508889|505169_505631_-|AYQ00487.1|DBSCAN-SWA MKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKLTYTDTYAQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKAAGFKEVK >CP033401|499447:508889|507752_508889_-|AYQ00489.1|DBSCAN-SWA MSELNDLLTTRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDPERLQRGERSGGLGGSNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVLHTKHLMNPEVLAAARQIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSLIPLARNFDFKSTLRANLQHWHPQHGKLYIESPRFNSRIKRQSEQWQLVLLVDQSGSMVDSVIHSAVMAACLWQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSVIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDHDTAQALVNVGAQIAAMTPGELASWLAENLQS >CP033401|499447:508889|501257_501989_-|AYQ00483.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKMLLSNENVDVQNGWRDQQETLLTYLQSGNLHSLRTWIKERGQDYPAQTLTTHLFIPLRRRLQCQQPTLQALLAILDGVLINYIAICLASARKKQGKDALVVGWNIQDTTRLWLEGWIASQQGWRIDVLAHSLNQLRPELFEGRTLLVWCGENRTSAQQQQLTSWQEQGHDIFPLGI >CP033401|499447:508889|499447_500374_+|AYQ00480.1|DBSCAN-SWA MIEFSHVSKLFGAQKAVNDLNLNFQEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGVIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWSVAQNIATVPQLQKWSRARIDDRIDELMALLGLESNLRERYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHRLLGRTIVLVTHDIDEALRLAEHLVLMDHGEVVQQGNPLTMLTRPANDFVRQFFGRSELGVRLLSLRSVADYVRREERAEGEALAEEMTLRDALSLFVARGCEVLPVVNTQGQPCGTLHFQDLLEEA >CP033401|499447:508889|500378_501110_+|AYQ00481.1|DBSCAN-SWA MKMLRDPLFWLIALFVALIFWLPYSQPLFAALFPQLPRPVYQQESFAALALAHFWLVGISSLFAVIIGTGAGIAVTRPWGAEFRPLVETIAAVGQTFPPVAVLAIAVPVIGFGLQPAIIALILYGVLPVLQATLAGLGAIDASVTEVAKGMGMSRGQRLRKVELPLAAPVILAGVRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIADRLFERLVQALSQHAK >CP033401|499447:508889|504658_505129_+|AYQ00486.1|DBSCAN-SWA MLSNDILRSVRYILKANNNDLVRILALGNVEATAEQIAVWLRKEDEEGFQRCPDIVLSSFLNGLIYEKRGKDESAPALEPERRINNNIVLKKLRIAFSLKTDDILAILTEQQFRVSMPEITAMMRAPDHKNFRECGDQFLRYFLRGLAARQHVKKS >CP033401|499447:508889|505755_507756_-|AYQ00488.1|DBSCAN-SWA MNSLRPELLELTPQALTALSNAGFVKRSLKELENGNVPEISHENSALIATFSDGVRTQLANGQALKEAQCSCGASGMCRHRVMLVLSYQRLCTTAQPTEKEEAWDPAIWLEELATLPDATRKRAQALVGKGITIELFCTPGEIPSARLPMSDVRFYSRSSIRFARCDCIEGTLCEHVVLAVQAFVEAKTQQAEFTHLIWQMRSEHVTSSDDPFANDEGNACRQYVQQLSQALWLGGISQPLIHYEAAFSRAQQAAERCNWRWVSESLRQLRASVDAFHVRASHYHAGECLRQLAALNSRLNCAQEMARRDSVGEVPPVPWRTVVGSGIAGEAKLDHLRLVSLGMRCWQDIEHYGLRIWFTDPDTGSILHLSRSWPRSEQENSPAATRRLFSFQAGALAGGQIVSQAAKRSADGELLLATRNRLSSVVPLSPDAWQMLSAPLRQPGIVALREYLRQRPPACIRPLNQVDNLFILPVAECISLGWDSSRQTLDAQVISGEGEDNLLTLSLPASACSPFAVERMAALLQQTDDPVSLVSGFVSFVDGQLTLEPRVMMTKTRAWALDAETAPVAPLPSASVLPVPSTAHQLLMRCQALLIQLLHNGWRYQEQSAISQAELLANDLSAVGFYRLAHVLGQFRNTESEARVEAMNNGVLLCEQLFPMLQQQG >CP033401|499447:508889|501090_501198_-|AYQ00482.1|DBSCAN-SWA MRIAKIGVIALFLFMALGGIGGVMLAGYTFILRAG >CP033401|499447:508889|503892_504612_+|AYQ00485.1|DBSCAN-SWA MIKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVTSHEGKEGFTELTLRTLESRTPLLRCHRQYLVNLAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL |
10 | Enterobacteria_phage(85.71%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1470551 : 1481329
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP033401|1470551:1481329|DBSCAN-SWA GTCATTTTGCTAATAACACTTTTCTGGCTTCATCGACTTTATTTGGAGCATCGAACCAATATTCGGCGAGCAAAGGGCAGATCTCTGTATCAACGACTTGTTCATACCAAGCCTGGGCATCGTTGATTTTTTGGCCGATAGCTGGGGTGACATAGCTGTGACCGATACAAAACTGTGGTCCTAAGGTAACATCTTTTGCCAGCATATCGTTAAGTACCGTCAGTCGAGATTTGATGAATGCTAACATGTCAAAATCAATTGCGTAATTATAATTTACCCAGTTTCTCCACGCGTCATTAAAAGCTGGCTTTAGATCGATAAACGCGAAACGACGGCGCAGCGCTAGGTCAAGTAGTGCGAGTGAGCGGTCTGCAATATTCATCGTACCGATGATATATAGATTCTCAGGAATGTATATTTTTTCATCATCATTTTTAGGATAGGAAAGAGATAATGCCTCAGTCGGCGTGCGTTTATCTGCTTCCATCAACGTGAGTGTTTCACCGAAGATTTGCGCCGGATTGCCACGATTAATCTCTTCAATTATCACCACATATTTCGAGGTAGGATTATTGACTGCAGTTTTGATTGCATTTACAAAAGGTCCATCAATTAGCGTCAATTGCCCTTCTTTACCTGGACGCCAGCCGCGAATAAAGTCTTCGTAAGAGAGGTTCGGGTGAAATTGCACCGCGCTAATACGCTCAGGTGCTTTTTCTCCCATCAAGCAGTACGCCAGACGTCGCGCTAACCAGGTTTTTCCAGTTCCGGGTGGTCCTTGTAATATCAGGTTTTTCTTGTCGATCAGGCGCTGAAGTGTGAGTTGGATCTTAGCCTCTTCCAAGAAACAGCCATCCTGCACCAGATGACTGATGTCATAAGGAACGTGAGTGAGTTTTGGCAACGGCGCACTCTCTTCGACAGTTTCCTCAGTTATCGTCTGGGATTCATTTTTCTCAAAATTCAGGTATTCATAATTTCCGGACTCAAGGGACTTTAACTCATCATCATTGCATAGTTTCTGTAGATAAAAGTAGATTGAGCTTGTAACTGTTTTGCTCTCAGGATGCTCTACCTTGAATACGTCCAAGTAGTTTTCTTCCAACTCCGAAGCAGTGAAATAAGGGCCATTTTTTTTCAGGCATAACGCCTTAATTTTATTCAGTAAAGAGGCTTTCCACGTTCTATTTGCCACCTCATCTTTAGACTGGCTAAGATCTGTATTCCACGCTGATAAAGAAAGTTCTGGGAAAGAATGAACCGGATAGTTTGGTTGGGTAAAGACCTCGTTCAGCGCCCGCATAAGACTCAGGTAACTTTGTCCACTACAGCGACCTTTTGCCCCGTTTTTAATGATCTTAATGTTTAACACTGTCTGAATGTAATACTGCGACTGGCTATCTAAAGTAGGGTAGAACCAAGGACGGGTCCAGTACAACCCCATGGTGAGATTCCAACCGACATTCATTACTGTAGAAGCAATGTCATATGCGGCAGTGAAGTCTGCAGAGTTAGTATTCTGGTTATCCGCAAAGGTCATTGCCTGCGAGAACATTTCCCACAAGCATTCAATGTCATTAGGGTCGCGTGACTTTTCATAACCAAAGAACCAAGATTTTTGGTTATTCAACAGCGGGATTCCGGCAAAGGAGTCAGGAATCGGTTCGTTCACGCCCAACAAATTCGCTAGCTTGGCAGCAATAATTTTGCGATTGCTGTCGGTCAAGTTACGATTGAACAAGCCCATAGTAGTAAACGGACAAATGTCTTTTAAGGGAAAGATCTCTCCCATAATAGATTTGTCCTGCAGATGGGACATTCCTTCCACACCTGAGGCAATTAGATGAATACCTTTGACTAATTCATCTCTACGATTTCGCCAAGTCAGTAACGCGTTGGCAAAAGCCTCATAAAAACTAGCCCAAGCAAATTTGCCATCATGTTCTGCTGTATCCACGGGAACTCCATTATACTTGTTGAGCAATGATAATTTATCTGTGTGAGTTTTAACATATTACTATCAGTTATAGAAAAATTTAACTACCCGATACAGAGAGCGGCATGCTGAATTTGACCTGACTTGCTTCCAACTAATTAAAATCAACTTATTTATCAATTGGTTATTTTGGCGCATAGCGGTCATCAAGGGAATATCGCGTTGTCATAAGGTGTCGAGGCTCGGAGGTTCAAATCCTCTCATGCAAAAAATAAATAAAATTAATGACGGTTGGAAATTATTCAATACATACACTCTCGAAAGTGCATCAGCCAACCGCAGCACGTCTTGCATACGGCGTGTCTGCAGTTTTATATAATCCTGGCTGGAAACCTCTTATACAAAGTAGATACACCAATATCATAGATGATCGCCACCTTCTGACGCGGGACTTTTGATGCAATTAATCGCCCGGCCTGCGCCCATTGTTCTGATGTAAGTTTAGGACGACGTCCACCAATTCCTCCCTGTGCGCGAGCATCTTCCAGTCCAACTTTTGTCAGTCATCAATCAGTTCACGCTTCATTTCAACCAGGGCTCCATCACATGGAAAAATAAAAAAAACCATCGATGTTGACGTGTCAATGTTACCTGTCAGGCTACGGAAATCCCCCCCTTACCCGCAGCTACTCGGTAAGTATAATAAGATGTTTCCTGCTCCTTCCTAACCTGTCCAGTTTCCAGACAATTAAAGTATCACCTTATATAAGAGACTGCAGAGCGCACTTTAACCTAGGTCGTTCTGAAATAGTTGCAGATTCGATTAATCAACGATTATACTCCCCGGCACTCCAGAGGATCTGGTAAACATAAAGTCAGAGCTTGGGTATACTGGCACACAAATGGCAGATCTTGCAGGTGCAGCCAGTCATAGCCAGTGGCGAAAATACACGAGTGGTTCTGAGCCCCGCGCCATGTCATCACATATCTTGTTTTTTTATTGCTACCAATCTGACTTTGAGTACTAATGAGCTAGATAGAATTGTTGGTAATGACTCCAACTTACTGATAGTGTTTTATGTTCAGATAATGCCCGATGACCTTGTCATGCAGCTCCACCGATTTTGAGAACGACAGTGACTTCCGTCCCAGCCTTGCCAGATGTTGTCTCAGATTCAGGTTATGTCGCTCAATGCGCTGAGTGTAACGCTTGCTGATAACGTGCAGCTTTCCCTTCAGGCGGGATTCATACAGCGGCCAGCCATCCGTCATCCATACCACGACCTCAAAGGCCGACAGCAGGCTCAGAAGACACTCCAGTGTGGCCAGAGTGCGTTCACCGAAGACGTGCGCCACAACCGTCCTCCGTATCCTGTCATACGCGTAAAACAGCCAGCGCTGACGTGATTTAGCACCGACGTAGCCCCACTGTTCGTCCATTTCAGCGCAGACAATCACATCACTGCCCGGTTGTATGCGCGAGGTTACCGACTGCGGCCTGAGTTTTTTAAGTGACGTAAAACCGTGTTGAGGCCAACGCCCATAATGCGTGCACTGGCGCGACATCCGACGCCATTCATGGCCATATCAATGATTTTCTGGTGCGTACCGGGCTGAGAGGCGGTGTAAGTGAACTGTAGTTGCCATGTTTTACGGCAATGAGAGCAGAGATAGCGCTGATGCCCGGCAGTGCTTTTGCCGTTACGCACCACGCCTTCAGTAGCGGAGCAGGAAGGACATCTGATGGAAATGGAAGCCACGCAAGCACCTTAAAATCACCATCATACACTAAATCAGTAAGTTGGCAGCATTACCTTGTTAGTTTGTCTTTTGTGTTTACCTGATTCGGGTAAACGCCTTTACCTGATTTAGGTAAACTTTTCTTACCTGATTCAGGTAAATTTACCTCTTTCAGGTAAACTTTATTTTTCTTACCCGATTCGGGTAATGTTGACCATTCACTGACCACATTATTGATGCCGATATTCCGCCCGCTCTGAATAAAAATCCCACGCTTTACCAGAACACTTTTTGCAGCAGAACACTTGTGCGGCAATATCCCGGTTAATTCGGAAAGTTGCTCGTTGCTAACCCAATCCAGTTTTTTATTAAAGCCATATGTTTTGCGCATGACAGCCAGAAAGACCAGAAGCTGGTGCTGTGTTAATCCGGCCAGCATCACAGCTTCCAGCAACTCATTTGCAATGCGCGTATAACCATCATCGAGATCTGCCACGCGCCGCTCCTTTTGTGCCACATCCGGCACAGGAAAATTGAATATCTCAGCAGTGTTTGCCATAATTCCTCCCGCAATGAGTGTGTTACGATTTGCACCTGAAAGTCGGTTCTGTTCCCGCAGACCGACTTTCGCCATTTTTGAACCTGTCATATTGCCCCCAGCATGGTGGTGACCATCGCCATCAATGGACCAGCCAGATCCGGGTCCACTCGAAACATCGACACAATGCCTTCACTCATCTCCTTCAGTTTCTGGTGGCGTGGTGCGTTGAGAATGACCGCCTGCTTTGCCTCACAGAGTTCCTTTTCCATTTCAGCCAGCCGAGCCATGAAGCTATCCTGCTCAACCAGGTGGCCGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCACGGCTGACATCATCAGGGAAATCGATAGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCAGGCGCGGACGGCATCACGGATTTTTTCGTGGCCTGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTAGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACCGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATTGCCCTGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTGAAACAGAAAGCCTCTGAGCAGAAGGTGGCTGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGAGATGATGCGTGGAACAAATTACGACTCGGCGTCATCACGGCTTCAGAAGTTCACAATGTGATAGCAAAACCCCGCTCCGGTAAAAAGTGGCCTGACATGAAAATGTCCTACTTTCACACCCTGCTGGCTGAGATTTGCACCGGTGTGGCTCCGGAAGTTAACGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAATGACGCCAGAGCCCTGTTTGAGTTTACTTCCGGCGTGAATGTTACTGAATCCCCGATCATCTATCGCGACGAAAGTATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAGCTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAGTTCCGGCTCGGTGGTTTCGAGGCCATAAAGTCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACACGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGTATGAAGCGTGAAGGACTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGCGCGATCACTTTCGTCTACTCCATTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCACTGAAAGCACAGCGGCTGGCTGAGAAGATAAATAATAAACAGGAGGATATATGAGTCAGGTTGGTAATCATTCATTCGAATTTCCGGCATCGCAAGGTGTACAGGGTGGTACTGTTACACTCTTCCTTACCATACCAGGAAGATCGCTGGCTCGTTTCCTCGCTTCAGATAATTACGGCCATACACTGGAACGCTCTCAGCGAGAAATTAATCCAAATCGAGTACGAAAATTTTTAAATTATCTCACTAACGCAGACTCAAGAAATGAGTCTTTTATCATTCCCCCTCTCGTAGGTAACTGTGATTCGAATATAGAATTTGTACCGTTTGGCAACACAAATGTTGGTATAGCCAGAATTCCCCTCGACGCCGAAATAAAACTTTTTGATGGCCAACATCGTGCAGCTGGCATTGAGATATTTTGCCGAAGTTCCCCATCAACGCTCATGGTTCCCATGATGCTTACAATGAATCTGCCGCTAAAAACCCGGCAGCAGTTCTTTTCGGACATAAATAACAACGTTTCTAAGCCATCAGCGACCATCAATATGGCGTATAACGGCCGGGATGATATTGCTCAGGGAATGATATCCTTCCTGACCCAACATACTGTATTTGCCGATATAACCGATTTTGAACACAACGTAGTGCCATTAAAAAGTAATATGTGGGTGAGTTTCAAGGCACTCACTGATGCAACGTCAAAGTTTGCTAGGAACGGCAATCAACAACTTGAAATGGGATATATAGAATCTGTCTGGGAGGCATGGATTACACTAACTCAGATTGACTCAATCCGACATGGTGTACACCACGCTACGTACAAGCGCGATTATATTCAGTTCCATGGAGTAATGATTAACGCTTTCGGTTTTGCGGTTCAACAGATGATGGTTAATCATTCCATCGCAGAAATAACTTCTATGATCGAAAAACTCTGTGCAACTACCAGCTCTGCAGAAAGAGAGGATTTTTTTCTGATGGATAACTGGGCGGGGATCTGCACGAAAGCCAGCCAGGAAAAACTATCGGTTATTGCCAATGTGGCAGCGCAGAAAGCAGCAGCAAACAGACTGATACAAGCTTTTACCAAAGGAAGTCTGGAAACAACTTAATGAATCAACATTGTCTCATATCAGCATGCTGTACGGCGTCTTTAAGGAACGGTGAGCATGAAAAACAAAATCATCATGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATCATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGCTAGATGAACGAAGACGCCTGGCCGTTCAGGGTTGTCGGACTTGTGCAAGTTGCCAGGAGGAGATCGAACTTAAGAACAAACAATGGGGACTGTGATGGCCTCAAAGCAGCAAATTTCAACATCGTCCAACTGAGGTGTAAAAATGTTCAGAATCATTTTTCCTAACACCTGGTACGTCGACCACCACGGCACTCCCTGCAAAATCCTGCGTTCTACCCACAACAAAGTTCACTACATCCGAAAAGGCAGAACATGTATCGCCAGCATGTTCCGCTTTAATCATGACTTTGAACCTGTGAATAAAGCTGATGCAGATCGGATAGCAGAAGAGATCGAAACGGCAGAACACATTAAGAAGTTACGTGCCATACGCAGGAAATAGAAAAATTGATAAATTCAATACTGCATTTCTCAGCATTAAATTTATCTCTATGACCAGTCAAGAGATGTACCTGCCATGAGCTTAATATCATGTCAGATATATCGGTCACAAACTCCCTCAGCAGCTAAGAGGAGGACAAATGTCTCGACTAATCACTTTACAGGACTGGGCTAAAGAAGAATTTGGGGACTTAGCACCAAGTGAGCGAGTTCTGAAAAAATACGCGCAAGGGAAAATGATGGCCCCACCCGCTATAAAAGTTGGTCGCTACTGGATGATTGACCGAAATTCCCGTTTTGTAGGAACGCTGGCAGAACCGCAACTCCCAATAAACGCAAACCCAAAACTCCAACGGATAATCGCTGATGGCTGCTAGACCCCGATCTCACAAAATCTCTATACCCAATTTATATTGCAAATTAGATAAGCGAACCGGAAAGGTATATTGGCAATACAAACATCCACTATCCGGTCGTTTTCATAGCTTAGGAACTGATGAGAATGAAGCAAAACAAGTTGCTACTGAAGCAAATACCATTATTGCTGAACAACGTACCAGACAAATATTAAGCGTCAATGAGCGTCTGGAAAGAATGAAAGGCAGGCGCTCAGACATTACGGTGACAGAATGGCTTGATAAATATATTTCTATCCAGGAGGACAGGCTGCAACATAATGAACTAAGACCCAACTCCTATCGGCAAAAAGGCAAACCCATTCGTCTTTTCCGTGAGCATTGTGGAATGCAACACCTCAAGGATATTACCGCACTTGATATTGCCGAAATAATTGATGCTGTAAAGGCTGAAGGTCATAACAGGATGGCGCAAGTCGTGAGAATGGTGTTGATCGACGTCTTCAAAGAAGCACAACACGCAGGACATGTTCCGCCAGGATTTAACCCAGCGCAGGCAACAAAACAACCGCGAAATCGAGTAAACCGCCAAAGATTGTCACTGCCCGAATGGCAGGCAATATTTGAAAGCGTAAGCAGACGGCAGCCCTATTTAAAATGCGGCATGCTACTTGCTCTTGTTACTGGACAACGTTTAGGCGATATCTGCAATTTGAAATTCTCTGATATATGGGACGACATGTTGCACATTACTCAGGAAAAAACCGGTTCAAAACTTGCTATTCCGCTTAACCTGAAATGCGATGCTCTGAATATTACCCTTCGTGAAGTTATATCTCAGTGCAGGGATGCTGTTGTTAGTAAATATCTGGTCCATTACCGTCACACTACCTCTCAAGCAAACAGAGGAGACCAGGTTTCTGCAAATACTCTGACAACGGCTTTTAAAAAGGCCAGGGAAAAATGTGGCATAAAATGGGAGCAAGGAACTGCGCCCACATTTCATGAGCAGCGATCTCTGTCAGAACGGTTATATCGGGAACAGGGTCTGGATACGCAAAAGTTGTTAGGCCATAAATCCAGAAAAATGACCGACCGATACAATGATGATCGTGGTAAAGACTGGGTTATCGTAGATATCAAAACAGCATAGAAAATAGCCAGTTTTGGGGAAGGGTTTTGGGGAAAGTTTTGGGGAAGATTTTACATCATCATAAAACAACGGGCGTATAACACGCCCGTTTCAATATTTAACACATGTAGAGATTACATGTTCTTGATGATCGCATCACCAAACTCTGAACATTTCAACAGTTTAGCGCCTTCCATCAGACGTTCGAAGTCATAGGTTACGGTCTTCGCATTGATTGCGCCTTCCATACCTTTAACAATCAGGTCTGCGGCTTCGGTCCAACCCATGTGGCGTAACATCATCTCAGCAGAGAGAATAATAGAGCCTGGGTTTACTTTGTCCTGACCGGCATATTTCGGCGCAGTACCGTGGGTGGCTTCAAACAGGGCGCATTCGTCACCGATGTTTGCACCTGGGGCGATACCGATACCGCCAACCTGCGCTGCCAGGGCGTCAGAAATGTAGTCACCGTTCAGGTTCATACAGGCGATAACATCATATTCAGCCGGACGCAGCAGGATCTGTTGCAGGAATGCATCAGCAATCACGTCTTTAATGACGATCTCTTTGCCGGTGTTCGGGTTTTTAACTTTCAGCCACGGGCCGCCGTCGATCAGTTCACCGCCAAACTCTTCACGCGCCAGCTGGTAGCCCCAGTCTTTAAACGCTCCTTCGGTGAACTTCATGATGTTGCCTTTGTGCACCAGAGTCACAGAGTCACGATCGTTAGCAATTGCGTATTCGATCGCTGCACGAACCAGACGTTTGGTGCCTTCTTCCGAACACGGCTTAATACCGATACCACAATGTTCCGGGAAGCGAATTTTCTTCACCCCCATCTCTTCACGCAGGAATTTAATCACTTTCTCGGCGTCGGCAGAGTCTGCTTTCCATTCGATACCCGCATAAATGTCTTCCGAGTTTTCACGGAAGATAACCATATCGGTCAGTTCAGGGTGTTTAACCGGGCTTGGAGTGCCCTGATAGTAACGTACCGGACGCAGGCAGATGTAGAGATCCAGTTCCTGGCGCAGGGCAACGTTCAGAGAGCGAATACCGCCACCAACAGGAGTGGTCAGCGGACCTTTAATGGCAACGCGATATTCACGAATCAGATCAAGGGTTTCAGCAGGCAGCCAGACATCCTGACCATAAACCTGTGTGGATTTTTCACCGGTGTAAATTTCCATCCAGGAGATTTTACGCTCGCCTTTATAGGCTTTCTCGACTGCAGCGTCGACCACTTTCAGCATGGCTGGGGTTACATCTACACCGATTCCATCACCTTCAATGTAAGGGATAATCGGATTTTCAGGAACGTTGAGTTTGCCGTTTTGCAGGGTGATCTTCTTGCCTTGTGCCGGAACAACTACTTTACTTTCCAT
Protein sequences of DBSCAN-SWA_2 >CP033401|1470551:1481329|1474871_1475411_-|AYQ01340.1|DBSCAN-SWA MQLLTYQQTSGFSPTAVINRSQTKQVPGHEKIRDAVRAWSAEDNQDVVATLIVNEYREQGGGTIDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGHLVEQDSFMARLAEMEKELCEAKQAVILNAPRHQKLKEMSEGIVSMFRVDPDLAGPLMAMVTTMLGAI >CP033401|1470551:1481329|1475901_1476582_+|AYQ01342.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWNKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEICTGVAPEVNAKALAWGKQYENDARALFEFTSGVNVTESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >CP033401|1470551:1481329|1478822_1479965_+|AYQ01348.1|integrase|DBSCAN-SWA MAARPRSHKISIPNLYCKLDKRTGKVYWQYKHPLSGRFHSLGTDENEAKQVATEANTIIAEQRTRQILSVNERLERMKGRRSDITVTEWLDKYISIQEDRLQHNELRPNSYRQKGKPIRLFREHCGMQHLKDITALDIAEIIDAVKAEGHNRMAQVVRMVLIDVFKEAQHAGHVPPGFNPAQATKQPRNRVNRQRLSLPEWQAIFESVSRRQPYLKCGMLLALVTGQRLGDICNLKFSDIWDDMLHITQEKTGSKLAIPLNLKCDALNITLREVISQCRDAVVSKYLVHYRHTTSQANRGDQVSANTLTTAFKKAREKCGIKWEQGTAPTFHEQRSLSERLYREQGLDTQKLLGHKSRKMTDRYNDDRGKDWVIVDIKTA >CP033401|1470551:1481329|1478217_1478457_+|AYQ01346.1|DBSCAN-SWA MFRIIFPNTWYVDHHGTPCKILRSTHNKVHYIRKGRTCIASMFRFNHDFEPVNKADADRIAEEIETAEHIKKLRAIRRK >CP033401|1470551:1481329|1476578_1476737_+|AYQ01343.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSITKRGWVFPGLSVIRNPLKAQRLAEKINNKQEDI >CP033401|1470551:1481329|1470551_1472507_-|AYQ01339.1|DBSCAN-SWA MDTAEHDGKFAWASFYEAFANALLTWRNRRDELVKGIHLIASGVEGMSHLQDKSIMGEIFPLKDICPFTTMGLFNRNLTDSNRKIIAAKLANLLGVNEPIPDSFAGIPLLNNQKSWFFGYEKSRDPNDIECLWEMFSQAMTFADNQNTNSADFTAAYDIASTVMNVGWNLTMGLYWTRPWFYPTLDSQSQYYIQTVLNIKIIKNGAKGRCSGQSYLSLMRALNEVFTQPNYPVHSFPELSLSAWNTDLSQSKDEVANRTWKASLLNKIKALCLKKNGPYFTASELEENYLDVFKVEHPESKTVTSSIYFYLQKLCNDDELKSLESGNYEYLNFEKNESQTITEETVEESAPLPKLTHVPYDISHLVQDGCFLEEAKIQLTLQRLIDKKNLILQGPPGTGKTWLARRLAYCLMGEKAPERISAVQFHPNLSYEDFIRGWRPGKEGQLTLIDGPFVNAIKTAVNNPTSKYVVIIEEINRGNPAQIFGETLTLMEADKRTPTEALSLSYPKNDDEKIYIPENLYIIGTMNIADRSLALLDLALRRRFAFIDLKPAFNDAWRNWVNYNYAIDFDMLAFIKSRLTVLNDMLAKDVTLGPQFCIGHSYVTPAIGQKINDAQAWYEQVVDTEICPLLAEYWFDAPNKVDEARKVLLAK >CP033401|1470551:1481329|1478596_1478833_+|AYQ01347.1|DBSCAN-SWA MSRLITLQDWAKEEFGDLAPSERVLKKYAQGKMMAPPAIKVGRYWMIDRNSRFVGTLAEPQLPINANPKLQRIIADGC >CP033401|1470551:1481329|1480078_1481329_-|AYQ01349.1|DBSCAN-SWA MESKVVVPAQGKKITLQNGKLNVPENPIIPYIEGDGIGVDVTPAMLKVVDAAVEKAYKGERKISWMEIYTGEKSTQVYGQDVWLPAETLDLIREYRVAIKGPLTTPVGGGIRSLNVALRQELDLYICLRPVRYYQGTPSPVKHPELTDMVIFRENSEDIYAGIEWKADSADAEKVIKFLREEMGVKKIRFPEHCGIGIKPCSEEGTKRLVRAAIEYAIANDRDSVTLVHKGNIMKFTEGAFKDWGYQLAREEFGGELIDGGPWLKVKNPNTGKEIVIKDVIADAFLQQILLRPAEYDVIACMNLNGDYISDALAAQVGGIGIAPGANIGDECALFEATHGTAPKYAGQDKVNPGSIILSAEMMLRHMGWTEAADLIVKGMEGAINAKTVTYDFERLMEGAKLLKCSEFGDAIIKNM >CP033401|1470551:1481329|1477951_1478170_+|AYQ01345.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPLDERRRLAVQGCRTCASCQEEIELKNKQWGL >CP033401|1470551:1481329|1475593_1475905_+|AYQ01341.1|DBSCAN-SWA MPLLCCEATYIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKASEQKVAA >CP033401|1470551:1481329|1476733_1477798_+|AYQ01344.1|DBSCAN-SWA MSQVGNHSFEFPASQGVQGGTVTLFLTIPGRSLARFLASDNYGHTLERSQREINPNRVRKFLNYLTNADSRNESFIIPPLVGNCDSNIEFVPFGNTNVGIARIPLDAEIKLFDGQHRAAGIEIFCRSSPSTLMVPMMLTMNLPLKTRQQFFSDINNNVSKPSATINMAYNGRDDIAQGMISFLTQHTVFADITDFEHNVVPLKSNMWVSFKALTDATSKFARNGNQQLEMGYIESVWEAWITLTQIDSIRHGVHHATYKRDYIQFHGVMINAFGFAVQQMMVNHSIAEITSMIEKLCATTSSAEREDFFLMDNWAGICTKASQEKLSVIANVAAQKAAANRLIQAFTKGSLETT |
11 | Enterobacteria_phage(40.0%) | integrase | attL 1468524:1468547|attR 1480032:1480055 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1610136 : 1654789
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP033401|1610136:1654789|DBSCAN-SWA GTTACTTACGGTCCGTAAACGGGCTGCCCGGACAGGGAATCGATAACTGCTCTCCCATTTTATCCTCTTCAAGCTGGTGCTTTATGTAATCCTGTATCTTCGCCGTGTTCTTACCCACTGTATCGACGTAGTCCCCCCTGCACCAGAACTCCCTGTTCCTGTATTTGAATTTCAAATCACCAAACTGCTCGTAAAGCATCAGACTGCTTTTCCCTTTCAGATATCCCATAAAGCCGGATACGCTCATTTTGGGCGGGATCTCCACAAGCATATGGATATGATCTGCACAGCATTCAGCTTCCAGAATCCGTACACTTTTCCACTCACACAGCTTTCTCAAAATACTGCCTGTTGCTCTACGCTTCTCTCTGTAGAACACCTGTCTTCGGTATTTTGGCGCAAAAACTATGTGATATTTACAGTTCCATCGGGTGTGCGCTAAGCTCTTTTCGTTCCCCATTTGAACCCCTTTTGATTTCTTGTTTGACTCTTGCAGTTGCCAGACCGCAAGGTGTTTTAACAAATCCGAGGATCTTAGTATGAATATGGAAGAAATTGTGGCCCTTAGTGTAAAGCATAACGTCTCGGATCTACACCTGTGCAGCGCCTGGCCCGCACGATGGCGTATTCGCGGGAGAATGGAAGCTGCGCCGTTTGACGCGCCGGACGTCGAAGAGCTACTGCGGGAGTGGCTGGATGACGATCAGCGGGCAATATTGCTGGAGAATGGTCAGCTGGATTTTGCTGTGTCGCTGGCGGAAAACCAGCGATTGCGCGGCAGTGCGTTCGCACAACGGCAAGGTATTTCTCTGGCGTTACGGCTGTTACCTTCGCACTGCCCGCAGCTCGAACAGCTTGGCGCACCACCGGTATTGCCGGAATTACTCAAGAGCGAGAATGGCCTGATTCTGGTGACGGGGGCGACGGGGAGCGGCAAATCTACCACGCTGGCGGCGATGGTTGGCTATCTCAATCAACATGCCGATGCGCATATTCTGACGCTGGAAGATCCTGTGGAATATCTCTATTCGGGGAGCTACACGCGACAACCAGGAATGCAGCCGTAACTGCAGCAACGACGGGCAAAATGCGCATGGGATTTTCCTTGCTGTATTTTTGTTAAGTGTAGATGACAACAGGAAAAAAAGAGAAAGAAAGGAGGCCCAATATCCTGGGCCTCATCGTCAGTTATTGCAGCTTTTCAAGAATGCGCCAGGCCGCCTCGACACGGACAGGGTTAGGATAGCTTTTGTTTGCCAGCATCACGATGCCAAGGTTTTTTTCTGGAACGAAGGCTACGTAGCTGCCAAATCCACCAGTGGAGCCCGTTTTATGCACCCATGAGGCTTTCACTGCGGGGGCGGGCGGGTTTACCTCAACGGCGGGAAGCGCTGCCAATGCCACTTTGCTGTCGCTGCCGTTGATGATCGAATCAGCTTTCAGCGGCCAGTTCAGCATCTCCCAGCCTAATCCCTGGTACATATCGCCAATACGCCAGTAGCGAGACTGCGCAAGCGCAATGCCCTGCTGGAGCGTTTTCTCCTGAACGTGGCTGGCATCCATGTTGGCCTGAACCCAGCGGGCCATATCAATAACGCTGGATTTCACGCCATAGGCTTCGGCGTCAAGTTGTCCCGGAGAAACGTGTACGGGCTTCCCTTCGCGATAGCCCCAGGCATAATCTTTTTGTTCGTTCTGCGGAACCGTAATCCAGGTATGCGCCAGTTTTAATGGTTGCAGGACGCGTCTGGTCATTGCCTCTTCGTAACTCATTCCTGAGGGTTTCACCGCCAGCGCGCCAAACAGACCAATGCTGGAGTTAGCGTAAAGTCGCTTAGCGCCCGGAGTCCATTGCGGCTGCCAGTTTTGATAAAAATGCAGTAATGCGGCTTTATCCCTAACGTCATCGGGGATCTGCAGCGGTAGGCCGCCTGCCGTATAGGTGGCTAAGTGCAGCAGGCGGATACCCTGCCACTGTTTGCCTGTCAGTTCTGGCCAGTATTTCGTGACCGGATCGCTGAGCTTAATTTCGCCGCGGGCGATAGCATCGCCGCCCAACACGCCGTTAAACGTCTTACTAACCGATCCTAGCTCAAACAGCGTTTGCTGCGTGACTGGGTGGTTATTGGCGATATCGGCTTTACCCCAGGTGAAATAATAGGGTTTTCCCTGGTAGATAACGGCAACGGCCATACCCGGAATAGCCTGCTCCTGCATCAACGGGGTGATGGTGCGATTAACGATATCGGCAATCTGTTGTTCTGTTTTTGCGGCAGCAAATGTGGAGAAAGAGGCTGTCAGCAGCAGAGCGCAGCATAACGATTTTTTCATCATGAAATCAGTTCCGTAATTAAAAGCAAAAAGGTGTCCGGGCCCGTCAGACGCAATCAGTGTGTTTGATTTGCACCGTGTTGACAAACGGTTAAATTTAGCAGCAGATATAAGTTTTTCCTAAATTCCACGTGTGTTTTTTATTAGCTTCAAAAATCACTATTTCACGAAGAATTTAGACTGCTTCTCACACATCGTAACATTATTTACAACCACCTTTCAATCATTTTTGATAAATCATTGATTTCATCTTTGCTGCAATGATACTTAATAAACTCTGCAAGTTATCCACAGAGCAACACTCAATTTTATTGATGATATTCTTATTATACCAGACATTTTTCATACACTCCCTTGTACGGATAGTTTTCCGACAACTTCATGATTACATATCTTGCGGTTTTGATTATTTTTGCTGCAAGAAATACATACTTCAAACGAAAGGTCTTTATTTGCTGTCTGTATTCTGAAGAGTCCAAGGAATCAAACTTGAACAACAAAAATAGGTTATATGAAAGCATCATCATTTGAAACACGGCTTCATTCGCCCAAAATGACTTTAGCAAGAGATGACCCACCGCCATGTCGTATTTGGCTTCTTTGATATAGTTTTCAGCATTACCACGCTTTTCATAGTATATAACTACTTTTTCAGAAAGCAAGGTAGTATTTGTTACAAAGAAAAAGTAGTCGTATTCGGAACCTTCTAAAAGTGATAATTGTGCTCTTTCTTTTTCTGGTTTCAGTACGCGAGATACGACAAATCTTCTGTCTTTTTCCCATTTAACTAATTTTGTATACAGTTCTGTAGTTTCTCTACCTTCTTCTCCTTTAACGAATACAATTGATGAATTCGTTGCTTGTGAGGTGAGTGTAGAATAACTTTTGGCTTTAATTAAATATTTGCATCCAAGAGATTCTATCGTTTCGATAATTTTTTCATCAAAGTAGCCACTATCCATTCGAAATAAAATTTCTAAATCGTCTGATTTGATGTTAGCAACAATTTCTTTGATCATTTCCGCAGCACCGTTTGCAGTGTAAGTATTGCCACTTCTTACAAATCCGGTAACATATGCTTTTAATTCGTCGCAAAATGCAAATTGGATATTGTAGCATCGGTTTCCCAGTTTCTTAGGATTATATCCTTTTGACGCACCTTCTTGATGACCTTCTACGTTAATTACACTACTATCAATATCAATCGTAATGGATGTCAATTTACTTTTAGTGAGCAGTTTTTTAAAGACTTTAAAATTAATGTCTCTAAACATTTGGGTTGTCTTGAAGTTGAAGTTTCCTAGAAACCGTGACACTGTTTCAGGTTCTTTTACGGAAATATCAAACTCGTTGACGAGGGGATCATTTTGAAGTAGCTTTAGACGTTCTAACTTATCAATGCCAATGAAGTGACCGCAGAGCATGGTCTTTATATGATTCATCTTGATTTTATTTGTTGAGTCATTATCAAATACGAGGTCATTTTCAATAAAATCAAAAATCCCATTGCTTTTTGCATTCTCAAGGAGCAGAAAAAGACCTGCATTTGATGTTAGATTCTTAGCTTTGAAATCAATTTTATTAATCATAATTAGAACCCCTTTTTACTACTTTTCTTACTATTATTTTACCATATATCGAGTCATAAAAGCTGATAATTTAACATATTTTTGAGCACTTTTCTTTCACCCAATGGGTGAAAGCTGAATTTCGAAGGAATGCATATTTATCAAGGCTTTGATTATGCTTTTTGAAGTACTGACGTAGAATCTAGGCAAATCAAAAGGGGTTTTAATAACTGGCTCAAAGCTGAAAGCTTTCCGGAACCCCCAGCCTAGCTGTAATGCCAGTCAGTTAAGCAACTGACTGGCTCTTTTTCGGGGCTGTGGGGTATTTCCAGGGCCTCTCCTTTACCACTCTCGGGAAGGCCCTTTCCCTTCTTGTCGGTAATTTCACAAGTTGTCCCATACTTGCAAGATCGCGCATCAGCTCCGGTATACGTCCCGGTGAAGCGCCCTGCAATGTCATCAGCATTCTCATCACCATTCCACATGATTCTGAGAAACTCAGTTGATTCGGCCAGTAACCTTTCAGATGTTCCGCCATTTTAATCATCTGATATCTCACCAGATTATAAGCCAGTAAGACACCCCACAGCTCTTGCTCCACAAGCTCCGGTTTTTTACTTCTCAGCGTCAGCCTGCTCAGTTGCATCGTCTGTTTTATCTCCCTGTATCCCAGTTCGATTTCCCAGCGATGACTGTACAGATCCGCCATTTCTCCTCCGGGGAAGCGCATGGCGTCCGTCATCGACGTCAGCAGATGGCAGACTTTTCCTTTGCGCGTCACGGTCAGCAGGCGGGCTGTCACTTCATTTCCCAGTCCCGGCCACTTTTTTCGTGCCTGCGGGCTGGTTTTCAGCTTCACCAGATGATCGCCTTTACCCAGTTTTCTGATCTCTTCATATTGCGCTCCCTTTCTGAGAGGTATCATCCAGTGGCGGTGTTCTCCCGCCAGGCTCCAGGCATTTAACAGTCCCAGTGAGTAATAACCTTTATCCATTAACGTCAGAGTGTTATCGCCGGTTTGTTCTATAAGTTGCTCAGCAAGCTCATTTTCGCTGTTCTTCATCGTGCCGAAGGCTGCAGCCGTCAGCAGATGGCTGGTCAGTTCCATCTGGCAGACCATTTTGACCTGCGGGTAGAGCGCCGGGTTCCCGGCATGTGTCTGGCGGGGGAAGGCTGCATCGTTCTCTGGTGTATCCGGTGTGCGCCAGAACACACCATCGATGGCCAGCAGGGTCAGGCCGCACCAGTGCGGATGCGGCGTGGCGTTATGCCAGAGCTGCGCTGTTTTCGTGAACACGCGGCGGACAGCCTCACTTCCCAGGCGCTGGCGGGCCTGAATAACGGCACTGGGGGCAACGAAGGGGCGATTGCCCGGCAGCATGATGTCCAGGCGATTCACAATCTGGTGAAGAGGTTCTTTACGCTCAAGCGCCATGCCAACAATACACCAGACCATCATTTCGAGGGGAAGACGGCGCTTGCGTAGCGTTACAGTACCTGATTCGGCAAGGCAACGAGAGATGAGTTCGGGGTCGAGGTAATCCCCCAGAGAAGTCAGTGGGTTACGCAGAGAATCGTAACGGGATACCAGATCAAGAGCCTGTCCAATGTGCATAAAAAAATCCGGAAACAAGTGAGCGTTTCCGGATTCTTACACAGCCACTGGATCGGTCAACTGATCCTTAACTGATCGGCATTACAGCCTAGCTGGGGGTTTTCTGTGCACAAAAAACCCCTGTAAAAAACTTACAGGGGTATAAGGCTTAGCCTAACTTGCGTCTGTTGCATGGTGCCGGGTGCCTCCCGGTGAATTCAGTCGGTGTCACTGAACCCGCGTAGGCTTCGCTCATAACATAATAAATGCTATGTACACCAGTCGCCCCACCGCACAGGGGGATTCACCACACAGCGCACTTTTTAACAAATATCCCTCCGGCCAGACAATAATAAACATAATGAATTGTGATCTTCTTAACGGTTTTAAAGTGTTACAGATAATATGCCAAGTAATTGCTTGTTTTTTCATCAATAGGAAGACCACACCAACAACCCCAGCTATCAGCCAGAACCGACATTATCCGGCTGATAAAATCTACCATCACTCCAACACCAATCACCGCCTGTGCCAGATCGCGTTTCTCAAACTTTTTAATCTCTGTTGCCACTCTGCGGGTTTTCTTTTTGAATTTTGAAAATACCAAATATCGTGACGTTTCTTTGGGGGATGAGCTATCAAGCGGGAACGATCTGCCTACAGAGAAAGTCAGCCAGACCACTCTGCAAATTATCCCCGAAAGGCTCTGTGGCTGATATGCGCCGGGCATGGCGCAATGGGCCAGTGGTGTCAACGACGGATGAAAAGTGATCCACTTATATCTCCACCAACGGCCCAATATTGATCCACCGTTTTACTCAGGATTAGCTTCTGCTATAACCCCGGCCTTTCGTTTCTGTCTGAGTCGATAGCTTTCTCCTTTGATTTGAACGACATGTGAGTGGTGTAAGATACGGTCCAGCATCGCTGAGGTCAGTGCTGCATCACCGGCGAACGTTTGATCCCACTGCCCGAACGGCAGATTGGATGTCAGGATCATTGCGCTCTTTTCGTAACGTTTAGCGATGACCTGGAAGAACAGCTTTGCTTCTTCCTGACTGAACGGCAGATAGCCTATTTCATCAATGATGAGCAGACGGGGGGCCATTACTCCACGCTGAAGCGTCGTTTTATAACGGCCCTGACGTTGTGCCGTAGATAACTGAAGTAACAGATCTGCTGCTGTTGTGAAGCGAACTTTGATACCTGCACGGACTGCTTCATAGCCCATCGCTATTGCCAGATGGGTTTTCCCCACACCTGATGGCCCCAGTAATACGATATTTTCATTACGTTCTATGAAGCTGAGTGAGCGTAACGACTGGAGTTGCTTCTGCGGTGCTCCGGTGGCGAATGTGAAGTCATACTCTTCGAACGTTTTCACCGCCGGGAAGGCTGCCATTCGGGTATACATCACCTGTTTACGTTGATGACGTGCCAGTTTTTCTTCATGAAGCAGATGCTCCAGGAAGTCCATATAACTCCATTCCTGGTCTACTGCCTGTTGTGACAGCGCAGGCGCTGCGCTTATAAGGCTTTCCAGTTGCAACTGCCCGGCGAGCGCCATCAGTCGTTGATGTTGCAGTTCCATCATCACGCCACTCCTCTGCAGAATGAGTCGTAGATGGAGAGTGGATGATGCAGGGGGTGTTTGTCGAAGTTCACCAGATTTTCATCAAGATGCACGTCATACTCTTTTTTCTCCGGAGGCAGTGCCAGCATGGACTGCTGCTCTTCGAGCCAGCGATCGCAGGGACGTGCCTGGATTGTTTCATGCTTTCGTTGGTTAGCGACATCGTGCAGCCAGCGCAGACCGTGGCGGTTGGCTGTTTCAACATCGACAGTGATCCCCATCGGGCGCAGGCGAGTCATTAGTGGGATGTAAAAACTGTTACGGGTGTACTGCACCATCCGTTCCACCTTACCTTTAGTCTGTGCCCTGAAGGGGCGACACAGTCGGGGAGAGAAGCCCATCTCCTTGCCGAACTGCCACAGCGAAGGATGGAACCGGTGCTGACCGGTCTGATATGCGTCACGTTGCAGAACCACAGTTTTCATATTGTCATACAACACTTAGCGCGGCACACCACCAAAGAAGCGGAACGCATTACGATGGCAGGTCTCCAGCGTGTCATAACGCATATTGTCAGTGAATTCGATGTACAGTATTCCGCTGTATCCGAGAACAGCAACGAACACGTGAAGCGGTGAGCGACCATTACGCATAGTGCCCCAGTCAACCTGCATCTGTCGTCCGGGTTCAGTTTCGAACCGAACGGCAGGCTCCTGCTCCTGAGGAACCGAGAGAGAACGAATGAATGCCCTGAGAATGGTCATTCCGCCACGATATCCCTGGTCTCTGATCTCGCGAGCGATTACCGTTGCCGGGATTTTGTAAGGATGAGCATCGGCGATGTGTTGACGAATATAATCCCGGTATTCATCCAGGAGTGAAGCAACAGCAGGTCGCGGCGTATATTTTGGCGGCTCAGATTTTGCCTGCAAATAACGTTTAACGGTATTGCGGGAGATCCCCAGTTCTCTGGCAATCGCCCGGCTACTCATTCCCTGCTTGTGCAGGATTTTAATTTCCATAACTGTCTCAAAAGTGACCATAAGCTCTCCTGAATCAGGAGAGCAGATTACCCCCTGGATCTGATTTCAGGCGTTGGGTGTGGATCACTATTGCACCGTTCGTGACAACAGAGAAAGTCAGCCAGACCACTCTGCAAATTATCCCCGAAAGGCTCTGTGACTGATATGTGCCGGGCATGGCGCAATGGGCCAGTGGCAGTGTGTGATGGTGGCCCTTACTGGATTTGAACCAGCGACCAAGCGATTATGAGTCCTCTTACCACTGAGCTAAAGGGTTGGAGAACGCAATATCACCTGCCTTATATGATTACACCCAGAATTTCCCGGACTGTCTTGTCAAAACATTCAGTCTCCAATTCCCACCAATAGCAAGACGGTCACTATGACAGCATCTCCGGAATGAGCTCAGGGCACGGGGGTTTACCGAGGTTACTTTTCCAACGAAGTTTCTCAAACGCAGGCGTGATAACATTCAAACTTAGGATCTCAGTTGCTATCTTTTCCCATGTCTTACGGATTAATGTCATCGGTTGCATACAAATCGCTACAAGCTGCATCAAGGCACATGCTTCATATGTACAGTACACCAAGATGGATAGGTGTTTCACCTAATCCTAGCTCATTTCAGGCAGGCTCAACAGGCTGCAGCCCGCATGTTTAAGGGCGCAGGTTATCAGCAGGGTATAAGCTGACCGTAAGCGTACAGCGAGGGCCGTATTGACGGGGATGTGTTATTCAGCTGGCAGTGCTATGCGCCACGGAAGCAGTTCGCTGACCCGGTTGACCGGCCAGTCTGCTATGACGCCAAGCACATGGCGAAGGTAGCTTTCTGGATCCACGTCATTCAGTTTGCACGTCCCGATCAGGCTGTACAGTAGCGCTCCCCGCTCACCACCATGGTCAGAGCCGAAGAACAGGAAGTTTTTACGACCCAGACTGACCGCCCGCAGGGCATTTTCAGCGATGTTGTTGTCGATTTCCACCCAGCCATCGTTCGCATAGTACGTCAGTGCCGACCACTGGTTAAGTGCGTACGCGAACGCCTTCGCCAACTCTGAGTGTCGCGACAGGGTCTTCATCTTTTCACGCAACCAGCTTTCCAGGGATTTCAACAGCGGTTTCGTTTTTCGCTGACGTTCAGCAAGCCGCTGCTCTGCCGGTATTCCCCTTATATCCGCCTCTATGGCGTACAACTGACCGATCTGTTCCAGGGCTTCTTCCGTCAGTGCTGACGGGATGCGGACGTGCACATCGTGGATCTTTCGGCGGGCATGAGCCCAGCAGGCAGCTTCCGTTATCCCACCATTGCGATACAGCTCGTTGAACCCGGCGTACGCATCCGCTTGCAGCACACCGCTGAAGCAAGCAAGATGAGTCTGCGGATGGATGCCTTTTCTGTCCGGGCTGTAAGCGAACCACACTGCAGGTGCCAACGCTGACCCGGCATTGCGGTCATCACGAACATACGCCCACAACCGCCCGGTCTTCGTCTTCTTATTACCCGGCAGCAGTACCTGGACCGGGGTATCATCGGCATGGAGTTTGCCGTCAGTCATGACATAGCCATGAAGCGCCTCTTCCAGCGGAGACAGCAGCCGGCAGCATGCATCCACCCAGCCCGACAGCAGTGAACGGCTCAGCTCCACACCTTGCCGGCCGTATATTTCTGACTGGCAATACAGCGGGGTGTGCTCTGCATACTTCGAGGTCAGCACGCGGGCCAGCAGCCCCGGTCCGGCGATACCCCGCTCGATGGGCCGCGAAGGTGCAGGTGCCTGCACGATGGCATCGCACTGAGTACAGGCATGTTTTTCCCGTACCGTCCGGATAACCCGGAATGCGCTACGCATCAACTCCAGCTGTTCGGCGGTATCCTCGCCCAGATAGCTCAGTGAACCGCCGCAGTTCGGGCAGCACGGCGCCGCAGGCAACAGTCGCTTTTCGTCACGGGGTAGTGATTCAGGGAACGGCTTACGGGTGCGGGTCTGACGCAACGGACGCTGTACTGCCGGGTCATACACCCTACCAGTCAGCGTATCGCTCTCTTTCTGAAGCCGGTTCAGATCGGCTTCCATTTGTGCGATACGGCGGGAGACTTTTTCGGAACGACTGCCGAAGTTCATCCGGCGGAGTTTATCCAGCTGCGCCTGCAGATGGTCTATTTCGCGCTCCCGGTTGCTCAGCTTTTCCTGCAGGGCGTGGATCAGCGCTTCCTGTTCGGCCAGGCGCTGTTTCAGCAGGAAGATGTCGTCAGAAGAGATGTCGTTCATAAGCCCGTATTTTACCGGGCTTATTCTGTGACAACCAGGATAAAGAGATTTACAGCATGGTCAGGGAGGTCAGCAGCCGCTTAGGCTGTCGCCAGTCGATACCTTCCAGCAGCATCGCCAGCTGCGCCTGTGTAAGGAACACTTTGCCATCACTGGCTGACGGCCAGGCGAAGCGCCCACGCTCCAGCCGTTTGGTCAGGAGGCACAGTCCGTCACCGGTGGACCACAGCAGTTTAACCTGACTGCCGCTGCGGCCCCGGAAAATGAAAACATGGCCGGACATGGGATCGTCTTTCAGCGCCGTCTGTACTTTCGCAGCCAGGCCGTTGAAGCCATTTCTCATATCGGTGATACCGGCAACCAGCCAAATTTTGGTCCCGGAAGGTAACGGGATCATCGCTTCAGTTCCTGTATCAGCAGAGTCAGGAGCTTTTCGCTGACATTGCCATTGAAGCGGAGCGTCCCGTGCCGGAACGTTACCTCACAGCTGATACTGAGGGTTTCCGGGTCCTCTGCGAGCGATTCTGGCTGTTCGGCAGCTGCATCGAGAGTCACAGGAAGTAGCTGGGGGCTCTCTGAAGAAGGTAATAGCAGCTTTCCCTCGCGCCATTGTTGTCGCCATTTGAACAACAGATTGGCGTTAATGCCATTTTCAAGAGCAAGTTTTGAGATGGATATCCCGGGTTCACAGGAGGCAGCAACGAGCTGCTGTTTAAATTCGGGAGAATAATTAGGGCAGCCTTTTCGCCTGCCGGGAGTCACATTTTTCTGCATATCTGACACTTTGGTTCCCACTACTTATTTGGTGGACACCACTTTGTCTAATTCGTCAGATTCTGACCAGACGGTTCAGGCTGTACGCTTACAGCTGACCGGATACATTCCGGCAAAGAAAAACCTTGCAGCGATGAAAGCCGTTTCGGTAATGCAATGACGATAAAGCTGTCCTGTATATATGTGCTTCGCCTCAAACGCTTGCCGTTTTGGTATTGTGCACATGCCGTCTGAACGATGTGGAGCCAGAAAAATGGATGCGTTATGTCATTGAGCATATCTAGGACTGGCCGGCAAACCGGGTACACGATCTGTTGCTCTAAATAGTTGAGCTGGCCTCTCAGTAAATATCAATACGGTTTTGGTGAGCCGCTTACCACTGAAGCATCACTTCGGCAGGTAGATTTCTGCAGGCAGAATGGCGCTTACCTTAGCGATAAGCGCGTAAATGTGGTTATTCAATACCTGTGGTGACTGTAAAAGTGCGCGTTTGCTGCGGTGCAACCTGAATCAGCGTGCCATTACGTTGCGCGGCAAGATACCCCTCAGGTCGACAGGTTGCTGGTAATGCAAAGGCGGCTACCTGTTGCTCGCCGTTATAAAGGATCCAGCGTGTCACATAATTTAGTTCAGCACTGGAGAAACGAGTAACAAACGTAGTGCCATCGGGAGCGATCATGCGAAACTCTGGCTGATCTGTGTAAGCGTCCAGTTTATCTGCAAAGAAGACAATTTCTGGATCATAAAATTCCGGTTGATTCAGCGTCGACAGAGAGGATTCCCCCTGCATAATCCGTTGATTAAACGCCAGCCACTGAGGGGTGGGATTAACATGCGAAGGTACTGATTCACGCAACCTTAATATTTCGTCCGGTATATTCTGGCTGAACGTAGCATTTGGTATATATGCATAATTCATGTGGCACATATATTGTAGTGGCATATCTACAGAAGCCAGATTGGTTACGGCCATCTTAATATCGAACAGTGTAGAGGATTTGTGAAGGACCACTGTTGGCTGAGCCAGATAATGATGCCCGAACCCCATTACATACTCGTAACGCCCGGTAAGGCGTAACATATCTCCCTCTAATTCCATCCATGCTTCATCCATCGCGGCACAGGCCATTTCACCGTGTAGCAGATGAGTATCTTCCGCAGATGGGCAGCCATTAGCCAGCAAACCTGAATGAAAGGCAAAACAGCCATAGGTCTCTATCACCTCTGTCGCCGGTTTAGGCTGGCGAAACATATTGCACATGGTGAGACTGTGTCCATCAAATTGCGCATCCCAAATCATCTGCCCCATCCAGGGAAGAATAATTAAATGTCCACGACTGTTTGCAATTTTAAGCCCCTCGACACCACTGTCATAGCGAAAAGACGTGACAGTAAAATCACTATTTTCCAGCAAGATACGAGGTTTTTCGCCAAAAAGCGCCCGCCACAAATAAATACGCGTACTCATAACGATTCTCCTCAGGACTCTGTGACTTCAGCCAGTGCATTACGTACTTTGCTTTCACGCCAGAAGTAGACACCGACATAGACAAAACAGAGCATAGAAACCAGGAATGAAAGCTGTAGTGAGTGGAACATATCTGCAATATACCCCTGAATTGCCGGAACCACCGCTGCGCCAACAATAGCCATAACAATGACTGCTCCTGCCATTTCTGTATGTTCGTTATCAACTGTATCCAGTGTTCCTGCATAGATCGTCGCCCAGCAAGGGCCAAACAAAACACTTACCAGGACGGCGACATAGACCGCGCTGAAACTTGGAGCCAGTGCAACATATGCCAGAAACAGCGCCCCTATAACGGAATAGAGAATTAGGACTTTTTCCGGATTAAAACGTGTCATAAGGATGTTGGCTATAAACTTGCCAATAAAGAAGCAGGCAAAGCTGTAGACCATGAAGTTTGAAGCATCACGTTCGTTGATATCGCCCAACTCCAGTGCCAGACGGATGGTAAATGACCATACTGCGACCTGCATACCCACATAAAGGAATTGAGCCACAATACCACGACGAAAGCGCGAATTTTTAGCCAGATAGCGCAGCGTATCCATTGCTGACGGGCGTTTATGGTGACTTGTCTGTGCCACTTTACAGGTTGGGAAGCGGGTTAAAAGGAACAACACCATGACCACAACCAGAATCATAATCATATACTTATACGGTTCAAGGGTGTTCTCTAACATCAGCACCTTAAAGTTGTGAATTTGCTCGGCGTTCATTCCGGACATCTGCTTCTCAAGGCTTTCCCCCTCGGAGAAAACCAGATATTTGCCCAATAAAATACCAGACGCAGCACCAATCGGATAAAAGGTCTGGCTGATATTGAGCCGCAATGTGGCATAGGCTTTTGGACCGATCATTGAACTGTATGTGTTCGCTGCAGTTTCAAGGAAACTCAGGCCAATCGCAATCGCAAAAATAGCTGCAAGGAACATAGTGTAGGTTGCCATATGCGAGGCAGGGAAAAAAAGTGTACAACCACCAATATACAGCGTCAGGCCAATTAAAATTGCCACCTTATAACTGGTCTTTTTAATCACAAGGGATGCTGGTATTGCAATTAAAAAATAACCTCCATAAAATGCGCTCTGCACCAATGCTGAAGCAAAGTTACTTAGCGAAAATACACTTTTGAATTGAGTGATTAATATGTCATTTAATGCAGCTGCGCATCCCCATAGCGGGAATAAACACGATAACAAAATAAACTGGAACAAGGGAGTCTTATTCAGATACCCATCAGGCATCTGAATGATGTTTTTATCGTTCATAGTGCTACCTTTAACTGTGCAGGATGATTATTCGTTTAAGGTTAAAAATTCATTAAATTGTTCAATACTCGGATAAGATGATTGCGTACCTTTCCCTGTGACGCTGAAAGCGGCAAAGAGAGCGGCTTTTTTCAAAGCGGCTTCAACATCACCGCTTTGAACATAATAATGGGAAAAGCAACCAATAAATGCGTCACCAGCGCCACTAGTATCAACAGCATTTACTTTGAATGCAGGAACATGGACTTCCTGATCGCGGGTCATCCATAATGCGCCTTTTTCGCTCATGGTAACAATAATATTGTTCAGCCCTTTATCAACTAACGAACGTGCGGCCAAACGAATATGATCATAAGTATCAACCGACATACCGGTTAATATTTCCAGTTCTGTTTCATTCGGAATAAAGAAATCACATTTGCAGGCATAAGACATATCTAACTCACGCAATGCCGGAGCCGGATTTAATAACACTTCAATACCATTTTTCTTACCAAACTCAATCGCGTGGTAAACTGTTTCCAGTTGAACTTCCAGTTGTAAAACGATCAATTTGCATTTTTTCAGATCTTCTGCAGCTCGATCGATATCTTCCGGGGAAAGAAATTTATTCGCTCCCTTAATTATTAATATACTATTGCTCGAGTTGGCATTAACAAAGATCGGTGCAACACCACTGCTGGTACAGGGGACCTTCTCAACATAAGTGGTATTAATTCCCCATGATTCGAGATTACGAATAGTATTATCCGCAAAAATATCATCACCTACTTTAGTCAGCATCAGGACTTTTGAATTCAATTTAGCCGCCGCCACCGCTTGATTAGCACCTTTCCCACCACATCCGATTTTGAAGGCAGGTGCTTCCAGAGTTTCTCCTTCTTTAGGCATCTGATTAGTGTAAGTAATGAGATCCACCATATTGGAACCAATAACTGCAATGTCCATTTCACTACCTCTTATAAACTTTCGCATAACAATGGTATTTAAATAACATTAGCATGTTACTTTTGCATCATTTGTGACTGAGATCGCGATTAGCACATCAACCCGATGTTTATTTAATAGACTTCCAGTCTCATCACTCAGGCCAACACTATCTAATCATAAGCAACCTAACAAGATTAGTGCCCAAAACTCAGCAGCCTATACCCTTTTCATTTCAAAGGGGCGGTCGTATAGTATGGTAATGAAAACAATGTTTACTAACGCCAAAATGTTATTTTTATAACATTCTTACGGAGAGAGAGTTGATGGAAACGAAGCAAAAAGAGCGTATCCGACGTTTGATGGAACTGCTTAAGAAAACCGACAGAATCCATTTGAAAGACGCAGCGCGAATGCTGGAAGTTTCTGTAATGACTATTCGTCGCGATCTCCATCAGGAAGATGAACCTCTGCCACTGACCCTACTGGGTGGCTATATTGTAATGGTGAATAAACCCGCGCCATCCATGCCAGTAATCCATGACGTTCCAAAAAATCATCGTGATGACTTACCTATTGCAATTCTGGCTGCCGGAATGGTTAATGAAAATGATCTGATCTTCTTTGATAATGGCCAGGAGATACCACTCGTTATAAGCATGATCCCGGATGCAATCACCTTCACCGGTATCTGTTACTCACATCGCGTCTTTGTTGCGTTGAATGAAAAGCCTAATGTAACAGCAATACTTTGTGGTGGTACGTATCGTGCCAGAAGTGATGCTTTTTACGATGCCAGTAACTCTTCGCCATTAGACTCTCTCAATCCGCGAAAAATATTTATTTCCGCCAGCGGTGTGCATAATCACTTTGGCGTCAGCTGGTTTAACCCTGAAGATCTTGCCACTAAGCGTAAAGCGATGAACCGTGGACTACGGAAAATTTTGCTCGCCCGCCACGCGTTGTTCGATGAAGTGGCCTCTGCCAGCCTCGCACCGATCTCTGCATTTGACGTTCTGATTAGCGATCGTCCGTTACCGGCAGATTATGTTACGCACTGCCAGAATGGTTCTGTAAAGATCATTACACCTGATTCAGAAGACGAATGACTTACTGAAAAAACACCACAATCTTGTTAAACATCGTCGGATTGGACTGATTACGTTGCACTTTCACCACATATTCCAGCTTATCTATTTGGCTTATCACCTACTCCAGACGCTGGTCATCCTTGACCAGTAGCCAGATATGGCTTTTGTCGCAGTCCTGAATCGGCAGGCAGAGAATGTCTTCAACGTTAAGAGCGCGACGGGCAAAAGCCCACAAAGGTGAGTCATTACACCCGGATGGTAAACAGGCACTTCCTTGTTGTAGAAGCCGTTATATTGTCCATAACGTCTGGGTTTTTAATGCATACTGGGTAACAACCGGACGAATCTTCGAATATCACTGGCGCTGCCAGCCGGTTTGAATTCATCTCACAACCCTGCATAGAGCGAATCTCCTGCCTGCACGTCACTCCACTCCATGGTATCAACTTCACACTCTTTATCTGCGGCTAGTTTCAGCCACCAGATAAGCATCGACTATGAAAGATGAGCCATGACAACATAACGTTGGTAACGCTCTGACGCCTTAATGGAAGATGCCTGCCACCATAGGGAATGTAAACGACTGAAGTGTGGCCTTTAATGCCGTGAACGGCTCATGGTCTCCTGGCACGGTTGCCGCCCCAACCTGTAACAACATTCCACAGTACAATGTCTGTCAGAGTCAGAGCCTCCCATGCTTGTTGTAGTAACTCTACCAGTGGATTTGCCCCTATATTTCCAGACGCCTGTTATCACTTAACCCATTACTGGCTTGCTGCCGTAGATATTCCCGTGGCGAGCGATAACCCAGTGCACTATGCGGATGCCATTCGTTATAATGCTCGAACGCCTCTGCAAGGTTCTTTGCTGCCGTTAACCCGTCTGGTTTGGGCATGACACTGATGTAGTCACGCTTTATCGTTTTCACAAAGCTCTCTGCTATTCCGTTACTCTCCGGACTCCGCACCGCCGTGCTCTTCGGTTCAAGCCCCAACATCCGGGCAAACTGCCGTGTTTCATTAGCCCGGTAGCATGAACCATTATCCGTCAGCCACTCTACTGGAGACGCCGGAAGCTCGTTGCCGAAGCGGCGTTCCACCGCTCCCAGCATGACGTCCTGTACTGTTTCACTGTTGAAGCCGCCCGTAGTGACCGCCCAGTGCAGTGCCTCACGGTCACAGCAGTCCAGCGCGAACGTGACTCGCAGTTTTTCTCCGTTATCACAGCGGAACTCGAACCCGTCAGAGCACCATCGCTGATTACTTTCTTTCACAGCCACTCTGCCGGTATGTGCCCGTTTCGATGGCGGTACAGCAGGTTTTCGCTCAAGCAACAGCGCATTCTGGCGCATGATCCGGTAAACACGTTTGGCATTGATCGCAGGCATACCATCAAGTTCTGCCTGTCTGCGAAGCAGCGTCCATACCCGACGATAACCATACGTGGGCAGCTCTCCGATAACATGGTGTATACGGAGAAGCACATCCGTATCATCAGTGTGACGACTGCGGCGGCCATCCATCCAGTCATCGGTTCGTCTGAGAATGACGTGCAACTGCGCACGCGACACCCGGAGACAACGGCTGACTAAGCTTACTCCCCATCCCCGGGCAATAAGGGCGCGTGCGCTATCCACTTTTTTGCCCGTCCATATTCAACGGCTTCTTTGAGGAGTTCATTTTCCATCGTTTTCTTGCCGAGCAGGCGCTGGAGTTCTTTAATCTGCTTCATGGCGGCAGCAAGTTCAGAGGCAGGAACAACCTGTTCTCCGGCGGCGACAGCAGTAAGACTTCCTTCCTGGTATTGCTTACGCCAGAGAAATAACTGGCTGGCTGCTACACCATGTTGCCGGGCAACGAGGGAGACCGTCATCCCCGGTTCAAAGCTCTGCTGAACAATTGCGATCTTTTCCTGTGTGGTACGCCGTCTGCGTTTCTCCGGTCCTAAGACATCAATCATCTGCTCTCCAATGACTAGTCTAAAAACTAGTATTAAGACTATCACTTAAATAAGTGATACTGGTTGTCTGGAGATTCAGGGGGCCAGTCTAACCAGTTACGAACATCCTTCCTCAAAATTGTTGTCATATCTCGCATGGAAGAAAAGATCCTGGCTAAGGAGCAACAAACAACGTATTGCGGAACTTGCATATTTTTCCTGTAACTAGTGTATTACCACATATGGTAATAGCTACCTGTGTGGTTTCGCTGGATAGCAAGGGGATTTATTCGCAAGTAAAATGCCTGATAAAATACACGAATCTAGTAATCATCAATATTTACTCTGGTCGAATGACGCGTGAAGTGGACTGCCAGCAGACGCGGCCAGTGGTCCACCGCCTGCTGAACAAAACGCCAGATATCTCTCGGCTCTGAAAGTAACGCTTCGGTTATTTGCACGGAATACTACTCCTTCAGACTCTGTTAAGTTTTGTTTGTTAAACCGGTGCAGACCTGCAGGAAAGCATGCCAGCACCGGCACTGTACGATATAAACATCCGGTACCGGGGATACGAATGGAATGACGAATACGCCAGAAAAGGGATAACAACCTTCCTCATAATGGTGAAATCATTCGCTATCGGTTACACGGTACGCGATGTCGCCAAAGGCAGCTGGATCGACGAATCCACGGTCACGCTACCGAAAGCGCCGCCGCTTAACACCCTGCCTCGGGCGACCAAAGTGCCGGAGCCGCAGCAGCCGCAGGAAGATTACACCTTTGAAGGTTACCGCAACGCCGACGGCAGCGTGGGCACCAAAAACCTGTTGGGTATTACCACCAGCGTGCACTGCATGGCAGACGTTGAGGACTACGTGGTTAAAATTATCGAACGCGACCTGCTGCCGAAATACCTGAGCATCGACGGCGTGGTCGACTTGAACCACCTCTACGGCTGTGGCGTAGCGATTAATGTACCGGCCGCCGTGGTGCCAATTCGCACCATCCATAATATTGCGCTGAACCCAAACTTCGGTGGCGAAGTGATGGTGGTGGGCATGCAGTGCGGTGGCAGCGACGCGTTCTCCGGCGTTACCACTAACCCCGCTGTCGGCTACGACTCTGACCTGCTGGTGCGCTGCGGCGCAACGGTGATGTTCTCCGAAGTCACTGAAGTACGCGACGCCATTCATCTGTTAACGCCACGCGCCATCAATGAAGAAGTGGGCAGGCGTCTGCTCGAAGAGATGGCCTGATACGATAACTATCCCGATATGGGCAAAACCGACCGCAGCGCCAACCCTTCGCCGGGCAACTAAAAGGGCGGCCTCGCCAACGTGGTAAAGAAAGCACTCGGCTCCATTGCTAAATCGGGTAAAACCGCAATTGTTGAAGTGCTGTCGCCCGGTCAACACCCGACTAAACGCGAATTAATTTACGCCGCGACGCCAGCCAGAGATTTTGTCTGTGGCACGCAACAGGTGGCTTCGGGTATCACCGTGCAAGTGTTTACGACCGGCCGTGGTACGCCGTACGGCCTGATGGAGGTACCCGTCATTAAAATGGCGACCCGCACCGGGCTGGCGAACCACTGGTTTGATTTAATGGATATTAACGCAGGCACTATCGCTACCGGCGAAGAAACCATTGAAGAGGTGGGCTGGAAGTTGTTCCACTTTATTCTCGACGTCGCCAGCGGGAAGAAGAAAACCCTCTCGGATCAATGGGGATTGCATAACCAACTGGCAGTGTTTAACCCGGCACCGGTGGCCTGATATTCTCTTCATACATTAAGTTGTATTATGCCCGATAACGCTTGTTTATCGGGCATAGTGAATCACAGCGAAGACGCGAGCTCCCCGACCAGAATCACTTCAACCCCAGCCTTTCGCAAGCCTTCCAGACTATCCGCAGGAATGCCTTCATCAACAATGATCATGTCGATACGTTGAGTATCAATGATCTTATGTAAACTGGAACGATTGAACTTACTGGAATCGGTGACCACGATGATCCGTTCCGCAACTTCGCACATCCGGCGGTTTAAACGGGCTTCATCTTCATTATGCGTGCTGACGCCGCGCTCCAGATCGATCGCATCTACACCAAGAAACAGCATATCGAAGTGGTAATTTTGCAGCGATTGCTCAGCCTGATCGCCGTAAAAAGATTGCGACTGACGGCGCAAATGCCCGCCGGTCATCAGCAGCTCAACGCCTTCCGCTTCCAGCAACGCATTAGCCACGTTCATACCGTTGGTCATCGCAATTACGTCAGTGTGCTTGCGCATCAGACGAGCAATCTCAAAAGTGGTGGTCCCGGAATCGAGGAGCACCCGATGACCTGGCTGAATCAACTCAACGGCAGCTTTCGCAACGCTGCGTTTCATCGCGGTGTTCAGTGCGCTTTTGTCTTCCACTGATGGCTCGACTGACGGCGTCGTGCTATCGCAGATCAACGCGCCACCATAGGCACGCACAGCGATTCCCTGCTTTTCCAGAAACGCCAGATCGTTGCGGATCGTCACAGTAGATACGCCATACAATGCCGACAGATCGTTAACCTGCACACTCCCTTGCTGTCGCAGACGCTGAATGATCTGTTCTCGTCGCTCGCTGGTGCCTGTCACTCGCTTCTCACCTGAAGCGTCGGTATTACTCATAGTAAGTCCTTTCGTAAAACTTTCGTTTCATTTCGTTTTGCCTATTAACGCCTTTCTATTAAGCAAATGCAAGCCCACCTTGCCCATTGGCGCAAGCTACTCTCGTTTCACTGACTTTCATTATGTTTCTTTTGTGAATCAGATCAGAAAACTATTATCTTTCGTTTTATTTTTATCTCACCATGACGCAGTATCAACTGAAACAAAACGAAAGATTAATATCGCAGCAATCTGAACTGGAGAGGAAAGTGAAACATCTGACAGAAATGGTGAGGCAGCACAAAGCGGGCAAAACAAATGGAATTTATGCCGTTTGTTTTGCCCGCTTTGTGCTGCATGGGGCGAGCGATGTGCCGGATGAGTATGTTCGTCGCACCATTGGGCCAGGCGTCTGCAAAGTCAACGTTGCAACCGAGTTGAAGATCGCCTTCTCTGACGCTATCAAAGCTTGGTTTGCTGAAAATCAGCAGAGCAACGATCCGCGCTTTTACATGCGGGTTGGCATGGACGCCATGAAAGAGGTGGTCAGAAGCAAAATCGCCGTCTGCGGCTCGGCAAATCGATTACGGCTACCGGCGGAGGCCTGATCCAACAGCGTATTACCTCAATATTTCAAAATAATTATAAGTCCCACAAATATGAAGGCGCGTCCTTAAACCGGGTAGTGCCTTCCATTATCCTAAAATTCGAGGAGCCCTATATGACACAAAAAAAATCTTTTAAATCAAAATTATGGGAGTTTTTACAAAGTCTGGGGAAAACCTTTATGTTCCCGGTTTCGCTTCTTGCCTTTATGGGATTGCTGCTGGGTATCGGTAGTTCAGTCACCAGCCCTTCCACCATTACTAGTTTTCCCTTTCTGGGCGGCGAATTTACCCAGTTGACCTTTGGCTTTATCGCTATGGTCGGTGGCTTTGCTTTTACCTATCTGCCGCTGATGTTTGCCATGGCGATCCCCATGGGGCTTGCCAAGCGCAACAAAGCGGTCGCTGCCTTTGCCGGGTTCGTTGGCTACATGCTGATGAACATGAGCATTAATTATTACCTGACGGCTACCCACCAGCTTGCCGACCCCGCCACCATGAAACAGGTAGGACAATCGATCGTGCTGGGCATTCAAACCCTGGAGATGGGGGTATTAGGTGGCATTGTGGTTGGGGTTATCACCTATTTTCTGCATGACCGTTTTCAGGACACGGTTCTGCATGACGCCTTCGCCTTCTTTAGCGGCATTCGTTTCGTGCCGATTATTACCGCGCTCACCCTGTCGCTGGTGGGTCTGTTCATTCCCATGCTGTGGGAATACGTCGCGCTGGGCATCGCGGGCATTGGGCATATCATCCAGAGCACCAGCGTTTTCGGCCCCTTCCTCTACGGCGTAGGCGTGCTGCTGCTTAAACCTTTTGGTCTGCACCACATCCTGCTGGCGATGGTGCGTTTTACCCCAGCAGGCGGCATTGAAATGGTAAATGGCCATGAGGTCGCCGGGGCGCTGAATATCTTCTACGCCGAGCTCAAAGCCGGCCTGCCGTTTAGCCCGCACGTTACCGCGTTTCTGTCACAAGGGTTTATGCCGACCTTTATCTTCGGTTTACCCGCCGTGGCTTACGCCATCTACCGCACCGCGCGTCCGGAAAATCGGCCGGTCATTAAGGGGTTGCTGCTTTCCGGCGTGCTGGTTTCCGTCGTCACCGGTATTTCAGAGCCGATTGAGTTCCTGTTCCTGTTTATCGCCCCCGCGCTTTACGCCTTCCATATCGTCATGTCTGGCCTGGCGCTGATGGTAATGGCCCTGCTGGGAGTGACCATCGGCAATACCGACGGCGGCATTCTGGATCTGCTGATTTTCGGCGTGATGCAGGGAATGTCGACCAAATGGTATCTGCTGTTCCCGGTTGGTATTGCCTGGTTTGCCATCTACTTCTTTGTCTTCCGCTGGTACATCCTCAAACACAACATCAAAACGCCGGGCCGCGAGGTGGATGTTCAGGGGGCACAGCAAGCCGTCGAGGCGAACACCCGCGCGCGCGGAAAATCAAAATACGATCACGAGCTTATCCTACGTGCGCTCGGAGGTAAAGAGAACATTGAGTCGCTTGATAACTGTATTACCCGCTTGCGTCTGGTGGTGAAAGATATGGGCCTTATCGATCAGCAGGCGCTGAAAGCGGCAGGCGCGTTGTCAGTGGTGATGCTTGATGCGCATAGCGTGCAGGTGATCATCGGACCGCAGGTACAGAGCGTCAAAACCGGCATTGAAGCCTTAATTTAACAGGAGGAGTGATGTTTGATTTCGACAAAATCATTGAGCGTCAAAATGATAAGTGCCGTAAATGGGACCATACCTTTGTTTGCTCGCGTTTCGGTGACGTCCCGGAGTCCTTTATCCCCCTATGGATAGCCGATATGGATTTCACCTCACCACCTGCGGTGATTGACGGTTTCCGGCGCATCGTGGAGCACGGCACCTTTGGTTATACCTGGTGCTTTGACGAATTCTACGACGCGGTCATTGCCTTCCAGCGCAAACGTCATCAGGTTGAGGTGGAAAAGTCGTGGATCACGTTGACCTACGGCACCGTATCCACGCTGCACTACACGGTTCAGGCATTCTGCAAACCGGGTGACAGCGTGATGATGAACACGCCGGTCTACGATCCTTTTGCGATGGCGGCACAGCGCCAGGGCGTGCAGGTACTGGCTAACCCGCTGCGCGTGGAGGAAAACCGCTATCAGCTTGATTTTAATCTGATAGAAGAACAGCTCAAAACCCACCGTCCAACGCTGTGGTTCTTCTGCTCGCCACATAACCCGTCCGGCAGGATCTGGCGCGAGGAAGAAATACGCCAGGTGTCCGATCTCTGTCAACGCTACGGCACGATTCTGGTGGTCGATGAGGTTCACGCTGAACACATTCTGGATGGCAAATTCGCCAGTTGTCTCACCTCTGGCTGTGCCGCCCAGGACAACCTGATCGTGCTCACATCGCCCAACAAAGCGTTCAATTTGGGCGGGCTGAAAACCTCCTACTCCATGATTCCAGACGACTCGCTGCGCCAGCGCTTCCGCCAGCAGCTCGAGAAGAACTCCATTACCTCGCCCAATTTGTTCGGGGTATGGGGAATCATTCTGGCCTATCAACACGGTCTGCCCTGGCTCGACGCGCTGAACGGTTATCTGCAAGGCAACGCCCGGTATCTGGCGGATGCCCTCCAGACCCACTTCCCGGCGTGGAAGATGATGAACCCGGAATCGTCGTATCTGGCGTGGATAGACGTAAGCGCGGATGAGCGTAGCGCAACGCAGCTAACCCAACATTTCGCACGGCAGGCAGGCGTGGTCATAGAAGACGGCAGCCACTATGTACAAAACGGCGAAAACTACCTGCGGATTAATTTTGGCACCCAGCGCTACTGGCTGGAGCAGTCCATTAACCGAATGCTGAAAAATGACAAATAAGGATCTTACCCCGATGAAGAAAGTGCTCACTCTCTCACTGCTGGCTCTCTGCGTTTCTCATGGTGCAGCGGCAGCAAACTACGCGCTCAATAACGACAATATTGCCCTCTTGTTTGATGATACAAACTCAACGGTCGTGGTGAAGGACAACAAGGCTAACCATCCGCTCACGCCGCAGGAGTTGTTCTTTCTGACGCTGCCGGATGAGAGTAAAATCCACACCGCGGATTTCAAAATCAAGCACGTCGAAAAGCAGGATAACGCGATTGTCATCGACTTTACGCACCCGGATTTTAACGTCACGGTGAAGCTGAACCTGGTGAAGGGAAAATACGCCAACATCGGCTACACCATTGCCGCCGTGGGGCAGCCGCGCGACGTCGCTAAAATCACCTTCTTCCCGACCCAAAAACAGTCTCAGGCCCCTTACGTAGACGGCGCAATCAATAGCTCTCCGATCGTTGCGGACTCGTTCTTTATCCTGCCGGATAAACCGATCGTGAATACCTACGCCTATGAAGCCACCACCAATCTCAACGTAGAGCTGAAAACGCCGATTCAGCCAGAGGCGCCGGTCAGCTTTACTACCTGGTTCGGCACTTTCCCGGAAACCAGCCAGCTGCGCCGCAGCGTGAACCAGTTTATTAATGACGTACGTCCACGCCCATACAAGCCTTATCTGCACTACAACAGCTGGATGGATATCGGCTTTTTCACTCCCTACACTGAACAGGATGTGCTGGGGCGTATGGACGAATGGAACAAGGAGTTCATTACGGGCCGCGGCGTGGCGCTGGACGCCTTCCTGCTGGATGATGGCTGGGACGATCTGACCGGACGCTGGCTATTTGGCACGGCATTCAGAAACGGTTTTAGCAAAGTACGGGAGAAAGCCGACAGCCTGCACAGCTCCGTTGGGCTATGGCTTTCACCGTGGGGTGGCTACAACAAACCGCGCGACGTTCGCGTTTCGCATGCAAAAGAGTATGGGTTCGAAACCGTGGACGGCAAACTGGCGCTGTCGGGAGCGAACTACTTTAAAAACTTCAATGAGCGGATCATCAAGCTTATCAAAAACGAGCACATCACCTCGTTTAAACTCGACGGGATGGGTAACGCCAGTTCGCATATCAAAGGCAGCTCGTTCGCCTCAGATTTCGATGCATCAATCGCCCTGCTGCACAATATGCGCAGCGCAACCCCGAATCTGTTTATCAACCTGACCACCGGCACCGACGCCAGCCCGTCCTGGCTGTTCTACGCTGATTCTATCTGGCGTCAGGGAGATGACATCAACCTGTATGGTTCCGGTACGCCGGTGCAGCAGTGGATGACCTACCGCGATGCCGAGACGTACCGCTCCATTGTCCGTAAAGGCCCTCTGTTCCCGCTGAACTCGCTGATGTACCACGGGATAGTCAGCGCCGAGAATGCCTATTACGGGTTAGAGAAGGTGCAAACGGACAGCGACTTTGCCGATCAGGTCTGGAGCTACTTCGCGACCGGCACCCAGCTGCAGGAGCTGTATATTACCCCGTCCATGCTGAACAAGGTGAAGTGGGATACGCTGGCGAAGGCTGCAAAATGGTCGAAGGAAAATGCCAGCGTGCTGGTTGATACCCACTGGATTGGCGGCGACCCAACGGCGCTTGCCGTGTACGGCTGGGCATCCTGGAGCAAAGACAAAGCCATTCTCGGTTTGCGCAACCCATCGGATAAGCCACAGGCCTACTATCTGGATTTGGCTAAGGATTTCGAAATACCGACAGGAGACGTGGCGCAGTTTAGTCTGAAAGCGGTATACGGCAGCAATAAAACCGTGCCCGTTGAGTATAAAAACGCGACGGTGATTACGTTGCAGCCGCTGGAAACGCTGGTGTTTGAGGCGGTGCCCGTTAACTAAACGCTTGTCCCAATGAGCAGACCGGGTAAGGCGCAAGCGCCACCCGGCAAAACCGGCAGCAGGGGCTTATTCCCCCTGCTGTTCCAGCGCATACTTATACAACGCATTCTTCTTCACTCCGTGGATTTCTGCCGCCAACGCCGCCGCTTGCTTCAACGGCAGCTCAGCCTACAACAGCGCCAGCGTACGCAGCGCATCGGCGGGCAGTTCGTCATCCTGGGCTTTATGGCCTTCAATAATCAGCACCATCTCGCCTTTGCGAGGGTTTTCATCTTCTTTGATCCACGCCAGCAGTTCGCCGACCGACGTGCCGTGGATGGTTCCCAGGCGGGTGTTCATCACATAACGGTATAGGGCATCATTCAGCGGAGCCATTAACAGGCAGCACGCGTCAACCCAGTTGGAGAGTAACGCCGTCATTGGGTTATTTCTTTCCATAAAAATACGCAAACATGCCCCATACCTCCTGTTCCAACAGCTCAACCTTTCTTAATCATTGCCGAAATTGCGCCACAGTCATGAAACTTATCAAAACCAGTGGTGCCTTTGATATATTCAGTAACCAGAGTACCAATAATCTTAACCGTTTCTTTGTTCGTTGTGATAAAGCCCAGTGGCATCGTTACAGGCAATATTTCGGGTTGAGAAATATGATAGATAACGGCAGATAACTCATCCTCAAGATGAAGGATCTCAACATTACCATCTAACAGCCCCTCAACTAGTCCTTTAGCGTACTCATGGAATTGCCCAAGTTCATACTTCCACTGTCTGGCAACCCGAGCATCAGCTGCATCCCGGACAGGTACGTACAGCACCGGTTTATACAGCTGCTTATCCTTCGTAGCCGACTTGCTAAGAAAATTGCTCAGGCTTGAGATAAATCGAATGCTTTTTGACCAGTAGGGATCTGATGGTGCCACCCAGATGAACGGGCTACCTCCATAATCAGTTTTATCTCTGATACGCTGATTGAGCGCTGGTGCCAGAGCTTTCTGCAGCTTTTCCCAGTTGTTTTTAGAATAAACCGGACCTAAAAAAGTCGTTCTGCCTGAGGACGAGTATTCTTCCAGTGCTTGCATATCTTTGATAAACAACGGATTAAGTCGATTCGTTTTAGAGTCAAACAGCTTAAGCGTAAGGTCATCGTTAATGGATTGCATGGCCTTAAAAATACTCATGGTATTGTTATGCCGTGATAACCTGGCTATCTCAACAGCCATTAACTCATGAACACATTCCGAACTATTTAACTGTTCATAGTCTTCACGCTCGACAGCCTTACCATCGCTATCCATTTTAAATTCCGTTGTTAAACGGAGTGGTATTTGCCGCAGCTTATTAACAGAATTAACGGTATAGATAGAAGGTTGTTCTTTTCCGACCTGAACCTGCAGGTGAGACAGGTAACCTTTATCCAGGATTTGATTAAACTGGTATTGAAAGTGCTTCTCAACCAGTTTCTCACTTTGAGAAATGTTGTTGTCATCAACCTTTAATGTTACCCCTCCCCTCCAGTGGTCCTGAAAAGCCTGGTTGCTGAAGCAACTAAATGTGGCTAAATCAAAGGATAAGGTACAGCCACTGTCCTGAATAGTTGAAAGATGCGGTATCTCGCCAACCTCTGAAAAATAGCCATTTGATTTGTCTGTTAAGGCCAGTCTGCCGTCTGAGGAGAGCGTCAATTCGCCACGCTGCACAAGATCGGAAATCGCCTCCTCTGTCTCACGGTGGGTCAGGCCGAAATAGGTTGCTATTTGTGCTTTGCTCATCGAAGCTACATGAACAAGTCTCAGCACAAACTCCCGGATAAATGGCAGCCCCTTCTGGGAAACATAGGAAAACTGAATGTTAAACCTCTGTGCTGGCAGCAGGAAGTCAACCTCATGATAGGTAACTTTGTTATCAGATATCATTTCTTTTTGCCTCCCTGCTGAGCAGAGAGGAACCTGTATCCAGCTTCCTGACCTCGCTCAGCCATATAACTTACAACGTAGCCTAATGGCAATTCTTTATTGTTTCCCTTCCAGATATCGGCATTACCCACAATCAACAGTCGATCCATTGCGCGTGACATCGCGACATTTATACGGTTTGGAACGCGTAAGAAACCTGGGCTATGCTGCTTATCCGAACGCGTCAGTGACAGGATAATGATCCGATTTTCCTTTCCCTGATAACTGTCAACAGTGTCAATTTTAACAATGTCCTTAAATCCCTCGCTCCAGATTTCCTGATTGAATTTCTGACGGAGTAACCGCTTTTGTTCGGCATACATACATATCACGCCGATAGCGGCTTCATCTTTGCTAACAAGTTTTGAAAGCTTAGCGACAAATTCTTCATTCTCTGACACCTGTTTAAGAACAGAAATAATCTCGTCAGCTTCACATCGGTTGTAAATGCTTGTTCCGCGATCTTCAAGATGATGTGCTCGGTGGCCCTGATTAGCAGTATCAAGCCAGGTTACAACGCTACGTAACGCTTCCGGAGCTTGCTGATAGACATCCGGAATTGCCCGTACTCCATTCAGAAGCTTCCCGTCATAAAACGTCTTCGATACGAGATTACCAATCGGTGGAGCCATACGATACTGGGTCATCAAAGCTGCACTCGTCTGCGCACCATAAGCAGAGTTGAAGGCTCGGGCAAAGTCACTTCGTAATACCTCGTCAATTTCAGTGCGGGAGTTATTGATACCCAGCTTCCTCGCTAATGCCGCCTTGTGGGCATCTGAGTACAATGGAGGAAGCTGCATGTGGTCACCCACCAACAGGACACGCCGGGCTGACTGCATTGCAATGGCCAGCTCACTTGAAATCGAGCGCGCCGCCTCATCAATAATCACCCAGTCATAGATATTCTCCTGAATGCCAATGTGCCCTTGTCCAATACCAACGCATGTGCCAGCCACTAACTGCCTGGAGCGCGAATAAAACTCGTCCAGGTTCACTCGCTCTCCCGACATAGCATCCTGCATATCACGGGATATTTTAGCTAATGCTTTTACTCGTCTTGCCTCATCAGGTCTGACACCATACTCTGTACATAGCTTGGAAATCAGAATGTCTTTTGCTGCTGAGACCTTCACGCCATTGTCCAAGTTAATCCCATACTCCTGACTCAGCTTAGAGCGGATGGAAAAATCGAGTTCGACTGCAATATCTTTCAGTTCATTGCTCTCATTTGAATCCGTTAAATTGTTAACCTGATAGAGCAATTTCTCAAGGTGATCGATTTGTCTGAACAAATTGAGCTCTGCAAGAACAACACCCGAAATAAATCCCGGCTCCAGACCAATAGCTTCACTCAATGCTTCAACACGGTACTTAATTTCAGCATTGAAGAGTTCGCGCTTCTCTGTTGTGATTGCGTGTGAGTACACATCTTTTAAGCCAGGGGAAACGGCTCCCTCTCGGTTGCTGAACCTGACAACGTCCAACTCTGTACCAAGCCGGGAACAATGCTTTCTGATACGCTCGGCCGCTGTATTCACAGCCTCGTGTGACTGGCTGACCAGTAAAATGCGTTTGGTATTCTGTTTCTCGATCAGGTAGTGAACAAAGGCCGCGATGAACTCGGTTTTACCGGTCCCCGGTGGCCCCTGAAGCAGGGAGAGAGGGCCATTATTGACCAGTTTGTTAAACGCCTTTCTTTGCTGTTCATTCAGACTGATCTTGTTTCCGTGCTGATCTTCACGGTCGTATCTGGCAAAATCGGTATCGCTGAGAGTGATACCATAATTTTGAGCCGCCTGTTTGCAGGATGGATCGAACAAGTCAATTAAGTCAGGCAGCACACTTTCTCGATCTAGGAGACGTTCCAGTGCGCGTTTACGTTTCTGATAAGATGCACGAGTCGGCCTTGTACGGAAGAAGACAATATCAGAATCCTTCAGCTTGAAGGCTGCTGAACTAACTTTGACAAGACGAATCTCTTTGAGCTCTGACTTCTTAAGTGACACTTCACCAATAAATCGCTCAACACCTTCCTGATCGACCTGTAAGGCTTCGACTTCATCACTACTTCTGAAAGCACCGAGTGGATCAACATCAGCAGAGTAAGGGAGAAGAAGCTCTCCATGAGCATCTGCAACCGGGACCACTTCACCGCTGATTTCAATGTTTGGATAAGATTCTGTTTCAGTATCCAGAATGGCGCGCCACAGTTTCACTGTTGGGATTTCCAGCACTTCTCTTAAAGAAGGTTCAAGCGTCTGCTTATCAAGCCTTGCAAAGGTATCTTTTAGCTGAAGCGTAAGTGGCTCTTGTACCTGAACATCTTCAGTTGCGGCAATCAACTCGATTGCCCGCGCAAAAGATTCTTCTTCATTAAGCAATACTGTCAACGCCGACAGATCCTGCGGACTGCCGGGAATAATCTTGATGCCAGTATCAATTTCGAACTGGCTTTCATCAATATCTTGCTTACGAATTGTGACGCGAGCCCGTGGCCTGAAGCCATGAACCAGCGTCTTCTGATCTTTGTTGAACACCGCAGTAAAACTGCCACCAATCCCAGAGAAAGTCACATTTACTTCAGCTGGCGCTTTAGGATTTGACTTAACTTTGACATACAGATGCCCGTTATCAGGCAGAATAGAGATTATTTCATCGGCATTTCCTGCAGTAATCTCTATCAGGTCTTGTTCTGGCACCAGGTCATTACTATCTATCGCTTTTTTAAAGCGGCCTAAATCCTTAAACCCAAAAACCGGATCTTCCAGCTCTGCTCGGATAGCATTCGCGATTGTCGGATAAATGTCTGATTCCAGCCCCCATGACATGCCTAGCAACTCGCAGGACATCTTCATCACTGCATAGTTGTCACGTTCAAAAGAAGTACAATTATCAATGTATTCAGGGCTGTAACTGTGATTTTTAGGTTCGTCACCGGACGGTGAAAAATCGGGGATATCGATGAGAAAAAGCAAACGGCTCTGTGTCTCAAATATCACATTGCCAGGGTGGATGTCCCCGTGAGAAACACCCAGACCATGTAAGTGCTCAACGGCGGCGACAAACTTACCAATGAGGTCGATTTTTTCATCATCGGGGACCGCTATTTTATCCCAGGTTTCTCCCTGTACCTGATCCGTGACCATATATAGGCTTGATGATTTTGATGCGATACCGAATTCACGAATTTGAGGTAGATACGTTGTTTTAACTGAAGAAAGCCGCTCAACCTGCTTTAAAAACTTCAGAACCTGGAAATTAATTGACGGATCGTATCCCTGTCCCCCAACATTCAGCCAAGCCTTCACGAGCCTTCCTTTCGAGATATAGACTTCTTTGTCGACTGTTTCGACCTGAAACTGAAAGCCATCGTCTTCCGGGTATTGACGAGCGTGATTGATAGCATGTCGGTAAGGGTCAAGCTCAGTATCATCAAATGTTGGAATATCCTTGCCAGCAGGTTCGGCTTGTTTCAGGGCATCAAAGAATTCTGTTGCAGAAGTGAATTTTGCGGCAACGGCATCCCGTAATACAGATGAATACCAGTGCTGGCTGTTTAGCATGTTGTCCTGTACTTTCTCCAGACTCTTTGGCGACATGCGCATACCGCTAAATAAGTGCCAGGCGACAAGGCCCAGAGTATGAACATCTTGCTGAAAAGGCGTAAGCTCGCCCTTATCAAGCATGTCTTTTACGTGGACTGCACCTACGGATAAAAGCTTACGGTAATCGCCAACCGTTCCGGCTGGCTGGTGGTAAGCCGAAATAAAGTTCGAAAGAGCAACTTCTTTTGAAGGTGAAATCCACAGACTGTGATCGGCGACATCCCTGTGCGCAATTTTCATCTCATGGAGATCACTAAACTTTGCAATGAGCAGTTTCACCACATTCAAGCGGTCCATATCAGAAAAGTTCTTACCATATTTTCCGATAAACTCATTAAACCGGACATGACCCGGCGGAACTTCGTAGACTTCGCTATACTCGGCCGTCACCTCGTCTTTCTGAAAACTCGTCAAAGACCTGAGACAGTGGTTATACAGGTCACGATTCTGGTGATTGATGTGCTGTAACACTTCGCGCTCACGTGAAACAATCTGAGCTCGTCCTTCCGGGGTATTCGCTTTCGTTCCCGTTATATTTCTAAAATTCCATACTCTGAGTAGCGCTTCGCTGTTCGTTGATATCTCAGATTTTGCCAGATATTCTCGATACACCTTTTTAGGGTGTTCGAAAATCATATCATTAGCTTCATAGCCATTAACTCTCAAGGCTTTTGGTGCCGTCTGAGGCCCTAGGAACAAATCATCGAAAAGATGGAAATCCTTGTTAAGAACCTTGGTAGCTGGGTGCGGTTTGAAATAGTTATTGAAGCTCCCGCGATCGGCAAACTTCAGGAAATCTTTTAACGAGATCGTATGGCGCCGTTGTTCCTCCGGCAGCGCGCTGAAATCTGCATTGCCGGTCATCACAACAAAAAAATGAACAATCGGGATATAGCCTTTGTTCGTAAAACGGTCTACCAGACGCTTGAGTTTTTTGTCCAGCATGAATTTTTTGCTACGAGTCACGCTTACTGGCGAGCGCCCCATGTTCTTATCGCCTTTAAACCAAGTATCTCCGCGTGCGGTTACAGGCTGATGGTTCCAGTCTTTCAGTTCAACAATGATCACGTTGCAGTGTGTTACAATAACTAAATCGAATTCCCCCTCTTTTTTTGCCTCAACAAATCGAAAACCTGCATAACCTTTCCAAGGGAACATTTCATTGCCGATAAAACCATAGCTTTTTAGCTGCTCACTAATTGAACCGCTACGGAAAGGCTTGTCAGGCTTAGATACGTTGACTGAAAAAGCAGCTTTTATTTTCTCGATAGCCAATACTTCCTGTTCTTGTAAGCCACCATCCCACATTTCTACTTCCAACGGTGTTCTCCTTAAATTTCAGTAAAAATAAGGTCATTTATTCTGGTCAATATGGCAGGTTTCGATGTTTATAGTTCTGAATACTATGGTGTTTTACACGTTCATTCTGGTTATTATACGTAGAAAAGAATTTATCGGGCAACGCCGAGATGACGCGGCCCCGTTAAACCGTGTAGTGGTAGATGATGAGATCAGATACAGGGATCAACTGACGGTGAGCGCAGTCCCCACATTTTATCCTGAGTTTACCGCTATTCCCGGCTGCCATTCGTTAGCACAGGCCCGCAAGTACCCTGATTTGCCGCTGGTTTTGCTTTCCCATCGAAGGGCCCATACATCATCACGCCCGCGAAACAGTCGACAAAATAACGCAACTTTCTCATCCGTGGATAATACGGAAACGCACTGCACAGGACTCTGCGGTTTACGTCACCATTCAATCCCATAAGCTTCAAGTTATGTTTGCCCTCAGTGCAGCTAATTCATCACTGTCAGATTTATGAACCATATTCGTTACCTGTATCCGATCCACTGAGGTACCACGGCAGAAATTCGGGTCCCCTACACTGATGACAAAGCGCCCCGCCCTCTTCTGCCTATTAACTCAAGCACCTCCCAACAACTTACAGAAACTTAATTCTAATACCTAATACAACTGTAATTCAGTTATGTCGTGGTCGGCCCCGGTTGCTTTTTTCGGGAAGCCTGCTACGGCCCAGAATAATACCGTGTTCTTTCTGATCCTGTTTAGCCAGGTAGCTAAGGGCGTAACGTAAACCGTCTACCGCAGACTTATCACTATAGTGAATCACGTGATCGATACGCACCGGGTATTTGTCTTTAGTACTGCACAGGTGAAAATAACCCTCCCCTTCCGTGATCCGACTCCAGATATCTCCCAGTTGCCTGGATATCCGATAAAATTTCTTGTGACGTTGCCCATCCAGGTAGCCGATAAAATGAATATGAAGCCCTTTGTTTGGTGTATATTCCATAACCCAGTAATATCCCGCCAACATCGTCTGGGTTTCGCTAAGTAAACGGTATATTTCCATACACATACTGTGCTTACAGGAATGCCCGAAACTGGGCGTGTCTTTCCTGTAGGCAAAATCAATTCTGAAAGGTAACAGTTTAGAAAAACGTTGAAACATACCATCCATGTGTTCATTCACGTCTTTCAGAATCATGAAGTCCATTTCGTAATTGGGATTAGCATTATACATTTTATAGTCCTTACTTTATAAAAGTTACAGGAAGGTGGAATTACATAAATACTGAGAGTACAGAAACGCCGTACCGCGGCATTAATAAACGAGGTAACCAGCAGCACAGACAACCTGATAAAACTGTTTCGTATTACCTCATTACTCCAAAGAGGTAATATTTCAGAGTACAGTAAGACTCACTTAAACTGAGGACCAGTACGAATTACAATGGTAAAAACACACCTCATAGAATTGAGTTTAAGGAAGTACCATTCAATCACTTACAGGAAGTAATCTGACATTCACAAAAGCCAGTATCGGGTAGCCACAGGCAACACCGCTATTAATACTTCTGTACCATTAAGTAAAAACAGCGACCCAGCTTAAGGCCTCTACAGATATTTAACTTCACCATAAAAATGATGGTATACAGACCATACCCCACACACCATAAGAATATTATTTCATACAGGAAAGATTATATTTTTTCATACCTATAAACTGTAACAGAGCCATAACATACTCTAATAGTGTTCCGGTACGTAACCAGTGTTAAGTGCGTACAAGACAACATAACAGATCACATAACTTCCACTAATAATACTAATTAACCTAAATTATTTATTGAACACGAAAACAATGAAGGCCCCACCTCACCTCAGACTAAAGCCGACAATATCCTGAGCAACACTCTATCCAATAGTAAATAGTGGTTACACCAGTTATATTTACTGACTGACCTGTACCAGAGTTTTCTAATACAACGCTGACATGGTAAAGACATTAGTCTCCTCCGCAAGTAGCAGAGGAGACTATATGTACTCAACGATTAAGTCGTTCAGAGGTCAAAGTTAGGTGCGACACACAACGTTTTTCCCTCAATAACCGGTACAACTCTGTTTTGTTCATACAGTAAATCCAGTAACCAGTTGATTTTATCCTTTTTCCGGAAACGGTTCGGACCACGCTGTAAAATATCATTTTTTTTCATACAGAGGATCCCCTTCTCAATACAATAGCTTTTTATCCAGTTGAAAAGTTCAAGCTCCTCTGGAATGAGTCGCACAGGTACGGTCAGGGCAGAGTTGTCAAAAGTTAACGGATTAGACAACCGCACATACTCATTACCGTACCATATTGCTAATTCTCTCGCCATTTCTGCAGTGTAAGGGGAAATCTCCCCCTCTTCACCGCTCGAATGGTAAATAAGTCCTGCCAGTCTTGCCATATACTCTGCATTTTTAGCAGCATATTCCCGGCAATGTCTTAAAGGCCCCAATCCCCCCAGCTTCGATTCCACATCATTGTAATAATCCGTCCAGATTCTGGCAGCCTGAGGAGAAAAGTGAAGGCAACGTCGTTCACCACTCATCGCCAGACTTTCATCAATAAGCTCATTGATCCTCTCTTCAAACAAATCCTGATACTGTGATGAATAATTATCTCCGGTTATTATCCTTGTCCCCTGCGTTGATGTTGGTTGACACATCAAAAACCTTGCATGATGTCCTGACGTTTTCACAATTTCTTTTTTTCGCGTACAAAAACCTTTGTGGTAAACATCAGGCTGAATCATCACCGATATCGTCAGTCTTGGCTCCTTCAAATTAATTCCGGGAGATGATTTCCTGTCGATGAAAAGAGAACCTCCATCCCACAAAGTGTTAATAATTCCCAGTTTACTCATGGCCCGGCTGTCAAAAATTACCCCCCCTTCACTGGATACAAGAGCAAAAGAGCGATTGCTATCGGAGTAATATTTTAACATTCCCTCTATCGTTGTCTCATTAAAAATTGTTCGACGTATCTGCGGCGGAACAGGAGGTTTATTCAGATGCGTTTCAAGCTCTGATTCTGTTGCCTTGTAATCTTTACCGGCACGAATCTCTTTATGAAATTTTGATTCCAGCGCTTTTTGTTTTTGCTCCCATATTTCCTTTTCTGTACTGTAATTCTCAACCAGTTTCGCGTATTCATCCGCCAGGGCTTCATCCCTGAGATAAAATGCTTTCATAAACACTTTATCCACGGTCGTTTTCCTTTCACCGGAATCAGCCAGAATCAGAGAGTAAAGATTAACAGGCCCATGTAAATTTCCAGGTCTGCACACGTCAATCTGATTCTGACAGGCAATTGAGATCGCTGTTAATGCGGATGTTGCCACCATAGCCAAAGGTGCCTGTGTATTTTTTTGAGTTTCAATTATTGCATTTCTCACCAGCGGTGGTAGTGCATATATCGGATAAGGATTTTCTGGTGCAAGTAAGCACATAAAAGCCTCTCTTTTCATTAATGGGTTAGTAAAATAGCAGCAAATCCCGCTACAATAGCGAAAATAGCTGCTATTCAGACCTGATGCTCTGAGAAATACAGAGCATTTGTAGATCTGATGTTTAATAAACCGATAAGGTTAATTCAGTGAAGACTGTATTATTCACCACGAAATATCATTCATATTTCTCATTCTTAATACCTCCGGAAATTATTTCACTAATACCTTTCCCCCGGTCTTTTCCCTTTCTCCAGAACCATAGTGTAAATATTAACGGCCCCACGTAAATTTCTGGGACTGCATACGTCAACCTGATTCTGACAGGCAATTGACATCGCTGTTAATACTGACATTGCAACCAGAGTCAGAGGTACTTGTTTATTGTATTGAGCTTTAATTATTGCCTTCCCTATCATTGATGGTCATGCATATACTGAATACGGCATATCCGGTTTGATGTAAGGCATAAATGCTTCCTTTTTCGTAATACGTTTCGGTTATAGCAGCAATTACCGCTACAATAGCGACAATAGCGACAATAGCGACAATAGCTGCTCTTTCAACAATGAATGCTGGAAATTCTGTTTATCAATTAAACTTCTGGCTAAATTGTTCTTATGCTTCTTTTTCCTCCTGAGGTGATCACGTGATGACAAAAACTTATCTGCTCATGGTCATATCTGAAGAAAGCGTATGTCTCACTTCGTCATTAACACCTTTGACTGCTGATCTGGCAGGTACAGTATGTAAAGTAAGCCATTCGTCAATAGCTGATGAACGCCAGCCCACAGAACCACTACCAAGACGTACCGGCCTGGGGAAGGTTGCATCGTAGTATTTCGATAACGGATTCATTTTTTCGTAAATTGTCGAACGAGATATACCTAACAATTGGCTCAGTTCAGGCATCCGTAAGATGCGTGAAGGAGTCCTGCAGTGTCCGGATTGATCTGGCATTTTATCCCCCATAGTGTGGTGAACTGTCGTTACAGTTTGTATTCCCCGGGTAAATTTGTCTGCGTAAAAAAATGGGTTGAGAAAAAAGACAATAAATAATTAAAAATCAATATGATAGAAAACGACAGCATTAGTGTTATTTTACTTTTCCCTTATCGCTGTCGATTCTCATTATCATTCATGGCAAAAAAACAGCAAAAGAGGCCCTGGTAATGACTCCTGTCAGTAGTACATGTGTTTTCGATAAATGAACCAATGGCTTTTCGTGAATTTCCAGGGAGCGTAAAAAGCAAATCGTTTTCCGGGCAGGTCGTTTGATGTGAGTACGAAGTGTCAGATTGTTGCGCTCAATGCGCCGGGTAAATATCTTGCCGACAAGATGCTTATCCTGTGGCATTTCTCTGGTATAACTGCTCCGGTTGTCTCTGGTTATCATGCCTGCGGAGAATGGCTTCAGGAATTCCGGCAACTCACGGCATGTTTCATCAGTGCGAGGACCAAAAGTGTAAGCCAGCACACCGTCAGCTTTGGTCTTATACGCGTACCAGTGCCACTGCTGACGAGCTTTGTTTTCGACAAAACTCCATTGTTCATCAAGTTCACAGATAAGTGCCACATCGACACTGGCGGCTGGAGATATGGTTACTTTACGTGGTGAGCGTTTTTTAAAGTGAGAATGACGGTGTTGATATCCACCTTCAGCGTTCTGGAGCTATCGCGAACCCCGGCTCCGTTATGAACCATTTCAACAATTTGCTCTTTAATGCCTGGTTTTCGGGTTTCGTAGATGTAATTCAGTTGAAATACACGTTTGCAGGATTAGCACTGGAAACGCTCATGATCTGAGGTGCTGTGATCATAACGGTACACTTTATAGGAATTACAGCGGGGAGAATATACTGTAACTGTTGCCATATGGTCTCCAGATACCAATAGAATACAACATTAATCTATCGTCAGAAGGCATCACCGGGGCCTTCTGACATAATCTGTTAATACGTGTACTATTACCCTGTTCAGAAAATATTGATTCAAAAATGAAAACCAGTTAACAGAAAAGCAAAATGATATAATGTTAAAATTTTATATAGTGCAATAAAAGGAGAATGTTATGTATAATTTTATCACTATAATGTATGATGTCTTTTCATGTTTTGGTGTTCTGGCTAAAAACCAGAATAGCCGTGACATCCGAAATATTAAAAATTTTTCCTCACATCAACATTCACTGGGCGACATGTTTGATGAATTAATAAACATTATTGATAAAGAACAAGTATTGAGTAAAGAACAACGAAAAGTTATATTTAGGAGATATGAAGATCTCTATGTTAAGCTAATGCACTATTCTGTTTTTACAGACAAAACACATCAAATAATAAAACAAAAATATTTTAATGACATTGTACCAATGATTCTCGCACTCGACATCAGGAACACATATCGCCCGGATAATGAGATGGCATTTTACTATCATATTCATTCTTTTCTCACTCAGATACCGGATAATGAGGATGATATATATCATGCTGCAAGGACATATCTGCGAAATTACGTTAAGTTATGTTTATCCGGATACACGCCAGCGAATGCGCATTTCAAAGATATCTTTGATGGCGTATATGAATTCATTCGTAATATTCGCAAAAACAGTACACCAGGAAAAACAAAACTTATCGCAACTATCAACACATGCAAAGAAACCTGTAAACATCTGCTTTATTTAAGTAATGAAGACAAGGAAAAAATAATTTCTGACTTAGATAAAGTTCAGGTTGCATGTTATTATCTCACTATATTACTGGCTTTCGAAAGACGAACTTCATTAACAAGCACCCTGACAACTTTATATAAAATGCTGATAAGCGAAAGAGAAGTTTCAGAATATGAATGCCAGTTATTATATTTAACCAACCCAATAGATGTAATGAATATACTGAACAAATACATATATTACTTTCCTAATGAGAACTCACCATTTTATACACTGAAAATTGACAGTGCATTATCGTGGGATGCCATTGACGCAATACGAGACTATAGTATTTCTGATATTTATCTTTATCCTGAACAAAAAACAATAAATTGTGTCGTTGAGATTGAAAACATTGTCTTTGGCGGTTACATTTATACATTGAACAACGGCGTCACATTACAAAACATAGAAAACTCTTTAAAAGATTCTTCATGCCATTATGTCTTAAATGGCTATACAGAATTTGTTAACTGTTTGAGACAACTTACTTCAGGAAAGACTGAAAGTGTTCATCGCACCATCAATAAACTGAACTATGAGAAATTACCTTTTGGATTTATCATTGCCGCGTTTGCTATACTAAAGATAGCATTTAAAATAAAATTCAGTAAAAATCATGTAAATATCCGAGCATTATTAAATGACATCAATTATTTTATGACTTATCAGGGCGAGTCCATTAACCTTATTTCACTGGATCACGAATACCCAGAGTCCTGTCTTCAAAATGACACAAACACATATTTATTAGGAAGAGTAATATTTCTGTATAACTCAATGATTTATAAGTTCATAAACTGTCAGGAACATGAAACCAATAACATTCACTCAGCTATGATAAATAACCTATTACAGGAAGTTGATATAGCCCTTGGTAAAATAAATGACATTATAGACAGCAGAAACATATCAGCCCCCCATGAACTGGCAAATATTCTTACCCGCGAAAAAATACTTACAACACGGGAAAAAAAAGGAAACCTGATAAGCCTGTTTGATGGATTCACTTTATTCCATTGTGTTGGAATGATAACCTTTCTTATCCATTATCTCAGAACACCTGAAGAAAAAGTTGAAAATATATTTATGTTATATGGTGCAGATAAAAACAATAAACTACGCAGAAGACTGATTTATGACGCACTAGGAATAATTCAGTCTCAGCAGGAGTGAAGAGTTAAACAGGCAAATTATTTTTTATAAAAAGGGGATTGTTAAGTAATCCCCTGTTAATCAAAATACGCTTTCCTGACGATCTGATAATTATCAGCTCAATACATTTGACATAAAAGCGGTTTCTTAATTACTACTGCTACGATCACATATCAAATCATGATCGTACAATATACATTTTCACTATTATTTATTCACAATTTGAGACATCACATAATCCGCCCATGCCTGCATCATCGGTACACGCTGCTCCAGTAGATCAGTACGATGATATGCCGCCTCAACCTTATTTTTCAGCGTATGAGCGAGCGCCCTTTCCGCCAGATCCCGCGAATACCCCTGTTCGCTACACCAGTCCCTGAATGTTGAGCGAAAACCATGTGCCGTGGCAACTCGCCCCGGAATGTCACTGACGGCTTTCTTTTTACGTAGAAAACTTGTCAACACCATATCGGAAAGGATCTGCTGCTTTCTGGGTGAAGGGAACACCAGTTCATCATGCAGGCCACGTATATTTTCCAGAATGTAAATAGCCTGCCGGGATAAAGGAACACGATGCTGTAGCCTAGCTTTCATTCTTTCTGCAGGTATAGTCCATACCCGCTTATGAAAATCAATTTCAGCCCAGCGCATTCCCCTGGCTTCGCCCGAGCGAGTTGCTGTAAGTATCACCATTAATAACAGTGCGCGGGTAACATTATAAGGTTCATCGGTATACACACTGGTCGCCACAAAAAGCGGTAACTGCCTCCAGGGCATTGCGGGTTGGTGTTCATCACGTCCTCTTGTCTGCTGAGGAAGCAAATGGTCAACCACATCAACAGGATTTGCTACACAAAAACCGTGCGCCCATCCCCACTGCATAACAACATGAATGCGCTGTTTAACCCGGCTTGCCGTTTCTGACAAGGTTAACCAGACTGGACGCAGTGTTTCTGCCACATCCGCAGCCGTAATCGAATCCAGCGTTTTTGCTCCCAGTTGAGGAAACGCGTAATTCTCAAGCGTCGATAACCACTGCCTTACATGCTTTGGATTTTCCCATCCAGGAGACAGTTCTGCATGTACACGCCTGGCTGCATCGGCAAATGTTGGGATAGCGACTTTCTCAGATTCAGCCTTTTTAATCTCCAGAGGATCATCACCTGCAGCAAGTTGCTCTCGCATTATCCGGGCAGTACGTGCAGCTTCAGCAATACTGACCTCTGGGTAAGTTCCCAATCCAGCATTACGTCTTTTTTGTGTCACCGGACTTACATAACGAAAAACCCATTTCCCCCGCCCCTTTACTGAAGAAGGATGAAGGGTCAGTCCGGTAATTCCCCCATGGGGCAATGGTTTGTCATCAGGTTTGATATGTCTTGCTTTCGTATCCGTCAATACTGCCATACGCTAATTCCCGTCACTCTGGTATGCCATCCAGTATGCCATTAGTCTCTCGCTTCACTAAGAATACATTAGATAATATCGGACAATAAAAAGTGTAATACATTGTTTATAATGAAATTATATAACACCCCTGGATTAAGTCAGATTTATTTCAGGCGGTCCCCCTCACCGCCATATTTAAAGAAGAGCCCGTACGAAAGTACGGGCTTTTTTTTCGTATATTGCACACACCGGGTGCCCCCGCCCACGTCTGTTAGGGCGAGGGAAAATTGTGCACATCAACTGTCTGGTTACTCAGTCAACAATCAAAATGCGCAAGATAGTAGAGAACATATATGCGTTATCTGGTCACTTCCTGTAACTTCCTCACTTATTAAAATGCCCCCAACGGAAATACATGTCACATGCCATCATAAAATCGTTATGCCAGAGGCATCGACAATGGAAAAAATAGTAAATTAAGCACGGTACATAGCAGAGTAAAAAATAGGCTTCAAACCAAATCCGCCATTCCACAAGCGTTGCCAATACCGACACCAGCGTAATCCACATCATTTCAATAACCAGCAATATGAATAACCGAAGCCAAAGCGGCTTGGCAAAAAAGGTCGACAACGCATTTTCACTGGAGAAAAACAGATCGATTTCCTGCAAAAAAAGCCTGCCCAGAAAAGCGTACCGCTTGGAGAAGCGCAGAAAAAACAGATTAATCGCCAACACTACGCCACAAAGAATGAGCAATGCAATTATCGATTTAAACGTTAACCACTGCCCCTCTGTCACAAAAGAATCACTGTTACGTAACGGGCTGATAGTAAGCAGGTTAACAACCGTTGAGATGCACACCGAAAGCAGCACATAGCGAGTCAGACTTTTATATTTTACTTTTTTCGCTACTGAAATAGCGACCTGCTTGACATAACTGGTGCGGATCATCCGAAACAGCAGAAAGATGACCAACGGCCCAAGTAACGAAATCAGGAACAGCGCATATAACGAGGGTAGAAGCTGGTATAGCCCCCAACTCATTGCACTCAACATGATTAACGACAGAAATCCTGCCACCAGAGGTTGTGTAAGAAATAATTCTATTTTGTGGTTCTGCTTTTTCTTAAACCACGAAAAATCGATATTCTTCGTGACGCATTTATAAGCCCAGTACGCCTTGAGATCAAAAATAAAATTATACAGCACCAGGGCCAGAATCCCACTAACGACACCAATATCCTCTGATGGAAAAGTAAATCCTGCATTTTTCCAATAAAAGCCTATCAGAATAAGTCCAATAGCATAAATGTATATCAGATTTCGTTTATCCTGAAAGCGAATAAAAGTTAAATAATCAGGAATCATTTGAATACCTTTGCATCACTAAATGCTTTACGACAATGCATAACTCGTTGCTGATCCACGCCAACAAAATCATTTAACATATTCCCCAAATAGGGCTTATTACTTATTTCAAGGAAAGCTACCGGGGTTGGCATATTCAGAAATGTATCGAGAGATTCAGTCCATTTCACCGTATGGGTTAAGTGTAATGAGAGATTTTCTTTGATAATTGACGGAGCGACCTCGCTTTTTGCTGTTACGTTCATCAGAACCTGATGCTCTGGCGAAGCGATATCCAGTCCGGCAAGATAATCGCGCATTGCCTGAACGCCGTCTTCCATCAAACGCGTGTGCCAGGCACCGCTCACACCGAGTTTAACCGGTTCGTATCCAGCGGCCATCAGCAGCGTGGCAAATTCATTCAACGAGGCCTGCGTTCCCCCAATAACCTGCTGGCGCGGCGTGTTATCACAGCTAATATCCAGCGCAATGCCTGATTCCGTAATCATCGTCTGCAGTTGTTCGCGGTTGATGCCTTTAACCGCCTGATCTTACCCAGCAATAGTGGACACGCGGCTAAGTGAGTAAACTCTCAGTCAGAGGTGACTCACATGACAAAAACAGTATCAACCAGTAAAAAACCCCGTAAACAGCATTCGCCTGAATTTCGCAGTGAAGCCCTGAAGCTTGCTGAACGCATCGGTGTTACTGCCGCAGCCCGTGAACTCAGCCTGTATGAATCACAACTCTACAACTGGCGCAGTAAACAGCAAAATCAGCAGACGTCTTCTGAACGTGAACTGGAGATGTCTACCGAGATTGCACGTCTCAAACGCCAGCTGGCAGAACGGGATGAAGAGCTGGCTATCCTCCAAAAGGCCGCGACATACTTCGCGAAGCGCCTGAAATGAAGTATGTCTTTATTGAAAAACATCAGGCTGAGTTCAGCATCAAAGCAATGTGCCGCGTGCTCCGGGTGGCCCGCAGCGGCTGGTATACGTGGTGTCAGCGGCGGACAAGGATAAGCACGCGTCAGCAGTTCCGCCAACACTGCGACAGCGTTGTCCTCGCGGCTTTTACCCGGTCAAAACAGCGTTACGGTGCCCCACGCCTGACGGATGAACTGCGTGCTCAGGGTTACCCCTTTAACGTAAAAACCGTGGCGGCAAGCCTGCGCCGTCAGGGACTGAGGGCAAAGGCCTCCCGGAAGTTCAGCCCGGTCAGCTACCGCGCACACGGCCTGCCTGTGTCAGAAAATCTGTTGGAGCAGGATTTTTACGCCAGTGGCCCGAACCAGAAGTGGGCAGGAGACATCACGTACTTACGTACAGATGAAGGCTGGCTGTATCTGGCAGTGGTCATTGACCTGTGGTCACGTGCCGTTATTGGCTGGTCAATGTCGCCACGCATGACGGCGCAACTGGCCTGCGATGCCCTGCAGATGGCGCTGTGGCGGCGTAAGAGGCCCCGGAACGTTATCGTTCACACGGACCGTGGAGGCCAGTACTGTTCAGCAGATTATCAGGCGCAACTGAAGCGGCATAATCTGCGTGGAAGTATGAGCGCAAAAGGTTGCTGCTACGATAATGCCTGCGTGGAAAGCTTCTTTCATTCGCTGAAAGTGGAATGTATCCATGGAGAACACTTTATCAGCCGGGAAATAATGCGGGCAACGGTGTTTAATTATATCGAATGTGATTACAATCGGTGGCGGCGGCACAGTTGGTGTGGCGGCCTCAGTCCGGAACAATTTGAAAACAAGAACCTCGCTTA
Protein sequences of DBSCAN-SWA_3 >CP033401|1610136:1654789|1626811_1628040_-|AYQ01488.1|transposase|DBSCAN-SWA MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEAVEYGRGKKVDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVWTLLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFRCDNGEKLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNELPASPVEWLTDNGSCYRANETRQFARMLGLEPKSTAVRSPESNGIAESFVKTIKRDYISVMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQQASNGLSDNRRLEI >CP033401|1610136:1654789|1631384_1632956_+|AYQ01493.1|DBSCAN-SWA MTQKKSFKSKLWEFLQSLGKTFMFPVSLLAFMGLLLGIGSSVTSPSTITSFPFLGGEFTQLTFGFIAMVGGFAFTYLPLMFAMAIPMGLAKRNKAVAAFAGFVGYMLMNMSINYYLTATHQLADPATMKQVGQSIVLGIQTLEMGVLGGIVVGVITYFLHDRFQDTVLHDAFAFFSGIRFVPIITALTLSLVGLFIPMLWEYVALGIAGIGHIIQSTSVFGPFLYGVGVLLLKPFGLHHILLAMVRFTPAGGIEMVNGHEVAGALNIFYAELKAGLPFSPHVTAFLSQGFMPTFIFGLPAVAYAIYRTARPENRPVIKGLLLSGVLVSVVTGISEPIEFLFLFIAPALYAFHIVMSGLALMVMALLGVTIGNTDGGILDLLIFGVMQGMSTKWYLLFPVGIAWFAIYFFVFRWYILKHNIKTPGREVDVQGAQQAVEANTRARGKSKYDHELILRALGGKENIESLDNCITRLRLVVKDMGLIDQQALKAAGALSVVMLDAHSVQVIIGPQVQSVKTGIEALI >CP033401|1610136:1654789|1611323_1612469_-|AYQ01474.1|DBSCAN-SWA MMKKSLCCALLLTASFSTFAAAKTEQQIADIVNRTITPLMQEQAIPGMAVAVIYQGKPYYFTWGKADIANNHPVTQQTLFELGSVSKTFNGVLGGDAIARGEIKLSDPVTKYWPELTGKQWQGIRLLHLATYTAGGLPLQIPDDVRDKAALLHFYQNWQPQWTPGAKRLYANSSIGLFGALAVKPSGMSYEEAMTRRVLQPLKLAHTWITVPQNEQKDYAWGYREGKPVHVSPGQLDAEAYGVKSSVIDMARWVQANMDASHVQEKTLQQGIALAQSRYWRIGDMYQGLGWEMLNWPLKADSIINGSDSKVALAALPAVEVNPPAPAVKASWVHKTGSTGGFGSYVAFVPEKNLGIVMLANKSYPNPVRVEAAWRILEKLQ >CP033401|1610136:1654789|1634156_1636046_+|AYQ01495.1|DBSCAN-SWA MKKVLTLSLLALCVSHGAAAANYALNNDNIALLFDDTNSTVVVKDNKANHPLTPQELFFLTLPDESKIHTADFKIKHVEKQDNAIVIDFTHPDFNVTVKLNLVKGKYANIGYTIAAVGQPRDVAKITFFPTQKQSQAPYVDGAINSSPIVADSFFILPDKPIVNTYAYEATTNLNVELKTPIQPEAPVSFTTWFGTFPETSQLRRSVNQFINDVRPRPYKPYLHYNSWMDIGFFTPYTEQDVLGRMDEWNKEFITGRGVALDAFLLDDGWDDLTGRWLFGTAFRNGFSKVREKADSLHSSVGLWLSPWGGYNKPRDVRVSHAKEYGFETVDGKLALSGANYFKNFNERIIKLIKNEHITSFKLDGMGNASSHIKGSSFASDFDASIALLHNMRSATPNLFINLTTGTDASPSWLFYADSIWRQGDDINLYGSGTPVQQWMTYRDAETYRSIVRKGPLFPLNSLMYHGIVSAENAYYGLEKVQTDSDFADQVWSYFATGTQLQELYITPSMLNKVKWDTLAKAAKWSKENASVLVDTHWIGGDPTALAVYGWASWSKDKAILGLRNPSDKPQAYYLDLAKDFEIPTGDVAQFSLKAVYGSNKTVPVEYKNATVITLQPLETLVFEAVPVN >CP033401|1610136:1654789|1643591_1644155_-|AYQ01497.1|DBSCAN-SWA MYNANPNYEMDFMILKDVNEHMDGMFQRFSKLLPFRIDFAYRKDTPSFGHSCKHSMCMEIYRLLSETQTMLAGYYWVMEYTPNKGLHIHFIGYLDGQRHKKFYRISRQLGDIWSRITEGEGYFHLCSTKDKYPVRIDHVIHYSDKSAVDGLRYALSYLAKQDQKEHGIILGRSRLPEKSNRGRPRHN >CP033401|1610136:1654789|1630932_1631271_+|AYQ04275.1|DBSCAN-SWA MKHLTEMVRQHKAGKTNGIYAVCFARFVLHGASDVPDEYVRRTIGPGVCKVNVATELKIAFSDAIKAWFAENQQSNDPRFYMRVGMDAMKEVVRSKIAVCGSANRLRLPAEA >CP033401|1610136:1654789|1622746_1624063_-|AYQ01484.1|DBSCAN-SWA MNDKNIIQMPDGYLNKTPLFQFILLSCLFPLWGCAAALNDILITQFKSVFSLSNFASALVQSAFYGGYFLIAIPASLVIKKTSYKVAILIGLTLYIGGCTLFFPASHMATYTMFLAAIFAIAIGLSFLETAANTYSSMIGPKAYATLRLNISQTFYPIGAASGILLGKYLVFSEGESLEKQMSGMNAEQIHNFKVLMLENTLEPYKYMIMILVVVMVLFLLTRFPTCKVAQTSHHKRPSAMDTLRYLAKNSRFRRGIVAQFLYVGMQVAVWSFTIRLALELGDINERDASNFMVYSFACFFIGKFIANILMTRFNPEKVLILYSVIGALFLAYVALAPSFSAVYVAVLVSVLFGPCWATIYAGTLDTVDNEHTEMAGAVIVMAIVGAAVVPAIQGYIADMFHSLQLSFLVSMLCFVYVGVYFWRESKVRNALAEVTES >CP033401|1610136:1654789|1629359_1629797_+|AYQ01491.1|DBSCAN-SWA MVKKALGSIAKSGKTAIVEVLSPGQHPTKRELIYAATPARDFVCGTQQVASGITVQVFTTGRGTPYGLMEVPVIKMATRTGLANHWFDLMDINAGTIATGEETIEEVGWKLFHFILDVASGKKKTLSDQWGLHNQLAVFNPAPVA >CP033401|1610136:1654789|1653627_1654789_+|AYQ01505.1|transposase|DBSCAN-SWA MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWRSKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILPKGRDILREAPEMKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCDSVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLAVVIDLWSRAVIGWSMSPRMTAQLACDALQMALWRRKRPRNVIVHTDRGGQYCSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISREIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENKNLA >CP033401|1610136:1654789|1632967_1634143_+|AYQ01494.1|DBSCAN-SWA MFDFDKIIERQNDKCRKWDHTFVCSRFGDVPESFIPLWIADMDFTSPPAVIDGFRRIVEHGTFGYTWCFDEFYDAVIAFQRKRHQVEVEKSWITLTYGTVSTLHYTVQAFCKPGDSVMMNTPVYDPFAMAAQRQGVQVLANPLRVEENRYQLDFNLIEEQLKTHRPTLWFFCSPHNPSGRIWREEEIRQVSDLCQRYGTILVVDEVHAEHILDGKFASCLTSGCAAQDNLIVLTSPNKAFNLGGLKTSYSMIPDDSLRQRFRQQLEKNSITSPNLFGVWGIILAYQHGLPWLDALNGYLQGNARYLADALQTHFPAWKMMNPESSYLAWIDVSADERSATQLTQHFARQAGVVIEDGSHYVQNGENYLRINFGTQRYWLEQSINRMLKNDK >CP033401|1610136:1654789|1653033_1653525_-|AYQ01504.1|DBSCAN-SWA MITESGIALDISCDNTPRQQVIGGTQASLNEFATLLMAAGYEPVKLGVSGAWHTRLMEDGVQAMRDYLAGLDIASPEHQVLMNVTAKSEVAPSIIKENLSLHLTHTVKWTESLDTFLNMPTPVAFLEISNKPYLGNMLNDFVGVDQQRVMHCRKAFSDAKVFK >CP033401|1610136:1654789|1620542_1620890_-|AYQ01482.1|DBSCAN-SWA MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRSGSQVKLLWSTGDGLCLLTKRLERGRFAWPSASDGKVFLTQAQLAMLLEGIDWRQPKRLLTSLTML >CP033401|1610136:1654789|1628642_1629278_+|AYQ01490.1|DBSCAN-SWA MVKSFAIGYTVRDVAKGSWIDESTVTLPKAPPLNTLPRATKVPEPQQPQEDYTFEGYRNADGSVGTKNLLGITTSVHCMADVEDYVVKIIERDLLPKYLSIDGVVDLNHLYGCGVAINVPAAVVPIRTIHNIALNPNFGGEVMVVGMQCGGSDAFSGVTTNPAVGYDSDLLVRCGATVMFSEVTEVRDAIHLLTPRAINEEVGRRLLEEMA >CP033401|1610136:1654789|1621721_1622735_-|AYQ01483.1|DBSCAN-SWA MSTRIYLWRALFGEKPRILLENSDFTVTSFRYDSGVEGLKIANSRGHLIILPWMGQMIWDAQFDGHSLTMCNMFRQPKPATEVIETYGCFAFHSGLLANGCPSAEDTHLLHGEMACAAMDEAWMELEGDMLRLTGRYEYVMGFGHHYLAQPTVVLHKSSTLFDIKMAVTNLASVDMPLQYMCHMNYAYIPNATFSQNIPDEILRLRESVPSHVNPTPQWLAFNQRIMQGESSLSTLNQPEFYDPEIVFFADKLDAYTDQPEFRMIAPDGTTFVTRFSSAELNYVTRWILYNGEQQVAAFALPATCRPEGYLAAQRNGTLIQVAPQQTRTFTVTTGIE >CP033401|1610136:1654789|1624090_1625011_-|AYQ01485.1|DBSCAN-SWA MDIAVIGSNMVDLITYTNQMPKEGETLEAPAFKIGCGGKGANQAVAAAKLNSKVLMLTKVGDDIFADNTIRNLESWGINTTYVEKVPCTSSGVAPIFVNANSSNSILIIKGANKFLSPEDIDRAAEDLKKCKLIVLQLEVQLETVYHAIEFGKKNGIEVLLNPAPALRELDMSYACKCDFFIPNETELEILTGMSVDTYDHIRLAARSLVDKGLNNIIVTMSEKGALWMTRDQEVHVPAFKVNAVDTSGAGDAFIGCFSHYYVQSGDVEAALKKAALFAAFSVTGKGTQSSYPSIEQFNEFLTLNE >CP033401|1610136:1654789|1637958_1642917_-|AYQ04277.1|DBSCAN-SWA MWDGGLQEQEVLAIEKIKAAFSVNVSKPDKPFRSGSISEQLKSYGFIGNEMFPWKGYAGFRFVEAKKEGEFDLVIVTHCNVIIVELKDWNHQPVTARGDTWFKGDKNMGRSPVSVTRSKKFMLDKKLKRLVDRFTNKGYIPIVHFFVVMTGNADFSALPEEQRRHTISLKDFLKFADRGSFNNYFKPHPATKVLNKDFHLFDDLFLGPQTAPKALRVNGYEANDMIFEHPKKVYREYLAKSEISTNSEALLRVWNFRNITGTKANTPEGRAQIVSREREVLQHINHQNRDLYNHCLRSLTSFQKDEVTAEYSEVYEVPPGHVRFNEFIGKYGKNFSDMDRLNVVKLLIAKFSDLHEMKIAHRDVADHSLWISPSKEVALSNFISAYHQPAGTVGDYRKLLSVGAVHVKDMLDKGELTPFQQDVHTLGLVAWHLFSGMRMSPKSLEKVQDNMLNSQHWYSSVLRDAVAAKFTSATEFFDALKQAEPAGKDIPTFDDTELDPYRHAINHARQYPEDDGFQFQVETVDKEVYISKGRLVKAWLNVGGQGYDPSINFQVLKFLKQVERLSSVKTTYLPQIREFGIASKSSSLYMVTDQVQGETWDKIAVPDDEKIDLIGKFVAAVEHLHGLGVSHGDIHPGNVIFETQSRLLFLIDIPDFSPSGDEPKNHSYSPEYIDNCTSFERDNYAVMKMSCELLGMSWGLESDIYPTIANAIRAELEDPVFGFKDLGRFKKAIDSNDLVPEQDLIEITAGNADEIISILPDNGHLYVKVKSNPKAPAEVNVTFSGIGGSFTAVFNKDQKTLVHGFRPRARVTIRKQDIDESQFEIDTGIKIIPGSPQDLSALTVLLNEEESFARAIELIAATEDVQVQEPLTLQLKDTFARLDKQTLEPSLREVLEIPTVKLWRAILDTETESYPNIEISGEVVPVADAHGELLLPYSADVDPLGAFRSSDEVEALQVDQEGVERFIGEVSLKKSELKEIRLVKVSSAAFKLKDSDIVFFRTRPTRASYQKRKRALERLLDRESVLPDLIDLFDPSCKQAAQNYGITLSDTDFARYDREDQHGNKISLNEQQRKAFNKLVNNGPLSLLQGPPGTGKTEFIAAFVHYLIEKQNTKRILLVSQSHEAVNTAAERIRKHCSRLGTELDVVRFSNREGAVSPGLKDVYSHAITTEKRELFNAEIKYRVEALSEAIGLEPGFISGVVLAELNLFRQIDHLEKLLYQVNNLTDSNESNELKDIAVELDFSIRSKLSQEYGINLDNGVKVSAAKDILISKLCTEYGVRPDEARRVKALAKISRDMQDAMSGERVNLDEFYSRSRQLVAGTCVGIGQGHIGIQENIYDWVIIDEAARSISSELAIAMQSARRVLLVGDHMQLPPLYSDAHKAALARKLGINNSRTEIDEVLRSDFARAFNSAYGAQTSAALMTQYRMAPPIGNLVSKTFYDGKLLNGVRAIPDVYQQAPEALRSVVTWLDTANQGHRAHHLEDRGTSIYNRCEADEIISVLKQVSENEEFVAKLSKLVSKDEAAIGVICMYAEQKRLLRQKFNQEIWSEGFKDIVKIDTVDSYQGKENRIIILSLTRSDKQHSPGFLRVPNRINVAMSRAMDRLLIVGNADIWKGNNKELPLGYVVSYMAERGQEAGYRFLSAQQGGKKK >CP033401|1610136:1654789|1647069_1647366_-|AYQ01500.1|DBSCAN-SWA MPDQSGHCRTPSRILRMPELSQLLGISRSTIYEKMNPLSKYYDATFPRPVRLGSGSVGWRSSAIDEWLTLHTVPARSAVKGVNDEVRHTLSSDMTMSR >CP033401|1610136:1654789|1618954_1620493_-|AYQ01481.1|transposase|DBSCAN-SWA MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRMNFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRKPFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRTVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYCQSEIYGRQGVELSRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTPVQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTHLACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALTEEALEQIGQLYAIEADIRGIPAEQRLAERQRKTKPLLKSLESWLREKMKTLSRHSELAKAFAYALNQWSALTYYANDGWVEIDNNIAENALRAVSLGRKNFLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVSELLPWRIALPAE >CP033401|1610136:1654789|1612792_1614055_-|AYQ01475.1|transposase|DBSCAN-SWA MINKIDFKAKNLTSNAGLFLLLENAKSNGIFDFIENDLVFDNDSTNKIKMNHIKTMLCGHFIGIDKLERLKLLQNDPLVNEFDISVKEPETVSRFLGNFNFKTTQMFRDINFKVFKKLLTKSKLTSITIDIDSSVINVEGHQEGASKGYNPKKLGNRCYNIQFAFCDELKAYVTGFVRSGNTYTANGAAEMIKEIVANIKSDDLEILFRMDSGYFDEKIIETIESLGCKYLIKAKSYSTLTSQATNSSIVFVKGEEGRETTELYTKLVKWEKDRRFVVSRVLKPEKERAQLSLLEGSEYDYFFFVTNTTLLSEKVVIYYEKRGNAENYIKEAKYDMAVGHLLLKSFWANEAVFQMMMLSYNLFLLFKFDSLDSSEYRQQIKTFRLKYVFLAAKIIKTARYVIMKLSENYPYKGVYEKCLV >CP033401|1610136:1654789|1614320_1615649_-|AYQ01476.1|transposase|DBSCAN-SWA MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEEIRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >CP033401|1610136:1654789|1626326_1626527_-|AYQ04274.1|DBSCAN-SWA MIPWSGVTCRQEIRSMQGCEMNSNRLAAPVIFEDSSGCYPVCIKNPDVMDNITASTTRKCLFTIRV >CP033401|1610136:1654789|1646627_1646825_-|AYQ01499.1|DBSCAN-SWA MIGKAIIKAQYNKQVPLTLVAMSVLTAMSIACQNQVDVCSPRNLRGAVNIYTMVLEKGKRPGERY >CP033401|1610136:1654789|1648477_1650295_+|AYQ01501.1|DBSCAN-SWA MYNFITIMYDVFSCFGVLAKNQNSRDIRNIKNFSSHQHSLGDMFDELINIIDKEQVLSKEQRKVIFRRYEDLYVKLMHYSVFTDKTHQIIKQKYFNDIVPMILALDIRNTYRPDNEMAFYYHIHSFLTQIPDNEDDIYHAARTYLRNYVKLCLSGYTPANAHFKDIFDGVYEFIRNIRKNSTPGKTKLIATINTCKETCKHLLYLSNEDKEKIISDLDKVQVACYYLTILLAFERRTSLTSTLTTLYKMLISEREVSEYECQLLYLTNPIDVMNILNKYIYYFPNENSPFYTLKIDSALSWDAIDAIRDYSISDIYLYPEQKTINCVVEIENIVFGGYIYTLNNGVTLQNIENSLKDSSCHYVLNGYTEFVNCLRQLTSGKTESVHRTINKLNYEKLPFGFIIAAFAILKIAFKIKFSKNHVNIRALLNDINYFMTYQGESINLISLDHEYPESCLQNDTNTYLLGRVIFLYNSMIYKFINCQEHETNNIHSAMINNLLQEVDIALGKINDIIDSRNISAPHELANILTREKILTTREKKGNLISLFDGFTLFHCVGMITFLIHYLRTPEEKVENIFMLYGADKNNKLRRRLIYDALGIIQSQQE >CP033401|1610136:1654789|1652050_1653037_-|AYQ01503.1|DBSCAN-SWA MIPDYLTFIRFQDKRNLIYIYAIGLILIGFYWKNAGFTFPSEDIGVVSGILALVLYNFIFDLKAYWAYKCVTKNIDFSWFKKKQNHKIELFLTQPLVAGFLSLIMLSAMSWGLYQLLPSLYALFLISLLGPLVIFLLFRMIRTSYVKQVAISVAKKVKYKSLTRYVLLSVCISTVVNLLTISPLRNSDSFVTEGQWLTFKSIIALLILCGVVLAINLFFLRFSKRYAFLGRLFLQEIDLFFSSENALSTFFAKPLWLRLFILLVIEMMWITLVSVLATLVEWRIWFEAYFLLCYVPCLIYYFFHCRCLWHNDFMMACDMYFRWGHFNK >CP033401|1610136:1654789|1636525_1637962_-|AYQ01496.1|DBSCAN-SWA MISDNKVTYHEVDFLLPAQRFNIQFSYVSQKGLPFIREFVLRLVHVASMSKAQIATYFGLTHRETEEAISDLVQRGELTLSSDGRLALTDKSNGYFSEVGEIPHLSTIQDSGCTLSFDLATFSCFSNQAFQDHWRGGVTLKVDDNNISQSEKLVEKHFQYQFNQILDKGYLSHLQVQVGKEQPSIYTVNSVNKLRQIPLRLTTEFKMDSDGKAVEREDYEQLNSSECVHELMAVEIARLSRHNNTMSIFKAMQSINDDLTLKLFDSKTNRLNPLFIKDMQALEEYSSSGRTTFLGPVYSKNNWEKLQKALAPALNQRIRDKTDYGGSPFIWVAPSDPYWSKSIRFISSLSNFLSKSATKDKQLYKPVLYVPVRDAADARVARQWKYELGQFHEYAKGLVEGLLDGNVEILHLEDELSAVIYHISQPEILPVTMPLGFITTNKETVKIIGTLVTEYIKGTTGFDKFHDCGAISAMIKKG >CP033401|1610136:1654789|1615880_1616060_-|AYQ01477.1|DBSCAN-SWA MKKQAITWHIICNTLKPLRRSQFIMFIIVWPEGYLLKSALCGESPCAVGRLVYIAFIML >CP033401|1610136:1654789|1644975_1646409_-|AYQ01498.1|DBSCAN-SWA MCLLAPENPYPIYALPPLVRNAIIETQKNTQAPLAMVATSALTAISIACQNQIDVCRPGNLHGPVNLYSLILADSGERKTTVDKVFMKAFYLRDEALADEYAKLVENYSTEKEIWEQKQKALESKFHKEIRAGKDYKATESELETHLNKPPVPPQIRRTIFNETTIEGMLKYYSDSNRSFALVSSEGGVIFDSRAMSKLGIINTLWDGGSLFIDRKSSPGINLKEPRLTISVMIQPDVYHKGFCTRKKEIVKTSGHHARFLMCQPTSTQGTRIITGDNYSSQYQDLFEERINELIDESLAMSGERRCLHFSPQAARIWTDYYNDVESKLGGLGPLRHCREYAAKNAEYMARLAGLIYHSSGEEGEISPYTAEMARELAIWYGNEYVRLSNPLTFDNSALTVPVRLIPEELELFNWIKSYCIEKGILCMKKNDILQRGPNRFRKKDKINWLLDLLYEQNRVVPVIEGKTLCVAPNFDL >CP033401|1610136:1654789|1616442_1617225_-|AYQ01479.1|DBSCAN-SWA MMMELQHQRLMALAGQLQLESLISAAPALSQQAVDQEWSYMDFLEHLLHEEKLARHQRKQVMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRSLSFIERNENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVRFTTAADLLLQLSTAQRQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQVIAKRYEKSAMILTSNLPFGQWDQTFAGDAALTSAMLDRILHHSHVVQIKGESYRLRQKRKAGVIAEANPE >CP033401|1610136:1654789|1620886_1621267_-|AYQ04273.1|DBSCAN-SWA MQKNVTPGRRKGCPNYSPEFKQQLVAASCEPGISISKLALENGINANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETLSISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR >CP033401|1610136:1654789|1626100_1626199_-|AYQ01487.1|DBSCAN-SWA MISQIDKLEYVVKVQRNQSNPTMFNKIVVFFQ >CP033401|1610136:1654789|1629859_1630684_-|AYQ01492.1|DBSCAN-SWA MSNTDASGEKRVTGTSERREQIIQRLRQQGSVQVNDLSALYGVSTVTIRNDLAFLEKQGIAVRAYGGALICDSTTPSVEPSVEDKSALNTAMKRSVAKAAVELIQPGHRVLLDSGTTTFEIARLMRKHTDVIAMTNGMNVANALLEAEGVELLMTGGHLRRQSQSFYGDQAEQSLQNYHFDMLFLGVDAIDLERGVSTHNEDEARLNRRMCEVAERIIVVTDSSKFNRSSLHKIIDTQRIDMIIVDEGIPADSLEGLRKAGVEVILVGELASSL >CP033401|1610136:1654789|1628216_1628456_+|AYQ01489.1|DBSCAN-SWA MRNLHIFPVTSVLPHMVIATCVVSLDSKGIYSQVKCLIKYTNLVIINIYSGRMTREVDCQQTRPVVHRLLNKTPDISRL >CP033401|1610136:1654789|1650481_1651684_-|AYQ01502.1|DBSCAN-SWA MAVLTDTKARHIKPDDKPLPHGGITGLTLHPSSVKGRGKWVFRYVSPVTQKRRNAGLGTYPEVSIAEAARTARIMREQLAAGDDPLEIKKAESEKVAIPTFADAARRVHAELSPGWENPKHVRQWLSTLENYAFPQLGAKTLDSITAADVAETLRPVWLTLSETASRVKQRIHVVMQWGWAHGFCVANPVDVVDHLLPQQTRGRDEHQPAMPWRQLPLFVATSVYTDEPYNVTRALLLMVILTATRSGEARGMRWAEIDFHKRVWTIPAERMKARLQHRVPLSRQAIYILENIRGLHDELVFPSPRKQQILSDMVLTSFLRKKKAVSDIPGRVATAHGFRSTFRDWCSEQGYSRDLAERALAHTLKNKVEAAYHRTDLLEQRVPMMQAWADYVMSQIVNK >CP033401|1610136:1654789|1618257_1618401_-|AYQ01480.1|DBSCAN-SWA MPGTYQSQSLSGIICRVVWLTFSVVTNGAIVIHTQRLKSDPGGNLLS >CP033401|1610136:1654789|1636214_1636421_-|AYQ04276.1|DBSCAN-SWA MAPLNDALYRYVMNTRLGTIHGTSVGELLAWIKEDENPRKGEMVLIIEGHKAQDDELPADALRTLALL >CP033401|1610136:1654789|1616022_1616199_-|AYQ01478.1|DBSCAN-SWA MATEIKKFEKRDLAQAVIGVGVMVDFISRIMSVLADSWGCWCGLPIDEKTSNYLAYYL >CP033401|1610136:1654789|1625316_1626099_+|AYQ01486.1|DBSCAN-SWA METKQKERIRRLMELLKKTDRIHLKDAARMLEVSVMTIRRDLHQEDEPLPLTLLGGYIVMVNKPAPSMPVIHDVPKNHRDDLPIAILAAGMVNENDLIFFDNGQEIPLVISMIPDAITFTGICYSHRVFVALNEKPNVTAILCGGTYRARSDAFYDASNSSPLDSLNPRKIFISASGVHNHFGVSWFNPEDLATKRKAMNRGLRKILLARHALFDEVASASLAPISAFDVLISDRPLPADYVTHCQNGSVKIITPDSEDE >CP033401|1610136:1654789|1610136_1610595_-|AYQ01473.1|transposase|DBSCAN-SWA MGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREKRRATGSILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKGKSSLMLYEQFGDLKFKYRNREFWCRGDYVDTVGKNTAKIQDYIKHQLEEDKMGEQLSIPCPGSPFTDRK |
38 | Stx2-converting_phage(33.33%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1868468 : 1878562
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP033401|1868468:1878562|DBSCAN-SWA TTTAACTGGAAACTTCCGTTGACCGGCTGTATTCATGGCTGCGAATTTTCGCCATCAGCTCGTCTGTCAGTTCGGACACCCACTGGATAGCCAGCCGCTTTTCTTCGTCGCTGCACTCACTAGCCGCTACAAGCTTGATAAAAAAATCAATACGCTGAAGCTTCAATGACTCCAAAAGATAGTCCTGCATCTTCCCTCCTATCATTACACGGATACACAAAAACTGTATATACACCCACTGTTTATATAAACAGTATAATAGGAACAGAAAAATGTAAAACTGTTTTTTGTCAGTTAATTGGATGTACTGATGTCGGTCAATAAAGCACAAAATGTTAAACAGCAGCCTTAGTACCATTGACGCCATTTGTCATCTTCCTGCAGCCTCTGGTTACGGTAAAAAATACGTAAACCGGCACCGGATGGAATGCTGCCGCCACGCAGAAGCAAATCAATCTCAGATGCACTACCTTCAAACCCCCTGGTTGTCAGTTCTGCCTCAAGCTGCAGGCGCTGCTGCTCCAAAATACTCTGTATGTATGCTTTTTTCCGCTTCGGTTTTACCAGTCTCAACCTGGCTGTCAGCTCCCGCCGTTCCTTCTGGCCCATGTTGTGGAGATATTCCTGCAGCTCCTTCTCATCCATGGTTTTAATATCGGGTAAATCACCCCCTGATTTGTTCAGATTTTCAACAGGGGGACAGTTATTGCCACGAGTCCAAGGGGCGCAAGCGCCCTGGTCGGCTGCCGCCTCCTGAACGTCAACGGCTTTACGAACCATTTTCCACTTCACTGCATGAGTGCAGATCTTGCCCTCTGCAATGGGTGACCAGATGCCATAAATACGAATGCCATGATCGCCATAGGCGGTCGGCTCTTCGTTGATTTCATAAGCGGTTCTGATGAGGTGATATTTACGGGGAACCAGTACGCCGCCCTGCTTCATGATGTAGGTGGCAAAACAACCAGCATCAGCAGCAGCCAGAATGGCATCAAGACGCGGGTTATCCAGTACCGGCGCACCTGCTTTTTTGTCACCCTGTTGCCTTGCCGCCTGACCAGCCAGCAAGCGAAGTTCACGGTAAGCCTGACGCCCCGGAATACCAAAGAAGCGGAATTGCTGAACACGATGCAGAGACGCCCAGGCATTCACGTATTCAGCGTTATCACGCAGAGATTTACCCGTTTCCTTGCTGATCTCGCCAGCCAGACCACGACCGTCAATGTTCTTACTGATATATTTCGCGATGTAGCTTGTCGGCGTTCCTTTGCGCGGGTTAATCAACTCAGACTTAAAGCGCGGCCCAGTGTTATTGCCCAGCTCCTCGCGGTCTTCACGGATGGCAAACTTACGCAGTAATGCAGTGATGGCACGGCGGTCTTTTTTGCGCATGAAACACAACAGGTGCCAGTGAACTGTGCCATCATGATGCGGCTCAGCCACCCGCACGCCATACCAGCGCAACCCGGCTTTGTGCATCGCCTTACGAAATGCAGCAAACATACCGACCAGATAATCGCTGCTTTGTCTTACCGTCGCGTTTGTCCAGGTCGGGTTTGGTCTGCCGTTATTTAGCGTGGAATGGAAACGCGACGGACAGGTGATAGTGTAGAAAACGGCGCAGTCACCGCGCATTTCCGCGATAAGCTCCAGACCTTTAACACAGGCCATCATCTCATTGCGGCGATGCGCAGGGTTGCTGCTGCTGGCGTTTACCACATCCTCCATGTCCAGCGTGTCGCCGTCTTCGTTCACCAGTTCATGAGAACGGAAAAACTCCAGCGACTTACGACGCTGCTCACGTTTATGCATCACGGCTTCATAGCTGACATAAGGAGATGCTTTTTTGCTGACCAGGCAAACAGCGCGCAACTGCTCTTCCCGCCATTCGCAACGCATCTTCCATAATTTCCGATACCACCAGTCGGCGCACAACATACGCGCCAGCGAACCCGGAATGAGTTCATAGGGCACGGGTTTACGGCGGTTTCTTTTCCGACGGAGTTGCTCAAACGCAGGTGGGATGACATCCAGACGCAGGGTTTCCGCTGCCACCTTTTCCCATGTCTTGCGGATTTCTTCTGGCTTAACGTCATCGGTGGCATACAAATCACCACAAGCTGCATCAAGGCACATACTCATATGCGCAGCTACCAGGGTGGAAAGGCGTTTCACCTGATCCTGACTCATTTCAGGCAGGATCAGCAGGCCGTCCAGCCCTTCATGGCTTGCCATAAAGCGAAAAGATGCAGATAGCTGACTGTCGCGTACATGCTCCAGTCGTTCCAGACATGGCTTAATCGTCTCACGTAAATAGCGGGAATAAGCCTTTGGCCTGCCCAGGCTGCTGAAGTATTCAATACGTTGCATCAGCGGCTTGCTGATATGGGAAGGCTGGGCGTTGACGTCCGCCAGAATGACCATGTCTGAATTAAAACGCTGCTGCTCATGCGCCAGCTTTGCCCGGCTAATGAGCTTATCCTGCTCCATTTCGCGCTGGACAGGATCACGTGATTCATTAAAGAAATAACGCTCCCAGACCTGATCACTCAGTGCCTCGCGGCGCAACTGTTCCTGCTCGTTATCGGCAGCGTACAGAGTGATCAGGTTTGAAAGCGTAGAAACCGGCGCAACTTCCGCCGGGTCCAGATAAGGGTTAATGGCCTTTTTCGGGCTGTTCCATGAGAATGCTGCGGCAGCCTCGTTAAAGCCGCAGCAGTTGTTCATATCGGCATGACTCATGCACGTACTCCGTACACGGCAGAACTATCCACGCCACGCGAATAATCAAATCCCATCCAGCAGCGCGGCCCGGAAACAGCAATGATTTCTGTTGCTGATTTACCCTCGCCAGCTGCCACACCGATGCTGCGTTTTACCTTGATATAGTGGTGAGTAAAATTGCGATACAGCGAACGGATCAGGGATGTGTCACTGTTAGAAACAATGACCGGATGTCCTTCTGATGACCGATGTTCAAGAACGGATGCCAGGTGATACTGGTCATCTTCAGTGAAACCATCAGTGTGATAGCCGGAAAACGTACCGTCATATGGCGGATCGCAATACACCACATCTCCCGCCTTCAACATCGCCAGCGTTTCATCAAAGCTGGCGCAGATAAACGTCGCTCGCTGGGCTTTTTCTGCAAATGTGCGAAGTTCTTTTTCAGGGAAATACGGATTTTTATAATTACCGTAGGGAATGTTGAAATGCCCGCTCTTGTTATAGCGACATAAACCACGGTAACCGTGACGATTGAGATACAGGAAATATACCGCTTTCATGAAATCAGTAATTTCAGTTGAGCAGTTAAACTCCTGCCTTATGTTGTAATAAGCCACCTCCCTGTTTGCGATCTCAAATAAAACTCTGGCGCGAGATATAAACGATTCACAATCAGCGGCAACCTTTTTATAGAGGTTGATTAAATCAGGATTAATATCCGCAACCAGATAGCTGGGATAATCCGTTTCCATCATCACAGCACAGGAACCCGCGAAAGGTTCAACCAGTCGCGGGCCAGCAGGAAGGTATTTTTTCAGTTCTGGCATAATGGCGGTTTTATTTCCCGCCCATTTCAGGATGGTGCTCATACAGCACCTCCGTTGTAATGTTTGCCTTTCAGTTCTGCGATTTCCTGACAGGTAATGCAAAGCTGCACTCCCGGAATGGCGCGGCGTCGTGCTGGCGGAATTGGCGCTTCACATTCAATGCAAAGTACGCGTGACACGCCCGGTGATTTGGCACGGGCTGCACGGATATGGCGCTGGCGTTCTTCTTCAACGCGCTGCTGTACGAGATCCATTGCATCAGCCATTAGTGGATCTCCTGCGCTTCGTTCTGGATTGCTTCAGCAGTCACGCGCAGCAGTTCTGCCGCTTCCACGTGGTTTAGCTGGCGGGATGAGATATGACACGCCAGGCTATCAAGGCGAGCAGCCATTGCTTCAGCCCTTGCCCGGCGTTCTTCCAGACGAGCCTCTGTCAGTAAAATATTAAGCCCTGCGTCATCCGGTCCGGTTTTAGTCGTGAGGGTTTCAATATTACGCATAATCAATTCTCCTGAATTTAGATAAAGGGATGCCCGGCGGGTTTACGCCATTAATTTCATTAGTTGGTTAATTCGGCATGGTTAGCCGTCTGGGAAATAAGCTCACCACTGCACGAAAATGATTCATTGCTTTAATCAGCTCCCGCTTTTCGTCAGTGGTCAGCTCATTAATGCTGATGCTATGACGTTCAGCTGGAATTTTTGCCATAAAGAATATGGCAGCCAGTGCCCGTTTATTTTGTTCATTATTGATATCCCGTGGATCACGCATATCTTTAATAAACCGCTCAAGCTCTGACTCAATATTCAGGCCAAAAACTTTCGCCCTTAACTCCGCAATGTGATTAAGTCCATTCAGGCGTTCACCGGGGCTTAATGGAACAGTTGCTGCAGCGCCATTAATTGCCATACTTCATATCCCCCAAACGCAGCTATCGTTCTTTGTTCTTACGGTAACGCTCAAGAGGAGATACATTTTTTCGTATCGTCTCTTTAACCTGCTCTCCCCGTAAAAACGTCCCATCCTTTAACGTGAAAAAGTAACTGCCATCGCCCGACAATGACGGATAGCAACAGAGCAAATCATCTTCAGGTACTGAATAACTCTCCCCTCTGTAACGAAACTGATAAACCACTTCACTTTCTGCCGCATACATTTGGACTTTCTCCGTTTCCTCGTGGTCAATTCAGACAGCAATTCATCTTGTGAATGACATGGATGCCAGCGTTTTCCATCCTCACCCGTGATCCAGCCGTGACCGTAGTGCATTGCCGGGCTTTGTTTTACCAGCAGCGATGCAAATGATGGTTCTTTCGTCAGCATAAACACCTCACAGCAAACCGAATGAAGCACCAAGGCCAGTCATGGTATCAACTGCACTCGCCATCGCAGGATTAGCCTGTAAACGGGCTTGCAATGAAACAGCAGCCAGCGCCATCAGTCGTGTTACAGAGTTAATGCTGCTGATAGCATCACGACGACCTGCACTGGTTTTTACATCGCCAGATACCGCACCTGCAGCAACACGCCCGATCTCTGCGGTTGCACTCATGACGTAATGCGGTAGTTTCTCTTTTGCCACCTCATTAATCGGAACACATGGCAGACAATGAATCTGTGCCAGAAAACCATCTACCAGCGTTGAATCTTCAGTCAGATCGGTAAGCAACCAAATTTCTGGTGCGGTTAATAAATGAGGTTGAGCTGGGTTCAGTTTGTTCCGCAGAATCTGCACATTCATGCCTGCACGTTCTGCCAGTTGCACCAGATTGTGGCGCAGTGCGAATGCACGACAGGCTTCATCAAAATGTGGATGTTTGGAAACTTGGTAATCAAACATGGTCGACACCTCTGATGTATCCCAAAATGGAACTAGTTGAATACAACATTGCAATCAGTAAGTGCATCAACGGTAAGAGCAGCAAGGTTGATCATTACCTTTTCTCTTTTCTTGTCTTTCCGAAGGCGATGCCGAGGGATGCGACCGTCAGCCAGCATATCGTTAATTGTGTCGATTGAAAGACCAGTAAGTTCGCTATAACGCTCAATTGTGACATGTGGCGTATTCAGGGTTATTGAAATGTTAGGGGTCATGATGCAACATCTCCTATTGGCTTGTGGTGAGTCAGTTTTAATCGTGGCTTTAACTTCACATTTCGGAGAATAGGATCACAAATCGGTTATGTCAACACATGAAATCACATTTCGCCATGTGGACGAAAAAAAGAAATCCTTAATCATGCAGAATCGCGGAGGGCAATCGGTTATAGATCGGATACTGAAAGCCTATGGTTTTTCTTCCCGACAAGCATTCTGTAATCACCTAGGTATATCGCAAAGTACAATGGCGAACAGGTATGCCCGTGACACTTTCCCTGCTGATTGGGTTGTTATCTGTAGCATGGAGACTGGAGTGCCGGTCGAGTGGTTGGCATTTGGCACTGATACCGAGAAGGGAAGCATTACAAATAATGCAGAAAAAAGTCACAACAATTGTGACAGCAAGCATCAACATCTCAATAGAGAACAAGACATCCAAAATGAGAACTCTTTTACTATTAACCAAGGTGGAAAAGCAGCAATAGAGCGAATCGTTTTGGCTTATGGATTTAAGACAAGACAAGCTTTAGCTGATCATATTGGTGTATCAAAAAGTACATTAGCCAATCGTTACATGAGAGATACCTTTCCTGCTGACTGGATTATTCAATGCTCACTGGAAACCGGTGCTTCATTAACATGGCTAACCACTGGTAACGGGGCAATGTTTGAAAAGCCTCGAAACGATACTATCACTATCCCATATCATAAAATAATTGATGGATCTCTTGCTCAAGAAACCTTCTTGACTTTTGACTCTAAGTTGTTAGAAGGAACCTTTCTGCAACCTTTAGCAGTATTCATTGATGAGGAAATATATATTGTAGAATCAAAATTTAATGAAGTTACTGATGGCAAGTGGCTTGTGAATATTGAAGGGAAAATAAGTATCAAAGATTTGACTCGCATACCCGTTGGTATGGTTAAAGTTGTAGGCACTAACGCAAGTTTTGAATGCTTACTTACTGACATTATCGTTTTGGCAAAATGTAAAAGAGTTTTTACTAAAAATGTATAAAGAGAAACATCATGACTGAACCAACCAATAAAGATAGCGAAATAAAAAAACACCTATTAGAATTTCTTGATTCACAGTCTGAAAATATAGCAAAACACTTCTACTCTCATATAAAAGACTTAATAGAAGCAGGAGAGCTTTCTGAAGCTCATAATAACCTAGCGCTAATTGAAAAATACATAACTAGGCCACCGATGGATGAAGAACCCAATATAAATGAAAATAAAGCCAATAAAAGAAAAAATGTAAAATCACTTGAACCTAATAATTATGTAGAACATATAATACAATTAGAAGAACGAAACAGCATATTAACTCTACAGTTAGAGCATTATACTCAGGATCTTAATAGAAAAAACGCAATAATCGAAAACAACGTAAAACAAATTAATTCATTGATTAGTGAAAATAAGGAACTCCGTAGCCAAGTACAGCAACAAAGAATCGATGATAAAATCCCCACCTATGTTAACGATGTTAAATCAGATCTTGGTAGTGATGACAAACATTTTATATTGATGTCTATTATCTGGTCTATTGCAGGGGTATTTTTTGGCTTCCTTGCAGTAGTATCTGCTTTTTTTACATTATACATGAACTTAGATTTAAAAAATCTCACTAACCTTCAGTTAATATATATCTTCACGCGAGGATTAGTTGGAATCGCCATTCTTTCATGGCTATCATATATCTGCCTTAGTAACTCAAAAAAGTACACACATGAATCGATCAGGCGAAAAGATCGTCGACATGCTTTGATGTTTGGTCAAGTTTTTTTGCAGATATACGGTTCTACAGCAACTAAAGAGGATGCAATAGAAGTCTTTAAGGATTGGAATATTTCAGGTGACTCTGCATTTTCAGGTCAGACAGAGCAACCACCGAGTTTTGCGTCATTTTTGAATACAATCAAAGACAAAGTTAAAGTAACTGGAAGTGATAAAGAAACAGATTAATCATGAACATGTATGCTACTAAGTAAAAAATACATTGAATACTGTTGTTATATGGGTCTTCCCTTGTTGTGGTGGCTGAAGGCATGATAATGGTGTATTTAATCGCCAGAGGTCACCGCCATGGACGAAAAGTCCCTCTACGCTCATATTCTCAACCTGTCCGATCCGTGGCAGGTAAAGTCCCTTTCTCTCGATGAAAATGCCGGTTCTGTTACTGTCACTATTGAGATCGCTGAAAACACCCGGCTAGCCTGTCCGACCTGCGGTAAATCCTGTTCTGTTCACGATCACCGTCATCGTAAATGGCGCCATCTTGATACCTGCCAGTTCACCACTATTGTTGAAGCCGATGTTCCACGAATTATGTGTCCGGAGCATGGCTGCCTGACGTTGCCTGTTCCGTGGGCTGGCCCCGGAAGCCGGTATACGTTGCTATTCGAATCGTTCGTTCTCTCATGGCTGAAAATCAGCACCGTTGATGCTGTCAGGAAGCAACTTAAGCTCAGTTGGAATGCGGTTGACGGCATTATGACCCGGGCAGTTAAGCGAGGTCTTGCCCGGATAAAAAAGCCATTATCCGCCCGTCATATGAATGTGGATGAGGTCGCCTTTAAAAAAGGACATCGTTACATAACGGTGATCTCCGATCGCGATGGTCGGGCGCTGGCCTTAACGGATGATCGCGGCACAGAGAGTCTTGCCGGCTATCTTCGCACGCTCACTGATGGGCAGTTGCTGGCTATCAAAACGCTCTCAATGGACATGAACGCGGGCTATATAAGAGCAGCGCGTATCCACTTACCCAGTGCGGTTGAGAAAATCGCCTTTGACCGCTTCCATGTGGCGAAGCAACTGGGCGAGGTAGTTGATAAAACCCGTCAGAATGAACATCCGCACCTCCCTGTTGAAAGCCGACACCAGGCAAAAGGAACCCGCTTCCTGTGGCAGTACAGCGATAAGTGGATGACCGAATCCCGGCAGGAAAAGCTGATGTGGCTGCGTGCACAGATGAAGCTGACGAGCCAGTGCTGGGCGCTGAAAGAGCTGGCAAAGGATATCTGGAACAGGCCATGGAGCGAGGAAAGACGGAGTGACTGGCAGAGATGGTTGGCGCTGGCGGCTAACAGTGACGTTCCCATGATGAAAAATGCCGCGAAAACGATAGGAAAAAGGCTGTACGGGATCCTGAATGCGATGCGACACAGTGTCTCAAACGGAAATGCGGAGGCACTTAACAGCAAGATCAGGCTGCTGAGGATAAAAGCCAGGGGATACCGAAACCGGGAGCGCTTTAAACTGGGGGTGATGTTCCACTACGGAAAGCTGAATATGGCGTTCTGAGCCTTCCCACCATGATCGGGGAAGACCCTGTTATATACAGTTAAATTTAGCCCTCTGATATGAGGGCATTTTTTATGGCAGTACGAAAACTCACCACAGGAAAATGGCTTTGCGAATGTTACCCCGCCGGACGTAGCGGACGCCGTGTGCGTAAACAATTCGCCACCAAAGGCGAAGCACTGGCCTTCGAGCGATACACCATGGGGGAAATAGAAGCAAAACCCTGGCTGGGCGAATCAGTGGATCGTCGGACACTGAAAGATATGGTTGAGCTATGGTTCAAATTACATGGCAAATCTCTTACTGCCGGACAGCATGTCTACAACAAGCTGCTGTTGATGGTTGACGCCTTGGGAAATCCCCTTGCAACTGATCTCACCTCAAAAATGTTTGCTCACTATCGAGATAAACGCCTGACAGGCGAGATCTACTTCAGCGAGAAATGGAAGAAAGGAGCAAGCCCGGTCACCATTAACCTGGAGCAAAGCTATCTAAGTAGTGTTTTTAGCGAACTATCCCGTCTGGGCGAATGGTCGTATCCGAACCCACTGGAGAACATGCGAAAATTCACCATCGCAGAAAAAGAGATGGCATGGCTTACCCATGAGCAGATTGTTGAATTGCTGGCTGATTGCAAACGTCAGGACCCAATTCTGGCACTGGTAGTTAAGATATGCTTAAGCACAGGCGCACGCTGGCGTGAAGCCGTAAATCTTACCCGCTCACAGGTGACCAAATACCGAATTACCTTTGTCAGAACGAAGGGGAAGAAAAACAGAAGCATCCCTATCAGTAAAGAGCTTTACGAAGAGATCATGGCGCTCGATGGGTTCAATTTCTTCACAGACTGCTATTTTCAATTTTTATCCGTGATGGAAAAAACGTCTATCGTGCTCCCTCGCGGTCAACTCACACACGTTCTGCGCCATACGTTTGCAGCGCACTTCATGATGTCGGGTGGAAACATTCTGGCCTTACAAAAAATTCTCGGACACCACGATATAAAAATGACTATGCGTTACGCACATCTGGCACCGGATCATCTGGAAACGGCGCTCCGTTTCAATCCTCTGGCAACGCTGCCAAGTGGCGACAAAGTGGCGGCAGCGGTTGGCATTACCCCGTAA
Protein sequences of DBSCAN-SWA_4 >CP033401|1868468:1878562|1872286_1872520_-|AYQ01685.1|DBSCAN-SWA MRNIETLTTKTGPDDAGLNILLTEARLEERRARAEAMAARLDSLACHISSRQLNHVEAAELLRVTAEAIQNEAQEIH >CP033401|1868468:1878562|1868815_1871209_-|AYQ01682.1|DBSCAN-SWA MSHADMNNCCGFNEAAAAFSWNSPKKAINPYLDPAEVAPVSTLSNLITLYAADNEQEQLRREALSDQVWERYFFNESRDPVQREMEQDKLISRAKLAHEQQRFNSDMVILADVNAQPSHISKPLMQRIEYFSSLGRPKAYSRYLRETIKPCLERLEHVRDSQLSASFRFMASHEGLDGLLILPEMSQDQVKRLSTLVAAHMSMCLDAACGDLYATDDVKPEEIRKTWEKVAAETLRLDVIPPAFEQLRRKRNRRKPVPYELIPGSLARMLCADWWYRKLWKMRCEWREEQLRAVCLVSKKASPYVSYEAVMHKREQRRKSLEFFRSHELVNEDGDTLDMEDVVNASSSNPAHRRNEMMACVKGLELIAEMRGDCAVFYTITCPSRFHSTLNNGRPNPTWTNATVRQSSDYLVGMFAAFRKAMHKAGLRWYGVRVAEPHHDGTVHWHLLCFMRKKDRRAITALLRKFAIREDREELGNNTGPRFKSELINPRKGTPTSYIAKYISKNIDGRGLAGEISKETGKSLRDNAEYVNAWASLHRVQQFRFFGIPGRQAYRELRLLAGQAARQQGDKKAGAPVLDNPRLDAILAAADAGCFATYIMKQGGVLVPRKYHLIRTAYEINEEPTAYGDHGIRIYGIWSPIAEGKICTHAVKWKMVRKAVDVQEAAADQGACAPWTRGNNCPPVENLNKSGGDLPDIKTMDEKELQEYLHNMGQKERRELTARLRLVKPKRKKAYIQSILEQQRLQLEAELTTRGFEGSASEIDLLLRGGSIPSGAGLRIFYRNQRLQEDDKWRQWY >CP033401|1868468:1878562|1868468_1868657_-|AYQ04285.1|DBSCAN-SWA MQDYLLESLKLQRIDFFIKLVAASECSDEEKRLAIQWVSELTDELMAKIRSHEYSRSTEVSS >CP033401|1868468:1878562|1872587_1872929_-|AYQ01686.1|DBSCAN-SWA MAINGAAATVPLSPGERLNGLNHIAELRAKVFGLNIESELERFIKDMRDPRDINNEQNKRALAAIFFMAKIPAERHSISINELTTDEKRELIKAMNHFRAVVSLFPRRLTMPN >CP033401|1868468:1878562|1873046_1873343_-|AYQ01687.1|DBSCAN-SWA MLTKEPSFASLLVKQSPAMHYGHGWITGEDGKRWHPCHSQDELLSELTTRKRRKSKCMRQKVKWFISFVTEGRVIQYLKMICSVAIRHCRAMAVTFSR >CP033401|1868468:1878562|1874259_1875138_+|AYQ01690.1|DBSCAN-SWA MQNRGGQSVIDRILKAYGFSSRQAFCNHLGISQSTMANRYARDTFPADWVVICSMETGVPVEWLAFGTDTEKGSITNNAEKSHNNCDSKHQHLNREQDIQNENSFTINQGGKAAIERIVLAYGFKTRQALADHIGVSKSTLANRYMRDTFPADWIIQCSLETGASLTWLTTGNGAMFEKPRNDTITIPYHKIIDGSLAQETFLTFDSKLLEGTFLQPLAVFIDEEIYIVESKFNEVTDGKWLVNIEGKISIKDLTRIPVGMVKVVGTNASFECLLTDIIVLAKCKRVFTKNV >CP033401|1868468:1878562|1873350_1873860_-|AYQ01688.1|DBSCAN-SWA MFDYQVSKHPHFDEACRAFALRHNLVQLAERAGMNVQILRNKLNPAQPHLLTAPEIWLLTDLTEDSTLVDGFLAQIHCLPCVPINEVAKEKLPHYVMSATAEIGRVAAGAVSGDVKTSAGRRDAISSINSVTRLMALAAVSLQARLQANPAMASAVDTMTGLGASFGLL >CP033401|1868468:1878562|1873892_1874114_-|AYQ01689.1|DBSCAN-SWA MTPNISITLNTPHVTIERYSELTGLSIDTINDMLADGRIPRHRLRKDKKREKVMINLAALTVDALTDCNVVFN >CP033401|1868468:1878562|1877509_1878562_+|AYQ01693.1|integrase|DBSCAN-SWA MAVRKLTTGKWLCECYPAGRSGRRVRKQFATKGEALAFERYTMGEIEAKPWLGESVDRRTLKDMVELWFKLHGKSLTAGQHVYNKLLLMVDALGNPLATDLTSKMFAHYRDKRLTGEIYFSEKWKKGASPVTINLEQSYLSSVFSELSRLGEWSYPNPLENMRKFTIAEKEMAWLTHEQIVELLADCKRQDPILALVVKICLSTGARWREAVNLTRSQVTKYRITFVRTKGKKNRSIPISKELYEEIMALDGFNFFTDCYFQFLSVMEKTSIVLPRGQLTHVLRHTFAAHFMMSGGNILALQKILGHHDIKMTMRYAHLAPDHLETALRFNPLATLPSGDKVAAAVGITP >CP033401|1868468:1878562|1876214_1877435_+|AYQ01692.1|transposase|DBSCAN-SWA MDEKSLYAHILNLSDPWQVKSLSLDENAGSVTVTIEIAENTRLACPTCGKSCSVHDHRHRKWRHLDTCQFTTIVEADVPRIMCPEHGCLTLPVPWAGPGSRYTLLFESFVLSWLKISTVDAVRKQLKLSWNAVDGIMTRAVKRGLARIKKPLSARHMNVDEVAFKKGHRYITVISDRDGRALALTDDRGTESLAGYLRTLTDGQLLAIKTLSMDMNAGYIRAARIHLPSAVEKIAFDRFHVAKQLGEVVDKTRQNEHPHLPVESRHQAKGTRFLWQYSDKWMTESRQEKLMWLRAQMKLTSQCWALKELAKDIWNRPWSEERRSDWQRWLALAANSDVPMMKNAAKTIGKRLYGILNAMRHSVSNGNAEALNSKIRLLRIKARGYRNRERFKLGVMFHYGKLNMAF >CP033401|1868468:1878562|1875149_1876094_+|AYQ01691.1|DBSCAN-SWA MTEPTNKDSEIKKHLLEFLDSQSENIAKHFYSHIKDLIEAGELSEAHNNLALIEKYITRPPMDEEPNINENKANKRKNVKSLEPNNYVEHIIQLEERNSILTLQLEHYTQDLNRKNAIIENNVKQINSLISENKELRSQVQQQRIDDKIPTYVNDVKSDLGSDDKHFILMSIIWSIAGVFFGFLAVVSAFFTLYMNLDLKNLTNLQLIYIFTRGLVGIAILSWLSYICLSNSKKYTHESIRRKDRRHALMFGQVFLQIYGSTATKEDAIEVFKDWNISGDSAFSGQTEQPPSFASFLNTIKDKVKVTGSDKETD >CP033401|1868468:1878562|1871205_1872063_-|AYQ01683.1|DBSCAN-SWA MSTILKWAGNKTAIMPELKKYLPAGPRLVEPFAGSCAVMMETDYPSYLVADINPDLINLYKKVAADCESFISRARVLFEIANREVAYYNIRQEFNCSTEITDFMKAVYFLYLNRHGYRGLCRYNKSGHFNIPYGNYKNPYFPEKELRTFAEKAQRATFICASFDETLAMLKAGDVVYCDPPYDGTFSGYHTDGFTEDDQYHLASVLEHRSSEGHPVIVSNSDTSLIRSLYRNFTHHYIKVKRSIGVAAGEGKSATEIIAVSGPRCWMGFDYSRGVDSSAVYGVRA >CP033401|1868468:1878562|1872059_1872287_-|AYQ01684.1|DBSCAN-SWA MADAMDLVQQRVEEERQRHIRAARAKSPGVSRVLCIECEAPIPPARRRAIPGVQLCITCQEIAELKGKHYNGGAV |
13 | Salmonella_phage(90.0%) | transposase,integrase | attL 1868138:1868151|attR 1878604:1878617 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1958145 : 1985349
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >CP033401|1958145:1985349|DBSCAN-SWA TATGACAACGGACGATCTTGCCTTTGACCAACGCCATATCTGGCACCCATACACATCCATGACCTCCCCTCTGCCGGTTTATCCGGTGGTGAGCGCCGAAGGTTGCGAGCTGATTTTGTCTGACGGCAAACGCCTGGTTGACGGTATGTCGTCCTGGTGGGCGGCGATCCACGGCTACAATCACCCGCAGCTTAATGCGGCGATGAAGTCGCAAATTGATGCCATGTCGCATGTGATGTTTGGCGGTATCACCCATGCGCCAGCCATTGAGCTGTGCCGCAAACTGGTGGCGATGACGCCGCAACCGCTGGAGTGCGTTTTTCTCGCGGACTCCGGTTCCGTAGCGGTGGAAGTGGCGATGAAAATGGCGTTGCAGTACTGGCAAGCCAAAGGCGAAGCGCGCCAGCGTTTTCTGACCTTCCGCAATGGTTATCATGGCGATACCTTTGGCGCGATGTCGGTGTGCGATCCGGATAACTCAATGCACAGTCTGTGGAAAGGCTACCTGCCAGAAAACCTGTTTGCTCCCGCCCCGCAAAGCCGCATGGATGGCGAATGGGATGAGCGCGATATGGTGGGCTTTGCCCGCCTGATGGCGGCGCATCGTCATGAAATCGCGGCGGTGATCATTGAGCCGATTGTCCAGGGCGCAGGCGGGATGCGCATGTACCATCCGGAATGGTTAAAACGAATCCGCAAAATGTGCGATCGCGAAGGTATCTTGCTGATTGCCGACGAGATCGCCACCGGATTTGGTCGTACCGGCAAACTGTTTGCCTGTGAATATGCAGAAATCGCGCCGGACATTTTGTGCCTCGGTAAAGCCTTAACCGGCGGCACAATGACCCTTTCCGCCACACTTACCACGCGCGAGGTTGCAGAAACCATCAGTAACGGCGAAGCCGGCTGCTTTATGCATGGGCCAACTTTTATGGGCAATCCGCTGGCCTGCGCGGCAGCAAACGCCAGCCTGGCGATTATCGAATCCGGCGAATGGCAGCAGCAGGTGGCGGCTATTGAAGTGCAGCTGCGCGAGCAACTGGCACCAGCCCGTGATGCCGAAATGGTTGCCGATGTGCGCGTACTGGGGGCAATCGGTGTGGTCGAAACCACTCGTCCGGTGAATATGGCGGCGCTGCAAAAATTCTTTGTCGAACAGGGTGTCTGGATCCGGCCTTTTGGCAAACTGATTTACCTGATGCCGCCCTATATTATTCTCCCGCAACAGTTGCAGCGTCTGACCGCAGCGGTTAACCGCGCGGTACAGGATGAAACATTTTTTTGCCAATAACGGGAAGTCCGCGTGAGGGTTTCTGGCTACACTTTCTGCAAACAAGAAAGGAGGGTTCATGAAACTCATCAGTAACGATCTGCGCGATGGCGATAAATTGCCGCATCGTCATGTCTTTAACGGCATGGGTTACGATGGCGATAATATTTCACCGCATCTGGCGTGGGATGATGTTCCTGCGGGAACGAAAAGTTTTGTTGTCACCTGCTACGACCCGGATGCGCCAACCGGCTCCGGCTGGTGGCACTGGGTAGTTGTTAACTTACCCGCTGATACCCGCGTATTACCGCAAGGGTTTGGCTCTGGTCTGGTAGCAATGCCAGACGGCGTTTTGCAGACGCGTACCGACTTTGGTAAAACCGGGTACGATGGCGCAGCACCGCCGAAAGGCGAAACTCATCGCTACATTTTTACCGTTCACGCGCTGGATATAGAACGTATTGATGTCGATGAAGGTGCCAGCGGCGCGATGGTCGGGTTTAACGTTCATTTCCACTCTCTGGCAAGCGCCTCGATTACTGCGATGTTTAGTTAATCACTCTGCCAGATGGCGCAATGCCATCTGGTATCACTTAAAGGTATTAAAAACAACTTTTTGTCTTTTTACCTTCCCGTTTCGCTCAAGTTAGTATAAAAAAGCTGAATGCGAAACATTAAAAAACATTAATATCAATGTGTTACAATATCATTGGTCTAAAAAATAGACTACATGATGCTACAAAACACAACATATCCAGTCACTATGAATCAACTACTTAGATAGTATTAGTGACCTGAGACAGAGCATTAGCGCAAGGTGATTTTTGTCTTCTTGCGCTAATTTTTTGTTATCAAACATGTCGCACTCCAGAGAAGCACAAAGCCTTGCAATCCAGTGCAAAGATTTGTGTGCCTCAGTTTTGTCTAAGTGTTCTACTGAAAACATAGTAAAATCGGCAACAGCTGGAAATCATTCAATACTCGCACTATCGAAAGTTCACCAGCCAACCGCAGCACGTTCTTGCATACGACGTGCTGCGGTTTTCTTTATGATTTATGCACAATGGACAATTTGAAATTATTGATGATTGTATGGTGCATCGTTTTCTGAACCTACACTGATTTTTTGGTATAGCCTTGCCTAGTCAGTCTTACCGGATCAAACTCTTCGCTATTGCAATACTAACCAAAATCATCAATTTGACAGCGATTAACCAGAATAATAGTATACTATCACCAGTAAGAAATTATCGTTATTTGTAGCGATACATATTATATATATATCATTCTCAGGTGCGTACATGATTATCAACCAGGTACCTATAAAAATAAAAATCTTTATCTTTTTATTTTCATGCATCTCTATTATATTTTTGTTACTGCATGCAAATAATGGAATATACATAACACAAACAACACAAATAAGTTATAGTGTTTTCATTATTGGGCTTTTTTTCATAAACCTGATGATTTTTATTTTTCTATTGCTTTACTATGTTTCTAATCAGAGACAAAGTTATCTCTTAATTCTTTCATTCGCGTTTTTGAGCAACACGTATTATTTATTAGAAGTGGCTATTATTTCTTTATCTCCGTTAGGTAACGATTTATCTACAATCTATCAGAAATCAAATGATATCGCAATATATTATCTATTCCGTCAGTTCAGCTTTATATCTATAATCTTTCTGGCTGTTTATTCCACCAATGTTAAAAATAAAAGTGTTTTAGAAGATAAAAGAAACATAATAATTGTTGTTTTGTCAATATTAATTCTTTTTATTACTCCGTTTGTAGCAAAAAATCTAAGCAGTGACAATATAAAATATAGTCTTAATATTATACAATACTCGCTGAATCGTCATTTGCCGACGTGGAATATCGTGTACACCAAAATAATATCAGTATTTTGGCTTGTATTACTTATCAGCTCATGCATCAGCATACGTAATTACTCAAAAATATGGTTGTGTATAATACTTATTAGTATAGTGTCAGTATGCAATAATCTAATTTTATTGTATTTTATTGATAAATCCCATCCTGCATGGTATATGACAAAATTTCTTGAATTGATATCAATGATTTATATCATTTCAACACTCATGTATTATGTTTTCAGGAAATTAAATCATGCTAATCATATGGCAATTCATGATCCACTAACGAATACATACAATAGAAGATACTTTATTGACTCATTGAAGAATATATCAAAACACCATGATTTCTCAGTAATAATGTTAGATATTGACAGTTTCAAAAGCATCAATGACAAATGGGGGCATCATATGGGTGATCAAGTCATAGTAATGGTTACCAGAATAATAAAAAAATCCATCAGGAAAGAGGATGTATTAGGGCGCTTAGGCGGTGAGGAGTTCGGTATTATCATTAAAGGTAATACTCAAAAGCTCTTGCTATCAATTGCAGAGCGAATCAGAAAAAACATTGAAGAGCAATGCTCGGAAAAATTATTATCGCATGGACCTGAGAAAATAACTGTCAGTATTGGTTGCTTTACTTCAAAAGAGAATAATCTCAGCCCATCTGAAATGTTAGTCAATGCCGATAAAGCGTTATATCAAGCCAAAAGAACCGGAAAAAACAAGGTGATAATTCACTCAAAATAAACACCTTTTTAAAATACAGCCCCAATAAACTGCAGAATATTATCCCATATAATATCCTGCAGTTCGTAATGCACTATTCGATAATGGGTACTGTTGGCCATTCAATATCCGGTGCAGTTGTTGTATTAACACGGTTCAGCAACACCCGATACTTCTTCCAGGCTTCCAGCAACGAGTTTTCTTCCTCCGTTGCGATCTCCAGATCTACAGCATCCTGAAGTGGCGCAATATGCTCACTGAATTCCTGGATGTAGAACTATGTGGTGACGGTCTTCCAGCCATTCGGCTCCTGCTGTATCGAAGCATACCAGGCTATTTCAATATCGCTATGCTGCGGCAGCATTTAACCCCTTGTAATTCATCACCATAATTGATTTAATTCACAAACAAAACTATAACATGGTGAAATTAATGAAAAAAAACACAGATGATGGGGCTAAAATTTACACACCACTTACCCTAAAGCTTTATGACTGGTGGGTTTTGGGAGTATCAAATCGGCTTGCATGGGGATGTCCTACAAAGGAACACCTTCTTCCACACTTTCTGGAACATTTAGGTAACAACCATCTGGATATTGGTGTTGGAACTGGGTTTTACCTTACTCACGTACCTGAGAGTAGTCTGATATCTTTAATGGATTTGAACGAAGCTAGCCTGAACGCGGCATCTACAAGGGCTGGGGAATCAAAAATTAAACATAAAATTAGCCATGATGTTTTTGAACCTTATCCCGCGGCGTTACATGGTCAATTTGATTCCATTTCCATGTTTTACCTTCTTCACTGCCTGCCTGGAAATATATCTACAAAAAGCTGTGTAATACGCAATGCGGCGCAGGCCTTAACTGACGATGGAACTCTATACGGAGCCACAATTCTTGGCGATGGAGTTGTGCACAATAGCTTCGGTCAAAAACTGATGCGCATTTACAATCAGAAAGGCATCTTTTCAAACACAAAAGATTCCGAAGAAGGCTTAACACATATACTCTCAGAGCATTTCGAGAATGTTAAAACCAAGGTTCAAGGTACTGTAGTAATGTTTTCCGCTTCAGGGAAAAAATAGCATCCAACCGCAGCTCGCTCTTGCTTAAGACGTGCTGCGGCATAATCCCAATGATTACTCCCTGACAGGGTTCGTAGGCCACTCAATATCAGGTGCAGTTGATGTATCAACACGGTTCAGCAACACCCGATACTTCTTCCAGACTTCCAGCAACAAGGTTTCTTCCTCCGTTGCGATTTCCAGCTCAACAACAGTCTGAACGTACCAGGAACAGCCTCCTTCAGGGCTTGAAGGATATCAATGTTCGCTTCCTGTTAACTGCCCGACAAGTGCAACCAGTTCGCTTACCTGATTTTCCAGAGTGGTGATCCGGGTGCCTGCTGTTGCCAGATTTTCACGTAACGTTGTATTTTCCTCTTCCAGCGCGGTAACGCGATCATCTGTTTCACGGGCGACCTGAACAAGTAACCCCGTCACGGCGGAGTAGTCAACATTAAGATAGCGAGTTTCTTCGCGTAGCTCGTTGCCGTCAACGGTCGGACCTTGCAACTCTTCACCATAATGAGTAAACGATCCCACAGCTTCAGGTATCGCCTCCATTACTTCCTGTGCAATAACGCCAGCATAAGGCATTCCGTTTTCCTTGAGCGTGTAGGTGTACCCGTTCATTTTACGGATTGCTTTCGTCGCGTCGCTGATAACGAGAATATCGTCTTTAAGGTCGCGGTCTGATGACTGATTCAGCGTTGTGCAATTAATAGCGCCATTTACATCAAACAACTGGCCTGCTGACGTTTTTTGCGCATAAAACAGATACGCAGCAGACGTTCCAACCTCAAAAACGTTTTGTCGAGTACTGGAACCCCACACCCTGACAGAAAACGGTAGTTCTGCATTACCTGAGTTCTGTAAAACAAAACGATTGCCAGTCCCTGATTGTTTTGTAAGGGTTAAATCAACTGTTGAGTTAACCTCATCCTTGTTGATAGTGAGCGCCTGCGCTTTATCAGTTCATCCAGCGCGGCTGCTTTGTTCATGGCTTTGATGATATCCCGTTTCAGGAAATCAACATGTCGGTTTTCCAGTTCCGGAAAACGCCGCTGCACCGACAGGGGGATCCCGTCGAGAATACTGGCAATTTCACCTGCGATCCGCGACAGCACGAAAGTACAGAATGCGGTTTCCACCACTTCAGCGGAGTCTCTGGCATTTTTCAGCTCCTGTGCGTCGGCCTGCGCACGCGTAAGTCGATGGCGTTCGTACTCAATAGTCCCTGGCTGGAGATCTGTCTCGCTGGCCTGCCGCAGTTCTTCAACTTCCCGGCGCAGCTTTTCGTTCTCAATTTCAGCATCCCTTTCGGCATACCATTTTATGACGGCGGCAGAATCATAAAGCACCTCATTACCCTTCCCACCACCCCGCAGAACGGGCATTCCCTGCTCCTGCCAGTTCTGAATGGTACGGATACTCGCGCCGAAAATGTCAGCCAGCTGCTTTTTGTTGACTTCCATTGTTCATTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAAAAGCCCGTTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGGGGGTGTTTTAAATAAAAACATTAAGTTACGACGAAGAAGAACGGAAATGCCTTAAACCGGAAAATTTTCATAAATAGCGAAAACCCGAGAGGTCGCCGCCCCGTAACCTGTCGGATCGCCGGAAAGGACCCGCAAAATGATAATAATTATCATCTGCATGTCACAACGTGCATCTACGCCATCAAACCACGTCAAATAATCAATTATGACGCAGGTATCATATTAATTGATCTGCATCAACTTAACGTAAAAACAACTTCAGACAATACAAATCAGCGACACTAAATACGGGACAACCTCATGTCAACGAAGAACAGAACCCGCAGAACAACAATCCGCAACATCCGCTTTCCTAACCAAATGATTGAACAAATTAACATCGCTCTTGATCAAAAAGGGTCCGGGAATTTCTCAGCCTGGGTCATTGAAGCCTGCCGCCGGAGACTGTGCTCAGAAAAAAGAGTTTCTCCTGAAGCAAACAAAGAAAAGAGTGACATTACTGAATTGCTCAGAAAACAGATCAGACCAGATTGAAGCAATTTAGATAATCGTGCAGACTACGCCCCTCATATCACATGGAAGGTACTACAATGGCTCAGGTTGCCATTTTTAAACAAATATTCGATAAAGTGCGAAATAATTTAAACTATCACTGGTTTTATTCTGAACTAAAACGTCACAATGTCTCACATTACATTTACTATTTAGCTACAGAGAATATTCATCTTGTTCTTGAAAACGATAATACGGTTTTAATAAAAGGACAGGGTAAGGTTGTAAATGTAAGATTTTCAAAAAATAAATGCCTTATAGAAGCCACCTTAAAAGGATTCAAATCAGGAGAGTTATCATTTTACGAATACAGGAAAAATCTTGCTACAGCAGGGGTTTTCAGATGGATTACAAATATCCACGAAAACAAAAGGTATTACTATACCTTTGATAATTCATTACTCTTTACTGAGAACATTCAGAACACTACACAAATATTTCCGCACTAAATCATAACGTCCGGTTTCTTCCGTGCCAGAACCGGACTCGCTGGCATGATGAAATATGTGTACCCGGTAACCCCGGTGTGCATCGTTTTTGATTATTCCCGCACACTCGCGCAGAAGGAGTTCCCCGTCGGGCTACGGTCTCTGTTAATACGGGAATACGGCGACGATACAGCGCATGATGTGTCAGGCTTGAATACCTTTATCCTTTAAAAGGGATATCAGTTAAGTTATCCCGTGTAGGGTATAAACCATTATCAAAGCCACTCTGTAGGAAGTGGCTTTTGTAATGGCAATAAAAAGCCCCGCGAATGCGAGGCTAAATCCTGGTATTTGTAATGACTGGTTCTTATCTCAACGCAGCCCCTTACCGCGCGCAAAATGCTCAATATCAAGCATCAGCAATGAGATGTTTAATCTGGATTCACTCCAGAAGTGAGCACCACCCTGTCTACAGAGCCAGATGTGAAGGATGATGAGTAAAATTATCGCTATCATCGAAGGCATTGCGTCCTGATGTATTCCTGAAGCGTTCTCAGTGCTGTTTGGTCGCGGATAATTCCGTCCCGGATACCGAGAACGTTTCGTCCAGCAACTGGAGAGAGTTCGACGGTGGCATCATTGCCCATGCCGGAGGCGCTGGAGGTTTCGGCTGAGGATGGCACAGGGCATTTTCCTTTGACGAGCACCCGACCACCATTATCAAGCTTGCGCCGAAGAGCATCATTTTCAGCTTTCGCATCAGCTAACTCCTTCGTGTATTTATCATCGAGTGCATCAGCATCACGCTGGCGCTGCTGCATGTCAGTAATGGTGGCGTTCGCCTTCTCCAGTTCACTGGCCTTGTTATCGCGCTGTTCTTTGTAGGCGATTGCATTATCACGGTAATGATTAACAGCCCATGACAGGCAGACGATGATGCAGATAACCAGAGCGGAGATAATCGCGGTTACCCTGCTCATTGTTGCCCCCACAAACAGACCTCACGCTCAATCTCACGACGAGTCATCAGGCCTTTCCATTGCTTACCGCCAGCGTATGCCCAGCGACGTAGCTGGTCACATGCGCCCTTGATATCGCCCTGGTTTATTTTGCGAAGAAGAGCGGATGTTCTGAAATTGCCTGCGCCCACGTTATAGACGAACGAGTAAAGAGCGCCGCGCGTTGTTTCCGGTATATCGACTTTGATGTACGGGTTAATTTGTCTGGCTACCGTGGCAAGGTCTTTATTCAGGAGGGCTTTGCATTCTGCTTCGGTATACGTTTTACCGGGAATGATGTCTTTTCCTGTATGCCCGTGACATACAGTCCATACGCCAACGATATCTTTGTATGGTATGTAGCTGACACCTTCCAGACCATCGTTACCACTTGGGCCAGTGATTAACACTGATGCTATAGCAATTGCTCCGCCACCAATAGCAGCAGCAACGGCTTTTCGTAATGATGGAGGCATTATTCACCTCTCGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAAGCTACCCAGCACACCTATTGCTGCCCACTGTGAGGGCGTGACTTTATCGAGCAGCTGTAAAAACCAGTAGCCAGCACTGCCTGCGGAGGTGCCGTAGGCAATGCCCGTTGTTAACTTATCCATGGATTTCATAGCCTCACCTCCGCAAATAACGGATGGTGTACACGGTTCGGAACGAAGAGGAAAGGTATAGAAGTTACATTAGCGTAAGGCTTGAACATCTATTCAAAAAGAAAAACGCCGCGATTATTCTGGCGTAGCTGAAAGCATCATACAATTATCAAATACGAAAATTACAAAATCATTAAAACGCATCACGTTACATCATGTCTTTTTCTAAAAAAAATCTTGATGAATATTGATGGGGAGGAACACCAAAATACCTTCTGAAAACACTTACAAAATATGACGTGTTTTCATAACCGCATATCTCAGCAACTTTTCCAACAGAATATAAATTGTAGCTTAATAACCTTTCCGCCATCACCATTCGCTCTTCAAGAATTAACTTACTAAATGATAAGCCTTCGTGCTTTAATTTTCTTTTTAACAGACTTTCACTCAGATACAGTCTTGAAGATATATCACAAAGTCTCCATGCTGCAGATATATCCGTGTGAATAATAGCCTTAACTTTACTTCCTAAACTATTAAGACATCCAAATAAAAAACTTTGCACTATTTTCTCTGAAGATAAGATAGCAAGACATGCAAGTGATATTTGATTTCTAACAAAATCCACAGTTCTGCCATCACAATTCAAGCATGCAATCAAGTTCTTTAACAATGAAAAATCTTCACATTCCACCATCAAGTATGCCGGATAAAACCTTCTTACAGAAAAAGGTGAGAGTGTGTTGCTTTTAAAGAAATCATTAACTGTTTTTTCTTCAACATCTACGATCATTACATGATCTATATTTGATGAAAAAAATCTTTTAAATTGTAATCAATGAGAACAGCACTTCCTTTTTTAAACAAAATATCTTCTTTACCAATTCGGACATCAAACGAATTCAACACCAAAATGATAGAACATATGTATGGCATATTATCCACCTGATATCATTGGGGTTACACCAGGTAAGTGTAGGTGGAAAATCAATATTCGCCAGTTCAACAATAAGGAAAATTTCATTACATCACAAGTATAAAATTATGTATTTAACTCACAAAGACAAATTATTAAACCAATCTGTTATATTATATATAGCTGCGTGGAATCATAATATCATATATTTTGACTGGCATGTTTACCAACTTTAAGTTGCATCTCAATTGTTTCTTCAGCGTAAACAGAGTTTTTATACAAACTGACACTCTGGGTATCATAGTGTAGTTTTTACGATTGTAAATATCCTGCATGCAGGAACTCATCCTTTTGGATGATATCGCATACAATTAATTTACCATCAGTCTTAGAGCCAGTTCGTCCGGATAGGGATCGAAGTAATTTTGTGTAAGCAAGTAATCATTAGGATACTCACCCAGATAATGCTTCAGCAGAGTCAACGGCGCAAGAAGAGGTAATGTGCCAGAACGATAGTTAAGTATAACCTCGCTCAACTCTTTACGCTGGCGTGTACTTAAGTAGTTACTAAAATACCCCTGTATATGCATCAGCACATTCGTGTGATTTTTACGTGATGCAGGTTTTCTGAGAATCGCCATCAGCTTATCACGATACACCTCAAAGTATGATTCAAGGTCCGCCCACTCGTGTATTGCAGCCACAAATGGTCCCATATCTTTATAGCCTGCCTGACTATGCGCCAACAACTGAAGCTTATAACGACTATGAAAAGCTAATAACTCTCTTCTTGATAATTTCTCCTTGTAAAGGTGATTGAGCTCATGCAAAGCAAAAACTCTTTCAACAAAATTCTCACGAAGCACTGGATCATGTAATCGCCCATCCTCTTCAACCGGTAGCCAGGAAAACTTTTCCATCAAAGTGCTCGTAAATAGTCCCACTCCATCTTTACGACCTCGATTACCATTTTCATCATAGACACGCACGCGCTCCATGCCACAGCTGGGAGATTTAGCACAAACCACAAACCCCGATACATCCTTTAATTTGTCCATATAAGAACGACTAAACTCTGTCATTCTCTCTGTCACATCCTCATTCTGGTCGTGGCTGAAACACATCCGTATATTTCCTTGCATCGAGCGCACAAGACGTAGAGCAGGACGCGGAACTGGCAGCCCTATAGCCATTTCCGGACATACTGGTCTGAATGTTACCCATTCCACTAATTTGTCCATTAAAAAGTCAGCTCTTTTGTGACCACCATCAAAACGAACAGCAGAACCGGCCAAACAACCGCTGATTCCAATCACAGGTTTTTTTATCATATTCTCCCCCTTGACTAATTCATTAACACATAAACTGTGTAGTGCACGGAATAAATTGCCTTTCTGGCGTCATCACTGACAATTTTTCTGTTATGGACTATTCCTAATATAGTATGAAAGTTCTTTAAGTGATCGGTCGTAATCATCTATCTTTCATACTTACTCTCAACTATCAAAAGTACAGGATTTATTATGAAGTTATGGCCTGTGTTGACTGGCATTGCACTCTCTTTCACTCTTATAGCATGTAAGGCCCCGACACCACCTAAAGGTGTGCAGCCGATTACAAATTTTGACGCCAACCGCTACCTCGGAAAATGGTATGAAATAGCTCGCCTCGAGAACCGGTTCGAACGTGGTCTGGAACAGGTCAGCGCTACTTATGGAAAACGGAACGACGGAGGGATTCGTGTACTTAACCGTGGATACGATCCAACGAAAAATAAATGGAGCGAGAGCGAAGGTAAAGCATACTTTACTGGAGATACTAAAACTGCAGCATTGAAGGTTTCGTTTTTTGGCCCCTTCTATGGTGGCTATAATGTAATCAAACTGGATGATGAGTATAAGTATGCTCTTGTCAGTGGTCCGAACAGAGAATACCTATGGATTCTGGCAAGGACCCCAACTATTCCAGATAAAGTAAAAGCAGACTATGTGCGAACCGCTCAAAAGTTGGGATTCAATGTCAATGAATTATTATGGGTTAAACAATAAAATCCCTACCCGAAATAATACTTATTAGAAAAAAACCAGCCTTTGGGGAGGCTGGCTAAATCAGGAAACAAGCTGTTATATGATAATAACTACGTTGTGATTCCAACATTTAAAATGTTAGACTAATGACAATCAGACAGCAACTTTTCCTTTAATTATTTCGAACAATCAGCATCCATCTCCAATCGGAGATCCAACACCATCAGCATACCCTCCACTACGCCCTCAGCTTTCTGAAGCATCCTGCCAACCCAACAATCAGATCGCCCATGCTTACGTGCAAGCGCCATAAAAGTCATGCCGCCGACATAATAGTCCACCAATAAATCATGCAAATCGCTGTTGTTCTTTTTCAGACGGGCCATGCACCCGCAAATGATCATCGCATCATCGTCACAACATTGCGGGCGAGATTTTACTTTTGAAGGAATTAATCCCTTAAAACCGGCGGCAATGGACGACCAAGTCACATCTTCATGATTATTAGCCGCCCACGCTCCCCAACGCTCAAGAACCATCTGAATATCACGCATCAACTTACTCCACAAAAATCAGACCAGAACGCCAATCACAAGCAAAAATCAACAAAACAGTATTAGTTGATTGTTATCTCTGACTTCATACTCCTGCTCCTGTCAGGGTTTTGGCGTAATTCTTCAGTATTCGGTAATCGGTCAAAACAGAACCGGGGAAACGATATAAGCGCAGATGCCCCCAGCGGTGGCGAAGAAGTTCTGCCATATAAAACTCAAACATCATTCATTCCCCATTTCGGTGATGGTCAGTTCCAGCCTCCCACCTTTGGTAACAGGCATCTTCACAACGCGGTAATCAACGACCTGAGCATCATCCAGCCAGAAACCTGCTTTAGTGAGTGCGTCAAAAGCGGCTTTTTGCAGATTATCCAGGTCACGGCGACGGCGATCCGGCATGTGGCACTCAATGCGGATTTTCACAGGCATAGCCAGGCCGATATCCAGCATTGCGTTTTTAATGATTCGGGCGACGTTATCGCGGTATGCCTGCCCCTCTGCGCTGACGTGCGTGCGCCCGCGATTATGGCGGTAATAGCGATTATTGCTCGGAGGCCAGGGTAATGTGATGTTGTAGGTATTCACGCCTTGATTACCCCCTCTTTCATCCAGATAACCTGCGTTCTCGCCATACCTTCCAGCGCGCATTCTTTTGCATATGCGGCATCGACAAAATGTGTGCGGCGGTCGATTTCGTCGTGGCAGGCAGAACATGCAATGGTGGCAATCAGGTCTGGCGGTTTGGTACCGGTGCCGCACAATCCAGTCAGCCGGATATGTGCCAGTACAGACGTTTCAGGGTTGCCATTACATACGCCAGGGGTTCTTACCTGGCATTCCCGACCACGCGCTGCTTTTCTCAAATCAGCCATGATTCCTCCTTGCTGCCAGTCGCAACCATTTTTTATCAACCAGGCTGGCGGTATATCCGAGCAGTGTTGGTATTTCGGATGGCTTCAGCTCAGGTTTACGCTTACGACGATTTGGTACTCTGTAGATGTGTCCGTTCATGACACGAATAAGCGGTGTAGCCATTACGCCTCCTGCTTGTCGCGCAGCAGCTGGAACTCGCAGCTCTGCGGAATAGTCAGGTGGCAGCCAATATTCATCGCCCAGGCTTCAACCTTACACAGGAAGACATACATCTCTCCGGCATCAAGATCGGAGGTATGGCGTAACGACTGGATAGTGGTGATTTCACCGGTTACGACATCAACCAGTTCTTTGGTTTCATAACCGAGGTATGTGTGTTTGAGAGCTTCTTTTACCCATGCTGCGGTAGCGAACGATTTCCCCCTGCTGATAAGGTATTCACTGATTTCGCTGTACCACATGTGGCTGAGTGCATTCTGGGAAAGACTGCGTTTCTCACGCCACGGTTTAAGCACCATGCGAAAGCATTTTCCGTCCTCCAGATAAGGCTGGATCTGCTGGCCGATAGCGGTGAAGTTGCCGCGATGTAATTTGATGCCATCTTGTGGTAGGTTCACGCTTCACCTCCGCAGAGGTCAAACGCAGGATGCAAAAAATCGCAGGTGCATTTCTGCATCTGTGAAGGGAGAAGAGAGTTTGGATTGTATGTGCGCATAAACGTCCCCGTTTAGCGCAGAAGTCACCGGAGTTGTTCAAGCTCCGATGACTTTATTATTACGAATTGATTTTACAAAATCAAAAGGTATGTTAGTGACGCGGGTCTGTTATTATGCGAGAAGGGTTTCCGTATAAAACAAGGACCTTACTTCCTTGAGTAAATAACGGATCTTTGCCTTGAACAATGGTCATTAAATTCCCATTCTCAGTTTCGACAACATATTCCATGCCTGTTTGTTTTGTTGCTGAAGATTCGATTGCTGCCCCGGCAATACCACCAATGACTGCACCACCAACGGCACCAACGATATTAGAACGAACTCCCCCACCAAGCGCAGAACCAGCGGTTGCCCCCACGGCAGCCCCAGCAGTCCCGCCTAACGCGGAAGTCCCACTGATATCAACCCCCCTGGCACTAATAACTGTACCAGCGATAGTTCGATTAACCATGCCCACAGAGCCAACAGAATAACTATTTGGCGATATATTTTGTGCGCATCCAACCAACACTAAGAGTGGAGCAATTACGAATAATCGCTTCATTTAGCTACCCTAACAGGAAACATTGGACGAGAAAGATCAACACTTTCTAATGCTTGCAAGAACTGCGTTATGTTGTTTTGCACCGCGCGATTAACAGATTCGCGTGCTCGAACAATACCGTAGAATGCGTAACTGGCTGGAACAGTACCGGTAGACTCAATATCCTGCGTATATATAATATCACCATTCGCACGGTTGATTATTTCATACCTTGCAATTGCTTTAGTTGTCATTGAAACACCAAAAGCAGGAACGTCAAGAGCCAACACTTTAACATTTAAGCTAACCGTATTTGGTGAACTATCACGAAAAATAGTCATTCGGTCGAGTGCTTCCTGCAAAGATTCACGCCAAATTGGAGTTATAGCCTCCATACCAGCAGTGATATCCCCTTTCTGCTCATCTGGACGAGCAAGTGATACCGTTAATGACTTAATTTCAGCATCTATTTTTTTCTGGCTAACTCCCACGTTAGGTGTTGAAAAATTCAATGGTGGCACACTAGCGCAACCTGTTAAAGAACCAATAATCATGGCTAATAATATTATCTTCTTCATAAATTTACCTTATTGTTATAACCAAAGGAATTATAAAGTAAAAAAGTTCACTATCACTAGCCATTAACGACATCAATTTCAGAGAAACATGGTACTCATTTCCACAAATTTGACACAAGTCATTTTCATCTACATATTCCATCATACTTGATGCATATGTTATTGAAGCCTCTATCCTATCCGTTCATAATAGCAATAGTTACCCGGGTGATAGTACCTCTATGATTACTCGTCTTTCTGATTGATTGGATTAAATATGCGCGCCAAAATTTATCAACTTTCGTTATGGATATTTATTTCGTTTCTAGCGATCTATGCCTTTATTATCTATAAAGGTTCTTATATTGGAGTAGCATTGCATCAAATTGCTTGGATCATCATTATTGCCTCTGGCTTGATTGCTAGGCTAACTAAACCAAAGCAAAAACCAATTTCGTCCAATAATTAGACATGTATTAAAAAATGATATTTTTATGTACATAGTCTATTGAAAATTGCCGCGATAAAATGCCAACACCCGCTTCATCGCGGCACTCTGGCGACACTCCTTGAAAATCAGATTCGTGCTCACCTTTCCTTCCCGTTCTTCCCTGGTAGCGAACCGGTAATACACCGTTCGCCAGACCTTACCATCAATAACTAAGATTCCTGTCCGCGCCATTTTAGCCGCAGCCTGGTTTATGCTGGTTACTGTTGCGCCTGTTACCGCAGCAACGTCCTGCGCACAGAAGCTCTTATGCGTCCCCAGGTAATGAATAATTGCTTCTTTTCCCGTCATACACTGGCTCCTTTCAGTCCGAACTTAGCTTTGATTTCTGCAATCTTCGCCAGAGCCTGTGCACGATTTAGAGGTCTACCGCCCATGACAGGAAGTTGTTTTACTGGTTCAGGGATCGCCTCACCACGGTTAATTCTCGCAGTCATATGGACAAGCTCATCTGCGGCCTTACGGCGTAATTCCGCATCAGTAAGCGCATTGGCCCGCATGTTCTGATACAGGTTGGTAACCAGCCAGTAGTGCGCGTTTGATTTCCACGGATAAGACTCCGCATCCGGATACAGGCCTCGCTTCCGGCAATACTCGTAAACCATATCAACCAGCTCGCTGACGTTTGGCAGTCCGGCGATAACGGATGCTTCTTCCCGGCACCATGCAACAAACTGCCCGGGTGATGGCAGGAATGGTCGATTCTGCCGACGGGCTACGCGCATTCCAGCGTTAACCTGTTCCATTGTGGTGATCCCGTTTTCCCGAAAAGCCAGCACCCACTGGCGGCGGATTTCGTTCAGTTCGTTCTGGTCCCGGTTAGCCAGGCTCGCTGGGAAAGTTGCCAGTAACTGGCTGAATACACCGTTGATTATCTGCGCTACCTGCTGTACCTGCGGCTTTTCGTCGTACTGTTCCGGCATGTTATTGGCGATCCGGCACATCTGCTCACGGTCAAAGTTAACCATCTGTGCGGCGATGTTTTTCATAAATCCACCCCGTAAATCCAGTCAGTGTTCGTCAGGTCGAGTTTTGGTTTGCCGGCTGTCACGCCAGCCTGTTGCTTGTTTCGGTTGATTTCGAGCTGGGTCCACTTGTCGCGGAGTTTGGCCGGACTCAGCACGTTACCGGACCAGAAGTTGTCCTGGCAGGCCCAGCGGAACAGTACACACATGTCGCGGTGGTTACGTCCATCACGTTCACGCATCAGACGGATATCGTTAGCCCACCCTGCAAAATTCGGTTTTCTGGCTGATGGCGCGATGGTCTTCACCATGTCAAACATCCACTCTGCGGCGGTCAGGTCTTCTGCTGTCCCCCACTTGCTGCCGTTCTGAATCGCAGCATCCGCTTTCACCACAGGAAGGTCGTTTTCTGGCAGGTCAGAGGATTCGCCAGAATTCTCGGACGAATAAGGTTTTATATTGTCTTTTGTTAGTTTGTCTTTTGTGTTTACCTGATTCGGGTAAGTGCCTTTACCTGATTTGGGTAAACTTTTCTTACCTGATTCAGGTAAATTTACCTCTTTCAGGTAAACTTTATTTTTCTTACCTGATTCGGGTAATGTTGACCATTCACTGACCACATTATTAATGCCTATATTCCGCCCGCTCTGAATAAAAATCCCACGCTTTACCAGAACACTTTTTGCAGCAGAACACTTGTGCGGCAATATCCCGGTCAACTCGGAAAGTTGCTCGTTGCTCACCCAATCCAGTTTTTTATTAAAGCCATATGTTTTGCGCATGACAGCCAGGAAGACCAGAAGCTGGTGCTGTGTTAATCCGGCCAGCATTACAGCTTCCAGCAACTCATTTGCAATGCGCGTATAACCATCATCGAGATCTGCCACGCGCGGCTCCTTTTGTGCCGCATCCGGCACTGGAAAATTGAATATCTCAGCAGTGTTTGCCATAATTCCTCCCGCAATGAGTGTGTTACGATTTGCACCTGAAAGTCGGTTCTGTTCCAGCAGACCGGCTTTCGCCATTTCTGAACCTGTCATATCGCCCCCAGCATGGTAGTAACCATCGCCATCAATGGACCAGCCAGATCTGGGTCCACACGAAACATCGACACAATACCTTCACTAATTTCCTTCAGTTTCTGGTGGCGTGGTGCGTTGAGAATGACAGCCTGTTTTGCCTCACTGAGTTCCTTTTCCATTTCAGCCAACCTAGCCATGAAGCTATCCTGCTCAACCAGGTAACCGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCACGGCTGACATCATCAGGGAAATCGATGGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCAAGCGCGGACGGCATCACGGATTTTTTCGTGGCCTGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTGGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATAGACGGCCTGCTGTGAAACTTCGCAAGCAGCGCCCAGTTTCTTTTGTGAACCAACGATATTGATCGCTGTTTTGATAGCTGGGTTCATAACAACCTCCGTGGTTAATTTGAATCAAGATTAAAACTATGGTTGTTTTTAGTCAACAACCATTTTCGTTTGATGGAATAAAACCTTGGTTGTACATTTGGACTATGAAAACAACACTCTCAGAAAGACTTAAAGAAGCCAGATTAGCGCGAGGCCTTACACAAAAGGCGCTTGGGGATTTGGTCGGGGTTAGCCAAGCTGCTATTCAGAAAATCGAAACAGGGAAAGCTAATCAAACAACTAAAATCGTGGAGATCGCGAACGCTTTGGGTGTGCGCGCAGAATGGTTATCTTCTGGCGTTGGAAATATGTCAGACAGTACAGTGCAACCAATACAATCAACTGTCAGCCATTCCAAATACTTCAAGATTGACGTTCTTGATATAGAAGTCAGTGCTGGGCCGGGAGTCATCAACCGTGAGTTTGTAGAAGTTCTACGCTCGGTTGAGTACTCGTTTGACGATGCTCGTCACATGTTCGATGGTAGGAAGGCGGAAAATATCCGCATCATTAACGTGCGTGGTGACAGCATGTCAGGAACGATCGAACCAGGTGATCTGCTGTTCGTTGATATCACAGTTAAATCTTTCGACGGTGATGGTATCTATGCGTTTCTGTACGACGACACAGCCCATGTAAAGCGCCTGCAAATGATGAAGGATAAGCTGCTGGTCATCTCTGATAACAAAAGCTACTCACCGTGGGACCCGATCGAGAAAGACGAGATGAACCGGGTGTTCATCTTCGGTAAGGTTATTGGGAGCATGCCGCAGACATATAGGAAGCATGGTTAAAGTGAGGCTAAAAAACAGTTACAGCAATAGGCCTGTTGTTTTTCTTTAAACACGCAGTGTTAAACCGCTCTTTGAGATGCGGAGTAATGAGATGGAAGACTTGAATCACATAAGGGTTAGTGATGGAGTGCGTAGCGAGCAGCAATAGTGCAATACCTAATGTCGTTGAAGTAATACGTCGCATCAATGAAGGTTCCACTCAGCCATTTCTTTGCAAATGTGATGATGGGCAGTTGTATGTTTTGAAGTCAAAACCATCAATGCCCCCGAAAAATCTCTTAGCTGAGTTCATTTCGGCGTGTTTGGCTAATGATATCGGCCTTCCTTTACCTGACTTTAAAATCGTATTTGTGCCAGAGGAACTTATAGAGTACTCACCTGATCTGCAGCAACAAATTTGTACAGGATATGCCTTTGCTTCATTGTTCATTGACGGTGCAATAGCGTTAACGTTTACGCAGTCAAGAAACGAAACGATCATCCCAGTCGAACAGCAAAAATTAATCTATGTTTTTGATAAATGGATATTAAATGCAGACAGAACGCTTACTGACAAAGGTGGAAACGTTAACATCCTTTATGACATCAGTAACGATAAGTATTATCTGATTGACCATAATCTCTCATTTGATCAGAATGCTGGACCTGAAGATTTTTCTGTGCACGTGTACGGCCCTGGTAACCGCAAATGGCAATATGATTTAGTGGATCGCGTAGAGTACCGCCAGAGGGTCGTTAACAGTTTACACAAGCTTCCTGCTATCCTTGACGAAATTCCAGAAGAGTGGATAGTAGATGAGGAGTTTTTACCTTTTGTCTGCACTACGCTAGACAAAGGTGATTGTGATGAATTTTGGAGCGCAATAGAATGACAACTCCATGCCTATATAGCATCGTTCGCTATGCGCCTTATGCGGAGACTGAAGAATTCGCAAACATAGGCGTACTTCTGTGCGCGCCAAAAGAAAATTACTTTGATTTCCAGCTCACAAAGCGAAATGACTCTCGTGTAAAGAATTTTTTCCATGATGATTGTATTTTCCCTGTAGCAAAAGACTCAATACAAAGAGAACTACAGTTCGCAAAAATGCATGCGACCCAGATTGTTGGACATCAACAACTTGCACAATTCTTCAGATATTTTACAAACAAAAAAGAATCAATTTTTCAGTTCAGTTCTACGAGAGTGATTCTCAGCGAAAACCCAAAAGAAGAGCTGGCCCGCATTTACAATAAATATGTAAACCACTCTGACTACACAAAAGAGCGCCGTGAAGATGTTCTAGCCAGAGAGCTAAAACGAAGTATCGATAGAATAGATGGATTGAAGAACGTCTTCAAACAAGCAACCATTGATGGGTATTTCGCAAAGTTCTCAATGCCATTGGTCGCCAAGAAGCATGACAGGATCCAATGTGCCATCAAACCTCTGGCATTCACTCAAGCTGAACCAGGAAAAATGATGGAGCATAGTGATACTTGGGTGATGAGAATAACTCGAGCAGCAGAAGAAAACCTGCTTTCACTTGATGACATTTTATTCACAATTGAAACTCCTGAATCACCAAACTCAGGCCAAAGCAAAGTTATTGACATCATAAAGAGAACTATGGATGCTAAGAAAATAAATCATATACCTGCATCCAACCACAAAGAAACTATTGATTTTGCAAAAAAAATACTTCCCCAAGTTTAAAATTTATTTTTGTATGTGATATTCCTTATTAATAACCCGGCCACCGTGCCGGGTTTTCTTTTGCCTCCCCTCATCACACAAACCGCTCAAAAAACCACCATAACCTCGCTTCAGTTATCGCTATGCGATTCAAGTCACAAAATAAATCCATCCTAAATACAACCAGTTATATCTAAAACAACCAATAAAACAACTTTTGTTGTTGACGGTAAAACAACTATAGTTTTAAATAGGTTCATCGCAACAACACAACGATACGGCAACCACCTGATTCACCGTTGCGATGACCGCTTAGATCCGCAGTTTGAATTTCAGCAGGCTTCGGGGAGTGCGAGGGGTGAAACGGACGCGTGAACGTCGGTGTGACCAGCTGAAATCAACTCAACATTTCATACCTTAGTCGCTTCAACGAGGCGGCTTAGTTATGACAACCGGCGACCATCCACCGCCTGAATACGCGCAGAAGTCTCTATATGTTCAGCAGCCCAGCTTACGGGCAGGAGTTTTTATGGTTCATCAACATTACGGAACGCAGACCGTTAATCGCGGCGCGGTCATGCCAGGAATGCTGGTCAAACACAAAGATGGTACCTGGACTGCATCAGCTAATTTACGCGGACGGCTTTATCTGCATCGCGGCATCGAGCGCACTTATACCCGTGATTTGCTCGTGGAAGTTTTTCTCGACGGACGCGGTAACGGCCTGAATCACTAATCCCCTTTCCTGTTTTCCTAATCAGCCTGGCATTTCGCGGGCGATATTTTCACAGCCATTTTCAGGAGGTCAGCCATGAACGCTTATTACATTCAGGATCGTCTTGAGGCTCAGAGCTGGGCGCGTCACTACCAGCAGATCGCCCGTGAAGAGAAAGAGGCAGAACTGGCAGACGACATGGAAAAAGGCCTGCCCCAGCACCTGTTTGAATCGCTATGCATCGATCATTTGCAACGCCACGGGGCCAGCAAAAAAGCCATTACCCGTGCGTTTGATGACGATGTTGAGTTTCAGGAGCGCATGGCAGAACACATCCGGTACATGGTTGAAACCATTGCTCACCACCAGGTTGATATTGATTCAGAGGTATAAAACGGATGAGTACAGCACTCGCAACGCTGGCTGGGAAGCTGGCTGAACGTGTCGGCATGGATTCTGTCGACCCACAGGAACTGATCACCACTCTTCGCCAGACGGCATTTAAAGGTGATGCCAGCGATGCGCAGTTCATCGCATTGCTGATCGTCGCCAACCAGTACGGCCTTAATCCGTGGACGAAAGAAATTTACGCCTTCCCTGATAAGCAGAACGGCATTGTTCCGGTGGTGGGCGTTGATGGCTGGTCCCGCATCATCAATGAAAACCAGCAGTTTGATGGCATGGACTTTGAGCAGGACAATGAATCCTGTACATGCCGGATTTACCGCAAGGACCGTAATCATCCGATCTGCGTTACCGAATGGATGGATGAATGCCGCCGCGAACCATTCAAAACCCGCGAAGGCAGAGAAATCACGGGGCCGTGGCAGTCGCATCCCAAACGGATGTTACGGCATAAAGCCATGATTCAGTGTGCCCGTCTGGCCTTCGGATTTGCTGGTATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACTGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATCGCCCTGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTTAAACAGAAAGCCACTGAGCAGAAGGTGGCAGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGGGATGATGCATGGCACAAATTACGGCTCGGCGTCATCACCGCTTCAGAAGTTCACAACGTGATAGCAAAGCCCCGCTCAGGAAAGAAGTGGCCTGACATGAAAATGTCCTACTTCCACACCCTGCTGGCTGAGGTTTGCACCGGTGTGGCTCCGGAAGTTAATGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAACGACGCCAGAACCCTGTTTGAATTCACTTCCAGCGTGAATATTACTGAATCCCCGATCATCTATCGCGACGAAAATATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAACTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAATTCCGGCTCGGTGGTTTCGAGGCAATAAAATCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACGCGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGCATGAAGCGTGAAGGCCTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGTGCGATCACTTTCGTCTACTCCGTTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCCCTGAAAGCACAGCGGCTGGCTGAGGAGATAAATAATAAACGGGGAGCTGTATGCACAAAGCATCTCCCGTTGAGTTAAGAACGAGTATCGAGATGGCACATAGCCTCGCTCAAATTGGAGTCAGGTTTGTGCCAATACCAGTAGAAACAGACGAAGAATTTCATACGTTAGCCGCATTCCTTTCACAAAAGCTGGAAATGATGGTGGCGAAAGCAGAAGCAGATGAGAGAGACCAGGTATGACAACCACTGAATGCATTTTTCTGGCAGCGGGCTTCATATTCTGTGTGCTTATGCTTGCCGACATGGGGCTTGTTCAATGACACCTCAGCAAGAAAACGCCCTTCGCAGCATTGCCCGTCAGGCTAATTCTGAAATCAAAAAAGCCAGACAGCAGTTTCCGGATAAAAACGTCGATGACATTTGCCGTAGCGTACTAAAGAAGCACCGCGAAACGGTAACGCTGATGGGATTCACACCGACTCATTTAAGCCTGGCGATCGGCATGTTGAACGGCGTCTTTAAGGAACGGTGAACATGAAAAGCAAAATCATCAGGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATAATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGATAGATGAACGAAGACGACTGGCCGTTCAGGGTTGTCGGACTTGTGCCAGTTGCCAGCAAGATCTGGAGCTTATCAGTAAACAGAGAGGTTCGAAGTGAGCGAAATTAACTAGAAGCCAAAGATAAAATCATCGCTGAGCAGGAGAAAATCGCTAACGGAGAAAAGACAGTAAGTCAGTATATGAAAACCGCATGATATCATCAGATAAAAATCGGTCGTAAAGCGAAATATTAATACCAGAACAAACGAGTCGAGGTAAATTATATTACCTCTATAAATTAACTAAAACTTGCCCGCTATATACTATATCATTCAGTATCATCACGCGCGGTCTGTGCATATGTCACTACCGCACCTAATATATTAATTTTCTTTTCAACATAGATAATATTATCGTACTCATAATTGCCATACGGATAGCAAATGCGAATATTCTCATGTAGATCGGGGTCATCCACCTCAGCTCCAGAACAACTTTTTGAACTACCGGAAGTATACCGATACGGTGCAACATAAGACGATGTCTCTCCAGGCAAAAAATAAGTTAGTGTCGTAAGGGGTATAATCAGAAAAAATCCAGCAAATATGCACATCCCTGCATAAACCTTAAGGTATGCTGACAGACTCTTCCAGCCGCTTTGTTTTACTATCCCCTTCTTAACCCAAAACAGAGATAACAGAAAAGCTATTCCCATGCTAAACAGAATGTAATAGTGGGATATACTCTGATTAAGAAACGTGACCCTGTAGATATCTGCCCGCCACCAGAAGAAAAGGAAAATAAAGATCAGCCCTGAAACTGTCATGCAAATCAAATAAGGATACGAATCTTTTTTCATGTTTAGCGCCCATAAAATTTTTCCTGACCCGGACAAATTTACCATCCATTTTTTGCGCAGAAAATAGCTCATTACTTACTGCACAATAATACACAAAATTGCGTAAATTTTTTGCATGGATTTTAGCTCTTTCAGCCGACATTTAAGGGGTAAATAGCATTTCCTAAAAGCAACTGCACCAACCCAACAGAATGGGCTACCGCTTACGTTGAGAGCAAAAAAGTGTATAGCAGCAATGAACAGCATCCTCGCACTGACGAGGATTTCTTTTATCTGAACTCGCTACGGCGGGTTTTGTTTTATGGAGATGATAAATGCACTTCCGAGTCACAGGTGAATGGAATGGAGAACCATTCAACAGAGTTATCGAAGCCGAGAACATCAGCGACTGCTATGACCACTGGATGCTGTGGGCGCAGATAGCACATGCAGACGTAACCAATATTCGAATTGAAGAACTGAAAGAACACCAAGCCGCCTGATGGCGGTTTTTTCTTGCGTGTAATTGCGGAGACTTTGCGATGTACTTGACACTTCAGGAGTGGAACGCTCGCCAGCGACGCCCAAGAAGCCTTGAAACAGTTCGTCGATGGGTGCGCGAATGCAGGATATTCCCTCCTCCGGTTAAGGATGGAAGAGAATATCTGTTCCACGAATCAGCGGTAAAGGTTGACTTAAATCGACCAGTAACAGGTAGCCTTTTGAAGAGGATCAGAAATGGGAAGAAGGCGAAGTCATGAGCGCCGGGATTTACCCCCTAACCTTTATATAAGAAACAATGGATATTACTGCTACAGGGACCCAAGGACGGGTAAAGAGTTTGGATTAGGCCGAGACAGGCGAATCGCAATCACTGAAGCTATACAGGCCAACATTGAGTTATTTTCAGGACACAAACACAAGCCTCTGACAGCGAGAATCAACAGTGATAATTCCGTTACGTTACATTCATGGCTTGATCGCTACGAAAAAATCCTGGCCAGCAGAGGAATCAAGCAGAAGACACTCATAAATTACATGAGCAAAATTAAAGCAATAAGGAGGGGTCTGCCTGATGCTCCACTTGAAGACATCACCACAAAAGAAATTGCGGCAATGCTCAATGGATACATAGACGAGGGCAAGGCGGCGTCAGCCAAGTTAATCAGATCAACACTGAGCGATGCATTCCGAGAGGCAATAGCTGAAGGCCATATAACAACAAACCCTGTCGCTGCCACTCGCGCAGCAAAATCAGAGGTAAGGAGATCAAGACTTACGGCTGACGAATACCTGAAAATTTATCAAGCAGCAGAATCATCACCATGTTGGCTCAGACTTGCAATGGAACTGGCTGTTGTTACCGGGCAACGAGTTGGTGATTTATGCGAAATGAAGTGGTCTGATATCGTAGATGGATATCTTTATGTCGAGCAAAGCAAAACAGGCGTAAAAATTGCCATCCCAACAGTATTGCATGTTGATGCTCTCGGAATATCAATGAAGGAAACACTTGATAAATGCAAAGAGATTCTTGGCGGAGAAACCATAATTGCATCTACTCGTCGCGAACCGCTTTCATCCGGCACAGTATCAAGGTATTTTATGCGCGCACGAAAAGCATCAGGTCTTTCCTTCGAAGGGGATCCGCCTACCTTTCACGAGTTGCGCAGTTTGTCTGCAAGACTCTATGAGAAGCAGATAAGCGATAAGTTTGCTCAACATCTTCTCGGGCATAAGTCGGACACCATGGCATCACAGTATCGTGATGACAGAGGCAGGGAGTGGGACAAAATTGAAATCAAATAA
Protein sequences of DBSCAN-SWA_5 >CP033401|1958145:1985349|1971666_1971837_-|AYQ01783.1|DBSCAN-SWA MATPLIRVMNGHIYRVPNRRKRKPELKPSEIPTLLGYTASLVDKKWLRLAARRNHG >CP033401|1958145:1985349|1967107_1967323_-|AYQ01774.1|lysis|DBSCAN-SWA MKSMDKLTTGIAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >CP033401|1958145:1985349|1968592_1969552_-|AYQ01777.1|DBSCAN-SWA MIKKPVIGISGCLAGSAVRFDGGHKRADFLMDKLVEWVTFRPVCPEMAIGLPVPRPALRLVRSMQGNIRMCFSHDQNEDVTERMTEFSRSYMDKLKDVSGFVVCAKSPSCGMERVRVYDENGNRGRKDGVGLFTSTLMEKFSWLPVEEDGRLHDPVLRENFVERVFALHELNHLYKEKLSRRELLAFHSRYKLQLLAHSQAGYKDMGPFVAAIHEWADLESYFEVYRDKLMAILRKPASRKNHTNVLMHIQGYFSNYLSTRQRKELSEVILNYRSGTLPLLAPLTLLKHYLGEYPNDYLLTQNYFDPYPDELALRLMVN >CP033401|1958145:1985349|1959493_1959970_+|AYQ01763.1|DBSCAN-SWA MKLISNDLRDGDKLPHRHVFNGMGYDGDNISPHLAWDDVPAGTKSFVVTCYDPDAPTGSGWWHWVVVNLPADTRVLPQGFGSGLVAMPDGVLQTRTDFGKTGYDGAAPPKGETHRYIFTVHALDIERIDVDEGASGAMVGFNVHFHSLASASITAMFS >CP033401|1958145:1985349|1974964_1975894_-|AYQ04291.1|DBSCAN-SWA MANTAEIFNFPVPDAAQKEPRVADLDDGYTRIANELLEAVMLAGLTQHQLLVFLAVMRKTYGFNKKLDWVSNEQLSELTGILPHKCSAAKSVLVKRGIFIQSGRNIGINNVVSEWSTLPESGKKNKVYLKEVNLPESGKKSLPKSGKGTYPNQVNTKDKLTKDNIKPYSSENSGESSDLPENDLPVVKADAAIQNGSKWGTAEDLTAAEWMFDMVKTIAPSARKPNFAGWANDIRLMRERDGRNHRDMCVLFRWACQDNFWSGNVLSPAKLRDKWTQLEINRNKQQAGVTAGKPKLDLTNTDWIYGVDL >CP033401|1958145:1985349|1963059_1963197_-|AYQ01767.1|capsid|DBSCAN-SWA MAYEPCQGVIIGIMPQHVLSKSELRLDAIFSLKRKTLLQYLEPWF >CP033401|1958145:1985349|1972288_1972390_-|AYQ01785.1|DBSCAN-SWA MRTYNPNSLLPSQMQKCTCDFLHPAFDLCGGEA >CP033401|1958145:1985349|1975980_1976520_-|AYQ01791.1|DBSCAN-SWA MQPLTYQQTSGFSPTAVINRSQTKQVPGHEKIRDAVRAWSAEDNQDVVATLIVNEYREQGGGTIDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGYLVEQDSFMARLAEMEKELSEAKQAVILNAPRHQKLKEISEGIVSMFRVDPDLAGPLMAMVTTMLGAI >CP033401|1958145:1985349|1983030_1983633_-|AYQ01803.1|DBSCAN-SWA MSYFLRKKWMVNLSGSGKILWALNMKKDSYPYLICMTVSGLIFIFLFFWWRADIYRVTFLNQSISHYYILFSMGIAFLLSLFWVKKGIVKQSGWKSLSAYLKVYAGMCIFAGFFLIIPLTTLTYFLPGETSSYVAPYRYTSGSSKSCSGAEVDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >CP033401|1958145:1985349|1973748_1973940_+|AYQ01788.1|DBSCAN-SWA MRAKIYQLSLWIFISFLAIYAFIIYKGSYIGVALHQIAWIIIIASGLIARLTKPKQKPISSNN >CP033401|1958145:1985349|1971024_1971387_-|AYQ01781.1|DBSCAN-SWA MNTYNITLPWPPSNNRYYRHNRGRTHVSAEGQAYRDNVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVVKMPVTKGGRLELTITEMGNE >CP033401|1958145:1985349|1970424_1970802_-|AYQ01779.1|DBSCAN-SWA MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLIPSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVGGMTFMALARKHGRSDCWVGRMLQKAEGVVEGMLMVLDLRLEMDADCSK >CP033401|1958145:1985349|1973976_1974270_-|AYQ01789.1|DBSCAN-SWA MTGKEAIIHYLGTHKSFCAQDVAAVTGATVTSINQAAAKMARTGILVIDGKVWRTVYYRFATREEREGKVSTNLIFKECRQSAAMKRVLAFYRGNFQ >CP033401|1958145:1985349|1981861_1982044_+|AYQ01799.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLPLS >CP033401|1958145:1985349|1976924_1977614_+|AYQ04292.1|DBSCAN-SWA MKTTLSERLKEARLARGLTQKALGDLVGVSQAAIQKIETGKANQTTKIVEIANALGVRAEWLSSGVGNMSDSTVQPIQSTVSHSKYFKIDVLDIEVSAGPGVINREFVEVLRSVEYSFDDARHMFDGRKAENIRIINVRGDSMSGTIEPGDLLFVDITVKSFDGDGIYAFLYDDTAHVKRLQMMKDKLLVISDNKSYSPWDPIEKDEMNRVFIFGKVIGSMPQTYRKHG >CP033401|1958145:1985349|1978482_1979310_+|AYQ01794.1|DBSCAN-SWA MTTPCLYSIVRYAPYAETEEFANIGVLLCAPKENYFDFQLTKRNDSRVKNFFHDDCIFPVAKDSIQRELQFAKMHATQIVGHQQLAQFFRYFTNKKESIFQFSSTRVILSENPKEELARIYNKYVNHSDYTKERREDVLARELKRSIDRIDGLKNVFKQATIDGYFAKFSMPLVAKKHDRIQCAIKPLAFTQAEPGKMMEHSDTWVMRITRAAEENLLSLDDILFTIETPESPNSGQSKVIDIIKRTMDAKKINHIPASNHKETIDFAKKILPQV >CP033401|1958145:1985349|1980100_1980397_+|AYQ01796.1|DBSCAN-SWA MNAYYIQDRLEAQSWARHYQQIAREEKEAELADDMEKGLPQHLFESLCIDHLQRHGASKKAITRAFDDDVEFQERMAEHIRYMVETIAHHQVDIDSEV >CP033401|1958145:1985349|1977736_1978486_+|AYQ01793.1|DBSCAN-SWA MECVASSNSAIPNVVEVIRRINEGSTQPFLCKCDDGQLYVLKSKPSMPPKNLLAEFISACLANDIGLPLPDFKIVFVPEELIEYSPDLQQQICTGYAFASLFIDGAIALTFTQSRNETIIPVEQQKLIYVFDKWILNADRTLTDKGGNVNILYDISNDKYYLIDHNLSFDQNAGPEDFSVHVYGPGNRKWQYDLVDRVEYRQRVVNSLHKLPAILDEIPEEWIVDEEFLPFVCTTLDKGDCDEFWSAIE >CP033401|1958145:1985349|1972931_1973492_-|AYQ01787.1|DBSCAN-SWA MKKIILLAMIIGSLTGCASVPPLNFSTPNVGVSQKKIDAEIKSLTVSLARPDEQKGDITAGMEAITPIWRESLQEALDRMTIFRDSSPNTVSLNVKVLALDVPAFGVSMTTKAIARYEIINRANGDIIYTQDIESTGTVPASYAFYGIVRARESVNRAVQNNITQFLQALESVDLSRPMFPVRVAK >CP033401|1958145:1985349|1958145_1959435_+|AYQ01762.1|DBSCAN-SWA MTTDDLAFDQRHIWHPYTSMTSPLPVYPVVSAEGCELILSDGKRLVDGMSSWWAAIHGYNHPQLNAAMKSQIDAMSHVMFGGITHAPAIELCRKLVAMTPQPLECVFLADSGSVAVEVAMKMALQYWQAKGEARQRFLTFRNGYHGDTFGAMSVCDPDNSMHSLWKGYLPENLFAPAPQSRMDGEWDERDMVGFARLMAAHRHEIAAVIIEPIVQGAGGMRMYHPEWLKRIRKMCDREGILLIADEIATGFGRTGKLFACEYAEIAPDILCLGKALTGGTMTLSATLTTREVAETISNGEAGCFMHGPTFMGNPLACAAANASLAIIESGEWQQQVAAIEVQLREQLAPARDAEMVADVRVLGAIGVVETTRPVNMAALQKFFVEQGVWIRPFGKLIYLMPPYIILPQQLQRLTAAVNRAVQDETFFCQ >CP033401|1958145:1985349|1964954_1965188_+|AYQ01769.1|DBSCAN-SWA MSTKNRTRRTTIRNIRFPNQMIEQINIALDQKGSGNFSAWVIEACRRRLCSEKRVSPEANKEKSDITELLRKQIRPD >CP033401|1958145:1985349|1984082_1984301_+|AYQ01806.1|DBSCAN-SWA MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >CP033401|1958145:1985349|1982016_1982208_+|AYQ01800.1|DBSCAN-SWA MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >CP033401|1958145:1985349|1966006_1966159_-|AYQ01771.1|DBSCAN-SWA MPSMIAIILLIILHIWLCRQGGAHFWSESRLNISLLMLDIEHFARGKGLR >CP033401|1958145:1985349|1983757_1983943_-|AYQ01804.1|DBSCAN-SWA MFSASITLLNGSPFHSPVTRKCIYHLHKTKPAVASSDKRNPRQCEDAVHCCYTLFCSQRKR >CP033401|1958145:1985349|1971383_1971674_-|AYQ01782.1|DBSCAN-SWA MADLRKAARGRECQVRTPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDRRTHFVDAAYAKECALEGMARTQVIWMKEGVIKA >CP033401|1958145:1985349|1983875_1984043_+|AYQ01805.1|DBSCAN-SWA MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >CP033401|1958145:1985349|1971836_1972292_-|AYQ01784.1|DBSCAN-SWA MNLPQDGIKLHRGNFTAIGQQIQPYLEDGKCFRMVLKPWREKRSLSQNALSHMWYSEISEYLISRGKSFATAAWVKEALKHTYLGYETKELVDVVTGEITTIQSLRHTSDLDAGEMYVFLCKVEAWAMNIGCHLTIPQSCEFQLLRDKQEA >CP033401|1958145:1985349|1962446_1963115_+|AYQ01766.1|DBSCAN-SWA MVKLMKKNTDDGAKIYTPLTLKLYDWWVLGVSNRLAWGCPTKEHLLPHFLEHLGNNHLDIGVGTGFYLTHVPESSLISLMDLNEASLNAASTRAGESKIKHKISHDVFEPYPAALHGQFDSISMFYLLHCLPGNISTKSCVIRNAAQALTDDGTLYGATILGDGVVHNSFGQKLMRIYNQKGIFSNTKDSEEGLTHILSEHFENVKTKVQGTVVMFSASGKK >CP033401|1958145:1985349|1984278_1985349_+|AYQ01807.1|integrase|DBSCAN-SWA MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >CP033401|1958145:1985349|1968106_1968241_-|AYQ01776.1|DBSCAN-SWA MPYICSIILVLNSFDVRIGKEDILFKKGSAVLIDYNLKDFFHQI >CP033401|1958145:1985349|1962120_1962297_-|AYQ01765.1|tail|DBSCAN-SWA MQEFSEHIAPLQDAVDLEIATEEENSLLEAWKKYRVLLNRVNTTTAPDIEWPTVPIIE >CP033401|1958145:1985349|1966187_1966394_-|AYQ01772.1|DBSCAN-SWA MRKLKMMLFGASLIMVVGCSSKENALCHPQPKPPAPPAWAMMPPSNSLQLLDETFSVSGTELSATKQH >CP033401|1958145:1985349|1982218_1982500_+|AYQ01801.1|DBSCAN-SWA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >CP033401|1958145:1985349|1966610_1967108_-|AYQ01773.1|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKDIVGVWTVCHGHTGKDIIPGKTYTEAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSALLRKINQGDIKGACDQLRRWAYAGGKQWKGLMTRREIEREVCLWGQQ >CP033401|1958145:1985349|1982598_1982820_+|AYQ01802.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQQDLELISKQRGSK >CP033401|1958145:1985349|1980402_1981188_+|AYQ01797.1|DBSCAN-SWA MSTALATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKATEQKVAA >CP033401|1958145:1985349|1964005_1964566_-|AYQ01768.1|terminase|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIENEKLRREVEELRQASETDLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIKRRRSLSTRMRLTQQLI >CP033401|1958145:1985349|1981184_1981865_+|AYQ01798.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSSVNITESPIIYRDENMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >CP033401|1958145:1985349|1974266_1974968_-|AYQ01790.1|DBSCAN-SWA MKNIAAQMVNFDREQMCRIANNMPEQYDEKPQVQQVAQIINGVFSQLLATFPASLANRDQNELNEIRRQWVLAFRENGITTMEQVNAGMRVARRQNRPFLPSPGQFVAWCREEASVIAGLPNVSELVDMVYEYCRKRGLYPDAESYPWKSNAHYWLVTNLYQNMRANALTDAELRRKAADELVHMTARINRGEAIPEPVKQLPVMGGRPLNRAQALAKIAEIKAKFGLKGASV >CP033401|1958145:1985349|1960715_1962047_+|AYQ01764.1|DBSCAN-SWA MIINQVPIKIKIFIFLFSCISIIFLLLHANNGIYITQTTQISYSVFIIGLFFINLMIFIFLLLYYVSNQRQSYLLILSFAFLSNTYYLLEVAIISLSPLGNDLSTIYQKSNDIAIYYLFRQFSFISIIFLAVYSTNVKNKSVLEDKRNIIIVVLSILILFITPFVAKNLSSDNIKYSLNIIQYSLNRHLPTWNIVYTKIISVFWLVLLISSCISIRNYSKIWLCIILISIVSVCNNLILLYFIDKSHPAWYMTKFLELISMIYIISTLMYYVFRKLNHANHMAIHDPLTNTYNRRYFIDSLKNISKHHDFSVIMLDIDSFKSINDKWGHHMGDQVIVMVTRIIKKSIRKEDVLGRLGGEEFGIIIKGNTQKLLLSIAERIRKNIEEQCSEKLLSHGPEKITVSIGCFTSKENNLSPSEMLVNADKALYQAKRTGKNKVIIHSK >CP033401|1958145:1985349|1969744_1970269_+|AYQ01778.1|DBSCAN-SWA MKLWPVLTGIALSFTLIACKAPTPPKGVQPITNFDANRYLGKWYEIARLENRFERGLEQVSATYGKRNDGGIRVLNRGYDPTKNKWSESEGKAYFTGDTKTAALKVSFFGPFYGGYNVIKLDDEYKYALVSGPNREYLWILARTPTIPDKVKADYVRTAQKLGFNVNELLWVKQ >CP033401|1958145:1985349|1970887_1971028_-|AYQ01780.1|DBSCAN-SWA MMFEFYMAELLRHRWGHLRLYRFPGSVLTDYRILKNYAKTLTGAGV >CP033401|1958145:1985349|1967510_1968098_-|AYQ01775.1|DBSCAN-SWA MIVDVEEKTVNDFFKSNTLSPFSVRRFYPAYLMVECEDFSLLKNLIACLNCDGRTVDFVRNQISLACLAILSSEKIVQSFLFGCLNSLGSKVKAIIHTDISAAWRLCDISSRLYLSESLLKRKLKHEGLSFSKLILEERMVMAERLLSYNLYSVGKVAEICGYENTSYFVSVFRRYFGVPPHQYSSRFFLEKDMM >CP033401|1958145:1985349|1972482_1972935_-|AYQ01786.1|DBSCAN-SWA MKRLFVIAPLLVLVGCAQNISPNSYSVGSVGMVNRTIAGTVISARGVDISGTSALGGTAGAAVGATAGSALGGGVRSNIVGAVGGAVIGGIAGAAIESSATKQTGMEYVVETENGNLMTIVQGKDPLFTQGSKVLVLYGNPSRIITDPRH >CP033401|1958145:1985349|1976589_1976820_-|AYQ01792.1|DBSCAN-SWA MNPAIKTAINIVGSQKKLGAACEVSQQAVYKWLHNKAKVSPEHVGSIVTATGGVVKAYQIRPDLPKLFPHTEKNAA >CP033401|1958145:1985349|1965244_1965655_+|AYQ01770.1|DBSCAN-SWA MAQVAIFKQIFDKVRNNLNYHWFYSELKRHNVSHYIYYLATENIHLVLENDNTVLIKGQGKVVNVRFSKNKCLIEATLKGFKSGELSFYEYRKNLATAGVFRWITNIHENKRYYYTFDNSLLFTENIQNTTQIFPH >CP033401|1958145:1985349|1979818_1980025_+|AYQ01795.1|DBSCAN-SWA MVHQHYGTQTVNRGAVMPGMLVKHKDGTWTASANLRGRLYLHRGIERTYTRDLLVEVFLDGRGNGLNH |
48 | Enterobacteria_phage(47.06%) | lysis,tail,integrase,capsid,terminase | attL 1960061:1960075|attR 1985423:1985437 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2472839 : 2488527
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >CP033401|2472839:2488527|DBSCAN-SWA AATGGATATTACTGAGTTTCCTTCTGGAGTAATTGAACACCTTGGCTGGTATGTATACCGATTGATTGATCCGAGGGACGGAAGCACCTTCTATGTAGGGAAAGGCAAAGGTAACCGCGTATTTGCCCATATGCGCGGTGAAGTGGCAGCGACTGATGATGACGAGTTACTGAGCAACAAGCTAAAGCAAATTAGAGAAATAAGGTTAGCAGGACTTGAAGTTATCCATGTCATCCATCGACACGGAATGACTGATGAAAAGACGGCGTACGAAGTTGAAGCAGCACTTATTGATGCCTACCCTGGGTTAACGAATATCATGAATGGTGCTGGCAGCAATGAATTCGGCGCCGCGCATGTCAAAGAGTTGATAGCAACATATCAGCCCGAAACCATAACATTTCATCATAAAGCATTAATGATATCCGTTAACAGAAGTGCAAAGGATTCAGAGCTTTATGATGCGGTTCGATTTAGCTGGCGCATTAATGTCTCTCGCGCCAGCCAAGCAGAAATCATTCTTGCTACTGTAAGGGGGATCGTTCGAGGGGTTTTCATTGCTGATAAATGGCTCAAATCAACACGTGAAAATTTCCCTTCGTTGAAATACTGGGACGAGGATCCTGACTTTGAGGCAACACAAAGTTCGCGCTATGGTTTTGAAGGTCGAGAAGCCCCACCTGAAATAGCAAATCTTTATCTTGGAAAAAAAATACCAGATGAATTAAGAAAAAAAGGAGCTATGTCCCCGGTCCGTTACTCACCTAATTTTTGAGTCTTTAAGTGATAAGCATAAACCGCAGCACGTCATGCATACGTCGTGTCTGCGGTTTTTCTTTTTTGCTTACACGGTGTCTGGTTCTTCTGGCCACTCAATATCAGGTGCAGTTGATGTATCAACACGGTTCAGCAACACCCGATACTTTTTCCATGCGGCCAGTAACGCTTTCTCTTCCTCCGTTGCGATTTCCAGCTCAACAACAGTCTGAACGTACCGGGAACAGCCTCCTTCAGAGCTTGAAGGATATCAATGTTCGCTTCCTGTTAACTGCCCGACAAGTGCAACCAGTTCGCTTACCTGATTTTCCAGAGTGGTGATCCGGGTGCCTGCTGTTGCCAGATTTTGACGTAGCGTTGTGTTTTCCTCTTCCAGCGCGGTAACGCGATCATCTGTTTCACGGGCGACCTGAACAAGTAAGCCCGTCACGGCGGCGTAGTCAACATTCAGATAACGCGTTTCTTCACGTAATTCATTGCCGTCAACCGTTGGTCCCTGCAACTCTTTACCGTAATGAGTGAATGAGCCTACCGCTTCCGGTATTGCCTCCATTGCTTCCTGTGCAATAACACCAGCGTAAGGCAGTCCGTTTTCCTTAAGTGTGTAGGTGTACCCGTTCATTTTACGGATAGCTTCGGTTGCGTCACTGATAATCTGAATATTGTCTTTCAGCTCGCGGTCTGATAACTGATTCAGTGTTGTACAGTTAATAGCACCGTTTACATCAAACAGCTGACCTGACGATGTTTTTTGAGCATAAAACAGATACGCGGCAGTCGTTCCAACCTCAAAAACGTTTTGTCGATCACCTGAACCCCACACCTTGACAGCAAATGGTAGTTCTGTATTACCTAAGTTCTGTAAAACAAAACGATTGCCAGTCCCTGTTTGTTTTGTAAGGGTTAAATCAACAGTTGAGTTAACCTCATCCTTGTTGATAGTGAGCGCCTGCGCTGTAGCCCCGTTAACAGCACCTGTTTTGAGTTGAACCGCGCCGTCATTACCATTTAGCAGTATCTCAGCTCCGCTAAAGAAATTTTTTAGCGATAGCATCTTACTTACGCCGACTGATGAACCCAACGCCCACGCGAGAGAATTACCGGTGCTATCAAACCCACGTACAAAGCAATCCATTTTGCTATAGTCTGACGTGCTTCCAAGGACATCAATCCGCCCTCCGCCAGATTTTATCGGGTTAGATGTGGTTAATGACCTGACAGCAATATCGGTAGATGAATTGAGATCGTCTACTGTTAGTAATTTCTTCCATTCCTGCGTCGTTCCATTTTCAATTGTTCTTCCCCAAAAACCGGAATTGCGGCCTCCGAACTGCACAGCATAATTTTTACTAAATTGAACATGAATGCCGCCAAGAACCATAGAGCCTGCCGGGCCGTTTGTACTGCCTGCAATTGATCTAAATTTGTCGACGTTATCAGTATGCTGAGCGTTCCAGTCATGACCTGTTAATGTTGCAATTCCTGAGTTAGCTGTAATTAACCCAGACGCTTTAAGATTCTGCACATCAATTCTGTCGTTTGCGAAATTATATGCAATTGCGTTACTAACGCTTCCGGCATCATCATAATCGCGTTTCTGAATAAACCAGTCGCCAGCATTAGCAATGAGAGTATAAGTAGGCGTGTTAGCCGGGCGATCTGTTTCATTAAATCTTATTGCTGGGTTAGCGCTCCTTATTTGTAAAGGTTTTTCAACCGTTGATTTCAGTATCGCACTATCTACAATTAAATTACCTTCTGAATTAAGGTTTAGATATTTTGAAGCACCAGTTGAACCATCGTATGTAAACCTTATTGTTAATTGACCTGGCTCATTTGATAAAGTTTCAACATACATATCAGCACCAAGACGCACAGTGCTGTCAGTTGCTAACAGTCTTGTATGGAGAATTCCACTAGATGTGTATGTATTGGTCGAATCACTATATCTATCAAGAAATAGACTGTTAAGTTGAGGGCTGTTGTTCCTACCTAAGCCAAGATTCGTTCTGGCACCATCAACAGTTGCCGCATTAGTGCCTCCTTGTCCAATAGGTAATGGAACCCATGACGATCCGTTATGGCAACCCCACAATCCAGATGTTGACACCTGAAGTCGTGGCGCTGATGGTGAGTAATTCGAATAAACGTAAGTAGTCGATTCGCCTTCTTCAATTCTTTCTGTTAATACTATTTTTTTCCATACACTCCAACCTTGTGTGCTGGTATATATCCTTCTATAAAGAATTGATGAATTGTTATATACAAAATAGCTCTGGATACAACCATCACTACCATTAGCACCAGTTCTTTGCACTAACAATGCACCAGCAAGCTGTATCGGGTAATTTAGTTCTGGCTTTGCGTTGGCAGACATTGGTTGGTAATAAAAACCGGCTGTAGTACCTTTTATATCGTTTAAGTTCGTATTCGCATCAAGGCCAGTTTTTGCTTCAAACATGACTTCAAGTTTAGAGCGCGCTGTACTTGCATCATTCGCCCCTGTGCCACCTTGCGCAACTGCGAGCGGTTGCCATCTACCAGCTTTAGGGTTAAATGCGCCCCACTGACCGTCAGCATCAACCTGTAAGTAACAGCCGTCCGCCTGAACGTCAGTAGATAAAATGATGGTTCTTGTGTCGCTTGAACGCTGAAATCGGATAACCTCATCTTTTCTTGCGTGGCGTGTCCATTGCGGGCCTATTGTCGGATTCCAGCGATAAGTATAAAGCGAGCGCGTAGCCCATCCCTGAAATACACCTGTATACGCTGGTGTTCCATCTACCTGACTAATAAACCCCGTAAGACTGCTTTCACCGGATGCAATCGATGGAAATCCTTTTGCATTACTCATAATGCGCATAAATCCGATATACCCTGATGGGTTGCCGGAAATATCAGGACAATCACGCGGGGCTGAGCCAAGACCGATATTATCGCCAACAATTAATTGCGCCTGGTTGCGATAATTAAGTGCGTCCGCCGCAGATTTCGCCGCGTTTGTTTCACTGGATTTAGCATTAGTTTCGCTGGCTTTAGCGTTTGTTTCGCTGCTCTTTGCTGCTGCCTCGCTATTTTTCGCGTTGGTTTCTGATTTTTTGGCTGCTGTCGCGGAGTTTGCCGATGCAGTTTGTGAGTCTGCTGCCGCCTGTGCGCTGTTACCCTGGCGACGCGCCTTACGCGTTAACCAGCCTTCTGCTTCCAGCCGTGCGATAGCCGTTCTGACGGTACTCATCCCCGCGCCAATCTGACGGGCAATGGTTTCAATTGATGGCCAGCACACACCTTCGTCATTACTGAAATCAGCCAGGCGGGCCATAATTGCCACGCTGGATAACTTCATGCCTGATGCAGCGCAACCATCCCATACATAGCCGGTTAATTTAGTGCTCATGACCGACCTCTATTTCCCTGAATTTACGACGAAACTGTTCGAGCGGGCTGAAGCACTCATGCTCATAGCCTTCGCGGAGGTAGATAACCCGTTGTGTTTCCGGTTCCCAACGAATGACTCTGACGGGCACTCCGTAGTGATCTTTGAACCAGCGGTTAACTTGTCGCAAAGGACTGTCTCCTTCTGCCGGTTGAAATCACCCACAGCCCACTCTGCAAAGCTGTGGGTTACAATTTCCCTGTCACCTGGTACATTTACTGCATAGCAATACTCCACCTTCGCTTTTCCACCCGGTACAGGAAGCGCAATCAGTTGCGAGCGACGGTAGTGTGTTGTTAAACTGTTCATGCGTTAGTTTCTCCACAACCAGAAGCAATCGACGCCACGACGCCCGGAGCTGCACACTCGCGGGCGTCATTACTTTCTGAAATGCAAAAAATTTTGTAGACAAGTGCTGCATGCTCCTGCAGCTTCGAAATTGAGAGATACAGCTCGTCGTTAATTGCTGTCTTCTCATGCGGTTCCACTACACCGTCTTCGATTGCTGAACGAATCTGTTTTGAATAACTGCCGATCTGTTCAATGACTTCCAGCAGACGCTGGTTAATATCGGCGTTGTCCACATCCTCGACGTCAGGAAGAGACACAAAGACGCCATTTGCAGACTGCGCCACAGCGTCAGCAATGAAGTGAGTTCCACCAGCACGTTGCAAAATCATTGCCCATCCCAGCGGGAAAATCTGATCGCCATCGGCACGAAGGCGGTTAAATAATGCGTTCTCTGTTACATCCAGCCAGTCAGCTGCTTCAGCGTAACCCCCCGGCAACGCTGCGATAGTTTTTCTGACAGCTTTCACGTACCACTCAGGCTGTTTTTCTACTTTCCAGTGATGCTTACCCACGGTTAGCCTCATCGTTCTGTGGTTTCTGTTAATCGATTTATCCATTAGATTTTTCATAAAGCTCAGGTTTAAATGGCAACCGTCCGCAAGTTCTATATGCAGCTTCTGCTGCACGTCCTTTTGGAATTAACTGGCCCGGACGGTTTCGCCACTGATAAACGGCTTCAGTTGTTATGCCGAAAAAAGCAGCAACTTTCTCAATACTGCCGAAGTAGCTTTCGATATCGTCAGTTGTCATACGCCCTCCAAACTAAGTTTTATTAGATGCTAATTACAAATCTATCTTTGGTCAATAAAAACTAAGATTACTTAGCAATTAAAGAAATGGTGCTCCTATGGAAACGGTTGGTCAGCGTATAAAAGCTCTGAGAAGAGTTACCAGAACGTCCCAGAAAGAATTGGGTAAATTTTGTGGAGTAAGCGACGTTGCTGTGGGGTACTGGGAGAAAGACATCAATACCCCTGGTGGGGAGGCACTTTCGAAATTAGCGAAGTTCTTCAATACGTCAATAGATTACATTCTTTATGGTGCTGAGTTTGAAGGCAAACTCGTCACAAACATGCGCAGAGTTCCTGTAATATCGTGGGTTCAGGCTGGGCAGTTTACTGAGTGCAGGGCAGCAGAAGTGTTTAGTGAAGTGGACAAGTGGGTAGATACATCATTAAAGGTTGGTGATAACTCATTTGCATTAGAGGTTAAAGGTGACTCCATGACTAACCCTAATGGCCTCCCAACAATACCAGAAGGCGCAACAGTGATTGTAGATCCAGATGCAGAACCTCGTCATGGAAAAATAGTCATCGCTCGACTTGATGGAACAAACGAAGCTACAGTAAAAAAATTAGTCATCGATGGCCCTCAAAAGTTTTTAGTACCATTAAATCCTCGGTATCCCAACATCCCTATCAATGGTAATTGCCTTATCATTGGTGTAGTCAAAGGAGTTCAATACGAACTCTAAGGCCTCTCTTCTCTAACTAAGGCACCGAACTAAGAAAAGTTTGGTGTTTTCTCTTGCCATGACAACTAAGTTAAGTTAGATTTTATATCAAAGATAACGAACAGGCAGGACGCCCACGAAGTAGCCGCCTGGGGCATATGAAGTCCAGGATGATTCGTTAGCAACAAAAAAGCGCCCTACAGGACGCTTAGCTCTTTAACAATCTGGGTATCATCCAACCAATGCAAGATTTAAGGAATCCAAGGCGAATTCAGATCTCGCCCCAACTCACGTAATGATCTTGGTCGTTCGTACATCGGATTTTTTTCCATAAGAAATTTATTTTCACAGTGAAGGCAACGGCTTGTAAGAAAATGAGAAGTTTTACCTACTGGAAGCGGATGAAGTATCGATTTTATATTTTTGGAATAACAAAGTGGGCACAGATGCACAGTTATTTCTTTTCCACTCACAATTTCATTTTTAGAGTAAACAAAAGCACCAGAGTCAAGCTGATCAAGGACATATCCTTCTACCTTGGCACAAAAATCTTCAAACTCTGCAATTTTTGCTTTGAGATGCATCACCTCTTCATCACGAAGGCGGATCGCATCGCCAAGAGAGAAGCATTCTGCCTGAAGCGTGATTAGTTTGTTCTGGAGTTCAATGGTTGCAGCTTTAACTTCTGCATCCGTTTTCGCGTCATTAATAACCTTAGCAAGACCGGCAGTCTCCTTTATAGCGGCCATAGCCGCAGACAGTTCAGCTATCACGTTGAATACTCAGCTAGTTGTTGGGGATATCCAGATTAACCAAATCCTTGTTGTTGGGGAATAACTAGGTCCACCTCGCCTGATGTGGCTAAAAGCAGGCACATAACAGCTAAGTATTTTCAACCAGAGAGAATCCTTAGCGTTGTGGTGAATGCGGCTCAGCGCACGCGGGTTAAGGTTGAGGCTGACAGTCGACCTTCTGTGGATACCCACCCGCCTGGTGTGCAACCTTCGCCAGGCACCGGGAGGCACCCGGCACCACAACTTTATGCTGTGTGTAGTCCTCGCGGTACCAGTTTGTACACTTGCTTCCGGCTGGTACCGCTCTTTTTACAAAACAGAGAAGAACATCACCGGACGACGGGCTCATAACCCAATCCATCCGGGCGGCTGCCACCGCAGGTGTTCTTCTCTGTTTTGTGGAGAAACCAACCGACCTTGCAGGGTCGATATGATTAGGAGCAGAAAAATGGCTAGCGAACGCAGTACTGATGTGCAGGCATTTATTGGGGAGCTGGACGGCGGCGTATTTGAAACCAAAATCGGCGCAGTTCTCAGTGAAGTCGCTTCCGGTGTGATGAACACGAAAACCAAAGGTAAGGTCTCACTCAACCTGGAAATCGAACCATTTGATGAGAACCGTGTGAAAATCAAACACAAACTCTCATATGTTCGCCCGACTAACCGCGGGAAAATTTCCGAAGAAGACACCACCGAAACGCCGATGTATGTCAATCGCGGTGGTCGCCTGACTATTCTGCAGGAAGACCAGGGACAATTACTGACTCTTGCCGGTGAACCTGACGGAAAACTCCGCGCAGCAGGTCATTAATATCGTTCTTAATTAACCGATTATTTATCTCATCACTGAATATCTTTATATAGTGAGGACTTATTATGTCTCAGAACTTAGACGCAACCGCAATTAATCAAATCCATGCCCTTATTTCTGCTCAGGGTGTTAATGAAATTATCAGTAAGATTGGTGCCGATGCTGTGGCATTGCCTGAGAATTTCCGCATTCATGATCTGGAAAAATTTAATTTAAATCGCTTCCGTTTCCGTGGTGCGCTTTCCACTGCCAGCATCGATGACTTTACCCGTTACTCTAAAGATCTTGCAGATGAAGGCACCCGCTGCTTTATCGATGCCGATAATATGCGAGCCGTCAGTGTGCTTAACCTGGGTACTATTGATGAACCAGGTCACGCAGATAACACCGCCACACTCAAACTGAAAAAGACAGCACCGTTCTCTGCTCTGTTGTCTATTAACGGCGAGCGTAACTCCCAGAAGTCACTAGCAGAATGGATTGAAGACTGGGCAGACTATCTTGTGGGCTTTGATGCTAATGGTGACGCTATTCAGGCAACAAAAGCGGCTGCGGCTGTCCGTAAAATCACGATTGAAGCAAACCAGACCGCTGATTTTGAAGATAATGACTTCAGCGGCAAACGCTCCCTGATGGAGTCTGTCGAAGCGAAGACCAAAGATATTATGCCTGTGGCATTTGAATTTAAATGCGTTCCGTTTGAAGGCCTGAAAGAACGTCCGTTTAAATTACGCCTCAGCATTATCACTGGCGATCGTCCTGTACTGGTTCTGCGCATTATTCAGCTGGAAGCAGTGCAGGAAGAAATGGCTAACGAATTTCGTGATCTGCTTGTTGAGAAATTCAAGGACAGCAAAGTAGAAACCTTTATTGGTACTTTCACCGCCTGATTTCATTACTGCAAATGCCCCTGCGGGGGCATTTATGGAAACGTAATTGACTCAATAATCGCCGGATGGTGAGGGCTTCCTTTTACCAGAATTCAGCGCGGTGCAGCGCATATACGTGGAGAACAAAATGTCATTTATTAAAACTTTTTCCGGGAAGCATTTTTATTATGACAGGATAAATAAAGACGACATCGTTATTAACGATATCGCGGTTTCTCTTTCAAATATCTGTCGCTTTGCAGGGCATCTTTCACATTTCTACAGCGTTGCCCAACATGCGGTGCTTTGCAGCCAACTGGTACCGCAGGAATTTGCTTTTGAAGCGTTAATGCATGATGCAACAGAAGCGTATTGCCAGGACATCCCGGCGCCACTGAAACGCCTTCTTCCTGACTATAAACGGATGGAAGAAAAAATAGATGCAGTAATCCGTGAGAAATACGAGTTGCCCCCGGTTATGAGCACGCCCGTGAAATATGCCGATCTCATCATGCTGGCAACCGAACGCCGTGATCTCGGGCTTGATGATGGCTCTTTATGGCCTGTACTGGAAGGTATCCCGGCAACAGAGATGTTCAAAGTTATTCCACTGGCACCGGGCCATGCCTACGGGATGTTTATGGAACGCTTCAACGAGTTATCGGAATTACGCAAATGTGCATAACTCATGTAGTTAGTTTTTCTGGCGGGAGAACATCCGCATATCTTGTTCACCTGATGGAAGAACAAAGAAAGGCTGGCAATAACGTCTGCTACATCTTTATGGATACCGGTTGCGAACATCCGCTGACATACCGCTTTATCCGGGAGGTTGTGAAGTTCTGGGACATACCACTAACTGTGTTACAGGTCGATATAAATCCTGAGCTTGGGCAGCCAAATGGTTATACAGAATGGGAGCCAAAGGATATTCAGACACGAATGCCGGTGCTTAAACCGTTTATGGACATGGTTAAAAAGTACGGCACGCCATACATCGGCGGCGCGCTCTGTACTGATAGGCTAAAACTCATCCCTTTCACAAAATACTGCGATAACCATTTCGGGCGAGGTAATTACATCACATGGCTGGGTATTCGTGCAGACGAACCCCGTAGGCTGAAACCGAAATCGGGCGTCCGGTATCTTGCCGAGCTGTCAGATTTTGATAAGTCGGATGTTATCCGGTGGTGGCGAAAACAACCTTTTGATTTGCAAATCCCGGAGCATCTCGGGAACTGTGTTTTCTGCATCAAAAAGTCAACGCAAAAGCTGGGGCTTGCATGTAAAGACGAACCAGGTCTGATGCGAGTTTTTAATGAGCTGGTTACAGGCAAACACGTCAGGGATGGTCATCGCAGAACAGGTAAAGACATTATGTACCGTGGTCACCTGACGCTTGACGGAATTGCCAGAATGTCTGCCAACAGCGACTACAGAAATTTGTATCAGGCGATGGTACAGGCCAGGCGATTCGATACCGGCTCGTGTTCAGAGTCATGTGAAATCTGGGGTGATCAATTGGAATTGGAATTCAAAGAGGTAGGGGTATGACAACCGAAATTAACTACCATGCACTGCTTGAGCGCGCACGGAATAAAGTGCAGAGCATTGAGTTCGCCTTAACACAGAGTGCATTCGCTGAGATTCGCGCTGAGCTTGAAAATGATTTAGAACTGGCACGGATTGCACTGGCATCTCTGGAAGTTGAGCCAGATGAACGCGCAGCCTATGAATTATTTATGGAAAAGCGTTTCGGTAAAACAGTCGATCGTCGGAGAGCAAAAAACGGCGATAACGAATACATGGCATGGGATATGACTCTCGGTTGGATCGTCTGGCAGCAACGAGCTGGTATCCATTTTTCAACAATGTCACAGCAAGAGGTGAAATAATGGAGCCATACAGCCTCACACTCGATGAGGCCTGTCATTTTCTCAAGATATCCAGACCGACTGCCATTAACTGGATACGCACAGGGCGTCTTCAGGCAACACGCAAAGATCCCACTAAGAATAAATCTCCTTACCTCACAACACGACAAGCCTGCATTGCGGCTCTTCAGTCTCCGCTGCATACTGTCCAGGTGAGCGCGGGTGATGGCATAACAGAGGAAAGAAAATGTCACTCTTCCGCAGAGGTGAAATATGGTACGCCAGTTTCACATTGCCGAACGGTAAAAGATTTAAACAGTCTCTTGGAACAAAGGACAAAAGGCAGGCGACAGAACTCCATGACAAGCTAAAGGCTGAAGCATGGCGGGTCAGCAAACTTGGTGAAATACCTGATATAACGTTCGAGGAAGCGTGTGTCAGGTGGCTTGAAGAGAAAGCACATAAAAAATCACTGGACGATGACAAAAGCCGGATCGGATTCTGGCTTCAACATTTCGCAGGAATGCAACTAAGAGACATTACTGAATCAAAAATTTATTCAGCAATGCAGAAAATGACGAACCGGCGTCATGAGGAAAACTGGAAACTCAGGGCAGAAGCATGCAGAAAAAAAGGGAAACCTGTTCCAGAATACACGCCAAAACCAGCGTCCGTTGCAACGAAGGCTACGCATCTTTCATTTATAAAGGCCCTACTAAGAGCCGCAGAGCGTGAATGGAAAATGCTGGATAAGGCACCAATTATTAAAGTGCCTCAACCAAAGAATAAACGGATCCGCTGGCTGGAGCCCCATGAAGCACAAAGGCTGATTGATGAATGTCCGGAGCCATTAAAGTCTGTTGTTGAATTTGCACTGGCAACAGGCTTAAGACGCTCGAACATCATCAACCTTGAATGGCAACAAATAGATATGCAGCGCCGGGTGGCATGGATAAACCCAGAAGAGAGTAAATCAAACCGCGCAATCGGCGTTGCGCTGAATGATACTGCATGTCGCGTTTTGAAAAAACAAATCGGGAATCATCACCGTTGGGTATTTGTGTACAAGGAAAGCTGTACCAAACCAGACGGAACGAAAGCGCCAACAGTAAGGAAGATGCGGTATGACGCAAACACAGCCTGGAAAGCGGCGCTGAGACGGGCTGGTATTGATGATTTCAGATTTCACGACTTGAGACACACCTGGGCAAGTTGGCTGGTTCAAGCCGGAGTCCCGTTGTCAGTGTTACAGGAAATGGGAGGCTGGGAGTCTATCGAAATGGTTCGTCGATATGCTCACCTTGCACCTAATCACCTTACCGAACACGCACGGCAAATAGACTCGATCCTGAACCCATCGGTCCCAAATTTGTCCCAGTCAAAAAATAAGGAAGGTACTAATGATGTGTAACTTATTGATTTAAATGGTGCCGATAATAGGAGTCGAACCTACGACCTTCGCATTACGAATGCGCTGCTCTACCAACTGAGCTATATCGGCCCTGAAAGGACATGTTCACGAACGTGAATCACGGTGGACAAGGTTAAAACTAACCGGGCGATGCGTCAATGGCCTTGTGAATCAAATGGCTACTTTTGCATCACCCGGTTTTATTTACGCACGAATGGTGTAATCACCAATACCGATCCACTTGTAAGTGGTCAGTGCTTCCAGCCCCATTGGGCCACGCGCGTGGAGTTTTTGTGTGCTTACCGCCACTTCCGCACCTAGTCCAAACTGGCCGCCGTCGGTAAAACGCGTAGAGGCGTTAACGTAAACAGCGGACGAATCCACTTCGTTAACAAAACGCTGGGCGTTGCGCATATCGCGGGTCAGGATCGCATCGGAGTGTTGTGTGCCGTGTTCACGAATATGGGCGATGGCATCGTCAAGATCACTGACGATTTTGACGTTCAAATCTAATGACAGAAACTCATCGTCATACTCTTCCGCTTTAACAGCCACCACCTTCGCGGGGCCTGTCTGCAACTGCGCCAGCGCAGCTGCATCTGCGTGTAATGCCACGCCGCTTTCCTCCATTTGTTTGCTTAATGCGGGCAGGAAGCTATCGGCGATGTTTTTATTCACCAGCAACGTTTCTACCGTATTACATGTGCTCGGACGCTGAGTTTTCGCGTTGACGATCACTTTTAATGCTTCAGCAATCTCTACACTTTCATCAACATAAATATGGCATACGCCTATACCACCTGTGATCACCGGGATCGTCGACTGTTCGCGGCACAGTTTATGCAAACCAGCGCCACCACGCGGGATCAGCATGTCGATGTATTTATCCATACGCAGCATTTCACTGACCAGCGCACGGTCAGGATTATCAATCGCCTGCACGGCACCCACCGGTAAGCCACAGGATTTCAGGGCGTCCTGAATCACCGCCACCGTTGCCGCGTTAGTGCGACAGGTTTCTTTACCGCCACGCAGAATCACTGCGTTACCGGTTTTCAGGCACAGCGAAGCGACATCAACCGTCACGTTCGGGCGCGCTTCATAAATCACGCCAATAACCCCCAGCGGTACGCGACGACGCTCAAGACGCAGGCCGCTGTCCAGTACGCTGCCATCGATTACCTGCCCCACCGGATCGGCGAGGTTACACACCTGGCGCACATCATCGGCAATGCCTTTCAGCCGTGCGGGCGTCAGTGCCAGACGGTCAAGCATCGCTTCGCCAAGGCCATTGGCACGCGCGTCAGCAACATCCTGGGCGTTAGCGTTGAGGATGATTTCGCTTTGTGCTTCCAGTTCATCGGCGATTTTTTCCAGCACGCGATTTTTTTCGCGGCTGGAGAGTTGCGCTAATTTATACGAGGCTTGCTTCGCGGCAATGCCCATTTGTTCCAGCATCAGCCTGCTCCTTAACGGGTAATCATGTCATCACGGTGAACGGCAACCGGGCCGTATTCATATCCCAGTATTGCATCAATTTCTTGCGAGTGGTGCCCGGCAATACGGCGTAATGCATCGCTGTTGTAACGACTGACGCCGTGGGCGATATCGCGACCTTCGAGGTTGCAAATGCGGATGACTTCACCACGCGAGAAATTGCCAGTCACGCTTTTAATGCCTTTCGGCAACAGGGAGCTGCCGCGTTCAAGAATGGCGGCAGTTGCCCCTTCATCTACCGTGATTTCACCCGCCGGCGGCGCACCGAAAATCCAGCGTTTACGGTTTTCAAGCGGAGTCGCCTGGGCATGGAACAGCGTACCGACGGAAATGCCTTCCATCACATCACCAATAACGCCCGGCTTGCTGCCCGCGGCAATAATGGTGTCGATACCCGCACGGCAAGCCACGTCAGCGGCCTGCAATTTGGTACTCATGCCGCCAGTTCCGAGGCCTGAAACGCTGTCACCGGCAATCGCGCGCAGTGCGTCATCAATGCCGTAAACATCTTTAATCAGTTCTGCCTGCGGATTGCTGCGCGGATCAGCGGTATACAAACCTTTTTGATCGGTCAGCAGCAACAGTTTATCGGCACCCGCCAGAATCGCCGCCAGCGCAGAAAGGTTATCGTTATCGCCGACCTTAATCTCTGCCGTAGCGACAGCATCGTTCTCATTGATTACCGGAACGATATTGTTATCGAGCAACGCACGCAGGGTGTCGCGGGCGTTCAGGAAGCGTTCACGGTCTTCCATATCAGCACGGGTCAGCAGCATCTGCCCGACGTGAATGCCATAAATCGAAAACAGCTGTTCCCACAGTTGAATCAGTCGACTCTGCCCTACCGCCGCCAGCAGTTGTTTCGAGGCGATAGTCGCTGGCAGTTCCGGGTACCCCAGGTGCTCACGTCCGGCGGCGATCGCGCCCGACGTCACAATAACAATCCGATGCCCGGCGGCATGTAACTGCGCGCACTGGCGAACAAGTTCAACGATATGGGCACGGTTCAGACGGCGCGATCCGCCTGTTAGCACACTGGTGCCGAGTTTTACCACCAGCGTCTGGCTGTCACTCATGATTCTCTGCCATTCAATTTTAGGAAAAATGATATCAAACGAACGTTTTAGCAGGACTGTCGTCGGTTGCCAACCATCTGCAAGCAAAGCATGGCGTTTTGTTGCGCGGGATCAGCAAGCCTAGCGGCAGTTGTTTACGCTTTTATTACAGATTTAATAAATTACCACATTTTAAGAATATTATTAATCTGTAATATATCTTTAACAATCTCAGGTTAAAAACTTTCCTGTTTTCAACGGGGCTCTCCCGCTGAATATTCGCGCGTTAATTAAAATCAGGAATGAAAATGAAAAAGAGCACTCTGGCATTAGTGGTGATGGGCATTGTGGCATCTGCATCCGTACAGGCCGCAGAAATATATAACAAAGACGGTAATAAACTGGATGTCTATGGCAAAGTTAAAGCCATGCATTATATGAGTGATAACGACAGTAAAGATGGCGACCAGAGTTATATCCGTTTTGGTTTTAAAGGCGAAACACAAATTAACGATCAACTGACTGGCTATGGCCGTTGGGAAGCGGAGTTTGCCGGAAATAAAGCGGAGAGTGATACTGCACAGCAAAAAACGCGTCTCGCTTTTGCCGGATTGAAGTATAAAGATTTGGGTTCTTTCGACTATGGCCGTAACCTGGGCGCGTTGTATGACGTGGAAGCCTGGACCGATATGTTCCCGGAATTTGGTGGCGACTCCTCGGCGCAGACCGACAACTTTATGACCAAACGCGCCAGCGGTCTGGCGACGTATCGGAACACCGACTTCTTCGGCGTTATCGATGGCCTGAACTTAACCCTGCAATATCAAGGGAAAAACGAAAACCGCGACGTTAAAAAGCAAAACGGCGATGGCTTCGGCACGTCATTGACATATGACTTTGGCGGCAGCGATTTCGCCATTAGTGGTGCCTATACCAACTCAGATCGCACCAACGAGCAGAACCTGCAAAGCCGTGGCACTGGCAAGCGTGCAGAAGCATGGGCAACAGGTCTGAAATACGATGCCAATAATATTTATCTGGCAACTTTTTATTCTGAAACACGCAAAATGACGCCAATAACTGGCGGCTTTGCCAATAAGACACAGAACTTTGAAGCGGTCGCTCAATACCAGTTTGACTTTGGTCTGCGTCCATCGCTGGGTTATGTCTTATCGAAAGGGAAAGATATTGAAGGTATCGGTGATGAAGATCTGGTCAATTATATCGATGTCGGGGCTACATATTATTTCAACAAAAATATGTCAGCGTTTGTTGATTATAAAATCAACCAACTGGATAGCGATAACAAATTGAATATTAATAATGATGATATTGTCGCGGTTGGCATGACCTATCAGTTTTAA
Protein sequences of DBSCAN-SWA_6 >CP033401|2472839:2488527|2478246_2478873_+|AYQ02230.1|DBSCAN-SWA METVGQRIKALRRVTRTSQKELGKFCGVSDVAVGYWEKDINTPGGEALSKLAKFFNTSIDYILYGAEFEGKLVTNMRRVPVISWVQAGQFTECRAAEVFSEVDKWVDTSLKVGDNSFALEVKGDSMTNPNGLPTIPEGATVIVDPDAEPRHGKIVIARLDGTNEATVKKLVIDGPQKFLVPLNPRYPNIPINGNCLIIGVVKGVQYEL >CP033401|2472839:2488527|2473865_2477009_-|AYQ02227.1|DBSCAN-SWA MSTKLTGYVWDGCAASGMKLSSVAIMARLADFSNDEGVCWPSIETIARQIGAGMSTVRTAIARLEAEGWLTRKARRQGNSAQAAADSQTASANSATAAKKSETNAKNSEAAAKSSETNAKASETNAKSSETNAAKSAADALNYRNQAQLIVGDNIGLGSAPRDCPDISGNPSGYIGFMRIMSNAKGFPSIASGESSLTGFISQVDGTPAYTGVFQGWATRSLYTYRWNPTIGPQWTRHARKDEVIRFQRSSDTRTIILSTDVQADGCYLQVDADGQWGAFNPKAGRWQPLAVAQGGTGANDASTARSKLEVMFEAKTGLDANTNLNDIKGTTAGFYYQPMSANAKPELNYPIQLAGALLVQRTGANGSDGCIQSYFVYNNSSILYRRIYTSTQGWSVWKKIVLTERIEEGESTTYVYSNYSPSAPRLQVSTSGLWGCHNGSSWVPLPIGQGGTNAATVDGARTNLGLGRNNSPQLNSLFLDRYSDSTNTYTSSGILHTRLLATDSTVRLGADMYVETLSNEPGQLTIRFTYDGSTGASKYLNLNSEGNLIVDSAILKSTVEKPLQIRSANPAIRFNETDRPANTPTYTLIANAGDWFIQKRDYDDAGSVSNAIAYNFANDRIDVQNLKASGLITANSGIATLTGHDWNAQHTDNVDKFRSIAGSTNGPAGSMVLGGIHVQFSKNYAVQFGGRNSGFWGRTIENGTTQEWKKLLTVDDLNSSTDIAVRSLTTSNPIKSGGGRIDVLGSTSDYSKMDCFVRGFDSTGNSLAWALGSSVGVSKMLSLKNFFSGAEILLNGNDGAVQLKTGAVNGATAQALTINKDEVNSTVDLTLTKQTGTGNRFVLQNLGNTELPFAVKVWGSGDRQNVFEVGTTAAYLFYAQKTSSGQLFDVNGAINCTTLNQLSDRELKDNIQIISDATEAIRKMNGYTYTLKENGLPYAGVIAQEAMEAIPEAVGSFTHYGKELQGPTVDGNELREETRYLNVDYAAVTGLLVQVARETDDRVTALEEENTTLRQNLATAGTRITTLENQVSELVALVGQLTGSEH >CP033401|2472839:2488527|2487471_2488527_+|AYQ02239.1|DBSCAN-SWA MKKSTLALVVMGIVASASVQAAEIYNKDGNKLDVYGKVKAMHYMSDNDSKDGDQSYIRFGFKGETQINDQLTGYGRWEAEFAGNKAESDTAQQKTRLAFAGLKYKDLGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGVIDGLNLTLQYQGKNENRDVKKQNGDGFGTSLTYDFGGSDFAISGAYTNSDRTNEQNLQSRGTGKRAEAWATGLKYDANNIYLATFYSETRKMTPITGGFANKTQNFEAVAQYQFDFGLRPSLGYVLSKGKDIEGIGDEDLVNYIDVGATYYFNKNMSAFVDYKINQLDSDNKLNINNDDIVAVGMTYQF >CP033401|2472839:2488527|2480522_2481347_+|AYQ02232.1|DBSCAN-SWA MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSINGERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAVRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA >CP033401|2472839:2488527|2480094_2480457_+|AYQ02231.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGH >CP033401|2472839:2488527|2484815_2486069_-|AYQ02237.1|DBSCAN-SWA MLEQMGIAAKQASYKLAQLSSREKNRVLEKIADELEAQSEIILNANAQDVADARANGLGEAMLDRLALTPARLKGIADDVRQVCNLADPVGQVIDGSVLDSGLRLERRRVPLGVIGVIYEARPNVTVDVASLCLKTGNAVILRGGKETCRTNAATVAVIQDALKSCGLPVGAVQAIDNPDRALVSEMLRMDKYIDMLIPRGGAGLHKLCREQSTIPVITGGIGVCHIYVDESVEIAEALKVIVNAKTQRPSTCNTVETLLVNKNIADSFLPALSKQMEESGVALHADAAALAQLQTGPAKVVAVKAEEYDDEFLSLDLNVKIVSDLDDAIAHIREHGTQHSDAILTRDMRNAQRFVNEVDSSAVYVNASTRFTDGGQFGLGAEVAVSTQKLHARGPMGLEALTTYKWIGIGDYTIRA >CP033401|2472839:2488527|2479103_2479601_-|AYQ04307.1|DBSCAN-SWA MAAIKETAGLAKVINDAKTDAEVKAATIELQNKLITLQAECFSLGDAIRLRDEEVMHLKAKIAEFEDFCAKVEGYVLDQLDSGAFVYSKNEIVSGKEITVHLCPLCYSKNIKSILHPLPVGKTSHFLTSRCLHCENKFLMEKNPMYERPRSLRELGRDLNSPWIP >CP033401|2472839:2488527|2476998_2477178_-|AYQ02228.1|DBSCAN-SWA MRQVNRWFKDHYGVPVRVIRWEPETQRVIYLREGYEHECFSPLEQFRRKFREIEVGHEH >CP033401|2472839:2488527|2472839_2473613_+|AYQ02226.1|DBSCAN-SWA MDITEFPSGVIEHLGWYVYRLIDPRDGSTFYVGKGKGNRVFAHMRGEVAATDDDELLSNKLKQIREIRLAGLEVIHVIHRHGMTDEKTAYEVEAALIDAYPGLTNIMNGAGSNEFGAAHVKELIATYQPETITFHHKALMISVNRSAKDSELYDAVRFSWRINVSRASQAEIILATVRGIVRGVFIADKWLKSTRENFPSLKYWDEDPDFEATQSSRYGFEGREAPPEIANLYLGKKIPDELRKKGAMSPVRYSPNF >CP033401|2472839:2488527|2482001_2482880_+|AYQ02234.1|DBSCAN-SWA MCITHVVSFSGGRTSAYLVHLMEEQRKAGNNVCYIFMDTGCEHPLTYRFIREVVKFWDIPLTVLQVDINPELGQPNGYTEWEPKDIQTRMPVLKPFMDMVKKYGTPYIGGALCTDRLKLIPFTKYCDNHFGRGNYITWLGIRADEPRRLKPKSGVRYLAELSDFDKSDVIRWWRKQPFDLQIPEHLGNCVFCIKKSTQKLGLACKDEPGLMRVFNELVTGKHVRDGHRRTGKDIMYRGHLTLDGIARMSANSDYRNLYQAMVQARRFDTGSCSESCEIWGDQLELEFKEVGV >CP033401|2472839:2488527|2477353_2477911_-|AYQ04306.1|DBSCAN-SWA MGKHHWKVEKQPEWYVKAVRKTIAALPGGYAEAADWLDVTENALFNRLRADGDQIFPLGWAMILQRAGGTHFIADAVAQSANGVFVSLPDVEDVDNADINQRLLEVIEQIGSYSKQIRSAIEDGVVEPHEKTAINDELYLSISKLQEHAALVYKIFCISESNDARECAAPGVVASIASGCGETNA >CP033401|2472839:2488527|2473682_2473811_-|AYQ04305.1|tail|DBSCAN-SWA MEIATEEEKALLAAWKKYRVLLNRVDTSTAPDIEWPEEPDTV >CP033401|2472839:2488527|2481474_2482011_+|AYQ02233.1|DBSCAN-SWA MSFIKTFSGKHFYYDRINKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKRMEEKIDAVIREKYELPPVMSTPVKYADLIMLATERRDLGLDDGSLWPVLEGIPATEMFKVIPLAPGHAYGMFMERFNELSELRKCA >CP033401|2472839:2488527|2477948_2478149_-|AYQ02229.1|DBSCAN-SWA MTTDDIESYFGSIEKVAAFFGITTEAVYQWRNRPGQLIPKGRAAEAAYRTCGRLPFKPELYEKSNG >CP033401|2472839:2488527|2483447_2484611_+|AYQ02236.1|integrase|DBSCAN-SWA MSLFRRGEIWYASFTLPNGKRFKQSLGTKDKRQATELHDKLKAEAWRVSKLGEIPDITFEEACVRWLEEKAHKKSLDDDKSRIGFWLQHFAGMQLRDITESKIYSAMQKMTNRRHEENWKLRAEACRKKGKPVPEYTPKPASVATKATHLSFIKALLRAAEREWKMLDKAPIIKVPQPKNKRIRWLEPHEAQRLIDECPEPLKSVVEFALATGLRRSNIINLEWQQIDMQRRVAWINPEESKSNRAIGVALNDTACRVLKKQIGNHHRWVFVYKESCTKPDGTKAPTVRKMRYDANTAWKAALRRAGIDDFRFHDLRHTWASWLVQAGVPLSVLQEMGGWESIEMVRRYAHLAPNHLTEHARQIDSILNPSVPNLSQSKNKEGTNDV >CP033401|2472839:2488527|2482876_2483221_+|AYQ02235.1|DBSCAN-SWA MTTEINYHALLERARNKVQSIEFALTQSAFAEIRAELENDLELARIALASLEVEPDERAAYELFMEKRFGKTVDRRRAKNGDNEYMAWDMTLGWIVWQQRAGIHFSTMSQQEVK >CP033401|2472839:2488527|2486080_2487184_-|AYQ02238.1|DBSCAN-SWA MSDSQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQLHAAGHRIVIVTSGAIAAGREHLGYPELPATIASKQLLAAVGQSRLIQLWEQLFSIYGIHVGQMLLTRADMEDRERFLNARDTLRALLDNNIVPVINENDAVATAEIKVGDNDNLSALAAILAGADKLLLLTDQKGLYTADPRSNPQAELIKDVYGIDDALRAIAGDSVSGLGTGGMSTKLQAADVACRAGIDTIIAAGSKPGVIGDVMEGISVGTLFHAQATPLENRKRWIFGAPPAGEITVDEGATAAILERGSSLLPKGIKSVTGNFSRGEVIRICNLEGRDIAHGVSRYNSDALRRIAGHHSQEIDAILGYEYGPVAVHRDDMITR |
17 | Shigella_phage(33.33%) | tail,integrase | attL 2469943:2470002|attR 2484612:2484671 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2874406 : 2880965
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >CP033401|2874406:2880965|DBSCAN-SWA GATGAAAATTGCGCTGGTTATTTTCATCACCCTTGCCCTGGCGGGCTGTGCGCTGTTATCACTCCATATGGGAGTGATCCCCGTGCCGTGGCGCGCGCTGCTGACCGACTGGCAGGCCGGACGCGAGCATTATTATGTATTGATGGAGTACCGACTGCCGCGCTTGCTGCTGGCACTGTTTGTCGGTGCAGCCCTCGCCGTGGCGGGCGTGCTGATACAGGGGATTGTGCGCAACCCTCTGGCATCACCGGATATTCTCGGCGTTAACCATGCCGCCAGCCTGGCCTCTGTGGGGGCTCTACTTCTTATGCCGTCACTGCCCGTGATGGTGCTGCCGCTGCTGGCCTTTGCGGGCGGCATGGCGGGGTTGATATTACTGAAGATGCTGGCAAAGACCCACCAGCCGATGAAGCTGGCGCTCACCGGCGTGGCGCTTTCTGCATGCTGGGCCAGCCTGACGGATTATCTGATGCTCTCGCGCCCACAGGATGTGAACAACGCCCTGCTGTGGCTGACCGGCAGCTTATGGGGCCGTGACTGGAGCTTTGTGAAGATTGCCATCCCGCTGATGATTTTATTTCTGCCGCTGAGCCTGAGTTTTTGCCGCGATCTCGACCTCCTTGCACTCGGCGATGCGCGCGCCACCACGCTCGGTGTGTCGGTGCCCCATACCCGATTCTGGGCTTTGTTACTAGCTGTCGCCATGACATCTACCGGCGTGGCCGCCTGCGGCCCGATTAGCTTTATTGGTCTCGTGGTGCCGCATATGATGCGTAGCATCACCGGTGGACGTCACCGCAGACTGCTGCCTGTTTCAGCCCTGACAGGTGCGTTGCTGTTGGTGGTTGCCGATCTGCTGGCGAGAATTATTCATCCCCCACTGGAGCTCCCGGTTGGCGTGCTGACCGCCATTATCGGTGCGCCGTGGTTTGTCTGGTTGCTTGTGAGAATGCGATAAATGACTTTACGAACTGAAAATCTGACGGTCAGTTACGGGACAGACAAGGTACTTAACGACGTTTCACTCTCACTGCCAACGGGGAAGATCACCGCCCTGATCGGTCCTAACGGTTGCGGGAAATCGACGCTGTTAAACTGTTTTTCGCGGCTTTTAATGCCGCAGTCTAGCACCGTATTTCTCGGCGATAATCCCATAAATATGCTCTCATCGCGCCAGTTGGCCCGCAGGCTTTCGCTGCTGCCTCAGCACCATTTAACGCCAGAGGGGATCACAGTCCAGGAGCTGGTTTCGTATGGTCGTAATCCCTGGCTGTCACTCTGGGGGCGTCTCTCCGCTGAAGACAATGCACGAGTTAATGTCGCCATGAACCAGACCCGGATCAATCATCTTGCCGTTCGTCGGTTAACCGAGCTTTCCGGCGGTCAGCGCCAGCGCGCATTTCTGGCGATGGTCCTGGCCCAGAATACGCCCGTTGTATTACTTGATGAGCCAACCACCTATCTTGATATCAATCACCAGGTGGACCTGATGCGGTTGATGGGCGAACTCCGGACTCAGGGGAAAACGGTGGTCGCTGTGCTGCACGACCTTAATCAGGCTAGCCGGTACTGCGATCAACTGGTGGTAATGGCAAACGGACATGTTATGGCGCAAGGCACACCAGAAGAGGTGATGACCCCAGGATTGCTGAGAACAGTATTCAGCGTGGAAGCGGAAATACACCCCGAGCCGGTATCTGGCAGGCCGATGTGCCTAATGAGGTAGATTGCACAGGCCGTAAGAACCAAACCACGACTGAATGAAACTGGACTGGCGCCAGCAAGCCTGTTCAGACTGGGGCTGAACTTTTCCGGACTCTGAAAGATTACCAATACTCATCGTCCATCCGCTTGCTTTAGGCTGACAGGTTCATAATCAACGCAAACCAGAGCTGTACAGGCTTGGGCGCGGCTTTCAAACCAGTCGTGATCACGGCAATCAATTTTGAACTCTGCTTAACGGACATTTCTGTATAACCCTTACGGCAACGAAAAACGCGAAGTTAAAATTTTAGAAACCCAAAAACGTGACATGACTAAGTTTAGATTTCAGGGGGGGAGATCAAAAAATTTCGCTCTGTGCCAGAGCGGACATTCACGGAGCTGGTTCATTACCAATGAGGTTGGGCTTTTGAGGATAAATCAATGATCAGACGCCAACGTAAATCAAAAGCACCCCTGGAACGGAATTGCTAATCCAGTTTCTGACCATCGATTTTTCTAAAAAGTGTGCGTTTGCTACTACTTAGGTAGGTGCAGCTTTCTTAATCACCGGCAGCCACGTTATACAGGCCAGTTGATGGATCGATTGTTATCAATGATATCTTTATGAGTCGGTGTCTCACCCAGCTTACCGAAGCTGGCATAAAGTGAAGGCAGACGGGCCCGTCCTTCTCCCTTTTTCGCCAGAGGGAAAGCGCGAAGCATGGTGGCAGCCTCCCAGTCACAACAATAGGATGGTGTGCACGGCTGCTGACGCCATGATTCAGCGATAGAGCCGGAAAATACGGGGTCAAAGCCGGTATCATTAACCAGAGTCATCGTGACCTGTACTGCGGCTGGATGATCTCCTGCTACTGCAACTGCTAGGCGACCGCGGCTCCCTTCAGGTAGTCTGTTGTGGTCAACTAAAACTGGCCTCCGCGTTAGAGTTTTTCCAGTATCGGTTTTCTGATTCGTTTGGTGGTAACCCACCATTATATTCGTGCGGTCTTAGTGCGCTGTAATATCCAACGATATAGTCCGTTATTGCGTGAGCTGCATCGCTGAAGCTTACATAGCCCGTCGCTGGCACCCATTCGTTCTTCAGACTCCTGAAGAAGCGCTCCATTGGGCTGTTATCCCAGCAGTTTCCACGCCGACTCATACTCTGCCTGATCCGGTATCGCCACAGTAACTGCCGGAACTGCCTGCTCGTATAATGACTGCCTTGATCGCCTGGAACATCACCCCGACGGGCTTACCACGGGTTTCCCATGCCATTTCCAGTGCTTTCATGGTAAGCCTGCTGTCCGGCGAGAACGACATGGCCCAGCCCACTGGTTTTCTTGCGAACAGGTCGAGAACAACGGCGAGGTACGCCCAGCGCTTACCCGTCCAGATATAGGTCACATCACCGCACCACACCTGATTTGGTTCCGTTACGGCGAACTGTCGCTCAAGATGATTCGGGATAGCAACGTGCTCATGACCGCCACGCTTATACCGGTGAGTCGGCTGCTGGCAACTGACCAGCCCCAGCTCTTTCATGAGTCTGCCAGCAAGCCAGTGCCCCATCTGGTAGCCTCTCCGGGTTGCCATTGTGGCGATGCTTCTTGCTCCGGCAGAGCCGTGGCTGATGCCATGCAGTTCAAGTACCTGGCTGCGTAATACAGCCCGTCTGCCGTCTGGTTTTTCAGGACGGTTTTTCCAGTATCTGTAGCTGCTGCGATGAACCCCGAACACATGGCAGAGTGTGACCACAGGATAATGCGCTCTGAGTTTCCTGATTATCGAGAACTGTTCAGGGAGTCTGACATCAAGAGCGCGGTTGTAGATTCAATTGGTCAACGCAACAGTTATGTGAAAACATGGGGTTGCGGAGGTTTTTTGAATGAGACGAACATTTACAGCAGAGGAAAAAGCCTCTGTTTTTGAACTATGGAAGAACGGAACAGGCTTCAGTGAAATAGCGAATATCCTGGGTTCAAAACCCGGAACGATCTTCACTATGTTAAGGGATACTGGCGGCATAAAACCCCATGAGCATAAGCGGGCTGTAGCTCACCTGACACTGTCTGAGCGCGAGGAGATACGAGCTGGTTTGTCAGCCAAAATGAGCATTCGTGCGATAGCTACTGCGCTGAATCGCAGTCCTTCGACGATCTCACGTGAAGTTCAGCGTAATCGGGGCAGACGCTATTACAAAGCTGTTGATGCTAATAACCGAGCCAACAGAATGGCGAAAAGGCCAAAACCGTGCTTACTGGATCAAAATTTACCATTGCGAAAGCTTGTTCTGGAAAAGCTGGAGATGAAATGGTCTCCAGAGCAAATATCAGGATGGTTAAGGCGAACAAAACCACGTCAAAAAACGCTGCGAATATCACCTGAGACAATTTATAAAACGCTGTACTTTCGTAGCCGTGAAGCGCTACACCACCTGAATATACAGCATCTGCGACGGTCGCATAGCCTTCGCCATGGCAGGCGTCATACCCGCAAAGGCGAAAGAGGTACGATTAACATAGTGAACGGAACACCAATTCACGAACGTTCCCGAAATATCGATAACAGACGCTCTCTGGGGCATTGGGAGGGCGATTTAGTCTCAGGTACAAAAAACTCTCATATAGCCACACTTGTAGACCGAAAATCACGTTATACGATCATCCTTAGACTCAGGGGCAAAGATTCTGTCTCAGTAAATCAGGCTCTTACCGACAAATTCCTGAGTTTACCGTCAGAACTCAGAAAATCACTGACATGGGACAGAGGAATGGAACTGGCCAGACATCTAGAATTTACTGTCAGCACCGGCGTTAAAGTTTACTTCTGCGATCCTCAGAGTCCTTGGCAGCGGGGAACAAATGAGAACACAAATGGGCTAATTCGGCAGTACTTTCCTAAAAAGACATGTCTTGCCCAATATACTCAACATGAACTAGATCTGGTTGCTGCTCAGCTAAACAACAGACCGAGAAAGACACTGAAGTTCAAAACACCGAAAGAGATAATTGAAAGGGGTGTTGCATTGACAGATTGAATCTACAGTAGCCTTTTTTAATATTTCATTCTCCATTTCAATGCGTTGTAGCTTTTTCCTCAGCTTACGTATTTCGATTTGTTCTGGTGTTATCGGAGAGGCTTTTGGTGTTTTGCCCTGACGCTCATCACGCAGTTGTTTGACCCATCTTGTCATTGTGGAAAGGCCAACATCCATAGCTTTGGCGGCATCTGCCACCGTGTATGTCTGGTCAACAACCAGTTGAGCGGATTCGCGTTTAAACTCTGCGCTAAAATTTCTTTTTTTCATTGGAGCACCTGTGTTGTTCTGAGGTGAGCATATCACCTCTGTTCAGGTGGCCAAATTCAGTAAACCACTTCACCACTTCAGAGCAGACGAGACAAAAAGAATGGTTGACGCATCAGCTCAACCCCATATAGTTGTAACACTTGAGCCAAACCCTTGGGCCGCTTTTTACTTTGATATTAACATTGCTAATACAGGGAACGCACCTGCCTATAATGTTGAGGTTGTGTTTGATCCTCCACTAGTAAATGCGGAGCATAGAGAAAAAAGTGAGATTCCGTTTAGTAAGGTAAGCGTGTTAAAAAATGGGCAATCACTTACCAGCAATCTCTGTAAGTATGAACAAATCAAAGATCAAATTTATAATATTAATATAAGCTGGGCAAGCAAACCTAAATCAAACGATAGAGAAACAAATGAATATGTGTATGACATGGCGACATTTGAAGGAATAAGTTATCTAGGAGCGAGAAGCCCATTGACGCAAATTGCAGAACAAATTAAAGGTATAAGAGAGGATTGGAAACCTATTGCACAAGGAGCTAAAAAAGTAAAAGCAGACGTATATACTTCAAGCGATAGAAACGAAGAACGCACGTATCTGCAAGAGCAACACGATTTGGCAATAAAAAGGAGAGATGAGAAAAGAGAAAAAAGATTAGAGTCTGGTGAATAATTTTAAAGGGAGTGGGTAACTAACCCACTCGTAACTATAAACCTGTAATTAATCACTTATTTTTGACAACAGATAATTACTGAACGCACTGCAAGTGACTAACATAAATTTAGCTTCAGCTAAGGTAGGATTTACATCTTCTTCTGTTAGCGCATGACGTATTCCACCCTGATCACTTGTATAACCATAGAGCTGACTAAAAGCGCCTTTCATTGCAGAGTGTATATATCCTTTTTCCTCTATAGCTTTAAGACAAGCCCCCAAGGTTCCTTTATCATTGCCCGTGATTTTCCTGCATAAAGATTCAATTGCAGAGATAGACTCTTTAATCGAGTTTCTGTAGTCTGGCTGCTCTCTATCCGTCATTAGTTGTAACGCCCTTTCGAAATGGCTACGCGATGAATCAGTGCCATTATCAACTGCGTTCTGAACACTTTCAATTTCGTTATCATTTGAAATAGGAGTAATACAACCATTTATTATGGTATAACCAACGCCATGCTTTTTAAAGATGGAATTGAGATGCTTCGATAGATTAATATATGAATTAGTTCTCTCAATGATGAACTCAATTAAATCATATACCAAATACCATGCTTCCCCATATATATAATCTCGGATAGCAGTCAGCAACGTCTTATCACTTTTGTATCCACTTTCATAACGAGGAATATTATCCGCAGGTTGATTTAGATAATATATCCACACAGACTGCGCACATTTTGTTGCTGTAGCAGTTTGACGATTGTTAGTCCAAAGGAAAAGATATAAGCAATTCCACAATGCCATGCGTGTATCAGAATTAAGATCATTCAGCTGGACATGCTCTCTAACGTCAACATGACCATACCTCACAGAAAATGGCTTTATCAT
Protein sequences of DBSCAN-SWA_7 >CP033401|2874406:2880965|2877997_2879149_+|AYQ02566.1|transposase|DBSCAN-SWA MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPHEHKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRGRRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVLEKLEMKWSPEQISGWLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNIQHLRRSHSLRHGRRHTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEGDLVSGTKNSHIATLVDRKSRYTIILRLRGKDSVSVNQALTDKFLSLPSELRKSLTWDRGMELARHLEFTVSTGVKVYFCDPQSPWQRGTNENTNGLIRQYFPKKTCLAQYTQHELDLVAAQLNNRPRKTLKFKTPKEIIERGVALTD >CP033401|2874406:2880965|2874406_2875363_+|AYQ02563.1|DBSCAN-SWA MKIALVIFITLALAGCALLSLHMGVIPVPWRALLTDWQAGREHYYVLMEYRLPRLLLALFVGAALAVAGVLIQGIVRNPLASPDILGVNHAASLASVGALLLMPSLPVMVLPLLAFAGGMAGLILLKMLAKTHQPMKLALTGVALSACWASLTDYLMLSRPQDVNNALLWLTGSLWGRDWSFVKIAIPLMILFLPLSLSFCRDLDLLALGDARATTLGVSVPHTRFWALLLAVAMTSTGVAACGPISFIGLVVPHMMRSITGGRHRRLLPVSALTGALLLVVADLLARIIHPPLELPVGVLTAIIGAPWFVWLLVRMR >CP033401|2874406:2880965|2876688_2876946_-|AYQ02565.1|DBSCAN-SWA MTLVNDTGFDPVFSGSIAESWRQQPCTPSYCCDWEAATMLRAFPLAKKGEGRARLPSLYASFGKLGETPTHKDIIDNNRSINWPV >CP033401|2874406:2880965|2875363_2876131_+|AYQ02564.1|DBSCAN-SWA MTLRTENLTVSYGTDKVLNDVSLSLPTGKITALIGPNGCGKSTLLNCFSRLLMPQSSTVFLGDNPINMLSSRQLARRLSLLPQHHLTPEGITVQELVSYGRNPWLSLWGRLSAEDNARVNVAMNQTRINHLAVRRLTELSGGQRQRAFLAMVLAQNTPVVLLDEPTTYLDINHQVDLMRLMGELRTQGKTVVAVLHDLNQASRYCDQLVVMANGHVMAQGTPEEVMTPGLLRTVFSVEAEIHPEPVSGRPMCLMR >CP033401|2874406:2880965|2879519_2880092_+|AYQ02568.1|DBSCAN-SWA MVDASAQPHIVVTLEPNPWAAFYFDINIANTGNAPAYNVEVVFDPPLVNAEHREKSEIPFSKVSVLKNGQSLTSNLCKYEQIKDQIYNINISWASKPKSNDRETNEYVYDMATFEGISYLGARSPLTQIAEQIKGIREDWKPIAQGAKKVKADVYTSSDRNEERTYLQEQHDLAIKRRDEKREKRLESGE >CP033401|2874406:2880965|2880140_2880965_-|AYQ02569.1|DBSCAN-SWA MIKPFSVRYGHVDVREHVQLNDLNSDTRMALWNCLYLFLWTNNRQTATATKCAQSVWIYYLNQPADNIPRYESGYKSDKTLLTAIRDYIYGEAWYLVYDLIEFIIERTNSYINLSKHLNSIFKKHGVGYTIINGCITPISNDNEIESVQNAVDNGTDSSRSHFERALQLMTDREQPDYRNSIKESISAIESLCRKITGNDKGTLGACLKAIEEKGYIHSAMKGAFSQLYGYTSDQGGIRHALTEEDVNPTLAEAKFMLVTCSAFSNYLLSKISD >CP033401|2874406:2880965|2879068_2879419_-|AYQ02567.1|transposase|DBSCAN-SWA MKKRNFSAEFKRESAQLVVDQTYTVADAAKAMDVGLSTMTRWVKQLRDERQGKTPKASPITPEQIEIRKLRKKLQRIEMENEILKKATVDSICQCNTPFNYLFRCFELQCLSRSVV |
7 | uncultured_Caudovirales_phage(16.67%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
3297668 : 3316187
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >CP033401|3297668:3316187|DBSCAN-SWA ATCACATCCACATAATTTGCTGCCCTGACGGCAACGGGTGCGGCCTCACGGCGTGGACTTCTCCCGGCTTCACGATGTATCTCTGTACCGACTCATAAGTGATGAACGTGGCGCTGCAATTCACGTTCTGGCACTGGTGATAACGCTCTTTTGTCGTGTCAGTGATATAGCGGCTTGTACGCGCATGTGCGGCATGCTGGCATAAAGGACAATGAAACATCGCGAGCACCTCTTCCGGTTTTGTTGATAGTGCCATTTTAGTTAAATTATCATTATAAAACAAAAAGATAAACAAAAGACATCACTCATAATCTTCTGTTTCGTACTCCACATCAGAAACCGTCGCGGCGAATCAGCGTAGTGACGCCTGACTCGTTAAGCAGGTCAGCATCGGTGCCGGACTCCTGCAAATCCCAGAATACAGATGCGCTGATGCCGGTAACACCGTTCACCCCGACATTGGACAGCGTTTTATGCCAGCCCTGCTCCTGGTCGATTTTAGCGCGCAGCCCCAGCGCACGGGCGGTGGCATACGCGGTGGCGGTGGTACTGGTGACCGTATCCCATGCGAGGAAATCCGGCCAGATGACCATCAGCTCACGCTGGCTGAAATTCTGGCGGTAGGCTTTCACCTCGGAAATGGTTTTACAGCCCCATGCGCTGATATACCCGAAAGCGCGCAGCTTCTGACAGACTGATGCCAGTGCAACAGCCACCTCTTTGGTATCCAGCCCCGGCACGCCGAGAATACGCGGTTTAACACCGGTTACCGACTCCGCCGCCAGCAGGGCTTTCAGTCCGGTGTACTGACCGTTTTCGTCGGTGGTGCCGATGATATTGGAAACGGTCTGCGCAAGTTTCGCTTCCTCGTCGTCGCCGGTGCCGTCTTCCACACGCACGACAACGGTGACCGGTTTTGACTGGTCGGCGATGGCCTGCAACGATGCCGCCAGCGTGCCTTTTTTACCGGCCTTTGCAATTGCGCTCTGCACATTGGTAATCAGCACTGGTTTATTGAGGGGGAAGATTTCCGCATCCGCATCGCTGGCCGTGCAGACCATGCCAACAATGGCGGTGGATACGGTGGAAATGACGCGGGTGCCGTCGTTAATCTCCAGCACCTGCACGCCGTGATGATAGTCACTCATCCGTTTAACTCCGTGGTTAATGGGTGCAACTATTTTCTGTTGGGCAGTGCATGAGACGCTATTTGACCTGGCTGGTCAGTGGATGAAACAACAGATAAAGAAAAGGCAGGCAATTCGCCCGCCTGTCCTGATTTGTACTCACTCATTTTCCGACTGACAATTTACATAGCCAAAACGCTATCAAATCTGACAGTCTGCTTTGAGCGAGGAGCAGAGGTTAGTTTTAGTTAACCAAAATGATAAAAAGCAGTAGAAAAATCCGCTCATTACGTTATGGTTATAAGCCGCACATAATCATCCGAGCCAAATCCTCTTGATTTCAAAGAATTAGCACTTGCTCTTCACTAGAAACTATGGTTCTGACTTCACGCTCAAATATTGGATCCATTATCATTTTTCTATTTGTTGGAATAGTAAATTTACCGAGTACCATGACAATGCTACTCAACGGAAATAATTTTGGAGTAACATTTCCTAATTGAATCGTAGGATATATATTAAAGCCCCCTTTTATCCTCATTGGCGAACCTTCAATGAAGTGTCGATAATTAAATTTCGCCATAGGGATTCTCGCCACATGCACTTTCATAATTACAGCAACCTGAGCTATTTTTACTTTCTGGAGGTAAGTTGACAATCGTTCTAACTGCGTTGTTTCATAGAAAAAATCAGACCACGTTACAGTACTTGTTCCAGTAACAATTTTTATGCTGTTTCTCTTGTTGGGGTGAGCTTTTCCATATTCATAGATCTCCCGTAAGCTATCCATGTTCTTCACATAAGGGGCTTTCCGACCTCTCTTGACATAAACGCGGTCTGTAGTATCAGAAGGTGGATTTGCCTGCATAGCTGCAGCTTTTCTGCTTAATTTAGAGATATCATCTTCATCGAGTATATGTATACGGAATATATTATTACCTGTGGCAAGCGCACGGCTGATATCATGACTGGAATCACTTGCATAAATTGTATTCAATCCTGAAGTTCCATACTGGCATAATTTGTCATGCGTGAAATTTTTTTCCAGTCTAAAGAAGGCGGGGATATGCAAGGTTTTATTAAAACTCCGACGAATATGGCTTTTGACATACACCAAATGAGCAGAGCAATTTGGGTCAGGACATTTGAGTGGATAGGGATTAACACTATAATTAAAAGTATTTACATTTACTAAAATCTGATTGTTATCAATAGCCTGAGTTATACGAATACCACTCATTACTCCTCCATAAAGATAGGCGCAAGCATTTAGTAATAAACCATTAACATGCCGAGATCATACCCGATCATTGAAATGTAAACACTCCCCTAAGACGGCTGCTGGCATAATATTTTATGACATCAGCAATGCCCACCTCTGGCACAGAGTGGACTGTCAGATTAGGCTTTACTCTGTGCCATAGATATGTAAGCCCACACTAGAGCTCATACAACTTATTGCGGCATTTCCGGCCATTCAGGATTTGCAGGATCCACACGACTGACCAGAACACTATAGCGTTCCCATGCCTCCAGTCGACTACGCTCCTCATCTGTTGCCATATTCAGCCTGACCGCGCGCTCCAGCGGCAAAATCACGGATTCAGCTTCGGAAAGCAAAGCTGCCTTATGTAATTCCGCCAGTTTCTGCTGTTCGTCTGCCGTATAAATCCGCTTAATCACGGCACCATCCTTAAACATCCATTTACCTGAGTCGTCAGCACGTCGGTTGGAGGTAATATCAGGAACCTCAACAACGCTAAAACCTTCAGGATTAAGCGTTGAAGCATCTCTGGTGATAGCGACAATAATATTATTTTCATCGTAAACAATCTTTATTGTGTCTGGCTGAAAGTTCTTCACTTCCTCATACCAGTTTTTTCCGTCCTCAGAGTAAAGCCAGATAACTCCGTGTTTCTTTGTTAACTCATACTGTTCCAGTGTTTTAGCGTTACCCGCTTTTATGTTCTTTAAGTGCATCATATTAAACGCTCGCTACATTATACCAGGTGCCATTTATATACTTTTGAACGGGTCTGTAATAAACGCCCGCTATATTATCGGCAGAGTTGGACCCTGTATCCTGAACATTAATACCAGACAATACATGACCTGACGGGCACTGGAAATTCCATGTTTGCCAGTTGTTCACTCCATAATATTGCTGTGACCCAAGTCGAACATCTTTCACATATCTGGAATCAAAATTGCCATAGTTGCCAGGAATAACTTGCGAGCCACAAAGCCAGTTACCGTTATTATCCATGTACGCCTGACCATCGGTGCCATTGGCTGTCCTTGAGTTATTAATCATGTAGATGCCAAATTGCTTATTTCCCAGACCGCCAATCATAAATTTGCGGTCGGCATGGTCCTGACGGAGCAAAGCCTGAGCACCATCAGTGGATACCGCATTACGTCCAAAAATAACATTCTGGTCACGCATATGAATCCACATGCCTGTGCTGCTGTTAATTGCAAGGCGGTTTGCGTACACCCATGCGTTAGTTGTTATATCTCCTGTAACATCCAGAGCGTGCCCCATAGTTATGCGACCAGTTCTGAGATTCAACGAAAAGGGGCGTAGTGGCCCGATATCACCATTTTCCCCCTCATTCTCTCGTGTGGGGATAATATGCAGGCATTCTTCAGAACGGCGAAAAATAGCACCAAAGGATGAATTAAAGATTCTCAGTGCATTGACTGTCGATATTTTTACTTCACTGCTGAAAAGGGCTTTAACAAGGACAGACAAAGCATCCCATTTAAGATTCATCAGGTCTTTTGTTGTGGTGCCCTGACGGCTTCTCCATTTGAAATATTCATTGCCGTTGTCGCCTGTTTCAAACCACATGTATGAATCAGTGTCACCATCGGCATCATTTTTAAATCCAATCTTCGCCCAGTCAGTATTTCGAATCCAGGCAAGGATTGAGTCGTTTTCAAAAGTAAGTCCACCGGACAAGGTATCGCCATTTTTTTGCACGGCGTTCCCGGCTCGGTTTACCGTTTCCTGTAAACCGAGATATTCGATAACGGCGGCAACGGTCGATTTAGCCAGAATACCGTGAGCGCGCTGGTTTCAGGTTCATACTCAATCACCGCCCCGTCAGGGAAACGGATATGCAGGGCATCCGCCGACGCAGACGGCGCGGGGTTATCGCCGGAATAAATCCCCGGCAGAACGAACGCCGTGTCGAGTTCACCACCCACGGCCAGAATCAGCACCTGTTCCCCCACGGAAGGTGCCCACCATGTGCGCGAACGCCCGGCGCGATGGGTCAGCCACTGAAGCCAGTCGGTGCACATGCCGCCGGTCTGCACACGGCAGCGACCGGCATTAAGGTCGGTTTCGACGATAATGCCGGTGCGGATCATGTTGCGCAGTGCGCGCGCGAGTTCCTGAATATTTGCGAGAGTGTTCATGCGTGTGAGATTGCACAATATATAAAAGTTATGCTATCTGGATTCATTTGTAGAACGACCATACAACATTCGAGGAGAGCGTAATGTTCAGTGATAATGTGACTAATGCGTGGTGGTTTATCTCTTTGTATCTATTTTTATTAATAGCATTAACATTTGTTACCTTTGGTAAAAGTAATCTTATGAGGTTTATTGCACATCATTTCAATCTTGAGTATTCAGACAGAAAGTTAAAAATGCTCGACAAAAAATGGCGCGACATTCAACTATTTAAAATAATTAACGGAATCAATGTATCAGGCATCGAAGATGTGAGAATGATACAGCAGGGGCTGATTGATGGAAAACTAAAAACATCGTATTTTTTCCTTACTCGCTTCTGGGGTGACATAACAAAACCACCACACATAATTAAAACAACAATTGTAATTCTGGCCAGTATTATTTATATTCTCTTCGCATGTTATATACACAATGAACAATCCGCTATAGTAAGGGATGCCATAGGTATACCATATAAAAATATGATGTACTATGTTTATAGTGACAAAGTTCTTTTATCCTTCAAAAATAAAACGGTTGAATTTAATAAAACTTATAGCCTTGCCGATTGCAAGAGTCTGCAAAACGTATTTATAAAAGACACACTTCCCGAGATCGCCTGCAATAAGCTCTTACAGCTAAACAAGGAGGACTCCGAATGGTTAAGTCAGGAGATTAAAGATAATAACAGCTACAGAAAAACATTATTAATAATGTCCCTAACCTATTTCATTTCAGGTCTGCTTATATTCCTGTCATATACAAAATTCCTTTACGCCAATAAGAAGGTTTTAGAATACAAAGCATCAAATAAAAACCACTCATAAACCTCTAAATATTGAGCGACCAGCACGGCCGCTCAATGCTTAATTGCGCATCAGCCTCTGCCTGGATAAAACTAACGCTCAAGGTGAGCCAGGATAATCTCTTCAATCATCTGCACATCCTCACCGGTAAAGCCGAGCAGAGGACGCGCCGGATAATCAATTTTCTTACCGTCTTTCCGGTTTTCTTCCGACAGACCGAACTGATGCACACTGGCGATTTTCGGTGACTTCCCGCCGTAAAACTCCATTGATGCCTGTTCCGGGCTGGCGCGGATATGCAAAAAACGACTGGTGATAAGTTTCGCAAACATTTTTCGCTTAACACGACCGGTCTTTTTTCTGGCGCTCTGCTGCTGGCGTGGCGCGTAGGGTGTGCCGTCCGGAGTTTTCTGTGCCATCACCCGACGCTGCTGACTCTGCCGCAGGCGTTTCGCCAGTTCGGCACTCAGTCGCCGACGCCCTGACGGTGACAGCGATTCAATCAGTCCGGTCAGCCGGTCTTCAAAACGCTTAAACTCATTCATCCCACTTGCTCACCAGTTCGCCATTGATATACAGCTCCATCGGGCGGGTGACCGGCTCCGGCGGCGGAGGTTCCGGGATATTCTTCACATGCAGCGCGCCGTCCACCTCACTGACCAGCGTGCGCTCGGTCAGCATCAGGCTGATACTGATATCAAAGCTGCTGTCATTGTTGATGTCCGCATAAAACGTGAAGCCCTTTTTCTGGCCTGCGTCGGTGGTCATGATGTCGGGCTGATTTTCCCGCAGCCACGCTAGCACCGGCACGATGAGCAGGTCAAAATCACCGGTAAAGTCGGTCACAATGACATTGAGCGTGTAACGCTTTTCAAATGACAGCGACCTCGCCAGTGTGGAGGCAATACTCCCGTTATCCACGAATATCCGCAGCATCTCGGGACTGGTTTTCAGCACCGTGACAGCATCAGTCAGCGCCCTGCGCAGGCTGTCGGGTTTGAGCATCGTTTTCGTCCTGACAGTGTTTAATCATTTTTACCTGGCTGGCACAGCGTGCCAGCGCGTTCTCAAGCTGCCGGATATCAGCACTTAAATCGCCGTTCGTCTCCGGGTCACTGCCCGGCATCGGGCAAAGGCTCACTTTCGGGCAGGCGTTGTGGACAATCACTGGCGTCGGCGCAGGCCGGGCGCTGGTGCAACCGGCGCACAGCATCAGGCAGGTCAGCGCCGTACCAGCGGCGAAAATCTTCGTTTTCATTAAGTAACCTCGTGATGGTTTTCTCGCGCTGTGCTTCACGCTTCGCGGCGTTCTCCAGCTCCTGACGCAGTGCCACCTGCGCCAGCTCGTTTTTGTCTGCCCTGGTAAGGGCAACATGAAGCTGATTTTTCAGCATGGTGATGGTCGTCTGCTGTTCACTGGCGACGTTGTTCGCCCTGTCCAGCGAGGCGCGCAGGCTGGCATTTTTGTGTTTCACCAGAAACAGACCGGCCACCGCCAGTGATAACAACACGACCAGCACAATCATCAGCTTTGACATGGTTCCCGCCCCTCAAAACGCTGACAGCAGGCCGTACGTATCAGCCGGAAGAACACCGATGCCACGAGATAAATCAGCGCGGTAAAAATCCCCCCGGCAGCGACCAGCGAGATAAACGTCGCCACCATCACTACCAGAGCCACTGACCGCCTGCGCCACGGCACCGGCTGCAAAAACAGCGCCGTGACAATCTTCACGGCCAGCGATTCCGGCGGCAGCTCCCGCCCGTAACGTTCCAGTACATACTCAGTGGCATACACGCCGACACCACCGGCAACCACACAGATAACCGTCGCCAGAATCGCCCAGGCGGCGACAAAACTGACGGCCACGCTCTGCGGGTAAATCAGGGACAGTGCCAGCATCAGCGCCAGCGACACGTTCAGCATCAGTGAAAGGGATAATTTCTTCATGGTGTTTACTCCGTTTACCGGTCTGGCAAAAAGCCTGCGTGCTGCCGTGCATCACAGCTCACCGATTTACGTCAAACGTAACATTCTGGCCTCAACGTTTATCCCACACCCGTGGCTTTCTCAGCAGGATTTCAGCCGCTTTGTGCTGGATTTTCTGGTATTCGGTAATGCGTTTCTGGAAAAGCGTTACAGCACCACCGGTAAGGTCATCAGACTGGAAACCTCACCGGCAAAATATACCCGCCGTGGCGTGGAGGAGGATGTTTACTGGTGGGTGCCGTCCTTCAACGAGCCGACACCTTTCGCGCCCGGCTCCGTGTTTCACCTGCTGGAGCCGGATATTAATCAGGAGCTGTACGGTCTGCCGGAATATCTCAGCGCCCTTAACTCTGCCTGGCTGAATGAGTCGGCCACGCTGTTCCGCCGCAAGTATTACGAAAACGGCGCTCATGCCGGATATATCATGTACGTCACTGATGCCGTGCAGGATCGCAACGATATCGAAATGCTTCGCGAAAACATGGTGAAGTCGAAAGGCCGCAACAACTTTAAAAACCTGTTTCTCTATGCCCCGCAGGGGAAAGCTGACGGCATTAAAATTATCCCGCTCAGTGAAGTGGCAACGAAGGACGATTTTTTTAATATCAAAAAAGCCAGCGCCGCTGACCTGCTGGACGCGCACCGCATCCCCTTTCAGTTGATGGGCGGCAAGCCGGAGAACGTCGGGTCGCTGGGTGATATTGAGAAAGTAGCAAAGGTCTTTGTCCGCAATGAGCTTATCCCGTTACAGGACAGGATCCGCGAGATAAACGGCTGGCTCGGTCAGGAGGTCATCCGATTTAAAAACTACTCACTGGACACTGACAACGGCTGAACATCGCCGCCTGCGGGCGGCTTTTTTACACCCCGTCATCACGCCCTCACACGTTCGCCACTGTACAAAACACCCCGCAGACACACCAACGCCCCGGCAGGCCGACTAAACGCCATCACGACGCGCTCAGACGCTGAAAAAATAAAATCAGCACCACCGCCAGCGCGCAGTGCTTTCCCCGCCTCGCCCGCCCGCTTCATGGGTCGGTTTTGATGCAATTCCAAAAGCCGTCCAAACTCTCTTAGGCTAAATGTCCAACGAGAAAATAGTTCTTTGAATGTGAATGCATTTTAATGCAGAGTTATGCCCAGCATTTTTGTACACTTCGATGTATCAAATGCGCTGCAAACGATCAAATATGGATGTTTTATCAAGCATCCCCCAAAAGATATTTACATCATCCCATGAGGTTAAGATGGATAACAAAATCGTAGAAATTGAGACAAATAAGCTTGATTTTGACCCTAAAAACCCACGTTTCTTTCGTCTCAATGATGCCAGTAACGCTGCAACAGTCATTGAGGAAATGTTAGATGACGAAAGTGTCCACGATCTAATGCTATCAATCGGTCAGCAAGGTTACTTTCCTGGAGAACCTTTATTGGCAGTAAAAAGCAATGGAAACTACATCGTGGTTGAGGGAAACAGACGCTTAGCTGCTGTAAAGTTGCTCAATGGAGATCTGCTTCCTCCAAAAAGAAAACTTAAAGGTGTGCAAGAAATCATTGATGATACTACCAATAAACCTAAGAAGCTTCCCTGCATCATTTATGAAAACCGAGAGGATGTACTGAGATATATCGGTTATCGTCATATAACTGGGGTCAAAGAATGGGACTCATTATCTAAAGCCAAATACCTTAAAGAGTTATGTGATACTTTTTATTCACATGAGCCTAAAGAGATAGTATTAAAAAATCTGGCTCGTGAGATTGGGAGTAAACCACATTATGTTGCAACACTTCTCACTGCACTGAACTTATATGAAGTCGCGCATGACCATGAGTTTTTTAATTTACCCATGAAGGCTTCTGACGTGGAATTTTCATATATAACCACAGCTTTGGGATATTCAAAAATCACAAACTGGTTAGGTCTACAGGATAAAAAGGATTTTTTAGACCCAAATTTAAATGAAGAAAACCTTAAGCGTTTATTCTCTTGGTTTTTTGTGCCTGACCAACAAGGTAGAACCATCATCGGTGAGTCTCGAAGAATAAAAGATATTGCAGCAGTGGTTGAGAAACCCGAAGCAATTGAAATTCTCATGAAAAGTTCAAACTTGGATGAAGCATATCTATATACCAGCGGAGAAAGAGAAGCATTAGATAAAGCACTAAACGCAGCTAGTGTTAAATTAAGAGTAGTTTGGGATATGCTACTTAAAGCTAAAGAATTAACATTAGAGCATGAAGAGGCTGCATCTGAAATTTTTGAGATGTCAAAAAATATTAGAAATCAGATCAGAAGCAAAAGGGAGGATGATTGAGATTATGATTACAAATCTTGATTCAATGCCTTCTAATGAGCCTTATTTATGGGCTGATTATATTGAGATATTGGCCTTAACTAATATCGACAGGTCATTCAGTCGAGGAGACCTATATAGCACACTGCAAGCTCAACCCGAAGCAGTACTAGCTGAAACAGATGAAGCAGAAGAAGAGGGCGTTTATGATGTTGATGATGAAAATGATACGCCTGTACGCAAGAGAACAAAACGAAGTGTTAGTCGAGCATATACTGACAGAAAGTGGAGCTATGCGATAGGCTTCATACGACAACGCATTGATTTATTTGGGGATAGTTACCCTTTTACTTTATCAGAAGACAACGATACTGTAGAGTTACGTGATATATCAGAAAAGCCACTGGAACATTTAGAAAGACTATATTTAGCTTTACTAATCTGTGCTAACATAAAATATGTCAACATAATGAGCAGAAGAGAGATAACGCGCAGTTTTGAACTAATTAGTTTACCTATTTTTGAAAGCCTAATGCCTAGCGGTAGCATAATAAAAGCATGCTGGGCTTCTGGTGGTCAAGCGGCCCCTTACACTGGAACTCTATATAATAAATTTAAGAGTATTGCTTCCGATATCCGTTGCACAGCGAACTTCAAAGAACGAGATTTCAGTCGAGGAAATAGTGGTGACGGAGGCCTTGACATAATTGCCTGGCATCCAATGGGAGATCAACGAGATGCCATCCCTATTTCTTTTGTTCAGTGTGGCTGTTCTCAAGAAGAGTGGGAAGCGAAGCAGCTTGAGGCCTCACCTGCGATGCTCTACAGTAAATTCCCCGTAGCTCACCGATGGGCAACTTATTATTTCTTACCTCAAGATCTACGATGGATAGATGGTGAGTGGGCGCATAAAAATAAGTTAGGCGATGCTATTTTTGTTGATCGCCTAAGATTAATCAATTTAACCAGAGCATCTGATAATATTGATCACAGTCAAAATATTAGCTATCTAGATATCATCCTTGATCCTTCCAGCGCGATCGCTGCTTAATCCCATAAATCTGGAAGGTTTCTAGCAACCGCCTCAAACAATGGAGGCGGTACTGCATTACCTACCACAGTATATTTCATATTAATAGAAGCCCGTTCAGTTTCTGGGAAAATTAAATCCCCAAAACCTTGTAAAATAGCAGCCTCTCGATAGCTAAACCTACGAGCTGGTGCATCCGAAGTAAATTGCCACTTATCAGGTCCCAATTTTTCTAATGTTGGACTTATTGGATGTAGAGGCATATGTCTAGGATTTGCAACAATTGTTTTAGATATCTGATCCCAATCTTGCCTACGGTTTCGCGATAGATAATACCAATGAAAATCGGCGTCATAAAACTCGCCAACAGGCCAAACAGGCATATGCCCAATAGCATCACGAATTGTGGAATATGGTGTCAAACCATCACCATGTGTTGGTTTTGGAAATTTGTATGTAATACCGTAGTCCTTTCGTATTCCTACGATAAAGATTCGCTTCCTATCTTGGGATACCCCATAATGGGACGCATTCAGAATTTGCGAGCTTACTGTATAACCTGCTTCTTCGAAAACTTTGAATTGATCCTTTAATAAATGCTCAAAGTTACGCCTTACCATACCAGAGACATTCTCTACAATGAATGCTTTTGGCTTAATTTTACTCAAAGCACGGGCAAACTCTAAATATAGTGTATTAATCTTTCTATCTGCCTTCCTTGCCCCACCTTGACTAAATCCTTGGCAAGGATAGCATCCGATGAGCAACTCAGCAGAAGGGAACGACTGGAGCCCTGAGATATCGCCCAAAATGTAGTCAGTTTCAGGATGGTTTTCTAAGTAAACGTCCCTTGCGTAAGGCAAAATATCATTTGCCATAAGCACATTGAAACCTGCGTTCAAAACTCCCGCATCAGAACCACCACATCCAGAAAAAAGTGAAACTACAGTTGGCATTGACCCCTCCTAAAAACCGACCGCGTATTATAGCGAAACACCCCGTTGGGAAAAGCTAGATTTTGCCAAGTCTTGATATTCTCACGTTTTAGTAGTTGTGGCCATCTTTAACGAGAAAAAGATAAAATTGACTTCTCATTAATTTTCAATAGGTTTAATTGTAAGCTCAAACTAACGCCTCGCGACACTCGTTATTCAACCCCGCCAGCCCTGAAAACAAGTTTCACGACTGGCGGCGTTCTCTATCGTCTGCGTTGTGGTGGCGCAACTCTGGACTGACCGATATGGTTAAACCGCCCGTAATTATCCCGGACTATTTCGGCACACCCGACCAGCTCATCGGGCGTCAGATTTTCGTTGACCATAATCCGCTGTAAACGCTGAACAATAGCCATCAGCTTGATATTTTTAGTTTTATGGTGCGGTATCTCGCCTGGTATTCTGTGCATTATCCAAGCCACCCGTTTTGCTGTGCACGCTCCATCTGTTCATCTGAATAGTTCCATGCTCCATCCGTGGCAACCATTGCCCCGCCAGACATCCCCGTCTCTGGTTCATACATAACAGCAAGGCCGAGCTGATGCATAATTTCATGATTAATTCTGAATACCAGACCACGCTCACTAAGTTCTTTCCAGTTCACAATCTCACATGCGCCTGTATTAAGCCGCTCAATACTTAGCAAGACATAATCTTCCAGCCAGTCTGACAGGTCAGTAACATCTGTTATCCGGGCTTCAACCTTTCGCCCCGTATACACACCCTGCACCCATTCATGCAAAATCAACGTGTCCCCGCGCTCATAATTACGGTCATTTTTCCGAAACTCTGCGCGTTTCTTTCCTTCCAGCACAAGGTCAAAATATTTTGCGTGCAGCTTTACCTCGTGAATTTTTGCCATTATGTCCACTCCATTACTGTTGAGAATCCCGGCCACTCATCAGCGACCGGATACGTGAATTTTTTCCCGTCATAATTTACGGTCGCTCCACGCGCCAGCGCCTCAAGCTCCCATCGCTGAGGCCTGATACCGTTCTGAGCAAGGTCAACGCGGATACGGGTGATTTGCATTCGTTCCGACCGGGTCAGTCTGGCCGACGGTGCAATTTCATGTGTTTTTAACGGGCTTCCGTTTCTTTGTTGACGATTTGGTGTTTTCAGGCCGTGTTTTAATGCTCCCCTGAGCGCCCTCACGACCTCCGGGTCATTCCATTCGATAACACCGTCATCAACCAGATTTAGCACTGCTGCGGCGTGCTCAGAAGGTGTGGGAGCCGGTAACGAAGCATCACTACCGGTGAGCTTTGTGATGTGGCGAATCATGGCGTCAAGATGAGAAGAAAAGCGCGTTGCTGCGTCGGCCTGTGCTTCGGTTCTGACCTGTTGCAGCAGTAATGCGTATTTACCGCACTGATTTTCAGAAACTGTATGCATGACTTTCTCCAGGCAAAAAGAAGCCCCGCACAATTAAGTGCGTTAAAAACTCTGGTTAATTACTTAATGCAGATATTGCTCTGGTTTTACCGACGTCAGGATTGTCGGTGCATACTCAAACAGGCTGAATAATTCACGTAATGCACGGAATAAGGCATCACGCCAGTAACATGATTCTTCATTAATTCGCCAGTATGGCTGGTTGAATTCTTTTTCAGTCAATCCGGCATGCATAAATAAAGTACGGCGCTGACTGACAGTTAAAAAGCTAATATATGCATACTCACTTGCACCGACCTGACGGCGTTTTGAGAATGCCCCACGCAATTCATCAATTGCACATACCAGTCGTTCACGTTCGACGTCGTTCATTTCTTCAAAACGCATCGTTGCGTGACGCTGTTTTAACTGTGCATGAAAGCAAACCGTTAGCCGTTCGCGCTCCATCATCTGATTATAATAATCACATGTATCCTGCCAGCGAGGGACGGCAAGATGCTTGCCAATTATCCGGCGCATAGCTGCTGGCTGTTTTTCAACGAGATTGAGCGTCATCACTGTCATTTCCAGCCCCTCCGGCTTTTCAGAAAGGTCAGAGCCTTTTTTAACGGACTCTGTTTTTTGGTGCGGATAATGATTCCCTTACGCCCCTTACCGTGGGTGATGGTGAAGTCAATCGCCCTGGGGCTTTCGTTACGCAGTAACTGAGCAATACAACGCGGTTCACTCATAATCACAACCCCATCCACAAAAGCCATGCATCACGCTGTTCAACTGGTCGGTTATAAAACGCCTCTCGTACAGCGCGATTAAACTCTGGAATGAAAACCCACTTCTCACCGACACGAGCGTTCGGCTTACTTGGATCACGAAGCTCAATAACTGGCAACTTATTCTCTTTTACCATCTTGACTACAGCCGTTTCTGGCTTACCAAGTAACTCTGCAAACTTAACCGTATGTACCGCATCAATCGGGTACTGAATCACATAGTCATTGACTTCCATTGATTAGCCCTTTTTGCTTTCGTGTTACCCTTATTAGATCCAGTCCCTTCTAGGTCGCACCTGTCCTTTCTAGGGACTGGCTAACACACTCAAAAGGTCACCAATACACAACCTTTTGACGGGAATATAAGTCACCAATAGGTTACTGTCAAATGCAGACATTCGAAAAACTGAAAGCGATTAGGAAAGCAGAAGGCTTAACACAGGCGAAATTCAGCGAAATTAGCGGGATAGCTCTAGGAACAGTCAAAAATTACGAAAGTGGGCATAAAGACCCTGGTCTCAGCATCGTTATGCGAGTCACAAATACGCCTTTATTTAAAAAATATACGCTCTGGTTAATGACTGGTGATACGTCACCACAAGCTGGTCAGATCGCGCCGGCTCTCGCACACATTGGGCAAAAACCAACAGAATCAGACCACTCCGAAAAACAGACTGGTTAACACTCTATAAACATTACATTTTCACCATTTGTTACCAAGATGGTGAATACAGCGTCAGAGGGCTTTCTTATGTCAATTAAGAAGCTCGATGATGGACGCTATGAAGTGGACATTAGACCTCGCGGTCGCGACGGAAAACGCATCCGCAGGAAATTTGAAAGAAAAGCTGAGGCTGTAGCATTTGAGCGATACACAATCGCCTACGCCAGCCAGAAAGAATGGGCAGGTCAGCGAGCAGATCGCAGAACTTTGAGTGAGTTGCTGAACATCTGGTGGAAATATCACGGGCAAAACCACGAGCATGGAACAAAAGAGTTTAATCATCTGCTCAAAACCATCAGCGGCATAGGTGATATACCAGTGAGCCGGATGAGCAAAAGAGCTTTGATGGATTATCGTTCCATGCGACTACGTGATGGTATCAGTGCCGCAACGATAAACCGTGACATGTACCGATTATCCGGCATGTTCACAAAATTAATTCAATTGGATGAATTTTCCGGGCAACACCCAATTCACGGACTGCCGCCACTGGCGGAGGCCAACCCTGAAATGACGTTCCTGGAAAAAGCAGAAATCGAAAAACTGTTAAATGTTTTGGATGGTGATGACTTACTTGTCGCACTTTTATGTCTGAGCACTGGAGGAAGATGGACGGAAGTTGCCACGCTAAAACCAGCACAGATTACAAATTGCAGGGTTACCTTCCTGAAAACCAAAAACGGTAAAAAGCGAACCGTGCCGATTTCTGAGGAACTGGAGAAAAAAGTTAAAGAGGAGGCCAGCGCTAAATTATTCAAAGTTGATTATGAGAAGTTTTGCGGGATTTTACGCAGAGTGAAGCCAGATATACCACCCAATCAGGCAACCCACATCCTGCGGCATACATTCGCAAGCCATTTCATGATGAATGGGGGCAATATAATCGCACTGCAACAGATTCTGGGACATGCGAGCATTCAGCAGACGATGGCCTATGCGCACCTTGCGCCTGACTACCTGCAAAATGCCGTCGCGCTGAATCCTCTAAAAGGCGGAGTGACGTTATAAATTTCCCTTCTGAGTGTCCACATAGTGTCCACACTCTCAGAACTTTGTAGCCCTTCCAGTCCCTTATAGGTTTTCTTAAGTTACTGTTTTCTTACGGAAACCGATGTAAGTGATTGATAAAAAAAACCCCCACATCATGTGGGGGAAGACAGGGATGGTGTCTATGGCAAGGAAAACAGGGTTTACTACTGGGAACGTGAGTTGCTACTACTCAATAGCTTCAACGATGAACTTTTTTGCCATTGCGTCACGTCGCGCAACTGCTCCATTCGTTGTTGATGTTTCTCGTTTAAAACCGCTTGCTGCTCCGGCGTTAACAGGCGATACATTTGGTTGCGGACTTTTGCCATCTCAACCTGACGAGCAATTTGCTCATTCGCCATTTTTTCTGCCTGTGCGCGCACAGCGTTTTCATCAAAATTTTCTGCGGTGACAAGGCGATGCATTGTCTCCAGTTCGCTAACATTAACAGGAGGCTGTTCGTGCCGGGCCTGTTGCATAAGATCTCGCATCTGCTGACGCTGATGTTCGGTTAAACTTATGCCGTCGAACATATGGCTCTGCGTACTGCGCTGCGTAAGTTCTTCACCCGGATGCCAGTTATCGCCTGAACCGACTTCAGCAGCGTGGCTTAATGAACTGACTGCCAGCGTTGAGGCCATGACGGCAGCGGTAACTATGCGCATCATTTGCTCCCAAAATCTTTCTGTCGCGATTCAACGATAGAGAGTTTACGATTCAGGCTGCAAACATGCGTCAGGGGGTGTAAAACAACGTAAAGTCATGGATTAGCGACGTCTGATGACGTAATTTCTGCCTCGGAGGTATTTAAACAATGAATAAAATCCTGTTAGTTGATGATGACCGAGAGCTGACTTCCCTATTAAAGGAGCTGCTCGAGATGGAAGGCTTCAACGTGATTGTTGCCCACGATGGGGAACAGGCGCTTGATCTTCTGGACGACAGCATTGATTTACTTTTGCTTGATGTAATGATGCCGAAGAAAAATGGTATCGACACATTAAAAGCACTTCGCCAGACACACCAGACGCCTGTCATTATGTTGACGGCGCGCGGCAGCGAACTTGATCGCGTTCTCGGCCTTGAGCTGGGCGCAGATGACTATCTCCCGAAACCGTTTAATGATCGTGAGCTGGTGGCACGTATTCGCGCGATCCTGCGCCGTTCGCACTGGAGCGAGCAACAGCAAAACAACGACAACGGTTCACCGACACTGGAAGTTGATGCCTTAGTGCTGAATCCAGGCCGTCAGGAAGCCAGCTTCGACGGGCAAACGCTGGAGTTAACCGGCACTGAGTTTACCCTGCTCTATTTGCTGGCACAGCATCTGGGTCAGGTGGTTTCCCGTGAACATTTAAGCCAGGAAGTGCTGGGCAAACGCCTGACGCCTTTTGACCGCGCTATCGATATGCACATTTCCAACCTGCGTCGTAAACTGCCGGATCGTAAAGATGGTCACCCGTGGTTTAAAACCTTGCGTGGTCGCGGCTATCTGATGGTTTCTGCTTCATGATAGGCAGCTTAACCGCGCGCATCTTCGCCATCTTCTGGCTGACGCTGGCGCTGGTGTTGATGTTGGTTTTGATGTTACCCAAGCTCGATTCACGCCAGATGACCGAGCTTCTGGATAGCGAACAGCGTCAGGGGCTGATGATTGAGCAGCATGTTGAAGCGGAACTGGCGAACGATCCGCCCAACGATTTAATGTGGTGGCGGCGTCTGTTTCGGGCGATTGATAAGTGGGCACCGCCAGGACAGCGTTTGTTATTGGTGACCACCGAAGGCCGCGTGATCGGCGCTGAACGCAGCGAAATGCAGATCATTCGTAACTTTATTGGTCAGGCCGATAACGCCGATCATCCGCAGAAGAAAAAGTATGGCCGCGTGGAACTGGTCGGTCCGTTCTCCGTGCGTGATGGCGAAGATAATTACCAACTTTATCTGATTCGTCCGGCCAGCAGTTCTCAATCCGATTTCATTAACTTACTGTTTGACCGCCCGTTATTACTGCTGATTGTCACCATGTTGGTCAGTACGCCGCTGCTGTTGTGGTTGGCCTGGAGTCTGGCAAAACCGGCGCGTAAGCTGAAAAACGCTGCCGATGAAGTTGCCCAGGGAAACTTACGCCAGCACCCGGAACTGGAAGCGGGGCCACAGGAATTCCTTGCCGCAGGTGCCAGTTTTAACCAGATGGTCACCGCGCTGGAGCGTATGATGACCTCCCAGCAGCGTCTGCTTTCTGATATCTCTCACGAGCTGCGCACCCCACTGACGCGTCTGCAACTGGGTACGGCGTTACTGCGCCGTCGTAGCGGTGAAAGCAAGGAACTGGAGCGTATTGAAACCGAAGCGCAACGTCTGGACAGCATGATCAACGATCTGTTGGTGATGTCACGTAATCAGCAAAAAAACGCGCTGGTTAGCGAAACCATCAAAGCCAACCAGTTGTGGAGTGAAGTGCTGGATAACGCGGCGTTCGAAGCCGAGCAAATGGGCAAGTCGTTGACGGTTAACTTCCCGCCTGGGCCGTGGCCGCTGTACGGCAACCCAAACGCCCTGGAAAGTGCGCTGGAAAACATTGTTCGTAATGCTCTGCGTTATTCCCATACGAAGATTGAAGTGGGCTTTGCGGTAGATAAAGACGGTATCACCATTACGGTGGACGACGATGGTCCTGGCGTTAGCCCGGAAGATCGCGAACAGATTTTCCGTCCGTTCTATCGGACCGATGAAGCACGCGATCGTGAATCTGGCGGTACAGGTTTGGGGCTGGCGATTGTTGAAACCGCCATTCAGCAGCATCGTGGCTGGGTGAAGGCAGAAGACAGCCCGCTGGGCGGTTTACGGCTGGTGATTTGGTTGCCGCTGTATAAGCGGAGTTAA
Protein sequences of DBSCAN-SWA_8 >CP033401|3297668:3316187|3307374_3308412_+|AYQ02930.1|DBSCAN-SWA MIEIMITNLDSMPSNEPYLWADYIEILALTNIDRSFSRGDLYSTLQAQPEAVLAETDEAEEEGVYDVDDENDTPVRKRTKRSVSRAYTDRKWSYAIGFIRQRIDLFGDSYPFTLSEDNDTVELRDISEKPLEHLERLYLALLICANIKYVNIMSRREITRSFELISLPIFESLMPSGSIIKACWASGGQAAPYTGTLYNKFKSIASDIRCTANFKERDFSRGNSGDGGLDIIAWHPMGDQRDAIPISFVQCGCSQEEWEAKQLEASPAMLYSKFPVAHRWATYYFLPQDLRWIDGEWAHKNKLGDAIFVDRLRLINLTRASDNIDHSQNISYLDIILDPSSAIAA >CP033401|3297668:3316187|3313468_3313969_-|AYQ04336.1|DBSCAN-SWA MRIVTAAVMASTLAVSSLSHAAEVGSGDNWHPGEELTQRSTQSHMFDGISLTEHQRQQMRDLMQQARHEQPPVNVSELETMHRLVTAENFDENAVRAQAEKMANEQIARQVEMAKVRNQMYRLLTPEQQAVLNEKHQQRMEQLRDVTQWQKSSSLKLLSSSNSRSQ >CP033401|3297668:3316187|3302303_3303089_+|AYQ02923.1|DBSCAN-SWA MFSDNVTNAWWFISLYLFLLIALTFVTFGKSNLMRFIAHHFNLEYSDRKLKMLDKKWRDIQLFKIINGINVSGIEDVRMIQQGLIDGKLKTSYFFLTRFWGDITKPPHIIKTTIVILASIIYILFACYIHNEQSAIVRDAIGIPYKNMMYYVYSDKVLLSFKNKTVEFNKTYSLADCKSLQNVFIKDTLPEIACNKLLQLNKEDSEWLSQEIKDNNSYRKTLLIMSLTYFISGLLIFLSYTKFLYANKKVLEYKASNKNHS >CP033401|3297668:3316187|3306308_3307382_+|AYQ02929.1|DBSCAN-SWA MDNKIVEIETNKLDFDPKNPRFFRLNDASNAATVIEEMLDDESVHDLMLSIGQQGYFPGEPLLAVKSNGNYIVVEGNRRLAAVKLLNGDLLPPKRKLKGVQEIIDDTTNKPKKLPCIIYENREDVLRYIGYRHITGVKEWDSLSKAKYLKELCDTFYSHEPKEIVLKNLAREIGSKPHYVATLLTALNLYEVAHDHEFFNLPMKASDVEFSYITTALGYSKITNWLGLQDKKDFLDPNLNEENLKRLFSWFFVPDQQGRTIIGESRRIKDIAAVVEKPEAIEILMKSSNLDEAYLYTSGEREALDKALNAASVKLRVVWDMLLKAKELTLEHEEAASEIFEMSKNIRNQIRSKREDD >CP033401|3297668:3316187|3303160_3303613_-|AYQ02924.1|DBSCAN-SWA MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKTPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTGEDVQMIEEIILAHLER >CP033401|3297668:3316187|3310844_3311345_-|AYQ02934.1|DBSCAN-SWA MTVMTLNLVEKQPAAMRRIIGKHLAVPRWQDTCDYYNQMMERERLTVCFHAQLKQRHATMRFEEMNDVERERLVCAIDELRGAFSKRRQVGASEYAYISFLTVSQRRTLFMHAGLTEKEFNQPYWRINEESCYWRDALFRALRELFSLFEYAPTILTSVKPEQYLH >CP033401|3297668:3316187|3303605_3304073_-|AYQ02925.1|tail|DBSCAN-SWA MLKPDSLRRALTDAVTVLKTSPEMLRIFVDNGSIASTLARSLSFEKRYTLNVIVTDFTGDFDLLIVPVLAWLRENQPDIMTTDAGQKKGFTFYADINNDSSFDISISLMLTERTLVSEVDGALHVKNIPEPPPPEPVTRPMELYINGELVSKWDE >CP033401|3297668:3316187|3311939_3312233_+|AYQ02936.1|DBSCAN-SWA MQTFEKLKAIRKAEGLTQAKFSEISGIALGTVKNYESGHKDPGLSIVMRVTNTPLFKKYTLWLMTGDTSPQAGQIAPALAHIGQKPTESDHSEKQTG >CP033401|3297668:3316187|3304593_3305019_-|AYQ02928.1|DBSCAN-SWA MKKLSLSLMLNVSLALMLALSLIYPQSVAVSFVAAWAILATVICVVAGGVGVYATEYVLERYGRELPPESLAVKIVTALFLQPVPWRRRSVALVVMVATFISLVAAGGIFTALIYLVASVFFRLIRTACCQRFEGREPCQS >CP033401|3297668:3316187|3304035_3304209_-|AYQ02926.1|lysis|DBSCAN-SWA MSLCPMPGSDPETNGDLSADIRQLENALARCASQVKMIKHCQDENDAQTRQPAQGAD >CP033401|3297668:3316187|3299134_3300034_-|AYQ02920.1|DBSCAN-SWA MSGIRITQAIDNNQILVNVNTFNYSVNPYPLKCPDPNCSAHLVYVKSHIRRSFNKTLHIPAFFRLEKNFTHDKLCQYGTSGLNTIYASDSSHDISRALATGNNIFRIHILDEDDISKLSRKAAAMQANPPSDTTDRVYVKRGRKAPYVKNMDSLREIYEYGKAHPNKRNSIKIVTGTSTVTWSDFFYETTQLERLSTYLQKVKIAQVAVIMKVHVARIPMAKFNYRHFIEGSPMRIKGGFNIYPTIQLGNVTPKLFPLSSIVMVLGKFTIPTNRKMIMDPIFEREVRTIVSSEEQVLIL >CP033401|3297668:3316187|3300249_3300777_-|AYQ02921.1|tail|DBSCAN-SWA MMHLKNIKAGNAKTLEQYELTKKHGVIWLYSEDGKNWYEEVKNFQPDTIKIVYDENNIIVAITRDASTLNPEGFSVVEVPDITSNRRADDSGKWMFKDGAVIKRIYTADEQQKLAELHKAALLSEAESVILPLERAVRLNMATDEERSRLEAWERYSVLVSRVDPANPEWPEMPQ >CP033401|3297668:3316187|3311514_3311787_-|AYQ02935.1|DBSCAN-SWA MEVNDYVIQYPIDAVHTVKFAELLGKPETAVVKMVKENKLPVIELRDPSKPNARVGEKWVFIPEFNRAVREAFYNRPVEQRDAWLLWMGL >CP033401|3297668:3316187|3314813_3316187_+|AYQ02939.1|DBSCAN-SWA MIGSLTARIFAIFWLTLALVLMLVLMLPKLDSRQMTELLDSEQRQGLMIEQHVEAELANDPPNDLMWWRRLFRAIDKWAPPGQRLLLVTTEGRVIGAERSEMQIIRNFIGQADNADHPQKKKYGRVELVGPFSVRDGEDNYQLYLIRPASSSQSDFINLLFDRPLLLLIVTMLVSTPLLLWLAWSLAKPARKLKNAADEVAQGNLRQHPELEAGPQEFLAAGASFNQMVTALERMMTSQQRLLSDISHELRTPLTRLQLGTALLRRRSGESKELERIETEAQRLDSMINDLLVMSRNQQKNALVSETIKANQLWSEVLDNAAFEAEQMGKSLTVNFPPGPWPLYGNPNALESALENIVRNALRYSHTKIEVGFAVDKDGITITVDDDGPGVSPEDREQIFRPFYRTDEARDRESGGTGLGLAIVETAIQQHRGWVKAEDSPLGGLRLVIWLPLYKRS >CP033401|3297668:3316187|3308408_3309347_-|AYQ02931.1|DBSCAN-SWA MPTVVSLFSGCGGSDAGVLNAGFNVLMANDILPYARDVYLENHPETDYILGDISGLQSFPSAELLIGCYPCQGFSQGGARKADRKINTLYLEFARALSKIKPKAFIVENVSGMVRRNFEHLLKDQFKVFEEAGYTVSSQILNASHYGVSQDRKRIFIVGIRKDYGITYKFPKPTHGDGLTPYSTIRDAIGHMPVWPVGEFYDADFHWYYLSRNRRQDWDQISKTIVANPRHMPLHPISPTLEKLGPDKWQFTSDAPARRFSYREAAILQGFGDLIFPETERASINMKYTVVGNAVPPPLFEAVARNLPDLWD >CP033401|3297668:3316187|3309589_3309796_-|AYQ02932.1|DBSCAN-SWA MHRIPGEIPHHKTKNIKLMAIVQRLQRIMVNENLTPDELVGCAEIVRDNYGRFNHIGQSRVAPPQRRR >CP033401|3297668:3316187|3297668_3297923_-|AYQ04335.1|DBSCAN-SWA MALSTKPEEVLAMFHCPLCQHAAHARTSRYITDTTKERYHQCQNVNCSATFITYESVQRYIVKPGEVHAVRPHPLPSGQQIMWM >CP033401|3297668:3316187|3309795_3310248_-|AYQ02933.1|DBSCAN-SWA MAKIHEVKLHAKYFDLVLEGKKRAEFRKNDRNYERGDTLILHEWVQGVYTGRKVEARITDVTDLSDWLEDYVLLSIERLNTGACEIVNWKELSERGLVFRINHEIMHQLGLAVMYEPETGMSGGAMVATDGAWNYSDEQMERAQQNGWLG >CP033401|3297668:3316187|3300778_3301780_-|AYQ02922.1|tail|DBSCAN-SWA MQKNGDTLSGGLTFENDSILAWIRNTDWAKIGFKNDADGDTDSYMWFETGDNGNEYFKWRSRQGTTTKDLMNLKWDALSVLVKALFSSEVKISTVNALRIFNSSFGAIFRRSEECLHIIPTRENEGENGDIGPLRPFSLNLRTGRITMGHALDVTGDITTNAWVYANRLAINSSTGMWIHMRDQNVIFGRNAVSTDGAQALLRQDHADRKFMIGGLGNKQFGIYMINNSRTANGTDGQAYMDNNGNWLCGSQVIPGNYGNFDSRYVKDVRLGSQQYYGVNNWQTWNFQCPSGHVLSGINVQDTGSNSADNIAGVYYRPVQKYINGTWYNVASV >CP033401|3297668:3316187|3314118_3314817_+|AYQ02938.1|DBSCAN-SWA MNKILLVDDDRELTSLLKELLEMEGFNVIVAHDGEQALDLLDDSIDLLLLDVMMPKKNGIDTLKALRQTHQTPVIMLTARGSELDRVLGLELGADDYLPKPFNDRELVARIRAILRRSHWSEQQQNNDNGSPTLEVDALVLNPGRQEASFDGQTLELTGTEFTLLYLLAQHLGQVVSREHLSQEVLGKRLTPFDRAIDMHISNLRRKLPDRKDGHPWFKTLRGRGYLMVSAS >CP033401|3297668:3316187|3304180_3304606_-|AYQ02927.1|lysis|DBSCAN-SWA MSKLMIVLVVLLSLAVAGLFLVKHKNASLRASLDRANNVASEQQTTITMLKNQLHVALTRADKNELAQVALRQELENAAKREAQREKTITRLLNENEDFRRWYGADLPDAVRRLHQRPACADASDCPQRLPESEPLPDAGQ >CP033401|3297668:3316187|3312302_3313283_+|AYQ02937.1|integrase|DBSCAN-SWA MSIKKLDDGRYEVDIRPRGRDGKRIRRKFERKAEAVAFERYTIAYASQKEWAGQRADRRTLSELLNIWWKYHGQNHEHGTKEFNHLLKTISGIGDIPVSRMSKRALMDYRSMRLRDGISAATINRDMYRLSGMFTKLIQLDEFSGQHPIHGLPPLAEANPEMTFLEKAEIEKLLNVLDGDDLLVALLCLSTGGRWTEVATLKPAQITNCRVTFLKTKNGKKRTVPISEELEKKVKEEASAKLFKVDYEKFCGILRRVKPDIPPNQATHILRHTFASHFMMNGGNIIALQQILGHASIQQTMAYAHLAPDYLQNAVALNPLKGGVTL |
22 | Escherichia_virus(30.0%) | tail,lysis,integrase | attL 3297511:3297557|attR 3313399:3313445 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
4622955 : 4636138
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >CP033401|4622955:4636138|DBSCAN-SWA TATGCGCATATTGCTGAGTAATGATGACGGGGTACATGCACCCGGTATACAAACGCTGGCGAAAGCCTTGCGTGAGTTTGCTGACGTTCAGGTGGTCGCCCCCGATCGTAACCGCAGCGGCGCTTCAAATTCTCTGACACTGGAATCCTCCCTGCGCACGTTTACCTTTGAAAATGGTGATATTGCTGTGCAAATGGGAACCCCGACCGATTGCGTCTATCTTGGCGTGAATGCTCTGATGCGTCCGCGCCCGGACATTGTTGTGTCCGGAATTAACGCCGGGCCGAATCTGGGGGATGATGTTATTTATTCCGGTACGGTAGCCGCCGCGATGGAAGGCCGTCATTTAGGTTTTCCGGCGCTTGCCGTCTCGCTTGACGGGCATAAACATTACGACACTGCCGCGGCGGTAATCTGTTCAATTTTGCGCGCACTGTGTAAAGAGCCGCTGCGCACCGGGCGTATTCTTAATATTAACGTTCCGGATTTACCCTTGGATCAAATCAAAGGTATTCGCGTGACGCGCTGCGGTACACGACATCCGGCAGATCAGGTGATCCCGCAGCAAGATCCGCGCGGCAATACGCTGTACTGGATTGGCCCGCCGGGCGGTAAATGTGATGCTGGTCCGGGGACCGATTTTGCTGCGGTAGATGAGGGCTATGTCTCCATCACGCCGCTGCATGTGGATTTAACTGCGCATAGCGCGCAAGATGTGGTTTCAGACTGGTTAAACAGCGTGGGAGTTGGCACGCAATGGTAAGCAGACGCGTACAAGCACTTCTGGATCAATTACGTGCGCAAGGTATTCAGGATGAGCAGGTGCTGAATGCACTTGCCGCCGTGCCGCGTGAAAAATTCGTTGATGAAGCGTTTGAACAAAAAGCCTGGGACAATATCGCTTTGCCGATAGGTCAGGGGCAGACAATTTCGCAGCCATATATGGTGGCGCGAATGACCGAATTACTCGAGCTGACGCCGCAGTCGCGGGTGCTGGAAATTGGCACCGGTTCGGGATATCAAACGGCAATCCTGGCGCATCTTGTCCAGCATGTTTGCTCGGTTGAACGGATTAAAGGCTTGCAGTGGCAGGCACGTCGCCGCCTGAAAAATCTTGATTTACATAATGTTTCAACCCGTCATGGCGATGGATGGCAAGGTTGGCAGGCACGTGCGCCGTTTGACGCTATCATTGTTACGGCGGCACCGCCGGAAATTCCAACTGCGCTAATGACGCAGCTGGACGAAGGCGGGATTCTCGTCTTACCCGTAGGGGAGGAGCACCAGTATTTGAAACGGGTGCGTCGTCGGGGAGGCGAATTTATTATCGATACCGTGGAGGCCGTGCGCTTTGTCCCTTTAGTGAAGGGTGAGCTGGCTTAAAACGTGAGGAAATACCTGGATTTTTCCTGGTTATTTTGCCGCAGGTCAGCGTATCGTGAACATCTTTTCCAGTGTTCAGTAGGGTGCCTTGCACGGTAATTATGTCACTGGTTATTAACCAATTTTTCCTGGGGGATAAATGAGCGCGGGAAGCCCAAAATTCACCGTTCGCCGCATTGCGGCTTTGTCACTGGTTTCGCTATGGCTGGCAGGCTGTTCTGACACTTCAAATCCACCGGCACCGGTCAGCTCCGTTAATGGCAATGCGCCTGCAAATACTAATTCTGGTATGTTGATTACGCCGCCGCCGAAAATGGGGACGACGTCTACAGCGCAGCAACCGCAAATTCAGCCGGTGCAGCAGCCACAAATTCAGGCTACTCAACAACCGCAAATCCAGCCAGTGCAGCCAGTAGCTCAGCAGCCGGTACAGATGGAAAACGGACGCATCGTCTATAACCGTCAGTATGGGAACATTCCGAAAGGCAGTTATAGCGGCAGTACCTATACCGTGAAAAAAGGCGACACACTTTTCTATATCGCCTGGATTACTGGCAACGATTTCCGTGACCTTGCTCAGCGCAACAATATTCAGGCACCATACGCGCTGAACGTTGGTCAGACCTTGCAGGTGGGTAATGCTTCCGGTACGCCAATCACTGGCGGAAATGCCATTACCCAGGCCGACGCAGCAGAGCAAGGAGTTGTGATCAAGCCTGCACAAAATTCCACCGTTGCTGTTGCGTCGCAACCGACAATTACGTATTCTGAGTCTTCGGGTGAACAGAGTGCTAACAAAATGTTGCCGAACAACAAGCCAACTGCGACCACGGTCACAGCGCCTGTAACGGTACCAACAGCAAGCACAACCGAGCCGACTGTCAGCAGTACATCAACCAGTACGCCTATCTCCACCTGGCGCTGGCCGACTGAGGGCAAAGTGATCGAAACCTTTGGCGCTTCTGAGGGGGGCAACAAGGGGATTGATATCGCAGGCAGCAAAGGACAGGCAATTATCGCGACCGCAGATGGCCGCGTTGTTTATGCTGGTAACGCGCTGCGCGGCTACGGTAATCTGATTATCATCAAACATAATGATGATTACCTGAGTGCCTACGCCCATAACGACACAATGCTGGTCCGGGAACAACAAGAAGTTAAGGCGGGGCAAAAAATAGCGACCATGGGTAGCACCGGAACCAGTTCAACACGCTTGCATTTTGAAATTCGTTACAAGGGGAAATCCGTAAACCCGCTGCGTTATTTGCCGCAGCGATAAATCGACGGAACCAGGCTTTTGCTTGAATGTTCCGTCAAGGGATCACGGGTAGGAGCCACCTTATGAGTCAGAATACGCTGAAAGTTCATGATTTAAATGAAGATGCGGAATTTGATGAGAACGGAGTTGAGGTTTTTGACGAAAAGGCCTTAGTAGAAGAGGAACCCAGTGATAACGATTTGGCCGAAGAGGAACTGTTATCGCAGGGAGCCACACAGCGTGTGTTGGACGCGACTCAGCTTTACCTTGGTGAGATTGGTTATTCACCACTGTTAACGGCCGAAGAAGAAGTTTATTTTGCGCGTCGCGCACTGCGTGGAGATGTCGCCTCTCGCCGCCGGATGATCGAGAGTAACTTGCGTCTGGTGGTAAAAATTGCCCGCCGTTATGGCAATCGTGGTCTGGCGTTGCTGGACCTTATCGAAGAGGGCAACCTGGGGCTGATCCGCGCGGTAGAGAAGTTTGACCCGGAACGTGGTTTCCGCTTCTCAACATACGCAACCTGGTGGATTCGCCAGACGATTGAACGGGCGATTATGAACCAAACCCGTACTATTCGTTTGCCGATTCACATCGTAAAGGAGCTGAACGTTTACCTGCGAACCGCACGTGAGTTGTCCCATAAGCTGGACCATGAACCAAGTGCGGAAGAGATCGCAGAGCAACTGGATAAGCCAGTTGATGACGTCAGCCGTATGCTTCGTCTTAACGAGCGCATTACCTCGGTAGACACCCCGCTGGGTGGTGATTCCGAAAAAGCGTTGCTGGACATCCTGGCCGATGAAAAAGAGAACGGTCCGGAAGATACCACGCAAGATGACGATATGAAGCAGAGCATCGTCAAATGGCTGTTCGAGCTGAACGCCAAACAGCGTGAAGTGCTGGCACGTCGATTCGGTTTGCTGGGGTACGAAGCGGCAACACTGGAAGATGTAGGTCGTGAAATTGGCCTCACCCGTGAACGTGTTCGCCAGATTCAGGTTGAAGGCCTGCGCCGTTTGCGTGAAATCCTGCAAACGCAGGGGCTGAATATCGAAGCGCTGTTCCGCGAGTAAGTAAGCATCTGTCAGAAAGGCCAGTCTCAAGCGAGGCTGGCCTTTTCTGTGCACAATAAAAGGTCCGATGCCCATCGGACCTTTTTATTAAGGTCAAATTACCGCCCATACGCACCAGGTAATTAAGAATCCGGTAAAACCGAGAATGGTCGTTAACACTGTCCAGGTTTTCAGACCGTCTGCTACCGACAACCCCAGATATTTGGTCACAATCCAGAACCCTGAGTCATTAATATGTGACGCGCCAAGCCCACCAAAGCAGGCTGCCAGCGTCACCAATACGCACTGAATCGGATTCAATCCCATCACCGCTTCTGAGAGTAACCCGCCGGTTGTCAGGATTGCTACGGTTGCTGACCCCTGCGATGCACGCAGAGCCAGTGAAATAATAAATGCGGCTGGTAACAGAGGCAGGTCAATCATTTGTAGCATATTGGCAAGGGCTTTGCCGACGCCCGATTCCACCAGCACTTTGCCAAATACCCCTCCAGCACCAGTAACCAAAATCACTACCGCCGCAGTGGGAAGGGCTGAGCCCATAATGTCGCTGGTGTGTTGTAAGCTCCAGCCGCGACGTAAAGCCAATAACCAGAATGCCAGCACCAGCGCAATCATTAGAGCTACCATTGGTGAGCCGATCAGCTGTAGCGTACCAAGCAGGGGATGCGAAGGCGGCATCAGTGTTGCGGAAACCGTACCCGCCATGATAATCGCGATAGGAATAACAATTAGCGAGGTGACCAGCGCGACGCCCGGTGGATTTATTTTATCGCTTAATTTTGTCGCGCCTTCCTCACTGGCCGGAGCCAGTTGCATCTGTTCCAGTACTTCCACCGACATCGCATATTGGTGCTTATTGATTATTTTCGCTGCAAAGTAGCCAACAATCCCTACGGGAATAGAAATCGCAATACCGATGATGGTTAGCCAGCCGATGTCTGCGTGGAGTAACCCCGCTGCGGCGACAGGGCCTGGATGCGGCGGTACCGCCACATGTACAGTTAGCATGATCCCAGCGACAGGCAGGCCAAATTTGAGTGGCGATATTTTGGCAACCTTGGCAAAACCGTAAATGATTGGCGCAAGAATAATAAAGCCGACATCAAAGAAGACGGGAATACCGAGGAAGAACGCTGCCAGAGTCAGCGCAGCGATAGTTCGTTTGTCACCTAACTTGCGACTGAAATAATTAGCCAGTGACTCTGCACCACCAGAGTGTTCGATCATACGCCCCAGCATAGCGCCCAGACCAATAATAATAGTGACGGAACCAAGCACACCGCCCATCCCGGCGATCATCACTTTACCCACTTCGCCCGCCGGTATACCCGCCGCAAGTGCGACTAACAGGCTGACGAGGAGCAGAGCAACGAATGGTTGTACCTTTGCCTTGATGACCAGCAGCAACAGCATGATTACGCCAGTTAACGCAATGCATAACAATGTAATTGTGGACATGGGAAACCCTGTCTGAAAGTTATAGTTAACCTACCCCATCCGTAGATGGGGGGATGTATGGGTACGTTGTAATTAGGGATTTAACGAATTAGCGCCAGGCGTCAAACCAGCCAAGCCCTTCTTCGGTGAGGCCACGAGGTTTATATTCACAACCGATCCAGCCCTGATATCCCACCTCATCGAACAGGCGGAACAGCCACGGATAGTTGATTTCTCCATCGTCCGGTTCATGTCGATCAGGTAGTCCGGCAATTTGTACGTGCGCATATTTCCCGGCGTAGTCGCGGATTAAATGCGTCAGGTTGCCATCTACTTTTTGCGCATGAAAAGTATCTAGTTGAATAAACACGTTATCTCGCGCAACCTCTTCAACAATAGCCAGTGCCTGATACTGGCTGGAGAAGAGATAATGAGGCTTAACGTCGGGGCTGAGTGCTTCAACTAATATTCGCTTGCCGTGTAGCGCAAAGCGGTCGGCAGCGTAGCGGAGATTATCGATAAATACTGCCCGGTACCGTTCAGCGTCTTCGCCAGCGGGCACGACGCCTGCCATCACATGGACTTGTTCACAATTGAGCGCCAATGCATATTCCAGTGCCAGGTCGATGTCTGCGTGTGCTTCGTGCTCACGTCCGGGAAGGGCGGATAATCCCCATTCCCCCGCATTAATATCTCCGGGAGCGGTATTGAACAGCGCCAGTGTCAGATGGTTTTGCTCCAGTTGCTTTTGGATTTGCAGGGTGGAGTAGTCATAGGGAAACAGAAATTCCACAGCATCGAACCCGGCTTTTCGCGCTGCGGCGAAGCGTTCAATAAAAGGCACTTCGGTGAACATCATGGATAAATTAGCTGCAAAACGAGGCATTGCATTAACTCCTTAATTCCGCAATTTCACCTGCGGTCAGATAACGGATCGGGCGGTCACCGAGAATAAAAATCAGCTTTGCCGTTTCCTCCAGCTCTTCCATATTGTTGGCGGCTTCTTGCAGGCTTTCACCGCAAACCACTGGGCCATGATTTGCCAGTAAAAAAGCCTGATTGTCTGCTGCCAGTTCCGCCAGATCCTGTGCGATGCGTTTATCGCCCGGTCGGTAATAAGGCACCAGCGGGACATTTCCCATCCGCATCACCACGTATGGTGTGAACGGACGAATAACGTTGCTGCTGTCCAGCCCTTGCAGGCAGGAAAGCGCCGTCGACCATGTGCTGTGCAAATGCACCACCGCTTTACAGCGCGGATTGTTGCGATACAGCGCCAGATGAAAGAGCACCTCTTTCGAGGGTTTGTCACCACTTAACCATTCGCCATCCGCGGCGACTTTGGAAAGCCGCTGCGGATCGAGATTGCCCAGGCATGAACCTGTCGGTGTCGCCAGTAAATTCCCGTCAGGTAAAAGCAGCGACAGATTGCCAGCCGAACCGGTTGCATAGCCGCGCTGAAAGAATGAACTGGCAATCCGCGTCATCTCCTCTCGCAAAGACTGCTCTACTTTTGCGAAATCGCTCATGATAAAAACTCTCTTTGGGCTCGTGAAAAAAAGGCTTCATCACCGAAGTTGCCAGATTTAAGGGCGAGTGAGACAGGCTTATCCAGTGCGTTTACCCACGGCACGCCGGGGGAAATGGTTGGGCCAATATGAAACCCTTTTATGCCCAGGCTCTGTGTGACTACGCCGGAGGTCTCACCGCCTGCGACAATAAAGCGTGTCACGCCTTCCGCTGCTAACCGCGCCGCTAGTTGAGAAAACAGAGTTTCTACTGCCTGACTGGCTTTTTGTGCACCGTATTGCTGTTGAATTGCTGCCAATGCGTCAGTGCTGGCGGTGGCAAAAACCAGTGGAGCAAGTACACTTTCCTGGCCCAGAACCCACTCTGCCAGTTCGTGTGCATAAGCGGCCAGAGTTTCGGTTGAGAGGCAGCGTGCCACATCAACTTCACGGGCTGGTGCAATTTGACGGTAATGTGCCACCTGGCGGTTGGTCATTTGAGAGCATGAACCGGAGAGCACTACGCCGCGCCCAGCGAGCGGATGCCCTGCTTCGCGAGCCTGGTTACCGTTTTCTTGCGCCCACTGCCGGGCCAGGCCAATCGCCAGACCAGAACCGCCCGTTACCAGTGGGGCATCGCGCAAGGCTTCTCCCTGAATTTCCAGATGGTGTTCGGTCAGCGCATCAAGCACCGCGTAGCGGTAGCCCTCTTGCTGTAAGCGAGCCAGCTCTTGACGAACGGCATCCACACCTTGTTCGAAAACATGTGCCGAAACGACGCCGCAGCGCCCTGTGGATTGCGCTTCAACCAGACGGGGAAGATAGCTGTCGGTCATGGGATTTACCGGGTGATGGCGCATCCCGGATTCGGCCAGCAGTTGATTCATTACGAACAAATACCCCTGATAAACCGTACGTCCGTTGACCGGCAGGGCCGGAGAGAAGACCGTAAACGGCGTGTCGAGAGCATCCATTAATGCATCGGTAACCGGGCCGATATTACCTTTCGCCGTACTGTCGAAAGTAGAGCAGTATTTGAAATAGATCTGTTTGCAACCTTGCTGTTGCAACCAGCTCAGAGCCGCCAGCGATTGCTGTGTGGCTTCAACCACCGGACAGGAGCGCGTTTTCAGGCTGATCACCAGTGCGTCGATTGCTTCCGGCATTTTACCTGTTGGAACACCGTTAATTTGTACCGTTGGTAGACCGTTTTCCACCAGAAAACTGGCGATATCCGTCGCGCCGGTAAAATCATCGGCGATAACGCCAATCTTGATCATGATTTCGCTCCCGGTAGAGTGATGCCAGAGAAAATCTTGATAACTGCGCTATCGTCTTCTTTCCCGTAACCTGCGTTACTGGCGCTGGTGAACATATTCAATGCTGTTGAGGCCAATGGCAGCGGGAAGTGCAGGGCTTTGGCTGTATCGGCAACCAGACCAAGATCCTTAACAAAAATATCGACGGCTGAATGCGGGGTGTAATCGCCATCCACCACATGACGCATCCGGTTTTCGAACATCCAGGAATTTCCGGCGGCATTGGTCACGACGTCATACATCACATCCAGCGGGATCCCCGCACGGGCTGCAAGTGCCATCGCTTCGGCTCCGGCAGCAATATGTACGCCCGCTAACAACTGGTGAATAATTTTTACGGTCGAACCTAGTCCCGGTTCTGCACCTATGCGATAAACTTTTCCGGCAACGGCTTCCAGCACGGGTGCCAGTCGTTCAAAGGCAATATCGCTACCGGAGGCCATGACAGTCATTTCACCGTTAGCGGCTTTTACTGCACCACCAGAAACTGGCGCATCCAGCATTTCCAGATCGAATCCAGCCAGAGCGGTAGCAATTTCTTGCGCATCAGCACTAGCGATAGTGGAAGAAACCATTACTGCCGTACCGGGTTTCAGATGTTGTGCAACGCCTGTTTCACCAAACAGCACCTGTTTAACCTGGGCCGCATTGACCACCAGCACCAGCAGAGCGTCCAGTTTTTCGGCAAACGTCGCGGCGTTATCAGAAACCCCGCAAGCACCTGCCTCTTTCAACGTAGCGCAGGCATTGCTGTTCAGGTCTGCGCCCCAGGTAGAAAGACCTGCGCGGACACATGACAGTGCTGCTCCCATTCCCATTGACCCTAAGCCAACGATACCGACATGAAACTCAGATCCCGTTTTCATCTGCTCTCCTTGTTAATTTAAGTGATATTTTGTTTGATATTGTGAATATAAGCGCTGGAAGATAACGATATGGTGAGCTGATTCACATAAATTAACATTGTGTGTTATTTTATGTGAACTAAGCGTTAGTTGCGCCGCGCACGTTTCGCAGGCAAATAGCGTAGAATGTCAGCAGGACAAAGGAAGGAGCAAAAGTTGATACCCGTAGAGCGTCGCCAAATCATCCTTGAGATGGTAGCTGAAAAAGGCATTGTCAGTATTGCTGAACTAACGGACAGAATGAATGTGTCACATATGACCATTCGTCGGGATTTACAAAAACTGGAGCAGCAGGGAGCCGTTGTGCTGGTGTCCGGAGGCGTCCAGTCTCCGGGACGCGTGGCGCATGAACCTTCTCATCAGGTAAAAACTGCGCTGGCAATGACGCAAAAAGCGGCTATTGGCAAGCTGGCGGCAAGTCTTGTTCAGCCGGGAAGCTGTATCTATCTGGATGCGGGAACGACCACGTTAGCGATAGCACAGCATCTGATTCACATGGAGTCACTGACTGTGGTCACAAACGATTTCGTTATTGCGAACTACTTGCTCGACAACAGTAATTGCACAATTATTCACACTGGCGGTGCAGTGTGTCGGGAAAACCGTTCCTGTGTCGGGGAAGCCGCTGCGACCATGCTGCGCAGCCTGATGATTGATCAGGCTTTTATTTCTGCATCGTCATGGAGTGTGCGGGGGATTTCTACGCCAGCGGAAGATAAAGTCACGGTGAAACGGGCGATTGCCAGTGCCAGCCGCCAGCGAGTTTTGGTCTGTGATGCGACGAAATATGGTCAGGTGGCGACATGGCTGGCGTTACCATTAAGCGAGTTTGATCAGATTATCACAGACGACGGTTTGCCGGAGAGTGCCAGTCGCGCGCTGGCGAAGCAGGATCTCTCTTTGCTGGTAGCGAAAAATGAATAATGGCCTGCAATAACATTTGGTTACTCATGCTTCACAGAAGAAGCATGAGACTACTTTATTTTATAAAATGACAGCCGCCCGCTTTTCGGCGTGCCGGTATCAATATAAATCTGGTTAGCGAACGTCTGAATGTTATCAAACATCATATGTCCAAATATAAAATAATCAGCGCCGTTTATTTGTTGTAACTCGCCATTAAGCGATTTCTGCACACGATCAACAGGCCAGAGTAATTCGCTCTCCGCTATTTCTTTACCAAAGAGATATTCACTCCCCGGATAATCTGCATGTGCGATGACATATTTTATGTTGTCGTTAGTGATTTCAATAATATGTGGAAGGTGATGGAATTTCAGCAACAGATCTATTGCCTCTTGTTGCTCTGAATCATTTAAATCGAAAAACCAGTCACCACCGCTGGCAAGCCACATATTGCCATCGCCAGTTTCGAATGCATCCAACGCCATCGCTTCGTGGTTGCCTTTAACCGACGTAAACCAGGGTTGGTTTAGCAGGCGCAGCACGTTAAGACTCTTCGGCCCACGATCAATATTATCGCCTGTGGAAATAAGTAAGTCGGTTTCAGGGTAAAAAGAGAGTTGATGTAAACGGGATTGTAATAACTGATATTCACCATGAATATCACCAACGACCCATATATGGCGATAGTGATGGGCATTGATTTTTTGATAGCGTGTAGATGGCATGGTTTTACCCTGTAAAATAAGCTTTCCTATTATACAGGGTATTTTTATTTGATTCGTCAGTTGTCGTTAATATTCCCGATAGCAAAAGACTATCGGGAATTGTTATTACACCAGACTCTTCAAGCGATAAATCCACTCCAGCGCCTGACGCGGAGTCAGTGAATCCGGGTCGAGATTTTCCAGTGCTTCGACCGCAGGCGAAGTTTCTTCTGGCACTGACAGCAAAGACATCTGCGTACCATCCACTTGCGTCGCGGCGGCGTTCGGCGAAATGCTTTCCAGCTCACGCAGTTTTTGCCGTGCGCGCTTAATCACCTCTTTTGGCACGCCCGCCAGAGCTGCAACCGCCAGGCCGTAGCTTTTGCTCGCCGCGCCATCCTGCACGCTATGCATAAAGGCAATGGTGTCGCCGTGCTCCAGCGCATCGAGATGCACGTTGGCGACGCCTTCCATTTTCTCCGGTAACTGGGTCAGCTCGAAATAGTGGGTAGCAAATAACGTCAATGCCTTAATCTTATTCGCCAGATTTTCCGCGCACGCCCACGCCAGCGACAGACCATCGTAGGTGGACGTTCCACGCCCGATCTCATCCATTAACACCAGACTGTATTCGGTGGCGTTATGTAAAATATTGGCGGTTTCAGTCATCTCCACCATAAAGGTTGAGCGCCCGGAAGCCAGGTCATCTGCCGCGCCTACGCGGGTAAAGATGCGATCGATAGGTCCAATCTCGACTTTTTGTGCCGGTACATAGCTGCCGATGTAGGCCATCAGCGCAATCAGTGCGGTCTGGCGCATATAGGTACTTTTACCGCCCATGTTCGGGCCGGTAATAATCAACATACGGCGCTGCGGCGACAGATTCAGCGGGTTAGCGATAAATGGCTCATTCAGTACTTGTTCAACTACCGGATGGCGACCTTCGGTAATGCGAATGCCCGGTTTATCAATGAAGGTCGGGCAGGTGTAGTTCAGGGTATAGGCCCGTTCCGCCAGGTTAACCAGCACGTCGAGTTCCGCCAGCGCGCTCGCGCTCTGTTGCAACGCTTCCAGATGCGGCAACAGCAGGTCGAACAGCTCTTCATAAAGCTGTTTTTCCAGTGCCAGTGCTTTGCCTTTTGAGGTGAGAACTTTATCTTCGTACTCTTTTAGCTCTGGAATGATGTAGCGCTCGGCGTTTTTCAGCGTCTGGCGACGCATGTAGTTGATGGGTGCCAGATGGCTTTGCCCACGGCTGATTTGAATGTAGTAGCCGTGCACCGCATTAAAGCCAACTTTCAGCGTGTCCAGGCCGGTACGTTCACGCTCGCGGACTTCCAGACGCTCCAGATAATCGGTCGCGCCGTCAGCCAGCGCGCGCCACTCATCCAGCTCTTCGTTATAGCCCGATGCGATAACACCACCGTCGCGTACCAGCACCGGCGGTGTGTCGATGATTGCTCGCTCCAGCAGATCGCGCAGCTCGGCAAACTCGCCCATCTTCTCACGTAGCGCCTGTACCGGTGCACTATCGACAGTTTCTAACTGCGCACGCAGCTCCGGCAGTTGCTGGAAAGCGTGGCGCATACGGGCCAGATCGCGTGGGCGAGCAGTTCGTAAAGCCAGACGTGCCAGAATACGTTCCAGGTCGCCGACCTGACGCAGTACCGGCTGCAACTCGGCGGTGAAATCCTGCAATGCGCCAATAGTTTGCTGGCGCTCAAGCAACACGCGGGTATCGCGCACTGGCATATGCAGCCAGCGTTTCAGCATACGACTACCCATCGGCGTGACGGTGCAGTCGAGCACAGAAGCCAGCGTATTTTCCGCACCGCCCGCCAGGTTCTGAGTAATTTCCAGGTTACGACGTGTCGCGGCATCCATAATGATGCTGTCCTGCTCACGTTCCATGGTGATAGAACGAATATGCGGCAGGGTCGTGCGTTGGGTATCTTTCGCATACTGCAACAGACAACCGGCAGCACAAAGTCCGCGCGGCGCGTTCTCGACGCCAAAACCGACCAGATCGCGGGTGCCAAATTGCAGATTCAACTGCTGGCGCGCGGTGTCGATTTCAAACTCCCACAGCGGGCGACGGCGCAGGCCGCGACGGCCTTCAATCAGCGACATCTCGGCGAAATCTTCTGCATACAGCAGTTCCGCCGGATTAGTGCGTTGCAGCTCTGCCGCCATCGTTTCGCGGTCGGCCGGTTCGCTCAGACGAAAACGCCCGGAGCTGATATCCAGCGTCGCGTAGCCGAAACCTTTGCTGTCCTGCCAGATAGCCGCCAGCAGGTTGTCCTGACGCTCCTGTAACAGGGCTTCATCGCTGATGGTGCCTGGCGTAACGATACGCACAACTTTGCGCTCAACCGGACCTTTGCTGGTCGCCGGATCGCCAATTTGTTCGCAGATGGCAACGGACTCTCCCTGATTCACCAGTTTGGCGAGATAGTTTTCCACCGCATGGTAGGGAATCCCCGCCATCGGGATCGGCTCTCCCGCCGAAGCACCGCGTTTGGTCAGTGAAATATCCAGCAGTTGCGACGCGCGTTTTGCGTCGTCATAAAACAGTTCATAAAAATCACCCATCCGGTAAAACAGCAGGATCTCGGGATGCTGGGCTTTCAGCCTGAGATACTGCTGCATCATGGGCGTATGGGCGTCGAAATTTTCTATTGCACTCAT
Protein sequences of DBSCAN-SWA_9 >CP033401|4622955:4636138|4626764_4628129_-|AYQ04100.1|DBSCAN-SWA MSTITLLCIALTGVIMLLLLVIKAKVQPFVALLLVSLLVALAAGIPAGEVGKVMIAGMGGVLGSVTIIIGLGAMLGRMIEHSGGAESLANYFSRKLGDKRTIAALTLAAFFLGIPVFFDVGFIILAPIIYGFAKVAKISPLKFGLPVAGIMLTVHVAVPPHPGPVAAAGLLHADIGWLTIIGIAISIPVGIVGYFAAKIINKHQYAMSVEVLEQMQLAPASEEGATKLSDKINPPGVALVTSLIVIPIAIIMAGTVSATLMPPSHPLLGTLQLIGSPMVALMIALVLAFWLLALRRGWSLQHTSDIMGSALPTAAVVILVTGAGGVFGKVLVESGVGKALANMLQMIDLPLLPAAFIISLALRASQGSATVAILTTGGLLSEAVMGLNPIQCVLVTLAACFGGLGASHINDSGFWIVTKYLGLSVADGLKTWTVLTTILGFTGFLITWCVWAVI >CP033401|4622955:4636138|4631996_4632764_+|AYQ04105.1|DBSCAN-SWA MIPVERRQIILEMVAEKGIVSIAELTDRMNVSHMTIRRDLQKLEQQGAVVLVSGGVQSPGRVAHEPSHQVKTALAMTQKAAIGKLAASLVQPGSCIYLDAGTTTLAIAQHLIHMESLTVVTNDFVIANYLLDNSNCTIIHTGGAVCRENRSCVGEAAATMLRSLMIDQAFISASSWSVRGISTPAEDKVTVKRAIASASRQRVLVCDATKYGQVATWLALPLSEFDQIITDDGLPESASRALAKQDLSLLVAKNE >CP033401|4622955:4636138|4624476_4625616_+|AYQ04098.1|DBSCAN-SWA MSAGSPKFTVRRIAALSLVSLWLAGCSDTSNPPAPVSSVNGNAPANTNSGMLITPPPKMGTTSTAQQPQIQPVQQPQIQATQQPQIQPVQPVAQQPVQMENGRIVYNRQYGNIPKGSYSGSTYTVKKGDTLFYIAWITGNDFRDLAQRNNIQAPYALNVGQTLQVGNASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPTATTVTAPVTVPTASTTEPTVSSTSTSTPISTWRWPTEGKVIETFGASEGGNKGIDIAGSKGQAIIATADGRVVYAGNALRGYGNLIIIKHNDDYLSAYAHNDTMLVREQQEVKAGQKIATMGSTGTSSTRLHFEIRYKGKSVNPLRYLPQR >CP033401|4622955:4636138|4633576_4636138_-|AYQ04107.1|DBSCAN-SWA MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV >CP033401|4622955:4636138|4628217_4628994_-|AYQ04101.1|DBSCAN-SWA MPRFAANLSMMFTEVPFIERFAAARKAGFDAVEFLFPYDYSTLQIQKQLEQNHLTLALFNTAPGDINAGEWGLSALPGREHEAHADIDLALEYALALNCEQVHVMAGVVPAGEDAERYRAVFIDNLRYAADRFALHGKRILVEALSPDVKPHYLFSSQYQALAIVEEVARDNVFIQLDTFHAQKVDGNLTHLIRDYAGKYAHVQIAGLPDRHEPDDGEINYPWLFRLFDEVGYQGWIGCEYKPRGLTEEGLGWFDAWR >CP033401|4622955:4636138|4622955_4623717_+|AYQ04096.1|DBSCAN-SWA MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVICSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW >CP033401|4622955:4636138|4628998_4629637_-|AYQ04102.1|DBSCAN-SWA MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVAADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNVPLVPYYRPGDKRIAQDLAELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS >CP033401|4622955:4636138|4625678_4626671_+|AYQ04099.1|DBSCAN-SWA MSQNTLKVHDLNEDAEFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFARRALRGDVASRRRMIESNLRLVVKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMNQTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNERITSVDTPLGGDSEKALLDILADEKENGPEDTTQDDDMKQSIVKWLFELNAKQREVLARRFGLLGYEAATLEDVGREIGLTRERVRQIQVEGLRRLREILQTQGLNIEALFRE >CP033401|4622955:4636138|4629633_4630896_-|AYQ04103.1|DBSCAN-SWA MIKIGVIADDFTGATDIASFLVENGLPTVQINGVPTGKMPEAIDALVISLKTRSCPVVEATQQSLAALSWLQQQGCKQIYFKYCSTFDSTAKGNIGPVTDALMDALDTPFTVFSPALPVNGRTVYQGYLFVMNQLLAESGMRHHPVNPMTDSYLPRLVEAQSTGRCGVVSAHVFEQGVDAVRQELARLQQEGYRYAVLDALTEHHLEIQGEALRDAPLVTGGSGLAIGLARQWAQENGNQAREAGHPLAGRGVVLSGSCSQMTNRQVAHYRQIAPAREVDVARCLSTETLAAYAHELAEWVLGQESVLAPLVFATASTDALAAIQQQYGAQKASQAVETLFSQLAARLAAEGVTRFIVAGGETSGVVTQSLGIKGFHIGPTISPGVPWVNALDKPVSLALKSGNFGDEAFFSRAQREFLS >CP033401|4622955:4636138|4632814_4633471_-|AYQ04106.1|DBSCAN-SWA MPSTRYQKINAHHYRHIWVVGDIHGEYQLLQSRLHQLSFYPETDLLISTGDNIDRGPKSLNVLRLLNQPWFTSVKGNHEAMALDAFETGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEITNDNIKYVIAHADYPGSEYLFGKEIAESELLWPVDRVQKSLNGELQQINGADYFIFGHMMFDNIQTFANQIYIDTGTPKSGRLSFYKIK >CP033401|4622955:4636138|4630892_4631801_-|AYQ04104.1|DBSCAN-SWA MKTGSEFHVGIVGLGSMGMGAALSCVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNAAQVKQVLFGETGVAQHLKPGTAVMVSSTIASADAQEIATALAGFDLEMLDAPVSGGAVKAANGEMTVMASGSDIAFERLAPVLEAVAGKVYRIGAEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSAVDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS >CP033401|4622955:4636138|4623710_4624337_+|AYQ04097.1|DBSCAN-SWA MVSRRVQALLDQLRAQGIQDEQVLNALAAVPREKFVDEAFEQKAWDNIALPIGQGQTISQPYMVARMTELLELTPQSRVLEIGTGSGYQTAILAHLVQHVCSVERIKGLQWQARRRLKNLDLHNVSTRHGDGWQGWQARAPFDAIIVTAAPPEIPTALMTQLDEGGILVLPVGEEHQYLKRVRRRGGEFIIDTVEAVRFVPLVKGELA |
12 | Escherichia_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|