Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP032485 | Neokomagataea tanensis strain AH13 = NBRC 106556 chromosome, complete genome | 0 crisprs | DinG,csa3 | 0 | 0 | 3 | 0 |
NZ_CP032486 | Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence | 1 crisprs | cas5f,cas7f,cas1,cas3f | 0 | 9 | 9 | 0 |
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1375106 : 1396207
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP032485|1375106:1396207|DBSCAN-SWA CTCAGTTGGAGTTGGCCCGTGGGTGGGCCTTCCGCCAGACGTCCAGCAGGTGTTTTTCGTCGGCTAAAGTGTACGCTTGAGTTGTTGAGAGGCTGGCATGCCCCATGAGTTCCTGAATGGCACGAAGGTCAGCGCCACCTTCCAGCAGATGTGTTGCGAAGGAATGGCGCAAAGCATGTGGGGTAGCGGAATCAGGAAGGCCCTCTGCTTTGCGCCATTCGCGCATGGAGCGTTGTGCAACGCCGGGGTTTAGGCGTCCGCCACGAATGCCGCAGAAAAGTGGTGCGTCTGGCGATGGGGAGGGATGTGCGCTGCGCCATTTTTCAAGGGCTGTTTGTACGGCCGGGAGGAGAGGGACAAGGCGTTCTTTGCCGCCTTTGCCTCGGACCATAATAGTATCCGCCCCGGTACGACCAATATCATTTATATTGAGCGAGAGAGCTTCACCGATCCGCAATCCAGTACCGTAGAGGAGGGTAAAGAGCGCTTGATCTCTCAGAGCGGCTTGATGGGTTGGTGCGTCATCCGCAATGGCAGTAGTGGCGGTAAGTGCCTGTTCTTGCCCAAGAGGACGTGGGAGACGTTTTTTACGGCGAGGTGTGGCGAGGAGGCTGGCGTTTGGGTTGGTAACGCCATGCCGCAAGCTCAAATATTTGAAAAAAGACCGTAGAGCGGAAAGACGACGCGCACGGGAGCGGGCGCGGCCATCGGCATTGCTTGCTCGCCTTGTGGGTTTTTCGCTGCGTTTGGTTTCAAAGGCAAGCCACGCTCGGAGGTCGGCCAGAGAGAGCGTATTGATGTGAGAGAGTGTTAAGTCTCCGCCCAAATGTTCCCTCAGGAATGCAAAGGCAAGGCTCATATCGTGTTGATAAGCTTCAACGGTTCGCGGGCTGGACCGTCGTTCATGGGTGAGCCATTCAAGCCAGTCGGAGAGGACAGTTTCGGCAGATTTGTAGTTGGTCATCTCTCATAGCCTTTCAGAGAGAAGCAGGTTACACAGAGATTATGGCACCAGAAAGCCTTTCTCCACAGAAACAATCGGCCGAAACTTCTACGCATGATGGGCGTGTACCGGTATTATTGCCGCTCCCTTTTGCTGGGGCGTTTGAATATCGCAGCGCTATTGCGCTCCAACCCGGGGATATTGTGGCGGTCCCTCTAGGGCGCAAAGACATTTTTGGGTGCGTGTGGGACCGGGAGAGTACGGTGCCCGCTTATATGGCGCCGCCTCCCGTTCCACCTGTTGATGAGCGGCGTTTAAAAGCTGTTATTCGGCGCCTAGAGGTTAGTCCTTTACCGCAAGAGTTGCGGCATTTTATTGAATGGGTGGCAGCTTATACGTTGAACCCACCCGGTCTTGTGTTGGCCACGGCGCTGAAGCAGCACACACAGTTGCCACCGAAAGCTGTTCTGGGTTGGGTGCGAACCGAGAAGCCGTTAGACGATGTGCGTCTGACGCCGGCACGGCAGTCTGTGTTGCAGGCATTAGGTGATGAGCCCAAATCATCGGCGTTTTTAGCGCATGAAAGTGGCGCGAGTACTGCTGTCATCCGTGGTTTGGCTGGTACTGGTTTAATAAAAGAAGCACCGCTTCTTGTGGGGGCGAGTTTTGCTCAGCCCCAGCCCGATTACGCTCAGCCTGCTCTCTCAGAAGAGCAAGAGGATGTTGCGCAAGGCCTTCGATCAATGGTCGCGGCCGAGCGTTTTGCAGTGGCGCTGCTAGAAGGCGTGACGGGCTCGGGTAAAACAGAAGTCTATTTTGAAGCGGTCGCTGCGTGTTTAAGGCGTGGGCAGCAAGTATTGGTTTTGTTGCCTGAGATTGCACTTTCGACGCAATGGACGGAGCGTTTTAGGCGTCGTTTTGGAGTTGAGCCTGCTCTTTGGCATTCAGATTTAGGACAAAAACGACGGCGTGAAACATGGTTCGCGGTTGCGGACGGCTCAGCCAAAGTTATTGTCGGGGCGCGTTCTGCACTTTTTCTACCCTTCGAAACGATCGGCTTGATTATCGTCGATGAAGAGCACGAAACATCGTTCAAGCAAGAAGAAGGTGTGACTTATCACGGGCGTGATATGGCCGTAGTGCGGGGAAGGCTCGCAAAAGCTCCTGTAATTCTCGTATCTGCCACGCCGTCTTTGGAGACATTGGCAAATGCGGACGCTGGGCGTTACCAACATTTTTTACTGACATCGCGCCATGGTGGAGCGCGTTTGCCTGATGTGACATTGTTGGACCTGCGGGAAGATGGCCCTGCACGGGGCCAGTTTTTGGCTCCCCGTTTATGTGCTGCGGTGAAAGAAACGTTGGCTGCCGGAGAGCAGGCAATGTTGTTTTTAAATCGCAGGGGGTACGCGCCTTTAACGCTCTGCCGGACTTGCGGACACCGTATGCAATGCCCGAATTGCACGACTTGGTTGGTGGAGCACCGTAACAGGGGTATTTTGACCTGCCACCATTGCGGGCACACAGAAGGCATCCCTAAAGCCTGCCCAGAATGTGACCAAGAGGAAACGCTTGTCCCAATTGGGCCGGGTATTGAGCGGGTTACTGAAGAGGCGCGCGAGTGCTTCCCCGAGGCCCGCATTTTAGTGATGTCGTCTGATACGCTGGGATCGGCCGCGGCGACAGCCGAGGCTGTCAGGCAGATCACCGACGGGGAAGTCGATCTTATTATTGGCACGCAAATCGTTGCAAAAGGCTGGCATTTTCCTCGGCTCACGCTGGTGGGCGTTGTTGACTCAGATTTGGGGCTTGGCGGTGGGGATTTGCGTGCTGCAGAGCGGACAGTGCAGCTGTTGCATCAGGTGGCGGGGCGTGCAGGGCGGGCGGAACGACCGGGGCGTGTGATGTTGCAGAGTTATGTGACCGAGCACCCTGTCATGGAAGCATTGGCATCCCACGATTTTGAGGCTTTTATCGAGCAGGAGACTGCACAAAGAAAACCCGGATTTTGGCCGCCTTATGGTCGGTTGGCCGCGCTTATCATCAGTGCGGATAGTGCTGATGTCGCTGATAGTCTGGCGGCGGAGATTGCGCGAGTGGCACCCGTACAAGATGGGATACAGGTACTCGGCCCTGCGCCAGCCCCGCTATCGGTGTTGCGTGGTCGGCATCGTAGGCGTTTATTGCTTAAAACGGTACGAGGGTTGGCAGTGCAGCCAATAGTCAGGGACTGGCTTTCGCATGTAAAAGCCAAAGGAAGCGCACGTATTGATGTAGATGTTGATCCAATATCTTTTCTATGAAAAGTGGGTAGTGATTTTCATTCAAGAAGCAGGATAATTCGTGGCAGAAGAAAATTTCTCAATATTATATGATGATGAATATTTAAGAGTAATATTCCTCGAGCAACAAAACGATAGTGAAGAACTGATCATCACGTTCGGTGACATGTTGCTGGAAGCAGAGGGCTATTGTTTTTTTGCGGACCAGCCCCTGAGAAAATTAAAATTACCAGCTATTGGTTTCGTCGCGAAAGAAGCACACTGGTACCAAAGGTCCAGTATGCTTGCCGCGTACAAAGTTCTGATGCCTATTTTAGGCCGATTTTCGAGGCGGATATTATATGGCGGCTCGATGGGGGGCTATGCTGCAATTAAATTTTCAAAGTTATTCGCTGCCACGCATGTTGTAGCTTTATGCCCGCAATGGTCGATTGATCCCGCAGAGTGGCCCGGAGAGCGGGCGGGTTGGGAAGGAAATTTCAGAGACTACATGCAGGGTATGAGCATAAAGGCGGGAGACGTAAGCGGAGATATTTACGTTTTTGCGGATAAGTTTGACGATAAAGACTACAGGCATTTTCTTAAGATTAAAAACAGTATTGAGCATGTAAATTTTGTGAACGTGCCTTTCGTTGGGCATCATGTCACGGGAGTTTTTGCGGGGACATCAAAGCTGCAGGAACTCTTTTACGCATGTGTGCATAATGACATGCGTGGATTATATCTTTTCTCCAGAAATACGAGAAAATCTTTCGGGTTTTACGCGCAAACTGTGCAGGATTTGGCGCTGGACAGATTTCCCAATCTTTTCGCAAAGAGAGTCATAGCTGCACAATCTCCTCCTGGAGATGTGAGAATTATGCCGAGAATTGTCTCCACTCTGAAAGACAATAATCGTGCTGGGGATGCCAAGGCGCTTGTTGATCAATGCTTTGCACGCCTTCATATTGGGGACAGTAGCGGCTCTATTCTTTTCATGTGCTGTGTCATGTTGGGCGAACAGGTTGTGCTCGTTGGGCATCACGGAAAGGTTTTGGCCGTTGATGCGGCGTCGATGCAGCTCATCCAAGTGGATAGGAATTACCGGCCGTGGCAATTTCCGATTTGTGTAAATCCGCGTACGACCAGCCATATTTGGACTGTGCAGCAAGAGCATAATATCTTTATCGAAGCATGTGTAGATAATCGTGTCTCGCTTGGTTATGGGACTACACGAGACACGATGGAAGTTGAGCTGGTACCGTGTGGTCGTGGCCGCTTTTACGTCAAAAATAACGGGCGTTATTTATGCGCTGAGCCTGATGGCAAGGTGATGCTGAACAGGCCTGATGCTCAGGCATGGGAAGAATTCCATTTTGAAGTTTTGTGAGTGAAAAGACTTCAAAATCGAAAGACCAAGTTGGATTGAAAGAAGCTGCCGGATTGCGCGCCGGCATTACGCAGACTTTTCGATGCTATAAAGCGGCTCCCGATCAGCTGCCACGAGAGGTGGCGTTGTAACTGGATGAGCAAGAATGCTTGCGGTTCAATACCTACAAAACTCCCCCGATAGGGGCGGCTGAATGTATAGGGGCCCGACGAATTCCAAATGGCCCCATTCGTGTTCTGGCGCCACAGCAAAGGAGCTTTGAATCTTAGTGAGGCATAACCGTGGAAAGGGGTGACGCTTAGAATCGGCGATAGAGAAATGAGGTTAGATGGTTGCAGGTAGGTCGTCGGGTCTGAGTAAGTGGTTTGTGGGTTGTAGGGGGCCATGTAGGTATGCAGGGTAGCGTTGCGGCCGTCTGCACCGCCGGAGTAAATATCGAACTGCGCACCGATGAAAGGATGCAGCGGGGATGGCGTGTGTCGGTAGCCTGCAATGGTGTTCACTGCCCATGCGGAGACGTCGCTTGAGGAAGCGTTGTTTGTGTATTGGAACGACCCGTTTTGATATAATCCGCCGAACGAAAACTCGAAATCGCGTGCAGTGCCGTACCACCTGATGCCGATGTTCCCGCGGGATGAAGTCCCTTTGAGAGTGGTGCTGCCGCTGGGGACGACCGAGAGGCTGGATGAATAGCGGTAGCGTAGATAAAAAAGATCGAAAAAAGAGTGTACCTGCTGTGAGCCGATGACAAAATCGGGGACAACTGCCGTTCCGTCGACGCCGTAAAGTCGGGTTCCCCAATCAATCGTGTCATGAAACATTTTGTTCGGGTTGGTATTGGTTTGCGCAAAGCTGAAGGTATCTAGGCGAAACTTCGGCCAGATGGCGTATATCCGCGTTCCATTCCATGAGAGAGGCACGTTTGGCGTTTCACGATTGTAGAGAATGTAAGAAGGTGCATCGAGAAATTGTTGTCGGCCAACCATTAAGCCTGTCCGTGCCCCTGCGATTTGGCCTTTGAGTTCAATGAAAGCCTGTTGCGCGTCCAGCCTTTTGCGGAATGTTTGATTATAGCCAAAGCCTGCCCAGCCAGCAGCATTTGCGTTAATAAGTTGTCCGAAGAGCCGTACATGCTCACCGAGATGTAAATCTGCCCCGAGAAGGTTTCTCACAGTAAAACGACCAGAGGGAGCGTTACCGCGCCGCCCTAGAAATGGCGCTTCTTCAGACCAGTTTCTGAGGCGTGTTTCGCCGGAGAGCGTGAGCCATATGGTTTTTTGCTGGTTTAGCGCAATAAATTTGAGCGGATCTAGGAGGTCGTCTGATTTTTTAGGGTTGCGTAAATCGCTCCAGTCTTCCGCCCATGGTGCGGGTCCGTACCGCCCGACGGGGCCAAAACCCGCAGCTTCCCCCGTTCCATAGTTAAATGCGCCCCACGGGCCTTGATGCCCCTGTGTTGGTCTGTTGAGGCTGGGCAGAATGCGGCGCGGTTGGCCGACGAAGCCAGCATGCGGGTAGTTCTGAATTGGCGGTCGGTCGGTTGTGGTGTTGGGGATCCACGTCGGATTTTTGGGCGTCCGTGCGTCCGGTGTTGCTTCTGGTGCGGCTGAAAGCGGCGTTGTCGTAAAGCTGTACAGGCCTGCAAACGCGAAAAAGAGCGCTGGGAAACCACTATTTAAGCGCGCAGGCACCATGGAATTGCTTTCTGGAAATTTACGGATAGAAAGAATAAAAGGATTAGATACGATTATAGAATGTTGTTAATATTTCAATACGAAATTCATTCTACATACAGGAAAAAATATAACGAATTGTCTCGGTTATTAATCGTCGGTTTCTATGGTGGTTTGCCCGCGTGTCATATCTCTGATTTCGTTAATAATCTGCGTGCTTGATGCCTCGGGGACCATAAGAGAGAGCTGCACACCGTCGGCATCAAAATGTGAGTCTGATTTTTGTGCGCCCCAAACTGGAAAACGCGCTTCGAGGCGTGAAAAAATAGAAAATGGGCAGTGAAAATGTAGCGCCTGCATTGCTATGCGCTCTGATTTTGGGGCGTCGCGTAAGCATGCGGCAGCTGCGCCACCATAAGCGCGGACCAGCCCTCCGGCACCAAGTTTGACGCCGCCAAACCATCTGATGACCACGACCATAACATTGTCAAAATCCTGCCCATCAATAGCGGCCAGTATAGGTCTGCCCGCAGTGCCTGACGGCTCCCCGTCATCGTTGGAACGGAAACGTTGGCCAATGCGGAAGGCCCAGCAGTTATGTGTTGCGTCTGGGACGGCTACATTTTGGAGAAACGTCATTGCTGCAGCTTCATCGGCAACAGGGGCGGCGTAAGTTTGAAAAACGCTGTTGCGAATAACGGAGCGGTGCTCGGCAGGGGCGTCTAGGGTCCAAGTCATGGCGAGAGTGTGCCTGAACTGAGGTTGGATGTCAGGAGGGCGCATCAAGTTGGGGTAGGGAACCCGTCAAAAACCTGTTATGGGGAACGTATGTTTTCCGAACTGATCGGTCGGCTTAACGCCGCCTGCGCCCGCCATGCCGTTACGGTTGTTGCTTTGTTCGTTCTGCTCGTTGCGGGTAGTGCCGGGCTAAGTGTCTTACGCTTGAGTGTGACGACTGACACCGGCAAAATGTTTGCAGATTCCTTGCCGTGGAAGCAGCGTGTGGCAGAGATGGATCGTCTCTTTCCGCAAGATTCGGACCAGCTCGTTGCTATTTTGGATAGCCGTATCCCGGAAGCCGGGCGTGAAGCAGCCCGCCAGTTGGCGCAGCTTTTGAAGCAGGACCATGCTCATTTCCGGACTGTGACCTTGCCGGAAGACAATGCTTTTTATCAGCAAAATGGTTTGCTCTTCTTAGAGCGCAAAGATTTGGAGCCGTTGCTGGATTCTGTTGTTTCTGCACAGCCGTTCTTAGGCACTCTAGCGGCTGATCCGTCTGCTCGCGGTTTGTACGGAGCTTTGGGTCTTGTGGGAGATGGCATTAAAGCAGGGCAGGGCGTTCCGGCTGGTTTTAACGCGGCGCTTGATGGTTTTGCGTCAGCTCTTGAGCAGGGAGCCCAAGGACACCCACAAGATCTGTCTTGGCAAAATCTCTTACTCGGCCAGTTGTCCAATTTAGGGACGGGGCATGAGTTCGTCGTTACGCAGCCGGTTATGGACTACAATGCTTTTGAGCCGGGCGAGGCCGCGACGACGGCTATGCGGCAGGCAATTGACAGTCTACCCCTCGTAAAAGCCGGGCAGGTTACCGGTTTGATTACGGGTGAAGTGAAACTCGGTGATGAAGAGTTCTCGACAGTCGCCCATGGGATGATTACCGGGCTGGTTATCTCTTTGACCTTAGTTGCCGTCTGGCTAATCCTCGCGGTTCGTTCCCCGCGGTATATCGTGTCTATTCTGCTGACATTGGTTGTCGGTCTGGCCTTAACGACAGGCCTGGCTGCGTTGGCCGTTGGTGAATTGAACATGATTTCCGTTGCGTTTGCCGTTCTGTTTGTCGGTATCGCGGTCGATTTTGCCATCCAATACTGCGTCAGGCTGCGTGGCCAGCGAGGGGAACAGGGACAAGTACTCAATTTGGGCGATGCTATTCGTCTGACAGGAGAGGAAAGCGGAGCGCAAATACTTGTTGCGTCGCTTGCGACGGCTGCTGGATTCCTTGCTTTTACGCCCACGCATTTCGTAGGCGTTGCGCAGTTGGGGCTCATTGCCGGTCTGGGAATGCTGATTGCATTTCTGTGTACTATGACGCTGCTTCCCGCATTATTAAGTCTGTTTCGTGCGCGGCTTGGACATGGAGAGCCGGGCATTGTTGCGCTGCGGCCTGCGGATGCTTTTCTAAGGCACAAGCGCGTGCGGGTTATGAGTGTTTTTGGGTTGTTAGGCGTTGTCGGTGTAGCGTTGATGCCGCTGCTGAAGTTTGATGCTGACCCGTTGCACACAAAAAACCCAAATACTGAGGGGATGAGAGCCCTTCATGTGCTTGAGGCAAATCCACTGACAACGCCTTATGGTGCACAAACGCTGGCAGCAAACGTTACGCAAGCCGCAAAAATGGCTGATGCGTTTTCCAAGCTTTCCAGTGTGCACGATGTGTTGTGGTTGGGAGCTTTGGTGCCTGAAGATCAGGAAACGAAGCGGGGCATGATTGCTGATACCGCCTCCATCTTGCTGCCGACACTAGACGTCAAGCCGATGGCCGCCCCCGATGCGCAAGCCTTGCGTGACGCTGCCGCTCAAGCCGCAGTAAAACTAGATGCGGTACAAAGTAAATTGTCGCCTGCATTAGAGCGAGTAAGGCAAGCCTTAAAGCGTCTTGCAACCGCGCCGGATGCTGTTGTCTTGGGGACGAGCCACGCTCTCACGCGGTTCTTGCCGGACCAGCTTGAAACGTTGAAAACTATTTTGCATCCTTCGGTCGTAACGATGGCATCAATCCCGGATGATATCCGTCGGGATTATGTTTTGCCTGATGGCCGGGCACGCCTCACAATTCACCCAAACGGCCACATGTCAGAAACGGCAGTATTACACCGTTTCGTTGAACAGCTGACAACAGTCACACCGAATCTTGCCGGCCCAGCAATAGAAATTACGGGTAGTGCACAAACGATCGTTACGGCGTTTCTGGTAGCTGCGGTGTCGGCGCTGGTGATGATTGCGGGTATTTTGCTGGTAGTGCTTCGTCGCGTTTTGGATGCGGCGCTGGTGATGGCACCTCTGCTGATGTCCGCCTTACTGACAGTTATTGTGATTGTTACGGTACCCGAAACATTGAATTATGCGAACATTATCGCTCTGCCGCTGCTGCTGGGCGTTGGTGTCTCGTTTAATATCTATTTCGTGATGAATTGGCGGAGGGGATTAAAGCGGCCTCTGTCATCACCAACAGCGCGTGCGGTGTTGTTCTCTGCGCTGACAACGGCAACGGCCTTTGGTTCTTTGGCGGCGTCAGAGCATCCCGGAACGGCTAGTATGGGCCGACTTTTATTAATGTCCTTAGCGTGCACCTTGGCATGCACGCTCGTGCTGGTTCCCGCTTTGCTGCCGAAACGTTCGGAAGATTCTGTCTAGAACGTGGCGCGTACTTAAAAGAAATGTGACAATTACGTCACATTTTAAAATATGGTTTCAAACTCTGTTACAATTGATATTGCGCAACCGAGACAAATACCCTCACTTTGAAACGTAATTTAGACTTAACGTTACAAAGTGAGGGGCCTTCATGGGCGCAGCATCGTTTCTGATAGGTACAAAATCAGTCAAAGGGAGCAATAAACGCGCCCAGTCGATTCTTAAAGTTATGCGGCCCTGTTGCAAAGTTGAATATCCTGTTGGAAACCGCTGCTGGTGTTAGCGTAGGGCTTGGTAGATGAGTGATTTCAAGGGGCGTCATTTTGGTGGTGAGGTGATCTTATGGGCAGTGCGTTGGTATTGCCGCTATGGGATTAGTTACCGGGATTTGGAAACCATGTTGGCCGAGCGGGGCGTGAGCGTTGATCATTCAACGATCTACCGCTGGGTTCAGAGGTATGCACCCGAGATGGAAAAACGCTTACGCTGGTATTGGAAACGCCCGGGGTTTTCCAGCAGCTGGCGAGTTGATGAGACCTACATTAAGGTCAAAGGGTCCTGTTGCAAAGTTCCGTTTGAGTCGCGTGGGCCCCGTTTGTTTCGATTTTCAAACGGTTTAGAAGCTGAATTGTCTTTCGATGAGACGCACTTCTCCCATGATACCCTCTTGGAACTGGAAGAGTTCAGCTTGTCCCTTTTTCAAGGCGCGCATAACCTCGAAGCCTTTAATGGTTGCATAAGCGGTTTTCAGGCTTTTGAAGCCACGGACCGGCCTGATCAGTTGTTTGAGTTTGCCGTGATCGGCCTCAATCACGTTGTTTAGATACTTCACCTGACGGTGTTTCACCGTGTCGGGAAGCTTACCGTTCTTCTTGAGTTCGTTGATGGCCCGACCATAGGTCGGCGCCTTGTCTGTGTTGATTGTTTCGGGCTTCTCCCACCTTCTCAGTCCGTTTAAAGCTTTGCTCAAAAAGCGTTTGGCCGCCTTGGCGCTCCGGGTCGGCGAGAGGAAGAAATCGATCGTATCACCGCCTTTGCCGATAGCACGGTACAGATAGGTCCATTTTCCTTTGACCTTAATGTAGGTCTCATCAACTCGCCAGCTGCTGGAAAACCCCGGGCGTTTCCAATACCAGCGTAAGCGTTTTTCCATCTCGGGTGCATACCTCTGAACCCAGCGGTAGATCGTTGAATGATCAACGCTCACGCCCCGCTCGGCCAACATGGTTTCCAAATCCCGGTAACTAATCCCATAGCGGCAATACCAACGCACTGCCCATAAGATCACCTCACCACCAAAATGACGCCCCTTGAAATCACTCATCTACCAAGCCCTACGCTAACACCAGCAGCGGTTTCCAACAGGATATTCAACTTTGCAACAGGGCCAGCGGTCGTCCGTTCTGATCGGTGACCGCATGGAGTTTGGTATTCATACCGCCTTTGGTGCGACCAATCAGGCGGCCTGGATCCCCTTTTTTAGCCGCAGGCTCGAAGCCGTGCGGTGCACCTTGAGATATGTCGCATCAATCATAATCGTCTGAGGCTCGGCTTTCGCAGCAGACAGGCCATCCATCATCCGCATGAAAATGCCCATGTCACCCCAACGTTTCCAGCGGTTGTAGAGCGTTTTGTGCGGACCGTATTCCCGGGGCGCATCACGCCAGCGCATACCATTGCGGTTCACAAAAATGATGCCGCTCAGCACACGGCGGTCATCAACGCGAGGTTTGCCGTGGCTCTTGGGAAAGAACGGCCGCAGACGCTCCATCTGTTCGTCCGTCAGCCAAAACAGGTCGCTCATCTTCAGTCTCCTCACAGAGCCTGAATCAGATTTCCGCAATCAAATCAATGGGTCCTGAGCCTAAGGCGACCTTTGCGGGGATGGATCCTACATGAGCCAGGAATACACGGATGACACCCGAGAATGGGTCGTGTGAGAGCGTTAGCGACGAGTAGAACGAAGAGATTACGGGCGGCAGAAAAGACAGAGACGCACCTATGACATCCCCGGGAGAGCGGAATTTCTGCATGAGTAATTTCAAGGAGCGCCATTTATACATTTCCTGGTCGATGATGGATAGATTGAGCTTCTATAAATCATAAATAGCATATAAAATGATTGATATAACCTCGGAGTATCAATCATGCGTTGGAATTGGCAGCGTCCAGACTGGCCGTATTTTCAGTTTGAGAAAAACTGCTTACGCGACGCTGAAACGAAATTCCTAAGAGGTTCAGGCGTCATCGTGGGTGCGATGCGGCATCTGGACACAGGTGCTAATCAACAGCTTGTCGTTCAGCTTATGTCGGAAGAAATGGTCGAAAGTTCCGCCATTGAAGGCGAAGTGCTTGATCGGGCGAGTGTACAATCATCGATTGCTCGACACTTAGGGTTCGCTGCAGACAGGCGCCGTTCTACTCCCGCTGAAGCGGGGGCCGCTGAATTGATGGTCGATCTCTATCAGCATTATGCCGCACCGCTAACAGACCAAAGCCTCTTTGTTTGGCATACCATGCTTATGAACGGTCGGCGTGATTTAGATGTTATTGGGGGATATAGGGTGCATGCTGAGGCAATGCAGATTATCTCAGGCCCTATCCATGAACCGCACATCCATTTTGAGGCTCCACCCTCTCAGGCTGTGCCTTCTGAGATGGCACAGCTTATTCTTTGGTTTAATAGAACTGCCCCAAGCGGGTCTGAACCTTTGCCCGCTATAGAGCGCGCTGCAATTGCCCATCTTTGGTTTGAAACCATCCATCCTTTTGAAGATGGCAATGGCCGTATCGGACGGGCTATTGCTGAAAAAGCATTAACGCAAAGCCTTGAAGGACCGACGCTGACCGCTTTGGCAGCAACCATCAATACGCACAAAAAAGCATATTATCTCGAACTGAACCGAGCCAGTAAAACAAATCAGATAGAGGATTGGATGATCTGGTTTTCTCAGATTGTTCTGGAAGCACAGTCTCAAACGCTTCAGAAAATCCAGTTTTTGATTGAGAAGACGCGTTTTCTGGACCGCTGGCGGGGGAAGCTCAATGCTCGACAGGAAAAAGCCATTCTTCGCATGTTGGCAGAAGGTTCTGATGGATTCCAGGGTGGTCTGAGTGCTCAAAATTACCGGTCCATTACAGGAGCGACATCAGCCACTGCCACGCGTGATCTGGCAAATCTCGTGTCACTTGGTGCTCTTCATCGAACGGGAGACAATAAATATGCTCGGTATTCGCTTTGTCTCGGATAACTGAGGATTTCATCCCTCGCAAATGTACCAAGCGCGACATAAAGTTTTTTAGTTGTCCGGTTGAAACGAGGTAACGACACGAAAGTGCCTTTCATGCCCTGCCATAGGTTGTGGGCATCTGTCCGCAAACGGACGGTAGTGTTGAAAAACTCATGGTGAAGAGTGAAAGTGTTGCCAAGCCACTGCGACATAAGTGCAGCGTTCTTGCCTCAGGTGGATAAGGCAGGTGCTCCGGGAGGTATCAGTTTCGCCATTTTGCGGAGATTTTGGGCAGCTGCGGCGAGATGGAACTCATCACGGGCGCCGTTTGGTCCTCTGAGCCTCAACCGATCGATCTTCAAAATGCGCTTGAGGTGAGCAAACAGCATTTCGACCTTCTTTCGTTCTCGTCTGGAGATAATATAGGCGTCGGTTAAAGCAATGTCGCGAGCCATATCACGAGCGCCTTCATGAATAGAGCGCAGAACCTTACGACTGGGCTGATTGGGGCAGCATTTTGGTTTGAAAGTGCATACGTCGCACGCCCGTTTCGACGAACGGTATCGAAGTAGGTTGTCGGGAGGGGCATTTGGTTGATCCGAGTTGATCTTGCGCCACTGCTGCTTCAGTTCCTGTCCTCCAGGACAGATGTAAAGATCGTGCACGTGGTTATATGTGAAATCTCGACGTTCGAAAGTCCCGTCCTGCCGGGCAGATTTGTCGAAGACCGGAATGTGAGGTTCGATGCCACGCTCATGAACCAGCCACGCAAGATTTTCAGCGGATCCATAAGCTGTATCCGCAGCTAGCCTTTCGGGCCATATTCCAAATGTTTCCTGCGTACGCTCTATCATTCTGCGTTGCGCCGTAACCTCGGCCTGCCGGATGGCTGTCGTGGTTTCTACATCCATGATGACAGCCGATTTAAGGTCAATGAGGTAATTTGTGCAGTAGGCATAATAAGCAAGGCCACCGCTTGCAGCGTTCCAACGTGCCGCAGGATCGACCGGAGAGATATATTTTGGTTGCACGGGCGTAGCAGACCCAAATGCAGCATCGTCCAGCACCGAGAAATACTCCCGCACAGCACGTTGGGCTGCCTCGATCGGCAGTTTATCTGAGCTCGGAACACTGCGCTGCCGATTGGCGTCGGCCTTGATCGTGCTGGCATCGACCGCAAAGCCTTCACCGCCCACCAGTCCCTTGGCGATGCACTGCCGGACGGTCATTTCGAACATTTGGCGAAGCAGATCACTCTCCCGGAAGCGTCCGTGCCGGTTTTTCGAGAATGTCGAATGATCAGGCACAGGGCCGTTGAGACCAAGGCCACAGAACCACCGGTAGGCCAGGTTAAGATGGACTTCTTCACATAGCCGTCGCTCGGATCGAATCCCCATCACGTAGCCAACGATTAACATCCTGATGATCAGCTCGGGATCGATTGAAGGCCGCCCCGTACCGCTGTAAAATGGGCGCAGATGCTCACGCAGGCCGTCCAGATCGACAAAGCGATCAATTGAACGCAAAAGGTGGCCAGTTGGCACATGATCTTCAAGACGGAACTCGTAGAACAGTGCCTCCTGTATTTTCGTTCGATCACCCATCATTTGCAGGTTCCTCCTGCGAAGAGTGAATCAGTGCTTCACTCTCAAAGCAAGCGCGACTTTTTCAACACTATCGGGGGGGAGCGGAAAGTCGGCTTTGTGTTGAAAAATATGATAAAGCGGCCATTACTTTCTCCTTACGAAGCGGTACTTCAAATAATCAGAGGCGTATTATTGCGGTCCAAGCGTAGAGGATGGATAGAGTTTGCAGGCAACACGCTCCATGGTTAGCCCCTGCACAATAATCGCAAACACCACTACGCCGTAACATACAAGCAACAGCAGTTCCCGCAAGTCTCCTCAGTAGCTAGAGTTAGCGAGCATGCTTATTGTCAAATACATCATTTGATACGCCAAATCGCGCCGGATTTCTTAAATCTAATATTACAGTATTTCGGAAAAAAGCGGCGTTTCCTACCAAGCCATCGGCTCCACGATTAGCGATTATAGCTGTTAATGTATCAGAGCTCTGATAAACCACCGGCCTACGCAAAGTGAGAGGGCCAATAGTAACACCGTCTTTCATGATTTTGCTTATTATTACAGAGCTTTTACCGTCGTATGTTCTACCTGGAATGGCTGGTGCGTCTGTGGTGTCGCCATCCATATTATCCCAAATAACAGGCGGTATTTGGATCGTAAAAAGGCTAGATCCCGTATCGAACAACAGGTGGTCGAATTTTTTACTTCCCACACTTCCGGTTATGTAAAATGTATTATTGTGTATTGAAGCGTTATACCAAGAGAAATTCTTAATATCGTCTTTATTATTTAAAACACATACACTCTTATGAGGGTAGTCTATGACTACAATCTGACCTATAAATGTATTCAACCCCATATGACCAATTATTTTATTGCCGTCAATTTTTTTCTTACGATCGATACTTAAAGAGAGGTGACTCCAACACATGTTCCCAGCACAAAAAGAATTTGGCACCAAACTATTTTGGCTTTCAGTGGTCAGATGCAGTTGATCAGAAATTTCTCCATACAACGCAGTGGTAGGCGCGCCTGTATCAAACTGCAAAAACCCGTTCTGCCCGTTGACTTCCACAGGTACCATCATACCTACCTTCTTTCGAGGTAATTCAGGTGAGCCAACCCAAATAAAATCGAAGCATCCAGAGTGCGGATTAGATGCGGGCTCCGCACAAAATGCCGAACCAAAGAAGAAAAGACAACAAAAAGCGCTTAATGCTCTTAGAGAATTTCTTTTACGTGTCGTATAATCGCACATAAAAAACACTCCGTATATGACAATGCCTAAAAAAATCGGTAAGTCAGACCCTGTGTATACCATACGTTGCGACGGAGACGATCGGCCTTCGATATCAGGTGTCGAACATGACGGCTGCACTCTTCATAGTAACTACTATGACTGAATATCGAAAAGCTACAAACGACTTGGATATGCGGATCGAAACGAACGTCGACATAAAAATGCTGACACGATTATATTAGAGCCCTTTTGGAAATTGAGCGTGAGGGGTTTTCAGGGCGTGTTGCGTGTGATTCACACTCAGGATGTGGACGCCAGAACAACGAGGCCGAATGGCCGACATTACGCGCAAGACGAAGCGTTATCCGTCTGATCTGACCGATGAGGAATGGGAGCGCATAGCGCCCCTGATGCAGCCTTTGAACCGGCGTGGCCAGAAACGAGCAACCGATTTTCGTGAGATCATCAATGCGCTGCGTTATCTTGTCCGCTCAGGCTGTGGCTGGGAAATGTTGCCTGTTCATTTTGGTCCATGACAGACCGTCTATTGGTGGTTCCGCAGGCTAATGCGCCGTTTCCTGTTTCAGGCCATCCATGATGTCTGCCTGATGCTCGATCGTGAAGCCGCGGGACGCGAGGCGGTGATGTCACGTTAAGTTTTTTGCTTTGAGTATGGTTGTATTTGAGTGATTTCAAGCTGCGAGAATAGTCGCGGTATTCCACTGGGCGAAGGCTTGGAGACGATGAATGTGTCGTGAGAGGGCATTGGCGTTAGCGGCGGGTGGAACGAAGAAATTACGGACGGCCGAGAAGACGGTTGATAACCTCTGACATCCGCCCGGAGATCGGAACCTTTGCATGGTTCGTTCCCGTTTTCGTAGCGGTAAATGGCTGTTTTCCGCTCGATTGTTCAGCCCTTTGCGGGAAAGGTGTTGGACAGACAAACGGAGTTCGCACTTTGCGGCCCCGTAAGAGCTCAGCTTATCCGTGATCATCCGCCGGGGGCGCACGCCCTGCTTGCGCAAAAGGCGGGTCAGGAGACGCTTGGCGGCCTTGGTATTGCGCCGTTTTTGTAAGATTTCATCAAGGATATAGCCGTCTTGATCGACAGCGCGCCAGAGCCAATGAATGTGGCCGCTAATCACAACCCTGACTTCGTCCAAATGCCATATATCGTCCCGTTTGGCGCGCTTGCGGCGCAGACTGCGAACATACTCCGCACCAAACTTCAAGCTCCAGCGACGGATCGTCTCATACGATACCACAATCCCACGTTCGAGCAGCATTTCCTCAACCAAACGAAAGCTCAATGGAAAACGGAAATACAACCATACCGCATGGGCTATCAATTCACGCGGGAAACGATGCCGTTTGTAACTCACTGGTATTTTACTCATCGGCGTCATCTAGTAGGCCAAACTTAACGTGACATCACCGCGCGCCTTGAAAAAGGGACAAGCTGAACTCTTCCAGTTCCAAGAGGGTATCATGGGAGAAGTGCGTCTCATCGAAAGACAATTCAGCTTCTAAACCGTTTGAAAATCGAAACAAACGGGGCCCACGCGACTCAAGCGGAACTTTGCAACAGAGCCCGAGGAAGTCGGTGATCAGATCTCGCGTCGAGTGCGTCTTTGTCGATCAAAAATCACAGACAAGACTGTTCGTCCGAACCGTAGGCTTAACGCGGGCCACTATGAGGATCAATCTGGCCAATATCGTCCATAATATACGCCGCTTTCTCCTCCTGGAGCGGATTAACGCCGCTCCGTAGCAATCCAGAGTATGAAACCCTCTCTTTGCTCATAGCGCAGGGTGAAAACCAGGACCAGGAACCAGAAATCAAGCGCAAGGGCAATCAATCAAGGGCTCTTCGAACTCTCCAATATCAATCAGAAATATTATGTAAATTAATCATACTAAAAGAGGAATCTTTTTGTATTTTACTTACATATTCCTTTATAAAATCCAAATTTGATTTATTGTTAAAATTTTTATTTATAGAAATCCACCCTTCTACATCCCAAATAAATTCACTATTAGGGTTTTTAATTAAAGAAGACCACCCTTCTACAGGGTCACTTGGTATTGTATTTTTCTTTTGATCCTTTGAAGGTTCATTCGATAATGACGATGGCGTCATGATTAAAATACCAATTAACTCACCTTCACGTGAAAATATGTGGCCTACAAAGGTGCCAGTACACGCTCTTGGTTGACATGTTGTAAAAAATACGAGGTTGGAAATTTCCTGATTTCCATTTGGAACAGTATTTGATTCAGCCGTTTTGTACGTAGATATAATATCATCCTTAGTATTTTTAGATAATTTATTAAAATATTTTGTCTTTAACAAACTAAATTCGCAATCTTCGAAATTATTAAATTCTCTACCTGAAAGAGACATCAGATAATCATTAAAATCTATATCTGAAAACCTTGTATTCGAATTATACCTTCTAGGTATTTTTGCTATTATTTCTTTTAAATAGATAGGAAGCCTTTTGTAATTACAAAAATTATTTATTGGAATATCTTTTTCTTTGGAAAAAGATATTCCAATATTTATTATATTGAATATTATCATAAATATAATTAAGGAAATTTTAATCATTCTACTTCTCCAATAAGTCTCATATTATCTAATGATCGGGAATCTTTAATTCCAAAAATTTGTTTTATATTATCTGATGCTTTTTTATAGTTTTCTGTACGGAAACTAGTCCATAATTGGTGCTTTGCTACAGCTGCGTTATCCTTACTAGAAACATAACCAAATCCTGACCCCATAACATATGACAAATCTATAATTGCAGTTTGTGCCTTAGGCTCAGGACTCATAGCCAGAAGATGACGATTGCGGCGAGGCAGACGGCTGAGAAGAAGACGGTCGGGCATCTGTCGTAGCGGGTTGCGACCCGCCGCCAGTCCTTGAGCCTTCCGAACATAATCTCGATGCGATTGCGGCGTTTATATTTTCGCTTGTCGTATTTGACCGGTTTTCCGCGAGATTTCCGTCCTGGAATGCAGGGCTTTATACCTTTCTCTTCCAGGGCGTCCCTGAACCAGTCGGCGTCATACCCGCGATCTCCCAGCATCCATTGTGCTGCAGGAAGACTGTCCAGCAGGGCAGCGGCACCGGTATAATCGCTGATCTGCCGCGCAGTCATGAAAAAGCTCAGCGGTCGTCCGTTCTGATCGGTGACCGCATGGAGTTTGGTATTCATACCGCCTTTGGTGCGACCAATCAGGCGGCCTGGATCCCCTTTTTTAGCCGCAGGCTCGAAGCCGTGCGGTGCACCTTGAGATATGTCGCATCAATCATAATCGTCTGAGGCTCGGCTTTCGCAGCAGACAGGCCATCCATCATCCGCATGAAAATGCCCATGTCACCCCAACGTTTCCAGCGGTTGTAGAGCGTTTTGTGCGGACCGTATTCCCGGGGCGCATCACGCCAGCGCATACCATTGCGGTTCACAAAAATGATGCCGCTCAGCACACGGCGGTCATCAACGCGAGGTTTGCCGTGGCTCTTGGGAAAGAACGGCCGCAGACGCTCCATCTGTTCGTCCGTCAGCCAAAACAGGTCGCTCATCTTCAGTCTCCTCACAGAGCCTGAATCAGATTTCCGCAATCAAATCAATGGGTCCTGAGCCTAGTGCCGTGGAGGGTTTTTCCCGCTGGGATAGCTTTCGGTCCGCATTTGCTAAAAGTCTTCCATATTATCAAACCGGCTCTGTTGCAAAGTTAATTTTTCGGCTGATTTCAGGTCGACACAAAAATCCAAACACCTATAAAATCAGTAAGTTATTTTTATCGGAATCAACACGAAAACGGCAATTTTAGAACTTTGCAACAGAGCCAGTGCGTCTCATCGAGAGACAATTCGGAGTTTACACCGTTTGAAAATCAAAACACAATGGATCCACGCGACCCAAATCTAACTTTGCAACAGGGCCTTTTGGAGGCGGTTAACCAGAAAGGTATTAAATATGAAAATAAAAGAAAGTTATATTTTAAAAAACTCTTGGATAATTATTGTATCTGGAGGTATTTTCATTATTTCTGGAGAGCAGATAGTATCCGGAAAAATAAATGTGATTTTTTCTATTATTCTTATGTCTTCTTCTATTTTAATGTCATTGGGAGTACTGATATCCGCCACAAAGAGGCGAGGTTAAAGATCGGCCCTGTTGCAAAGTTCTAAAATTGCCGTTTTCGTGTTGATTCCGATAAAAATAACTTACTGATTTTATAGGTGTTTAGATTTTTGTGTCGACCTGAAATCAGCCAAAAATTCAACTTTGCAACAGAGCCGCGCCAGATTATGACCGATGCCTTTCTGGCGATTTTACCGGACTTAGAGGCCATCTATTCAGATTAATCTCGCATCGCGAATGTCCGATATTGGATAACTATGAACTGATAGCCGACCTTCGGCTTAACCCCCCTATGCGGACGTAGTATAGACCAAACGGATCGAAAAGCTAACGTAACGATAATAAGTACTTCGCCATAAGTGATTGCTTCCGTTCACTGCAACTGCGCTAGGCTCAGGACCTATTGATTTGATTGCGGAAATCTGATTCAGGCTCTGTGAGGAGACTCAAGATGAGCGACCTGTTTTGGCTGACGGACGAACAGATGGAGCGTCTGCGGCCGTTCTTTCCCAAGAGCCACGGCAAACCTCGCGTTGATGACCGCCGTGTGCTGAGCGGCATCATTTTTGTGAACCGCAATGGTATGCGCTGGCGTGATGCGCCCCGGGAATACGGTCCGCACAAAACGCTCTACAACCGCTGGAAACGTTGGGGTGACATGGGCATTTTCATGCGGATGATGGATGGCCTGTCTGCTGCGAAAGCCGAGCCTCAGACGATTATGATTGATGCGACATATCTCAAGGTGCACCGCACGGCTTCGAGCCTGCGGCTAAAAAAGGGGATCCAGGCCGCCTGATTGGTCGCACCAAAGGCGGTATGAATACCAAACTCCATGCGGTCACCGATCAGAACGGACGACCGCTGAGCTTTTTCATGACTGCGCGGCAGATCAGCGATTATACCGGTGCCGCTGCCCTGCTGGACAGTCTTCCTGCAGCACAATGGATGCTGGGAGATCGCGGGTATGACGCCGACTGGTTCAGGGACGCCCTGGAAGAGAAAGGTATAAAGCCCTGCATTCCAGGACGGAAATCTCGCGGAAAACCGGTCAAATACGACAAGCGAAAATATAAACGCCGCAATCGCATCGAGATTATGTTCGGAAGGCTCAAGGACTGGCGGCGGGTCGCAACCCGCTACGACAGATGCCCGACCGTCTTCTTCTCAGCCGTCTGCCTCGCCGCAATCGTCATCTTCTGGCTATG
Protein sequences of DBSCAN-SWA_1 >NZ_CP032485|1375106:1396207|1385402_1386110_-|WP_141492761.1|transposase|DBSCAN-SWA MSDFKGRHFGGEVILWAVRWYCRYGISYRDLETMLAERGVSVDHSTIYRWVQRYAPEMEKRLRWYWKRPGFSSSWRVDETYIKVKGKWTYLYRAIGKGGDTIDFFLSPTRSAKAAKRFLSKALNGLRRWEKPETINTDKAPTYGRAINELKKNGKLPDTVKHRQVKYLNNVIEADHGKLKQLIRPVRGFKSLKTAYATIKGFEVMRALKKGQAELFQFQEGIMGEVRLIERQFSF >NZ_CP032485|1375106:1396207|1382200_1384786_+|WP_141492760.1|DBSCAN-SWA MFSELIGRLNAACARHAVTVVALFVLLVAGSAGLSVLRLSVTTDTGKMFADSLPWKQRVAEMDRLFPQDSDQLVAILDSRIPEAGREAARQLAQLLKQDHAHFRTVTLPEDNAFYQQNGLLFLERKDLEPLLDSVVSAQPFLGTLAADPSARGLYGALGLVGDGIKAGQGVPAGFNAALDGFASALEQGAQGHPQDLSWQNLLLGQLSNLGTGHEFVVTQPVMDYNAFEPGEAATTAMRQAIDSLPLVKAGQVTGLITGEVKLGDEEFSTVAHGMITGLVISLTLVAVWLILAVRSPRYIVSILLTLVVGLALTTGLAALAVGELNMISVAFAVLFVGIAVDFAIQYCVRLRGQRGEQGQVLNLGDAIRLTGEESGAQILVASLATAAGFLAFTPTHFVGVAQLGLIAGLGMLIAFLCTMTLLPALLSLFRARLGHGEPGIVALRPADAFLRHKRVRVMSVFGLLGVVGVALMPLLKFDADPLHTKNPNTEGMRALHVLEANPLTTPYGAQTLAANVTQAAKMADAFSKLSSVHDVLWLGALVPEDQETKRGMIADTASILLPTLDVKPMAAPDAQALRDAAAQAAVKLDAVQSKLSPALERVRQALKRLATAPDAVVLGTSHALTRFLPDQLETLKTILHPSVVTMASIPDDIRRDYVLPDGRARLTIHPNGHMSETAVLHRFVEQLTTVTPNLAGPAIEITGSAQTIVTAFLVAAVSALVMIAGILLVVLRRVLDAALVMAPLLMSALLTVIVIVTVPETLNYANIIALPLLLGVGVSFNIYFVMNWRRGLKRPLSSPTARAVLFSALTTATAFGSLAASEHPGTASMGRLLLMSLACTLACTLVLVPALLPKRSEDSV >NZ_CP032485|1375106:1396207|1389933_1390860_-|WP_141492764.1|DBSCAN-SWA MCDYTTRKRNSLRALSAFCCLFFFGSAFCAEPASNPHSGCFDFIWVGSPELPRKKVGMMVPVEVNGQNGFLQFDTGAPTTALYGEISDQLHLTTESQNSLVPNSFCAGNMCWSHLSLSIDRKKKIDGNKIIGHMGLNTFIGQIVVIDYPHKSVCVLNNKDDIKNFSWYNASIHNNTFYITGSVGSKKFDHLLFDTGSSLFTIQIPPVIWDNMDGDTTDAPAIPGRTYDGKSSVIISKIMKDGVTIGPLTLRRPVVYQSSDTLTAIIANRGADGLVGNAAFFRNTVILDLRNPARFGVSNDVFDNKHAR >NZ_CP032485|1375106:1396207|1392727_1393453_-|WP_141492766.1|DBSCAN-SWA MIKISLIIFMIIFNIINIGISFSKEKDIPINNFCNYKRLPIYLKEIIAKIPRRYNSNTRFSDIDFNDYLMSLSGREFNNFEDCEFSLLKTKYFNKLSKNTKDDIISTYKTAESNTVPNGNQEISNLVFFTTCQPRACTGTFVGHIFSREGELIGILIMTPSSLSNEPSKDQKKNTIPSDPVEGWSSLIKNPNSEFIWDVEGWISINKNFNNKSNLDFIKEYVSKIQKDSSFSMINLHNISD >NZ_CP032485|1375106:1396207|1376110_1378351_+|WP_141492756.1|DBSCAN-SWA MAPESLSPQKQSAETSTHDGRVPVLLPLPFAGAFEYRSAIALQPGDIVAVPLGRKDIFGCVWDRESTVPAYMAPPPVPPVDERRLKAVIRRLEVSPLPQELRHFIEWVAAYTLNPPGLVLATALKQHTQLPPKAVLGWVRTEKPLDDVRLTPARQSVLQALGDEPKSSAFLAHESGASTAVIRGLAGTGLIKEAPLLVGASFAQPQPDYAQPALSEEQEDVAQGLRSMVAAERFAVALLEGVTGSGKTEVYFEAVAACLRRGQQVLVLLPEIALSTQWTERFRRRFGVEPALWHSDLGQKRRRETWFAVADGSAKVIVGARSALFLPFETIGLIIVDEEHETSFKQEEGVTYHGRDMAVVRGRLAKAPVILVSATPSLETLANADAGRYQHFLLTSRHGGARLPDVTLLDLREDGPARGQFLAPRLCAAVKETLAAGEQAMLFLNRRGYAPLTLCRTCGHRMQCPNCTTWLVEHRNRGILTCHHCGHTEGIPKACPECDQEETLVPIGPGIERVTEEARECFPEARILVMSSDTLGSAAATAEAVRQITDGEVDLIIGTQIVAKGWHFPRLTLVGVVDSDLGLGGGDLRAAERTVQLLHQVAGRAGRAERPGRVMLQSYVTEHPVMEALASHDFEAFIEQETAQRKPGFWPPYGRLAALIISADSADVADSLAAEIARVAPVQDGIQVLGPAPAPLSVLRGRHRRRLLLKTVRGLAVQPIVRDWLSHVKAKGSARIDVDVDPISFL >NZ_CP032485|1375106:1396207|1393449_1393680_-|WP_141492767.1|DBSCAN-SWA MSPEPKAQTAIIDLSYVMGSGFGYVSSKDNAAVAKHQLWTSFRTENYKKASDNIKQIFGIKDSRSLDNMRLIGEVE >NZ_CP032485|1375106:1396207|1395450_1396207_+|WP_141492768.1|transposase|DBSCAN-SWA MSDLFWLTDEQMERLRPFFPKSHGKPRVDDRRVLSGIIFVNRNGMRWRDAPREYGPHKTLYNRWKRWGDMGIFMRMMDGLSAAKAEPQTIMIDATYLKVHRTASSLRPKKGDPGRLIGRTKGGMNTKLHAVTDQNGRPLSFFMTARQISDYTGAAALLDSLPAAQWMLGDRGYDADWFRDALEEKGIKPCIPGRKSRGKPVKYDKRKYKRRNRIEIMFGRLKDWRRVATRYDRCPTVFFSAVCLAAIVIFWL >NZ_CP032485|1375106:1396207|1375106_1376069_-|WP_141492755.1|integrase|DBSCAN-SWA MTNYKSAETVLSDWLEWLTHERRSSPRTVEAYQHDMSLAFAFLREHLGGDLTLSHINTLSLADLRAWLAFETKRSEKPTRRASNADGRARSRARRLSALRSFFKYLSLRHGVTNPNASLLATPRRKKRLPRPLGQEQALTATTAIADDAPTHQAALRDQALFTLLYGTGLRIGEALSLNINDIGRTGADTIMVRGKGGKERLVPLLPAVQTALEKWRSAHPSPSPDAPLFCGIRGGRLNPGVAQRSMREWRKAEGLPDSATPHALRHSFATHLLEGGADLRAIQELMGHASLSTTQAYTLADEKHLLDVWRKAHPRANSN >NZ_CP032485|1375106:1396207|1393676_1394434_-|WP_141492768.1|transposase|DBSCAN-SWA MSDLFWLTDEQMERLRPFFPKSHGKPRVDDRRVLSGIIFVNRNGMRWRDAPREYGPHKTLYNRWKRWGDMGIFMRMMDGLSAAKAEPQTIMIDATYLKVHRTASSLRPKKGDPGRLIGRTKGGMNTKLHAVTDQNGRPLSFFMTARQISDYTGAAALLDSLPAAQWMLGDRGYDADWFRDALEEKGIKPCIPGRKSRGKPVKYDKRKYKRRNRIEIMFGRLKDWRRVATRYDRCPTVFFSAVCLAAIVIFWL >NZ_CP032485|1375106:1396207|1388246_1389623_-|WP_141492763.1|transposase|DBSCAN-SWA MMGDRTKIQEALFYEFRLEDHVPTGHLLRSIDRFVDLDGLREHLRPFYSGTGRPSIDPELIIRMLIVGYVMGIRSERRLCEEVHLNLAYRWFCGLGLNGPVPDHSTFSKNRHGRFRESDLLRQMFEMTVRQCIAKGLVGGEGFAVDASTIKADANRQRSVPSSDKLPIEAAQRAVREYFSVLDDAAFGSATPVQPKYISPVDPAARWNAASGGLAYYAYCTNYLIDLKSAVIMDVETTTAIRQAEVTAQRRMIERTQETFGIWPERLAADTAYGSAENLAWLVHERGIEPHIPVFDKSARQDGTFERRDFTYNHVHDLYICPGGQELKQQWRKINSDQPNAPPDNLLRYRSSKRACDVCTFKPKCCPNQPSRKVLRSIHEGARDMARDIALTDAYIISRRERKKVEMLFAHLKRILKIDRLRLRGPNGARDEFHLAAAAQNLRKMAKLIPPGAPALST >NZ_CP032485|1375106:1396207|1378391_1379699_+|WP_141492757.1|DBSCAN-SWA MAEENFSILYDDEYLRVIFLEQQNDSEELIITFGDMLLEAEGYCFFADQPLRKLKLPAIGFVAKEAHWYQRSSMLAAYKVLMPILGRFSRRILYGGSMGGYAAIKFSKLFAATHVVALCPQWSIDPAEWPGERAGWEGNFRDYMQGMSIKAGDVSGDIYVFADKFDDKDYRHFLKIKNSIEHVNFVNVPFVGHHVTGVFAGTSKLQELFYACVHNDMRGLYLFSRNTRKSFGFYAQTVQDLALDRFPNLFAKRVIAAQSPPGDVRIMPRIVSTLKDNNRAGDAKALVDQCFARLHIGDSSGSILFMCCVMLGEQVVLVGHHGKVLAVDAASMQLIQVDRNYRPWQFPICVNPRTTSHIWTVQQEHNIFIEACVDNRVSLGYGTTRDTMEVELVPCGRGRFYVKNNGRYLCAEPDGKVMLNRPDAQAWEEFHFEVL >NZ_CP032485|1375106:1396207|1381522_1382110_-|WP_141492759.1|DBSCAN-SWA MTWTLDAPAEHRSVIRNSVFQTYAAPVADEAAAMTFLQNVAVPDATHNCWAFRIGQRFRSNDDGEPSGTAGRPILAAIDGQDFDNVMVVVIRWFGGVKLGAGGLVRAYGGAAAACLRDAPKSERIAMQALHFHCPFSIFSRLEARFPVWGAQKSDSHFDADGVQLSLMVPEASSTQIINEIRDMTRGQTTIETDD >NZ_CP032485|1375106:1396207|1391534_1392248_-|WP_170211057.1|transposase|DBSCAN-SWA MTPMSKIPVSYKRHRFPRELIAHAVWLYFRFPLSFRLVEEMLLERGIVVSYETIRRWSLKFGAEYVRSLRRKRAKRDDIWHLDEVRVVISGHIHWLWRAVDQDGYILDEILQKRRNTKAAKRLLTRLLRKQGVRPRRMITDKLSSYGAAKCELRLSVQHLSRKGLNNRAENSHLPLRKRERTMQRFRSPGGCQRLSTVFSAVRNFFVPPAANANALSRHIHRLQAFAQWNTATILAA >NZ_CP032485|1375106:1396207|1386933_1388037_+|WP_141492762.1|DBSCAN-SWA MRWNWQRPDWPYFQFEKNCLRDAETKFLRGSGVIVGAMRHLDTGANQQLVVQLMSEEMVESSAIEGEVLDRASVQSSIARHLGFAADRRRSTPAEAGAAELMVDLYQHYAAPLTDQSLFVWHTMLMNGRRDLDVIGGYRVHAEAMQIISGPIHEPHIHFEAPPSQAVPSEMAQLILWFNRTAPSGSEPLPAIERAAIAHLWFETIHPFEDGNGRIGRAIAEKALTQSLEGPTLTALAATINTHKKAYYLELNRASKTNQIEDWMIWFSQIVLEAQSQTLQKIQFLIEKTRFLDRWRGKLNARQEKAILRMLAEGSDGFQGGLSAQNYRSITGATSATATRDLANLVSLGALHRTGDNKYARYSLCLG >NZ_CP032485|1375106:1396207|1379710_1381393_-|WP_141492758.1|DBSCAN-SWA MVPARLNSGFPALFFAFAGLYSFTTTPLSAAPEATPDARTPKNPTWIPNTTTDRPPIQNYPHAGFVGQPRRILPSLNRPTQGHQGPWGAFNYGTGEAAGFGPVGRYGPAPWAEDWSDLRNPKKSDDLLDPLKFIALNQQKTIWLTLSGETRLRNWSEEAPFLGRRGNAPSGRFTVRNLLGADLHLGEHVRLFGQLINANAAGWAGFGYNQTFRKRLDAQQAFIELKGQIAGARTGLMVGRQQFLDAPSYILYNRETPNVPLSWNGTRIYAIWPKFRLDTFSFAQTNTNPNKMFHDTIDWGTRLYGVDGTAVVPDFVIGSQQVHSFFDLFYLRYRYSSSLSVVPSGSTTLKGTSSRGNIGIRWYGTARDFEFSFGGLYQNGSFQYTNNASSSDVSAWAVNTIAGYRHTPSPLHPFIGAQFDIYSGGADGRNATLHTYMAPYNPQTTYSDPTTYLQPSNLISLSPILSVTPFHGYASLRFKAPLLWRQNTNGAIWNSSGPYTFSRPYRGSFVGIEPQAFLLIQLQRHLSWQLIGSRFIASKSLRNAGAQSGSFFQSNLVFRF |
15 | Leptospira_phage(33.33%) | integrase,transposase | attL 1371354:1371367|attR 1389621:1389634 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2364867 : 2375633
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP032485|2364867:2375633|DBSCAN-SWA ATTAATAATGAGCCGGGACATAACCCAAAACCTCCATGAGTTGTGTCCTATCGGAAAGATACGCGGTCATCTCTTCCGACAAAGGCTTATCAAGGGTGAACACCATATCAACCGTCGGTACGCCGGCATACAAACTTGCTCCCCCTGAAATGAGCACCAGTCTGAGGTCAAAGCGGATCGAAATATCCGAAATGAACGCCGTCGATGCTTCATCCCCTACGAGCAGCGCTCTTACCACACTTAAATGCTTCTCAAGGGGCTTGTTCGTTAGATTGCTTTGTAGGGTGTGGGGCAGAGCAGGGCGGGTTTCGTTCAATAGGGCCTGAACAGCGGGCGGGTTGTTGGAGCGCGTTAAGAGATCAAGTATCGCGCCGTTGTGAACGAGTTCTCCCTGATCTAGCACCAATACTCTTTGAGCAAAGCGGCGAATGACGTCCATCTCGTGCGTAATGAGGATAATGGTCAAACCCAGTTTGCGATTTATATCGCTCAGAAGGTCCAGTACCGCCGTCGTGCTCTGCGGGTCCAGTGCTGAAGTAGCTTCGTCACACAAAAGCAGGGACGGCGCGTTCGCTAGCGCCCTCGCTATACCGACCCGCTGCTTTTGCCCTCCCGATAATTGTGCGGGATATTTTTCGCTGTGCTCGCCAAGACCAACCAGTTCCAGCAGTTCTTTAACGCGGGCTTTTATGTCGTTTGACGACCAACCCGCTATTTTTAGGGGTAGGGCTACATTTTGAGCTACGGTCTTTGTTTGCAGCAAATTAAAATGCTGAAAGACCAAACCTATATTCTGTCTAGCCTTCCGCAAATCCCGCTCTGCCAGCGTTGTAATGTCTTGACCTTGGAGTACTACGCTCCCTTGGTCAGGCTTTTCCAAGCCACACAAGGCACGCAGTAATGTCGATTTTCCTGCGCCAGATGGGCCAATAAGACCAACAATCTCGCCTTTTTCGACCTCAAACGAAATATCTTTTAAAACAGGGCTTTTCGCAAAACTATGCCAAAGCCCCGCAACACTAAGAACACTCATGGCCATGCTGGCAGTGTCGAGCCATGGTAGACATCTAGCAGAACTTTTTTAACATTTGGCTGCTGATAGGCATCGACTAGCTTTTTGACCCAAGGAGCCTTCGCGTCTTCTTCGTTAACAGCGATGAAGTTCGTGTACGGATTATTTTGCAGAGCTTCTTGTGCAATGCGCTGATGCACAACATCAATTCCTGACTTATGTGCCCAATCCGTATTAACCACTGCCGCGTCAAGATCGGACAATGCGCGACCCACAACCCCAGCGTCTAACTCTTTCACGTTTAAATTGCGTGGATTATCGGTAATATCTAGTACAGTTGGCAAAAGGCCCGCGCTGTCCGAGACTTTTATCAAACCTTGAGTTTGTAAAAGTAACAATGCACGACCTTCATTGCTTGGGTCGTTTGGCACACCAATACTTGCACCTTTAGGTAACTCGGAGACACTGTGCCACTTGGTGGAATATAGCCCAATCGGTGAAACATACATGTTTCCGACCGGCACAATATGGAAACCATGGGCTGCAATTTGCGCCTTCAAAAAAGGCCCATGCTGGAAGGCATTTGCCTGGACGTCGTGCTCTGCCAGTGCCTCATCAGGCAGATTGTACTCTGAGAAGGTAACAACCTCCACCTTCAAGCCATTTTTAAGAGCATTCTGTGACACAACACGCCAGACGTCCTCATCCTCGCCAGACATGATGCCGACACGTAGAGGAGTAGGGGTGCCTTCTTCAGCGTTGATGTGTCCAAAGGATGCGGCCCCCATGAACGCCGCAAAGGCCAAAAGAGTATGACGACGAGAAAGCATGCAAGCATCCTTTAACACAAAACAAGACCACAAAATAAACGATTTGACGAACAGATAATGTCATTAACGACAAAAGTCTATCCGAAACACCGATTTATCAACGTGACAAGATAAGATCCAACGCACGAAGAGCGCTAGCTTGCCTTATTGTGTGCCTGTCACCTTTGAAGAAACAACTTTCCGACTGGGTCTCAAGCCCTCGTCGAGCAATGGAAAAACAGACGGTTCCTACAGGCTTATAGGCAGACCCTCCACTTGGCCCAGCTATACCCGTAACCGCGACAGACACCGTTGCACTTTGCGTCCGTGTCAAAGCACCTTCCGCCATTTGTCTGGCTGTCTGCTCGCTTACGGCTCCATGGGCTTCCAGCGTTTCAAGGCTTACATCTGTCAGTTGGCGTTTGAGAAGATTGGAGTATGTGACAAGTCCGCCTTCAACCACATCTGACGACCCCGGCACTTCGGTCAAGGATGCAACAATCAAGCCGCCTGTGCAGCTTTCCACGGTGACGAGTTTTTCATCGTAGCTTCTTAAAAATTCAAGAACCTCTTCTGCTTTGGCAATGAGGGGGTCAGTTATCATGAAAAGCGCTTTCAACGGCGTAAAGCGTAATATTCGCTTCGATATTTAAAACATAAGACCGCGTCTCCCTGTAGGGCAGACGCTCTATCCAATCGAGCAATGTTTCATCGTTGATATCTGCGACAGGCGGGTTCGCCTTTAACCATGTATCCACCCTGTGCGGGCCAGCATTATAAGAGGCTAAAGCGTACGGAATTACGTGGTCAAAACGCTCCAGAAGCTGGCGAATATAAGCACTGCCGACTGTCAGGTTTGTCTGCGGATCTTTCAATCCCTCCGCTGAAACATTGAAACCTTTGAGGTGAGCGCGCCGCACAACGTCTCGTGCTGCACCCGGTAAAAGTTGCAGCAAACCGACAGCATGCGCTGCGCTCAATGCGTCAGGATTAAATGCACTTTCTTGCCGGGCAACGCCTAGTATAATGCCTTCAGGCAAATCTGCATCGACTGGCCATGGGTTTGGGTAACCTTGCGGATACAAACTGTACCCCAACCGCGCGAGGCTATGTGACGCAGCCACAGCACCCGCGGGGACGTTCAACTGTAAGGACAAATCCGCAACAGCCTTCTGCCCTGATGGGGAGGGATTTACCGTTTGCAGCAGCAACAAGAAAAGCCGCGAATGATCATAATCTCCGCTGTTGGCCAAGCTTCGTGCAGCATCAACCAGGTCTAGGCGCTGCAAAGGTTGTGTTTGGGGCTGGACCTGCTGGGTGAGCAGCTTATGCATTTCTGCCAGAAACGCATCGCTTTTGGTTTCGCCATTCAGTAGCGCTGTCTTATCCGTTAATTCAGCAAGGGCAAGCTGACCATGAATCATGGTTGGGTAGTATGATGCTTCTTTAAACGCCTCTTGAGCGCGATCGTCATCTTCCAGCAGAGCATAGCCCCGGCCAGCCCAATACCAACCCGCAGCACGGAAGCGAAGCGGCTGAGCCTGCTGCAAGGCTGCAAAATGCGGCAATGCCATTTCCGCATTTTTAAGGGCTCTGAGTGCTATGTATCCAGTTAATTCTTCAGCTTCTAAGCGTGCCGTAGTGTCTAGGGAAAGCGTTTGATCGTCTGAGAATTTCAACGCATCTTCGATGCGGTTGAATAGTAAAAAAGTCCGCGCCAATGTAAGGCGCTCGTTAGTCCATGCAGTCGTAGGACTTTGTTTTTGAAGAGCGAATATCCGCTCATTCCATAGCGCTTGAAGATCATCCAAACGATCAGCCCGTTTTAGATATCGGGAATAATACCGCAAAATGATCGGATCATTTTTCTGCGCCTCAGATAAAGACAAGAATGCACTTTCCGCATCAGGTAATGCGTAGTGCATGGATAAGCGCGCCGAGGCCAAGGCAGCATCTTGTGGCGCCAAACGGGAGATTTGCCTACGTGCCGAGGTAAGCTTGCCTGCGCCATCTAAAGTCTCAAAGCGCTGCCATTGATCCTTGGAGGTAAAGCCCGAGCCAAAACGCTGCTCAAAAGCAGCCTCTTGAGCACCATCAACCGCACCACTTATCCATAAGCTACGCGCTCCAGCGATGGCGTGCGGCAGATAAGCCACGCATCTCAAGAAAGCCTCGGGCAAGCTGAGCGGTAACGCTGGACAAAGCGCTTGCAGTTGGCCCGGGTCATTTTCCTGCGAAAGTGCCTTCTGGTAACGCCACATTATGCGGCTCTGCTCAGGCCAGACCGGCCCACTTGTCAAGAAACTCCCGTACCGTGAGGCGCTAAATCCGGCCCCGTCGGCACTTGTCAGAAGCAGCCATTCTTGCAGGCGCGCAGATAGCGGCGCCGCCTGAGCGGTGTCTAGGGCAGTGGCCCCTATGATGATGGAGGCAATGACAGACTTTGCACGTAACCGCATAGGAACTGCTCTATGATTAATAAACAACATGCTGCCAGACTTCACCGCCACCTTTGACGATAAGCTCCACAGCGACGAATAGAACAATCGCAAGGCCGACCCAGGCAATCCACTTATAACGCTCAAGGAGTCGGGCAATAAACGAGGCAGCAAGCGCCATCATCAAAACAGAAGCTACGAGGCCAAAAACCAGAACCCAAGGGTGGCCAACCGCGGCGCCGGCAACAGCCAGCACATTATCGACGCTCATAGACAGATCAGCTGCAATGATCCTGACAATGGCTGTCTTTAAGCTGGAAGGCGCATCTATTGTTGGGCCGCTCTGCTCATGCCGAAACTCGTGGAACATACGGGCACAAACCCATAGCAGCAGCAATCCGCCAGCCAAGGTCAAGCCGACGATAGATAATAGCTGCACCGCTACCAGAGCCAAAACTATACGCAGAACTGCGGATAAAAGTGTGCCTGCAAAAATCGCTCTTGTCCGGTCCCGACCAGTCAACTGGCGTACTGCGAGGCCGATTACGACGGCATTGTCTCCCGCAAGAGTGACGTCAATCAACGTGACTTGTGCGAGAGCAAAAAGGGTATGAATCATAGAATCGGGTGACAAAGGTTGTGCTTTCCTAAAAGCGATCCGTGCAATGTTCTCGTATTTGGTCTAAGCAACCCCCGTAAAGACCTTAAGGTCGTGACCCACACGCTTTTCGGAGTTCGCATGGTCCCCCGCTATAGCCGCCCCCAGATGACCGCAATCTGGTCCCCCGCCAATCGCTACCGCATCTGGTTCGAAATCGAAGCCCTCGCATGCGAAGCTATGGCTGAGCAAGGCGCCATTCCCGTCGAAGCCGCCCGCATCATCCGCGAAAAAGGCGACATCGCACTTGCTGCCTTCTCCGATGCTGATCTGGCACGCATTGATGAAATCGAGGCTGAAACGCGCCATGACGTCATTGCGTTCCTTACATGGCTAGCAGAAAAAGTCGGCCCTGAAAGCCGCTTTGTTCACTTGGGCATGACGTCTTCGGACGTTCTGGACACATGCCTTTCTGTCCAATTAACACAAGCAGCAGACCTTCTGCTTGAAGATATGGACCGAGCTTTGGAAGCGCTAAAAACACGCGCTTACGAGCACAAAAACACTGTTACCATTGGTCGTAGCCACGCTATCCATGCTGAGCCGACCAGCTTTGGCCTAAAGCTTGCAGGGCACTACGCAGAGTTTGCCCGCGGTCGTGAGCGGCTGGTTCAAGCGCGTAAAGAAATTGCGGTATGTGCAATTTCTGGTGCGGTTGGCACCTACGCACATATTGACCCGGCTATTGAGGAATACGTCGCAGCCAAGCTCGGCCTCGAAGTCGAGAGCGTATCAACGCAGGTCATCCCACGTGACCGCCATGCTGCATTTTTCTGTGCGCTCGGCGTCATTTCCAGCGGTATTGAGCGTTTGGCTGTAGAAGTCCGTCATCTGCAACGCTCAGAAGTGCGCGAAGCTGAAGAGTTCTTCCATAAGGGGCAAAAAGGCAGCTCTGCCATGCCGCACAAGCGTAACCCAGTTCTGTCTGAAAACCTGACAGGTCTGGCACGTTTGATTCGCTCACACGTTGTTCCCGCGCTTGAGAACGTCGCTTTGTGGCACGAGCGGGATATCAGCCACTCCGCTGTTGAGCGCAACATCTGCCCTGACGCGACGATCGGCCTTGATTTCGCACTGATCCGCATGGCCAGCATGATGGAAAAGCTGGTTGTCTATCCAGATCAAATGATCGCAAACATGGAAAGCCTTGGTGGCGTCGTCCATTCTGGCGAGGTTCTGCTGGCATTAGCTCGCGCCGGTATCTCCCGCGAAGACGCTTACAAGATTGTTCAGCGCAATGCGATGGCAACTTGGACACGCTTGGGCCAACCCGATGGCCGTAGCTTCCGCGAAAATCTGGACGCTGATCCAGACGTCAAAGACCGCATTGATGCATCCGTGCTCGACAAAGCCTTTGACCCTGCACAGCACCTTGCCAAAGTTGACCGTATTTTCACGCGCGTTTTCGGCTAAAAAACCACAGGGCGGATGCTTCTATCATCCGCCCCTCGCAGTTCACGCGCTGTAATGCTATGGGAGCGGAATACTCCGCTTGACACATCAGAGGCTGCTTCGTTCGTCAAGACTGTTTGCAGCCCTTGATACATTGCAACAAGGATAGCCAATGGCCCGCCGCCGCCAGATCTACGAAGGCAAAGCCAAGGTCCTGTTCGAAGGCCCCGAACCGGGTACACTCGTCCAGTATTTTAAAGATGACGTCACCGCCGGAAACGGTGAGAAGAACGGCATCATTACGGGCAAAGGTGTGCTTAACAACCGCATCAGCGAATATCTGATGCAACACCTGCACGACATCAACATCCCGACTCACTTTTTGCGCCGCCTCAATATGCGTGAGCAGTTGATCCGCGAAGTGGAAATTATTCCACTCGAAGTCGTCGTGCGCAACGTCGCAGCAGGCTCTATCGCCAAGCGCCTGGGCCTTGAAGAAGGTACGCGCCTGCCGCGGACCATCATTGAGTACTACTACAAAAACGACTCACTCAATGATCCGATGGTCTCAGAAGAACACATTGCCGCGTTCAATTGGGCAGCTCCACAAGACCTTGACGAAATGAATGCGCTTGCACTGCGCACAAACGATTTTCTGATGGGGATGTTTGCCGCCGCCGGGATCACATTGGTTGATTTTAAGCTGGAATTTGGGCGCGTTTGGGAAGGCGAGGAAATGCGCATCATCCTCGCTGATGAAATTTCCCCCGACAATTGCCGTCTTTGGGACACAAAGACCAGTGAGCGCATGGACAAAGACCGCTTCCGCCGTGACCTCGGCAAAGTAGAAGAAGCGTATCAAGAAGTAGCACGTCGCCTCGGCATTCTGCCTGAGGCTGGTAACGGTGACCTCAAGGGTCCGGAGGCAGTTCAATGAAAATCCGTGTTTTCGTATCCCTGAAGGAAGGCGTTTTGGACCCACAGGGCAAAGCCATTGGCCACGCCCTAGACACGTTGGGCTTCAAAGGCCTCGGCGAAGTCCGCGTAAGCCGCGTTGTTGATCTTGACGTTCCTGCGACCGACAAGGAAACAGCTCTGGAGCAGGGCGCTGCGATGGCTAAGGCACTGCTGGCAAATGAAGTTATCGAAGACTTCTTTGTAGAGGTGGCTGCATGAAGAAGCTGCTGGCACTCACTGTAGCAGCATCGCTCGCAGCATCTGGCTCGGTTGCACACGCACAGCGCGTATCAAAAGTAAGCGGGCGCATGATCGGCTCTATGTGCAGTAATGCACGTTCAGCTGGCTTGTGCGACGCATACATCGCAGGCGTAACAGACAGCGAAGTCTGGTCGAAAAAATTCGACGAGATCTCCAACGATGCGAACGCCCCGGTTGCATTCTGCGTCCCCGCAAGCGAGACAACTGCACGCCTGCGTGAGAGCGTTGTGTCCTGGCTGCACCGCCATGATGATGCCCTAACACAGCCCGCTGCGAAGGGTATCTATCGAGCCCTGCACGAAGCTTATCCTTGTCACAGCGCGACGGAGGACAAAAAATGAAAGCGGGTATCGTCGTATTCCCCGGTACAAACCGGGAGCGCGACATGGCACAGGCTCTCAAGCTGATTTCTGGCCATGCTCCGCGTATGATCTGGCACCATGAAACGTCCCTTGGTGACCTTGACCTCGTAGTCCTACCCGGTGGGTTCAGCTTCGGTGACTACCTCCGCAGTGGTGCTATGGCAGCACACTCACCAATCATGAGCGCTGTGAAAGCTTTTGCTGAGCGCGGCGGCCACGTTTTGGGTGTCTGTAACGGCTTCCAGATTTTGACAGAATCGCATCTTTTGCCGGGCGCCTTACTTCGCAATGCTGGCCTGCGCTTTCTGTCGCAAGACTGCCACTTGAAAGTTGAAAACGCCTCCTCACCATTTACCCGCGGCTGGAAAACGGGAGATGTATTCCGCTCTCCCATGGCGCATGGTGACGGCAACTACACTGCGTCCCCTGAAACGCTAGATCGGCTCGAAGGTGAAGGCCGCGTCGCTTTCCGCTATTCCACAGCAAATGGAACAGTGGCTGCAGATGACGTTACAGCCAACCCTAACGGTAGCGATCGCGCAATTGCCGGTATCCTTAGCGAAAATGGTCGCGTTTGCGGCCTAATGCCGCACCCTGAAAACCTGACTGACCCTGCAATCGGTGGTACGGATGGCGTGCCCCTGTTCCGTGGTCTTGTGGAGGCACTTGTCGGATGAGCGTGAAGGTTGATGCCACTCTCGCCCAGTCCTTCGGGCTGACCAGCGTAGAATACGATAAAGTTCTCACCATTATGGGCCGTACGCCGAGCTTTACAGAGCTCGGTGTGTTCTCGGTCATGTGGTCCGAGCACTGCTCTTACAAATCATCGCGTCTTCACCTGAAGACACTGCCGACTAAAGCACCTTGGGTCATTCATGGCCCCGGTGAGAATGCTGGTGTGGTAGATATTGGTGAAGGCCTGGCGGCCGTCTTCAAGATGGAAAGCCATAACCATCCATCCTTTATCGAGCCTTACCAAGGTGCGGCTACGGGCGTTGGCGGCATTTTGCGTGACGTCTTCACCATGGGCGCGCGTCCTGTTGCCAACCTGAACGCGCTGCGTTTTGGTGACCCAAAGAACGCTGGCACGCGCCGCATTGTTGATGGCGTCGTACGCGGCATCGGTGGCTATGGTAACTGCGTTGGCGTTCCAACCGTTGGCGGCGAAGTCAACTTCCATAAAGCATATGACGGCAACCCACTCGTCAATGCGATGACAGTCGGTATCGCTAAAAAAGACAAGATATTCCTGTCTGCGGCAGCTGGCGTCGGTAACCCGGTTGTTTATGTCGGCTCAAAAACCGGACGCGATGGCATCCATGGTGCTACCATGTCTTCCGCTGAGTTTGACGATGAAGCTGCATCAAAGCGTCCAACCGTCCAAGTCGGTGACCCGTTCATCGAAAAGCTTCTCATCGAAGCCTGCCTTGAACTTATGGCGACAGACGCAATTGTTGCTATTCAGGACATGGGCGCTGCGGGTCTCACGTCTTCCTCAGTCGAAATGGCCGGGAAGGGCGGAGTTGGCATTGAGTTGAACCTTGATGCCGTACCTCAGCGCGAGCCTAACATGACAGCGTATGAAATGATGCTGTCTGAGAGCCAAGAGCGTATGCTCATGGTGCTCAAGCCAGAGCGCACCGAAGAGGCACGTGCTATCTTCGAAAAGTGGGAGCTGGACTTTGCTGTTATTGGTCACCTGACCGATACGGGCAACATTACAGTTAAGCACAACGGCGCTGTAGAGGCCGATATTCCGCTAGCGCCGCTGGCTGAAGAAGCACCAATCTACGACCGCCCGCGCGCCCCGCTGGAAGCACCTCGCCACATTCAGCCGCCAGCTGACCCGGTTGGTATTGAACACGCGCTGCTTACCCTCATTGCATCGCCAGATCTTGCTTCACGTGCATGGATCTGGAACCAGTATGACAGCCTCGTTGGCGGTCAAACGGTTCAGCGCCCCGGTGGCGCAGATGCTGCTATCGTACGCGTAGAAGACACCAAGTTAGGCTTGGCACTTACGACCGATTGTACGCCGCGTTATTGCCAAGCTGACCCACATGCAGGTGGTGCACAGGCCGTTGTCGAGGCATGGCGTAATATTACGGCAACGGGTGCAACACCTCTCGCCGTGACGGACAACCTAAATTTCGGCAACCCAGAACGCCCCGAAATCATGGCTCAGTTCGCTGAAGCGATTAAGGGCATGGGTGAAGCATGCCGCGCGCTCGACTTCCCCGTCGTGAGCGGCAACGTGTCTCTGTATAACGAAACACGTTCCCCAACCGGCTTACCACAAGCGATTTTACCGACTCCTGCTATTGGTGGCTTGGGCGTGTTCCAAGACGTTTCCAAGTCTGTCGGTCTAGCCATGCCTGAAAAGCAAGAGCTGGTGCTGGTGGGTGAAATTCGTGGTGAGCTTGGTCAATCACTGTGGCTGCGTGAAATTTGCCACCGTGAGGAAGGTGCTCCGCCGCGTATCGATTTGGTCGCCGAGCGCCGCAACGGCGACTTTGTTCGCAGCCAGATTCAAAGCGGCACAGTTTCTGCTTGCCATGACATCGCTGATGGCGGCTTGCTCATCACGCTAGCAGAAATGGTCATGGCAAGCGGTGTAGGCTGCTCCTTGTTAGAGCATCCAGCTTCCCTGCCACATCACGCATTCTGGTTCGGTGAAGATCAAGCGTGCTATGTCGTGGCAACGACAGATGCTTCATCCTTCATCGCGGCGGCTGAAAAAGCAGGTGTACACGCCCGCAATCTCGGCCGTACGGGTGGAACCGCATTGAAAATGGCAGACGGCATCAGCGTAGAAGCGTCGCGTCTACGTGACATTAACGCAGCATTCTTGCCGGCACTTATGGGGCAGGATTCGTAA
Protein sequences of DBSCAN-SWA_2 >NZ_CP032485|2364867:2375633|2372730_2373432_+|WP_141493637.1|DBSCAN-SWA MKAGIVVFPGTNRERDMAQALKLISGHAPRMIWHHETSLGDLDLVVLPGGFSFGDYLRSGAMAAHSPIMSAVKAFAERGGHVLGVCNGFQILTESHLLPGALLRNAGLRFLSQDCHLKVENASSPFTRGWKTGDVFRSPMAHGDGNYTASPETLDRLEGEGRVAFRYSTANGTVAADDVTANPNGSDRAIAGILSENGRVCGLMPHPENLTDPAIGGTDGVPLFRGLVEALVG >NZ_CP032485|2364867:2375633|2373428_2375633_+|WP_141493638.1|DBSCAN-SWA MSVKVDATLAQSFGLTSVEYDKVLTIMGRTPSFTELGVFSVMWSEHCSYKSSRLHLKTLPTKAPWVIHGPGENAGVVDIGEGLAAVFKMESHNHPSFIEPYQGAATGVGGILRDVFTMGARPVANLNALRFGDPKNAGTRRIVDGVVRGIGGYGNCVGVPTVGGEVNFHKAYDGNPLVNAMTVGIAKKDKIFLSAAAGVGNPVVYVGSKTGRDGIHGATMSSAEFDDEAASKRPTVQVGDPFIEKLLIEACLELMATDAIVAIQDMGAAGLTSSSVEMAGKGGVGIELNLDAVPQREPNMTAYEMMLSESQERMLMVLKPERTEEARAIFEKWELDFAVIGHLTDTGNITVKHNGAVEADIPLAPLAEEAPIYDRPRAPLEAPRHIQPPADPVGIEHALLTLIASPDLASRAWIWNQYDSLVGGQTVQRPGGADAAIVRVEDTKLGLALTTDCTPRYCQADPHAGGAQAVVEAWRNITATGATPLAVTDNLNFGNPERPEIMAQFAEAIKGMGEACRALDFPVVSGNVSLYNETRSPTGLPQAILPTPAIGGLGVFQDVSKSVGLAMPEKQELVLVGEIRGELGQSLWLREICHREEGAPPRIDLVAERRNGDFVRSQIQSGTVSACHDIADGGLLITLAEMVMASGVGCSLLEHPASLPHHAFWFGEDQACYVVATTDASSFIAAAEKAGVHARNLGRTGGTALKMADGISVEASRLRDINAAFLPALMGQDS >NZ_CP032485|2364867:2375633|2371347_2372112_+|WP_141493634.1|DBSCAN-SWA MARRRQIYEGKAKVLFEGPEPGTLVQYFKDDVTAGNGEKNGIITGKGVLNNRISEYLMQHLHDINIPTHFLRRLNMREQLIREVEIIPLEVVVRNVAAGSIAKRLGLEEGTRLPRTIIEYYYKNDSLNDPMVSEEHIAAFNWAAPQDLDEMNALALRTNDFLMGMFAAAGITLVDFKLEFGRVWEGEEMRIILADEISPDNCRLWDTKTSERMDKDRFRRDLGKVEEAYQEVARRLGILPEAGNGDLKGPEAVQ >NZ_CP032485|2364867:2375633|2365895_2366666_-|WP_141493955.1|DBSCAN-SWA MGAASFGHINAEEGTPTPLRVGIMSGEDEDVWRVVSQNALKNGLKVEVVTFSEYNLPDEALAEHDVQANAFQHGPFLKAQIAAHGFHIVPVGNMYVSPIGLYSTKWHSVSELPKGASIGVPNDPSNEGRALLLLQTQGLIKVSDSAGLLPTVLDITDNPRNLNVKELDAGVVGRALSDLDAAVVNTDWAHKSGIDVVHQRIAQEALQNNPYTNFIAVNEEDAKAPWVKKLVDAYQQPNVKKVLLDVYHGSTLPAWP >NZ_CP032485|2364867:2375633|2367280_2369146_-|WP_141493632.1|DBSCAN-SWA MRLRAKSVIASIIIGATALDTAQAAPLSARLQEWLLLTSADGAGFSASRYGSFLTSGPVWPEQSRIMWRYQKALSQENDPGQLQALCPALPLSLPEAFLRCVAYLPHAIAGARSLWISGAVDGAQEAAFEQRFGSGFTSKDQWQRFETLDGAGKLTSARRQISRLAPQDAALASARLSMHYALPDAESAFLSLSEAQKNDPIILRYYSRYLKRADRLDDLQALWNERIFALQKQSPTTAWTNERLTLARTFLLFNRIEDALKFSDDQTLSLDTTARLEAEELTGYIALRALKNAEMALPHFAALQQAQPLRFRAAGWYWAGRGYALLEDDDRAQEAFKEASYYPTMIHGQLALAELTDKTALLNGETKSDAFLAEMHKLLTQQVQPQTQPLQRLDLVDAARSLANSGDYDHSRLFLLLLQTVNPSPSGQKAVADLSLQLNVPAGAVAASHSLARLGYSLYPQGYPNPWPVDADLPEGIILGVARQESAFNPDALSAAHAVGLLQLLPGAARDVVRRAHLKGFNVSAEGLKDPQTNLTVGSAYIRQLLERFDHVIPYALASYNAGPHRVDTWLKANPPVADINDETLLDWIERLPYRETRSYVLNIEANITLYAVESAFHDN >NZ_CP032485|2364867:2375633|2372347_2372734_+|WP_141493636.1|DBSCAN-SWA MKKLLALTVAASLAASGSVAHAQRVSKVSGRMIGSMCSNARSAGLCDAYIAGVTDSEVWSKKFDEISNDANAPVAFCVPASETTARLRESVVSWLHRHDDALTQPAAKGIYRALHEAYPCHSATEDKK >NZ_CP032485|2364867:2375633|2364867_2365899_-|WP_141493630.1|DBSCAN-SWA MSVLSVAGLWHSFAKSPVLKDISFEVEKGEIVGLIGPSGAGKSTLLRALCGLEKPDQGSVVLQGQDITTLAERDLRKARQNIGLVFQHFNLLQTKTVAQNVALPLKIAGWSSNDIKARVKELLELVGLGEHSEKYPAQLSGGQKQRVGIARALANAPSLLLCDEATSALDPQSTTAVLDLLSDINRKLGLTIILITHEMDVIRRFAQRVLVLDQGELVHNGAILDLLTRSNNPPAVQALLNETRPALPHTLQSNLTNKPLEKHLSVVRALLVGDEASTAFISDISIRFDLRLVLISGGASLYAGVPTVDMVFTLDKPLSEEMTAYLSDRTQLMEVLGYVPAHY >NZ_CP032485|2364867:2375633|2369162_2369744_-|WP_141493956.1|DBSCAN-SWA MIHTLFALAQVTLIDVTLAGDNAVVIGLAVRQLTGRDRTRAIFAGTLLSAVLRIVLALVAVQLLSIVGLTLAGGLLLLWVCARMFHEFRHEQSGPTIDAPSSLKTAIVRIIAADLSMSVDNVLAVAGAAVGHPWVLVFGLVASVLMMALAASFIARLLERYKWIAWVGLAIVLFVAVELIVKGGGEVWQHVVY >NZ_CP032485|2364867:2375633|2366805_2367291_-|WP_141493631.1|DBSCAN-SWA MITDPLIAKAEEVLEFLRSYDEKLVTVESCTGGLIVASLTEVPGSSDVVEGGLVTYSNLLKRQLTDVSLETLEAHGAVSEQTARQMAEGALTRTQSATVSVAVTGIAGPSGGSAYKPVGTVCFSIARRGLETQSESCFFKGDRHTIRQASALRALDLILSR >NZ_CP032485|2364867:2375633|2369864_2371196_+|WP_141493633.1|DBSCAN-SWA MVPRYSRPQMTAIWSPANRYRIWFEIEALACEAMAEQGAIPVEAARIIREKGDIALAAFSDADLARIDEIEAETRHDVIAFLTWLAEKVGPESRFVHLGMTSSDVLDTCLSVQLTQAADLLLEDMDRALEALKTRAYEHKNTVTIGRSHAIHAEPTSFGLKLAGHYAEFARGRERLVQARKEIAVCAISGAVGTYAHIDPAIEEYVAAKLGLEVESVSTQVIPRDRHAAFFCALGVISSGIERLAVEVRHLQRSEVREAEEFFHKGQKGSSAMPHKRNPVLSENLTGLARLIRSHVVPALENVALWHERDISHSAVERNICPDATIGLDFALIRMASMMEKLVVYPDQMIANMESLGGVVHSGEVLLALARAGISREDAYKIVQRNAMATWTRLGQPDGRSFRENLDADPDVKDRIDASVLDKAFDPAQHLAKVDRIFTRVFG >NZ_CP032485|2364867:2375633|2372108_2372351_+|WP_141493635.1|DBSCAN-SWA MKIRVFVSLKEGVLDPQGKAIGHALDTLGFKGLGEVRVSRVVDLDVPATDKETALEQGAAMAKALLANEVIEDFFVEVAA |
11 | Pseudomonas_phage(42.86%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2394719 : 2405786
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP032485|2394719:2405786|DBSCAN-SWA CATGTCGCAGAAAACAGTGCACCAAGCGGAAGCTCAGATTGAAAGCTGGATCGCATTGCAGCGCGATCTCGGTAAAGCACCCAATACTGTTATTGCTTATGGAAGAGGGGTGCGCGATTTTGCCGCATTCTGCTGCATGGAAAGTTTCGACCTCCTTGAAGCGCGGAAAGGGACCATCGCTGCGTATTTGCATCACCTGCGTTCCCGCCCCCCGAGGGGCGGGAACGCAGGTGATGCAGGTCATGCATGCGCCACGTCTCCTATAATGGCAAACGCCTCATTGCGACACAGACTAACGATTATCCGACTGTTCTATGACCACCTTGTAGAAGAGGGGCTGCGCGGCACAAACCCCGTACCTCGCGGCCTCCGAGGAGCTATTGATGGATCCCGCCATCAGGGCGCACGTGGCCTAGTACCAGTCCACCGATCCCTTCCATGGATTCCTTCTGAAGAGCAGTGGCTAACAATCTTAGAGGCAGCACGAAGCGAGCCGATCCGCAACCGTCTCATGTTAGCATTCGCCTATGACTGCGCACTAAGGCGTGAGGAGCTATGTGCGTTAGAAACTTCAGACATAGATCCTGCGCAGAAATTGATAACGATCCGCGCTGAAACAACAAAAACAAAAGCTGGCCGCGTCGTCCCTTACTCTGCTGTTACAGGGGAGTTACTTACCGTATACCTTCGCGAACGTCGTGCCCTGAGTCGATCAAGAGGGCCACTCTTCCTATCGACCTCTCCACGAAATCTAACGATGCCAATCACCAAATGGACTTGGTCGAAAGTCGTGCGTGCCTTGGGAGTGAGGGCCGGCGAGCCGAAGCTCAGCACCCACACGATGCGGCATCTGTGTCTGACGGATCTGGCTCGGGTAGGATGGGACATACATGAGATCGCTCGATTTGCTGGCCATAGAAGTGTACAGACCACACTGCTTTATATTCACCTCAGCGCGCGCGACCTTTCGGCTAAGTTTGCTGCAACCGCTGCGGAACTACACGAAGCGCGACTTGCCTCAATCATGGAGGTACCACAATGAGTATGTGCACTGATAAGTCCGCACCGTCTAGCAACGACTTCTGGGAAGCTACGCTCCCAACTGAACGATCGCCTGATCTGTCAATGCCTGAACACAATGCTTTGACCATTCTGGAGGAGCTGTTTGCTAAGCACCACACTCAACGACCGAAGAAGATCGCCTCCGATCTACAGCGACTTTTATTCCCTCTTGAAGCCGCCCTTGACCGCATAAAGACCGCAGCCGCTCCTAGAAACTCTGCATTGCGGGTAATGATGCAAGAGCTCATCATACGGCGGCGGGCATACTGGGCATGGTCTCAAAATGATTGGAAGGACGTGTTGGGCGCAACCGAAGGCGCGTTTCATAAGCGCAATGGCTGCCACGGTAATTGCCGACAGTATGTTATGATGATCGCCTATCGCCTAGGAGGCTTCGACCGCCTAGAAGACGTTGGAACCTTTTTCCAGTATCGATTAGCTTTGAAGGTGTTCGGTCGCAAAGATGTCGATGCGGTAGACGAGCGAGTACGGAATGAGATGCGTTCAATGGGCTTTCAGTCCAAAACGCCTCGTGGTCTGAGGTCTGCACTTTACTCCGCGTTGCTTTATCAAAGGTCGAGCCGTCTAGAGGATGTAACGCTCAAAACTTTGCGTCGCGTAGCGTCAAGCGATGTAAAAAGTATTCGCACGGGTGCGGCCTCTCTTTCGCGAGTATTGGAACGGCTAGGAACGATCGAACGAGGTTTTTGTCTTCGACAGGAAGAACGCCGACGCCCACCAGACGAACGTAAAGCCATCGAGGGCGTCCCTGACGAATGGCTGAGCTGGTGTGATCGATGGCGTGTAACTACTACCGCTGCTCCATCTTCGGTTACGAGTATTTATTATGGCATTCTCAAATGTGGTCGATGGCTTGGAGATCAGCATCCGGATATCCAATCACCAGCGCAATGGGATCGAGCACTTGCGCTCGAATATGCTGCCGTCGTCATGAAAATGCGGATAGGTGATTGGGCGACGCCGGCCGGTGGCTCTACGTGCAGAATCGGTGAGCCTATGAAACCATCAGCAATTGCCGCCAACCTCCGTTGCATAAGAGCCTTCTTTAACGACTTGCAGGGATGGGAATGGATTGCGCGCCGCTTCACTCCAGCACAAACGCTTGTGCCGTCTCGCAGCATATTGGCAAAAATTGGTCCGGCGCCTCGGGTTATTGCCGATGATGTCTGGGCAAAACTGATATGGGCTGCTTTGAACTTAACGAAAACGGATCTTCAAGATCGGTCAAAGCGTGTGGGATGGGCAGACCAGCCACCTCGTTATCCTCTCGCCATGGTACGTGCAGTGGCAACTTTATGGCTGTTCGGCGGTCTAAGGCGCGACGAGATCTACCGCATGCGAGTGGGTGGAATACGATGGTCACCCTCTGCGGACGGCGAGAAGACTTCTCAGACCTGTCTCTTAGATGTGCCAGTCAACAAAACTAGCCGTGCCTTCACCAAACCCGTGGACCCTATCGTCGGAACGGCAATCGAGAAATGGGAGGCCGAACGACTACCGCACCCATACTCTGTCGATGATAAAACCGGCGAGCGGGTCAACTGGCTTTTCGTTTATAAAGGACGCCGTATGGGTGCCGCCTACATCAATCGTGCCCTCATTCCACTCTTGTGCGCCAAGGGTGGTGTGCCTGAAAGAGACGCTCGCGGACGGATCACCAGCCACCGGGCTCGGGCAACAATCGCAACGCAACTCTTTAATGCGAAAGAGCCGCTGTCGCTATTCGAGCTCCAAGCTTGGCTCGGCCACACCTCGCCGCAAGCAACGCAGTATTACGCGGCCGTTACGCCTACGAAGCTAGCCCGGTCGTTCGAACAAGCTGGTTACTTCGAGCGCAACATACGCACGATCGAAGTGCTTATCGATCCGGCGGCGCGACAGACGAGCCCCGAAGGCTCTGCAGAACCTTGGCAGTATTATGACCTAGGGCACGGCTATTGCACTTATGATTTCTTTGATCAGTGCGCGCACCGTATGGCGTGTGCCAAATGCTCATTTTACGAACCAAAAGAGTCGGCGCGCATGCAGGCTCTGGAAGCGCAAGGTAACCTCAAAAAAATGTCTCAGAATATCCCCATGACCCCAACGGAACTAGAAGCTGTGAATGAGGGCGAGCGGTTGATGGCAGCCTTGGTCGCTGGCCTGGAACATGTGCAAACCCCAGACGGAAGGACGCGTCAGCAGATCGAAGCAAAGAACTCAGGAGTGCCCTGAGACGAAGACTCAATATGGACCCCTATTACAAAGTTAGGTTCAGGGCGGAAATATCCCAGTTGATTGATGTTTCAGACGTTATAAATTCCGAATTGTCTTTCGATTAAACGCACGTCCCCCATGATGCCGCCTTGGAACAAGAAGAATTTGGCCTGTCTTTTTTTAGGGCTCGCATGACCTCGAAACCTTTCACGGTCGCATAAGCAGTTTTCAGGCTTTTGAAGCCCCGAACCGGTTTGATCAGTTGTTAAGTCTGCCGCGATTAGCTTCGATCACGCTGTTTAGATATTTTGAACTGCCCCGGGCCTTCTGGAGACGTTTTTATCGGAATTAAGCGGCTAAAGCATGCGTTAAATTCTATTGGTAGAGGCGTGCTTCAGCCTCTGCGGGTGTGATATTACCGATGGACGAGAGAATACGGCGATTATTGAACCAATCGACCCATTTGAGGGTTGCAAGTTCAACGTCGCGTTTTCCTTTCCATGGCCCTTGCCGATAGATCAGCTCGGTTTTGTAGAGACCGTTAATCGTCTCTGCGAGCGCATTGTCATAGGAATCCCCGACACTGCCTACGGATGCGACAAGCCCTGCTTCGGCAAGGCGTTGTGTGTAGCGCAGACTGGCATATTGACTGCCGCGATCGGAGTGATAGGTGATTGAGAACCTACGGGATTTCGCGTGGCTAATACTTGTTCCAAAGCATCAAGAACAAACGCGGTTTCAGCACTGGACGAGACGCGCCAGCCAACGATCACACGGGCAAACACATCAATGATAACAGCAACATAAACAAAGTCTTGCCGTGTAGAGACGTAGGCAAAGTTAGACACCCATAATTGGTTTGGGGCAGGAGCCCAAAACTGGCGGTGTACCAGATCCTCTGGGCAGAGGTCATGCGGCGTCAGGCCGCGTGGTTTTGAACTATTTTCCGCGCACAGCCCCCTTGAGGTCCATGCGCTTCATTAAACAACGTGCCACCGTGATTTTCTCACGTTTGAGTTGATGCCAGACTTTACGAGCCCCATACACACAGAAATTCTCGGTCCAAATTTGTCGAATATGCGTGCACAGTGTTTCATCCCTTTGAGCGGTGTGTTGTTATTGCCGTTACGAGATCAGCGATCGCGACCTTGAAAGCATGTTAGCTGAGCGCGGCGTGAGTGTTGATCATTCAAGGATTTACCGCCGGGTCCAGCGCTATGCGCCGGAGATGGAAAAGCGCCTTCAGTGGTATTGGAAACGTTCAGGATTTTCCAGCAACTGGCGGATGGATGAGACTTACATCAAGGTCAAAGGACAATGGGCGTATCTATCCCGTGCTGTCGGTAAAAACGATGATACGATTGATTTCTTATTCTCACCAACCCAAAACGCCAAGGCGGCCAAACGCTTTTCGGGCAAAGCTTTGAACGGTCTGAGGGGGGATAAAAGCCCGAAAACCGCTTATGCAACCATTAAAGACTTCGAGGTCATGAAAACTCTGAAAAAGGGACAAGCTAAACTCTTCCAGTTCCAAGAGGCTATCATGGGAGAAGTGCGTCTCATCGAGAGCTAATTCAGCTTCTAAAACGTTTTAATATCAAAACAAATGAGATTCACTCAACCCAAACCCAACTTTGCAACAAAGTCCTCTGACATGCTGGACCGCAGGCCGCAGTCGCTTTTCTTCAGACGGGCATAAGAACTCTTTCGTCTCCCTAAAGAGAACGACATCGCTCAATTACTTAGTATACAGCCTAGATTAGTGCGTCGATCTCTGAGAGCCGGAAAACTCGAATCACGGTGGGAGACGATGACACAAAGCAACTTCACGAAAAGATGACCGTTCTCTTGCTGTAAGTCCCGCGTGTAGTAAGGCGGCGCTGTAAGAGGAGAAACCTTTGGGAACGATCGACGTCTCAAACGCTACGCAGAATAATCTGCGCGGGGTCTCGATTTCCATTCCTGTAGGAAAAATCACGGCATTCACGGGCGTCTCGGGTTCGGGAAAATCCTCACTCGTTTTTGGCGTGCTGGCTGCCGAATCCCAGCGGCAGCTGAATTTCACTTATCCCACTTATGTCCGAAATCGGCTGCCCCATGGAGGCACACCTCAGGTCGATCATATCTCTGGCCTGTCGACCGCCATCATTATCGACCAAAAACCTCTTGGCACGAACCGTCGCTCGACCGTCGGTACCGCGACGGACGCGGCACCGCTGCTGCGCCTAATCGTTTCGCGCATGGGGTCACCGTTCGTCGGCTATTCCCACGTCTTTTCCTTCAACGACAAGGCCGGAATGTGCCCACGATGTGAGGGGTTAGGTGAGACAATTGATCTGGACGAAAGCGAACTAATCGATTTTAACAAGTCGATCAATGAGGGTGCGTTTACTTTCAGCGGCTATGCAGTCGGCACTTGGTACTGGAAATGGTACAAACGCACAGGCTTGTTTGATGCCGATCGTAAGTTATGCGACTATGCCGCGGAAGATCTCGACCTGCTGCTGTATGCCGAGCCAAAGCCACTGAAAAACCCGCCTCCTGACTGGTATGCGACCGCCAAATATGAAGGCGTGGTGCATCGTTTCCGTCGCATGTATCTTGGTAGTAGGCAAACCACGCATAAAGGTCAGATCGCTCGCGATCTGGAACGCATTATTCATCGCACGACATGTCCGGTGTGCAAGGGCGCTAGGCTAAACGCGGCGGCGCTGAGTTGCAAGATCGATGGGCGGAATATCGACGATCTCAGTCGCTTGCCGATTTCGGAGCTGTTGGAGATCGTGGAGGCTTGGCGGGTGCCCGAACTATCGCCCGCCATCGAAAACCTGATCAGTCAGCTGCGCACTCTGGTGGAACTAGGCTTAGGATACCTACAGCTTGGCCGTACGACGCCGACCATCTCGGGCGGAGAAGGGCAGCGGATCAAGATGGTCCGCCATCTCGGTAGCTCGCTGACACGATTTACCTATATTCTCGACGAACCGTCAACCGGTCTGCATCCGCGCGACGTGCGGCACCTTGCTATGATTATTCGCCGCCTACGCGACAAGGGCAATACGGTTTTACTGGTCGAACATGATCCCGACCTGATCGATATCGCCGACTTTATCGTGGACATGGGGCCCGGTCCAGGCGACGAGGGTGGTAACGTGCTGTTCGCCGGGAGACTTGAGGACCTAAAAGCCTCCACGACGCCAACCGGTCTGTATCTGCGCGCATCCCGGAAATTCTCTGCCCCACGCAGACCGCATGGCGACCTGATCCGGGTGCGCAACGCGCGCCGCCATAATTTATGTGGCGTCGATGTGGATATTCCTCGTGGCCTGCTGACAGCCCTGACCGGTGTCGCGGGTTCGGGAAAGAGCTCGCTGGTACATGAGATTTTGGCAGCAGCGCCATCCGCTCAGCTGATCGATCAAAGCCCTCTCCGCGGGAGCATTGCGTCAAGTATCGCGACATACACGGCTGCCATGGATCGTATCCGTGATCTCATGGCGCGGGCCAATCGGGTATCGCGGGGTTGGTTCACCAGTGCTGGTAAAGGGGCGTGCCCGGTGTGCAAGGGGCGCGGCGTGATCGTTACCGAACTTGCATTTCTCGATAGTACCGAGACCGATTGCGAGGCCTGTAGCGGCAGTGGTTTCAACCAGACGGCACTTGGCTTTCACTTTGGGGGCTATAACATAGCCGATATTCTAAGCATGTCAGCCAAGCGAGCGGCTGACTTTCTTGCTCCGCACGCTCCTGATGCTGCAATAGTCCTCGGCCGATTGACGCGAGTAGGCCTCGGACATCTTGCAATCGGACGCTCGACGGCGAGCCTTTCGGGAGGCGAGAGACAGCGGCTGAAACTTTCTTCCTTACTTGATGACGACATCGACACACTTATTCTAGACGAACCAACGACCGGGCTGCATGGTTCGGACGTCACACGTTTACTGGCTCTGTTCCAGGGCCTCGTGGGGCAGGGAAAAACGGTGGTGATGGTTGAACATAATTTGGATGCTATGCTTGCCGCAGACTGGATCGTGGATTTAGGGCCAGAGGCCGGCAGCGGTGGTGGTCAGGTTATGTATACAGGCCCTGCGTCAAAAATATTGAAAGCAAAGCGCACCGCCACCGGAGATGAGCTGCGCCGTTATATCGACATTGATGCGCAGCGCAGGAAAACCGCTTAAAAAAAAGCAGCCAGATTTATTTGAAATCGCGGGGCAACGGCACGCGCACGTCTGTGTCATCGAAGAGCAGTACAGATACTGCCCCCCCAAACCTTAGCGTGCGTTTTCGAACGAAATCTCACCGTTCCCCCAAATGGCCTGCCGCTCGATTTCGCGCAACACAGAACGCAGATTATACGGCTTGCTGACAAAAAAGGCCCGTTCCGGGAGCGATGAATCGTCAGGCTTTAGCATCCCAGAGGCAATCACTAGGACTGTCAGAGGACAGCAATTGCGGACGTGATGGGCAAGCCTAATTCCATCCATCGGGCCAGGCAAATGTACGTCAGTCAAGACCGCTTGAATGATCCCATTGGTCGCTAGTATCTGGATGGCCTCGTCGGCGTTTTCAGCTTCGAAAACGTGAAAACCGCGATCCCTAAAAAAGTCTACAAGGTCGAAGCGAATAAACGGTTCGTCTTCGACGATGAGAATCGTGCGTAATGTCAACGTCGGTATCATTTCAACTGGAAGTTACTGCGCCGGCATCGCCAAGTGGCGCGTCGAGGCGGAATTCAACGCCGCTAGGCGGATAGATCAGCTCCACCTTACCACGAAAATAGGACCGCAATGCGCGTTCGATCATGCGAGAGCCGAACCCTTTCCGCGTCGGGGGCGACACAACTGGTCCATTCAGCTCGCGCCATAGCAGCGAAAAGTGATAACCGCCCTCTGAGTCCGTCAACTGCCAGTTTAATATCACCGAGCCTATCTCCTGACTTAATGCTCCGTATTTGTAGGCATTCGTCGTCAATTCATGAAGCGCAAGTGTTAACGCCATCGCGGCTTGCGGGTTCAGGTCGATCCCCGGCCCTGCGACCGAAATACGCCCTCTAACGCCGGGAAAAGCAGATAGACCAGCCTCAACAATGTTGTGCATGCTGGCCTTATGCCAGAAACATGCCGTCAGCGCATCGGTGGCGCGACCGAGTGATACGAGCCGTGATGAGAAGGCGGCTGAGGCTTCTTCCACGCTGTCGGCGTCGCGCAACGTCTGGTTGGCGATCGCCTGGACCAGCGTCAGCACATTCTTGAGCCGATGGCTCAGTTCGCCGTTGAGTAATCGTTGTTGTTCTCGAGCCTGCTCCAGTGCCGTATGATCCCGTGAAACGGACAGGATACGCGCGACCTCGCCATCCGGGCCTGGGATCGGTGACACAGAAATGGACCAGTGTTTGGCCGTACCTAGAAAGGTGTCGGCGGGTGCTTCGAAATGTGTCGTTTCCCCGCGTTTCGCAGCCTCAATTGCCTCGCGGGCATAAGCAGGCCCTTCCATTCTGAGGAGTTGCGGCCAAGGGCACCCTCGCACAGCGTTGAAATCGCCAATCTCCATTACCTTCATGCCACCGTCACTCATGAAAGATAATGTGCCGTCCAGCTCCACCACCTTAATGCAATCGGTCGAGGCGGCGAGCACGCTCTGCAAAAAGACGGCGGTTTCGCGCGCCTCGGCCTCGACACGTCGGCGCTCTAGGGACGCCCTTGTTCGGTTGGCGGCATCGATAACGAATTCCACTTCCCCGGGCGTCCACTTACGTGGCCGGTCATCGTTAACGTAGAGAACCGCGACTGTCCGTCCCTTTTCCATAATTGGCCGATTGATCAAACTGCGCACCGCGACACTTTCCAAAGATTCAGTATCGGCAAACGTGCGTGCATCCTGGCGAATATCGTGAATGACGACGGTACGGCCTTGCCGAAGATCTTCGGCGTAGCCGCCATACTCGTCCATGCGGTACGTGCCTGCGAGAGTTGGATAGCCTTCAACCGTCCAATCACTAGAAACCGTAAGGGTTTCGCCATCGGCCCCGATCGTACCGTAACCGACACGCCTCACCGCAAGCGTCCGCCCCACGATTGCCGCGACCGCCCGCAACACGTCTTCCGGCTTGTCTAGAGCCTCTAGAGCGCCACCGAGTTCCACGAGCGCTTCACGACGTTGTTCTGAGCGAACGCGATTAGTCACCTCAAGAAAGATGACCGTAAACCTGTCGCCACCGGCGGGCTGACATACGCCATCATACCAGCGTTGGAGATTACCAACTTGACGTGTGAAGCGGATTGCCTCACCGGTCTCCACGACGCGGGCGAACTCGGACACCCACTCGTCCTCGATACTAGTGAAAATCTCCCGGATCGTCCGACCGACAGCACAGCCCTGCGCAACATCCATCAGATCGTACCAGGCATTGTTCACTTCCTGATGGCGCCAGTCGACGATGCGCCCGCACTCAGCGCGAACGACCTCACCAAGAATGAATCCCTCTTCAAGAGTCTCGAATAGGCCCCGCCATTTTGCCTCGTTGGCGGCAAGTTCGGCAGCCACGGCGTCGCGCTCACGGTACGCCCGGACGCGAGCCGTGGTGTCGGCCGTAATGTTGAGAAGGCCGACCACCTCACCGTCGTCGCCACGAACCGGCGAATAGGCGAAGTCATACCACGTCTCTTCGACGAAACCGTTGCGCGTCATGGCGAGCGACTGATCGTGAAAAGCGATGCTCTTTCCCGCGAAAGTGCGATCAACCAAGGGTTTGATGTCATCCCACAGCTCCGGCCATACCTCGGCCGTAACCTTCCCAAGCGCGCTGGGGTGGCGCGCACCGAGGATGGGAGCGTACCCGTCATTGTAAAGTAAAATGCGTTTAGGCCCCCAGATCAGGCACATTGGATGCAGGGAGGCAAGCATCGTATCGACCGTGATCCGAAGTGTGTCAGACCAGGCTATGCGCGCGCCCAACGGGCTTGTGCTCCAATCGTATTCGGCAATCAGATCGAACATATTAGGCACGACGTCAGCATCGCTCCCAGCCAAATTAAGACTTCTTTCCATACGAACTTGTCGCTTCTTCGACCTCGACGTTGAACCTTTCCGCCCTTTGGCGACTTGAAACGTTTACATAACAAAATTGCCCGATAATCCGAAATGTAGCCCGCTGCCAATACAACATTTGTTGTTAATAGCGGCGGACAGCTTTTTGGCGAAGCCGTCCTTCAGGATGCCGTTATACGCGTTAATGAAGGCCCTATTGCAAAGTTAGGGTCAGGACAGTTGATTGATGTTTCAGACGTTGTAAATTCCAAATTCCCTTTCAATCAGACGCACTTCTCTCATGCTACCCCCTTGGCACTGGAAGAGTTCAGCGTGTCCCTTTTTTAAGGTGCGCGTAACCTCAAAATCTTTGATCGTCGCATAAGCGGTTTTTAGGCTTTTGAAGCCACGGACCGGCTTGAGCAGCTGTTTGAGTTTGCAGTGTTTTAAAGAAGCTTTCTGTCACGGCGTTATCCCAACAATTGCCCTTGCGGCTCATGGAAACGGTGAAGCCATGCGCAGCAAGCCGTTTACGGTACTCTTAGGAACAATACTGGGATCCGCGATCGGCATAGTGGACACAGCCGGATGTTGGCTGTCGCACAGCAATGGCGCACTCCAAAGCGTTCAGCACCAGCGTCGTCGTCATGCGTGCTCCCGCACTCCACCCGATGACACGACGCGAGAAGAGATCGAGCACCACAGCAAGGTAAACCCAGCCTTCACGCGTCCAAATATACGTCAGGCGTCCCTGTTGCAA
Protein sequences of DBSCAN-SWA_3 >NZ_CP032485|2394719:2405786|2402271_2402679_-|WP_141493662.1|DBSCAN-SWA MIPTLTLRTILIVEDEPFIRFDLVDFFRDRGFHVFEAENADEAIQILATNGIIQAVLTDVHLPGPMDGIRLAHHVRNCCPLTVLVIASGMLKPDDSSLPERAFFVSKPYNLRSVLREIERQAIWGNGEISFENAR >NZ_CP032485|2394719:2405786|2399895_2402178_+|WP_141493661.1|DBSCAN-SWA MGTIDVSNATQNNLRGVSISIPVGKITAFTGVSGSGKSSLVFGVLAAESQRQLNFTYPTYVRNRLPHGGTPQVDHISGLSTAIIIDQKPLGTNRRSTVGTATDAAPLLRLIVSRMGSPFVGYSHVFSFNDKAGMCPRCEGLGETIDLDESELIDFNKSINEGAFTFSGYAVGTWYWKWYKRTGLFDADRKLCDYAAEDLDLLLYAEPKPLKNPPPDWYATAKYEGVVHRFRRMYLGSRQTTHKGQIARDLERIIHRTTCPVCKGARLNAAALSCKIDGRNIDDLSRLPISELLEIVEAWRVPELSPAIENLISQLRTLVELGLGYLQLGRTTPTISGGEGQRIKMVRHLGSSLTRFTYILDEPSTGLHPRDVRHLAMIIRRLRDKGNTVLLVEHDPDLIDIADFIVDMGPGPGDEGGNVLFAGRLEDLKASTTPTGLYLRASRKFSAPRRPHGDLIRVRNARRHNLCGVDVDIPRGLLTALTGVAGSGKSSLVHEILAAAPSAQLIDQSPLRGSIASSIATYTAAMDRIRDLMARANRVSRGWFTSAGKGACPVCKGRGVIVTELAFLDSTETDCEACSGSGFNQTALGFHFGGYNIADILSMSAKRAADFLAPHAPDAAIVLGRLTRVGLGHLAIGRSTASLSGGERQRLKLSSLLDDDIDTLILDEPTTGLHGSDVTRLLALFQGLVGQGKTVVMVEHNLDAMLAADWIVDLGPEAGSGGGQVMYTGPASKILKAKRTATGDELRRYIDIDAQRRKTA >NZ_CP032485|2394719:2405786|2398936_2399086_-|WP_141493958.1|transposase|DBSCAN-SWA MCTHIRQIWTENFCVYGARKVWHQLKREKITVARCLMKRMDLKGAVRGK >NZ_CP032485|2394719:2405786|2405567_2405786_-|WP_141493664.1|integrase,transposase|DBSCAN-SWA MQQGRLTYIWTREGWVYLAVVLDLFSRRVIGWSAGARMTTTLVLNALECAIAVRQPTSGCVHYADRGSQYCS >NZ_CP032485|2394719:2405786|2399105_2399570_+|WP_141493660.1|integrase,transposase|DBSCAN-SWA MCCYCRYEISDRDLESMLAERGVSVDHSRIYRRVQRYAPEMEKRLQWYWKRSGFSSNWRMDETYIKVKGQWAYLSRAVGKNDDTIDFLFSPTQNAKAAKRFSGKALNGLRGDKSPKTAYATIKDFEVMKTLKKGQAKLFQFQEAIMGEVRLIES >NZ_CP032485|2394719:2405786|2395756_2398015_+|WP_141493658.1|integrase|DBSCAN-SWA MSMCTDKSAPSSNDFWEATLPTERSPDLSMPEHNALTILEELFAKHHTQRPKKIASDLQRLLFPLEAALDRIKTAAAPRNSALRVMMQELIIRRRAYWAWSQNDWKDVLGATEGAFHKRNGCHGNCRQYVMMIAYRLGGFDRLEDVGTFFQYRLALKVFGRKDVDAVDERVRNEMRSMGFQSKTPRGLRSALYSALLYQRSSRLEDVTLKTLRRVASSDVKSIRTGAASLSRVLERLGTIERGFCLRQEERRRPPDERKAIEGVPDEWLSWCDRWRVTTTAAPSSVTSIYYGILKCGRWLGDQHPDIQSPAQWDRALALEYAAVVMKMRIGDWATPAGGSTCRIGEPMKPSAIAANLRCIRAFFNDLQGWEWIARRFTPAQTLVPSRSILAKIGPAPRVIADDVWAKLIWAALNLTKTDLQDRSKRVGWADQPPRYPLAMVRAVATLWLFGGLRRDEIYRMRVGGIRWSPSADGEKTSQTCLLDVPVNKTSRAFTKPVDPIVGTAIEKWEAERLPHPYSVDDKTGERVNWLFVYKGRRMGAAYINRALIPLLCAKGGVPERDARGRITSHRARATIATQLFNAKEPLSLFELQAWLGHTSPQATQYYAAVTPTKLARSFEQAGYFERNIRTIEVLIDPAARQTSPEGSAEPWQYYDLGHGYCTYDFFDQCAHRMACAKCSFYEPKESARMQALEAQGNLKKMSQNIPMTPTELEAVNEGERLMAALVAGLEHVQTPDGRTRQQIEAKNSGVP >NZ_CP032485|2394719:2405786|2402680_2405047_-|WP_141493663.1|DBSCAN-SWA MERSLNLAGSDADVVPNMFDLIAEYDWSTSPLGARIAWSDTLRITVDTMLASLHPMCLIWGPKRILLYNDGYAPILGARHPSALGKVTAEVWPELWDDIKPLVDRTFAGKSIAFHDQSLAMTRNGFVEETWYDFAYSPVRGDDGEVVGLLNITADTTARVRAYRERDAVAAELAANEAKWRGLFETLEEGFILGEVVRAECGRIVDWRHQEVNNAWYDLMDVAQGCAVGRTIREIFTSIEDEWVSEFARVVETGEAIRFTRQVGNLQRWYDGVCQPAGGDRFTVIFLEVTNRVRSEQRREALVELGGALEALDKPEDVLRAVAAIVGRTLAVRRVGYGTIGADGETLTVSSDWTVEGYPTLAGTYRMDEYGGYAEDLRQGRTVVIHDIRQDARTFADTESLESVAVRSLINRPIMEKGRTVAVLYVNDDRPRKWTPGEVEFVIDAANRTRASLERRRVEAEARETAVFLQSVLAASTDCIKVVELDGTLSFMSDGGMKVMEIGDFNAVRGCPWPQLLRMEGPAYAREAIEAAKRGETTHFEAPADTFLGTAKHWSISVSPIPGPDGEVARILSVSRDHTALEQAREQQRLLNGELSHRLKNVLTLVQAIANQTLRDADSVEEASAAFSSRLVSLGRATDALTACFWHKASMHNIVEAGLSAFPGVRGRISVAGPGIDLNPQAAMALTLALHELTTNAYKYGALSQEIGSVILNWQLTDSEGGYHFSLLWRELNGPVVSPPTRKGFGSRMIERALRSYFRGKVELIYPPSGVEFRLDAPLGDAGAVTSS >NZ_CP032485|2394719:2405786|2394719_2395760_+|WP_141493657.1|integrase|DBSCAN-SWA MSQKTVHQAEAQIESWIALQRDLGKAPNTVIAYGRGVRDFAAFCCMESFDLLEARKGTIAAYLHHLRSRPPRGGNAGDAGHACATSPIMANASLRHRLTIIRLFYDHLVEEGLRGTNPVPRGLRGAIDGSRHQGARGLVPVHRSLPWIPSEEQWLTILEAARSEPIRNRLMLAFAYDCALRREELCALETSDIDPAQKLITIRAETTKTKAGRVVPYSAVTGELLTVYLRERRALSRSRGPLFLSTSPRNLTMPITKWTWSKVVRALGVRAGEPKLSTHTMRHLCLTDLARVGWDIHEIARFAGHRSVQTTLLYIHLSARDLSAKFAATAAELHEARLASIMEVPQ >NZ_CP032485|2394719:2405786|2398372_2398702_-|WP_141493957.1|transposase|DBSCAN-SWA MSHAKSRRFSITYHSDRGSQYASLRYTQRLAEAGLVASVGSVGDSYDNALAETINGLYKTELIYRQGPWKGKRDVELATLKWVDWFNNRRILSSIGNITPAEAEARLYQ >NZ_CP032485|2394719:2405786|2398584_2398845_-|WP_170211090.1|DBSCAN-SWA MSNFAYVSTRQDFVYVAVIIDVFARVIVGWRVSSSAETAFVLDALEQVLATRNPVGSQSPITPIAAVNMPVCATHNALPKQGLSHP |
10 | Burkholderia_virus(33.33%) | integrase,transposase | attL 2399163:2399179|attR 2414749:2414765 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP032486_1 | 79849-80413 | Unclear |
NA
Consensus repeat of NZ_CP032486_1
|
9 spacers
spacers of NZ_CP032486_1
>1.1|79877|32|NZ_CP032486|CRISPRCasFinder,CRT TTCAAGATCGCCGGAATTGGGACGAGCAGCAC >1.2|79937|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR TCGGATACGCCGCTAAAAGTTTTAAAGCTATA >1.3|79997|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR GATTGAAGATCAGGGCGTTTTACAGTCTGGCA >1.4|80057|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR TTAAGTCTGGAGCGGTGCTGTCCTTACACATT >1.5|80117|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR AAACAGAGACATCAGACGGCAAAGGTTCAATC >1.6|80177|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR AAAATTCAGGAAATAAACAATGACAGCCCCTA >1.7|80237|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR AATAATAATAGCTCTTTTACGCCAGTTCTGTA >1.8|80297|29|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR TCAACAGACGGGGCAGGAGGCGGGCCAAC >1.9|80354|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR TGAACATCAGCGGACAATTCGTTGGCGAACTG |
cas3f,cas1,cas7f,cas5f |
CRISPR arrays and Neighbor proteins around NZ_CP032486_1
The CRISPR arrays of NZ_CP032486_1 >merge|NZ_CP032486|1|79849-80413|CRISPRCasFinder,CRT,PILER-CR GCTCCCCGCCGGGTAGGCGGCTGAGAGATTCAAGATCGCCGGAATTGGGACGAGCAGCACGCTCCCCGCCGGGTAGGCGGCTGAGAGATCGGATACGCCGCTAAAAGTTTTAAAGCTATAGCTCCCCGCCGGGTAGGCGGCTGAGAGAGATTGAAGATCAGGGCGTTTTACAGTCTGGCAGCTCCCCGCCGGGTAGGCGGCTGAGAGATTAAGTCTGGAGCGGTGCTGTCCTTACACATTGCTCCCCGCCGGGTAGGCGGCTGAGAGAAAACAGAGACATCAGACGGCAAAGGTTCAATCGCTCCCCGCCGGGTAGGCGGCTGAGAGAAAAATTCAGGAAATAAACAATGACAGCCCCTAGCTCCCCGCCGGGTAGGCGGCTGAGAGAAATAATAATAGCTCTTTTACGCCAGTTCTGTAGCTCCCCGCCGGGTAGGCGGCTGAGAGATCAACAGACGGGGCAGGAGGCGGGCCAACGCTTCCCGCCGGGTAGGCGGCTGAGAGATGAACATCAGCGGACAATTCGTTGGCGAACTGGCTCCCCGCCGGGTAGGCGGCTGAGAGA >NZ_CP032486|1|1|79849-80413|CRISPRCasFinder GCTCCCCGCCGGGTAGGCGGCTGAGAGA TTCAAGATCGCCGGAATTGGGACGAGCAGCAC GCTCCCCGCCGGGTAGGCGGCTGAGAGA TCGGATACGCCGCTAAAAGTTTTAAAGCTATA GCTCCCCGCCGGGTAGGCGGCTGAGAGA GATTGAAGATCAGGGCGTTTTACAGTCTGGCA GCTCCCCGCCGGGTAGGCGGCTGAGAGA TTAAGTCTGGAGCGGTGCTGTCCTTACACATT GCTCCCCGCCGGGTAGGCGGCTGAGAGA AAACAGAGACATCAGACGGCAAAGGTTCAATC GCTCCCCGCCGGGTAGGCGGCTGAGAGA AAAATTCAGGAAATAAACAATGACAGCCCCTA GCTCCCCGCCGGGTAGGCGGCTGAGAGA AATAATAATAGCTCTTTTACGCCAGTTCTGTA GCTCCCCGCCGGGTAGGCGGCTGAGAGA TCAACAGACGGGGCAGGAGGCGGGCCAAC GCTTCCCGCCGGGTAGGCGGCTGAGAGA TGAACATCAGCGGACAATTCGTTGGCGAACTG GCTCCCCGCCGGGTAGGCGGCTGAGAGA >NZ_CP032486|1|1|79849-80413|CRT GCTCCCCGCCGGGTAGGCGGCTGAGAGA TTCAAGATCGCCGGAATTGGGACGAGCAGCAC GCTCCCCGCCGGGTAGGCGGCTGAGAGA TCGGATACGCCGCTAAAAGTTTTAAAGCTATA GCTCCCCGCCGGGTAGGCGGCTGAGAGA GATTGAAGATCAGGGCGTTTTACAGTCTGGCA GCTCCCCGCCGGGTAGGCGGCTGAGAGA TTAAGTCTGGAGCGGTGCTGTCCTTACACATT GCTCCCCGCCGGGTAGGCGGCTGAGAGA AAACAGAGACATCAGACGGCAAAGGTTCAATC GCTCCCCGCCGGGTAGGCGGCTGAGAGA AAAATTCAGGAAATAAACAATGACAGCCCCTA GCTCCCCGCCGGGTAGGCGGCTGAGAGA AATAATAATAGCTCTTTTACGCCAGTTCTGTA GCTCCCCGCCGGGTAGGCGGCTGAGAGA TCAACAGACGGGGCAGGAGGCGGGCCAAC GCTTCCCGCCGGGTAGGCGGCTGAGAGA TGAACATCAGCGGACAATTCGTTGGCGAACTG GCTCCCCGCCGGGTAGGCGGCTGAGAGA >NZ_CP032486|1|1|79909-80413|PILER-CR GCTCCCCGCCGGGTAGGCGGCTGAGAGA TCGGATACGCCGCTAAAAGTTTTAAAGCTATA GCTCCCCGCCGGGTAGGCGGCTGAGAGA GATTGAAGATCAGGGCGTTTTACAGTCTGGCA GCTCCCCGCCGGGTAGGCGGCTGAGAGA TTAAGTCTGGAGCGGTGCTGTCCTTACACATT GCTCCCCGCCGGGTAGGCGGCTGAGAGA AAACAGAGACATCAGACGGCAAAGGTTCAATC GCTCCCCGCCGGGTAGGCGGCTGAGAGA AAAATTCAGGAAATAAACAATGACAGCCCCTA GCTCCCCGCCGGGTAGGCGGCTGAGAGA AATAATAATAGCTCTTTTACGCCAGTTCTGTA GCTCCCCGCCGGGTAGGCGGCTGAGAGA TCAACAGACGGGGCAGGAGGCGGGCCAAC GCTTCCCGCCGGGTAGGCGGCTGAGAGA TGAACATCAGCGGACAATTCGTTGGCGAACTG GCTCCCCGCCGGGTAGGCGGCTGAGAGA
>NZ_CP032486.1|WP_141494024.1|76406_79724_+|CRISPR-associated-endonuclease-Cas3'' MREIIAVCRSEKKSRERVARILDRYFWRIGDRTWRGKATNACLNRVSRELRKTATRNTAVVIHEIRSAASSRKPLIRIGAQHAFSEEGAVPVATHPAAFSNRKSSLASRETAEAILRIAALFHDLGKATALFQKKLDRAIKKGPPESDAVRHELFSAAVWDDLFGTVTDEKLPHRIKNLTPGEIDASCLRVQDLLGKLHRAPDRNLGLAFTQKADHLTYHIGMLVLTHHRLPTANRDCKTLLAEQHVNQNGALTPHVDLAIAPGTPFWHENWWLTALNREADRLRLGPPIASADMAIRASLMLADHLGSAMKTSREISDGHLANTIRAEAENLILRADSLSRHTKRVYRYARFSFDATHHQRDRYPALSEDAMPPEVAFPQLSCDTRFAWQSTAAQAAKKICSENEGGFFAAIIAGTGTGKTRGAPTILAAASLGDHVPERRYFRMSLGLGLRVLATQSAKEYVKDLNFGQKDVSVSVGAPPLEFGEDLPDSTDESGSESLVNLPEWLRIQQADGPVPEEGAEQEEDWLRGLSLDTGRSLPAFVDIFLDLAGDRNGNGRRFLNAPVLVSTIDYLMPVATPTSARFLLPALRVLSSDLILDEIDQYDGEDLAAIGRLVFQSGAAGRRVIIMSATLTKEIVETLHSSYHAGWKEYARSLGLSERCNLLICGDNPVSLSVNADDEPCDVVFERCCDAILRGFRTAPALRRGKVLPPVPTWEGLITTVDEACSAAHNENAIDIDGVRVSVGLVRMTRISHTADLARRLNAGDIGGRLRVLLCLHSQMTRLVRGWIESRLKQALTRKGDFPEAGVKALCDEVRLFERARKICAKDIEIVVITSPVIETGNDLDFDYAIIDPISTRAIIQTAGRVRRHRPPLGSTPNVWILGQSPIALDTGKLKMPGVETELSGDTQVPRSSLLDGYKGRLFSEIVGNNVFERIDAVAILDDTVSFPLKQAEAELCRKMLSISSDAPLGRYISSSTARWNTKVSQMRKFRRTNQREIRFVLKGETLSDAVWYVDMAPGTRHSVLREAGDKMRSTSLRSPEAVLLADTIERAWRAYAQNSREISSLELASLMSVGLPVWDNKTCVDVTMDDLSGLTRGSFED >NZ_CP032486.1|WP_170211112.1|75430_76396_+|type-I-F-CRISPR-associated-endonuclease-Cas1 MQIKGGPRLLMTDRESAVYLEHARIHVEGERVVYHIDDDENRREFNIPHVNLAVLFIGQGTSITQGAMRLLGEEGVHLAVTGSGGTPLHMGSLTSYTATRHFRELLPIYLSEEKSLKAAISVMQDRTLRMRKLGGKGATKVLRARETMGLSKKCASFEENLKTCASIQQLLGFEGQFTKACYAEFSSIAALPKDSPFRRDAGAGEATKGPTDRVKLINRLIDHGNYLCYGMAGAALWALGIPPHMSIFHGKTRAGGLVFDIADSFKDALVLPLAFAPYKEKSVSEAEKTFRARLIEAFNDHAILKEAIATIDKLILAANLT >NZ_CP032486.1|WP_170211111.1|74790_75441_+|hypothetical-protein MDTLNEGKGRCLATSRLLIRPVRAVIELGAAGDVLSGAIRAVHHANRHGSVEDFIAIAFPTMRMGRETMLSGDDIELIGSDASLGRFLELEGIITLKRRGMLEDTSLDQVYADEGMIGAAYVRDRAVEKHTPGWIRRTEARAARRGKPLGKAVKQRENDLKSLVLTHGTTVLHIREVVGPFTDRPLHVSTYGFSGSGDPAILPVFPESARTVDNAD >NZ_CP032486.1|WP_141494022.1|73708_74791_+|type-I-F-CRISPR-associated-protein-Csy3 MAKTSAPLSLETGMLAFARSLQITEGLFYATRKADSAIAAPIEILEKGVRGQSSEDKAKNPGLSNPQSVEYAIVPQGHDGVRLTFSIRFMPFSRAPHACNNTDVGSAYERLATAYRAADGYKVLAGLYLWNIANARFAWRNRFQSDAMSVSITTSGGTRLKFDPFKLSLTEPASAVELSAALIGGNTADIEKVIDGIAFGLANEDHEAYTVNVSWDAEMEAGQEVFPSQEYVREEKAIANLSRVFAKLPTKWGGRSLMQASMHSQKIGAALRCIDIWHGDEDETKPIPVNPYGGVQETGAVLRNSKTKRSFYDLRKNGETLIDGIESATTVSEISGDAHFVMANLIRGGVFGSSSKKAEG >NZ_CP032486.1|WP_141494021.1|72867_73716_+|hypothetical-protein MPELRNLFVLRDIQASRVNLIMNDYAAGLPSPLSFLGLGDFLARRLKLKPWSASVLPILHAVRVSEGRTKPEMENKSSVFAPIETMEDLVGSVTVSLLINLPGCESENALARALTGCRIAGGIIQNNDVKVQALTPDGSAFRSLRRGYAMLAPEQNERRVISKGDNDSLTEIATLLFPVERPAGFGWIVPAAVGYHLLEDPEHAPKRTRTRSKDIPHVFAEPVLGIAELVSVRNRRLTELSPEAFASAFWHWDAREDMLLGHSAYFSNNKLNDVTKEVLNHG >NZ_CP032486.1|WP_141494020.1|71710_72868_+|hypothetical-protein MTDGAEEETEKSTQAVKTEQLLSFKTSGRNIAPNALSAEFSLMKGCLKSPLTPKVALRLADKSGNYLLKLVSPRTDGDNAILRIEVPRGNKLGDILPSTNAPDIPFCKVIFRPLEIDGYPPRSVVDLIRNPEDHTTIVLDAFAEVFGTDVLETLKTSLLTVLPAPTQLGIGEFPIIFVPRPDGQDLQITPVSPAAAFMGMKRVRKHYFQKTQPDRPMPRSKWTEQAVSAKPQNISGAIGGPRVRFRADMPTQLSHEEADLFRFAQGGSFPLWRDSAVAARILRYGDRLTSDNEFNNKNTRAALNQVADDLISDAVEFIQDTLRDTVDYAKRQGIAEKHSALPSLPQLLLKRRWKNTDEEDKARKALTSPHFELRLAKSRMAAKGL >NZ_CP032486.1|WP_141494019.1|70060_71095_-|AAA-family-ATPase MSFLKEVREKLGLSQSQLKDVLNLRLNRSYDRHTISRWENSRQPLPAEVSSELEALLHGEKRQTTIITFANQKGGVGKTTSALNVSVALSKMGYRVLLIDVDPQASATAALLGMQIVPLYRQGKTLAHALLKDASMSNCIVKKGPIEGVEVEIPVDFCPSHIDLAEVDIRREPGTEGLLKEAISQVQDNYEFIILDSPPHLGFLTWMALASSNTVFVPVRTEPYDVMGVNLILDTITKVNRRSNPRLRLGGVIPTQFVQNQYVDVGIIEHLIRVMDNRAPVLEPVPSSTSFSNAAWASKIPVDVAPRSPSVRVYVRLAEAIAGRRNFINASSVLSLDTDKRENS >NZ_CP032486.1|WP_141494018.1|69191_70064_-|ParB/RepB/Spo0J-family-partition-protein MKRTFKQRPASAFAQTHSQVMEQVDVPFLGDGKFRHTFEASIDQIIPDPNQPRRNFEQASLEELAESLKQQGQLQPILVRQSSENSEKWIIVAGERRFRAAKLADWTSILAIPHDGNSTSAALIENLMRVDLNPVEEAKGIKNLLDYNQWSQRKAAAELGMDQARISRAIKLLSLPESFLEKAGAASVPMNVLVGIARIDDPARRDKLMEKALSGEVTVASLNERNFSIEPDRKDGQSNPDRQLKAINIEKMAPKIIQVIRDFEVKNVKFSDKDMSALRLLHAELTKLVG >NZ_CP032486.1|WP_068173535.1|68539_68914_-|hypothetical-protein MRRYAFPYIPNRPVRVPQRWTDTFAPLSCRRCSAVWEEGDPALRVPCRGCGAGPSEPCRRSKGGNERVCACRDEDAVRLGMLEPCEGLTWDGRHEKPLRLRAEPVPSALMCRAVRTGAPISRWG >NZ_CP032486.1|WP_141494017.1|67781_68045_-|hypothetical-protein MTRQSQPIPAGFSLIQAHREGWTLHTLPHPQNGPEGETQLRGLDCGDQQAWWRVASKARLGSRYHLSALALISAAEREQIALRFGPF >NZ_CP032486.1|WP_141494025.1|80515_81223_+|IS6-family-transposase MGDFKGRHFRGEVILWAVRWYCHYGISYRDLETMLAERGVSVDHSTIYRWVQRYAPEMEKRLRWYWKRPGFSSSWRVDETYIKVKGKWTYLYRAVGKGGDRIDFFLSPTRSAKAAKRFLSKALNGLRRWEKPETINTDKAPTYGRAINEFTKNGKLPDTVKHRQVKYLNSVIEADHGKLKQLIKPVRGFKSLKTAYATIKGFEVMRALKKGQAELFQFQKGIMGEVRLIERQFSF >NZ_CP032486.1|WP_141494026.1|81266_81869_-|CdiI-family-contact-dependent-growth-inhibition-immunity-protein MNELIYTPERIRGLHRKSVLIALSPKYISLESQEIWKGVYFSRTGWCVHASKDATAQWIGENFRKALLSSEYFNTPGRPLDKEDRIKMEEVSDKRRLDFCDEIMRTYGYKKREKIGDRCDVVYASWHDLVDDFVTLSASKRQPGGHSAWGPMDKKKYASTRVTVPLSASDEELGLAIRDALSRCEAPGRQKINSPALLQS >NZ_CP032486.1|WP_141494027.1|81865_85009_-|DUF4214-domain-containing-protein MSDQNVLVINQNQNLLFNDNHGAVSIEASASFFLKEKEFIDAQQAGNKEDIYQYISKGNVDYWNVIGNNKQNTFDFKNYSLSTLSESKSTAFSYDQYGNVYAAPDQGGIYGPSSQMGLYNFGPAPSSWISTHDRHMLTRSYADYYRDNVADKIGSEILNRPLTSAEKQDDWNSIYDTVNSRLAANRVLTPYQAAQDQYIATIKNIRNKYIENSPYVNQSAHKMYDNFYGNLSDGDSRWFKDQLKSGKTFEKIWQEEAHSGRMRAQVLDMINGVQDRNDAKEEWDAPYVDAVTNSLANHSQTFKDIRNNLIDGTSQVAAHKMYDNFYGYTSDTDIAWFKDQLHSGKTTQQVWQEEAHSGRMRAHVMDMINGVQDRNDAKEEWDGNYINGFTDRLANRSASFNDMRNELIDGTSRGSAEKMFNSFYGHIEESNWNWYKEQLHSGKTTQQIRAEQAYSEEVQNNINKLYQDELGRSADPSGLATYQKSLADGGSLQAIRSTIAYSQEAQNNLNKLYQDELGRSADPSGLATYQKSLADGGSLQSIRSAIAYSQEAQNNINKLYQDELGRSADPSGLATYQKSLADGGSLQAIQSTIAYSQESHDTIMAQYRTEVGLYNPNDGQIAFYQKILANSGNFSNVKTALAYSSETVQSLQSSLEFMYGRPFTETDIQWRKNVQDDLASGASSHLYVIKELTESSEFHDGANQLFAHWGLPPINDFQLQTLRTGMTNLYQAKAIVNGQTQEQLEAEAAAYKAPAANATFDDYIASPAASVEQATDLMASMLVEPMMDPAETWQDVDGILLQGVTSTMNAAVSQSITKTVIQQDKDNPQTPCDDVQMFRRLHTTDVTNIGQITSANAKWQTVNQRDWPSSYATDGIVWATGAGKGPTAQGLPYEAYVQQKLNGGNTTGNYVWLQDHKSNWMTFDHWNKDTGDAVSDKVLNTGRKSFQEIDKDKKYSPANIKYQIWKDLKEMSIKYVRGNSRQGSPVTIQFTQSEIESYRFELAVRVDGTTDAQWKQICEAYHGAAAKMASYETNGYKPLTFEIDAIV >NZ_CP032486.1|WP_141494028.1|85270_86182_+|recombinase-family-protein MKGQRVGYVRVSTFDQNVDRQLDGEILDRVFTDKASGKDVQRAQLDELLAFVREGDIVVVHSMDRLARNLDDLRNLVHTLTRKGVCVEFIKERLIFSDKDESLPKLMLSVMGAFAEFERSLIRERQREGIALAKKRGAYRGRKRVLSHEQVADILQRISDGETKAAIARERGISRETLYQYLRAYPNGGVRMRPVKEVVHDAPMGAKPFIATLILNIQSQSASGPSLSSVRAEIEDMLAEDYDVQKNEKGEYRLAIPAEYTRTEHDLTAEIADLFDEIDAIAEEYVCTAVGTLREMGGQERVW >NZ_CP032486.1|WP_141494029.1|86377_86620_+|AbrB/MazE/SpoVT-family-DNA-binding-domain-containing-protein MHGTVRKWGNSAAIRLPTSILEAVQLQIDQPVDVHEEDGRIIIQPIRHKNLSLESLTAAITDENRHGEVDFGVAIGGEAW >NZ_CP032486.1|WP_141494030.1|86619_86949_+|endoribonuclease-MazF MPEWVPDCGEIVWLEFDPQAGREQAGHRPAVVLSPASYNAKSGMIVCCPTTTRIKGYPFEVSLQGQPASVVLSDQVRSLDWRARRAKMKGKVTEDELESVREKVRLLVG >NZ_CP032486.1|WP_141494032.1|87778_88492_-|IS6-family-transposase MTPMSKIPLSYKRHRFPRELIAHAVWLYFRFPLSFRLVEEMLLERGIVVSYETICRWNLKFGAEYVRSLRRKRAKRGDLWHLDEVRVVIGGHIHWLWRAVDQDGYILDEILQKQRNTKAAKRLLTRLLRQQGVRPRRMITDKLKSYEAAKRKLGLSVRHLSHKGLNNRAENSHLPLRKRERTMQRFRSPGGCQRFLTVFSAVRNLFVPPVANANALARHMHRLQAFAQWKTVTILTA >NZ_CP032486.1|WP_170211113.1|92526_94869_+|hypothetical-protein MADTVVSGNQVTRNKTFNNGDRLYVGPNGSADYATLNSGSLATITNSPYGIHDSTVNNGASLVLSSGGFGTTTKVNGGNVTVSSGGTYNYNWTNAGGTVTVLNGGTAWTNFVSGQNAAVIISSGGFGSDARVGSDGRYSVGSGGRLSGANVGSGGTLYAIGGSIATATISAGGTAFIQNGGTLTSATVDGGTLFLNGTDPTNNTIFTSNGGTVYLNNNYRNTTSWGNISSNTTFAVNSGGQLFGGTVLQGGVIRVNEGGKLTSATLNGGTLNLNGANATSNTTFGTSGGTVNLFSGYSNTVAWQNITSNTRFNVLSGAIFSGTNVLSGATVNVASGGSLTGTLSVNPGGTVVLNGTAGSGTVNLSGDGSQLTISGTQMPTNVISGWSPTDKIDLASIPYGSITSVTTTESGVTFHTANGSYSLNIPGANKYGYALNKDSDGSTIYTTCFAEGTHITTEEGDIAVENLKIGTQIHTPNGLMPLKWLGHRSITVAKQKHPEDNWLVRIRRGAFAEGTPARDLLVTQEHCMVFDGRLVPARMLVNNRSVIVDRSINSYTYYHVELESHAAIWAEKTLTESYLDTGNRDQFENNTVVSLSPRRRTGGSVTLPLDTSRDFVEPIFRSIAERAGVAGATSQHGMTCDPDLHLLTETGEVIRPRRISGEQHVFFLPDTIEHVKIMSRSSRPSDVVGPYVDDRRDLGVLIGQMKLFGANKTTSIEVPSSPVELSGWYDCSAETGRWTNGAATLFVGPARNNEPRILTLQILSQGHYRIESEQGTAATA >NZ_CP032486.1|WP_141494034.1|96943_97141_+|HVA1-family-protein MKKNEHVAWNTTRGETTGHVKKQVTHDIKIKGNTVKASGDHPKIVVKSDKTGAEAAHKPESLKKR >NZ_CP032486.1|WP_141494035.1|97625_98123_-|hypothetical-protein MVIRIHDGEEASATRTPLEGGILGWAAVAMLGVCLVVIRVVPSNRASQVRRFAQIWAGGLFCFFAGVHRGASFYDAKGPQLSDPFIFLATHALGLGAILSSSNISWRALQVGALLNMAGDVRLARQGRLPRFLVRLRPAQLSTAFVLLILLEHSTPNHVLSSKSD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP032486_1 | 1.1|79877|32|NZ_CP032486|CRISPRCasFinder,CRT | 79877-79908 | 32 | NZ_CP032486 | Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence | 79877-79908 | 0 | 1.0 |
NZ_CP032486_1 | 1.2|79937|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR | 79937-79968 | 32 | NZ_CP032486 | Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence | 79937-79968 | 0 | 1.0 |
NZ_CP032486_1 | 1.3|79997|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR | 79997-80028 | 32 | NZ_CP032486 | Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence | 79997-80028 | 0 | 1.0 |
NZ_CP032486_1 | 1.4|80057|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR | 80057-80088 | 32 | NZ_CP032486 | Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence | 80057-80088 | 0 | 1.0 |
NZ_CP032486_1 | 1.5|80117|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR | 80117-80148 | 32 | NZ_CP032486 | Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence | 80117-80148 | 0 | 1.0 |
NZ_CP032486_1 | 1.6|80177|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR | 80177-80208 | 32 | NZ_CP032486 | Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence | 80177-80208 | 0 | 1.0 |
NZ_CP032486_1 | 1.7|80237|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR | 80237-80268 | 32 | NZ_CP032486 | Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence | 80237-80268 | 0 | 1.0 |
NZ_CP032486_1 | 1.8|80297|29|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR | 80297-80325 | 29 | NZ_CP032486 | Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence | 80297-80325 | 0 | 1.0 |
NZ_CP032486_1 | 1.9|80354|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR | 80354-80385 | 32 | NZ_CP032486 | Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence | 80354-80385 | 0 | 1.0 |
NZ_CP032486_1 | 1.8|80297|29|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR | 80297-80325 | 29 | NC_008826 | Methylibium petroleiphilum PM1 plasmid RPME01, complete sequence | 244118-244146 | 5 | 0.828 |
NZ_CP032486_1 | 1.8|80297|29|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR | 80297-80325 | 29 | NZ_CP054621 | Azospirillum oryzae strain KACC 14407 plasmid unnamed6, complete sequence | 623349-623377 | 7 | 0.759 |
NZ_CP032486_1 | 1.1|79877|32|NZ_CP032486|CRISPRCasFinder,CRT | 79877-79908 | 32 | MH316562 | Mycobacterium phage Erk16, complete genome | 3745-3776 | 8 | 0.75 |
NZ_CP032486_1 | 1.1|79877|32|NZ_CP032486|CRISPRCasFinder,CRT | 79877-79908 | 32 | DQ398051 | Mycobacterium phage PLot, complete genome | 3745-3776 | 8 | 0.75 |
NZ_CP032486_1 | 1.1|79877|32|NZ_CP032486|CRISPRCasFinder,CRT | 79877-79908 | 32 | NZ_CP030354 | Novosphingobium sp. P6W plasmid pP6W1, complete sequence | 98189-98220 | 10 | 0.688 |
1. spacer 1.1|79877|32|NZ_CP032486|CRISPRCasFinder,CRT matches to NZ_CP032486 (Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
ttcaagatcgccggaattgggacgagcagcac CRISPR spacer ttcaagatcgccggaattgggacgagcagcac Protospacer ********************************
2. spacer 1.2|79937|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP032486 (Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
tcggatacgccgctaaaagttttaaagctata CRISPR spacer tcggatacgccgctaaaagttttaaagctata Protospacer ********************************
3. spacer 1.3|79997|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP032486 (Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
gattgaagatcagggcgttttacagtctggca CRISPR spacer gattgaagatcagggcgttttacagtctggca Protospacer ********************************
4. spacer 1.4|80057|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP032486 (Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
ttaagtctggagcggtgctgtccttacacatt CRISPR spacer ttaagtctggagcggtgctgtccttacacatt Protospacer ********************************
5. spacer 1.5|80117|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP032486 (Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
aaacagagacatcagacggcaaaggttcaatc CRISPR spacer aaacagagacatcagacggcaaaggttcaatc Protospacer ********************************
6. spacer 1.6|80177|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP032486 (Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
aaaattcaggaaataaacaatgacagccccta CRISPR spacer aaaattcaggaaataaacaatgacagccccta Protospacer ********************************
7. spacer 1.7|80237|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP032486 (Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
aataataatagctcttttacgccagttctgta CRISPR spacer aataataatagctcttttacgccagttctgta Protospacer ********************************
8. spacer 1.8|80297|29|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP032486 (Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
tcaacagacggggcaggaggcgggccaac CRISPR spacer tcaacagacggggcaggaggcgggccaac Protospacer *****************************
9. spacer 1.9|80354|32|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP032486 (Neokomagataea tanensis strain AH13 = NBRC 106556 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
tgaacatcagcggacaattcgttggcgaactg CRISPR spacer tgaacatcagcggacaattcgttggcgaactg Protospacer ********************************
10. spacer 1.8|80297|29|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR matches to NC_008826 (Methylibium petroleiphilum PM1 plasmid RPME01, complete sequence) position: , mismatch: 5, identity: 0.828
tcaacagacggggcaggaggcgggccaac CRISPR spacer acaacacaccgggcaggaggcgggccttc Protospacer ***** ** **************** *
11. spacer 1.8|80297|29|NZ_CP032486|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP054621 (Azospirillum oryzae strain KACC 14407 plasmid unnamed6, complete sequence) position: , mismatch: 7, identity: 0.759
tcaacagacggggcaggaggcgggccaac CRISPR spacer cgaacagacgggccagcaggcgggcggtc Protospacer . ********** *** ******** . *
12. spacer 1.1|79877|32|NZ_CP032486|CRISPRCasFinder,CRT matches to MH316562 (Mycobacterium phage Erk16, complete genome) position: , mismatch: 8, identity: 0.75
ttcaagatcg----ccggaattgggacgagcagcac CRISPR spacer ----aggccgatgcccggaattgggacgagcagcgg Protospacer **..** ********************.
13. spacer 1.1|79877|32|NZ_CP032486|CRISPRCasFinder,CRT matches to DQ398051 (Mycobacterium phage PLot, complete genome) position: , mismatch: 8, identity: 0.75
ttcaagatcg----ccggaattgggacgagcagcac CRISPR spacer ----aggccgatgcccggaattgggacgagcagcgg Protospacer **..** ********************.
14. spacer 1.1|79877|32|NZ_CP032486|CRISPRCasFinder,CRT matches to NZ_CP030354 (Novosphingobium sp. P6W plasmid pP6W1, complete sequence) position: , mismatch: 10, identity: 0.688
ttcaagatcgccggaattgggacgagcagcac CRISPR spacer gtcaagatcgccggctttgggacgttcctgct Protospacer ************* ******** * .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
0 : 3636
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP032486|0:3636|DBSCAN-SWA
Protein sequences of DBSCAN-SWA_1 >NZ_CP032486|0:3636|1672_2035_-|WP_141493968.1|DBSCAN-SWA MSVQIQESYLHQLLNISQRDQSYDPETFYYTVLELVDLFGKCPRRLTAAQCLEGILGVPTDEHFNDFILRITKEFLSEVPEEVLIDDINKDFIKNIYVYKYKSFTYFFANDLNTCSYNNV >NZ_CP032486|0:3636|2757_3636_-|WP_141493969.1|DBSCAN-SWA MTLIDPRNRYPRPPFKTQPQDFPGLSANMGPEPDYGLDSYQSGERLKGLVALITGGDSGIGRAVAIAYAKEGADIALAYLDAEEKDAQNIASSIEAIGRRCLLLPGDIRKKTVCVDWIDETVKKLGGIDILVNNAAFQHPRKDLLDIEDEEWRCHFDTNIHGMFYLTKAALPHLKPGASIINTSSVNTRTPMSILIPYSMTKAAIANFTVSLAGSLVEKGIRVNSVLPGPIWTPFIATGMPAEQHESFGSQAPMGRPGQPAELAGAYVYLADPNNTYTTGALLPVHGGMPQL |
2 | Trichoplusia_ni_ascovirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
13924 : 14581
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP032486|13924:14581|DBSCAN-SWA GATGCGTGATTTCAAGGGGCGTCATTCTCGTGGTGAAGTGATCCTATGGGCAGTGCGCTGGTATTGCCGTTACGGGATCAGCTACCGCGACCTTGAAACCATGCTGGCCGAGCGCGGTGTGAGCGTGGATCATTCAACAATTTACCGTTGGGTCCAGCGCTATGCACCCGAGATGGAAAAGCGCCTTCGGTGGTATTGGAAACGCCCAGGGTTTTCCAGCAGTTGTCGGGTTGATGAGACTTACATCAAGATCAAAGGGAAATGGGCGTATCTGTACCGGGCTGTCGGCAAAGGCGGTGATACGATTGATTTCTTCCTCTCACAGACCCGGAACGCCAAGGCTGCCAAACGCTTTCTGAGCAAAGCTTTGAACGGCCTGAGGGAGTGGGAGAAGCCTGAGACAATCAATACGGATAAAGCCTCAACCTACGGCATAGCCATCAACGAACTCCAGAAGAACGGTAAGCTTCCCGACACGGTGAAACACCGTCAGGTGAAGTCTCTGAACAACGTGATCGAGGCCGATCACGGCAAGCTGAAACAACTGATCCGGCCGGTTCGTGGCTTCAAAAGCCTGAAAACCGCTTATGCGACGATTAAGGGTTTTGAGGTCATGCGCGTCTTAAAAAAGGGACAAGCTGAACTCTTTCAGCTATAG
Protein sequences of DBSCAN-SWA_2 >NZ_CP032486|13924:14581|13924_14581_+|WP_141493976.1|transposase|DBSCAN-SWA MRDFKGRHSRGEVILWAVRWYCRYGISYRDLETMLAERGVSVDHSTIYRWVQRYAPEMEKRLRWYWKRPGFSSSCRVDETYIKIKGKWAYLYRAVGKGGDTIDFFLSQTRNAKAAKRFLSKALNGLREWEKPETINTDKASTYGIAINELQKNGKLPDTVKHRQVKSLNNVIEADHGKLKQLIRPVRGFKSLKTAYATIKGFEVMRVLKKGQAELFQL |
1 | Escherichia_phage(100.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
22444 : 24553
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP032486|22444:24553|DBSCAN-SWA CATGGCACTCCAGTTACCTCATTCCGAAGCCAACCATGGCGGGCGGCATTCGGAACATCGCAGTCAGAGCAGCCTGATTCAGGCGTCGCTTCTGCTGTCCGCGATCAGGGAAAGCTTCCGTAAGCTTGATCCGCGCATAATGTGGCGTAATCCGGTCATGTTCGTCGTAGAGGTCGTCACCATTCTGACGACTGTCCTGCTGGTACGCGACATAATCATTGGAGGAACCCATCTCGGTTTTGCGATCCAGATCAATCTTTGGCTCTGGTTCACGCTCCTCTTCGCCAATTTCGCCGAGGCCGTCGCGGAAGGGCGTGGCAAGGCGCAGGCCGACAGTCTACGCCGCACCCGAACCGAGACACTCGGAAAGCGTCTCTTGGCCAATGACGACGGAGTCTACTCACATTTCGGTATGTATGAGCAGATCCCGGCTCCTGAACTTGCGGTAGGCGACATCGTGCTGGTTGAAGCGAACGACTTCATCCCGAGCGACGGAGAAGTTATCGGTGGGATCGCATCAGTGGATGAATCTGCCATCACTGGTGAATCCGCTCCCGTGATCCGAGAAAGTGGTGGCGATCGATCCGCGGTGACGGGTGGCACACGCGTCCTGTCCGACTGGGTCATCGTGAGGATTACTGCGGCTCAGGGTTCCACCTTCCTTGACCGGATGATTTCGCTCGTCGAAGGCGCACAGCGCCAGAAAACGCCCAACGAGATTGCGCTAACGATCCTCCTGGCCGGCATGACGCTGATCTTCATATTCGCGGTCGCGACGATCCCAAGCTTTGTGCATTATGCAGGGGGTCATATTTCTGTTTTGATCCTCGTCGCGCTGTTTGTAACACTCATCCCGACGACCATCGGTGCTCTGCTGTCAGCGATCGGTATTGCCGGTATGGATCGCCTGATACGTTTCAATGTGCTTGCCATGTCTGGAAGGGCCGTGGAAGCAGCAGGCGACGTCGATACGCTGCTGCTCGACAAGACCGGGACGATCACCATCGGTGACAGACAAGCTACCGCATTCGCTCCTGTCTCAGGTGTAACGGAACACGAACTTGCTGATGCCGCACAACTTGCATCCCTGGCCGACGAAACGCCCGAAGGGCGCTCTATTGTTGTGCTCGCGAAGGAGAAATTCGGTATTCGCGGGCGGGATATGAAAGCCCTTGGTGCGCATTTTGTGCCCTTTACAGCGCAGACACGCATGAGCGGCGTGGATGTCGGGGACCGACATATCCGCAAGGGCGCAGTAGACAGCGTGATTGCCTATCTCGGGGAAGCTGTGCCATCCGTCGGTCAGATCCGCCAGATCGCCGACGAGATCGCCCGTCAAGGTGGTACGCCGCTTGCTGTAGCGGATAACACGCGCCTGCTTGGCGTGGTACATCTGAAGGACGTCGTCAAGGGCGGTATCCGCGAACGCTTCGCAGAACTGCGACGGATGGGCATCCGAACCGTCATGATCACCGGAGATAATCCCCTGACGGCTGCAGCGATTGCTGCAGAGAGTGGTGTCGATGACTTCCTTGCGCAGGCAACGCCGGAAGCAAAGCTCGCACTGATCCGTTCGGAGCAGGCGTCTGGCAAGCTGGTAGCCATGTGCGGCGATGGTACCAATGATGCTCCGGCTCTCGCGCAGGCCGATGTTGGGGTGGCCATGAATACGGGAACGGTCGCGGCACGCGAGGCCGGCAACATGGTTGACCTGGACAGTGATCCGACGAAACTCATTGAAATCGTGGGGATAGGAAAACAACTTCTGATGACGCGGGGAGCCTTGACGACGTTTTCGATCGCCAACGATGTTGCCAAATATTTCGCCATTATTCCTGCGATGTTTGTGGGCTTCTATCCAGCGCTTTCAGCATTGAACATCATGCATCTTGCCACACCGGAAAGCGCAGTGCTCTCAGCGATCATTTTCAATGCCCTGATCATCATCGCGCTCATTCCTCTGGCGTTGAGGGGTGTTCGTTACCGTCCGGTAGGCGCGGCGTCGCTTCTAAGGCGCAATCTCCTGATTTACGGCATTGGCGGCCTGATCGTCCCGTTCATCGGAATCAAACTCATAGACATGCTCGTGACGAGCATTGGTCTGGCCTGA
Protein sequences of DBSCAN-SWA_3 >NZ_CP032486|22444:24553|22444_24553_+|WP_141493983.1|DBSCAN-SWA MALQLPHSEANHGGRHSEHRSQSSLIQASLLLSAIRESFRKLDPRIMWRNPVMFVVEVVTILTTVLLVRDIIIGGTHLGFAIQINLWLWFTLLFANFAEAVAEGRGKAQADSLRRTRTETLGKRLLANDDGVYSHFGMYEQIPAPELAVGDIVLVEANDFIPSDGEVIGGIASVDESAITGESAPVIRESGGDRSAVTGGTRVLSDWVIVRITAAQGSTFLDRMISLVEGAQRQKTPNEIALTILLAGMTLIFIFAVATIPSFVHYAGGHISVLILVALFVTLIPTTIGALLSAIGIAGMDRLIRFNVLAMSGRAVEAAGDVDTLLLDKTGTITIGDRQATAFAPVSGVTEHELADAAQLASLADETPEGRSIVVLAKEKFGIRGRDMKALGAHFVPFTAQTRMSGVDVGDRHIRKGAVDSVIAYLGEAVPSVGQIRQIADEIARQGGTPLAVADNTRLLGVVHLKDVVKGGIRERFAELRRMGIRTVMITGDNPLTAAAIAAESGVDDFLAQATPEAKLALIRSEQASGKLVAMCGDGTNDAPALAQADVGVAMNTGTVAAREAGNMVDLDSDPTKLIEIVGIGKQLLMTRGALTTFSIANDVAKYFAIIPAMFVGFYPALSALNIMHLATPESAVLSAIIFNALIIIALIPLALRGVRYRPVGAASLLRRNLLIYGIGGLIVPFIGIKLIDMLVTSIGLA |
1 | Streptococcus_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
27870 : 28554
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP032486|27870:28554|DBSCAN-SWA AATGAGCGATCCGTTGGTACTGATTGTGGATGACGAGCCTGCCATTCGTCGGCTTCTTAGAACGTCGCTCGTCAGCCAGTCTTGGCGCGTAACAGAAGCCCGAACGGGAAAAATGGCCCTGAACATGGCAACGGAGGTCGTGCCGGACATCGTTGTGCTTGATCTGGGGCTTCCAGACATGGACGGCGTAGACGTTCTTCGACGTCTGAGAGGTGCTCACCCAACTCTGCCCGTCGTGATCCTTTCGGTCCGTGATGACGAACGGGGCAAAGTTGCAGCCCTTGAAGCGGGTGCTGATGACTATGTAACAAAGCCGTTCAGCATGGCTGAACTTATCGCCCGCATGCGCAATGCCTTGCGACACGCCTTACAGCAAGAAGGCACCATTCCGCAGTTTGTGTCAGGTGATCTGACAATCGATCTTGTACGCCGCCAGATCTTTCGAAGCGAGGATGAGATCCGGCTTTCACCACGGGAGTGGGATATCTTGCGTATGTTGGTTCGTTATGCGGGAAGGGTTCTAACACACCAGACAATCATGAGTCAGCTATGGGGAGCAACGGGCGATGTTCAACAGCTTCGCGTCTATATCCGGCAGATACGTCAAAAGATCGAAATTGATCCAGAGAGGCCACGATATATTATTACTGAAACGGGTGTCGGGTATAGGATGGTTCAATTATAA
Protein sequences of DBSCAN-SWA_4 >NZ_CP032486|27870:28554|27870_28554_+|WP_141493986.1|DBSCAN-SWA MSDPLVLIVDDEPAIRRLLRTSLVSQSWRVTEARTGKMALNMATEVVPDIVVLDLGLPDMDGVDVLRRLRGAHPTLPVVILSVRDDERGKVAALEAGADDYVTKPFSMAELIARMRNALRHALQQEGTIPQFVSGDLTIDLVRRQIFRSEDEIRLSPREWDILRMLVRYAGRVLTHQTIMSQLWGATGDVQQLRVYIRQIRQKIEIDPERPRYIITETGVGYRMVQL |
1 | Bacillus_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
31714 : 35448
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP032486|31714:35448|DBSCAN-SWA CATGAACGTGCTGGAATTCATGCTCAAAAATCGCGGCGTCATGCTCGGGCTGATCATAGCTCTGATGCTAGGCGGGTTGGCGACCATTCCCGGCTTGCCGGTCGAGCCCGCGCCGGATATTTCGCCCCGGCAGGTCATGGTGTCCGTCACGGCCCCCGGACTGGCGACGGAAGAGGTGGAAAAGCTCATCACCCTGCCCGTCGAAACGGCGCTGACGGGAATCCCCGGCACGACGGATCTACGCTCGGTGTCACGCACCGGGGTCAGCGCGATCTATCTCCAGTTCGCCGATGATACCGACATCGATCTAGACCGGACCCGCGTATCGGAGCGTCTGGACATCGCGCGCGGCAACATCACCGTTCCCAACGCGACCCTAATGCTCGGCCCGCGCGCAACAGGCATGGGCGAGATCATGCAGATTCAGCTCAAGGGTCCGGGCTTTTCGCCCATGACGCTGAACCGGCTAATGACATGGACGGTAGCCCCCATGCTCAAGCTTGTGCCGGGCGTGGCCGACGTCAACGTCAATGGCGGCGCGGAAGAAACGTTCCAGATCGCCCTTCAGCCTGCTCGCCTGCGCGCATTAGATATTTCTATAGCTCAGGTCGACCGGGCGATTGAGGAAAACAACGCCGCGTCCGGCGGCGGCTGGATCGCACAGGGAGCGACAGCGCAAGTCATTGTCGGTCGTTCGCTAGTCCAGGGCCTGGACGCATTCGGCGCCATACCCGTGAAATTCGGGGACCGCGGAAACGCCATTCGGTTGCGTGATCTGGGCATCATCTCCGAAGCTGCGCGCATGCGGCTCGGCGCCGTGACGCGCGATGGCCAAGGCGAGATCGTCAACGGTGTCGTCATGATGCAGATCGGGGCGAGCTCGAATGCGACGATCACCGGCATCAGGGCCGCCCTTCCTGCCATCCGCCAGGCACTGCCGACGGGTATCAGCCTTGATCCGTTCTACAGCCGAACCACGCTGACCGATCAGACGATCACAACGATTAAGGAAAATCTCGTGATAGGCGCGGCATTAGTGCTGCTGGTTCTCGTCGTGGTGCTAGGCGACTGGCGCGCGTCGCTTGCGATCGCAAGCGTCATCCCGGTTGCGCTAATTGCGGCGATGGCCGGTATGCGCCTGTTCGGCGTATCGGCCAATCTGCTGAGCTTGGGCGCCATCGATTTCGGGATGATCGTCGATAGCGCCCTCGTCATCGTCGAGCATCTGATGGCTGAGCGGGGCAACCGTCCCAATTCCGCAGCGCTTCCACGGCTTGCGATCCACACCACACGGTTGGTCGCGCGCCCGGTTATCTTTGCGATCGCCGTCATCATCATGGTCTACCTACCGATCCTCACCCTACAGGGGATCGAGGGCAAAATGTTCAAACCGATGGCCCAGACGGTCATCATGGCACTTCTGGCATCCTTGATTTACGGTTTTGTGGGCGTGCCGTTGATAGCAAGCCTCTTGTTGCGTCGCAGCCCGCCCGATCGAGAAACTCGACTCATTGCAAGCATGCGCCGACATTACGAACGCGCGCTTGACTGGAGCCGAACCAGGGCGGAGCTGCTAATTGGCATTGTGGCTGTCGTCGTCGCCACGGCCGCCCTGATGGGTAGTCGATTGGGGGGCGAATTTGTACCACAACTGAAAGAAGGGTCGCTTGTCGTGACGTCAGTGCGCTTGCCAAGCGCGTCGCTAGCGACCGTTCTGGCAGATGTGACGCGCGAGGAACGCATCGTGCGGGGGTTTCCAGAGGTCGCCACGATCGTTAGCAACACGGGCACGGCTGCGATTCCGACCGATCCGATGGGCCCTTATGAAACCGACAGCTTCGTTCTGCTCAAGCCCTCGTCGACCTGGCCTACCGGGATGACCCAGGAAGCGCTGGTTGCCGACCTGAGCGCAACCCTCCAGCGCGAATTGCCCGATGCGGAATATTCCTTTTCGCAGCCTATTCAGATGCGTATGGACGATTTGCTGTCGGGCGTTCGGACACAACTGGCGGTGTTCATCTACGGCGAGGATCTCGATCGTCTCGGCGCGTTGGCCGCGAAAACCGCGACCGTGTTGCGCAGCGTCTCCGGTGCGGCGGACGTTCAGGTGCAGGGAGACGGAACCGTTCCGTTCCTTCGGATCGACGTCAATCGCGATGCCGCAGCTAGGCTTGGTGTAGCGGTTCCCGATATCCTGGCCATGGTTGAAGCGATCGGCGGCCGCGCCGGCAAGCCCGTGATCGTCGACAATGCCATCGTGGGAACCCAAGTCCGCCTTGATCCGTCATTCGTGACGAATGTTGATCGCATCGGCGATCTGCAGATTCGTCGTGCTGACGGGAAAGGCTGGGTGCTGCTGAGCGAAGTGGCGCGTATCGCAACGCTCGACGGACCGCCACGCATCGATCGCGACGGCTTGCAACGACGTATCATCATTCAGGCCAACGTTCGTGGCCGCGACACCAGTTCGTTCGTGACCGACGCCCAAAAGGCGGTGGCGCGCGCCGTCTTCTTGCCGCACGGATATCACATTGTCTGGGCGGGACAATTCCGCAATCTTCAATCGGCGATGGCCCGGCTAAATATCGTGGTGCCGATCGCGCTTGGGTTGATATACGGCCTGCTGGTGGTGGCGTTGGAGTCACAGGGCGCTGCGGCACTTGTGTTCGTCAACCTGCCGGTCGCGGCGACAGGAGGTATCTTCATGCTGGTGTTTCGGGGGTTTCCTTTCAGCATCGCCGCAGGGATCGGGTTCATCGCCCTTTTCGGCGTCGCTATTCTAAATGGTGTCGTGCTGCTCAGTCAGATCGGTTTGTATCGCAAGGCCGGCATGACGCCTGGCGATGCGGCTTTTGCCGCCGCACGCTCACGTTTTCGTCCCGTCATCGCGACTGCATCCGTTGCCATGCTCGGCTTTTTTCCTATGGCTTTTTCCGCTGGTGCTGGTGCTGAGGTCGAGCGCCCGCTGGCTAGCGTGGTCATCGGTGGGCTCGTTTCGTCCACCGCCCTAACATTGTTGTTGCTACCGGCGTTCTATGCCCGGCTTTTCGGCGGTAGACAGTCGTGAGAATTCTAGTGGTTGAGGACGAACCTGCCTTGGGGGCAGCCGTGGCGGAGCGTGTGCGGCAGGCGGGCCATGTCGCAGACTGGGTGGCGACGCTGGCGGACGCGCGCGCAGCATTCAAGGCGTTTACCTATGATTTTGTCTTGCTGGACCTAGGCTTGCGGGACGGCAATGGTCGCGTTTTCTTACGCGAGATCCGCGCCAGAAAAACATCCGCCGCCGTCATGATCACCACGGCCATGGACCAGATCCGCGACCGGATCGGCGGCCTGTCGGACGGAGCGGACGATTATCTGGTCAAACCATTTGATCTCAACGAACTGATCGCCCGGATTGATGCGGTGGCACGGCGATACGTGGCGCTGCCCATCAACACGATCCGACGTGGCGACACGGAAATCGACCTAGCTCGACGGTGCGTATCTTATGCGGGAAATGCGATCGACCTGACGGCCCGCGAATGGGCCGTGGTTGAACTTCTGGCCCGCAGGCCGGGCGCGATTTGCTCGAAAGAGCAGATCGAGGACGCGCTGTATGGTCTCGGCGAGGTGGTCGAAAGCAACGCTATCGAGGTTTTCGTCAGCCGCATCCGCAAGAAAATAGGCTCCAATGCGATCCGCACCCTTCGTGGGCGCGGTTACGCGCTGGCTGGCAGTTCGGAGGACGTATGA
Protein sequences of DBSCAN-SWA_5 >NZ_CP032486|31714:35448|31714_34780_+|WP_141493989.1|DBSCAN-SWA MNVLEFMLKNRGVMLGLIIALMLGGLATIPGLPVEPAPDISPRQVMVSVTAPGLATEEVEKLITLPVETALTGIPGTTDLRSVSRTGVSAIYLQFADDTDIDLDRTRVSERLDIARGNITVPNATLMLGPRATGMGEIMQIQLKGPGFSPMTLNRLMTWTVAPMLKLVPGVADVNVNGGAEETFQIALQPARLRALDISIAQVDRAIEENNAASGGGWIAQGATAQVIVGRSLVQGLDAFGAIPVKFGDRGNAIRLRDLGIISEAARMRLGAVTRDGQGEIVNGVVMMQIGASSNATITGIRAALPAIRQALPTGISLDPFYSRTTLTDQTITTIKENLVIGAALVLLVLVVVLGDWRASLAIASVIPVALIAAMAGMRLFGVSANLLSLGAIDFGMIVDSALVIVEHLMAERGNRPNSAALPRLAIHTTRLVARPVIFAIAVIIMVYLPILTLQGIEGKMFKPMAQTVIMALLASLIYGFVGVPLIASLLLRRSPPDRETRLIASMRRHYERALDWSRTRAELLIGIVAVVVATAALMGSRLGGEFVPQLKEGSLVVTSVRLPSASLATVLADVTREERIVRGFPEVATIVSNTGTAAIPTDPMGPYETDSFVLLKPSSTWPTGMTQEALVADLSATLQRELPDAEYSFSQPIQMRMDDLLSGVRTQLAVFIYGEDLDRLGALAAKTATVLRSVSGAADVQVQGDGTVPFLRIDVNRDAAARLGVAVPDILAMVEAIGGRAGKPVIVDNAIVGTQVRLDPSFVTNVDRIGDLQIRRADGKGWVLLSEVARIATLDGPPRIDRDGLQRRIIIQANVRGRDTSSFVTDAQKAVARAVFLPHGYHIVWAGQFRNLQSAMARLNIVVPIALGLIYGLLVVALESQGAAALVFVNLPVAATGGIFMLVFRGFPFSIAAGIGFIALFGVAILNGVVLLSQIGLYRKAGMTPGDAAFAAARSRFRPVIATASVAMLGFFPMAFSAGAGAEVERPLASVVIGGLVSSTALTLLLLPAFYARLFGGRQS >NZ_CP032486|31714:35448|34776_35448_+|WP_141493990.1|DBSCAN-SWA MRILVVEDEPALGAAVAERVRQAGHVADWVATLADARAAFKAFTYDFVLLDLGLRDGNGRVFLREIRARKTSAAVMITTAMDQIRDRIGGLSDGADDYLVKPFDLNELIARIDAVARRYVALPINTIRRGDTEIDLARRCVSYAGNAIDLTAREWAVVELLARRPGAICSKEQIEDALYGLGEVVESNAIEVFVSRIRKKIGSNAIRTLRGRGYALAGSSEDV |
2 | Leptospira_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
40123 : 40840
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP032486|40123:40840|DBSCAN-SWA TTCAAACGGTGTAAACTCCGGATTGTCTCTCGATGAGACGCACTTCTCCCATGATGCCCTCTTGGAATTGGAAGAGTTCAGCTTGTCCCTTTTTCAAGGCGCGCATGACCTCAAAACCCTTAATCGTCGCATAAGCGGTTTTCAGGCTTTTGAAGCCACGGACAGGTTTAATCAGTTGTTTCAGCTTGCCGTGATCGGCCTCAATCACGTTGTTCAGAGACTTCACCTGACGGTATTTCACCGTATCGGGAAGCTTACCGTTCTTCTTGAGTTCGTTGATGGCTATGCCGTAGGTTGGGGCTTTATCCGTGTTGATTGTCTCGGGCTTCTCCCAGTCCCTCAGGCCATTCAAAGCTTTGCCCAGAAAGCGTTTGGCAGCCTTGGCGTTCCGGGTCTGTGAGAGGAAGAAATCAATCGTATCCCCATCCTTATCGACCGCCCGATAGAGATACGCCCATTTCCCTTTGATCTTGATGCAGGTATCATCAACCCGCCAGCTGCTGGAAAACCCCGGGCGTTTCCAATACCAGCGGGGGCGTTTTTCCATCTCGGGTGTATACCTCTGAACCCAGCGGTAGATCGTTGAATGATCAACGCTCACACCGCGTTCACAGAGCATGGTTTCAAGGTCGCGGTAACTAATCCCATACCGACAATACCAGCGCACTGCCCATAGGATCACTTCACCACAAAAATGACGGCCCTTGAAATCACGCAT
Protein sequences of DBSCAN-SWA_6 >NZ_CP032486|40123:40840|40123_40840_-|WP_141493993.1|transposase|DBSCAN-SWA MRDFKGRHFCGEVILWAVRWYCRYGISYRDLETMLCERGVSVDHSTIYRWVQRYTPEMEKRPRWYWKRPGFSSSWRVDDTCIKIKGKWAYLYRAVDKDGDTIDFFLSQTRNAKAAKRFLGKALNGLRDWEKPETINTDKAPTYGIAINELKKNGKLPDTVKYRQVKSLNNVIEADHGKLKQLIKPVRGFKSLKTAYATIKGFEVMRALKKGQAELFQFQEGIMGEVRLIERQSGVYTV |
1 | Escherichia_phage(100.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
60349 : 60631
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP032486|60349:60631|DBSCAN-SWA GATGAAGATTGCAGATCTGGTCGATCATATTGCTTTGACAACAAACACAAGCAAGGCCGAGACCCGCGCTGTATTGGATGAGCTGGTCAAAACCATTACCGCGGCTGCGCAACAGGGTGATGAGATCAGTCTTCCGGCATTGGGAAAGTTTAAAGTCAAAGAGACGCCTGAGCGCGAGGGGCGCAAACCAGCCACTGGGGAAAAGATCACGATTGCAGCATCCCGCAAGCTGACCTTTACACCGGCCAGAGCCTTGAAAGAGGCTCTGAAAGCTTCGGTGTAA
Protein sequences of DBSCAN-SWA_7 >NZ_CP032486|60349:60631|60349_60631_+|WP_141494010.1|DBSCAN-SWA MKIADLVDHIALTTNTSKAETRAVLDELVKTITAAAQQGDEISLPALGKFKVKETPEREGRKPATGEKITIAASRKLTFTPARALKEALKASV |
1 | Pelagibaca_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
64987 : 81223
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NZ_CP032486|64987:81223|DBSCAN-SWA CTCAAAGCAAAAAACTTAACGTGACATCCCCGGAGAAGGTTTTGCCGGAGCCTACGGCATGCGCCATATAGGTGCTGCCTGCGGTGATGATCCGCCACACCACGCGTTTTTGATGGTCGCGCAGACGAATAACGCTGCTCGCCCCCGGCAATTGCAGATGTCGGCCGTCAAACTGGCGCGGGACAAGGTTGTTATACGTGTCGTTATACAGCCGCACCAAACGCTGTGCCCGCTCACCGTCTTCCCAGACCCACTGGCTAAACGCTTCGCGCAGCGCCATCAGCTTTTCTTTAGCGGCTTCGGTTTCACGGGCGTTCAGAACACGACGTTCACCGCCTTCGGGATCGCGGATCGTATCGAAGATTTTGGGCGTGCTCTGGTTAAGTGCATCTTCCAGAAGGTCTTGTGCAGGGCGTCGCTGTGTTCCCCACGTCGAGGTCGCCTCGGCCTTGCCGTAAAACGGCGCTTTCTCCACGCTCCAGGCCGCGACTTCGGCAGTGTGCCAAACCTGGGCTTCAACCCCGATTTTTTCAGCAATGAAGGCTTCCACATCCGAGGCTGGCACCCATGGTGCACCCAAACGCGCAGTAATATCACTAGGGCGCAAATCCGCAGGCTGCGCGTCTTCTAACGCGCGTACCAAGCTTGTATAGGCCGGATCCGTCTCCGCCGCCTCTCGCGCTTGAGCCAATTTGGTGCGGACCGCGCCCGATAACGCCTCATCGGCCATCACCCAGAGCGGGCGTTCCGGCGTGCTGCGCACAGGGTCACGATACACCGCCTCCCCAAGCTCGGCCTGTGCCGCCTCCTCACTGCACCCCAGCAACTCCGCAATCCGAGGCATATCCACCCCACCCACTTCATGCAGGCAGACCGCAAGCGCGTCATGGGCGCTGGCAATCTCTGGCTCGCTTGGCGCATGGATGATCCGCTCTGAGAAGATCGGACCAGGACGCCCCTCGTCACGGTCTTCGTCATACTCTTCAATCGCACTGACCAGCCAGACATCCGGGTCATCATAGAACGGTTGCAAATGCGGACGGCGTTGTGTGCTGCTCTCCTCGCCGGTTTCCGGGTCAACCCGCACACTGGTTTTTACAAGGTTAATCGGGCCAAAGGCTTTGACGAACGCAGCGTAAGCGCGTTTTAAGTCCTGCTGGAACCCCGCATAAGGCGCATTCTCCATCTGCGCACGCAGCACAGAGCGCGCCGCGTCACGCACGGGGATTAGCCCTTTGATGATATTGGCGTGTTTTTGGAAAATCCCCTCTTTCTGATCCCCCTTTCGGACCCGAACCGGCGTCGTTTCGCCATCGATGATCTGATGCAGTTCTGTCTTAATCAGGACGTAGCTGCCCTCACGCAGCACACTCCCCTGCCCAGCACTGGCTAACGCAGCACGCGCCTCCTGGGCGCTGTGCGGCGTCACAGCCTGCGGCTCTGGGAACCGCACCTGACCTTTGAGACGGCTCAACGCATTGGGCAGCGCCTCGTCCAAAGCCTGCATCGCAGGGGCGGTGCAGGTGTATTCCGGTCCATATTGGCCCGTTGTCCACCCATGTTGACCCAGCACCTGCTCAGGGTGATCGAGGAAATACTGGTTGATCTGAAGCGCACCATCGCCCTGATCACTCTGCGGCACCGTGCCCACATCCTGCCAGAGGGTATCCGCTGTGTCTTCGCCGGGGACATGGCGGCGCAGCACCAGAAGATCCACCACCACCTCGGTTCCGGCATCATCGCGCATCGCCCCTGCGGGCAGGCGCACGGCCCCGACCAAATCCGCCATCTCCGCAATATGCGCACGCGCTTTGTGGTCGGTTTTATCCATCGTCCAACGCGAGGTTACAAACACGGCCAGCGCACCGGGGCGCAAACGGCTGACAGCACGCGCGATGAAGTAATCATGCAAGGACAAACCCATCTTGCCCAACGCATCCGTCCCACGCACGGTGCGGTTGCTAAAGGGCGGGTTGCCCACTGCCAGATCATAGCGGTCAGCCAGACGGGCTTTGGTAAAATCTTCCTGGCGCACCCATTGATTGGGGAAGAGCTTTTGCGTGATCCGCGCGGTCAGCGGGTCGTTTTCAATAGCGGTCCAAGCGGTGTTCTCTGCAAGCGTTTCAGGCATCGCCGCCATGAACAATCCGGTTCCACAGCCCGGCTCCAGCGCCTTACCGCCCTTGAACCCCATTTGCAGGATCACATCCCACACAGCATGGATGATCACTTCGGGGGTGTAATGCGCGTATTGGGTCGCCCGCGCTAGGCCTGCGCGTTCCTGCACACTGGTTGCTTGTTCCAGCTGCTGGCCCAGCTCTTCCCACCCTTCGCGGAACGCTTCGCCAGCACGCGGAAAGAGGCTATTGGCCAGTTCGCCTGCTCCAAAGCCGGTAAAGCGCGCCAATTGCGCTTGTTCATCGATTGTGGCCGGACGGTTTTCTGTCTCCAGCTGTGTGAGCACGCGAATGGCGTTCAGATTGTCGCGCCCACGCGCTTTCCAGCCTGCGGCGAGGCCGCGTGTACCGCTCAGGCGGAAATCACGCTTGACGACGTCCACCACCTCTTGCGCCACGTGGCGGGCTTCGGCGACAGCATGATCACCGGGCAAACTCCCCCAGAACAAAGACGGGGCATCATTCAGGGCGGAAGTCTCGAAAAAGCTCAGCTGGGTCGTACGCATGGCACACTCCGGACAAGCGCCAACACCCGGTTCCTGCGCGGGGCAGACCGGGGGTGTTAGCGTTCAGTATTAAGGGGCAGGACGGGATTGGTGCGGGGTGAGCCGCAAAATGTCAGAAAGGGCCGAAACGCAGGGCGATCTGCTCACGCTCAGCGGCCGAAATCAGGGCGAGAGCGCTGAGATGATAGCGTGAGCCAAGGCGGGCTTTACTGGCCACGCGCCACCACGCTTGCTGATCGCCACAATCCAGACCACGCAGCTGTGTTTCGCCCTCAGGGCCGTTTTGCGGATGGGGCAGAGTATGTAGGGTCCAGCCCTCGCGATGGGCCTGGATCAACGAAAAACCAGCGGGAATGGGTTGGCTTTGGCGGGTCATGCCTGTCTTCTCCTCTCTTGACCCTTTCCGGCAACCACGGCGTCGTTGGGCCGTGCACGGGCGGCGCTTGCGCCGGGACCGCCCGGCCCGCAGCGCAGCGAGGACACGGACGGACCCACCCCTTGCACGGCACAGCGATCCCGTGGTGCCTCCCTTCCCTGTTCTGCCTTTGTCTGCCGGTCTTCTGTCTTTCTCCCTTTCCCTTTTGCTTTTGTGCTGGCGTCTCTTTGTGTGTCTTTTTCCTGTGGACGTGGGAACGCGGTGGTGTCACGGGCATGACGTGACACGTTGTTGGCTGAGGTGGCTGTTCGTCACATTTCAAAACACCGCTCTGCTAACCCGCTCAGTAGCGCACAAGCGCGTCTGTGGGCTGCGGCGAAGGCGCAGTCTACAGACACCGTTTCAGCGCACGCAGCGCGGGCGCGCCGCGCAAGGTGCGCATCCCCGCCATCACTCAAGAGCGTTGTGATGGCAGGGGTCTGTTGCTCTATCGCTTACCCCCATCGGCTGATGGGGGCACCGGTGCGTACAGCGCGGCACATCAAGGCGGAAGGTACAGGCTCAGCCCGAAGACGCAGAGGCTTTTCGTGACGGCCATCCCATGTCAGCCCCTCGCAAGGCTCTAACATCCCCAAACGCACCGCATCCTCATCACGGCACGCACAGACCCGCTCATTGCCCCCTTTGGAGCGACGGCATGGCTCGGAGGGCCCTGCGCCACAGCCACGGCATGGCACGCGCAAAGCAGGATCACCCTCCTCCCAGACCGCCGAACACCGACGGCAGGACAGAGGAGCAAAGGTGTCGGTCCAGCGTTGCGGGACGCGGACAGGGCGGTTCGGAATGTAGGGGAAAGCGTATCGGCGCATGAGGCTCGTTCCAGTCTTGTGGATTTCAAGCATCAAGCCGGGCAGGCGGGCTCTCGGGGTGCAACAGTGCCGCGCAGCGGCAGGATCACCGGCCCCCCGCAGTCACGCGCAAGCGCCAGCGGCGCTGAGCATGGCGAGGAGGGCGGGTGACCCACCTGTTGCAGCGCGAGGGACCGGCTGGCAGGAGGTTAGCTGCCAATTTCTTTTCGTTATCTTTTAATTCAAGTCTTGATGAACTTTGGATCAGGTGATGCAACGTTGCATCACCTGATCCAAACTAACCTACCAACTTTGTTAACTCAGCGTGTAATAAACGCAGAGCGCTCATATCCTTATCGCTGAATTTGACATTTTTAACTTCGAAATCACGTATTACCTGAATAATCTTTGGCGCCATTTTCTCAATATTTATAGCTTTTAATTGCCTATCAGGGTTAGACTGTCCATCTTTTCTATCCGGCTCAATACTAAAATTTCTCTCGTTAAGAGAAGCCACTGTGACTTCACCTGAGAGAGCCTTTTCCATAAGTTTGTCACGACGTGCCGGATCATCAATACGCGCTATCCCAACCAAAACGTTCATTGGTACGCTGGCCGCGCCAGCTTTTTCTAAAAAGCTCTCAGGCAAGCTTAATAACTTTATTGCCCTGCTAATCCTAGCTTGGTCCATACCCAGTTCTGCAGCGGCTTTCCTTTGGGACCACTGATTGTAATCTAATAAGTTTTTAATTCCCTTCGCTTCTTCTACTGGATTTAAATCCACTCTCATCAAATTTTCTATAAGCGCTGCAGAGGTAGAATTGCCATCGTGAGGAATTGCAAGAATTGATGTCCAATCAGCAAGTTTTGCAGCTCGGAACCGCCTCTCCCCAGCTACTATAATCCATTTTTCAGAATTTTCTGATGACTGTCTTACTAAAATTGGTTGCAGCTGCCCCTGCTGCTTCAATGACTCCGCAAGCTCTTCTAAAGAAGCTTGTTCAAAATTTCGCCTAGGCTGATTCGGATCAGGAATAATCTGATCAATCGACGCTTCAAATGTGTGCCTAAACTTACCGTCTCCGAGAAATGGGACGTCCACCTGCTCCATTACCTGAGAATGAGTTTGTGCAAACGCTGAAGCAGGGCGTTGTTTAAATGTGCGTTTCATGAATTTTCTCTTTTATCTGTATCCAAACTCAGAACTGAAGATGCGTTTATAAAATTACGACGCCCGGCTATTGCTTCAGCAAGGCGCACATAAACTCTGACAGAAGGACTACGGGGCGCGACATCCACTGGGATTTTAGATGCCCACGCAGCATTACTAAACGACGTACTGGAGGGTACCGGCTCTAGTACGGGTGCACGATTATCCATTACCCGAATTAAATGCTCTATAATTCCTACATCGACATACTGATTTTGTACAAATTGTGTTGGGATAACACCACCCAGTCTTAACCGAGGATTACTTCGACGATTTACTTTTGTGATTGTATCTAGAATTAGATTAACTCCCATAACATCGTACGGCTCAGTCCTAACGGGAACGAATACTGTATTTGACGACGCCAAAGCCATCCAAGTAAGAAAACCCAAATGTGGCGGAGAATCCAAAATGATAAATTCGTAATTATCTTGTACTTGAGATATTGCTTCTTTCAACAACCCTTCAGTCCCAGGTTCACGTCGAATATCCACTTCAGCCAAATCAATATGACTTGGACAGAAATCAACAGGAATTTCGACCTCCACTCCCTCAATAGGACCCTTTTTCACAATACAATTGGACATCGATGCATCTTTAAGCAAAGCATGCGCAAGTGTTTTTCCTTGACGATATAGAGGAACAATTTGCATTCCCAGCAAAGCTGCGGTTGCACTAGCCTGTGGATCAACGTCTATAAGAAGAACACGATATCCCATTTTAGAAAGTGCCACAGAGACGTTGAGTGCACTTGTGGTTTTCCCAACCCCTCCTTTTTGATTAGCAAAAGTTATAATTGTTGTTTGACGCTTCTCTCCATGGAGTAATGCTTCCAACTCAGAAGACACCTCAGCCGGCAGTGGTTGCCTACTGTTTTCCCAGCGTGAAATAGTATGCCGATCATAGCTTCGGTTCAGTCGTAAATTTAATACATCCTTTAACTGACTTTGCGACAATCCAAGCTTTTCACGAACCTCTTTAAGAAATGACATTAATGTTACTCCATAATACGCACTATCAACATCAACACATCAACATCAACACGTCAACATCAACAATGATCTATATCTAATTTTGTCCCCCTAAAGTGATGCAACGTTGCATCACTTTAGGGACGGATATTGCAAACTTAGGTTGAGATCAGACACTCATTTGACATCATCTAGAAACAGGGACTGTGTCATTGGTCCGCATGATCTGATGGTCGCTGGCTATGCACGCAGTCGAGGACACATTGTTGCAATCGGAAACCTTCATGAGTTTGAAATAATATACAGTCTTTTATCAAAGGATTTGCCCCTTAAATCAGGACACACCAATTCGGGCGATCACGAGATAGTACGGCTAATGGAATAGCGAATCTTCCATTAGGTTGCCTCACCGCGAATATGATGTTCCTTGAAAACACTCATCGGGCATGCATTACTCTAAAACTAAGAGCAGCTTCTAACAGGATCGCCATATTTGCAGCGATGTCCTTCATAGCGAAGAAACTAGAGTTTTTAAGTCCTCTCCCTTCTGGACCTCAGAGCCAGAAAGATTAATCATACATCCAACTAGAGAGTTAGTGATCTAAAGAAAGTACTATTCATTTAAGGAACCTGTTTATGACCGACGGTGCTGAGGAAGAGACCGAGAAATCTACCCAAGCTGTGAAAACGGAGCAGTTACTTTCCTTTAAAACATCAGGCAGAAATATCGCCCCAAACGCCTTAAGTGCCGAATTCAGCCTAATGAAAGGCTGCCTTAAGTCCCCCCTCACGCCGAAAGTAGCACTACGACTAGCTGATAAAAGTGGTAACTATCTGCTCAAACTCGTGAGCCCGCGCACCGATGGTGATAATGCCATCCTGCGGATCGAGGTACCACGTGGAAACAAACTTGGAGACATTCTTCCTTCCACAAACGCACCTGATATTCCATTTTGTAAAGTGATATTTCGACCACTTGAAATCGATGGATATCCGCCTCGCTCCGTCGTCGATCTAATTCGCAACCCCGAAGATCACACGACAATTGTGCTGGATGCATTTGCAGAGGTTTTTGGGACGGATGTTCTTGAAACCTTGAAGACTTCCCTCCTGACAGTTCTTCCTGCTCCAACCCAGCTCGGTATAGGCGAATTTCCGATTATATTTGTTCCAAGGCCAGATGGGCAGGATTTACAAATTACCCCTGTATCTCCTGCGGCCGCTTTTATGGGAATGAAACGCGTTAGGAAGCATTATTTTCAAAAAACACAGCCAGATCGCCCCATGCCGCGCAGCAAGTGGACAGAACAGGCTGTGAGCGCGAAGCCACAAAATATTTCTGGTGCTATCGGAGGGCCTCGGGTCCGTTTCCGGGCTGACATGCCGACACAGTTATCGCACGAAGAAGCGGACCTATTCCGCTTTGCACAAGGCGGCTCATTTCCACTGTGGCGTGACAGCGCGGTTGCCGCGCGGATCTTACGTTACGGGGATCGCTTAACGTCGGATAATGAATTTAACAATAAGAACACGCGGGCAGCTTTGAACCAGGTGGCGGATGACTTGATCTCCGATGCTGTTGAGTTCATCCAAGATACCTTGCGCGACACTGTCGACTACGCGAAGCGACAAGGAATTGCCGAAAAGCACTCGGCTCTACCTTCTCTGCCCCAGTTACTGTTAAAACGGAGATGGAAAAATACAGATGAAGAAGACAAAGCACGCAAAGCTCTGACAAGTCCTCATTTTGAGCTTCGTCTTGCGAAAAGCCGTATGGCCGCGAAGGGTTTGTAATGCCAGAACTCCGTAATCTCTTTGTGCTACGCGACATTCAAGCAAGCCGCGTCAATCTGATTATGAATGATTATGCAGCGGGTCTTCCCAGCCCTTTATCCTTTTTGGGATTAGGAGATTTTTTAGCACGTCGGCTAAAATTAAAGCCTTGGAGCGCGTCTGTTCTCCCTATCCTGCATGCCGTCCGTGTCTCAGAGGGGCGGACAAAGCCAGAGATGGAGAACAAATCTAGCGTTTTCGCGCCAATTGAGACAATGGAAGACTTAGTCGGCTCGGTCACGGTTTCTTTGCTGATTAATCTGCCTGGATGCGAAAGCGAAAATGCCTTGGCGCGTGCCTTAACTGGCTGTCGTATCGCTGGGGGTATTATTCAAAATAATGATGTAAAAGTTCAGGCTCTAACGCCGGATGGGTCCGCCTTTAGAAGCCTTCGGCGCGGATATGCCATGCTGGCTCCAGAACAAAACGAGCGGCGGGTTATTTCCAAAGGCGACAATGATAGCCTGACCGAGATTGCGACACTGTTGTTTCCCGTTGAACGGCCTGCGGGCTTTGGCTGGATCGTACCCGCGGCAGTCGGCTATCATTTGCTTGAAGACCCAGAGCATGCTCCTAAGCGCACCCGCACCAGAAGCAAAGATATCCCTCACGTTTTTGCTGAGCCTGTACTTGGCATCGCTGAGTTAGTCTCTGTGCGCAATCGACGGCTGACGGAACTATCGCCGGAAGCATTCGCCTCAGCTTTTTGGCACTGGGACGCACGCGAAGACATGCTGTTGGGGCATTCTGCCTATTTTTCAAACAATAAATTAAATGATGTGACCAAGGAGGTTTTGAACCATGGCTAAAACCAGCGCTCCACTGTCCCTTGAAACCGGAATGCTTGCTTTTGCACGCTCTCTCCAGATCACGGAAGGCTTGTTCTACGCCACGCGCAAAGCAGATTCAGCTATTGCGGCACCTATTGAAATTCTTGAAAAGGGCGTGCGCGGACAGTCATCGGAAGACAAAGCGAAAAATCCCGGTCTGAGTAATCCTCAATCTGTTGAATATGCAATCGTACCCCAAGGGCATGACGGGGTTCGTTTGACATTCTCTATCCGTTTTATGCCGTTTTCACGTGCTCCGCACGCTTGCAATAATACCGATGTCGGGTCGGCTTATGAACGCCTCGCTACGGCATACCGTGCAGCCGACGGGTATAAGGTTCTAGCCGGTCTATATTTGTGGAATATCGCCAATGCGCGTTTTGCGTGGCGCAACCGTTTCCAAAGCGACGCCATGTCGGTGAGTATTACGACAAGTGGTGGAACACGTCTGAAGTTTGATCCGTTTAAGCTCAGTCTTACAGAACCAGCATCAGCCGTAGAACTGAGTGCCGCCTTAATCGGAGGCAACACTGCCGACATTGAGAAGGTCATTGATGGCATAGCGTTCGGTTTGGCGAACGAAGACCATGAAGCCTACACGGTGAATGTATCTTGGGATGCAGAAATGGAAGCCGGTCAGGAGGTTTTCCCATCGCAAGAATATGTTCGCGAGGAAAAGGCCATCGCCAATTTGAGCCGCGTTTTTGCAAAGCTTCCGACCAAATGGGGTGGTCGTTCGCTGATGCAGGCTTCCATGCATTCGCAGAAAATAGGGGCAGCCCTACGCTGCATCGATATTTGGCACGGTGATGAAGATGAGACGAAACCAATACCGGTTAATCCTTATGGCGGCGTACAGGAAACCGGTGCCGTTTTGCGCAACTCCAAAACAAAACGCAGCTTCTACGACTTACGCAAAAACGGCGAGACGCTGATCGACGGAATAGAGAGCGCCACAACCGTTAGTGAAATTTCGGGCGATGCTCATTTTGTCATGGCAAACCTGATCCGAGGGGGCGTCTTTGGATCAAGCAGCAAAAAGGCAGAGGGGTGATGGACACACTTAACGAGGGAAAGGGGCGTTGTCTTGCGACGTCCCGCCTTCTGATCCGTCCTGTGCGCGCAGTCATCGAACTCGGAGCAGCCGGGGATGTGCTCTCTGGTGCTATCAGAGCTGTGCATCACGCCAACCGGCACGGCTCGGTTGAAGATTTCATTGCCATCGCCTTCCCCACGATGAGGATGGGGCGAGAAACCATGCTCTCCGGCGACGATATTGAGTTGATTGGATCGGATGCGTCGCTTGGTAGATTTCTCGAACTTGAGGGAATCATAACACTGAAAAGACGTGGTATGCTGGAAGACACTTCCCTAGATCAGGTCTATGCCGATGAAGGCATGATAGGCGCGGCTTACGTTCGCGATCGGGCTGTTGAGAAGCACACTCCCGGTTGGATACGTCGTACTGAAGCCCGCGCAGCGCGACGCGGAAAGCCCTTGGGAAAGGCTGTCAAGCAGCGCGAGAACGATCTAAAGTCTCTTGTCCTCACACATGGTACAACGGTTTTGCATATCCGTGAGGTCGTTGGTCCCTTCACTGATCGCCCGTTGCACGTCAGTACTTATGGTTTTTCAGGTTCTGGCGATCCGGCTATTTTGCCCGTGTTTCCAGAAAGTGCCCGTACGGTGGATAATGCAGATTAAAGGTGGCCCCCGCCTTCTGATGACAGATCGGGAAAGCGCTGTTTATCTTGAACATGCAAGAATACACGTTGAGGGAGAACGGGTCGTTTATCACATCGACGACGATGAAAATCGCCGCGAATTCAATATCCCTCATGTCAATCTCGCGGTTCTCTTTATTGGTCAAGGCACCTCAATCACTCAGGGAGCAATGCGTCTGCTGGGGGAAGAGGGCGTGCATCTTGCTGTCACTGGCAGTGGCGGCACCCCTTTGCATATGGGAAGCTTAACAAGCTACACCGCAACGCGACATTTCAGAGAGCTATTGCCAATATACTTGTCGGAAGAAAAGTCCTTGAAAGCGGCCATATCGGTCATGCAAGACCGGACGTTGCGCATGCGTAAGCTGGGAGGCAAGGGGGCTACGAAAGTGCTCAGGGCGCGTGAGACGATGGGGCTATCGAAAAAATGTGCCTCTTTTGAAGAAAACCTCAAAACCTGCGCCAGTATTCAACAACTTTTGGGCTTCGAAGGGCAATTTACAAAGGCATGTTATGCCGAATTTTCCTCAATCGCGGCGCTCCCGAAAGATAGTCCGTTTCGTCGTGACGCCGGTGCAGGCGAAGCAACGAAAGGCCCTACTGACCGGGTCAAGCTCATTAACCGTTTGATCGATCACGGGAATTATCTTTGCTATGGGATGGCCGGTGCCGCACTTTGGGCCTTGGGGATCCCTCCACACATGTCGATCTTCCACGGCAAAACGCGGGCAGGTGGATTGGTTTTTGATATTGCAGATAGCTTCAAAGATGCCTTGGTGCTGCCCTTGGCTTTTGCCCCGTATAAGGAGAAAAGCGTATCCGAAGCGGAAAAAACGTTTCGCGCAAGACTAATTGAGGCTTTTAACGATCACGCTATCCTTAAGGAGGCCATTGCGACAATCGACAAGCTGATCCTCGCGGCCAACCTGACGTAGAATTTCGAATATGCGTGAGATCATCGCCGTCTGTAGATCAGAAAAAAAATCCCGCGAACGGGTCGCCCGAATTCTCGACAGGTATTTCTGGCGTATCGGGGATAGAACTTGGCGCGGCAAAGCGACAAATGCCTGCCTGAACCGCGTTTCGCGTGAGTTGCGCAAGACCGCCACGCGAAATACTGCCGTGGTGATCCATGAGATCCGATCCGCTGCCTCAAGCCGGAAACCGCTGATCCGTATCGGTGCACAGCATGCTTTCTCGGAAGAAGGTGCTGTTCCTGTCGCTACGCACCCCGCTGCTTTTTCTAATCGCAAATCATCCCTTGCATCCCGGGAAACCGCAGAAGCAATCTTACGCATCGCAGCCTTGTTTCACGACCTTGGTAAAGCCACGGCTCTTTTCCAAAAAAAGCTCGACCGAGCAATAAAGAAAGGTCCGCCTGAGTCCGACGCTGTACGTCACGAGCTATTCTCCGCTGCAGTTTGGGACGACCTGTTCGGCACAGTTACAGACGAAAAGTTGCCTCATCGAATCAAAAATCTGACGCCTGGTGAGATCGATGCATCTTGCCTTCGTGTGCAGGATCTCTTGGGCAAGCTGCATCGCGCTCCCGATCGAAATTTAGGCCTTGCCTTTACTCAAAAGGCAGATCACTTGACTTATCATATTGGCATGCTGGTTTTGACCCATCACCGCTTGCCGACGGCAAATCGCGATTGCAAGACCCTTCTGGCCGAACAGCATGTGAATCAGAACGGCGCTCTGACGCCTCATGTTGATCTTGCCATAGCGCCAGGAACACCATTTTGGCATGAAAACTGGTGGCTCACAGCGTTGAACCGGGAAGCTGATAGGCTGCGTTTGGGGCCGCCGATAGCAAGCGCTGACATGGCCATACGTGCCTCGCTCATGTTGGCGGATCACCTAGGGTCTGCGATGAAGACTAGCCGAGAAATCAGTGACGGACATTTAGCGAATACTATTCGAGCCGAGGCAGAGAACCTGATATTACGCGCCGATAGCCTGTCTCGTCACACGAAACGTGTCTACCGATATGCTCGTTTTTCTTTTGATGCGACGCACCATCAGAGGGATCGTTATCCTGCCCTGAGCGAAGACGCGATGCCTCCCGAGGTCGCGTTCCCTCAACTGTCTTGCGACACACGCTTTGCGTGGCAATCGACCGCCGCTCAGGCCGCCAAGAAAATATGCTCGGAAAACGAAGGCGGATTTTTCGCCGCGATTATTGCTGGGACAGGCACAGGGAAAACGCGCGGCGCCCCCACCATTCTTGCGGCTGCTAGTCTAGGAGATCATGTCCCCGAGCGCCGCTATTTCAGAATGAGCCTTGGCCTCGGATTACGCGTTTTAGCGACACAATCGGCTAAAGAATATGTCAAGGACCTGAACTTCGGTCAGAAAGATGTGTCGGTTTCGGTCGGTGCGCCGCCTTTAGAGTTTGGCGAAGATCTACCCGATTCCACTGATGAAAGTGGTTCAGAAAGCCTGGTGAATCTGCCTGAATGGCTACGCATTCAGCAGGCAGATGGTCCCGTTCCCGAAGAGGGCGCTGAGCAAGAAGAAGACTGGCTTCGGGGACTTTCTTTAGATACGGGGCGCTCTCTTCCCGCTTTTGTGGATATCTTTCTCGATCTAGCAGGAGACAGGAATGGAAATGGTCGCAGGTTTTTAAACGCACCTGTTCTGGTTTCAACGATCGATTATCTGATGCCCGTTGCCACCCCGACAAGCGCACGTTTTTTGTTGCCCGCGCTGCGTGTCCTAAGCTCCGATTTAATATTAGATGAGATCGACCAGTATGATGGAGAAGATCTCGCTGCGATCGGGCGTTTGGTATTTCAATCAGGGGCCGCGGGGCGCCGTGTCATCATCATGTCAGCCACCCTCACCAAAGAGATCGTCGAGACACTGCATTCTTCCTATCACGCGGGCTGGAAGGAGTATGCGCGGTCTTTAGGATTATCTGAAAGGTGCAACCTTTTGATTTGCGGAGATAATCCGGTCTCGCTTTCCGTAAATGCTGACGATGAGCCTTGCGACGTGGTGTTTGAGCGTTGTTGTGATGCGATATTGCGTGGCTTTAGAACTGCCCCGGCCCTACGGCGGGGGAAGGTTCTCCCTCCCGTACCGACATGGGAAGGACTTATTACAACTGTTGATGAGGCTTGTTCAGCCGCACACAACGAGAATGCCATAGATATTGATGGTGTTCGCGTTTCTGTGGGGCTCGTTCGGATGACGCGCATTTCGCACACGGCCGATTTGGCACGCCGATTAAATGCTGGCGATATTGGCGGAAGGCTCCGCGTGCTCCTTTGTCTTCATTCACAAATGACCAGACTGGTACGGGGTTGGATTGAGTCCCGCCTGAAGCAGGCCCTTACCCGTAAGGGGGACTTTCCCGAAGCTGGCGTCAAAGCTCTATGTGATGAAGTACGGCTCTTTGAACGTGCACGAAAAATTTGTGCCAAGGATATTGAAATCGTTGTTATCACCTCACCCGTTATTGAGACCGGTAATGATCTGGATTTTGACTATGCAATTATCGATCCGATTTCGACACGCGCCATCATTCAAACCGCAGGAAGGGTAAGGCGGCACCGCCCCCCTCTCGGAAGTACCCCGAATGTGTGGATATTGGGACAAAGTCCTATTGCCTTGGATACTGGGAAGCTCAAAATGCCAGGTGTCGAAACGGAATTGAGTGGGGACACTCAAGTTCCAAGGTCTTCTCTTCTTGATGGCTATAAAGGACGCTTATTTTCGGAGATCGTAGGCAATAACGTGTTCGAGCGCATCGATGCCGTAGCAATCCTTGACGATACTGTGTCCTTTCCCCTCAAACAGGCAGAGGCTGAGCTTTGTCGCAAGATGCTCTCTATCAGCTCTGACGCGCCGTTGGGGCGTTACATTTCATCTTCCACGGCGCGATGGAATACCAAGGTTTCTCAGATGCGTAAATTTCGCAGAACCAACCAGCGCGAAATTCGCTTCGTGCTGAAAGGCGAAACTTTGTCCGATGCTGTTTGGTATGTGGATATGGCCCCTGGCACAAGACATAGCGTATTGCGGGAGGCCGGAGACAAAATGCGCTCTACCTCTCTACGCTCTCCTGAAGCTGTTTTGTTGGCTGATACCATCGAACGGGCCTGGCGGGCCTATGCTCAAAATAGTCGAGAAATTTCGTCATTGGAACTTGCGTCCCTTATGAGCGTGGGGCTTCCTGTCTGGGATAATAAAACGTGTGTGGATGTTACCATGGACGATCTGAGCGGACTAACTCGAGGCTCTTTTGAGGATTAATCATCTATCCTCGGGAAAAGTATTTTATTACAGTGCGTTAACCCCGGTCTTTGAAAAGATGGTATGATAGCTCTAGTGTTGCTAGAGCTCTGAAATAAATAACTTTTGTGTGCGTACACTTACTGCTCCCCGCCGGGTAGGCGGCTGAGAGATTCAAGATCGCCGGAATTGGGACGAGCAGCACGCTCCCCGCCGGGTAGGCGGCTGAGAGATCGGATACGCCGCTAAAAGTTTTAAAGCTATAGCTCCCCGCCGGGTAGGCGGCTGAGAGAGATTGAAGATCAGGGCGTTTTACAGTCTGGCAGCTCCCCGCCGGGTAGGCGGCTGAGAGATTAAGTCTGGAGCGGTGCTGTCCTTACACATTGCTCCCCGCCGGGTAGGCGGCTGAGAGAAAACAGAGACATCAGACGGCAAAGGTTCAATCGCTCCCCGCCGGGTAGGCGGCTGAGAGAAAAATTCAGGAAATAAACAATGACAGCCCCTAGCTCCCCGCCGGGTAGGCGGCTGAGAGAAATAATAATAGCTCTTTTACGCCAGTTCTGTAGCTCCCCGCCGGGTAGGCGGCTGAGAGATCAACAGACGGGGCAGGAGGCGGGCCAACGCTTCCCGCCGGGTAGGCGGCTGAGAGATGAACATCAGCGGACAATTCGTTGGCGAACTGGCTCCCCGCCGGGTAGGCGGCTGAGAGATAGTAGTGCCTGCACCAAGCATCAGCGGCTGGACCCGGCCCTGTTGCAAAGTTGAGTATCCTGTTTGAAACCGCTGCCGGTAGTTAGAGTAGGGTTTGGCAGATGGGTGATTTCAAGGGACGTCATTTTCGTGGTGAGGTAATCCTGTGGGCGGTGCGTTGGTATTGCCATTACGGGATTAGCTACCGGGATTTGGAAACCATGCTGGCCGAGCGGGGCGTGAGTGTTGATCATTCAACGATATATCGCTGGGTCCAGCGCTATGCTCCCGAGATGGAAAAGCGCCTTCGGTGGTATTGGAAACGCCCGGGCTTTTCCAGCAGCTGGCGGGTTGATGAGACCTACATCAAGGTCAAAGGAAAATGGACCTATCTGTACCGGGCTGTTGGCAAAGGCGGTGATAGGATTGATTTTTTCCTCTCACCGACCCGGAGCGCTAAGGCGGCCAAACGTTTTTTGAGCAAAGCTTTAAACGGACTGAGAAGGTGGGAGAAGCCTGAAACAATCAACACAGACAAGGCGCCGACCTATGGTCGGGCCATCAACGAATTCACGAAGAACGGTAAGCTACCCGACACGGTGAAACACCGTCAGGTCAAGTATCTAAACAGCGTGATTGAGGCTGATCATGGCAAACTCAAACAGCTGATCAAGCCGGTCCGTGGCTTCAAAAGCCTGAAAACCGCTTATGCGACGATCAAAGGTTTTGAGGTCATGCGGGCCTTGAAAAAGGGACAAGCTGAACTCTTCCAATTCCAAAAGGGTATCATGGGAGAAGTGCGTCTCATCGAGAGACAATTCAGCTTCTAA
Protein sequences of DBSCAN-SWA_8 >NZ_CP032486|64987:81223|72867_73716_+|WP_141494021.1|DBSCAN-SWA MPELRNLFVLRDIQASRVNLIMNDYAAGLPSPLSFLGLGDFLARRLKLKPWSASVLPILHAVRVSEGRTKPEMENKSSVFAPIETMEDLVGSVTVSLLINLPGCESENALARALTGCRIAGGIIQNNDVKVQALTPDGSAFRSLRRGYAMLAPEQNERRVISKGDNDSLTEIATLLFPVERPAGFGWIVPAAVGYHLLEDPEHAPKRTRTRSKDIPHVFAEPVLGIAELVSVRNRRLTELSPEAFASAFWHWDAREDMLLGHSAYFSNNKLNDVTKEVLNHG >NZ_CP032486|64987:81223|68539_68914_-|WP_068173535.1|DBSCAN-SWA MRRYAFPYIPNRPVRVPQRWTDTFAPLSCRRCSAVWEEGDPALRVPCRGCGAGPSEPCRRSKGGNERVCACRDEDAVRLGMLEPCEGLTWDGRHEKPLRLRAEPVPSALMCRAVRTGAPISRWG >NZ_CP032486|64987:81223|70060_71095_-|WP_141494019.1|DBSCAN-SWA MSFLKEVREKLGLSQSQLKDVLNLRLNRSYDRHTISRWENSRQPLPAEVSSELEALLHGEKRQTTIITFANQKGGVGKTTSALNVSVALSKMGYRVLLIDVDPQASATAALLGMQIVPLYRQGKTLAHALLKDASMSNCIVKKGPIEGVEVEIPVDFCPSHIDLAEVDIRREPGTEGLLKEAISQVQDNYEFIILDSPPHLGFLTWMALASSNTVFVPVRTEPYDVMGVNLILDTITKVNRRSNPRLRLGGVIPTQFVQNQYVDVGIIEHLIRVMDNRAPVLEPVPSSTSFSNAAWASKIPVDVAPRSPSVRVYVRLAEAIAGRRNFINASSVLSLDTDKRENS >NZ_CP032486|64987:81223|73708_74791_+|WP_141494022.1|DBSCAN-SWA MAKTSAPLSLETGMLAFARSLQITEGLFYATRKADSAIAAPIEILEKGVRGQSSEDKAKNPGLSNPQSVEYAIVPQGHDGVRLTFSIRFMPFSRAPHACNNTDVGSAYERLATAYRAADGYKVLAGLYLWNIANARFAWRNRFQSDAMSVSITTSGGTRLKFDPFKLSLTEPASAVELSAALIGGNTADIEKVIDGIAFGLANEDHEAYTVNVSWDAEMEAGQEVFPSQEYVREEKAIANLSRVFAKLPTKWGGRSLMQASMHSQKIGAALRCIDIWHGDEDETKPIPVNPYGGVQETGAVLRNSKTKRSFYDLRKNGETLIDGIESATTVSEISGDAHFVMANLIRGGVFGSSSKKAEG >NZ_CP032486|64987:81223|64987_67669_-|WP_141494016.1|DBSCAN-SWA MRTTQLSFFETSALNDAPSLFWGSLPGDHAVAEARHVAQEVVDVVKRDFRLSGTRGLAAGWKARGRDNLNAIRVLTQLETENRPATIDEQAQLARFTGFGAGELANSLFPRAGEAFREGWEELGQQLEQATSVQERAGLARATQYAHYTPEVIIHAVWDVILQMGFKGGKALEPGCGTGLFMAAMPETLAENTAWTAIENDPLTARITQKLFPNQWVRQEDFTKARLADRYDLAVGNPPFSNRTVRGTDALGKMGLSLHDYFIARAVSRLRPGALAVFVTSRWTMDKTDHKARAHIAEMADLVGAVRLPAGAMRDDAGTEVVVDLLVLRRHVPGEDTADTLWQDVGTVPQSDQGDGALQINQYFLDHPEQVLGQHGWTTGQYGPEYTCTAPAMQALDEALPNALSRLKGQVRFPEPQAVTPHSAQEARAALASAGQGSVLREGSYVLIKTELHQIIDGETTPVRVRKGDQKEGIFQKHANIIKGLIPVRDAARSVLRAQMENAPYAGFQQDLKRAYAAFVKAFGPINLVKTSVRVDPETGEESSTQRRPHLQPFYDDPDVWLVSAIEEYDEDRDEGRPGPIFSERIIHAPSEPEIASAHDALAVCLHEVGGVDMPRIAELLGCSEEAAQAELGEAVYRDPVRSTPERPLWVMADEALSGAVRTKLAQAREAAETDPAYTSLVRALEDAQPADLRPSDITARLGAPWVPASDVEAFIAEKIGVEAQVWHTAEVAAWSVEKAPFYGKAEATSTWGTQRRPAQDLLEDALNQSTPKIFDTIRDPEGGERRVLNARETEAAKEKLMALREAFSQWVWEDGERAQRLVRLYNDTYNNLVPRQFDGRHLQLPGASSVIRLRDHQKRVVWRIITAGSTYMAHAVGSGKTFSGDVTLSFLL >NZ_CP032486|64987:81223|69191_70064_-|WP_141494018.1|DBSCAN-SWA MKRTFKQRPASAFAQTHSQVMEQVDVPFLGDGKFRHTFEASIDQIIPDPNQPRRNFEQASLEELAESLKQQGQLQPILVRQSSENSEKWIIVAGERRFRAAKLADWTSILAIPHDGNSTSAALIENLMRVDLNPVEEAKGIKNLLDYNQWSQRKAAAELGMDQARISRAIKLLSLPESFLEKAGAASVPMNVLVGIARIDDPARRDKLMEKALSGEVTVASLNERNFSIEPDRKDGQSNPDRQLKAINIEKMAPKIIQVIRDFEVKNVKFSDKDMSALRLLHAELTKLVG >NZ_CP032486|64987:81223|80515_81223_+|WP_141494025.1|transposase|DBSCAN-SWA MGDFKGRHFRGEVILWAVRWYCHYGISYRDLETMLAERGVSVDHSTIYRWVQRYAPEMEKRLRWYWKRPGFSSSWRVDETYIKVKGKWTYLYRAVGKGGDRIDFFLSPTRSAKAAKRFLSKALNGLRRWEKPETINTDKAPTYGRAINEFTKNGKLPDTVKHRQVKYLNSVIEADHGKLKQLIKPVRGFKSLKTAYATIKGFEVMRALKKGQAELFQFQKGIMGEVRLIERQFSF >NZ_CP032486|64987:81223|67781_68045_-|WP_141494017.1|DBSCAN-SWA MTRQSQPIPAGFSLIQAHREGWTLHTLPHPQNGPEGETQLRGLDCGDQQAWWRVASKARLGSRYHLSALALISAAEREQIALRFGPF >NZ_CP032486|64987:81223|74790_75441_+|WP_170211111.1|DBSCAN-SWA MDTLNEGKGRCLATSRLLIRPVRAVIELGAAGDVLSGAIRAVHHANRHGSVEDFIAIAFPTMRMGRETMLSGDDIELIGSDASLGRFLELEGIITLKRRGMLEDTSLDQVYADEGMIGAAYVRDRAVEKHTPGWIRRTEARAARRGKPLGKAVKQRENDLKSLVLTHGTTVLHIREVVGPFTDRPLHVSTYGFSGSGDPAILPVFPESARTVDNAD >NZ_CP032486|64987:81223|76406_79724_+|WP_141494024.1|DBSCAN-SWA MREIIAVCRSEKKSRERVARILDRYFWRIGDRTWRGKATNACLNRVSRELRKTATRNTAVVIHEIRSAASSRKPLIRIGAQHAFSEEGAVPVATHPAAFSNRKSSLASRETAEAILRIAALFHDLGKATALFQKKLDRAIKKGPPESDAVRHELFSAAVWDDLFGTVTDEKLPHRIKNLTPGEIDASCLRVQDLLGKLHRAPDRNLGLAFTQKADHLTYHIGMLVLTHHRLPTANRDCKTLLAEQHVNQNGALTPHVDLAIAPGTPFWHENWWLTALNREADRLRLGPPIASADMAIRASLMLADHLGSAMKTSREISDGHLANTIRAEAENLILRADSLSRHTKRVYRYARFSFDATHHQRDRYPALSEDAMPPEVAFPQLSCDTRFAWQSTAAQAAKKICSENEGGFFAAIIAGTGTGKTRGAPTILAAASLGDHVPERRYFRMSLGLGLRVLATQSAKEYVKDLNFGQKDVSVSVGAPPLEFGEDLPDSTDESGSESLVNLPEWLRIQQADGPVPEEGAEQEEDWLRGLSLDTGRSLPAFVDIFLDLAGDRNGNGRRFLNAPVLVSTIDYLMPVATPTSARFLLPALRVLSSDLILDEIDQYDGEDLAAIGRLVFQSGAAGRRVIIMSATLTKEIVETLHSSYHAGWKEYARSLGLSERCNLLICGDNPVSLSVNADDEPCDVVFERCCDAILRGFRTAPALRRGKVLPPVPTWEGLITTVDEACSAAHNENAIDIDGVRVSVGLVRMTRISHTADLARRLNAGDIGGRLRVLLCLHSQMTRLVRGWIESRLKQALTRKGDFPEAGVKALCDEVRLFERARKICAKDIEIVVITSPVIETGNDLDFDYAIIDPISTRAIIQTAGRVRRHRPPLGSTPNVWILGQSPIALDTGKLKMPGVETELSGDTQVPRSSLLDGYKGRLFSEIVGNNVFERIDAVAILDDTVSFPLKQAEAELCRKMLSISSDAPLGRYISSSTARWNTKVSQMRKFRRTNQREIRFVLKGETLSDAVWYVDMAPGTRHSVLREAGDKMRSTSLRSPEAVLLADTIERAWRAYAQNSREISSLELASLMSVGLPVWDNKTCVDVTMDDLSGLTRGSFED >NZ_CP032486|64987:81223|75430_76396_+|WP_170211112.1|DBSCAN-SWA MQIKGGPRLLMTDRESAVYLEHARIHVEGERVVYHIDDDENRREFNIPHVNLAVLFIGQGTSITQGAMRLLGEEGVHLAVTGSGGTPLHMGSLTSYTATRHFRELLPIYLSEEKSLKAAISVMQDRTLRMRKLGGKGATKVLRARETMGLSKKCASFEENLKTCASIQQLLGFEGQFTKACYAEFSSIAALPKDSPFRRDAGAGEATKGPTDRVKLINRLIDHGNYLCYGMAGAALWALGIPPHMSIFHGKTRAGGLVFDIADSFKDALVLPLAFAPYKEKSVSEAEKTFRARLIEAFNDHAILKEAIATIDKLILAANLT >NZ_CP032486|64987:81223|71710_72868_+|WP_141494020.1|DBSCAN-SWA MTDGAEEETEKSTQAVKTEQLLSFKTSGRNIAPNALSAEFSLMKGCLKSPLTPKVALRLADKSGNYLLKLVSPRTDGDNAILRIEVPRGNKLGDILPSTNAPDIPFCKVIFRPLEIDGYPPRSVVDLIRNPEDHTTIVLDAFAEVFGTDVLETLKTSLLTVLPAPTQLGIGEFPIIFVPRPDGQDLQITPVSPAAAFMGMKRVRKHYFQKTQPDRPMPRSKWTEQAVSAKPQNISGAIGGPRVRFRADMPTQLSHEEADLFRFAQGGSFPLWRDSAVAARILRYGDRLTSDNEFNNKNTRAALNQVADDLISDAVEFIQDTLRDTVDYAKRQGIAEKHSALPSLPQLLLKRRWKNTDEEDKARKALTSPHFELRLAKSRMAAKGL |
12 | Vibrio_phage(42.86%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
85270 : 86182
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >NZ_CP032486|85270:86182|DBSCAN-SWA GTTGAAGGGTCAAAGGGTCGGTTATGTCCGTGTGAGTACATTCGATCAAAATGTTGATCGTCAATTGGATGGCGAAATACTGGATCGGGTTTTCACCGATAAAGCGTCCGGCAAAGATGTTCAGCGGGCTCAGCTTGATGAGCTTCTTGCCTTCGTGCGTGAAGGCGACATCGTGGTCGTGCATAGTATGGACCGATTAGCGCGTAATCTCGATGACCTCCGCAATCTAGTTCACACGCTGACGCGAAAAGGCGTTTGTGTTGAGTTTATCAAAGAGCGTCTCATTTTCTCAGACAAAGATGAATCCCTGCCCAAATTGATGCTTTCGGTGATGGGCGCTTTTGCAGAGTTTGAACGCTCTCTCATTCGGGAAAGACAGCGCGAGGGTATTGCGCTGGCTAAAAAGCGTGGTGCCTATCGTGGCCGGAAGCGGGTTCTATCTCACGAACAAGTTGCAGACATCCTCCAGCGTATCAGCGACGGTGAGACAAAAGCTGCGATTGCCCGCGAGCGCGGCATAAGCCGGGAAACTCTCTATCAGTATCTAAGAGCTTATCCGAATGGCGGGGTTCGCATGCGGCCGGTAAAAGAGGTTGTTCACGACGCACCTATGGGCGCAAAGCCGTTCATTGCGACCCTTATTCTGAACATTCAGAGTCAGTCCGCCTCCGGACCGAGCCTTTCCTCGGTTCGAGCTGAGATCGAAGACATGCTGGCGGAAGATTACGATGTGCAGAAGAACGAGAAGGGCGAGTATAGGTTAGCTATTCCTGCAGAGTACACGCGTACGGAGCATGATCTCACTGCTGAGATAGCGGATCTCTTTGACGAAATTGACGCCATTGCTGAAGAGTATGTCTGTACGGCCGTCGGAACGCTTCGTGAAATGGGAGGGCAGGAGCGTGTTTGGTAA
Protein sequences of DBSCAN-SWA_9 >NZ_CP032486|85270:86182|85270_86182_+|WP_141494028.1|DBSCAN-SWA MKGQRVGYVRVSTFDQNVDRQLDGEILDRVFTDKASGKDVQRAQLDELLAFVREGDIVVVHSMDRLARNLDDLRNLVHTLTRKGVCVEFIKERLIFSDKDESLPKLMLSVMGAFAEFERSLIRERQREGIALAKKRGAYRGRKRVLSHEQVADILQRISDGETKAAIARERGISRETLYQYLRAYPNGGVRMRPVKEVVHDAPMGAKPFIATLILNIQSQSASGPSLSSVRAEIEDMLAEDYDVQKNEKGEYRLAIPAEYTRTEHDLTAEIADLFDEIDAIAEEYVCTAVGTLREMGGQERVW |
1 | Salmonella_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|