Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP015017 | Polynucleobacter asymbioticus strain MWH-RechtKol4 chromosome, complete genome | 0 crisprs | c2c9_V-U4,DEDDh,Cas9_archaeal,csa3,cas3,WYL | 0 | 0 | 3 | 0 |
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1380998 : 1387992
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP015017|1380998:1387992|DBSCAN-SWA AATGAGTAATGAAATCGTAGAAGCAAGTCAACCAAAACAATCCTTTTCTTTGGCTCCAAAAGACTTTGAGCAAGCCTTGAAGTTTTCCGAAATGATGAGTAAATCCAACTTAGTGCCTAAAGAATTTGTGGGCAACGCTGGAAACATCATGGTGGCAGTCCAATGGGGCATGGAGCTGGGCTTACAACCCATGCAAGCCATGCAGAACATTGCCGTAATCAATGGTCGCCCTTCCCTCTGGGGTGATTCAGTTATGGCCCTGGCACGCAGTTCAAGCCTATGTGAATACATCATTGAAGAAGATGACGGAAAAGTAGCTACTTGCCGCGCTAAGCGCAAAGGCCAAGATGCTGAAATCGTTGCTACATTTTCAATGGAAGATGCCACTAAAGCTGGCCTTGCTGGAAAACAAGGCCCTTGGACTCAGTACCCAAAACGGATGCGTCAAATGCGTGCTCGTGCTTTTTGCTTGAGAGATGCTTTCCCCGATGTATTGCGTGGAATGCCGATCGCTGAAGAGCTACAGGACATTGAAACAGGTGAAGTTAAACCACAACCACAGACAAGCGCTCGCCCAGTGATTGAGAAATCGATTTATTCAGACGAGGAATTTATTGAGAAGTCTGAGAAGTGGGCGCAAATCGTTGAGAGTGGCCGCAAAACTCCTGACGAACTCATCATTTTTATCGAGTCTAAAAATGCAGTTTTGACCGATGAACAAAAAGCAATTATCCAAACTTGGGCAAAGGCAGAATAATCATGCAAGTTCATAACCTGATTCAAGGCAGCGATGCCTGGCACGAATTCCGTGAAACTCATTTTGGGGCAAGTGAAGCCGCTGCCATGATGGGGGTATCTAAATACATGACTCGCAGCGAGCTTTTGAAAATGAAAGCGACTGGCGCAACGAAAGCGATCGATTCATTTACTCAAAAGATTTTTGATAATGGTCATGTAGTCGAAGCACTAGCACGTCCATTAGTTGAGGCCATGATCGATGATGAGCTTTATCCGGTCACTTGCTCTGACAGTAATCTATCGGCTTCTTGCGATGGCCTGACCATGGCTGGCGATATTGCTTTTGAGCATAAACAGTGGAATGAATCATTAGCTGACTCAGTGCGTGATGGTGTTTTACCCGCCGAGCACGTTCCCCAGTGCCAGCAAATCCTGATGATTACTGGTGCCACAAAGGTGATTTTTACGGTATCGGATGGCACGCCTGAAAACATGGTGACGATGGACGTTTTCCCAAGCGAGTATTTCTTTGGTCAGATTCGTGCCGGTTGGGATCAATTCGCCAAAGACTTAGCAGCGTATGAGAATAAGGCCATTGCAGACAAGCCAGTAGCAGACTCTATCGAAGCCTTCCCAGTAGCCACTATTCAGGCTAAGGGTGAGCTCATTTTGAGCAATCTTCCTGAAGTCCTGCCACGTTTTGATTTATTCCTCGCCAATCAAAAGACTGATTTAGTAACCGATGATGATTTTGCTAATGGCGAGGCCGTAGCTAAATTTAGCCGTGAAACTGCTAAGAAATTGCGCCTGGTCGCAGATCAAACCATTGACCAAATTGCCTCAGTAAGTGAAGCCGTAAGAGCTCTTAATAATTATGCCGATCAGTTCGATGGGCTTAGCCTGAAATTGGAAAAGTTGGTTAAGAGCGAAAAAGAGGCTCGTAAGGATTCGATTATCAATGCCGCTAAGTTGAAATGGCGTGAGCACCTGGACGCAATCGATGCTGAGCTCAAGACAGTTCACCTTCAGATCATTGCCCCTGACTTCTTAGCCGCAGTTAAAAGTAAAAGAACGATTGAGTCAATTGAAAACGCAGTAGATACGACTCTAGCGAGTGGGAAGATTGCAGCCGATGCCTTGGCTAAGACTTACCGCGCAAACCTTACATGGGCCCATGAGAACGCTTCTGAATTCAATTTTTTGTACGCCAATGACTTAAACGTAATTGTCGCTAAGCAACCAGAGGACTTTAAGAACCTGATTGAGTCGCGCATCACTAATTTTAAGACTCAAGAAGCGGATAAGAAAGCCAAGGAAGCAGCCGATCAAGCGATTAAAACCGCCCAAGTGGAGCAGCCTAAACCTGAAGCAAAGACGGAGCCAGTAAAAGCAACTGTGAAGCCTGCGCAACAAGAGCCTGATCTGGTTGCTGGATTTATCAATTCGCGCGAATGGCCTAGCGTTGCAGTAAAGAATAGTGCTCGGGCCGTCATTGTCGAATTCTTAAAGTTTCAAGAAGTGCATTTACAAGCAGCTTAACCAACGGGGCGCACGTTCACTGAAAGGTGTCTTCCCCAAGAAGTTGTGCGCCCCACCCTATTTAAAGATTGATATGACAGATCAAAACCAAATAAACATTAAAAATTCTGAATCTAGAATTTCTGCTTTTTTGAAAATACCCCCCCCCTCACTCATCTTGGAGTCACCAAAGGACGGTCGAGTTTAAGGAAGTCTGCAAGCTGGCAATGAGCTACATGAATCGCCAGCGTAAGAAATTAGATCGAATTAATGAAATTGAGCAACAGTTAAGAAGTTACTTTTAAATGACCAAAAGCAAAAATATCCTAGCACCTCGCCACAAGTGGACTGAACTAGAGCTCAGAACTTTAACCAGTCGATACGCCAACGAGCGAACAGAGCTCATCGCGATAGATATTGGAGTAAGCACCCAGTCTGTGTATACAAAAGCTCAGTCTCTTGGTCTTGTTAAGTCTAAAAAATATTTATGCTCACCAGCAGCTTGCCGTCTTGATGGCATTAAAGGAACTTCAAACCGCTTTCAAAAGGGGCAGCCAGCTTGGAATAAAGGTATGAAGGGACTTCAGACCGGAGGTCAAGCCGGATGGTTCAAGCCTGGTCATCGAGGTGGCAAAGCCTTGGAAAAATACCAACCTATTGGCACGATCAGAACTAGCAAAGATGGTTATCTGCAAATGAAGATACATGACAACATGCCTTTGCAATCTCGCTGGCGTGGGCTTCATCTTGTGAACTGGGAGGCTATTCATGGCCCAGTAAATACCAAGACCCATGCGCTCTGCTTTAGGGATGGCAATAAAGAGAATTGCGACATTAGCAATCTTGAGCTTATTACTAGGGCAGAGCTCGCCCTAAGAAACATGATTACCGCCATTCCGCCAGAAGTTCGCAAAGTCATGCAACTGCGTGGAGCAATCACAAGACAGATCAATAAAAGGAAAGAGCATGAGCAACAACAATAATATTGATGAATTACGACTCCACCTTTTTGACACTCTTCGAGGCCTTAAAAATGGCACCGTAAGTGTTGAAACTGCTCAGGCTATGAGCAATGTAGGAAAGACCATTATCGACACCGCTAAGGTTGAAATTGAGTTCACGAAGGCTACTGGCGAAACAGTTGTAAGTAAGTTTTTAGAATCAGAGCGAGAGCTGCCGCCTGGCATTACTTCAATTCGTCAACATAGGATTGCTTGATGACCGACATATCAAAAACACTAGAAGAGCGCCAAAAGACTCATGGCGAGTTTTCTTCGCACGCTCGAATCTCAATGCAGTTAAAGCTATTCATTGCTGGTGAGATAAACAAAGGTGATAAAGACCTCTCTTATTCTCAGCGTGAGGCTTTAGACATGATCTGCCACAAGATCGCAAGAATCCTTAATGGCAATCCTGACGTGCACGACCATTGGCACGACATAGCTGGATATGCGACTTTAGTGGCTAATAAATTGGAGGCAAATCATGAGTGAATGGATAAGCGTAAAAGATCGATTGCCATCATTGGAATGTTACTGCTTAGTTTTTACAGAAGATGGTTTTTATGAACTTATGTTTTTTGAGAATGATGAATGGCAAAACGATTCTCCAATTTATGGAGCTCCGAGCCATTGGATGCCACTCCCTACTCCACCAAAGGAGCAAAAATGAAATATTTACTTTCAGTTCTAATTGTTTTCAGTATTTTGCTATTTATGTATTTATTAGGGTGCTTTTACTTTGTCACTTTTAATATAGGTGAGAGGAGTGATTTTGGAAGATTTCATTTTGTAATTATTTCTCCTATTTCACTATTTATGGGATTTTATTATTTTTGCTTTGGTGAGAGTTTATGAACTGCCCCAAATGCCAAACTAAGACAAAGATGATCGCAGTCAGACCAAACAAGCATGGCTACCCTAAGCGGTCACGCTGCTGTCCCAAGTGTAAGCATCGATTCTTTACGGTTGAAGTTCTTGAAGAAGATGTGACTTTTGAGGGGGTTGTATGAATACGAGTGAACAAAATCAGACCTGTCACACAGATGGAGGCAAATGCGGAGTTGGCGGGTATTGCCGTTTTTGCAAGTTTTCTATCGAATCTCAAGCATTGGATTGGGGAGACAGGGATGATGAGTGGACTGATGAGATTAAGGCAGCTCACCCATTAAACACAGGGTTTTACGATACTTACCATCTTGCTTGTGAAATGGTACATAACCGGCATGGCAAGTTTGCTTTAGTTGCACTGGTGAATTGGCTGATAAGAAAGGCACAAGAGAAATGAACGCAAATGAACTAGCGGATTTGATGGACAGTAGAAGCGTAGAAGGAAGCCAAGCTAGGGAAGTAAGTGCCATGCTACGCCAGCAACAAGCTGAAATAGAAGCGTTGAAAGTTATTGTTGAAAGATTTATTGCCGACTGGTCTGACGGGTTTAAAAATTATGAATATAACTCTGATGAGCCTGAAGAAAACAAATTGCGTGATGATGTTTTTGCAATACTAAGAAAGGCATTCGATAAATGATTGATGAACCAAGTGCTTTTGCATTTATAAACAAACTGCCGTTACCAATGATTGAAAGCAAGGAAATGAACGCAAATGAACTAGCTGATTTAATTGAAGATGAGTATGTATCTATTGACAGCCCAGATTATTTTAGAAAAGATGACGATTGGTGGTGGGTAGAGTTGGGCAATATATTTAACAATATTCCAGCAAATGGTCAATATGATGACCAAAGGTCTGCTGGGTGTGCAATGAAATATATTGAATCATTGGAAAAGCAAATAGCAATACTAAGAAAGGCACAAGAGAAATGATTAAGCACATTTTAACTATTCCGTTATTTTTAATAACTGCTTTATGTTTACCTTTTGCGGTTGCTATCGCTGGTATTGGTGATGGATTGGTAAGTTATTGGAAAGCTATAAAAACACCTATTAAGGCTTACAAAACTTTTTATAAAAATCTTTGGATTTATTAAGAAAGGCAAAAGAATGATTCTTACTGATAAAGAACTTTTTGAGCTTACTGGATATATTCAAGGGTCACGTCAAGCAAAGTGGATCAATGAGAACTATGGATTCAATCCACCCATCCGTGCCGATGGTCATCCAAGCATTACCTGGCATCAAATCAATGCGCCAAAAAATAAACCTCACAATGAGCCAAACTGGGGAGTTGCAGCATGAAACAAAGAAGAGACGGCTTATTGCCACGGATGCAAGCGATCACTCGTAAAAAAGGAATTTATTACCGGTACATTACGCGCGATAACAAGCACGTTGGCCTGGGGTATGACTTAAACAACGCCATCAAAACAGTTTTGGAAATTAATGGGAAAGTTACCGATATTGGAAAGATTAGCAAGCTATGGGAAATTTATTCTTCAAGCGCTGATTTTTTGAGCTTGTCAAAGAATACTCAAAAGGAATACAAGGATTGTGCCAAGCCACTTTTAAAGATTTTTGGGGATGTATATGCTGGGCATATTAAAGCGCCTTGGATATATCGCTATTTGACAGTAGAAAGGGCTTCTGCGCCTGTCAGAGCTAATAGAGAGAAGTCTTTGCTATCCAACCTCATAGATTTGGCAATTCGCCGTGGTGAGGCTGAATTTAACCCATGTAAACAGGTTCGCAGAAATGTGGAGCAGCCTAGAACTGAGTCGCCAAGCAAAGATGACTTTGAGGGCTTTATGACATGGCTGAGAAATCAAGGGGGAAGAAGGCTTGCTATCGCTCAAATGGCAGAGTTTTGCGCGTACTCCGGCAGTCGGAAGATTGAGTTTTTGAATTTGACATGGCACCAGGTAGATTTTGAAGGCCAAGTGATTCGCGTTCCTAGAGCTAAACAACGTGGCGCTAAAAAAGGGAATGTGATTGATGCAATCAGCATGAGCCCCAACCTGCTTGACTTGATGGTGACTATTCGGGAATCAAACGCAAATCTAAATTATGTGTTCCCGACCCAAAAGGGCATGCCTTATACGGCATCGGGATTTAAGGGGATGTGGGGGAAATTGAAGAGAGAGGCAAAGGTGGCCAAGATTGAATTTAATGCCACGTTTCATGATCTTAGGGCCTACTATGCAACCACCCATAAAGCGGAAACTGGATTATTGCCAGACCTTCATGCGAACCCTCAAACGACTGCTAGGGTCTATGATCGGAGCAAGGAATTCAAGCGGAGAGCTGTATGAAAAGGGATTTACCTAAACGCGTATATAAATTTAGAAATTTATATAGATACATTCCAAAAGGCGAAAAAGCAATCAATCTTGGTCATAATAAAGATGAGGCATTAGCAAAATTTTATTCAATACAAAACCAAAAAAATATTAACAAAGATGAGATTGTTAGTTTAGAAAAAACAGTCATGATTATGTGGAAAAGGCATTTAAAAGGATCCAAGCAGAGAAAAATTGAATTTCAAATTACTGTTGAAGATATTGAGATGGCGCTTAAGCAACAAAAATTTAAATGCGCAATCACAAAGATTCGATTTAATGAGTCAAAGCCTGACGGGATGAGATTTAGGCCATGGTTGCCTAGCATTGACAGGGTTGATAACTCAAAAGGTTACACAAAAGATAACATCAGAATATTGTGTGCTTTTGTGAATATTGCCATGAACGGCTTTGGTGAAGGATTTTTTAAATATGTCCTTGAGCCACTGGTTGAAGAGCAAGTAAAAGCTCGATTAGAGGTTATAAAACTTACAAATAATCCCTAG
Protein sequences of DBSCAN-SWA_1 >NZ_CP015017|1380998:1387992|1385535_1385781_+|WP_071539368.1|DBSCAN-SWA MNANELADLMDSRSVEGSQAREVSAMLRQQQAEIEALKVIVERFIADWSDGFKNYEYNSDEPEENKLRDDVFAILRKAFDK >NZ_CP015017|1380998:1387992|1381756_1383274_+|WP_071539362.1|DBSCAN-SWA MQVHNLIQGSDAWHEFRETHFGASEAAAMMGVSKYMTRSELLKMKATGATKAIDSFTQKIFDNGHVVEALARPLVEAMIDDELYPVTCSDSNLSASCDGLTMAGDIAFEHKQWNESLADSVRDGVLPAEHVPQCQQILMITGATKVIFTVSDGTPENMVTMDVFPSEYFFGQIRAGWDQFAKDLAAYENKAIADKPVADSIEAFPVATIQAKGELILSNLPEVLPRFDLFLANQKTDLVTDDDFANGEAVAKFSRETAKKLRLVADQTIDQIASVSEAVRALNNYADQFDGLSLKLEKLVKSEKEARKDSIINAAKLKWREHLDAIDAELKTVHLQIIAPDFLAAVKSKRTIESIENAVDTTLASGKIAADALAKTYRANLTWAHENASEFNFLYANDLNVIVAKQPEDFKNLIESRITNFKTQEADKKAKEAADQAIKTAQVEQPKPEAKTEPVKATVKPAQQEPDLVAGFINSREWPSVAVKNSARAVIVEFLKFQEVHLQAA >NZ_CP015017|1380998:1387992|1385777_1386077_+|WP_071539369.1|DBSCAN-SWA MIDEPSAFAFINKLPLPMIESKEMNANELADLIEDEYVSIDSPDYFRKDDDWWWVELGNIFNNIPANGQYDDQRSAGCAMKYIESLEKQIAILRKAQEK >NZ_CP015017|1380998:1387992|1383558_1384236_+|WP_071540144.1|DBSCAN-SWA MTKSKNILAPRHKWTELELRTLTSRYANERTELIAIDIGVSTQSVYTKAQSLGLVKSKKYLCSPAACRLDGIKGTSNRFQKGQPAWNKGMKGLQTGGQAGWFKPGHRGGKALEKYQPIGTIRTSKDGYLQMKIHDNMPLQSRWRGLHLVNWEAIHGPVNTKTHALCFRDGNKENCDISNLELITRAELALRNMITAIPPEVRKVMQLRGAITRQINKRKEHEQQQ >NZ_CP015017|1380998:1387992|1386254_1386449_+|WP_071539370.1|DBSCAN-SWA MILTDKELFELTGYIQGSRQAKWINENYGFNPPIRADGHPSITWHQINAPKNKPHNEPNWGVAA >NZ_CP015017|1380998:1387992|1380998_1381754_+|WP_071540143.1|DBSCAN-SWA MSNEIVEASQPKQSFSLAPKDFEQALKFSEMMSKSNLVPKEFVGNAGNIMVAVQWGMELGLQPMQAMQNIAVINGRPSLWGDSVMALARSSSLCEYIIEEDDGKVATCRAKRKGQDAEIVATFSMEDATKAGLAGKQGPWTQYPKRMRQMRARAFCLRDAFPDVLRGMPIAEELQDIETGEVKPQPQTSARPVIEKSIYSDEEFIEKSEKWAQIVESGRKTPDELIIFIESKNAVLTDEQKAIIQTWAKAE >NZ_CP015017|1380998:1387992|1384738_1384924_+|WP_071539366.1|DBSCAN-SWA MSEWISVKDRLPSLECYCLVFTEDGFYELMFFENDEWQNDSPIYGAPSHWMPLPTPPKEQK >NZ_CP015017|1380998:1387992|1386445_1387459_+|WP_071539371.1|integrase|DBSCAN-SWA MKQRRDGLLPRMQAITRKKGIYYRYITRDNKHVGLGYDLNNAIKTVLEINGKVTDIGKISKLWEIYSSSADFLSLSKNTQKEYKDCAKPLLKIFGDVYAGHIKAPWIYRYLTVERASAPVRANREKSLLSNLIDLAIRRGEAEFNPCKQVRRNVEQPRTESPSKDDFEGFMTWLRNQGGRRLAIAQMAEFCAYSGSRKIEFLNLTWHQVDFEGQVIRVPRAKQRGAKKGNVIDAISMSPNLLDLMVTIRESNANLNYVFPTQKGMPYTASGFKGMWGKLKREAKVAKIEFNATFHDLRAYYATTHKAETGLLPDLHANPQTTARVYDRSKEFKRRAV >NZ_CP015017|1380998:1387992|1386073_1386241_+|WP_155763423.1|DBSCAN-SWA MIKHILTIPLFLITALCLPFAVAIAGIGDGLVSYWKAIKTPIKAYKTFYKNLWIY >NZ_CP015017|1380998:1387992|1384470_1384746_+|WP_071539365.1|DBSCAN-SWA MTDISKTLEERQKTHGEFSSHARISMQLKLFIAGEINKGDKDLSYSQREALDMICHKIARILNGNPDVHDHWHDIAGYATLVANKLEANHE >NZ_CP015017|1380998:1387992|1385260_1385539_+|WP_071539367.1|DBSCAN-SWA MNTSEQNQTCHTDGGKCGVGGYCRFCKFSIESQALDWGDRDDEWTDEIKAAHPLNTGFYDTYHLACEMVHNRHGKFALVALVNWLIRKAQEK >NZ_CP015017|1380998:1387992|1387455_1387992_+|WP_071539372.1|DBSCAN-SWA MKRDLPKRVYKFRNLYRYIPKGEKAINLGHNKDEALAKFYSIQNQKNINKDEIVSLEKTVMIMWKRHLKGSKQRKIEFQITVEDIEMALKQQKFKCAITKIRFNESKPDGMRFRPWLPSIDRVDNSKGYTKDNIRILCAFVNIAMNGFGEGFFKYVLEPLVEEQVKARLEVIKLTNNP >NZ_CP015017|1380998:1387992|1384219_1384471_+|WP_071540145.1|DBSCAN-SWA MSNNNNIDELRLHLFDTLRGLKNGTVSVETAQAMSNVGKTIIDTAKVEIEFTKATGETVVSKFLESERELPPGITSIRQHRIA |
13 | uncultured_Caudovirales_phage(33.33%) | integrase | attL 1384528:1384541|attR 1389370:1389383 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1529985 : 1539303
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP015017|1529985:1539303|DBSCAN-SWA ATTAACGATTGAAAACATTCATGCCAGGGAAGAAGAAAGCAACTTCCACAGCAGCTGTTTCAGGAGCGTCAGAACCATGTACTGCATTAGCATCAATGCTGTCAGCAAAGTCAGCACGAATAGTGCCTTTATCTGCCTTCTTAGGATCAGTAGCGCCCATCAAATCGCGGTTCTTAGCAATGGCGCCTTCGCCTTGCAATACTTGAATCATGACTGGACCAGAAATCATGAAGCTCACCAAATCTTTAAAGAAAGGACGATCTTTATGCACACCGTAGAACTGCTCAGCTTCATTTTGTGACAAATGCGCCATTCTGGAGGCAATGATCTTTAAACCAGCTGATTCGAAACGGTCATAGATTTTGCCGATGACGTTCTTAGCAACAGCATCAGGTTTGATAATAGAAAGGGTGCGTTCAATTGCCATGCAAGACTCCAATAGTGATTTCAAAGGTATTTGTTGAATAGGCGCCGCAAGAATCCTAAAAGGACTCCATGCCCCGCCCTACTTCAACGACCCCCGAATTATACAATGCCTGACCGTATTTACCCCTATAGGGGCTAGGCTAAGCCCTTGAATTTACAAGGCATAACCCCTCAAGCTGTAGTCTTTTTGCTCCTAAACTTGAAATTGGGGTGTTTTGTCTATACTTACTGTCAACAGCCCTTAATGGGCTCATGAGTTTAATTAAAAAGAAGGAGTCAATATGAGTGATCTGAACTCTTACGGCTTTGGGCAATCTAGCTCGATTAGCAATATCCAGGTTCGCAACCGCGTACTTCGAAATACCTATGCATTACTGGCCCTCTCCATGGTGCCAACAGTCATTGGTGCATGGCTTGGCGTAGCAATGGGTCTAAACCTTTTGGCCGGTAGTCCATTTATGGGCTTTATCGTCTTTATGGCTGTCGCCTTTGGATTCTTCTGGGCAATTGAAAAGAACAAAAACACTGGTGTAGGTGTGTTGCTTTTGCTGGGCTTTACCTTCTTCATGGGCATCATGATGTCCCGTCTGATTGGTTTTACCCTCAACAGTTACAGCAATGGTGCGACCCTGATCATGCTGTCTTTTGGTGGTACTGCTGCAATCTTTGCCACCATGGCCACGATTGCCACTGTAAGCAAAAGTGACTTCTCAGGTCTTGGTAAATTCTTGATGGTGGGTGTCCTGTTGTTAATCCTTGCGTCTGTTGCAAACATCTGGCTCCAGTTGCCCGCCTTGATGTTGACAGTGATGGTGCTTGCTATTGCGATCTTCTCCGCATTCATCCTGGTTGACGTTCAGCGCGTCATCAGCGGCGGAGAAACCAACTACATCATGGCTACTTTGGCTATTTACTTGGATGTGTATAACATCTTTACAAACCTGCTTGCCCTCTTGGGTATTGTTGGTGGTAACAGAGATTAAACAACAATTTTTAAGTTAGTTGCTTCTCAAAGGCGCCTGCGGGCGCCTTTTTATTGATTGCCCTCCAATGAGATGTATTTGTAAACGAACGAATTAGTTCGCTACAAACATACTCACCGCATAAGACAGTGCAATCAAAATTAACAAGATCGCAAAACTATCCGAGGCTTTCATGATGTTCTCCAGTTGGTAGGTTGAATCGTTAACAGCCTTAGTCCACTGCACAATTCAAATCTACTCTTCCATAAGACCTAATTCAGAACTTGAATGCGCTTGAATTGGAGTAAGAATCCAAAGTGGTGCCCTATTTGCCCCAACTTAGTACATAGCCTTACTCAGAACGGTCAAAAACTGCCATGGATTCAACATGGGAGGTATGTGGGAACATATTGACTACCCCCGCGCTCTTGAGTGTATAACCAGCGTGATGACATAAGATTTCAGCATCACGGGCTAAGGTTTTAGGATTACAAGATACATACACAATTCGCTGCGGCAACAATAGATTGTCCTCTGCATGTAACTCTGCAAGTGCCTTACAAATTTCCATTGCACCTTCACGTGGTGGATCCATGAGCCAGCGCTCCGCTTTACCCCATGATGCAATTGTGTCTGGCGTTACTTCAAATAAATCACTTCGCATGAAGCTCACCTTATCCGCTAATCCATTGTGCACAGCATTTGCTTTGGCCCTAGTGGTAAGAGTTTCTAACCCTTCAATACCCAATACTGTTTTTGCTTTTCTAGCGAGCGGTAAGGTGAAATTACCAATACCGCAAAATAGATCTAAGACTCGATCGCTGGGGCTTACTTCTAACAAACGAATCGCTTTGCTAACCAATGCGCGATTCATCAAGTGATTCACTTGAGTAAAGTCAGCCGGCATAAACGGCATTTCGATTTCAAACTCCGGCAAGCGATAGCATAGCTTACCTGTTTCAGGATAAAACGGCGCTACTGTTTCAATCCCCTTAGGTTGCAACCAAACCCAAACCTGATGTTCGTCTGCAAAGGATCTTAAAAGCTGCTCATCATCCTGGGTTAAAGGCTTCAGGTTTCGAAAAACCAAGGCTGTTACTGGCCTCGCTTTTTTGGGATCCAATGAATTTGGCTCTTCTGGCTCACCCACTGCCACTTCTATTTGTGGCATGCGATCAACAATGGATAAGCCGTTCACTAATGCTCTTAACTGCGGCAATAAATTAGAAACATGCTTAGGTAAAATTTCACAAGCAGTCATATCAGCAACATAGCCACTCTTGCCCTCATGAAAACCAATGAGGACAGTGCCCTTTTTGATCGAGCGATTAACAGCGCTTAAACGGGCGCGATGTCGATACTCCCAGGTAGGCCCACCCATGGGACGCAAAATTTCTTGTGGCTTTAATTTTGCTAAGTGCTGCAGATCATCTTCTAATACACGCTGCTTCATAGCAACCTGAGCCCGAATATCTAAGTGCTGCATCGTGCAACCACCGCACACACCAAACGCTTTGCACTTAGGTTCAGCCCTAAAAACTGCAGGCTTTAAGATATCGCGTACTTTTGCCTTACTAAAGCGCGCCTTGTCTCGAGTAATCGTGTACGTCACCAACTCTGTTGGTAAAGCGCCTTGAATAAAAATGACTTTTCCACTATGGCCCTCAGCGGCTTCCTCTTCATTTGGCGCCATGCGCGCAATACCTTGAGCATCTAAATCAAGGGATTCCACTCGAATAGGCTCAGTCACTTCAATATTGACCGGCTTTTCACCTCTACGCATCCGCTGCAAAACCTTGAAGATATTCTTGCCACTGCCCACCGTTAGGCTCTTTCGCTTCTTTTTCAAGATAGGCCCTCACAAAAGCGATCTCCTCTTGGTACTCCTCTAATGAAAAGCCACCCCTGAGCAATTGAAAGCGACAGTACATGAGATAGGTATTGACCACATCGGTTTCGCAATAGCGACGAATATCGTTAATCTTGCCTTCTTGATAAGCAGGCCAAACTTGGCTGCCATCCATTCCCATCTTGCCAGGAAAGCCACAGAGCTTAGCTAGGCCATCTAAAGGAGCATTGGCTCTACCATTAAATTTAGCCAGCAAATCCATCATGTCTAGATGGCGCATGTGATAGCGACTAATGTAGTTATTCCATTTGAATTCTCGACTATCGGATTCTTGGCTCTCACCCATTTCCCAATAACGTGGCGCTTGAACATGATTCGCTAGAGCACGATAGTGCAATACTGGTAAATCAAATCCACTGCCATTCCAAGAAACCAATTGAGGGGTGTACTTCTCAACCAAATCAAAAAAGGTTTGAATTAAGATTTTTTCATCATCTTGCGGCATACCTAAAGTACCCACTTTAATTTGAGGCGAACCCTCTTTCGTAGTTCTACGAATCACACAAGAAATGGCAACGATCTTTTGCAGAAATAGAGGCAGAAATTCACTGCCGGTTTTGGCTGCTCGCTCAGTCATAGCTTGAGCAGCTACTTCAGCATCACTTAAAGTGTCTGGGTAACTTTCCAAACGACGTAGGCCAGCTACATCCGGAATGGTCTCAATATCAAATACCAGTACGGTGGCCATAGGCGCTTACTGCAAATACGGTGTAGGGTTTACTGGCTTGCCGTTTACACGCAACTCAAAATGGAGTTTTACAGAAGTCGTATCGGTATCGCCCATCTCCGCAATCTTTTGACCTTTGCGTACCGTATCACCCTCTTTAACTAGAAGCTTACTGTTATGAGCATAGGCAGTGAGATAGGTGTTGTCATGTTTAACAATTACCAAATTGCCGTAACCACGCAAACTGTTACCGGCATAAACCACTTTGCCATCCGAAGCAGCCAAGACTGGCTCACCCACTTTTCCAGCAATATCAATACCTTTATTTTTATCGCTAAAGTCATCAGTGACTTTGCCTTTGGCTGGCCAAGATAAACGAATTCCAGGCTCTGCAACTATTTCTGATTTAGTGGGCTCCGGTGTAGGGCTTGGTACATCTGCTTTATCGGTTACAGGGATTGGCTTTTTCTCAGTTACTTTCTCAGCCACTTTGACTGGCTTGGTACCTGCTGGTGCCTTAATCAAAATTAAATCGCCAATTTCAATAATGTTTGGATTAAAGCTTGGATTAACAGCCTTGTTCCACTGCACAACATCTCGAGGCGCTTGACCATTATCCAAGGCAATGCGTGCCAAGGTATCACCCCTTTTCACCCGATAGTAGCCAGGAGGCGCAGGCTCGCTAGATCCGCCACTGCGGTCAGTAACGCTGGCAGGCTTGGTACGAGGCGTAGAACAACCCACTGTTAGCATCAAAGATGCGGTCAGAAGCAAAAACAAAAAGCTTTTAGAAACAGTGTTTGGCATGACGTTGGACATTAGATTCAATATAGATCTCATACTACCCCTGATTGTAAGGGGACAAAAAAGACTCCGTCCAGTACAGTTCTTTGATAGCGCTGAGAACTCATTCTCTCTACTACCACCAGTTGCTGTTCATTCTCATTTTTGGCAACTGGGGCAACTAATCGGCCTCCAATGGCCAACTGATCCATTAAGGCATCAGGAATACCGAGACCAGCCGCAGCCAAAATTATGCCGTCGAACGGTGCGGCCTGGGGTAATCCCAAAATTCCGTCTCCATAAATGAGGCGCAGGTTATTAATACGAAAAGGTCGAAGCTTAGCTCTAGCCATGTCATGCAATGGCCGGATGCGTTCGATTGTATAAACCTCATCCGCAAGCAAACTTAATACCGCAGCTTGGTACCCACAACCGGTCCCGATTTCTAAAACTTTGCCTAGCTTATGTCTTGGCTTGTGCAATAACTCAATCATACGAGCAACTACAGAGGGTTTTGAAATTGTTTGCTCATGGCCTATTGGCAGAGCGGTATCTTCATAAGCTTGCGCATGCAAACCGGGGTCCATAAATGCATGACGTGGAACGGTCGCGATGGCTTCTAAAGTCTTGCCATGTTTCACGCCACTAGCGTGCACCTTGGCTGCTAGAGCTTGGCGATAGGCAGCAAAGCGTTCTGTGGGAGCCTTCAACCGCGATCCCACCCATGTGCACGCATTGCTGCTAAGCGCGCATGGTGCGTCAAGTCCAACTGCATCGGCGTAATCGAAATACATCCTTCGGCAATCGCATGAAAATCGGTACCCTCAGAGCCTTCTTTAACTTCGCCGGCAGCACCAATCCAATAAATTTTTTCGCCACGCGGACTATCTTGCACCACTACTGGTTGCGAGTGATGGCGGTTTCCCAAACGGGTTACACGCCAGCGATAGAGGTCAGCATAAGGACGATTTGGAATATTGACATTGAGTAGTGTTGCATTACCTTCTGTACGAGCCAAGGCGGAAACTAACATTTGCGCAACAACATCATGCGCTGCTTTAGCAGCATCTTCAATCCGATTCCACCCACGATCAATTTGCGAGAACGCAATACCGGGTACACCAAACATGACCCCTTCAACCGCTGCAGCCACCGTACCTGAGTAAAGCGTATCTTCACCCATGTTCTCGCCTTGATTGATTCCAGAGATAACTAAGTCTGGCTTTTCATCCAAGAAACCAGTCATCGCAACGTGAACGCAATCGGTAGGAGTGCCATTGACGAAGAAAAAGCCATCACGCTCGCCGCCAGCTACTCGATGAATCGATAGAGGCCTAGAGAGTGTGAGTGAATTAGATGCTCCGCTATGGTTTTGCTCAGGAGCAATCACTGTAATTCGACCTAAGGGACGAACTGCATTCACTAAAGCCAATAAACCAGGAGCTAGGTAACCATCGTCATTGGAAACCAGGATATGGGGTTGCTTTGACATGATTAAACGATTCCTGGTTTATGTTTGCCGCGACTTCGCATTGCCGCATAAATCACCGGCAATACCAAAAGCGTCAGAATTGTAGTTGTGACCATGCCGCCAACAATGACGAGAGCTAATGGGCGCTGAGCTTCAGATCCAATCGCATGAGATAAGGCGGCAGGCAAGAGACCCAAGCCCGCCAACATTGCCGTCATGAGAACAGGTCGAACACGTAAAGCTGCGCCATCAACCATGGCATCTTTCATGGCACCATGCTCTTCAGCTACTGTCTTATTGATGTAAGAAATCAGAATCACACCGTCTTGAATGGCAATACCAAATAAAGATAAGAAACCAAATAAAGCAGAAATACTCAGCGTTTCGCCACCAAGATGCAGGGCGATAATTCCGCCAATGGCTGCAAATGGGACATTGATGAGAACAATCACCGCATCTCGGAAGTTACCCAAGGCAGTTACCAACAACAAGAAAATTGCCACCAGCGTGAGCGGAATGATGACCATTAATTTTTGTTGCGCCGCCTTCATCTGATTAAACTGACCATCCCAGGCTATCGAATAGTTTGCAGGTAGCGCTACATTCTTCTCGACTAAATATTTGGCATCTTCGACTGCGCTACCAAGATCTCGATCACGCACGCTGAAAATAACCGCAATGTAACGCTTACCGGCTTCCCGATATATAAAGAATGGACCATCTGTAAGCTCTACATTTGCTACCATCCCCAAAGGCACTCTGGAGCCAACAGGAGTGTCAATCAGTAAATTGGCAATATCAGGCAAATCATTTCGACTATCTTCATTTAAACGCACAGCAATGCCAAACGTCTTTTCGTCCTCCAGGAAATTAGAAACTGGGGCCCCACCGATGGCATTAGCAACTACAGTTTGCACATCGGCCACATTTACCCCATATCGCGCTGCCTTATCGCGATCAATCTGAATATTTAAAGTTGGTTGACCCAGCTCCTTCAGAATGCTTTCGTCCTCAATGCCACGCACTTTCTTTAATTGGGCAATGACCTCATGCGCCTTTGTGTTCAGAATTGTTAAGTCTGTTCCAAAAATCTTGACAGAGTTTTCACCCTTCACCCCAGACAGAGCCTCATTCACGTTATCCTGAATGTATTGAGAGAAACTAAATGAAATCCCAGGAATGCGCTCCAGCTCACCCTGTAAGCGCTTCAATAGATCTGCCTTACTAGACCCCTCCGGCATCAGCTCAGGAGGCTTTAGATAAAGACCGTACTCCTGATTAAATACTCCAGTAGAGTCAGTTCCATCATCTGGGCGACCAATTTGAACAGCCACTCGCTCTATCTCAGGCTGCCTTGTGAATGTCTCTCGCAACTGATTGGCAACACTAATTGAATAATTTAAATCAACAGTATTTGGCAGGACAACACGAAGCCAAATGTTATTTTCTTCTAAGGTGGGTAAAAATGCAGTACCCAAACGAGTTGCACTAATTAAGGTAACGCCAAGCACCAAAATAGAAACAGTAATGACGTGACGAGGGTGATCCATTAAGCGCCGCAATAAGGGCTTGTAATGGTCGAGCATCCAAGTAATAAACTTAGGCGGTTGATGATGGAAATTTTCACCGAACGCGTAAGAAATCGATGCGGGTAAGAAGGTGAGACTCAGAACAATCGAGGCAATCAAGGCAAAGCCCATCGTAAAGGCCATCGGCTTAAATATAATCCCCTCCACACCACCCATTAAAAATAATGGTGAGTAAGCAACAATAATGATGCTGGTGGAATACACCATAGCCCTTTGCACTTCAGCAGTCGCCAAAATAATGCTTTGATTCAAGCGCTTTCCGCCCTCTTCCAAGTGACGCATCACATTTTCCGTAATGATTACAGCTGCATCTACGATCACCCCGAAGTCAATCGCACCCAAAGAAATCAAGTTGGCTGGTACATTAAATATGTACATCATGATGAAAGAAAAACACAGCGCCAGGGGAATAACTGCTGCAACGACAGCGGCAGCCCGGAAATTACCCAAGAATATGTAAAGCAAAACCAAAACCATAGTGATGCCAAAAAACATGGTGTGCTTAACAGTGCCCACAGTGATATCTAAGAGAACCTGGCGATCATAAAAAGGCTTAACCTCAATGCCCGGCGGCAATACATGGGCATTGATCGTGACAATCTTTTCTCTAACGCGGGCTAATACTTCTGAGGCATTCTCACCTCGACGTAAATACACAATTCCTTCAACCGAGTCTGGATTTTCATCAAACTGGAACAAACCCAAGCGGGGAGCATTTCCAATTTCAACTGTGGCCACATCACCAATGCGAACAGGAACGCCGTTGTTGACAGCAACAACCACTCTTCTAATATCGTCCAAGTTCTTCAGTAGGCCAACCCCACGCACCACAAATTGCTGTTCACCACTTGGCAATACGCCGCCTCCAGTGTTGTCATTTGCCTTCGATAAGGCATCAATTAACTGAGAAATAGTGATATTTTTGGATTGCAGGCTTTCGGGTCGAACGATTACGTTGTACTGACGAACCTTACCGCCAAAAGATGAAACGTCTGCAATGCCTGGAGTTTGTTTGAGCTCTTTGTAGATCTCATAATTTTGCAAAGTCTTTAGACGAGTGGGGGATGCATAGTCTGATTTCACTTGATAGCGAAGAATTTCGCCAGTAGCATCTGAATCAGGGCTCACACTTGAGCTTACGCCCGGCGGAAAAGTTACATTTCCCAAATTAGAAATAAAAATTTGCCGAACCTTAAAGGGGTCAGCATTGTCATTAAATTTTAGCGTGACAACAGAAAGTCCAAATAAAGAAACGGAGCGAAATGCTTTAACCCCTGGAATACCTGCTAAAGCATTTTCTACTGGAATCGTAACTTGTTGTTCTACTTCAGTAGTACTGCGTCCTGGCCACTGAGAAATTGCTTGGATAGTTAATGGCGCTACACCTGGATAAGGCTGAATAGGCAGCTGCTTTAGGCTGTATGCTCCCAAGCATAAAAGTACCGCAGAGGCAAATAGGATTAGTATGCGCTTGTCTAAAACACCTCTTAGAAAATTCGTTGCGAAACTCAA
Protein sequences of DBSCAN-SWA_2 >NZ_CP015017|1529985:1539303|1533147_1533969_-|WP_071539467.1|DBSCAN-SWA MATVLVFDIETIPDVAGLRRLESYPDTLSDAEVAAQAMTERAAKTGSEFLPLFLQKIVAISCVIRRTTKEGSPQIKVGTLGMPQDDEKILIQTFFDLVEKYTPQLVSWNGSGFDLPVLHYRALANHVQAPRYWEMGESQESDSREFKWNNYISRYHMRHLDMMDLLAKFNGRANAPLDGLAKLCGFPGKMGMDGSQVWPAYQEGKINDIRRYCETDVVNTYLMYCRFQLLRGGFSLEEYQEEIAFVRAYLEKEAKEPNGGQWQEYLQGFAADA >NZ_CP015017|1529985:1539303|1534784_1535441_-|WP_011903137.1|DBSCAN-SWA MKAPTERFAAYRQALAAKVHASGVKHGKTLEAIATVPRHAFMDPGLHAQAYEDTALPIGHEQTISKPSVVARMIELLHKPRHKLGKVLEIGTGCGYQAAVLSLLADEVYTIERIRPLHDMARAKLRPFRINNLRLIYGDGILGLPQAAPFDGIILAAAGLGIPDALMDQLAIGGRLVAPVAKNENEQQLVVVERMSSQRYQRTVLDGVFFVPLQSGVV >NZ_CP015017|1529985:1539303|1536225_1539303_-|WP_071539469.1|DBSCAN-SWA MSFATNFLRGVLDKRILILFASAVLLCLGAYSLKQLPIQPYPGVAPLTIQAISQWPGRSTTEVEQQVTIPVENALAGIPGVKAFRSVSLFGLSVVTLKFNDNADPFKVRQIFISNLGNVTFPPGVSSSVSPDSDATGEILRYQVKSDYASPTRLKTLQNYEIYKELKQTPGIADVSSFGGKVRQYNVIVRPESLQSKNITISQLIDALSKANDNTGGGVLPSGEQQFVVRGVGLLKNLDDIRRVVVAVNNGVPVRIGDVATVEIGNAPRLGLFQFDENPDSVEGIVYLRRGENASEVLARVREKIVTINAHVLPPGIEVKPFYDRQVLLDITVGTVKHTMFFGITMVLVLLYIFLGNFRAAAVVAAVIPLALCFSFIMMYIFNVPANLISLGAIDFGVIVDAAVIITENVMRHLEEGGKRLNQSIILATAEVQRAMVYSTSIIIVAYSPLFLMGGVEGIIFKPMAFTMGFALIASIVLSLTFLPASISYAFGENFHHQPPKFITWMLDHYKPLLRRLMDHPRHVITVSILVLGVTLISATRLGTAFLPTLEENNIWLRVVLPNTVDLNYSISVANQLRETFTRQPEIERVAVQIGRPDDGTDSTGVFNQEYGLYLKPPELMPEGSSKADLLKRLQGELERIPGISFSFSQYIQDNVNEALSGVKGENSVKIFGTDLTILNTKAHEVIAQLKKVRGIEDESILKELGQPTLNIQIDRDKAARYGVNVADVQTVVANAIGGAPVSNFLEDEKTFGIAVRLNEDSRNDLPDIANLLIDTPVGSRVPLGMVANVELTDGPFFIYREAGKRYIAVIFSVRDRDLGSAVEDAKYLVEKNVALPANYSIAWDGQFNQMKAAQQKLMVIIPLTLVAIFLLLVTALGNFRDAVIVLINVPFAAIGGIIALHLGGETLSISALFGFLSLFGIAIQDGVILISYINKTVAEEHGAMKDAMVDGAALRVRPVLMTAMLAGLGLLPAALSHAIGSEAQRPLALVIVGGMVTTTILTLLVLPVIYAAMRSRGKHKPGIV >NZ_CP015017|1529985:1539303|1531724_1533155_-|WP_071539466.1|DBSCAN-SWA MRRGEKPVNIEVTEPIRVESLDLDAQGIARMAPNEEEAAEGHSGKVIFIQGALPTELVTYTITRDKARFSKAKVRDILKPAVFRAEPKCKAFGVCGGCTMQHLDIRAQVAMKQRVLEDDLQHLAKLKPQEILRPMGGPTWEYRHRARLSAVNRSIKKGTVLIGFHEGKSGYVADMTACEILPKHVSNLLPQLRALVNGLSIVDRMPQIEVAVGEPEEPNSLDPKKARPVTALVFRNLKPLTQDDEQLLRSFADEHQVWVWLQPKGIETVAPFYPETGKLCYRLPEFEIEMPFMPADFTQVNHLMNRALVSKAIRLLEVSPSDRVLDLFCGIGNFTLPLARKAKTVLGIEGLETLTTRAKANAVHNGLADKVSFMRSDLFEVTPDTIASWGKAERWLMDPPREGAMEICKALAELHAEDNLLLPQRIVYVSCNPKTLARDAEILCHHAGYTLKSAGVVNMFPHTSHVESMAVFDRSE >NZ_CP015017|1529985:1539303|1535437_1536223_-|WP_071466864.1|DBSCAN-SWA MSKQPHILVSNDDGYLAPGLLALVNAVRPLGRITVIAPEQNHSGASNSLTLSRPLSIHRVAGGERDGFFFVNGTPTDCVHVAMTGFLDEKPDLVISGINQGENMGEDTLYSGTVAAAVEGVMFGVPGIAFSQIDRGWNRIEDAAKAAHDVVAQMLVSALARTEGNATLLNVNIPNRPYADLYRWRVTRLGNRHHSQPVVVQDSPRGEKIYWIGAAGEVKEGSEGTDFHAIAEGCISITPMQLDLTHHARLAAMRAHGWDRG >NZ_CP015017|1529985:1539303|1529985_1530411_-|WP_011903132.1|DBSCAN-SWA MAIERTLSIIKPDAVAKNVIGKIYDRFESAGLKIIASRMAHLSQNEAEQFYGVHKDRPFFKDLVSFMISGPVMIQVLQGEGAIAKNRDLMGATDPKKADKGTIRADFADSIDANAVHGSDAPETAAVEVAFFFPGMNVFNR >NZ_CP015017|1529985:1539303|1530691_1531393_+|WP_071539465.1|DBSCAN-SWA MSDLNSYGFGQSSSISNIQVRNRVLRNTYALLALSMVPTVIGAWLGVAMGLNLLAGSPFMGFIVFMAVAFGFFWAIEKNKNTGVGVLLLLGFTFFMGIMMSRLIGFTLNSYSNGATLIMLSFGGTAAIFATMATIATVSKSDFSGLGKFLMVGVLLLILASVANIWLQLPALMLTVMVLAIAIFSAFILVDVQRVISGGETNYIMATLAIYLDVYNIFTNLLALLGIVGGNRD >NZ_CP015017|1529985:1539303|1533975_1534788_-|WP_081354849.1|DBSCAN-SWA MRSILNLMSNVMPNTVSKSFLFLLLTASLMLTVGCSTPRTKPASVTDRSGGSSEPAPPGYYRVKRGDTLARIALDNGQAPRDVVQWNKAVNPSFNPNIIEIGDLILIKAPAGTKPVKVAEKVTEKKPIPVTDKADVPSPTPEPTKSEIVAEPGIRLSWPAKGKVTDDFSDKNKGIDIAGKVGEPVLAASDGKVVYAGNSLRGYGNLVIVKHDNTYLTAYAHNSKLLVKEGDTVRKGQKIAEMGDTDTTSVKLHFELRVNGKPVNPTPYLQ |
8 | uncultured_Mediterranean_phage(25.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2035111 : 2041474
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP015017|2035111:2041474|DBSCAN-SWA TTTAATAAGGGAATTTTGAATAACTAAGGCCAAGCGCATTGACTTCATCAATACAATCGCTTGAGGAATATAACTTTCCATCCTTACTCATATACAGAAATCCTTTAGTAATACTCTCGCCGGGCGCCCACCACCCATTGCTTTCGTTATGTCTCATCCAAAACTTAGGGGCAATCACCGTTTTGCAAAATTTATTTAACCAAGCTCCCCACCAAGAAAAGGTAGAGTTAGCAATGATTAAGTATTTCGCATAACTAATCACTAAAAAGTCTTCTGCAACGCTAAAGTGATATGCGGGGAAATCGGGGAATAATTTTTCCGCGGCATCAACATCATCAGTAATAACGATAAATTTTAAATTGTCATGGAGCTTTTTCATTTGCTCAACTGAATTATTCCAGTAATCAGAGCTTAAATAGACATCCGGAATATCTTTGTAGTCCCCGCCCCTAAAGTTAATGATACATACCTCATCTGTAAGATTTAACTTGTCAATTAGCTCAAGATTAGGATTTTTTAGCTCAAACCATTCTTGGATATTTTTTTTATTTTCAAGAATATATCTTTCTGACTGAAAGAATCCAACTAGTTTTGTGAAATCTTGTAAGTGAATAATATTGGGGTTGTAAAACTGCGCCATAAAACTATTATGCATATAGCTATCTGTATATTGCATCTTTGTTTGATCAACCTCAACACCAAGGGAGCAGTCAAATATTTCTGCGCCCAAAAAAGATCTAGGGACATGGAACTCGTAGCCATTTCTCTCTGCCAACGTACGGCATACGGCATATTGCCACATATGATTTCCAAGGCGGCCTATCATTTCTACGCTAATCATTACAAAGCATTCCTAACTCAACTTATAATCCCTGATAGCTTATCATTTACTTTCAAAGCACCGTGATTTATGAAACCCATTAAACTCATATGAAAATAATAGACGCATTTATATTTTTCAATGAAATTGATACGTTAAAGATTCGACTGGGTCTTCTTTATGAGAAAGTAGACCAGTTTGTAATATGCGAATCTAACATTACTCATAGTGGGCAAACAAAGAAATACAATTTTCTAGACCGTCAAAGTGAATTTTTGCCATGGTTAGATAAAATTACGTTTCTTCAGTATGAGCCAGATGTAAGTCACCTTGATTTCACCAAAAAAGATGAGGCCTACAACCCATCTTCCGCTTCTTGGCAAATCGAGACGGGGCAAAGAAACTATCTTGGCAGCTATATAAGAAATCTCAATTCTGAAGACATGGTTATGGTCTGCGACGTTGATGAAATATGGAATCCTACCTTTGCAGACTTTATTAGATCTGGGCAATATGAGCTAGATGCCGCTAGAATGGAAATGCTATTTCATCACTATTATTTAAATTGCGTAGGTATTAGTCAAAGCAACTCCAAATGGATTCATGCATTTTGTGCAAAAGCGTCTTATTTAAAAACAAAGCAAAATATCTCCCAAATTAGAGTCGGTGAGCAGCTTCCAATTGTTGCTGGTATTGGCTGGCATTTCTCCTACTTAGGAGGCGCCCAAAAAATATCTGAAAAGATTCATGCTTTTGCACATCAAGAAACCAACACAGCAGAAATTAACAACTTAAAGCATCTAGAAAACTGCATCAACCTTGGAATAGACCACCTCGGCAGACCAGATCATGAGTGGGCCTTTCACCCGCTTGGCTACTACCCTCCTGATATCAAGGCTGAAATGGAAAAATTTCCGCATTTAATCAAAAAAAGTTTGGTTTAATTTATTTCATTTCATGCCTGACTACACATCAATCACAGTTACGTCTATTTTTGGTCACAATGATGGGGCATCTGCCATCCCAGCCATCTTAAGATCAATGAAAGAGCTTCCAGGTAGCAAGGGCCTCCTTCTGTCTACCCAGAAACCACAAAATCTTCCACCTCAAATAGATTGGACTGAAATACTGCCCTTAGACTATCGCCAATACAGTCTGTTTGTGATGTTCTCGTTACACAATTTTATTCAAACTGAATTTTGCTTAATTGTTCAGGACGACGGCTGGGTTATCAATGGAAAAAGTTGGAAAAAGGAGTATTTTGATTATGACTATATAGGTGGGCCATGCCATGCAGCATTTGTTGGCAGCGAGCTCGTTCCAGCCTATCAATGGGTTGGTACCTCCAATCCAACTCCACTAGTGATTCAGAATGGTGGCTTAAGTTTACGTAGCAAGAAATTCTTAAAAGCGCCCAGCTGTCATGGCGCACTCTACTATTTTTCTGAAGAACAGATCCTACAGAATGAAGATGTACAACTTACTGGTATTTACAGACCTCAATTAGAAGAACTGGGAATCAAATTTGCCCCAAATAACCTAGCAAAACAATTCTCTGTTGAGTATTTGGGACCAATATTCCATGACGATATTGATTTACTTAGCCTCTTAGCGGTGCATGGACAAACCCGTAAGCTTATTGAAGAAAATACCATTCAAATCACCATCCCAAAAGATCAACTGCAAAGCATACATAGGGAGGAGGAATTGTTAAATTACCTATCTTCAGAGCTCCACTACAACATTCGATATATAGCCTAAACCTACCAGTAATAAAAAATGCCCAGTCTTTCGAACCGGGCATTTTTTTATTTTATTTTTTACCTATCGCCTTTGATAGCTATTGGCAAGATTATCTTGTAGAACATAAGCAATGCCGTTAGCTACTAATGACAGGTCCTTCTCTAGCAAAGTACTTCGATACACCCATCCTGTAGGTAACTTGAGTTCCTTTGCTAAATTAGGAAGATCCTTCATGGTCAAATTAGGATTAACAATTTGGGAATAGGACTGCATCACATAAACCTGACCATCCGGAGCAACTAACTCATATATAGGGCTACCGGCTTTATAGATAAAATTTGTTGTACGGTTGATTTGATTGGGAGAGTATGATTTGCTACCCAAAATTTGCTTTAAGAGCCCTATATCAACCGTTGCTCTTAAGTTCATTTCAATACCACCAAAAGATTCTTTGACATTGTTCACGGTGGCACCAGCCGCCTGAATCTCATCCATCATCCAATAACGTGGGCCATTTAGATCTACAAAAGAAGCATCATACTTTTTGGCAACCTCTTCCTTCGTGATGGTTTTCCATTGATCCTCAGGACATAAATTTAGCCCCTGGGTATTGAATACTTTCACTTCTAAATTTAGCCAATGGCGTTTACCGTAAAGGATTTCGCAATAGCGTTGATCGCGCAAATTGGATACGCTTTTTTCGAAAATAGGTTGCGAGTTTGCTAACTCTGCAATCAGGATCAACAAGCTTGGAATTAATTTATATAGTTTGGTCATTTTTCTACTTTACCAAATAAAAAACGCCTGGTCTTTTGAACCAGGCGTTTTAGTTTCTACAGTCAAGCGACTGAGGAAATGTACAGCTATTACATCATGCCGCCCATACCACCCATACCGCCCATACCACCCATATCAGGCATTCCACCAGCACCAGACTCATCTTTTGGTGCCTCGGAGATTGCGCAGTCAGTAGTCAACAAGAGGCCAGCAACAGAAGCAGCATTTACCAATGCAGTTTTTGTTACCTTAGTAGGATCAATAACACCTTGAGCAACCAAGTCGCCATACTCACCAGTAGCAGCGTTGTAACCGTTATTGCCTGTGCTTGCTTGTACCGCATTAACAACTACACCAGCATCTTCACCGGCGTTGCTAACGATAGTACGTAATGGCTCTTGCATAGCGCGCAATACGATGCTGATACCAGCGTCTTGATCAGCGTTATCGCCTTTCAATCCCTTGATACCTTGCATAGCACGAATCAATGCTACGCCACCGCCAGGAACAATACCTTCTTCCACGGCAGCACGAGTTGCATGCAATGCATCATCAACGCGAGCTTTCTTCTCTTTCATTTCAACTTCAGTAGCAGCGCCAACACGAATCACCGCAACACCGCCAGCTAATTTAGCTACACGCTCTTGCAATTTTTCCTTGTCGTAATCGCTAGTCGCTTCTTCGATCTGAACACGAATGTTCTTCACGCGAGCTTCAATCGCTTTAGCATCGCCAGCACCATCAATGATGATGGTATTTTCTTTGCCTACTTCGATACGCTTTGCTTGACCCAAGTGCTCAAGAGTTGTTTTCTCGAGTGTCAGGCCGATTTCTTCAGCGATAACAGTTCCGCCAGTCAAGATAGCGATGTCTTCCAACATGGCTTTACGACGATCACCAAATCCTGGAGCCTTAACAGCACAAGTCTTGATAATGCCGCGAATGTTGTTTACAACCAAAGTTGCTAAGGCTTCGCCTTCAACATCTTCAGCAATGATCAACAAAGGACGGCCAGACTTTGCTACTTGCTCGAGTACTGGTAACAAATCACGGATGTTAGCAATCTTCTTATCAAACAAGAGAACGTATGGGCTTTCCAATACAGCAACTTGCTTTTCTGGTTGATTGATGAAGTATGGAGAAAGGTAGCCACGATCAAACTGCATACCTTCAACGACTTCGAGCTCGTCCTCTAAAGACTTACCATCTTCAACAGTGATAACGCCTTCTTTACCTACTTTTTCCATTGCTTCTGCAATGCGCTGACCAATACTATGATCGCTGTTTGCAGAGATAGAACCTACCTGAGCAATTTCTTTAGTGGTCGTGCAAGGCTTGCTAATTTTTGCGAGCTCTGCGATTGCAGCTGTAACAGCTTTATCGATACCACGCTTCAAGTCCATTGGGTTATGACCTGAAACTACATACTTCATGCCTTCACGAACGATAGACTGAGCCAAAACCGTAGCGGTAGTTGTGCCGTCGCCAGCGATGTCAGCAGTTTTGGAAGCAACTTCCTTAACCATCTGCGCGCCCATGTTTTGCAGCTTGTCTTTAAGTTCGATTTCTTTTGCAACGGACACACCGTCTTTAGTGATGGTAGGGCCGCCGAATGAACGCTCGATCACAACATTGCGACCTTTTGGTCCTAAAGTTGTTTTAACTGCATTAGCAAGAATGTTTACGCCTTCCACCATCTTGGTGCGAGCGCTATCTCCAAATACAACGTCTTTTGCTGCCATGATTAAATTCCTCTCTTAAATACCGAAATTACTTCTGTACAACAGCCATGATGTCGTCTTCACGCATCACGATGAGTTCGTCGCCGTCGACCTTAACGGTTTGACCAGCATATTTGCCAAACAACACGCGATCGCCTACTTTGACGTCGAGTGGGTTCAACTTACCGGCTTCATCTCGTTTTCCAGGACCTACTGCCAATATTTCACCTTGATCAGGCTTTTCTGCAGCAGCATCAGGAATGATGATTCCAGAAGCAGTTTTTGATTCTTGATCTAAACGCTTGATGATTACGCGATCATGTAAGGGACGCAAATTCATCTCTTCTCCTATGTTAGTAAGTGTTAACTATTTAAAAAACTATATAAATCAATGCTTTACAATCTCAGTACAAGGAATGTAAGCGAAATTTGGTAATAAATTCCTGCTTTAGCACTCGCATGTAGGGAGTGCTGATTATATAGGTCTGATTCCCTGGATTTCAAGAGCAGATCCCCATAAATTCTTTAAGGATTTAGCTGTGGTTCAAACTTTAACTTGAGACTTTCAAAGTTCAATTTGGGAGCAGGCCTCTAGGTAACATAAGGCCCCTTGAGATTCAGGCTGAAAATTGCACGAATTTTGAGCAATTTCTAAGCCTATTAATTTTTAAAAAATTTAAATAAATAAATTTGGTATCACGATCATTTATTTGATTTCGGCATTTTCCTTAGGTTATTGATTTGATTGGGAATTAGACTCCCAATTTAAACTTTGGAAAAAAGTTGATAAACCCGTAAATCCTTTATAGATCTGGCTTTCACGTCTATCAAATTTGGGAACAAAAAAAATTAGCAGTACTTTAATATTTCACTGCTAATTTATGTGATGTAGGCCCTTAACTGCCAGCCCGAAGAAAATTTAATTAATTGAAAGAAGCAATCGATCCCGTAACGAGTCAAACACAAAATAACGATCCGTTCCTTTAAGCAAAGATCCAAATTTAGTAAGCATTTCCTCGTCAAGCAGGCTAAATGCGTCCATTTCAGCCTTACGCATAGCAATTTGCGTGGGATTAGCAGCCAGTCTATTGCCGGTTAATAAATAAACCGAGTCTGCCCCAATCTTGGCCGCCAATTCAAAGTATGAGGCCCGTGGCTCACGTCGCCCACCTTCATAGGCACCTTGTGCTAACACCTTTACACCAACACGCTTTGCGAACTCAGCCTGAGTTAGACCCAGGCGTTTACGTTCCACACGAATTCGACCGCCTAAATTTTGAGCAAAATCAGGGTTTATTGTCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP015017|2035111:2041474|2035111_2035951_-|WP_071539714.1|DBSCAN-SWA MISVEMIGRLGNHMWQYAVCRTLAERNGYEFHVPRSFLGAEIFDCSLGVEVDQTKMQYTDSYMHNSFMAQFYNPNIIHLQDFTKLVGFFQSERYILENKKNIQEWFELKNPNLELIDKLNLTDEVCIINFRGGDYKDIPDVYLSSDYWNNSVEQMKKLHDNLKFIVITDDVDAAEKLFPDFPAYHFSVAEDFLVISYAKYLIIANSTFSWWGAWLNKFCKTVIAPKFWMRHNESNGWWAPGESITKGFLYMSKDGKLYSSSDCIDEVNALGLSYSKFPY >NZ_CP015017|2035111:2041474|2038539_2040192_-|WP_071539716.1|DBSCAN-SWA MAAKDVVFGDSARTKMVEGVNILANAVKTTLGPKGRNVVIERSFGGPTITKDGVSVAKEIELKDKLQNMGAQMVKEVASKTADIAGDGTTTATVLAQSIVREGMKYVVSGHNPMDLKRGIDKAVTAAIAELAKISKPCTTTKEIAQVGSISANSDHSIGQRIAEAMEKVGKEGVITVEDGKSLEDELEVVEGMQFDRGYLSPYFINQPEKQVAVLESPYVLLFDKKIANIRDLLPVLEQVAKSGRPLLIIAEDVEGEALATLVVNNIRGIIKTCAVKAPGFGDRRKAMLEDIAILTGGTVIAEEIGLTLEKTTLEHLGQAKRIEVGKENTIIIDGAGDAKAIEARVKNIRVQIEEATSDYDKEKLQERVAKLAGGVAVIRVGAATEVEMKEKKARVDDALHATRAAVEEGIVPGGGVALIRAMQGIKGLKGDNADQDAGISIVLRAMQEPLRTIVSNAGEDAGVVVNAVQASTGNNGYNAATGEYGDLVAQGVIDPTKVTKTALVNAASVAGLLLTTDCAISEAPKDESGAGGMPDMGGMGGMGGMGGMM >NZ_CP015017|2035111:2041474|2036887_2037691_+|WP_011903639.1|DBSCAN-SWA MPDYTSITVTSIFGHNDGASAIPAILRSMKELPGSKGLLLSTQKPQNLPPQIDWTEILPLDYRQYSLFVMFSLHNFIQTEFCLIVQDDGWVINGKSWKKEYFDYDYIGGPCHAAFVGSELVPAYQWVGTSNPTPLVIQNGGLSLRSKKFLKAPSCHGALYYFSEEQILQNEDVQLTGIYRPQLEELGIKFAPNNLAKQFSVEYLGPIFHDDIDLLSLLAVHGQTRKLIEENTIQITIPKDQLQSIHREEELLNYLSSELHYNIRYIA >NZ_CP015017|2035111:2041474|2037754_2038450_-|WP_081354874.1|DBSCAN-SWA MTKLYKLIPSLLILIAELANSQPIFEKSVSNLRDQRYCEILYGKRHWLNLEVKVFNTQGLNLCPEDQWKTITKEEVAKKYDASFVDLNGPRYWMMDEIQAAGATVNNVKESFGGIEMNLRATVDIGLLKQILGSKSYSPNQINRTTNFIYKAGSPIYELVAPDGQVYVMQSYSQIVNPNLTMKDLPNLAKELKLPTGWVYRSTLLEKDLSLVANGIAYVLQDNLANSYQRR >NZ_CP015017|2035111:2041474|2040220_2040511_-|WP_011903642.1|DBSCAN-SWA MNLRPLHDRVIIKRLDQESKTASGIIIPDAAAEKPDQGEILAVGPGKRDEAGKLNPLDVKVGDRVLFGKYAGQTVKVDGDELIVMREDDIMAVVQK >NZ_CP015017|2035111:2041474|2041090_2041474_-|WP_071539717.1|DBSCAN-SWA MTINPDFAQNLGGRIRVERKRLGLTQAEFAKRVGVKVLAQGAYEGGRREPRASYFELAAKIGADSVYLLTGNRLAANPTQIAMRKAEMDAFSLLDEEMLTKFGSLLKGTDRYFVFDSLRDRLLLSIN >NZ_CP015017|2035111:2041474|2036040_2036874_+|WP_071539715.1|DBSCAN-SWA MKIIDAFIFFNEIDTLKIRLGLLYEKVDQFVICESNITHSGQTKKYNFLDRQSEFLPWLDKITFLQYEPDVSHLDFTKKDEAYNPSSASWQIETGQRNYLGSYIRNLNSEDMVMVCDVDEIWNPTFADFIRSGQYELDAARMEMLFHHYYLNCVGISQSNSKWIHAFCAKASYLKTKQNISQIRVGEQLPIVAGIGWHFSYLGGAQKISEKIHAFAHQETNTAEINNLKHLENCINLGIDHLGRPDHEWAFHPLGYYPPDIKAEMEKFPHLIKKSLV |
7 | uncultured_virus(66.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|