Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
LS483402 | Pseudomonas aeruginosa strain NCTC13718 genome assembly, chromosome: 1 | 3 crisprs | csa3,cas2,cas1,cas3,cas6e,cas8e,cse2gr11,cas7,cas5,csb3,csb2gr5,csb1gr7,DEDDh,WYL | 0 | 29 | 5 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LS483402_1 | 16480-18584 | Orphan |
I-C,I-E,II-B
Consensus repeat of LS483402_1
|
34 spacers
spacers of LS483402_1
>1.1|16509|32|LS483402|PILER-CR,CRISPRCasFinder,CRT CGGGTTGATCGATTTGAAAGCTGAACGTGATA >1.2|16570|32|LS483402|PILER-CR,CRISPRCasFinder,CRT CGCCACCGCCACTTCAATCGCTATAGCAACCT >1.3|16631|33|LS483402|PILER-CR,CRISPRCasFinder,CRT CCGATAAGTGGGACAGCACGAAGGGTAACCCGC >1.4|16693|32|LS483402|PILER-CR,CRISPRCasFinder,CRT CGAATTCTTGATCTCATCGACGGACATTTAAC >1.5|16754|32|LS483402|PILER-CR,CRISPRCasFinder,CRT TTACAGGCCGAGGAGTTATTTTTCATGGCTAA >1.6|16815|32|LS483402|PILER-CR,CRISPRCasFinder,CRT TGGTGATGCTAAGTGGGTTGAGTCGGGCAGTC >1.7|16876|32|LS483402|PILER-CR,CRISPRCasFinder,CRT AAAAAGGGCAAAGTTGATCAGGTACGTGTGGG >1.8|16937|32|LS483402|PILER-CR,CRISPRCasFinder,CRT AGGCTTTGAAGGATTCTGTGGCTGGCGCGATT >1.9|16998|32|LS483402|PILER-CR,CRISPRCasFinder,CRT CAATCGCAACAGCACCTATACCATCGACTTCA >1.10|17059|32|LS483402|PILER-CR,CRISPRCasFinder,CRT TCGCGCCTTCAGCTCTTCTATCTCCGCAAGAA >1.11|17120|32|LS483402|PILER-CR,CRISPRCasFinder,CRT AGAATCCTGGCTCGTGAGCTTATCCTCAAGCT >1.12|17181|32|LS483402|PILER-CR,CRISPRCasFinder,CRT TCAAGGAGCACCTGAAATGCTAAATCCCTACG >1.13|17242|32|LS483402|PILER-CR,CRISPRCasFinder,CRT AATCCAAATCAGGGTCGAAAGATGCGAAAGCG >1.14|17303|32|LS483402|PILER-CR,CRISPRCasFinder,CRT GTTTTCATTTGTACGGTAGGCGGGTACAGATG >1.15|17364|33|LS483402|PILER-CR,CRISPRCasFinder,CRT AGGATTGGAATCAGATTGCTAATGCGATCCCTG >1.16|17426|32|LS483402|PILER-CR,CRISPRCasFinder,CRT GGGAGGCCACATCGCGGGCTATGTCTGCGGAT >1.17|17487|32|LS483402|PILER-CR,CRISPRCasFinder,CRT ATTAAGCGTTTTGAGGGAAGGTGAAAGCGATA >1.18|17548|32|LS483402|PILER-CR,CRISPRCasFinder,CRT GTGGCGAAAGCGGAGAAGATGTGCGTGTTTTT >1.19|17609|32|LS483402|PILER-CR,CRISPRCasFinder,CRT GATCACACGATCACCCTTCGCTAGGGAGTTCG >1.20|17670|32|LS483402|PILER-CR,CRISPRCasFinder,CRT CCACTCCATGAAAACATCCTCCTATCACCAAA >1.21|17731|32|LS483402|PILER-CR,CRISPRCasFinder,CRT CAATCGGCTGGCCTATAGTGTTCAAAACTTCC >1.22|17792|32|LS483402|PILER-CR,CRISPRCasFinder,CRT ATGTCTTTATCCTTATGTAGTGGGTGGGTTTT >1.23|17853|32|LS483402|PILER-CR,CRISPRCasFinder,CRT AGACTCTGGCACGTCGTACCATGCGAGGACCA >1.24|17914|32|LS483402|PILER-CR,CRISPRCasFinder,CRT GCGCCCAATATCTGCCAAAGCCTCCGATGTGC >1.25|17975|32|LS483402|PILER-CR,CRISPRCasFinder,CRT GGTGCCAATGGCGGGCTGGTAGCTGTCTACCA >1.26|18036|32|LS483402|PILER-CR,CRISPRCasFinder,CRT GACTGTGGCGAATTCTCGCAGGAAGGAGCTGG >1.27|18097|32|LS483402|PILER-CR,CRISPRCasFinder,CRT ATCAACGGTGAGCTGCGAAATAAGCTCGGCGC >1.28|18158|32|LS483402|PILER-CR,CRISPRCasFinder,CRT TGATAGTGATTAGCTGGTCAATCAGTGTTTCT >1.29|18219|32|LS483402|PILER-CR,CRISPRCasFinder,CRT CTTGGACGTGTCCCGATCGTCATGATGATTAA >1.30|18280|32|LS483402|PILER-CR,CRISPRCasFinder,CRT TGTTGAATCTGGCATCGACGAAGACGGAAAGC >1.31|18341|32|LS483402|PILER-CR,CRISPRCasFinder,CRT CAGCCGATCTCCTACCCACTCTGTGGTGCTTG >1.32|18402|32|LS483402|PILER-CR,CRISPRCasFinder,CRT CTACAAACTTTTCTGCAAACGCCACCTCCTCA >1.33|18463|32|LS483402|PILER-CR,CRISPRCasFinder,CRT ATCATTGCCCAGCTCACGAGCACGCTCGGCGG >1.34|18524|32|LS483402|CRISPRCasFinder,CRT GATACGCTTAGACATAACACCGGCGGCATCAG |
DinG |
CRISPR arrays and Neighbor proteins around LS483402_1
The CRISPR arrays of LS483402_1 >merge|LS483402|1|16480-18584|PILER-CR,CRISPRCasFinder,CRT CTTTTCTCCGCGTATGCGGAGGTAGTTCCCGGGTTGATCGATTTGAAAGCTGAACGTGATACTTTTCTCCGCGTATGCGGAGGTAGTTCCCGCCACCGCCACTTCAATCGCTATAGCAACCTCTTTTCTCCGCGTATGCGGAGGTAGTTCACCGATAAGTGGGACAGCACGAAGGGTAACCCGCCTTTTCTCCGCGTATGCGGAGGTAGTTCCCGAATTCTTGATCTCATCGACGGACATTTAACCTTTTCTCCGCGTATGCGGAGGTAGTTCCTTACAGGCCGAGGAGTTATTTTTCATGGCTAACTTTTCTCCGCGTATGCGGAGGTAGTTCCTGGTGATGCTAAGTGGGTTGAGTCGGGCAGTCCTTTTCTCCGCGTATGCGGAGGTAGTTCCAAAAAGGGCAAAGTTGATCAGGTACGTGTGGGCTTTTCTCCGCGTATGCGGAGGTAGTTCCAGGCTTTGAAGGATTCTGTGGCTGGCGCGATTCTTTTCTCCGCGTATGCGGAGGTAGTTCCCAATCGCAACAGCACCTATACCATCGACTTCACTTTTCTCCGCGTATGCGGAGGTAGTTCCTCGCGCCTTCAGCTCTTCTATCTCCGCAAGAACTTTTCTCCGCGTATGCGGAGGTAGTTCCAGAATCCTGGCTCGTGAGCTTATCCTCAAGCTCTTTTCTCCGCGTATGCGGAGGTAGTTCCTCAAGGAGCACCTGAAATGCTAAATCCCTACGCTTTTCTCCGCGTATGCGGAGGTAGTTCCAATCCAAATCAGGGTCGAAAGATGCGAAAGCGCTTTTCTCCGCGTATGCGGAGGTAGTTCCGTTTTCATTTGTACGGTAGGCGGGTACAGATGCTTTTCTCCGCGTATGCGGAGGTAGTTCCAGGATTGGAATCAGATTGCTAATGCGATCCCTGCTTTTCTCCGCGTATGCGGAGGTAGTTCCGGGAGGCCACATCGCGGGCTATGTCTGCGGATCTTTTCTCCGCGTATGCGGAGGTAGTTCCATTAAGCGTTTTGAGGGAAGGTGAAAGCGATACTTTTCTCCGCGTATGCGGAGGTAGTTCCGTGGCGAAAGCGGAGAAGATGTGCGTGTTTTTCTTTTCTCCGCGTATGCGGAGGTAGTTCCGATCACACGATCACCCTTCGCTAGGGAGTTCGCTTTTCTCCGCGTATGCGGAGGTAGTTCCCCACTCCATGAAAACATCCTCCTATCACCAAACTTTTCTCCGCGTATGCGGAGGTAGTTCCCAATCGGCTGGCCTATAGTGTTCAAAACTTCCCTTTTCTCCGCGTATGCGGAGGTAGTTCCATGTCTTTATCCTTATGTAGTGGGTGGGTTTTCTTTTCTCCGCGTATGCGGAGGTAGTTCCAGACTCTGGCACGTCGTACCATGCGAGGACCACTTTTCTCCGCGTATGCGGAGGTAGTTCCGCGCCCAATATCTGCCAAAGCCTCCGATGTGCCTTTTCTCCGCGTATGCGGAGGTAGTTCCGGTGCCAATGGCGGGCTGGTAGCTGTCTACCACTTTTCTCCGCGTATGCGGAGGTAGTTCCGACTGTGGCGAATTCTCGCAGGAAGGAGCTGGCTTTTCTCCGCGTATGCGGAGGTAGTTCCATCAACGGTGAGCTGCGAAATAAGCTCGGCGCCTTTTCTCCGCGTATGCGGAGGTAGTTCTTGATAGTGATTAGCTGGTCAATCAGTGTTTCTCTTTTCTCCGCGTATGCGGAGGCAGTTCCCTTGGACGTGTCCCGATCGTCATGATGATTAACTTTTCTCCGCGTATGCGGAGGTAGTTCCTGTTGAATCTGGCATCGACGAAGACGGAAAGCCTTTTCTCCGCGTATGCGGAGGTAGTTCCCAGCCGATCTCCTACCCACTCTGTGGTGCTTGCTTTTCTCCGCGTATGCGGAGGTAGTTCCCTACAAACTTTTCTGCAAACGCCACCTCCTCACTTTTCTCCGCGTATGCGGAGGTAGTTCCATCATTGCCCAGCTCACGAGCACGCTCGGCGGCTTTTCTCCGCGCATGCGGAGGTAGTTCCGATACGCTTAGACATAACACCGGCGGCATCAGCTTTTCTCCGCGTATGCGGAAAGTAAGCC >LS483402|1|1|16480-18523|PILER-CR CTTTTCTCCGCGTATGCGGAGGTAGTTCC CGGGTTGATCGATTTGAAAGCTGAACGTGATA CTTTTCTCCGCGTATGCGGAGGTAGTTCC CGCCACCGCCACTTCAATCGCTATAGCAACCT CTTTTCTCCGCGTATGCGGAGGTAGTTCA CCGATAAGTGGGACAGCACGAAGGGTAACCCGC CTTTTCTCCGCGTATGCGGAGGTAGTTCC CGAATTCTTGATCTCATCGACGGACATTTAAC CTTTTCTCCGCGTATGCGGAGGTAGTTCC TTACAGGCCGAGGAGTTATTTTTCATGGCTAA CTTTTCTCCGCGTATGCGGAGGTAGTTCC TGGTGATGCTAAGTGGGTTGAGTCGGGCAGTC CTTTTCTCCGCGTATGCGGAGGTAGTTCC AAAAAGGGCAAAGTTGATCAGGTACGTGTGGG CTTTTCTCCGCGTATGCGGAGGTAGTTCC AGGCTTTGAAGGATTCTGTGGCTGGCGCGATT CTTTTCTCCGCGTATGCGGAGGTAGTTCC CAATCGCAACAGCACCTATACCATCGACTTCA CTTTTCTCCGCGTATGCGGAGGTAGTTCC TCGCGCCTTCAGCTCTTCTATCTCCGCAAGAA CTTTTCTCCGCGTATGCGGAGGTAGTTCC AGAATCCTGGCTCGTGAGCTTATCCTCAAGCT CTTTTCTCCGCGTATGCGGAGGTAGTTCC TCAAGGAGCACCTGAAATGCTAAATCCCTACG CTTTTCTCCGCGTATGCGGAGGTAGTTCC AATCCAAATCAGGGTCGAAAGATGCGAAAGCG CTTTTCTCCGCGTATGCGGAGGTAGTTCC GTTTTCATTTGTACGGTAGGCGGGTACAGATG CTTTTCTCCGCGTATGCGGAGGTAGTTCC AGGATTGGAATCAGATTGCTAATGCGATCCCTG CTTTTCTCCGCGTATGCGGAGGTAGTTCC GGGAGGCCACATCGCGGGCTATGTCTGCGGAT CTTTTCTCCGCGTATGCGGAGGTAGTTCC ATTAAGCGTTTTGAGGGAAGGTGAAAGCGATA CTTTTCTCCGCGTATGCGGAGGTAGTTCC GTGGCGAAAGCGGAGAAGATGTGCGTGTTTTT CTTTTCTCCGCGTATGCGGAGGTAGTTCC GATCACACGATCACCCTTCGCTAGGGAGTTCG CTTTTCTCCGCGTATGCGGAGGTAGTTCC CCACTCCATGAAAACATCCTCCTATCACCAAA CTTTTCTCCGCGTATGCGGAGGTAGTTCC CAATCGGCTGGCCTATAGTGTTCAAAACTTCC CTTTTCTCCGCGTATGCGGAGGTAGTTCC ATGTCTTTATCCTTATGTAGTGGGTGGGTTTT CTTTTCTCCGCGTATGCGGAGGTAGTTCC AGACTCTGGCACGTCGTACCATGCGAGGACCA CTTTTCTCCGCGTATGCGGAGGTAGTTCC GCGCCCAATATCTGCCAAAGCCTCCGATGTGC CTTTTCTCCGCGTATGCGGAGGTAGTTCC GGTGCCAATGGCGGGCTGGTAGCTGTCTACCA CTTTTCTCCGCGTATGCGGAGGTAGTTCC GACTGTGGCGAATTCTCGCAGGAAGGAGCTGG CTTTTCTCCGCGTATGCGGAGGTAGTTCC ATCAACGGTGAGCTGCGAAATAAGCTCGGCGC CTTTTCTCCGCGTATGCGGAGGTAGTTCT TGATAGTGATTAGCTGGTCAATCAGTGTTTCT CTTTTCTCCGCGTATGCGGAGGCAGTTCC CTTGGACGTGTCCCGATCGTCATGATGATTAA CTTTTCTCCGCGTATGCGGAGGTAGTTCC TGTTGAATCTGGCATCGACGAAGACGGAAAGC CTTTTCTCCGCGTATGCGGAGGTAGTTCC CAGCCGATCTCCTACCCACTCTGTGGTGCTTG CTTTTCTCCGCGTATGCGGAGGTAGTTCC CTACAAACTTTTCTGCAAACGCCACCTCCTCA CTTTTCTCCGCGTATGCGGAGGTAGTTCC ATCATTGCCCAGCTCACGAGCACGCTCGGCGG CTTTTCTCCGCGCATGCGGAGGTAGTTCC >LS483402|1|1|16480-18584|CRISPRCasFinder CTTTTCTCCGCGTATGCGGAGGTAGTTCC CGGGTTGATCGATTTGAAAGCTGAACGTGATA CTTTTCTCCGCGTATGCGGAGGTAGTTCC CGCCACCGCCACTTCAATCGCTATAGCAACCT CTTTTCTCCGCGTATGCGGAGGTAGTTCA CCGATAAGTGGGACAGCACGAAGGGTAACCCGC CTTTTCTCCGCGTATGCGGAGGTAGTTCC CGAATTCTTGATCTCATCGACGGACATTTAAC CTTTTCTCCGCGTATGCGGAGGTAGTTCC TTACAGGCCGAGGAGTTATTTTTCATGGCTAA CTTTTCTCCGCGTATGCGGAGGTAGTTCC TGGTGATGCTAAGTGGGTTGAGTCGGGCAGTC CTTTTCTCCGCGTATGCGGAGGTAGTTCC AAAAAGGGCAAAGTTGATCAGGTACGTGTGGG CTTTTCTCCGCGTATGCGGAGGTAGTTCC AGGCTTTGAAGGATTCTGTGGCTGGCGCGATT CTTTTCTCCGCGTATGCGGAGGTAGTTCC CAATCGCAACAGCACCTATACCATCGACTTCA CTTTTCTCCGCGTATGCGGAGGTAGTTCC TCGCGCCTTCAGCTCTTCTATCTCCGCAAGAA CTTTTCTCCGCGTATGCGGAGGTAGTTCC AGAATCCTGGCTCGTGAGCTTATCCTCAAGCT CTTTTCTCCGCGTATGCGGAGGTAGTTCC TCAAGGAGCACCTGAAATGCTAAATCCCTACG CTTTTCTCCGCGTATGCGGAGGTAGTTCC AATCCAAATCAGGGTCGAAAGATGCGAAAGCG CTTTTCTCCGCGTATGCGGAGGTAGTTCC GTTTTCATTTGTACGGTAGGCGGGTACAGATG CTTTTCTCCGCGTATGCGGAGGTAGTTCC AGGATTGGAATCAGATTGCTAATGCGATCCCTG CTTTTCTCCGCGTATGCGGAGGTAGTTCC GGGAGGCCACATCGCGGGCTATGTCTGCGGAT CTTTTCTCCGCGTATGCGGAGGTAGTTCC ATTAAGCGTTTTGAGGGAAGGTGAAAGCGATA CTTTTCTCCGCGTATGCGGAGGTAGTTCC GTGGCGAAAGCGGAGAAGATGTGCGTGTTTTT CTTTTCTCCGCGTATGCGGAGGTAGTTCC GATCACACGATCACCCTTCGCTAGGGAGTTCG CTTTTCTCCGCGTATGCGGAGGTAGTTCC CCACTCCATGAAAACATCCTCCTATCACCAAA CTTTTCTCCGCGTATGCGGAGGTAGTTCC CAATCGGCTGGCCTATAGTGTTCAAAACTTCC CTTTTCTCCGCGTATGCGGAGGTAGTTCC ATGTCTTTATCCTTATGTAGTGGGTGGGTTTT CTTTTCTCCGCGTATGCGGAGGTAGTTCC AGACTCTGGCACGTCGTACCATGCGAGGACCA CTTTTCTCCGCGTATGCGGAGGTAGTTCC GCGCCCAATATCTGCCAAAGCCTCCGATGTGC CTTTTCTCCGCGTATGCGGAGGTAGTTCC GGTGCCAATGGCGGGCTGGTAGCTGTCTACCA CTTTTCTCCGCGTATGCGGAGGTAGTTCC GACTGTGGCGAATTCTCGCAGGAAGGAGCTGG CTTTTCTCCGCGTATGCGGAGGTAGTTCC ATCAACGGTGAGCTGCGAAATAAGCTCGGCGC CTTTTCTCCGCGTATGCGGAGGTAGTTCT TGATAGTGATTAGCTGGTCAATCAGTGTTTCT CTTTTCTCCGCGTATGCGGAGGCAGTTCC CTTGGACGTGTCCCGATCGTCATGATGATTAA CTTTTCTCCGCGTATGCGGAGGTAGTTCC TGTTGAATCTGGCATCGACGAAGACGGAAAGC CTTTTCTCCGCGTATGCGGAGGTAGTTCC CAGCCGATCTCCTACCCACTCTGTGGTGCTTG CTTTTCTCCGCGTATGCGGAGGTAGTTCC CTACAAACTTTTCTGCAAACGCCACCTCCTCA CTTTTCTCCGCGTATGCGGAGGTAGTTCC ATCATTGCCCAGCTCACGAGCACGCTCGGCGG CTTTTCTCCGCGCATGCGGAGGTAGTTCC GATACGCTTAGACATAACACCGGCGGCATCAG CTTTTCTCCGCGTATGCGGAAAGTAAGCC >LS483402|1|1|16480-18584|CRT CTTTTCTCCGCGTATGCGGAGGTAGTTCC CGGGTTGATCGATTTGAAAGCTGAACGTGATA CTTTTCTCCGCGTATGCGGAGGTAGTTCC CGCCACCGCCACTTCAATCGCTATAGCAACCT CTTTTCTCCGCGTATGCGGAGGTAGTTCA CCGATAAGTGGGACAGCACGAAGGGTAACCCGC CTTTTCTCCGCGTATGCGGAGGTAGTTCC CGAATTCTTGATCTCATCGACGGACATTTAAC CTTTTCTCCGCGTATGCGGAGGTAGTTCC TTACAGGCCGAGGAGTTATTTTTCATGGCTAA CTTTTCTCCGCGTATGCGGAGGTAGTTCC TGGTGATGCTAAGTGGGTTGAGTCGGGCAGTC CTTTTCTCCGCGTATGCGGAGGTAGTTCC AAAAAGGGCAAAGTTGATCAGGTACGTGTGGG CTTTTCTCCGCGTATGCGGAGGTAGTTCC AGGCTTTGAAGGATTCTGTGGCTGGCGCGATT CTTTTCTCCGCGTATGCGGAGGTAGTTCC CAATCGCAACAGCACCTATACCATCGACTTCA CTTTTCTCCGCGTATGCGGAGGTAGTTCC TCGCGCCTTCAGCTCTTCTATCTCCGCAAGAA CTTTTCTCCGCGTATGCGGAGGTAGTTCC AGAATCCTGGCTCGTGAGCTTATCCTCAAGCT CTTTTCTCCGCGTATGCGGAGGTAGTTCC TCAAGGAGCACCTGAAATGCTAAATCCCTACG CTTTTCTCCGCGTATGCGGAGGTAGTTCC AATCCAAATCAGGGTCGAAAGATGCGAAAGCG CTTTTCTCCGCGTATGCGGAGGTAGTTCC GTTTTCATTTGTACGGTAGGCGGGTACAGATG CTTTTCTCCGCGTATGCGGAGGTAGTTCC AGGATTGGAATCAGATTGCTAATGCGATCCCTG CTTTTCTCCGCGTATGCGGAGGTAGTTCC GGGAGGCCACATCGCGGGCTATGTCTGCGGAT CTTTTCTCCGCGTATGCGGAGGTAGTTCC ATTAAGCGTTTTGAGGGAAGGTGAAAGCGATA CTTTTCTCCGCGTATGCGGAGGTAGTTCC GTGGCGAAAGCGGAGAAGATGTGCGTGTTTTT CTTTTCTCCGCGTATGCGGAGGTAGTTCC GATCACACGATCACCCTTCGCTAGGGAGTTCG CTTTTCTCCGCGTATGCGGAGGTAGTTCC CCACTCCATGAAAACATCCTCCTATCACCAAA CTTTTCTCCGCGTATGCGGAGGTAGTTCC CAATCGGCTGGCCTATAGTGTTCAAAACTTCC CTTTTCTCCGCGTATGCGGAGGTAGTTCC ATGTCTTTATCCTTATGTAGTGGGTGGGTTTT CTTTTCTCCGCGTATGCGGAGGTAGTTCC AGACTCTGGCACGTCGTACCATGCGAGGACCA CTTTTCTCCGCGTATGCGGAGGTAGTTCC GCGCCCAATATCTGCCAAAGCCTCCGATGTGC CTTTTCTCCGCGTATGCGGAGGTAGTTCC GGTGCCAATGGCGGGCTGGTAGCTGTCTACCA CTTTTCTCCGCGTATGCGGAGGTAGTTCC GACTGTGGCGAATTCTCGCAGGAAGGAGCTGG CTTTTCTCCGCGTATGCGGAGGTAGTTCC ATCAACGGTGAGCTGCGAAATAAGCTCGGCGC CTTTTCTCCGCGTATGCGGAGGTAGTTCT TGATAGTGATTAGCTGGTCAATCAGTGTTTCT CTTTTCTCCGCGTATGCGGAGGCAGTTCC CTTGGACGTGTCCCGATCGTCATGATGATTAA CTTTTCTCCGCGTATGCGGAGGTAGTTCC TGTTGAATCTGGCATCGACGAAGACGGAAAGC CTTTTCTCCGCGTATGCGGAGGTAGTTCC CAGCCGATCTCCTACCCACTCTGTGGTGCTTG CTTTTCTCCGCGTATGCGGAGGTAGTTCC CTACAAACTTTTCTGCAAACGCCACCTCCTCA CTTTTCTCCGCGTATGCGGAGGTAGTTCC ATCATTGCCCAGCTCACGAGCACGCTCGGCGG CTTTTCTCCGCGCATGCGGAGGTAGTTCC GATACGCTTAGACATAACACCGGCGGCATCAG CTTTTCTCCGCGTATGCGGAAAGTAAGCC
>LS483402.1|SQG55661.1|14893_16171_-|phosphoserine-phosphatase MIELDQPQVTVSLRPELTPAVITISGQDRQGVSAAAFRVLAANGVQILDVEQSQFRGFLGLAVFAGVEAAGVETLEIGLKETLKTYGQSVKIELQEVAQSSRPRSTHEVVILGDPVEAHDLSRIAQTLANFDANIDTIRGISDYPVTGLELKITVANRELGAAMPLRKALAELTTELGVDIAIERAGLLRRSKRLICFDCDSTLITGEVIEMLAAHAGREKEVAEVTERAMRGELDFEESLRERVKALAGLDASVIGEVADSIELTPGARTTIRTLKRLGYKTAVVSGGFIQVLEGLAEDLGLDYVRANTLEIVDGKLTGRVIGKVVDRAAKAEFLEEFARESGIEMHQTVAVGDGANDIDMISAAGLGIAFNAKPALREIADTSVNSPFLDEVVHMLGITRADIDAADDSDGKIRRVPLPQQKN >LS483402.1|SQG55660.1|14396_14891_+|N-acetyltransferase-GCN5 MAHVPTLSNESIRLRPLVLSDAHDLTATCVDPLTQKYTTIPAGYTLANAEEFITTEHDNLRWVITNRTSDRFCGQIELRPLAGEHNAMDVGYMTAPWSRGQGLMTSALLLAVDYAFSLGIRRIELRTDPQNKASQRVAEKAGFLYQGLHNDFTVYSLLTDDYRS >LS483402.1|SQG55659.1|12383_14375_+|putative-helicase MGAASTLCPMSDSPLALSTDELLAAAVAALGGSRRNGQVSMANAVTKALESERHLAVQAGTGTGKSLAYLVPALRHAQATDSTIIVSTATIALQRQLVERDLPRLADALEPHMSRRPTFAIMKGRANYVCMNKIAAAEEPEDALIDEEDLSWLGKHVARIYEWANETEVGDRDSLDPGVPDLAWRQVSVSAQECIGASRCPHGEDCFAEIARKKAHDVDVIVTNHALLAIDALSDVNVLPEHEVVIVDEAHELDGRITAVATNEIGVTALTMSSRRAGKLGAGDKDQKLIDISKEWEDAMLAVEPGRLTSLPESLKQQTIALRDAIWSLREHVSRVPEGEAANDPERHAERMSLSNHLGDQHDSVVRILSVFEEEDAASQEDVVWVLHDDRRGVMIKVAPLSIAGLLHARLFSENTVVLASATLNIGGNFNAMAASWGLPKGSWDSLDAGTPFDPAKSGILYTPNSLPDPGRDGLSPEVIDEIYDLIMAAGGRTLGLFSSRRAAQQATEAMRTRLPFDVLCQGDDTTGALVEKFSKQENTCLFGTLSLWQGVDVPGKACSLVIIDRIPFPRPDDPLLQARKDAADAEGRNGFMEVAATHAALLIAQGAGRLLRSVTDRGVVAILDRRIVTKRYGAFFIKSLPAFWRTNDPQVVRGALARLVAK >LS483402.1|SQG55657.1|10988_12344_+|nicotinate-phosphoribosyltransferase MTCASALIVFKAKLVCVTEFESTALLTDMYELTMLQSALADGTAYRNCTFEVFSRRLPNERRYGVVAGTARVLEAIKRYRFTEKQLASLTFLDATTIDFLRSYEFKGQIDGYREGELYFPSSPILTIRGTFAECVILETLILSIMNADSAVASAAARMVTAADGRPIFEMGSRRTHEYAAVTAARAAYLAGFVGTSNLEAVYRYGIPGSGTAAHAWTLLHVNDDGTPNEPAAFQSQINVLGVGTTLLVDTYDIAKGVKTAIEIAGPQLGAVRIDSGDLGVMTRKVRQELDSLGAHNTGIVVSSDLDEYAIAGLRGNPVDAFGVGTSVVTGSGAPTAGMVYKLVEVDGHPVAKRSRGKAMVGGTKRAVRTHRATGTAVEEIVFPYDHETPQIGQLNSYELTIPLMRNGIVVDNLPTLEESRAYLAEQLITLPWEGLALSKDEPVLSTRFIGF >LS483402.1|SQG55656.1|10483_10864_-|ATP-dependent-Clp-protease-adaptor-protein-ClpS MQVNQEDVTHSLNELPSVVLAPTMDVVVSSPMATPELDEDLSVDVASSENLPWMCIVWDDPVNLMSYVTYVFQTILGYSKRRAIELMMQVHTEGKAVVSSGERDKVEGDVKKLHTAGLWATMQQGG >LS483402.1|SQG55655.1|9848_10385_-|Domain-of-uncharacterised-function-(DUF2017) MQPWKKKKGLMRGAHFVCVFEPMEREVLGNLASTVSEALIHRAQTAPKDELAELTGMPSGHKEAPTDPALARLLPDFEKEGDEEFEGDNSLLRCLHETDITRAKVEHLQVLGQSLGPDGGVHVDITEPEAHAWVAALNDIRLYVASGEVFGEEAEQDRDNLVEWLAYNQESLLNAMMG >LS483402.1|SQG55654.1|8944_9763_-|peptidase,-S54-(rhomboid)-family MELRDFFTRLCFIMTNRFNPYAQSDRNTYGGVSTSGGYLPHEYGAQYLPTPGYSADRVTQRGVNTSSWRSMGRKRLVDATVLALGYVVIIWAVHIVNTVFFGGTLAQGLGVHPLDGASIWHIFTSPLVHGNYMHLSANTLPGLIFVFLIGLSGRRAFWEVTMIAAVVGGMGTWIFGGIGTTHIGASGLIYGWLAYLVVRGIFNRSFSQVLLGMVLAFIYGGLIWGVLPGDVGVSWQAHLFGAIGGLIAGATITSDDPPALKARREQRALERS >LS483402.1|SQG55652.1|7984_8878_-|glutamate-racemase MDYVIQDHKSDSLKTEMPGVEPSIVYEGTIDASSPIGIFDSGVGGLTVARAIMEQLPQESVIYIGDTAHSPYGPKPIAQVRELSMRIGDELVARGCKMIVIACNTATSAALRDLRERYSIPVVGVILPAVRRAVATTRNGKIGVLGTQGTIASGAYQELFAASPGVDVYAQACPSFVSFVERGITSGRQILGVAQGYTEGLQAAGVDTLVLGCTHYPLLTGVIQLAVGDNVTLISSSEECVKDVLKTLSCNDMLADAATDKQPIRSFESTGDPALFEQLAMRFLGPHVTHVEKLREV >LS483402.1|SQG55651.1|7144_7912_-|Ribonuclease-Z MKLIILGSSGSLGAPDNAASGYLIQMDNAPSILMDMGPGVLAQLERVQNPSDAHVVFSHLHADHCVDFPSLLVWRRYHPTAAAKGRNLCFGPTDTPIRMGRLSADSVDNIDDMSDTFAFTPWENAQEELVGAVSITPYSVIHPIETFALRVEHKRSGKIIAYSGDSSYTENLIECARNADVFLCEATWGETSEGKAPNMHMSGAEAGRIARLAGVKRLVLVHIPPWGNAEAALEKARSEYDGPIDISYQGMEINI >LS483402.1|SQG55650.1|6371_7106_-|ribonuclease-PH MTTSNFKRADGRAVDQMRTVKITRGFTTNPAGSVLVEFGNTRVMCTASAEIGVPRFKRDSGEGWLTAEYAMLPAATLDRNPRESMRGKVKGRTHEISRLIGRSLRAAVDLSELGENTINIDCDVLQADGGTRTASITGAYVALADAITHLQKQGVVPGNPLKDPVAAVSVGVIDGTVCLDLPYEEDSRADVDMNVIMQSGRFVEIQGTGEHNTFDRDELARILDFAEKGCAELVEVQKAVLGIA >LS483402.1|SQG55667.1|18739_19042_+|Uncharacterised-protein MNTYQKIKEKTGGKIYLGVGLQPGNQDEIAIEACYIEALIDYGYLDVELKKEFLKLWLTDDMYDDLSDLDSIELKTYRNLLKYAGMQPRVDTSVFRPEPA >LS483402.1|SQG55668.1|19608_19953_+|Uncharacterised-protein MHLQTTVTRVLITTTIALSFALASTSTLAHAETKSVPIACQELQADVEAWTKQLKEAESSHDQLSKNGHSKDAIKRYLERSQQYFQECVSSPPKHLHFELSSALPMFMSNLSSS >LS483402.1|SQG55669.1|20095_21817_-|cytochrome-c-oxidase-subunit-I MTAVAPRLENYSEPTRPAPTGGARKGTLAWKMLTTTDHKQLGIMYIIMSFVFFFLGGLMALLIRAELFSPGLQYLSNEQFNQLFTMHGTVMLLLFGTPIVWGFANYILPLQIGAPDVAFPRLNAFGFWVTMIGAAAMLSGFLTPGGAADFGWTMYLPLADSIHSPGIGSDMWIVGVGATGVGTISSAINMITTILCMRAPGMTMFRMPIFCWNIFVASVLVLMIFPLLTAAALGVLYDRKLGGHLFDPGNGGAIMWQHLFWFFGHPEVYVLALPFFGIVSEIIPVFARKPMFGYIGLVFATLSIGSLSMAVWAHHMFVTGAILLPFFSFMTFLISVPTGVKFFNWLGTMWKGHVSWETPMTWTMGFLVTFLFGGLTGIMLASPPLDFHISDTYFVVAHFHYTLFGTVVFASFAGVYFWFPKMTGRMLDERLGKIHFWITFVGFHGTFLVQHWLGNEGMPRRYADYLDSDGFTTLNQISTIFSFLLGMSVLPFIWNVIKSWRYGEVVTVDDPWGYGNSLEWATSCPPPRHNFTSLPRIRSERPAFELHYPHMVERMRREAHVGHHAEPVTKKTS >LS483402.1|SQG55671.1|22284_23271_-|ribonucleoside-diphosphate-reductase-subunit-beta MESYDSYLESHKKPVSAINWNSIPDEKDLEVWDRLTGNFWLPEKVPVSNDLKSWGTLNDLEKTTTMRVFTGLTMLDTIQGTVGAVSMIPDAITPHEEAVYTNIAFMESVHAKSYSNIFMTLASTKEINEAFRWSEENENLQKKAKIVLSYYEGADPLKRKVASTLLESFLFYSGFYLPMYWSSHAKLTNTADIIRLIIRDEAVHGYYIGYKYQQAVRQQTPERQAELKEYTFDLLYDLYDNEIQYTEDLYDDLGWTEDVKRFLRYNANKALNNLGYEGLFPADECKVSPAILSALSPNADENHDFFSGSGSSYVIGKAENTTDDDWDF >LS483402.1|SQG55672.1|23711_24197_+|Ferritin MSINEKLAAALNNQITAELEASMVYLQLSYILDDLSLTGMRNWMQAQHKEELDHAAQFSKHLLDRDYRPQIGDIAPPKLDANSAIEAFEASLAHEQKVTAMIRELAEIADSVKDYDSRPLIDRFLEEQIEEEATVKEILDRLRIADTGSGILRIDAELAAR >LS483402.1|SQG55673.1|24263_26423_-|ribonucleotide-diphosphate-reductase-subunit-alpha MSQSLGKHVAEPVSRTEQLDYHALNALLNLYNADGKIQFDKDREAANQFFLQHVNQNTVFFHDLEEKIEYLVENNYYEPEIIQQYEFAFIKDLFKQAYAHKFRFKSFLGAYKYYTSYTLKTFDGRRYLERFEDRVCMVALTLADGDQDLARNLVDEIMTGRFQPATPTFLNSGKAQRGEPVSCFLLRIEDNMESIGRSINSALQLSKRGGGVALLLSNLREAGAPIKKIENQSSGVIPVMKLLEDSFSYANQLGARQGAGAVYLNAHHPDILNFLDTKRENADEKIRIKTLSLGVVIPDITFELAKRNDDMYLFSPYDVERVYGKAFADISVSEHYAEMVEDPRIRKSKINAREFFQTIAEIQFESGYPYIMFEDTVNKANPIEGRVNMSNLCSEILQVNTPSLFNDDLTYEEVGEDISCNLGSLNIAMTMDSPDFAKTIETAIRGLTAVSEQTAINSVPSIRKGNDAAHAIGLGQMNLHGYLGREHIYYGSEEGLDFTNAYFAAVLYQCLVASNKLARERGRTFAGFETSKYATGEYFDDFDPADFAPKTEKVAKIFADSSIYTPTVADWADLKDAVAAHGLYNRYLQAVPPTGSISYINHSTSSIHPIASKIEIRKEGKIGRVYYPAPHMDNENLDYFADAYEIGFEKVIDTYAVATKYVDQGLSLTLFFKDSATTRDINRAQIYAWRKGIKTLYYIRLRQVALMGTEVEGCVSCML >LS483402.1|SQG55675.1|26572_27010_-|ribonucleotide-reductase-stimulatory-protein MLVVYFSSATENTKRFVHKLGFPAKRIPLHKSSPELVVDEPYVLVCPTYGGGASISGGNTRPVPAQVIRFLNNEHNRGLLRAVIAGGNSNFGLDFGKAGDMIAAKCQVPYVYRFELLGTDEDVRLVRDGLLSNAAALGLLPEPVA >LS483402.1|SQG55676.1|27137_27371_-|glutaredoxin MSITVYTKPACVQCNATKKALDRAGLDYTLVDISIDDEARDYVMALGYLQAPVVEVNGEHWSGFRPERISSLVAQVA >LS483402.1|SQG55677.1|27847_27970_-|50S-ribosomal-protein-L36 MKVRKSLRSLKNKPGAQVVRRRGKVYVINKKDPRFKARQG >LS483402.1|SQG55678.1|28204_29035_+|NAD-synthetase MDTLRSTIKHRLRTQSIINPSEEIAKRVDFLAHYLAASGAKGFALGISGGQDSTLAGRLAQLAVEKLRKEGHPAEFWAIRLPYGVQADEADAQTALAFIQPDHSVTINIKPATDACAADVAQALGLKELGDFNKGNVKARQRMIAQYALAGEKGLLVIGTDHAAENVTGFFTKFGDGAADILPLAGLSKRQGAQLLQALNAPDSTWLKVPTADLEEDRPALPDEAALGVTYSEIDTYIEGTEAVSKEATARIEHLWKVSEHKRHLPVEPGDTWWRR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LS483402_2 | 585781-587275 | TypeI-E |
I-C,I-E,II-B
Consensus repeat of LS483402_2
|
24 spacers
spacers of LS483402_2
>2.1|585810|32|LS483402|PILER-CR,CRISPRCasFinder,CRT GTTCTGGACAACTCTCTTCTTTGTCTTTATAG >2.2|585871|32|LS483402|PILER-CR,CRISPRCasFinder,CRT TCTCCGCATCACGAAGATTGTGATTAGCCTTC >2.3|585932|32|LS483402|PILER-CR,CRISPRCasFinder,CRT TTGAGTGCTGCGCGGAAGTTTCTGTCCACAAG >2.4|585993|32|LS483402|PILER-CR,CRISPRCasFinder,CRT GCTTATCAGCCACACGCATACCAACAAGGGCT >2.5|586054|32|LS483402|PILER-CR,CRISPRCasFinder,CRT GCTTATCAGCCACACGCATACCAACAAGGGCT >2.6|586115|33|LS483402|PILER-CR,CRISPRCasFinder,CRT CGTGGGCCAAACACAAGGCCTGATTGATAAAGG >2.7|586177|32|LS483402|PILER-CR,CRISPRCasFinder,CRT TGGAACCACCCAAATCTGCGGCCAAAACAATC >2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT CGCCCCTACCGGCGCGACCCGCAAGGACGCCG >2.9|586299|32|LS483402|PILER-CR,CRISPRCasFinder,CRT GGAAACCGCCCGTGGCGTTTAATGAGGAGCCC >2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT GCTCCCTCCGGCGATCAAGATCAATGCGAACA >2.11|586421|33|LS483402|PILER-CR,CRISPRCasFinder,CRT CCCACCCACCGCGATTAAGCCACGTGGTGGGAT >2.12|586483|32|LS483402|PILER-CR,CRISPRCasFinder,CRT ATGAAGACGCCGTGGAGTACCCAGAAAACACG >2.13|586544|32|LS483402|PILER-CR,CRISPRCasFinder,CRT CAGCTTCCCCTAAAGGAGAAAATTCTATGTAT >2.14|586605|32|LS483402|PILER-CR,CRISPRCasFinder,CRT CGTCACCGACTCCGCATTATTCGCCGCAGCCT >2.15|586666|32|LS483402|PILER-CR,CRISPRCasFinder,CRT TAGGCCCCAACGCCTTGCGAAGCGCGCTAGGG >2.16|586727|32|LS483402|PILER-CR,CRISPRCasFinder,CRT CCAATCATGGCACGTGACCAGCGCTTCTACGG >2.17|586788|32|LS483402|PILER-CR,CRISPRCasFinder,CRT CAATTGATCCAATGTGTCCTCGATGCTCATTG >2.18|586849|32|LS483402|PILER-CR,CRISPRCasFinder,CRT ATCTTAGGGCGGGGTGCTCTAAAATAAAAAAT >2.19|586910|32|LS483402|PILER-CR,CRISPRCasFinder,CRT TTAAGATTCGATCACAATTTCTAACCACATGC >2.20|586971|32|LS483402|PILER-CR,CRISPRCasFinder,CRT GGCTTTGGCAGGCAAAGCGCCGGTTTCGCATC >2.21|587032|32|LS483402|PILER-CR,CRISPRCasFinder,CRT TCGGGAAGCTCTTTCACCGTGGCGATGATGTT >2.22|587093|32|LS483402|PILER-CR,CRISPRCasFinder,CRT GCCAAAAACCCGGTAAAACCACTCAAATCTGC >2.23|587154|32|LS483402|PILER-CR,CRISPRCasFinder,CRT GGTAGGGAATTCACCCTGAAAGAAGAAGATAG >2.24|587215|32|LS483402|PILER-CR,CRISPRCasFinder,CRT CGGGCAAAAACATGAGCTCCGAAAGCATATCT |
cas2,cas1,cas3,cas6e,cas8e,cse2gr11,cas7,cas5 |
CRISPR arrays and Neighbor proteins around LS483402_2
The CRISPR arrays of LS483402_2 >merge|LS483402|2|585781-587275|PILER-CR,CRISPRCasFinder,CRT GGAACTACCTCCGCATACGCGGAGAAAAGGTTCTGGACAACTCTCTTCTTTGTCTTTATAGGGAACTACCTCCGCATACGCGGAGAAAAGTCTCCGCATCACGAAGATTGTGATTAGCCTTCGGAACTACCTCCGCATACGCGGAGAAAAGTTGAGTGCTGCGCGGAAGTTTCTGTCCACAAGGGAACTACCTCCGCATACGCGGAGAAAAGGCTTATCAGCCACACGCATACCAACAAGGGCTGGAACTACCTCCGCATACGCGGAGAAAAGGCTTATCAGCCACACGCATACCAACAAGGGCTGGAACTACCTCCGCATACGCGGAGAAAAGCGTGGGCCAAACACAAGGCCTGATTGATAAAGGGGAACTACCTCCGCATACGCGGAGAAAAGTGGAACCACCCAAATCTGCGGCCAAAACAATCGGAACTACCTCCGCATACGCGGAGAAAAGCGCCCCTACCGGCGCGACCCGCAAGGACGCCGGGAACTACCTCCGCATACGCGGAGAAAAGGGAAACCGCCCGTGGCGTTTAATGAGGAGCCCGGAACTACCTCCGCATACGCGGAGAAAAGGCTCCCTCCGGCGATCAAGATCAATGCGAACAGGAACTACCTCCGCATACGCGGAGAAAAGCCCACCCACCGCGATTAAGCCACGTGGTGGGATGGAACTACCTCCGCATACGCGGAGAAAAGATGAAGACGCCGTGGAGTACCCAGAAAACACGGGAACTACCTCCGCATACGCGGAGAAAAGCAGCTTCCCCTAAAGGAGAAAATTCTATGTATGGAACTACCTCCGCATACGCGGAGAAAAGCGTCACCGACTCCGCATTATTCGCCGCAGCCTGGAACTACCTCCGCATACGCGGAGAAAAGTAGGCCCCAACGCCTTGCGAAGCGCGCTAGGGGGAACTACCTCCGCATACGCGGAGAAAAGCCAATCATGGCACGTGACCAGCGCTTCTACGGGGAACTACCTCCGCATACGCGGAGAAAAGCAATTGATCCAATGTGTCCTCGATGCTCATTGGGAACTACCTCCGCATACGCGGAGAAAAGATCTTAGGGCGGGGTGCTCTAAAATAAAAAATGGAACTACCTCCGCATACGCGGAGAAAAGTTAAGATTCGATCACAATTTCTAACCACATGCGGAACTACCTCCGCATACGCGGAGAAAAGGGCTTTGGCAGGCAAAGCGCCGGTTTCGCATCGGAACTACCTCCGCATACGCGGAGAAAAGTCGGGAAGCTCTTTCACCGTGGCGATGATGTTGGAACTACCTCCGCATACGCGGAGAAAAGGCCAAAAACCCGGTAAAACCACTCAAATCTGCGGAACTACCTCCGCATACGCGGAGAAAAGGGTAGGGAATTCACCCTGAAAGAAGAAGATAGGGAACTACCTCCGCATACGCGGAGAAAAGCGGGCAAAAACATGAGCTCCGAAAGCATATCTGGAACTACCTCCGCATACGCGGAGAAAAG >LS483402|2|2|585781-587275|PILER-CR GGAACTACCTCCGCATACGCGGAGAAAAG GTTCTGGACAACTCTCTTCTTTGTCTTTATAG GGAACTACCTCCGCATACGCGGAGAAAAG TCTCCGCATCACGAAGATTGTGATTAGCCTTC GGAACTACCTCCGCATACGCGGAGAAAAG TTGAGTGCTGCGCGGAAGTTTCTGTCCACAAG GGAACTACCTCCGCATACGCGGAGAAAAG GCTTATCAGCCACACGCATACCAACAAGGGCT GGAACTACCTCCGCATACGCGGAGAAAAG GCTTATCAGCCACACGCATACCAACAAGGGCT GGAACTACCTCCGCATACGCGGAGAAAAG CGTGGGCCAAACACAAGGCCTGATTGATAAAGG GGAACTACCTCCGCATACGCGGAGAAAAG TGGAACCACCCAAATCTGCGGCCAAAACAATC GGAACTACCTCCGCATACGCGGAGAAAAG CGCCCCTACCGGCGCGACCCGCAAGGACGCCG GGAACTACCTCCGCATACGCGGAGAAAAG GGAAACCGCCCGTGGCGTTTAATGAGGAGCCC GGAACTACCTCCGCATACGCGGAGAAAAG GCTCCCTCCGGCGATCAAGATCAATGCGAACA GGAACTACCTCCGCATACGCGGAGAAAAG CCCACCCACCGCGATTAAGCCACGTGGTGGGAT GGAACTACCTCCGCATACGCGGAGAAAAG ATGAAGACGCCGTGGAGTACCCAGAAAACACG GGAACTACCTCCGCATACGCGGAGAAAAG CAGCTTCCCCTAAAGGAGAAAATTCTATGTAT GGAACTACCTCCGCATACGCGGAGAAAAG CGTCACCGACTCCGCATTATTCGCCGCAGCCT GGAACTACCTCCGCATACGCGGAGAAAAG TAGGCCCCAACGCCTTGCGAAGCGCGCTAGGG GGAACTACCTCCGCATACGCGGAGAAAAG CCAATCATGGCACGTGACCAGCGCTTCTACGG GGAACTACCTCCGCATACGCGGAGAAAAG CAATTGATCCAATGTGTCCTCGATGCTCATTG GGAACTACCTCCGCATACGCGGAGAAAAG ATCTTAGGGCGGGGTGCTCTAAAATAAAAAAT GGAACTACCTCCGCATACGCGGAGAAAAG TTAAGATTCGATCACAATTTCTAACCACATGC GGAACTACCTCCGCATACGCGGAGAAAAG GGCTTTGGCAGGCAAAGCGCCGGTTTCGCATC GGAACTACCTCCGCATACGCGGAGAAAAG TCGGGAAGCTCTTTCACCGTGGCGATGATGTT GGAACTACCTCCGCATACGCGGAGAAAAG GCCAAAAACCCGGTAAAACCACTCAAATCTGC GGAACTACCTCCGCATACGCGGAGAAAAG GGTAGGGAATTCACCCTGAAAGAAGAAGATAG GGAACTACCTCCGCATACGCGGAGAAAAG CGGGCAAAAACATGAGCTCCGAAAGCATATCT GGAACTACCTCCGCATACGCGGAGAAAAG >LS483402|2|2|585781-587275|CRISPRCasFinder GGAACTACCTCCGCATACGCGGAGAAAAG GTTCTGGACAACTCTCTTCTTTGTCTTTATAG GGAACTACCTCCGCATACGCGGAGAAAAG TCTCCGCATCACGAAGATTGTGATTAGCCTTC GGAACTACCTCCGCATACGCGGAGAAAAG TTGAGTGCTGCGCGGAAGTTTCTGTCCACAAG GGAACTACCTCCGCATACGCGGAGAAAAG GCTTATCAGCCACACGCATACCAACAAGGGCT GGAACTACCTCCGCATACGCGGAGAAAAG GCTTATCAGCCACACGCATACCAACAAGGGCT GGAACTACCTCCGCATACGCGGAGAAAAG CGTGGGCCAAACACAAGGCCTGATTGATAAAGG GGAACTACCTCCGCATACGCGGAGAAAAG TGGAACCACCCAAATCTGCGGCCAAAACAATC GGAACTACCTCCGCATACGCGGAGAAAAG CGCCCCTACCGGCGCGACCCGCAAGGACGCCG GGAACTACCTCCGCATACGCGGAGAAAAG GGAAACCGCCCGTGGCGTTTAATGAGGAGCCC GGAACTACCTCCGCATACGCGGAGAAAAG GCTCCCTCCGGCGATCAAGATCAATGCGAACA GGAACTACCTCCGCATACGCGGAGAAAAG CCCACCCACCGCGATTAAGCCACGTGGTGGGAT GGAACTACCTCCGCATACGCGGAGAAAAG ATGAAGACGCCGTGGAGTACCCAGAAAACACG GGAACTACCTCCGCATACGCGGAGAAAAG CAGCTTCCCCTAAAGGAGAAAATTCTATGTAT GGAACTACCTCCGCATACGCGGAGAAAAG CGTCACCGACTCCGCATTATTCGCCGCAGCCT GGAACTACCTCCGCATACGCGGAGAAAAG TAGGCCCCAACGCCTTGCGAAGCGCGCTAGGG GGAACTACCTCCGCATACGCGGAGAAAAG CCAATCATGGCACGTGACCAGCGCTTCTACGG GGAACTACCTCCGCATACGCGGAGAAAAG CAATTGATCCAATGTGTCCTCGATGCTCATTG GGAACTACCTCCGCATACGCGGAGAAAAG ATCTTAGGGCGGGGTGCTCTAAAATAAAAAAT GGAACTACCTCCGCATACGCGGAGAAAAG TTAAGATTCGATCACAATTTCTAACCACATGC GGAACTACCTCCGCATACGCGGAGAAAAG GGCTTTGGCAGGCAAAGCGCCGGTTTCGCATC GGAACTACCTCCGCATACGCGGAGAAAAG TCGGGAAGCTCTTTCACCGTGGCGATGATGTT GGAACTACCTCCGCATACGCGGAGAAAAG GCCAAAAACCCGGTAAAACCACTCAAATCTGC GGAACTACCTCCGCATACGCGGAGAAAAG GGTAGGGAATTCACCCTGAAAGAAGAAGATAG GGAACTACCTCCGCATACGCGGAGAAAAG CGGGCAAAAACATGAGCTCCGAAAGCATATCT GGAACTACCTCCGCATACGCGGAGAAAAG >LS483402|2|2|585781-587275|CRT GGAACTACCTCCGCATACGCGGAGAAAAG GTTCTGGACAACTCTCTTCTTTGTCTTTATAG GGAACTACCTCCGCATACGCGGAGAAAAG TCTCCGCATCACGAAGATTGTGATTAGCCTTC GGAACTACCTCCGCATACGCGGAGAAAAG TTGAGTGCTGCGCGGAAGTTTCTGTCCACAAG GGAACTACCTCCGCATACGCGGAGAAAAG GCTTATCAGCCACACGCATACCAACAAGGGCT GGAACTACCTCCGCATACGCGGAGAAAAG GCTTATCAGCCACACGCATACCAACAAGGGCT GGAACTACCTCCGCATACGCGGAGAAAAG CGTGGGCCAAACACAAGGCCTGATTGATAAAGG GGAACTACCTCCGCATACGCGGAGAAAAG TGGAACCACCCAAATCTGCGGCCAAAACAATC GGAACTACCTCCGCATACGCGGAGAAAAG CGCCCCTACCGGCGCGACCCGCAAGGACGCCG GGAACTACCTCCGCATACGCGGAGAAAAG GGAAACCGCCCGTGGCGTTTAATGAGGAGCCC GGAACTACCTCCGCATACGCGGAGAAAAG GCTCCCTCCGGCGATCAAGATCAATGCGAACA GGAACTACCTCCGCATACGCGGAGAAAAG CCCACCCACCGCGATTAAGCCACGTGGTGGGAT GGAACTACCTCCGCATACGCGGAGAAAAG ATGAAGACGCCGTGGAGTACCCAGAAAACACG GGAACTACCTCCGCATACGCGGAGAAAAG CAGCTTCCCCTAAAGGAGAAAATTCTATGTAT GGAACTACCTCCGCATACGCGGAGAAAAG CGTCACCGACTCCGCATTATTCGCCGCAGCCT GGAACTACCTCCGCATACGCGGAGAAAAG TAGGCCCCAACGCCTTGCGAAGCGCGCTAGGG GGAACTACCTCCGCATACGCGGAGAAAAG CCAATCATGGCACGTGACCAGCGCTTCTACGG GGAACTACCTCCGCATACGCGGAGAAAAG CAATTGATCCAATGTGTCCTCGATGCTCATTG GGAACTACCTCCGCATACGCGGAGAAAAG ATCTTAGGGCGGGGTGCTCTAAAATAAAAAAT GGAACTACCTCCGCATACGCGGAGAAAAG TTAAGATTCGATCACAATTTCTAACCACATGC GGAACTACCTCCGCATACGCGGAGAAAAG GGCTTTGGCAGGCAAAGCGCCGGTTTCGCATC GGAACTACCTCCGCATACGCGGAGAAAAG TCGGGAAGCTCTTTCACCGTGGCGATGATGTT GGAACTACCTCCGCATACGCGGAGAAAAG GCCAAAAACCCGGTAAAACCACTCAAATCTGC GGAACTACCTCCGCATACGCGGAGAAAAG GGTAGGGAATTCACCCTGAAAGAAGAAGATAG GGAACTACCTCCGCATACGCGGAGAAAAG CGGGCAAAAACATGAGCTCCGAAAGCATATCT GGAACTACCTCCGCATACGCGGAGAAAAG
>LS483402.1|SQG56791.1|585485_585752_-|Uncharacterised-protein MQDYIDRFNTNAFIEPSTTKIYQSRSGAKARADLLESFGYKAIVQRSAPLVWPEGDNTTVETTKINEVFAAIKTLVNHGVVKSADELL >LS483402.1|SQG56789.1|584799_584910_-|Uncharacterised-protein MVEFLTIIGSFSIAFLIPTLLVIIIVLFVLSLKKKK >LS483402.1|SQG56787.1|584530_584662_+|Uncharacterised-protein MVAPARNYLRVRGEKNHYQSEQRPLRSPKYIGIKPFVAFLILS >LS483402.1|SQG56785.1|584162_584390_-|Uncharacterised-protein MDMMNPTERISEDQLRQIRNDLASAEIEGQPRTPEDEKILAAYYLGEISEQESFESLLKAAGVHPPYPSINHDLD >LS483402.1|SQG56783.1|582859_583795_-|Uncharacterised-protein MTTSGSVTGRIKKAKAHGYDEFTGVYVDVPLEVSLKSAQERYIRGAQEYIDGVGYGGRYVPESAIMRAQVPGSDKTRNRQVFEGLTESKVFTATEIWDNNRQDENGNRQPAALIERTTINGKEHHRPVRSGGSDDGRRETSTRQRTEGSGDKLRRDSGDRGRSLPGRPTDRHSGSGNAREDHDGTSKRNVQPRSNNIQLNQTPLAAQTSPVVPTTGHYKGKTVHNIDNTDDILKGHDLPEGMKAVIGEYGEIALKFPNKESEREYRKKHGKNFLNVTTGAQAALHDAVKNNDAIPASFFTYTEVTVPVGWV >LS483402.1|SQG56781.1|582301_582772_-|Uncharacterised-protein MIFSFLPKENVVFSSSYRQRKRLEHLKFLDEALEREWEKDDEESIKKQIKEIEGKFSGKHWVRQPSWMEVAIAGAMITSLTLIVSGAIFFVSHILSEYILRENGWEQGGVLALELRILIIGILLFLIIVLSAPILLDWPGSKAPDTRKKKKKWGKR >LS483402.1|SQG56779.1|581897_582209_-|Uncharacterised-protein MVIDNIYNRIERATDGKVFYGVGVSPENQDNISIDAGYVEALLEYGYLDISLKQEFLELWLTDDEYEDLEALDYVQDLTYKMLLKYADMTPRVDVSRFLKEEG >LS483402.1|SQG56777.1|581366_581753_-|Uncharacterised-protein MTNTFTALTERSEGWWSVQLKEDPGLLTQTRRLDQIADMVRDALELFPELTDDPYKDIVNIEFREGESIADIANQAVQANQKAKQAQEEASQLMRQAAAELSKKGLSYRDIGTLLGVSFQRAQKLATT >LS483402.1|SQG56775.1|579368_581291_+|Uncharacterised-protein MENMSFPARSQRAGKLLCVVACAIALVSSLMMFATGQPLARAESVCSGGDWSELAWKDEHSNPSLRDNTYHGPGSHAEVQFKWKAKADAKQGDKITFTLPPQLQGVDTGSILLQDSKNDLVARGSWDSGRKSFVITLEQFANTHFNVQGTAFVSVKWNRDGIDGDPKKFEGSLNFNGCGSGSLNGKYEEGSEGDSHETSKIGEYRGYDSENKVHKVQWTVGLSGKTGNGQRDLVTDNAPAGWNFACDGKYNDGYAPVYVSSFIKGDPSGERRHQIFNAQNQDTGGIREGLSGVKNLENFVQGYSYRLRCSSDRVEVELPYGISPQSSPLISLLTISTEKPALGSTIYNTAEVNGRKISGSVTFPSAGGQGRGSKGGFTIEKIVSGEHTSKQFSFEWSCTSQSKETKSGTIKLANGDVHHEKQLDKGASCVIKEEDADAASEKKHSLKWSVDGEDKEGESVAISIRQPEEQAVQVVATNIYYQEEPEIPPVPPTTTSSSSPSTTTSTETTTKTTTTTTTATKTTEPSATTTTSPPSSPRTTEPTRAPRNPLLPIPIPIPIPLPPAPPVTTTVTPHAPAPVPPPATPSIVSHKPADAAPQPPAKRLLARTGASVAGLVIPALFLMIGGVGLLMIIRRKRNSE >LS483402.1|SQG56774.1|578230_578446_-|Uncharacterised-protein MDKQTDTNTIRHIQALAGLRKRTQGIKIVLRNRTNDSTALLLSSELQQSSAGYPDSYDLAHSNRGHYKPCD >LS483402.1|SQG56803.1|587293_587665_-|CRISPR-associated-protein-Cas2 MFAVIQGHNLPNHLNGYLSRFLSEVDAGLYVGVLSRAVMENLWEKCQSVDLAGSLTLIHPQYDAEQGFRIRTTGKQRRPVVDLDGLFLSARGLIEDVRFADPLDEADAIIPDEVLEDFCPESE >LS483402.1|SQG56805.1|587665_588619_-|CRISPR-associated-Cas1-family-protein MSYSNEALAFSTIPASEQIRLEDRVSFLYLEYCLIRQDRTGVIAVSRGDEKAPAELKDLPIKARIQLPVGGLAVLMLGPGTSISQPAATSCARAGVSVLFTGGGGVQAYSLSTPLTSSARWAIAQARLASNEAKQRTAARILYKRQLGIEEIEADSIAVMRGIEGRTIRNLYKRLSAQHKIKNFKRNTNATDPVNTNLNLGNSILYGCAASACAALGINPALGIIHRGDIRSLLFDLADLYKPTLTIPAAFKCANNDDDGSEFRRLVRSEIVNQDLLKNMIHIMMEILTPHLPERTDDRLIGGRNHEVPGHTQYGGK >LS483402.1|SQG56807.1|588615_591309_-|CRISPR-associated-helicase-Cas3-family-protein MTNAGTNHHVLWAKFDNVSEPYPLLAHLLDTATAATCLFNHWLRKGLRDRLSTELGPDAEKILGFVAGIHDLGKANPYFQAQRRNKKEEWITLRDAIQKAGFPLSNGTSALFEETKEKRRHENITLSILGWEITKFLQVKDVWPQLAIIGHHGNFSAPGFLSDEDDLEDIEDIFDDNGWSPTHELLVSSLLQAVGLEKQPEIKHISPASAILISGLVVLADRIASQSEMASDGLQALQKEELFFHQPEKWIANRKTFCREIIENTVGTYHPWESEAAGIRAVLGDYEPRFTQKAALNADDGLFNVMETTGAGKTEAALLRHVKRKERLLFFLPTQATTNAIMERIGKIFDGTPNVASLAHGLAVTEDFYAHPIVPVQGSSDDANYKDNGGLYPTEFVRSAGTPRLLAPVCVGTIDQALMGALPSKFNHLRLLALANAHVVVDEVHTMDQYQSELMSGLLEWWSATDTPVTLLTATMPAWQREKFHLSYTGKEPHFKGVFPSLEDWSTPSKNTETSQENIPTEAFTIPINIDKIAHNEIVDSHVQWVIEQRKLFPQARIGIICNTVGRAQSIAEALAHESPIVLHSRMTAGHRKEAATKLEQAIGKKGTATATLVIGTQAIEASLDIDLDLLRTELCPAPSLIQRAGRLWRRLDPQREVRVPGMVGKKLTIAVVDSPSTGQTLPYLRSQLYRVESWLKQRDRIEFPADIQDFIDATTPGLQELFQKVSLPEDCGSAEERETLADDYLNEVASWVTKQRQAGTSRIDFAKHGKPRQVLASDCVVEDFLQITSAKDLEERATRLIDYPTISAILCDPTGTVPGAWTDSVEKLIAIPAKDRESLRRALRASISIPRSDKFVPITSREIPLSEAKTLLSGYSAVHIQPDEYDLQSGLKGPQK >LS483402.1|SQG56809.1|591329_592040_-|CRISPR-associated-protein-Cas6/Cse3/CasE,-subtype-I-E/ECOLI MTNAIYWTHFPAHIALNKSLVLGNSATKDSKNKPRWDVDDPIFRHRAVMALFPEHQSDNARADSNILFRLEALPGQPPYFFVQSSIEPSNRNLDNHIKTRQVDLVSPEAGTPIEFRLSINAVRRKTIDATENTKRKIKTTCLSLKALDSDPTETAAGQWVKEKLSPALENIDIVRHGRQVLGANRNGEKTSNRTVQVDTIDGFAQVKDPEELQKMLIHGIGRAKSYGCGMLTFRPI >LS483402.1|SQG56810.1|592253_593867_+|Uncharacterised-protein MKSNVFKDFPFVKTNKGPMTVEEFFHSSHEESLHLDLSIPGYEYGAIWRLLASLTAVIVQRDPSLLERGESGSELRLDPEFISQILDDLGSKISLTEGKNLFFQRPLLEGENPKDTARYVGPGKDPAWKLSPTAPSEKSQIYWNLEKLKPESLEAVDAIVALMVFSMYSFTGNSKYDGAKCLNGSPGIRFLGGGNTATEFIIEAKTPLLSLLKSIPLEWCEPRGLPAWLDRTGAESRKPNGEMHPLWRATWSSNTAACCWDGETLIGVGIGGIPPEWYAIEMGSKPEARKEWWDQRNTEDPFYFYQPDKGGALKAKRLDLSRDLTELAVEWVAEDLSTALAERVRGRALRVDFKKEDSLLFIRHQIGGNASSAMIRESVVSQARKQQWIFDPTGALQKQVRGKADFILSLRNIVLSPFRRENKSDRDRGRRVLDNLASERPKMNETFWREIAPIYEEFILYFTEQVSGDETRKEVRAQAKKTLKELEKDAVRTAQKAFDMVLEPYLLQNPSQAYEVRRRIHSYLASKIAEANEGDQK >LS483402.1|SQG56812.1|593902_594457_+|CRISPR-type-I-E/ECOLI-associated-protein-CasB/Cse2 MAEFAQRKKDKEFRAKRSALRAGSGIYTEFRAYSYVLPFLGEKASEAQRTALLRCMAALAEYPDIVSSGEKATASSVGQWVNRVAFDGKQGQSEPDSMVASRIKYLHTQDLEEAISSLRRIMAFADRKNMAIKLNPYQFVELFWYWGNGFTDASTKHRLSVLRDFYSTKQKENTDPQSSSEGEK >LS483402.1|SQG56814.1|594456_595524_+|CRISPR-associated-Cse4-family-protein MSRHLTIHVVASVPYSNLNRDDSGTPKNVRRGGVTCALLSSQSIKKGIRTKYEDASLDTSVRSGRIADDVLERAKVLAPEADTKALEKAVKKIIGTLTKVAEANESEGESDRSIWLSAEELEAAAVSVVEQAEKKDFIEDGRTGSLAIAAFGRMFAAAPQKGTEAALSVSPAVTTHGVTIATDYFSTVDDIRERNRDTGATYLGVSQYTTGVFYRTVTIDKEQLRESWTGLDREDAKENLAALVNAIIYGLPRGKQHSTAPFVQPALILAEEQSYRCAYDFESPVQADTREGGYLKPTLEELKRQYDSARAFDADNFGETQVVSGTYPEVSEFFAGAKYADKNGFIDEVVAWIQR >LS483402.1|SQG56816.1|595535_596228_+|CRISPR-associated-Cas5e-family-protein MTTTSVYLRLAGPLQSWAGPAVTGNFVRTEPIPTHSALVGLIAGACGYRRDEWPDWLNRLNFRVRVDHPGKFVDDFQTVSAHEEEMLFRERLIYATGKRPSAKTTRLTPDGRGMTSIIQRTYLAEAEFIVEVASDTHGELLRDALRAPKFSTYLGRKAFAPAFPFYLGATTDVDVLHRIPACDLSGAKRDTARVQIHHCSAELHTSAEHINVPAVQERSDWLEKTKELFV >LS483402.1|SQG56818.1|596304_597216_-|Uncharacterised-protein MRSGTLHQRQMKAFISWILGGKPRDKSDVSRTQRRLMSWCWQVPVPQPSMEDKACILVMNEGTFFQSWCLVVAYNSTHVLGWQWVRKDDKTTCAQVYTYFPRPCAAVISGDESTAQAIQSLWPGIPMRRCLSSIKESVDQKISHTPHTQPAIEIKNLTDSLSYVYTSDQAQLWLDRYNTWETTWKELLKHRTDSPNTNHNKNSCSWQWTYKELRSIRLMYRTLIKKEELFFQLTESAIPLSDAALPCQTSLLGDDLSADIKKLFHTHRGINHEHARRMVEWYLNSKTESPFIPERIIKHDHWE >LS483402.1|SQG56820.1|597753_598677_+|Phospholipase-D-precursor MKKKVVLFLSIIMGILLPVGNAVATPVSHDAASTGNRPVYAIAHRVLTTQGVDDAVAIGANALEIDFTAWRGGWWADHDGIPTSAGATAEAIFKHIAEKRKQGANITFTWLDIKNPDYCTDPDSVCSINALRDLARKYLEPAGVRVLYGFYKTVGGPGWKTITSDLRDKEAIALSGPTHDVLNDFAKAGDKILTKQKIADYGYYDINQGFGNCYGDGNKTCDQLRKSSEARDQGQLGKTFGWTITTGQDDRVNDLLGKAHVDGMIFGFKVTHFYRHADTENSFKAIKTWVDKHSDTHHLATAADNPW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LS483402_3 | 662021-662860 | Unclear |
NA
Consensus repeat of LS483402_3
|
11 spacers
spacers of LS483402_3
>3.1|662057|37|LS483402|PILER-CR,CRISPRCasFinder,CRT CAAAGAACTGAACCGCCACATTCGCGAAGCAGCCGGA >3.2|662130|35|LS483402|PILER-CR,CRISPRCasFinder,CRT ACTTGGCGGAAGAGTTAGGCATCTCAGAGAGTTCG >3.3|662201|38|LS483402|PILER-CR,CRISPRCasFinder,CRT GCCCTCGCCCGATCTGTCGCGCCGGGCATCACATGGGC >3.4|662275|37|LS483402|PILER-CR,CRISPRCasFinder,CRT ATTACCATTGCCCCACAATTCCGCATGCAAAAGAGCC >3.5|662348|35|LS483402|PILER-CR,CRISPRCasFinder,CRT GGTGTGAAAGCTGCTGGTGCTTGTGGGGTTTCTGG >3.6|662419|36|LS483402|PILER-CR,CRISPRCasFinder,CRT CAGCACCTGCACCAACAAACTGTGGCGATGCACAGA >3.7|662491|40|LS483402|PILER-CR,CRISPRCasFinder,CRT TTCTGTCATTAAGGACATCATTTTGGGGCAGTGGTGGCTT >3.8|662567|40|LS483402|PILER-CR,CRISPRCasFinder,CRT CGCCTCAATCTGAGCAGCCGCTGCCAGCTGGAGCGTGCCC >3.9|662643|39|LS483402|PILER-CR,CRISPRCasFinder,CRT TTCACGTGGCGCTGTGTATGGCTCTAAAGCCGGTGCAAT >3.10|662718|35|LS483402|PILER-CR,CRISPRCasFinder,CRT CCTTACCGCACGGATAATGAAAACCGCTATAACCG >3.11|662789|36|LS483402|PILER-CR,CRISPRCasFinder,CRT GGTCTGGGTATCATCGCCACTACAGCCAGTAATAAA |
cas1,csb3,cas3,csb2gr5,csb1gr7 |
CRISPR arrays and Neighbor proteins around LS483402_3
The CRISPR arrays of LS483402_3 >merge|LS483402|3|662021-662860|PILER-CR,CRISPRCasFinder,CRT ACCTGAATGAAAGGCTGCGACCGAAGCCGCAGCGACCAAAGAACTGAACCGCCACATTCGCGAAGCAGCCGGAACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGACACTTGGCGGAAGAGTTAGGCATCTCAGAGAGTTCGACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGACGCCCTCGCCCGATCTGTCGCGCCGGGCATCACATGGGCACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGACATTACCATTGCCCCACAATTCCGCATGCAAAAGAGCCACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGACGGTGTGAAAGCTGCTGGTGCTTGTGGGGTTTCTGGACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGACCAGCACCTGCACCAACAAACTGTGGCGATGCACAGAACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGACTTCTGTCATTAAGGACATCATTTTGGGGCAGTGGTGGCTTACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGACCGCCTCAATCTGAGCAGCCGCTGCCAGCTGGAGCGTGCCCACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGACTTCACGTGGCGCTGTGTATGGCTCTAAAGCCGGTGCAATACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGACCCTTACCGCACGGATAATGAAAACCGCTATAACCGACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGACGGTCTGGGTATCATCGCCACTACAGCCAGTAATAAAACCTCAATGAAAGGCTGCGACTGAAGCCGCAGCGAC >LS483402|3|3|662021-662860|PILER-CR ACCTGAATGAAAGGCTGCGACCGAAGCCGCAGCGAC CAAAGAACTGAACCGCCACATTCGCGAAGCAGCCGGA ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC ACTTGGCGGAAGAGTTAGGCATCTCAGAGAGTTCG ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC GCCCTCGCCCGATCTGTCGCGCCGGGCATCACATGGGC ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC ATTACCATTGCCCCACAATTCCGCATGCAAAAGAGCC ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC GGTGTGAAAGCTGCTGGTGCTTGTGGGGTTTCTGG ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC CAGCACCTGCACCAACAAACTGTGGCGATGCACAGA ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC TTCTGTCATTAAGGACATCATTTTGGGGCAGTGGTGGCTT ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC CGCCTCAATCTGAGCAGCCGCTGCCAGCTGGAGCGTGCCC ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC TTCACGTGGCGCTGTGTATGGCTCTAAAGCCGGTGCAAT ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC CCTTACCGCACGGATAATGAAAACCGCTATAACCG ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC GGTCTGGGTATCATCGCCACTACAGCCAGTAATAAA ACCTCAATGAAAGGCTGCGACTGAAGCCGCAGCGAC >LS483402|3|3|662021-662860|CRISPRCasFinder ACCTGAATGAAAGGCTGCGACCGAAGCCGCAGCGAC CAAAGAACTGAACCGCCACATTCGCGAAGCAGCCGGA ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC ACTTGGCGGAAGAGTTAGGCATCTCAGAGAGTTCG ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC GCCCTCGCCCGATCTGTCGCGCCGGGCATCACATGGGC ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC ATTACCATTGCCCCACAATTCCGCATGCAAAAGAGCC ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC GGTGTGAAAGCTGCTGGTGCTTGTGGGGTTTCTGG ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC CAGCACCTGCACCAACAAACTGTGGCGATGCACAGA ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC TTCTGTCATTAAGGACATCATTTTGGGGCAGTGGTGGCTT ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC CGCCTCAATCTGAGCAGCCGCTGCCAGCTGGAGCGTGCCC ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC TTCACGTGGCGCTGTGTATGGCTCTAAAGCCGGTGCAAT ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC CCTTACCGCACGGATAATGAAAACCGCTATAACCG ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC GGTCTGGGTATCATCGCCACTACAGCCAGTAATAAA ACCTCAATGAAAGGCTGCGACTGAAGCCGCAGCGAC >LS483402|3|3|662021-662860|CRT ACCTGAATGAAAGGCTGCGACCGAAGCCGCAGCGAC CAAAGAACTGAACCGCCACATTCGCGAAGCAGCCGGA ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC ACTTGGCGGAAGAGTTAGGCATCTCAGAGAGTTCG ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC GCCCTCGCCCGATCTGTCGCGCCGGGCATCACATGGGC ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC ATTACCATTGCCCCACAATTCCGCATGCAAAAGAGCC ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC GGTGTGAAAGCTGCTGGTGCTTGTGGGGTTTCTGG ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC CAGCACCTGCACCAACAAACTGTGGCGATGCACAGA ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC TTCTGTCATTAAGGACATCATTTTGGGGCAGTGGTGGCTT ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC CGCCTCAATCTGAGCAGCCGCTGCCAGCTGGAGCGTGCCC ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC TTCACGTGGCGCTGTGTATGGCTCTAAAGCCGGTGCAAT ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC CCTTACCGCACGGATAATGAAAACCGCTATAACCG ACCTCAATGAAAGGCTGCGACCGAAGCCGCAGCGAC GGTCTGGGTATCATCGCCACTACAGCCAGTAATAAA ACCTCAATGAAAGGCTGCGACTGAAGCCGCAGCGAC
>LS483402.1|SQG56938.1|661305_661710_+|Uncharacterised-protein MRKSLNRPSDSRENTDKKPLLFSKWYGIPFLEIALLVVTFIVDIFSSHYVGHIYAIIFISFVGLPILSLIPLRFFSVDLLFISLGVLGILILKFSGDTASVGLGWVSLFLLGLGISGVCCLTRRILANILTKTT >LS483402.1|SQG56936.1|660314_660722_-|DNA-binding-domain-containing-protein MTIQTEASLNLTTEDAKILTAFRHQLAGENPKITSKAGQEAPDQLIDIIRKVLDAVAYGRAISISQLPARITTTTAASMLGVSRPTVMKYIKQGKLSNTMVGSHHRLDTQEVLKLLNDRKEEMRRSVFEVLDLET >LS483402.1|SQG56934.1|659675_660287_-|Uncharacterised-protein MTTAAPNEYTPIPNSVLPDANIWFSTTLHAWIGLLAAETLGSWSFYWTEDILAEAIYHKRREYPKTSSHQIEAIRDRLMTVMAENRITGFAIDESVSYSDTFDAHVHSAAIHGRIQIIVTQDYKDFAGLYSNSDDCPYEVFTPDEFLLLVAESAPEAIDAVIEQQFSYYMQKSHSFNLVQKLISAGCPHFAEYVRTRLQALSP >LS483402.1|SQG56932.1|658152_659661_+|putative-methyl-viologen-resistance-protein MANTANRWAVLTILMIGVSLIVLDSTIVSVSLPTIITSLNLNLTDAQWVSSLYSVVFAALLLFTGTLGDKFGRVLIFRIGLIVFALASLLAAVAGSAALLILARGLQGVGGAMILPSTLATINTVFRDQERAQAFGIWGATMASMAAIGPLLGGWLTQSLSWHWIFLINIPIVIALLIAGHFVFGPDHKGAIVGFDVPGTLLSALALGLTVFGLIEGTSLGWWNSPVPFAIACGLIAAVLFIVLERARQRAGKPVLLDVRLFRIGSFSNGNITTLTVALGEFSALFVLPLYLISVLRLGTIHAGWVLATLALGSILSGAAARHLAATFGPAITVIIGLVLEVVGIAGAGLLIGPATSAPLIALVLAIYGAGVGLASAQLASVVLADVPVESSGMGSATQSTSRQLGSALGVAIAGTVLAIDVVRRVTEGLTNLGMTGPQAEQLAHATADSAGAAIPALAEKMGPDVGSVLSTAFADATSSVLYVSAGVLLLGLISAVRLAKR >LS483402.1|SQG56930.1|657481_658114_-|transcriptional-repressor-BetI MVSRYESIKTHSETLRASALMENMGSREENREKTQAAILDAAEALLREGGVEALTAGAVAQRVGLARNSLYRYVGSMDELRGRVILRHFPDYVDAINQAIHEATSPTEALCAYIEANLQIVAVENHGWLMELAQGVGGEAQANIAHIHRQLIKSLSNLLAPFNLDNPTLAAALIQGLLSTGFSALERGHDVQEVTALCAQGALGIVGKPR >LS483402.1|SQG56927.1|656554_657310_-|VIT-family MTFPDYTAAQPETQTQPETHKESSNRLNSRLNWLRAGVLGANDGIVSVSALILGVIATGVGHGAILAAGIAATVAGAISMALGEFVSVSAQRDSERMVMERERLELLHTPEEERHEIAKILSDYGMSEETALRAATEIGHNDPFPAHLRIEYGIDAQDLTSPWHAALSSAAAFTLGAILPLLMVVIAPQGNSTVGIIAVSSITIIALAVTGYLSAAIAGTSRMRSVLRLVIGGTLGLALTYVAGALFGGIV >LS483402.1|SQG56926.1|655235_656321_-|Inositol-3-phosphate-synthase MSNNRTIRVAIAGVGNCASSLIQGVEYYKDADPATNVPGLMHVQFGDYHVGDIEFVAAFDVDKEKVGLDLSQAINASENCTIKICDVPEQGVTIQRGPTLDGLGKYYRQTITESDTEPVDVVGTLKDVRADVLVSYLPVGSEEADKFYAQCAIDANVAFVNALPVFIASDPEWAEKFEKAGVPIVGDDIKSQVGATITHRVLAKLFEDRGVHLDRTMQLNVGGNMDFKNMLERERLESKKISKTQAVTSNLDQHIEAHDVHIGPSDYVGWLDDRKWAYVRLEGTAFGDVPLNLEYKLEVWDSPNSAGIIIDAIRAAKIAKDRRLGGPVFAASSYLMKSPPKQLRDEHARAELESFIAGDPS >LS483402.1|SQG56924.1|653799_655179_+|glycosyl-hydrolase-family-protein MLMAGMPRSADAGGVIGEVKRPKIFGALTGPFTCSKGWKYVFCAAIFQLGLKFFSVLRCEKAIIVTNVTYVTSGIIGRVAIMMKTRRLKTALCTVLASSTFAVASVQAIPLPFPIGASSGIDVSGHQHPNGSSINWQDVKSHGQSFAFVKATEGLGWTNDFYASDITQAAAQGLKVGSYHYARPGADARQQARHYAKVISHTPNHSLPPVLDLEVAEGKTPQELVNWTRDFVQELEKQTGRVPMIYTYRYFWIEQMANTTEFSQYPLWLAAYQAQVPGTVGGWDQIDFWQRSSSGRINGIVGDVDMNLFNGDDGELAAFAAGNLHAAGNKFASINLPELADLGKSAGGVVAVILALSAGAAAAPQLIQAAEAAGLSSEGAQDLTAVVQALAKAGKLPVDQLNKMASGNYTVGDLVILLDNAAHLAGIDAGQSSQAVMRADGLNIDANQVARVIRGLAAR >LS483402.1|SQG56921.1|650471_653765_+|periplasmic-alpha-amylase MKTYRCHARRIFTALTTLTLITAGGALSRPPAQAADPSSVVIAADFQTKAGCTKDWDPACSQTQMEKQGKFYSKKIKVPKGDWNFKVVLDKNWDTSYGAPGKGYERDNVPLKLAADAELEFIFDPESHHIGLRPTQITTGDHEVKPEDRELIKAPYRQNAAQNNFYFVLTDRFNNGDPKNDRGDASAEQGDRAQHGFDPTSKAFYHGGDIKGIIQKLDYIQGLGTTAIWLTPSFKNKAVQGTGNDASAGYHGYWITDFTQIDPHLGTNQDMKDLIKAAHEKGMKVYFDIVTNHTADLIQLAGGNGSNGSTYVSQQEQPYKDVNGKEFRLEDYAGKGASEFPKLNKESFPYTPQRTNPAEKMTPDWLNDVTLYHNRGNSMFDDGGESVIMGDFFGLDDLMTEHPTVVDGMTKIYNEWVDYGLDGFRIDTVKHVDLAFWKQWTERVHQHAVEKGMGDFFMFGEAYNFSPEALSPFVRETHMDAVLDFAFQNNAVDFAKGGDTNKLKSLFYGDDWYTTTRSDAAVLPTFLGNHDMGRIGSLLQKSGDGTERLRRDQLAHALLYLTRGQPVVYYGDEQGFAGSGSDKDARQDMFATKVTDVHNEQLVNGDQFGTGDHFNSEAPVAKTITELAKLRKENKALVEGAQIERYATQGAGIYAFSRVNREEKQEYLVALNNATTTRDVDLKALTPNAEFERVYISSIYGNEAPTSLTTDNEAKTHVTVPGLTAVVYKVKNGKQVTGSVSGGLNIVGQELKGDAPIMTTVGGNAWSETNFGWRKLGEKEWNYLGTDTGQDARIFHNVRDLEPGTVVEYRTVTVDGDNKETASHGWGVVGVDLAVDSRALSVSATTASATVPAAVVAGNFTKDLGCTGGQEGNWDPACAAAELRDDSSGWKTAELTLKPGEYEYKIATGGSWAQNYGALSEGTRESDEGVLNGKNVKFQVTQDKQKVTFFYHPETHEFFNTAEHRVITLPGTMGGALECPANVEKSDAYGNWGPACLATMLTRTGAHTYGTRLPKVPTPGDYQVKVAYDRDWQESYGPDGRGDSNYLVTVAESGKVLSYKWDEQTKKLTWTTSDQGASLVEDAAMPTELEESVALEN >LS483402.1|SQG56919.1|649968_650190_+|Uncharacterised-protein MSNFYKSQTIVRSMVAVIVGCWAYCLVVAPLLSERSYGEVMAEKSKDIGLGFTLAALLIGVMWLFVARRKSAE >LS483402.1|SQG56940.1|662882_662996_-|CRISPR-associated-protein-Cas2 MRRDDVRRTIIAYDIAHDRRRNKLAKILQKYGDRCLC >LS483402.1|SQG56942.1|662992_664594_-|CRISPR-associated-Cas1-family-protein MQALIDPVPISLVVHTEYCERRTWLELNGEQTDTYQMQAGKSSHVHVDNLKTSTPHRQVSVKVWSDELGILGICDSLETLPDGTIRVVEFKATPVRKAPIVTDANRLQLALQGICLREMGYKKLEYAVYFTDHRKTIEVELSSADFEHAKNQALRTHEIAQASTSPIPLDEDPQCTWCSHLSVCLPDELFQRQPQRRVLAQNPDSQVLHLTEQGSRASKKNGRIEVHRKSELLGSVPIERVQAVVLHGNIDLSSALIRELAWQHRPVVWCSSTGRLYGWMLPGDGPNGLARVRQHVLAETGFIPIASEIISSKIYNQATMLRRHGSAKEAVASLRRLQETARTVNDIPSLFGVEGEAASKYFESFGSMLNDAALHGLGAQWLGRKGRGAQDHINVLLNYAYGMLTAECVRALIACGLDPHAGFLHSSNRNKPAAALDLMEEFRPVVADSVVLTLINRREISSRDFFIRDKGQALTTDGRKKIVKAFERRIQTQFKHPTFGYSVSWRRAIEVQARMMLGVLDGTQLRYKGVKIR >LS483402.1|SQG56944.1|664598_665630_-|Uncharacterised-protein MFTIDIAGDASSALSHFALLGLAAVAEEMGDNSVRLLWSLDSEPKAQLRSIYDPLLIAQRIRELATRWSEDSSWVKARKNYAGKQFAPFSPRIKAIDAEKSPHDWEEHHHIRTSHVDQLLADKRWLDLSFISALGEPSYWHNEKKAPRPDHGASRWEMKTRNRGEEFVQHRLSLMVDELSSWTNEDILAGIQGKQVHDPLGKNSPDSRTSTGLTPPGPTDVALAFVGLLGIASFQLAPQVKEKSVTPGAFPPQALHPVLMVLPMSSTPISLGRARSVLRSEAIACIGGELVRTGDIGTTAVVSASKWLLEHGINAVALFDIKKAGSSSAPERQVQPGSVLPLG >LS483402.1|SQG56946.1|665634_668316_-|CRISPR-associated-protein-Cas3 MPSITFDAFFAELNDGHRPFAWQQRLVDAVIKTGTWPAQIVAPTGTGKSSVVDIHVYLNALYALGECPRVPRRLSVVVNRRALVDSHIDRAETILRTMQEAKAGSVLATLSQALTSLRSDAHQDPFIVSRLRGALTNKTLPVNSLEACAIIAATPDMWGSRALFRGYGSGRLARPRETALFTMDAVVLLDESHLNRQLLTTARRIAALQELEVDLHVPRLQVVAATATSTETLGLAQSIGVFEEDCERDPVIAQRIDSSKHLSLLKLKKWNGRPKNSEIIATAVEEVLRLCADADSTVGCIVNHVETATKIHRILKKKGLRSEILVGRMRPHDVAQMKARRPGLFTIQGSQEVDVLVATQTMEVGVDVDFAHLVTELAPGSSLTQRFGRVNRLGHRVRSEVSVLVPSTADAIKTDVPPYTRKDLLNSLAWLEQLAEAGTVNPRRLLELPAPEESPGRVLLQRLEWADLHNLCRTTDPLFAEPDLDLWLRDSLEKDPALGGVVVRSPLPEDFNAAVELLNATKPQDFETFPANIAVLNRLKDVLAPTDESFKQKASAVRHRAFLYRDSEVVLLDHDKPLRPTDILIIEPGTPFTTEGVASATPEDSELIDPAPLPGIDVHVFDSAMDKAEAESFKQIAAELSEDPEETTTESQRSKGFQCSTMVLETDHHHGFDAVVPWYITETDDAIRAEEEALQEWSPTSKTVTLQQHQADVAEQADNLCTSVGLRHDLHQIVVQAAAHHDDGKIEPRFQTWLRGGKTSDDQEPWAKSAQRNRQEIRRAKNISGVPPKWRHEQLSALKVAVQLGYDTPETELILRIVGCSHGHGRSTFPYSSWEMISPLATDQEQAVARHLFSEGEWDSIIERTNRTIGPYAMAYLEALQRAADAHVSSEGR >LS483402.1|SQG56948.1|668308_669886_-|CRISPR-associated-protein-GSU0054/csb2,-Dpsyc-system MPKYCLTARFPLGVYLGHTGDANRDAYPDPARLHAALMNAAAQGVHAEEDPNEHQLRPSQQSLQALQWLEAHPPTGLAMPEQQWLSPDTSRMMYRNVGSVKIDKTGVTRATENRAVSDGVSVNGAYGYIWDEMPVDIAEAITALLPDVACLGEASSLVVLEQQEIEATLTLDPQATAFSTERVQVRIALPGRTQHLREVFHARYGKKPPSKKADKFLKDDPIHDPPIPKDHLGTARYLRVNAADKECVTPWTKVILLEVHGKQLGAKEQVRAAVALHRALIARIRTDVSPVITGRYAAGAQRPANNLAIQYIPHRHLEALGLKTSAFALLVPQDADSTVYEQLNQALTGPFPLRSGGKLLCQLKYNGHVFRGDAFWPAPQPGTIRMWEPLNVFIPESRPHNKQQGVLWRLADAGLLSVAFVWRDNFPTKETGPARYVELRDAAYNADVRIFHDHPVSRNTRRFVHRTNRSLTIQPWRGLVHLGSLQQDRAIIALGQSRHLGGGLLIPVDIPRSEFETMTSEMTHA >LS483402.1|SQG56950.1|669889_671095_-|CRISPR-associated-protein-GSU0053/csb1,-Dpsyc-system MGTLSYTDLVKACSAGGSSVLTSITELEAAVGQHGSVAPAKFVNRSEPVFAFEDRFIDGESKRTVLIDSKQSQLNRAEAALMQAINEGNETLNRIPRIEVSYNDSKVFSDLELPHRFTDGHIRAGSIDGKPTTENDLYISARNSTPRNMKPLLNLAPSALIFGGWDASRKSDQVKLRSALVGEIIGVLANQNRAESYSRRGGARVDPVAASVKMTGTDLKETALVQSHELSQKTRSKLDNQVKKAKKGETISASSLGLGAILPSLDSLGGVACQRIIRSWALSFAALRQLRFGGTAEQDIAARALLAALGLAAMARAESELNIRANCDLVEQGKPVVTLDLRYGEKRELEPISVEAADELLKEAIAKASACGVADWEGQILHVTGNPVVLRGATEDDAEAE >LS483402.1|SQG56952.1|671480_671732_+|Uncharacterised-protein METNSIGASEEAYSMLSLAETIYGPGRVPRTASLVDLCSRVVGIGEIVRVQLSRAMPRLSLRMLRQRSGRSLLLRLELSLQRQ >LS483402.1|SQG56954.1|671790_672597_-|lipase-LipC MAIIDAPLPLSARLPARGLFEDDWRARPTSRHPYPVILIHGTGVTKGDWMELGTDLRKKGYAVFAPDFGMRSTAAVAESADQVGAYIHAVLKVTGAERVILVGHSQGGILARYWMHHLDGARYVTHLICLAVPNHGTSHGGVISPLTRTARGTVVVDSIITNFFGASGFEMLAESDLIQELNANGDTLPGIYYSCITTKSDTIIQPVESCFLTGPLVRNIYVQAVSKRAIVLHEDVPYDRRVRRIVLSELERVERLTAKKHVRTEHNT >LS483402.1|SQG56956.1|672690_673278_+|nitroreductase MSLTVAEAIANRRATRQYTEQEVSDAVLDVVVSQALQAPSAFNAQRADLVVIRDQAIKDKIFAASGQKQLRDAPVVLVTVARADVPEDLDEVLGVERATFVRNVLAKADAARLRETALKDAMLVAGFALIAAQGEGLATSPTTGWDEAKVLEAIGLADRSDRAVGLVIGMGYPAEFPAHPGRAESRRVNDGYARD >LS483402.1|SQG56958.1|673359_673917_+|anhydrase-family-3-protein MNSGPIILPFNGKTPRVHETAFIAPNATLIGDVEIAAHASVFYGCVLRADINMIRVGARTNVQDNSVLHVDGDAPCILGEDVTVGHMALVHGSTVGNGTLVGMHSALLSRSVIGAGSLIAAGAVVLEGQEIPAGSLAAGVPAQVRRVLSSEQSAGFIPHAGKYVNVASMHRELGMSLSLDQVRFS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
LS483402_2 | 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586238-586269 | 32 | NZ_CP029834 | Azospirillum ramasamyi strain M2T2B2 plasmid unnamed4, complete sequence | 74661-74692 | 6 | 0.812 |
LS483402_2 | 2.24|587215|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 587215-587246 | 32 | NC_027331 | Citrobacter phage Moon, complete genome | 45353-45384 | 6 | 0.812 |
LS483402_1 | 1.1|16509|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 16509-16540 | 32 | MK448727 | Streptococcus phage Javan291, complete genome | 5947-5978 | 7 | 0.781 |
LS483402_1 | 1.1|16509|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 16509-16540 | 32 | NC_050148 | Pseudomonas virus Pa193, complete genome | 8390-8421 | 7 | 0.781 |
LS483402_1 | 1.25|17975|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17975-18006 | 32 | NZ_CP016453 | Sphingobium sp. RAC03 plasmid pBSY17_1, complete sequence | 451802-451833 | 7 | 0.781 |
LS483402_1 | 1.30|18280|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 18280-18311 | 32 | NZ_CP029830 | Azospirillum ramasamyi strain M2T2B2 plasmid unnamed1, complete sequence | 513867-513898 | 7 | 0.781 |
LS483402_1 | 1.33|18463|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 18463-18494 | 32 | NC_006826 | Sphingobium xenophagum QYY plasmid pSx-Qyy, complete sequence | 4654-4685 | 7 | 0.781 |
LS483402_2 | 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586238-586269 | 32 | NC_010510 | Methylobacterium radiotolerans JCM 2831 plasmid pMRAD01, complete sequence | 212730-212761 | 7 | 0.781 |
LS483402_2 | 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586360-586391 | 32 | NZ_CP015881 | Ensifer adhaerens strain Casida A plasmid pCasidaAA, complete sequence | 301490-301521 | 7 | 0.781 |
LS483402_2 | 2.16|586727|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586727-586758 | 32 | NZ_HG938356 | Neorhizobium galegae bv. officinalis bv. officinalis str. HAMBI 1141 plasmid pHAMBI1141a, complete sequence | 1378731-1378762 | 7 | 0.781 |
LS483402_2 | 2.16|586727|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586727-586758 | 32 | NZ_CP030761 | Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed1, complete sequence | 1031353-1031384 | 7 | 0.781 |
LS483402_2 | 2.21|587032|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 587032-587063 | 32 | NZ_CP029831 | Azospirillum ramasamyi strain M2T2B2 plasmid unnamed7, complete sequence | 287607-287638 | 7 | 0.781 |
LS483402_1 | 1.7|16876|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 16876-16907 | 32 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 1860981-1861012 | 8 | 0.75 |
LS483402_1 | 1.7|16876|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 16876-16907 | 32 | NZ_CP054622 | Azospirillum oryzae strain KACC 14407 plasmid unnamed7, complete sequence | 36744-36775 | 8 | 0.75 |
LS483402_1 | 1.9|16998|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 16998-17029 | 32 | NZ_CP019603 | Croceicoccus marinus strain E4A9 plasmid pCME4A9I, complete sequence | 106189-106220 | 8 | 0.75 |
LS483402_1 | 1.10|17059|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17059-17090 | 32 | NC_022044 | Paracoccus aminophilus JCM 7686 plasmid pAMI6, complete sequence | 165671-165702 | 8 | 0.75 |
LS483402_1 | 1.29|18219|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 18219-18250 | 32 | NZ_KY000046 | Agrobacterium genomosp. 1 strain CFBP2177 plasmid pTi_CFBP2177, complete sequence | 21935-21966 | 8 | 0.75 |
LS483402_1 | 1.33|18463|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 18463-18494 | 32 | NZ_CP020900 | Rhizobium phaseoli Brasil 5 strain Bra5 plasmid pRphaBra5d, complete sequence | 568297-568328 | 8 | 0.75 |
LS483402_2 | 2.1|585810|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 585810-585841 | 32 | MH779523 | Lactococcus phage vB_Llc_bIBBAm4, complete genome | 13391-13422 | 8 | 0.75 |
LS483402_2 | 2.1|585810|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 585810-585841 | 32 | NC_017060 | Rahnella aquatilis HX2 plasmid PRA1, complete sequence | 136330-136361 | 8 | 0.75 |
LS483402_2 | 2.1|585810|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 585810-585841 | 32 | NC_015062 | Rahnella sp. Y9602 plasmid pRAHAQ01, complete sequence | 139500-139531 | 8 | 0.75 |
LS483402_2 | 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586238-586269 | 32 | NZ_CP023068 | Ensifer sojae CCBAU 05684 plasmid pSJ05684b, complete sequence | 857324-857355 | 8 | 0.75 |
LS483402_2 | 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586238-586269 | 32 | NC_023497 | Amycolatopsis keratiniphila plasmid pXL100, complete sequence | 22077-22108 | 8 | 0.75 |
LS483402_2 | 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586238-586269 | 32 | NC_048068 | Microbacterium phage OneinaGillian, complete genome | 3594-3625 | 8 | 0.75 |
LS483402_2 | 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586238-586269 | 32 | MT310894 | Microbacterium phage Tempo, complete genome | 3942-3973 | 8 | 0.75 |
LS483402_2 | 2.21|587032|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 587032-587063 | 32 | NC_014838 | Pantoea sp. At-9b plasmid pPAT9B01, complete sequence | 753803-753834 | 8 | 0.75 |
LS483402_2 | 2.21|587032|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 587032-587063 | 32 | NZ_CP012399 | Chelatococcus sp. CO-6 plasmid pCO-6, complete sequence | 251919-251950 | 8 | 0.75 |
LS483402_1 | 1.1|16509|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 16509-16540 | 32 | NZ_CP041677 | Lactobacillus reuteri strain LL7 plasmid unnamed, complete sequence | 22008-22039 | 9 | 0.719 |
LS483402_1 | 1.5|16754|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 16754-16785 | 32 | NZ_CP014175 | Clostridium argentinense strain 89G plasmid pRSJ17_1, complete sequence | 97930-97961 | 9 | 0.719 |
LS483402_1 | 1.9|16998|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 16998-17029 | 32 | MN694099 | Marine virus AFVG_250M963, complete genome | 21854-21885 | 9 | 0.719 |
LS483402_1 | 1.9|16998|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 16998-17029 | 32 | MN694466 | Marine virus AFVG_250M969, complete genome | 16159-16190 | 9 | 0.719 |
LS483402_1 | 1.9|16998|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 16998-17029 | 32 | MN694378 | Marine virus AFVG_250M964, complete genome | 21909-21940 | 9 | 0.719 |
LS483402_1 | 1.9|16998|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 16998-17029 | 32 | MN694090 | Marine virus AFVG_250M1127, complete genome | 15005-15036 | 9 | 0.719 |
LS483402_1 | 1.9|16998|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 16998-17029 | 32 | MN694428 | Marine virus AFVG_250M968, complete genome | 16140-16171 | 9 | 0.719 |
LS483402_1 | 1.21|17731|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17731-17762 | 32 | MK448905 | Streptococcus phage Javan318, complete genome | 13299-13330 | 9 | 0.719 |
LS483402_1 | 1.25|17975|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17975-18006 | 32 | NZ_CP007144 | Hymenobacter swuensis DY53 plasmid pHsw1, complete sequence | 157812-157843 | 9 | 0.719 |
LS483402_1 | 1.27|18097|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 18097-18128 | 32 | AP014287 | Uncultured Mediterranean phage uvMED isolate uvMED-GF-U-MedDCM-OCT-S24-C25, *** SEQUENCING IN PROGRESS *** | 24844-24875 | 9 | 0.719 |
LS483402_1 | 1.32|18402|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 18402-18433 | 32 | NZ_CP032703 | Pantoea dispersa strain DSM 32899 plasmid unnamed1, complete sequence | 356170-356201 | 9 | 0.719 |
LS483402_1 | 1.33|18463|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 18463-18494 | 32 | NZ_CP045339 | Vibrio sp. THAF190c plasmid pTHAF190c_a, complete sequence | 1117986-1118017 | 9 | 0.719 |
LS483402_2 | 2.4|585993|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 585993-586024 | 32 | MK605246 | Nodularia phage vB_NspS-kac68v162, complete genome | 5128-5159 | 9 | 0.719 |
LS483402_2 | 2.4|585993|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 585993-586024 | 32 | NC_048757 | Nodularia phage vB_NspS-kac68v161, complete genome | 5128-5159 | 9 | 0.719 |
LS483402_2 | 2.5|586054|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586054-586085 | 32 | MK605246 | Nodularia phage vB_NspS-kac68v162, complete genome | 5128-5159 | 9 | 0.719 |
LS483402_2 | 2.5|586054|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586054-586085 | 32 | NC_048757 | Nodularia phage vB_NspS-kac68v161, complete genome | 5128-5159 | 9 | 0.719 |
LS483402_2 | 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586238-586269 | 32 | NZ_CP013635 | Rhizobium sp. N324 plasmid pRspN324e, complete sequence | 149950-149981 | 9 | 0.719 |
LS483402_2 | 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586238-586269 | 32 | NZ_LR134451 | Tsukamurella tyrosinosolvens strain NCTC13231 plasmid 9, complete sequence | 48044-48075 | 9 | 0.719 |
LS483402_2 | 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586238-586269 | 32 | NZ_CP048816 | Caulobacter rhizosphaerae strain KCTC 52515 plasmid unnamed | 171858-171889 | 9 | 0.719 |
LS483402_2 | 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586360-586391 | 32 | NZ_CP036488 | Rahnella aquatilis strain MEM40 plasmid pMEM40-1, complete sequence | 108405-108436 | 9 | 0.719 |
LS483402_2 | 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586360-586391 | 32 | NZ_CP032297 | Rahnella aquatilis strain ZF7 plasmid pRAZF7, complete sequence | 460873-460904 | 9 | 0.719 |
LS483402_2 | 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586360-586391 | 32 | NC_017060 | Rahnella aquatilis HX2 plasmid PRA1, complete sequence | 184336-184367 | 9 | 0.719 |
LS483402_2 | 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586360-586391 | 32 | NZ_CP034838 | Rahnella aquatilis strain KM12 plasmid pKM12v1, complete sequence | 173738-173769 | 9 | 0.719 |
LS483402_2 | 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586360-586391 | 32 | NZ_CP034839 | Rahnella aquatilis strain KM25 plasmid pKM12v2, complete sequence | 173738-173769 | 9 | 0.719 |
LS483402_2 | 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586360-586391 | 32 | NZ_CP034837 | Rahnella aquatilis strain KM05 plasmid pKM05, complete sequence | 134061-134092 | 9 | 0.719 |
LS483402_2 | 2.13|586544|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586544-586575 | 32 | NZ_CP014068 | Enterococcus gallinarum strain FDAARGOS_163 plasmid unnamed, complete sequence | 7089-7120 | 9 | 0.719 |
LS483402_2 | 2.17|586788|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586788-586819 | 32 | NZ_CP018222 | Tardibacter chloracetimidivorans strain JJ-A5 plasmid pHSL1, complete sequence | 77545-77576 | 9 | 0.719 |
LS483402_2 | 2.19|586910|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586910-586941 | 32 | NZ_CP051207 | Dolichospermum flos-aquae CCAP 1403/13F plasmid pAfl69, complete sequence | 29362-29393 | 9 | 0.719 |
LS483402_2 | 2.20|586971|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586971-587002 | 32 | KU160494 | Vibrio phage vB_VmeM-32, complete genome | 103841-103872 | 9 | 0.719 |
LS483402_2 | 2.21|587032|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 587032-587063 | 32 | NC_041921 | Dinoroseobacter phage vB_DshS-R5C, complete genome | 19420-19451 | 9 | 0.719 |
LS483402_1 | 1.1|16509|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 16509-16540 | 32 | NZ_CP021033 | Rhizobium sp. NXC14 plasmid pRspNXC14c, complete sequence | 566801-566832 | 10 | 0.688 |
LS483402_1 | 1.9|16998|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 16998-17029 | 32 | MN694728 | Marine virus AFVG_250M962, complete genome | 15616-15647 | 10 | 0.688 |
LS483402_1 | 1.10|17059|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17059-17090 | 32 | NZ_HG938357 | Neorhizobium galegae bv. officinalis bv. officinalis str. HAMBI 1141 plasmid pHAMBI1141b, complete sequence | 135949-135980 | 10 | 0.688 |
LS483402_1 | 1.10|17059|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17059-17090 | 32 | NC_019919 | Yersinia phage phiR201 complete genome | 83163-83194 | 10 | 0.688 |
LS483402_1 | 1.16|17426|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17426-17457 | 32 | KX889311 | Pseudomonas aeruginosa plasmid pJB12, complete sequence | 19899-19930 | 10 | 0.688 |
LS483402_1 | 1.17|17487|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17487-17518 | 32 | CP053403 | Salmonella enterica strain 2010K-2057 plasmid unnamed1, complete sequence | 77025-77056 | 10 | 0.688 |
LS483402_1 | 1.17|17487|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17487-17518 | 32 | NZ_CP019182 | Salmonella enterica subsp. enterica serovar Inverness str. ATCC 10720 plasmid pATCC10720, complete sequence | 74860-74891 | 10 | 0.688 |
LS483402_1 | 1.20|17670|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17670-17701 | 32 | CP034582 | Lactococcus lactis subsp. lactis strain C10 plasmid pC10B, complete sequence | 23883-23914 | 10 | 0.688 |
LS483402_1 | 1.20|17670|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17670-17701 | 32 | CP029292 | Lactococcus lactis subsp. lactis KLDS 4.0325 plasmid unnamed5 | 39892-39923 | 10 | 0.688 |
LS483402_1 | 1.24|17914|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17914-17945 | 32 | NZ_AP017962 | Synechococcus sp. NIES-970 plasmid plasmid3 DNA, complete sequence | 62920-62951 | 10 | 0.688 |
LS483402_1 | 1.32|18402|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 18402-18433 | 32 | NZ_CP020848 | Klebsiella variicola strain KPN1481 plasmid pKPN1481-1, complete sequence | 134018-134049 | 10 | 0.688 |
LS483402_1 | 1.32|18402|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 18402-18433 | 32 | NZ_CP009856 | UNVERIFIED_ORG: Enterobacter cloacae strain ECNIH5 plasmid pENT-784, complete sequence | 60965-60996 | 10 | 0.688 |
LS483402_1 | 1.32|18402|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 18402-18433 | 32 | NZ_CP008898 | Enterobacter hormaechei subsp. hoffmannii ECNIH3 plasmid pENT-576, complete sequence | 15674-15705 | 10 | 0.688 |
LS483402_2 | 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586238-586269 | 32 | NC_020548 | Azoarcus sp. KH32C plasmid pAZKH, complete sequence | 250469-250500 | 10 | 0.688 |
LS483402_2 | 2.12|586483|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586483-586514 | 32 | NC_023285 | Streptomyces sp. F8 plasmid pFRL5, complete sequence | 381918-381949 | 10 | 0.688 |
LS483402_1 | 1.16|17426|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17426-17457 | 32 | NC_031230 | Gordonia phage Yvonnetastic, complete genome | 89747-89778 | 11 | 0.656 |
LS483402_1 | 1.24|17914|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17914-17945 | 32 | NZ_CP019984 | Pediococcus inopinatus strain DSM 20285 plasmid pLDW-14, complete sequence | 3045-3076 | 11 | 0.656 |
LS483402_1 | 1.24|17914|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17914-17945 | 32 | NZ_CP019984 | Pediococcus inopinatus strain DSM 20285 plasmid pLDW-14, complete sequence | 17644-17675 | 11 | 0.656 |
LS483402_1 | 1.24|17914|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 17914-17945 | 32 | NZ_CP019984 | Pediococcus inopinatus strain DSM 20285 plasmid pLDW-14, complete sequence | 32248-32279 | 11 | 0.656 |
LS483402_1 | 1.30|18280|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 18280-18311 | 32 | NZ_CP017565 | Paraburkholderia sprentiae WSM5005 plasmid pl2WSM5005, complete sequence | 373868-373899 | 11 | 0.656 |
LS483402_2 | 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586360-586391 | 32 | NZ_CP049159 | Caballeronia sp. SBC1 plasmid pSBC1_3, complete sequence | 293424-293455 | 11 | 0.656 |
LS483402_2 | 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT | 586360-586391 | 32 | NZ_CP049319 | Caballeronia sp. SBC2 plasmid pSBC2-3, complete sequence | 87020-87051 | 11 | 0.656 |
1. spacer 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP029834 (Azospirillum ramasamyi strain M2T2B2 plasmid unnamed4, complete sequence) position: , mismatch: 6, identity: 0.812
cgcccctaccggcgcgacccgcaaggacgccg- CRISPR spacer cgccccgtccggcgcgacccgc-accacgtcga Protospacer ****** ************** * ***.**
2. spacer 2.24|587215|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_027331 (Citrobacter phage Moon, complete genome) position: , mismatch: 6, identity: 0.812
---cgggcaaaaacatgagctccgaaagcatatct CRISPR spacer acccg---agaatcattagctccgaaagcatatct Protospacer ** *.** *** ******************
3. spacer 1.1|16509|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to MK448727 (Streptococcus phage Javan291, complete genome) position: , mismatch: 7, identity: 0.781
cgggttgatcgatttgaaagctgaacgtgata CRISPR spacer tgtgttgattgatttgaaagctggacggagta Protospacer .* ******.*************.*** ..**
4. spacer 1.1|16509|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_050148 (Pseudomonas virus Pa193, complete genome) position: , mismatch: 7, identity: 0.781
cgggttgatcgatttgaaagctgaacgtgata CRISPR spacer ggcgatgatcgatctgaaagctgaacttgcca Protospacer * * ********.************ ** .*
5. spacer 1.25|17975|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP016453 (Sphingobium sp. RAC03 plasmid pBSY17_1, complete sequence) position: , mismatch: 7, identity: 0.781
ggtgccaatggcgggctggtagctgtctacca CRISPR spacer ggcacggaaggcgggcaggtagctgtcttcca Protospacer **..* .* ******* *********** ***
6. spacer 1.30|18280|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP029830 (Azospirillum ramasamyi strain M2T2B2 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.781
tgttgaatctggcatcgacgaagacggaaagc CRISPR spacer tctggaatctcggatcgacgaagacggcgggc Protospacer * * ****** * ************** ..**
7. spacer 1.33|18463|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_006826 (Sphingobium xenophagum QYY plasmid pSx-Qyy, complete sequence) position: , mismatch: 7, identity: 0.781
atcattgcccagctcacgagcacgctcggcgg CRISPR spacer ctcctgtgcctgctcacgggcacgctcggcgg Protospacer ** * ** *******.*************
8. spacer 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_010510 (Methylobacterium radiotolerans JCM 2831 plasmid pMRAD01, complete sequence) position: , mismatch: 7, identity: 0.781
cgcccctaccggcgcgacccgcaaggacgccg CRISPR spacer cgccccgaccggcgcgaaccgcacgctgaccg Protospacer ****** ********** ***** * .***
9. spacer 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP015881 (Ensifer adhaerens strain Casida A plasmid pCasidaAA, complete sequence) position: , mismatch: 7, identity: 0.781
gctccctccggcgatcaagatcaatgcgaaca CRISPR spacer cctcggggcggcgatccagatcactgcgaaca Protospacer *** ******** ****** ********
10. spacer 2.16|586727|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_HG938356 (Neorhizobium galegae bv. officinalis bv. officinalis str. HAMBI 1141 plasmid pHAMBI1141a, complete sequence) position: , mismatch: 7, identity: 0.781
ccaatcatggcacgtgaccagcgcttctacgg CRISPR spacer gcaaccatggcacgggaccagcgcttcaagct Protospacer ***.********* ************ *
11. spacer 2.16|586727|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP030761 (Rhizobium leguminosarum strain ATCC 14479 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.781
-ccaatcatggcacgtgaccagcgcttctacgg CRISPR spacer tctggtcg-ggcacgtgaccagcgcgtctgcgg Protospacer *...**. **************** ***.***
12. spacer 2.21|587032|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP029831 (Azospirillum ramasamyi strain M2T2B2 plasmid unnamed7, complete sequence) position: , mismatch: 7, identity: 0.781
tcgggaa---gctctttcaccgtggcgatgatgtt CRISPR spacer ---gaaaccggctcgttcaccgtggcgatgatgcg Protospacer *.** **** ******************.
13. spacer 1.7|16876|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 8, identity: 0.75
aaaaagggcaaagttgatcaggtacgtgtggg CRISPR spacer gataagggcaaagtcgatcaggtgcggctgaa Protospacer .* ***********.********.** **..
14. spacer 1.7|16876|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP054622 (Azospirillum oryzae strain KACC 14407 plasmid unnamed7, complete sequence) position: , mismatch: 8, identity: 0.75
aaaaagggcaaagttgatcaggtacgtgtggg CRISPR spacer aaaaagggcgaagtggatcaggtctccgtcga Protospacer *********.**** ******** . .** *.
15. spacer 1.9|16998|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP019603 (Croceicoccus marinus strain E4A9 plasmid pCME4A9I, complete sequence) position: , mismatch: 8, identity: 0.75
caatcgcaacagcacctataccatcgacttca CRISPR spacer caatcgcaacatcaactataccaatatctgcc Protospacer *********** ** ******** .. ** *
16. spacer 1.10|17059|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_022044 (Paracoccus aminophilus JCM 7686 plasmid pAMI6, complete sequence) position: , mismatch: 8, identity: 0.75
tcgcgccttcagctcttctatctccgcaagaa CRISPR spacer ggcaaccttcagctcttcgatctcggcaagac Protospacer .************* ***** ******
17. spacer 1.29|18219|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KY000046 (Agrobacterium genomosp. 1 strain CFBP2177 plasmid pTi_CFBP2177, complete sequence) position: , mismatch: 8, identity: 0.75
cttggacgtgtcccgatcgtcatgatgattaa CRISPR spacer tatggaagtgtccagatcgtcatgatcgatag Protospacer . **** ****** ************ . **.
18. spacer 1.33|18463|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP020900 (Rhizobium phaseoli Brasil 5 strain Bra5 plasmid pRphaBra5d, complete sequence) position: , mismatch: 8, identity: 0.75
atcattgcccagctcacgagcacgctcggcgg CRISPR spacer gccgtggtccagctcgcgatcacgctcggcgc Protospacer ..*.* *.*******.*** ***********
19. spacer 2.1|585810|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to MH779523 (Lactococcus phage vB_Llc_bIBBAm4, complete genome) position: , mismatch: 8, identity: 0.75
gttctggacaactctcttctttgtctttatag CRISPR spacer gcattctccaactttcttctttgtctttaaag Protospacer *. .* *****.*************** **
20. spacer 2.1|585810|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_017060 (Rahnella aquatilis HX2 plasmid PRA1, complete sequence) position: , mismatch: 8, identity: 0.75
-gttctggacaactctcttctttgtctttatag CRISPR spacer cgcgatag-caactctcttctttgtatttttaa Protospacer *. *.* **************** *** **.
21. spacer 2.1|585810|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_015062 (Rahnella sp. Y9602 plasmid pRAHAQ01, complete sequence) position: , mismatch: 8, identity: 0.75
-gttctggacaactctcttctttgtctttatag CRISPR spacer cgcgatag-caactctcttctttgtatttttaa Protospacer *. *.* **************** *** **.
22. spacer 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP023068 (Ensifer sojae CCBAU 05684 plasmid pSJ05684b, complete sequence) position: , mismatch: 8, identity: 0.75
cgcccctaccggcgcgacccgcaaggacgccg CRISPR spacer tcgcctggccggcgcgaccagccaggacgccg Protospacer . **. .*********** ** *********
23. spacer 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_023497 (Amycolatopsis keratiniphila plasmid pXL100, complete sequence) position: , mismatch: 8, identity: 0.75
cgcccctaccggcgcgacccgcaaggacgccg CRISPR spacer cgagctgcgcgccgcgacccgcaaggccgccg Protospacer ** *. ** ************** *****
24. spacer 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_048068 (Microbacterium phage OneinaGillian, complete genome) position: , mismatch: 8, identity: 0.75
cgcccctaccggcgcgacccgcaaggacgccg CRISPR spacer cgccactaccggctcgacccgcacgtgcccta Protospacer **** ******** ********* * .* *..
25. spacer 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to MT310894 (Microbacterium phage Tempo, complete genome) position: , mismatch: 8, identity: 0.75
cgcccctaccggcgcgacccgcaaggacgccg CRISPR spacer cgccactaccggctcgacccgcacgtgcccta Protospacer **** ******** ********* * .* *..
26. spacer 2.21|587032|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_014838 (Pantoea sp. At-9b plasmid pPAT9B01, complete sequence) position: , mismatch: 8, identity: 0.75
tcgggaagctctttcaccgtggcgatgatgtt CRISPR spacer taatccagctctttcaacatggcgatgatgct Protospacer * . ********** *.***********.*
27. spacer 2.21|587032|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP012399 (Chelatococcus sp. CO-6 plasmid pCO-6, complete sequence) position: , mismatch: 8, identity: 0.75
tcgggaagctctttcaccgtggcgatgatgtt CRISPR spacer tccgcctgctccttcaccgcggcgatgatgcg Protospacer ** * ****.*******.**********.
28. spacer 1.1|16509|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP041677 (Lactobacillus reuteri strain LL7 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
cgggttgatcgatttgaaagctgaacgtgata CRISPR spacer gtaacttatcgatttgaaagcctaacgtgatt Protospacer ...* **************. ********
29. spacer 1.5|16754|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP014175 (Clostridium argentinense strain 89G plasmid pRSJ17_1, complete sequence) position: , mismatch: 9, identity: 0.719
ttacaggccgaggagttatttttcatggctaa CRISPR spacer gttaataatgaggatttatttttcatagctaa Protospacer * * . .***** ***********.*****
30. spacer 1.9|16998|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to MN694099 (Marine virus AFVG_250M963, complete genome) position: , mismatch: 9, identity: 0.719
caatcgcaacagcacctataccatcgacttca CRISPR spacer gagctgcaacagcagctataccatctactaag Protospacer *...********* ********** *** .
31. spacer 1.9|16998|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to MN694466 (Marine virus AFVG_250M969, complete genome) position: , mismatch: 9, identity: 0.719
caatcgcaacagcacctataccatcgacttca CRISPR spacer gagctgcaacagcagctataccatctactaag Protospacer *...********* ********** *** .
32. spacer 1.9|16998|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to MN694378 (Marine virus AFVG_250M964, complete genome) position: , mismatch: 9, identity: 0.719
caatcgcaacagcacctataccatcgacttca CRISPR spacer gagctgcaacagcagctataccatctactaag Protospacer *...********* ********** *** .
33. spacer 1.9|16998|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to MN694090 (Marine virus AFVG_250M1127, complete genome) position: , mismatch: 9, identity: 0.719
caatcgcaacagcacctataccatcgacttca CRISPR spacer gagctgcaacagcagctataccatctactaag Protospacer *...********* ********** *** .
34. spacer 1.9|16998|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to MN694428 (Marine virus AFVG_250M968, complete genome) position: , mismatch: 9, identity: 0.719
caatcgcaacagcacctataccatcgacttca CRISPR spacer gagctgcaacagcagctataccatctactaag Protospacer *...********* ********** *** .
35. spacer 1.21|17731|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to MK448905 (Streptococcus phage Javan318, complete genome) position: , mismatch: 9, identity: 0.719
caatcggctggcctatagtgttcaaaacttcc CRISPR spacer tagcatgctggcatatagttttcaaaactatc Protospacer .*.. ****** ****** ********* .*
36. spacer 1.25|17975|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP007144 (Hymenobacter swuensis DY53 plasmid pHsw1, complete sequence) position: , mismatch: 9, identity: 0.719
ggtgccaatggcgggctggtagctgtctacca CRISPR spacer gccctccttgacgggctggtagctgactaccc Protospacer * . .* **.************** *****
37. spacer 1.27|18097|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to AP014287 (Uncultured Mediterranean phage uvMED isolate uvMED-GF-U-MedDCM-OCT-S24-C25, *** SEQUENCING IN PROGRESS ***) position: , mismatch: 9, identity: 0.719
atcaacggtgagctgcgaaataagctcggcgc CRISPR spacer atcaacggtgagatgagaaataaagaagctga Protospacer ************ ** *******. * .*
38. spacer 1.32|18402|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032703 (Pantoea dispersa strain DSM 32899 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.719
ctacaaacttttctgcaaacgccacctcctca CRISPR spacer ccatcgcctgttctgcaaacgccacttcctgt Protospacer *.*. . ** ***************.****
39. spacer 1.33|18463|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP045339 (Vibrio sp. THAF190c plasmid pTHAF190c_a, complete sequence) position: , mismatch: 9, identity: 0.719
atcattgcccagctcacgagcacgctcggcgg CRISPR spacer ggcaatgcccagctcactagcacgcccattcg Protospacer . ** ************ *******.*. . *
40. spacer 2.4|585993|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to MK605246 (Nodularia phage vB_NspS-kac68v162, complete genome) position: , mismatch: 9, identity: 0.719
gcttatcagccacacgcataccaacaagggct CRISPR spacer acttatcagccacactcacaccaaaagctaat Protospacer .************** **.***** *. . *
41. spacer 2.4|585993|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_048757 (Nodularia phage vB_NspS-kac68v161, complete genome) position: , mismatch: 9, identity: 0.719
gcttatcagccacacgcataccaacaagggct CRISPR spacer acttatcagccacactcacaccaaaagctaat Protospacer .************** **.***** *. . *
42. spacer 2.5|586054|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to MK605246 (Nodularia phage vB_NspS-kac68v162, complete genome) position: , mismatch: 9, identity: 0.719
gcttatcagccacacgcataccaacaagggct CRISPR spacer acttatcagccacactcacaccaaaagctaat Protospacer .************** **.***** *. . *
43. spacer 2.5|586054|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_048757 (Nodularia phage vB_NspS-kac68v161, complete genome) position: , mismatch: 9, identity: 0.719
gcttatcagccacacgcataccaacaagggct CRISPR spacer acttatcagccacactcacaccaaaagctaat Protospacer .************** **.***** *. . *
44. spacer 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP013635 (Rhizobium sp. N324 plasmid pRspN324e, complete sequence) position: , mismatch: 9, identity: 0.719
cgcccctaccggcgcgacccgcaaggacgccg CRISPR spacer gatcgttaccggcgcgacccgcgaggaagcgt Protospacer ..* .****************.**** **
45. spacer 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LR134451 (Tsukamurella tyrosinosolvens strain NCTC13231 plasmid 9, complete sequence) position: , mismatch: 9, identity: 0.719
cgcccctaccggcgcgacccgcaaggacgccg CRISPR spacer catcgtgaccggcgcgacccgccaggaggcgc Protospacer *..* . *************** **** **
46. spacer 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP048816 (Caulobacter rhizosphaerae strain KCTC 52515 plasmid unnamed) position: , mismatch: 9, identity: 0.719
cgcccctaccggcgcgacccgcaaggacgccg CRISPR spacer tcccggcttcggcgtgaccagcaaggacgccg Protospacer . ** . .*****.**** ************
47. spacer 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP036488 (Rahnella aquatilis strain MEM40 plasmid pMEM40-1, complete sequence) position: , mismatch: 9, identity: 0.719
gctccctccggcgatcaagatcaatgcgaaca CRISPR spacer caggccgagggcgatcacgaacaatgcgaaca Protospacer ** ******** ** ***********
48. spacer 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032297 (Rahnella aquatilis strain ZF7 plasmid pRAZF7, complete sequence) position: , mismatch: 9, identity: 0.719
gctccctccggcgatcaagatcaatgcgaaca CRISPR spacer cagaccgagggcgatcacgaacaatgcgaaca Protospacer ** ******** ** ***********
49. spacer 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_017060 (Rahnella aquatilis HX2 plasmid PRA1, complete sequence) position: , mismatch: 9, identity: 0.719
gctccctccggcgatcaagatcaatgcgaaca CRISPR spacer cagaccgagggcgatcacgaacaatgcgaaca Protospacer ** ******** ** ***********
50. spacer 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP034838 (Rahnella aquatilis strain KM12 plasmid pKM12v1, complete sequence) position: , mismatch: 9, identity: 0.719
gctccctccggcgatcaagatcaatgcgaaca CRISPR spacer cagaccgagggcgatcacgaacaatgcgaaca Protospacer ** ******** ** ***********
51. spacer 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP034839 (Rahnella aquatilis strain KM25 plasmid pKM12v2, complete sequence) position: , mismatch: 9, identity: 0.719
gctccctccggcgatcaagatcaatgcgaaca CRISPR spacer cagaccgagggcgatcacgaacaatgcgaaca Protospacer ** ******** ** ***********
52. spacer 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP034837 (Rahnella aquatilis strain KM05 plasmid pKM05, complete sequence) position: , mismatch: 9, identity: 0.719
gctccctccggcgatcaagatcaatgcgaaca CRISPR spacer cagaccgagggcgatcacgaacaatgcgaaca Protospacer ** ******** ** ***********
53. spacer 2.13|586544|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP014068 (Enterococcus gallinarum strain FDAARGOS_163 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
cagcttcccctaaaggagaaaattctatgtat CRISPR spacer tagcttcctctaaagtagaaaatttttcgata Protospacer .*******.****** ********.* .*
54. spacer 2.17|586788|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP018222 (Tardibacter chloracetimidivorans strain JJ-A5 plasmid pHSL1, complete sequence) position: , mismatch: 9, identity: 0.719
caattgatccaatgtgtcctcgatgctcattg CRISPR spacer gcaaggatcgaatgtgtcctcgacgctcaggc Protospacer * **** *************.*****
55. spacer 2.19|586910|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP051207 (Dolichospermum flos-aquae CCAP 1403/13F plasmid pAfl69, complete sequence) position: , mismatch: 9, identity: 0.719
ttaagattcgatcacaatttctaaccacatgc CRISPR spacer ttctgattctatcaaaatttctaaccagtatt Protospacer ** ***** **** ************ .
56. spacer 2.20|586971|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to KU160494 (Vibrio phage vB_VmeM-32, complete genome) position: , mismatch: 9, identity: 0.719
ggctttggcaggcaaagcgccggtttcgcatc CRISPR spacer aaaattggcaggcagagcgtcggtttctcgtg Protospacer .. **********.****.******* *.*
57. spacer 2.21|587032|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_041921 (Dinoroseobacter phage vB_DshS-R5C, complete genome) position: , mismatch: 9, identity: 0.719
tcgggaagctctttcaccgtggcgatgatgtt CRISPR spacer aggaaagcctctttcaccttgtcgatgatgtc Protospacer *..*. ********** ** *********.
58. spacer 1.1|16509|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP021033 (Rhizobium sp. NXC14 plasmid pRspNXC14c, complete sequence) position: , mismatch: 10, identity: 0.688
cgggttgatcgatttgaaagctgaacgtgata CRISPR spacer atcgtcgatcgatttgaaagctgtaccgacga Protospacer **.***************** ** . *
59. spacer 1.9|16998|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to MN694728 (Marine virus AFVG_250M962, complete genome) position: , mismatch: 10, identity: 0.688
caatcgcaacagcacctataccatcgacttca CRISPR spacer gagctgcaacagcagctataccatctaccagg Protospacer *...********* ********** **. .
60. spacer 1.10|17059|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_HG938357 (Neorhizobium galegae bv. officinalis bv. officinalis str. HAMBI 1141 plasmid pHAMBI1141b, complete sequence) position: , mismatch: 10, identity: 0.688
tcgcgccttcagctcttctatctccgcaagaa CRISPR spacer cgccgcctgcagctcttctatcaccggcccac Protospacer . ***** ************* *** *
61. spacer 1.10|17059|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_019919 (Yersinia phage phiR201 complete genome) position: , mismatch: 10, identity: 0.688
tcgcgccttcagctcttctatctccgcaagaa CRISPR spacer cttagccttcagctcttttatttccgcctcta Protospacer .. *************.***.***** *
62. spacer 1.16|17426|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to KX889311 (Pseudomonas aeruginosa plasmid pJB12, complete sequence) position: , mismatch: 10, identity: 0.688
gggaggccacatcgcgggctatgtctgcggat CRISPR spacer tcagcgccacgtcgcgggcaatgtctgcgtcg Protospacer .. *****.******** *********
63. spacer 1.17|17487|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to CP053403 (Salmonella enterica strain 2010K-2057 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
attaagcgttttgagggaaggtgaaagcgata CRISPR spacer ttctggcgttttgaggaaaggtgtaagcacac Protospacer *. .***********.****** ****.
64. spacer 1.17|17487|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP019182 (Salmonella enterica subsp. enterica serovar Inverness str. ATCC 10720 plasmid pATCC10720, complete sequence) position: , mismatch: 10, identity: 0.688
attaagcgttttgagggaaggtgaaagcgata CRISPR spacer ttctggcgttttgaggaaaggtgtaagcacac Protospacer *. .***********.****** ****.
65. spacer 1.20|17670|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to CP034582 (Lactococcus lactis subsp. lactis strain C10 plasmid pC10B, complete sequence) position: , mismatch: 10, identity: 0.688
ccactccatgaaaacatcctcctatcaccaaa CRISPR spacer aatatttgagaaaacatacttctatcaccaaa Protospacer *... ******** **.***********
66. spacer 1.20|17670|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to CP029292 (Lactococcus lactis subsp. lactis KLDS 4.0325 plasmid unnamed5) position: , mismatch: 10, identity: 0.688
ccactccatgaaaacatcctcctatcaccaaa CRISPR spacer aatatttgagaaaacatacttctatcaccaaa Protospacer *... ******** **.***********
67. spacer 1.24|17914|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_AP017962 (Synechococcus sp. NIES-970 plasmid plasmid3 DNA, complete sequence) position: , mismatch: 10, identity: 0.688
gcgcccaatatctgccaaagcctccgatgtgc CRISPR spacer ctgcccaatatccgccaaatcctccagcattg Protospacer .**********.****** *****....*
68. spacer 1.32|18402|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP020848 (Klebsiella variicola strain KPN1481 plasmid pKPN1481-1, complete sequence) position: , mismatch: 10, identity: 0.688
ctacaaacttttctgcaaacgccacctcctca CRISPR spacer gaatccgactttccgcaaacgccacgtcctca Protospacer *. . .****.*********** ******
69. spacer 1.32|18402|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP009856 (UNVERIFIED_ORG: Enterobacter cloacae strain ECNIH5 plasmid pENT-784, complete sequence) position: , mismatch: 10, identity: 0.688
ctacaaacttttctgcaaacgccacctcctca CRISPR spacer gaatccgactttccgcaaacgccacgtcctca Protospacer *. . .****.*********** ******
70. spacer 1.32|18402|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP008898 (Enterobacter hormaechei subsp. hoffmannii ECNIH3 plasmid pENT-576, complete sequence) position: , mismatch: 10, identity: 0.688
ctacaaacttttctgcaaacgccacctcctca CRISPR spacer gaatccgactttccgcaaacgccacgtcctca Protospacer *. . .****.*********** ******
71. spacer 2.8|586238|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_020548 (Azoarcus sp. KH32C plasmid pAZKH, complete sequence) position: , mismatch: 10, identity: 0.688
cgcccctaccggcgcgacccgcaaggacgccg CRISPR spacer gagcccgaccgtcgcgacccgcaaggcgagcc Protospacer . *** **** ************** . *
72. spacer 2.12|586483|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_023285 (Streptomyces sp. F8 plasmid pFRL5, complete sequence) position: , mismatch: 10, identity: 0.688
atgaagacgccgtggagtacccagaaaacacg CRISPR spacer cgcaccacgccgtggaggacccagcaaaccgt Protospacer * *********** ****** ****
73. spacer 1.16|17426|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NC_031230 (Gordonia phage Yvonnetastic, complete genome) position: , mismatch: 11, identity: 0.656
gggaggccacatcgcgggctatgtctgcggat CRISPR spacer atagacccacatcccgggcgatgtctgcgacc Protospacer . ... ******* ***** *********. .
74. spacer 1.24|17914|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP019984 (Pediococcus inopinatus strain DSM 20285 plasmid pLDW-14, complete sequence) position: , mismatch: 11, identity: 0.656
gcgcccaatatctgccaaagcctccgatgtgc CRISPR spacer aacgataatatctgtcaaagcctccaatgatt Protospacer . .********.**********.*** .
75. spacer 1.24|17914|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP019984 (Pediococcus inopinatus strain DSM 20285 plasmid pLDW-14, complete sequence) position: , mismatch: 11, identity: 0.656
gcgcccaatatctgccaaagcctccgatgtgc CRISPR spacer aacgataatatctgtcaaagcctccaatgatt Protospacer . .********.**********.*** .
76. spacer 1.24|17914|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP019984 (Pediococcus inopinatus strain DSM 20285 plasmid pLDW-14, complete sequence) position: , mismatch: 11, identity: 0.656
gcgcccaatatctgccaaagcctccgatgtgc CRISPR spacer aacgataatatctgtcaaagcctccaatgatt Protospacer . .********.**********.*** .
77. spacer 1.30|18280|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP017565 (Paraburkholderia sprentiae WSM5005 plasmid pl2WSM5005, complete sequence) position: , mismatch: 11, identity: 0.656
tgttgaatctggcatcgacgaagacggaaagc CRISPR spacer gcgcgaagctggcatcgacgaaaacggctgcg Protospacer .*** **************.**** .
78. spacer 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP049159 (Caballeronia sp. SBC1 plasmid pSBC1_3, complete sequence) position: , mismatch: 11, identity: 0.656
gctccctccggcgatcaagatcaatgcgaaca CRISPR spacer cgagcctgtggcgatcaagatcaatgccggtt Protospacer *** .****************** ...
79. spacer 2.10|586360|32|LS483402|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP049319 (Caballeronia sp. SBC2 plasmid pSBC2-3, complete sequence) position: , mismatch: 11, identity: 0.656
gctccctccggcgatcaagatcaatgcgaaca CRISPR spacer cgagcctgtggcgatcaagatcaatgccggtt Protospacer *** .****************** ...
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
154805 : 163269
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >LS483402|154805:163269|DBSCAN-SWA TTTGTTTTCGCACGACACGGCTGGAGAAACGGTTGAGGATCGACACCGCGATGTGGTGATCGTGGGTGGCGGCTCCGCAGGTTCCGTCTTGGCTAACCGCCTCAGCGAAGATGGCTCCTCGGTCATGGTGCTAGAAGCCGGGCGCTCCGATTCACTCTGGGATCTTTTCATTCACATGCCCGCGGCATTCTCGTTCCCGATTGGCAACAAGTACTACGACTGGGCCTACGAGTCCGAGCCAGAGCCGGAAATGAATGGGCGCCGCATCTACCACGCACGCGGCAAAGTGCTTGGCGGTTCCAGCTCCGTCAACGGCATGATCTTCCAACGCGGAAACCCCATGGATTATGAGAAATGGGGAAAGCTGCCGGGCATGCAGAACTGGGACTACGCCCATGTGCTGCCGTACTTCAACAAGATGGAAACTGCGCTAGCAGCCGACCCCGACGACCCACGTCGCGGCCATGATGGTCCTCTCAAGCTCACCCGTGGTCCTGCCACAAACCCGCTCTTCCAAGCATTTTTCCGCTCCGTGCAGGAAGCTGGCTATAAGCTGACTAACGACGTCAACGGATACCGCCAGGAGGGGTTCGCCCCCTTCGACCGCAACATCTTCAAGGGCAAGCGCCTCTCCGCAGCCCGCGCGTATCTCCACCCAGTGAAATCCCGTAAGAACCTTGACGTGCGCACCCGCGCATTCACCACTCGCATCCTTTTCACCGGTGACAAAGCCACCGGCGTGGAATACGAATGGAAGGGGAAGACTCGTCGAGTCCACGCAGACAAGGTGATTCTCTGCGGCGGTGCGTTCAATACCCCACAGCTGCTACAGGTGAGCGGAATCGGCGACCGCGAGGTCTTGGAAAAGGCCGGCGTGGAAGTTCGTAAGCACCTCCCCGGTGTCGGCGCCAATCTTCAGGACCACCTCGAGGTCTATGTGCAGTACAACTGCACCCAACCGGTGAGCTCGCAGCCCTACTACAAGATGATCAATCGCCCCAAGATTGGTCTGCAATGGCTGCTCACCAAGAAGGGCCCCGTGGCGTCCTCTCACTTTGAGGCAGGCGGATTTGCTCGTTCCAACGACGACGAGGACTACCCCAACCTCATGTTCCACTTCCTTCCAATGGCGATCCGCTACGACGGCTCCCAGCCAGAAGGCGAACACGGCTTCCAATTCCACGTCGGCCCGATGTATTCCGATACCCTCGGCCACGTTCATATCACCTCGCCAGATCCCAAGGAAAAGCCAGAAATTATCTTCAACTACCTGGCTACCGACCAAGATCGACGCGAGTGGGTAGAAGCCGTCCGGACCTCCCGCAAGCTGCTAGACACCCCTGCGATGAAGGAGTTCACAGACGGAGAAATCTCCCCCGGCCCGCAAGTGCAGACTGATGAGGAGATTCTGGAATGGGTGCGTAACGACGCAGAAACGGCTCTGCATCCATCATGTACCGCAAAAATGGGTTCTGCTGACGACGAATACGCGGTGGTAGACCCCGACACGATGCAGGTTCATGGCGTTGAAGGGCTCTATGTCTGCGACGCTTCCGTCATGCCGATTATCACTAACGGAAACATCTACGCCCCCGTGGTCATGATGGCAGAAAAAGCTGCCGACCTGATCAAGGGCATGGAACCTCTAGCCCCCATTCACGAGCCGTTCTACCAGGCAAAGAAGGACATGCCGCTCTACGTCGAGGAACCCCGCGACCACACGACTGTTATTCCTGGTTCAGATCACTAGCACTATCGATCAACTGGACTCCGCCGAAGGTAGCCGCCACATACACGCGGTTATCTTCACAGCGGACATAGTCCGGTTCTACCCCAGCTGCAACAAGCACCTGTTTCGCAGCGTCAACGCCTCTCAGCGCCCCCGCGGTAACCGCAGCAGAAAGAGCCAATGCTTTCTCTCGATCCTCCGCGGGGATCAGGGCGTTATGGGAGGAAATGGCTATTCCGTCTGGCATCCGCACCGTAGGCACCGAATGCACCTTGACCGGTATATGTAGGTCAGTAATCGCACGCTGCGTCTGCAGCAAGAATGCATAATCGTCCTCGCCGTAGATGACGTCTGTAGGGCCGACCAGGTTCAGCAGTGCTATCCGACGCGTCAGCCGAGGGGAGGCGTCGATAAGCGTGCGCTCTGGCTCGCTGTGATCAATGTAGACGGCATCGACTCGTTCTTCGGCGAAAGCCGGCTGGTTCTCACGAGCTACCACGACAACCACGGCCCCCGGGATTCGCTTCGCCGTCCGGAGTAACGCTACGTGTCCCGCGTGCACTTGCGTGCCAATAGAGACCAACACAACGGGCTTGCCTTGTTTCCTAAAAGCACGACTTACCATCGCAATATCGTCAAAAACCTGCGCCTGGCCAAATTGAAACATAATTCCCTTACAACAATTCCTCAGTAATATCGCCCAGCATGGGTTGAAGCAGTACCCGCGCTGAATCACGCATCTGTTCTAGCTGTTGGACCTGTACTAGTCCAGCGAGCAGCTCTGGCCATTTATCTTGCGATACATGAATAGCATGCGCTCCGATCTCTGCGAGCAAGAGCTCAGCCACGGTCGCAGCGACGTCGTTTGTGGCGGTAACAGCGTAAATATCCGGCTTTAATTCTTTAAGGACAAAGGACATACCGGATAATCGCCCCACGAGACCGTCCCAAGCGGGCTCATCAAATGCTAATTCCGCGTGTTCCACAGCACGGCGGTGCCCCACAGCAATCAGCCGAGTAACAAGCTCGTTGTTCCCGCTGATCGAAAGAACCGGCCTCACCGATTGCGCTTTAGCAAATCAGCGACGGAAACCGCGTCTGAGCGCTCTTCTCGTCGGCGCCGGCCATGCGGTTCAGGTTCGGTGGGGACCCAATCCACCTTGGCAAAGCTATTGGTCTCCGCAGCAACGGCATTTCTAAAGGCAGCAGCACGCAGCTCTTCTGCTTTTTCTTGATCGCTAGCCTGCTCGATTTCTGGGATCCGAGTGGCCTGTGCTTGGATCATCGTCGGCTCAACAAAAACCTGCCCCGTGAGGGTTTCCAGTTGAGTCCGCACTTGCGCCACTTGCTCTCTAATCTGCGCCAAGGTCTCTTGTTCCTCGACGCGTTGCTCTAGAGCTTCTGCCTGCGCACGGTAACGGCCCACCAGAAAGAATCCGAGCACGGCTGCCCAGAGCGCCGCAAGCAATGCAAGCTTGAGTACTCCCTGGTTACCAGTGAACAGCATGATTATGCTTGCCACCAAAGACAGCACGACAAGCGCAATGAGTAACTTCTGCCCGTTGTCCTTCATGCCATCCAAACTACCGCACTGCATCCACAGGCGGCACCGTACACGATTTTTCTAACCAATATCCTGCCGCAGACATCGCCACCCCACCGAGCGCAGAGGCAATCACGATAGGTAGATCTTGCGCAGAAATATGCGATGTGAGCACAAAGATTCCCAGGCTCACATAGCCCCCGCCGAAAATAGCTCCTGTCCATGCAGAGGCTTTGCCTATCACAAGGAACTGCGCGGCTGTCACCGGATTCAGTTGTGAGCGATCTTGTCCGATCTGGTCATCTTTAATACGTGTCCGCACCCGCACAGCTAAGAGCACGCACAACACCGTCATCCCCCACAGGCATATCGCCACCCCCAAAGGAACCGAGGGCAGCACGCCATAAAATCGCATCACCAAAATAAAGGACGCAAGAGCGCAGAAGAGGAAAACCCCTAACAGCCCGAGGACGGAGGTTTGTTTCATAAGGTCCCCACAATCGGGAAGGTTCCGGTGATCAACTCCGATAACGGACGCCCGTTTAGGGTTGCGTGAGGGTCCGCCTCAAGCCACGGAACCAGTACGAAATTGCGCTCGTGTGCATAGGGATGCGGCAGTGTCAGTTCGGGGTCATCTGATGTCACGCCCTCAATACACACAATATCTACGTCGAGGGTACGAGGGCCCCAATGCCGAATCCGCTTGCGCTCCGCAGCTTGTTCTAAAGCCTGACCGCGTTTGAGCAGTTCCATCGGTGTCTCATCCACATCCACGATGAGCACCGCGTTAAGGAACTCATCCTGGTCTTCCACGCCCCACGGCGGAGTGGCATAAATACTCGATGCAGCAACGAGGTCAGGTAAGAACTCCTCGTACGCCGACCTTAAGAGGGCATAGCGATCGTCCATGTTAGAACCGATGGATAAGACTGCTCTCATGTTATCGCCTTGCCACCACAGCTACGTCGGCAAATTCGAGCGGGATGGGCGCATGCGGTTTATGCACCGTCACTTCAATGCCCGAAAGCGGGAAAGTGCTCATGGCCGTATCGGCGATTTCCGATGCGACAGTCTCAATCAGATCGCGCGGCTCTCCAGTAAGGATTCCATATGCCAGCTGAGCCAGCTCGCCATAATGCACCGTCTGGGTCAGGTCATCCTGAGCGGGAAAGGTGAGCCAACAGGTGATATCCACAAGAAACTCCTGGCCCTCGCGCTTCTCGTGCTCAAAGACACCGTGATAGGCGTAGCCTTTAAGGCCTTTCAGTTCAATGCGATCCATGCTTCCTCCATTTCGCCGCAACATCCACAGCGGCACGGGATAGCGCCACGTTGTGCACCCGGACGCACCACACGCCTTGATGCGCCGCGATTGCTGTGACTGCGGCAGTCGCCGCGTCCACGTCTGAGCCTAGGCTAGCTAAAAAGCGCTTTCTCGACGCCCCCACGAGTACCGGATACTCTCTCATAAATTCTGGAAGAGCGTTCAATAAAGCCCAGTTGTCCTCTGCAGTTTTAGCAAACCCGAGGCCTGGGTCGATGACAATGGCTTTCGCGTCAATGCCAGCGGCAAGGGCGCTATCAACGAGTATGTCCAGGCCCTCATGAACTTCACGCACCACGTCGGTATGATGCGCGCCTGCGGCATCCCCAAAGGTCCCTGTTTTCCAGTGCATCAAGCACACGGGAAGCTGTGTATCAGCCATAACCGAGTACATGTTCTTATCTGCGAGGCCGCCGGAGACGTCGTTAAGCATGCTAACCCCGGCCTCGGCTGCTGCGACAGCGACCGAGGCACGCATGGTATCTACTGACGTCCTGATTCCTTCAGCATGCAGGGCTTTAATCACAGGAACGACGCGGGCAAGCTCAACCTGCTCGCTCACACGAGTGGCGCCTGGGCGGGTGGACTCGCCACCCACGTCGATGATGTCCGCACCTTCTGCCACGAGTTGCTTGGCGTGCGCCACCGCGGCATCGAGATCGAGGTATTTTCCGCCATCAGAAAAGGAATCGGTGGTCACGTTGACGATTCCCATTACTTGCGTCATGTCAGCTCCGAATAAGGCTCAAAACCTCCGCACGGGAGGCCGCGTTGGTTTGGAAACCTCCTCGAACCGCAGATGTGGTGGTGGTGGCTCCTGGCTTACGTATGCCGCGCATCGCCATGCACAGGTGCTCACATTCGATCACCACGATCACGGCCTGAGCTTCCAGTTTTTCTACCAGAGCATCGGCTACCTGAGAGGTTAATCGCTCCTGCACCTGGGGACGCTTGGCATACAGATCCACCAAACGCGCCAACTTAGACAGGCCCGTCACTTTCCCGGACTGTCCTGGAATGTATCCGATGTGCGCCGTGCCAAAGAAGGGCACCAGGTGGTGCTCGCACGTTGAGTAGATCGGGATGTCTCGAACCAGTACTAGCTCACGGTGGTCCTCGCTAAAAGTCTTGTTCAAGACCTCCGTAGGGTCCGTGTGCAGGCCAGCAAAGACCTCTGCATATGCCTTTGCCACACGCGCCGGTGTTTCTTGGAGTCCTTCACGCTCCGGGTCCTCGCCCACCGCAATAAGGAGTTCCCGGACTGCTGCTTCCGCGCGATCGCGATCAAACACGATCTGCCCTTTCGCTTTGACGACGCGCACGCTCCGCCTTGGAAGCGTCGAGAAGCGTGAACGGCTTCGGGGGTTCTTCGCCGCGCTCGATAGCCAACTCTGTGGGGGTCTTCACTGGCTCGCGTCCGGCTTGACGTGGGAAACGGGCGTCTTCATCCGGGAAGACATCCCCCACCTCACGCGGAACGATGCCGTGGAAGAGCTCCTCCAGATCGGGGCGACGCAGAGTCTCCTTCTCCAAGAGCTTCTCTGCCAAGCGATCCAAATAATCGCGGTACTCGGCAAGGATCATGTACGCCTCGGTGTGCGCACGATCGAGCAGATACTGCATCTCGCGGTCAATCTGCGCCGCGACCTCTGGCGAGTACTCCAGTACTCCGCCGCTGCCACCACGGGCAAAAGGATCACCTTGTTCTTCGCCGTATTTGACCATGCCTAGGGTCGGCGACATGCCGTACTCGGTAATCATGGCCTTGGCTATCTTGGTTGCCTGCTCAATATCTGCGGAGGCGCCGGTGGTGGGCTCACCAAACACCAGCTCCTCAGCCGCGCGTCCACCCATGGCAAAGACGAGGCGTGCGTACAGCTCATCGCGGTTGTACATGCCTTTGTCGTCTTCTTGCGCAGTCATGGCATGTCCACCGGTACGTCCACGCGCCAAGATGGTGACCTTGTACACCCGCTCAATGTTTTTCAGCGCCCAGGCGGCCAACGTGTGTCCACCCTCGTGGTATGCGGTGACCTTCTTTTCCTTCTCGGAAATCACCTTGGAGCGGCGCGGACCGCCGACTACTCGATCTGTGGCCTCTTCCAAAGCATCAGCGGTAATCACGGTCTTGCCAATACGCGCCGTGAGCAACGCAGCCTCGTTGAGTACGTTGGCGAGGTCTGCACCGGACATGCCGGCAGTACGACGAGCCAGTGACTCTATGTCTGCGTCTGGAGCAAAGGGCTTGCCCTTGGCATGGACGCGGAGGATTTGTTCGCGCCCCTTGAGGTCGGGGTTGCTCACGGGGATCTGGCGGTCAAAACGGCCAGGACGCAGCAAAGCCGGATCCAGAATGTCTGGTCGGTTAGTAGCAGCCATGATGATCACGCCTTCGCGGTCACCAAAGCCGTCCATTTCCACCAGCATCTGGTTAAGCGTCTGCTCGCGCTCGTCGTGTCCGCCGCCCATACCGGAGCCACGCTGCCGGCCCACGGCGTCGATCTCGTCGATAAAGATGATGCAGGGACTGTTCTCGCGGGCTTGTTTGAAAAGGTCACGCACGCGGGAAGCACCCACGCCAACGAACATCTCTACGAAGTCAGAACCAGAGATTGAGTAGAACGGCACACCGGCTTCTCCGGCGACAGCCCTAGCCAAGAGGGTCTTACCCGTACCGGGAGGACCGTACAGCAAAACACCACGTGGGATCTTTGCGCCCAGCTGTTCATACCTGGATGGGTCTTCAAGGAAGTCCTTGATCTCATGCAACTCATCCACGGCCTCTTCTGCACCAGCCACGTCTGCAAACGTGTTGGTGGGCATGTCTTTGTTCAGTTCCTTAGCGCGGGAACCACCAAAGCCGAACATGCTGTTTCCCTGCATACGGGAGAACATCCACATGATGAGACCAAAAACGATCAGCATCGGCAGTATGTATCCGAGCATGGAAACGAGGAAGTTCTCCTTATTCACGTTGGTGGAGTACTTCTCTGCCTCGGACTTCTCTACCTTGCTAAAGATCTCCGGGGACGTACGCGCGGGATACCGAGCCAGGATCTCAGAAACCTCGCGACCCTCGTGGTCGATGGCGTTCTTGAGCTTGATGCGGATGCGCTGTTCGCGATCGTCGATTTGGACTTCTTTGACGTTTTTCTTGTCCAACTGAGCGATAGCAACAGATGTGTCTACTTGCTTATAGCCACGGGCGTCGTTGCCCAGCAAAGTAAAGACATAGATGGCCATCAAGACGATGGCGGCAATCACCGAATACTGCAGGATTTTTTTGTTCACTTGTAAACCTTAGGATGCAGCGAAGCGACAAACGGCAAGTCGCGGTACTTCTCTGCGAAGTCTAGACCGTAACCGATCACAAATTCATTGGGAATATCAAAACCGATGTCAAGGCAGTCTATATCCGTCTTCACTGCCTCAGGTTTACGCAGCAACGCGACGATCTCCAAGGACTTTGGCTTACGCCCATTGAGGTTACGAATCAGCCATTTAAGTGTGAGGCCGGAATCAATGATGTCCTCGATAATCACTACGTTGCAGCCCTGGATGTTGCGGTCCAAATCCTTCAGAATCCGCACCATGCCAGAAGACGTGGCGGAGTTACCGTAGGAACTCACCGCCATGAATTCCATCTGGCAGGGGATGTCGAGAGCCCGGGCAAAGTCGGTGAGGAAGAATACCGCACCCTTGAGCACACAGATCAAGACGAGATCGTCCTCTTCGTCGCGGTACTTCTCTGACACCAGCGCCGCCATCTCCTTGATGCGGGTCTGGAGTTCTTCTTCGCTCACGAGGACGGCCTCAACGTCAGCTCCGTAGGAGTTGACGGGCACTTGATAATCCTTCTTGTCGTGCAT
Protein sequences of DBSCAN-SWA_1 >LS483402|154805:163269|160684_162691_-|SQG55935.1|protease|DBSCAN-SWA MNKKILQYSVIAAIVLMAIYVFTLLGNDARGYKQVDTSVAIAQLDKKNVKEVQIDDREQRIRIKLKNAIDHEGREVSEILARYPARTSPEIFSKVEKSEAEKYSTNVNKENFLVSMLGYILPMLIVFGLIMWMFSRMQGNSMFGFGGSRAKELNKDMPTNTFADVAGAEEAVDELHEIKDFLEDPSRYEQLGAKIPRGVLLYGPPGTGKTLLARAVAGEAGVPFYSISGSDFVEMFVGVGASRVRDLFKQARENSPCIIFIDEIDAVGRQRGSGMGGGHDEREQTLNQMLVEMDGFGDREGVIIMAATNRPDILDPALLRPGRFDRQIPVSNPDLKGREQILRVHAKGKPFAPDADIESLARRTAGMSGADLANVLNEAALLTARIGKTVITADALEEATDRVVGGPRRSKVISEKEKKVTAYHEGGHTLAAWALKNIERVYKVTILARGRTGGHAMTAQEDDKGMYNRDELYARLVFAMGGRAAEELVFGEPTTGASADIEQATKIAKAMITEYGMSPTLGMVKYGEEQGDPFARGGSGGVLEYSPEVAAQIDREMQYLLDRAHTEAYMILAEYRDYLDRLAEKLLEKETLRRPDLEELFHGIVPREVGDVFPDEDARFPRQAGREPVKTPTELAIERGEEPPKPFTLLDASKAERARRQSERADRV >LS483402|154805:163269|159016_159358_-|SQG55928.1|DBSCAN-SWA MDRIELKGLKGYAYHGVFEHEKREGQEFLVDITCWLTFPAQDDLTQTVHYGELAQLAYGILTGEPRDLIETVASEIADTAMSTFPLSGIEVTVHKPHAPIPLEFADVAVVARR >LS483402|154805:163269|156531_157200_-|SQG55918.1|DBSCAN-SWA MFQFGQAQVFDDIAMVSRAFRKQGKPVVLVSIGTQVHAGHVALLRTAKRIPGAVVVVVARENQPAFAEERVDAVYIDHSEPERTLIDASPRLTRRIALLNLVGPTDVIYGEDDYAFLLQTQRAITDLHIPVKVHSVPTVRMPDGIAISSHNALIPAEDREKALALSAAVTAGALRGVDAAKQVLVAAGVEPDYVRCEDNRVYVAATFGGVQLIDSASDLNQE >LS483402|154805:163269|157207_157594_-|SQG55920.1|DBSCAN-SWA MRPVLSISGNNELVTRLIAVGHRRAVEHAELAFDEPAWDGLVGRLSGMSFVLKELKPDIYAVTATNDVAATVAELLLAEIGAHAIHVSQDKWPELLAGLVQVQQLEQMRDSARVLLQPMLGDITEELL >LS483402|154805:163269|157590_158106_-|SQG55922.1|DBSCAN-SWA MKDNGQKLLIALVVLSLVASIIMLFTGNQGVLKLALLAALWAAVLGFFLVGRYRAQAEALEQRVEEQETLAQIREQVAQVRTQLETLTGQVFVEPTMIQAQATRIPEIEQASDQEKAEELRAAAFRNAVAAETNSFAKVDWVPTEPEPHGRRRREERSDAVSVADLLKRNR >LS483402|154805:163269|160128_160722_-|SQG55932.1|DBSCAN-SWA MRVVKAKGQIVFDRDRAEAAVRELLIAVGEDPEREGLQETPARVAKAYAEVFAGLHTDPTEVLNKTFSEDHRELVLVRDIPIYSTCEHHLVPFFGTAHIGYIPGQSGKVTGLSKLARLVDLYAKRPQVQERLTSQVADALVEKLEAQAVIVVIECEHLCMAMRGIRKPGATTTTSAVRGGFQTNAASRAEVLSLIRS >LS483402|154805:163269|154805_156554_+|SQG55916.1|holin|DBSCAN-SWA MFSHDTAGETVEDRHRDVVIVGGGSAGSVLANRLSEDGSSVMVLEAGRSDSLWDLFIHMPAAFSFPIGNKYYDWAYESEPEPEMNGRRIYHARGKVLGGSSSVNGMIFQRGNPMDYEKWGKLPGMQNWDYAHVLPYFNKMETALAADPDDPRRGHDGPLKLTRGPATNPLFQAFFRSVQEAGYKLTNDVNGYRQEGFAPFDRNIFKGKRLSAARAYLHPVKSRKNLDVRTRAFTTRILFTGDKATGVEYEWKGKTRRVHADKVILCGGAFNTPQLLQVSGIGDREVLEKAGVEVRKHLPGVGANLQDHLEVYVQYNCTQPVSSQPYYKMINRPKIGLQWLLTKKGPVASSHFEAGGFARSNDDEDYPNLMFHFLPMAIRYDGSQPEGEHGFQFHVGPMYSDTLGHVHITSPDPKEKPEIIFNYLATDQDRREWVEAVRTSRKLLDTPAMKEFTDGEISPGPQVQTDEEILEWVRNDAETALHPSCTAKMGSADDEYAVVDPDTMQVHGVEGLYVCDASVMPIITNGNIYAPVVMMAEKAADLIKGMEPLAPIHEPFYQAKKDMPLYVEEPRDHTTVIPGSDH >LS483402|154805:163269|162687_163269_-|SQG55937.1|DBSCAN-SWA MHDKKDYQVPVNSYGADVEAVLVSEEELQTRIKEMAALVSEKYRDEEDDLVLICVLKGAVFFLTDFARALDIPCQMEFMAVSSYGNSATSSGMVRILKDLDRNIQGCNVVIIEDIIDSGLTLKWLIRNLNGRKPKSLEIVALLRKPEAVKTDIDCLDIGFDIPNEFVIGYGLDFAEKYRDLPFVASLHPKVYK >LS483402|154805:163269|158116_158563_-|SQG55924.1|DBSCAN-SWA MKQTSVLGLLGVFLFCALASFILVMRFYGVLPSVPLGVAICLWGMTVLCVLLAVRVRTRIKDDQIGQDRSQLNPVTAAQFLVIGKASAWTGAIFGGGYVSLGIFVLTSHISAQDLPIVIASALGGVAMSAAGYWLEKSCTVPPVDAVR >LS483402|154805:163269|159344_160127_-|SQG55930.1|DBSCAN-SWA MTQVMGIVNVTTDSFSDGGKYLDLDAAVAHAKQLVAEGADIIDVGGESTRPGATRVSEQVELARVVPVIKALHAEGIRTSVDTMRASVAVAAAEAGVSMLNDVSGGLADKNMYSVMADTQLPVCLMHWKTGTFGDAAGAHHTDVVREVHEGLDILVDSALAAGIDAKAIVIDPGLGFAKTAEDNWALLNALPEFMREYPVLVGASRKRFLASLGSDVDAATAAVTAIAAHQGVWCVRVHNVALSRAAVDVAAKWRKHGSH >LS483402|154805:163269|158559_159015_-|SQG55926.1|DBSCAN-SWA MRAVLSIGSNMDDRYALLRSAYEEFLPDLVAASSIYATPPWGVEDQDEFLNAVLIVDVDETPMELLKRGQALEQAAERKRIRHWGPRTLDVDIVCIEGVTSDDPELTLPHPYAHERNFVLVPWLEADPHATLNGRPLSELITGTFPIVGTL |
11 | Pandoravirus(33.33%) | protease,holin | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1821504 : 1843481
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >LS483402|1821504:1843481|DBSCAN-SWA ATTAATCTTCAATCAGGCTCTTAATCGCTTTCAGATCAGGAGCCGTAGGCGGCGGAGGCGGCGAACCACCATCGGCCCACCGGATCAAAGCATCCAAATGATTCATCGCCGTAATCAACCCCAGCCGCAAACGATGCGCCAAAGTTTCTGCATCAGAACGACGAGACCGCTCAGCATCCAAATCAACCTGAAGCTTGTCAAGCCTCGATTCCATGATGTCGATCCGACTTCCCTTAGCAGCCTGTGTATCAGACCGGGCTTTCATATACCCCATCCACACAGACCCCACGGCAGTCACAAGACCCCCAAACGCGGCAAGCATTGCTGCAATAACACTCATACTCGCCCCCGATCTTCAGCCAACGGAATATCCCCGCGTTTGCCACGCCACACAGCCCACATCAGCGAAACCGTCATGGACAAGATAAGCACACCAACCAACCACAAGACTGCTTGATCATGCTCTACTGCCGCTGCTAAAAGGCTAAAAGTCCAGGAGAGGTTAAGGCCAGTTGCTGCAGCCAACGCCACAACATCGATGTGCCCCCGGTCCGTGAAAGCAGCAACCAGGCACAACAGCCCAACGGCTATATGCACCCAGGCCCACGCTGTGATGGGCATGATCGACTCAACGGGGCGATGCCACTGCAAAGGATCACCCAAAACACCTGGGAGATAATAAAAGCCCACGGCTATTTGCACGATGAACAGGAGCAAAAGGGCCGTGGAATCGGTCATCAAAAAAGCCCGCACACGGCGGGCAGTCGGTTGGACCGAGGTGGGTAATCTTTCAATAGGCATGGGTATCACCCCTGCGTGCTTGTGGAAAGCTGCTCACGCACAGCCTCCAAATCAAAAACCGGGGTCACCGGGGCCACAGCGGTAAGTCGCTTCCCCATCGATGGTGTGATAGCCCCAGGGGTTCCGGCGTGGATGATGATCTGCGCGATGCCGATCACCGCAGCAATGAGGAAGCTCACCCACATTGGGGCACCAACTGCCATGCCGGTGGCGAAATTGCCCACTTGCAAAACCCAGCCAGCAACCGCTGCCAGTGTATCCTTGCGGCGCATAAACCACGGCTGCGTTGCCAAAGCGGCCGCAACCGCGTCCCGAATATCATTCTTGTTACTCATTATTTTGCCTTTCGTTCGTTGTTGATGGCCTCATCAAGAAGCCGATCAGGATCAAGACCCTGCTGCTTCGCCATCGCCTTCACCAGCACAAGCAGCTGCCAGCAGGTGGCATCCACAATGCTGATCAGCGTGGACTGCGCGAAACTCTTTGCGGGATTGATCAGAGATGCGATCATGGGGATTTCTCCTTCTTTCTTTGCCCCCGCCGTAGCAAAGGGCGGGTTGTCGATGTAGTGCTGCACTTTTTTGCGGAAATCGTTCATGTCGATTCCGCCTGGGTCCCACTTCCCCTGTGCAGCGCCGGAATATTCTTTGTGCCCGAGCAGGGTTTGCGTGGTCGCGGATTTCCCGAGATACCACAAAATCGCAGCACAGCAGCGGTAGTAGGCATCCAGCTGGGCCGGTGTCCACGGGCTAGTGCCATCGCTAGCCGCTTCAATGCCGATTGAAACACGATTGGCATCGTTGGTTGGCCAGCCTGGGTAGGAGCCTCTACCTGCGTGATAAGCAATGCCAGCGCCCGTGATAACGGCGGTACCATCACGGTTGAGGTGGATTTGGGAACATAGCCCCAACTGTGGATGTTGTGCGATGTATCCAGGGATGTCTTTATTAGTACCCGTGTGGTGAGCCATAACCCCTTGGATTATGTGGAAGTCGCCGTGTCCACGGTTTCTCCAGCCCTGCCATTCTTGGACTTTGACCCCGAACGCTCGGAGGACATCCGGTAAGAATAACGGGTCACCTCTGTGCGAGGGATTGGGTTTCATCATTTTCTCCTTTTGGGTATAAAAAAATGCACCCTCATGGGGTGCTTTTATTCAGTGTCTAGCTTCAACCACAGCAAAAGATATTCTTTCCCAGAGTTTCTGCCATAGATGCGGATATCCGTTCTCGAAGCGTAGTTGAGGTCGTAAAGAAAGTAAGTCCTTCGTTCGTCCCCAGCAGAAGTCATCAAAGCAAGATAAGGTGTCTCGCCTTCCTTTGTGGGTAGTGCCACACAAACCACGTGTGTCGCTTCACTCTGCGATACTCTTTCAGGGATTTTCCCGGGAATCGTCACCGCCACCACTCTGCTTGCTCCTGCAAGCGTCCCAATCAGTGAACCAAGTTCCATGTTTTATGCTCCTTTAACGTAATAGAGTTTTCCAGTTTTTGTGTCCAAAACGATGTCTTTTGCGCGAGCCTCAGTTGGCGCCTTCCAGGCCCCCTGCCCATCCCATATCCACAAAGCCGGGCGATTTTCCGAGCGCTTCACGATAGCCAGCAGATCTTCAACCTTGGAAAATCCCTCAGCTGCGTTTTCTTTCAACCACGCATAGCGTTCCTCCCAATCTGCCAACTGCTTAGCACGTGCTTCCCACTCTTGGAGCTGCTGGGTTACCTGATCCCGTGCATCTGAAGCTTGGCCAGCCAAGGTTGCTGCAGCACTCTGTGAATCTGAAGCGCTCTTCTTAGCAGCTTCGGCTCCCTCCTGTGCTTGAGTAGCAGCCCTCTGCGCAGCCTGGGCCTGGTCGAGTAGAGACGCGGCCGTGTCTTTCACAGCGGCTGTTGCTTTTTGGACGGTCGCATCCACAGACTCTTTAAGAAGCCGGCTTGCGTCAGCTAGTGTTTTTTCTGCCACAATCGTCTGTTCTCGCACCCTGCGCGCCTCACCAAGGTTATCGGCGGTTTCCTTAGCTAGCTCAGCCAAATGATCATGAACATCTGGAGAAAAAGCCTCAGCATCCTCGACGGCTTGGCGCATCGTCACGGTGTTTTCTCGCACCAAAATCGGAATGGACTCACGCCCTATGCCTCCCGGGCCAATCAAGATTAAGACACCAGCCCCAAGTGCAAGATCGACGGTGAATTCACCGGCGTCATCCACCAAGACACGCACTGGCTCATCCACGATAAGTCCCTCAGCCCCTGGCCGGGTGGAAGGCGGTCTGATCCACACCTCCCGAATGGTCGATGGTTTCATCGTAATGTTTTTAAGTTTTCCTTTAATTTTCACACCACTCATATTTTTCTTTCCTTATACAGCGTTTTCATTAGGTAAGGTCTCGCTTTGCTCTTTCCTCCGCTAGGATCGCTGAAAAATCCGCAGTTGGAATCACCAGAATGGATGCCTCAGCGTAGTTTTCTGCCCACCCAACCGCCCCAGACGCTTTCGGTCGAGACGCCGTTAATCTAATCGGGAAGCTGTGCGTGTACTGCGAAATCGCATTCACGCGAGCATTCACCAACACCGATGCACCAGAAACAAACTTCTGCCCTGGTAGAAGCTCAAAATCAATTTTGTTCCGAGCAGGATAAGTAATCTTCACCGGCCCCAATATCAGCGCGTCATAGTCACTGATCGACCCAGCAAGCGGTTCCTGAAGATCCCCGAGCCTACGCAAAGCTGCATTCTGAGCATCAAGCTCTTTGATCTGCCGGGTTTGGAGAGCATCAAGCTCATCTTTTTGCTGCCGCCACTTTTTTATGCCCTCATCCGCGGTCTCCTGCGCAGCCACCGCCTTACCATCAGCCTGCGCTGCAACCTCACCAGCCGCCGCAGCCTGCCTCGACACCACACCAATCTTTGATGCGGCTTCCCGACGCTCCGCCGCGATCTGCGCAAAAATCTTATCCCGCGAAGACCGAAGCGCCGCAGCATCAGCAATCGGCTGACCTCCAACATGGACACGCCACCGATTCACCGATCCCCGCTCCGACGCAATGTCAATCGTCGTCACAGGAAGATCAGGAAGCCGCTTTCCCCATAGCATCACATCGACGAGATCATCTTCGTGAAAGTCCACACCTGGCTCATAAATACCTAGACCATCAGTAGTGATGTCTTTTTCGAAAAACAGGTTGCCCTCGATGCGCTTCTCGGCAGCATCGAGCACCTGTTCTGTATCCGAATTGGACAAAGAATAATCAATCTCACCGGATTTTTTCACGACCATCGTCTGCGATGAATCAGCGCGGACACCACGTAGGTCAAAACGCCCCTTCGGCAGGTCCTCCGGCTTGTAGAGAAAACCCTCTGCCATGCGAGTATCCGGCTGGGATTTTAATGAAACATCACTCGGCAACGTCACATCCCAACGCCCCCACACAAAGGAGCCAACCTGACGGCCCACAGTGATCTCACCACCATCAGCACGCAAAATAACATCACTCATTGCTGCAACACCTCCACTACTACAGTCGGCTTAGAAAGACGCAGCCCCTCAGGCTGCGGATCCTCCGGCAACCACATAAAACACCTAACCACACAACCGGCCATCGCCGCGGGGGCAGAAATCTCCTCCCAGATACTGCGATCCTCCGGCCTGATAATCAGCTCAGGCGACGGCTTCCCCGTGCTGGTATCGGCCACCTGAATCGGACGATCCGATACCCCGAAAGCCTTATACGTTGCATGTAAAGACTCCTTGATCAGCTTCCTGATCGTCACATCCGCAGGGCCTTGCACACTGAAACCATCAGCAACCGCAGCCATCTTCAACTTCGCTAAATACCGAGGCTTCGAAAACTGAGAGCCGAAATCACCCACAGCACGAGTAAAAGACCCACCAACCTGGCCAGGAATCGACCAACACGGCATAAACCCCAGCTCAGTCAACATATCTGTCCCATGCACCTGCAAAACCCGCGGCGCAATAGGATCACTAGACGACGCGACCACAAACATCACGCGAAAGACTTTCCGCAGACCGTTACGCTCCACAGCAATAAACCGCGTCGCCGCATTCGCAGGCACAAGACGACCCTCCGCATCCACCTTCCCAAGCCCATCAGCCACAAGATCGTCCACACACATGTGCACAACCCCCTGCGGTGATGTGATCTCAAAATCGCCTTGAAAAGACTCAGGGTTCATCCGCGTAGTCGGCGCATGAATCCTGATATATGGTGGCATGTCACAAATCGGAACCCCATCCTCCGACAGCAGCCCGACCCATTCGCCACGATCCTCAATTACCTGACGGCGATGCTTAGCGTGTGCGACCCAATCAAATCCCATTAGAAATCACCTCCACATGCTCAAAAACCCGACCTGCCACAAAAGCTCCGCCCCATCAGGGACAGAAAAAACCCGAGACGCGCCGGGTGGTATGTTCTCGGGTAATGCTTGTCCTCGGACCTTGATCCATGTCTCACGATCCAATTCCCCTTGGTCGTCGACGACTGCGAGGCACTGGGCGATCCCGAGGAGAAGGTTTCGGGGCTGGGACACGGCAGGGAGGGTGAAGGTCGCTCCGGATGGTAGTTTTATCTTCCCTCCGGCCCCCTTCCACCTGATTTTTGGGGATACTGGAACATCGCCGTTATTTGTCACTGTGACGTTATTATTTCCGTGCAGCCAATTCGACCACCACGCCCCATCATCAATCACCACCGAGGCCTCAACGTCCAAAGATGAGACCTCGGCGGGATCACATGCGGGGGATCCGATCTCTGCTGCGAGTCTCACCGCCCCGAACCGAGGCCCCCGCAGCTTCGAAGAAACTTTCAAAACCGCAGGGACATGCGTCGACAGACTCCCCATAAATTCATCGAAGATCTTGGTGCAATCCCCTGACATGTCATAGATAGTCAGAGACAGGGTCCCCTCAAGGGGGCCATGCTCAGCACCAAAAAAGCGCTGCCCCACAGCCCCCACAGCCTGCGACACCAACGGCGACGGAGTCCCCACCAAACCGCTGATACCACCCTCGCGGAGAAACACCCCGCGATCCTTTTTCTCAGTCAGATTCCACTCGACCCCAGAAGGGGTGAGAAGCGTGATCTTCTTAACCTCCGGCTCCGGAAGATTAACCTTCAACAAAATACTCACCCCTTCACTAAACGGTCACGCCCTGACGGGCGCGTGAATAATCCTGAGCCGACATACCATCGCGCCGCACATTGAACTCCACATCAGACAAGCGACGATCCATCTCATCAACGAATTTCGACAGGCGAGAATCCGCAGGTTCACCACGCAAATCAATGTTGAATACGCGTTTACCAGCACCATCACCGCCACCGAGAATCTGGTTCGTGCGCCGCGCCTCAGACACCAAAGCGTCACGAGCGCGAGCAGCCTCCGCCGCCGCCCCCACTAGTGCGGAACGCTGACGAGAACCTTCCTCAGCCTCCACACGCTGGCGGTTAATATCAGCGACATTAAGCTCGGCGTTGAGAGCCTGCTTCCCGATCGCATGCCGAGCATCCAACGCCGCTCGCTCCGCAGCGAGACGCGCCGAATCCATCGTGTGCTGTGCCGTCATACGCTGCATCTCAGCCTGATGACGACGCTCCGCAGCCTCTTGGAAGTGCTTTTGGTAGTCCATCATCGACCCCATGGTCGCACCAGTAAACGCACTGCCAAGCTGATTACCAGCCTCAACCCACTGCGAATTACCCGTGGCAACAGTGCCACCGATGCCTACCACACCACCGAGCACGCCACCTGCGATACCAAGGCCTAGACCAGCGCGAGCCCCAGTTGTGGAGTTCCTCCACGCTTCCTTGAAATCACCGTGATTTGCTTTCGCTGCTCCGATACCACCGACGATATCTTTAATGCCGCCTATGGCACCTGCGCCAGCAGTGACAGCCCCGAGGATATTCCCCGTCGCAAGCGATGCAGCACCAGTGAAAAGCCCACCTATAAATTTGGCGATACCACCAAAAATTCCACCAATACCACCGAGGAATCCCTGCGCCCCACGGGCCTGATTCGAGGTCATGCCGTAGAGACGCTGGGTTTCATCCTGCAGCCGCAGCGTCATCAGACGCAGCGTGTCCGCAGCCTGAGCTTGCGCCATCGTCGCCTCAGCCACCTGCAGCCGTGCTACATCAGCAGCATATTCGGCATCGAGGTTATCGAGATTCGCCTTAGCGCGAGCTGCCTGCAAAGCCCACTCCGCTGCCTCCACCTCACGGGTATTAGCGATGAACGTCGCGCCGAGCGCCTCCATCGACTGCGTACCCGTGATACGGAATCGGTCGACAGCCCGGCCAAGGCCACCAACCCCAGTCGTAGTCAGCGTCAGATACCCGCGCTGCGCCTTCGCAAGATCATCCTCAGCCTTCGCCACAGCAATAAGCCCCTGCATACGCACGCGACTCATATCCCATTCTGCGACGCGCAGCTCATTCGCTGCCTTCACCCCAGAGATACGCAGCATGACCTGCTGCTGCTGTAGCTTACTAATCTCTTCGCGGGTCTTTTCGATCTGCTCCGCGAAGCGTTCAATCTCACCGAAAAAAGCCGAGACCTGCTTAAACGACTCCGCAATGCCGTTAAGGAACTTGGTTGCTACCTCGCCTGTGGCGGTGATTCGAGCCGCCACCACAGTACGCTCAGCAGCCTCCAATTGCAGCGCTGCATTAGCTGACTTCTCACGGATCTCATTGAGCTTCTTGAGCGCGTCAGCTTCCTTCGTAGAGTCACCAGACTTCTTAGCTTCTGCGTATTCACGCTCCGTATCAGTAATCGCAGTGGCCTGCTCAGCGAGCTGCTTGCGGGTATCGGCAAGGCCCTTTTCGGCATCCCACACCAAATCGGTATTGGACAGGAAGCCGCCAAGAGCCTGCCCGAGAGGCTTCACAGCATTAGCTGCGATGGTCTTCCACGCTGGGATAAGCTGCAACAGTGCCGTGCCGATCTTGTCCAGACCACTGAACTCGGGCTTTGCCAGAACTCGCTCCGGAGTACCCGAAAGGTTCACCGCTACACCGCCGTGTGGGATCCAGCCGCCAGCATCGTAGAGTCCGAGAGACTTCTTCGAAGCCTCCCACAGCTGCTGCGCGGCACCCCACGACACCTGAGTACCGTTGCCCATCCTTGCACCATCCACCGAGGTAGACACTGCTTCCAACATGCTAGAACCGGAATTGTCAGCCAACGGCCTATGATAGATTTCCGTGTACTGAGAATGACGTGCCCCAGCAGCACCGCCGCCGATCTGCCCGTTACCACGAGCACCACCCATCTCCACATTGACCACGGAGCCATCAGCACCAAAAATACTGCCCGAGGTGTGCCCACCACCGGGGCCACCATTCAACACCCCAATCTCATAGGCATTTTTGCCACTAGACCGGCCGCGCTTGAATCCCACCGAGATAAGCTGCGGACCCTCATTCACCGTAGAAAACAGTCGTGATGTTGTCGCAGGACGACCCGCCGAGAACAGGGCACCTTGGCTCTGTGCACCGGAGCAATCACCCCAGTTCGTGCCACCCCAGTCATAGCGTGCCCCCTCAAGCGAACGCGCTGCTTTCTGGCCATTCACTAACTCGCCACGGAAGAAACGCAGCAGGTCAGAAGCGCCAACCTGCCCGCCGTCTTTATACCGCGGCAGACCGAGGTCACCCTGCCTGCCATCAACACGACCAGCATTGATAGCAAGCAGAAGCCCAAGGTTCTTCGCGGTAGCCTCGCGGTTCACCACGAATTCGCCGGGCTCGACCTTAGCGACAGGGATGCCACTGGCAGTCACACCAAGGATCGGATCGCGGACACTGTCGGGGATACCCGGCACACGCGGCAGAACACCACCGTCTTTGAATCCCCGTATTTGCCCACCGCCGCTGTGGCCGTTGATGCGACCCGAGGCAGAAGCCGTAGTACCAGTGAAGAACTCAGAGATTTCGCTCTTCTTAGTGCTGAACCACGATCGAACTGCCTCCCATTTTTCTTTCAGACCATTCCACAGACCATCGATAATGTTGCGACCAGTGTTTTTGAGCCATTCACCAGCATTAGCAAAGGTCTCTTTGATCTTGCCAGGCATCTCTTTGACGGTGGCAATGACGCGGTCTTTCATCTCCACGGCTTTAGCGACTGTGTTCGTTACCCAGCTGGCGGCGGTTGCAGCGACTCGGGTTGCCATGTCTGTGAAGAAGCCCTTGATGGCCTCGATAAACTCGCCGGTCTTCGCCTTGAGTGTCTCCCACGCCTCGGAAGTGCGCTGCTTGGTCCCCTCGAGCCACTCGGAGAAGATCGTTTTAATCCACTCCATGCCCTCGGAGAAGTGCCCCTTGATGGTCTCCCAGCCGGCGCGCAGAGTCTCAGGAATCTGAGACCAGTTTCCCGTGACCAGGTCAACAATCACCAGCCATGCGGTAGCAAAAATATCTTTGATGATCTTCCAACCTGTGGAGAACATCGTCTTGATAATCTCCCAGCCGGCGGTGAAATTAGCTTTTAGCTGCTCCCACCACGTTGTGATTATCACGACAAGACCGTCGAGCATTTCTTTGAAACCGGCCGCGAGATCGCTTCCCATGAGCTTGTCGAAGAGCTCTTTAGCTCCCTCAACAATCTTCGAGAAGGCTTCCTTGATCCACTCCGCGCCTGCTTTCACCGCATCGACGAAACCGGACCAGATTTTTTGCCCGATCTCGGTCTTCGTGAAAAACGCCCACATGATGCCGATCGCAGCACTCACCCCGAGGACGACCGCGCCAACGGGGCCGAGTGATGCGAGCCAAGCAGCGGCGATCTTCGCGGCGGACACGGTAGCCTTCACAGAGGTCTTGATCCACGCGGAGCCAACTTTATGCAAGGCTTTTACATTCGCAATGGAACCGGCAATAGCCTCTTTTTTCGTTGTCAGCCACGCGGCAGTGTGTTTAGCAGCCCCGATGATAGCTTTCCTACCCGAGGCCACCCACTGCACGCCAAGACCAATAATCAGTGGCACCACGATAGGTGATACTGCCGTTGCAAAAGCGATCAGCTTGCCTTTATTGTCTTCGACCCAATGCGCCGCGTCTTGCAGTTTAGCTTTGAATGATTCGAATGCGGGACCGATGTTCTCCACTCCGTGGATCATTAGCTCGATCCCGCCCTTAGTGAAATCAGTGATCGCGGTCATCGCAGGCTCCAAAAACTCTTTCATCTGGTCTTTGAAGCGACCCCACTTTTGCCCAGTGGTCTCCAGCTCACCAGCAAGACCATTGATCGTATCGGTTGTGACACCGATATTGTTTTGGAGGTCGTCGACTGATAAAGCGCCAGTTTTGACAGCTTCTACGAACTTTCCTGCACCTTTTGTGCCAAAAAGCTTCGCAGACATGTCAATAGCGGCCGCATCATCGCCCTTCGCAATGAACTCTTCGACCGCACGAGCAGTGTTGAAAAGCTCAGTCTGAGGGTCTTTGCCGGCCTTCGCGAAAGACACCATAGCTTTCGACAAGGATCCGACCGTAGCGTCCGCATCAAGACCAGCCTTATCGAGCTTACCGATAAGAGCCGCGGATTCTCCCATATCGAATCCGAATTGCTGTAACTGGGGGCCACCTTTTGCGGCACTGGTAGCAAGGTCATCGATTGAGACACCGGTTGCTTGCGAGACGCGGAAGAGCTCATTCATTGCCTCTGGCATTTCCTCAGCTTGTAGCCCGAATGCGTTTAGCGCTGTTGATACGGCGGTGACATCAGCGTCGAATCCTAGGTGCTTGAGGTTTTGAAATTGACGTGTGAGCTCTTCAAGGGGCTTTCCTGTCACACCAAGGCGGGTGTTCAGGTCAGCGAGCGTCGTACCAATCTCGGCGAGTCCCCCCTCTGCAGACACGGTGCCAGACACGTTCCGCAGGGACTGCTTGAGGTCCTCGAAAGCCTCACCGCTAGCGCCCGTACCTGCGCGAATCGTATTAAACGTGGTTTCGAACTCGCGTCCCGCGTCGATGATCGTTGTCTTGATCGCGGCGAATCCAGCGAACCCAGCAACCGCCCCCACAGCAAGCGCAGGCATAGACTTGAGTTTGCCCATCACACTTGAAAGAGCCTCGTTGCCTTTGCCACGGAATTTCTGCATGCCATGAGTCATCTTCCCCGTTGACTCCGTTGCAGACTCCTGTGCTTTCTCCAGCTGCTTTGTGCGAGCGGTGATGTTTTCGGTTGCCTCAGCAACCTTAGTTTGCGCCTTAGCCAAGATCTTCTCAGAGTCCGCGACCTTATCCGTCGCCGCGTTCAGCTTGGCACGAGCGGCACTAAGTTTCGTATCCTTTGCAGCCACCTGCGCTGCAGCAGCATCACGAGCTGCCTGCACACGCTTCTCCCCGGCCTCAAGCTCAGCAGCAGATGCCTTGCCCGAATCGCGGAGCTTTTGGAGCGCGACCTCCTCCTTCTGGATACTGGCCGCGCCCTTGGTGCGTGCTGCTTCGAGCTCAAGCTCAGCAGTTTTAACTGCCTTCGTCTTGGTCTCGACATCCTCTTTGGCTTTAGCCACCGCCTTTTCTGCATCGACTGCAGCCTTGGAGGCTTTTTCGTGGAAAGCCTGCGCCGCTTTCAGGGACTTATTCGCTTGCTCAATCCCACCGCCGATCCCCTTTTTCAGGTTTTCACCGGCTTCCTTCGACAGCTTCGCCAGTGGGGCGTTGAGTTCCGTCTCAAACTGGCGCTTCAGCCCGCGAAGCGATGGGGTGATCGGTAATGAAGCGTGCCCGATTGCGGACATGGCAACCTCCTTTAGTTAATCCCGAGCCGCTTCTTACGTGCGCGCTCAGCAGCCAAAACGGCCTTCTTTTTCGCCTCGAACTCCGCCTGCTTACGTGCTTCCTCGCGCCGATTCCACATCGGATGGACCTCCCCAACCAACGCAAGATAGAAGATCATGAGGACATAGTCAGTGGTGCTAAACCGATCGGAATCGCTTATCTCAGCCCAGAACCGTGATGAATTCCGGTCCAATCCATCGACCAACAGCAATAGCCTCCGCGTCGACATATGAGAAGGGCCGCCACCTTCGCGGTAGCGGTCCCTGTAATCCCATCCACGGTCAGCAAAATCCAACTCGACTAGGTCCTCATGCTCACGAATCAGCGGGAGGAGGCCTAATCTTCCCCCAAGCCAGTAGCATCAGACCAAGCGCTCATGATGGTGCCACCGAACTCGGAAGAGCGTGCACCGGCGTTACGAAGCTTCTGCATCTGAGCAGGCCCCAACATGATCGCAAAGGCCTTCATGTACTTGTCATCCTCGAATGCGAGACCGGCATCCATCGGTGCGTCATCGATAGATGCCGGGGCGGTCAGTTCAACGTCCTTGCCGCGAATGTTGACAGTGAAGGTAACGGTTTCGATTTCCTTGACTTCATTCTTGGCTGGGCTGATTTTTTCGGTAGTCATGGCAGACCTCCTATTGAGATATGTGTGAAAAGTTTTTTGGCGGATCTATGGATATAACCCCGCGCCGATTCGCCAGAAACGCGGGGCTGTGGGAAGCAGGGGTTACTTCACAGTGATGTCGGTGGAATCGCCACCAGACAGGGACGTACCAGTGCCGGTCAACTTCCCCTTCACACCGGTGATGGTGTAGGGGCCACCAGCAGAACCAGTTACCTTCGCGGTGCTGACGTTCGAGAGCTTGTTCAACGCGGAGGCGACAGTCGTGCCAATCGCATTGTGCCCAATAGGACCAGTCGTCTGATCATCAACGGTCAGAGTCCATGTGCCGCCGGATACGGATTCTGGGAGGGTTACAGTCTTAGACTGAGCCTCATCTGTAGCATCCTCGTCTGCTTCCTCGAGGGACTTGACGAAGCGGATCGGCTTCATATCACCAGCAGCAGTCTTAGGCTCATCTATAGTGATCTCATCGAAAGCGTCCTTGAGCTTGCCGGTCTGGAAATCGAACTCAATATCCTTCCCCTCAGGGTCCTCCCCAAAGGACAGGTTCGAAATCGTCGCGATCGCCTTATGACGGGTGACGGTGATCTTTTGCTTGCCGTTCTGAGTGGTGCCGATCATCGCAACGTGGCAAAGGGCGACCTTGCCGGAGTGACGGCGGACAACACCGTTATCTTCGCCTTCCTCCGGAACGACAGTGTCCGGCCATGCGATCTTGTCCACAGTCTTATTGCGCTCGAGCACCGAGGCCTTGCCGGTGAGCTCGCCGGGCTTCGTGGAAACAGCGATCACGCCGAAGCCCCAGCCCTTGGTCTTGTTCTTGTCGATAGCACGGGTCAGCTCGACTTTGGAACCATCAGCAAGAATGCCGACAGTCTCCCAGTCCTTACCAAACTCGCCCTCTAGCGAGATCTTCGGATCATCGGCGAACGAGACCAGAACTTGACCATCCACATATGGCTGGACATTCTCTGGGTCACGGAGAGTAACTTTGCTCATGAAAACACCTTTCGGTTAAGCCCGACTGAATACGTCACCGACGCAACCCAACCCCCGACACGGGAGTCCTTCGTCACAATCAGCCCAGTCGAGGCACGTATAGAAAAAGCCCATGGGGCTTGTCCCATGGGCTGTAGGAGATGGCCATCTATTAAGGCCATTAATCGGCGAGCCGTCGGCCCATCCGGTGCGTGAACAGTGACTCTTATCGATTCGCGTGTCCACGATTTGCGAGAGAGCGGAGTGCCATCAGACGTCACAGTGACGAAAGCGCCTTGGCGTGGAGTCCACCCCTTCGGCAGCACTGCGACGATCCTCTCTGGCTTGTCGGTGAATTTCCGTAGACGACGAAGCATCAGTCCCGTCGCGTCCTGTTGAACCCACCCCATCAGCTATCCTCGAGCTCATAGCGACGAATATCGAGGCCGTTCTCAGCCGCTGCCTTCGTTAAAACCCCGTGTTTAGCCTGCATCACCAGCCCACCAGAGTGTGCGATCGTCACCATCCCACGAGGTCTACCTGTGCGGTCGCGATCAACCTTCGTAACTACCTCTACATCGTCTGGGACATCCACAGACGCAGCCACAGCTTTTGTCGCAGCTGCTAATTCATCGCTGTAATTCTCAGCGAGGAAATCCATCAGCGCAGCATGATTAAAAGTCACAGTAGCTTTAGCCATCAGGCCTCCTTCCGCTCGCAAATAATCTGAACCCGCGGCCGATGAGACCGCAGCACAGGGCGACGGCCTACACCCCAATCCCACGGGCGATAAATCACCGAATAGGGTTCGCCGCGGATTACCACCTCATCCCCCACATTGATCGTGGGAGAATTCATGGTTAAAAGCTGCAAGCGCTTGCTATTCCCATCAGCGATCCCAGACTCAGGGGCATCAACCATCCCATCCCCAATGGGGGCGACATCAGCGAAAAACTCCCCCGCCGACACCGCCTCCACGAGATTCCCATCAGGGTCATAATCAGAGGACGCATTAATCAGAATCTTCTCCATCAGCAGCCTCCAAAAACCTCAGGCCAACGCAACGGATCAGGGAAACACCCACGGGGGCCACCCGTAAGCCCCAGCCGCTCACGCTGAACATCAGTCAGTAACACGCCAGCCCATGACACCGAACCGACATCAGCCCATGTCACAGAGTCTGACTGTGGCCCAGTCGTCGACGTAGCCGACCGCTGACCAACATTCCCACCAACGATCGCCGCCGCTGCCACCATCTCGATCACAACGAATCTTGCGGTAGGGCCAAGCCACGCCGAGCCCTGCCCCTCAGCATCAAAGTCGCGACCCGCTCGGAGAAATTCCAAGCGAATCAACTCCTCTGCATCCGAGAGCAGAACCTCAGCACGCTGGCGATCAACTCCCACCAGTGGGATCGGAAGACGGCGCGCCACGTCCTCAACATCGATTTTGAGCATGCACGCCCCCTCTCATCACTCGACTGCGTTCACCGCAGCGATGATCTCCTGCTTCTTCATACCTTTGGTGTCGATCCCAAAGGAATTCGCATATGCACGCCAAGAGTCTGTAGTAGCGGTATGCTTCGGGCGCTCAGATGAAACCTTCTCAGAGGGTTCACCCGATACAACAGCATCGGATACCGCTTCGGCAGGCTCAGCGTCCGTTTCTTGCACCGGCTCGCCGTCACCATCAATGACAACAGCAGCGCCAATGCGGACGAGCCTTTCAGCCCATGGGGAGTCATCTTCAACGACATCGTTTTCTACGAAGTGCTCCCAATCTTCCCCGTCCCATCGGCGCGCCCATGCAACCAACTTGATTTTCACTAGGCCTCCTTAGACACCAGTAATCCAAGCGGCAGCCTTGGGATTGTGAACCGCAAGAATGCGCTTCTGAGAAAGCTTGTAACCCACCTTGTCACGACGAGGATCGTCATATGGACCAGTCATCTCAAGACGCTTTGTATGGGAGACAAGACCAGGCTTGTTGTTCTCATCGAGGACGAGAATCTTATCCTTGTACCAGAACTTCGGAATGATGATCTGCAATCCATTGAAACCGCGAGCGGTCACACCCTTGTAACCAGCAAATGCAGGATTGCTCTGATCGAGGTTGCCCTCATAAGCGGATCGGAGCTCTTCGGTCTTGACGAAAGCGCTGGCCTTTGACCGTGGAACGATCATGGTCGTGGGGTCAAAACCGTAAGTCGCTTCTTTCTTCATCTCAGATGGGATTTCCGCATCAGCGATCTTATCGATTGCTTTATCGACATCCCCGAAGACATCAGCAGACTTCGTCTTCCATGTAGCAGCTGCTGCAACAGTACCGATGCCACCCTTGTCGAGCACTGACTTCAAGCGCTCGTAATTCGAGTACGCTGCAGCGTTACGGAATTGCGTCAGAGCGCGGTTGACCTGATCAACCTTGTTCTCATCTCGCATCTCGTAGGAGAAATCAAGCCAACGGCCTTCCTTCTCCGCTGCTGCCATAATCTTCTCACCTGTAACAGTGATAGTCGCAGGATATTCAGCAAATTCCGCAATGGTCTCAAGACCATCCAACGCGAATGGGGCAACATCCTTCTCATATCCCACAGAACCGTTGTTAGGCCCCACGTCTTCGAAGATTGCCGAGGTCAGATCCCAATCTGTGAGGAATTCGAGGATTGGTTTAATGAAGATCTCTGGGGCTCCCATGAGCTCAGAGACTGTGTACTGCACGCCATCATGGATGGAGTGAGTGGTAATAGGAGTATTCATACCTTTTGCCTTTCTTAAGCCAAGCGCACCAAAGCGCGGTTTTCGGAAGGCGCAGGGTCCGTGACGATGCCAACGACATCTCCTGAGGAATGCGCCTTAGCTGCACCATTTTCATCAGCTCCGACCTTGTCGCCGAAGTTCAGCTGTGAAGTGGTAGTCAGATAGACCTCCATGCCGGAGTACGCGACAGCTACATGCTCTGGCTTTTGAGCTACCAGTTCGTTTTCTTTAGCCTTTACTTGTGGGCTGCCATCGGTGATGGCTACGCCGATAACCTTTTCGGCATCAGCAGTCGCAGGCTCAACACCGCCGGTGCCTGCGACGACGATTTGGCCTCCCTTAACAGCTTTTACCGCTTTGAATGTACGGGGGCCGCGCTCTGTGATGATACGAATCGCGCTCATTTTAAAACTCCTTAGTTGTTGAGGTTGAGGGAACGTCGGATCTCAGCTATGCGATCCTCATCCATCTGGGCTTCTTCTGAACCGCTGTCATCCTTGCCGAAACCGATCTCATGAGTGGGGATCGTGTCCTTAGGATTTGAGCCGTAGAGGTCTCGTGCAAGTGATGCATCACGTCGCATCGCCGAGATCGCTTTATTTCGAAGTGCTGCAGAGATTCGGCCATCTCGAATCCACGAGTCGACCTCTTCGACTAGCTTCGAATCCTTGTCTAGTTGTAGGGCTTTCCACCCGAATTGAGCAGCAGCTTTCAGCTCTGCGTATGTCTCACGATCCAAGCGGATCGAATCATCTGCTGGCTCAGCGACAGGATCGGAAGCGGCTGGAGTTTCCTCCGCAGCGGACTTGACCGAAACACGCGCGGTTAAATCAACTGGTTCCCCATCACCAGAAACGACGGTCACAACAAAATCAGAAGAGGCTCCAGGCTCTGCATTTGGAGCAGTAACGGACAACACACCAGTAACGTTGTCGACCTCAGCTGACCACCCTTCTTCTACCTCACCAAGGGCAAAAGAAAGACCCTGTGGAACATCTCCAACAGGGTTAATCTGTGCGGTGCCGGTAGGCACGACTTGGGTTTCGTCTGGATACGAAAGTTCTATTTCTGTTCGTACCTGCACTGTCTCATTAAAAAAGCCGGAGAGTGCTGCTTTCACATCCTCCGGCGACTTGCCAAATTCTTTAGCAAGCTGATCCAAGATATTCATCCCAGCTCCATCCATAGTTTTTTCACTGCTATTTTTCTTCACCGCTGGTGGCGGTGCCGCTGAGCGGTTCGCAAACTTAAACCGGCGGCGAGCATACTTCGCATCCGGAGCTGTTGCTTTAGCTGCAGGCTTTGCCGAAATAATCTCATCAGCAAGGCCCGCCTCCACAGCTTCTTGAGCCGTGTACCACGTCTCAGAAGACATCGCGGCAAGCCAGTCGTCTACTGTTCCGCCGGCTTTACCCGCATAAATATTCGCGAGTTTCACATCTTGACGCTCCAAATCTGAAAGCGTCTTATTCGCCTCATCGACATTTCCCTCGAAGTAAGTCCAAGCACGGTGAATCATCATCTCCGATGAATCACGCATGAGAACACGATCAGCGCCGCCAACAGCAATAAACGAAGCTGCCGAAGCGGCCAACGACTCGATGATTACGGTCACGGTGCCGCGATCATAATTTTTCAGCGCATTCATGATGTCAATGCCTGCATAGACATCTCCGCCACCTGAGCTGATTCGAACGTTGACGTCTCCTGTCATATCTGCAAGCTGCGTCATCACACTCTTAGCGGTGATAGAGTTCTCCGGCTCCCAGAAGTCTTCACCAATGGGTCCATACATGAGAATTTCGTTCATCGTTCACCTCTTCTTTCGAGTAGATCAAATAAAACGGCATTGTTTTTCAACGCCTGCTCATAGCGGGAATTCATATTCCCCTGCTCGGTACTTTCGACCAACTTCACACCCATTTGCTCTTCCAACCGCTGCCTGTCTTTCTTCGACTGCAAAGCATCAGAAAGCTGCTGCTTTGGAGGAAGCGTGAATCTACGGCGAAGATCCTCTTCCAGATCCTTATCCGCCATGATCAGTCCAGCGTTCTTCAGCTGAGCAAGATCACCAGGACTGACTTCCTTCTTCGACGCAATCGGATCGAGTGTGATTCGAGGTATCAGCCCTGAGTACTCGGGGAAAGCCACACGAACGAGGTCCTCCACGATATGTTGAGTTGCCACATCAGCGATCCACTCGGCGGTTGTTTGCAGAGACTGGATAAAAAGATCGCTCTGGGTTTCCGCGAGAGCATAAGATCCGCCTTTACCTTCCAAATTTAGAAAGTGAGCAAGAACGCTCTTTGCGATCATCGAATCGTGATACAAAATCGCCTCACGCGGTGACATCAACTGACCGGAAACACCAACCAGAGTCAGCTTCGCCTTCGCAGGGATCGTTGCAGCTGAATGCTTACCAGATCGAAGCCCCTCAACCACTCGTTGGCCATTCCTCAGATCCCCCTCTGGATCTTTAGCAATCTCCGATCCCTCATAAACTGGGATACCCATGCTATTTCGCTCAATAGCATTGAACTCCATCCTGAGCAGATCATCACGCAGCTGCCAATGCTTATAGGCTGCGCGCAAAACCGATGAACCAGTCCACTCAGAGCCAATATCATCGAAAGCATAAGCGACTAGACGTTCAACTGGGATCTCTACAAAGCCATGACCAAACTCAACATTGCGTTGCTCAACAGATTCCAATCCCCCATCTGCTGCAACGTTCACCTTCTGGATAGTCGCAGGCCACCTAGGGGCAAGTTTGACCAGATGATGTCGACCGTCAGACCGTGGGGCATACACCTGCTCGAAAAACATGCACCCAAACTGGAGCGATCTCAGCGCATCTTCAAGGTGTTTTTCCCATGAAACCCTCCCAGCGCGGGAAGCCAAAGGCTCACCAGGATCTTCGCCCTTGACTCTCAGCCTCAGATCCTCAGCAACAAGGCCCACAACCTCTTCTGGGGCACCATTTGGTTCGAGATACCAAGTTGCGCGCCGAATCGGGAGCATCACAGCACGCAAAACTGACTTAACCTGTGAATCCTCCCGCCCCATCTTGGCGTAGACCTTTGCCGAATGAGGAAATTTTAGCCGCCAGTTATCCTCCCCCATCGCAGAATACCCAGAAGGCCGAGCAACACCGACTTCCTTAGCTACTGTCCCCATGAAACACCTCCTTAAAACTCAATTTCCGAAGCACGAATACCCGTCCCAGCAGCAGAAACAACATCTGCCGAACCAACAAAACGCTTCTTAGACTTCACATCGACAGAATCAGGGATAGAGAACTCTCGAAGCCCCCACAACGCCAAAGAAGCAGCTACAAGTGGGGCAACATTCCCCACAAACCTATTCAAGCTCCGATATGTTGCATTCCGGCGTTCTTCCGCAACCTCCCAGCACTCCAGCCAGCGAGGATTACCATCGTGAATGATCTTGCCTTCGCGAATCATAGAAAGAAGCAACTCATACGCCGCGGAAATCTTCCCGCCAGAAAGCTGCTCGGGCTCGATGCCATGCCTCTCTAAAGGGGAAATAAGCGTCCCCACAGGCCCTGATTTATCCATAAGGATTGCAAGGGGGTCATTTTCTTCAACTGTTCCGGCGACGGAAGCAACAACAGCATCAGAATCGAAGGATTCTTGATTACCCAAAGTCAAAAATGCGCGCCCATCTCCGAAAGCCACAGCCGCAGCCATAGCGATCTTTTCACCATCAGGAGTCGCATCAAGAGCGAGAACGCTATCACCAAGTGCGCCATGTGGAACCCCCTGCAATCGACTCCACTCTTCGGGCACTACGATGTAGCTCTTCTCGTCTTCCGCATTATCCCGCGGAACCCAGTTGCCAACGCCTAATGACTCGACAAGATACGCAGCTTTCAAAACCTCAGATGAACGAGCCGCATCAGAATCAGAACGAATATCCGCCAGCTGAGCACCAACGCCACGCAGAACTAGCGATGGATTCGATTGCTTCCACGCCTCTTCCGAAAAAATATCAACACTTTCGACATCGGCAGACCATTCCCGAAAAAGAACCCCACCAACGTTATCTATGCCCGCCCACCTTTTCGCCGAGAACACAGCACCATGCATATGTTGGGTTCGATCAACAGGTGAAGAGATAAAAATAGTATGGCTGTGCTCACGGGCACGCGTTGTTTTCGAAATTGCCGAATAAATCTGATTCGGGAGATTGAAACACTCGTCAAAAACAAGCAGCTCAATCGAAAGGCCACGTCCGGTTTTCTCAGTTCTAGTACGAAATCTGATTTTCGCACCATTGGGGAATTCAATCCCCTCTTTGCCATTAGTCTTAACTAGCCAAGGCCTTCCATCATGCTCACCTACCCACCATTCCATGAGATCATCGTTGCTTTCGATGACCTCCCAGAGTCGATCACGAGCCTCAATCGACGTATCAAGAAAGTGAGCGGTATGCAGGATCTCTTTTTCCCCAAAAAGATAAATCCCAGCAAGCTCACGTGCAATAAGAACCTCACCTTTGCCATTCTGCCGAGGAACCACGGCAACAGTCTCACGGTACCGCCACAAACCATCTTGGTCGGTGCGGCACATATCTCGAAGGAGGTTCTCCTGCCAAGGGAAAAGCGTCATCCCCGCCCACCGGCAGAACTCGACTGCGGCATCTCCACGAGTCGTATCACCAACAGCGCCTGTTCCAAGACGAGGTATCTGCGATCCGACAAGACGATCATCTACAACCGTGGACATGCTGCTTCACCCCCTATGCAAATTCATGCCAATCCGAACGAAGGTCTCTCTCGAGTCCGCTGTGGACGGCGTTCTCCGAAAAGATCAGGTCTGTCTTTAACCCATCCTCTCAGCTCTAAGGAAGCCTGACGCTCAACTTTCGCTGCGGGGTGTTCAATAAGCTCGCCAGATTCCTTCGCGGAGACGGGACCTTCTGATTCCAGTATCTTCTGGCATTGCTCACGCCGAATGATTAGAGCTGCAACTGCTTCTACAGCAGCGCGATCGAAGGTGTTGAGCTGTAGCCCTCCAGATAGGTCTTCTACAAGGCGAAGTGAGTGATCATGTTCATGTTTTTCTAAAAGCAAAACGACCTCCTAGCTGCTGAAACGTTGCTTCCGTTAAGTACATGGGAAATGTGAATAATTATTCGGGCTTATCAGGGAGAGAGAAGGCCGGACTGGGGACGAACGGGGCTGGGTCCGGCGTGGGAACGCTCCAAGATTTTTACCACCCCGGTCATTTCCAGATGAATGCTTCATTTTTTATTTCTGTTTTATTCATGGCTGGTGTCTTGGGGAGTGGTTGTGATGGGTGGCGTCCGGTTATTGCTGGGCGCGTGAAATCGCGTGAGCCGTCTCCTCGTTGTGAGTTGCATTTGAAGTGGAGGAGGCTGTCGGGTTGTTCTCCGTTACGCGCCCCGTGAGGGTTGTAATGGTCTGCTGCTAGCGGCAGACCATCCCAGTTTTGGGTTTTGTCTTTGTGCAGTGGAAGTCCGCACCAGTAGCAGGGTGTTCCGTCGCGTAGTGCCATGAGGAGGCGTTTGCGTGGGATTTCGTGGTTGTGATGTGTGTAGCCGCGTTGTCGTGCGCTTTTGGGTTTTGCGGGTTCGTACCATTGATATGCGGCTGTGAGTGATTCTGGGGGGCGCTGGTTTTTGCAGCGTTGGATCACTGTTTGTTTGCCGGGGTCAATGGTGATGATTTCCGCATTTGCTGCTTTGTATTCGGCTAGTTGCTTTGGTGTCGGCTTCGTGTGGATCAGCCAGATGTCGCAGGTGTCTGCGAGTTTGAGGGCTTGGCGGCGCATGGCTTCTCTGCCTGCGCGGACGAGCTGGATTGTCTCGTGGGTGTGCGTGTGGTTGTCGTCGGGTTGTCCGGTGATGAGGTTGGCAATGCGGTCGAAGTCGATGCGAATATCGCTTTGCTCGGCGTGTTGTATGAGGTAGGTGGTCTTGCCTGCGGCTGGTGGGCCGGTGATGACGCGCAGCAT
Protein sequences of DBSCAN-SWA_2 >LS483402|1821504:1843481|1840670_1842230_-|SQG58796.1|terminase|DBSCAN-SWA MSTVVDDRLVGSQIPRLGTGAVGDTTRGDAAVEFCRWAGMTLFPWQENLLRDMCRTDQDGLWRYRETVAVVPRQNGKGEVLIARELAGIYLFGEKEILHTAHFLDTSIEARDRLWEVIESNDDLMEWWVGEHDGRPWLVKTNGKEGIEFPNGAKIRFRTRTEKTGRGLSIELLVFDECFNLPNQIYSAISKTTRAREHSHTIFISSPVDRTQHMHGAVFSAKRWAGIDNVGGVLFREWSADVESVDIFSEEAWKQSNPSLVLRGVGAQLADIRSDSDAARSSEVLKAAYLVESLGVGNWVPRDNAEDEKSYIVVPEEWSRLQGVPHGALGDSVLALDATPDGEKIAMAAAVAFGDGRAFLTLGNQESFDSDAVVASVAGTVEENDPLAILMDKSGPVGTLISPLERHGIEPEQLSGGKISAAYELLLSMIREGKIIHDGNPRWLECWEVAEERRNATYRSLNRFVGNVAPLVAASLALWGLREFSIPDSVDVKSKKRFVGSADVVSAAGTGIRASEIEF >LS483402|1821504:1843481|1837496_1837886_-|SQG58793.1|DBSCAN-SWA MSAIRIITERGPRTFKAVKAVKGGQIVVAGTGGVEPATADAEKVIGVAITDGSPQVKAKENELVAQKPEHVAVAYSGMEVYLTTTSQLNFGDKVGADENGAAKAHSSGDVVGIVTDPAPSENRALVRLA >LS483402|1821504:1843481|1821504_1821843_-|SQG58774.1|DBSCAN-SWA MSVIAAMLAAFGGLVTAVGSVWMGYMKARSDTQAAKGSRIDIMESRLDKLQVDLDAERSRRSDAETLAHRLRLGLITAMNHLDALIRWADGGSPPPPPTAPDLKAIKSLIED >LS483402|1821504:1843481|1835422_1835755_-|SQG58789.1|DBSCAN-SWA MEKILINASSDYDPDGNLVEAVSAGEFFADVAPIGDGMVDAPESGIADGNSKRLQLLTMNSPTINVGDEVVIRGEPYSVIYRPWDWGVGRRPVLRSHRPRVQIICERKEA >LS483402|1821504:1843481|1836195_1836549_-|SQG58791.1|DBSCAN-SWA MKIKLVAWARRWDGEDWEHFVENDVVEDDSPWAERLVRIGAAVVIDGDGEPVQETDAEPAEAVSDAVVSGEPSEKVSSERPKHTATTDSWRAYANSFGIDTKGMKKQEIIAAVNAVE >LS483402|1821504:1843481|1836558_1837482_-|SQG58792.1|DBSCAN-SWA MNTPITTHSIHDGVQYTVSELMGAPEIFIKPILEFLTDWDLTSAIFEDVGPNNGSVGYEKDVAPFALDGLETIAEFAEYPATITVTGEKIMAAAEKEGRWLDFSYEMRDENKVDQVNRALTQFRNAAAYSNYERLKSVLDKGGIGTVAAAATWKTKSADVFGDVDKAIDKIADAEIPSEMKKEATYGFDPTTMIVPRSKASAFVKTEELRSAYEGNLDQSNPAFAGYKGVTARGFNGLQIIIPKFWYKDKILVLDENNKPGLVSHTKRLEMTGPYDDPRRDKVGYKLSQKRILAVHNPKAAAWITGV >LS483402|1821504:1843481|1833451_1833745_-|SQG58785.1|DBSCAN-SWA MTTEKISPAKNEVKEIETVTFTVNIRGKDVELTAPASIDDAPMDAGLAFEDDKYMKAFAIMLGPAQMQKLRNAGARSSEFGGTIMSAWSDATGLGED >LS483402|1821504:1843481|1826648_1827455_-|SQG58782.1|DBSCAN-SWA MSILLKVNLPEPEVKKITLLTPSGVEWNLTEKKDRGVFLREGGISGLVGTPSPLVSQAVGAVGQRFFGAEHGPLEGTLSLTIYDMSGDCTKIFDEFMGSLSTHVPAVLKVSSKLRGPRFGAVRLAAEIGSPACDPAEVSSLDVEASVVIDDGAWWSNWLHGNNNVTVTNNGDVPVSPKIRWKGAGGKIKLPSGATFTLPAVSQPRNLLLGIAQCLAVVDDQGELDRETWIKVRGQALPENIPPGASRVFSVPDGAELLWQVGFLSMWR >LS483402|1821504:1843481|1822309_1822639_-|SQG58776.1|DBSCAN-SWA MSNKNDIRDAVAAALATQPWFMRRKDTLAAVAGWVLQVGNFATGMAVGAPMWVSFLIAAVIGIAQIIIHAGTPGAITPSMGKRLTAVAPVTPVFDLEAVREQLSTSTQG >LS483402|1821504:1843481|1834740_1835133_-|SQG58787.1|DBSCAN-SWA MGWVQQDATGLMLRRLRKFTDKPERIVAVLPKGWTPRQGAFVTVTSDGTPLSRKSWTRESIRVTVHAPDGPTARRLMALIDGHLLQPMGQAPWAFSIRASTGLIVTKDSRVGGWVASVTYSVGLNRKVFS >LS483402|1821504:1843481|1837897_1839292_-|SQG58794.1|protease|DBSCAN-SWA MNEILMYGPIGEDFWEPENSITAKSVMTQLADMTGDVNVRISSGGGDVYAGIDIMNALKNYDRGTVTVIIESLAASAASFIAVGGADRVLMRDSSEMMIHRAWTYFEGNVDEANKTLSDLERQDVKLANIYAGKAGGTVDDWLAAMSSETWYTAQEAVEAGLADEIISAKPAAKATAPDAKYARRRFKFANRSAAPPPAVKKNSSEKTMDGAGMNILDQLAKEFGKSPEDVKAALSGFFNETVQVRTEIELSYPDETQVVPTGTAQINPVGDVPQGLSFALGEVEEGWSAEVDNVTGVLSVTAPNAEPGASSDFVVTVVSGDGEPVDLTARVSVKSAAEETPAASDPVAEPADDSIRLDRETYAELKAAAQFGWKALQLDKDSKLVEEVDSWIRDGRISAALRNKAISAMRRDASLARDLYGSNPKDTIPTHEIGFGKDDSGSEEAQMDEDRIAEIRRSLNLNN >LS483402|1821504:1843481|1833847_1834744_-|SQG58786.1|DBSCAN-SWA MSKVTLRDPENVQPYVDGQVLVSFADDPKISLEGEFGKDWETVGILADGSKVELTRAIDKNKTKGWGFGVIAVSTKPGELTGKASVLERNKTVDKIAWPDTVVPEEGEDNGVVRRHSGKVALCHVAMIGTTQNGKQKITVTRHKAIATISNLSFGEDPEGKDIEFDFQTGKLKDAFDEITIDEPKTAAGDMKPIRFVKSLEEADEDATDEAQSKTVTLPESVSGGTWTLTVDDQTTGPIGHNAIGTTVASALNKLSNVSTAKVTGSAGGPYTITGVKGKLTGTGTSLSGGDSTDITVK >LS483402|1821504:1843481|1839288_1840659_-|SQG58795.1|DBSCAN-SWA MGTVAKEVGVARPSGYSAMGEDNWRLKFPHSAKVYAKMGREDSQVKSVLRAVMLPIRRATWYLEPNGAPEEVVGLVAEDLRLRVKGEDPGEPLASRAGRVSWEKHLEDALRSLQFGCMFFEQVYAPRSDGRHHLVKLAPRWPATIQKVNVAADGGLESVEQRNVEFGHGFVEIPVERLVAYAFDDIGSEWTGSSVLRAAYKHWQLRDDLLRMEFNAIERNSMGIPVYEGSEIAKDPEGDLRNGQRVVEGLRSGKHSAATIPAKAKLTLVGVSGQLMSPREAILYHDSMIAKSVLAHFLNLEGKGGSYALAETQSDLFIQSLQTTAEWIADVATQHIVEDLVRVAFPEYSGLIPRITLDPIASKKEVSPGDLAQLKNAGLIMADKDLEEDLRRRFTLPPKQQLSDALQSKKDRQRLEEQMGVKLVESTEQGNMNSRYEQALKNNAVLFDLLERRGER >LS483402|1821504:1843481|1842253_1842577_-|SQG58797.1|terminase|DBSCAN-SWA MLLEKHEHDHSLRLVEDLSGGLQLNTFDRAAVEAVAALIIRREQCQKILESEGPVSAKESGELIEHPAAKVERQASLELRGWVKDRPDLFGERRPQRTRERPSFGLA >LS483402|1821504:1843481|1833086_1833410_-|SQG58784.1|DBSCAN-SWA MDFADRGWDYRDRYREGGGPSHMSTRRLLLLVDGLDRNSSRFWAEISDSDRFSTTDYVLMIFYLALVGEVHPMWNRREEARKQAEFEAKKKAVLAAERARKKRLGIN >LS483402|1821504:1843481|1823759_1824647_-|SQG58779.1|DBSCAN-SWA MSGVKIKGKLKNITMKPSTIREVWIRPPSTRPGAEGLIVDEPVRVLVDDAGEFTVDLALGAGVLILIGPGGIGRESIPILVRENTVTMRQAVEDAEAFSPDVHDHLAELAKETADNLGEARRVREQTIVAEKTLADASRLLKESVDATVQKATAAVKDTAASLLDQAQAAQRAATQAQEGAEAAKKSASDSQSAAATLAGQASDARDQVTQQLQEWEARAKQLADWEERYAWLKENAAEGFSKVEDLLAIVKRSENRPALWIWDGQGAWKAPTEARAKDIVLDTKTGKLYYVKGA >LS483402|1821504:1843481|1835754_1836180_-|SQG58790.1|DBSCAN-SWA MLKIDVEDVARRLPIPLVGVDRQRAEVLLSDAEELIRLEFLRAGRDFDAEGQGSAWLGPTARFVVIEMVAAAAIVGGNVGQRSATSTTGPQSDSVTWADVGSVSWAGVLLTDVQRERLGLTGGPRGCFPDPLRWPEVFGGC >LS483402|1821504:1843481|1824675_1825797_-|SQG58780.1|DBSCAN-SWA MSDVILRADGGEITVGRQVGSFVWGRWDVTLPSDVSLKSQPDTRMAEGFLYKPEDLPKGRFDLRGVRADSSQTMVVKKSGEIDYSLSNSDTEQVLDAAEKRIEGNLFFEKDITTDGLGIYEPGVDFHEDDLVDVMLWGKRLPDLPVTTIDIASERGSVNRWRVHVGGQPIADAAALRSSRDKIFAQIAAERREAASKIGVVSRQAAAAGEVAAQADGKAVAAQETADEGIKKWRQQKDELDALQTRQIKELDAQNAALRRLGDLQEPLAGSISDYDALILGPVKITYPARNKIDFELLPGQKFVSGASVLVNARVNAISQYTHSFPIRLTASRPKASGAVGWAENYAEASILVIPTADFSAILAEERAKRDLT >LS483402|1821504:1843481|1825793_1826642_-|SQG58781.1|DBSCAN-SWA MGFDWVAHAKHRRQVIEDRGEWVGLLSEDGVPICDMPPYIRIHAPTTRMNPESFQGDFEITSPQGVVHMCVDDLVADGLGKVDAEGRLVPANAATRFIAVERNGLRKVFRVMFVVASSSDPIAPRVLQVHGTDMLTELGFMPCWSIPGQVGGSFTRAVGDFGSQFSKPRYLAKLKMAAVADGFSVQGPADVTIRKLIKESLHATYKAFGVSDRPIQVADTSTGKPSPELIIRPEDRSIWEEISAPAAMAGCVVRCFMWLPEDPQPEGLRLSKPTVVVEVLQQ >LS483402|1821504:1843481|1821839_1822304_-|SQG58775.1|DBSCAN-SWA MPIERLPTSVQPTARRVRAFLMTDSTALLLLFIVQIAVGFYYLPGVLGDPLQWHRPVESIMPITAWAWVHIAVGLLCLVAAFTDRGHIDVVALAAATGLNLSWTFSLLAAAVEHDQAVLWLVGVLILSMTVSLMWAVWRGKRGDIPLAEDRGRV >LS483402|1821504:1843481|1827462_1833075_-|SQG58783.1|tail|DBSCAN-SWA MSAIGHASLPITPSLRGLKRQFETELNAPLAKLSKEAGENLKKGIGGGIEQANKSLKAAQAFHEKASKAAVDAEKAVAKAKEDVETKTKAVKTAELELEAARTKGAASIQKEEVALQKLRDSGKASAAELEAGEKRVQAARDAAAAQVAAKDTKLSAARAKLNAATDKVADSEKILAKAQTKVAEATENITARTKQLEKAQESATESTGKMTHGMQKFRGKGNEALSSVMGKLKSMPALAVGAVAGFAGFAAIKTTIIDAGREFETTFNTIRAGTGASGEAFEDLKQSLRNVSGTVSAEGGLAEIGTTLADLNTRLGVTGKPLEELTRQFQNLKHLGFDADVTAVSTALNAFGLQAEEMPEAMNELFRVSQATGVSIDDLATSAAKGGPQLQQFGFDMGESAALIGKLDKAGLDADATVGSLSKAMVSFAKAGKDPQTELFNTARAVEEFIAKGDDAAAIDMSAKLFGTKGAGKFVEAVKTGALSVDDLQNNIGVTTDTINGLAGELETTGQKWGRFKDQMKEFLEPAMTAITDFTKGGIELMIHGVENIGPAFESFKAKLQDAAHWVEDNKGKLIAFATAVSPIVVPLIIGLGVQWVASGRKAIIGAAKHTAAWLTTKKEAIAGSIANVKALHKVGSAWIKTSVKATVSAAKIAAAWLASLGPVGAVVLGVSAAIGIMWAFFTKTEIGQKIWSGFVDAVKAGAEWIKEAFSKIVEGAKELFDKLMGSDLAAGFKEMLDGLVVIITTWWEQLKANFTAGWEIIKTMFSTGWKIIKDIFATAWLVIVDLVTGNWSQIPETLRAGWETIKGHFSEGMEWIKTIFSEWLEGTKQRTSEAWETLKAKTGEFIEAIKGFFTDMATRVAATAASWVTNTVAKAVEMKDRVIATVKEMPGKIKETFANAGEWLKNTGRNIIDGLWNGLKEKWEAVRSWFSTKKSEISEFFTGTTASASGRINGHSGGGQIRGFKDGGVLPRVPGIPDSVRDPILGVTASGIPVAKVEPGEFVVNREATAKNLGLLLAINAGRVDGRQGDLGLPRYKDGGQVGASDLLRFFRGELVNGQKAARSLEGARYDWGGTNWGDCSGAQSQGALFSAGRPATTSRLFSTVNEGPQLISVGFKRGRSSGKNAYEIGVLNGGPGGGHTSGSIFGADGSVVNVEMGGARGNGQIGGGAAGARHSQYTEIYHRPLADNSGSSMLEAVSTSVDGARMGNGTQVSWGAAQQLWEASKKSLGLYDAGGWIPHGGVAVNLSGTPERVLAKPEFSGLDKIGTALLQLIPAWKTIAANAVKPLGQALGGFLSNTDLVWDAEKGLADTRKQLAEQATAITDTEREYAEAKKSGDSTKEADALKKLNEIREKSANAALQLEAAERTVVAARITATGEVATKFLNGIAESFKQVSAFFGEIERFAEQIEKTREEISKLQQQQVMLRISGVKAANELRVAEWDMSRVRMQGLIAVAKAEDDLAKAQRGYLTLTTTGVGGLGRAVDRFRITGTQSMEALGATFIANTREVEAAEWALQAARAKANLDNLDAEYAADVARLQVAEATMAQAQAADTLRLMTLRLQDETQRLYGMTSNQARGAQGFLGGIGGIFGGIAKFIGGLFTGAASLATGNILGAVTAGAGAIGGIKDIVGGIGAAKANHGDFKEAWRNSTTGARAGLGLGIAGGVLGGVVGIGGTVATGNSQWVEAGNQLGSAFTGATMGSMMDYQKHFQEAAERRHQAEMQRMTAQHTMDSARLAAERAALDARHAIGKQALNAELNVADINRQRVEAEEGSRQRSALVGAAAEAARARDALVSEARRTNQILGGGDGAGKRVFNIDLRGEPADSRLSKFVDEMDRRLSDVEFNVRRDGMSAQDYSRARQGVTV >LS483402|1821504:1843481|1822638_1823409_-|SQG58777.1|DBSCAN-SWA MKPNPSHRGDPLFLPDVLRAFGVKVQEWQGWRNRGHGDFHIIQGVMAHHTGTNKDIPGYIAQHPQLGLCSQIHLNRDGTAVITGAGIAYHAGRGSYPGWPTNDANRVSIGIEAASDGTSPWTPAQLDAYYRCCAAILWYLGKSATTQTLLGHKEYSGAAQGKWDPGGIDMNDFRKKVQHYIDNPPFATAGAKKEGEIPMIASLINPAKSFAQSTLISIVDATCWQLLVLVKAMAKQQGLDPDRLLDEAINNERKAK >LS483402|1821504:1843481|1842728_1843481_-|SQG58798.1|DBSCAN-SWA MLRVITGPPAAGKTTYLIQHAEQSDIRIDFDRIANLITGQPDDNHTHTHETIQLVRAGREAMRRQALKLADTCDIWLIHTKPTPKQLAEYKAANAEIITIDPGKQTVIQRCKNQRPPESLTAAYQWYEPAKPKSARQRGYTHHNHEIPRKRLLMALRDGTPCYWCGLPLHKDKTQNWDGLPLAADHYNPHGARNGEQPDSLLHFKCNSQRGDGSRDFTRPAITGRHPSQPLPKTPAMNKTEIKNEAFIWK >LS483402|1821504:1843481|1823456_1823756_-|SQG58778.1|DBSCAN-SWA MELGSLIGTLAGASRVVAVTIPGKIPERVSQSEATHVVCVALPTKEGETPYLALMTSAGDERRTYFLYDLNYASRTDIRIYGRNSGKEYLLLWLKLDTE >LS483402|1821504:1843481|1835132_1835423_-|SQG58788.1|DBSCAN-SWA MAKATVTFNHAALMDFLAENYSDELAAATKAVAASVDVPDDVEVVTKVDRDRTGRPRGMVTIAHSGGLVMQAKHGVLTKAAAENGLDIRRYELEDS |
25 | Corynebacterium_phage(70.59%) | terminase,tail,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1846489 : 1855204
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >LS483402|1846489:1855204|DBSCAN-SWA GTCATTTTTTCTCTTTCCTTATCTGTGCTTTCACCGCGTTGATAAGTGCGGTCTGGGTGGTGTCTTTTTTCTCAAGTGCTTTAGCAACGTCTGCGTCGATGGTGTCGGATGTCTGAATGTGAATAATGCTCACGGGTTCGGTTTGTCCTTGGCGGAAGAGCCTCGCGTTGGTTTGTTCCACGAGTTCTAAGGACCAGGGGGTGGTGAGCCAGACCATGATGTGGCCTCCCGATTGGAGGTTGAGTCCATGCCCTGCAGACGCTGGGTGGATGAACCCGATGGGGATTTTTTGTTCGCACCAGTCAGCCATGTCTTGGCTGGTTGAGAGTTCTTTTCCTTGGGGGAATTTTGCTTCGAGTCTTTCTAGTTCATGTTTGAACCAGTAAGAAACGAGGATGGTGTTTCCTGATGCCTGCTCGACGATGTCAGCTAATGCGTCAAGCTTTTGATCGTGGATGTAGCGCACCCCGTCGTCTGTGTAGATGGCCCCTGAGGAGAGCTGCTGGAGTTTGTTGGAGAGTGTTCCGGCGGTTCCTGCGTCGATGGTGGCTGTGCCGATGGTGGTGACCATATCGTCGCGTAGCTGGTTGTAGATGGTTCTTTCTTTGGGGTTGAGGTGGACGTTTATTGTGGTTGTGGCTAGTGGTGGTAGGTCGAGGTAGTCGGTTGTTTTCATGGATACGGTGATGTCTTTGATGAGGTCGTAGATCTGTTTGTCTGCTCCTGGTCGGATTTTGTAGGAGTAGATTTGGGGGCCGTTTCGCTTGTCTGGGGTGAAGAATTTGTCCCTGTAGTGGGTGAGGTATTTTCCTAGTCGTTCTCCTCCGTCGATGAGGCGGAATGGGGCCCAGAGGTCTTCGAGACTGTTGGGGGCTGGGGTTCCTGTGAGGCCGATGATTCGGTTGATTTTTGGGAGTCGTGTTTTCAACGCTTTGAAACGTTGGGCTTTGTGGTTTTTGAAGCTGGAGAGTTCATCGATAACGACGGTGTCGAAGGGCCAATTATTGCCTAGCTCTTTAACAAGCCATGGGATGTTTTCTCGATTAATGATGTAGATGTCTGCGTCTGTGTTGAGGGCGTTTCGTCTGGTGGTGGGGTTTCCGATGATGATGCTGAGTCGGAGGTTTTTGAGGTGGTCCCATTTTTGTTGTTCGTTGGACCAGGTGTCTCTGGCGACTCTGAGTGGGGCGATGATGAGTGCTTTGGTGGTGGCGTATCGGTCGGTGATCAAGTTGTTGAGGGCTGTGAGGGTGATGATTGTTTTTCCCATTCCCATGCCGAGGAAGATTGCGGCTTGGGGGTGGGTTTCTATGAAGTTTGTTGCGAATCGTTGGTAGCTATGTGGTTTGTATTTCATGGATTGCGTCTCGGATTTGGTTGGTGTTGTTGATGTAGGTGGCGTGGAAGCCGAGTTTTTTGAGTTGGTTGATGCGGTGTTGTTGGAGTGGTCGTGGTTTTTTGCCGGGTGCTTTCAGTTCGATGAATCCGATTCGTCCGTTGGGTAGGAGGATGAGCCTGTCGGGGATTCCTGCGTGTCCTGGGGTGGTGAGTTTGAGTGCTATTCCGCCCATGTTTTTGACGTGTTGGGTGAGCTTTTTTTCGATGTGTTTTTCGTCCATGTGCGATGGCCTCTTTTTCTCTTCAGGACAACCTCGGTTGTTTTTCTGTTAGGTTGTGCGATTTAAGGGTTTTAAGGGTTATGGGGCTGTATGTCCCCTTAAATCGTCTTTTTATCTTTTCTTTGTATTTAGTTGTCCACTTGTCCTATTAGGTGTAGAAAATAGGCTTTTACTTGGGGTTTTATTGGTGGACAACTTGTTGGACAAGCGGGACAACTGGGACAACCTCAATCACCCGATTTTGTGACCTAGGTCATTAGCAATTAGTGGCGGCCTCGCCTGTGGGAGGTTGTCCCAACCTAGTTGTCCCGCCTAAAACGGCGGCTCGTCACTAACAATAAACCGGTAGACGCGCTGCCTCCCATACGGCGGCACGACGCGCCGTTCCGGCATCCTCTCCCACATGGGGAGCTTCTGCATAATCGCCGCAATCGCATAAGAATCAGCAGGCTTCATATCCGATGGGTCACGCCCAAAACACTCAGCCCAGATCTCCGCGTTCGACACGAAACCACGCCGATCCCCCGAAGCGGCGCCACCGACACCAACGAAATCACCGTCAGCACCCGACAGGTAAGCGCGTCGCTCACGAGTCCCCATCCCCCACCAACCATCCGGCAGCGGGGTGTCCAAGTACTCCTCAACCAGCCCGGCGCGCTCATCTGATTCAATGGCGGCGTCCTGCTGCTGGCGGGCTGCCTCAGCAACCACACCAGTCAGGTGCAGTTGCTCACCCGAGCGGAACCGAAACAGGGCCTCCGCCCAAATCTGATCCACTGTAGACGTATCAAGCTCCCACGACTTCTTCGGGGACTCGCCCGTGATCTTCGCCGGCCAGAAGCGCCGCCCACCAGTGTTGTCCCGCAAGAAGCCATTCTCGGCGTTCGTCGAACCGACGATGATGCACTGCCGTGGGTGCGACTCGACGGTTCGTCCGTAGGCGGCACGGAACTTGTCATCTGTTCGCGACAAGAAACCCTTGACGGTTTCCACATCCATCTTCCGCATACCGGCGAGTTCTCCAAGCTCTAGAATCCAGTACCCTTGGAGCTTTTCAGCACCGGTCTTGTCCCGCATGTCTGTCAGTGACAGCGCATCAGAAAACCACGCCCCGGCGAGCTTCGCGAAGAGCGTGGACTTGCCTGTGCCCTGCGGGCCGTTGAGGATGAGCACCGTGTCGAACTTCGTCCCGGGGTGGAAAACACGAGCTACTGCTGCGGTGAAGGTTTTCCGCGTGATCGCTTTGATGTAATCGGTGTTGTCCGCCCCTAGATAGTCGATCAGTAGCGTGTCGAGGCGGGGCTGTTGATCCCACTTGGGTAACGCGGTGAGGTAGTCCCTGATGGGGTGATAGCACCGTGACGCGGCGGCGATCTGCAGCGCCTCCGCTGTTTTTGTCCCTGAGTACAGGCCGAAGGCTTTTTCCAGGTAGAGCTTCAGCTGGGCGACATCAGCGTCCGACCAGCCTGCTTTTGTTTGCACCCATGGGAGCTTGGCGTCATCGTCTACGCAGATACTTTCAGATAGCAAGTTGTATTTGATGGGCTGTAGTAGTGGGTCGTAGGTGAGAATTTTTGTAAGGTTTTCGAGGGTGTCGTCGTATTCTCCGGAGTTTTTTCTCGTGAGGCCAGCTGTGGTGCGCCAGCTTTCCTTTTGCTTCTGCGTCTCTGGTTCCGTGGTGTGATCATCGGTGTTTCCTTGGTCTAGGAGGTCGCCGAAGTCTTCGGCGGCTTTTTGGTCGGCTTCCCTGTCGAGGAGAGCTTTGACGTCGTGGTCTTCCCTCGCGAGCTCGAGCATCGCCTTGTATGAGGGGAGTTTGTGGGTTGGGGTTCCGTGCTTAGCGTCGTCGTCCCATGTTCCGAATTTGTGTAGGCGGATGAGGTCGAAGGCGTTGACGAGTTGCCCGCCGGCGGGGTCAGTTCCGTGGTGGCTGTAGGAGAAGTTGTCGTTTCCGTAGGTGACAACACCGGATGTTGATTCACCTGGTGTGTAGGTGTAGCGCCCTTGTGTGGCGGTCGGTGTGTATGTCTCTGGGAGGAATACTTCGATGGCTTTACTGATTGGGTAGGTTCTGCAGAATGCTCCCACGAGCCCGGGTTTAGTTAGCGGGTCGGCTTGTTTATCGGCTCGCGTTTTGAGGTGTTCGGCTTGTCTGGAGGATGTTGGCCATGTGGACATGTCCCGCCAGTTGTCGTAGCGTGCGAGTACCTCATCTGGGTTTATCCACGGACCGGTGTTGGTTTTGTGCAGCGGCTCAACATCGACAGGGTGGGTGGGCCAGTACATCAGCCGGTGGGCTTCGTAGGTGGTGTCGTCGAAGGCATCGATGCCGATGTCCGCGGCAAGACGCCGGCATATCGCCGTGTACTCGTCCGCGGTGACATCGCGTGTGAGCGGGGCGATGATGCGGAAGCGGGGTAGGTCAGCGGTGTGGGAGTGGGTTGAGTAGAGCACCCACTCATATGGGAGCGTGGTGGGTAGCTCTGTCAGCGTTTGGTGTGTGGGGGTATCGGCATCGAGGGCTATGAGGCTTCGGGTGAGGATATTGTTTTTGCGTCGACGACCGTTGGCGAGGTGTCCGCCGACGAACCCGCCGAAGTCTTTTTCGTCGTCGCGTTGGGTTTTAGGAAGCGCCTGGTATTCCGCTGCTGTTTTTGTGCCTGGGCGGCTGTTGTTGAGGCGTTGTTTGAGGGTGGGCCAGTCGGTGAGCTGGTTGTCCCAGAGCATGGCGAGTCGTGATCCGGCTGTGGATATTTTTAGTTCTCGGTTCATTGCTGAGCTCCTCCTTTCGTCAGTCTTTTTGGTAGAAAGGGCATTCATAGCCATCCGCTGTCAACGGGATCCCAGAAGCCCACTCGGGGGCTTGCTCCATCAGCCGACACACCACATCAACTGGTGTCGTGGAATCGGCTTCGATGACGGCTTCGTCGTGGATGTGCATGACGATGGCGTGTCCCGCTTTTTCTAAGGTTGTGAGGGCGTGAGCCAGTAGATCTCGGGCGACCGCTTGGGTGATGTTCTCAACGAGCTTGCCACCGTAGGTTTCTTGTTTGACGAATTTTCGATTGATTCCAACGCCGTGGAAGGTAATCGTGTCATTACCGAACCTGTTTATTCCGATGCCAGCTTTGGGGTACACCAAGGATCTTCCAGATGGGAGGGTGATAAGGAGCATTCCTCCGTCGATTCGCATAGTGATGCTACGAACGGTAGAGGGTTCCCCCGTTGTGATGGTGTGCTTTGCGGCTTCGTCGATGGCATACCAGTAGGCAACGATTTTGTGGTTAGCGGATCGCCATTTGTCGACGATGGTGCGCATTTCGTCTTCGGTGAAACCCAATTGTTCACCACCCATGGTTTTAATGGCGCCTACGCCGCCTTGATATCCGCAGGCGAGGACCGCGATTTTTCCTTTTTGGCGCAAGTCCGCGTTTTGGCCATTCTTTTCGACGGGTACACCGAACATTTGGGCAGCGGTAGCACAGTAGAGGTCTTTGCCATCGATAAAAGCTTGGATAGTGGTGTCTTGGCCTGCTAGCCATGCCAACACGCGGGCTTCGATGGCCGAGTAGTCGGCGACAATGAACTTTTTCCCCTCGGCGGGGATGAACGCAGTGCGAATCAGCTGGCTGAGGGTATCTGGAACCGAGTCGAATAAAAGTTCAAGGAGGTTGTGGTGCTCTCCACGTATGAGCTCACGGGCCCCAGCTAAATCTTGGATGTAATTTCTGGGAAGGTTTTGGACTTGGACAAGCCTGCCAGCCCACCTCCCCGTTCGTCCCGCACCGTAAAACTGGAGAAGCCCATGGGCTCGGCTAGTGGCTGGTATCGCACATTGTTGCATGGCTTGGTACTTTTTCACGGAGCTTCTGGACATGTCTTGGCGTAGCTCGAGGACACGGCGTGTGGATCCTGTGGCTGTTTTTAGTGCAGCTTGGACGTGCTCTTTAGCCATGGATTCGATAGGGCATCCGTTGACGTTCAACCAGTTTTGTAGCTGCGTTGGAGAGCCAGGGTTTTCGAGTCCGGTGAGTTCTTGGGCTTCGTCGACGCAATGTTCGCGGTAGACGTCGTCGACGGTGATGGCGTGAGCTGCGAGATCGAGGTCAATACGGATGCCGTTGTCGTTGATTCGTTGATCTGTAGCGTATTGATCCCACACCCAGTCTGGGAGGGGAAACCGCTCGAGTTTTGTGCGCATTTCTTGCTCGGTTTCCACGTCACGACGGTTGTATTCGATGAAGTCCACCCAGCGTTCAGGGGCTGTTTGTGGGTGGTTCCGATGGGTCATACCTGAGGCGTCGAACAATACCGGGGCGGCGACCTCTCGTGGTTTTGTCGGTAAGCAAAAATACTGGATAAGACTTTTACCTTCGGATATTTTTTGGGCTTTCAGGCTGAGAACTTTCGCAGCTTGGTCCAGTGACATCGGCAGTCCGATGGAGGCACACCACACCATGGTGCATCGCCAACCGCCTGGGTCAATGAACTCTTCTTTCGGTAGTAAGCTTTGCTGCTGTAACCATCGGGATAGGCAGACTCGTTCGAATTGGGCATTGAACGCATATTTCGTCACCGAAGGGTCCGTGAGCGCTGAAATAATATCGGCGGGTACGGTCTCGCCGGTGGCGATTGACACCACGGTGACGGGGCTGTCATCGACGGAGTAAGCGAAGAGAAGAATTTCGAAATCGTCTGCTGCCGCATATTTATAAGCGCCTCCTTTGGCGATGTTGGCGGAGCTGAAGGTTTCAATGTCGATAGAGAGCGTTCGCATGATTCAGTCGGTCTTTCTAATAAATATTTTGTTAAAAAGTAGGGCGCCCTCTTGATAAATAGGTGGGGCGCGTAACCTATCAGTGCGGTTTAGTCAAGAAATGACGGGGACCCGAAACCACCGGTACTACTCGCGCCGAACTCGGAGGCCGCAGTAGGAGCAATATCACCAAAGTCATCCTCAGCAGTCGCACCGCCCGACAACGGGTCGCCGTCTCGAAGCTTCAAAACATTGCCGAGACCAGCCCCCACGCCCTTGTTGCCGTTGGTGGAGTAAGCGAAGAACTCGAGGGTTACCAGACCATAGGCTCCCGAGTACACATCCCCAGCCTGCGCTGGGATAGCCTTACCACCGGATAGCTTGAGCAATTGCGGCTGACGATCGACATTTGCGTTGGCGTTGATGAAATAGTGGCCAGCGTAAACGGGGTCGTCGCGTTCGATGTCACCATCACGCAAAGGGAGCTTCAAGGCGCCACGTGGCGGGATTTTTCCACCGAATTTTCCGATGCCGTCTTGTAGCGCGTTTTCTACGGCCTGCTCGACCAGCTGCAGGGTTTCGGTATCCGTCTTGGGAATAAGGACGGCCATTCCGTAGGTTTCTTTTCCGGACTCCTTGTTGACTCTCGGGGTATCGACATGAACATAGGAGAAACGAGCCTCGCCGGTGCGGACACGACGAGCGGACATCAACTTAGCGTTGGACATGATAAAAAGGTTTCCTTTCTCGGAAATACAAAAGGTAAGTAATTGTTTTTGGGAAGTGTTTTCGCCCCGTCAGGGAGGTTTCTTCACTCTGAAACAGAGCATGAAATCAACAGACTTTTCTCAACAGTCGGTTACGCAACATCCGCAAAATCATCGGCGGCACTGTGGGAGGCAATCTCAGGGCGATGATCAGCAACCGGAACCAGCGTGGGCTTACCCTCCGCCTTATGAACAAGGCCACCGAGGACCTCGTTGAAGGTGTTTTTGCCCATCAGTTTCTCCATTTTGGACAAGCTGATTAGCTTCCGGTCATAGATATCGGTGTATCCGGCGGTTGCCGCAGCCGTCGCAACCGCATCAGGATCCGTGTATTTTCGTGCTGAACGGCCAGCGACTAGCTTGAAGCCCGGCCAGACGCGTCCATTCCCCACCGCCTGATCAGTGGTGAAGCGCTCCACGTCCGCGATCCACGCCTTCACCTTCGGAGCCAACGTCAGTACCTCCGCAATCTCCTCATCGGATAGCTCGGTCGCATCGGCAAATTCAAATTTGGCGATAACGAGGTTCTCTTCGGCACGCTGCCGGCAGGTCGCTTTGATCTTGCAGAACTGGCACCACTCCCCTGCCTTGAACTCCCCCTCTCCTGCCGCGGCAAGTTTTGCGATCGGGGCAACGGTGTTCTCCGCCCAGTCGAGAAGGTCAGTGACTGACATGGAGAACATGCTGATGTTGTCTCGCCGCGGCTGATAAATAACCATTTCGATGGTGGTGAAATCAAAAATGAAGTCGAACGCTTTCAACGCCCCCAGCGCGTAGAGCTTCATCTGCGGATTATCCCAACTGTCGACGAGAACACCGGCCCCGTATTTAAAATCGATAACGGTGAGGGTGTCACCGTGGGCGATAAGGCAGTCGCCAGTGCCGAATCCCTCTGGAACAGTCTCGGAGAAGTCCAGACGTTGCTCTAGGAAGATCAGCGAGGATGGGTCTTCAGCTTGGGCTTGCCTCCACCTAGTCAGCACATAGTCCGCGTAGGCGTCGGTGTAGTCTTCCATGTCATCGTCGTTGTATTTAGACACGGGGCGCTGCGAGCGTTGGTTGAGACCCTTTCGGATTTTGTGTTCACCAAGGGCATGTGCGGCTGTACCTTCTTCAGCTGCTGTGGAAGTTGACTCCGCAAGGTTTTCCTCAAGTGTGGCGCTCGGCGGGCAATTGAGCCACCGGTGGGCACCCGAAGCGGATAGCACCGCATGCGCACGCTCCTCCGGTTCGGATTGTGGGGCTTCCTCCTCGGCTTTTTCAGCGGCTTTGGTTGCGGTTTCTGTGCCGTCGGCTTCAATGGTTTGGGGGTATTCGGTGGGTGTCCCGTCCCAGCTGATACCGAATGAGTGGACAGAGCTGAAGTTGATTCGGGATGGCCCTTCGTCTGTGGTTTCTAGTGGGACGCTTTTCTTGTTGAGGTCACCGGTTATTTTCCAGACGCGCCCTTGTCTATCTTTGACGTGGGTGAGGCCATATTCAGCGATGGTGGTTTTGAGGGCCTTGATTTCGTCGGGTTGTTCGAAGTCGATGTCTTTGGCATATATTTTGGGGTCATAGTTGAGGTGTTCCAT
Protein sequences of DBSCAN-SWA_3 >LS483402|1846489:1855204|1847825_1848107_-|SQG58805.1|DBSCAN-SWA MDEKHIEKKLTQHVKNMGGIALKLTTPGHAGIPDRLILLPNGRIGFIELKAPGKKPRPLQQHRINQLKKLGFHATYINNTNQIRDAIHEIQTT >LS483402|1846489:1855204|1846489_1847845_-|SQG58804.1|DBSCAN-SWA MKYKPHSYQRFATNFIETHPQAAIFLGMGMGKTIITLTALNNLITDRYATTKALIIAPLRVARDTWSNEQQKWDHLKNLRLSIIIGNPTTRRNALNTDADIYIINRENIPWLVKELGNNWPFDTVVIDELSSFKNHKAQRFKALKTRLPKINRIIGLTGTPAPNSLEDLWAPFRLIDGGERLGKYLTHYRDKFFTPDKRNGPQIYSYKIRPGADKQIYDLIKDITVSMKTTDYLDLPPLATTTINVHLNPKERTIYNQLRDDMVTTIGTATIDAGTAGTLSNKLQQLSSGAIYTDDGVRYIHDQKLDALADIVEQASGNTILVSYWFKHELERLEAKFPQGKELSTSQDMADWCEQKIPIGFIHPASAGHGLNLQSGGHIMVWLTTPWSLELVEQTNARLFRQGQTEPVSIIHIQTSDTIDADVAKALEKKDTTQTALINAVKAQIRKEKK >LS483402|1846489:1855204|1850898_1852884_-|SQG58807.1|DBSCAN-SWA MRTLSIDIETFSSANIAKGGAYKYAAADDFEILLFAYSVDDSPVTVVSIATGETVPADIISALTDPSVTKYAFNAQFERVCLSRWLQQQSLLPKEEFIDPGGWRCTMVWCASIGLPMSLDQAAKVLSLKAQKISEGKSLIQYFCLPTKPREVAAPVLFDASGMTHRNHPQTAPERWVDFIEYNRRDVETEQEMRTKLERFPLPDWVWDQYATDQRINDNGIRIDLDLAAHAITVDDVYREHCVDEAQELTGLENPGSPTQLQNWLNVNGCPIESMAKEHVQAALKTATGSTRRVLELRQDMSRSSVKKYQAMQQCAIPATSRAHGLLQFYGAGRTGRWAGRLVQVQNLPRNYIQDLAGARELIRGEHHNLLELLFDSVPDTLSQLIRTAFIPAEGKKFIVADYSAIEARVLAWLAGQDTTIQAFIDGKDLYCATAAQMFGVPVEKNGQNADLRQKGKIAVLACGYQGGVGAIKTMGGEQLGFTEDEMRTIVDKWRSANHKIVAYWYAIDEAAKHTITTGEPSTVRSITMRIDGGMLLITLPSGRSLVYPKAGIGINRFGNDTITFHGVGINRKFVKQETYGGKLVENITQAVARDLLAHALTTLEKAGHAIVMHIHDEAVIEADSTTPVDVVCRLMEQAPEWASGIPLTADGYECPFYQKD >LS483402|1846489:1855204|1852973_1853591_-|SQG58808.1|DBSCAN-SWA MSNAKLMSARRVRTGEARFSYVHVDTPRVNKESGKETYGMAVLIPKTDTETLQLVEQAVENALQDGIGKFGGKIPPRGALKLPLRDGDIERDDPVYAGHYFINANANVDRQPQLLKLSGGKAIPAQAGDVYSGAYGLVTLEFFAYSTNGNKGVGAGLGNVLKLRDGDPLSGGATAEDDFGDIAPTAASEFGASSTGGFGSPSFLD >LS483402|1846489:1855204|1848419_1850879_-|SQG58806.1|DBSCAN-SWA MNRELKISTAGSRLAMLWDNQLTDWPTLKQRLNNSRPGTKTAAEYQALPKTQRDDEKDFGGFVGGHLANGRRRKNNILTRSLIALDADTPTHQTLTELPTTLPYEWVLYSTHSHTADLPRFRIIAPLTRDVTADEYTAICRRLAADIGIDAFDDTTYEAHRLMYWPTHPVDVEPLHKTNTGPWINPDEVLARYDNWRDMSTWPTSSRQAEHLKTRADKQADPLTKPGLVGAFCRTYPISKAIEVFLPETYTPTATQGRYTYTPGESTSGVVTYGNDNFSYSHHGTDPAGGQLVNAFDLIRLHKFGTWDDDAKHGTPTHKLPSYKAMLELAREDHDVKALLDREADQKAAEDFGDLLDQGNTDDHTTEPETQKQKESWRTTAGLTRKNSGEYDDTLENLTKILTYDPLLQPIKYNLLSESICVDDDAKLPWVQTKAGWSDADVAQLKLYLEKAFGLYSGTKTAEALQIAAASRCYHPIRDYLTALPKWDQQPRLDTLLIDYLGADNTDYIKAITRKTFTAAVARVFHPGTKFDTVLILNGPQGTGKSTLFAKLAGAWFSDALSLTDMRDKTGAEKLQGYWILELGELAGMRKMDVETVKGFLSRTDDKFRAAYGRTVESHPRQCIIVGSTNAENGFLRDNTGGRRFWPAKITGESPKKSWELDTSTVDQIWAEALFRFRSGEQLHLTGVVAEAARQQQDAAIESDERAGLVEEYLDTPLPDGWWGMGTRERRAYLSGADGDFVGVGGAASGDRRGFVSNAEIWAECFGRDPSDMKPADSYAIAAIMQKLPMWERMPERRVVPPYGRQRVYRFIVSDEPPF >LS483402|1846489:1855204|1853722_1855204_-|SQG58809.1|DBSCAN-SWA MEHLNYDPKIYAKDIDFEQPDEIKALKTTIAEYGLTHVKDRQGRVWKITGDLNKKSVPLETTDEGPSRINFSSVHSFGISWDGTPTEYPQTIEADGTETATKAAEKAEEEAPQSEPEERAHAVLSASGAHRWLNCPPSATLEENLAESTSTAAEEGTAAHALGEHKIRKGLNQRSQRPVSKYNDDDMEDYTDAYADYVLTRWRQAQAEDPSSLIFLEQRLDFSETVPEGFGTGDCLIAHGDTLTVIDFKYGAGVLVDSWDNPQMKLYALGALKAFDFIFDFTTIEMVIYQPRRDNISMFSMSVTDLLDWAENTVAPIAKLAAAGEGEFKAGEWCQFCKIKATCRQRAEENLVIAKFEFADATELSDEEIAEVLTLAPKVKAWIADVERFTTDQAVGNGRVWPGFKLVAGRSARKYTDPDAVATAAATAGYTDIYDRKLISLSKMEKLMGKNTFNEVLGGLVHKAEGKPTLVPVADHRPEIASHSAADDFADVA |
6 | Corynebacterium_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1858326 : 1866480
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >LS483402|1858326:1866480|DBSCAN-SWA TCTACGCGGCGTAGTATTGGTTCCAAACCTGCTCCATGAGTGGACGGTCAGCTTCCGTGTAGGCGCAGACGTTCCGTATCTGCCCATTGGGTAAATTCAGCGGGTATTTCCCTGGATCTTCGCCGTGTTCTAGGGTGTATGCGGCTTTTACTCGCTTACCGAAAACGCTGGCTACAGATTTGCGTTTCTTATCCGAAAGGCCTTTTTCTTTTAGGAAGTCGGCCGTATATAGAAGCTGGTTAGCGGTGTTTAGCTCCGGCGCTTCACCCAAGCCGCGGGCGAGGATGATGCGGGCTTTTGCTTCTAGATGGTCCTGTTGGATGAGGCCTTTAGCTGCTTGGCAGAGCTCTATTTGGGAGCGGGCCTGGAAAATTAATGCATTGATCTGATGTTCTGTGGCGCGGGGGTTGATCGCTCCACCTTTTGCCCAGTACGACTCGATTACGTCCGCGATCTCTTGCTGGTAGGCCACGACTAGTGGGCGGGCTTCTTCGCTGACACGGTTCTCGTCGATGGTGGCAAGCCACATCGTCAAAGTCCGAAGATCGATCAGATTCGTTTCCTGCGGTCCGCCATTTGAGGGGATGGTCATCTTGACCATGCCTGCCCACGACTTTGAGGATAGGCGGCGGGCCTGGCTGCGGTAATCCATTCCGATGGATTCAACGAGCGGGCGGAACACTACGTGTGGCTTGCCACCAATTTCGACAGACTGCACCTGTTGCCCGTGAAAAGGGATAATGGCTAATTGATTAGACATGGTATAGTCTCCTTGTTGAATAGTTTTTTTGAATTTGCCTCGCGTCTCTAGCGCGGGGCTTTTCTTATGCCGCGATTGCCGCGTCGTTTTCGACCACTAGCAATCGGGCGGGATTCGCCCCTAGCGCGACGAGCGCATTGAGGACTGCGATGGTTGGTTTTCTTGAAGTGGTGGCGGTTGACCAGGTTTTTCGCGATAAGTCTGTTCGTTCTGCAAGTAAGGTGTTTGAGGCGAGATTGTTTAAACGTTTTACGCGGTCAATTTCGTTTAGATTGAGTAGCAACATTCCCATTTCCTCCTTTCGTGTTGCTGTTATGTAACTCAATTTACACAGATGGGAACTTCCCGGCAATACCCGTGTAACAAATAGTTCGCAAAAGGGGGTAGTTTGGTTACAGTTTGTCACGCTAGATGGCTATTTAGTGAGTAAAAAGTTACTCACTGTGCTAGGGTTTAAGCCATGGACATAAAACAATGGCTAAGCGAAAACGCGCACCGACGCGTCACCGACAGCGAAGTCGCCGAAATTCTCGGAGTAACCAGAAAAACCGTGAACACCCGACTTAATGCTGGAACCCTCACCGCAGATGACCTCCTACTAATCTGTGAACGCCTAGGCATCAACCGCACACTCGCACTGGTCGAACTCGAGCGACTCCCCCACTCAGATGTTCTCGAATATCTAGACAGCGACGGAGCCCTCCTAGCCACCGCAGAAGACGGAGACCTAGCGATAGAACTCGCTCGCAGACTAAACCCGGCCACCAAGGCGCACGAGATCGACGAACTTGCAGCTAGGCGCCGTCGTAGCACCCCCGCTACTCCCCCGAGTGTCAAACCCCTAAGCGATGATGACGGCACCATCATGGATTGGGACGAAAGCCTGCCACACGCGGCAGACTCCAGCCCAGACGAAACAGAAGAACGGCTAAGGAGAGGCGAGAGCCCAGTTGACTAAGAGGTAAAAGACTTTTACAAAAGCCTGGTCACAACACCTTTCCTAATTAGAAAGAGATTCACCTAACCAGGCACTTTCAAACAGGTTTTGAAACCCGCACTTCACTTAGACCAACAGAAAACGGTAAGGTAATGAACCATGACAATCATGAACCCCAGCCGCAAACGCCGGGCGTTCCTTACGGGCCTCGGCAGCCTCATGGACCTCAGCGGCCAAACGACGTACAGGCAAGCCCAGAAACTCCTGCCACCAGCAGCCAGAACCAATATCAACCAGCAGCTCCTCCGAGCTTCAGCGCAGACGAGCTTAGAGAAAACCCAATTCTCCTAGCTCAGGTCACCGCCTACCTGCAGGAAAATCACCTCCATCTACCTCTCCTCACTCCCGACCTAGACGAAATGACCCGGATGAGGAAGGACACCCCTGAGCTTTACCAGGCCTACGTCAAAGCCATCAACGCCCAAATCGACGCAGACCATAAAGCCCGCACGCTGCCCTACGTCGAACCTGGAACCATCGCCAAGCGCGGTCAAATATTCGGTCTCATAGCAGTAGTTGCGGTGTTGGGCTTCTGCGCCTACCTTGCATTCCTCGGACATGTGATCAGCGCCAGCATCATCGCGGCATTCGACCTTGTTGCACTCGCAAGCGTCTTCGCAAGTGGAAAGGACGAAAAGCAAGACGAACGGAAGCAAAGTGTCCATGATTGATGCCCTCATCGACGCCGCCGAAGCCCGCGGCTACCGCATCCGCTGGCACCGCGGTGGACCAAAAGCAGCCTGGCTACCCCACACCCACACCATCAGCGTGAGGGTGGGCATGGACGATGTGCAGACCCTATGCTCACTAGCACACGAGCTCGGACACGCACACCACAATGACCCTCCAGGGCACACCGGATTCCGGGAGCAGCGAGCCAACCGCTTCGCAGCCCAGCTCCTCATCTCCCCCGTTGAGTACGCCACAGCAGAGACGATCTACGGCCCGCACCCAGCACGCCTAGCCCACGAGCTCGGCGTCACCGTCGAAATCATCGAAGCCTGGCAACAGCTACCACAACAAGTCTCAGCATGATCTGAACACAACGGTAACATCACCCCAAAAACTACCCGGAAAACATTAGAAGATATGGCTGATCAAGCCATGAAGGAATGTCTGTATAGGTAAATAACATAAAAAGCCTCTTTGCTCGATCGACGCCAATCAACCCAAACAAAGAGGCAAAGCCACCAATTATCAAAATCACTGACTTAGGGAGAAGTCTATCTCATGCCGCAAAAACGCACAGACAAAAACGGAAAAACACGCTGGATCGGACGATACCGCGACAAAAACAAAAAAGAATACACACGATCATTCCCCACACGACGCGAAGCCAAACAATGGGAAGACGAACGAAAAACCCAAGTCACAAAAGGCACACACGTCACCCCACAACGGGAAAAAACCACCATCCTAGAAATGTACGACGCATGGACAACACGAGACCTCTCCGACGGAACCCTCCTGTCCTACGCCCAAACCAGACGCGAACTACAAGACTCAATCGGCGCAGCCCAAGCCATCCACACCACCGTCACAGACATCAACAAATGGCACCTCCAACTCATCAACGGCCGCCCATGGATGGACAACAAAACCCTGGCACGAACAACAGCCCGCGAACACATGGTCAGGCTCTCTAGCGCATTCAACTTCGCAATCCGCGAAGGGTGGCTCTACCGCAACCCCGTCATGGTCCCACCCGCAGCCACAACAACCGCAGTAAAAGCAAAAGAAATCCCCACCCTCGACGAAATCCAAAGCCTAATTACACAGGTAGAAACCGGCGGATCCATATACCAAGGCTATTCAGTTCGACGAGGCAAAAGCGTACCCGACACATTCACCTCCCAACCAGCCCCAGTCGTTGCCGACATGATGCGGCTTGGCGTCGGCAGCGGGCTGCGAATAAGCGAAGTGTGTGGGTTAATCGTTAGCGACATCAACGTCTGCGCACGCGAACTTCATGTCACGGCCCAGATTCACCGTGAAGGTAAGCGGCGCGTGGCGTTGAAAACAAAGGCTTCCGAACGTGTCGTGCCCCTGGCAGATGATGCGCTGGAATTACTGGGAAAATATACCAAAGGTAAGGACCCTGATGATTGGGTTTTCGCGACCAAATGCGGGACTCCGTATCGGGCTTCATCGTTGGGTGGGGCGATTCGTCATGCTTCGCGTCATTTGGGGGTTGAGTGGACGTTTCACTCATTGAGGCATTTGTATGCTTCGCGTCTTATTGCCGCGGGTGTGCCTGTGAATGTGGTGCAGAAGTTGATGGGGCACGCAAGCGCGACGGTAACGCTGGACACGTACACACATCTGTGGCCTAAGGCTGATGATGTGGCACGGTCAGCGATTGCGGGCGCGGTTGCGGCGTGCGGGCAAAATGCGGGCAAAGGTGGTGTTTAGAGGGTGTTTTAGCTGCGTTTTTGCAGGTCAGGCTAAGCCAATTCCACGATCTCCATGTAAGTCTCGCTCCACAGGTCTTCGTCTCCGTCAGGAAGCACTATGACGCGTTCGGGTTCTAGGGCTCGCACCGCTCCAGGGTCGTGAGTCACAAGCACCACAGCGCCTTTATAGGTTCGGAGTGCGTCAAGTACTTGTTCACGGGAAACCGGGTCAAGGTTGTTCGTTGGCTCGTCGAGAAGCAAGACGTTTGCACGCGAAGAAACCAAGGCTGCCAGCGCCAATCGTGTCTTCTCACCGCCTGATAGCGTCCCTGCAGGCTGATCGAGCTGCTCACCGGTAAACATGAAGGCACCGAGGAGACCGCGGAGGTCTTGTTCGCCAGCTTCTGGGCAAGCATCGATAGTGTTCTGCCAGACAGATTTTTGTGGATCGATGGTGTCATGTTCCTGGGCAAAATAGCCGATTTTCAGGCCATGTCCGGAAACGATTCCGCCTTCGCCATCTGTTCTTTCAACGCCTGCAAGAAGTTTCAGAAGTGTCGTCTTACCAGCACCATTGAAGCCTAGGACCACTACTCGGGATCCTTTATCAATGGCCAGGTCCACGCCAGCAAATACTTCCAAAGAACCGTACATCTTGGTCAACCCGGTCGCGTTCAAAGGGGTTTTTCCGCAAGGTGCTGGCTCAGGGAAAGAGATGTTAGCTACTCGGTCTGCAATGCGGATCTCATCAAGGTTGCCCATCATTTTTTCAGCGCGTGCCAGCATCTGTTTAGCTGCGGCGGCTTTGGTGGCCTTGGCTCCAAGTCGAGCTGCTTGATCCTTCAACGCGGCGGCTTTCTTCTCCGCATTGGCTCGCTCACGACGACGACGGGCCTCGTCAGTGGCACGCGCATCTTTGTACTTAGAAAAGCTCATGTTGTAGATGTCCGCTTCGCCGCGCACAGCATCAAGGAACCAGACTTTATTACAGACCGCATCAAGCAGTTCTACATCGTGTGAAATCATGATGAGGCCACCCTCGTGCTTGCTCAAAAAATCGCGCAGCCACGTAATAGAATCGGCATCAAGGTGGTTCGTCGGCTCGTCGAGAAGCAATGTCGTGCTTGATTTTCCAGAGCCTGCTGACGCCGCAAAAAGGATTTGCGCCAGCTCTACCCTACGGCGTTGTCCGCCAGAAAGCGTCTTGAGTGGTTGGTCGAGGATCCGTGCGGGAAGACCCAAGTTATCGCAGATGCGAGCTGCTTCAGCAGCTGCTTCATAACCGCCAAGCGCATGATAGCGTTCCTCAAGCCGCGAATACTTGCGAATAGCGGCATCGCGCTTTTTATCATCCGTCGTGGTCTCCATGATTTCTTGCTGGCGTTCCATGGAGGTTTGGATTTGGTCTAATCCACGCGCAGAAAGAACACGGTCACGCGCGGATTGATCGATATTGCCTTCTCGCGAATCTTGCGGAAGATAACCAATGTCACCACTACGAGTCACTACGCCGGCATAGGGTTCAGTCTCCCCCGCCAAAATTCGCATCGTAGTAGTTTTGCCGGCACCGTTGCGGCCAACAAGTCCAATTCGATCTCCCGGCTGTACACGAAGATGCTGGCCAGGAGCATTCAGGAGGGTTCTTGCCCCTACGCGGACTTCAAAATCATTGGTCACAATCACAACGGCGGATTATATCAAGTAACCGTCCCTATCCTGTAACAGGATGATGTTCCCCGCGCATCATGGGACGCAAGGGGAACATCTGCGAGTTATGGCGTTAAGTTAAAAACCTAAACCGAGAAACCAAGCGCACGCAATTGGTCACGGCCATCTTCCGTAATCATATGCGGGCCCCATGGAGGCATCCATACCCAATGAATGCTCAGGGACTCTGCGATCTTGTTTCCTACAACTGCAGATTGCGCCTGATCTTCTAGGACGTCCGTAAGCGGACACGCAGGCGAGGTAAGAGTCATGTTCACGTGTGCGTGAACCCCGTCTACCATCCATACGTCATATACCAATCCGAGGTCGACAACGTTGATGCCTAGTTCGGGGTCAATAACATCACGAAGATACTCTTCCACATCACTAGCTTTAGCAATGTCTTCCTCTGATTGCTCCGGACGCAAAGCACCTGCTGCGAGATCAGATTGTTCCTCAGAAGCTGCCTCGGTAGCGGTGCCATCAGATAGGTCTTCCACTTGGGGCTCTACCGAGGTGCCTGCTGGCTCTTCCGTGGCTTTGTTTTCTACGTGTTCTTCGTTCACTTCTTCTCCTCTAAAGCATCAGCAGTGGCAGCCTGGAATGCCTTCCACCCCAGAAGCGCACATTTCACCCGCGCAGGATATTTGGACACTCCGGCAAAGGCAATGCCATCGCCGATAAGGTCTTCATCTCCTTCTACCGTTCCGCGAGATGTAATCATCCGCTCAAATTCTTCGAGCTTATCCATGGCTTCTGCAACGCTCTTGCCAACGATTTCTTCCGCCATGACCGACGTGGAAGCTTGACTAATAGAGCAACCTTCCGCGTCATAAGAAACGTCTGCGACAGTTAAGCCATCCTCTGAAAGATGCACCCGGAGTGTGATCTCATCACCGCATGAGGGATTCACATGATGGACTTCGGACTCATAAGGTTCACGCAGACCCGCATGCATCGGATTTTTATAGTGATCGAGGATCACTTCTTGATACATGGACTCGAGGTTCATAGTGCTCCTCCTTCCGAGGCAGTAACTCCAAAGAATTTTTTAGCCGCTTCAATCGCCTCAACGAGTCGGTCAATCTCTTCTAGCGTGTTGTAGAAGTAAAAAGATGCTCTCGCAGTCGATTGTGCATTAAGTGCGCGGTGTACCGGCCACGCACAGTGATGTCCAACCCGAATGCACACACCGTGATCATCAAGAACTTGGCCGAGGTCATGAGGGTGTATCCCCTCGACCTTAAAGCTCACGGCAGAACCACGATCTACATTGGTTGTAGGGCCGTAGATGCTGAGACCCTCAATGCTGCTCAGACGGTTGAGAGCGTAGTCCGTGAGCTTGTGCTCATGGCGTGCAATAGCACTCATCCCTATCTCTTGCAGAAAATTTACTGCTTCTCCCAATCCCACCACTTGGCTGGTCATCTGTGTACCGGCTTCAAATCGTTGGGGAATGTCTGCAAACGTCGTCTTTTCCATAGTCACTACGGAAATCATCGAGCCACCGGTAAGGAAAGGCGGAAGTTGTTCCAGTAACTTTCGTTTTCCATAGACAGCGCCAACGCCACTCGGGCCACACATTTTATGACCCGAAAACGCAGCAAAATCGACATCAAGATCATGGAAATTTACCGGCATGTGCGGGACCGACTGACACGCGTCGAGGACCACAAGCGCGCCTACCTCGCGGGCACGACGCACCAGCTCTGCGACCGGCGCTACTGCCCCTGTGACATTCGACTGATGCGTAAATGCAACCACTTTTACCGTCTGGTCAAGTTCTAAGGAGTCAAGATCGATGCGCCCATCTTCGGTCACTGAGTACCACTTCAGCGTGGCACCCGTACGTTCGCACAATTCTTGCCAAGGCACCAAGTTAGCATGATGCTCTAGCTCTGTGACTACGACGGTGTCGCCTTCCGTAACTTGGAGGTCACCGGCTCGGAGGTCTCCCAGCACGTATGCGACTAGGTTAAGGGCTTCCGTGGCGTTTTTAGTAAACGCTATTTCGTCCCACTCCGCTCCAACGAACCCTGCGATCGCGGCACGAGCGTCTTCGTAGGCATCTGTGGCTTCCTCGGCAAGTTGGTAAGCACCCCGGTGCACAGGAGCATTAGTGTGCAAAACAAAACGCTCTTCTGCACGCCACACTCGCTCTGGGCGTTGCGACGTTGCACCGGAGTCCAAATAGACCAAGCTTTTACCATCGCGCACAGACCGCGACAATATAGGAAACTCTTGACGCAGTCGCTGCGTATCGAGCTCTCCTGACTCAGTCAGATATGTATTGCTCAT
Protein sequences of DBSCAN-SWA_4 >LS483402|1858326:1866480|1858326_1859085_-|SQG58816.1|DBSCAN-SWA MSNQLAIIPFHGQQVQSVEIGGKPHVVFRPLVESIGMDYRSQARRLSSKSWAGMVKMTIPSNGGPQETNLIDLRTLTMWLATIDENRVSEEARPLVVAYQQEIADVIESYWAKGGAINPRATEHQINALIFQARSQIELCQAAKGLIQQDHLEAKARIILARGLGEAPELNTANQLLYTADFLKEKGLSDKKRKSVASVFGKRVKAAYTLEHGEDPGKYPLNLPNGQIRNVCAYTEADRPLMEQVWNQYYAA >LS483402|1858326:1866480|1860174_1860753_+|SQG58819.1|DBSCAN-SWA MNHDNHEPQPQTPGVPYGPRQPHGPQRPNDVQASPETPATSSQNQYQPAAPPSFSADELRENPILLAQVTAYLQENHLHLPLLTPDLDEMTRMRKDTPELYQAYVKAINAQIDADHKARTLPYVEPGTIAKRGQIFGLIAVVAVLGFCAYLAFLGHVISASIIAAFDLVALASVFASGKDEKQDERKQSVHD >LS483402|1858326:1866480|1860745_1861117_+|SQG58820.1|DBSCAN-SWA MIDALIDAAEARGYRIRWHRGGPKAAWLPHTHTISVRVGMDDVQTLCSLAHELGHAHHNDPPGHTGFREQRANRFAAQLLISPVEYATAETIYGPHPARLAHELGVTVEIIEAWQQLPQQVSA >LS483402|1858326:1866480|1864268_1864748_-|SQG58823.1|DBSCAN-SWA MNEEHVENKATEEPAGTSVEPQVEDLSDGTATEAASEEQSDLAAGALRPEQSEEDIAKASDVEEYLRDVIDPELGINVVDLGLVYDVWMVDGVHAHVNMTLTSPACPLTDVLEDQAQSAVVGNKIAESLSIHWVWMPPWGPHMITEDGRDQLRALGFSV >LS483402|1858326:1866480|1861312_1862494_+|SQG58821.1|DBSCAN-SWA MPQKRTDKNGKTRWIGRYRDKNKKEYTRSFPTRREAKQWEDERKTQVTKGTHVTPQREKTTILEMYDAWTTRDLSDGTLLSYAQTRRELQDSIGAAQAIHTTVTDINKWHLQLINGRPWMDNKTLARTTAREHMVRLSSAFNFAIREGWLYRNPVMVPPAATTTAVKAKEIPTLDEIQSLITQVETGGSIYQGYSVRRGKSVPDTFTSQPAPVVADMMRLGVGSGLRISEVCGLIVSDINVCARELHVTAQIHREGKRRVALKTKASERVVPLADDALELLGKYTKGKDPDDWVFATKCGTPYRASSLGGAIRHASRHLGVEWTFHSLRHLYASRLIAAGVPVNVVQKLMGHASATVTLDTYTHLWPKADDVARSAIAGAVAACGQNAGKGGV >LS483402|1858326:1866480|1859545_1860043_+|SQG58818.1|DBSCAN-SWA MDIKQWLSENAHRRVTDSEVAEILGVTRKTVNTRLNAGTLTADDLLLICERLGINRTLALVELERLPHSDVLEYLDSDGALLATAEDGDLAIELARRLNPATKAHEIDELAARRRRSTPATPPSVKPLSDDDGTIMDWDESLPHAADSSPDETEERLRRGESPVD >LS483402|1858326:1866480|1859149_1859371_-|SQG58817.1|DBSCAN-SWA MLLLNLNEIDRVKRLNNLASNTLLAERTDLSRKTWSTATTSRKPTIAVLNALVALGANPARLLVVENDAAIAA >LS483402|1858326:1866480|1865190_1866480_-|SQG58825.1|DBSCAN-SWA MSNTYLTESGELDTQRLRQEFPILSRSVRDGKSLVYLDSGATSQRPERVWRAEERFVLHTNAPVHRGAYQLAEEATDAYEDARAAIAGFVGAEWDEIAFTKNATEALNLVAYVLGDLRAGDLQVTEGDTVVVTELEHHANLVPWQELCERTGATLKWYSVTEDGRIDLDSLELDQTVKVVAFTHQSNVTGAVAPVAELVRRAREVGALVVLDACQSVPHMPVNFHDLDVDFAAFSGHKMCGPSGVGAVYGKRKLLEQLPPFLTGGSMISVVTMEKTTFADIPQRFEAGTQMTSQVVGLGEAVNFLQEIGMSAIARHEHKLTDYALNRLSSIEGLSIYGPTTNVDRGSAVSFKVEGIHPHDLGQVLDDHGVCIRVGHHCAWPVHRALNAQSTARASFYFYNTLEEIDRLVEAIEAAKKFFGVTASEGGAL >LS483402|1858326:1866480|1864744_1865194_-|SQG58824.1|DBSCAN-SWA MNLESMYQEVILDHYKNPMHAGLREPYESEVHHVNPSCGDEITLRVHLSEDGLTVADVSYDAEGCSISQASTSVMAEEIVGKSVAEAMDKLEEFERMITSRGTVEGDEDLIGDGIAFAGVSKYPARVKCALLGWKAFQAATADALEEKK >LS483402|1858326:1866480|1862526_1864158_-|SQG58822.1|DBSCAN-SWA MIVTNDFEVRVGARTLLNAPGQHLRVQPGDRIGLVGRNGAGKTTTMRILAGETEPYAGVVTRSGDIGYLPQDSREGNIDQSARDRVLSARGLDQIQTSMERQQEIMETTTDDKKRDAAIRKYSRLEERYHALGGYEAAAEAARICDNLGLPARILDQPLKTLSGGQRRRVELAQILFAASAGSGKSSTTLLLDEPTNHLDADSITWLRDFLSKHEGGLIMISHDVELLDAVCNKVWFLDAVRGEADIYNMSFSKYKDARATDEARRRRERANAEKKAAALKDQAARLGAKATKAAAAKQMLARAEKMMGNLDEIRIADRVANISFPEPAPCGKTPLNATGLTKMYGSLEVFAGVDLAIDKGSRVVVLGFNGAGKTTLLKLLAGVERTDGEGGIVSGHGLKIGYFAQEHDTIDPQKSVWQNTIDACPEAGEQDLRGLLGAFMFTGEQLDQPAGTLSGGEKTRLALAALVSSRANVLLLDEPTNNLDPVSREQVLDALRTYKGAVVLVTHDPGAVRALEPERVIVLPDGDEDLWSETYMEIVELA |
10 | Corynebacterium_phage(37.5%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2415298 : 2451328
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >LS483402|2415298:2451328|DBSCAN-SWA CTTAGTTCAGCTCCTCAAGGCGGGCGTTGATGCGCGCCACCTCTTCTTCTGCGATCTTCTGGCGGTTGCGGATTTTCTCTACCACTGCCTCGGGCGCCTTGGACAAGAATGCCTCGTTGCCGAGTTTCTTAGCTGTGGTCTCCAATTCCTTGGTTGCCGCTGCCAAATCCTTCTCCAGGCGCTTACGCTCAGCAACCTTATCCACGGTTCCGGAGGTATCCAGCGAAACCACAACGGTGGCCTGCGACAAGCGCACTTCAATGCTTGCCGACGCCGCGAAGTCCTCCGCAGGAGTCTCAAGGCGAACCAAGGAACGCACGATGCTCTCCTGCTCGTCCAGGTCACAAGCGCTAAAGTCTACGCGTGCGGGCACCTTCTGGGACGGCTTGACGCCCTGGTCTGAGCGGAAGCGGCGTACCTCGGTGACGAGCTTCTCCACATCAGCTATACGACGAGCAGCGGTGGCGTCGATAAGCGCGCCGCCGTTTGTTTCCGCCTCGGTGGGCCATGCCGCTACGTTGAGGGACTCGCCATCGGTCAGAGCTTTCCACAGGACTTCCGTGACAAACGGCATCGCGGGATGCAGCATACGCAGTACAGCGTCCAGTACTTGCCCCAGCACGATCTGAGTGTTCTCGCCACGACGACGCTCCTCCGGAGTAGCGGAGTCAATATCACGTGGAATCTGCACCTTGGCAAACTCCAAGTACCAGTCGCAGAACTCTCCCCAGGTGAACTGGTAGAGGGCCTCATTTCCCTTAGCAAACTGATACGCATCAAAATAAGAATCTACCTGCGCGCGGACCTGCTCAAGGCGGTCGAGGATCCACCGGTCGGCGTCGCTCAGCTCCTCGCGGGCAGGAAGCTCGCCCACGCGTGCGCCATTCATGAGCGCAAATTTGGTGGCGTTGTAGAGCTTGGTAGCAAAGTTGCGGGAGCTCTGTGCGGAGTCTTCACCAACGGGCAGATCGACGCCGGGGTTGGCGCCACGGGCAAGAGTAAAGCGCAACGCATCAGCACCAAAACGTTCCACCCAGTCCATGGGGTCGATGCCGTTGCCCAAAGACTTACTCATCTTGCGCCCGTGTTCGTCGCGAACAAGACCGTGCAGGAAGAGGTCCGTAAACGGCACCTGCGGACGACCATCGCGGCCTTCCCCAAGAAGCTCCGGGGTGGTCTCAGCAGCAAGAGTGCCAAACATCATCATGCGGGCAACCCAGAAGAACAAGATGTCATAAGCAGTAACCAGAACCGAGGTTGGATAGAATTTTTCCAGCTCAGGTGTCTTTTCTGGCCAGCCCATCGTGGAGAACGGCCACAAAGCCGAGCTAAACCAGGTGTCCAGGACGTCAGGATCCTGCGTGTATCCCTCGGGCGCCTGCTCATCGGGCCCCACACACACGATGTCTCGCTGTCCGTTAGCGTCCTCAGGTCCGTACCAAATCGGAATGCGATGACCCCACCACAGCTGACGCGAAATGCACCAATCGTGCATGTCATCAACCCACTCAAAGTAACGCGGCTCTGCAGACTTGGGGTGGATGACCGTATCACCTTGGCGTACGGCATCGCCAGCCATAGCGGCGAGTTCTTCCACGCTGACAAACCACTGGAGGGACAAACGCGGCTCAATGGCTTCACCCGAACGCTCTGAATGCCCTACCGAGTGCACATACGGGCGAATTTCCTTAACGATGCGCCCTTGCTCCGCAAGCGCCTCGCGGATCTTGACGCGAGCCTCCGCCCGATCCATTCCATCAAACTGTGTTCCCGTACCTGCGATATGGCCGGTCGAGTCCATAATGGTTGGCATATCAAGGTTGTGGCGCAAACCAAGCGCGTAATCGTTCGGGTCATGCGCCGGAGTGATCTTGACTGCGCCGGTACCAAACTCTGGGTCAACGTAATCATCAGCGACTACGATCATCTGCCGATCCGCAATAAACGGGTGTGGCAACGAGGTGCCTACCAGGTCCTTGTAGCGCTCATCCTCTGGGTGCACAGCCACTGCGACGTCACCCAGCATGGTCTCCACGCGCGTGGTGGCAACGATGACGTTTGGCTCGGAATCATCCAGGGATCCATACCGGATGGAGACAAGCTCGCCTTCAACATCCTTGTACACGACCTCAATGTCGGAGACAGCGGTCTCCAGGACCGGTGACCAGTTGACCAGGCGATTGGCCTGATAAATCATCCCGCGGTCATACATCTGCTTGAAAATAGTTTGAACTGCGCGCGAGAGGCCCTCATCAAGGGTAAAGCGCTCGCGGGACCAATCGACGGAATCGCCGATAGCTCGCATCTGCTTACCGATAGTTCCGCCGTACTCGTTTTTCCAGTTCCAGACATGGCCGATGAATTCTTCGCGGTCATAGTCCCACCGGCTCTTGCCCTCTTTTTCCTTGAGCATGGCCTCGACTTTGGTCTGAGTCGCGATTCCGGCGTGATCCATACCCGGCAGCCACAAGACTTCAAAGCCTTGCATCCTCTTGCGTCGAGCGATGGAGTCCATCAGCGTATGGTCCAAAGCATGGCCCATATGGAGCTGGCCAGTTACGTTTGGCGGAGGAAGCACAATAGAGAACGGCGGCTTGGAGCTAGAGGTGTTAGCGGTGAAGTAGCCAGCATCCACCCAGCTTTGATAAAGCTCCTGCTCATGAGCTTGGGGATCCCAGCTCGCGGGGAGGCGGTCAGCGCGGTTTGCATTCTCATTATTCGCAGTCACGTCGACCATCTTAGCGTTAGCACCAAGCACACTTCACGCTGGGGTTTTTAGGTAGCTCTCGGACCTTCCCCGCTCTCCTTATTCACCGAATCTATAAAACCAAAAATGCTCCCGATCCAAGAGCCACACAAGGCTCTGTCGGGAGCATTATTTTGGATAAGGCTTTTAGAGGGGCTACTTCAGCAAGTCGGCAACAGCCGCGCGCTCCTCTTCGAGCTCTTTCACGTTACGAGCGATACTTTCCTTCTGGAAGTCCGAGAGCTCAAGGCCATCGACGATCTTCCATTCGCCGCCTTCAGCGATAGTCGGGAAACCGAAGATAAGGCCCTCTGGGACGCCATAGGAACCATCCGAAGGGATAGCGGCCGTACGCCACTGGCCTTCCGTTCCATTGATCCAGTCATGCATATGATCGATTGCCGAGGACGCAGCCGAAGCTGCCGAAGACTTTCCGCGAACTTCAATGATCTCAGCGCCGCGCTTAGCCACTCGAGGAATGAACTCGTCTCGGTACCATGCATCATCAACCAAGTCAGCCAGCTTTTCTCCGTCCACGATTGCATAAGCAATGTCCGGGAACTGGCCTGCTGAGTGGTTACCCCACACAACAAAGTTCTCAATGCCGTTCTTATCGCGGTTAATCTTATCCGCAAGCTGGGAGATACCGCGGTTATGGTCTAAGCGCATCATTGCATTGAAACGGTCAGCGGGAATGTCTTTAGCGGCAGACTGTGCAATCAGAGCGTTGGTGTTGGCAGGGTTTCCTACAACCAAGACGCGAATGTCATCTGCGGCGTTGCGGCTCAGAGCATCCCCCTGGGGGCCAAAGATCTTACCGTTAGCAGTCAGCAGCGCAGCCCGCTCCTCCCCTTTTCCTCGTGGCTTTGCGCCCACAAGGAATGCAGCATTCGTACCATCAAAAGCAACATCGGCGGAGTCAGTCACGGTGATGTTCTTTAAAAGTGGGAAGGCCGAGTCAAGCAATTCCATTGCCACACCCTCAGCGCCACCGATTGCCTGAGGGATCTCAAGAAGCTGAAGCTCAATTGGGGTGTCTTTGCCGTACACATCGCCATTGGCGATACGCCACAGCAAAGAATAGGCAATCTGCCCAGCCGCACCAGTGACAGCGATCTTCTTCACAGCGTGATTCATCGTGAGTTGTACTCCTTACAAATAAAGAACCTAGCTTTATTCGACGCTCACACCGCACGACTGCACCCACCCCACGGTGGCACCAAGCGTTTACTTTGCACGGCTGTGAACTACGTCAAATACCACCCTCAAGAGTAATGGCAATGCCACTTTTTTGCCTGTCGACTCATCGTTTGCGCAGGTCACCCACACAAGACAATGATCACATGTGAATTTCAACACTCTTTTTAATCGGTTTACTTTACCCCCGAAAGTGCATTTTTAGCAACGTTTCGCCGACAACGGCGCTGGAGCACCATCGAGAAGGACGCGGATAAAACATCTCAAACACCCCCAAAAAGCCAGAGACGTTAACCCCGAATAGTCTTTAACAGTGAAACCCAGCACTTTTCGACTATCTTTAGCCACGAGGGGTAATGTTTTTCTCTTACTTCCCCTAAAAGTTGTCGTTTCCACCGCTAGGTTAACCTGCGTTCTGCGTTCAAACTGTCATAAGGCAGATCATCTTTATGGAGGAAACAGTTAAAAATATCACTAATTTATCTATATATTTCGCGAGGATTAATCGGCCCATGCATCAGTCCTCGACGTCATTGTTCGAGATAGGAGTACATCGTGAATGGCACTGTGGATCACATCGAGAATACGTCGGTAGATAATACGCCAGAGGTCTCCCCGCAGACGGTGCTTGCAATTGCTCTAGAGATGTTCTCCGAACTCGGTTTCTCCGATGCCAAGCTTGAAGCAATCGCAAAGCAATCCGGGATGTCTAAACGAATGATCCATTATCACTTTGGTGATAAGAAGGGTCTTTATCGCAGATGTTTGGAAGAAGCAGTCCGCAGGCTCAGACCATCAGCGGAGGAAATGCAGCTAGAAACTCCTGTTCCTGTCGATGGAGTCCGCAAAGTCGTGGAAGCTGTTTTCCGAACCTATGTCATGCACCCAGAATCTATCCGTATTTTGCAGATGGAAAACCTGCATCACTTTGGCAAAATCGCAGAGGCGAGCCCACTCTCCGATCAGTCTTCCATCATGCTTCAGCTAGACAAATTGCTCATGCTGGGACAAGACGCCGGCGCGTTCCGCCCTGGGATCTCCGCACAGGATGTGTTCACCCTCATCGCCTCGCTAGCGGTTTTCCGAGTCAATTCTCGCTCCACTACGCTCAACCTCTACAGCGTAGACATGATGGATGAGGACAACACTCAAGGTATGGCACGTCTGGCAGTCGACGCGGTTTTGGCATTCCTCACATCAAACCTCAAAGGCTCTGATGACGTCAGTTACCTCACGTCGACGGCCCCCAGTGATTCGATCACACGCAACATGGAGGAAGCCTCCTACCAGGTCGACGCAGACCCATTTTCTTAAATAATCAGTATCTAAAAAGCGGTTCCACTATCCCGTGGAACCGCCTTTTTTCATCCTTTTTTACGCAGTCTTTTCTTCTCCAGCTTGTTCAGACATCTGCGGAGGTTCTTCGCCTCGCACCGATGCTGCCGTCACCGTGACCACTCCCACATCTTCTAGGTCAGGAATATCGTACATAACGGGGACAAGAAGTTCTTCCATGATGGCTCGCAGCCCGCGTGCGCCGGTACCACGCTCTATTGCCAGATCAGCAATGATCTCAAGGGCATCATCATCAAATTCCAGAGCCACCCCATCCATCTCAAAAAGACGCTGATACTGCTTTACCAGAGAATTTTTAGGCTCGGTGAGCACCTTGACCAGAGACCTTTGGTCCAAATTCCCCACGGTGGCAACGATCGGCAACCGGCCAATAAACTCTGGGATCAGCCCAAACTTGACTAGGTCTTCTGGCAGGACTTCTTTGAATACGTCAATCTCGTCGATATCTTCTTTTGTGGTCAGCTCGGCACCAAAACCAATGCCTTTTTTGCCACGACGCTCCTCGATAACCTTTTCCAACCCCGCAAATGCCCCTGCAACGATAAACAAAATGTTTGAGGTATCCAGCTGGATAAAATCTTGATTGGGGTGCTTACGCCCGCCCTGTGGCGGGATGGAAGCCACTGTCCCCTCAAGAATCTTGAGCAATGCCTGCTGCACGCCCTCACCGGACACATCTCGCGTAATAGACGGGTTATCAGACTTCCGAGAAATCTTATCTACCTCATCGACATAGATAATTCCCCGTTGCGCACGCTGGACATCAAACTCTGCGGCCTGCAACAACTTCAAAAGAATATTCTCTACGTCTTCACCCACATAGCCGGCCTCGGTCAGTGACGTGGCGTCTGCGATGGCGAAGGGGACGTCGAGAAGCCTAGCCAAGGTCTGTGCCAGATACGTCTTTCCGGAACCCGTTGGCCCAAGCATCAAAATGTTGGACTTTTGCAGCTCTGTCTCGTCGTCTTTATTTTTACGCGTGCTCAGAACACGAGATTCTTCAGCACGCACCCGTTTGTAGTGGTTATACACCGCCACAGACAGGATTCGTTTAGCGTCATCTTGCCCAATCACATACTTATCAAGGAACGCGGAAATCTCCGAAGGGCGCGGCAGCTTAGTGCCCTCGTCCTTCTTCTCCTCAGCAGCAGTGCTGAGCTCTTCCTCGATGATCTCATTGCATAGTTCGATACATTCGTCGCAGATATAAACTCCGCCACCCGCGATCAGCTTCTTTACCTGCTTCTGACTCTTTCCACAGAAGGAGCATTTGAGCAGGTCAGCGCTTTCTTGCATACGTGTCATGGTGAGGTTGAAGCTCTCAATTCTTCCCAGCCGTCAGCGGCCAAACGGATTAGTAATCGACTGTCCTTCACTTTAGCGGGTACGTCAAACTAACGGGCTCGTACGCGCCCAGCGCCTTTTTACCCCAAGGCACAAGACAATCCACCGTAGCAGAATTATCCCTTAGTTTTCTCTGATTAGTGGCTGCTGTTTGCCGGCCAACATATGACGGAATCGCGCTAGTCGTTGGGTATCGCACGGCACTGGCTCACCATCAAGCTCACTCACCCGAGAAATTCCGTGCAACGCATTGACAACCCACAGCTCAGCTCCGTGAAGCGCTGAGCGATCATACGTTTTCTGGGTCACTCGATACCCTAATTCCCGAGCAATCTCCTTGACCATGCGCTCGGTTACTGAGCTTAAGCGTTCTGCCTGCATGGACACGAGTTCATTGTCTTTCCAGGCCATGAGTGCTGCAGTGGTCGTCTCATGTACTCCCCGTGGGGAAACAAGCACAGCATCGTCGGTGGGCACTGCATTTTTGATCCTTAACAGTTCAGACAGGTCGGGCCCTTTTATCGTCGGATGTTGAGTTTGCCCAGGCTGCACCATAACGGTGGTCAGCGTCGTAGCTTCTCGACGCTTCGGCGCAGGCCTAAACCGCACGCGCAACGTGCCGCCGGGGAGGCTTTCCAAGCGAGGAAACCACTCCCCTTGCAGAGGAAGCTCTCGCTTGACGGCGGCTAAGAATCTATCGATCTCGTCTTCTCGCGCTGTCTGCAGCTCGAGACAACTGGCGCGGAAGCGCCGGCAATGGGAGTCAAAGCCGCGGGCGTGGCCGTCGACAAGCAAAAAGGAATCAATCACCGTCGGGGGTTGCGTGTGGAGTGCCGGTTGGAGCCTGGCGCCGTCAAATTCCAGAAGCTCTTCGTGGGGAAAGCCCTGCGCAACCAGGCTCAGCAACGGTGCAGACTTTGTGACTATCTCCTCCCATTCTGCGTCGGCGTTCGATAACGCGATAATGGCGCCCCCTACCCCATAGCTCAGGCGCTGCTCCTGCACGACCGCGGTACGGATCGTCATTGCCAGATCCATGTTCCCGTCCACGGAGATGAAGCCGACCGCCCCGGAATAGACCCCGCGTGGGTGTCCTTCCAGCTCAGCGATAATCTCCATCGTCCTATACTTGGGAGCACCCGTCATTGAGCCGCCGGGGAAAGCCGCGCGGATCACATCCACGGGGGTTGCAGACTCTCTCACTTTCCCGCTGACAGTGGAGACCAGCTGATGTGCCCTGGAAAAAGTCTTTACTTGGCACAACTCGTCTACCCGTACGCTGCCGTATTCGCAGACATGTGCAAGATCATTGCGCACTAAATCCACAATCATGAGGTTTTCTGCACGGTCTTTCTTATTCGTGGCCAAGTCGTGCCGCATGTCGGCATCTTTTTTCTCGTCTTGACACCGCGCGCGAGTGCCTTTGATCGGCTCTGAAGAGACCACCCCTTGGCTCATCTTTAAAAATCGCTCCGGTGACGAGCTCACTACGTGGGTATCACCTAGAACAAGCAGCGAACGCATGGGGGCTGGAGCGATCTCTGTGAGGCGTCGATATGCCGCAGGCGCGTCAAAGTCACCGTCAATGGGGGCTTCTAGCTGGGTGGTCAGGCACACCTCATACGTTGCGCCTTGGCTAATGAGCTCTTGAATCTCAGTAATACTGTGCAGATACTTTCGTCGCGATTCCCGCACTCGCAGTCGCCCCACCGCAGAAGGGTCAAAAGTCCCCACGGGGGGCGCAGCTTGTATCTGCGCCATCGCAGCGTCACACCAGTCTATGGCTTCCCGTGTCTCGCGATCATTGCGTGAGACAAGCGCTACCAGCTGCAAACGCCCTCGTTCCATAACCACAATCCGCTCAGCAAAAAACATTTTTAGTGCTTCACCGAGTACGGGAGCTTGTGCCTGTGCCTGCGGACCAAAGTCAGGATGGTTCGCCTCATAGCCTACGTAGCCGAACCATCCTGGTAGTGCCGCAGGGATCGCGCCATCGTTGCTCTCAACGTGCACCTGCGGGCATGCGTCCATGCTCAGAGACAAGGCATCGAGGGAATCAATTATCTTCCCACCATCATGAGGTGCAATGATGCTTTTACCCTCGAACTCCACCAGCACGCCCTGTCCACCGAGAGCAGCAAAAATATCAACGGGTCCTACGCCCTCTGGGACGGCTGACTCACGCACCACAACAGACCTGCTGTGAGAGGCACGGCCAGTCTCGCGCTGCTCCGCGATCCCCACAAAGTTAGCAATGAGCTGCTCGCCGTATTCGCTACCGATAGACTCCGGGTGAAATTGCACTCCCCACCACGGTTTTGTCCTGTGTTCTAGGGCCATAACAAGACCCTCAGCATTCCTCGCGGTCACCTGCATGGAGTCGGGAACGTCCGTGACCACGAGCGAGTGGTAGCGCACCACAGTAAAAGAATCCGCGATTTTTTCCCACAATGGGCCGGTGGATGTCACCCAGATGGTATCTTCCCTGCCGTGGACCGGCTCGGGCGCCTTTTCTACCCTCCCGCCCTCAACGTAAGCCATCGCTTGCATACCCAGGCATACGCCCAGTACCGGAACCTCCGAATGCTCCAACGCTGCAGCAGAGACTCCCAAGTCTGAAGGAATAGTGGGAGTGCCAGGTCCCGGAGAAATCACGACCGCGTCAAAGGCATCAGGGGACGCGCTGCCCTGCTTACTGATGCGATTGATATCCAGGGGGGCGTCGTTACGCATCACCGTGACCGCAGCCCCGCAGCGCGCAAGATAGTCCACAATGTTGTACGTAAAAGAATCGTAATTATCGATAACTAAAACGCGGAGAGCAGTCTTCTCGGGCTCGCCGCTAGCTTCTCGACGTCCCCTCGCATCGCAAGCAGGTGTAATCACTCTCCCGATTATGGCAAAGTCCCCGCAGCACCAATAATTTAGGGTGCGGCGGGGACTTCCTTAAGACCAGAACTTGCTAGCCGTTGAGCTTGCGATAATCAAAAACCTGGTCAATGATGCCGTATTCCACGGCTTCTTGTGCGGTGAGAATCTTATCGCGGTCAGTATCAATGCGAACCTGCTCAGGAGTACGACCAGTGTGACGCGAAAGGGTCTGCTCCATCAGCGCACGCATGCGCTCGATCTCTTTCGCCTGAATCTCCAGGTCCGAAACCTGTCCCTGCGTCCCCTGGGTAGCAGGCTGGTGAATCAGCACACGGGAGTTAGGCAAGCAGGCACGCTTACCTGGTGCGCCCGCAGCGAGCAGCACCGCTGCTGCCGACGCCGCCTGACCCAAGCAGACGGTGCGCACGTCAGGACGCACGTACTGCATGGTGTCGTAGATAGCCATCAGGGAGGTAAAAGAGCCGCCTGGAGAGTTGATGTACATCGTGATGTCACGATCAGGATCAAGCCCCTCCAACACCAATAGCTGCGCCATGATGTCATTGGCAGAGGTATCATCCACCTGCGTGCCCAAGAAGATAATGCGCTCTTCAAAGAGCTTGGCGTAAGGATTAGTCTCCTTGGTGCCATAAGTAGAGTGCTCGATAAACGAAGGCAGCACGTAACGAGCAGAGGGCATCTGCATTCCGGAAGTAGTCATGTATATTCTCCTGCCTTAGTTAGAAAGCGGACCCTGAGCGGACGCAATCACGTGGTCGACAATGCCATATTCTTTAGCCTGCGCAGCCGTAAACCAACGGTCACGATCAGAATCCTTGGTGATCTGCTCAAAAGTCTGACCGGTGTGTTCTGCGATAAGCTCAGCCATCTCACGCTTGGTCTGCGCGAACTGCTCAGCCTGAATAGCGATATCAGCTGCCGTTCCGCCCACACCTGCTGAAGGCTGGTGCATCATAATGCGTGCGTGAGGAAGAGCATAGCGCTTCCCCTTTGTGCCACCGGAGAGCAAGAACTGCCCCATAGAAGCAGCAAGCCCCATGCCGTACGTAGCGATATCGCAAGGCGAGTACTTCATCGTGTCATAAATGGCCATACCGGCAGTGACAGAGCCACCCGGCGAGTTAATGTACAGGGAAATATCGCGGTGCGGATCCTCAGCCGACAGCAGCAAGATCTGTGCGCAGAGCTTATTTGCGATTTCATCATCCACTTGCGTTCCAAGGAAGATAATTCGCTCGCGCAAGAGGCGCTCATACACGGAATCACTCAGGTTCAGGCCAGCGCTGCCCTGAGCCATGCGGATCTGGTCACTCATTCCAGTACTCCTTACAGATTTTCATTTTTAGCCAGCCACGTGACTATCGAATGCAACCCACGTTACCTGCCTATTAGTCCTCCCATGGCACCTGTTCGCTCACAGCGTAGGACAAACTAACAAGCAGAGCCACTATCCCACCGCTATCCCACAGCGATACAGCCTGTTGCGCGGTTGTATTGCCCAGTTGTCGTTTTTGAGAACGCTAGCAGGGTCGAGTTCTACAAACCCCCACGTCGAGTTTTCTAGCGTTCTCAAAAACGACAAAACTTCTATCCCGGCATAGCAAAAGGCCTCAATGAGCGTTGCATCTCATTGAGGCCTTTTGTGTATAAGCCTATTACTTAGCCTCGTCCTCGGAAGCTTCTGCTTCTTCCTCACCAAAGTACTGAGTCGGGTCAACTACGTTGCCTTCCTCGTCCTTAACGGAGACACGGCAGATAGCTGCTGCCAAAGCCTTGCCACGGCGAACATCGGAGAACAAGTTAGCGATCTGACCGGACTGCTGCAGCTGCATCACAAACTGGTTTGGCTCCATGCCATAATTCTGTGCAGTGAAGAGGATATGATCGGTCAGCTCCTGCTGAGAGACCTCAGGTGCTTCCTGCTCTGCAAGAGCGTCGAGGAACAGCTGAGTACGGACTGCCTCTTCAGAGTTCTTACGGTTCTCTGCGTCGAAGTCTTCGCGAGTGGTTCCCTGTGCTTCAAGAGCAGCGTTCAAAGCGGCCTCATCGTGAGCGAGCTGGCCAAGCAGCTGGTGCAGCTGAGCCTCAACCTGCTCGTTAACAACACCTTCCGGAAGCTCGAATGTGGACTCTGCGAGTGCCGCCTTCAAGACTTCATCACGAATAGCAGTTGCCTGTGCGGCCTTTGCCTTTTCCTCAACCTGTGCGGTGACGGACTCGCGGAGTTCCTCAACGGTGTCAAACTCGGAAGCCATCTGCACGAACTCTTCGTCGACCTCAGGAAGCTTGCGCTCCTTGGTCTGCTGAACAGTTACGGTGACAGTTGCTTCTTTACCAGCGTACTCGCCAGCCTGAAGAGTGGTGGTGAATTCTGCAGACTCACCGGTCTTGAGACCACGGAGAGCGGTATCTAGGCCGTCAATGAGATCGCCAGCACCAACCTGGTAAGACATACCCTCGGTTGTAGCTTCTTCGATTGCCTCACCGTCAACGGAAGCCGCAAGATCAATGGTGGCAAAGTCGTTGGTCTTCAGCTTGCGCTTGGTGTCCTTCAGCTCGCCGAAGCGCTCACGCAAACGATCAATCTCAGCGTCGACTGCTTCTTCATCAACCTTGAGTGCAGGAACCTCTACATTGAAGGCAGCAAAATCAGGGACGGTGATTTCTGGACGAACATCGACCTCAGCAGTGAACTCAACAACGTCGTTGTCTTCGATCTTTGTGATATCGATAGCAGGCTGACCCAAGACCACGAGCTCGTTCTCTTCACAGACCTGCTGGTAACGGGTAGGAAGCATGTCGTTGACAACCTGCTCCAAAACCGGTCCGCGACCGATGCGGGCGTCGATAAGCTGGCGCGGCGCTTTGCCACGACGGAAGCCTGGAATGTTGATCTGCTGAGCCAACGCCTTGTAGGCCTGATCGATCTCAGACTTCAGCTCCTCGAAAGGAACCTCAACGTTGAGCTTGACGCGGGTGTCGCTCAGCTTTTCGACGGAACTCTTCACGAGCCATTCTCCTGATCGTTGGGTACTCAATAAATTTAAAAAATCCCTCCACCCTTACCAAGTGGAGGGATCTACGTCGGGACGACAGGATTTGAACCTGCGACCCCCTGCTCCCAAAGCAGGTGCGCTACCAAACTGCGCCACGTCCCGTTGTCGAACTTTCAAACAAACGCGCACACAATACAAACATGGAATACGGATTCTCCGCACCAAGTTCGTACATGCAGTGTACAAGCTGCCCCTAAAAAGTTACGCCATAGAATCGGTTTTTCCCAAAACGCCAGTTCAAAGGCACTAAAAGAGGAAATTACTACTACACGTTAGGACAATTTTTCGCGCTTCAGTCTTCCGCCGTCGCTTACCCCCGCCCACTCCACAAAAGCCAAGAATTGTCGTTTTTGAGAACGCTAGCGACCATGCGCCCCACAAAACCGCCAGTCAAATTTTTCAGCGTTCTCAAAAACGACAATCCCCCACTCACAACAACATTGCAAAAAGCCGGCAACCTTGAATCAAGGCCACCGGCTTTTAAAAGTGGAGGGAATGACGGGAATCGAACCCGCGTCTTCAGCTTGGAAGGCTGAGGTATTAGCCACTATACGACATTCCCACTAGCGTTTGAGCGTGTCGAACGCTGTGCGCATAACAGTAGCGGACTTTCGCGATAAGAACAAAACCCCAGCACCTCCCCGTAACTTTCACAATTCCACGTACCACCACCCACCCTTTCCCGCCTGTCAGTAAAACCAATTTTTCTACATAATTTCTCTAAAGTTTTTCACCTCTCAAAAATCTTCAAACCCCTCTTTTACCCAACACCAAGGCCCGATAACCTGCACAAAAGACACATTATTGATTAGTGAGTTATTATGTTTTCATGCAACCGAAGATCATCGACGCAGATACCGGCCGCGAATTGTGGACCGCAAATGACTGCGCAGAGTTCTCGGGCACGGCTCGTGGCACTTTTACGAGTTACGCAGGTCGAGGACGTGCCCCTTCCCCGTAGCTAAACACCATGGTCTAACTTTGTGGGATTCCGAAGCAGTCAAAGAATGGGTCAATGCCCGCAACGCTATGAAAGACCCAGAACCTGAAGACCCCGTCGCAGATGTATAAAGTCACATCTCGCAAAGTTTCTCTTCCCGCGTTCCTCAGCCCCCATCCGCAAGGTTAGGCTGAGGAACTTTTTGTTTCCTTGGAACGTTGGTCAAAGTAATTAACGTCCCCAATTTTCCACGGAGGCATTCACAATGATTGCGTTATTACTTGTCGTCCTCGTTGTCGGCGGTGCCGTCTTCTTCCTCAGCAGAGGCAATAAAAGTAACCGGCAAATCGAGTCAGACAATCTCCAAGACTCAATCGCAGAAGCTCGCCGTTGGATCGAAAGACTAGGGTCACAAGTACTCACGATCTCAGGCACCGATGCGGCTTCCACACAAGCCATTGCCGACGCCTCAGAGCGCTACAACGCAGCATCTTCCCAAATCAGCACGGCCACGACGGTGCGCCAAGCAGAATTAGCACGCGAGTCTGCACTAGAAGGCCTGCACTATATGAATGCCGCCCGCGAAATCATGGGACTGCCTGCTGGTCCTGAGCTACCGCCGCTAGAAGGCCAACGCCAAGCAGGAAAAGTCACCGAAAACCGAACCATCGACTTCGAGGGCCAACAAATCACGGCCTCTCCTCACGCAACACCCGATACCCCGAATTACTACCCTGGCGGAATGGTTGCTGGCCGCCCGGTTCCTGCCGGATGGTATTCCGAGCCGTGGTGGGCGGGCGCGCTACGTAGTGGCCTCTGGACTGCCGGATCCGTTTTTCTCTTTAGCTCATTGTTCAACGGTATGTCTGGCGTCGGGTACAGCGCTCATGCCTTTGAGTCCGGCTTTGATAAGGGCTATGCAGAAGGCCTTGCAGCCAATGGTGGCGATGGCGGCGGAGACATAGGAGACGTCGGAGGAGATTCCGGCGATGACGGTGGATTCTTTGATGGTTTCTTCGGCGGCGACGGTGGCGATGGCGGCGGATTCGACTTTGACTTCGATTTTTAGCCGCTAAGCCCCTCCGCAAAGTGTGAAGGTGCCCGTGCGTCCATGCACAGGCACCTTCACTTTTGTTTGTTGCTATTCCTGCCTATGGCCGCGCATACGGCGCCGGGTTTCTCCGCGGCCGTAGCGTCACGTTGGGCAACGTGGGGGCTTCAATTCGCGCTAATCCCTGCGCGTTGATCCCCTCGCCCCGAGCATTCACTCCGCCATGGCCTTGATACCCGACGACCTTGCCAAAACGATCTTCTTGGGCTTGCCAATCATCTCTGCTCTTGCGGATCTCTTCGGTCGACCGTCCTACAAAGTTCCACCACATAACGATCTCTTCCGCAAACGGCTGCCCACCGATCAGCAGTACTCTGCCGTCGCTATCGCTGTGGTTGGATATCACGATGCGTTCCGTGCCAACCCCTACATAAGCAAGCGCCGCGTGGGGAATAGCAACGTCTTCTACCGTGATCGTTCCAGCGTCGACAAGCACTCCATGCTCAAAAGCTGGATCGAGGGGTATGGCGAGGGAAGAATTCTTCTTGACACGTATTTCCGCTCCTACCAGCGGGGTAAACGTTTGTACTGGACTTTTCATGCCGCAGAGCTCGCCAAGAAAAACAGTTGCCTGCCCCTCCCCTAGATCTACAGGTTCAGGAGAATAATGTTCGAAGGAGCGTGGCGCTATGTTCCGGGCTGAATCCGGCAACGCAATCCAGAGCTGGACGCCGTGGAGAGTTTCTGTGGTTGGCGTAGATACCTCCGAATGGCAAATTCCAGCTCCCGAAGTCATTAAATTAACCTCTCCAGGGCGCACCGTGCCGACGTTTCCGCCTGAGTCGCGGTGCTCTATTTCCCCACGGAACAACCAGCTAACGGTCTGCAAACCAGTGTGAGGATGCGGCGCCACATCCATGCCTCCAGTCCGAGAAACCTCATCGGGGCCATAGTGATCTACAAAGCACCATGCCCCGATCATCGTCCGTTTTTTCTGAGGTAGTGTCCGACGCACGGTCATCGCACGTAATCCCCCGAGTGGAACCTCCCGAGCAGTGATGATTTCCACTGGGGCCCGCATGGGCTGACTGTTGCTATGCATATTCGTCATCGCTTTCCACCTCATCGATTGCACGTAGTAAAAAATCCCCGATATCACGGCTGGAGAGAGGCCTCGGATATCAGGGATCGTTAGTTGTAAAAAGACAGCTGTAACGCTAGAGAAGAATTAAGCTTCCGGAAGCGCCGGGGCGATTCCTGTGCGCTCGTATTCGGAGAGGATATCGATGCGCCGCTGGTGGCGTTCCTCCTTGCTCCACTCCTGTTCAATAAAGGCGTCAACGATCTGGAGTGCTTCTTCTTCAGAATGCATCCGGCCGCCCAAACCGATCAGCTGGGCGTTGTTGTGCTCACGAGCCAGCCGGGCAGTTTCCACCGACCATGCCAATGCGCAACGAGCGCCCTTGACTTTATTAGCGGCGATCTGTTCGCCGTTACCCGAACCACCCAACACGATGCCGAGTGAACCAGGGTCATTAACCACGCGGCTAGCAGCCTCAATACAGTACGCCGGATAGTCATCCTGGGCGTCATAAGTGTGTGCACCGCAGTCAATAACCTCATGTCCCTTGGTCTTGAGGTGTTCCGCGATAATGTTCTTCATTTCGAACCCTGCATGGTCCGCTCCGAGGTAAACGCGCATAGTTTCAAAGTGTATCGCATCACCGCACCCCATTAAGAAGGGGACAAATCACCTCATACCTTCTCCCCTTGGCTACTTAACTTCCCTTAAGGCCTTAACAAGGAAATATATCGTTATATGAGTCGCTGGATGGGATCATACGCAGACCACACGCAGATCATACGAGTGCACCACTAAAAAACATGTGACGCACTGCACCTCGTACAAAGCCTCCTACAGGGGTTAGTCAAAGCGAGGGCTCTCTGTCCTCGACCGCTTTAGCTCGAAGAAATAGGGATAGGAACCTAGTGCGATGGAGGCGTCGAAAAGCTTCCCAGCTTCTTCCCCACGCGGAATCCGCGTGATAACAGGACCAAAGAACGCAGAATCTCCCAGCTTGATCACTGGTGTCCCCACATCGTTGCCCACTAGTTCCATGGCACCGCGATGATATTCACGCAATTCTTTATCCCAAGTTTCAGTGTTAGCCACTTCTGCCAACGTGGGATCCAATCCGACCGTGGACAGTGCAGACTTAATGATCTCGTCATAAGCGCCAAAGCCCTGCTTACCGCCTTGACCGTTGTTATGAATCTCAGTTCCCATGACTGTATAGAGCTCATCAAGCTTATCGGGGTGCTGTGAGGCGACTGCCGCGAACACTCGCGCTGGCCCCCAGTTCGCCTTCATCTTCTCTTTGTAGTCTTCCGGCAGCTCGTCTCGACCTTCGTTGAGAACAGAAAGGCTCATCGGCACCCATTCAACGTGAATGTCGCGAACCTGCTCCACTTCTTTGATCCACCGAGAGGTCACCCAACAAAAGGGGCAGCTCACGTCGAACCAGAATGTCACGTGCTCGGTCATAGAAGTCTCCTTTAATCTCTCATGCGAGTTTCTCATCCAACGCACTTCGAAGTTCTCGGCCCTTAGCCTTCCCTCAAGTGCGCAACACTGCCCTTTTCAACGCATCTCCCCACGCGATCATTCCACGCCATGCTTGTCCTTCTGCGTTTGTCCTACTGCAAAAAAGACTTTGCCCAATAGCCGGTTGCACTGTGGCATCCAGTTGTAAAGTAAATCCACAGATCAAACGCAGTACCCACAGAGCTTTTCGCGTCTAGCCCTCTGGCTAGCATCATCCCAAGCTAAAGGAGCTCCTTAATCACATGTCTTCCATCAATCTCACGCAAGCAGAAGCCGAGCAGCGTTCCCGTATCCTTGACGTGCATCATTACGATATTGCTCTCGATTTGACTGAAGGGGATAAAGAGTTTCCTTCTATCACCACAGTGTCTTTCACAGTAAAAGAAGCCGGGGATACTTTTATTGACCTGCGGGCAGCATCCGTAGCAGAGGTGCTTCTCGACGGCACAGACGTCACAGCAGCAGCGGTTCCGCTGACCCCCTCCGGCTACGATGAGACCCAGGGACTCGCTTTACGAGGGCTAACTCCAGGACTGCATTCCCTTACTGTGACCGCTTCATGCGTCTATTCGCATACAGGACAGGGGCTTCACCGCTTTATTGATCCGGAAGATCAGCGCGTCTATTTGTACACGCAGTTTGAAACAGCTGATGCCAAACGCATGTTTGCGTGCTTTGACCAGCCGGATCTCAAGGCTACCTACGCTTTTACTATCACAGCACCCACAGCGTGGAAAGTAATCACCAACGCGCATACCCAGATAACAACCGCCGCGGACAAAGCTATCCATAGGGCACACGTGGATTACAAGCTTTCCACGTATCTGGTCGCGTTGTGCGCCGGTGATTATCACGAGGTCTCAGACACCTGGTCTGGCGCGCTGACGCACCATCCGGAGACTCCCGCAGACCAGCCCACAGCGCTAGAGATTCCAATGTCCATCTACTGCCGGAAATCATTGGCACAGTATTTAGACGCGGACACCTTGTTGAGGGAAACCAAGCAAGGGTTCACGTTCTATCACGAGAACTTTGGCATGGCGTACCCGTTTTACAAGTATGACCAGATCTTTGTCCCGGAGTTCAACATGGGCGCCATGGAGAACGCCGGTGCGGTGACCTTCCGCGATGAGTATGTCTTTTCGTCTAAGGTCACCAAATACCGCTATGAGCGTCGGTGCGACACAATCCTGCACGAGATGGCTCACATGTGGTTTGGCGATCTTGTCACCATGAAGTGGTGGGGCGACCTCTGGCTCAACGAGTCCTTTGCCACGTGGGCCGCGGCGATCTCTCAAGCCGAGGCAACGGAATACTCCACGGCATGGGTCACTTTTGCCAATGTGGAAAAGTCATGGGCCTATCACCAAGACCAACTCCCCTCCACGCATCCGATCACAGCCGACGCCTCCGATATTGAAACCGTTGAGCAGAACTTCGACGGCATTACTTATGCCAAGGGTTCTTCGGTGCTCAAACAGCTACAGGCTTTTGTAGGGCGAGATGCCTTCCTGGCGGGTGTTCGCAAGCACTTTGCCAACCACGCGTTTGCTAACGCGACCTTTGATGACCTGCTTGGAGCCTTTGAAGAGGCCTCCGGCCGCGATCTTTCCCAATGGGCCGACCAATGGCTCAAGACCACTGGAATTAACAAGCTTTCCCCCACCTTTACCGTCAAGGATGGCGTTTATTCCGAGTTCGCCGTTCAGCAAAGTGGAGCAGCGCCCGGCGCAGGAGAGCTTCGTACCCACAGGATCGCCGTGGGGCTTTATTCGCTTATCGACGGCCAGGTGAAGAGAACTCATCGATGTGAGATTGACGTTGAAGGCACATCCACGCCGGTTCCAGAAGTGGTCGGCCTTGCGCAAGCCGATCTGATTCTGGTCAACGATGACGACCTCACGTATTGCTTGATGCAGCTGGACCCAGCTTCATTGGATTTCATCGTGAACAACATCGATAAAATCTCTGATCCCATGGCGCGCACACTGTGTTGGTCCGCCGCATGGGAAATGACAAGAGACGGCAGTATGCGCGCTCGTGACTTTGTCACGCTCGTTGCACGTGGCGCCCAGTTTGAGACTGAAATTGCAGTGCTTGAACGCATCCTAAGCCAGGCTGCAAAGGCTGTACGGTCCTACGTAGATCCTGCGTGGGCGGATAGCACCGGACGCGACATGCTTGCCGACGCTCTCTTGGTCGGTGCTCGCAACGCGCAGGCTGGTTCTGACGCGCAGCTGGCGTTTGTCCAAGCTCTGGCAAAAATACGCATTACCAAAGATGCTGCCGCCGAATTTGCTGCGATTGTGCAAGGTTCTACGTCACTTCCAGGCCTTACCGTAGATTCTGACCTGCGCTGGTGGGCACTGACAGCGCTCATCGCTCATGGTGAGATCACCGGAACAGCGGTTCACGAGAGCATAGAGAAACTACGAGGCATAGATCGTTCTTCAGCCGGTGAGCTTGCGGCGTTGCGGGCATATGCAGCACAACCGGATGCTGATGTTAAGGCCGACATCTTTGATGAGGTCACCGACACAAAGAACACGCTTTCTAACCTCTTCTTGCGCCATAAGCTCGAAGGTCTTACTTTCACAGGATCTGGCCCATACCTCGCGCAATTTAACTCCGCAGTTTTTGCCTTGGCAGAGAAGATCTGGGCTGAGATGTCTTCTGAGGTCGCGCTGGTGACGCTGTCCGGAATCTATCCTTATTGGGATATTTCCGCGCAAGGTGTAGAAAACGCCCACGCGTTCCTTAACAAGGACTCTTTGCCAGCTGGTGTTCGCCGGGTGGTGTCTGAAGGGATGTCTGAGCAAGAACGTGCCTTGCGCCTACGTGAAATAGACGCACGCTAGTTTTGTTTCGCCCCTATCCGTAGGACGTTAACTAGTACTTCCTGCGGATAGAGGATTCTCCTCGAGGGGGTTTCCTCATGGATTACATACACAACCTCCCCTCCCGAAGCTTTAGTGCACAGCACGCTGCTACGGTGGGCAACGATGACAATCTTTAGCGCTACAGAACAGACCGAGTCGGTCTCCTCTTGGTTAGGATCCGTAAACACCCAAGAATGGCTGATAGACAAACCGATCCAGATAGGGATCACTATCGTTATCGGTCTCATCGCAAACTGGCTTCTGCGGAAAACCATTACCAAAGCGGCTCACATCAACATCAACAAAAAGCCCTCGAAGATTTCTTCGGTGCTACCTCTGCGGGGAAAGACCGCTAACAAGTCTTCTGAGGCCCTCAGTGCGACTCAGGAACAACGTAGACAATCGCGGATGCTCACACTTGCAGCGGTAGGACGCTCTGCAGTCTCTGTCGTGGTGTGGGTGTGGGTGTGCCTTGCTGTCCTCACGTACCTGGGGATCAACGTCACCCCAATCGTGGCATCAGCAGGCGTCGTCGGCGTGGCTCTTGGTTTTGGCGCGCAATCGTTGGTCAAAGACTTCTTATCAGGTGTCTTTATGCTCATCGAGGACCAATATGGTGTTGGCGACACTATCGACGTAGGAGACATCGTGGGAACCGTCGAAGACGTCAGCCTTCGACTCACAACATTGCGCGATATTCACGGCACCCAGTGGTTTGTCCGCAACGGAGAAATTCTTAGGATCGGTAACTTCAGTCAAGAATATGCCGTGGCCCTCATCAATATTCCGGTTGCACTAGACGAGAACGCTAGCGCAGCCATTGAAGCTGTCACTGACGCAGTAAACGCCGCTTCACAAGAGCCGGCTATTAACGACGTACTCCTCGACTCCCCTATCGTCGATGGTGTTAATTCCATTGGGCTCGACCACATGCTGATCCGCGCCCGAGTGACAACCCTCCCGGATCAACAATGGTATGTCACCCGAGAGCTAACAGCGCGCGTCCTCACTGCTCTCCAGCACAACAACATTGATACCCCTTATCCTGAAGGCATTGCGGCTTCACGACGCATCTCCGACTAACACCCCCTCAGAAAACGAAAGTCCCATGTCTACTCCCCAATCCTTTTACGATGCTGTGGGCGGCGAGTCCACATTCCGTGCATTCGTCCACCGCTTCTACGAGCTCGTCCGCACCGATGACATCCTGGGCCCCATGTACCCGCACGACGATTGGGAGGGTGCGGAAGACCGACTCCGTTGGTTCCTCGTCCAATACTGGGGTGGCCCTCAAACGTTCTCAGAGAATCGTGGGCATCCGCGCCTTCGTATGCGGCATGCGCACTTTCCTATCGACCAAGCCGCCGCAGACCGCTGGCTACTACTCATGGAATCCGCGCTGGATAGCCTCGATGAAGAGACTTTGCCTACCGCCTATCGCGCTGCTCTGTGGGATCATATGCAGCGCGTAGCCGATATGCTCATCAACCGCGCTTCTTAGTCGAGTTTCGCATGTTGTGGATTCTTCTTGGTGTCGCCGCCGGGGCGGTTATCCCCATCCAAACGGTTGTTAATACTCGGCTCAGCGCATCCACTGGAACACCGTTTTCCTCCTCAATGATCTCCTTTTGCGTGGGGACGCTAACCCTCGCGGTAGCTCTCATAGCTGCCACAGGCCAGCTTCCCAATGTCTCAGCTGCTTATCACGCCCCCGCGTGGATTTGGCTTGGCGGGCTTTTGGGGGTTATCGCCCTTACCGCGAATATTTTCATGTTCCCCCGGCTCGGCGCAGTACAAACTGTTGTTTTGCCTATCTCAGGGCAAATCTTCATGGGGCTAGCCATCGATCATTGGGGCTTATTCGACGCTCCGCAGAACTCGATAACCCCGCTCCGAGTAGTCGGAGCGCTCTTGGTGTTTGCCGGAGTACTCGCCACCGTGGGGAAACCGAAAAGTTCTGAGACTCAATCGGGGAGCTTTATTCACTGGATCTGGCGGCTTGCCGGCGTTTCATTTGGCATGCTTACCGCCATGCAGTCAGCCATCAACGGCAGGCTTGGCGCTGTGCTTCACTCGGCGGCAACAGCGGCACTCGTGTCATTCTCTGTAGGAGCCGCAGCCCTTATTGTGCTCAACATCGTTTTACGGTGGCGTCCACGGATACAACGCCTCGGGTCTTCTCATCCCTGGTGGATGTGGTTTGGAGGAAGCCTCGGGGCATTGTTTGTTTTTGCTAACGCGGCATTGGTGCCAAAGATCGGAACGGGCCTTACCGTCGTTGCAGCGTTACTCGGGATGATGATCTCAAGCATTGCAATTGAGCGCATCCGTGGTGGGCACAGTGGAATCCGGCAGATTCTCGGAGTAGTTGCAATGCTCATAGGAATCGTAGCGATCCGACTGCTTTAGCGCCTACGGCAACAGGGAAAGTCCTTGAGACTCATAAACCGTTCCGTACGGAGCATCAATGCGTGCCCACCGTCCGGTGCTGGATACTCGGAGATGCCTCGGTATATGCGCTGGTGCGGTAGCCGGAGGGATAAATCCCAGGGAGGTGCAGGCAAAGATCATTCGCATCGAAATCTCCACGGAATTATCTGCATTAGATACCGTCATAACTTTTTGATCCAACAGGCTCGCGGGCGGCCCCATCGGGCCGGAAAACTGTCGAGCCAAAGAACGTCCTTTTTCTGCAAGGTCGCTGGCAACACCTACGGGAATTTCGTCGATAAGCTTAAAGCCCGTTAATGGGGGAAGGGCACCTGCCCACGAGGCGTCATGCGGTTTGCCTTCACCTCGTGCCAGCAAGTCAGTAGCCAGTACAACCGCGCCATCCCTCGATGCTTCTCCCCTGACTCTCCGCGATGCGACCACGCCAAAAGGAGTAGTCACATACACGTTCACGCAGTCTTCTACCTGCTGCAGCCGTGCATAAGCGTGTCTATCTAAGCCAACAGCGCGCTGCAGCAGGCTCGTTATTCCCGTTGCCGGGCCTTCGGGAATACTGAGGCTCTCGTTCATCTTGATTAACGCGCCGATTCTGCAATCTTGGTAAGAACCTGCAGCTCATGGGCCTCAATCTCACGCGGACGCGCAGTCTTAAGATCCACTGCTACCTGCACGGCGAGCACAACCGCGCACACGTGTCCCTCGCAGTCCTTCAAACTTTGACGAGTGGTAAACGAGGTAGTCCCGATCTGGACAATGGAGGTTTCAACCTCCACCTCAGCGGTACTGGGCATGATGGGACGCAAATAGTCCACTTCAATTCGGCGAACAAATACCGCAGGAATATCGTGCCCCTGCGCCAAAAACTCATCATTCGCCCACCTGGTGCGCGCTTCTTGAGCTAAATCAACATACGCTGAATTTGTCACGTGACCGAAACGGTCGAAATCAGTCCACCGAACGGGAACTGTAGTTGTATGGACTTGTTGTGTGGATTCTGCCGCCATGCTCTACTTTCTCAAAGTTTTCTATCAGGTTAAAGCACACATCATTCTTCATCTGTGTGTGCTGTTGCACAATGTTAGCCTCGTAACACGTGTTATGTGCAACGATGCCCCTCAGCACTCGTGAACCGGTTTTCCGATCACACATCAAGTACCGAGGGGCATCAACTAGCGCAGTACACTACCGGCCATAATCGCTATGACCTAGCGGGTCAGTTTACGGTGTGTGACGCGAGATGGACGTGCGGCATCAGCGCCGAGACGCTCAACTTTGTTCTTTTCATAATCCTCAAAGTTGCCCTCGAACCAGAACCACTGGCCCTCTTCCACGTTGCCTTCCCATGCAAGGATGTGCGTACACGTACGGTCCAGGAACCAACGGTCGTGAGAGATCACCACGGCACAACCCGGGAACTTCTGGAGAGCGTTTTCCAGGGAACCCAACGTCTCCACGTCGAGGTCGTTGGTTGGCTCATCGAGAAGAATCAGGTTGCCGCCCTGCTTCAACGTCAAGGCGAGGTTCAGTCGGTTGCGCTCACCACCAGACAAGACCTTGGATGGCTTCTGCTGGTCAGGGCCTTTAAAACCAAAGGCGGACAGATACGCGCGCGAAGGCATCTCGTTTTGTCCAACGTGGATGTAATCGAGGCCGTCGGAGACAACTTCCCACACGGTCTTTTCTGGGTCGATGTTCTCACGGTTCTGGTCGACATAGCTGAGCTTGACGGTTTGTCCAACCTTGACGTCACCGGAATCCGGGTTCTCTAGTCCTACGATGGTCTTGAACAGCGTCGACTTACCCACGCCATTGGGGCCAATAACGCCCACAATGCCGTTGCGTGGCAGGGTGAAGGAAAGGTCTTTAATCAGAACGCGGCCATCAAAGCCCTTATCCAGGTGGTCCACTTCCACGACCTGGTTGCCCAAACGCGGCGGAGTAGGAATCTGGATTTCTTCAAAGTCGAGCTTCTTGTACTGCTCAGCCTCTGCAGCCATTTCCTCGTAACGCTGCAGACGTGCCTTGTTCTTAGCCTGACGCGCCTTAGCTCCCGAGCGCACCCACGCAAGCTCTTCCTTCAGTCGCTTCTGCAGCTTCTGGTCCTTCTTGCCGGCAACCTCTAGTCGCTGAGCCTTAGTCTCCAGGTAGGTAGAGTAGTTGCCCTCGTAAGGGTAAAGCTTTCCACGGTCAACCTCACAAATCCAGCCTGCAACATGGTCCAGGAAGTAACGATCGTGAGTAACTGCCAAGACAGCGCCCTTATAGTCTGCAAGGTGCTTTTCCAGCCACAGCACGGATTCTGCGTCCAGGTGGTTGGTGGGCTCGTCGAGAAGCAAAAGGTCGGGCTCAGAGAGCAGCAGCTTAGCCAGTGCTACTCGACGACGCTCGCCTCCCGAGAGGTGCGTAACCGGATCGTCGGACGGCGGGCAGCGCAGTGCCTCAAGCGCTTGGTCGATCTTAGAGTCGATCTCCCAGGCGTCAGCGGCATCCAGTTCTTCCTGGAGCTTGCCCATCTCATCCATGAGCTCATCGGTGTAATTGGTCGCCATTTCCTCGGCGATCTGCTCGAAGCGCTGCTTCTTCTCAAAAATCTCGCCGAGGCCTTCCTCAACGTTTCCACGAACGGTCTTGTCCTCGTTCAGTGGCGGCTCCTGAAGAAGAATGCCCACGGTAGCACCGGGGTCAAGGAATGCTTCGCCGTTGGACGGCTGGTCAAGTCCAGCCATGATCTTGAGGATCGAAGACTTACCAGCGCCGTTCGGGCCCACAACGCCGATCTTGGCGCCTGGGTAAAAGGCCATGGTGACATTGTCCAAAATGAGCTTGTCACCGATAGCCTTGCGCACGTTTTTCATCGTGTAGATGAATTCGCCCACAGTGATATTCCCCTTTAAATGTTGAAATAGGTTTCAGTCACGTCAAAGGGTACATCACTCTTCCTATAAATTCGCCCCGCGCTGCGCTCTCCTAACCTAAAAGAGGTTTTATTTCCTCACCGTGTGAGCCCCGCCTCCCCCTGTTAGTCGACGAGTTTTTTAAAAGGCGGTTAAAAGGGCGGATTGTGGGAACTCGTTGTCGTTGCTCCCACCAACTCACCGGACTCCTTCGCGGCACCACCTTCTTCATTCAAAAAGTCATTGGTTCTTGAATAGTCCCGGTCTGGGATAAAGGGTTGGTTCTCCGTATTCTCTGGAGCTGGTAGCCCAGGGACATGGTGGATGTCCTCCACAGTCGATTTACGCGAAGACACAACGTAACGGCTCAACTCAAACCCCACGTAGCTAGCTTTGAGAACGATCTTTGATGCCAGCTTTCCCGTTGCGGCATCAGTCCATTCTTGTGTGACCAAATAACCAGAACAGATCACGGGGCGCCCACGTTCCAACGACATCCGGGCGTTCACACCTAGCTGACCCCAGCATTCCACGTCAATGTAGTTTTGGTCCGTATCAATCCACTGAGACTCTTCCCCGGGAACATCGTTGTTGACAGGATGCTTGCGCATCCTGCGACTCGCAGCGATACGAAACTTGCAGACTCCTCCACTAGTGAATTGCACGAATGTTGGCTCGTTGGTGAGGTTTCCTACGATGGTGCTTTGAATATGCATGATGTGATTCCATTCCTTGAGTTCATTGGTATGAGCTGATAACCACGCAGGTCTGCGGCATGTCGCAGAACTTTGTTCCGTAGCACAAGGGCTAACGCCGATTCCCTGCCCTCCACATCTTGCGCACCTTTCCCCCACACGAATATCCCTCTCACCGTGCAGACTGTCTTTCTGTGGACAACTCTGCTTCTAGTCGCAAGCTATCCACAGAAGCTCTCCGAATAGGCTGCGGGCTCTTGATGTTGTGAACCTTTTATGCTGCGGGACGAATCGCCCGCTAGCGCACTCAAGCCCCTCCGCCCATCACCAGCAGTTTCTGATAAATATCTATCTGTTTAATAACTTTTCTGAACCTAAGCCACATTTTGCGGTAAGCTTTTGTCAATAAATTTTTGATTCTTCTTCGGATCACCCCCGAGCCACCACCCCGCTTATCGTCACGCTCGTTCCCGTCGAAAAGCCCTTTGTGCTCTAGAAATCCGACGATGAGTTGAGGGTTTGTGTGAGATTGCTCCCCAAAGGAATGGTGATCGTATGACCCGTATCGTACAGAATTCCCCACGTCCCTCCGGCTTGCCAGAGACTGAGAATTCCACAGCGGCTCAGGGATGGCGACCATTCCTGCACAGTAATGTGATGTTCTCGGAGACACCCACGGGCGTGGAGTTTCACGATGACTTCCGCCGGTTTTCTATCGACGGCCCCGGCGCATACAAAGCTTTCCGTGCCGCAACGCCCCTTTTTGAAGGCGACGTACTCTTCGCAGATGCTCTCCAACACCTGGGCGCTGCCGGTGCTCAGACCATCAAGCTATTCGAATCCGAGTTATACCGGCATGGGATGCTCACCCGGCTGACCCCGGAAAACCCTTCTGTTGCCGCATACCAGTGGGGCGGCCCCTGGGACGGGATCTTGAGGCTCCTCGCAAACTACACGCCCACTCCCATTGCTGTTCTGGAGAAGATCAGCCACACGCCTTTTACTATCTGCTGCAACCACCCGGAAAGCGCTCATCTAATCAAAGCAAGTTTGGAAGAAAACGGCTGTGCAACAGTGCGTTGTGCAGCGCTTTCTCCGGATACGTCGCACCTCCGCCCCCTGTTCTCTCTGAGCTGGCGCGAACTCAGCGACGCCGCCACCATCACCGTCCGCAGAATTGGCAGCGGATTAGTCATCGCATCAGATGTCGCAGCTGAGGCAGCGGCTGCGACGCGCGTGTTGGAGTCTTTATCCACTGAGAAAGAAACCGCTGAGGTTCCAGAAGCTCTTATCCGCATGGCTGGAATCGTCGCGTCCTTTGAGGCTTTCAAGATTGTCTCTGGCGTCATGCCAGCAACACTCACACAAGCACTAGTTCGCATCGACCTTTCCACCGGTGAGGTGACCCAACACCAGCTAGGACAGCAGCAGCACGGTAATGGCGATTCTCCCCTGTTTTCCCTGGTTGATCCACTCCTGGGTTTCACGCCACGCTTCCTTGACGACGATATAACTCAGATGCCCTTGCGACTATCTCAAGTATCCGTGACCGGTCCCGATCAACGTCGTTGGATCGTCACCGGTTGGGCGCTAGAGACTCTGGAGGCTGCACGTGAACGCGCAGTTGAGGAAGCCACTACCCGTGTCCTACTGTCCCAACAACTAACACAGACTATGCCGGACGCCCCTTTAATGCAGCCCACTGGGCTTCCCACACCGGCTGCCGTGGGAAGTTCCCAGGAAGAAGCCATGGAACGCGCTCTCCCGCGCGCAGCCGCTTATTGGGCTTTTCACTCTGCAACGCAGGGCACCTCGGCGGTTGTTCCGTTTAATGCAAGTCCTGAGGCCTCATCCTGCGTACGAGATGCTCAGGAACTCTATGGTGCAGCCACGTCTACGTTTTTGGAGCTCACGCCTCTGTGGGACCATTCGGTAGTCGTTGTCCAATGTGACAATCCGATTCTTAGCGTCGTGGCGGCGGGAGAGTCGGCGGAACAAGCCGCGGTATCCGCCGCCTACGGCCACCTCGCGCTTGCACAACTGCATGCGGACAAGCACTGTCCGCCGCACGACGGCTGGTCGCATGCACTTCCGGCGTTAGCCTTAAGAGAATCCAGCATGCGCTGCCTTGCAGAGGAACCGGTGACTGAGCTTTTAACAGATCCCCGTCTCTCTGCCGCCGGGCTCACTGCCGTAGCGTTGCGCCCACGCACGTGGATACCTGCTTCCCCGCCCGCCGTCGACCCGGCCGTTCATCGGCCTCTTGTCTCATGACTACGCGCGTCGTCTACCTGCACTGCCCGGCGCGCAGTTTCCCAGCGCACGCCCTGCCTCCGCGTCATGAGCAGTGGATAGCAGTCCCCTCCGAGGTATGCAGGGTCTGCCTTAGCGCATGGGCGGATCGCACGGAAGAGGCTGCGGACTGGGTTAAGGCTGCTAGAGTACCCGGCTTCATCAGACGCTGGCTGTCTGAGCGTGTGCACTATGTCGAGCGCCCCCAAGGAGGCTGGAGACTAGCAGCTACTGATGCACAGTGCCTCTCCAGTTCTACGGTGTTTATTCCGCCAAGCCCGTGCTGCACTGAGCACCACGATTGTGCTGCTAACGCAGAGGTTTCCGATCTCACAGGGCCTGTGTTGGCACCGTGTGGAACGGGGCCGATCACGGAGATTCGCCGTCCTGGAATGGTGGTTTCCCGTGGAACCATGCCTGCTGTGGGATCTCGCCCAGCATTCCATTGGAGCGGTCAGGCACCAACTATTGCTGAGAGTCGAAAGCTAGCTCTTTACGAGGCCGTCGAGCGCGCATCCGCATGTGGGAATGAGGGAGCGAGAGGCGTCGATACGCATATTCCCTACGTCCCTGCCACTGATTTTGGGGTGGACAACGAACGGTGGAACCGTTCTTATGATCACTGCCGCGATTGGACACGCGCTATTCGCCTGGGAGATGAAACCGCATGGGCAGTCCCCACGGATATGGCGTTTTTCTGGTCTGACGCACAAGCTCGTTTTTGCTTCGATTCCTCCAGCGGCGCCGCGGTGGGGCACACATGGGAGGACGCGGTGATGTCCGGGTTAGTAGAGGTCATCGAACGTGACGCTGTCCTGGCCGTATGGCATGGTTCCATGACCGTCCCAGAGATCGATGTTGACTCGATCAATGATCGCACTTACCAGGCAATGCTGCGGCATCTTCGCAGGCAAGGGCTGGTTATCCGCGCGTTTTACTGCCCGCTGAGTGTAGGAGTTCCTGCTGTTATTGCGGTATGCACGGATACTGAGCGCACTTTCCTCTGTGTTGGGGCAGCCGCCGCACCCGATCCTTATGTCGCAGTACGCAAAGCTCTACGGGAGGTCATGGCGGATTATCCGCAGTCGCGTTTGCTTGCTAGTGCACGGTTGTCAGACGCCGCCGCAGTGCGTGCGGACGGTTCCGGGGCAGCTCATCGCCTCAGCGTTGCCGCTTCCGAGCTTATCGACGCCGCCGCTTTCCTCCTCCTGCCCCGTAAAGAGCTGCTCCGTGTCTCCGACATCCCGGGGTGCCCTCGCTTGTCTCTCGTTGAGCTAGTCGAGCGTCTCAAAGCACATGGCTTTTACGGCTATGTTGTCGACTTCACCCAGAGTTATCACCAAATGGTGGGGCTTTCAGCTGTCAAGGTTATTGTCCCTGGTCTCTTGCCGCTGGAATATATCGGACAGCTGACCAGGGCATTGCATATGCCTAGGTTGAGGCAACAGATGACATGTTTTCGCGCTTTGGGTCTAGCCCCTCCCAGCTCACCTCCCCGTCTCAATCTTGTTCCGCACCCGCTTCCCTAATGTTCCTCATGCTTCGCTGAAAGGTGGTGAGTCCTATGGATCGCCATACTGCAGATTTTTGGGCCACAACGGATAACGCTGCGACCGAATACACGCAGCTCATTCTGGAGCGCAAAGAACATGGCATGGTCTTCCCTGCTCAAGGTCCTTTTTGGAACCATCAACCCTATCCCGCAAAAATTGTTCCAGACGCCCCTCGTTTCCAGCTTCATACACACGCCATGTCCCCCACGGACATCGCTATTGCGCAAGCGCTGGAGGATTCTCTGATCAGAACTCATCTGCGCGCAGAAGTCGACTGCAACTCGCCGACAAGGACCCGATCTGAGGCGCAATCTTTTCAGTGGTCTCGGAATACAGCCTCAGGCGGAGGCTTATACCCAGTGAACGTCTACAGATACAGCCCCGGAGATAGCCACCTGCCCGCTGGCCTGTACTTATTCAACCCCATAACCTGCCAATGGCAGCAGTTGAGGGCTGATTCCCCACGAGGCGAACGCTCGCGTTCTGCCGGTGAGACCCTGCTGGTTACCGTGGAGTTTTGGCGTTCAGCGTTTAAGTACGGTGACTTTGCGTATCAAGCGACCTCCGTGGATGTGGGGATCGTTGTCGCGGCGTTAGTCTCCCAACTGGATGCAGCTGTTGGGCCGGTGGCGATCGACTGGTCACCCGACGAGCTTGCTCTCTCCGAATTCCTTGGTAATGATCCCCTCGACGAGGCCATATATTGCACGATCACGCTGCCTAACGGCTCCCCTTCATACACTGTTACCGCTGGCGCGCCATCAGCCGTTCTCCAGACTGCTGCTCGACTCGCACACTCGGGAACGATGCCCGTACGTTTTCCCACCACTGTGGCTTTGCAAAAACAGCGGCTCCGAGAAATGCAGAGTATGCGTGCGCCTTTTTCTACGGAAAGAATCGCCGCCCCGCCACCAGCAATAATCAAGCGAGGCTCCAGCTCTTTTGGGCGCTATTCCGGAGCTCCAATCGACGTGGGTGTGCTTACCCGCATGGTGCAACGCGGGCGCGCCACGGCCGCTTCACTCCTGGGAACCCCTCCTGAAACCGATTATTCCTCGGGTATCCAAGCAGCCGCGCTGTGCGTCAACGTCATGGGCTTAGCACATTCGCTTATAGCGGATAGTGAGACATATCCTGCGGCGGCTCCTGCGCGCCCGTGTCCACAGCTTCCCGAGTTATTGCGCAACACCTATCTCCTGAAAAACTACGACCCCCTGCGCAGCTCAGCCGTCTTAGTGCTGTGCGCAGATTTGCAGCGGGTAACCACTACTTACGGCGCCTCTGGTTATCGGTGGGCATGTGCTGAGGTCGGGGCCTTTTGTCACGCTGTCTATGCTGTTGCGGCACAGGAGCGAGTCTCTGTGGGCGCTGTGTTGGGCTTTGATGCTCAGTACCAGCGCAACTACCTCGGTTTAGCGGATAATCTCATCCCGGTCCTCAACATCCTTGTGGGTGTGGATCGGCCTCACGCACGGTGGAGGAACTCGTTACTATGACAACCATTTCATGGAAGCTTCTTAATGAAGCCCTGGTTCGCAGCTGCAGTGTGCCGTATGAGGCGTTGCGGCAGCTTTGCGACGCCCCCACCGACGCACTACTTACTGCCGATCTGGACCAGCGACTCCGCGTAGACGCGGCCCTTGATGCCCTTGGTTCTGCGGCCCGCCAGGACGTCGCGGGGTGTACACACACAGTGTTGCGCCACACGCTACTCGATCTCAAGCGAGCCGCACACCACCATGACGTGCGCAAAGTTCGTGCCCTACTGGCTAAAGCTACATCCGCTGGTGTGCGGTTGCCTGCTGGCGTAACGACTGCCGCAGATACGGTGATCACTGCCGCCGAAAACGTGCTGTCCTCAGAGCAGCTGCACAAGGTACTCATGAGTTCCAAGAAAAGAGAGCGCGAGTGCCTCGGGAAGATCGCCCGTGACCTGGGAGTCGATTCAGCAGTACTGGCCGCATCAGTCCCAGCGGCAAAAGCCATCCGGAAACTGAGCACAGATTGTGCGATGTCTGCCAAGCAGCTAGCTCGTGCCCACCGCACCGCGCTGGGATACGTGATCCGCAGCGCCACCCGGTCAGTACCGTTTTCCGCCCTGTGCGCAATCGCGCCGTCTCAGTTGTCCCACGCCAGCCTGCACTCGGATGAGACTCTTGTCCCAACATCTGTGCACACTATTACGCGTTGGAACGTCTACGCCATGGCTCAGATTTTCTCTGCTATGAAGAAGGACTTTGGTTTTATTGCCACACTTCCGGTTCTGGTTAATCCTGACGCGTTATCAGAGCATGGTCACTGTGCACTGCCTCGGTGTTCTGTCGAATATCTAGGGCACGTCGGCGACCGGGACTTGGCCGTTTACCGGGAGGAACGCCGAGTGGTCGATTCCAATGGACTATTTGGAAAGGTCATGGCCTTAGCCGCAAGTGCTCCCCAAAACCACGAATACACGTGTGAGCAGCTGGCCGCAGAGCTCTCGGCGCGTACAGGTTTAAGCAAGGCACAAACAAAGTGCATTGTTCTCGATGCCATCCGAATTTCGGCCTTAGTTGTTCCTACTCTCGACCTATCGCCGTCCACAGCGGTATCTGAACAGCCGATAGTTACGCACTTGGCACGTGGATCGGACAAGGCCGTATCTGCTGCGCGTCTCATTGCCCGCATCGCGGAGGAATGCAACGCGGTGGCGTCGATAAGCGACTTTGACCGCAGGCACACTCAGATCCTAGATCTCGCGCACCACTTGGATAGCCTCCGGCGACTCGTCGATCCCAGCATGCCGATGTTCCATACCCATGTGTATGAGGATGGGATCGGGAAAGAATCGACGATTCCCTCCTCCATCGCGGACTCGTGCACAGCCCTTGACTGGGAAGCCCTCGCAGACCTGGTCGATCTGCTTGATGTGCGGCAGGCAGAACGCGCGTTGTTTGAGGAGTTTGTGTCCGCAAAATTCCCCCGTGGGGAAATCTGCCACGATGTTCCGGCGCTGGTCAACAGTTTTGTCGCCGAGGTCCTCACGCCACTGCGCCAAATTGATATTGAGGCAGTCGATGAAAATGACCTGAAGTCCACTGCATCTTTGCCGCTGGGTAAGGCTTGGGAATGGATCCGTGCGCGGCGTCGCTTCCTGGCACACGTGGCAGCATTAAGAAGCAACGCCGCTGGCCCCGTGGACATACGTGATCTCCTCATCGATCATCGCCCTCTTGTGCAATCGCGTAGATATCCGCTTCGGTCTTTGAACGCTTATGTTCAGCAAGGCGACAAAGCCAAGATCGTGATTAACCGGACACTTGGCGGGCCAGGTTTCCCGTGGTCTCGGTTCGCGCATGCGATGCCAGATTCCGCAACACGTGCATGGTCCGAATTATCTGATTATGCGTCTGATGCGGGAGTGAAACTCGTCGAGCTGACAGCCGGAAAAGTCGTCTCCAACCTCAACGCGCATCCGGCTACCTATCCAACCACCCTTCTCATTCCGGGACACCCTCGAAAAACAACACGTGCGTCTGATATTCGACTCGCAGACACGCATCTGGCCTACTGCGCATCCAGTGGACGCTTGCAGCTTTTCGACGCCCACGGCACCGAACTCCTCCCCGCATACATGGGGTACATCACCGACCGAGGTCTGCCATTATCTACGCAGGCGCTCATGCTCCTAGCACCACCTATGCATTGCTCTTTGGACTTTTTCCCGCGTACGAATTCAGAGATAACGCATCAAGCGCGTCTGATGCTAGGAGATGCTGTCCTCGCGCGAGAAAGCTGGATCTTTCCTACTTCAAGCACCTTTATGAATATCCCGTGTCTCTTAGAGGAAGCCTTGGTATGGTGGCGCAGCTTTTCCCGGTATCAGGGTCTACCCGAGTGTGGAGTTCTGCGAACATTCGACGCCCACGGCGTTGTGGGCAAAGGACAGTTTTACAACTCCCAGATCATGGGAACGATCACCAACCTCATACGTGCATTACGCAACGCTCACTACGGATGCGTCATTGAGGAATTCTTTCCTCTTGTTGGCGAGTCCGGCGTGGCACAGGAACACATAGTCACAGGAACACGCTCTTTGAAGGAGGGTATCTAATGCGCACAACCGACCCCTGGCTCGCTTTTCATATTTTTTATGGAGAAGATCCCTCTCTTTTGCTCCGCGACTGCCTACTGCCCTTTGCCCATACATGCGTGACAGAAGGCCTAGTACAACGGTTCTTCCATATGAATTACTGGTTGGAGGGCGCACATGTCCGCTTAAGGCTAGAGCTTTGCAATCCGGCAGATCGCGAGCACGTCATATGCGCAGCACATCAGGCTATCCAACCGTGGATTGACGCTCATCCTTCCAGCGCACCCCAACTGTCGCTGCGCAATCCCGAAGGCTATCGGCGCCTCTTTGAACACGAGTACCCCATGTCCAGGTTCTCCGACTACATCGATCAGGATGGCCTGCCGCGCCTAGAACCAGATAACTGCATCCGAGAGCGCTACTACGAGCGTGAATATGACCGCTACGGCGGACGGATAGGGATGTCGCTTAGCCAAGATGTTTTTCAAACCTCGACATTGTTCGTCGAAAAGCTTTTACACTCTGGCGTTTTAGAAGCACGTACTTCACGATTAGCCGCGGCCGCGCTAGGAATGATCTGCACTGCGCATACGGTTTTGGATTCCGACGCATCCATAGGACGCTTTTGGGAAAGCTACCACGCAGGTTGGACCTCGTCTTTTTCTATGCCCACCAGCTACACCTCGCCCACAGCACAGCACAAAATCTCCGCTGAGGCTCAGGCGCTCTCACGCGCAGCCACGCCATTTCGTGAAGCCTTACGCCGCACCCCAGACGGCTCAGGTCTCCCCGAACCCTATGCCACTTTCACTCAACACATGACGTCTGCAGTGGAGAAACTCCATGCGGCGCATGCCTCTCACGAGCTTGATTTTGGTGATTCTGCGGCAGAGGCCGGCCACACGTGGAAAGAGAACTCCATTTTTCTTCTGCGCTCTTATGTCCACATGACCAACAACAGGATGTCCGTCAATATCTCAGATGAAGCCTATTTAGCTTATTTACTGCACGCGTTATATCAGGAGGCAATATGACCAGCATCCACGATCTCCGTTTTGCTCCCGGCGCGGCATTGGGTCCGATTATCCACAGTCCTGATCGAAGTTTCGCAGTAGCACGTAATCCGCAGGGCGAACTCAGCGAACTAGGCCCACAGGCCTGGATGCTGCTGGATCTCTTCCGGCAACGCGATAATGGGAATAGCATCGACATGAGAATGCTTGTCATTCCGCAGGTTGCTCAAGCCATACGTGCGATGGAGCACCGGGGACTGCTTACTGCGGCCCCGGATACCGGCGCGTCTTCGCATCCCGTAGGCGTGTTGGGGCGGCCCCGTGGAGTGTGGTTCGTTTCTTTTGTCACCTTAGTCATGGCACTTGCGGCGGCATCGACGGCCCTGCTCCTGTCTGCGGGCTCCGCTTTTACCCAAACGCATAGCGCGCCTAATCTCACCGGTTTTTTCACAATTCTCGTCGGCGTAATCATCACCATTATTGCCCACGAGTGTGCTCATGGAATCGCCTTTTTTCTGCTGTGCGGAATCAAACCGGCAGTATTTTCCACTGCCTCTTTTCCCCGCCGATTCCTCAGCTTGCAATTACCGGGAATATTGGCAATATCTTCTCGCGCCGGGAAGGTTGCAGTCCTTGCGGTCGGCCCTGCCACCACTCTCATATTCACAGGGCTTTCCGTATTTCTCTACAACAAAGGGTTGCTTCCCGACGGCGCCTCCTGGCTCCCCAGAATCTTATTTCTCACCTTTGTTGGCTCCCTTATTCCTGTTCCACATTCAGACGGAACCAAGATTTTGGAGACGATATCCAGAACCAATAATTTACCAAAATTTGCATGGAATTTTGCACGGAGAAAACAACTACGTTCTGAGATATTTCAACAGCAAATCGTTACCATAATTGTTATTTACATTTCACTTTATCTGGGTACCATAATGTTATGGGCAATCATAGTGCTATTCTTTATAGTTTTCCCTGGTAGTCCATATCAGTAG
Protein sequences of DBSCAN-SWA_5 >LS483402|2415298:2451328|2424902_2425532_-|SQG59319.1|protease|DBSCAN-SWA MTTSGMQMPSARYVLPSFIEHSTYGTKETNPYAKLFEERIIFLGTQVDDTSANDIMAQLLVLEGLDPDRDITMYINSPGGSFTSLMAIYDTMQYVRPDVRTVCLGQAASAAAVLLAAGAPGKRACLPNSRVLIHQPATQGTQGQVSDLEIQAKEIERMRALMEQTLSRHTGRTPEQVRIDTDRDKILTAQEAVEYGIIDQVFDYRKLNG >LS483402|2415298:2451328|2439019_2440690_-|SQG59332.1|DBSCAN-SWA MGEFIYTMKNVRKAIGDKLILDNVTMAFYPGAKIGVVGPNGAGKSSILKIMAGLDQPSNGEAFLDPGATVGILLQEPPLNEDKTVRGNVEEGLGEIFEKKQRFEQIAEEMATNYTDELMDEMGKLQEELDAADAWEIDSKIDQALEALRCPPSDDPVTHLSGGERRRVALAKLLLSEPDLLLLDEPTNHLDAESVLWLEKHLADYKGAVLAVTHDRYFLDHVAGWICEVDRGKLYPYEGNYSTYLETKAQRLEVAGKKDQKLQKRLKEELAWVRSGAKARQAKNKARLQRYEEMAAEAEQYKKLDFEEIQIPTPPRLGNQVVEVDHLDKGFDGRVLIKDLSFTLPRNGIVGVIGPNGVGKSTLFKTIVGLENPDSGDVKVGQTVKLSYVDQNRENIDPEKTVWEVVSDGLDYIHVGQNEMPSRAYLSAFGFKGPDQQKPSKVLSGGERNRLNLALTLKQGGNLILLDEPTNDLDVETLGSLENALQKFPGCAVVISHDRWFLDRTCTHILAWEGNVEEGQWFWFEGNFEDYEKNKVERLGADAARPSRVTHRKLTR >LS483402|2415298:2451328|2419795_2420554_+|SQG59316.1|DBSCAN-SWA MNGTVDHIENTSVDNTPEVSPQTVLAIALEMFSELGFSDAKLEAIAKQSGMSKRMIHYHFGDKKGLYRRCLEEAVRRLRPSAEEMQLETPVPVDGVRKVVEAVFRTYVMHPESIRILQMENLHHFGKIAEASPLSDQSSIMLQLDKLLMLGQDAGAFRPGISAQDVFTLIASLAVFRVNSRSTTLNLYSVDMMDEDNTQGMARLAVDAVLAFLTSNLKGSDDVSYLTSTAPSDSITRNMEEASYQVDADPFS >LS483402|2415298:2451328|2422063_2424826_-|SQG59318.1|DBSCAN-SWA MITPACDARGRREASGEPEKTALRVLVIDNYDSFTYNIVDYLARCGAAVTVMRNDAPLDINRISKQGSASPDAFDAVVISPGPGTPTIPSDLGVSAAALEHSEVPVLGVCLGMQAMAYVEGGRVEKAPEPVHGREDTIWVTSTGPLWEKIADSFTVVRYHSLVVTDVPDSMQVTARNAEGLVMALEHRTKPWWGVQFHPESIGSEYGEQLIANFVGIAEQRETGRASHSRSVVVRESAVPEGVGPVDIFAALGGQGVLVEFEGKSIIAPHDGGKIIDSLDALSLSMDACPQVHVESNDGAIPAALPGWFGYVGYEANHPDFGPQAQAQAPVLGEALKMFFAERIVVMERGRLQLVALVSRNDRETREAIDWCDAAMAQIQAAPPVGTFDPSAVGRLRVRESRRKYLHSITEIQELISQGATYEVCLTTQLEAPIDGDFDAPAAYRRLTEIAPAPMRSLLVLGDTHVVSSSPERFLKMSQGVVSSEPIKGTRARCQDEKKDADMRHDLATNKKDRAENLMIVDLVRNDLAHVCEYGSVRVDELCQVKTFSRAHQLVSTVSGKVRESATPVDVIRAAFPGGSMTGAPKYRTMEIIAELEGHPRGVYSGAVGFISVDGNMDLAMTIRTAVVQEQRLSYGVGGAIIALSNADAEWEEIVTKSAPLLSLVAQGFPHEELLEFDGARLQPALHTQPPTVIDSFLLVDGHARGFDSHCRRFRASCLELQTAREDEIDRFLAAVKRELPLQGEWFPRLESLPGGTLRVRFRPAPKRREATTLTTVMVQPGQTQHPTIKGPDLSELLRIKNAVPTDDAVLVSPRGVHETTTAALMAWKDNELVSMQAERLSSVTERMVKEIARELGYRVTQKTYDRSALHGAELWVVNALHGISRVSELDGEPVPCDTQRLARFRHMLAGKQQPLIREN >LS483402|2415298:2451328|2438389_2438818_-|SQG59331.1|DBSCAN-SWA MAAESTQQVHTTTVPVRWTDFDRFGHVTNSAYVDLAQEARTRWANDEFLAQGHDIPAVFVRRIEVDYLRPIMPSTAEVEVETSIVQIGTTSFTTRQSLKDCEGHVCAVVLAVQVAVDLKTARPREIEAHELQVLTKIAESAR >LS483402|2415298:2451328|2429979_2430990_-|SQG59323.1|DBSCAN-SWA MTNMHSNSQPMRAPVEIITAREVPLGGLRAMTVRRTLPQKKRTMIGAWCFVDHYGPDEVSRTGGMDVAPHPHTGLQTVSWLFRGEIEHRDSGGNVGTVRPGEVNLMTSGAGICHSEVSTPTTETLHGVQLWIALPDSARNIAPRSFEHYSPEPVDLGEGQATVFLGELCGMKSPVQTFTPLVGAEIRVKKNSSLAIPLDPAFEHGVLVDAGTITVEDVAIPHAALAYVGVGTERIVISNHSDSDGRVLLIGGQPFAEEIVMWWNFVGRSTEEIRKSRDDWQAQEDRFGKVVGYQGHGGVNARGEGINAQGLARIEAPTLPNVTLRPRRNPAPYARP >LS483402|2415298:2451328|2435486_2436446_+|SQG59327.1|DBSCAN-SWA MTIFSATEQTESVSSWLGSVNTQEWLIDKPIQIGITIVIGLIANWLLRKTITKAAHININKKPSKISSVLPLRGKTANKSSEALSATQEQRRQSRMLTLAAVGRSAVSVVVWVWVCLAVLTYLGINVTPIVASAGVVGVALGFGAQSLVKDFLSGVFMLIEDQYGVGDTIDVGDIVGTVEDVSLRLTTLRDIHGTQWFVRNGEILRIGNFSQEYAVALINIPVALDENASAAIEAVTDAVNAASQEPAINDVLLDSPIVDGVNSIGLDHMLIRARVTTLPDQQWYVTRELTARVLTALQHNNIDTPYPEGIAASRRISD >LS483402|2415298:2451328|2425547_2426147_-|SQG59320.1|protease|DBSCAN-SWA MSDQIRMAQGSAGLNLSDSVYERLLRERIIFLGTQVDDEIANKLCAQILLLSAEDPHRDISLYINSPGGSVTAGMAIYDTMKYSPCDIATYGMGLAASMGQFLLSGGTKGKRYALPHARIMMHQPSAGVGGTAADIAIQAEQFAQTKREMAELIAEHTGQTFEQITKDSDRDRWFTAAQAKEYGIVDHVIASAQGPLSN >LS483402|2415298:2451328|2429093_2429897_+|SQG59322.1|DBSCAN-SWA MIALLLVVLVVGGAVFFLSRGNKSNRQIESDNLQDSIAEARRWIERLGSQVLTISGTDAASTQAIADASERYNAASSQISTATTVRQAELARESALEGLHYMNAAREIMGLPAGPELPPLEGQRQAGKVTENRTIDFEGQQITASPHATPDTPNYYPGGMVAGRPVPAGWYSEPWWAGALRSGLWTAGSVFLFSSLFNGMSGVGYSAHAFESGFDKGYAEGLAANGGDGGGDIGDVGGDSGDDGGFFDGFFGGDGGDGGGFDFDFDF >LS483402|2415298:2451328|2445259_2446744_+|SQG59336.1|DBSCAN-SWA MDRHTADFWATTDNAATEYTQLILERKEHGMVFPAQGPFWNHQPYPAKIVPDAPRFQLHTHAMSPTDIAIAQALEDSLIRTHLRAEVDCNSPTRTRSEAQSFQWSRNTASGGGLYPVNVYRYSPGDSHLPAGLYLFNPITCQWQQLRADSPRGERSRSAGETLLVTVEFWRSAFKYGDFAYQATSVDVGIVVAALVSQLDAAVGPVAIDWSPDELALSEFLGNDPLDEAIYCTITLPNGSPSYTVTAGAPSAVLQTAARLAHSGTMPVRFPTTVALQKQRLREMQSMRAPFSTERIAAPPPAIIKRGSSSFGRYSGAPIDVGVLTRMVQRGRATAASLLGTPPETDYSSGIQAAALCVNVMGLAHSLIADSETYPAAAPARPCPQLPELLRNTYLLKNYDPLRSSAVLVLCADLQRVTTTYGASGYRWACAEVGAFCHAVYAVAAQERVSVGAVLGFDAQYQRNYLGLADNLIPVLNILVGVDRPHARWRNSLL >LS483402|2415298:2451328|2432726_2435342_+|SQG59326.1|DBSCAN-SWA MSSINLTQAEAEQRSRILDVHHYDIALDLTEGDKEFPSITTVSFTVKEAGDTFIDLRAASVAEVLLDGTDVTAAAVPLTPSGYDETQGLALRGLTPGLHSLTVTASCVYSHTGQGLHRFIDPEDQRVYLYTQFETADAKRMFACFDQPDLKATYAFTITAPTAWKVITNAHTQITTAADKAIHRAHVDYKLSTYLVALCAGDYHEVSDTWSGALTHHPETPADQPTALEIPMSIYCRKSLAQYLDADTLLRETKQGFTFYHENFGMAYPFYKYDQIFVPEFNMGAMENAGAVTFRDEYVFSSKVTKYRYERRCDTILHEMAHMWFGDLVTMKWWGDLWLNESFATWAAAISQAEATEYSTAWVTFANVEKSWAYHQDQLPSTHPITADASDIETVEQNFDGITYAKGSSVLKQLQAFVGRDAFLAGVRKHFANHAFANATFDDLLGAFEEASGRDLSQWADQWLKTTGINKLSPTFTVKDGVYSEFAVQQSGAAPGAGELRTHRIAVGLYSLIDGQVKRTHRCEIDVEGTSTPVPEVVGLAQADLILVNDDDLTYCLMQLDPASLDFIVNNIDKISDPMARTLCWSAAWEMTRDGSMRARDFVTLVARGAQFETEIAVLERILSQAAKAVRSYVDPAWADSTGRDMLADALLVGARNAQAGSDAQLAFVQALAKIRITKDAAAEFAAIVQGSTSLPGLTVDSDLRWWALTALIAHGEITGTAVHESIEKLRGIDRSSAGELAALRAYAAQPDADVKADIFDEVTDTKNTLSNLFLRHKLEGLTFTGSGPYLAQFNSAVFALAEKIWAEMSSEVALVTLSGIYPYWDISAQGVENAHAFLNKDSLPAGVRRVVSEGMSEQERALRLREIDAR >LS483402|2415298:2451328|2431803_2432424_-|SQG59325.1|DBSCAN-SWA MTEHVTFWFDVSCPFCWVTSRWIKEVEQVRDIHVEWVPMSLSVLNEGRDELPEDYKEKMKANWGPARVFAAVASQHPDKLDELYTVMGTEIHNNGQGGKQGFGAYDEIIKSALSTVGLDPTLAEVANTETWDKELREYHRGAMELVGNDVGTPVIKLGDSAFFGPVITRIPRGEEAGKLFDASIALGSYPYFFELKRSRTESPRFD >LS483402|2415298:2451328|2443673_2445224_+|SQG59335.1|bacteriocin|DBSCAN-SWA MTTRVVYLHCPARSFPAHALPPRHEQWIAVPSEVCRVCLSAWADRTEEAADWVKAARVPGFIRRWLSERVHYVERPQGGWRLAATDAQCLSSSTVFIPPSPCCTEHHDCAANAEVSDLTGPVLAPCGTGPITEIRRPGMVVSRGTMPAVGSRPAFHWSGQAPTIAESRKLALYEAVERASACGNEGARGVDTHIPYVPATDFGVDNERWNRSYDHCRDWTRAIRLGDETAWAVPTDMAFFWSDAQARFCFDSSSGAAVGHTWEDAVMSGLVEVIERDAVLAVWHGSMTVPEIDVDSINDRTYQAMLRHLRRQGLVIRAFYCPLSVGVPAVIAVCTDTERTFLCVGAAAAPDPYVAVRKALREVMADYPQSRLLASARLSDAAAVRADGSGAAHRLSVAASELIDAAAFLLLPRKELLRVSDIPGCPRLSLVELVERLKAHGFYGYVVDFTQSYHQMVGLSAVKVIVPGLLPLEYIGQLTRALHMPRLRQQMTCFRALGLAPPSSPPRLNLVPHPLP >LS483402|2415298:2451328|2436471_2436864_+|SQG59328.1|DBSCAN-SWA MSTPQSFYDAVGGESTFRAFVHRFYELVRTDDILGPMYPHDDWEGAEDRLRWFLVQYWGGPQTFSENRGHPRLRMRHAHFPIDQAAADRWLLLMESALDSLDEETLPTAYRAALWDHMQRVADMLINRAS >LS483402|2415298:2451328|2449340_2450354_+|SQG59338.1|bacteriocin|DBSCAN-SWA MRTTDPWLAFHIFYGEDPSLLLRDCLLPFAHTCVTEGLVQRFFHMNYWLEGAHVRLRLELCNPADREHVICAAHQAIQPWIDAHPSSAPQLSLRNPEGYRRLFEHEYPMSRFSDYIDQDGLPRLEPDNCIRERYYEREYDRYGGRIGMSLSQDVFQTSTLFVEKLLHSGVLEARTSRLAAAALGMICTAHTVLDSDASIGRFWESYHAGWTSSFSMPTSYTSPTAQHKISAEAQALSRAATPFREALRRTPDGSGLPEPYATFTQHMTSAVEKLHAAHASHELDFGDSAAEAGHTWKENSIFLLRSYVHMTNNRMSVNISDEAYLAYLLHALYQEAI >LS483402|2415298:2451328|2441958_2443677_+|SQG59334.1|DBSCAN-SWA MTRIVQNSPRPSGLPETENSTAAQGWRPFLHSNVMFSETPTGVEFHDDFRRFSIDGPGAYKAFRAATPLFEGDVLFADALQHLGAAGAQTIKLFESELYRHGMLTRLTPENPSVAAYQWGGPWDGILRLLANYTPTPIAVLEKISHTPFTICCNHPESAHLIKASLEENGCATVRCAALSPDTSHLRPLFSLSWRELSDAATITVRRIGSGLVIASDVAAEAAAATRVLESLSTEKETAEVPEALIRMAGIVASFEAFKIVSGVMPATLTQALVRIDLSTGEVTQHQLGQQQHGNGDSPLFSLVDPLLGFTPRFLDDDITQMPLRLSQVSVTGPDQRRWIVTGWALETLEAARERAVEEATTRVLLSQQLTQTMPDAPLMQPTGLPTPAAVGSSQEEAMERALPRAAAYWAFHSATQGTSAVVPFNASPEASSCVRDAQELYGAATSTFLELTPLWDHSVVVVQCDNPILSVVAAGESAEQAAVSAAYGHLALAQLHADKHCPPHDGWSHALPALALRESSMRCLAEEPVTELLTDPRLSAAGLTAVALRPRTWIPASPPAVDPAVHRPLVS >LS483402|2415298:2451328|2431107_2431614_-|SQG59324.1|DBSCAN-SWA MGCGDAIHFETMRVYLGADHAGFEMKNIIAEHLKTKGHEVIDCGAHTYDAQDDYPAYCIEAASRVVNDPGSLGIVLGGSGNGEQIAANKVKGARCALAWSVETARLAREHNNAQLIGLGGRMHSEEEALQIVDAFIEQEWSKEERHQRRIDILSEYERTGIAPALPEA >LS483402|2415298:2451328|2415298_2418055_-|SQG59314.1|tRNA|DBSCAN-SWA MLGANAKMVDVTANNENANRADRLPASWDPQAHEQELYQSWVDAGYFTANTSSSKPPFSIVLPPPNVTGQLHMGHALDHTLMDSIARRKRMQGFEVLWLPGMDHAGIATQTKVEAMLKEKEGKSRWDYDREEFIGHVWNWKNEYGGTIGKQMRAIGDSVDWSRERFTLDEGLSRAVQTIFKQMYDRGMIYQANRLVNWSPVLETAVSDIEVVYKDVEGELVSIRYGSLDDSEPNVIVATTRVETMLGDVAVAVHPEDERYKDLVGTSLPHPFIADRQMIVVADDYVDPEFGTGAVKITPAHDPNDYALGLRHNLDMPTIMDSTGHIAGTGTQFDGMDRAEARVKIREALAEQGRIVKEIRPYVHSVGHSERSGEAIEPRLSLQWFVSVEELAAMAGDAVRQGDTVIHPKSAEPRYFEWVDDMHDWCISRQLWWGHRIPIWYGPEDANGQRDIVCVGPDEQAPEGYTQDPDVLDTWFSSALWPFSTMGWPEKTPELEKFYPTSVLVTAYDILFFWVARMMMFGTLAAETTPELLGEGRDGRPQVPFTDLFLHGLVRDEHGRKMSKSLGNGIDPMDWVERFGADALRFTLARGANPGVDLPVGEDSAQSSRNFATKLYNATKFALMNGARVGELPAREELSDADRWILDRLEQVRAQVDSYFDAYQFAKGNEALYQFTWGEFCDWYLEFAKVQIPRDIDSATPEERRRGENTQIVLGQVLDAVLRMLHPAMPFVTEVLWKALTDGESLNVAAWPTEAETNGGALIDATAARRIADVEKLVTEVRRFRSDQGVKPSQKVPARVDFSACDLDEQESIVRSLVRLETPAEDFAASASIEVRLSQATVVVSLDTSGTVDKVAERKRLEKDLAAATKELETTAKKLGNEAFLSKAPEAVVEKIRNRQKIAEEEVARINARLEELN >LS483402|2415298:2451328|2426487_2427840_-|SQG59321.1|DBSCAN-SWA MKSSVEKLSDTRVKLNVEVPFEELKSEIDQAYKALAQQINIPGFRRGKAPRQLIDARIGRGPVLEQVVNDMLPTRYQQVCEENELVVLGQPAIDITKIEDNDVVEFTAEVDVRPEITVPDFAAFNVEVPALKVDEEAVDAEIDRLRERFGELKDTKRKLKTNDFATIDLAASVDGEAIEEATTEGMSYQVGAGDLIDGLDTALRGLKTGESAEFTTTLQAGEYAGKEATVTVTVQQTKERKLPEVDEEFVQMASEFDTVEELRESVTAQVEEKAKAAQATAIRDEVLKAALAESTFELPEGVVNEQVEAQLHQLLGQLAHDEAALNAALEAQGTTREDFDAENRKNSEEAVRTQLFLDALAEQEAPEVSQQELTDHILFTAQNYGMEPNQFVMQLQQSGQIANLFSDVRRGKALAAAICRVSVKDEEGNVVDPTQYFGEEEAEASEDEAK >LS483402|2415298:2451328|2450350_2451328_+|SQG59339.1|protease|DBSCAN-SWA MTSIHDLRFAPGAALGPIIHSPDRSFAVARNPQGELSELGPQAWMLLDLFRQRDNGNSIDMRMLVIPQVAQAIRAMEHRGLLTAAPDTGASSHPVGVLGRPRGVWFVSFVTLVMALAAASTALLLSAGSAFTQTHSAPNLTGFFTILVGVIITIIAHECAHGIAFFLLCGIKPAVFSTASFPRRFLSLQLPGILAISSRAGKVAVLAVGPATTLIFTGLSVFLYNKGLLPDGASWLPRILFLTFVGSLIPVPHSDGTKILETISRTNNLPKFAWNFARRKQLRSEIFQQQIVTIIVIYISLYLGTIMLWAIIVLFFIVFPGSPYQ >LS483402|2415298:2451328|2437775_2438384_-|SQG59330.1|DBSCAN-SWA MNESLSIPEGPATGITSLLQRAVGLDRHAYARLQQVEDCVNVYVTTPFGVVASRRVRGEASRDGAVVLATDLLARGEGKPHDASWAGALPPLTGFKLIDEIPVGVASDLAEKGRSLARQFSGPMGPPASLLDQKVMTVSNADNSVEISMRMIFACTSLGFIPPATAPAHIPRHLRVSSTGRWARIDAPYGTVYESQGLSLLP >LS483402|2415298:2451328|2436875_2437772_+|SQG59329.1|DBSCAN-SWA MLWILLGVAAGAVIPIQTVVNTRLSASTGTPFSSSMISFCVGTLTLAVALIAATGQLPNVSAAYHAPAWIWLGGLLGVIALTANIFMFPRLGAVQTVVLPISGQIFMGLAIDHWGLFDAPQNSITPLRVVGALLVFAGVLATVGKPKSSETQSGSFIHWIWRLAGVSFGMLTAMQSAINGRLGAVLHSAATAALVSFSVGAAALIVLNIVLRWRPRIQRLGSSHPWWMWFGGSLGALFVFANAALVPKIGTGLTVVAALLGMMISSIAIERIRGGHSGIRQILGVVAMLIGIVAIRLL >LS483402|2415298:2451328|2446740_2449341_+|SQG59337.1|DBSCAN-SWA MTTISWKLLNEALVRSCSVPYEALRQLCDAPTDALLTADLDQRLRVDAALDALGSAARQDVAGCTHTVLRHTLLDLKRAAHHHDVRKVRALLAKATSAGVRLPAGVTTAADTVITAAENVLSSEQLHKVLMSSKKRERECLGKIARDLGVDSAVLAASVPAAKAIRKLSTDCAMSAKQLARAHRTALGYVIRSATRSVPFSALCAIAPSQLSHASLHSDETLVPTSVHTITRWNVYAMAQIFSAMKKDFGFIATLPVLVNPDALSEHGHCALPRCSVEYLGHVGDRDLAVYREERRVVDSNGLFGKVMALAASAPQNHEYTCEQLAAELSARTGLSKAQTKCIVLDAIRISALVVPTLDLSPSTAVSEQPIVTHLARGSDKAVSAARLIARIAEECNAVASISDFDRRHTQILDLAHHLDSLRRLVDPSMPMFHTHVYEDGIGKESTIPSSIADSCTALDWEALADLVDLLDVRQAERALFEEFVSAKFPRGEICHDVPALVNSFVAEVLTPLRQIDIEAVDENDLKSTASLPLGKAWEWIRARRRFLAHVAALRSNAAGPVDIRDLLIDHRPLVQSRRYPLRSLNAYVQQGDKAKIVINRTLGGPGFPWSRFAHAMPDSATRAWSELSDYASDAGVKLVELTAGKVVSNLNAHPATYPTTLLIPGHPRKTTRASDIRLADTHLAYCASSGRLQLFDAHGTELLPAYMGYITDRGLPLSTQALMLLAPPMHCSLDFFPRTNSEITHQARLMLGDAVLARESWIFPTSSTFMNIPCLLEEALVWWRSFSRYQGLPECGVLRTFDAHGVVGKGQFYNSQIMGTITNLIRALRNAHYGCVIEEFFPLVGESGVAQEHIVTGTRSLKEGI >LS483402|2415298:2451328|2440860_2441424_-|SQG59333.1|DBSCAN-SWA MHIQSTIVGNLTNEPTFVQFTSGGVCKFRIAASRRMRKHPVNNDVPGEESQWIDTDQNYIDVECWGQLGVNARMSLERGRPVICSGYLVTQEWTDAATGKLASKIVLKASYVGFELSRYVVSSRKSTVEDIHHVPGLPAPENTENQPFIPDRDYSRTNDFLNEEGGAAKESGELVGATTTSSHNPPF >LS483402|2415298:2451328|2418199_2419180_-|SQG59315.1|DBSCAN-SWA MNHAVKKIAVTGAAGQIAYSLLWRIANGDVYGKDTPIELQLLEIPQAIGGAEGVAMELLDSAFPLLKNITVTDSADVAFDGTNAAFLVGAKPRGKGEERAALLTANGKIFGPQGDALSRNAADDIRVLVVGNPANTNALIAQSAAKDIPADRFNAMMRLDHNRGISQLADKINRDKNGIENFVVWGNHSAGQFPDIAYAIVDGEKLADLVDDAWYRDEFIPRVAKRGAEIIEVRGKSSAASAASSAIDHMHDWINGTEGQWRTAAIPSDGSYGVPEGLIFGFPTIAEGGEWKIVDGLELSDFQKESIARNVKELEEERAAVADLLK >LS483402|2415298:2451328|2420614_2421901_-|SQG59317.1|protease|DBSCAN-SWA MTRMQESADLLKCSFCGKSQKQVKKLIAGGGVYICDECIELCNEIIEEELSTAAEEKKDEGTKLPRPSEISAFLDKYVIGQDDAKRILSVAVYNHYKRVRAEESRVLSTRKNKDDETELQKSNILMLGPTGSGKTYLAQTLARLLDVPFAIADATSLTEAGYVGEDVENILLKLLQAAEFDVQRAQRGIIYVDEVDKISRKSDNPSITRDVSGEGVQQALLKILEGTVASIPPQGGRKHPNQDFIQLDTSNILFIVAGAFAGLEKVIEERRGKKGIGFGAELTTKEDIDEIDVFKEVLPEDLVKFGLIPEFIGRLPIVATVGNLDQRSLVKVLTEPKNSLVKQYQRLFEMDGVALEFDDDALEIIADLAIERGTGARGLRAIMEELLVPVMYDIPDLEDVGVVTVTAASVRGEEPPQMSEQAGEEKTA |
26 | Agrobacterium_phage(28.57%) | tRNA,protease,bacteriocin | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|