Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
LR134303 | Shewanella putrefaciens strain NCTC12093 genome assembly, chromosome: 1 | 3 crisprs | DEDDh,DinG,WYL,RT,cas3,csa3,csx1,cas2,cas1,csx16,cas6,csm3gr7,csx10gr5,cas10 | 0 | 10 | 5 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134303_1 | 972053-972187 | Orphan |
NA
Consensus repeat of LR134303_1
|
1 spacers
spacers of LR134303_1
>1.1|972097|47|LR134303|CRISPRCasFinder CGACTATGCTTTAGTTAAGCATTGAACAAGGAGGTTTGTATGCCGAG |
CRISPR arrays and Neighbor proteins around LR134303_1
The CRISPR arrays of LR134303_1 >merge|LR134303|1|972053-972187|CRISPRCasFinder AACAAAACCGGACACCCAACCTCAATAGCGCGGTAGCTAACCTCCGACTATGCTTTAGTTAAGCATTGAACAAGGAGGTTTGTATGCCGAGAACAAAACCGGACACCCAACCTCAATAGCGCGGTAGCTAACCTC >LR134303|1|1|972053-972187|CRISPRCasFinder AACAAAACCGGACACCCAACCTCAATAGCGCGGTAGCTAACCTC CGACTATGCTTTAGTTAAGCATTGAACAAGGAGGTTTGTATGCCGAG AACAAAACCGGACACCCAACCTCAATAGCGCGGTAGCTAACCTC
>LR134303.1|VEE61135.1|971561_972008_+|Uncharacterised-protein MARKEKAESESYRKFIDEQAKLAYEELVKNQSPKKAFFGAILGVFLGLALLILFVWNGLVFYWMLFVPAAVIGYLACKFGKIYESKYANMIGVIGLLTNGFAVMTLYNYEAIALSTIPIAFIVTRYFAKLKLTEAQERAIWRKEIGKL >LR134303.1|VEE61134.1|969078_971376_+|Tetratricopeptide-repeat MRHKSNSLPAKAIPRWKTKLKQTSLIGLSLFSPVLLACGPDFPLQLTQDRQNNLSYLPQTSFSQQLSGLAKPLAWQFQGAPAAQEYLWDEAHSRYLSQTRAYENSELSEAQLAQVNSLRDAQSLAEAEQIAAQLKESLAPALTWYSLGAMAFDAKEYDKASDYFKKVIALPESERAGRSLWALYSLSRIELIKSKTASDNSHFVQANAYLQQLQTEVTQGAADPLRLSLAGLGEQAYILLHQGQAQIEVSRGEYEPPKIDVALNPATLDKVIELYATQSVQGDSSGYDSLLMLSRSLMSKNINELKPLLQQPSVQQLLIAYWQSSANDFAFDGQLMEMGQKVAEILTVFPTDGVVLSQGDKLAAIYYQLGDYASAERMLALAKPSGLTWWLTAKLMLQKGDQAQAAKAYAQAVRHFPTDMNATAATGSQQDAQQQAIEADAEQATYCRIRAEQGVLSLERGEYVDALGQLLASGDEYWQDIAYVAERVLTTAELKLFIDEHVPVMTFEYPKDSDWYDSVEPINNRLRYLLGRRLLREGAIAEAPVYFSNPTLNANVLEYGKALTAAKNSKGIESARAYWVAAELARHQGMEILGFELAPDYSIYAGMFDLRDWYAADKLSDKEQQRISASQAIPDKRFHYRYQAAELANRAADLVPHNSQAYAALLCQATGWVLYRDDELARRYYKKYVANGPFVPWAENFGTQCETPDFDRAVEREKANQIAEWNAMYHKLKKPVAYGSLVITAALIGFIGLRRRSRRAKNHQS >LR134303.1|VEE61133.1|967574_969089_+|Protein-of-uncharacterised-function-(DUF3142) MAQSTLGFYSYPMDLRSRYSTRFQPKHFESKHFEPRRSKSLQAIHRTLSLLFRLSQLLALSFGLVFLSACQPANQDTPSIKTATTAPRELTQEVYVWQRQWRDANQSALVESQSTFNGVRILALQAHPKPNGADIWFEVQVNHTWLQADPRPKVAVIRLDGQLTRLNNREALQKILALIQDWQAKGTRLAGIEIDHDSASSKLAAYNAFLRELKSQLPHSLKLSITSLPAWMSSSEFPALFDNIDELVLQIHSVSDPRLGLFDATQGWQWVEQLSRLAKVPYLIALPSYGSAVYSTAAGYRVESEVPMQMPLADTASSQHLARQELMADPLVLQSFVKKLHTFADTKLKGIIWFRLPLEGDKRVWPLSTLIAVAKQQPLAAHIELEILSQANTGSAQHEAPGSRLFQLVLVNKGNLAGKLPGQLSLAAQACSGYDAQNGYQAKLTQGILVWQLPQATSTERAAAPASSQFAPNLISAIELSPNGRRVIGWARCESLHLQGIYAP >LR134303.1|VEE61132.1|966197_967373_+|Phosphoribosylglycinamide-formyltransferase-2 MIGTPYTEGARRAMLLGCGELGKEVAIELQRLGVEVIGVDRYPNAPAMQIAHRSHVINMLDAKALRAVIELEKPHLVIPEIEAIATQTLVEMEAEGLNVVPTARATKLTMDREGIRRLAAETLGLPTSPYFFCDTETEFNQAISKIGVPCVVKPVMSSSGKGQSVIRDTAQSAKAWQYAQEGGRAGGGRVIVEGFIPFDYEITLLTISAVNGIHFCAPIGHRQEDGDYRESWQPQAMSADVLAKSQAIASKVVEALGGYGLFGVELFVKGSDVYFSEVSPRPHDTGLVTLISQDLSEFALHVRAILGLPIPNIHQHGPSASAVVLVEGKSKNIRYQGLADALAAENTQLRLFAKPEIDGRRRLGVALARDKDIESAVNKALDSASKVKVIF >LR134303.1|VEE61131.1|964493_966062_-|ATP-dependent-zinc-metalloprotease-FtsH-3 MLGIVKILFVAVFYVNFSIHNGLWKPKMQDIRDLTSVLRSKTPIVVIETYEEYRVVDMLKRVANVLYQPLFTWSITQGLTRVDKPMAAQKFNTEPGDILGQIKSTTQQGIYVLCDFHPFVIDAPKNVRLLKEIALEYDALQHTLVLVSHAFEIPPEIKRYCAYFQLTLPSTSQLENLIYLEVDKVRGQGMPLTVDDKAVVKLAENLRGVTLDDARRLIHKAIVDDGAITHSDVDLINKAKFQLLDLNGILQFEYDTSDFSQIAGLHNLKKWLKQRAPVTQATTDAKATVQPDTPKGVLLLGVQGSGKSLAAKAVAGVWQRPLLRLDMGALYNKYIGETEKNLRNALELADMMSPCILWIDEIEKGLSGSSSDEGTSTRILGTLLTWMAERKSEVFVVATANDIQALPPELMRKGRMDEIFFVDLPDEAIRQAIFLIHCQRRGIDVTRLDLAQLSRHSQGFSGAEIEQAVIAAMYSARSLGRSLDQGMLLEELTKTKPLSIVMGDKINALRQWAAGRTVNAHE >LR134303.1|VEE61130.1|963802_964276_-|Uncharacterised-protein MISREEMSFEQEFTEKVGGYLDNLTDELQPILKQLIEHDYPQEVVTLAFEVFADSFSSQFPVRVFFMDIDGKATYASPVNPYLLDIDHVYPDEFEEKYIENDEDLDPWQIATNALIEWFSKCWIAAGGQHFKLKANIAPHDSHYEFNLVECQWQERC >LR134303.1|VEE61129.1|962100_963231_+|Sulfate/thiosulfate-import-ATP-binding-protein-CysA MSIRLTNISKKFGQFQALSPLNLDIQEGEMIGLLGPSGSGKTTLLRIIAGLEGADSGQIHFGNRDVTQVHVRDRRVGFVFQNYALFRHMTVADNVAFGLEVIPKNQRPSKAEIQKRVSHLLEMVQLGHLAQRYPEQLSGGQKQRIALARALATQPEVLLLDEPFGALDAKVRKELRRWLRSLHDELKFTSVFVTHDQDEALELSDRVVVMSNGHIEQVNTPIELYAQPNSRFVFDFLGNVNRFEASWQQNRWTNGDAFLVPPEQAPLQQNGALYVRSHELALADKPNSQAHIPFTIVAITPVGAEVRVELAPIGWQSEELWEATFTHHHLQELGLQKGSVVYATPRTAYFFGEQGDGSPIRQNWPFLPPGSLAFDI >LR134303.1|VEE61128.1|961234_962104_+|Sulfate-transport-system-permease-protein-CysW MNSFKPLRVGEAPLIKWSLITLAVFLAMVLLLLPLVSIFQQAFVGGWERYIKHLSQPDSLHAIGLTLMVAALTVPINLVFGVLLAWSVTRFEFPGRKLLITLIDIPFAVSPVVAGLLYLLLYGNSGWLGAWLFEHDLQIMFAWPGILLVTIFVTCPFVARELIPLMQQQGASEEEAAVILGASWWQLFRRVTLPNIKWALIYGVILTNARAVGEFGAVAVVSGNIRGETNTLPLHVQLLYEDYQAEAAFASASLLALIALCTLLLKALVEWRQQRSLSANDNQEQSQSL >LR134303.1|VEE61127.1|960379_961225_+|Sulfate-transport-system-permease-protein-CysW MIFNNGRLRHKRVLPGFSISLGVSLLFVSLILLLPTTGLIMQTSQMSWAEYWGVIADPRVLASYKVTILSALVASLFNCLFGLLLAWVLVRYEFPGKRILDALVDLPFALPTAVAGITLATLYAENGQIGSLLAEIGIKVAYTPLGIVVAMIFTSIPFVVRTVQPVLEELSHDEEEAGMTLGATDGAVFWRVILPSLWPALVVGTALSFTRSLGEFGAVIFIAGNMPYISEITSLMIFVRLQEFDFAGASAIASVVLLTSLLLLLLINLWQARYLRRIHGR >LR134303.1|VEE61126.1|959188_960196_+|Thiosulfate-binding-protein-precursor MPVKLTKALLGTLLLGTTLNVAAADQTLLNSSYDIARELFNAYNPVFAKHWQEQTGKTVEIKQSHAGSSAQARSILQGLPADVVTFNQVTDVQILHDRGKLIPENWQQLLPNASSPYYSTIAFLVRKGNPKQISDWNDLAKDDVKLVFPNPKTSGNARYTYLAALGYAQKNYGKDNQVSLDEFLKKFLGNVAVFDTGGRGATTSFVERGIGDVLITFESEVNNIRQQYGADDYQVVVPKTSILAEFPVAVVEKNAKRNGTQELATEYLNYLYSEEAQRLLAGFNYRVHNEKVVAEFTKQFPAVELMTVEQIIGNWDNAMKTQFANGAKLDQLLKR >LR134303.1|VEE61136.1|972247_973225_+|Transposase-and-inactivated-derivatives MPRPRRTQISLEDTPYYHCCSRVVRRAFLCGDDTYSGKNYDHRRAWVESLLFELEAVFAIDIAAFAVMSNHLHVVLRVDIETANRWTDREVLEQWHKLFKGDELTQKFVKGELVEAHQVNRLKHSIALYRSRLCDISWFMRCLNEPIARQANQEDNCTGRFWEGRFKSQALLDEAAVLACMTYVDLNPIRAQLADTPEQSDHTSIQLRIRAALKGEQPSNLLPFIGNECDNRPNGIAFSLKDYLQLVDDTGRIIRNDKRGYISESSAKILNRLNITHDNWLKLTTEFGKLFHGPVGTLQELTDYCEHLEKRRRHFATSCQHFNSN >LR134303.1|VEE61137.1|973561_974950_+|RHS-repeat-associated-core-domain MHYVIDLKWVARFTSFVYGSDNMRAKQSRTVDSATTTTYYVDKYYEADSDGSWRAYLDDIAVLSYTSQRGHLLQFTLKDRLGSATTLADQNGNIVSQRYFDPFGRTASTGGGHGTDIVNKNTLQSKLQDLDITNKNRRGFTDHEHLNEQQLIHMNGRIYDYNLGRFMSVDPFIQSPTSTQSVNPYSYIMNNPLAGIDPTGYLAECPDRNEGCPKPEEEKPEKKRRMDGSGRGSGWTVIYDRSSNGSSRQSSSATGTQKTEAIGSPKQNANKQYPSYGNDANFAKGYGSLKGDMSGILSMTGPALAVDAVKAGIEIANEIADEYEQSGLSMSTFIVAGKAAGEAAIDVGAKKLRLPDVVTGGPHGKIKGISGHESHHMPADSVSPIAKDKGPAISMEKGDHRQTASWGNSREARAYRSQQAQLIQRGDFKGAQQMDINDVRSKFGEKYDKAINQMLDYSKTLD >LR134303.1|VEE61138.1|974970_975393_+|Uncharacterised-protein MDFSIIPFEGVGSIKFGMTPKQVRTNLGSGFKSFKRTPDSVLPCDYYEALGVFIYYKLPSVVEAIEFAEPANPELDKAGLLTMSFNEVHNFLAAKDPSLEIQSDSLTSYNLGISVYAPNADEDPNLLIESILLFENGYYD >LR134303.1|VEE61139.1|976082_976796_-|Uncharacterised-protein MLKMVIGIFIALFAAFLFFLNYNDNAIVTADKYEAAPVSTSSIIHKEEKVEVSLKNDEVSLEKVEALVSVPEKKSVEEKSKVDSIEQNDEAFFSQFTRLTKNDFQTPMMLTSAFLNENGNVSKAALEDAFNASDFNELIHVINSIDKTENGTARENLLSERLYKLDNLQVHSETYSCSGKICLVSFDFEGDESSATELSNFTKNYSFTNIVDGDNGFKKFKAIYIQTDDPSTLTLSY >LR134303.1|VEE61140.1|976926_977073_+|Uncharacterised-protein MEPEEQLQVIEYTVAELRTAIEVTKPSKTFFIICPYIQVCQIILARLT >LR134303.1|VEE61141.1|977194_977371_-|Uncharacterised-protein MLPHVTEKMLGHLLGGVMAVYNKHDWLEEQAEAYEQWTSKIKLAALGDGSVVVLERRA >LR134303.1|VEE61142.1|977711_978746_+|Transposase-and-inactivated-derivatives MTSARRQLIDANATPFYHVINRCVRRAFLCGEDKLSGRSYEHRRGWIVDKIKALSAIFCIDICAYAVMSNHYHLVLKIDVDKAKSLTQKDIISRWCQITKGHAIATKYMNDDALIEGERMLLDGLITEWHERLSSISWFMRCLNEEIARKANREDECKGAFWEGRFKSQALLDEQALLACMMYVDLNPIRAGIADSLQSSDFTSIQERINSLNTPNHPLTIPVSQTDSATNQAHILRKSLVQFDGAAHLNQQVGIPFHFADYLELIDWTGRAIRLDKKGYIDNQRPKLLNELGIAPDAWLTSAKEFRRQYSGISGRWDSMCAFKKQHNSGKWCKGKASSQALHP >LR134303.1|VEE61143.1|979253_979622_-|Uncharacterised-protein MYRTLLVCMLGFALNGCAHQIRVPSSCQTVNECAELIKLKIQSNLEFDESFKDQTVKVNFHLDQSANVISYKMLEPSKVIKLNEAVKSAVLTSSPFTEVLSADSEVFSEIKEVNLIVIPRLD >LR134303.1|VEE61144.1|979639_980596_-|Glycerate-dehydrogenase MKIVVLDGETLNPGDLTWQAVSALGEFSCFARTPSAEIIPRAQDAEIVLTNKTPLDANTLAQLPKLKYIGVLATGTNVVDLAVAKELGIVVTNVPAYGPDAVAQMVFAHILHHTQAVAAHHQAVAAGQWSNCSDFCFTLMPLQSLKGKTLGLIGYGDIGQQVAKLALAFGMKVLVNTRTEPSDLPQGVSWTSRDTVFKESDILSLHCPLTPETTELINAQTLELMKPQALLINTARGGLIDEAALAAALTQGKVFAGVDVLSTEPPSADNPLLTAPNITISPHNAWATKEARQNLFNIATANLKSFLQGNIRNCVNSK >LR134303.1|VEE61145.1|980641_981286_-|Uncharacterised-protein MKYVNRLTVIAGLLPLTLTLPTYAEPELFVAPYGGYSFGGSSFDINELDANKAETDNKQSVGIEEASHYGIMLGIGTNDPGNIYLLYSRQSSELKSGGLFTPDLVASLDVDYIHLGGTLFFPRGDFQPYITASAGVTRMMPDDWSTETRFSMGIGGGAEYRMTQNFALFADLRGYATFIDSDSSLFCNEDQCLWHVTSDVMWQVQANIGVKLSF |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134303_3 | 4614027-4616302 | TypeIII-A |
NA
Consensus repeat of LR134303_3
|
32 spacers
spacers of LR134303_3
>3.1|4614064|33|LR134303|CRISPRCasFinder,CRT GCTAAGGTGCCAACAGTATGACTATTGATGACT >3.2|4614134|32|LR134303|CRISPRCasFinder,CRT GCTGATGCCGACTACGAGCCGAACTTCAAATT >3.3|4614203|32|LR134303|CRISPRCasFinder,CRT GTAGTAGTTTCAGCATCATGGCTACTCGAACT >3.4|4614272|34|LR134303|CRISPRCasFinder,CRT AAGAATCCAAAGCAAGAAAGGTTAATAGCTAATC >3.5|4614343|33|LR134303|CRISPRCasFinder,CRT,PILER-CR TGACACTATCGCTCCTATCGTTAACGTTCGTCT >3.6|4614413|33|LR134303|CRISPRCasFinder,CRT,PILER-CR CCACAGCATCAGCATATGCTGCAAATGTAAAGT >3.7|4614483|32|LR134303|CRISPRCasFinder,CRT,PILER-CR GTAGAGTTCTTCCCAAAAGCAAACAAACCACC >3.8|4614552|34|LR134303|CRISPRCasFinder,CRT,PILER-CR CTATTTCCACAAACGTATAGGTAACTGTGCTTAT >3.9|4614623|33|LR134303|CRISPRCasFinder,CRT,PILER-CR GTAGGAGTAGTAGTCTCAGCCTCATGGCTACTT >3.10|4614693|35|LR134303|CRISPRCasFinder,CRT,PILER-CR GTTCTATGCAACTAAACACCCCGTAGTAGGAATGG >3.11|4614765|33|LR134303|CRISPRCasFinder,CRT,PILER-CR GAAGTATCTTTAGCAATCGCTCAACGTAAATCA >3.12|4614835|34|LR134303|CRISPRCasFinder,CRT,PILER-CR TCGAAGCTGAGTTCGGATACCTGTTGTTGTCGTA >3.13|4614906|34|LR134303|CRISPRCasFinder,CRT,PILER-CR AGAGGCTCATACAAGCGTTATTAGCATATGTGTG >3.14|4614977|31|LR134303|CRISPRCasFinder,CRT,PILER-CR AAGTCATTGGTAACGGCGTCTACCCTTCAAG >3.15|4615045|31|LR134303|CRISPRCasFinder,CRT,PILER-CR ATAAATGCTATTGTGGTGTTTACTGATTCTC >3.16|4615113|35|LR134303|CRISPRCasFinder,CRT,PILER-CR AGCTCCACAGGTAATCCCCGTAGCTCAGGGATTGG >3.17|4615185|33|LR134303|CRISPRCasFinder,CRT,PILER-CR ACTTACCATCGATTACTTGATCCACCGCGGTAT >3.18|4615255|31|LR134303|CRISPRCasFinder,CRT,PILER-CR TGCGCAGGTGCTGTTTTGATGATTTATTCGC >3.19|4615323|31|LR134303|CRISPRCasFinder,CRT,PILER-CR AAAAAAGTTGTCATCGTTAAATCTCCGTAAT >3.20|4615391|33|LR134303|CRISPRCasFinder,CRT,PILER-CR CGCAAGCACTCCGTAGCCAACGGGATTGGTTGG >3.21|4615461|33|LR134303|CRISPRCasFinder,CRT,PILER-CR GTAACGCCATGGTGTGCTGCGTGTGGTCGGCCC >3.22|4615531|36|LR134303|CRISPRCasFinder,CRT,PILER-CR GCTAAGGCAGTATCGCGTTCATCTATATGGATTTCT >3.23|4615604|32|LR134303|CRISPRCasFinder,CRT,PILER-CR GAAGTCGGACACGTTTCATCTTACTTTGCTAA >3.24|4615673|34|LR134303|CRISPRCasFinder,CRT,PILER-CR CATGATGCGCACTGTCTACATCAAGGCAGCCTAA >3.25|4615744|33|LR134303|CRISPRCasFinder,CRT,PILER-CR GGTTAAAGGGGGCCGACTAAGACCTTTGGCATC >3.26|4615814|31|LR134303|CRISPRCasFinder,CRT,PILER-CR TTAAACCCGAACGGTTAGGGCTGATTTCACC >3.27|4615882|34|LR134303|CRISPRCasFinder,CRT,PILER-CR ATGAAACCCAGACCCCTGAGAAATATGTCCAACT >3.28|4615953|33|LR134303|CRISPRCasFinder,CRT,PILER-CR TCTCATTAAGAAACGTCTTGAAGATTTATCTAC >3.29|4616023|33|LR134303|CRISPRCasFinder,CRT,PILER-CR CAAGCGGTATGTATCGCCCTAGGTCGCCCACAA >3.30|4616093|33|LR134303|CRISPRCasFinder,CRT,PILER-CR AAAGGTATCACTCGCTTTATAGACCAAGGTGCG >3.31|4616163|32|LR134303|CRISPRCasFinder,CRT,PILER-CR AAATACCCTGAATACCAAGAAAAAGAAAGATA >3.32|4616232|34|LR134303|CRISPRCasFinder,CRT,PILER-CR TCAATAACGGCAAAACAGGCGGGACGGTAACTAT |
csx1,cas2,cas1,DinG,csx16 |
CRISPR arrays and Neighbor proteins around LR134303_3
The CRISPR arrays of LR134303_3 >merge|LR134303|3|4614027-4616302|CRISPRCasFinder,CRT,PILER-CR GTCTAAGTCCCTTTAAATGGCGGGGCGTCTTTCAGAGGCTAAGGTGCCAACAGTATGACTATTGATGACTGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGGCTGATGCCGACTACGAGCCGAACTTCAAATTGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGGTAGTAGTTTCAGCATCATGGCTACTCGAACTGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGAAGAATCCAAAGCAAGAAAGGTTAATAGCTAATCGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGTGACACTATCGCTCCTATCGTTAACGTTCGTCTGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGCCACAGCATCAGCATATGCTGCAAATGTAAAGTGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGGTAGAGTTCTTCCCAAAAGCAAACAAACCACCGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGCTATTTCCACAAACGTATAGGTAACTGTGCTTATGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGGTAGGAGTAGTAGTCTCAGCCTCATGGCTACTTGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGGTTCTATGCAACTAAACACCCCGTAGTAGGAATGGGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGGAAGTATCTTTAGCAATCGCTCAACGTAAATCAGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGTCGAAGCTGAGTTCGGATACCTGTTGTTGTCGTAGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGAGAGGCTCATACAAGCGTTATTAGCATATGTGTGGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGAAGTCATTGGTAACGGCGTCTACCCTTCAAGGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGATAAATGCTATTGTGGTGTTTACTGATTCTCGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGAGCTCCACAGGTAATCCCCGTAGCTCAGGGATTGGGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGACTTACCATCGATTACTTGATCCACCGCGGTATGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGTGCGCAGGTGCTGTTTTGATGATTTATTCGCGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGAAAAAAGTTGTCATCGTTAAATCTCCGTAATGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGCGCAAGCACTCCGTAGCCAACGGGATTGGTTGGGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGGTAACGCCATGGTGTGCTGCGTGTGGTCGGCCCGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGGCTAAGGCAGTATCGCGTTCATCTATATGGATTTCTGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGGAAGTCGGACACGTTTCATCTTACTTTGCTAAGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGCATGATGCGCACTGTCTACATCAAGGCAGCCTAAGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGGGTTAAAGGGGGCCGACTAAGACCTTTGGCATCGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGTTAAACCCGAACGGTTAGGGCTGATTTCACCGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGATGAAACCCAGACCCCTGAGAAATATGTCCAACTGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGTCTCATTAAGAAACGTCTTGAAGATTTATCTACGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGCAAGCGGTATGTATCGCCCTAGGTCGCCCACAAGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGAAAGGTATCACTCGCTTTATAGACCAAGGTGCGGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGAAATACCCTGAATACCAAGAAAAAGAAAGATAGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGTCAATAACGGCAAAACAGGCGGGACGGTAACTATGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG >LR134303|3|3|4614027-4616302|CRISPRCasFinder GTCTAAGTCCCTTTAAATGGCGGGGCGTCTTTCAGAG GCTAAGGTGCCAACAGTATGACTATTGATGACT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GCTGATGCCGACTACGAGCCGAACTTCAAATT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTAGTAGTTTCAGCATCATGGCTACTCGAACT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAGAATCCAAAGCAAGAAAGGTTAATAGCTAATC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TGACACTATCGCTCCTATCGTTAACGTTCGTCT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CCACAGCATCAGCATATGCTGCAAATGTAAAGT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTAGAGTTCTTCCCAAAAGCAAACAAACCACC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CTATTTCCACAAACGTATAGGTAACTGTGCTTAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTAGGAGTAGTAGTCTCAGCCTCATGGCTACTT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTTCTATGCAACTAAACACCCCGTAGTAGGAATGG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GAAGTATCTTTAGCAATCGCTCAACGTAAATCA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TCGAAGCTGAGTTCGGATACCTGTTGTTGTCGTA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AGAGGCTCATACAAGCGTTATTAGCATATGTGTG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAGTCATTGGTAACGGCGTCTACCCTTCAAG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG ATAAATGCTATTGTGGTGTTTACTGATTCTC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AGCTCCACAGGTAATCCCCGTAGCTCAGGGATTGG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG ACTTACCATCGATTACTTGATCCACCGCGGTAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TGCGCAGGTGCTGTTTTGATGATTTATTCGC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAAAAAGTTGTCATCGTTAAATCTCCGTAAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CGCAAGCACTCCGTAGCCAACGGGATTGGTTGG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTAACGCCATGGTGTGCTGCGTGTGGTCGGCCC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GCTAAGGCAGTATCGCGTTCATCTATATGGATTTCT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GAAGTCGGACACGTTTCATCTTACTTTGCTAA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CATGATGCGCACTGTCTACATCAAGGCAGCCTAA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GGTTAAAGGGGGCCGACTAAGACCTTTGGCATC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TTAAACCCGAACGGTTAGGGCTGATTTCACC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG ATGAAACCCAGACCCCTGAGAAATATGTCCAACT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TCTCATTAAGAAACGTCTTGAAGATTTATCTAC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CAAGCGGTATGTATCGCCCTAGGTCGCCCACAA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAAGGTATCACTCGCTTTATAGACCAAGGTGCG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAATACCCTGAATACCAAGAAAAAGAAAGATA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TCAATAACGGCAAAACAGGCGGGACGGTAACTAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG >LR134303|3|1|4614027-4616302|CRT GTCTAAGTCCCTTTAAATGGCGGGGCGTCTTTCAGAG GCTAAGGTGCCAACAGTATGACTATTGATGACT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GCTGATGCCGACTACGAGCCGAACTTCAAATT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTAGTAGTTTCAGCATCATGGCTACTCGAACT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAGAATCCAAAGCAAGAAAGGTTAATAGCTAATC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TGACACTATCGCTCCTATCGTTAACGTTCGTCT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CCACAGCATCAGCATATGCTGCAAATGTAAAGT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTAGAGTTCTTCCCAAAAGCAAACAAACCACC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CTATTTCCACAAACGTATAGGTAACTGTGCTTAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTAGGAGTAGTAGTCTCAGCCTCATGGCTACTT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTTCTATGCAACTAAACACCCCGTAGTAGGAATGG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GAAGTATCTTTAGCAATCGCTCAACGTAAATCA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TCGAAGCTGAGTTCGGATACCTGTTGTTGTCGTA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AGAGGCTCATACAAGCGTTATTAGCATATGTGTG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAGTCATTGGTAACGGCGTCTACCCTTCAAG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG ATAAATGCTATTGTGGTGTTTACTGATTCTC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AGCTCCACAGGTAATCCCCGTAGCTCAGGGATTGG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG ACTTACCATCGATTACTTGATCCACCGCGGTAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TGCGCAGGTGCTGTTTTGATGATTTATTCGC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAAAAAGTTGTCATCGTTAAATCTCCGTAAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CGCAAGCACTCCGTAGCCAACGGGATTGGTTGG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTAACGCCATGGTGTGCTGCGTGTGGTCGGCCC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GCTAAGGCAGTATCGCGTTCATCTATATGGATTTCT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GAAGTCGGACACGTTTCATCTTACTTTGCTAA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CATGATGCGCACTGTCTACATCAAGGCAGCCTAA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GGTTAAAGGGGGCCGACTAAGACCTTTGGCATC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TTAAACCCGAACGGTTAGGGCTGATTTCACC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG ATGAAACCCAGACCCCTGAGAAATATGTCCAACT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TCTCATTAAGAAACGTCTTGAAGATTTATCTAC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CAAGCGGTATGTATCGCCCTAGGTCGCCCACAA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAAGGTATCACTCGCTTTATAGACCAAGGTGCG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAATACCCTGAATACCAAGAAAAAGAAAGATA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TCAATAACGGCAAAACAGGCGGGACGGTAACTAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG >LR134303|3|1|4614306-4616302|PILER-CR GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TGACACTATCGCTCCTATCGTTAACGTTCGTCT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CCACAGCATCAGCATATGCTGCAAATGTAAAGT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTAGAGTTCTTCCCAAAAGCAAACAAACCACC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CTATTTCCACAAACGTATAGGTAACTGTGCTTAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTAGGAGTAGTAGTCTCAGCCTCATGGCTACTT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTTCTATGCAACTAAACACCCCGTAGTAGGAATGG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GAAGTATCTTTAGCAATCGCTCAACGTAAATCA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TCGAAGCTGAGTTCGGATACCTGTTGTTGTCGTA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AGAGGCTCATACAAGCGTTATTAGCATATGTGTG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAGTCATTGGTAACGGCGTCTACCCTTCAAG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG ATAAATGCTATTGTGGTGTTTACTGATTCTC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AGCTCCACAGGTAATCCCCGTAGCTCAGGGATTGG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG ACTTACCATCGATTACTTGATCCACCGCGGTAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TGCGCAGGTGCTGTTTTGATGATTTATTCGC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAAAAAGTTGTCATCGTTAAATCTCCGTAAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CGCAAGCACTCCGTAGCCAACGGGATTGGTTGG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTAACGCCATGGTGTGCTGCGTGTGGTCGGCCC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GCTAAGGCAGTATCGCGTTCATCTATATGGATTTCT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GAAGTCGGACACGTTTCATCTTACTTTGCTAA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CATGATGCGCACTGTCTACATCAAGGCAGCCTAA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GGTTAAAGGGGGCCGACTAAGACCTTTGGCATC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TTAAACCCGAACGGTTAGGGCTGATTTCACC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG ATGAAACCCAGACCCCTGAGAAATATGTCCAACT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TCTCATTAAGAAACGTCTTGAAGATTTATCTAC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CAAGCGGTATGTATCGCCCTAGGTCGCCCACAA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAAGGTATCACTCGCTTTATAGACCAAGGTGCG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAATACCCTGAATACCAAGAAAAAGAAAGATA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TCAATAACGGCAAAACAGGCGGGACGGTAACTAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG
>LR134303.1|VEE64244.1|4613172_4613850_-|Uncharacterised-protein MLVLNQYAFAPKTGSDLTSSLRSRGIQRLLHFTLVDNLASIIKYGLLGRTELEQRQLNYIYTDNKRLDGNPNAICLSITSPNYKMFFSKRNSFSQLGYQDSDWVVILLKPEVLLNLPAAFTFTNAADSRCRGQWQRHSSREAFEAMFYDEALRATLGLNPKQPTNPQAEIQVMRSIPSDWIDAIVCHPNADIRKLQSVAGNLPVLPIYEAFAPRHDYSYWSVNYN >LR134303.1|VEE64243.1|4611866_4613153_-|CRISPR-associated-protein,-NE0113-family MQKHILLAITGLSPQVVTETLYAIHQQQLEWPTEIRLITTLTGKNQARLGLLIPPEGKSLNRLQQLCADYNRPVPTFTEAHIAIVPNAEGQEVDDARSNADQESLADFIVSQVAQLANDDECQIHASLAGGRKTMTFYLGYAMSLFARPQDRLSHVLVDAQFEGLRDFYYPTPATCVINGRDPNTNLDAADAKVTLAEIPFISQRQSLAKTTLAAFAHSPAVQPPSYRDLVRYENLAQDRDNLSLTFNTKDRLVVLGSTQGEDINIDFNDAPLQLAFFAMMARHSQLANPIKLVRSKDANKTYAHLFLQELERIEGLPVTHMATEADYGYRIEQFRNQFLRSTQDRTLDSLKQGMDAQFWDDRKNQLKQYLSKQFPESMLDCLLPGPVFKPKKQNRNQFEPTAFDGRNPQGAAWGLWLKQTNIHFNDQ >LR134303.1|VEE64242.1|4611135_4611831_-|Uncharacterised-protein MANRPLFTPHSASISTKTHSVEFNWHPGLAASQKHKSVAELHQSAKQQGLCRRPLEVSSKSLEPLGIALSAFNLQVPLGNGKHTYVENVFQAAKVFEHAGPFIDLLYVSPLEAKRDPRLKENGMLQYFHGNAGHRWPLQPTTLFYDWVYLNALVRNPQLCEQVMDYDGFTDIEFNPNKSLNCQAQSVALFISLTQAGLLRQALASPLAFTELTTKVWREISSAIPEQQSLL >LR134303.1|VEE64241.1|4607844_4610949_+|Site-specific-recombinase-XerD MATTKIRYFINQLYSQGKLHPLYREVFDAFVKQNGHTIDKEKQNKVKDAFFLSYKQLLGHVKTRFILSETKVLPEELTDLIAQLKPAVLNKQVQLLILQSILNYAKKHHGIDSPNIPAIVTLKRDKPILTPVELVKLPIVERIYALLDKELITPHRTLTSEQKIGRAALLVYLTVNVSKTEELLSILETPNNVFYVGGICYWQSQQTNAASTRFVLSDVAVMALQQLSSLKQGSIKIAACIIKYLNITSEFDWSDLSILKLRTLRKIDNVIHYGPVQYQMYCLPSVSQALPEHAFMRVLTDKACLYSAPHLLGTMREGANELKPWRKVIKQHHSFIAIDVTLRQLDDIFLRLSGLDESNAPRANCIAFISEVLKSSEVTANPYLWMLCSWLLSLLKNGGTYKHRLKVSTTVDYVKSFSKPFLTEFCTSDIGLLSGEDWANKLNEAAELFTSAQRKKYVYYFAQFLIETNLVRDLCLSDIDVVGSSSEVDANLISATHIDEVLKYLTKHVEEGLIYQYAYFLLCFCFFTGLRRNETAKLTWADLSFALSAPDDEQFDYVQLSVRPNKHRTLKTTSARRELPLDALWPKSALQKLRIQHQIHQNSKANKKALLFDNPKLVNQAYDLITDLIRHYTQDYGLRVHHLRHSFANWTWCRLNPKIINQGKTQLTMFDHEQFNHDYLERLQTRLRHHMNTRKKMFMLAHLLGHKDVQSTLNSYLHLKDILYYLELTPRFALTKYFTSECVGRVSLLEPEQGLSLAERVIYYTRDIEAKLAIKPASLASSLFRPTLSTFVLDIKPNIEISSLTWAKALNALKVCSVIESSQHFNVPMEQLQQLLHNAEQVHQLYPRVGKPLPLIPAFPKLNHSHNEAEPDEHLAKSSKVFIYLCNKFDKNVGIEPLTLANVRLSMEILNYAVPGKDYALRCPNSKVSRMFIRFCQLLGLKDRHLKFRFHRADLTIEESQRVENRWKKTVLGCGFNDGNFVVASESEGRFLGKHDGNGFLEIALVNNGYKRVQRHQSLFSFLHFLLILSYQEE >LR134303.1|VEE64240.1|4605337_4607875_+|Uncharacterised-protein MTKLKQHLHEHICRYCDHMMLGIGKDEVLEDLEELIERVFAPESLGGFKQLIDLVVRVRMHSFHSAQSIMKDLHLAPYIFDALHNYSFEADDKSINSEYHRNSYKNGWCSEYPFYVLLLKSHEYHFTIKSCELLAAFIKHYYSVLDTLRDHDLARASSTREEDACANFRLFMKTNVAEFNFVRESIPKTALDSPISIANQIDQYLISKETWPYLVHKNYLRMLCHFFYNDWEQPKHFTRRGSPSDRVPKRYKDPIAIPIVGAYDDTFALIPGKPSLPNSDGLDDDDQYAAQTFVVNNRDVNTQRDKTELLDTAIPFNKHVQSRTAIDVTASVRRSHNMGLQNTQLLMPEELNLLINKLIKLANQPVNLETAIVMWLMMLLSKSIEDIHNLVVFTDLRAKQQGLYIDEFGQGWWLFYVSHSAKSKLDNVGLRPVKEDVFTACPDFLLKLIVKNMGARANGPIINEENTQVIIDNVAKKLKKISDRHSSGRLSVRRLVNFTSYYLNSTDVIDPIYIDYSYAVNMYTTRVARSYANLRDHARSQQLDKLWKSVEQDIELYSGKPLSISLFDLRHLSQCEQFIGSSFTPTKTVVSTLINSLTQRVLSSKPSFQHRLVDIIEYHNAFTAYTAWMLLFGTGYRAAWNPLPTFALFLPSLNLMGISDKDDSDFSHSRIVAVPTALATQLKEYKRHLGCLRSLLRVLMPKLCSRIDRIVDVDQHVLSFNYSQASQWYKVIRNSRKEQGPFFFFHQQGTSVVTQNLSPSALVNYCRDAILLPSNAGRHWLKSHLLEKNITPELINFQMGHWQAGEVPLGHYSALSHVEAINDIVPVLDELFEEVGWLPLKSVIL >LR134303.1|VEE64239.1|4604762_4605341_+|Uncharacterised-protein MNIHAEHFKASLDRLLVDVKLEKVAFHQIQLDRELKIFCESFNLLSTLKPSSDDGFESSASILLDSYQVPLLVSKTEAGYYRLISGLLTFQKLCKIYTVNDKSLVPCLVLPRRPNKEILRLLILNDLVRPLLKQFVDVTGDTITQTLSTLFTSVANPSVFNSPQWQSLFPMIKTKTQLCEWLHISTKTVKLK >LR134303.1|VEE64238.1|4602594_4603752_+|Alcohol-dehydrogenase-YqhD MNNFSFFNNTKIIFGEGQINQLTQEIPADATVMIIYGGGSIKKNGIFDQVVNALEGKKWVEFGGIEPNPHYDTCLKAVEKIKQEDVDFLLAVGGGSVIDATKFIAAAAKHQGDPWAIIQSFGGAVQGALPIGCVLTLAATGSEMNPTAVVTRADTQDKLFFNSDYVRPHFSILDPQTTYSLPARQVGNGVVDSFVHVLEQYVTYPVNAKVQDRFAEGLLATILEEGPRALIEPENYDVRANLMWAATMALNGLIASGVPADWATHLIGQEITGLYGLDHAQTLAIVMPAVWTYKVEQKREKLAQYGRRVLGIIEADDLVAAEKAIVRSREFFEAMGVRTRLADYGLSADIIPNIIAKLQEHQFIQLGEHGDITPDDVSKILKLAL >LR134303.1|VEE64237.1|4600874_4602053_-|multidrug-efflux-system-protein-MdtL MDSTGNKTTFWAASSALAATFIASATPIPLYGTYQRVDGISYLGLSLSSVIYFVGAVTALLIFGRLSNHFGRKPISLIAVLLAALASSLFLDVHTVTPLLIGRLFQGLSCGLASTALAAWIVDCATTVPKWVAPAVISCGPMTGLTIGGIVSGLLVEYSALPRQLPFYLVLALLLICIFTLIKSKETVRHTSGAINSLLPRFALPETAKRAYPIAAITFVCTWSLGGFFQAFGPAMAAEQLHSTNAVAAALVFASIMAPSVVGASLASKMTPKSAQYYGMLSFTVFVFAVWLSLYVGILTAFLISSVLAGIAQGLVLTGSIQTMVGQLKAEQRANVLSVIYATSYTGAAIPTLIAGRLSENVGLLGTASGYVGLAFIGAVAVIRHTRSPIKA >LR134303.1|VEE64236.1|4600414_4600858_-|Uncharacterized-conserved-protein MGNARGWLMLLSLFSTVSSAEITSKSQENLVKIYIDIQGNTISGTLKSDSDAAKSFAALLPLQLKLSDYAATEKVADLPKPLSTQNEPAGTAAKVGDITYYAPWGNLAIFHKSFGYANGLVALGTLDSGIELMRKPGELDVTIRLAD >LR134303.1|VEE64235.1|4599258_4600347_-|Uncharacterized-oxidoreductase-yvaA MAQPIRIGFIGLNPDSHWAATAHIPALSTLIDKFQVIGVANSSAESAQRTAKALNLPYAFSDVDELVNSPEIDLVVVTVKVPHHLKLVTAAINAGKHVYCEWPLANGVAEAKTLANLANSKGVVAVCGTQARMAPEIIHLTKLISEGYVGKVLSTSLIGSGGNWAGETINEYYYLFDAQNGANMQTIPLAHTLAAIKDVLGEFGSLSARFLSNFDTVTVTDTNEIKPKTVPDQLMIHGCLKSGAAIAVHYRGGVNRGTNFLWEINGTEGDIQVSAGLGHAQMVQLTIKGARGDETTLKTILPAPSLYQGKPEAAAARNVAGVYNALYHDIVNGTQTAPSFNDAVALHQLIDNIERSAKCCAE >LR134303.1|VEE64246.1|4616439_4616757_-|CRISPR-associated-endoribonuclease-Cas2 MNSTEHWYLICYDIRKPQRLQKLQRYLRRCCLKLQDSVYLFCGNYRQGEQLRQAISQRISRSVDDVRVYQLPSQSQFKFYGQLPWTVDIFYPGLPSFSHTPATVA >LR134303.1|VEE64247.1|4616743_4617703_-|CRISPR-associated-endonuclease-Cas1 MTTLILEKHGISLEYDTDVLVIREAEMPPRTIPLSRIDKVVCLHNTALSTQLLGQLHQRGIDFIVLNSRYTDHCVALYADTHGNCLRRAAQYTLQLDNKERLPLAKSLCHHKLQQSLRVLDAQQPGRLQSQLQMAQESVRNCQDEQQLRGLEGSAQRALFQHWRQQLDPKWGFERRERRPPPDPVNGLLSLTYTLVHHEAIRQCKQYGLDSTLGIYHRMAYGRHSLACDLMEPTRPIIEQWLVQLLDTDLLTQRHFTKGKPGCYLGKEGRLLFYPQLETQMALIRRKLAANARWMVQQLDHGLTQGQYTLSEASAYELN >LR134303.1|VEE64248.1|4617706_4617985_-|CRISPR-associated-protein-Cas2 MMKQQNILIAYDIQSDNARSCALYNLRKFAISYQDSVFELQLSAAKLQRLILKLQPYISTNDTLLSVHFSPNACWQLGCGLPSISGQFLVIG >LR134303.1|VEE64249.1|4618008_4619031_-|Uncharacterized-protein-conserved-in-bacteria MSQDILWQARNEIIALSAEVGIDGSALLNALPSSSQLLTGSQVPTLSPRYRGKCSVLFHIHQMRNGKSWPWMRLHTFKHGGISRVFNGLQWQRQNSATHWPITPHFGQSNTVSTPQPPSASATLADWRATRFASLQVQYQALPTLTPTHPWLQQRFAAQLPSAVLQRCHLRGNHQQLLLPLQHHHGEISGFQLLTWGENAQKRFWLRRQGLLTGSYALISGQPGPVALCEGLATALSIALAWPGKIYIGLCANNLAAVRAKITGKVVIFYDNDNWKPDYGNPGQQAALACRQTRDILVGPQFSSEQQQTKPTDFNDLHNLAGIAELMHQIRRHWTVKSSY >LR134303.1|VEE64250.1|4619031_4619619_-|Helix-destabilizing-protein MTTGINKVILIGHLGKAPEIKTLNDGVRLCNLSLATSERWTNKQTGQAQEHTEWHRLVLSDRLAEIAEKYLQKGSKIYIEGKLRTRKWQNQLGQEQYTTEVRVAQLQILSSRAQEATKTAEDYSKAVSPVQPVSPVQQVSPTITKLQPIRRMLTILTWMIGTIFLFEVVRHGTISLGNSIVNHSIMWKSKPHLGG >LR134303.1|VEE64251.1|4619619_4621851_-|Probable-ATP-dependent-helicase-dinG-homolog MLNYQDNNPTQQHSIDIAEVKLANAKLWQAQSQQLYTTLSQAIAGFKSRAGQQQIVELCANNSAQHGQLVVEAGTGVGKTFGYILGHLPAILQNEQRLLISTNTITLQNQLMEKDLPQLLAILAPELTYTIAKSGSRYLCPAKVRSLLTKTEKNTDADSTDDMFDSASPVNMSSAQLQCLKEVYTLFTEGNFAGDLDQLQVANAPQYQKLLNRDQQRCPGQRGCSDGKQCPFYLQREKISKTDIVVTNHAFLVNVLMHDSNAFGDLTNTMLVLDEAHHLPKVIRDVNQAELDLSQAAEIADQFDKLEKASNKLIKTQPALFDDINKFTTSIQRLKPAITDFCEQLQQLNEHMSLNFMAYRGKKQNSFDDIEQWLMSYAPMPISFVTPLLNLYQSCATARTYVEQLNGQFRQQLDNKDIPQAAENACNRWLQSSANLLAMVAQAHHCLDYYYQFTNQQTEDQRLDVGIARWVKNDAATDKITLLANVIDVSQHFQLRIAQAFKSQVLTSATLKTLGSFNHINRQLGINSETSIHADIPSDFNYRHAFLSWAPSNAPEPNNEHFIGYLTQQIMELTQRHSAILVIFTAHNMLESTFNLMPTALKQQILCQDKRSKSELIQEHKRRIDAGQTSILFGVDSLSEGLDLQGKYLTGVVIAKLPFPSLSEPLINYEKITLDARGINSFAQMMVPECSRKLIQSVGRLIRSEQDWGEVFIADRRIQPRPFSNYYGNILLNALPIFQQGVR >LR134303.1|VEE64252.1|4621864_4622155_-|Uncharacterised-protein MMSSSSPYIFGQQANSQWQKRPLGRVGDLQGWHQRMKALCPNPRLGQVVWLENNHWDNGEGQYVHSSTGYKVINFRANGDPVWERGQVIILSTDDE >LR134303.1|VEE64253.1|4622462_4622765_-|putative-CRISPR-associated-protein,-VVA1548-family MNKRIWLVSRHPGAQTWLKQQGFQGQQIAHLDIDTIAAGDIIIGSLPIHMVVALTLKNAEYWHLSMTIPECWRGTELTPEQMQLCQIKLQKITAQAEDFF >LR134303.1|VEE64254.1|4622761_4623550_-|GTPase-Era MKPTLRCTVKIERDVVALVQARNAFGFLHNRQPLLSLASRWQRGDSLSHDIQVIAIGKSGYGKSTTLNQLVGEAVFETNAIAGCTREMQSAEYRFPTKEAQCHFSLADLPGIGEHPELDKQYIQLYRDAIAKAHSVIYFLRADQRDYVIDQWAFAQLFNTDAERRKVIIALNAVDKIEPLNRSQPFALTAEQIQSLRLKRHTICQQFAVCEDQIVTLSATEQHNLDGLVDRLADVLQPYLSQADTTSAYQAHWTSSVNGATY >LR134303.1|VEE64255.1|4623549_4624431_-|GTPase-Era MMNNIIAILQDLDSQLAPAQVERIREKLTSMLNYRPRIGVFGKTGVGKSSLCNALFGKPICAISDIEACTRDPQEVMLNMENGKGITLLDVPGVGENLQRDEEYSELYRKLLPELDMVLWVLKADDRAYSVDIEFYTNVVKPHLQQGKPFILVLNQVDKIEPFRDWDGENRKPGPSQAQNIALKAANVATHFQVKQSLVVPVSADEKFGLTKLVDEIIFSLPDEKKVSIAREVPAENLSTESKAEVKNSLSRVLTRTLTGVATGAALGSKIAGKAGAVVGGVIGGIGGFLGLW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134303_4 | 4625297-4626449 | TypeIII |
NA
Consensus repeat of LR134303_4
|
16 spacers
spacers of LR134303_4
>4.1|4625334|33|LR134303|CRISPRCasFinder,CRT CCGTCAGAACTTCTCACCGAGTGATGTCGGTAA >4.2|4625404|32|LR134303|CRISPRCasFinder,CRT,PILER-CR AAAATACACCTATCTCCCGGGGATGTTCTCAC >4.3|4625473|31|LR134303|CRISPRCasFinder,CRT,PILER-CR ACCATTGCGTCTTACAAGAAGACCGAACAAT >4.4|4625541|33|LR134303|CRISPRCasFinder,CRT,PILER-CR CCAACACCCATGAATCATGCGGGTAAGTATCTA >4.5|4625611|33|LR134303|CRISPRCasFinder,CRT,PILER-CR TCAAAAAATCCAACTGACATCGAAAAACAAGTG >4.6|4625681|33|LR134303|CRISPRCasFinder,CRT,PILER-CR CAACTACACACATATTGCCAATATCAGCGCTAC >4.7|4625751|33|LR134303|CRISPRCasFinder,CRT,PILER-CR GAGTACGACGCGATAGTGGCCGACGACTTCGAT >4.8|4625821|32|LR134303|CRISPRCasFinder,CRT,PILER-CR GCATTATCAGCTACAGTGCAGGTCGGATACAC >4.9|4625890|32|LR134303|CRISPRCasFinder,CRT,PILER-CR ATTTCATTGGCTTCTTCAATAACCTTTTTAGT >4.10|4625959|32|LR134303|CRISPRCasFinder,CRT,PILER-CR CTATAAGACAATCCGTCTTATCTGTGAGGAAC >4.11|4626028|34|LR134303|CRISPRCasFinder,CRT,PILER-CR TTACAGCTTAACCTTACCGCGTACTCAGAGCTTC >4.12|4626099|33|LR134303|CRISPRCasFinder,CRT,PILER-CR GAAAAGCATAATGCATTCAGTTTCACTAAGTTT >4.13|4626169|34|LR134303|CRISPRCasFinder,CRT,PILER-CR GAAGTATGAACAAGAGAAGTCTACTTGGAATATC >4.14|4626240|33|LR134303|CRISPRCasFinder,CRT,PILER-CR CGAGTGATGCTAGGCGTGATGGTAGGTAGCTGG >4.15|4626310|33|LR134303|CRISPRCasFinder,CRT,PILER-CR AGCCAAGCAGCAGCTCACGAAGTAAATCAAGCA >4.16|4626380|33|LR134303|CRISPRCasFinder,CRT,PILER-CR GTAGTACGTGTATTCAGTACTATTGATAAAACA |
csx16,DinG,cas2,cas1,csx1,cas6,csm3gr7,csx10gr5,cas10 |
CRISPR arrays and Neighbor proteins around LR134303_4
The CRISPR arrays of LR134303_4 >merge|LR134303|4|4625297-4626449|CRISPRCasFinder,CRT,PILER-CR ACATTAACCCCCTAAAATCGCGGGGAGTCTTTCAGAGCCGTCAGAACTTCTCACCGAGTGATGTCGGTAAGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGAAAATACACCTATCTCCCGGGGATGTTCTCACGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGACCATTGCGTCTTACAAGAAGACCGAACAATGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGCCAACACCCATGAATCATGCGGGTAAGTATCTAGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGTCAAAAAATCCAACTGACATCGAAAAACAAGTGGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGCAACTACACACATATTGCCAATATCAGCGCTACGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGGAGTACGACGCGATAGTGGCCGACGACTTCGATGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGGCATTATCAGCTACAGTGCAGGTCGGATACACGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGATTTCATTGGCTTCTTCAATAACCTTTTTAGTGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGCTATAAGACAATCCGTCTTATCTGTGAGGAACGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGTTACAGCTTAACCTTACCGCGTACTCAGAGCTTCGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGGAAAAGCATAATGCATTCAGTTTCACTAAGTTTGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGGAAGTATGAACAAGAGAAGTCTACTTGGAATATCGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGCGAGTGATGCTAGGCGTGATGGTAGGTAGCTGGGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGAGCCAAGCAGCAGCTCACGAAGTAAATCAAGCAGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAGGTAGTACGTGTATTCAGTACTATTGATAAAACAGTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG >LR134303|4|4|4625297-4626449|CRISPRCasFinder ACATTAACCCCCTAAAATCGCGGGGAGTCTTTCAGAG CCGTCAGAACTTCTCACCGAGTGATGTCGGTAA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAAATACACCTATCTCCCGGGGATGTTCTCAC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG ACCATTGCGTCTTACAAGAAGACCGAACAAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CCAACACCCATGAATCATGCGGGTAAGTATCTA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TCAAAAAATCCAACTGACATCGAAAAACAAGTG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CAACTACACACATATTGCCAATATCAGCGCTAC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GAGTACGACGCGATAGTGGCCGACGACTTCGAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GCATTATCAGCTACAGTGCAGGTCGGATACAC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG ATTTCATTGGCTTCTTCAATAACCTTTTTAGT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CTATAAGACAATCCGTCTTATCTGTGAGGAAC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TTACAGCTTAACCTTACCGCGTACTCAGAGCTTC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GAAAAGCATAATGCATTCAGTTTCACTAAGTTT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GAAGTATGAACAAGAGAAGTCTACTTGGAATATC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CGAGTGATGCTAGGCGTGATGGTAGGTAGCTGG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AGCCAAGCAGCAGCTCACGAAGTAAATCAAGCA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTAGTACGTGTATTCAGTACTATTGATAAAACA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG >LR134303|4|2|4625297-4626449|CRT ACATTAACCCCCTAAAATCGCGGGGAGTCTTTCAGAG CCGTCAGAACTTCTCACCGAGTGATGTCGGTAA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAAATACACCTATCTCCCGGGGATGTTCTCAC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG ACCATTGCGTCTTACAAGAAGACCGAACAAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CCAACACCCATGAATCATGCGGGTAAGTATCTA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TCAAAAAATCCAACTGACATCGAAAAACAAGTG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CAACTACACACATATTGCCAATATCAGCGCTAC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GAGTACGACGCGATAGTGGCCGACGACTTCGAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GCATTATCAGCTACAGTGCAGGTCGGATACAC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG ATTTCATTGGCTTCTTCAATAACCTTTTTAGT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CTATAAGACAATCCGTCTTATCTGTGAGGAAC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TTACAGCTTAACCTTACCGCGTACTCAGAGCTTC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GAAAAGCATAATGCATTCAGTTTCACTAAGTTT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GAAGTATGAACAAGAGAAGTCTACTTGGAATATC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CGAGTGATGCTAGGCGTGATGGTAGGTAGCTGG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AGCCAAGCAGCAGCTCACGAAGTAAATCAAGCA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTAGTACGTGTATTCAGTACTATTGATAAAACA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG >LR134303|4|2|4625367-4626449|PILER-CR GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AAAATACACCTATCTCCCGGGGATGTTCTCAC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG ACCATTGCGTCTTACAAGAAGACCGAACAAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CCAACACCCATGAATCATGCGGGTAAGTATCTA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TCAAAAAATCCAACTGACATCGAAAAACAAGTG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CAACTACACACATATTGCCAATATCAGCGCTAC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GAGTACGACGCGATAGTGGCCGACGACTTCGAT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GCATTATCAGCTACAGTGCAGGTCGGATACAC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG ATTTCATTGGCTTCTTCAATAACCTTTTTAGT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CTATAAGACAATCCGTCTTATCTGTGAGGAAC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG TTACAGCTTAACCTTACCGCGTACTCAGAGCTTC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GAAAAGCATAATGCATTCAGTTTCACTAAGTTT GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GAAGTATGAACAAGAGAAGTCTACTTGGAATATC GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG CGAGTGATGCTAGGCGTGATGGTAGGTAGCTGG GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG AGCCAAGCAGCAGCTCACGAAGTAAATCAAGCA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG GTAGTACGTGTATTCAGTACTATTGATAAAACA GTCTTAATCCCCTTAAATGGCGGGGCGTCTTTCAGAG
>LR134303.1|VEE64256.1|4624494_4624833_-|Uncharacterized-conserved-protein MMTPNYSSQFSTSMFWSSLAKDMKRLGRKSLLLALKLYYAAQDPATPVWAKGIIFSALGYLISPIDSIPDMLPVIGLTDDLAVLTAALATVAAHVTESHRLKAEARLQAMFD >LR134303.1|VEE64255.1|4623549_4624431_-|GTPase-Era MMNNIIAILQDLDSQLAPAQVERIREKLTSMLNYRPRIGVFGKTGVGKSSLCNALFGKPICAISDIEACTRDPQEVMLNMENGKGITLLDVPGVGENLQRDEEYSELYRKLLPELDMVLWVLKADDRAYSVDIEFYTNVVKPHLQQGKPFILVLNQVDKIEPFRDWDGENRKPGPSQAQNIALKAANVATHFQVKQSLVVPVSADEKFGLTKLVDEIIFSLPDEKKVSIAREVPAENLSTESKAEVKNSLSRVLTRTLTGVATGAALGSKIAGKAGAVVGGVIGGIGGFLGLW >LR134303.1|VEE64254.1|4622761_4623550_-|GTPase-Era MKPTLRCTVKIERDVVALVQARNAFGFLHNRQPLLSLASRWQRGDSLSHDIQVIAIGKSGYGKSTTLNQLVGEAVFETNAIAGCTREMQSAEYRFPTKEAQCHFSLADLPGIGEHPELDKQYIQLYRDAIAKAHSVIYFLRADQRDYVIDQWAFAQLFNTDAERRKVIIALNAVDKIEPLNRSQPFALTAEQIQSLRLKRHTICQQFAVCEDQIVTLSATEQHNLDGLVDRLADVLQPYLSQADTTSAYQAHWTSSVNGATY >LR134303.1|VEE64253.1|4622462_4622765_-|putative-CRISPR-associated-protein,-VVA1548-family MNKRIWLVSRHPGAQTWLKQQGFQGQQIAHLDIDTIAAGDIIIGSLPIHMVVALTLKNAEYWHLSMTIPECWRGTELTPEQMQLCQIKLQKITAQAEDFF >LR134303.1|VEE64252.1|4621864_4622155_-|Uncharacterised-protein MMSSSSPYIFGQQANSQWQKRPLGRVGDLQGWHQRMKALCPNPRLGQVVWLENNHWDNGEGQYVHSSTGYKVINFRANGDPVWERGQVIILSTDDE >LR134303.1|VEE64251.1|4619619_4621851_-|Probable-ATP-dependent-helicase-dinG-homolog MLNYQDNNPTQQHSIDIAEVKLANAKLWQAQSQQLYTTLSQAIAGFKSRAGQQQIVELCANNSAQHGQLVVEAGTGVGKTFGYILGHLPAILQNEQRLLISTNTITLQNQLMEKDLPQLLAILAPELTYTIAKSGSRYLCPAKVRSLLTKTEKNTDADSTDDMFDSASPVNMSSAQLQCLKEVYTLFTEGNFAGDLDQLQVANAPQYQKLLNRDQQRCPGQRGCSDGKQCPFYLQREKISKTDIVVTNHAFLVNVLMHDSNAFGDLTNTMLVLDEAHHLPKVIRDVNQAELDLSQAAEIADQFDKLEKASNKLIKTQPALFDDINKFTTSIQRLKPAITDFCEQLQQLNEHMSLNFMAYRGKKQNSFDDIEQWLMSYAPMPISFVTPLLNLYQSCATARTYVEQLNGQFRQQLDNKDIPQAAENACNRWLQSSANLLAMVAQAHHCLDYYYQFTNQQTEDQRLDVGIARWVKNDAATDKITLLANVIDVSQHFQLRIAQAFKSQVLTSATLKTLGSFNHINRQLGINSETSIHADIPSDFNYRHAFLSWAPSNAPEPNNEHFIGYLTQQIMELTQRHSAILVIFTAHNMLESTFNLMPTALKQQILCQDKRSKSELIQEHKRRIDAGQTSILFGVDSLSEGLDLQGKYLTGVVIAKLPFPSLSEPLINYEKITLDARGINSFAQMMVPECSRKLIQSVGRLIRSEQDWGEVFIADRRIQPRPFSNYYGNILLNALPIFQQGVR >LR134303.1|VEE64250.1|4619031_4619619_-|Helix-destabilizing-protein MTTGINKVILIGHLGKAPEIKTLNDGVRLCNLSLATSERWTNKQTGQAQEHTEWHRLVLSDRLAEIAEKYLQKGSKIYIEGKLRTRKWQNQLGQEQYTTEVRVAQLQILSSRAQEATKTAEDYSKAVSPVQPVSPVQQVSPTITKLQPIRRMLTILTWMIGTIFLFEVVRHGTISLGNSIVNHSIMWKSKPHLGG >LR134303.1|VEE64249.1|4618008_4619031_-|Uncharacterized-protein-conserved-in-bacteria MSQDILWQARNEIIALSAEVGIDGSALLNALPSSSQLLTGSQVPTLSPRYRGKCSVLFHIHQMRNGKSWPWMRLHTFKHGGISRVFNGLQWQRQNSATHWPITPHFGQSNTVSTPQPPSASATLADWRATRFASLQVQYQALPTLTPTHPWLQQRFAAQLPSAVLQRCHLRGNHQQLLLPLQHHHGEISGFQLLTWGENAQKRFWLRRQGLLTGSYALISGQPGPVALCEGLATALSIALAWPGKIYIGLCANNLAAVRAKITGKVVIFYDNDNWKPDYGNPGQQAALACRQTRDILVGPQFSSEQQQTKPTDFNDLHNLAGIAELMHQIRRHWTVKSSY >LR134303.1|VEE64248.1|4617706_4617985_-|CRISPR-associated-protein-Cas2 MMKQQNILIAYDIQSDNARSCALYNLRKFAISYQDSVFELQLSAAKLQRLILKLQPYISTNDTLLSVHFSPNACWQLGCGLPSISGQFLVIG >LR134303.1|VEE64247.1|4616743_4617703_-|CRISPR-associated-endonuclease-Cas1 MTTLILEKHGISLEYDTDVLVIREAEMPPRTIPLSRIDKVVCLHNTALSTQLLGQLHQRGIDFIVLNSRYTDHCVALYADTHGNCLRRAAQYTLQLDNKERLPLAKSLCHHKLQQSLRVLDAQQPGRLQSQLQMAQESVRNCQDEQQLRGLEGSAQRALFQHWRQQLDPKWGFERRERRPPPDPVNGLLSLTYTLVHHEAIRQCKQYGLDSTLGIYHRMAYGRHSLACDLMEPTRPIIEQWLVQLLDTDLLTQRHFTKGKPGCYLGKEGRLLFYPQLETQMALIRRKLAANARWMVQQLDHGLTQGQYTLSEASAYELN >LR134303.1|VEE64258.1|4626828_4627335_-|Uncharacterised-protein MNTRVLTRQSAMLHYLEQVWQLLQDSYADVPGGLHFASRQQLLETTVRWKLVLQDKQLLAVTVYKAKKGLKLVAMGINQGLQALGKAALILIIKQDLPSCWMELSEKAERFVLKYCDGHNYIIHSSLVAQLLDKPITPHDADSYHYCRFVQQLMKVKIALGSPQLTQS >LR134303.1|VEE64259.1|4627641_4628913_-|CRISPR-associated-protein,-NE0113-family MKHILLAVSGMTPQIITETLYGIYKKDCSQMPTEIHVITTGAGSDKLINALMGSDNKLEQFCRDYQLPQILFTEEHIHIPKGDDGLKIVDVRSEREQEIIADFITQFVRDKTTQADIAIHASLAGGRKTMGFAMGYAMSLFGRHQDSLSHVLVSEPYEIVPDFFYPTPQETWRADKNGSRHDMSKAEVTLATIPLVLMREEMPTALLSNTQLSYTETVSRVNKANALNAEDASVILDYQRLTINCDGYEVAMKPDCFAFYSWLAQDSKENPGEGTEAPCSGMKCGELNQRLRKFYLALLPPQWVLRDEYTEISLEELAEVTKDHLDRLEPQPKSSWLLQDNDNTQQLLNADADTCDAELVKKHNTLWHRLLRETNKALEDVMGKKLAKFYQIQTVNHVKGSVKTAVQDFKGLAIQAHKIQFID >LR134303.1|VEE64260.1|4629041_4630235_-|Domain-of-uncharacterised-function-(DUF1887) MKTIHINLVSEQLIPNLISTLGDENCCGVVLVLGDSKFADKADRLENLYKRNQLKVLWRSQGMSSTRLPQLQTQANALIDYLATNHSDCHWVLNATCGTKPMALAFANAFNQQDKQQALVIYTDTEHKEIPLLNPGVDFTLPFKSVLSLDDYLLANGFEYEQAINSDNDAQLQQHAPLTRYLAQQLVGKCQHMLGSLQSMATIAAKDFPHSQIQTMPSVPHGDYAKLYLRLNDEGLLSWDGQCKQITFQSEDACRYLAGRWLEEFTYLTALECGAKEVAMSVTGRWLQDSRNFGSDDLTNEFDVLILHNNQLLTIECKASNWFRQEEHGSKNQDIIHKLDTLSKNLGGLFGSPMLVTAQQLSDAARSRVTYNRFRCCEQATEKSLKKALCAWVGTVG >LR134303.1|VEE64261.1|4630313_4631192_-|Uncharacterized-conserved-protein-(DUF2276) MTPPTVLLDLAEQFEILHLRCNLILQADGVLPAFKGSLWHGWLGHAIKGLDEPLYQLLYGSHAEQQPKPYLIKAGADHKQQWRAGEMLEFDIILLGSACQLGQRLVSALLSGQRLGLGTNRIPFRVQSLTSVLPMRQTPGLHVARLIDWLMPMDVGLDCEIALQLHSPLRLKQNGNIIKSATPALPELIKQISRRMALLTLFWVNEDPCLQAALFKTLPILGDYQSDGSHVWFEDWQRYSARQHEQLPFGGLKGQLCYRGDLSAAIPWLQLGELLHMGGKTTFGLGQYRLIY >LR134303.1|VEE64262.1|4631188_4633483_-|CRISPR-associated-protein MSSQVHAPYHFAPLSSWVYMPEWAHLVSHDHPFEDSLSGVIEYELTNKTPLLVGGEQLRADNQPSKILWSRDPDGKPVIPGSSVKGMLRSVLEIASFAKFSMVDDHHFGFRDISNSDSRYAQKVLDRETETQTYWLKYDADRCQWMLRKCQHTVLFTDDFNQFSGLRFHNQSFKLPAKEKYEQWPLNKPAIPFELGTRTMMGTKGKPVTITCAQQLGTGKLAGVPVFTGSRPGKNEVKEQRLNFNYVFYAHEAEAKPLDNGAAKVTKMFAAHDEDLVNHLKKFGHPELGMPIFAKEKQGKIIAMGFAKMPKVLYDNSIGDIANKQQKPLNSQSVFDMPGLMFGTLAESGFSLKSRVAFSDASCSHNAGITLSKPVILGQPKASYLNAYLEQNAQSDRVRGELSQYENNSKLAGWKRYPAQSGFNAHLPADLARKTSVQSQLELMNPGSRFNGKLVFHNLKPVELGALLWVLNPDGDKRVFHHALGHGKSLGAGAVQFNAKITIAHSDLTPELNELRSLFVEHMNAVYPSSQPTASSWLLSPQIRHLLAFGDRGDNQNKDLSYMQLNSKDSQEITYTSSAKGREKLVLPAFCHQSDALGRNEQLKQSAPQLGRGRLYQLLEQLEKTDNISEFEREQLEQSKQAAEQAAKNALFAQASPQYREYLNIIDELTPYRNQSNVDADNKRQGMRGRIDSLLTNCLSNENEADETELQQICRFVFDFPFSGYIDPSIKRKDLSKKAKEKFDERKALVDALLGKCKLEAADI >LR134303.1|VEE64263.1|4633484_4634006_-|Uncharacterised-protein MSNSETMTIYNTLASKSYTEPELLGCALPTLYGGRIEAQHQICNRDELKLMLAQWQDAEGWYQSRADVGLGMPVNLDELLEGQWHRADQSLHLELRSDQTYNLTLFIQHEVSQGQLARQCYSDQLIFLRPSLKSQNQTLRYRIWWQQAETGPNEGGWRPLLQQFLGLAQQKEQ >LR134303.1|VEE64264.1|4634009_4635419_-|CRISPR-associated-RAMP-protein,-SSO1426-family MSQMYLTRLLIETTAPLAINSGGRETGFDSQLARDANQLPYIPASSLAGVWRHLVHSCVGADASKRWFGCIDDNGESSRLFVQDGLLLDSQGHIVQGLVEAERIAADPLLQRLQQTRPHHRERVRINDRGVASDLGKFDQIILPTGIRFCIDIRWMSDDVSENDRQEWQQLLSYFSHPAFALGSSTRNGLGRFKIIADEQNILTLTNNPSAGAALRKFVKRETLPTKVSSITNDYRPFARLELTALDSWRCGQGSRPLGDKTDQHTDSFTYSEPTVRWHKGKAHWNDKPQAILCGSGIKGILAHRLAYHYRRRKGAFAEIMAEASHQEWESRPQELGQLLGSSREKNGAGSSEQEVAGQLFVDDAVIVCDKTLIRTHNSIDRFTGGVRQGALFSEELLWQPKIALTLRLASNAKISQTLADALADTLEDLKLGLLPLGAGSGRGNSLVEHQTGQIWDIDWAQLTILAQE >LR134303.1|VEE64265.1|4635421_4636954_-|CRISPR-associated-RAMP-protein,-Csx10-family MRLYFSLTNLEPLVISQSSASTNNHQCLDHIPGSAILGAIASKLYSSLSAEQSFALFHSGACRFGPAYPIHNDEIAMPIPASWHKIKNDGHTLYNHAAANFSRDKAKQYQQCRNGYITSKSIDVFVKQGLTTRTAIDEQTQRASDGQLYSYAFIEAGQHFGAWVDVNDSSLLAILRPLLNGELNIGRSRSSEFGRVQLHCPTQQPATQQACKLENQLVLWCLSDAECRDEWGMPTVTPRAEDLHPALKGELDTTRSFIRSHKVRRFNRARNGFDTEQQLISRGSILTFTLHETASDDVLREMASQGIGCNRQQGLGWVSANPQWASMAQPNSLAIFDAITLTPPPKQETSGAQTETPLLRWVRVKQQESQTNKDQSTRIKALHQAIYAAYSNARHYRNVPRNYQAGPSSSQWRRLTDLVRSRQDESWFKVAFVSDAAICKSNNDPDGWGIDWQEQGVLLTFSEKMQKIFDGADILLMRQLLEILCRYDLSTSDGLQRFHTNHLAPRKGGE >LR134303.1|VEE64266.1|4636953_4637553_-|CRISPR-associated-RAMP-protein,-SSO1426-family MIKSIQVMLTFDLRSEWHLGSGREGGAYADNLVQKNPDGLPIINGKTIKGLLRQAFNDANKYQWFPDADVDTIEHLFGREGTDLIASGALKCDSASLNLAEQHYFKENPSAIGHLYRVRHATAIDPQTGTAQDGSLRAMEVVIPMVMQSRLTLTTDNPKILKQFSKWLDLSLPLLCALGGKRRRGLGEVIVTATGQENY >LR134303.1|VEE64267.1|4637549_4639274_-|Uncharacterised-protein MYVYLFEAKSIQTYLFRSGRLKDVISASERLDRLVDDDSGSVLAQVLHSAGLKSDLLESVLVSDQVIGFTRCKGGAFYAYCEQREPLLTLRSLWTLTVQQLFPGLEFADALTQGETLAEAMDAGHPALAASRNCPTVKFPLATAPCDHAPRTGLAAVPLSAAAKTEVKSSEKDERIDLDTEHHRQGYRLLRLRKEPLLHKFITSIEGDKQLPDSLNFPLNMEDFPAFNDPDEGHSVVKDLALIHIDGNGLGLLLRSLQGALKGIGDQEYGRAFRTFSSALARATQQAAFQASYWLYQEQKRGLDAGCEPDMLAMRPIVLGGDDITLFCQADLALGYAETFCIAFKSCSERELAPLYHDYLQGTPLKPYLTASGGILYHKATHPFTTCHNLVEGLCKEAKQLTQSLDSHVGPAAISFYRLSHALAEDIVTLRTQSQQFLIEDRQLLTSIGGYLVEDKTQNNAHKTNTYTPSLSNLKQAITLLRDKRSSLSIAKFRQMATELARGDMDEAERIYERAYEQLSPATLKEWQTILASLFPKADTPNWYWNNKSWLSDLLTIAHFLPQSTTSVTQGSKK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
LR134303_4 | 4.13|4626169|34|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4626169-4626202 | 34 | MN582077 | Podoviridae sp. ctLUJ1, complete genome | 13269-13302 | 0 | 1.0 |
LR134303_4 | 4.16|4626380|33|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4626380-4626412 | 33 | MN582077 | Podoviridae sp. ctLUJ1, complete genome | 13120-13152 | 3 | 0.909 |
LR134303_3 | 3.19|4615323|31|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4615323-4615353 | 31 | NC_047960 | Enterobacteria phage vB_EcoS_IME347, complete genome | 9292-9322 | 5 | 0.839 |
LR134303_3 | 3.7|4614483|32|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4614483-4614514 | 32 | NC_027625 | Propionibacterium phage Kubed, complete genome | 8623-8654 | 6 | 0.812 |
LR134303_3 | 3.7|4614483|32|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4614483-4614514 | 32 | NC_022340 | Propionibacterium phage PHL114L00, complete genome | 8622-8653 | 6 | 0.812 |
LR134303_3 | 3.7|4614483|32|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4614483-4614514 | 32 | KJ578775 | Propionibacterium phage PHL114N00, complete genome | 8622-8653 | 6 | 0.812 |
LR134303_3 | 3.11|4614765|33|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4614765-4614797 | 33 | NZ_AP017309 | Leptolyngbya sp. NIES-3755 plasmid plasmid1 DNA, complete genome | 41018-41050 | 6 | 0.818 |
LR134303_4 | 4.9|4625890|32|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4625890-4625921 | 32 | MF370965 | Pseudoalteromonas phage SL25, complete genome | 31808-31839 | 6 | 0.812 |
LR134303_3 | 3.4|4614272|34|LR134303|CRISPRCasFinder,CRT | 4614272-4614305 | 34 | NZ_CP033067 | Pseudoalteromonas agarivorans strain Hao 2018 plasmid unnamed, complete sequence | 39862-39895 | 7 | 0.794 |
LR134303_3 | 3.7|4614483|32|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4614483-4614514 | 32 | NC_011561 | Candidatus Azobacteroides pseudotrichonymphae genomovar. CFP2 plasmid pCFPG2 DNA, complete sequence | 18306-18337 | 8 | 0.75 |
LR134303_3 | 3.18|4615255|31|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4615255-4615285 | 31 | NZ_CP013238 | Clostridium butyricum strain CDC_51208 plasmid pNPD4_2, complete sequence | 796506-796536 | 8 | 0.742 |
LR134303_3 | 3.31|4616163|32|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4616163-4616194 | 32 | CP000620 | Burkholderia vietnamiensis G4 plasmid pBVIE04, complete sequence | 82074-82105 | 8 | 0.75 |
LR134303_4 | 4.9|4625890|32|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4625890-4625921 | 32 | NZ_CP054613 | Paenibacillus cellulosilyticus strain KACC 14175 plasmid unnamed4, complete sequence | 2257422-2257453 | 8 | 0.75 |
LR134303_3 | 3.7|4614483|32|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4614483-4614514 | 32 | NC_020865 | Cyanophage KBS-P-1A genomic sequence | 14211-14242 | 9 | 0.719 |
LR134303_3 | 3.7|4614483|32|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4614483-4614514 | 32 | HQ317389 | Synechococcus phage S-RIP2 genomic sequence | 4195-4226 | 9 | 0.719 |
LR134303_3 | 3.3|4614203|32|LR134303|CRISPRCasFinder,CRT | 4614203-4614234 | 32 | NZ_CP048417 | Citrobacter freundii strain CitB plasmid pA_CitB, complete sequence | 79497-79528 | 10 | 0.688 |
LR134303_3 | 3.31|4616163|32|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4616163-4616194 | 32 | MN692999 | Marine virus AFVG_117M75, complete genome | 53757-53788 | 10 | 0.688 |
LR134303_4 | 4.9|4625890|32|LR134303|CRISPRCasFinder,CRT,PILER-CR | 4625890-4625921 | 32 | NC_049805 | Lactococcus phage phiQ1 DNA, complete genome | 21438-21469 | 10 | 0.688 |
LR134303_3 | 3.4|4614272|34|LR134303|CRISPRCasFinder,CRT | 4614272-4614305 | 34 | AP014258 | Uncultured Mediterranean phage uvMED isolate uvMED-GF-U-MedDCM-OCT-S27-C55, *** SEQUENCING IN PROGRESS *** | 9704-9737 | 11 | 0.676 |
1. spacer 4.13|4626169|34|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to MN582077 (Podoviridae sp. ctLUJ1, complete genome) position: , mismatch: 0, identity: 1.0
gaagtatgaacaagagaagtctacttggaatatc CRISPR spacer gaagtatgaacaagagaagtctacttggaatatc Protospacer **********************************
2. spacer 4.16|4626380|33|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to MN582077 (Podoviridae sp. ctLUJ1, complete genome) position: , mismatch: 3, identity: 0.909
gtagtacgtgtattcagtactattgataaaaca CRISPR spacer gtagtacgtgtattcagtactatcgataagact Protospacer ***********************.*****.**
3. spacer 3.19|4615323|31|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to NC_047960 (Enterobacteria phage vB_EcoS_IME347, complete genome) position: , mismatch: 5, identity: 0.839
aaaaaagttgtcatcgttaaatctccgtaat CRISPR spacer aaaaaagttgtcatggttaaatttcccttac Protospacer ************** *******.*** * *.
4. spacer 3.7|4614483|32|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to NC_027625 (Propionibacterium phage Kubed, complete genome) position: , mismatch: 6, identity: 0.812
gtagagttcttcccaaaagcaaacaaaccacc CRISPR spacer gccaacttcatcccagaagcaaacaaaccacc Protospacer *. .* *** *****.****************
5. spacer 3.7|4614483|32|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to NC_022340 (Propionibacterium phage PHL114L00, complete genome) position: , mismatch: 6, identity: 0.812
gtagagttcttcccaaaagcaaacaaaccacc CRISPR spacer gccaacttcatcccagaagcaaacaaaccacc Protospacer *. .* *** *****.****************
6. spacer 3.7|4614483|32|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to KJ578775 (Propionibacterium phage PHL114N00, complete genome) position: , mismatch: 6, identity: 0.812
gtagagttcttcccaaaagcaaacaaaccacc CRISPR spacer gccaacttcatcccagaagcaaacaaaccacc Protospacer *. .* *** *****.****************
7. spacer 3.11|4614765|33|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to NZ_AP017309 (Leptolyngbya sp. NIES-3755 plasmid plasmid1 DNA, complete genome) position: , mismatch: 6, identity: 0.818
gaagtatctttagcaatcgctcaacgtaaatca CRISPR spacer tcagactctttagcaatcgctcaacgtaaattt Protospacer ** *************************.
8. spacer 4.9|4625890|32|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to MF370965 (Pseudoalteromonas phage SL25, complete genome) position: , mismatch: 6, identity: 0.812
atttcattggcttcttcaataacctttttagt CRISPR spacer tctgccttagcttcttcaatagcctttttagt Protospacer .* * **.************.**********
9. spacer 3.4|4614272|34|LR134303|CRISPRCasFinder,CRT matches to NZ_CP033067 (Pseudoalteromonas agarivorans strain Hao 2018 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.794
aagaatccaaagcaagaaaggttaatagctaatc CRISPR spacer aactatccaaagaaataaaggttaatagctatag Protospacer ** ******** ** ***************
10. spacer 3.7|4614483|32|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to NC_011561 (Candidatus Azobacteroides pseudotrichonymphae genomovar. CFP2 plasmid pCFPG2 DNA, complete sequence) position: , mismatch: 8, identity: 0.75
gtagagttcttcccaaaagcaaacaaaccacc CRISPR spacer ttagcattcttcccaaaagcaaacaacagtgc Protospacer *** .******************** *
11. spacer 3.18|4615255|31|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP013238 (Clostridium butyricum strain CDC_51208 plasmid pNPD4_2, complete sequence) position: , mismatch: 8, identity: 0.742
tgcgcaggtgctgttttgatgatttattcgc CRISPR spacer acagcagctgctgctttgatgatttatgctg Protospacer **** *****.************* *
12. spacer 3.31|4616163|32|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to CP000620 (Burkholderia vietnamiensis G4 plasmid pBVIE04, complete sequence) position: , mismatch: 8, identity: 0.75
aaatacc----ctgaataccaagaaaaagaaagata CRISPR spacer ----accatgactgaataccatgaagaagaaagagc Protospacer *** ********** ***.********
13. spacer 4.9|4625890|32|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP054613 (Paenibacillus cellulosilyticus strain KACC 14175 plasmid unnamed4, complete sequence) position: , mismatch: 8, identity: 0.75
atttcattggcttcttcaataacctttttagt CRISPR spacer gctatattggcttcttctttaacctttttgat Protospacer ..* .************ **********..*
14. spacer 3.7|4614483|32|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to NC_020865 (Cyanophage KBS-P-1A genomic sequence) position: , mismatch: 9, identity: 0.719
gtagagttcttcccaaaagcaaacaaaccacc CRISPR spacer ttagagttctacacaaaagcaaacggaagctc Protospacer ********* * ***********..* .*
15. spacer 3.7|4614483|32|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to HQ317389 (Synechococcus phage S-RIP2 genomic sequence) position: , mismatch: 9, identity: 0.719
gtagagttcttcccaaaagcaaacaaaccacc CRISPR spacer ttagagttctacacaaaagcaaacggaagctc Protospacer ********* * ***********..* .*
16. spacer 3.3|4614203|32|LR134303|CRISPRCasFinder,CRT matches to NZ_CP048417 (Citrobacter freundii strain CitB plasmid pA_CitB, complete sequence) position: , mismatch: 10, identity: 0.688
gtagtagtttcagcatcatggctactcgaact CRISPR spacer cggttagtttcagcatcatgccttctccttcg Protospacer . **************** ** *** *
17. spacer 3.31|4616163|32|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to MN692999 (Marine virus AFVG_117M75, complete genome) position: , mismatch: 10, identity: 0.688
aaataccctgaataccaagaaaaagaaagata CRISPR spacer gctctggctgaataccaagaaatggaaagaca Protospacer . . *************** .******.*
18. spacer 4.9|4625890|32|LR134303|CRISPRCasFinder,CRT,PILER-CR matches to NC_049805 (Lactococcus phage phiQ1 DNA, complete genome) position: , mismatch: 10, identity: 0.688
atttcattggcttcttcaataacctttttagt CRISPR spacer caatcactggtttcttcaataaccttcctctc Protospacer ***.***.***************..* .
19. spacer 3.4|4614272|34|LR134303|CRISPRCasFinder,CRT matches to AP014258 (Uncultured Mediterranean phage uvMED isolate uvMED-GF-U-MedDCM-OCT-S27-C55, *** SEQUENCING IN PROGRESS ***) position: , mismatch: 11, identity: 0.676
aagaatccaaagcaagaaaggttaatagctaatc CRISPR spacer cctaatccaaaccaagaaaggctaatatgataca Protospacer ******** *********.***** *.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1915328 : 1927825
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >LR134303|1915328:1927825|DBSCAN-SWA ACTACGCATCCTCCAACATCATCGCCCGCAAAGTCTCCAAAGCCTGTGTAAATGCGAGTGATTGAACCAGTCCATTGAGTTCAGCCCATTGCGGGTGCTGTGATAGCGGTTTTGCGAGTTTGTCGACTAAATTCACGGCTTCTAAATTATTTTCCTGTAAGAGAGCAACCAAGGTATCCAGCTCGGGGCGCATACTCTGCATATCCACGGCGGTACCGTTTACAGGCTCAATCGATTGAGGCTGTTTGAGCTGCCCCTTAGGTAAAAAGCCTGCAAGCTGTTCAATACTTTGGTGTATTAAGGACTCCAGTGTATCCGTCCAGCGTTTTATTTCCAGCAGCTCGAGGCCTTCTTGTTTAAATTGTTTTTCTAAAAACGCCGCATGGACGGCTAAACGCCGCGCGCCAAAGTTGCTGGCAATGCCTTTAATCGCATGACTGATCGCAGCTGTAGTGGCGTAATCAAAGGTTTTGGTCGATTGCTTGAACAAGCTTAGCTGCTTGGTCATTTCGGGGGCAAAACTGCTCGCCATTTTTTCGAAGAACACTTGGTTGCCGCCAAAACGGCGCAGGATCAAACGAATATCGTCGAGCAGGGTTTCGCCCTCGAGATTGTGTTGTGAGTCGAGATGCGCAGACTCATGCTGCTCGAGGGATGTCACGTCACGCCCCACTAGACGCAGAATATTCGGCAACAGCAGCTGCATATCGATAGGCTTACCGACGTGGGCATTCATCCCCGCCTCGAGGCATTCTTGTTTATCGGTTTGTGAGGCATTGGCCGTCATCGCCAAAATAGGCAACTGGGCGAATCGACCATCGGCACGGATCCGCCGCGTCGCCTCTAACCCATCGATGTCGGGCATTTGCACATCCATAATCACGGCATCGTATAAATCACCGCTTTCGAGTACCTGTGCCACCCCTTCGACACCGCCACCTGCGAGCACAACGGTGGCGCCTTCATAACTCAGCAGCTCCTCAATCACCTCACGGTTTAATTGATTGTCTTCAACCACTAACAGGGTGAGCCCTGCTAACAGACGCTGTGAGCGCGGCTTAGGATTCGAGTCCATGGTCTTGCCTTCGATGGCATTGAGCACGGCTTCAACCAACACTTGCGAGGTGACAGGTTTAGTTAAGAAGTTGACGAAGGGAACGTTATTAATGTGTTGGCTCTCTGCAATCACTTCATGGCCGTAGGCCGTCAACATCACCACCAGCGGGGTATAACTGCCGGTTCCGGCATTTTTCAACATTTCCGCGGTTTGTAGACCGTCGATATCCGGCATACGCCAATCCATTAGCACCACATCAAACTGCTGTCCGGCTGCATTGGCCTGTTTTACCTTTTCAATGGCCTGATAACCGCCAGAAGCGGTTTCAACCAGACAGCCATAGTCGCTGAGGATTTTCGATAAGATCTCGGTCGTGATTTGATTATCGTCGACCACCAGAATGCGATATTGGCTCAAATCGGCGCGGGCGCGGGCTTCAACCTCCATCACTTGGAAGGTCAAATCGAACCAGAAGCGACTACCAACCCCGACTTTACTGGTGACTTGCAGTTGTCCGCCCATTAACTCGACAAGACGTTTACTGATAGCCAGCCCAAGTCCCGTGCCCCCAAAACGGCGTGAGGTTGAGGATTCAGCCTGCTCAAACCCAGTGAAAATCCGCTCAATTTGCTCTTCGCTGATGCCAATACCACTATCGACAATCGAAAACTGCACAGTGACGGTATCGGCCTCATGGCGAATACATTCTAGGCCGACAATCACTTGCCCGTGAGGGGTAAACTTGAGTGCGTTGCCCGCGAGGTTGATCAGAATTTGCTGCAGCCTTAATTGATCGGCCAGCAGCCAAGGAGGCAACGCCGCGTCGAGATCGAACATCACTTCCACATCGGTATTGTTGTGGCTCGCCGATAACACCACAGCCAGATCCCGCATCAATAACTCAATCGAGCAAGGGTGTGGATCGAGTTTTAATTTGCCCGCATCGATTTTTGAGAAGTCTAAAATATCGTTAAGCAGCCCTAACAATGACTTAGCCGCCGTTTGCGCCTTGTTCACATAGCCCTGCTGCTGCACCGAGAGCGAGGTATATTGCATCAACTGCAACATCCCCAACACCGCATTCATGGGGGTACGGATTTCGTGGCTCATATTGGCTAGGAACGCCGACTTAGCGGCATTGGCGTTATCCGCCTCCAGCTTGGCACTCCTTAAATTCGCCTCAATCTGCCGCTGGGCGGTAATATCGATGTTGATCCCAATCACTCGGATCACATTCCCTTGCCTGTCGCGCTCAATTTGGGCACCCGCCTGCACATAGCGCACCGCACCATCCGTGGTTAACACGCGGAAAATTGGATGATAGGGTTCATCATACTCAACCGCGCGGTTAAGCTTTTCTTCGGCCTCTTCAACGTCATCGGGATGGACACGCATCCGCCAATGTTCATAGCTAAGACCTTGCTGTTTAAGGCTTTCAGGCTGGTCATACATGGCAAACATGCGGTCGTTCCACTGCAACGAGTTATCGAGCAGGTTCCAGGTCCAAATCCCCAGTTTAGCCACTTCCGCCGCCTTACTGAGGTGATTACTCGCCGTGAGTAAGGCCTCTTGCTGGGCAACAATTTGCGAAATATCCACACCAATCCCAAGATAACCTAGGATTTCACCGTCATTACCACGCATCGCGGTTACTGACAGTGAGATTGGTAACTGGCTACCATCCTTACGCACATAGGTCCAAGTACGAGTCTCGGAGCCCTCTTCCCGCGCTTTGTGGACAAAGGTGTCAAAACCTTGAATTTTTTGGCCGTATTCAACAGAAAGCTCGGCGGCGCGAGCGGCAATTTCATCGGCAATATGGAAAGGTGCAGGGGTGGTTTTGCCTATGATCTCTTCAGCGGAATACCCCAACATGCGCTCGGCACCGCGGTTGAAGATGGTGATAATGCCCTGTGGATCGGCGGCAATTATGGTCATTTCGGAGGCCGCATCCAGCACATTGGTCAGCAGAGTCGCAATCCGATTTTTGTCGAGCTGGGCTAATTTACGCTCGGTAATATCGGTTTGCAGCGCGACAAAACGTTCCACTTTGCCACGCTCATCGAAGACTGGGCCAATCACAGTATCGAACCATTTCAGCTGCTGATCTTTCCCTTGGTTACAGATTTCCCCATGCCAAGATTTACCGGCCTTAATCTGTTGCCACATCCGCTGCCAAAACGCGGCATCGTGCTCCCCAGACTTCAGCACTGAATGGGTTTGACCTATTAACTCCTCCCGCGCGTAGCCACTGATCAAACAGAAGTGTTCATTCACATCTAAAATCACCCCGTCTAAGTCGGTGACCGAGTAATGCAGCTGCTGGTTGATGGTATCGAGCAGTGCCTGATTCTCGAGTAAAGCCTGCTGCAAAGCATGGGTACGCTTGGTCACCTGTCGTTCGAGGCTAGCGTTAAGGTTGAGAATATGTCGCTCGGCATCCTGCTGGGCCGTGATATCCCGAATGGTTTGACTCAAACCCACTATGTCGCCATGTTCGTCGTAAATCGGTAAGGTAGTGGTCGAGGTCGACAGGTTGCTGCCATCGGCGCGTTGATGGCGCGATACTCGATTGAGTACCGTCTTGCCGGTCAACACATCGGCCATAAGAGCCTGTTCCTCCAGCACTATGCTGGGTGGAATAATCAATTCGCTACTGGGTAAGCCCAGCGCCTGCTGCTCTGTGTAACCAAAGAGTCGTTCAGCACCTTGGTTCCAACTGGTTATATTGCCAGCGAGGTCATAACTGATAATGCCGTCGAGGGAATGCTCCAACATGCTGGCGCGGCGCGCTTGCTCGGCTAACACTTGCTGTTTACGTTGCAAGGTTAAGGACCACATAGCAATCAACGCCGCCAGCAACACACTGAACAAGCTGCCACTGAGCAACACTAAGCTGGGTTTATTGAGATGTAATGACTTGATAAAACCTGGATAGGCCATTACCTCAAACTGCCAATGCCTACCAAAAATCGTCTTATTTAACTTGTAGCTGTAATCCGATAGCGGGCTTAAATCATTGGCGTGGGTTTCGAAGAAATTGATCGGCTGCTTATCGTCCGTCACGTCGCTTAACATCAGCTTGGTCATTTTCTGATTCAGCGATAATCCCGTTAAAATTTCGGTCGTCACCAGCGGCGCATAACTCCAACCAAACCCTTGGGCCAATCGCTCTTCCGGAGTTGGAGGGACAATACCCGTTCGGTAAACAGGTAATAAAATTAAGAAGGATTGCAGCGGCTTACCCGTGGCCTGCACTAAGGTAATCGGCGCGGAAATTTGCACTTCTCCCGACAGCATGGCGCGATCGGCGGCCTCTTTTCTGTGGGCCTCTGAGGCGATATCTAAACCAACCGCCGCCTTATTGCGCTCGAGCGGTTCAATATATTCAATAAGATACTTCTCCCCCTCGTTGGGGTTGAGCTGACGAACATGGTAGTCAGGCCAATCGACCGCCTTTACCCGCGCTAAAAACTGCGCTTCGTTTTCTTTGGAGACCCGCCGAATAAAGCCAAATCCGCGTGCGCCGGGAAACTCCTCATCCACATCCCGCGTCAGACTATAGTTTTGAAACATGGGGCGGCTGATATGGTCTTCACCCGCCGTGAGGATCATGCCTCGGGCGCCCCTCAAGCCATATTGGTAGAGGGCAATCCTATTGGTGACATTCTCACTAATCTGCTGCGCGCTCGCTTCGAGCGCCAAAGAAATGGTTTGCGAATTGATACGACTCACCTGCCAAGCGAGCACGGCACTAAAAAAAAGTCCCACGAGCAAGACGAAAAATCCCCACTTGGAGGCTTGCTTGTAACTCATCGACAGGGTCATATACACCTAATAATTGATAATCATAGAGTTATGATTCTGGACTAATCAAAATACTTTTGCCATAAAGCTGATAAGTTAACTCAACAAATGCATCAAAAGTAGCTAATGCCAATTTAGTTACAATTTCAGTTACATACTAGATAAATGTGATGATTCCCATGCAATTTCATTTGCTTAGTATTAACATGAGTTGGCAATTAGCGGTATTCGGTCGACATTGACTCACATCATGAGCTAGTCGCTCAGATTGAATCCAGCTATATTAAGCACATTCTTGGCAGATTCGTTTTCAATTTCAGCCAGTTAATCTTTGTTGTATGACGGCATAAAGAATAAAAGCAATAATCCGAGCAATTTAAGGACATCCCATGGGACAAGAAACCTCGAAAATCCTCGTCGTCGATGATGATATGCGCCTACGAGCACTACTCGAGCGTTACCTGATGGAGCAAGGCTATCAGGTGCGCAGTGCGGCCAATGCTGAGCAGATGGACCGTTTATTGGAGCGTGAAAACTTCCACCTAATAGTGCTCGATTTGATGTTGCCCGGTGAAGATGGGCTATCCATCTGCCGTCGTCTGCGCCAACAGGGCAGTACGATTCCGATTGTAATGCTTACCGCCAAGGGCGATGAGGTCGACCGGATTATCGGTTTAGAACTCGGTGCCGACGATTATTTGCCAAAACCCTTTAACCCGAGGGAGTTATTGGCGCGGATCAAAGCGGTAATGCGCCGTCAAATCCAAGATGTCCCCGGCGCGCCGGCGCAGCAAGAGGCTGAAATTAGCTTTGGCGAGTTCTCCCTCGACTTAGCCACCCGCGAGATGTACCACGGTAATGAGGCCATCGCACTCACCAGTGGCGAGTTCGCCGTATTAAAAGTGTTAGTCACTCATCCACGCGAACCCTTGTCGCGGGATAAACTGATGAACCTCGCCCGTGGCCGTGATTATTCGGCGCTGGAGCGTTCGATTGACGTACAGGTTTCGCGCCTGCGCCGTTTAATTGAGAAGGATCCCGCCAACCCAAGGTATATCCAAACCGTGTGGGGCCTAGGTTATGTGTTTGTGCCCGATGGCGCCGCCCGTCGATGAGCCATTCTTGCCTGAAGTCGTCAGTGCGTTGGCATCAAGATGAAATCTAAGTTTTGGTGGCGCTTTCTTCCCCGCAGCGCCTTTAGCCAAACCGTTATGCTGATTGGTTGTCTACTGTTGATCAATCAGCTGGTTTCCTATGTCACTGTGGCTGTGTATGTATTAAAGCCCAGTTATCAGCAAATCAACCAGTTAATTGCCCGCCAAATCAATCTGTTATTTGTCGATGGTATCGATATCGGCCGCGAACACTTAACCATAGTCGATGCGCTCAATGCCAAAGTCCGTGACGATGGCATGAAGGTCTACAACCAACAACAGGCCCGCGAGGCGGGAATCGAGCAGGCCACCTACTACGGTTTTTGGTCATCGCAGATGTCGGAATACCTAGGTGGCGAGGCAGAAGTTCGCGTCACCCACGGCACAGTATTACAAATTTGGATCCGCCCCCCACAGGCGCCATCGGTGTGGATTAAAGTGCCGCTCATCGGGCAAAACGTTTCCGATTTGTCGCCACTCACCTTGTACTTAATGGTGATCGGCGCGCTCAGTGTCGCCGGTGGATGGTGGTTTGCCCGCCAGCAAAACAGGCCGCTTAGACGCTTACAAAAAGCCGCGATCTCGGTCTCACGCGGCGAGTTTCCCGATCCTCTGCCGCTAAAAGGCTCGAGCGAACTGGTGGAAGTGACCAATGCCTTCAACCAAATGTCCCACAGCATGAAACAGCTCGAACAGGACAGAGCCCTGTTGATGGCGGGTATTTCCCACGATTTACGCACGCCACTGACGAGGATTCGCCTCGCCTCCGAGATGATGGTCGAGGAAGATCAATATCTTAAAGATGGTATCGTCAACGATATCGAAGATATGGACGCCATTATCAGCCAGTTTATTGCCTACATTCGCCAAGATCAGGAGGCGAGCCGCGAGCTAGGGCAAATTAATAAACTCATTCAAGATATTGCGCAGGCCGAAGCCAACCGCGACGGTGAAATTGAAGTCGTACTAAGCGACTGCCCCGAGGCGCTGTTCCAAGGGCTGGCGATTAAGCGAGTGCTCAGTAATCTGGTCGAAAATGCCTTCCGTTATGGTTCGGGCTGGGTGCGGATAAGCTCGCAATTTGATGGTAAGCGTATCGGTTTTAGCGTTGAGGATAATGGCCCTGGGATTGATGAGTCGCAAATTCCCAAACTGTTTCAACCTTTTACCCAAGGTGATATTGCGCGCGGCAGTGTTGGCTCGGGCCTAGGGCTCGCCATCATCAAACGGATTATTGACCGTCACCAAGGCCAAGTGACCTTATCTAACCGCACTGAGGGTGGCTTAAAAGCCCAAGTCTGGCTCCCCTTGGAGTAAATCCAAGATTCAGCATGACAACGCTGACATTCAACTTGCGTTGTCATATTTATGTCACGCGAAATTCATAAACTCTAGGCAGTTAACTTCCTAAAGTTTCCAATGAAGATTGCATCATGAAGATTTCAACGCTGTCACTCTCAGCTTCGGCGCTGTTGTTATTGCTTGCTGGACTATTAGCAGCCGTGGTGCTGTGGAGCAGCGATCAAAGACAACAGATTGAGCAACAAACCCAAGTACTGCAAGGCCTACAACAGGATTTTCTCGTAGGAGTACGCCGCGATCTCGATGGCTATTTAGCCAGTGGTAACGCGACCCAACTCGAAGAAGCCAAAGCCAAACTGAGCAAGATTAAAACAGCGCTCAGCGAACTCAACCTCGCCGCAGCGGGCAGTGCCGATGAACAATTACAAACTGGCCTGAGCCAATTCATTCAAGACTTAGATACCAAATACCGCGCCGCCGGCAAACTCGCGGGCAATCCAAGACAACTACTGGCCCACGCCGAATCCGAAATGCTCGACTATAACCGCAGACTCGGCAGTTACGCCGACAAAGGTTTAGCCGTTAATGCCACTGTGGCCGAACAATATCTGCAATTAAGTCGCGACTTACCCAGCATTGTTTATCAATTATCGCAGCTTACCGATGGCTATCTCATCGATAAAAATCAGCAACTTAAAAACATTCTCGACAGCACCAGCAAGGAGCTAAACCAGTGGCGTGACCGACTGAACGCCCTGCCGTTAATCGGCGTGTACGAGCAGCAGGAAGCCGATGAATTTGCCCTCGGAGCAAGTGAGCCAGAGCAGATTGAAGTGGGTGAAAATGATCGCAGCGAGCTGTTAAGCCTTGCCAACCGCTACAATAAAGAAGTCGCCAACACCCACCAACTGCTGCAAGCCAATCAGGAGATGCAAGAGCAGTTAATTCAAGCCATTAGCAGAGTTGAACAGCAGCTTATCGCCCTCGGTGAGGCGCAGGCCGCCAAAAACCAACAGCTCAAATATGAGTTACAAGTCATTCTTTATGCGATGGTTTCGATTATGGCGCTGTTTGCTATCGGCTATTTAATCCTGCAACAGAACCGTGTGGTTAAGCCCCTTAAACGCCTCAATCAAGCCTTTATGCAATTAAGCGAGTCCAACAGTCGTGAACGCTTGGACATCAATCGCCGCTGCGAAACTGGCCAAATTGCAGGCCATTTCAACCAATTACTGCACAGATTCGAGCAAGAAGACGAACTCCAGCGCCAACAAATGACTAAGGTTTCTCAATCCTTGAGTCAGTTGGTCGCGCGCATTACTCAACTGTCGCAACACACCGAACACACTCAGACCATAGTTGCCGACACTCAGTCGCAAACCGAGCATATCCGCAGCCTCGCCAATGAGGTCAGCCACACTTCGGCCCTTGTTGAACAAAGCGCGGCCGAAACCATGCGTCAAATGCAGTCGAGCCAAACCGAGGCCGAAGCCGTGCTGAGTGCCACAGAACAAACCCAAACAGCGGTTGGCCTTTGCCGTGCCTCGCTCGAAAGCCTGAATAACTCAGTGGCAGATGTTGCTAAAATCATTGATGTAATTGGTAATATTGCCGAGCAAACCAATTTGTTGGCGCTTAATGCCGCCATCGAAGCCGCCCGTGCGGGCGAACAAGGTCGCGGTTTTGCCGTGGTCGCCGATGAGGTGCGAAGCCTGAGTCAACGTACTCAAGTGTCGTTAAATGAAATCGTGAAAATCCTGCATCAACTCACCCAATCGAACCTCGCACTCGGCGAGAGTGTCGATGGCATTGCCCAAGCGACCGATAGCCAAAAACAGCGGGCACAGAGCCTATGGCATGTGGCGCAAACCGTGCAAAATCAAGCGAGTGAAATGGCCAATACCGCCAAGCAGGGTTCGCTTAACGCCAAAGAACAAGTCGATTACCTCGATGAATTTGTCCGCAGCATGGATAACCTAAAAGACCAAGCGCAAACCAGTTCGCAGCAGAGTGAAGTGATTGCCCAAGAAGTGCAGCAAAGCGTGGAAGATATTGAGACCAGTCTAGGCATAGCCGACACCAACACAGTATCTGCCCGAGCGGCTTAAGCTTGATGCCAATAAAAATGGGAGCCTAGGCTCCCATTTTGTTTTTTAATCAGGCATTAAAGTGCCGCTAATACCACTTCTGCTTTACTCACTTCAAATGACTTAGGTGCTTCTACATTCAGTAAAGTCACCACGCCGTTCTCGATGATCATCGCATAACGTTGTGAACGTACACCGCCAAAGCCAGCGGTATCCATTTCGAGTCCAAGCGCCTTAGTAAAGCTGGCATCGCCATCGGCAAGCATCAGTAGCTCAGACGCATTTTGCGCTTCGCCCCACGCTTTCATCACAAAGGCATCGTTTACCGATACACAGGCAATTAAATCAACGCCTTTGGCTTTAAACTGATCGGCCAGCACCACATAACCTGGCAAATGCGCTTCAGAACAGGTTGGTGTGAAAGCACCAGGCACTGCAAATAACACCACTTTTTTACCCGCAAACAGCTCGGTAACTTGATGATTTACCATGCCATCTTTAGTCAGTTGGCTTAGCGTCGCCGCTGGTAATGTTTGACCTTGAGCAATCATGTCTCTCTTCCTTTAGTTAACGTAACTTGCCCATACTAGCCCGAATCGGAAATGATAAACACAGAGAGAAAAAGAGGATTTGTAGCAGAGGAGGGCAAATTGCCGATAAATCGCAAGGTTGCTTTTGTTACCCGCATTGCGAAATATAAAAGCAGCGTATCGATGGATACGCTGCTTTTAGTCTTAGCCAAGCTGTTTACTCTAAATAGGCGCTGACTTAGTAAAACCCTGATGCCTTACAAAAGAGTTTATTTCAGGGTATTAAACCCATTTTTCACGCTTGCGTAGTGCAGCGAACAACCCAAGGCCAAGCAAGGCAAAGATTGACAATGCACCGCCGCTTTCTTCCACCACAATCACTTCGTCAACACGGTCACGGTCGCTACTTTCCAGCGGCAAGGCATATAAGCCATCGGTGTCATCGGCGCTAATGGTGGCGACAATATCGCTATAACCCACTTCATAAACATCGATTAATACATCGTAGTGATCCGTTGGATAGCCAGTGTAGAGGGTCGTCAGCACTTCATAATCGTCCTGCGTTGAGTCACCGTAAATGGTAAACACGTCCGTGGTGTAGTAATGCACCCAAGGGCCACCGTTACGGCTGAGGTATAACTCGGCAAACAGGTCGGCGCGCTCATTAAGATATGAGCCAAACACATCTACATCGAAGGTCACGCTGAAGGTTTGATAAAACCCATCGTAATCAAAGTCTTCAAACAAACGGCTGCTGGCATCAAAAATCGCAAAGCTGTGATACACGGGGGCGCGATAGGGATCTTCACTGGTCGCACTCGAGCTTGGGATAGGCTGACTCGTATGCTTAGCAATAACTTGTTCGCGGGTCATGGGGGTCGCGCCCATCAATTGCACGCGATGAGCACTCGGACTCTTTGGCGACAACGACGGCGCTAAAGATTTAACTGCGGCCGCAGCGGCTTCACCCACACTTTGAGCGGGCGCCGACTGCTGCAATAGGCTTAGGGCTTGTTTTTCTTGCTCTGTGGCCTGCTCGGCATCGGCTTTTCTCGCCATCCCAATACTGGCCGCGCTAAAGGGGGCCAAGTTTGACTCGTCGAAGGACTCTGCGCTCACCATGGCTGGCGCCAGTAATGCACCTGCCAATGCGATTGCCTTGATAAAGGTAGAGTGAGTTGTGCCTACGATACCTGATTGATTGAGTGTGCTCATCTTGAATACCCCATTGGATAATCCGTTCAAATTGGTGACATTAGAGGGCACTCAAGGTGAACATAAGCTGAACATTAAAAACCGTGATTTTGGGGATAAAAAGCCGTTCAGCCTCAATTCATCTGTTTTGGTTAACACTGCTAATATGTCACAGGATATGACTCCTGTTCTGTAAAGAGGTTCGATGGGGATTAACTCGATAAAATCCCACGAATCGCAGTGAATTTTAAAGGAATTGTGTTTACCAATTGGCTGATCTCTATGTGTTTGTTATGGTGATAAGGTATTCTTAGTCGACTCGATAAGCTATCCACCTAGCTACTCACTTAAGATCGCCGAACATGCGCGGGTAACAGATATGAAACTTGGTTTGACACTGACCCTAGTCGCTTGCTTATGCACCTCCTTTGGCAGCATGGCGGGCAATGATAAACACGATGATCGTAATCAATCGCGCGGCCAAGGTGTGAAGAATGAACAGCGCCGCCTTGTGGTCAACAGTCCAGATCAAGCCGTGGCGATGGCGCAACGTCAATATCGAGGAAAAGTGCTCAGCGTGCAGTCGAGTGGCTCAGGCTATAGAGTCAAAATCCTCAATAACGATGGCCAAGTTTTTTCTGTTTCGGTGGATGCCGCCACTGGGCGTGTTTCGAGGAATTAATATGCGACTCTTACTCGTTGAAGACGATTTAGCGCTTCAAGCCAACTTAAAACAGCACTTGCTCGATGCTCACTACAGCATAGATGTCGCCAGCGACGGTGAAGAAGGCCTGTATCAAGCCATTGAATATAATTATGATGCCGCGATTATCGATGTCGGTTTGCCTAAACTCGACGGCATAAGCCTTATTCGCCGCGTGCGCCAAAAAGAGCGCGCGTTCCCTATCCTGATTTTAACCGCGCGCGATAGCTGGCAGGATAAAGTCGAGGGCCTCGATGCGGGTGCCGACGACTATCTCACTAAGCCCTTTCATCCCGAGGAATTAGTGGCTCGGCTCAAAGCCTTGATACGCCGCTCGGCCGGTAAAGCCAGCCCAGTGATTACTAACGGCCCCTTTAGCTTAAATACCAGTAGCTTAGAAGTGCGCAAAGGGGATGAACTCGTCACTCTAAGCGGCTCTGAATACAAGCTATTTGAGATTTTTATGCTGCATCAGGGCGAAGTGAAATCGAAAACTGCGCTCACCGAACATATCTACGATCAGGATTTTGACCTCGACTCTAACGTTATCGAAGTCTTTATCCGCCGTTTACACAAAAAACTCGACCCAGATAACCAATACAATCTGATCGAAACCCTGCGCGGCCAAGGCTATCGTTTAAGAGTCATCTCCCCAGATGAGTAA
Protein sequences of DBSCAN-SWA_1 >LR134303|1915328:1927825|1926837_1927140_+|VEE61957.1|DBSCAN-SWA MKLGLTLTLVACLCTSFGSMAGNDKHDDRNQSRGQGVKNEQRRLVVNSPDQAVAMAQRQYRGKVLSVQSSGSGYRVKILNNDGQVFSVSVDAATGRVSRN >LR134303|1915328:1927825|1921345_1922662_+|VEE61953.1|DBSCAN-SWA MKSKFWWRFLPRSAFSQTVMLIGCLLLINQLVSYVTVAVYVLKPSYQQINQLIARQINLLFVDGIDIGREHLTIVDALNAKVRDDGMKVYNQQQAREAGIEQATYYGFWSSQMSEYLGGEAEVRVTHGTVLQIWIRPPQAPSVWIKVPLIGQNVSDLSPLTLYLMVIGALSVAGGWWFARQQNRPLRRLQKAAISVSRGEFPDPLPLKGSSELVEVTNAFNQMSHSMKQLEQDRALLMAGISHDLRTPLTRIRLASEMMVEEDQYLKDGIVNDIEDMDAIISQFIAYIRQDQEASRELGQINKLIQDIAQAEANRDGEIEVVLSDCPEALFQGLAIKRVLSNLVENAFRYGSGWVRISSQFDGKRIGFSVEDNGPGIDESQIPKLFQPFTQGDIARGSVGSGLGLAIIKRIIDRHQGQVTLSNRTEGGLKAQVWLPLE >LR134303|1915328:1927825|1915328_1920212_-|VEE61951.1|DBSCAN-SWA MTLSMSYKQASKWGFFVLLVGLFFSAVLAWQVSRINSQTISLALEASAQQISENVTNRIALYQYGLRGARGMILTAGEDHISRPMFQNYSLTRDVDEEFPGARGFGFIRRVSKENEAQFLARVKAVDWPDYHVRQLNPNEGEKYLIEYIEPLERNKAAVGLDIASEAHRKEAADRAMLSGEVQISAPITLVQATGKPLQSFLILLPVYRTGIVPPTPEERLAQGFGWSYAPLVTTEILTGLSLNQKMTKLMLSDVTDDKQPINFFETHANDLSPLSDYSYKLNKTIFGRHWQFEVMAYPGFIKSLHLNKPSLVLLSGSLFSVLLAALIAMWSLTLQRKQQVLAEQARRASMLEHSLDGIISYDLAGNITSWNQGAERLFGYTEQQALGLPSSELIIPPSIVLEEQALMADVLTGKTVLNRVSRHQRADGSNLSTSTTTLPIYDEHGDIVGLSQTIRDITAQQDAERHILNLNASLERQVTKRTHALQQALLENQALLDTINQQLHYSVTDLDGVILDVNEHFCLISGYAREELIGQTHSVLKSGEHDAAFWQRMWQQIKAGKSWHGEICNQGKDQQLKWFDTVIGPVFDERGKVERFVALQTDITERKLAQLDKNRIATLLTNVLDAASEMTIIAADPQGIITIFNRGAERMLGYSAEEIIGKTTPAPFHIADEIAARAAELSVEYGQKIQGFDTFVHKAREEGSETRTWTYVRKDGSQLPISLSVTAMRGNDGEILGYLGIGVDISQIVAQQEALLTASNHLSKAAEVAKLGIWTWNLLDNSLQWNDRMFAMYDQPESLKQQGLSYEHWRMRVHPDDVEEAEEKLNRAVEYDEPYHPIFRVLTTDGAVRYVQAGAQIERDRQGNVIRVIGINIDITAQRQIEANLRSAKLEADNANAAKSAFLANMSHEIRTPMNAVLGMLQLMQYTSLSVQQQGYVNKAQTAAKSLLGLLNDILDFSKIDAGKLKLDPHPCSIELLMRDLAVVLSASHNNTDVEVMFDLDAALPPWLLADQLRLQQILINLAGNALKFTPHGQVIVGLECIRHEADTVTVQFSIVDSGIGISEEQIERIFTGFEQAESSTSRRFGGTGLGLAISKRLVELMGGQLQVTSKVGVGSRFWFDLTFQVMEVEARARADLSQYRILVVDDNQITTEILSKILSDYGCLVETASGGYQAIEKVKQANAAGQQFDVVLMDWRMPDIDGLQTAEMLKNAGTGSYTPLVVMLTAYGHEVIAESQHINNVPFVNFLTKPVTSQVLVEAVLNAIEGKTMDSNPKPRSQRLLAGLTLLVVEDNQLNREVIEELLSYEGATVVLAGGGVEGVAQVLESGDLYDAVIMDVQMPDIDGLEATRRIRADGRFAQLPILAMTANASQTDKQECLEAGMNAHVGKPIDMQLLLPNILRLVGRDVTSLEQHESAHLDSQHNLEGETLLDDIRLILRRFGGNQVFFEKMASSFAPEMTKQLSLFKQSTKTFDYATTAAISHAIKGIASNFGARRLAVHAAFLEKQFKQEGLELLEIKRWTDTLESLIHQSIEQLAGFLPKGQLKQPQSIEPVNGTAVDMQSMRPELDTLVALLQENNLEAVNLVDKLAKPLSQHPQWAELNGLVQSLAFTQALETLRAMMLEDA >LR134303|1915328:1927825|1927141_1927825_+|VEE61958.1|DBSCAN-SWA MRLLLVEDDLALQANLKQHLLDAHYSIDVASDGEEGLYQAIEYNYDAAIIDVGLPKLDGISLIRRVRQKERAFPILILTARDSWQDKVEGLDAGADDYLTKPFHPEELVARLKALIRRSAGKASPVITNGPFSLNTSSLEVRKGDELVTLSGSEYKLFEIFMLHQGEVKSKTALTEHIYDQDFDLDSNVIEVFIRRLHKKLDPDNQYNLIETLRGQGYRLRVISPDE >LR134303|1915328:1927825|1925546_1926479_-|VEE61956.1|DBSCAN-SWA MSTLNQSGIVGTTHSTFIKAIALAGALLAPAMVSAESFDESNLAPFSAASIGMARKADAEQATEQEKQALSLLQQSAPAQSVGEAAAAAVKSLAPSLSPKSPSAHRVQLMGATPMTREQVIAKHTSQPIPSSSATSEDPYRAPVYHSFAIFDASSRLFEDFDYDGFYQTFSVTFDVDVFGSYLNERADLFAELYLSRNGGPWVHYYTTDVFTIYGDSTQDDYEVLTTLYTGYPTDHYDVLIDVYEVGYSDIVATISADDTDGLYALPLESSDRDRVDEVIVVEESGGALSIFALLGLGLFAALRKREKWV >LR134303|1915328:1927825|1924811_1925285_-|VEE61955.1|DBSCAN-SWA MIAQGQTLPAATLSQLTKDGMVNHQVTELFAGKKVVLFAVPGAFTPTCSEAHLPGYVVLADQFKAKGVDLIACVSVNDAFVMKAWGEAQNASELLMLADGDASFTKALGLEMDTAGFGGVRSQRYAMIIENGVVTLLNVEAPKSFEVSKAEVVLAAL >LR134303|1915328:1927825|1920580_1921306_+|VEE61952.1|DBSCAN-SWA MGQETSKILVVDDDMRLRALLERYLMEQGYQVRSAANAEQMDRLLERENFHLIVLDLMLPGEDGLSICRRLRQQGSTIPIVMLTAKGDEVDRIIGLELGADDYLPKPFNPRELLARIKAVMRRQIQDVPGAPAQQEAEISFGEFSLDLATREMYHGNEAIALTSGEFAVLKVLVTHPREPLSRDKLMNLARGRDYSALERSIDVQVSRLRRLIEKDPANPRYIQTVWGLGYVFVPDGAARR >LR134303|1915328:1927825|1922778_1924755_+|VEE61954.1|DBSCAN-SWA MKISTLSLSASALLLLLAGLLAAVVLWSSDQRQQIEQQTQVLQGLQQDFLVGVRRDLDGYLASGNATQLEEAKAKLSKIKTALSELNLAAAGSADEQLQTGLSQFIQDLDTKYRAAGKLAGNPRQLLAHAESEMLDYNRRLGSYADKGLAVNATVAEQYLQLSRDLPSIVYQLSQLTDGYLIDKNQQLKNILDSTSKELNQWRDRLNALPLIGVYEQQEADEFALGASEPEQIEVGENDRSELLSLANRYNKEVANTHQLLQANQEMQEQLIQAISRVEQQLIALGEAQAAKNQQLKYELQVILYAMVSIMALFAIGYLILQQNRVVKPLKRLNQAFMQLSESNSRERLDINRRCETGQIAGHFNQLLHRFEQEDELQRQQMTKVSQSLSQLVARITQLSQHTEHTQTIVADTQSQTEHIRSLANEVSHTSALVEQSAAETMRQMQSSQTEAEAVLSATEQTQTAVGLCRASLESLNNSVADVAKIIDVIGNIAEQTNLLALNAAIEAARAGEQGRGFAVVADEVRSLSQRTQVSLNEIVKILHQLTQSNLALGESVDGIAQATDSQKQRAQSLWHVAQTVQNQASEMANTAKQGSLNAKEQVDYLDEFVRSMDNLKDQAQTSSQQSEVIAQEVQQSVEDIETSLGIADTNTVSARAA |
8 | Bacillus_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
3469263 : 3476707
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >LR134303|3469263:3476707|DBSCAN-SWA CTTACTTAGTCATACGCTTGTATTTTAAGCGGTGTGGTTCAACCACATCGGCACCATAGGTGTTCTTCAACCACGCCGAGTATTCGGTGTAGTTACCTTCGTAGAAGTTCACTTGGCCTTCGTCGCGGTAGTCCAGAATATGGGTCGCGATACGGTCTAAGAACCAACGGTCGTGCGAGATCACCATAGCGCAACCTGGGAATTCGAGAATCGCTTCTTCCAGTGCGCGCAGTGTTTCAACGTCTAAGTCGTTGGTAGGTTCGTCGAGGAGCAGTACGTTACCGCCAGCCTGCAGCAGTTTAGCTAAGTGAACACGGTTGCGCTCACCGCCCGACAGCGTGCCAATCACTTTTTGCTGATCAGCACCACGGAAGTTGAAGCGGCCCACATAGGCGCGGCTTGGGATTTCCATGTTGTTGATACGCATGATGTCTTGACCACCGGAAATTTCTTGCCAAATGGTGTTCTTGTCATTCATTGAATCGCGGAACTGTTCAACCGACGCCAGTTGCACGGTTTCACCCAGTTCGATGGTGCCGCTATCAGGCTGCTCGCTGCCAGACAGCATGCGGAACAGGGTCGATTTACCTGCACCGTTGGCACCGATAATACCGACGATCGCGCCCTTAGGAATGGAGAACGACAGGTTATCGATCAGAACCCGGTCACCGTAGGATTTGGTTAAGTTGTTAACCTCAATCACTTTGTCACCTAAACGCGGTCCGGGCGGAATAAACAGCTCGTTGGTTTCGTTACGTTTTTGGTAATCAGTGGTGTTCAGCTCTTCGAAGCGCGCCATACGGGCTTTGCCCTTAGACTGACGGCCTTTCGCGCCTTGGCGAACCCATTCTAATTCTTTCGCAATGGTCTTTTGACGCGCGCTTTCTGCGGCAGATTCCTGCTTCAAGCGAGCATCTTTTTGCTCAAGCCATGAAGAGTAGTTGCCTTCCCATGGGATACCCTCACCACGGTCAAGTTCTAAAATCCAGCCAGCAGCATTGTCGAGGAAGTAACGGTCGTGGGTAATCGCCACAACGGTGCCCGAGTATTCCTGCAGGAAGTGCTCGAGCCAAGCGACTGATTCGGCGTCCAAGTGGTTGGTGGGTTCGTCGAGTAGCAGCATCTCTGGTTTTTCCAGCAGCAGACGACAAATTGCCACACGGCGGCGCTCACCACCCGATAAGACTTCAATCTTTTCATCCCAATCTGGCAGACGCAGGGCATTCGCCGCACGCTCAAGGATGTTGTCCAAATTGTGGGCGTCCTGCGCCTGAATGATGGCTTCGAGTTCGCCTTGCTCTTTGGCAAGCGCGTCGAAGTCCGCATCGGGATCGGCGTAAGCCGCATAGACTTCGTCTAAGCGTTTGAGGGCATTTTTGGCTTCAGAAACCGCTTCTTCAATCGCTTCACGCACGGTTTGCGTCGGATCGAGTTTCGGTTCCTGTGGCAGATAACCAATCTTCAGACCTGGCATTGGGCGCGCTTCACCTTCAATCTCGGTATCGATACCTGCCATGATGCGCAGTAGGGTGGATTTACCTGAACCGTTAAGACCTAACACACCGATCTTAGCGCCGGGGAAAAAGCTTAAGGAAATGTCTTTAAGGATCTGCTTCTTAGGAGGAACAACCTTGCCCACCCGCAGCATGCTGTAAACAAACTGAGCCATTTTTCTGCATCTATAAGTGACTAATGATGCGGCTAATTCTACTCGATTGCGCGCCGAACTCAACTCGGCATTGAGGCGAGTTCAAGGTAAAACTGTAGGATTTTCATGTTTCAGAGACTGGGATTGTCGGCTCGTTACGAGTAGAATACCGCGCAACCCTACCTACAAGAATTGGCAGCTGGAGTAAGCAATGCTGAAAAAAGATATGAATATCGCAGATTATGATCCGGAACTGTTCAACGCAATTCAGAACGAAACTCTGCGTCAAGAAGAGCATATTGAGCTGATTGCTTCTGAAAACTACACCAGTCCACGCGTGATGCAAGCGCAAGGTTCACAATTAACCAACAAGTACGCCGAAGGTTATCCTGGCAAGCGTTACTACGGTGGTTGTGAGTATGTGGACGTAGTTGAAACCTTAGCGATTGAGCGTGCAAAACAACTGTTTGGTGCGACTTACGCAAACGTGCAACCTCACTCAGGTTCTCAAGCAAACAGCGCCGTTTACATGGCATTGTTAAAACCAGGCGATACCGTTTTAGGTATGAACCTCGCTCACGGTGGTCACTTGACCCACGGTTCGCCAGTAAACTTCTCCGGTAGACTGTACAACATCATTCCTTACGGCATCGATGAATCAGGCAAAATCGACTATGACGAAATGGAACGTCTGGCGGTTGAACATAAGCCTAAGATGATGATCGGTGGTTTCTCTGCTTACTCAGGCATCGTTGACTGGGCAAGAATGCGCGAAATCGCAGACAAAATCGGTGCTTACCTGTTTGTCGACATGGCGCACGTTGCGGGTCTTATCGCCGCTGGTGTGTATCCAAACCCAGTGCCACATGCGCACGTTGTGACCTCTACTACTCACAAGACCTTAGCCGGTCCTCGTGGCGGTATCATTCTGTCTGCTGCCGATGATGAAGAGCTATACAAAAAGCTGAACTCTGCGGTATTCCCAGGCGGTCAAGGCGGTCCTTTGATGCACGTTATCGCGGGTAAAGCGGTAGCCTTCAAAGAAGCTTTAGAGCCAGAGTTCAAAGCTTACCAACAACAAGTGGTTAAGAACGCTAAAGCCATGGTTGAAGTGTTCTTAGAGCGCGGTTACAAGATCGTTTCTGGCGGTACTGACAACCACTTAATGCTGGTGGACTTAATCGGTCGCGACCTGACGGGTAAAGAAGCCGATGCCGCTTTAGGTAGCGCGAACATCACAGTAAACAAAAACTCTGTGCCAAATGACCCACGTTCTCCATTCGTGACCTCTGGTGTGCGTATCGGTACGCCTGCGATCACTCGCCGCGGCTTTAAAGAAGCTGAAGCGAAAGAGTTAACCGGTTGGATCTGTGACATCCTCGACGATGCCCACAACCCAGTGGTTATCGAGCGCGTAAAAGGCCAAGTATTGGCACTGTGCGCCCGTTTCCCTGTTTACGGTTAATTCGTTAATTTATCTGAATATGGCGGCTGAGTTCACAACTGCTTAGGGTGTCACTCTTAGGTGGTTAACCTCAGCCGATAAATAGGATAAAATCCCATGGCCGCATTAAGCGGCCATTTTTATTGCTAAAATCTTAGACTTAAACCCGCTCGGTTTAAGCAAGCTACAGCGAAAGCCATAGGCTTTATTCCATTCACGGCAGGAGGCTCAATGCATTGTCCATTTTGCAGCGCGACAGATACTAAAGTGATCGATTCCCGATTAGTGGCGGAAGGCCATCAAGTGCGTCGTCGCCGAGAATGCACCGAATGCCACGAAAGATTTACTACCTTCGAAGGGGCTGAATTAGTCATGCCACGGGTGATTAAACGCGATGGCACGCGCCAACCCTTCGATGAAGAAAAGCTGCAAGCAGGCATGTTACGCGCGGTCGAAAAGCGCCCCGTGTCTATGGATGAAATCGAGCAGGCCTTAAGTAAAATCAAGTCAACGCTGCGGGCTACTGGTGAGCGCGAAGTGCCATCCGAGATGATAGGTAACTTGATGATGGAACAGTTAATGAGCCTAGATAAGGTTGCCTATATTCGTTTTGCCTCGGTTTATCGCGCCTTTGAAGACGTCTCCGAATTTGGTGAGGCGATTGCGAAACTGCAAAAGTAACGTGGGTTATCTTAGGTTAGTGCTTAGGTTAGCGCCTTGTTCGGGCACGCTTCCTCAAGATTTCATAGGGTTTATATGAATTGGTCAGAACTCGATAACCAGATGATGAGCCGAGCCATACAACTGGCTCGCAAAGGTTTTTATACCACTCGCCCCAATCCCAGTGTGGGCTGCGTTATCGTAAAAGATAATCAGATTGTCGGTGAAGGTTATCATCAAAAAGCCGGCGAGCCCCACGCCGAGGTGCATGCACTGCGCATGGCTGGCGAGCTTGCCCGTGGCGCGACCGCCTATGTCACCTTAGAACCTTGCAGCCATTATGGCCGCACACCGCCGTGTGCCTTGGCGCTGATCAATATTGGTGTAAAACGTGTAGTGGTCGCGGTTGAAGATCCCAATCCGCAGGTCGGTGGTCGCGGTATTCAAATGCTGCGCGATGCGGGCATTGAAGTGGATGTCGGCTTACATCGCGACGAAGCTTATGCTTTAAATCTTGGTTTTATGAAGCGCATGGAATCGGGCTTACCTTGGGTGACGGTAAAGCTTGCCGCGAGTCTCGATGGTAAAACCGCATTATCCAACGGTGTCTCTAAGTGGATTACAGGCCCCGAAGCTCGCCGCGATGTGCAGCGCTTACGTTTGCGCGCCTGTGCACTGGTCACTGGGATTGAAACCGTACTGGCCGATGCCCCTTCGCTTAATGTGCGCTACTCAGAGCTTGGCAGTCTTAACTTGCAATTGAGTGAAGCGCAAATTTTACAACCGCTGCGGGTGATTTTAGATAGTCGTTGCCGCATGCCGATTACGGCAGCCTTGCTTGCGATTGAATCGCCGATTTTATTAGTCTCAACAGAGCCTTACTCGCCAGCCTTTATGGCGCAGTTGCCTGCCCATGTGACTTGTCTTCAATTACCGGCGATTGATGGTCGAATCTCGCTGCCTGCACTCTTAAGCTATTTAGGCAAAAGCTGTAATCAGGTGCTTATCGAAGCGGGCGCGACCTTAGCCGGCGCCTTTATCGGTGACGGATTAGCCGATGAGTTAGTGCTGTATCAAGCGATGAAAATCCTTGGGGCACAAGGACGTAATCTACTCGAATTACCCGATTATCAAATGATGGCCGATATTCCGACCCTCAAACTGGTCGATGAGCGTAAAGTGGGCGCGGATATGCGTTTCACCTTGCGGCTCACGTCCAATCCATCTTTAGCTAATAAGTGAGTTAACCATGTTTACTGGGATTATTGAGGCCGTAGGCACGCTGCGAAAGCTTGAACGTAAAGGCGATGATATTCGTTTGACGGTCGCCAGTGGCAAACTGGATTTAAGCGATGTGCGTTTAGGCGACAGTATCGCCACCAATGGTGTGTGTTTGACTGTGGTTCAGCAATTAGCCGACGGCTATGTGGCGGATGTGTCGGCTGAAACTGTCAGTCTCACAGGCTTTGCTAACTATAAAGTGGGCACTAAGGTTAATCTTGAAAAAGCCGTTACCCCGACAACTCGCCTCGGCGGGCATATGGTCAGCGGCCATGTGGATGGCATTGCCACCGTAGAGCAGCGTTTAGCCCGGGGCCAAGCAATCGAGTTTTGGTTAGCGGCGCCAACTGAGCTGGCGCGCTATATCGCCCATAAAGGTTCTATCACCATCGATGGTGTGAGCCTGACGGTAAATGAAGTCGATGGACACCGTTTTCGTTTAACCATAGTGCCTCATACTGCGGGTGAAACCACACTGGTGGATTTAAAAGCTGGCGATAAGGTGAATATTGAAGTGGATTTAATCGCCCGCTATTTAGAGCGTTTAATGCGCTTTGATACTAAAGAAACCCAAGGCGGTGGGGTGACCATGGAAATGTTAGCCCGTGCTGGCTTTGTGCGTTAGGGCACTGGCACTTAGCTCACTTAGGTATAGAATTTCATAACAACAGTAAAATCATAAAGGTCCTACAATGGCGCTGCACAGTATAGAAGAGATCATCGAGGATATTCGTCAAGGCAAAATGGTTATTTTGATGGATGACGAAGACAGAGAAAACGAAGGCGACCTGATTATGGCGGCCGAGCTGGTGACGCCTGAAGCGATTAACTTTATGGCGAAATACGGCCGTGGACTTATCTGCCAAACGATGACTAAGGCCCGTTGTCAGCAGCTAAACCTGCCCTTGATGGTGACTAATAACAATGCGCAGTTCTCGACTAACTTTACGGTATCGATTGAAGCGGCAGAAGGTGTAACCACCGGGATTTCGGCCCACGATCGCGCTGTAACGGTAAAAGCTGCCGTGGCTAAAGATGCCAAGGCATCGGATTTAGTACAACCTGGTCATATCTTCCCATTAATGGCGCAGGACGGCGGCGTGTTAACCCGCGCAGGTCACACGGAAGCGGGTTGTGATTTAGCCCGTTTAGCAGGACTTGAGCCATCGGGGGTGATTGTTGAGATCCTCAATGAAGACGGCACTATGGCTCGTCGCCCCGATTTAGAGATTTTCTCTGAATTACACGGGATTAAAATCGGCACTATTGCGGCGCTTATCGAATACCGCAACACCAAAGAAACCACTGTCGTGCGCGAAGCCAAATGTAAACTGCCAACCCGCTTTGGCGAGTTCGACATGGTGACCTTTAGGGATACTATCGACAATCAACTGCACTTTGCCTTAGTGAAGGGCGAGGTGAAACCGGATTGTTTAGTGCGCGTACACTTACAAAATACCTTCAACGATTTACTCCACTCAGAGCGCGATCAGCAGCGTAGCTGGCCGCTTGAAAAAGCCATGGAGCGCATTTCTGCCGAAGGTGGAGTGTTGGTGCTACTGGGCAATCAAGAGCATACCTGTGAAATTCTCTCTAAAGTCAAAGCCTTTGAAGCAGAAGATCAAGGTCAAGCGCCAGCCTCTGCAAAATGGCAAGGCACTTCACGCCGCGTGGGTGTAGGTTCGCAAATCCTTGCCAGCCTTGGGGTAACCAAAATGCGTCTCTTGAGTTCGCCTAAGCGTTATCACTCGCTCTCTGGCTTTGGGCTTGAAGTGACTGAATACGTAGCGGAGTAACAGGCTGAGCTTGTTAGACGTTGCACTAAGGTAAAATTAATCATTTGTATAATTTGCATAGGGCTGTGGGCGATTGACTGTGGTATCATGTCGCCACTTTTCGCCCAAGCCGGGTGCTTTAGCTAAATTAGGTAAGAAAATGAACGTAGTTCAAGGTAATATCGAAGCGAAGAATGCCAAAGTTGCGATTGTAATTTCGCGTTTTAACAGCTTTTTAGTTGAGAGCCTGCTTGAAGGTGCACTTGACACGCTGAAACGTTTTGGCCAAGTCAGTGATGAAAACATCACTGTCGTCCGTGTACCTGGTGCGGTTGAGTTACCGCTGGCTGCACGTCGTGTTGCCGCAAGTGGTAAGTTTGACGGTATCATCGCACTTGGTGCTGTGATCCGTGGTGGTACCCCTCATTTTGATTTTGTTGCAGGTGAATGTAACAAAGGTCTAGCTCAAATCGCATTAGAGTTCGATCTGCCCGTTGCCTTCGGTGTATTGACGACAGATACCATTGAACAAGCCATTGAGCGTTCAGGTACTAAAGCAGGTAACAAGGGCGGCGAAGCTGCACTAAGCTTGCTTGAAATGGTCAATGTTCTGCAACAGCTAGAACAACAGTTGTAA
Protein sequences of DBSCAN-SWA_2 >LR134303|3469263:3476707|3476230_3476707_+|VEE63274.1|DBSCAN-SWA MNVVQGNIEAKNAKVAIVISRFNSFLVESLLEGALDTLKRFGQVSDENITVVRVPGAVELPLAARRVAASGKFDGIIALGAVIRGGTPHFDFVAGECNKGLAQIALEFDLPVAFGVLTTDTIEQAIERSGTKAGNKGGEAALSLLEMVNVLQQLEQQL >LR134303|3469263:3476707|3474263_3474920_+|VEE63272.1|DBSCAN-SWA MFTGIIEAVGTLRKLERKGDDIRLTVASGKLDLSDVRLGDSIATNGVCLTVVQQLADGYVADVSAETVSLTGFANYKVGTKVNLEKAVTPTTRLGGHMVSGHVDGIATVEQRLARGQAIEFWLAAPTELARYIAHKGSITIDGVSLTVNEVDGHRFRLTIVPHTAGETTLVDLKAGDKVNIEVDLIARYLERLMRFDTKETQGGGVTMEMLARAGFVR >LR134303|3469263:3476707|3469263_3470931_-|VEE63268.1|DBSCAN-SWA MAQFVYSMLRVGKVVPPKKQILKDISLSFFPGAKIGVLGLNGSGKSTLLRIMAGIDTEIEGEARPMPGLKIGYLPQEPKLDPTQTVREAIEEAVSEAKNALKRLDEVYAAYADPDADFDALAKEQGELEAIIQAQDAHNLDNILERAANALRLPDWDEKIEVLSGGERRRVAICRLLLEKPEMLLLDEPTNHLDAESVAWLEHFLQEYSGTVVAITHDRYFLDNAAGWILELDRGEGIPWEGNYSSWLEQKDARLKQESAAESARQKTIAKELEWVRQGAKGRQSKGKARMARFEELNTTDYQKRNETNELFIPPGPRLGDKVIEVNNLTKSYGDRVLIDNLSFSIPKGAIVGIIGANGAGKSTLFRMLSGSEQPDSGTIELGETVQLASVEQFRDSMNDKNTIWQEISGGQDIMRINNMEIPSRAYVGRFNFRGADQQKVIGTLSGGERNRVHLAKLLQAGGNVLLLDEPTNDLDVETLRALEEAILEFPGCAMVISHDRWFLDRIATHILDYRDEGQVNFYEGNYTEYSAWLKNTYGADVVEPHRLKYKRMTK >LR134303|3469263:3476707|3471121_3472375_+|VEE63269.1|DBSCAN-SWA MLKKDMNIADYDPELFNAIQNETLRQEEHIELIASENYTSPRVMQAQGSQLTNKYAEGYPGKRYYGGCEYVDVVETLAIERAKQLFGATYANVQPHSGSQANSAVYMALLKPGDTVLGMNLAHGGHLTHGSPVNFSGRLYNIIPYGIDESGKIDYDEMERLAVEHKPKMMIGGFSAYSGIVDWARMREIADKIGAYLFVDMAHVAGLIAAGVYPNPVPHAHVVTSTTHKTLAGPRGGIILSAADDEELYKKLNSAVFPGGQGGPLMHVIAGKAVAFKEALEPEFKAYQQQVVKNAKAMVEVFLERGYKIVSGGTDNHLMLVDLIGRDLTGKEADAALGSANITVNKNSVPNDPRSPFVTSGVRIGTPAITRRGFKEAEAKELTGWICDILDDAHNPVVIERVKGQVLALCARFPVYG >LR134303|3469263:3476707|3474987_3476091_+|VEE63273.1|DBSCAN-SWA MALHSIEEIIEDIRQGKMVILMDDEDRENEGDLIMAAELVTPEAINFMAKYGRGLICQTMTKARCQQLNLPLMVTNNNAQFSTNFTVSIEAAEGVTTGISAHDRAVTVKAAVAKDAKASDLVQPGHIFPLMAQDGGVLTRAGHTEAGCDLARLAGLEPSGVIVEILNEDGTMARRPDLEIFSELHGIKIGTIAALIEYRNTKETTVVREAKCKLPTRFGEFDMVTFRDTIDNQLHFALVKGEVKPDCLVRVHLQNTFNDLLHSERDQQRSWPLEKAMERISAEGGVLVLLGNQEHTCEILSKVKAFEAEDQGQAPASAKWQGTSRRVGVGSQILASLGVTKMRLLSSPKRYHSLSGFGLEVTEYVAE >LR134303|3469263:3476707|3472585_3473035_+|VEE63270.1|DBSCAN-SWA MHCPFCSATDTKVIDSRLVAEGHQVRRRRECTECHERFTTFEGAELVMPRVIKRDGTRQPFDEEKLQAGMLRAVEKRPVSMDEIEQALSKIKSTLRATGEREVPSEMIGNLMMEQLMSLDKVAYIRFASVYRAFEDVSEFGEAIAKLQK >LR134303|3469263:3476707|3473110_3474256_+|VEE63271.1|DBSCAN-SWA MNWSELDNQMMSRAIQLARKGFYTTRPNPSVGCVIVKDNQIVGEGYHQKAGEPHAEVHALRMAGELARGATAYVTLEPCSHYGRTPPCALALINIGVKRVVVAVEDPNPQVGGRGIQMLRDAGIEVDVGLHRDEAYALNLGFMKRMESGLPWVTVKLAASLDGKTALSNGVSKWITGPEARRDVQRLRLRACALVTGIETVLADAPSLNVRYSELGSLNLQLSEAQILQPLRVILDSRCRMPITAALLAIESPILLVSTEPYSPAFMAQLPAHVTCLQLPAIDGRISLPALLSYLGKSCNQVLIEAGATLAGAFIGDGLADELVLYQAMKILGAQGRNLLELPDYQMMADIPTLKLVDERKVGADMRFTLRLTSNPSLANK |
7 | Staphylococcus_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
3498282 : 3510042
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >LR134303|3498282:3510042|DBSCAN-SWA TATGGCTAAGATTATTAACGTGATTGGTCGCGAGATTATGGATTCTCGTGGTAACCCAACAGTTGAAGCCGAAGTGCATTTAGAAGGTGGTTTTATCGGTATGGCGGCTGCGCCATCTGGTGCTTCTACTGGTAGCCGCGAAGCGCTGGAACTGCGTGATGGCGACAAGAGCCGTTACTTAGGTAAAGGTGTATTGACTGCTGTGGCTAACGTAAACGGCCCTATCCGTGCAGCGTTAATCGGTAAAGATGCGACTGCACAAGCAGAACTCGATCAAATCATGATCGACTTAGACGGCACTGAAAACAAAGACAAGTTAGGCGCTAACGCGATTCTGGCTGTGTCTTTAGCGGCGGCTAAAGCGGCTGCAGCATTCAAAGGCATGCCTCTATACGCTCACATCGCTGAATTAAACGGCACACCTGGCCAATACGCTATGCCAGTGCCTATGATGAACATCCTTAACGGTGGTGAGCACGCTGATAACAACGTTGATATCCAAGAGTTCATGGTTCAACCTGTTGGCGCGAAAAACTTCCGCGAAGCTTTACGTATGGGCGCTGAGATTTTCCACACCCTGAAAAAAGTACTGCACGGTAAAGGTTTAAGCACTTCTGTGGGTGACGAAGGTGGTTTCGCACCTAACCTGTCTTCTAACGCAGATGCATTAGCGGTAATCAAAGAAGCTGTTGAATTAGCGGGTTACAAGCTGGGTACCGACGTGACTCTGGCATTAGACTGCGCGGCTTCTGAGTTTTACAAAGACGGCAAATATGACCTGTCTGGCGAAGGCAAAGTATTCGATTCAAACGGTTTCTCTGACTTCCTGAAATCACTGACTGAGCAATATCCAATCGTCTCTATTGAAGACGGTCTGGACGAGTCAGATTGGGATGGTTGGGCATATCAAACTAAGATCATGGGCGACAAGATCCAATTAGTGGGCGACGATTTATTCGTCACTAACACTAAGATCTTAACCCGTGGTATCGAGAACGGCATCGCTAACTCAATCCTGATCAAGTTCAACCAAATCGGTTCATTAACTGAAACCTTAGCGGCTATCCGTATGGCAAAAGCAGCGGGTTACACTGCGGTGATTTCACACCGTAGCGGTGAAACTGAAGACGCTACTATCGCTGATTTAGCGGTAGGTACTGCGGCTGGCCAAATCAAGACCGGTTCACTGTGCCGTTCTGACCGTGTTGCTAAATACAACCAATTGCTGCGTATCGAAGAGCAATTAGGTGAAAAAGCACCATACCGCGGTTTGAAAGAAATCAAAGGTCAGGCGTAATTTAGTCGTCTGATGTAAAAAGGCCACCACTTGGTGGCCTTTTTTGTTGTACTATCTGCAGTATATTTTTTTAATTCAAGATACTGACGCTAATTCCTCCATGAAATTCTTTGTCATTGCACTCATAGTGCTTCTCGGTTTGCTGCAATATCGGCTGTGGTCGGGCGATAACAGCCTGCCCGAATACTTTGTTCTGCAAAAACAGATCGCGGCTCAGCAAGAAGGTAATGCAAAACTCAATGAGCGTAATCAGGTGCTTAAAGAGGAAATTATCGATCTTAAGAGTGGTACCGAAGCGATTGAAGAGCGGGCGCGTAACGAGCTAGGCATGGTGAAAGAAGGCGAGACCTTCTATCGCGTGGTGGGCGGTGACCGTTCAGTATCGAGTCCCTCGCAGTAACGGAGCGCCTATGGATACCGTATTAGAAGTCGCGCCGCAGGTCGAAACCCAAGTTGCAGTTCAAATTGATCCTTTCTCGCACCATGTGGTCGCGATTGTGCCCGCCGCGGGGATTGGCAGTCGCATGGGCGCGGGCAAACCCAAGCAATATTTAACCTTGCTGGGGCAAAGTATTCTGGCCCATACCTTAGACAAGTTATTGTCCCATCCGCAAATTAACCAAGTGATTGTGGCACTGCATCCAGAGGATACCGAGTTTGCTGCGTTGCCGCAGGCCAAGCATCCCAAGCTGGTGACTGTCATTGGCGGCTCTGAGCGCGCCGACTCGGTATTGGCGGCACTCGATAAGGCGCCTGATAATGGTTGGGCTTTAGTACACGATGCGGCCAGACCCTGCCTGATGGCGGGGGACATAGATAAATTGCTGGCTTCGCGGGTGCACTTTCCCCAGGGGGCGATTTTAGCCATGCCTGTGCGGGATACCATGAAGCGCGCCAATAGCTTAGGTGAAATTAGCTCGACCGTTTGCCGCGATAATCTCTGGCATGCGTTGACGCCGCAGTTATTTCCAACCTCCTTGCTGCGATTACATTTAAAAGCCGCGCTTGCCGCGGGTGCTGTTGTAACCGATGAGGCCTCGGCGATGGAGTGGGCGGGCATTTCCCCTGGTTTGGTCGCCGGGCGGGCGGATAACATTAAAGTGACCCATCCCGATGATTTGGAGTTGGCAGAGCTCTTTTTACTGCGCGCGAATGCTTGAAGGCAAGTTGAGTGAATTGCGCAATTTATTAAATACGTTAGGAATTAAGTAATGAAAATCCGAATCGGCCATGGTTTTGATGTCCATAAATTTGGTGAAGCGCGCCCGTTAATTTTATGTGGGGTCGAAGTCCCCTACGAAACCGGGCTGGTGGCTCATTCCGATGGCGATGTGGTGCTGCATGCCATTTCCGATGCCATTTTAGGGGCGATGGCCCTTGGGGATATTGGTAAACATTTCCCCGATACTGATGCCGCCTATAAGGGCGCCGATAGCCGCGTCTTGCTGCGCCATTGCTATGCGTTAGCGCGGGCGAAGGGATTTGAGCTGGGTAATCTGGATGTCACTATCATTGCCCAAGCGCCTAAGATGGCGCCGCATATCGAGGATATGCGCCACGTGTTGGCGGCCGATCTTAATGCTGACGTTGCCGATATCAACGTTAAGGCAACCACTACCGAAAAACTCGGGTTTACCGGCCGTAAAGAAGGCATTGCGGTCGAAGCCGTCGTATTACTCAGTCGCCAATAGCCTGAAACTTTAAGAGATAAGAAAGACCCATGAGCGAACTACATTACCTGTACGGCAAACCGACGGGCACCGCAGATTTAAGAACCGTTAACAGCGACTTTATCGTGAAAGAGATTTTGCCTTTTAGCCCGTCGGGCGAGGGCGAGCATCATTTAGTCCACATTCGTAAAGATGGGCTGAATACGGTGCAAGTGGCTGAAATGCTAGCGAAGTTCGCTAAGGTTCATCCCAAAGAGGTGACCTATGCTGGACAAAAAGATAAAAATGCCATCACAGAACAGTGGTTTGGCATTCGTATCCCAGGTAAAGAAACCCCAACGTGGAGCGAGCTAAACAGCGAGCGCTTAACCATTTTATCCAGCAGTCGCCACAGCAAAAAACTACGTATTGGCGCGCTCTTGGGCAACCGTTTTATTCTTACCCTGCGCAATGTCACAAACGTAGAAGACATTATCAGCCGCATCGAAAAAGTCAGCCAGATTGGTGTGCCTAATTATTTTGGTGAGCAGCGTTTCGGTCACGATGGTAAAAACCTCGTGATGGGACGGCAAATGCTGGCGGGCAAAAAGGTGAAAGACCGTAATAAGCGCAGCATGTATCTGTCTGCGGTGCGCTCCAATCTGTTCAATACCGTCGTCTCCTATCGTTTGGCGAATCATGGCACTAAACCCTTAGCGGGGGATTGCGTGATGCTGGCGGGCAGTAAGAGCTTTTTCGTTACGCCAGAATGGGACTTAGTGGTATTAAAGCGCTTGATTGAGAAAGATATTCAGCTTTCTGCCCCACTTTGGGGTCGCGGAAAAATGCTGCCGCAGGGCGAAGCCGCCGAGGTTGAAACCCTCGCCATGGCAGAGCTGAGCGAAGATTGCTACGGCCTTGAGCACGCGGGGCTTGAGCAGGAGCGTCGTCCATTGCTGCTCGAACCTCAAGGTCTTAAATACGAGCAAACTGCGGATGGATTAGTGCTCGAATTTATCTTACCTGCGGGCAGCTTTGCGACATCGCTGTTAAGAGAGTTGGTTGATTATCAAGATGTGAAAGAGCTGCAATGGCAAGCGACAGTCTCGCCTGAGGCGAATACTGCAACAGAGAGTCAGACGGCAGAGCTCGATACACCAGAAACCATTGCGCCAGAGCCTGATGAGTCCGCTTCATGATCCGCATCTTAGTCAGTAATGATGATGGTGTGAATGCGCCAGGGATCAGAGCCTTAACCGAGGCGCTCGCCGAAATCGCCACTGTGATGACGGTTGCGCCCGATCGTAATTGTTCCGGCGCAAGTAACTCTTTAACCTTGACTAACCCATTAAGAATTAATAGGTTAGATAATGGTTATATTTCGGTTCACGGTACACCCACGGATTGCGTTCACTTAGCCATACGTGAGCTTTGTGATGGAGAGCCGGATATGGTGGTATCGGGCATCAATGCTGGCGCGAATATGGGGGATGACACTTTATATTCGGGCACAGTAGCGGCGGCGATGGAGGGGCGTTTTTTAGGTTTCCCCGCCGTTGCGATTTCGCTTAATGGTAAGGCATTAAAGCATTATCACTCTGCGGCTGTGTATGCGCGGCGGATCGTGCAGGGGCTGTTAGCGCATCCGATTGCGAGCGATCAGATCCTCAATATCAATGTGCCCGATTTGCCGCTGGATGAGATTAAAGGGATCCGGGTGACGCGCCTAGGTGCACGGCATAAGGCCGAAGGCATAGTGCGAACGCAGGATCCTGCGGGGAAAGAGATTTTTTGGCTTGGCCCACCGGGTGTTGAGCAAGATGCGAGTGAAGGAACGGATTTCCATGCGATTGCCCATGGTTATGTGTCGATCACTCCCTTAACCGTGGATTTGACCGCGTATAGACAATTATCGGTATTGCAAGATTGGGTAGATAAAATATGACTCGAGTTGCCTTAACATCGGCGGTGAATTTAGCAAAAAAGCTTCAGGAGGCGGGGATCCGCCATCCAGCCGTTCTTAAGGCAATATCCCATACCCCGCGCGAGTTGTTTCTTGATAATGCGCTGGCCCATAAAGCCTACGAAAACACCGCCTTGCCCATAGGCCAAGGACAAACCATTTCTCAGCCTTATATCGTGGCGCGTATGACAGAGTTACTGCTCCAACATCAGCCGCAAAAGGTGCTTGAGGTGGGAACGGGCTCAGGCTACCAAGCGGCGATCCTCGCGCAATTAGTGCCTGAACTGTGCACCATTGAGCGTATTAAAGGTTTACAGATCCAAGCGAGACAAAGATTAAAGCGACTCGATCTTCATAACGTGTCATTCAAATATGGCGATGGCTGGCAGGGCTGGCCGAATCGCAGCCCCTTTGATGGGATTATGGTCACGGCGGCAGCGGCTAAGGTTCCCGAAGCTTTATTATCCCAGCTAGCCGAAGGCGGCGTGCTGATCATCCCTGTGGGGGAAGAGACGCAGCAACTGATGCGCTTTACCCGCCGCTCTGACCGTTTTAGTTCTGAAGTGATAGAAACCGTCAAATTTGTTCCCTTGGTCAATGGCGAGCTCGCTTAAGTTATTCGCTGAAGTATTTAGCTTTAGGTCACTTAGTTAAACACTTGGCCTTTTATGATTAGGTAAAACAGTAAGATAGCCCTAAAGAGGAGTTTTTATTGTTGAATGCGGGTTTACTCTTAAACCTCTGTTTAGTGTTGTTGCTTGCGGGCTGTAGCTTTCAGGCGAGTCGCCCCGCACCTGTCGAAAGTCTCTCCCACAGTTATTCCAAACATAATAAAGGCCACATTAAATCTAATTCATATAAAGTTAAGAAAGGTGACACCCTTTATTCGATATCTTGGGCTGCGGGTAAAGATTTCGCCGAAATTGCCAAAATTAATCAATTAGATAAATCGTACACCATTTACCCTGGACAGATTTTATATTTAACGAATGACACAGGGAAAAATGGCAAAGGCTCTACAACTTTAGGTGGAAGTAATTCAGCGTCTAAAGGGCAAAATAAAGCTAATTCTCTTGATAAACAATCGGCTAGTAACAATTCTTCCGCGAAAAACTTGTCAAGTGAACAGCAGAAAAAAACACTTGATCAGAAAGCGAAGCCTGCGTATTCTGCAACAAGCTCTCAACAAAGTGTTAACCCTTCGATCGTCGCCCCGACATCAACACTGCCAGACAGTGTCAGTCAGTGGCAATGGCCAGTAAGAGGTAAATTGATTGGGACATACTCTGCCAATGAGCAGGGAAATAAAGGAATTAAGATCGCAGGAAAACGCGGAGATATCATCAAAGCCGCTGCAGATGGGCGGGTGGTATACGCAGGTAGTGCTCTTAGGGGTTATGGTAATTTAGTGATTATTAAACATAGTGACGATTACCTTAGTGCCTATGCTCATGCAGATCAGATCTTAGTCGAAGAAAAGCAACATGTCCTTGCTGGACAGACAGTTGCAAAAATGGGCAGTACAGGTACCAATCAGGTAATGCTTCGTTTCGAGATCCGTTACCACGGTCAGTCTGTTAACCCACTTAACTATTTACCTAAGCAATGATTGTTTAGGGCCTTGTTGGCATGGAGGATAAAAAATTGCAGTAATTCAAGCAGTTTACATTAGGTTTGGGAGATTTGATCATGAGCCGCATAAATAGCACTGCCGCAGAAGAACTAGTAGATTTTTCCGTAGATACCGCAGAGTTTGATCTCGATAAAGAGGATATTGCCGCTGATTTAGTTCAAGAACTAGGACTCGAACAACAGGTTCAAGATGACCTGCAAAAAAATCTTGATGCCACCCAGCTCTATTTAGGTGAAATAGGGTTTTCCCCACTGCTTAGCGCAGAAGAAGAAGTTTACTTTTCCCGTAAAGCCTTAAAAGGCTGCGAAAAATCCCGTAATCGCATGATTGAGAGTAACCTGCGACTCGTGGTTAAAATTGCCCGTCGTTACAATAATCGTGGCCTTGCGCTGCTGGATTTAATCGAAGAGGGTAATCTTGGCTTGATCCGTGCTGTGGAAAAATTCGACCCAGAAAGAGGCTTCCGTTTCTCAACCTATGCGACTTGGTGGATCAGACAAACGATCGAACGTGCCATCATGAATCAAACTCGCACGATTCGCTTGCCAATCCATGTGGTAAAAGAGCTCAACGTGTATTTACGTACGGCTCGGGAATTAGCGCAAAAACTTGACCACGAACCTACGGCAGAAGAAATTGCCGAGAAACTGCAAGTGTCCAGTGTTGACGTCAGTCGTATGCTGAAGCTCAACGAGAAGATCACCTCTGTCGATATCCCCTTAGGTGGTGATAACGACAAGGCGTTACTCGATGTGCTTGCCGATGACGACAACGTAGGCCCTGACTACAAAGTACAAGATGAAGACATTTCAAACTCAGTGGTGAAATGGCTCAACGAACTCAATACCAAGCAAAGAGAAGTGTTAGCGCGCCGCTTTGGCTTGTTGGGTTATGAACCCTCAACCCTTGAAGACGTAGGCGCAGAAATTGGCCTCACCCGTGAACGTGTTCGCCAAATACAAGTAGAAGCCCTTAAACGCCTACGTGATTTGCTGGGCGCTCAAGGTCTCTCTGTAGAGGCTCTATTTAGAAACTAAGGTTTAGAGATTAAGGTTTAGAAACTTTGGCTTGGACACTTAGCCCTAAACCCGTTCATCAAGTCATAAAAAACGCCCAATATTCAGGGCGTTTTTTTATGTTTTCACGTTTATGTTATCGCGAGTTTTCACTCTATGTGTTAACTCAAGCGTTTTAGCTCGTAGAGTAAATCCAGCGCCTGTTTTGGGGTCAGGTTATCGGGATTGATGGCTTTTAATTTGCTGACAGCGGGATTCTCCACCGGCTCAGGCAGAGCGAGCAAGGTTTGAATCGGTGCCCTAGTGCCGTTAACATTAGCACCTTCTACTTGATGATCGCGGCTCTCAAGCTGATGTAACTTGTGTTTTGCCGCCTTAATCACCCGAGCAGGCACCCCCGCAAGGGCCGCAACCTGCAAACCATAACTCTTACTGGCCGCACCTTCTTGCACCGCATGCATAAAGGCGATGGTGTCTTCGTGTTCAATCGCATCGAGGTGCACGTTATAGACGCCTGCCATAATTTCCGGTAATTGCGTTAACTCGAAATAATGGGTCGCAAACAGCGTCATTGCGCCGACTTGCTGTGCTAAATATTCAGCCGCAGACCAAGCTAATGACAGGCCATCGTAGGTTGAGGTGCCACGGCCAATTTCATCCATCAAGACTAAACTTTGCGCGGTCGCATTATGGAGAATATTTGCCGTTTCTGTCATTTCCACCATAAAGGTTGAGCGACCAGAGGCGAGATCGTCTGAGGCGCCAATCCGAGTAAAAATCCGATCGATAGGGCCAATGATGGCGCGATCAGCTGGCACAAAGCAGCCAATATGTGCCATTAGGGTGATAAGCGCCACTTGGCGCATGTAGGTCGATTTACCGCCCATGTTTGGACCTGTAACAATCAACATCCGGCGTTGATTGTGTAGGGTGACAGGGTTGGCGATAAAAGGCGTTTGACTCACACGTTCAACCACTGGGTGGCGACCCGCTTCGATTTTTACCCCGATTTCGCTGCTCAGTTCTGGGCAGGTATATCCTAAAGTTTCGGCACGTTCGGCAAAGTTACTCAGCACATCGAGTTCGGCTGCCGCTCTGGCGAAGGCTTGTAGTTCATGCAACTTGGGGAGGATTAAATCAAACAATTCGTCCCATAGTTGTTTTTCAAGGGCGAGCGCCTTACCTTGGCTCGACAGCACTTTTTCTTCGTATTCCTTAAGTTCGGGCGTGATGTAACGCTCCATATTCTTAAGGGTTTGACGACGTTGGTAATTTAATGGCACTTGCTGCGATTGCAGGCGACTGACCTCGATGTAGTAGCCATGCACTCGGTTGTAGCCAACCTTCAACGTGGCAATGCCTGTACGCTCTTTTTCCCGAGCCTCCAGCTGAACTAGATAATCAGTGGCTCCTTCACTTAAGCCTCGCCATTCATCTAACTCGGCATTGTAGCCTTCACGGATCACGCCGCCATCACGGATAAGCATCGGCGGATTATCGACAATGGCGCGCTCAAGCAGTTGTTGCTCTTCGGGAAACTCACCTAAGAGTTGGCCTAGTTTCACCGTATGGGGCGCACTTAACTGCGCGAGTGAATGCTGCAACTGAGGCAATAGGTTTAATGCTTGGCGTAGGCGGGCAAAATCCCTTGGACGAGCGGTACGCAGTGCCAGCCTTGCCATGATACGTTCAATATCGCCTAGGGCTTTAAGCTGTTCATGCAATGATTCATGGGCAGTGGTTTCAAGTAATTCATTCACGGCCGTTTGGCGAGCAAAAATCAGGGCATGGTCTCTTAGCGGCTGATGGATCCAGCGTTGCAACATCCGGCTGCCCATGGCTGTGGCGGTATTGTCAAGAACGGCTGCTAAGGTGTTATCTCGACCACCACTCAAATTTTGTGTCAGTTCGAGGTTACGGCGCGTTGCCGCATCCAAGACAATCGTATCGGTTTGATTGAAACGGGTAATGGCATTGATATGGGGCAGGGCGGTGCGTTGGGTGTCTTTCACATATTGCATTAAGCAGCCAGCCGCTTGTAGCGAAAGCCTTGCATCGGTAATCCCAAAGCCGTGCAAATCCTTAGTGCCAAATTGTGCCAGCAGCAGTTTAATGCTGGTATCGTAATCAAACTCCCACTCGGGGCGGCGACGCTTGCCTTTAAAATGCTGCAATAGCTCCATCGCGCCAAAATCTTCACTATAGAGAATTTCGACCGGATTGGTACGTTGCAGTTCGGCTTCAAGGGACTCTTTTGTCTCTAATTCGGCAATCACGAAGCGGCCGGAAGACACATCGAGGGTCGCGTAGCCAAAATCAACTTTACCTTGGTAAACCGCGGCTAACAGATTGTCTTGTCTCTCCTGCAATAGGGCTTCGTCGGTCAAAGTACCAGGGGTGACGATGCGGACCACTTTTCGCTCAACTGGGCCTTTTGACGTGGCAGGATCGCCAATCTGCTCACAGATCGCGACCGATTGACCAATTTGAACCAATTTGGCCAGATAGCCTTCAACGGCATGGTAAGGGATACCCGCCATCGGGATCGGGTCACCGCCACTTTTGCCGCGGGCCGTAAGGGAAATGCCCAATAATTCAGAGGCGCGTTTTGCGTCATCATAGAAGAGTTCATAGAAGTCACCCATACGATAAAACAGCAGCATGTCGTGATGCTCTGCTTTCATGGTCAAATATTGACGCATCATAGGGGTATGTTTTTCTAAATCATCGGTATCTATAGGATTCATTAAATCGTTTTCGTCCGTTATGCCGGCCCAAAGCGGGGAAATAAGTCGTACAGTAATGGGATAAAAAATCAGTGGATGTGCACTATTACTGTCTAAAATAAATTGACGCCAATCTTAGCAATAATTTTCTGAAAATTGACCCTTAAACCATGGATATTCACTGATATCCTGACAAATCTGCAGCACGAGTCGGTAAAAGCTTGTCTCAACGGGAAAATTTTCTGACTTAAAAATAAATCATTCACAGGTCTTGATACTGTATGATTGTACAGTATACTAGTCAGCAGATTATGACAAAGAGCCTCGCTTTTTGTATTACCCGCTTTATTAAATTGAATATCACTTTTGACAGTACTGATACTGTCTAGATGAGGGAACAGGAATGAAGGTCGATCCAAACAAAGAGAAAGCACTTGCTGCGGTATTGAGCCAAATTGAAAAGCAATTTGGTAAAGGCTCCATCATGAAGCTGGGCGAAGACCGCTCTATGGATGTTGAGACTATTTCTACTGGTTCACTTTCCCTTGACGTCGCCTTAGGTGCTGGCGGTTTACCAATGGGACGTATCGTTGAGATTTATGGTCCAGAATCATCAGGTAAAACAACGCTAACATTAGAAGTGATTGCTGCCGCTCAGCGCGAAGGTAAAACCTGTGCCTTTATCGACGCCGAGCACGCGCTAGACCCTATCTATGCTAAAAAGTTAGGCGTTGATATCGATAACTTACTGTGTTCACAACCCGATACCGGTGAACAAGCCCTTGAGATCTGTGATGCGTTAACCCGTTCTGGCGCGGTAGATGTGATCATCGTTGACTCGGTCGCGGCATTAACACCCAAAGCGGAAATTGAAGGTGAAATTGGTGACTCACACATGGGCTTAGCGGCGCGTATGATGAGCCAAGCGATGCGTAAGCTAGCGGGTAACCTCAAGCAATCAAATACACTGTTAATCTTCATTAACCAAATTCGTATGAAGATTGGTGTGATGTTTGGCAACCCAGAAACCACAACGGGTGGTAACGCGCTGAAGTTCTACGCCTCTGTTCGTCTCGATATTCGCCGTACTGGTGCTATCAAAGAAGGCGATGAGGTGGTAGGTAACGAGACGCGCGTTAAAGTGGTTAAAAACAAAGTCGCTGCACCGTTTAAGCAAGCTGAGTTCCAAATTCTTTACGGTCAAGGTATTAACCGTACCGGTGAATTAGTCGATTTAGGTGTGGCTCATAAACTGATCGAAAAAGCCGGTGCATGGTACAGCTACAAGGGTGACAAGATTGGTCAAGGCCGTGCAAATGCCGGTAAATATCTGACTGAAAACCCAGAAATTGCTGCTGAAATCGATAAAACACTGCGTGAGTTACTGCTGAGTAACCCAAGTGCTATGTCTTCATCTTCTTCTGATGATGAAAATAGCGAAGGCAATGTTGATTTCGAAACAGGCGAAGTATTCTGA
Protein sequences of DBSCAN-SWA_3 >LR134303|3498282:3510042|3499988_3500738_+|VEE63293.1|DBSCAN-SWA MDTVLEVAPQVETQVAVQIDPFSHHVVAIVPAAGIGSRMGAGKPKQYLTLLGQSILAHTLDKLLSHPQINQVIVALHPEDTEFAALPQAKHPKLVTVIGGSERADSVLAALDKAPDNGWALVHDAARPCLMAGDIDKLLASRVHFPQGAILAMPVRDTMKRANSLGEISSTVCRDNLWHALTPQLFPTSLLRLHLKAALAAGAVVTDEASAMEWAGISPGLVAGRADNIKVTHPDDLELAELFLLRANA >LR134303|3498282:3510042|3502422_3503172_+|VEE63296.1|DBSCAN-SWA MIRILVSNDDGVNAPGIRALTEALAEIATVMTVAPDRNCSGASNSLTLTNPLRINRLDNGYISVHGTPTDCVHLAIRELCDGEPDMVVSGINAGANMGDDTLYSGTVAAAMEGRFLGFPAVAISLNGKALKHYHSAAVYARRIVQGLLAHPIASDQILNINVPDLPLDEIKGIRVTRLGARHKAEGIVRTQDPAGKEIFWLGPPGVEQDASEGTDFHAIAHGYVSITPLTVDLTAYRQLSVLQDWVDKI >LR134303|3498282:3510042|3506000_3508586_-|VEE63300.1|DBSCAN-SWA MNPIDTDDLEKHTPMMRQYLTMKAEHHDMLLFYRMGDFYELFYDDAKRASELLGISLTARGKSGGDPIPMAGIPYHAVEGYLAKLVQIGQSVAICEQIGDPATSKGPVERKVVRIVTPGTLTDEALLQERQDNLLAAVYQGKVDFGYATLDVSSGRFVIAELETKESLEAELQRTNPVEILYSEDFGAMELLQHFKGKRRRPEWEFDYDTSIKLLLAQFGTKDLHGFGITDARLSLQAAGCLMQYVKDTQRTALPHINAITRFNQTDTIVLDAATRRNLELTQNLSGGRDNTLAAVLDNTATAMGSRMLQRWIHQPLRDHALIFARQTAVNELLETTAHESLHEQLKALGDIERIMARLALRTARPRDFARLRQALNLLPQLQHSLAQLSAPHTVKLGQLLGEFPEEQQLLERAIVDNPPMLIRDGGVIREGYNAELDEWRGLSEGATDYLVQLEAREKERTGIATLKVGYNRVHGYYIEVSRLQSQQVPLNYQRRQTLKNMERYITPELKEYEEKVLSSQGKALALEKQLWDELFDLILPKLHELQAFARAAAELDVLSNFAERAETLGYTCPELSSEIGVKIEAGRHPVVERVSQTPFIANPVTLHNQRRMLIVTGPNMGGKSTYMRQVALITLMAHIGCFVPADRAIIGPIDRIFTRIGASDDLASGRSTFMVEMTETANILHNATAQSLVLMDEIGRGTSTYDGLSLAWSAAEYLAQQVGAMTLFATHYFELTQLPEIMAGVYNVHLDAIEHEDTIAFMHAVQEGAASKSYGLQVAALAGVPARVIKAAKHKLHQLESRDHQVEGANVNGTRAPIQTLLALPEPVENPAVSKLKAINPDNLTPKQALDLLYELKRLS >LR134303|3498282:3510042|3508968_3510042_+|VEE63301.1|DBSCAN-SWA MKVDPNKEKALAAVLSQIEKQFGKGSIMKLGEDRSMDVETISTGSLSLDVALGAGGLPMGRIVEIYGPESSGKTTLTLEVIAAAQREGKTCAFIDAEHALDPIYAKKLGVDIDNLLCSQPDTGEQALEICDALTRSGAVDVIIVDSVAALTPKAEIEGEIGDSHMGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKIGVMFGNPETTTGGNALKFYASVRLDIRRTGAIKEGDEVVGNETRVKVVKNKVAAPFKQAEFQILYGQGINRTGELVDLGVAHKLIEKAGAWYSYKGDKIGQGRANAGKYLTENPEIAAEIDKTLRELLLSNPSAMSSSSSDDENSEGNVDFETGEVF >LR134303|3498282:3510042|3503902_3504799_+|VEE63298.1|DBSCAN-SWA MLNAGLLLNLCLVLLLAGCSFQASRPAPVESLSHSYSKHNKGHIKSNSYKVKKGDTLYSISWAAGKDFAEIAKINQLDKSYTIYPGQILYLTNDTGKNGKGSTTLGGSNSASKGQNKANSLDKQSASNNSSAKNLSSEQQKKTLDQKAKPAYSATSSQQSVNPSIVAPTSTLPDSVSQWQWPVRGKLIGTYSANEQGNKGIKIAGKRGDIIKAAADGRVVYAGSALRGYGNLVIIKHSDDYLSAYAHADQILVEEKQHVLAGQTVAKMGSTGTNQVMLRFEIRYHGQSVNPLNYLPKQ >LR134303|3498282:3510042|3503168_3503804_+|VEE63297.1|DBSCAN-SWA MTRVALTSAVNLAKKLQEAGIRHPAVLKAISHTPRELFLDNALAHKAYENTALPIGQGQTISQPYIVARMTELLLQHQPQKVLEVGTGSGYQAAILAQLVPELCTIERIKGLQIQARQRLKRLDLHNVSFKYGDGWQGWPNRSPFDGIMVTAAAAKVPEALLSQLAEGGVLIIPVGEETQQLMRFTRRSDRFSSEVIETVKFVPLVNGELA >LR134303|3498282:3510042|3501298_3502426_+|VEE63295.1|tRNA|DBSCAN-SWA MSELHYLYGKPTGTADLRTVNSDFIVKEILPFSPSGEGEHHLVHIRKDGLNTVQVAEMLAKFAKVHPKEVTYAGQKDKNAITEQWFGIRIPGKETPTWSELNSERLTILSSSRHSKKLRIGALLGNRFILTLRNVTNVEDIISRIEKVSQIGVPNYFGEQRFGHDGKNLVMGRQMLAGKKVKDRNKRSMYLSAVRSNLFNTVVSYRLANHGTKPLAGDCVMLAGSKSFFVTPEWDLVVLKRLIEKDIQLSAPLWGRGKMLPQGEAAEVETLAMAELSEDCYGLEHAGLEQERRPLLLEPQGLKYEQTADGLVLEFILPAGSFATSLLRELVDYQDVKELQWQATVSPEANTATESQTAELDTPETIAPEPDESAS >LR134303|3498282:3510042|3504879_3505860_+|VEE63299.1|DBSCAN-SWA MSRINSTAAEELVDFSVDTAEFDLDKEDIAADLVQELGLEQQVQDDLQKNLDATQLYLGEIGFSPLLSAEEEVYFSRKALKGCEKSRNRMIESNLRLVVKIARRYNNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMNQTRTIRLPIHVVKELNVYLRTARELAQKLDHEPTAEEIAEKLQVSSVDVSRMLKLNEKITSVDIPLGGDNDKALLDVLADDDNVGPDYKVQDEDISNSVVKWLNELNTKQREVLARRFGLLGYEPSTLEDVGAEIGLTRERVRQIQVEALKRLRDLLGAQGLSVEALFRN >LR134303|3498282:3510042|3500789_3501269_+|VEE63294.1|DBSCAN-SWA MKIRIGHGFDVHKFGEARPLILCGVEVPYETGLVAHSDGDVVLHAISDAILGAMALGDIGKHFPDTDAAYKGADSRVLLRHCYALARAKGFELGNLDVTIIAQAPKMAPHIEDMRHVLAADLNADVADINVKATTTEKLGFTGRKEGIAVEAVVLLSRQ >LR134303|3498282:3510042|3498282_3499578_+|VEE63291.1|DBSCAN-SWA MAKIINVIGREIMDSRGNPTVEAEVHLEGGFIGMAAAPSGASTGSREALELRDGDKSRYLGKGVLTAVANVNGPIRAALIGKDATAQAELDQIMIDLDGTENKDKLGANAILAVSLAAAKAAAAFKGMPLYAHIAELNGTPGQYAMPVPMMNILNGGEHADNNVDIQEFMVQPVGAKNFREALRMGAEIFHTLKKVLHGKGLSTSVGDEGGFAPNLSSNADALAVIKEAVELAGYKLGTDVTLALDCAASEFYKDGKYDLSGEGKVFDSNGFSDFLKSLTEQYPIVSIEDGLDESDWDGWAYQTKIMGDKIQLVGDDLFVTNTKILTRGIENGIANSILIKFNQIGSLTETLAAIRMAKAAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSLCRSDRVAKYNQLLRIEEQLGEKAPYRGLKEIKGQA >LR134303|3498282:3510042|3499621_3499978_+|VEE63292.1|DBSCAN-SWA MLYYLQYIFLIQDTDANSSMKFFVIALIVLLGLLQYRLWSGDNSLPEYFVLQKQIAAQQEGNAKLNERNQVLKEEIIDLKSGTEAIEERARNELGMVKEGETFYRVVGGDRSVSSPSQ |
11 | uncultured_Mediterranean_phage(28.57%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
4687569 : 4698866
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >LR134303|4687569:4698866|DBSCAN-SWA TTTGTCTCAAGGAAATAGTGTACGCACGCTCTCAGGGCTGCAGCGTCTGCTCGAAGGCGGACTCATCATTTGCTGTGTTTTAGCCACCTATATTTTGCTTGCCTTAACCAGTTTTAGTCCGTCAGATCCGGGCTGGAGCCAGTCTCATTTTCAAGGCGATATCAAAAACTGGACGGGTGCAGTCGGGGCGTGGATTGCCGACATTCTGCTGTACTTTTTTGGCGTCACTGCTTACATCATGCCGATCATTGTGGCCTCCACAGGTTGGCTGCTTTTCAAACGCGCCCATGATTTATTGGAAATTGACTACTTTTCAGTCGCACTGCGCATCATAGGTTTTCTACTGCTTATTCTTGGTTTTTCAGCATTAGCCAGCATGAATGCCAATAATATCTATGAGTTTTCCGCAGGCGGCGTGGCAGGGGATGTTATCGGCCAAGCCATGTTGCCGTATTTTAATAAGCTCGGTACGACCTTGTTGCTGCTGTGCTTTTTAGGCTCGGGTTTTACGCTGTTAACAGGGATCAGCTGGTTGACAGTGGTCGAGAAGGTTGGCTTTGTCTCCATTTGGTGTTTCAGAAAACTCAAGCGTCTGCCGCAGGCGTTAAAACGCGAGCGTGAAACCGAAGATACCCGCGGCTTCATGACGGTCGTTGATAAGTTTAAACAGCGCCGTGACTCACAGCATCAACTTGAAAAAGCCCGAGTGCGTGAGCCAGAGGTGGCACCGAGTCGTATCTTTACGACTCGCCCCGTGAAGGAAGAAGAGGCCAGTGACGAAATCATCACCGAAGCAAGCTCCAGTAAAGGCAAGCTGTCGGCGCTCGCAAAAATCTTAAGTTTAAACAGCAATAAAGCCAAGGCTGAACCTAAAGGCCTACAAAGAGTCGAGCCCCAATTAGATCAAGCCAGTGCCGTAGCTGAGCATGGTCATTTTGAAGCGCCGCCTTGGGTCGCCAAGCCCAAAGAGGCCGAGTTAGATTTAGATGATGAAACTGAGTTCAAGGCGCACGTTTTCGAAGATGAAGATGACGATGAACCCGTATTCCATCGTGAAACCATGCTCGATGACGAGGATGAAGACGAGTTAGGTTTCAACGATGAAGATGTCATCGATTTTGATACTAAAGCCTCAACGGGCGCTGTGACTCAGGCGCAACGTCAAAAAGAAGCGCCAAAGGCGAAGATTGTCGATGGTATTGTGGTTTTACCGGGACAAGAAGATAAACCTGTTCCAGCAAAACCTATGGATCCGCTGCCAAGCATCAGCTTGCTGGATGTGCCTAATCGTAAAAAGAATCCGATAAGCCCAGAGGAATTAGAGCAAGTTGCCCGCTTAGTCGAGGCCAAACTTGCCGACTTTAACATTGTGGCCACTGTGGTGGGGGTTTATCCCGGTCCTGTGATCACCCGTTTTGAGCTAGATTTAGCGCCAGGTATTAAAGCTTCTAAGATTTCAAACCTCGCAAACGACTTAGCGCGTTCGCTACTGGCTGAGCGTGTGCGTGTGGTTGAAGTCATTCCTGGTAAATCCTATGTGGGTCTAGAGCTGCCGAATAAGTTCCGCGAAACCGTGTACATGCGCGATGTACTCGATTGTGAAGCCTTTACCGAAAGCAAATCGAACCTGACCATGGTGCTTGGCCAAGACATTTCCGGCGAGCCTGTGGTGGTTGATTTAGGCAAAATGCCGCACTTACTGGTAGCCGGTACCACGGGTTCGGGTAAATCGGTCGGGGTGAACGTGATGATCACCAGCTTATTGTATAAGTCTGGCCCTGAAGACGTTCGCTTTATCATGATCGACCCGAAAATGCTGGAATTATCGGTTTATGAAGGCATCCCACATTTACTTTGTGAAGTGGTTACCGACATGAAAGAAGCGGCCAATGCGCTGCGCTGGTGTGTAGGTGAGATGGAGCGCCGCTACAAGCTAATGTCTATGATGGGCGTTCGTAACATCAAGGGCTATAACGCCAAGATTGCCGAAGCGAAAGCGAATGGCGAAGTGATTTTAGATCCCATGTGGAAGTCATCTGACAGCATGGAGCCAGAGGCGCCAGCCTTAGATAAACTGCCATCGATTGTGGTTGTCGTCGACGAATTTGCCGACATGATGATGATTGTCGGTAAAAAAGTTGAAGAACTGATCGCCCGTATCGCCCAAAAGGCTCGTGCTGCGGGTATCCATTTAATTCTCGCGACCCAACGTCCATCGGTGGATGTGATCACAGGCTTAATTAAGGCCAACATCCCGACACGGATGGCCTTCCAAGTGTCTTCCCGTATCGACTCCCGTACCATTTTGGATCAGCAGGGTGCTGAAACCTTATTGGGTATGGGTGACATGCTGTATTTACCACCCGGAACCGCAGTGCCAAACCGTGTTCATGGTGCCTTTATCGATGACCATGAAGTTCACCGTGTGGTGGCCGATTGGTGCGCCCGTGGTAAGCCACAATACATTGATGAAATCCTCAATGGCGTGAGTGAGGGCGAACAAGTGCTCTTGCCGGGTGAAACCGCCGAATCCGATGAAGAATACGATCCGCTTTACGATGAGGCCGTCGCCTTCGTGACCGAAACCCGCCGTGGTTCAATTTCGAGCGTGCAGCGTAAATTTAAGATTGGTTATAACCGCGCAGCGCGTATTATCGAACAAATGGAAATGCAGGGTGTGGTCTCGGCTCAGGGCCATAATGGTAACCGCGAAGTGCTGGCACCGCCGGCCCCGAAACACTATTAATGTTAAGCTAGATGTCAGTTTGAACTCGCCATAATGCGGCGAGTTCAACTTTTAGTCCAAAGGATACCTATGAAAAAACTGTTGTGTGCTGTGTTGTTATCACCATTGTTATACAGCAATGCAGTATTGGCCGATGACGCAAAGCAATTACGCGAAACCTTAAACGGCACTGAGTCACTCAAGGCGGATTTCAAGCAAACCGTTACCGATATCAATAAAAAGGTTATCCAAACGGGCGCGGGTGTATTCGCACTGGCCCACCCAAACCAGTTCTATTGGCATTTAACCGCGCCCGATGAATCCCAAATTGTCGCCGACGGTAAAGATTTATGGATCTACAATCCCTTCGCCGAAGAAGTGGTGATTATGGATTTTGCCGAAGCCATTAATGCATCGCCCATCGCCTTATTGGTTCACCGCGATGACGCCACTTGGTCGCAATATAGCGTGACCAAAAAGCAAGACTGTTATGAAATCAAACCTAAGTCCACGGATGCCGGGATTACCTCAGTCAATGTCTGTTTTAACAAAGGCACACTGAATAAATTTAATGTGCTCGATGATAAAGGCAACCTGAGCCAGTTTGACTTGAGCAATCAACACAGCATTAGCGCTGCGGATAAAGCGCTGTTTAAGTTTGTGTTGCCAGAGAACGTTGATGTGGATGATCAACGTCTTAAAACCCAGTAGGTTAGCGCGTGAGCAGTTTATCGTTTAATTTTGCGCCTGATTTTCGTCCCTTAGCCGCGCGTATGCGGCCAAGGACGATTGCCGAGTATATCGGTCAAGCCCATTTGTTGGGGGAAGGCCAACCATTGCGCCAAGCATTAGAAGCGGGCCGTGCCCATTCGATGATGTTATGGGGACCGCCGGGCACAGGTAAAACGACTCTTGCCGAACTTATCGCCCATTATTCCAATGCCCATGTTGAACGCATCTCGGCGGTCACCTCGGGGGTAAAAGACATTCGCGCCGCCATCGAGCAGGCGCAAGCCGTTGCCCAATCCCGCGGACAACGCACTTTGTTGTTTGTCGATGAAGTGCATCGCTTTAATAAGAGTCAGCAGGACGCCTTCTTGCCCTTCATTGAAGATGGCACTGTGATTTTTATTGGGGCGACCACTGAAAACCCTTCATTTGAAATCAATAACGCCTTACTGTCGCGGGCGCGTGTCTATCTTATCAAGCGCTTAAGCCAAGATGAGATAGTCCATATCATCACCCAAGCCTTAACAGATCCCGAGCGAGGGTTAGGACAACGTCAGTTAGTCATGCCAACGGATGTGCTTAATAAGCTGGCGCAGCTCTGTGACGGCGATGCGCGCAAAGCGCTAAACCTGCTTGAATTGATGAGCGATATGGTGGCCGATGGCGGCAGTTTTACCACTGAAATGCTGGTACAAGTCGCCGGCCATCAAGTGGCGGGGTTCGATAAGAATGGCGATCAGTTTTACGATTTGATTTCGGCAGTACATAAATCGATTCGTGGCTCAGCGCCCGATGCAGCACTGTATTGGTTTTGCCGTATTCTAGAAGGCGGCGGCGATCCGCTATATGTCGCCAGACGCTTATTAGCCATAGCCTCGGAAGATGTGGGCAATGCAGACCCTAATGCCATGACAGTCGCGCTTAATGCCTGGGATTGTTTCCATCGCGTTGGGCCTGCTGAAGGTGAGCGGGCGATCGCCCAAGCGATTGTGTATTTAGCCAGTGCGCCTAAGAGTAATGCCGTTTACACCGCCTTTAAGGCGGCGAGGGCACTGGCCCGCGAAACGGGCCATGAGGCCGTACCTTATCATCTGCGTAATGCGCCGACCAAGCTCATGGCGGAAATGGGTTTTGGCGCCGAATATCGCTATGCCCACGATGAACCCAATGCCTATGCCAGCGGCGAGAATTATTTCCCCGAATCCTTACAAGAGTCGCAATTTTATTTCCCGACCGAACGCGGATTTGAGAAGCGGATTAAGGAAAAATTGGCGCAATTAGCGCAATTAGATCAAGCAAGTGGGAGAAAAAGGTATGAATAATCTCCTACTTGTGGCCTTGGGTGGTTCCATTGGTGCGGTTTTTCGCTATCTTATTTCAATATTCATGATCCAAGTATTTGGCAGCAGTTTTCCTTTTGGTACACTGTTGGTCAATGTCCTCGGTTCATTTTTAATGGGCGTCATTTACGCACTGGGACAAATGAGTCATATCAGCCCAGAACTCAAAGCCCTGATCGGCGTTGGCCTATTAGGTGCTTTGACAACGTTTTCAACTTTCTCAAACGAAACTTTATTGCTGATGCAAGAAGGAGATTGGTTGAAGGCGGCTTTGAATGTGGTGTTGAACCTAAGTCTATGTTTGTTTATGGTTTACTTAGGCCAACAACTGGTTTTTTCTCGCATTTAACTATTAAGAATATATCACATGTTAGATCCTAAATTTTTGCGCAACGAATTAGCAGTTACCGCTGAGCGATTAGCTACCCGTGGTTTTATTTTAGATGTCGCTCATCTCACTCAATTAGAAGAAAAACGTAAGTCACTGCAAGTGGCGACTGAAGAGTTACAAGCTTCGCGTAATGCTATTTCCAAGTCCATCGGACAAGCAAAAGCCCGCGGCGAAGATGTGGATGCCATCATGGCGCAGGTTGGCGATTTAGGTGCGCAATTAGATGCGAAGAAAGTCGAGCTGGCTGCGGTACTTGAAGAAGTGAACGCGATTGCCATGTCGATGCCAAACCTGCCGGATGAGTCAGCGCCTATCGGTGCTGACGAGACTGAAAACGTCGAGATCCGCCGTTGGGGCACACCACGCAGCTTCGATTTCCCTGTTAAAGATCATATTGACTTAGGTGAAGGCCTAAACGGTTTAGATTTTAAGAGCGCCGTGAAAATTACTGGCTCACGCTTTATCGTTATGAAAGGCCAAATCGCCCGTTTAAACCGCGCATTAGGTCAGTTCATGTTAGATCTGCACACCACTGAGCACGGTTATACCGAAGCTTACGTGCCATTACTGGTTAACGAAGCAAGCTTACTGGGTACTGGCCAATTGCCTAAGTTTGGTGAAGACTTGTTCCACACTAAACCTGCGACCGAAGAAGGCCAAGGTTTAAGCCTGATCCCAACCGCAGAAGTGCCATTAACGAACTTAGTGCGTGACAGCATTGTCGATGAAGACGAATTACCGATTAAGTTAACCGCGCATACCGCCTGTTTCCGCAGTGAAGCCGGCTCATACGGTAAAGATACCCGTGGTCTTATCCGTCAGCACCAATTCGATAAAGTGGAATTAGTGCAATTGGTTAAGCCAGAAGACTCAATGGCAGCGCTCGAAGCACTAACGGGCCACGCTGAAACCGTACTGCAACGCCTCGGTCTGCCATACCGCACAGTGATCCTGTGTACTGGTGACATGGGCTTTGGTTCAAGCAAAACCTACGATATCGAAGTGTGGTTACCAGGCCAAAACACTTACCGCGAGATTTCTTCATGTTCAAATATGAAAGACTTCCAAGCCCGTCGTATGCAAGCCCGTTACCGCGTTAAGGCCGATAACAAGCCAGCCTTGCTGCACACCTTAAACGGCTCAGGCCTAGCGGTAGGTCGTACTTTAGTGGCGATTTTAGAGAACTATCAAAATGCCGATGGTAGCGTGACCATTCCTGAAGCACTGCGTCCATACATGGGCGGACTGACTCAGATCGGTTAATTGGCGTAACACAACACCATGGATAAAGGTCAACGTTATCTTCGTTGGCTTTCCTACACCGCTGTCATCGCCGTGTTTTGCGCTGTGATGTTGGCCACCCTAGGTAAAGCCGTCTGGGTTGTGTTAGAAAACGTGAAATAGCACGACTCAAATAAAAAGGTATATTCCACTGGAATATACCTTTTTTGTTAGCAGAAGTGATTAAACCTTAAATACACTTGGCAGGTTTAGGCAGACCCGCAATCTTTGTCGCCTGTTTGGCAGGACCAACCGGAAAGAGGGTATAGAGGTACTTAGAGTTACCTTTATCCGGCCCTAAGGCTTGACCAATGGCTTTGACTAACACCCTGATCGCAGGGCTGGTTTTATATTCCAGATAAAAATCTCGAACAAAATGGATCACTTCCCAGTGGGCACTGGTCAGTTCAATCTGTTCTTCTTGGGCTAACAGCGGCGCCATATCCGGCTGCCAGTCGGCGATATTCTTCAAATAGCCTTGATGGTCGCGTTCAATTTCGACGCCATTGAATATAAGCGGATTTACCACGTGATCACCTTATCGTGCTGCAAAGATTGAGCGACAAACTCGTTGTAATCAATCTGACTAATGTTTTTAAGACGTTCCGTTAAGCCGCGGGCGATGACATCGTCTTTCAGCACCATCACCTTAAAGGGCGAAAGCGCCATGGCCCATTGGCGCAGTAATAATGCGTTAACGCCATCGCTCGAGAGTAAAATCGCATCCTCTTTGCAAGCATAACGCAGGCAAAGTTTGAGGGCATTGTCTCGGCTTGGCGAGGTTTGGATATGATGTAAAATCATTAGAATACTAAGACCTCATCGACGGCTTTTAAGTGAGCCGAGATGGCTTCATCGTTTAAGATAGTCACCGGAATCGACAGCAAGCTATGGCTTAAGCCGTAGTCACCTAAGGACTGTTTGCAGGCGAACACTGATTCGATGTCGTACAGCGGGAGCGCTTTTAGCGCCGCAAGGTAATCTTTACCGCCGATTAGCTCGGGTTGTTGATCTTTGAGCAGATGTAGCACGCCTTCATCGACAAACACTAAACTCACTTCTTGCTCAAAGCTGGCGCTTAACAGGGCAAAGTCTAAGGCTTCACGGCCCTTAGTGGTGCCATGGGGCGCGCTTCGGAACAATATACAGATTTTTTTCATAACATTTCCGAAGCTTAAAAGCAGATCAAACGATCGGCTGATTCAATTCCCGTGACCAGTTCACCCAGTCCACCCATGATAAAAGAATGTTCAACATTCCAGTGGCTTAGCCCGTTTTCCTGCGCCTCTTGCTGCGAGACTATGCCACGTCGTAATGCCGCTGAAACACAGTTCACTAACGAGACTTGATGCTCGGCTGCCAGTTGTTTCCAACCTTTAACCACATCATATTCATCGGATGTGGGTAAGTTAAAGTCCGTCGAGTTATACACGCCATCCTGATAGAAAAACACGCAGACAATCTCATGTCCGCTTTGCAAGGAGGCTTGGGTGAAACGTAGGGCATTCACACTTGCCGACGTGCCATAGGCTGGCCCGTTTACTTGGATAATAAATTTGCTCATCATAAAAAAAATGGCCCTAAAAGTAGGGCCATTTTAGCGTAAGTTTCCCGAAAAGGGGATTAATCATCGTTACCAATACCAAGCAAATGCAAAATGGCAATGAATAGGTTTAAAAAGTCCAGATACAAGGAGATAGTGGCGCGGATATAGTTAGTCTCGCCACCATTCACGATACGGCTAGTATCGAACAGGATAAAGCCGGTCATTAACAGCGCAAGACCCGCGTTAATCGCCATAAAGGCCACACTGTTACCCACGAAAATATTAATCAGCGCAGCGGCGATCACCACAATCAATCCGGCGAACAGGAAACCACGTAGGAAAGAGAAATCTTTCTTCGTCGTGACAGCATAGGCCGAGAGCGCAATAAAGATAACCGAGGTTAATCCTAAAGCTTGCATGATAAGCTCAGAGCCATTGGTCATGCCAGCGTAGTGGTTAAGCATGTAACCTAATGAGGCGCCTTCCATACCAGTAAAGGCAAATACCCAGAAGATGCCCGCAGCCGAGTCGGCTTTACGTAGGGTAACGAACAACAGCACTAAACCACCGATAGATAAACCGAGTGACATCAGAGGGCTGATATTGAGTGCCATCGCTAAGCCTGCACACAATGCAGAGAAGGCGAGCGTCATGGCCAACAATAAATAGGTATTTTTAAGAAGTTTATTCACTTCTAAGGTGGATGCACTCGCTGAGTACAAGGTTTGCTGAGTCATATCTTTCTCCATTAGCTTTTAAATCCAAACTGATCATTTTGATTTTTAACGGGGACGAGAGTTCCCATCAATATCGAAAATCCAGTAAAACTGGGTGAATACTACCGAAAACAGCGCTGTGCTTAAAGGATTCACGCCACAAATAACAAGTTGTAATGACTGATTTTTCTGTGCGGCCTAACGCTTAAGGCGTATTTAATTACACCAGTGCAACACAAAGACTTTAAAACAAGCTTAATCCTGTGGTAATACCGAAATATTACGCCAGCCATTCACGGTGTAATTTATCGGTATCCCCTAAATACTCGAGCACCCAAGCTAGGGCGGGCGACATCTTATCTGCGTTCCAAGCGAGGCAACAGGGGCTGGTTAATTTCGGATTTTCAAGCTGCTTTTCAACCAAGGCGCCCGCTTTAATAAATACGCTGGCCAAATGTACTGGCATATAGCCGATACCTAATCCCTCTCTAAAGCAGTTAATTGCGCGGATCCAATCCGGCACCACGAGTCTGCGTTGATTCTCCAATAACCAGGTCATGCGCTTAGGAATTTCCCGCGAGGTATCTTCCAAACAAATGGAGGGGAAGGGACGCAGTTCATCATCGGTCAAGGGCCGGTCAATACTGGCTAAGGGATGGTTTTTACTCACCAAGAATGCCCATTGAATATCGCCCATGTCCTTATATTGGTATACGCCTCCCACTGGGATCGCCGTTGTCGCACCAATGGCGATATCACTGCGACCGGTCGCAAGCGCCTCCCACACACCGTTAAAGACCTCGATGCGGATGATAAGCTCAATATCATGAAAATGACGGTAAAAGTCGGCAATTAATACGCTAATTCTGTCGGCGCGCACTATGTTATCTAAGGCGATAGACAATGTGGGTTGCCAGCCGTTGGCCACACGCTGGGTGCCACGTTTCATCTCATCCATTTGAGTCAGCAAGTTTCTGGCTTGCTTAACAAAATGCTCTCCTGCGGGGGTCAGGGTCACGCTGCGGTGGTGGCGCTCAAATAACACTACGCCAAGTTCTTCTTCGATTTGTTTAACAGCATAACTGACCGCAGAAGGGACCTTGTGCAGTCGATTGGCTGCCGCCGTAAAACTCCCAACACGGGCTACGATATCAATGAGTTCTAACGCTTGTTCCGAGAGCATAGCTGACATCCATTGTGATGAAAATAATTGATAGCATCTGTCAAAAATAAACGTTTCCAATCAAATTAACAAGTATTTAGACTGCACGCCTTAAGTCTTTTGACATGATTATTTTATGAAAACGTCTAGTAATATCTTTGTAAATATGAAGTTTTTTATATTCTTATTCTATTTGGCCTTATTAAGCATGTTAGGCTTTATTGCCACTGATATGTATTTGCCTGCTTTCAAAGCTATTGAAAGTTCGTTCAATTCTTCACCGTCTCAAGTGGCGATGTCGCTCACCTGTTTCTTGGCTGGTTTAGCATTAGGGCAACTGATTTATGGCCCTTTGGTCAGTAAATTAGGCAAACGTTACGCTCTTATCATCGGCCTTGGCATTTTTGCGCTCGCCAGTGTGGCCATCGCCAATAGCGACTCGATATTGATGTTAAACATCGCTCGCTTCTTCCAAGCCGTAGGCGCCTGTAGCGCAGGGGTCATTTGGCAAGCGATTGTGGTCGAGCAATATGATGCCGAAAAAGCACAGGGGATTTTCAGTAACATTATGCCGTTAGTGGCATTATCACCCGCATTAGCCCCCATCCTTGGCGCTTATATTCTGAACGATTTTGGATGGCGTGCAATCTTTATCTCATTGTGTGTGATTGCCTTTTTATTGGTGTTGATGACCTTATACTTCGTGCCGAGCCATGCAGAGCATCAGGATGCTAAGCCAAGCGCTGTTTCTTACGGGCAGATTTTGAAAAATACCCGTTACCTAGGCAATGTGGTGATTTTTGGTGCCTGTTCGGGTGCGTTTTTCGCATATCTTACTGTATGGCCGATTGTGATGGAGCAACACGGCTATCAGGCAACTGAGATTGGGCTGAGCTTTATTCCACAAACCATTATGTTTATTGTGGGTGGATACGCGAGTAAGTTATTGATAAAACGCATTGGTGCCGACCGTACACTCAACGTATTGCTGACCATTTTTGGACTCTGCGTTATCTCGATTGTGTTTTTCACCTTATTAATGAAGGCGGAAACCATTTTCCCTCTGCTGATTTCCTTCTCGATACTCGCAGCGGCGAATGGGGCGATTTATCCTATTGTGGTGAACAGTGCTTTACAACAATTCACTCAAAATGCGGCTAAGGCGGCAGGATTACAGAACTTTTTGCAAATCACCATCGCCTTTGGCACCTCAAGTTTAGTCGCACTCTGGGCAAGTTCAGGAGAAGTCGCCATAGGTTGGGGCATTCTGAGCTGTTCATTAGTAGTGATCTTGGGTTACCTGTTAAAAACCGAACAAACTTGGGCTGATTTTGCCAAACACTTTACGGCGCCGGATCCTGCTCGTCTTGGGATCAATGCAGATACGAAGCAAAATCAAGCAGATTGA
Protein sequences of DBSCAN-SWA_4 >LR134303|4687569:4698866|4690383_4691004_+|VEE64311.1|DBSCAN-SWA MKKLLCAVLLSPLLYSNAVLADDAKQLRETLNGTESLKADFKQTVTDINKKVIQTGAGVFALAHPNQFYWHLTAPDESQIVADGKDLWIYNPFAEEVVIMDFAEAINASPIALLVHRDDATWSQYSVTKKQDCYEIKPKSTDAGITSVNVCFNKGTLNKFNVLDDKGNLSQFDLSNQHSISAADKALFKFVLPENVDVDDQRLKTQ >LR134303|4687569:4698866|4691012_4692344_+|VEE64312.1|DBSCAN-SWA MSSLSFNFAPDFRPLAARMRPRTIAEYIGQAHLLGEGQPLRQALEAGRAHSMMLWGPPGTGKTTLAELIAHYSNAHVERISAVTSGVKDIRAAIEQAQAVAQSRGQRTLLFVDEVHRFNKSQQDAFLPFIEDGTVIFIGATTENPSFEINNALLSRARVYLIKRLSQDEIVHIITQALTDPERGLGQRQLVMPTDVLNKLAQLCDGDARKALNLLELMSDMVADGGSFTTEMLVQVAGHQVAGFDKNGDQFYDLISAVHKSIRGSAPDAALYWFCRILEGGGDPLYVARRLLAIASEDVGNADPNAMTVALNAWDCFHRVGPAEGERAIAQAIVYLASAPKSNAVYTAFKAARALARETGHEAVPYHLRNAPTKLMAEMGFGAEYRYAHDEPNAYASGENYFPESLQESQFYFPTERGFEKRIKEKLAQLAQLDQASGRKRYE >LR134303|4687569:4698866|4694224_4694563_-|VEE64315.1|DBSCAN-SWA MVNPLIFNGVEIERDHQGYLKNIADWQPDMAPLLAQEEQIELTSAHWEVIHFVRDFYLEYKTSPAIRVLVKAIGQALGPDKGNSKYLYTLFPVGPAKQATKIAGLPKPAKCI >LR134303|4687569:4698866|4692729_4694016_+|VEE64314.1|tRNA|DBSCAN-SWA MLDPKFLRNELAVTAERLATRGFILDVAHLTQLEEKRKSLQVATEELQASRNAISKSIGQAKARGEDVDAIMAQVGDLGAQLDAKKVELAAVLEEVNAIAMSMPNLPDESAPIGADETENVEIRRWGTPRSFDFPVKDHIDLGEGLNGLDFKSAVKITGSRFIVMKGQIARLNRALGQFMLDLHTTEHGYTEAYVPLLVNEASLLGTGQLPKFGEDLFHTKPATEEGQGLSLIPTAEVPLTNLVRDSIVDEDELPIKLTAHTACFRSEAGSYGKDTRGLIRQHQFDKVELVQLVKPEDSMAALEALTGHAETVLQRLGLPYRTVILCTGDMGFGSSKTYDIEVWLPGQNTYREISSCSNMKDFQARRMQARYRVKADNKPALLHTLNGSGLAVGRTLVAILENYQNADGSVTIPEALRPYMGGLTQIG >LR134303|4687569:4698866|4695208_4695601_-|VEE64318.1|DBSCAN-SWA MMSKFIIQVNGPAYGTSASVNALRFTQASLQSGHEIVCVFFYQDGVYNSTDFNLPTSDEYDVVKGWKQLAAEHQVSLVNCVSAALRRGIVSQQEAQENGLSHWNVEHSFIMGGLGELVTGIESADRLICF >LR134303|4687569:4698866|4692336_4692711_+|VEE64313.1|DBSCAN-SWA MNNLLLVALGGSIGAVFRYLISIFMIQVFGSSFPFGTLLVNVLGSFLMGVIYALGQMSHISPELKALIGVGLLGALTTFSTFSNETLLLMQEGDWLKAALNVVLNLSLCLFMVYLGQQLVFSRI >LR134303|4687569:4698866|4695657_4696317_-|VEE64319.1|protease|DBSCAN-SWA MTQQTLYSASASTLEVNKLLKNTYLLLAMTLAFSALCAGLAMALNISPLMSLGLSIGGLVLLFVTLRKADSAAGIFWVFAFTGMEGASLGYMLNHYAGMTNGSELIMQALGLTSVIFIALSAYAVTTKKDFSFLRGFLFAGLIVVIAAALINIFVGNSVAFMAINAGLALLMTGFILFDTSRIVNGGETNYIRATISLYLDFLNLFIAILHLLGIGNDD >LR134303|4687569:4698866|4694556_4694838_-|VEE64316.1|tRNA|DBSCAN-SWA MILHHIQTSPSRDNALKLCLRYACKEDAILLSSDGVNALLLRQWAMALSPFKVMVLKDDVIARGLTERLKNISQIDYNEFVAQSLQHDKVITW >LR134303|4687569:4698866|4697594_4698866_+|VEE64321.1|DBSCAN-SWA MKTSSNIFVNMKFFIFLFYLALLSMLGFIATDMYLPAFKAIESSFNSSPSQVAMSLTCFLAGLALGQLIYGPLVSKLGKRYALIIGLGIFALASVAIANSDSILMLNIARFFQAVGACSAGVIWQAIVVEQYDAEKAQGIFSNIMPLVALSPALAPILGAYILNDFGWRAIFISLCVIAFLLVLMTLYFVPSHAEHQDAKPSAVSYGQILKNTRYLGNVVIFGACSGAFFAYLTVWPIVMEQHGYQATEIGLSFIPQTIMFIVGGYASKLLIKRIGADRTLNVLLTIFGLCVISIVFFTLLMKAETIFPLLISFSILAAANGAIYPIVVNSALQQFTQNAAKAAGLQNFLQITIAFGTSSLVALWASSGEVAIGWGILSCSLVVILGYLLKTEQTWADFAKHFTAPDPARLGINADTKQNQAD >LR134303|4687569:4698866|4694837_4695194_-|VEE64317.1|tRNA|DBSCAN-SWA MKKICILFRSAPHGTTKGREALDFALLSASFEQEVSLVFVDEGVLHLLKDQQPELIGGKDYLAALKALPLYDIESVFACKQSLGDYGLSHSLLSIPVTILNDEAISAHLKAVDEVLVF >LR134303|4687569:4698866|4687569_4690314_+|VEE64310.1|DBSCAN-SWA MSQGNSVRTLSGLQRLLEGGLIICCVLATYILLALTSFSPSDPGWSQSHFQGDIKNWTGAVGAWIADILLYFFGVTAYIMPIIVASTGWLLFKRAHDLLEIDYFSVALRIIGFLLLILGFSALASMNANNIYEFSAGGVAGDVIGQAMLPYFNKLGTTLLLLCFLGSGFTLLTGISWLTVVEKVGFVSIWCFRKLKRLPQALKRERETEDTRGFMTVVDKFKQRRDSQHQLEKARVREPEVAPSRIFTTRPVKEEEASDEIITEASSSKGKLSALAKILSLNSNKAKAEPKGLQRVEPQLDQASAVAEHGHFEAPPWVAKPKEAELDLDDETEFKAHVFEDEDDDEPVFHRETMLDDEDEDELGFNDEDVIDFDTKASTGAVTQAQRQKEAPKAKIVDGIVVLPGQEDKPVPAKPMDPLPSISLLDVPNRKKNPISPEELEQVARLVEAKLADFNIVATVVGVYPGPVITRFELDLAPGIKASKISNLANDLARSLLAERVRVVEVIPGKSYVGLELPNKFRETVYMRDVLDCEAFTESKSNLTMVLGQDISGEPVVVDLGKMPHLLVAGTTGSGKSVGVNVMITSLLYKSGPEDVRFIMIDPKMLELSVYEGIPHLLCEVVTDMKEAANALRWCVGEMERRYKLMSMMGVRNIKGYNAKIAEAKANGEVILDPMWKSSDSMEPEAPALDKLPSIVVVVDEFADMMMIVGKKVEELIARIAQKARAAGIHLILATQRPSVDVITGLIKANIPTRMAFQVSSRIDSRTILDQQGAETLLGMGDMLYLPPGTAVPNRVHGAFIDDHEVHRVVADWCARGKPQYIDEILNGVSEGEQVLLPGETAESDEEYDPLYDEAVAFVTETRRGSISSVQRKFKIGYNRAARIIEQMEMQGVVSAQGHNGNREVLAPPAPKHY >LR134303|4687569:4698866|4696576_4697479_-|VEE64320.1|DBSCAN-SWA MLSEQALELIDIVARVGSFTAAANRLHKVPSAVSYAVKQIEEELGVVLFERHHRSVTLTPAGEHFVKQARNLLTQMDEMKRGTQRVANGWQPTLSIALDNIVRADRISVLIADFYRHFHDIELIIRIEVFNGVWEALATGRSDIAIGATTAIPVGGVYQYKDMGDIQWAFLVSKNHPLASIDRPLTDDELRPFPSICLEDTSREIPKRMTWLLENQRRLVVPDWIRAINCFREGLGIGYMPVHLASVFIKAGALVEKQLENPKLTSPCCLAWNADKMSPALAWVLEYLGDTDKLHREWLA |
12 | uncultured_Caudovirales_phage(28.57%) | tRNA,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
4943792 : 4954010
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >LR134303|4943792:4954010|DBSCAN-SWA CATGGAGAAGTTATCCGGCGCCAGCATGATTGTGCGATCCCTTATCGATGAAGGTGTAAAGCACATATTTGGTTACCCTGGCGGCTCAGTGTTAGATATCTACGACGCCCTGCACCTTATCCCCGGTATTGAACATATTCTGGTTCGTCACGAACAAGCCGCCGTGCATATGGCCGATGGTTATGCCCGTGCCACGGGTAAAGTGGGCGTGGTACTCGTGACCTCAGGCCCAGGTGCCACCAATGCTATTACCGGCATAGCAACCGCCTATATGGATTCAATTCCATTAGTGGTGTTATCTGGCCAAGTACCGAGCAATTTAATCGGTAACGATGCGTTCCAAGAATGCGACATGATTGGGATTTCACGCCCAGTCGTCAAACACAGCTTTTTAGTACAAGATCCCACTCAAATTCCTGAGATCATTAAAAAAGCCTTTTATATTGCCTCCACGGGTCGCCCCGGCCCCGTGGTTGTCGACTTGCCAAAGGATTGCCTCAATCCAGCATTACTGCACGACTATATTTACCCCGAGAGCATTAAAATGCGCTCCTATAATCCAACGACATCGGGCCATAAAGGTCAAATTCGCCGTGGATTACAGGCATTACTCGCCGCGAAAAAACCTGTGCTTTATGTGGGTGGCGGCGCCATTATCTCAAGCAGTGAAAAGCAAATCTTAGCCCTAGCGGAAAAATTAAATATCCCCGTTGTCAGCACCTTGATGGGCCTTGGCGCCTTCCCTGGCACCCATAAAAACAGCTTAGGAATGCTGGGGATGCATGGTCGCTATGAAGCCAACATGACCATGCATAACTGCGATCTTATCTTCGGGATTGGCGTGCGTTTCGATGACAGAACCACCAACAACATCGAAAAATACTGCCCTAATGCCACCATCTTACATATTGATATCGACCCTTCATCGATTTCAAAAACCGTACGAGTGGATATCCCCATCGTTGGCTCGGCCGATATCGTATTAGACAGCATGCTAGCCCTGCTCGAGGAGTCGAAGGAGAGCAACGATAGCGAGGCAATTAATCAATGGTGGCAAGAAATCACTGTGTGGCGTAACCGCAACTGCTTGGCATACGATAAGGAAAGCCATCGGATCAAACCACAGCAAGTGATTGAAACTCTTTACAAACTCACTAATGGTGAGGCTTATGTCGCCTCGGATGTGGGCCAGCATCAAATGTTTGCCGCGCTCTATTATCCCTTTGATAAACCTCGTCATTGGATTAACTCCGGCGGCCTTGGCACCATGGGCTTTGGCTTACCCGCTGCCATGGGAGTCAAAATGGCAATGCCGGACGAAACCGTTGTGTGCGTCACGGGCGATGGTTCGATTCAGATGAATATTCAAGAACTTTCAACCGCCCTGCAATACGATACGCCGGTTAAAATTATCAACTTAAACAACCGTTTCCTTGGCATGGTGAAACAGTGGCAGGATATGATCTACTCGGGTCGCCATTCCCATTCCTATATGGATTCGGTCCCTAACTTTGCCAAAATTGCCGAAGCCTATGGCCATGTTGGCATGACAATTAGCGATCCTGCTGAGCTTGAGTCTAAGATGGCCGAAGCCCTTGCCATGAAAGACAGACTGGTATTTATCGACATTATGGTGGATGAGACTGAGCACGTGTATCCCATGCTTATCCGCGGCGGCGCGATGAATGAAATGTGGTTAAGCAAAACGGAGAAATGCTAATGCGTCGAATTATATCTGTATTACTCGAAAACCAATCCGGCGCCCTATCCCGTGTTGTCGGTCTGTTTTCCCAGCGCGGTTACAACATCGAAAGTTTAACTGTGGCACCAACCGATGATAACACCTTATCGCGCCTCAATATTACCGTAGTGGCCGACGAGAAAGTGCTAGAGCAAATCGAGAAACAGCTGCATAAGTTAGTCGATGTGCTTAAGGTGTCGAATGTCACCGAATCGGCTTATATCGAGCGGGAGTTGGCACTGGTTAAAGTCCGCGCCCAAGGGGAGTTACGGGAAGAAATTAAGCGCACCGCCGATATCTTCCGTGGGCAAATTGTCGATGTCACCGCCAATCTCTACACCATTCAATTGGTGGGCACGACCGAGAAACTCGATGCCTTTATTCACTCCATGGGTGAAGTGACCAAGGTAGTTGAAGTATCGCGCTCGGGCGTTGTGGGTCTGGCCCGCGGCGAGAAAGCGATGAGAGCATAGCGCTGAAATCCCCATTAACCCTATGTGAGCTTGCCATCGCAAGGCTTTGACATAGACCACATGGGATTTTAATTGGTGGGATAACAATAAGGGGAGCATTAAGCTCCCCTTCTTTATACGCGATATGCAGGATTAGGATTGACCATCTAATAGGCGCGAAGTGGTGATTTCAATCTTGCGTGGTTTTAACGCTTCTGGAATTTCACGCACTAAATCAATGTTCAATAAACCATTCTCTAAACTGGCACCCAATACGGTCACATAATCCGCCAGTTGGAAGGTACGTTCAAACCCGCGCTCGGCAATACCTTGGTATAAGTATTTACGCTCGGTTTGGCTTTCGGCCTTATTGCCTTTCACCAGCAATTTCTCACCTTCGCTACTGATCTCAAGCTCTTCCATCGAGAACCCGGCCACGGCCATAGTAATGCGATAACGATTTTCGCCAAGGAGTTCAATGTTATAGGGAGGATAACCTGAGTTGCCATTATTCGCGGCAGCATGTTCTGCCATCTGGGCTAAACGATCAAAACCAATAGCACTACGGTAAAGAGGAGTTAAATCATAAGTACGCATATTCATATCCTTGTCTTAAGCAATATGGAAAGTGGCGAACGTTTTTCAATTGAAAACGCATTTACCTCGCCTAAATTAACTTCTGTCCTAGCGGCACAGAATCGATTGGGATCCTCATCGAGCGATCCTGCAATAGATATAAGGATAGGAAAGCGGATTTCAAGTAAGGTTTTTAAAATTTTTTAAAAACTGAATAAAAATAATCTGTTACTAGCATTAAACACTTTATCTTTGCCGTAATGAGCAAAATGCCGCCAATAAAAAAGGAGCCCTTAGGCTCCCTTCTTCGGTCTCGCACTCAGATTAACGAGTACGTGGACACAGTTCTTCAGCACTGAAGAAATAAGCAATTTCACGTGCTGCTGATTCTAAAGAATCAGAACCGTGAGCTGCGTTTTCGTCGATGCTTTGAGCGAAATCAGCACGGATAGTACCAGGAGCCGCTTGAGCTGGGTTAGTTGCACCTAAAATTTCGCGGTGAGCTAAAACAGCGTTTTCGCCTTCTAATACTTGAACCATGATTGGGCCAGAAGTCATGAAAGCAACCAGAGCACCGAAGAAGCCACGCTCGCTATGCTCAGCATAGAAACCTTCAGCTTGTTCTTTAGTCAGGTGTAACATTTTAGCTGCAACGATCTTCAGACCAGCAGTTTCAAAACGGTTGTAGATTGCACCGATGTGGTTTTTAGCAACTGCATCAGGCTTAATGATAGAAAATGTGCGTTCGATCGCCATAACAAGCTTCCTTTGTCAGCAAGTTTGAAAAATTCGCGCGGATTATACGAAATTTAAAAAACAAATCCTATCTTTAAAGCAAATAAGTCTGAGATATTGCGCACATTGGCTGGATTTGTAACCAGATTGATCACACAAAAACGCCAGACCTGTCATTCGACTGGACTGGCGAAAGGATAACAACACAACTAAATTGGCATATTTGCCAAAGACTCAGGGGTTATTTTTTATTTTTTGCCCAAATGCTCGCCCACGCAATCAAGGCGACCAAAATAATGGCAGGGGCTAAATCGAGCAAATGATGGAAGATACCAACGCCACCATGACCAGGATGAGCCCATGCCAAGGCGGGTAACAGTGTGAATAACAAAGCAAACAAAAGTTTCATGTGAACATCCTTGCGTAGCCAGAATAACATTAAGCGCTTAACCACTTAACTGATACATAACAAGGTACGTCATTCACTTAAAATGAATTTGATGCCGATCAAACTCCTACAAAAATCCCCTTTTAGGGATCACTAAACGTGACAATTGTCACGTAAGTACGGTTTTTGCGGGACAAGATGCCCCATCAGGGCGATGGTTATTTCTCTTCAATCCAGCATTGCTGAATGGCTTCTAATATCCGCTCATTACAATGCTTAGGATCGTCATCAAAATCATCGAGGGCTAATACCCACTCATATAACTGAGTAAAATGCAGCTTGTGGGGGTCAACCTCAGAATGCTTCTCCAGCAATTCTAAGGCGATATCTAATGAGTCAATCCACTTTAACGACATCCGCTCCCCCTTGTGCCATCTGCAGCTAAACCTTGCTCTCCTGAGTGTAGCTTAAGAACGCGTTAACTCTACGCTTGAGTGGGCTTTTTTGCTTTGAGGGCGTTCTTTTCAACAAACTCAATAATGGCACCAGCAACATCTTTTGCGGTAGCGCCTTCAATCCCTTCAAGGCCTGGCGAGGAGTTTACTTCCATCACCACAGGGCCATGATTTGAACGCAATAAATCGACACCCGCCACATTGAGCCCCATGGTCTTGGCCGCACGGATTGCAACTGAACGCTCTTCAGGCGTCAGTTTAACTAAACTTGCGGTTCCGCCACGGTGTAGGTTTGAACGAAACTCCCCAGGCATGGCTTGGCGTTTCATCGCAGCAATCACTTTATCGCCCAGTACAAAGCAGCGAATGTCGGCGCCATTGGCTTCTTTAATGTATTCCTGCACCATGATGTTGGCTTTCAATCCCATAAAAGCTTCAATCACGCTCTCAGCCGCTTTGCGAGTTTCGGCTAACACCACCCCAATGCCCTGCGTGCCCTCAAGCAACTTGATCACTAACGGCGCGCCGCCCACCATGTCGATAAGATCTGGAATATCGCTCGGCTTATTGGCAAAGCCGGTGATCGGCAAGCCGATACCACGGCGTGACATCAACTGCATCGAACGTAACTTATCGCGGGAGCGCGAAATGCCGACGGAATCATTGAGCGCATACACGCCCATCATCTCAAACTGGCGCAGCACAGCTGAGCCATAAAAAGTAATAGATGCACCAATACGCGGGATCACCGCATCAAAAGGCGGCAATTCTTCCCCACCAATATGGATGCTCGACTGACGCATATTGATGTTCATATAACATTCGAGCGGATTGATCACTTTGACTTCGTGACCACGTAATGTCGCAGCCTCAACGAGTCGCTTTGTGGAATATAGCTCAGGCCCTTGCGATAGGATCGCGATTCTCATTTTGGTCTCCGACAACCATTTCATTTTGAGCGCATCATACGCTCGGATTAGGATAAATCAATCATGGCAATTATTGCTTGCGCAAATGCTGGGCGCCAGAAGCCAAAAACCAGTCACATGGACTGGTTTTTGACGATATCATCGCATCGGGTTGATTAACCTTCGCTCACCATATTCACAGTGTACTTCGGAATTTCAATCACCAGATCGCTATCGACAACTTTGGCTTGGCAAGACAAGCGGCTCTCAGGCTCAAGTCCCCATGCCTTATCGAGCATATCATCTTCAAGCTCATCGCTTGGCTCTAACTCATCAAAACCTTCACGCACGATGCAATGGCATGTGGTGCAGGCACAGGATTTTTCACAGGCATGCTCAATATTGATGCCGTTACGCAGCGCCACATCGAGAATCGTTTCACCTACTTTGGCTTCCAACACTGCGCCATCGGGACATAACTCAGCATGGGGAAGAAAGACTAACTGGGGCATCACATCACCTATATGTTGTCAATTGACTGACCCTTAAAGGCCACTCGAATAGAATTGTCCATACGCTTCGCGGCGAAATCCTGAGTATGTTCATCGAGGACTTCAATCGCTTGCTTAATCGCATCGGCATCATCACCACGGGCAATTTCGGCAAGCTGCGCCATGACTGAATCAATCTGCTGACGCTCATCGCTTGTCAGCAGGTCGCCATCTTTTGCAAGCGCCGCATTTAACGACTCAAGTACCCTTGCCGCCTCCACTTGCTGCTCGGCCAGCATACGACGACTAATGTCTTCCTTCGCATGCTTCATCGAGTCCTTGAGCATAGTAGCGATTTCAGTATCCGATAAACCAAACGAAGGTTTTACTTGAATACTCGATTGCACACCGGTGGATTTCTCCATCGCGGTAACGCTAAGCAACCCATCGGCATCGACTTGGAAGGTCACACGAATATGGGCAGCGCCCGCCGCTAACGGGGGAATACCTTTAAGCGTAAAGCGTGCGAGTGAGCGACAATCATCGACTAACTCACGCTCGCCCTGCACCACATGGAAGGCCATCGCCGTTTGACCATCCTTAAAGGTGGTAAACTCTTGGGCGCGGGCCACTGGAATCGTGGTGTTACGAGAAACCACTTTCTCCACTAAACCGCCCATGGTCTCAATCCCTAATGACAAAGGGATCACATCGAGCAGTAATAATTCAGACTCGGGTTTGTTACCGACCAGAATATCGGCCTGAATCGCCGCGCCAATCGCAACGACGCGATCTGGGTCGATAGAGGTTAATGGCGCCTTACCGAAAAAGGCTTCAACCTGTTCACGCACTAATGGCACGCGGGTTGAACCACCTACCATCACGGTTTCTAACACTTCATCAGCCGTTACGCCCGCATCACGTAGGGTGCGACGACAGCTGGCGATGGTTTTCTTCACTAATGCGCTGATTAAGTTATCAAATTCAGCTTTGGTCACGACTTGCTTGAGCACAGTGCCATCGGCAAGTGCCAGGTTCGCTTCGACGTCACTCGCATCGGTTAAGGCTTCTTTCACACGGCGCGCTTCAATCAATAACTGACGGCTTAGCTGCGGATCAAGGTTCGTCAATTGCCATACTTGCTGCATATGCGCCTGCAACAGATGGTCAAAGTCATCGCCGCCCAGCGCCGAATCGCCGCCCGTGGCCAGCACTTCAAAAACGCCGCGATTTAAACGCAGAATAGAGATATCAAAGGTACCGCCGCCGAGATCGTAAATCGCGATCACACCTTCTTGCTTAGAATCTAACCCGTAGGCGATTGCCGCCGCAGTAGGCTCGTTTAGCAGGCGCAATACTTTAACACCGAGCAAAGACGCGGCATCTTTTGTCCCTTGACGCTGTGCATCATCGAAATACGCTGGCACAGTGATCACGACACCCTGCAGCTCGCCACCAAGGGTTTTCTCGGCGCGTTCAACCAGAGGACGTAAAATCTCCGCTGACACTTGCACCGGATTCACTTGCCCTTGAGGCGTAACAAACAGCGGTAAACCGTTTTCACTGGCTTCAAACTGGTATGGGAACGCTTGCTCACCAGATTGAATATCCGTAAGGCTGCGGCCCATAAAGCGTTTGACCGATACGATAGTATTTTTAGGATCCTGCGCAGAACTTAACGCCGCCACATGTCCGACTTCAATGCCATCTTGAGTGTAACGGACGATAGAAGGTAAAGAGTGCTGCCCATTTTCATCGGGTAAGGTCGCCGTTACGCCGCTGCGAACTGCGGCCACGAGGGAATTAGTGGTACCTAAATCGATGCCAACGGCAAGTCTATGTTGGTGCGGCGCGGCACTTTGTCCGGGCTCTGCAATCTGCAAAAGGGCCATATGTATCCAATTTAGTAGCGCCGACTCAGACCCGTCCGAATCGGCTAAATAACTGCAATAGGGAAAATTAATCGAGCAACGCGTCTTCGATACGCGTCAGCTCATCGTGTAATTTTGCCATAAATTTAAGCTTGCGTACTTGATCTGCCGCTAACAAGGCATCTTCTTCGCTGTCACTGCTTAGCTGTGCGGTGAGCAATTTCGTCAGCTTTGTCCGGTATGCCGCGAAAGAGTCGTATAACTCATCAATGCTTTCTTGAGGATCGGCGCTATCGCGAATGTCTTCTAACGCCTCACGCCATTCCATCTGCTGCATCAAAAATGCAGTGTCTTTCACTGTTGTGGTTTCGTGGCTTAAATCGATACCACGTAGCGACAGCATATGCTCGGCACGGCGGATAGGATCTTTTAGGGTTTGATAACCGTCATTCACTTGCGCAGTGCGTTGCACCGACAATAAGCGTTGTTGCTCGGTATCGTTGGCAAACTTATCGGGATGCACCGCCCGTTGCAGTTCGCGGTAGCGTTCTGCAAGCACAGCGGTGTCAATATCGAAGGCAGGGGAAAACTTAAACAGCTCGAAATAATTCATGCGTCAATTCGGCAATTAAACAGTGAAACTCTCACCGCAACCGCACTCACCTTTTGCGTTAGGATTGTTAAATTGGAAACCTTCGTTCAGGCCTTCTTTAACAAAATCCAACTCAATGCCCTGTAGATAGATGAAGCTTTTCGCATCGATAATGATTTTAACACCATCAATATCATACACTTCATCGTCATCATTCAGCGCATCAACAAACTCAAGCACATAAGCCATACCGGAACAGCCAGAGGTCTTTAAGCCGAGGCGCAGGCCGATGCCCTTGCCTCGATTCGCTAAGAAAGATCTGACACGCTCAGTTGCCGCTGGGGTCATTGTAATCGCCATCTTAACTCCAGATGTTACTTAGCTTGTTTCGATTTGTACTCGTCGATCGCCGCTTTAATGGCATCTTCTGCCAAAATGGAACAGTGGATTTTCACTGGTGGCAAGGCTAATTCTTCCGCGATATCGGTGTTTTTGATGGCCGCCGCTTGCTCGATAGTCTTGCCTTTCACCCACTCGGTTACCAGTGAGCTAGACGCAATCGCGCTGCCACAACCGTAGGTTTTGAACTTAGCGTCTTCGATGATGCCGTCAGCGCCGATTTTCAGCTGTAACTTCATCACGTCACCACAGGCAGGTGCTCCCACCATACCGGTCACGACCGAAGGATCGTTCTTGTCGAAAGAACCAACGTTACGTGGGTTCTCATAATGATCTATCACTTTTTCACTGTAAGCCATGGTACTGCTCCAAAATTATCTATTGTAAATGGTCGAATTAATGATGTGCCCACTGCACCTGATTCAGGTCGATGCCATCTTTGAACATCTCCCACAATGGAGACATTTCTCTTAATTTATCAATAGATTGTGTAATAGTTTCGATAGCGTGGTCGATTTCTTCTTCGGTGGTAAAACGACCGATTGAGAAACGAATTGAGCTATGCGCCATTTCATCATTTAAACCGAGCGCACGTAACACGTAGCTTGGCTCGAGACTGGCTGACGTACAGGCAGAACCCGATGAAACCGCTAAATCTTTTAATGCCATCATCAGCGACTCACCTTCAACATAGTTGAAGCTGACGTTCAGGCTACCACTCACACGGTGAGTCATGTCGCCGTTTACATACGTCTCTTCGATGTGCTTGATGCCATTCCATAACTTGTCACGCAGTTTAGCGATACGGGCGTTGTCTGTTGCCATTTCAGCTTTGGCAATCGCAGCCGCTTCACCTAAACCAACGATTTGGTGAGTCGCCAGTGTACCACTGCGCATACCACGCTCATGACCACCGCCGTGCATTTGGGCTTCTAAGCGAATACGTGGCTTACGGCGAACGTACAGAGCACCAATCCCTTTTGGACCATACATTTTGTGGCCAGAGATAGAGATCAGGTCGACCTTAGTCGTTTGTACATCGATAGGTAATTTGCCCGCGCTTTGTGCAGCATCCATATGGAAGATAATGCCTTTTGAGCGGCATAATTCGCCGATGGCATCTACATCATGGATCACACCGATTTCGTTATTCACATGCATGATGCTGACAAGAATGGTGTCGTCGCGCATCGCCGCTTCTAAACGTTCCATCGGAATGATGCCGTTGGCATCTGGCTCTAAGTAAGTGACTTCAAAACCTTCACGCTCAAGCTGACGGCAGGTATCGAGTACCGCTTTATGTTCAGTCTTGCTGGTGATGATGTGCTTGCCTTTCTTATTGTAGAAATGCGCTACGCCTTTGATCGCCAGGTTGTTTGATTCGGTTGCACCTGAGGTAAACACGATTTCACGGTGATCGGCATTGATCAAATCAGCCACTTGGCTGCGAGCCACATCCACCGCTTCTTCCGCTTGCCAGCCATAACGGTGAGAACGTGATGCTGGGTTACCGAAGATGCCATCCATCGTCATGTATTGGAACATTTTTTCCGCGACTCGCGGATCAACTGGCGTGGTGGCGGCATAATCTAAATAGATAGGAAGCTTCAT
Protein sequences of DBSCAN-SWA_5 >LR134303|4943792:4954010|4952795_4954010_-|VEE64551.1|DBSCAN-SWA MKLPIYLDYAATTPVDPRVAEKMFQYMTMDGIFGNPASRSHRYGWQAEEAVDVARSQVADLINADHREIVFTSGATESNNLAIKGVAHFYNKKGKHIITSKTEHKAVLDTCRQLEREGFEVTYLEPDANGIIPMERLEAAMRDDTILVSIMHVNNEIGVIHDVDAIGELCRSKGIIFHMDAAQSAGKLPIDVQTTKVDLISISGHKMYGPKGIGALYVRRKPRIRLEAQMHGGGHERGMRSGTLATHQIVGLGEAAAIAKAEMATDNARIAKLRDKLWNGIKHIEETYVNGDMTHRVSGSLNVSFNYVEGESLMMALKDLAVSSGSACTSASLEPSYVLRALGLNDEMAHSSIRFSIGRFTTEEEIDHAIETITQSIDKLREMSPLWEMFKDGIDLNQVQWAHH >LR134303|4943792:4954010|4949222_4949558_-|VEE64546.1|DBSCAN-SWA MPQLVFLPHAELCPDGAVLEAKVGETILDVALRNGINIEHACEKSCACTTCHCIVREGFDELEPSDELEDDMLDKAWGLEPESRLSCQAKVVDSDLVIEIPKYTVNMVSEG >LR134303|4943792:4954010|4943792_4945511_+|VEE64539.1|DBSCAN-SWA MEKLSGASMIVRSLIDEGVKHIFGYPGGSVLDIYDALHLIPGIEHILVRHEQAAVHMADGYARATGKVGVVLVTSGPGATNAITGIATAYMDSIPLVVLSGQVPSNLIGNDAFQECDMIGISRPVVKHSFLVQDPTQIPEIIKKAFYIASTGRPGPVVVDLPKDCLNPALLHDYIYPESIKMRSYNPTTSGHKGQIRRGLQALLAAKKPVLYVGGGAIISSSEKQILALAEKLNIPVVSTLMGLGAFPGTHKNSLGMLGMHGRYEANMTMHNCDLIFGIGVRFDDRTTNNIEKYCPNATILHIDIDPSSISKTVRVDIPIVGSADIVLDSMLALLEESKESNDSEAINQWWQEITVWRNRNCLAYDKESHRIKPQQVIETLYKLTNGEAYVASDVGQHQMFAALYYPFDKPRHWINSGGLGTMGFGLPAAMGVKMAMPDETVVCVTGDGSIQMNIQELSTALQYDTPVKIINLNNRFLGMVKQWQDMIYSGRHSHSYMDSVPNFAKIAEAYGHVGMTISDPAELESKMAEALAMKDRLVFIDIMVDETEHVYPMLIRGGAMNEMWLSKTEKC >LR134303|4943792:4954010|4951496_4952021_-|VEE64548.1|DBSCAN-SWA MNYFELFKFSPAFDIDTAVLAERYRELQRAVHPDKFANDTEQQRLLSVQRTAQVNDGYQTLKDPIRRAEHMLSLRGIDLSHETTTVKDTAFLMQQMEWREALEDIRDSADPQESIDELYDSFAAYRTKLTKLLTAQLSSDSEEDALLAADQVRKLKFMAKLHDELTRIEDALLD >LR134303|4943792:4954010|4946884_4947316_-|VEE64542.1|DBSCAN-SWA MAIERTFSIIKPDAVAKNHIGAIYNRFETAGLKIVAAKMLHLTKEQAEGFYAEHSERGFFGALVAFMTSGPIMVQVLEGENAVLAHREILGATNPAQAAPGTIRADFAQSIDENAAHGSDSLESAAREIAYFFSAEELCPRTR >LR134303|4943792:4954010|4949566_4951429_-|VEE64547.1|DBSCAN-SWA MALLQIAEPGQSAAPHQHRLAVGIDLGTTNSLVAAVRSGVTATLPDENGQHSLPSIVRYTQDGIEVGHVAALSSAQDPKNTIVSVKRFMGRSLTDIQSGEQAFPYQFEASENGLPLFVTPQGQVNPVQVSAEILRPLVERAEKTLGGELQGVVITVPAYFDDAQRQGTKDAASLLGVKVLRLLNEPTAAAIAYGLDSKQEGVIAIYDLGGGTFDISILRLNRGVFEVLATGGDSALGGDDFDHLLQAHMQQVWQLTNLDPQLSRQLLIEARRVKEALTDASDVEANLALADGTVLKQVVTKAEFDNLISALVKKTIASCRRTLRDAGVTADEVLETVMVGGSTRVPLVREQVEAFFGKAPLTSIDPDRVVAIGAAIQADILVGNKPESELLLLDVIPLSLGIETMGGLVEKVVSRNTTIPVARAQEFTTFKDGQTAMAFHVVQGERELVDDCRSLARFTLKGIPPLAAGAAHIRVTFQVDADGLLSVTAMEKSTGVQSSIQVKPSFGLSDTEIATMLKDSMKHAKEDISRRMLAEQQVEAARVLESLNAALAKDGDLLTSDERQQIDSVMAQLAEIARGDDADAIKQAIEVLDEHTQDFAAKRMDNSIRVAFKGQSIDNI >LR134303|4943792:4954010|4946137_4946581_-|VEE64541.1|DBSCAN-SWA MRTYDLTPLYRSAIGFDRLAQMAEHAAANNGNSGYPPYNIELLGENRYRITMAVAGFSMEELEISSEGEKLLVKGNKAESQTERKYLYQGIAERGFERTFQLADYVTVLGASLENGLLNIDLVREIPEALKPRKIEITTSRLLDGQS >LR134303|4943792:4954010|4947536_4947704_-|VEE64543.1|DBSCAN-SWA MKLLFALLFTLLPALAWAHPGHGGVGIFHHLLDLAPAIILVALIAWASIWAKNKK >LR134303|4943792:4954010|4945510_4946005_+|VEE64540.1|DBSCAN-SWA MRRIISVLLENQSGALSRVVGLFSQRGYNIESLTVAPTDDNTLSRLNITVVADEKVLEQIEKQLHKLVDVLKVSNVTESAYIERELALVKVRAQGELREEIKRTADIFRGQIVDVTANLYTIQLVGTTEKLDAFIHSMGEVTKVVEVSRSGVVGLARGEKAMRA >LR134303|4943792:4954010|4952036_4952360_-|VEE64549.1|DBSCAN-SWA MAITMTPAATERVRSFLANRGKGIGLRLGLKTSGCSGMAYVLEFVDALNDDDEVYDIDGVKIIIDAKSFIYLQGIELDFVKEGLNEGFQFNNPNAKGECGCGESFTV >LR134303|4943792:4954010|4947901_4948099_-|VEE64544.1|DBSCAN-SWA MSLKWIDSLDIALELLEKHSEVDPHKLHFTQLYEWVLALDDFDDDPKHCNERILEAIQQCWIEEK >LR134303|4943792:4954010|4952374_4952758_-|VEE64550.1|DBSCAN-SWA MAYSEKVIDHYENPRNVGSFDKNDPSVVTGMVGAPACGDVMKLQLKIGADGIIEDAKFKTYGCGSAIASSSLVTEWVKGKTIEQAAAIKNTDIAEELALPPVKIHCSILAEDAIKAAIDEYKSKQAK >LR134303|4943792:4954010|4948167_4949067_-|VEE64545.1|DBSCAN-SWA MRIAILSQGPELYSTKRLVEAATLRGHEVKVINPLECYMNINMRQSSIHIGGEELPPFDAVIPRIGASITFYGSAVLRQFEMMGVYALNDSVGISRSRDKLRSMQLMSRRGIGLPITGFANKPSDIPDLIDMVGGAPLVIKLLEGTQGIGVVLAETRKAAESVIEAFMGLKANIMVQEYIKEANGADIRCFVLGDKVIAAMKRQAMPGEFRSNLHRGGTASLVKLTPEERSVAIRAAKTMGLNVAGVDLLRSNHGPVVMEVNSSPGLEGIEGATAKDVAGAIIEFVEKNALKAKKPTQA |
13 | Yellowstone_lake_phycodnavirus(12.5%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|