Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
LR131272 | Kocuria rosea strain NCTC2676_1 genome assembly, chromosome: 1 | 1 crisprs | cas3,DinG,csa3,WYL,DEDDh,RT | 0 | 0 | 4 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR131272_2 | 734514-734614 | Orphan |
NA
Consensus repeat of LR131272_2
|
1 spacers
spacers of LR131272_2
>2.1|734541|47|LR131272|CRISPRCasFinder CAGGTCAGGGGGCGTGGACCGCGTGGAGGTCAGGGTCGTTGACCACG |
CRISPR arrays and Neighbor proteins around LR131272_2
The CRISPR arrays of LR131272_2 >merge|LR131272|2|734514-734614|CRISPRCasFinder TGCAGGTCAGCTCCTGCCGACCTCCCGCAGGTCAGGGGGCGTGGACCGCGTGGAGGTCAGGGTCGTTGACCACGTGCAGGTCTGCTCCTGCCGACCTCCCG >LR131272|2|2|734514-734614|CRISPRCasFinder TGCAGGTCAGCTCCTGCCGACCTCCCG CAGGTCAGGGGGCGTGGACCGCGTGGAGGTCAGGGTCGTTGACCACG TGCAGGTCTGCTCCTGCCGACCTCCCG
>LR131272.1|VDR31422.1|733841_734444_+|Uncharacterised-protein MNLTRIAAGAAVTLLALTGCAGTSEQAPATPADTTPSPASSSAPAANIERPATNDLVDDASPEGARAFLYHYFELKAYVLQTGDAAALVELTDGAEAEEAEAARLQAVYDDGGWVLGGQPKVKNVLLTTPEDEIAEGVDVTAVIPVNPDAYTEFSADGAITEQRPFSPDGTVYSATVRYSDGAWKLASLEETPDAELPEA >LR131272.1|VDR31421.1|731788_733690_-|Alpha-amylase-precursor MRPRHATVVVLTALVAGSMTSLPAHAGPDKPGSAPPTTAKHSLRSAVTDENFYFVMADRFQNGDPANDDGGLGPDPSVSGFDPTSKGYYNGGDLAGLLQELDYIQGMGTTSIWLTPSFKNRAVQPEDNSAGYHGYWITDFTQIDPHLGTNEDLRTLIDAAHARGMKVYFDIITNHTADVIAYDAGERAPYVSKDEEPYRDAEGNAFDDRDVAGTGAFPELDAATSFPYVPVVPAADRDLKVPDWLNDPTLYHNRGDTTFAGEDSFYGDFFGLDDLFTEHPRVVDGMIDIYKTWIADFGVDGFRIDTMKHVNDEFWQQFGPEVLSYARAQGKDEFFMFGEVFDTSKAFTSQFTTRNRMQSVLDFPFQAAARGYASQGAPASELETFFAGDDWYTDADSNAYSLPTFLGNHDMGRIGGFITADNPGASDDELLARDTLAHELMYFSRGNPVIYYGDEQGFTGPGGDQDARQTLFASQVPDYLDDDLLGTDATHAEDTFDVDHPLYRGIASLAGVTAEHPALRNGAHQHRYASEGPGIYAFSRTDATDQREYVVALNNSTEPQTAAIPTYAAKRSFGLVYGDASQRAKSAADGTLTVTVPALSTVVYELSGRVPPRRPRRPSRCRPRPPRPRTTGA >LR131272.1|VDR31420.1|730973_731792_-|Alpha-amylase-precursor MRIAADVAGSSFYEVTFEARTAGAEWTGIGTDDNAPYSVYHDTSALPAGTAVEYRAVVLDNGRHSRTSATASAAVPAPKPTMGLPAEGTQVQGAVDVTATVSPERSDYAVTLERSVAGGTWTAIGTDESSPTYSAVDDLTALELADGTQVRYRAVLGTSVSDVRTVTVGSIVDQPGTVAVAGSLDSELGCGGDWDPACDAAQMTFDDASQAWLLEVQDLPAGSYEYKAALNRAWTENYGAGGAPDGANIVLNHPGGAITFVYDHRTKVIGTR >LR131272.1|VDR31419.1|729893_730910_+|Magnesium-transport-protein-CorA MSLVANAVYVDGKKRIDPDTLETTFELMRTSGGMGWIGLYRPSAEEISAVAAEFGLHPLAVEDATNGHQRAKLERYGETLFLVLRPARYLDVEEKVEFGELHLFIGHDFVVTIRHAESPDLGIVRRRLEADPELLGTGPQAVLYALLDQVVDEYEPVVLGLENDIDEIENELFGGAPDVSRRIYELHREVIQFQRATQPLQTMMESLLRGSEKYQLEAELSRNLRDVQDHVIRVVERVNTFRALLQNALTVHSTLVAQRQNEEMTRLTETSLSQGEEVKRISSWAAILFAPTLIASIYGMNFDVMPELHWAIGYPFALLLMLGMGVGLWWTFRRNDWL >LR131272.1|VDR31418.1|729300_729894_+|Phosphoribosylglycinamide-formyltransferase MRIVVLVSGTGSNLQAVMDAVEDGSLPVTIAAVGADRPGTGGVRRAAAAGIETFEVDFRAFADRAAWNRALTEAAAAHEPDYVVSSGFMRILDQQFLDRFPDRYLNTHPALLPSFPGAHGVRDALAYGVKVTGCTVMIADAGVDTGPILAQAAVEVRPDDTEDSLHERIKIEERRLLIETLQNLAARLLPASGTADA >LR131272.1|VDR31417.1|728803_729244_-|Uncharacterised-protein MTQRDTGPVDAGRGRTDGPGRGEDGRTRAAASARRWLRLFLLALVGALLTTGLVLPWKVIALVLSLFALTAGVVALVKALAAKMPRLVVLTTSIGLVGALFLAVGTGASVLLWPVTKTYEDCMARALTLRAESDCRDGLTRLDGLG >LR131272.1|VDR31416.1|727467_728757_+|Uncharacterised-protein MEVMKLPSKPRALPMPLWLQGVVELGQAAVISALLVLLPVAAVWLTGGFADRTPESAARLAGQGWLVMHGVPLVLQFPPGVAGEEPASGVLHVIPLGLVLIPLLLAWRAGRRLARASYTDQLWQALLGALVTYALIGAGIAYLSVTADASAHVVTGALVPPVSAGIGLITGAYREAGAWSRLVGVDFAAWVGRTSQHSRWAGSYAWSVLRSGFLAVLVALSLSAVLLAVAIGLNWAGIAAIYERLDGGIAGASVVTLFQLGLLPNLAVWTMSWSSGAGFALGTGSSLTPLGSAVGPLPALPIFGALPAGTLEYGYAALAIPVLAGLLAGWWFFREGENHLDEWLVLHSPRRWLTWTASTFSLAVLIGLAAGLGGAFTALVSRASVGLGRFTDLGPDPLVVGGWLALEVAVGAVLGHAVGPLLERDPSRR >LR131272.1|VDR31415.1|726597_727359_+|Sulfite-exporter-TauE/SafE MARVTLGIFCIVLASILVGAVAQRIAGLGFALLIAPFLVIILGPHEGVLLVNICGVVSSSIIVGRVWKDIDWSMFRWLVVPSLFGSVPGSFLAVAVPSAPLSVTVGSVVLVALTVSLMLQRSAVVVRGSVPKVVAGFTAGVTNSMAGVGGPAVSAYALLSRWPQRPFAATLQPFFVCIGSVTLVVKLLLDPSQAPVLAAWMWVAIGVAIVAGIFTGEKLSRFVRDDQARLFVIVIAFIGAGLAVVKGLVDILG >LR131272.1|VDR31414.1|725890_726526_-|Protein-of-uncharacterised-function-(DUF1684) MTALTTTLATADWRRRVFGLYEDVRRCAAEDSPEAAHGLWQRGRNDLLRDHPASALHAGARTGFTGLEVAAYDPTFRFEVAVDDAGAGEVMDVATGTDGVVPFRRLGTLVLPGLGTLALWKLASYGGGLFLPLRDATAGTPGGTYGGGRYVLDTVKGAHLGEGRTPGSLVVDLNFAYNPSCAYDEQWACPLPGPDNRLTAEVPVGELYRTY >LR131272.1|VDR31413.1|724138_725764_-|Capsular-polysaccharide-phosphotransferase-SacB MRAKPLPEVPITDTAAQHDIYFGAPEPENEAEILESASRAAVARFEGRSDITTVKRRFTLVNSDRTPHQAMVEDLLFIRAALDDAGIDFLLVRGNDQRPVIAVDLADRDRLRTALVEACRDEPFYSRTVDTKRRTTLLVSDGELSKSGKARIFRLFRPRIEPLGGLAYGPSAGVQIELWSLGDTAIELPVENSLTRRTLPAAEVVRGEVERHGHTWPTIDNMFADHATDIDFDIDLVFSWVDGSSPEYQAARAARMQGVVVGEGDDHEARFRQIDELKYALRSVYMFAPWVRRIFIATDSDRPAWLADHPSVTFVPSEEHFRDPSVLPTHNSQAVEAQLQHIPGLSEYFLYSNDDMFFGRPVAPDMFFSPGGVTKFIEADTRIGLGDNDPERSGFENAARVNRRLLHERFGRITTRHLEHTAAPLRKSVLLEMENEFAAEFAATAASRFRAKDNISVTNSLYHYYALLTGRAVTQESARVAYVDTTQRSGLKSLNKMLDKRNHDFFCLNDGSFPEVPAEERARLVTDFLEKYFPVKAPWEI >LR131272.1|VDR31423.1|734637_736146_-|Putative-niacin/nicotinamide-transporter-NaiP MEGAVAGHFCPLGPAAGTDGTHAGQGTTSAAGGRSHRRRYSFREYRPLHEGIMSINTQLPTGDQVVQDLPWRWKVQGKIFLIGGLGFMFDAWDVTLNAYLIPLLIADWGLVPGQAAWIATSNLIGMAVGAFVWGSVADLIGRKKAFTLTLLVFSIFTVLGAFSPDIVWFCVFRFLAGFGLGGCIPVDYALVGEFTPRRQRGRVLTAMDAWWPVGAFLCGVVTTLVVAQTGDWRYAMLVMVLPALLVFWVRRGVPESPLYLVQRGRGEEARAVIDGLVARTGGEQRAWRLPEPEETPKLSLGSISGQLTGLWRFNWRITLTAWSLFLTILLVYYGALTWMPRILIASGYAQSVAFITTTFMTGVGFLGVVAAALLVERVGRKWLLAITGPGSAVLLVVFALTLDLPAVATAWLLAFGFVVQIAIPVLYTYVSELYPTELRGSGFGWASTISRIGAGLVPLIFGSLLWPYLGLPLTFALIGALVVLAVVFMAFNAPETRAAKLR >LR131272.1|VDR31424.1|736189_737179_+|Uncharacterised-protein MENVLGSGARAHDGSMPTDPYATALLNLALKRCFSVEAARDLGIPARVLRRARYRRATRSLRVLDAAPADLSDVVACLGSLTPGTVASHQTAAVLWGLPLPLRSADGLLHLTRSTGCSRPRRSGVVGHVSRLGAGDVVTAYGVPLTSPERTWCDLAATLSRPELVALGDALLRRWDAPRRPAAINEPDPLSSVEALAAAPARRSGARGAATARAALPLLRSGVDSAPESLLRLLIVDAGLPEPEVNQWILDSAGRRVSRPDLQYRARRIALEYEGEHHLTDPRQWARDIERDDRLRALGWIVLRFTKRHLGAGRDDGLARIRHALALRP >LR131272.1|VDR31425.1|737214_738930_+|Bifunctional-purine-biosynthesis-protein-PurH MTRSPVISVLSRPEPAPASDAYRLVADPQRSKMTEICVSLTHLDRVPIRRALISVFDKTGLEELAQGLHRAGVAIVSTGSTAQRIAAAGVPVTEVSEVTGFSETLDGRVKTLHPKVHAGILADRRRQEHIDQLAELDIEAFDLVVVNLYPFVETVRSGAEPDAVVEQIDIGGPAMVRSAAKNHPSVAIVVDPARYADVVDAAAEGGFDLVARRRLAALAFAHTAAYDTAVAAWTAEQFETDDGAFAFPGYAGLALERSEVLRYGENPHQQAALYVEKGATPGIAQADQLHGKSMSYNNFVDADAALRAAFDFDEPAVAIIKHANPCGVAVGSADAADPIADAHAKAHACDPVSAFGGVIAANREVTAGMAATVRDIFTEVVIAPSFSAEAVEILTQKKNIRLLTLPEGYGRNPVEFRQVSGGVLMQVADTLDADGDDPSGWTLAAGEAADAATLADLAFAWKACRAAKSNAILLASDGASVGVGMGQVNRVDSCRLAVERANSLADSERARGAVAASDAFFPFADGLQILLDAGVRAIVQPGGSVRDQEVIDAANAAGATLYLTGARHFFH >LR131272.1|VDR31426.1|739108_741331_+|Isocitrate-dehydrogenase-[NADP] MAKIIYTHTDEAPMLATYSFLPIIEAFASTAGVEVETRDISLSGRIIALFTDRLPADQQMADALAELGALAKTPDANIIKLPNISASIPQLKAAIAELQGQGYAIPDYPDHPSSDEERDVRARYDKVKGSAVNPVLREGNSDRRAPASVKNYARQNPHSMGAWTPESKTNVAHMTSDDFRSNEQSVVIPADTTVSIQHVDAEGNVTELKGSFPVLAGEVLDGTVLRADALNAFLKAQVARAKDEGVLFSAHLKATMMKVSDPIIFGHVVRAFLPELFETYGEQLSKAGLSPNNGLASIIGGLDKLPEDVREGVRQAIAQGMEDGPAIAMVDSDKGITNLHVPSDVIVDASMPAMIRTSGHMWGPDGEEADTLAVIPDSSYAGIYQVVIDDCRAHGAFDPTTMGTVPNVGLMAQAAEEYGSHDKTFEIPADGKVQVVDADGAVLIEHDVAPGDIWRACQTKDAAILDWVKLAMTRARASATPAVFWLDEGRAHDANLIAKVKEYLAEHGTDDVTLEIMTPVDATAFTLKRIREGADTISVTGNVLRDYLTDLFPILELGTSAKMLSVVPLMNGGGLFETGAGGSAPKHVQQLVRENHLRWDSLGEFLALAVSFEHLATTTGNKRAQILADTLDRATGTFLLENKSPSRRVGEIDNRGSHYFLARYWAEELARQEQDAELAESFARVAEALTGNEDAIVSELLAVQGQPADIGGYFHPDVIKVSRVMRPSATLSEVLDILAV >LR131272.1|VDR31427.1|741411_742539_-|1,5-anhydro-D-fructose-reductase MLVWSVDAAPGSGKDGWMNPQTRIAVPPCAEPGAPSPIAATGRALKWGVIATGGIASKVTADIALLEDAVLHAVSSRSTESAAAFAERFGFATSYGNDGGVDGYQRLVDDPEVDVVYITTPHGQHYDVAKAALIGGKHVLCEKPFTINAAEAEELAALAADRGLFLMEAVWTRFLPSVNRAWEIIHSGDLGDVRWIQADLGFSAPDDPTSRLWDPAAGGGALLDLTVYPLTWALGSLGFPDSVSAVGTLNDDGVDLQTAVTLTYEHGAYAQLSSSFIASCPGQATVSGSKGWLKTGGGPLHNPKELTVSTGQGDPRVEHFEQVGAGYTYELREVTRCIQAGLTESPTMPVADTVRTMRLLDGVRAQIGLRYANDA >LR131272.1|VDR31428.1|742685_743798_-|Multiple-sugar-binding-periplasmic-protein-sbpA-precursor MRNFAKSLAALTAISALALTSCGREDAGTDTGTDTGTAAASTCEGFEDGAAIGVALPQKTSENWVLAEQLFNEGLSDAGYEPSVQFANGGVSEQQAQINAMITNGVEVLVVGAIDGAQLGNQLQQAKDAGITVLAYDRLLTNTENVDYYVAYDNFKVGELQGQALLDGLAERKGEAPYNIELFAGSPDDANAQVFFDGAMSVLQPKIDDGTLNVVSGQTAFDQAVTQGWKAENAQKRADTLLSGNYASEDLDGVLSPNDTLARAVLTSVKAAGKDIPVVTGQDSEVESVKSIVAGEQYSTINKDTRDLVEHTITMVEGLGACEEIEVNDTDSYDNGVKTVPAYLLEPQIVTEENAADAYADDPTLSEITK >LR131272.1|VDR31429.1|743842_745135_-|Xylose-transport-system-permease-protein-xylH MNSLKQMFGGNTRQFGMIFALIALILFFQWQTGGNTLTPTNVINLFNGNAYILILAIGMVLVIIAGHIDLSVGSVAAFSGVVVAIVIRDWGIPWYLGIVVGLLLGALIGAWQGFWVAYVGIPAFIVTLAGMLLFRGANQYVGESNTIPVPQAFQYIGAGYLPEVGPVTGYNNLTVLLGLLAVAFFVYQEFRSRRKAAQLGSQLPPLWVSVAKLVLLSAAILYVTSLFATGRPGTSFPIPGLILAVLVIIFAFVSNKTVLGRHVYAVGGNRHAAELSGVQSKRVNFMVMMNMSILASLAGMIFVGRSTASGPFDGVGWELDAIAAVFIGGAAVTGGVGTVIGSVIGGLVMAVLNNGLQLMGIGADMTQIIKGLVLLIAVAFDVYNKSQGKPSITGLLTRNFGSRSKPDPTPYGSSQTEQPVGTKEKISHDV >LR131272.1|VDR31430.1|745157_746696_-|Xylose-import-ATP-binding-protein-XylG MNNHTILEMRSITKEFPGVKALADVSIEVQAGEIHAICGENGAGKSTLMKVLSGLYPYGDYSGQIFFQGKEVQFKDIRSSEAAGIVIIHQELALIPELSIAENIFLGNEPTRFGVIDWDYVNRTTLELMARVGLSEDPVTKVKDIGVGKQQLVEIAKALNKSVKLLILDEPTAALNESDSQHLLDLMAGLRSKGISCIMISHKLNEIEQIADSITIIRDGRSIETLHVANDGVDEDRIIRGMVGRSLESRFPDHTPTIGETFFEVRNWTVGHPNIPDRLVCKNSNFHVRRGEIVGFAGLMGAGRTELARSVFGRSYGTFKSGQILIDGREVTLRTVPQAINAGLAYVTEDRKSLGLNLLDDIKSTTVSANLRKITHGLVVDDDEEYRVAEEYRTSLRTKTPSVNEGVSKLSGGNQQKVVLAKWMFTDPELLILDEPTRGIDVGAKYEIYGIIQKLADQGKGVIVISSELPELLGLSDRIYTIFEGSITGEVAREDASQEALMKLMTASRKSA >LR131272.1|VDR31431.1|746843_746939_-|Uncharacterised-protein MSSLGDRAEALGAAAVVLAQPGLSPALAVTA >LR131272.1|VDR31432.1|747028_748075_-|Making-large-colonies-protein MPAESPSMARPNPSASLPKPGSQSALRERNQQRVIAALMSGGPQTQAELSRQTGLSTATVSNIVKVMASTGIVSTAPTTSSGRRALSVILNETGQVAAGIDIGRRHLRVVLATPTYRVLQEAAVALPLGHSAMDGLTAASELLDTLLESGGIPRSALLGAGIGIPGPIDRRTGTVVQGAILPEWVGINIHETFSEQLQVPVLIDNDANLGALAQVTWGPHGAVDNLMFMKVASGIGSGLVLNGTLYYGNVGVTGELGHTTINEQGAICRCGNRGCLETVASTSTMIDLLGHRGRDAGEPVDTRQIIDWALTGDTATLRVIDDAGTAIGRALAHMANLINPETIVIGGR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
524283 : 533394
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >LR131272|524283:533394|DBSCAN-SWA ACTAGGGGGTGCGGACGACGGCGGACGTCCTCGCCCACTCGATCGGGTCAGCTCCCTGCTCGGCGAGCAGGCTGTTGACGCGGGTGAACGGCCGTGAGCCGAAGAACCCCCTCGACGCCGACAACGGGCTGGGATGCGCGGATTCGATGGTCGGTGTGGCGCCCAGGAGCGGCTTGAGCCGCTGGGCGTCCTTGCCCCACAGCACGGCGACGAGCGGGCCGCCCCTGGCGACGAGTGCGCGGACGGCGGCGTCGGTGACCTGTTCCCAGCCGAGTCCGCGATGCGATGCGGCCTCGCCGGCGCGCACCGTCAGGACCCGGTTCAGCAGGAGTACCCCCTGCGCGGCCCAGCCGGAAAGATCTCCGGATGCGGGGACGGCGGCGCCAGTGTCGGAGGAGAGTTCCCGGTAGATGTTGGCAAGGCTCCGGGGGACGGGACGCACGGCCGGATGCACGGAAAAGCTCAACCCCACGGCATGCCCCACGGTGGGGTAGGGGTCCTGCCCCACGATGACCACCCGCACCTCTGCGAGGCGAGTGGTGAAGGCGCGGAGGATGTCGTGGGAGTCGGGCAGGACACGCGCTCCGCGAGCGAGATCTGCGGCCAGGGCGGTGGCCAGTGCGTGCAGCACGGGTTGCAGCGGCTCGAGTGCCGGCACCCAGTCCGGCGCGACGAGATCAGCGACGCCGGCAGGGGCCGCGAAGGGGAAGACGGATTCGTCGGCGGAGGACGTCGGTGCCCCGGCGGCGGGGGTGACGGGAAGGTCGAAGAGTGCGCCGTCGTCGTGGGTCATGCCTCCATTCTGACCTGCCCGGCGGTGGCGTCCTGCGCGAATGACAGGCATGTAATTCTTCTCTACCATGGGGTGACACCCGAGCTGGAGATGGAAGGCGGTAGCAGGTATGGCAGAGGCACGACGACAGGCGGACAGCCCCCACCCCGCACCGGATGCCCGCCTGACCCTGACCGACCGCGAGCAGCGCATGCTCGCCCTCGAACGCGAGTGGTGGAAGTACTCGGGAGCCAAGGAACAGGCCATCCGCGATCTCTTCACGCTGTCCGCGACGCACTATTACCAGCTGCTGAACGCGCTGATCGACACGGAAGCGGCGCTCGCGCATGATCCCATGCTCGTGAAGCGGCTGCGTCGCCTCCGGTCCACACGCCAGAAGGTCCGGTCGGCACGGCGCCTCGGCGCGGACATCTAGCAGTACCGCATGCAGGACCCGCATCCGGACGAGCCCACGTCATGACTGACCACCCGAGGGACGAGTTCGACGCGGTCCCCGAGACATCGGACCGCCAGGGCGTGCATCGGGAGCACCTCGATCCTGCTCGCTCCAGCGGTCTGGGGCTGAAGATCACGGTCGGCGTCCTCGCGCTCCTGATCGTCCTCGGTGTCTTCTTCCTGCTCCCTCGGCTCGGCCTCTTCGGAGGGAACGACACGGGTCCGGGTGCCGCCCCCGCCTCGAGCACGACGCCGGGTCCGTCCGCGGATGCCCTGGCTCCGGACCCGACGGGCGGTGCGTCAGGGGGCCCCGCGGCGTCGGCACCCGACAGTGATCCCGGAGTGGCATCCGCCACGGCGACCGATGCCGGGAACGACGCGGGGGACGATCCGGACCGCGACCAGACCGTCAACGTGCTCAACGGCACCGGCGCTCCCGGCCTCGCGACCGTCGCGGCGGAGCGTCTGGCCACCGCCGGATGGACGGCTGCGCTGCCCGGGAACTGGGCCGGAGCGCCGCTCGACTCCTCCGTGGTCTTCTACAACGGGGAGGCGCAACGCGAGGCCGCCGAGTCCGTCGCGGCGGACCTCGACATCGCGACCGTGGTCGAGAGCCCGGACGTCTCTCCGGACATCTCGGTGGTGCTGGGCCCCGACTTCGAGTAGCCGCAGGTGCGACGGCGGGCCTTCAGATCGAACGGGAACCGGATCGGCCTCGGGACCCGGCATCGGTGCACGCACGTGCTCAGTCCTTGAGGGGAATGCCCTTCGCGTCCCGTTGGAACTGCTCCGATCCCACCAGGATCATGATGCCCTCGATCAGTCCCCAGATTCCGGCGGCGATCGCGCTCAGACCGAAGGTGAGGAACGCGCCCACGGTGGACATGAGGATCAGGATGACGCCGATCGTGGTGAAACCGAGGTAGAAGCGGTGGATGCCGAGCGATCCCAGCAGGATCCCGAGGACACCGGCCACCACCCGGGACTTCTGGGGAGCGTAGGGCCGGCCGGAGGGGCCCTGAGCGAACCCACCCTGCCCGTAGGGGGCCGGCTGGTACGGGCCTGGCTGGTACGAGCCCGGCTGGTACGGGCCCTGCTGGTACGAGCCCGGCTGGTACGGGCCTTGCTGGTACGAGCCCGGCTGGTAGGGGCCCGGCTGGGACGGGCCCGGACCGTGCCCGTTCTGCGGGTACCCGTACGGCGGGTGGTCAGCCGGGCCGATACCCTCGCCCGAGGGCAGAGGCCCCGACGGCCGGTCGTGCCGGGTGCCGGCATCCGCCCGTGGTGGGGCAGCGGGGGAGGATGCGGGTTCCTCACCAGGGGCGCTGCCCGCCGCTCCCTGCACGTCGTCTTTCCCGTTGTCCGGACCGGATGCTGACGTCGACACGATGCCTCCTGTTGCTCGATGTCCGCCCAACCTATGCCCTGCGAGAAATCGGCGCCATGGTTGGGGCTCCCGTCCCCTCACCACGGCCTGCCGAGCCCGTCTTCGTGGTCCCCCCGACTCCCCTCAGCAGGTTGCACTCGGGGGGTGGGAGTGCTAATTATGGAGTTAGCACTCGCGCGCGCAGACTGCTAAAGCTGCGCGTGGCGGGTGAAGCAGGCAAACCTCTGTGGTGGGGGTGTCGCCCTGGTCGTGAAAGAGATCGTCCAGGTGATACCCGGCCGTCGCGGGCACCACACGCAGGCAAGCATTCAAGTGCTGAAATGCAACCGTCCCGAAAGGACCTACGCCGTTATGGCCAAGATCATTTCTTTTGATGAAGAGGCCCGCCGCGGCCTCGAGCGAGGCCTCAATATCCTCGCTGATGCCGTCAAGGTAACGCTCGGCCCGCGTGGCCGCAATGTCGTCCTCGAGAAGAAGTGGGGCGCCCCCACCATCACGAACGACGGCGTCTCAATCGCCAAGGAGATCGAGCTCGACGACCCGTTCGAGAAGATCGGCGCGGAGCTCGTCAAGGAAGTTGCCAAGAAGACGGACGACGTCGCAGGCGACGGAACCACCACCGCCACGGTGCTCGCCCAGGCGCTCGTCAAGGAAGGCCTGCGCAACGTCGCGGCCGGAGCGGACCCCCTGAGCCTCAAGCGCGGCATCGAGAAGGCCGTCAAGGCCGTCACCGATGAGCTGCTCGCCTCCGCCAAGGAGATCGAGACCAAGGAGCAGATCGCTGCCACCGCCTCCATCTCGGCCGGTGACAAGCAGATCGGTGACCTGATCGCCGAGGCCCTCGACAAGGTGGGCAAGGAAGGCGTCATCACCGTCGAGGAGTCGAACACCTTCGGCCTCGAGCTCGAACTCACCGAGGGCATGCGCTTCGACAAGGGTTATATCTCGGGCTACTTCGTCACGGACGCCGAGCGCCAGGAGACGGTCCTCGAGGACCCGTACATCCTGATCGTCAACTCGAAGATCAGCAACGTCAAGGACCTCGTCGCGGTCCTCGAGAAGGTCATGCAGTCCGGCAAGCCGCTGCTGATCATCGCCGAGGACATCGAAGGCGAAGCCCTCGCGACCCTCGTGGTCAACAAGATCCGCGGCACCTTCAAGTCCGTCGCCGTCAAGGCCCCGGGCTTCGGTGACCGTCGCAAGGCGCAGCTCGCCGACATCGCCATCCTCACGGGCGGCCAGGTCATCGCCGAGGAGGTCGGCCTCAAGCTCGAGACCGCGACCCTCGACCTGCTGGGTACCGCCCGCAAGGTCGTCGTGACCAAGGACGAGACCACGATCGTCGAGGGAGCCGGCGACGCCGACGCCATCGCCGGCCGCGTGGCCCAGATCCGCGCGGAGATCGACAACTCCGATTCCGACTACGACCGTGAGAAGCTGCAGGAGCGCCTGGCCAAGCTGGCCGGTGGCGTTGCAGTCATCAAGGCCGGAGCGGCCACGGAGGTCGAGCTCAAGGAGCGCAAGCACCGCATCGAGGACGCCGTCCGCAACGCGAAGGCTGCCGTCGAAGAGGGCATCGTCGCCGGTGGCGGCGTGGCCCTCATCCAGGCCGGCGCCAAGGCGTTCGCCAACCTCGAGCTCACGGGTGACGAGGCGACCGGCGCGAACATCGTGCGCGTCGCCATCGACGCCCCGCTCAAGCAGATCGCCTTCAACGCCGGCCTCGAGCCCGGCGTCGTGGTCGACAAGGTCCGCGGCCTGCCCGCCGGTCACGGCCTCAACGCCGCAACCGGCGAGTACGAGGACCTGCTGGCCGCCGGCGTCAATGACCCCGTGAAGGTCACCCGCTCGGCCCTGCAGAACGCGGCGTCGATCGCCGGCCTGTTCCTCACCACCGAGGCCGTCGTGGCCGACAAGCCGGAGAAGGCAGCGCCTGCCGGTGGCGGCGGGGACGACATGGGCGGCATGGGCGGGATGGGCGGCTTCTAGCCTCCCCCCGCATCGATCGCACCACCTGCACAGCAGAAGAGGGCGGCACCCCTGACCGGGGGTGCCGCCCTCTTCTTCGTCCTGACTGTCACACGGGAACGGGGCAGGACACGGGCTAGGCTCGGCGCCCGGCAGGGCGTTCGCCGTCCTGCGGCGATCGGCAGGTGTCAGCTCATGAACATACGCGCGTTGCTCTGCTCCGCTTCCGCGTACTGCTGCGTCGCGACGGTGAGCGCCTGGTTGATGCCGGTGAGCGACTCCTCGACCTGCTGCTGCGTGGCCCTCCATTGCACGATGAGCTGCTGGAAGTTCGACGCGGCCTGGCCCGTCCACAGACCCTCCAGTTCACGGAGCCCCGCCTGCATCGAATCGACCTCCGCCTGAAGCCGCGAGATGGTGCCCTGGACCTGGCTGCTCTTGGCCGCGAGGCTGTCGGTGTCTACGTTGAAGAATGCCATTGCTGTCTCCCCTGGGATCGATCGCACGTGCTGTGACCCCTCGACGGTAGGTGAGGATGCGGGGCGGTCAGCCGTCGTCGCGGCCTTTGTGGACAACCGGCTGTGCGGCTGGACGCTGTGGAGGCACGATGCCGTTCACCGGCGCGGATGCTCCGCGGGACGCACCTGCGTCGTCGCCGTCCTGCTGGTCGGCCGGGATGTGGGGCAGCCGTATCGACAGCGTCGCACCGCCGCCCTGGGTCTCCTCGAGCCGTACGCTGCCGTCGTGCTGGGCCACCAGGGCAGCGACGATCGCGAGGCCGAGGCCCGTGCCGCCGGTCTCGCGGTACCGTGACGAGTCCGCGCGGTAGAACCGCTCGAAGACCTTGGCCGCATCGGCATCGGAGATGCCCGGCCCGTGGTCGCGCACCTCCAGCACGGAATCGCTGTGGCCGTGCAGCACGGGTGCCACTCCCACGGCGATCTCGATCGGCGTGCCCTCGGGCGTGTACCGCAGCGCATTCGTCATCAGGTTGGCGATCACCTGGCGGAGCCTGGCTTCGTCACCCCGTGTCGGGGCAGGGCGCGGGGTTCCGCCGTCGAGCCCGATCACGGTGATCGAGCGGTCCGGCGCGGTCGCGCGGGCATCGAACGCCGCGTCATGGCCGAGAACCTTGAGATCCACCGGCTTGACCTCGAGCGGGCGCTGTTCGTCGATGCGCGCCAGCGTCAGGAGGTCCTCCACCAGTTGGCCCATGCGGATCGCCTCGCTCTCGATCCGGCCCATGGCCGCCGCGACGTCCTCGTCCTTCTGCAGCGCACCGTGCCGATAGAGCTCGGAATAGCCGCGGATGGTGACGAGTGGAGTGCGCAGCTCATGCGACGCGTCCGCGACGAACCTCCGCATCTTGGTCTCCGAGGCGGTGCGCGCGGCGAAGGCACGCTCGATGTGGGCGAGCATCGCGTTGAGCGATCGGGAGAGCCGCCCGATCTCCGTGGCCGGCCTTTCCACGGCGACGCGGCGCGTGAGGTCCCCGGCGGCGATGCCTGCCGCCGTGCGCTCGACGCGTGCGAGCGGGGCGAACTGACGCGTCACGGTGATGAAGGCGATGAGCGATGCAAGGGTCGTGGTCACGAGACCCGACGTGAAGATGATCTCCGTGGCCTCCTGCAGGGAATCGTTCACGGTCTTCAGCGGGGACGCTATGGCGAAGAATCCGTCATTGGCGCGGAACGAGTACACCTGGACCCGCCAACCCGGGCCCGTCGGGTCGGTGCTCGGTACGTTCAGGCCAGAGCTCTGGAGCTCGAGGATCCGTTCCCTCGTCAGGCCGCTGAGATCGGGCAGGTCGGGAGCGCCGTCGAGCTGGTTGGTGCCGGTCACCTGGCCGCTGCCGTCCAGGACGGCCGCGTAGTACCTGAGGACCGAGCTGTCGCCCTGGGGATTCGTGTAGAAGTTCCTCGACACGTTCTGCACATTGTTGCTGAGGTCCTGGTCCACGCGATCGATCAGACCGTCGCGGAGCAGCGAGACAGTGGCGAGCCCCGTGATGCCCACCGTGACGATCATGAGCAGCATGATGATCGCCACGAGCTGGGACCGGAGGGACGCCGCGTTCCACCGCTTGATCACGAGGGGCCCGCGCTACCGCTTGTCGGCGGTACGGAGCAGGTAGCCCACGCCGCGCTTGGTCTGGATGAGTGCGGCGGCGTCGGGGTTCCTGTCGATCTTGCGCCGCAGGTAGGAGATGTAGGACTCGACGATCGAGGCGTCGCCGTTGAAGTCGTACTCCCAGACATGATCGAGGATCTGTGCCTTCGACAGGACGCGGTTCGGGTTCATCATCAGGTAGCGCAGCAGCTTGAACTCGGTGGGGGAGAGGTCGATGACCTCACCTCCGCGGCGGACCTCGTGGGCGTCGTCGTCGAGTTCGAGGTCGTCGACGCGGATCACGGCGTCGTCGTCCTCGAGCGGATGGGTGCGGCGCAGGACCGCACGGATCCGGGCCACGACCTCGTCGAGGCTGAAGGGCTTGGTGACGTAGTCGTCGCCGCCGACCGTGAGCCCGGTGACCTTGTCGTCCGTGTCATCGCGAGCGGTCAGGAAGACGACCGGGAAGTGCCGGCCCGCGGCGCGCAGGCGCCGCGTGACCGTGAACCCGTCCATGTCGGGGAGCATGACGTCGAGGACCGCGAGGTCCGGGTTGTGCAGCTCCGCGGCGGCGAGGGCGTCACGGCCGTTGCCGGCCGCGACCACTTCGAAACCTGCGAAACGCAGCGAGGTGGAAAGGAGCTCGCGGATATTGGGTTCGTCATCCACGACGAGCAGTTTTGCTTCGGGGCCTGTTTTCTTCACCCCACCAGTATCCGCACGTTCTCTGTGAGTTCGCTGGGTCGGCCCTGAGAGTGGCCTGGTCAGTCGCTGGGCCCGGTCTCGCCGATCGACGCCGCATCGAGGATCCGGTAGGCATAGCCCTGCTCTGCCAGGAATCGTTGGCGTTTCGCAGCGAAGTCCTGGTCGAGGGTGTCGCGGGCGACGACGGTGTAGAAGCGTGCGGCACGACCATCCGCCTTCGGCCGCAGCAGGCGGCCGAGCCGCTGGGCCTCCTCCTGACGGGAACCGAACGAGCCGGAGACCTGGACGGCCACCGAGGCCTCCGGCAGGTCGATGGAGAAGTTCGCGACCTTGGAGACCACGAGGACGTGGATCTCGCCGGCCCGGAACGCATCGAACAGGCGCTGGCGCTCCTTCACGGTCGTCTCACCCTTGATGACGGGTGCGTCGAGCCGCGCGGCGAGGTCGTCGAGCTGGTCGATGTACTGCCCGATCACGAGCAGTTGCTCGCCCCTGTGCGACGCGACCAGTTTCTCCACCACGTCGGACTTCGTGTCCGACGTCGCACACAGACGGTACTTGTCGCCGTCCTCGGCCATCGCATAGGCGACGCGCTCGTCACGTGGCAGGTCCACCCGCACCTCGACGCAGTCCGCGGGCGCGATGTAGCCCTGCGCCTCGATGTCCTTCCAGGGGGCGTCGTAGCGCTTGGGCCCGATGAGGGAGAACACCTCGCCCTCGCGGCCGTCCTCGCGCACGAGCGTGGCGGTGAGGCCTAGCCGGCGGCGCGCCTGTAGATCGGCGGTCATCTTGAAGATCGGCGCGGGGAGCAGATGCACCTCGTCGTAGACGATCAGGCCCCAGTCGTTGGCGTCGAGGAGTTCGAGGTGCGGGTACAGCCCGCCCCGCTTCGTCGTGAGCACCTGGTACGTCGCGATGGTGACGGGACGGACCTCCTTGACGGCGCCCGAGTACTCCCCGATCTCCTCCTCCGTGAGCGAGGTCCGCTTGAGCAGCTCATCCTTCCACTGCCGGGCGGAGACGGTGTTGGTCACGAGGATCAGGGTGGTGGTCTGGCTCGTGGCCATCGCGGCAGCGCCGACCAGTGTCTTGCCGGCACCGCAGGGCAGGACGACGACGCCCGAGCCGCCGGACCAGAAGTTCTCGACGGCGAGCTGCTGGTAGGGGCGCAGCGTCCACCCGTCCTCGTCCAGCGCGATGGGGTGCGGCGTTCCGTCCACGTAGCCGGCGAAGTCCTCCGCGGGCCAGCCGAGCTTCAGGAGCAGCTGCTTGAGCTGCCCGCGCTGCGAGGAGTGGACGACGACGGTCTCGCCGTCGATCCTCGGGCCGAGGAGGGGCTGGATCTTCTTCGCGTGCAGCACCTCCTCGAGGACGGGGTAGTCGGTGGTGCGCAGGACCAGCCCGTGCTGCGGGTCCTTCTCGAGGCGCAGCCGCCCGTACCGCGACATGGTGTCCTCGATGTCGATGAGGAGCGCGTGCGGCACGGGGAAGCGCGAATACTTCAGCAGGGTGTCGAGGACCCGCTCCGCGTCCAGCCCGGCCGCCCGCGCGTTCCACAGGCCCAGCGGCGTGAGGCGGTAACTGTGCACGTGCTCGGGGGCGCGTTCGAGTTCCGCGAACGCCGCGATGGCGTGCCGCGCCTCCGTGGCCTGCTCGTGGTCCACCTCGAGCAGGATGGTCTTGTCGCTCTGGACGATCAGCGGTCCGTCAACCAT
Protein sequences of DBSCAN-SWA_1 >LR131272|524283:533394|525184_525490_+|VDR31229.1|DBSCAN-SWA MAEARRQADSPHPAPDARLTLTDREQRMLALEREWWKYSGAKEQAIRDLFTLSATHYYQLLNALIDTEAALAHDPMLVKRLRRLRSTRQKVRSARRLGADI >LR131272|524283:533394|526255_526897_-|VDR31231.1|DBSCAN-SWA MSTSASGPDNGKDDVQGAAGSAPGEEPASSPAAPPRADAGTRHDRPSGPLPSGEGIGPADHPPYGYPQNGHGPGPSQPGPYQPGSYQQGPYQPGSYQQGPYQPGSYQPGPYQPAPYGQGGFAQGPSGRPYAPQKSRVVAGVLGILLGSLGIHRFYLGFTTIGVILILMSTVGAFLTFGLSAIAAGIWGLIEGIMILVGSEQFQRDAKGIPLKD >LR131272|524283:533394|531738_533394_-|VDR31236.1|DBSCAN-SWA MVDGPLIVQSDKTILLEVDHEQATEARHAIAAFAELERAPEHVHSYRLTPLGLWNARAAGLDAERVLDTLLKYSRFPVPHALLIDIEDTMSRYGRLRLEKDPQHGLVLRTTDYPVLEEVLHAKKIQPLLGPRIDGETVVVHSSQRGQLKQLLLKLGWPAEDFAGYVDGTPHPIALDEDGWTLRPYQQLAVENFWSGGSGVVVLPCGAGKTLVGAAAMATSQTTTLILVTNTVSARQWKDELLKRTSLTEEEIGEYSGAVKEVRPVTIATYQVLTTKRGGLYPHLELLDANDWGLIVYDEVHLLPAPIFKMTADLQARRRLGLTATLVREDGREGEVFSLIGPKRYDAPWKDIEAQGYIAPADCVEVRVDLPRDERVAYAMAEDGDKYRLCATSDTKSDVVEKLVASHRGEQLLVIGQYIDQLDDLAARLDAPVIKGETTVKERQRLFDAFRAGEIHVLVVSKVANFSIDLPEASVAVQVSGSFGSRQEEAQRLGRLLRPKADGRAARFYTVVARDTLDQDFAAKRQRFLAEQGYAYRILDAASIGETGPSD >LR131272|524283:533394|529053_529344_-|VDR31233.1|DBSCAN-SWA MAFFNVDTDSLAAKSSQVQGTISRLQAEVDSMQAGLRELEGLWTGQAASNFQQLIVQWRATQQQVEESLTGINQALTVATQQYAEAEQSNARMFMS >LR131272|524283:533394|530968_531679_-|VDR31235.1|DBSCAN-SWA MKKTGPEAKLLVVDDEPNIRELLSTSLRFAGFEVVAAGNGRDALAAAELHNPDLAVLDVMLPDMDGFTVTRRLRAAGRHFPVVFLTARDDTDDKVTGLTVGGDDYVTKPFSLDEVVARIRAVLRRTHPLEDDDAVIRVDDLELDDDAHEVRRGGEVIDLSPTEFKLLRYLMMNPNRVLSKAQILDHVWEYDFNGDASIVESYISYLRRKIDRNPDAAALIQTKRGVGYLLRTADKR >LR131272|524283:533394|529411_530956_-|VDR31234.1|DBSCAN-SWA MIKRWNAASLRSQLVAIIMLLMIVTVGITGLATVSLLRDGLIDRVDQDLSNNVQNVSRNFYTNPQGDSSVLRYYAAVLDGSGQVTGTNQLDGAPDLPDLSGLTRERILELQSSGLNVPSTDPTGPGWRVQVYSFRANDGFFAIASPLKTVNDSLQEATEIIFTSGLVTTTLASLIAFITVTRQFAPLARVERTAAGIAAGDLTRRVAVERPATEIGRLSRSLNAMLAHIERAFAARTASETKMRRFVADASHELRTPLVTIRGYSELYRHGALQKDEDVAAAMGRIESEAIRMGQLVEDLLTLARIDEQRPLEVKPVDLKVLGHDAAFDARATAPDRSITVIGLDGGTPRPAPTRGDEARLRQVIANLMTNALRYTPEGTPIEIAVGVAPVLHGHSDSVLEVRDHGPGISDADAAKVFERFYRADSSRYRETGGTGLGLAIVAALVAQHDGSVRLEETQGGGATLSIRLPHIPADQQDGDDAGASRGASAPVNGIVPPQRPAAQPVVHKGRDDG >LR131272|524283:533394|525531_526176_+|VDR31230.1|DBSCAN-SWA MTDHPRDEFDAVPETSDRQGVHREHLDPARSSGLGLKITVGVLALLIVLGVFFLLPRLGLFGGNDTGPGAAPASSTTPGPSADALAPDPTGGASGGPAASAPDSDPGVASATATDAGNDAGDDPDRDQTVNVLNGTGAPGLATVAAERLATAGWTAALPGNWAGAPLDSSVVFYNGEAQREAAESVAADLDIATVVESPDVSPDISVVLGPDFE >LR131272|524283:533394|524283_525075_-|VDR31228.1|DBSCAN-SWA MTHDDGALFDLPVTPAAGAPTSSADESVFPFAAPAGVADLVAPDWVPALEPLQPVLHALATALAADLARGARVLPDSHDILRAFTTRLAEVRVVIVGQDPYPTVGHAVGLSFSVHPAVRPVPRSLANIYRELSSDTGAAVPASGDLSGWAAQGVLLLNRVLTVRAGEAASHRGLGWEQVTDAAVRALVARGGPLVAVLWGKDAQRLKPLLGATPTIESAHPSPLSASRGFFGSRPFTRVNSLLAEQGADPIEWARTSAVVRTP >LR131272|524283:533394|527104_528886_+|VDR31232.1|DBSCAN-SWA MKQANLCGGGVALVVKEIVQVIPGRRGHHTQASIQVLKCNRPERTYAVMAKIISFDEEARRGLERGLNILADAVKVTLGPRGRNVVLEKKWGAPTITNDGVSIAKEIELDDPFEKIGAELVKEVAKKTDDVAGDGTTTATVLAQALVKEGLRNVAAGADPLSLKRGIEKAVKAVTDELLASAKEIETKEQIAATASISAGDKQIGDLIAEALDKVGKEGVITVEESNTFGLELELTEGMRFDKGYISGYFVTDAERQETVLEDPYILIVNSKISNVKDLVAVLEKVMQSGKPLLIIAEDIEGEALATLVVNKIRGTFKSVAVKAPGFGDRRKAQLADIAILTGGQVIAEEVGLKLETATLDLLGTARKVVVTKDETTIVEGAGDADAIAGRVAQIRAEIDNSDSDYDREKLQERLAKLAGGVAVIKAGAATEVELKERKHRIEDAVRNAKAAVEEGIVAGGGVALIQAGAKAFANLELTGDEATGANIVRVAIDAPLKQIAFNAGLEPGVVVDKVRGLPAGHGLNAATGEYEDLLAAGVNDPVKVTRSALQNAASIAGLFLTTEAVVADKPEKAAPAGGGGDDMGGMGGMGGF |
9 | Bacillus_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1293503 : 1303510
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >LR131272|1293503:1303510|DBSCAN-SWA GATGAGCACCACGGTTGTTCCTTCGCAGGCGGAGGTGGATCCGGCTGCCTGGCCGGGCATCGCCGGTGTCCCCAGCGGCACGCGGGTCAGGGTCCGGGCTGCCATCGCCGAGAAGATCTTCTCGACCGCGGTCAACACCCTCGAGCTCCGGGTCATCTACGACGACGGCACCGTCCTGGGCGCGCCAGGTGGCGGCCCGCGGCCCGAGATGCGCATCCATCGACCGGCCGACTTCTACGCGAGGCTCGGTGACAGCGGCCTGATCGGTCTCGGCGAGTCCTACATGGCGAAGGACTGGACGGCGTCGGACCTCACCGACGTGATCGAGGTCTTCGCGGCGCGTGCCGCCCACCTCGTGCCCGTGCCGCTGCAGAAGCTCCGGGGTCTCTCGCTGCCCAAGCCGCCCCGCCGGGAGCGCAACAGCACGGCCAATACGCGCTCGAATATTTCGCGGCATTACGATCTCTCCAACGAACTCTTCGCCCTGTTCCTCGACGGGACGATGTCCTACTCGAGTGCCCTGTTCGACGCCACCGGGAAGGACCTCCTGCACGTCGCGTGGGACGAGCTCGCCACGGCGCAGCATCGGAAGATCGACCGCCTGCTCGACCAGGCGAATGTGGGGCAGGGCACGCGCGTGCTCGAGATCGGCACCGGCTGGGGGGAGCTCGCGCTGCGGGCCGCAGCGCGTGGAGCCACCGTGGTCTCCATGACCCTCTCCAGTGAGCAGAAGGACCTGGCGGAGCAGCGCATCCGCGACGCCGGCTTCGCCGACCGCGTCGAGGTCCAGCTGAAGGACTATCGCGGCGTCGAGGGTCAGTACGACGCCGTGGTGTCGGTCGAGATGATCGAGGCCGTGGGCTACGAGTTCTGGGCGGTGTACTTCCAGACCATCGACCGCGTCCTCGCGCCGGGAGGCCGCGTGGCCATCCAGGCGATCACCCTCCCGCACGAGCGCATGCTGGCGACGCGGCGGTCCTACACGTGGATCCACAAGTACATCTTCCCCGGCGGGTTCATCCCGTCGGTCCGAGCCATCGAGGACGTCACGGACCGCTACACGAGCCTCCGTGTGCGGGAGCGCCTGTCGATGGGGGACCACTATGCACAGACGCTGCGCCTGTGGGAGGAGCGCTTCCTCGCCCGGCACGCGGACGTCGAGGCCCTCGGCTTCGACGAGACGTTCGAGCGGATGTGGCAGTTCTACCTCTGCTACTCCCGGGCGGGCTTCCAGGCCGGCTACCTCGACGTGCAGCAGCTGATCTTCGACCGGCGGCCCTGACGCCCGTCGCGCACCACCCTGCCGTGCACCCGCTCGCGCCGGACCGTTCCACCAAGGAGTTTCGCGTGACTGATCCCGAGCACGGCACCACGGCGACCGCCGGCCGCCCGCACAGCGGCACGGAGGGCGTCGCGCAGAAGCTGGCGGCGATCCTCGGGCCGATCGTGGGTGGCGAGCTCCCCGTCCGGCTGGTCGCCTGGGACGGCAGCGCCGCCGGGGGCGAGGACCAGCCCGAGGTCCGCCTGAACAGCGTCGACGCGTTGCGCCGCATCCTCTGGAGTCCGGGCGAGCTCGGCGTGTCGCAGGCGTACGTCACCGGCGAACTGGATGTGCCCGGGTCCCTCGGGCAGGCGCTGACCCATCTGTGGACCGTGATCGGGCAGCGCGGCCTCTCGATGCCGCGGCTGACCCCCCAGCTCGTCCTGCAGCTGACGAAGGTGGCGCGCGAGGTCGGGGCCTTCGGAGCGCCGCTGCCGGCGCCGGCATCGCAGGCGAAGATCAAGGGCCGCCTGCACAGCCTGCTGCGGGATCGAGCGGCCATCAGCCACCACTACGATCTCTCCAACGACTTCTACGGCCTCATCCTCGACGACCACATGGCGTACTCCTGCGGGTACTGGCTGTCCGAGGACCCCGACTACTCCCTCGAGGACGCCCAGCGCGACAAGCTGGACCTCGTGTGCCGCAAGGTGGGCCTCGACACGGAGCCTGGCCTGCGGTTCCTCGACGTCGGCTGCGGCTGGGGGTCCCTGAGCCTGTACGCCGCCGAGCACTTCGGCGCGAAGGTCACCGGTGTGACGATCTCGCGTGAGCAGAAGGCGTTCATCGACCAGCGGATCGCGGAGCGGGGGCTCGGCGACCGCGTCGAGATCCGGCTCCAGGACTATCGCGACGTCCGCGATGGCCCGTACGACGCCGTCGCCTCGCTGGAGATGGGAGAGCACGTGGGCCAGGAGAACTACGGCACCTACACGTCCACGCTCTTCCGGAACGTCGCGCCGGGCGGCAAGGTGCTGATCCAGCAGATGTCGCGCCACGGCAGGCACCCGGGCGGAGGTCCGTTCATCGAGTCCTTCATCGCGCCGGACATGCACATGCGTCCCGTGGGGGACACCGTGAACCTGATCGAGGGTGCGGGCTTCGAGATCCGCGGTGTCCAGGCGCTCCGCGAGCACTACGTCCGGACGATCGACGCGTGGGAGGACCGCTTCGAGGCCAACTGGGACCGGGCCGTCGCGATGATGGGCGAGGAAGTCGCGCGCGTCTGGCGCCTGTACATGGTCGGCGCATCCATGACCTTCCGGGACGGCCGCATGGGCGTCGACCAGATCCTCGCGCAGCGCCCACTGAGGTAGGCCCGCGGACGGCGCGAGCCGTCACGCCGACATATGGCCCACGGCCGGTACCGCTCCTTGAGGATGCGGTGCCGGCCGTCGTCGTGGGCGGTGGGAACCGACCGCCATCGAGTCCGCCAGCCCGTCCGCGACACGCCGCGGAATCGCGACACGGAGTTTTTTCTGCCTGTGCGGAACTCCTCCACAGCATGTGGACCGGCATCCCGTGAGGCCCCCGGAATCGCTCGTTTCCGGGCTTTCCACGGGTGTCCGACTGTGGACATCGTGTGGACTACTCGCACCGGTAATGACATACTTGTAATACATCATCTAGGGGTCGACCCGAGCGTGCCGCACTAGATGTAGTATCGAACTACAGAAACAACCGCTGGACGGGCATGGGCCCGGAAGCGCGGAAATCCCCGGATTTCTTCCCCTGTCCGGGGCCGCGCACCGCCCACGCCACAGCACGACAGCAGTAGACGAAAAGGGGACTAGGACATGACCGTCACGGTTTACACGAAGCCGGCTTGCGTACAGTGCAATGCCACCTACCGCGCTCTGGACAAGAAGGGCATCACCTACCAGAGCGTCGACATCTCGCAGGACCCCGCAGCACTCGAGCGCGTCCGCTCCATGGGGTACATGCAGGCCCCCGTGGTCATCACCGACAACGACCACTGGTCCGGCTTCCGCCCCGACAAGATCAGCGCGATCGCCGACTCGGCCGTCAGCTCGGTGGCCTGACCCATCCCGTCCCGGGATCAGCAGGCAAGGCATCAGGCAGTGACCACCCTCGCCGTGGACGACCGACCCACCACCGACCCGGCAACGGCCGGGACGGCGGAGGGTGCGGTAACCGATGCCGCCCTGATCTATTTCTCGTCGGTCTCCGATAACACGCATCGCTTCGTCGAGAAACTCGGAGTGCGGGCCGCCCGCATGCCGGTGCTCACGAAGGAGCCGACGCTGAGGGCCACGCGGCCCTACGTCCTGGTCCTCCCGACCTACGGCGGGATCACCGGCAAGGGTGCAGTGCCCCGCCAGGTCGTGAAATTCCTGAACAACGAACAGAACCGCAGCCTCCTCCGGGGGGTCATCGGGGCGGGCAACACCAATTTCGGTGAGACCTACTGCCTCGCAGCCGACATCGTCGCCGCCAAGTGCCATGTGCCGGTTCTCTACAGATTTGAAGTCATGGGAACGTCCGAAGACGTCGCCCGCGTAACCACAGGATTGGAAGAGTTTTGGACATGAGTGTTGCAGAGACGGAGACGGGTGCGCCCGCAGCTTCAGTAGAGCGCTTCAAGGACATGGGCTACCACGAGCTCAATGCCATGCTCAACCTGTACGGGCCGAATGGGGAGATCCAGTTCGACGCCGACCGCGAGGCGGCGCACCAGTACTTCCTGCAGCACGTGAACAACAACACCGTGTTCTTCCATGATCTCGAGGAGAAGCTCGACTACCTCGTGAAGAACGAGTACTACGAGCGCGAGACGCTCGACCAGTACACGATGAACTTCGTCCGCGACCTCTACAAGCGCGCCTACTCCAAGAAGTTCCGTTTCGAGACGTTCCTGGGCGCCTTCAAGTTCTACACCTCGTACACGCTGAAGACGTTCGACGGCAAGCGTTACCTCGAGCGCTACGAGGACCGCGTCTGCATGGTGGCCCTGCACCTGGCCCGTGGCAACGAGCAGCTGGCGGAGCAGATGGTCGACGAGATCATCGACGGCCGCTTCCAGCCCGCCACCCCGACCTTCCTCAATGCCGGCAAGCGCCAGCGCGGCGAGCTGGTCTCCTGCTTCCTCCTGCGCATCGAGGACAACATGGAGTCGATCGGCCGCTCCATCAACTCGGCCCTGCAGCTCTCCAAGCGCGGCGGCGGCGTCGCCTTCGCCCTCACGAACATCCGCGAGGTCGGGGCGCCCATCAAGCAGATCGAGAACCAGTCCTCCGGCGTCATCCCCGTGATGAAGCTCCTCGAGGACAGCTTCTCCTATGCGAACCAGCTCGGTGCCCGTCAGGGTGCGGGAGCCGTGTACCTGCACGCCCACCACCCGGACATCAACCGTTTCCTCGACACCAAGCGCGAGAACGCGGACGAGAAGATCCGCATCAAGACGCTGTCCCTCGGCGTCGTGGTTCCCGACATCACGTTCGAGCTCGCCAAGCGCGACGAGGACATGTACCTCTTCTCGCCGTACGACGTCGAGCGCGTCTACGGCATGCCGTTCTCCGACATCTCGGTCACCGAGAAGTACTACGAGATGGTGGACGACGCCCGCATCAAGAAGACCAAGATCAAGGCGCGCGAGTTCTTCCAGACGCTCGCCGAGATCCAGTTCGAGTCCGGCTACCCGTACATCATGTTCGAGGACACCGTGAACCGGGCCAACCCGATCGACGGCAAGATCATCATGTCCAACCTGTGCTCCGAGATCCTCCAGGTATCGCAGCCCACCACGTACAACGAGGACCTGTCCTACGACACCGTGGGCAAGGACATCTCCTGCAACCTGGGCTCGCTGAACATCGCGAAGACCATGGACTCGCCCGATTTCGGCCGGACCATCGAAACCGCCATCCGTAGCCTCTCGGCTGTTTCGGACATGAGCCACATCAGCTCGGTCCCGTCGATCGCCGCCGGTAACGACGCCTCGCACGCCATCGGCCTCGGGCAGATGAACCTGCATGGCTACCTGGCCCGCGAGCGTGTCCACTACGGCTCCGAGGAGGGCCTGGACTTCACGAACATCTACTTCTACTCCGTGGTGTACCACTGCATCCGGGCGTCGAACCTGATGGCCATCGAGACCGGCCGTACCTTCGCAGGCTTCGAGAAGTCGAAGTACGCGAGCGGGGAGTTCTTCGACAAGTACACGGAGCAGGTCTGGGAGCCGACGACGGCGCGCGTCCGCGAGCTGTTCGCCGACATGCACATCCCCACGCAGGAGGACTGGCGTGCGCTCAAGGCCTCGGTCGTGGAGCACGGCATCTACAACCAGAACCTGCAGGCGGTCCCGCCCACCGGCTCGATCTCGTACATCAACAACTCGACGTCCTCCATCCACCCGGTCGCGTCGAAGATCGAGATCCGCAAGGAAGGCAAGCTGGGTCGCGTGTACTACCCGGCGCCGTACCTGACCAACGACAACCTGGAGTACTACCAGGACGCGTACGAGATCGGCTACGAGAAGATCATCGACACCTACGCCGCTGCCACGCAGCACGTGGACCAGGGCCTGTCCCTGACGCTGTTCTTCAAAGACACCGCCACCACCCGTGAGATCAACAAGTCGCAGATCTACGCCTGGCGCAAGGGCATCAAGACCGTCTACTACATCCGTCTCCGCCAGCTCGCGCTGGAAGGGACCGAGGTAGAGGGTTGCGTCAGTTGCATGCTCTAGGCAACTAGATTGTCAGGCCGGACGTCATGTCCGGCCTGGCACCGCAGGCTCCACCGGGATCACCGGCATTACTTTCACCCAGACAAGGAACACACCATGAGCGAGAAGCTGAAGCTCGTAGACCAGGTCCAGGCCATCAACTGGAACCGTATCCAGGACGACAAGGACGTGGATGTCTGGAACCGCCTGGTGAACAACTTCTGGCTGCCCGAGAAGGTGCCGCTGTCCAACGACGTGCAGTCGTGGGCCACGCTGACGCCGGAGGAGCAGCAGCTCACCATGCGGGTCTTCACGGGCCTGACCCTGCTGGACACCGTGCAGGCCACCGTCGGCGCGGTCAGCCTCATCCCGGATGCGCTGACCCCGCACGAGGAAGCCGTGCTCACGAACATCGCGTTCATGGAGTCGGTGCACGCCAAGAGCTACTCGTCGATCTTCTCCACGCTGGCCTCCACCAAGGAGATCGACGAGGCGTTCCGCTGGTCCACGGAGAACGTGAACCTGCAGAAGAAGGCGCACATCGTCACCGACTACTACCGTGGCGACGATCCCCTGAAGCGCAAGGTCGCCTCGACGCTGCTCGAGAGCTTCCTGTTCTACTCCGGCTTCTACCTGCCGATGTACTGGTCCTCACGCGCCAAGCTCACGAACACCGCTGACCTGGTGCGCCTGATCATTCGCGACGAGGCCGTGCATGGCTACTACATCGGGTACAAGTTCCAGCGTGGTCTGGAGAGCGCGACGCCGGAGCGCCGCCAGGAGCTGAAGGACTACACTTTCGAACTGCTCTTCGAGCTGTACGAGAACGAGGTCCAGTACACGCACGACCTCTACGACTCCGTGGGCCTGGCCGAGGACGTCAAGAAGTTCCTGCACTACAACGCCAACAAGGCGCTGATGAACCTGGGCTACGAGGCGATGTTCCCGTCCTCCGTCACCGATGTGAACCCGGCGATCCTCTCGGCGCTGTCACCGAACGCGGACGAGAACCACGACTTCTTCTCCGGCTCGGGCTCGTCCTACGTGATCGGCAAGGCCGTGAACACCGAGGACGAGGACTGGGACTTCTAGCTCTTCACAGGAAGTGCGGTAGACAGCCTCGCCCGAAGGCCTGGTAGCGACGGGACGGGATCGGGAGATCCCATCGACTGCTACCAGGCCTTCGTCGTGCCTACCCTGAGGGGGCATCCTTGCGGCATGTACGGCACGACCAGTACCCGCGCAACGGTGTTTTCTTGAGGTGCTCGTTCTGCTTTCGGTGGACAGTGCGGCACAGCCTGGCTCGATGTCACGCGATGAGCGGCGCCGACCGAGGTGCGCGCCGCAACGCCTTCACCCGTTGACGCATCATGCGCCGGTCGCGGCGCCGCCGCAGCAACTCTTCGGCGAGATCATCGGCGCTGCCAGGCGGTGAGGGAGCGATCGATCGGTACCTGTGACCCGTCGGGGTTTCGACGTCGATGCTATGCCTCAGACCGGAGCCGGTTCGGGCCCGCCATCCTCGGACTTCCTTCGAGTGGTTGCATGCCTCGCAGAGGCCGGCTCCGTTGGACAGACTCGTTCGTCCGCCGTCGTGCCACGGTACAACGTGATCATGGTGGCGGATCGGCGCGTCGCAGTAGGGGGTTCGGCAGGTATGGTCGCGGGCCTCGAGGAACCGCCGCTGGGACGCCGTGAACAGGCGGGCGCGGGAATCCATCGCGAGCAGCTCACCACTCCCAGGGGCGGTGTACAAGCGCCGAATCAGGACCCGAAGGTCCTGTCCGAGGCCGCCCTCTCGGGTGGAGGCGAGTGTCCTCGCCCAACCGGCAGGAACCACGCCGTATCCCGAGAGGCGCGCCGGTTCACTGTCCCCCTGGAAAAGGGCGCGGTCGGTCATGACCAGTTGGATCTCCACGCCGGACACACCTCCAGGTGTACCCGTGGTGCGTTCCACCAGAGTGTCGGCCATGAGTTGACCACGGCTTCGGATGTCGCCTTCTGATCGAAGCGAACCGGCGTGCCGCGCGAGCTCCGCGTAGACCGCCACGCCTTGAGCGACCGGAAGCAGGGCGGTGAGGTAAGCCATCGTGTCCGGTGCCGGCCGAAGGCTGACATACCGTTCCGCGGCGGCGTGCCGCGCCCGATCCGTCACAGAGCGTGGGTCTCTGCGGTATGCAGCGGCCCGGGCTGCAGCGACGATGGCCCGGTCTCCCTGTCCGAGGAAGGATCCGGTGTCGGCAGAGAGCTCCTCGTCCACGGCGCATCGATCCGAAGCAGCCAGACAGGCGGTTTCCCTCACCAGCAGGGTGGCGCGCCATTCGTTCAGCTGGCCACTCTCCAGAGCTGCGAGAGTGTGCGGCATCTCCGTGATGAGCGCACGAGCGAATCCGAGCAGGCGGCCACCGCGTGCGGGAGATTCCTTACGCGCCAGCGCCAGCTGAGCGCCCACGCCCTGCCCCTGCGCGGAAGCGGGAACGCCCCACAGGGCCTGGTTCTTCCGCTCCGCGAGATCGAAGGCGACGGCGGCGCGGGCCTGCAGACCCGCGACGGCCGAGGTGAGGTCTTCGAGGTCACGGATCTGATCGATCAGCTCTGCGCTCGTAGAACCGAGCCGGAAGGACATCAGGCCTTCGATGAGCTCCCGAACATCGGTCGGCAGAACAGCGTCGTCCATCGGCACGTCACAAGGAAGCAGGGCCTCTTCAGATGGGGCCGTCCTTGTGCTGTCCATGGTCCATTTTCGCGCGCACCACTGACATTCCAAGCGGGTGTCAGGCTTGGGAGTACCCCGCGTTCTTCGCCGGTGCGGCGTGGCAACCAGGCGGTCGGATTCTGTCGGGGACGCCCCTGAGGACAGGGCAGACTGCCATCACTCACCGCGGCTTCGATCCCGGCGCAGGTGGACAAGCGGCAGGTGAAGTGTGCACTGCGCGATCGCACCGATTTCGAGCCGTCGCTACGGCCAGGTCCGCCAGTTCCGCCAGCGCGACGGTTCGGGCTCGGGCTCCGGTTCCGGCTTCGGCTCTCCTCCCAGCGCCTGCGCGGCCTCGACGAGATCCGCGGCGGTGAGCGTCATGACGTCGTCGCGGTCGAGCGCATCGAGTTCGTGATCCGGGTCGAGCGACAGCCTCAGAGCCTGCCGGTTCAGCGCCTGCTCGAACAGGGTGCGGGCGAAGCGCGCGTTGCCCGAGTCCTCCCCGACGTGGAGGCCCAGGAAGATACGGCGCAGCATGTCGTCCGCGCCGTCGGCCAGGACGTACTCGTGCTGCCCGAGCATCTGGTGGAAGATCGTCTGCAGTGCGTCCACCGAGTAGTCCGGGAAGGTGATCTCCCGCGCGAAGCGGGACCGCAGGCCGGGGTTCGAGAGGAGGAACGCCTCCATGAGTTGCGGGTACCCGGCCACGATCACCACGAGCCGATGGCGGTGGTCCTCCATCCGCTTGAGCAGGACCTCGATGGCTTCGGGGCCGAAGTCCATCCGACCGTCCTCCGGCGTCAGGGCGTACGCCTCGTCGATGAACAGGACGCCGTCCAGCGCACGCCGGATCACGCGGTCGGTCTTGATGGCGGTGGCCCCCACATACTGCCCGACGAGCCCCGACCGGTCGACCTCCACGAGGTGGCCCTTCTGCAGCAGACCCACCGCGCGGTACATCTCGGCGAGGAGCCGCGCCACGGTGGTCTTCCCGGTGCCCGGGTTCCCCAGGAACACCAGGTGCTGGGAGGTCGCCACCTCCGGCAGCCCGTGAGCCTTCCGCCGGGCCTGCACCTGCAGCAGGGCGACGAGCGCGCGGACCTGCTCCTTGACGGTGTCCAGCCCCACCAGGGCGTCGAGTTCGGCCTGCACCTCGGCCAGCGGGCGAGCCGGACCGGGACGCGCTGCGAAGTATTCGCCGCGCAGGTCCTCGACCCGATCCGACCCCGACGATTTGAGCTGCTCGGTGAGATGGCCGAGCGTCTCGCGCAGGTCATCGAGCGGATTGCGGCTGGCAGCCAT
Protein sequences of DBSCAN-SWA_2 >LR131272|1293503:1303510|1297371_1299531_+|VDR31902.1|DBSCAN-SWA MSVAETETGAPAASVERFKDMGYHELNAMLNLYGPNGEIQFDADREAAHQYFLQHVNNNTVFFHDLEEKLDYLVKNEYYERETLDQYTMNFVRDLYKRAYSKKFRFETFLGAFKFYTSYTLKTFDGKRYLERYEDRVCMVALHLARGNEQLAEQMVDEIIDGRFQPATPTFLNAGKRQRGELVSCFLLRIEDNMESIGRSINSALQLSKRGGGVAFALTNIREVGAPIKQIENQSSGVIPVMKLLEDSFSYANQLGARQGAGAVYLHAHHPDINRFLDTKRENADEKIRIKTLSLGVVVPDITFELAKRDEDMYLFSPYDVERVYGMPFSDISVTEKYYEMVDDARIKKTKIKAREFFQTLAEIQFESGYPYIMFEDTVNRANPIDGKIIMSNLCSEILQVSQPTTYNEDLSYDTVGKDISCNLGSLNIAKTMDSPDFGRTIETAIRSLSAVSDMSHISSVPSIAAGNDASHAIGLGQMNLHGYLARERVHYGSEEGLDFTNIYFYSVVYHCIRASNLMAIETGRTFAGFEKSKYASGEFFDKYTEQVWEPTTARVRELFADMHIPTQEDWRALKASVVEHGIYNQNLQAVPPTGSISYINNSTSSIHPVASKIEIRKEGKLGRVYYPAPYLTNDNLEYYQDAYEIGYEKIIDTYAAATQHVDQGLSLTLFFKDTATTREINKSQIYAWRKGIKTVYYIRLRQLALEGTEVEGCVSCML >LR131272|1293503:1303510|1296904_1297375_+|VDR31901.1|DBSCAN-SWA MTTLAVDDRPTTDPATAGTAEGAVTDAALIYFSSVSDNTHRFVEKLGVRAARMPVLTKEPTLRATRPYVLVLPTYGGITGKGAVPRQVVKFLNNEQNRSLLRGVIGAGNTNFGETYCLAADIVAAKCHVPVLYRFEVMGTSEDVARVTTGLEEFWT >LR131272|1293503:1303510|1296619_1296865_+|VDR31900.1|DBSCAN-SWA MTVTVYTKPACVQCNATYRALDKKGITYQSVDISQDPAALERVRSMGYMQAPVVITDNDHWSGFRPDKISAIADSAVSSVA >LR131272|1293503:1303510|1302469_1303510_-|VDR31905.1|DBSCAN-SWA MAASRNPLDDLRETLGHLTEQLKSSGSDRVEDLRGEYFAARPGPARPLAEVQAELDALVGLDTVKEQVRALVALLQVQARRKAHGLPEVATSQHLVFLGNPGTGKTTVARLLAEMYRAVGLLQKGHLVEVDRSGLVGQYVGATAIKTDRVIRRALDGVLFIDEAYALTPEDGRMDFGPEAIEVLLKRMEDHRHRLVVIVAGYPQLMEAFLLSNPGLRSRFAREITFPDYSVDALQTIFHQMLGQHEYVLADGADDMLRRIFLGLHVGEDSGNARFARTLFEQALNRQALRLSLDPDHELDALDRDDVMTLTAADLVEAAQALGGEPKPEPEPEPEPSRWRNWRTWP >LR131272|1293503:1303510|1294849_1296139_+|VDR31899.1|DBSCAN-SWA MTDPEHGTTATAGRPHSGTEGVAQKLAAILGPIVGGELPVRLVAWDGSAAGGEDQPEVRLNSVDALRRILWSPGELGVSQAYVTGELDVPGSLGQALTHLWTVIGQRGLSMPRLTPQLVLQLTKVAREVGAFGAPLPAPASQAKIKGRLHSLLRDRAAISHHYDLSNDFYGLILDDHMAYSCGYWLSEDPDYSLEDAQRDKLDLVCRKVGLDTEPGLRFLDVGCGWGSLSLYAAEHFGAKVTGVTISREQKAFIDQRIAERGLGDRVEIRLQDYRDVRDGPYDAVASLEMGEHVGQENYGTYTSTLFRNVAPGGKVLIQQMSRHGRHPGGGPFIESFIAPDMHMRPVGDTVNLIEGAGFEIRGVQALREHYVRTIDAWEDRFEANWDRAVAMMGEEVARVWRLYMVGASMTFRDGRMGVDQILAQRPLR >LR131272|1293503:1303510|1300819_1302187_-|VDR31904.1|DBSCAN-SWA MDDAVLPTDVRELIEGLMSFRLGSTSAELIDQIRDLEDLTSAVAGLQARAAVAFDLAERKNQALWGVPASAQGQGVGAQLALARKESPARGGRLLGFARALITEMPHTLAALESGQLNEWRATLLVRETACLAASDRCAVDEELSADTGSFLGQGDRAIVAAARAAAYRRDPRSVTDRARHAAAERYVSLRPAPDTMAYLTALLPVAQGVAVYAELARHAGSLRSEGDIRSRGQLMADTLVERTTGTPGGVSGVEIQLVMTDRALFQGDSEPARLSGYGVVPAGWARTLASTREGGLGQDLRVLIRRLYTAPGSGELLAMDSRARLFTASQRRFLEARDHTCRTPYCDAPIRHHDHVVPWHDGGRTSLSNGAGLCEACNHSKEVRGWRARTGSGLRHSIDVETPTGHRYRSIAPSPPGSADDLAEELLRRRRDRRMMRQRVKALRRAPRSAPLIA >LR131272|1293503:1303510|1299627_1300602_+|VDR31903.1|DBSCAN-SWA MSEKLKLVDQVQAINWNRIQDDKDVDVWNRLVNNFWLPEKVPLSNDVQSWATLTPEEQQLTMRVFTGLTLLDTVQATVGAVSLIPDALTPHEEAVLTNIAFMESVHAKSYSSIFSTLASTKEIDEAFRWSTENVNLQKKAHIVTDYYRGDDPLKRKVASTLLESFLFYSGFYLPMYWSSRAKLTNTADLVRLIIRDEAVHGYYIGYKFQRGLESATPERRQELKDYTFELLFELYENEVQYTHDLYDSVGLAEDVKKFLHYNANKALMNLGYEAMFPSSVTDVNPAILSALSPNADENHDFFSGSGSSYVIGKAVNTEDEDWDF >LR131272|1293503:1303510|1293503_1294784_+|VDR31898.1|DBSCAN-SWA MSTTVVPSQAEVDPAAWPGIAGVPSGTRVRVRAAIAEKIFSTAVNTLELRVIYDDGTVLGAPGGGPRPEMRIHRPADFYARLGDSGLIGLGESYMAKDWTASDLTDVIEVFAARAAHLVPVPLQKLRGLSLPKPPRRERNSTANTRSNISRHYDLSNELFALFLDGTMSYSSALFDATGKDLLHVAWDELATAQHRKIDRLLDQANVGQGTRVLEIGTGWGELALRAAARGATVVSMTLSSEQKDLAEQRIRDAGFADRVEVQLKDYRGVEGQYDAVVSVEMIEAVGYEFWAVYFQTIDRVLAPGGRVAIQAITLPHERMLATRRSYTWIHKYIFPGGFIPSVRAIEDVTDRYTSLRVRERLSMGDHYAQTLRLWEERFLARHADVEALGFDETFERMWQFYLCYSRAGFQAGYLDVQQLIFDRRP |
8 | Mycobacterium_phage(42.86%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1974602 : 1986869
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >LR131272|1974602:1986869|DBSCAN-SWA CATGCGAATCAGTGAACTGCAAACCCTCTGCTACAAGCAGTCAGCCGACAAAGGCTTCCACGACGACGAACCAACTGAGGGACGGGCGCTCCTGTCCCTCAACGCTGAGCGCATCGCCCTCATGCACTCCGAGCTTTCCGAAGCGCTCGAGGAGCTGCGCACCGGTCGGGCCGCGAACGAGACCTGGTACGCGCCACTGGTCATTCCAACCTCGCTCGCAATCGAAGCTGGCTACGCGGAGGCCGAGCGACTCATCAAGCAGGAGAACGAGGGCAAGCTGCAGAAGCCCGAGGGAGTCCCCTCCGAAATGGCCGACGTCGTGATCCGAGTACTCGACTTTTGCGGAGCCAACGACATCGACCTCGAGTCCATGATCACCGAGAAGCTCGCCTACAACGCCAGCCGCGGCTTCAAGCACGGCAAGAAGTTCTGAGCCGACCCATGAAGATCACGATCGAGTCGGGGCAGCTGCTCGCGGCGCTCGCCGCGACGGTTGCGCATACGACGTCGGGCAAGGAGGACGAGTACATGCGGACGGTGGATGTCCAGATCATCGATGACCACCTGATGTTGACGGCGTCGAATGGCGGGACGACGGGAGTGGCCCGGGCGGCGCTGATGGACACGACGGGTGAGCTGTCTCGCTTCGCCCTGGACCGTGACGACATTGCCACGATTCGGGGGCTGTTCGCGGGGACGATCAAGGACCTCGAGGTGACGGTCAACGTTATGACGACCACCCCGATGTCCCCGGACGATAAGCCCGAGGTCATCAACCAGGTGCAGATCCGCGAGCTCGGCGCCCTCTTCGGCGGCCGCGAGATCCGCCTGTCCACTCCGGACCAGTCCCGGCGCGACGTCGAAGGTGTCTGGCACGCGATCGCCGCCGGTCTCCGCAAGCGCGCCCCGGCCCTCCCAAGGACGTCGCTCGAGCCGAAGGACTTCGCGCTCTTCCGGGCCGCTGCCTCGGCCTACAAGGAACCCTTGAGCGTCGAGCCGGCAGACGGCTTCGGCAGCCTCATCGTCCAGTGCGGGCGCGACTTCATCGGGTTCTGCAACGCGGACGCCCGAGACAACCCCGCACTCACCAGCGCTCGCGACAGCTGGCGCAACACCCTCCCCCTCAAGATGCGAATGGCGGACACAGGATCATGACCACCACACCAGCAGACCCAGCCCCAGCCCCAACCAACAACCCATTCCAGTACGTCGACGAACCAGGCAGAGCCCTCGGCATGTATGCCGCGTTCGTCGACAGCCTCCCCGAAGTCCAAGCCGAACGCGGTACCGGGCCCGGACAGATCACCTTCCGCACACCCAACGGCGCGCTCGCCGCCACCGGCATCAGCACCTCCGAGCGTCTCTTCGCGGCCGCCGCTGACGCCCTCGCCCTCGCGCACGCCGTGGCCGCCCATGAGGACGCGGAGCGTGACAGCGAGCGCCAGGCCACCCTCAGTCCTGCGCTCGCCGATCAGAAGGGCCGGGCGCTCGTGCAGGCCATCCACGACCGAAAGCTGAACCGCTAATCCATGCCATGGCTCCGCTCCGGCGATACCGCCGCTAACCACCCCATCGCGCTCGCCGCTCTCGAGCACGACAACTTCGACGATCGCCTTCTCAATGAGGTTTTCGGGTTCGTCATGCGCTGCGCGCTTCAGTCGACCGCTCACCTGACCGACTACGTCGTGTCCCGAGGCACCGCCATCCAGATGGCCGGCGCCTCGAGGGTGGGCCTGCTGACGGACGTGGCCACGTTCGCCGGCTACTGGGTGGAGCAGCGCGTAGAGACCGACGGGGCGGAGCGCACGGTCTACAAACTTGTGGAGGACCCCGAGTTCATCCACATGAGGACGAAGGAGGAAATCGAGTGGGAGCGGCAGCGCAAGACGGACAACTCTAATCCCGCCCTGATCATCCCGGTCCGCTTCCGCGACGGTGACGCCTGCCGCTACTGCGGCAAGGTCGTGAACTTCCTTGCGCGCAAGGGACGCCTCGCGGGCACGTACGACCACAGAGAGCCTGGGAAGACGGCGACGGTAGACACCTACGTCGTCGCCTGCGGCGCCTGCAATGCCGGCCGGTCCGATGACCCCGCTGCAGACACCGCCTACCCGCTCCTGCCAGCACCACCGAAGCCGTACTACTCCCCGAAGACCATCGACTGGTTTAGGGGTCACGAGTGGGCGCAGAGCAACGGATACAGGCCCCCACGGCCTCCGGCTCGAACCATTGCACCAGGACAGCCGTCCGGCAACGGAACACCCGGCCAGACAGCACCAGCAGCACCACAGACGCAGGGCAACGGCGACACCGGCCCTCAGAACGGCGCAGAACCGCGCAAACGAGCCGAAGAGAAGCCCCCTCCACAGCTACCTGCCAGAAATCTGCGGATTCCAGCAGATCCAGCAGGTCAGCAGTGTGCAACTTCTAGAAATTCCGGGACGGGTAGGGACGGGACGGGACGGGTAGGTGAGGTGAGGTCTGGTCTTGCAGACCAACCACCCTCTCAACCTGACCCTGCCTCCAAACCGTCGAAGCGCAGACGAAACAGATCCAGGCCCCGCAGGAGAGGCAATAGCTCATGACCCGATTCCCCGTCCTCACCGCCGCCGAGCACAGGCTCAAGATGTGCAACGCCTGGTCCGAGCAGCAGCTGCAGGACAAGGTCGTCCAGCTCGCTGAAGAACTCGGCTGGATGACCTTCCACGTCTATGACAGCCGGCGGTCTAATCCGGGCTGGCCTGACCTCGTCCTCGTCCACCCAGTCTGCCGGCGCCTCATCATCTGGGAGCTCAAGAGCAAGAACGGTCGAGCAACTCCGGCCCAACTCACCTGGCTGACAGCACTCCGGAACCTCGGCCTGAACGTTGGCATCAAAAAGCCCGAGGACTGGGCCAGCGAATCCATACAGCGCGAGCTCGCAGCACGAGCAGACAGCACCATCGAAGGGACACCATCCGCGTGAGCACCGGACCATGCACCTATCCACGCTGCACCGACGGCGAGGGCAACCCAGTACTGACCAGCCTCGGCATCTGCGACAACGACCGGATTCGCTACGGGCAGGTTATCCGCTGGCTAGCAGACGACTACGCCACCCTCCGCCTCGGCCTGCCTTCCCCCACGATTCGAGGCGTCCGCGTCAGGACAACGACGCGCGAAGAGTTCGGCCACCCAGCCGAGTGGGCTTCCGACGAAGCGCGGTCGATCGCGCTGCTCTTGAACCAAATCGAGGACGGCCTGCGCGAGTACTCCGGCCAAGAACCAGCGGACCACCCGGGCATCTTCCAGGACCGCCTCGCCCTGCAGGGTTTCGCCTACCTCGCCCTCCACTTCGAATCGCTGTGCACCTACCCCGAGGCCGGCACCGCTGCTACAGCCATCGTCGACAAGCACCGCACCATTCGGGCAGGTCTCGGCTACACCAGGCAGGCCGACAAGCTCCCCACCCCCTGCCCGAAGTGCAACACCGTCGGCCTCGTCTCCCTCGGCGGAAACCGAGCCGTTATCGAGTGCCAGGCATGTGGCCATCGAGTCAGGCCCGAGCACTACGAGTACCTCACGAAGGTGGCCGCCGAGACCGCGATCACAACAAGCAGAGCTAGCGACTGGGATCTTCTAGACCGCTACGATGACGCGAACTCAAGACGAGAAGTGGAGCATGGATGGAACACCTGATTGACGCACTGGAAGCTGCCCTCGAGCAGAAGAATTGGTACGCAGCCCTGACGATGGCGCTTTCCCTCCCGGACATCTGCGGAAAGCTCGAGGGAGAGGCGCCACAGACCTCCCGGGCCCGATACGTCCGTTGGTGCGAGAAGTACTTGCAGCCCAAATACACCGCGCCCGCAGACTGGGATGAGATTCCGCACGTGTTTCTGAGCGGCTCGGACTGCTATGCGCTGCGATGCGCCCTACTCCACGAAGGAAGCGCAGAGATCATCGAGCAGAGCGCGCGCGACGCGCTCGACTCGTTTGAGTTCACGGTGACTGACGGAGTGGACTGGAACATTCACATGAATCAGACTGACCTCCGCCTACAGCTCAGCATTGATGTCTTCTGCGACGATCTAGCTGCTGGAGCAGAACGCTGGCTTGAGGACATGGAGCCTAACGCTGAGGTACAGGCTCGAATGTCGAGGATGATGCGAATCACCAACATCGGGCCGAGACGCAGTGCCGATTAGTTCGATGACTTACAGCTTGCACACCGCGTGCCTCAGTGTCACACTGTTTGCAGCGCCACATCTGTATCCAAGGCCGGGCATCAGGACACTGACCCGGCCTTTCTCATGCCCGGAGGTATCCCCATGCTGACCCCGGAAGAGCTGGCCGCTCTCGAATCCCACGTCACGACCACGCAGGCGGCCGAGCGCGTGCGACGATCACCTGCTGCCATCCGTCAGTGGGTGAAGCGCGGACACCTGACGCCCATCAACCCAAGCAGCACGGGACCGAAGTACTTCCGAGTCGTCGACGTCCTACGATGCGACCGCGATCGGCGCGCCAAAATCATGCAGCTGCTGTCCGCATGACGAAGCGGGTGTGCCGGATGCACGGCTGCCCTCGCATCCAGGACGCCAAGCTATGCGACGAGCATGCTCGATCGCACGAGAAGCGACGCGGCACGCGACAGCAGCGCGGATACGACGCAGCCCACGACAAGGAACGACGTCGCATCGAGAAGCAGGGCATCGAGAACCATCGCTGCGCTCGCTGCGGCGCATGGTTCGAGCGCGGCGAGCCGTTCCAGCTCGGTCACACCGACGACAGGCTCAACTGGTCGGGTCCGGAGCACATCCGATGCAACACATCAGCGGCCGGCGCCGCATCCCATGCGGTGTTCAAGCGCCCCGAAGGCCGCCGCTGAGGGTGGGGGGCACCCCCTAATCGAACGCGAACCGCCGGACCGCCGGGGAGGGCTCTCGATGGTTCGTCAGGTTCATATGACTTTTGAGCTCCCTGGTCAGGTTCAAAGAGTTGGCCTGTCAGGTTCAAAGCGTCCGAGGTGAGAGCCCCGGACGGGCTGCGAGTCGCCGCGCGAAGGAACACGCGGGCGTTGGCTCACCGTAGGGTTCGAGGCCCTGCCAGCCCACCGACTTCCCCAAGCCCCCTTTCGGAACGTAGAACCGGGAGGGGGTACCACCATGCGCGGAGTCGGCAACTGGCCCTTCTAGGCCACACACGGCGGGTGCGACCGCGACCCGACAAGCGGGCCGCGCCCGCCGTTCGAACGCTGTACCAGGCCGCCGGCGCGATGCCCCGGCCTTTCTGCTGCGCGATGCAGCGAGGAAATGAGGCCAGATGCCACGCGGTGGAGCTCGAGTGAACAGCGGCCCAGCCGCGGACCCGCAGGCACTCCGACGGGATCGGCCGGCGGACAAGGACTCGTGGACGGTCCTCCCATCCGAGGGGCGCAAGGGCAATGCCCCCGCCTGGCCGTTGTCGAAGTGGCGTGACCAGGAGAAAGCGAAGGACCCGGAGGAGCGCGACGACGCCGGCGCTCGCGCCCTCGATGCGCGCGAGCTGGTCGTATGGCGACAGATCTGGAAGACACCGCAGGCGGCCCAGTGGGAGAAGCTCGGCTGGAAGCACGACGTCGCCCTCTACGTCCGGATGCTCGTCGGAGCTGAGCAGGGCAACATGAAGGCCGCCAGCGAGGCCCGGCAGTGGTCGGACCGCCTCGGCCTCTCCCAGATGGCCATGCTCCGAAACCGGTGGCGTATCGCCGCCGACGAAGTCGGCGCTCGCCGGACGCAGCAGCGCCCAGCAACGCAGCGCCCGAAGTCGTCCAGGGACCGCTTCCGCGTGGTGAGCAGTGCAGGGGCCTAGCGCGTTCGTCGTCGACTTCCCCACCCTCGGCTTCCTCGCAGCGGACTACATCGAAGCGCACTGCATCGTCCCGAAGGGCTTCAAACGCGGGCTGCCGTTCACCATGGCTGACTGGCAGCTGTGGTGCACACTCAACCACTACCGGGTGAAGCCCACTGCCCTGTGGCGTCCAGAAGATCCGATGCTGGCCACGGCTTTCCAGAACCGGCGCTCGCAGGTCGTCGCTCCGCAGAAGACGGGCAAGGGCCCGTGGTCCGCGGCGATGGTGTGCAACGAAGCGACCGGGCCGTCTCTGTTCGCCGGTTGGGCTCGCGCAGGCGATGTCTACGACTGCGCGGACGATGACTGCCACTGTGGGTTCGTCTACGAGTACTCCCCCGGCGAGCCGAAGGGCATGCGCCGGCCGGACCCGGAGATCCAGCTGCTCGCTGCCTCCGAGGACCAGGTGAGGAACGTCTACGGGCCGTTGCAGGAAATGGTCAAGGGTGGGCCGCTGGCGCAGATGATGCGCGTGGGCGAAGAGTTCATCCGCATCAACGACGGGGGCCGCATCGACCCCATCACCTCCTCGGCGCTCTCCCGCCTGGGCAACCCGATCACCTACGCCAACATGGACGAGACCGGCACCTACACAGACCGGAACAAGCTCAAGGGCGTCGCGCGCACCATGCGCCGAGGCCTCGCCGGCATGGGTGGTCGTGCGGTGGAGACCACGAACTCCTGGGACCCCAGCGAGAACTCCACGGCGCAGAACACCTACGAGGCCGCCGCGGCCGACATCTTCCGCTTCTTCCGCCAGCCTCCACCGAACCTCAAGTTCAAGAACAAGGCCGACCGTCGGAAGATCCTGCGGTACGTGTACGAGGGGTCCACGTGGGTGGACCTCGACGGCATCGAGGCCGAGGCTCTGGAGCTGATGGAGAAGGACCCGCAGGAAGCCGAACGCTTCTTCGGGAACATCCTGTCGCAGGGTAAGGGCGCATGGATGCCGGAAGGCCTCTGGAAGCAGACGGAGGACGTGCCGAAATGAGGATCTGCATCGGCTTCGACGGCTCCGACAAGGACGACTTCACCGGCATCCGCGCAGAAACCATCGAAGGCTGGCAATTCACGCCAACCTACGGGCCGGACAAGCGTCCCACGATCTGGAATCCAGCCGAGTGGGGCGGGCGCATACCCCGCTCTGAGGTGCACGCGGCCATGGACGAAATCTTCGAAGAGCACGACGTCGAACGCGGCTACTTCGACCCTCCGCTGTGGAAGACAGAGATTGAGGAGTGGGCGCTCAAGTACGGCGACGAACGGGTCATCCCGTGGGAAACCTACCGGGTCGCTCCCATGCATGCCGCGCTCGAGCGGTTCATCGTGGACCTCGGCTCCGGAGCGCTCACACATGACGGCTGCCCTACGACGGAGGCACACATCGCCAACGCGCGCATGTTCGCCCGGACTAACCAGCGGTACATCATTCTCAAGCCCTCGCAGACGCAGAAGATCGACCTCGCGGTGACCTCGGTCGTCACCCACGAGGCAGCTGCCGACGCGCGGGCCGACGGATGGACAGACAGGTACGTGGCAGGCAAGGGCGTTTCGACGGCCGCATACGGATTCAGCTAAGGAAGAGGTGGTGGCGTGAAGGAAGCGCAGGCACGCAGCTACCTGGAGACGGGACTGAAGAAGCTCAGCGAGCGGGCCCCGGCGTGGAACCGCCGGCAGGACTACTTCGAGGGCAAGCAGGATCTCCCCTACGCGCCTCCTGGGGTGAACGCCGAGTACCTCGAGCTTCGGGAGATGTCTCTCGCCAACGTCCTCGGCCTGTGCATGAAAGCCCCGATCCAGCGAATGCAGGCCGACGGCTTCCGCACCGGACGCCAGGGCGAAGCAGACCTCCGCTCGTGGGTCGAGATCTGGCAGCCGAACAAGCTCGACTCGCGCTCGCCGATCGTCTTCCAGCAGATGTTCAATCACGGCCGCGGCATCATGTCGGTCTCCCCGAACACCCGGAACCGCAAGAGCCCGATCATCCGCCCGGAGAACGCGAAGCGGGTATGGCTAGAACCGAACCCCGAAGACCCGTTCGAGGCCCTGTTCGCGGTAAAGACGCTCGAGGTCGCAGCCGACGCTGGTCCGATCAGCACGCTCTGGACCCCGGACTCCTACAGCAACACGGTCACGACGAAGATCGCTTACGTCTACACGGCGATGGAGTGGGTGCGCTTCGAAGCGAAGGGCATCTCGGGCAACGTCTGGGAGCTCGCCGACGCCGGCCGCCACAACATGCGCGGAATCCCGTTCGTCGGCTTCGACTTCAACGTGGACGCAGACGGCAACCCCCGCTCAGCGATGGACGAGCTGATCCCGCAGCAAAACGCGGTCAACACCATTCGCTTCCAGACCCTGCTGGCCATGCAGTTCAGCGCATACCGGCAGCGCGTCTTCACCGCGTACGATCCGGTGGTTCGGGACAAGCGCGGCGAGCCCATCTGGCAGACCAACCCGGACGGGACGCTCGTCCTGGATAGCAACCGTCAGCCACTGCCCATCGTCACCTCACCCGGCAGGATCGGCGTCGACCGCGCTCTGGTGTTCCCTGGTGCTGACACGAAGGTGTTCGACCTGCCGGAGTCGAATCTGGACAACTACATCAAGGTCCTCCAGCAGTTCCTCACCGATATGCTCGCCACTGGTCAGATCCCGCCGTCCTATGCACTCACGAAGATGGCGAACCTCACCGGCGATGGCATGGCCGGCGCGGAGTCCACCTTCCAATCCCTCATCAACGACATCCAACGTGCGGCGGGCGAGGGCATCGAGCAGGTGATGCGCCTGGCGAACGTCGCTCGAGGTGAGACGGAGGAAGACCTTGCCTCGGAGGTGATCTGGGCCGACACGGAGATCCGTTCGTTCGCGCAGATTATCGACGCCATCGGCAAGCTCATCACCTCCGGGATGGCCCGCACCGACGCGTGGGCGTTCCTTCCGAGCGCCACCCCGGCGCGCGTCGCCAACTGGGTGTCGAACTCGGATGGCGAAGCGGAAGCCCGCGACGAGCAGCTCCTGCGCACCACGGAGAAGCTGGCGCTCGATGCCTGACTACCCCCAGGCGGCGCAAGCCTACGACCTGCGCATGCGTCGCCTGTCGGTGCGGGCAATCGCTGCAGGGAAGGTCCACTGGCGCAGCGTCAGCGGTGCGAACATCTCCGCGAGCTGGCAGGAAGCATTGGCCTCGCTCGTCCCGGCGGTAACTACGCTTCAGCTCAGTGTCGCCGCCGAGTCTGTCGACTACGTCAGTGCTGCGCTCGCAGGGCAAGGCCTCTACGAAGCCCCGGACGGTTTCGCGGACCCGCGCAGCTTCGTAGGCGTGGCCCCCGACGGCCGGTCGCTGTCAGGGCTCCTGCTTTCGCCAGCCACCAGGGCCAAGACCGCCATAGGGGCAGGCGCATCCCTCGCGGAGGCGCTCGCCGAGGGTGGGAGGGCTTTGGACATGCTGATGAAGACCACGATCTCCGACACGGGCCGCGCGGCGGCAAGCGTCGATCTGGCTGCGCGCCCGAGGACCGGGTACGTGCGGATGCTGAATCTGCCATCGTGCGGCCGCTGCGTGGTCATGGCTGGCCGGTTTTACAGGTGGAATGCGGGGTTCCTCCGGCATCCACGCTGCGACTGCCGGCACATCCCTTCGACGGAGAACGTCGCAGGCGACCTGCGGACGGACGCCTACGAGGCGTTCAACTCCTACACTCCCGCGCAGCAGGACAAGCACTTCGGGAAGGCCGGGGCCCAAGCTATTCGCGACGGCGCCGACATCTCGCAGGTCGTGAACTCTCGACGCGGCATGTCTGCGTCCGGGCTGATGACGAAGGAAGGCACCTCAAAGCGGGGCAACTTCCGCCAAGCATCCGGACGATCGCAGCGCCTCACCCCGCAATCCATCTACCAGCTCCACAAGGGCGACCGCGTGGCTGCAGTCAAGGACCTGGAACGCTACGGATACATCTTGCCCGGCGGACAGAACCCGCTCGGCGCTCTGCGCGGACAGCGCGAAGGCTACGGCGCTCTCGGTCGCGGGGGAACCCGTGTAGGGCCCACTGCCGCCGTAGATCGGGCAAGGGAGACCGGCGCCCGGGACCCGCGCATCAGGGCGACGATGACCGAAGCCGAGCGCCGGCTCGCGGACGCCGAGTCAAACTGGCGGGCAGTCCAGCAGGGACGAAACCCGGCGGGGCGCTCACCGCTCACGCCGGTCGCTGCCGCGAAGGCCGAGGCCGAGTACCGCAAGCTGCTGCGCATGAACGGCGAGGTCTTCACCCGCTGACCACATTCCTACCCGACGGCGCGAGGCCGGCGGGTCCAACTCCGCGATGGAGGAGCACCACCATGAAGAAGAACACCACGCTCGCCCCAGCAGTACTCGCCCCGCACGGTATCGACCTCAACGCTCCCGGCGGCATCGAGGCGCTGCTCGAGCACCACCGGCTGACCTTCGGCGACGCCGTGATGCAGGACGACGGCGCGGGGGCCGGCGGCGACGGCGCGGGTGCCGGCGGCGACGGCTCTGGCAGTGCGTCGGGCGATGGTTCCGGCGAGGGCTCCGGTGACGGCAAGGACGGCACCGATGGCCAGCTCGGCCCCAACGGCCAGAAGGCTCTGCAGACCGAGCGCGACGCTCGGAAGACCGCGGAGAAGCTGGCCAACGAACGAGCCGCCCGGATTACGGAGCTCGAGAATGCTACGAAGTCGGACGAGGAAAAGCGCTCGGAGCGATTCCAGCAGCTCGAGACCACCGAACGCGAGCAGTCAGCCACGATCACCCAGCAGCAGGGCATCATCGACCGCTACCGGGTAGCGGCGGCCAAGGGCCTCGACCTCGAAGCTGCGGAGCGACTCCGCGGTACTACGAAGGCAGAGCTCGAGAAGGACGCTGACAGCTGGATCGCGAAGTGGGGCAACACCGGCGGCGCCCAGCAGGTGCCCGGCGCTGGCGCCCGCGGGAGCGACCGAGTCCAGTCCTCCCCCGGCATCGGAACCCTGCGTGCGGGGTACGACGAGAAGAAGTAAGCACCCAACGGCGGCAGCCCGAGGGCCCATCTAACAGGAAAAGGAGCCTTCAATGGCTGTCACCCTCGTTGAGTCAGCGAAACTGTCTCAGAACACCCTGCAGCGAGGCGTCATCGAGACGTTCGTGCAGACCTCCCCCATCCTCGACCGGCTTCCGCTGATGCCGATCGAGGGCAACGCCTACTCGTACAACTCCGAGGGAACACTCCCCGGAGTCGCCTTCCGTTCGGTCAACGAGGCCTACACCGAGTCCACGGGCACGGTGAACCAGTCGAGCGAGTCGCTGGTCATCCTCGGTGGCGACGCGGACGTGGACCGCTTCATCGTCCAGACCCGCGGCAACCTCAACGACCAGCGGGCCACGCAGACCGCGATGAAGGTCAAGGCTGCCTCCTACAAGTTCCAGGAGTCGTTCTTCAACGGCGACGTCGCGGTGGAGCCCAAGGGCTTCGACGGGCTGCGGAAGCGGCTCATCGGCAACCAGGTCATGGACGCCGCCACGAACGGGTCCCCTGTCCTCGGCAACGGGGCCTCCGACGCCCAGACGTTCTTCGATATGCTCGATGACCTGATCTCCCGGGTCCCCGGCATCGACGGGACGAACGGCGCGCTCTACGCCAACTCCGTCCTGCAGGGGAAGATCCGCTCCGCGGGCCGTCGCCTCGGCGGTGTCGAGACGGTCCGCGAGGACCTCACCGGCAAGCGCGTCCTGCAGTGGAACGGCATCTCCATCCTGGACCCCGGTAACAACGCCGCCGGTGCCCCGATCCTCGGCCAGAACGAGGCCCAGGGCACCTCCGCCGACGCGTCCTCGATCTACGCCGTCAAGTTCGGCAACGACGAAGGCGACCAGGCGGTCACCGGCCTCACCAACGGCGGCGTGCAGGTCGATGACCTCGGCCAGCTGCAGGAGAAGCCGGCGTACCGCACCCGCATCGAGTTCTACTGCGGGCTCGCACTGTTCGGCGGCAAGGCAGCGGCCCGCCTCCGCGGCATCCGCAACCTGTAGCCGACCGGCTCACCGGCCCTACCAGAGGAAGGCAGCACCATGGCGAACGCCACCCAGCCCAAGGCCGAGAAGAGCACGACGCTCGACTCGGACACCACGAAGCCGTCCGTTACCGCTCCCGGCGACGGCCCGGCCGACACGACTGACCCGGACGAGCGCGCGCAGTCCGCGCCCGTGAACCCCGGCCCCGACGCGCTCGCCGCCGGCACGGTCAACGCGGTGAAGTCGGTTCCGAAGTCGAAGGCCGCGACGCGGTCCGCGGCGAAGGAGCGCGTCGAGAGGTACAAGGCGACCAAGCCCAACGGCGACGTCGTCACGGTGACCCGGAACATCGACACCGGTGTTTCCAAGGTCGAGGCGGACAGCTGA
Protein sequences of DBSCAN-SWA_3 >LR131272|1974602:1986869|1977562_1978279_+|VDR32529.1|DBSCAN-SWA MSTGPCTYPRCTDGEGNPVLTSLGICDNDRIRYGQVIRWLADDYATLRLGLPSPTIRGVRVRTTTREEFGHPAEWASDEARSIALLLNQIEDGLREYSGQEPADHPGIFQDRLALQGFAYLALHFESLCTYPEAGTAATAIVDKHRTIRAGLGYTRQADKLPTPCPKCNTVGLVSLGGNRAVIECQACGHRVRPEHYEYLTKVAAETAITTSRASDWDLLDRYDDANSRREVEHGWNT >LR131272|1974602:1986869|1978266_1978788_+|VDR32530.1|DBSCAN-SWA MEHLIDALEAALEQKNWYAALTMALSLPDICGKLEGEAPQTSRARYVRWCEKYLQPKYTAPADWDEIPHVFLSGSDCYALRCALLHEGSAEIIEQSARDALDSFEFTVTDGVDWNIHMNQTDLRLQLSIDVFCDDLAAGAERWLEDMEPNAEVQARMSRMMRITNIGPRRSAD >LR131272|1974602:1986869|1979132_1979471_+|VDR32532.1|DBSCAN-SWA MTKRVCRMHGCPRIQDAKLCDEHARSHEKRRGTRQQRGYDAAHDKERRRIEKQGIENHRCARCGAWFERGEPFQLGHTDDRLNWSGPEHIRCNTSAAGAASHAVFKRPEGRR >LR131272|1974602:1986869|1975836_1976127_+|VDR32526.1|DBSCAN-SWA MYAAFVDSLPEVQAERGTGPGQITFRTPNGALAATGISTSERLFAAAADALALAHAVAAHEDAERDSERQATLSPALADQKGRALVQAIHDRKLNR >LR131272|1974602:1986869|1985543_1986500_+|VDR32539.1|DBSCAN-SWA MAVTLVESAKLSQNTLQRGVIETFVQTSPILDRLPLMPIEGNAYSYNSEGTLPGVAFRSVNEAYTESTGTVNQSSESLVILGGDADVDRFIVQTRGNLNDQRATQTAMKVKAASYKFQESFFNGDVAVEPKGFDGLRKRLIGNQVMDAATNGSPVLGNGASDAQTFFDMLDDLISRVPGIDGTNGALYANSVLQGKIRSAGRRLGGVETVREDLTGKRVLQWNGISILDPGNNAAGAPILGQNEAQGTSADASSIYAVKFGNDEGDQAVTGLTNGGVQVDDLGQLQEKPAYRTRIEFYCGLALFGGKAAARLRGIRNL >LR131272|1974602:1986869|1982065_1983526_+|VDR32536.1|portal|DBSCAN-SWA MKEAQARSYLETGLKKLSERAPAWNRRQDYFEGKQDLPYAPPGVNAEYLELREMSLANVLGLCMKAPIQRMQADGFRTGRQGEADLRSWVEIWQPNKLDSRSPIVFQQMFNHGRGIMSVSPNTRNRKSPIIRPENAKRVWLEPNPEDPFEALFAVKTLEVAADAGPISTLWTPDSYSNTVTTKIAYVYTAMEWVRFEAKGISGNVWELADAGRHNMRGIPFVGFDFNVDADGNPRSAMDELIPQQNAVNTIRFQTLLAMQFSAYRQRVFTAYDPVVRDKRGEPIWQTNPDGTLVLDSNRQPLPIVTSPGRIGVDRALVFPGADTKVFDLPESNLDNYIKVLQQFLTDMLATGQIPPSYALTKMANLTGDGMAGAESTFQSLINDIQRAAGEGIEQVMRLANVARGETEEDLASEVIWADTEIRSFAQIIDAIGKLITSGMARTDAWAFLPSATPARVANWVSNSDGEAEARDEQLLRTTEKLALDA >LR131272|1974602:1986869|1974602_1975034_+|VDR32524.1|DBSCAN-SWA MRISELQTLCYKQSADKGFHDDEPTEGRALLSLNAERIALMHSELSEALEELRTGRAANETWYAPLVIPTSLAIEAGYAEAERLIKQENEGKLQKPEGVPSEMADVVIRVLDFCGANDIDLESMITEKLAYNASRGFKHGKKF >LR131272|1974602:1986869|1983518_1984748_+|VDR32537.1|DBSCAN-SWA MPDYPQAAQAYDLRMRRLSVRAIAAGKVHWRSVSGANISASWQEALASLVPAVTTLQLSVAAESVDYVSAALAGQGLYEAPDGFADPRSFVGVAPDGRSLSGLLLSPATRAKTAIGAGASLAEALAEGGRALDMLMKTTISDTGRAAASVDLAARPRTGYVRMLNLPSCGRCVVMAGRFYRWNAGFLRHPRCDCRHIPSTENVAGDLRTDAYEAFNSYTPAQQDKHFGKAGAQAIRDGADISQVVNSRRGMSASGLMTKEGTSKRGNFRQASGRSQRLTPQSIYQLHKGDRVAAVKDLERYGYILPGGQNPLGALRGQREGYGALGRGGTRVGPTAAVDRARETGARDPRIRATMTEAERRLADAESNWRAVQQGRNPAGRSPLTPVAAAKAEAEYRKLLRMNGEVFTR >LR131272|1974602:1986869|1986539_1986869_+|VDR32540.1|DBSCAN-SWA MANATQPKAEKSTTLDSDTTKPSVTAPGDGPADTTDPDERAQSAPVNPGPDALAAGTVNAVKSVPKSKAATRSAAKERVERYKATKPNGDVVTVTRNIDTGVSKVEADS >LR131272|1974602:1986869|1981459_1982050_+|VDR32535.1|terminase|DBSCAN-SWA MRICIGFDGSDKDDFTGIRAETIEGWQFTPTYGPDKRPTIWNPAEWGGRIPRSEVHAAMDEIFEEHDVERGYFDPPLWKTEIEEWALKYGDERVIPWETYRVAPMHAALERFIVDLGSGALTHDGCPTTEAHIANARMFARTNQRYIILKPSQTQKIDLAVTSVVTHEAAADARADGWTDRYVAGKGVSTAAYGFS >LR131272|1974602:1986869|1980419_1981463_+|VDR32534.1|DBSCAN-SWA MQGPSAFVVDFPTLGFLAADYIEAHCIVPKGFKRGLPFTMADWQLWCTLNHYRVKPTALWRPEDPMLATAFQNRRSQVVAPQKTGKGPWSAAMVCNEATGPSLFAGWARAGDVYDCADDDCHCGFVYEYSPGEPKGMRRPDPEIQLLAASEDQVRNVYGPLQEMVKGGPLAQMMRVGEEFIRINDGGRIDPITSSALSRLGNPITYANMDETGTYTDRNKLKGVARTMRRGLAGMGGRAVETTNSWDPSENSTAQNTYEAAAADIFRFFRQPPPNLKFKNKADRRKILRYVYEGSTWVDLDGIEAEALELMEKDPQEAERFFGNILSQGKGAWMPEGLWKQTEDVPK >LR131272|1974602:1986869|1977182_1977566_+|VDR32528.1|DBSCAN-SWA MTRFPVLTAAEHRLKMCNAWSEQQLQDKVVQLAEELGWMTFHVYDSRRSNPGWPDLVLVHPVCRRLIIWELKSKNGRATPAQLTWLTALRNLGLNVGIKKPEDWASESIQRELAARADSTIEGTPSA >LR131272|1974602:1986869|1975042_1975756_+|VDR32525.1|DBSCAN-SWA MKITIESGQLLAALAATVAHTTSGKEDEYMRTVDVQIIDDHLMLTASNGGTTGVARAALMDTTGELSRFALDRDDIATIRGLFAGTIKDLEVTVNVMTTTPMSPDDKPEVINQVQIRELGALFGGREIRLSTPDQSRRDVEGVWHAIAAGLRKRAPALPRTSLEPKDFALFRAAASAYKEPLSVEPADGFGSLIVQCGRDFIGFCNADARDNPALTSARDSWRNTLPLKMRMADTGS >LR131272|1974602:1986869|1979926_1980433_+|VDR32533.1|DBSCAN-SWA MNSGPAADPQALRRDRPADKDSWTVLPSEGRKGNAPAWPLSKWRDQEKAKDPEERDDAGARALDARELVVWRQIWKTPQAAQWEKLGWKHDVALYVRMLVGAEQGNMKAASEARQWSDRLGLSQMAMLRNRWRIAADEVGARRTQQRPATQRPKSSRDRFRVVSSAGA >LR131272|1974602:1986869|1984810_1985491_+|VDR32538.1|DBSCAN-SWA MKKNTTLAPAVLAPHGIDLNAPGGIEALLEHHRLTFGDAVMQDDGAGAGGDGAGAGGDGSGSASGDGSGEGSGDGKDGTDGQLGPNGQKALQTERDARKTAEKLANERAARITELENATKSDEEKRSERFQQLETTEREQSATITQQQGIIDRYRVAAAKGLDLEAAERLRGTTKAELEKDADSWIAKWGNTGGAQQVPGAGARGSDRVQSSPGIGTLRAGYDEKK >LR131272|1974602:1986869|1976130_1977186_+|VDR32527.1|DBSCAN-SWA MPWLRSGDTAANHPIALAALEHDNFDDRLLNEVFGFVMRCALQSTAHLTDYVVSRGTAIQMAGASRVGLLTDVATFAGYWVEQRVETDGAERTVYKLVEDPEFIHMRTKEEIEWERQRKTDNSNPALIIPVRFRDGDACRYCGKVVNFLARKGRLAGTYDHREPGKTATVDTYVVACGACNAGRSDDPAADTAYPLLPAPPKPYYSPKTIDWFRGHEWAQSNGYRPPRPPARTIAPGQPSGNGTPGQTAPAAPQTQGNGDTGPQNGAEPRKRAEEKPPPQLPARNLRIPADPAGQQCATSRNSGTGRDGTGRVGEVRSGLADQPPSQPDPASKPSKRRRNRSRPRRRGNSS >LR131272|1974602:1986869|1978911_1979136_+|VDR32531.1|DBSCAN-SWA MLTPEELAALESHVTTTQAAERVRRSPAAIRQWVKRGHLTPINPSSTGPKYFRVVDVLRCDRDRRAKIMQLLSA |
17 | Gordonia_phage(75.0%) | portal,terminase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1990496 : 2004601
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >LR131272|1990496:2004601|DBSCAN-SWA CATGGCCGATCGCTCTATAACCCTGCGCCTGACTGCGAACGTGCAGGGGCTGGTCGCAGGATTCCGGACTGGGCAGCAGGCCGCGAAGGACTTCGGATCGCAGACGCAGCGGTTCGCGCAGGACAACCGCCAGTCCTTGGAGCAGCTCGGGCAGGCGGGCATGGCAGTCGGCGGGACCCTGGCCGCTGGTTTCGGCCTCGCAGTCACCAAATTCGCCAGCTTCGACAAGGCCATGTCCTCGGTACGCGCCGCAACGCACGAGACCGAGGGCAACATGGACCTGCTCCGCGAGGCAGCAGTCCGGGCCGGAGCCGACACAGCATTCTCGGCCGAGGAAGCAGCTCGCGGCATCGAGGAAATGGCGAAGGCCGGAGTCTCCACCGCCGACATTATGGGCGGAGGCCTCGACGGCGCGCTCGCGCTCGCAGCAGCCGGCGCACTGGATGTCGGTGACGCTGCCGAGCTGGCCGCTACGGCGATGACCCAGTTCGGCCTCTCCGGAGAGGACATCCCCCATATCGCGGATCTCCTCGCGGCGGGCGCGGGGAAAGCTCAGGGCTCCGTCGAGGACATGGGTATGGCACTCAAGCAGGCAGGGCTCGTCGCGGACCAGACGGGCCTGTCCATCGAGGAGACGACCGGGGGCTTGGCTGCTTTCGCGGCCGCCGGCCTCGTGGGCTCGGATGCGGGCACATCCTTCAAGTCGATGCTCCAGCGGCTGACACCCCAGTCGAAGGAAGCCCAGGAGAAGATGGCGGAGCTGGGCATCAGCGCCTACGACGCCCAGGGCAACTTCATCGGCCTATCCGAGTTCGCCGGCAACCTGAAGCAGTCTATGGCTGAACTGACCGTCGAAGAGCGCAACGCCGCGATGGCTACTATCTTCGGGTCCGACGCCGTGCGCGCCGCGTCAGTGCTGTACGAGAACGGCGCAGAAGGCGTCCAGAAGTGGGAAGACGCTGTCAACGACGCGGGGTACGCGGCAGAGACGGCAGCACTCATGCAGGACAACCTGGCAGGCGACCTCGAGAAGCTCGGCGGGGCCTTCGACACCGTGTTCCTCCAGGCAGGGTCCGGCGCGAACGACTCCCTCAGGTCCCTGGTGCAGACCCTCGAGGACGTAGTCGACCGGGTCGGGGAAATTCCCGGCCCTGTCCTTTCTGCTGTCGGCATCGTCGCCGGCCTCACTGGTGGAGCACTCCTACTAGGAGGGGCCTTCCTCACCGTCGTACCCCGAATCGCAGACACCGTCAGCGCAATCTCTGACCTGCGAGCCAAGGCCCCCGGTGCCACGACCGCGATCGGCAGGCTCGGGCGGGTCGCCGGGGTAGCAGCTGTGGCAATGGTCGCACTTCAGGTCGGGGCCGCGATCAACGACAGCTTGGGTGAAGCGACGAAGAGCGCTGAGGAAATGGCGCAAGCCATCCTCCGTGTGACCCGCGAAGGAGAGGCCTTCAGCGACGTATTCACGTCCGACCTGTTCGCGAGTTCCAACGGCTTCGCGATGCGCGACGAGATCACCGGAATGGGTGATGCGCTCGCCGAGGTGAATTCGATTGATTTCGGGGAGACCGTCAACGACTGGTATCACGGTCTCACAGGGTGGAAGTCCCAGATCAACGAGACGCGCGAAACCATCGCGAACATGGACACCGCCCTGACCGACTTCGCGTCCTCGGGAAATCTCGAAGGAGCAGCTGAGGGGTTCCGACAGATTGCCGCTGATGCCAAGGAATCGGGCGTCAGCCTCGAAGACACGGCAGCCCAGTTCCCGCAGTATCTAGATCACCTTCGGTCGCTTGCCACGGAGGCCGGAGTTGCACTGGATGAGCAGGAGCTTCTGAACTGGGCGATGGGTGAAGTGCCTCCAGCGATGGAGGCAGCTGCCTCCTCAACCGAGGGTTCCGCTGCGGCTCTGGAGGCACAGGGGGCAGCAGCTGAGCAGGCAGCCGAGCTGAACGAGGCGATGCTGGACCGGCTGAAGGAGCTCGGTCTCGCGGCGGACGGAACGATCGTGAACATCCAGGCATACACAGAGGCGCTCTTCGCCTCCGGTCTCGCTGTGATGTCCTCGAGGGACGCGTCGTTCCAGTGGGAGGACTCGCTGCGTGGCATGAACGACGCGATCTTGGAGGTCCTGAACACGCAGGGCGAGCTTGGAGCCGTCCTGGACGGCACGGCCACCGACTTCAATACGATGACGGACTCTGGTCTCGCGGCGAACGACGTCTTTCAGGGTCTGGTCGAGCAGGGCCTGAGTGTGGCTTCAACCTTTTCGGGTGACACAACGAAGTCCATTCAGGACGTGCAAGCGCAGCTGCTCTCGACATATGACGCCGGCGTGCAGTCCGCGATGGGCTTCGGGATGTCGAAGGAAGCTGCGATTGCACTCACACGCGAAGTCCTGCAAATCCCTGAGGGCGTGACCATCGAGTCGTGGATGTCGGAGGAAGCCAAGAACATGGCTGCCGAGACAACCGGTGCAGCCCAGGCTGTCCCGACCAATGTGCGCATCGAGTCCTCGATGTCCGAGGCCGCGAAGCTGACGGCAGACGCGACGAAAAAGTCCGCCGACGACGTCCCTGAGAAGGAGACGATCGACTCATGGATGTCCGACGCGGCGTTCATCGAAGCGGTGCGGACTCGTGCAGCTGCTCTCGGCATTCCGGAAAAGGAAGCCATCGACAGCTTCATGTCCTCGGCCGCCAGGAACGAGGCCGACAACACGACGGCTCGCATCCTCGGCATCCCACCCGGGGTGTCAGTCACGTCCTACATGGACAACTACGCCCGCATCGAAGCGCAGAACACGAAAGCGGCAATCGACGCGATTCCCTCCTACAAGGAATCAAGGATCGTCGTCGTCACCGAACGCAATGAGCGGGTCGGCGTCGGACAGGTAGGCCTCAAAGCCCTCGGCGGACGCCTGCCAGCCCACGCCGCCGGGTACCGGCTTCCATCCACCGGGCCCGGAACGGACGTCACAGACGGCATCCTCGGCATCAGCTCCAAGACCGGCGCGCCCATGTCCTGGCTCGACGGCGGGGAATGGGTCATCAACAGCAAGTCCTCGGAGAAGTACCACCGTGTCCTCGGAGCCATCAACCGCGACGACCCATCGGTCTCCAGCCTCTCCGCGCTCGCAGGCGGCGGGCAGGTCATGCGTAGCGGGCAGCACCCCCCGGCGGGCACCGGGGGTATCGGTTCCGCGCCGCCGGCGATTGACCTCGTGGCGTATGTGCAGAACCCGTGGACGGGCGAGCAGATGCTTGCTCCCGTCCGAGCTGTCGCAAGGGAGGAAGCGGATGCCTCTATATCGAGGATTGACCATCAGATGACACGGTACGACCGTGGTGGGAAGTATGTGGAGCGTATATGGTGACCGACCTTCTGCATGGAGACTTCGAACTCGACGGATATGTCATTGGAGGAGACAGAGACCGCCCTGTCTATATCAAATCCTTTCTACCCGGCCGGTCCGATGTCCGCAGCCAGGACTTCGACAACCCGGTAGGTGATGGTCGGCTGTTCGGCCGTGATGCAGCCCTCGGGTCGACGTGGTCGTTCAGTTTCGGGATGGCCTCCGAGTCGTCGGCCGCTTCGCTGGCCGCATTGAATAGCCTCTCTGCTGCGTGGAAGTACGTTCACCGAGAGCCTGGCGCGGAATCGGTGTTGCGGTACAAAGTGGGAGGCCGGGTTCGGAGAGTTTATGGCCGAGCGCGTCATTTTGACTACGACCCGAACTACGTCTTCTTCACCGGCTACGCAGTAGCATCTGGAGAATTTGTTACGTCGGACGCCGTCCACTATGAGGACGCGTTGCGTTCGGTGTCGGTGGGGCTCCTCGCTGAGGTCTCCGGGTCCCAGCCTTTGCCGTCGGCGCTGCCGTTCACGTTCGCCCCGGGCGGACCCCGCTCGGGGATCATCCAGGACGTGGGCGGGGACGCACCGGCCCCGGTGGAGGTCACCTTCGCTGGGCCGGTCACCGGCCCGTCAGCCACTATCGGCGGTCAGCTCGTCGCCCTGCCCGGGCTGACGCTGGCTTTCGATCAGTCGGTGACGGTGAATACCCGGACGATGACCGTCACCCGCAACGACGGCGCGTCCCTGGCGGGCGCCTTGTCGAGGCGGACCTACCTCGAGGACGTCCGGCTGCAGCCGGGCTTGTCGGAGGTCGTGTACTCCGGTTCGGACCTCACTGGCACCTCCCGCTGCACCGTCGCCTGGCGGCCCGCCCACTACGGATTCTAGAAAGGCACCTCATGGCCATCGAAGTACTGGCAATCGAAGCATCCGGCGTCACGGCCGCAGGGCTCCGCCTGGCCGCGTTCCATGCCACCCGTGGCGGGAACGGGATCAGCCTCCCGGAGGATCTGAAGGTCACGGCCCTGCCGGTACCTGGCGCATTTGTCCGGGTCGTCCGCGGCCCGTCCGGGTCTGCCGCACTGCGGTCCCGGTATGCGTCCGCGGCCGGGCAGTCGTACCTGACGCTCCTGTCGGCGACGGAGGACGTCTCGGTCCCCGCGACGGGCTCCGGGTCGGGCGCGACCCGGTACCTGATCCAGCGGGTGTTCGATCCGAAGTTCGAGGGGCAGGCCCCGCTCGCGGAGTTCGCCCTCGTCCCGGGATCTGCGACCGGGGTGTGGCCGTATAACCTCGGGCTGTCCTACCCGCACGTGGTGCTCGCGCGGATCAATCAGCCGGCGTCGACGGCGACGATCACGCAGGCCATGATCGAGGACCTCCGGCAGCTCGCCTACCCGCTACAGGACCGGCAGCTTCTACCCCCGATCTACCCGTCGGCGAACGCGAACATCCCGACCGCCGGGTACGCCTCGTGGCCGATCACCGCCGCGCAGCGGCCCATCCTGCAGGTCCCAGAGTGGGCGACCCGCCTCGACGTCGTCGCACACCTCTCCGGGATCGAGTTCATCGGCACCACGAAGACGGTCGCCGGCATCCGGTCCGGGTTCGGCGCGGAAGGGGCTGAGAACGGGATCATCATCGCGGACAAGGCCGGCCGCGGCCACTACACGGTGGTCGGTACCCACCAGGTGCTCGCCGCGCAGCGCGGCACCCGGCAGGCACTGAACCTGCAGGGCGTCCGATCCTCGGGGACCGGGTCGTGGCAGGCCGATTACCAGTCCGCGGTTGCAATCGATGTGCAGTTCAGCGCGGTCCCTGTCTGATGCGCTACATCGCCACCCGGCTCAACGGCGACGGCACAGAGACCCCGCTGTCCTTCGACGTGCCGCTGCAGGGCGTGCGGACCACCGACGACCTGTCCGGCCCCGGCGGCCTCGAGGGCTCCATCAGCCCGGAGGTGCAGCGGCTGCAGACCGCCGGCGGGGAACCGATCTTCCACGCGTGGTCGACGGCGCTCTACGCGGAGGTCGACGGGAAGCTGCGCGGAGGCGCGATCCTCGCCGGCCTCAAAGCGCAGGGCCCGTCCCTGTCCCTGGACTGTGTGGGCTTCACCGGGTACCTGAAGGACGAGCCCTACACCGCGGACTACTCCAGGGTCGCCGTCGACCCACTCGACGTCGCACGCCACCTATGGGAGCACCGGCAGGCCAAGACCAACGGGAACATCGGCCTGACCGTCGACACCACAACGTCACCAATCCGGATCGGCACCCCGGTGAAGGAGACCAGCTTCACGACCGGCGCCGGCGAGGACGTCAACTTCGAATCCGGCCCCTACACCCTGGCCTGGTGGAAGACCCGGGACCTTGGCAAGGAATTCGACGACCTCGCCACCAGCACCCCCTTCGATTACCGGGTGTCCCATGAGTGGGACGGTGAGACGATCCGACACCGCCTGATCCTGGGCTACCCGAACCTCGGCGCTCGCCGGGAGGATCTGCGGTTCGTTCTCGGCGAGAACCTGTTCCAGCGGCCGACGATCGACCTGACCGGCGGGGACTACGCCTCCGAGGTCCTCGTCCTCGGCGCCGGCGAAGGCCGGAAGATGGTCCGCAGCGTCCAGTCCACCCCGACCAGGCGCCTGCACCGGACCGCGGTCCTGGAGGACAAGTCCCTGCAGTCCAAGGCCGCAGCCGACCGGGCCGCGCTGCTGGAGCTGCGGTCCCGGCTGGGCGAGGTCGACATCACCGAAGTCGACGTGACCAATCACCCGCACGCCCGCATCGGGGAGTACTCCCCCGGCGACGAGATCCTCATCCAGACCCGCCACGACTGGGCCGGGGAGGTGTCCCTGTGGGTGCGCATCCTGTCGATCACCCTTGACCCGCAGACGGAGCGTTCCGTGCTGTCCGTCACCCGAGTAGAAAGGGTCTGAACCACGATGACCCAGCAGGACTACGCACGGATCGCCCGCCGTCTCCACGCGCTCGAGCGGAACCTGGCAGGGCTCTCCACCCCGCAGCTCATCCACTCCTCGATCGAGAATGGCAGCATCGACGAGTACGACGAGGCCGGGAACCTCGTCGGCGTGGTCGGCAAGCAGTTCGACGGCACCCACGGCGCCGTAGCCTTCCAGGGCCCCATCCCGCCAGCACCGACCGCCGCAACCGTCACTGGGGCGCCCGGTGCCCTATCGATCGTCTGGGACGGGTACTTCGCTGGCAATGCCGCCCGGCCTTTGGACCTCGATTTCATCCGCACCCGGGTCGCGGACAACGAGGCCATGACTAACCCGGTCTCCGCCGGCACACTGGTCGCCGCCGGGCAGGTCGTTGCGCAGATCGCCGCTGGCACCTACTGGGTGCAGCTCGTCGCCGAATCCAAGCCCGGCCGCCAGTCCGACCCGAGCCCCGCCGTGCAGGTCGAGGTCCTCCTCCCCGTCGACGTCGACGCCATCCAGGACGAGATCGACCAGGCCAACACCCGACTCGACGAAGCCAAGGCGGAGCTGGAGGCCGCGCAGCAGCAACTCGCCGCCGCGCTCGCCGCCAGCGACCTCGACCAGGCCGCCCTCGCAGCCACCGTGTCCACGCTGAAGAACACCACCCTGCCCGCGCTGACCACGGACCTATCCAACGCGCAGGTCCGTCTGGACGCCGCCGAATCCGACCTCTCGGACGCGTTCGGGGCGATCAACGCCGTCCCCGCCCAGATCAGCACCGCGAAGCAGCAAACCCTCGACGCCGCCGCGGCCGACGCCACCGCGAAGGCCGACGCGGCGAAGGCTGCTGCCGCCCGCGCGGCAGGGATGGCCGGCGGGATCACGTGGGTGCAGTCCACGGAGCCAATCGTCAGCACCCAGTACGCGTGGACCGGCACACCCGGCCGCAGCGCGAGCACGTGCACGATGCCCGACGGAACAGTCCGCACGAACCACTCGACCAGCCCGCAGGGCGGCGAGCCCGGCTGGGCGACGAGCTCGGGCTACAACAGCCAGCCGACCTCGACCATGCAGCGCCGGCCGGGCATCAATACGCGCATGTGGGTGAAGTCCGGCACAGGACGATACAGCCCTTACGGGCGTGCCGCGGGAGCCGGGAACACCGGGGCGAGCGGTAGCCAGCTCGCCGCAGCGACCATGCGCGTCGCCCCCGGCCAGTCCGTCGGAGCCGGCCTGTGGATGTACTCCCCCGTCGCAATTCCCATCGCCCGCCTGACCATGCGTTGGCAGGACGACGCGGGAGCCAGCAAGGGCACCGTGAACGGGCCCACGGCCTCACTGCCAGCGAACACGTGGCAGTGGTTCGCGGCCGTCGGCGTAGCACCCGCGGGCACCACCAACGTCCAGCTCGAGAACGTCCTCGACTTCGGCACTGACGGCCTCGGCATCGACGGGCTGCGGACCTACGGCAGCGATGCACTGCAGGAGCTCCGCACGACGACGCCCACCAGCGACGAGTACTTCGACGGCGGCACGGGAGACCTTCGCCTCACCGGCCTGTGGATAGACACCACGGACGGGAAGAACATCCCGAAGAAGTGGAACGGGGCAACCTGGACGCCGATCACCGATCAGGCCGCCATCAAGGCCGCCACCGACGCAGCGGCCCTGGACGCTCAGGCCCGCGCGGATAAGGCCCTCGCCGACGCGAAAACCGACGCTGCAGCGAAGGACGCCCTGGTGCGGGAGGCCGCAGCATCCGACGCGACGACGAAAGCCGACGCGGCGAAAGCTGTGGCCACGACGGCACAGACGGCCGCGGACCAGGCGAAGGCGGACGCGCTCGCCGCGTCCGGTCTGGCCGGCAGCAAGGGTGAGGTGATCTACCAGCTCAGCGCACCCACCGGTACCCGGGCTGTCGCGCAGAACCTCTGGATTCGCTCGTCGGACAACAAGCCGCACCGGTGGAACCCGGACAGCACGAACGTCTCGAAGTGGGAGGCCGTCACCGACAAGGCGGCGCTCGACGCCGCCACCGCGGCAGGGGCTGCCCAGACAAAGGCGAACGAAGCCGCAGCCGCGGCCGCGACCGCCCAGACCCGGGCGAACGACGCCTACTCGCTCGCCGGCGGAGCGAAGGAGACCGCCGACCTCGCCCTGTCCAGCGCGAACGGGAAGACCAAGGTCTACCGGGACCTCGCTGCCCCGAGCGGCGCCGGGAGCACGGCCGGTGACATCTGGTGGAGGTTCGCGGACGACACCTACAAGGTCGTCATCGATGAATGGACCTGGACCGGCACCGCCTGGTCGCAGCAGCAGCGCGGGCACCAGTCCATCGCCAGCGTCGACCTCGGCGCCCTCACCGTCGTGGGAACGTCCACGCTGGCCGACGTCGTCGCCAGGAGCGTCGCGGGTAGCACCGCGTCCTTTCAGCAGGTCGACGCGAAGAACCTCTTCGTCACCGGCACGGCGTCCCTCGCGGACGCCGTCGCGAAACGGCTGGCCGCGGAGACCGGAAGCTTCATCAGCCTCGCCGTATCGCAGCTCACCGCCGGCGCCGCCAGCATCAGCACCGGCGTCATCGACAAGCTCTACACCGAAGTCGTCAACAGCCGGAAGATCACCGCCGGGCAGATCGCTATCGGTGACTACACCAACCTCCGGGCGAACGGGGATTTCGCGCTTGGCTTCGACCAGTGGTCCGCCGACGGATGGACGATCGTCACCGGGGCAGGGCCAAACGGGGAGAACGTCGCTCAGTACGTCAACACAACCCCAAGCACGGTCTACGCGCCGACCTCCTACGCCTGGCGAATCCCGGTCACCCCCGGGGAGCTGCTGTACTTCTCAGCAACGGTAAAGCTCACCGGCGGGTCAATGCCCTCGGGCGCTGACCTCAGGGGTTACTTCTACGACGCCAACAACAGCAACATGAACGTCTCGGTCCTCGGCACCAAACCCATGGCAGACGGCGTTTGGCACACCTACGAAGGGACCTACGAGGTCCCCGCCGGCCGACGGCTCATGTACCCCCGGTTCACCTGGTACAACCCGGCCAACGGGACCTACCAGATCACCAACGTCATCGTCCGCCGTGCCGTCGGAGCGGTCATGATCGGAGATGGGGCGGTCACCGCACCGAAGATCGTCGCAAGCGAGGAACTGACAGCGAAGATCGCGCAGTTCCTCACCGTGAAAGCGAACCAGGTCGACGTGAATGACCTGTGGGCGGACACTGCGTGGATCTCCAAGGCCAACGCCCAGGTCCTCACACTCCTGTCGAACACGGACGGGTCAGGGTTCACCTCGCAGATCACCAGCGAAGGCCTCCGCGTCTTCTACACCGACCCGGTCACCGGGGAGGAGCAGGACCGGATCAGCCTCGGCACGTTCAACGGGACCGCCGACTACTTCGGCCTCACCGACGACAGCAACGAGCTGTCGGTCTCAATCGACAAGGACGGAGGTATCGCCGGCCGGGACCTCGCAGCCCGGGACTCCTTCACCTACAAGGGCACCGAGCTGCAGGACCTCCTCGACGGGTCCGGCAGCAGGCTGGCAGCCTGGGCTTCCCGTGCCAGCTCAAGCCTGTACTGGGCGAACGCACTCCAGCCATACCTTCACCTCCGGTTCGAGGCCGCGCCCGGCCGCGCCTACATGATCCAGACCACCCCGATCTGGTTGGACGGCGACACCGCCAACACCGAAGCCCGCGTCCACCTTCACTACGAAACGTCGGGCACGAACGCGACCACGGCGTCCCCGGTGATCGCGGAAGGCATGTCCGTGCAAGCCAGCCTGCAGACCCGGCGCTCGCCGGTCACGATCAACCGGCTCATCACACCGCCCGCCGGTGACGTCAGCCTTCTCCTGTCCTACTCGTGCCTGGGCGGGAGAGCGAAGATCTCCGCAGGTACGGGCACCAGGACGGTGAGCTTCACCGTGATCGACATCGGGCCTGCCATCCCGGAGACCGGTGAGATTCGCAACGGATCAGGAGACGCGCCCACGGGGTCGGCCACCCAGCCGCCCCAGACCGATCCGACGCCGCCGCCGATCGTGCGGAACTACGACCGGTACTTCGAATACACCGGCGTCAGGTCCTACCTCGGCAGCGGGGCCCAGTACACCTACCAGCCGGGAAAGGGATACCAGGGCCTCCAGCCCTACACGAACAGCGGCAACCTGAAATCGATCTGGACGTTCCCGTCCCTGACCGCCGAGCTTTCGGGCGCGACGATCACGGACATCTACGCCTACTTCTACTTTGAGCACTGGAACTACGGCGCCGGCGGCACCGCCCGCATCCGCACCCACGGCCACAGCTCAGCCCCCACCACCTACGGGGGCATCGGCAACGGCATCGACCGCGCCAAGTGGCCACGCGCCACCGGGCTCTGGGTGCGCCTCCCCGACAGCCTCTACGCCGGCTTCAAATCCGGTGCGGCCCGCGGGCTCGCCCTCGACGGAGACAACACACTCCAGACCTACGGCATCGCCAACCGGGCACGCCTGCGGATCAAGTACACCAAGTAAGGAGAACCACCATGTCCGACCTGCTGAAATACGCCCGGGCCCGCGACGACACGGACTTCGTGTCCCGCATCGCCGCAGCGATGACCGTCCGCGCCCAAGAGATTGAGCTGTTCGAGCTGAGCCCGCCGTCCAGGGCCCTGTGCTCCTGGGTGCTGGAGAACCCGATGCAGGTCGTGGACCGCATGGTCGCCCACGTGTCGACCAGCCCGGGCATCGCCGCGAACGTCACCGTGAGCGAGGGCACCGTCGACGCATCGACGGTGCCCGACGCCGACATCCAGTACACGGTGAACGAGAAGTGGGACGCCGTCGCCGGCTACCTCCACCGAGGCACCGAATCGACGACAGCATGAACCGCGGCATGCTGCGTCGCGCGAAACCCTATGGGCCCCGCGGGGTGATCATGCTCGCCCTGTCAATCGTCGCGCTCGGCAGGGCGGTCGCCTACCTGCCCTCCGAGCACCCCACGTTCACGCCGGCACTGCTGCAGGAGTGGCCGATACCGATACCCGTCTGGGGTGCCCTCTGGGGTGCGGTTGGCATCCTCCTGCTCGTGCAAGCGTTCCGCCGCGACCACGCGCTCGCGCTAGCCGTCATGGCCTCGATGGCAACCCTGTGGGCCGGCGTGTATGTGTGGGCCGCGAGTACGCGGACCTACGTGGACGGACTCGACGCCGCCCGTGGCTCATGGATCACCGCTGCCACGTACATCGCGACGGCGGTGATCGTCGTGTGTATCTCTCGGATGATCAACAAACTCGATCGGGAGGTTCCGCATGCTTGACGCCATCCTGCAAGTGACCGGCAGCCTGGCTCTCGCGTCCGTTTCGCTCGCCTCAGCATGGTTCGTGTACCACTCCACCAGAAAACGAGACAAGGTCACAGCCGAACTGTCCAAGGAAGCGAACGCCATCACCTGGTCGCAGGATCTCGTGAAGCGCCTCGACAAGCTCGAGGACGAGCTCAAGGAAGTGCGAGCGGACCTCGACAAGGTAACCCGCACTTTCCGAATCAGCATGAACTACATCGAGCGCTTGTGGCAGTGGGCGAAGACCGGGTCCCGCCCTCCCATCCCGGACATCCCCGAGTCGCTCTACGAGCACCTCGACCCATCGCTCATCGACGAGCACCACCGACAGCAGCGGGAGCACGACGCGTCCCCAAAAGCCTGACCGGTCTCACGATCAGCACGAGCCCCACCATCTGGTCGGGGCTTTTTCTATGCCCAACGGAAGGACACGCTCATGCCTGATCTCTGGCTGCCGGGCGCGGTGAAGGACCCGCAGCCCGGCGGCGTACGCCTGAACAAGGCGCTGCCAGCGCGCGGTACGTGGCACATCACCGCGGACCGCCTCGACCCGGTTACCATGGCGCAGCCCGCCCGGGCCAACGTCTGGAACTACCTGCGGAACGTCGGCTACTGCCCGCACCTGCTCTGGGACCCCTTCGACGGGTACCTGGTGCAGGCGTACCCGGCCGACGTCGGCGCCCGGGCGCTGTCTCGTTGGAACGAGGACGGGGCGGTGAACCTGCAGGTCGAGATCTACTTCAGCCCCGGAGTGATCCGCGGGGGAACCCAGTACCTCAGCGTTGACCAGACACCCTGCACGGGTCTGGAGGAGATCGTGGACTGGATGGAGTCCTGGGGCGTTCCGCCCGTCTGGCCGCTCGGCTCACCCACCTGGGAATCGACGCAGGACGCTGACGTGTGGAACGCCCGGGCAGGCCACTACGGCCACATCCAAGTCCCCGGCGAGAACCACCGGGACCCTGGGCCCATGCCGGGACTCCGAGCAGCAGTCACCGAAGCCGCCGCTGGCCTGCTCATCCCCGAGCTGGGAGGGCTGAGCGCATGAGTATCGCGATGCTCGAGCGGCTGCCCGTCGGCGACTTCATCAGCCAGGCGTGGAAAGCGAATGCCACCGCCGGCGTCACCCCGAACCCGAACGGCACGACGGTCGAGCAGCTCGTCGCGAAGCACGGCAACTACCAGGAGTACGGCCACGACGGCATCGACTTCGGCTGCCAGATGCGGACCCTCGTCTTCGCCCCGGGCGCCGGCCGTGTCGACTTCGCCGGCTGGTTCAGGGACATGCCGCAATGGGTGGCCGACAAGTACGGGTACCTGGTCAACGACGACAGCGGTGGCATCGCGGTCCTCATCGACCACGGCAACGGGCTCCTGTCCGCGCTACTCCACCTCGACCAGACCGACCTCACCGCCGGCACCTGGGTGCTGGCGGGCAACCTCTGCGGATACAGCGGCACCACCGGGCGCTCCGGCGGCCCGCACCTCCACTGGTCCCTCATCGTCGCCGCCGAGGTCTACACCACGGTCATGTATGGGCGCATCAACCCGCTCTCCCGCATCCCCGCGGGCCTGACCATCCCCATCGCCCCGGGCAACACCGGCGGCGCAGCAGACACAACCGACACCCTCATCGCCGGGATCTCCGGCCTTTCCAAGTAAGGATCACCACCATGGCAACACCACGCGAGAAGAACTTCGACGGCCCGGTCGTCGACGGAGCCAAGAAGGCCGTCGCCTACCTCGACCTGAAGATCAGCGACGTCGCCGCCAAGGTCGCATCCACCCCGATCAAGCTCCCCAACGGGGAGGTGCGACCGCTGGGCAACACCATCGGGTGGATCGACAACAACCAGAACGCCAACGCCGCCAAGATCATCGGCATGCTGGACGGGCTGACGCAGACCATCGCTGCGCAGTCCACTGACCCGGCTTTGAGCCCGGCGGTCGTGCAGAAGATCATGGGCGATGCGGCTCAGAAGGCGTTCGGGGGCTTCCTCGACCAGCTGCGTCGCGACGCCGATGCCCAGGGAGGCGAGGAGTGATGGCCACCATCGAGGCAACACCCACGCAGGCCGCGCACCCGTGGCGGGCCACCCTCCGCACCGCCGTGGCGGTCGGCATCCCGGCCTTCGCCGGCCTCGTCCTCCTGCTGCCCCTGGTGCTCTCTGAGCTCGCCAGCGGGCCGCTCAGCGAATACTTGCCACCTGGCTTCATCGCCTGGCTCGTCGCGGCCGCCGGGTTCATCACCGCGGCGTCCGCGGCGATCACCCGCATCATGGCGATCCCGGGCGTCGTCGAGTGGTTCCGCAAATACCTGCGGGCACTCTCCCCCGACGGCAACCCGCCCGGACGCCACGAGGCCAAGCCGATCGAGGACACCACCGCGGACGCTGCAACCAGGCAGCAGGCGGCGTCCCTCGACCGCGTCGACGGCCCCGACCACCGACTCTGA
Protein sequences of DBSCAN-SWA_4 >LR131272|1990496:2004601|1996802_2001338_+|VDR32554.1|DBSCAN-SWA MTQQDYARIARRLHALERNLAGLSTPQLIHSSIENGSIDEYDEAGNLVGVVGKQFDGTHGAVAFQGPIPPAPTAATVTGAPGALSIVWDGYFAGNAARPLDLDFIRTRVADNEAMTNPVSAGTLVAAGQVVAQIAAGTYWVQLVAESKPGRQSDPSPAVQVEVLLPVDVDAIQDEIDQANTRLDEAKAELEAAQQQLAAALAASDLDQAALAATVSTLKNTTLPALTTDLSNAQVRLDAAESDLSDAFGAINAVPAQISTAKQQTLDAAAADATAKADAAKAAAARAAGMAGGITWVQSTEPIVSTQYAWTGTPGRSASTCTMPDGTVRTNHSTSPQGGEPGWATSSGYNSQPTSTMQRRPGINTRMWVKSGTGRYSPYGRAAGAGNTGASGSQLAAATMRVAPGQSVGAGLWMYSPVAIPIARLTMRWQDDAGASKGTVNGPTASLPANTWQWFAAVGVAPAGTTNVQLENVLDFGTDGLGIDGLRTYGSDALQELRTTTPTSDEYFDGGTGDLRLTGLWIDTTDGKNIPKKWNGATWTPITDQAAIKAATDAAALDAQARADKALADAKTDAAAKDALVREAAASDATTKADAAKAVATTAQTAADQAKADALAASGLAGSKGEVIYQLSAPTGTRAVAQNLWIRSSDNKPHRWNPDSTNVSKWEAVTDKAALDAATAAGAAQTKANEAAAAAATAQTRANDAYSLAGGAKETADLALSSANGKTKVYRDLAAPSGAGSTAGDIWWRFADDTYKVVIDEWTWTGTAWSQQQRGHQSIASVDLGALTVVGTSTLADVVARSVAGSTASFQQVDAKNLFVTGTASLADAVAKRLAAETGSFISLAVSQLTAGAASISTGVIDKLYTEVVNSRKITAGQIAIGDYTNLRANGDFALGFDQWSADGWTIVTGAGPNGENVAQYVNTTPSTVYAPTSYAWRIPVTPGELLYFSATVKLTGGSMPSGADLRGYFYDANNSNMNVSVLGTKPMADGVWHTYEGTYEVPAGRRLMYPRFTWYNPANGTYQITNVIVRRAVGAVMIGDGAVTAPKIVASEELTAKIAQFLTVKANQVDVNDLWADTAWISKANAQVLTLLSNTDGSGFTSQITSEGLRVFYTDPVTGEEQDRISLGTFNGTADYFGLTDDSNELSVSIDKDGGIAGRDLAARDSFTYKGTELQDLLDGSGSRLAAWASRASSSLYWANALQPYLHLRFEAAPGRAYMIQTTPIWLDGDTANTEARVHLHYETSGTNATTASPVIAEGMSVQASLQTRRSPVTINRLITPPAGDVSLLLSYSCLGGRAKISAGTGTRTVSFTVIDIGPAIPETGEIRNGSGDAPTGSATQPPQTDPTPPPIVRNYDRYFEYTGVRSYLGSGAQYTYQPGKGYQGLQPYTNSGNLKSIWTFPSLTAELSGATITDIYAYFYFEHWNYGAGGTARIRTHGHSSAPTTYGGIGNGIDRAKWPRATGLWVRLPDSLYAGFKSGAARGLALDGDNTLQTYGIANRARLRIKYTK >LR131272|1990496:2004601|2004190_2004601_+|VDR32561.1|DBSCAN-SWA MATIEATPTQAAHPWRATLRTAVAVGIPAFAGLVLLLPLVLSELASGPLSEYLPPGFIAWLVAAAGFITAASAAITRIMAIPGVVEWFRKYLRALSPDGNPPGRHEAKPIEDTTADAATRQQAASLDRVDGPDHRL >LR131272|1990496:2004601|1990496_1993877_+|VDR32550.1|tail|DBSCAN-SWA MADRSITLRLTANVQGLVAGFRTGQQAAKDFGSQTQRFAQDNRQSLEQLGQAGMAVGGTLAAGFGLAVTKFASFDKAMSSVRAATHETEGNMDLLREAAVRAGADTAFSAEEAARGIEEMAKAGVSTADIMGGGLDGALALAAAGALDVGDAAELAATAMTQFGLSGEDIPHIADLLAAGAGKAQGSVEDMGMALKQAGLVADQTGLSIEETTGGLAAFAAAGLVGSDAGTSFKSMLQRLTPQSKEAQEKMAELGISAYDAQGNFIGLSEFAGNLKQSMAELTVEERNAAMATIFGSDAVRAASVLYENGAEGVQKWEDAVNDAGYAAETAALMQDNLAGDLEKLGGAFDTVFLQAGSGANDSLRSLVQTLEDVVDRVGEIPGPVLSAVGIVAGLTGGALLLGGAFLTVVPRIADTVSAISDLRAKAPGATTAIGRLGRVAGVAAVAMVALQVGAAINDSLGEATKSAEEMAQAILRVTREGEAFSDVFTSDLFASSNGFAMRDEITGMGDALAEVNSIDFGETVNDWYHGLTGWKSQINETRETIANMDTALTDFASSGNLEGAAEGFRQIAADAKESGVSLEDTAAQFPQYLDHLRSLATEAGVALDEQELLNWAMGEVPPAMEAAASSTEGSAAALEAQGAAAEQAAELNEAMLDRLKELGLAADGTIVNIQAYTEALFASGLAVMSSRDASFQWEDSLRGMNDAILEVLNTQGELGAVLDGTATDFNTMTDSGLAANDVFQGLVEQGLSVASTFSGDTTKSIQDVQAQLLSTYDAGVQSAMGFGMSKEAAIALTREVLQIPEGVTIESWMSEEAKNMAAETTGAAQAVPTNVRIESSMSEAAKLTADATKKSADDVPEKETIDSWMSDAAFIEAVRTRAAALGIPEKEAIDSFMSSAARNEADNTTARILGIPPGVSVTSYMDNYARIEAQNTKAAIDAIPSYKESRIVVVTERNERVGVGQVGLKALGGRLPAHAAGYRLPSTGPGTDVTDGILGISSKTGAPMSWLDGGEWVINSKSSEKYHRVLGAINRDDPSVSSLSALAGGGQVMRSGQHPPAGTGGIGSAPPAIDLVAYVQNPWTGEQMLAPVRAVAREEADASISRIDHQMTRYDRGGKYVERIW >LR131272|1990496:2004601|1993870_1994746_+|VDR32551.1|DBSCAN-SWA MVTDLLHGDFELDGYVIGGDRDRPVYIKSFLPGRSDVRSQDFDNPVGDGRLFGRDAALGSTWSFSFGMASESSAASLAALNSLSAAWKYVHREPGAESVLRYKVGGRVRRVYGRARHFDYDPNYVFFTGYAVASGEFVTSDAVHYEDALRSVSVGLLAEVSGSQPLPSALPFTFAPGGPRSGIIQDVGGDAPAPVEVTFAGPVTGPSATIGGQLVALPGLTLAFDQSVTVNTRTMTVTRNDGASLAGALSRRTYLEDVRLQPGLSEVVYSGSDLTGTSRCTVAWRPAHYGF >LR131272|1990496:2004601|1995683_1996796_+|VDR32553.1|DBSCAN-SWA MRYIATRLNGDGTETPLSFDVPLQGVRTTDDLSGPGGLEGSISPEVQRLQTAGGEPIFHAWSTALYAEVDGKLRGGAILAGLKAQGPSLSLDCVGFTGYLKDEPYTADYSRVAVDPLDVARHLWEHRQAKTNGNIGLTVDTTTSPIRIGTPVKETSFTTGAGEDVNFESGPYTLAWWKTRDLGKEFDDLATSTPFDYRVSHEWDGETIRHRLILGYPNLGARREDLRFVLGENLFQRPTIDLTGGDYASEVLVLGAGEGRKMVRSVQSTPTRRLHRTAVLEDKSLQSKAAADRAALLELRSRLGEVDITEVDVTNHPHARIGEYSPGDEILIQTRHDWAGEVSLWVRILSITLDPQTERSVLSVTRVERV >LR131272|1990496:2004601|2002582_2003194_+|VDR32558.1|DBSCAN-SWA MPDLWLPGAVKDPQPGGVRLNKALPARGTWHITADRLDPVTMAQPARANVWNYLRNVGYCPHLLWDPFDGYLVQAYPADVGARALSRWNEDGAVNLQVEIYFSPGVIRGGTQYLSVDQTPCTGLEEIVDWMESWGVPPVWPLGSPTWESTQDADVWNARAGHYGHIQVPGENHRDPGPMPGLRAAVTEAAAGLLIPELGGLSA >LR131272|1990496:2004601|1994757_1995684_+|VDR32552.1|DBSCAN-SWA MAIEVLAIEASGVTAAGLRLAAFHATRGGNGISLPEDLKVTALPVPGAFVRVVRGPSGSAALRSRYASAAGQSYLTLLSATEDVSVPATGSGSGATRYLIQRVFDPKFEGQAPLAEFALVPGSATGVWPYNLGLSYPHVVLARINQPASTATITQAMIEDLRQLAYPLQDRQLLPPIYPSANANIPTAGYASWPITAAQRPILQVPEWATRLDVVAHLSGIEFIGTTKTVAGIRSGFGAEGAENGIIIADKAGRGHYTVVGTHQVLAAQRGTRQALNLQGVRSSGTGSWQADYQSAVAIDVQFSAVPV >LR131272|1990496:2004601|2001349_2001691_+|VDR32555.1|DBSCAN-SWA MSDLLKYARARDDTDFVSRIAAAMTVRAQEIELFELSPPSRALCSWVLENPMQVVDRMVAHVSTSPGIAANVTVSEGTVDASTVPDADIQYTVNEKWDAVAGYLHRGTESTTA >LR131272|1990496:2004601|2001687_2002122_+|VDR32556.1|DBSCAN-SWA MNRGMLRRAKPYGPRGVIMLALSIVALGRAVAYLPSEHPTFTPALLQEWPIPIPVWGALWGAVGILLLVQAFRRDHALALAVMASMATLWAGVYVWAASTRTYVDGLDAARGSWITAATYIATAVIVVCISRMINKLDREVPHA >LR131272|1990496:2004601|2003819_2004191_+|VDR32560.1|DBSCAN-SWA MATPREKNFDGPVVDGAKKAVAYLDLKISDVAAKVASTPIKLPNGEVRPLGNTIGWIDNNQNANAAKIIGMLDGLTQTIAAQSTDPALSPAVVQKIMGDAAQKAFGGFLDQLRRDADAQGGEE >LR131272|1990496:2004601|2003190_2003808_+|VDR32559.1|DBSCAN-SWA MSIAMLERLPVGDFISQAWKANATAGVTPNPNGTTVEQLVAKHGNYQEYGHDGIDFGCQMRTLVFAPGAGRVDFAGWFRDMPQWVADKYGYLVNDDSGGIAVLIDHGNGLLSALLHLDQTDLTAGTWVLAGNLCGYSGTTGRSGGPHLHWSLIVAAEVYTTVMYGRINPLSRIPAGLTIPIAPGNTGGAADTTDTLIAGISGLSK >LR131272|1990496:2004601|2002114_2002510_+|VDR32557.1|DBSCAN-SWA MLDAILQVTGSLALASVSLASAWFVYHSTRKRDKVTAELSKEANAITWSQDLVKRLDKLEDELKEVRADLDKVTRTFRISMNYIERLWQWAKTGSRPPIPDIPESLYEHLDPSLIDEHHRQQREHDASPKA |
12 | Brevibacterium_phage(37.5%) | tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|