Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
LR134000 | Escherichia coli strain NCTC9066 genome assembly, chromosome: 1 | 9 crisprs | cas3,csa3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG,RT | 0 | 24 | 12 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134000_1 | 995042-995926 | Orphan |
I-E
Consensus repeat of LR134000_1
|
14 spacers
spacers of LR134000_1
>1.1|995071|32|LR134000|PILER-CR,CRISPRCasFinder,CRT TTCCGCGACCCGGCGATAAGGGAAGATGGGTG >1.2|995132|32|LR134000|PILER-CR,CRISPRCasFinder,CRT TAACGACAGAGGGATTCGGCAGCGAAGAGGAT >1.3|995193|32|LR134000|PILER-CR,CRISPRCasFinder,CRT CGTAGTTTCGGCAGTCCAGTGCCTCGTTACGT >1.4|995254|32|LR134000|PILER-CR,CRISPRCasFinder,CRT ATAGAACGGGACGAGATTTTTAAACAATGGCT >1.5|995315|32|LR134000|PILER-CR,CRISPRCasFinder,CRT CAATCTGAGCCAGACGCGACGAATAAAAGCAT >1.6|995376|33|LR134000|PILER-CR,CRISPRCasFinder,CRT TTGACGTTGATTTTGTTCGTTATGTTGCCAGCC >1.7|995438|32|LR134000|PILER-CR,CRISPRCasFinder,CRT CTCTGATTCATCGGCGGCGATACTGTCATCAC >1.8|995499|32|LR134000|PILER-CR,CRISPRCasFinder,CRT GAAAAACAAATAGATGGATAGCTCGATATCAT >1.9|995560|32|LR134000|PILER-CR,CRISPRCasFinder,CRT CGGCTTATTGCTCTTGCCGACGGATTACAGTG >1.10|995621|32|LR134000|PILER-CR,CRISPRCasFinder,CRT GGCTGGTGGGTTCGGGTAACTGGTTTGCTGTC >1.11|995682|32|LR134000|PILER-CR,CRISPRCasFinder,CRT TACATGTTGATGACGTTTGCCAAATGCCATGG >1.12|995743|32|LR134000|PILER-CR,CRISPRCasFinder,CRT ATTATTAATTCTGGTGGCGCTGGTCGCCCTGG >1.13|995804|32|LR134000|PILER-CR,CRISPRCasFinder,CRT AGCGCGCGCGGGCTACTGCACTCGGTGATAAC >1.14|995865|32|LR134000|PILER-CR,CRISPRCasFinder,CRT CCGAGCATTATATCCTGTGCGTCGTTCATTTA |
CRISPR arrays and Neighbor proteins around LR134000_1
The CRISPR arrays of LR134000_1 >merge|LR134000|1|995042-995926|PILER-CR,CRISPRCasFinder,CRT GAGTTCCCCGCGCCAGCGGGGATAAACCGTTCCGCGACCCGGCGATAAGGGAAGATGGGTGGAGTTCCCCGCGCCAGCGGGGATAAACCGTAACGACAGAGGGATTCGGCAGCGAAGAGGATGAGTTCCCCGCGCCAGCGGGGATAAACCGCGTAGTTTCGGCAGTCCAGTGCCTCGTTACGTGAGTTCCCCGCGCCAGCGGGGATAAACCGATAGAACGGGACGAGATTTTTAAACAATGGCTGAGTTCCCCGCGCCAGCGGGGATAAACCGCAATCTGAGCCAGACGCGACGAATAAAAGCATGAGTTCCCCGCGCCAGCGGGGATAAACCGTTGACGTTGATTTTGTTCGTTATGTTGCCAGCCGAGTTCCCCGCGCCAGCGGGGATAAACCGCTCTGATTCATCGGCGGCGATACTGTCATCACGAGTTCCCCGCGCCAGCGGGGATAAACCGGAAAAACAAATAGATGGATAGCTCGATATCATGAGTTCCCCGCGCCAGCGGGGATAAACCGCGGCTTATTGCTCTTGCCGACGGATTACAGTGGAGTTCCCCGCGCCAGCGGGGATAAACCGGGCTGGTGGGTTCGGGTAACTGGTTTGCTGTCGAGTTCCCCGCGCCAGCGGGGATAAACCGTACATGTTGATGACGTTTGCCAAATGCCATGGGAGTTCCCCGCGCCAGCGGGGATAAACCGATTATTAATTCTGGTGGCGCTGGTCGCCCTGGGAGTTCCCCGCGCCAGCGGGGATAAACCGAGCGCGCGCGGGCTACTGCACTCGGTGATAACGAGTTCCCCGCGCCAGCGGGGATAAACCGCCGAGCATTATATCCTGTGCGTCGTTCATTTAGAGTTCCCCGCGCCAGCGGGGATAAACCGC >LR134000|1|1|995042-995925|PILER-CR GAGTTCCCCGCGCCAGCGGGGATAAACCG TTCCGCGACCCGGCGATAAGGGAAGATGGGTG GAGTTCCCCGCGCCAGCGGGGATAAACCG TAACGACAGAGGGATTCGGCAGCGAAGAGGAT GAGTTCCCCGCGCCAGCGGGGATAAACCG CGTAGTTTCGGCAGTCCAGTGCCTCGTTACGT GAGTTCCCCGCGCCAGCGGGGATAAACCG ATAGAACGGGACGAGATTTTTAAACAATGGCT GAGTTCCCCGCGCCAGCGGGGATAAACCG CAATCTGAGCCAGACGCGACGAATAAAAGCAT GAGTTCCCCGCGCCAGCGGGGATAAACCG TTGACGTTGATTTTGTTCGTTATGTTGCCAGCC GAGTTCCCCGCGCCAGCGGGGATAAACCG CTCTGATTCATCGGCGGCGATACTGTCATCAC GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAAAACAAATAGATGGATAGCTCGATATCAT GAGTTCCCCGCGCCAGCGGGGATAAACCG CGGCTTATTGCTCTTGCCGACGGATTACAGTG GAGTTCCCCGCGCCAGCGGGGATAAACCG GGCTGGTGGGTTCGGGTAACTGGTTTGCTGTC GAGTTCCCCGCGCCAGCGGGGATAAACCG TACATGTTGATGACGTTTGCCAAATGCCATGG GAGTTCCCCGCGCCAGCGGGGATAAACCG ATTATTAATTCTGGTGGCGCTGGTCGCCCTGG GAGTTCCCCGCGCCAGCGGGGATAAACCG AGCGCGCGCGGGCTACTGCACTCGGTGATAAC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCGAGCATTATATCCTGTGCGTCGTTCATTTA GAGTTCCCCGCGCCAGCGGGGATAAACCG >LR134000|1|1|995042-995926|CRISPRCasFinder GAGTTCCCCGCGCCAGCGGGGATAAACCG TTCCGCGACCCGGCGATAAGGGAAGATGGGTG GAGTTCCCCGCGCCAGCGGGGATAAACCG TAACGACAGAGGGATTCGGCAGCGAAGAGGAT GAGTTCCCCGCGCCAGCGGGGATAAACCG CGTAGTTTCGGCAGTCCAGTGCCTCGTTACGT GAGTTCCCCGCGCCAGCGGGGATAAACCG ATAGAACGGGACGAGATTTTTAAACAATGGCT GAGTTCCCCGCGCCAGCGGGGATAAACCG CAATCTGAGCCAGACGCGACGAATAAAAGCAT GAGTTCCCCGCGCCAGCGGGGATAAACCG TTGACGTTGATTTTGTTCGTTATGTTGCCAGCC GAGTTCCCCGCGCCAGCGGGGATAAACCG CTCTGATTCATCGGCGGCGATACTGTCATCAC GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAAAACAAATAGATGGATAGCTCGATATCAT GAGTTCCCCGCGCCAGCGGGGATAAACCG CGGCTTATTGCTCTTGCCGACGGATTACAGTG GAGTTCCCCGCGCCAGCGGGGATAAACCG GGCTGGTGGGTTCGGGTAACTGGTTTGCTGTC GAGTTCCCCGCGCCAGCGGGGATAAACCG TACATGTTGATGACGTTTGCCAAATGCCATGG GAGTTCCCCGCGCCAGCGGGGATAAACCG ATTATTAATTCTGGTGGCGCTGGTCGCCCTGG GAGTTCCCCGCGCCAGCGGGGATAAACCG AGCGCGCGCGGGCTACTGCACTCGGTGATAAC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCGAGCATTATATCCTGTGCGTCGTTCATTTA GAGTTCCCCGCGCCAGCGGGGATAAACCGC >LR134000|1|1|995042-995925|CRT GAGTTCCCCGCGCCAGCGGGGATAAACCG TTCCGCGACCCGGCGATAAGGGAAGATGGGTG GAGTTCCCCGCGCCAGCGGGGATAAACCG TAACGACAGAGGGATTCGGCAGCGAAGAGGAT GAGTTCCCCGCGCCAGCGGGGATAAACCG CGTAGTTTCGGCAGTCCAGTGCCTCGTTACGT GAGTTCCCCGCGCCAGCGGGGATAAACCG ATAGAACGGGACGAGATTTTTAAACAATGGCT GAGTTCCCCGCGCCAGCGGGGATAAACCG CAATCTGAGCCAGACGCGACGAATAAAAGCAT GAGTTCCCCGCGCCAGCGGGGATAAACCG TTGACGTTGATTTTGTTCGTTATGTTGCCAGCC GAGTTCCCCGCGCCAGCGGGGATAAACCG CTCTGATTCATCGGCGGCGATACTGTCATCAC GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAAAACAAATAGATGGATAGCTCGATATCAT GAGTTCCCCGCGCCAGCGGGGATAAACCG CGGCTTATTGCTCTTGCCGACGGATTACAGTG GAGTTCCCCGCGCCAGCGGGGATAAACCG GGCTGGTGGGTTCGGGTAACTGGTTTGCTGTC GAGTTCCCCGCGCCAGCGGGGATAAACCG TACATGTTGATGACGTTTGCCAAATGCCATGG GAGTTCCCCGCGCCAGCGGGGATAAACCG ATTATTAATTCTGGTGGCGCTGGTCGCCCTGG GAGTTCCCCGCGCCAGCGGGGATAAACCG AGCGCGCGCGGGCTACTGCACTCGGTGATAAC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCGAGCATTATATCCTGTGCGTCGTTCATTTA GAGTTCCCCGCGCCAGCGGGGATAAACCG
>LR134000.1|VDY67440.1|994031_994703_+|7-carboxy-7-deazaguanine-synthase;-queosine-biosynthesis MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWEKLEDREVSLFSILAKTKESDKWGAASSEDLLAIIGRQGYTARHVVITGGEPCIHDLLPLTDLLEKNGFSCQIETSGTHEVRCTPNTWVTVSPKLNMRGGYEVLSQALERANEIKHPVGRVRDIEALDELLATLTDDKPRVIALQPISQKDDATRLCIETCIARNWRLSMQTHKYLNIA >LR134000.1|VDY67439.1|992865_993738_-|protein MRYFILMFTFVCSFVAAQPTIVPQLQQQVTDLTSSLNSQEKKELTHKLESIFNNTQVQIAVLIVSTTKDETIEQYATRVFDNWRLGDAKRNDGILIVVAWSDRTVRIQVGFGLEEKVTDALAGDIIRSNMIPAFKQQKLAQGLELAINALNNQLTSQHQYPTNPSESESASSSDHYYFAIFWVFAVMFFPFWFFHQGSNFCRACKSGVCISAIYLLDLFLFSDKIFSIAVFSFFFTFTIFMVFTCLCVLQKRASGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASGRW >LR134000.1|VDY67438.1|991507_992806_+|enolase MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >LR134000.1|VDY67437.1|989782_991420_+|CTP-synthetase MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK >LR134000.1|VDY67436.1|988763_989555_+|nucleoside-triphosphate-pyrophosphohydrolase MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFADSSAENSSEVLARWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCANVGFDWTTLGPVVDKVYEEIDEVMYEARQAVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLEMTGVDLETMEEVWQQVKRQEIDL >LR134000.1|VDY67435.1|988357_988693_+|toxin-ChpA MVSRYVPDMGDLIWVDFDPTKGSEQAGHRPAVVLSPFMYNNKTGMCLCVPCTTQSKGYPFEVVLSGQERDGVALADQVKSIAWRARGATKKGTVAPEELQLIKAKINVLIG >LR134000.1|VDY67434.1|988109_988358_+|antitoxin-MazE MIHSSVKRWGNSPAVRIPATLMQALNLNIDDEVKIDLVDGKLIIEPVRKEPVFTLAELVNDITPENLHENIDWGEPKDKEVW >LR134000.1|VDY67433.1|985797_988032_+|GTP-pyrophosphokinase MVAVRSAHINKAGEFDPEKWIASLGITSQKSCECLAETWAYCLQQTQGHPDASLLLWRGVEMVEILSTLSMDIDTLRAALLFPLADANVVSEDVLRESVGKSVVNLIHGVRDMAAIRQLKATHTDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHLREVKDAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYKRIAKLLHERRLDREHYIEEFVGHLRAEMKAEGVKAEVYGRPKHIYSIWRKMQKKNLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQSIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAAAGGARSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQVFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQIEIITQKQPNPSRDWLNPNLGYVTTSRGRSKIHAWFRKQDRDKNILAGRQILDDELEHLGISLKEAEKHLLPRYNFNDVDELLAAIGGGDIRLNQMVNFLQSQFNKPSAEEQDAAALKQLQQKSYTPQNRSKDNGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQLAELRSHAPERIVDAVWGESYSAGYSLVVRVVANDRSGLLRDITTILANEKVNVLGVASRSDTKQQLATIDMTIEIYNLQVLGRVLGKLNQVPDVIDARRLHGS >LR134000.1|VDY67432.1|984448_985750_+|23s-rRNA-(uracil-5-)-methyltransferase MAQFYSAKRRTTTRQIITVSVNDLDSFGQGVARHNGKALFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSPERETPRCPHFGVCGGCQQQHASVDLQQRSKSAALARLMKHEVSEVIADVPWGYRRRARLSLNYLPKTQQLQMGFRKAGSSDIVDVKQCPILVPQLEALLPKVRACLGSLQAIRHLGHVELVQATSGTLMILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILETVSGEMPWYDSNGLRLTFSPRDFIQVNAGVNQKMVARALEWLDVQPEDRVLDLFCGMGNFTLPLATQAASVVGVEGVPALVEKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVMQQIIKLEPIRIVYVSCNPATLARDSEALLKAGYTIARLAMLDMFPHTGHLESMVLFSRVK >LR134000.1|VDY67431.1|981635_984392_-|two-component-sensor-kinase/response-regulator MTNYSLRARMMILILAPTVLIGLLLSIFFVVHRYNDLQRQLEDAGASIIEPLAVSTEYGMSLQNRESIGQLISVLHRRHSDIVRAISVYDENNRLFVTSNFHLDPSSMQLGSNVPFPRQLTVTRDGDIMILRTPIISESYSPDESPSSDAKNSQNMLGYIALELDLKSVRLQQYKEIFISSVMMLFCIGIALIFGWRLMRDVTGPIRNMVNTVDRIRRGQLDSRVEGFMLGELDMLKNGINSMAMSLAAYHEEMQHNIDQATSDLRETLEQMEIQNVELDLAKKRAQEAARIKSEFLANMSHELRTPLNGVIGFTRLTLKTELTPTQRDHLNTIERSANNLLAIINDVLDFSKLEAGKLILESIPFPLRSTLDEVVTLLAHSSHDKGLELTLNIKSDVPDNVIGDPLRLQQIITNLVGNAIKFTENGNIDILVEKRALSNTKVQIEVQIRDTGIGIPERDQSRLFQAFRQADASISRRHGGTGLGLVITQKLVNEMGGDISFHSQPNRGSTFWFHINLDLNPNIIIEGPSTQCLAGKRLAYVEPNSAAAQCTLDILSETPLEVVYSPTFSALPPAHYDMMLLGIAVTFREPLTMQHERLAKAVSMTDFLMLALPCHAQVNAEKLKQDGIGACLLKPLTPTRLLPALTEFCHHKQNTLLPVTDESKLAMTVMAVDDNPANLKLIGALLEDMVQHVELCDSGHQAVERAKQMPFDLILMDIQMPDMDGIRACELIHQLPHQQQTPVIAVTAHAMAGQKEKLLGAGMSDYLAKPIEEERLHNLLLRYKPGSGISSRVVTPEVNEIVVNPNATLDWQLALRQAAGKTDLARDMLQMLLDFLPEVRNKVEEQLVGENPEGLVDLIHKLHGSCGYSGVPRMKNLCQLIEQQLRSGTKEEDLEPELLELLDEMDNVAREASKILG >LR134000.1|VDY67442.1|995972_997247_-|IS186,-transposase MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPEVPDPKRRTNSLWRITKMVIWSLQVAIRGTVSLTAYKTQLKNARHRLNEAPRRRILQMVQPLS >LR134000.1|VDY67443.1|998583_1000062_-|putative-sugar-kinase MSKKYIIGIDGGSQSTKVVMYDLEGNVVCEGKGLLQPMHTPDADTAEHPDDDLWASLCFAGHDLMSQFAGNKEDIVGIGLGSIRCCRALLKADGTPAAPLISWQDARVTRPYEHTNPDVAYVTSFSGYLTHRLTGEFKDNIANYFGQWPVDYKSWAWSEDAAVMDKFNIPRHMLFDVQMPGTVLGHITPQAALATHFPAGLPVVCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAYWPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDARAQDLSPEDLLNKKASSVPPGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNHFAKHVIITGGGSNSDLFMQIFADVFNLPARRNAINGCASLGAAINTAVGLGLYPDYATAVDKMVRVKDIFIPIESNAKRYDAMNKGIFKDLTKHTDVILKKSYEVMHGELGNVDSIQSWSNA >LR134000.1|VDY67444.1|1000088_1001366_-|major-facilitator-superfamily-protein MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQIAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAPDDSTSLKTVIIIYSVVYILLGILCWFFVSDNNNLRSANNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAILSYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSVLGLLTLTALLVTNSNPQSVAMGIGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGHWQDTLPAAEAYRNMWLMGMAALGMVIVFTFLLFQKIRTADSAPAMASSK >LR134000.1|VDY67445.1|1001684_1002470_+|putative-short-chain-dehydrogenase MSIESLNAFSMDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEKQGVEVDFMQVDITAEGAPQKIIAACCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAAFELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNIQVNGIAPGYYATDITLATRSNPETNQRVLDHIPANRWGDTQDLMGAAVFLASQASNYVNGHLLVVDGGYLVR >LR134000.1|VDY67446.1|1002539_1003994_+|putative-FAD-binding-oxidoreductase MSLSRAAIVDQLKEIVGADRVITDETVLKKNSIDRFRKFPDIHGIYTLPIPAAVVKLGSTEQVSRVLNFMNAHKINGVPRTGASATEGGLETVVENSVVLDGSAMNQIINIDIENMQATAQCGVPLEVLENALREKGYTTGHSPQSKPLAQMGGLVATRSIGQFSTLYGAIEDMVVGLEAVLADGTVTRIKNVPRRAAGPDIRHIIIGNEGALCYITEVTVKIFKFTPENNLFYGYILEDMKTGFNILREIMVEGYRPSIARLYDAEDGTQHFTHFADGKCVLIFMAEGNPRIAKATGEGIAEIVARYPQCQRVDSKLIETWFNNLNWGPDKVAAERVQILKTGNMGFTTEVSGCWSCIHEIYESVINRIRTEFPHADDITMLGGHSSHSYQNGTNMYFVYDYNVVDCKPEEEIDKYHNPLNKIICEETIRLGGSMVHHHGIGKHRVHWSKLEHGSAWALLEGLKKQFDPNGIMNTGTIYPIEK >LR134000.1|VDY67447.1|1004015_1005425_+|major-facilitator-superfamily-protein MTGRCLFGFSGEKPFLLPDNEGVKMNTSPVRMDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFLGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGILLGAFSVVWTVGYVLASIAGHHFISENPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLFSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQVTLLLLATVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV >LR134000.1|VDY67448.1|1005402_1006182_+|electron-transfer-flavoprotein-subunit-YgcR MNILLAFKAEPDAGMLAEKEWQAAAQGKSGPDISLLRSLLGADEQAAAALLLAQRKNGTPMSLTALSMGDERALHWLRYLMALGFEEAVLLETAADLRFAPEFVARHIAEWQHQNPLDLIITGCQSSEGQNGQTPFLLAEMLGWPCFTQVERFTLDALFITLEQRTEHGLRCCRVRLPAVIAVRQCGEVALPVPGMRQRMAAGKAEIIRKTVAAEMPAMQCLQLARAEQRRGATLIDGQTVAEKAQKLWQDYLRQRMQP >LR134000.1|VDY67449.1|1006178_1007039_+|electron-transfer-flavoprotein-subunit-YgcQ MNIAIVTINQENAAIASWLAAQDFSGCTLAHWQIEPQPVVAEQVLDALVEQWQRTPADVVLFPPGTFGDELSTRLAWRLHGASICQVTSLDIPTVSVRKSHWGNALTATLQTEKRPLCLSLARQAGAAKNATLPSGMQQLNIVPGALPDWLVSTEDLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVGYSRARVMNGGVDAEKVIGISGHLLAPEVCIVVGASGAAALMAGVRNSKFVVAINHDASAAVFSQADVGVVDDWKVVLEALVTNIHADCQ >LR134000.1|VDY67450.1|1007186_1007762_-|glycerol-antiterminator-regulatory-protein MPLLHLLRQNPVIAAVKDNASLQLAIDSECQFISVLYGNICTISNIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLVTEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVCDEEDARNAINAGVVALSTTNTGVWTLAKKLL >LR134000.1|VDY67451.1|1007778_1008039_-|ferredoxin-like-protein-YgcO MSVARNLWRVADAPHIVPADSVERQTAERLINACPAGLFSLTPEGDLRIDYRSCLECGTCRLLCDESTLQQWRYPPSGFGITYRFG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134000_2 | 997307-997945 | Unclear |
I-E
Consensus repeat of LR134000_2
|
10 spacers
spacers of LR134000_2
>2.1|997336|32|LR134000|CRISPRCasFinder CTCTTCAGCAATGAAATCGTCAAACGAGATTA >2.2|997397|32|LR134000|CRISPRCasFinder ATTACGCCGCCTCGCGTTTTTAGTCATTTCTA >2.3|997458|32|LR134000|CRISPRCasFinder AGGAGTTTAATTTCCAGATTGAGCGCTGGATA >2.4|997519|32|LR134000|CRISPRCasFinder CGTGGTCGGGATTGTTGCGCCAGTCTCCGGGG >2.5|997580|32|LR134000|CRISPRCasFinder CACGGCTGGCCATTTGAAATACCTGTTGCTCT >2.6|997641|32|LR134000|CRISPRCasFinder AACAGCGAGCCAACTGGTTTCAGATTGCTGAA >2.7|997702|32|LR134000|CRISPRCasFinder GCGATCTCGCGGAATACACCGACGAGGCGGGC >2.8|997763|32|LR134000|CRISPRCasFinder TAAGGCCGTCGCCGGATCAGCCTGGCTATGCC >2.9|997824|32|LR134000|CRISPRCasFinder GAGCCTGACGAGACTACTGAGGCCGTTCTGTC >2.10|997885|32|LR134000|CRISPRCasFinder GACGCCGCCGCCGCGAAGCCGTTTCCGATGTT >2.11|997336|34|LR134000|PILER-CR,CRT CTCTTCAGCAATGAAATCGTCAAACGAGATTAGA >2.12|997397|34|LR134000|PILER-CR,CRT ATTACGCCGCCTCGCGTTTTTAGTCATTTCTAGA >2.13|997458|34|LR134000|PILER-CR,CRT AGGAGTTTAATTTCCAGATTGAGCGCTGGATAGA >2.14|997519|34|LR134000|PILER-CR,CRT CGTGGTCGGGATTGTTGCGCCAGTCTCCGGGGGA >2.15|997580|34|LR134000|PILER-CR,CRT CACGGCTGGCCATTTGAAATACCTGTTGCTCTGT >2.16|997641|34|LR134000|PILER-CR,CRT AACAGCGAGCCAACTGGTTTCAGATTGCTGAAGT >2.17|997702|34|LR134000|PILER-CR,CRT GCGATCTCGCGGAATACACCGACGAGGCGGGCGT >2.18|997763|34|LR134000|PILER-CR,CRT TAAGGCCGTCGCCGGATCAGCCTGGCTATGCCGA >2.19|997824|34|LR134000|PILER-CR,CRT GAGCCTGACGAGACTACTGAGGCCGTTCTGTCGA >2.20|997885|34|LR134000|PILER-CR,CRT GACGCCGCCGCCGCGAAGCCGTTTCCGATGTTGA |
CRISPR arrays and Neighbor proteins around LR134000_2
The CRISPR arrays of LR134000_2 >merge|LR134000|2|997307-997945|CRISPRCasFinder,PILER-CR,CRT GAGTTCCCCGCGCCAGCGGGGATAAACCGCTCTTCAGCAATGAAATCGTCAAACGAGATTAGAGTTCCCCGCGCCAGCGGGGATAAACCGATTACGCCGCCTCGCGTTTTTAGTCATTTCTAGAGTTCCCCGCGCCAGCGGGGATAAACCGAGGAGTTTAATTTCCAGATTGAGCGCTGGATAGAGTTCCCCGCGCCAGCGGGGATAAACCGCGTGGTCGGGATTGTTGCGCCAGTCTCCGGGGGAGTTCCCCGCGCCAGCGGGGATAAACCGCACGGCTGGCCATTTGAAATACCTGTTGCTCTGTGTTCCCCGCGCCAGCGGGGATAAACCGAACAGCGAGCCAACTGGTTTCAGATTGCTGAAGTGTTCCCCGCGCCAGCGGGGATAAACCGGCGATCTCGCGGAATACACCGACGAGGCGGGCGTGTTCCCCGCGCCAGCGGGGATAAACCGTAAGGCCGTCGCCGGATCAGCCTGGCTATGCCGAGCTCCCCGCGCCAGCGGGGATAAACCGGAGCCTGACGAGACTACTGAGGCCGTTCTGTCGAGTTCCCCGCGCCAGCGGGGATAAACCGGACGCCGCCGCCGCGAAGCCGTTTCCGATGTTGAGTTCCCCGCGCCAGCGGGGATAAACCA >LR134000|2|2|997307-997945|CRISPRCasFinder GAGTTCCCCGCGCCAGCGGGGATAAACCG CTCTTCAGCAATGAAATCGTCAAACGAGATTA GAGTTCCCCGCGCCAGCGGGGATAAACCG ATTACGCCGCCTCGCGTTTTTAGTCATTTCTA GAGTTCCCCGCGCCAGCGGGGATAAACCG AGGAGTTTAATTTCCAGATTGAGCGCTGGATA GAGTTCCCCGCGCCAGCGGGGATAAACCG CGTGGTCGGGATTGTTGCGCCAGTCTCCGGGG GAGTTCCCCGCGCCAGCGGGGATAAACCG CACGGCTGGCCATTTGAAATACCTGTTGCTCT GTGTTCCCCGCGCCAGCGGGGATAAACCG AACAGCGAGCCAACTGGTTTCAGATTGCTGAA GTGTTCCCCGCGCCAGCGGGGATAAACCG GCGATCTCGCGGAATACACCGACGAGGCGGGC GTGTTCCCCGCGCCAGCGGGGATAAACCG TAAGGCCGTCGCCGGATCAGCCTGGCTATGCC GAGCTCCCCGCGCCAGCGGGGATAAACCG GAGCCTGACGAGACTACTGAGGCCGTTCTGTC GAGTTCCCCGCGCCAGCGGGGATAAACCG GACGCCGCCGCCGCGAAGCCGTTTCCGATGTT GAGTTCCCCGCGCCAGCGGGGATAAACCA >LR134000|2|2|997309-997945|PILER-CR GTTCCCCGCGCCAGCGGGGATAAACCG CTCTTCAGCAATGAAATCGTCAAACGAGATTAGA GTTCCCCGCGCCAGCGGGGATAAACCG ATTACGCCGCCTCGCGTTTTTAGTCATTTCTAGA GTTCCCCGCGCCAGCGGGGATAAACCG AGGAGTTTAATTTCCAGATTGAGCGCTGGATAGA GTTCCCCGCGCCAGCGGGGATAAACCG CGTGGTCGGGATTGTTGCGCCAGTCTCCGGGGGA GTTCCCCGCGCCAGCGGGGATAAACCG CACGGCTGGCCATTTGAAATACCTGTTGCTCTGT GTTCCCCGCGCCAGCGGGGATAAACCG AACAGCGAGCCAACTGGTTTCAGATTGCTGAAGT GTTCCCCGCGCCAGCGGGGATAAACCG GCGATCTCGCGGAATACACCGACGAGGCGGGCGT GTTCCCCGCGCCAGCGGGGATAAACCG TAAGGCCGTCGCCGGATCAGCCTGGCTATGCCGA GCTCCCCGCGCCAGCGGGGATAAACCG GAGCCTGACGAGACTACTGAGGCCGTTCTGTCGA GTTCCCCGCGCCAGCGGGGATAAACCG GACGCCGCCGCCGCGAAGCCGTTTCCGATGTTGA GTTCCCCGCGCCAGCGGGGATAAACCA >LR134000|2|2|997309-997945|CRT GTTCCCCGCGCCAGCGGGGATAAACCG CTCTTCAGCAATGAAATCGTCAAACGAGATTAGA GTTCCCCGCGCCAGCGGGGATAAACCG ATTACGCCGCCTCGCGTTTTTAGTCATTTCTAGA GTTCCCCGCGCCAGCGGGGATAAACCG AGGAGTTTAATTTCCAGATTGAGCGCTGGATAGA GTTCCCCGCGCCAGCGGGGATAAACCG CGTGGTCGGGATTGTTGCGCCAGTCTCCGGGGGA GTTCCCCGCGCCAGCGGGGATAAACCG CACGGCTGGCCATTTGAAATACCTGTTGCTCTGT GTTCCCCGCGCCAGCGGGGATAAACCG AACAGCGAGCCAACTGGTTTCAGATTGCTGAAGT GTTCCCCGCGCCAGCGGGGATAAACCG GCGATCTCGCGGAATACACCGACGAGGCGGGCGT GTTCCCCGCGCCAGCGGGGATAAACCG TAAGGCCGTCGCCGGATCAGCCTGGCTATGCCGA GCTCCCCGCGCCAGCGGGGATAAACCG GAGCCTGACGAGACTACTGAGGCCGTTCTGTCGA GTTCCCCGCGCCAGCGGGGATAAACCG GACGCCGCCGCCGCGAAGCCGTTTCCGATGTTGA GTTCCCCGCGCCAGCGGGGATAAACCA
>LR134000.1|VDY67442.1|995972_997247_-|IS186,-transposase MNYSHDNWSAILAHIGKPEELDTSARNAGALTRRREIRDAATLLRLGLAYGPGGMSLREVTAWAQLHDVATLSDVALLKRLRNAADWFGILAAQTLAVRAAVTGCTSGKRLRLVDGTAISAPGGGSAEWRLHMGYDPHTCQFTDFELTDSRDAERLDRFAQTADEIRIADRGFGSRPECIRSLAFGEADYIVRVHWRGLRWLTAEGMRFDMMGFLRGLDCGKNGETTVMIGNSGNKKAGAPFPARLIAVSLPPEKALISKTRLLSENRRKGRVVQAETLEAAGHVLLLTSLPEDEYSAEQVADCYRLRWQIELAFKRLKSLLHLDALRAKEPELAKAWIFANLLAAFLIDDIIQPSLDFPPEVPDPKRRTNSLWRITKMVIWSLQVAIRGTVSLTAYKTQLKNARHRLNEAPRRRILQMVQPLS >LR134000.1|VDY67441.1|995465_995936_-|Domain-of-uncharacterised-function-(DUF2825) MFSGGLSPLARGTLNERRTGYNARRFIPAGAGNSLSPSAVARARSVYPRWRGELPGRPAPPELIIGLSPLARGTPMAFGKRHQHVRFIPAGAGNSTANQLPEPTSPVYPRWRGELHCNPSARAISRGLSPLARGTHDIELSIYLFFRFIPAGAGNS >LR134000.1|VDY67440.1|994031_994703_+|7-carboxy-7-deazaguanine-synthase;-queosine-biosynthesis MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWEKLEDREVSLFSILAKTKESDKWGAASSEDLLAIIGRQGYTARHVVITGGEPCIHDLLPLTDLLEKNGFSCQIETSGTHEVRCTPNTWVTVSPKLNMRGGYEVLSQALERANEIKHPVGRVRDIEALDELLATLTDDKPRVIALQPISQKDDATRLCIETCIARNWRLSMQTHKYLNIA >LR134000.1|VDY67439.1|992865_993738_-|protein MRYFILMFTFVCSFVAAQPTIVPQLQQQVTDLTSSLNSQEKKELTHKLESIFNNTQVQIAVLIVSTTKDETIEQYATRVFDNWRLGDAKRNDGILIVVAWSDRTVRIQVGFGLEEKVTDALAGDIIRSNMIPAFKQQKLAQGLELAINALNNQLTSQHQYPTNPSESESASSSDHYYFAIFWVFAVMFFPFWFFHQGSNFCRACKSGVCISAIYLLDLFLFSDKIFSIAVFSFFFTFTIFMVFTCLCVLQKRASGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASGRW >LR134000.1|VDY67438.1|991507_992806_+|enolase MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >LR134000.1|VDY67437.1|989782_991420_+|CTP-synthetase MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK >LR134000.1|VDY67436.1|988763_989555_+|nucleoside-triphosphate-pyrophosphohydrolase MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFADSSAENSSEVLARWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCANVGFDWTTLGPVVDKVYEEIDEVMYEARQAVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLEMTGVDLETMEEVWQQVKRQEIDL >LR134000.1|VDY67435.1|988357_988693_+|toxin-ChpA MVSRYVPDMGDLIWVDFDPTKGSEQAGHRPAVVLSPFMYNNKTGMCLCVPCTTQSKGYPFEVVLSGQERDGVALADQVKSIAWRARGATKKGTVAPEELQLIKAKINVLIG >LR134000.1|VDY67434.1|988109_988358_+|antitoxin-MazE MIHSSVKRWGNSPAVRIPATLMQALNLNIDDEVKIDLVDGKLIIEPVRKEPVFTLAELVNDITPENLHENIDWGEPKDKEVW >LR134000.1|VDY67433.1|985797_988032_+|GTP-pyrophosphokinase MVAVRSAHINKAGEFDPEKWIASLGITSQKSCECLAETWAYCLQQTQGHPDASLLLWRGVEMVEILSTLSMDIDTLRAALLFPLADANVVSEDVLRESVGKSVVNLIHGVRDMAAIRQLKATHTDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHLREVKDAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYKRIAKLLHERRLDREHYIEEFVGHLRAEMKAEGVKAEVYGRPKHIYSIWRKMQKKNLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQSIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAAAGGARSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQVFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQIEIITQKQPNPSRDWLNPNLGYVTTSRGRSKIHAWFRKQDRDKNILAGRQILDDELEHLGISLKEAEKHLLPRYNFNDVDELLAAIGGGDIRLNQMVNFLQSQFNKPSAEEQDAAALKQLQQKSYTPQNRSKDNGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQLAELRSHAPERIVDAVWGESYSAGYSLVVRVVANDRSGLLRDITTILANEKVNVLGVASRSDTKQQLATIDMTIEIYNLQVLGRVLGKLNQVPDVIDARRLHGS >LR134000.1|VDY67443.1|998583_1000062_-|putative-sugar-kinase MSKKYIIGIDGGSQSTKVVMYDLEGNVVCEGKGLLQPMHTPDADTAEHPDDDLWASLCFAGHDLMSQFAGNKEDIVGIGLGSIRCCRALLKADGTPAAPLISWQDARVTRPYEHTNPDVAYVTSFSGYLTHRLTGEFKDNIANYFGQWPVDYKSWAWSEDAAVMDKFNIPRHMLFDVQMPGTVLGHITPQAALATHFPAGLPVVCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAYWPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDARAQDLSPEDLLNKKASSVPPGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNHFAKHVIITGGGSNSDLFMQIFADVFNLPARRNAINGCASLGAAINTAVGLGLYPDYATAVDKMVRVKDIFIPIESNAKRYDAMNKGIFKDLTKHTDVILKKSYEVMHGELGNVDSIQSWSNA >LR134000.1|VDY67444.1|1000088_1001366_-|major-facilitator-superfamily-protein MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQIAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAPDDSTSLKTVIIIYSVVYILLGILCWFFVSDNNNLRSANNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAILSYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSVLGLLTLTALLVTNSNPQSVAMGIGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGHWQDTLPAAEAYRNMWLMGMAALGMVIVFTFLLFQKIRTADSAPAMASSK >LR134000.1|VDY67445.1|1001684_1002470_+|putative-short-chain-dehydrogenase MSIESLNAFSMDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEKQGVEVDFMQVDITAEGAPQKIIAACCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAAFELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNIQVNGIAPGYYATDITLATRSNPETNQRVLDHIPANRWGDTQDLMGAAVFLASQASNYVNGHLLVVDGGYLVR >LR134000.1|VDY67446.1|1002539_1003994_+|putative-FAD-binding-oxidoreductase MSLSRAAIVDQLKEIVGADRVITDETVLKKNSIDRFRKFPDIHGIYTLPIPAAVVKLGSTEQVSRVLNFMNAHKINGVPRTGASATEGGLETVVENSVVLDGSAMNQIINIDIENMQATAQCGVPLEVLENALREKGYTTGHSPQSKPLAQMGGLVATRSIGQFSTLYGAIEDMVVGLEAVLADGTVTRIKNVPRRAAGPDIRHIIIGNEGALCYITEVTVKIFKFTPENNLFYGYILEDMKTGFNILREIMVEGYRPSIARLYDAEDGTQHFTHFADGKCVLIFMAEGNPRIAKATGEGIAEIVARYPQCQRVDSKLIETWFNNLNWGPDKVAAERVQILKTGNMGFTTEVSGCWSCIHEIYESVINRIRTEFPHADDITMLGGHSSHSYQNGTNMYFVYDYNVVDCKPEEEIDKYHNPLNKIICEETIRLGGSMVHHHGIGKHRVHWSKLEHGSAWALLEGLKKQFDPNGIMNTGTIYPIEK >LR134000.1|VDY67447.1|1004015_1005425_+|major-facilitator-superfamily-protein MTGRCLFGFSGEKPFLLPDNEGVKMNTSPVRMDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFLGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGILLGAFSVVWTVGYVLASIAGHHFISENPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLFSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQVTLLLLATVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV >LR134000.1|VDY67448.1|1005402_1006182_+|electron-transfer-flavoprotein-subunit-YgcR MNILLAFKAEPDAGMLAEKEWQAAAQGKSGPDISLLRSLLGADEQAAAALLLAQRKNGTPMSLTALSMGDERALHWLRYLMALGFEEAVLLETAADLRFAPEFVARHIAEWQHQNPLDLIITGCQSSEGQNGQTPFLLAEMLGWPCFTQVERFTLDALFITLEQRTEHGLRCCRVRLPAVIAVRQCGEVALPVPGMRQRMAAGKAEIIRKTVAAEMPAMQCLQLARAEQRRGATLIDGQTVAEKAQKLWQDYLRQRMQP >LR134000.1|VDY67449.1|1006178_1007039_+|electron-transfer-flavoprotein-subunit-YgcQ MNIAIVTINQENAAIASWLAAQDFSGCTLAHWQIEPQPVVAEQVLDALVEQWQRTPADVVLFPPGTFGDELSTRLAWRLHGASICQVTSLDIPTVSVRKSHWGNALTATLQTEKRPLCLSLARQAGAAKNATLPSGMQQLNIVPGALPDWLVSTEDLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVGYSRARVMNGGVDAEKVIGISGHLLAPEVCIVVGASGAAALMAGVRNSKFVVAINHDASAAVFSQADVGVVDDWKVVLEALVTNIHADCQ >LR134000.1|VDY67450.1|1007186_1007762_-|glycerol-antiterminator-regulatory-protein MPLLHLLRQNPVIAAVKDNASLQLAIDSECQFISVLYGNICTISNIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLVTEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVCDEEDARNAINAGVVALSTTNTGVWTLAKKLL >LR134000.1|VDY67451.1|1007778_1008039_-|ferredoxin-like-protein-YgcO MSVARNLWRVADAPHIVPADSVERQTAERLINACPAGLFSLTPEGDLRIDYRSCLECGTCRLLCDESTLQQWRYPPSGFGITYRFG >LR134000.1|VDY67452.1|1008029_1009301_-|putative-FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERRITHESLSLLTPDGVTTFSSLQPGGESWSVLRARFDPWLVAEAEKEGIECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134000_3 | 1023495-1023889 | TypeI-E |
I-E
Consensus repeat of LR134000_3
|
6 spacers
spacers of LR134000_3
>3.1|1023524|32|LR134000|PILER-CR,CRISPRCasFinder,CRT GGTTACGCCTGCACAGAGTACAATGCGTGGGG >3.2|1023585|32|LR134000|PILER-CR,CRISPRCasFinder,CRT CGGTGGCAGTGATGAGGCGTTCCCAATTAATG >3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT CGCACTCAAAATAGTAAATTAATTTATGAATT >3.4|1023707|32|LR134000|PILER-CR,CRISPRCasFinder,CRT ATCGGACGATGGCGATCGCAATCGCGCGGGAA >3.5|1023768|32|LR134000|CRISPRCasFinder,CRT TTTTTGTTCTCTTCAAAACGCCGAACAACCAA >3.6|1023829|32|LR134000|CRISPRCasFinder,CRT GACGCACTGGATGCGATGATGGATATCACTTG |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around LR134000_3
The CRISPR arrays of LR134000_3 >merge|LR134000|3|1023495-1023889|PILER-CR,CRISPRCasFinder,CRT GAGTTCCCCGCGCCAGCGGGGATAAACCGGGTTACGCCTGCACAGAGTACAATGCGTGGGGGAGTTCCCCGCGCCAGCGGGGATAAACCGCGGTGGCAGTGATGAGGCGTTCCCAATTAATGGAGTTCCCCGCGCCAGCGGGGATAAACCGCGCACTCAAAATAGTAAATTAATTTATGAATTGAGTTCCCCGCGCCAGCGGGGATAAACCGATCGGACGATGGCGATCGCAATCGCGCGGGAAGAGTTCCCCGCGCCAGCGGGGATAAACCGTTTTTGTTCTCTTCAAAACGCCGAACAACCAAGAGTTCCCCGCGTCAGCGAGGATAAACCGGACGCACTGGATGCGATGATGGATATCACTTGGAGTTCCCCGCCCCTGCGGTAGAACTCCC >LR134000|3|3|1023495-1023767|PILER-CR GAGTTCCCCGCGCCAGCGGGGATAAACCG GGTTACGCCTGCACAGAGTACAATGCGTGGGG GAGTTCCCCGCGCCAGCGGGGATAAACCG CGGTGGCAGTGATGAGGCGTTCCCAATTAATG GAGTTCCCCGCGCCAGCGGGGATAAACCG CGCACTCAAAATAGTAAATTAATTTATGAATT GAGTTCCCCGCGCCAGCGGGGATAAACCG ATCGGACGATGGCGATCGCAATCGCGCGGGAA GAGTTCCCCGCGCCAGCGGGGATAAACCG >LR134000|3|3|1023495-1023889|CRISPRCasFinder GAGTTCCCCGCGCCAGCGGGGATAAACCG GGTTACGCCTGCACAGAGTACAATGCGTGGGG GAGTTCCCCGCGCCAGCGGGGATAAACCG CGGTGGCAGTGATGAGGCGTTCCCAATTAATG GAGTTCCCCGCGCCAGCGGGGATAAACCG CGCACTCAAAATAGTAAATTAATTTATGAATT GAGTTCCCCGCGCCAGCGGGGATAAACCG ATCGGACGATGGCGATCGCAATCGCGCGGGAA GAGTTCCCCGCGCCAGCGGGGATAAACCG TTTTTGTTCTCTTCAAAACGCCGAACAACCAA GAGTTCCCCGCGTCAGCGAGGATAAACCG GACGCACTGGATGCGATGATGGATATCACTTG GAGTTCCCCGCCCCTGCGGTAGAACTCCC >LR134000|3|3|1023495-1023889|CRT GAGTTCCCCGCGCCAGCGGGGATAAACCG GGTTACGCCTGCACAGAGTACAATGCGTGGGG GAGTTCCCCGCGCCAGCGGGGATAAACCG CGGTGGCAGTGATGAGGCGTTCCCAATTAATG GAGTTCCCCGCGCCAGCGGGGATAAACCG CGCACTCAAAATAGTAAATTAATTTATGAATT GAGTTCCCCGCGCCAGCGGGGATAAACCG ATCGGACGATGGCGATCGCAATCGCGCGGGAA GAGTTCCCCGCGCCAGCGGGGATAAACCG TTTTTGTTCTCTTCAAAACGCCGAACAACCAA GAGTTCCCCGCGTCAGCGAGGATAAACCG GACGCACTGGATGCGATGATGGATATCACTTG GAGTTCCCCGCCCCTGCGGTAGAACTCCC
>LR134000.1|VDY67464.1|1023104_1023389_+|CRISPR-associated-protein-Cas2 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNTESGFEFQTFGVNRRTPVDLDGLRLVSFLPV >LR134000.1|VDY67463.1|1022185_1023103_+|putative-CRISPR-associated-protein-Cas1 MTWLPLNPIPLKDRVSMIFLQYGQIDVIDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVRLAAQVGTLLVWVGEAGVRVYASGQPGGARSDKLLYQAKLALDEDLRLKVVRKMFELRFGEPAPARRSVEQLRGIEGSRVRATYALLAKQYGVTWNGRRYDPKDWEKGDTVNQCISAATSCLYGVTEAAILAAGYAPAIGFVHTGKPLSFVYDIADIIKFDTVVPKAFEIARRNPGEPDREVRLACRDIFRSSKTLAKLIPLIEDVLAAGEIQPPAPPEDAQPVAIPLPVSLGDAGHRSS >LR134000.1|VDY67462.1|1021570_1022170_+|CRISPR-associated-Cse3-family-protein MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVITINDAPALIDLVQQGIGPAKSMGCGLLSLAPL >LR134000.1|VDY67461.1|1020909_1021584_+|CRISPR-associated-Cas5-family-Ecoli-subtype-protein MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTPQATMVMSELEKAVLKPRYTPYLGRRSCPLTQPLFLGTCQASDPQKVLLNYEPVGGDIYSEESVDGHHLKFTVRDEPMITLPRQFASREWYVIKGGMDVSQ >LR134000.1|VDY67460.1|1019815_1020907_+|CRISPR-associated-Cse4-family-protein MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSDYYAQNIGESSLRTIHLAQLRDVLRQKLSERFDQKIIDKTLALLSGKSVNEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMSELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKANDGFLQPSLQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITGQVQQMPTLEQLKSWVRNNGEA >LR134000.1|VDY67459.1|1019320_1019803_+|CRISPR-associated-Cse2-family-protein MADEIDAMALYRAWQQLDNGACAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAGKNVIRHQDKKPEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRLLAHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLTTNKNA >LR134000.1|VDY67458.1|1017819_1019328_+|Cse1-family-CRISPR-associated-protein MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIDPAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPITTFVRGIDLRSTVLLNVLTIPRLQKQFPNESHTENQPTWVKPVKPNESVPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQGGPSNG >LR134000.1|VDY67457.1|1014738_1017405_+|CRISPR-associated-helicase-Cas3 MEPFKYICHYWGKSSKSLTKGNDIHLLIYHCLDVAAVADCWWDQSVVLQNTFCRNEMLSKQRVKAWLLFFIALHDIGKFDIRFQYKSAESWLKLNPATPSLNGPSTQMCRKFNHGAAGLYWFNQDSLSEQSLGDFFSFFDAAPHPYESWFPWVEAVTGHHGFILHSQDQDKSRWEMPASLASYAAQDKQAREEWISVLEALFLTPAGLSINDIPPDCSSLLAGFCSLADWLGSWTTTNTFLFNEDAPSDINALRTYFQDRQQDASRVLELSGLVSNKRCYEGVHALLDNGYQPRQLQVLVDALPVAPGLTVIEAPTGSGKTETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEASASHLFSSPNLILAHGNSRFNHLFQSIKSRAITEQGQEEAWVQCCQWLSQSNKKVFLGQIGVCTIDQVLISVLPVKHRFIRGLGIGRSVLIVDEVHAYDTYMNGLLEAVLKAQADVGGSVILLSATLPMKQKQKLLDTYGLHTDPVENNSAYPLINWRGVNGAQRFDLLAHPEQLPPRFSIQPEPIYLADMLPDLTMLERMIAAANAGAQVCLICNLVDVAQVCYQRLKELNNTQVDIDLFHARFTLNDRREKENRVISDFGKNGERNVGRILVATQVVEQSLDVDFDWLITQHCPADLLFQRLGRLHRHHRKYRPAGFEIPVATILLPDGEGYGRHEHIYSNVRVMWRTQQHIEELNGASLFFPDAYRQWLDSIYDDAEMDEPEWVIKGMDKFESAECEKRFKARKVLQWAEEYSLQDNDETILAVTRDGEMSLPLLPYVQTSSGKQLLDGQVYEDLSYEQQYEALALNRVNVPFTWKRSFSEVVDEDGLLWLEGKQNQDGWFWQGNSIVITYTRDEGMTRVIPANPK >LR134000.1|VDY67456.1|1013645_1014380_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >LR134000.1|VDY67455.1|1011858_1013571_+|sulfite-reductase-(NADPH)-hemoprotein-beta-subunit MSEKHPGPLVVEGKLTDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHKGDFRITANQNLIIAGVPESEKAKIEKIAKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVMRVTGCPNGCGRAMLAEVGLVGKAPGRYNLHLSGNRIGTRIPRMYKENITEPEILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDLWD >LR134000.1|VDY67465.1|1023909_1024947_-|alkaline-phosphatase-isozyme-conversion-protein MFSALRHRTAALALGVCFILPVHASSPKPGDFANTQARHIATFFPGRMTGTPAEMLSADYIRQQFQQMGYRSDIRTFNSRYIYTARDNRKSWHNVTGSTVIAAHEGKAPQQIIIMAHLDTYAPLSDADADANLGGLTLQGMDDNAAGLGVMLELAERLKNTPTEYGIRFVATSGEEEGKLGAENLLKRMSDTEKKNTLLVINLDNLIVGDKLYFNSGVKTPEAVRKLTRDRALAIARSHGIAATTNPGLNKNYPKGTGCCNDAEVFDKAGIAVLSVEATNWNLGNKDGYQQRAKTAAFPAGNSWHDVRLDNQQHIDKALPGRIERRCRDVMRIMLPLVKELAKAS >LR134000.1|VDY67466.1|1025198_1026107_+|sulfate-adenylyltransferase MDQIRLTHLRQLEAESIHIIREVAAEFSNPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYEFRDRTAKAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWHNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMIDDNRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESNAQTLPEIIEEMLVSTTSERQGRVIDRDQAGSMELKKRQGYF >LR134000.1|VDY67467.1|1026108_1027536_+|sulfate-adenylyltransferase MNTALAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTRQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCELAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYSEKTFTRIREDYLTFAGQLPGNLDIRFVPLSALEGDNVASQSESMAWYSGPTLLEVLETVEIQRVVDAQPMRFPVQYVNRPNLDFRGYAGTLASGRVEVGQRVKVLPSGVESNVARIVTFDGDREEAFAGEAITLVLTDEIDISRGDLLLAADEALPAVQSASVDVVWMAEQPLSPGQSYDIKIAGKKTRARVDGIRYQVDINNLTQREVENLPLNGIGLVDLTFDEPLVLDRYQQNPVTGGLIFIDRLSNVTVGAGMVHEPVSQATAAPSEFSAFELELNALVRRHFPHWGARDLLGDK >LR134000.1|VDY67468.1|1027535_1028141_+|adenosine-5-phosphosulfate-kinase MALHDENVVWHSHPVTVQQRELHHGHRGVVLWFTGLSGSGKSTVAGALEEALHKLGVSTYLLDGDNVRHGLCSDLGFSDADRKENIRRVGEVANLMVEAGLVVLTAFISPHRAERQMVRERVGEGRFIEVFVDTPLAICEARDPKGLYKKARAGELRYFTGIDSVYEAPESAEIHLNGEQLVTNLVQQLLDLLRQNDIIRS >LR134000.1|VDY67469.1|1028190_1028514_+|putative-cytochrome-oxidase-subunit MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNTLFFFIYTWPFFLALMPVAVVVGIALHSLMDGKLRYSIVFTLVTVGIMFGALFMWLLG >LR134000.1|VDY67470.1|1028707_1029019_+|cell-division-protein MGKLTLLLLAILVWLQYSLWFGKNGIHDYIRVNDDVAAQQATNAKLKARNDQLFAEIDDLNGGQEALEERARNELSMTRPGETFYRLVPDASKRAQSAGQNNR >LR134000.1|VDY67471.1|1029037_1029748_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MATTHLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKRVVIAISPGDSRFAQLPLANHPQITVVDGGDERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALSETSRTGGILAAPVRDTMKRAEPGKNAIAHTVDRNGLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTRTIHQENT >LR134000.1|VDY67472.1|1029747_1030227_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLIKATK >LR134000.1|VDY67473.1|1030223_1031273_+|tRNA-pseudouridine-synthase-D MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDLSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLNDICVKGVPNYFGAQRFGIGGSNLQGAQRWAQTNTPVRDRNKRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRAMLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE >LR134000.1|VDY67474.1|1031253_1032015_+|stationary-phase-survival-protein-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVTCSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134000_4 | 1516188-1516305 | Orphan |
NA
Consensus repeat of LR134000_4
|
1 spacers
spacers of LR134000_4
>4.1|1516219|56|LR134000|CRISPRCasFinder TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA |
CRISPR arrays and Neighbor proteins around LR134000_4
The CRISPR arrays of LR134000_4 >merge|LR134000|4|1516188-1516305|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGCTGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAACCGAGCCGTAGGCCGGATAAGGCGTTTACGC >LR134000|4|4|1516188-1516305|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGC TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA CCGAGCCGTAGGCCGGATAAGGCGTTTACGC
>LR134000.1|VDY67912.1|1514960_1516091_-|ribonucleotide-diphosphate-reductase-subunit-beta MAYTTFSQTKNDQLKEPMFFGQPVNVARYDQQKYDIFEKLIEKQLSFFWRPEEVDVSRDRIDYQALPEHEKHIFISNLKYQTLLDSIQGRSPNVALLPLISIPELETWVETWAFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEQIQKRAEGISSYYDELIEMTSYWHLLGEGTHTVNGKTVTVSLRELKKKLYLCLMSVNALEAIRFYVSFACSFAFAERELMEGNAKIIRLIARDEALHLTGTQHMLNLLRSGADDPEMAEIAEECKQECYDLFVQAAQQEKDWADYLFRDGSMIGLNKDILCQYVEYITNIRMQAVGLDLPFQTRSNPIPWINTWLVSDNVQVAPQEVEVSSYLVGQIDSEVDTDDLSNFQL >LR134000.1|VDY67911.1|1514706_1514961_-|putative-ferredoxin MARVTLRITGTQLLCQDEHPSLLAALESHNVAVEYQCREGYCGSCRTRLVAGQVDWIAEPLAFIQPGEILPCCCRAKGDIEIEM >LR134000.1|VDY67910.1|1514002_1514653_+|pH-inducible-protein-involved-in-stress-response-(putative-kinase) MAVSAKYDEFNHWWATEGDWVEEPNYRRNGMSGVQCVERNGKKLYVKRMTHHLFHSVRYPFGRPTIVREVAVIKELERAGVIVPKIVFGEAVKIEGEWRALLVTEDMAGFISIADWYAQHAVSPYSDEVRQAMLKAVALAFKKMHSINRQHGCCYVRHIYVKTEGKAEAGFLDLEKSRRRLRRDKAINHDFRQLEKYLEPIPKADWEQVKAYYYAM >LR134000.1|VDY67909.1|1513581_1513788_-|yfaH MNFIRQGLGIALQPELTLKSIAGELCSVPLEPTFYRQISLLAKEKPVEGSPLFLLQMCMEQLVAIGKI >LR134000.1|VDY67908.1|1512463_1513540_+|glycerophosphodiester-phosphodiesterase MKLTLKNLSMAIMMSTIVMGSSAMAADSNEKIVIAHRGASGYLPEHTLPAKAMAYAQGADYLEQDLVMTKDDHLVVLHDHYLDRVTDVADRFPDRARKDGRYYAIDFTLDEIKSLKFTEGFDIENGKKVQTYPGRFPMGKSDFRVHTFEEEIEFVQGLNHSTGKNIGIYPEIKAPWFHHQEGKDIAAKTLEVLKKYGYTGKDDKVYLQCFDADELKRIKNELEPKMGMELNLVQLIAYTDWNETQQKQPDGSWVNYNYDWMFKPGAMKQVAEYADGIGPDYHMLIEETSQPGNIKLTGMVQDAQQNKLVVHPYTVRSDKLPEYTTDVNQLYDALYNKAGVNGLFTDFPDKAVKFLNKE >LR134000.1|VDY67907.1|1511100_1512459_+|glycerol-3-phosphate-transporter MLSIFKPAPHKARLPAAEIDPTYRRLRWQIFLGIFFGYAAYYLVRKNFALAMPYLVEQGFSRGDLGFALSGISIAYGFSKFIMGSVSDRSNPRVFLPAGLILAAAVMLFMGFVPWATSSIAVMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLFLLGMAWFNDWHAALYMPAFCAILVALFAFAMMRDTPQSCGLPPIEEYKNDYPDDYNEKAEQELTAKQIFMQYVLPNKLLWYIAIANVFVYLLRYGILDWSPTYLKEVKHFALDKSSWAYFFYEYAGIPGTLLCGWMSDKVFRGNRGATGVFFMTLVTIATIVYWMNPAGNPTVDMICMIVIGFLIYGPVMLIGLHALELAPKKAAGTAAGFTGLFGYLGGSVAASAIVGYTVDFFGWDGGFMVMIGGSILAVILLIVVMIGEKRRHEQLLQKRNGG >LR134000.1|VDY67906.1|1509199_1510828_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-A MKTRDSQSSDVIIIGGGATGAGIARDCALRGLRVILVERHDIATGATGRNHGLLHSGARYAVTDAESARECISENQILKRIARHCVEPTNGLFITLPEDDLSFQATFIRACEEAGIIAEAIDPQQARIIEPAVNPALIGAVKVPDGTVDPFRLTAANMLDAKEHGAVILTAHEVTGLIREGATVCGVRVRNHLTGETQALHAPVVVNAAGIWGQHIAEYADLRIRMFPAKGSLLIMDHRINQHVINRCRKPSDADILVPGDTISLIGTTSLRIDYNEIDDNRVTAEEVDILLREGEKLAPVMAKTRILRAYSGVRPLVASDDDPSGRNVSRGIVLLDHAERDGLDGFITITGGKLMTYRLMAEWATDAVCRKLGNTRPCTTADLALPGSQEPAEVTLRKVISLPAPLRGSAVYRHGDRTPAWLSEGRLHRSLVCECEAVTAGEVQYAVENLNVNSLLDLRRRTRVGMGTCQGELCACRAAGLLQRFNVTTSAQSIEQLSTFLNERWKGVQPIAWGDALRESEFTRWVYQGLCGLEKEQKDAL >LR134000.1|VDY67905.1|1507950_1509210_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-B MRFDTVIMGGGLAGLLCGLQLQKHGLRCAIVTRGQSALHFSSGSLDLLSHLPDGQPVADIHSGLESLRQQAPAHPYSLLGPQRVLDLACQAQALIAESGAQLQGSVELAHQRITPLGTLRSTWLSSPEVPVWPLPAKKICVVGISGLMDFQAHLAAASLRELDLSVETAEIELPELDVLRNNATEFRAVNIARFLDNEENWPLLLDALIPVANTCEMILMPACFGLADDKLWRWLNEKLPCSLMLLPTLPPSVLGIRLQNQLQRQFVHQGGVWMPGDEVKKVTCKNGVVNEIWTRNHADIPLRPRFAVLASGSFFSGGLVAERNGIREPILGLDVLQTATRGEWYKGDFFAPQPWQQFGVTTDETLRPSQAGQTIENLFAIGSVLGGFDPIAQGCGGGVCAVSALHAAQQIAQRAGGQQ >LR134000.1|VDY67904.1|1506763_1507954_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-C MNDTSFENCIKCTVCTTACPVSRVNPGYPGPKQAGPDGERLRLKDGALYDEALKYCINCKRCEVACPSDVKIGDIIQRARAKYDTTRPSLRNFVLSHTDLMGSVSTPFAPIVNTATSLKPVRQLLDAALKIDHRRTLPKYSFGTFRRWYRSIAAQQAQYKDQVAFFHGCFVNYNHPQLGKDLIKVLNAMGTGVQLLSKEKCCGVPLIANGFTDKARKQAITNVESIREAVGVKGIPVIATSSTCTFALRDEYPEVLNVDNKGLRDHIELATRWLWRKLDEGKTLPLKPLPLKVVYHTPCHMEKMGWTLYTLELLRKIPGLELTVLDSQCCGIAGTYGFKKENYPTSQAIGAPLFRQIEESGADLVITDCETCKWQIEMSTSLRCEHPITLLAQALA >LR134000.1|VDY67903.1|1505667_1506570_-|transposase-YhgA-family-protein MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAYYSDVLWSVETSDGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKGYDRVPLVVPLLFYHGETSPYPYSLNWLDEFDEPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELIQKHIRDRDLIGMVDRITTLLVRGFTNDSQLQTLFNYLLQCGDTSRFTRFIEEIAERSPLQKERLMTIAERLRQEGHQIGWQEGMHEQAIKIALRMLEQGIDRDQVLAATQLSEADLAANNH >LR134000.1|VDY67913.1|1516324_1518610_-|ribonucleoside-diphosphate-reductase MNQNLLVTKRDGSTERINLDKIHRVLDWAAEGLHNVSISQVELRSHIQFYDGIKTSDIHETIIKAAADLISRDAPDYQYLAARLAIFHLRKKAYGQFEPPALYDHVVKMVEMGKYDNHLLEDYTEEEFKQMDTFIDHDRDMTFSYAAVKQLEGKYLVQNRVTGEIYESAQFLYILVAACLFSNYPRETRLQYVKRFYDAVSTFKISLPTPIMSGVRTPTRQFSSCVLIECGDSLDSINATSSAIVKYVSQRAGIGINAGRIRALGSPIRGGEAFHTGCIPFYKHFQTAVKSCSQGGVRGGAATLFYPMWHLEVESLLVLKNNRGVEGNRVRHMDYGVQINKLMYTRLLKGEDITLFSPSDVPGLYDAFFADQEEFERLYTKYEKDDSIRKQRVKAVELFSLMMQERASTGRIYIQNVDHCNTHSPFDPAIAPVRQSNLCLEIALPTKPLNDVNDENGEIALCTLSAFNLGAINNLDELEELAILAVRALDALLDYQDYPIPAAKRGAMGRRTLGIGVINFAYYLAKHGKRYSDGSANNLTHKTFEAIQYYLLKASNELAKEQGACPWFNETTYAKGILPIDTYKKDLDTIANEPLHYDWEALRESIKTHGLRNSTLSALMPSETSSQISNATNGIEPPRGYVSIKASKDGILRQVVPDYEHLHDAYELLWEMPGNDGYLQLVGIMQKFIDQSISANTNYDPSRFPSGKVPMQQLLKDLLTAYKFGVKTLYYQNTRDGAEDAQDDLVPSIQDDGCESGACKI >LR134000.1|VDY67914.1|1519512_1523058_+|adhesin MTNNASGGAVFLQQGAEFSLLPENETGMTLFANNTVTGEYNNGGAIFAKENSTLNLTDVIFSGNVAGGYGGAIYSSGTNDTGAVDLRVTNAMFRNNIANDGKGGAIYTINNDVYLSDVIFDNNQAYTSTSYSDGDGGAIDVTDNNSDSKHPSGYTIVNNTAFTNNTAEGYGGAIYTNSVTAPYLIDISVDDSYSQNGGVLVDENNSAAGYGDGPSSAAGGFMYLGLSEVTFDIADGKTLVIGNTENDGAVDSIAGTGLITKTGSGDLVLNADNNDFTGEMQIENGEVTLGRSNSLMNVGDTHCQDDPQDCYGLTIGSIDQYQNQAELNVGSTQQTFVHALTGFQNGTLNIDAGGNVTVNQGSFAGIIEGAGQLTIAQNGSYVLAGAQSMALTGDIVVDDGAVLSLEGDAADLTALQDDPQSIVLNGGVLDLSDFSTWQSGTSYNDGLEVSGSSGTVIGSQDVVDLAGGDNLHIGGDGKDGVYVVVDASDGQVSLANNNSYLGTTQIASGTLMVSDNSQLGDTHYNRQVIFTDKQQESVMEITSDVDTRSDAAGHGRDIEMRADGEVAVDAGVDTQWGALMADSSGQHQDEGSTLTKTGAGTLELTASGTTQSAVRVEEGTLKGDVADILPYASSLWVGDGATFVTGADQDIQSIDAISSGTIDISDGTVLRLTGQDTSVALNASLFNGDGTLVNATDGVTLTGELNTNLETDSLTYLSNVTVNGNLTNTSGAVSLQNGVAGDTLTVNGDYTGGGTLLLDSELNGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVEDNNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPNPTPDPEPTPAYQPVLNAKVGGYLNNLRAANQAFMMERRDHAGGDGQTLNLRVIGGDYHYTAAGQLAQHEDTSTVQLSGDLFSGRWGTDGEWMLGIVGGYSDNQGDSRSNMTGTCADNQNHGYAVGLTSSWFQHGNQKQGAWLDSWLQYAWFSNDVSEQEDGTDHYHSSGIIASLEAGYQWLPGRGVVIEPQAQVIYQGVQQDDFTAANRARVSQSQGDDIQTRLGLHSEWRTAVHVIPTLDLNYYHDPHSTEIEEDGSTISDDAVKQRGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW >LR134000.1|VDY67915.1|1523185_1523908_-|3-demethylubiquinone-9-3-methyltransferase MNAEKSPENHNVDHEEIAKFEAVASRWWDLEGEFKPLHRINPLRLGYIAERAGGLFGKKVLDVGCGGGILAESMAREGATVTGLDMGFEPLQVAKLHALESGIQVDYVQETVEEHAAKHAGQYDVVTCMEMLEHVPDPQSVVRACAQLVKPGGDVFFSTLNRNGKSWLMAVVGAEYILRMVPKGTHDVKKFIKPAELLGWVDQTSLKERHITGLHYNPITNTFKLGPGVDVNYMLHTQNK >LR134000.1|VDY67916.1|1524054_1526682_+|DNA-gyrase-subunit-A MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDSAVYDTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDDEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEVDAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDGMRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDIIAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHAPTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVKYQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVYSMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMATANGTVKKTVLTEFNRLRTAGKVAIKLVEGDELIGVDLTSGEDEVMLFSAEGKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGEGAILTATQNGYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITDAGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDTIDGSAAEGDDEIAPEVDVDDEPEEE >LR134000.1|VDY67917.1|1526830_1528519_+|putative-membrane-protein-YfaA MSGEKKAKGWRFYGLVGFGAIALLSAGVWALQYAGSGPEKTLSPLVVHNNLQIDLNEPDLFLDSDSLSQLPKDLLTIPFLHDVLSEDFVFYYQNHADRLGIEGSIRRIVYEHDLTLKDKLFSSLLDQPAQAALWHDKQGHLSHYMVLIQRSGLSKLLEPLLFAATSDSQLSKTEISSIKINSETVPVYQLRYNGNNALMFATYQDKMLVFSSTDMLFKDDQQDTEATAIAGDLLSGKKRWQASFGLEERTAEKTPVRQRIVVSARWLGFGYQRLMPSFAGVHFEMGNDGWHSFVALNDESASVDASFDFTPVWNSMPAGASFCVAVPYSHGIAEEMLSHISQENDKLNGALDGAAGLCWYEDSKLQTPLFVGQFDGTAEQAQLPGKLFTQNIGAHESKAPEGVLPVSQTQQGEAQIWRREVSSRYGQYPKAQAAQPDQLMSDYFFRVSLAMQNKTLLFSLDDTLVNNALQTLNKTRPAMVDVIPTDGIVPLYINPQGIAKLLRNETLTSLPKNLEPVFYNAAQTLLMPKLDALSQQPRYVMKLAQMEPGAAWQWLPITWQPL >LR134000.1|VDY67918.1|1528515_1529139_+|protein MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIYRLNFLAR >LR134000.1|VDY67919.1|1529282_1533677_+|large-extracellular-alpha-helical-protein MRLEAPGRDYRRYQMEEYGGVDVRLYRIPDPMAFLRQQKNLHRIVVQPQYLGDGLNNTLTWLWDNWYGKSRRVMQRTFSSQSRQNVTQALPELQLGNAIIKPSRYVQNNQFSPLKKYPLVEQFRYPLWQAKPFEPQQGVKLEGASSNFISPQPGNIYIPLGQQEPGLYLVEAMVGGYRATTVVFVSDTVALSKVSGKELLVWTAGKKQGEAKPGSEILWTDGLGVMTRGVTDDSGTLQLQHISPERSYILGKDAEGGVFVSENFFYESEIYNTRLYIFTDRPLYRAGDRVDVKVMGREFHDPLHSSPIVSAPAKLSVLDANGSLLQTVDVTLDARNGGQGSFRLPENAVAGGDELRLAYRNQVYSSSFRVANYIKPHFEIGLALAKKEFKTGEAVSGKLQLLYPDGEPVKNARVQLSLRAQQLSMVGNDLRYAGRFPVSLEGSETVSDASGHVALNLPAADKPSRYLLTVSASDGAAYRVTTTKEILIERGLAHYSLSTAAQYSNSGESVVFRYAALESSKQVPVTYEWLRLEDRTSHSGELPSGGKSFTVNFAKPGNYNLTLRDKDGLILAGLSHAVSGKGSTAHTGTVDIVADKTLYQPGETAKMLITFPEPIDEALLTLERDRVEQQSLLSHPANWLTLQRLNDTQYEARVPVSNSFAPNITFSVLYTRNGQYSFQNAGIKVAVPQLDIRVKTDKTHYQPGELVNVELTSSLKGKPVSAQLTVGVVDEMIYALQPEIAPNIGKFFYPLGRNNVRTSSSLSFISYDQALSSEPVAPGATNRSERRVKMLERPRREEVDTAAWMPSLTTDKQGKAYFTFLMPDSLTRWRITARGMNGDGLVGQGRAYLRSEKNLYMKWSMPTVYRVGDKPAAGLFIFSQQDNEPVALVTKFAGAEMRQTLTLHKGANYISLTQNIQQSGLLSAELQQNGQVQDSISTKLSFVDNSWPVEQQKNVMLGGGDNALMLPEQASNIRLQSSETPQEIFRNNLDALVDEPWGGVINTGSRLIPLSLAWRSLADHQSAAANDIRQMIQDNRLRLMQLAGPGARFTWWGEDGNGDAFLTAWAWYADWQASQAIGVTQQPEYWQHMLDSYAEQADNMPLLHRALVLAWAQEMNLPCKTLLKGLDEAIARRGTKTEDFSEEDTRDINDSLILDTPESPLADAVANVLTMTLLKKAQLKSTVMPQVQQYAWDKAANSNQPLAHTVVLLNSGGDATQAAAILSGLTAEQSTIERALAMNWLAKYMATMPPVVLPAPAGAWAKHKLTGGGEYWRWVGQGVPDILSFGDELSPQNVQVRWREPAKTAQQSNIPVTVERQLYRLIPGEEEMSFTLQPVTSNEIDSDALYLDEITLTSEQDAVLRYGQVEVPLPPGADVERTTWGISVNKPNAAKQQGQLLEKARNEMGELAYMVPVKELTGTVTFRHLLRFSQKGQFVLPPARYVRSYAPAQQSVAAGSEWTGMQVK >LR134000.1|VDY67920.1|1533677_1535327_+|protein MNWRRIVWLLALVTLPTLAEETPLQLVLRGAQHDQLYQLSSSGVTKVSALPDSLTTPLGSLWKLYVYAWLEDTHQPEQPYQCRGNSPEEVYCCQAGESITRDTALVRSCGLYFAPQRLHIGADVWGQYWQQRQAPAWLASLTTLKPETSVTVKSLLDSLATLPAQNKAQEVLLDVVLDEAKIGVASMLGSRVRVKTWSWFADDKQEIRQGGFAGWLTDGTPLWVTGSGTSKTVLTRYATVLNRVLPVPTQVASGQCVEVELFARYPLKKITAEKSTTAVKPGVLNGRYRVTFTNGNHITFVSHGETTLLSEKGKLKLQSHLDREEYVARVLDREAKSTPPEAAKAMTVAIRTFLQQNANREGDCLTIPDSSATQRVSASPATTGARTMAAWTQDLIYAGDPVHYHGSRATEGTLSWRQATAQAGQGERYDQILAFAYPDNSLSRWGAPRSTCQLLPKAKAWLAKKMPQWRRILQAETGYNEPDVFAVCRLVSGFPYTDRQQKRLFIRNFFTLQDRLDLTHEYLHLAFDGYPTGLDENYIETLTRQLLMD >LR134000.1|VDY67921.1|1535331_1536108_+|DUF2135-family-protein,-function-uncharacterised MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPAEGEDASFSQSINYPASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDGSFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTPGTGTIRARLRLVLSWDTDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPIHGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGELTLVKSFDW >LR134000.1|VDY67922.1|1536181_1537366_-|acetyl-CoA-acetyltransferase MKNCVIVSAVRTAIGSFNGSLASTSAIDLGATVIKAAIERAKIDSQYVDEVIMGNVLQAGLGQNPARQALLKSVLAETVCGFTVNKVCGSGLKSVALAAQAIQAGQAQSIVAGGMENMSLAPYLLDAKARSGYRLGDGQVYDVILRDGLMCATHGYHMGITAENVAKEYGITREMQDELALHSQRKAAAAIESGAFTAEIVPVNVVTRKKTFVFSQDEFPKADSTTEALGALRPAFDKAGTVTAGNASGINDGAAALVIMEESAALAAGLTPLARIKSYASGGVPPALMGMGPVPATQKALQLAGLQLADIDLIEANEAFAAQFLAVGKNLGFDSEKVNVNGGAIALGHPIGASGARILVTLLHAMQARDKTLGLATLCIGGGQGIAMVIERLN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134000_5 | 2156054-2156177 | Orphan |
NA
Consensus repeat of LR134000_5
|
1 spacers
spacers of LR134000_5
>5.1|2156097|38|LR134000|CRISPRCasFinder CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA |
CRISPR arrays and Neighbor proteins around LR134000_5
The CRISPR arrays of LR134000_5 >merge|LR134000|5|2156054-2156177|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA >LR134000|5|5|2156054-2156177|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA
>LR134000.1|VDY68513.1|2153883_2155488_-|conserved-protein-with-FAD/NAD(P)-binding-domain MKKIAIVGAGPTGIYTLFSLLQQQTPLSISIFEQADEAGVGMPYSDEENSKMMLANIASIEIPPINCTYLEWLQKQEASHLQRYGVKKETLHDRQFLPRILLGEYFRDQFLRLVDQARKQKFAVAVYESCQVTDLQITNAGVMLATNQGLPRETFDLAVIATGHVWPDEEEATRTYFPSPWSGLMEAKVDACNVGIMGTSLSGLDAAMAVAIQHGSFIEDDKQHVVFHRDNASEKLNITLMSRTGILPEADFYCPIPYEPLHIVTDQALNAEIQKGEEGLLDRVFRLIVEEIKFADPDWSQRIALESLNVDSFAQAWFAERKQRDPFDWAEKNLQEVERNKREKHTVPWRYVILRLHEAVQEIVPHLNEHDHKRFSKGLARVFIDNYAAIPSESIRRLLALREAGIIHILALGEDYEMEINESRTVLKTEDNSYSFDVFIDARGQRPLKVKDIPFPGLREQLQKTGDEIPDVGEDYTLQQPEDIRGRVAFGALPWLMHDQPFVQGLTACAEIGEAMARAVVKPASRARRRLSFD >LR134000.1|VDY68512.1|2153107_2153872_+|putative-oxidoreductase-subunit MYRWFLRHFPRGGSYADIHHALIEEGYTDWAESLVEYAWKKWLADENFAHQEVSSMQKLATDPGEIPFCSQFARSDDHARIGCCEDNARIATAGYAAQIASMGYSVRIGSVGFNSHIGSSGERARVAVTGNSSRISSAGDSSRIANTGMRVRVCTLGERCHVASNGDLAQIASFGANARIANSGDNVHIIASGENSTVVSTGVVDSIILGPGGSAALAYHDGERVRFAVAIEGENNIRAGVRYRLNEQHQFVEC >LR134000.1|VDY68511.1|2152270_2153056_+|putative-oxidoreductase,-cytochrome-b-subunit MNPSQHAEQFQSQLANYVPQFTPEFWPVWLIIAGVLLVGMWLVLGLHALLRARGVKKSATDHGEKVYLYSKAVRLWHWSNALLFVLLLASGLINHFAMVGATAVKSLVAVHEVCGFLLLACWLGFVLINAVGDNGHHYRIRRQGWLERAAKQTRFYLFGIMQGEEHPLPATTQSKFNPLQQVAYVGVMYGLLPLLLLTGLLCLYPQAVGDVFPGVRYWLLQAHFALAFISLFFIFGHLYLCTTGRTPHETFKSMVDGYHRH >LR134000.1|VDY68510.1|2151605_2152274_+|putative-oxidoreductase-Fe-S-subunit MSFTRRKFVLGMGTVIFFTGSASSLLANTRQEKEVRYAMIHDESRCNGCNICARACRKTNHVPAQGSRLSIAHIPVTDNDNETQYHFFRQSCQHCEDAPCIDVCPTGASWRDEQGIVRVEKSQCIGCSYCIGACPYQVRYLNPVTKVADKCDFCAESRLAKGFPPICVSACPEHALIFGREDSPEIQAWLQDNKYYQYQLPGAGKPHLYRRFGQHLIKKENV >LR134000.1|VDY68509.1|2150903_2151542_+|putative-oxidoreductase-subunit MNHRDELPLAKVSEVDEAKRQWLQGMRHPVDTVTEPEPAEILAEFIRQHSAAGQLVARAVFLSPPYSVAEEELSVLLESIKQNGDYADIACMTGSQDDYYYSTQAMSENYAAMSLQVVEQDICRAIAHAVRFECQTYPRPYKVAMLMQAPYYFQEAQIEAAIAAMDVAPEYADIRQVESSTAVLYLFSERFMTYGKAYGLCEWFEVEQFQNP >LR134000.1|VDY68508.1|2148788_2150891_+|putative-aldehyde-ferredoxin-oxidoreductase MANGWTGNILRVNLTTGNITLEDSSKFKSFVGGMGFGYKIMYDEVPPGTKPFDEANKLVFATGPLTGSGAPCSSRVNITSLSTFTKGNLVVDAHMGGFFAAQMKFAGYDVIIIEGKAKSPVWLKIKDDKVSLEKADFLWGKGTRATTEEICRLTSPETCVAAIGQAGENLVPLSGMLNSRNHSGGAGTGAIMGSKNLKAIAVEGTKGVNIADRQEMKRLNDYMMTELIGANNNHVVPSTPQSWAEYSDPKSRWTARKGLFWGAAEGGPIETGEIPPGNQNTVGFRTYKSVFDLGPAAEKYTVKMSGCHSCPIRCMTQMNIPRVKEFGVPSTGGNTCVANFVHTTIFPNGPKDFEDKDDGRVIGNLVGLNLFDDYGLWCNYGQLHRDFTYCYSKGVFKRVLPAEEYAEIHWDQLEAGDVNFIKDFYYRLAHRVGELSHLADGSYAIAERWNLGEEYWGYAKNKLWSPFGYPVHHANEASAQVGSIVNCMFNRDCMTHTHINFIGSGLPLKLQREVAKELFGSEDAYDETKNYTPINDAKIKYAKWSLLRVCLHNAVTLCNWVWPMTVSPLKSRNYRGDLALEAKFFKAITGEEMTQEKLDLAAERIFTLHRAYTVKLMQTKDMRNEHDLICSWVFDKDPQIPVFTEGTDKMDRDDMHASLTMFYKEMGWDPQLGCPTRETLQRLGLEDIAADLAAHNLLPV >LR134000.1|VDY68507.1|2148141_2148768_+|putative-oxidoreductase-Fe-S-subunit MNPVDRPLLDIGLTRLEFLRISGKGLAGLTIAPALLSLLGCKQEDIDSGTVGLINTPKGVLVTQRARCTGCHRCEISCTNFNDGSVGTFFSRIKIHRNYFFGDNGVGSGGGLYGDLNYTADTCRQCKEPQCMNVCPIGAITWQQKEGCITVDHKRCIGCSACTTACPWMMATVNTESKKSSKCVLCGECANACPTGALKIIEWKDITV >LR134000.1|VDY68506.1|2147477_2147687_+|protein MGNRTKEDELYREMCRVVGKVVLEMRDLGQEPKHIVIAGVLRTALANKRIQRSELEKQAMETVINALVK >LR134000.1|VDY68505.1|2145508_2146921_-|pyruvate-kinase MKKTKIVCTIGPKTESEEMLAKMLDAGMNVMRLNFSHGDYAEHGQRIQNLRNVMSKTGKTAAILLDTKGPEIRTMKLEGGNDVSLKAGQTFTFTTDKSVIGNSEMVAVTYEGFTTDLSVGNTVLVDDGLIGMEVTAIEGNKVICKVLNNGDLGENKGVNLPGVSIALPALAEKDKQDLIFGCEQGVDFVAASFIRKRSDVIEIREHLKAHGGENIHIISKIENQEGLNNFDEILEASDGIMVARGDLGVEIPVEEVIFAQKMMIEKCIRARKVVITATQMLDSMIKNPRPTRAEAGDVANAILDGTDAVMLSGESAKGKYPLEAVSIMATICERTDRVMNSRLEFNNDNRKLRITEAVCRGAVETAEKLDAPLIVVATQGGKSARAVRKYFPDATILALTTNEKTAHQLVLSKGVVPQLVKEITSTDDFYRLGKELALQSGLAHKGDVVVMVSGALVPSGTTNTASVHVL >LR134000.1|VDY68504.1|2144961_2145198_-|murein-lipoprotein MKATKLVLGAVILGSTLLAGCSSNAKIDQLSSDVQTLNAKVDQLSNDVNAMRSDVQAAKDDAARANQRLDNMATKYRK >LR134000.1|VDY68514.1|2156491_2157748_+|Fluffing-protein MGSDAKNLMSDGNVQIVKTGEVIGATQLTEGELIVEAGGRAENTVVTGAGWLKVATGGIAKCTQYGNNGTLSVSDGAIATDIVQSEGGAISLSTLATVNGRHPEGEFSVDKGYACGLLLENGGNLRVLEGHRAEKIILDQEGGLLVNGTTSAVVVDEGGELLVYPGGEASNCEINQGGVFMLAGKASDTLLAGGTMNNLGGEDSDTIVENGSIYRLGTDGLQLYSSGKTQNLSVNVGGRAEVHAGTLENAVIQGGTVILLSPTSADENFVVEEDRAPVELTGSVALLDGASMIIGYGAELQQSTITVQQGGVLILDGSTVKGDSVTFSVGNINLNGGKLWLITGAATHVQLKVKRLRGEGAICLQTSAKEISPDFINVKGEVTGDIHVEITDASRQTLCNALKLQPDEDGIGATLQPA >LR134000.1|VDY68515.1|2157788_2159162_-|multidrug-resistance-protein MQKYISEARLLLALAIPVILAQIAQTAMGFVDTVMAGGYSATDMAAVAIGTSIWLPAILFGHGLLLALTPVIAQLNGSGRRERIAHQVRQGFWLAGFVSVLIMLVLWNAGYIIRSMENIDPALAEKAVGYLRALLWGAPGYLFFQVARNQCEGLAKTKPGMVMGFIGLLVNIPVNYIFIYGHFGMPELGGVGCGVATAAVYWVMFLAMVSYIKRARSMRDIRNEKGTAKPDPAVMKRLIQLGLPIALALFFEVTLFAVVALLVSPLGIVDVAGHQIALNFSSLMFVLPMSLAAAVTIRVGYRLGQGSTLDAQTAARTGLMVGVCMATLTAIFTVSLREQIALLYNDNPEVVTLAAHLMLLAAVYQISDSIQVIGSGILRGYKDTRSIFYITFTAYWVLGLPSGYILALTDLVVEPMGPAGFWIGFIIGLTSAAIMMMLRMRFLQRMPSAIILQRASR >LR134000.1|VDY68516.1|2159376_2160018_+|riboflavin-synthase-subunit-alpha MFTGIVQGTAKLVSIDEKPNFRTHVVELPDHMLDGLETGASVAHNGCCLTVTEINGNHVSFDLMKETLRITNLGDLKVGDWVNVERAAKFSDEIGGHLMSGHIMTTAEVAKILTSENNRQIWFKVQDSQLMKYILYKGFIGIDGISLTVGEVTPTRFCVHLIPETLERTTLGKKKLGARVNIEIDPQTQAVVDTVERVLAARENAMNQPGTEA >LR134000.1|VDY68517.1|2160057_2161206_-|cyclopropane-fatty-acyl-phospholipid-synthase MSSSCIEEVSVPDDNWYRIANELLSRAGIAINGSAPADIRVKNPDFFKRVLQEGSLGLGESYMDGWWECDRLDMFFSKVLRAGLENQLPHHFKDTLRIAGARLFNLQSKKRAWIVGKEHYDLGNDLFSRMLDPFMQYSCAYWKDADNLESAQQAKLKMICEKLQLKPGMRVLDIGCGWGGLAHYMASNYDVSVVGVTISAEQQKMAQERCEGLDVTILLQDYRDLNDQFDRIVSVGMFEHVGPKNYDTYFAVVDRNLKPEGIFLLHTIGSKKTDLNVDPWINKYIFPNGCLPSVRQIAQSSESHFVMEDWHNFGADYDTTLMAWYERFLAAWPEIADNYSERFKRMFTYYLNACAGAFRARDIQLWQVVFSRGVENGLRVAR >LR134000.1|VDY68518.1|2161496_2162708_-|major-facilitator-superfamily-protein MQPGKRFLVWLAGLSVLGFLATDMYLPAFAAIQADLQTPASAVSASLSLFLAGFAAAQLLWGPLSDRYGRKPVLLIGLTIFALGSLGMLWVENAATLLVLRFVQAVGVCAAAVIWQALVTDYYPSQKVNRIFATIMPLVGLSPALAPLLGSWLLVHFSWQAIFATLFAITVVLILPIFWLKPTTKARNNSQDGLTFTDLLRSKTYRGNVLIYAACSASFFAWLTGSPFILSEMGYSPAVIGLSYVPQTIAFLIGGYGCRAALQKWQGKQLLPWLLVLFAVSVIATWAAGFISHVSLVEILIPFCVMAIANGAIYPIVVAQALRPFPHATGRAAALQNTLQLGLCFLASLVVSWLISISTPLLTTTSVMLSTVVLVALGYMMQRCEEVGCQNHGNAEVAHSESH >LR134000.1|VDY68519.1|2162820_2163753_+|LysR-family-transcriptional-regulator MWSEYSLEVVDAVARNGSFSAAAQELHRVPSAVSYTVRQLEEWLAVPLFERRHRDVELTAAGAWFLKEGRSVVKKMQITRQQCQQIANGWRGQLAIAVDNIVRPERTRQMIVDFYRHFDDVELLVFQEVFNGVWDALSDGRVELAIGATRAIPVGGRYAFRDMGMLSWSCVVASHHPLALMDGPFSDDTLRNWPSLVREDTSRTLPKRITWLLDNQKRVVVPDWESSATCISAGLCIGMVPTHFAKPWLNEGKWVALELENPFPDSACCLTWQQNDMSPALTWLLEYLGDSETLNKEWLREPEETPATGD >LR134000.1|VDY68520.1|2163749_2164775_-|purine-nucleotide-synthesis-repressor MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVNHTKSIGLLATSSEAAYFAEIIEAVEKNCFQKGYTLILGNAWNNLEKQRAYLSMMAQKRVDGLLVMCSEYPEPLLAMLEEYRHIPMVVMDWGEAKADFTDAVIDNAFEGGYMAGRYLIERGHREIGVIPGPLERNTGAGRLAGFMKAMEEAMIKVPESWIVQGDFEPESGYRAMQQILSQPHRPTAVFCGGDIMAMGALCAADEMGLRVPQDVSLIGYDNVRNARYFTPALTTIHQPKDSLGETAFNMLLDRIVNKREEPQSIEVHPRLIERRSVADGPFRDYRR >LR134000.1|VDY68521.1|2165328_2166498_+|major-facilitator-superfamily-protein MKINYPLLALAIGAFGIGTTEFSPMGLLPVIARGVDVSIPAAGMLISAYAVGVMVGAPLMTLLLSHRARRSALIFLMAIFTLGNVLSAIAPDYMTLMLSRILTSLNHGAFFGLGSVVAASVVPKHKQASAVATMFMGLTLANIGGVPAATWLGETIGWRMSFLATAGLGVISMVSLFFSLPKGGAGARPEVKKELAVLMRPQVLSALLTTVLGAGAMFTLYTYISPVLQSITHATPVFVTAMLVLIGVGFSIGNYLGGKLADRSVNGTLKGFLLLLMVIMLAIPFLARNEFGAAISMVVWGAATFAVVPPLQMRVMRVASEAPGLSSSVNIGAFNLGNALGAAAGGAVISAGLGYSFVPVMGAIVAGLALLLVFMSARKQPETVCVANS >LR134000.1|VDY68522.1|2166643_2167225_-|superoxide-dismutase MSFELPALPYAKDALAPHISAETIEYHYGKHHQTYVTNLNNLIKGTAFEGKSLEEIIRSSEGGVFNNAAQVWNHTFYWNCLAPNAGGEPTGKVAEAIAASFGSFADFKAQFTDAAIKNFGSGWTWLVKNSDGKLAIVSTSNAGTPLTTDATPLLTVDVWEHAYYIDYRNARPGYLEHFWALVNWEFVAKNLAA >LR134000.1|VDY68523.1|2167352_2168168_-|putative-exported-hydrolase MARINRISITLCALLFTTLPLTPMAHASKQARESSATTHITKKADKKKSTATTKKTQKTAKKAASKSTTKSKTASSVKKSSITASKNAKTRSKHAVNKTASASFTEKCTKRKGYKSHCVKVKNAASGTLADAHKAKVQKATKVAMNKLMQQIGKPYRWGGSSPRTGFDCSGLVYYAYKDLVKIRIPRTANEMYHLRDAAPIERSELKNGDLVFFRTQGRGTADHVGVYVGNGKFIQSPRTGQEIQITSLSEDYWQRHYVGARRVMTPKTLR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134000_7 | 2974507-2974711 | Orphan |
I-F
Consensus repeat of LR134000_7
|
3 spacers
spacers of LR134000_7
>7.1|2974535|28|LR134000|CRISPRCasFinder,CRT ATGGTGGGTGGAGTATGTTACCTGTGAA >7.2|2974591|32|LR134000|CRISPRCasFinder,CRT,PILER-CR GCCAACTTTCACAGCCTTTGCAAGATTGTTCC >7.3|2974651|33|LR134000|CRISPRCasFinder,CRT,PILER-CR GTTCGCTGCAACCGCTAGCCAAGACGGTAGGTT |
CRISPR arrays and Neighbor proteins around LR134000_7
The CRISPR arrays of LR134000_7 >merge|LR134000|7|2974507-2974711|CRISPRCasFinder,CRT,PILER-CR CATTTTATCTGTCTGTACGGCAGTGAACATGGTGGGTGGAGTATGTTACCTGTGAATTTCTAAGCTGCCTGTACGGCAGTGAACGCCAACTTTCACAGCCTTTGCAAGATTGTTCCTTTCTAAGCTGCCTGTACGGCAGTGAACGTTCGCTGCAACCGCTAGCCAAGACGGTAGGTTTTTCTAAGCTGCCTGTACGGCAGTGAAC >LR134000|7|7|2974507-2974711|CRISPRCasFinder CATTTTATCTGTCTGTACGGCAGTGAAC ATGGTGGGTGGAGTATGTTACCTGTGAA TTTCTAAGCTGCCTGTACGGCAGTGAAC GCCAACTTTCACAGCCTTTGCAAGATTGTTCC TTTCTAAGCTGCCTGTACGGCAGTGAAC GTTCGCTGCAACCGCTAGCCAAGACGGTAGGTT TTTCTAAGCTGCCTGTACGGCAGTGAAC >LR134000|7|4|2974507-2974711|CRT CATTTTATCTGTCTGTACGGCAGTGAAC ATGGTGGGTGGAGTATGTTACCTGTGAA TTTCTAAGCTGCCTGTACGGCAGTGAAC GCCAACTTTCACAGCCTTTGCAAGATTGTTCC TTTCTAAGCTGCCTGTACGGCAGTGAAC GTTCGCTGCAACCGCTAGCCAAGACGGTAGGTT TTTCTAAGCTGCCTGTACGGCAGTGAAC >LR134000|7|4|2974563-2974711|PILER-CR TTTCTAAGCTGCCTGTACGGCAGTGAAC GCCAACTTTCACAGCCTTTGCAAGATTGTTCC TTTCTAAGCTGCCTGTACGGCAGTGAAC GTTCGCTGCAACCGCTAGCCAAGACGGTAGGTT TTTCTAAGCTGCCTGTACGGCAGTGAAC
>LR134000.1|VDY69264.1|2973903_2974122_+|translation-initiation-factor-IF-1 MAKEDNIEMQGTVLETLPNTMFRVELENGHVVTAHISGKMRKNYIRILTGDKVTVELTPYDLSKGRIVFRSR >LR134000.1|VDY69263.1|2972914_2973619_+|leucyl/phenylalanyl-tRNA-protein-transferase MRLVQLSRHSIAFPSPEGALREPNGLLALGGDLSPARLLMAYQRGIFPWFSPGDPILWWSPDPRAVLWPESLHISRSMKRFHKRSPYRVTMNYAFGQVIEGCASDREEGTWITRGVVEAYHRLHELGHAHSIEVWREDELVGGMYGVAQGTLFCGESMFSRMENASKTALLVFCEEFIGHGGKLIDCQVLNDHTASLGACEIPRRDYLNYLNQMRLGRLPNNFWVPRCLFSPQE >LR134000.1|VDY69262.1|2971151_2972873_+|ABC-transporter-ATP-binding-protein/permease MRALLPYLALYKRHKWMLSLGIVLAIVTLLASIGLLTLSGWFLSASAVAGVAGLYSFNYMLPAAGVRGAAITRTAGRYFERLVSHDATFRVLQHLRIYTFSKLLPLSPAGLARYRQGELLNRVVADVDTLDHLYLRVISPLVGAFVVIMVVTIGLSFLDFTLAFTLGGIMLLTLFLMPPLFYRAGKSTGQNLTHLRGQYRQQLTAWLQGQAELTIFGASDRYRTQLENTEIQWLEAQRRQSELTALSQAIMLLIGALAVILMLWMASGGVGGNAQPGALIALFVFCALAAFEALAPVTGAFQHLGQVIASAVRITDLTDQKPEVTFPDTQTRVADRVSLTLRDVQFTYPEQSQQALKGISLQVNAGEHIAILGRTGCGKSTLLQLLTRAWDPQQGEILLNDSPIASLNEAALRQTISVVPQRVHLFSATLRDNLLLASPGSSDEALSEILRRVGLEKLLEDAGLNSWLGEGGRQLSGGELRRLAIARALLHDAPLVLLDEPTEGLDATTESQILELLAEMMREKTVLMVTHRLRGLSRFQQIIVMDNGQIIEQGTHAELLARQGRYYQFKQGL >LR134000.1|VDY69261.1|2969384_2971151_+|ABC-transporter-ATP-binding-protein/permease MNKSRQKELTRWLKQQSVISQRWLNISRLLGFVSGILIIAQAWFMARILQHMIMENIPREALLLPFTLLVLTFVLRAWVVWLRERVGYHAGQHIRFAIRRQVLDRLQQAGPAWIQGKPAGSWATLVLEQIDDMHDYYARYLPQMALAVSVPLLIVVAIFPSNWAAALILLGTAPLIPLFMVLVGMGAADANRRNFLALARLSGHFLDRLRGMETLRIFGRGEAEIESIRSASEDFRQRTMEVLRLAFLSSGILEFFTSLSIALVAVYFGFSYLGELDFGHYDTGVTLAAGFLALILAPEFFQPLRDLGTFYHAKAQAVGAADSLKTFMETPLAHPQRGEAELALTDPLTIEAEDLFITSPEGKTLAGPLNFTLPAGQRAVLVGRSGSGKSSLLNALSGFLSYQGSLRINGIELRDLSPESWRKHLSWVGQNPQLPAATLRDNVLLARPDASEQELQAALDNAWVSEFLPLLPQGVDTPVGDQAARLSVGQAQRVAVARALLNPCSLLLLDEPAASLDAHSEQRVMEALNAASLRQTTLMVTHQLEDLADWDVIWVMQDGRIIEQGRYAELSVAGGPFATLLAHRQEEI >LR134000.1|VDY69260.1|2968296_2969262_+|thioredoxin-reductase MGTTKHSKLLILGSGPAGYTAAVYAARANLQPVLITGMEKGGQLTTTTEVENWPGDPNDLTGPLLMERMHEHATKFETEIIFDHINKVDLQNRPFRLNGDNGEYTCDALIIATGASARYLGLPSEEAFKGRGVSACATCDGFFYRNQKVAVIGGGNTAVEEALYLSNIASEVHLIHRRDGFRAEKILIKRLMDKVENGNIILHTNRTLEEVTGDQMGVTGVRLRDTQNSDNIESLDVAGLFVAIGHSPNTAIFEGQLELENGYIKVQSGIHGNATQTSIPGVFAAGDVMDHIYRQAITSAGTGCMAALDAERYLDGLADAK >LR134000.1|VDY69259.1|2967257_2967752_-|leucine-responsive-transcriptional-regulator MVDSKKRPGKDLDRIDRNILNELQKDGRISNVELSKRVGLSPTPCLERVRRLERQGFIQGYTALLNPHYLDASLLVFVEITLNRGAPDVFEQFNTAVQKLEEIQECHLVSGDFDYLLKTRVPDMSAYRKLLGETLLRLPGVNDTRTYVVMEEVKQSNRLVIKTR >LR134000.1|VDY69258.1|2963133_2967123_-|cell-division-protein-FtsK MSQEYTEDKEVTLTKLSSGRRLLEALLILIVLFAVWLMAALLSFNPSDPSWSQTAWHEPIHNLGGMPGAWLADTLFFIFGVMAYTIPVIIVGGCWFAWRHQSSDEYIDYFAVSLRIIGVLALILTSCGLAAINADDIWYFASGGVIGSLLSTTLQPLLHSSGGTIALLCVWAAGLTLFTGWSWVTIAEKLGGWILNILTFASNRTRRDDTWVDEDEYEDDEEYEDENHGKQHESRRARILRGALARRKRLAEKFINPMGRQTDAALFSGKRMDDDEEITYTARGVAADPDDVLFSGNRATQPEYDEYDPLLNGAPITEPVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVKKATLATGAAATVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRIRVPTRRELASYGIKLPSQRAAEEKAREAQRNQYDSGDQYNDDEIDAMQQDELARQFAQTQQQRYGEQYQHDVPVNAEDADAAAEAELARQFAQTQQQRYSGEQPAGANPFSLDDFEFSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVPPQPQYQQPQQPVAPQPQYQQPQQPVAPQQQYQQPQQPVAPQQQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVEPVDTFALEQMARLVEARLADFRIKADVVNYSPGPVITRFELNLAPGVKAARISNLSRDLARSLSTVAVRVVEVIPGKPYVGLELPNKKRQTVYLREVLDNAKFRDNPSPLTVVLGKDIAGEPVVADLAKMPHLLVAGTTGSGKSVGVNAMILSMLYKAQPEDVRFIMIDPKMLELSVYEGIPHLLTEVVTDMKDAANALRWCVNEMERRYKLMSALGVRNLAGYNEKIAEADRMMRPIPDPYWKPGDSMDAQHPVLKKEPYIVVLVDEFADLMMTVGKKVEELIARLAQKARAAGIHLVLATQRPSVDVITGLIKANIPTRIAFTVSSKIDSRTILDQAGAESLLGMGDMLYSGPNSTLPVRVHGAFVRDQEVHAVVQDWKARGRPQYVDGITSDSESEGGAGGFDGAEELDPLFDQAVQFVTEKRKASISGVQRQFRIGYNRAARIIEQMEAQGIVSEQGHNGNREVLAPPPFD >LR134000.1|VDY69257.1|2962367_2962979_-|outer-membrane-lipoprotein-carrier-protein MKKIAITCALLSSLVASSVWADAASDLKSRLDKVSSFHASFTQKVTDGSGAAVQEGQGDLWVKRPNLFNWHMTQPDESILVSDGKTLWFYNPFVEQATATWLKDATGNTPFMLIARNQSSDWQQYNIKQNGDDFVLTPKASNGNLKQFTINVGRDGTIHQFSAVEQDDQRSSYQLKSQQNGAVDAAKFTFTPPQGVTVDDQRK >LR134000.1|VDY69256.1|2961013_2962357_-|putative-ATPase MSNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHLHSMILWGPPGTGKTTLAEVIARYANADVERISAVTSGVKEIREAIERARQNRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELNSALLSRARVYLLKSLSTEDIEQVLTQAMEDKTRGYGGQDIVLPDETRRAIAELVNGDARRALNTLEMMADMAEVDDSGKRVLKPELLTEIAGERSARFDNKGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIASEDVGNADPRAMQVAIAAWDCFTRVGPAEGERAIAQAIVYLACAPKSNAVYTAFKAALADARERPDYDVPVHLRDAPTKLMKEMGYGQEYRYAHDEANAYAAGEVYFPPEIAQTRYYFPTNRGLEGKIGEKLAWLAEQDQNSPIKRYR >LR134000.1|VDY69255.1|2959630_2960923_-|seryl-tRNA-synthetase MLDPNLLRNEPDAVAEKLARRGFKLDVDKLGALEERRKVLQVKTENLQAERNSRSKSIGQAKARGEDIEPLRLEVNKLGEELDAAKAELDALQAEIRDIALTIPNLPADEVPVGKDENDNVEVSRWGTPREFDFEVRDHVTLGEMHSGLDFAAAVKLTGSRFVVMKGQIARMHRALSQFMLDLHTEQHGYSENYVPYLVNQDTLYGTGQLPKFAGDLFHTRPLEEEADTSNYALIPTAEVPLTNLVRGEIIDEDDLPIKMTAHTPCFRSEAGSYGRDTRGLIRMHQFDKVEMVQIVRPEDSMAALEEMTGHAEKVLQLLGLPYRKIILCTGDMGFGACKTYDLEVWIPAQNTYREISSCSNVWDFQARRMQARCRSKSDKKTRLVHTLNGSGLAVGRTLVAVMENYQQADGRIEVPEVLRPYMNGLEYIG >LR134000.1|VDY69265.1|2974920_2977197_-|ATP-dependent-Clp-protease-ATP-binding-subunit MLNQELELSLNMAFARAREHRHEFMTVEHLLLALLSNPSAREALEACSVDLVALRQELEAFIEQTTPVLPASEEERDTQPTLSFQRVLQRAVFHVQSSGRNEVTGANVLVAIFSEQESQAAYLLRKHEVSRLDVVNFISHGTRKDEPTQSSDPGSQPNSEEQAGGEERMENFTTNLNQLARVGGIDPLIGREKELERAIQVLCRRRKNNPLLVGESGVGKTAIAEGLAWRIVQGDVPEVMADCTIYSLDIGSLLAGTKYRGDFEKRFKALLKQLEQDTNSILFIDEIHTIIGAGAASGGQVDAANLIKPLLSSGKIRVIGSTTYQEFSNIFEKDRALARRFQKIDITEPSIEETVQIINGLKPKYEAHHDVRYTAKAVRAAVELAVKYINDRHLPDKAIDVIDEAGARARLMPVSKRKKTVNVADIESVVARIARIPEKSVSQSDRDTLKNLGDRLKMLVFGQDKAIEALTEAIKMARAGLGHEHKPVGSFLFAGPTGVGKTEVTVQLSKALGIELLRFDMSEYMERHTVSRLIGAPPGYVGFDQGGLLTDAVIKHPHAVLLLDEIEKAHPDVFNILLQVMDNGTLTDNNGRKADFRNVVLVMTTNAGVRETERKSIGLIHQDNSTDAMEEIKKIFTPEFRNRLDNIIWFDHLSTDVIHQVVDKFIVELQVQLDQKGVSLEVSQEARNWLAEKGYDRAMGARPMARVIQDNLKKPLANELLFGSLVDGGQVTVALDKEKNELTYGFQSAQKHKAEAAH >LR134000.1|VDY69266.1|2977227_2977548_-|ATP-dependent-Clp-protease-adaptor-protein MGKTNDWLDFDQLAEEKVRDALKPPSMYKVILVNDDYTPMEFVIDVLQKFFSYDVERATQLMLAVHYQGKAICGVFTAEVAETKVAMVNKYARENEHPLLCTLEKA >LR134000.1|VDY69267.1|2977870_2978095_+|stationary-phase/starvation-inducible-regulatory-protein-CspD MEKGTVKWFNNAKGFGFICPEGGGEDIFAHYSTIQMDGYRTLKAGQSVQFDVHQGPKGNHASVIVPVEVEAAVA >LR134000.1|VDY69268.1|2978167_2980114_-|macrolide-export-ATP-binding/permease-protein MTPLLELKDIRRSYPAGDEQVEVLKGITLDIYAGEMVAIVGASGSGKSTLMNILGCLDKATSGTYRVAGQDVATLDADALAQLRREHFGFIFQRYHLLSHLTAEQNVEVPAVYAGLERKQRLLRAQELLQRLGLEDRTEYYPAQLSGGQQQRVSIARALMNGGQVILADEPTGALDSHSGEEVMAILHQLRDRGHTVIIVTHDPQVAAQAERVIEIRDGEIVRNPPAIEKVNVAGGTEPVVNTVSGWRQFVSGFNEALTMAWRALAANKMRTLLTMLGIIIGIASVVSIVVVGDAAKQMVLADIRSIGTNTIDVYPGKDFGDDDPQYQQALKYDDLIAIQKQPWVASATPAVSQNLRLRYNNVDVAASANGVSGDYFNVYGMTFSEGNTFNQEQLNGRAQVVVLDSNTRRQLFPHKADVVGEVILVGNMPARVIGVAEEKQSMFGSSKVLRVWLPYSTMSGRVMGQSWLNSITVRVKEGFDSAEAEQQLTRLLSLRHGKKDFFTWNMDGVLKTVEKTTRTLQLFLTLVAVISLVVGGIGVMNIMLVSVTERTREIGIRMAVGARASDVLQQFLIEAVLVCLVGGALGITLSLLIAFTLQLFLPGWEIGFSPLALLLAFLCSTVTGILFGWLPARNAARLDPVDALARE >LR134000.1|VDY69269.1|2980110_2981208_-|macrolide-specific-efflux-protein MKKRYVIALVIVIAGLITLWRILNAPVPTYQTLIVRPGDLQQSVLATGKLDALRKVDVGAQVSGQLKTLSVAIGDKVKKDQLLGVIDPEQAENQIKEVEATLMELRAQRQQAEAELKLARVTYSRQQRLAQTQAVSQQDLDTAATEMAVKQAQIGTIDAQIKRNQASLDTAKTNLDYTRIVAPMAGEVTQITTLQGQTVIAAQQAPNILTLADMSTMLVKAQVSEADVIHLKPGQKAWFTVLGDPLTRYEGQIKDVLPTPEKVNDAIFYYARFEVPNPNGLLRLDMTAQVHIQLTDVKNVLTIPLSALGDPVGDNRYKVKLLRNGETREREVTIGARNDTDVEIVKGLEAGDEVVIGEAKPGAAQ >LR134000.1|VDY69270.1|2981340_2982333_+|virulence-protein MVKSTSCITIDFMNMSQLTERTFTPSESLSSLSLFLSLARGQCRPGKFWHRRSFRQKFLLRSLIMPRLSVEWMNELSHWPNLNVLLTRQPRLPVRLHRPYLAANLSRKQLLEALRYHYALLRECMSAEEFSLYLNTPGLQLAKLEGKNGEQFTLELTMMISMDKEGDSTILFRNSEGIPLAEITFTLCEYQGKRTMFIGGLQGAKWEIPHQEIQNATKACHGLFPKRLVMEAACLFAQRLQVEQIIAVSNETHIYRSLRYRDKEGKIHADYNAFWESVGGVCDAERHYRLPAQIARKEIAEIASKKRAEYRRRYEMLDAIQPQMATMFRG >LR134000.1|VDY69271.1|2982329_2983988_-|nucleoside-triphosphate-hydrolase-domain MILERVEIVGFRGINRLSLMLEQNNVLIGENAWGKSSLLDALTLLLSPESDLYHFERDDFWFPPGDINGREHHLHIILTFRESLPGRHRVRRYRPLEACWTPCTDGYHRIFYRLEGESAEDGSVMTLRSFLDKDGHPIDVEDINDQARHLVRLMPVLRLRDARFMRRIRNGTVPNVPNVEVTARQLDFLARELSSHPQNLSDGQIRQGLSAMVQLLEHYFSEQGAGQARYRLMRRRASNEQRSWRYLDIINRMIDRPGGRSYRVILLGLFATLLQAKGTLRLDKDARPLLLIEDPETRLHPIMLSVAWHLLNLLPLQRIATTNSGELLSLTPVEHVCRLVRESSRVAAWRLGPSGLSTEDSRRISFHIRFNRPSSLFARCWLLVEGETETWVINELARQCGHHFDAEGIKVIEFAQSGLKPLVKFARRMGIEWHVLVDGDEAGKKYAATVRSLLNNDREAEREHLTALPALDMEHFMYRQGFSDVFHRMAQIPENVPMNLRKIISKAIHRSSKPDLAIEVAMEAGRRGVDSVPTLLKKMFSRVLWLARGRAD >LR134000.1|VDY69272.1|2984412_2985108_+|aquaporin-Z MFRKLAAECFGTFWLVFGGCGSAVLAAGFPELGIGFAGVALAFGLTVLTMAFAVGHISGGHFNPAVTIGLWAGGRFPAKEVVGYVIAQVVGGIVAAALLYLIASGKTGFDAAASGFASNGYGEHSPGGYSMLSALVVELVLSAGFLLVIHGATDKFAPAGFAPIAIGLALTLIHLISIPVTNTSVNPARSTAVAIFQGGWALEQLWFFWVVPIVGGIIGGLIYRTLLEKRD >LR134000.1|VDY69273.1|2985602_2986502_+|transporter MFSGLLIILVPLIVGYLIPLRQKAALKVINQLLSWMVYLILFFMGISLAFLDNLASNLLAILHYSAVSITVILLCNIAALMWLERGLPWRNHHQQEKLPSRIAMALESLKLCGVVVIGFAIGLSGLAFLQHATEASEYTLILLLFLVGIQLRNNGMTLKQIVLNRRGMIVAVVVVASSLIGGLINAFILDLPINTALAMASGFGWYSLSGILLTESFGPVIGSAAFFNDLARELIAIMLIPGLIRRSRSTALGLCGATSMDFTLPVLQRTGGLDMVPAAIVHGFILSLLVPILIAFFSA >LR134000.1|VDY69274.1|2986645_2988298_+|hydroxylamine-reductase MFCVQCEQTIRTPAGNGCSYAQGMCGKTAETSDLQDLLIAALQGLSAWAVKAREYGIINHDVDSFAPRAFFSTLTNVNFDSPRIVGYAREAIALREALKAQCLAVDANARVDNPMADLQLVSDDLGELQRQAAEFTPNKDKAAIGENILGLRLLCLYGLKGAAAYMEHAHVLGQYDNDIYAQYHKIMAWLGTWPADMNALLECSMEIGQMNFKVMSILDAGETGKYGHPTPTQVNVKATAGKCILISGHDLKDLYNLLEQTEGTGVNVYTHGEMLPAHGYPELRKFKHLVGNYGSGWQNQQVEFARFPGPIVMTSNCIIDPTVGAYDDRIWTRSIVGWPGVRHLDGDDFSAVITQAQQMAGFPYSEIPHLITVGFGRQTLLGAADTLIDLVSREKLRHIFLLGGCDGARGERHYFTDFATSVPDDCLILTLACGKYRFNKLEFGDIEGLPRLVDAGQCNDAYSAIILAVTLAEKLGCGVNDLPLSLVLSWFEQKAIVILLTLLSLGVKNIVTGPTAPGFLTPDLLAVLNEKFGLRSITTVEEDMKQLLSA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134000_8 | 3094785-3094929 | Orphan |
NA
Consensus repeat of LR134000_8
|
1 spacers
spacers of LR134000_8
>8.1|3094837|41|LR134000|CRISPRCasFinder TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC |
CRISPR arrays and Neighbor proteins around LR134000_8
The CRISPR arrays of LR134000_8 >merge|LR134000|8|3094785-3094929|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGCTGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTCGTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC >LR134000|8|8|3094785-3094929|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC
>LR134000.1|VDY69386.1|3093435_3094719_+|acyl-CoA-thioester-hydrolase MNTFSVSRLALALAFGVTLTACSSTPPDQRPSDQTAPGTSSRPILSAKEAQNFDAQHYFASLTPGAAAWNPSPITLPAQPEFVVGPAGTQGVTHTTIQAAVDAAIIKRTNKRQYIAVMPGEYQGTVYVPAAPGGITLYGTGEKPIDVKIGLSLDGGMSPADWRHDVNPRGKYMPGKPAWYMYDSCQSKRSDSIGVLCSAVFWSQNNGLQLQNLTIENTLGDSVDAGNHPAVALRTDGDQVQINNVNILGRQNTFFVTNSGVQNRLETNRQPRTLVTNSYIEGDVDIVSGRGAVVFDNTEFRVVNSRTQQEAYVFAPATLSNIYYGFLAVNSRFNAFGDGVAQLGRSLDVDANTNGQVVIRDSAINEGFNTAKPWADAVISNRPFAGNTGSVDDNDEIQRNLNDTNYNRMWEYNNRGVGSKVVAEAKK >LR134000.1|VDY69384.1|3092807_3093284_+|putative-phosphatidylethanolamine-binding-protein MKLISNDLRDGDKLPHRHVFNGMGYDGDNISPHLAWDDVPAGTKSFVVTCYDPDAPTGSGWWHWVVVNLPADTRVLPQGFGSGLVAMPDGVLQTRTDFGKTGYDGAAPPKGETHRYIFTVHALDIERIDVDEGASGAMVGFNVHFHSLASASITAMFS >LR134000.1|VDY69382.1|3091459_3092749_+|adenosylmethionine-8-amino-7-oxononanoate-aminotransferase MTTDDLAFDQRHIWHPYTSMTSPLPVYPVASAEGCELILSDGRRLVDGMSSWWAAIHGYNHPQLNAAMKSQIDAMSHVMFGGITHAPAIELCRKLVAMTPQPLECVFLADSGSVAVEVAMKMALQYWQAKGEARQRFLTFRNGYHGDTFGAMSVCDPDNSMHSLWKGYLPENLFAPAPQSRMDGEWDERDMVGFARLMAAHRHEIAAVIIEPIVQGAGGMRMYHPEWLKRIRKICDREGILLIADEIATGFGRTGKLFACEHAEIAPDILCLGKALTGGTMTLSATLTTREVAETISNGEAGCFMHGPTFMGNPLACAAANASLAILESGDWQQQVADIEVQLREQLAPARDAEMVADVRVLGAIGVVETTHPVNMAALQKFFVEQGVWIRPFGKLIYLMPPYIILPQQLQRLTAAVNRAVQDETFFCQ >LR134000.1|VDY69380.1|3090332_3091373_-|biotin-synthetase MAHRPRWTLSQVTELFEKPLLDLLFEAQQVHRQHFDPRQVQVSTLLSIKTGACPEDCKYCPQSSRYKTGLEAERLMEVEQVLESARKAKAAGSTRFCMGAAWKNPHERDMPYLEQMVQGVKAMGLEACMTLGTLSESQAQRLANAGLDYYNHNLDTSPEFYGNIITTRTYQERLDTLEKVRDAGIKVCSGGIVGLGETVKDRAGLLLQLANLPTPPESVPINMLVKVKGTPLADNDDVDAFDFIRTIAVARIMMPTSYVRLSAGREQMNEQTQAMCFMAGANSIFYGCKLLTTPNPEEDKDLQLFRKLGLNPQQTAVLAGDNEQQQRLEQALMTPDTDEYYNAAAL >LR134000.1|VDY69378.1|3089181_3090336_-|8-amino-7-oxononanoate-synthase MSWQDKINAALDARRAADALRRRYPVAQGAGRWLVADDRQYLNFSSNDYLGLSHHPQIIRAWKQSAEQFGVGSGGSGHVSGYSVAHQALEEELAEWLGYSRALLFISGFAANQAVIAAMMAKEDRIVADRLSHASLLEAASLSPSQLRRFVHNDVTHLARLLASPCPGQQMVVTEGVFSMDGDSAPLAEIQQVTQQHNGWLMVDDAHGTGVIGEQGRGSCWLQKVKPELLVVTFGKGFGVSGAAVLCSSTVADYLLQFARHLIYSTSMPPAQAQALRASLAVIRSDEGDARREKLAALITRFRAGVQDLPFTLADSCSAIQPLIVGDNSRALQLAEKLRQQGCWVTAIRPPTVPAGTARLRLTLTAAHEMQDIDRLLEVLHGNG >LR134000.1|VDY69376.1|3088439_3089195_-|biotin-biosynthesis-protein-BioC MATVNKQAIAAAFGRAAAHYEQHADLQRQSADALLAMLPQRKYTHVLDAGCGPGWMSRHWRERHAQVTALDLSPPMLVQARQKDAADHYLAGDIESLPLATATFDLAWSNLAVQWCGNLSTALRELYRVVRPKGVVAFTTLVQGSLPELHQAWQAVDERPHANRFLPPDEIEQSLNGVHYQHHIQPITLWFDDALSAMRSLKGIGATHLHEGRDPRILTRSQLQRLQLAWPQQQGRYPLTYHLFLGVIARE >LR134000.1|VDY69374.1|3087769_3088447_-|dithiobiotin-synthetase MSKRYFVTGTDTEVGKTVASCALLQAAKAAGYRTAGYKPVASGSEKTPEGLRNSDALALQRNSSLQLDYATVNPYTFAEPTSPHIISAQEGRPIESLVMSAGLRALEQQADWVLVEGAGGWFTPLSDTFTFADWVTQEQLPVILVVGVKLGCINHAMLTAQVIQHAGLTLAGWVANDVTPPGKRHAEYMTTLTRMIPAPLLGEIPWLAENPENAATGKYINLALL >LR134000.1|VDY69373.1|3085169_3087191_-|UvrABC-system-protein-B-(excinuclease-ABC-subunit-B) MSKPFKLNSAFKPSGDQPEAIRRLEEGLEDGLAHQTLLGVTGSGKTFTIANVIADLQRPTMVLAPNKTLAAQLYGEMKEFFPENAVEYFVSYYDYYQPEAYVPSSDTFIEKDASVNEHIEQMRLSATKAMLERRDVVVVASVSAIYGLGDPDLYLKMMLHLTVGMIIDQRAILRRLAELQYARNDQAFQRGTFRVRGEVIDIFPAESDDIALRVELFDEEVERLSLFDPLTGQIVSTIPRFTIYPKTHYVTPRERIVQAMEEIKEELAARRKVLLENNKLLEEQRLTQRTQFDLEMMNELGYCSGIENYSRFLSGRGPGEPPPTLFDYLPADGLLVVDESHVTIPQIGGMYRGDRARKETLVEYGFRLPSALDNRPLKFEEFEALAPQTIYVSATPGNYELEKSGGDVVDQVVRPTGLLDPIIEVRPVATQVDDLLSEIRQRAAINERVLVTTLTKRMAEDLTEYLEEHGERVRYLHSDIDTVERMEIIRDLRLGEFDVLVGINLLREGLDMPEVSLVAILDADKEGFLRSERSLIQTIGRAARNVNGKAILYGDKITPSMAKAIGETERRREKQQKYNEEHGITPQGLNKKVVDILALGQNIAKTKAKGRGKSRPIVEPDNVPMDMSPKALQQKIHELEGLMMQHAQNLEFEEAAQIRDQLHQLRELFIAAS >LR134000.1|VDY69371.1|3084069_3084978_+|transferase-with-NAD(P)-binding-Rossmann-fold-domain;-UPF0052-family MRNRTLADLDRVVALGGGHGLGRVLSSLSSLGSRLTGIVTTTDNGGSTGRIRRSEGGIAWGDMRNCLNQLITEPSVASAMFEYRFGGNGELSGHNLGNLMLKALDHLSVRPLEAINLIRNLLKVDTHLIPMSEHPVDLMAIDDQGHEVYGEVNIDQLTTPIQELLLTPNVPATREAVHAINEADLIIIGPGSFYTSLMPILLLKEIAQALRRTPAPMVYIGNLGRELSLPAANLKLESKLAIMEQYVGKKVIDAVIVGPKVDVSAVKERIVIQEVLEASDIPYRHDRQLLHNALEKALQALG >LR134000.1|VDY69369.1|3082149_3082662_-|molybdenum-cofactor-biosynthesis-protein-B MSQVSTEFIPTRIAILTVSNRRGEEDDTSGHYLRDSAQEAGHHVVDKAIVKENRYAIRAQVSAWIASDDVQVVLITGGTGLTEGDQAPEALLPLFDREVEGFGEVFRMLSFEEIGTSTLQSRAVAGVANKTLIFAMPGSTKACRTAWENIIAPQLDARTRPCNFHPHLKK >LR134000.1|VDY69387.1|3094952_3097214_-|putative-aconitase MIKLSEKGVFLASNNEIIAEEHFTREIKKEEAKKGTIAWSILSSHNTSGNMDKLKIKFDSLASHDITFVGIVQTAKASGMERFPLPYVLTNCHNSLCAVGGTINGDDHVFGLSAAQRYGGIFVPPHIAVIHQYMREMMAGGGKMILGSDSHTRYGALGTMAVGEGGGELVKQLLNDTWDIDYPGVVAVHLTGKPAPYVGPQDVALAIIGAVFKNGYVKNKVMEFVGPGVAALSTDFRNSVDVMTTETTCLSSVWQTDEEVHNWLALHGRGQDYCQLNPQPMAYYDGCISVDLSAIKPMIALPFHPSNVYEIDTLNQNLTDILREIEIESERVAHGKAKLSLLDKVENGRLKVQQGIIAGCSGGNYENVIAAANALRGQSCGNDTFSLAVYPSSQPVFMDLAKKGVVADLIGAGAIIRTAFCGPCFGAGDTPINNGLSIRHTTRNFPNREGSKPANGQMSAVALMDARSIAATAANGGYLTSASELDCWDNVPEYAFDVTPYKNRVYQGFVKGATQQPLIYGPNIKDWPELGALTDNIVLKVCSKILDEVTTTDELIPSGETSSYRSNPIGLAEFTLSRRDPGYVGRSKATAELENQRLAGNVSELTEVFARIKQIAGQEHIDPLQTEIGSMVYAVKPGDGSAREQAASCQRVIGGLANIAEEYATKRYRSNVINWGMLPLQMAEVPTFEVGDYIYIPGIKAALDNPGTTFKGYVIHEDAPVTEITLYMESLTAEEREIIKAGSLINFNKNRQM >LR134000.1|VDY69389.1|3097396_3098830_-|putative-Sodium:sulfate-symporter MNKKSLWKLILILAIPCIIGFMPAPAGLSELAWVLFGIYLAAIVGLVIKPFPEPVVLLIAVAASMVVVGNLSDGAFKTTAVLSGYSSGTTWLVFSAFTLSAAFVTTGLGKRIAYLLIGKIGNTTLGLGYVTVFLDLVLAPATPSNTARAGGIVLPIINSVAVALGSEPEKSPRRVGHYLMMSIYMVTKTTSYMFFTAMAGNILALKMINDILHLQISWGGWALAAGLPGIIMLLVTPLVIYTMYPPEIKKVDNKTIAKAGLAELGPMKIREKMLLGVFVLALLGWIFSKSLGVDESTVAIVVMATMLLLGIVTWEDVVKNKGGWNTLIWYGGIIGLSSLLSKVKFFEWLAEVFKNNLAFDGHGNVAFFVIIFLSIIVRYFFASGSAYIVAMLPVFAMLANVSGAPLMLTALVLLFSNSYGGMVTHYGGAAGPVIFGVGYNDIKSWWLVGAVLTILTFLVHITLGVWWWNMLIGWNML >LR134000.1|VDY69391.1|3098905_3099958_-|3-methylitaconate-isomerase MKKIPCVMMRGGTSRGAFLLAEHLPEDQTQRDKILMAIMGSGNDLEIDGIGGGNPLTSKVAIISRSSDPRADVDYLFAQVIVHEQRVDTTPNCGNMLSGVGAFAIENGLIAATSPVTRVRIRNVNTGTFIEADVQTPNGVVEYEGSARIDGVPGTAAPVALTFLNAAGTKTGKVFPTDNQIDYFDDVPVTCIDMAMPVVIIPAEYLGKTGYELPAELDADKALLARIESIRLQAGKAMGLGDVSNMVIPKPVLISPAQKGGAINVRYFMPHSCHRALAITGAIAISSSCALEGTVTRQIVPSVGYGNINIEHPSGALDVHLSNEGQDATTLRASVIRTTRKIFSGEVYLP >LR134000.1|VDY69393.1|3100141_3101095_+|DNA-binding-transcriptional-regulator MKHELSSMKAFVILAESSSFNNAAKLLNITQPALTRRIKKMEEDLHIQLFERTTRKVTLTKAGKRLLPEARELIKKFDETLFNIRDMNAYHRGMVTLACIPTAVFYFLPLAIGKFNELYPNIKVRILEQGTNNCMESVLCNESDFGINMNNVTNSSIDFTPLVNEPFVLACRRDHPLAKKQLVEWQELVGYKMIGVRSSSGNRLLIEQQLADKPWKLDWFYEVRHLSTSLGLVEAGLGISALPGLAMPHAPYSSIIGIPLVEPVIRRTLGIIRRKDAVLSPAAERFFALLINLWTDDKDNLWTNIVERQRHALQEIG >LR134000.1|VDY69395.1|3101135_3102131_-|6-phosphogluconolactonase MKQTVYIASPESQQIHVWNLNHEGALTLTQVVDVPGQVQPMVVSPDKRYLYVGVRPEFRVLAYRIAPDDGALTFAAESALPGSPTHISTDHQGQFVFVGSYNAGNVSVTRLEDGLPVGVVDVVEGLDGCHSANISPDNRTLWVPALKQDRICLFTVSDDGHLVAQDPAEVTTVEGAGPRHMVFHPNEQYAYCVNELNSSVDVWELKDPHGNIECVQTLDMMPENFSDTRWAADIHITPDGRHLYACDRTASLITVFSVSEDGSVLSKEGFQPTETQPRGFNVDHSGKYLIAAGQKSHHISVYEIVGEQGLLHEKGRYAVGQGPMWVVVNAH >LR134000.1|VDY69397.1|3102285_3103104_+|Phosphotransferase MTTRVIALDLDGTLLTPKKTLLPSSIEALARAREAGYQLIIVTGRHHVAIHPFYQALALDTPAICCNGTYLYDYHAKTVLEADPMPVIKALQLIEMLNEHHIHGLMYVDDAMVYEHPTGHVIRTSNWAQTLPPEQRPTFTQVASLAETAQQVNAVWKFALTHDDLPQLQHFGKHVEHELGLECEWSWHDQVDIARGGNSKGKRLTKWVEAQGWSMENVVAFGDNFNDISMLEAAGTGVAMGNADDAVKARANIVIGDNTTDSIAQFIYSHLI >LR134000.1|VDY69399.1|3103104_3104163_-|molybdenum-ABC-transporter-ATP-binding-protein MLELNFSQTLGNHCLTINETLPANGITAIFGVSGAGKTSLINAISGLTRPQKGRIVLNGRVLNDAEKGICLTPEKRRVGYVFQDARLFPHYKVRGNLRYGMSKSMVDQFDKLVALLGIEPLLDRLPGSLSGGEKQRVAIGRALLTAPELLLLDEPLASLDIPRKRELLPYLQRLTREINIPMLYVSHSLDEILHLADRVMVLENGQVKAFGALEEVWGSSVMNPWLPKEQQSSILKVTVLEHHPHYAMTALALGDQHLWVNKLDEPLQAALRIRIQASDVSLVLQPPQQTSIRNVLRAKVVNSYDDNGQVEVELEVGGKTLWARISPWARDELAIKPGLWLYAQIKSVSITA >LR134000.1|VDY69401.1|3104165_3104855_-|molybdenum-ABC-transporter-permease MILTDPEWQAVLLSLKVSSLAVLFSLPFGIFFAWLLVRCTFPGKALLDSVLHLPLVLPPVVVGYLLLVSMGRRGFIGERLYDWFGITFAFSWRGAVLAAAVMSFPLMVRAIRLALEGVDVKLEQAARTLGAGRWRVFFTITLPLTLPGIIVGTVLAFARSLGEFGATITFVSNIPGETRTIPSAMYTLIQTPGGESGAARLCIISIALAMISLLISEWLARISRERAGR >LR134000.1|VDY69402.1|3104854_3105628_-|molybdate-transporter-periplasmic-protein MARKWLNLFAGAALSFAVAGNALADEGKITVFAAASLTNAMQDIATQFKKEKGVDVVSSFASSSTLARQIEAGAPADLFISADQKWMDYAVDKKAIDTATRQTLLGNSLVVVAPKASVQKDFTIDSKTNWTSLLNGGRLAVGDPEHVPAGIYAKEALQKLGAWDTLSPKLAPAEDVRGALALVERNEAPLGIVYGSDAVASKGVKVVATFPEDSHKKVEYPVAVVEGHNNATVKAFYDYLKGPQAAEIFKRYGFTIK >LR134000.1|VDY69404.1|3105794_3105944_-|outer-membrane-or-exported-protein MLELLKSLVFAVIMVPVVMAIILGLIYGLGEVFNIFSGVGKKDQPGQNH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134000_9 | 3473302-3473446 | Orphan |
NA
Consensus repeat of LR134000_9
|
1 spacers
spacers of LR134000_9
>9.1|3473345|59|LR134000|CRISPRCasFinder CGGAGCACTTATTGCCGGATGCGGCGTGAACGCCTTATCCGGCCTACGGTTCTGGCACC |
CRISPR arrays and Neighbor proteins around LR134000_9
The CRISPR arrays of LR134000_9 >merge|LR134000|9|3473302-3473446|CRISPRCasFinder TTTTGTAGGCCTGATAAGACGCGACAAGCGTCGCATCAGGCATCGGAGCACTTATTGCCGGATGCGGCGTGAACGCCTTATCCGGCCTACGGTTCTGGCACCTTTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCAT >LR134000|9|9|3473302-3473446|CRISPRCasFinder TTTTGTAGGCCTGATAAGACGCGACAAGCGTCGCATCAGGCAT CGGAGCACTTATTGCCGGATGCGGCGTGAACGCCTTATCCGGCCTACGGTTCTGGCACC TTTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCAT
>LR134000.1|VDY70009.1|3471956_3473225_+|MFS-transporter-AraJ MALLVVILQAITLLATVIGSRSGGCDGGMKKVILSLALGTFGLGMAEFGIMGVLTELAHNVGISIPAAGHMISYYALGVVVGAPIIALFSSRYSLKHILLFLVALCVIGNAMFTLSSSYLMLAIGRLVSGFPHGAFFGVGAIVLSKIIKPGKVTAAVAGMVSGMTVANLLGIPLGTYLSQEFSWRYTFLLIAVFNIAVMASVYFWVPDIRDEAKGNLREQFHFLRSPAPWLIFAATMFGNAGVFAWFSYVKPYMMFISGFSETAMTFIMMLVGLGMVLGNMLSGRISGRYSPLRIAAVTDFIIVLALLMLFFCGGMKTTSLIFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAFNLGSAVGAYCGGMMLTLGLAYNYVALPAALLSFAAMSSLLLYGRYKRQQAADTPVLAKPLG >LR134000.1|VDY70007.1|3468771_3471915_+|exonuclease,-dsDNA,-ATP-dependent MKILSLRLKNLNSLKGEWKIDFTREPFASNGLFAITGPTGAGKTTLLDAICLALYHETPRLSNVSQSQNDLMTRDTAECLAEVEFEVKGEAYRAFWSQNRARNQPDGNLQVPRVELARCADGKILADKVKDKLELTATLTGLDYGRFTRSMLLSQGQFAAFLNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTELEKLQAQASGVALLTPEQVQSLTASLQVLTDEEKQLLTAQQQEQQSLNWLTRLDELQQEASRRQQALQQALAEEEKAQPQLAALSLAQPARNLRPHWERIAEHSAALAHTRQQIEEVNTRLQSTMALRASIRHHAAKQSAELQQQQQSLNTWLQEHDRFRQWNNELAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLNALAAITLTLTADEVATALAQHAEQRPLRQHLVALHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKTICEQEARIKTLEAQRAQLQAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEGAALRGQLDALTKQLQRDENEAQSLRQDEQALTQQWQAVTASLNITLQPQDDIQPWLDAQDEHERQLRLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYALTLPQEDEEESWLATRQQEAQSWQQRQNELTALQNRIQQLTPILETLPQSDDLPHSEETVALDNWRQVHEQCLALHSQQQTLQQQDVLAAQSLQKAQAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQTLVTQTAETLAQHQQHRPDGLALTVTVEQIQQELAQTHQKLRENTTSQGEIRQQLKQDADNRQQQQTLMQQIAQMTQQVEDWGYLNSLIGSKEGDKFRKFAQGLTLDNLVHLANQQLTRLHGRYLLQRKASEALEVEVVDTWQADAVRDTRTLSGGESFLVSLALALALSDLVSHKTRIDSLFLDEGFGTLDSETLDTALDALDALNASGKTIGVISHVEAMKERIPVQIKVKKINGLGYSKLESTFAVK >LR134000.1|VDY70005.1|3467572_3468775_+|exonuclease-SbcD MRILHTSDWHLGQNFYSKSREAEHQAFLDWLLETAQTHQVDAIIVAGDVFDTGSPPSYARTLYNRFVVNLQQTGCHLVVLAGNHDSVATLNESRDIMAFLNTTVVASAGHAPQILPRRDGTPGAVLCPIPFLRPRDIITSQAGLNGIEKQQHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIYIGTLDAFPAQNFPPADYIALGHIHRAQIIGGMEHVRYCGSPIPLSFDECGKSKYVHLVTFSNGKLESVENLNVPVTQPMAVLKGDLASITAQLEQWRDVSQEPPVWLDIEITTDEYLHDIQRKIQALTESLPVEVLLVRRSREQRERVLASQQRETLSELSVEEVFNRRLALEELDESQQQRLQHLFTTTLHTLAGEHEA >LR134000.1|VDY70003.1|3466693_3467383_-|phosphate-regulon-two-component-system,-response-regulator MARRILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWMLPGGSGIQFIKHLKRESMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRISPMAVEEVIEMQGLSLDPTSHRVMAGEEPLEMGPTEFKLLHFFMTHPERVYSREQLLNHVWGTNVYVEDRTVDVHIRRLRKALEPGGHDRMVQTVRGTGYRFSTRF >LR134000.1|VDY70001.1|3465340_3466636_-|phosphate-regulon-two-component-system,-sensor-kinase MLERLSWKRLVLELLLCCLPAFILGAFFGYLPWFLLASITGLLIWHFWNLLRLSWWLWVDRSMTPPPGRGSWEPLLYGLHQMQLRNKKRRRELGNLIKRFRSGAESLPDAVVLTTEEGGIFWCNGLAQQILGLRWPEDNGQNILNLLRYPEFTQYLKTRDFSRPLNLVLNTGRHLEIRVMPYTHKQLLMVARDVTQMHQLEGARRNFFANVSHELRTPLTVLQGYLEMMDEQPLEGAVREKALHTMREQTQRMEGLVKQLLTLSKIEAAPTHLLNEKVDVPMMLRVVEREAQTLSQKKQIFTFEIDNGLKVSGNEDQLRSAISNLVYNAVNHTPEGTHITVRWQRVPHGAEFSVEDNGPGIAPEHIPRLTERFYRVDKARSRQTGGSGLGLAIVKHAVNHHESRLNIESTVGKGTRFSFVIPERLIAKNSD >LR134000.1|VDY69999.1|3463614_3464934_-|branched-chain-amino-acid-transport-system-II-carrier-protein MTHQLRSRDIIALGFMTFALFVGAGNIIFPPMVGLQAGEHVWTAAFGFLITAVGLPVLTVVALAKVGGGVDSLSTPIGKVAGVLLATVCYLAVGPLFATPRTATVSFEVGIAPLTGDSALPLFIYSLVYFAIVILVSLYPGKLLDTVGNFLAPLKIIALVILSVAAIVWPAGSISTATEAYQNAAFSNGFVNGYLTMDTLGAMVFGIVIVNAARSRGVTEARLLTRYTVWAGLMAGVGLTLLYLALFRLGSDSASLVDQSANGAAILHAYVQHTFGGGGSFLLAALIFIACLVTAVGLTCACAEFFAQYVPLSYRTLVFILGGFSMVVSNLGLSQLIQISVPVLTAIYPPCIALVVLSFTRSWWHNSSRVIAPPMFISLLFGILDGIKASAFSDILPSWAQRLPLAEQGLAWLMPTVVMVVLAIIWDRAAGRQVTSSAH >LR134000.1|VDY69997.1|3462165_3463539_-|proline-specific-permease MESKNKLKRGLSTRHIRFMALGSAIGTGLFYGSADAIKMAGPSVLLAYIIGGIAAYIIMRALGEMSVHNPAASSFSRYAQENLGPLAGYITGWTYCFEILIVAIADVTAFGIYMGVWFPTVPHWIWVLSVVLIICAVNLMSVKVFGELEFWFSFFKVATIIIMIVAGFGIIIWGIGNGGQPTGIHNLWSNGGFFSNGWLGMVMSLQMVMFAYGGIEIIGITAGEAKDPEKSIPRAINSVPMRILVFYVGTLFVIMSIYPWNQVGTAGSPFVLTFQHMGITFAASILNFVVLTASLSAINSDVFGVGRMLHGMAEQGSAPKIFSKTSRRGIPWVTVLVMTTALLFAVYLNYIMPENVFLVIASLATFATVWVWIMILLSQIAFRRRLPPEEVKALKFKVPGGVATTIGGLIFLLFIIGLIGYHPDTRISLYVGFAWIVVLLIGWMFKRRHDRQLAENQ >LR134000.1|VDY69995.1|3460192_3462010_-|maltodextrin-glucosidase MMLNAWHLPVPPFVKQSKDQLLITLWLTGEDPPQRIMLRTEHDNEEMSVPMHKQRSQPQPGVTAWRAAIDLSSGQPRRRYSFKLLWHDRQRWFTPQGFSRMPPARLEQFAVDVPDIGPQWAADQIFYQIFPDRFARSLPREAEQDHVYYHHAAGQEIILRDWDEPVTAQAGGSTFYGGDLDGISEKLPYLKKLGVTALYLNPVFKAPSVHKYDTEDYRHVDPQFGGDGALLRLRHNTQQLGMRLVLDGVFNHSGDSHAWFDRHNRGTGGACHNPESPWRDWYSFSDDGTALDWLGYASLPKLDYQSESLVNEIYRGEDSIVRHWLKAPWSMDGWRLDVVHMLGEAGGARNNMQHVAGITEAAKETQPEAYIVGEHFGDARQWLQADVEDAAMNYRGFTFPLWGFLANTDISYDPQQIDAQTCMAWMDNYRAGLSHQQQLRMFNQLDSHDTARFKTLLGRDIARLPLAVVWLFTWPGVPCIYYGDEVGLDGKNDPFCRKPFPWQVEKQDTALFALYQRMIALRKKSQALRHGGCQVLYAEDNVVVFVRVLNQQRVLVAINRGEACEVVLPASPFLNAVQWQCKEGHGQLTDGILALPAISATVWMN >LR134000.1|VDY69993.1|3459606_3460188_+|acyl-carrier-protein-phosphodiesterase MNFLAHLHLAHLAESSLSGNLLADFVRGNPEESFPPDVVAGIHMHRRIDVLTDNLPEVREAREWFRSETRRVAPITLDVMWDHFLSRHWSQLSPDFPLQEFVCYAREQVMTILPDSPPRFINLNNYLWSEQWLVRYRDMDFIQNVLNGMASRRPRLDALRDSWYDLDAHYDALETRFWQFYPRMMAQASRKAL >LR134000.1|VDY69991.1|3458443_3459514_-|S-adenosylmethionine--tRNA-ribosyltransferase-isomerase MRVTDFSFELPESLIAHYPMPERSSCRLLSLDGPTGALTHGTFTDLLDKLNPGDLLVFNNTRVIPARLFGRKASGGKIEVLVERMLDDKRILAHIRASKAPKPGAELLLGDDESINATMTARHGALFEVEFNDERSVLDILNSIGHMPLPPYIDRPDEDADRELYQTVYSEKPGAVAAPTAGLHFDEPLLEKLRAKGVEMAFVTLHVGAGTFQPVRVDTIEDHIMHSEYAEVPQDVVDAVLAAKARGNRVIAVGTTSVRSLESAAQAAKNDLIEPFFDDTQIFIYPGFQYKVVDALVTNFHLPESTLIMLVSAFAGYQHTMNAYKAAVEEKYRFFSYGDAMFITYNPQAINERVGE >LR134000.1|VDY70011.1|3473469_3474378_-|fructokinase MRIGIDLGGTKTEVIALGDAGEQLYRHRLPTPRDDYRQTIETIATLVDMAEQATGQRGTVGMGIPGSISPYTGVVKNANSTWLNGQPFDKDLSARLQREVRLANDANCLAVSEAVDGAAAGAQTVFAVIIGTGCGAGVAFNGRAHIGGNGTAGEWGHNPLPWMDEDELRYREEVPCYCGKQGCIETFISGTGFAMDYRRLSGHALKGSEIIRLVEESDPVAELALRRYELRLAKSLAHVVNILDPDVIVLGGGMSNVDRLYQTVGQLIKQFVFGGECETPVRKAKHGDSSGVRGAAWLWPQE >LR134000.1|VDY70013.1|3474436_3475414_+|exonuclease-RdgC MQGRRQFVIMPAKFNDKAVEIIMLWFKNLMVYRLSREISLRAEEMEKQLASMAFTPCGSQDMAKMGWVPPMGSHSDALTHVANGQIVICARKEEKILPSPVIKQALEAKIAKLEAEQARKLKKTEKDSLKDEVLHSLLPRAFSRFSQTMMWIDTVNGLIMVDCASAKKAEDTLALLRKSLGSLPVVPLSMENPIELTLTEWVRSGSAAQGFQLLDEAELKSLLEDGGVIRAKKQDLTSEEITNHIEAGKVVTKLALDWQQRIQFVMCDDGSLKRLKFCDELRDQNEDIDREDFAQRFDADFILMTGELAALIQNLIEGLGGEAQR >LR134000.1|VDY70015.1|3475774_3475894_+|Uncharacterised-protein MSLTTTPGYVRCAMPFGVNDHCDFAAMHATIHHLIYLLS >LR134000.1|VDY70017.1|3476060_3476345_-|conserved-protein,-UPF0345-family MLQSNEYFSGKVKSIGFSSSSTGRASVGVMVEGEYTFSTAEPEEMTVISGALNVLLPDATDWQVYEAGSVFNVPGHSEFHLQVAEPTSYLCRYL >LR134000.1|VDY70019.1|3476416_3477094_-|putative-chorismate-biosynthesis-protein MSASLAILTIGIVPMQEVLPLLTEYIDEDNISHHSLLGKLSREEVMAEYAPEAGEDTILTLLNDNQLAHVSRRKVERDLQGVVEVLDNRGYDVIILMSTANISSMTARNTIFLEPSRILPPLVSSIVEDHQVGVIVPVEEMLPVQAQKWQILQKSPVFSLGNPIHDSEQKIIDAGKELLAKGADVIMLDCLGFHQRHRDLLQKQLDVPVLLSNVLIARLAAELLV >LR134000.1|VDY70021.1|3477351_3477543_-|protein MPTKPPYPREAYIVTIEKGKPGQTVTWYQLRADHPKPDSLISEHPTAQEAMDAKKRYEDPDKE >LR134000.1|VDY70023.1|3477592_3478117_-|shikimate-kinase MTQPLFLIGPRGCGKTTVGMALADSLNRRFVDTDQWLQSQLNMTVAEIVEREEWAGFRARETAALEAVTAPSTVIATGGGIILTEFNRHFMQNNGIVVYLCAPVSVLVNRLQAAPEEDLRPTLTGKPLSEEVQEVLEERDALYREVAHIIIDATNEPSQVISEIRSALAQTINC >LR134000.1|VDY70026.1|3478293_3478758_-|putative-cytoplasmic-protein MTIWVDADACPNVIKEILYRAAERMQMPLVLVANQSLRVPPSRFIRTLRVAAGFDVADNEIVRQCEAGDLVITADIPLAAEAIEKGAAAINPRGERYTPATIRERLTMRDFMDTLRASGIQTGGPDSLSQRDRQAFAAELEKWWLEVQRSRGQM >LR134000.1|VDY70028.1|3478877_3479687_+|pyrroline-5-carboxylate-reductase MEKKIGFIGCGNMGKAILGGLIASGQVLPGQIWVYTPSPDKVAALHDQFGINAAESAQEVAQIADIIFAAVKPGIMIKVLSEITSSLNKDSLVVSIAAGVTLDQLARALGHDRKIIRAMPNTPALVNAGMTSVTPNALVTPEDTADVLNIFRCFGEAEVIAEPMIHPVVGVSGSSPAYVFMFIEAMADAAVLGGMPRAQAYKFAAQAVMGSAKMVLETGEHPGALKDMVCSPGGTTIEAVRVLEEKGFRAAVIEAMTKCMEKSEKLSKS >LR134000.1|VDY70029.1|3479703_3480819_-|putative-signal-transduction-protein MFPKIMNDENFFKKAAAHGEEPPLTPQNEHQRSGLRFARRVRLPRAVGLAGMFLPIASTLVSHPPPGWWWLVLVGWAFVWPHLAWQIASRAVDPLSREIYNLKTDAVLAGMWVGVMGVNVLPSTAMLMIMCLNLMGAGGPRLFVAGLVLMVVSCLVTLELTGITVSFNSAPLEWWLSLPIIVIYPLLFGWVSYQTATKLAEHKRRLQVMSTRDGMTGVYNRRHWETMLRNEFDNCRRHNRDATLLIIDIDHFKSINDTWGHDVGDEAIVALTRQLQITLRGSDVIGRFGGDEFAVIMSGTPAESAITAMLRVHEGLNTLRLPNTPQVTLRISVGVAPLNPQMSHYREWLKSADLALYKAKKAGRNRTEVAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134000_10 | 3630888-3630984 | Orphan |
NA
Consensus repeat of LR134000_10
|
1 spacers
spacers of LR134000_10
>10.1|3630915|43|LR134000|CRISPRCasFinder CAATTGCCGGATGCGGCACAAGTTTGTAGGCACGATAAGACGC |
CRISPR arrays and Neighbor proteins around LR134000_10
The CRISPR arrays of LR134000_10 >merge|LR134000|10|3630888-3630984|CRISPRCasFinder GCCAGCGTCGCATCAGGCATCCGCGCACAATTGCCGGATGCGGCACAAGTTTGTAGGCACGATAAGACGCGCCAGCGTCGCATCAGGCATCTGCGCA >LR134000|10|10|3630888-3630984|CRISPRCasFinder GCCAGCGTCGCATCAGGCATCCGCGCA CAATTGCCGGATGCGGCACAAGTTTGTAGGCACGATAAGACGC GCCAGCGTCGCATCAGGCATCTGCGCA
>LR134000.1|VDY70328.1|3628678_3630772_+|lateral-flagellar-export/assembly-protein-(FlhA-like) MAKTTKSFLALLRGGNLGVPLVILCILAMVILPLPPALLDILFTFNIVLAVMVLLVAVSAKRPLEFSLFPTILLITTLMRLTLNVASTRVVLLHGHLGAGAAGKVIESFGQVVIGGNFVVGFVVFIILMIINFIVVTKGAERISEVSARFTLDAMPGKQMAIDADLNAGLINQAQAQTRRKDVASEADFYGAMDGASKFVRGDAIAGMMILAINLIGGVCIGIFKYNLSADAAFQQYVLMTIGDGLVAQIPSLLLSTAAAIIVTRVSDNGDIAHDVRHQLLASPSVLYTATGIMFVLAMVPGMPHLPFLLFSALLGFTGWRMSKRPQAAEAEEKSLETLTRTITETSEQQVSWETIPLIEPISLSLGYKLVALVDKAQGNPLTQRIRGVRQVISDGNGVLLPEIRIRENFRLKPSQYAIFINGIKADEADIPADKLMALPSSETYGEIDGVLGNDPAYGMPVTWIQPAQKAKALNMGYQVIDSASVIATHVNKIVRSYIPDLFNYDDITQLHNRLASMAPRLAEDLSAALNYSQLLKVYRALLTEGVSLRDIVTIATVLVASSAVTKDHILLAADVRLALRRSITHPFVRKQELTVYTLNNELENLLTNVVNQAQQGGKVMLDSVPVDPNMLNQFQSTMPQVKEQMKAAGKDPVLLVPPQLRPLLARYARLFAPGLHVLSYNEVPDELELKIMGALM >LR134000.1|VDY70326.1|3627555_3628695_+|lateral-flagellar-export/assembly-protein-(FlhB-like) MADSSSEEKTEKPSAQKLRKAREEGQLPRSKDMGLAASLFAAFVVISSSFPWYADFVRESFISVHQYAQEINNPQVIGQFLRHHLLILGKFILTLLPMPAAALLSSLVPGGWLFLPKKILPDFSKISPLKGIGRLFSSEHLAETGKMTVKSVVVLVMLWISLRNNFAAFLGLQALPFKLAMNEGLSLYASVMRNFVILFIFFALLDVPLAKALFTKGLKMTKQELKEEYKNQEGKPEVKARVRRLQRQLAMGQIRKVVPKANVVITNPTHYAVALQYDQSRAAAPFVVAKGTDEIALYIRQVAAENQVEVVEFPRLARSVYYTTQVNQQIPFQLYRAIAHVLTYVLQMKHWREGAQPRPALNRDISIPKEVLKLDGENN >LR134000.1|VDY70324.1|3626783_3627566_+|flagellar-biosynthetic-protein MRTSDVTQLMDLALGLWFPFVRIMAFLRYVPVLDNSALTVRVRIILSLALAIIITPLIPHPIPHDLLSLNSLILTVEQILWGMLFGLMFQFLFLALQLAGQILSFNMGMSMAVMNDPSSGASTTVLAELINVYAILLFFAMDGHLLLVSVLFKGFTYWPIGNALHPQTLRTIALAFSWVLASASLLALPTTFIMLIVQGCFGLLNRIAPPLNLFSLGFPINMLAGLVCFATLLYNLPDHYLHLANFVLQQLDALKGHYGG >LR134000.1|VDY70322.1|3626509_3626782_+|flagellar-biosynthetic-protein-FliQ MLTVDVAADIVASGIKVVILLVSVLVVPSLLVGLLVSVFQAVTQINEQTLSFLPRLIVTLVVLGVCGKWMIIQLHDLCIHLFTQAALLVH >LR134000.1|VDY70320.1|3625754_3626507_+|lateral-flagellar-export/assembly-protein-(FliP-like) MKRSTQLTLGLGLLALAPLAIAQGGDIALLNVVTHGNTQEYSVKIQVLILMTLVGLLPTMVLMMTCFTRFIIVLSLLRQALGLQQTPPNRILIGIALSLTMLVMRPIWLNIYDHAVVPFENDRITLTDALSTAATPLKRFMLAQTDKKAMAQIMTIGGAKGNAADQDLSIVVPAYVLSELKTAFQIGFMIYIPFLVIDLIVASVLMAMGMMMLSPLIVSLPFKLMLFVLIDGWSLTIGTLTTSIRGLGLG >LR134000.1|VDY70318.1|3625386_3625758_+|Flagellar-motor-switch-protein-FliN MSKQEDILAGDFGLTDEVAPVAKSADAETLVTRLEDRFSDSMTLLKRIPVTLTLEVSSVEIMLADLLNIDDDTVIELDKLAGEPLDIKVNNILLGKAEVVVVNEKYGLRVLEFNTRDINDLAP >LR134000.1|VDY70316.1|3624542_3625394_+|lateral-flagellar-export/assembly-protein-(FliM-like) MLKYSKTPGIFKLEGNRLGRPYHHLPTLFTGNFDVIDSHLGSYFLKKHRSNITLKKIGCEMDIINKNAELMVSQVGHLAFDIDRSLLLMLLGNFYGLESSLEEAKAHNGLPTKTETRLKNRLALDICTLIFNLQTSGIALKLKLDSSTVITHWAYQLTFTLAGDEESCFRILLDDAHTDFILNLIRHSEHSKPQQVAKSVNKPALIKEIIRSLPLTLNVKIAELSMNVADLTQIKAGDILPISLGETFPVAIGQSELFSALIVEDKDKLFLSELAGKNENSHE >LR134000.1|VDY70314.1|3623169_3624156_-|putative-sigma-54-specific-transcriptional-regulator MSELIATAASSINAFTLAKRVAAFNVPVLIQGETGAGKECVAKYIHTVAFGENDNAPYIGVNCAAIPENMLEATLFGYDKGAFTGAIASVPGKMELANNGTLLLDEIGDMPLALQAKILRVLQEQLVERLGSNRQIKLNFRLIACTNKNLEQEVSAGRFREDLYYRLAVIPITMPPLRERLNDIIPLAESFIKKYSTVLVKNITLSESTRRALLNYRWPGNVRQLENAIQRGMILNRDGVIYLDALGLPENDIADRSELQWPVQPAVHIAETSDLGQHGRSAQYQYIADLMRKYQGNRSKIADLLGITPRALRYRLASMRKQGIEVFS >LR134000.1|VDY70312.1|3622813_3623155_-|lateral-flagellar-basal-body-component-protein-(FliE-like) MSITTINSTMQAQIMQDVQRMQANAQAPVLPAMTFSSTNPDVSFNRIMSGALGHVDQFQQVAEQQQTAIDTGKSDDLAGAMIASQQASLSFSALVQVRNKIATGFNDLMSMSI >LR134000.1|VDY70310.1|3621162_3622809_-|lateral-flagellar-M-ring-protein-(FliF-like) MNAQIKKLTQAFPAFRLRLADNKRWALMAGVGLAVAATAIIVSVLWTGNRGYVSLYGRQENLPVSQIVTVLDGEKLSYRIDPQSGQILVPEDELSKTRMTLAAKGVQAILPSGYELMDKDEVLGSSQFMQNVRYKRSLEGELAQSIMSLDAVESARVHLALNEESSFVVSDEPQNSASVVVRLHYGAKLNMDQVNAIVHLVSGSIPGLQASKVSVVDQAGNLLTDGIGAGEAVSAATRKRDQILKDIQDKTRASVANVLDSLVGSRNYRVSVMPDLDLSTIDETQEHYGDAPKINREESVLDSDTNQVAMGVPGSLSNRPPVAANQMTNGTEENRSPEALSKHSESKRDYSYDRSVQHIQHPGFAVKRLNVAVVLNQNAPALKNWKPEQTTQLTALLNNAAGIDAQRGDNLTLSLLNFVPQVVPVEPVIPLWKDDSVLAWVRLIGCGLLALLLLFFVVRPVMKRLTAVRAPVITPEPEAVSEPWIAMPEEERKNVDLPSLPGDDSLPSQSSGLEVKLEFLQKLAMSDTDRVAEVLRQWITSNERIDNK >LR134000.1|VDY70330.1|3630987_3631485_-|RAYT-REP-element-mobilizing-transposase;-TnpA(REP) MSEYRRYYIKGGTWFFTVNLRNRRSQLLTTQYQMLRNAIIKVKRDRPFEINAWVVLPEHMHCIWTLPEGDDDFSSRWREIKKQFTHACGLKNIWQPRFWEHTIRNTKDYRHHVDYIYINPVKHGWVKQVSDWPFSTFHRDVERGLYPIDWAGDVTDINAGERIIL >LR134000.1|VDY70332.1|3631660_3632434_-|NlpC/P60-family-protein MHDQPLLRMSLPSIPSFVLSGLLLLCLPFSSFASATTSHISFSYAARQRMQNRARLLKQYQTHLKKQASYIVEGNAESRRALRQHNREQIKQHPEWFPAPLKASDRRWQALVENNHFLSSDHLHNITEVAIHRLEQQLGKPYVWGGTRPDQGFDCSGLVFYAYNKILEAKLPRTANEMYHYRRATIVANNDLRRGDLLFFHIHSREIADHMGVYLGDGQFIESPRTGETIRVSRLAEPFWQDHFLGARRILTEETIL >LR134000.1|VDY70334.1|3632619_3632880_+|antitoxin-of-YafQ-DinJ-toxin-antitoxin-system MAANAFVRARIDEDLKNQAADVLAGMGLTISDLVRITLTKVAREKALPFDLREPNQLTIQSIKNSEAGVDVHKAKDADDLFDKLGI >LR134000.1|VDY70336.1|3632882_3633161_+|toxin-of-the-YafQ-DinJ-toxin-antitoxin-system MIQRDIEYSGQFSKDVKLAQKRHKDMNKLKYLMTLLINNTLPLPAVYKDHPLQGSWKGYRDAHVEPDWILIYKLTDKLLRFERTGTHAALFG >LR134000.1|VDY70338.1|3633316_3634057_+|membrane-protein MRKIALILAMLLIPCVSFAGLLGSSSSTTPVSKEYKQQLMGSPVYIQIFKEERTLDLYVKMGEQYQLLDSYKICKYSGGLGPKQRQGDFKSPEGFYSVQRNQLKPDSRYYKAINIGFPNAYDRAHGYEGKYLMIHGDCVSIGCYAMTNQGIDEIFQFVTGALVFGQPSVQVSIYPFRMTDANMKRHKYSNFKDFWEQLKPGYDYFEQTRKPPTVSVVNGRYVVSKPLSHEVVQPQLASNYTLPEAK >LR134000.1|VDY70340.1|3634027_3634795_-|amidotransferase MCELLGMSANVPTDICFSFTGLVQRGGGTGPHKDGWGITFYEGKGCRTFKDPQPSFNSPIAKLVQDYPIKSCSVVAHIRQANRGEVALENTHPFTRELWGRNWTYAHNGQLTGYKSLETGNFRPVGETDSEKAFCWLLHKLTQRYPRTPGNMAAVFKYIASLADELRAKGVFNMLLSDGRYVMAYCSTNLHWITRRAPFGVATLLDQDVEIDFSSQTTPNDVVTVIATQPLTGNETWQKIMPGEWRLFCLGERVV >LR134000.1|VDY70342.1|3635000_3635579_-|phosphoheptose-isomerase MYQDLIRNELNEAAETLANFLKDDANIHAIQRAAVLLADSFKAGGKVLSCGNGGSHCDAMHFAEELTGRYRENRPGYPAIAISDVSHISCVGNDFGFNDIFSRYVEAVGREGDVLLGISTSGNSANVIKAIAAAREKGMKVITLTGKDGGKMAGTADIEIRVPHFGYADRIQEIHIKVIHILIQLIEKEMVK >LR134000.1|VDY70344.1|3635818_3638263_+|acyl-CoA-dehydrogenase MMILSILATVVLLGALFYHRVSLFISSLILLAWTAALGVAGLWSAWVLVPLAIILVPFNFAPMRKSMISAPVFRGFRKVMPPMSRTEKEAIDAGTTWWEGDLFQGKPDWKKLHNYPQPRLTAEEQAFLDGPVEEACRMANDFQITHELADLPPELWAYLKEHRFFAMIIKKEYGGLEFSAYAQSRVLQKLSGVSGILAITVGVPNSLGPGELLQHYGTDEQKDHYLPRLARGQEIPCFALTSPEAGSDAGAIPDTGIVCMGEWQGQQVLGMRLTWNKRYITLAPIATVLGLAFKLSDPEKLLGGAEDLGITCALIPTTTPGVEIGRRHFPLNVPFQNGPTRGKDVFVPIDYIIGGPKMAGQGWRMLVECLSVGRGITLPSNSTGGVKSVALATGAYAHIRRQFKISIGKMEGIEEPLARIAGNAYVMDAAASLITYGIMLGEKPAVLSAIVKYHCTHRGQQSIIDAMDITGGKGIMLGQSNFLARAYQGAPIAITVEGANILTRSMMIFGQGAIRCHPYVLEEMEAAKNNDVNAFDKLLFKHIGHVGSNKVRSFWLGLTRGLTSSTPTGDATKRYYQHLNRLSANLALLSDVSMAVLGGSLKRRERISARLGDILSQLYLASAVLKRYDDEGRNEADLPLVHWGVQDALYQAEQAMDDLLQNFPNRVVAGLLNVVIFPTGRHYLAPSDKLDHKVAKILQVPNATRSRIGRGQYLTPSEHNPVGLLEEALVDVIAADPIHQRICKELGKNLPFTRLDELAHNALVKGLIDKDEAAILVKAEESRLRSINVDDFDPEELATKPVKLPEKVRKVEAA >LR134000.1|VDY70346.1|3638305_3638752_-|inhibitor-of-vertebrate-lysozyme-precursor MFKAITTVAALVIATSAMAQDDLTISSLAKGETTKAAFNQMVQGHKLPAWVMKGGTYTPAQTVTLGDETYQVMSACKPHDCGSQRIAVMWSEKSNQMTGLFSTIDEKTSQEKLTWLNVNDALSIDGKTVLFAALTGSLENHPDGFNFK >LR134000.1|VDY70348.1|3638932_3639703_+|putative-carbon-nitrogen-hydrolase MPGLKITLLQQPLVWMDGPANLRHFDRQLEGITGRDVIVLPEMFTSGFAMEAAASSLAQDDVVNWMTAKAQQCNALIAGSVALQTESGSVNRFLLVEPGGTVHFYDKRHLFRMADEHLHYKAGNARVIVEWRGWRILPLVCYDLRFPVWSRNLNDYDLALYVANWPAPRSLHWQALLTARAIENQAYVAGCNRVGSDGNGCHYRGDSRVINPQGEIIATADAHQATRIDAELSMMALREYREKFPAWQDADEFRLR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
LR134000_6 | 6.1|2832751|40|LR134000|CRISPRCasFinder | 2832751-2832790 | 40 | NZ_CP041417 | Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence | 47951-47990 | 0 | 1.0 |
LR134000_3 | 3.6|1023829|32|LR134000|CRISPRCasFinder,CRT | 1023829-1023860 | 32 | LC542972 | Escherichia coli IOMTU792 plasmid pIOMTU792 DNA, complete sequence | 246937-246968 | 1 | 0.969 |
LR134000_2 | 2.3|997458|32|LR134000|CRISPRCasFinder | 997458-997489 | 32 | NZ_KY515226 | Salmonella enterica subsp. enterica serovar Derby strain S701 plasmid AnCo3, complete sequence | 63538-63569 | 2 | 0.938 |
LR134000_2 | 2.3|997458|32|LR134000|CRISPRCasFinder | 997458-997489 | 32 | MT074430 | Salmonella phage pink, complete genome | 39944-39975 | 2 | 0.938 |
LR134000_2 | 2.3|997458|32|LR134000|CRISPRCasFinder | 997458-997489 | 32 | KY709687 | Salmonella phage 29485, complete genome | 17835-17866 | 2 | 0.938 |
LR134000_2 | 2.3|997458|32|LR134000|CRISPRCasFinder | 997458-997489 | 32 | NC_022752 | Salmonella phage SETP13, complete genome | 32901-32932 | 2 | 0.938 |
LR134000_2 | 2.3|997458|32|LR134000|CRISPRCasFinder | 997458-997489 | 32 | KY176369 | Salmonella phage STP03, complete genome | 33437-33468 | 2 | 0.938 |
LR134000_2 | 2.5|997580|32|LR134000|CRISPRCasFinder | 997580-997611 | 32 | NZ_CP021720 | Escherichia coli strain AR_0128 plasmid tig00000793, complete sequence | 108175-108206 | 2 | 0.938 |
LR134000_2 | 2.5|997580|32|LR134000|CRISPRCasFinder | 997580-997611 | 32 | NZ_CP020051 | Escherichia coli strain AR_0118 plasmid unitig_3, complete sequence | 58913-58944 | 2 | 0.938 |
LR134000_2 | 2.5|997580|32|LR134000|CRISPRCasFinder | 997580-997611 | 32 | KP869108 | Escherichia coli O157 typing phage 10, complete genome | 36670-36701 | 2 | 0.938 |
LR134000_2 | 2.5|997580|32|LR134000|CRISPRCasFinder | 997580-997611 | 32 | KP869107 | Escherichia coli O157 typing phage 9, partial genome | 36794-36825 | 2 | 0.938 |
LR134000_2 | 2.5|997580|32|LR134000|CRISPRCasFinder | 997580-997611 | 32 | NC_019711 | Enterobacteria phage HK629, complete genome | 30061-30092 | 2 | 0.938 |
LR134000_2 | 2.5|997580|32|LR134000|CRISPRCasFinder | 997580-997611 | 32 | NC_007804 | Escherichia phage phiV10, complete genome | 36256-36287 | 2 | 0.938 |
LR134000_2 | 2.5|997580|32|LR134000|CRISPRCasFinder | 997580-997611 | 32 | DQ126339 | Enterobacteria phage phiV10, complete genome | 36256-36287 | 2 | 0.938 |
LR134000_2 | 2.5|997580|32|LR134000|CRISPRCasFinder | 997580-997611 | 32 | MT225100 | Escherichia phage Lys8385Vzw, complete genome | 29555-29586 | 2 | 0.938 |
LR134000_5 | 5.1|2156097|38|LR134000|CRISPRCasFinder | 2156097-2156134 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 113727-113764 | 2 | 0.947 |
LR134000_2 | 2.3|997458|32|LR134000|CRISPRCasFinder | 997458-997489 | 32 | CP053410 | Salmonella enterica strain 2014K-0203 plasmid unnamed, complete sequence | 46242-46273 | 3 | 0.906 |
LR134000_2 | 2.5|997580|32|LR134000|CRISPRCasFinder | 997580-997611 | 32 | NZ_CP042632 | Escherichia coli strain NCYU-25-82 plasmid pNCYU-25-82-5, complete sequence | 38424-38455 | 3 | 0.906 |
LR134000_2 | 2.5|997580|32|LR134000|CRISPRCasFinder | 997580-997611 | 32 | NZ_CP021537 | Escherichia coli strain AR_0119 plasmid unitig_3, complete sequence | 25662-25693 | 3 | 0.906 |
LR134000_2 | 2.5|997580|32|LR134000|CRISPRCasFinder | 997580-997611 | 32 | NZ_CP047663 | Escherichia coli strain LD93-1 plasmid pLD93-1-90kb, complete sequence | 60243-60274 | 3 | 0.906 |
LR134000_2 | 2.13|997458|34|LR134000|PILER-CR,CRT | 997458-997491 | 34 | NZ_KY515226 | Salmonella enterica subsp. enterica serovar Derby strain S701 plasmid AnCo3, complete sequence | 63538-63571 | 3 | 0.912 |
LR134000_2 | 2.13|997458|34|LR134000|PILER-CR,CRT | 997458-997491 | 34 | MT074430 | Salmonella phage pink, complete genome | 39942-39975 | 3 | 0.912 |
LR134000_2 | 2.13|997458|34|LR134000|PILER-CR,CRT | 997458-997491 | 34 | KY709687 | Salmonella phage 29485, complete genome | 17835-17868 | 3 | 0.912 |
LR134000_2 | 2.13|997458|34|LR134000|PILER-CR,CRT | 997458-997491 | 34 | NC_022752 | Salmonella phage SETP13, complete genome | 32899-32932 | 3 | 0.912 |
LR134000_2 | 2.13|997458|34|LR134000|PILER-CR,CRT | 997458-997491 | 34 | KY176369 | Salmonella phage STP03, complete genome | 33437-33470 | 3 | 0.912 |
LR134000_2 | 2.13|997458|34|LR134000|PILER-CR,CRT | 997458-997491 | 34 | CP053410 | Salmonella enterica strain 2014K-0203 plasmid unnamed, complete sequence | 46242-46275 | 4 | 0.882 |
LR134000_2 | 2.15|997580|34|LR134000|PILER-CR,CRT | 997580-997613 | 34 | KP869108 | Escherichia coli O157 typing phage 10, complete genome | 36670-36703 | 4 | 0.882 |
LR134000_2 | 2.15|997580|34|LR134000|PILER-CR,CRT | 997580-997613 | 34 | KP869107 | Escherichia coli O157 typing phage 9, partial genome | 36794-36827 | 4 | 0.882 |
LR134000_2 | 2.15|997580|34|LR134000|PILER-CR,CRT | 997580-997613 | 34 | NC_019711 | Enterobacteria phage HK629, complete genome | 30061-30094 | 4 | 0.882 |
LR134000_2 | 2.15|997580|34|LR134000|PILER-CR,CRT | 997580-997613 | 34 | MT225100 | Escherichia phage Lys8385Vzw, complete genome | 29555-29588 | 4 | 0.882 |
LR134000_2 | 2.15|997580|34|LR134000|PILER-CR,CRT | 997580-997613 | 34 | NZ_CP021720 | Escherichia coli strain AR_0128 plasmid tig00000793, complete sequence | 108173-108206 | 4 | 0.882 |
LR134000_2 | 2.15|997580|34|LR134000|PILER-CR,CRT | 997580-997613 | 34 | NZ_CP020051 | Escherichia coli strain AR_0118 plasmid unitig_3, complete sequence | 58911-58944 | 4 | 0.882 |
LR134000_2 | 2.15|997580|34|LR134000|PILER-CR,CRT | 997580-997613 | 34 | NC_007804 | Escherichia phage phiV10, complete genome | 36254-36287 | 4 | 0.882 |
LR134000_2 | 2.15|997580|34|LR134000|PILER-CR,CRT | 997580-997613 | 34 | DQ126339 | Enterobacteria phage phiV10, complete genome | 36254-36287 | 4 | 0.882 |
LR134000_2 | 2.7|997702|32|LR134000|CRISPRCasFinder | 997702-997733 | 32 | NZ_CP017563 | Paraburkholderia sprentiae WSM5005 plasmid pl1WSM5005, complete sequence | 632328-632359 | 5 | 0.844 |
LR134000_2 | 2.15|997580|34|LR134000|PILER-CR,CRT | 997580-997613 | 34 | NZ_CP021537 | Escherichia coli strain AR_0119 plasmid unitig_3, complete sequence | 25662-25695 | 5 | 0.853 |
LR134000_2 | 2.15|997580|34|LR134000|PILER-CR,CRT | 997580-997613 | 34 | NZ_CP042632 | Escherichia coli strain NCYU-25-82 plasmid pNCYU-25-82-5, complete sequence | 38422-38455 | 5 | 0.853 |
LR134000_2 | 2.15|997580|34|LR134000|PILER-CR,CRT | 997580-997613 | 34 | NZ_CP047663 | Escherichia coli strain LD93-1 plasmid pLD93-1-90kb, complete sequence | 60241-60274 | 5 | 0.853 |
LR134000_1 | 1.3|995193|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 995193-995224 | 32 | MT375042 | Acinetobacter phage vB_AbaP_Alexa, complete genome | 10578-10609 | 6 | 0.812 |
LR134000_3 | 3.4|1023707|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023707-1023738 | 32 | NZ_CP025114 | Bradyrhizobium sp. SK17 strain CBNU plasmid unnamed, complete sequence | 241497-241528 | 6 | 0.812 |
LR134000_3 | 3.4|1023707|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023707-1023738 | 32 | NZ_CP044544 | Bradyrhizobium betae strain PL7HG1 plasmid pBbPL7HG1, complete sequence | 137039-137070 | 6 | 0.812 |
LR134000_1 | 1.3|995193|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 995193-995224 | 32 | MT708550 | Achromobacter phage Mano, complete genome | 2350-2381 | 7 | 0.781 |
LR134000_2 | 2.10|997885|32|LR134000|CRISPRCasFinder | 997885-997916 | 32 | KR080197 | Mycobacterium phage FlagStaff, complete genome | 15476-15507 | 7 | 0.781 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | NZ_CP038925 | Piscirickettsia salmonis strain Psal-011 plasmid unnamed2, complete sequence | 67772-67803 | 7 | 0.781 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | NZ_CP038813 | Piscirickettsia salmonis strain Psal-001 plasmid unnamed2, complete sequence | 67771-67802 | 7 | 0.781 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | NZ_CP033943 | Piscirickettsia salmonis strain EM-90 plasmid pPSEM90-6, complete sequence | 12990-13021 | 7 | 0.781 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | NZ_CP013780 | Piscirickettsia salmonis strain PM51819A plasmid p2PS5, complete sequence | 23367-23398 | 7 | 0.781 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | NZ_CP013780 | Piscirickettsia salmonis strain PM51819A plasmid p2PS5, complete sequence | 106639-106670 | 7 | 0.781 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | NZ_CP038915 | Piscirickettsia salmonis strain Psal-010a plasmid unnamed2, complete sequence | 67771-67802 | 7 | 0.781 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | NZ_CP038934 | Piscirickettsia salmonis strain Psal-025 plasmid unnamed2, complete sequence | 68060-68091 | 7 | 0.781 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | NZ_CP038974 | Piscirickettsia salmonis strain Psal-069 plasmid unnamed2, complete sequence | 2466-2497 | 7 | 0.781 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | NZ_CP038888 | Piscirickettsia salmonis strain Psal-004 plasmid unnamed2, complete sequence | 67773-67804 | 7 | 0.781 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | NZ_CP039216 | Piscirickettsia salmonis strain Psal-104a plasmid unnamed2, complete sequence | 2466-2497 | 7 | 0.781 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | NZ_CP038878 | Piscirickettsia salmonis strain Psal-002 plasmid unnamed2, complete sequence | 67772-67803 | 7 | 0.781 |
LR134000_3 | 3.6|1023829|32|LR134000|CRISPRCasFinder,CRT | 1023829-1023860 | 32 | MK455769 | Aeromonas phage MJG, complete genome | 10369-10400 | 7 | 0.781 |
LR134000_1 | 1.6|995376|33|LR134000|PILER-CR,CRISPRCasFinder,CRT | 995376-995408 | 33 | NC_021529 | Vibrio phage nt-1, complete genome | 92163-92195 | 8 | 0.758 |
LR134000_1 | 1.12|995743|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 995743-995774 | 32 | NZ_CP040765 | Paracoccus sp. 2251 plasmid unnamed1, complete sequence | 230307-230338 | 8 | 0.75 |
LR134000_2 | 2.5|997580|32|LR134000|CRISPRCasFinder | 997580-997611 | 32 | HQ634174 | Cyanophage MED4-213, complete genome | 93296-93327 | 8 | 0.75 |
LR134000_2 | 2.7|997702|32|LR134000|CRISPRCasFinder | 997702-997733 | 32 | NZ_CP029830 | Azospirillum ramasamyi strain M2T2B2 plasmid unnamed1, complete sequence | 460575-460606 | 8 | 0.75 |
LR134000_2 | 2.9|997824|32|LR134000|CRISPRCasFinder | 997824-997855 | 32 | MK814759 | Gordonia phage Reyja, complete genome | 4467-4498 | 8 | 0.75 |
LR134000_2 | 2.10|997885|32|LR134000|CRISPRCasFinder | 997885-997916 | 32 | NZ_CP014600 | Yangia sp. CCB-MM3 plasmid unnamed4, complete sequence | 15418-15449 | 8 | 0.75 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | MN693954 | Marine virus AFVG_250M569, complete genome | 38214-38245 | 8 | 0.75 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | MT074146 | Bacteroides phage SJC03, complete genome | 43512-43543 | 8 | 0.75 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | MT074139 | Bacteroides phage DAC19, complete genome | 51209-51240 | 8 | 0.75 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | MT074140 | Bacteroides phage DAC20, complete genome | 51209-51240 | 8 | 0.75 |
LR134000_1 | 1.7|995438|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 995438-995469 | 32 | MK693005 | Klebsiella phage KPR2, complete genome | 32997-33028 | 9 | 0.719 |
LR134000_1 | 1.7|995438|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 995438-995469 | 32 | NC_031267 | Gordonia phage Lucky10, complete genome | 11488-11519 | 9 | 0.719 |
LR134000_1 | 1.9|995560|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 995560-995591 | 32 | NC_008760 | Polaromonas naphthalenivorans CJ2 plasmid pPNAP04, complete sequence | 97029-97060 | 9 | 0.719 |
LR134000_1 | 1.12|995743|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 995743-995774 | 32 | NZ_CP046574 | Rhodococcus sp. WAY2 plasmid pRWAY02, complete sequence | 32983-33014 | 9 | 0.719 |
LR134000_1 | 1.12|995743|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 995743-995774 | 32 | NZ_CP009112 | Rhodococcus opacus strain 1CP plasmid pR1CP1, complete sequence | 288224-288255 | 9 | 0.719 |
LR134000_1 | 1.12|995743|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 995743-995774 | 32 | NZ_CP054616 | Azospirillum oryzae strain KACC 14407 plasmid unnamed2, complete sequence | 298192-298223 | 9 | 0.719 |
LR134000_2 | 2.7|997702|32|LR134000|CRISPRCasFinder | 997702-997733 | 32 | NZ_CP006988 | Rhizobium sp. IE4771 plasmid pRetIE4771b, complete sequence | 91348-91379 | 9 | 0.719 |
LR134000_2 | 2.8|997763|32|LR134000|CRISPRCasFinder | 997763-997794 | 32 | NZ_CP030074 | Streptomyces sp. ZFG47 plasmid unnamed1, complete sequence | 224876-224907 | 9 | 0.719 |
LR134000_2 | 2.10|997885|32|LR134000|CRISPRCasFinder | 997885-997916 | 32 | NZ_CP009292 | Novosphingobium pentaromativorans US6-1 plasmid pLA3, complete sequence | 139210-139241 | 9 | 0.719 |
LR134000_2 | 2.17|997702|34|LR134000|PILER-CR,CRT | 997702-997735 | 34 | NZ_CP029830 | Azospirillum ramasamyi strain M2T2B2 plasmid unnamed1, complete sequence | 460575-460608 | 9 | 0.735 |
LR134000_3 | 3.4|1023707|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023707-1023738 | 32 | NZ_CP054028 | Rhizobium sp. JKLM19E plasmid pPR19E01, complete sequence | 1548232-1548263 | 9 | 0.719 |
LR134000_3 | 3.6|1023829|32|LR134000|CRISPRCasFinder,CRT | 1023829-1023860 | 32 | NC_047954 | Pseudomonas phage Njord, complete genome | 11435-11466 | 9 | 0.719 |
LR134000_7 | 7.3|2974651|33|LR134000|CRISPRCasFinder,CRT,PILER-CR | 2974651-2974683 | 33 | MN855903 | Myoviridae sp. isolate 490, partial genome | 4171-4203 | 9 | 0.727 |
LR134000_10 | 10.1|3630915|43|LR134000|CRISPRCasFinder | 3630915-3630957 | 43 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 4037-4079 | 9 | 0.791 |
LR134000_10 | 10.1|3630915|43|LR134000|CRISPRCasFinder | 3630915-3630957 | 43 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 4036-4078 | 9 | 0.791 |
LR134000_10 | 10.1|3630915|43|LR134000|CRISPRCasFinder | 3630915-3630957 | 43 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 4036-4078 | 9 | 0.791 |
LR134000_10 | 10.1|3630915|43|LR134000|CRISPRCasFinder | 3630915-3630957 | 43 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 4036-4078 | 9 | 0.791 |
LR134000_1 | 1.12|995743|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 995743-995774 | 32 | NZ_CP022367 | Azospirillum sp. TSH58 plasmid TSH58_p02, complete sequence | 117189-117220 | 10 | 0.688 |
LR134000_2 | 2.5|997580|32|LR134000|CRISPRCasFinder | 997580-997611 | 32 | AP014501 | Uncultured Mediterranean phage uvMED isolate uvMED-GF-U-MedDCM-OCT-S32-C4, *** SEQUENCING IN PROGRESS ***, 3 ordered pieces | 17722-17753 | 10 | 0.688 |
LR134000_2 | 2.10|997885|32|LR134000|CRISPRCasFinder | 997885-997916 | 32 | NZ_CP012383 | Streptomyces ambofaciens ATCC 23877 plasmid pSAM1, complete sequence | 14130-14161 | 10 | 0.688 |
LR134000_2 | 2.17|997702|34|LR134000|PILER-CR,CRT | 997702-997735 | 34 | NZ_CP006988 | Rhizobium sp. IE4771 plasmid pRetIE4771b, complete sequence | 91346-91379 | 10 | 0.706 |
LR134000_2 | 2.20|997885|34|LR134000|PILER-CR,CRT | 997885-997918 | 34 | NZ_CP014600 | Yangia sp. CCB-MM3 plasmid unnamed4, complete sequence | 15418-15451 | 10 | 0.706 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | NZ_LR215032 | Mycoplasma gallopavonis strain NCTC10186 plasmid 2 | 248909-248940 | 10 | 0.688 |
LR134000_3 | 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT | 1023646-1023677 | 32 | NZ_LR214943 | Mycoplasma orale strain NCTC10112 plasmid 4 | 4362-4393 | 10 | 0.688 |
LR134000_3 | 3.6|1023829|32|LR134000|CRISPRCasFinder,CRT | 1023829-1023860 | 32 | NC_047894 | Pseudomonas phage uligo, complete genome | 11353-11384 | 10 | 0.688 |
LR134000_9 | 9.1|3473345|59|LR134000|CRISPRCasFinder | 3473345-3473403 | 59 | MT230312 | Escherichia coli strain DH5alpha plasmid pESBL31, complete sequence | 97-155 | 10 | 0.831 |
LR134000_2 | 2.18|997763|34|LR134000|PILER-CR,CRT | 997763-997796 | 34 | NZ_CP030074 | Streptomyces sp. ZFG47 plasmid unnamed1, complete sequence | 224874-224907 | 11 | 0.676 |
LR134000_2 | 2.20|997885|34|LR134000|PILER-CR,CRT | 997885-997918 | 34 | NZ_CP009292 | Novosphingobium pentaromativorans US6-1 plasmid pLA3, complete sequence | 139208-139241 | 11 | 0.676 |
LR134000_9 | 9.1|3473345|59|LR134000|CRISPRCasFinder | 3473345-3473403 | 59 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 40375-40433 | 11 | 0.814 |
1. spacer 6.1|2832751|40|LR134000|CRISPRCasFinder matches to NZ_CP041417 (Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence) position: , mismatch: 0, identity: 1.0
gcgctgcgggtcattcttgaaattacccccgctgtgctgt CRISPR spacer gcgctgcgggtcattcttgaaattacccccgctgtgctgt Protospacer ****************************************
2. spacer 3.6|1023829|32|LR134000|CRISPRCasFinder,CRT matches to LC542972 (Escherichia coli IOMTU792 plasmid pIOMTU792 DNA, complete sequence) position: , mismatch: 1, identity: 0.969
gacgcactggatgcgatgatggatatcacttg CRISPR spacer gacgcactggatgcgatgatggacatcacttg Protospacer ***********************.********
3. spacer 2.3|997458|32|LR134000|CRISPRCasFinder matches to NZ_KY515226 (Salmonella enterica subsp. enterica serovar Derby strain S701 plasmid AnCo3, complete sequence) position: , mismatch: 2, identity: 0.938
aggagtttaatttccagattgagcgctggata CRISPR spacer aagagtttaatttccagatagagcgctggata Protospacer *.***************** ************
4. spacer 2.3|997458|32|LR134000|CRISPRCasFinder matches to MT074430 (Salmonella phage pink, complete genome) position: , mismatch: 2, identity: 0.938
aggagtttaatttccagattgagcgctggata CRISPR spacer aagagtttaatttccagatagagcgctggata Protospacer *.***************** ************
5. spacer 2.3|997458|32|LR134000|CRISPRCasFinder matches to KY709687 (Salmonella phage 29485, complete genome) position: , mismatch: 2, identity: 0.938
aggagtttaatttccagattgagcgctggata CRISPR spacer aagagtttaatttccagatagagcgctggata Protospacer *.***************** ************
6. spacer 2.3|997458|32|LR134000|CRISPRCasFinder matches to NC_022752 (Salmonella phage SETP13, complete genome) position: , mismatch: 2, identity: 0.938
aggagtttaatttccagattgagcgctggata CRISPR spacer aagagtttaatttccagatagagcgctggata Protospacer *.***************** ************
7. spacer 2.3|997458|32|LR134000|CRISPRCasFinder matches to KY176369 (Salmonella phage STP03, complete genome) position: , mismatch: 2, identity: 0.938
aggagtttaatttccagattgagcgctggata CRISPR spacer aagagtttaatttccagatagagcgctggata Protospacer *.***************** ************
8. spacer 2.5|997580|32|LR134000|CRISPRCasFinder matches to NZ_CP021720 (Escherichia coli strain AR_0128 plasmid tig00000793, complete sequence) position: , mismatch: 2, identity: 0.938
cacggctggccatttgaaatacctgttgctct CRISPR spacer cacggcaggccatttgaaatacccgttgctct Protospacer ****** ****************.********
9. spacer 2.5|997580|32|LR134000|CRISPRCasFinder matches to NZ_CP020051 (Escherichia coli strain AR_0118 plasmid unitig_3, complete sequence) position: , mismatch: 2, identity: 0.938
cacggctggccatttgaaatacctgttgctct CRISPR spacer cacggcaggccatttgaaatacccgttgctct Protospacer ****** ****************.********
10. spacer 2.5|997580|32|LR134000|CRISPRCasFinder matches to KP869108 (Escherichia coli O157 typing phage 10, complete genome) position: , mismatch: 2, identity: 0.938
cacggctggccatttgaaatacctgttgctct CRISPR spacer cacggcaggccatttgaaataccagttgctct Protospacer ****** **************** ********
11. spacer 2.5|997580|32|LR134000|CRISPRCasFinder matches to KP869107 (Escherichia coli O157 typing phage 9, partial genome) position: , mismatch: 2, identity: 0.938
cacggctggccatttgaaatacctgttgctct CRISPR spacer cacggcaggccatttgaaataccagttgctct Protospacer ****** **************** ********
12. spacer 2.5|997580|32|LR134000|CRISPRCasFinder matches to NC_019711 (Enterobacteria phage HK629, complete genome) position: , mismatch: 2, identity: 0.938
cacggctggccatttgaaatacctgttgctct CRISPR spacer cacggcaggccatttgaaataccagttgctct Protospacer ****** **************** ********
13. spacer 2.5|997580|32|LR134000|CRISPRCasFinder matches to NC_007804 (Escherichia phage phiV10, complete genome) position: , mismatch: 2, identity: 0.938
cacggctggccatttgaaatacctgttgctct CRISPR spacer cacggcaggccatttgaaataccagttgctct Protospacer ****** **************** ********
14. spacer 2.5|997580|32|LR134000|CRISPRCasFinder matches to DQ126339 (Enterobacteria phage phiV10, complete genome) position: , mismatch: 2, identity: 0.938
cacggctggccatttgaaatacctgttgctct CRISPR spacer cacggcaggccatttgaaataccagttgctct Protospacer ****** **************** ********
15. spacer 2.5|997580|32|LR134000|CRISPRCasFinder matches to MT225100 (Escherichia phage Lys8385Vzw, complete genome) position: , mismatch: 2, identity: 0.938
cacggctggccatttgaaatacctgttgctct CRISPR spacer cacggcaggccatttgaaataccagttgctct Protospacer ****** **************** ********
16. spacer 5.1|2156097|38|LR134000|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 2, identity: 0.947
cggacgcaggatggtgcgttcaattggactcgaaccaa CRISPR spacer cagacgcagaatggtgcgttcaattggactcgaaccaa Protospacer *.*******.****************************
17. spacer 2.3|997458|32|LR134000|CRISPRCasFinder matches to CP053410 (Salmonella enterica strain 2014K-0203 plasmid unnamed, complete sequence) position: , mismatch: 3, identity: 0.906
aggagtttaatttccagattgagcgctggata CRISPR spacer aagagtttaatttccagacagagcgctggata Protospacer *.****************. ************
18. spacer 2.5|997580|32|LR134000|CRISPRCasFinder matches to NZ_CP042632 (Escherichia coli strain NCYU-25-82 plasmid pNCYU-25-82-5, complete sequence) position: , mismatch: 3, identity: 0.906
cacggctggccatttgaaatacctgttgctct CRISPR spacer cgcggcaggccatttgaaatacccgttgctct Protospacer *.**** ****************.********
19. spacer 2.5|997580|32|LR134000|CRISPRCasFinder matches to NZ_CP021537 (Escherichia coli strain AR_0119 plasmid unitig_3, complete sequence) position: , mismatch: 3, identity: 0.906
cacggctggccatttgaaatacctgttgctct CRISPR spacer cgcggcaggccatttgaaatacccgttgctct Protospacer *.**** ****************.********
20. spacer 2.5|997580|32|LR134000|CRISPRCasFinder matches to NZ_CP047663 (Escherichia coli strain LD93-1 plasmid pLD93-1-90kb, complete sequence) position: , mismatch: 3, identity: 0.906
cacggctggccatttgaaatacctgttgctct CRISPR spacer cgcggcaggccatttgaaatacccgttgctct Protospacer *.**** ****************.********
21. spacer 2.13|997458|34|LR134000|PILER-CR,CRT matches to NZ_KY515226 (Salmonella enterica subsp. enterica serovar Derby strain S701 plasmid AnCo3, complete sequence) position: , mismatch: 3, identity: 0.912
aggagtttaatttccagattgagcgctggataga CRISPR spacer aagagtttaatttccagatagagcgctggataca Protospacer *.***************** ************ *
22. spacer 2.13|997458|34|LR134000|PILER-CR,CRT matches to MT074430 (Salmonella phage pink, complete genome) position: , mismatch: 3, identity: 0.912
aggagtttaatttccagattgagcgctggataga CRISPR spacer aagagtttaatttccagatagagcgctggataca Protospacer *.***************** ************ *
23. spacer 2.13|997458|34|LR134000|PILER-CR,CRT matches to KY709687 (Salmonella phage 29485, complete genome) position: , mismatch: 3, identity: 0.912
aggagtttaatttccagattgagcgctggataga CRISPR spacer aagagtttaatttccagatagagcgctggataca Protospacer *.***************** ************ *
24. spacer 2.13|997458|34|LR134000|PILER-CR,CRT matches to NC_022752 (Salmonella phage SETP13, complete genome) position: , mismatch: 3, identity: 0.912
aggagtttaatttccagattgagcgctggataga CRISPR spacer aagagtttaatttccagatagagcgctggataca Protospacer *.***************** ************ *
25. spacer 2.13|997458|34|LR134000|PILER-CR,CRT matches to KY176369 (Salmonella phage STP03, complete genome) position: , mismatch: 3, identity: 0.912
aggagtttaatttccagattgagcgctggataga CRISPR spacer aagagtttaatttccagatagagcgctggataca Protospacer *.***************** ************ *
26. spacer 2.13|997458|34|LR134000|PILER-CR,CRT matches to CP053410 (Salmonella enterica strain 2014K-0203 plasmid unnamed, complete sequence) position: , mismatch: 4, identity: 0.882
aggagtttaatttccagattgagcgctggataga CRISPR spacer aagagtttaatttccagacagagcgctggataca Protospacer *.****************. ************ *
27. spacer 2.15|997580|34|LR134000|PILER-CR,CRT matches to KP869108 (Escherichia coli O157 typing phage 10, complete genome) position: , mismatch: 4, identity: 0.882
cacggctggccatttgaaatacctgttgctctgt CRISPR spacer cacggcaggccatttgaaataccagttgctcttg Protospacer ****** **************** ********
28. spacer 2.15|997580|34|LR134000|PILER-CR,CRT matches to KP869107 (Escherichia coli O157 typing phage 9, partial genome) position: , mismatch: 4, identity: 0.882
cacggctggccatttgaaatacctgttgctctgt CRISPR spacer cacggcaggccatttgaaataccagttgctcttg Protospacer ****** **************** ********
29. spacer 2.15|997580|34|LR134000|PILER-CR,CRT matches to NC_019711 (Enterobacteria phage HK629, complete genome) position: , mismatch: 4, identity: 0.882
cacggctggccatttgaaatacctgttgctctgt CRISPR spacer cacggcaggccatttgaaataccagttgctcttg Protospacer ****** **************** ********
30. spacer 2.15|997580|34|LR134000|PILER-CR,CRT matches to MT225100 (Escherichia phage Lys8385Vzw, complete genome) position: , mismatch: 4, identity: 0.882
cacggctggccatttgaaatacctgttgctctgt CRISPR spacer cacggcaggccatttgaaataccagttgctcttg Protospacer ****** **************** ********
31. spacer 2.15|997580|34|LR134000|PILER-CR,CRT matches to NZ_CP021720 (Escherichia coli strain AR_0128 plasmid tig00000793, complete sequence) position: , mismatch: 4, identity: 0.882
cacggctggccatttgaaatacctgttgctctgt CRISPR spacer cacggcaggccatttgaaatacccgttgctcttg Protospacer ****** ****************.********
32. spacer 2.15|997580|34|LR134000|PILER-CR,CRT matches to NZ_CP020051 (Escherichia coli strain AR_0118 plasmid unitig_3, complete sequence) position: , mismatch: 4, identity: 0.882
cacggctggccatttgaaatacctgttgctctgt CRISPR spacer cacggcaggccatttgaaatacccgttgctcttg Protospacer ****** ****************.********
33. spacer 2.15|997580|34|LR134000|PILER-CR,CRT matches to NC_007804 (Escherichia phage phiV10, complete genome) position: , mismatch: 4, identity: 0.882
cacggctggccatttgaaatacctgttgctctgt CRISPR spacer cacggcaggccatttgaaataccagttgctcttg Protospacer ****** **************** ********
34. spacer 2.15|997580|34|LR134000|PILER-CR,CRT matches to DQ126339 (Enterobacteria phage phiV10, complete genome) position: , mismatch: 4, identity: 0.882
cacggctggccatttgaaatacctgttgctctgt CRISPR spacer cacggcaggccatttgaaataccagttgctcttg Protospacer ****** **************** ********
35. spacer 2.7|997702|32|LR134000|CRISPRCasFinder matches to NZ_CP017563 (Paraburkholderia sprentiae WSM5005 plasmid pl1WSM5005, complete sequence) position: , mismatch: 5, identity: 0.844
gcgatctcgcggaatacaccgacg-aggcgggc CRISPR spacer acgatctcgcggaatacgtcgacgtaggccgg- Protospacer .****************..***** **** **
36. spacer 2.15|997580|34|LR134000|PILER-CR,CRT matches to NZ_CP021537 (Escherichia coli strain AR_0119 plasmid unitig_3, complete sequence) position: , mismatch: 5, identity: 0.853
cacggctggccatttgaaatacctgttgctctgt CRISPR spacer cgcggcaggccatttgaaatacccgttgctcttg Protospacer *.**** ****************.********
37. spacer 2.15|997580|34|LR134000|PILER-CR,CRT matches to NZ_CP042632 (Escherichia coli strain NCYU-25-82 plasmid pNCYU-25-82-5, complete sequence) position: , mismatch: 5, identity: 0.853
cacggctggccatttgaaatacctgttgctctgt CRISPR spacer cgcggcaggccatttgaaatacccgttgctcttg Protospacer *.**** ****************.********
38. spacer 2.15|997580|34|LR134000|PILER-CR,CRT matches to NZ_CP047663 (Escherichia coli strain LD93-1 plasmid pLD93-1-90kb, complete sequence) position: , mismatch: 5, identity: 0.853
cacggctggccatttgaaatacctgttgctctgt CRISPR spacer cgcggcaggccatttgaaatacccgttgctcttg Protospacer *.**** ****************.********
39. spacer 1.3|995193|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to MT375042 (Acinetobacter phage vB_AbaP_Alexa, complete genome) position: , mismatch: 6, identity: 0.812
cgtagtttcggcagtccagtgcctcgttacgt CRISPR spacer catagacacggcagtccagcgcttcgttacgt Protospacer *.*** . ***********.**.*********
40. spacer 3.4|1023707|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP025114 (Bradyrhizobium sp. SK17 strain CBNU plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.812
atcggacg-atggcgatcgcaatcgcgcgggaa CRISPR spacer -ttagacgcatggcgattgcaatcgcgtgggat Protospacer *..**** ********.*********.****
41. spacer 3.4|1023707|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP044544 (Bradyrhizobium betae strain PL7HG1 plasmid pBbPL7HG1, complete sequence) position: , mismatch: 6, identity: 0.812
atcggacg-atggcgatcgcaatcgcgcgggaa CRISPR spacer -ttagacgcatggcgattgcaatcgcgtgggat Protospacer *..**** ********.*********.****
42. spacer 1.3|995193|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to MT708550 (Achromobacter phage Mano, complete genome) position: , mismatch: 7, identity: 0.781
cgtagtttcggcagtccagtgcctcgttacgt CRISPR spacer cgtacacgcggcagtccagggcctcgttgcgg Protospacer **** . *********** ********.**
43. spacer 2.10|997885|32|LR134000|CRISPRCasFinder matches to KR080197 (Mycobacterium phage FlagStaff, complete genome) position: , mismatch: 7, identity: 0.781
gacgccgccgccgcgaagccgtttccgatgtt CRISPR spacer gacgccggcgccgcgatgccgttcgccgggtt Protospacer ******* ******** ******. * . ***
44. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP038925 (Piscirickettsia salmonis strain Psal-011 plasmid unnamed2, complete sequence) position: , mismatch: 7, identity: 0.781
---cgcactcaaaatagtaaattaatttatgaatt CRISPR spacer aggcgagc---aaatagtaaattaatttataaata Protospacer ** .* *******************.***
45. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP038813 (Piscirickettsia salmonis strain Psal-001 plasmid unnamed2, complete sequence) position: , mismatch: 7, identity: 0.781
---cgcactcaaaatagtaaattaatttatgaatt CRISPR spacer aggcgagc---aaatagtaaattaatttataaata Protospacer ** .* *******************.***
46. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP033943 (Piscirickettsia salmonis strain EM-90 plasmid pPSEM90-6, complete sequence) position: , mismatch: 7, identity: 0.781
---cgcactcaaaatagtaaattaatttatgaatt CRISPR spacer aggcgagc---aaatagtaaattaatttataaata Protospacer ** .* *******************.***
47. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP013780 (Piscirickettsia salmonis strain PM51819A plasmid p2PS5, complete sequence) position: , mismatch: 7, identity: 0.781
---cgcactcaaaatagtaaattaatttatgaatt CRISPR spacer aggcgagc---aaatagtaaattaatttataaata Protospacer ** .* *******************.***
48. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP013780 (Piscirickettsia salmonis strain PM51819A plasmid p2PS5, complete sequence) position: , mismatch: 7, identity: 0.781
---cgcactcaaaatagtaaattaatttatgaatt CRISPR spacer aggcgagc---aaatagtaaattaatttataaata Protospacer ** .* *******************.***
49. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP038915 (Piscirickettsia salmonis strain Psal-010a plasmid unnamed2, complete sequence) position: , mismatch: 7, identity: 0.781
---cgcactcaaaatagtaaattaatttatgaatt CRISPR spacer aggcgagc---aaatagtaaattaatttataaata Protospacer ** .* *******************.***
50. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP038934 (Piscirickettsia salmonis strain Psal-025 plasmid unnamed2, complete sequence) position: , mismatch: 7, identity: 0.781
---cgcactcaaaatagtaaattaatttatgaatt CRISPR spacer aggcgagc---aaatagtaaattaatttataaata Protospacer ** .* *******************.***
51. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP038974 (Piscirickettsia salmonis strain Psal-069 plasmid unnamed2, complete sequence) position: , mismatch: 7, identity: 0.781
---cgcactcaaaatagtaaattaatttatgaatt CRISPR spacer aggcgagc---aaatagtaaattaatttataaata Protospacer ** .* *******************.***
52. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP038888 (Piscirickettsia salmonis strain Psal-004 plasmid unnamed2, complete sequence) position: , mismatch: 7, identity: 0.781
---cgcactcaaaatagtaaattaatttatgaatt CRISPR spacer aggcgagc---aaatagtaaattaatttataaata Protospacer ** .* *******************.***
53. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP039216 (Piscirickettsia salmonis strain Psal-104a plasmid unnamed2, complete sequence) position: , mismatch: 7, identity: 0.781
---cgcactcaaaatagtaaattaatttatgaatt CRISPR spacer aggcgagc---aaatagtaaattaatttataaata Protospacer ** .* *******************.***
54. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP038878 (Piscirickettsia salmonis strain Psal-002 plasmid unnamed2, complete sequence) position: , mismatch: 7, identity: 0.781
---cgcactcaaaatagtaaattaatttatgaatt CRISPR spacer aggcgagc---aaatagtaaattaatttataaata Protospacer ** .* *******************.***
55. spacer 3.6|1023829|32|LR134000|CRISPRCasFinder,CRT matches to MK455769 (Aeromonas phage MJG, complete genome) position: , mismatch: 7, identity: 0.781
gacgcactggatgcgatgatggatatcacttg- CRISPR spacer gacgcactgtatgcaatgatgga-atcggaggc Protospacer ********* ****.******** ***. *
56. spacer 1.6|995376|33|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NC_021529 (Vibrio phage nt-1, complete genome) position: , mismatch: 8, identity: 0.758
ttgacgttgattttgttcgttatgttgccagcc CRISPR spacer gagacgttgattttgctcattatgttgactttc Protospacer *************.**.******** * .*
57. spacer 1.12|995743|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP040765 (Paracoccus sp. 2251 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
attattaattctggtggcgctggtcgccctgg CRISPR spacer gttgtacgatctggtggcgctggtggtcctgg Protospacer .**.* . *************** *.*****
58. spacer 2.5|997580|32|LR134000|CRISPRCasFinder matches to HQ634174 (Cyanophage MED4-213, complete genome) position: , mismatch: 8, identity: 0.75
cacggc-tggccatttgaaatacctgttgctct CRISPR spacer -atagcacctccaattgaaatacctgttgcact Protospacer *..** . *** **************** **
59. spacer 2.7|997702|32|LR134000|CRISPRCasFinder matches to NZ_CP029830 (Azospirillum ramasamyi strain M2T2B2 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
gcgatctcgcggaatacaccgacgaggcgggc CRISPR spacer gcctgctggcggaatacgccgacgaggcgacg Protospacer ** ** *********.***********.
60. spacer 2.9|997824|32|LR134000|CRISPRCasFinder matches to MK814759 (Gordonia phage Reyja, complete genome) position: , mismatch: 8, identity: 0.75
gagcctgacgagactactgaggccgttctgtc- CRISPR spacer aagcctgacgaggctactggggcca-gcggtgg Protospacer .***********.******.****. * **
61. spacer 2.10|997885|32|LR134000|CRISPRCasFinder matches to NZ_CP014600 (Yangia sp. CCB-MM3 plasmid unnamed4, complete sequence) position: , mismatch: 8, identity: 0.75
gacgccgccgccgcgaagccgtttccgatgtt CRISPR spacer gtcgccgccgccgcgcagccctttccacagcc Protospacer * ************* **** *****. *..
62. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to MN693954 (Marine virus AFVG_250M569, complete genome) position: , mismatch: 8, identity: 0.75
cgcactcaaaatagtaaattaatttatgaatt CRISPR spacer atcaagaaaaatagtaaattcaattatgaatc Protospacer ** ************* * ********.
63. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to MT074146 (Bacteroides phage SJC03, complete genome) position: , mismatch: 8, identity: 0.75
cgcact-caaaatagtaaattaatttatgaatt CRISPR spacer -gaattaaaaaatagtaaattgatttataaaac Protospacer * *.* *************.******.** .
64. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to MT074139 (Bacteroides phage DAC19, complete genome) position: , mismatch: 8, identity: 0.75
cgcact-caaaatagtaaattaatttatgaatt CRISPR spacer -gaattaaaaaatagtaaattgatttataaaac Protospacer * *.* *************.******.** .
65. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to MT074140 (Bacteroides phage DAC20, complete genome) position: , mismatch: 8, identity: 0.75
cgcact-caaaatagtaaattaatttatgaatt CRISPR spacer -gaattaaaaaatagtaaattgatttataaaac Protospacer * *.* *************.******.** .
66. spacer 1.7|995438|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to MK693005 (Klebsiella phage KPR2, complete genome) position: , mismatch: 9, identity: 0.719
ctctgat-----tcatcggcggcgatactgtcatcac CRISPR spacer -----acagtcatcatcggcagcggtactgtcatcaa Protospacer *. ********.***.***********
67. spacer 1.7|995438|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NC_031267 (Gordonia phage Lucky10, complete genome) position: , mismatch: 9, identity: 0.719
ctctgattcatcggcggcgatactgtcatcac CRISPR spacer gtgccgttcatcggctgcgataccgtcatacc Protospacer * . .********* *******.***** *
68. spacer 1.9|995560|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NC_008760 (Polaromonas naphthalenivorans CJ2 plasmid pPNAP04, complete sequence) position: , mismatch: 9, identity: 0.719
cggcttattgctcttgccgacggattacagtg---- CRISPR spacer tcacttattgcccttgcagacggatt----tgtccc Protospacer . .********.***** ******** **
69. spacer 1.12|995743|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP046574 (Rhodococcus sp. WAY2 plasmid pRWAY02, complete sequence) position: , mismatch: 9, identity: 0.719
attattaattctggtggcgctggtcgccctgg CRISPR spacer ggtcggctttctggtcgcgctggtcgcgctgg Protospacer . * ******* *********** ****
70. spacer 1.12|995743|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP009112 (Rhodococcus opacus strain 1CP plasmid pR1CP1, complete sequence) position: , mismatch: 9, identity: 0.719
attattaattctggtggcgctggtcgccctgg CRISPR spacer cgtccccactctggtcgccctggtcgccctgg Protospacer * .. *.****** ** *************
71. spacer 1.12|995743|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP054616 (Azospirillum oryzae strain KACC 14407 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
attattaattctggtggcgctggtcgccctgg CRISPR spacer gattctgggtctggtggcgctggtggcgctgg Protospacer . * .*.. *************** ** ****
72. spacer 2.7|997702|32|LR134000|CRISPRCasFinder matches to NZ_CP006988 (Rhizobium sp. IE4771 plasmid pRetIE4771b, complete sequence) position: , mismatch: 9, identity: 0.719
gcgatctcgcggaatacaccgacgaggcgggc CRISPR spacer cgcacgtcgcggattacatcgacgaggcggta Protospacer *. ******* ****.***********
73. spacer 2.8|997763|32|LR134000|CRISPRCasFinder matches to NZ_CP030074 (Streptomyces sp. ZFG47 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.719
taaggccgtcgccggatcagcctggctatgcc CRISPR spacer cggggccgtcgccggatcagccggtctgctgc Protospacer ...******************* * **.. *
74. spacer 2.10|997885|32|LR134000|CRISPRCasFinder matches to NZ_CP009292 (Novosphingobium pentaromativorans US6-1 plasmid pLA3, complete sequence) position: , mismatch: 9, identity: 0.719
gacgccgccgccgcgaagccgtttccgatgtt CRISPR spacer gaagccgccgccgcgaacccgtttttcccggg Protospacer ** ************** ******.. .*
75. spacer 2.17|997702|34|LR134000|PILER-CR,CRT matches to NZ_CP029830 (Azospirillum ramasamyi strain M2T2B2 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.735
gcgatctcgcggaatacaccgacgaggcgggcgt CRISPR spacer gcctgctggcggaatacgccgacgaggcgacgct Protospacer ** ** *********.***********. *
76. spacer 3.4|1023707|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP054028 (Rhizobium sp. JKLM19E plasmid pPR19E01, complete sequence) position: , mismatch: 9, identity: 0.719
atcggacgatggcgatcgcaatcgcgcgggaa CRISPR spacer aaaagacgatggcgatcgcaatcgtgcctttc Protospacer * .********************.**
77. spacer 3.6|1023829|32|LR134000|CRISPRCasFinder,CRT matches to NC_047954 (Pseudomonas phage Njord, complete genome) position: , mismatch: 9, identity: 0.719
gacgcactggatgcgatgatggatatcacttg CRISPR spacer gacgcactcgatgcgatgatggaggccgaggc Protospacer ******** ************** ..*.
78. spacer 7.3|2974651|33|LR134000|CRISPRCasFinder,CRT,PILER-CR matches to MN855903 (Myoviridae sp. isolate 490, partial genome) position: , mismatch: 9, identity: 0.727
gttcgctgcaaccgctagccaagacggtaggtt CRISPR spacer gttcgctgcaaccgttcgccaagaccacaaagg Protospacer **************.* ******** ..*..
79. spacer 10.1|3630915|43|LR134000|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 9, identity: 0.791
caattgccggatgcggcacaagtttgtaggcacgataagacgc CRISPR spacer cggcctaccgatccggcacaagtttgtaggcatgataagacgc Protospacer *.... * *** *******************.**********
80. spacer 10.1|3630915|43|LR134000|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 9, identity: 0.791
caattgccggatgcggcacaagtttgtaggcacgataagacgc CRISPR spacer cggcctaccgatccggcacaagtttgtaggcatgataagacgc Protospacer *.... * *** *******************.**********
81. spacer 10.1|3630915|43|LR134000|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 9, identity: 0.791
caattgccggatgcggcacaagtttgtaggcacgataagacgc CRISPR spacer cggcctaccgatccggcacaagtttgtaggcatgataagacgc Protospacer *.... * *** *******************.**********
82. spacer 10.1|3630915|43|LR134000|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 9, identity: 0.791
caattgccggatgcggcacaagtttgtaggcacgataagacgc CRISPR spacer cggcctaccgatccggcacaagtttgtaggcatgataagacgc Protospacer *.... * *** *******************.**********
83. spacer 1.12|995743|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP022367 (Azospirillum sp. TSH58 plasmid TSH58_p02, complete sequence) position: , mismatch: 10, identity: 0.688
attattaattctggtggcgctggtcgccctgg CRISPR spacer gctgacccatctgggggcgctggtcgcgctgg Protospacer ..*. . ***** ************ ****
84. spacer 2.5|997580|32|LR134000|CRISPRCasFinder matches to AP014501 (Uncultured Mediterranean phage uvMED isolate uvMED-GF-U-MedDCM-OCT-S32-C4, *** SEQUENCING IN PROGRESS ***, 3 ordered pieces) position: , mismatch: 10, identity: 0.688
cacggctggccatttgaaatacctgttgctct CRISPR spacer aattcaataccatatgcaatacctgttgctct Protospacer *. .**** ** ***************
85. spacer 2.10|997885|32|LR134000|CRISPRCasFinder matches to NZ_CP012383 (Streptomyces ambofaciens ATCC 23877 plasmid pSAM1, complete sequence) position: , mismatch: 10, identity: 0.688
gacgccgccgccgcgaagccgtttccgatgtt CRISPR spacer ttgcccgccgcggcgaagccgcttccgtcctc Protospacer ******* *********.***** . *.
86. spacer 2.17|997702|34|LR134000|PILER-CR,CRT matches to NZ_CP006988 (Rhizobium sp. IE4771 plasmid pRetIE4771b, complete sequence) position: , mismatch: 10, identity: 0.706
gcgatctcgcggaatacaccgacgaggcgggcgt CRISPR spacer cgcacgtcgcggattacatcgacgaggcggtact Protospacer *. ******* ****.*********** *
87. spacer 2.20|997885|34|LR134000|PILER-CR,CRT matches to NZ_CP014600 (Yangia sp. CCB-MM3 plasmid unnamed4, complete sequence) position: , mismatch: 10, identity: 0.706
gacgccgccgccgcgaagccgtttccgatgttga CRISPR spacer gtcgccgccgccgcgcagccctttccacagcccc Protospacer * ************* **** *****. *..
88. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LR215032 (Mycoplasma gallopavonis strain NCTC10186 plasmid 2) position: , mismatch: 10, identity: 0.688
cgcactcaaaatagtaaattaatttatgaatt CRISPR spacer gaatctcaaaatattaaattaattgatttagg Protospacer . ********* ********** ** *
89. spacer 3.3|1023646|32|LR134000|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LR214943 (Mycoplasma orale strain NCTC10112 plasmid 4) position: , mismatch: 10, identity: 0.688
cgcactcaaaatagtaaattaatttatgaatt CRISPR spacer aaatatgaaaacagtacattaatttatgaaaa Protospacer . * ****.**** *************
90. spacer 3.6|1023829|32|LR134000|CRISPRCasFinder,CRT matches to NC_047894 (Pseudomonas phage uligo, complete genome) position: , mismatch: 10, identity: 0.688
gacgcactggatgcgatgatggatatcacttg CRISPR spacer gacgcactcgatgcgatgatggaggcagaagc Protospacer ******** ************** .. .
91. spacer 9.1|3473345|59|LR134000|CRISPRCasFinder matches to MT230312 (Escherichia coli strain DH5alpha plasmid pESBL31, complete sequence) position: , mismatch: 10, identity: 0.831
-cggagcacttattgccggatgcggcgtgaacgccttatccggcctacggttctggcacc CRISPR spacer tcagtgcac-gatcgccggatgcggcgtgaacgccttatccgtcctacggttctgtgctc Protospacer *.* **** **.**************************** ************ .*
92. spacer 2.18|997763|34|LR134000|PILER-CR,CRT matches to NZ_CP030074 (Streptomyces sp. ZFG47 plasmid unnamed1, complete sequence) position: , mismatch: 11, identity: 0.676
taaggccgtcgccggatcagcctggctatgccga CRISPR spacer cggggccgtcgccggatcagccggtctgctgctc Protospacer ...******************* * **.. *
93. spacer 2.20|997885|34|LR134000|PILER-CR,CRT matches to NZ_CP009292 (Novosphingobium pentaromativorans US6-1 plasmid pLA3, complete sequence) position: , mismatch: 11, identity: 0.676
gacgccgccgccgcgaagccgtttccgatgttga CRISPR spacer gaagccgccgccgcgaacccgtttttcccgggcc Protospacer ** ************** ******.. .*
94. spacer 9.1|3473345|59|LR134000|CRISPRCasFinder matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 11, identity: 0.814
cggagcacttattgccggatgcggcgtgaacgccttatccggcctacggttctggcacc- CRISPR spacer ggtacggctttttgccggatgcggcgtaaacgccttatccggcctacggtt-tggtgcga Protospacer * * .*** ****************.*********************** ***..*
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
45493 : 52666
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >LR134000|45493:52666|DBSCAN-SWA GTCAGGCTGTTGCAGGGTCGTCACATTTTGGCAGCCAGTCGCCGTAGCTTTCCTCTTTCAGCGTCAGGTTGGTCTGTATCCCCTGTTTGGTATGGCGCTTCTCGTAATTCAGTCCGTATTCCTTCAGCATCACCGGCAGCCCCAGCCCGAACATTTTCAGACTGAGTACATTCCGGTAGCCGTTTGCCTCCATGTAGGCCAGATAGGCGTGATAGAGGTATTTACGGTAATTACGCGGGATGATACTGGCGTTCCCCATATACATGCCGCTGGTCTGCGGCAGGGTTTCCAGATAGCCGATAAAATCAAACGTCGGGTCGGCATCCCGTTTGATGTTCAGCGCCTCGTCTGAGTTTTGCTGGGACTGAAGCAGTGACCGGGCGAGCATCGGGTCGCTGAATTTCTGCATCAGGTGACGCACGATGACCGCCAGCTCGCGGGTGATTTTGTCCTTAAGCTGCGGGTCGCGCTCCTGCGGGGCTATCTGTTCCGGGAAGTGAATAATCACCCGCCGGCGTGACACGCCGCCGCTGCGGTCGGTGAAGCGCATCGGGTTATTGTTCACGGCCAGAATTACCGCCGGGATGTGCGTGGAGTACGCATCCCGGTATTTCGGGTCAACGGACACCGCATCGCCGCCGGTGATGGCCTTGAGTCCGGCTCCGTCGCCGCTCCATTTTTCCTGGTCCGGCAGGCGTATCAGTGAGAAGCCAGTTAACGCGGCACGTTCACGCGGGGATTCCAGCGTCTCGATGGTGGCCGACGTGGCGTTATCCTCCCCGGCCAGCAGGGTGGCTATTTCGGCCATGATACTTTTGCCGCTGCCGCCGGGACCGGTCACCTCCAGAAAGAGCTGCCAGTCGTAGCGGTTTGCCAGCACCATAAACAGTGCGGCCAGAATCACGTCGCGTTTTTCCGCACGGCCACCGGCGGCACGGTCAAGCCAGCGCCAGAACGCGGGGGCGTGGGTTTCCAGCGTTTCCCCTTCCACTGGCGGGGTGAAATCCACATCACACAGGGTGCGCATCCAGTGTGACGGACTGTGCGGGTGGAACGTGCCGTTCTGCGTGTCGAGCACGCCGTTACGAAAGCCAATCAGGCAGCGGGAGGGGGCTTCCTGCTGCGGAATAATCAGCTTCAGGGTGTCCACCACGGAGGCCACCTTCCCGGAGGAGAACGGCGCACGCAGCCGCTGAAACAGCCCGGCCACATCCCGGGCAAAGTCCTGTGGTGGCAGCACCTTCCAGACACCATTTTCATAACGGGACAGAAGCTGGCCGTTGGCATCGACCGCGAGCGCCTCGCCGTAATGCTCATAGATACGCATGGCCTTTTCGCTGGTACTCATGGCGGAAAACTCCGCTTCGCTCATGGTGTCGAACGGGCTTTCAGCCGGTGGCCGGATGGCATCGTAAATGGCCTTACGGGTGGCCTCCCCGCCGTACTGCGTGAAGGCATCATTCCAGTCACCGAAGACCGGCGGCAGGGCAACAACGCCCTCACACGCATCTGCGGCTGCGGCGGCTTTTTTCTGGCCGTCACCGCTGAGGTCACGGTCTGCGGCAAGGACAATCTGACAGGCCGGATGCTTCTGCCGGGCAAGGCTGGCCAGAGAAAGGAGGTTCACTGAAGAAAGCGCCACCATCACCGTTTCACCGGTCAGGTGATGTACGGTAAGTGCGGTCGCGTATCTCCGGCCATTTCTTTCAGTACAGCATGCCGGTTTACGGGGCTGCGTTTGAACAGGTCAGGACGGTCACAGGTAAATTCCCGCAGAAAACGCCCCAGCGGGATGTCTGTGGTGCGCCCGTCAGCGAGGATACGCACAAGGATACTGAATTTACGGCGGTACGGATTCGGGTAGCGCTTTATTTTGTGAATCTTTTTGGCAGACGCAACAGGGGGGATTTGTTCCGGCCGCCTTACAATGACTGTGTGTTTTTTGTTCATTCCCACTTAAAGTCATTTAAAGCCACTTAAAGCAATTCGTAATTTTTATAGTGAAATACAAATCGTTTCTTCTTATTCATTCCCGGCGAATTAATAAAAACAAACAGTAGTAAACAGCACAAAAAGTCCATGAGCGGGTGAACAGTGGTGAACAGACGGTGAACAGTCATTACTGCGATTGTTCACCATTTAACTTACTGTATTAATTATCTTTTTTCTTATGGTGAACAGAGGTGAACAGTAAAATATAAAAAAACAAACAGTAAGCCGGTTTTTCCTGCGACCTTTTCCTGGCTTGCCGGTCTGAGGATGAGTCTCCTGTGTCAGGGCTGGCACATCTGCAATGCGTCGTGTTGTTGTCCGGTGTACGTCACAATTTTCTTAACCTGAAGTGACGAGGAGCCGGAAAATGTCTGACAACACCATCCCTGAATATCTGCAACCCGCGCTGGCACAACTGGAAAAGGCCAGAGCCGCTCATCTTGAGAACGCCCGCCTGATGGATGAGACCATCACGGCCATTGAACGGGCAGAGCAGGAAAAAAATGCGCTGGCGCAGGCCGACGGAAACGACGCTGACGACTGGCGCACGGCCTTTCGTGCAGCCGGTGGTGTCCTGAGCGACGAGCTGAAACAGCGCCACATTGAGCGCGTGGCACGCCGGGAGCTGGTACAGGAATATGACAATCTGGCCGTGGTGCTGAATTTCGAACGTGAACGCCTGAAAGGGGCGTGTGACAGCACGGCCACCGCCTACCGGAAGGCACATCATCACCTTCTGAGTCTGTATGCAGAGCATGAGCTGGAACACGCCCTGAATGAAACCTGTGAGGCGCTTGTCCGGGCAATGCATCTGAGTATCCTGGTACAGGAAAATCCGCTCGCCAACACCACCGGCCATCAGGGCTACGTCGCACCCGATAAAGCTGTCATGCAGCAGGTGAAATCATCGCTGGAACAGAAAATTAAACAGATGCAAATCAGCCTCACCGGCGAGCCGGTTCTCCGGCTGACCGGACTGTCAGCGGCAACACTCCCGCACATGGATTATGAGGTGGCAGGCACACCGGCACAGCGCAAGGTGTGGCAGGACAAAATAGACCAGCAGGGAGCAGAGCTTAAGGCCAGAGGACTGCTGTCATGATGCGCTGCCCTTTCTGTCGTCATTCAGCGCATACCCGCACCAGCCGGTATGTGAGTGACAATGTCAAAGAAAGTTATCTGCAGTGCCAGAATATTTACTGTTCGGCGACATTTAAAACCCATGAGTCAATTTGTGCCGTGATTCGTTCTCCGGTCACGGAGGAAAAACCAGCACCGGCAAGCACAGCACCGGCTGTTGTCCGAAAAGTTAAAGGCTGTTACAGCTCACCATTCAACCATTAATCAGGAGACGGGGTGTGACCACAATGTCATTACAGCAGGCTTTTGAGGTCTGCCAGAATAACAAAGCGGCATGGCTGCAACGCAAAAATGAGCTGGCTGCAGCCGAACAGGAATATCTGCGGCTTCTGTCCGGGGAAGGCAGAAACGTCAGCCGCCTGGACGAATTACGCAATATTATCGAAGTCAGAAAATGGCAGGTGAATCAGGCCGCCGGTCGTTATATTCGTTCGCATGAAGCCGTTCAGCACATCAGCATCCGCGAGCGGCTGAATGATTTTATGCAGCAGCACGGCACAGCACTGGCGGCCGCACTGGCACCGGAGCTGATGGGCTACAGTGAGCTGACGGCCATTGCCCGAAACTGTGCCATACAGCGTGCCACAGATGCCCTGCGTGAAGCCCTTCTGTCCTGGCTTGCGAAGGGGGAAAAAATTAATTATTCCGCACAGGATAGCGACATTTTAACGACCATCGGATTCAGGCCTGACGAGGCTTCGGTGGATGACAGCCGTGAAAAATTCACCCCTGCGCAGAACATGATTTTTTCGCGTAAAAGTGCAGAACTGGCATCACGTCAGTCTGTGTAAAACTCCCCGAAAATCCGCCTGTTTTTACTGAAAAAAGCCATGCATCGATAAGGTGCATGGCTTTGCATGCGTTTTCTTGCCTCATTTCCTGCAGATCACGCCGCGCCCGACGCGGCGTGAGCGTGTCAGTGCAACTGCATTAAAACCGCCCTGCAAAGCGGGCGGGCGAGGCGGGGAAAGCACTGCGCGCAAAAGGGGGGGCTATTAAATTTTTCTCTGGCTTGATGAATGATAACCAAACCAGAGAGGGCATGAGCTTTACGGAGTTTTATCAAGTAATTATTGCGATGCGTCTTCTTTTAAAGCGATGCTTTGTTCTAAAGCTTCTTTTATGCGTTTAACGCATTCTGTTAGTTCAATTATGGCTCCGCGTCGAAAAGAACTGTTAATTTTTTTATGATGAGCGTTTATGGCTGAATGTAGCTTCTTAAGGTATGGTAGTTCAGCAGAAAGGTAGTCCGCAAGATTTGAGAAGTTATAGAGTGATGATAGATGGTTTGAAAACAACTGAATATCGCGAATGGACCAAGTGTTTTTTATTCCATGTATGATTAGGTTTACATCTAATGTTTCAAGTGGATTAAATTTGAATTTTGATTGATAAATATCCATATCAATAGCACTCCAACCGTTAACATTCATTTTTTCTTTCAATTCTTTTATTTTCTCCTCCTTTATTATATCGCAATATTTATCAAGGAAGTAATTATGAAGTTCATTTCCTTCTCTCCTTAATATATGTGGTGGAATATTATCAATATTGGGATTTGATTCTAGGAAATATTCTTTTATTTCTTTATTACTAAAGCTACGTTTGTTTTTTTCAATGGTTTTGTTGGTTAACCCTCTTATGCGAGGGATGTATTTTGATTTTGAGAGTCTAATATAGTTGTTTGTAGCGATTAGCCAAGTAGATAGTGTCGGTGATGAAGTTTTTAGTATTACTTTTTTAATTTCATCCAAATAATCATTTTCAGGTATATTATACCATTCCGGTTGTTTTAATGTGGCATAATCACTTTTTTTTAAAGGTTCTTGTGAATTGAATATGTATGGAATTATATCCTTTTGGGATATCTCATTAAAACAATATGGAACCATTAAGTTGTTATAAGTCATATGAGCAGTGATATTTAAAAGGTTACTCTCTTCTTCGCTTATTTTATCTGGATCATTCTCAGATGAAGTCGTAAGTGTTTTGAAAGAAGATGTGATATAATTATGATGAAAATCATTCTCTTGATATGAAAATTTCTCTTTTAGTATTATATGAGCACAAAGTGAAGTGACTAGGTTTTTGATGTCTATATCAGAGATTTTTTGCTCAGGTTCTTGTTTTTCAAAAAGAGGTGTCAATTTTGAAATTACTCTATTAATAATTCGTAGGTTTGTTTCTTCGAACCCAATAATAACCTGAGTGATTAAATATTTATGTCTCTCTTCTAATGGAGCCAGTTTTTGCTCTAATATATCGGTAAGGTTATTAATAGAGAAATATATTTCGTCGCTTACAACTTTTTCTTTATGACTTAATACCTCACTACTCTGCTTTGAAAAGTTACCCACTAAAATAAAGTCTAACCGATTATCATTTTGATAACTTTGTAGGCAAAAGGTTGCTATTTCATCTCTCAAAGATTGAGGGATTCTTTCTAGATCATCAATGACAAAAACCCCTGAAAGATCCTTAAGTACATAATCTCTCATTGCACCTGAAATGGTAGATATAATTTGTTCTGTCAACTTTCCAGTGCTTTCGTCTTGGGTTAATGCTGATGCAGCGCTTGAGGTTAAATCCCCAAGTTTTTTTATCTCTGAAGGGGTGTTTAGATACGTTATGCTTAGCATTCTATCTTTAAAATCTTGTAAACTGTTTAACCCAAGAACAGACAAATAAAAATGACTCGTATCTGAATAAAGTGTTCTAAATTCAGTTCGAAGGAAATATGTCTTACCTACACCCCATTCCCCATTTATTAATATCAGACCATCACGTTTCTCTTTCAATATACGAATCAATATAGATACTATAGTACCTTTGGTCATAAAAAGCTCCGTTCAAAATTCCATTATGATTATTTTATTTTTGGATAAAGTAAGTCACTATACCACTGCATCATATGTAATCTTTTCTCTAAATATTGAGCATGATTATAAGTACCTCTTATACTGTTTTTATCAACATGTGCTAATTGCATCTCAATCCATGCACTATCAAATCCATGTTCATGTAAGATTGTCGATAATGAATGTCTAAAACCGTGACCTGTAGCACGTCCTTTGTAACCAAGTAACTCAATTACTTGAGATACGCTTTCTTTTGAAATTGGTTTGCTACGATTGTTCCTTCCAATAAATATGTAGGGGTAATGGCCGGTAATAGGCTTGAGTTGTTCGAAAAGGGCAATTACCTGAGTAGATAAAGGAACAATGTGTGGCCTACGCATTTTCATCCTTTCTGCAGGTATTTCCCATATGCCTTTCTCAAGATCAACCTCATTCCAAGTAGCCAAACGCATTTCCTGCGTTCTAACGCCGGTCAGCATAACTATCTTAGTAGCATTTTTAGTAATGATACTTCCGGTATACGCTTCCAAATCCTGAATGAAATGAGGTAGCTCTTCAGCGGATAAAAAAGGATGATGTTTTTGCTTAGGAACAGCCAGAGCGATGGCTAAATCAGGCGCAGGATTGTATTCAGCACGGCCAGTTATGATCGCATAGCGGTAGACTTCACCACATCTTTGCCGCACTTTTCTGGTCTTCTCTAATGCCCCACGCTTTTCTATTCTTCGCAATACTTCGAGCAGTTCTAATGGTTTAATTTCACTGATAGGGCGTTTACCAATGAAAGGGAATACATCTTGCTCAAAAGTCTTCATAATTTCTTCTCGATAGGCCACCGTCCAGCGGTCAGCTTTGTTGGTATGCCATTCTCGACATATTGCTTCGAAAGAATTTTCTGTTGATAGCTTTTGAGCGAGTCTTAAAGCTTTGCGTTCCTCTACTGGGTCAATGCCATTAGCAACCTGCTTACGGGCGATGTCACGCTTCTCACGTGCTTCTGCGAGGCTTACTAAATCATAGCTGCCAAATGACATTAACCGCGCTTTCCCAGCAAATCTGAAACGGAAACGCCAGCCTTTCGTGCCATCTGGATTGATAAGCAACGACAGACCTTGTCCATCGTTCAATGTGTATGGCTTGTCTTGTGGTTTTGCACGTTTGATCTGTGTATCGGTTAGAGCCAT
Protein sequences of DBSCAN-SWA_1 >LR134000|45493:52666|47151_47442_-|VDY66564.1|DBSCAN-SWA MNKKHTVIVRRPEQIPPVASAKKIHKIKRYPNPYRRKFSILVRILADGRTTDIPLGRFLREFTCDRPDLFKRSPVNRHAVLKEMAGDTRPHLPYIT >LR134000|45493:52666|48582_48828_+|VDY66566.1|DBSCAN-SWA MMRCPFCRHSAHTRTSRYVSDNVKESYLQCQNIYCSATFKTHESICAVIRSPVTEEKPAPASTAPAVVRKVKGCYSSPFNH >LR134000|45493:52666|51490_52666_-|VDY66569.1|integrase|DBSCAN-SWA MALTDTQIKRAKPQDKPYTLNDGQGLSLLINPDGTKGWRFRFRFAGKARLMSFGSYDLVSLAEAREKRDIARKQVANGIDPVEERKALRLAQKLSTENSFEAICREWHTNKADRWTVAYREEIMKTFEQDVFPFIGKRPISEIKPLELLEVLRRIEKRGALEKTRKVRQRCGEVYRYAIITGRAEYNPAPDLAIALAVPKQKHHPFLSAEELPHFIQDLEAYTGSIITKNATKIVMLTGVRTQEMRLATWNEVDLEKGIWEIPAERMKMRRPHIVPLSTQVIALFEQLKPITGHYPYIFIGRNNRSKPISKESVSQVIELLGYKGRATGHGFRHSLSTILHEHGFDSAWIEMQLAHVDKNSIRGTYNHAQYLEKRLHMMQWYSDLLYPKIK >LR134000|45493:52666|47851_48586_+|VDY66565.1|DBSCAN-SWA MSDNTIPEYLQPALAQLEKARAAHLENARLMDETITAIERAEQEKNALAQADGNDADDWRTAFRAAGGVLSDELKQRHIERVARRELVQEYDNLAVVLNFERERLKGACDSTATAYRKAHHHLLSLYAEHELEHALNETCEALVRAMHLSILVQENPLANTTGHQGYVAPDKAVMQQVKSSLEQKIKQMQISLTGEPVLRLTGLSAATLPHMDYEVAGTPAQRKVWQDKIDQQGAELKARGLLS >LR134000|45493:52666|49694_51461_-|VDY66568.1|DBSCAN-SWA MTKGTIVSILIRILKEKRDGLILINGEWGVGKTYFLRTEFRTLYSDTSHFYLSVLGLNSLQDFKDRMLSITYLNTPSEIKKLGDLTSSAASALTQDESTGKLTEQIISTISGAMRDYVLKDLSGVFVIDDLERIPQSLRDEIATFCLQSYQNDNRLDFILVGNFSKQSSEVLSHKEKVVSDEIYFSINNLTDILEQKLAPLEERHKYLITQVIIGFEETNLRIINRVISKLTPLFEKQEPEQKISDIDIKNLVTSLCAHIILKEKFSYQENDFHHNYITSSFKTLTTSSENDPDKISEEESNLLNITAHMTYNNLMVPYCFNEISQKDIIPYIFNSQEPLKKSDYATLKQPEWYNIPENDYLDEIKKVILKTSSPTLSTWLIATNNYIRLSKSKYIPRIRGLTNKTIEKNKRSFSNKEIKEYFLESNPNIDNIPPHILRREGNELHNYFLDKYCDIIKEEKIKELKEKMNVNGWSAIDMDIYQSKFKFNPLETLDVNLIIHGIKNTWSIRDIQLFSNHLSSLYNFSNLADYLSAELPYLKKLHSAINAHHKKINSSFRRGAIIELTECVKRIKEALEQSIALKEDASQ >LR134000|45493:52666|48842_49415_+|VDY66567.1|DBSCAN-SWA MTTMSLQQAFEVCQNNKAAWLQRKNELAAAEQEYLRLLSGEGRNVSRLDELRNIIEVRKWQVNQAAGRYIRSHEAVQHISIRERLNDFMQQHGTALAAALAPELMGYSELTAIARNCAIQRATDALREALLSWLAKGEKINYSAQDSDILTTIGFRPDEASVDDSREKFTPAQNMIFSRKSAELASRQSV >LR134000|45493:52666|45493_47137_-|VDY66563.1|DBSCAN-SWA MVALSSVNLLSLASLARQKHPACQIVLAADRDLSGDGQKKAAAAADACEGVVALPPVFGDWNDAFTQYGGEATRKAIYDAIRPPAESPFDTMSEAEFSAMSTSEKAMRIYEHYGEALAVDANGQLLSRYENGVWKVLPPQDFARDVAGLFQRLRAPFSSGKVASVVDTLKLIIPQQEAPSRCLIGFRNGVLDTQNGTFHPHSPSHWMRTLCDVDFTPPVEGETLETHAPAFWRWLDRAAGGRAEKRDVILAALFMVLANRYDWQLFLEVTGPGGSGKSIMAEIATLLAGEDNATSATIETLESPRERAALTGFSLIRLPDQEKWSGDGAGLKAITGGDAVSVDPKYRDAYSTHIPAVILAVNNNPMRFTDRSGGVSRRRVIIHFPEQIAPQERDPQLKDKITRELAVIVRHLMQKFSDPMLARSLLQSQQNSDEALNIKRDADPTFDFIGYLETLPQTSGMYMGNASIIPRNYRKYLYHAYLAYMEANGYRNVLSLKMFGLGLPVMLKEYGLNYEKRHTKQGIQTNLTLKEESYGDWLPKCDDPATA |
7 | Enterobacteria_phage(100.0%) | integrase | attL 45551:45564|attR 57438:57451 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
752456 : 773354
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >LR134000|752456:773354|DBSCAN-SWA AATGCGTATTTTAGTCTTAGGGGTCGGCAATATTTTGCTGACCGATGAAGCCATCGGTGTGCGGATTGTCGAAGCGTTAGAGCAACGATACATTCTGCCGGATTATGTTGAGATCCTCGATGGCGGCACGGCGGGAATGGAGCTGCTTGGCGACATGGCAAATCGCGATCATTTGATTATTGCGGATGCCATTGTCTCGAAAAAGAACGCGCCGGGAACGATGATGATCCTGCGGGATGAAGAAGTTCCGGCGTTGTTTACCAACAAAATCTCTCCGCATCAGCTTGGCCTGGCCGACGTCTTGTCGGCCCTGCGCTTCACCGGCGAGTTTCCGAAAAAGCTGACCCTGGTCGGCGTGATCCCGGAATCGCTGGAGCCACACATCGGCTTAACGCCGACGGTTGAAGCAATGATTGAACCTGCGCTTGAGCAGGTTCTGGCTGCGCTGCGTGAATCTGGCGTGGAAGCCATCCCACGGGAGGCGATTCATGACTGAAGAGATAGCAGGTTTCCAGACCTCCCCGAAGGCGCAAGTACAGGCAGCGTTTGAAGAAATTGCCCGGCGTTCGATGCACGATCTCTCTTTTCTGCATCCTTCAATGCCGGTGTATGTTTCTGATTTTACGCTGTTCGAAGGTCAGTGGACGGGGTGTGTGATCACCCCGTGGATGCTGAGTGCAGTTATCTTCCCCGGCCCGGATCAACTCTGGCCGCTGCGCAAAGTGAGTGAAAAAATTGGTCTGCAACTGCCGTATGGCACTATGACCTTTACCGTTGGCGAACTGGACGGTGTTTCGCAATATCTCTCCTGTTCGCTGATGTCGCCGCTTTCGCACAGCATGTCGATTGAAGAGGGCCAACGCCTGACGGATGACTGCGCACGAATGATCCTTTCGCTGCCAGTCACGAATCCGGATGTACCACACGCAGGGCGTCGCGCCCTGCTGTTTGGTCGCAGGAGTGGCGAAAATGCATGAGTTGTCTCTTTGCCAGAGCGCCGTTGAAATTATCCAACGGCAGGCGGAGCAGCACGATGTTAAGCGCGTCACCGCCGTGTGGCTGGAAATTGGCGCGCTCTCCTGCGTTGAGGAGAGCGCCGTCCGTTTTAGTTTTGAAATTGTCTGCCACGGAACGGTGGCGCAAGGGTGCGATTTACATATCGTCTATAAACCCGCCCAGGCCTGGTGCTGGGATTGCAGCCAGGTGGTGGAGATTCATCAGCACGATGCGCAGTGTCCGCTCTGTCACGGCGAGCGGTTGCGTGTCGATACCGGCGATTCGCTGATCGTCAAAAGTATTGAAGTTGAATAACCGGAGTTAATAATGTGTATTGGCGTTCCAGGCCAGGTGCTGGCTGTCGGTGAAGATATTCACCAGCTTGCGCAGGTTGAAGTATGTGGTATCAAGCGCGATGTGAATATCGCCCTGATTTGTGAAGGTAACCCTGCCGATCTACTGGGCCAGTGGGTGCTGGTACACGTCGGATTTGCCATGAGCATCATCGACGAAGATGAAGCCAAAGCCACATTAGACGCACTGCGCCAAATGGATTACGACATTACCAGCGCGTGATGATTAGCTTTCGTAGGCCTGATAAGACGCGGCAGCGTCGCATCAGGCATTGTGCACGATTGCCGGATGCGGCGTGAACGCCTTATCCGGCCTGGATGGCTTGCTGCGACGAACACCAACCCTTACCCCTGACGCTTATCTTCCGTATTCGTCTCGAAATCACTGGCGTCATGGCGCTCATGCAACTGCTCATTCAGCGGTCCGTTGGTGCGGTTAACAATACGCCCACGTTTCACCGCCGGACGTTCGCCTACTTCTTTCGCCCAGCGTTGTACATGCTTATAACTGCCCGCATCAAGAAACTCAGCGGCATCATACACACCACCTAACACCACGTTGCCAAACCACGGCCAAATCGCCATATCCGCAATGGTGTACTCATCGCCCGCAACAAACTTATGCTGCGCCAGTTGCTTATCCAGCACGTCGAGCAGACGTTTGGCTTCCATGGTAAAGCGGTTGATGGCGTACTCAATCTTTACCGGTGCGTAATGGTAAAAGTGACCAAAACCACCGCCGAGGAACGGTGCCGCGCCCTGTAACCAGAACAGCCAGTTCATCGTTTCAGTACGCTTTGCCAAATCCTGCGGCAGGAAGTAGCCAAATTTCTCCGCCAGATAAAGCAGGATCGAACCAGATTCAAACACGCGGATCGGCGGATTATGCGTATGATCGCGCAGCGCCGGGATCTTCGAGTTTGGGTTCACTTCGACAAAGCCGCTGGAGAATTGATCACCATCGCCAATACGAATCAGCCAGGCGTCGTATTCTGCACCAGTAACGCCCAGCGCCAGCAGCTCCTCAAGCATAATCGTTACTTTCTGACCGTTCGGCGTTCCCAGCGAATAAAGTTGCAATGGATGTTTGCCAACGGGCAGCGTTTTTTCATGCGTCGGACCAGAAACCGGGCGATTGATATTGGCGAACGCGCCGCCAGCGGATTTATCCCACGTCCAGACTTTCGCGGGCTGATAAGTATTGTCTGTCATAGTGAGTTGCCTTCTGAGTGGTAGTGTTGAAGCAGTGTAGCAGGTCACTTGCTGTACAGAATGTGAATTTTTTAACCCTTTGCGCCAGCCGCTGCGATGAGGTGTATGATTAGTCTGACCTCTGTCCGGTTAATGTCGCTGAGCAGGTAATCAGAGCACAACAATTACGCCAACTGCTTAGGTTGTATTGTTAAAGGTAAAGTGATGAGCAAAGGAACGACCAGCCAGGATGCCCCGTTCGGGACATTATTGGGCTACGCCCCAGGTGGGGTAGCAATCTACTCTTCAGATTACAGTTCTCTCGATCCGCAGGAATACGAAGATGACGCCGTATTCCGTAGCTATATCGACGACGAATATATGGGCCACAAGTGGCAATGCGTTGAATTTGCTCGCCGTTTTCTCTTTCTGAATTACGGTGTGGTCTTTACTGACGTGGGTATGGCGTGGGAGATTTTCTCGCTGCGCTTCCTGCGAGAAGTGGTTAATGACAACATCCTGCCATTGCAGGCATTTCCTAACGGCTCGCCGCGTGCGCCGGTCGCGGGTGCGCTTCTTATCTGGGATAAAGGCGGTGAATTTAAAGACACTGGCCATGTCGCCATCATTACCCAATTGCATGGCAACAAAGTCCGTATTGCGGAACAGAACGTGATTCATTCCCCGTTGCCGCAAGGGCAACAGTGGACGCGCGAGCTGGAGATGGTGGTCGAAAACGGCTGCTATACCCTGAAAGACACTTTTGATGACACCACCATTCTGGGCTGGATGATCCAGACGGAAGATACTGAATACAGCTTACCGCAGCCGGAAATTGCAGGCGAGCTGCTGAAAATCAGCGGAGCGCGCCTGGAAAACAAAGGCCAGTTTGACGGTAAATGGCTGGATGAAAAAGATCCGCTGCAAAACGCCTATGTGCAGGCCAACGGTCAGGTGATCAATCAGGATCCTTATCATTACTACACCATTACCGAGAGTGCCGAGCAGGAGCTAATTAAAGCCACCAACGAGCTGCACCTGATGTATCTTCACGCAACCGACAAGGTGCTGAAAGATGACAACCTGCTGGCGCTGTTCGACATCCCGAAAATCCTCTGGCCACGTTTGCGTCTCTCCTGGCAGCGTCGCCGTCACCATATGATCACTGGTCGTATGGATTTCTGCATGGATGAGCGTGGCCTGAAGGTTTACGAGTACAACGCCGACTCCGCCTCCTGTCATACCGAAGCGGGCTTGATCCTCGAACGTTGGGCGGAGCAGGGCTATAAAGGCAACGGCTTCAATCCGGCGGAAGGGCTGATTAACGAATTGGCTGGTGCCTGGAAACACAGTCGTGCACGTCCGTTTGTCCATATCATGCAGGACAAAGATATCGAGGAAAACTATCACGCGCAGTTTATGGAGCAGGCGCTGCACCAGGCGGGCTTTGAAACGCGTATCTTGCGTGGATTGGATGAACTGGGCTGGGATGCTGCCGGGCAACTGATTGATGGGGAAGGGCGACTGGTTAACTGCGTGTGGAAAACCTGGGCGTGGGAAACCGCGTTTGATCAGATTCGTGAAGTTAGCGACCGTGAGTTTGCTGCGGTGCCAATCCGTACCGGTCATCCGCAAAACGAAGTGCGTCTTATCGACGTATTGCTGCGCCCGGAAGTGCTGGTCTTTGAGCCGCTGTGGACGGTGATCCCCGGCAACAAAGCGATTCTGCCGATCCTCTGGTCGCTGTTCCCGCACCATCGTTACCTGCTGGATACCGATTTCACTGTTAATGATGAACTGGTGAAAACAGGTTACGCAGTGAAACCGATCGCCGGTCGCTGTGGCAGCAATATCGACCTCGTCAGCCATCATGAAGAGGTGCTGGACAAAACCAGCGGTAAATTTGCCGAGCAGAAAAACATCTATCAGCAACTGTGGTGTTTGCCGAAAGTGGACGGTAAATACATTCAGGTATGTACCTTCACCGTTGGCGGCAACTACGGTGGGACGTGTTTGCGCGGTGATGAATCACTGGTCATCAAAAAAGAGAGTGATATTGAACCGTTAATTGTGGTGAAAAAGTAATACATGTTGTAGCAACCCCGTACTGATTATGAAGAATAATCAGTACGGGGATTAATCAAAATATTCAATATTTATATATTCCAATATTTATTCATTTCATATGTGAATATATTAACCAGTGGAATACCTGTGTTTTGATTATTTCTAAAGGTTTTGAATGAATGCTTATTGTCTGATACACAAGAAATAACACTCTTTTTATCGTTAAAAAATGATATTTCACTTTGCCCATGCCGTTAAAGTCACTGATAATGCGTCCGTTCGTAAATTCAAAATGGCGTAATCTAATATATGCTAAATTTATTTGTTGGCCTTGATATATACACAGGGCTTTTGTTATTGCTTGCTCTGGCATTTGTGTTGTTCTACGAAGCAATCAATGGTTTTCATGACACGGCGAATGCGGTGGCAACCGTTATTTATACTCGTGCCATGCAACCACAACTTGCTGTGGTGATGGCGGCATTTTTTAACTTTTTTGGCGTGTTATTGGGCGGACTTAGCGTTGCCTATGCCATTGTCCATATGTTGCCAACCGATTTGTTGCTGAATATGGGGTCAACCCACGGCCTGGCGATGGTCTTTTCCATGCTGCTGGCGGCGATTATCTGGAACCTGGGAACGTGGTTCTTCGGTTTACCGGCCTCCAGTTCGCACACCTTGATTGGTGCGATTATCGGCATCGGTTTAACCAACGCGCTGTTAATCGGCTCATCGGTGATGGATGCGTTAAACCTGCGTGAAGTGACCAAAATTTTCTCCTCGCTGATTGTTTCCCCTATCGTCGGCCTGGTCATTGCGGGAGGCCTGATATTCCTGCTGCGACGCTACTGGAGCGGGACGAAAAAGCGTGACCGTATTCACCGCATTCCGGAAGATCGCAAAAAGAAAAAAGGCAAACGTAAACCGCCATTCTGGACGCGTATTGCGCTGATTGTTTCCGCTGCGGGCGTGGCGTTTTCGCACGGCGCGAACGACGGACAAAAAGGGATCGGCCTGGTAATGCTGGTACTGGTGGGGATTGCCCCTGCTGGCTTCGTCGTCAATATGAATGCGTCCGGCTATGAAATTACCCGTACCCGCGATGCCGTTACCAACTTCGAACACTACCTGCAACAGCATCCTGAACTGCCGCAGAAGTTGATTGCGATGGAACCTCCATTGCCTGCAGCATCGACTGATGGCACGCAAGTAACAGAGTTTCACTGTCATCCGGCAAATACCTTTGATGCTATTGCGCGCGTTAAAACGATGCTGCCAGGCAATATGGAAAGTTACGAGCCGTTAAGCGTGAGTCAGCGCAGCCAGCTGCGCCGCATTATGCTGTGCATCTCTGATACCTCCGCGAAGCTAGCGAAACTGCCAGGCGTCAGTAAAGAAGACCAGAACCTGCTGAAAAAACTTCGCAGCGATATGTTAAGCACCATTGAGTACGCTCCGGTGTGGATCATCATGGCGGTAGCACTGGCGCTCGGCATTGGCACCATGATTGGCTGGCGTCGTGTAGCGATGACCATCGGTGAGAAGATTGGTAAGCGCGGCATGACGTATGCGCAAGGCATGGCGGCACAAATGACGGCGGCAGTGTCTATCGGTCTTGCCAGTTATATTGGGATGCCCGTCTCCACAACACACGTCCTCTCGTCTGCAGTTGCAGGGACGATGGTGGTGGACGGCGGTGGGTTACAGCGTAAAACGGTAACCAGCATCCTGATGGCGTGGGTATTTACTTTACCGGCGGCAATTTTTCTTTCTGGTGGGCTGTACTGGATAGCATTGCAGTTGATTTAACCTTCCTGAAAATGCCCGGTCCTGGCGATCGGGCATTTCCATTTTTGACTAGTGATAACCACGCGCGGTCATAAAATCCGTAATCGCTTTTTCTGCATCAACCAACACCTGTTCCAGTGGCTGATTGGCATCGATATCAACCAGTTGTGCACCACCAAAGGTTAACTGTGGCGTTATGGCAATCTTCCTCGCCAGCGATTCCCGTTTATGGTCGGGTTTACGTGCACAGGCAACTTCAAGGTCAACATTGAGTTTGATGACCAGATCAGGCTTATGGCTCGCCATCCAGTGAAACGCTTTACGTTCCTGGCTTGCCAGCCATGAGACAAAACGACCACCTTCAACGTTAGGTGGGAACACCGTACCATCGTAAGCGCCAGGAATTTGGTCCTGAGGATAACGGTCGGTTAGAACAATTAACCCGCGACGACGACAGGCAAGCATATGACGAAAGCGCAGTAAGCGACGGGCGACAAACGCTGTAATTACCAGCGCCGGAACTGGTCCAGGCAATTTTTTTGCTGTTTTCACCTGATTTCGTTCAATTGTTTTATGTAAGGATTTTCCCATCAACGGTAATTTTGTCACTGCACGACCGACATTTCCGGCCTGTTTTCCTAAATGAACTCTTTCGGCAGCACCATATTTTTCGACAACGGTAATAAGATGTTCACACACCGTTGACTTGCCTGAACCATCGCTACCAATAACGGCAATTAATGGAGGTGTTATTGACTGCATGAATATATCCTTTAATTTAAATATCCATTAAAAATATTTATTTGGTTAATATGTTTTTATGAAAGCGTAATTCAGGTCAATGTCACAATTAACCATTGTCACAATAAGGTTGAACGGATATTCTGAGCACGGACCACGTAATATTCAATACATTATTTACTGCCGGGTTTTTCGTGAATCAACGTAATATGTCAATAATTAATTCCACGCCAGTGCGTGTTATTGCCATTGTAGGATGTGATGGTTCAGGTAAATCGACCCTCACGGCAAGCCTGGTAAATGAACTGGCAGCAAGAATGCCAACAGAACACATTTATCTCGGGCAATCGTCCGGGCGAATTGGCGAATGGATTTCACAGCTCCCTGTTATTGGCGCACCTTTTGGGCGTTATCTGCGAAGTAAAGCGGCACATGTGCACGAAAAGCCCTCAACACCGCCTGGCAATATTACTGCACTGGTTATCTATCTGCTTTCCTGCTGGCGGGCGTACAAGTTTCGCAAAATGTTGTGTAAAAGCCAGCAAGGCTTTCTGCTCATCACCGACCGCTACCCGCAGGTTGAAGTGCCGGGGTTTCGCTTTGATGGCCCGCAATTGGCAAAAACCACGGGCGGTAACGGTTGGATAAAAATGTTAAGGCAGCGCGAGCTGAAGCTGTACCAATGGATGGCATCTTATTTGCCCGTATTGTTGATTCGTCTTGGCATTGATGAACAAACCGCGTTTGCGCGTAAACCTGACCACCAACTGGCAGCGTTACAGGAAAAAATCGCCGTTACGCCGCAACTGACATTCAATGGCGCAAAGATCCTTGAACTGGATGGGCGACACCCCGCCGATGAAATTCTGCAAGCGTCACTACGCGCAATTCACGCCGCGCTTTCCTGATCACCAAAATGGATAGGGGATGATTTAACGAATGGATGCACTACAAACTCAAACTGTTAATAGTACAACCGCGCCGCAGCCCAACTACATTCCGGGGCTGATTGCGGTGGTCGGGTGTGATGGCACCGGTAAATCCACACTGACCACCGACCTGGTGAAATCGCTGCAACAACACTGGCAAACCGAGCGGCGCTATCTGGGGCTGCTCTCCGGCGAAGACGGCGACAAAATCAAACGATTGCCGTTGGTTGGCGTCTGGCTGGAACGGCGACTGGCGGCCAAATCCTCGAAAACCCAAAGCATGAAAACCAAATCTCCGGCGCTATGGGCGGCGGTGATTATGTACTGCTTCTCGCTGCGAAGAATGGCGAATCTACGCAAGGTTCAGCGACTGGCGCAAAGTGGCGTTCTGGTGGTCAGCGATCGCTTCCCGCAGGCTGAAATTTCGGGCTTTTATTATGATGGACCGGGGATTGGCGTCGAACGTGCGACCGGGAAAATCAGCATGTTTCTGGCGCAGCGCGAACGGCGTTTATACCAACAAATGGCGCAATATCGCCCGGAATTAATTATTCGCCTGGGCATTGATATTGAGACTGCCATCTCCCGCAAGCCTGACCATGACTATGCCGAGCTGCAGGACAAAATCGGTGTCATGTCGAAGATTGGCTATAACGGCACAAAAATTCTTGAGATTGATTCCAGAGCGCCCTACAGCGAAGTGCTGGAACAGGCACAGAAGGCGGTTTCTCTGGTTGCCATCGTTTCTGACCGCCGAAGTTTAACTTAAGGCTGGCGGGATACTTTTTACTATCAACGATAAGGTCAGGCTGAATTGGCGGGTTTTAACATCAAACTTTGGTTTGCAGATGGCGCGTTTCGCACCATTATTCGCAATAGCGCCTGGTTAGGCTCCAGTAATGTCGTGAGCGCCTTGCTGGGTCTGTTGGCGCTCTCGTGTGCCGGTAAAGGGATGACGCCCGCCATGTTTGGCGTACTGGTGATTGTGCAATCGTACGCCAAGTCGATCAGCGATTTTATTAAGTTTCAGACATGGCAACTGGTGGTTCAGTACGGAACACCAGCATTAACCAACAATAATCCGCAGCAATTCCGCAATGTCGTCTCATTTTCCTTCTCGCTGGATATCGTCAGCGGCGCGGTGGCGATTGTCGGTGGCATTGCCTTACTGCCATTCCTTTCCCATTCATTAGGTCTGGATGACCAAAGTTTTTGGCTGGCAGCGCTCTATTGCACGCTCATTCCTTCAATGGCTTCCTCCACGCCGACCGGCATTCTGCGTGCGGTAGATCGCTTCGATTTAATTGCTGTACAGCAGGCGACGAAACCTTTTCTGCGCGCAGCGGGGAGCGTCGTAGCCTGGTATTTTGACTTTGGTTTTGCGGGTTTTGTTATTGCCTGGTACGTGTCGAATCTGGTTGGCGGCACCATGTACTGGTGGTTTGCCGCGCGCGAATTACGCCGCCGAAATATCCATAACGCCTTCAAATTGAATCTGTTTGAGTCTGCCCGACACATTAAAGGCGCGTGGAGTTTTGTCTGGTCAACCAACATTGCCCACTCCATCTGGTCGGCGCGTAATGATCTTACCCAGCAATAGTGGACACGCGGCTAAGTGAGTAAACTCTCAGTCAGAGGTGACTCACATGACAAAAACAGTATCAACCAGTAAAAAACCCCGTAAACAGCATTCGCCTGAATTTCGCAGTGAAGCCCTGAAGCTTGCTGAACGCATCGGTGTTACTGCCGCAGCCCGTGAACTCAGCCTGTATGAATCACAACTCTACAACTGGCGCAGTAAACAGCAAAATCAGCAGACGTCTTCTGAACGTGAACTGGAGATGTCTACCGAGATTGCACGTCTCAAACGCCAGCTGGCAGAACGGGATGAAGAGCTGGCTATCCTCCAAAAGGCCGCGACATACTTCGCGAAGCGCCTGAAATGAAGTATGTCTTTATTGAAAAACATCAGGCTGAGTTCAGCATCAAAGCAATGTGCCGCGTGCTCCGGGTGGCCCGCAGCGGCTGGTATACGTGGTGTCAGCGGCGGACAAGGATAAGCACGCGTCAGCAGTTCCGCCAACACTGCGACAGCGTTGTCCTCGCGGCTTTTACCCGGTCAAAACAGCGTTACGGTGCCCCACGCCTGACGGATGAACTGCGTGCTCAGGGTTACCCCTTTAACGTAAAAACCGTGGCGGCAAGCCTGCGCCGTCAGGGACTGAGGGCAAAGGCCTCCCGGAAGTTCAGCCCGGTCAGCTACCGCGCACACGGCCTGCCTGTGTCAGAAAATCTGTTGGAGCAGGATTTTTACGCCAGTGGCCCGAACCAGAAGTGGGCAGGAGACATCACGTACTTACGTACAGATGAAGGCTGGCTGTATCTGGCAGTGGTCATTGACCTGTGGTCACGTGCCGTTATTGGCTGGTCAATGTCGCCACGCATGACGGCGCAACTGGCCTGCGATGCCCTGCAGATGGCGCTGTGGCGGCGTAAGAGGCCCCGGAACGTTATCGTTCACACGGACCGTGGAGGCCAGTACTGTTCAGCAGATTATCAGGCGCAACTGAAGCGGCATAATCTGCGTGGAAGTATGAGCGCAAAAGGTTGCTGCTACGATAATGCCTGCGTGGAAAGCTTCTTTCATTCGCTGAAAGTGGAATGTATCCATGGAGAACACTTTATCAGCCGGGAAATAATGCGGGCAACGGTATTTAATTATATCGAATGTGATTACAATCGGTGGCGGCGGCACAGTTGGTGTGGCGGCCTCAGTCCGGAACAATTTGAAAACCAGAACCTCGCTTAGGCCTGTGTCCATATTACGTGGGTAGGATCAGTTTTGAAATAGCTCCAGAAGTCGTACACAGGCTCAGTAGAACAATATAGTTTGGAACGCATTACCGCACCTTCCCAGCCACTCCAGAAAATAGCCGCCAGTTGTGTATTATTCATATTTGAACTTATTTCGCCTGAAGATAACGCGTCGGACAGGCAGGCCGCAACAAGTGCCTGCCAGCTCTCAAGAATATTCTGTAGCACTTTAATAAAGGATTGGGGAAGCCCCGGAGTCTCCTGCATCATATTTCCAACCAGGCAGCCGCGGGTAAAATTGTATTTTTTAATCCCCTCACAGGCATCATTGATGAAATTTTCCAGACGTACCATCGGTGTGCAGGATGATTCATGTAGATGTTTTTTGAGTTTATGCTCGAAGAAACTGTCATAGGCGTTCAGAACGGTCTGAGCATAATCCTCTTTACTCTTAAAGTAGTAATAAAATGAGCCTTTAGGGACATTGGCATTTTTAACAATGGCATCGACTCCCGTCGCAAGAAAACCATTCTGCGTCAGTAGTTCAAGGCCGCTGCGGATCAGATCATTACGAGTATCTGCATAGGTACTTTCTGTCTTTCGCGGCCTTCCCCGGCCACGATTTTCGCCTGTCATGACTCATCTCCTACATTTTATTAACGCAAAAGAATAAGTGATTTTGCGCGAGAGCTTCAATCAGCTTACCCCCGCAGTGGCAAAGTGTTCGCTTTTGCCGACTGGACAGTGGCAGTGCCTGTCGCTACTGCACCATCGCTGAACAGTGCAGAATTAGCAACGTTTACTTTTGGCACAAAGCATACGCCCATTCTCACGCGAATGCTTTGCCCTTGACTTGCCAACAACCGAATCGCTTGCTCTGCAGCTATGGTTTTTTATTAGAATCTTCGCCACTAAATAAATTATAATAAATTGATTTTATTAGATATCTATCAGAATCTATATGGTCAGCACGAGTTGCATGCTCTAATGTTTCAAATGCCAGAGTATGTGCGGCTTCAGCAAATATCCCAGGCCAACCCTTACTCAGCATCGACAGTCCTTTAAGGACTCTTATCTGAATCTCCCTCATACTGGCACCATCGCGCGCGACAGGTGAGAAAAAGTCTTGCAGTAGATCGTTATTCTGAAGTGGTGCAACATGTACTGAAGGATATTTCACTTCTATTTCATCAGATTTATTCTGCGCGTAAGTGGAAAGTATACGAACACCTCTGCCAATGACATCAATGGCGGTTCCAGGATCGTTCACTGCAGGGGAAAGGGCCCGGCAGGCTATCTCGGCCATGACGCTAAGACAAAATCGAGGATCCTGAGCAAATGAACGTGCATCCGAGACAATAAGCGTCTCAAGTAAATCGGTGCTGATTGACGACTCCTGGCCCTGACTCAGGTACAAAACTGGCATGGACGGATGTATGAAACTGCCTGGCTGCGCCACGAGGTATACATGACAGGGATCATTGGTCAGCAGCTTGCTAAGTTTCACCATATCAATATATTCAACATAGCCAATCTTCTTCGGATAAACTGCAACCGTTCCTTTCGGCTGTTCATTGTTCTCAAGCCATGGATATCCGCCGAGACAGGGATTTCTTGCTCTCGCAATAAATGTTTCGATGGCCGCCTGTTCTACTTGTGCCGTTGTCTCACCAACCCTCCCCAGAGAGGTCAAATGCTGTATCCAGCGAAGCAATGTGATGAGGATTAAGGCAATGACAACCAGTGTGACAATGAATAAAATGACTCTCCCCCTTTCTCCATAAGCTCCCATATTGAGGGCAATAATCCCTACCAGACTGAAGAGAAAAGAACCGATGAAGGTGGCCAGTACATTTTGTGTGGTGACGTCTTCAACAACTAAACGCGTAGCACTGGGAGTCACATTAGTAGTGGCTGAACCGTAGGCTGTGACCATGATACTCAGCGAAAATGTGGTCACTGCCAGCATACTCGATGCCAGTATGTTCAGAATGTTATCGACTGCTTCCGCACCAACCTTCACGGAAACCGACTCAGGTATCATTGATTTAAAAAGAATTGATAAAAGGGCCGTTATTATCGCGACAATTGCGAATAACGTTGCCCTGAACCATAGTTTTTTAATTGTCTGCTTCAGCATCCATTTCCAGCGTGAAATCATTCTGCATTCCCTCATATTGCGGCCGGGATAATAATTGCCGCCCCCAGGACGGCATGTTGCTATTAACGAACCAGGTGCAGGAAGTGCAGGTGTTTTTCGTACTGGTCCAGTATGTCATTAATCAGCTGTTCCTGGTTCCAGCCCATCACATCATAATTTTGACCGCCTTCTTTGAGATAAACCTCAGCGCGATAATATCGATGTTGTTCAGTCTGCTGCTCATCATTATCCATTGCGGCGAGCGCGAAGGTCGGTGAGATATACCCGCGAAGCCTCACTTCATATATAAAATTCAGCTCGTTGCCCAAATCGACTTCAAGACGAATACGATCGTCGACTGCATCACTAATGTGGCTTATCGTCCCCTGCTTGTTCAGTTCCTCCTGAACCAGCGTCATGGCGGGCTGGATGACGTCGACCATAAAACGTTTCACAAGAGATCGCTTCGGCAGATACGCGATATTGCGTAACCTTCTCTGCCAAGGAATTGGGTTACGTGCAGCCGTAGGAGCAATTGTCGCCATGCTCAGGCTTTCACGCTTGGTCAAATCCCGGCGCAAAGCTTTTAAAAGTCCGTATATGGATATTAGTAAGATCACTGAGAAGGGCAATGCACTCGCTATTGTCACCGTTTGCAGCGCACTTAGCCCTCCGGCAAGGAGAAGCGCAATTGCAACAATGCCCATGAGCGAGGCCCAGAATATTCGCTGCCAGACGGGTGTGTTTGCCACTCCACCTGATGCCAGAGTATCCACAACCATTGCCCCCGAATCAGCAGACGTTACAAAGAAGACGATGACCATCGCCATTGCAATGAATGACAGCACAGAAGAGAACGGGAAATGCTCCAGGAAATTAAACAGGGCCAGCGACACATCCTGCTGAACAGTATTGGCGAGGTCTGTGGCCCCCTGGTTCATAATGAGATAGATCGCGCTGTTACCAAACACCGTCATCCACATTAGCGTAAAACCCGCTGGAACAAACAGCACGCCGGTGACAAACTCGCGAATGGTTCGCCCGCGGGAGACCCGTGCGATGAACATCCCCACAAACGGCGACCATGAAAGCCACCATCCCCAGTACAGTAATGTCCAGCCCCCCAGCCAGTTGCTCGACTTGGGCTCATACGCGTAAAGGTTGAACGTTTTACTCACCAGTTCCGAAAGATAACCGCCCGTATTTTCCACAAATGACTTCAGCAGAAGCACGGTTGGTCCCAGACACAGGACCAGCGCCAGGAGCAACAAAGCCAGACCCAGATTGAGTTCAGACAGGATACGTATTCCCTTATCCAGACCGGACACCACTGAAATCGTCGCTAACCCCGTGATGACCACGATCAGAATTACCTGCACCGTTTCATTGATGGGCACCCCGAAAAGATGGTTCAAACCGGCATTCACCTGCAAAACACCGTAACCCAGTGATGTCGCAACGCCAAAGACCGTGCCTATAACAGCGAAAATATCAACCGCATGTCCTACAGGTCCGTATATGCGATCGCCAATAATGGGATAGAGTGCGGAGCGCAGAGTTAAAGGCAGACCGTGACGGTAACTGAAGAACGCCAGAATCAGCGCCACAATGGCATAAATTGCCCATGCGTGCAGTCCCCAGTGGAAAAAGGTCAGACGCATTGCTTCCTTAGCTGCCGCAACGGTTTCTGGAGTGCCAACGGGTGGCGAAAGATAATGCATTACAGGTTCGGCAACGCCAAAGAACATCAGGCCGATCCCCATCCCTGCCGAAAAAAGCATCGCAAACCAGGAGTGGTAGCTGAAATCAGGCTGCGCATGGTCCGGGCCCAGCTTGATATCACCGTAGCGTGAGAGTCCAAGGAACGTGACACTCAGTAAAATCAGGGCCACAGCAAGGATGTAGAACCAGCTGGCATTCGTGAAGATTTGTTGCTGAAGTAGTTTAAAATTTTTGTCGGCGACATCCGGGAATACGGCGGCAAAGGCGACAAGAAGGAAAATTAGCAAAGCAGATGTAAAGAATACCGCTTTGTTAATCTGACTTGTAGACTTCTTTGGGATTGTATCATTTTCACTCATAATCATTTATTCCATTAATTAAATTCGCACTGGATAGAAGGTACTACCTCTCACAAAGTAGCAAAGTTCAACTGGCTATGCGGCGTTAATCGGAATGCATGGTGGAATGTTGAGGTGTACTGGCAATAGCGGACACTACCATTTGTTCTTTTTTTAAGCAGCCATCTGATGATATTTTTCCCTGAAGGCTGCCGGGGAGATATTCCCCAGACGAGAGTGACGACGCTGACGATTGTAGAAAATCTCAATGTATTCCCGTATTACTGAGATGGCTTCATCCCGGTTATTAAAACGATAGTGGCTCAGGCTCTCATTTTTCAGTGTTCCCCAGAAGCTTTCCATCGGAGCGTTGTCGTAACAGTTACCTTTACGCGACATTGATGTTTTCAGACCAAACTGCTCCTGTATGACCCGGTAATCGTATGCGCAGTACTGTGAACCTCGATCAGAGTGGTGGATTAGCCCGGCAGGTGGGCGCTGGCTCCTGAGCGCCATAAACAGGGCTTTACCTGTCAGCTCTTTTGTCATGCGCTCTCCCATGGCGTAGCCGACAATTTCGCACGTATAAACATCTTTGATGCCAGCGAGGTACAACCATCCCTCCTGTGTGGCAACATACGTCAGGTCCGCCACCCAGACCTGATTTGGTGCTGTAGGAGCGAACGTCTGGTTCAGCAGATTTGGCGCAACTGGCAGATTGTGGTTCGAGTTCGTAGTCGCTCTGAACTTGCGTTTCTGCTTACAGCGTAGCCTTAGCTCCTTACGAAGACGTGCCAGTCGGTCACGACCAACGATGATGCCATTCTCTGCCAGCTCCGTCTGGAGCCGCCGGGTTCCATATGTTTCGCGAGTGCGGATATGTGCCACCTTAATCTCCAGTTTTAGCCGCTCATCACTTTGTTTTCTGTCTGAGGGTTCATGCTGTACCCAGTTGTAATAACCGCTCCTGGATACACCAAATACCTGACACATCGCTTCAATGGGAAATTGTTGTCGCCATTGTTCGATTAACGCGTATTTTTCAGCGACTCCTGTGCAAAATACGCTGTTGTAGATTCAATCTGTCAATGCAACACCCCTTTCAATTATCTCTTTCGGTGTTTTGAACTTCAGTGTCTTTCTCGGTCTGTTGTTTAGCTGAGCAGCAACCAGATCCAGTTCATGTTGAGTATATTGGGCAAGACATGTCTTTTTAGGAAAGTACTGCCGAATTAGCCCATTTGTGTTCTCATTTGTTCCCCGCTGCCAAGGACTCTGAGGATCGCAGAAGTAAACTTTAACGCCGGTGCTGACAGTAAATTCTAGATGTCTGGCCAGTTCCATTCCTCTGTCCCATGTCAGTGATTTTCTGAGTTCTGACGGTAAACTCAGGAATTTGTCGGTAAGAGCCTGATTTACTGAGACAGAATCTTTGCCCCTGAGTCTAAGGATGATCGTATAACGTGATTTTCGGTCTACAAGTGTGGCTATATGAGAGTTTTTTGTACCTGAGACTAAATCGCCCTCCCAATGCCCCAGAGAGCGTCTGTTATCGATATTTCGGGAACGTTCGTGAATTGGTGTTCCGTTCACTATGTTAATCGTACCTCTTTCGCCTTTGCGGGTATGACGCCTGCCATGGCGAAGGCTATGCGACCGTCGCAGATGCTGTATATTCAGGTGGTGTAGCGCTTCACGGCTACGAAAGTACAGCGTTTTATAAATTGTCTCAGGTGATATTCGCAGCGTTTTTTGACGTGGTTTTGTTCGCCTTAACCATCCTGATATTTGCTCTGGAGACCATTTCATCTCCAGCTTTTCCAGAACAAGCTTTCGCAATGGTAAATTTTGATCCAGTAAGCACGGTTTTGGCCTTTTCGCCATTCTGTTGGCTCGGTTATTAGCATCAACAGCTTTGTAATAGCGTCTGCCCCGATTACGCTGAACTTCACGTGAGATCGTCGAAGGACTGCGATTCAGCGCAGTAGCTATCGCACGAATGCTCATTTTGGCTGACAAACCAGCTCGTATCTCCTCGCGCTCAGACAGTGTCAGGTGAGCTACAGCCCGCTTACGCTCATGGGGTTTTATGCCGCCAGTATCCCTTAACATAGTGAAGATCGTTCCGGGTTTTGAACCCAGGATATTCGCTATTTCACTGAAGCCTGTTCCGTTCTTCCATAGTTCAAAAACAGAGGCTTTTTCCTCTGCTGTAAATGTTCGTCTCATTCAAAAAACCTCCGCAACCCCATGTTTTCACATAACTGTTGCGTTGACCAATTGAATCTACAACCGCGCTCTTGATGTCAGACTCCCTGAACAGTTTTCGATAATCGGGAAACTCAGGGCGCGTTATCCTATGGCCACTCTCTGCCATGTGTTCGGGGTCCATCGCAGCAGCTACAAATACTGGAAAAACCGTCCTGAAAAACCAGACGGCAGACGGGCTGTATTACGCAGCCAGGTACTTGAACTGCATGGCATCAGCCACGGCTCTGCCGGAGCAAGAAGCATCGCCACAATGGCAACCCAGAGAGGCTACCAGATGGGGCGCTGGCTTGCTGGCAGACTCATGAAAGAGCTGGGGCTGGTCAGTTGCCAGCAGCCGACTCACCGGTATAAGCGTGGCGGTCATGAGCACGTTGTTATCCCGAATCATCTTGAGCGACAGTTCGCCATAACGGAACCAAATCAGGTGTGGTGCGGTGATGTGACCTATATCTGGACGGGTAAGCGCTGGGCGTACCTCGCCGTTGTTCTCGACCTGTTCGCAAGAAAACCAGTGGGCTGGGCCATGTCGTTCTCGCCGGACAGCAGGCTTACCATGAAAGCACTGGAAATGGCATGGGAAACCCGTGGTAAGCCCGTCGGGGTGATGTTCCAAGCGATCAAGGCAGTCATTATACGAGCAGGCAGTTCCGGCAGTTACTGTGGCGATACCGGATCAGGCAGAGTATGAGTCGGCGTGGAAACTGCTGGGATAACAGCCCAATGGAGCGCTTCTTCAGGAGTCTGAAGAACGAATGGGTGCCAGCGACGGGCTATGTAAGCTTCAGCGATGCAGCTCACGCCATAACGGACTATATCGTTGGATATTACAGCGCACTAAGACCGCACGAATATAATGGTGGGTTACCACCAAACGAATCAGAAAACCGATACTGGAAAAACTCTAACGCGGAGGCCAGTTTTAGTTGACCACAACAGACTACTTGAAGGGAGCCGCGGTCGCCTGGCAGTTGCAGTAGCAGGAGATCATCCAGCCGCAGTACAGATCACGATGACTCTGGTTAATGATACCGGCTTTGACCCCGTATTTTCCGGCTCTATCGCTGAATCATGGCGTCAGCAGCCGTGCACACCATCCTATTGTTGTGACTGGGAGGCTGCCACCATGCTTCGCGCTTTCCCTCTGGCGAAAAAGGGAGAAGGACGGGCCCGTCTGCCTTCACTTTATGCCAGCTTCGGTAAGCTGGGTGAGACACCGACTCATGAAGATATCATTGATAACAATCGATCCATCAACTGGCCTGTATAACGTGGCTGCCGGTGATTAAGAAAGCTGCACCTACCTAAGTAGTAGCAAACGCACACTTTTTAGAAAAATCGATGGTCAGAAACTGGATTAGCAATTCCGTTCCAGGGTTGCTTTTGATTTACGTTGGCGTCTGATCATTGATTTATCCTCAAAAGCCCAACCTCATTGGTAATGAACCAGCTCCGTGAATGTCCGCTCTGGCACAGAGCGAAATTTTTTGATCTCCCCCCCTGAAATCTAAACTTAGTCATGTCACGTTTTTGGGTTTCTAAAATTTTAACTTCGCGTTTTTCGTTGCCGTAAGGGTTATACAGAAATGTCCGTTAAGCAGAGTTCAAAATTGATTGCTGTGATCACGACTGGTTTGAAAGCCGCGCCCAAGCCTGTACAGCTCTGGTTTGCGTTGATTATGAACCTGTCAGCCTAAAGCAAGCGGATGGACGATGAGTATTGGTAATCTTTCAGAGTCCGGAAAAGTTCAGCCCCAGTCTGAACAGGCTTGCTGGCGCCAGTCCAGTTTCATTCAGTCGTGGTTTGGTTCTTACGGCCTGTGCAATCTACCTCATTAGGCACATCGGCCTGCCAGATACCGGCTCGGGGTGTATTTCCGCTTCCACGCTGAATACTGTTCTCAGCAATCCAGGGGTCATCACCTCTTCTGGTGTGCCTTGCGCCATAACATGTCCGTTTGCCATTACCACCAGTTGATCGCAGTACCGGCTAGCCTGATTAAGGTCGTGCAGCACAGCGACCACCGTTTTCCCCTGAGTCCGGAGTTCGCCCATCAACCGCATCAGGTCCACCTGGTGATTGATATCAAGATAGGTGGTTGGCTCATCAAGTAATACAACGGGCGTATTCTGGGCCAGGACCATCGCCAGAAATGCGCGCTGGCGCTGACCGCCGGAAAGCTCGGTTAACCGACGAACGGCAAGATGATTGATCCGGGTCTGGTTCATGGCGACATTAACTCGTGCATTGTCTTCAGCGGAGAGACGCCCCCAGAGTGACAGCCAGGGATTACGACCATACGAAACCAGCTCCTGGACTGTGATCCCCTCTGGCGTTAAATGGTGCTGAGGCAGCAGCGAAAGCCTGCGGGCCAACTGGCGCGATGAGAGCATATTTATGGGATTATCGCCGAGAAATACGGTGCCAGACTGCGGCATTAAAAGCCGCGAAAAACAGTTTAACAGCGTCGATTTCCCGCAACCGTTAGGACCGATCAGGGCGGTGATCTTCCCCGTTGGCAGTGAGAGTGAAACGTCGTTAAGTACCTTGTCTGTCCCGTAACTGACCGTCAGATTTTCAGTTCGTAAAGTCATTTATCGCATTCTCACAAGCAACCAGACAAACCACGGCGCACCGATAATGGCGGTCAGCACGCCAACCGGGAGCTCCAGTGGGGGATGAATAATTCTCGCCAGCAGATCGGCAACCACCAACAGCAACGCACCTGTCAGGGCCGAAACAGGCAGCAGTCTGCGGTGACGTCCACCGGTGATGCTACGCATCATATGCGGCACCACGAGACCAATAAAGCTAATCGGGCCGCAGGCGGCCACGCCGGTAGATGTCATGGCGACAGCTAGTAACAAAGCCCAGAATCGGGTATGGGGCACCGACACACCGAGCGTGGTGGCGCGCGCATCGCCGAGTGCAAGGAGGTCGAGATCGCGGCAAAAACTCAGGCTCAGCGGCAGAAATAAAATCATCAGCGGGATGGCAATCTTCACAAAGCTCCAGTCACGGCCCCATAAGCTGCCGGTCAGCCACAGCAGGGCGTTGTTCACATCCTGCGGGCGCGAGAGCATCAGATAATCCGTCAGGCTGGCCCAGCATGCAGAAAGCGCCACGCCGGTGAGCGCCAGCTTCATCGGCTGGTGGGTCTTTGCCAGCATCTTCAGCAATATCAACCCCGCCATGCCGCCCGCAAAGGCCAGCAGCGGCAGCACCATCACGGGCAGTGACGGCATAAGAAGTAGAGCCCCCACAGAGGCCAGGCTGGCGGCATGGTTAACGCCGAGAATATCCGGTGATGCCAGAGGGTTGCGCACAATCCCCTGTATCAGCACGCCCGCCACGGCGAGGGCTGCACCGACAAACAGTGCCAGCAGCAAGCGCGGCAGTCGGTACTCCATCAATACATAATAATGCTCGCGTCCGGCCTGCCAGTCGGTCAGCAGCGCGCGCCACGGCACGGGGATCACTCCCATATGGAGTGATAACAGCGCACAGCCCGCCAGGGCAAGGGTGATGAAAATAACCAGCGCAATTTTCAT
Protein sequences of DBSCAN-SWA_2 >LR134000|752456:773354|770079_770493_+|VDY67235.1|transposase|DBSCAN-SWA MGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVVIPNHLERQFAITEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPVGVMFQAIKAVIIRAGSSGSYCGDTGSGRV >LR134000|752456:773354|765454_765613_-|VDY67231.1|DBSCAN-SWA MTYWTSTKNTCTSCTWFVNSNMPSWGRQLLSRPQYEGMQNDFTLEMDAEADN >LR134000|752456:773354|762166_762466_+|VDY67227.1|transposase|DBSCAN-SWA MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWRSKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILQKAATYFAKRLK >LR134000|752456:773354|772397_773354_-|VDY67239.1|DBSCAN-SWA MKIALVIFITLALAGCALLSLHMGVIPVPWRALLTDWQAGREHYYVLMEYRLPRLLLALFVGAALAVAGVLIQGIVRNPLASPDILGVNHAASLASVGALLLMPSLPVMVLPLLAFAGGMAGLILLKMLAKTHQPMKLALTGVALSACWASLTDYLMLSRPQDVNNALLWLTGSLWGRDWSFVKIAIPLMILFLPLSLSFCRDLDLLALGDARATTLGVSVPHTRFWALLLAVAMTSTGVAACGPISFIGLVVPHMMRSITGGRHRRLLPVSALTGALLLVVADLLARIIHPPLELPVGVLTAIIGAPWFVWLLVRMR >LR134000|752456:773354|771629_772397_-|VDY67238.1|DBSCAN-SWA MTLRTENLTVSYGTDKVLNDVSLSLPTGKITALIGPNGCGKSTLLNCFSRLLMPQSGTVFLGDNPINMLSSRQLARRLSLLPQHHLTPEGITVQELVSYGRNPWLSLWGRLSAEDNARVNVAMNQTRINHLAVRRLTELSGGQRQRAFLAMVLAQNTPVVLLDEPTTYLDINHQVDLMRLMGELRTQGKTVVAVLHDLNQASRYCDQLVVMANGHVMAQGTPEEVMTPGLLRTVFSVEAEIHPEPVSGRPMCLMR >LR134000|752456:773354|757371_758871_+|VDY67222.1|DBSCAN-SWA MLNLFVGLDIYTGLLLLLALAFVLFYEAINGFHDTANAVATVIYTRAMQPQLAVVMAAFFNFFGVLLGGLSVAYAIVHMLPTDLLLNMGSTHGLAMVFSMLLAAIIWNLGTWFFGLPASSSHTLIGAIIGIGLTNALLIGSSVMDALNLREVTKIFSSLIVSPIVGLVIAGGLIFLLRRYWSGTKKRDRIHRIPEDRKKKKGKRKPPFWTRIALIVSAAGVAFSHGANDGQKGIGLVMLVLVGIAPAGFVVNMNASGYEITRTRDAVTNFEHYLQQHPELPQKLIAMEPPLPAASTDGTQVTEFHCHPANTFDAIARVKTMLPGNMESYEPLSVSQRSQLRRIMLCISDTSAKLAKLPGVSKEDQNLLKKLRSDMLSTIEYAPVWIIMAVALALGIGTMIGWRRVAMTIGEKIGKRGMTYAQGMAAQMTAAVSIGLASYIGMPVSTTHVLSSAVAGTMVVDGGGLQRKTVTSILMAWVFTLPAAIFLSGGLYWIALQLI >LR134000|752456:773354|753778_754027_+|VDY67219.1|DBSCAN-SWA MCIGVPGQVLAVGEDIHQLAQVEVCGIKRDVNIALICEGNPADLLGQWVLVHVGFAMSIIDEDEAKATLDALRQMDYDITSA >LR134000|752456:773354|753424_753766_+|VDY67218.1|DBSCAN-SWA MHELSLCQSAVEIIQRQAEQHDVKRVTAVWLEIGALSCVEESAVRFSFEIVCHGTVAQGCDLHIVYKPAQAWCWDCSQVVEIHQHDAQCPLCHGERLRVDTGDSLIVKSIEVE >LR134000|752456:773354|765557_767555_-|VDY67232.1|DBSCAN-SWA MSENDTIPKKSTSQINKAVFFTSALLIFLLVAFAAVFPDVADKNFKLLQQQIFTNASWFYILAVALILLSVTFLGLSRYGDIKLGPDHAQPDFSYHSWFAMLFSAGMGIGLMFFGVAEPVMHYLSPPVGTPETVAAAKEAMRLTFFHWGLHAWAIYAIVALILAFFSYRHGLPLTLRSALYPIIGDRIYGPVGHAVDIFAVIGTVFGVATSLGYGVLQVNAGLNHLFGVPINETVQVILIVVITGLATISVVSGLDKGIRILSELNLGLALLLLALVLCLGPTVLLLKSFVENTGGYLSELVSKTFNLYAYEPKSSNWLGGWTLLYWGWWLSWSPFVGMFIARVSRGRTIREFVTGVLFVPAGFTLMWMTVFGNSAIYLIMNQGATDLANTVQQDVSLALFNFLEHFPFSSVLSFIAMAMVIVFFVTSADSGAMVVDTLASGGVANTPVWQRIFWASLMGIVAIALLLAGGLSALQTVTIASALPFSVILLISIYGLLKALRRDLTKRESLSMATIAPTAARNPIPWQRRLRNIAYLPKRSLVKRFMVDVIQPAMTLVQEELNKQGTISHISDAVDDRIRLEVDLGNELNFIYEVRLRGYISPTFALAAMDNDEQQTEQHRYYRAEVYLKEGGQNYDVMGWNQEQLINDILDQYEKHLHFLHLVR >LR134000|752456:773354|752943_753432_+|VDY67217.1|DBSCAN-SWA MTEEIAGFQTSPKAQVQAAFEEIARRSMHDLSFLHPSMPVYVSDFTLFEGQWTGCVITPWMLSAVIFPGPDQLWPLRKVSEKIGLQLPYGTMTFTVGELDGVSQYLSCSLMSPLSHSMSIEEGQRLTDDCARMILSLPVTNPDVPHAGRRALLFGRRSGENA >LR134000|752456:773354|758919_759612_-|VDY67223.1|DBSCAN-SWA MQSITPPLIAVIGSDGSGKSTVCEHLITVVEKYGAAERVHLGKQAGNVGRAVTKLPLMGKSLHKTIERNQVKTAKKLPGPVPALVITAFVARRLLRFRHMLACRRRGLIVLTDRYPQDQIPGAYDGTVFPPNVEGGRFVSWLASQERKAFHWMASHKPDLVIKLNVDLEVACARKPDHKRESLARKIAITPQLTFGGAQLVDIDANQPLEQVLVDAEKAITDFMTARGYH >LR134000|752456:773354|764217_765495_-|VDY67230.1|DBSCAN-SWA MISRWKWMLKQTIKKLWFRATLFAIVAIITALLSILFKSMIPESVSVKVGAEAVDNILNILASSMLAVTTFSLSIMVTAYGSATTNVTPSATRLVVEDVTTQNVLATFIGSFLFSLVGIIALNMGAYGERGRVILFIVTLVVIALILITLLRWIQHLTSLGRVGETTAQVEQAAIETFIARARNPCLGGYPWLENNEQPKGTVAVYPKKIGYVEYIDMVKLSKLLTNDPCHVYLVAQPGSFIHPSMPVLYLSQGQESSISTDLLETLIVSDARSFAQDPRFCLSVMAEIACRALSPAVNDPGTAIDVIGRGVRILSTYAQNKSDEIEVKYPSVHVAPLQNNDLLQDFFSPVARDGASMREIQIRVLKGLSMLSKGWPGIFAEAAHTLAFETLEHATRADHIDSDRYLIKSIYYNLFSGEDSNKKP >LR134000|752456:773354|767708_768527_-|VDY67233.1|transposase|DBSCAN-SWA MCQVFGVSRSGYYNWVQHEPSDRKQSDERLKLEIKVAHIRTRETYGTRRLQTELAENGIIVGRDRLARLRKELRLRCKQKRKFRATTNSNHNLPVAPNLLNQTFAPTAPNQVWVADLTYVATQEGWLYLAGIKDVYTCEIVGYAMGERMTKELTGKALFMALRSQRPPAGLIHHSDRGSQYCAYDYRVIQEQFGLKTSMSRKGNCYDNAPMESFWGTLKNESLSHYRFNNRDEAISVIREYIEIFYNRQRRHSRLGNISPAAFREKYHQMAA >LR134000|752456:773354|754149_755016_-|VDY67220.1|DBSCAN-SWA MTDNTYQPAKVWTWDKSAGGAFANINRPVSGPTHEKTLPVGKHPLQLYSLGTPNGQKVTIMLEELLALGVTGAEYDAWLIRIGDGDQFSSGFVEVNPNSKIPALRDHTHNPPIRVFESGSILLYLAEKFGYFLPQDLAKRTETMNWLFWLQGAAPFLGGGFGHFYHYAPVKIEYAINRFTMEAKRLLDVLDKQLAQHKFVAGDEYTIADMAIWPWFGNVVLGGVYDAAEFLDAGSYKHVQRWAKEVGERPAVKRGRIVNRTNGPLNEQLHERHDASDFETNTEDKRQG >LR134000|752456:773354|760530_761289_+|VDY67225.1|DBSCAN-SWA MDALQTQTVNSTTAPQPNYIPGLIAVVGCDGTGKSTLTTDLVKSLQQHWQTERRYLGLLSGEDGDKIKRLPLVGVWLERRLAAKSSKTQSMKTKSPALWAAVIMYCFSLRRMANLRKVQRLAQSGVLVVSDRFPQAEISGFYYDGPGIGVERATGKISMFLAQRERRLYQQMAQYRPELIIRLGIDIETAISRKPDHDYAELQDKIGVMSKIGYNGTKILEIDSRAPYSEVLEQAQKAVSLVAIVSDRRSLT >LR134000|752456:773354|768611_769763_-|VDY67234.1|transposase|DBSCAN-SWA MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPHERKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRGRRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVLEKLEMKWSPEQISGWLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNIQHLRRSHSLRHGRRHTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEGDLVSGTKNSHIATLVDRKSRYTIILRLRGKDSVSVNQALTDKFLSLPSELRKSLTWDRGMELARHLEFTVSTGVKVYFCDPQSPWQRGTNENTNGLIRQYFPKKTCLAQYTQHELDLVAAQLNNRPRKTLKFKTPKEIIERGVALTD >LR134000|752456:773354|770489_770732_+|VDY67236.1|transposase|DBSCAN-SWA MSRRGNCWDNSPMERFFRSLKNEWVPATGYVSFSDAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKNSNAEASFS >LR134000|752456:773354|759800_760499_+|VDY67224.1|DBSCAN-SWA MSIINSTPVRVIAIVGCDGSGKSTLTASLVNELAARMPTEHIYLGQSSGRIGEWISQLPVIGAPFGRYLRSKAAHVHEKPSTPPGNITALVIYLLSCWRAYKFRKMLCKSQQGFLLITDRYPQVEVPGFRFDGPQLAKTTGGNGWIKMLRQRELKLYQWMASYLPVLLIRLGIDEQTAFARKPDHQLAALQEKIAVTPQLTFNGAKILELDGRHPADEILQASLRAIHAALS >LR134000|752456:773354|761334_762120_+|VDY67226.1|DBSCAN-SWA MAGFNIKLWFADGAFRTIIRNSAWLGSSNVVSALLGLLALSCAGKGMTPAMFGVLVIVQSYAKSISDFIKFQTWQLVVQYGTPALTNNNPQQFRNVVSFSFSLDIVSGAVAIVGGIALLPFLSHSLGLDDQSFWLAALYCTLIPSMASSTPTGILRAVDRFDLIAVQQATKPFLRAAGSVVAWYFDFGFAGFVIAWYVSNLVGGTMYWWFAARELRRRNIHNAFKLNLFESARHIKGAWSFVWSTNIAHSIWSARNDLTQQ >LR134000|752456:773354|763325_763970_-|VDY67229.1|DBSCAN-SWA MTGENRGRGRPRKTESTYADTRNDLIRSGLELLTQNGFLATGVDAIVKNANVPKGSFYYYFKSKEDYAQTVLNAYDSFFEHKLKKHLHESSCTPMVRLENFINDACEGIKKYNFTRGCLVGNMMQETPGLPQSFIKVLQNILESWQALVAACLSDALSSGEISSNMNNTQLAAIFWSGWEGAVMRSKLYCSTEPVYDFWSYFKTDPTHVIWTQA >LR134000|752456:773354|762462_763329_+|VDY67228.1|transposase|DBSCAN-SWA MKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCDSVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLAVVIDLWSRAVIGWSMSPRMTAQLACDALQMALWRRKRPRNVIVHTDRGGQYCSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISREIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA >LR134000|752456:773354|755220_757080_+|VDY67221.1|DBSCAN-SWA MSKGTTSQDAPFGTLLGYAPGGVAIYSSDYSSLDPQEYEDDAVFRSYIDDEYMGHKWQCVEFARRFLFLNYGVVFTDVGMAWEIFSLRFLREVVNDNILPLQAFPNGSPRAPVAGALLIWDKGGEFKDTGHVAIITQLHGNKVRIAEQNVIHSPLPQGQQWTRELEMVVENGCYTLKDTFDDTTILGWMIQTEDTEYSLPQPEIAGELLKISGARLENKGQFDGKWLDEKDPLQNAYVQANGQVINQDPYHYYTITESAEQELIKATNELHLMYLHATDKVLKDDNLLALFDIPKILWPRLRLSWQRRRHHMITGRMDFCMDERGLKVYEYNADSASCHTEAGLILERWAEQGYKGNGFNPAEGLINELAGAWKHSRARPFVHIMQDKDIEENYHAQFMEQALHQAGFETRILRGLDELGWDAAGQLIDGEGRLVNCVWKTWAWETAFDQIREVSDREFAAVPIRTGHPQNEVRLIDVLLRPEVLVFEPLWTVIPGNKAILPILWSLFPHHRYLLDTDFTVNDELVKTGYAVKPIAGRCGSNIDLVSHHEEVLDKTSGKFAEQKNIYQQLWCLPKVDGKYIQVCTFTVGGNYGGTCLRGDESLVIKKESDIEPLIVVKK >LR134000|752456:773354|770928_771072_+|VDY67237.1|DBSCAN-SWA MLRAFPLAKKGEGRARLPSLYASFGKLGETPTHEDIIDNNRSINWPV >LR134000|752456:773354|752456_752951_+|VDY67216.1|protease|DBSCAN-SWA MRILVLGVGNILLTDEAIGVRIVEALEQRYILPDYVEILDGGTAGMELLGDMANRDHLIIADAIVSKKNAPGTMMILRDEEVPALFTNKISPHQLGLADVLSALRFTGEFPKKLTLVGVIPESLEPHIGLTPTVEAMIEPALEQVLAALRESGVEAIPREAIHD |
24 | Acinetobacter_phage(25.0%) | transposase,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1031253 : 1044436
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >LR134000|1031253:1044436|DBSCAN-SWA TATGCGCATATTGCTGAGTAATGATGACGGGGTACATGCACCCGGTATACAAACGCTGGCGAAAGCCTTGCGTGAGTTTGCTGACGTTCAGGTGGTCGCCCCCGATCGTAACCGCAGCGGCGCTTCAAATTCTCTGACACTGGAATCCTCCCTGCGCACGTTTACCTTTGAAAATGGTGATATTGCTGTGCAAATGGGAACCCCGACCGATTGCGTCTATCTTGGCGTGAATGCTCTGATGCGTCCGCGCCCGGACATTGTTGTGTCCGGAATTAACGCCGGGCCGAATCTGGGGGATGATGTTATTTATTCCGGTACGGTAGCCGCCGCGATGGAAGGCCGTCATTTAGGTTTTCCGGCGCTTGCCGTCTCGCTTGACGGGCATAAACATTACGACACTGCCGCGGCGGTAACCTGTTCAATTTTGCGCGCACTGTGTAAAGAGCCGCTGCGCACCGGGCGTATTCTTAATATTAACGTTCCGGATTTACCCTTGGATCAAATCAAAGGTATTCGCGTGACGCGCTGCGGTACACGACATCCGGCAGATCAGGTGATCCCGCAGCAAGATCCGCGCGGCAATACGCTGTACTGGATTGGCCCGCCGGGCGGTAAATGTGATGCCGGTCCGGGGACCGATTTTGCTGCGGTAGATGAGGGCTATGTCTCCATCACGCCGCTGCATGTGGATTTAACTGCGCATAGCGCGCAAGATGTGGTTTCAGACTGGTTAAACAGCGTGGGAGTTGGCACGCAATGGTAAGCAGACGCGTACAAGCACTTCTGGATCAATTACGTGCGCAAGGTATTCAGGATGAGCAGGTGCTGAATGCACTTGCCGCCGTGCCGCGTGAAAAATTCGTTGATGAAGCGTTTGAACAAAAAGCCTGGGACAATATCGCCTTGCCGATAGGTCAGGGGCAGACAATTTCGCAGCCATATATGGTGGCGCGAATGACCGAATTACTCGAGCTGACGCCGCAGTCGCGGGTGCTGGAAATTGGCACCGGTTCGGGATATCAAACGGCAATCCTGGCGCATCTTGTCCAGCATGTTTGCTCGGTTGAACGGATTAAAGGCTTGCAGTGGCAGGCACGTCGCCGCCTGAAAAATCTTGATTTACATAATGTTTCAACCCGTCATGGCGATGGATGGCAAGGTTGGCAGGCACGTGCGCCGTTTGACGCTATCATTGTTACGGCGGCACCGCCGGAAATTCCAACTGCGCTAATGACGCAGCTGGACGAAGGCGGGATTCTCGTCTTACCCGTAGGGGAGGAGCACCAGTATTTGAAACGGGTGCGTCGTCGGGGAGGCGAATTTATTATCGATACCGTGGAGGCCGTGCGCTTTGTCCCTTTAGTGAAGGGTGAGCTGGCTTAAAACGTGAGGAAATACCTGGATTTTTCCTGGTTATTTTGCCGCAGGTCAGCGTATCGTGAACATCTTTTCCAGTGTTCAGTAGGGTGCCTTGCACGGTAATTATGTCACTGGTTATTAACCAATTTTTCCTGGGGGATAAATGAGCGCGGGAAGCCCAAAATTCACCGTTCGCCGCATTGCGGCTTTGTCACTGGTTTCGCTATGGCTGGCAGGCTGTTCTGACACTTCAAATCCACCGGCACCGGTCAGCTCCGTTAATGGCAATGCGCCTGCAAATACTAATTCTGGTATGTTGATTACGCCGCCGCCGAAAATGGGGACGACGTCTACAGCGCAGCAACCGCAAATTCAGCCGGTACAGCAGCCACAAATTCAGGCTACTCAACAACCGCAAATCCAGCCAGTGCAGCCAGTAGCTCAGCAGCCGGTACAGATGGAAAACGGACGCATCGTCTATAACCGTCAGTATGGGAACATTCCGAAAGGCAGTTATAGCGGCAGTACCTATACCGTGAAAAAAGGCGACACACTTTTCTATATCGCCTGGATTACTGGCAACGATTTCCGTGACCTTGCTCAGCGCAACAATATTCAGGCACCATACGCGCTGAACGTTGGTCAGACCTTGCAGGTGGGTAATGCTTCCGGTACGCCAATCACTGGCGGAAATGCCATTACCCAGGCCGACGCAGCAGAGCAAGGAGTTGTGATCAAGCCTGCACAAAATTCCACCGTTGCTGTTGCGTCGCAACCGACAATTACGTATTCTGAATCTTCGGGTGAACAGAGTGCTAACAAAATGTTGCCGAACAACAAGCCAGCTGCGACCACGGTCACAGCGCCTGTAACGGTACCAACAGCAAGCACAACCGAGCCAACTGTCAGCAGTACATCAACCAGTACGCCTATCTCCACCTGGCGCTGGCCGACTGAGGGCAAAGTGATCGAAACCTTTGGCGCTTCTGAGGGGGGCAACAAGGGGATTGATATCGCAGGCAGCAAAGGACAGGCAATTATCGCGACCGCAGATGGCCGCGTTGTTTATGCTGGTAACGCGCTGCGCGGCTACGGTAATCTGATTATCATCAAACATAATGATGATTACCTGAGTGCCTACGCCCATAACGACACAATGCTGGTCCGGGAACAACAAGAAGTGAAGGCGGGGCAAAAAATAGCAACCATGGGTAGCACCGGAACCAGTTCAACACGCTTGCATTTTGAAATTCGTTACAAGGGGAAATCCGTAAACCCGCTGCGTTATTTGCCGCAGCGATAAATCGGCGGAACCAGGCTTTTGCTTGAATGTTCCGTCAAGGGATCACGGGTAGGAGCCACCTTATGAGTCAGAATACGCTGAAAGTTCATGATTTAAATGAAGATGCGGAATTTGATGAGAACGGAGTTTAGGTTTTTGACGAAAAGGCCTTAGTAGAAGAGGAACCCAGTGATAACGATTTGGCCGAAGAGGAACTGTTATCGCAGGGAGCCACACAGCGTGTGTTGGACGCGACTCAGCTTTACCTTGGTGAGATTGGTTATTCACCACTGTTAACGGCCGAAGAAGAAGTTTATTTTGCGCGTCGCGCACTGCGTGGAGATGTCGCCTCTCGCCGCCGGATGATCGAGAGTAACTTGCGTCTGGTGGTAAAAATTGCCCGCCGTTATGGCAATCGTGGTCTGGCGTTGCTGGACCTTATCGAAGAGGGCAACCTGGGGCTGATCCGCGCGGTAGAGAAGTTTGACCCGGAACGTGGTTTCCGCTTCTCAACATACGCAACCTGGTGGATTCGCCAGACGATTGAACGGGCGATTATGAACCAAACCCGTACTATTCGTTTGCCGATTCACATCGTAAAGGAGCTGAACGTTTACCTGCGAACCGCACGTGAGTTGTCCCATAAGCTGGACCATGAACCAAGTGCGGAAGAGATCGCAGAGCAACTGGATAAGCCAGTTGATGACGTCAGCCGTATGCTTCGTCTTAACGAGCGCATTACCTCGGTAGACACCCCGCTGGGTGGTGATTCCGAAAAAGCGTTGCTGGACATCCTGGCCGATGAAAAAGAGAACGGTCCGGAAGATACCACGCAAGATGACGATATGAAGCAGAGCATCGTCAAATGGCTGTTCGAGCTGAACGCCAAACAGCGTGAAGTGCTGGCACGTCGATTCGGTTTGCTGGGGTACGAAGCGGCAACACTGGAAGATGTAGGTCGTGAAATTGGCCTCACCCGTGAACGTGTTCGCCAGATTCAGGTTGAAGGCCTGCGCCGTTTGCGTGAAATCCTGCAAACGCAGGGGCTGAATATCGAAGCGCTGTTCCGCGAGTAAGTAAGCATCTGTCAGAAAGGCCAGTCTCAAGCGAGGCTGGCCTTTTCTGTGCACAATAAAAGGTCCGATGCCCATCGGACCTTTTTATTAAGGTCAAATTACCGCCCATACGCACCAGGTAATTAAGAATCCGGTAAAACCGAGAATGGTCGTTAACACTGTCCAGGTTTTCAGACCGTCTGCTACCGACAACCCCAGATATTTGGTCACAATCCAGAACCCTGAGTCATTAATATGTGACGCGCCAAGCCCACCAAAGCAGGCTGCCAGCGTCACCAATACGCACTGAATCGGATTCAATCCCATCACCGCTTCTGAGAGTAACCCGCCGGTTGTCAGGATTGCTACGGTTGCTGACCCCTGCGATGCACGCAGCGCCAGTGAAATAATAAATGCGGCTGGTAACAGAGGCAGGTCAATCATTTGTAGCATGTTGGCAAGGGCTTTGCCGACGCCCGATTCCACCAGCACTTTGCCAAATACCCCTCCAGCACCAGTAACCAAAATCACTACCGCCGCAGTGGGAAGGGCTGAGCCCATAATGTCGCTGGTGTGTTGTAAGCTCCAGCCGCGACGTAAAGCCAATAACCAGAATGCCAGCACCAGCGCAATCATTAGAGCTACCATTGGTGAGCCGATCAGCTGTAGCGTACCAAGCAGGGGATGCGAAGGCGGCATCAGTGTTGCGGAAACCGTACCCGCCATGATAATCGCGATAGGAATAACAATTAGCGAGGTGACCAGCGCGACGCCCGGTGGATTTATTTTATCGCTTAATTTTGTCGCGCCTTCCTCACTGGCCGGAGCCAGTTGCATCTGTTCCAGTACTTCTACTGACATCGCATATTGGCGCTTATTGATTATTTTCGCTGCAAAGTAGCCAACAACCCCTACGGGAATAGAAATCGCAATACCGATGATGGTTAGCCAGCCGATGTCTGCGTGGAGTAACCCCGCTGCGGCGACAGGGCCTGGATGCGGCGGTACCGCCACGTGAACAGTGAGCATGATCCCAGCGACAGGCAGGCCAAATTTGAGTGGCGATATTTTGGCAACCTTGGCAAAACCGTAAATGATTGGCGCAAGAATAATAAAGCCGACATCAAAGAAGACGGGAATACCGAGGAAGAACGCTGCCAGAGTCAGCGCAGCGATAGTTCGTTTGTCACCTAACTTGCGACTGAAATAATTAGCCAGTGACTCTGCACCACCAGAGTGTTCGATCATACGCCCCAGCATAGCGCCCAGACCAATAATAATAGTGACGGAACCAAGCACACCGCCCATCCCGGCGATCATCACTTTACCCACTTCGCCCGCCGGTATACCCGCCGCAAGTGCGACTAACAGGCTGACGAGGAGCAGAGCAACGAATGGTTGTACCTTTGCCTTGATGACCAGCAGCAACAGCATGATTACGCCAGTTAACGCAATGCATAACAATGTAATTGTGGACATGGGAAACCCTGTCTGAAAGTTATAGTTAACCTACCCCATCCGTAGATGGGGGGATGTATGGGTACGTTGTAATTAGGGATTTAACGAATTAGCGCCAGGCGTCAAACCAGCCAAGCCCTTCTTCGGTGAGGCCACGAGGTTTATATTCACAACCGATCCAGCCCTGATATCCCACCTCATCGAACAGGCGGAACAGCCACGGATAGTTGATTTCTCCATCGTCCGGTTCATGTCGATCAGGTAGTCCGGCAATTTGTACGTGCGCATATTTCCCGGCGTAGTCGCGGATTAAATGCGTCAGGTTGCCATCTACTTTTTGCGCATGAAAAGTATCTAGTTGAATAAACACGTTATCTCGCGCAACTTCTTCAACAATAGCCAGTGCCTGATACTGGCTGGAGAAGAGATAATGAGGCTTAACGCCGGGGCTGAGTGCTTCAACTAATATTCGCTTGCCGTGTGGCGCAAAGCGGTCGGCAGCGTAGCGGAGATTATCGATAAATACTGCCCGGTACCGTTCAGCATCTTCGCCAGCGGGCACGACGCCTGCCATCACATGGACTTGTTCACAATTGAGCGCCAATGCATATTCCAGTGCCAGGTCGATGTCTGCGTGTGCTTCGTGCTCACGTCCGGGAAGGGCGGATAATCCCCATTCCCCCGCATTAATATCTCCGGGAGCGGTATTGAACAGCGCCAGTGTCAGATGGTTTTGCTCCAGTTGCTTTTGGATTTGCAGGGTGGAGTAGTCATAGGGAAACAGAAATTCCACAGCATCGAACCCGGCTTTTCGCGCTGCGGCGAAGCGTTCAATAAAAGGCACTTCGGTGAACATCATGGATAAATTAGCTGCAAAACGAGGCATTGCATTAACTCCTTAATTCCGCAATTTCACCTGCGGTCAGATAACGGATCGGGCGGTCACCGAGAATAAAAATCAGCTTTGCCGTTTCCTCCAGCTCTTCCATATTGTTGGCGGCTTCTTGCAGGCTTTCACCGCAAACCACTGGGCCATGATTTGCCAGTAAAAAAGCCTGATTGTCTGCTGCCAGTTCCGCCAGATCCTGTGCGATGCGTTTATCGCCCGGTCGGTAATAAGGCACCAGCGGGACATTTCCCATCCGCATCACCACGTATGGTGTGAACGGACGAATAACGTTGCTGCTGTCCAGCCCTTGCAGGCAGGAAAGCGCCGTCGACCATGTGCTGTGCAAATGCACCACCGCTTTACAACGCGGATTGTTGCGATACAGCGCCAGATGAAAGAGCACCTCTTTCGAGGGTTTGTCACCACTTAACCATTCGCCATCCGCGGCGACTTTGGAAAGCCGCTGCGGATCGAGATTGCCCAGGCATGAACCTGTCGGTGTCGCCAGTAAATTCCCGTCAGGTAAAAGCAGCGACAGATTGCCAGCCGAACCGGTTGCATAGCCGCGCTGAAAGAATGAACTGGCAATCCGCGTCATCTCCTCTCGCAAAGACTGCTCTACTTTTGCGAAATCGCTCATGATAAAAACTCTCTTTGGGCTCGTGAAAAAAAGGCGTCATCACCGAAGTTGCCAGATTTAAGGGCGAGTGAGACAGGCTTATCCAGTGCGTTTACCCACGGCACGCCGGGGGAAATGGTTGGGCCAATATGAAACCCTTTTATTCCCAGGCTCTGTGTGACTACGCCGGAGGTCTCACCGCCTGCGACAATAAAGCGTGTCACGCCTTCCGCTGCTAACCGCGCCGCTAGTTGAGAAAACAGTGTTTCTACTGCCTGACTGGCTTTTTGTGCACCGTATTGCTGTTGAATTGCTGCCAATGCGTCAGTGCTGGCGGTGGCAAAAACCAGTGGAGCAAGTACACTTTCCTGGCCCAGAACCCACTCTGCCAGTTCGTGTGCATAAGCGGCCAGAGTTTCAATTGAGAGGCAGCGTGCCACATCAACTTCACGGGCTGGTGCAATTTGACGGTAATGTGCTACCTGGCGGTTGGTCATTTGAGAGCATGAACCGGAGAGCACTACGCCGCGCCCAGCGAGCGGACGCCCTGCTTTGCGAGCCTGGTTACCGTTTTCTTGCGCCCACTGCCGGGCCAGGCCAATCGCCAGACCAGAACCGCCCGTTACCAGTGGGGCATCGCGCAAGGCTTCTCCCTGAATTTCCAGATGGTGTTCGGTCAGCGCATCAAGCACCGCGTAGCGGTAGCCCTCTTGCTGTAAGCGAGCCAGCTCTTGACGAACGGCATCCACACCTTGTTCGAAAACATGTGCCGAAACGACGCCGCAGCGCCCTGTGGATTGCGCTTCAACCAGACGGGGAAGATAGCTGTCGGTCATGGGATTTACCGGGTGATGGCGCATCCCGGATTCGGCCAGCAGTTGATTCATTACGAACAAATACCCCTGATAAACCGTACGTCCGTTGACCGGCAGGGCCGGAGAGAAGACCGTAAACGGCGTGTCGAGAGCATCCATTAAGGCATCGGTAACCGGGCCAATATTACCTTTCGCCGTACTGTCGAAAGTAGAGCAGTATTTGAAATAGATCTGTTTGCAACCTTGCTGTTGCAACCAGCTCAGAGCCGCCAGCGATTGCTGTGTGGCTTCAACCACTGGACAGGAGCGCGTTTTCAGGCTGATCACCAGTGCGTCGATTGCTTCCGGCATTTTACCTGTTGGAACACCGTTAATTTGTACCGTTGGTAGACCGTTTTCCACCAGAAAACTGGCGATATCCGTCGCGCCGGTAAAATCATCGGCGATAACGCCAATCTTGATCATGATTTCGCTCCCGGTAGAGTGATGCCAGAGAAAATCTTGATAACTGCGCTATCGTCTTCTTTCCCGTAACCCGCGTTACTGGCGCTGGTGAACATATTCAATGCTGTTGAGGCCAATGGCAGCGGGAAGTGCAGGGCTTTGGCTGTATCGGCAACCAGACCAAGATCCTTAACAAAAATATCGACGGCTGAATGCGGGGTGTAATCGCCATCCACCACATGACGCATCCGGTTTTCGAACATCCAGGAATTTCCGGCGGCATTGGTCACGACGTCATACATCACATCCAGCGGGATCCCCGCACGGGCTGCAAGTGCCATCGCTTCGGCTCCGGCAGCAATATGTACGCCCGCTAACAACTGGTGAATAATTTTTACGGTCGAACCTAGTCCCGGTTCTGCACCTATGCGATAAACTTTTCCGGCAACGGCTTCCAGCACGGGTGCCAGTCGTTCAAAGGCAATATCGCTACCGGAGGCCATGATAGTCATTTCACCGTTAGCGGCTTTTACTGCACCACCAGAAACTGGCGCATCCAGCATTTCCAGATCGAATCCAGCCAGAGCGATAGCAATTTCTTGCGCATCAGCACTAGCGATAGTGGAAGAAACCATTACTGCCGTACCGGGTTTCAGATGTTGTGCAACGCCTGTTTCACCAAACAGCACCTGTTTAACCTGGGCCGCATTGACCACCAGCACCAGCAGAGCGTCCAGTTTTTCGGCAAACGTCGCGGCGTTATCAGAAACCCCGCAAGCACCTGCCTCTTTCAACGTAGCGCAGGCATTGCTGTTCAGGTCTGCGCCCCAGGTAGAAAGACCTGCGCGGACACATGACAGTGCTGCTCCCATTCCCATTGACCCTAAGCCAACGATACCGACATGAAACTCAGATCCCGTTTTCATCTGCTCTCCTTGTTAATTTAAGTGATATTTTGTTTGATATTGTGAATATAAGCGCTGGAAGATAACGATATGGTGAGCTGATTCACATAAATTAACATTGTGTGTTATTTTATGTGAACTAAGCGTTAGTTGCGCCGCGCACGTTTCGCAGGCAAATAGCGTAGAATGTCAGCAGGACAAAGGAAGGAGCAAAAGTTGATACCCGTAGAGCGTCGCCAAATCATCCTTGAGATGGTAGCTGAAAAAGGCATTGTCAGTATTGCTGAACTAACGGACAGAATGAATGTGTCACATATGACCATTCGTCGGGATTTACAAAAACTGGAGCAGCAGGGAGCCGTTGTGCTGGTGTCCGGAGGCGTCCAGTCTCCGGGACGCGTGGCGCATGAACCTTCTCATCAGGTAAAAACTGCGCTGGCAATGACGCAAAAAGCGGCTATTGGCAAGCTGGCGGCAAGTCTTGTTCAGCCGGGAAGTTGTATCTATCTGGATGCGGGAACGACCACGTTAGCGATAGCACAGCATCTGATTCACATGGAGTCACTGACTGTGGTCACAAACGATTTCGTTATTGCAGACTACTTGCTCGACAACAGTAATTGCACAATTATTCACACTGGCGGTGCAGTGTGTCGGGAAAACCGTTCCTGTGTCGGGGAAGCCGCTGCGACCATGCTGCGCAGCCTGATGATTGATCAGGCTTTTATTTCTGCATCGTCATGGAGTGTGCGGGGGATTTCTACGCCAGCGGAAGATAAAGTCACGGTGAAACGGGCGATTGCCAGTGCCAGCCGCCAGCGAGTTTTGGTCTGTGATGCGACGAAATATGGTCAGGTGGCGACATGGCTGGCGTTACCATTAAGCGAGTTTGATCAGATTATCACAGACGACGGTTTGCCGGAGAGTGCCAGTCGCGCGCTGGCGAAGCAGGATCTCTCTTTGCTGGTAGCGAAAAATGAATAATGGCCTGCAATAACATTTGGTTACTCATGCTTCACAGAAGAAGCATGAGACTACTTTATTTTATAAAATGACAGCCGCCCGCTTTTCGGCGTGCCGGTATCAATATAAATCTGGTTAGCGAACGTCTGAATGTTATCAAACATCATATGTCCAAATATAAAATAATCAGCGCCGTTTATTTGTTGTAACTCGCCATTAAGCGATTTCTGCACACGATCAACAGGCCAGAGTAATTCGCTTTCTACTATTTCTTTACCAAACTGATATTCATCACCCGGATAATCTGCATGTGCGATGACATATTTTATGTTGTCATTAATGATTTCAATAATATGTGGAAGGTGATGGAATTTCAGCAACAGATCTATTGCCTCTTGTTGCTCTGAATCATTTAAATCGAAAAACCAGTCACCACCGCTGGCAAGCCACATATTTCCATCGCCGGTCGCGAACGCATCCAGCGCCATTGCTTCGTGGTTGCCTTTAACCGGCGTAAACCAGGGCTGGTTTAGCAGGCGCAGCACGTTAAGACTCTCCGGCCCACGATCGATGTTATCGCCTACCGAAATAAGTAAGTCGGTTTCAGGGCAAAAAGAGAGTTGATGTAAGCGGGATTGTAATAATTGATATTCACCATGAATATCACCAACGACCCATATATGGCGATAGTGATGGGCATTAATTTTTTGATAGCGTGTAGATGGCATGGTTTTACCCTGTAAAATAAGCTTTCCTATTATACAGGGTGTTTTTATTTTATTCGTCAGTTGTCGTTAATATTCCCGATAGCAAAAGACTATCGGGAATTGTTATTACACCAGGCTCTTCAAGCGATAAATCCATTCCAGCGCCTGACGCGGAGTCAGTGAATCCGGGTCCAGGTTTTCCAGAGCTTCGACCGCTGGCGAGGTTTCTTCCGGTACTGACAGCAAAGACATTTGCGTACCATCCACTTGCGTAGCGGCGGCGTTCGGCGAAATGCTTTCCAGCTCACGCAGTTTTTGCCGTGCGCGCTTAATCACCTCTTTTGGCACGCCTGCCAGAGCTGCAACCGCCAGGCCGTAGCTTTTGCTCGCCGCGCCATCCTGCACGCTGTGCATAAAGGCAATGGTGTCGCCGTGCTCCAGTGCATCGAGATGCACGTTGGCGACGCCTTCCATTTTCTCCGGTAACTGTGTCAGCTCGAAATAGTGGGTGGCAAACAGCGTCAACGCTTTAATCTTATTCGCCAGATTTTCCGCGCACGCCCACGCCAGCGACAGACCATCGTAAGTGGAAGTTCCGCGCCCAATCTCATCCATCAGCACCAGACTGTACTCGGTGGCGTTATGTAAAATATTGGCGGTTTCAGTCATCTCCACCATAAAGGTTGAGCGCCCGGAAGCCAGGTCATCTGCCGCGCCTACGCGGGTAAAGATACGGTCAATCGGGCCAATCTCGACTTTTTGCGCCGGTACGTAGCTGCCGATATAGGCCATCAGCGCAATCAACGCGGTCTGGCGCATATAGGTACTTTTACCGCCCATGTTCGGACCGGTGATGATCAACATACGGCGCTGCGGCGACAGATTCAGCGGGTTAGCGATAAATGGCTCATTCAGTACTTGTTCAACTACCGGATGGCGACCTTCGGTAATGCGAATGCCCGGTTTATCAATGAAGGTCGGGCAGGTGTAGTTCAGGGTATAGGCCCGTTCCGCCAGGTTAACCAGCACGTCGAGTTCCGCCAGCGCGCTCGCGCTCTGTTGCAACGCTTCCAGATGCGGCAACAGCAGGTCGAACAGCTCTTCATAAAGCTGTTTTTCCAGTGCCAGTGCTTTGCCTTTTGAGGTGAGAACTTTATCTTCGTACTCTTTTAGCTCTGGAATGATGTAGCGCTCGGCGTTTTTCAGCGTCTGGCGACGCATGTAGTTGATGGGTGCCAGATGGCTTTGCCCACGGCTGATTTGAATGTAGTAGCCGTGCACCGCATTAAAGCCAACTTTCAGCGTGTCCAGGCCGGTACGTTCACGCTCGCGGACTTCCAGACGCTCCAGATAATCGGTCGCGCCGTCAGCCAGCGCGCGCCACTCATCCAGCTCTTCGTTATAGCCCGATGCGATAACACCACCGTCGCGTACCAGCACCGGCGGTGTGTCGATGATTGCTCGCTCCAGCAGATCGCGCAGCTCGGCAAACTCGCCCATCTTCTCACGTAGCGCCTGTACCGGTGCACTATCGACAGTTTCTAACTGCGCACGCAGCTCCGGCAGTTGCTGGAAAGCGTGGCGCATACGGGCCAGATCGCGTGGGCGAGCAGTTCGTAAAGCCAGACGTGCCAGAATACGTTCCAGGTCGCCGACCTGACGCAGTACCGGCTGTAGCCCGGCGGTGAAATCCTGCAATGCGCCAATAGTTTGCTGGCGCTCAAGCAACACGCGGGTATCGCGCACTGGCATATGCAGCCAGCGTTTCAGCATACGGCTGCCCATCGGCGTGACGGTGCAGTCGAGCACAGAAGCCAGCGTATTTTCCGCACCACCCGCCAGGTTCTGGGTGATTTCCAGATTACGACGCGTCGCGGCATCCATAATGATGCTGTCCTGCTCACGTTCCATGGTGATGGAACGAATATGCGGCAGAGTCGTACGTTGGGTATCTTTCGCATACTGCAACAGACAACCGGCAGCACAAAGTCCGCGCGGCGCGTTCTCGACGCCAAAACCGACCAGATCGCGGGTCCCAAATTGCAGATTCAACTGCTGGCGCGCGGTGTCGATTTCAAACTCCCACAGCGGGCGACGGCGCAGGCCGCGACGGCCTTCAATTAACGACATTTCAGCAAAATCTTCTGCATACAGCAGTTCCGCAGGATTAGTGCGTTGCAGTTCTGCCGCCATCGTTTCGCGGTCAGCCGGTTCGCTCAGGCGAAAACGCCCGGAACTGATATCCAGCGTCGCGTAGCCGAAACCTTTGCTGTCCTGCCAGATAGCCGCCAGCAGGTTGTCCTGACGCTCCTGCAACAGGGCTTCATCGCTGATGGTGCCTGGCGTAACGATACGCACAACTTTGCGCTCAACCGGACCTTTGCTGGTCGCCGGATCGCCAATTTGTTCGCAGATGGCAACGGACTCTCCCTGATTCACCAGTTTGGCGAGATAGTTTTCCACCGCATGGTAGGGAATCCCCGCCATCGGGATCGGCTCTCCCGCCGAAGCACCGCGTTTGGTCAGTGAAATATCCAGCAGTTGCGACGCGCGTTTTGCGTCGTCATAAAACAGTTCATAAAAATCACCCATCCGGTAAAACAGCAGGATCTCGGGATGCTGGGCTTTCAGCCTGAGATACTGCTGCATCATGGGCGTATGGGCGTCGAAATTTTCTATTGCACTCAT
Protein sequences of DBSCAN-SWA_3 >LR134000|1031253:1044436|1041112_1041769_-|VDY67484.1|DBSCAN-SWA MPSTRYQKINAHHYRHIWVVGDIHGEYQLLQSRLHQLSFCPETDLLISVGDNIDRGPESLNVLRLLNQPWFTPVKGNHEAMALDAFATGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEIINDNIKYVIAHADYPGDEYQFGKEIVESELLWPVDRVQKSLNGELQQINGADYFIFGHMMFDNIQTFANQIYIDTGTPKSGRLSFYKIK >LR134000|1031253:1044436|1036515_1037292_-|VDY67479.1|DBSCAN-SWA MPRFAANLSMMFTEVPFIERFAAARKAGFDAVEFLFPYDYSTLQIQKQLEQNHLTLALFNTAPGDINAGEWGLSALPGREHEAHADIDLALEYALALNCEQVHVMAGVVPAGEDAERYRAVFIDNLRYAADRFAPHGKRILVEALSPGVKPHYLFSSQYQALAIVEEVARDNVFIQLDTFHAQKVDGNLTHLIRDYAGKYAHVQIAGLPDRHEPDDGEINYPWLFRLFDEVGYQGWIGCEYKPRGLTEEGLGWFDAWR >LR134000|1031253:1044436|1037931_1039194_-|VDY67481.1|tRNA|DBSCAN-SWA MIKIGVIADDFTGATDIASFLVENGLPTVQINGVPTGKMPEAIDALVISLKTRSCPVVEATQQSLAALSWLQQQGCKQIYFKYCSTFDSTAKGNIGPVTDALMDALDTPFTVFSPALPVNGRTVYQGYLFVMNQLLAESGMRHHPVNPMTDSYLPRLVEAQSTGRCGVVSAHVFEQGVDAVRQELARLQQEGYRYAVLDALTEHHLEIQGEALRDAPLVTGGSGLAIGLARQWAQENGNQARKAGRPLAGRGVVLSGSCSQMTNRQVAHYRQIAPAREVDVARCLSIETLAAYAHELAEWVLGQESVLAPLVFATASTDALAAIQQQYGAQKASQAVETLFSQLAARLAAEGVTRFIVAGGETSGVVTQSLGIKGFHIGPTISPGVPWVNALDKPVSLALKSGNFGDDAFFSRAQREFLS >LR134000|1031253:1044436|1032008_1032635_+|VDY67475.1|DBSCAN-SWA MVSRRVQALLDQLRAQGIQDEQVLNALAAVPREKFVDEAFEQKAWDNIALPIGQGQTISQPYMVARMTELLELTPQSRVLEIGTGSGYQTAILAHLVQHVCSVERIKGLQWQARRRLKNLDLHNVSTRHGDGWQGWQARAPFDAIIVTAAPPEIPTALMTQLDEGGILVLPVGEEHQYLKRVRRRGGEFIIDTVEAVRFVPLVKGELA >LR134000|1031253:1044436|1039190_1040099_-|VDY67482.1|DBSCAN-SWA MKTGSEFHVGIVGLGSMGMGAALSCVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNAAQVKQVLFGETGVAQHLKPGTAVMVSSTIASADAQEIAIALAGFDLEMLDAPVSGGAVKAANGEMTIMASGSDIAFERLAPVLEAVAGKVYRIGAEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSAVDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS >LR134000|1031253:1044436|1035062_1036427_-|VDY67478.1|DBSCAN-SWA MSTITLLCIALTGVIMLLLLVIKAKVQPFVALLLVSLLVALAAGIPAGEVGKVMIAGMGGVLGSVTIIIGLGAMLGRMIEHSGGAESLANYFSRKLGDKRTIAALTLAAFFLGIPVFFDVGFIILAPIIYGFAKVAKISPLKFGLPVAGIMLTVHVAVPPHPGPVAAAGLLHADIGWLTIIGIAISIPVGVVGYFAAKIINKRQYAMSVEVLEQMQLAPASEEGATKLSDKINPPGVALVTSLIVIPIAIIMAGTVSATLMPPSHPLLGTLQLIGSPMVALMIALVLAFWLLALRRGWSLQHTSDIMGSALPTAAVVILVTGAGGVFGKVLVESGVGKALANMLQMIDLPLLPAAFIISLALRASQGSATVAILTTGGLLSEAVMGLNPIQCVLVTLAACFGGLGASHINDSGFWIVTKYLGLSVADGLKTWTVLTTILGFTGFLITWCVWAVI >LR134000|1031253:1044436|1034135_1034969_+|VDY67477.1|DBSCAN-SWA MLDATQLYLGEIGYSPLLTAEEEVYFARRALRGDVASRRRMIESNLRLVVKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMNQTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNERITSVDTPLGGDSEKALLDILADEKENGPEDTTQDDDMKQSIVKWLFELNAKQREVLARRFGLLGYEAATLEDVGREIGLTRERVRQIQVEGLRRLREILQTQGLNIEALFRE >LR134000|1031253:1044436|1032774_1033914_+|VDY67476.1|DBSCAN-SWA MSAGSPKFTVRRIAALSLVSLWLAGCSDTSNPPAPVSSVNGNAPANTNSGMLITPPPKMGTTSTAQQPQIQPVQQPQIQATQQPQIQPVQPVAQQPVQMENGRIVYNRQYGNIPKGSYSGSTYTVKKGDTLFYIAWITGNDFRDLAQRNNIQAPYALNVGQTLQVGNASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPAATTVTAPVTVPTASTTEPTVSSTSTSTPISTWRWPTEGKVIETFGASEGGNKGIDIAGSKGQAIIATADGRVVYAGNALRGYGNLIIIKHNDDYLSAYAHNDTMLVREQQEVKAGQKIATMGSTGTSSTRLHFEIRYKGKSVNPLRYLPQR >LR134000|1031253:1044436|1037296_1037935_-|VDY67480.1|DBSCAN-SWA MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVAADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNVPLVPYYRPGDKRIAQDLAELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS >LR134000|1031253:1044436|1040294_1041062_+|VDY67483.1|DBSCAN-SWA MIPVERRQIILEMVAEKGIVSIAELTDRMNVSHMTIRRDLQKLEQQGAVVLVSGGVQSPGRVAHEPSHQVKTALAMTQKAAIGKLAASLVQPGSCIYLDAGTTTLAIAQHLIHMESLTVVTNDFVIADYLLDNSNCTIIHTGGAVCRENRSCVGEAAATMLRSLMIDQAFISASSWSVRGISTPAEDKVTVKRAIASASRQRVLVCDATKYGQVATWLALPLSEFDQIITDDGLPESASRALAKQDLSLLVAKNE >LR134000|1031253:1044436|1041874_1044436_-|VDY67485.1|DBSCAN-SWA MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAGLQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV >LR134000|1031253:1044436|1031253_1032015_+|VDY67474.1|DBSCAN-SWA MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVTCSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW |
12 | Escherichia_phage(50.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1124161 : 1134272
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >LR134000|1124161:1134272|DBSCAN-SWA GTCAGGTTGTTGCAGGGTCGTCACATTTTGGCAGCCAGTCGCCGTAGCTTTCCTCTTTCAGCGTCAGGTTGGTCTGTATCCCCTGTTTGGTATGGCGCTTCTCGTAATTCAGTCCGTATTCCTTCAGCATCACCGGCAGCCCCAGCCCGAACATTTTCAGACTGAGTACATTCCGGTAGCCGTTTGCCTCCATGTAGGCCAGATAGGCGTGATAGAGGTATTTACGGTAATTACGCGGGATGATACTGGCGTTCCCCATATACATGCCGCTGGTCTGCGGCAGGGTTCCCAGATAGCCGATAAAATCAAACGTCGGGTCGGCATCCCGTTTGATGTTCAGCGCCTCGTCTGAGTTCTGCTGGGACTGAAGCAGTGACCGGGCGAGCATCGGGTCGCTGAACTTCTGCATCAGGTGACGCACGATGACCGCCAGCTCGCGGGTGATTTTGTCCTTGAGCTGCGGGTCGCGCTCCTGCGGGGCTATCTGTTCCGGGAAGTGAATAATCACCCGCCGGCGTGACACGCCGCCGCTGCGGTCGGTGAAGCGCATCGGGTTATTGTTCACGGCCAGAATCACCGCCGGGATATGCGTGGAGTACGCATCCCGGTATTTCGGGTCCACGGACACCGCATCGCCGCCGGTGATGGCCTTGAGTCCGGCACCGTCGCCGCTCCATTTTTCCTGGTCCGGCAGGCGTATCAGTGAGAAGCCAGTTAACGCGGCACGTTCACGCGGGGATTCCAGCGTCTCGATGGTGGCCGACGTGGCGTTATCCTCCCCGGCCAGCAGGGTGGCGATTTCGGCCATGATACTTTTGCCGCTGCCGCCGGGACCGGTCACCTCCAGAAAGAGCTGCCAGTCGTAGCGGTTTGCCAGCACCATAAACAGTGCGGCCAGAATCACGTCGCGTTTTTCCGCACGGCCACCGGCGGCACGGTCAAGCCAGCGCCAGAACGCGGGGGCGTGGTTTTCCAGCGTTTCCCCTTCCACCGGCGGGGTGAAATCCACATCGCACAGGGTGCGCATCCAGTGTGACGGACTGTGCGGGTGGAACGTGCCGTTCTGCGTGTCGAGCACGCCGTTACGAAAGCCAATCAGGCGGCGGGAGGGGGCTTCCTGCTGCGGAATAATCAGCTTCAGGGTGTCCACCACGGAGGCCACCTTCCCGGAGGAGAACGGCGCGCGCAGACGCTGAAACAGCCCGGCCACATCCCGGGCAAAGTCCTGTGGCGGCAGCACCTTCCAGACACCATTTTCATAGCGGGACAGAAGCTGGCCGTTGGCATCGACCGCGAGCGCCTCGCCGTAATGCTCATAGATACGCATGGCCTTTTCGCTGGTACTCATGGCGGAAAACTCCGCTTCGCTCATGGTGTCGAACGGGCTTTCAGCCGGTGGCCGGATGGCATCGTAAATGGCCTTACGGGTGGCCTCCCCGCCGTACTGCGTGAAGGCATCATTCCAGTCACCGAAGACCGGCGGCAGGGCAACAACACCTTCACACGCATCTGCGGCAGCGGCGGCTTTTTTCTGGCCGTCACCGCTGAGGTCACGGTCTGCGGCAAGGACAATCTGACAGGCCGGATGCTTCTGCCGGGCAAGGCTGGCCAGAGAAAGGAGGTTCACGGAAGAAAGCGCCACCATCACCGTTTCACCGGTCAGGTGATGCACGGTAAGTGCGGTCGCGTATCCCTCCGCTATCCACAGACGTTTTCCGGCCTGATTCTGTCCTTCAAGGATGTGACAGGTGCCCCTGACCTGTCCGCCTTTCAGGGTGCGCTTACGGCCGTCAGCACTGATTAACTGAAGGTTAACCAGTTCGCCGCTGTCGTCATACAGTGGCACCACAAGGTCACCGGCGCGCCAGCTCACGCCACCGGCTCTGTGTGTGCCGGTCAGCATCCGGCATTCCCGGCCGGGAAAGCCCTTGCGGGTCAGGTAGGCGTTACCGGTTCCGGGACGGGTTTTCGCCATCAGGGTTTGTGCCAGTGCGGCGGCGTTCTTCCGGGCGGCTTCTGTTTCAGCAACGGCGGCGGCCGTCACTGCCGGGTCAGCCGGTGGCAGGCTGCCGGTCACGGCAGCCACCTTTGCGGCCGCGTCGGACGGGGAAACACCAAACACCTTTTCAACCAGTTTCAGGCCGTCACCGGCACCACACTGATTGCAGTACCAGGTGCCGCGCCCCTCCCTGTCATCAAAACGGAAGCGGTCACTCCCGCCACAGACCGGACAGGGCTGATGACGGTTTTTCAGCACCTGAATCCCCAGCGCCGGGAGAATACGCGGCCAGTGGCCGAGCGCATGGCTGACGGTGGCGGTTACGTTCATTTTCATGGTGTTGTTCTCCTTCAGTGCAGTACCGGCGCTTTTATGTGACGGGCACAGAGTTCATCCATCACAACCAGCCCGAGAAAGGACAGCGACGGCGCGGCCTTCAGGGGGCCGGATTCCATTAAATCTTCCAGCAGGGCACAGGCTATCTGACGCCCTTTTTCCTCACCGTGCTGGCGCAGATAAAAGCCCTCCAGCTCAGCGGCGATGGCCGCCTCCAGTGATTCAAGGGTGAGATGCGGGTAGCGGTGCTGACGTTCGCACACGGTCAGCCAGGCACAGGCGACAGCGCGACGGTAAAGGGCAGCGCGTAAGACGGGCGGTAAGGGTGTTTTCATTTGTTTTCCTCCCTGTGACAGATGACTGCATTCCGTGCCGGTTGCATTAACTGATAAGGCATATCTGCGTCTCCTGAAGACGTGCGTATCCCTGCGCGAATACGCACATTTAATTTTTCGGGTGTCGTTTTTTAATTACAGATAATTGCGGTAACTGTTATCCGGGGTGATTTCCGGGTCAGGCTCCGTACGGGGAATTTCCCGCCATTCCCGCGCCACCGGTGCCGCCCGGCTGGCCGGAACAGTGTCCTGCGGGTAAATATCCAGATATTTCTCCCGCCATTTCTGTAATTCCGGGTCTCCGGCCATTTCTTTCAGTACCGCATGCCGGTTTACGGGGCTGCGTTTAAACAGGTCAGGACGGTCACAGGTAAATTCCCGCAGAAAACGCCCCAGCGGGATGTCTGTGGTGCGCCCGTCGGCGAGGATACGCACAAGGATACTGAATTTACGGCGGTACGGGTTCCAGACAATGTCCGGGCAGCGGTACGGCATTTCCCACGGAATACCGTCTTCCAGAATGCCGACCACGGCCACATCGGGAAAACCGGCAGAACGGTAAATCTCACCGGGCTGGGGAAAATCAAACATGCGTCCTGTCTCCCCGGTCTTTCTGCTGGGCGAGAAAATCGCGGCACAGGCCTTTGGCTTTCAGCTCATTCAGCACAAAATCAATATCTTCATTCAGGTAGCTGAAAATATGCGGAATGTAGAGCTGATGCAGGCCGGAGAGTTCACGGTGAATCAAATCACCCCCAACAAACTGGGATACGGCGCTGGCGCGGTTGAGCTTATGGTAAGCCTCAATGCTGAGGTGTTCACGGGCGTCATGACGTGCTGAGACGGTCTGAGGGGCTTTTTTATTACGCACGGGACACCTCCACCACCGGCAGACGGGCAGCAAGGGAGAGCACATAGTCACGGACAAGGGAACGGCGGGCGCTGCGTTCATCACCGGCGACGGTGCGAAGCATGCAGATACGGGGATGACGGTCTGCGCGACGGACAGCCGCAAACACAAAGACAAATTCAGGGTGTGAGGGGGTAAGGGTTGTAGCCATAAGGCAGCCTCCTTCGAGTAGCAAATAACTGCTATCGCCGGAGTTCTCACGCTCGATGGCGATAGCCCAGACGGGGGTGAGAATACCGGCCTCGAAGAATACCGGCCAGCCCGGAGGCTGCCCCGCCTGAGCTACCATTGACTCAGTGGCATAACATGCGATTGCGAACAGGATCATACCTGCACGGCAAACCACACGCCACACCATAATCTGGCGCTCTGTGGCGTTGATTGCGACACAAAAAAAGACGCATGGCGCGTCATATGTCGCCTTCGAGTTACACGGGTTCTCACGCCCGGCTGCCGATTTTGCGGCAACGGAAAAACTATATCCGCAAATGCCGGAAAAAGGCAAGCCAGAAAAAGGGAGTTTTTGCAGAGCGGGCATCATCATGCGTCGTACCCCCGTTTGCGTCCGGCAATGCGTCCGGCCATCCATGCGGTGACTTCAGAGTGCAGCCAGGCCACATTTTTACCGCCAAGACTCACCTGCGGCGGAAATTCCCCCTTACGGATGAGTTCATAGATGGTCGAGCGTGACAGGCCGCACAGGTGCATCACTTCCGGCAGACGTAAAAAACGCTCCTGCGTGATGTCCGGCAGCGGCATCAGTGGCGTCACAGGGGCGGGAGACGGGGAAGAAAAAACAGCTTGCATCGGGCTACCTCGTTAATGTCCATACAGCACCGGATAAGTCCGTCCGGCTTCGGGTAGCGCTTTATTTTGTGAATATTTTTGGCAGACGCAACAGGGGGGATTTGTTCCGGCAGCCTTACAATGCTTGTGTGTTTTTTGTTCATCTCCACTTAAAGTCATTTAAAGCCACTTAAAGCAATTCGTAATTTTTATAGTGAAATACAAATCGTTTTTTCTTATTCATTCCCGGCGAATTAATAAAAACAAACAGTAATAAACAGCACAAAAAGCCCATCAACGGGTGAACAGTGGTGAACAGACGGTGAACAGTCATTACTGCGATTGTTCACCCTTTAACTTACTGTATTACTTATCTTTTTTCTTATGGTGAACAGAGGTGAACAGTAAAATATAAAAAAACAAACAGTAAGCCGGTTTTTCCTGCGACCTTTTCCTGGCTTGCCGGTCTGAGGATGAGTCTCCTGTGTCAGGGCTGGCACATCTGCAATGCGTCGTGTTGTTGTCCGGTGTACGTCACAATTTTCTTAACCTGAAGTGACGAGGAGCCGGAAAATGTCTGACAACACCATCCCTGAATATCTGCAACCCGCGCTGGCACAACTGGAAAAGGCCAGAGCCGCCCATCTTGAGAACGCCCGCCTGATGGATGAGACCGTCACGGCCATTGAACGGGCAGAGCAGGAAAAAAATGCGCTGGCGCAGGCCGACGGAAACGACGCTGACGACTGGCGCACGGCCTTTCGTGCAGCCGGTGGTGTCCTGAGCGACGAGCTGAAACAGCGCCACATTGAGCGCGTGGCACGCCGGGAGCTGGTACAGGAATATGACAATCTGGCCGTGGTGCTGAATTTCGAACGTGAACGCCTGAAAGGGGCGTGTGACAGCACGGCCACCGCCTACCGGAAGGCACATCATCACCTTCTGAGTCTGTATGCAGAGCATGAGCTGGAACACGCCCTGAATGAAACCTGTGAGGCGCTTGTCCGGGCAATGCATCTGAGCATCCAGGTACAGGAAAATCCGCTCGCCAACACCACCGGCCATCAGGGCTACGTCGCACCGGAAAAGGCTGTCATGCAGCAGGTGAAATCATCGCTGGAACAGAAAATAAAACAGATGCAAATCAGCCTCACCGGCGAGCCGGTTCTCCGGCTGACCGGACTGTCAGCGGCAACACTCCCGCACATGGATTATGAGGTGGCAGGCACACCGGCACAGCGCAAGGTGTGGCAGGACAAAATAGACCAGCAGGGAGCAGAGCTTAAGGCCAGAGGACTGCTGTCATGATTTACTGCCCGTCGTGTGGACATGTTGCTCACACCCGTCGCGCACATTTCATGGACGATGGCACCAAGATAATGATTGCACAGTGCCGGAATATTTATTGCTCTGCGACATTTGAAGCGAGTGAAAGCTTTTTCTCTGACTGTAAAGATTCAGGAATGGAATACATTTCAGGCAAACAGAGATACCGCGATTCACTGACGTCAGCCTCCGGCAGTATGAAACGCCCGAAAAGAATGCTTGTTACCGGATATTGTTGTCGGAGATGTAAAGGCCTTGCACTGTCAAGAACATCGCGGCGTCTTTCTCAGGAAGTCACCGAGCGTTTTTATGTGTGCACGGATCCGGGCTGTGGTCTGGTGTTTAAAACGCTTCAGACCATCAACCGCTTCATTGTCCGCCCGGTCACGCCGGACGAACTGGCAGAAAGCCTGCATGAAAAACAGGAACTGCCGCCAGTACGGTTAAAAACACAATCATATTCGCTGCGTCTGGAATGAGGGCTGCCGGTTAACCCCGGCCGTCGCCGCACACCGTATTTTTATTCTTCAGCATGATGAGAAAGAGATAACGATGGAAAGCACAGCCTTACAGCAGGCCTTTGACACCTGTCAGAATAACAAAGCAGCATGGCTGCAACGCAAAAATGAGCTGGCTGCGGCCGAACAGGAATATCTGCGGCTTCTGTCAGGAGAAGGCAGAAACGTCAGTCGCCTGGACGAATTACGCAATATTATCGAAGTCAGAAAATGGCAGGTGAATCAGGCCGCCGGTCGTTATATTCGTTCGCATGAAGCCGTTCAGCACATCAGCATCCGCGATCGGCTGAATGATTTTATGCAGCAGCACGGCACAGCACTGGCGGCGGCACTGGCACCGGAGCTGATGGGCTACAGTGAGCTGACGGCCATTGCCCGAAACTGTGCCATACAGCGTGCCACAGATGCCCTGCGTGAAGCCCTCCTGTCCTGGCTTGCGAAGGGTGAAAAAATTAATTATTCCGCACAGGATAGCGACATTTTAACGACCATCGGATTCAGGCCTGACGCGGCTTCGATGGATGACAGCCGTGAAAAATTCACCCCTGCGCAGAACATGATTTTTTCGCGTAAAAGTGCGCAACTGGCATCACATCAGTCTGTGTAAAACTCCCCGAAAATCCGCCCGTTTTTACTGAAAAAAGCCATGCATCGATAAGGTGCATGGCTTTGCATGCGTTTTCCTGCCTCATTTTCTGCAAACCGCGCCATTCCCGGCGCGGCCTGAGCGTGTCAGTGCAACTGCATTAAAACCGCCCCGCAAAGCGGGCGGGCGAGGCGGGGAAAGCACTGCGCGCGGAGTAACATAAAATTTTTATAACGCTTTCTATCTATTGTTAATAACACTCTCTAATGCAGATTGGATGATGAGTATATAATGAAGTACTCCCACAATGTTATTGATGCTGTGAAGCAGTACTATTACTGCTAGCGTTTCGTTATACATTTGGCTGAAACTTAGCCAAGATCCTCATAGGAGAACCTTGGCTGGAATGGAATGGAATGGAATGGAATGGAATGCTACTTTCGTAATACTCTAATTTCTTCCTTTCTTTTTATGAACTTAACAAAATAGTTGGAAATGAATATAACAACCCCCATTCCAAATATCACTTTTAAAACATCTATTAAAGAGGTGTTAAAATAAGATAGTTTGCTGTTTAAAGTGGATATGGGCTTTAAATTCATAATCTTAGTGTTAAGGGCTAGTTTTTCCGCAGTACCAAAAATTTTAATTTTGTTATTTTTGGTGACAAATCCAACAAGGTTATTTCCGGTTGCATATAGGAGGTCGGAGTCAGAAACATCTATCTTCTTATTGTTAACCTTTAGTTCTTCAAGCCCGATCAAATTAATGTGTTTTAGATAGCCATTACTAATATCCGAAAGAGGGGCTTTACTGTCCTTTGTTTTCGTGAAGTCACCGTTTTTTATATTAATTGTCAAAAGTGGGACGTTAGATAAAGTGAGGTGACCTTTAGGAATGAATCTAAAAGCAATTGATGCAACACCAGTAATGTTATTCTTAGTAGGAATTCTATTATCTGGTGTAACTATTTGACGACATTTTAATTCTTTTTTATTTATTATATTGTACTGTTGCACTATATTGGTGATTAAATTTTCTTGCTTGTTGATGGTTAATTTGTATTCTGTTTTTTCTAAAAATGATTTTTCATCAAATGGTGCTATGTTAACCATTAATGTAAATGGCAAATAAGTACTTTTAAATTTAGCTGAACCATCCTCAATTTGACTTCTTATTTTGTCTTTGTTGCTATAGTTGACCTCTTTTGAGTTTTGAATTGATGTTGAAAGTTTTTCTCCCTTGATGAAATAGATCACTGGTCTGTCAGATGAAAACTCTAATGTTTTTACATCAATTAATTTCACAGGTTGTTGTTTGTACTTATTGAGTGATGTTGAGTTTAAAATACAATCAGAGAATAGATAAGCTTCGACATCATAACTTTGATCGCCAGACATCCATGTAAAATTGTTATTTAAATAGGACTGTATGGTTATCTCATCATTTATCGTTTTTGAGGTAGTGAGGAAACCAAAACCCTCATCATCCGCAGTATGTATCGAACCTAATAATTTTTCATTGAAAAGCGCATCTATATGAACAGGACTTCTTTCATAAATTAGATATAAAAACCCATAAATCATAAAGCTAAGTACGAAGGAGGAGGATAGTGGAATGATTGTAGCAATACCAATGCTAGTTTTGGTATTTTTCAATAAATAACTAATGCTCGCAAGGTTTAATAAAACGAATAGGGAAAGAAAAAATCCACCAAAACTATTAAACATCAAGTTTTCAATGGTGTAAGGAATTAAACTTATATTTTCCTTTGCTAAGGTGACGTCAAATACTGTAATAGTAAATGCTGCCGCCAGCCAAGTTGTAATAAGTATTTTAATCCTTTGTTTAAATGCCAATGAAAATATGAAAATAATCATCGACATGGTGAGGTAATAAATGTCAGGTTTATTGCTAATAATCTTTGTGAAATCCTCAACTTCAGTAAAGCAGTTTTCACATGAATAAATTCCGAATGAACTTATATAGCCTTGAGCAAGAGCACCAAAGATTAACAGTAAATAAATTGAATATGAAAAGCTCAGAAAAAGAAAAGTAAAATTCAACCAATTAATAGATGTCAGTGTATTTTTAATGTGGTGCTTTATATTATTTTTCATTTGTAATTTTTCCCTTCAACATAGTCTCCCCACCATTGCATAAGCTCAATTCTTTTATTTAAATAGGTGGTTCGGTTGTAAGCCCTTCTGACCTCATTTTTATCAGAATGAGCAAGGCTAGCCTCAATAACGTCTGCATTGAACCCTGCTTCATTCATTGCTGTACTTGCAATAGATCGGAGGCCATGAGCAACTAGCTTTCCTCCAAAACCAATACGTTTTAAAGCTGCATTAGCAGTTTGGCTATTCATTGATTGCTTAGGATTATTTCTGCTAGGAAAAATATGTTCACGATGAGCACTGATGGGCTTCATCACTTCCAAAATATCTAAAGCTTGGGGTGATAAAGGAACAATGTGCTCTCGTTTGGCTTTCATTCTCTCTGCCGGAATAGTCCAGAGCTTTGTATCAAGATCGAGCTCTGCCCAACGAGCACCAGAGGCTTCAGAAGGACGCACAAGTGTCAGGAGTTGCCATTCAATGAGACAGCGAGTCGAAACAGATAGGTTTGACATAGCTAAAGAACGCATCAGCTTCGGCAATTCTTCTGGCCGCAATGTTGGCATGTTTTGCTTTTTGGGCTTCTCAAAGGCCATCCCAACCCCCGATGCTGGATTGGCATCAATCAGACCAGTGTTTACAGCGTAAATCATTATCTCGTTAATCCGCTGTATCAGTCGACGTACAGTTTCAAGAGCCCCACGAGCTTTGATTGGCTCAAGAGCTTCGACTAGCGTACGGGCTTTGATTTGTTGGACGGGAATCTCTCCAATGGCTGGGAATATGTCTTTTTCCAGCGAGCGCCATATATCTTTAGCGTAATCAGGGGAAACACTTTTGCTTTTTATCTGGAACCAGTTGGTCGCAACAGTCGAAAAAATACTGTCTAGAGCTATCTGTTGTTGTTCTTCGATAAATTCAGCTTGAACTTGTGGGTCTATACCATTAGCCAACAAAGCAAGGTAATCAGCTCTTAACCGTCGAGCATCAGCAAGTGAAAGGGCGGGGAAAGCACCAAGCCCCATCATTGTCCGCTGTTTTGTTGCTGGACGTTGATAACGGAAACGCCACAACTTCTTACCGTTCGTTTTAACGAGCAGAAAAAGGCCATCGCCATCATGTAACGTTAGATCCTTTTCTAACGCTTTAGCGCGCAGAACTTCTGTGTTGGTCAGGGGGCGTGTCGTTCTTGCCAC
Protein sequences of DBSCAN-SWA_4 >LR134000|1124161:1134272|1131390_1133076_-|VDY67567.1|DBSCAN-SWA MKNNIKHHIKNTLTSINWLNFTFLFLSFSYSIYLLLIFGALAQGYISSFGIYSCENCFTEVEDFTKIISNKPDIYYLTMSMIIFIFSLAFKQRIKILITTWLAAAFTITVFDVTLAKENISLIPYTIENLMFNSFGGFFLSLFVLLNLASISYLLKNTKTSIGIATIIPLSSSFVLSFMIYGFLYLIYERSPVHIDALFNEKLLGSIHTADDEGFGFLTTSKTINDEITIQSYLNNNFTWMSGDQSYDVEAYLFSDCILNSTSLNKYKQQPVKLIDVKTLEFSSDRPVIYFIKGEKLSTSIQNSKEVNYSNKDKIRSQIEDGSAKFKSTYLPFTLMVNIAPFDEKSFLEKTEYKLTINKQENLITNIVQQYNIINKKELKCRQIVTPDNRIPTKNNITGVASIAFRFIPKGHLTLSNVPLLTINIKNGDFTKTKDSKAPLSDISNGYLKHINLIGLEELKVNNKKIDVSDSDLLYATGNNLVGFVTKNNKIKIFGTAEKLALNTKIMNLKPISTLNSKLSYFNTSLIDVLKVIFGMGVVIFISNYFVKFIKRKEEIRVLRK >LR134000|1124161:1134272|1130404_1130977_+|VDY67566.1|DBSCAN-SWA MESTALQQAFDTCQNNKAAWLQRKNELAAAEQEYLRLLSGEGRNVSRLDELRNIIEVRKWQVNQAAGRYIRSHEAVQHISIRDRLNDFMQQHGTALAAALAPELMGYSELTAIARNCAIQRATDALREALLSWLAKGEKINYSAQDSDILTTIGFRPDAASMDDSREKFTPAQNMIFSRKSAQLASHQSV >LR134000|1124161:1134272|1126509_1126830_-|VDY67560.1|DBSCAN-SWA MKTPLPPVLRAALYRRAVACAWLTVCERQHRYPHLTLESLEAAIAAELEGFYLRQHGEEKGRQIACALLEDLMESGPLKAAPSLSFLGLVVMDELCARHIKAPVLH >LR134000|1124161:1134272|1129830_1130331_+|VDY67565.1|DBSCAN-SWA MIYCPSCGHVAHTRRAHFMDDGTKIMIAQCRNIYCSATFEASESFFSDCKDSGMEYISGKQRYRDSLTSASGSMKRPKRMLVTGYCCRRCKGLALSRTSRRLSQEVTERFYVCTDPGCGLVFKTLQTINRFIVRPVTPDELAESLHEKQELPPVRLKTQSYSLRLE >LR134000|1124161:1134272|1133072_1134272_-|VDY67568.1|integrase|DBSCAN-SWA MARTTRPLTNTEVLRAKALEKDLTLHDGDGLFLLVKTNGKKLWRFRYQRPATKQRTMMGLGAFPALSLADARRLRADYLALLANGIDPQVQAEFIEEQQQIALDSIFSTVATNWFQIKSKSVSPDYAKDIWRSLEKDIFPAIGEIPVQQIKARTLVEALEPIKARGALETVRRLIQRINEIMIYAVNTGLIDANPASGVGMAFEKPKKQNMPTLRPEELPKLMRSLAMSNLSVSTRCLIEWQLLTLVRPSEASGARWAELDLDTKLWTIPAERMKAKREHIVPLSPQALDILEVMKPISAHREHIFPSRNNPKQSMNSQTANAALKRIGFGGKLVAHGLRSIASTAMNEAGFNADVIEASLAHSDKNEVRRAYNRTTYLNKRIELMQWWGDYVEGKNYK >LR134000|1124161:1134272|1128280_1128547_-|VDY67563.1|DBSCAN-SWA MQAVFSSPSPAPVTPLMPLPDITQERFLRLPEVMHLCGLSRSTIYELIRKGEFPPQVSLGGKNVAWLHSEVTAWMAGRIAGRKRGYDA >LR134000|1124161:1134272|1126965_1127421_-|VDY67561.1|DBSCAN-SWA MFDFPQPGEIYRSAGFPDVAVVGILEDGIPWEMPYRCPDIVWNPYRRKFSILVRILADGRTTDIPLGRFLREFTCDRPDLFKRSPVNRHAVLKEMAGDPELQKWREKYLDIYPQDTVPASRAAPVAREWREIPRTEPDPEITPDNSYRNYL >LR134000|1124161:1134272|1124161_1126495_-|VDY67559.1|DBSCAN-SWA MKMNVTATVSHALGHWPRILPALGIQVLKNRHQPCPVCGGSDRFRFDDREGRGTWYCNQCGAGDGLKLVEKVFGVSPSDAAAKVAAVTGSLPPADPAVTAAAVAETEAARKNAAALAQTLMAKTRPGTGNAYLTRKGFPGRECRMLTGTHRAGGVSWRAGDLVVPLYDDSGELVNLQLISADGRKRTLKGGQVRGTCHILEGQNQAGKRLWIAEGYATALTVHHLTGETVMVALSSVNLLSLASLARQKHPACQIVLAADRDLSGDGQKKAAAAADACEGVVALPPVFGDWNDAFTQYGGEATRKAIYDAIRPPAESPFDTMSEAEFSAMSTSEKAMRIYEHYGEALAVDANGQLLSRYENGVWKVLPPQDFARDVAGLFQRLRAPFSSGKVASVVDTLKLIIPQQEAPSRRLIGFRNGVLDTQNGTFHPHSPSHWMRTLCDVDFTPPVEGETLENHAPAFWRWLDRAAGGRAEKRDVILAALFMVLANRYDWQLFLEVTGPGGSGKSIMAEIATLLAGEDNATSATIETLESPRERAALTGFSLIRLPDQEKWSGDGAGLKAITGGDAVSVDPKYRDAYSTHIPAVILAVNNNPMRFTDRSGGVSRRRVIIHFPEQIAPQERDPQLKDKITRELAVIVRHLMQKFSDPMLARSLLQSQQNSDEALNIKRDADPTFDFIGYLGTLPQTSGMYMGNASIIPRNYRKYLYHAYLAYMEANGYRNVLSLKMFGLGLPVMLKEYGLNYEKRHTKQGIQTNLTLKEESYGDWLPKCDDPATT >LR134000|1124161:1134272|1129099_1129834_+|VDY67564.1|DBSCAN-SWA MSDNTIPEYLQPALAQLEKARAAHLENARLMDETVTAIERAEQEKNALAQADGNDADDWRTAFRAAGGVLSDELKQRHIERVARRELVQEYDNLAVVLNFERERLKGACDSTATAYRKAHHHLLSLYAEHELEHALNETCEALVRAMHLSIQVQENPLANTTGHQGYVAPEKAVMQQVKSSLEQKIKQMQISLTGEPVLRLTGLSAATLPHMDYEVAGTPAQRKVWQDKIDQQGAELKARGLLS >LR134000|1124161:1134272|1127413_1127701_-|VDY67562.1|DBSCAN-SWA MRNKKAPQTVSARHDAREHLSIEAYHKLNRASAVSQFVGGDLIHRELSGLHQLYIPHIFSYLNEDIDFVLNELKAKGLCRDFLAQQKDRGDRTHV |
10 | Enterobacteria_phage(85.71%) | integrase | attL 1123142:1123155|attR 1135642:1135655 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1644825 : 1654270
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >LR134000|1644825:1654270|DBSCAN-SWA AATGATTGAATTTAGCCATGTCAGCAAACTGTTCGGCGCACAAAAAGCCGTTAACGATCTCAATCTCAATTTTCAGGAAGGGAGTTTTTCGGTGCTGATTGGCACATCTGGCTCCGGCAAATCCACCACCCTGAAAATGATTAACCGCCTGGTGGAACATGACAGCGGCGTGATCCGCTTTGCCGGAGAAGAAATTCGCTCGCTGCCAGTACTGGAGTTGCGCCGCCGGATGGGCTATGCCATTCAATCTATTGGCCTGTTCCCCCACTGGAGCGTGGCGCAAAACATCGCTACCGTGCCGCAATTACAAAAATGGTCGCGGGCACGGATTGACGATCGTATCGACGAATTAATGGCGCTACTGGGGCTGGAGTCAAATTTGCGTGAGCGTTATCCGCATCAGCTTTCCGGTGGTCAGCAGCAACGTGTGGGAGTGGCGCGCGCACTGGCTGCCGATCCGCAAGTCTTACTGATGGATGAACCTTTTGGCGCACTGGACCCGGTAACGCGCGGCGCGTTGCAACAAGAGATGACGCGCATTCACCGTTTGCTGGGGCGCACTATTGTGCTGGTCACTCATGATATTGATGAGGCGCTAAGGCTGGCAGAACATCTGGTATTGATGGATCACGGTGAAGTGGTGCAGCAGGGGAATCCGCTGACGATGCTGACTCGTCCGGCGAATGATTTTGTCCGCCAGTTTTTTGGACGTAGTGAACTGGGGGTGCGCCTGCTTTCGTTACGTAGTGTGGCGGATTACGTGCGTCGCGAAGAACGGGCAGAAGGTGAGGCACTGGCAGAAGAGATGACGCTACGCGATGCGCTCTCCCTGTTTGTCGCGCGGGGATGCGAGGTGCTGCCGGTGATGAACACGCAGGGCCAGCCTTGCGGCACGCTGCATTTTCAGGATCTGCTGGAGGAGGCGTAAGCGTATGAAGATGTTGCGCGATCCGCTGTTCTGGCTCATTGCTCTGTTTGTGGCGCTGATTTTCTGGCTGCCTTACAGCCAGCCGCTGTTTGCTGCCTTGTTCCCACAACTGCCACGACCCGTTTATCAGCAAGAAAGTTTTGCAGCTCTGGCACTGGCTCATTTCTGGCTGGTGGGAATTTCGAGTTTGTTTGCGGTGATCATTGGCACTGGTGCCGGAATTGCTGTCACTCGCCCGTGGGGCGCGGAATTTCGCCCACTGGTGGAAACTATTGCCGCCGTTGGACAGACTTTTCCGCCCGTCGCAGTGCTGGCGATCGCCGTTCCGGTGATCGGCTTTGGTCTGCAACCAGCGATTATCGCCTTGATCCTTTACGGTGTGCTGCCCGTCCTGCAGGCGACACTTGCCGGGCTGGGAGCGATTGATGCCAGCGTGACAGAAGTTGCGAAAGGTATGGGAATGAGTCGTGGTCAGCGACTGCGTAAGGTCGAGCTACCGCTGGCGGCTCCGGTGATTCTGGCGGGCGTGCGAACTTCGGTGATTATCAACATTGGTACGGCGACGATCGCCTCAACGGTAGGGGCCAGCACGCTGGGTACGCCCATCATCATCGGGCTTAGCGGATTTAATACCGCGTATGTGATCCAGGGGGCGTTACTGGTGGCACTGGCGGCGATCATCGCAGACCGCCTGTTTGAAAGGCTGGTGCAGGCGCTTAGCCAGCACGCAAAATAAAGGTATAACCTGCGAGCATGACGCCACCAATTCCGCCTAACGCCATAAACAGGAACAGGGCGATGACCCCAATTTTAGCTATGCGCATAATGCACTCCTTATGTTAACGAAAGGATTGTACAGTAAAGCGCATTTGTTAACGAATCATTAAATGCCGAGTGGGAAAATATCATGGCCTTGTTCTTGCCAACTGGTGAGTTGCTGCTGTTGGGCGGAGGTTCGATTTTCACCGCACCACACCAGCAATGTACGGCCTTCGAATAGTTCAGGGCGTAGTTGATTGAGCGAGTGGGCGAGGACATCAATGCGCCATCCTTGTTGACTGGCAATCCAGCCCTCCAGCCACAGACGGGTGGTATCCTGAATATTCCAGCCAACCACCAGCGCATCTTTACCCTGTTTTTTACGTGCCGAAGCCAGACAAATGGCGATGTAGTTGATCAGTACGCCGTCGAGGATCGCCAGCAGCGCCTGGAGAGTCGGTTGTTGGCACTGAAGCCGTCGGCGCAGAGGAATAAACAGATGTGTGGTGAGTGTCTGGGCGGGGTAATCCTGACCGCGCTCTTTGATCCACGTTCGCAGGCTATGTAGATTGCCGCTTTGCAGGTAAGTCAGTAATGTTTCTTGCTGATCGCGCCAGCCGTTCTGCACATCAACATTTTCATTACTGAGCAGCATTTTAACTTTGCTGACCTGCACGCCGTTGTCGATCCAGCGTTTGATCTCGCGGATCCGGTCAATATCGGCATCGTTGAACAGCCGATGACCGCCGTCTGTCCGTTGCGGTTTCAGCAATCCGTAACGCCTCTGCCACGCGCGTAACGTGACAGGATTAATATCACAAAGCAACGCCACTTCACCAATTGTGTAAAGCGCCATCGTCTCACCCTTGCTCGCGAGGTCCCGGTTTAACTTTAGACGCAGTTTTGCGAACCAGGTAGTTTTGCCCGTTTTTTGTGCATCTATAGGGTGATTTTATTTTTGCCAGGCGATTTTGAGTGATCGTACTCACGAATTCTCATTTTTCTGCAAGAGTTCAAAGAAAGTTAAACGCAGGCAATGTATGTTACGCGTTTTAAAGGGAAGTGTGGTTTGCGGGTATGTACGATTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTTTTTTTAGTCATTGCGTGGTTAATGAGTAAAACGCCATTATTCATACCGTTAATGCAGGTCACGGTTCGTCTGCCGCATAAATTTCTCTGCTACATCGTCTTTTCCATCTTCTGCATCATGGGCACCTGGTTTGGGTTGCACATTGACGATTCTATTGCCAATACCCGTGCGATAGGCGCGGTAATGGGCGGCTTACTCGGCGGTCCGGTCGTCGGTGGGCTGGTTGGTCTGACCGGCGGCTTACATCGATATTCGATGGGGGGCATGACCGCGCTAAGTTGCATGATCTCAACCATCGTTGAAGGATTGCTCGGCGGCCTGGTACACAGCATCCTGATCCGTCGCGGGCGCACTGATAAAGTCTTTAACCCCATTACCGCCGGTGCCGTCACGTTCGTCGCTGAAATGGTGCAAATGTTGATCATCCTGGCGATCGCCCGACCTTATGAAGATGCGGTACGTCTGGTGAGTAATATTGCTGCGCCAATGATGGTCACCAATACCGTCGGCGCGGCGCTGTTTATGCGTATATTGCTCGATAAACGCGCGATGTTTGAAAAATACACTTCAGCTTTTTCTGCCACTGCGCTGAAAGTGGCGGCCTCGACGGAAGGCATTTTGCGCCAGGGGTTTAACGAAGTGAACAGCATGAAAGTGGCACAGGTGCTGTATCAGGAGCTGGATATTGGTGCAGTCGCGATTACCGATCGAGAGAAATTGCTAGCCTTTACCGGAATTGGTGACGACCACCATTTACCCGGCAAACCGATTTCTTCGACTTATACCTTAAAAGCGATTGAAACCGGTGAAGTGGTCTACGCTGATGGCAACGAAGTACCTTACCGTTGCTCTTTGCATCCGCAATGCAAACTGGGGTCGACGCTGGTAATTCCGTTGCGTGGTGAAAATCAGCGGGTGATGGGCACCATCAAATTGTATGAAGCCAAAAACCGTTTATTCAGTTCAATCAACCGCACGCTGGGCGAGGGGATTGCGCAACTGCTTTCGGCGCAGATCCTTGCCGGGCAATATGAGCGGCAAAAAGCGATGCTCACCCAGTCAGAGATCAAACTGCTTCACGCCCAGGTGAATCCCCATTTTTTGTTTAATGCGCTTAACACCATTAAAGCGGTGATCCGTCGCGACAGCGAACAGGCCAGCCAGCTGGTGCAGTATCTTTCCACTTTTTTCCGCAAAAACTTAAAGCGGCCTTCGGAGTTTGTTACTCTCGCCGACGAAATTGAACATGTGAATGCTTATCTGCAAATTGAAAAGGCGCGCTTCCAGTCGCGGTTGCAGGTCAACATTGCTATTCCGCAAGAATTATCCCAGCAGCAATTGCCCGCGTTTACCCTGCAACCGATAGTGGAAAACGCCATTAAACATGGGACATCACAACTGCTGGATACAGGGCGAGTGGCAATCAGCGCCCGACGTGAGGGGCAACATTTAATGCTGGAGATCGAAGACAATGCCGGTTTGTATCAACCGGTAACCAATGCCAGTGGGCTGGGGATGAATCTGGTGGATAAGCGTTTACGTGAACGGTTTGGCGATGACTATGGGATAAGCGTCGCCTGTGAGCCTGATAGTTACACCCGAATAACGTTACGACTACCATGGAGGGACGAGGCATGATTAAAGTCTTAATTGTCGATGATGAACCGTTAGCACGGGAGAACCTGCGCGTATTTTTGCAGGAGCAGAGCGATATTGAAATCGTTGGTGAGTGTTCAAACGCCGTGGAAGGGATCGGCGCGGTGCATAAACTGCGCCCGGATGTGCTGTTTCTCGATATCCAGATGCCGCGCATCAGTGGTCTGGAAATGGTGGGGATGCTCGACCCGGAACATCGTCCGTATATTGTTTTTCTCACCGCGTTTGACGAATACGCCATTAAAGCCTTTGAAGAACACGCCTTTGATTATCTGCTGAAGCCAATTGATGAAGCGCGACTGGAGAAAACGCTGGCGCGTTTGCGGCAGGAACGCAGCAAGCAGGATGTTTCGCTGTTACCAGAAAATCAACAGGCGCTGAAATTTATCCCTTGTACTGGGCATAGTCGGATTTATTTGCTGCAAATGAAAGATGTGGCATTTGTCAGCAGTCGGATGAGCGGTGTCTACGTTACCAGCCACGAAGGGAAAGAGGGCTTTACCGAATTGACATTACGTACCCTGGAAAGTCGTACACCACTACTGCGCTGCCATCGTCAGTATCTGGTTAACCTCGCGCATTTACAGGAGATTCGTCTGGAAGATAACGGCCAGGCCGAGTTGATTTTGCGTAATGGCTTAACCGTGCCGGTCAGCCGCCGTTATCTGAAAAGCTTAAAAGAGGCGATTGGCCTGTAAAAGACTGCTAAAATGGCTTTTTGCCTCATCAACACCTGAAGGCCTCATGCTAAGTAACGATATTCTGCGCAGCGTGCGCTACATTTTGAAAGCCAATAATAATGACCTGGTGCGTATTCTGGCGCTGGGTAATGTCGAAGCCACCGCGGAACAGATCGCCGTCTGGCTACGTAAAGAAGACGAAGAGGGTTTTCAGCGTTGTCCGGACATTGTTTTGTCGTCATTCCTCAATGGCCTGATTTATGAAAAACGCGGCAAGGATGAGTCTGCTCCGGCACTGGAGCCGGAACGTCGCATTAATAACAACATCGTGCTGAAAAAATTACGCATCGCGTTTTCGCTGAAAACCGATGACATTCTGGCGATCCTCACCGAACAGCAGTTCCGCGTTTCGATGCCGGAAATTACGGCGATGATGCGTGCACCGGATCATAAAAACTTCCGCGAATGCGGCGATCAATTTTTACGTTATTTTCTGCGTGGACTGGCAGCGCGCCAGCATGTGAAGAAAAGCTAAGACGGGTATGGCGGCCATGCGAAACATGGCCGCCGACAGATTATTTCACTTCTTTAAAACCAGCGGCTTTCATCACCAGTTCCATTTGCGCCATAGTGATACCTTTTTTGGCATCTTCAGCAGAAACGTTGATTCCTGAAATACCCTGCAGGGCTTTAAAATCCACTTTTTCCATATCGATAGTCACGTTTTCCTGCGCGTAGGTATCGGTATAGGTTAATTTTTCTTCAACACCCGCGATGTTTTTGTATTTGGCGCTTAACGGCTCAAGTGCCCTGGCAGCATCTTCTTTGGTGGTTGCACTAATGGAGGCAAATTGAATTTTGGTTTCAGAAGATTGCTTAAGCACCTTGTCACCTTTGTAGACATAGGTAATGGCAATTTCAGTGCCGTTCAGATTGGCGCTGAATTTCTTCGATTCTTCTTTGTCACCGCAGCCAGCAAGAGAGAAAACCAGAACAGATGCAACAACGAGGGAAAACAGCTTATTGAAAGCCTTCATGTAAAACTCCATTTTATTTAATCAAGAAACTGGTGACTCTCACCAGGGGCTATATAGAATATGCCTAATACCGTGGCGTGAGCAGTCCGGAACTGGAGTAGAACTCTTAGTAAAAAGCACTATTTCATCCTTGTTGCTGAAGCATGGGGAATAATTGTTCGCAAAGCAAAACACCGTTATTCATTGCTTCTACCCGTGCCTCGCTTTCTGTATTACGAAATTGTGCCAACACATGTGCCAGCCGATAAAAACCCACCGCGGTGAGGTCATTCGCCAGCAACTCTGCCTGACTAATAGCGCTCTGTTCCTGATAGCGCCAGCCGTTATGGAGCAGTTGAATAAGTAACGCCTGGCAGCGCATCAGCAACTGATGAGCAGTAGACGGCACAGGCAGAACGCTGGTAGAAGGTAGTGATACCACCACAGGCGCAGTTTCTGCGTCCAGCGCCCAGGCACGAGTCTTTGTCATCATTACCTGTGGTTCCAGTGTCAATTGCCCATCAACAAAACTGACAAAGCCAGAAACCAGATACACGGGGTCGTCTGTTTGTTGCAAAAGCGCCGCCATGCGTTCAACGGCATAAGGCGCGCTGGCTGAGGCTGGCAGTGATAACGTCAGCAGATTATCTTCACCTTCGCCGCTGATTACCTGCGCATCCAGCGTCTGGCGGCTGCTATCCCAACCGAGCGAAATACACTCAGCGACCGGCAGAATAAATAAGTTATCGACCTGATTAAGAGGCCGTATGCAGGCGGGGGGACGCTGGCGTAAATATTCCCGCAAAGCCACAATGCCCGGCTGGCGTAACGGCGCGCTCAACATTTGCCAGGCATCAGGCGACAGCGGCACAACGCTGCTTAAGCGGTTGCGGGTAGCTAACAGCAGCTCGCCATCGGCACTGCGTTTTGCTGCTTGTGAAACAATTTGCCCGCCCGCCAGTGCGCCAGCCTGAAAACTAAACAGCCGACGCGTAGCTGCCGGTGAGTTTTCCTGTTCACTTCGCGGCCAACTGCGCGAAAGGTGCAAAATACTGCCGGTGTCGGGATTGGTAAACCAGATGCGTAAACCATAATGCTCAATATCCTGCCAGCAACGCATACCTAAAGACACCAGCCGCAGATGATCAAGCTTTGCTTCTCCGGCAATGCCTGCGCCAACGACCGTGCGCCACGGCATGGGAGGAATTTCACCAACACTGTCGCGCTGCGCCATTTCGTGTGCGCAATTTAATCGACTGTTTAATGCGGCAAGCTGACACAAGCATTCTCCGGCATGATAATGGCTTGCGCGGGCGTGGAAGGCATCAACGCTGGCGCGCAGTTGCCGTAGCGATTCACTCACCCATCGCCAGTTGCAGCGTTCCGCCGCTTGCTGCGCGCGGCTGAAAGCGGCCTCGTAGTGAATAAGCGGCTGGCTGATGCCGCCAAGCCATAATGCCTGGCTTAATTGCTGAACATATTGACGACACGTTTGGCCTTCTTCGCTGGCAAACGGATCGTCAGATGATGTGACGTGTTCGCTGCGCATCTGCCAGATTAAATGGTTAAATTCTGCTTGCTGCGCTTTGGCCTCGACGAAGGCCTGTACCGCCAGTACGATATGTTCGCAAAGAGTGCCTTCAATACAATCACAACGGGCGAAACGAATGCTGCTGCGGGAATAAAAACGCACATCGCTCATCGGTAAGCGGGCAGAGGGAATTTCGCCCGGCGTACAGAACAACTCAATGGTGATGCCTTTAGCGACCAGCGCCTGCGCGCGTTTGCGGGTGGCATCGGGCAGGGTAGCCAGTTCTTCCAGCCAGATTGCCGGATCCCACTCTTCTTCTTTTTCCGCAGGCTGGGCGGTAGCACAAAGTCGTTGATAACTTAACACCAGCATCACGCGATGACGGCACATGCCGCTGGCCCCACAGGTGCACTGAGCCTCTTTCAGTGCCTGGCCGTTCGCCAGCTGGGTACGGACACCGTCACTGAAGGTGGCGATTAAAGCGCCGTTCTCATGGCTGATTTCCGGGACGTTGCCATTTTCCAGTTCCTTAAGGCTGCGCTTAACAAAACCGGCATTGCTTAACGCCGTCAGTGCCTGTGGTGTCAGTTCTAATAATTCCGGACGTAGTGAATTCATGACTGAAGGTTCTCTGCAAGCCATGATGCCAGCTCGCCCGGCGTCATGGCGGCTATTTGTGCGCCGACATTAACCAGCGCCTGGGCCGTATCGCGGTCATAGCAAGGCGTTGCTGTGCTATCGAGCGCTGCCAGTCCCAGCACTTTGATGCCGCTCTGGACACACTTTTTCACCTGATGCGTCAGCAATGATGATGAACCCCCTTCATAAAAATCGCTCACGAGGATAATGACGCTTTTTGCTGGTTGTTCAATAAGTTGCCGACCATACTCCACGGCACTGGCGATATTGGTTCCGCCGCCCAGTTGTACTTTCATTAATAGTTCTACCGGATCGGCAACGTCTGCCGTGAGATCAACGACGCTTGTGTCAAACGCCACCAGATGGGTACGAATGCCGGGTAACTGCCACAAACAGGCCGCCATCACCGCAGAGTGGATCACCGAATCGACCATCGAGCCGCTTTGATCAACCAGTAAGACCAGTTGCCATTGTTCGCTTTGGCGTTTAATGCGGCTGTTAAATCGGGGGGATTCGATATACAACTTGCCGTGTTGCGGGTGCCAGTGTTGCAGGTTGGCGCGCAGAGTACTTTTGAAATCAAAGTTTCGCGCCAGTGGAATAAATGAGCGGCGACGGCGATCGCGGACACCAGAAAAAGCCTGACGAACTTCCTTTGCCAGTCGAGCCATAATTTCTTCAACAACCTGGCGCACTATCCGGCGGGCGGTAGCCAGTACTTCGGGGTTCATCAGATGTTTGGTATGCAAAACGGCGCGTAGCAGGCTTTCAGAAGGCTGCATACGTTCCAGCACGTCGAGATTCGTCACCACATCTTCAATGCCATAGCGCAGTACGGCATCGCTTTCCAGCCGCTCAATCACCTGTTGCGGAAACAGCGTGTGAATACTGTTGATCCACTCAGGGGTGGTGAGATTTGAGCCACCTAATCCACCGGAGCGTTCACCACGCTGGAGCCGTTCAGGATCGCGCCCATACAGCCATTCCAGCGCGTGGTCTATCTGCCGGGCGTTGTCATCCAGCCCACAAAGCGTCGTTTCTGCCGCTTCGCCAAGAATTAATCGCCAGCGTTGTAGCTCACGGGTGGTCAGAAGATCGTTCAGTTCAGACAT
Protein sequences of DBSCAN-SWA_5 >LR134000|1644825:1654270|1650036_1650507_+|VDY68021.1|DBSCAN-SWA MLSNDILRSVRYILKANNNDLVRILALGNVEATAEQIAVWLRKEDEEGFQRCPDIVLSSFLNGLIYEKRGKDESAPALEPERRINNNIVLKKLRIAFSLKTDDILAILTEQQFRVSMPEITAMMRAPDHKNFRECGDQFLRYFLRGLAARQHVKKS >LR134000|1644825:1654270|1644825_1645752_+|VDY68015.1|DBSCAN-SWA MIEFSHVSKLFGAQKAVNDLNLNFQEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGVIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWSVAQNIATVPQLQKWSRARIDDRIDELMALLGLESNLRERYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHRLLGRTIVLVTHDIDEALRLAEHLVLMDHGEVVQQGNPLTMLTRPANDFVRQFFGRSELGVRLLSLRSVADYVRREERAEGEALAEEMTLRDALSLFVARGCEVLPVMNTQGQPCGTLHFQDLLEEA >LR134000|1644825:1654270|1646468_1646576_-|VDY68017.1|DBSCAN-SWA MRIAKIGVIALFLFMALGGIGGVMLAGYTFILRAG >LR134000|1644825:1654270|1650547_1651009_-|VDY68022.1|DBSCAN-SWA MKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYVYKGDKVLKQSSETKIQFASISATTKEDAARALEPLSAKYKNIAGVEEKLTYTDTYAQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKAAGFKEVK >LR134000|1644825:1654270|1653133_1654270_-|VDY68024.1|DBSCAN-SWA MSELNDLLTTRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDPERLQRGERSGGLGGSNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVLHTKHLMNPEVLATARRIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKSTLRANLQHWHPQHGKLYIESPRFNSRIKRQSEQWQLVLLVDQSGSMVDSVIHSAVMAACLWQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSVIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTAQALVNVGAQIAAMTPGELASWLAENLQS >LR134000|1644825:1654270|1651133_1653137_-|VDY68023.1|DBSCAN-SWA MNSLRPELLELTPQALTALSNAGFVKRSLKELENGNVPEISHENGALIATFSDGVRTQLANGQALKEAQCTCGASGMCRHRVMLVLSYQRLCATAQPAEKEEEWDPAIWLEELATLPDATRKRAQALVAKGITIELFCTPGEIPSARLPMSDVRFYSRSSIRFARCDCIEGTLCEHIVLAVQAFVEAKAQQAEFNHLIWQMRSEHVTSSDDPFASEEGQTCRQYVQQLSQALWLGGISQPLIHYEAAFSRAQQAAERCNWRWVSESLRQLRASVDAFHARASHYHAGECLCQLAALNSRLNCAHEMAQRDSVGEIPPMPWRTVVGAGIAGEAKLDHLRLVSLGMRCWQDIEHYGLRIWFTNPDTGSILHLSRSWPRSEQENSPAATRRLFSFQAGALAGGQIVSQAAKRSADGELLLATRNRLSSVVPLSPDAWQMLSAPLRQPGIVALREYLRQRPPACIRPLNQVDNLFILPVAECISLGWDSSRQTLDAQVISGEGEDNLLTLSLPASASAPYAVERMAALLQQTDDPVYLVSGFVSFVDGQLTLEPQVMMTKTRAWALDAETAPVVVSLPSTSVLPVPSTAHQLLMRCQALLIQLLHNGWRYQEQSAISQAELLANDLTAVGFYRLAHVLAQFRNTESEARVEAMNNGVLLCEQLFPMLQQQG >LR134000|1644825:1654270|1647588_1649274_+|VDY68019.1|DBSCAN-SWA MYDFNLVLLLLQQMCVFLVIAWLMSKTPLFIPLMQVTVRLPHKFLCYIVFSIFCIMGTWFGLHIDDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSILIRRGRTDKVFNPITAGAVTFVAEMVQMLIILAIARPYEDAVRLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSTYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQQLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGLYQPVTNASGLGMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLPWRDEA >LR134000|1644825:1654270|1646635_1647367_-|VDY68018.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKMLLSNENVDVQNGWRDQQETLLTYLQSGNLHSLRTWIKERGQDYPAQTLTTHLFIPLRRRLQCQQPTLQALLAILDGVLINYIAICLASARKKQGKDALVVGWNIQDTTRLWLEGWIASQQGWRIDVLAHSLNQLRPELFEGRTLLVWCGENRTSAQQQQLTSWQEQGHDIFPLGI >LR134000|1644825:1654270|1645756_1646488_+|VDY68016.1|DBSCAN-SWA MKMLRDPLFWLIALFVALIFWLPYSQPLFAALFPQLPRPVYQQESFAALALAHFWLVGISSLFAVIIGTGAGIAVTRPWGAEFRPLVETIAAVGQTFPPVAVLAIAVPVIGFGLQPAIIALILYGVLPVLQATLAGLGAIDASVTEVAKGMGMSRGQRLRKVELPLAAPVILAGVRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIADRLFERLVQALSQHAK >LR134000|1644825:1654270|1649270_1649990_+|VDY68020.1|DBSCAN-SWA MIKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVTSHEGKEGFTELTLRTLESRTPLLRCHRQYLVNLAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL |
10 | Enterobacteria_phage(85.71%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
1779850 : 1832609
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >LR134000|1779850:1832609|DBSCAN-SWA CATGACGGTCGCCCAGTGTCTCGACTACTGGTTCGACAACTACGTCTCTACAACTCTCAGAGAAAAGACCCAGGCACTTTACCGATCAACGGTTATGAAGCGCATGCATGACGCCTTTCCTAATCGTCCGGCATCTTCTATCACGGTTAAGCAATGGGTTGACCTGCTTACCGAAGAAGAAAGAGATAATCCACGCCGAGCAAGGCAGGTGCTAAGTCAACTAAGATCAGCAATAAGTTGGTGCATGCGGCGTCAGTTGATAGATAGTTGCGCAATTATGAGCATCCAACCAAGGGACTTCGGCTCCCGCGCTGAGGTAGGGGATCGGGTACTGTCGTATCACGAACTGGCTAAGATTTGGCTTGCTATTGAAAGAAGCCGTGCGTCTACGTCAAATAAGCTACTTCATCAGATGCTTATGCTGTGGGGGGCGAGGCTCTCAGAGCTTAGGCTGGCAAAAAAGACAGAATTTGACCTGCTGGAAAACGTATGGACCGTACCGAAAGAGCATAGCAAGATGGGTAATGTTATCCGCCGTCCAATCTTCGAACAAATTAAGCCTTTCCTCGAAAAGGCCATGACAACGTACAATGATGTTCTTTTCCCTGGAGAAGACATAAACAAACCGATCAGCATCGCTGCAGCCAACCGATTCGTAAATAGAATAAGGGGAGGGATGGATCTAGGTTACTGGCGAACACACGATTTCAGAAGAACGCTTGTTACACGTCTGTCCGAGATGAATGTCGAGCCTCATGTTACTGAGCGAATGCTCGGTCATGAACTTGGCGGGATAATGTCCGTGTACAATAAACACGACTGGATAGAGGCTCAGCGCAAAGCGTATGAGCTTCACGCTGATAAATTGTTCTGGCACATCAGGAGCATTTCTGATTAACGCCACCGTTAAGAATCCACCCTTCAACAGCTTCACGAAGGTATGATTTGGGGTGGGTTCTGACTGGCTTCGGAAATCCGTGCCGTTTGGTATAGTTCCAGATTGTCTGACGTGATGAAACACCGAGCTTGTTCATTACTTCTTTCTCAGGAATCAGGCTGGTATCGGTCATCTTAATTCTCCAGGCAAAAAGAAACCGCCATCAGGCGGCTTGGTGTTCTTTCAGTTCTTCAATTCGAATATTGGTTACATTGTTTTCATATATGAATAAATAAATTAGCTTTTTTCGTTGCCTTCGCGTTCTTTATTAATTTTAACAAACTCGTTTTTACCACGCTCTCCAAATGCGTCTTTAGAGTCGTTGTATCCGCAATCGCAGCACACATAATCACCAGACCATCCACGCATTGTTTTTTCTTTTGCAATATTTCCAGAACCGCATTTTGGACAAGACATATCACTACCTCCAAAGCATGAGTGAGATGACAACGTAACATTGATTGGAGATTAACAATAGATTGCTGATGTAAAAGATATGTATAAGCTTCGCTTTCAAAGTGGAGGCTCTGGTAGCGGCATCCAGTGAGTTACGTCATCCAAGATATTTCCTGATAAATACGTGAAAGCTCTATATTTTTTGTAATCAATTGGATTTACAACCCAGTTCCAATATGCGGCCACGATTTCACCTTGACTAAATGCCAGTAACATTTTGGTGTCTTCCGGCATTCGATCACTACAGCTTATCCAACCACCCGGAGTTACCGGAGAGTTGCCATTTATATCGAAGTTTGGCTCTGCGTCCTGAACTAGGAGGATGTAACCATTCTTGGCAGTATCAAGTTCCGACGCCTCGGTGACGGTGCCGAAATAGCGATTACCTAAATCAGCATCACAAGTACTTACATCAATGGAAACTTCCATGCCTTCGATTAATTCTGGCAAGTTGTAAGCTTGGCTTACAGGTTCGGCTTCCAGTTCTGCTATGCGCTTTTTTGCTGTTTCCAGCTCGCCCAGCAGCGCCAAGACGGTAGCCGGATTGGCTGCAGCGATGAATTCAGCATTGGCCTGCTGTTCCATTTGGAAATCTTCATCGAAACCGCTTTCAGGATGCGCTCCTTCAATTCTGCAAATGGGAATATATCCAGCAACTTCACGATGAATTAGCGCATCATCACCATCAAATCGGCTCTCTCCATATTCGAGCGACCACTCACCACACGTTGCTTTTTCTGCCTTGGCCCGCAGTGCCTGAGAGTTAATTTCGCTCACTTCGAACCTCTCTGTTTACTGATAAGCTCCAGATCTTCCTGGCAACTTGCACAAGTCCGACAACCCTGAACGGCCAGGCGTCTTCGCTCATCTATCGGATCGCCACACTCACAACAATGAGTGGCAGATACAGTCTGGTAGTTCAGACGACGCATTTTTATTGCTGTATTGCGCTGTAATTCTTCGATTTCTGATGCTGAATCAATGATGTCTGCCATCTTCCATTAATCCCTGAATTGTTGGTTAATACGCTTGAGGGTGAATGCGAATAATAAAAAAGGAGCCTGTAGCTCCCTGATGATTTTGTTTTTCATGCTCACCGTTCCTTAAAGACGCCGTACAGCATGCTGATATGAGACAATGTTGATTCATTAAGTTGATTCCAGACTTCCTTTGGTAAAAGCTTGTATCAGTCTGTTTGCTGCTGCTTTCTGCGCTGCCACATTGGCAATAACAGATAGTTTTTCCTGGCTGGCTTTCGTGCAGATCCCCGCCCAGTTATCCATCAGAAAAAAATCCTCTCTTTCTGCAGAGCTGGTAGTTGCACATAGTTTTTCGATCATAGAAGTTATTTCTGCGATGGAATGATTAACCATCATCTGTTGAACCGCAAAACCGAAAGCGTTAATCATTACTCCATGGAACTGAATATAATCGCGCTTGTACGTAGCGTGGTGTACACCATGTCGGATTGAGTCAATCTGAGTTAATGTAATCCATGCCTCCCATACAGATTCTATATATCCCATTTCAAGTTGTTGATTGCCGTTCCTAGCGAACTTTGACGTTGCATCAGTGAGTGCCTTGAAACTCACCCACATATTACTTTTTAATGGCACTACGTTGTGTTCAAAATCGGTTATATCGGCAAATACAGTATGTTGGGTCAGGAAGGATATCATTCCCTGAGCAATATCATCCCGGCCGTTATACGCCATATTGATGGTCGCTGATGGCTTAGAAACGTTGTTATTTATGTCCGAAAAGAACTGCTGCCGGGTTTTTAGCGGCAGATTCATTGTAAGCATCATGGGAACCATGAGCGTTGATGGGGAGCTTCGGCAAAATATCTCAATGCCAGCTGCACGATGTTGACCATCAAAAAGTTTTATTTCGGCGTCGAGGGGAATTCTGGCTATACCAACATTTGTGTTGCCAAACGGTACAAATTCTATATTCGAATCACAGTTACCTACGAGAGGGGGAATGATAAAAGGCTCATTTCTTGAGTCTGCGTTAGTGAGATAATTTAAAAATTTTCGTACTCGATTTGGATTAATTTCTCGCTGAGAGCGTTCCAGTGTATGGCCGTAATTATCTGAAGCGAGGAAACGAGCCAGCGATCTTCCTGGTATGGTAAGGAAGAGTGTAACAGTACCACCCTGTACACCTTGCGATGCCGGAAATTCGAATGAATGATTACCAACCTGACTCATATATCCTCCTGTTTATTATTTATCTTCTCAGCCAGCCGCTGTGCTTTCAGTGGATTTCTGATAACAGAAAGGCCGGGAAATACCCAGCCTCGCTTTGTAATGGAGTAGACGAAAGTGATCGCGCCTACCCGGATATTATCGTGAGGATGCGTCATCGCCATTGCTCCCCAAATACAAAACCAATTTCAGCCAGTGCCTCGTCCATTTTTTCGATGAACTCCGGCACCATCTCGTCAAAACTCGCTATGTACTTTTCATCCCGCTCAATCACGACATAATGCAGGCCTTCACGCTTCATGCGCGGGTCATAGTTGGCAAAGTACCAGGCATCTTTTCGCGTCACCCACATGCTGTACTGCACCTGGGCCATGTAAGCCGATTTTATTGCCTCAAAACCACCGAGCCGGAATTTCATGAAATCCCGGGAGGTAAACGGGCATTTCAGTTCAAGGCCGTTGCCGTCACTGCATAAACCATCTGGAGAGCAGGCGGTGCGCATACTTTCGTCGCGATAGATGATCGGGGATTCAGTAACATTCACGCCGGAAGTGAATTCAAACAGGGTTCTGGCGTCGTTCTCGTACTGTTTTCCCCAGGCCAGCGCCTTAGCATTAACTTCCGGAGCCACACCGGTGCAAACCTCAGCCAGCAGGGTGTGGAAGTAGGACATTTTCATGTCAGGCCACTTCTTTCCTGAGCGGGGCTTTGCTATCACGTTGTGAACTTCTGAAGCGGTGATGACGCCGAGCCGTAATTTGTGCCATGCATCATCCCCCTGTTCGACAGCTCTCACGTCGATCCCGGTACGCTGCAGGATAATGTCCGGTGTCATGCTGCCACCTTCTGCTCAGTGGCTTTCTGTTTCAGGAATCCAAGAGCTTTCACTGCTTCGGCCTGTGTCAGTTCTGACGATGCGCGAATGTCGCGGCGAAATATCTGGGAACAGAGCGGCAATAAGTCGTCATCCCATGTTTTATCCAGGGCGATCAGCAGAGTGTTAATCTCCTGCATGGTTTCATCGTTAACCGGAGTGATGTCGCGTTCCGGCTGACGTTCTGCAGTGTATGCAGTATTTTCGACAATGCGCTCGGCTTCATCCTTGTCATAGATACCAGCAAATCCGAAGGCCAGACGGGCACACTGAATCATGGCTTTATGCCGTAACATCCGTTTGGGATGCGACTGCCACGGTCCGGTGATTTCTCTGCCTTCGCGGGTTTTGAATGGTTCGCGGCGGCATTCATCCATCCACTCGGTAACGCAGATCGGATGATTACGGTCCTTGCGGTAAATCCGGCATGTACAGGATTCATTGTCCTGCTCAAAGTCCATGCCATCAAACTGCTGGTTTTCATTGATGATGCGGGACCAGCCATCAACGCCCACCACCGGAACGATGCCGTTCTGCTTATCAGGGAAGGCGTAAATTTCTTTCGTCCACGGATTAAGGCCGTACTGGTTGGCGACGATCAACAATGCGATGAACTGCGCATCGCTGGCATCACCTTTAAATGCCGTCTGGCGAAGAGTGGTGATCAGTTCCTGTGGGTCGACAGAATCCATGCCGACACGTTCAGCCAGCTTCCCTGCCAGCGTTGCGAGTGCTGTACTCATCCGTTTTATACCTCTGAATCAATATCAACCTGGTGGCGGGCAATAGTTTCAACCATGTACCGGATGTGTTCTGCCATGCGTTCCTGAAACTCAACATCGTCATCAAACGCACGGGTAATGGCTTTTTTGCTGGCCCCGTGGCGTTGCAAATGATCGATGCATAGCGATTCAAACAAATGCTGGGGCAGACCCTTTTCCAGGTCGTCTGCCAGTTCTGCCTCTTTCTCTTCACGGGCGATCTGCTGGTAGTGACGCGCCCAGCTCTGAGCCTCAAGACGATCCTGAATGTAATAAGCGTTCATGGCTGAACTCCTGAAAATGGCTGTGAAAATATCGCCCGCGAAATGCCAGGCTGATTAGGAAAACAGGAAAGGGGATTAGTGATTGAGGCCGTTACCGCGTCCGTCGAGAAAAACTTCCACGAGCAAATCACGGGTATAAGTGCGCTCGATGCCGCGATGCAGATAAAGCCGTCCGCGTAAATTAGCTGATGCAGTCCAGGTACCATCTTTGTGTTTGATCAGCATTCCTGGCATGACCGCACCGCGATTAACGGTCTGCGTTCCGTAATGTTGATGAACCATAAAAACTCCTGCCCGTAAGCTGGGCTGCTGAACATATAGAGACTTCTGCGCGTATTCAGGCGGTGGATGGCCGCCGGTTGTCATAACTAAGCCGCCTCGTTGAAGCGACTGAGGTATAAAGTGTTGTGTTGATTTCAGCTGGTCACACCGACGTTCACGCGTCCGTTTCACCCCTCGCACTTCCCGAAGCCTGCTGAAATTCAAGCTGCGGATCTAAGCGGTCATCGCAACGGTGAATCAGGTGGTTGCCGTATCGTTGTGTTGTTGCGATGAATTTATTTAAAACTATAGTTGTTTTATCGTCAACAACAAAAGTTGTTTTATTGGTTGTTTTAGATGTAACTGGTTGTATTTAGGATGGATTTATTTTGTGACTTGCATCGCATAGCGATAACTGAAGGGAGATTGTGGTGGTTTTTTGAACGGTATACGTGATGAGGGGAGGGGATAAAAGAAAACCCGGCACGGTGGCCGGGAAGTTTACAAATACTTTACATTTACCGCTATAAGAGAATAATCGTCAGTCGGGCCTTTTGTCTCTATTCTTCGCATTAGGCTGGAGGCAAACCTTGATGCGGTTGACAATGTACGGTAAGAAAATCGTGGGCGTTGCTCCCAAAAGGTATGCGCGCCATCAGACATGATGAATAGAGATAGAACACCATCGTCAAGAGGAAGTTCATCTTTATCAATATTTATGACTTGATGTTGAAGAGGGATCTTGGCAGCAAGTGCAGTTGTAATAATATTTTTACCTTTTGCATTTTTGAGCTGTCGAGCTGTATAAATACCGCTATCTATCAACATTTGATGTTGAGTATGATCTGTTGTTAATTGGATTAGCTTATTTCCATTTTTTAAATAAACACGACAGTCACCTACATGTATAATTGTTAGACCTTGATTGTGAAGTAGGCACATTGTTAAAGTTGTAGCAGCAGATGCTAATTCTTTATCAACTGACGAAAGAGATGCCACCTTTTCTTGTAGGGTCTTAAGTAAGGATAAGACATTGTCTTCATGAGTGAAGCTGTCTTTAATACTATACAAATGATTTATTACTGCTGCAGATGCTTCTTTTCCTCCTTTATAACCACCAACCCCATCAGCTATAGCCATAAGATAACCATTTGGTGTTTGAAGTGGATAAAGCACGCTATCTTGATTTTCCCTGCCATTATGTTTTGGTACAGAAAATATGGATGATGAGAGTATGCTAATCATAAGCGTTTCAACTCCAATTCCTCAAAGATCTGCTCTACGGTCAGGAATCTTTTGTTAAGTTGTCTATGCGTACATTTGTTTATTATATCATCAAAACCATCAATATTTAATTCTTCAATTAAAATACCAATTGAATATATATCTGATTGTTTTGAATATCCGTTGAGGAAAACGTTGTAATCAAAGTATTTAGGAGTAGCGGGGTATTGGCCAATTTGTGTTAATAGCTGTGTATTGGCATCTGGTGACACATTTTTTGCAAGGCCGAAGTCAGATAACTTATAAATTCCATCTTTAAATTTTAGTACATTTAGTGGTTTTATATCCCGATGAAGATATCCTTTCTTATGAATCCATCCGACGGCATTTAGTACCATTTTTACAATAGAGATTTTTTCAGCCTTTGTAAGAGTTCCACTTTTTATTTCTTCTTCAAGATTTGTTTCTGCTAGTTCCATTACGAACCAAGGCTGAGCATTCTGTAAGTGGCATATGAATATTTGGACAATATTAGTATGCAGGCATTCAGTTTGATATCTTACTTCTCGTTCAAATCTGGTAAAGAGTTCAGGATCTGTTGCATCAGGTCGTAAAATTTTTCTGGCATATAGACCACAAATTTTTCTTTGTGAGTTATATAACCTTATTTTTTCTACAAGGCCAAAACCGCCAGAACCAATTACTTCTAGGGGGTCTATTAGGTAATTCCCACGTTCTTCCATTGTTCATCCTTTTAAGTTAGACAAAAGGAAAAATATAATGCAATGTTATCGATGCTTTTTTGAAGCCAAAGTTAACTTGTTGAATTTATGTCGAATTATCCATGCTTCCTATACGTCTGCGGCATGCTCCCGATGACTTTCCCGAAGATGAATACCCGGTTCATCTCGTCTTTCTCGATTGGGTCCCACGGTGAGTAGCTCTTGTTATCAGAGATAACCAGCAGTTTATCCTTCATCATTTGCAGGCGCTTTACATGGGCTGTGTCATCGTACAGAAACGCATAGATACCATCACCGTCGAAAGATTTAACAGTGATATCAACGAACAGCAGATCACCTGGTTCGATCGTTCCTGACATGCTGTCACCGCGTACGTTAATGATACGGATATTTTCCGCCTTCCTGCCATCGAACATGTGACGAGCATCGTCAAACGAATACTCAACCGAACGTAGAACTTCTACAAACTCACGGTTGATGACTCCCGGCCCGGCACTCACTTCGATATCAAGAACGTCAATCTTAAAGTATTTGGAATGGCTGACAGTTGTTTGTATTGGTTGCACTGTACTGTCTGACATATTTCCAACGCCAGAAGATAACCATTCTGCGCGCACACCCAAAGCGTTCGCGATCTCCACGATTTTAGTTGTTTGGTTAGCTTTCCCTGTTTCGATTTTCTGAATAGCTGCTTGGCTAACCCCGACCAAATCCCCAAGCGCCTTTTGTGTAAGGCCTCGCGCTAATCTGGCTTCTTTAAGTCTTTCTGAGAGTGTTGTTTTCATAGTCCAAATGTACAACCAAGGTTTTATTTCATCAAACGAAAATGGTTGTTGACTAAAAACAACCATAGTTTTAATCTTGATTCAAATTAACCACGGAGGTTGTTATGAACCCAGCCATCAAAACAGCGATCAATATCGTTGGTTCACAAAAGAAACTAGGCGCTGCCTGCGAAGTTTCACAGCAGGCCGTCTATAAGTGGCTTCACAACAAAGCAAAGGTATCCCCTGAACATGTCGGCAGCATTGTTACGGCTACTGGTGGAGTAGTGAAGGCATACCAGATTCGCCCGGATCTTCCGAAGTTGTTTCCACACACTGAAAAGAACGCAGCTTAAATTTCCATTTCACGCTCTTTAACAATAAGCAATCAACTTAACAGTCAATTCAAACTAAAGGAGTCAATTATGCAACCACTTACATACCAACAGACTAGCGGATTTAGCCCGACTGCGGTGATAAATCGTTCTCAAACAAAACAAGCTCCAGGCCACGAAAAAATCCGTGATGCCGTCCGCGCCTGGTCGGCTGTAGATAATCAGGATGTCGTTGCCGCACTCATTGTGAATGAGTATCGGGAGCAGGGCGGCGGCACCATCGATTTCCCTGATGATGTCAGCCGTGCACGCCAGAAGCTGTTCCGCTTCCTCGATAACAAATTCGATTCTGAAAAATACCGAAATAACGTGCGTGAATTGACCCCGGCAATTCTGGCGGTACTACCGCTGGAATATCGCGGCCACCTGGTTGAGCAGGATAGCTTCATGGCTCGGCTGGCTGAAATGGAAAAGGAACTCAGTGAGGCAAAACAGGCTGTCATTCTCAACGCACCACGCCACCAGAAACTGAAGGAGATGAGTGAAGGCATTGTGTCGATGTTTCGTGTGGACCCGGATTTGGCTGGTCCACTGATGGCGATGGTCACCACCATGCTGGGGGCAATATGACAGGTTCGGAAATGGCGAAAGCCGGTCTGCGGGAACAGAGCCGACTTTCAGGTGCAAATCGTAACGCACTCATTGCGGGAGGAATTATGGCAAACACTGCTGAGATATTCAATTTTCCAGTGCCGGATGTGGCACAAAAGGAGCGGCGCGTGGCAGATCTCGACGATGGTTATACGCGCATTGCAAATGAGTTGCTGGAAGCTGTGATGCTGGCCGGATTAACACAGCACCAGCTTCTGGTCTTCCTGGCTGTCATGCGCAAAACATATGGCTTTAATAAAAGACTGGATTGGGTGAGCAACGAGCAACTTTCCGAATTGACCGGGATATTGCCGCACAAGTGTTCTGCTGCAAAAAGTGCTCTGGTAAAGCGTGGGATTTTTATTCAGAGCGGGCGGAATATAGGCATTAATAATGTGGTCAGTGAATGGTCAACATTACCCGAATCAGGTAAGAAAAATAAAGTTTACCTGAAAGAGGTAAATTTACCTGAATCAGGTAAGAAAAGTTTACCCAAATCAGGTAAAGGCGTTTACCCGAATCAGGTAAACACAAAAGACAAACTAACAAAAGACAATATAAAACCTTTTTCGTCCGAGAATTCTGGCGAATCCTCTGACCAACCAGAAAACGATCTTCCTGTGGAGAAACCAGATGCTGCAATTCAGAGCGGCAGCAGGTGGGGGACAGCAGAAGACCTGACCGCCGCAGAGTGGATGTTTGACATGGTGAAGACCATCGCGCCATCAGCCAGAAAACCGAATTTTGCAGGGTGGGCTAACGATATCCGCCTGATGCGTGAACGTGACGGCCGTAACCACCGCGACATGTGCGTGCTGTTCCGCTGGGCATGCCAGGACAACTTCTGGTCCGGTAACGTGCTAAGTCCGGCCAAACTCCGCGACAAGTGGACCCAACTCGAAATCAACCGTAACAAGCAACAGGCTGGCGTGACAGCCGGAAAATCAAAACTCGACCTGACAAACACTGACTGGATTTACGGGGTGGATTTATGAAAAACATCGCCGCACAGATGGTTAACTTTGACCGTGAGCAGATGCGCCGGATCGCCAACAACATGCCGGAACAGTACGACGAAAAGCCGCAGGTACAACAGGTAGCGCAGATCATCAATGGTGTGTTCAGCCAGTTACTGGCAACTTTCCCGGCGAGCCTGGCTAACCGGGACCAGAACGAACTGAACGAAATCCGCCGCCAGTGGGTTCTGGCTTTCCGGGAAAACGGGATCACCACGATGGAACAGGTTAATGCAGGAATGCGCGTAGCCCGTCGGCAGAATCGACCATTCCTGCCATCACCCGGGCAGTTTGTTGCCTGGTGCCGGGAAGAAGCATCCGTTACCGCCGGGCTGCCAAACGCCAGCGAGCTGGTTGATATGGTTTACGAGTATTGCCGGAAACGTGGCCTGTATCCGGATGCAGAGTCTTATCCGTGGAAATCAAACGCGCACTACTGGCTGGTTACCAACCTGTATCAGAACATGCGGGCCAATGCGCTGACTGATGCGGAATTACGGCGCAAGGCTGCCGATGAACTGGCCTGTATGACAGCGCGAATTAACCGTGGTGAGGCGATACCTGAACCAGTAAAACAACTTCCTGTTATGGGCGGGAGACCGCTTAATCGTGTTCAGTCTCTGGCGAAGATCGCAGAAATCAAAGCGAAGTTTGGGCTGAAAGGAGCAAGTGTATGACGGGCAAAGAGGCAATTATTCATTACCTGGGGACTCATAAGAACTTCTGTGCGCAGGACGTTGCCGCGGTAACAGGCGCAACCGTAACCAGCATAAATCAGGCTGCGGCTAAAATGGCGCGGGCAGGAATCCTGGTCGTTGATGGTAAGGTCTGGCGAACGGTGTATTACCGGTTCGCTACCAGAGAAGAACGGGAAGGAAAGGTGAGCACGAATCTGATTTTTAAGGAGTGTCGTCAAAGTGCCGCGATGAAGCGGGTGTTGGCTTTATATGGAAGAGAGTAGGTATGAGCAATTATTGTTACTAATTTTAGTTTTACGACATTCGTGATAACTAAATAATTGATGTGTGGAACTGAATTATAAAGGGGATGATGTTTTGGGAAATAAAGAAAATATCGATTGTAAGCACACAAGAAGCTCATGCTATAAAAACAAGCAGATGAAAGATGTTATTTATATTACATTGCCTAAACTCACTGAAGAAGAAGTAGAGATTTTTAAGGGACCAATGCATAAAGCATTGCTTGCAGGGATAAATGTTACAAAAAAGGCAGTTTCTGATGCCCTGCTAAACAAAGGGATAAAAGTTGAATTTAAATAGAGTAATTCAGTGGCAACAATAGCACTCATTTGTGAGTGCTATTGAAATTTATTAGAAAATAATGTTTGCTATATCCGAGATAGCATTTAAAGAACCTCTGTGGTCACTTCCTGTTTTAGGTAAAATATAGGAATATCGAGGGTCTTCATTATATGTTAACTTCCAGTGTTTTCCATCGCTTGACGCACTGAAACCTAAGTCCTTTAAATTTCTTTGTGTTGCGCTATCCATGCTTCTGTAACCTGTTAAAGTCCTTTTAAGTAATTGGCGGCGACTCTCGGTTTCTTTATTGTATTCATTATTGGCAATTAGGGATGATAAGATATGGTAGCTTCTACCAAATTCATTTTTATTTTTTATGGCAGTTTTTAGTGCGTCAATAATTATATTTTTGATTTCTCCATCAAAAAAGTCAGTCTCTTCACCAGCATTAAGCGCAATGCTTCCTTGGGCTGAAGCTTGTGATTGAAGAGTGCGTACTCTGTGCTCTAAAGAGGAGATCTTATGCTTAAGATCTTCTATTTGGTCATCTTTCGCAACATTATCGGCTTCGTAAAGCGCCATCAGTTCGCGTGTGTATTCTCCTCTTTCTTTGAGTGAGTTAATAGAGTCTTTCGTTTTTCTGGTTTGTATCTCACTCCATCCACTATCTGAGACAGGAGCCATTGTAGTCGTAGCTCTTACAACATCATCAAATAACTCATCTTCAAATTCTTTTGCGGTTTTCTCACCACGGCGATAAAAACTGATATTTTGACCCCGTGGCCAATAGATACCAACAGCACCAGCATAGGCATTTTTAGCATTTGTCTCATTTTTTAGCTTAATAGAAAATAATCTATTGCTAGGCTCAATTAATACATGTGCTAATCCGCATACCTTTCTTGCAAGGCGTTCTGGGATGATATTGTGAGCATGTTCATTAAAAAAATACTTGGAGCTAACGTATATTATTGGTAGTCTGTTGTCAGTTTCACCATTTATAACTTTAGCTGCTATATTTAAATGTTCATCAGTATCATCTAAGGAATGAGGTTCGACTGACACCTTGAAAATATCATCAAGACCACCAGGAAATTTATCAATCAATCTCATAACAACTAATGGTTTCTTAGGTTGCGGGGCTAGATAAGCTGCATCTTGGCTTACAACGCTAGATTCCACCTGAATCCACATGGTATCAGTTTTGAGATTTTTATTAATTGAAATATCTGTTATCCATTTGTGTGGCTCAGATATTTTCGAATACCTAAAACAGCCTAAACTAGTATCTTTGTTTTTATAAGTTATTATATCAATTCTTTCATTTTTAGACTCTATGAAGTACTCCTCGCTTTTATAATCACATACTAATTGTGCTGGAATAAAAGTTGTATGTGGTGAGTCGTTTACCCAAGAAAAACATTCATTAAAGATCTCAGCTAGAGATGTGGAGTTTGAAACATAAAAACCAGTTGAGAAATATTTCATGAAATTTCCTTATTAAAAATGAGATTAATCTCATTGGCTATGATAATTGTTACCCTCCTTTGGATCTAGTGTTCATTTGACATAAGAACCTATTGATTCATCATAATCAACTCGCCATAATCATGTCATCGGAGCCTGAACAACTCCGGTGACTTCTGCGCTTTGAGGGGACTCAAAGTGCAAACGACAATCAGAACGCCTTTCAACCAGTCACAGATGCAGAAATGCACCTGCGATATCTTGCATCCAACGTTTGATCTCTGCGGAGGTGAAGCGTGAACCTCCCACAAGATGGTATCAAATTGCATCGCGGTAACTTCACCGCTATCGGTCGGCAGATCCAGCCTTATCTGGAGGACGGCAAATGCTTTCGCATGGTGCTTAAACCGTGGCGCGAGAGACGCAGTCTTTCCCAGAATGCACTCAGCCACATGTGGTACAGCGAAATCAGTGAATACCTCATCAGCAGGGGTAAAACGTTCGCCACTCCAGCTTGGGTAAAAGATGCTCTCAAACACACATATCTCGGTTATGAAACCAAAGACCTGGTTGATGTCGTAACCGGTGATATCACCACTATCCAGTCGTTACGCCATACCTCCGATCTTGATACCGGAGAGATGTATGTCTTCCTGTGTAAGGTTGAAGCCTGGGCGATGAATATTGGTTGCCACCTGACTATTCCACAGAGCTGCGAGTTCCAGCTGCTGCGCGACAAGCAGGAGGCGTAATGGCTACACCGCTTATTCGTGTCATGAACGGACACATCTACAAAGTACCAAATCGTCGTAAGCGTAAACCTGAGCTGAAGCCATCCGAAATACCAACTCTGCTCGGATATACCGCCAGCCTGGTTGATAAAAAATGGTTGCGACTGGCAGCAAGGAGGAATCATGGCTGATTTGAGAAAAGCAGCGCGTGGTCGGGAATGCCAGGTAAGAATCCCTGGCGTATGTAATGGCAATTCTGAAACGTCTGTACTGGCACATATCCGGCTGGCTGGATTGTGCGGTACCGGTACCAAACCGCCAGACCTGATTGCCACCATTGCATGTTCTGCCTGTCACGATGAGATCGACCGTCGCACGCATTTTGTTGACGCTGGATATGCAAAAGAATGCGCGCTGGAAGGTATGGCGAGAACGCTGGTTATCTGGCTGAAAGAGGGGGTTATTAAGGCGTGAATACTTACAGCATCACATTACCCTGGCCTCCGAGCAATAATCGCTATTACCGCCATAATCGCGGGCGCACGCACATCAGCGCAGAGGGGCAGGCATACCGCGATAACGTCGCCCGAATCATTAAAAACGCAATGCTGGATATCGGCCTGGCTATGCCAGTGAAAATCCGTATTGAGTGCCACATGCCGGATCGCCGTCGCCGTGACCTGGATAATCTGCAAAAAGCCGCTTTTGACGCACTCACCAAAGCAGGTTTCTGGCTGGATGATGCTCAGGTCGTTGATTACCGTGTTGTGAAGATGCCCGTTACCAAAGGTGGGAAGCTGGAACTGACCATCACTGAACTGGGAGATGAATGATGTTTGAGTCTTATATGGCAGAACGTCTTCGCCGCCGCTGGGTGCGCCTGCGCTTATATCGTTTTCCCGGTTCTGTTTTGACCGATTACCGAATACTGAAGAATTACGCCAAAACACTGAAAGGAGCTGCCGCATGAATATCCAATATTTACAGTATGTTCGCGAGCAACTCATGGTGGCTACCGCTGATTTGAGCGGAGTAACGAAAGGCCAGCTTGAAGCCTGGCTGGAGCATGCTCAATTTGATACTGGTACATACAAACGAAAGAAGCCGCGCATTCTGGATGAGGTAACTGGCAGGATGATTACGCTGGATAATCCGCCGATTTTCGGTAAGCAGTCGTACGCAAAAGGTTCATCCATTGCACTGGTCAGCCAGGTTGAGTTCTCAACATCTTCATGGCGCCGCGCGGTTCTGTCTCTCGAAGAACACCAGAAAGCGTGGTTGCTGTGGAGTTACAGCGAAAGTGTTCGCTGGGAGCATCAGGTTGCCATAACGCAGTGGGCATGGAACGAGTTTAAGTCTCTGTTAGGTAAAAGAAAAATTGCCAGTAAGACACTGGAACGCTTAAAGAAGTTGATCTGGCTGGCGGCACAGGATGTGAAGAACGAGCTGGCAGGGCGTAAGACCTATGAATACCAGGAGCTGGCATCACTGGTGGGAGTGACATCAAAAAACTGGTCTGAGACATTTACTGAACGCTGGGTTGCAATGAAGCACATTTTTCTACAGCTTGATAGCCAAGCTTTATTGCTTTTAACGAAAACACGTTCAAAACAAAAGACGACATTTTCACAGCAAAGTATTGCAAAACTGGATTAAAAAGCATATATTTCGTGTAAATCTGATATTTTGCCAATGTTGTACGCACTGGCAGTAATCCAAATTCAAGCCCGAGGTTTAAAACTTTGGGCTTTTCTGTTTCTGGACGGTGAGTAGCCTTCCAACCTACCCCAGCCAGGGTGTCTTCAACTGTTGAGTTGATATTGCTTAACCCTCTGTTGCCAGCTACATGCTGGCTTTTTTATTCCAGGCTTGCGGGGAGCATCAACTCCGTGCTTTGTCGTTAAATTACCCCGTGAGCCTGATTTCTGACATTTAACGTCCCGGCCTTTTGTCGGCGGCGAAACATTGGCTATTCATATGCACGAAAAAGAGAGCCTTGCCGGAGCGTTCTGGCTCGTTTTGCTGATCATCGCAGGTTGGGGCGGTCTAGTCCGCTACCTGATAGATGTGAAGCAGAGTAAAGCAACGTGGAGTTGGATAAATGCTCTGGCTCAGATAGTGGTATCAGGATTCACCGGTGTTATTGGTGGCCTGATCAGCATCGAAAGTGGATTCAGTATTTACATGATTCTCGCGACAGCGGGGATTAGTGGTGCGATGGGTTCGGTTGCACTGACGTACTTCTGGGAACGACTGACAGGGGTGAAAAATGCAAAATCTTAATCCTCAGCGTAAGGCTTTCCTCGATATGGTGGCATGGTCAGAAGGAACGGATAACGGACGGCAGAAAACCAGAAATCATGGTTATGACGTCATTGTAGGCGGAGAGCTATTTACTGATTACTCCGATCACCCTCGCAAACTTGTCACGCTAAACCCAAAACTCAAATCAACAGCAGCCGGGCGCTACCAGCTTCTTTCCCGTTGGTGGGATTCCTATCGTAAGCAGCTTGGCCTGAAAGACTTCTCTCCGAAAAGCCAGGATGCTGTGGCATTGCAGCAGATTAAGGAGCGTGGCGCTTTGCCGATGATTGATCGCGGTGATATCCGTCAGGCTATCGACCGTTGCAGCAATATCTGGGCTTCGTTGCCCGGGGCTGGTTATGGCCAGTTCGAGCATAAGGCTGACAGCCTGATTGCAAAATTCAAAGAAGCTGGCGGAACAGTCAGAGAGATTGAGGTATGAGCAGAGTAACCGCGATTATCTCCGCTCTGGTTATCTGCATCATCGTCTGCCTGTCATGGGCGGTTAATCATTACCGTGATAACGCCATTACCTATAAAGAACAGCGCGATAAGGCCGCATTCATTATCGCTGACATGCAGAAACGTCAACGCAATGTAGCAGAACTCGACGCCAGATATACAAAGGAGCTTGCTGATGCTAACGCGACTATCGAAAGTCTCCGTGCTGATGTTTCTGCTGGTCGTAAGCGCATGCAAGTCGCCGCCACCTGTGCAAAGTCAACGACCGGAGCCAGCAGCATGGGCGATGGAGAAAGCCCAAGACTTACAGCAGATGCTGAACTCAATTATTACCGTCTACGAAGTGGAATCGACAGGATAACCGCGCAGGTTAACTACCTGCAGGAGTACATCAGGACGCAATGCCTGAAATAATTTTTTTGCAAATCACAAAGTCCATTTAATGAGCCTCGCGATGCGGGGCTTTTTGCAATAAATGCGTACCGCAACGCATGTTTTTTACACCGAACCTGCCCCTTTGGAATGGGCCTTTGAGGATACCAGTTAGTGCTGGCGAGCCTCGGTGGGCTGGTTTCCTGTGCGGCAAAGGTTCATTTCAAAGAGTAGGTACACGCTATGGAATCATTAACCCTCTTCAATCAACCAATTCGTATCGGTGAAGATGGCATGATCTGCCTCACTGATATGTGGAAAGCCAGTGGTAAAAGTGAATCTGAATCGCCTTACCACTACCTGCGAAACAAGCAGACCAAAGAGTTCTTAGCCGAGCTGGAGAAAAACCACGAATCTGTGGTTTTTACTGAGCGCGGTGTACACGGTGGAACATATGGCGGGAAGTTTGTTGCTTACGATTATGCGGCTTGGTTAAACCCCGGGTTCAAGTACGCGGCCTATAAAGTCCTCGATGACTACTTCACCGGAGAACTTCAGCATCGCAACAGCTTAAGTGCGCAGCTCAATATGAAGTGTCATGAGTTTGACCAGAAGAAAGACATGGCGAGCTTCTGCGGACAAGGGCTGGCAGCATGGCGCTATACGAAGCCAGTGTTGGTCGCTGAGATTAACTCCCTGGCTAACCAGCTGCAGATTACGATCCCCGGGCTTCCGGGATGAGTGATCGTGTCATTGAATGCGCCTCCAGAGCGGGGCGTGACTTCTCAGAGTTCATGAAAGGTGAGAAGGGCATGATGGAAGCATTGGCCTCGGTGGATGAGTTTGGCGAGCAGCTGCGCCTCAACGGCTGTGTCAATCATCACTTTGTTAGCTACATGATGCGGAACTCGATCATGCAGGCATTCATGGACATGGCAAAAGCCGAGAGGAAAGAAGAGCGCCGGCGTAAGCGAGCGGAAGCAAAAGCGAAGTAGCCATTACAAAGCTCATCTGCGGGTGGGCTTGATAATGGTGATGAACATCTCAATTTTTATTGTGGTTGACATAAAATCCTTTATTTTCATTGTGATATAATTATCAGGCATCAATGGAAAGGAGGTGCATATGTTCGACGTAGTTGTGTTCGGCGCGGGGCGTTTTGGTTCGGTTTGCCACGTTGAAAGGCATGAGTTGACGATCAAGGTTCCGGACTGTCAAGTGAAGGCTCGGGAAATGGTTGGTGGCGTGGCGGTCATCCCCGATGTGGAGTATCAGATCTTTGAATTCCAGGTGGATGACGAGATTTATCTGATCGGAGTTAACGGACGCAGGCCAAGCGACGATGTGATTAAGTCGCACATCAGGCATGGCAACCCAAAGCCAAAGCCATACAAAAAACTATAACCGCCTCCGGGCGGTTTTTTATTGGAGTTAAAATGCCACCTCGAACACCAAAAGCTTGCCGTGTTCGCGGCTGCCGTTCCACGACAACAGACCCATCAGGTTACTGTGAGTCGCACAAAGGCGAAGGCTGGAAATCCTACAAATCAGGACAATCAAGGCATCAGCGTGGATATGGGACGAAGTGGGAAGTCATACGCGCTCGCATACTGAAGCGCGACAAAGCTCTCTGCCAGAACCATCTACGTCAGGGAGTAGCAAAGCAGGCATCATGTGTCGACCACATCAAGGCTAAGGCCCATGGCGGTACTGATGAAGACAGCAACCTTGAGAGTCTGTGCTGGTCGTGCCACGCCGCGAAGACCGCGCGTGAGCGGCTCAAGTGAGAATCGATGTCATCATCAGCCAGGGGAGGGGGAGGTCAAATCCCTGCGGCCGCGTGCCTTCCGGACTGCCCGCCCCATCGTTTTTTTATACCCGCGAAAAATGAAATTTAACCAGGAGTGCCGCATATGGCTGGAACGGCGGGGCGTTCCGGGCGTCGCCCCAAGCCAACGGCGCGCAAGGCGCTGGCCGGAAACCCCGGCAAGCGAGCCCTGAACAAAGATGAACCTGTTTTTACGCCCATCAAAGGTGTTGAGCCACCGGAGTGGTTCGCTGAAGAAGATCTCCCTCTCGCCACGATCATGTGGCAATTGACAACTAAAGAACTTTGCGGTCAGGGCCTGCTGTGCGTGACTGACCTCGCGGTGCTTGAGCGGTGGTGTGTGGCCTATGAGTTCTGGCGACGTGCCGTGAAAAATATTGCCAGACAGGGCAACACCATCACCGGTGCAATGGGCGGCAGGGTCAAAAATCCTGAGCTGACCGCCAAAAAAGAACAGGAGTCCGAGATGAGCAGCACGGGGGCAATGCTCGGACTCGACCCCAGCAGCCGCCAGCGTCTGATTGGCCTGGCGGGGCAGAAGAAAGCTACTAACCCGTTTCTGAAAATCATCGAATCATGAGCCGGAAATCTTACCCCAACGTAAATGCTGCCAATCAGTATGCCCGGGATGTTGTGCGCGGAAAGATTGTGGCCTGCCAGTTTGTGATTCAGGCCTGCCAGCGCCATCTTGATGACCTGATGGCGGAAAAAAGTAAGTCGTTTCGTTACCGCTTCGACAAGGACCTGGCTGAACGGGCCGCCAAATTTATTCAGCTGTTGCCGCACACCAAGGGTGAGTGGGCATTCAAGAGGATGCCCATCACGCTGGAGCCGTGGCAGCTCTTTGTGATCTGCTGTGCGTTTGGCTGGGTCAATAAAGGCTCCCGGCTGCGCCGCTTCCGGGAGGTGTATACCGAAATCCCCCGTAAGAACGGCAAATCGGCAATCTCTGCCGGTGTCGCCCTGTATTGTTTTGCCTGTGATAACGAGTTTGGCGCGGAAGTGTATTCCGGTGCCACGACAGAGAAACAGGCGTGGGAAGTCTTTCGCCCGGCGCGACTGATGTGTAAACGCACACCCATGCTGACGGAAGCGTTCGGGATTGAGGTTAACGCCTCAAACATGAACCGTCCGGAGGATGGCGCGCGGTTTGAACCGCTGATCGGTAACCCCGGTGATGGATCATCCCCCCACTGTGCGGTGGTGGATGAATATCACGAGCACGCCACCGATGCGCTTTACACCACGATGCTTACCGGGATGGGGGCGCGACGTCAGCCACTGATGTGGGCCATTACTACTGCCGGGTACAACATTGAGGGGCCGTGCTACGACAAGCGGCGGGAAGTCATCGAGATGCTCAACGGCTCGGTGCCCAACGATGAACTGTTCGGGATCATCTATACCGTTGATGAAGGTGACGACTGGACCGACCCGCAGGTGCTGGAAAAAGCCAATCCAAATATTGGCGTGTCGGTTTATCGCGAATTTTTGTTAAGTCAGCAGCAGCGTGCGAAAAATAACGCCCGTCTGGCAAACGTCTTTAAAACAAAACACCTCAATATCTGGGTGTCGGCGCGTTCGGCGTATTTCAACCTGGTGAGCTGGCAGAGCTGCGAGGATAAATCACTGACCCTTGAGCAGTTCGAGGGGCAGCCGTGCATTCTGGCCTTTGACCTGGCGCGTAAGCTGGATATGAACAGCATGGCGCGACTTTATACCCGCGAGATTGACGGTAAAACGCATTACTACAGTGTGGCCCCGCGTTTCTGGGTACCGTATGACACGGTGTACAGCGTCGAGAAAAATGAAGATCGACGGACAGCCGAACGCTTTCAGAAATGGGTGGAAATGGGCGTTCTGACCGTTACCGATGGTGCAGAGGTGGATTATCGCTACATCCTCGAAGAGGCCAAAGCGGCGAACAAAATCAGCCCGGTCAGTGAGTCACCCATCGACCCTTTCGGAGCGACCGGGCTCTCACATGACCTTGCTGATGAAGACCTGAACCCCATCACTATCTTCCAGAACTTCACCAATATGTCCGACCCGATGAAAGAGCTGGAAGCGTCAATTGAATCGGGGCGCTTTCATCATGATGGCAATCCCATCATGACCTGGTGTATCGGCAACGTGGTCGGCAAAACCATTCCGGGTAACGATGATGTGGTGAAGCCCGTCAAAGAGCAGGCGGAAAACAAAATCGATGGTGCAGTTGCGCTGATTATGGCGGTTGGCAGAGCCATGCTGTACGAGAAAGAAGACACGCTGTCTGACCACATTGAGTCCTACGGGATCCGCTCGCTTTAACTGAGGTAATTATGATCATGCTGATTCTCGCGCCTCTGGTAGGCGTGCTGGGGGCGCTTTTGCTGGCGTATGGTGCCTGGCTGATTTATCCCCCGGCGGGGTTTGTTGTTGCCGGGGCGTTGTGCCTGTTCTGGTCGTGGCTGGTGGCGCGATATCTCGACCGTACACAGCTGTCTGTTGGTGGAGGTAAATAGTGTTCTTTTCGGGATTATTTCAACGAAAAAGTGACGCACCGGTGACCACGCCAGCAGAGCTGGCGGATGCCATCGGGTTGTCCTACGACACCTATACCGGAAAGCAGATCAGCAGCCAGCGGGCCATGCGACTGACGGCGGTTTTTTCCTGCGTCAGGGTGCTGGCAGAGTCGGTCGGGATGTTGCCCTGCAATCTGTATCACCTGAACGGCAGCCTGAAACAGAGAGCCACCGGCGAACGTCTGCATAAGCTGATCTCCACGCATCCCAATGGCTATATGACGCCGCAGGAGTTCTGGGAGCTGGTGGTCACCTGTCTGTGCCTGCGGGGCAACTTTTACGCCTACAAAGTGAAAGCATTTGGCGAAGTGGCTGAACTGCTGCCCGTCGACCCCGGTTGTGTGGTACCGAAGCTTAACAGTAGCTGGGAACCGGTTTACCAGGTCACATTCCCGGATGGCTCCACGGATGTACTGAGCCAGGAGGATATCTGGCATGTGCGCACGCTGACGCTGGACGGACTGGTGGGGCTGAACCCCATCGCCTATGCCCGCGAGGCAATATCGCTGGCGGCAGCGACCGAAGAGCACGGGGCCAGACTGTTCAGCAATGGCGCGGTGACGTCGGGTGTGTTGCGTACAGAGCAGACGCTGTCAGATCAGGCTTATGAGCGCCTGAAGAAAGATTTTGAGGAGCGTCACACCGGGCTTGGCAATGCTCACCGCCCGATGATCCTTGAGATGGGGCTGGACTGGAAGTCGATGGCGCTGAACGCCGAGGACAGCCAGTTCCTGGAAACCCGCAAGTTTCAGCTTGAAGAAATCTGTCGTCTGTTCCGGGTGCCGTTGCACATGGTGCAGAACACCGATCGCGCCACCTTCAACAATATCGAAGAGCTGGGGCTGGGATTTATCAACTATTCACTGGTGCCGTATCTGACCCGCATCGAACAGCGGATCAACACCGGACTGGTACGAAAAAGTAAGCAGGGCGTTTATTACGCCAAATTTAACGCCGGGGCGTTACTGCGCGGGGATATGAAGTCCCGTTTTGAAGCCTACGCCACCGGGATTAACTGGGGAATTTACTCTCCCAATGACTGCCGCGACCTGGAAGATATGAATCCGCGTCCCGGTGGTGATGTCTATCTCACACCGATGAACATGACCACGAAACCCTCCGATGGCAGTAAAGCCGGTAAGCAGAAGGATAACGCCAATGCAGACGAAACAACGTCTTGATGTACCGCTGAGTCTGAAATCTGTCAGTGACTCCGGTGAGTTTGAAGGGTATGGCTCCGTCTTTGGTGTAAAGGACAGCCACGATGATGTGGTGATGTCCGGGGCATTTGCTGCTTCCCTGCGGGCGTGGAGTGACAGAAAAGCGTTACCTGCGCTGCTCTGGCAGCACCGCATGGATGAACCCATCGGTGTTTACACCGAAATGAAGGAAGACGATGTCGGGCTTTACGTCAGGGGACGGTTGCTTATTGATGATGATCCCCTGGCAAAACGCGCACATGCACATATGAAGGCCGGTTCGTTAACCGGCCTTTCTATTGGGTACGTCCTGAAAGACTGGGAATATGATCGGAGCAAAGAAGCCTTTCTGCTGAAAGAAATCGACCTCTGGGAAGTCAGTCTGGTGACGTTCCCGTCTAACGACGAAGCGCGGATCAGCGACGTCAAGAACGCACTGGCCCGCGGGGAAATCCCCGAACAGAAAAAAATCGAAAGAGTCCTGCGTGATGTCGGACTCTCCCGTTCCCAGGCCAAAGCATTCATGGCCGGGGGCTATGGCGCACTGTCCCTGCGCGACGCTGAGGATGTGGGCTCTGCACTGAATGCACTGAAAAATCTGAACTTCTAATCAGGAGAAATACGATGGCGGTTGATATTAAAGATGTCGAACAGGTCGCGCAGGAGTTGCAGCAGAAGTTTGACGACTTCAAAGCAAAGAACGACAAGCGCGTGGATGCGATTGAGCAGGAAAAAGGCAAACTTGCCGGGCAGGTGGAAACCCTGAACGGGAAACTCAGCGAGCTGGAAAACCTCAAAAGCGATCTTGAAAAAGAGCTGCTTGAGCTGAAACGTCCGGCAGGTGGTGCGCAAAATAAACTGACCACCGAGCATAAAGAAGCGTTTGTGGGCTTCCTGCGTAAAGGCCGTGAAGATGGTCTGCGCGATCTGGAGCGCAAGGCATTACAGGTGGGCACCGATGAAGACGGCGGCTATGCCGTGCCGGAAGCGCTGGATCGCAACATTCTGACCTTGCTGAAAGATGAAGTGGTGATGCGTCAGGAAGCCACGGTGATCACCATTGGCGGTTCCGACTACAAAAAACTGGTGAATCTGGGCGGTACGGCTTCCGGATGGGTGGGGGAAACAGATACGCGATCCCAGACTGCCACCTCCAGACTGGAGCTGATTGAACCTCTCATGGGGGAAATCTACGGTAACCCGCAGGCCACCCAGAAAATGCTGGACGATGCCTTTTTCAACGTGGAGGCCTGGATCAACAGCGAGCTGGCAACCGAATTTGCCGAACAGGAAGAAATTGCCTTTACCTCAGGCGATGGCACTAAGAAGCCGAAAGGGTTCCTGGCGTATGAATCCACTGATGAAACCGATAAGGTCCGGGCGTTCGGCAAACTTCAGCATATTGTATCCGGCGACGCGACTGCGGTGACCGCAGACGCCATTATCAAACTGATTTACACGCTGCGTAAGGCACACCGCACCGGCGCGAAGTTCATGATGAACAACAACAGCCTGTTTGCCATCCGTCTGCTGAAAGACACCGAGGGTAACTATCTGTGGCGTCCGGGGCTGGAACTGGGGCAGCCGTCCTCTCTGGCGGGTTACGGTATCGCTGAAAACGAACAGATGCCGGATATCGCCGCTGATGCGAAAGCCATTGCATTTGGTAACTTCAAACGGGGTTACACCATCGTTGACCGTATCGGCACCCGCATTCTGCGTGACCCGTACACCAATAAACCGTTTGTCGGTTTTTATACCACCAAGCGCACCGGCGGGATGCTGGTCGATTCGCAGGCCATCAAACTGCTGAAGATTGCAGCGGCGTAATCACTCAGGGGCGCGGAGCCGCGCCCCCTGTTCTGACGGGTGAAGAATCATGATCCTGAAACAAGATCTGAAATGGTCACCGGACGGTATGCGTGTTGAGGTCATTCGGGCCGGTGAGTATGACGACGGGGCGCTTCCTGCCCGGGTGCAGGAGATTGCACTTCAGGCCGGGTTAGCAGAGCGCGGAACCAGTGCAAAAAGCAGTAAAGCGACAAAAGAGAAAAAAGCCACGACCAGTAAAGAGGGCTGAGTATGCTTCTGACAATGGAAGAGATTAAAGCCCAACTCCGGCTGGATGAGGATTTCGATGCTGATGACCGCCATCTGCAACTGCTGGCCTGTGCGGCGCAAAAGCGGACGGAAACGTATCTGAACCGGAAGCTCTATGCACCGGATGAAACCATTCCGGACAGCGACCCGGACGGACTGCACCTGCCGGATGATATTCGTCTGGGGATGCTGATGCTTATCAGCCATTTTTACGAAAACCGCTCGTCGGTTACGGAAGTGGAGAAACTCGACATGCCGCAGAGTTTTGGCTGGCTTGTCGGCCCGTACAGGTACTTTCCGCAATGAAAATTCGTCAGGCGCAGACCAGCGCAACCTACATTCTGCCGGACCCCGGCGAACTGAATAAACGCGTCCTGATCCGCCTGCGGGTGGATATGCCCGCGGATAACTTTGGCGTGGAGCCTCAATACCCGGTTACGTTCCGGACATGGGCGAAGGTTATCCAGACCAGTGCCACCACCTGGCAGGAAACCGCGCAGACCGGGGACGCCATCACCCATTACATCACCATTCGTTACCGCCGGAGGATCACCGCTGATTATGAGGTGGTCTGCGGTGACAGTGTGTACCGGGTGAAACGTCAGCGCGATCTGAACGGGGCGCGGCGCTTTCTGCTGCTGGAGTGTACGGAGCTGGGCGAATGTAGGCAGAGTCACGGAGGCAGCAATGGCGACTCCCTTTTTTCACGTTGATGTTCAGCAGCCCGCCGAGATGCGCTTTAACCGCGCCCGTGTCCGGCGGGCGTTTGTCACGATTGGGCAGCGTCATATGCGTGATGCCCGTCGGCTGGTGATGCGCCGTGCGCGGTCGGCACCGGGTGAAAACCCCGGTTATCAGACCGGACGCCTGGCTCGTTCGATTGGTTATATGGTGCCGAGAGCCAGTAAAAAGCGAGCCGGTTTTATGACACGCATTGCCCCTAACCAGCGCAACGGGAAGGGGAACCGGATGATCTCTGGTGACTTCTATCCGGCGTTTCTGTTTTTTGGTGTCCGGGGAGGAGCAAAACGTCGTCGTAGTCATCATCGTGGTGCATCCGGTGGCAGCGGCTGGCGACTGGCTCCACGTAATAACTTCATGGTGGAAACTCTTGAAAAGAACCGCAGCTGGACACGCTATTTTCTGGCGCGGGAATTGCGTAAATCACTGAAGCCGGAGCGACGACACAGATGAAACTGACGCCTGTTATTGCTGCGCTGCGTGCCCGCTGCCCGTATTTTGAAAACCGGGTGGCAGGCGCGGCACAGTTCAAAAATCTGCCGGAGGTCGGAAAGCTGAGACTCCCGGCGGCGTATGTGGTACCGGGTGATGATTCTCCGGGAGAAAACAAAAGCCAGACCGACTACTGGCAGGAGCTGAAAGAGGGCTTCTCCGTGGTTGTCATACTGAGTAACGGGCGTGATGAGCGCGGTCAGTTTGCCTCGTATGATGTGGTGGACGATGTCCGGCAGATGCTCTTTAAGGCCCTGCTGGGCTGGAACCCGGAAGCGTGCGGTAACCCGATTACCTATGACGGCGGCACGCTGCTGGATCTGAATCGTCATGAGCTGATTTATCAGTTCGATTTTTCGGTCATCAGCGAGCTGACTGAAGACGATACCCGCCAGCAGGATGATCTGAACAGTCTGGATGAACTGCAAACGCTGGCGATTGATGTTGATTATCTCGAGCCCGGTAACGGGCCTGACGGCGATATCGAACATCACACCGAAATAACCCTTCCTTCCTGAGGAGCCTCATGTTTGTCAAACCTGTTAAAGGGCGGTCAGTTCCTGACCCTGCCCGCGGCGACCTTTTGCCCGCCGAAGGGCGAAATGTTGACGAGAACAACTACTGGCTGCGCCGTGAAGCAGCGGGTGATATCCGGCGCGTGAATAAAAAGGTGAACACCGATGACGATAAGCTTTAACACCATTCCGTCGAATACGCTGGTTCCGCTGTTTTATGCGGAAATGGATAACCAGGCGGCGAATACTGCACAGGACAGCGGAGCATCGCTGCTGATTGGTCATGCCAATAACGGTGCAGAGATTGTTGCCAACAGTCTGGTACTGATGCCGTCGGCAGACTATGCACGCCAGATTTGTGGTGCGGGAAGTCAGCTGGCGCGTATGGTCGAGGCTTATCGCCAGACCGACCCCTTTGGCGAGCTGTATGTGATTGCCGTTCCGGAAGCCACAGGCGCGGCGGCAACGGTTACGCTGACGGTGACCGGAGCAGCAACCGAAACCGGCACGGTGAATGTTTATGTGGGACGTACCCGCGTGCAGGCACCGGTGACCAACGGCGATAACGTCACGACGATTGCCAGCAGTATCCAGGATGCCATCAATGCCGTTCCGACTCTGCCGTTTACAGCTTCATCTTCGGCTGGTGTTGTCACGCTGACCGCGCGTCATAAGGGGCTTTGCAGGAATGAAATTCCTGTCAGCCTCAATTACTACGGCTTCGGTGGGGGCGAAGTGCTGCCAGCGGGCGTACAGATTGCCGTGGCGACGGGGACCGCCGGAACGGGCGCTCCGGTTCTCACCGGCGCGGTGGCTGCAATGGCGGATGAGCCGTTTGATTATATCGGTCTGCCGTTCAACGACACGGCCTCCGTTAACACGCTGGTGACCGAGATGAACGATACCAGCGGTCGCTGGAGCTATGCGCGTCAGCTGTATGGTCATGTGTATACGGCAAAGACCGGCACACTGTCAGAACTGGTGAACGCAGGTGACCAGTTTAACCAGCAGCACATCACCCTGGCGGGGTACGAAAAAGAGACCCAGACGCCTGCCGACGAGCTGGCGGCAAGCCGTACCGCCCGCGCAGCGGTGTTTATCCGCAACGATCCGGCACGTCCCACGCAGACCGGTGAGCTGGTGGGTATGCTGCCTGCGCCGAAGGGGAAACGGTTCACGATGATCGAGCAGCAGACCCTGCTGTCTCATGGCGTGGCAACGGCGTATGTCGAAAGCGGGGTGCTGCGCATTCAGCGTGATGTCACCACGTACAGGAAAAATTCTTACGGGGTTGCGGATAACAGCTACCTCGACAGCGAGACGCTGCATACCAGTGCGTATGTACTGCGCAAACTGAAATCCGTCATTACCAGTAAGTACGGGCGTCACAAGCTTGCCAGCGACGGTACCCGCTTTGGTCCCGGTCAGGCGATTGTCACCCCGGCGGTGATCAAAGGGGAACTGCTGGCAACCTACCGTCAGCTTGAGCGTGCGGGGATCGTGGAAAACTACGAACTGTTTAAGCAGTACCTGGTTGTGGAGCGTGATGCCAGCGATCCGAACCGCCTGAACACGCTGTTCCCGCCTGACTATGTTAACCAGTTGCGTGTTTTTGCCGTGGTTAACCAGTTCCGTCTTCAGTATTCAGAGGAGTCTGCATAATGGCCCGTATCGGGGGAACCTGTTATTTCAAAATTGACGGTCAGCAGCTATCGCTGACCGGCGGCATTGAGGTGCCCATGAACAGGACGGTCAATGATGACATCATCGGCCTGGACGGTTCAGTGGACCGCAAGGAAACTCACCGTGCGCCTTATGTCAAAGGGACCTTCAAGGTGCCGAAGAATTTTCCGGTGAGTAAAATCACCTCGTCTGATGAGATGACCATCACTGCCGAGCTGGCGAACGGTCAGGTCTATGTATTGTCGTCCGCCTGGCTGCACGGAGAAGCGAACCATAATGCCGAAGAAGGCACGGTTGATCTTGAGTTCCACGGTGAAGAAGGGGATTACCAGTAATGAAAGAGCTTGAGTTAAAGAAACCGATTATCGCTCATGGTGAGACACTCTCCGTACTGGAGTTTGATGAACCCACCGGGAAGGATGTCCGCGAGCTGGGGTATCCCTACCAGATGAATCAGGATGAGTCCGTCAGACTTCTGGCGCATGTGGTGTCGAAATACATTGTGCGGCTGGCGAAAGTGCCGCAAAGCTCTGTCGACCAGATGTCTCCGGCAGACCTGAATGCAGCGGCGTGGCTTGTGGCTGGTTTTTTCCTCCAGGCCTGACGGCTGAATACCTCACTGATCGCTTCTTTGACTGCGCCAGCTACTGGCGCATTAATCCCTTCGAATTGCTGAATATGCCGATCAGTGAAATTCCCTTGCTGGTCAGTCAGGCAAACAGGATAGAGCAGGAGAAACGCACACATGGCTGAATTTGAGCTTAAGGCGTTGATCACCGGTGTCGACAGGCTTTCTCCCGCGCTGTCGAAAATGCAAAAGAAAATCCGGGGATTTAAACGCCAGGCGGAAGAAGCGTCACAGGGTGGGCTGGCGCTTGGTGGCGGACTGGCAGCGGGTCTGACGCTTTCCCTGAAATCTTATGCTGATCAGGAAAACGCCGCCACCGGGCTGAAAGTCGCCATGATGGATGCGAACGGCGAGGTTGGAAAGAGCTTTCAGGACATCAATAAACTGGCTATTGGCCTGGGTAACCAGCTACCCGGTACAACGGCTGATTTCCAGAACATGATGCAGATGCTGGTGCGTCAGGGGATCCCGGCAGAAAACATTCTTGGTGGTGTGGGTAAAGCGACAGCTTATCTTGCGGTACAACTGAAAAAAACACCGGAAGCGGCTGCTGAGTTTGCTGCAAAGATGCAGGATGCTACCGGAACGGCGTCAGAAGACATGATGGGGCTGTTCGACACTATCCAGAAGGCGTTTTATCTGGGCGTTGACGATACCAACATGTTGTCCTTCTTCACTAAAACCAGTTCTGTTCTGAAGATGGTGAACAAGGACGGTCTTCAGGCTGCACAGAGCCTTGCCCCCATCAGCGTCATGATGGATCAGATGGGGATGAACGGGGAGTCGGCAGGTAACGCCCTGCGAAAAGTTATCCAGTCCGGATTAAGTGTTAAGAAAATCAGGGACGTCAATAAAGTCATGGCCCGCCAGAAACTCGGGGTACAGCTCGATTTTACTGACGGCAAAGGAAGTTTTGGCGGTCTTGATAACATGTTCAGGCAACTGGCAAAGCTGCGAAAACTTACCGACGTTAAGCGAACAGGTGTACTTAAGGCAATATTTGGTGATGATGCCGAAACCCTTCAGGTGGTCAATGCACTAATCGATAAAGGAAAGGATGGCTACGATCAGATCCAGCAGAAGATGAATAAACAGGCCAGCCTGAATAAACGTGTTCAGGCCCAGCTTGGTACGCTGTCCAACCTGTGGGAGGCAATGACGGGGACCGCAACTAACGGCCTTGCGGCTATTGGCGGCGCATTTTCTGGTGACGCCAAAAATATCACGCAATGGCTGGGGGAGTTAGGGGAAAAATTCACGAAGTTTGCGGATGAAAATCCCCGGGTTATTCGCGGCGTCGTCGGGCTTGCTGCCGGTCTTGCGATTCTGAAACTGGGATTGATGGGCGTTGGCGGTGCCATCAGTATTGTCAGCAGGATCATGTCGATGACGCCGATTGGCATGATTGCGACGGCGATAGCCCTGGCTGCGGGATTAATTATCACTAACTGGGATGTTGTCGGACCTTATTTTAAGAAACTCTGGGAAACCATTGGTCCTTATTTTGAGGCTGGCTGGGAACTCCTTAAGAAAGTTTTTGCCTGGTCGCCGCTGGGGATGGTGATCAATAACTGGGGGCCGGTTGTTAAGTGGTTTCAGGATATGTGGGACAAGCTGAAGCCAATTATTGAGTGGTTTACCGACAGTTCCGGTGACACGGTCGATGCCATTAACTCTGCGCAGTGGGGCGCGGGTGCTTATGATGCTTATGGGACGGGAATACCGGCGCGGGGATACACACCTTATCCGGCGGTGGATCCGGCTCAGTCAAACAACGCCTCCGATGCCACAGGCCCGAATCCCTTCATGATTAACAAAGCTTCTGCGCCAAAAGTTGATGGTGAGATCAAGGTCTCTTTTGTGAATTCGCCTCCGGGGATGCGGGTTATGGAAACGCGATCCAGCGGCTTTGATGTCAGCCATGATGTTGGCTATACGCGCTTTGGCAGGTAATGAAAAATTAATCTGTTAATGAGTCCCACTCCGGTGGGATTTTTTATGTACGGAGTTTATATGACGTGGAAAGACAGACTTCAGGACGCGTCATTTCGCGGTGTGCCGTTTAAGGTTGAAGAAGAAAGTGCGGGAACCGGTCGTCGTGTGGAAACGCACGAATACCCGAACCGCGACAAACCCTATACCGAAGACCTGGGGAAAATCACTTTCCGCCCGTCCATCACAGCTTATGTGGTGGGAGATGACTGCTTTGACCAGCGCGATCGCCTGATTGACGCGCTGAATAAACCCGGTCCCGGCACGCTTGTCCACCCGACATATGGTGAGCTGAAAGTCTGTGTTGACGGGGAGGTTCGGGTCAGCACATCGAAGAGTGAAGGGCGTATTGTCCGCTTTGACCTGAAGTTTGTCGAAGCGGGAGAACTCTCTTACCCCACATCAGGTGCGGCGACGGCGCAGACGCTGATGTCATCCTGTTCTGCACTGGATGACTGCATCAGTGACAGTTTCAGTGGTTTCAGTATCGATGGCGTGGCAGATTTTGTGCAGAACGACGTCGTCGGTAATGCCAGCATAATGCTGGGGTATGTTTCTGATGCGATGAAAGTGGTGGATTCTGCCGTATCGGATGCCGCCAGGCTGTTGCAGGGGGATATCTCGGTACTTCTGCCGCCGCCATCGTCAGGCAAAAATTTCGTTGAGCAGGTGCAGAAAATGTGGCGTACCGGGAAACGCCTTTATGGTAACGCCAGCGACCTGGTCACCATGATCAAAACGCTTTCCGGTGTCAGCCTCGGCAGCGATCTGCAACCGCGCGGCATCTGGAAAACGGACAGTAAAACCACCGCCACGGCGACGCAGCAGCGTAACGTGGTTGCCAGCACCCTTCGTACGACCGCAATCAGCGAAGCGGCGTATGCCGTCACCCGATTGCCTGCGCCAACAACTTCCGCGGTGATGCAGAATTCCGCAGTGGGGCAGGCAACAACACCCGCGCAGAGCACTGGCTGGCCTTCCGTCACGCATCCGGCACTGAACAATGCACCGGCGGTGAAAAACACGGTTGACCTGCCGACGTGGGAAGAACTGACTGACATTCGCGACACACTGAATACGGCAATTGATAAGGAGTTGTCCCGTACAACCAGTGATGCGCTGTTTCTGGCGCTGCGCCGGGTGAAAGCAGATCTGAATGCGGATATCAACACGCGCCTTGAACAGTCTGCACGGATCATTCAGCGCACACCGGATGAGGTTTTACCCGCGCTGGTGCTGGCGGCGACCTGGTTTGATAACGCGGCGCGTGACGCGGACATTATCCGGCGTAATGCCATTACGCATCCCGGCTTTGTGCCGGTGATCCCTCTGAAGGTGCCAGTGCAATGAACGACAATGTCACGCTACGGGTAAATGGCCGGGAGTGGAATGGCTGGACATCGGTGCGCATCGGTGCCGGTATTGAACGGCTGGCGCGGGATTTCAGTGTGGAGATCACCCGCCAGTGGCCGGGAGATGAGGGTATTACCACGCTTCAGCCGCGCATTAAAAACGGTTCAAAAGTGGAGGTGCTGATTGGTGATGAGCTGGTGATCACCGGCTGGGTGGAGGCGACGCCCGTTCGTTACGATGCCCGTTCTGTCAGCACCGGTATTGCCGGACGCAGTCTGACCGCTGACCTGATTGACTGTGCAGCCGAACCGACACAGTTTAACGGACGATCGCTGGTACAGATTGCGCAGGCGCTTGCTGCGCCTTTCGGCATTGAGGTGGTGAACAACGGTGCGCCGCCGGGTGTTATTCCTGATGTTCAGCCTGATCACGGTGAAACGGTGATTGAGGTAATCAACAAAATACTCGGTCAGCAGCAGGCGCTGGCTTACGACGACCCGCACGGCAGGCTGGTGATTGGCGGTATTGGCTCAACGCGGGCACATACCGCGCTGGTACTTGGGGAAAACATCCTTTCCTGTGATACGGAGAAGAGTATCCGGGAGCGGTTTTCTGTTTACCAGGTGGCGGGGCAGCGTGCCGGAAACGACGATGATTTCGGTGAGGCCACCACAACCGCGCTGCGGGCCCGCACAGAGGACGCATTTATTGCCCGTTACCGTCCGATGTATATCAGGCAGACAGGGCAGGCTACGGGGGCAGGCTGTATTGCCCGTGCTGACTTTGAAGCCCGACAACGGGCGGCGCGGACGGATGAAACCACCTATGTGGTGCAGGGCTGGCGACAGGGTAACGGTACGCTGTGGCAGCCCAACCAGCGGGTGATTGTCTTCGATCCGGTCTGTGGTTTCGACAATACCGAACTGCTTGTCTCGGAAGTCACGTTTACTCAGGACCAGAACGGCACCCTGACGGAAATCCGTGTCGGCCCACCTGATGCTTATCTGCCTGAACCCGAAGCCCCCGGCGCGCGGAAAAAGAAAAAAGCCAGAGTACAGGAGGACCCGTTCTGATGAGTACGATTGAAGCCATGCAGCGACAACTCCTCGGCCTGATTGGGCGGGCCGTGGTGAAAAGCATCAGTGCCGCCACGAAATGTCAGACCGTGGATGTGTCCCTGATTGCCGGTGAACCCAAAGCCGGGGTTGAACATCTTGAACCCTACGGTTTTACCGCAAGGGCAAACAGCGGTGCGGAAGCGGTGGTGTTGTTTCCGGATGGCGACCGTTCTCATGCGGTGGTTGTTACGGTGTCGGACCGTCGCTACCGCCTGAAAGGGCTGCAGACGGGTGAGGTGGCTGTCTATGACGATCAGGGGCAGTCCGTGACGCTGACCCGGGAGGGGATCGTGGTGGACGGTGCAGGTAAAACGATCACGTTTCACAATGCGCCTAAGGCACGTTTTGAAATGGACCTGGAAGTGACCGGACAGGTGAAAGACCTGTGCGACTCCGGCGGCACCACCATGTCAGCGATGCGGCTTGCCTATAACGGGCATCGTCACAGAGAGAACGGTCAGGGCAGTAACACCGACAAACCTGATAAAGCGATGGAGGCATGATGGAAGTGTGGCTGACGGTGAACGGTAAACGCACCTGCGCCAGCGCACCGCTGGATCCGCTGACCCGCGCCGTGGTGATTTCCCTGTTTACCTGGCGGCGGGCGGAGCCTGATGACAATGCCGACGTCCCGATGGGATGGTGGGGGGATACCTGGCCTGCGGTACAGAATGACCGTTACGGCTCCCGACTGTGGCTGCTTCAACGCAGCAAACTGACCAATCAACTGGTGCAGACGGTAAGGGGGTATATCCGCGAATGCCTGCAATGGATGATTGATGACGGCGTGGTGTCCCGTATTGATCTGGATATCCGCCGCACCGGGATTAATGAACTGGGTAACAGTATTACTCTCTGGCGTCGTGACGGACCGGTAATGATTTCTTTTGATGATCTGTGGAGTGCGATAACGCATGGCGGACAGTGAATTTCAGCGCCCGACGCTGGCAGAAAATATCAGTATGCTCCGTAACGATTTATTCGCCAGGCTGGACGTCAGCGACACGCTCCGGCGCATGGATGAAGACGTGCGGGCAAAGGTGTATGCGGCGGCGCTGCATACGGTTTACGGTTACATCGATTATCTGGCAATGAACATGCTGCCTGACCTGTGCGATGAGTCCTGGCTGGCACGACATGCTGCGATGAAACGGTGTCCGCGCAAGGGGGCCACGTCTGCCAGCGGGTATATGCGCTGGGAAGGTGTCAGCGATGGCCTGAAGGTGACCGCCGGGAGTGTTATTCAGCGCGATGACCTGGCTCAGTACACGGCAACTGCCGATGCAACCAGCTCCGGTGGTGTCCTGCGCGTGCCGATCGCCTGCTCAAGTGCAGGCGCGGTCGGTAACGCTGACGACGGTACGTCATTAATCCTGGTCACGCCGGTGAATGGTCTGCCGTCTTCCGGCGTGGCAGACACTCTGACAGGTGGATTTGATACTGAAGAGCTGGAAACGTGGCGCGCCCGCGTCATTGAGCGGTATTACTGGACGCCTCAGGGCGGGGCTGACGGGGACTATGTTGTCTGGGCTAAAGAAGTGCCCGGCATTACCCGTGCATGGACATACCGCCACTGGATGGGAACGGGGACTGTCGGTGTGATGATTGCCAGCAGTGACCTGATTAATCCCATTCCGGAAGAATCAACGGAAACGGCGGCAAGACAACACATTGAGCCACTGGCCCCGGTGGCAGGCTCTGATTTGTATGTATTCAGGCCGGTGGCGCATAAAGTGGATTTTCATATCCGCGTGACGCCGGACACACCGGAAATACGGGCTGCCATCACCGCCGAGTTGCGTTCGTTCCTGCTGCGTGATGGTTATCCGCAGGGAGAACTGAAGGTGTCGCGTATCAGTGAGGCGATTTCCGGTGCGAACGGGGAATACAGCCATCAGTTGCTTGCACCGGCGGACAATATCTCCATTGCAAAAAATGAACTGGCGGTACTGGGGACGATTTCATGGACGTGACAAACGATGATTACATCCGTCTGTTGTCGGCACTGTTGCCCCCCGGTCCGGCGTGGTCAGCCAGCAATCCGGCGATTGCCGGTGCGGCACCGTCATTAACCCGCGTTCATCAGCGTGCGGATGCCCTGATGCGGGAGCTGGATCCGCGCACCACCACCGAACTGATAAATCGCTGGGAGCGTCTGTGCGGCCTGCCGGATGAATGTATTCCCGCAGGGACACAGACCCTTCGCCAGCGTCAGCAACGACTGGATGCGAAGGTTAACCTGGCGGGCGGCATCAATGAGGATTTTTACCTTGCACAGCTTGCTGCCCTGGGCAGACCAGACGCCACCATCACGCGATACGACAAAAGCACGTTCACCTGCTCATCGGCCTGTACTGACGCGGTGAATGCGCCGGAATGGCGGTATTACTGGCAGGTCAACATGCCAGCCGCCACCAACACCACCTGGATGACATGTGGTGATCCCTGTGATTCCGCACTGCGTATCTGGGGCGACACCGTTGTCGAGTGTGTGCTTAACAAACTCTGCCCGTCGCATACCTACGTAATTTTTAAATATCCGGAGTAATCCATGCATCGTATAGACACGAAAACCGCGCAGAAGGATAAGTTCGGCGCGGGTAAGAACGGTTTTACCCGTGGTAATCCCCAGACTGGTACGCCTGCCACCGATCTGGATGATGACTACTTTGACATGTTGCAGGAGGAGCTTTGCAGCGTTGTGGAGGCATCCGGTGCCAGCCTGGAGAAGGGGCGGCATGACCAGCTGCTTACCGCGCTTCGTACGCTGCTGTTAAGCCGCAAGAATCCGTTTGGCGATATCAAATCGGATGGCACGGTGAAAACGGCTCTCGAAAACCTTGGTTTGCAATACTCGATCAGTGAACCAGATACAGAAACTGTTGTTTACACACTTCCTGGTGGATATAAGCTCATGGCTTTTAACCGATTAGTGAACAACTCAACCGCTGTCGGTACGGTTGTAACAACCCACATTACGTTCCCTCAGGCCTTCCCTTCGCGATTGATTGCTGTGTTCGCTACGAAGAAAAACTATGTTCAGGCAGCTGTGAGTTGCGAGAACCAGTCCCTGACTGGCTTTGATGCGGTGGTAACGCTCATTACAACTATTGCGGGTGGTATTACAAGCACTCGCGCAATGTTTCTGGCTATCGGGAAATAAAAATGACCTATGCATATAGCGCTTCAAAAAATGCATTCTATTCGCATGAATGGAAGGAAGAGTATGATGCAGCCGGTACATGGCCTGCAGATGCTGTAGATGTCAGCGATGATGTCTGGAAGGAATATTCATCAGAGCCACCAGAAGGAAAGGTTAGGGCGTCTGATTCTTCAGGACTGCCTGTCTGGATTGATAAACCGGCTCCGCCTAATTCTGAGTTGCGCAAAGCAGCTTTGTCCGCCTTGAGTAACACCTATCAGGATGACATTGAGAAGCTTAACAGGGCATGGCTGGCGGCGGCAGTAAATGATGGTGTTAATGAAACCGCCAAAAAAGACGTGGTTCTGGCGCAGATCAACACAAGAAAGACACAATATGCAAATGACAGGGCAGCCATAATTGCTCAATACCCGGTTTAATGGAGAAAACTATGTCAGATAACAATGCGGCACCCGATAATGAGAATCAGGCATCAACTGCAACTACTGAGGTAAGATTTTGCCCTATCTGTGGAACGCAGATGTACCAGGGTGAGCGGTATGGGTTTTTGTGCTGGATTTGTCCGGAATGCGACTTTGACGAGCCCGTAAGTTAATAAACTGATCCATAAAAATAGGCCGCGTTTGCGGCCTAAGTATATTAAGCAATCTTAGCTATCATGGAACGCGTAAGAAGAAAATCCCTTACCTTATTCGACAGCCATACTTCTATGAAGTTTCTTGAAACATGTGCAACTAGTATTGTTGACGACACAATCAATAATCCCATCAAAGGTCCTCCTGCATATTCCTCATGCCCAGACTCTATAAATATTATCTGAAGCAGCTCTTTCACAAACGGATGCAAGAGGTACACCGAAAAAGATATGCTCCCCAGATATGTCAAAGGCGCACAAACAAACCCAAACCTGCTCTCATATATCACGCAGACAAAGACAAGTGCAGCTGAGACAAGCCCCCACTGCAAGAATCCCCATCCAGCCCTAAATCCAGAGAAATACTGCCACGCAAAGGCTGTAACAACAGCGAAAACCAACCAGGTAAGCGATGATTTACTTACTTTGCTAAAATCAACAAAATCTACCAACTTGGCAATCAATACCCCGGCAAGAAAATTTATAATGATAGGGTTTGTCACAAGGGACATATAGGCAACACTAAAACCATAATCAACCTCTGTGACGAAGCTAAATGATGAAAAGATAGATGGTATTATGTAAAGAATAGTCATGAACGCTAAAGAGAAAACAGTCCACATCCATCTATTAAATGCAAGAGATACAGCAAATACAGCATAGAAGAACATCTCATAGTTCAATGTCCATCCAACAAAGAGAGACGATAACCCAAGATTCGGAGCGTTTCTGGCGTCAAGAGGATAAAACAGCAATGATTTTATTATGTCAGTATAATTATCGAATTCGTGAGCCGTGGGAACTGAATATTTAAGGACTATCAGCCATAGTATTGTCGCTATGAAATAAAGTGGATATACGCGTGAAAACCGCTTTATGATAAATAAGATTGACTCATGAACGCCTGTCCTATTGTGCGTATAGGTCATGACGAATCCACTAATAATGAAAAATAAAGGCACGCCCATCCCACCATAAGAGAACAGAGCATCGCCAAAATTATAAAAAGGCATATTGAAATACACCTTAAAATGCACAAGAACAACCATTAAAGCGGCAATTCCTCTCAATGCCTGAATGGTATTAATTGTTCTCTTTGCGTCGTTAGAGTCATTGACCGACATCGCAACATCATCCCGTTTCGTATGCAATAAGTATTTTCTGCATTCTATCATGAGTTTTCGCGTTCTCTATGTTCAGTGACACATTTGTGATTGACGAGGCAAATCCGGATGAACTACTGTATGCATGAACAGTATTTATCGGAGGTGCATTATGGGATTCCCGAGTCCAGCGCAAGACTACGTTGAAGAGCGTATATCACTCGATAAGCGGCTTATCGCTCATCCATCAGCCACGTACATGATGATAGCTGGCACGACATATTTGCGTGCGGGCATCATGAAGGGTGCCATGCTTATCGTCGACTCATCGCTGACACCGAAAGACGGTTCTCTGCTTGTCTGTGCCATTGATGGTGAGTTCAGGATAATGCGCTACAGGACGCTACCGCATCCTTGTCTGGAGAACCCTGAGAATGGAAGGAGGGAGCCGTTACCATCGAAGGATGAGGTGTCGGATACGTCTCGGCCAGTGTTTGGGGTGATCACATACACCATCAATGATGCTCGTTATGGTGAGTTTGATGACTACCCGCTGAAGTGAAAATAGTGTTGTGTACCAAATTGCGTACCAAACTAAAATCACAAATCATGAAACCCTTGTTCATGGCGGTTCTCAGGGGTGTTGCGCGTAATCGTGAAACAAAAAGGTAGATTGTTGCTTACCGTCATTCATCATTAGGTTAAATCCGTTATTTCTGCTGTCTGCCAGAGTATCAAATATCACCGTGCTAATCAGCTTTAGCGCGACAATTTGACAGCGAGTAGCAACAGATCATGTCAGATGAAAATGAGAGGGTAGTCACATTTTCTTGCACTTTATTCCAGCCAGTTCATAAGTATTTCCGTAAAAAGAACAGCTATTTGAAACTCCTGAGGGTTTGCTGTTGAAACGCCGTCTTATTATTGCTGCTTCTTTGTTCGTTTTTAACTTATCGTCTGGTTTTGCGGCGGAAAACATTCCTTTTTCACCTCAGCCTCCACAGATTCATGCCGGGTCCTGGGTACTGATGGATTACACCACCGGTCAGATCCTCACCGCGGGTAATGAGCATCAACAGCGCAATCCCGCCAGCCTGACAAAGCTGATGACGGGTTATGTCGTGGATCGCGCTATCGATAGTCATCGCATTACGCCAGACGATATTGTCACCGTGGGGCGCGATGCGTGGGCGAAAGATAATCCGGTGTTTGTCGGTTCTTCACTGATGTTTTTGAAAGAGGGCGATCGCGTATCGGTACGTGATTTAAGCCGTGGTTTAATTGTGGATTCCGGAAATGACGCTTGTGTTGCTCTGGCTGACTATATTGCCGGTGGGCAACGGCAGTTTGTTGAAATGATGAACAACTATGCCGAGAAGCTGCATCTCAATGATACGCATTTTGAAACAGTGCATGGTCTGGATGCACCTGGCCAGCATAGCTCGGCTTATGATTTAGCTGTGCTTTCTCGCGCTATCATCCACGGCGAGCCCGAGTTTTATCATATGTACAGTGAGAAAAGTCTCACCTGGAACGGTATCACCCAGCAAAACCGTAACGGGTTGTTGTGGGATAAAACCATGAATGTTGACGGCCTGAAAACGGGTCATACTTCTGGTGCCGGGTTTAATCTCATTGCTTCGGCTGTAGATGGGCAGCGTCGTCTCATTGCAGTGGTAATGGGGGCTGACAGTGCAAAAGGTCGTGAGGAAGAGGCAAGAAAATTACTGCGTTGGGGGCAACAAAACTTTACTATGGTGCAAATTTTGCACCGTGGGAAAAAGGTCGGTACGGAACGCATCTGGTATGGCGATAAAGAAAATATCGCCCTGGGAACGGAACAAGAGTTCTGGATGGTGCTACCGAAAGCCGAAATTCCACATATCAAAGCCAAATATACCCTTAATGGTAAAGAGCTCACCGCGCCAATTAGCGCCCATCAGCGGGTAGGGGAAATTGAACTTTACGACCGTGATAAACAGGTGGCGCACTGGCCGCTGGTTACCCTGGAATCTGTCGGGGAAGGCAGCATGTTTTCTCGCCTGAGTGATTATTTCCACCATAAGGCCTGACCTTTCTTTTGCAGCAGACTGGCAGGAGTGCGAGTCTGCTCGCATAATCAACACTCATTCCTTGTGGTTTTAATATTGCAACTATACTGTATATAAAAACAGTAATAATGGAGGCGTCATGAACTACGAGATTAAGCAGGAAGATAAACGTACCGTTGCAGGTTTCCATCTCGTTGGCCCGTGGGAACAGACGGTAAAGAAAGGTTTTGAGCAGTTGATGATGTGGGTAGATAGCAAAAATATTGTGCCGAAGGAGTGGGTTGCTGTCTATTACGACAATCCGGATGAAACACCCGCCGAAAAATTACGCTGCGACACCGTCGTGACGGTACCGAATAACTTTACGCTCCCCGAAAACAGTGAGGGCGTCATTCTGACAGAAATTTCAGGTGGTCAATATGCGGTGGCGGTGGCTCGTGTAGTCGGTGATGATTTTGCTAAACCCTGGTATCAGTTCTTTAATAGCCTCTTGCAGGACAGTGCTTATGAAATGTTACCAAAGCCCTGCTTCGAGGTTTACTTGAACAATGGCGCGGAAGATGGGTACTGGGATATCGAAATGTATGTTGCGGTGCAGCCAAAACATCACTAATTCATCTCAGGGCGGTGTGTTGACGCGAAGACCACTCTTTTTTTGAAAGCGAAAAGAGTAAGATGCGCCTTTCAATTTTTTCGCTCCTGCCGGGAAATTACACTGTTCCCGGTTTGTCCGTCGGATAATTCAGAGGCGCGCCTTCTGGCCGACAGATGAGTTATGAGCGCTTTTAATCTTCATTACGGAGTTTCTGCGTGCGTGCCGATAAGTCATTAAGCCCGTTTGAAATCCGGGTATACCGCCATTACCGCATTGTGCATGGTACTCGGGTCGCGCTGGCATTCCTGCTCACTTTTCTCATTATCCGCCTGTTTACTATCCCGGAAAGCACCTGGCCGCTGGTCACCATGGTGGTGATTATGGGGCCAATCTCGTTCTGGGGGAACGTTGTCCCTCGCGCATTCGAACGTATTGGCGGTACGGTGTTGGGGTCGATTTTAGGTCTTATCGCTCTGCAACTGGAGTTAATCTCGTTACCGCTGATGTTAGTCTGGTGCGCGGCGGCTATGTTTCTTTGCGGTTGGCTGGCGCTGGGCAAGAAACCGTATCAAGGTTTATTGATTGGGGTGACGCTGGCAATTGTTGTTGGTTCCCCGACAGGTGAAATTAATACGGCGTTATGGCGAAGCGGCGATGTGATCCTCGGCTCTTTACTGGCAATGTTGTTTACCGGTATCTGGCCACAACGGGCGTTCATCCACTGGCGCATTCAACTGGCAAAAAGTCTGACCGAGTATAATCGGGTCTATCAATCTGCATTCTCACCGAACTTACTCGAACGCCCGCGTCTGGAAAGCCATCTACAAAAACTCCTGACCGATGCCGTGAAAATGCGTGGGCTGATTGCGCCCGCCAGCAAAGAAACCCGTATTCCAAAATCGATATATGAAGGTATCCAGACCATTAATCGCAATCTGGTTTGTATGCTGGAGTTGCAAATCAATGCATACTGGGCCACGCGCCCCAGCCATTTCGTGTTATTGAACGCGCAAAAACTTCGTGATACCCAGCATATGATGCAGCAAATACTGCTGAGCCTTGTTCATGCGCTGTACGAAGGTAATCCGCAGCCGTTTTTTGCCAATACGGAAAAATTGAACGATGCTGTGGAAGAGCTGCGTCAGTTGCTGAATAACCACCATGACCTGAAGGTTGTGGAAACACCAATCTATGGTTATGTGTGGCTGAACATGGAAACGGCACATCAGCTTGAGTTGCTATCGAATCTGATTTGCCGGGCCTTGCGCAAATAATTCCTGAACTTCAGAATCATCTTGCTGCTGCTTCGATTCAGCAAGGATAAAGGGTATGATAGTGAAAAGGGATAAAAGCATTGTCATCTGCGGCAGCTATGAGTAATGTTGGCCCTAACGAATAGCGGTTGCTTAAACGAATCCGACTCTCACATTATCAGGGGTATAAAAATGGAAACTACCAAGCCTTCATTCCAGGACGTACTGGAATTTGTTCGTCTGTTCCGTCGTAAGAACAAACTGCAACGTGAAATTCAGGACGTTGAGAAAAAGATCCGTGACAACCAGAAGCGCGTCCTGTTGCTGGACAACCTGAGCGATTACATCAAGCCAGGGATGAGCGTTGAAGCAATCCAGGGCATCATCGCCAGCATGAAAGGTGACTATGAAGATCGCGTTGACGATTACATCATCAAAAATGCCGAGCTCTCCAAAGAACGCCGCGATATCTCCAAAAAGCTGAAAGCTATGGGCGAAATGAAAAACGGCGAAGCGAAGTAATTCCCGTTTTATTCAATGAGGGTTGCCCGGCAACCCTCATTGCTCATTGATTCTTATCTGTGTATCACCGTCATCATTCTCATCTGAGAACCAATCGAAATTAACAACAGCCTTCTTCTGTATGCAGCAAGGCAAAAAGTTCTGTAACTCCATTGTTATTAACTGCACTGGTTACTAACACGTTGTGCGCTCCAGCTTCCCGTAACCAACTTTTCACCAAAGATATTTGTTCCATGCTGGCTAAATCTGCTTTGGTTACTACTCCAATGACCGGGTGATTCATGGCCCGGTAGGGCGTTTTTACCTCTAATTCCTGCTTCAGCGCTGAGATTCCTGCTTGTTGCCGGAAAAGCAGGCACCTGCCGTCTGAACGGTATCGATCCGGAAGCGTATCTGCGCCATATTCTGAGCATACTGCCGGAATGGCCCTCCAACCGTGTTGACGAACTCCTGCCATGGAACGTAGTTCTCACCAATAAATAAGCGTCAATACGGTGCTCCGTTGACGCTTACTTTAATCCGTTTTCTGAGCTTAGATTTTTCTGTTCCTACCCAGACCGTGGATTTGGAATTCATTTTTTGTCAGGTCAATGCTAATCGTTAAAACGGCCATAACGCACCTCCACTGGAAGGTTTATCAGTAATATCCTACCGTTGCTTCATGGTCGGGGCGTCCATTCCATTATTATCGGTTTTCCTTTCTGTTTATAAATGAAAACGCCAGCTGTATTCAGGCTGGCGTCAGGGGAAATGAAGCCTGTTGAGTGAGATTCACCGGTTCTGGTGCAGAAGCTGCAGATGGCGCAGGGTAAGGACATCTTCTCTTGCCTGCTTAATACAGGACAACAGTGCTGTTTCTGTGGTTAATGGCACTATACCCTGCATCACTGCATTGTCTTACAGCAACTGGTGAGGGCTGGCATAAGTTGCCGGTGTTTCGGTAGAGTCACGGTCATCCACCACGCGGAGTGAATAACCTTTTTCAGCGGCCTTATTAATCAGTTCAGTGATATTAACACCATCAATGTCAACGACAATGTGCCCCATATCCAGCGCCTGTACGTTAACGCTGTCGGCTTCCAGTGTCATCATTTCGCCTCCGGATACTTACCCAGGGTAATGTTATTTACCGTTCTGTAATTGTCGCGGGTCATCAGGCCGGTCGCCCTGCGAGCCCGGAGGATATCGATGCTGTTTATTAACTGAGAGCGGGTACCGTTCATGGCATTGAACAGTACCCATGTTTCGTCATCATCGTCATCCGGTTCGGGTGCCATAAATGCCCCGCCGTTGTTCAGGGTGTACAGATTCCAGATACCACCGCAGTAGTCTTCGCACAGACGGTCCATCCAGCCGAAGACACGGGGCTCCAGGGTCACCCACTGTGGAATGAGGCCAAAATGCTGCGGCCAGAAGCTGATGCGCTGTTCATCAGGGACGAGGCTGGCAACCAACTGAGGCGGATTATTCTCTGGTGTTGTAGCTGAATAAATTGTGGGGGTATTCTGAGAAACGGTTTTCATGAGATAACTCCTTAAATAACAAATAAATGTGTGTTTTTCAGGAACGAAAGCTGAATGATTCCCTCACTCAGGGAAACGGCATCAGCGCAGGCTCTCCAGCATCGTTTCTGCCATCACCCACAATGCGCGATTGAGCTTAATGTCGGTATCAATGCTGTGAATGGCCCGGGTATGGATACGTTTTCCTTTTGCACTGCGACCGGAAATCCCGCCTTTCAGCATATTCTCCTGGATGGTCTGATATGCGCTCCACAGGTCCTTACCGTAATCCTCCCGGCGTCGCGGCGTCAGAATGTCGGCGGTGGTGACGGGCTGATGTTCGTCACCATAACGGTAAGTCAGTGCCGCCTGTGCCAGCGCCTGGCGTGCCGGTGGCGGCAGGACCAGCGACTGCATGGCATCACGCTTCTCCTCTATCCGGTCAAACACGCCCACCACCTCGTAAGCCCCTTCGATAACTTTCTCCACTACATTTCCCCGGTGTGGAACACGCACTTCCCCCAGAGACTGGCCACAGACGCACCCGTTCTGGCAGACGAACCTGAAGTAACCCGGCAGCATCTGGTAGCTGGAGGTACCGTCATGAGAGTTGAGCAGAATAATTTCAGGGACATGTTCTCCGTTTATCTCTCCGGCCCGCCGCAGACGCAGCATGTGTTTTGTGTATCCCCGGCGGCCCGGGTCGCGCACACGGGTCTGGCAGGCGAAGAACGGCTGAAAGCCTTCCCGCTGCAGATTTTCCAGGACGGTAATGGTGGGAATGTACGCATACCGTTCACTGCGGGAGGTGTGCCGGTCTTCCCCAAAAATACTGGGTACATGGCGTATCAGTTCTTCATGTGTCAGCGGACGGTCTGTGGGGTACAGCGACACATACAGCGGTAGCCCGTCTGCCAGAGCATCAGTATCAAGATAATGATCCATCAGCGCTGTCAGGGGCTCATCTGACAACAACGGCTGAGCCATCAGATTGTCAAACCCGTTGATTGTCGATATATGGCGAAGAAGCGTCGTAAGCAGTGCTCCTGCACGTCCGGGGAACCGGCTCAGTTTTTCCGGGAACGTGGCGGATGAGCGTCGGTACAGAGGTCATGGGGCGGTTACGGTTAGGCAGAATTTCCTGCCCCACAATGGCGTTGATGGTGGTTGATTTGCCTGCTTTCATCGTGCCGACAACAGCCAGCACCATCTCCCTGCGGGCGGTTTTGATTTGTTCGCCCTCCAGCTCCTCAATTCTTTTCAGTGCTTTCTGTTTATCAAAAAGCAGTCTGTTTTCGTTCTTACTGTCAGATAACACATCTGGCTCCTCAACCATTTGCCGGAGCAGATTAATGTTCAGTTGCAAAAGTCGGTCGGCTTCATCACAAAGCAGGGCGATGTTCTTTTCGTGCATTATTTAACCTTCCTAAGACGCATTTTCATGACAACCACGGAGGAGGTCATTTACTACAAACGATCGTTTATAACGATCGATATGATGGTAACGATCGATGTATTCTATTAACTTGAGAAACACAACGCAAGGGGACAACCCGGGTATGGCTGCGTTTGCATAGTGTTTTCATGGAGTTAAAAATCTGTAACAACAGCCTGGTACATCACCACCGGGTTCAGAAGAAAATCCAGTTCCATACCGCGCGGGCAACAGAGACCACCGAGTCACGGACGGCGCGCAGAACCGTGCGTGCAACAGAGGCAATGCAGACGCTCTCCGCCGTGTCAAATATCCGGTCCACCGCACCGGTAAACTGTTCACGGGCCTGAGACCGGACAGACTCCGTGCGCAGCTCGTCCTGCAGGCGGGTCATCAGGGGACTGGCGGCATGGTCGGGAAGTGCTGTCATGAGCGCACTGACCAGCGTATCCAGTTCCCAGCCGGTGCGGGCCGATACGGCAACAACCCGATGTACGGGCCGGAACAGACGGAATACCGCCTCCGTTTTTTCGCGAATGTTCTGTGCCTCTGCGGGAGAAGGCTGAATACCGGCCATATCCCATTCATGGCAGGGCTCCGTTTTATCGGCCTGCATCACCACAAACAGTACCCGCTGATGTCCCCGGTGCAGGATGTGTCGCCAGAAATACTCATCCACAGACAGGGCACGGTCATCGGCTTTAATCAGCCACAGTACCAGGTCCAGTTCGAGCAGAATGTCACGGTACAGGGCTTCATACTCTGCATCCCTGTCCCGGCTCTCGCCCACCCCGGGCAGGTCAGTGATAACCATGCTGTGACCATGGCCATTCAGACGGAAGCGCTGCACTTCCCGGGTGCCGGCGTGAACATCACTGACCGGGGTGACCTCCCCCTGAAACAGCGCATTACAGAGTGAGGATTTACGGGCCCGCTTTTACCCATAATGCCAGGCACGGGTTCGTGACTGGTGAGTTTGCGCAGATGTTCCAGGATGTGACGGGAAAGTGAGTAAGGCAGGGAGGAGAGCGGTTTTTCAATTGCCTCAATGGCATCAGACGGATTCATACATGTATTCCCGGTGAAAAAAGACAAAACCCCGCGCACCGGAAAGGTGGCGGGGGATTCACGGAAATAATATCTGTCTGAAATTATCTGGAATCGATTTGGCTGTTCTGCCTGGTTGCCACCAAATCATGGTACGCCTGCTCAATCATTTCATTCAGCAGTTTTTTTGGGAGGACACCACGACTTCGGGCAATACTTTCAAGCTGTTTGTGGATTTTTCCGGACAGTTTCAGAGGGACGGAACCATCAGCACGTTCATCACGCCTTTTTTGTTGCTCCCGGGCACTTTTCAGCTCATCAAGAAGAATCGCTTTATGCTGAATCCTGGGGGCCTGGAAGGAGACCTGCGTAAACTTCGCTCTGTTCCGTCTGGGATTACTTTCCAGTAAAAACTGTACCTGTTGCTCATTCCAGCCCTCCCAGTTATCAAACGTGGCGCAGGCAAACCAGTATTTCTGCTCCGGGGTAAGGGGATTCAGCGGGGTTCCCACACACCGGACCTGCATGGCGTTCCATAACCAGTCACACTGTTCACTGTCCCTGCTGTCCACCCACGAAAAGGGGGAGGGGTAATCATTGTTTATGCTGGCCCATTCAGAGAATACTGAGTTGATAATCGTCCTCGTAGCCTCTGGTCCGGATTTCAGCAGGTAACTGGAGCAGAACCATATAATTTCACGGTAGATTTTATCCGGATGCACGGTGCGCATCGGCTTTTCTCTGGGACCGGCAATCTTCGGTTGCGATGGAAACCCCGTGGCAAACCGCAGATGTTCCGGGGGCATTCCTGAACGTTTTTCAGGAATGGGTGTGAGTCCTGCGTTTCTGGTCTTTTCCGCTTGTCCGGTATTGCCTCCGGCCGGAGTGTTGTCGGGAGTATCTTCCGGAGGCGAGGTGTTTTCATTAGTTGGGGGAACAACAGCAGAACTGTCCATCTCCGGAATGGGTTGAGGGGCATAATCACTGTGGTAGCCAGGGGAAAGCGTTAAGTCCCGAAGGGTGACATCAGGTTCCGGGCAGTGAGTGCCACCGGGAGTGAACGGGAGTTGTTGCTTTTGGGAGAAAAATGTATTGTGAATGCGGGTATACAGCTGGAACAGAAAAAAATAAGATGCCCGTGGTTCTTTTTTTAACCACTGTGTATATTCCTCTATGGCACCGAGCTGTTGTTTGTGCCGTTTTACAGCATCGATATACGGACAGCTGGTGCAGATATCATCCAGTTCCTTTCCCACAATACACCTCACCTGCTTTTCTCACATTTGCAGATTTTAAAAAAGCAAACAAAAACCTCCGTACATGATGTACAGAACAATCTACCAGGTAGTGGCGTCAGAAAACCTCCGGGCCGCCTGCACTTTCCGGAATGCGCCGGTTATGAAAAACCGGAAATTCAGTCTCTGTTGTAAGATGTACCCTCTGATACTACGTTTTCTCTGTGTATTTAACTCACTGTATTTATTGTCTTTTTAAGCTTGACTGATATTCTCCGGTCATAGTGTGCCGGATACACATCCCGTAAACTTCTCATTTTTATATAAAAATACCTGCACTCTCTTCAGCGCATCTGTCTCCTTTAAGTAAAAAATGTTTTTACGTATTGCTGTAAATTCATTTTAAATCAGTATATTACCTTGATATTGCTGTCTCCTGGTTTGATGCGTAATCGTTCTGCCTTTCTGGATGATTCTGGTGCTTCCGGGTGCATTCTCCTCTGCACATGCAACAGTTCGTCATCGCCACGCTCCCAAAAAGCATATAAAAACGAATCGAATTTGATATTTTTTTCAATCGTTTTTCATAGGAATTTCGTAGGGTATTCTCCGCTATAACCTCCATATTTGCTGTCTTTTTCTTCATAAATCGTATGCAATGTAGCCTGTCTGTCAAGACCTCTTCTCTGCTGTTGAGGAGACGATCTCATTTTTGGTCTTTTCCGGGAGAGAGGCTGTCAGATGAGCGAGCCGGATATACTGTCAGAGGTACACCCCGTAGCAGACCTGTGCTCCCATTTCCGCGATCCGGAACCAACAACACCCTACGGTTAATACCATTTGAAGCGTCAGGTACTGCAGGAAGTTCCCCCCTGCGCGGACATCGGTTCCTGCCGACATCTCACAATACGGCCAGACTGTCATGCAGTAACACCCGCATGCGGTCTGCGGTTTTAAGCGCACTCTCCCGTTTTCAGTGCACCGATAATGCGTCATAGTAAGTCTGGGGGAAGTTACGGAATGCCTGTTAAGTGGTGCCGTGGGTGGAGACAGCTTACTTTTCTACAGCAACTGCAGTCATTCAGGATGATCAGGAGATATTCTGTCAATGTCACGATCCTGCCATTTTCGTACCCGCTAAGATACTTCCGTTACTGATGCTGTCCGCTTTTCACCATTTCATCCTCACTGACAGGCGTTGCGTTTTGGGTCATCACCACAGCTTTACGTTTCTCTTTTAACCGGTAACGCTCTCCTTTGATATTCAGCGTGGTTGAGTGATGCAGCAGTCAATCCAGGAGCGCTGTTGTCCGGCAACCACCATATCAACTTCTTCCCGGGTGTGGCGGAGAGGTGGTATCCTGGCTGAGTTTCGAAGCGAACCGTTCTTTTTGACGGGCGCATTTTACGTCCGGGCTGAATGTAGTAACGCGGCATGGAACGTCGGCCCGTATACCATTGCCTTAATCTCCGCAAAGATAACCTCGTCATTCCGGACATTCTCTGCCAGGCATATGTCGATGTAATCCATAAACGATTTCAGCTTACCCCTTTTGTTGTCACGAACGGTGCAATAGTGATCCACACCCAACGCCTGAAATCAGATCCAGGGGGTAATCTGCTCTCCTGATTCAGGAGAGCTTATGGTCACTTTTGAGACAGTTATGGAAATTAAAATCCTGCACAAGCAGGGAATGAGTAGCCGGGCGATTGCCAGAGAACTGGGGATCTCCCGCAATACCGTTAAACGTTATTTGCAGGCAAAATCTGAGCCGCCAAAATATACGCCGCGACCTGCTGTTGCTTCACTCCTGGATGAATACCGGGATTATATTCGTCAACGCATCGCCGATGCTCATCCTTACAAAATCCCGGCAACGGTAATCGCTCGCGAGATCAGAGACCAGGGATATCGTGGCGGAATGACCATTCTCAGGGCATTCATTCGTTCTCTCTCGGTTCCTCAGGAGCAGGAGCCTGCCGTTCGGTTCGAAACTGAACCCGGACGACAGATGCAGGTTGACTGGGGCACTATGCGTAATGGTCGCTCACCGCTTCACGTGTTCGTTGCTGTTCTCGGATACAGCCGAATGCTGTACATCGAATTCACTGACAATATGCGTTATGACACGCTGGAGACCTGCCATCGTAATGCGTTCCGCTTCTTTGGTGGTGTGCCGCGCGAAGTGTTGTATGACAATATGAAAACTGTGGTTCTGCAACGTGACGCATATCAGACCGGTCAGCACCGGTTCCATCCTTCGCTGTGGCAGTTCGGCAAGGAGATGGGCTTCTCTCCCCGACTGTGTCGCCCCTTCAGGGCACAGACTAAAGGTAAGGTGGAACGGATGGTGCAGTACACCCGTAACAGTTTTTACATCCCACTAATGACTCGCCTGCGCCCGATGGGGATCACTGTCGATGTTGAAACAGCCAACCGCCACGGTCTGCGCTGGCTGCACGATGTCGCTAACCAACGAAAGCATGAAACAATCCAGGCACGTCCCTGCGATCGCTGGCTCGAAGAGCAGCAGTCCATGCTGGCACTGCCTCCGGAGAAAAAAGAGTATGACGTGCATCTTGATGAAAATCTGGTGAACTTCGACAAACACCCCCTGCATCATCCACTCTCCATCTACGACTCATTCTGCAGAGGAGTGGCGTGATGATGGAACTGCAACATCAACGACTGATGGTGCTCGCCGGGCAGTTGCAACTGGAAAGCCTTATAAGCGCAGCGCCTGCGCTGTCACAACAGGCAGTAGACCAGGAATGGAGTTATATGGACTTCCTGGAGCATCTGCTTCATGAAGAAAAACTGGCACGTCATCAACGTAAACAGGCGATGTATACCCGAATGGCAGCCTTCCCGGCGGTGAAAACGTTCGAAGAGTATGACTTCACATTCGCCACCGGAGCACCGCAGAAGCAACTCCAGTCGTTACGCTCACTCAGCTTCATAGAACGTAATGAAAATATCGTATTACTGGGGCCATCAGGTGTGGGGAAAACCCATCTGGCAATAGCGATGGGCTATGAAGCAGTCCGTGCAGGTATCAAAGTTCGCTTCACAACAGCAGCAGATCTGTTACTTCAGTTATCTACGGCACAACGTCAGGGCCGTTATAAAACGACGCTTCAGCGTGGAGTAATGGCCCCCCGCCTGCTCATCATTGATGAAATAGGCTATCTGCCGTTCAGTCAGGAAGAAGCAAAGCTGTTCTTCCTGGTCATCGCTAAACGTTACGAAAAGAGCGCAATGATCCTGACATCCAATCTGCCGTTCGGGCAGTGGGATCAAACGTTCGCCGGTGATGCAGCACTGACCTCAGCGATGCTGGACCGTATCTTACACCACTCACATGTCGTTCAAATCAAAGGAGAAAGCTATCGACTCAGACAGAAACGAAAGGCCGGGGTTATAGCAGAAGCTAATCCTGAGTAAAACGGTGGATCAATATTGGGCCGTTGGTGGAGATATAAGTGGATCACTTTTCATCCGTCGTTGACATTTGTGGCCAGTCTTTCTGGCTGGCGGTTCAGGGGATGTTTTCTGGTTATCGTGTACCACAAAGCAGGGATATTGCCCCGATAGTCACATTTCAGATTGATCGCGGTCTGAAGATGTGGCACTGAGGTAACGTGCTGGTGTCACTGGTATTGCTGAAGTTATGGTTTTTCGGGGTCAGGTGAGCCTGTTGTGTGTATGCTTTTTGTTCAGGTCAGAGGGGGACTGTGTGGACAGCGGCTTTTGTCTCCGGGCATTCACGCCTGAACGATGAATGCGATTTTCAGCAATATTTGTCTTTGTGACACTCTCTGGTTGAGAAGCTAATGCGCGCCCCTGTTTCCACAGAGAATGAGGCTTGCCATCAGCCGAAACATCAGTATTCCACGCCATACAGACGGCGCCTGACCATGTGCTGATGCAGGTAAAATGTTTCACTACAACTGTTCGGGTTGCTATGCAACGAAACAGCGATTCTCACGATCTTTAACTGCAACGTTTTTCCGCCTGAAAGCTGACTGCCTCCTGCTGAGGTGGGAGGCTATGATTCAGTGCGTTTATAGTCTCCGGGGTATGCATGTACCGGAGCATACCATTTCTGGTGGATAACAAGCCGGCACATTCAGACTGCAGTGAAGATGGAGGCACCTGGTTGTGAAGAACGTTTCTGCCCGTGGCTGCTTCGCATCATCAATGGTATTCATACCCGCTCAGAAATGACTGGGTGCTGAAGCCCTCCGTTATCGCGCAGTATAATGACGTATATGATCTGTGGACGAGTGAGCGGCGTGCCTGATGTACGCCGCTATTACTGTATTTTACCACAGGGCTGGTTTTCTGCTGCCAGCCCTGTGGTTCTCAGACATCAACTGTGAGCGGCATCAACCGATGTACGGTATGCCATAAGCACGTACTGATCCATCAGATTTAGACTGGCCCCCTGAATCTCCAGACAACCAGTATCACTTAAATAAGTGATAGTCTTAATACTAGTTTTTAGACTAGTCATTGGAGAACAGATGATTGATGTCTTAGGGCCGGAGAAACGCAGACGGCGTACCACACAGGAAAAGATCGCAATTGTTCAGCAGAGCTTTGAACCGGGGATGACGGTCTCCCTCGTTGCCCGGCAACATGGTGTAGCAGCCAGCCAGTTATTTCTCTGGCGTAAGCAATACCAGGAAGGAAGTCTTACTGCTGTCGCCGCCGGAGAACAGGTTGTTCCTGCCTCTGAACTTGCTGCCGCCATGAAGCAGATTAAAGAACTCCAGCGCCTGCTCGGCAAGAAAACGATGGAAAATGAACTCCTCAAAGAAGCCGTTGAATATGGACGGGCAAAAAAGTGGATAGCGCACGCGCCCTTATTGCCCGGGGATGGGGAGTAA
Protein sequences of DBSCAN-SWA_6 >LR134000|1779850:1832609|1794295_1794658_+|VDY68150.1|DBSCAN-SWA MNTYSITLPWPPSNNRYYRHNRGRTHISAEGQAYRDNVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVVKMPVTKGGKLELTITELGDE >LR134000|1779850:1832609|1794657_1794795_+|VDY68151.1|DBSCAN-SWA MFESYMAERLRRRWVRLRLYRFPGSVLTDYRILKNYAKTLKGAAA >LR134000|1779850:1832609|1808419_1808689_+|VDY68173.1|DBSCAN-SWA MKELELKKPIIAHGETLSVLEFDEPTGKDVRELGYPYQMNQDESVRLLAHVVSKYIVRLAKVPQSSVDQMSPADLNAAAWLVAGFFLQA >LR134000|1779850:1832609|1789339_1790344_+|VDY68143.1|DBSCAN-SWA MAKAGLREQSRLSGANRNALIAGGIMANTAEIFNFPVPDVAQKERRVADLDDGYTRIANELLEAVMLAGLTQHQLLVFLAVMRKTYGFNKRLDWVSNEQLSELTGILPHKCSAAKSALVKRGIFIQSGRNIGINNVVSEWSTLPESGKKNKVYLKEVNLPESGKKSLPKSGKGVYPNQVNTKDKLTKDNIKPFSSENSGESSDQPENDLPVEKPDAAIQSGSRWGTAEDLTAAEWMFDMVKTIAPSARKPNFAGWANDIRLMRERDGRNHRDMCVLFRWACQDNFWSGNVLSPAKLRDKWTQLEINRNKQQAGVTAGKSKLDLTNTDWIYGVDL >LR134000|1779850:1832609|1801294_1802536_+|VDY68162.1|portal|DBSCAN-SWA MFFSGLFQRKSDAPVTTPAELADAIGLSYDTYTGKQISSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSLKQRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKAFGEVAELLPVDPGCVVPKLNSSWEPVYQVTFPDGSTDVLSQEDIWHVRTLTLDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLTRIEQRINTGLVRKSKQGVYYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMTTKPSDGSKAGKQKDNANADETTS >LR134000|1779850:1832609|1822167_1822497_+|VDY68188.1|DBSCAN-SWA METTKPSFQDVLEFVRLFRRKNKLQREIQDVEKKIRDNQKRVLLLDNLSDYIKPGMSVEAIQGIIASMKGDYEDRVDDYIIKNAELSKERRDISKKLKAMGEMKNGEAK >LR134000|1779850:1832609|1794791_1795481_+|VDY68152.1|DBSCAN-SWA MNIQYLQYVREQLMVATADLSGVTKGQLEAWLEHAQFDTGTYKRKKPRILDEVTGRMITLDNPPIFGKQSYAKGSSIALVSQVEFSTSSWRRAVLSLEEHQKAWLLWSYSESVRWEHQVAITQWAWNEFKSLLGKRKIASKTLERLKKLIWLAAQDVKNELAGRKTYEYQELASLVGVTSKNWSETFTERWVAMKHIFLQLDSQALLLLTKTRSKQKTTFSQQSIAKLD >LR134000|1779850:1832609|1813130_1813679_+|VDY68177.1|plate|DBSCAN-SWA MSTIEAMQRQLLGLIGRAVVKSISAATKCQTVDVSLIAGEPKAGVEHLEPYGFTARANSGAEAVVLFPDGDRSHAVVVTVSDRRYRLKGLQTGEVAVYDDQGQSVTLTREGIVVDGAGKTITFHNAPKARFEMDLEVTGQVKDLCDSGGTTMSAMRLAYNGHRHRENGQGSNTDKPDKAMEA >LR134000|1779850:1832609|1791038_1791326_+|VDY68145.1|DBSCAN-SWA MTGKEAIIHYLGTHKNFCAQDVAAVTGATVTSINQAAAKMARAGILVVDGKVWRTVYYRFATREEREGKVSTNLIFKECRQSAAMKRVLALYGRE >LR134000|1779850:1832609|1824993_1825413_-|VDY68192.1|DBSCAN-SWA MHEKNIALLCDEADRLLQLNINLLRQMVEEPDVLSDSKNENRLLFDKQKALKRIEELEGEQIKTARREMVLAVVGTMKAGKSTTINAIVGQEILPNRNRPMTSVPTLIRHVPGKTEPVPRTCRSTAYDASSPYIDNQRV >LR134000|1779850:1832609|1799367_1801101_+|VDY68160.1|terminase|DBSCAN-SWA MSRKSYPNVNAANQYARDVVRGKIVACQFVIQACQRHLDDLMAEKSKSFRYRFDKDLAERAAKFIQLLPHTKGEWAFKRMPITLEPWQLFVICCAFGWVNKGSRLRRFREVYTEIPRKNGKSAISAGVALYCFACDNEFGAEVYSGATTEKQAWEVFRPARLMCKRTPMLTEAFGIEVNASNMNRPEDGARFEPLIGNPGDGSSPHCAVVDEYHEHATDALYTTMLTGMGARRQPLMWAITTAGYNIEGPCYDKRREVIEMLNGSVPNDELFGIIYTVDEGDDWTDPQVLEKANPNIGVSVYREFLLSQQQRAKNNARLANVFKTKHLNIWVSARSAYFNLVSWQSCEDKSLTLEQFEGQPCILAFDLARKLDMNSMARLYTREIDGKTHYYSVAPRFWVPYDTVYSVEKNEDRRTAERFQKWVEMGVLTVTDGAEVDYRYILEEAKAANKISPVSESPIDPFGATGLSHDLADEDLNPITIFQNFTNMSDPMKELEASIESGRFHHDGNPIMTWCIGNVVGKTIPGNDDVVKPVKEQAENKIDGAVALIMAVGRAMLYEKEDTLSDHIESYGIRSL >LR134000|1779850:1832609|1801112_1801295_+|VDY68161.1|DBSCAN-SWA MIMLILAPLVGVLGALLLAYGAWLIYPPAGFVVAGALCLFWSWLVARYLDRTQLSVGGGK >LR134000|1779850:1832609|1791696_1793115_-|VDY68146.1|DBSCAN-SWA MKYFSTGFYVSNSTSLAEIFNECFSWVNDSPHTTFIPAQLVCDYKSEEYFIESKNERIDIITYKNKDTSLGCFRYSKISEPHKWITDISINKNLKTDTMWIQVESSVVSQDAAYLAPQPKKPLVVMRLIDKFPGGLDDIFKVSVEPHSLDDTDEHLNIAAKVINGETDNRLPIIYVSSKYFFNEHAHNIIPERLARKVCGLAHVLIEPSNRLFSIKLKNETNAKNAYAGAVGIYWPRGQNISFYRRGEKTAKEFEDELFDDVVRATTTMAPVSDSGWSEIQTRKTKDSINSLKERGEYTRELMALYEADNVAKDDQIEDLKHKISSLEHRVRTLQSQASAQGSIALNAGEETDFFDGEIKNIIIDALKTAIKNKNEFGRSYHILSSLIANNEYNKETESRRQLLKRTLTGYRSMDSATQRNLKDLGFSASSDGKHWKLTYNEDPRYSYILPKTGSDHRGSLNAISDIANIIF >LR134000|1779850:1832609|1804433_1804634_+|VDY68165.1|DBSCAN-SWA MILKQDLKWSPDGMRVEVIRAGEYDDGALPARVQEIALQAGLAERGTSAKSSKATKEKKATTSKEG >LR134000|1779850:1832609|1824101_1824986_-|VDY68191.1|DBSCAN-SWA MAQPLLSDEPLTALMDHYLDTDALADGLPLYVSLYPTDRPLTHEELIRHVPSIFGEDRHTSRSERYAYIPTITVLENLQREGFQPFFACQTRVRDPGRRGYTKHMLRLRRAGEINGEHVPEIILLNSHDGTSSYQMLPGYFRFVCQNGCVCGQSLGEVRVPHRGNVVEKVIEGAYEVVGVFDRIEEKRDAMQSLVLPPPARQALAQAALTYRYGDEHQPVTTADILTPRRREDYGKDLWSAYQTIQENMLKGGISGRSAKGKRIHTRAIHSIDTDIKLNRALWVMAETMLESLR >LR134000|1779850:1832609|1808063_1808420_+|VDY68172.1|DBSCAN-SWA MARIGGTCYFKIDGQQLSLTGGIEVPMNRTVNDDIIGLDGSVDRKETHRAPYVKGTFKVPKNFPVSKITSSDEMTITAELANGQVYVLSSAWLHGEANHNAEEGTVDLEFHGEEGDYQ >LR134000|1779850:1832609|1781299_1782019_-|VDY68130.1|DBSCAN-SWA MSEINSQALRAKAEKATCGEWSLEYGESRFDGDDALIHREVAGYIPICRIEGAHPESGFDEDFQMEQQANAEFIAAANPATVLALLGELETAKKRIAELEAEPVSQAYNLPELIEGMEVSIDVSTCDADLGNRYFGTVTEASELDTAKNGYILLVQDAEPNFDINGNSPVTPGGWISCSDRMPEDTKMLLAFSQGEIVAAYWNWVVNPIDYKKYRAFTYLSGNILDDVTHWMPLPEPPL >LR134000|1779850:1832609|1802513_1803164_+|VDY68163.1|head,protease|DBSCAN-SWA MQTKQRLDVPLSLKSVSDSGEFEGYGSVFGVKDSHDDVVMSGAFAASLRAWSDRKALPALLWQHRMDEPIGVYTEMKEDDVGLYVRGRLLIDDDPLAKRAHAHMKAGSLTGLSIGYVLKDWEYDRSKEAFLLKEIDLWEVSLVTFPSNDEARISDVKNALARGEIPEQKKIERVLRDVGLSRSQAKAFMAGGYGALSLRDAEDVGSALNALKNLNF >LR134000|1779850:1832609|1794008_1794299_+|VDY68149.1|DBSCAN-SWA MADLRKAARGRECQVRIPGVCNGNSETSVLAHIRLAGLCGTGTKPPDLIATIACSACHDEIDRRTHFVDAGYAKECALEGMARTLVIWLKEGVIKA >LR134000|1779850:1832609|1796567_1797005_+|VDY68155.1|lysis|DBSCAN-SWA MSRVTAIISALVICIIVCLSWAVNHYRDNAITYKEQRDKAAFIIADMQKRQRNVAELDARYTKELADANATIESLRADVSAGRKRMQVAATCAKSTTGASSMGDGESPRLTADAELNYYRLRSGIDRITAQVNYLQEYIRTQCLK >LR134000|1779850:1832609|1780727_1780919_-|VDY68128.1|DBSCAN-SWA MTDTSLIPEKEVMNKLGVSSRQTIWNYTKRHGFPKPVRTHPKSYLREAVEGWILNGGVNQKCS >LR134000|1779850:1832609|1808830_1810666_+|VDY68174.1|tail|DBSCAN-SWA MAEFELKALITGVDRLSPALSKMQKKIRGFKRQAEEASQGGLALGGGLAAGLTLSLKSYADQENAATGLKVAMMDANGEVGKSFQDINKLAIGLGNQLPGTTADFQNMMQMLVRQGIPAENILGGVGKATAYLAVQLKKTPEAAAEFAAKMQDATGTASEDMMGLFDTIQKAFYLGVDDTNMLSFFTKTSSVLKMVNKDGLQAAQSLAPISVMMDQMGMNGESAGNALRKVIQSGLSVKKIRDVNKVMARQKLGVQLDFTDGKGSFGGLDNMFRQLAKLRKLTDVKRTGVLKAIFGDDAETLQVVNALIDKGKDGYDQIQQKMNKQASLNKRVQAQLGTLSNLWEAMTGTATNGLAAIGGAFSGDAKNITQWLGELGEKFTKFADENPRVIRGVVGLAAGLAILKLGLMGVGGAISIVSRIMSMTPIGMIATAIALAAGLIITNWDVVGPYFKKLWETIGPYFEAGWELLKKVFAWSPLGMVINNWGPVVKWFQDMWDKLKPIIEWFTDSSGDTVDAINSAQWGAGAYDAYGTGIPARGYTPYPAVDPAQSNNASDATGPNPFMINKASAPKVDGEIKVSFVNSPPGMRVMETRSSGFDVSHDVGYTRFGR >LR134000|1779850:1832609|1825630_1826248_-|VDY68193.1|DBSCAN-SWA MVITDLPGVGESRDRDAEYEALYRDILLELDLVLWLIKADDRALSVDEYFWRHILHRGHQRVLFVVMQADKTEPCHEWDMAGIQPSPAEAQNIREKTEAVFRLFRPVHRVVAVSARTGWELDTLVSALMTALPDHAASPLMTRLQDELRTESVRSQAREQFTGAVDRIFDTAESVCIASVARTVLRAVRDSVVSVARAVWNWIFF >LR134000|1779850:1832609|1832243_1832609_+|VDY68197.1|transposase|DBSCAN-SWA MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >LR134000|1779850:1832609|1823394_1823589_-|VDY68189.1|DBSCAN-SWA MMTLEADSVNVQALDMGHIVVDIDGVNITELINKAAEKGYSLRVVDDRDSTETPATYASPHQLL >LR134000|1779850:1832609|1787694_1788450_-|VDY68140.1|DBSCAN-SWA MVVFSQQPFSFDEIKPWLYIWTMKTTLSERLKEARLARGLTQKALGDLVGVSQAAIQKIETGKANQTTKIVEIANALGVRAEWLSSGVGNMSDSTVQPIQTTVSHSKYFKIDVLDIEVSAGPGVINREFVEVLRSVEYSFDDARHMFDGRKAENIRIINVRGDSMSGTIEPGDLLFVDITVKSFDGDGIYAFLYDDTAHVKRLQMMKDKLLVISDNKSYSPWDPIEKDEMNRVFIFGKVIGSMPQTYRKHG >LR134000|1779850:1832609|1788488_1788719_+|VDY68141.1|DBSCAN-SWA MNPAIKTAINIVGSQKKLGAACEVSQQAVYKWLHNKAKVSPEHVGSIVTATGGVVKAYQIRPDLPKLFPHTEKNAA >LR134000|1779850:1832609|1798876_1799371_+|VDY68159.1|terminase|DBSCAN-SWA MAGTAGRSGRRPKPTARKALAGNPGKRALNKDEPVFTPIKGVEPPEWFAEEDLPLATIMWQLTTKELCGQGLLCVTDLAVLERWCVAYEFWRRAVKNIARQGNTITGAMGGRVKNPELTAKKEQESEMSSTGAMLGLDPSSRQRLIGLAGQKKATNPFLKIIES >LR134000|1779850:1832609|1783451_1783610_-|VDY68133.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSITKRGWVFPGLSVIRNPLKAQRLAEKINNKQEDI >LR134000|1779850:1832609|1786133_1786877_-|VDY68138.1|DBSCAN-SWA MISILSSSIFSVPKHNGRENQDSVLYPLQTPNGYLMAIADGVGGYKGGKEASAAVINHLYSIKDSFTHEDNVLSLLKTLQEKVASLSSVDKELASAATTLTMCLLHNQGLTIIHVGDCRVYLKNGNKLIQLTTDHTQHQMLIDSGIYTARQLKNAKGKNIITTALAAKIPLQHQVINIDKDELPLDDGVLSLFIMSDGAHTFWEQRPRFSYRTLSTASRFASSLMRRIETKGPTDDYSLIAVNVKYL >LR134000|1779850:1832609|1818248_1818638_+|VDY68184.1|DBSCAN-SWA MGFPSPAQDYVEERISLDKRLIAHPSATYMMIAGTTYLRAGIMKGAMLIVDSSLTPKDGSLLVCAIDGEFRIMRYRTLPHPCLENPENGRREPLPSKDEVSDTSRPVFGVITYTINDARYGEFDDYPLK >LR134000|1779850:1832609|1820937_1821996_+|VDY68187.1|DBSCAN-SWA MRADKSLSPFEIRVYRHYRIVHGTRVALAFLLTFLIIRLFTIPESTWPLVTMVVIMGPISFWGNVVPRAFERIGGTVLGSILGLIALQLELISLPLMLVWCAAAMFLCGWLALGKKPYQGLLIGVTLAIVVGSPTGEINTALWRSGDVILGSLLAMLFTGIWPQRAFIHWRIQLAKSLTEYNRVYQSAFSPNLLERPRLESHLQKLLTDAVKMRGLIAPASKETRIPKSIYEGIQTINRNLVCMLELQINAYWATRPSHFVLLNAQKLRDTQHMMQQILLSLVHALYEGNPQPFFANTEKLNDAVEELRQLLNNHHDLKVVETPIYGYVWLNMETAHQLELLSNLICRALRK >LR134000|1779850:1832609|1779850_1780747_+|VDY68127.1|integrase|DBSCAN-SWA MTVAQCLDYWFDNYVSTTLREKTQALYRSTVMKRMHDAFPNRPASSITVKQWVDLLTEEERDNPRRARQVLSQLRSAISWCMRRQLIDSCAIMSIQPRDFGSRAEVGDRVLSYHELAKIWLAIERSRASTSNKLLHQMLMLWGARLSELRLAKKTEFDLLENVWTVPKEHSKMGNVIRRPIFEQIKPFLEKAMTTYNDVLFPGEDINKPISIAAANRFVNRIRGGMDLGYWRTHDFRRTLVTRLSEMNVEPHVTERMLGHELGGIMSVYNKHDWIEAQRKAYELHADKLFWHIRSISD >LR134000|1779850:1832609|1815139_1815724_+|VDY68180.1|tail|DBSCAN-SWA MDVTNDDYIRLLSALLPPGPAWSASNPAIAGAAPSLTRVHQRADALMRELDPRTTTELINRWERLCGLPDECIPAGTQTLRQRQQRLDAKVNLAGGINEDFYLAQLAALGRPDATITRYDKSTFTCSSACTDAVNAPEWRYYWQVNMPAATNTTWMTCGDPCDSALRIWGDTVVECVLNKLCPSHTYVIFKYPE >LR134000|1779850:1832609|1818981_1820148_+|VDY68185.1|DBSCAN-SWA MKRRLIIAASLFVFNLSSGFAAENIPFSPQPPQIHAGSWVLMDYTTGQILTAGNEHQQRNPASLTKLMTGYVVDRAIDSHRITPDDIVTVGRDAWAKDNPVFVGSSLMFLKEGDRVSVRDLSRGLIVDSGNDACVALADYIAGGQRQFVEMMNNYAEKLHLNDTHFETVHGLDAPGQHSSAYDLAVLSRAIIHGEPEFYHMYSEKSLTWNGITQQNRNGLLWDKTMNVDGLKTGHTSGAGFNLIASAVDGQRRLIAVVMGADSAKGREEEARKLLRWGQQNFTMVQILHRGKKVGTERIWYGDKENIALGTEQEFWMVLPKAEIPHIKAKYTLNGKELTAPISAHQRVGEIELYDRDKQVAHWPLVTLESVGEGSMFSRLSDYFHHKA >LR134000|1779850:1832609|1783606_1784287_-|VDY68134.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSGVNVTESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVIERDEKYIASFDEMVPEFIEKMDEALAEIGFVFGEQWR >LR134000|1779850:1832609|1829359_1830382_+|VDY68195.1|transposase|DBSCAN-SWA MVTFETVMEIKILHKQGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRAFIRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVLGYSRMLYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTGQHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMTRLRPMGITVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSMLALPPEKKEYDVHLDENLVNFDKHPLHHPLSIYDSFCRGVA >LR134000|1779850:1832609|1805341_1805848_+|VDY68168.1|DBSCAN-SWA MATPFFHVDVQQPAEMRFNRARVRRAFVTIGQRHMRDARRLVMRRARSAPGENPGYQTGRLARSIGYMVPRASKKRAGFMTRIAPNQRNGKGNRMISGDFYPAFLFFGVRGGAKRRRSHHRGASGGSGWRLAPRNNFMVETLEKNRSWTRYFLARELRKSLKPERRHR >LR134000|1779850:1832609|1793845_1794016_+|VDY68148.1|DBSCAN-SWA MATPLIRVMNGHIYKVPNRRKRKPELKPSEIPTLLGYTASLVDKKWLRLAARRNHG >LR134000|1779850:1832609|1793390_1793846_+|VDY68147.1|DBSCAN-SWA MNLPQDGIKLHRGNFTAIGRQIQPYLEDGKCFRMVLKPWRERRSLSQNALSHMWYSEISEYLISRGKTFATPAWVKDALKHTYLGYETKDLVDVVTGDITTIQSLRHTSDLDTGEMYVFLCKVEAWAMNIGCHLTIPQSCEFQLLRDKQEA >LR134000|1779850:1832609|1795802_1796108_+|VDY68153.1|holin|DBSCAN-SWA MHEKESLAGAFWLVLLIIAGWGGLVRYLIDVKQSKATWSWINALAQIVVSGFTGVIGGLISIESGFSIYMILATAGISGAMGSVALTYFWERLTGVKNAKS >LR134000|1779850:1832609|1790340_1791042_+|VDY68144.1|DBSCAN-SWA MKNIAAQMVNFDREQMRRIANNMPEQYDEKPQVQQVAQIINGVFSQLLATFPASLANRDQNELNEIRRQWVLAFRENGITTMEQVNAGMRVARRQNRPFLPSPGQFVAWCREEASVTAGLPNASELVDMVYEYCRKRGLYPDAESYPWKSNAHYWLVTNLYQNMRANALTDAELRRKAADELACMTARINRGEAIPEPVKQLPVMGGRPLNRVQSLAKIAEIKAKFGLKGASV >LR134000|1779850:1832609|1830381_1831161_+|VDY68196.1|DBSCAN-SWA MMELQHQRLMVLAGQLQLESLISAAPALSQQAVDQEWSYMDFLEHLLHEEKLARHQRKQAMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRSLSFIERNENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVRFTTAADLLLQLSTAQRQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFLVIAKRYEKSAMILTSNLPFGQWDQTFAGDAALTSAMLDRILHHSHVVQIKGESYRLRQKRKAGVIAEANPE >LR134000|1779850:1832609|1796094_1796571_+|VDY68154.1|DBSCAN-SWA MQNLNPQRKAFLDMVAWSEGTDNGRQKTRNHGYDVIVGGELFTDYSDHPRKLVTLNPKLKSTAAGRYQLLSRWWDSYRKQLGLKDFSPKSQDAVALQQIKERGALPMIDRGDIRQAIDRCSNIWASLPGAGYGQFEHKADSLIAKFKEAGGTVREIEV >LR134000|1779850:1832609|1797700_1797958_+|VDY68157.1|DBSCAN-SWA MSDRVIECASRAGRDFSEFMKGEKGMMEALASVDEFGEQLRLNGCVNHHFVSYMMRNSIMQAFMDMAKAERKEERRRKRAEAKAK >LR134000|1779850:1832609|1797206_1797704_+|VDY68156.1|DBSCAN-SWA MESLTLFNQPIRIGEDGMICLTDMWKASGKSESESPYHYLRNKQTKEFLAELEKNHESVVFTERGVHGGTYGGKFVAYDYAAWLNPGFKYAAYKVLDDYFTGELQHRNSLSAQLNMKCHEFDQKKDMASFCGQGLAAWRYTKPVLVAEINSLANQLQITIPGLPG >LR134000|1779850:1832609|1786873_1787599_-|VDY68139.1|DBSCAN-SWA MEERGNYLIDPLEVIGSGGFGLVEKIRLYNSQRKICGLYARKILRPDATDPELFTRFEREVRYQTECLHTNIVQIFICHLQNAQPWFVMELAETNLEEEIKSGTLTKAEKISIVKMVLNAVGWIHKKGYLHRDIKPLNVLKFKDGIYKLSDFGLAKNVSPDANTQLLTQIGQYPATPKYFDYNVFLNGYSKQSDIYSIGILIEELNIDGFDDIINKCTHRQLNKRFLTVEQIFEELELKRL >LR134000|1779850:1832609|1804956_1805367_+|VDY68167.1|DBSCAN-SWA MKIRQAQTSATYILPDPGELNKRVLIRLRVDMPADNFGVEPQYPVTFRTWAKVIQTSATTWQETAQTGDAITHYITIRYRRRITADYEVVCGDSVYRVKRQRDLNGARRFLLLECTELGECRQSHGGSNGDSLFSR >LR134000|1779850:1832609|1782015_1782237_-|VDY68131.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNYQTVSATHCCECGDPIDERRRLAVQGCRTCASCQEDLELISKQRGSK >LR134000|1779850:1832609|1784283_1785069_-|VDY68135.1|DBSCAN-SWA MSTALATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKATEQKVAA >LR134000|1779850:1832609|1806567_1808064_+|VDY68171.1|tail|DBSCAN-SWA MTISFNTIPSNTLVPLFYAEMDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADYARQICGAGSQLARMVEAYRQTDPFGELYVIAVPEATGAAATVTLTVTGAATETGTVNVYVGRTRVQAPVTNGDNVTTIASSIQDAINAVPTLPFTASSSAGVVTLTARHKGLCRNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGTLSELVNAGDQFNQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMIEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNSYGVADNSYLDSETLHTSAYVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASDPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA >LR134000|1779850:1832609|1806413_1806584_+|VDY68170.1|DBSCAN-SWA MFVKPVKGRSVPDPARGDLLPAEGRNVDENNYWLRREAAGDIRRVNKKVNTDDDKL >LR134000|1779850:1832609|1788788_1789328_+|VDY68142.1|DBSCAN-SWA MQPLTYQQTSGFSPTAVINRSQTKQAPGHEKIRDAVRAWSAVDNQDVVAALIVNEYREQGGGTIDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGHLVEQDSFMARLAEMEKELSEAKQAVILNAPRHQKLKEMSEGIVSMFRVDPDLAGPLMAMVTTMLGAI >LR134000|1779850:1832609|1816341_1816758_+|VDY68182.1|DBSCAN-SWA MTYAYSASKNAFYSHEWKEEYDAAGTWPADAVDVSDDVWKEYSSEPPEGKVRASDSSGLPVWIDKPAPPNSELRKAALSALSNTYQDDIEKLNRAWLAAAVNDGVNETAKKDVVLAQINTRKTQYANDRAAIIAQYPV >LR134000|1779850:1832609|1815727_1816339_+|VDY68181.1|DBSCAN-SWA MHRIDTKTAQKDKFGAGKNGFTRGNPQTGTPATDLDDDYFDMLQEELCSVVEASGASLEKGRHDQLLTALRTLLLSRKNPFGDIKSDGTVKTALENLGLQYSISEPDTETVVYTLPGGYKLMAFNRLVNNSTAVGTVVTTHITFPQAFPSRLIAVFATKKNYVQAAVSCENQSLTGFDAVVTLITTIAGGITSTRAMFLAIGK >LR134000|1779850:1832609|1814090_1815149_+|VDY68179.1|plate|DBSCAN-SWA MADSEFQRPTLAENISMLRNDLFARLDVSDTLRRMDEDVRAKVYAAALHTVYGYIDYLAMNMLPDLCDESWLARHAAMKRCPRKGATSASGYMRWEGVSDGLKVTAGSVIQRDDLAQYTATADATSSGGVLRVPIACSSAGAVGNADDGTSLILVTPVNGLPSSGVADTLTGGFDTEELETWRARVIERYYWTPQGGADGDYVVWAKEVPGITRAWTYRHWMGTGTVGVMIASSDLINPIPEESTETAARQHIEPLAPVAGSDLYVFRPVAHKVDFHIRVTPDTPEIRAAITAELRSFLLRDGYPQGELKVSRISEAISGANGEYSHQLLAPADNISIAKNELAVLGTISWT >LR134000|1779850:1832609|1803178_1804384_+|VDY68164.1|capsid|DBSCAN-SWA MAVDIKDVEQVAQELQQKFDDFKAKNDKRVDAIEQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPAGGAQNKLTTEHKEAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEALDRNILTLLKDEVVMRQEATVITIGGSDYKKLVNLGGTASGWVGETDTRSQTATSRLELIEPLMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTSGDGTKKPKGFLAYESTDETDKVRAFGKLQHIVSGDATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA >LR134000|1779850:1832609|1812051_1813131_+|VDY68176.1|tail|DBSCAN-SWA MNDNVTLRVNGREWNGWTSVRIGAGIERLARDFSVEITRQWPGDEGITTLQPRIKNGSKVEVLIGDELVITGWVEATPVRYDARSVSTGIAGRSLTADLIDCAAEPTQFNGRSLVQIAQALAAPFGIEVVNNGAPPGVIPDVQPDHGETVIEVINKILGQQQALAYDDPHGRLVIGGIGSTRAHTALVLGENILSCDTEKSIRERFSVYQVAGQRAGNDDDFGEATTTALRARTEDAFIARYRPMYIRQTGQATGAGCIARADFEARQRAARTDETTYVVQGWRQGNGTLWQPNQRVIVFDPVCGFDNTELLVSEVTFTQDQNGTLTEIRVGPPDAYLPEPEAPGARKKKKARVQEDPF >LR134000|1779850:1832609|1816978_1818097_-|VDY68183.1|DBSCAN-SWA MSVNDSNDAKRTINTIQALRGIAALMVVLVHFKVYFNMPFYNFGDALFSYGGMGVPLFFIISGFVMTYTHNRTGVHESILFIIKRFSRVYPLYFIATILWLIVLKYSVPTAHEFDNYTDIIKSLLFYPLDARNAPNLGLSSLFVGWTLNYEMFFYAVFAVSLAFNRWMWTVFSLAFMTILYIIPSIFSSFSFVTEVDYGFSVAYMSLVTNPIIINFLAGVLIAKLVDFVDFSKVSKSSLTWLVFAVVTAFAWQYFSGFRAGWGFLQWGLVSAALVFVCVIYESRFGFVCAPLTYLGSISFSVYLLHPFVKELLQIIFIESGHEEYAGGPLMGLLIVSSTILVAHVSRNFIEVWLSNKVRDFLLTRSMIAKIA >LR134000|1779850:1832609|1823585_1824020_-|VDY68190.1|DBSCAN-SWA MKTVSQNTPTIYSATTPENNPPQLVASLVPDEQRISFWPQHFGLIPQWVTLEPRVFGWMDRLCEDYCGGIWNLYTLNNGGAFMAPEPDDDDDETWVLFNAMNGTRSQLINSIDILRARRATGLMTRDNYRTVNNITLGKYPEAK >LR134000|1779850:1832609|1805844_1806405_+|VDY68169.1|DBSCAN-SWA MKLTPVIAALRARCPYFENRVAGAAQFKNLPEVGKLRLPAAYVVPGDDSPGENKSQTDYWQELKEGFSVVVILSNGRDERGQFASYDVVDDVRQMLFKALLGWNPEACGNPITYDGGTLLDLNRHELIYQFDFSVISELTEDDTRQQDDLNSLDELQTLAIDVDYLEPGNGPDGDIEHHTEITLPS >LR134000|1779850:1832609|1781023_1781203_-|VDY68129.1|DBSCAN-SWA MSCPKCGSGNIAKEKTMRGWSGDYVCCDCGYNDSKDAFGERGKNEFVKINKEREGNEKS >LR134000|1779850:1832609|1785446_1785653_-|VDY68137.1|DBSCAN-SWA MVHQHYGTQTVNRGAVMPGMLIKHKDGTWTASANLRGRLYLHRGIERTYTRDLLVEVFLDGRGNGLNH >LR134000|1779850:1832609|1810726_1812055_+|VDY68175.1|DBSCAN-SWA MTWKDRLQDASFRGVPFKVEEESAGTGRRVETHEYPNRDKPYTEDLGKITFRPSITAYVVGDDCFDQRDRLIDALNKPGPGTLVHPTYGELKVCVDGEVRVSTSKSEGRIVRFDLKFVEAGELSYPTSGAATAQTLMSSCSALDDCISDSFSGFSIDGVADFVQNDVVGNASIMLGYVSDAMKVVDSAVSDAARLLQGDISVLLPPPSSGKNFVEQVQKMWRTGKRLYGNASDLVTMIKTLSGVSLGSDLQPRGIWKTDSKTTATATQQRNVVASTLRTTAISEAAYAVTRLPAPTTSAVMQNSAVGQATTPAQSTGWPSVTHPALNNAPAVKNTVDLPTWEELTDIRDTLNTAIDKELSRTTSDALFLALRRVKADLNADINTRLEQSARIIQRTPDEVLPALVLAATWFDNAARDADIIRRNAITHPGFVPVIPLKVPVQ >LR134000|1779850:1832609|1804636_1804960_+|VDY68166.1|DBSCAN-SWA MLLTMEEIKAQLRLDEDFDADDRHLQLLACAAQKRTETYLNRKLYAPDETIPDSDPDGLHLPDDIRLGMLMLISHFYENRSSVTEVEKLDMPQSFGWLVGPYRYFPQ >LR134000|1779850:1832609|1785074_1785371_-|VDY68136.1|DBSCAN-SWA MNAYYIQDRLEAQSWARHYQQIAREEKEAELADDLEKGLPQHLFESLCIDHLQRHGASKKAITRAFDDDVEFQERMAEHIRYMVETIARHQVDIDSEV >LR134000|1779850:1832609|1813678_1814104_+|VDY68178.1|tail|DBSCAN-SWA MEVWLTVNGKRTCASAPLDPLTRAVVISLFTWRRAEPDDNADVPMGWWGDTWPAVQNDRYGSRLWLLQRSKLTNQLVQTVRGYIRECLQWMIDDGVVSRIDLDIRRTGINELGNSITLWRRDGPVMISFDDLWSAITHGGQ >LR134000|1779850:1832609|1782390_1783455_-|VDY68132.1|DBSCAN-SWA MSQVGNHSFEFPASQGVQGGTVTLFLTIPGRSLARFLASDNYGHTLERSQREINPNRVRKFLNYLTNADSRNEPFIIPPLVGNCDSNIEFVPFGNTNVGIARIPLDAEIKLFDGQHRAAGIEIFCRSSPSTLMVPMMLTMNLPLKTRQQFFSDINNNVSKPSATINMAYNGRDDIAQGMISFLTQHTVFADITDFEHNVVPLKSNMWVSFKALTDATSKFARNGNQQLEMGYIESVWEAWITLTQIDSIRHGVHHATYKRDYIQFHGVMINAFGFAVQQMMVNHSIAEITSMIEKLCATTSSAEREDFFLMDNWAGICTKASQEKLSVIANVAAQKAAANRLIQAFTKGSLEST >LR134000|1779850:1832609|1820266_1820740_+|VDY68186.1|DBSCAN-SWA MNYEIKQEDKRTVAGFHLVGPWEQTVKKGFEQLMMWVDSKNIVPKEWVAVYYDNPDETPAEKLRCDTVVTVPNNFTLPENSEGVILTEISGGQYAVAVARVVGDDFAKPWYQFFNSLLQDSAYEMLPKPCFEVYLNNGAEDGYWDIEMYVAVQPKHH >LR134000|1779850:1832609|1826585_1827737_-|VDY68194.1|DBSCAN-SWA MGKELDDICTSCPYIDAVKRHKQQLGAIEEYTQWLKKEPRASYFFLFQLYTRIHNTFFSQKQQLPFTPGGTHCPEPDVTLRDLTLSPGYHSDYAPQPIPEMDSSAVVPPTNENTSPPEDTPDNTPAGGNTGQAEKTRNAGLTPIPEKRSGMPPEHLRFATGFPSQPKIAGPREKPMRTVHPDKIYREIIWFCSSYLLKSGPEATRTIINSVFSEWASINNDYPSPFSWVDSRDSEQCDWLWNAMQVRCVGTPLNPLTPEQKYWFACATFDNWEGWNEQQVQFLLESNPRRNRAKFTQVSFQAPRIQHKAILLDELKSAREQQKRRDERADGSVPLKLSGKIHKQLESIARSRGVLPKKLLNEMIEQAYHDLVATRQNSQIDSR >LR134000|1779850:1832609|1798088_1798367_+|VDY68158.1|DBSCAN-SWA MFDVVVFGAGRFGSVCHVERHELTIKVPDCQVKAREMVGGVAVIPDVEYQIFEFQVDDEIYLIGVNGRRPSDDVIKSHIRHGNPKPKPYKKL |
71 | Enterobacteria_phage(29.51%) | holin,capsid,tail,portal,head,terminase,protease,plate,transposase,integrase,lysis | attL 1770518:1770532|attR 1784904:1784918 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2229639 : 2272322
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >LR134000|2229639:2272322|DBSCAN-SWA ATTATTTTTGCGACAGTTGATCCAGCTTGTCGCGAAAACCGGTAACAGAAATGGCCCGGTTATCGGCGCGCCAGCGATCTTTCGCGGCAGGAGCCGAACTTTGCACCCCAATTAATTGCCAGCCGTCATCGGTATGCAACATCAGAGGCGAACCGCTGTCACCCGGCAAGGTATCGCACTGATGTGACATCACCGACGTTTGCGCCCAGCCAGTCACTTCACAGTTTTGATGACTGTACAACGTATCGAGATGATCTTCAGGGTAGCCTGCCTGAGTCACTTTACGACCTGCCGCTTTTAATGCGGCGGTAAGCGCGGCTTTATCTCCCTCAAATAACGGCAACGGCGTAATGCCAGAAGGGGGATTACGTAGCACAATCAATCCGAAGTCCCACGGCGCGGCTGCGGGAGGTACAATCCAACCATCCCCATCTGCTTTTAACCGCTTTCCCAGTGTCGGATCAACGCGGCCTTCTATGTCGTGGATCTCATAGCGCCAAAGACCTTTATTTGACACAAAACGCAGCGCCACTGCTTTATCGGCTTTACCCTTTGGAGGTGTCAATAAACAGTGTCCTGCCGTTAATGCCAGATTGGGTGCAATCAGCGTCGCCGTACATAAATTGCCGCTGGCCGTTTCCAGTTGCCCAACCGCATCCCACGGTGATTGGGTCGTGTCATTCACTGGCACACGATCATCATGACCAAAAAACAGGGTGCTGACCTCATCGTTTGCCGATCTGGCAACGTCTGGTTTATCTGCAAACACAAAAGCAGACGTCAAACTAATTGCACCCAACACTACAGCAATGGTTGTACGCATATCACACTCTGGTGGGTAATTATGATTATTAAAAGCAAACCCTCATAAATACTATAGACGGGACGGGATGAAAGTGGGAGTAAAATCAGATAACTACAATCAGGAAAGATAAAAACGGGCGGCAATCAGTGCGCATAAAATAAGGATGATCAGAATGAACTCAAAGCGATAACGGCCCACCATACTACCCTCCGCTAAAGGCGGTGCTAAACGTTACCCGCTTAGCACCGTAATTTTCAGGCGCGAGGGGGCGCGCCAGCATTACTGTTGAAAACTTACGCTGCGGGTTGTGCAGCAGGTTTTGCCGGTTGCTGATGGCTGTGTTTCTTGGCGTGTTTCTTCGCTGCCTGCGCTTTTTGTTCAGGGGCTTTCTGTTCAGCTTTCGTATTTTTATGATGCTTTTTAGCCGCCTGCGCTTTCTGGGCAGGTGCTGCTTTATGCTGTTTTTTATGATGTGTAGTTTTCGCCGGCGCTGCTTTGGTGGTCGTCGCAGTCGGAGCAGGTGTGGTCGTAGTCTCTGCAGCAAAGGCGGCAGAAGACAGACCCATAGCAGCGGCAACAACCAGAGCTAATACTTTTTTCATTGTCATACCCTCAATTTGTTTTTTCATTTAACCCCACTGCGGGGCCGTTGAAATAACTATATCCCTGGGAATGGCAGACTTCCGTGAGTGGTTGGTTTCAGCGTGTAACGATATGTACAAACGCTGAATAAATTACAGCGTTGATAATAACGTATTGTGACTTAAGGGAAATTTAGCTACCAATAATAGTAGTCTTGATCGGGTTAACTATTTTACTACTTACAAAGCAGTAGAATAACTGCGCATCAGTAATAAATGACACACAGCAAAATGAATCCGTTTATTTGGGTACTTATAATCCTGATGACGCTAGACGCGCTGCGGGAATTGGCTGGCGCTTCGTCTATTTTAGGATGGCTATTAACGTTGGTTTGAGCTGGCAATAAGTCCGGACGGGTATTTACCGCAGTCCGGACTTATTTTTCAGGCGTGCAGACGACGATGCAAACGCGTCCCGACCAGCAGAGCAATGACCAGCATCAGAGCAATAAATGCGCCGACGCCGTTCCAGCCATAGTTATGCCAGAAAACACCACCCAGCGTCCCGGCAATACTCGACCCCAGATAGTAACTGAACAGATACAGCGAGGAGGCCTGGCCTTTAGCGCGTTTTGCGCGGGGGCCGATCCAGCTGCTGGCTACTGAGTGGGCTGCGAAGAATCCTGCTGAGAAGAGTAACATTCCGGCAAAGATCAGCCACAGCGAGCTGAATAAGGTCATCAGTAAACCAAACAGCATAACCCCCGTCGAAAACAACATCACTGGACCACGCCCATAGCGGGTGGTCATGGTTCCGGCTTTGGGTGAGCTCCATGTACCGGTCAAATAAGCCAGCGATAATAAGCCAACCACGGCCTGACTGACATGCCAGGGGGAGAGCATCAACCGATAGCCGATGTAATTAAACAGCGTGACGAACGACCCCATCAGCAAAAAGCCTTCTGCGAACAATAACGGTAATCCCCGGTCACGCCAGTGCAGACGAAAGTTGATAAACAACGTCTTAGGGCGCAGCGAAGTCGGGCGAAAATGGCGTGATTCAGGGAGGATTTTCCAGAACATCAACGCCGAGGCCAGCGCGAAACAACCGATTGCCGCCAGAGCAATTCGCCAGTTGAAAAAGTCCGTGAAGACACCGCTAATTAAGCGTCCGCTCATGCCGCCAATTGAGTTGCCGCTGATATACAACCCCATTGAAAAGGCCACGAAACTGGGATGGATTTCCTCGCTAAGATAAGTCATGCCAACAGCTGCCACGCCACTTAACGAAAGCCCAATCAAGGCGCGCATAATCAAAATGCCGTGCCAGCTGGTCATCATTGTCGAAAGTAACGTACAAATGGAGGCCAACAGTAGCGCCGTGACCATCACTGGTTTGCGACCAATGGCATCGGATAGCGGGCCAGTAAACAGCAAACCAATAGCCAACATCGCCGTGGAAATGGACAGTGAAATACTACTGTTCGCGGGGGTTAAGCCAAACTCCTGCGAAAGCACCGGAAGGATAGGCTGCACACAATAGAGAAGTGCAAATGTTGCCAGTCCGGCAGAGAACAGCGCCAGGGTGACGCGCATAAATTGCGGCGTACCGCGTTTAATAAATTGATTTGGCTGAGAAATGCTTTGCTTGTCAGTGTCGCTTGCCGGAGCGCCATCAACAGTTGTAGTACGGCTCACTTGAAATCCTTGCTAAATATGCCTGTAGATCAGGCTTATACATAGGGTAGGAAAATCGAATTGTTCTGTCTAATATATTAATAATCTCAAATAAGATGTTTTAAATATGAATATTGAACTTCGTCATCTGCGTTACTTTGTTGCTGTTGCGGAAGAGCTGCATTTCGGGCGCGCCGCTGCCCGCCTGAATATTTCGCAACCGCCGCTAAGTCAGCAGATTCAGGCGCTGGAGCAACAAATTGGTGCCCGACTGCTGGCACGAACCAATCGCAGTGTATTGCTGACGGCAGCAGGAAAACAGTTTCTTGCAGATAGTCGGCAAATCCTGTCTATGGTGGATGACGCTGCCGCTCGCGCTGAAAGGCTGCATCAGGGTGAAGCGGGGGAGTTGCGCATTGGTTTTACTTCGTCGGCTCCTTTTATTCGGGCGGTGTCCGATACGTTATCGCTGTTTCGCCGTGATTATCCTGATGTCCATTTACAAACCCGCGAAATGAACACTCGCGAGCAAATCGCTCCGCTCATTGAAGGAACGCTGGATATGGGATTGCTGCGTAACACAGCGTTACCGGAGTCGCTTGAACACGCAGTCATCGTCCATGAACCGCTTATGGCGATGATCCCGCACGATCATCCCCTGGCAAATAACCCGAATGTAACGCTGGCTGAACTGGCGAAAGAACCCTTTGTCTTTTTTGATCCGCACGTCGGGACAGGGCTGTATGACGATATTCTCGGGCTGATGCGACGTTACCATTTGACGCCCGTCATCACTCAGGAGGTGGGCGAGGCAATGACCATCATCGGTCTGGTTTCCGCCGGTCTGGGTGTTTCAATTTTGCCTGCGTCATTTAAACGTGTTCAGCTCAACGAAATGCGCTGGGTGCCGATTGCTGAAGAGGATGCGGTTTCTGAAATGTGGTTGGTCTGGCCGAAACATCATGAACAAAGTCCGGCTGCGCGTAACTTTCGTATTCATCTGCTGAATGCTCTCAGGTGAGGGAAATTTCAGCGAAAAAGCCCGAAAAATGTGCTGTTAATCACATGCCTAAGTAAAAATTTGACGACACGTATTGAAGTGCTTCACCATAGCCTACAGATTATTTCGGAGCGCGAAAATATAGGGAGTATGCGGTGGTTGCTGAAAACCAGCCTGGGCACATTGATCAAATAAAGCAGACCAACGCGGGCGCGGTTTATCGCCTGATTGATCAGCTTGGTCCAGTCTCGCGTATCGATCTTTCCCGTCTGGCGCAACTGGCTCCTGCCAGTATCACTAAAATTGTCCGTGAGATGCTCGAAGCACACCTGGTGCAAGAGCTGGAAATCAAAGAAGCGGGGAACCGTGGCCGTCCGGCGGTGGGGCTGGTGGTTGAAACTGAAGCCTGGCACTATCTTTCTCTGCGCATTAGTCGCGGGGAGATTTTCCTTGCTCTGCGCGATCTGAGCAGCAAACTGGTGGTGGAAGAGTCGCAGGAACTGGCGTTAAAAGATGACTTGCCATTGCTGGATCGTATTATTTCCCATATCGATCAGTTTTTTATCCGCCACCAGAAAAAACTTGAGCGTCTAACTTCGATTGCCATAACCTTGCCGGGAATTATTGATACGGAAAATGGTATTGTACATCGCATGCCGTTCTACGAGGATGTAAAAGAGATGCCGCTCGGCGAGGCGCTGGAGCAGCATACCGGCGTTCCGGTTTATATTCAGCATGATATCAGCGCATGGACGATGGCAGAGGCCTTGTTTGGTGCCTCACGCGGGGCGCGCGATGTGATTCAGGTGGTTATCGATCACAACGTGGGGGCGGGCGTCATTACCGATGGTCATCTGCTACACGCAGGCAGCAGTAGTCTCGTGGAAATAGGCCACACACAGGTCGACCCGTATGGGAAACGCTGTTATTGCGGGAATCACGGCTGCCTCGAAACCATCGCCAGCGTGGACAGTATTCTTGAGCTGGCACAGCTGCGTCTTAATCAATCCATGAGCTCGATGTTACATGGACAACCGTTAACCGTGGACTCATTGTGTCAGGCGGCATTGCGCGGCGATCTACTGGCAAAAGACATCATTACCGGGGTGGGCGCGCATGTCGGGCGCATTCTTGCCATCATGGTGAATTTATTTAACCCACAAAAAATACTGATTGGCTCACCGTTAAGTAAAGCGGCAGATATCCTCTTCCCGGTCATCTCAGACAGCATCCGTCAGCAGGCCCTTCCTGCGTATAGTCAGCACATCAGCGTTGAGAGTACTCAGTTTTCTAACCAGGGCACGATGGCAGGCGCTGCACTGGTAAAAGACGCGATGTATAACGGTTCTTTGTTGATTCGTCTGTTGCAGGGTTAACATTTTTTAACTGTTCTACCAAAATTTGCGCTATCTCAATTTGGGCCAGGAAAGCATAACTTAGACTTTCAAGGTTAATTATTTTCCTGGTTTATATTTGTGAAGCATAACGGTGGAGTTAGTGATGCTGAAGCGTTTCTTTATTACCGGTACAGACACTTCTGTAGGGAAAACGGTGGTTTCCCGCGCATTGCTACAAGCGTTAGCCTCCCAGGGAAAAACGGTTGCGGGATATAAACCCGTAGCGAAGGGGAGCAAAGAGACACCCGAAGGGCTGCGTAATAAAGATGCCCTGGTGTTGCAGAGTGTTTCAACCATCGAACTGCCTTATGAAGCAGTTAATCCTATCGCGTTAAGCGAAGAAGAAAGTAGCGTGGCGCACAGTTGCCCAATCAATTACACCCTCATTTCAAACGGCCTGGCAAACCTGACCGAAAAAGTCGATCATGTCGTGGTAGAAGGGACTGGCGGCTGGCGCAGTCTGATGAATGATTTGCGTCCACTCTCTGAATGGGTAGTGCAGGAACAACTGCCGGTGTTGATGGTTGTCGGTATTCAGGAAGGTTGCATTAACCATGCACTGCTAACAGCTCAGGCGATCGCCAACGACGGGCTGCCGCTCATTGGCTGGGTGGCTAACCGAATCAACCCAGGACTGGCGCATTATGCGGAAATCATTGATGTGCTGGGTAAAAAACTTCCGGCACCGCTCATTGGTGAACTGCCTTATCTGCCGCGCGCCGAACAGCGTGAACTGGGGCAATACATCCGCTTAGCTATGCTGCGCAGTGTGCTGGCGGTAGATAGAGTCACGGTGTAACGTCCGCGAAATTACCGACGCAATTACGCAGGCAATCAATAAACCGGGGAGTAGCTGATACTCCCCGGTCATTTCACATATCATCAACGTCGACATAATCGGCGCGTGCGTGGTCGCCGCCAACAGTGTCGCCATCCCGGTCAATCCGAGTAAAAGTGTAATTTCTTCGCCATCAGGGAACCATAATCCCAGGCTACGACCATACAACATGCCAATGGCAAGACCGATAAATAGCGTCGGTGTAAAGACCCCACCGGGTGCGCCGGAACCGCTACTCGCCAGCACGGCACACAGTTTACAAAGGAAGATCCCGGCAATGATCATTAACAGTGGTGGGGCGGTTAAAAAGGATTGTACGGTGCTATAGCCGTTGCCCCACACTGCAGGTGTGAACAGGGAAAGCAGACCCACGATCAACCCGCCTAGTGCCAGTTGCCAGGGCGGCGCAAGTTTGAGACTCACAAATCCACGATGACAGGCGTTCATTAACGTTAACAACAGTGGTCCGCACAGACCTGCCAGCACACCTGTACTGATAATCAGCGCATAGTCACGAGCCTGAACCGTCACTGAGAGTTGTACGTTGTAGAGTAACGCGTCGCTATGATTAATCAGATTGCTAACCAGCAATGCCACGACGGCGGAAATAATCACCGGGCCGAGAGAGGCCAACATCATAGTGCCAAACAGCACTTCGGCTATAAATAAACTGCCAGCAAGCGGGGCACGATAGGCCGCAGCCATTCCCGCCGCGGCCCCACAGGCGATCCATAATTTCCACTCCTGGCGTGGCGTAAAACGTTGGGCAAAACAGGAGGCGGCAAGGGCAGCTAAAAGAATCATCGCACCTTCGCGACCAATTGCACTGCCGCTGGTTACTACCAGCAGAGAGGCAAGCGATTTAACCAGGCTTGCTGCGTAATCGAACTGTCCATCGGTTTGCAACGCTTCCATGTAATCGGTCGGCGCATGAGGGCGTTGTTGGGTAAATTTCTGCCAGCCCATCAGCAACAAACCCGCCGCCAGTCCGCCGAGCGCCGGAGTTAGCAACCGTCGCCAGGGGGAAAGGTTTGTCGCTGCATTGACCAGACTGCCGGAGTCATTATTGAGGAACAACCACTCCAGTAGCAGCATCGCATGACGAAACCCGGCAACGGCAAAGGCCGCGAGAATACCGACGACTGTTGCGATAAGCAGACGGCGGAACATAGTGCGCAGATCCGGATAGATGTGTAGATGATGCATGAGCAGAAATGGGCAATGAAACACGGGCGACTATTTTGCGGCAAAAGAGGCCAGCCGCAAAAGCAGTGCGTGCTTTAAGAGATAAAAAAACGTGACACTGTAACCTATTATTGCCGGATTGCGCGTAATCGTCACCATCCGGCAATATTACGGTGATCCTATCGAAACAGCGGTTTAACCGCGACAGGAATTAACAGTTGTGATTGCCATTGCGCCAGCGTTAAGCGAGCCAGTTCACCCAGTGCACGGTAGAAAGGGTGTTCTGCTTTTTCGATAAAAACATCAAGAAAACGTGTTGACCACGGAAAAAGGTGCCATGCCAGCAATTCTTCGCATTCTGTCTGGCGACCATTCTCCGCCAACCACGCAGCCATCAACAGCAGTGAACCAAAATGATCTTCCGGTTCGTTTTGCTTCATTTCAAACTGAATGCCTTTCTCGCGCATCCACTGACGAAGTGCCAATGTTGAATCGCCAAACAGCACAGATTCGCGATCCAGCCAGACCGACCCCCACGGCGGAGACGGCAGTGCCCACGGGCCGACAAACAAACGCTGCCAGGCCTGGGCGTGAGTCTCTTCACACTGTGTCTGAAAAGCCGTCACCAGCGGCGCTAATGACGCTTCTGGTAGAGGCCACTGAGTTTCCCAGCCGTCACTGGTGAGTACCGCAACCAGAGGTGCGGCTTCCGCACTCTCTGGAGCGTAATAAAACAGCGCGCCCAACACGCGCGCCGCGACAGAAAAATTATCTTGCTGTGAAAAATGGGTCATTCCTAACATCCTGAAATGTGCGGGTTGCCCCGCACCAATTTGTTAACCTGCAATAGCCATTCCTACTGTCATGTGCAGGCCATAAAAGAGCACACGCCCGATCATCTCACCGCCAAGTATGAGGATTAGCCCCAGAATAAGTCCGGCAACGTGAGGTTCACGGCGACGAATGAGCGGGCAAAGCCAGCATCCAAGACCTGCGCATAACAATACCACACGCCAGACCTGTAAGGAGGCGTAATCTGGTACCAGTGCGCTGGCTTGTTGCACGGATGAGTGAATAGACGCCAGAGACAAACCTTGCAGCACAATCACCCCGGCACAAGCGATCAGCGCCAGAACGCTAATGATGGCAAATGGTGTGGTATTAAATGTAACGCGAGCTGCCCGCAGAATTGCGGCAGCAAGTATTGGACCGCTCAACAATACCGTCAGGAAGAAAGCCAGCGTAGTGTAACCGTTATGCCAGGTTGGCACGGTGTCGATTTGATACACGCAGGTCATCATCCAGACGAAAATGACGCCGAGCGCCATGCTGAAAAGTAACCAGAGTTTCCCCAACGCTTGCGGCATTTTACCGATGACTGCCACCAGCCACCATAATCCGCCAACGGCAAAAAATATGGAACCGGCAGCAATCTCGTTACTCAGACCAGAAGCTCCGATTCGATTCAGCGAGTTAAACGCCCGCAGTGGCGATCCCAGGTGCATGACAGAGGCAATAAACCCGACGCCCATCAACAGCCAGAGGAAAAACATGCCACGGACAATACGCTGACGGTCTGCGTCATTTTTGGCGGCGAACCAGCCAATTCCGCTAACAATCAGCGCGCCTACTACGCATTGACCCAATACGGTAAAGATAACCAGTGGCCACTCATGCCATCCATTTCCCATTTTACACTTCCTCCGGATTAGCCAGATAACCAGTGGTATCGCCGGTCGGGCGGCTGTTGGCGTTAGGTTTGATAACGATATTGGGTTTTGTGAAATGCGCGCGCGGCAGTGGCGCGACGGCAGCAAGAGTGCCGTGCTTCTGGCGCAGCTCTTCAATTGGACCGAACTCAAGTGCACGCAGCGGGCAGGATTCGACACATATGGGTTGTTTCCCCTCGGCGACGCGCGAATAACAACCATCGCACTTCGTCATGTGCCCTTTTTCAGCATTGTACTGTGGCGCGCCGTACGGACAAGCCATGTGGCAGTAGCGGCAGCCGATACAAACATCTTCATCAACCACCACAAAGCCATCTTCACGCTTATGCATTGCGCCGCTGGGGCAGACTTTTGTACATGCGGGGTCATCGCAATGGTTACAGGAGATGGAGAGATAATAAGCGAACACGTTCTGGTGCCAGACGCCATTATCTTCCTGCCAGTCGCCGCCAGCGTATTCATAAATACGGCGGAAACTGACTTCCGGGCCAAGATCTTTAAAATCTTTGCACGCCAGTTCGCAGGTTTTACAGCCAGTACAGCGGCTGGAATCGATAAAAAATCCATATTGTGTGGTCATGGGCTACTCCTTAAACCTTTTCGATCTGGACAAGATTGCTGTGCGACGGGTTTCCCTTTGCCAGCGGTGAAGGGCGGTGAGAGGTCAGAATATTGATACTGCCGCCGTGATCGACCCGGTCACCAAACATATCCGCTTTAAGCCACGCACCTTGCCCGATGGCGGTAACGCCAGGCAGAATACGCGGAGTCACTTTTGCGGCAATCAGCATTTCTCCATTATTGTTAAATACCCGCACGGTATCGCCATGACGGATACCGCGTGCCTGAGCATCAATGGGGTTGATCCACACCTCTTGTGGGCAGGCCTGCTGTAACACATCAATATTGCCGTAGCTGGAGTGGGTACGCGCTTTGTAATGGAAGCCCGTTAACTGCAGTGGATAGGTTTTCCGCAGGGGATCGTCCCAGCCATCAAAACCTGGGGTATACGCAGGAAGGGGATGAATAATTTCATCTTTTTTCAATTCCCAGGTATCTGCAATCTTCGCCAGTCGTTCAGAATAAATTTCGATTTTCCCCGAAGGTGTTTTCAACGGGTTTGCCTGTGGATCTTCACGGAATGCGCGGAAAGCGACGTAGTGTTCTTCCGGGCATTTTTTCTTAAAGATCCCGGTCGTTTTCATCTCCTCGTAGTCGGGCATCTCAGGGTTACGTTCCTTCGTTTTCGCATGGAGATATTTGATCCATTCATGCTGACTGCGACCTTCAGTAAAGGTTTGATAAACGTCTGGTCCTAAGCGTTTGGCGACTTCACTCAGCATCCAGTAGATGGGTTTGCGTTCAAATTTTGCTGAGGTTGCGGGTTGGGCGAGGATCACATAGCCCATATTCCCTGCAGATTCATGAGAGATAAGGTCTTCTTGCTCTGTTGGCATCAGGTCGGGCAACAGGATATCGCAATACTTAGCCGAGGCCGTCATGAAGTGGTCAATGCCAACAATCATCTCGCACTTGCTGTCATCCTGAAGCACCTCATGGGTGTGATTGATGTCGCCATGTTGATTGATCAATGTGTTACTGGCGTAGCACCATAAAAACTTGATGGGGACATCCAGTTTTTCTTTTCCACGAACACCATCACGGGTCGCGGTCATTTCCGTACCATGGTCGATGGCATCTGTCCATGTAAAGACGGAAATCTGCGTTTTAACAGGATTCTCGAGCATCGGGAACCATTCTACCCCCAGATCCCAGCTACCTTCGCGTACGCCTGAGTTGCCGCCGTTTATGCCGACGTTGCCGGTGAGAACGGAAAGCATGGCAATAGCGCGGGATGTTTGCTCGCCGTTGGAATGTCGTTGTGGCCCCCAACCCTGACAAATATAAGCAGGTTTTGCTGAACCGATCTCTCGTGCCAACTGGATAATTTTTTCTGCCGGGATGCTGGTGATTTTTGCTGCCCATTCCGGCGTTTTAGCTATGCCGTCAGGCCCTTCGCCCAGAATATAGGCTTTATAATGCGCGTTACGTGGTGCGTTGGCGGGCAGCGTTTTTTCATCGTAACCAACACAATATTTGTCGAGAAATGGCTGATCGACCATGTTTTCAGTAATCAGTACCCAGGCAATCGCACAGGCCAGTGCGCCATCGGTGCCAGGGCGAATGGGCAGCCATTCATCTTCACGCCCGGCAGCAGTGTCGTTATAACGTGGATCGATGACGATCATGCGTGCGTTTGAACGTTCGCGGGCTTGCTCGACGTAGTAAGTGACACCACCGCCGCTCATCCGCGTTTCTGCCGGGTTATTTCCGAACATAACGACCAGTTTCGTATTGGCGATATCATCCGGGCTGTTGCCATCATTGGCACCGAACATATAACTCATTGCGGCACTGATCTGTGCGGTACTGTAGCTGCCATAGCGACTGAGAAAACCACCGCAAGAGTTCATCAGACGGTACGGGACGTTTGAGTTGGTGATGTTTCCGCCATCTACGCCTGTTCCGTACAGGACATGTACAGCCTCATTGCCGTAATCTTTCAGGATCCGCCGAAGATTATCACTGATGGTATCCAGGGCTTCGTCCCAACTTATCCGTTCAAATTTACCTTCACCGCGCTTGCCGACGCGCTTCATGGGATATTTCAACCTATCAGGATGATTCATCCGTCGGCGGATAGAGCGCCCGCGTAAACACGCTCGAACCTGATGATTACCGTAGACGTCGTCACCTGTCGTATCAGACTCCACCCAGTACACGGTGTCATCTTTCACATGCAAACGTAACAGACAGCGGCTCCCGCAGTTAACGGTGCAGGAACTCCAGACCGCTTTCTCTTCTACCGGAGCCTCTGCCGCCCGGACCATTTGGGAAAATGGCAGAGTGAAAGCACTGCTTGCCAGCGCAAGACTGCCAAGTGCGGAGGTTTTCATCAGACTTCTACGGCTGATTTCAGCCTTCATGAGCGCCTCTGTGGTATGGATTTTCATCATTACTCACTTATTGCTTTTCAAACAAAATGTCATGCCAGAATTTATGGTTGTCGTGGGTTATATTTTTTCGATCTCGACCAGATTAGTGTGCTGCGGGTTTCCCTTCGCCAGTGGTGAAGGGCGCAGAGTGGTTAGCGTATTCACACAGCCGCCATGGTCGATTTTATCGCCAGACATATTGGCCTCGTGCCAGGCTCCCTGGCCCATAGCGCTAACTCCAGGGAGAATACGTGGTGTTACTTTGGCTGGTAGCCGAACTTCGCCACGATGGTTAAACACCCGCACCATATCGCCGTTGGCAATCCCACGTTTCTGCGCATCTATAGGGTTGATCCACACCTCCTGACGGCAGGCAGCCTTCAGGAGATCAATATTGCCGTAGGTCGAGTGAGTACGGGATTTGTAATGGAAACCAAACAGTTGCAGTGGGAAGGTTCTACGTTCAGGGGAGTTCCAGCCTTCAAAGGTTGAGGCATAAACTGGCAATGGGCTTATCACTTCATCTTTTTCCAGTTCCCAGGTACGGGCAATTTCCGCCAGCCTGCTGGAATAAATTTCAATCTTACCGGAAGGCGTTTTAAGTGGATTTGCCTCGGGGTCGTCACGAAATGCTTTGTAGGCGACAAAATGGCCATTGGGATCTTTACGCTTATAGATACCCATTTTTTTCAGTTCGTCGTAAGACGGTAACGCCGGATCTTTGGCAAGCATTTTGGCGTACAGATGTTGTAACCATTGTTCCTGCGTGCGACCTTCTGTGAACTTTTGATAGACGTCAGGTCCAAGACGTTTCGCGACTTCACTCAGGATCCAGTAAATCGGTTTGCGTTCGAATTTTTCGCTGGTGACAGGCTGGAGGAAAATGAGATATCCCATGTTACCGGCGTAGTCGTTAGGAATAATATCTTCCTGCTCAACGGTCATCAGGTCTGGCAGCAGAATGTCGGCATATTTTGCCGATGAGGTCATAAAGTTTTCGATGACCACAATCATTTCGCATTTCGATTCGTCCTGCAGAATTTCATGCGTTTTGTTGATGTCAGAATGCTGATTAACGAGGGTATTTCCCGCGTAGTTCCAGATGAACTTAATGGGCACATCCAGTTTATCTTTGCCGCGGACGCCGTCGCGGATTGCCGTCATTTGCGGACCATGATCGATAGCATCTGTCCAGCTGAAGCAGGAGATTGACGTTTTGACCGGATTATCCAGCACCGGCAGGCGTTCTATGGTAATGGTATAGGTCGATTCACGCGCGCCACTATTTCCGCCGCTGATGCCGACATTGCCCGTCAAAATAGGTAACATAGCAATAGCGCGTGCAGTCAGTTCGCCGTTTGCCTGGCGTTGTGGCCCCCAGCCCTGGCAGATATAAGCGGGTTTTGCTGTGCCAATTTCACGCGCCAGTTTGATGATACGGTCCTCCGGGATACCGGTAATTTGCGAAGCCCACTGCGGCGTTTTCGCTGTTTTATCGTCACCTTCACCAAGAATATAGGCTTTATAGTGACCATTTTTGGGTGCATCTGCGGGTAAGGTTTTTTCGTCATAGCCGACGCAGTATTTATCGAGAAAAGGTTGATCAACGAGATTTTCGTTAATCAATACCCAGGCAATACCCGCAACCAGCGCGGCATCGGTGCCCGGGCGAATAGGGAGCCATTCGTCTTCACGACCGGCAGCCGTATCGGTATATCGCGGATCGATAACAATCATTTTGGCGTTCGATTTCTCGCGCGCTTTTTCAAGAAGATAAGTGATGCCACCACCGCTCATGCGGGTTTCTGCCGGGTTGTTACCAAACATCACGACCAGCTTGCTGTTTTCAATATCCGTGGTGCTGTTGCCATCATTACTGCCGTAGGTGTAGGGCATGGCACAGGAAATTTGCGCAGTGCTGTAGGAGCCATACTGGTTGAGTGAACCGCCGTAGCAGTTCATCAGGCGTTTGACCGCCGAGGCTGATGGCGAAGAGCGGGTCATATTGCCGCCAACGATCCCCGAAGAGTACTGAATATATACAGCCTCATTGCCATATTGTTCGACGGTTTTCTTCAGGCTACTGGCGATAGTATCCAGGGCTTCATCCCAGCTAATCCGTTCGAATTTGCCTTCGCCGCGTTTGCCCACGCGTTTCATTGGGTAATTCAAGCGATCGGGATGATTAATACGCCGGCGGATGGAGCGACCGCGCAAACAGGCGCGTACCTGATGGTTGCCGTACTCATCGCTGCCGGTATTGTCAGTTTCCACCCAGGTCACTTCATTATCTTTAACATGTAGACGAAGTGCACAGCGGCTACCACAGTTGACGGAACAGGCACCCCAGACCACTTTTTCGCGGGCCTGTTGTACCGATGCTGCTGCATTGCGCAGGGTAAACGGCAAAGAAAAACCGCCTGCAGCCAGCGCCAGAGAACCTATCGCGGTAGATTTAACGAGTGTTCTGCGGCTGATGCCCACCATTCGTTCATTTTTGGACATAACTCACTCCCTGTTCTTTATCGTTATATAAAAGTTTATATATTGAATATTTAGCGCGCTAACAATAGAGGGAGTCTACCCATTTTGGGTTAAGAATTATTAATCCATATCAATAGAAGGGTATGAGTAATAAGGTGGGATTATGTTGTATGTTCAAATCGCCGGATTTGTCATATCCGGCGTTCAGTCGATAATGTGTTACTGCGGTTCGGCAGGCGCGCCATCCTGGCTAGACTGCGCGGGAGCAGAGACGTTACCGCTGGTGGTGCGGGTATAGAGAATTTTATGCGTATCATTAGCGCAATGGCCGACGACCTGGGAATCAGGCTGATCAACCTGGTCATTGGGTACAATACTTAACGTGAAGCTGCTTTCGGGTACGCCATTATTGATAATGCGCTGTGATATATCGCTCTGTATGCGCTCACAGGATCCCGGCGCGGCGAGTACCGCGGGTGAGGCGAGGGCGAGCAGAAGCGCGGCACAGCAGGTTGAGAGTTTCATCATAAGCTCCTTACGCGAAGATAACTTCTTTAAGCATAGCATTTAACGTGTAAAGTACTGTATTTGCTACTATGATTGAGAATCATCTCTACTCTCTGGTGACTGTTGTGAAATACAAATTACTACCATGCTTACTCGCGATATTCCTCACAGGATGTGACCGCACAGAGGTAACACTTTCATTTACCCCTGAGATGGCCAGTTTCTCTAATGAATTCGATTTTGATCCGCTGCGTGGTCCGGTAAAAGATTTCACTCAGACATTAATGGATGAGCAAGGTGAAGTGACGAAACGTGTTTCTGGGACTTTGTCGGAAGAAGGCTGTTTTGATTCACTCGAATTACTGGATCTGGAAAATAATACCGTGGTCGCTCTGGTACTGGACGCCAATTATTACCGTGATGCCGAGACGCTGGAGAAGAGAGTACGTTTACAGGGAAAATGCCAGCTAGCAGAATTACCTTCTGCCGGGGTGAGTTGGGAAACCGATGATAATGGCTTCGTGATTAAAGCCAGCAGCAAACAAATGCAGATGGAATATCGCTATGATGATCAGGGTTATCCGCTGGGTAAAACCACGAAAAGTAACGACAAAACATTATCTGTCAGCGCCACGCCATCAACGGATCCGATCAAAAAATTAGATTACACAGCGGTTACTTTACTGAATAATCAACGGGTTGGTAATGTAAAACAGAGCTGTGAATATGACAGTCACGCTAATCCGGTGGACTGTCAGCTAATCATTGTTGATGAAGGAGTAAAACCCGCCGTCGAACGGGTTTACACCATCAAAAATACGATCGATTATTATTAATGCTATTGTGCGGTCGGCTTCAGGAGAGTCTGACCCGGTGTTTTGTGCTCTGCCAGATACTGATGCTGGAATATACACATGCGAATGGCATTACGATATTGACCATTAATAAAGAACTCGTGCATCAATTCACCTTCAACCGAAAAGCCAAGCTTGCGGTAAATGTGAATCGCTTTTTCATTCTCTTTATCAACGATCAGATACAGCTTATAGAGATTGAGAACGGTAAAGCCATAGTCCATTGCTAATTTGGCGGCACGGGTTGCCAGACCTTTCCCCTGATACTCCGGGGAGATAATTATCTGAAATTCTGCGCGGCGATGAACATGGTTAATTTCCACCAGCTCCACCAGACCGGCTTTTTCGCCGTCACATTCCACCACAAAGCGCCGTTCGCTCTGATCGTGAATATGCTTATCATACAGATCAGAGAGTTCAACAAAGGCTTCGTAGGGTTCCTCAAACCAGTAACGCATCACACTGGCGTTATTGTCGAGTTGATGTACATAGCGTAAATCTTCACGCTCCAGCGGGCGTAGCTTAACACTGTGGGCGCTTGGCATAACGTGTCCTTACATTCCTTAAATCAATAACAGGTTAGGGGGTAATAACGCGGCCAGTTCGACGGTCCAGGCAGCGCAAAGTATTGGGCTCCCAGTAGGCATTGATGTTGGCGCTTTGCTCACATTTATCGCGGTTATCAAAAGCGGCGTCGGCTTTATCCCACTCTTTTTCAGTGCGTTTATTCACTTTCTGGCGCAGATTGCGCGTGTCATTCCATTGCTCTTTTTCCATAGCGGCGTGCTGGCGGCTTTGTGCACTGTCGCCAGACTCAATCACCAGTTTGTTAGTTTCGGCATGAACAGTTGTGCTCAATGCCAGTGCGCAAGGCAGCAGAATAGCGAGCAGGCCGATTCGTTTGCTGAGAGTGATTTTCATAATTCATTCCCTGTATGAATGATTAAAGGTGATTCTACACCATCCACTGCGGACGCAAAACGTACCAGGAGGGTGTTTATATTGATGATATTATGTCGCCCTATAACTATACATGATGTCAATAAGAGACAAAGATGATTAAAACAACGTTACTATTTTTTGCTACTGCGCTGTGTGAAATTATTGGATGCTTTCTGCCCTGGTTGTGGTTAAAACGAAACGCCAGTATCTGGCTGTTGCTTCCGGCGGGGATTTCACTGGCGCTGTTTGTCTGGTTGTTAACGTTGCATCCAGCGGCGAGTGGGCGTGTTTACGCGGCTTATGGTGGCGTTTATGTCTGCACGGCGTTGATGTGGCTGCGCGTTGTGGATGGCGTGAAACTGACTCTTTATGACTGGACGGGTGCGTTGATTGCGCTTTGCGGCATGTTGATCATTGTTGCGGGCTGGGGGCGCACGTAGGAACATAAATCCATTTTATCAATAAGATAAGAGGAAGTGTCAGCTGACAAAAGGTATTCTATTTCATCTTTTGTCAACCATTCACAGCGCAAATATACGCCTTTTTTTGTGATCACTCCGGCTTTTTTCGATCTTTATACTTGTATGGTAGTAGCTCAGTTGCGTAGATTTCATGCATCACGACAAGCGATGCAAGGAATCGAACATGAAGATCGTAAAGGCTGAAGTTTTTGTTACCTGTCCGGGGCGTAATTTCGTCACATTAAAAATCACCACTGAGGACGGTATTACGGGCCTTGGGGATGCCACCCTCAATGGACGTGAGCTTTCCGTGGCCTCTTATTTGCAGGATCACCTTTGTCCGCAGCTTATTGGTCGCGATGCGCACCGTATCGAAGATATCTGGCAGTTTTTCTATAAAGGTGCTTACTGGCGTCGCGGTCCGGTTACGATGTCGGCCATTTCAGCGGTTGATATGGCGCTGTGGGATATTAAAGCCAAAGCTGCCAACATGCCGCTTTACCAGTTACTCGGCGGCGCGTCTCGTGAAGGGGTGATGGTTTATTGCCATACCACCGGTCACAGTATTGATGAAGCTCTGGATGATTATGCCCGTCATCAAGAGCTTGGATTCAAAGCCATCCGCGTGCAGTGCGGAATCCCTGGTATGAAAACCACCTACGGCATGTCGAAAGGTAAAGGTCTGGCTTATGAACCCGCAACCAAAGGACAGTGGCCGGAAGAGCAGCTGTGGTCGACGGAGAAATACCTCGATTTCATGCCGAAATTGTTTGACGCGGTACGTAACAAGTTTGGTTTTAATGAACATTTGCTGCATGACATGCACCATCGCTTAACGCCTATTGAAGCGGCGCGCTTTGGTAAAAGCATTGAAGATTATCGCATGTTCTGGATGGAAGACCCGACGCCTGCGGAAAACCAGGAATGCTTCCGTCTCATTCGCCAACATACCGTCACACCCATCGCAGTGGGTGAAGTCTTCAACAGCATCTGGGACTGCAAACAACTGATTGAAGAGCAACTCATCGATTATATCCGCACCACGCTGACCCATGCAGGCGGAATTACCGGTATGCGCCGGATTGCCGATTTTGCTTCGCTGTATCAGGTACGTACTGGCTCACACGGTCCTTCCGATTTGTCACCAGTCTGCATGGCTGCGGCGCTGCACTTTGATCTGTGGGTCCCCAATTTCGGTGTCCAGGAATACATGGGTTATTCCGAACAAATGCTCGAAGTCTTCCCGCACAACTGGACTTTCGATAACGGCTATATGCATCCGGGAGACAAACCGGGTCTTGGTATCGAATTCGATGAAAAGCTGGCGGCGAAATATCCCTATGAACCTGCTTATCTACCAGTCGCACGTCTGGAAGATGGCACGCTGTGGAACTGGTAAGGAGTAAGATAATGAAAAGCATATTAATTGAAAAACCGAATCAACTGGCGATTGTCGAACGTGAAATACCCACCCCGTCAGCGGGTGAAGTACGAGTAAAAGTGAAACTTGCCGGAATTTGTGGTTCAGATAGCCATATTTATCGTGGGCATAATCCTTTTGCGAAATATCCGCGCGTCATTGGTCATGAATTCTTTGGCGTCATTGATGCAGTGGGTGAAGGCGTGGAAAGCGCCAGAGTCGGTGAACGTGTTGCTGTCGATCCGGTGGTCAGCTGTGGGCATTGCTATCCGTGCTCTATAGGTAAACCGAACGTTTGTACGACACTGGCTGTATTAGGTGTGCACGCTGACGGTGGTTTCAGTGAATATGCCGTGGTTCCGGCAAAAAATGCGTGGAAAATTCCTGAAGCAGTGGCCGATCAATATGCGGTAATGATCGAACCTTTTACCATTGCGGCTAACGTAACCGGACATGGTCAACCGACTGAAAATGATACCGTTCTGGTTTATGGTGCCGGTCCAATCGGCCTGACGATCGTTCAGGTATTAAAAGGCGTCTATAACGTTAAAAATGTGATTGTTGCCGATCGCATTGATGAACGACTGGAAAAAGCGAAAGAGAGCGGGGCTGACTGGGCGATTAATAACAGCCAGACACCGCTTGGCGAGATTTTCACTGAAAAAGGCATCAAGCCGACATTAATTATCGATGCGGCTTGTCATCCTTCTATCCTGAAAGAGGCCGTAACGCTGGCTTCTCCAGCGGCACGTATTGTATTGATGGGGTTCTCCAGTGAACCGTCTGAAGTGATTCAGCAAGGAATTACCGGAAAAGAACTCTCTATTTTCTCTTCACGCTTAAATGCAAATAAATTCCCGATCGTTATCGACTGGTTAAGTAAAGGGTTAATTAAACCAGAAAAATTAATTACCCATACGTTTGATTTCCAGCATGTTGCTGATGCCATTAGTTTATTTGAACAGGATCAAAAACATTGCTGCAAAGTCTTACTCACTTTTTCTGAATAATACCAATAACGGCGAGTAAGTAGTACGCATCTTACCTCTTTTTTAGAGATAACCATTATGACAATAGAAAAACACGAAAGAAGCACTAAGGATTTGGTGAAAGCAGCAGTATCGGGATGGCTGGGCACTGCGCTTGAATTTATGGATTTCAAGAGTCATGCGTGTTAACTATTTGATAAATATTAAATTAATTTTTCATTGCTTCGTTATGGGGCATGGTTGGGGCAAACTCGCTTAACTGTGTATTTAACAAAGCTACCTGTGCATTATTGTTTTCAGACATCCATTTTCCGTATACCTGAAATACCATTTGCGCATCTGCATGGCCCATCTGGTTTGCTATAAATGCCGGGTTAGCACCAGCTGTCAGCGACCAGCAGGCATAAGTATGTCTCGACTGATATGATTTTCGATGGCGGAGTCCGGCACGTTTTATCGCTGCGTCCCACATCTGCCTTATTGAGTCAACGGTAAAATGGTCACCATAATTTTTTACTCTCGCTGACACTTCAGGTTGAAAAACAAAGGTGCATTTTTGTTTTTCTGTTCTGCCATACTCTCTGAGGTGAACATCAATGATATGCTCTTTGCTCAGTCTCGTTAATGTCATCTGACTCCGGAGAGCGTCGATTGCTGGCTTAATAAGATGAATGACCCGATTGGTTCCCGCCTGTGTTTTTGGTACCGTGAAACGGTCTTTTGCTAAATTTCTCCTGATCATCATTGTTCCATTTTTCAGATCTATGTCCTCCCATCCAAGTGCACACAGCTCACCAGGGCGAACTCCAGTATAAACAGAAACACACCATAAATTTTTTGCTTGCTGATTTCTGCACGCATCGATAAGACGGATAAATTCTTCCCGCGAAAGAGGATCCGGAATGGTTCTTGATTCCTTTAATGGCGAGATCCCCTTAAACGGATTATCTGCCAGGTAACCGTTATCAACACCAAACTGGAACACGGCGTTAAGATTTGTCATGTAATTATTTACAGTTACAGCCGATCTCCCTGGTTGTGTAACAATATAGTTACTTTTGGGGATCTGGTATCCAGTCAGTAACTCTTTACGAACCTCCAGTAATTTTTCTTTATTAATCGATGAGGCAAGATTTTTTTCACCGATTATGCTCAGGATATTTTTGATGACGGCACGGTATGTGTTGAGTGATGTTTTGGCGACTTCAGTTTCTTTCAGTGCCAGAAATTTTTCAGCCAGTTCTTTTATGGTTAAATCTTGTCGGGCCTCACCAAATTTTTCCAGATTGCGTGAGGAGGGAAACTGTTTTGCATAGTCGAAAACACCAGTTTTTATTGCGTAACAAACAGAGGAGCGTAGTTCACCTGCAACGCGCCTGTTTTTTGCTGTGTCAGGAACCCCCAGATTTTCCCTGACTCTTACGTCTTTATAAACAAACCAGATACGTAATTTCCCTCCATGGTTTTCCACGCCTGTCGGATATTTCATTTCAACTTCTCTCATTAGTTAGTGTGGCTTTTAGTCAAGTAAGATGACGTCTTGGTCTCGCTGATGCCTGGCGCTCAATCCAGCGATCAATTTCTTCCAGGTTGTAAAAGCATGGACTGTTATCCCATGGCATACCGTCATGAGCGACATGCTTATATTCCCTTCCTTCCATAAACGATTTTTCCCGGGCCTTTTTTAACGTACCTTTTTTTATTCCTTTCAGCGCAATTAACTGCTCTTCGGATACCCATTTGCCGGGAGAGACAATCATGATTACTTCGCTCATCGATTTCTTTATCTCTTACATCAGACGAGCGCCGGTTGCAGAATACCAGTCACAACCGGCGACAGTTGAACATTAAGAATCAGCCTGACTCGGGATCAGTTTTTGCCAGATAACTGAAACGTATTTTGCCTGGTAACGGGCGTCATCAAGTGCATTATGGCGCTCACCTTCGAATGGAATAGCCGTTCTGGCATCGAAGTCTATGGCTTTCCCCAGCTCAACGATTGTGCGTACATCGCGATCGTTGTAGTAACGCCACGGGCAGGGGATCCCCTGCCGTTCGTATGAACGGCGCAAAATCGTGTTGTCGAAGTTGGCTCCATTTCCCCAAACCTGAACAAAAAATTCACCGGAGTTTTCGTCGATAAATTCTCGCAATTGTAACAGTGCATCATCTAACGGGATTTCATCGGTCATAATGGCAGATTGCGCTTCGCGTGATTGCTTAAGCCACCATTTAATGGTGTCCCGATCAATGACTCCGCCAGCAGTTTCCAGATCGATAGTCTTACTAAATTCCGGTCCCATATCTCCGGTTTGCGGATCGAAAAATATTGCACCTATTGAGATGATCGGGGCATCAGGATTTTTTCCCATGGTTTCAAGGTCGATCATTAGATGGTCACACGTCCTGCTGGTGGATGTGATTTCGTGATGACCGTTCACCTTAATTGGGTGATCTGCCGTCTCGCCAGTTTCATTATCGCTATTGTGATGCTGATTGCCGCCAGTGTTCTCCTTGTGTGGATGTTCAGCGCCTTCCATTTCCTCCGGATCATCTTCCTGAACTTCAACCTGATACTCTTCATCGAATGTTTCTTGGTATGTTGCGTCGCCCATCACCGCGCCACAATCAGGGCAGTTGCCGCCGCCGGTCTGACCGCAGGCGGTGCAGACTTTTTCCACTTCCTGTTGCGCTACTGGTTCAGGCTGTTTCGTTTCTGGCTCGTTTTGTAACGCATTTGGACTGTTTTGTTCCGCTTTTTGGTAGTTCCGTTCCGATTCATGCTGGTTCTGGTTCACAGAATCGCGGGTCTGGAGCCCCTTAACCCATTTCGGATCATTCGGGTCACTAATCCCTTCAACAAATTCACCACGTGATGCAGCAAGCAACTTATCGGCGTCAGGCTGGCTGATACGGGTTATTCCCCTCTGAACAGGCGCGTAAGGACTGGCAAAACGCCCGCAAAAAACTCTCGAGGGCAAAGGTGAAGAAACCTGCTGTGGTTGATCCGGACCTTATCTGGTCATCACCTGACGGAGAAATACGTCGCTACGACAGTCGCCTAAACATAATCTGTCGCGAGTGCCGGAAGAGCGAAGTTATGCAGCGCATACTGGCTTTCTATCAGGGTAATTTTCAGGACGTGGCGCAGTGAGTGCACCGGCAACCATTCTTGATATGTGCTGTGGCAGCCGCATGTTCTGGTTCGATAAGTCTGACGAACGGGCGATATTCAGCGATATCAGAAAAGAAGGATACACATTACGCAATGGGAGACGCTTGATTATCAGCCCTGACATTATCGCAGATTTTCGTGCATTATCATTTGCAGACGCATCTTTTTCGATGGTTGTATTAGACCCTCCGCATCTTGAGAGTGTTGGTGATAACGCCTGGATGGGAAAGAAATATGGACGGCTGAATAAAGATGCCTGGCGTGATGATTCGCGACAGAGATTTAAAGAAGCCTTTCGGGTGTTGAGGCCGCACGGCGTTCTGATTTTTTAATGGAATGAAACGCAAATACCGGTAAGCCAGATTCTGGCACTGACAGACAGAAAACCTGTTATCTGTCAACGAACAGGGAAAAAACGATAAAACCCACTGGATTATTTTTATGAAAGAGGCATCCAGTGAGTAGATTCGTAAGGCTACAGATACGTATATCTGAATAATTAAATTCAGTTCTGTAAATAAAATTTAATCCTTAACCGGATGGATTTCTGCACCCTCAGAACATCAGGAGGCCGCCCGAAAGGGCGGTAGTGAAATGCGAAAGTTCAAAATTATTATTGAAACGGGAATAGCTGGTGGAGATTTTGAGGATGTATTCGAAGTGGACGATGACGCAACACCTGAGGAAATTCATGACGGAGCATAAGAAATTTTCTTTAACTACTGCAATTACTCATACCACGAAATAAAAGACGATGAGGAAGAACAAAGTGGCTGATTTTTGTTCAGCTAAATATAACATCAGTTTTGAAGAGCGGGATGAACTATTAATGGACTATGGTGAGTTACGCGGTGGAAGTGCTGCTGATGTCGAATCCTAGCGTGATGACTATGAAGCGGGAAAAACTCCGGTCGAAGCATATTGTGATGAGTGGGGCGATGAATGAGCGAGATTAATTATCAGGAAGGGCATGAAACGGCGGGGCAAGCAAAACCAGTGGCATGGCGATATCGCTACGTGAAAAAAGGCGTTACGGACTTTCAGGGGAAGTAGTGGTCTGGTGACTGGAAATATGTACCGAAAAAAGAGGATTGTAACGACAGGCCGAACTATCAAATTCAGGCCTTATTCACTGCCTCACCAGTCCCAGTTACATCAGAAGAACTGGTTAAAGCTGTGCACTTTTATGAACAACTAAAACGCGAAAATCCACCAACATCCGGCAACTAGATTAATGGATTAACTATGTCGGTTAAACGACCAGCCAACTGAAAAAGCGGAAACCTGATTACAGGTTGCCAGATAAGGCAATGAGCTACCTGGCGCGGAACGGACTGATAAGTATGGGGAATGTTTTACGATGAACCTTTAGACTAAAGAGTTTGTAACGCTATGTAAGTGATTTTTTCTGGTTTAGATATTTATATGTCCGGCCAAATTGAGGTGTGTTTAAATGTAATTGCACATTGATTGTAGGAGGAATAATGAAAAACGCATTGCAGTTTTTGTTTGTTGCGTTCTGGTTGTTCGCATCATGTATGCCCATCATCTTCACAGCAAGGTATATGGAAAAAGTTGATGTTTTGATATTAATATTTGGATATATAAATGCCCTTTTTTTAGGGGTGTTCATGGCGGTCATGTGCATTGAATACTGGCGGTAAATACAGCGAACTCCATTGGTTTAGTTGGATATTTACTGTGCTGGACAAAAACGGTTTGCGGGGAAATCTTAGTTAAGTAGAATGACTGCGGGTGCTTGAGGCTATCTGCCTCGGGCATGGACACCAACGGCAGATAGAGAAAAGCCCCAGTTAACATTACGCGTCCTGCAAGACGCTTAACATTAATCTGAGGCCAATTTCATGCTAGACACATGTAGGTTAGCCTCTTACGTGCCGAAAGGCACGGAGAAGCAGGCTATTGTTAACACCAAGCTGTAATGTCCCCTTTGAACCATTCTAAAATGTCCCCAGACAATTCTCTGGGGGATTTTTCATGATCAAAGAGACTGTTACGATGAGTCATAAGGAACTCCACCGACTTCAGATTATTCAGGAACAAGCTGCGGCACGCATTGGCATTTCTATTCGGCAGGTTAAACGTCTGGTGCAACGGTATAGAAATGAAGGGCCTTCTGGTCTGGTTTCCCACCGACGTGGAAAGCGTCCTAATAATTCCTTTTCTACTGAATTCAGAGCAACAGTAATTTCACTCCTCAAAGGCCGTTACGCTGATTTTGGACCTACGTTTGCGTGCGAAAAATTGCGCGAGATACACGGTTTATCTTTATCCGTTGAAACTCTCAGAAAGTGGATGATAGAAGAGGGGTTATGGCGTGAACGCCGTCGTAAAATTGCCCGTATATATCAACGCCGCATGCGACGACCATCTTACGGTGAACTGATCCAGATTGATGGCTCACCTCATGACTGGTTTGAAAATCGAGGCCCCAGATGTACACTGATCGTTTTCATTGATGATGCCACCAGTGCGTTGATGGCGTTGCGTTTTGTGCCTGCTGAAACAACCCGGGCTTACATGGAAACCCTCCGGGGTTACCTTAATGATCATGGCGTACCGCTCGCTCTCTACTCTGATAGACACAGTATATTCAGGGTAAATAACCCAGAGCGGGAAGGTGAGCTGACCCAGTTCACTCGTGCGATAAAGACACTGGGCATCGAGCCAATCCATGCCAACAGCCCGCAGGCAAAAGGGCGGGTAGAGCGCGCCAATCAGACACTACAGGACAGGCTGGTCAAAGAAATGCGGCTTCAGAATATCAGTGATATTGAAACAGCAAATGCATGGTTGCCGACCTTTATTGAAGCCTATAACAACCGGTTCGCTACGTCGCCTCGTACTACTGATAATGCTCATCTTGATGTGCACCATTCTGAAGAGGAACTGGGTTATATCTTCAGCCTACAGGCGAAGCGCGTTCTGTCTAAAAATCTCACTTTCCAGTACAAAAGCAGTGCGTTTCAGGTACGCAGTGAGGGCCGGGGATATCGACTTAGGCATTCGGTTGTTACTGTATGCGAGAACTTTGACGGTGAAATTAACGTTCTGTATGACGGGAAAGCGCTGGGCTGGGAAAAGTATGTTGATGGCCCGGAGCCTATACCACTGGATGATGAAAAGAGTGTCCATGAACGAGTGGATAATGCCCGTATTGATTTACGCTCAAAATACTATGTTAAACCTAAAGCTGACCATCCCTGGCTTACGCGCCGAACGCAAAGTCATCAGCAAGTTAAGCCCCCGAAGTTACCTAAAAAGAAGCCTGATCCCGATAAAAAAGATTGAAACCAAGATCGATTCGGTTGAGTGCATATCCATTCATAGGGTAGATTCTTAAGTCGCGTTTCTGGTGTTCATTTTCGGGTGGTTTGTTACTTGTTTTACCGGGGATATGCCAGAAACGCGCTGAGTCAGTCTGGGCGGTGCGCGTAATGAGGCGTTATGGTAAATAGCCTATGCTAATGTCCGCTAAGAGCAAGAAGCGGAAGTTGGCAGTTTTGTGGACTGTCCCCACAAAAGTGACTACAGAAATAGTTGCAATTCATAATTGATCATGGGTTGTCAGTTAAACTCGTGGCGATTTAAATAGACTAATTGGGAGTGCGTCCATTACTTATATCTTGTAATGTTAACTATCAGAAATTCTTGTAATGTTAACTATCAGAAATGATACAAAGATAATATGTCTTTAAAGAAAAGGCTGATGGCGAAAAGTGGCCCGATGAGGGCCACAATACGGCTGTCACTTAGACGTAAATATCAATGGTGCCAGCGGTATTTGTATCGTCTTTTTTCTCTTCTTTTTTATCAGGCTGAACTGTCGCGTCTTCATTCTTTTTCTCTGCCTGCTGCCTTAACAACTGCTCCAGTTGAGCCCAGAGGCTTTCAATTTGCTTCTGTACCAATGCAGCCATTTCTTTTTTCTGCTGTGTCGTCATCCCCTCTTCCGATGAGATTTTCCCAAGCTTTTCAGTCAGCACCTGAATTTGTCTTGTGATTTTGGCTATTTCTGATGTTCCTTCCGGGGCGGAGTTGTTTGAAATAACGGTTGAGGTATTTCCCTGAATTGTGACAGACATAGATTTCTCCTTTTAAAAAAGCACTATCGGCATGCACAAAAAAATCTTTAATCGTATTTCTTGTGTCATTAATTGTTTGATGTTCAGATTGTTTTCCTCGCGGGCTGGCGCGCCTCAGAAAGTAAAGCTTGTTGACAGGGGTAAACGTTCGGCAATAATTTTCTGCCGCATGCGGGTGTTGCATAAAACGTGTTACGTTCCTTTATCGACAGGTCAGGTCACCGCTCACCCGCCGACGAGAAAGCAACACTGACATGCTAAAGCAAAAAATAGATGAATAAGTTGAGTTGTGCATATGTAGCCTGACCGTCACAAAGTATATGGTGTCTGTACCAGTAAGATGATGGCCGGACTCTTTAAAAACGAGCTGACCTGCACAATACAGGATGGACTTAGCAATGGCTGCTCCTGGCACAAAGCGGACAGTGATCACCGTTCTTACGACTACTTTCTGACTTCCTTCGTGACTTGCCCTAAGCATGTTGTAGTGCGATACTTGTAATGACATTTGTAATTACAAGAGGTGTAAGACATGGGTAGCATTAACCTGCGTATTGACGATGAACTTAAAGCGCGTTCTTACGCCGCGCTTGAAAAAATGGGTGTAACTCCTTCTGAAGCGCTTCGTCTCATGCTCGAGTATATCGCTGACAATGAACGCTTGCCGTTCAAACAGACACTCCTGAGTGATGAAGATGCTGAACTTGTGGAGATAGTGAAAGAACGGCTTCGTAATCCTAAGCCAGTACGTGTGACGCTGGATGAACTCTGATGGCGTATTTTCTGGATTTTGACGAGCGGGCACTAAAGGAATGGCGAAAGCTGGGCTCGACGGTACGTGAACAGTTGAAAAAGAAGCTGGTTGAAGTACTTGAGTCACCCCGGATTGAAGCAAACAAGCTCCGTGGTATGCCTGATTGTTACAAGATTAAGCTCCGGTCTTCAGGCTATCGCCTTGTATACCAGGTTATAGACGAGAAAGTTGTCGTTTTCGTGATTTCTGTTGGGAAAAGAGAACGCTCGGAAGTATATAGCGAGGCGGTCAAACGCATTCTCTGAACCAAAGCATGACATCTCTGTTTCGCACCGAAGGTGACACTTCTGCTTTGCGTTGACAGGAGAAGCAGGCTATGAAGCAGCAAAAGGCGATGTTAATCGCCCTGATCGTCATCTGTTTAACCGTCATAGTGACGGCACTGGTAACGAGGAAAGACCTCTGCGAGGTACGAATCCGAACCGGCCAGACGGAGGTCGCTGTCTTCACAGCTTACGAACCTGAGGAGTAAGAGACCCGGCGGGGGAGAAATCCCTCGCCACCTCTGATGTGGCAGGCATCCTCAACGCACCCGCACTTAACCCGCTTCGGCGGGTTTTTGTTTTTATTTTCAACGCGTTTGAAGTTCTGGACGGTGCCGGAATAGAATCAAAAATACTTAAGTAGCGCGCAGGGATAAGAGGGATGGTCCCTTAAAGGGGAGAGCTAATTATCCGGAAGGATTCTGATGATGAACATCGAAGAACTGCGTAAAATTTTTTGTGAAGATGGCCTCTATGCTGTGTGCGTTGAAAATGGAAATCTTGTTAGTCATTACCGCATTATGTGTTTGCGAAAGAATGGGGCTGCGTTAATTAATTTTGTGGATGCTCGGGTCACGGACGGATTTATCTTGCGCGAAGGTGAGTTTGTCACTTCATTACAGGCATTGAAAGAGATCGGAATAAAAGCTGGCTTTTCTGCTTTTTCAGGAGAATAAACTCATCTACAATCTTGCGCGGGGCTGAACTCCCGCTGAGTAACACCGTGCCACCGGAGAAAACCGATGGCACGCAACGCAAAATATTACAATTCTGATAATTCGCCCGTTCTTGCCTGCACGCACGGGCGGTATTCTCACGCATTCAAGTCTGAATGGTTCCAGCACCCTCCATGCACTGCAGAACAGGCCGAATGGCTGATTCATTCTTACCGCAGGCGCGGGTTCGAGGTTAAGAAAGCTCTCAGTCTCGACTATCGGCACTGGATAATCTCTGTCAGGCTGCCTTATTCCGAACGCCCACCACGTGCGTCCCGCACTTTCCAGCAACGGATCTGGAGGTAACGTGCGGGTATTACTTAGACCTGTTCTGGTGCCTGAGCTTGGGCTGGTGGTCCTTAAGCCGGGCCGTGAATCCATACAGATATTTCATAATCCTCGAGTGCTGGTGGAACCGGAACCAAAAAGCATGCGTAATCTGCCATCCGGAGTCGTTCCTGCCGTTCGCCAGCCGCTGGCGGAAGACAAAACATTGCTGCCGTTTTTTAGTAACGAACGGGTGATTCGTGCTGCTGGCGGCGTTGGCGCATTGTCCGACTGGCTATTACGTCATGTTACATCCTGCCAGTGGCCTAATGGCGATTACCATCACACTGAAACAGTCATTCACCGTTATGGTACCGGCGCAATGGTGTTGTGCTGGCACTGCGACAACCAACTGCGTGACCAGACATCGGAATCACTGGAGCTGCTTGCTCAACAAAATCTGACAGCATGGGTGATTGACGTCATCCGTCACGCAATAAGCGGTACGCAGGAGCGGGAATTATCTTTGGCTGAATTATCCTGGTGGGCGGTCTGCAATCAGGTGGTGGATGCACTACCTGAGGCTGTATCGCGTCGTTCGCTGGGATTACCAGCGGAAAAAATCTGCTCGGTGTACCGCGAAAGCGACATCGTACCGGGAGAGCAGACCGCCACCAGCATATTGAAACAACGCACAAAAAATCTTGCACCGTTGCCTTACGCCCACCAGCAACAAAAATCACCACAGGAAAAGACGGTGGTAAGCATCACCGTTGATCCAGAGTCTCCGGAATCTTTCATGAAGCTGCCTAAACGTCGCCGCTGGGTTAAGGAGAAATACACACGTTGGGTTAAGACACAGCCGTGTGCTTGCTGCGGTATGCCAGCCGACGATCCGCATCATCTGATTGGTCACGGGCAGGGCGGAATGGGAACAAAAGCACATGATCTCTTTGTGTTGCCTTTGTGCAGAAAGCATCACAACGAGCTGCATACGGATACAGTGGCATTTGAAGATAAGTATGGCTCCCAACTGGAGCTGATATTTCGTTTTATCGATCGCGCGCTGGCAATTGGCGTACTGGCGTAAGTGGAGAACGAGCATGAACCTTGAAGCCTTACCAAAATATTACTCCCCAAAATCTCCAAAATTGAGCGATGACGCTCCAGCGACAGGCACCGGTTGTTTAACAATTACGGATGTAATGGCAGCGCAGGGGATGGTGCAGTCGAAAGCACCACTTGGGTTGGCCTTATTTCTGGCAAAAGTTGGTGTTCAGGACCCTCAGTTTGCGATTGAAGGCCTGCTAAATTACGCGATGGCACTGGATAACCCGACATTGAACAAATTGAGTGAAGAAATCCGGTTACAGATTATTCCTTACCTCGTGAGTTTTGCCTTTGCTGATTACTCCAGGTCTGCGGCAAGTAAGGCTCGCTGTGAGCATTGTTCAGGTACGGGATTTTATAATGTATTGCGCGAAGTGGTGAAACACTACAGACGCGGGGAATCTGTAATCAAGGAAGAATGGGTGAAGGAACTATGTCAGCATTGCCATGGTAAGGGCGAAGCCAGCACAGCGTGCAGAGGGTGTAAGGGTAAAGGGATTGTTCTGGATGAAAAAAGAACCCGGTTTCATGGCGTACCGGTATATAAGATTTGTGGGCGTTGTAATGGAAACCGGTTTAGTCGTTTACCGACCACGCTGGCACGACGTCATGTCCAGAAGCTGGTACCAGACCTGACCGATTATCAGTGGTATAAGGGGTATGCGGACGTCATTGGTAAACTGGTAACAAAGTGCTGGCAGGAAGAAGCATACGCGGAAGCGCAATTGAGGAAGGTGACGAGATAAATGATTTTTGCTGAAGATGGCGACATGATGTTTGCATTTTTCAAAAAATATGGATAAAATTTTTTCAACGATGGGCTTTGTATACCCGACGTTAAGAAAAAGTAGAAAACCCGCTGATGAGCGGGTTTTGTGCTTTAAATGGGGCAATGGTAATGTTGAATCTCATCCCGGGACTCATGTCTGTTAACTTATTATTTAGCTGGTGACTTGGTTATTTGCCTGATGTTTAAAATGTTTTCTTCCAGTACAATGTCCCTAAACACAATGAGTCTGCTTATTATATTATTAGCAGAGCTATTACGGCCAAAGTACAGCATAAGCTTTTAAAGCCAATCAACCAGTCATCAAGACAGACGGGGTTATTCATAAAAACTCTCCATGTGTGATCCGATGGGGCCTGAAATTAAAGCTTTAATATAGCTCATGAAAGGTAAACATTGGCAGCTGAAGGGCCACGCAGACCATTTATCCGGCAAAATTCCACGCGTAATCCGGTGGTAATTTCTTCTGCATCGCGGAGATTGAGCGCTGAAACATGAAGCTGGACATCGATACGACCATCGGATGGGGTGATAAGACCCTTGCCGCTTTTGCCGTCAAAGGTTTTGACAATTCCTGTCATTTTACGGGACAAAAAAATTCCTTAATACTGATAACTTGGCGCACTATACACACGTTCCTGAAGAAAGCTATAGTTTTTTGATGGGGTTGAAGATGGCTGGATGTCTAAAATAAACATTGCTTCATATGTTCAACTATGCGTTAATGATTGCGTCGGTTTGAAGAACAGACGATATACGAAGTAGTTTACTAAAGCAGTTCTCATTTCAGGTGTTATTCACTTATTCCTTCTTTGAGTCTCTCCAATTAAGTACGAAGTCGTTTCTGTTATGCAAACCATTTATGCCGAAAGGCTCAAGTTAAGGAATGTAGAATGTCAAATAAAATGACTGGTTTAGTAAAATGGTTTAACGCTGATAAAGGTTTCGGCTTTATTTCTCCTGTTGATGGTAGTAAAGATGTGTTTGTGCATTTTTCTGCGATTCAGAATGATAATTATCGAACCTTATTTGAAGGTCAAAAGGTTACCTTCTCTATAGAGAGTGGTGCTAAAGGTCCTGCAGCAGCAAATGTCATCATTACTGATTAAAATTCATCGCTCGTCTGTATACGATAACGAAGAAGGCTGATGCCTGAGTAGAGATACGGACAGAGTAGTGAATATTGGATCTCTTTAATAAAAAGTAAGGAGGTCCAATACATGAAACAATGGCTAGCATATTTGGCAAAATCTTAATCAGGAAAAGTATGCTAACCATTGTGGTGAAGTGCAGGTTTGCTGCATGAATAGTTTTACAGCAGAAGCTAACTGCTGGCATGGCAAAACAAAGTGCGTAAGTGGATGACTCCCACAAAAAGCACCACAATCTCAAACCCGCTCAGGCGGGTTTTTTATTATCTGCTTTAAATATATTATTAAAATATAAAAAATACTTGTTACTAATAAAATCAATCAGGCTACAGCTTTAAGATTTGTCTGGAATACTTTGTTGCAATGAGGGCAGATCAAAAGGGCACCTTTTTGTACTCTTGAAAAACTGTGTTCTGACTCTTGGGTGCAGTTTGGGCAGGAACATTTAACGAGATAATTACGGCGTGATTTTGAGTTTTTACGTTCTGACATAGGCTTTTCCTGTATAAATGGCCGTATACAGTACACTAAATATGAAAACATTTCTCGTATTATTATTTTATATATGACTTTCTTTCAAAATAATTACCCACATTTTTAATGTGTATGTTTTTTTAGCGCCGTTGAGAACAACGTGTGCTGTCAAAACTACCCCGTAGACTCCGATCTTTTCAAACATATTGCACCATCCGTGTACATCGGGGTGAGGATATGAAATCAATGGATAAGTTAACAACAGGTGTTGCCTATGGCACATCGGCGGGTAATGCTGGTTTCTGGGCATTGCAGTTACTCGATAAAGTAACTCCGTCACAGTGGGCTGCAATCGGTGTGCTGGGTAGCCTGGTTTTTGGCCTGCTGACGTATCTGACAAATCTTTATTTCAAGATTAAAGAAGACAGGCGTAAGGCTGCGAGAGGAGAGTAATCCAATGACTCAAGACTATGAACTGGTTGTGAAAGGAGTCCGTAATTTTGAGAATAAAGTTACGGTAACTGTAGCCTTACAGGACAAAGAACGCTTTGACGGTGAAATTTTTGACCTGGATGTCGCCATGGACCGTGTTGAAGGAGCTGCGCTGGAGTTTTATGAGGCAGCAGCCAGAAGGAGCGTCCGGCAAGTCTTCCTGGAAGTAGCAGAAAAATTGTCAGAAAAAGTTGAGTCTTATCTGCAGCATCAGTACTCCTTTAAGATTGAAAATCCTGCCAATAAGCACGAGCGTCCTCATCATAAATATCTATGAACACAAAAATCAGATACGGCCTGTCGGCTGCCGTTCTGGCGCTGATTGGTGCTGGCGCATCTGCTCCTCAGATACTTGACCAGTTTCTGGACGAAAAAGAAGGTAACCACACAATGGCATACCGCGATGGTTCTGGCATATGGACCATCTGTCGGGGTGCCACAGTGGTGGATGGAAAAACCGTTTTTCCCAATATGAAACTGTCGAAGGAAAAATGCGACCAGGTCAACGCCATTGAGCGTGATAAGGCGCTGGCATGGGTGGAGCGCAATATTAAAGTACCACTGACCGAACCACAAAAAGCGGGTATCGCGTCATTTTGTCCCTATAACATTGGCCCCGGTAAGTGTTTCCCGTCGACGTTTTATAAGCGGCTGAATGCTGGTGATCGTAAAGGTGCATGCGAAGCGATTCGCTGGTGGATTAAGGATGGCGGACGCGATTGCCGCATTCGTTCAAATAACTGTTACGGTCAGGTTATTCGTCGTGACCAGGAGAGCGCATTAACCTGCTGGGGGATAGAACAGTGAATCAGATATTCATGGTGATTTTTCTCGTGTTGTCAGGATTTATCGTCGGAAATGTCTGGAGCGACCGAGGATGGCAAAAAAAATGGGCGGAACGTGATGCTGCCGCATTATCACAAGAGGTAAATGCTCAATTTGCTGCTCGAATAATTGAACAGGGGCGAACTATAGCCCGTGATGAGGCTGTTAAAGATGCGCAACAGAAATCTGCTGAAATTTCTGCCAGGGCTGCTTATCTGTCTGATAGTGTTAACCAGTTGCGTGCCGAAGCAAAAAAATATGCCATACGCCTTGACGCAGCGAAGCATACCGCAGATCTTGCCGCTGCCGTCAGAGGCAAAACAACCAAAACCGCCGAAGGAATGCTCACCAACATGCTCGGAGATATTGCAGCAGAAGCTCAGCTTTATGCTGAAATTGCTGACGAACGCTACATCGCAGGAGTGACTTGTCAACAGATCTATGAATCTTTAAGAGATAAAAAGCATCAAATGTAGGGTAATATTAAATCGGAACATTTACATCGCGGAATGTAAAATTTAAATAAAAAGGACTCTTCCATGAGCCAAAATTCCTGAAATCTTAAGGGTAAGATAAAAGGTCTTAATCAGAATGACACGTTTTATTAATAAATAAAGCTATTCTTTCATTGCTGTGTTTTTCTTTACAAAAGTAATCCTTGCTATGGGTGGTTAATCATGCGTTAATGGTGTTCTGGTTTGTTACAAATTTATCTGAAGCAGTCATTGTTATAATTTTATTATTTGTACCTCTTGAGATTTCCTTGTTGGTTTTTCTCTCTGATATTTTTTTTCGGACCATTCTGCCCAAGGGCTAATTTCTTCAAAAGGTAATAATTATGTCTAACAAAATGACTGGTTTAGTGAAATGGTTTAACCCTGAAAAAGGTTTTGGTTTCATCACGCCGAAAGATGGCAGCAAAGATGTGTTTGTCCATTTCTCAGCAATTCAGAGCAACGATTTCAAAACATTAACTGAGAATCAGGAAGTTGAATTTGGTATTGAGAACGGACCTAAAGGTCCTGCCGCTGTTCATGTAGTGGCGCTTTGAGGTAGACAATATTACAAACCATATTCACTTTAGATGCCCGTGTTGTCATGGTTCCCAGTATAGAACATCATCTTTTGATGTTTCTGACATGAATCCTTTCGGGGCAAAATGTATCTTTTGTAAATCAATGATGATTACATTTGATAATATTTCACAATACTTAAATGCCAGCCGTCTGTCGTTGGATTTGAAAAAGTGAAAATGAAGGCTCCTTCGGGAGCTTTTTTGCTTGGTGTCTATTCGATGGATACTCACATACTACGGTAACATCATGAAAAAAATCATAGTTTTTTTTAACTCTGAACCAGCAGTGGTAGTGCCAGCGATGACTGGAGTTAACACCATCATGCGTGAATATCCAAATGGCGAAAAAACACACCTTACTGTAATGGCCGCAGGGTTTCCATCTCTGACCGGAGATCATAAAGTCATTTATGTAGCCGCGGATCGACATGTTACTTCAGAAGAAATTCTGGAAGCAGCAATAAGGCTCTTGAGTTGATTTGATGCTATTGCATTGATAATTCAGGAAAATTCTCTTTGTCTGTTTGTGTAAAATTTAGACTATCGTATGTTGATTATTGCGATGTTTCATCTTATCTTTTACACGTTTGCACCATATAATCGACTTACTGTGTAACTGGAAAGTCATAACAGACTAAAAGAGGAAATGATGAATATTGAAAACTTAAAAACAAAAGCAGAAGCAGATATTTCTGAATATATAACAAAAAAAATTATTGAACTTAAGAAAAAGACCGGGAAAGAAGTTACCAGTATTCAGTTTACCGCACGGGAAAAAATGACGGGTCTTGAAAGCTATGATGTCAAGATTAATTTAATCTGATGTATTCAATAATAAAATTTATCCATAAACCTCGTTTTTACGGGGTTTTGTTATATTTGAATGGTTCCGAATATCTAAATCACAATTGTTGATGGTTTTTATTAAACCAATGCAGTCCGGCTCAGGAGTGAGAGAAGCCGGACGTTATGGTTTAGCGTGGTAAGATCTGTGTAGTTTTCTGGATGCTTTCAGTAAATAGTAATGAATTATCAAAGGTATAGTAATATCTTTTTTGTTCGTGGATATTTGTAACCCACCGAAAAACTCCTGCTTTAGCAAGGTTTCTTCTGTATTCCTGAAATGTGATCTCTCTGGATTTCAGCTTATTAGAGGTCGTTTCTATAAGATGCCTATCCTTTGAAAATTTGACAGACACAATGTTTTTTAGGCCCTTTAATAACACTGTATTATCATTTTTTAATACAATATGAACATTCTCTGTGGCTAAATAGTAAATGTAATGTGAGACATTGTGACGTTTTAGCTCAGAATAAAACCATTGATAGTTTAAATCGTTTCGAACTTTATCAAATATTTGTTTAAAAATGACTACCTGATCCATAGATAAACCTTCCATGTGATATGAGGGGGCGTAGTCTGCACGATTATCTAAATTGCTTCAATCTGGTCTGACCTGTTTTCTGAGCAATTCAGTAATGTCACTCTTTTCTTTGTTTGCTTCAGAAGAAACTCTTTTTTCTGAGCACAGTCTCCGGCGGCAGGCTTCAATGACCCAGGCTGAGAAATTCCCGGACCCTTTTTGCTCAAGAGCGATGTTAATTTGTTCAATCATTTGGTTAGGAAAGCGGATGTTGCGGGTTGTTGTTCTGCGGGTTCTGTTCTTCGTTGACATGAGGTTGCCGGGGTTGCTGAGTGAATATATCGAACAGTCAGGTTAACAGGCTGCGGCATTTTGTCCGCGCCGGGCTTCGCTCACTGTTCAGGCCGGAGCCACAGACCGCCGTTGAATGGGCGGATGCTAATTACTATCTCCCGAAAGAATCCGCATACCAGGAAGGGCGCTGGGAAACACTGCCCTTTCAGCGGGCCATCATGAATGCGATGGGCAGCGACTACATCCGTGAGGTGAATGTGGTGAAGTCTGCCCGTGTTGGTTATTCCAAAATGCTGCTGGGTGTTTATGCCTACTTCATAGAGCATAAGCAGCGCAACACCCTTATCTGGTTGCCGACGGATGGTGATGCCGAGAACTTTATGAAAACTCACGTTGAGCCGACCATCCGTGATATTCCTTCGCTGCTTGCGCTGGCCCCGTGGTATGGCAAAAAGCACCGGGATAACACGCTCACCATGAAGCGTTTCACCAATGGTCGTGGCTTCTGGTGCCTGGGCGGTAAAGCGGCAAAAAACTACCGTGAAAAGTCGGTGGATGTGGCGGGTTATGATGAACTTGCTGCTTTTGATGATGATATTGAACAGGAAGGCTCTCCGACGTTCCTGGGTGACAAGCGTATTGAAGGCTCGGTCTGGCCAAAGTCCATCCGTGGCTCCACGCCCAAAGTGAGAGGCACCTGCCAGATTGAGCGTGCAGCCAGTGAATCCCCGCATTTTATGCGTTTTCATGTTGCCTGCCCGCACTGCGGGGAGGAGCAGTACCTTAAATTTGGCGATAAAGAGACGCCATTTGGCCTCAAATGGACGCCGGATGATCCCTCCAGCGTGTTTTATCTCTGCGAGCATAATGCCTGCGTCATCCGCCAGCAGGAGCTGGACTTTACTGATGCCCGTTATATCTGCGAAAAGACCGGGATCTGGACCCGTGATGGCATTCTCTGGTTTTCGTCATCCGGTGAAGAGATTGAGCCGCCGGACAGTGTGACCTTTCACATCTGGACGGCGTACAGCCCGTTCACCACCTGGGTGCAGATTGTCAAAGACTGGATGAAAACGAAAGGAGATACGGGAAAACGTAAAACCTTCGTGAACACCACGCTCGGTGAGACGTGGGAAGCGAAAATCGGCGAACGTCCGGATGCTGAAGTGATGGCAGAGCGGAAAGAGCATTATTCAGCGCCCGTTCCTGACCGTGTGGCTTACCTGACCGCCGGTATCGACTCCCAGCTGGACCGCTACGAAATGCGCGTATGGGGATGGGGGCCGGGTGAGGAAAGCTGGCTGATTGACCGGCAGATTATTATGGGCCGCCACGACGATGAACAGACGCTGCTGCGTGTGGATGAGGCCATCAATAAAACCTATACCCGCCGGAATGGTGCAGAAATGTCGATATCCCGTATCTGCTGGGATACTGGCGGGATTGACCCGACCATTGTGTATGAACGCTCGAAAAAGCATGGGCTGTTCCGGGTGATCCCCATTAAAGGGGCATCCGTCTACGGTAAGCCGGTGGCCAGCATGCCTCGTAAGCGAAACAAAAACGGGGTTTATCCTGGCTGGATGCCGCAGAAATGGACATGGATACCCCGTGAGTTACCCGGCGGGCGCGCCTCGTTCATTCACGTTTTTGAACCCGTGGAGGACGGGCAGACCCGCGGTGCAAATGTGTTTTACAGCGTGATGGAGCAGATGAAGATGCTTGACACGCTGCAGAACACGCAGCTGCAGAGCGCCATTGTGAAGGCGATGTATGCCGCCACCATTGAGAGTGAGCTGGATACGCAGTCAGCGATGGATTTTATTCTGGGCGCGAACAGTAAGGAGCAGCGGGACAAGCTGACCGGCTGGATTGGTGAAATTGCCGCGTATTACGCCGCAGCACCGGTCCGTCTGGGAGGCGCAAAAGTGCCGCACCTGATGCCGGGGGACTCACTGAACCTGCAGACGGCTCAGGACACGGATAACGGCTACTCCGTGTTTGAGCAGTCACTGTTGCGGTATATCGCTGCCGGGCTGGGTGTCTCGTATGAGCAGCTTTCCCGGAATTACGCCCAGATGAGCTACTCCACGGCACGGGCCAGTGCGAACGAGTCGTGGGCGTACTTTATGGGGCGGCGAAAATTCGTCGCATCCCGTCAGGCGAGCCAGATGTTTCTGTGCTGGCTGGAAGAGGCCATCGTTCGCCGCGTGGTGACGTTACCTTCAAAAGCGCGCTTCAGTTTTCAGGAAGCCCGCAGTGCCTGGGGGAACTGCGACTGGATAGGCTCCGGTCGTATGGCCATCGATGGTCTGAAAGAAGTTCAGGAAGCGGTGATGCTGATAGAAGCCGGACTTCCGCAGCGAAGACATCCGAGACGAACGCGAAAGCGTCGGAAACCAGCGCAGAATCCTCAAAAACGGCTGCCGCATCGTCCGCCAGTTCGGCGGCGTCATCGGCATCATCGGCGTCAGCTTCAAAAGATGAGGCGACCAGACAGGCGTCAGCAGCGAAGGGCAGCGCCACGACAGCATCCACGAAGGCGACAGAGGCAGCTGGCAGTGCGACGGCGGCAGCACAGAGCAAAAGTACGGCGGAATCCGCGGCAACGCGCGCCGAGACAGCAGCTAAACGGGCAGAGGATATTGCATCCGCCGTGGCGCTTGAGGATGCAAGTACGACGAAAAAGGGGATAGTACAGCTCAGCAGTGCGACCAACAGTACGTCTGAAACGCTGGCGGCAACGCCAAAGGCAGTAAAATCAGCCTATGACAATGCAGAGAAACGTCTGCAGAAAGACCAGAACGGCGCTGATATACCCGATAAGGGACGCTTCCTGAACAACATTAACGCGGTCAGTAAAACAGACTTTGCTGATAAGCGTGGTATGCGTTATGTGCGGGTTAACGCTCCTGCAGGTGCAACATCTGGAAAATATTACCCTGTTGTTGTTATGCGTTCTGCTGGCTCAGTAAGCGAACTGGCATCAAGGGTCATTATCACCACGGCACCGCGAACCGCAGGCGATCCGATGAATAACTGCGAGTTTAACGGATTTGTTATGCCTGGTGGCTGGACTGACAGGGGGCGTTATGCTTATGGAATGTTCTGGCAATATCAAAACAATGAACGAGCCATCCACTCAATAATGATGAGTAATAAGGGCGATGATTTGCGCTCTGTGTTCTATGTTGATGGCGCTGCTTTCCCTGTTTTTGCGTTTATCGAAGATGGCCTGTCAATATCCGCACCTGGTGCTGATCTCGTTGTTAATGATACGACCTATAAGTTTGGGGCAACAAATCCAGCGACTGAATGTATCGCGGCGGACGTTATCCTTGATTTTAAGAGTGGGCGTGGTTTTTATGAGTCTCATTCGTTAATCGTTAACGATAACTTGTCGTGCAAAAAACTTTTTGCCACAGACGAAATTGTAGCGCGTGGTGGTAATCAGATTCGAATGATAGGTGGGGAGTATGGTGCATTATGGCGTAATGATGGCGCTAAAACTTACCTGCTGCTTACCAATCAAGGTGATGTTTATGGTGGCTGGAATACATTAAGACCGTTTGCTATTGATAACGCAACCGGCGAACTGGTTATTGGAACCAAACTGTCCGCAAGTCTGAACGGTAATGCATTAACAGCAACAAAGCTGCAAACGCCAAGACTGGTTTCTGGTGTTGAGTTTGATGGTTCCAAAGATATTACTTTAACCGCCGCGCATGTGGCTGCTTTTGCCAGAAGGGCAACGGATACATATGCCGATGCGGATGGTGGCGTTCCCTGGAATGCCGAATCAGGCGCTTACAATGTCACCCGCTCTGGCAACAGCTATATTCTGGTTAACTTCTATACCGGAGTCGGAAGTTGCCGGACATTGCAGATGAAGGCGCATTACAGAAATGGAGGTCTGTTCTACCGTTCCTCAAGAGATGGCTATGGTTTTGAGGAAGACTGGGCAGAAGTTTATACCTCGAAAAATCTTCCACCAGAAAGCTACCCAGTCGGTGCACCAATCCCGTGGCCATCAGATACCGTTCCGTCTGGTTATGCCCTGATGCAGGGGCAGACTTTTGACAAATCTGCTTACCCGAAACTTGCAGCCGCTTATCCGTCAGGCGTGATCCCTGATATGCGTGGCTGGACGATTAAGGGCAAGCCCGCCAGTGATCGAGCCGTATTGTCTCAGGAACAGGACGGCATTAAATCACACACCCACAGCGCCAGCGTATCCAGTACGGATTTGGGGACGAAAACCACATCGTCGTTTGATTACGGCACTAAATCCACGAATAACACTGGTGCGCATACCCATAGTGTTAGCGGTACGGCTGCTTCAGCCGGTGCACATACCCATTCGATGACATTTGTTTCAGGTGGTTCCAGTGGTGCTCCGGGAAGTGGATCACCTGATTATTCTAAATACAGTGTTAACACTTCTTCTGCAGGCGCTCATACGCACTCTGTATCGGGTACTGCTGCAAGCGCAGGTGCACACGCACATACTGTCGGTATTGGTGCTCATACGCACTCCGTTGCGATTGGTTCACATGGACATACCATCACCGTTAACGCTGCTGGTAACGCGGAAAACACCGTCAAAAACATCGCATTTAACTATATTGTGAGGCTTGCATAATGGCATTCAGAATGAGTGAACAACCACGGACCATAAAAATTTATAATCTGCTGGCCGGAACTAATGAATTTATTGGTGAAGGTGATGCATATATTCCGCCTCATACAGGTCTGCCAGCAAACAGTACCGATATTGCACCGCCAGATATTCCGGCTGGCTTCGTGGCTGTTTTCAACAGTGATGAGTCATCGTGGCATCTCGTTGAAGATCATCGGGGTAAAACGGTTTATGACGTGGCTTCCGGCGACGCGTTATTTATTTCTGAACTCGGTCCGTTACCGGAAAATGTTACCTGGTTATCGCCGGAAGGGGAGTTTCAGAAGTGGAACGGCACAGCCTGGGTGAAGGATACGGAAGCAGAAAAACTGTTCCGGATCCGGGAGGCGGAAGAAACAAAAAACAACCTGATGCAGGTAGCCAGTGAGCATATTGCGCCGCTTCAGGATGCTGCAGATCTGGAAATCGCAACGGAGGAAGAAATCTCGTTGCTGGAAGCATGGAAAAAGTATCGGGTATTGCTGAACCGTGTTGATACGTCAACTGCACAGGATATTGAATGGCCAGCACTGCCGTAGGGTAAAACATATAAATTCTATAATTAGATGTATCTTTCCATTTACGGCAAGGAAGGGGGCTTGGAAGACGTAAAGCATCTCACACCGAGATTATTTTTTATATGTCAGGTGTCTGAAGTTTTGCTTTGGCTCTTAAAATGGTTTGCCGCGAGGTTTTGAATTCCCGGGCAATGGCACTTATACTTACACCTGACTTAATTCGTTCGAATACCGCCTGTTTCTGTTCTTCATTTAACACAGGTGGTCGACCAAAACGTTTCCCTGCGCCGCGGGCTCTTACTATCCCGGAATGAGTGCGTTCAAGTAAAAGGTCTCGTTCAAATTCAGCGACTGCTGAAATTACTTGCATCATCATTTTTCCTGTTGGACTGGTCAGGTCAATGCCCCCCAATGCTAAGCAATGCACTCTGATACCTGTTTCGGTCAGTTGTTCCACTGTTTTCCTGATATCCATTGCATTACAACCAAGGCGATCCAGTTTTGTCACAATCAATTGATCACCACATTTCAGGCGAGCAAGCAACCGGTTAAAACCAGGACGCTCACTGGTTGCTGCTGAGCCGCTAATGTGTTCTTCGATTATTTGCTGAGGTTTGATTTTAAAACCTGCACTTTCGATTTCCCGGCGTTGATTTTCGGTGGTCTGATCCAGCGTTGATATCCGACAGTAAGCAAAAATTCGAGACAT
Protein sequences of DBSCAN-SWA_7 >LR134000|2229639:2272322|2238813_2239431_-|VDY68591.1|DBSCAN-SWA MTTQYGFFIDSSRCTGCKTCELACKDFKDLGPEVSFRRIYEYAGGDWQEDNGVWHQNVFAYYLSISCNHCDDPACTKVCPSGAMHKREDGFVVVDEDVCIGCRYCHMACPYGAPQYNAEKGHMTKCDGCYSRVAEGKQPICVESCPLRALEFGPIEELRQKHGTLAAVAPLPRAHFTKPNIVIKPNANSRPTGDTTGYLANPEEV >LR134000|2229639:2272322|2252654_2253011_+|VDY68604.1|DBSCAN-SWA MSAPATILDMCCGSRMFWFDKSDERAIFSDIRKEGYTLRNGRRLIISPDIIADFRALSFADASFSMVVLDPPHLESVGDNAWMGKKYGRLNKDAWRDDSRQRFKEAFRVLRPHGVLIF >LR134000|2229639:2272322|2257915_2258071_+|VDY68610.1|DBSCAN-SWA MKQQKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >LR134000|2229639:2272322|2229639_2230461_-|VDY68582.1|protease|DBSCAN-SWA MRTTIAVVLGAISLTSAFVFADKPDVARSANDEVSTLFFGHDDRVPVNDTTQSPWDAVGQLETASGNLCTATLIAPNLALTAGHCLLTPPKGKADKAVALRFVSNKGLWRYEIHDIEGRVDPTLGKRLKADGDGWIVPPAAAPWDFGLIVLRNPPSGITPLPLFEGDKAALTAALKAAGRKVTQAGYPEDHLDTLYSHQNCEVTGWAQTSVMSHQCDTLPGDSGSPLMLHTDDGWQLIGVQSSAPAAKDRWRADNRAISVTGFRDKLDQLSQK >LR134000|2229639:2272322|2230736_2231045_-|VDY68583.1|DBSCAN-SWA MKKVLALVVAAAMGLSSAAFAAETTTTPAPTATTTKAAPAKTTHHKKQHKAAPAQKAQAAKKHHKNTKAEQKAPEQKAQAAKKHAKKHSHQQPAKPAAQPAA >LR134000|2229639:2272322|2245676_2246237_-|VDY68596.1|DBSCAN-SWA MPSAHSVKLRPLEREDLRYVHQLDNNASVMRYWFEEPYEAFVELSDLYDKHIHDQSERRFVVECDGEKAGLVELVEINHVHRRAEFQIIISPEYQGKGLATRAAKLAMDYGFTVLNLYKLYLIVDKENEKAIHIYRKLGFSVEGELMHEFFINGQYRNAIRMCIFQHQYLAEHKTPGQTLLKPTAQ >LR134000|2229639:2272322|2254677_2255991_+|VDY68606.1|integrase,transposase|DBSCAN-SWA MIKETVTMSHKELHRLQIIQEQAAARIGISIRQVKRLVQRYRNEGPSGLVSHRRGKRPNNSFSTEFRATVISLLKGRYADFGPTFACEKLREIHGLSLSVETLRKWMIEEGLWRERRRKIARIYQRRMRRPSYGELIQIDGSPHDWFENRGPRCTLIVFIDDATSALMALRFVPAETTRAYMETLRGYLNDHGVPLALYSDRHSIFRVNNPEREGELTQFTRAIKTLGIEPIHANSPQAKGRVERANQTLQDRLVKEMRLQNISDIETANAWLPTFIEAYNNRFATSPRTTDNAHLDVHHSEEELGYIFSLQAKRVLSKNLTFQYKSSAFQVRSEGRGYRLRHSVVTVCENFDGEINVLYDGKALGWEKYVDGPEPIPLDDEKSVHERVDNARIDLRSKYYVKPKADHPWLTRRTQSHQQVKPPKLPKKKPDPDKKD >LR134000|2229639:2272322|2251027_2251264_-|VDY68602.1|DBSCAN-SWA MIVSPGKWVSEEQLIALKGIKKGTLKKAREKSFMEGREYKHVAHDGMPWDNSPCFYNLEEIDRWIERQASARPRRHLT >LR134000|2229639:2272322|2262613_2262820_+|VDY68616.1|lysis|DBSCAN-SWA MDKLTTGVAYGTSAGNAGFWALQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDRRKAARGE >LR134000|2229639:2272322|2235849_2237106_-|VDY68588.1|DBSCAN-SWA MFRRLLIATVVGILAAFAVAGFRHAMLLLEWLFLNNDSGSLVNAATNLSPWRRLLTPALGGLAAGLLLMGWQKFTQQRPHAPTDYMEALQTDGQFDYAASLVKSLASLLVVTSGSAIGREGAMILLAALAASCFAQRFTPRQEWKLWIACGAAAGMAAAYRAPLAGSLFIAEVLFGTMMLASLGPVIISAVVALLVSNLINHSDALLYNVQLSVTVQARDYALIISTGVLAGLCGPLLLTLMNACHRGFVSLKLAPPWQLALGGLIVGLLSLFTPAVWGNGYSTVQSFLTAPPLLMIIAGIFLCKLCAVLASSGSGAPGGVFTPTLFIGLAIGMLYGRSLGLWFPDGEEITLLLGLTGMATLLAATTHAPIMSTLMICEMTGEYQLLPGLLIACVIASVISRTLHRDSIYRQHTAQHS >LR134000|2229639:2272322|2233856_2235077_+|VDY68586.1|DBSCAN-SWA MVAENQPGHIDQIKQTNAGAVYRLIDQLGPVSRIDLSRLAQLAPASITKIVREMLEAHLVQELEIKEAGNRGRPAVGLVVETEAWHYLSLRISRGEIFLALRDLSSKLVVEESQELALKDDLPLLDRIISHIDQFFIRHQKKLERLTSIAITLPGIIDTENGIVHRMPFYEDVKEMPLGEALEQHTGVPVYIQHDISAWTMAEALFGASRGARDVIQVVIDHNVGAGVITDGHLLHAGSSSLVEIGHTQVDPYGKRCYCGNHGCLETIASVDSILELAQLRLNQSMSSMLHGQPLTVDSLCQAALRGDLLAKDIITGVGAHVGRILAIMVNLFNPQKILIGSPLSKAADILFPVISDSIRQQALPAYSQHISVESTQFSNQGTMAGAALVKDAMYNGSLLIRLLQG >LR134000|2229639:2272322|2244550_2244856_-|VDY68594.1|DBSCAN-SWA MKLSTCCAALLLALASPAVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKILYTRTTSGNVSAPAQSSQDGAPAEPQ >LR134000|2229639:2272322|2258885_2259935_+|VDY68612.1|DBSCAN-SWA MRVLLRPVLVPELGLVVLKPGRESIQIFHNPRVLVEPEPKSMRNLPSGVVPAVRQPLAEDKTLLPFFSNERVIRAAGGVGALSDWLLRHVTSCQWPNGDYHHTETVIHRYGTGAMVLCWHCDNQLRDQTSESLELLAQQNLTAWVIDVIRHAISGTQERELSLAELSWWAVCNQVVDALPEAVSRRSLGLPAEKICSVYRESDIVPGEQTATSILKQRTKNLAPLPYAHQQQKSPQEKTVVSITVDPESPESFMKLPKRRRWVKEKYTRWVKTQPCACCGMPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHNELHTDTVAFEDKYGSQLELIFRFIDRALAIGVLA >LR134000|2229639:2272322|2257317_2257557_+|VDY68608.1|DBSCAN-SWA MGSINLRIDDELKARSYAALEKMGVTPSEALRLMLEYIADNERLPFKQTLLSDEDAELVEIVKERLRNPKPVRVTLDEL >LR134000|2229639:2272322|2257556_2257844_+|VDY68609.1|DBSCAN-SWA MAYFLDFDERALKEWRKLGSTVREQLKKKLVEVLESPRIEANKLRGMPDCYKIKLRSSGYRLVYQVIDEKVVVFVISVGKRERSEVYSEAVKRIL >LR134000|2229639:2272322|2265733_2266144_-|VDY68621.1|DBSCAN-SWA MDQVVIFKQIFDKVRNDLNYQWFYSELKRHNVSHYIYYLATENVHIVLKNDNTVLLKGLKNIVSVKFSKDRHLIETTSNKLKSREITFQEYRRNLAKAGVFRWVTNIHEQKRYYYTFDNSLLFTESIQKTTQILPR >LR134000|2229639:2272322|2259948_2260701_+|VDY68613.1|DBSCAN-SWA MNLEALPKYYSPKSPKLSDDAPATGTGCLTITDVMAAQGMVQSKAPLGLALFLAKVGVQDPQFAIEGLLNYAMALDNPTLNKLSEEIRLQIIPYLVSFAFADYSRSAASKARCEHCSGTGFYNVLREVVKHYRRGESVIKEEWVKELCQHCHGKGEASTACRGCKGKGIVLDEKRTRFHGVPVYKICGRCNGNRFSRLPTTLARRHVQKLVPDLTDYQWYKGYADVIGKLVTKCWQEEAYAEAQLRKVTR >LR134000|2229639:2272322|2261122_2261335_-|VDY68614.1|DBSCAN-SWA MSRKMTGIVKTFDGKSGKGLITPSDGRIDVQLHVSALNLRDAEEITTGLRVEFCRINGLRGPSAANVYLS >LR134000|2229639:2272322|2237957_2238812_-|VDY68590.1|DBSCAN-SWA MGNGWHEWPLVIFTVLGQCVVGALIVSGIGWFAAKNDADRQRIVRGMFFLWLLMGVGFIASVMHLGSPLRAFNSLNRIGASGLSNEIAAGSIFFAVGGLWWLVAVIGKMPQALGKLWLLFSMALGVIFVWMMTCVYQIDTVPTWHNGYTTLAFFLTVLLSGPILAAAILRAARVTFNTTPFAIISVLALIACAGVIVLQGLSLASIHSSVQQASALVPDYASLQVWRVVLLCAGLGCWLCPLIRRREPHVAGLILGLILILGGEMIGRVLFYGLHMTVGMAIAG >LR134000|2229639:2272322|2263662_2264160_+|VDY68619.1|DBSCAN-SWA MNQIFMVIFLVLSGFIVGNVWSDRGWQKKWAERDAAALSQEVNAQFAARIIEQGRTIARDEAVKDAQQKSAEISARAAYLSDSVNQLRAEAKKYAIRLDAAKHTADLAAAVRGKTTKTAEGMLTNMLGDIAAEAQLYAEIADERYIAGVTCQQIYESLRDKKHQM >LR134000|2229639:2272322|2258287_2258539_+|VDY68611.1|DBSCAN-SWA MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDARVTDGFILREGEFVTSLQALKEIGIKAGFSAFSGE >LR134000|2229639:2272322|2237300_2237915_-|VDY68589.1|DBSCAN-SWA MTHFSQQDNFSVAARVLGALFYYAPESAEAAPLVAVLTSDGWETQWPLPEASLAPLVTAFQTQCEETHAQAWQRLFVGPWALPSPPWGSVWLDRESVLFGDSTLALRQWMREKGIQFEMKQNEPEDHFGSLLLMAAWLAENGRQTECEELLAWHLFPWSTRFLDVFIEKAEHPFYRALGELARLTLAQWQSQLLIPVAVKPLFR >LR134000|2229639:2272322|2269073_2271059_+|VDY68624.1|tail|DBSCAN-SWA MALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGRFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTAPRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRLVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGNSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASDRAVLSQEQDGIKSHTHSASVSSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGTAASAGAHTHSMTFVSGGSSGAPGSGSPDYSKYSVNTSSAGAHTHSVSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA >LR134000|2229639:2272322|2241925_2244352_-|VDY68593.1|DBSCAN-SWA MSKNERMVGISRRTLVKSTAIGSLALAAGGFSLPFTLRNAAASVQQAREKVVWGACSVNCGSRCALRLHVKDNEVTWVETDNTGSDEYGNHQVRACLRGRSIRRRINHPDRLNYPMKRVGKRGEGKFERISWDEALDTIASSLKKTVEQYGNEAVYIQYSSGIVGGNMTRSSPSASAVKRLMNCYGGSLNQYGSYSTAQISCAMPYTYGSNDGNSTTDIENSKLVVMFGNNPAETRMSGGGITYLLEKAREKSNAKMIVIDPRYTDTAAGREDEWLPIRPGTDAALVAGIAWVLINENLVDQPFLDKYCVGYDEKTLPADAPKNGHYKAYILGEGDDKTAKTPQWASQITGIPEDRIIKLAREIGTAKPAYICQGWGPQRQANGELTARAIAMLPILTGNVGISGGNSGARESTYTITIERLPVLDNPVKTSISCFSWTDAIDHGPQMTAIRDGVRGKDKLDVPIKFIWNYAGNTLVNQHSDINKTHEILQDESKCEMIVVIENFMTSSAKYADILLPDLMTVEQEDIIPNDYAGNMGYLIFLQPVTSEKFERKPIYWILSEVAKRLGPDVYQKFTEGRTQEQWLQHLYAKMLAKDPALPSYDELKKMGIYKRKDPNGHFVAYKAFRDDPEANPLKTPSGKIEIYSSRLAEIARTWELEKDEVISPLPVYASTFEGWNSPERRTFPLQLFGFHYKSRTHSTYGNIDLLKAACRQEVWINPIDAQKRGIANGDMVRVFNHRGEVRLPAKVTPRILPGVSAMGQGAWHEANMSGDKIDHGGCVNTLTTLRPSPLAKGNPQHTNLVEIEKI >LR134000|2229639:2272322|2251351_2252485_-|VDY68603.1|DBSCAN-SWA MPSRVFCGRFASPYAPVQRGITRISQPDADKLLAASRGEFVEGISDPNDPKWVKGLQTRDSVNQNQHESERNYQKAEQNSPNALQNEPETKQPEPVAQQEVEKVCTACGQTGGGNCPDCGAVMGDATYQETFDEEYQVEVQEDDPEEMEGAEHPHKENTGGNQHHNSDNETGETADHPIKVNGHHEITSTSRTCDHLMIDLETMGKNPDAPIISIGAIFFDPQTGDMGPEFSKTIDLETAGGVIDRDTIKWWLKQSREAQSAIMTDEIPLDDALLQLREFIDENSGEFFVQVWGNGANFDNTILRRSYERQGIPCPWRYYNDRDVRTIVELGKAIDFDARTAIPFEGERHNALDDARYQAKYVSVIWQKLIPSQADS >LR134000|2229639:2272322|2248505_2249525_+|VDY68600.1|DBSCAN-SWA MKSILIEKPNQLAIVEREIPTPSAGEVRVKVKLAGICGSDSHIYRGHNPFAKYPRVIGHEFFGVIDAVGEGVESARVGERVAVDPVVSCGHCYPCSIGKPNVCTTLAVLGVHADGGFSEYAVVPAKNAWKIPEAVADQYAVMIEPFTIAANVTGHGQPTENDTVLVYGAGPIGLTIVQVLKGVYNVKNVIVADRIDERLEKAKESGADWAINNSQTPLGEIFTEKGIKPTLIIDAACHPSILKEAVTLASPAARIVLMGFSSEPSEVIQQGITGKELSIFSSRLNANKFPIVIDWLSKGLIKPEKLITHTFDFQHVADAISLFEQDQKHCCKVLLTFSE >LR134000|2229639:2272322|2247279_2248494_+|VDY68599.1|DBSCAN-SWA MKIVKAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLQDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHSIDEALDDYARHQELGFKAIRVQCGIPGMKTTYGMSKGKGLAYEPATKGQWPEEQLWSTEKYLDFMPKLFDAVRNKFGFNEHLLHDMHHRLTPIEAARFGKSIEDYRMFWMEDPTPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQVRTGSHGPSDLSPVCMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWTFDNGYMHPGDKPGLGIEFDEKLAAKYPYEPAYLPVARLEDGTLWNW >LR134000|2229639:2272322|2262824_2263136_+|VDY68617.1|DBSCAN-SWA MTQDYELVVKGVRNFENKVTVTVALQDKERFDGEIFDLDVAMDRVEGAALEFYEAAARRSVRQVFLEVAEKLSEKVESYLQHQYSFKIENPANKHERPHHKYL >LR134000|2229639:2272322|2244927_2245674_+|VDY68595.1|DBSCAN-SWA MIENHLYSLVTVVKYKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY >LR134000|2229639:2272322|2254163_2254343_+|VDY68605.1|DBSCAN-SWA MKNALQFLFVAFWLFASCMPIIFTARYMEKVDVLILIFGYINALFLGVFMAVMCIEYWR >LR134000|2229639:2272322|2246271_2246613_-|VDY68597.1|DBSCAN-SWA MKITLSKRIGLLAILLPCALALSTTVHAETNKLVIESGDSAQSRQHAAMEKEQWNDTRNLRQKVNKRTEKEWDKADAAFDNRDKCEQSANINAYWEPNTLRCLDRRTGRVITP >LR134000|2229639:2272322|2239441_2241838_-|VDY68592.1|DBSCAN-SWA MKAEISRRSLMKTSALGSLALASSAFTLPFSQMVRAAEAPVEEKAVWSSCTVNCGSRCLLRLHVKDDTVYWVESDTTGDDVYGNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTISDNLRRILKDYGNEAVHVLYGTGVDGGNITNSNVPYRLMNSCGGFLSRYGSYSTAQISAAMSYMFGANDGNSPDDIANTKLVVMFGNNPAETRMSGGGVTYYVEQARERSNARMIVIDPRYNDTAAGREDEWLPIRPGTDGALACAIAWVLITENMVDQPFLDKYCVGYDEKTLPANAPRNAHYKAYILGEGPDGIAKTPEWAAKITSIPAEKIIQLAREIGSAKPAYICQGWGPQRHSNGEQTSRAIAMLSVLTGNVGINGGNSGVREGSWDLGVEWFPMLENPVKTQISVFTWTDAIDHGTEMTATRDGVRGKEKLDVPIKFLWCYASNTLINQHGDINHTHEVLQDDSKCEMIVGIDHFMTASAKYCDILLPDLMPTEQEDLISHESAGNMGYVILAQPATSAKFERKPIYWMLSEVAKRLGPDVYQTFTEGRSQHEWIKYLHAKTKERNPEMPDYEEMKTTGIFKKKCPEEHYVAFRAFREDPQANPLKTPSGKIEIYSERLAKIADTWELKKDEIIHPLPAYTPGFDGWDDPLRKTYPLQLTGFHYKARTHSSYGNIDVLQQACPQEVWINPIDAQARGIRHGDTVRVFNNNGEMLIAAKVTPRILPGVTAIGQGAWLKADMFGDRVDHGGSINILTSHRPSPLAKGNPSHSNLVQIEKV >LR134000|2229639:2272322|2271731_2272322_-|VDY68625.1|DBSCAN-SWA MSRIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSERPGFNRLLARLKCGDQLIVTKLDRLGCNAMDIRKTVEQLTETGIRVHCLALGGIDLTSPTGKMMMQVISAVAEFERDLLLERTHSGIVRARGAGKRFGRPPVLNEEQKQAVFERIKSGVSISAIAREFKTSRQTILRAKAKLQTPDI >LR134000|2229639:2272322|2262452_2262572_-|VDY68615.1|DBSCAN-SWA MFEKIGVYGVVLTAHVVLNGAKKTYTLKMWVIILKESHI >LR134000|2229639:2272322|2263132_2263666_+|VDY68618.1|DBSCAN-SWA MNTKIRYGLSAAVLALIGAGASAPQILDQFLDEKEGNHTMAYRDGSGIWTICRGATVVDGKTVFPNMKLSKEKCDQVNAIERDKALAWVERNIKVPLTEPQKAGIASFCPYNIGPGKCFPSTFYKRLNAGDRKGACEAIRWWIKDGGRDCRIRSNNCYGQVIRRDQESALTCWGIEQ >LR134000|2229639:2272322|2266391_2266481_+|VDY68622.1|DBSCAN-SWA MLRVVVLRVLFFVDMRLPGLLSEYIEQSG >LR134000|2229639:2272322|2246747_2247074_+|VDY68598.1|DBSCAN-SWA MIKTTLLFFATALCEIIGCFLPWLWLKRNASIWLLLPAGISLALFVWLLTLHPAASGRVYAAYGGVYVCTALMWLRVVDGVKLTLYDWTGALIALCGMLIIVAGWGRT >LR134000|2229639:2272322|2235201_2235897_+|VDY68587.1|DBSCAN-SWA MLKRFFITGTDTSVGKTVVSRALLQALASQGKTVAGYKPVAKGSKETPEGLRNKDALVLQSVSTIELPYEAVNPIALSEEESSVAHSCPINYTLISNGLANLTEKVDHVVVEGTGGWRSLMNDLRPLSEWVVQEQLPVLMVVGIQEGCINHALLTAQAIANDGLPLIGWVANRINPGLAHYAEIIDVLGKKLPAPLIGELPYLPRAEQRELGQYIRLAMLRSVLAVDRVTV >LR134000|2229639:2272322|2265408_2265582_+|VDY68620.1|DBSCAN-SWA MNIENLKTKAEADISEYITKKIIELKKKTGKEVTSIQFTAREKMTGLESYDVKINLI >LR134000|2229639:2272322|2266455_2269113_+|VDY68623.1|terminase|DBSCAN-SWA MNISNSQVNRLRHFVRAGLRSLFRPEPQTAVEWADANYYLPKESAYQEGRWETLPFQRAIMNAMGSDYIREVNVVKSARVGYSKMLLGVYAYFIEHKQRNTLIWLPTDGDAENFMKTHVEPTIRDIPSLLALAPWYGKKHRDNTLTMKRFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDDDIEQEGSPTFLGDKRIEGSVWPKSIRGSTPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKFGDKETPFGLKWTPDDPSSVFYLCEHNACVIRQQELDFTDARYICEKTGIWTRDGILWFSSSGEEIEPPDSVTFHIWTAYSPFTTWVQIVKDWMKTKGDTGKRKTFVNTTLGETWEAKIGERPDAEVMAERKEHYSAPVPDRVAYLTAGIDSQLDRYEMRVWGWGPGEESWLIDRQIIMGRHDDEQTLLRVDEAINKTYTRRNGAEMSISRICWDTGGIDPTIVYERSKKHGLFRVIPIKGASVYGKPVASMPRKRNKNGVYPGWMPQKWTWIPRELPGGRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSKEQRDKLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLPQRRHPRRTRKRRKPAQNPQKRLPHRPPVRRRHRHHRRQLQKMRRPDRRQQRRAAPRQHPRRRQRQLAVRRRQHRAKVRRNPRQRAPRQQLNGQRILHPPWRLRMQVRRKRG >LR134000|2229639:2272322|2232828_2233722_+|VDY68585.1|DBSCAN-SWA MNIELRHLRYFVAVAEELHFGRAAARLNISQPPLSQQIQALEQQIGARLLARTNRSVLLTAAGKQFLADSRQILSMVDDAAARAERLHQGEAGELRIGFTSSAPFIRAVSDTLSLFRRDYPDVHLQTREMNTREQIAPLIEGTLDMGLLRNTALPESLEHAVIVHEPLMAMIPHDHPLANNPNVTLAELAKEPFVFFDPHVGTGLYDDILGLMRRYHLTPVITQEVGEAMTIIGLVSAGLGVSILPASFKRVQLNEMRWVPIAEEDAVSEMWLVWPKHHEQSPAARNFRIHLLNALR >LR134000|2229639:2272322|2256452_2256785_-|VDY68607.1|DBSCAN-SWA MSVTIQGNTSTVISNNSAPEGTSEIAKITRQIQVLTEKLGKISSEEGMTTQQKKEMAALVQKQIESLWAQLEQLLRQQAEKKNEDATVQPDKKEEKKDDTNTAGTIDIYV >LR134000|2229639:2272322|2231468_2232722_-|VDY68584.1|DBSCAN-SWA MSRTTTVDGAPASDTDKQSISQPNQFIKRGTPQFMRVTLALFSAGLATFALLYCVQPILPVLSQEFGLTPANSSISLSISTAMLAIGLLFTGPLSDAIGRKPVMVTALLLASICTLLSTMMTSWHGILIMRALIGLSLSGVAAVGMTYLSEEIHPSFVAFSMGLYISGNSIGGMSGRLISGVFTDFFNWRIALAAIGCFALASALMFWKILPESRHFRPTSLRPKTLFINFRLHWRDRGLPLLFAEGFLLMGSFVTLFNYIGYRLMLSPWHVSQAVVGLLSLAYLTGTWSSPKAGTMTTRYGRGPVMLFSTGVMLFGLLMTLFSSLWLIFAGMLLFSAGFFAAHSVASSWIGPRAKRAKGQASSLYLFSYYLGSSIAGTLGGVFWHNYGWNGVGAFIALMLVIALLVGTRLHRRLHA >LR134000|2229639:2272322|2249712_2250993_-|VDY68601.1|integrase|DBSCAN-SWA MKYPTGVENHGGKLRIWFVYKDVRVRENLGVPDTAKNRRVAGELRSSVCYAIKTGVFDYAKQFPSSRNLEKFGEARQDLTIKELAEKFLALKETEVAKTSLNTYRAVIKNILSIIGEKNLASSINKEKLLEVRKELLTGYQIPKSNYIVTQPGRSAVTVNNYMTNLNAVFQFGVDNGYLADNPFKGISPLKESRTIPDPLSREEFIRLIDACRNQQAKNLWCVSVYTGVRPGELCALGWEDIDLKNGTMMIRRNLAKDRFTVPKTQAGTNRVIHLIKPAIDALRSQMTLTRLSKEHIIDVHLREYGRTEKQKCTFVFQPEVSARVKNYGDHFTVDSIRQMWDAAIKRAGLRHRKSYQSRHTYACWSLTAGANPAFIANQMGHADAQMVFQVYGKWMSENNNAQVALLNTQLSEFAPTMPHNEAMKN |
44 | Enterobacteria_phage(33.33%) | tail,terminase,protease,transposase,integrase,lysis | attL 2237215:2237230|attR 2264132:2264147 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
2485052 : 2493936
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >LR134000|2485052:2493936|DBSCAN-SWA GTTACCTGGCTATCGGCTTTTTACCCTGAGCAACAAGCATCGCAAACTTAGCCTGCTGGTCAGAAATGCCGAATTCATCGGGACAGAACGAAAATTCCTCCGCGTCGCCCTCATTCAGGCCCGCATCGAATACTGGCTTTTTTTCCTGAGATTTTTCGTTCCGCTTTTGTGCAGTCTGCGCAGATTTTTTCTGCGCACTTTTTTGCGCAGTTTTGCACATTTCTGTCTGCGCATTTTTCGGAGTTTTTTTGATGTAACGACGGGCTGTTGCGTAATTCAGTCCCCTTGCTTCACACCATGCCACCGGAGATATACCGGAGCGGGTGTATTCAGCAATATACTCCTGCTGCAACGCCCCCCAGTCCGGTCTGCTCATCAGTTAGTCCTGATTTTTATCCACCCTGAGTAGTTCGCGCAGAGCAAAGGCATCCCCTTTTCTGGCAAGCTTAAACAATGCCGCTCGTAACTCGGCTTCACCTTTCGCTCTGCCCTTACGGATGGCCGCATAAAAATCTGTCATTGCTTCCCGATTTTCTTTCAGTCGGTTCAGATCAACATCCAGAACGTCAGCGATTTGTTGTGCAGTCATCCGGCACGCTGCCAGAGACTCGACTTTCGAATACGGAATCATTTGTCACCCCCATTGATATGCAGGGTGTCTTCTTCCTGTATTTTTCGTGAAGGATTTTTACTGCAGCGTTGTTCCAGGTGACCTGATGGTGAATGCGTTTATGACTGGCACCCATAAGTGAGATTTTTACGCACGACGGCGCATACATGACGGAGTAAAAACTTTTAACGTAGGTTCCGGAATCCAGATACAGCTCGGTCATTCCGCCGCTGTTTTTCTGCGTCTGTTTCTGCCCTAACTGGACAGCACCGATCGTCATAAACAATTCACCACAGCGACCGAGATTCGTGTACGTATTCACATCCTCGTTAATGCGCCCCATGAATGAGAACGGTCGATCAACCGAACAGATAAAGCTGTTCATTGCCTTGCGTTTCACCCACGAAGCATGGCCGCCATTGTCACCAAGAAAATCCCCGCCCTGCGACATAGCGATGGAAAGAGCAGGTATTGATTCGTAGTACGCAAGCATTTCAGAAAGGATCTCATCCAGTTTCCTTATCGGGAAATAGGCCTGGTCATAGTTGCGATCCACCCGAAACTGGAACTCGTGATAATCATCATCGAGCTGAATGAAGTATTTACACCCGACCAGTTTTGCCTGGTCGAAACAGGCATTACGGGCGTAAAAAATTGAGCGGCGGTCACTAAAATTATCGGCTTCGTCAAAACGACTAGCAATATCGGCTTTGGAAAATACCAGCACCTGTTCACCAAATTCAGCCATGTACTGATGCCGTGTCTTATCTTCATCATCAACAACGATAAAAATTTTCCCGGTATAGCCAGCACGACGCAACGTCCGGTAAGTCAGAACTTTGTCCGGTCGCCCGTGAGTCAGAATAAAGGCGCAAAAATCATCACGCATATTCCTCCTCCCCGCCATGCATGATCTCCACCATGCGCTGCGTCATTCGGACAAATCCATTTTCAATAGCCTGCTGATAATCAATGATCACCAGCGCCGACTCCTCGAAAAGGCACTGAATTTCAGCGGGGGCATGAGCGTAATAGTCCGCAATTCTGCTGAAATTAAACACCGTGTGACGTTCTGCTGCACACAGGAGGAATTTCTCAATATCAGGATCAAGGGACGCCGAACGTATCCGGCTGATCAGCTCCTGAGTTTTCGTATCGTCGTACAGTTCACTGATATCCGGTTTACCGCCTGACGGCTCATAAACAGGCGTATCAATTTTCGTCGTATACGGCTCCTCCTCATTTCCTGTACCGGGCAAAACATCCGTCAACAGTTCATCAATTTCTGCCGGGATGAAGCCTGTCAGGGAGACATCAAAATCAGCATTGATTAGGTCCGACAGCTCCATCCGCAACAGATCTTCATCCCAGCCAGCATTCATCGGCAGGCGATTATCTGCCAGGCGGTACGCCTTTTTCTGCTCATCCGTCAGGCCAGACAGAACAATGACCGGAACAGAATCCATTTTGAGCATTTCAGCCGCCATAACACGACCGTGACCCGCAATAATTTCGCCCTTTTCGTCAATCAGCACCGGATTAGTCCAGCCTAATTGCTTAATACTTTCTACCAGTTGTGCCACCTGCTCAGTACTGTGCGTCCTGGCGTTGTGCGCATACGGTGACAATTCTTGTAATGGGCGATAGACTATCTTTAATTTCTCGCTCATACAGCCTTGCTTTATGAATAAAACGCACCCCAGCAGCCAGTGCTACTGGGGGCGGAGGTGTTGCTGGTAAAGTTAGGTATTGGATCAATGAGCGAGTCAACATAATATTAAACTCACAATTATAAATCAGCCATATATTAGGAGCGCCAAAAAAAACCTGAAAACAATATAATAACAGGATAAATTTCAAGGCGACCAAGAATCATAGCTATGCACATTAAATATTTTGCAATGTCATTAAGCACTCCGAATGACGATGCAGTAGCCCCAAAACCTAATCCCATATTATTAATACATGCAGCCACTGTTGCAAATGATGTAAGAAAATCATATCCCATACCATTTAACACCAGTATAAAAAACACCGTGAAGAGAGTATAAAGAAAAAAGAAACTCCATACAGACCTCATTACACGATCTGTAACTATCTTCCCTCCTACATTTACACTCAACAACGCTCTGGGATGAGAAAGCTGATTTATCTCGTGTTTGCTTTGTTTGAAAAGTATAAGAAATCGAAGTGACTTAATTCCACCACAGGTTGAACCTATACATCCCCCAAAGAAACTTGACAACAGCAAAAACACTATCGTGTGCGTGGGCCAGTTTGCATAATCCTGCGTAGCTAAACCATTATCAGTGAGCATGGAGCTGGCAAGAAAAAACGAATGAATAAAACTTCCATGCAAGTCATACATACCTATATGCCAGACCTGGAAAGAGGTAACAATGATCACCCCTAAGGCTATTAACAGAAAGAAACGAAGTTCAATATCTCTGATTAAAGGTTTTATCGTTTTTCTGCTAATAACAATATACCAAAGAGTGAAGTTGAAAGCCGATAGCAGGGAAAAAGAACCAGCCACCAGCTCAACCAAATAGTTATTAAAATATCCGATACTCTCGCTATGAGTTGAGAAACCACCAAGCGAAACTGTGGAAATCCCGTGACAAATAGCATCAAACAAAGGCATTCCTGCAAGTCTATAACAGACAATACAAGCAATACCTAATAAAGAATAAGTTATCCACAGTGTCCGTGACGTATCGGCCAGGCGGGGAGTGAGTTTGTCATCCTTAAATGGCCCCGGCATTTCTGACTGATAAAGCTTTGCACCACCAATACCCAATAATGGCAATACTGCAACCGCCAGAACAATAACTCCTAAACCACCTATAAAATTTAACTGTGACCGATAGTACAAATATGCCCGAGGTAATGAACTAACATCATCAATTACAGTTGCTCCTGTTGTTGTTATTCCAGAAACCCCTTCAAACAGAGCATCAATGAACGTTAAATTAAGTTCTGAGTCAATCCATAAAGGGAATGCACTAATAACAGAAAACAAAATCCAAAACATTACAATTATAATAAACCCATCACGGGTACGTAATTGAATGCCAGATTTCTTAGTTGTATACCACGCTCCGCCACCAATGCAAAAAAATATAACGAAAGTTATAAAGAAAACGAACAGGCTTTTTTCTTTATAAAACAATGCTACAACCATTGGTGGCAACATTGAAAGACTATAGAGCCAAACCAGGAACCCACACATATGAGTAACAACTCTTACATGAGATGTATTCATATCTAAATATTCTTTCAATTATAACCACCTTGCTGCAATATTATGATTATACTGTATAAAATTTAACTCCTCTTAGATCTTACTTCACTGTTCCTTATGAAACAATCATCAAAATGAATCATATTGTAGTTAAGATTTTACTTTAAACACTGCTCGGTTATGTATTGCTGAGCACCTTCAAGTTGGGCCTGCATCATTACCAGTCGTTCCCGGAGGGTGAAATAATCCCGTTCAGCGGTGTCTGCCAGTCGGGGGGAGGCTGCATTATCCACGCCGGAGGCGGTGGTGGCTTCACGCACTGACTGACAGACTGCTTTGATGTGCAACCGACGACGACCAGCGGCAACATCATCACGCAGAGCATCATTTTCAGCTTTAGCATCAGCTAACTCCTTCGTGTATTTTGCATCGAGCGCAGCAACATCACGCTGACGCATCTGCATGTCAGTAATTGCCGCGTTCGCCAGCTTCAGTTCTCTGGCATTTTTGTCGCGCTGGGCTTTGTAGGTAATGGCGTTATCACGGTAATGATTAACAGCCCATGACAGGCAGACGATGATGCAGATAACCAGAGCGGAGATAATCGCGGTTACTCTGTTCATTGCTGACCCCACAAACAGATTTCACGCTCAATCTCACGACGAGTCATGAGACCTTTCCATTGCTTACCGCCAGCATATGTCCAGCGACGTAGCTGATCACATGCGCCTTTGATATCGCCCTGGTTTATTTTGCGAAGAAGCGTCGATGTTCTGAAATTGCCAGCACCCACGTTGTAAACGAATGAGTAAAGAGCGCCGCGCGTTGTTTCCGGTATATCGACTTTGATGTACGGGTTAATTTGTCTGGCGACAGTGGCAAGGTCTTTATTCAAGAGTGTTTTGCATTCTGCTTTGGTATACGTTTTACCGAGCATGATGTCTTTTCCTGTATGCCCGTGACATACAGTCCATACACCAACAATATCTTTGTATGGTATGTAGCTGACACCTTCCAGACCATCGTTACCACTTGGGCCAGTGATTAACACTGATGCTATAGCAATTGCTCCGCCACCAATAGCAGCAGCAACGACTTTTCGTAATGATGGAGGCATTATTCACCTCTCGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAGACTACCCAGCACACCGATTGCAGCCCACTGTGACGGAGTTACTCTATCGAGCAACTGTAAAAACCAGTAGCCAGCACTGCCTGCGGAGGTGCCGTAGGCAATGCCTGTTGAAATTTTGTCCATGGATTTCATAGCCTCACCTCCGCACGGAACGGATGGCATAGTTATTATGTGTAGGCTTTCAGACACATCAATCAGAGCCTTAATTGATATATATGCTGGAGACGATGCAATATAAAAAGCTCGCCGTAGCGAGCTAATAAAATGTATTTCTCTGATATTATGTTTATTTGTATTAGCTCAGACTTGACATCACAGGTTTCGTATATAGAACATCATCAAATCTGTCAGTTTGCTATGAATGAGATATAGTAATTGAAGAGCTAACCTCGCATGTCAAAGCCAGATTTCTGAAAATCTCTGTAGACTTCCGGATTGTTGAAGGCCGGAAATTTGGCTTTATGAGCTGCGGACTTTATCGCTTCGCAATAGGCTTTATCACCGTTACTGGTAGATATTTTTAACGCCGTGCCATCCTGAGAGAATTCCATATGCAACCTGCATTTTTTCCCTTTCCAGTTATGCGGCTCATCAAGTTTGGCATTAATTGCAGCTCTGATTCCCCGCGCTTGCGCCCCCCATTCATCCTGATCATCCCAGCGTCCTGAACTGCAACTACCTGTAGCAGTAGTTTTGTGGCAATCTGAAGGGTGTAAAGGTGTGCATCCCGCAACAAAACCGACCCAAAAAGTCAACATAACGATTTTCTTTAATCCCACTTCTTGCTCCTCAATCCATTAAAATCTCAGCAATAGTAGTTGTTACGTCCGCCACTGGCTCAGAGCTGACTATCCGCTAAATTTAGCTCAGTGCCGTAGCTGTGTCAGAACAAACCTAAGCCGAAACCGTTTATTACAAAACAATAAATATCAGGGTTTAAAATCCAGCACCCCATTTTGAAATACTTTATATACTTCCGGCGAAGGGGGGGCAGGTATATCAGCATTCTTTATCGCATTCATCGCTTCACGACATAAATCGAGGTCTCCACTTTCTCTTTTAACCTCCAGTAGAAGGCCATTCGGGGCCATATGCATTCTCAGTGTACACTCTTTTCCTGAATACTTACTCGCATCCCCGAACTGTTTTTCGATGGCGCTCTTGATTTGATGGGCATACAGACGGATATCCTCACTAACATCAGAAGTACGTTCAGATGAACTCACATACTGTGTCTCTATTGCTTTATCGGAGTAATATGATGTACGGTCATGATAATTTGTCGATACAGCATCAGTGCACCCGATAATAATCCCACTAATAATCAACGTAAGAATTGATGCGCTACGAAAACCCATTTTTCCTCACATATGTCATATAGTAAAGGATTATATATACCGTTGTTTTGGACGCTCAAACAGCGAATCAGATCAAATAAAACGCACATTTGTTAACATTTACACAAAGTCTGCGTGGGATATTCTGAAAGAATATCCATAATGTGGAGAGAATCTATTGAAGTGCATGGTGCCGGGTGCCTCCCGGTGAACAAAATGTTCGTGATACCTGTCGGCGACAGAAAAGGTTAATGGTATCACCCCACCGCACAGGGGGATTCACCATGCAGGAGTTTTCTTAGCAAACTCACTGCGCGCCCGGCAACTCCCAACCACATAAAATGCGGAGTTTGTGGCATTTATGCATATAACTCGCAGGAATTATCTTAAAAAACTGATGTCGATCCGGATTAAAAAGAAGCAGGTCATCATCAGATGACTGGAAAAAAGGAAAACAAAAAATACTCATCATACAGTTTTGATTGCAGGGATGAGCCTGCTATGCACAATATGCAGAATATAAGCAAGATAAAAATATGCAGGCATATTATTTCGGATTTTGTTATTAACACAACCTTTTTAATAATCATTTGGCATACAATAAACCAGCCCAAAAAGAACCGCCTAAACAGGCGGTTGGTCAATACAAAGGATGCTTCGTCTTTATTATAGTAATCTGAGGCGTCGGGTGTCTTGTATCAGACAACATATTGTCCCGCTAAACAGCGAATTACAAACCACCCTGCAATGATCTCTCATCTCATTTTATATGAGTTGACGACATCAGGATAACGCATCATCAGCCCCTGCCAAGAAATATCAAAACTCCCGCCAGCAATGTGTTATCACAATATTGTAAAAAAAACACAGCACCGAAACTATAACTGGTCTCTGTTATAATTTGGAGCAGAAAGACCAGTTGCCCAACTAGCAGCATTCTCCCCTGCTTTCCTGACGTAAAAAAACCGCATTAAGCGGTTTTTTTACGATGTCCATGTCTGCAATCCGCCTCGCGATACAGCTTTGCGAAGCATAGCAAAATTGAAGCAGTTTATACGTAAGAAATCAAGCCATTTTCTCAGCAAATGATTCACGCATGGGAATATATAGGGCATACTCAGCAACAGCTAACCAATTAGCAATCCGTTTTTCGCATGTGCTAAAACACCACTCTGGGTGTGCATCATTTAGCAATTCAGCCATTTTGCGCTTAGTCATCCCCCGCCCTTCATAGCGTTGCCGGAGAATGCAAATCAATCCTGGATGCTCTGCCAGCACCTCACTTATGACTCGATCAATACATAACGCCTCTGCATCAGTACAATGCGCCAGCCAGCTCTTTTGCTTGCCGTTGATCATCTCTCGCAAAAACGCTTCCAGCTCAGCTTTCTCTATTCCCGCTTTTTTCATTCTGCGCAGGGCTTCATTAATGGCTGTTTTCGTCAGTTTTTTGGATGCCAACAACTGATTGAACATATTCCCTGACCTGCCACCGCCAATATACGACCAGCGCCCCCACATACGTAGTTTTCCCTGAATCCAGACACTTTCCAGCGTGGTGAGGCGAAGGTGTTCTCCGCTTTTTCCTGTATTCGTTGGGTAAATCATAAATGACCTTTCTTTCTCCAGATTTCTTGTGTGCGAAAAACCCCTTCAGCATGCATCAGGCGCAATTCTTCTTTGGTGTAATCGCTGGTTTTTACCCGCCCGTCGATTAGATCGTGGCATGAGCTACAGGCTATCGCCGCCTGCATATCATGTGGTTTTGTCGCTGTTCCGCACGTCCCCGCCAGCCTGTAATGCGCCAGCACAGAGGTTTCGGGATTGTGATTGCAGTAGCCAGGGATTCTGATCTGGCACATCTGGCCTTTAGCCGCTTTACGTAAATTCACCATTACGCAAACTCCAGTAGTTGTGCGGCCACATTTTCAACTTCCTCCTGAGAGGAGAATTTACGGAACAGAATCCAGTTCCACAGCACATTCAGTACAGATTTATAAACCTGCTGAAACTCGACTTCGTCCATATTCGCAAAAGCGATGGATTTTGCCCGACGCCCACGGCTACCGTCCGGATAAATATGCTCGGTGTAAAATCCGGCCTGAATGGTTACCCACTCGCGGAAAGCCTCAAACGACTTTAGCAACGCCGTATCCCGGGTTCTACGAGTCGCAACGGTGTTAAGGTATTGCTCTGCGGCATCACTCAGGGCTGGCGTGTGTTCCCGACCAACTGATTCGCACAGGTAATCAACGAAACCAGACAGCAGTTCTCGTTCGCGAGGCGTGATCGCCCCACCGACCGGAGTCCAGTAATCGAATCCCAGTTGCAGGAGTTTGAAAAAACGCTTGTGGAATGCGTAGTTACGCACACGCTTAAAGTCTGCGTGTATCCACTCACCTATTTTGATTTGATGCAGAAAATCGCAACTCTCCGGCGTCGCCGGGAGAAGTAAACCAGAAGAGGTTTGTTTGACCAGTTGTATATGCGCCAT
Protein sequences of DBSCAN-SWA_8 >LR134000|2485052:2493936|2493336_2493936_-|VDY68815.1|DBSCAN-SWA MAHIQLVKQTSSGLLLPATPESCDFLHQIKIGEWIHADFKRVRNYAFHKRFFKLLQLGFDYWTPVGGAITPRERELLSGFVDYLCESVGREHTPALSDAAEQYLNTVATRRTRDTALLKSFEAFREWVTIQAGFYTEHIYPDGSRGRRAKSIAFANMDEVEFQQVYKSVLNVLWNWILFRKFSSQEEVENVAAQLLEFA >LR134000|2485052:2493936|2487468_2488893_-|VDY68806.1|DBSCAN-SWA MCGFLVWLYSLSMLPPMVVALFYKEKSLFVFFITFVIFFCIGGGAWYTTKKSGIQLRTRDGFIIIVMFWILFSVISAFPLWIDSELNLTFIDALFEGVSGITTTGATVIDDVSSLPRAYLYYRSQLNFIGGLGVIVLAVAVLPLLGIGGAKLYQSEMPGPFKDDKLTPRLADTSRTLWITYSLLGIACIVCYRLAGMPLFDAICHGISTVSLGGFSTHSESIGYFNNYLVELVAGSFSLLSAFNFTLWYIVISRKTIKPLIRDIELRFFLLIALGVIIVTSFQVWHIGMYDLHGSFIHSFFLASSMLTDNGLATQDYANWPTHTIVFLLLSSFFGGCIGSTCGGIKSLRFLILFKQSKHEINQLSHPRALLSVNVGGKIVTDRVMRSVWSFFFLYTLFTVFFILVLNGMGYDFLTSFATVAACINNMGLGFGATASSFGVLNDIAKYLMCIAMILGRLEIYPVIILFSGFFWRS >LR134000|2485052:2493936|2490488_2490884_-|VDY68810.1|DBSCAN-SWA MGLKKIVMLTFWVGFVAGCTPLHPSDCHKTTATGSCSSGRWDDQDEWGAQARGIRAAINAKLDEPHNWKGKKCRLHMEFSQDGTALKISTSNGDKAYCEAIKSAAHKAKFPAFNNPEVYRDFQKSGFDMRG >LR134000|2485052:2493936|2492241_2492379_-|VDY68812.1|DBSCAN-SWA MLLVGQLVFLLQIITETSYSFGAVFFLQYCDNTLLAGVLIFLGRG >LR134000|2485052:2493936|2485617_2486550_-|VDY68804.1|DBSCAN-SWA MRDDFCAFILTHGRPDKVLTYRTLRRAGYTGKIFIVVDDEDKTRHQYMAEFGEQVLVFSKADIASRFDEADNFSDRRSIFYARNACFDQAKLVGCKYFIQLDDDYHEFQFRVDRNYDQAYFPIRKLDEILSEMLAYYESIPALSIAMSQGGDFLGDNGGHASWVKRKAMNSFICSVDRPFSFMGRINEDVNTYTNLGRCGELFMTIGAVQLGQKQTQKNSGGMTELYLDSGTYVKSFYSVMYAPSCVKISLMGASHKRIHHQVTWNNAAVKILHEKYRKKTPCISMGVTNDSVFESRVSGSVPDDCTTNR >LR134000|2485052:2493936|2489063_2489528_-|VDY68807.1|lysis|DBSCAN-SWA MNRVTAIISALVICIIVCLSWAVNHYRDNAITYKAQRDKNARELKLANAAITDMQMRQRDVAALDAKYTKELADAKAENDALRDDVAAGRRRLHIKAVCQSVREATTASGVDNAASPRLADTAERDYFTLRERLVMMQAQLEGAQQYITEQCLK >LR134000|2485052:2493936|2493046_2493337_-|VDY68814.1|DBSCAN-SWA MVNLRKAAKGQMCQIRIPGYCNHNPETSVLAHYRLAGTCGTATKPHDMQAAIACSSCHDLIDGRVKTSDYTKEELRLMHAEGVFRTQEIWRKKGHL >LR134000|2485052:2493936|2485430_2485640_-|VDY68803.1|DBSCAN-SWA MTAQQIADVLDVDLNRLKENREAMTDFYAAIRKGRAKGEAELRAALFKLARKGDAFALRELLRVDKNQD >LR134000|2485052:2493936|2491034_2491463_-|VDY68811.1|DBSCAN-SWA MGFRSASILTLIISGIIIGCTDAVSTNYHDRTSYYSDKAIETQYVSSSERTSDVSEDIRLYAHQIKSAIEKQFGDASKYSGKECTLRMHMAPNGLLLEVKRESGDLDLCREAMNAIKNADIPAPPSPEVYKVFQNGVLDFKP >LR134000|2485052:2493936|2485052_2485427_-|VDY68802.1|terminase|DBSCAN-SWA MSRPDWGALQQEYIAEYTRSGISPVAWCEARGLNYATARRYIKKTPKNAQTEMCKTAQKSAQKKSAQTAQKRNEKSQEKKPVFDAGLNEGDAEEFSFCPDEFGISDQQAKFAMLVAQGKKPIAR >LR134000|2485052:2493936|2492507_2493050_-|VDY68813.1|DBSCAN-SWA MIYPTNTGKSGEHLRLTTLESVWIQGKLRMWGRWSYIGGGRSGNMFNQLLASKKLTKTAINEALRRMKKAGIEKAELEAFLREMINGKQKSWLAHCTDAEALCIDRVISEVLAEHPGLICILRQRYEGRGMTKRKMAELLNDAHPEWCFSTCEKRIANWLAVAEYALYIPMRESFAEKMA >LR134000|2485052:2493936|2486542_2487331_-|VDY68805.1|DBSCAN-SWA MSEKLKIVYRPLQELSPYAHNARTHSTEQVAQLVESIKQLGWTNPVLIDEKGEIIAGHGRVMAAEMLKMDSVPVIVLSGLTDEQKKAYRLADNRLPMNAGWDEDLLRMELSDLINADFDVSLTGFIPAEIDELLTDVLPGTGNEEEPYTTKIDTPVYEPSGGKPDISELYDDTKTQELISRIRSASLDPDIEKFLLCAAERHTVFNFSRIADYYAHAPAEIQCLFEESALVIIDYQQAIENGFVRMTQRMVEIMHGGEEEYA >LR134000|2485052:2493936|2489524_2490022_-|VDY68808.1|DBSCAN-SWA MPPSLRKVVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKDIVGVWTVCHGHTGKDIMLGKTYTKAECKTLLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSTLLRKINQGDIKGACDQLRRWTYAGGKQWKGLMTRREIEREICLWGQQ >LR134000|2485052:2493936|2490021_2490237_-|VDY68809.1|lysis|DBSCAN-SWA MKSMDKISTGIAYGTSAGSAGYWFLQLLDRVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE |
14 | Escherichia_phage(44.44%) | terminase,lysis | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
2499003 : 2512534
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >LR134000|2499003:2512534|DBSCAN-SWA TCTATAAATACCGAAGATTTCCTTGATAAAATGCCAGTACACGTTTCATAACTTCACTCTTCCGGCACTCGCGACAGATTATGTTCTGACGCCTGTCATAGCGGCGTATTTCTCCGTCTGGTAATGACCAGATAAGGTCAGGATCAACCACAACCGTTTTTTTCACCTTTGCCCTGGATAGTTTTTTGCGGGCGTTTTTCCAATCCTTACGAGCCTGTTCAGACGGGAATAACCCATAGCCAGAGTTGTATACATCGCCACTGGCAACCAACTCTCTGGCAAGAATGCTCATCAGATATCTTGTCGCACCTGTCTTGGCCTCCAGTTGCCTTAACGTCTCGCGCCCACTCTGGCGTACCAACTCAACAACCTGCTCCTTAATTTTTTCACGCTCTTCCTGTGTTGCCGCTTCCTGTACTGGTAACGCAACACCGGCTGGCTGAGGAAAGGCTTTACCATCAGTTTCCGCTACCGATGCAGCTTTCGGCTCTGCTGGTAAATTATCGCCCGGCATGCAGTAACGAAATTTACCGTTCTGGTTTACGCGAATCAGGCGTCCTTTGCTGATTGCCATTGCCAGCGTTGAAGCAACTTTGCGGGATGTTGTACCGAACACCCGGCGTTAAGGGGAGAGATAAGATGGTGCATTACGAAGTAGTTCAGTATTTGATGGATTGTTGCGGTATCACTTACAACCAGGCTGTGCAGGCTTTACGCAGCAACGACTGGGATCTCTGGCAGGCAGAAGTCGCTATACACAGCAACAAGATGTGAGATTCGCAAAATGCAAAAAATCGACCTCGGCAACAACGAATCCCTGGTGTGCGGCGTGTTCCCCAACCAGGATGGAACGTTCACTGCCATGACGTATACCAAAAGCAAAACATTTAAAACCGAAACTGGTGCGCGCCGATGGTTGGAGAAGCACACAGTAAGCTAACGATTAAAACGTCTACTCCTGCTGTTCCAGAATAACTTCATAAAATGGGAGTATTTTTCGGTGACGAGATAATAAGAACAGTTTGCGCTATCACTCTGATGTTGAATGATGCCCTTCCGTTCTAATTTTTTCATAACCGGGTTACGGCAAGGAGAAGTGATAATAAGATTTCCTGTTTTAAGGAAATCTTTAAATACAGCGATTTCTTTCTCAGATAAACGAAGCAATACTCGTTGCTCTGGTAGTAATGAATAATGCTTTTGAATATGTGCTCGCAATCTTGAGAAGGAAATGGCGACCACGAAAGAAAAGGCAAAAACGATAATCTGAAAGAGCCAAGGTATTTCAGTATAAGCATTGAATGCGACAGTAAACTCTTTCGGTATCAGCCAGAGAGTGAGACCAAAAATGATAATCGTATACATAAGTATTTCGAGTGGCTCGTTAGCAAAAAGTTTCAACAATGGAGTAAATACATCCAACATATCAATAACTCTCAACTGTAAGGGTATTGAAATGTTAACACAAGCTCTCGCTGTAGGGGTATAGCCGAGACCACCGAAGCCCGGAGGTGGTGAAATAAAACCGGGCACAACACGAAGGCGCATTTCCGATATCCATAAAGAGTCGGTCTTGTCTGTTAAATTTAAATGGTGGGAGTGCGCCTCCGGTTGTAAATAACGACATTGCTGTGTGTAGTCCTGGCGGCATCAGTTTTTTTCTTGAAGTTCGGCTGATGTCCGCCCTTTTTAAAGTGAATTTTGTGATGCGGTGAATGCGGCTAAGCGCACGTGGCACAGTTAAAAGTCATGTTAGTCCTTATTGGTTTGGGTGGGAAAGCCGACTGTAATTGTTAACTGGTTGCAGTCACCTGGAGGCACCAGGCACCGCATCAACAAAGTTCATTTGTAAAAATGGAGATAATTATGATTGCACATCACTTCGGAACTGATGAAATACCACGTCAGTGTGTGACTCCTGGCGATTATGTTCTTCATGAAGGCCGGACATATATTGCCTCGGCAAACAATATTAAAAAGCGAAAACTATATATTCGTAACCTGACCACAAAAACATGCATTACTGACCGCATGATTAAAGTCTTCCTCGGTCGTGATGGTTTACCTGTAAAGGCGGAGTCATGGTGATGACTAAGAAAATAAAATGTGCTTACCACCTTTGCAAAAAAGACGTTGAAGAAAGCAAAGCTATTGAAAGAATGCTTCACTTCATGCACGGGATTTTATCAAAAGACGAACCGAGAAAATATTGCAGTGAAGCTTGTGCCGAAAAAGACCAGATGGCACATGAACTTTAATTAATTGACTATTCGAAACTGAATTTATGCCAGAAATGGCAGGTATTCGCTCAACCTTAATTAAGGAGAAAAACATGATTACCAATTATGAAGCCACTGTTGTAACTACCGATGACATTGTTCACGAGGTGAATCTGGAAGGAAAGCGCATTGGCTACGTAATTAAAACAGAAAATAAAGAAACCCCATTCACTGTGGTTGATATCGATGGTCCATCAGGCAACGTAAAAACACTTGATGAAGGTGTCAAAAAAATGTGCCTGGTGCATATCGGAAAGAATCTGCCCGCAGAAAAAAAAGCCGAATTTCTGGCAACTCTAATTGCAATGAAATTAAAAGGTGAAATCTGAAAGAAATAGCCTGCGTATGGCGCAGGCTATGAACAGTGTGTATCCGGCAAGATCATTCACTGAACAAAACGAATTTTAATCTGAGTTGAGGTTAAAAAACAATGAGCACAAAACCACTCTTCCTGTTACGGAAAGCGAAAAAATCATCCGGTGAACCTGACGTCGTCCTGTGGGCAAGCAACGATTTTGAATCGACCTGTGCCACTCTGGACTACCTGATCGTTAAGTCAGGTAAAAAACTGAGCAGCTATTTTAAAGCTGTTGCCACGAATTTTCCTGTCGTTAATGACCTGCCCGCTGAAGGTGAGATCGATTTTACCTGGAGTGAACGCTATCAACTCAGCAAAGACTCCATGACATGGGAACTAAAACCGGGAGCAGCACCAGACAACGCTCACTATCAAGGCAATACCAACGTCAACGGCGAAGACATGACTGAGATTGAGGAGAATATGCTACTCCCAATTTCTGGCCAGGAACTGCCCATTCGTTGGCTTGCTCAACACGGCAGCGAAAAACCGGTAACGCACGTTTCACGCGACGGACTCCAGGCATTACACATTGCTCGGGCTGAAGAACTACCGGCTGTTACTGCCCTGGCTGTTTCCCACAAAACCAGCCTGCTCGACCCGCTGGAAATTCGCGAACTCCACAAACTGGTTCGTGACACTGACAAAGTTTTCCCTAATCCTGGTAATTCAAACCTGGGACTGATAACTGCTTTTTTCGAAGCATACCTGAACGCTGACTACACCGATCGAGGACTGCTGACAAAAGAGTGGATGAAGGGTAATCGTGTTTCACACATCACTCGCACGGCTTCCGGTGCTAATGCTGGCGGCGGAAACCTCACCGATCGCGGCGAAGGTTTCGTACACGATCTGACGTCACTGGCGCGGGACGTAGCCACTGGCGTACTGGCCCGTTCAATGGATCTGGACATCTATAACCTTCATCCGGCACACGCTAAACGCATTGAGGAAATTATCGCTGAAAATAAACCGCCCTTTTCTGTTTTCCGCGACAAATTCATCACCATGCCTGGCGGGCTGGATTATTCCCGCGCCATCGTGGTTGCGTCCGTAAAAGAAGCACCAATTGGGATCGAGGTCATCCCCGCGCACGTCACTGAATATCTGAACAAAGTACTGACTGAAACCGATCATGCCAACCCTGATCCGGAAATCGTGGATATTGCCTGCGGTCGCTCCTCTGCCCCGATGCCGCAGCGAGTAACAGAAGAAGGAAAACAGGATGATGAAGAAAAACCGCAACCATCTGGAACAACGGCAGTTGAACAGGGAGAGGCTGAAACAATGGAACCGGACGCAACTGAACATCATCAGGACACGCAGCCGCTGGATGCTCAGTCACAGGTAAATTCTGTTGATGCGAAATATCAGGAACTGCGGGCAGAACTCCATGAAGCCCGGAAAAACATTCCATCAAAAAATCCTGTCGATGCCGATAAATTGCTTGCTGCATCACGTGGTGAATTTGTTGACGGAATTAGCGACCCGAACGATCCGAAATGGGTAAAGGGGATCCAGACTCGCGATTGTGTGTACCAGAACCAGCCAGAAACGGAAAAAACCAGCCCGGATATGAATCAACCTGAGCCAGTAGTGCAACAGGAACCGGAAATAGCCTGCAATGCCTGCGGCCAGACTGGCGGGGATAACTGCCCTGACTGTGGTGCGGTGATGGGCGACGCAACATACCAGGAAACATTCGATGAAGAGAGTCAGGTTGAAGCTAAGGAAAATGATCCGGAGGAAATGGAAGGCGCTGAACATCCGCACAATGAGAATGCTGGCAGCGATCCGCATCGCGATTGCAGTGATGAAACTGGCGAAGTCGCAGATCCCGTAATCGTAGAAGACATAGAGCCAGGTATTTATTACGGAATTTCGAATGAGAATTACCACGCGGGTCCCGGTGTCAGTAAGTCTCAGCTCGATGACATTGCTGATACTCCGGCACTGTATTTGTGGCGTAAAAATGCCCCCGTGGACACCACAAAGACAAAAACGCTCGATTTAGGAACCGCTTTCCACTGCCGGGTACTTGAGCCGGAAGAATTCAGTAACCGCTTTATCGTAGCACCTGAATTTAACCGCCGGACAAACTCCGGAAAAGAAGAAGAGAAAGCGTTTCTGAGGGAATGCGCAAGCACAGGAAAAACGGTTATCACTGCCGAAGAAGGCCGGAAAATTGAACTCATGTATCAGAGCGTTATGGCTTTGCCGCTGGGGCAATGGCTTGTTGAAAGCGCCGGACACGCTGAATCATCAATTTACTGGGAAGATCCTGAAACAGCAATTTTGTGTCGGTGCCGTCCGGACAAAATTATCCCTGAATTTCACTGGATCATGGACGTGAAAACTACGGCGGATATTCAACGATTCAAAACCGCTTATTACGACTACCGCTATCACGTTCAGGATGCATTCTACAGTGACGGTTATGAAGCACAGTTTGGAGTGCAGCCAACTTTCGTTTTTCTGGTTGCCAGCACAACTATTGAATGCGGACGTTATCCGGTTGAAATTTTCATGATGGGCGAAGAAGCAAAACTGGCAGGTCAGCTGGAATATCACCGCAATCTGCGAACCCTGGCTGACTGCCTCAATACCGATGAATGGCCAGCTATTAAGACGTTATCACTGCCCCGCTGGGCTAAGGAATATGCAAATGACTAAGCAACCACCAATCGCAAAAGCCGATCTGCAAAAAACTCAGGGAAACCGTGCACCAGCAGCAATTAAAAATAACGACGTGATTAGTTTTATTAACCAGCCATCAATGAAAGAGCAACTGGCAGCAGCTCTTCCACGCCATATGACGGCTGAACGTATGATCCGTATCGCCACCACAGAAATTCGTAAAGTTCCGGCGTTAGGAAACTGTGACACTATGAGTTTTGTCAGTGCAATCGTACAGTGTTCACAGCTCGGACTTGAGCCCGGTAGCGCCCTCGGTCATGCATATTTACTGCCTTTTGGTAATAAAAACGAAAAGAGCGGTAAAAAAAACGTTCAGCTAATCATTGGCTATCGCGGCATGATTGATCTGGCTCGCCGTTCAGGTCAAATCGCCAGCCTGTCAGCCCGTGTTGTCCGTGAAGGTGACGAGTTTAATTTCGAATTTGGCCTTGATGAAAAGTTAATACACCGCCCAGGAGAAAACGAAGATGCCCCTGTTACCCACGTCTATGCTGTCGCAAGACTGAAAGACGGAGGTACTCAGTTTGAAGTTATGACGCGCAAACAGATTGAGCTGGTGCGCAGCCAGAGTAAAGCTGGTAATAACGGGCCGTGGGTAACTCACTGGGAAGAAATGGCAAAGAAAACGGCTATTCGTCGCCTGTTCAAATATCTGCCCGTATCAATTGAGATCCAGCGTGCAGTATCAATGGATGAAAAGGAACCACTGACAATCGATCCTGCAGATTCCTCTGTATTAACCGGGGAATACAGTGTAATCGATAATTCAGAGGAATAATTCAGCCTGGCGGTGTAATGCACCGCCAACTTGAAATATTTTTATGAGAAAAATTATGAGATATGACAATGTTAAACCATGTCCATTTTGTGGTTGTCCATCAGTAACGGTGAAAGCCATTTCAGGATATTACCGAGCGAAGTGTAACGGATGCGAATCCCGAACCGGTTATGGTGGAAGTGAAAAAGAAGCACTCGAACGATGAAATAAACGAACCACTGGAAATAATAATGGAGGTGTTCATGTATAAAATTACCGCCACTATTGAAAAGGAAGGTGGCACTCCTACTAACTGGACAAGATATTCAAAATCTAAACTAACGAAATCAGAATGCGAAAAAATGCTCTCAGGTAAAAAAGAAGCAGGCGTTTCCAGAGAGCAGAAAGTAAAACTGATAAATTTTAATTGCGAGAAACTTCAGTCCTCGTGAATTGCATTGTATTCAAATTAAAACTTCATAACTGATTATTAATAATCAACATCGGGCGTCAATTTAAGTCTAACATTGGCGCCTGCCAGAGGTGATGCGATGGCACAAGTAATCTTTAATGAAGAGTGGATGGTTGAATACGGCCTGATGCTTCGCACTGGTCTGGGGGCCAGACAAATTGAAGCATACCGCCAGAACTGTTGGGTGGAAGGCTTCCACTTCAAACGAGTATCTCCTTTAGGGAAGCCAGACAGTAAGCGAGGGATTATCTGGTATAACTATCCAAAGATAAATCAGTTTATCAAAGACTCATGATATGTCTAAATTACCAACAGGTGTCGAGATTCGAGGTAAATACATTCGCATCTGGTTCATGTTTCGAGGAAAACGATGTCGGGAAACATTGAAAGGCTGGGAGGTTACTAACAGTAACATTAAAAAAGCCGGGAATTTAAGAGCGTTGATAGTTCATGAAATCAATTCCGGTGAGTTTGAGTATTTAAGACGTTTTCCCCAGTCCAGCACTGGGGCAAAAATGGTGACAACGAGGGTCATAAAAACGTTCGGGGAGCTTTGTGATATCTGGACAAAAATTAAAGAAACAGAGTTAACAACAAACACAATGAAGAAAACGAAATCACAATTAAAAACACTCAGGATAATAATTTGTGAGAGTACTCCGATATCGCATATTCGTTATAGCAATATCTTAAACTACCGGAATGAACTGCTGCATGGAGAAACGCTTTACCTGGATAATCCAAGATCCAACAAAAAAGGAAGAACCGTGCGCACAGTTGATAACTATATCGCCCTGCTCTGTTCGTTGTTACGTTTTGCGTATCAGTCGGGATTTATATCAACCAAACCATTTGAAGGAGTAAAGAAATTACAGAGAAACAGAATAAAGCCTGACCCGTTATCTAAAACAGAATTCAATGCATTAATGGAAAGTGAAAAAGGACAGAGCCAGAACTTGTGGAAATTTGCCGTATACTCCGGGCTTCGTCACGGGGAACTGGCTGCTCTGGCGTGGGAGGATGTGGATTTCGAGAAGGGAGTTGTGAATGTCAGAAGAAACCTGACGATACTGGATATGTTCGGTCCCCCAAAAACAAATGCGGGGATCCGGACGGTAACACTACTGCAGCCTGCTCTTGAAGCACTGAAGGAGCAATACAAACTGACCGGGCATCATCGCAAAAGCGAAATCACTTTTTATCATCGGGAGTACGGCAGAACCGAAAAGCAAAAACTGCATTTTGTTTTCATGCCCAGAGTGTGTAACGGAAAACAGAAACCTTATTACTCGGTAAGCAGTTTGGGTGCGAGATGGAATGCAGCAGTAAAACGTGCTGGTATTCGCCGCCGTAATCCGTACCATACGCGGCATACTTTTGCCTGCTGGCTGTTGACGGCAGGAGCGAACCCGGCGTTTATAGCCAGCCAGATGGGGCATGAAACTGCGCAGATGGTGTATGAAATTTACGGTATGTGGATTGATGACATGAACGACGAACAGGTAGCCATGTTGAATGCGCGGTTATCGTAGTTGCAAAGTTTGCCCCCAATTTGCCCCATTTAGTACCAGAGAACTGAAATAATGAAAGAAAATCAACAAATTACAAAGAAAGAACAATACAACCTGAACAAATTACAAAAACGTCTGCGTCGTAACGTGGGCGAAGCCATTGCTGACTTCAATATGATTGAAGAAGGCGATCGCATCATGGTTTGCCTCTCCGGGGGTAAAGACAGCTATACCATGCTGGAGATTCTGCGCAATTTGCAGCAAAGCGCGCCAATCAATTTTTCGCTGGTGGCTGTTAACCTCGATCAAAAGCAACCGGGCTTCCCGGAACACGTTCTGCCCGAGTATCTTGAAAAGCTGGGCGTTGAGTACAAGATTGTTGAAGAGAATACTTACGGTATCGTGAAAGAGAAGATTCCAGAGGGCAAAACCACTTGCTCACTGTGTTCTCGCCTTCGTCGCGGTATCCTTTATCGTACCGCAACGGAACTGGGGGCGACGAAGATCGCGTTGGGTCACCATCGTGACGATATCCTGCAAACGTTGTTCTTAAATATGTTCTACGGCGGTAAGATGAAAGGTATGCCTCCGAAACTGATGAGCGATGATGGCAAACATATCGTTATTCGTCCGCTGGCCTACTGCCGAGAGAAAGATATTCAGCGATTTGCCGATGCAAAAGCGTTCCCGATTATTCCGTGCAACCTGTGCGGTTCACAGCCTAACCTGCAACGTCAGGTGATTGCTGACATGTTGCGTGACTGGGATAAACGTTATCCAGGGCGTATCGAGACGATGTTCAGCGCGATGCAGAATGTGGTGCCGTCGCATCTGTGCGATACCAACCTGTTCGATTTCAAAGGCATCACCCACGGTTCTGAAGTGGTTAACGGGGGTGATCTGGCGTTTGATCGCGAAGAGATCCCACTACAACCGGCGGGCTGGCAGCCAGAAGAAGATGAAAATCAGTTGGATGAGTTACGGCTGAATGTGGTTGAAGTGAAATAACCAGGATAGCGCCCGATGCGCAAGCGTATCGGGCTACTCTTATGGAGGCCGGATAAGACGCGGCCAGCGTCGCATCCGGCAATCCCGAATAAGATGTTTACTCTTGCACCCGGCAATTCAACATTTCATTATTTTAATAACCGCACCCGGCACGTTTTTCCTTTAATCTTCCCGCCCTGTAACTGTTTCCATGCTTTATGAGCAACAGCCTGACGGACCGCGACATAGACATGCGCCGGATGCACGGCGATTTTGCCAATATCTGCGCCATCAAGCCCGATATCTCCTGTCAGTGCACCTAATACATCACCCGGGCGCATTTTGGCTTTTTTCCCGCCATCGATACACAACGTTGCCATTTCTGCTTCCAGCGTCGCAATGGAACTATTAGCTGGCGGCGTTTGCCAGTTAAGTTTTATCTGCAACATGTCAGAAATGATATTGGCCCGCTGTGCTTCTTCCGGAGCACAGAAACTGATCGCCAGACCGCTATTTCCTGCACGAGCTGTACGACCGATGCGATGTACATGAACTTCAGGGTCCCACGCCAGCTCAAAGTTCACCACCAGCTCAAGCGATTTAATATCCAGACCACGCGCAGCAACATCAGTCGCGACCAGTACACGGGCGCTACCGTTAGCAAAACGTACCAGGGTCTGATCGCGATCGCGTTGCTCCAAATCGCCGTGTAATGACAATGCACTTTGCCCTACTTCATTCAGCGCGTCGCAGACAGCCTGGCAATCTTTTTTGGTATTGCAAAACACCACGCAAGAGGATGGCTGATGCAAGCTTAATAACCGTTGCAACAGAGGAATTTTGCCTTTGCTGGATGTCTCATAAAATTGTTGTTCAATGGGTGGCAAAGCATCTGTTGAGTCAATTTCAATCGCCAAAGGATCGCGTTGCACTCGTCCGCTGATTGCAGCGATGGCTTCCGGCCAGGTTGCCGAAAACAGAAGCGTCTGTCGAGATGCAGGCGCAAAACGGATGACATCATCAATGGCATCGCTAAATCCCATATCCAGCATGCGGTCGGCCTCATCCATCACCAGCGTATTCAACGCATCCAGTGATACCGTGCCTTTTTGCAGGTGATCCAGCAAACGCCCCGGCGTTGCCACGATAATATGCGGCGCATGTTGCAACGAATCACGCTGCATACCGAACGGTTGACCACCGCACAACGTCAAAATTTTGGTATTTGGCAGAAAACGCGCCAGCCGACGCAATTCACCTGCCACCTGATCCGCCAGTTCACGCGTAGGACACAGCACTAAAGCCTGGGTTTGAAATAGCGACGCATCAATTTGCTGTAACAAGCCGAGGCCAAAAGCCGCCGTTTTGCCGCTGCCGGTTTTCGCCTGCACGCGAACATCTTTTCCGGCAAGGATCGCCGGAAGCGCGGCGGCCTGCACCGGCGTCATGGTTAAATAACCCAACTCATTAAGGTTCGTGAGTTGGGCGGGAGGCAAAACATTCAGGGTAGAAAAAGCGGTCACAATCTATTCTCGTGGTCATCGACGCAAAGTTAGCAGGCGCGTATCCTCGCAGATCTACGCTCACGATGCGACAATTTAATCGGTTCTTCATCGGGTGGTGGGTCAGGCATGGGTTGCGGGCGAGGGATCGGATCGGGCACTGGAACAGGATCGCCAGGAATCGGTTCAGGGACAGGAATTTGCAAATAAATAAGTGTCGTCATATTTCCCTCTGGTCATTGGGTGGACTCTTAAAGGGTAGACGCTGATAAATAACAGGCAAAAAAAAGCCGACTCATCAAAGTCGGCGTCGTACGAATCAATTGTGCTATGCAGTAATTCAAAAAAGGAAGTAAGACAATATGGAGCGCAACGCCCATCGCTTGACGTTGCATTCACCTGCAAGAGAGATATTGCCCTGAATGGGTAGAGAGTTTATTGACTTCGCTCAAACTTTGCGGCGTTTTTGTATACAGACAGCCGGAAAAATTGCTTTTGTTACAACCATTTACTACGATGCAACCATAAAGCAACACCACCAATAAGAACAACTAACAGAATACAAAAAATTGAAAATCCGAATTGCCACCCGCCGCCAGGGATCCCACCAAGGTTGACGCCAAATAACCCGGTCAGAAAGGTACTGGGTAAAAAGACCATTGCCATCAACGACATTGTATAGGTACGACGAGCTAAATTTTCCTGCATCACCTGAGCGATTTCATCCGCCATCACGCCAGTCCGTGCTATACAGGCGTCGATTTCGTCAAGGCCGCGCCCAAGGCGATCGGCAATATCCTGCATCCGACGGCGTTGGTCATCGCTCATCCACGGCAAACGTTCACTGGCAAGACGAGCATAAACATCACGTTGCGGTGCCATATAGCGACGCATCACAATTAATTGTTTGCGCAGCAGAGCCAGGAATCCACGCGGTGGAATTTGCTGATCAAGGAGATTATCTTCAAGGTCGATAATTTTATCGTGCAGCTGCTCGATAAATTCACTGGAATGATCGGTCAACGCATCGCACACATCCACCAGCCATCCCCCGCAATCGGTCGGACCCGTGCCCTCTTCCAGATCGCTCACCACATCGTCCAGCGCCAGCACTTTGCGTTGTCGGGTCGAAACAATTAACCGCCCGTCCATATATACACGCATGGCGACCAGTTGATCGGGGCGTTCATCGGTGCTGCCGTTTATACAGCGCAATGTAATCAGCGTGCCTTCACCGAGACGGCTGACTCGGGGACGCGTGCTCTCGCCCGCCAGCGCATCACGTACGTTATTGGGAAGCAGCGGTGTTGTCGCCAGCCATTGGGCGCTATCATGGTGTACATAATTAAGGTGGAGCCAACAGGGATGCGCTTCATCAATCACATCTGTATTTTCCAGCGGTTTAACGCCGCCTCTACCATCCAGCATCCAGGCAAATACTGCATCCGGGACATTAACGTCCGATCCCTTAATCGCTTCCACAGTGCCTCCATCATCAACGCATTATTTTGTAGTCTAGCCTTCTGGCCCTGTTACGCAACATCTCATCACCCCATTACCCTGAAATGATTAATAAAATTCTGTCTAAATTGAATACAAAAAGCAAAATGCTTTTCCGTATACAAACCGTGTGAAGTGTTAAATAGCGTCTATCATTATCAGAATTATCTGATCATATGACGTGGCTTTTTTGCGATCGGATAGCAACAAAAATTGATAAAAATAACGGGATCTCAATGATTACGCACAACTTCAATACCCTGGACTTACTCACCAGTCCTGTCTGGATCGTTTCGCCCTTTGAGGAACAGTTAATTTATGCCAATAGCGCGGCGAAACTGTTGATGCAAGACCTCACGTTTAGTCAGCTACGAACCGGACCCTATTCCGTCTCCTCACAAAAAGAACTGCCGAAATACCTCTCCGATCTGCAAAACCAACACGATATTATCGAAATCCTCACTGTTCAGCGTAAAGAAGAGGAAACAGCATTGAGCTGTCGGCTTGTTTTGCGAAAGCTGACAGAAACAGAACCGGTGATTATTTTCGAAGGTATCGAAGCGCCGGCAACGCTGGGTTTAAAAGCCAGTCGCTCGGCAAATTATCAGCGCAAAAAACAAGGTTTTTATGCGCGCTTTTTTCTGACTAACTCTGCACCAATGTTGTTGATTGACCCGTCACGAGATGGACAAATCGTCGATGCTAACCTCGCCGCGCTCAATTTCTATGGTTATAACCATGAAACGATGTGCCAGAAACATACCTGGGAAATAAATATGCTCGGGCGTCGCGTCATGCCTATCATGCATGAAATCTCGCATTTACCCGGTGGTCATAAACCTTTGAATTTTGTTCATAAACTGGCGGATGGTTCGACTCGTCATGTGCAGACCTATGCCGGACCGATTGAAATTTATGGCGACAAGCTCATGTTATGTATTGTGCATGATATTACTGAGCAAAAACGGCTGGAGGAGCAGCTGGAACATGCTGCTCACCATGACGCGATGACCGGATTACTGAATCGGCGACAGTTTTATCACATTACGGAACCAGGCCAAATGCAGCATCTCGCCATCGCTCAGGATTACAGCTTGTTGCTCATCGACACCAATCGTTTTAAACACATTAACGATCTCTATGGGCATTCTAAAGGTGATGAGGTGTTATGCGCCCTCGCCCGCACCCTCGAAAGTTGCGCTCGCAAAGGCGATTTGGTGTTTCGTTGGGGAGGCGAAGAGTTTGTCTTATTGCTACCAAGAACCCCACTGGATACCGCGCTTTCGCTGGCTGAAACTATCCGCGTAAGCGTGGCAAAAGTGAGTATTTCGGGCTTACCACGCTTTACCGTCAGCATTGGTGTGGCGCATCACGAAGGAAATGAAAGCATCGATGAACTGTTTAAACGCGTTGATGATGCTTTGTATCGGGCGAAAAATGATGGACGCAACCGCGTGCTGGCGGCATAA
Protein sequences of DBSCAN-SWA_9 >LR134000|2499003:2512534|2505356_2505545_+|VDY68829.1|DBSCAN-SWA MYKITATIEKEGGTPTNWTRYSKSKLTKSECEKMLSGKKEAGVSREQKVKLINFNCEKLQSS >LR134000|2499003:2512534|2499786_2499942_+|VDY68822.1|DBSCAN-SWA MQKIDLGNNESLVCGVFPNQDGTFTAMTYTKSKTFKTETGARRWLEKHTVS >LR134000|2499003:2512534|2509615_2509789_-|VDY68834.1|DBSCAN-SWA MTTLIYLQIPVPEPIPGDPVPVPDPIPRPQPMPDPPPDEEPIKLSHRERRSARIRAC >LR134000|2499003:2512534|2504304_2505114_+|VDY68828.1|DBSCAN-SWA MTKQPPIAKADLQKTQGNRAPAAIKNNDVISFINQPSMKEQLAAALPRHMTAERMIRIATTEIRKVPALGNCDTMSFVSAIVQCSQLGLEPGSALGHAYLLPFGNKNEKSGKKNVQLIIGYRGMIDLARRSGQIASLSARVVREGDEFNFEFGLDEKLIHRPGENEDAPVTHVYAVARLKDGGTQFEVMTRKQIELVRSQSKAGNNGPWVTHWEEMAKKTAIRRLFKYLPVSIEIQRAVSMDEKEPLTIDPADSSVLTGEYSVIDNSEE >LR134000|2499003:2512534|2505861_2507097_+|VDY68831.1|integrase|DBSCAN-SWA MSKLPTGVEIRGKYIRIWFMFRGKRCRETLKGWEVTNSNIKKAGNLRALIVHEINSGEFEYLRRFPQSSTGAKMVTTRVIKTFGELCDIWTKIKETELTTNTMKKTKSQLKTLRIIICESTPISHIRYSNILNYRNELLHGETLYLDNPRSNKKGRTVRTVDNYIALLCSLLRFAYQSGFISTKPFEGVKKLQRNRIKPDPLSKTEFNALMESEKGQSQNLWKFAVYSGLRHGELAALAWEDVDFEKGVVNVRRNLTILDMFGPPKTNAGIRTVTLLQPALEALKEQYKLTGHHRKSEITFYHREYGRTEKQKLHFVFMPRVCNGKQKPYYSVSSLGARWNAAVKRAGIRRRNPYHTRHTFACWLLTAGANPAFIASQMGHETAQMVYEIYGMWIDDMNDEQVAMLNARLS >LR134000|2499003:2512534|2499003_2499576_-|VDY68820.1|DBSCAN-SWA MAISKGRLIRVNQNGKFRYCMPGDNLPAEPKAASVAETDGKAFPQPAGVALPVQEAATQEEREKIKEQVVELVRQSGRETLRQLEAKTGATRYLMSILARELVASGDVYNSGYGLFPSEQARKDWKNARKKLSRAKVKKTVVVDPDLIWSLPDGEIRRYDRRQNIICRECRKSEVMKRVLAFYQGNLRYL >LR134000|2499003:2512534|2507148_2508084_+|VDY68832.1|tRNA|DBSCAN-SWA MKENQQITKKEQYNLNKLQKRLRRNVGEAIADFNMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNLDQKQPGFPEHVLPEYLEKLGVEYKIVEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFLNMFYGGKMKGMPPKLMSDDGKHIVIRPLAYCREKDIQRFADAKAFPIIPCNLCGSQPNLQRQVIADMLRDWDKRYPGRIETMFSAMQNVVPSHLCDTNLFDFKGITHGSEVVNGGDLAFDREEIPLQPAGWQPEEDENQLDELRLNVVEVK >LR134000|2499003:2512534|2499938_2500550_-|VDY68823.1|DBSCAN-SWA MRLRVVPGFISPPPGFGGLGYTPTARACVNISIPLQLRVIDMLDVFTPLLKLFANEPLEILMYTIIIFGLTLWLIPKEFTVAFNAYTEIPWLFQIIVFAFSFVVAISFSRLRAHIQKHYSLLPEQRVLLRLSEKEIAVFKDFLKTGNLIITSPCRNPVMKKLERKGIIQHQSDSANCSYYLVTEKYSHFMKLFWNSRSRRFNR >LR134000|2499003:2512534|2501334_2501610_+|VDY68826.1|DBSCAN-SWA MITNYEATVVTTDDIVHEVNLEGKRIGYVIKTENKETPFTVVDIDGPSGNVKTLDEGVKKMCLVHIGKNLPAEKKAEFLATLIAMKLKGEI >LR134000|2499003:2512534|2505644_2505860_+|VDY68830.1|DBSCAN-SWA MAQVIFNEEWMVEYGLMLRTGLGARQIEAYRQNCWVEGFHFKRVSPLGKPDSKRGIIWYNYPKINQFIKDS >LR134000|2499003:2512534|2501711_2504312_+|VDY68827.1|DBSCAN-SWA MSTKPLFLLRKAKKSSGEPDVVLWASNDFESTCATLDYLIVKSGKKLSSYFKAVATNFPVVNDLPAEGEIDFTWSERYQLSKDSMTWELKPGAAPDNAHYQGNTNVNGEDMTEIEENMLLPISGQELPIRWLAQHGSEKPVTHVSRDGLQALHIARAEELPAVTALAVSHKTSLLDPLEIRELHKLVRDTDKVFPNPGNSNLGLITAFFEAYLNADYTDRGLLTKEWMKGNRVSHITRTASGANAGGGNLTDRGEGFVHDLTSLARDVATGVLARSMDLDIYNLHPAHAKRIEEIIAENKPPFSVFRDKFITMPGGLDYSRAIVVASVKEAPIGIEVIPAHVTEYLNKVLTETDHANPDPEIVDIACGRSSAPMPQRVTEEGKQDDEEKPQPSGTTAVEQGEAETMEPDATEHHQDTQPLDAQSQVNSVDAKYQELRAELHEARKNIPSKNPVDADKLLAASRGEFVDGISDPNDPKWVKGIQTRDCVYQNQPETEKTSPDMNQPEPVVQQEPEIACNACGQTGGDNCPDCGAVMGDATYQETFDEESQVEAKENDPEEMEGAEHPHNENAGSDPHRDCSDETGEVADPVIVEDIEPGIYYGISNENYHAGPGVSKSQLDDIADTPALYLWRKNAPVDTTKTKTLDLGTAFHCRVLEPEEFSNRFIVAPEFNRRTNSGKEEEKAFLRECASTGKTVITAEEGRKIELMYQSVMALPLGQWLVESAGHAESSIYWEDPETAILCRCRPDKIIPEFHWIMDVKTTADIQRFKTAYYDYRYHVQDAFYSDGYEAQFGVQPTFVFLVASTTIECGRYPVEIFMMGEEAKLAGQLEYHRNLRTLADCLNTDEWPAIKTLSLPRWAKEYAND >LR134000|2499003:2512534|2508212_2509586_-|VDY68833.1|DBSCAN-SWA MTAFSTLNVLPPAQLTNLNELGYLTMTPVQAAALPAILAGKDVRVQAKTGSGKTAAFGLGLLQQIDASLFQTQALVLCPTRELADQVAGELRRLARFLPNTKILTLCGGQPFGMQRDSLQHAPHIIVATPGRLLDHLQKGTVSLDALNTLVMDEADRMLDMGFSDAIDDVIRFAPASRQTLLFSATWPEAIAAISGRVQRDPLAIEIDSTDALPPIEQQFYETSSKGKIPLLQRLLSLHQPSSCVVFCNTKKDCQAVCDALNEVGQSALSLHGDLEQRDRDQTLVRFANGSARVLVATDVAARGLDIKSLELVVNFELAWDPEVHVHRIGRTARAGNSGLAISFCAPEEAQRANIISDMLQIKLNWQTPPANSSIATLEAEMATLCIDGGKKAKMRPGDVLGALTGDIGLDGADIGKIAVHPAHVYVAVRQAVAHKAWKQLQGGKIKGKTCRVRLLK >LR134000|2499003:2512534|2511301_2512534_+|VDY68836.1|DBSCAN-SWA MITHNFNTLDLLTSPVWIVSPFEEQLIYANSAAKLLMQDLTFSQLRTGPYSVSSQKELPKYLSDLQNQHDIIEILTVQRKEEETALSCRLVLRKLTETEPVIIFEGIEAPATLGLKASRSANYQRKKQGFYARFFLTNSAPMLLIDPSRDGQIVDANLAALNFYGYNHETMCQKHTWEINMLGRRVMPIMHEISHLPGGHKPLNFVHKLADGSTRHVQTYAGPIEIYGDKLMLCIVHDITEQKRLEEQLEHAAHHDAMTGLLNRRQFYHITEPGQMQHLAIAQDYSLLLIDTNRFKHINDLYGHSKGDEVLCALARTLESCARKGDLVFRWGGEEFVLLLPRTPLDTALSLAETIRVSVAKVSISGLPRFTVSIGVAHHEGNESIDELFKRVDDALYRAKNDGRNRVLAA >LR134000|2499003:2512534|2501089_2501260_+|VDY68825.1|DBSCAN-SWA MTKKIKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHEL >LR134000|2499003:2512534|2499641_2499776_+|VDY68821.1|DBSCAN-SWA MVHYEVVQYLMDCCGITYNQAVQALRSNDWDLWQAEVAIHSNKM >LR134000|2499003:2512534|2510063_2511047_-|VDY68835.1|DBSCAN-SWA MEAIKGSDVNVPDAVFAWMLDGRGGVKPLENTDVIDEAHPCWLHLNYVHHDSAQWLATTPLLPNNVRDALAGESTRPRVSRLGEGTLITLRCINGSTDERPDQLVAMRVYMDGRLIVSTRQRKVLALDDVVSDLEEGTGPTDCGGWLVDVCDALTDHSSEFIEQLHDKIIDLEDNLLDQQIPPRGFLALLRKQLIVMRRYMAPQRDVYARLASERLPWMSDDQRRRMQDIADRLGRGLDEIDACIARTGVMADEIAQVMQENLARRTYTMSLMAMVFLPSTFLTGLFGVNLGGIPGGGWQFGFSIFCILLVVLIGGVALWLHRSKWL >LR134000|2499003:2512534|2500868_2501090_+|VDY68824.1|DBSCAN-SWA MIAHHFGTDEIPRQCVTPGDYVLHEGRTYIASANNIKKRKLYIRNLTTKTCITDRMIKVFLGRDGLPVKAESW |
17 | Escherichia_phage(69.23%) | integrase,tRNA | attL 2493517:2493530|attR 2511836:2511849 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_10 |
3563153 : 3585944
Sequences of DBSCAN-SWA_10
Nucleotide sequences of DBSCAN-SWA_10 >LR134000|3563153:3585944|DBSCAN-SWA CATGAATAAATATCAGGCAGTGATTATTGGTTTTGGCAAGGCTGGAAAAACATTAGCCGTCACGCTGGCAAAAGCAGGTTGGCGAGTGGCTCTCATCGAACAATCAAATGCAATGTATGGCGGGACCTGTATTAATATCGGCTGCATCCCAACCAAAACATTGGTTCATGACGCACAGCAGCACACAGATTTTGTCCGTGCCATACAGCGTAAAAATGAAGTGGTTAATTTTTTACGTAATAAGAATTTTCATAATCTTGCGGATATGCCCAATATCGACGTGATCGACGGCCAGGCGGAGTTTATCAATAATCATAGCCTGCGTGTTCATCGGCCTGAGGGAAATCTGGAAATTCATGGCGAGAAAATTTTTATTAATACCGGTGCACAAACCGTGGTTCCGCCAATTCCTGGAATTACCACCACGCCAGGAGTATATGACAGCACCGGATTACTTAATCTAAAAGAATTGCCTGGGCATTTAGGTATTTTGGGCGGCGGATATATTGGCGTTGAGTTCGCCTCTATGTTCGCTAATTTTGGCAGCAAGGTAACCATTTTAGAAGCAGCTTCGCTGTTTTTGCCTCGGGAAGATCGGGATATTGCTGATAATATCGCGACGATTTTACGCGATCAGGGCGTCGATATTATCCTCAATGCCCATGTGGAGCGAATCAGTCACCATGAAAATCAAGTGCAAGTGCATAGCGAGCACGCCCAACTGGCGGTGGATGCACTGTTAATAGCTTCCGGTCGTCAACCGGCTACCGCTTCGTTACATCCAGAAAATGCCGGTATCGCAGTAAACGAGCGCGGGGCAATTGTCGTTGACAAGCGATTACATACCACCGCAGACAATATTTGGGCGATGGGAGATGTTACCGGCGGACTGCAATTTACTTACATATCACTGGATGATTACCGCATTGTACGTGATGAGTTACTGGGTGAAGGCAAACGTAGTACTGATGATCGGAAAAATGTGCCTTATTCCGTATTTATGACACCGCCCCTGTCCAGGGTTGGTATGACAGAAGAACAAGCCAGAGAGAGTGGTGCTGATATTCAGGTGGTGACATTGCCTGTAGCTGCAATTCCGCGTGCCAGAGTGATGAATGATACTCGTGGGGTATTAAAAGCGATTGTTGATAATAAAACCCAACGTATGTTAGGGGCATCACTGCTGTGTGTTGACTCCCACGAGATGATCAATATAGTGAAAATGGTGATGGATGCCGGGCTGCCTTATAGCATATTACGCGATCAGATATTTACTCATCCGTCGATGAGCGAATCACTCAATGATCTATTTTCATTAGTCAAATAAACTCAAAATCAGACGCCAGAACAAATATTCTGGCGTCTCAGAGAAAAGAATCTTATTAATTCTCTGTTACTCTATAACTTCTTAATCACTTCATTGATGGTATTTTATATGTTTAAAAAATCTGTTTTATTTGCAACACTATTATCTGGCGTTATGGCATTTTCCACCAATGCAGATGATAAAATAATTCTGAAACATATCAGCGTCTCGTCAGTATCAGCATCACCGACAGTTCTGGAGGATGCCATTGCTGATATAGCCAGAAAATATAATGCTTCATTCTGGAAAGTCACATCGATGCGAATTGATAATAATTCAACCGCAACAGCAGTATTGTATAAATAAGGATGTTCACAATGGAAAAATACCTGCACCTGTTAAGTCGGGGAGATAAAATTGGCCTGACATTGATTCGTCTGAGTATTGCAATTGTTTTTATGTGGATTGGGTTATTAAAGTTTGTCCCTTACGAGGCAGACAGCATTACACCATTCGTCGCAAACAGTCCACTAATGTCGTTCTTTTATGAACACCCGGAAGACTATAAACAGTATCTGACTCACGAAGGCGAATACAAACCAGAAGCAAGGGCATGGCAATCGGCCAATAATACCTATGGTTTTTCCAACGGTCTTGGCGTCGTGGAGGTGATTATTGCTCTGCTGGTTTTGGCTAATCCTGTCAATCGCTGGTTAGGTTTATTGGGAGGGCTGATGGCATTTACCACACCGTTGGTAACACTCTCATTTTTAATCACCACCCCGGAGGCATGGGTACCCGCATTGGGTGACGCTCATCATGGTTTCCCTTATTTATCCGGTGCTGGTCGCCTGGTATTGAAAGATACTCTGATGCTGGCAGGTGCAGTCATGATAATGGCAGATTCGGCGCGGGAAATTCTTAAACAACGCAGTAATGAATCCAGTTCAACGTTAAAAACTGAATATTGATAATCCGAACACTGTCTGTTGTTCACTTTTTCAGGCGTCAGGCCCCCTTTCTGATGATGAAGCCTGACGCCATATTACCGGAGATATGACAGAGTGAAGATCAATATCACTCCCTGTTAAAAGGTTTTTGTCGTCTTAGGTAAAATACATGATTAATATCAATCATGTATTTTTACAGCAAGACAACCACGAACAAATTCAGGATTAAAATGTTTTACCGACTCCCCAACATAACCCAAATCAAGCGAACTAATTTGTGCCATTTCTTTATCTGTCAGTGAGAAATCGATTTTTGATGGGCATCTGCAATACTCTGAAGCAGACTATACGGAAAATTCCACTGTCAGTCCCTCCATTAGGCATGAACAATGAGTCTACGTTAAAACGTAACCTCAAAGTAGTATGTGGATTTTGATATCACTTATGCAAAAAATTCATTAATAATGTAGGACTGAAACCTCTCTATTTTCGGGGACAACGAAGCAGACGCTACCAGTGCTTTTGCCTTCGCCCTTGCTATTTTTGATACACTTAGGGCCCAGGGTATAACGAAAATGTGCGATATGACAAAGAATTAACGGAGAATGAGATGATCAGGCAGAAGATTCTACAGCAGCTCCTGGAGTGGATTGAGTGCAATCTTGAGCACCCTATTTCAATCGAAGATATCGCACAGAAATCTGGCTACAGCAGACGCAACATCCAGCTTCTGTTCCGAAATTTCATGCATGTGCCTTTGGGAGAATACATTCGCAAACGAAGGCTTTGTCGTGCCGCCATTCTTGTCCGGCTCACCGCGAAATCTATGCTTGATATTGCACTCTCTTTGCATTTTGATTCACAGCAGTCATTCAGCCGCGAATTTAAAAAGTTATTCGGCTGCTCTCCCCGTGAATACCGCCACCGTGATTATTGGGATCTCGCAAATATCTTCCCTTCTTTTTTAATACGTCAACAGCAAAAGACGGAGTGTAGATTAATCAACTTTCCTGAGACACCTATTTTTGGCAACTCATTTAAATATGACATTGAAGTGTCGAATAAATCACCGGATGAAGAAGTCAAACTACGACGTCATCATTTAGCCAGATGTATGAAGAATTTTAAGACGGATATCTATTTCGTTTCCACGTTTGAACCGTCAACAAAATCGGTCGATTTGCTCACGGTTGAAACTTTTGCTGGTACGGTATGTGAATATGCTGACATGCCAAAAGAGTGGACAACGACCCGAGGACTTTATGCCTCTTTCCGTTATGAAGGAAACTGGGAAAATTATCCTGACTGGGTGCGTAACATCTATCTGATAGAGTTACCTGCCAGGGGGTTAGCCAGAGTGAACGGCAGCGATATTGAGCGCTTTTATTACAATGAAGATTTCGTAGAAAAGGATGGCAATGATGTTGTTTGCGAAATTTTTATTCCCGTTCGTCCGGTTTAGTTGGTCACTATCTCTTATTGAGTTTTATCCTTGCCTGATACTATTTAATCAGTATCAGGCAAGGTATTCAGTGAATAATGGCGTTGAATATTTCAACACCATTATTCTTTATTAGATCGTAACTTTCATACTATTCAAATTATGGCATCTCCTCCTCGCCATTTTCAGTCTCTGCAGCAACGGCGTTAATTGTCACCTTCACAGGCGTCATTTCGTACCCGCCATACGTCAGTGCGTTGAATGTGAACGTATAGACACCTGGCTTGCTGGTACTAAATGTACGCGTGATTTTCCCACCTGAGAAATGGTCAGCCTTCGACGGCAGGAATTGATAATCCTTCTCCGTCACTCCTTCCGGAGCCTCAATGTCGACCCACATACTTCCTCCTACGGGATTGACTCCCGCCATCAGGGTAACGCGGGGCTACACTGTAGTAATGCGTTTTACCGTCAATCTCGCGGGTATAAAGTCGCGCCATGCTGTTCATATCCAGTTTACGCGCCAGGTCAAAGGCCAGAATGCACGGCTGCCCCTCGAACTGCTCAAGGGACAGTGATTTATCCTCGCAGCTCTGCCAGCTCACCAGGTTGAAATACGCCGAACGCGCCGACACCCAGATATTGAGGTGTTTTGTTTTAAAGACGTTTGCCAGACGGGCGTTATTTTTCGCGCGCTGTTGCTGACTTAACAAAAACTCACGGTAAACCGACACCCCGATATTCGGGTTGGCTTTTTCCAGCACCTGCGGGTCGGTCCAGTCGTCACCTTCGTCAACGGTATAGATGATCCCGAACAGTTCATCGTTGGGTACCGAACCGTTGAGCATCTCGATGACTTCCCGCCGTTTGTCGTAGCACGGCCCCTCAATGTTGTACCCGGCGGTAGTGATAGCCCACATCAGTGGCTGACGTCGCGCCCCCATCCCGGTAAGCATCGTGGTGTAAAGCGCATCTGTGGCGTGCTCGTGATATTCATCCACCACCGCACAGTGGGGTGATGAACCATCACCGGGGTTACCGATCAGCGGTTCAAAACGCGCACCATCCTCCGGACGGTTCATGTTTGAGGCGTTAACCTCAATCCCGAACGCTTCCGTCAGCATGGGTGTGCGTTTACACATCAGTCTTGCCGGACGAAAGACTTCCCATGCCTGTTTCTCCGTCGTGGCACCGGAATACACTTCCGCGCCGAACTCGTTATCACAGGCAAAACAATACAGAGCGACACCGGCAGAAATTGCCGATTTGCCGTTCTTACGGGGGATTTCGGTATACACCTCACGGAAGCGGCGCAGCCGGGAGCCTTTATTGACCCAGCCAAACGCGCAGCAGATCACAAAGAGCTGCCACGGCTCCAGCGTGATAGGCATCCTCTTGAATGCCCACTCACCCTTGGTGTGCGGCAACAGCTGAATAAATTTGGCGGCCCGTTCAGCCAGGTCCTTGTCGAAGCGGTAACGAAACGACTTACTTTTTTCCGCCATCAGGTCATCAAGATGGCGCTGGCAGGCCTGAATCACAAACTGGCAGGCCACAATCTTTCCGCGCACGACATCACGGGCATACTGATTGGCAGCATTTACGTTGGGGTAAGATTTCCGGCTCATGACTCGATGATTTTCAGAAACGGGTTAGTGGCTTTCTTCTGCCCCGCCAGGCCAATCAGACGCTGGCGGCTGCTGGGGTCGAGTCCGAGCATTGCCCCCGTACTGCTCATCTCGGACTCCTGTTCTTTTTTGGCGGTCAGATCCGGATTTTTGACCATACCGCCCATTGCACCGGTGATGGTGTTGCCCTGTCTGGCAATATTTTTCACGGCACGTCGCCAGAACTCATAGGCCACACACCACCGCTCAAGCACCGCGAGGTCAGTCACGCACAGCAGGCCCTGACCGCAGAGTTCTTTGGTTGTCAGTTGCCACATGATCGTGGCGAGAGGGAGATCTTCTTCAGCGAACCACTCCGGTGGCTCAACACCTTTGATGGGCGTAAAAACAGGTTCATCTTTATTCAGGGCTCGCTTGCCGGGGTTTCCGGCCAGCGCCTTGCGCGCCGTTGGCTTGGGGCGACGCCCGGAACGCCCCGCCGTTCCAGCCATATGCGGCACTCCTGGTTAAATTTCATTTTTCGCGGGTATAAAAAAACGATGGGGCGGGCAGTCCGGAAGACGTCAGGTCACAGAGATTTGACCCTCCCCTCCCCTCAGACAGTTGAGAGTTATTATCACTTAAGCCGTTCACGGGCCGTCTTCGCCTTATGACACGGCCAGCACAGACTCTGCAGATTACTGTCGGCATCAGTGCCGCCATGTGCTTTAGGGATGATGTGATCAACGGTTTTCGCTTCACGCACCACACCGGCACGCAGGCACAACTGGCACAGTCCTTTGTCACGCTTCAACACACGCACGCGGATAACATCCCACTTCGAACCATAACCGCGCTGATGACGGGATTGTCCTGGCTTGTATTGCTTCCAGCCTTCGCTTTTGTGGCTTTCACAATAGCCTGACGGGTCTGTGGTGGTATGGCGGCAGCCACGAACACGGCAGGCTTTTGGGATTCGTGATGGCATATGTACTCCAATGAAGAAGCCACCGACATAGCCTCCTCCATTCATCGTGAAACTATTTTCATCTACCCAGTAATGAATTCTTTGTAGAGTTGTGATCAATACAACTCACTAATGGAGAGGCTTGTCCAACACGTTGGACAAGTTTCCTGTTTGATTTACTGGACACTATAGAAGGACAGAATGCCTTCATCACTCGAATAACATCAATTAAGGAGGTTCAACATGTTTCATTCCACAAATCATCAGGCTGTAATTATGGCTGCATCAGCTTGTACCACAGACCTTTTCCGCTTCACTTTGAGCCTGATTCATTTCTACCTGACCGGCTCGCCTCTATCTTTTTAATCCCCGCTTTATCCAAATTGCATTGCCAGAATGCCGACAACAGACTGACATTCAAATCCTGACTACCTCCAATAGTCTGACCGTACACCTATATAGTTTTAATTTTCATCAATCCATTTAACTATCGTTTAATTGTTGTCACATAGAATTCTGCCGTTTTTAACAATGCAGGATAATAAGATGAAAAAAATGTTGTTTTCTGCCGCTCTGGCAATGCTTATTACAGGATGTGCTCAACAGACGTTTACTGTTGGAAACAAACCGACAGCAGTAACACCAAAGGAAACCATCACCCATCATTTCTTCGTTTCGGGAATTGGACAGGAGAAAACTGTTGATGCAGCCAAAATTTGTGGCGGCGCAGAAAATGTGGTTAAAACAGAAACCCAGCAAACATTCGTAAATGGATTGCTCGGTTTTATTACTTTAGGCATTTATACTCCGCTGGAAGCGCGGGTGTATTGCTCACAATAATTGCATGAGTTGCCCATCGATATGGGCAGCGCTATCTGCACTGCTCATTAATATACTTCTGGGTTCCTTCCAGTTGTTTTTGCATAGTGATCAGCCTCTCTCTGAGGGTGAAATAATCCCGTTCAGCGGTGTCTGCCAGTCGGGGGGAGGCTGCATTATCCACGCCGGAGGCGGTGGTGGCTTCACGCACTGACTGACAGACTGCTTTGATGTGCAACCGACGACGACCAGCGGCAACATCATCACGCAGAGCATCATTTTCAGCTTTCGCATCAGCTAACTCCTTCGTGTATTTTGCATCGAGCGCAGCAACATCACGCTGACGCATCTGCATGTCAGTAATTGTCGAGTTCGCCAGCTTCAGTTCTCTGGCATTTTTGTCGCGCTGGGCTTTGTAGGTAATGGCGTTATCGCGGTAATGATTAACAGCCCATGACAAGCAAACGATGATGCAGATAACCAGAGCGGAAATAATCGCGGTTACTCTGCTCATACCTCAATCTCTCTGACCGTTCCGCCTGCTTCTTTGAATTTTGCAATCAGGCTGTCAGCCTTATGCTCGAACTGACCATAACCAGCGCCCGGCAGTGAAGCCCAGATATTGCTGCAACGGTCAATTGCCTGACGAATATCACCGCGATCAATCATCGGTAAAGCGCCACGCTCTTTAATCTGTTGCAATGCCACAGCGTTCTGACTTTTGGGAGAGAAGTCTTTCAGGCCAAGCTGCTTACGGTAGGCATCCCACCAACGGGAAAGAAGCTGGTAACGTCCGGCGGCTGTTGATTTGAGTTTGGGGTTTAGCGTGACAAGTTTGCGAGGGTGATCGGAGTAATCAGTAAACAGTTCACCACCGACAATAACATCATAACCGTGGTTACGTGTCGGTTGTCGCCCGTTATCCGTTCCTTCTGACCATGCAACCATATCCAGGAAAGCTTTACGCTGGGAATTTAGTACCTGCATAAATTACTCCTTCGAGCTACCAAACTTGTTACCGATTACTCTCATTGCAGCCCCACGAATAGCATCGACACCGATCAGCCCCACCCCACCACCAATGGCAACAGAAAGCGATTTAGGCCATCCGACATACTCAAGCGCGGATGCAAAGGTCAACGTCAGAGCGCCACATAGCAAAATCTCGAGCGTTTTTCGCTTCCAGCCACCACCACCGCCAAAATAGGCGATGCGCAAACCAGCCATAACAATCGACATAATCACTGCGCCCAGCGGCGTATCTCCACGCCACCAGCTCTGAAACAACTCCAGCCAGTCAGGCCAGGTATTTGGGTTATGAGGCATTTCGTCATCTCTCACCTCGCGATATTTGCGGGTGCTGTGTTGGAAATAAAAAGGCCACGCAACGTGGCCACCAGAATTATTTCCCCACCAGTTCACTTACCTCTTTCACCGTCTGATTAAACCGCTCTGACTCAAGTTCAACACCTAACGCCCGACGCCCCAGCGCCATTGCCGCTTTTATTGTGGAACCGGATCCCATAAAGAAATCAGCAACCAGATCGCCTTGTCGACTACTGGCACTGATTATTTGCCGGAGCATATCCGCAGGCTTCTCACACGGATGTTTACCCGGGTAGAACTGAACGGGTTTATGCGTCCAGACGTCGGTATAAGGCACGGAAACTGATACGGAGAAATAGCGCCGGAGAGTTTTAAACTCATCCAGCAATTCAGAATATTTGCGATTCAGTGAATCATAAGATGCCACCAGCTGGTGGTGTGGTTGTTCCAGTTGTTGTTCCTGAAACTTCTCTGCCGCTATACGGGAAAACAGTGCCTGCAACTTCCGGTAGTCAGCCTCATTCGGCAACTGCCACTGACTGGCACCAAACCAGTGGGAAACCATATTTTTCTTACCAGTGGCTTCGGCAATCTGTTTTGCCGTTATACCCAGTTCGGCACGAGCATCCCTGAAATACGAAATCAGCGGTGCCATTATGTGCTGTTTGAGTTCCCTTTCTTTTGCCGCATAGCCGTCACTTTTGCCGCGATATGGTCCCTGGTAATGTTCAGCAAACAAAACGCGTTCTGTTGCGGGAAAATATGCGCGCAGACTTTCTTTATTACACCCGTTCCATCGTCCGGACGGCTTCGCCCAGATAATATGGTTAAGAACGTTGAAACGTTCACGCATCATGATCTCGGTATCAGATGCGAGGCGATGTCCACAGAACAGGTAAAGGCTTCCGGCAGGTTTCAGCACCCGCCAGAACTGAGCCAGACAGTGGTCCAGCCACTTCAGGTAATCTTCGTCCCCTTTCCACTGATTGTCCCAACCGTTGGGTTTCACCTTGAAGTACGGCGGATCGGTAACAATCAGGTCAATGGAATCATCAGGCAGGGACTGAATAAAATGCAGGCAATCAGCGTTGATTAAATCAACACTGTTTATTTTCACAGTATTTTTCATGGATCAGTAAGCGTAACTCTGGTAGGCTCACTCTGCTTTTGCGCTAAAGCAGTGGGCCGTGGTTCGCTTGTGACCAGTAAGCATGAGCGAATGGCTGGCAGGTGCTACCAACACCCACCAGCCGCCCATTTTCACAAATTAAAAGCCCTTCATTGCTGAAGGCGTCTGTAACAGCCGAACTGGTAATCTGCCAGCCCCGCCATAACCAGCTGGGTCAGTATTAACTGACAGCGTTCGCGTGAAAGGTATGTGTTTTGTGCAATCTCCCCGACTGTTGCCGGTTTGCCGTTTAATTCATTAAAAACAACTTTCGCCGTTTCTGTCATATCTAGCTGTTTTAGCATGTCTTTTTACCTTCATGGTTAACATGACATACCAATAACTCTTGTCTAAAAAGCCAGCAAGATAAAAAGTCAGTATTCACGACCACCAGCGTGTTTACCGTACTGCACCAGGTTTACAGGTACAAAAAAACCCGCTCGACGGCGGGTTTAAGCTGTGTGGCGAAGTAACCACTCTTAACAGCATAACCAATTTTTTACGTACGTAAACCTCTAAACAATATTTGTGAGAATGATATCGAGTGTTCAAAACACCACCACAATCACATAAGGAAAAATAAACAAATAATCATTAAATAATTTCCGACGTTATTTTCAGTGAATTTAAATTAAAGAGATGATTTATATAACACTTATAAATAACAACCTTTAATATAATTTAGCTACCAAATTTATTTTCTTTCAGAAAATAGTCATGTACAGCATGCAATAGAAAAAGTTCAGATAAAAATAGAGATCTAGATCACAATTTAAATAAGAATCTAAAACTTACATCTTGAATTAATCACATTGATTAGATGAATATTTGTCGCGCAGGGCATCATTTTTTAATAAATGTTCAAAAAAAGGTCTCACGATGAAAAAATTAACAGTGGCAATTTCTGCTGTAGCTGCATCAGTACTGCTGGCGATGTCTGCTCAGGCAGCAGAGATTTATAATAAAGACAGTAACAAGCTGGATCTGTACGGAAAAGTTAATGCTAAGCACTACTTCTCCTCTAACGAGGCAGATGATGGTGACACTACTTATGTTCGTCTGGGCTTCAAAGGCGAAACCCAAATCAACGATCAACTGACTGGTTTCGGTCAGTGGGAATACGAATTCAAAGGCAACCGTGCTGAATCTCAGGGGTCTTCCAAAGATAAAACCCGTCTTGCCTTCGCTGGTCTGAAATTCGGTGACTACGGCTCCATCGATTATGGTCGTAACTACGGTGTAGCATACGACATCGGTGCGTGGACTGACGTACTGCCAGAATTCGGTGGTGATACCTGGACTCAAACCGACGTATTCATGACTCAGCGTGCAACTGGTGTTGCAACTTACCGTAATAACGACTTCTTTGGTCTGGTTGATGGCCTGAACTTTGCTGCTCAGTATCAGGGCAAAAATGACCGTAACGATTTCGAGAACTACACCGAAGGTAACGGTGATGGCTTCGGTTTTTCTGCTACCTATGAATACGAAGGATTCGGCATTGGTGCGACCTATGCAAAATCTGATCGTACCGACACTCAAGTCAATGCAGGGAAAGTTCTTCCTGAAGTATTTGCTTCCGGTAAAAATGCAGAAGTTTGGGCTGCAGGTCTGAAATATGACGCTAACAACATTTACCTGGCCACTACCTATTCTGAAACCCAGAATATGACGGTGTTTGCTGATCACTTCATTGCTAATAAAGCACAGAACTTCGAAGCTGTTGCACAATATCAGTTCGATTTCGGTCTGCGTCCGTCCGTTGCTTACCTGCATTCTAAAGGAAAAGACTTGGGTGTTTGGGGTGATCAGGACCTGGTTGAATATGTTGATGTAGGTGCAACTTATTACTTCAACAAAAACATGTCTACTTTCGTTGATTACAAAATCAACCTGCTTGACAAAAATGACTTCACTAAAGCACTCGGTGTAAGCACTGACAACATCGTTGCTGTAGGATTGGTTTACCAGTTCTAATCTAGTTACTAAAAGATATGTTGCGGGAGGCGTTGCCTCCCCAACATATAAGTGGCTCCCTCAAGCCACTTCCTTTAGAAGCACTACCTTGCTTCTGACTATATAAACCTTCTGTTATATATTACCCTTTATTTTGGGGGCGTTGACACGCCCCATTTTTAATAAAATTAAGTGAACAATTAGCGTATCAATTAGACTTCGTAACAACGATATCCATCTCTAACCGGATATCTAATGCCATTAACATTCCTTCAATTATTCCCTCAGCCTTCTGTAACTTTTTCCCGATATAACCATCAGAGCAACAATGCTTGCTTGCAAGTGACATGAATGTCATACCGCCGACATAATAATCCACTAGTAAATCATGCAAATCGCTGTTGTTCTTTTTCAGACGGGCCATACATCCGCAAACGATCATCGCGTCATCATCACAACATTGCGGACGGGATTTTACTTTTGAAGGAAGTAACCCCTTAAAACCAGCAGCAATGGACGACCAGGTCACATCCTCACGGTTATTAGCTGCCCATGCCCCCCAACGCTCAAGAACCATCTGAATATCACGCATCAACTTTCTCCACAAAATCAGGCCAGCACGCCAGTTGCCAGCGCACGATCGATAAAACGAAATATCAACTCCAGCTGGGAGCCATACTTCTCTTCAAATGCCGCGGTATCCGGAGGAAACAACACGATCTCCACTGAAGCAGGTGCCGACGTTGGTTTCGGCAGACGACGTAACTGCTCAACTATTGCTGCGCACGCCGCGCTCTGGAATTTTCGCCCCGCCGCGCTTATCAGACTCTTACCAGCAAACGCCCCTTTGTTGGGGTGTCGCCAGTACGTGTTCACGCTGGGCGGGAAAGGCAAGATCAGCTTCATACTTTCAGGCCTCTCTCATGTAACCAGTGGGCTGCACGCAGCCTGGCGTTTTCCTCACCGGCAAGCAGTGCGCGGATAATCCCGACCGCCTCGCTGTCGTCGTCCTTCACCGCGGTATGAAGCGTTATCCCCCGGGCCACGCCACGCTTTATCGTGATGACGCTTTTTTTCTCCAGTGCACGAAGATGCTCTACCGCTGCATTCACTGAACGGTATCCCAGCATGGTTGCCACCTCCTGATTGGTTGGCGGAAACCGTCCGCAAGTTCTATATGCAGCTTCTGCTGCACGTCCTTTTGGAATTAACTGGCCCGGACGGTTTCGCCACTGATAAACGGCTTCAGTTGTTATGCCGAAAAAAGCAGCAACTTTCTCAATACTGCCGAAGTAGCTTTCGATATCGTCAGTTGTCATACGCCCTCCAAACTAAGTTTTGTTAGATGCTAATTACAAATCTATCTTTGGTCAATAAAAACTAAGATTACTTAGCAATTAAAGAAATGGTGCTCCTATGGAAACGGTTGGTCAGCGTATAAAAGCTCTGAGAAGAGTTACCAGAACGTCCCAGAAAGAATTGGGTAAATTTTGTGGAGTAAGCGACGTTGCTGTGGGGTACTGGGAGAAAGACATCAATACCCCTGGCGGGGAGGCACTTTCGAAATTAGCGAAGTTCTTCAATACGTCAATAGATTACATTCTTTATGGTGCTGAGTTTGAAGGCAAACTCGTCACAAACATGCGCAGAGTTCCTGTAATATCGTGGGTTCAGGCTGGGCAGTTTACTGAGTGCAGGGCAGCAGAAGTGTTTAGTGAAGTGGACAAGTGGGTAGATACATCATTAAAGGTTGGTGATAACTCATTTGCATTAGAGGTTAAAGGTGACTCCATGACTAACCCTAATGGCCTCCCAACAATACCAGAAGGCGCAACAGTGATTGTAGATCCAGATGCAGAACCTCGTCATGGAAAAATAGTCATCGCTCGACTTGATGGAACAAACGAAGCTACAGTAAAAAAATTAGTCATCGATGGCCCTCAAAAGTTTTTAGTGCCATTAAATCCTCGGTATCCCAACATCCCTATCAATGGTAATTGCCTTATCATTGGTGTAGTCAAAGGAGTTCAATACGAACTCTAAGACCTCTCTTCTCTAACTAAGGCACCGAACTAAGAAAAGTTTGGTGTTTTCTCTTGCCATGATAACTAAGTTAAGTTAGATTTTATATCAAAGATAACGAACAGGCAGGACGCCCACGAAGTAGCCGCCTGGGGCATATGAAGTCCAGGATGATTCGTTAGCAACAAAAAAGCGCCCTACAGGACGCTTAGCTCTTTAACAATCTGGTCCCCATCAACAAGTAACTGATAACTTGAGGAGATGTGAAATGCACAAAACAGAACCCAAAATCGTCGCGCCTGGCTACACAGATGAGGAAATTTATGAGTGGATGACAAAGAAGCTGGCAGCTATAAACCAGCTTCGTGAAGTGCTGTCTTATCGACAGGAAACAATAGACTCCTTAAAAAAACTGGATCAGGAAATCACGGTTTTATCACAGGATGTTACTTTAGATATTGTGCAGACAAATTAGGATCCCATTCATTTTCGTCAAAATCATCAAAGTGATGAATTTGTGATCTCCAGTCTCGATAATCTAAAAATTTCTGGGCGGTTACGCTTATTTTATCAAGTGTGAGTTCATCCTGAATTGAAAGAAGAAGTTCATCAAATTTCATCTCATTAATCTGTTTTGGCATCCAGTGATGCTTCATCAGAATAAGGTGAACCAGAGCCTTTTTCCCATTCAACTGATTATAGGGAGTGCCGAATTTCTTCCGGTGCTCATGTAAGACAAGGTCCAAAAGAGTAAGTAATGTTGCCCTTGATTCAACTTTGCTTATTTCGACTGATGACACTACCCCACTGATTTCAATGCCCCGATACTTTCCAACATTTTCACAGTGGGATTTGTACAGCGTATAGATATTACCGGACATTTCTTTTCCTTTTGCGTTGTTGGGGATAACCAGATTAACCGAATCCTTGTTGTTGGGGAATAACCAGGTCCACCTCGCCTGATGTGGCTAAAAGCAGGCACATAACAGCTAAGTATTTTCAACCAGAGAGAATCCTTAGCGTTGTGGTGAATGCGGCTCAGCGCACGCGGGTTAAGGTTGAGGCTGACAGTCGACCTTCTGTGGATACCCACCCGCCTGGTGTGCAACCTTCGCCAGGCACCGGGAGGCACCCGGCACCACAACTTTATGCTGTGTGTAGTCCTGGCGGTACCAGTTTGTACCCTTGCTTCCGGCTGGTACCGTCCTTTTTACAAAACAGAGAAGAGCATCACCGGACGACGGGCTCATAACCCAATCCATCCGGGCGGCTGCCACTGCAGGTGTTCTTCTCTGTTTTGTGGAGAAACTAATCGGCCTTGCAGGGTCGATATGATGAGGAGCAGCAAAATGGCTAGCGAACGCAGTACTGATGTGCAGGCATTTATCGGGGAGCTGGACGGCGGCGTATTTGAAACCAAAATCGGCGCAGTTCTCAGTGAAGTCGCTTCCGGTGTGATGAACACGAAAACCAAAGGTAAGGTCTCACTCAACCTGGAAATCGAACCATTTGATGAGAACCGTGTGAAAATCAAACACAAACTCTCATATGTTCGCCCGACTAACCGCGGGAAAATTTCCGAAGAAGACACCACCGAAACGCCGATGTATGTCAATCGCGGTGGTCGCCTGACTATTCTGCAGGAAGACCAGGGACAGTTACTGACTCTTGCCGGTGAACCTGACGGAAAACTCCGCGCAGCAGGTCGTTAATATCGTTTTTAATTAACTGATTATTTATCTCATCACTGAATATCTTTATATAGTGAGGACTTATTATGTCTCAGAACTTAGACGCAACCGCAATTAATCAAATCCATGCCCTTATTTCTGCTCAGGGTGTTAATGAAATTATCAGTAAGATTGGTGCCGATGCTGTGGCATTGCCTGAGAATTTCCGCATTCATGATCTGGAAAAATTTAATTTAAATCGCTTCCGTTTCCGTGGTGCGCTTTCCACTGCCAGCATCGATGACTTTACCCGTTATTCTAAAGATCTTGCAGATGAAGGCACCCGCTGCTTTATCGATGCTGATAATATGCGTGCCGTCAGTGTGCTTAACCTGGGTACTATTGATGAACCAGGTCACGCAGATAACACCGCCACACTCAAACTGAAAAAGACAGCACCGTTCTCTGCTCTGTTGTCTGTTAACGGCGAGCGTAACTCCCAGAAGTCACTGGCAGAATGGATTGAAGACTGGGCCGACTATCTTGTGGGCTTTGATGCTAATGGTGACGCTATTCAGGCAACAAAAGCGGCTGCGGCTGTCCGTAAAATCACGATTGAAGCAAACCAGACCGCTGATTTTGAAGATAATGACTTCAGCGGCAAACGTTCCCTGATGGAGTCTGTCGAAGCGAAGACCAAAGACATTATGCCAGTGGCATTTGAATTTAAATGCGTTCCGTTTGAAGGTCTGAAAGAACGTCCGTTTAAATTACGCCTCAGTATTATCACTGGCGATCGTCCTGTACTGGTTCTGCGCATTATTCAGCTGGAGGCGGTGCAGGAAGAAATGGCTAACGAATTTCGTGATCTGCTTGTTGAGAAATTCAAGGACAGCAAAGTAGAAACCTTTATTGGTACTTTCACCGCCTGATTTCATTACTGCAAATGCCCCTGCGGGGGCATTTATGGAAACGTAATTTACTCAATAATCGCCGGATGGTGAGGGATTCTTTTTACCAGAATTCAGCGCGGTGCAGCGCATATACGTGGAGAACAAAATGTCATTTATTAAAACTTTTTCCGGGAAGCATTTTTATTATGACAGGATAAATAAAGACGACATCGATATTAACGATATCGCGGTTTCCCTTTCAAATATCTGTCGCTTTGCCGGTCATCTTTCGCACTTCTACAGCGTCGCCCAACATGCGGTTCTTTGCAGCCAGTTGGTGCCGCAGAAATTTGCTTTTGAAGCGTTAATGCATGATGCAACAGAAGCGTATTGCCAGGACATTCCCGCACCACTGAAACGCCTTCTTCCTGACTATAAACAAATGGAAGAAAAAATAGACGCCGTAATCCGTGAGAAATACGGGTTACCCCCAGTTATGAGTACGCCCGTGAAATATGCCGATCTCATCATGCTGGCAACCGAACGCCGCGATCTCGGGCTTGATGATGGCTCTTTCTGGCCTGTACTGGAAGGTATCCCGGCAACAGAGATGTTCAACGTGATTCCACTGACATCGGGCCATGCCTACGGGATGTTTATGGAACGCTTTAACGAGTTATCGGAGTTACGCAAATGCGCATGAATGTTTTCGAAATGGAAGGGTTTCTTCGTGGGAGATGTGTACCGCGAGATCTGAAAGTAAATGAAACAGATGCTGAATACCTGGTGCGTAAATTCGATGCGCTTGAAGCTAAATGTGCAGCACAGGAAAACAAAGTAATACCAGTGTCAACTGAACTGCCACCAGCAAATGAAAGTGTTTTGTTATTCGATGCTAACGGAGAAGGCTGGCTAATTGGCTGGCGTTCTCTCTGGTACACCTGGGGACAAAAAGAAACCGGAGAATGGCAGTGGACATTTCAGGTCGGGGACCTTGAAAACGTCAATATCACTCACTGGGCAGTAATGCCAAAAGCACCGGAGGCTGGAGCATAATGACCACTTTTACCGACAAAGAACTGATTAAAGAAATTAAAGAGCGTATCAGCAGCCTTGACGTGCGAGACGATATTGAGCGCCGTGCTTATGAAATCGCACTCCTATCTCTGGAAGTAGAACCAGATGAACGCGAAGCTTATGAATTATTCATGGAAAAGCGTTTCGGTGACTTAGTAGATCGTCGGAGAGCAAAAAACGGCGATAACGAATACATGGCATGGGATATGACTCTCGGTTGGATCGTCTGGCAGCAACGAGCTGGTATCCATTTTTCAGCAATGTCACAACAAGAGGTGAAATAATGGAGCCATACAGCCTCACACTCGATGAGGCCTGTCATTTTCTCAAGATATCCAGACCGACTGCCATTAACTGGATACGCACAGGGCGTCTTCAGGCAACACGCAAAGATCCCACTAAGAATAAATCTCCTTACCTCACAACACGACAAGCCTGCATTGCGGCTCTTCAGTCTCCGCTGCATACTGTCCAGGTGAGCGCGGGTGATGGCATAACAGAGGAAAGAAAATGTCACTCTTCCGCAGAGGTGAAATATGGTACGCCAGTTTCACATTGCCGAACGGTAAAAGATTTAAACAGTCTCTTGGAACAAAGGACAAAAGGCAGGCGACAGAACTCCATGACAAGCTAAAGGCTGAAGCATGGCGGGTCAGCAAACTTGGTGAAATACCTGATATAACGTTCGAGGAAGCGTGTGTCAGGTGGCTTGAAGAGAAAGCACATAAAAAATCACTGGACGATGACAAAAGCCGGATCGGATTCTGGCTTCAACATTTCGCAGGAATGCAACTAAGAGACATTACTGAATCAAAAATTTATTCAGCAATGCAGAAAATGACGAACCGGCGTCATGAGGAAAACTGGAAACTCAGGGCAGAAGCATGCAGAAAAAAAGGGAAACCTGTTCCAGAATACACGCCAAAACCAGCGTCCGTTGCAACGAAGGCTACGCATCTTTCATTTATAAAGGCCCTACTAAGAGCCGCAGAGCGTGAATGGAAAATGCTGGATAAGGCACCAATTATTAAAGTGCCTCAACCAAAGAATAAACGGATCCGCTGGCTGGAGCCCCATGAAGCACAAAGGCTGATTGATGAATGTCCGGAGCCATTAAAGTCTGTTGTTGAATTTGCACTGGCAACAGGCTTAAGACGCTCGAACATCATCAACCTTGAATGGCAACAAATAGATATGCAGCGCCGGGTGGCATGGATAAACCCGGAAGAGAGTAAATCAAACCGCGCAATTGGCGTTGCGCTGAATGATACTGCATGTCGCGTATTGAAAAAACAAATCGGGAATCATCACCGTTGGGTATTTGTGTACAAGGAAAGCTGTACCAAACCAGACGGAACGAAAGCGCCAACAGTAAGGAAGATGCGGTATGACGCAAACACAGCCTGGAAAGCGGCGCTGAGACGGGCTGGTATTGATGATTTCAGATTTCACGACTTGAGACACACCTGGGCAAGTTGGCTGGTTCAAGCCGGAGTCCCGTTGTCAGTTTTACAGGAAATGGGAGGCTGGGAGTCTATCGAAATGGTTCGTCGATATGCTCACCTTGCACCTAATCACCTTACCGAACACGCACGGCAAATAGACTCGATCCTGAACCCATCGGTCCCAAATTTGTCCCAGTCAAAAAATAAGGAAGGTACTAATGATGTGTAACTTATTGATTTTAATGGTGCCGATAATAGGAGTCGAACCTACGACCTTCGCATTACGAATGCGCTGCTCTACCAACTGAGCTATATCGGCCCTGAAAGGACATGTTCACGAACGTGAATCACGGTGGACAAGGTTAAAACTAACCGGGCGATGCGTCAATGGCCTTGTGAATCAAATGGCTACTTTTGCATCACCCGGTTTTATTTACGCACGAATGGTGTAATCACCAATGCCGATCCACTTGTAAGTGGTCAGTGCTTCCAGCCCCATTGGGCCACGCGCGTGGAGTTTTTGTGTGCTTACCGCCACTTCCGCACCCAGACCAAACTGGCCGCCGTCGGTAAAACGCGTAGAGGCGTTAACGTAAACAGCGGACGAATCCACTTCGTTAACAAAACGCTGGGCGTTGCGCATATCGCGGGTCAGGATCGCATCGGAGTGTTGTGTGCCGTGTTCACGAATATGGGCGATGGCATCGTCAAGATCGCTGACGATTTTGACGTTCAAATCTAATGACAGAAACTCATCGTCATACTCTTCGGCTTTAACAGCAACCACCTTCGCAGGGCCTGCCTGCAACTGCGCCAGTGCAGCTGCATCTGCGTGTAATGTCACGCCGCTTTCCGCCATTTGTTTGCTTAATGCGGGCAGGAAGCTATCGGCGATGTTTTTATTCACCAGCAACGTTTCAACCGTATTACATGTGCTCGGACGCTGAGTTTTCGCGTTGACGATCACTTTTAATGCTTCAGCGATCTCTACACTTTCATCAACGTAAATATGGCATACGCCTATACCACCTGTGATCACCGGGATTGTCGACTGTTCACGGCACAGTTTATGCAAACCAGCGCCACCACGCGGGATCAGCATGTCGATGTATTTATCCATACGCAGCATTTCACTGACCAGCGCACGGTCAGGATTATCAATCGCCTGCACGGCACCCGCCGGTAAGCCGCAGGATTTCAGGGCGTCCTGAATCACCGCCACCGTTGCAGCGTTAGTGCGACACGTTTCTTTGCCACCGCGCAGGATCACCGCATTACCGGTTTTCAGGCACAGCGAAGCGACATCAACCGTCACGTTCGGGCGCGCTTCATAAATCACGCCAATAACCCCCAGCGGTACGCGACGACGCTCAAGACGCAGGCCGCTGTCCAGTACGCCGCCATCGATTACCTGCCCCACCGGATCGGCGAGGTTGCACACCTGACGTACATCGTCGGCAATGCCTTTCAGCCGTGCGGGCGTCAGTGCCAGACGGTCAAGCATCGCTTCGCTAAGGCCATTGGCTCGCGCGTCAGCAACATCCTGGGCGTTAGCGTTGAGGATGATTTCGCTTTGTGCTTCCAGTTCATCGGCGATTTTTTCCAGCACGCGATTTTTTTCGCGGCTGGAGAGTTGCGCTAATTTATACGAGGCTTGCTTCGCGGCAATGCCCATTTGTTCCAGCATCAGCCTGCTCCTTAACGGGTAATCATGTCATCACGGTGAACGGCAACCGGGCCGTATTCATATCCCAGTATTGCATCAATTTCTTGCGAGTGGTGTCCGGCAATACGGCGTAATGCATCGCTGTTGTAACGACTGACGCCGTGGGCGATATCGCGGCCTTCGAGGTTGCAAATGCGGATGACTTCACCACGCGAGAAATTGCCAGTCACGCTTTTAATGCCTTTCGGCAACAGGGAGCTGCCGCGTTCCAGAATGGCGGCAGTTGCCCCTTCATCTACCGTGATTTCACCCGCCGGCGGCGCACCGAAAATCCAGCGTTTACGGTTTTCAAGCGGAGTCGCCTGGGCATGGAACAGCGTACCGACGGAAATGCCTTCCATCACATCACCAATAACGCCCGGCTTGCTGCCCGCGGCAATAATGGTGTCGATACCCGCACGGCAAGCCACGTCAGCGGCCTGCAATTTGGTACTCATGCCGCCAGTTCCGAGGCCTGAAACGCTGTCACCGGCAATCGCGCGCAGTGCGTCATCAATGCCGTAAACATCTTTAATCAGTTCTGCCTGCGGATTGCTGCGCGGGTCAGCGGTATACAAACCTTTTTGATCGGTCAGCAGCAACAGTTTATCGGCACCCGCCAGAATCGCCGCCAGCGCAGAAAGGTTATCGTTATCGCCGACCTTAATCTCTGCCGTAGCGACAGCATCGTTCTCATTGATTACCGGAACGATATTGTTATCGAGCAACGCACGCAGGGTGTCGCGGGCGTTCAGGAAGCGTTCTCGGTCTTCCATATCAGCACGGGTCAGCAGCATTTGCCCGACGTGAATGCCATAAATCGAAAACAGCTGTTCCCACAGTTGAATCAGTCGACTCTGCCCTACCGCCGCCAGCAGTTGTTTCGAGGCGATAGTCGCTGGCAGTTCCGGGTAACCCAGGTGCTCACGTCCGGCGGCGATCGCGCCCGACGTCACAATAACAATCCGATGCCCGGCGGCATGTAACTGCGCGCACTGGCGAACAAGTTCAACGATATGGGCACGGTTCAGACGGCGCGATCCGCCTGTTAGCACACTGGTGCCGAGTTTTACCACCAGCGTCTGGCTGTCACTCATGATTCTCTGCCATTCAATTTTAGGAAAAATGATATCAAACGAACGTTTTAGCAGGACTGTCGTCGGTTGCCAACCATCTGCGAGCAAAGCATGGCGTTTTGTTGCGCGGGATCAGCAAGCCTAGCGGCAGTTGTTTACGCTTTTATTACAGATTTACTAAATTACCACATTTTAAGAATATTATTAATCTGTAATATATCTTTAACAATCTCAGGTTAAAAACTTTCCTGTTTTCAACGGGACTCTCCCGCTGAATATTCGCGCGTTAATTAAAATCAGGAATGAAAATGAAAAAGAGCACTCTGGCATTAGTGGTGATGGGCATTGTGGCATCTGCATCCGTACAGGCCGCAGAAATATATAACAAAGACGGTAATAAACTGGATGTCTATGGCAAAGTTAAAGCCATGCATTATATGAGTGATAACGACAGTAAAGATGGCGACCAGAGTTATATCCGTTTTGGTTTTAAAGGCGAAACACAAATTAACGATCAACTGACTGGTTATGGTCGTTGGGAAGCAGAGTTTGCCGGTAATAAAGCAGAGAGTGATACTGCACAGCAAAAAACGCGTCTCGCTTTTGCCGGGTTGAAATATAAAGATTTGGGTTCTTTCGATTATGGTCGTAACCTGGGCGCGTTGTATGACGTGGAAGCCTGGACCGATATGTTCCCGGAATTTGGTGGCGACTCCTCGGCGCAGACCGACAACTTTATGACCAAACGCGCCAGCGGTCTGGCGACGTATCGGAACACCGACTTCTTCGGCGTTATCGATGGCCTGAACTTAACCCTGCAATATCAAGGGAAAAACGAAAACCGCGACGTTAAAAAGCAAAACGGCGATGGCTTCGGCACGTCATTGATATATGACTTTGGCGGCAGCGATTTCGCCATTAGTGGGGCCTATACCAACTCAGATCGCACCAACGAGCAGAACCTGCAAAGCCGTGGCACAGGCAAGCGAGCAGAAGCATGGGCAACAGGTCTGAAATACGATGCCAATAATATTTATCTGGCAACTTTTTATTCTGAAACACGCAAAATGACGCCAATAACTGGCGGCTTTGCCAATAAGACACAGAACTTTGAAGCGGTCGCTCAATACCAGTTTGACTTTGGTCTGCGTCCATCGCTGGGTTATGTCTTATCGAAAGGGAAAGATATTGAAGGTATCGGTGATGAAGATCTGGTCAATTATATCGATGTCGGTGCTACATATTATTTCAACAAAAATATGTCAGCGTTTGTAGATTATAAAATCAACCAACTGGATAGCGATAACAAATTGAATATTAATAATGATGATATTGTCGCGGTTGGCATGACCTATCAGTTTTAA
Protein sequences of DBSCAN-SWA_10 >LR134000|3563153:3585944|3580864_3582028_+|VDY70223.1|integrase|DBSCAN-SWA MSLFRRGEIWYASFTLPNGKRFKQSLGTKDKRQATELHDKLKAEAWRVSKLGEIPDITFEEACVRWLEEKAHKKSLDDDKSRIGFWLQHFAGMQLRDITESKIYSAMQKMTNRRHEENWKLRAEACRKKGKPVPEYTPKPASVATKATHLSFIKALLRAAEREWKMLDKAPIIKVPQPKNKRIRWLEPHEAQRLIDECPEPLKSVVEFALATGLRRSNIINLEWQQIDMQRRVAWINPEESKSNRAIGVALNDTACRVLKKQIGNHHRWVFVYKESCTKPDGTKAPTVRKMRYDANTAWKAALRRAGIDDFRFHDLRHTWASWLVQAGVPLSVLQEMGGWESIEMVRRYAHLAPNHLTEHARQIDSILNPSVPNLSQSKNKEGTNDV >LR134000|3563153:3585944|3569093_3569444_-|VDY70185.1|DBSCAN-SWA MPSRIPKACRVRGCRHTTTDPSGYCESHKSEGWKQYKPGQSRHQRGYGSKWDVIRVRVLKRDKGLCQLCLRAGVVREAKTVDHIIPKAHGGTDADSNLQSLCWPCHKAKTARERLK >LR134000|3563153:3585944|3575594_3576011_-|VDY70206.1|DBSCAN-SWA MTTDDIESYFGSIEKVAAFFGITTEAVYQWRNRPGQLIPKGRAAEAAYRTCGRFPPTNQEVATMLGYRSVNAAVEHLRALEKKSVITIKRGVARGITLHTAVKDDDSEAVGIIRALLAGEENARLRAAHWLHERGLKV >LR134000|3563153:3585944|3566018_3566870_+|VDY70175.1|DBSCAN-SWA MIRQKILQQLLEWIECNLEHPISIEDIAQKSGYSRRNIQLLFRNFMHVPLGEYIRKRRLCRAAILVRLTAKSMLDIALSLHFDSQQSFSREFKKLFGCSPREYRHRDYWDLANIFPSFLIRQQQKTECRLINFPETPIFGNSFKYDIEVSNKSPDEEVKLRRHHLARCMKNFKTDIYFVSTFEPSTKSVDLLTVETFAGTVCEYADMPKEWTTTRGLYASFRYEGNWENYPDWVRNIYLIELPARGLARVNGSDIERFYYNEDFVEKDGNDVVCEIFIPVRPV >LR134000|3563153:3585944|3567009_3567249_-|VDY70179.1|DBSCAN-SWA MWVDIEAPEGVTEKDYQFLPSKADHFSGGKITRTFSTSKPGVYTFTFNALTYGGYEMTPVKVTINAVAAETENGEEEMP >LR134000|3563153:3585944|3572846_3573041_-|VDY70198.1|DBSCAN-SWA MLKQLDMTETAKVVFNELNGKPATVGEIAQNTYLSRERCQLILTQLVMAGLADYQFGCYRRLQQ >LR134000|3563153:3585944|3584888_3585944_+|VDY70229.1|DBSCAN-SWA MKKSTLALVVMGIVASASVQAAEIYNKDGNKLDVYGKVKAMHYMSDNDSKDGDQSYIRFGFKGETQINDQLTGYGRWEAEFAGNKAESDTAQQKTRLAFAGLKYKDLGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGVIDGLNLTLQYQGKNENRDVKKQNGDGFGTSLIYDFGGSDFAISGAYTNSDRTNEQNLQSRGTGKRAEAWATGLKYDANNIYLATFYSETRKMTPITGGFANKTQNFEAVAQYQFDFGLRPSLGYVLSKGKDIEGIGDEDLVNYIDVGATYYFNKNMSAFVDYKINQLDSDNKLNINNDDIVAVGMTYQF >LR134000|3563153:3585944|3570294_3570756_-|VDY70189.1|DBSCAN-SWA MSRVTAIISALVICIIVCLSWAVNHYRDNAITYKAQRDKNARELKLANSTITDMQMRQRDVAALDAKYTKELADAKAENDALRDDVAAGRRRLHIKAVCQSVREATTASGVDNAASPRLADTAERDYFTLRERLITMQKQLEGTQKYINEQCR >LR134000|3563153:3585944|3578063_3578426_+|VDY70213.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGR >LR134000|3563153:3585944|3568473_3568968_-|VDY70183.1|terminase|DBSCAN-SWA MAGTAGRSGRRPKPTARKALAGNPGKRALNKDEPVFTPIKGVEPPEWFAEEDLPLATIMWQLTTKELCGQGLLCVTDLAVLERWCVAYEFWRRAVKNIARQGNTITGAMGGMVKNPDLTAKKEQESEMSSTGAMLGLDPSSRQRLIGLAGQKKATNPFLKIIES >LR134000|3563153:3585944|3580332_3580638_+|VDY70221.1|DBSCAN-SWA MTTFTDKELIKEIKERISSLDVRDDIERRAYEIALLSLEVEPDEREAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTLGWIVWQQRAGIHFSAMSQQEVK >LR134000|3563153:3585944|3578491_3579316_+|VDY70215.1|DBSCAN-SWA MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNGERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAVRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA >LR134000|3563153:3585944|3569969_3570263_+|VDY70187.1|DBSCAN-SWA MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQEKTVDAAKICGGAENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ >LR134000|3563153:3585944|3576982_3577189_+|VDY70210.1|DBSCAN-SWA MHKTEPKIVAPGYTDEEIYEWMTKKLAAINQLREVLSYRQETIDSLKKLDQEITVLSQDVTLDIVQTN >LR134000|3563153:3585944|3570752_3571229_-|VDY70191.1|DBSCAN-SWA MQVLNSQRKAFLDMVAWSEGTDNGRQPTRNHGYDVIVGGELFTDYSDHPRKLVTLNPKLKSTAAGRYQLLSRWWDAYRKQLGLKDFSPKSQNAVALQQIKERGALPMIDRGDIRQAIDRCSNIWASLPGAGYGQFEHKADSLIAKFKEAGGTVREIEV >LR134000|3563153:3585944|3582232_3583486_-|VDY70225.1|DBSCAN-SWA MLEQMGIAAKQASYKLAQLSSREKNRVLEKIADELEAQSEIILNANAQDVADARANGLSEAMLDRLALTPARLKGIADDVRQVCNLADPVGQVIDGGVLDSGLRLERRRVPLGVIGVIYEARPNVTVDVASLCLKTGNAVILRGGKETCRTNAATVAVIQDALKSCGLPAGAVQAIDNPDRALVSEMLRMDKYIDMLIPRGGAGLHKLCREQSTIPVITGGIGVCHIYVDESVEIAEALKVIVNAKTQRPSTCNTVETLLVNKNIADSFLPALSKQMAESGVTLHADAAALAQLQAGPAKVVAVKAEEYDDEFLSLDLNVKIVSDLDDAIAHIREHGTQHSDAILTRDMRNAQRFVNEVDSSAVYVNASTRFTDGGQFGLGAEVAVSTQKLHARGPMGLEALTTYKWIGIGDYTIRA >LR134000|3563153:3585944|3579443_3579980_+|VDY70217.1|DBSCAN-SWA MSFIKTFSGKHFYYDRINKDDIDINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQKFAFEALMHDATEAYCQDIPAPLKRLLPDYKQMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDGSFWPVLEGIPATEMFNVIPLTSGHAYGMFMERFNELSELRKCA >LR134000|3563153:3585944|3576108_3576735_+|VDY70208.1|DBSCAN-SWA METVGQRIKALRRVTRTSQKELGKFCGVSDVAVGYWEKDINTPGGEALSKLAKFFNTSIDYILYGAEFEGKLVTNMRRVPVISWVQAGQFTECRAAEVFSEVDKWVDTSLKVGDNSFALEVKGDSMTNPNGLPTIPEGATVIVDPDAEPRHGKIVIARLDGTNEATVKKLVIDGPQKFLVPLNPRYPNIPINGNCLIIGVVKGVQYEL >LR134000|3563153:3585944|3563153_3564479_+|VDY70171.1|DBSCAN-SWA MNKYQAVIIGFGKAGKTLAVTLAKAGWRVALIEQSNAMYGGTCINIGCIPTKTLVHDAQQHTDFVRAIQRKNEVVNFLRNKNFHNLADMPNIDVIDGQAEFINNHSLRVHRPEGNLEIHGEKIFINTGAQTVVPPIPGITTTPGVYDSTGLLNLKELPGHLGILGGGYIGVEFASMFANFGSKVTILEAASLFLPREDRDIADNIATILRDQGVDIILNAHVERISHHENQVQVHSEHAQLAVDALLIASGRQPATASLHPENAGIAVNERGAIVVDKRLHTTADNIWAMGDVTGGLQFTYISLDDYRIVRDELLGEGKRSTDDRKNVPYSVFMTPPLSRVGMTEEQARESGADIQVVTLPVAAIPRARVMNDTRGVLKAIVDNKTQRMLGASLLCVDSHEMINIVKMVMDAGLPYSILRDQIFTHPSMSESLNDLFSLVK >LR134000|3563153:3585944|3573615_3574713_+|VDY70200.1|DBSCAN-SWA MKKLTVAISAVAASVLLAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNEADDGDTTYVRLGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYGVAYDIGAWTDVLPEFGGDTWTQTDVFMTQRATGVATYRNNDFFGLVDGLNFAAQYQGKNDRNDFENYTEGNGDGFGFSATYEYEGFGIGATYAKSDRTDTQVNAGKVLPEVFASGKNAEVWAAGLKYDANNIYLATTYSETQNMTVFADHFIANKAQNFEAVAQYQFDFGLRPSVAYLHSKGKDLGVWGDQDLVEYVDVGATYYFNKNMSTFVDYKINLLDKNDFTKALGVSTDNIVAVGLVYQF >LR134000|3563153:3585944|3571232_3571568_-|VDY70193.1|DBSCAN-SWA MPHNPNTWPDWLELFQSWWRGDTPLGAVIMSIVMAGLRIAYFGGGGGWKRKTLEILLCGALTLTFASALEYVGWPKSLSVAIGGGVGLIGVDAIRGAAMRVIGNKFGSSKE >LR134000|3563153:3585944|3575301_3575598_-|VDY70204.1|DBSCAN-SWA MKLILPFPPSVNTYWRHPNKGAFAGKSLISAAGRKFQSAACAAIVEQLRRLPKPTSAPASVEIVLFPPDTAAFEEKYGSQLELIFRFIDRALATGVLA >LR134000|3563153:3585944|3583497_3584601_-|VDY70227.1|DBSCAN-SWA MSDSQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQLHAAGHRIVIVTSGAIAAGREHLGYPELPATIASKQLLAAVGQSRLIQLWEQLFSIYGIHVGQMLLTRADMEDRERFLNARDTLRALLDNNIVPVINENDAVATAEIKVGDNDNLSALAAILAGADKLLLLTDQKGLYTADPRSNPQAELIKDVYGIDDALRAIAGDSVSGLGTGGMSTKLQAADVACRAGIDTIIAAGSKPGVIGDVMEGISVGTLFHAQATPLENRKRWIFGAPPAGEITVDEGATAAILERGSSLLPKGIKSVTGNFSRGEVIRICNLEGRDIAHGVSRYNSDALRRIAGHHSQEIDAILGYEYGPVAVHRDDMITR >LR134000|3563153:3585944|3564835_3565429_+|VDY70173.1|DBSCAN-SWA MEKYLHLLSRGDKIGLTLIRLSIAIVFMWIGLLKFVPYEADSITPFVANSPLMSFFYEHPEDYKQYLTHEGEYKPEARAWQSANNTYGFSNGLGVVEVIIALLVLANPVNRWLGLLGGLMAFTTPLVTLSFLITTPEAWVPALGDAHHGFPYLSGAGRLVLKDTLMLAGAVMIMADSAREILKQRSNESSSTLKTEY >LR134000|3563153:3585944|3571644_3572697_-|VDY70196.1|DBSCAN-SWA MKNTVKINSVDLINADCLHFIQSLPDDSIDLIVTDPPYFKVKPNGWDNQWKGDEDYLKWLDHCLAQFWRVLKPAGSLYLFCGHRLASDTEIMMRERFNVLNHIIWAKPSGRWNGCNKESLRAYFPATERVLFAEHYQGPYRGKSDGYAAKERELKQHIMAPLISYFRDARAELGITAKQIAEATGKKNMVSHWFGASQWQLPNEADYRKLQALFSRIAAEKFQEQQLEQPHHQLVASYDSLNRKYSELLDEFKTLRRYFSVSVSVPYTDVWTHKPVQFYPGKHPCEKPADMLRQIISASSRQGDLVADFFMGSGSTIKAAMALGRRALGVELESERFNQTVKEVSELVGK >LR134000|3563153:3585944|3567232_3568477_-|VDY70181.1|terminase|DBSCAN-SWA MSRKSYPNVNAANQYARDVVRGKIVACQFVIQACQRHLDDLMAEKSKSFRYRFDKDLAERAAKFIQLLPHTKGEWAFKRMPITLEPWQLFVICCAFGWVNKGSRLRRFREVYTEIPRKNGKSAISAGVALYCFACDNEFGAEVYSGATTEKQAWEVFRPARLMCKRTPMLTEAFGIEVNASNMNRPEDGARFEPLIGNPGDGSSPHCAVVDEYHEHATDALYTTMLTGMGARRQPLMWAITTAGYNIEGPCYDKRREVIEMLNGSVPNDELFGIIYTVDEGDDWTDPQVLEKANPNIGVSVYREFLLSQQQRAKNNARLANVFKTKHLNIWVSARSAYFNLVSWQSCEDKSLSLEQFEGQPCILAFDLARKLDMNSMARLYTREIDGKTHYYSVAPRYPDGGSQSRRRKYVGRH >LR134000|3563153:3585944|3574900_3575284_-|VDY70202.1|DBSCAN-SWA MRDIQMVLERWGAWAANNREDVTWSSIAAGFKGLLPSKVKSRPQCCDDDAMIVCGCMARLKKNNSDLHDLLVDYYVGGMTFMSLASKHCCSDGYIGKKLQKAEGIIEGMLMALDIRLEMDIVVTKSN >LR134000|3563153:3585944|3577160_3577595_-|VDY70212.1|DBSCAN-SWA MSGNIYTLYKSHCENVGKYRGIEISGVVSSVEISKVESRATLLTLLDLVLHEHRKKFGTPYNQLNGKKALVHLILMKHHWMPKQINEMKFDELLLSIQDELTLDKISVTAQKFLDYRDWRSQIHHFDDFDENEWDPNLSAQYLK >LR134000|3563153:3585944|3579970_3580333_+|VDY70219.1|DBSCAN-SWA MRMNVFEMEGFLRGRCVPRDLKVNETDAEYLVRKFDALEAKCAAQENKVIPVSTELPPANESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA >LR134000|3563153:3585944|3566826_3566985_+|VDY70177.1|DBSCAN-SWA MMLFAKFLFPFVRFSWSLSLIEFYPCLILFNQYQARYSVNNGVEYFNTIILY |
30 | Shigella_phage(52.0%) | terminase,integrase | attL 3573953:3573969|attR 3585217:3585233 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_11 |
3999404 : 4020572
Sequences of DBSCAN-SWA_11
Nucleotide sequences of DBSCAN-SWA_11 >LR134000|3999404:4020572|DBSCAN-SWA TTCACTGATCTCCTTCGGTCAACGGAGATTGTATTTTCCGCATTGATTCTCCTTTAAGTTCGATCTTGATACTGCCATGAACCAGTCGATCTAGGATGGCATCCGCATGTGTGGAGTCTCCGATCATTTTGTACCAGTTTTCCACCGGCAACTGGCTCACTACGATGATTGAGCCTCGTTGGTACATCAGATCCACTATTTCCAGCAGGTCGCTACGCTGTTCTGATGAGAGAGGTTCCAGCCCCCAGTCATCCAGAAGCAGCAGATCGCTATTATTCAGCCTGGTCAAAAGTTTGCTGTATCTTCCATCAGCATGCCCCTGATAGCACTGTTCCATCAGCGCTTTAAGGCGATAATAGTAGACCTTGTATCCCTGTCGGCAGGCATTATGACCAAGTGCACATGCCAGGAACGTTTTACCGCTGCCGGTGGCCCCGGTCAGTAAAATATTTTGTTTCAGGGTTAACCAGTTTCCCTGACTGAGTGAACGGATGAGGGCCCTGTCCAGCCCTCTATTGTTACGATAATCCAGCTTTGATAACTCAGCATTAAGTCTGAACCGTGCATGTTTGATCAGACGCTCTGCCTTCCTGTTTTCACGGCAGGTTAGTTCTTCTGCTGTCAGTAATGACAGGCGTTCTTCGAAGCCCAGCTCCTGGTATGTGCCCGGTTGAGCAAGTTGCTTTTTAAGCGCATCACGGAAGCCGGTGAGTTTAAGTGCGGTCAGTTGTTCGTAAAGATGATTCATCATTGGATCCCGTATCAGTGGTAATACTCACTGCCGCGTACGTTTTCGTGTTCCATCGTGGATAACAGATCTGGTTTTGGATCCTGAACAGGTTGTTTATCCAGACCTTTTTCCAGGATCGATTTAATACCTGACAGACGCCATACCTTTGTTTTCAGAGCTCTTGCACATGCTGCATTAAGTCTGGCTTTACTGTATTTTTTATGAAGGTTCAGGAGTCCAAGGCAGAAGCGATAGCTTTGTTCCGGATGTGGACGAGAGTTCAGTATATGAAGCACATAACTATGAGTTTCACTGCCTATGTGCCCCGCCCATTCCAGAAGACGCTCTGGCGTCCAGGTGGCATGCTGTCGATGAGCTTCAGGCATGTGCTCGTTGCGGGTACTGTAGCCATAAGTACGCTTGCGCGGGTGCACAGCAACCTCCTGCCCCTGATTGAAGAGTCTTACCAGTTCTCCGGAGATCCATGCTTCCAGTTGCTGGCCTAACAGCGAACATGGAACCGAGTAGTAATGTTTGTCGATTTCCACGTGGTAATCGGCATGAACTCTGACTTTCTTCACCAGGGTGTAACTGTAACTGGCTTCAGGAAGAGGCTTCAGTGCGGGTTTATCAAGCTGGATGAAGAGTTCTGCACGTGAATAACCCAACTTCTGCATTATTTTGTTATTCAGTCTTTCCAGCAACTCCCGAATGCGCTGATTAAGCGATGCAAGGCTGTAGAAGATCTCATGCCTGATTCGGGCCATGATCCAGCGTTCAACAACCTGAACGCCAACTTCAGCTTTGGCTTTATCTTTCGGTTTACGTGGCCGCGCAGGCAAAACTGCGACATTATAATGCTCAAGCATCTGCTGGTAGGTAGGGTTAACGTCAGGATCATACTTACATGCCCTGGATGTGGCGCTTTTCAGATTGTCCGGAACAACAAGTTCAGGAACGCCACCCAACCACTGGAAGCAGCGAACATGACTCATCACCCAGTCTTCAAGCTGCTGAGACCAGGTGGCCTCTGCCCATGTGTAACTTGATGCCCCGAGAACAGCTACGATGACCTGAGCAGTTCTTATTTCTCCGGTCTCAGGGTCGGTAACGCCAACGGTAGGTCCACAGTAATCAACGAAAAGTTTTTCGCCAGCTTTATGTACCTGACGCATTGATGGTGAAGTGGTTTTGAGCCATTCACGGTACATCCGGCAGTAATGGTTATAGCTGTAAAAACCGCCTGGATTACGCTCACAGTATTCTTCCCAGAGTAGCTGCAGCGTCACGCATTTATTACGCAGTTCCCGGTGTACTGTAGCCCAGTCAGGCAGAGAGTGCTTCTTCATCTTAACCTGGGTCTGAAGGAACGCATGTTTTAGTTTTGTATCATCCCATCCTGTAGGTAAGGGCCACTGCTTTATGCCAAGTTGAGCCGCCCGATTAGCATATCTGGATACAACGGAAGGGGAGATTGCAAGACTACGACCAATTTGTCGATGGCTGAGTCCAACACCGTATTTAAGCCTAAGAATTTCTTTAAGTTTTCTCATAGAAATTGGAACTGTTGGCATAGGTATCCTTTACCGGAATGGCAAAAGATACAGATCAACACACCTGTGAAGTTCCAATAACATTGATGGAGATCACTGAATAACAAAATGAGTCAAAAGTGATCTCCATCGATGTTATTCAGCGATCTGTTCAAATGTTATTACCCGATCTCGATGGAAGTTATTGAGTGATCTCCTTTCATGAAAATACGCACCTGGCTTTAACTGTTCCTGATATAAGTTGACAGTTTTCTGTCCGTAAAACGGTTGATGAACTTCATCGGGCGTTTTGTATTTCAATGCCAAATGTGACCGCTCATGGTTATAAATTGCCACGAATTCTTTTACCATTTACCGGCCCTACGTCAGCTCTGCAGGACGCGAGAGTAAAAACTCATTTTCAGGATTCCGTTTATCCACTCAGCCTGCGCATTCTGGTGGCAGTTATAACCATTAGTCATTGAACAGGTTATTGATAAATAATTAAAAAACTCAGTTTTCTGGCTCGCCTGCTCAAGCTGTTCTTCAAGTTCCCTGATTCTTTGCTCGGGTGTCAGCGGGACAGTCGCCAGAGCCACAGGCAGTCTCCTCTTTACCTGGTCAGGAAGTCCGGGGCTCCAGTCAAGCCGGCCATATTTACGCAACCAGACAAGAACGGTGGAGCGCCCCTGAATGCCATAACTGCAGCCGGCCTGTTTATAAGTCATCTCGCCTTTTTCAACCTTCTCAACAACGGCTATTTTAAAGGATAGAGGATAATCACGTTGAGTGCGTTTGAACCCAGACATCATCACGTTCTCCGGGTTTAAGTCAGGAACGTGTCATAGCTATTCAGGACGGGTCACCGACACAAAAAAAGTCGCCCTTGAGCGACTCGATTTGCATACGGTATGGTGCGAAGGCCGGACTCAAACATGCGATTTAACCTCATGATTTAAAATTGTTAATAATAAACAACACCATCCTATACCAACACAGATACCAACACGAAAAATCACTGGTTTTCGGCATCGAACCAGGACAATCAGGTCCTGATTGAGATACCGAGCTCTGTTACTCGTCATCTTCCTCGACAGGCTCCATTGCATCAAGTGGTACTCGCACCATCGCGATAGAACCAATCCGGATAAAAGCATCTTCCCCATCTATATCGGTTAACTGATATCGCTTACTTGATGCCGGATAATTGTGTCGATTGGCAACAGTGTACTCTTTCGTACAGCCAGAATGTGGCAGACTGAGGTGGAATAGATAACGTCCGTATGCCCGCTGACCACCTCCGGGCGGGAGAGTGTGGTATCTGACATCATCATTTTTCCTTTCTGTTTATAAATGAAAATGCCAGCCGTGTTCAGGCTGACGTCAGGGAAGTGAAATCGGGTGAGTGATCTTCACTGGTTCTGGTACAAAAGTTACTGTTGGCGCAGGGTACGGATACCCTCCCTGGCCTGTGCGATACAGGGCAACAGTGCTGCCGAATCTGTTTTATCCTCATCGTTGTCGAAGATAATTCCCGATTCGCAGTCGATATTGTCCTGCAGCCACGTAATCAGAATATCCAGCGCTGTTTCCGTGGTTAATGATTTCATGTTGTGAATTTCCGGATTACCAGTCGAAAGTGGGTAAACCCGGCAGATATCCGGCACTGGCATCCAGATGAATGAGACTGACACCATAACACCGGATGAGTGTGACGACCAGACGACGGAACGTTTTTGACAGCCCCAGGCGCTTAAGCCGCAGAACCGGGTATGACCACGTATCGGTACGTAACAGATAACCGGTACCGGTAAAATGAATCCATTCTGATTCACCGAAGTCACTGGTCTGGTGTGACAGCGAGTACAGCCAGGCGTTGTCCGTTTCCGTGATATGTGCGGTACTGCAGCGTATGCCGGTAAGGGCAACAAATGGTGGCGTGCAGTCTGCTGAGGTCCGGTCAGATTCATCAACAACACGAAGGGAGTAGCCGTTATCGCAGACCATGTTAATCAGTTCAGCGAGATTAACACCATCAATGTCAACGACAATGCGCCCCATGTTCAGCGCCTGCACGTTAACGCTGTCGGCTTCCAGCGTCAGGGCCAGTTTCATCGTTTCGCCTCCGGATGCTTCCCCAGGGTAATGTTATTTACCGTTCTGTAATTGTCGCGGATCATCAGGCCTGTTGCCCGGCGAGCCCGGAGGATATCGATGCTGTTTATTAACTGAGAGTGGGTACAGGCGCTGAATCCCGGCAGGTCGGTACGTACCAGTGCGTATTTTTCCACGAGAAAGTTCACTGCATCACACAGTGAAATGCCTGCCTCAATATGCTGCTCAATCACACGTTCATCAGCGAACGGTGTGTCATTCAGCGTAAGGCCGTAGTGCTGGTCCAGCAGTCGGGTGAGCAGTGTCTGCCAGATGGTGACGGGAGACGGGCAGCGCGATACCTTCCGTACGTGTGTATCAGGTAATGTTTTCATACTGAAGATTTTCCTGATATGCAGAGATAAAAATGGAAAAGTGGCGTGGTGAAAACACCAGACCGGAGCAGAAGGTTATTCTGGAGAGTTACTTTTTCGTTTCGGATGTTGGATAAAAAGCCAGATAAACGTAACCACAACTGCCGAGGGTGTCGGCTTCGCAGGTCAGTCCTTTCGCATACAGCGTGACGGTATGCTGATGGCGGGGATTCAGTTCACCGCTGGTGAGCATTAATTCCAGTTGTTTCATCAGCAGCGGAAAGGCCTGGTCCGGGTGGTACGCATCCGCATCTCTGAACCGGCCTCTGATACCGGCACGGTCGGCAAGGTAATGCAACCGGTTACCCTCCTGCACCAGACGGGCGCCGAAACAGGGCGTCACTGTGCAGGGTAGTCCCCACCAGGGATGGTCGTGATTGTCATCGGGATGCGTTGTCCCGGAGAGTGTGTCTGACACGATAAAATCCTCACAGAAAATCGGTGAATGATAGTTTAACGATGACGGGTCAGCCGGCGCGCAGACCGGCTTCGGTAAAGGTGTATTCGTTGATGAGCAGGACCTCATCCACAGCGACGTCTGAGCTCAACCAGTCGTACTCGTTTTCCGGGTGACGGAACATCAGTTCAGGGTGACGCTCATCAGCCGGACATACGGACCAAAACTGTCCTTACGGCGTTCGGCAAACACAGCCAGCACGCCGGGAATATCCTGCACTTCACGACCGGTATACGCTTCAGCACTGCCGTGCCAGCGGTATTTACCGGTACAGAACGGAAAAAGACGGGATGTTGGATGTTGTTGGTGAATACGCATGGCTTCACCACGGGTGATAATTTTCATAATGGGATACCTCTGAAGACAGAAGATAAAAGTGAAAACAGGTGTGATGTGGTGACAACGGGTTAAAGCAGACCGTGTTCCGCAAAGGAGAAAACCTGACTGCCACCGACTATCAGATGGTCCGGCACCCGGATATCCACCAGGCCCAGTGCCTGTACCAGACGTTCCGTGATAAGGCGGTCTGCCTTACTGGGGGTGACTTCACCGGACGGGTGATTGTGTGCCAGCACCACGGCAGCGGCATTGTGGTGCAAGGCGCGTTTAATCACTTCCCGGGGATGGACTTCCGTGCGGTTGATGGTGCCGGTGAAGAGGGTTTCACCGGCAATCAGCTGATTCTGGTTGTTCAGATACAGCACCCGGAACTCTTCACGCTCCAGTCCCGCCATGTTCAGAAGCAGCCATTCCCGTGCCGCACGGGTGGAGGTGAAGGCCACGCCGGGTTCATGAAGATGGCGGTCCAGAGTTTTCAGGGCCCGCTGAATGAGGCTGCGCTCGCCGGGCGTCATCTCTCCGGGCAGAAAGGAAAGCTGCTGCATTGTCCCTGCCTCCATTCAGTCGATGATGCGCATAATGGCGCTGCATTCCGGATGCTGCAGGGCATAATCCCGCAGCCGGTAATAATGAACCGTCATGGCATAATTCTCTGTACGACAGGCGTGATGGCTGTACGCCATCAGACAGGCGGCAATGCCGGCGGCTTCCGGGCTCATTTCAGCGCGGTTACCGTTCATGACGTTAAACAGCACCCATGTTTCGTCATCCGGTTCGGGTGCCATAAATGCGCCACCGTTGTTCAGGGTGTACAGATTCCAGATACCACCGCAGTAGTCTTCACACAGACGGTCCATCCAGCCGAAGACACGGGGCTCCAGGGTCACCCACTGTGGAATGAGGCCAAAATGCTGCGGCCAGAAGCTGATGCGCTGTTCATCAGGGACGAGGGTGGCAACCAGTTGTTGCAGGTCAGTCTCCGGTGCACAGGCCGAAGAGGTGGTGTTCTGAGACAATGTTTTCATGTATGAAATCCTTGTAATGTGAATAAATGAGAGGGAGTAAAGGTCGGGAGCTGGCAACATCACGTTCATCACTGACCGTGATGACTCGGTAATATCAGGGATGAATGATATTCGGCTGAAATTCCGGACAGCCGTACGCGTAACCGTTCCGGTCACAGACGGATGGCCAGATGCCATTGCCTGTATACTCAGTCTGCACAGACTGATTTCTCCGGGACGGTAAAGCAGACGGGTACACCAGGCGCACAGGTCATCATCAGAAATGGAACGACACCCCGGCAGCGCCTCCGGGGCGTCGGGTGATTCTGGTGCCTGAATCACGCCTTCAGGGACATGCTGCATCAGCGCAGGCTCTCCAGCAGCGTTTCTGCCATTACCCACAATGCGCGGTTGAGCTTAATATCGGTGTCGATGTTGTGAATGGCACGGGTGTGGATACGTTTTCCTTTTGCACTGCGGCCGGAAATTCCGCCTTTCAGCATATTCTCCTGGATGGTCTGATATGCGCTCCACAGGTCTTTACCGTAATCCTCCCGGCGTCGTGGTGTCAGAATGTCAGCGGTGGTGACTGGCTGATGTTCGTCACCATAACGGTAAGTCAGTGCCGCCTGTGCCAGCGCCTGGCGTGTCGGTGGCGGCAGGACCAGCGACTGCATGGCATCACGCTTTTCCTCAATCCGGTCAAACACGCCCACCACCTCGTAAGCCCCTTCAATGACTCTGTCCACCACATTTCCCCGATGCGGAACACGCACTTCCCCCGGAGACTGACCACAGACGCATCCGTTCTGACAGACGAACCTGAAGTAACCCGGCAGCATCTGGTAGCTGGAGGTACCGTCATGGGAGTTGAGCAGAATGATTTCAGGGACATGTTCTCCGTTTATCTCTCCGGCCCGCCGCAGACGCAGCATGTGTTTTGTGTATCCCTGGCGGCCCGGGTCGCGCACACGGGTCTGGCAGGCGAAGAATGGCTGAAAGCCTTCCCGCTGCAGGCTTTCCAGTACGGTGATGGTGGGAATGTACGCATATCGTTTACTGCGGGAAGTATGCCGGTCTTCACCGAAAATACTGGGAACATAGTGCATCAGTTCTTCGTGTGTCAGCGGACGGTCACGGCGTATCTGGTTCGTATAACCAAAACGACTGGCTAGTCGCATAATTTGCTCCTTATCGGTGGTTAAGATTTACTGGTGTAATAAATGAAAAAGCCACGTCTCCCGGAGAAGACGCGGCCTGACAGATGAAATGAATGACGTTTATTGTCTGAGAAGCCCTTAACTGGCGAGCTGAGTATTAAGCTGTGTTCCGGCATCACCAGCGCAACTGACCTTCAGCATTACGGATAACCAGCCGGGAATATGTTCCCTGGTCATCTTCAGTAAACACATTGCGGTAAGCTGTTTTGACGGCAACAGCCTGTTCGCGTGAGAAAGGGCCTTCGGGCAATAAACGTGATGTACAACCCGGCATATCTGTTGCTCCCTGAAAGTAAAAAGCCCCGGTCATGATGACCGGGGCCTGAAGGAGAGTGACCTGATTATCAGAAAGTCACATTCAGCGTGGCCTGACCGTTATAACCTTCAGCGCTGCTGCCGCTGACGCTGTGGGCATAACCGCCCTGAACGCCCAGGGTGATATTTTCCCGGACACGGGCTTCCAGTCCGGCCTGCAGGTCCAGTGACGTGCCATTCCGGGACGGTGAGAACGTCATGTTACTGCCGGCGGCGGCTGTACCCATGCTCATGTCACCCCGGGAGCTGAAGGTGCGGATAACAGAAGGCTGTACCCACCAGTTCACCGGCAGTTCACGCACACGGTGTTTTGCACTGTCGCGCAGGGTGTCACGGGATGAGGTGCCTTCACCAAAGCTCATATCGTTGTGGCTGCCCAGACGGAAACCGGCACGCACATGTTGTGCACTGCCATGCCCAAACTTCACATAACCGGCGTTATCCTGGCCGTCATCCAGGGAAAGTCCCTGCCAGGTATACTGCAGTTGTGGCTCCAGCATCAGGTTGTCAGTGATACTGAAGGGCAGACCGGTTTCCAGTGAGCCCAGCCAGCCCCAGCCCCGGGCGCGGAAGTCGTTATTGTCCGATGACGCTTTCATGCTGTGGCGGGTTCCCTGTGCCACAATGTCAGCCCACAGGCCGGAGGACGTGTGTGTCAGATTCAGGTATCCGCCCAGGCTGCCGGCATCATCCCGGACCGTGCCGGCGCGGGAACCGTCATCATCCTTAACATCAACGGAAGAATGGCCTGCGGCACCATACACCCCTGTCGTCAGAGACATACCGGCAACCTCTGTTCTGAGCAGGTCACCCTCCAGACGGACGAAGCCATAGCTGCCGCTGCTTTCCGGCGTGGCTCCACGGGCAATACCGCCGTTGTTATCGTGACCGAGATGACCGCCCTGAATGCTGAGACGGACGCTGTTATTTTCACCGTTTACACCGGTCTGATGGCTGCGGGAGCCTGCCAGAATCCGGTCATAGTCCATTGCCTGTGTCAGCATGGATGTATACAGGGGGACTTCAGCACGATAAGCATTTTCACTGCGCAGGTACCAGTCTTCATCGCTGTCACGGTTCAGGGTGTAGTTAAAGGCGCCGGCCTGAAGCGGGCGACTCAGGGCAAACGCACCTTCTTCTGTGGTAGCGCCATTCTGTGCATCCACAACCCGGATACCCTGTCCGGTGGTTGCCACCCCGAGGTTACTGTTTCCGACATTTGTAAACGCAAGCCAGGTTTTGCCGGTTGCCTGACCACCATTAATCACCAGCTGGTCAGAGGCATTGCTGCCATCAAGGCGAACACGCATATTGATAGTGCCACCCTGACCGGTGAGGTTACTGGTTGTCAGTTTATGGAACGTTACCGGGGCGTTGCCTTTTGCAACAGCACGGCTCAGTACGGCATCCGGTGTTGTCTGATTGTCAAAGTAAATCTCCCCTGCGTTAACAATATCGCCGGTACTGACCACATCACCATTCAGTACCATGCTGGCGCCTTTACTGAGCTGTGTTCTGCCACTCAGACTTCCCCCGGCAGCCAGCATCAGTGTTCCACGATGTCCGACAGTGGTGTTGCTGGCCTGTCCCCCGGCATTAACCGTAAAGCTGCCGCCATTTTCCAGCAGCAGGCCGCTGGCCTGACCGGATATGCCATCAATACTGAACTTACCGCTGGCATTGGTCCCCTCAACAGTGGCACCACTGTCTGCAATCAGGGCACCACCGGATGTCATGGTGACACCTGTTGCCTTACCACCGGCAGACACTGCCAGGGTTCCGCCGTCATCCACCCGTGTTTTCTGCGCTGAATGGCCTTCCAGTACATCCAGGCGACCGCCGGACTCCAGAACAACGCCGTCAGCCTTACCGTTTTCCACCGTGAAATTGCCCAGACGGTTGCTGCCAAGAACCGTTGCCGCAGTACTGGTGACCAGTGCTCCACCCTGTTTCAGGGTGACATTCGTGGCTGTACCACCGGCGTTCACCTGCAGTTTGCCTTTCTGGTTAACGGTGGTGAAATCCGCCAGACCACCTTCCTTGACAATCTGCCAGCCGTCACTGTTTACAACCGTGTCAGAGGCTGTGCCTCCGTTGTGCACATACTGGTAACCGCCATTCAGTGTGGTATCCAGCGCATGCCCGTGTACCGTCTGGTCGCCGCCGGCATAAACCACAGTGGTATTTGCTGTTCCTTCAACAGCCACAATCTGACGACCATTTTTATTGATGGTGGTACGTGCGGCATTTCCGCGAACAAACTGCCCGGTATCACCATTTTCTGCATCCGGCCCCCCTTCCGCGCCGGTATTCACAACCGTGTCGGTTGCCACTGCGCCGGATTTGACGGCCTGCCAGCCCTTCTCATTAATGACGGTACCCGTTGCAATCCCGCCTTCATGTACCCACTGCTCACCGCCGTTCAGAGTGGTATTCACTGCCTGCCCCTGAAGGCTCTGTCCGCCTCCGGCACTGATAACCGTGTCTGAAACGCTTCCTCCGGCATTCACTCTCTGAAGACCACCACCGGTGACAGTAGTGTTGTTGGCGATACCGCCATTTTGTATCCATTGTCCGCCGGTATTGGCCTCGTTATCCGGCCCATACTCCAGACCGGTACTGATGGTCATTCCGTTGGCCGTACCGAAGACAATCTGGTTGTCATGATTTTCCAGTGTTCCACCGTTCACGGTTTCTCCCGCCTGGACAACCGTGTCAGCAGCCAGTGCCGGGAGTGACGTGACAGCAGCAAGAGACAGAGCAACTGCCACACCGGTGCGTTTTCCCCGTGAGCGCGCCAGTTCGGAAGCCACCACCAGGGTGCCCGTAATGTGATTCCATACCAGCCTGTAGCTGGTGTTCAGATGTCGTTTCATCAGCTTTTCCTTACAGAGGGTGAATAAAAAAGCCGGTACCCACAAATGTGGATACCGGCAGGTACAAAACAGCATGGTCAGATAAAGTGGTTTTTCCCGGACATAAATGCTTCCCCTCTCTCATCCGCCAGAGAAGTATCAGTCCGTTGTGTGCAATTCCGTTCTGCATGATGAAGAGATTCAGCTGCATTTTCGCAGACCATTAACCACTATAAACGATCGATAGTAACGATCGATACCTGTTTTATCGATCGTTTTATTCTATTAACAGCAAGGCAGTCACGCAAGAGAAAGAGGTGGGTCTGAGGTGTTTGTGTCATGTTTTCAGTTGGTTAACGCAATGTAATCAGGCATTTCCCCTGGCAGAGTGTTCAGAAGAAAATCCAGTTCCATACAGCGCGGGCAACAGAGACCACCGAGTCACGGACGGCACGCAGAACCGCGCGTGCAACAGAGGCAATACAGACGCTCTCCGCAGTGTCAAATATCCGGTCCACCGCACCGGTAAACTGTTCACGGGCCTGAGACCGGACAGATTCCGTGCGCAGCTCGTCCTGCAGGCGGGTCATCAGGGGACTGGCGGCATGGTCGGGAAGCGCTGTCATGAGCGCACTGACCAGCGTATCCAGTTCCCAGCCGGTGCAGGCCGATACGGCCACAACCGGATGTACGGGCCGGAACAGCCGGAATACCGCATCTGTTTTCTCACGAATGTTCTGTGCCTGTGCGGGAGAAGGCTGAATGCCGGCCATATCCCATTCATGGCAGGGCTCCGTTTTGTCGGCCTGCGTCACCACAAACAGCACCTGCTGATGTCCCCGGTGCAGGATGTGTCGCCAGAAATACTCATCCACAGACAGGGCACGGTCATCGGCTTTAATCAGCCACAGTACCAGGTCAAGTTCAGGCAGAATGTCACGGTACAGGGCTTCATACTCTGCATCCCTGTCCCGGCTCTCGCCCACCCCGGGCAGGTCAGTGATAATCATGCTGTGACCGTGGCCACTCAGACGGAAGCGCCGCACTTCCCGGGTGCCGGCGTGAACATCACTGACCGGGGTGACCTCCCCCTGAAACAGTGCATTACAGAGTGAGGATTTACCGGCCCCGCTTTTACCCATAATGCCAATCACGGGTTCGTGACTGGTGAGTTTGCGCAGATGTTCCAGGATGTGACGGGAAAGTGAGTAAGGCAGGGAGGAGAGCGGTTTTTCAATTGCCTCAATGGCATCAGACGGATTCATACATGTATTCCCGGTGAAAAAAGACAAAACCCCCGCACACCGGAAAGGTGGCGGGGGACTCACGGAGATAATATATGTCTGAAATTATCTGGAATGTCGGTACCTTTTATACTCTTCAACAATCAGGTCATGCAGCATTTCGCCGGTACTGAGTCTTCTGACCCTCGATAATTCTTTGAGCTGCGCCCTGGTTTCTGCGTCAAGAAAAAACTGTACATTGACTTTATCTTCCTGCATTTTTTTGTACTTGCGCTGCGCCTTGGCCTTATTGAAACTGTTTCTGAAATTTTTTATATCGTCAGGTGAGGTGAGTGGACAGGTAACCCATATGTCAAATGAAAGGTGAATTGCGTGGTAAATATCAGCAGAACCAGGAAAACTGGCCAGTTTTTCCAGTGCAATATTTTTTCCCTTCAAATAATTCCATGTCCATATGCATTCATGTTCGTTCTCCGGGAGGAGGTAACTGAAAGGACTCTTTGATTTAAATGCCCGCATCCATCTGTCTTTGGTCTGGATTATCCACTCATCCCGCAAATTCTTATCAGAAAGAATATCGAGAACTCTGAGTATTTGTCTGAACTGTGTCGATGAGTCCGACGGGAAGGCATTCCCTTCCAGATACTCTTCTTTAAGATACAGTTTATTCTTTTGAAGCAAGTCTGCGCTTATTTTTGCCCTGTTATTCTGGATAAACTGCATCAACAGATACCAGGAAAATAAGCATGCGCGTTCACTGCCCGCGATCCAGGATATTTTTTCCAGGGGGACCAGGAAGGCCAGGGCCTTCAGACGCATACCAGAAATTATTTGTTCAAGTTCAAAATCTGAAAGTGTGATGTTCAGATTTTCTCTGTTTTTTTTATGTAAGCGATAAAACGTTCCGGGGGATCATAAACATTAAGGGGCAGTACGGAGATATTACGCTCTCCAAGATAATAAGAGTAATACGCATTTGTTCTTTCATTCATAACCTTATCCGACAAAATTTTCTGAACCATCCTCTTTCTCCGGTTGCTTAAAAATTAAACATAATAAGGTGATAATAACATGATAAAAAGAAGTTTATAATAGTATATTAAGGTGGTCAATAAAAAGGCAATCTGCGCGCTGTACAGTCATTCTGACAGGAAATGCAGGAGGAAGGGAAGGTAACCGGCTTCGTTATGCTGACAGGAATACCTGTTATCATGGAAGTGCACTCATGTGGCCGACGTAGCTTAAGAGGATGGAAGAGTGGAGACTGCAGACAGGGGGTATTGAAAGGTAAACATGTTTCGCCCGCCAGCTGCACCGGGGGCCAGCATCACTTTTTTTACGTTGTAACCCGGTATGCATTCCGGCTTTGATGATTACAGTCAGAGGGCAGCAGACTGACGTTATTTAACCGGACAGAAAAATTATGGACACTGAAATATCGTCTGTTAATACCTCGTTTGTCGGTGTGGTATTCCCTGTATTACCGGCGTCACCGATAAAGAGAGTTATAAACGCCAGAAAGGTACCTCCTTTATGCCGGGGAACACTTACGGAAAATGAAGTCTGAAAAGGAGCACAATGAAAAGAGGCGTGTCCCGTAAATATAACGAATCATCGCAGGATATGCATTATCAGGTGATACCAGAGGGCATTAAACTGTCGCCACGGATAAGAATATACAACATTTCTGATAACAAAACATCATAACGTGTTTCAGATATGTTCCGGCATCAGTCTGACGGATATTATTTCCCGGGAAAACAAACCTGCTACTGTGACGGTTAACTGTATCTGCATAAATTATTTATGATGAAAGAACAGTATTACGGATTCAAATATCATCCTGTTTGCATTCAGGCGGACATTCCCGGATGCCGGAAAGAATATCCTGCAGGCTGAGTGAGTCTCTTCCACAGTGCCGGGATCGCTGTCGCCACGTACATATTGTCACCAGGATGGGTATGTACCGCAGCATGCCATTTCTGGTGAATAACCGCACCTGCACATTCAGTCTGCTGAGAACCTGGCGTGCATCGGGCTGTAAAGAACATCCCTGTCCGTGGCTCCTCTGGTGGTGTTGCCTGCTCAGCAATGACAGGATGCCCTTAAGACGCCCGTCATGGCTCAGTATGATGACGACGAACACGCTCTGCAGGTGAGCATGCGGGGCCGGGAGCATCATCATTTCCGTATTTTGTCACACCTTCCGGAAGCAACCGGCATCAGTTGCGGTTATCGTCAACCGTTGTGCGGTATGCCAGGAGCACGTACTGATCCATCAGATCCGTCAGTGCTGCGCTCTGCCTCTGACGGTTGATGATCCATTCCTCCAGCTCTTTTTCTGATTTCCGTTTCAGGTTCTCTTTCCAGTCTTCAAGGCTGTTTCCGGTCATTAAATCAAGCCAGCAAAGCCAGACCAGTTTGAGGCGCAACTCCGTGCCACCAAACTGACGGAAGAGGCTAATCAGTTTATTTGCCAGAGAACGCTGCCTCAGACGTTCACATTCAGACAGGAGAATGCCTTCCAGCATCGGATGATGAACCAGCAGATGTTCACTGGTGCATAAATTCACTATCAGATCATCATCCAGGGGCGGCGTGGCCCGTTTGTCAGGGCCATAAATCAGCCGGTGCAGGCTCTGGGGTACGATGGCAATCTCCGGTTTTTCACCATTGAATGACACCCACCCGGGCGTCAGATTATGGAAGTAGTAGATATTCGGATGACACACGCCCCAGGATACGGCGTGCCGGGTGAAGCCGGTCAGCCATTTCTGGTAGCTTTCGTGGAGCAGTTCGTAACGGGTCTGATACTGTTGTTTCATTTTCATATTCCTCACGGGGGATATAACAGGCAGCAGAATGCCGCCGGCAGGCCGGGGATTACGTCTGCTGCGGGTGAACGGATAACGGTCAGAAATTACGGGCGGGGCTGTGCAATACGCGCCTGTAGCCAGGCTTCCACTTCACTTTTCAGCCAGCGGGAGCTGCGGCCCAGCTTGATGGGAGCCGGAAAGGCCCCATCCCTGATGAGCCTGTAAAACCACTTATCTGTCAGGCCGTTCAGTTGAGTGATAAAAGCCATGTCGACCATCTGGTCATCCATCAGTGAAACTGGGGTGGTCATGAGAATGTTCCTCCGGATGTTGTTAATCTCAACCGGCGTTGGTGATGAAAACAACGCACCACCTTCACCGGCATAACAACCGGGGAGAAACGTAAAAAATAATCGGGAATGTGACCGTCAGGAGATAAACGGGGGGATTGCGGTGGCTGAGCCGTACCGGTATGTCGGGTGGTTGTTGCGAAACGATGCTTCCGCACGCGCACGCGATAAAGGAGGGAGGACTTTCGTGAGACAAGGGGAAAACTAGCGTTCTGATAATTGCAGAGTTGTTTTTGAAGGCGGTTGTGGTATATCAGCCTGTGACCTGTGGGCAGGCGACAGTTATCACTGGTGGGATAAGACTGTCTGATTGGTATTATAAATATATAAGAGAGATCAAACACATGAGATCAAATACATGAGTTCATATATGTGAAATAAACACTGTGAGTGCGATCAGTGGGCATGAGAACGCAGTAGCAGCAGATTGCGTTGTACCCACAGGGCTATCTCTGATAGCTGGCGTATAACCTGCGTATATAAAACGTACCTGCAACACACGTGAACAGAGTGCATTAAGCATCCGTCTGAGCGGTAAACGTATCCGACCTGGTGAGTGACAGTCTGAGAGACCGGTTAAGTGGTGGCACGGGAGTAATGAGTTTGGGGAGGGTACCAGAATGCCAGGCTGGTGCGGTATGCTGACAGCATCCTGTTAACCGCAGCATAAAGTAATGAGAGGCTAATGGAAGTTAAAACCACAGCAGTTCCGTCGTTGTTCATCCCGGGAGACGCTGGAAAAAGTGATTTCCCACACGCGTTATAAACTTACCCCTGCGGAGCTGGAAGCCTTTAACTCTGCCGTTGACAACCGGCTGGCAGAACTGACAATGAACAAACTTTACGATCGCGTGCCGGCTTCCGTCTGGAAATATGTCACCTGAATCTTGAATCCGCGCTACGATGACTCCGGTGTTATCTGTGCACCGGAAGATCGGGAACATTAAGCAGTATAACGGCAACAAAGAATTCGAAATCAAAAGGGAATGGTGTACAGAGGGGCAGGCGTTCAGGCATTACCGGCCTGTATCTTCTGAAGGGGACTCGTATTTGTCGGGGCTGAGTATTCCGCTCGATTCGATCACGGATTGGCGGAATACCTGACTGTGATTGATTCAGGTACCATTTTGCAGGGAGCCGGTGCGTTACTGTCTGGCACAACAAACGCAGCGGGTCGCTCTGCCCCGCCGATTCCTGATAATCCCTCCCCTGTAACCTACTGCAACAGTACAAATACACTGACGCCTAAAACGGTAACCAGAATCAGCACCAGCAGTAACGTCAGTAAACAGAGCAGACGATCGGTCTTCCGGTAATAACCGAAGGGGTCGACAGCGCCACAATACGGGCAACAAACTGTATTATGAGTAATTGCGTTACCACAGTCCCTGCACTTCTTTTTGCGTATCACGATAGCTCCCTCGCCACGCCTTATCCGTAACCGGTTTTTTACATTAAAAAAAACAACGCCGGGAATTCAATCGTCAGTCCGGAGACGACCGTTCGGGTTATCACAGAGTGCCTGAGACAGTGTCCTGCCGGAGGTCACTTCAGGGACTCTGCGTATTTTTTACGACGTGGCTATTCCGTATGAACCATACGGAGATTTAACCATGACCTACAAATACAACCCCTTCTGGCAGCAACGTATTCGTGAGACGGTGCGGCACGCACTGAATGTTCATCCCCGCCTGACGGCATTGCGGGTTGACCTGCGTCTCCCGGATGTACCGGCAGCAACGGACGCAGCTGTGATATCCCGCTTCATCAATGCCCTGAAAGCCCGAATCGACGCTTACCAGAAACGTAAGCATCGGGAAGGTAAACGCGTGCATCCCACAACCCTGCATTACGTCTGGGCCCGGGAGTTTGGGGAGTGCAAAGGTAAAAAACACTATCACCTGATGCTGCTGGTCAACCGGGATACCTGGTGTCGTGCCGGTGATTACCGCGCTCCGGAGTCTCTGGCCGGGATGATTAAACAGGCGTGGTGCAGTGCCCTGGGAGTGGATGTCGGGTGCCATGCCACGCTGGTGCATTTTCCGGCCTGGCCGGCGGTGTGGCTGGCGCGTAATGATGACACCGGCTTTCAGCAAGTGCTGGAACGTGCTGACTATCTGGCGAAGGAGCATACCAAAGCTCACTGCACCGGTGAGCGCAATTTTGGCTGTAGCCGGAGTTGAGCACGACTGGCGCGCCAATATCCGGCAAACCAAAAGTATGGGTTGTTGTCTGGCGGCAACGATGTATCCACTCGCCCACTGACTGTTTTTTACGTCAGTGTTCCATTGTGTGACAACCACAGTGCTGTAACAGCAACCAGCCTGCTTTACCCAAACTCCAGTCCAGATTTTAACTGAACCGCCATTCACACCCTGACCACGCTGCCCGTACCGGGACGGCTGTTGCCTGCGTGTCGTCTGGCGGCTGAGGTATATGACTATGTACGCAAAATCCTTTATCGCTCTTGATGGCAACGGACGTCTGACGGGCGCCCGTACTGCACAGGCCGCACCTTATGCTAACTACACCTGCCACTTGTGTGGCAGTGCACTCAGATGCCATCCGCAATACGACACTGAACTTCCCTGGTTTGAACACACTGACGACAGGCTGACAGAGCACGGTCAACAGTGCCCTTATGTCAGGCCGGAGCGCAGAGAAATACAGTTGATTAAACGTCTGCAGCAATTCGTACCGGATGCCTTACCCGTGGTGCGTAAAGCCAGCTGGCACTGCAGACAATGTCACCACGATTATTATGGGGAGCAGTACTGCACACACTGCCAGACCGGAGGATTCAGTATTCCCCGGACAACTCAGGAGGAAATATGCGAATTCTGAACTGTTATATGGCGAATGACAGCAAAGGCCATTTTGTTACGGCGAAAGAAGCTGCGAAGCACAACCGACAGGACGTTTTGTGCTGTGTGTCCTGTGGATGCCCGTTAACACTTCAGCGGGGCAATGACGGACAACCACCGTGGTTTGAACATGACCAGATGACTGTCGCTGAAAAAATCCTGCTGCGATGTACCTGGCTTGACCCGGCAGAGAAAGAGGCCCGTCGTTTGCATCTGCAGGGCATGACGGTTCCGGATTATACGGTGAAGGTGAGAAAGTGGTTTTGTGTGATGTGTGACGAAGATTATGAGGGGGAAAAGTGCTGCCCACGCTGCGGTACCGGGGTATACAGCAGGGCGTGGGGGCGGCAGGAGGTGCCGTCGGAAGATGCCAGGGCTGATAATCCGTTACAGAGGCTGTAATGGTTGCCTCCGGAGCAGTTTGCGGTGATGCCTTTCCCGGGAATGGATTGCCAGTGCGGGGACTGTGGAGCTTAATTCCGGTGTTGGTGGCCTTTCCGTTGATTTTATGCCAACAGCCCCCTGTGACTGACAGGCTGCCTCGTCATTCCATTCGTATCTCCAGTAACAGGGGGTTGTATTGTATTTTATTGTGCCGGTTCAGGTGTGCGTTTCCCGGCGCGTCTGCACCGGCTTAACCAAATTCAACAGGGATAAAATAGTGGTAAGCGTACAGCCTGAACCGTCTGGTCAGAATCTGACGAATTAGACAAAGTGGTGTCCACCAAATAAGTAGTGGGAACCAAAGTGTCAGATATGCAGAAAAATGTGACTCCCGGCAGGCGAAAAGGCTGCCCTAATTATCCTCCCGAATTTAAACAGCTGCTCGTTGCTGCCTCCTGTGAACCCGGGATATCCATCTCAAAACTTGCTCTTGAAAATGGCATTAACGCCAATCTGTTGTTCAAATGGCGACAACAATGGCGCGAGGGAAAGCTGCTATTACCTTCTTCAGAGAGCCCCCAGCTACTTCCTGTGACTCTCGATGCAGCTGCCGAACAGCCAGAATCGCTCGCAGAGGACCCGGAAACCCTCAGTATCAGCTGTGAGGTAACGTTCCGGCACGGGACGCTCCGCTTCAATGGCAATGTCAGCGAAAAGCTCCTGACTCTGCTGATACAGGAACTGAAGCGATGATCCCGTTACCTTCCGGGACCAAAATTTGGCTGGTTGCCGGTATCACCGATATGAGAAATAGCTTCAACGGCCTGGCTGCGAAAGTACAAACGGCGCTGAAAGACGATCCCATGTCCGGCCATATTTTCATTTTCCGGGGCCGCAGCGGCAGTCAGGTTAAACTGCTGTGGTCCACCGGTGACGGACTGTGCCTCCTGACCAAACGGCTGGAGCGTGGGCGCTTTGCCTGGCCGTCAGCCCGTGATGGCAAAGTGTTCCTTACGCAGGCGCAGCTGGCGATGCTGCTGGAAGGTATCGACTGGCGACAGCCCAAGCGGCTGCTGACCTCCCTGACCATGCTGTAAATCTCTTTATCCTGGTTGTCACAGAATAAGCCCGGTAAAATACGGGCTTATGAACGACATCTTTCTGACGACATCTTCCTGCTGAAACAGCGCCTGGCCGAACAGGAAGCGCTGATCCACGCCCTGCAGGAAAAGCTGAGCAACCGGGAGCGCGAAATAGACCATCTGCAGGCGCAGCTGGATAAACTCCGCCGGATGAACTTCGGCAGTCGCTCCGAAAAAGTCTCCCGCCGTATCGCACAAATGGAAGCCGATCTGAACCGGCTTCAGAAAGAGAGCGATACGCTGACTGGTAGGGTGTATGACCCGGCAGTACAGCGTCCGTTGCGTCAGACCCGCACCCGTAAGCCGTTCCCTGAATCACTACCCCGTGACGAAAAGCGACTGTTGCCTGCGGCGCCGTGCTGCCCGAACTGCGGCGGTTCACTGAGCTATCTGGGCGAGGATACCGCCGAACAGCTGGAGTTGATGCGTAGCGCCTTCCGGGTTATCCGGACGGTACGGGAAAAACATGCCTGTACTCAGTGCGATGCCATCGTGCAGGCACCTGCACCTTCGCGGCCCATCGAGCGGGGTATCGCCGGACCGGGGCTGCTGGCCCGCGTGCTGACCTCGAAGTATGCAGAGCACACCCCGCTGTATTGCCAGTCAGAAATATACGGCCGGCAAGGTGTGGAGCTGAGCCGTTCACTGCTGTCGGGCTGGGTGGATGCATGCTGCCGGCTGCTGTCTCCGCTGGAAGAGGTGCTTCATGGCTATGTCATGACTGACGGCAAACTCCATGCCGATGATACCCCGGTCCAGGTACTGCTGCCGGGTAATAAGAAGACGAAGACCGGGCGGTTGTGGGCGTATGTTCGTGATGACCGCAATGCCGGGTCAGCGTTGGCACCTGCAGTGTGGTTCGCTTACAGCCCGGACAGAAAAGGCATCCATCCGCAGACTCATCTTGCTTGCTTCAGCGGTGTGCTGCAAGCGGATGCGTACGCCGGGTTCAACGAGCTGTATCGCAATGGTGGGATAACGGAAGCTGCCTGCTGGGCTCATGCCCGCCGAAAGATCCACGATGTGCACGTCCGCATCCCGTCAGCACTGACGGAAGAAGCCCTAGAGCAGATCGGTCAGTTGTACGCCATAGAGGCGGATATAAGGGGAATGCCGGCAGAGCAGCGGCTTGCTGAACGTCAGCGAAAAACGAAACCGCTGTTGAAATCCCTGGAAAGCTGGTTGCGTGAAAAGATGAAGACCCTGTTCTTCGGCTCTGGCCATGGTGGTGAGCGGGGAGCGCTACTGTACAGCCTGATCGGGACGTGCAAACTGAATGACGTGGATCCAGAAAGCTACCTTCGCCATGTGCTTGGCGTCATAGCAGACTGGCCGGTCAACCGGGTCAGCGAACTGCTTCCGTGGCGCATAGCACTGCCAGCTGAATAA
Protein sequences of DBSCAN-SWA_11 >LR134000|3999404:4020572|4005057_4005534_-|VDY71017.1|DBSCAN-SWA MQQLSFLPGEMTPGERSLIQRALKTLDRHLHEPGVAFTSTRAAREWLLLNMAGLEREEFRVLYLNNQNQLIAGETLFTGTINRTEVHPREVIKRALHHNAAAVVLAHNHPSGEVTPSKADRLITERLVQALGLVDIRVPDHLIVGGSQVFSFAEHGLL >LR134000|3999404:4020572|4007557_4010404_-|VDY71025.1|DBSCAN-SWA MKRHLNTSYRLVWNHITGTLVVASELARSRGKRTGVAVALSLAAVTSLPALAADTVVQAGETVNGGTLENHDNQIVFGTANGMTISTGLEYGPDNEANTGGQWIQNGGIANNTTVTGGGLQRVNAGGSVSDTVISAGGGQSLQGQAVNTTLNGGEQWVHEGGIATGTVINEKGWQAVKSGAVATDTVVNTGAEGGPDAENGDTGQFVRGNAARTTINKNGRQIVAVEGTANTTVVYAGGDQTVHGHALDTTLNGGYQYVHNGGTASDTVVNSDGWQIVKEGGLADFTTVNQKGKLQVNAGGTATNVTLKQGGALVTSTAATVLGSNRLGNFTVENGKADGVVLESGGRLDVLEGHSAQKTRVDDGGTLAVSAGGKATGVTMTSGGALIADSGATVEGTNASGKFSIDGISGQASGLLLENGGSFTVNAGGQASNTTVGHRGTLMLAAGGSLSGRTQLSKGASMVLNGDVVSTGDIVNAGEIYFDNQTTPDAVLSRAVAKGNAPVTFHKLTTSNLTGQGGTINMRVRLDGSNASDQLVINGGQATGKTWLAFTNVGNSNLGVATTGQGIRVVDAQNGATTEEGAFALSRPLQAGAFNYTLNRDSDEDWYLRSENAYRAEVPLYTSMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNLTHTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMSFGEGTSSRDTLRDSAKHRVRELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSRNGTSLDLQAGLEARVRENITLGVQGGYAHSVSGSSAEGYNGQATLNVTF >LR134000|3999404:4020572|4018413_4018794_+|VDY71047.1|transposase|DBSCAN-SWA MQKNVTPGRRKGCPNYPPEFKQLLVAASCEPGISISKLALENGINANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETLSISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR >LR134000|3999404:4020572|4002051_4002477_-|VDY71001.1|transposase|DBSCAN-SWA MMSGFKRTQRDYPLSFKIAVVEKVEKGEMTYKQAGCSYGIQGRSTVLVWLRKYGRLDWSPGLPDQVKRRLPVALATVPLTPEQRIRELEEQLEQASQKTEFFNYLSITCSMTNGYNCHQNAQAEWINGILKMSFYSRVLQS >LR134000|3999404:4020572|4007328_4007487_-|VDY71023.1|DBSCAN-SWA MPGCTSRLLPEGPFSREQAVAVKTAYRNVFTEDDQGTYSRLVIRNAEGQLRW >LR134000|3999404:4020572|4013849_4014452_-|VDY71035.1|DBSCAN-SWA MKQQYQTRYELLHESYQKWLTGFTRHAVSWGVCHPNIYYFHNLTPGWVSFNGEKPEIAIVPQSLHRLIYGPDKRATPPLDDDLIVNLCTSEHLLVHHPMLEGILLSECERLRQRSLANKLISLFRQFGGTELRLKLVWLCWLDLMTGNSLEDWKENLKRKSEKELEEWIINRQRQSAALTDLMDQYVLLAYRTTVDDNRN >LR134000|3999404:4020572|4004660_4004774_-|VDY71013.1|DBSCAN-SWA MFRHPENEYDWLSSDVAVDEVLLINEYTFTEAGLRAG >LR134000|3999404:4020572|4003098_4003275_-|VDY71005.1|DBSCAN-SWA MKSLTTETALDILITWLQDNIDCESGIIFDNDEDKTDSAALLPCIAQAREGIRTLRQQ >LR134000|3999404:4020572|4001821_4002037_-|VDY70999.1|DBSCAN-SWA MVKEFVAIYNHERSHLALKYKTPDEVHQPFYGQKTVNLYQEQLKPGAYFHERRSLNNFHRDRVITFEQIAE >LR134000|3999404:4020572|4013205_4013334_+|VDY71033.1|DBSCAN-SWA MKRGVSRKYNESSQDMHYQVIPEGIKLSPRIRIYNISDNKTS >LR134000|3999404:4020572|4004242_4004611_-|VDY71011.1|DBSCAN-SWA MSDTLSGTTHPDDNHDHPWWGLPCTVTPCFGARLVQEGNRLHYLADRAGIRGRFRDADAYHPDQAFPLLMKQLELMLTSGELNPRHQHTVTLYAKGLTCEADTLGSCGYVYLAFYPTSETKK >LR134000|3999404:4020572|4011732_4012446_-|VDY71029.1|DBSCAN-SWA MRLKALAFLVPLEKISWIAGSERACLFSWYLLMQFIQNNRAKISADLLQKNKLYLKEEYLEGNAFPSDSSTQFRQILRVLDILSDKNLRDEWIIQTKDRWMRAFKSKSPFSYLLPENEHECIWTWNYLKGKNIALEKLASFPGSADIYHAIHLSFDIWVTCPLTSPDDIKNFRNSFNKAKAQRKYKKMQEDKVNVQFFLDAETRAQLKELSRVRRLSTGEMLHDLIVEEYKRYRHSR >LR134000|3999404:4020572|4014547_4014754_-|VDY71037.1|DBSCAN-SWA MTTPVSLMDDQMVDMAFITQLNGLTDKWFYRLIRDGAFPAPIKLGRSSRWLKSEVEAWLQARIAQPRP >LR134000|3999404:4020572|4010775_4011648_-|VDY71027.1|DBSCAN-SWA MNPSDAIEAIEKPLSSLPYSLSRHILEHLRKLTSHEPVIGIMGKSGAGKSSLCNALFQGEVTPVSDVHAGTREVRRFRLSGHGHSMIITDLPGVGESRDRDAEYEALYRDILPELDLVLWLIKADDRALSVDEYFWRHILHRGHQQVLFVVTQADKTEPCHEWDMAGIQPSPAQAQNIREKTDAVFRLFRPVHPVVAVSACTGWELDTLVSALMTALPDHAASPLMTRLQDELRTESVRSQAREQFTGAVDRIFDTAESVCIASVARAVLRAVRDSVVSVARAVWNWIFF >LR134000|3999404:4020572|4002840_4002996_-|VDY71003.1|DBSCAN-SWA MMSDTTLSRPEVVSGHTDVIYSTSVCHILAVRKSTLLPIDTIIRHQVSDIS >LR134000|3999404:4020572|4005549_4006014_-|VDY71019.1|DBSCAN-SWA MKTLSQNTTSSACAPETDLQQLVATLVPDEQRISFWPQHFGLIPQWVTLEPRVFGWMDRLCEDYCGGIWNLYTLNNGGAFMAPEPDDETWVLFNVMNGNRAEMSPEAAGIAACLMAYSHHACRTENYAMTVHYYRLRDYALQHPECSAIMRIID >LR134000|3999404:4020572|4017624_4018059_+|VDY71043.1|DBSCAN-SWA MRILNCYMANDSKGHFVTAKEAAKHNRQDVLCCVSCGCPLTLQRGNDGQPPWFEHDQMTVAEKILLRCTWLDPAEKEARRLHLQGMTVPDYTVKVRKWFCVMCDEDYEGEKCCPRCGTGVYSRAWGRQEVPSEDARADNPLQRL >LR134000|3999404:4020572|4017235_4017637_+|VDY71041.1|DBSCAN-SWA MYAKSFIALDGNGRLTGARTAQAAPYANYTCHLCGSALRCHPQYDTELPWFEHTDDRLTEHGQQCPYVRPERREIQLIKRLQQFVPDALPVVRKASWHCRQCHHDYYGEQYCTHCQTGGFSIPRTTQEEICEF >LR134000|3999404:4020572|4003291_4003780_-|VDY71007.1|DBSCAN-SWA MKLALTLEADSVNVQALNMGRIVVDIDGVNLAELINMVCDNGYSLRVVDESDRTSADCTPPFVALTGIRCSTAHITETDNAWLYSLSHQTSDFGESEWIHFTGTGYLLRTDTWSYPVLRLKRLGLSKTFRRLVVTLIRCYGVSLIHLDASAGYLPGLPTFDW >LR134000|3999404:4020572|3999404_4000151_-|VDY70995.1|transposase|DBSCAN-SWA MNHLYEQLTALKLTGFRDALKKQLAQPGTYQELGFEERLSLLTAEELTCRENRKAERLIKHARFRLNAELSKLDYRNNRGLDRALIRSLSQGNWLTLKQNILLTGATGSGKTFLACALGHNACRQGYKVYYYRLKALMEQCYQGHADGRYSKLLTRLNNSDLLLLDDWGLEPLSSEQRSDLLEIVDLMYQRGSIIVVSQLPVENWYKMIGDSTHADAILDRLVHGSIKIELKGESMRKIQSPLTEGDQ >LR134000|3999404:4020572|4000165_4001707_-|VDY70997.1|transposase|DBSCAN-SWA MPTVPISMRKLKEILRLKYGVGLSHRQIGRSLAISPSVVSRYANRAAQLGIKQWPLPTGWDDTKLKHAFLQTQVKMKKHSLPDWATVHRELRNKCVTLQLLWEEYCERNPGGFYSYNHYCRMYREWLKTTSPSMRQVHKAGEKLFVDYCGPTVGVTDPETGEIRTAQVIVAVLGASSYTWAEATWSQQLEDWVMSHVRCFQWLGGVPELVVPDNLKSATSRACKYDPDVNPTYQQMLEHYNVAVLPARPRKPKDKAKAEVGVQVVERWIMARIRHEIFYSLASLNQRIRELLERLNNKIMQKLGYSRAELFIQLDKPALKPLPEASYSYTLVKKVRVHADYHVEIDKHYYSVPCSLLGQQLEAWISGELVRLFNQGQEVAVHPRKRTYGYSTRNEHMPEAHRQHATWTPERLLEWAGHIGSETHSYVLHILNSRPHPEQSYRFCLGLLNLHKKYSKARLNAACARALKTKVWRLSGIKSILEKGLDKQPVQDPKPDLLSTMEHENVRGSEYYH >LR134000|3999404:4020572|4004773_4004995_-|VDY71015.1|DBSCAN-SWA MKIITRGEAMRIHQQHPTSRLFPFCTGKYRWHGSAEAYTGREVQDIPGVLAVFAERRKDSFGPYVRLMSVTLN >LR134000|3999404:4020572|4003776_4004154_-|VDY71009.1|DBSCAN-SWA MKTLPDTHVRKVSRCPSPVTIWQTLLTRLLDQHYGLTLNDTPFADERVIEQHIEAGISLCDAVNFLVEKYALVRTDLPGFSACTHSQLINSIDILRARRATGLMIRDNYRTVNNITLGKHPEAKR >LR134000|3999404:4020572|4018058_4018295_+|VDY71045.1|DBSCAN-SWA MVASGAVCGDAFPGNGLPVRGLWSLIPVLVAFPLILCQQPPVTDRLPRHSIRISSNRGLYCILLCRFRCAFPGASAPA >LR134000|3999404:4020572|4019153_4020572_+|VDY71051.1|transposase|DBSCAN-SWA MSQNKPGKIRAYERHLSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRMNFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRKPFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRTVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYCQSEIYGRQGVELSRSLLSGWVDACCRLLSPLEEVLHGYVMTDGKLHADDTPVQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTHLACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALTEEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKTLFFGSGHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVSELLPWRIALPAE >LR134000|3999404:4020572|4012490_4012649_-|VDY71031.1|DBSCAN-SWA MVQKILSDKVMNERTNAYYSYYLGERNISVLPLNVYDPPERFIAYIKKTEKI >LR134000|3999404:4020572|4018790_4019138_+|VDY71049.1|transposase|DBSCAN-SWA MIPLPSGTKIWLVAGITDMRNSFNGLAAKVQTALKDDPMSGHIFIFRGRSGSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGIDWRQPKRLLTSLTML >LR134000|3999404:4020572|4006355_4007174_-|VDY71021.1|DBSCAN-SWA MRLASRFGYTNQIRRDRPLTHEELMHYVPSIFGEDRHTSRSKRYAYIPTITVLESLQREGFQPFFACQTRVRDPGRQGYTKHMLRLRRAGEINGEHVPEIILLNSHDGTSSYQMLPGYFRFVCQNGCVCGQSPGEVRVPHRGNVVDRVIEGAYEVVGVFDRIEEKRDAMQSLVLPPPTRQALAQAALTYRYGDEHQPVTTADILTPRRREDYGKDLWSAYQTIQENMLKGGISGRSAKGKRIHTRAIHNIDTDIKLNRALWVMAETLLESLR >LR134000|3999404:4020572|4016406_4016976_+|VDY71039.1|DBSCAN-SWA MTYKYNPFWQQRIRETVRHALNVHPRLTALRVDLRLPDVPAATDAAVISRFINALKARIDAYQKRKHREGKRVHPTTLHYVWAREFGECKGKKHYHLMLLVNRDTWCRAGDYRAPESLAGMIKQAWCSALGVDVGCHATLVHFPAWPAVWLARNDDTGFQQVLERADYLAKEHTKAHCTGERNFGCSRS |
29 | Stx2-converting_phage(37.5%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_12 |
4027936 : 4041421
Sequences of DBSCAN-SWA_12
Nucleotide sequences of DBSCAN-SWA_12 >LR134000|4027936:4041421|DBSCAN-SWA GTCAACAAAAACTGGCCACCGAGTTAGAGTTTTTCCAGTATCGATTTTCCGATTCGTTTGGGGGTAACCCACCGTTATATTCGTGCGGTCTTAGTGCGCTGTAATATCCAACGATATAGTCCGTTATGGCGTGAGCTGCCTCGCTGAAGCTTACGTAACCCACCACCGGCATCCATTCGTTCTTCAGACTCCTGAAGAAGCGTTCCATTGGGCTGTTATCCCAGCAGTTTCCGCGCCGGCTCATACTCTGTCTGATCTGGTATCGCCACAATAACTGCCGGAACTGCCTGCTCGTATAATGACTGCCCTGATCGCTGTGGAACATCACCCCGCCGGGCTTACCACGGGTTTCCCACGCCATTTCCAGCGCTTTCATGGTGAGCCTGCTGTCCGGCGAGAACGACATGGCCCAGCCCACTGGTTTTCTTGCGAACAGGTCGAGAACAACGGCGAGGTACGCCCAGCGCTTACCCGTCCAGAGTAGAGTAAGAGGCAGGGCGTAGTCGGACTATTTCCCTGCCTCTCCTCCCCGAACCGGACGTGCACCTTTCAGCGCATCCGGCTCTCCATTTAAATGCTGGCGAACGCCATTGCCACTTCTGTAAAGCGCGATGTATACGTGTTTCTGGTCTCCGTCCTCAGATAGGGATTACCTTCGGGTAGCCGCCAGCGGAACAGCTTCTTGCCTTGCCCCACCAACCGGTACAGTATTTCGCCGCTGAGCTTGCCGTGATTGGTTTTACCAAATAAAACCCACGTTTTACTCTGACCCGGTTTCGGTGATTTACACCACCACCTCATCAGGGAAGCGATACCTGTACGGTATTTGCGGGCCAGCCAGTGAGCCAGCTTCCAGAACACGACACGGTCGATATAACTGAAGACTTTGGCCTTAAAATCAACGAACTGATAGAACATGGCCCAGCCTTTCAGTTTTCGGTTGAGTTGTTCAGCCATATCGACTTTGCTTTCACTGTAGTTGCCTGATAACAGTGCTGTCAGCGATGCGGCGAAGTTTCTGGCTTTCTCCTGCGGGATCGTTGAGACCACTCGCATCTCGCCATAACGACTGCGTTTGCGAATGATCCTGTGCCCCAGAAAGATAAAGCCGTCATTAACATGGGTGATTTTAGTCTTATCCATGTTCAGCCTGAGTTTCAGACTGCCTTCGAGCACACCCCGACACTCCTCCCTGATGGCTTCCGCCTGTGCTTTGGTGCCTTTGACGATGAGGACAAAATCATCGGCATAGCGGCAGTACGCCACCGCGGGTTTCCACTGCCAGTTTTCTCTGACCGCCGTACTTCGGCCCCGTTGGATACTGTTATTCCAGTACCACCGATCTTTTCTGGCTTTCCCGCTCAGGTAGCGCTCATGCAGGTATTGATCGAACTCATTCAGCATGATGTTCGATAATAGCGGCGATATAACACCGCCCTGTGGCACACCTTCACTGGCCGCCCGAAAGAGACCGACATCGATATGTCCCGCCTTGATGGTTTTCCACAGCAGAGTCATGAAACGTGCGTCACTGATCCTGCGGCGTACAGCCTTCATCAGCAGTCGATGATGTACGGTGTCGAAGTAACTGGACAGGTCGCCTTCAATCACCCAGCGTCCCCGGGTTTCACCGCAGTCTGTGAGCTGTAATTTCACCGTGCGGATCGCGTGGTGGACACTGCGCTCAGGCCGGAAGCCATATGAGAGCGTATGAAAATCACTCTCCCATATCGGCTCCATCGCCATCAGCATGGCCCGCTGAACAATACGATCCCGCAACGCGGGGATACCCAGTGGTCGCAGTTTGCCGTTGCTTTTAGGGATGTAAACCCGTCTGGCGGGCAAGGGCTGGTAGTGGCCTGAGAGTAATTCATCCCTGAGGATTTGCAGCTCAACAGCCAGTCTGGCCTGTAGCATTGTTTTGTTCACGCCATCAACGCCGGGGGTATGCGCCCCCTTTGATGAAAGCGTGATCCGCGCCGCTTCAGCCAGCCATTCTGGTTGTGTTATCAGACGCAGCAGCCGTTGAATCCGTAGGGACGGATCGGTGGCTGCCCATGTGGCAAGCTTGCGTTGCATTTCGCTGATTATCAAAGGTCTTCACCTCGTTAGGCCAGTTAATTCACGTCGCAAACACATTCAAACTGCTTCCCTTCGCCATGTAATGGGCTTTCCCCATCGCGGACTACTACGGAAGCTCCGCCAGCCAGCGCGTCATCGGAGCCATGCCCCCTTAACATCCGTCGCTGACCTTCCCCGGTTTACCTGCCTGGACTCAGGCATACTGAGGAGGCTGCCCGTCGCACTCTTTATCCTTGCTTGCCGCAAGTTGGCAGAAGTCAGCAACGCAAGCGTGATAGACGCTGCTGCCCCGGTGTTTCGCATACATGTCAAAACACCTTCGACCGGCAGTGCTTACGTATCACTGCCAGTTCCTCCTGCACGGCCTGTCAGATCACGTAGGCCGTGGTGACGTTTTCAACCCACAGAGGCGGATTAACGGGTTTATGTTCTTCAGCCTTTCAGTACTTAACCTTGAGGATCATCTCGGCTTAGTGATCTCGCCTCAATCCCCGTTGTCAGCGGGTTACATCACCCTGCGGGCATGCCGCAGGTCACTGCCGCTCAGGTTCTCCACCGTCACACCCGGTGGGATTGTTGGGTTTCTCATCGTGAGTTACCGGTTCAATATTCCAGACAGACTCGCGGTTCATTTAAGCATCCATGCCCGCCCTGAACTCCGGGCACACTATAGGTCACATCACCGCACCACACCTGATTTGGCTCGGTCACGGCGAACTGCCGTTCAAGATAGTTAGGGATAGCAACATGTTCATGACCACCACGTTTATACCGGTGAGTCGGCTGCTGACAGCTGACCAGCCCCAGCTCTTTCATGAGCCTGCCAGCAAGCCAGCGTCCCATCTGGTAGCCTCTCCGGGTTGCCATTGTGGCGATGCTTCTTGCTCCGGCCGAACCGTGGCTGATGCCATATAGCTCAAGTACCTGACTGCGTAATACAGTCCGTCTGCCGTCTGGTTTTTCAGGCCGGTTTTTCCAGTATCTGTAGCTGCTGCGATGAACCCCGAACACATGGCAGAGTGTGACCACAGGATAATGCGCTCTGAGTTTCCCGATTATCGAGAACTGTTCAGGGAGTCTGACATCAAGAGCGCGGTAGCCTTCCTGAGAAACTCCCGGTTCACACGCATAACTCGGGATCTTGCTGGAGAGAGTAACCCAGTGATTTTGCCCTTCGTTTAAGCCCGTCGAGTACGCGATTACGGTAGCGCTCTTCATAATAATCCGCGCCAGGATCGACATATTTCTGACCGTATTTAAGGGCATTGTAGAACAGTATTGCCAGTTTTCTTGCCGTTGCAGTGACTGCTTTTGCTTTCCCGATACGGGATGATAATCGACGATAAAAAGCACCCAGCGCTGTATCGCTCCGGCCAATTGTTGTGGCTGCCAGTCTTAGTGCAGCAGCAATTCGACTGGATGAACGCCGTGTTTTCGAGGACAGTACTTTTCCACCGGAGATCTTATTCCCCGGGGACAAACACAGCCACGAGGTAAAATGGCTCGCATCCGGCCAGCGATTCATGTCAGTACCGCATTCGGCAACGAATTTTAGTGCCAGATAAGGTCCGAACCCATGTATCTGTGTCAGGTCTGCTCCGGTAATTTTCCAGAGTAATGGCCTGACATCAAAGGAAAGCTGATTGGGTTGTTTTGTACGATGTCTGGCGGAGGGCAATACTTCTTCAGGCTCTTCAGTTCCGGTGCTGAGCTGCAATAATGAAGTTTCGATTTGATCATCACACTCACGGATTTTTTCCTGGTAAAAATCGAAAAATGCCACCGACTGTTCAAGTGCAAAAAGATGTTCAGGCTGCCAGTTCCCTGTCAGTGCCTGTTGAAGAACCTCTGGCGTCTTTTTACATCTGACATCACGATACTGGATCAATACTGAAGGATCGCGCTCACCACTGACAATAGCTCTGACGATTGAAAGACCGGTAACGCCAGTAATATCCGATACGGCATGATGCAATTGAACATTCATCTGCATCAACGCCTTCTGCATATGTTGGATATGAGCAGCCCGGTATTCAGTCAGGCGTTCCCGTTGTCTCAAATAAGACCTGAGTACGGAAATGTCGTGTTCAGGTCTGAAACTGGCCCGAAGAAGGCCAAATGAATGCAGCTTTTGCAGCCACTGTGCGTCATTGACATCAGTCTTGCGACCGGGAACATTTTTTGCGTCTCTGGCATTAACAAGGATCACATTGAAACCCGCTGCCTCAAGGATCTCAAAAGCAGGCAACCAGTAAACGCCAGTGGACTCCATGGCAATGGTCGTAATGTGACAGGATTTGAGCCAGTCGGACATCCGATGAAGGTCGCCAGTGAAACTTTGAAAAGTTCTGACCGGCTCGGCGTCAGCATCCGGTGGAACGGCGACTACGTGGAATTCAGCGCCAATATCAACGCCCGCAGCATGAGGATGGATAACAGACATTCGGGAGGATTTTTTCATGGAAATCTCCTTGCTCATTTTTCGATGAACGAAGGAGGGGACGACGGAGAAATCACCTTCCTGAACGGGATCAAAGTCACCATTGCCGGATCCGCCACAGTCCCTGAACCATGTTTTTTTACGGGGTCATACCACCAAAAGGCCGACGGCTGCTTCCTTCGCTAAATTAGCCTGCCAACTTTTCCGGGGAGTTTCTGAGACTCAGGACGGGGTTTCACCCATGGACAGTTTTTTAATATTTCATTCTCCATTTCAATGCGTTGTAGCTTTTTCCTCAGCTCACGTATTTCGATTTGTTCTGGTGTTATCGGAGAGGCTTTTGGTATTTTGCCCTGACGCTCATCACGCAGTTGTTTGACCCATCTTGTCATTGTGGAAAGGCCGACATCCATAGCTTTGGCGGCATCTGCCACCGTGTAGTTCTGGTCAACAACCAGTTGAGCGGATTCGCGTTTAAACTCTGCGCTGAAATTTCTTTTTTTCATTGGAGCACCTGTGTTGTTCTGAGGTGAGCATATCACCTCTGTTCAGGTGGCCAAATTCAGTGTGCCACTTCAAACATCAACCTACGCTCCCCAAGATGTTCCGCCTTATGGATGTAAGCGGCTCGCACAGAGCTACGCTCCTGGTGACTCATCTGCCGCTCTACCGCATCCCTCGACCATAATCCCGACTCAATCAACGAACTACAAGCCATCGTCCTGAAGCCATGCCCACAAACCTCGACCTTCGTGTCATACCCCATTACGCGCAAAGCCTTGTTCACCGTATTCTCACTCATCGGCTTACGTGGATCGTGATCGCCCACAAAAATCAGCTCTCAATTTCCGCTCATGCTTTTGATCTTTTCCAAAATACTTAGGGCCTGCCGCGACAAAGGTACAAGGTGTGGTGTCCGCATCTTCGAGCCACGCTGAGAATGCTTGACTCCTTCCAGTTGCTCACGCTCACCTGGGATTGTCCACATAGCCGTTTCAAAATCTACTTCTGACCAGCGGGCAAAATGCAGCTCGCTCGAACGGATGAAAACCAATAAAGTGAGTTCAACCGCCAGTCGAGTTAACGGTCTGCCAGAATAGTGATCTATGCGATGAAGTAATTCAGGTATACGGTTCAGTTCAAGAGCTGCACGGTGCTGTCTTTTCGCCGTAGCAACCGCACCTGCAATCTCTTGCGCGGGGTTGTAGTCGATTAAGCCGCTCTGAACGGCAAAGCGCATAATCGCGGTAGTACGCTGCTGTAAACGGGCGGCAACTTCGAGCCGCCCGGATGACTCGACTGCTTTAATGGGTACAAGAAGATCCCGCGTCTTCAGTTCCGCAATGTTCCGCTTACCGATAGCAGTAAAGAGATTATCCTCCAGACTTTTCAAATCACGAGCGCTATGCGATGCCGACCACTTCTGATTGCTGGCGTGCCAGTCTCTGGCGACTACTTCAAACGTTATCGCCTCTTGTTCCTGTTCTACCTTAACAGCTTTCTTGTTTTCACTGGGATCGACACCATTAGCTAAAAGCTTACGAGCCTCATCCCGGCGTGCCCTGGCATCCGCCAACGACACTTCGGGGTATTTCCCCAATGCCAGCATCTTCTCTTTACCGCCAAAGCGATAACGTAGCCGCCAGTATTTGGAGCCGTTAGGGTGAACCAGCAAAACCATGCCTTCACCGTCGGTCAGCTTATAGGCTTTTGCTTCAGGCTTAGCCTAACGAACCTTCACATCACTCAGAGCCATGATGAGTATCCTTTCAAGGGTTCTGTGTGGGTACAAACATTATCGAACCGGGAAATACCCGCAGTTGTACCCGCATCAGTAAGTTGATGTAGATTGAATCAGGTTGACTTAGGTTGAGTGAAAAAGCGAGGAAAGCCTTGCGGATACTGGATTTTAGGCACAAAAAAAGACGTCCGTTGACGTCTATTGATGTTCCGATGGTGCCGAAGGCCGGACTCGAACCGGCACGTATTTCTACGGTTGATTTTGAATCAACTGCGTCTACCGATTTCGCCACTTCGGCACTGAAGAGGTATGCGGAAAACGTTGTGGATTATACCTGTCGCACGCCACCATGCAAGCGCCAGCTTGCACCCACCTCGCTAAGTGCTGAAATTTTCAGCATTACACTCAGCGATATGTAACCTTTGTCACACTCCAGGCACCCCGCCCTGCCATGCTCTACACTTCCCAAACAACACCAGAGAAGGACCCAAAAATGTCGATGATAAAAAGCTATGCCGCAAAAGAAGCGGGCGGCGAACTGGAAGTTTATGAGTACGATCCCGGTGAGCTGAAGCCACAAGATGTTGAAGTGCAGGTGGATTACTGCGGGATTTGTCATTCCGATCTGTCGATGATCGATAACGAATGGGGATTTTCACAATATCCGCTGGTTGCCGGGCATGAGGTGATTGGGCGCGTGGTGGCACTCGGGAGCGCCGCGCAGGATAAAGGTTTGCAGGTCGGTCAGCGTGTCGGGATTGGCTGGACGGCGCGTAGCTGTGGTCACTGCGACGCCTGTATTAGCGGTAATCAGATCAACTGCGAGCAAGGTACGGTGCCGACGATTATGAATCGCGGTGGCTTTGCCGAGAAGTTGCGTGCGGACTGGCAATGGGTGATTCCACTGCCAGAAAATATTGATATTGAGTCCGCCGGGCCGCTGTTGTGCGGCGGTATCACGGTCTTTAAACCACTGTTGATGCACCATATCACTGCTACCAGCCGCGTTGGGGTAATTGGTATTGGCGGGCTGGGGCATATCGCTATAAAACTTCTGCACGCAATGGGATGCGAGGTGACGGCCTTTAGTTCTAATCCGGCGAAAGAGCAGGAAGTACTGGCGATGGGTACCGATAAAGTGGTGAATAGCCGCGATCCGCAGGCACTGAAAGCCCTGTCGGGGCAGTTTGATCTCATTATCAATACTGTGAACGTCAGCCTCGACTGGCAGCCTTATTTTGAGGCGCTGACCTATGGCGGTAATTTCCATACGGTCGGTGCGGTTCTCACGCCGCTGTCTGTTCCGGCCTTTACGTTAATTGCGGGCGACCGCAGCATCTCTGGTTCTGCTACCGGCACGCCTTATGAGCTGCGAAAGCTGATGCGCTTTGCCGCCCGCAGCAAGGTTGCGCCGACAACCGAACTGTTCCCGATGTCGAAAATTAACGACGCCATTCAGCATGTACGCGATGGTAAAGCTCGCTACCGAGTAGTCCTGAAAGCCGACTTCTGATCCCTTTGCCGGGTGCTGTCGCCCGGCACTTAAATCTCTTTGGAAGGCGTCTCGCAATCTGCTGAATTTAGCGGTGAAGAGTGCACAGCCGTTGCCTATACTCCAGGCAGGAAACTGGAGGAAATCTCATGAGCGAACCCCTGTTAATTGCCCGCACGCCGGACACTGAACTGTTTTTACTGCCGGGAATGGCTAACCGTCACGGGCTGATTACTGGCGCAACGGGGACGGGTAAAACCGTTACGCTGCAAAAACTGGCAGAGTCATTGTCGGAAATCGGCGTGCCGGTGTTTATGGCTGATGTGAAAGGCGATCTGACCGGTATCGCGCAGGCAGGAACGGCGTCGGAAAAACTGCTCACAAGGCTTAAAAATATCGGCGTCAATGACTGGCAACCGCATGCCAATCCGGTGGTGGTGTGGGATATCTTTGGCGAGAAAGGCCATCCGGTGCGGGCGACGGTTTCAGACCTGGGGCCGCTGTTGCTGGCGCGGCTGTTGAATCTCAACGATGTGCAGTCTGGCGTGCTGAATATCATCTTCCGCATTGCTGACGATCAGGGGCTGTTACTGCTCGACTTTAAAGATTTGCGGGCGATTACCCAGTACATCGGCGATAACGCCAAATCTTTCCAGAATCAGTACGGAAATATCAGTAGCGCATCGGTTGGTGCCATCCAGCGCGGATTACTGTCGCTGGAGCAACAAGGTGCGGAGCATTTCTTTGGCGAGCCGATGCTGGATATCAAAGACTGGATGCGCACCGATGCCAACGGTAAAGGCGTTATCAATATCCTCAGCGCCGAAAAGCTTTATCAGATGCCGAAACTATATGCCGCCAGCCTGTTGTGGATGCTCTCGGAGTTGTATGAACAATTGCCGGAAGCAGGCGATCTGGAGAAGCCGAAACTGGTGTTTTTCTTCGACGAAGCACATCTGCTGTTTAATGACGCACCGCAGGTACTGCTGGATAAGATTGAGCAGGTGATAAGGCTTATTCGCTCAAAAGGCGTGGGCGTCTGGTTCGTTTCGCAAAACCCGTCTGATATTCCGGATAACGTGCTCGGGCAGCTCGGTAATCGCGTTCAACACGCTTTGCGTGCTTTTACGCCCAAAGATCAGAAAGCGGTAAAAGCTGCGGCGCAAACCATGCGGGCCAATCCGGCGTTTGATACCGAAAAGGCGATTCAGGAACTGGGCACCGGCGAGGCGTTAATCTCGTTTCTTGATGTGAAAGGAAGTCCTTCAGTGGTGGAGCGGGCGATGGTGATCGCGCCTTGTTCGCGAATGGGGCCGGTGACGGAAGATGAGCGTAATGGCCTGATTAATCACTCCCCGGTGTATGGCAAATATGAGGATGAGGTGGACCGAGAATCCGCCTATGAGATGTTGCAAAAAGGCTTTCAGGCCAGTACCGAGCAGCAAAATAATCCTGCCGCGAAAGGGAAAGAGGTGGCGGTGGATGACGGTATTCTTGGTGGATTGAAGGATATTTTGTTTGGCACTACCGGACCACGCGGCGGGAAGAAAGATGGTGTGGTGCAAACAATGGCCAAAAGCGCCGCTCGCCAGGTGACGAATCAGATTGTACGCGGGATGTTGGGGAGTTTGCTGGGGGGGAGAAGAAGGTAAGTCAGGTGCGGTCTGTGCCCCTCACCCTAACCCTCTCCCCAAAGGGGCGAGGGGACCGATCGAACTCGTATTTGACTTTATCAACAATGTGAAAGGGAGCTTTCGCTCCCTTGCTTGGTTACGATTTTCTCATTAACAGCCACAGGCTGATTAAGAAGAAGCTGGCGCTTGGCAACAGTGCGCCGATGATCGGCGGGATGCCATAAACCAACGTCAGCGGGCCGAAGATCTGGTCCAGTACGTAGAAGACAAAACCGAAGCTGATACCGGTGACCACACGCACGCCCATCGGTACGCTACGCAGTGGGCCAAAGATGAACGACAGCGCCATCAGCATCATCACCGCCACGGATAGCGGCTGGAAGATTTTGCTCCACATGTTGAGCTGATAACGTCCGGCATCCTGACCGCTCGACTTCAGATACTTCACATAGTTGTGCAAACCGCTAATGGAGAGTGCATCCGGGTCCAGCGCCACCACGCCCAGTTTGTCTGGCGTGAGGTTGGTTTTCCAGGTGCCGCTCACCGTCTGCGAACCGGTAATCTGTTTCGGATTGGTCAGATCAGATTCATCAACCTGCGACAGACGCCAGACTTTATGTTCCGGGTCAAACTTCGCAGTAGCGGCATAGCGTACGGATTGCAGACGACGATTCTCGTTAAAGGCATAAATGCTGATGCCACCTAACTCTTCGTCACCTTTAACCCGCTCAATGTAGACGAAGTTGTTGCCATCTTTCGCCCATAAGCCTTGCTGGGTAGAGAGCAACGAGCCGCCGTACATCGCCTGCGCACGGTAGTTACGCGCCATCTGTTCGCCCTGCGGCGCAACCCATTCGCCAATCGCCATCGTCAGCAAGACCAGCGGAATGGCGGTTTTCATCACCGACAGCGCCACCTGCATACGGGTAAAACCAGAAGCCTGCATCACCACCAGTTCGCTGCGCTGCGCCAGCATCCCAAGACCAAGCAACGCCCCAAGCAGAGCCGCCATCGGGAAGAAGATCTGCACATCTTTCGGCACGCTCAGCAAGGTATACATTCCTGCGCCTAACGCGTCGTAACTCCCCTGCCCGGCTTTTTTCAACTGATCGACAAACTTGATAATGCCCGACAGCGACACCAGCATGAACAGCGTCATCATGATGGTGGTGAAAATAGTTTTACCGATATAGCGGTCAAGTACGCCAAAAGGTTGCATCACACCGCTCCTTTACGCGAAAAACTGGCGCGCAGGCGGCGGACCGGCACGGTGTCCCAAAGGTTGAGAACAATCGCCAAAGCCAGATAAATCAGGTTAACGGTCCACATCCACAGCGTCGGGTCCAGCTTACCTTTACCGCCGTTCGATTTCAGGGAGGTCTGGATCAGGAAGAAAAGTAGATACAGCAGCATGGCTGGCAGCATCGACAGTACGCGTCCCTGACGTGGGTTAACCACGCTCAGCGGTACGACCATAAGTGCCATCATAAACACGGTGAATACCAACGTGATACGCCAGTTCAGTTCTGCGCGAGCACGATCGGTGTCAGTGTTCCACAATGTGCGCATGTCCATCTGGTCGGTATCGTTCGGGTCGAGCGCCACCGCCTGGTGACCAATGATCGCCTGATAATCCTGGAAATCCGTAATGCGGAAATCACGTAACAGTGCAGTGCCTTCGAAGCGCGTTCCCTGGTTGAGAGTGACGACCTGGGAGCCGTCGCGCAGCTGGGTTAAATGTCCGGAATCGGCCACCACCACAGAAGGACGTGCATTACCTTTTGGTCGAATTTGTGCGAGGAACACATCTTTGAAATCGCTGCCGTCAACGCTTTCGATGAACAGCACCGAGCTGCCATTAGTCGCTTGCTGGAATTGCCCTTGCGCCAGCGCCGCCATGCCAGGGTTCGCTTTCGCTTCTGCTAACACTTCATCCTGATGACGCGATGACCACGGTCCCGCCCACATCACGTTAACCGCCGCGACGATTGCCGTGAATACCGCAAGGATCATTGCCGCTTTCACCAGAACCGCTTTGCTCAGGCCGCAGGCATGCATTACCGTAATTTCACTTTCGGTATACAGTTTGCCCAGCGTCATCAGCAGCCCGAGGAACAGGCTTAATGGCAGGATAAGCTGCGCCATTTCCGGCACGCCCAACCCGAGAAGGGAGAGCACCAGATTCGCCGGAATATCGCCGTCAACCGCTGCGCCGAGGATCCTCACTAACTTTTGACAGAAGAAGATCAAAAGCAAGATGAAGAGTATCGCCAGCTGGCTTTTGAGCGTCTCCCGCACCAGATATCTTATGATTATCACTTTAAATACGCCCGTAAAAACTCGTCTTTTGCAGGATTTTAGCTTGTTTCATGGCTTAAACGTCATTTATTCTCTTGAGTCGTCGAAATCGTCGCTAAGATAATTATACTCAACGGATTCACCTCTCAGATTTTGTTCTGACGTGCCAATGCCGTAATAACGTTAAGATTAACACGAAGTCATCGCAACAGCGGACATGAGTTACGAAAGCTTGCAATTCTATCTGTAGCCACCGCCGTTGTCTTTAAGATTCAGGAGCGTAGTGCATGGAGTTTAGTGTAAAAAGCGGTAGCCCGGAGAAACAGCGGAGTGCCTGCATCGTCGTGGGCGTCTTCGAACCACGTCGCCTTTCTCCGATTGCAGAACAGCTCGATAAAATCAGCGATGGGTACATCAGCGCCCTGCTACGTCGGGGCGAACTGGAAGGAAAACCGGGGCAGACATTGTTGCTGCACCATGTTCCGAATGTACTTTCCGAGCGAATTCTCCTTATTGGTTGCGGCAAAGAACGTGAGCTGGATGAGCGTCAGTACAAGCAGGTTATTCAGAAAACCATTAATACGCTGAATGATACTGGCTCAATGGAAGCGGTCTGCTTTCTGACTGAACTGCACGTTAAAGGCCGTAACAACTACTGGAAAGTGCGTCAGGCTGTCGAGACGGCAAAAGAGACGCTCTACAGTTTCGATCAGCTGAAAACGAACAAGAGCGAACCGCGTCGTCCGCTGCGTAAAATGGTGTTCAACGTGCCGACCCGCCGTGAACTGACCAGCGGTGAGCGCGCGATCCAGCACGGTCTGGCGATTGCCGCCGGGATTAAAGCAGCAAAGGATCTTGGCAATATGCCGCCGAATATCTGTAACGCCGCTTACCTCGCTTCACAAGCGCGCCAGCTGGCTGACAGCTACAGCAAGAATGTCATCACCCGCGTTATCGGCGAACAGCAGATGAAAGAGCTGGGGATGCATTCCTATCTGGCGGTCGGTCAGGGTTCGCAAAACGAATCGCTGATGTCGGTGATTGAGTACAAAGGCAACGCGTCGGAAGATGCACGCCCAATCGTGCTGGTGGGTAAAGGTTTAACCTTCGACTCCGGCGGTATCTCGATCAAGCCTTCAGAAGGCATGGATGAGATGAAGTACGATATGTGCGGTGCGGCAGCGGTTTACGGCGTGATGCGGATGGTCGCGGAGCTACAACTGCCGATTAACGTTATCGGCGTGTTGGCAGGCTGCGAAAACATGCCTGGCGGACGAGCCTATCGTCCGGGCGATGTGTTAACCACCATGTCCGGTCAAACCGTTGAAGTGCTGAACACCGACGCTGAAGGCCGCCTGGTACTGTGCGACGTGTTAACTTACGTTGAGCGTTTTGAGCCGGAAGCGGTGATTGACGTGGCGACGCTGACCGGTGCCTGCGTGATCGCGCTGGGTCATCACATTACCGGTCTGATGGCGAACCATAATCCGCTGGCCCATGAACTGATTGCCGCGTCTGAACAATCCGGTGACCGCGCATGGCGCTTACCGCTGGGTGACGAGTATCAGGAACAGCTGGAGTCCAATTTTGCCGATATGGCGAACATTGGCGGTCGTCCTGGTGGGGCGATTACCGCAGGTTGCTTCCTGTCGCGCTTTACCCGTAAGTACAACTGGGCGCACCTGGATATCGCCGGAACCGCCTGGCGTTCTGGTAAAGCAAAAGGCGCAACCGGCCGTCCGGTAGCGTTGCTGGCACAGTTCCTGTTAAACCGCGCTGGGTTTAACGGCGAAGAGTAA
Protein sequences of DBSCAN-SWA_12 >LR134000|4027936:4041421|4035839_4037342_+|VDY71079.1|DBSCAN-SWA MSEPLLIARTPDTELFLLPGMANRHGLITGATGTGKTVTLQKLAESLSEIGVPVFMADVKGDLTGIAQAGTASEKLLTRLKNIGVNDWQPHANPVVVWDIFGEKGHPVRATVSDLGPLLLARLLNLNDVQSGVLNIIFRIADDQGLLLLDFKDLRAITQYIGDNAKSFQNQYGNISSASVGAIQRGLLSLEQQGAEHFFGEPMLDIKDWMRTDANGKGVINILSAEKLYQMPKLYAASLLWMLSELYEQLPEAGDLEKPKLVFFFDEAHLLFNDAPQVLLDKIEQVIRLIRSKGVGVWFVSQNPSDIPDNVLGQLGNRVQHALRAFTPKDQKAVKAAAQTMRANPAFDTEKAIQELGTGEALISFLDVKGSPSVVERAMVIAPCSRMGPVTEDERNGLINHSPVYGKYEDEVDRESAYEMLQKGFQASTEQQNNPAAKGKEVAVDDGILGGLKDILFGTTGPRGGKKDGVVQTMAKSAARQVTNQIVRGMLGSLLGGRRR >LR134000|4027936:4041421|4028505_4030029_-|VDY71065.1|DBSCAN-SWA MIISEMQRKLATWAATDPSLRIQRLLRLITQPEWLAEAARITLSSKGAHTPGVDGVNKTMLQARLAVELQILRDELLSGHYQPLPARRVYIPKSNGKLRPLGIPALRDRIVQRAMLMAMEPIWESDFHTLSYGFRPERSVHHAIRTVKLQLTDCGETRGRWVIEGDLSSYFDTVHHRLLMKAVRRRISDARFMTLLWKTIKAGHIDVGLFRAASEGVPQGGVISPLLSNIMLNEFDQYLHERYLSGKARKDRWYWNNSIQRGRSTAVRENWQWKPAVAYCRYADDFVLIVKGTKAQAEAIREECRGVLEGSLKLRLNMDKTKITHVNDGFIFLGHRIIRKRSRYGEMRVVSTIPQEKARNFAASLTALLSGNYSESKVDMAEQLNRKLKGWAMFYQFVDFKAKVFSYIDRVVFWKLAHWLARKYRTGIASLMRWWCKSPKPGQSKTWVLFGKTNHGKLSGEILYRLVGQGKKLFRWRLPEGNPYLRTETRNTYTSRFTEVAMAFASI >LR134000|4027936:4041421|4033019_4033286_-|VDY71073.1|integrase|DBSCAN-SWA MGDHDPRKPMSENTVNKALRVMGYDTKVEVCGHGFRTMACSSLIESGLWSRDAVERQMSHQERSSVRAAYIHKAEHLGERRLMFEVAH >LR134000|4027936:4041421|4031137_4032478_-|VDY71069.1|transposase|DBSCAN-SWA MKKSSRMSVIHPHAAGVDIGAEFHVVAVPPDADAEPVRTFQSFTGDLHRMSDWLKSCHITTIAMESTGVYWLPAFEILEAAGFNVILVNARDAKNVPGRKTDVNDAQWLQKLHSFGLLRASFRPEHDISVLRSYLRQRERLTEYRAAHIQHMQKALMQMNVQLHHAVSDITGVTGLSIVRAIVSGERDPSVLIQYRDVRCKKTPEVLQQALTGNWQPEHLFALEQSVAFFDFYQEKIRECDDQIETSLLQLSTGTEEPEEVLPSARHRTKQPNQLSFDVRPLLWKITGADLTQIHGFGPYLALKFVAECGTDMNRWPDASHFTSWLCLSPGNKISGGKVLSSKTRRSSSRIAAALRLAATTIGRSDTALGAFYRRLSSRIGKAKAVTATARKLAILFYNALKYGQKYVDPGADYYEERYRNRVLDGLKRRAKSLGYSLQQDPELCV >LR134000|4027936:4041421|4032639_4032963_-|VDY71071.1|DBSCAN-SWA MKKRNFSAEFKRESAQLVVDQNYTVADAAKAMDVGLSTMTRWVKQLRDERQGKIPKASPITPEQIEIRELRKKLQRIEMENEILKNCPWVKPRPESQKLPGKVGRLI >LR134000|4027936:4041421|4037460_4038543_-|VDY71081.1|DBSCAN-SWA MQPFGVLDRYIGKTIFTTIMMTLFMLVSLSGIIKFVDQLKKAGQGSYDALGAGMYTLLSVPKDVQIFFPMAALLGALLGLGMLAQRSELVVMQASGFTRMQVALSVMKTAIPLVLLTMAIGEWVAPQGEQMARNYRAQAMYGGSLLSTQQGLWAKDGNNFVYIERVKGDEELGGISIYAFNENRRLQSVRYAATAKFDPEHKVWRLSQVDESDLTNPKQITGSQTVSGTWKTNLTPDKLGVVALDPDALSISGLHNYVKYLKSSGQDAGRYQLNMWSKIFQPLSVAVMMLMALSFIFGPLRSVPMGVRVVTGISFGFVFYVLDQIFGPLTLVYGIPPIIGALLPSASFFLISLWLLMRKS >LR134000|4027936:4041421|4038542_4039643_-|VDY71083.1|DBSCAN-SWA MIIIRYLVRETLKSQLAILFILLLIFFCQKLVRILGAAVDGDIPANLVLSLLGLGVPEMAQLILPLSLFLGLLMTLGKLYTESEITVMHACGLSKAVLVKAAMILAVFTAIVAAVNVMWAGPWSSRHQDEVLAEAKANPGMAALAQGQFQQATNGSSVLFIESVDGSDFKDVFLAQIRPKGNARPSVVVADSGHLTQLRDGSQVVTLNQGTRFEGTALLRDFRITDFQDYQAIIGHQAVALDPNDTDQMDMRTLWNTDTDRARAELNWRITLVFTVFMMALMVVPLSVVNPRQGRVLSMLPAMLLYLLFFLIQTSLKSNGGKGKLDPTLWMWTVNLIYLALAIVLNLWDTVPVRRLRASFSRKGAV >LR134000|4027936:4041421|4039909_4041421_+|VDY71085.1|DBSCAN-SWA MEFSVKSGSPEKQRSACIVVGVFEPRRLSPIAEQLDKISDGYISALLRRGELEGKPGQTLLLHHVPNVLSERILLIGCGKERELDERQYKQVIQKTINTLNDTGSMEAVCFLTELHVKGRNNYWKVRQAVETAKETLYSFDQLKTNKSEPRRPLRKMVFNVPTRRELTSGERAIQHGLAIAAGIKAAKDLGNMPPNICNAAYLASQARQLADSYSKNVITRVIGEQQMKELGMHSYLAVGQGSQNESLMSVIEYKGNASEDARPIVLVGKGLTFDSGGISIKPSEGMDEMKYDMCGAAAVYGVMRMVAELQLPINVIGVLAGCENMPGGRAYRPGDVLTTMSGQTVEVLNTDAEGRLVLCDVLTYVERFEPEAVIDVATLTGACVIALGHHITGLMANHNPLAHELIAASEQSGDRAWRLPLGDEYQEQLESNFADMANIGGRPGGAITAGCFLSRFTRKYNWAHLDIAGTAWRSGKAKGATGRPVALLAQFLLNRAGFNGEE >LR134000|4027936:4041421|4033298_4034138_-|VDY71075.1|integrase|DBSCAN-SWA MVLLVHPNGSKYWRLRYRFGGKEKMLALGKYPEVSLADARARRDEARKLLANGVDPSENKKAVKVEQEQEAITFEVVARDWHASNQKWSASHSARDLKSLEDNLFTAIGKRNIAELKTRDLLVPIKAVESSGRLEVAARLQQRTTAIMRFAVQSGLIDYNPAQEIAGAVATAKRQHRAALELNRIPELLHRIDHYSGRPLTRLAVELTLLVFIRSSELHFARWSEVDFETAMWTIPGEREQLEGVKHSQRGSKMRTPHLVPLSRQALSILEKIKSMSGN >LR134000|4027936:4041421|4034691_4035711_+|VDY71077.1|DBSCAN-SWA MSMIKSYAAKEAGGELEVYEYDPGELKPQDVEVQVDYCGICHSDLSMIDNEWGFSQYPLVAGHEVIGRVVALGSAAQDKGLQVGQRVGIGWTARSCGHCDACISGNQINCEQGTVPTIMNRGGFAEKLRADWQWVIPLPENIDIESAGPLLCGGITVFKPLLMHHITATSRVGVIGIGGLGHIAIKLLHAMGCEVTAFSSNPAKEQEVLAMGTDKVVNSRDPQALKALSGQFDLIINTVNVSLDWQPYFEALTYGGNFHTVGAVLTPLSVPAFTLIAGDRSISGSATGTPYELRKLMRFAARSKVAPTTELFPMSKINDAIQHVRDGKARYRVVLKADF >LR134000|4027936:4041421|4027936_4028311_-|VDY71063.1|transposase|DBSCAN-SWA MKALEMAWETRGKPGGVMFHSDQGSHYTSRQFRQLLWRYQIRQSMSRRGNCWDNSPMERFFRSLKNEWMPVVGYVSFSEAAHAITDYIVGYYSALRPHEYNGGLPPNESENRYWKNSNSVASFC >LR134000|4027936:4041421|4030620_4030860_-|VDY71067.1|DBSCAN-SWA MGRWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNYLERQFAVTEPNQVWCGDVTYSVPGVQGGHGCLNEPRVCLEY |
12 | Shigella_phage(30.0%) | transposase,integrase | attL 4030819:4030833|attR 4040591:4040605 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|