Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
LR134204 | Citrobacter koseri strain NCTC11075 genome assembly, chromosome: 1 | 15 crisprs | DinG,DEDDh,cas3,PD-DExK,RT,csa3 | 2 | 9 | 12 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134204_1 | 263898-263981 | Orphan |
NA
Consensus repeat of LR134204_1
|
1 spacers
spacers of LR134204_1
>1.1|263921|38|LR134204|CRISPRCasFinder AGGCCTACAGGTTCTGCGTAATGTTGTAGGGGTCGCCG |
CRISPR arrays and Neighbor proteins around LR134204_1
The CRISPR arrays of LR134204_1 >merge|LR134204|1|263898-263981|CRISPRCasFinder GATGGCGGCACAAGCGCCTTATCAGGCCTACAGGTTCTGCGTAATGTTGTAGGGGTCGCCGGATGGCGGCACAAGTGCCTTATC >LR134204|1|1|263898-263981|CRISPRCasFinder GATGGCGGCACAAGCGCCTTATC AGGCCTACAGGTTCTGCGTAATGTTGTAGGGGTCGCCG GATGGCGGCACAAGTGCCTTATC
>LR134204.1|VEB84151.1|263574_263814_+|bifunctional-acetaldehyde-CoA/alcohol-dehydrogenase MKAELGIPKSIREAGVQEADFLAHVDKLSEDAFDDQCTGANPRYPLISELKQILLDTYYGREFTEGEVAAKKEAARTES >LR134204.1|VEB84148.1|262775_263462_+|bifunctional-acetaldehyde-CoA/alcohol-dehydrogenase MIIALGGGSPMDAAKIMWVMYEHPETHFEELALRFMDIRKRIYKFPKMGVKAKMVAITTTSGTGSEVTPFAVVTDDATGQKYPLADYALTPDMAIVDANLVMDMPKSLCAFGGLDAVTHALEAYVSVLASEFSDGQALQALKLLKENLPASYHEGSKNPVARERVHSAATIAGIAFANAFLGVCHSMAHKLGSQFHIPHGLANALLICNVYPLQRERQPDQADCVQPV >LR134204.1|VEB84145.1|262510_262741_+|bifunctional-acetaldehyde-CoA/alcohol-dehydrogenase MLWHKLPKSIYFRRGSLPIALDEVITDGHKRALIVTDRFLFNNGYADQITSVLKAAGVETEVFFEVEADPTLNYRS >LR134204.1|VEB84142.1|261855_262191_+|bifunctional-acetaldehyde-CoA/alcohol-dehydrogenase MSKTFDNGVICASEQSVVVVDSVYDAVRERFASHGGYLLQGKELKAVQDVILKNGALNAAIVGQPAYKIAELAGFTVPETTKILIGEVTAVDDSEPFAHEKLSPTLAMLSR >LR134204.1|VEB84138.1|261302_261875_+|bifunctional-acetaldehyde-CoA/alcohol-dehydrogenase MAVAESGMGIIEDKVIKNHFASEYIYNAYKDEKTCGVLSEDDTFGTITIAEPIGIICGIVPTTNPTSTAIFKSLISLKTRNAIIFSPHPRAKDATNKAADIVLQAAIAAGAPKDLIGWIDQPSVELSNALMHHPDINLILATGGPGMVKAAYSSGKPAIGVGAGNTPVVIDENSRYQTCGGFRTDVQNFR >LR134204.1|VEB84135.1|261150_261348_+|bifunctional-acetaldehyde-CoA/alcohol-dehydrogenase MAVTNVAELNALVERVKKAQREYASFTQEQVTKSSAPPLWLPQMLEFPSLRWPLPNQAWVSSKIK >LR134204.1|VEB84132.1|260025_260763_-|multiple-drug-resistance-protein-MarC MKYADKRRIFSAFFTIFIDIIYPANVSELTVTQTLFDFPVYSKFFIGLFALVNPVGIIPVFISMTSYQTAAARNKTNLTANLSVAIILWISLFLGDGILQLFGISIDSFRIAGGILVVTIAMSMISGKLGEDKQNKQEKSETAIRESIGVVPLALPLMAGPGAISSTIVWGTRYHTIMHLIGFSVAIALFAFCCWGVFRMAPWLVRLLGQTGINVITRIMGLLLMALGIEFIVTGIKAIFPGLLN >LR134204.1|VEB84129.1|258152_259253_-|oligopeptide-ABC-transporter-substrate-binding-protein MSNITKKSLLAAGILTALIGGNVAMAADVPAGVQLSDKQTLVRNNGSEVQSLDPHKIEGVPESNVSRDLFEGLLISDVEGHPSPGVAEKWENKDFKVWTFHLRKNAKWSDGTPVTAHDFVYSWQRLANPNTASPYASYLQYGHIANIDDIIAGKKPATDLGVKAIDDNTFEVTLSEPVPYFYKLLVHPSVSPVPKAAVEKFGEKWTQPANIVTNGAYKLKNWVVNERIVLERNTQYWDNDKTVINQVTYLPISSEVTDVNRYRSGEIDMTYNNMPIELFQKLKKEIPNEVRVDPYLCTYYYEINNQKAPFNDVRVRTALKLAMDRDIIVNKVKNQGDLPAYSYTPPYTDGAKLVEPEWFTWSQEKT >LR134204.1|VEB84126.1|257620_258208_-|oligopeptide-ABC-transporter-substrate-binding-protein MARNWSSLSGSHGHRKKRNEEAKKLLAEAGYTADKPLTFDLLYNTSDLHKKLAIAVASIWKKNLGANVKLENQEWKTFLDTRHQGTFDVARAGWCADYNEPTSFLNTMLSDSSNNTAHYKSPEFDKLIADTLKVTDEAQRTELYAKAEQQLDKDSAIVPLYYYVNARLVKPWVGGYTGKDPMDNIYVKNLYIIKH >LR134204.1|VEB84123.1|256577_257498_-|oligopeptide-transporter-permease MLKFILRRCLEAIPTLFILITISFFMMRLAPGSPFTGERALPPEVLANIEAKYHLNDPIMTQYFSYLKQLAHGDFGPSFKYKDYTVNDLVASSFPVSAKLGAAAFFLAVIIGVSAGVIAALKQNTRWDYTVMGVAMTGVVIPSFVVAPLLVMIFAITLKWLPGGGWNGGALKFMILPMVALSLAYIASIARITRGSMIEVLHSNFIRTARAKGLPMRRIIFRHALKPALLPVLSYMGPAFVGIITGSMVIETIYGLPGIGQLFVNGALNRDYSLVLSLTILVGTLTILFNAIVDVLYAVIDPKIRY >LR134204.1|VEB84154.1|264002_264164_-|thymidine-kinase MVLRLDQSGRPYNEGEQVVIGGNERYVSVCRKHYKEALAEGSLTSIQEKHRHA >LR134204.1|VEB84157.1|264196_264685_-|thymidine-kinase MPVNYDTIPPNLLFQPGEGLLPMAQLYFYYSAMNAGKSTALLQSSYNYQERGMRTVVYTAEIDDRFGAGKVSSRIGLSSPAKLFNQNSSLFEEIRTENAQQTIHCVLVDECQFLTRQQVYDYLRLLTNWIFRCCVTDCVLIFVASYLAAANTYSRGQINSSN >LR134204.1|VEB84160.1|265163_265577_+|global-DNA-binding-transcriptional-dual-regulator-H-NS MSEALKILNNIRTLRAQARECTLETLEEMLEKLEVVVNERREEESAAAAEVEERTRKLQQYREMLIADGIDPNELLNSMAAVKSGTKAKRAARPAKYSYVDENGETKTWTGQGRTPAVIKKAMEEQGKQLDDFLIKG >LR134204.1|VEB84163.1|265702_266611_-|UTP--glucose-1-phosphate-uridylyltransferase-subunit-GalU MAALNSKVKKAVIPVAGLGTRMLPATKAIPKEMLPLVDKPLIQYVVNECIAAGITEIVLVTHSSKNSIENHFDTSFELEAMLEKRVKRQLLEEVQSICPPHVTIMQVRQGLAKGLGHAVLCAHPVVGNEPVAVILPDVILDEYESDLSQDNLAEMIRRFDETGCSQIMVEPVEDVTAYGVVDCKGAALAPGESVPMVGVVEKPKADVAPSNLAVVGRYVLSADIWPLLAKTPPGAGDEIQLTDAIDMLIEKETVEAYHMKGKSHDCGNKLGYMQAFVEYGIRHNSLGAEFKAWLEDELGIKK >LR134204.1|VEB84166.1|266813_267827_-|response-regulator-of-RpoS MTQPLVGKQILIVEDEPVFRSLLDSWFSSLGATTALAGDGLDALELLGSFTPDLMICDLAMPRMNGLKLVEHLRNRGDQTPILVISATENMADIAKALRLGVEDVLLKPVKDLNRLRETVFACLYPNMFNSRVEEEERLFRDWDAMVDNPVAAAKLLQELQPPVQQVISHCRINYRQLVAADKPGLVLDIAPLSDNDLAFYCLDVTRAGDNGVLAALLLRALFNGLLQEQLAHQNQRLPELGALLKQVNHLLRQANLPGQFPLLVGYYHSGLKNLILVSAGLNATLNTGSHQVQISNGVPLGTLGNTYLNQLSQRCESWQCQIWGAGGRLRLMLSAE >LR134204.1|VEB84169.1|267917_268361_-|patatin-like-phospholipase MAPSLTLFPVSLTRAMGADIVIAVDLQHDAHLMQQDLLSLNVSNDNDESDDSLSWHARLKERLSSMTSRRAVTAPTAMEIMTTSIQVLENRLKRNRMAGDPPDILIQPFCPQISTLDFHRASAAIAAGQLAVEKKMDELLPLVRTDV >LR134204.1|VEB84172.1|268936_269284_+|SEC-C-motif MSQLCPCGSAVEYSLCCGPIVSGERVAPDPSHLMRSRYCAFVMKDANYLIRTWHPACGATAFRDDIVAGFAHTEWLGLTIFEESGSDAENTGYVSFVARFIEQGIPARLLNVRVS >LR134204.1|VEB84175.1|269443_270286_+|formyltetrahydrofolate-deformylase MHSLQRKVLRTICPDQKGLIARITNICYKHELNIVQNNEFVDHRTGRFFMRTELEGIFNDVTLLADLDSALPEGSVRELNPAGRRRVVILVTKEAHCLGDLLMKANYGGLDVEISAVIGNHETLRPLVERFDIPFELVSHEGLTRDEHDKQMADAIDAHQPDYVVLAKYMRVLTPEFVSRFPNKIINIHHSFLPAFIGARPYHQAYERGVKIIGATAHYVNDNLDEGPIIMQDVIHVDHTYTAEDMMRAGRDVEKNVLSRALYKVLAQRVFVYGNRTIIL >LR134204.1|VEB84178.1|271303_272665_-|respiratory-nitrate-reductase-1-subunit-gamma MIELVIVSRLLEYPDAALWQHQQELFDALASSENLDKEDAQSLAVFLRDLTAQEMLDVQASYSELFDRGRATSLLLFEHVHGESRDRGQAMVDLMAQYEQHGLQLDSRELPDHLPLYLEYLAQLPKNDALGGLQDIAPILALLGARLQQRESRYAVLFDLLLKLANTVIDSDKVAEKIADEVRDDTPQALDAVWEEEQVKFFAEQGLWRVGDFRSPASFCRGCRSAIFEYHHRRTAIMHFLNMFFFDIYPYIAGSVFLIGSWLRYDYGQYTWRAASSQMLDRKGMNLASNLFHFGILGIFAGHFLGMLTPHWMYEAFLPIEVKQKMAMIAGGACGVMTLVGGVLLLKRRLFSPRVRATTTGADILILSLLVIQCALGLLTIPFSAQHMDGSEMMKLVGWAQSVVTFHGGASEHLEGVAFIFRLHLVLGMTLFLLFPFLAPGSHLERAGRVPDA >LR134204.1|VEB84181.1|272661_274197_-|respiratory-nitrate-reductase-1-subunit-beta MKIRSQVGMVLNLDKCIGCHTCSVTCKNVWTSREGMEYAWFNNVESKPGVGFPNDWENQEKWKGGWVRKINGKLQPRMGNRAMLLGKIFANPHLPGIDDYYEPFDYDYQNLHTAPESKHQPIARPRSLITGQRMNKITSGPNWEEILGGEFEKRAKDQNFENMQKAMYGQFENTFMMYLPRLCEHCLNPACVATCPSGAIYKREEDGIVLIDQDKCRGWRMCITGCPYKKIYFNWKSGKSEKCIFCYPRIEAGQPTVCSETCVGRIRYLGVLLYDADAIESAASTENEKDLYQRQLDVFLDPNDPAVIEQALKDGVPQSVIDAAQQSPVYKMAMDWKLALPLHPEYRTLPMVWYVPPLSPIQSAADAGELGSNGILPDVESLRIPVQYLANLLTAGDTQPVLLALKRMLAMRHYKRAETVDGKLDTRALEEVGLSEAQAQEMYRYLAIANYEDRFVIPSSHRELAREAFPEKSGCGFTFGDGCHGSDTKFNLFNSRRIDAIDVSSKTEPHQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134204_2 | 974610-974707 | Orphan |
NA
Consensus repeat of LR134204_2
|
1 spacers
spacers of LR134204_2
>2.1|974641|36|LR134204|CRISPRCasFinder ACAAAACAGTACACCGTAGGCCTGGCTGAACTCCGC |
CRISPR arrays and Neighbor proteins around LR134204_2
The CRISPR arrays of LR134204_2 >merge|LR134204|2|974610-974707|CRISPRCasFinder CGGATGGCGGTGCAAGCACCTTATCCGGCCTACAAAACAGTACACCGTAGGCCTGGCTGAACTCCGCGCGGATGGCGGGCAAGCACCTTATCCGGCCT >LR134204|2|2|974610-974707|CRISPRCasFinder CGGATGGCGGTGCAAGCACCTTATCCGGCCT ACAAAACAGTACACCGTAGGCCTGGCTGAACTCCGC GCGGATGGCGGGCAAGCACCTTATCCGGCCT
>LR134204.1|VEB86766.1|973964_974453_-|colicin-V-production-protein MVWIDYAIIAVIGFSCLVSLIRGFVREALSLVTWGCAFFVASHYYTYLSVWFTGFEDELVRNGIAIAILFIATLIVGAIVNFVIGQLVEKTGLSGTDRVLGICFGALRGALIVAAILFFLDTFTGLSKSEDWSKSQLIPQFSFIIRWFFDYLQSSSSFLPRT >LR134204.1|VEB86762.1|972411_973929_-|amidophosphoribosyltransferase MCGIVGIAGVMPVNQSIYDALTVLQHRGQDAAGIITIDANNCFRLRKANGLVSDVFEARHMQRMQGNMGIGHVRYPTAGSSSASEAQPFYVNSPYGITLAHNGNLTNAHELRKKLFEEKRRHINTTSDSEILLNIFASELDNFRHYPLEADNIFAAIAATNRQIRGAYACVAMIIGHGMVAFRDPNGIRPLVLGKRDIGDGRTEYMVASESVALDTLGFEFLRDVAPGEAVYITEKGQLFTRQCADNPVSNPCLFEYVYFARPDSFIDKISVYSARVNMGTKLGEKIAREWEDLDIDVVIPIPETSCDIALEIARILGKPYRQGFVKNRYVGRTFIMPGQQLRRKSVRRKLNANRAEFRDKNVLLVDDSIVRGTTSEQIIEMAREAGAKKVYLASAAPEIRFPNVYGIDMPTANELIAHGREVDEIRQIIGADGLIFQDLDDLIEAVRAENPDIQQFECSVFNGVYVTKDVDQQYLDYLDSLRNDDVKAMQRQNEVENLEMHNEG >LR134204.1|VEB86759.1|971744_972314_-|3-octaprenyl-4-hydroxybenzoate-carboxy-lyase MKRLIVGISGASGAIYGVRLLQVLRDVADVETHLVMSPAARQTLALETEFSLREVQALADVTHDARDIAASISSGSFQTAGMVILPCSIKTLSGIVHSYTDGLLTRAADVVLKERRPLVLCVRETPLHLGHLRLMTQAAEIGAVIMPPVPAFYHCPQTLDDVINQTVNRVLDQFDITLPHDLFTRWQGA >LR134204.1|VEB86756.1|971300_971456_-|lysine-arginine-ornithine-binding-periplasmic-protein MKKTVLALSLLVGLSATAASYAALPQTVRIGTDATYAPFSSKDAKGDFVRF >LR134204.1|VEB86753.1|970672_971329_-|lysine-arginine-ornithine-binding-periplasmic-protein MPKGILSGFDIDLGNEMCKRMQIKCTWVGSDFDALIPSLKAKKIDAIISSLSITEKRQQEIAFSDKLYAADSRLIAAKGSSVQPTIDSLKGKHVGVLQGSTQEAYANDNWRSKGVDVVAYANQDLIYSDLTAGRLDAALQDEVAASEGFLKQPAGKDYAFAGPSVKDKKYFGDGTGVGLRKDDTELKAAFDKALSDLRADGTYDKMAKKYFDFNVYGD >LR134204.1|VEB86750.1|969642_970425_-|histidine-ABC-transporter-substrate-binding-periplasmic-protein MKKLVLSLSLVLAFSSATAAFAAIPQKVRIGTDPTYAPFESKNAQGELVGFDIDLAKELCKRINTQCTFVENPLDALIPSLKAKKIDAIMSSLSITEKRQQEIAFTDKFYAADSRLVVAKDSDIQPTVESLKGKRVGVLQGTTQETFGNEHWAPKGIEIVSYQGQDNIYSDLTAGRIDAAFQDEVAASEGFLKQPVGKDYKFGGPSVKDEKLFGVGTGMGLRKEDNELREALNKAFAEMRADGTYEKLAKKYFDFDVYGG >LR134204.1|VEB86747.1|968770_969457_-|histidine-ABC-transporter-permease MLYGFSGVIFQGALVTLELALSSVVLAVLIGLVGAGAKLSQNRVTGLIFEGYTTLIRGVPDLVLMLLIFYGLQIALNAVTDAMGIGQIDIDPMVAGIITLGFIYGAYFTETFRGAFMAVPKGHIEAATAFGFTGSQIFRRIMFPAMMRYALPGIGNNWQVILKATALVSLLGLEDVVKATQLAGKSTWEPFYFAVVCGLIYLVFTTVSNGVLLFLERRYSVGVKRADL >LR134204.1|VEB86744.1|968057_968774_-|histidine-ABC-transporter-permease MIEIIQEYWKSLLWSDGYRFTGVAITLWLLISSVVMGGILALFLAIGRVSSNKFIQFPIWLFTYVFRGTPLYVQLLVFYSGMYTLEIVKGTDLLNAFFRSGLNCTVLALTLNTCAYTTEIFAGAIRSVPHGEIEAARAYGFSSFKMYRCIILPSALRIALPAYSNEVILMLHSTALAFTATVPDLLKIARDINSATYQPFTAFGIAAVLYLIISYVLISLFRKAEKRWLQHVKPSSTH >LR134204.1|VEB86741.1|967276_968050_-|histidine/lysine/arginine/ornithine-transporter-subunit MSENKLNVIDLHKRYGEHEVLKGVSLQANAGDVISIIGSSGSGKSTFLRCINFLEKPSEGSIVVSGQNISLVRDKDGQLKVADKNQLRLLRTRLTMVFQHFNLWSHMTVLENVMEAPIQVLGLSKHEARERAVKYLAKVGIDERAQGKYPVHLSGGQQQRVSIARALAMEPEVLLFDEPTSALDPELVGEVLRIMQQLAEEGKTMVVVTHEMGFARHVSTHVIFLHQGKIEEEGDPEQVFGNPQSPRLQQFLKGSLK >LR134204.1|VEB86738.1|966639_967179_+|acetyltransferase MPDFWNGDFMAIITTSRLTLSLFQPTDWSFFLALRENPDIMRYMADITPEKDTRLLFARRLVSKHTFVIRLHNSDKPLGDIGLQISHHYPQEADIGYTVLPEAQGRGIATEALYAVCDYAFTHTSVKAINAYVLAENHASVRVLEKRGFTRTQVLENAYEVNGIRYDEWVYRLESEMGR >LR134204.1|VEB86769.1|974767_975451_-|peptidoglycan-binding-protein MASKFQNRLVGTIVLVALGVIVLPGLLDGQKKHYQDEFAAIPLVPKPGDRDEPDMMPAATQALPTQPPEGAAEEVRAGDAAAPSLDPSRFAAENNASFDPVTAPVEPPKPKPVEKPKPQPKPQQPVAATPTPAPEPKPAAEEKPAPTGKAYVVQLGALKNADKVNEIVGKLRGAGFRVYTSPSTPVQGKITRILVGPDASRDKLKNSLGELQQISGLSGVVMGYSPN >LR134204.1|VEB86772.1|975440_976847_-|bifunctional-folylpolyglutamate-synthase/-dihydrofolate-synthase MSLHRQVIRVSESSQHTCSLKDDEYKRAGPAGAAFCFSQRNKFTGIMDNKRIPQAASPLAAWLSYLENLHSKTIDLGLERVSRVAARLGVLKPAPFVFTVAGTNGKGTTCRTLESVLTAAGYKVGVYSSPHLVRYTERVRVQGNELPESAHTASFAEIEAAREDISLTYFEYGTLSALWLFKQAQLDVVILEVGLGGRLDATNIVDADVAVVTSIALDHTDWLGPDRESIGREKAGIFRAEKPAIVGEPEMPYTIADVAQETGAQLQRRGVDWHYDVTGDGWSFTDGEGTLANLPLPQVPQPNAATALAALRASGLNISEQAIRDGIARAILPGRFQIVSESPRVILDVAHNPHAAEYLTERLKMLPKNGRILAVIGMLHDKDIAGTLAWLKSVVDDWYCAPLEGPRGATAEQLLEHLGKGTVYDSVALAWHAALAEAKPEDTVLVCGSFHTVAHVMEVMDAGRSGGE >LR134204.1|VEB86775.1|976901_977816_-|acetyl-CoA-carboxylase-subunit-beta MSWIERIKSNITPTRKASIPEGVWTKCDSCGQVLYRAELERNLEVCPKCDHHMRMSARNRLHSLLDEGSLVELGSELEPKDVLKFRDSKKYKDRLASAQKETGEKDALVVMKGTLHDMPVVAAAFEFSFMGGSMGSVVGARFVRAVEQALEDNCPLICFSASGGARMQEALMSLMQMAKTSAALAKMQERGLPYISVLTDPTMGGVSASFAMLGDLNIAEPKALIGFAGPRVIEQTVREKLPPGFQRSEFLIEKGAIDMIVRRPEMRLKLASVLAKLMNLPAPNPDAPREGEVVPPVPDQEPEA >LR134204.1|VEB86778.1|977968_978628_-|SNARE-associated-Golgi-protein MDLIYFLIDFILHIDVHLAELVAEYGVWVYAILFLILFCETGLVVTPFLPGDSLLFVAGALASLETNDLNVHIMVMLMLIAAIVGDAVNYTIGRLFGERLFSNPNSKIFRRSYLDKTHQFYEKHGGKTIILARFVPIVRTFAPFVAGMGHMSYRHFAAYNVIGALLWVLLFTYAGYFFGTIPMIQDNLKLLIVGIIVVSILPGVVEIIRHKRAASRAAK >LR134204.1|VEB86781.1|978651_979464_-|tRNA-pseudouridine-synthase-A MSAVEQPPVYKIALGIEYDGSKYYGWQRQNEVRSVQEKLEKALSQVANEPVNVFCAGRTDAGVHGTGQVVHFETTALRKDAAWTLGVNANLPGDIAVRWVKAVPEDFHARFSATARRYRYIIYNHRLRPAILGHGVTHFYEPLDAERMHRAAQCLIGENDFTSFRAVQCQSRTPWRNVMHINVTRHGAYVVVDIKANAFVHHMVRNIVGSLMEVGAHNQPESWIAELLAAKDRRLSAATAKAEGLYLVAVDYPERFDLPKTPLGPLFLAD >LR134204.1|VEB86784.1|979463_980477_-|putative-semialdehyde-dehydrogenase MSEGWNIAILGATGAVGEALLETLAERQFPVGEIYALARNESAGEHLRYSGKSVIVQDVADFDWTQAQLAFFVAGAEASAAWVEEATNAGCLVIDSSGLFALEPDVPLVVPEVNPSVLADYRNRNVIAVANSLTSQLLASLKPLIDEGGLSRISVTSLLSASAHGKKAVDALAGQSAKLLNGIPIDEDDFFGRQLAFNMLPLLPDSEGSVREERRIVDEVRKILQDDGLMISASCIQSPVFYGHAQMVSFEALRPLAAEEARDAFSRGEDIVLSEETDFPTQVGDASGNPQLSVGCVRNDYGMPEQVQFWSVADNVRFGGALMAVKTAEKLVQEYLY >LR134204.1|VEB86787.1|980538_981675_-|erythronate-4-phosphate-dehydrogenase MKILVDENMPYARELFSRLGEVKAVPGRPIPVAELDDADALMVRSVTKVNEALLGGKSIKFVGTATAGTDHVDEAWLKQAGVGFSAAPGCNAIAVVEYVFSALLMLAERDGFALSDRTVGIVGVGNVGARLQARLEALGIRTLLCDPPRADRGDEGDFRSLDELVQEADILTFHTPLYKEGPYKTLHLADEALIGRLKPGTILINACRGPVVDNTALLACLNAGQSLSVVLDVWEGEPDLNVALLEKIDIGTSHIAGYTLEGKARGTTQVFEAYSTFIGRGQKVALDTLLPAPEFGRITLHGPLDQPTLKRLAHLVYDVRRDDAPLRKVAGIPGEFDKLRKNYLERREWSSLYVMCDDASAATLLHKLGFNAVHHPAH >LR134204.1|VEB86790.1|981774_982779_+|flagella-biosynthesis-regulator MMQPISGTPARPPGEGQTAPSVAGEQPLSTQQRTVLERLITRLISLTQQQSAEVWAGMKHDLGVKSDTPLLSRHFPAAEQNLTQRLSVAQQNHANRQVLSQLTELLSQGNNRQAVSDFIRQQYGQTALSQLTPEQLKNVLTLLQQGQLSIPQPQQRPPTDRPLLPAEHNTLNQLVTKLAAATGESGKLIWQSMLELSGVKSGELIPAKQFTHLVTWLQARQTLSNQSAPTLQTLQAALKLPLEPNEFTALRDYAQQTYQALPHTILTTAQVQDLLNQVFLRRVARERELTEPHHIQPIYSPFAPMIETIKSVTARPGLIFIALAVAFILFWLVS >LR134204.1|VEB86793.1|982775_983954_-|major-facilitator-superfamily-protein MTAVSQKEKTPSANFSLFRIAFAVFLTYMTVGLPLPVIPLFVHHELGYGNTMVGIAVGIQFLATVLTRGYAGRLADQYGAKRSALQGMLACALAGGAWLLAALLPVSVPVKFALLIVGRLILGFGESQLLTGTLTWGMGLVGPSRSGKVMSWNGMAIYGALAAGAPLGLLIHSHYGFAALAGTTMALPLLAWAFNGTVRKVPAHAGERPSLWSVVGLIWKPGLGLALQGVGFAVIGTFISLYFASKGWAMAGFTLTAFGGAFVLMRVLFGWMPDRFGGVKVAVASLLVETAGLVLLWLAPVAWIALLGAALTGAGCSLIFPALGVEVVKRVPSQVRGTALGGYAAFQDISYGVTGPLAGLLATSCGYSSVFLAGAISAVVGIVVTLLSFRRG >LR134204.1|VEB86796.1|984195_984720_+|Protein-of-uncharacterised-function-(DUF3828) MKAFPLITLILLLCGCATPHRDSTQDINQFYLSWMKTFTEDQDTPHDKSALMQRYVAKELINRLTLIDNLYEQEIVASDYFMYVQDYAPEWIAKFHAGTASAFLGGEKVDVWLGDSDTRLIHLMVYTRRENGQWKIYRVRDLTHQFEHPIYDAGAIARARAWSAKVAQEYENKQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134204_3 | 1030565-1030660 | Orphan |
NA
Consensus repeat of LR134204_3
|
1 spacers
spacers of LR134204_3
>3.1|1030600|26|LR134204|CRISPRCasFinder CATAACCGCCGGATGGCGGCGTAAAT |
CRISPR arrays and Neighbor proteins around LR134204_3
The CRISPR arrays of LR134204_3 >merge|LR134204|3|1030565-1030660|CRISPRCasFinder GCCTTATCCGGCCTACATGTAAAACGAATAACCGGCATAACCGCCGGATGGCGGCGTAAATGCCTTATCCGGCCTACATGTAAAACGGATAACCGG >LR134204|3|3|1030565-1030660|CRISPRCasFinder GCCTTATCCGGCCTACATGTAAAACGAATAACCGG CATAACCGCCGGATGGCGGCGTAAAT GCCTTATCCGGCCTACATGTAAAACGGATAACCGG
>LR134204.1|VEB86936.1|1029685_1030543_+|AraC-family-transcriptional-regulator MKEPGLPADQQFFADLFSGLVLNPQRLGRVWFATQHTALPTGSLCIDLPRLDIVLRGEYGNLLEKKQHALTEGEMLFIPARAANLPGSDKPVMLLSLVFAPAWLGLSFYDNRTASLLRPVRRIELAHQQRGEGEAMLTALTHLSRSPQAQDIIQPLVLSLLHLCRSVVNTRPDSRRPRSEFLYHSMCNWVQDNYARPLTRESVAQFFNITPNHLSKLFTQHGTMSFVEYVRWVRMAKARIILQKYHLSISEVAERCGYHDSDYFCRLFRRQFGLTPGEYSARFQG >LR134204.1|VEB86933.1|1028936_1029671_+|two-component-system-response-regulator MKVIIVEDEFLAQQELSWLIKEHSQMEIVGTFDDGLDVLKFLQHNRVDAIFLDINIPSLDGVLLAQNISQFAHKPFIVFITAWKEHAVEAFELEAFDYILKPYQESRIVGMLQKLEAAWQQQQVVAVPGSPVARENDTINLIKDERIIVTPINDIYYAEAHEKMTFVYTRRESYVMPMNITEFCSKLPASHFFRCHRSFCVNLNKIREIEPWFNNTYILRLKDLEFEIPVSRSKVKEFRQLMHL >LR134204.1|VEB86930.1|1027659_1028922_+|two-component-system-sensor-kinase MLCETLTMILVIVWAPTIALGVDIVSKIGIPMILGSVCIGFIVLLVQSVEGEKEASAARQAKLALDIANKTLPLFRQVNSESLRQVCEIIRHDIHADAVAITNTEHVLAYVGVGETNYRNNDDFVSPTTQQAINYGKIIIKNNDEAHRTPEIHSMLVIPLWEKGVVTGTLKIYYCHAHQITSSLQEMAIGLSQIISTQLEVSRAEQLREMANKAELRALQSKINPHFLFNALNAISSSIRLNPDTARQLIFNLSRYLRYNIELKDDEQIDIKKELYQIKDYIAIEQARFGDKLTVIYDIDEEVNCYIPSLLIQPLVENAIVHGIQPCKGKGVVTISVAECGNRVRIAVRDTGHGIDPKVIERVKANEMPGNKIGLLNVHHRVKLLYGEGLHIRRLEPGTEIAFYVPNQRSPAAAPATLLL >LR134204.1|VEB86927.1|1027223_1027670_+|two-component-system-sensor-kinase MHEIFNMLLAVFDRAALMLFCLFFLIRIRLFRELLHKSAHSPKELLAVTAIFSMFALFSTWSGVPVEGSLVNVRIIAVMSGGILFGPWVGIITGVIAGIHPLPYRYWRGHCRTVFYHQYSGGMYRGLDKPENPKSTALARGYFGRDAL >LR134204.1|VEB86924.1|1025593_1026694_-|aminotransferase MGQRLLIIVEKLCTVAQRPDTHGYSTSRGIPRLRRAISRWYQERYDVDIDPETEAIVTIGSKEGLAHLMLATLDHGDTVLVPNPSYPIHIYGAVIAGAQVRSVPLVEGVDFFNELERAIRESYPKPKMMILGFPSNPTAQCVELEFFEKVVALAKRYDVLVVHDLAYADIVYDGWKAPSIMQVPGARDVAVEFFTLSKSYNMAGWRIGFMVGNQTLVSALSRIKSYHDYGTFTPLQVAAIAALEGDQQCVRDIAEQYKRRRDVLVKGLHEAGWMVEMPKASMYVWAKIPEQYAAMGSLEFAKKLLNDAKVCVSPGIGFGDYGDTHVRFALIENRDRIRQAIRGIKTMFRADGLLPAGAKSVTEGTE >LR134204.1|VEB86921.1|1024171_1025092_+|lipid-A-biosynthesis-palmitoleoyl-acyltransferase MFPQCKFSRALLHPRYWLTWFGLTVLWLLVLLPYPVLRFLGTRTGRLARPFLKRRESIARKNLELCFPNLMPEEREKMIVENFHSLGMALIETGMAWFWPDKRVRKWFDVDGLDNLMQAQAQKRGVMVVGVHFMSLELGGRVMGLCQPMMATYRPHNNSLMEWVQTRGRMRSNKAMIGRNNLRGIVGALKKGEAVWFAPDQDYGRKGSSFAPFFAVKDVATTNGTYVLSRLSGAAMLTVTMIRKGDNSGYRLYITPEMEGYPEEENQAAAYMNKIIEKEIMRAPEQYLWIHRRFKTRPLGEASLYV >LR134204.1|VEB86918.1|1023805_1024024_-|Uncharacterised-protein MLGQLMYHRGINISQKLLIKMRKYFLMGVKDDANVLLVMALPTSHSLKYQVLTLHFHSSSYFIHFSVFHAEI >LR134204.1|VEB86915.1|1019209_1023481_-|outer-membrane-autotransporter MQLSKVFLAVVSALCATNAVATNINNNQERTIDYLWSDTSPTNVGYETNGILNIIRGGEVQTDLLKIGRAGSNGIVNVVDGGKLTIRASSYSYPLDIGGTTDVSSTQSKGGTGTLNVSGEGSIVTVSVGTRAMQVGAGAGSGVLNITDGGKFYHEDPSISFGGIWIGGRRATSAETFGIVNVDGKGSELRTASRIIVGTYGSGSLITSNGGVTRAEATIQVGNQAETKVYDNLLKVDGTDSIVSAGGILTVGLAGRGTAVASDNGTLSAPEIRIASSAGSLGELATGARAGENAVAAGMIDAQKIIFGSGNGVLTLNHTSSDFSLGADISGNGAINALSGVSALSGDNSAYQGDFNIDAPATLVISEQKNIGSSDIAVTGGTLAIDTDHDWQFINLLTGQGTLAVNTGGNIFDFNSSSLTDNFSGILALKETLFHLADTNTAALKTMWLKVGQGATVKVGPGQQVIDGLAFDGGTLVFGDIIPGQTTTENTVHTTGILDISGQGTVQVTTGAAFSNDRPTPDTHIPLLEQDNSNILVQLVSSDGGVVGNGGNLTLTDQSSNAISDAVEADIAQNGVVVAKGTYDYRLTGGDSDDGLYVSYGLTQVDLLGKDADALILDANGKSGNAADLSARVTGSGDLAFDSQKGQTVTLSNMDNDYSGVTDVRSGNLAMLNDNVLGNTRELKLAGDTGFDMRGHSQTIGKLTAESGSLTDLNGGHLTLTNGGEASGVLTGDGELIVAGGTLNVSGANTGLKATTTIAQGATAVLDNTLGLGTGDIVAAGLLNLSNATGVLYNSISDAGKVSLDASDVVLAGNNSHFAGTFDIDNDSTLTASSAQQLGTSAIQNAGKFVLNTHENWSLENGVTGSGSVVKNGSGNVTLSDSAQWTGATDINAGGLTLGSADNAFTLASHQVNIGKDGRLSGFGGVAGNMANQGTLLIGDDVSAARRAASSPVSFTVGGNLTNSGDIWTGSKGKDAGNQLVVNGNYQGDGGHLHLNTALNDDNSVTDKLIVKGNTSGTTGVSVTNAGGSGAQTINGIEVIHVDGQSDGEFTQDGRIVAGAYDYSLARGQGDNNGNWYLTSHKTDPNPGPGPKPEPDIRPEPGSWTANLAAANTLFVTRLHDRLGETQYIDALTGEKKVTSMWMRQAGGHNAWRDSSGQLKTQSNRYVMQVGGDIARWSGDELDRWHLGVMAGYGHNSSNTRSSSTGYRSDGSVNGYSAGVYATWYANDETHQGAYLDSWAQYSWFNNSVKGQDIQSESYKSKGITGSLELGYTRKLGEFAGSKGSKNEWFIQPQAQAIWMGVEADDHRESNGTRISGEGDGNVQTRLGVRTFLKGHSAIDEGKAREFQPFVEVNWLHNTRDFGTKMDGVSIRQEGARNLGEIKTGVEGQINPQLNVWGSVGVQLGDKGYNDTSAMIGIKYNFK >LR134204.1|VEB86912.1|1018728_1019112_-|LuxR-family-transcriptional-regulator MIVISENSFFHLGVSALVERMALNTTRNLIVFDTGRDCLYMLDTEEQRCLFFHEPILIFSHCRRIYLTEKNRKFKIETLLSKTKSHPAVRRKREVLTVSELRVIKKVVRDQSTTNCRTVQGQRKDDQ >LR134204.1|VEB86909.1|1018532_1018721_-|LuxR-family-transcriptional-regulator MNALRKLSINKTTKFFVEYLAWLRLWHEYVEYKNNQMRKPVKLSVITPETFTKRRSEVLIEV >LR134204.1|VEB86939.1|1030760_1033256_-|multiphosphoryl-transfer-protein-1-[includes-phosphoenolpyruvate-protein-phosphotransferase;phosphocarrier-protein-Hp;-fructose-like-phosphotransferase-enzyme-IIA-component] MLTIQFLCPLPNGLHARPAWELKEQCSQWQSEVTFINHRQNARADAKSSLALIGTGTLFNDSCSLNISGSDEEQARRSLETYLQHRFIDSDSVQPTSAELAAHPLPRSLVRLNPDLLYGAVLANGVGAGALTLWQNDNLEAYRAIPASAEDSTRLEHSLATLAEQLNQQLRERDGESKTILSAHLSLIQDDEFAGNIRRLMDEQHKGLGAAIIANMEQVCDKLSSSASDYLRERVSDIRDISEQLLHITWPERRPRNTLILEQPTILVAEDLTPSQFLSLDLQHLSGMILEKTGRTSHTLILARASAIPVLSGLPLDALSRYAGQPAVLDAQCGVLAINPNAAVCGYYAIAQQLADKRQQQQAREASLPAFSRDNQRLDIAANIGTALEAPGAFANGAEGIGLFRTEMLFMDRDSAPDEQEQFEAYQQVLLAAGEKPVIFRTMDIGGDKNIRYLNIPQEENPFLGYRAVRIYPEFAGLFRTQLRAILRAATFGQAQLMIPMVHSLDQILWVKSELQKAIVELKRDGLRHAATIPLGIMVEVPSVCYIIDHFCDEVDFFSIGSNDMTQYLYAVDRNNPRVSPLYNPITPSFLRMLQQIVHAAHQHGKWVGICGELGGESRYLPLLLGLELDELSMSSPRIPGVKSQLRQMDSQACRALAAQACECRSAEEIETLLNQFAPQKDVRPLLALENIFVGESLSNKEQVIQFLCGNLGVNGRTEHPFELEEDVWQREEIVTTAVGFGVAIPHTKSQWIRHSSISIARLARPVDWQSEMGEVELVIMLTLGADEGINHVKVFSQLARKLVNKNFRQSLFAAQDAESLLALLEAELTF >LR134204.1|VEB86942.1|1033282_1033954_-|exoaminopeptidase MRVDIGARSYDEVIQAGVRPGDRVTFDSAFQVLPHQRVMGKAFDDRLGCYLLIMLLREWHDAALPAEVWLVASSSEEVGLRGGQTAARAVAPDIAIVLDTACWAKNFDYGAANHRQIGKGPMLVLADKSLIAPPKLTAWVESVASDSGLPLQLDMFSNGGTDGGAVHLTGTGTPTVVIGPATRHGHCAASIADCRDIIHTQQLLSALITRLTRETVVHLTDFR >LR134204.1|VEB86944.1|1033914_1034562_-|exoaminopeptidase MAIIFGHNTGHAIGIEVHEAPRFSPTDTTRLAAGMLLTVEPGIYLPGQGGVRIEDVVLVTEDGAEVLYTIAENSIAHGRSIMDLSLLKALSEADAIASSEQEVRQILLEEADRLHKEVRFDGLGSVLIRLNESDGPKVMICAHMDEVGFIVRSISSEGAIDVLPVGNVRMAAASSSPCVLPRGKTVKFPVCWMASVMEMRLAPCGSILAHAHMMR >LR134204.1|VEB86947.1|1034515_1034737_-|aminopeptidase MTRTFLVSGQDAPVASHPLFAVYQTVLEAQQAAIAAIRPGVCCQAVDAAARRVIEAAGYGDYFRPQHRARYRH >LR134204.1|VEB86950.1|1034785_1035406_-|aminopeptidase MTLRASLRQWLVEQRLDAVLISSRQNKQPHLGISSSAGFVFITHKSAHLLVDGRYYADVEARAHGDQLHLLAGSQTLASIVNQIIDDEQLQVIGFEGAQVSWDCAQRWQERLRAKLVSASPDALRQIKTAQEIALIREACRIADCSAEHIRRFIRPGLREREIAAELEWFMRQQGAEKPLFDTIVASGWRGALPHGKSQRKNRRGG >LR134204.1|VEB86952.1|1035422_1036046_-|fructose-like-specific-PTS-system-EIIC-component-1 MLAMYYVITPFGGWINGGIRTLLTAAGEKGTLMYAMGISAATAIDLGGPINKAAGFVAFSFTTDHVLPVTARSIAIVIPPIGLGLATILDRRLTGKRLFNAQLYPQGKTAMFLAFMGISEGAIPFALESPITAIPSYMVGAIVGSTTAVWLGAVQWFPESAIWAWPLVSNLGVYMAGIVLGAVITALMVVFLRHMMYRRGKLLIESL >LR134204.1|VEB86954.1|1036389_1036668_-|fructose-like-specific-PTS-system-EIIC-component-1 MAIKKRSATVVPGASGAAAAIKTPPASRNSFWGELPQHVMSGISRMVPTLIMGGVILAFSQLIAYSWLKIPADTGIMDALNSGKFDGFNSHY >LR134204.1|VEB86956.1|1037264_1038230_-|glucokinase MTKYALVGDVGGTNARLALCDMTSGEISQAKTYSGLDYPSLEAVVRVYLDEHSARVEDGCIAIACPITGDWVAMTNHTWAFSIAEMKKNLGFSHLEIINDFTAVSMAIPMLRKEHLIQFGGAEPVAGKPIAVYGAGTGLGVAHLVHVDKRWISLPGEGGHVDFAPNSEEEGIILEELRAEIGHVSAERVLSGPGLVNLYRAIVKSDGRLPENLQPKDITERALADTCIDSRRALSLFCVIMGRFGGNLALTLGTFGGVYIAGGIVPRFLEFFKASGFRGGFEDKGRFKAYVQDIPVYLIVHDNPGLLGSGAHLRQTLGQIL >LR134204.1|VEB86959.1|1038434_1039670_+|voltage-gated-chloride-channel-protein MLHPRARTMLLLSLPALIIGIASSLVLIAVMKVASVLQRLLWEQIPIGLGIAHDSPVWIIGMLTFTGIMVGLVIRYSPGHAGPDPATEPLIGAPMSPVALPGLLIALILGLAGGVSLGPEHPIMAVNIALAVAIGARLFPRIGALDWTILASAGTIGALFGTPVAAALIFSQTLNSSDDTPLWDRLFAPLMAAAAGSLTTSLFFHPHFSLPIAHYTQMRLSDIFSGAVVAAIAIAAGMVAVWCLPRLHSLVHRLKNPVLILGIGGFMLGILGVLGGPLTLFKGLDEMQQLAFSQTLGATDYFMLAIVKLGALVIAAACGFRGGRIFPAVFVGVALGLMLHAHVEAVPAAITVSCAILGLMLVVTRDGWLSLFMAAVVVPDTTLLPLLCIVMLPAWLLLAGKPMMAVNRHDR >LR134204.1|VEB86961.1|1039683_1041339_-|decarboxylase MQTPYSVADYLLDRLAGCGIGHLFGVPGDYNLQFLDHVIAHRDVCWVGCANELNAAYAADGYARLAGAGALLTTFGVGELSALNGLAGSYAEYVPVLHIVGAPCSGAQRRGELMHHTLGDGDFQHFYRISQAVTAASAVLNEQNACYEIDRVLREMLTMRRPGYLMLPADVAKRSAVPPVNVLTPDRAEEENDTVAAFRYRARQRLMNSPRVALLADFLALRFGLQPVLQRWMAETPIAHATLLMGKGLFDEQHPNFVGTYSAGASSESVRQAIEEADTVICVGTRFVDTLTAGFTQKLPQERTIEIQPHASRVGNSWFSGLSMEQAVTTLRELCLELSFSPPPTLAHAPGGHIEKGALTQENFWHTVQAYLLPGDIILADQGTAAFGAADLSLPAGADVLVQPLWGSIGYALPAAFGAQTACPERRVILITGDGAAQLTIQELGSMMRDGQSPVILLLNNDGYTVERAIHGANQRYNDIAGWNWTQVPQALSANCQAECWRVTQAIQLEEVLARLKSPQRLSLIEVVLPKADLPELLRTVTRALESRNGE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134204_4 | 1822206-1822326 | Orphan |
NA
Consensus repeat of LR134204_4
|
1 spacers
spacers of LR134204_4
>4.1|1822244|45|LR134204|CRISPRCasFinder ACCGGGCATTGCGCCGGATGGCGGCTTCGCCTTATCCGGCCTACG |
CRISPR arrays and Neighbor proteins around LR134204_4
The CRISPR arrays of LR134204_4 >merge|LR134204|4|1822206-1822326|CRISPRCasFinder GGGAAGTGCCATTTGTAGGCCCGGTAAGCGCCAGCGCCACCGGGCATTGCGCCGGATGGCGGCTTCGCCTTATCCGGCCTACGGGGAAGTGCCATTTGTAGGCCCAGTAAGCGCCAGCGCC >LR134204|4|4|1822206-1822326|CRISPRCasFinder GGGAAGTGCCATTTGTAGGCCCGGTAAGCGCCAGCGCC ACCGGGCATTGCGCCGGATGGCGGCTTCGCCTTATCCGGCCTACG GGGAAGTGCCATTTGTAGGCCCAGTAAGCGCCAGCGCC
>LR134204.1|VEB89618.1|1821585_1822161_+|phospholipid-binding-protein MKAFSPLAVLISALLLQGCVAAAVVGTAAVGTKAATDPRSVGTQVDDGTLELRVNSALSKDEQIKKEARINVTAYQGKVLLVGQSPNSELSSRAKQIAMGVDGATEVYNEIRQGQPIGLGTASNDTWITTKVRSQLLTSDQVKSSNVKVTTENGEVFLLGLVTERESKAAADIASRVSGVKRVTTAFTFIK >LR134204.1|VEB89615.1|1820985_1821576_+|DnaA-initiator-associating-protein-DiaA MLDRIKVCFTESIQTQIAAAEALPDAISRAAMTLVHSLLNGNKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIANDRLHDEIYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVALTGYDGGELAGLLGPQDVEIRIPSHHSARIQEMHMLTVNCLCDLIDNTLFPHQDD >LR134204.1|VEB89612.1|1820569_1820965_+|Uncharacterised-protein-family-UPF0102 MAQIPARANRPRQLTSKQTGDVWEAAARRWLESKGLRFVAANVRERGGEIDLIMRDGKTTVFVEVRYRRSAQFGGAAASVTWSKQHKLLQTARLWLARHNGSFDTVDCRFDVLAFTGNDVEWFKDAFNDRS >LR134204.1|VEB89609.1|1818680_1820612_+|lipoprotein MQGTAQADSGFYLQQMQQSSDDSKTNWQLLAIRALLKEGKSQHAIELFNQLPQNLNDAQRREQSLLAVEIKLAQKDIAGAQALLEKLTPADLDQNQQARYWQAKIDASQGRPSLSLLRALIAQEPLLSAKDKQKNMDATWQALSSMTQEQAQALVINADENVLQGWLDLQRVWFDNRNDPDMMKAGIADWQKRYPQNPGAKLLPTQLVNVQSFKPASTSKIALLLPLNGQAAIFGRTIQQGFEAAKNLGTQPVDAQPAAAPVAEPTASAEPAQPQAADGVASPSQAAVNDLTGDEQTQPVEQPVSAPAQTATASAPANPSAELKIYDTTSQPLDQILTQVQQDGASIVVGPLLKNNVEELIKSNTPLNVLALNQPEKVQNHANICYFALSPEDEARDAAHHIHDQGKQNPLLLIPRSGLGDRVANAFAQEWQKLGGGTVLQQKFGSTAELRMGVNGGSGIALTGSPVAASQPSQPGVTIGGLTIPAPPTDAQITGGGGRVDAVYILATPEEIGFIKPMIAMRNGSQSGATLYASSRSAQGTSGPDFRLEMEGLQYSEIPMLAGSNMPLMQQALGAVRNDYSLARMYAMGVDAWSLANHFSQMRQVPGFEINGNTGALTATQDCVINRKLSWLKYQQGQIVPAS >LR134204.1|VEB89606.1|1817639_1818503_-|Ribosomal-RNA-small-subunit-methyltransferase-I MKQNESADNSQGQLYIVPTPIGNLADITQRALEVLQAVDLIAAEDTRHTGLLLQHFAINARLFALHDHNEQQKAETLVAKLKEGQNIALVSDAGTPLINDPGYHLVRTCREAGIRVVPLPGPCAAIAALSAAGLPSDRFCYEGFLPAKSKGRRDALKAIEAEPRTLIFYESTHRLLDSLEDMVAVWGESRYVVLARELTKTWETIYGAPVGELLAWVKEDENRRKGEMVLIVEGHKAQNDALPADALRTLALLQAELPLKKAAALAAEIHGVKKNALYKYALEQQGE >LR134204.1|VEB89603.1|1816635_1817637_-|Protein-involved-in-cell-division MSTYQPPFVITPLILNQVIEVGELMGHWAAMAGRTSPLLRKENRIRTIQASLAIEHNSLTTEQVTALMEGKRILAPAKDIQEVRNAILAYEKMSEWKSEKLSDLLQAHRVLMLGLVDNPGQLRTGNVGVYREKQLIHMAPPASQVFRLMNDLLGWVKTTELHPLIAGAVFHYEFEFIHPFADGNGRMGRLWQTLILREWRSELAWLPVETLIHYQQDRYYQVLGQCDRQSSCTPFIEFILENIISALQEGLGKSSALSEEMSEEMSEEMSPDLLDIEKRILQILSNHPGRSAKSIADECELSARTVERYLKALQLKGKLQRVGATKGGMWRVI >LR134204.1|VEB89600.1|1815865_1816639_+|galactitol-utilisation-operon-repressor MNSFERRNKIVDLINTQGSVLVMDLSNTFGISEVTIRADLRLLEEKGLVTRFHGGAAKPGSHLAEGDNQEVILEDRYQLASDPKKRIAQAAASMVEEGMTIILDSGSTTLLIAEALTRKSNITVITNSLPAAFTLSDNKDLTLVVCGGTVRHKTHSMHGTIAERSLHGISADLMFVGADGIDATNGITTFNEGYSISSVMAAAAHKVIAVLDATKFNRRGFNQVLPMEKINCVITDDSISKQDKSALTKTGVELLVV >LR134204.1|VEB89597.1|1814933_1815755_+|zinc-binding-dehydrogenase MQPGDAVACVPLLPCFHCPQCERGYFSLCKQYQFVGSRSEGGNAEYVVVKRANLFRLPSDMPIEDGAFIEPITVGLHAFHLAQGCKGKNVIIVGAGTIGLLAMQCARELGAKSVTAIDINPQKLELAKELGATYTFNSREMTASEIYAALSDIQFDQLILETAGTPQTVSLTIEIAGPRAQLALVGTLHHDLTLTSGVFGQILRKELTILGSWMNYSSPWPGEEWETAARLLTEKRLQLEPLIAHRGDADSFADAVKALNGAPMQGKILLKLS >LR134204.1|VEB89594.1|1814712_1814883_+|Sorbitol-dehydrogenase MKSVVIHAEGDVRVEERPIPQIQADDDVLIKVISSGLCGSDIPRIFAHGRIIIQSR >LR134204.1|VEB89591.1|1814323_1814668_+|component-IIC-of-galactitol-specific-phosphotransferase-system MAVAVHQGNLFRTLISGVIIMGITLWIATQTIGLHTQLAANAGALKTGGMVASMDQGGSPVTWLLIELFTWQNVIGLVVIGAIYFTGVLLTWRRARNFMAAEKAAATQQSQTAS >LR134204.1|VEB89623.1|1822341_1823385_-|permease MLWLVQSSSQAVTPFQWWKPALFFLVVIVGLWYVKWQPYYGKAFTAAETHSIGKSILAQADANPWRAAWDYAMIYFIAVWKAAVLGVILGSLIQVLIPRDWLLRTLGQSRFRGTLFGTLFSLPGMMCTCCAAPVAAGMRRQQVSMGGALAFWMGNPLLNPATLVFMGFVLGWHFAAIRLVAGLAMVLVVATLVQKWVKESPQVDIPQEIAVSEPQGGFFARWGKALWALFWSTIPVYILAVLVLGAARVWLFPHADGAVDNTLMWVIAMAVAGCLFVIPTAAEIPIVQTMMLAGMGTAPALALLMTLPAVSLPSLIMLRKAFPVKALWLTGVLVAISGVIVGSIALV >LR134204.1|VEB89626.1|1823570_1824206_-|Semialdehyde-dehydrogenase,-NAD-binding-domain MTQVLITGATGLVGGHLLRMLLNEPRIHSIAAPTRRPLADMVGVYNPHDPQLTDALAQVTDPVDIVFCCLGTTRREAGSKEVFIHADYTLVVDTALTGRRLGARHMLVVSAMGANARSPFFYNRVKGEMEEALIAQQWPRLTIARPSMLLGDRAKHRMNETLFAPLFRLLPGNWKSIDARDVARAMLEEALAPEQEGVTILTSSQLRERAL >LR134204.1|VEB89629.1|1824238_1824679_-|Uncharacterized-protein-conserved-in-bacteria METLTAISRWLAKQHVVTWCVQHEGELWCANAFYLFDAQKVAFYVLTEEKTRHAQMSGPRAPVAGTVNGQPKTVARIRGIQFKGEIRKLEGEERDAARQAYLRRFPVARMLPAPVWEIRLDEIKFTDNTLGFGKKMYWLRSPIRSL >LR134204.1|VEB89632.1|1824722_1825034_+|GIY-YIG-nuclease-superfamily-protein MVMMTPWYLYLIRTADNALYTGITTDVARRYKQHQSGKGAKALRGKGELTLAFSARVGERSLALRMEYRIKRLTKRQKERLVTEGEGFEALLASLQTPTLKSD >LR134204.1|VEB89635.1|1825020_1825524_-|acetyltransferase MLIRVEIPIDAPGIDALLRRTFESDAEAKLVHDLREDGFLTLGLVATDDEGQVVGYVAFSPVDVQGEDLQWVGMAPLAVDENYRGQGLARQLVYEGLDSLNEFGYAAVVTLGEPALYSRFGFELAAHYDLHCRWPGTESAFQVHRLAEDALDGVTGLVEYHDHFNRF >LR134204.1|VEB89638.1|1825517_1826042_-|Putative-lipid-carrier-protein MLDKLRSRLVHFGPSLMSVPVKLTPFALKRQVLEQVLSWQFRQALADGELEFLEGRWLSITVRDIDLRWYTSVENGKLIVSQNAEADVSFSADASDLLMIAARKQDPDTLFFQRRLVIEGDTELGLYVKNLMDAIELEQMPKALRIMLLQLADFVEAGMKTSPETKQTSVGEPC >LR134204.1|VEB89641.1|1826262_1826898_+|peptidase MELLCPAGNLPALKAAIENGADAVYIGLKDDTNARHFAGLNFTEKKLQEAVNYVHQHRRKLHIAINTFAHPDGYVRWQRAVDMAAQLGADALILADLAMLEYAAERYPHIERHVSVQASATNEEAINFYHRNFDVARVVLPRVLSIHQVKQLARVTPVPLEVFAFGSLCIMAEGRCYLSSYLTGNPPIPSAPALRPASCAGNRPPRGWNRA >LR134204.1|VEB89644.1|1826855_1827257_+|peptidase MRWQQTSQGLESRLNEVLIDRYQDGENAGYPTLCKGRYLVDGERYHALEEPTSLNTLELLPDLLAANIASVKIEGRQRSPAYVSQVAKVWRQAIDRCMADPQNFVPQSAWMETLGSMSEGTQTTLGAYHRKWQ >LR134204.1|VEB89648.1|1827265_1828144_+|peptidase MKYSLGPVLYYWPKETLEAFYQQAATSSADVIYLGEAVCSKRRATKVGDWLDMAKSLAGSGKQVVLSTLALVQASSELGELKRYVENGDFLLEASDLGVVNMCAERKLPFVAGHALNCYNAVTLRLLLKQGMVRWCMPVELSRDWLVNLLNQCDELGIRNQFEVEVLSYGHLPLAYSARCFTARSEDRPKDECETCCIKYPNGRNVLSQENQQVFVLNGIQTMSGYVYNLGNELTSMQGVVDIVRLSPLGTETFAMLDAFRANENGSAPLPLAAHSDCNGYWKRLAGLELQV >LR134204.1|VEB89651.1|1828384_1828540_+|Uncharacterised-protein MTDKTIPFSLLDLAPIPKAPLRKRPSHTHWISPVWQSGAAITATGWRNITT |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134204_5 | 2142068-2142154 | Orphan |
NA
Consensus repeat of LR134204_5
|
1 spacers
spacers of LR134204_5
>5.1|2142098|27|LR134204|CRISPRCasFinder TTCGCTTGCCCGGCCTACGTCCGTTTC |
CRISPR arrays and Neighbor proteins around LR134204_5
The CRISPR arrays of LR134204_5 >merge|LR134204|5|2142068-2142154|CRISPRCasFinder CGTTGGTAGTTCACAGGCCGGATAAGGCGCTTCGCTTGCCCGGCCTACGTCCGTTTCCGTTGGTAGTTCACAGGCCGGATAAGGCGC >LR134204|5|5|2142068-2142154|CRISPRCasFinder CGTTGGTAGTTCACAGGCCGGATAAGGCGC TTCGCTTGCCCGGCCTACGTCCGTTTC CGTTGGTAGTTCACAGGCCGGATAAGGCGC
>LR134204.1|VEB90792.1|2141613_2142015_+|nickel-responsive-regulator MQRVTITLDDDLLETLDSLSQRRGYNNRSEAIRDILRGALAQEATQEHGTQGFAVLSYVYEHEKRDLASRIVSTQHHHHDLSVATLHVHINHDDCLEIAVLKGDMGDVQHFADDVIAQRGVRHGHLQCLPKEE >LR134204.1|VEB90789.1|2140801_2141608_+|nickel-transporter-ATP-binding-protein-NikE MTLLTVSGISHQYAHANLSGKHQHHTVLKEVSLSLKSGETVALLGRSGCGKSTLARLLVGLESPTQGRVCWQGEPLSTLNRARQKAFRRDIQMVFQDSISAVNPRKTVREILREPMRHLLSLSRAEQIARASEMLRAVDLDDTLLDKRPPQLSGGQLQRVCLARALAVEPKLLILDEAVSNLDLVLQAGVIRLLKKLQQQFGTACLFITHDLRLVERFCHRVMVMDEGQIVETRTVGDTLTFSSDAGRVLQNAVLPAFPVRRRATEKV >LR134204.1|VEB90786.1|2140094_2140805_+|nickel-transporter-ATP-binding-protein-NikD MHGVSLALRRGCVLALVGGSGSGKSLTCAATLGILPAGVRQTAGAILADGQPISPCKLRGIKVATIMQNPRSAFNPLHTMAAHAKETCLAAGKPADDATLIAALEAVGLENAARVLKLYPFEMSGGMLQRMMIAMALLCDAPFIIADEPTTDLDVVAQARILDLLESIMRSRAPGMLLVTHDMGVVARLADDVAVMDNGKIVELGDVETLFRTPQHSVTRNLVSAHLALYGMELAS >LR134204.1|VEB90783.1|2139206_2140040_+|nickel-transporter-permease-NikC MNFFLSARWPVRLALLIIALLAIIALTSQWWLPYDPQAIDLPSRLLSPDSQHWLGTDHLGRDIFSRLLAATRVSLGSVIACLLLVLALGLIVGGSAGLMGGRVDQFTMRVADIFMTFPTSILSFFLVGVLGTGLTNVIIAIALSHWAWYARMVRSLVISLRQREFVLASRLSGAGHVRVFIDHLAGAVIPSLLVLATLDIGHMMLHVAGMSFLGLGVTAPTPEWGVMINDARQYIWTQPLQMFWPGLALFISVMAFNMVGDALRDHLDPHLVTEHAH >LR134204.1|VEB90780.1|2138265_2139210_+|nickel-transporter-permease-NikB MLRYIVRRILLLIPMIFAASVIIFLMLRMGTGDPALDYLRLSNLPPTPEMVASTRVMLGLDQPLVVQYGSWLWKALHLDFGISFATQRPVLEDVLNFLPATLQLAGAALVLILLTSVPLGIWAARHRDRLPDFIVRLIAFLGVSMPNFWLAFLLVMFFSVYLKWLPAMGYGGWQHIILPAVSIAFMSLAINARLLRASMLEVAGQRHVTWARLRGLNAKQTERRHILRNASLPMITAVGMHIGELIGGTMIIENIFAWPGVGRYAVSAIFNRDYPVIQCFTLIMVVVFVVCNLIVDLLNAALDPRIRRHEGAHS >LR134204.1|VEB90777.1|2137732_2138266_+|nickel-ABC-transporter-substrate-binding-protein MPAGKTIREKAGQPLHVELAYIGTDALSKSMAEIIQADMRKVGVDVSLVGEEESSIYARMRDGRFGMIFSRTWGAPYDPHAFMSSMRVPSHADYQAQLGLPDKAQIDKEIGEVLTTTDEAQRKSLYRDILTRLHNDAVYLPISYMSMMVVAKPELGKIPYAPITSEIPFEQINPVKP >LR134204.1|VEB90774.1|2137504_2137600_+|nickel-ABC-transporter-substrate-binding-protein MLALNSAKAPTNELAVREALNYAVNKKIADR >LR134204.1|VEB90772.1|2136850_2137378_+|nickel-ABC-transporter-substrate-binding-protein MVYEPLVKYQADGSVKPWLAKSWTHSADGKTWVFTLRDDVTFSNGEAFDAQAAAENFRAVLDNRQRHAWLELTNQITDVKALSKTELQITLKSAYYPFLQELALPRPFRFIAPSQFKNNETMNGIKAPVGTGPWVLKASKLNQYDVLVRNDHYWGEKPAIRQITIKVIPDPTTRA >LR134204.1|VEB90769.1|2136082_2136517_+|holo-(acyl-carrier-protein)-synthase-2 MYGEQGKPAFSPDTRLWFNLSHSGDDIALLLSDEGEVGCDIEVIRPRDNWRSLANAVFSLGEHAEVEAEHPEQQLTAFWRIWTRKEAIVKQRGGSAWQIVSVDSTLNSALSVSQCQLDTLSLAVCTPTPFTLTADCVQRLESIA >LR134204.1|VEB90766.1|2134807_2135665_+|3-oxoacyl-(acyl-carrier-protein)-synthase-II MPEWQVYDGLHTLLGAPVDDFTLPAHYTRKRIRAMGRVSLMSTRATELALEQAGLIDDAVLTNGETGIAYGSSTGSTGPVSEFATMLTEKHTNNITGTTYVQMMPHTTAVNTGLFFGLRGRVIPTSSACTSGSQAIGYAWEAIRHGYQTVMVAGGAEELCPSEAAVFDTLFATSQRNDEPKTTPSPFDEKRDGLVIGEGAGTLILEELEHAKARGATIYGEIIGFATNCDAAHITQPQRETMQICMETIAENSRLKRTGYRVHFRPRHGDRSGRYRRKSGNGRNL >LR134204.1|VEB90795.1|2142176_2143301_-|ABC-transporter-permease MRRLRNIYNLGIKELRSLLGDKAMLTLIVFAFTISVYSSATVLPGSLHLAPIAIADMDQSQLSNRIVNSFYRPWFLPPEMITADEMDAGLDAGRYTFAVNIPPNFQRDVLAGRQPDIQVNVDATRMSQAFTGNSYIQNIISGEVNSFVARYRDNSEPLVSLETRMRFNPNLDPAWFGGVMAIINNITMLAIVLTGSALIREREHGTIEHLLVMPITPFEIMMAKVWSMGLVVLVVSGLSLMLMVKGVLGVPIEGSIPLFMLGVALSLFATTSIGIFMGTLARSMPQLGLLMILVLLPLQMLSGGSTPRESMPQMVQDIMLTMPTTHFVSLAQAILYRGAGFGIVWPQFLTLLAIGSAFFLIALLRFRKTIGTMA >LR134204.1|VEB90798.1|2143300_2146048_-|ABC-transporter-ATP-binding-protein MTNLARLPIPPVAHLDGVSQHYGATVALNNITLDIPARCMVGLIGPDGVGKSSLLSLISGARVIEQGNVRVLGGDMCDATHRRDVCPRIAWMPQGLGKNLYHTLSVYENVDFFARLFGHDKAEREARITDLLNSTGLAPFRDRPAGKLSGGMKQKLGLCCALIHDPELLILDEPTTGVDPLSRAQFWALIDSIRQRQTNMSVLVATAYMEEAERFDWLVAMNAGEVMATGSAQELRDKTGSATLEQAFIALLPEAQRQAHKPVVIPPYDATQEEIAIEAKDLTMRFGNFVAVDHVNFRIPRGEIFGFLGSNGCGKSTTMKMLTGLLPASEGEAWLFGQSVDPKDIETRRRVGYMSQAFSLYSELTVRQNLELHARLFHIPDAEIPQRVAQMSERFMLSDVEDMLPEALPLGIRQRLSLAVAVIHRPEMLILDEPTSGVDPVARDMFWQLMVDLSRQDKVTIFISTHFMNEAERCDRMSLMRAGKVLASGTPQELVERRGAASLEEAFISWLQEAAGPAPVSPLQPADTPDVHKAGKPPRQGFSLRRLFSYSRREALELRRDPVRSTLALLGTVILMLIMGYGISMDVENLRFAVLDRDRTVSSQAWSLNLAGSRYFIEQSPLTSYDDLDRRMRSGEVAVAIELPPNFGRDIARGTPVQIGVWVDGAMPSRAETVRGYVQAMHQSWLQDVASRQPNASSQGGLMTIETRYRYNPDVKSLPAIVPAVIPLLLMMIPSMLSALSVVREKELGSIINLYVTPTTRSEFLLGKQLPYIALGMLNFLLLCALSVFVFGVPHKGSFLTLTLAALLYVTIATGMGLLISTFMKSQIAAIFGTSIITLIPATQFSGMIDPVASLEGPGRWIGEIYPTSHFLTIARGTFSKALDLMDLWPLFVPLAIAIPLVIGLSVLLLKKQEG >LR134204.1|VEB90801.1|2146044_2147112_-|hemolysin-D MDKIKRHLVWWSVGILVAIAAVAWWMLRPAGVPDGFAASNGRIEATEVDIATKIAGRIDTILVSEGQFVRQGEVLAKMDTRVLQEQRLEAIAQIKEAESAVAAARALLEQRQSETRAAQSVVKQREAELDSVSKRHVRSRSLSQRGAVSAQQLDDDRAAAESARAALESAKAQVSATKAAIEAARTSIIQAQTRVDAAQATERRIVADIDDSELKAPRDGRVQYRVAEPGEVLSAGGRVLNMVDLSDVYMTFFLPTEQAGLLKIGGEARLVLDAAPSLRIPATISFVASVAQFTPKTVETSDERLKLMFRVKARIPPALLQQHLEYVKTGLPGMAWVRLDEQRPWPDDLAVRLPQ >LR134204.1|VEB90804.1|2147765_2148779_+|magnesium-transporter MSSIHVCNAHHPAPAFDGDAIAQYMRTDFITLPDHLSVHEAREFFVSQLTSDDIPGQVFVVAEKQLRGVLSIKKLLQASDTAQSIRALMDSCLFRVKPDDERPHIIAELTERQLDLVPVVDKGELVGCLMEKEIAHLLEDDVTEDVQRQGATLPLEKPYLDISPWTLWKKRSVWLLLLFVAEAYTSSVLQHFEEALESAIALAFFIPLLIGTGGNSGTQITSTLVRAMALGEVRLRDMGRVIRKEASTSLLIAITLGLAGCLRAWMMGIGVEITLIVSLTLVCITLWSAVVSSVIPMVLKRIGIDPAVVSAPFIATLIDGTGLIIYFKIAQYFLGLN >LR134204.1|VEB90807.1|2148816_2150010_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-B MERFDAVIIGAGAAGMFCAAQAGQAGRRVLLIDNGKKPGRKILMSGGGRCNFTNLYVEPAAYLSQNPHFCKSALARYTQWDFIDLVGKHGIAWHEKTLGQLFCDDSAQQIVDMLVAECAKGNVTMRLRSEVLSIARDDEGFTLELNGMTVAAKKVVIATGGLSMPGLGASPLGYKIAEQFGLDVLPTRAGLVPFTLHKPLLEQLQVLSGVSVPAVIAAQDGTVFRENLLFTHRGLSGPAVLQISSFWQPGEFVSINLLPEVDPDDFLNEQRCAHPNQSLKNTLAMQLPKRLVECLQQLGQIPDVSLKQLNAREQQALIETLTEWRVQPNGTEGYRTAEVTLGGVNTDELSSRTMEARKVPGLYFIGEVMDVTGWLGGYNFQWAWSSAWACAQALSEG >LR134204.1|VEB90811.1|2150240_2150924_+|low-affinity-inorganic-phosphate-transporter MLHLFAGLDLHTGLLLLLALAFVLFYEAINGFHDTANAVATVIYTRAMRSQLAVVMAAVFNFFGVLLGGLSVAYAIVHMLPTDLLLNMGSAHGLAMVFSMLLAAIIWNLGTWYFGLPASSSHTLIGAIIGIGLTNAMMTGTSVVDALNIPKVINIFGSLIVSPIVGLVFAGGLIFLLRRYWSGTKKRARIHLTPAEREKKDGKKKAAILDAYRTYPLGYRRGVLSWR >LR134204.1|VEB90814.1|2150901_2151738_+|low-affinity-inorganic-phosphate-transporter MAFSHGANDGQKGIGLVMLVLIGVAPAGFVVNMNASGYEITRTRDAINNVETYFQQHPDLLKKVTGVDQLIPAPEPGATEPAEFHCHPANTINALDRAKGMLANLESYDTLSVDQRSQLRRIMLCISDTTDKVVKLPGVSNDDQRLLKKLKTDMLSTIEYAPIWIIMAVALALGIGTMIGWRRVATTIGEKIGKKGMTYAQGMSAQMTAAVSIGLASYTGMPVSTTHVLSSSVAGTMVVDGGGLQRKTVTSILMAWVFTLPAAIILSGVLYWISLKII >LR134204.1|VEB90817.1|2151853_2152216_-|universal-stress-protein-UspB MISTVALFWALCVVCIVNMARYFSSLRALLVVLRGCDPLLYQYVDGGGFFTSHGQPNKQMRLVWYIYAQRYRDHHDDEFIRRCERVRGQFILTSALCGLVLISMVCAVDLALSKKSGSVS >LR134204.1|VEB90820.1|2152607_2153042_+|universal-stress-protein-A MAYKHILIAVDLSPESKVLVEKAVSMARPYNAKVSLIHVDVNYSDLYTGLIDVNLGDMQKRISEETHHALTELSTNAGYPITETLSGSGDLGQVLVDAIKKYDMDLVVCGHHQDFWSKLMSSARQLINTVHVDMLIVPLRDEEE >LR134204.1|VEB90823.1|2153340_2154813_+|inner-membrane-transporter-YhiP MNTTAPTGLLQQPRPFFMIFFVELWERFGYYGVQGILAVFFVKQLGFSQEQAFITFGAFAALVYGLISIGGYVGDHLLGTKRTLVLGAIVLAIGYFMTGLSLLKPSLIFIALGTIAVGNGLFKANPASLLSKCYLPKDPRLDGAFTLFYMSINIGSLLSLSLAPVIADKFGYAVTYNLCGAGLIVALLVYFACRGMVKDIGSEPDHKPLSFRNLLAVLIGTVAMIFLCAWLMHNVKIANLVLIVLSIVVIIFFFREAFRLDKTGRNKMFVAFILMLEAVLFYILYAQMPTSLNFFAINNVHHDILGFTINPVSFQALNPFWVVVASPVLAAIYTRLGSKGKDLTMPMKFTLGMFLCSLGFLTAAAAGMWFADAQGLTSPWFIVLVYLFQSLGELLISALGLAMVAALVPQHLMGFILGMWFLTQAAAFLLGGYVATFTAVPENITDPLQTLPIYTGVFSKIGLVTLAVTVVMAIMVPWLNRMINTPDSSK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134204_6 | 2364776-2364918 | Orphan |
NA
Consensus repeat of LR134204_6
|
1 spacers
spacers of LR134204_6
>6.1|2364831|33|LR134204|CRISPRCasFinder CGTAAGCGCCTTATCCGGCCTACGGTGTTGCGT |
CRISPR arrays and Neighbor proteins around LR134204_6
The CRISPR arrays of LR134204_6 >merge|LR134204|6|2364776-2364918|CRISPRCasFinder TTTTGTAGGCCCGGTAAGCACAGCGCCACCGGGCATCATTATTGCCGGATGGCGGCGTAAGCGCCTTATCCGGCCTACGGTGTTGCGTTTTTGTAGGCCCGGTAAGCGCAGCGCCACCGGGCATCATTATTGCCGGATGGCGG >LR134204|6|6|2364776-2364918|CRISPRCasFinder TTTTGTAGGCCCGGTAAGCACAGCGCCACCGGGCATCATTATTGCCGGATGGCGG CGTAAGCGCCTTATCCGGCCTACGGTGTTGCGT TTTTGTAGGCCCGGTAAGCGCAGCGCCACCGGGCATCATTATTGCCGGATGGCGG
>LR134204.1|VEB91467.1|2364284_2364710_-|Inner-membrane-protein-yidH MKISRLGEAPDYRFSLANERTFLAWIRTALGFLAAGVGLDQLAPDFATPVIRELLALLLCLFAGGLAIYGYLRWLRNEKAMRLKEDLPYTHSLLIISLMLMIVCGDCHGAGVVCRIAAKPAVRLIPVYSRSARRWPGFVPC >LR134204.1|VEB91465.1|2364021_2364270_-|Inner-membrane-protein-yidG MALALKHNWHQAGFLFWVSIGVLAIVALILWRYTRSRNLMDVAHSDFSESRAVRDKFLISLAVLSLAILFAVTHIRQLIVFY >LR134204.1|VEB91463.1|2362900_2363917_-|arylsulfatase MDNATLEAFICQHIAAQPVDDVLFAWQGGEPTLCGLDFFHRVVALQKRYGEGKRIQNTFQTNGILLSDEWCRFLRDNGWLVGISLDGPADLHDAYRVSRSGKPTHHKVVEAIARLVAHRVDFNLLVVVNRLNSQQPARMYRYLRQLGTPFLQFIPLVERDDSGKLTADSVTSEDWGRFLNGVFDLWVREDIGRVFVQLFDSMLGVWSGHPAQMCALSETCGHAFALEANGDLYQCDHYVYPAFRLGNIHQTPLHMLNASPQASEFGQQKKSTLGADCLDCALLRFCNGDCPKHRDLSGKSVLCGGYKAFINYTSPHMRVMRDLIKQHRSPMELMAMLR >LR134204.1|VEB91461.1|2362519_2362888_+|D-serine-dehydratase MMPFPSRKIGVDNLTAADGLAVGRASGFVGRAMERLLDGLYTLDDRTMYDMLGWLAQEEGIRLEPSALAGMAGPQRVCRSTDYQQMHAFSAEQLNHATHLVWATGGGMVPEEEMAQYLAKGR >LR134204.1|VEB91459.1|2361558_2362557_+|D-serine-dehydratase MENAKMTSLIAQYPLVEDLIALKETTWFNPGTTSLAEGLPYVGLTAQDVQDAHARLARFAPYLAKAFPETAATGGIIESELAIIPAMQQRLEKEYGQKISGELLLKKDSHLPISGSIKARGGIYEVLAHAEKLALKAGLLTTEDDYSVMLSPEFRQFFSQYSIAVGSTGNLGLSIGIMSACIGFKVTVHMSADARAWKKAKLRSHGVTVVEYEEDYGVAVEQGRKAAQSDPNCFFIDDENSRTLFLGYAVAGQRLKAQFAQQGRVVDADHPLFVYLPCGVGGGPGGVAFGLKLAFGDNVHCFFAEPTHSPCMLLGVYTGLHDAISVQEDWRR >LR134204.1|VEB91457.1|2361073_2361541_+|permease-DsdX MGTLLTHTENGFGSIANILLIIGAGGAFNAILKSSGLADTLAVILSNMHMHPILLAWLVALILHAAVGSATVAMMGATAIVAPMLPLYPNVSPEIIAIAIGSGAIGCTIVTDSLFWLVKQYCGATLNETFKYYTTATFIASVVALAGTFLLSFII >LR134204.1|VEB91455.1|2360202_2361123_+|permease-DsdX MHSQIWVVSTLLVSIVLIVLTIVKFRFHPFLALLLASFFVGAMMGMGPLEMVNAIESGIGGTLGFLAAVIGLGTILGKMMEVSGAAERIGLTLQRCRWLSADVIMVLVGLICGITLFVEVGVVLLIPLAFSIAKKTNTSLLKLAIPLCTALMAVHCVVPPHPAALFVANKLGADIGSVIVYGLLVGLVASLVGGPLFLKFLGNRLPFKPVPTEFADLEVRDENTLPSLRATLFTVLLPIGLMLVKTVAELNMTQGGTLYTLLEFIGQPDHRNVYRRFRGLLHSRPASAYRYGYVADPYRKRLWLDC >LR134204.1|VEB91453.1|2359048_2359975_-|DNA-binding-transcriptional-regulator-DsdC MEPLRDGRNRLLNGWQLSKLYTFEVAARHQSFALAADELSLSPSAVSHRINQLEEELGIQLFVRSHRKVELTHEGKRVFWALKSSLDTLNQEILDIKNQELSGTLTVYSRPSIAQCWLVPALGDFTRRYPSISLTMLTGNDNVNLQRAGIDLAIYFDDAPSAQLTHHFLMDEAILPVCSPEYARHHDLTNARVNLRHCTLLHDRQAWSNDSGTDEWHSWAQHFGVNLPTSSGIGFDRSDLAVIAAMNHIGVAMGRKRLVQKRLDSGELIAPFGEMTLKCHQHYYVTTLPGRQWPKIEAFIGWLQEQAT >LR134204.1|VEB91451.1|2357897_2358959_-|Cellulase-(glycosyl-hydrolase-family-5) MLKPLASLFLLVSATVCAAQPPLTASRYAQQLGVGMDVDWARTERGIREFDPLVVRDFQAKGFKHVRIRVAGEPTEARLIHLRKLVEACEQYGVIPIIAYQADEYKNDPSPGNEQEVINWWVAVAHYFAQRSPLLGFDLIYEPAEKLNHSQSSLNRVYDKTIRTLHAIDPQRMIFVAPRLRAAPEALSALKLPPQSQNYVLAEWHIFPWGPLKNNGKYPWTSARRRRKRRFAPRINAAARWQQKTGHVSWVGGWRRANQSKNSPSASQLAFARFMACELQRVNIPYAINADTQFYDGEEGAWRPALEPLLTTMISPVCEKPGGKPGHHSIKSSAPDAAHATPAAASTVKSAAP >LR134204.1|VEB91449.1|2356729_2357971_+|multidrug-resistance-protein-D MGMCSTLFLFFSFFKSNQRNEKAEKRQFVVDVGITGGRRADGANHLYPRHCDMARELNVREGAVQSVMAAYLLTYGVSQLFYGPLSDRVGRRPVILVGMSIFMLATLVAITTHSLTVLIAASALQGMGTGVGGVMARTLPRDLYEGTQLRHANSLLNMGILVSPLLAPLIGGLLDTVWNWRACYVFLLVLCAGVTFSMARWMPETRPTGAPRTRLIASYKTLFGNGAFTCYLLMLIGGLAGIAVFEACSGVLMGAVLGLSSMVVSILFILPIPAAFFGAWFAGRPNKRFATLMWQSVFSCLLAGVMMWVPGLFGVMNVWTLLIPAALFFFGAGMLFPLATSGAMEPFPFLAGTAGALVGGLQNIGSGVLAWLSAMLPQTGQGSLGFLMTLMGLLILLCWLPLASRVPHQGQTI >LR134204.1|VEB91469.1|2364978_2366472_-|sulfatase MTRPNFLFIMTDTQATNMVGCYSGKPLNTQHIDSLAAEGIRFNSAYTCSPVCTPARAGLFTGIYANQSGPWTNNVAPGKNISTMGRYFKDAGYHTCYIGKWHLDGHDYFGTGECPPEWDADYWYDGANYLSELTENEISLWRNGLNSIEDLQTNNIDETFTWAYRISNRAVDFLHQPARDDEPFLMVVSYDEPHHPFTCPVEYLEKYQDFYYDLGAKAQDSLSNKPEHHRLWAQAMPSPVGDDGCYHHPLYFACNDFVDDQIGRVINALTPEQRENTWVIYTSDHGEMMGAHKLISKGAAMYDDITRIPLIMRPPQGAPQQIDTPVSHIDLLPTMMALAGIDKPDILPGENILAVEAPRGVMVEFNRYEIEHDSFGGFIPVRCWVTDNWKLVLNLFTSDELYDRRNDPDELHNLIAAPEFAGVRSQMHDALLDYMDKIRDPFRTYQWSLRPWRKDARPRWMGAFRPRPQDGYSPVVRDYDTGLPTQGVKVEEKKQKF >LR134204.1|VEB91471.1|2366468_2368205_-|putative-symporter-YidK MKNKGYEMNTLQILSFVGFTLLVAVITWWKLRKADTGSQQGYFLAGRSLKAPVIAASLMLTNLSTEQLVGLTGQAYRSGMSVMAWEVISAIPLIFLALIFLPRYLQRGIATIPDFIEERYDKTTRIIIDCCFLIATGICFLPIILYSGALAFNSMFHVTETFGISQHTAIWLMAVVLGISGILYAVVGGMRAIAIADVINGIGLVIGGLMVPLFGLMAMGKGSLSEGITRLTQDHSQMLNSIGGSNDPVPIGAALTGLILVNTFYWCTNQGIVQRTLASKSLSEGQKGALLTAVLKMLDPLILVLPGVIAFHLFQDLPKADMAYPALVNKVMPLPLIGFFSAVLFGAIISAFCGFLNSASTLFSLGIYRRLINEQASPDKLVTVGRRFGFIVAVISVLVAPWIAYAPQGLYSWMKQLNGIYNVPLVTIVIMGFFFPRIPALAAKVAMGLGIVSYITINYLVKFDFHFLYVLACTFCINVVVMLLIGVIKPRATPFKFHDAFAVDMKPWKNVKIAAVGVLFAMIGVYSGLAQFGGYQTRWLTILSYAITAAVVVYLIYSSWQTRHSAPVVYVSDAKDKA >LR134204.1|VEB91473.1|2368243_2369245_+|AraC-family-transcriptional-regulator MPGQAVRFLNYKHRLPEAFTFCYVQVVGIILRRENKMNGNLLSSHVKNETTYNIPLLINENVISSGISLISLWHTYADENYRVIWPRDKKKPLIANSWVAVYTVQGCGKILLKDGEQITLNGNCIIFLKPTDIQSYHCEGLLWEQYWMEFTPTSIMDIPIRQQSIIYNGDVYNQELAEVSQLITRPEPIKNNLAVAFLTKIIYQWICLIDSNGKKDPQRIQVEKLIAALHASLQRRWSVADMAATIPCSEAWLRRLFLRYTGKTPKEYYLDARLELALSLLKQEGNTVGQVADMLNFFDSFHFSKAFKHKFGYAPSAVLKHPDRHPLDAGQQD >LR134204.1|VEB91475.1|2369241_2369502_-|6-phospho-alpha-glucosidase MVEIPCLVGHNGPEPLTVGDIPQFQKGLMSQQVAVEKLVVDAWEHRSYQRLWQAITLSKTVPSASVAKAILDDLIEANKEYWPELH >LR134204.1|VEB91477.1|2369512_2370565_-|6-phospho-alpha-glucosidase MKKFSVVIAGGGSTFTPGIVLMLLANQDRFPLRALKFYDNDGARQEVIAEACKIILKEQAPEIEFSYTTDPEAAFTDVDFVMAHIRVGKYPMREKDEKIPLRHGVLGQETCGPGGIAYGMRSIGGVLELVDYMEKYSPNAWMLNYSNPAAIVAEATRRLRPNAKILNICDMPIGIEGRMAQIVGLQDRKQMRVRYYGLNHFGWWTSIEDLQGNDLMPKLREYVAKHGYVPPSNDPHTEASWNDTFAKAKDVQALDPDTMPNTYLKYYLFPDYVVQHSNPARTRANEVMDHREKQVFGSCRAIIEAGHSSAGELEIDEHASYIVDLATAIAFNTQERMLLIVPNNGAIHNF >LR134204.1|VEB91479.1|2370561_2372181_-|PTS-system-EIIBC-component MLSQIQRFGGAMFTPVLLFPFAGIVVGIAIMLRNPLFVGEALTDPNSLFAQIVHIIEEGGWTVFRNMPLIFAVGLPIGLAKQAQGRACLAVLVSFLTWNYFINAMGMTWGHFFGVDFSVDPTAGSGLTMMAGIKTLDTSIIGAIMISGIVTALHNRYFDKPLPVFLGIFQGTSFVVIIAFLVMIPCAWLTLLGWPKVQMGIESLQAFLRSAGALGVWVYTFLERILIPTGLHHFVYGPFIFGPAAVEGGIQVYWAQHMQAFSQSTESLKTLFPEGGFALHGNSKIFGSVGIALALYYTASPENRVKVAGLLIPATLTAVLVGITEPLEFTFLFISPLLFAVHAVLAASMATVMYICGVVGNMGGGLLDQFLPQNWIPMFHNHAGMMFTQIGIGLAFTALYFVIFRTLILRFNLKTPGREESEIRLYSKADYKAARGQTTAPAPDAKLGQAAGFLQALGGAGNIESINNCATRLRIALHDMAQTQSDDVFKALGAHGVVRRGNGIQVIVGLHVPQVRDQLETLMKSPLSTEQIPMTEAVS >LR134204.1|VEB91481.1|2372403_2373126_+|GntR-family-transcriptional-regulator MIYKSIAERLRIRLNSSDYNIGSPLPGEKKLAEEFAVSRMTIRKAVDMLVGWGLVVRRHGSGTYVARKDVHHETTNLTGLAEVLRRQGKEVTSQVLVFEVMPAPPAIASQLRIQINERIYFSRRVRYVDGKPLMLEDSYMPVRLFRNLSLIHLEGSKFDYIEKECGITISGNYESLTPVLADKQLSAQMNVPEQTPLLRITSLSYSDSGEFLNYSVMFRNASEYQVDYHLRRVHPDANLT >LR134204.1|VEB91483.1|2373122_2374784_-|transport-protein MSDIALTVSVLALVAVVGLWIGNIKVRGVGFGIGGVLFGGIIVGHFVDQAGMTLSSDMLHFIQEFGLILFVYTIGIQVGPGFFASLRVSGLRLNLFAILIVIIGGLVTAILHKIFAIPLPVVLGIFSGAVTNTPALGAGQQILRDLGTPMEAVDQMGMSYAMAYPFGICGILLTMWLMRMIFRVNVEAEAKQHEDTLSNGHSLIQTMNIRVENPNLNNMAIQDVPILNSDKIICSRLKRDETLMVPSPGTIIQSGDLLHLVGQPADLHNAQLVIGQEVDTSLSTRGTDLRVERVVVTNEKVLGKRIRDLHFKERYDVVISRLNRAGVELVASSDASLQFGDILNLVGRPSSIDAVANVVGNAQQKLQQVQMLPVFIGIGLGVLLGSIPLFVPGFPVALKLGLAGGPLIMALILGRIGSIGKLYWFMPPSANLALRELGIVLFLAVVGLKSGGDFIDTLTQGDGLSWIGYGIFITAIPLITVGLLARIFAKMNYLTLCGMLAGSMTDPPALAFANNLHATSGAAALSYATVYPLVMFLRIITPQLLAVLFWGLG >LR134204.1|VEB91485.1|2375018_2375447_-|heat-shock-chaperone-IbpB MRNYDLSPLLRQWIGFDKLANALQNTGESQSFPPYNIEKSDDNHYRITLALAGFRQEDLDIQLEGTRLTVKGTPAQPEKETKWLHQGLVTQPFSLSFTLAENMEVSGATFTNGLLHIDLTRNEPETIAPQRIAISERPALNS >LR134204.1|VEB91487.1|2376279_2376612_+|lipoprotein MIKNVLLTLMMCSGLVLLGGCSSVMSHTGGKEGTYPGTRASAEMIGSSDTNWGTKSLAILDMPFTAVMDTILLPWDFFRTDSSVKSRVEKSEEKTKMTNAVIPPARMPAH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134204_7 | 2757928-2758029 | Orphan |
NA
Consensus repeat of LR134204_7
|
1 spacers
spacers of LR134204_7
>7.1|2757952|54|LR134204|CRISPRCasFinder CACCGGGCGACCTGCCGGATGGCGGCGTTGCCTTATCCGGCCTACAAAGCAACT |
CRISPR arrays and Neighbor proteins around LR134204_7
The CRISPR arrays of LR134204_7 >merge|LR134204|7|2757928-2758029|CRISPRCasFinder CGTAGGCCCGGTAAGCGGTAGCGCCACCGGGCGACCTGCCGGATGGCGGCGTTGCCTTATCCGGCCTACAAAGCAACTCGTAGGCCCGGTAAGCGGTAGCGC >LR134204|7|7|2757928-2758029|CRISPRCasFinder CGTAGGCCCGGTAAGCGGTAGCGC CACCGGGCGACCTGCCGGATGGCGGCGTTGCCTTATCCGGCCTACAAAGCAACT CGTAGGCCCGGTAAGCGGTAGCGC
>LR134204.1|VEB92299.1|2757624_2757876_+|aromatic-amino-acid-aminotransferase MRTRILAMRQELVNVLNAEIPGRNFDYLLQQRGMFSYTGLSVAQVDRLRDEFGVYLIASGRMCVAGLNSGNVQRVAKAFAAVM >LR134204.1|VEB92297.1|2757167_2757587_+|aromatic-amino-acid-aminotransferase MLATLNTLPARSIVLLHPCCHNPTGADLTHSQWDAVIEILKARELIPFLDIAYQGFGAGMEDDAYAIRAIASAGLPALVSNSFSKIFSLYGERVGGLSVVCEDAEAAGRVLGQLKATVRRNYSSPPNFGAQVGCRRTER >LR134204.1|VEB92295.1|2756680_2757064_+|aromatic-amino-acid-aminotransferase MFQKVDAYAGDPILSLMERFKEDSRSDKVNLSIGLYYNEEGIIPQLKAVAEAEARINAQPHGASLYLPMEGLNTYRHTIAPLLFGADHPVLQQQRVATIQTLGGSGALKVGADFLKRYFPESAVWGQ >LR134204.1|VEB92293.1|2756271_2756532_+|alanine-racemase MAMVIRVRRRPGTPVLVNGREVTIVGRVAMDMICVDLGPQAQDKAGDPVILWGEGLPVERIAEITKVSAYELITRLTSRVAMKYID >LR134204.1|VEB92291.1|2755450_2756068_+|alanine-racemase MQAATVVINRRALRHNLQRLRELAPASKLVAVVKANAYGHGLLETARTLPDADAFGVARLEEALRLRAGGITQPILLLEGFFEATDLPVISAQHLHTAVHNEEQLAALEAAGLDEPVTVWMKLDTGMHRLGVRPEKAEAFYQRLTQCKNVRQPVNIVSHFARADEPECGATEQQLDIFNTFCEGKPGQRSIAASGGILLWPQSHF >LR134204.1|VEB92289.1|2754129_2755332_+|replicative-DNA-helicase MHRLQEMGKPIDLITLAESLEIQGQLDSVGGFAYLAELSKNTPSAANISAYADIVRERAVVRDMISVAHEIADAGYDPQGRNSDELLDLAESRVFQIAENRANKDEGPKSIDQILDATVARIETLFQQPHDGVTGVDTGYQDLNKKTAGLQRSDLIIVAARPSMGKTTFAMNLCENAAMLQEKPVLIFSLEMPGEQIMMRMLASLSRVDQTRIRTGQLDDEDWARISGTMGILLEKRNMYIDDSSGLTPTEVRSRARRIYREHGGLSLIMIDYLQLMRVPSLSDNRTLEIAEISRSLKALAKELQVPVVALSQLNRSLEQRADKRPVNSDLRESGSIEQDADLIMFIYRDEVYHENSDLKGIAEIIIGKQRNGPIGTVRLTFNGQWSRFDNYAGPQYDDE >LR134204.1|VEB92287.1|2753915_2754164_+|replicative-DNA-helicase MAGNKPFNKQQTDARDRDLQVDGLKVPPHSIEAEQSVLGGLMLDNERWDDVAERVVADDFLYPAAPAYFYRNAPFAGDGETH >LR134204.1|VEB92285.1|2752847_2753534_-|quinone-oxidoreductase,-NADPH-dependent MFPPIKAAILPKAISFEQAAASFLKGLTVFYLLRKTYEVKPDEPFLFHAAAGGVGLIACQWAKALGAKLIGTVGSAQKAQIALQAGAWKVINYREENIAERVKEITGGKKVRVVYDSVGKDTWEASLDCLQRRGLMVSFGNASGPVTGVNLGILNQKGSLYVTRPSLQGYITSRDELTEASNELFSLIASGVIKVEVAENQKYALKDAQRAHEVLESRATQGSSLLIP >LR134204.1|VEB92283.1|2752428_2752671_+|phage-shock-protein-G MLELLFVMGFFIMLMATGVSLLGILAALMVATAVMFLGGMFALVIKLLPWLLLAVAVVWVIKAVKTPKIPQYQRNNRRFY >LR134204.1|VEB92281.1|2751228_2752266_+|tRNA-dihydrouridine-synthase-A MHQNKESQNTLQTNVMPEKLALYPAHRFSIAPMLDWTDRHCRYFLRLLSRQTLLYTEMVTTGAIIHGKGDYLAYSEEEHPVALQLGGSDPAALAQCAKLAQARGYDEINLNVGCPSDRVQNGMFGACLMGNAQLVADCVKAMRDVVSIPVTVKTRIGIDDQDSYEFLCDFINTVAGKGECEMFIIHARKAWLSGLSPKENREIPPLDYPRVYQLKRDFPHLTMSINGGITSLEEAKAHLEHMDGVMIGREAYQNPGILASVDREIFGATTADADPVAVVRAMYPYIERELSQGTYLGHITRHMLGLFQGIPGARQWRRYLSENAHKAGADIDVLEHALKRVADKR >LR134204.1|VEB92301.1|2758182_2758896_+|acid-phosphatase/phosphotransferase MRKITLALSAFCLLFTLNNSAIALASSPSPLNPGTNVAKLAEQSPIHWVSVAQIENSLAGRPPIAVGFDIDDTVLFSSPGFWRGKKVYSPESEDYLKNPAFWEKMNNGWDEFSIPKEVARLLIDMHVRRGDSIFFVTGRSQTKTETVSKTLADDFHIPAANMNPVIFAGDKPGQNTKTQWLQEKNIRIFYGDSDNDITAARDAGIRGIRILRASNSTYKPLPQAGAFGEEVIVNSEY >LR134204.1|VEB92303.1|2759145_2759502_+|Uncharacterized-protein-conserved-in-bacteria MTNSELLQYCMAKTGAEQSVHSDWKATQIKVEDVLFAMVKEVENRPAVSLKTSPELAELLRQQHSDVRPSRHLNKAHWSTVYLDGSLPDSQIYYLVDASYQLAVSTLPEDKRKLLVQP >LR134204.1|VEB92305.1|2759597_2762078_-|excinuclease-ABC-subunit-A MGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLMLLAPIIKERKGEHAKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTIEVVIDRFKVRSDLSQRLAESFETALELSGGTAIVADMDDTKAEELLFSANFACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPELSLAGGAIRGWDRRNFYYFQMLKSLAEHYKFDVEAPWGSLSQNVHKVVLYGSGKESIEFKYINDRGDTSVRRHPFEGVLHNMERRYKETESSAVREELAKFISNRPCASCEGTRLRREARHVFVENTPLPAISDMSIGHAMDFFNNLKLAGQRAKIAEKILKEIGDRLKFLVNVGLNYLTLSRSAETLSGGEAQRIRLASQIGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRNLGNTVIVVEHDEDAIRAADHVIDIGPGAGVHGGQVVAEGPLEAIMAVPESLTGQYMSGKRKIEVPKQRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLINDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSNPATYTGVFTPVRELFAGVPESRSRGYTPGRFSFNVRGGRCEACQGDGVIKVEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFFDAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTGQTLYILDEPTTGLHFADIQQLLDVLHQLRDQGNTIVVIEHNLDVIKTADWIVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML >LR134204.1|VEB92307.1|2762190_2762421_-|excinuclease-ABC-subunit-A MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQRRYVESLSAYARQFLSLMEKTRCRPY >LR134204.1|VEB92309.1|2762671_2763208_+|single-stranded-DNA-binding-protein MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKQTGEMKEQTEWHRVVLFGKLAEVASEYLRKGSQVYIEGQLRTRKWTDQSGQDKYTTEVVVNVGGTMQMLGGRQGGGAPAGGNMGGGQQQGGWGQPQQPQGGNQFSGGAQSRPQQQSAPAPSNEPPMDFDDDIPF >LR134204.1|VEB92311.1|2763467_2763665_+|universal-stress-protein-g MNKVILMPVDILAMDLSDKAIAYADHMVDKENGVIHLLHVFPKTGYLPNAWFCLGYQKIRGIHDE >LR134204.1|VEB92313.1|2763654_2763957_+|universal-stress-protein-g MTNDAQEKMLNLARKFKTPLENIRFEVRHGNIRDEVNNAVEAHNAEMIIIGSRKPGIATHLLGSAAANIFAIRENSSDGHSLICHARWRFAYRADKIPLP >LR134204.1|VEB92315.1|2763989_2764466_-|Uncharacterised-protein MPMGEGFCRPGKRSATGHGDGWRHKRLTRPTGEGFCRPGKRSATGHNAGWRRKRLIRPTGERFCRPGKQSATGHDAGWRHKRLIRPTGEGFCRPGKQSATGHGDGWRHKRLTRPTGEGFCRPGKRSATGHNAGWRRKRLIRPTGERFCRPGKPKRHRA >LR134204.1|VEB92317.1|2764516_2765020_-|cation-transporting-P-type-ATPase MATSATLSFGLAFEAGERNIMRRPPRQSNENVMDAFAIWRVGFVGTLIAACAFALEAWLQPRGHSPEFIRTVLLQTLVTAQWVYMLNCRVSDGFSLGRGLLMNKGIWMVSGILLLLQMAIIYMPFMQMLFGTEALPLRYWGITFAIGAVLFCIVEIEKPLTRKFRRK >LR134204.1|VEB92319.1|2764979_2766677_-|cation-transporting-P-type-ATPase MSIKTADTLTGELPLGDRKNLLFSGTTISAGAGLGVVIATGEETELGHINQMMTGIEKHRTPLLVQMDKLGKAIFAIIVAMMAGLFVFSLLLRDMPMGELLLSLISLAVAAVPEGLPAIISIILSLGVQTMARKRAIIRKLPTVETLGAMSVICSDKTGTLTMNEMTVKAIITADKNYRVQGNSYEPMGEIHTEESDAAVEILPGSLLENYLRTIDLCNDSQLIQDENGHWGITGGPTEGALKVLAAKARLPAIETELRSKIPFDSQYKYMATHHRIGNEERVLVTGAPDVLFKLCQLQQTATGTEALTQNHWEAEIARYAKEGLRMVAAAWKPERAEAASLTHECLNDGLIFLGIAGMMDPPRPEAIDAIQVCQQAGIRVKMITGDHPQTAMSIGGMLGIHNSTHAVTGYELEQMDDAALAEAAVTYDIFARTSPEHKLRLVKALQNKGEIVGMTGDGVNDAPALKQADVGIAMGIKGTEVTKEAADMVLTDDNFATIASAVQEGRPRLRQPEENDSVHHAHQPGTGSADHHCPVGWEPDPAHAGTDSVDEHGDLRHAVVRAGI |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134204_8 | 2897241-2897335 | Orphan |
NA
Consensus repeat of LR134204_8
|
1 spacers
spacers of LR134204_8
>8.1|2897266|45|LR134204|CRISPRCasFinder TGAGCAGAACTACTGGGCGCAATAGTGGTTAAAAAATGACCTTGT |
CRISPR arrays and Neighbor proteins around LR134204_8
The CRISPR arrays of LR134204_8 >merge|LR134204|8|2897241-2897335|CRISPRCasFinder TGCGGGGCGCAGCAAGTACAAAAAATGAGCAGAACTACTGGGCGCAATAGTGGTTAAAAAATGACCTTGTTGCGGGGCGCAACAAGTACAAAAAA >LR134204|8|8|2897241-2897335|CRISPRCasFinder TGCGGGGCGCAGCAAGTACAAAAAA TGAGCAGAACTACTGGGCGCAATAGTGGTTAAAAAATGACCTTGT TGCGGGGCGCAACAAGTACAAAAAA
>LR134204.1|VEB92600.1|2896468_2897197_+|Uncharacterised-protein MTALAVVNDILHDPGMRHRHSRLAIIESRLSAPATPEQMARQRQAVQPSPKSGLSKKRIKNSPNSLSPENGLSSLKPDNSQNPENGLSKKSRTYAGVRNSDCNVRSFTHSVNKKTYVPCGEQFLPDDFLNALNSEDRQMLSTQMASLPETMAKALADMLREQQALGRLNNPVGWMFSMLRRARSGELKISEDTATASSGGTISHPVESRTVPTANVPVQPAKRPSQGQVRAMIAAIRQQVQK >LR134204.1|VEB92598.1|2895963_2896434_+|Uncharacterised-protein MTIPADSIVAHTLSRMQRSLERRTDTGDVGQERSGLLFMGNVHDAFPRQLFLDTRLSPLDKTAWVMIRLYAQQNEGAVFPTYDELQVQLASAHSEKASRDTVSRVLLMLRLTGWLSLCKRVRDDHGRVRGNIYAQHDEPLGYRDAETFDPGWLTLC >LR134204.1|VEB92596.1|2895654_2895885_+|Uncharacterised-protein MQMTDFSSRQYVSPSVRRRAQRLFSAVCCGEKVFRRLTMNGYLKIDVGPFWRILSKDGGKHWWLMDHETYNREIRR >LR134204.1|VEB92594.1|2895058_2895655_+|Protein-of-uncharacterised-function-(DUF2857) MSQHNLSQATTAILSSLLMDVKNGNIRRCESLGMSLEEIRALSQLSLDELHYLSQSHVSVLDVSLNHENFWLMLNQARNEQKRMLMIDRALELGGSMELIDKYFGLSPSEVSARRRLMGIESRQGRTQSLTEPQDSALWIEWKAAGLSSPDSHQALEAMMLAAEQHGLSLTAVWSRVCQWCRETTPSRKSESQQRKEA >LR134204.1|VEB92592.1|2894194_2894974_+|Protein-of-uncharacterised-function-(DUF2786) MFTVLPAAGGYTMSETTKVIARIRRLMALGTRNSNPHEAARAVALAQRLMQRHGLTPDMLSLHDIQESVCQNLTSNAEKVPAWLSSLATVVCMATGCRCWFGWHIHTSCSGVKSVRRSLHFYGFSGRAEVALYIYTVLQRQLRADTDGQIAGYRKRRILKNTLRRRADQFREGWVSGVWQVLQSFAPSECENSMLQRWLAQRHTGDTLGTVNVRVAGKCRGDRAARVAGWLAGRDTEVHQGLTGAQAQQQITAGGCTHE >LR134204.1|VEB92590.1|2893983_2894238_+|conjugal-transfer-protein-TraR MSEDLNDEMAQASTAIFTQRGIDAIRAKVHSLRPSKEICECCGADIPAARQLAVPGVELCAACQDVKEKQDVHRAAGCRGLHHV >LR134204.1|VEB92588.1|2892275_2893991_+|integrating-conjugative-element,-PFGI_1-class,-ParB-family-protein MSNVRSLNLGDAMLQRGKSPAQAADNNTPTVTQPVNEMAMVLTLDQLRPNPDNPRKGRNPRFEEIKASVRARGLDSIPKVTRDPDGDDVYIFSDGGNTRYQILCDLWQETGDERFYRVHTIFKPWPGRLKCLVGHLAENEVRGDLTYIEKAFGVHKARVIYEEQLGRSVTLRELSDLLGKEGYPIDNSSVSRMEDTLNYLYPHMPQLLESGMSRGQATPLLSLRSSAMKVWKSFSGDVNPECSLDDVFGAVCRCFDEPESYALDIFRDEFIGQLVKALPHPSLNYDRWLIELDPREQKRREQFGEPPPLPAPTPASEQSGGRERTTPPVTTLPADTSSQPPVTGLRSGTLTTPGRPEAVPELQDNTALPGGMSVGSEPRREVQNDLFGGNPVITGQTENPDNDGEWQFKSDVNANALVSAFGEQDETSTPSVSSVAFAATGLEPVNDIWHISALQDDIEHLQDMAYRLTFEIAEAMGCADCVREDKDHHSAGFSISENASEFTLFLAGLSGSLPNKQFNMFMFCLNFFGSQEPADTAVFDDVTVVKTMRLIRVIRRLRELQRLAVKGGENV >LR134204.1|VEB92586.1|2890911_2892279_+|replicative-DNA-helicase MTSNNRRNVSGLFSQDAEQSVIGGLMLDNDCWDEVALRLDTDDFYFKVHQVIFHEMARLVAAGRPIDLITLAESIENRGKDALEQLGGFAYLAELSKNTPSAANIVAYCDIVARYSQGRQLVAIGAEITETVKASGADIGAVMETAEQKITRLAERSEPQQAVTLLDGMEKLLTELERRCNVPDGITGTPTGFEDFDAMTSGLQAADLILIAARPSMGKTAFLISLILNSLLKKADTHAQFYSLEQPTEQILMRMVASLGSVDLTHLKNGQMDDEDWARVSNASSLLMGDLKDRLIIDDTGSLTPAMLRIRARRNARRYGHPSVIGLDYLQLMRCPDQENRTQEIAEISRSLKALAKEMGCPVVALSQLNRQLESRADKRPNNGDLRDSGALEQDADVIVFIYRDEVYHENTEDKGVAEIIVSKQRQGPIGTIRLQYEGRYTRFSANPGHTGRFA >LR134204.1|VEB92584.1|2889961_2890915_+|partitioning-protein-A MLNRILINFQVLKCIYNQSIIQRWVLSLVLYLLFPLRGGEGKSTQSANLAGFLADAGIKTLLIDGDHAQPTASSIFPLEYEAPGGLYELLMQTVDLSNPDNLISRTSINNLDIIVSNDPRNFLPTAMLNAPDGRVRLRNALSHPLFNSYGVIIVDSQGSRSVMSELIILASTGTMVGIAKPILPDVREFMRGTVALMEELLPYCAFGIQLPVTKLLINCMEYDNLSVETLAEVKAIVEDKRYSAHADKIHIDLLETCIYDLTVYVLGHVKGVPVHRLEKNTRRKSDSAFTSMYQLACELFPEWKTNFDALANAGGEE >LR134204.1|VEB92581.1|2887166_2888819_-|chromosome-segregation-protein MVITKLIIEGFKGFKDRFCIEFNESVNIIVGDNEAGKSTILEAINLGLTGLYAGKPVKNNISQYLFNDQLVKEYLASLATAARLPPPALFVELHFQKTPDTVFLEGDGNCLKSSSVGVVFKIKFNEDYKAAYEALLKVGDIRTLPIEYYDIEWKSFSRDALLSRNIPLKSAFIDSSSARGNNGSDVYISRIIKDLLTDDEVISVSQSHRKMKEAFISDKSIETINKKIISASKITSKKVTISVELSSQKAWESSLITCLDDVPFNYIGKGEQSIVKTNLALSHNKAKESNVILIEEPENNLSHTKLNLLIKTIKDNCEGKQIIITTHSSFVANKLGLGDLIFLSNRKTTNLASLTPGTQEFFEKIAGYDTLRMVLCKKAILVEGDSDELVVQKAYMDANQGRLPIEDGIEVISVGISFLRFLEIAAIIDKPTRVVTDNDGDIAAIQKKYNTYLGANIKPHIDICVDPIVDTGTLLVGTKKYNYNTLEPKLLKANSRASLNAVLGTTHTSDDDLRIYMHANKTDCALKIFNYTNTAALSIQYPQYILDAIK >LR134204.1|VEB92602.1|2897538_2898261_+|integrating-conjugative-element-protein,-PFL_4669-family MAEKNEKRTGALQSEMTIALHTNYAIGLWQGRQPEKQDGEKPARHGIIGMPQFFHRATLINQDSLRNNPWADVAMFNLEEKIKVASDSMTALIQQLDAEMSMLPPGITLTEVASVEPLDIQVFTRTPLGYRCVFLLVGFDQFAKKALQASHYGLITRSRRDHHLSEGGRLIRQIYGSVLSYRRIDATRFDAVENNETWKQACEVAGEPDLSVLLGEKRSAFSPPVNESSVNLLRMRYRAA >LR134204.1|VEB92604.1|2898400_2899975_+|DNA-topoisomerase-III MLETAPPEAYGQQYGKPWSLSALPILPSPWQVVVKKETQSQFKVIERLLRQVDDVVISTDADREGEVIARELLEYCRWDGPVQRLWLSALDEMSIRAALQDLRPGAETLGMYHAGLGRARADWLIGMNLSRLYTLLAVQSGFDCVISVGRVQTPTLALVVRRDREIATFVPKPFWQVKALLSAGGRTFPAQWVPAKVYTDEEKRCVHQNIAQQVAHLCRQAGAATVTECETKREKAAAPLAFSLGTLQQACGLLWDMSPQQVLDTAQSLYEKHKATTYPRTDCGYLPESMREEIPSVLSAIGLSDPECSSLLSGLNTSFVSRIWNDKKITAHHGIIPTRNAFKFSALSEAERRVYTLIRRNYLAQFLPLHESDITRLQFDIGGQLFRTTGRTEIVMGWKVLFSKEEEDSAGDDDLSIELPPLRKGDRCGVNGAEVVQQMTRPPAHYTFATLIGAMMNASAFVTDIALRKVLKDNAGLGTEATRAGIVEQLLNRRFIVRKGKQLRATDLGADVIDALPSQLTDPE >LR134204.1|VEB92606.1|2900910_2901402_+|Protein-of-uncharacterised-function-(DUF3577) MTATTTASSSTTSSSASQKKEYFNLNVSGIGYLSSIRRVQGQNGEFTCAVINALTGPTDNASYTRFDVTVAGPEATKLINRCQKAVDEEEKVLLGFVLSGLKADVFTLQSGEHSGESRPSLKARLIKVEFIKKGQEVVYTAPNASQTPEQGSAAPKDYDPNSF >LR134204.1|VEB92608.1|2901474_2901678_+|Uncharacterised-protein MGNPFGLLASEAGYQLQIQVLSSQRGFYIGTANEMGPVSRESVEYYKTSLQADIALEKGEWTQREYD >LR134204.1|VEB92610.1|2901695_2902244_+|single-stranded-DNA-binding-protein MASRGINKVILVGHLGQDPEVRYMPNSTAVTGFSIATSESWRDKQSGEMKENTEWHRVVLFGKLAEIAGEYLKKGSQVYIEGHLRTRKWTDNGGVERWTTEVVVGVNGTMQMLGSRNAPAGTGNAQQPGSAQGGWGQPQQPQPQQSAAPHNPANEPPMDFDDDIPFLGHGYGICRKAIYAIS >LR134204.1|VEB92612.1|2902322_2902565_+|Uncharacterised-protein MPVILRINGFKFFFYSNEGNPLEPAHIHVRNADGEAKFWLENEVKLSRNDGFDARTLKELTKMVQHNQTMFVEAWNDYFS >LR134204.1|VEB92614.1|2902548_2902797_+|Protein-of-uncharacterised-function-(DUF3532) MTISAKNVRFDETTMWVELNDGRVMGVPIAWFPRLLNATDKERNDYELSKRGIHWDNLDEDISVDGLLAGRGDVTHVPHRVA >LR134204.1|VEB92616.1|2903063_2904413_+|integrating-conjugative-element-protein-PilL,-PFGI-1-class MVKINLQTRYRLGAITLCLLAAGCTDFSSRSEPMSRLQGSPRVQDLYQNRSPEVVRYDRYILASTRPVDAQRDPLNQIIDIRMPLQMVNTIGEGMRYVMLESGYSLCSGEPGVFSELFVKPLPAVQRSIGPVKLGDALQILAGPAWRMRVDDLNREICFVLRDEYRHLALSTTGPLQASVASHGTGKSASTGTLLNSSKGLSGTGNNSYPLPPARPESSILSPGTPKVTSQAVSPLSVATTAKTPRNPFSASEPGEKSTVPVQKTQAGPAAKLTSGKVKPSTELAPAPAPSALSVASTPLNKAALGVPLTPSGAVKPGGTVQNSNPPSTVISPTAPVSGKTVFTPGALLSSLPDGQAWTAQAGTTLKETLIQWATSVRCESGSSPTWVVIWPTPVNYRIDVPFTLRGNFESIIVQLFRLYRPAEKPLYAAPNRLQCLVFVDDKPIQDGK >LR134204.1|VEB92618.1|2904414_2904858_+|PilM MGYALPVFALCLVIILIAGDAQHHSSEHVRLALQQTQPEQLAADMLRTADVVNNWRHGRSITDGPVSTALTGMLPLPDSRIKSIIQNGRLWIWVPETDGVYAALRARSVTSALALTVSGGHLRMADGTDMNLSLPSGVTEGSIVYLN >LR134204.1|VEB92620.1|2904877_2906551_+|Bundle-forming-pilus-B MPRIPFCLTVLTAAFVLSSCSLNEISKIDKEAVGQADTAQRVLQSRQSISQPTVVWMDKPWVNLQPVTPVVSTPDEKNLPPCQITINRPDGISLPELGQRITALCGIRVSITPDAFAALSNVSTGSVVTSQMSGQLPAPDDNGRVPLAQMGATSAQPVSVQPSPALMRGLKFQGDVRDLLDVEASGYGLSWRSDGNGVYFYRQDTRTFQLVILNTKVNSSASINSGSGNQLGSGGGTSGGTSGDISSNQKTDYGMNSDLYDDIRKTIEQMLTPKSGRFWLSAATGTLSVTDTPDVLDRIGRYIEYQNKVLSRQVQLNIQIVSVNQTRNEQLGLDWGLVYKSLHNFGATLTGSMANASTSAGSAGISILDTATGNAAKFSGSSLLIKALSEQGNVSMALNQTDPTANLTPVAYQLSNQQGVLTSSSSTATANVGVTSSQTVTTITTGLFMTMLPFIQENGDVQLQFAFSYTSPPQIEKFISRDGNTRNDIPNTSTQGLARKVNLRSGQTLVLTGSEQQNLSANKQGTFTPDNFILGGGQNGTRGRNTLVIMITPVLLR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134204_9 | 3401899-3401983 | Orphan |
NA
Consensus repeat of LR134204_9
|
1 spacers
spacers of LR134204_9
>9.1|3401928|27|LR134204|CRISPRCasFinder TATCGGCAGGGCGTTGCATCGTTTGTA |
CRISPR arrays and Neighbor proteins around LR134204_9
The CRISPR arrays of LR134204_9 >merge|LR134204|9|3401899-3401983|CRISPRCasFinder GGCCCGATAAGCAAAGCGCCATCGGGCATTATCGGCAGGGCGTTGCATCGTTTGTAGGCCCGATAAGCAAAGCGCCATCGGGCAT >LR134204|9|9|3401899-3401983|CRISPRCasFinder GGCCCGATAAGCAAAGCGCCATCGGGCAT TATCGGCAGGGCGTTGCATCGTTTGTA GGCCCGATAAGCAAAGCGCCATCGGGCAT
>LR134204.1|VEB93291.1|3401258_3401696_-|putative-major-pilin-subunit MDKQRGFTLIELMVVIGIIAILSAIGIPAYQNYLRKAALTDMLQTFVPYRTAVELCALEHGGIATCDASSNGIPSPTTTRYVSGMSIEKGVVMLTGQESLNGLSITMTPNWDNANGITGWTRSCNIQSDSALQQACEDVFRFDAS >LR134204.1|VEB93290.1|3399862_3401248_-|protein-transport-protein-HofB MNITQLTALCLRHQGILLDADNEVVHVAVVDSPSHELLDALHFATTKRIDIVCWTRQQMEGHMHKPRKMLPAVIADDSRTAAELLQQTLESALAKRASDIHIEPAENHYRIRLRVDGVLHALPGVSSVTGAALTARLKVLGNLDIAEHRLPQDGQFTVTLAGNAVSFRVAVLPCRDGEKVVLRLLHQVEQALDLNALGMPDTQLALFMQALRQPQGLVLVTGPTGSGKTITLYSALQARNTQDVNLCSVEDPVEIPVEGLNQTQIHPRAGLTFQSVLRALLRQDPDIIMVGEIRDGETAEIAIKAAQTGHLVLSTLHTNSTSETLIRLQQMGVARWMISSALTLVVAQRLVRKLCPYCRHLAAESVSLPAALWPTTLPRWQAPGCERCYHGFYGRTALFEVLPVTPALRQLLINNATVEALESCARQEGMSTLFENGCHAVEQGLTTFEELLRVLGLPHVS >LR134204.1|VEB93289.1|3398670_3399906_-|type-IV-pilin-biogenesis-protein MKNSCAFWGYLMSAKQLWRWQGINATGHSEEGALWADDRAALLVALEHQHIMPIRAKRLSVKAAHWRAEQSAEIVHQLATLLKAGLTLSEGLTLLAGQHPSRQWQALLQTLAHNLEQGTPLSSALAQWPQVFPPLYQAMIRTGELTGKLDECCLELARQQNAQKQLADKVKKALRYPIIILTMAVMVVFAMLHFVLPEFAAIYRTFNTPLPALTQGIITLAQWSAQWGWLMALPGVAMGVIYGLIIKKPYWQIQCQRLLLRCPVIARLVRGQKLTQIFTILALTQRAGIPFSQGLESVAETLNCPYWSQRLAQVHRDIHAGKPVWQALKNAGEFSPLCIQLVMTGETSGALDAMLHNLARHHGENTLSLADNLAALLEPALLVITGVIIGTLVVAMYLPVFHLGDAMSGMG >LR134204.1|VEB93288.1|3397593_3398637_+|guanosine-5'-monophosphate-oxidoreductase MRIEEDLKLGFKDVLIRPKRSTLKSRSDVELERQFTFKHSGQTWSGVPIIAANMDTVGTFEMASALASFDILTAVHKHYSVEDWNAFTSTASEDVLRHVMVSTGTSDADFEKTKQILAQSPALNFVCIDVANGYSEHFVQFVSKAREAWPTKTICAGNVVTGEMCEELVLSGADIVKVGIGPGSVCTTRVKTGVGYPQLSAVIECADAAHGLGGMIVSDGGCTMPGDVAKAFGGGADFVMLGGMLAGHEESGGKIVEENGEKFMLFYGMSSESAMTRHVGGVAQYRAAEGKTVKLPLRGPVEYTARDILGGLRSACTYVGASRLKELTKRTTFIRVQEQENRVFNSL >LR134204.1|VEB93287.1|3396747_3397368_-|dephospho-CoA-kinase MRYTVALTGGIGSGKSTVANAFADLGINVIDADIIARQVVEPGQPALMAIAEHFGSALIAPDGSLQRRMLRERIFASPEEKSWLNALLHPLIQQETRRQFQQATSPYVLWVVPLLVENALYKQANRVLVVDVTPETQLLRTMQRDDVTREHVEQILAAQATREARLAVADDVIDNNGAPDAIASDVARLHARYLQLASQFVSQEKP >LR134204.1|VEB93286.1|3396004_3396748_-|Protein-of-uncharacterised-function-(DUF1342) MHTQVLFEHPLNEKMRTWLRIEFLIQQLSVNLPLADHAGALHFFRNIGDLLDVFERGEVRTELLKELERQQRKLQAWVEVPGVDQSRIEALRQQLKMAGSILISAPRIGQQLREDRLIALVRQRLSIPGGCCSFDLPTLHIWLHLPQEQRDAQVETWIASLNPLNQALTLILDLIRNSSPFRKQTSLNGFYQDNGDDADLLRLQLALDSQLYPQISGHKSRFAIRFMPLDSENGLVPERLDFELACC >LR134204.1|VEB93285.1|3395800_3395995_-|zinc-binding-protein MSETITVNCPTCGKTVVWGEVSPFRPFCSKRCQLIDLGEWAAEEKRIPSSGDLSESDDWSEEQK >LR134204.1|VEB93284.1|3395322_3395712_+|nucleoside-triphosphate-pyrophosphohydrolase MKKLQIAVGIIRNPNHEIFITQRAADAHMANKLEFPGGKIEAGETPEQALIRELQEEVGITPREATLFEKLEYQFPDRHITLWFWLVDHWEGEPWGKEGQPGRWIAQDALNAEDFPPANAPVIEKLIAG >LR134204.1|VEB93283.1|3391990_3392497_+|secretion-monitor-precursor MSGILTRWRQLGRRYFWPHLLLGMVAASFGLPALSNAAEPNTPAKATASNHDQSVKVNFSQLALLEASNRRPNFTVDYWHQHAIRTVIRHLSFAMAPQTLPVVEESSPLQAHHIALLNTLSAMLTQEGTPPTIVRRVARAHFTPQASFSVPAWISQAQGIRAGPQRLS >LR134204.1|VEB93282.1|3390671_3391589_+|UDP-3-O-[3-hydroxymyristoyl]-N-acetylglucosamine-deacetylase MIKQRTLKRIVQATGVGLHTGKKVTLTLRPAPANTGVIYRRTDLNPPVDFPADAKSVRDTMLCTCLVNEHDVRISTVEHLNAALAGLGIDNIVIEVDAPEIPIMDGSAAPFVYLLLDAGIDELNCAKKFVRIKETVRVEDGDKWAEFKPYNGFTLDFTIDFNHPAIDSSSQRYAMNFSADAFMRQISRARTFGFMRDIEYLQSRGLCLGGSFDCAIVVDDYRVLNEDGLRFEDEFVRHKMLDAIGDLFMCGHNIIGAFTAYKSGHALNNKLLQAVLAKQEAWEYVTFQDDAELPLAFKAPSTVLA >LR134204.1|VEB93292.1|3401995_3402889_-|nicotinate-nucleotide-pyrophosphorylase-[carboxylating] MPPRRYNPDYRRDALLERINLDIPTAVAQALREDLGGEVDANNDITAQLLPENTRSHATVITREDGVFCGKRWVEEVFIQLAGDDVSITWRVQDGDSIKANQPLFELDGPSRILLTGERTALNFVQTLSGVASEVRKYVDLLAGTKTQLLDTRKTLPGLRTALKYAVLCGGGANHRLGLSDAFLIKENHIIASGSVRQAVEKAFWLHPDVPVEVEVENLDELDDALKAGADIIMLDNFETDQMREAVKRTNGQARLEVSGNVTDETLREFAETGVDFISVGALTKHVRALDLSMRFR >LR134204.1|VEB93293.1|3402976_3403540_+|N-acetyl-anhydromuranmyl-L-alanine-amidase MLLDKGWLAEARRVPSPHFDCRPDDETPSLLVVHNISLPPGEFGGPWIDALFTGTLDPAAHPFFAEIAHLRVSAHCLIRRDGEIVQYVPFDKRAWHAGVSNYQGRERCNDFSIGIELEGTDTLAYTDAQYQQLAAVTRTLIALYPAIADNMTGHCNIAPERKTDPGPSFDWAKFRALVTASSHKEMT >LR134204.1|VEB93294.1|3403536_3403875_+|regulatory-protein-AmpE MTLFTTLLVLIVERLFKLGEHWHLDHRLEVLFRRTTHFSMVRTLGMTIVAMAVTFLLLRALEGLLFNAPTLVVWILIGVLCIGAGKVRLHYHAYLNAASRNDAHAVKRWLTN >LR134204.1|VEB93295.1|3403859_3404015_+|regulatory-protein-AmpE MANELTLIHGVPPDCNEREFLRELQNALLWINFRFYLAPLFWFIVGGRGGR >LR134204.1|VEB93296.1|3404011_3404290_+|regulatory-protein-AmpE MTLVGYAFLRAWQSWLARYQTPHQRLQSGIDAILHVLDWIPVRLAGVVYALLGHGEKALPAWFVSLADLRTSQYQVLTRLATVLAGPRAAHG >LR134204.1|VEB93297.1|3404479_3404773_-|glycosyl-hydrolase MGLLWIDIHADPQNPANWHKSPRPVFTPAMKTASMGQGTTALPKPPEGDDVLVYHARNYTEIEGDPLYDPNRHTRLKLIRWTKNGMPDFGIPPADTV >LR134204.1|VEB93298.1|3404846_3405236_-|glycosyl-hydrolase MCELIWAPEIHRIDGKWYIYFAAAHTQALDKLGMFQHRMFVLECTDADPLSGVWEEKGQIKTHLIPSRWMPRLFSHQGKQWYLWAQKAPDIAGTPISILPGWKSVDDQSAPVMLSKPEYGLGVSGFSRQ >LR134204.1|VEB93299.1|3405252_3405855_-|symporter MLYFLLNILHQIPSPLHWSLMADVDDYGEWKTGKRITGISFSGNLFFLKLGLAIAGAMVGFLLSWYGYDAGAKAQSDTAIGGIVLLFTVIPGVGYLITAGVVRLLKVDRELMKQIQKDLEKRRASYRELNDDPQFNVAETVRKPDAKLAQPFIEQRADPFILRDGNHYYFIASVPEYDRLEIRRATRWKTYAPPPCRCLA >LR134204.1|VEB93300.1|3405794_3406322_-|symporter MVEWFGGDNKAKGYQMAMTVLAIIGMCMFLFCFATVRERIRPAVPTHDDMKNDFKDVWKNDQWVRILLLTLCNVCPGFIRMAATMYYVTWVMGQSTHFATLFISLGVVGMMIGSMLAKVLTDRWCKLKVFFWTNIALALFSCAFYFFDPKPLRLSWCSTSCLTSCIRSLPRCTGR >LR134204.1|VEB93301.1|3406333_3406825_-|symporter MEKGKLSIKEKIGYGMGDAGCNIIFGAIMLFVNYFYTDIFGLAPALVGVLLLSVRVIDAVTDPIMGAIADRTQSKYGRFRPWLLWIAFPYALFSILMFTTPEWTYNSKVIYAFVTYFLLSITYTAINIPYCSLGGVITNDPQERVACQSYRFVLVGLPLCCCH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134204_10 | 3718973-3719104 | Orphan |
NA
Consensus repeat of LR134204_10
|
1 spacers
spacers of LR134204_10
>10.1|3719023|32|LR134204|CRISPRCasFinder TTTACGCCGCCATCCGGCGTAATGCCCGGTGG |
CRISPR arrays and Neighbor proteins around LR134204_10
The CRISPR arrays of LR134204_10 >merge|LR134204|10|3718973-3719104|CRISPRCasFinder CGCTATGCTTACCGGGCCTACAATGTGCTCCCTGTAGGCCGGATAAGGCGTTTACGCCGCCATCCGGCGTAATGCCCGGTGGCGCTACGCTTACCGGGCCTACAATGTGCTCCCTGTAGGCCGGATAAGGCG >LR134204|10|10|3718973-3719104|CRISPRCasFinder CGCTATGCTTACCGGGCCTACAATGTGCTCCCTGTAGGCCGGATAAGGCG TTTACGCCGCCATCCGGCGTAATGCCCGGTGG CGCTACGCTTACCGGGCCTACAATGTGCTCCCTGTAGGCCGGATAAGGCG
>LR134204.1|VEB93659.1|3718027_3718936_+|manno(fructo)kinase MRIGIDLGGTKTEVIALGDAGEQLFRHRLPTPRDDYRQTIETIASLVEMAEKATGQTGTVGMGIPGSISPYTGMVKNANSTWLNGQPFDKDLSARLQREVRLANDANCLAVSEAVGWRGGRSADRFAVIIGTGCGAGVALNGRAHTGGNGTAGEWGHNPLPWMDEDELRYREEVPCYCGKQGCIETFISGTGFATDYHRLSGHPLKGNDIIRLVNEQDALAERALSRYELRLAKSLAHVVNILDPDVIVLGGGMSNVDRLYNTVPSLIKPFVFGGECETPVRKALHGDSSGVRGAAWLWPQE >LR134204.1|VEB93658.1|3717605_3717902_-|recombination-associated-protein MLWFKNLMVYRLSRDISLRAEEMEKQLALDDVYPLRQPGYGKNGLVPPMGSHSDALTHTANGQIIICARKEEKILPSPVIKQRWKRKSPNWRLTRGAS >LR134204.1|VEB93657.1|3716988_3717522_-|recombination-associated-protein MMWIDTVNGLIMVDCASAKKAEDTLALLRKSLGSLPVVPLALENPIELTLTEWVRSGTVAQGFQLLDEAELKAMLEDGGVIRAKKQDLVSDEIAVHIEAGKVVTKLALDWQQRIQFLICDDGSIKRLKFSDELRDQNEDIDREDYAQRFDADFILMTGELAALIQSLVEGLGGEAQR >LR134204.1|VEB93656.1|3716118_3716592_+|chorismate-biosynthesis-protein MFSRQKVERDLQSIIEVLDNQGYDVILLMNTAAINSMTARNTILLEPLRIIPPLVASIVDGHQVGVIVPVEELLDVQARKWQVLQRPPVFSLANPVQGSEQQLIDAGKDLLEQGADVIMLDSIGFNQRHRDLLQRALDVPVLLSNVLIARLASELLG >LR134204.1|VEB93655.1|3715913_3716177_+|chorismate-biosynthesis-protein MSASLAILTIGVVPMSEVLPLLTEYIDEQHITHHSLLGEISREEVLAEYAVEAGDDPLLTLLSDNQIVHVFAPESGARPAKYYRSAG >LR134204.1|VEB93654.1|3715458_3715650_+|Uncharacterised-protein MPTKPPYPREAYIVTVEKGTPGQTVTWYQLRADYPEPNALISEHPSAQEAMDAKTRYEDPDKS >LR134204.1|VEB93653.1|3715105_3715432_+|shikimate-kinase-II MVATGGGIILTEYNRRYMRENGIVIYLSAPVSTLVNRLEAAPEEGLRPTLTGKPLSEEVQEVLEQRDALYREAAHYIIDASHTPDQVVSEILAALSQAAQRLQGGVYN >LR134204.1|VEB93652.1|3714243_3714669_+|Uncharacterized-BCR,-YaiI/YqxD-family-COG1671 MTIWVDADACPNVIKEILYRASERMQMPLILVANQNLRVPPSRFIRTLRVPAGFDVADNEIVRQCEAGDLVITADIPLASEVLEKGAAALNPRGERYTESTIRERLTMRDFMDTLRASGFRRAAPDSLSPRDRQHFRRRAG >LR134204.1|VEB93651.1|3713295_3714105_-|pyrroline-5-carboxylate-reductase MEKKIGFIGCGNMGKAILGGLIASGQVLPGQIWVYTPSPDNVAALHDRYGINAAQSAQDVAQVADIVFGAVKPGIMIKVLSEISSSLNKDSLVVSIAAGVTLDQLARALGHDRKIVRAMPNTPSLVNAGMTSITPNALVTPEDTADVLNIFRCFGEAEVIAESMIHPVVGVSGSAPAYVFMFIEAMADAAVLGGMPRAQAYKFAAQAVMGSAKMVLETGKHPGELKDMVCSPGGTTIEAVRVLEERGFRASIIEAMAKCMEKSEKLSKS >LR134204.1|VEB93650.1|3712180_3713236_+|diguanylate-cyclase-AdrA MNDENFYKKAVAQSKPPHSPQNDHQRSGLRFARRVRLPRAVGLGWMFLPIAAVLASQPIAGGWWLFLVGWSFVWPHLAWQLASKAIDPLSREIYNLKADAILAGVWVGVMGVNVLPSTALLMMMCMNLMGAGGLRLFIAGMVLMVVSCLVTLQLTGITVAFRSAPLEWWFSLPVIVIYPLLFAWVSYQTATKLAEHKRRLQVMSMRDGMTGVYNRRHWEILLRNEFDNCRRYHRDATLLIIDIDHFKSINDTWGHDVGDEAIIALTRQLQMTLRGSDVIGRFGGDEFAVIMCGTPADNAIAAMSRVHERLNALRLPCAPQVILRISVGVAPLTTQIGHYREMAEIGGYGAL >LR134204.1|VEB93660.1|3719188_3719452_-|exonuclease-subunit-SbcC MPFAIPATLSGGESFLVSLALALALSDLVSHKTRIDSLFLDEGFGTLDSETLDTALDALDALNATGKTIGVISHVEAMKNAFLCKLR >LR134204.1|VEB93661.1|3719802_3722277_-|exonuclease-subunit-SbcC MKILSLRLKNLNSLKGEWKVDFTAEPFASNGLFAITGPTGAGKTTLLDAICLALYHETPRLSTVSQSQNDLMTRDTAECLAEVEFEVKGESYRAFWSQNRARNQPDGNLQVPRVELARCADGKILADKVKDKLEMTASLTGLDYGRFTRSMLLSQGQFAAFLNAKPKERAELLEELTGTEIYGQISALVFEKHKTARTDLEKLQAQASGVSLLAPEQQHSLNESLQALTDEEKQLLARQQLHQQHLHWLTRQNELQAEMNRRQQALRVAQEEQENAQPQLAALKLAHPARQLRPHWERIEEQTAAAGRTRQQIQEVNTRLQSVLHLRSRIRHSAQQQSAELRATMQTLTGWLTEHERFRLWSSELAGWRALFAQQSSDKAQLLKWQQQCASDIRKRDALPPNPLTLTPEEATAALAQHTQQQPLRQRLASLHGQIAPKQKRREQLQTAIQNSQQELARRSAALEDKRQKYKEKNQQFMDVKTICEQEARIKDLESQRALLQSGQPCPLCGSTSHPAIASYQALEPGVNQARRDALEKEVKTLAEEGAALRGQLETLTQQLHRDESEAQALVKEEQALTQEWQTLCDALKVTLHPQDDISPWLTARQDYEQQLYQLSQRHMLQAQIAAHTGQVTQFQQQIDQRQTTLLSELRCYGLSLPADGEEASWLNARADDAQTWQQRQTELGELQTRIAQLVPLLETLPETDILPESDERVALDNWRQVHDDCVSLQSQWQTLQQQEAQEMQRVAQAQAHFDAALKASVFDDRAAFLAALLDDETIARLEQHRPGAGKPVTAGAGLSRSGEPGAGRTSTTSARGAGSHAHT >LR134204.1|VEB93662.1|3722325_3722730_-|exonuclease-subunit-SbcD MRQSKCVHLVSFANGKLHQVENLTVPVTQPLAVLKGDLASITEQLEQWRDAHQEPPVWLDIEITTDEYLHDMQRKIQALTESLPVEVLLVRRSRENRERIMANERRETLSELRVEDVFTAVWRWKSWMNPSVNV >LR134204.1|VEB93663.1|3722897_3723293_-|exonuclease-subunit-SbcD MYNRFVVNLQQTGCHLVVLAGNHDSVATLNESRDILAFLNTTVVASAGHAPQYLYRRDGTPGAVLCPIPFLRPRDIITSQAGLSGNEKQQHLLGAITDYYQQQYQEACKLRGDGDQTLPVIATGHLTTSAQ >LR134204.1|VEB93664.1|3723939_3724356_+|transcriptional-regulator-PhoB MRGLETGADDYVTKPFSPKELVARIKAVMRRISPMAVEEVIEMQGLSLDPTSHRVMTGENPLDMGPTEFKLLHFFMTHPERVYSREQLLNHVWGTNVYVEDRTVDVHIRRLRKALEHSGHDRMVQTVRGTGYRFSTRF >LR134204.1|VEB93665.1|3724425_3724764_+|phosphate-regulon-two-component-system,-sensor-kinase MLERLSWKRLVLELILCCIPAFILSAFFGYLPWFLLASVTGLLIWHFWNLLRLSWWLWVDMSMTPPPGRGSWEPLLYGLHQMQLRNKKRRRELGNLIKRFAAGRNPSPTRWC >LR134204.1|VEB93666.1|3724754_3725351_+|phosphate-regulon-two-component-system,-sensor-kinase MVLTTEDGGIFWCNGLAQQVLGLRWPDDNGQNILNLLRYPEFTQYLKTRDFIRPLNLVLNTGRHLEIRVMPYTDQQLLMVARDVTQMHQLEGARRNFFANVSHELRTPLTVLQGYLEMMQEQPLEGATREKALHTMREQTHRMEGLVKQLLTLSKIEAAPALLLNERVDVPMMLRVVEREAQALSQQKHTFTFEVDAG >LR134204.1|VEB93667.1|3725374_3725719_+|phosphate-regulon-two-component-system,-sensor-kinase MRSAISNLVYNAVNHTPAGTHITVRWQHVTHGAEFSVEDNGPGIASEHIPRLTERFYRVDKARSRQTGGSGLGLAIVKHALNHHESRLIIDSSPGKGTRFSFVLPERLIAKNCA >LR134204.1|VEB93668.1|3726128_3726527_+|branched-chain-amino-acid-transport-system-II-carrier-protein MTHQLKSRDIIALGFMTFALFVGAGNIIFPPMVGLQAGEHVWTAAFGFLITAVGLPVLTVVALAKVGGGVDSLSTPIGKVAGVLLATVCYLAVGPLFATPRTATVSFEVGIAPLTGDSHCRCLSTAWCISPL >LR134204.1|VEB93669.1|3726523_3727447_+|branched-chain-amino-acid-transport-system-II-carrier-protein MILVSLYPGKLLDTVGNFLAPLKIIALIILSVAAIIWPAGPISDAMDAYQNAAFSNGFVNGYLTMDTLGAMVFGIVIVNAARSRGVKEARLLTRYTVWAGLMAGVGLTLLYLALFRLGSDSATLVDQSANGAAILHAYVQHTFGGAGSLLLAALIFIACLVTAVGLTCACAEFFAQYVPLSYRTLVFILGGFSMVVSNLGLSHLIQISIPVLTAIYPPCIALVVLSFTRSWWHNSSRVIAPAMFISLVFGILDGIKASAIGDILPAWTQRLPLAEQGLAWLMPTVVMVVLAIIWDRAAGRQVTSSAH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134204_11 | 3894431-3894519 | Orphan |
NA
Consensus repeat of LR134204_11
|
1 spacers
spacers of LR134204_11
>11.1|3894454|43|LR134204|CRISPRCasFinder ATATGCCCGGTGGCGCTGCGCTTACCGGGCCTACAATAGCGTA |
CRISPR arrays and Neighbor proteins around LR134204_11
The CRISPR arrays of LR134204_11 >merge|LR134204|11|3894431-3894519|CRISPRCasFinder TGCTCCGTAGGCCGGATAAGGCGATATGCCCGGTGGCGCTGCGCTTACCGGGCCTACAATAGCGTATGCTCCGTAGGCCGGATAAGGCG >LR134204|11|11|3894431-3894519|CRISPRCasFinder TGCTCCGTAGGCCGGATAAGGCG ATATGCCCGGTGGCGCTGCGCTTACCGGGCCTACAATAGCGTA TGCTCCGTAGGCCGGATAAGGCG
>LR134204.1|VEB93872.1|3893439_3894306_+|Uncharacterised-protein MTVMTRREPTVMQPADNRKQLGAFLRARRESLDPQRMGLPRVGRRRTPGLRREEVAMLAEVGVTWYTWLEQGREVNPSESVLVGVANALQCSPLETRHLFVLAGLTPPESTQITVCEGISAGTRRMLDSLMPQPASIQKPNFDIVAWNESFCRLMGVDFATIPDDDRNCIYLYLTNETWRSRIENRDVLPTFVSYFRAAMAEHRGDPGWENKLARFFAASSEFEALWHQRYEVRGVENQVKNFLHPRLGRFSLQQMYWYSAPRNGSRLLVYLPMDEAGEQALAWLDKQ >LR134204.1|VEB93871.1|3891894_3893076_-|methyl-viologen-resistance-protein-SmvA MASPPCFAVWRNSIHMLLGARILQGAGAALIVPQILATLHVTLKGSAHAKAISLYGGIGGIAFIVGQMGGGWLVSADIAGLGWRNAFFINVPVCLLVLAFSRRHVPETASETRADIDWLGTVCLAAILCCLLFPMALGPETGWPWQAQAALFAILPLGWWMRKNALRQELRHQHPLLPPRLLRLTSIRFGILIALLFFSAWSGFMFCMALTLQMGLGMAPWESGNSFIALGAAYFISACYAPRLIVRYSMGRILLMGLAIQIVGLAGLMMTFWHSGMQTSTFALAPATALIGYGQALIVNSFYRIGMRDISTQDAGAGSAILSTLQQATLGLGPAVLGALFLHLQHNSHGDYTQSVIGFLAVEAAMMLSLVLATLWFRRALNINATRPCMASK >LR134204.1|VEB93870.1|3890575_3891820_-|mechanosensitive-ion-channel-protein MQELISQVEELGIEMNHTTSLVMIFGIIFLTAIIVHIILHWVVLRAFEKRASASSRLWLQIITQNKLFHRLAFTLQGIIVNIQAVLWLQKGTEAAEILTTCAQLWIMMYALLSVFSLLDVILNLSQKFPAASQLPLKGIFQGIKLIGAIIIGILMISLLIGQSPAILISGLGAMAAVLMLVFKDPILGLVAGIQLSANDMLKLGDWLEMPKYGADGAVIDIGLTTVKVRNWDNTITTIPTWSLVSDSFKNWSGMSASGGRRIKRSISIDATSIHFLDEDEQQRLYKAHLLKPYLTTRHQEIQEWNQQQTAPESVLNHRRMTNIGTFRAYLNEYLRHHPRIRKDMTMMVRQLAPDDQGLPIEIYAFTNTVVWLEYESIQADIFDHIFAVVEEFGLRIHQSPTGNDIRALSGAFQR >LR134204.1|VEB93869.1|3889592_3890573_+|ribose-operon-repressor MSIQKIAQLAGVSVATVSRVLNNSDTVKAKNRERVLQAIKESNYQPNLLARQLRTARSSMILVMVSNIANPFCAEVVKGIEEEAEKNGYRILLCNSGSDMARSKSALKLLSGKIVDGIITMDAFSKLPELTTMIGDAPWVQCAEYADKGAVSCVGINDVDAAQSVISHLVEKGYQRIALINHDLSYKYARLRERGYKAALHAWSLDYQAVEYASELSSSAGMAAMDVLLAAETRPDAVFAVSDTLAAGAMRAIANAGLRIPQDIAIVGFDGTELAEMVSPQLTTIEQPSRDIGRKAVGLLLNRIEHPDAPTERVMMDWRYISRAST >LR134204.1|VEB93868.1|3888363_3889515_+|oxidoreductase MIQVGIIGTGFIGPAHIEALRRPGNVSVVALCDSSLEKAQEKARALNVPHAYGSVEALLAHPGLQVVHNCTPNYLHADINRQALRAGLSVFSEKPLCMTPDEARELAALAEQAGVVHGVSFVYRQFAMVQQAASMVRAGSVGRQFSVQGSYLQDWMLLETDYNWRVDPALGGDSRAVADIGSHWCDTVQFVTGKRIVEVMADLATVWPTRKASVEGGSTFTQPLTERQWVDKPVSTEDRGAVLVRFDDGSKGCFSVSQVSAGRKNQLTFEINGSHCSLAWDQEVPQQLWIGHRQQANQILTDDPGLMNADVAGSAHFPGGHIEGWPDAFKNMMQQFYLAVQAGKMPAPEVRRFASFADGADVMYIIDAIVKSHQHQRWVSVMR >LR134204.1|VEB93867.1|3887338_3888367_+|Inosose-dehydratase MRTIKGPGIFLAQFIGGQPPFNTLEGLAGWAAGLGYKALQIPCNHKALFDVEQAAVSQTYCDDIRGMLADHGLVISELSTHLEGQLIAVHPAYDEAFAGFAPPSVRGNPQARQAWATQMLQQAAAASQKLGLTAHATFSGSLAWPYFYPWPPHNRQRFDEAFSELARRWRPILDCFDEHGVDVCYEIHPSEDLHDGVTFERFLALVDDHPRCHILYDPSHLLLQQMDYLGFVDVYHSRIKAFHVKDAEYRASSRSGVYGGYQPWIERAGRFRSPGDGQVDFKTLFSKFAQYDYPGWAVLEWECCLKEAATGAREGSEFIRRHIIPVSEHAFDDFAAGDEVRK >LR134204.1|VEB93866.1|3886081_3887323_+|major-facilitator-superfamily-protein MVSTTESSGKQTVQYRLLVPRLSLMMFLQFFIWGSWSVTLGLVMTQYNMSLLIGDAFSAGPIASILSPFVLGMLVDRFFASQKVMAVMHVAGAAILWFVPQALVAQNGALLIGLLFGYTLCYMPTLALTNNIAFHSLSDKDKTFPVVRVFGTIGWIVAGIFIGVTGISDTTGIFTLAALCSVILALYSLTLPHTPAPAKGLPVKIRDLFCADAFALLKTRHFFVFSLCATLISVPLGTYYAYTASFLADAGVGDVSTAMSFGQMSEIVFMLVIPFLFRRLGVKYMLLIGMCAWFVRYAFFALGISEEGRFLLYLGILLHGVCYDFFFVVGFIYTDRIAGEKVKGQAQSMIVMFTYGIGMLLGSQISGALYNRLVAGQAVPQAWVTFWWLPAVAAAGIAAIFLFAFKYDEKEQA >LR134204.1|VEB93865.1|3884489_3885884_+|phenylalanine-transporter MKNASTASGNSLSDAASNGEPTLQRGLQNRHIQLIALGGAIGTGLFLGIGPAIQMAGPAVLLGYGVAGIIAFLIMRQLGEMVVEEPVSGSFAHFAYKYWGPFAGFLSGWNYWVMFVLVGMAELTAAGIYMQYWLPDVPTWIWAAAFFIIINAVNLVNVRLYGETEFWFALIKVLAIIGMIGFGLWMLFSGNGGEHASIENLWRYNGFFATGWNGLILSLAVIMFSFGGLELIGITAAEARDPHKSIPKAVNQVVYRILLFYIGSLVVLLALYPWVEVQSNSSPFVMIFHNLDSNVVASALNFVILVASLSVYNSGVYSNSRMLFGLSVQGNAPKFLTRVSRRGVPVNSLVLSGAITSLVVLINYLLPQKAFGLLMALVVATLLLNWIMICLAHLRFRAAMRRKGRDTQFRALLYPAGNYLCIAFLAMILVLMCTIDDMRMSAILLPVWILFLFVAFKLLRRKTR >LR134204.1|VEB93864.1|3883278_3884364_+|Gluconolactonase MKKTTSALALLIAVALSGCAATSGSSTPVAPQATPTAPAVATDVQKRDLADGLYEMALSPSGDALYVASAEGFKDVQGGVVYKLDPKTLKTIGASHTDLKNFGMAISPDGKTVYVTNSLDGGVSALNTADGKVKNRVLFPERNPEGFPYGARQVLLHNGLLYIGAVADPAVIWVVDADTLKLKTRIKNTGKWMTGLHYSDSTQRIYAANGGGEILVINPRTQRVEKRWKPLGDKPALLLNMAEDTQTGRLFVTDNSKAKTTLVLDIHTGKLIKQLEVGDSLSVKFNPKRNELYISQRESGKLLSLNATDYSVKKSWDLPPNPNSLLLSADGQTLYVTVKQKFNKDHSTDAPDSVVRIDLNK >LR134204.1|VEB93863.1|3881605_3882871_-|Uncharacterised-protein MNKIIKRLEIVKSAIELEDEEIIHQQLAHLKDASLDAAIGTIALAIEERRFGDAMREIAAWLQSQRAVSTWQDPGIAASKLELKALETQLRELIDKRNARIQILDDFNDLYHLRLGPLMGRILELRKQLAASAQRRQEAELRRREKDYQSCQQYISQAIDQLVKLKQHWVSLSSTSREAVETRERIQQQTELITSLLAEIRELENDFSRQDDSATRQAREEADHEYEEYQEQHQDAQHRYARDQRLSADERNELKRLWRQASRLCHPDVVADELKEKAHQMMVQLNQARQNADLATIRSLLTQLQSGLEPMLASDRLNNLAHLRSKIEQLRTQINALLKEISELETENAWRLATSVTDKEAYFAEQERALAEIRDTLEVQVNMRNKRSSPGKAERQGGCSQHHHDHPLPFIAVELYYPLKN >LR134204.1|VEB93873.1|3894545_3895199_-|dihydropteridine-reductase MDIVSVALKRYSTKAFDPSKQLTADEAEKLKTLLQYSPSSTNSQPWHFIVASTEEGKARVAKSAAGNFVFNERKMLDASHVVVFCAKTAMDDAWLDRVVDQEDADGRFATPEAKAANNKGRRFFADLHRRDLKDDDQWMAKQVYLNVGNFLLGVAAMGLDAVPIEGFDAAVLDAEFGLKEKGYTSLVVVPVGHHSVEDFNAALPKSRLPQETTLTEV >LR134204.1|VEB93874.1|3895319_3895688_-|Uncharacterized-protein-conserved-in-bacteria MDGQTLHRCAKRIALELPFTEHCWPFGPEFDVFKVGGKIFMIVSEQRGRRFVNLKSDPQKSLLNQQIYRSIEPGYHMNKKHWISVYAGDDITPSLLADLIGDSWNLVVDGLAKKDQKRLRPT >LR134204.1|VEB93875.1|3895687_3896269_-|TetR-family-transcriptional-regulator MARPKSEDKKLALLEAATKAIAQSGIAASTAVIARNAGVAEGTLFRYFATKDDLINELYLHLKQDLCQSMMANLDRSITDTRTMTHFIWNSYINWGLNNTNGHRTIRQLAVSEKITKETEQLADDMFPELRDLCHRSVLPIFMSDEYRAFGDALFLTLAETTMEFAARDPAHANEYISLGFEAMWRALTREEK >LR134204.1|VEB93876.1|3896459_3897560_+|putative-L-ascorbate-6-phosphate-lactonase MKRLTICVVVIMLIASTASLPFVLNAGFGQVPQGAQLSLVEQSPHYRDGQFHNQLPTPGYTGDKGMLAAWWEFLVAKRENARPAQPLPLVNTDLASVPRDRDTLIWLGHSSWYLQLAGKRILIDPVFSSYAAPFSFLNKAFAGDYPWTAQNMPEIDLLIISHDHYDHLDYATIKALMPKIKRVITPLGVGSHLRYWGMNGGIIDERDWNQSVRIDDALLIHVLPARHFSGRGLKRNQTLWASFMFETPEQKVYYSGDSGYGPHFKAIGEQFGSVDLAIMENGQYDRDWRYIHMMPEETAQAAQDLRAKAILPGHAGRFVLAKHSWDDPYKRLALASRHKRYRLLTPTLGEPVMLADPTQQFTAWWE >LR134204.1|VEB93877.1|3897589_3897931_+|transcriptional-activator-RamA MTISAQVIDTIVEWIDDNLNQPLRIDDIARHAGYSKWHLQRLFMQYKGESLGRYIRERKLRLAARDLRDTDQRVYDICLKYGFDSQQTFTRIFTRTFNQPPGAYRKENHSRTH >LR134204.1|VEB93878.1|3897967_3898768_-|ferrichrome-iron-TonB-dependent-receptor MDWTEVRTTDYIDSEKTQQNDNKFTWRTGLLYAFDFGLSPYISYSTSYEPNLQTNRAPGSAPFKPTTGKQTEVGVKYQPVDNTLMSLALYDLKQSNVSTYNSTLGWFENAGEVRSKGVEAEIHSSLWDSVNLIGSYTYTDAETVNTTVAGTEGKTPARIPAHMASAFASYTFPGGPLKSLTTGVGVRYIGTSYGDAKNTFKVPAVDLYDAMVSYELGELNSSLKGAAVQFNVNNIADTKYVASCASDTACFYGVGRTVTATVSYSW >LR134204.1|VEB93879.1|3898724_3900116_-|ferrichrome-iron-TonB-dependent-receptor MIFKNNKIMQLWVLSLATVSTTTFAKTQEETILVTQGVSQEPTAPVKGMVATKTLSATKTSAELVKTPQSVSVVTRDQMDALDATSVSQALRYTAGAFTEYRGSSNRNDEVFVRGFSYVPKFLDGLSFGATASSQTGTVDPWLLERVELVRGPASVLFGQVNPGGLISMTSKRPTSEPIHKVQFSTGNRDLAEGAFDFGGSLSDDGRVLYRLNGIARTQHNQVEDYKDSRVAIAPAITWYPNDQTRFTLLTSYQKDPDAGYRNFLPAYGTVTSANGKYIPLDFNVSDPDYDQSWREQTMVGYEFEHQFNDMMTFRQNARYASIKQKYRYLVYFNSKPESTLLSRRAQHEERTTNEFGIDNQLEAQFATAQMNHTLLGGLDYKSSNDKQLLMRGSGSQYDMDWTHPVYGVNVDESTFSPASHEQQNLDQMGLYLQDQMSWNNWELLLSGRYGLDGSSHHRLHRQ >LR134204.1|VEB93880.1|3900376_3900589_+|Uncharacterised-protein MPAGEICFVLPALSENYVSATLSGITDARLLDGQNRRIRTLLEGGPADGEHQLLFSLPVQQATSLVLHGK >LR134204.1|VEB93881.1|3900617_3901187_+|enterobactin/ferric-enterobactin-esterase MKETTPLPKIQRVAPVSPTLQQLEKALAAGAGTAHFWQDLQRNGTPLVEPVDDSHKRVTFLWRGAKQNVFILGSPAGDHDPLFRLGDSDVWFRSYVVPADTVMQYKLAPDVPLVNGSPRDQRRAILVSAQRDPLNPLTLGEKYADRWNQFSLLDLSPARFLLCASYRAACSLRFAYAQNIVQRTSGQQS >LR134204.1|VEB93882.1|3901238_3901670_+|enterochelin-esterase-(ferric-enterobactin-esterase) MLFDGKTYLDDYHIDRVLDGLIARHQLPPINVVFIDTLDHARRAKELPPNPDFADFMAHELLPWLRQQGITTQRQKTVLAGSSYGGLASSWVALRYPRLFGNVLSLSGSYWWAPKDEEASWLTRQYQNSPRYPVPLLVAGWPL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134204_12 | 4513217-4513308 | Unclear |
NA
Consensus repeat of LR134204_12
|
1 spacers
spacers of LR134204_12
>12.1|4513244|38|LR134204|CRISPRCasFinder ATGTGCCCGGTGGCGCTGCGCTTACCGGGCCTACAGGA |
cas3 |
CRISPR arrays and Neighbor proteins around LR134204_12
The CRISPR arrays of LR134204_12 >merge|LR134204|12|4513217-4513308|CRISPRCasFinder TATGCTTCCCGTAGGTCGGATAAGGCGATGTGCCCGGTGGCGCTGCGCTTACCGGGCCTACAGGATATGCTTCCCGTAGGCCGGATAAGGCG >LR134204|12|12|4513217-4513308|CRISPRCasFinder TATGCTTCCCGTAGGTCGGATAAGGCG ATGTGCCCGGTGGCGCTGCGCTTACCGGGCCTACAGGA TATGCTTCCCGTAGGCCGGATAAGGCG
>LR134204.1|VEB94575.1|4512082_4513051_-|peptidoglycan-binding-protein MIMNNMRLSRWLAFFTLAASVALAIPAQANTWPLPPPGSKLVGENTFHVVENNGGSLEAIAKKYNVGFLALLQANPGVDPYVPRAGSVLTIPLQTLLPDAPREGIVINLAELRLYYYPPGKNSVTVYPIGIGQLGGDTLTPTMVTTVSDKRANPTWTPTANIRARYKAQGIDLPAVVPAGPDNPMGHHAIRLAAYGGVYLLHGTNADFGIGMRVSSGCIRLRDGDIETLFRQVTPGTKVNIINTPIKASVEPGGMRLVEVHQPLSKNIDDDPQILPIVLNGQMQAFKDAAQTDAAVMEHVMEVRSGMPVDVTRHPGITQQSM >LR134204.1|VEB94574.1|4511746_4512004_+|Multiple-stress-resistance-protein-BhsA-precursor MKNVKTLIAAAVLSSLSFASFAAVEVQATPVDQQKVGTISATAGTNLGSLEDQLAQKADEMGAKSFRITSVTGPNTLHGTAVIYK >LR134204.1|VEB94573.1|4510870_4511506_-|TetR-family-transcriptional-regulator MTTDSTGCVKKSRGRPKVFDREAALDKAMTLFWQHGYEATSLADLVEATGAKAPTLYAEFTNKEGLFRAVLDRYISRFAAKHEAQLFCEEKSVESALEDYFTAVATCFTSKDTPAGCFMINTSATLAASSREIAHTVKSRHAMQEQTLTQFLRQRQERGEIPAHCNPQTLAEYLNCILQGMSISAREGATFDKLMQITRTTLRLWPEMLKA >LR134204.1|VEB94572.1|4510257_4510776_+|Uncharacterized-protein-conserved-in-bacteria MNKSMLAGIGIGVAAALGVAAVASLNVFDRSPKYAQVVSATPIKETVKTPRQECRNVTVTHRKPVQDENRIAGSVLGAVAGGVIGHQFGGGRGKDVATVVGALGGGYAGNQIQGSMQENDTYTSTQQRCKTVYDKSEKMLGYDVTYKIGDQQGKIRMDKDPGTQIRWIITAN >LR134204.1|VEB94571.1|4508666_4509971_+|NADH-dehydrogenase MTTPLKKIVIVGGGAGGLEMATQLGKKLGRKKKAKITLVDRNHSHLWKPLLHEVATGSLDEGVDALSYLAHARNHGFQFQLGSVMDIDREAKTITIAELRDEKGELLVPERKIAYDTLVMALGSTSNDFNTPGVKEHCIFLDNPHQARRFHQEMLNLFLKYSSSMGVSGKVNIAIVGGGATGVELSAELHNAVKQLHSYGYKGLTNEALNVTLVEAGERILPALPPRISAAAHSELTKLGVRVLTQTMVTSADEGGLHTKDGEHIKADLMVWAAGIKAPDFLKDIGGLETNRINQLVVEPTLQTTRDPDVYAIGDCASCARPEGGFVPPRAQAAHQMASCALNNILAQMKGKPMKDYVYKDHGSLVSLSNFSTVGSLMGNLMRGSMMVEGRIARFVYISLYRMHQIALHGYFKTGLMMLVGSINRVIRPRLKLH >LR134204.1|VEB94570.1|4507875_4508520_+|Predicted-esterase MIIYLHGFDSNSPGNHEKVLQLQFIDSDVRLISYSTRHPKHDMQHLLKEVDKMLQLNVDDRPLICGVGLGGYWAERIGFLCDIRQVVFNPNLFPYENMEGKIDRPEEYADIATKCVTNFREKNRDRCLVILSRHDEALDSHRSAKELHHFYEIVWDEEQTHKFKNISPHLQRIKAFKTLGLSRCNPITPLRNQPSGWFFYEHSGRVVYLFSEKT >LR134204.1|VEB94569.1|4506826_4507852_+|beta-hexosaminidase MGPVMLDVEGFELDAEEREILAHPLVGGLILFTRNYHDPEQLRELVRQIRAASRNHLVVAVDQEGGRVQRFREGFTRLPAAQSFAALHGLEEGGKLAQDAGWLMASEMIAMDIDISFAPVLDVGHISAAIGERSYHADPLKALAMATRFIDGMHAAGMKTTGKHFPGHGAVTADSHKETPTDPRPEAEIRARDMSVFRSLITGNKLDAIMPAHVIYSDVDPRPASGSPHWLKTVLREELGFDGVIFSDDLSMEGAAIMGSYAERGQASLDAGCDMILVCNNRKGAVSVLDNLSPINAERVTRLYHKGSFSRQELMDSARWKTVSAQLNQLHERWQEEKAGH >LR134204.1|VEB94568.1|4505991_4506816_+|thiamine-kinase MQSSNNNLLTRDDVLSRYFPQYHPVAACHNGLSGGSCIIEHDARRLVLRCHHDPFAPESDFLRQYHALSRLPETLAPRPRFYIPGWMAVDYLQGEVKSGLPDTDELAGLLYHLHQQPRFGWRISLLPLLMRYWQGSDPARRTPYWLRMLKRLRAAREPRPLRLAPLHMDVHGDNLVQTASGLRLIDWEYAGDGDIALELAAVWVEDERQHQRLVRSYAAVAHICPQTLWQQVRLWRPWVMMLKAGWFEYRWQQTGEQQFIRLADDAWRQLKKKG >LR134204.1|VEB94567.1|4505456_4506011_+|lipoprotein MKSKPAPEQPAEPQQPVPVVPSVPTIPQQPGPIEHEDQTAQPAPRVRHYDWNSAMQPMVGKMLQADGVTAGNVLLVDSVNNRTNGSLNAGEATETLRNALANNGKFTLVSAQQLAMAKQQLGLSPQDSLGTRSKAIGIARNVGAQYVLYSSAAGNVNAPSLQMQLMLVQTGEIIWSGKGAVQQQ >LR134204.1|VEB94566.1|4505368_4505491_+|lipoprotein MMKMNRYALVAALAIFLSGCVGQREPAPVDEVKTGAGTAS >LR134204.1|VEB94576.1|4513337_4513727_-|transcription-repair-coupling-factor MNTRLSFYKRIASAKNENELEELKVELIDRFGLLPDPARNLLDIARLRQQAQKLGIRKLEGNEKGGTIEFAEKNHVNPTWLIGLLQKQPQHFRLDGPTRLKFIQDLAERKTRMDWVRQFMAQLEENAAA >LR134204.1|VEB94577.1|4513740_4515510_-|transcription-repair-coupling-factor MIGAAEHGFIDTQRNLALICESDLLGERVARRRQDSRRTINPDTLIRNLAELHPGQPVVHLEHGVGRYAGMTTLEAGGIKGEYLMLTYANDARLYVPVSSLHLISRYAGGAEENAPLHKLGGDAWSRARQKAAEKVRDVAAELLDIYAQRAAKEGFAFKHDREQYQLFCDSFPFETTPDQAQAINAVLSDMCQPLAMDRLVCGDVGFGKTEVAMRAAFLAVENHKQVAVLVPTTLLAQQHFDNFRDRFANWPVRIEMLSRFRSAKEQAQILEQVAEGKIDILIGTHKLLQSDVKLKDLGLLIVDEEHRFGVRHKERIKAMRADVDILTLTATPIPRTLNMAMSGMRDLSIIATPPARRLAVKTFVREYDNLVVREAILREVLRGGQVYYLYNDVENIQKAADRLAELVPEARIAIGHGQMRERELERVMNDFHHQRFNVLVCTTIIETGIDIPTANTIIIERADHFGLAQLHQLRGRVGRSHHQAYAWLLTPHPKAMTTDAQKRLEAIASLEDLGAGFALATHDLEIRGAGELLGEDQSGQMETIGFSLYMELLENAVDALKAGREPSLEDLTSQQTEVELRMPSAAAG >LR134204.1|VEB94578.1|4515745_4516786_-|transcription-repair-coupling-factor MPEQQRYTLPTKPGDQRQLGELTGAACATLVAEIAERHAGPIVLIAPDMQNALRLHDEVRQFTDQLVMNLADWETLPYDSFSPHQEIISSRLSTLYQLPSMQRGVLIVPVNTLMQRVCPHSYLHGHALVMKKGQRLSRDALRAQLDSAGYRHVDQVMEHGEYATRGALLDLFPMGSEQPYRLDFFDDEIDSLRLFDADTQRTLEEVDAINLLPAHEFPTDKTAIELFRSQWRDTFEVKRDAEHIYQQVSKGTLPAGIEYWQPLFFNEPLPPLFSYFPANTLLVNTGALENSAERFQADTLARFENRGVDPMRPLLPPESLWLRVDELFSELKRWPPRTVKNRTSAR >LR134204.1|VEB94579.1|4516915_4517989_-|acyltransferase MKQKELWINQIKGLCICLVVIYHSVITFYPHLTAFQYPLSEILTKCWIYFNLYLAPFRMPVFFFISGYLIRRYIDSVPWGTCIDKRIWSIVWVLALWGVAQWLALSWLNHWLAPERDINNASNAAYAGSVGEFIHGMLTASTSLWYLYALVVYFVLCKVFSQWAKPLFVLFVLLSVTINFVPTPWWGMNSVIRNLPYYSLGAWFGATLMEWIKHVPLRRYAIAFAGIALLAVIAWLANVSLLLSLVSIVLIMKLFYQYEQRFGMRSSSLLNVIGSNTIAIYTTHRILVEAFSLTLIPKINGAAWSPQLEFALLLVYPFASLLICTLTGLTVRKISQRLFADLFFSPPSLPAVTSYSR >LR134204.1|VEB94580.1|4518142_4519450_+|outer-membrane-specific-lipoprotein-transporter-subunit-LolC MILFRITLTNTYGSVIYPFRFRLYTQGFANSNQTDFMYQPVALFIGLRYMRGRAADRFGRFVSWLSTIGITLGVMALVTVLSVMNGFERELQNNILGLMPQAILSSEQGSLNPQQIPEKAVTLNGVNRIAPLTTGDVVLQSARSVAVGVMLGIDPAQKDPLTPYLVNVKQTSLEAGKYNVILGEQLAGQLGVNRGDQIRVMVPSASQFTPMGRLPSQRLFTVIGTFAANSEVDGYQMLTNIQDASRLMRYPAGNITGWRLWLNEPLKVDTLSQQTLPEGTKWQDWRERKGELFQAVRMEKNMMGLLLSLIVAVAAFNIITSLGLMVMEKQGEVAILQTQGLTPRQIMMVFMVQGASAGIIGALLGAVLGALLASQLNNLMPIIGAFLDGAALPVAIEPLQVVVIALVAMAIALLSTLYPSWRAAATQPAEALRYE >LR134204.1|VEB94581.1|4519442_4520144_+|lipoprotein-transporter-ATP-binding-subunit MNKILLQCDNLCKRYQEGSVQTDVLHDVSFSIGEGEMMAIVGSSGSGKSTLLHLLGGLDTPTSGDVIFSGQPMSKLSSAAKAELRNQKLGFIYQFHHLLPDFTSLENVAMPLLIGKKKPAEITERAREMLQAVGLDHRANHRPSELSGGERQRVAIARALVNNPRLVLADEPTGNLDARNADSIFQLLGELNRAQGTAFLVVTHDLQLAKRMSRQLEMRDGLLTAELSLMGAE >LR134204.1|VEB94582.1|4520143_4521388_+|lipoprotein-releasing-system-transmembrane-protein MASPLSLLIGLRFSRGRRRGGMVSLISVISTIGIALGVAVLIVGLSAMNGFERELNNRILAVVPHGEIEAVNQPWNHWREALEKVQHVQGIAAAAPYINFTGLVESGANLRAIQVKGVDPKQEQRLSALPSFVQNHAWDNFRAGEQQIIIGKGVADALKVKQGDWVSIMIPNASADHKLQQPKRVRLHVTGILQLSGQLDHSFAMIPMEDAQQYLDMGASVSGIAIKVNDVFNANKLVRDAGEVTDSYVYIKSWIGTYGYMYHDIQMIRAIMYLAMVLVIGVACFNIVSTLVMAVKDKSGDIAVLRTLGAKDGLIRAIFVWYGLLAGLLGSLCGVAIGVVVSLQLTPIIEGIEKLIGHQFLSGDIYFIDFLPSELHWLDVIYVLVTALLLSLLASWYPARRASNIDPARVLSGQ >LR134204.1|VEB94583.1|4521420_4522332_+|N-acetyl-D-glucosamine-kinase MYYGFDIGGTKIALGVFDNERRLRWEKRVPTPREGYEAFLTAVCDLVAEADQRFDVKGSVGIGIPGMPETEDGTLYAANVPAASGKPLRADLSARLDRDVRLDNDANCFALSEAWDDEFTQYPLVMGLILGTGVGGGLVLNGKPITGRSYITGEFGHMRLPVDALTLMGFDFPLRRCGCGQLGCIENYLSGRGFAWLYQHYYHQPLQAPEIIALWEQGDERARAHVERYLDLLAVCLGNILTIVDPDLVVIGGGLSNFTAITTQLADRLPRHLLPVARVPRIERARHGDAGGMRGAAFLHLTD >LR134204.1|VEB94584.1|4522347_4523169_+|NAD-dependent-deacetylase-(regulatory-protein-Sir-homolog) MQSRRAHRLSRFRKNKRHLRERLRQRIFFRDRVVPEMMEKPRVLVLTGAGISAESGIRTFRAADGLWEEHRVEDVATPEGFARDPELVQSFYNARRQQLQRPEIQPNPAHVALAKLEEVLGDRFLLVTQNIDNLHERAGNKNIIHMHGELLKVRCSQSGQILDWTGDVTSDDKCHCCQFPASLRPHVVWFGEMPLGMDEIYMALAMADVFIAIGTSGHVYPAAGFVHEARLHGAHTVELNLEPSQVGSEFEEKYYGPASQVVPAFIEKLLKGL >LR134204.1|VEB94585.1|4523270_4524317_-|spermidine/putrescine-ABC-transporter-periplasmic-substrate-binding-protein MKKWSRHLLAAGALALGMSAAHADDNNTLYFYNWTEYVPPGLLEQFTKETGIKVIYSTYESNETMYAKLKTYKDGAYDLVVPSTYYVDKMRKEGMIQKIDKTKLTNFHNLDPEMLNKPFDPNNDYSIPYIWGATAIGVNSDEIDPKTVTSWADLWKPEYKGSLLLTDDAREVFQMALRKLGYSGNTTDPKEIEAAYSELKKLMPNVAAFNSDNPANPYMEGEVNLGMVWNGSAYVARQAGTPLEVIWPKEGGIFWMDSLSIPANAKNKEGALKLINFLLRPDVAKEVAETIGYPTPNLAARKLLSPEVANDKSLYPDADTINKGEWQNDVGSASAIYEEYYQKLKAGR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134204_13 | 4578232-4578322 | Orphan |
NA
Consensus repeat of LR134204_13
|
1 spacers
spacers of LR134204_13
>13.1|4578262|31|LR134204|CRISPRCasFinder ATCAGGCATCAGTATGGCACGACGCTTTCCC |
CRISPR arrays and Neighbor proteins around LR134204_13
The CRISPR arrays of LR134204_13 >merge|LR134204|13|4578232-4578322|CRISPRCasFinder GTAGGCCTGATAAGCGTAGCGCCATCAGGCATCAGGCATCAGTATGGCACGACGCTTTCCCGTAGGCCTGATAAGCGCAGCGCCATCAGGC >LR134204|13|13|4578232-4578322|CRISPRCasFinder GTAGGCCTGATAAGCGTAGCGCCATCAGGC ATCAGGCATCAGTATGGCACGACGCTTTCCC GTAGGCCTGATAAGCGCAGCGCCATCAGGC
>LR134204.1|VEB94653.1|4577037_4577607_+|decarboxylase MNDLQPWVSPLTHIPSSLKPLVATQKKHYGDVLHPTRWWGRMPFLFWLVALFVGFLERKRARVTPVMRALLMTRVSQVCHCAFCVDANSLRLAERSGALDKVQAVAGWQSSTLFSEEERVALAYAEAVTATPPQVDEALKAMMKRYFTDDAITEMTALIAFQNLSARFNAALDIPSQGLCDALKGAPHV >LR134204.1|VEB94652.1|4576544_4576967_-|pyrimidine-(deoxy)nucleoside-triphosphate-pyrophosphohydrolase MNMMKTIDVVAAIIERDDKILLAQRPEHADQPGMWEFAGGKVESSETQPQALIRELREELGIEAVVGRYIASHQREVSGRLIHLHAWHVPAFTGTVTAHYHQNMIWCSPKEALRYPLAPADIPLLEAFMALRDARPTDSY >LR134204.1|VEB94651.1|4576309_4576585_+|Protein-of-uncharacterised-function-(DUF1496) MHRLFIAAAAAMVSFVALANQHYRPDVEVNVPPEVFSSSGQRAQPCNQCCVYQDQNYSEGAVIKAEGVLLQCQRDEKTLSTNPLVWRRVKP >LR134204.1|VEB94650.1|4575892_4576072_-|glutamate-dehydrogenase MDQTCSLESFLNHVQQRDPNQTEFAQAVREVMTTLWPFLEQNPRYRQMSLLNVWLSQNA >LR134204.1|VEB94649.1|4575202_4575790_-|glutamate-dehydrogenase MRFHPSVNLSILKFLGFEQTFKNALTTLPMGGGKGGSDFDPKGKSEGEVMRFCQALMTELYRHLGPDTDVPAGDIGVGGREVGFMAGMMRKLSNNSACVFTGKGLSFGGSLIRPEATGYGLVYFTEAMLKRHGLGFEGMRVAVSGSGNVAQYAIEKAMAFGARVVTASDSSGTVVDESGFTEEKTGAPVRNQSQP >LR134204.1|VEB94648.1|4574703_4575264_-|glutamate-dehydrogenase MRAASPKKKLARLCEIKASRDGRVADYAREFGLPYMEGKQPWSVPVDIALPCATQNELDVDAARVLIANGVKAVAEGANMPTTIEATDLFLEAGVLFAPGKAANAGGVATSGLEMAQKRRANELESRESGCASAPHHAGYSSCLRGIRRRKQTYQLCSRGEHRRLRQSGRRDAGAGRDLIFRGMQA >LR134204.1|VEB94647.1|4574369_4574624_+|DNA-topoisomerase-III MAARPDMTAHWESVLTQISEKQCRYQDFMQPLVGTLFQLIDQAKRTPVRQFRGIVAPGGGGEKKKSAPRKRAAKKSPPSEEASV >LR134204.1|VEB94646.1|4574026_4574428_+|DNA-topoisomerase-III MGNKERDEENDGTPLPVVAKDDELLCEKGEVVERQTQPPRHFTDATLLSAMTGIARFVQDKDLKKILRATDGLGTEATRAGIIELLFRRGFLTKKGRYIHSNGCRESPDSLVTGDGCPTGYDGALGVCPDANQ >LR134204.1|VEB94645.1|4572915_4573653_+|DNA-topoisomerase-III MTKQLNVIKRFLHDASEVIHAGDPDREGQLLVDEVLDYLQLSPEKRQQVQRCLINDLNPQAVERAISRLRANSEFVPLCVSALARARADWLYGINMTRAYTLLGRNAGYQGVLSVGRVQTPVLGLVVRRDEEIENFVAKDFFEVKAHIVTPAEERFTAIWQPSEACEPYQDEEGRLLHRPLAEHVVNRINGQPAIVTSYNDKRESESAPLPFSLSALQIEAAKRFWSERAERGLISARSCTKPTN >LR134204.1|VEB94644.1|4572676_4572919_+|DNA-topoisomerase-III MRLFIAEKPSLGRAIADVLPKPHRKGDGFIECGNGQVVTWCIGHLLEQAQPDAYDVVMRAGIWPICRLCRKSGSSSHVLP >LR134204.1|VEB94655.1|4578331_4579564_-|thiosulfate-sulfurtransferase MAQSLTFSQLQQKQGAAIDTRQSAFYNGWPQTLNGPSGHEPSALNLSASWLDKMSDEQLAAWLQRHQLRKDAPLALYGNDSGVQAVKSRLQKAGFSQIATLSDALTDPSRLQKLPHFEQLAYPQWLHALQQNQPVTAKPAGDWKVIEAGWGAPKYYLLSHIPGAGYIDTNEVESEPLWNKVSDAQLKAMLAKHGIRHDTTVILYGRDVYAAARVAQIMLYAGVKDVRLLDGGWQTWSDAGLPVERGMPADVKQEPDFGVAIPAQPQLMLDMEQARALLHRQDASLVSIRSWPEFIGTTSGYSYIKPKGEIAGARWGHAGSDSTHMEDFHNPDGTMRSAEDIAAMWKQWDILPDQHVAFYCGTGWRASETFMYARAMGWKNVAVYDGGWYEWSSDPKNPVSTGERGPDSSK >LR134204.1|VEB94656.1|4579884_4580361_-|ABC-transporter-ATP-binding-protein MLIVQHLTLRLGNHRLLNQVAFQVKKGDIVTLMGPSGSGKSSLFSWMVGALPSQFQASGELWLNERRIDTLPTAQRQIGILFQDALLFDHFSVGQNLLLALPAAIKGAARQTEVRHALERAGLDGFSRRDPASLSGGQRARVALLRALLGATAGITAG >LR134204.1|VEB94657.1|4580360_4581041_-|Inner-membrane-ABC-transporter-permease-protein-ynjC MALTLPLSGILCVALLACMAEPASVNREALVNSLQMGLASAVLGLMTLFLWLEWGPQTGHRWVWLPILLPALPLVAGQYTLALLTGQDGQYATVIWDTCCGLSRGCYSSSKPAWQRIDPRLVLIAQTLGWTRVRIFWRIKCPLLLRPALFAFAVGFSVSIAQYMPTLWLGAGRYPTLTTEAVALSSGGSTTILASQALWQLLLPLLVFALTALCSRVIGHYRQGLR >LR134204.1|VEB94658.1|4580979_4581897_-|Inner-membrane-ABC-transporter-permease-protein-ynjC MATPLRYAVTLLVWGVMAVIWLPLVPAAFTLITPALSATHWLALFSDPQLPQALRATLVSVTLAATGALAIALTIVVALWPGAKWAHLCARLPWLLAIPHVAFAASALLLFAEGGMLYRLFPSLTPQMDRYGIGLGLTLAVKESAFLLWVLSALLSEKQLSQQVIVLDTLGYSRLQCLSWLLLPAVAPGLGAVMLAIVAWSLSAVDVAIILGPGNPPTLAVLSWQWLSQGDADQQAKGALASLLLVALLMLFALFGYLIWRGWQRTPSCHQRFSPPALIRQKRKNVGANAAVKRHTMRGAAGLYG >LR134204.1|VEB94659.1|4581869_4583033_-|putative-ABC-transporter-solute-binding-protein MRRTLLLAGLLLTGHVQAVENWQAVKDEAKGQTVWFNAWGGDNAVNQYLGWVSGEMKTHYAINLRIVRLADAADAVKRIQTESAAGRKTDGSVDLLWVNGENFRTLKEAGLLQTHWAETLPNWRYVDTRKPVREDFSMPTQGAESPWGGAQLTFIANRDITPQPPQSPQALLAFAKTHPGTVTYPRPPDFTGTAFLEQLLLALTPQPDALKVAPDDATFDSVTAPLWAYLDALHPSLWRKGKDFPPSPARMDALLQAGSLRLSLTFNPAHAQQKIASGELPKTSYSFGFSQGMIGNVHFVTIPANARASAGAKVVANFLLSPDAQLRKADPAFWGDPSVLDPHKLPAGQREALRSRIPDGLPPTLAEPHAAWVNALEQAWLRRYGTQ >LR134204.1|VEB94660.1|4583047_4583722_-|TVP38/TMEM64-family-inner-membrane-protein-ydjZ MNAKKRLLSGFLIICIAGVVWALPPGFLSLDTLKRYHTMLAAWQQQSPFLSAGLYFLVYTLVAALSIPGAALLTLLGGALFGLWQGTLLVSFASTLGATLAMLASRYLLREWISRRFARQMQTVNHGMARDGAFYLFALRVMPLFPFVLVNLLAGLTSIRVRQYWWISQAGMFPATVIYLNAGRQLSQLTSIRDIISPGMLAAFALLGLLPLASRWLVKRFLRS >LR134204.1|VEB94661.1|4583731_4584538_-|exonuclease-III MKFVSFNINGLRARPHQLSAIVDKHQPDVIGLQETKVHDDMFPLEEVAKLGYNVFYHGQKGHYGVALLTKETPVSVRRGFPDDGEEAQRRIIMAEIPSPLGSITVINGYFPQGESRDHPLKFPAKAQFYQNLQNYLDNELKRDNPVLIMGDMNISPTDLDIGIGEENRKRWLRTGKCSFLPEEREWMDRLISWGLVDTFRHAHPETVDKFSWFDYRSKGFDDNRGLRIDLLLASNPLAEHCVETGIDYEIRSMEKPSDHAPVWAKFRV >LR134204.1|VEB94662.1|4584983_4586204_+|bifunctional-succinylornithine-transaminase/acetylornithine-transaminase MSLSITRENFDEWMMPVYAPAPFIPVRGEGSRLWDQQGKEYIDFAGGIAVNALGHAHPALREALNEQASKFWHTGNGYTNEPVLRLAKMLIDATFAERVFFCNSGAEANEAALKLARKYAHDHFGTHKSGIVAFKNAFHGRTLFTVSAGGQPAYSQDFAPLPPDIRHAVYNDLNAASELIDDSTCAVIVEPMQGEGGVLPATKAFLQGLRELCDRHNALLIFDEVQTGVGRTGELYAYMHYGVTPDLLTTAKALGGGFPIGALLATEKCASVMTVGTHGTTYGGNPLASAVAGKLLEIVNTPEMLNGVKQRHDGFVERLNAINERFGLFSEIRGLGLLIGCVLEAEFAGKAKLISQEAAKAGVMVLIAGANVVRFAPALNVSKEEVATGLDRFALACERIKAGGSS >LR134204.1|VEB94663.1|4586200_4586431_+|arginine-succinyltransferase MMVIRPVERRDVSALMQLASKTGGGLTSLPADEATLTSRIERALKTWRGELPKKRTGLRVRAGGQRYGKRCGNLCD >LR134204.1|VEB94664.1|4586375_4587236_+|arginine-succinyltransferase MFVLEDSDTGSVAGICAIEVAVGLNDPWYNYRVGTLVHASKELNVYNALPTLFLSNDHTGSSELCTLFLDPDWRKEGNGYLLSKSRFMFMAAFRDKFNEKVVAEMRGVIDEHGYSPFWESLGERFFSMEFSRADYLCGTGQKAFIAELMPKHPIYTHFLSEEAQAVIGEVHPQTAPARTVLEKEGFRYRNYIDIFDGGPTLECDIDRVRAIRKSRLLDVVEGQPAPGEFPACLVANENYHHFRAMLIRTDPDTQRLVLTAAQLDALKCHAGDRVRLVRLCAEEKTA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134204_14 | 4592581-4592695 | Orphan |
NA
Consensus repeat of LR134204_14
|
1 spacers
spacers of LR134204_14
>14.1|4592622|33|LR134204|CRISPRCasFinder ACAGCGATGTAGGCCGGATAAGGCGGCACGTTC |
CRISPR arrays and Neighbor proteins around LR134204_14
The CRISPR arrays of LR134204_14 >merge|LR134204|14|4592581-4592695|CRISPRCasFinder GCGCCCGATGGCGCTTCGCTTATCGGGCCTACGGGCGTTATACAGCGATGTAGGCCGGATAAGGCGGCACGTTCGCGCCCGATGGCGCTTCGCTTATCGGGCCTACGGGAGTTAT >LR134204|14|14|4592581-4592695|CRISPRCasFinder GCGCCCGATGGCGCTTCGCTTATCGGGCCTACGGGCGTTAT ACAGCGATGTAGGCCGGATAAGGCGGCACGTTC GCGCCCGATGGCGCTTCGCTTATCGGGCCTACGGGAGTTAT
>LR134204.1|VEB94673.1|4591991_4592567_+|Various-environmental-stresses-induced-protein MEYFDIRKMPVNLWRNGAGETREICCFPPATRDFHWRASIASIAGNGEFSLFPGVERVITLLEGGEVMLEGVNTFTHTLKRHQPFTFAGDSVVKATLSEGQMSMDLNIMTRRDRCKAKVRVADRTFTTFGSRGGVVFVISGAWQLGDKLLTADQGACWHDGKHTLRLLKPEGQLLFSEITWLPGYTPDTVQ >LR134204.1|VEB94672.1|4591534_4591819_+|periplasmic-protein MKGQRDQMKRPPLEERRAMHDIIASDSFDKAKAEAQIAKMEEQRKANMLAHMETQNKIYNILTPEQKKQFNANFEKRLTERPAPEGKMPTPPAE >LR134204.1|VEB94671.1|4591333_4591477_+|Uncharacterised-protein MRKLTALFVASTLALALRTWLTRLIPLPLLRQMRSDDAPQRKIRSAS >LR134204.1|VEB94670.1|4590764_4590917_+|succinylglutamate-desuccinylase MEGGTRPIRYRVVAQITRRSDAFVLCMDNQTLNFTPFKEERYWRRMAMSR >LR134204.1|VEB94669.1|4590040_4590289_+|succinylglutamate-desuccinylase MDNFLAQTLAGISPDITQGKRRAFTGAGRCLVCLNLRLPFEVDRALVLSAGIHGNETAPVEMLDALLSALFEGKIPCAGDCW >LR134204.1|VEB94668.1|4589841_4590048_+|succinylarginine-dihydrolase MNPAVMMNDTLFSTLNDWVDRYYRDRLTAADLADPQLLHEGREALDTLTQLLNLGSVYPFQQEGAGNG >LR134204.1|VEB94667.1|4588706_4589831_+|succinylarginine-dihydrolase MKAHEVNFDGLVGLTHHYAGLSFGNEASIQHRFQVSNPRLAAKQGLLKMKALADAGFPQAVIPPHERPFIPVLRQLGFSGRTSRCWRKWRARAPHWLSSVSSASPMWVANAATVCPSADALDGKVHLTVANLNNKFHRALEAPTTASLLRAIFRDAQFFAVHDALPQVALLGDEGAANHNRLGGDYGAPGIQLFVYGREEGVDTRPVRYPARQTREASEAVARLNQVNPHQVIFARQNPDVIDLGVFHNDVIAVSNRQVLFCHEQAFAKQGELMRQLRSRVAGFMPLEVPAREVSVQDAVATYLFNSQLLSRDDGSMALVLPQECREHAGVWRYLNVLLAADNPSAICACSTCAKVWRTARPGLFTLARGIDRG >LR134204.1|VEB94666.1|4588413_4588710_+|N-succinylglutamate-5-semialdehyde-dehydrogenase MANNTRFGLSCGLVSPDRKQFDQLLLEARAGIVNWNKPLTGAASTAPFGGVGASGNHRPSAWYAADYCAWPMASLESPTLVLPDSLSPGLDFSREEPV >LR134204.1|VEB94665.1|4587232_4588375_+|N-succinylglutamate-5-semialdehyde-dehydrogenase MTLWINGDWVTGQGERRVKTNPVNGEVLWQGSDADATQVAEAGRAARAAFPAWARLPFAARQAIVEKFSALLEVHKAELTAIIARETGKPRWEAVTEVTAMINKIAISVKAYHARTGEQQSELPDGAATLRHRPHGVLAVFGPYNFPGHLPNGHIVPALLAGNTLIFKPSELTPWTGEAVMKLWQQAGVPPGVLNLVQGGRETGQALSALDALDGLLFTGSANTGYQLHRQLSGQPEKILALEMGGNNPLIIENPEDIDAAVHLTIQSAFVTAGQRCTCARRLLVKRGEQGDAFLARLLEVSQRLTPGEWDDEPQPFIGGLISEPAARHVYEAWQRLEAMGGRTLLAPRMLKAGTSLLTPGIIEMTGVKQVPDDEVFGRY >LR134204.1|VEB94664.1|4586375_4587236_+|arginine-succinyltransferase MFVLEDSDTGSVAGICAIEVAVGLNDPWYNYRVGTLVHASKELNVYNALPTLFLSNDHTGSSELCTLFLDPDWRKEGNGYLLSKSRFMFMAAFRDKFNEKVVAEMRGVIDEHGYSPFWESLGERFFSMEFSRADYLCGTGQKAFIAELMPKHPIYTHFLSEEAQAVIGEVHPQTAPARTVLEKEGFRYRNYIDIFDGGPTLECDIDRVRAIRKSRLLDVVEGQPAPGEFPACLVANENYHHFRAMLIRTDPDTQRLVLTAAQLDALKCHAGDRVRLVRLCAEEKTA >LR134204.1|VEB94674.1|4592739_4592976_-|NAD-synthetase MGCPEHLYKKAPTADLEDDRPSLPDEAALGVTYDNIDDYLEGKTLDASIAKIIEGWYIRTEHKRRTPITVFDDFWKKS >LR134204.1|VEB94675.1|4593008_4593569_-|NAD-synthetase MTLQQEIIRALGAKPHINAEEEIRRSVDFLKAYLKTYPFLKSLVLGISGGQDSTLTGKLCQTAITELREETGNDALQFIAVRLPFGVQADEQDCQDAIAFIQPDRVLTVNIKGAVLASEQALREAGIELSDFVRGNEKARERMKAQYSIAGMTNGVVVGTDHAAEAVTASSLNMAMAVPILTRFSV >LR134204.1|VEB94676.1|4593796_4594138_+|DNA-binding-transcriptional-activator-OsmE MNKNIAGILSAAAVMTMLAGCTAYDRTKDQFVQPVVKDVKKGMSRSQVEQIAGKPSSEVSMIHARGTCQTYILGQRDGKAETYFVALDDTGHVINSGYQSCAEYDTDPQAPKQ >LR134204.1|VEB94677.1|4594438_4594759_+|PTS-system-N,N'-diacetylchitobiose-specific-transporter-subunit-IIB MEKKHIYLFCSAGMSTSLLVSKMRAQAEKYEVPVIIEAFPETLAGEKGLSADVVLLGPQIAYMLPEIQRLLPNKPVEVIDSLLYGKVDGLGVLKAAVAAIKKAAAN >LR134204.1|VEB94678.1|4594845_4595037_+|PTS-system-N,N'-diacetylchitobiose-specific-transporter-subunit-IIC MSNVIASLEKVLLPFAVKIGKQPHVNAIKNGFIRLMPLTLAGAMFVLINNVFLSFGEGSFFIL >LR134204.1|VEB94679.1|4595069_4596203_+|PTS-system-N,N'-diacetylchitobiose-specific-transporter-subunit-IIC MNGLKGIGGNVYNGTLGIMSLMAPFFIGMALAEERKVDALAAGLLSVAAFMTVTPYSVGEAYAVGANWLGGANIISGIIIGLVVAEMFTFIVRRNWVIRLPDSVPASVSRSFSALIPGFIILSIMGIIAWALANYGSNFHQIIMDTISTPLASLGSVVGWAYVLFVPLLWFFGIHGSLALTALDSGIMTPWALENIAIYQQFGSVDAALEAGKTFHVWAKPMLDSYIFLGGSGATLGLIIAIFLASRRADYRQVAKLALPSGIFQINEPILFGLPIIMNPVMFIPFILVQPILAAITLIAYYLGIIPPITNIAPWTMPTGLGAFFNTNGSVAALLVALFNLGVATLIYLPFVVVANKAQNAIEQEESEEEIANALKF >LR134204.1|VEB94680.1|4596253_4596604_+|PTS-system-N,N'-diacetylchitobiose-specific-transporter-subunit-IIA MLDLENVAGTPSEAEELEEVVMGLIINSGQARSLAYAALKQAKQGDFAAAKTMMEQSRMALNEAHLVQTKLIEGDQGEGKMKVSLVLVHAQDHLMTSMLARELVTELIELHEKLEQ >LR134204.1|VEB94681.1|4596611_4596917_+|DNA-binding-transcriptional-regulator-ChbR MMQLQVNAPEIATAREQQLFNGKNFHVFIYNKTESISGLHQHDYYEFTLVLTGRYYQEINGKRVLLERGDFVFIPLGSHHQSFYEFGATRILNVGISKRFF >LR134204.1|VEB94682.1|4596861_4597455_+|DNA-binding-transcriptional-regulator-ChbR MSSAPRVFLTLVSVSVFFEQHYHPLLPFCFVASQVYRVNSAFLTYIETVIASLNFRGNGLDEFVEVVTFYIINRLRHHREEQVIDDIPQWLKATVETMHDKTQFGENALENMVHLSAKSQEYLTRATQRYYSKTPMQIINEIRINFAKKQLEMTNYSVTDIAYEAGYSSPSLFIKTFKKLTSFTPNSYRKKLTEFNQ >LR134204.1|VEB94683.1|4597578_4598934_+|6-phospho-beta-glucosidase MSQKLKVVTIGGGSSYTPELLEGFIKRYHELPVSELWLVDVEDGKEKLDIIFDLCQRMIDKAGVPLKLYKTLDRREALQGADFVTTQLRVGQLKAREQDERIPLSHGYLGQETNGAGGLFKGLRTIPVIFDIVKDVEELCPNAWVINFTNPAGMVTEAVYRHTHFKKFIGVCNIPVGMKMFIHDVLALQDSDDLSIDLFGLNHMVFIKDVLVNGESRFAELLDGVASGQLKASTVKNIFDLPFSEGLIRSLNMLPCSYLLYYFKQKEMLAIEMGEYYKGGARAQVVQKVEKQLFDLYKNPELNIKPKELEQRGGAYYSDAACEVINAIYNDKQTEHYVNIPHHGHVDNIPADWAVEMTCTLGRNGATPHPRITHFDEKVLGLIYTIKGFEVAASKAALSGEFNDVLLALNLSPLVHSDRDAETLARELILAHEKWLPNFAECIEKLKGAQH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134204_15 | 4665330-4665453 | Orphan |
NA
Consensus repeat of LR134204_15
|
1 spacers
spacers of LR134204_15
>15.1|4665373|38|LR134204|CRISPRCasFinder CAGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA |
CRISPR arrays and Neighbor proteins around LR134204_15
The CRISPR arrays of LR134204_15 >merge|LR134204|15|4665330-4665453|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACAGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA >LR134204|15|15|4665330-4665453|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA CAGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA
>LR134204.1|VEB94759.1|4662704_4665005_-|outer-membrane-receptor-FepA MIKKAYKLSLLSSLIAGSLAIPAVQADDIKTAANQKAEQTNASNPPEKQAARDKEINSGETMVVTAKEQTLQAPGVSIIDSEAIKKRPIQRDVAEIIRTMPGVNLTGNSNSGQRGNNRQIDIRGMGPENTLILIDGMPVSSRNSVRYGWSDERDTRGDTNWVPPEMIERIEVIRGPAAAMYGSGAMGGVVNIITKPATDEWHGSLNTYYNFPQHKSEGSTKRYNGSLSGALADNLTLRVYGNWNKTQADAQDINENHTTPRIGSYAGSYPAGREGFINKDINSALRWEFMPMQALELGAGYSRHGNLYAGDTQNTNSNALVKKYYGKETNVLYRQTLSLKYTGAWDSGVTTNNYVQFEKTRNTRLNEGLSGGISGIFSPSNEGFSTISLYDTNLHSEVNIPLDLWVSQTLTFGAEFNHQAMKDPSSNTESTTGGGSVPGIADTGRSKYSSADIYGIFAEDNLELTDSTRLTPAIRFNHQSISGSNWSPSLNLSQELGGDFTLKLGIARAWKAPNLYQTNPNYLLFSKGQGCYGGGACYLQGNRDLKAETSVNKEVGIEYNHDEVQAGLTWYRNDYHDKIAAGDSVAGKATSANIYQWENVPKAVVQGLEGTFNFPLADTLKMKNNFTYIIENKNKKSGDYLSVIPKYTINSTLEWQAANDLSVQGTLTWYGRQKPKKYTYQGNPTSGSETRQVSPYALVGMSATYAVTKYVDVTAGIDNLFDKRHFREGNAQTTGNATTGAYLWGAGANTYNEPGRTYYMQVGLHF >LR134204.1|VEB94758.1|4661749_4662472_+|Ferri-bacillibactin-esterase-BesA MHYFAVWARKGTPGRIWEPLGPNIADRGSAFYHFRVENFDSADGKRHYKVWTGIPDKTPPASGYPVLYMLDGNAVMDRLTEDLLKQLAEKTPPVIVAIGYQTHLPFDLNGRAYDYTPALEVKSGAEGRYRRPGGGSGDFRRLLETRIAPQTEQGINIDPERRGVWGHSYGGLFVLDSWLSSSFFRFYYSASPSLGRDNSSLLTRLTTMDAAKNCHKRLFFMEGSASPGEKRADSGFRYPE >LR134204.1|VEB94757.1|4660544_4661447_-|endonuclease/exonuclease/phosphatase MKNIFTLTFLALSCLSTHAIASEKLAGNEILAVQKGGVPDKIYENNKPHLRIATYNIGKNEASENVADFTSLNAAIKKIAADIIAVPEVDNKTARSQKIDQLKTIADANNYHYAFGKALDFDGGEYGLGILSKYKIQHTQVINLPSGDAEQRVALLAQIEVPGFDTPVLVMVTHLDWQKDPTMRIEQVRHLLDVSIGDASSDFKDIASSIKILAGDFNSTRDEQPLKEIGYFFNPVEKQGTDYRTWPAVNPAIDIDHIFTFKGQKWDVKKIEIPHNSPAFTWSSASDHLPFIADMELTEQ >LR134204.1|VEB94756.1|4660268_4660409_-|Uncharacterised-protein MDIDLDNLVFDGLEEAQERNAERVEDADKKAQAVIADDDCGDACKI >LR134204.1|VEB94755.1|4659992_4660172_+|Uncharacterised-protein MNGFKRLWHTVVKARPQTRQYDAEPHLQRIVESVLPVASLYGVDIANIDPEWFRDKTER >LR134204.1|VEB94754.1|4658926_4659688_-|ABC-transporter-substrate-binding-protein MMATTIHSTKKTLLALSALFASGIAALPAQADQLADIKAAGVVKVATFDANPPFGSVDPQTHKIVGYDVDFAEALAKSLGVKLELVATNPANRIPLLQSGKADLIVADITITPERAQVIDFSVPYFVTGQQFLVPATSPDKLDEYSRARIGAVKGTTGEQALHQRFPQSRVLAYDDIPLALTALRNGNVQAITQDSTILAGLLSGAPDKANFKILPDLLSKEEIGVGVKKGEPALLKAVNDETAEARGHRTGG >LR134204.1|VEB94753.1|4658865_4658967_-|ABC-transporter-substrate-binding-protein MKLLKLEATGQAAKIYDVWFGPETKIHSRAPLK >LR134204.1|VEB94752.1|4658454_4658853_-|ATP-binding-protein-of-ABC-transporter MLSGLFTRSAASAADFTHLRQARVELRDVIKQYDGHRVLNGVNLTVEPGEVVTILGPSGSGKSTLIRLINQLESLSGGDIFIDDKPIGQLKGAALRQLRSRIGFVFQQFNLYAHLTAQAEHYAGAGVCAWLE >LR134204.1|VEB94751.1|4658031_4658217_-|ATP-binding-protein-of-ABC-transporter MIVVTHEMHFAREIADRVVFIDGGDILEVASPEAFFTRPQHPRTQRFLKKVLDPLHLESSL >LR134204.1|VEB94750.1|4657768_4658032_-|Uncharacterised-protein MLTLDWQGVLSGQPLQWVISGFLTTLWVTLAGVLLSTLLAIILLALRLSGNAFGSPSGRCLGLAVSQYASACAVDVLVLRSVEFSAP >LR134204.1|VEB94760.1|4665611_4666505_-|multidrug-efflux-protein MVMGFLGLLVNIPVNYIFIYGHFGMPELGGVGCGVATAAVYWVMFIAMASYVKRARSMRDIRNERGFSKPDTAVVKRLVQLGLPIALALFFEVTLFAVVALLVSPLGIVDVAGHQIALNFSSLMFVLPMSLAAAVTIRVGYRLGQGSTLDAQTAARTGLGVGVCMAVITAIFTVTLREPIALLYNDNPEVVTLAAQLMLLAAVYQISDSIQVIGSGILRGYKDTRSIFFITFTAYWVLGLPSGYILALTDLVVDRMGPAGFWMGFIIGLTSAAIMMMLRMRYLQRQSSAIILQRAAR >LR134204.1|VEB94761.1|4666462_4666984_-|multidrug-efflux-protein MQKYISEARQLLALAIPVILAQIAQTAMGFVDTVMAGGYSATDMAAVAIGTSIWLPAILFGHGLLLALTPVIAQLNGSGRRERIAHQVRQGFWLAGCVSVLIMFVLWNAGYIIRSMHNIDPALADKAVGYLRALLWGAPGYLFFQVARNQCEGLAKPNPAWSWASLACWSTFR >LR134204.1|VEB94762.1|4667201_4667843_+|riboflavin-synthase-subunit-alpha MFTGIVQGTAKLVSIDEKPNFRTHVVELPEHMLDALETGASVAHNGCCLTVTEINGNQISFDLMKETLRITNLGDLSVGDLVNVERAAKFSDEIGGHLMSGHIMTTAEISKILTSENNRQVWFKVQDPQLMKYILYKGFIGIDGISLTVGEVTPTRFCVHLIPETLERTTLGKKKLGARVNIEIDPQTQAVVDTVERVLAARESAANLHAGDA >LR134204.1|VEB94763.1|4667882_4669031_-|cyclopropane-fatty-acyl-phospholipid-synthase MSSSCIEEVSVPDDDWYRITNELLGRAGIAINGTAPADIRVKNPDFFKRVLQEGSLGLGESYMDGWWECDRLDMFFSKVLRAGLENQLPHHFKDTLRIAGARLFNLQSKKRAWIVGKEHYDLGNDLFSRMLDPYMQYSCAYWKDADTLESAQQAKLKLICEKLQLQPGMRVLDIGCGWGGLSQFMASNYDVSVVGVTISAEQQKMAQERCAGLDVSIRLQDYRDLNDQFDRIVSVGMFEHVGPKNYKTYFEVVDRNLKPDGIFLLHTIGSKKTDNNVDPWINKYIFPNGCLPSVRQIANASESHFVMEDWHNFGADYDTTLMAWYSRFINGWPEIADNYTERFKRMFSYYLNACAGAFRARDIQLWQVVFSRGVEHGLRIPR >LR134204.1|VEB94764.1|4669321_4670527_-|inner-membrane-transport-protein-YdhC MQPGKGFLVWLAGLSVLGFLATDMYLPAFAAIQADLQTPAAAVSASLSLFLAGFAVAQLLWGPLSDRYGRKPILLLGLSIFALGSLGMMWVESATGLLVLRFIQAVGVCAATVIWQALVTDYYPTQKINRIFATIMPLVGLSPALAPLLGSWILTHFSWQAIFATLFIITLILMLPALRLKPTTKAHDHSQEKLTFASLLRSQTYRGNVLIYAACSASFFAWLTGSPFILSEMGYSPAVIGLSYVPQTIAFLVGGYGCRAALQKWQGQQLLPWLLVLYALSVVATWGASFISHVSLTEILIPFCVMAIANGAIYPIVVAQALRPFPQATGRAAALQNTLQLGLCFLASLVVSWLISTPLLTTTSVMLSTVALAALGYWLQLQAQEPAARAANTEVAHSESH >LR134204.1|VEB94765.1|4670639_4671572_+|putative-DNA-binding-transcriptional-regulator MWSEYSLEVVDAVARNGSFSSAAQELHRVPSAVSYTVRQLEEWLAVPLFERRHRDVELTPAGAWFLKEGRSVIKKMQITRQQCQQIANGWRGQLAIAVDNIVKPERTRQMIVDFYRHFDDVELLVFQEVFNGVWDALSDGRVELAIGATQAIPVGGRYTFRDMGMLSWSCVVARHHPLASMAGPLSDDTLRNWPSLVREDTSRTLPKRITWLLDNQKRVVVPDWESSATCLSAGLCVGMVPTHFAKPWIDSGKWVALQLENPFPDAACCLTWQQNDTSPALAWLLDYLGDSETLNKEWLREPEEAPAEGD >LR134204.1|VEB94766.1|4671568_4672594_-|DNA-binding-transcriptional-repressor-PurR MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVNHTKSIGLLATSSEAAYFAEIIEAVEKNCFQKGYTLILGNAWNNLEKQRAYLSMMAQKRVDGLLVMCSEYPESLLSMLEEYRHIPMVVMDWGEAKADFTDTVIDNAFAGGYMAGRYLVERGHREIGVIPGPMERNTGAGRLAGFMKAMEEALINVPENWIVQGDFEPESGYRAMQQMISQSHRPTAVFCGGDIMAMGALCAADEMGLRVPQDISVIGYDNVRNARYFTPALTTIHQPKDSLGETAFNMLLDRIVNKREESQSIEVHPRLIERRSVADGPFRDYRR >LR134204.1|VEB94767.1|4673155_4674322_+|major-facilitator-superfamily-protein MKINYPLLALAIGAFGIGTTEFSPMGLLPVIARGVDVSIPAAGMLISAYAVGVMVGAPLMTLLLSHRARRNALIFLMAIFTLGNVLSAISPDYTTLMLSRILTSLNHGAFFGLGSVVAASVVPKHKQASAVATMFMGLTIANIGGVPAATWLGETIGWRMSFMATAGLGVISMISLFFSLPKGGAGERPEVRKELAVLLRPQVLSALLTTVLGAGAMFTLYTYIAPVLHSITHATPAFVTGMLVLIGVGFSIGNYLGGKLADRSVNGTLKGFLLLLMVIMLAIPFLARNEVGAAISMVVWGAATFAVVPPLQMRVMRVASEAPGLSSSVNIGAFNLGNALGAAAGGAVISGGLGYSFVPVMGAIVAGLALLLVFVSARRQPEDVCVAN >LR134204.1|VEB94768.1|4674368_4674950_-|superoxide-dismutase MSFELPALPYAKDALAPHISAETLEYHYGKHHQTYVTNLNNLIKGTPFEGKSLEEIVRSAEGGVFNNAAQVWNHTFYWNCLAPDAGGEPTGKLADAIVASFGSVADFKAQFTDAAIKNFGSGWTWLVKGTDGKLAIVSTSNAGTPLTTNATPLMTVDVWEHAYYIDYRNARPGYLEHFWALVNWEFVAKNFAA >LR134204.1|VEB94769.1|4675075_4675702_-|lipoprotein MSKTQSKTASTSKKRIASPAKTSKTASRRSKPATTQTAAVTWTEKCTPRKGRKPHCVKVKGAPLAIADAHKAKMQKATNTAMNKLMNQIGKPYRWGGTSPRTGFDCSGLVYYAYKDLVKFRIPRTANEMYHLRDAAPIERGELKNGDLVFFRTQGRGTADHVGVYVGNGKFIQSPRSGQEIQITSLSEDYWQRHYVGARRVMTPKTIR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
LR134204_4 | 4.1|1822244|45|LR134204|CRISPRCasFinder | 1822244-1822288 | 45 | LR134204.1 | 1409844-1409888 | 2 | 0.956 |
LR134204_10 | 10.1|3719023|32|LR134204|CRISPRCasFinder | 3719023-3719054 | 32 | LR134204.1 | 2056023-2056054 | 2 | 0.938 |
LR134204_10 | 10.1|3719023|32|LR134204|CRISPRCasFinder | 3719023-3719054 | 32 | LR134204.1 | 2056104-2056135 | 2 | 0.938 |
1. spacer 4.1|1822244|45|LR134204|CRISPRCasFinder matches to position: 1409844-1409888, mismatch: 2, identity: 0.956
accgggcattgcgccggatggcggcttcgccttatccggcctacg CRISPR spacer accgggcattgtgccggatggcggctacgccttatccggcctacg Protospacer ***********.************** ******************
2. spacer 10.1|3719023|32|LR134204|CRISPRCasFinder matches to position: 2056023-2056054, mismatch: 2, identity: 0.938
tttacgccgccatccggcgtaatgcccggtgg CRISPR spacer tttacgccgccatccggcgttctgcccggtgg Protospacer ******************** **********
3. spacer 10.1|3719023|32|LR134204|CRISPRCasFinder matches to position: 2056104-2056135, mismatch: 2, identity: 0.938
tttacgccgccatccggcgtaatgcccggtgg CRISPR spacer tttacgccgccatccggcgttttgcccggtgg Protospacer ******************** **********
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
LR134204_15 | 15.1|4665373|38|LR134204|CRISPRCasFinder | 4665373-4665410 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 113727-113764 | 1 | 0.974 |
LR134204_9 | 9.1|3401928|27|LR134204|CRISPRCasFinder | 3401928-3401954 | 27 | NC_012586 | Sinorhizobium fredii NGR234 plasmid pNGR234b, complete sequence | 1014718-1014744 | 3 | 0.889 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 396224-396261 | 3 | 0.921 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 229404-229436 | 4 | 0.879 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 239898-239930 | 4 | 0.879 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 230810-230842 | 4 | 0.879 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 211780-211812 | 4 | 0.879 |
LR134204_3 | 3.1|1030600|26|LR134204|CRISPRCasFinder | 1030600-1030625 | 26 | LR134132 | Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 12 | 88504-88529 | 5 | 0.808 |
LR134204_5 | 5.1|2142098|27|LR134204|CRISPRCasFinder | 2142098-2142124 | 27 | KU160673 | Arthrobacter phage Wilde, complete genome | 40224-40250 | 5 | 0.815 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_AP023208 | Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence | 8886-8918 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 40399-40431 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP019447 | Kosakonia cowanii JCM 10956 = DSM 18146 strain 888-76 plasmid p888-76-2, complete sequence | 9850-9882 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 229101-229133 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 229202-229234 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 229303-229335 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 194300-194332 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 239595-239627 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 239696-239728 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 239797-239829 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 204794-204826 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 230507-230539 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 230608-230640 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 230709-230741 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 195628-195660 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 211477-211509 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 211578-211610 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 211679-211711 | 5 | 0.848 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 176583-176615 | 5 | 0.848 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 42135-42172 | 5 | 0.868 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP019203 | Salmonella enterica subsp. enterica serovar Infantis strain CFSAN003307 plasmid pCFSAN003307, complete sequence | 16046-16083 | 5 | 0.868 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_LN868945 | Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 3, complete sequence | 12928-12965 | 5 | 0.868 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | CP022016 | Salmonella enterica subsp. enterica serovar India str. SA20085604 plasmid unnamed1, complete sequence | 156616-156653 | 5 | 0.868 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | LR134125 | Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 5 | 355130-355167 | 5 | 0.868 |
LR134204_4 | 4.1|1822244|45|LR134204|CRISPRCasFinder | 1822244-1822288 | 45 | LR134122 | Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 2 | 272303-272347 | 6 | 0.867 |
LR134204_5 | 5.1|2142098|27|LR134204|CRISPRCasFinder | 2142098-2142124 | 27 | NZ_CP053575 | Citrobacter sp. TSA-1 plasmid unnamed2, complete sequence | 78070-78096 | 6 | 0.778 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP019447 | Kosakonia cowanii JCM 10956 = DSM 18146 strain 888-76 plasmid p888-76-2, complete sequence | 9766-9798 | 6 | 0.818 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | LR134132 | Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 12 | 103323-103355 | 6 | 0.818 |
LR134204_4 | 4.1|1822244|45|LR134204|CRISPRCasFinder | 1822244-1822288 | 45 | NZ_CP019203 | Salmonella enterica subsp. enterica serovar Infantis strain CFSAN003307 plasmid pCFSAN003307, complete sequence | 5142-5186 | 7 | 0.844 |
LR134204_4 | 4.1|1822244|45|LR134204|CRISPRCasFinder | 1822244-1822288 | 45 | NZ_LN868945 | Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 3, complete sequence | 2024-2068 | 7 | 0.844 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 194021-194053 | 7 | 0.788 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 194114-194146 | 7 | 0.788 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 204515-204547 | 7 | 0.788 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 204608-204640 | 7 | 0.788 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 195349-195381 | 7 | 0.788 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 195442-195474 | 7 | 0.788 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 176304-176336 | 7 | 0.788 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 176397-176429 | 7 | 0.788 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP044147 | Escherichia coli O157 strain AR-0428 plasmid pAR-0428-2 | 7440-7472 | 7 | 0.788 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | CP044351 | Escherichia coli strain 194195 plasmid p194195_1, complete sequence | 84264-84296 | 7 | 0.788 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_AP023209 | Escherichia coli strain TUM18781 plasmid pMTY18781-4, complete sequence | 19387-19419 | 7 | 0.788 |
LR134204_11 | 11.1|3894454|43|LR134204|CRISPRCasFinder | 3894454-3894496 | 43 | CP022016 | Salmonella enterica subsp. enterica serovar India str. SA20085604 plasmid unnamed1, complete sequence | 156616-156658 | 7 | 0.837 |
LR134204_11 | 11.1|3894454|43|LR134204|CRISPRCasFinder | 3894454-3894496 | 43 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 42130-42172 | 7 | 0.837 |
LR134204_11 | 11.1|3894454|43|LR134204|CRISPRCasFinder | 3894454-3894496 | 43 | NZ_CP019203 | Salmonella enterica subsp. enterica serovar Infantis strain CFSAN003307 plasmid pCFSAN003307, complete sequence | 16046-16088 | 7 | 0.837 |
LR134204_11 | 11.1|3894454|43|LR134204|CRISPRCasFinder | 3894454-3894496 | 43 | NZ_LN868945 | Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 3, complete sequence | 12928-12970 | 7 | 0.837 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 263676-263713 | 7 | 0.816 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 263130-263167 | 7 | 0.816 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 422319-422356 | 7 | 0.816 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | LR134127 | Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 7 | 197043-197080 | 7 | 0.816 |
LR134204_4 | 4.1|1822244|45|LR134204|CRISPRCasFinder | 1822244-1822288 | 45 | LR134132 | Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 12 | 116999-117043 | 8 | 0.822 |
LR134204_4 | 4.1|1822244|45|LR134204|CRISPRCasFinder | 1822244-1822288 | 45 | NZ_CP019447 | Kosakonia cowanii JCM 10956 = DSM 18146 strain 888-76 plasmid p888-76-2, complete sequence | 32612-32656 | 8 | 0.822 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 386506-386538 | 8 | 0.758 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP019447 | Kosakonia cowanii JCM 10956 = DSM 18146 strain 888-76 plasmid p888-76-2, complete sequence | 9599-9631 | 8 | 0.758 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP010208 | Escherichia coli strain M11 plasmid B, complete sequence | 27300-27332 | 8 | 0.758 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP010208 | Escherichia coli strain M11 plasmid B, complete sequence | 27399-27431 | 8 | 0.758 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 289943-289980 | 8 | 0.789 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 73707-73744 | 8 | 0.789 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 197685-197722 | 8 | 0.789 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | LR134125 | Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 5 | 47050-47087 | 8 | 0.789 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | LR134125 | Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 5 | 734204-734241 | 8 | 0.789 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | LR134122 | Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 2 | 171110-171147 | 8 | 0.789 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP053575 | Citrobacter sp. TSA-1 plasmid unnamed2, complete sequence | 37066-37103 | 8 | 0.789 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_LN868946 | Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 4, complete sequence | 40894-40931 | 8 | 0.789 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 211748-211780 | 9 | 0.727 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 211948-211980 | 9 | 0.727 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 222242-222274 | 9 | 0.727 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 222442-222474 | 9 | 0.727 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 213076-213108 | 9 | 0.727 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 213276-213308 | 9 | 0.727 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 194031-194063 | 9 | 0.727 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 194231-194263 | 9 | 0.727 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_CP010208 | Escherichia coli strain M11 plasmid B, complete sequence | 30231-30263 | 9 | 0.727 |
LR134204_10 | 10.1|3719023|32|LR134204|CRISPRCasFinder | 3719023-3719054 | 32 | NC_020276 | Mycobacterium intracellulare subsp. yongonense 05-1390 plasmid pMyong2, complete sequence | 15438-15469 | 9 | 0.719 |
LR134204_11 | 11.1|3894454|43|LR134204|CRISPRCasFinder | 3894454-3894496 | 43 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 263676-263718 | 9 | 0.791 |
LR134204_11 | 11.1|3894454|43|LR134204|CRISPRCasFinder | 3894454-3894496 | 43 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 396224-396266 | 9 | 0.791 |
LR134204_11 | 11.1|3894454|43|LR134204|CRISPRCasFinder | 3894454-3894496 | 43 | LR134125 | Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 5 | 47050-47092 | 9 | 0.791 |
LR134204_11 | 11.1|3894454|43|LR134204|CRISPRCasFinder | 3894454-3894496 | 43 | LR134125 | Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 5 | 355130-355172 | 9 | 0.791 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | LR134122 | Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 2 | 208409-208446 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP019910 | Escherichia coli strain MDR_56 plasmid unnamed4, complete sequence | 25418-25455 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | CP045556 | Citrobacter sp. S39 plasmid pS39-1, complete sequence | 107062-107099 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP045203 | Citrobacter sp. NMI7904_11 plasmid pCTEL-2, complete sequence | 229936-229973 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP022152 | Citrobacter freundii strain 705SK3 plasmid p705SK3_1, complete sequence | 64063-64100 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP019987 | Citrobacter werkmanii strain BF-6 plasmid unnamed, complete sequence | 124429-124466 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP030224 | Salmonella enterica strain SA20083039 plasmid pSA20083039.1, complete sequence | 86484-86521 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP026213 | Citrobacter sp. CFNIH10 plasmid pKPC-933d, complete sequence | 51137-51174 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_MG387191 | Citrobacter freundii strain 2262 plasmid pTEM-2262, complete sequence | 109979-110016 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP048383 | Citrobacter freundii strain 62 plasmid p6_A, complete sequence | 127034-127071 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP026282 | Klebsiella oxytoca strain KONIH2 plasmid pKOR-e3cb, complete sequence | 172735-172772 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP042535 | Citrobacter freundii strain E51 plasmid pE51_001, complete sequence | 112017-112054 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP007734 | Klebsiella pneumoniae subsp. pneumoniae KPNIH27 plasmid pKPN-262, complete sequence | 179832-179869 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP026058 | Citrobacter freundii strain FDAARGOS_73 plasmid unnamed2, complete sequence | 283606-283643 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP029436 | Klebsiella quasipneumoniae strain CAV2013 plasmid pKPC_CAV2013, complete sequence | 290568-290605 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP029431 | Klebsiella quasipneumoniae strain CAV2018 plasmid pKPC_CAV2018-435, complete sequence | 411223-411260 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP018017 | Kosakonia radicincitans DSM 16656 plasmid pKrDSM16656L, complete sequence | 130829-130866 | 9 | 0.763 |
LR134204_12 | 12.1|4513244|38|LR134204|CRISPRCasFinder | 4513244-4513281 | 38 | NZ_CP029442 | Klebsiella quasipneumoniae strain CAV1947 plasmid pKPC_CAV1947-412, complete sequence | 382422-382459 | 9 | 0.763 |
LR134204_4 | 4.1|1822244|45|LR134204|CRISPRCasFinder | 1822244-1822288 | 45 | NZ_LN868946 | Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 4, complete sequence | 56784-56828 | 10 | 0.778 |
LR134204_6 | 6.1|2364831|33|LR134204|CRISPRCasFinder | 2364831-2364863 | 33 | NZ_AP023207 | Escherichia coli strain TUM18781 plasmid pMTY18781-2, complete sequence | 27992-28024 | 10 | 0.697 |
LR134204_11 | 11.1|3894454|43|LR134204|CRISPRCasFinder | 3894454-3894496 | 43 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 197685-197727 | 10 | 0.767 |
LR134204_11 | 11.1|3894454|43|LR134204|CRISPRCasFinder | 3894454-3894496 | 43 | NZ_CP019910 | Escherichia coli strain MDR_56 plasmid unnamed4, complete sequence | 25413-25455 | 10 | 0.767 |
LR134204_11 | 11.1|3894454|43|LR134204|CRISPRCasFinder | 3894454-3894496 | 43 | NZ_LN868946 | Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 4, complete sequence | 40889-40931 | 10 | 0.767 |
LR134204_4 | 4.1|1822244|45|LR134204|CRISPRCasFinder | 1822244-1822288 | 45 | NZ_LN868946 | Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 4, complete sequence | 100270-100314 | 11 | 0.756 |
LR134204_11 | 11.1|3894454|43|LR134204|CRISPRCasFinder | 3894454-3894496 | 43 | LR134122 | Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 2 | 171105-171147 | 11 | 0.744 |
LR134204_11 | 11.1|3894454|43|LR134204|CRISPRCasFinder | 3894454-3894496 | 43 | NZ_CP053575 | Citrobacter sp. TSA-1 plasmid unnamed2, complete sequence | 37061-37103 | 12 | 0.721 |
1. spacer 15.1|4665373|38|LR134204|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 1, identity: 0.974
cagacgcaggatggtgcgttcaattggactcgaaccaa CRISPR spacer cagacgcagaatggtgcgttcaattggactcgaaccaa Protospacer *********.****************************
2. spacer 9.1|3401928|27|LR134204|CRISPRCasFinder matches to NC_012586 (Sinorhizobium fredii NGR234 plasmid pNGR234b, complete sequence) position: , mismatch: 3, identity: 0.889
tatcggcagggcgttgcatcgtt-tgta CRISPR spacer tatcgccagggcgttgcatcgctatgt- Protospacer ***** ***************.* ***
3. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 3, identity: 0.921
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer ttttgcccggtggcgctgcgcttaccgggcctacagaa Protospacer * *********************************.*
4. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 4, identity: 0.879
cgtaagcgccttatccggcctacgg-tgttgcgt CRISPR spacer cgtaaacgccttatccggcctacggatggcgcg- Protospacer *****.******************* ** .***
5. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 4, identity: 0.879
cgtaagcgccttatccggcctacgg-tgttgcgt CRISPR spacer cgtaaacgccttatccggcctacggatggcgcg- Protospacer *****.******************* ** .***
6. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 4, identity: 0.879
cgtaagcgccttatccggcctacgg-tgttgcgt CRISPR spacer cgtaaacgccttatccggcctacggatggcgcg- Protospacer *****.******************* ** .***
7. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 4, identity: 0.879
cgtaagcgccttatccggcctacgg-tgttgcgt CRISPR spacer cgtaaacgccttatccggcctacggatggcgcg- Protospacer *****.******************* ** .***
8. spacer 3.1|1030600|26|LR134204|CRISPRCasFinder matches to LR134132 (Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 12) position: , mismatch: 5, identity: 0.808
cataaccgccggatggcggcgtaaat CRISPR spacer tgaaagcgccggatggcggcgtaaac Protospacer .. ** *******************.
9. spacer 5.1|2142098|27|LR134204|CRISPRCasFinder matches to KU160673 (Arthrobacter phage Wilde, complete genome) position: , mismatch: 5, identity: 0.815
ttcgcttgcccggcctacgtccgtttc CRISPR spacer ttcgcttgcccgggctacgtcaggtcg Protospacer ************* ******* * *.
10. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_AP023208 (Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacggtgttgcgt CRISPR spacer cgtaaacgccttatccggcctacgatgtgcggt Protospacer *****.******************.*** **
11. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacggtgttgcgt CRISPR spacer cgtaaacgccttatccggcctacggtttggtgc Protospacer *****.******************** * *.*.
12. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP019447 (Kosakonia cowanii JCM 10956 = DSM 18146 strain 888-76 plasmid p888-76-2, complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacggtgttgcgt CRISPR spacer cgcaagcgccttatccggcctacgggcgtgtgt Protospacer **.********************** **.**
13. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacgg-atggcgtg Protospacer ***.*.******************* .* ****
14. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacgg-tgttgcgt CRISPR spacer cgtgaacgccttatccggcctacggatggcgcg- Protospacer ***.*.******************* ** .***
15. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacgg-gtggcgcg Protospacer ***.*.******************* ** ***.
16. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacg-gtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacgagtggcgcg- Protospacer ***.*.****************** *** .***
17. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacgg-atggcgtg Protospacer ***.*.******************* .* ****
18. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacgg-tgttgcgt CRISPR spacer cgtgaacgccttatccggcctacggatggcgcg- Protospacer ***.*.******************* ** .***
19. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacgg-gtggcgcg Protospacer ***.*.******************* ** ***.
20. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacg-gtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacgagtggcgcg- Protospacer ***.*.****************** *** .***
21. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacgg-atggcgtg Protospacer ***.*.******************* .* ****
22. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacgg-tgttgcgt CRISPR spacer cgtgaacgccttatccggcctacggatggcgcg- Protospacer ***.*.******************* ** .***
23. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacgg-gtggcgcg Protospacer ***.*.******************* ** ***.
24. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacg-gtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacgagtggcgcg- Protospacer ***.*.****************** *** .***
25. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacgg-atggcgtg Protospacer ***.*.******************* .* ****
26. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacgg-tgttgcgt CRISPR spacer cgtgaacgccttatccggcctacggatggcgcg- Protospacer ***.*.******************* ** .***
27. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacgg-gtggcgcg Protospacer ***.*.******************* ** ***.
28. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 5, identity: 0.848
cgtaagcgccttatccggcctacg-gtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacgagtggcgcg- Protospacer ***.*.****************** *** .***
29. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 5, identity: 0.868
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer gattgcccggtggcgctgcgcttaccgggcctacaaaa Protospacer . ********************************..*
30. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP019203 (Salmonella enterica subsp. enterica serovar Infantis strain CFSAN003307 plasmid pCFSAN003307, complete sequence) position: , mismatch: 5, identity: 0.868
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer ggttgcccggtggcgctgcgcttaccgggcctacacgt Protospacer . ******************************** *
31. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_LN868945 (Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 3, complete sequence) position: , mismatch: 5, identity: 0.868
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer ggttgcccggtggcgctgcgcttaccgggcctacacgt Protospacer . ******************************** *
32. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to CP022016 (Salmonella enterica subsp. enterica serovar India str. SA20085604 plasmid unnamed1, complete sequence) position: , mismatch: 5, identity: 0.868
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer tgttgcccggtggcgctgcgcttaccgggcctacaaaa Protospacer ********************************..*
33. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to LR134125 (Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 5) position: , mismatch: 5, identity: 0.868
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer ttttgcccggtggcgctacgcttaccgggcctacggta Protospacer * **************.****************.* *
34. spacer 4.1|1822244|45|LR134204|CRISPRCasFinder matches to LR134122 (Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 2) position: , mismatch: 6, identity: 0.867
accgggcattgcgccggatggcggcttcgccttatccggcctacg CRISPR spacer caccgcttttgcgccggatggcggcttcgccttatccggcctacg Protospacer * * . *************************************
35. spacer 5.1|2142098|27|LR134204|CRISPRCasFinder matches to NZ_CP053575 (Citrobacter sp. TSA-1 plasmid unnamed2, complete sequence) position: , mismatch: 6, identity: 0.778
ttcgcttgcccggcctacgtccgtttc CRISPR spacer ttcgcttgcccggcctacgaaatgtta Protospacer ******************* **
36. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP019447 (Kosakonia cowanii JCM 10956 = DSM 18146 strain 888-76 plasmid p888-76-2, complete sequence) position: , mismatch: 6, identity: 0.818
cgtaagcgccttatccggcctacggtgttgcgt CRISPR spacer cgcaagcgccttatccggcctacgggcgtgtgc Protospacer **.********************** **.*.
37. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to LR134132 (Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 12) position: , mismatch: 6, identity: 0.818
cgtaagcgccttatccggcctacggtgt-tgcgt CRISPR spacer cataaatgccttatccggcctacggtccgtgcg- Protospacer *.***..******************* . ****
38. spacer 4.1|1822244|45|LR134204|CRISPRCasFinder matches to NZ_CP019203 (Salmonella enterica subsp. enterica serovar Infantis strain CFSAN003307 plasmid pCFSAN003307, complete sequence) position: , mismatch: 7, identity: 0.844
--accgggcattgcgccggatggcggcttcgccttatccggcctacg CRISPR spacer gcaaccggcac--cgccggatggcagcttcgccttatccggcctaca Protospacer * * ****. ***********.*********************.
39. spacer 4.1|1822244|45|LR134204|CRISPRCasFinder matches to NZ_LN868945 (Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 3, complete sequence) position: , mismatch: 7, identity: 0.844
--accgggcattgcgccggatggcggcttcgccttatccggcctacg CRISPR spacer gcaaccggcac--cgccggatggcggctccgccttatccggcctaca Protospacer * * ****. ***************.*****************.
40. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 7, identity: 0.788
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacga-atggcgca Protospacer ***.*.******************. .* ***.
41. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 7, identity: 0.788
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacga-atggcgca Protospacer ***.*.******************. .* ***.
42. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 7, identity: 0.788
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacga-atggcgca Protospacer ***.*.******************. .* ***.
43. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 7, identity: 0.788
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacga-atggcgca Protospacer ***.*.******************. .* ***.
44. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 7, identity: 0.788
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacga-atggcgca Protospacer ***.*.******************. .* ***.
45. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 7, identity: 0.788
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacga-atggcgca Protospacer ***.*.******************. .* ***.
46. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 7, identity: 0.788
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacga-atggcgca Protospacer ***.*.******************. .* ***.
47. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 7, identity: 0.788
cgtaagcgccttatccggcctacggtgttgcgt- CRISPR spacer cgtgaacgccttatccggcctacga-atggcgca Protospacer ***.*.******************. .* ***.
48. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP044147 (Escherichia coli O157 strain AR-0428 plasmid pAR-0428-2) position: , mismatch: 7, identity: 0.788
cgtaagcgccttatccggcctacggtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacggttgagtgc Protospacer ***.*.******************** *.*.
49. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to CP044351 (Escherichia coli strain 194195 plasmid p194195_1, complete sequence) position: , mismatch: 7, identity: 0.788
cgtaagcgccttatccggcctacggtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacggttgagtgc Protospacer ***.*.******************** *.*.
50. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_AP023209 (Escherichia coli strain TUM18781 plasmid pMTY18781-4, complete sequence) position: , mismatch: 7, identity: 0.788
cgtaagcgccttatccggcctacgg-tgttgcgt CRISPR spacer tgtgaacgccttatccggcctacggatggcccg- Protospacer .**.*.******************* ** . **
51. spacer 11.1|3894454|43|LR134204|CRISPRCasFinder matches to CP022016 (Salmonella enterica subsp. enterica serovar India str. SA20085604 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.837
atatgcccggtggcgctgcgcttaccgggcctacaatagcgta CRISPR spacer tgttgcccggtggcgctgcgcttaccgggcctacaaaatcgcc Protospacer ********************************* * **.
52. spacer 11.1|3894454|43|LR134204|CRISPRCasFinder matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 7, identity: 0.837
atatgcccggtggcgctgcgcttaccgggcctacaatagcgta CRISPR spacer gattgcccggtggcgctgcgcttaccgggcctacaaaacccca Protospacer . ********************************* * * .*
53. spacer 11.1|3894454|43|LR134204|CRISPRCasFinder matches to NZ_CP019203 (Salmonella enterica subsp. enterica serovar Infantis strain CFSAN003307 plasmid pCFSAN003307, complete sequence) position: , mismatch: 7, identity: 0.837
atatgcccggtggcgctgcgcttaccgggcctaca-atagcgta CRISPR spacer ggttgcccggtggcgctgcgcttaccgggcctacacgttgcga- Protospacer . ******************************** .* ***
54. spacer 11.1|3894454|43|LR134204|CRISPRCasFinder matches to NZ_LN868945 (Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 3, complete sequence) position: , mismatch: 7, identity: 0.837
atatgcccggtggcgctgcgcttaccgggcctaca-atagcgta CRISPR spacer ggttgcccggtggcgctgcgcttaccgggcctacacgttgcga- Protospacer . ******************************** .* ***
55. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 7, identity: 0.816
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cattgcccggtggcgctgcgcttaccgggcctacgatc Protospacer *******************************..
56. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 7, identity: 0.816
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer caaagcccggtggcgcttcgcttaccgggcctacgggt Protospacer . ************* ****************.**
57. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 7, identity: 0.816
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer aaaagcccggtggcgctacgcttaccgggcctacgtgg Protospacer * . *************.****************. *.
58. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to LR134127 (Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 7) position: , mismatch: 7, identity: 0.816
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer ttttgcccggtggcgctttgcttaccgggcctacgaaa Protospacer * ************** .***************...*
59. spacer 4.1|1822244|45|LR134204|CRISPRCasFinder matches to LR134132 (Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 12) position: , mismatch: 8, identity: 0.822
accgggcattgcgccggatggcggcttcgccttatccggcctacg CRISPR spacer cgcgacttttgcgccggatggcggctacgccttatccggcctaca Protospacer **. . ****************** *****************.
60. spacer 4.1|1822244|45|LR134204|CRISPRCasFinder matches to NZ_CP019447 (Kosakonia cowanii JCM 10956 = DSM 18146 strain 888-76 plasmid p888-76-2, complete sequence) position: , mismatch: 8, identity: 0.822
accgggcattgcgccggatggcggcttcgccttatccggcctacg CRISPR spacer tgcggtctgtttgccggatggcggctgcgccttatccggcctacg Protospacer *** * * .************** ******************
61. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 8, identity: 0.758
cgtaagcgccttatccggcctacggtg---ttgcgt CRISPR spacer cgtaaacgccttatccggcctacgccagactta--- Protospacer *****.****************** .. **.
62. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP019447 (Kosakonia cowanii JCM 10956 = DSM 18146 strain 888-76 plasmid p888-76-2, complete sequence) position: , mismatch: 8, identity: 0.758
cgtaagcgccttatccggcctacggtgttgcgt CRISPR spacer cgcaagcgccttatccagcctacgggtgcgtgc Protospacer **.*************.******** .*.*.
63. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP010208 (Escherichia coli strain M11 plasmid B, complete sequence) position: , mismatch: 8, identity: 0.758
cgtaagcgccttatccggcctacggtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacgggtgagcac Protospacer ***.*.******************* **..
64. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP010208 (Escherichia coli strain M11 plasmid B, complete sequence) position: , mismatch: 8, identity: 0.758
cgtaagcgccttatccggcctacggtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacgggtgagtgc Protospacer ***.*.******************* *.*.
65. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 8, identity: 0.789
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer agaagcccggtggcgctgcgcttaccggccctacccat Protospacer * . ************************ ***** .
66. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 8, identity: 0.789
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer tactgccgggtggcgctgcgcttacccggcctacgaaa Protospacer **** ****************** *******...*
67. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 8, identity: 0.789
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer aaaagccgggtggcgctgcgcttacccggcctacattt Protospacer * . *** ****************** ********
68. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to LR134125 (Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 5) position: , mismatch: 8, identity: 0.789
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cgctccccggtggcgctgcgcttaccggggctacaatg Protospacer * ************************ *****. .
69. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to LR134125 (Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 5) position: , mismatch: 8, identity: 0.789
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer acctccccggtggcgcttcgcttaccggggctacggat Protospacer *. * ************ *********** ****.*.
70. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to LR134122 (Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 2) position: , mismatch: 8, identity: 0.789
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer tattgccgggtggcgctgcgcttacccggcctacaaat Protospacer **** ****************** ********..
71. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP053575 (Citrobacter sp. TSA-1 plasmid unnamed2, complete sequence) position: , mismatch: 8, identity: 0.789
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer tcttgcccggtgacgcttcgcttaccgggcctaccgtt Protospacer . *********.**** **************** *
72. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_LN868946 (Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 4, complete sequence) position: , mismatch: 8, identity: 0.789
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer tcttgcccggtggcgcttcgcttaccgggcttaccgcc Protospacer . ************** ************.*** *
73. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 9, identity: 0.727
cgtaagcgccttatccggcctacg----gtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacgtagagcact---- Protospacer ***.*.****************** *...*
74. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 9, identity: 0.727
cgtaagcgccttatccggcctacg----gtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacgtagagcact---- Protospacer ***.*.****************** *...*
75. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 9, identity: 0.727
cgtaagcgccttatccggcctacg----gtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacgtagagcact---- Protospacer ***.*.****************** *...*
76. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 9, identity: 0.727
cgtaagcgccttatccggcctacg----gtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacgtagagcact---- Protospacer ***.*.****************** *...*
77. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 9, identity: 0.727
cgtaagcgccttatccggcctacg----gtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacgtagagcact---- Protospacer ***.*.****************** *...*
78. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 9, identity: 0.727
cgtaagcgccttatccggcctacg----gtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacgtagagcact---- Protospacer ***.*.****************** *...*
79. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 9, identity: 0.727
cgtaagcgccttatccggcctacg----gtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacgtagagcact---- Protospacer ***.*.****************** *...*
80. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 9, identity: 0.727
cgtaagcgccttatccggcctacg----gtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacgtagagcact---- Protospacer ***.*.****************** *...*
81. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_CP010208 (Escherichia coli strain M11 plasmid B, complete sequence) position: , mismatch: 9, identity: 0.727
cgtaagcgccttatccggcctacggtg---ttgcgt CRISPR spacer cgtaaacgccttatctggcctacgccagactta--- Protospacer *****.*********.******** .. **.
82. spacer 10.1|3719023|32|LR134204|CRISPRCasFinder matches to NC_020276 (Mycobacterium intracellulare subsp. yongonense 05-1390 plasmid pMyong2, complete sequence) position: , mismatch: 9, identity: 0.719
tttacgccgccatccggcgtaatgcccggtgg CRISPR spacer ccgaggccaccatccggcgcaatgcccgccag Protospacer .. * ***.**********.******** ..*
83. spacer 11.1|3894454|43|LR134204|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 9, identity: 0.791
atatgcccggtggcgctgcgcttaccgggcctacaatagcgta CRISPR spacer cattgcccggtggcgctgcgcttaccgggcctacgatcgaaat Protospacer *******************************.** * .
84. spacer 11.1|3894454|43|LR134204|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 9, identity: 0.791
atatgcccggtggcgctgcgcttaccgggcctacaatagcgta CRISPR spacer ttttgcccggtggcgctgcgcttaccgggcctacagaatttgc Protospacer * ********************************. * .
85. spacer 11.1|3894454|43|LR134204|CRISPRCasFinder matches to LR134125 (Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 5) position: , mismatch: 9, identity: 0.791
atatgcccggtggcgctgcgcttaccgggcctacaatagcgta CRISPR spacer cgctccccggtggcgctgcgcttaccggggctacaatggcacg Protospacer * ************************ *******.**...
86. spacer 11.1|3894454|43|LR134204|CRISPRCasFinder matches to LR134125 (Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 5) position: , mismatch: 9, identity: 0.791
atatgcccggtggcgctgcgcttaccgggcctacaatagcgta CRISPR spacer ttttgcccggtggcgctacgcttaccgggcctacggtaagttt Protospacer * **************.****************..**. *
87. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to LR134122 (Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 2) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer tcttccccggtggcgctacgcttaccggggctaccgat Protospacer . * ************.*********** **** *.
88. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP019910 (Escherichia coli strain MDR_56 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer gaatgccgggtggcgctgcgcttacccggcctactttt Protospacer . .**** ****************** *******
89. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to CP045556 (Citrobacter sp. S39 plasmid pS39-1, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
90. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP045203 (Citrobacter sp. NMI7904_11 plasmid pCTEL-2, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
91. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP022152 (Citrobacter freundii strain 705SK3 plasmid p705SK3_1, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
92. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP019987 (Citrobacter werkmanii strain BF-6 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
93. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP030224 (Salmonella enterica strain SA20083039 plasmid pSA20083039.1, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
94. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP026213 (Citrobacter sp. CFNIH10 plasmid pKPC-933d, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
95. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_MG387191 (Citrobacter freundii strain 2262 plasmid pTEM-2262, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
96. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP048383 (Citrobacter freundii strain 62 plasmid p6_A, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
97. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP026282 (Klebsiella oxytoca strain KONIH2 plasmid pKOR-e3cb, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
98. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP042535 (Citrobacter freundii strain E51 plasmid pE51_001, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
99. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP007734 (Klebsiella pneumoniae subsp. pneumoniae KPNIH27 plasmid pKPN-262, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
100. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP026058 (Citrobacter freundii strain FDAARGOS_73 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
101. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP029436 (Klebsiella quasipneumoniae strain CAV2013 plasmid pKPC_CAV2013, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
102. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP029431 (Klebsiella quasipneumoniae strain CAV2018 plasmid pKPC_CAV2018-435, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
103. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP018017 (Kosakonia radicincitans DSM 16656 plasmid pKrDSM16656L, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
104. spacer 12.1|4513244|38|LR134204|CRISPRCasFinder matches to NZ_CP029442 (Klebsiella quasipneumoniae strain CAV1947 plasmid pKPC_CAV1947-412, complete sequence) position: , mismatch: 9, identity: 0.763
atgtgcccggtggcgctgcgcttaccgggcctacagga CRISPR spacer cctcacccggtggcgctgcgcttaccggggcgacataa Protospacer . ..************************ * *** .*
105. spacer 4.1|1822244|45|LR134204|CRISPRCasFinder matches to NZ_LN868946 (Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 4, complete sequence) position: , mismatch: 10, identity: 0.778
accgggcattgcgccggatggcggcttcgccttatccggcctacg CRISPR spacer catcaatatatcgccggatggcggcttcgccttatccggcctaca Protospacer . ...** *********************************.
106. spacer 6.1|2364831|33|LR134204|CRISPRCasFinder matches to NZ_AP023207 (Escherichia coli strain TUM18781 plasmid pMTY18781-2, complete sequence) position: , mismatch: 10, identity: 0.697
cgtaagcgccttatccggcctacg-gtgttgcgt CRISPR spacer cgtgaacgccttatccggcctacgcacactata- Protospacer ***.*.****************** ....*...
107. spacer 11.1|3894454|43|LR134204|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 10, identity: 0.767
atatgcccggtggcgctgcgcttaccgggcctacaatagcgta CRISPR spacer aaaagccgggtggcgctgcgcttacccggcctacatttcgatt Protospacer * * *** ****************** ******** * .*
108. spacer 11.1|3894454|43|LR134204|CRISPRCasFinder matches to NZ_CP019910 (Escherichia coli strain MDR_56 plasmid unnamed4, complete sequence) position: , mismatch: 10, identity: 0.767
atatgcccggtggcgctgcgcttaccgggcctacaatagcgta CRISPR spacer gaatgccgggtggcgctgcgcttacccggcctacttttccttt Protospacer . ***** ****************** ******* * * *
109. spacer 11.1|3894454|43|LR134204|CRISPRCasFinder matches to NZ_LN868946 (Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 4, complete sequence) position: , mismatch: 10, identity: 0.767
atatgcccggtggcgctgcgcttaccgggcctacaatagcgta--- CRISPR spacer tcttgcccggtggcgcttcgcttaccgggcttac---cgcctacgc Protospacer . ************** ************.*** ** **
110. spacer 4.1|1822244|45|LR134204|CRISPRCasFinder matches to NZ_LN868946 (Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 4, complete sequence) position: , mismatch: 11, identity: 0.756
accgggcattgcgccggatggcggcttcgccttatccggcctacg CRISPR spacer tatctgtagtctgccggatggcggctccgccttatccggcctaca Protospacer . *.* * .**************.*****************.
111. spacer 11.1|3894454|43|LR134204|CRISPRCasFinder matches to LR134122 (Klebsiella aerogenes strain NCTC10006 genome assembly, plasmid: 2) position: , mismatch: 11, identity: 0.744
atatgcccggtggcgctgcgcttaccgggcctacaatagcgta CRISPR spacer tattgccgggtggcgctgcgcttacccggcctacaaatccccg Protospacer **** ****************** ********* * ..
112. spacer 11.1|3894454|43|LR134204|CRISPRCasFinder matches to NZ_CP053575 (Citrobacter sp. TSA-1 plasmid unnamed2, complete sequence) position: , mismatch: 12, identity: 0.721
atatgcccggtggcgctgcgcttaccgggcctacaatagcgta CRISPR spacer tcttgcccggtgacgcttcgcttaccgggcctaccgttatttc Protospacer . *********.**** **************** .* .. *
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
65 : 22768
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >LR134204|65:22768|DBSCAN-SWA AGTGGAGGCCGCTAGCGATGATGCTCGCGACATTGATATTGCTGATGCTGTTCGTAACCTCATCACCCAGCCACAAATTCCGGAGCTGCTTTTTGATCTGCTTGACGGTCTGGGGAAAGGCGTGGGCGTCTGTGAAATCCTCTGGGATACAAGCAACCAGTTCTGGCAACCCCGCGATTATGAATGGGTGGATCCTCGTTTCCTGAAGCCGGACCGTGAGACCTTACGCGACTTCCGTCTGTTAACGGATGCCAGTCCCATTGAAGGTGAACCCCTGACGCCCGGTAAATATGTTGTGCATCAGCCACGCCTTAAATCGGGTCTCCCGTTACGCAACGGCCTGGCGCGTCTGGTGGCGGTGATGTACATGCTCAAGTCCTATACCGTCAGGGACTGGTGGGCGTTTGCCGAGAAGTTCGGTATTCCCGTTGTGGTCGGTAAGTACGGTAATAACGCCAGCCCGGAGCAGATCCAGACGCTGCTGGAAGCCATCGCGTCACTGGCATCAGATGCCGGTTGCGCCATTCCGGCTTCGATGACCCTTGAAATGCAGGAAACCGCCAGCCGTAATAACGGTGGTGCGCTCTTTAAAGAGATGGCCGTCTGGTGCGATGAGCAAATCAGCAAGGCCGTGCTGGGTCAGACCATGACCACCGATAACGGAAGCTCCCGGTCACAGGCGGACGTGCATGACCGGGTGCGCATGGATATTGCCCGCTGGGATGCCCGGCAGCTTGAGAACACACTCAATGAATTTCTGGTGCGCCCCTTCGTTATCATGAACTACGGGCCACAGGATTCTTATCCCCGCGTCGTGCTGCGCCTGAGTGAACCCGAAGACCTGAAGATGCTGGTTGATGCGCTGACCCTCTGATTGATCGGGGTATGGAGGTGCAGATGTCAGAGGTGCGTGACAAGTTCGGCCTGTCAGAGCCTGAGAAAGGCGCGGCATTGCTGACACCCTCCAGCCAGGCGCTCCAGCCTGCGCTGGCGATAAACCGGGAGCGCCTTGCCCTGAACCGTAACCAGCAGGACGATATTGATCTGATGGTTGCTGATGCCATGCAGGACTGGCAGCGTACCGGCGATGCGTTTACCAGCCCGGTGCTGCAACTGGCGAAGGATGCTGACAGCTTCGACGCGTTTCTGGCCGGTCTGCCGGAACTTCAGAAAACGCTTGAACCAGACGAGTTCGTCACGCAACTGGCGCAGCTCTGCTTTAAGGCCCGTGCTCTGGGAGATGTAAACGATGCGTAAACCTGAACGCAAACCGGACATTATCCCGAAGGAAGCGCTGGAGTGGCTGAAGGCGAAAAAGCTGAAGCCGGGCTTCGATTATCGAGACGTGTGGCAGGAAGAGCACCGTTACGGTTACACCGTCGCGAAGATGACGCAGCTTGATTTGCTGGCCGACGTTCGCCAGCTCGTCGAGGATGCGCTGGAGAACGGCCAGACATTCGCGCAGTTCCGCGAGCTGTTGCGCCCTTTACTGGTAAAGCGCGGATGGTGGGGACAGGCGCTGATGGATGACCCGCTGACGGGGGAGACCCGGCAGGTGCAGCTCGGCAGCGAACGGCGAATGCGCGTTATCTATGACACCAATATGCGCACTGCCCGCGCAGCCGGTCAGTGGGAACGCATCCAGCGAACAAAGCGGGCGATGCCGTATCTTGTTTACACGCTGGGACCGTCACGGGAGCACCGGGCGGAGCATCTGAAATGGGCGAATACCTGTCTACCGGTTGACCATCAGTTCTGGATAACACATATGGTGCCGAACGGCTGGGGTTGTAAATGCAATGTTCGTCAGGTCAGCCTCTATGAGTTTGAGCAAATGCAGCAGAACGGCACCATCACGACAACCGCACCGGATGTCCGTTACGTGAAGTGGGTCAATAAGCGCACGGGCGAGGAAGAATCCATTCCAGAAGGGGTTGATCCGGGTTGGGCATACAATCCGGGTATTTCCCGCAGCGATGCACTCAGCCTACAGTTGAAGCAGAAACAACAGGCATTTGACAGTTATACATCATCGCAGAAATAATCCCCCTCAGACGCGCCCGGTGACGATATCGTGATTTATGCTACGATGGCGCGTGGGAATTTTATCTAACGTCTGTACGCGCTTTTAAACGGGTTTTAAACGGGGTTCCCCGCGTCGTTTACAGTAAAGCCCGTTAATCCATCTTCCCTTTCTTTCTCCCGCATGCTGTCCGGAGTCTTTTACGACAACGGACAACACCATGAAACCGACCAACACGGAACTGCTGGCACTCTGCTTTCAGCTTCCTGAACTTGCTGATGATGCGCTGCCGGAATGGCTGCCAATGATACCGGCCGGAACCTTCACCGGCCGCGATGGCCGTTCGTGGATCAACAACCAGCCGGAATCCGTCATTCGCGCCACCCTGAGCTATCCCAAACTGCCGTTTGATATTGAGCATGCCACCGAACTGAAAGGCCCCAAAGGCGAAGAAGCTCCGGCCTTTGCATGGCTGGACGATTACCGCATCCGACGATGGCGTAATTGAGGCGCATGTCGAATGGACTGCTGACGGTGCCGCACTGGTTCGTGGCAAAAAATACCGCTATTACAGCCCGGCATTTCGATTCACTGCGGATGGTCAGGTGACCCGCCTGTCCAGCGCCGGGCTGACTAACAAACCCAACCTTGATTTACCCGCACTCAACTCAGAGGAAAACACGATGACAGTACCTGTCCAGATTGTGACGGTGCTCGGCCTCGCGGCTACGGCCAGCGCGGACGATGCAGTCAAAGCCATTCAGCAGCTCAAAACCAGCGAGCAGGTTGCCCTTAACCGCGCGGAGAACCCGGACCTGACGAAGTTCATTCCGGTTGAGACTCACCAGTTAGCACTTAACCGGGCTGAAACGGCTGAAGCACAGCTCAGTGCAATCGCCATCAAAGAAGCAGAAGCACTGGTTGATGGCGCTATCGAGGCCGGAAAAGTTGCCCCGGCCAACCGTGAAATGTACCTCGCCACCTGTCGCTCTGAAGATGGCCGCAAGCAGTTTGCGGAGTTCGTTAAAGGTGCGCCAGTCATTGTCAGCAAGGACCCGTCCGACAAAAAAGATCCAGGCGGTGATGGCAACACCACACTTTCCGATGAAGACCTCGCGATGTGTCGCCAGATGGGCATCACCCAGGAAGAATTCCTTTCCGTTCGTAAGCAGGAGAAATAATCCATGCAGGTATCCGCAGAAGTATTGCATGCGCTGACCACCGCCCTGAGCGCCGCCTTTACCAAAGGTGTCGGGCGGGTCAACCCGCAGTATCGCTCCATCGCCACGGTCATTCCCAGTACCGGCGCATCTAACACTTATGGCTGGGTTGAAGATTTCCCGACGATCAAAGAGTGGATCGGGGAACGTCAGCTGAAAGAACTGGCTCAGGCCGGGTATACCATTACCAACAAGACCTGGGAAAACTCGGTCAAAGTTAAGCGCGAAAAAATCGAAGACGATCAGATTGGTCAGTATTCCGTGATTGCTGAACAGCTTGGCCGCGATACCACCATTTTCCCGGACAAGCTGTCGTTTGAGTTGCTGTGTAAAGGTTTTGATACGCTGTGCTGGGACGGCCAGTATTTCTTCGATACCGATCACCCGGTCGGCACATCCACCGCCTCAAACGTGATCGGCGACCCGACAACCGACGCAGGCGAACCGTGGTTCCTGGTGGATGCGACACATGCCCTGTTACCCATCATTTACCAGGAGCGCCGTCCATTTAATTTCACCGCCCTTGATGATCTCACCAGTGAGCGCGTGTTCCTTCAGAACGAATTCGCCTATGGCACCGATGGCCGCAGTAACGTCGGCTTTGGCTTCTGGCAGACCTGTGTCGGGTCGAAAGCAGCGCTGAACAAAGCTAATTATGAAGCCGCCGTCTCCGCAATGATGGATATCACTGACTCCAACAGCGAACCACTGGGCATGAACCCGACACTACTGGTTGTCGGTAAAAACAATCGCGGTGCCGCGAAGTCGCTTATCGAAGCGCTCATGGCAGATGGTGGCGGTTCCAACATCTATTACAAAGACGTTGATCTGCTGATTTCACCTTACGTAAAAGCCTGAGTCCGTTACGTAAAAAATAACGTAACCCTGCATTACAGGAGGGGTTAACCCTCCTGTAAATCACCGCTAACAGAGGTTTAAAAAGTGAGTGGAAAAACTGAAAAGAAATCTGTCAAACCCGCAGCCGGTAGCGACACAACCACGGCACAGAAAGACATCAAAGGCACGCAGGACACAACTGTCAAGACGGACCCGGCATCGCTGGCACCAGTTACTTCTGCTGACGGTCAGACATCACAATCAACGGTTACAGCGACAGATGGTGAAGCTGGCCCGGAATCTGATGCTACCGCCACCATTGCAGGAGTAACACCGTCCCCGGAGACGGACCATTCTCATCTTCAGGATGCGGGGGTTAATACAGTCATGTCCGCACTATGCCAGTCAGTTTCGGATGGTGTTCATATTTCTGGTGACGTGACCGTGCTGGAAGTTCGCGCAATCCCTGAAGGTGGATTTCACCGCGCGGGACGTTTCTGGCCGCATGACCCGGTGCATGTCTTCGTCAGTGATGATCCGGACGAGCAGGTTCTGGAAGATGGCAGCGGTCAGCCGCTTTATGGCTGTGTTATCAGCACCGCAGATGCCAGCCGTCTAAACCGGGAAAAGATGCTGGTCGTTACTGAGCTGAAACCAAAGGCGGAGGAAAGCTGATGGGTATCTACGTGACGCGGGACGATTTGCTGGCAACGGATGCGGAACGCGTCTGGAACATGGCGCTGAACAAAGCTACCCAGCAGCTTGATGAGGAGAAGATCCAGCGTGCAATTGATGATACTGACGCGGAAATCAATTCCTTTCTGGCGAAGCGTTATCACCTGCCGCTGAATCTCCCGACACTCCCGAGTCCGTTACGTCGTGCGGCGGTCTCCATCGCGTTCTACTGGCTGTCTGAACGTGACAGCCAGATCACCGACGAGATCCAGAAGCGCTACGACGATGCCCTGCGTACGCTCCGGGAAATCGCCAACGGCACCCGTGACCTTGGCGTACCGTCTGATACGCCGGTGCCTGAAACTGACACCGGCAAGCTGATTATTGTCAGCGATAACCGACGTCTGTTCACCCGTAACAACCTTAAGGGTGTGCTCTGATGGGAATAACCGTTGAGGTGACAGGGGCAGAGAAGCTCCAGACTATCCGCAAGGCAATGGAAAAGCTGGCCGACAGTTCGCTTCGTCAGGAACTGCTTGAAAGCATCGGCGCGGTGGCGGAGTCCCAGACACGTCGCCGTATCGCCAGCGAAAAAAGCAGCCCGGCAGGTGCAAAGTGGCAGGACTGGTCTGACAACTATGCAAAGACCCGCCACGGTAACCAGAGCCTGTTGCAGGGTAATGGCGACCTGCTGGACAGCATCCAGTATTTCGTCAGCGGCGAGCGGGTGCATATCGGTACGCCACTGCCTTACGGCAAAACGCACCAGGAAGGCTTTTCCGGTAGCGTCGCTGTGTCTTCCCACAAGCGCCTTATCACACAGGCATTCGGCCGGGCGCTGAAGCACGGCGTCTGGCAGACCGTGGGGCGCATCAGCGCCAGATGGACATCCCGCAGCGCGAGTTCCTCGGTCTGTCTGCGGATAACAGTAACGAGCTGACCAGTGTGATCGGCGATTTCTGGAGTGAGGTTCTGAAATGAGTGAACGTCCTGCGTTCGTGACCCTCGGCAGTACGGTCAGTGCCGCTGAAAACATCGTGGCCTGGCTGAAGACGGAGCTGGAAGGCAACACGCCAGACCGTGTGGAGATAGTGGAGCGTCATGTCGGCCAGTTCAGTACGCCGGATGAGGTGAAGCGTTACCTTTCCGGGCGTTCCGGCTGCGTGCGTCTTGCGGCCCTGCGCGTACGTAATATCAGTAACCGCAACGGTATGACGGGGCTGGTGACGTGGGCGGCCTACGTCATGACCTCCGACTCATGGGGCTATGCCCGTGATGCCCGCTGCGAGGTTCTGGCCGGGAAAATCGCCCGCCGCATTTCAGTCCGGGAGGCTCCCCGCGCCATGAAGGCTGAGCGCATGGCGGAAAATATCGGCGCTGAAAATATCTACTCCGGCCGCCTGGATAACTTTGGCGTCAGTCTCTGGGCGGTGACGTGGGAACAGGTGTTTCGTCTGGATGACGAGATTGATATGGCCGCACTGCCGGAGTTCCTGCGACTTGGCGCATCGTTTGTCGTGAACGGCCAGCCGGTTACTGAAGAGCCGGACATCATTAATGTAAGAGAAGGTCAGACTGATGAATAAAAAACTGATTAAGCCCGCCCGCCCCGGTCTCCGGGTACGTAAGGCAGATGGCAGCCTGCTGAATGCTGATGGCGAAATACTTGCTGTCGCTGCGTACTGGCGACGTCGTGAATCCGAAGGTGATGTGGTTATCACCGCGCCATCCAAACCCAAATCCGGCAAAGCTGATAAGGAGGCATGATGGCTCTGGGTAACATTCCTGATGATATCCGCGTCCCGCTGGTGTGGATCGATATCGATAACTCAATGGCGATGAGCGGTGCGCCAGCTCAGTCACGCAAAATTCTGGTGATCGGCCAGCAGGTGGAGAGCGCCAGTGCTGAACCACTAACGCTCAATCGCATCACCGGCGACAGTATGGCGGATGAATACCACGGCCGGGGATCCATGCTGGCGGAGATGCTGAAAACCCTGCGTAAGGCAAACAGCTATACCGAGACTTATGCAATGGGACTGGCTGACATCATCACCGGTGCTGCTGCGACAGCCAGTATTACTGTCGTGGGAGACGCTCTCGCTGCCGGTACGCTTGCCCTTCTGATTAACGGTGTGTCCGTACAGGTCGGCGTCGCTCAGGGGGATTCTGCTGAAACCGTGGTGCAGTCCGTCATTACGGCCGTCACAGCGAAAACCGCCACGCAGGTCAGCGCAGTCGTTGACGGTGAGAATGCCGCCTCAGCGGTGCTGACGGTGAACTGGAAAGGTGTGACAGGCAACGACTGCGACGTACGCCTGAACTACTACTCAGGTGAGAAAACACCGTCCGGTATCAGCGTTACCGTGACACCGTTCACGGGTGGTGCCGGTACGCCGGATATTCAGGCTGTTGTCGCCGCGCTGGGTGATGACTGGTACACCGACATCGTTTTCCCCTACAACGACACGCAGAGCCTCAACACTATCCGCGACGAGCTGCTGGAACGCTGGGGGCCGCTGAAGATGATGGAGGCGCAGCTGTGGACTGCATTCCGTGGAACACATGCGCAGACCGGCACGTTCGGCAGCGCCCGCAACGACTGGCTGATTTCCTGTATCGGCACCAACATTTCCCCGGAGCCGGTCTGGTTATGGGCGGCAAGCTACGGCGGAACGGCAGCTTATCAGCTTGCCATCGACCCGGCCCGTCCGCTTCAGACGCTGGTACTGACAGGTATCAAGTCGCCAGCCCGCGCCGTTTCGCTGGGACATGCCGGAGCGTAATCTGTTGCTGCACGATGGTATCGCTACCCACTTTGTGGATGCCGGGGACAATGTCTGCATCGAGCGTGAAATCACCATGTACCGCGTGAACAGCTTCGGTGACACCGACATCTCGTACCTCGATGTGCAGTCACCGGCAACGCTGGGACGTATCCGCTATGTCATCAAAAACCGTTTCACCAGCCGTTATCCGCGTCACAAACTGGCCGGGGATGATGTCCTTGATCTGCTCGATGCCGGACAGCCAGTGATGACGCCGAAAATCTGTCGTGCCGAGCTGCTGGATATTGCGCTGACCGGAGCTTATCCCGGCCGGGCTTGTGGAAGATTTCGACGATTACAAAGACACGCTGGACGTCTCCATCGATTCCAGCGATCCAAACCGCCTGAACTTCATCTGTCACCCAAACCTGGTGAATCAGCTGCGCGTTCTGGCCGGTCTCATCCAGTACAAACTCTAAGGGGAAGCTATGTCGAGCATTCTGGGAATGGCGGCCATCCGTATCAACGGCCGCGAAATCAAGACCGAGGGAAAATCCACCCTCAATCCGGGCGGGTATCAGCGCCAGCAACATATGGGCGCTGGCAAAATCTGGGGGATTTCCCGTAAGACCGCCGCCCCGTCCATCAAACTGACCATTGCGGCAGACCAGGACGTTGATGTCATCGAGATAAGTCAATGGGAGGACGTCACCGTGATGTTCTACGGCGACAACGGCCTGAACTACATGATGACCAAAGCGGCAACGGACAACCCGGCCGAACTGGACGAAGACGCCGGAACCGTGACGGCAAACTTCAATCGGCGTTCAGTGTGGAAGGTGTAAGACATGGCAACGATTGATTTGATCTGATTCATGGCCTGCGCACCGGCGCAGGCACCACCGATGAGGCGATGCACAAAACTGTCAGGCTGCGCGAGCTGACCACGGATGACATTGTGGACTCGCAGCTGGCGGCCGAACGTGTCGTGATTGGTGAGAACGGTAAGGCGGTTGCCTACTGTTCTGAAGTCCTTGTCGGGCTGGAGATGCTGCGCCGTCAGATTGCCAGTATTGGCTTTTATCCCCGGCCCGCTGGATATGAAACAGTTACGTCGTCTGCACCCGGACGACCTGAATCTCATCAACGAAAAAGCCGCCGCACTGGATGACATGCTCCGCGAGGTGGCTGAACGGGGCGAGCTGATGCCGCTGGCAGCGGCACTGACCCATCTGCTGATTAATCTTTCACAGCGCTTTGATATTCAGCGTCTCGGACAGCTGCCCCTGCGGCAGCTGTTAACACTGGTGCGGCAAACTGGAGAAACAATATGGCCGGAAATCGCCTTAGTACGGAAATTCTGATTAATCTCGCCGGTAACCTTCAGGCCAAGGCCCGCCAGTACGGAGCCAGCATGTCCGAGTTTTGCCAGCCGTAATCAGCGGCGATGTCCATTGTTCGGGCGACGTCTGAAGCGGCCGGACGTGGACTGGACAGGTTGGGCAACCGTTACACCGCCTGATTGCCAGCGTGGCCGGGGGCGCAGCCCTGCGGGAGTTTGCAAAAACGGATCGCATGTTGACCGAACTGGGGATTGCCGCCGGGAAGACGCGCGAGGAGATGCGCAAGATTTTTTCTGATACCCAAGATGCGTCCATCAAATTCAGGGTGGACGATTCGGAAGTGATGGCGGCAATTCTAATGTCAACAAAATGACCGGCGATCTGGATTTTGGTGTCAGTAATAAGGACATGATGGCGGCTTCTATCGCAGCATCCCGGATCTGATGGTGAATCAATCGGTGGGTTGTTCGCCAGTTTCCAGAAATTCAAAACCAAAAATGAACATGAAAACCTTCTGGCGATGGATCTGCTGAACCAGTTGGGTAAGGAAGGTGGTTTTGAGCTTAAAGATTTTGCCGAGAAAGGTACCAAGATCTTTTCCGCTTATGCCGGAACCGGGAGGACTGGCCCTCAAGCACTCAAAGAAATGGGCGTGGTTATGGAGTCGGCAATGGATGCCGTGGGGGATAAAGACCTAGCAGCCACTGCGTCCTTTAACTTACTTAACGATCTACGTAACCCGAAAATTGCTAAGGTACTGGAAGCCAGTGGCGTCAGACTACGTGATATGCAAGGGAACATGCTCCCCATCAATAACATAGTTAAAGATATCGCTCAGCGCTCCGGTAAGGATGGCTCCAAGCGTCAGGATGAGCGGCTTGCCAAAGCGGGGTTTACCGATTACAGCCGATTACTCATTTCCAGCGTTACTACTGGTAAAGGGGCCGAAAACTTTGCTCGCTATAACGCGGTTGTCGCTGATGGTTCGGGCATTATGACCGACGCCAAGTATGCGGCGCAGGATTTCACCTCTGCAATGAGCAGTCTCAACGTCACCTGGAAACAGTTCGCTAACAACAATCTGGCAAAACCGGTTCAGGAACTGGCTGATGCTATCAACAGCATGGAGCCAGCAGCCGTTCAGCGCTGGCTGGAAGTAGGTAAATACCTCGCTATTGCTGTTGGAGGTGTTATAGCTGCGCGTAAAGCCTTCCAGATTGGGAAAGGCACCTGGGACTTTTTCAACACCGTCCGGGGGAAAAACGGCAAAGGTGGCGTAGCCGGTGGAATCGCTGATGTGTTCGGCTCTGGCGTGATGCCTGTCTATGTCGTGAACATGGGGGCCGGTGGTATGGGGGGCGGTATCACTGACGCGCTGGGTGAAGCCGGAGGTCGTGGTGGTGGACTCCCCGGACGGTTTGGTCGCCTTGCCCGTGGTGCGGGAAAATTTGCTGGCATAGCAGGTGCAGGGGTTGCGCTTTATGACCACCTTGAAAGCAACTACAGGCTCGATGGCCGGGTTGATAACCTGACCAAACAGGTTGTGGAAGATAAAAATGCTTCCGTGCAGGAAAGAGCCTTTGCTGAAGAAAGCCAGCGTAACCGTCAGGCTCTGGCGAATAAATGGAAGCAATGGTTTGGCGGTGACGATACACCACGAACAAAAGTTGTTGATCCGCGCCCGTGGGCCTCGATGGCTCCTGTCATCCCCGCCGTAAATTTCGCATCGGTGCCTGCGCCATCAGACCCGAAAGGTCCCACCATTCCACAGCTTAAGAGTGATGAGCACTCATTGTGGGCGACTATCGCCGACTTTTTTAAGGGGGCCAATACCACCATTGAAAGCGGTATGCCTTCGGTAGCTCAGGAGGACAAAGCGCCACAGCTTCCGCCTGTTCCTCCGAAGCTTCAGGGTGAAATCCGGGTGATTGTTGAAGGCGATGCGCGGGTTAAAAGCGTGAAAATGGATCAGCCAGGTGTCACGCTCAGTGCCTTCGCAGGCGTCTCTAATGTGGAGCAAAACTGATGGGTACGACGAAATGGGAAGACCTGCGCGAAGCGTCGTTTCGGGCGTGGCGTTTTATCTGGTGGATAACGAAGGCACCAGCGGCCGTCGTGCCATCCCCCGCGCATACCCGAAAAAAGAAGTGGGATGGACCGAAGACAACGGCGCTGTACTGACACAGCAGCAGATTAACGGGAAGTTAATTGGCAGCAGTTACCAGTCACAACTGGAAGATCTGCTGCGTGCACTCAATACACCGGGACCGGGGGAACTTGTTCACCCGTGGTTCGGGATCCAGAAAGTCCAGGTGGGTAAAGTGAATCACCGCCTGAGCACACAGGAAGGCGGTATTGCGTATATTTCCTTTGAGGTCTCTGAAGCTGGCGAACGCCTGTTCCCCGCAGCGGCTGAAAATACCAGCCTGACCGTACTTAGAGGCGTGGACAAAGTGAAAGCAGCGCTTGAGAACGGTGATTTCTTTGCTGTGCTCGATGGGCTGGGCGAGATGGTCGATACCTTTCTGGATGACATGGAAGGAATGGTCGTTAACCTGCTCACCCTGCCATCGGCCATCACCGAGTGGATGGATCGGCTGGTCGTTTCCGTGGTCTGGTTGATGTCATCGTTGCGAAGCCTGCGAACTTCATCAACGAAATTCTGGGGCTGGTCAGTGGTGTGCATGAGACCGTGACTGAACCGCTCTGGTCAATGCGTCTCTATGACCGGTTACGCAGCCGCTGGGAAGGTGCACAGTCAGAAGGTTCCGGGGCGGGTATCAGTCGTGCAGAAGCGGCGGCCACCCGCCAGTTACCACAGTGTTATGTCCGTCACTCCGGGGTCAGTGGAAAGGAGGTGTCGGGGGATTTGCCAGCAGTATTCCCACAGTGGCAACCACACCATCGCCAGCGATGCAGGCCAACATCACCGGGTTTACGCAGGTGGTCGTACTGGCAACCCTGCTGGCACAGGCGGAAACCATCGCGCAGACGACGTTCCGCACCAGCGAAGAGGCTGTCAGTACCGGTGATGCTCTGGCCGTTCTGCTGGCTGAGCAGGCCGTCATTGCCGTTGAAAGTGGTCAGCGTGAACTCTGGCGCACGCTTCGCGATCTGCGTTTTGCCGTGGTGAATGACGTGCGCATCCGTAGCGCCAGACTGCCGCAGACGCGTCTGCTGTCTCCGACGATCACCTCTTCCGTCTCACTGATAGCCTGGAGGGAAACCGGCAACACAGAGAACCGGGACACCATCACACTGAGAAACCGGCTGCGCGACCCTTCCTTTATCCTGCCGGGGTAAAACTATAGAGGTAACTGAATAATGGAATCTGTTGTTCTGACGGTGGATAGTCAGCAGTGGGACGGCTGGACGGAAATGTCGATCACGTCCTCACTGGAGGCCATCGCCGGAGAGTTCGATCTGACTGTCACCACGCAGTGGTCTGAAGCATCCCCCCGCGTTATCCGGCAGGGTATGCCCTGCACGGTGGCTCTTGGCAGCGATACGGTAGTCACTGGTTATATTGATGATTTTATTCCGAGTTATGACGCTGAGAACGTGAGTATCCGGGTCACCGGCCGTGACAAAACCGGCGATCTCGTTGACAGTTCTGTTGTTCACAAATCCGGGCAGTGGAAAGGCGTTCGCCTGGAGAAACTGGCGGAAGAAATCTGTAAACCCTACGGTGTCGCCGTCATTAATGAGACCGACACTGGTGAAGCGTTCCCCTCCGTTGCTCTTGAACAGGGTGAAACCGCCTTTGACTTGCTTGATCGGCTGGCGAAGCAACGCGGCGTTCTGCTGACCGCTGACGGGCTTGGGCGTCTTGTTATCACCCGTGCATCAACAAAACGCGCTGGCGTTGCTCTGGTGCTGGGGAAAAATATCCTTGCAGCGCGTGGCCGCTTCAGCTGGCGTGAACGCAACAGCCAGTACATCGTCAAAGGCACCACCAGTGCCGGTGGCAGTACATGGGACGAACAGCCCGCAAAAGTGACCGGCGGGCGTCAGACCATCGTTGATGACAACGAGATCAACCGCTACCGCCCTAAAATCCTCGTAAACGAAGACAGCCTGACCGTGGGCGGCGCAAACACGCGCGGTGAGTGGTTCAAGGCCCGAATGCTGGGCGAAGCAAACAGCACTGAAATAACGCTGGCAGGCTGGCGCGAGAACGGCGACAGCGGCCCGCTCTGGCAGAAAAATCAACTGGTCGATATTGATGACCCGGTGCAGAACCTGAAGACCACCTGGCTGATAAAAACCGTCACGTTCACCGAAGGTGACAACGGCCGAATCTGTGTACTGACGCTGGTGCCTCCGGAATCAATGGATCTTCCTCTGACCGATGCGAAGAAGAAAGGTAAGAAGGCGAAGAAAGGTAAAACGGTGACGACATGGGACTGAATCCGGCAAATATTGGTCGCACGCTGACGGGTCTGGGGCGACGTCTTCGCCTGATGGTTGATCGGGCTGTTGTGCGAATTGTCACCGACAGTCTGGGGCGTCAGAACCTTCAGATCCAGTCGCTGGCTGACGCCACTAACGACGATGTTGAACGCTTCCAGAACTACGGTTTTACGTCAGTTCCGCCAGTGGGTTCTGAAGCCATCGTGCTTGCAGTCGGAGGGCGTCGTGAAGGTCTGGTGGCAGTTGCCGTCGAAGATAAACGCTGTCGTCCGAAAGGACTGAAGGACGGGTGATGTCTGCATCTATCACGCAGACGGTCAGGCTCTGGTTATTCTGAAGAAAGACGGCGTGGCAGAAGTAAGAGTAAAAACGGTTAATTACACCGCCACTGACTTATTCGAGATAACTACAGCTCAGTTCAAAGTAAACGGGCCGTCAGAATTCTCGGAGGATATTGTGGTAGGTGAAAAATCCTTCCTTGAGCATTTCCATATAGACGGTGACGGCGAAAAAACATCGGAGCCGAAATGGACTATCGGGATCACCTGGAATAACCAGCTGTCGCGCGGCGAGCTGACGGTGACGCATGATGGCCTCACTCTGGATGAGGGGCTGGTCACACTGGTGTTGATATGCCTGTTCACCGATACCCGAGCTGATGACGATGATGTCATTCCAGATAACACCGGCGACCCGCGCGGGTGGCCGGGGGACACCTTCAGTGCGTATCCGTGGGGTTCAAAGCTCTGGTTACTTGACCGCGAAAAGCTGACAGAGACGGTGCGCCAGCGTGTTGAGGATTATGCCAGTCTTGCCATGCAACCCTTATTGCGTTCGGGTTATGCCAGAACAGCCAGTGTGACGGCGGTAATCAGTGGTGCTGACCGCATCAATTTTATTGTCATCCTTAGCCGCCCGGACAAGACGCAGTTGCGTATTGAAATCAGTAAACGTTGGGAGGCGACAGAGCATGCCCTTTGATATTCCGGCGCTTCGTAAGCTTATCGCCGACGGTGAGAAAGACATTGCGATTGAGCTGGGTCTGCAAACACTCCCGCCAGTGGGTGTGGAGAAGGCACTGAATGTGACGTTCAGCAGTCAGGTACGCGACCTTTATGACCATCAGAGCTGGATAAAAGACCAGATCATCCCGTCAGTAAAAGCGGATGACGACACAATTATTGAGATTGCAGCCAGTGAGGGTGTGATCCGTAAGCAGGCGACATTCTCTGGTGGCCCGGTGATATTCCCCGGACTGGCGAGTATTCCGGAAGACACCGAGATGCAGACATCATCCGGTGTGCTGTATCTGGTCGTTGCATCCGGGATGCCGCAGAACGGCCAGGTTATGGTCACAGTGCAGGCCAGCGACGCTGGTGTTGCCGGTAATCTTCCTGAAGGCGAGACCATGACGCTGCCTCTCTCCTGTTCCCGGCGTGGAAAGTGATGGCATTGTGGGTTCTGGCGGGCTGACCGGAGGCGCTGATATTGAGCCTGTAGCCGAGGTGCTTGACCGCCTGCTGTACCGTAAGCGCAATCCTCCTGTTGGTGGTGCACTGCATGATTACGTTATCTGGGCGCGTGAAATGGCGGGCGTCAGCCGCGCATGGTCGTGGGACGTCTGGCATGGTCCGGGTACTGTTGGTCTGGCATGGGTATACGACGGCCGTGAAGATATCACCCCAACGTTTTCAGGACAGAGCCGATATGGAAGCTTATCTGTTTCGTCACGCTGATCCGGCAACGGGTAACTTCGTCGGCAAGCCTGGCGGTATTGAAGTCTGGCCGGTTGAACTTCATCTGAAGCCGGTGCCGCTTGCTATCCGGCTGACGCCGGACACTCAGGCCACACGTCAGTCAGTAGAGGCCCGGTTGCTGATCCTCCAGCAGACAATGGCACCGGGTCAGACAATGGGCGTTTCTGCACTGCGTACTGCAATTGGTACGGCTTCAGGCGTAACGGATTACACACTTTGATATTGATGGAGATATTACCTGCGATCAGAACGAACTGATAACCGTTGGGGTGATTACATGGCTCACAGCGTAGATGAATGGCTGGGAGCGCTATGGCAGGTCATGCCACGAGGCAAGGCATGGTCGCGTGATAATGACAACGATTTAACGCGCTTTTTACGGGCATTAGCCAGGCGTTTAAGCCAGGCTGAATTTGATGCGGAACGGCTGCTGCCGGAGATGCGGCCAGAAACCACCTTTTTATTGCTGGAGGAATGGGAAGAATATCTGGAGTTACCTGAATGTGAGCAGGCATCCGGCACAATAGAGGATCGCCGTCGCGCTGTAGTGGAGAAATACCACCGTAAAGGCGGGCTGGCCCCGTGGCAGATTGAAGCGGTTGCTGCGGCTCTTGGGTTTACTATTCGCGTGAATGTGATCCTTCCTCACCACTGCCTGCGAAGTTGCATGTATCCACTTTATCCGGCGCGTTATCGCTGGATTTTACAGATTGATGTGCTCGGTATTAGTGGCGGGCGTTTTACGTGCATTGATAACGTTATGACGCCTCTGCTGAGTGATCGTGCCCGCGAACTGGAGTGTGTGATGACGAAATACCGGCTGGCCGGAACGGCCTACGATTATATTTATTATGCAGGAGATAACTGATGTTTTATGTCGATAACCCGACAGGCGTTCCGGTCATGCCAGAACCGTCGCCAGTCAGCAGCCTGACCGATTTGTTCTTTACTGAGGGTGGTAACGGCGTACCTCCGACTTATCCGGGGCCTGACTGGTTCAATATCATTCAGAGTGAGCTGATTAATATTGTCAGAGCCGCAGGGCTTGATCCTGACAAAATGGACAATACGCAAATTCTGGCTGCACTTAAAAAGCTGTTCCTGCAACGTCAGAATCCGTTCGGTGATATTAAGTCCGATGGCGCAGTTGCAACGGCTCTCGCAAACCTTGGTTTGGGAGATTTAGCCAAAGCGGGTGTTGGCAATGGGCTGATCGCCACAAATGGATACGCCACACTTCCCGTGATTATTGGCGGAGAAAAAAAGGTTCTTATCATTCAATGGGGAACTACGAGCACAACAGGTAGTGACGGAAAAGCTACTGCTACGTTTCCTGTTTCATTTACGCGAACCCCTTTTTATGTAGGCCTGACAGAAACAACCGGGGAAGCATCCGGCATTGGTTCTGTATGCGTGTGGTCAAAAGAGATCACATCCACCACCACAACCGGTTTTGCGGCGCTGGCGTCAAAACCCTGGGCATCGGCGTTTACAGCCGGGGAAGCTGCAAATTATCTCGCGGTAGGATTCTAATAATGAGCAAATATATATACAGCCCTTCCCATAACGCGTTTTATCTGACTGTATTAAAAACAGAGTATGAACTGTCTGGTAACTGGCCCGAAGATGGTGTGGAAATTAGCGATAATTTATTTATCGAATACACATCGACACCACCTGACGGGAAAGAACGTGGAGTTGGAGACGATGGCATGCCGTGCTGGGTAGATTTACCAGAACCGACAACCGAGGAGTTAATTGCTGCCGCAGAAAGCAAACGACAGCAATTAATCAATGAGGCTAATACCTGCATCAATAACAAACAATGGCCCGGTAAAGCTGCAATTGGCCGCCTGAAAGGCGAAGAACTGTCGCAATACAATCTGTGGCTGGATTACCTTGATGTGCTGGAAGCCGTTGATACATCCAGCGCACCGGATATTAACTGGCCTGTTCCCCCGGAACTGTAGGCCATACGGGTTTTTGTTGTATCAACGCGCATCAGCAAAACCCGGTATTTTATCCATTCAGCCAGTTCTGTGGTTTCCCGTTCTGTCGCGATTCCGGCGTCAACAGCGTCCTGCCTCCAGATTATTTCGGCATCTGCGGTTGCGCGTAACTGGCTTCTTATTGCTTCGGCCTCCGCTATTTCTTCTTCAGGTGATGGAGGGGGAGTGTCTGTTAATACCGGATGATTGCTGGCATCGTTCGACCACACTTTACCAGCAGGTAAATCTCCCGTTTTAAACGGGTGCTCATCATCATCAATCTCAATACAACCAGATAAGTCATGTATGTCCGGCAAAGCCTCATTACAGTCAACCGGATTCCAGTACCACATATTAATATCCCTCCGCAGTGTAATACCCTGTAAATGACGTTCCGTAGGCCGTTGATGGTGATGAGCCAAGCGTTATGTCCATAGTTGCCATAGAAAAGCCCGTCATGCCGATTTGGGCGGGGTTAGCAAACTCAAAATTGTTATTCGTATTCCCGTTCTCAGTGACATAGATCCCGTTAAATTGCGTCGTAAAACTGATCGGATAGTTAACTCTTGCTCCCGTTGCAGGGGTGCTGATTAAAAAGCGGGTTTGAGCTATCCCACGTTTTCCCTACTCGTTTAAAGCCGTCGCTATAAATCTCATACCACCCGTTTGCGTTCTGCCCCTTACTTACGAGGAAACGGGGCGTTTTTGCCAGCTCCCCCAAACCAACGTTTTCGATAATGATCGTTTCTCGCTCAAATGGCATTATTCACGCCATTTAAAGCGAGTTTAACCATGCTTATTGGCTATGTACGTGTGTCAACAAATGACCAGAACACTGCGTTGCAGCGTAACGCGCTGGAGTGTTCAGGATGTGAGCTGATTTTTGAGGACAAGATAAGTGGCAGAACATCGGACAGACCAGGACTCAAACGCGTACTCAGAACGCTATCTGAAGGTGATACTCTCGTGGTCTGGAAGCTTGATCGCCTCGGACGCAGCATGCGGCACCTTGTCATTCTGGTGGAGGAGTTGCGGGAACGGGGAATAAATTTTCGCAGCCTGACCGACAGCATAGACACATCATCACCTATGGGGCGCTTCTTCTTCCATGTGATGGGTGCTCTGGCCGAGATGGAACGTGAACTGATTGTTGAACGTACCCGTGCAGGACTGGCGGCAGCACGAGCGGAGGGGCGTGTTGGAGGTCGAAGACCCAAGTTAACACCTGAGCAGTGGGCGCAGGCTGGAAGGTTGCTAGCGGCCGGTGAGACCCGCCAGAGGGTGGCTCTCATTTATGATGTCGGAATCTCAACTCTGTACAAACGATTCCCGGCGTCTGACAGATAAATCAGGATCGCACGGGCGATCCAATTTTAATTGTCCAAGTGTCGCGATTTGTTTGGCGCGCGGCAGTAGGTGCCATAGTGGCTAAGGAAACCGCCGTAACAGTTCATCAGGCGGGCCACCAGAGAGGCGTAAGGCGAGGAGCGGGTGATGTTGCCGCCGACGATACCGGAGGTGTAGTTAATGTACACCGCCTCGTTACCGTATTTATCGACCGTGTTTTTCAGGCTGTTGCTGATGGTATCCAGCGCTTCTTCCCAACTGATACGCTCAAATTTCCCCTCGCCGCGTTTGCCTGTGCGTTTCATCGGATAGTTGAGGCGATCAGGATGGTTAATTCGTCGACGGATTGAACGTCCGCGCAGGCAGGCGCGCACCTGGTGATTACCATAGACATCTTCGCCGGTGTTGTCGGTTTCCACCCACCAGACTTCATCATCTTTGACGTGGAGACGCAGGGCGCAGCGGCTACCGCAGTTGACCGAACATGCTCCCCAGACCACTTTGTCTTCAACGGGCTGGATTGCCTGCTGCACGGCAGCCGCCGCGGTACGCATCCCGAAAGGCAAGGAGACCCCTCCGGCAGCGAGCGCCAGCGATCCTATCGCGGTGGATTTCACTAACGTCCTGCGGCTGATGCCTCCCTGTTGTTCAACATCGGACATAATTCACTCCATTATAGTTATCGTTATATAATTATGTTTATAACGAAATTAATAGGATACTAATGAAAGAGTGATTTTAACCCCTTATAGGTAGGGGTTATTAATCCATGTCAAAATTGAGGGAATCTTTCTAAGAGGGAGTAAGAGCCTGGTATATCAGGCTCTTATCTGTTACTGAGGTTCGACAGCATGCCCGTCCTGAGAAGGACTGGTACTGGCGGGGGCGTTTCCGCTGCTGGTGCGGGTATACAAAATTTTATGGGTATCGTTTGCACAGTGGCCTACGACCTGGGAATCGGGCTGATCCGCCTGGTCATTAGGAACAATAGTTAACGTAAAACTGGACTCCGCCACACCATTGTTAATGATGCGCTGTTCGATGTCGCTTTTGACTCTTTCACATGACTCCGGCGCCGCCAGTACCTGCGGGGAGGCGACGATGAGCAGACACGCTGCAATTCCTGCTGAGATTTTCATCATTCACTCCTTTTATGAACACGATAATGAAAGCTTAGCATTTCTAGTGTAAACAACTGTATTTGCTAATATGATTGAGAATCATCTCCTTATCCTGGTGAATAACGTGAAGAAACAGACCCTGGTGAGCTTTCTGTGTGTGCTGCTTGTTGGATGTGATAACGCCACTGTTCTGGTCTCCTTTACCCCGGAAATGGCCAGCTTTTCCAATGAGTTCGATTTTGATCCGTTGCGCGGCCCGGTTAAAGATTTCAGCCAGACGTTGATGAACGACAAAGGGGAAGTGACGAAGCGCATCAGCGGAACGCTTTCGCAGGAAGGCTGCTTTGATACGCTGGAACTGCACGATCTGGAAAACAATACCGGGCTGGCGTTGGTACTGGATGCAAACTACTACCGCGATGCCGAGACGATGGAGAAAAGAGTGCGCTTGCAGGGGAAATGTCAGTTGGCGGAGTTGCCCTCGGCAGGCGTGGTCTGGGATACGGACGATAACGGTTTTGTTGTTTCCGCAACGGGAAGAGAGACGAAGGTACAATACCGCTATGATGCGGAAGGGTATCCGTTGGGTAAGACCACCATCAGTAAAGACAAAACGTTATCCATTGATGCAAAACCGTCGGCGGACCCGCTTAAAAAGCTCGACTATACGGCGGTCAGCCTGCTGAACGATCGCCCGTTAGGCAATGTGAAGCAGACGTGTGAATATGATAATTACGCCAATCCGATCAACTGCCAGCTTGTGATCGTTGATGAAAGCGTCGAACCCGCCGTTGCGCGGAGCTATACCATTAAAAATACGATTGATTATTACCCGGCTGCGTCTGACACAGCCGGGTAGCGGCTTACTGTGCGGTAGGCTTCAGCAGGCTTGAGCCAGGCGTTTTGTGATCGTCCAGATATTGCTGCTGGAAAATGCACATACGAATGGTGTTGCGGTATTGGCCATTGATAAAGAACTCATGAATCAGTTCACCTTCAACCATAAAACCTAACTTACGGTAAATGTGAATGGCTTTTTCGTTTTCTTTATCAACGATCAGATACAGCTTATAAAGGTTGAGTACGGTGAATCCGTAGTCCATCGCCAGCTTCGCTGCGCGCGAAGCCAGCCCTTTTCCCTGGTATTCCGGCGAGATGATGATTTGAAATTCAGCCCGGCGGTGAACGTGATTAATTTCTACCAGTTCGACCAGACCGGCTTTTTCGCCGTTGCACTCCACGACAAAGCGCCGTTCGCTCTGGTCATGAATGTGCTTATCATACAGGTCAGACAGCTCGACAAACGCTTCGTACGGCTCTTCAAACCAGTAACGCATTACGCTGGCGTTGTTATCAAGCTGGTGGACGAATCGTAAATCTTCGCGCTCCAGCGGGCGCAACTTAACACTGTTGGCGTGGGTCATTTTGTGTCCTTACCGTCTGTCGCTCAACGCACTGTTATGGCGCAACGGTACGGCCAGTACGACGATCCAGGCAACGCAGCGTATTCGGTTCCCAGTACGCGTTAAGGTTGGCGCTCTGCTCGCACTTGTCGCGGTTATCAAACGCGGCGTCGGCTTTGTCCCACTCTTTCTCCGTGCGCGTGTTCACTTTCTGGCGCAGGCTGCGGGTATCATTCCATTGTTCCTTTTCCATGGCTGCGTGTTGGCGGCTTTGCGCGCTGTCTCCAGATTCAATGATCAGTTTGCTGGTATTCGCGAATGCGGAGGTGGTATACACGACAGCACCCAGCGCAAGCATTGCCGTCAGGCACAGGCGTTTGCTAAGTGTATTGTTCATGGTTAATTCCTTTCATGATAGAGGGATAAAACTTGTCTAAATTCTACACCATTCCTGGCGCAGGGCATACCCACCCTGCGGGTATGGATACGAAGGCGCTATTCTGGCGTATCATGCTTAAACCTGACGTAACGCAATGATGATGATGCTAAAAACAACACTTCTTTTTTTCATAACCGCACTGTGCGAAATCGTGGGCTGTTTTCTGCCCTGGCTATGGTTGAAGCGGGGCGCAACGGCCTGGCTGCTGGTTCCGGCGGGAGTCTCACTGGCGTTATTCGTCTGGCTGCTGACGTTGCATCCGGCCGCCAGCGGCCGCGTCTATGCGGCTTATGGCGGGGTCTATGTCTGTACTGCGCTGTTATGGTTACGCTTTGTTGACGGCGTCAGACTCAGTCTTTATGACTGGTCAGGCGCACTGATCGCACTCTGTGGAATGCTGATTATCGTTGCCGGGTGGGGGCGCGCTTAAGCGCCTTATTTTGTGATCGATGAGCGATTTTTTGATCATTATACTTGTATGGCAGTAGTTCAGGTGTGTAAATTTCCTGCATCGCACAGAAGAGATGTAAGGAACAACAATGAAGATTGTAGGGGCTGAAGTTTTTGTCACATGTCCGGGACGTAACTTTGTCACGCTGAAAATCACCACCGAGGACGGTATCACCGGCCTGGGCGACGCTACCCTGAATGGCCGTGAGCTGTCGGTGGCATCCTATCTGAAAGATCACCTTTGCCCGCAGCTTATTGGCCGCGATGCGCACCGTATCGAAGATATCTGGCAGTTCTTTTATAAAGGCGCCTACTGGCGTCGCGGGCCGGTCACGATGTCTGCCATTTCAGCCGTTGACATGGCGCTGTGGGATATCAAAGCGAAAGCCGCCAACATGCCGCTGTATCAGTTGCTGGGCGGCGCATCCCGCGAAGGCGTGATGGTCTATTGCCACACGACGGGTCACACCATCGATGATGTGCTGGAAGATTATGCGCGTCACAAAGAGATGGGGTTTAAAGCGATTCGCGTGCAGTGCGGCGTGCCGGGTATGCAAACCACCTACGGCATGTCTAAAGGCAAAGGCCTGGCCTACGAACCCGCTACGAAAGGCCAGTGGCCGGAAGAGCAACTCTGTCAACCGAGAAATACCTCGATTTCACGCCGAAACTGTTTGACGCCGTACGTGCTGAATTTGGTTTTTGAAGAGCATTTGCTTCACGACATGCACCACCGCCTGACGCCGATTGAAGCCGCCCGTTTTGGTAAGCGCGTTGAGGATTATCGTCCTGTTCTGGATGGAAGACCCGACGCCTGCGGAAAACCAGGAATGCTTCCGTCTGATTCGTCAGCATACGGTGACGCCGATTGCCGTCGGGGAAGTGTTCAACAGCATCTGGGACTGCAAGCAGCTCATTGA
Protein sequences of DBSCAN-SWA_1 >LR134204|65:22768|16635_17070_+|VEB83094.1|tail|DBSCAN-SWA MSKYIYSPSHNAFYLTVLKTEYELSGNWPEDGVEISDNLFIEYTSTPPDGKERGVGDDGMPCWVDLPEPTTEELIAAAESKRQQLINEANTCINNKQWPGKAAIGRLKGEELSQYNLWLDYLDVLEAVDTSSAPDINWPVPPEL >LR134204|65:22768|12254_13340_+|VEB83066.1|plate|DBSCAN-SWA MESVVLTVDSQQWDGWTEMSITSSLEAIAGEFDLTVTTQWSEASPRVIRQGMPCTVALGSDTVVTGYIDDFIPSYDAENVSIRVTGRDKTGDLVDSSVVHKSGQWKGVRLEKLAEEICKPYGVAVINETDTGEAFPSVALEQGETAFDLLDRLAKQRGVLLTADGLGRLVITRASTKRAGVALVLGKNILAARGRFSWRERNSQYIVKGTTSAGGSTWDEQPAKVTGGRQTIVDDNEINRYRPKILVNEDSLTVGGANTRGEWFKARMLGEANSTEITLAGWRENGDSGPLWQKNQLVDIDDPVQNLKTTWLIKTVTFTEGDNGRICVLTLVPPESMDLPLTDAKKKGKKAKKGKTVTTWD >LR134204|65:22768|8687_8951_+|VEB83046.1|DBSCAN-SWA MKQLRRLHPDDLNLINEKAAALDDMLREVAERGELMPLAAALTHLLINLSQRFDIQRLGQLPLRQLLTLVRQTGETIWPEIALVRKF >LR134204|65:22768|3271_4165_+|VEB83017.1|head|DBSCAN-SWA MQVSAEVLHALTTALSAAFTKGVGRVNPQYRSIATVIPSTGASNTYGWVEDFPTIKEWIGERQLKELAQAGYTITNKTWENSVKVKREKIEDDQIGQYSVIAEQLGRDTTIFPDKLSFELLCKGFDTLCWDGQYFFDTDHPVGTSTASNVIGDPTTDAGEPWFLVDATHALLPIIYQERRPFNFTALDDLTSERVFLQNEFAYGTDGRSNVGFGFWQTCVGSKAALNKANYEAAVSAMMDITDSNSEPLGMNPTLLVVGKNNRGAAKSLIEALMADGGGSNIYYKDVDLLISPYVKA >LR134204|65:22768|13330_13636_+|VEB83070.1|plate|DBSCAN-SWA MGLNPANIGRTLTGLGRRLRLMVDRAVVRIVTDSLGRQNLQIQSLADATNDDVERFQNYGFTSVPPVGSEAIVLAVGGRREGLVAVAVEDKRCRPKGLKDG >LR134204|65:22768|15377_15968_+|VEB83086.1|tail|DBSCAN-SWA MAHSVDEWLGALWQVMPRGKAWSRDNDNDLTRFLRALARRLSQAEFDAERLLPEMRPETTFLLLEEWEEYLELPECEQASGTIEDRRRAVVEKYHRKGGLAPWQIEAVAAALGFTIRVNVILPHHCLRSCMYPLYPARYRWILQIDVLGISGGRFTCIDNVMTPLLSDRARELECVMTKYRLAGTAYDYIYYAGDN >LR134204|65:22768|2307_2595_+|VEB83008.1|DBSCAN-SWA MKPTNTELLALCFQLPELADDALPEWLPMIPAGTFTGRDGRSWINNQPESVIRATLSYPKLPFDIEHATELKGPKGEEAPAFAWLDDYRIRRWRN >LR134204|65:22768|6585_7605_+|VEB83038.1|tail|DBSCAN-SWA MALGNIPDDIRVPLVWIDIDNSMAMSGAPAQSRKILVIGQQVESASAEPLTLNRITGDSMADEYHGRGSMLAEMLKTLRKANSYTETYAMGLADIITGAAATASITVVGDALAAGTLALLINGVSVQVGVAQGDSAETVVQSVITAVTAKTATQVSAVVDGENAASAVLTVNWKGVTGNDCDVRLNYYSGEKTPSGISVTVTPFTGGAGTPDIQAVVAALGDDWYTDIVFPYNDTQSLNTIRDELLERWGPLKMMEAQLWTAFRGTHAQTGTFGSARNDWLISCIGTNISPEPVWLWAASYGGTAAYQLAIDPARPLQTLVLTGIKSPARAVSLGHAGA >LR134204|65:22768|4818_5259_+|VEB83024.1|DBSCAN-SWA MGIYVTRDDLLATDAERVWNMALNKATQQLDEEKIQRAIDDTDAEINSFLAKRYHLPLNLPTLPSPLRRAAVSIAFYWLSERDSQITDEIQKRYDDALRTLREIANGTRDLGVPSDTPVPETDTGKLIIVSDNRRLFTRNNLKGVL >LR134204|65:22768|9450_10956_+|VEB83054.1|tail|DBSCAN-SWA MDLLNQLGKEGGFELKDFAEKGTKIFSAYAGTGRTGPQALKEMGVVMESAMDAVGDKDLAATASFNLLNDLRNPKIAKVLEASGVRLRDMQGNMLPINNIVKDIAQRSGKDGSKRQDERLAKAGFTDYSRLLISSVTTGKGAENFARYNAVVADGSGIMTDAKYAAQDFTSAMSSLNVTWKQFANNNLAKPVQELADAINSMEPAAVQRWLEVGKYLAIAVGGVIAARKAFQIGKGTWDFFNTVRGKNGKGGVAGGIADVFGSGVMPVYVVNMGAGGMGGGITDALGEAGGRGGGLPGRFGRLARGAGKFAGIAGAGVALYDHLESNYRLDGRVDNLTKQVVEDKNASVQERAFAEESQRNRQALANKWKQWFGGDDTPRTKVVDPRPWASMAPVIPAVNFASVPAPSDPKGPTIPQLKSDEHSLWATIADFFKGANTTIESGMPSVAQEDKAPQLPPVPPKLQGEIRVIVEGDARVKSVKMDQPGVTLSAFAGVSNVEQN >LR134204|65:22768|15967_16633_+|VEB83090.1|tail|DBSCAN-SWA MFYVDNPTGVPVMPEPSPVSSLTDLFFTEGGNGVPPTYPGPDWFNIIQSELINIVRAAGLDPDKMDNTQILAALKKLFLQRQNPFGDIKSDGAVATALANLGLGDLAKAGVGNGLIATNGYATLPVIIGGEKKVLIIQWGTTSTTGSDGKATATFPVSFTRTPFYVGLTETTGEASGIGSVCVWSKEITSTTTTGFAALASKPWASAFTAGEAANYLAVGF >LR134204|65:22768|14313_14790_+|VEB83078.1|DBSCAN-SWA MPFDIPALRKLIADGEKDIAIELGLQTLPPVGVEKALNVTFSSQVRDLYDHQSWIKDQIIPSVKADDDTIIEIAASEGVIRKQATFSGGPVIFPGLASIPEDTEMQTSSGVLYLVVASGMPQNGQVMVTVQASDAGVAGNLPEGETMTLPLSCSRRGK >LR134204|65:22768|20419_20980_-|VEB83121.1|DBSCAN-SWA MTHANSVKLRPLEREDLRFVHQLDNNASVMRYWFEEPYEAFVELSDLYDKHIHDQSERRFVVECNGEKAGLVELVEINHVHRRAEFQIIISPEYQGKGLASRAAKLAMDYGFTVLNLYKLYLIVDKENEKAIHIYRKLGFMVEGELIHEFFINGQYRNTIRMCIFQQQYLDDHKTPGSSLLKPTAQ >LR134204|65:22768|16994_17441_-|VEB83098.1|tail|DBSCAN-SWA MWYWNPVDCNEALPDIHDLSGCIEIDDDEHPFKTGDLPAGKVWSNDASNHPVLTDTPPPSPEEEIAEAEAIRSQLRATADAEIIWRQDAVDAGIATERETTELAEWIKYRVLLMRVDTTKTRMAYSSGGTGQLISGALDVSTASSTSR >LR134204|65:22768|1313_2108_+|VEB83004.1|head|DBSCAN-SWA MRKPERKPDIIPKEALEWLKAKKLKPGFDYRDVWQEEHRYGYTVAKMTQLDLLADVRQLVEDALENGQTFAQFRELLRPLLVKRGWWGQALMDDPLTGETRQVQLGSERRMRVIYDTNMRTARAAGQWERIQRTKRAMPYLVYTLGPSREHRAEHLKWANTCLPVDHQFWITHMVPNGWGCKCNVRQVSLYEFEQMQQNGTITTTAPDVRYVKWVNKRTGEEESIPEGVDPGWAYNPGISRSDALSLQLKQKQQAFDSYTSSQK >LR134204|65:22768|15050_15320_+|VEB83082.1|DBSCAN-SWA MEAYLFRHADPATGNFVGKPGGIEVWPVELHLKPVPLAIRLTPDTQATRQSVEARLLILQQTMAPGQTMGVSALRTAIGTASGVTDYTL >LR134204|65:22768|11843_12233_+|VEB83062.1|DBSCAN-SWA MQANITGFTQVVVLATLLAQAETIAQTTFRTSEEAVSTGDALAVLLAEQAVIAVESGQRELWRTLRDLRFAVVNDVRIRSARLPQTRLLSPTITSSVSLIAWRETGNTENRDTITLRNRLRDPSFILPG >LR134204|65:22768|10939_11626_+|VEB83058.1|DBSCAN-SWA MWSKTDGYDEMGRPARSVVSGVAFYLVDNEGTSGRRAIPRAYPKKEVGWTEDNGAVLTQQQINGKLIGSSYQSQLEDLLRALNTPGPGELVHPWFGIQKVQVGKVNHRLSTQEGGIAYISFEVSEAGERLFPAAAENTSLTVLRGVDKVKAALENGDFFAVLDGLGEMVDTFLDDMEGMVVNLLTLPSAITEWMDRLVVSVVWLMSSLRSLRTSSTKFWGWSVVCMRP >LR134204|65:22768|17880_18435_+|VEB83105.1|DBSCAN-SWA MLIGYVRVSTNDQNTALQRNALECSGCELIFEDKISGRTSDRPGLKRVLRTLSEGDTLVVWKLDRLGRSMRHLVILVEELRERGINFRSLTDSIDTSSPMGRFFFHVMGALAEMERELIVERTRAGLAAARAEGRVGGRRPKLTPEQWAQAGRLLAAGETRQRVALIYDVGISTLYKRFPASDR >LR134204|65:22768|2551_3268_+|VEB83012.1|DBSCAN-SWA MHGWTITASDDGVIEAHVEWTADGAALVRGKKYRYYSPAFRFTADGQVTRLSSAGLTNKPNLDLPALNSEENTMTVPVQIVTVLGLAATASADDAVKAIQQLKTSEQVALNRAENPDLTKFIPVETHQLALNRAETAEAQLSAIAIKEAEALVDGAIEAGKVAPANREMYLATCRSEDGRKQFAEFVKGAPVIVSKDPSDKKDPGGDGNTTLSDEDLAMCRQMGITQEEFLSVRKQEK >LR134204|65:22768|21937_22768_+|VEB83133.1|DBSCAN-SWA MKIVGAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLKDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHTIDDVLEDYARHKEMGFKAIRVQCGVPGMQTTYGMSKGKGLAYEPATKGQWPEEQLCQPRNTSISRRNCLTPYVLNLVFEEHLLHDMHHRLTPIEAARFGKRVEDYRPVLDGRPDACGKPGMLPSDSSAYGDADCRRGSVQQHLGLQAAH >LR134204|65:22768|65_938_+|VEB82996.1|portal|DBSCAN-SWA MEAASDDARDIDIADAVRNLITQPQIPELLFDLLDGLGKGVGVCEILWDTSNQFWQPRDYEWVDPRFLKPDRETLRDFRLLTDASPIEGEPLTPGKYVVHQPRLKSGLPLRNGLARLVAVMYMLKSYTVRDWWAFAEKFGIPVVVGKYGNNASPEQIQTLLEAIASLASDAGCAIPASMTLEMQETASRNNGGALFKEMAVWCDEQISKAVLGQTMTTDNGSSRSQADVHDRVRMDIARWDARQLENTLNEFLVRPFVIMNYGPQDSYPRVVLRLSEPEDLKMLVDALTL >LR134204|65:22768|19268_19574_-|VEB83113.1|DBSCAN-SWA MKISAGIAACLLIVASPQVLAAPESCERVKSDIEQRIINNGVAESSFTLTIVPNDQADQPDSQVVGHCANDTHKILYTRTSSGNAPASTSPSQDGHAVEPQ >LR134204|65:22768|17647_17851_-|VEB83102.1|DBSCAN-SWA MPFERETIIIENVGLGELAKTPRFLVSKGQNANGWYEIYSDGFKRVGKTWDSSNPLFNQHPCNGSKS >LR134204|65:22768|4249_4819_+|VEB83020.1|DBSCAN-SWA MSGKTEKKSVKPAAGSDTTTAQKDIKGTQDTTVKTDPASLAPVTSADGQTSQSTVTATDGEAGPESDATATIAGVTPSPETDHSHLQDAGVNTVMSALCQSVSDGVHISGDVTVLEVRAIPEGGFHRAGRFWPHDPVHVFVSDDPDEQVLEDGSGQPLYGCVISTADASRLNREKMLVVTELKPKAEES >LR134204|65:22768|19644_20415_+|VEB83117.1|DBSCAN-SWA MIENHLLILVNNVKKQTLVSFLCVLLVGCDNATVLVSFTPEMASFSNEFDFDPLRGPVKDFSQTLMNDKGEVTKRISGTLSQEGCFDTLELHDLENNTGLALVLDANYYRDAETMEKRVRLQGKCQLAELPSAGVVWDTDDNGFVVSATGRETKVQYRYDAEGYPLGKTTISKDKTLSIDAKPSADPLKKLDYTAVSLLNDRPLGNVKQTCEYDNYANPINCQLVIVDESVEPAVARSYTIKNTIDYYPAASDTAG >LR134204|65:22768|21014_21356_-|VEB83125.1|DBSCAN-SWA MNNTLSKRLCLTAMLALGAVVYTTSAFANTSKLIIESGDSAQSRQHAAMEKEQWNDTRSLRQKVNTRTEKEWDKADAAFDNRDKCEQSANLNAYWEPNTLRCLDRRTGRTVAP >LR134204|65:22768|5796_6405_+|VEB83031.1|DBSCAN-SWA MSERPAFVTLGSTVSAAENIVAWLKTELEGNTPDRVEIVERHVGQFSTPDEVKRYLSGRSGCVRLAALRVRNISNRNGMTGLVTWAAYVMTSDSWGYARDARCEVLAGKIARRISVREAPRAMKAERMAENIGAENIYSGRLDNFGVSLWAVTWEQVFRLDDEIDMAALPEFLRLGASFVVNGQPVTEEPDIINVREGQTDE >LR134204|65:22768|5258_5759_+|VEB83027.1|DBSCAN-SWA MGITVEVTGAEKLQTIRKAMEKLADSSLRQELLESIGAVAESQTRRRIASEKSSPAGAKWQDWSDNYAKTRHGNQSLLQGNGDLLDSIQYFVSGERVHIGTPLPYGKTHQEGFSGSVAVSSHKRLITQAFGRALKHGVWQTVGRISARWTSRSASSSVCLRITVTS >LR134204|65:22768|18461_19097_-|VEB83109.1|DBSCAN-SWA MSDVEQQGGISRRTLVKSTAIGSLALAAGGVSLPFGMRTAAAAVQQAIQPVEDKVVWGACSVNCGSRCALRLHVKDDEVWWVETDNTGEDVYGNHQVRACLRGRSIRRRINHPDRLNYPMKRTGKRGEGKFERISWEEALDTISNSLKNTVDKYGNEAVYINYTSGIVGGNITRSSPYASLVARLMNCYGGFLSHYGTYCRAPNKSRHLDN >LR134204|65:22768|961_1321_+|VEB83000.1|DBSCAN-SWA MSEVRDKFGLSEPEKGAALLTPSSQALQPALAINRERLALNRNQQDDIDLMVADAMQDWQRTGDAFTSPVLQLAKDADSFDAFLAGLPELQKTLEPDEFVTQLAQLCFKARALGDVNDA >LR134204|65:22768|8917_9025_+|VEB83050.1|DBSCAN-SWA MAGNRLSTEILINLAGNLQAKARQYGASMSEFCQP >LR134204|65:22768|13799_14324_+|VEB83074.1|DBSCAN-SWA MVGEKSFLEHFHIDGDGEKTSEPKWTIGITWNNQLSRGELTVTHDGLTLDEGLVTLVLICLFTDTRADDDDVIPDNTGDPRGWPGDTFSAYPWGSKLWLLDREKLTETVRQRVEDYASLAMQPLLRSGYARTASVTAVISGADRINFIVILSRPDKTQLRIEISKRWEATEHAL >LR134204|65:22768|6397_6586_+|VEB83034.1|DBSCAN-SWA MNKKLIKPARPGLRVRKADGSLLNADGEILAVAAYWRRRESEGDVVITAPSKPKSGKADKEA >LR134204|65:22768|21492_21828_+|VEB83129.1|DBSCAN-SWA MMMMLKTTLLFFITALCEIVGCFLPWLWLKRGATAWLLVPAGVSLALFVWLLTLHPAASGRVYAAYGGVYVCTALLWLRFVDGVRLSLYDWSGALIALCGMLIIVAGWGRA >LR134204|65:22768|8075_8432_+|VEB83042.1|tail|DBSCAN-SWA MSSILGMAAIRINGREIKTEGKSTLNPGGYQRQQHMGAGKIWGISRKTAAPSIKLTIAADQDVDVIEISQWEDVTVMFYGDNGLNYMMTKAATDNPAELDEDAGTVTANFNRRSVWKV |
36 | Vibrio_phage(72.0%) | head,tail,plate,portal | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
174908 : 182759
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >LR134204|174908:182759|DBSCAN-SWA AATGAAAAGAAAAGTACTGGCTCTCGTAATCCCTGCCCTGTTAGCCGCAGGCGCGGCCCACGCCGCTGAAGTTTATAATAAAGACGGCAATAAATTAGATCTCTATGGCAAAGTAGACGGTCTGCATTATTTCTCTGATGACGCAAATTCTGACGGCGATCAAACATACATGCGTATGGGTTTCAAAGGTGAAACTCAGGTTAACGATATGATCACGGGTTACGGCCAGTGGGAATATCAGGTTTCAGGCTAACGGTACCGAAGGTGATAAAGGTGACTCCTGGACTCGTCTGGCGTTCGCAGGTATTAAAGTCGGTGACTATGGTTCATTCGACTACGGCCGTAACTACGGCGTACTGTACGACGTGGAAGGCTGGACCGATATGCTGCCAGAATTCGGCGGCGACTCTTATACCAAAGCAGACAACTTCATGACCGGTCGTGCTAACGGCGTAGCAACCTACCGTAATACCGACTTCTTCGGTCTGGTGGATGGTCTGAGCTTTGCGCTGCAATATCAGGGTGCGAATGAAAACCAGGTTACCAACGAGCAGGAAGGTACGGGCAACGGCGGCGACCGCGATGTGAAAAACTCCAACGGTGACGGCTTCGGTATCTCCTCAACTTACGATCTGGGTATGGGCGTCAGCTTCGGTGCGGCATATACCACTTCCGACCGAACGAATCAGCAAGCAAATCACTCTACTGCTGGCGGTGATAAAGCAGATGCGTGGACTGCTGGCCTGAAATATGACGCTAACAACATCTATCTGGCGACCATGTATTCTGAAACACGTAACATGACGCCATATGGTGGAAAAGATAGCTGGAGTGATAGCACCATTGCCAACAAAACCCAGAACTTTGAAGTTACAGCGCAGTACCAGTTCGACTTCGGTCTGCGTCCTGCCGTATCTTTCCTGATGTCTAAAGGTAAGGATCTGGCCGGCGGTGAAGATAAAGACCTGGTTAAATATGCTGACGTAGGCGCGACCTACTACTTCAACAAAAACTTCTCCACTTATGTTGATTACAAAATCAACCTGCTGGATGAAGATGACAGCTTCTACACTCGCAACGATATGTCTACTGACGACGTCGTTGCTCTGGGTATGGTTTACCAGTTCTAAGTCCGTCAGCCCGCTGGCAACGGCGGGCTTTTCTCATAACAACAGTGTTTTATCTATACTAAAACAATATATTTGTTGCGCTAAAACTCTCCATTGATAGATGATATATTGCATAAAGATCTGCTGTAGAGGACGCCGGCAATGAACCATCATCCCGTTAAATCATCCCGTATTGCATCCGTCGGCTATGACGAAACCTCCAGCACGCTGGAAATTCGTTTTCATCGAACCGGTACTCTGCAATATCTCGGCGTCCCGTCGCGTATTTTTCGTGATTTTCTTGTCGTGGTTTCTAAAGGCCGCTTTTATGACGGCGTGATCAAAGGGAAATTTCCAGAGCAGAAACCAAACTCACGCTAAAATGTGACTCCTGTCATTGTACATAAAGTGCCATTAAGGGTTAAACTTTAATCGGATACTTAAACAGGAGGTTTTATGAGCAGAACGATTCTTGTGCCCATCGATATTTCAGATTCAGAATTAACTCAACGCGTTATCACCCATGTTGAAGCCGAGGCAAAAATAGATGACGCAGAGGTCCATTTCCTGACCGTTATTCCGTCACTGCCCTATTACGCCTCTTTAGGCCTGGCCTACTCAGCAGAGTTACCGGCTATGGACGATCTGAAAGCCGAAGCAAAATCTCAACTGGAAGAGATTATCAAGAAGTTCAGTATTCCTGCCGACAGAGTACAGATTCATATCGCGGAAGGTTCTCCGAAAGATAAAATTCTGGAAATGGCTAAAAAATTACCGGCAGACATGGTGATTATTGCTTCTCACCGCCCGGACATCACCACGTATCTGCTGGGCTCCAACGCCGCAGCGGTCGTACGCCATGCCGAATGTTCCGTCCTGGTGGTTCGCTAAACGCATCAGCCCGCACGCCGTTGCGGGCTTTCATATCCACTTTTCAACCCTGCACGCGCTTACCTGCAACCTGAGGTATAACGGGTATAAATGTGCTGACATTCTTCTCGCCAATCCGTACCATACACGCCACAGTTTTTATATCAGATTTCTTTTGCCGGTCTTTCCGGCGTGATGCATACTTTTCTGATGCCACACAATGAATTGAGCCTCTATTAAATGTCGCAAAATCAAGATATTAGCAAGAAAGAACAGTACAACCTGAACAAATTACAAAAACGTCTGCGTCGTAACGTGGGCGAAGCGATTGCTGACTTCAATATGATCGAAGAAGGCGATCGCATCATGGTGTGCCTTTCCGGCGGGAAAGACAGCTATACCATGCTGGAAATTCTGCGTAATTTGCAGCAAAGCGCACCCGTCAACTTTTCACTGGTCGCCGTTAACCTTGACCAGAAGCAACCGGGTTTTCCCGAACACATTCTGCCGGAATATCTTGAGAACCTGGGCGTAGAGTATAAGATCGTTGAAGAAAACACCTATGGGATCGTTAAAGAGAAAATTCCTGAAGGCAAAACGACCTGCTCTCTGTGCTCCCGACTGCGCCGCGGTATTCTCTACCGCACCGCAACAGAACTGGGTGCAACCAAAATTGCGTTAGGCCACCACCGCGACGATATCCTGCAAACGTTATTCCTGAACATGTTCTACGGCGGGAAAATGAAAGGGATGCCGCCGAAGCTGATGAGCGATGACGGCAAACACATCGTTATCCGCCCGCTGGCCTACTGTCGCGAAAAAGATATTGAGCGTTTTTCAGAAGCCAAAGCGTTCCCGATCATTCCGTGCAACCTGTGCGGGTCGCAGCCAAACCTGCAACGCCAGGTCATCGCCGACATGCTGCGCGACTGGGATAAGCGTTATCCAGGGCGTATTGAAACCATGTTCAGCGCGATGCAGAACGTGGTGCCTTCTCATCTGAGCGATATTAATCTTTTTGATTTCAAAGGCATTAATCATGATTCCGACGTAGTAGACGGTGGTGATTTAGCCTTTGACCGCGAAGAGATCCCTCTGCAACCCGCAGGCTGGCAGCCAGAGGAAGACGATAATCAGCTGGATGAACTGCGTCTGAACGTCGTGGAAGTGAAGTAATTTCTGACTCAGCCGGATGGCGACGTAAACATCTTATCCGGCCTACGATGCGCATAAGCACCTGTAGGCCGGAGCGGGCATTTATTTCAATAAACGAACCCGACAGTTCTTCCCTTTTATCTTTCCGTTCTGCAATTGCTTCCAGGCTTTCTGCGCCACTGGCTGACGAACCGCCACATAGACATGGGCCGGATGTACGGTAATTTTGCCAATATCAGCGCTATCCAGACCAATATCTCCCGTTAATGCTCCCAGCACGTCGCCCGGGCGCATTTTTGCCTTTTTACCGCCATCAATACACAGCGTCGCCATTTCAGCTTCCAGCGGTGCAACCGGAACGTTGCCCGGCGGCGTCAGCCAGTTCAGTTTAAGCTGCAACATTTCAGACAGGATATTGGCGCGCTGCGCCTCTTCCGGCGCGCAAAAACTGATCGCCAGACCGCTGTTGCCTGCACGGGCGGTACGACCAATACGGTGTACGTGAACTTCCGGGTCCCACGCCAGCTCGAAGTTTACGACCAGCTCCAGCGACTTGATATCCAGCCCACGCGCGGCGACGTCCGTCGCTACCAGTACGCGCGCGCTGCCGTTTGCAAAACGAACCAGCGTCTGGTCACGATCGCGCTGCTCAAGGTCGCCATGCAGCGATAATGCGCTCTGCCCGACAGAATTGAGCGCATCACAAACCGCCTGACAATCTTTTTTGGTATTACAGAAAACAACGCATGACGCAGGCTGATGCTGGCTCAGCAGCTTTTGCAGCAGCGGGATTTTCCCCTGCGAAGATGTTTCAAAAAACTGCTGCTCGATAGATGGCAGCGCATCCACCGAATCAATTTCAATCGTCAGTGGATTTTGCTGAACCCGACCGCTGATCGCCGCGATGGCCTCAGGCCAGGTGGCGGAAAACAGCAGAGTCTGACGGCGTGACGGCGCAAAGCGAATCACCTCGTCAATGGCGTCGCTAAAGCCCATATCCAGCATCCTGTCGGCTTCATCCATCACCAGCGTTTGCAGGGCATCCAGAGAAACGGTGCCTTTTTGCAGATGGTCAAGCAGGCGACCCGGCGTCGCGACAATAATATGCGGCGCATGCTGGAGAGAGTCTCGCTGCGCGCCGAAAGGTTGCCCGCCGCACAACGTCAGGATTTTGGTATTAGGCAGAAAACGCGCCAGACGACGCAGCTCCCCTGCCACCTGATCGGCCAGTTCGCGCGTAGGACAAAGCACCAGCGACTGGGTCTGGAACAGACTGGCATCAATATGCTGTAAAAGCCCGAGACCAAATGCAGCCGTTTTTCCGCTGCCGGTTTTTGCCTGCACGCGCACATCTTTTCCGGCCAGGATCGCCGGTAACGCAGCGGCCTGGACAGGCGTCATCTCAAGATAACCCAACTCGTTAAGATTCTCGAGTTGGGCGGCAGGCAGGACATTCAGAGTAGAAAAAGCGGTCACAATTTAATCTCGCGGTAAAAGACACACAGTCAGCAGGCGCGTATCCTCGCAGATCTCCGCTCTTGATGCGACAATTTAATCGGTTCTTCATCCGGCGGCGGATCTGGCATAGGTTGTGGGCGCGGGATAGGATCGGGAATCGGCACCGGATCGGTTGGTACGGAATCCGACTGGCGCGTTTGCATTTGCAACAACGTGAACAGGCTGGTCATATTACCCTCCGGTCATACAGATTGACTCTTTTAGGGTAGACGCTGATATCCCGCAGGCAAAAAAAAGCCGACTAATCTAAGTCGGCGTCGTACGAATCAATTGTGCTATGCAGTAATTCAAAAAAGGAAGTAAGACAATATGGAGCGCAACGCCCATCGCTTGACGTTGCATTCACCTGCGAGAGAGATATTGCCCCGAATGGGTAGATTGTTTATTGACTTCGCTCAAATTATGCGGCGTTTTTCTGCTCAAAGGACGATGAAAAGCGTTGTTGTTACAACCATTTACTACGATGCAACCATAAGGTAACACCACCAATCAGGACAACTAACAGAATACAGAACATAGAAAATCCAAAATGCCATGCGCCGCCCGGAATTCCCCCAAGATTCACGCCGAACAAGCCGGTTAAAAATGTACTGGGCAGAAAGACCATCGCCATCAGCGACATCGTATAGGTTCTGCGCGCTAAGGACTCCTGCATAACTTGCGCGATTTCATCGGCCATCACGCCCGTGCGCGCAATACACGCGTCGATCTCATCCAGCCCCCTTCCCAGACGATCCGCGATATCCTGCATTCGACGCCGCTGATCATCATTCATCCAGGCCAGACGCTCGCTTGCCAGTCGCGCATAAACATCACGCTGAGGCGCCATGTATCGACGCATCACAATCAGTTGCTTACGCAGTAACGCCAGGAATCCACGCGGCGGGATCTGCTGATCGAGCAGATTGTCTTCCAGGTCGATAATTTTATCGTGTAATTGCTCGATAAATTCGCTGGCGTGATCCGTCAGGGCGTCACACACGTCGACCAGCCAGCCGCCGCAATCGGACGGGCCCGTCCCTTCCTGCAAGTCGCTAACCACCTCATCAAGCGCCAGAACTTTTCGCTGTCGCGTTGAGACGATAAAACGTTCATCCATATATACACGCATCGCCACTAGCTGATCAGGACGTTCATCGGTACTGCCGTTGATGCAGCGTAAGGTGATCAGCGTGCCATCCCCCATCCGACTGACGCGTGGGCGCGAACTCTCGCCAGCCAGCGCATCACGTACAGAATTCGGTAACAAAGGCGTCGACGCCAGCCACTGCGCGCTGTCAGGGTGGGTGTAATTGAGATGCAACCAGCAGGGATGCTGGCTGTCGATAATATCGTCATTTTCAAGCGGCTTAATGCCGCCGCGACCATCCAGTAGCCAGGCAAAAACGGCATCCGGCACATTGACTTCCGATCCCTTGATGGCTTCCACAACGCCTCCACATTCATTCAACGCTTTTTCGTTAGTCTAGCCTTCGGCGTATGTTAAGCAATACCTCATTATCCCATTGCGCTGAAATGATTCAGAAAAGTAAGGTATGGCAGGAAAGGCGCCGGCATCACCGGCGCAAATGTGTTGTGCGAGACAATATGGGGTGTTCAACGGATGATAATTTACTTAATCGAAATTTATGAATTTTAGTCTGTAGATCGATAACCTTGCGTTGCATCATCAGCTTTAATATGTCGTACGGGAAAGGGTTAACCTCTTTCAGAATGATTAGCCTGAGTAATGTTTATATAAGTCGCCATCAACCTTTCTTCCTTTTTACTCGTTCACACTTATATCGACCATTGAAAAAAAACTTATCATTCGTGGATTTGTCCTGGTTCCCTGCTAGATTTATCTGCACGTCTATTGTTGTCTCCCGATTTGTGAAAAAAACGCTCCATATCACTGTAAGCTGTTAATAAAAAAGCCATTCACCAATAAACCGGCAACACAGCATGTGAAGATAACGGGATTATCGATGATTGCTCACGAACTCAATGCTCTGGATTTATTAAGTTTTCCTGTCTGGATTGTTTTACCACACACAGAAGAGTTAGTTTTCGCTAATACCGCCGCACGCGAACTGACACAGGAACAGACTTTCAGCCGTCTCAGAAAGGGGATATTTTCAACTTATGCGCAAAATGAACTGCGCATGTACGTTGCTGATTTACACCACCATCACGATATTGTTGAAATTCTGACGGTCTGTCGTGATGGTAAAGAAATCGCCTTAACCTGCCGCCTCTCGGTTAAAACCCTGTCGACGCAGGGAGACGTTATTATTTTTGAAGGCATAGAGACACCTACGGCGCAAGGTCTCAAGGCCAGTCGTTCGGCAAACTATCAGCGTAAAAAACAAGGGTTTTACGCCCGTTTTTTCCTGACGAACTCCGCGCCGATGCTGTTGATAGACCCTTCGCGGGACGGTCTGATCGTCGATGCCAATCTGGCCGCACTCAATTTCTATGGCTACAGCCATGAAGCGATGTGCCAGAAGCATACATGGGAAATCAACGTATTAGGTCGCCAGATCTTACCCGTTATGCATGAGATTGCCCGTCTGCCCGGCGGTCATAAGCCCTTAAACTTCGTCCATAAACTCTCTGACGGCACCACCCGGCACGTACAAACCTATGCGGGCCCCCTTGAGATATACGGCGATAAGCTCATGTTATGCATCATTCATGACATTACCGAGCAAAAACGGCTGGAGCAGGAGCTGGAACGCGCTGCCCTGCATGATGCGCTGACTGGCTTACTGAACCGCAGGCAATTTTATCTTCTTACTGAGCAAAACCATACGCCCCATCTGTCGATGACAATGGATTACAGCCTGCTGCTAATCGATACCGATCGCTTCAAAAGCATTAACGATTTGTACGGTCATTTGAAAGGCGATGAAGTGCTGTGCGCTCTGGCGCGAACGCTGGAAGCCCGCGCACGTAAAGGCGACCTGGTCTTTCGCTGGGGCGGTGAAGAATTTGTCTTGCTGCTGCCGCGAACCTCGCTGGAAGTTGCACTTAATCTTGCAGAATCAATACGCGCCGCCGTGGCGAAAGTCTGTATTCCTGGCCTGCCCCGTTTTACCGTCAGCATCGGCGTCGCACGGCACGAATCGAATGAAAGCATTGATGAATTATTTAAGCGGGTGGACGATGCCCTGTACAAAGCCAAAAACGACGGCAGAAATCGGGTGCTGGCGGCGTAA
Protein sequences of DBSCAN-SWA_2 >LR134204|174908:182759|177126_178062_+|VEB83819.1|tRNA|DBSCAN-SWA MSQNQDISKKEQYNLNKLQKRLRRNVGEAIADFNMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPVNFSLVAVNLDQKQPGFPEHILPEYLENLGVEYKIVEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFLNMFYGGKMKGMPPKLMSDDGKHIVIRPLAYCREKDIERFSEAKAFPIIPCNLCGSQPNLQRQVIADMLRDWDKRYPGRIETMFSAMQNVVPSHLSDINLFDFKGINHDSDVVDGGDLAFDREEIPLQPAGWQPEEDDNQLDELRLNVVEVK >LR134204|174908:182759|175290_176037_+|VEB83807.1|DBSCAN-SWA MLPEFGGDSYTKADNFMTGRANGVATYRNTDFFGLVDGLSFALQYQGANENQVTNEQEGTGNGGDRDVKNSNGDGFGISSTYDLGMGVSFGAAYTTSDRTNQQANHSTAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPYGGKDSWSDSTIANKTQNFEVTAQYQFDFGLRPAVSFLMSKGKDLAGGEDKDLVKYADVGATYYFNKNFSTYVDYKINLLDEDDSFYTRNDMSTDDVVALGMVYQF >LR134204|174908:182759|179546_179729_-|VEB83827.1|DBSCAN-SWA MTSLFTLLQMQTRQSDSVPTDPVPIPDPIPRPQPMPDPPPDEEPIKLSHQERRSARIRAC >LR134204|174908:182759|176178_176397_+|VEB83811.1|DBSCAN-SWA MNHHPVKSSRIASVGYDETSSTLEIRFHRTGTLQYLGVPSRIFRDFLVVVSKGRFYDGVIKGKFPEQKPNSR >LR134204|174908:182759|174908_175160_+|VEB83803.1|DBSCAN-SWA MKRKVLALVIPALLAAGAAHAAEVYNKDGNKLDLYGKVDGLHYFSDDANSDGDQTYMRMGFKGETQVNDMITGYGQWEYQVSG >LR134204|174908:182759|176472_176907_+|VEB83815.1|DBSCAN-SWA MSRTILVPIDISDSELTQRVITHVEAEAKIDDAEVHFLTVIPSLPYYASLGLAYSAELPAMDDLKAEAKSQLEEIIKKFSIPADRVQIHIAEGSPKDKILEMAKKLPADMVIIASHRPDITTYLLGSNAAAVVRHAECSVLVVR >LR134204|174908:182759|180004_180988_-|VEB83831.1|DBSCAN-SWA MEAIKGSEVNVPDAVFAWLLDGRGGIKPLENDDIIDSQHPCWLHLNYTHPDSAQWLASTPLLPNSVRDALAGESSRPRVSRMGDGTLITLRCINGSTDERPDQLVAMRVYMDERFIVSTRQRKVLALDEVVSDLQEGTGPSDCGGWLVDVCDALTDHASEFIEQLHDKIIDLEDNLLDQQIPPRGFLALLRKQLIVMRRYMAPQRDVYARLASERLAWMNDDQRRRMQDIADRLGRGLDEIDACIARTGVMADEIAQVMQESLARRTYTMSLMAMVFLPSTFLTGLFGVNLGGIPGGAWHFGFSMFCILLVVLIGGVTLWLHRSKWL >LR134204|174908:182759|178143_179517_-|VEB83823.1|DBSCAN-SWA MTAFSTLNVLPAAQLENLNELGYLEMTPVQAAALPAILAGKDVRVQAKTGSGKTAAFGLGLLQHIDASLFQTQSLVLCPTRELADQVAGELRRLARFLPNTKILTLCGGQPFGAQRDSLQHAPHIIVATPGRLLDHLQKGTVSLDALQTLVMDEADRMLDMGFSDAIDEVIRFAPSRRQTLLFSATWPEAIAAISGRVQQNPLTIEIDSVDALPSIEQQFFETSSQGKIPLLQKLLSQHQPASCVVFCNTKKDCQAVCDALNSVGQSALSLHGDLEQRDRDQTLVRFANGSARVLVATDVAARGLDIKSLELVVNFELAWDPEVHVHRIGRTARAGNSGLAISFCAPEEAQRANILSEMLQLKLNWLTPPGNVPVAPLEAEMATLCIDGGKKAKMRPGDVLGALTGDIGLDSADIGKITVHPAHVYVAVRQPVAQKAWKQLQNGKIKGKNCRVRLLK >LR134204|174908:182759|181526_182759_+|VEB83835.1|DBSCAN-SWA MIAHELNALDLLSFPVWIVLPHTEELVFANTAARELTQEQTFSRLRKGIFSTYAQNELRMYVADLHHHHDIVEILTVCRDGKEIALTCRLSVKTLSTQGDVIIFEGIETPTAQGLKASRSANYQRKKQGFYARFFLTNSAPMLLIDPSRDGLIVDANLAALNFYGYSHEAMCQKHTWEINVLGRQILPVMHEIARLPGGHKPLNFVHKLSDGTTRHVQTYAGPLEIYGDKLMLCIIHDITEQKRLEQELERAALHDALTGLLNRRQFYLLTEQNHTPHLSMTMDYSLLLIDTDRFKSINDLYGHLKGDEVLCALARTLEARARKGDLVFRWGGEEFVLLLPRTSLEVALNLAESIRAAVAKVCIPGLPRFTVSIGVARHESNESIDELFKRVDDALYKAKNDGRNRVLAA |
9 | Enterobacteria_phage(33.33%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1359487 : 1366959
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >LR134204|1359487:1366959|DBSCAN-SWA CTCACAATAGTCGAACGTCCACGGCATCCATTCCCGCCGCGCGTGCTGCCTGCAAGCCGAAATCCGCATCCTCAAACACGACGCATTGCGTCGGCACAACGCCCATACGCTCCGCACACAGCAGGAACGTGTCCGGCGCGGGTTTGTGGTGCAGAACATGATCCGCAGCGACAACTGCATCAAAATAACGACGCAGCCCCAGATGGGTCAATAGCGCTTCGGCAATTGCGCTTTCGCTGCCCGTTCCCACCGACAGAGGTCGACGGCCATACCAGGCTTTGACCACCTCAACCAGCGGCAGAGGCTCTACGCTATCCAGCAACATGCTTTTCACTGCATCAGTTTTTTCTTGCGCCAGCGCATGAGGGTCAAGATCGGCGTGATTCAGTTCGATAATCGACTGCGCGATACGCCAGGTTGGAGAACCGTTAAGCGCCACCATCGCTTGCAGATCAAAACGAAGGCCATAGCGCCCTAATACCTCGTCCCACGCCTTACGGTGCGTCGGCTCGGTATCCAGGATGGTGCCATCCATATCAAAAATCAAACCCGCATAACGTTCGTACATCGTGTTCTCACAAACCAAATGACCAGATGTTACTTTATCGTAATTGACTGATTTTGTCGCTGACAGAGAAATGAGGATTGTTCAGGGTAGAGAGGTAATGGAGGTATTCAGGAGAAGATGGTGCATCCGGGAGGATTACTCGGCTGCGCCTCGCCCTACGGGCCGTTGCTGAAGCAACGTTATCCTCCCTGGTGCTTGCGATAGCTTCGCAGACCATGAAACAACAGCATAGCTGTTATAACGGAGAAGATGGTGCATCCGGGAGGATTCGAACCTCCGACCGCTCGGTTCGTAGCCGAGTACTCTATCCAGCTGAGCTACGGATGCATCGGGAAACTTATTTACTGCCAGTATTGATACCGCTACAAAAGCCGTATCAAATAAGAGATGGTGCATCCGGGAGGATTACTCGGCTGCGCCTCGCCCTACGGGCCGTTGCTGAAGCAACGTTATCCTCCCTGGTGCTTGCGATAGCTTCGCAGACCATGAAACAACAGCGTAACTGTTATACCGGGGAAGATGGTGCATCCGGGAGGATTCGAACCTCCGACCGCTCGGTTCGTAGCCGAGTACTCTATCCAGCTGAGCTACGGATGCATTGGGAAACTTATTTACTGCCGATATTGATACCGCTACAAAAGCCATATCAAGTAAGAGATGGTGCATCCGGGAGGATTACTCGGCTGCGCCTCTCCCTACGGGCCGTTGCTGAAGCAACGTTATCCTCCCTGGTGCTTGCGATAACTTCGCAGACCATGAAACAACAGCGTAGCTGTTATATCGGGGAAGATGGTGCATCCGGGAGGATTCGAACCTCCGACCGCTCGGTTCGTAGCCGAGTACTCTATCCAGCTGAGCTACGGATGCATCGGGAAACTTACTTTATTGCAGATTTTGATACCGCTACTAAAGCCATATCAAGTAAGAGATGGTGCATCCGGGAGGATTCGAACCTCCGACCGCTCGGTTCGTAGCCGAGTACTCTATCCAGCTGAGCTACGGATGCAAATGGCGGTGAGGCGGGGATTCGAACCCCGGATGCAGCTTTTGACCGCATACTCCCTTAGCAGGGGAGCGCCTTCAGCCTCTCGGCCACCTCACCACGCGCCTCTTACGAGTGCTTCGAAGAACTTGTTTCTGCTCATCGTCGCTGCGTGCGCACATATTACTTTCTGGGACTTATAAGTCAAACAATTTTTCCCGCGCTTTTATCGTTTGCACACTTCACGCTCAATTAGTCTGCAAAAACGGCAAAAAGGGTGTTTTATCAACAGATAAATCGGCGTGTTACACCGCCTGAGCGCAGGCAGAAAAACAGAATAACCAGGAAAGAGCCTGCCGGCACGGGAAGCGATACGGAAAAAGAGTGAAGAAAAATGAAAAAGAGAAAGGACGCGAGGAGCAGGTTGCGCCTCATCGGAAGAGATGAGACGCGAAAACCTTAGTAACTGGACTGCTGGGATTTTTCAGCCTGGATACGCTGGTAGATCTCCTCACGATGGACAGAGACTTCTTTCGGGGCGTTTACGCCAATGCGCACCTGGTTGCCCTTTACCCCTAAAACTGTCACGGTGACCTCATCTCCAATCATGAGGGTCTCACCAACTCGACGAGTCAGAATCAGCATTCTTTGCTCCTTGAAAGATTAAAAGAGTCGGGTCTCTCTGTATCCCGGCATTATCCATCATATAACGCCAAAAAGTAAGCGATGACAAACACGTAAAGTGTAAGCAGTCACGGCATCACATTCTGTTAAACCTAAGTTTAGCCGATATACACAACTTCAACCTGACTTTATCGTTGTCAATAACGTTGATGCAAACGCCGCAGACCGGGGCCTGCGGCGTTAGATAACGCTTATAGTTATTACAGTTTTGCGCTAACCCAGCCTTTCACACTGGCTAACGCTGCTGGCAGAGCAGCCGCATCCGTACCACCGGCTTGCGCCATGTCTGGACGTCCGCCGCCCTTGCCGCCCACCTGCTGAGCGACCATGCCGACCAGCTCCCCTGCTTTCACGCGGTCGGTCACATCCTTCGACACGCCCGCAATCAGAGAAACCTTACCTTCCGCTACCGTTGCAAGAACGATAACGGTAGAACCCAGTTGATTTTTCAGATCATCAACCATCGTACGCAGCATTTTTGGCTCAACGCCCGCAAGCTCGCTAACCAGCAGCTTCACGCCGTTGATATCAACCGCTTTGCTGGAAAGGTTCGCACTTTCCTGCGCGGCAGCCTGTTCTTTCAACTGCTGCAACTCTTTTTCAAGCTGACGCGTACGTTCCAGTACAGAACGGACTTTTTCGCCCAGATTCTGGCTGTCGCCCTTCAGCAACTGTGCGATATCGCTTAAGCGATCGCTTTCAGCGTGCAGCGTAGCAATCGCGCCTTCGCCCGTAACCGCTTCAATACGACGAACGCCTGCCGCAGTACCAGATTCAGAAACGATGCGGAACAGACCGATATCCCCCGTACGGCTGGCGTGCGTACCGCCACACAGTTCGGTGGAGAAATCGCCCATGCTCAGCACACGAACGCGATCGTCATACTTCTCGCCAAACAGCGCCATTGCGCCTTTAGCTTTCGCCGCTTCAAGGTCCATGATGTTGGTTTCGACCGGCAGGTTGCGACGGATTTGCGCATTCACCAGATCTTCCACCGCACGGATTTCAGACGGTTTCATCGCTTCATTATGCGAGAAGTCGAAACGCAGGACTTTGTCGTTAACCAGCGAGCCTTTCTGCGCTACGTGCGTGCCCAGAATCTGGCGCAGCGCCGCGTGCATCAGGTGCGTCGCAGAGTGATTCAGGCGAATACGCGCCCGGCGAGCGTCATCCACATCAGCCTGCACCGCATCACCCACTTTCAGGGAACCGGTCGAAAGCGTACCGATATGTCCAATCGCCTGGCCATATTTTTGCGTATCGCTAACCGCAAACGCGAAGCCTGCGCCTTTCAGTTCGCCTTTATCGCCAACCTGACCGCCGGATTCCGCATAGAAACGGCGTCTGATCCAGCACGACAACGGCTTCCTGACCCGCGCTAATCGCATCTACCGCTTTACCATCCACAAACAGCGCGGTCACTTTACCGTTCAGTTCCAGATGGTCATAACCTTTAAATTCTGACGCGCCGTCAACACGGATCATCGCGTTATAGTCTGCGCCAAAACCGCTGGCCTCACGCGCACGACGACGCTGTTCTTCCATCGCGGCTTCAAATCCCGCTTCATCAACTTTGATGTTGCGCTCGCGGCAAACGTCCGCCGTCAGGTCAACCGGGAAGCCGTAGGTGTCATACAGACGGAAAGCGGTTTCGCCATCCAGCGTGTCGCCGGAAAGCTTCGCCAGCTCTTCGTCCAGCAGCGCCAGACCACGTTCCAGCGTACGGGCAAACTGCTCTTCTTCCGTTTTCAGAACCTGCTCAACCTGCGCCTGCTGGCGCTTCAGCTCTTCACCGGCAGAGCCCATAACGTCAATCAGCGGCCCAACCAGCTTGTAGAAGAAGGTCTCTTTCGCGCCCAGCATGTTGCCATGACGAACCGCACGACGAATGATACGGCGCAGCACATAGCCACGGTTTTCATTCGACGGAACCACGCCGTCGGCGATCAGGAAAGCGCAGGAACGGATGTGGTCGGCAATCACGCGCAGCGACTTGTTGCTCAGGTCGGTCGCGCCAGTCACCTTCGCAACGGCTTCAATCAACGTGCGGAACAGGTCAATTTCATAGTTGGAGTTAACGTGTTGCAACACAGCGGCGATACGCTCCAGACCCATACCGGTATCGACGGACGGTTTCGGCAGCGGTTCCATCGTACCGTCAGCCTGACGGTTGAACTGCATGAAGACGATGTTCCAGATCTCAATATAGCGATCGCCATCTTCTTCCGGGCTTCCCGGAGGGCCGCCCCAGATGTGATCGCCGTGGTCGTAGAAGATCTCGGTACATGGACCGCACGGCCCTGTATCGCCCATTTGCCAGAAGTTGTCGGACGCGTATGCCGCGCCTTTGTTATCACCAATTCGGATGATACGTTCGCGCGGAATGCCGACTTCTTTTTCCCAGATCTCGTAGGCTTCATCGTCAGTTTCATAGACGGTCACCCACAGACGCTCTTTTGGCAGGGCAAACCAGTTTTCACCGGTCAGCAGTTCCCACGCAAACTGAATCGCGTCGTGTTTGAAATAGTCGCCGAAGCTGAAGTTACCCAGCATTTCGAAGAAGGTGTGGTGACGTGCAGTGTAACCGACGTTTTCCAGATCGTTGTGTTTACCGCCCGCACGTACGCAACGCTGCGAAGTGGTGGCGCGGGAATAATTACGCTTGTCGAGCCCAAGGAAAACATCCTTGAACTGGTTCATCCCGGCGTTGGTAAACAACAAAGTTGGGTCATTGTTTGGCACCAGGGAGCTGCTGGCAACTACCTGATGTCCCTTACTGTGGAAAAAATCGAGAAACGCCTGACGGATCTCAGCGGTGCTCTTGCTCATAATTATCCTGAAATCAAGCTAACGAAATATCATTACCCGCCTCAGCGCTCATATGCCCTGCCCGGCTGGCAATTGAAAAAAGTGGGAATAAGATAAGTTTTCTTTACGGGGAAGTAAAATCCCGTATGCAGTCAATCGTCAAAATTTTCGCCATATCTCCTGAATATCCTCCATCAGATAGCCACGATAAAGCAGGAAACGCTGGACTTTGACCTTTTCCGAAAATGCGGCTGGTAAGGGTTCGCCATATTTGCGGATCGCCTGTTCGCGAGCAAGAAGCAACCACTCAATCTCGCATTCACGCATTGCGCGTTCAATCGCCTCACGCGCAATGCCTTTTTGATTCAGCTCCTGCCGAATACGTGCCGGGCCATAACCTTTACGGCTGCGGCTGGCGATAAAGCGCGAAACAAAACGATGATCATCAAGGTAGTTGTGCTCATGGCACCAGGCGATAACCCGTTCATAGTCTTCCGGCGCCGCATCGATCTCTTCCGGCCCGTTTTTTCCCATAACCGGTGCGGAGAGTTTACGCCGCAGTTCCTGTTCACTGTGATCGCGTACCGCCAGAATACGTACCGCGCGATCCAACAGACGGTTATACGCGGGGCGTCGCGGTGTGGATTCACTCATAACAAACCTTCGGAAAAAGAAATGCAAAAAGGGCCGCAAGATATGCAGCCCTTCAAAGTATAAGCCGATAAGACGCCATAGCGCCGCCACCGGGCAATGCAACGATTAGAAATCTTCGTTGGTTTCTTTCACACCTTCACCGTTGTCATCAACGGTGAAGTCTGGCGTCGAGTCCTGATTGCTCAGCAGCAGCTCACGCACTTTTTTCTCGATCTCTTTGGCGGTCGCCGGGTTCTCTTTCAGCCAGGCAGTGGCATTCGCTTTACCCCTGACCGATTTTCTCGCCGTTATAGCTGTACCATGCACCGGCTTTCTCGATCAGCTTCTCTTTCACGCCCAGGTCAACCAGTTCGCCATAGAAGTTAATGCCTTCGCCGTAGAGGATCTGGGAACTCTGCCTGTTTAAACGGCGCGGCGATTTTGTTCTTCACAACCTTTCACGCGGGGTTTCGCTACCCACGACGTTTTCGCCCTCTTTCACCGCGCCGATACGGCGGATATCCAGACGAACGGACCGCGTAGAATTTCAGCGCGTTACCCCCGGTAGTGGTTTCCGGGTTGCCGAACATGACGCCAATTTTCATACGGATCTGGTTGATGAAGATCAGCAGGGTATTGGACTGTTTCAGGTTACCGGCCAGCTTACGCATCGCCTGGCTCATCATACGCGCCGCGAGGCCATATGAGAGTCGCCGATTTTCACCTTCGATTTCCGCTTTCGGCGTCAACGCCGCCACGGAGTCCAACAACGATAACGTCTACAGCGCCTGAACGCGCCAGCGCATCACAGATTTCCAGCGCCTGCTCGCCGGTGTCCGGCTGGGAGCACAGCAGGTTGTCGATATCGACGCCCAGTTTGCGGGCATAGATAGGGTCCAGCGCATGTTCGGCGTCGATAAAACGCACAGGTTTTTGCCTTCACGCTGTGCAGCGGCAATCACCTGAAGCGTCAGGGTGGTTTTACCGGAGGATTCCGGCCCGTAGATTTCTACGATACGCCCCATCGGCAGGCCACCCGCGCCCAAGCGCGATATCCAGAGAAAGCGAACCGGTGGAGATGGTTTCCACATCCATGGAACGGTCTTCACCCAGGCGCATGATGGAGCCTTTACCGAATTGTTTTTTCGATCTGGCCCAGCGCTGCCGCCAACGCTTTCTGTTTGTTTTCGTCGATAGCCATTAACTACTCCTGTCATGAGCCGGGGTCCGCTTGCGCTTGTACCGGATAGTGAATTCTGTACTGTTAGGTGCGATTATACTGTACAGTCATACAGTATCAAGTGTTTTGTAGAAATTGTTGCCAGAGCGTTTTGAGCGCATATTCCGTCGCCTGGCGACGCACGCTGTCACGGTCGCCGCTAAAGCACTCCCGGCGGGTGATCCCCTCGCCCTGTGCGGTTGCGAAACCAAACCAGACAGTGCCGACGGGCTTCACGTCGCTGCCGCCGTCCGGCCCGGCAATACCGCTGATAGAAACGGCATAATCCGCGCGGGCCGCTCTGAGCGCGCCAATCGCCATCTCAATCACCACCGGTTCGCTGACCGCGCCGTGCTGCTCAAGCGTCGGTTCGCGCACGCCAATCATTTGCGCTTTCGCTTCATTACTGTAGGTAACGAAACCGCGTTCAAACCAGGCGGAACTCCCGGCGATATCGGTAATCACTTTCGCTACCCAACCGCCCGTGCAGGACTCCGCTGTTGTTAGCGTTGCGCCGCGCTGTTTCAGCGCCAGCCCTACCACTTCGCTTAACTGCATCAATTCATGGTCAGTCATCAT
Protein sequences of DBSCAN-SWA_3 >LR134204|1359487:1366959|1364687_1365182_-|VEB88021.1|DBSCAN-SWA MSESTPRRPAYNRLLDRAVRILAVRDHSEQELRRKLSAPVMGKNGPEEIDAAPEDYERVIAWCHEHNYLDDHRFVSRFIASRSRKGYGPARIRQELNQKGIAREAIERAMRECEIEWLLLAREQAIRKYGEPLPAAFSEKVKVQRFLLYRGYLMEDIQEIWRKF >LR134204|1359487:1366959|1365881_1366088_-|VEB88030.1|DBSCAN-SWA MRFIDAEHALDPIYARKLGVDIDNLLCSQPDTGEQALEICDALARSGAVDVIVVGLRGGVDAESGNRR >LR134204|1359487:1366959|1361495_1361681_-|VEB88012.1|DBSCAN-SWA MLILTRRVGETLMIGDEVTVTVLGVKGNQVRIGVNAPKEVSVHREEIYQRIQAEKSQQSSY >LR134204|1359487:1366959|1359487_1360054_-|VEB88009.1|DBSCAN-SWA MYERYAGLIFDMDGTILDTEPTHRKAWDEVLGRYGLRFDLQAMVALNGSPTWRIAQSIIELNHADLDPHALAQEKTDAVKSMLLDSVEPLPLVEVVKAWYGRRPLSVGTGSESAIAEALLTHLGLRRYFDAVVAADHVLHHKPAPDTFLLCAERMGVVPTQCVVFEDADFGLQAARAAGMDAVDVRLL >LR134204|1359487:1366959|1361920_1363081_-|VEB88015.1|tRNA|DBSCAN-SWA MSCWIRRRFYAESGGQVGDKGELKGAGFAFAVSDTQKYGQAIGHIGTLSTGSLKVGDAVQADVDDARRARIRLNHSATHLMHAALRQILGTHVAQKGSLVNDKVLRFDFSHNEAMKPSEIRAVEDLVNAQIRRNLPVETNIMDLEAAKAKGAMALFGEKYDDRVRVLSMGDFSTELCGGTHASRTGDIGLFRIVSESGTAAGVRRIEAVTGEGAIATLHAESDRLSDIAQLLKGDSQNLGEKVRSVLERTRQLEKELQQLKEQAAAQESANLSSKAVDINGVKLLVSELAGVEPKMLRTMVDDLKNQLGSTVIVLATVAEGKVSLIAGVSKDVTDRVKAGELVGMVAQQVGGKGGGRPDMAQAGGTDAAALPAALASVKGWVSAKL >LR134204|1359487:1366959|1365287_1365488_-|VEB88024.1|DBSCAN-SWA MHGTAITARKSVRGKANATAWLKENPATAKEIEKKVRELLLSNQDSTPDFTVDDNGEGVKETNEDF >LR134204|1359487:1366959|1365634_1365919_-|VEB88027.1|DBSCAN-SWA MAALTPKAEIEGENRRLSYGLAARMMSQAMRKLAGNLKQSNTLLIFINQIRMKIGVMFGNPETTTGGNALKFYAVRSSGYPPYRRGERGRKRRG >LR134204|1359487:1366959|1366458_1366959_-|VEB88036.1|DBSCAN-SWA MMTDHELMQLSEVVGLALKQRGATLTTAESCTGGWVAKVITDIAGSSAWFERGFVTYSNEAKAQMIGVREPTLEQHGAVSEPVVIEMAIGALRAARADYAVSISGIAGPDGGSDVKPVGTVWFGFATAQGEGITRRECFSGDRDSVRRQATEYALKTLWQQFLQNT >LR134204|1359487:1366959|1366142_1366280_-|VEB88033.1|DBSCAN-SWA MRLGEDRSMDVETISTGSLSLDIALGRGWPADGAYRRNLRAGILR >LR134204|1359487:1366959|1363022_1364549_-|VEB88018.1|tRNA|DBSCAN-SWA MSKSTAEIRQAFLDFFHSKGHQVVASSSLVPNNDPTLLFTNAGMNQFKDVFLGLDKRNYSRATTSQRCVRAGGKHNDLENVGYTARHHTFFEMLGNFSFGDYFKHDAIQFAWELLTGENWFALPKERLWVTVYETDDEAYEIWEKEVGIPRERIIRIGDNKGAAYASDNFWQMGDTGPCGPCTEIFYDHGDHIWGGPPGSPEEDGDRYIEIWNIVFMQFNRQADGTMEPLPKPSVDTGMGLERIAAVLQHVNSNYEIDLFRTLIEAVAKVTGATDLSNKSLRVIADHIRSCAFLIADGVVPSNENRGYVLRRIIRRAVRHGNMLGAKETFFYKLVGPLIDVMGSAGEELKRQQAQVEQVLKTEEEQFARTLERGLALLDEELAKLSGDTLDGETAFRLYDTYGFPVDLTADVCRERNIKVDEAGFEAAMEEQRRRAREASGFGADYNAMIRVDGASEFKGYDHLELNGKVTALFVDGKAVDAISAGQEAVVVLDQTPFLCGIRRSGWR |
10 | Acanthocystis_turfacea_Chlorella_virus(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1848055 : 1896534
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >LR134204|1848055:1896534|DBSCAN-SWA CTTATTTGTCGCCTAACTGCTCTGACATGGTGTTGCCTGGGTTCGGCGTACGCGGTTCATCAACCGGGCGCGGCGCACGAGGCGTGCCATTGTTGTCAGAGTTGTTGGTGCCGTTCGGATCTTCCCAGCCAGCCGGCGGGCGAACTTCGCGGCGCGCCATCAGGTCATCAATCTGCGGCGCATCGATGGTCTCATACTTCATGAGCGCATCTTTCATTGCATGCAGGATGTCCATATTGTCATTCAGGATCTGACGAGCACGATTGTAGTTACGTTCAATCAGCGCTTTCACTTCCTGGTCGATGATACGCGCGGTTTCATCAGACATGTGTTTGGCTTTCGCGACGCTACGTCCCAGGAACACTTCACCCTCTTCCTCTGCATACAGCAGCGGACCGAGCTTATCGGAGAAGCCCCACTGGGTGACCATGTTACGCGCCAGGTTAGTCGCGACTTTAATGTCGTTCGACGCACCGGTAGAAACATGCTCTACGCCATAGATGATCTCTTCCGCCAGACGACCGCCGTACAGCGTGGAGATCTGGCTTTCCAGTTTCTGACGGCTGGCGCTAATCGCGTCGCCTTCAGGCAGGAAGAAGGTCACACCCAGCGCACGACCGCGCGGAATAATCGTCACTTTATGCACCGGATCGTGCTCAGGAACAAGGCGACCAATAATCGCATGGCCTGCTTCGTGGTAAGCGGTAGATTCTTTCTGCGCTTCCGTCATCACCATGGAGCGGCGTTCCGCACCCATCATGATTTTGTCTTTCGCTTTCTCGAATTCAACCATGGATACCACACGCTTGTTGCCACGGGCAGCAAACAGCGCCGCTTCGTTCACAAGGTTCGCCAGATCGGCGCCAGAAGAAGCCCGGCGTGCCGCGTGCGATGATTGCCGCATCGATATCCGGCGCCAGCGGTACGCGACGCATATGCACTTTCAGGATCTGCTCACGACCACGCACGTCTGGCAGACCTACCACGACCTGACGGTCGAAACGGCCAGGACGCAGCAGCGCAGGGTCAAGTACGTCTGGACGGTTAGTCGCCGCGATAACGATGATACCTTCGTTACCTTCGAAGCCGTCCATCTCAACCAGCATCTGGTTCAATGTCTGTTCACGTTCATCATGACCACCGCCCAGACCCGCGCCACGCTGGCGGCCTACGGCGTCGATTTCATCGATGAAGATAATGCACGGCGCTGCTTTCTTAGCCTGCTCGAACATGTCACGCACACGAGATGCGCCAACACCGACGAACATTTCGACGAAGTCAGAACCGGAAATCGTAAAGAACGGGACTTTCGCTTCGCCTGCAATAGCTTTCGCCAGCAGGGTTTTACCCGTACCCGGAGGGCCGACCATCAGGACGCCTTTCGGGATTTTACCGCCCAGCTTCTGGAAGCGGCTTGGTTCACGCAGATACTCGACCAGCTCAGCCACTTCTTCTTTCGCTTCGTCACAACCTGCGACATCGGCAAAAGTGGTTTTGATCTGATCCTCCGTCAGCATGCGCGCTTTACTCTTACCGAACGACATGGCGCCTTTGCCACCGCCGCCCTGCATCTGACGCATGAAGAAGATCCAGACGCCGATCAGCAACAGCATCGGGAACCAGGAAATAAAGATTGAAGCCAGCAGGCTCGGCTCTTCAGGTGGTTCGCCAACAACCTTGACGTTTTTGGTCAGCAGGTTATCAAGCAGCTTCGGATCGTTAACCGGAATGTAGGTCGTGTAACGGTTACTATCTTTCTTGGTAACGTTGATCTCACGTCCGTTGATACGCGCTTCACGAACCTGGTCCTGGTTGACCTCTTGCAGGAAGGTAGAGTAATCCACCTTACGGCCATTAGACTCGCTGGGCCCAAAGCTCTGGAATACTGACATCAGCACAACGGCAATGACCAGCCAGAGTATTAGGTTTTTCGCCATGTCACTCAAGGGATTAACCTCTTATTACAACTGTGTTAAAAAACAGCGTCAGGATACTCTATATCCAGCTTCTTTCAAACTTTCGTCTGAAATCCCCCGGTTATGGTTTTCGCCCGGTCGCTACAATATACACTTCGCGTGAACGCGCACGCGAAGAGTCCGGCTTACGAACTTTAACCTTCGTAAACAGGGAGCGAATTTCCCTTAGGTACTCATCGAAACCTTCGCCCTGGAACACCTTCACTACAAAACTTCCACCTGGCGCCAGCACATCACGACACATTTCTAACGCCAGTTCCACCAGATACATGGCGCGGGGGATATCCACCGCCGGTGTTCCGCTCATGTTTGGTGCCATATCTGACATGACAACTTGTACTTTACTGTCACCCACGCGTTCCAGCAGCGCTTTCATAACAAGTTCATCACGAAAATCGCCCTGAAGAAAGTCCACACCAACGATAGGATCCATAGGTAAAAGATCGCAAGCGATGATACGGCCTTTGCCGCCAATCTGCGTGACCACATACTGTGACCAGCCTCCTGGCGCAGCACCGAGGTCGACAACCGTCATCCCCGGCTTAAAAAAGTTTGTCACTTTGCTGTATTTCATCAAGTTTAAACCAGGCACGGGAACGTAACCCCTTTTTCTGCGCCTGTTGAACATATTTATCGCTAAAGTGTTCCTGAAGCCAGCGACTCGAGCTGGCAGAACGCTTTTTTACCTGTCATTTAACTTTCCCGTCGGGGGCAATTCATCGTTACCCGTAGCGTAAATTTTTACGCGCTCATTTGGTGATATATGGGAGATGGCGGTAGAATGACCCGTTTTCAATCCCAACGTAAGCAAAAATATACGATGAATCTGAGTACTAAACAAAAACAGGCACCTGAAAGGTCTGGCACATCCGCTCAAGCCGGTAGTTATGCTTGGCAACAATGGTTTGACCCGAAGGGGTACTGGCCGAGATTTGAACAAGCGTTAGAGCACCATGAACTTATCAAGGTGAAGATCGCCTCGGAAGATCGCGAAACCAAAACCTTGATCGTGGAAGCTATCGTACGCGAAACCGGCGCCTGTAACGTACAGGTCATCGGTAAAACGCTGGTACTTTATCGCCCAACTAAAGAACGTAAAATCTCGCTGCCACGCTAAGAATATCCTAAAGTCGAACACAAAATCTGTGTAAAACGAGGGGTTTTCCGCAAGCAGGAGAGCAAAATGCCACGCTCTCTTCGTTGATAAAAGGCCGCATAGCGGCCTTTTTCCTTTCTTTACAATACATCAACATCTTGAGTATTGGGTAATTCTTACAGGTATTCCCACCTTAATCACTTCGTATTCCACTTCGCCGCCAGGGTGTTGATGGTCACGACGTCATCCTGCTCTTTGCCAACCAGGCCGCGCGCGATAGGTGGAATTCACGGAAATCAGGTTCTGTTTAAAATCCGCTTCGTCATCGCCGACAATACGCCAGGTCTGCTCTTCAATCGTTATCGAGATTCAGAACAGTTACCGTTGGCGCCGAAAAACAACACGTCCGTTGTTCGGCATCTTCGTCACGTCGATCACCTGCGCATTCGACAAGCTTCGCCTCGATATCCTTGATACGTCCTTCGCAGAAACCCTGCTGCTCACCGCGCAGCGTGATACTCGGCGTTCTCTTTCAGATCGCCATGCTCACGCGCTTCCGCGATAGCGCGATGATTTCAGGACGGCGCACAGATTTCAGAAATCCAGCTCTTCGCGCAGTTTTTCGGCACCACGTAAAGTCATCGGAATAGCTTGCATTTGTTATACCTCTTGAACATTCCTGTAGGGGGTGATTGACCCCGCCTCGGCTGCTAAGCCTGCTCCGGTATGCGTTCTACCAAAGCCAGAGCAAAAAAATACCGACCCGGGTACAAAGCCCCAGGTCAGCTACAATTCTCAATTTGATACGTATTTTACCCTGGAGTTCCCTATGGGTCATCGGTTTACTTTCCAGGGCATTGCGCCGTAGTATGACGGCTTGTTTCCAGGTTGTTAGCGCGAGATTATGCGATTTTCCAGATTTATCATCGGATTGACCACCAGTATAGCGTTCAGCGTCCAGGCCGCGAATGTTGATGAATACATCAATCAGCTCCCTGATGGGGCCAACCTTGCTTTCATGGCTCAGAAGGTCGGCGCGTCCACGCCCGCGATTGATTACCATAGCCAGCAGATGGCGCTGCCCGCCAGTACGCAAAAAGTGATCACGGCCCTCGCGGCATTGATTCAGCTCGGCCCCGATTTTCGTTTTACCACCACGCTGGAGACAAAAGGCAACGTGGACAATGGCGTCTTAAAAGGGGACTTAGTGGCGCGATTTGGTGCCGATCCGACGCTAAAACGCCAGGATATTCGCAATATGGTCGCGATGCTGAAAAAATCCGGCGTGACGCAAATTGCGGGAAATGTGCTCATCGACACCTCCATTTTCGCCAGCCATGACAAAGCGCCAGGCTGGCCCTGGAATGACATGACGCAATGTTTTAGCGCACCCCCCGCAGCCGCCATTGTCGATCGCAACTGCTTTTCCATCTCCCTCTACAGCGCGCAAAAACCGAATGACTTAGCGTATATTCGCGTGGCGTCCTATTACCCGGTGACCATGTTCAGTCAGGTTCGCACTCTCCCACGCGGCTCTGCGGAAGCGCAATACTGTGAGCTGGATGTCGTTCCGGGGGATTTAAACCGCTTCACGCTGACAGGCTGCCTGCCGCAGCGCGCCGAACCGTTACCGCTGGCCTTCGCCATTCAGGACGGCGCCAGCTATGCCGGAGCGATCCTGAAAGACGAGTTAAAACAAGCAGGTATTACCTACAGCGGTACGCTGTTACGCCAGACGCAGGTCAACGAACCCGGAACGGTGGTTGCCAGCAAGCAGTCTGCGCCGCTGCATGATTTGCTTAAGATTATGCTGAAAAAATCGGACAACATGATTGCCGATACCGTCTTTCGCATGATCGGCCACGCACGCTTCAACGTCCCCGGCACATGGCGGCGGGATCTGATGCCGTACGCCAGATCCTGCGTCAGCAGGCTGGCGTTGATATTGGCAATACCATCATCGCTGACGGTTCGGGCCTCTCTCGCCACAACCTGATTGCCCCGGCCACCATGATGCAGGTGCTGCAATACATCGCCAACACGACAACGAGCTGAACTTTATCTCCATGCTGCCGCTGGCGGGATATGACGCTCTCTGCAATACCGTGCAGGTCTGCATCAGGCTGGGGTCGATGGTAAAGTTTCGGCGAAAACAGGCTCGCTGCAAGGGGTCTATAATCTGGCGGGATTTATTACAACGGCAAGTGGGCAACGAATGGCGTTTGTTCAGTATCTTTCCGGGTATGCCGTTACGCCAGCTGACCAGCGCAATCGTCGAATTCCGTTAGTACGTTTTGAAAGCCGGTTGTACAAAGATCTTTATCAGAACAACTAGGTGAGGATAGTGCCGGATGGCGCTTACGCTTATCCGGCCTACACACTAGATGGGTCATTTAATTGCCGGATGGCGGCGCTTTGCGCCTTATCCGGCCTACGTACCCGTAGGCCCGGTAAGCGTAAGCGCCACCGGGCATTTGAACCAGATTAACGTTTGTAGATGAACTCGACGCCTTCTTCGTCGTCTTCGTCCCAGTCGTCATCCCAGTCATCTTCCGCTTCGTCTTCGACTTCAGCGAGCTGCTGACGGTGATAATCGTCCACATGAACTCCACTTTCTCCGGCTGCTTCGCTTCATCAGCCTGAACGATAGGGTTCTCGATGATAAAGGTCATCACATCCCAGCAGAGATCTTTCACGCCAGTCTGGCTGGCAGCGGAAAATCATGTAGTACTTATCTTCCCAGCCCAGCGCCTGCGCGATAGCTTTCGCTTTTTCTTCCGCTTCGGCCTGTCCAGCAAATCAATTTTGTTGAACACTAACCAGCGCGGCTTAGCAGCCAGATCCTGACTGTATTTTTCCAGTTCGCCGATAATGATACGGGCGTTTTCCACCGGATCGGAACCGTCGATAGGATCGATATCAATAAGGTGCAGCAGTACGCGGCAACGTTCCAGGTGCTTCAGGAAGCGAATCCCCAGACCCGCGCCTTCCGCCGCGCCTTCAATCAGCCCCGGAATATCGGCAACCACAAAGCTCTTTTCGTTGTCCATACGCACAACGCCAAGGCTCGGCACCAGAGTGGTAAACGGATAGTCCGCCACTTTCGGTTTTGCCGCAGATACCGCGCGGATAAAGGTTGATTTACCGGCGTTTGGCATACCCAGCATACCCACGTCCGCCAGCAGCATCAGCTCCAGCATCAAATCGCGCTTATCGCCCGGCGTGCCCATCGTTTTCTGACGCGGAGTACGGTTAACAGAGGACTTAAAGCGGGTATTGCCCAGGCCGTGCCAGCCGCCTTTGGCAACCATCAGACGCTGACCGTGTTTGGTCATATCGCCCATGGTTTCGCCGGTGCCCTGATCGATAACACGCGTACCGACAGGCACTTTGATCGTGACATCTTTCCCACGTTTACCGGTGCAGTCACGGCTTGCGCCATTCTGACCACGTTCAGCGCGGAACGATTTTTCAAAGCGGTAATCGATCAGCGTGTTCAGGTTTTCGTCGGCTTCCAGCCAGGACGTCGCCACCATCGCCGCCATCACCGCCATCCGGCCGCCTTTCGGGATGTATTTTTCGCGGCGGAAGCTCACGCAACCATTACCGCCATCCGCCAGCAACGACCAGAATCGATGCTTCATCAACAAACTTCATTTTACTCTCCGTAATCATTCGCCTGAGCGGGGTTGCGAAACCACCGTTTCATGCTTGCGTAAAACCGCCCAAATACGATGACCAATGGCGGAATACATCGCGCCGCCAACCACGACAAACGCACCGAGATAACCTAAAAGGTTTAACATCGGTCTGGCGAAGAAATCGGGCCAGGCCATAGATAATAGATCTGAAAATAACAGCGTAAACAGCGGAGTGAGCGTGATTAACGCACTTACCTGCGCTGCCTGCCAACGCGCCATCGCTTCCGCCAGGGCGCCATATCCTACCAGCGTATTCAGCCCACAGAAAATCAAACACGCGAGCTGCCAGTCGCTTAACTGCGAGATAACCCCCGGCTTCGCCAGCGGCAATAACGCAATCGTACATAAAGTGTACAGTAAAAACAGGATCTGCTGTGAGGCCAGACGACGCAATAACACCTTTTGCGCGACGCCATAACTCACCCAGACCGTCGCCGCACCGACCCCAAAGATCACACCCCAGGTGTAATCGGTCAGTTTGGTAAAAATCTCGACCAGACTGGTATTAAAGAACATCACCAGCCCACAAAGCAGCATGAGCGCGCCAATCACCTGCGTGCCGCGCATCTTTTCTTTCAAAATAAAGACGCTGGCGACCATCATGCCAACCGGTGACAGTTGACCGATAACCTGCGATGCCGTCGGACTCAAATATTGCAGGGAAGAGCTGAACAGGATGAAATTCCCGAAAAGTCCACCTGTTGCAATCGCCAGCAACACCAGCCAGCGCGGTTTGCGAAAAATACGCAATGGCGGGAGCTTTCTTTTAACTGCCAGGATCGCGCCGAGGCCGATACTCGCCATTAAAAAGCGATAGAAGACAATGGTTGGAGGTTCCATCACCTCCAGCACCTGCTTCATTGCAATTGGCAAGGCGCCCCAACACATTGCAGTGATGAGTGCTAAAAGGATGCCAATGCCAGCCTGCTGCTTCATGCCCGTTTTCCCTACAAGAAAATTTTCCGGGTTTTCAATGTAAAAAGCCCCGCAACACGTTGCGGGGCTTTAATCCGTTACCGGACCACGAAGAACTTACTCAGCAACGATGCTGATGTATTTACGGTTGTTCGGGCCTTTAACTTCGAATTTCACTTTACCGTCTGCTTTAGCAAACAGAGTGTGGTCACGACCGCAACCTACGTTAGTGCCAGCGTGGAATTTGGTGCCACGTTGACGAACGATGATGCTACCCGCCAGAACGGTTTCGCCACCGAAACGCTTAACGCCCAGGCGTTTAGCTTCTGAATCGCGACCGTTACGAGTTGAGCCGCCAGCCTTTTTATGTGCCATTTAAATCTCTCCTCAGGTCTTAGGCGCTGATGCCAGTAATTTTCACATCAGTGAACCACTGACGGTGGCCTTGCTGCTTACGATAGTGTTTACGACGACGAAACTTAACGATTTTAACTTTCTCGCCACGACCGTGAGCAACAACTTCAGCTTTGATTACGCCGCCATCAACGAAAGGAACGCCGATTTTGACTTCTTCACCGTTTGCGATCATCAGAACTTCAGCGAACTCAATAGTTTCGCCAGTTGCGATGTCCAGCTTTTCCAGGCGAACGGTCTGACCTTCGCTTACTCGGTGTTGTTTACCACCACTTTGGAAAACCGCGTACATAAAAAACTCCGCTTCCGCGCACGCCTTTTCAATGATTCAGAGTGCGCTATAAATATTCACAATAGGGCGCGAATATTACGCAAAACGCGAGCCTTTGACAAGTGCTACCGTCAATACATGAAGAAAAAAAACACAACGTGTACGGTAACGTTTATCTGCGCCGTTTTTTCAGTACAATCAGCATATATTCCTAACCCTAAACCCTTAGTTATCTCATCCCATGATGAGTTAATCGGGGCGATAAGCCCGGCTTTTGCGATGAATTTAGAAAAAATCAATGAGTTAACCGCGCAAGATATGGCGGGTGTCAATGCGACAATCCTTGAACAGCTCAATTCCGACGTCCAACTGATCAATCAGTTAGGGTATTACATCGTTAGCGGCGGCGGAAAACGTATCCGCCCGATGATTGCCGTACTCGCTGCGCGCGCCGTTGGTTATCAGGAAAATGCGCACGTCACGATCGCTGCCTTAATCGAGTTTATCCACACCGCCACCCTGCTACACGACGATGTTGTGGATGAATCCGATATGCGTCGTGGGAAGGCAACGGCGAACGCGGCATTTGGCAACGCCGCCAGCGTCCTGGTCGGTGATTTTATCTACACCCGCGCGTTCCAGATGATGACCAGCCTTGGCTCGCTGAAAGTGCTGGAAGTAATGTCAGAAGCGGTGAACGTCATCGCCGAAGGCGAAGTGCTGCAACTGATGAACGTCAACGACCCGGACATCACCGAAGAAAATTACATGCGCGTGATTTACAGCAAAACGGCGCGCCTGTTTGAAGCTGCGGCGCAGTGTTCCGGTATTCTTGCTGGCTGTACTCCTGAGCAGGAAAAAGGGTTGCAGGATTATGGTCGTTATCTCGGTACGGCCTTTCAGCTGATTGACGATTTACTCGATTACAGCGCTGATGGCGAGCAGTTAGGTAAAAATGTCGGCGATGACCTCAACGAAGGAAAACCAACGCTGCCGCTGCTGCACGCTATGCGTAACGGTACGCCAGAGCAGGCGCAAATGATCCGTACCGCCATTGAACAAGGCAATGGTCGCCACCTTCTGGAACCCGTTCTGGAAGCCATGACCGCCTGCGGTTCACTGGAATGGACGCGTCAGCGTGCAGAAGACGAGGCCGATAAAGCCATCGCCGCCCTACAGATTCTGCCTGATACGCCGTGGCGCGAAGCCCTGATCGGTCTGGCGCACATCGCCGTACAACGCGATCGTTAACCCGCCCTTCTCATCCCGTGCTCTGCGCGGGATGAATCTTTAAGTTCCAGTAAATTCCAGTTACTCATTTCTTTTCCAGCCTTTACCCATCGCACGTATGCTGTCACATTTTATGAATATGCATACTCTTTACAGGAACATATTAGAGCAATACTATCACCTGGCGTATAGTATTTCTCACAGGCTGATTATCCAGGCCGCAAGCGACTGACTGACCGAAGGAGTGAGGGAATAATGGAAAATAAACTCATTGACTGGCATCCCGCAGACATCATTGCCGGATTACGTAAAAAAGGGACATCTATGGCGGCGGAATCACGCAAGAATGGATTAAGTTCCTCGACGCTGGCAAACGCCCTGACCCGACCGTGGCCGAAAGGAGAGCTCATCATTGCGAAGGCGCTGGGTACAGAACCCTGGGTGATCTGGCCGTCGCGCTATCATGACCCAGAAACCCATGAGTTCATTGACAGAACGCGTCTGATGCGGGCTCGAAAAGGGAAATAGAGAAGTAAGGAACGGATAACGCTCTGAACGGTAGCCCCCGAACAGGGCTACCAGTGCATGCGGAAAAAATTACTCGCCTTTCACACGCTCAATATTCGCGCCCAGCGCGCGCAGTTTGTCTTCAATGCGCTCATAGCCACGATCGATGTGATAAATACGATCGACCAGCGTCGTGCCTTCTGCAATGCATCCCGCCAGCACCAGGCTCGCAGACGCACGCAAGTCGGTCGCCATCACCTGCGCCCCGGAGAGTTTTTCAACGCCGTGGCAAATCACGGTGTTGCTCTCGATTTCCGCGTGCGCGCCCATACGGATCAGTTCAGGCACATGCATAAAGCGGTTTTCGAAAATGGTTTCCGTGATGAAACCCGTCCCTTCCGCGACAAGGTTCAACAGCGTGAACTGCGCCTGCATATCGGTCGGGAAAGCCGGGTGCGGCGCGGTACGCACATTCACAGCTTTCGGACGCTGGCCATGCATGTCCAGGCTGATCCAGTCTTCACCGGTTTCAATATCCGCTCCGGCGTCACGCAGTTTAGCCAGTACGGCATCCAGCGTATCCGGCTGCGCGTTACGGCAGACGATTTTACCGCCGGAAATCGCCGCCGCGACCAGGAAGGTGCCGGTTTCAATGCGATCGGGCAGAACACGGTATACGCCGCCGCCCAGACGCTCAACGCCCTCAATGGTAATACGATCGGTGCCCTGACCGCTGATCTTCGCTCCCAACGTCACCAGGAAGTTCGCGGTATCGACGATCTCCGGCTCACGCGCGGCGTTTTCAATCACGGTGGTCCCTTCGGCCAGCGTTGCCGCAGACATAATGGTCACCGTCGCACCGACGCTGACTTTGTCCATCACGATATGCGCGCCTTTCAGACGACCATTAACAGACGCCTTCACATAGCCTTCTTCCAGCTTGATCTCAGCGCCCAGTTGTTCCAGGCCGGTGATGTGCAGATCGACCGGACGCGCGCCGATGGCGCAGCCGCCTGGCAGAGAAACCTGCCCCTGACCAAAACGCGCAACCAGTGGCCCCAGCGCCAAATCGACGCGCGCATGGTTTTTACCAGTTCATAAGGCGCGCAGAAGATGTTGACCTGGCTGGCGTCGATCCAGACAGAACCGTTACGTTCCACTTTCGTTCCAAGCTGGCTGAGCAGTTTCATCGTCGTATCGATGTCTTTCAGCTTTGGAACGTTCTGGATCTCTACCGGCTCTTCTGCCAGCAGCGCCGCGAAAAGGATCGGCAATGCGGCATTTTTAGCGCCTGAAATTGTGACTTCGCCCTGGAGCGTTGTCGGCCCCTGTACACGAAATTTATCCATTATTCTGTTCTCTGTTAAGAATTCATATCTGCTACCGGCGTGTCGCCCGTCGCTCAAAAACCGTTAAGTTTGCGATCGCGTTCCCATTCCGCTGGGGTATACGCTTTGATCGATACGGCATGGATACGGTTATCCGCAATAAACTCCATCAACGGGCCGTAAACAGACTGCTGCTTCTTGACCCGGCTCATGCCGTCCAAACATCTCACCCACGGCGATAACCTGAAAGTGACTGCCATCGCCAGAGACGTGGACTTCCTGGAGGGAGAGTGCGCTCATCAGCACGCTCTGAATTTCATGATTTTCCATGGGCTCTTCATTCGTCAAATAGGGAAAACAGCCCAACATCTTAGAGCAAAGTGGCGCTGTCATAAATAAGCAAAAAGCCCAGCTGATAAATCAGGCAAGGCTTTGGTGGCAAATCCCACTTAGGGGTTTTGCCGGATGGCGGCACAAGGCCTTATCCGGCCTACAAAAATTAACGTGGGAGCACGTCAGCAGGCAAATTATAGAGCTTCGCCAGCGTGTAAACCTTCTCGTTTACCCCTTGCAGCGAAACGCTGTTGCCCTGTCGCTTCGCCTGATCGACAAGATGCACCAGTAACGCCAGACCACCGCTATCCACCCGGGAGACCTGGCTCAGATCGATACAGGTTACGCCTTTCATCGCATCAACGCGCGCATCCCACAGAGGGTTCAGCACGTCCTGGTCCAGCTCTCCCGCTAATGCCAGCGTGTCACCTTCGCGCGTCCAGCTAAGTGACTGCGTCATTATTTTTTCTCTTCCAGCGAAATTTTCTGACGAGAAATTGACTGCAACTGCGCGGTCAGGCCGTCGATGCCTTTGGTACGCAGCAGATCGCTCCACTCATTCTGTTTTGTGGTGATCATACTGACGCCTTCCGCGATCATGTCATAAGCCTGCAGTTACCGGTCTGGGTGTTTTTACGCCACTGGAAATCCAGTCGAACCGGAGGACGGCCATTCGGATCGATGATCGTCACGCGGATCGGCACGATGGTCGCATTGCCCAGCGGTTGTTCCGGCGCAATCTGGTAAGTCTGGCCGTGGTACATTGCCAGCGCCTGACCATAAGCCTGTTTCAGGTATTCACGGAACGCCGCGAAATAGGCGTCACGCTGCGCCGGCGTCGCCTCCCGGTAGTAACGACCCAGTACCAGCGCGCCCGCGTATTTCACCTGCACATACGGCCAGCAGTTCCTGGTCAACGACGTCGCGCAGGTAATCCGGGTTGGCACGGATTTTCGGCTGTTCGTTTTTCAGACGATCGAAGGTTTTCTGCGCCGCCTCATTCATCAATTTATACGGGTTAGTCTGGTCCGCCGCGGTTGCCGCGCTCAACGGTGCAATGACCAGCAGCGCTACCATCATTAATCGCTTAAACATACATCGATTCTCCTGTGATTAATTTGTTGCGCCCGCAGGCGGTGTCGCTTCATTATTGCCTTCAGTTGGCGCTGGCGCATCGCCAGAATTCTTATTGTCGTCCCCTTTACTGTTGTAAAGGAACTGACCAATCATGTCTTCAAGCACCATCGCGGACTTGGTGTCCTGAATAGTGTCGCCATCTTTGAGGATAGACGTTCCCAGTTCAGGATCTTCAAACCCGACGTTTAACGCCAGATATTGCTCCCCCAGCAGGCCAGAGGTGCGGATACTCAGCGAACTGGTATCAGGAATGTGGTTATAGCGCTCTTCAATATCCAGCGTGACGCGCGGCAGGTAGGTTTTCGGATCCAGCGAGATATCCGCAACCCGGCCGACAACCACCCCGCCGATGCGCACCGGAGAGCGCGCTTTCAGGCCGCCAATATTGTCGAAGGTCGCATAAATCGTGTAGGTCGGCTCCGTGCGCATTGAGGTGACATTGGCCGCTTTCAGGCAAACAAACAGCGCCGCCAGCAACGCAACCAGCAAGAAGATCCCTACCCAAATTTCACTTTTTTTCGTTTGCATGAACTCAATTCCCAAACATCAGTGCGGTCAACACAAAATCCAGCCCCAGAACGGCCAAAGACGAGTGCACAACGGTGCGCGTGGTTGCGCGGCTAATCCCAGCAGAGGTCGGAATCGCGTCATACCCATTGAACAGCGCAATCCACGTTACCGTGATGGCAAACACCACGCTCTTAATCAAACAGTTCACGAGATCCAGACGCCAGTCGACGGCATCCTGCATCGCCGACCAGAAGAAACCGGCATCGATCCCCTTCCAGCTAACGCCAACCAGCGATCCTCCCCAGATGCCAACGGCCACGAAGATAATCGTCAGCAGCGGCAGCGAAATCACCCCCGCCCACAGGCGCGGAGAGATCACCCGACGCAGCGGATCGACGGCCATCATTTCCATACTGGAGAGCTGCTCAGTCGCTCGCATAAGACCAATTTCTGCGGTCAGCGCAGACCCTGCGCGCCCGGCAAACAGCAACGCCGCGACAACCGGCCCCAGTTCACGCAGCAATGAAAGCGCCACCAGCATACCCAAGCTGGTTTCCGCGCTGTAAGTCGTCAGAACCAGATAGCCCTGCAACCCGAGCACCATGCCGATAAAAACGCCAGAGACGATAATAATGAGCATCGACAGGACGCCCACATTATAGAGCTGACGCACCAGCAGTGGCGCATGCTTGCGAAATTCCGGCTTACCAATCACCGCATTGAATAACATTAACCCGGCACGCCCGAACGTTCTGAGGGTTTTGATCCCCCGGTGTCCAAGCGATGCCAGCGCATTTAACAGCATGAGTGGCTTAACTCCCTGTTTCGAGTAAATCGGAGTGATAATCGCCCGCCGGATAGCGGAACGGAACCGGCCCGTCGGCAATACCGTCCAGGAACTGGCGGACGCGCGGGTCGGCATTTTCTTGCAGTTCCTTCGCGCTACCGTGCGCCACGATTTTTTTGTCCGCCATAATCCAGGCGTGATCGGCGATGCTCAGCACCTCCGGCACATCGTGGGACACGACAACGCAGGTTACGCCCAGCGTGCTGTTCAGTTCTGAAATCAGCTTCACCAGTACGCCCATCGTAATCGGGTCCTGACCCACAAACGGCTCATCGAACATAATGAGGTCGGGTTCCAGCGCAATAGCACGGGCTAATGCCGCACGCCGCGCCATCCCCCCGGACAGTTCAGAAGGCATGAGCTTTGCCGCGCCGCGCAGGCCAACGGCTTCCAGTTTCATCATCACCGTGCTTTTCAACAGCGGCGCGGGTAAATTCGTGTGCTCCCGCAGCGGGTAGGCTACGTTATCAAACACGTTCATGTCGGTGAACAGCGCCCCCGACTGAAACAGCATGCTCATCCGTTTTCGTACCGTGTACAACCGCGAGCGAGACATAGCAGGTACGTTTTCGCCATCAAACAGTATTTCCCCGCTATCCGGCGGGATCTGTCCGCCAATCAGTCGTAACAGCGTGGTTTTACCGATCCCCGATGGCCCCATGATCGCCGTGATCTTCCCGCGTGGCACGGTCAGGGAAATATTATCGAAAATGCAGCGGTCGCCACGCGAAAAGCTGACGTCGCGCATATCGACTAAATTCGCCACAGACTGACCCATTGATTCATCCTTTGTATCGCCTTGTTGATCTAAGCATGGCGCTGAATTTAGCCATGAACCCAACATATTTACAGAATATTACCTGCCCTGGTTAGCTAAAGCTGGCATTTGTTTTACTTTTTAGCCGCATAAAGTCAAAATTAAGACTTCGTTACGGCTTCCAGAAGATCCCTCCAGTGGACCGGCGAGTATACCTGAAGAAAGGACTTTAGATGCTTTTAGCTACGGCGCTGTTAATTATTGGTTTACTTCTGGTGGTCTACAGCGCCGATCGTCTGGTGTTTGCCGCATCGATTTTGTGCCGGGCAGTGGGGATTCCTCCGCTCATTATCGGGATGACAGTCGTCAGTATCGGAACATCATTACCGGAGATAATTGTCTCGGTTGCCGCGTCGCTTCACGGGCAACTGGACTTAGCCGTAGGCACCGCGCTCGGCTCCAACATCATCAATATCTTGCTGATTCTGGGCCTGGCGGCGCTGTTTCACCCTTTTACCGTGCATTCTGATGTTTTGCGTCGCGAATTGCCGCTAATGTTATTTGTCAGTGTGCTGGCGGGTTCCGTCCTGCACGATGGCGAGCTGAGCCGTAGCGACGGAATCTTTCTTCTGCTGCTGGCCGTGCTATGGCTGCTGTTCATTGTTAAGATCGCGCGCCTGGCGGAGCGGCAGGGAAATGACAGCCTCACCCGGGAGCAGGTGGCGGAGTTGCCGCGCGAAGGCGGCTTACCCGTCGCGTTTTTGTGGCTGGGTATCGCGCTGATTATTATGCCGATGGCGACGCGGATGGTGGTTGATAACGCCACCGTACTGGCGAACTACTTCGCTATGAGCGAACTGACTATCGGCCTGACGGTGATTGCGATTGGCACCAGCCTGCCAGAACTGGCGACGGCGATCGCGGGCATCCGCAAAGGTGAAAATGATATCGCCATTGGCAACATTATCGGCGCCAATATTTCCAATATCGCCATCGTTCTGGGGCTGCCTGCATTGATTACGCCAGGCGACGTCAACCCGCTGGCGTTTGGGCGTGACTATAGCGTGATGCTGTTAGTGAGCATTGTTTTGGCATTGCTGTGCTGGCGGCGTCCGCGCCAGATTGGTCGGGGCGCAGGCGTGCTGCTGACCGGCGGTTTTATCGTATGGCTGGCGATGCTGTACTGGCTATCGCCGCTTCTTATTGGATAACGGGAAAACGGACTATGTCGCACTTAGCGTTACAACCGGGTTTTGACTTTCAGAAAGCAGGCAAAGACGTCCTGGAGATTGAACGTGAAGGCCTGGCGGAGCTTGATCAATACATCGACCAGAACTTCACGCTCGCCTGTGAAAAAATCTTCTCGTGCACAGGCAAAGTCGTTGTCATGGGGATGGGAAAATCAGGTCATATCGGGCGGAAAATGGCCGCCACCTTTGCCAGCACCGGCACTTCCGCCTTTTTTGTTCACCCGGGCGAGGCCGCGCATGGCGATCTCGGCATGGTCACCTCGCAGGATGTGGTGATCGCGATTTCCAACTCTGGCGAATCCAATGAGATTGCCGCATTAATCCCGGTTCTCAAACGCTTGCAGGTGCCGCTGATTTGTATCACCGGGCGGCCAGAAAGCAGCATGGCGCGTGCGGCGGATGTGCATCTGTGTGTTAAAGTGCCGAAAGAAGCCTGCCCGTTGGGCCTGGCGCCGACCAGCAGCACTACGGCTACGCTGGTCATGGGCGATGCGCTCGCGGTCGCGTTATTAAAAGCCCGCGGCTTTACCGCCGAAGATTTTGCGTTGTCGCATCCGGGCGGCGCACTTGGCCGTAAACTTCTGCTGCGCGTTAACGATATCATGCACACCGGCGATGAAATCCCGCATGTGAATAAAAACGCCAGCCTGCGTGATGCCCTGCTGGAAATTACGCGTAAAAATCTCGGCATGACCGTCATTTGTGATGACACAATGAAGATTGACGGCATCTTTACCGATGGCGACTTGCGTCGTGTTTTCGATATGGGTGTGGATGTTCGCCAGTTAGGGATTGCCGACGTCATGACGCCTGGGGGAATTCGCGTTCGCCCCGGTATTCTCGCCGTTAACGCCCTGAACTTAATGCAGTCCCGTCATATCACCTCCGTTTTGGTTGCTGATGGCGACCAGTTACTGGGTGTGTTACATATGCATGATTTACTGCGTGCAGGCGTAGTGTAGAAACTCAAGGATAAAAGAAATGAGTAAAGCAGGTGCGTCGCTTGCGACCTGTTATGGCCCCGTCAGTACACAGGTTATCGCCCAGGCGGAAAACATCCGTCTGCTCATCCTCGATGTGGATGGCGTGCTGTCTGATGGTCTGATTTATATGGGCAACAACGGTGAAGAGCTGAAGGCCTTTAACGTCCGTGATGGCTACGGCATTCGCTGTGCGCTCACTTCCGGGATTGAGGTCGCTATTATTACCGGGCGAAAGGCTAAACTTGTAGAAGATCGCTGTGCCACCCTGGGGATCACGCATCTCTATCAGGGGCAGTCGGACAAACTTATCGCGTTTGGCGACCTGCTCGATAAGCTGGGCGTTGCGCCTGAAAATGTGGCTTACGTCGGTGACGATCTGATCGACTGGCCGGTGATGGAAAAAGTGGGTCTGAGCGTTGCGGTCGCTGATGCGCATCCGCTCCTGATCCCCCGCGCCGACTATGTGACGCATATCGCCGGTGGACGCGGCGCGGTGCGGGAAGTGTGCGATCTACTCTTGCTGGCGCAGGGTAAGCTTGATGAGGCCAAAGGGCAATCGATATGAGTAAAACCAGACGTTGGGTTATCATTCTACTGTCGTTGGCCGTACTGGTGTTGATCGGAATTAATCTGGCGGATAAAGACGATCCTGCTCAGGTTGTGGTGAACACCAGCGATCCGACTTATAAGAGCGAGCATACCGATACGGTCGTCTATAGCCCGGAAGGCGCGTTGAACTATCGTCTTATTGCGCAGCACGTTGAATATTATTCAGAACAGGCGCTTTCGTGGTTTACCCAACCGGTATTAACCACCTTTGATAAAGATAAAGTCCCGACATGGTCGATTAAAGCGGATAAGGCGAAATTGACCGAAGACCGGATGCTCTATCTGTATGGTCACGTTGAAGTCAATGCGCTGGCGCCTGACGCTCAACTACGCAGAATCACGACGGATAACGCGACGATCAACCTGGTGACGCAGGATGTCACCTCTGAAGATCTGGTCACGTTATACGGAACAACATTTAACTCCAGCGGACTGAAAATGCGCGGCAACTTACGCAGCAAGAACGCCGAGCTGATTGAAAAGGTTAGAACCTCCTATGAAATTCAAAACAAACAAACTCAGCCTTAATCTTGTGCTTGCCAGCACACTTCTGGCCGCCAGTCTTCCGGCGTTCGCGGTCACCGGCGATACCGAACAACCGATTCATATTGAATCGGACCAGCAGTCACTGGATATGCAGGGTAACGTGGTGACGTTTACCGGAAATGTCGTCGTGACTCAGGGCACCATCAAAATCAATGCCGATAAAGTGGTCGTCACCCGTCCGGGCGGCGAGCAAGGCAAAGAGGTGATCGACGGCTTCGGCAACCCGGCGACGTTTTATCAGATGCAGGACAACGGCAAGCCGGTAAAAGGCCATGCCCAAAAAATGCATTATGAGCTGGCGAAAGATTTCGTCGTCCTGACCGGTAACGCCTACCTGGAGCAGCTCGACAGCAACATCACCGGCGATAAGATCACTTACCTGGTGAAAGAGCAGAAAATGCAGGCGTTCAGCGAAAAAGGCAAACGTGTAACGACGGTTCTGGTGCCGTCGCAATTGCAGGACAAAAACAAAGACCAGGCCCCGGCACAGAAGAAGGGTAACTAATTCGTTATGGCAACATTAACTGCAAAGAATCTTGCGAAAGCCTATAAAGGCCGTCGCGTGGTGGAAGATGTCAGCCTGACCGTGAATTCCGGGGAGATCGTCGGTCTGCTCGGGCCGAACGGCGCAGGTAAAACCACTACCTTCTACATGGTGGTGGGCATTGTGCCGCGCGACGCTGGCAACATTATTATTGACGATGAGGACATCAGCCTGCTGCCGCTGCATGCCCGCGCGCGTCGCGGTATTGGCTATTTACCGCAGGAAGCCTCCATTTTCCGCCGTCTGAGCGTGTTTGATAACCTGATGGCGGTGCTGCAAATTCGTGACGATCTCACGACCGAACAGCGTGAAGACCGCGCCAATGAACTGATGGAAGAGTTCCACATCGAGCATCTGCGCGACAGCCTCGGACAAGCGTTGTCCGGCGGTGAGCGCCGCCGTGTGGAGATTGCCCGCGCGCTGGCCGCAAATCCGAAATTTATCCTGCTGGATGAACCGTTTGCAGGCGTTGACCCGATTTCCGTCATCGACATCAAACGCATTATCGAGCACCTGCGCGACAGCGGTCTCGGCGTTCTGATTACCGACCACAACGTTCGTGAGACGCTGGCGGTGTGTGAACGCGCGTATATTGTCAGCCAGGGGCATCTGATCGCACACGGCACACCGACAGAAATCCTGCAAGACGAGCACGTTAAACGCGTATACCTTGGGGAAGACTTCAGACTCTGATAGGGTAGAAGATTGCGACGTCACAGCGGGAGAAAACGACTCTGAACATGAAGCAAGGTTTGCAACTCAGGCTCAGCCAACAACTGGCGATGACGCCCCAGCTACAACAGGCCATCCGTCTGTTGCAGTTGTCTACGCTGGAACTTCAGCAGGAACTTCAGCAAGCGCTGGAGAGTAATCCGCTGCTTGAGCAAACCGATCTTCACGATGAAATCGACACGCGCGAAACCCAGGACAACGAAGCCCTGGATACCGCAGACGCGCTCGAACAGAAAGAGATGCCCGAAGAGCTGCCGCTTGACGCCAGCTGGGATGAGATTTACACCGCAGGTACGCCGTCCGGCACCAGCGGCGACTACATCGACGACGAGCTGCCGGTGTACCAGGGCGAAACCACGCAGTCCCTCCAGGACTACCTGATGTGGCAAGTCGGGCTGACTCCGTTCTCGGATACCGATCGCGCGATTGCCACCTCAATTGTCGATGCCGTTGATGATACCGGCTATCTCACCATTTCGCTGGATGACATTCTCGAAAGCATCGGCGATGAAGAGATCGGCCTCGATGAAATAGAAGCGGTACTCAAACGCGTTCAGCGCTTCGATCCCATCGGCGTGGCGGCGAAAGATCTCCGTGACTGCCTGCTGATCCAGCTCTCACAGTTCGATAAATCCACGCCCTGGCTGGAAGACGCCCGGTTGATCATCAGCGATCACCTCGATCTGCTGGCCAATCATGACTTCCGTACCCTGATGCGCGTTACGCGGCTGAAAGAAGAGGTGCTGAAAGAGGCGGTCAATCTTATCCAGTCGCTCGATCCCAGACCGGGTCAGTCTATCCAGACCGGCGAACCTGAGTATGTGATCCCGGATGTGCTGGTGCGTAAGCACAACGGTTACTGGATGGTGGAGCTGAACGGCGACAGCATTCCCCGTCTACAGATCAATCAACATTATGCCGCCCTGTGCAACGGCGCGCGTAATGATGCCGACAGCCAGTTCATTCGCAGCAACCTTCAGGATGCGAAGTGGCTGATCAAAAGTCTGGAAAGCCGCAACGACACCCTGTTACGCGTGAGCCGCTGTATTGTTGAGCAGCAGCAGGCCTTCTTCGAGCAAGGTGAAGAGTTTATGAAACCGATGGTGCTGGCGGACATCGCCCAGGCCGTCGAGATGCACGAATCGACGATTTCCCGTGTGACCACGCAGAAGTATCTGCACAGTCCACGCGGCATTTTTGAACTGAAGTATTTCTTTTCCAGCCATGTGAATACCGAAGGCGGAGGCGAAGCCTCTTCCACGGCGATACGTGCGCTGGTGAAGAAATTAATTGCTGCGGAGAACCCCGCGAAGCCGCTGAGCGACAGCAAGTTAACCTCTATGTTGTCGGATCAGGGTATCATGGTGGCGCGCCGCACCGTTGCGAAGTACCGAGAGTCTTTATCCATTCCGCCGTCCAATCAGCGCAAACAGCTGGTTTGAACCAACCGATAAGGAAGACACTATGCAGCTCAACATCACTGGAAATAACGTCGAAATTACTGAAGCCCTGCGCGACTTTGTCTCAACCAAATTCGCCAAACTTGAGCAGTATTTTGACAGGATTAATCAGGTCTACGTTGTTTTGAAAGTGGAGAAGGTAACACACATCTCGGATGCGACACTGCATGTAAACGGTGGCGAAATTCACGCCAGCGCGGAAGGTCAGGATATGTATGCCGCCATTGACGGTTTGATCGATAAACTGGCAAGGCAGCTAACGAAACACAAAGATAAACTGAAACAACACTAATTGTCCGGGCAGTTAGCGAGTGCAGGACGGCCTGTTGTGACGCACAACAGGCCATTTGTACAGTTAGCGCTCCGAATCTGCCTCATCAGGAATCATTCTGATGGAACAGGTTCTTAGGTGAAATTATGACAAATAACGATACGACTCTACAACTGAGCAGCGTACTTAACCAGGAATGTACGCGCAGCGGCGTTCACTGCCAGAGCAAAAAACGTGCGCTGGAAATTATCAGTGAACTGGCGGCAAAACAGCTCAGCCTGCCTCCGCAGGTGGTATTTGAAGCAATCCTGACGCGTGAGAAAATGGGCAGTACCGGCATTGGCAATGGTATCGCCATCCCGCACGGCAAGCTGGAAGAAGATACGCTGCGCGCCGTCGGCGTGTTCGTCCAACTCGAAACGCCTATCGCTTTCGATGCCATTGATAATCAACCGGTCGATCTGCTTTTCGCCCTGCTGGTGCCCGCAGATCAGACCAAAACGCATCTGCATACACTGTCGCTGGTCGCTAAACGTCTGGCGGATAAAACCATCTGCCGCCGCCTGCGCGCCGCGCAGAGTGACGAAGAGCTGTATCAAATCATCACTGACACCGAAGGTGGAAAGGATGAGGCATAACCACCCAATGGCATCTGTTGTGAGGAGAAACGGTACATGGTACTGATGATCGTCAGCGGTCGTTCAGGGTCAGGGAAATCTGTCGCCCTGCGTGCGCTGGAAGATATGGGTTTTTACTGCGTGGACAACCTCCCCGTGGTGCTGTTGCCCGATCTGGCTCGTACGCTGGCCGATCGCCAGATTTCTGCGGCCGTCAGCATTGACGTGCGTAACATGCCTGAGTCCCCTGAAATTTTCGAGCAGGCGATGAACAACCTGCCCGATGCGTTTTCACCACAGCTTCTGTTCCTTGATGCCGATCGCAACACGCTGATTCGCCGTTACAGCGATACGCGCCGTCTGCATCCACTTTCCAGCAAAAATCTCTCCCTGGAGAGCGCCATCGACCAGGAAAGCGATCTGCTGGAACCGCTGCGTTCCCGCGCCGATCTGATTGTCGATACCTCTGAAATGTCCGTGCATGAACTGGCGGAAATGCTGCGTACCCGTCTGCTGGGCAAGCGCGAACGCGAGCTGACAATGGTGTTCGAGTCCTTCGGCTTCAAGCACGGTATTCCAATCGATGCCGATTATGTTTTTGACGTGCGCTTCCTGCCTAACCCGCACTGGGACCCGAAACTGCGTCCAATGACCGGCCTCGATAAACCGGTTGCGGCATTCCTCGACAGACACACAGAAGTACACAATTTTATCTACCAGACTCGCAGCTATCTTGAGTTATGGTTACCCATGCTGGAGACAAACAACCGTAGCTATCTCACCGTAGCCATCGGCTGTACCGGCGGGAAACACCGTTCGGTGTATATTGCAGAACAGCTGGCAGACTACTTCCGCTCACGCGGTAAGAACGTGCAGTCACGCCATAGAACGCTGGAAAAACGCAAAACATGACCGTGAAGCAGACTGTAGAAGTCACGAACAAGCTGGGTATGCATGCCCGGCCTGCCATGAAACTGTTTGAATTAATGCAGGGTTTTGAGGCCGAAGTGCTGTTACGCAATGATGAAGGCACCGAAGCCGAAGCCAATAGCGTCATCGCCCTGCTGATGCTGGACTCTGCTAAAGGTCGCCAGATAGAAATTGAAGCTACCGGCCCGCAGGAGGTCGAAGCGCTGGCGGCGGTGATCGCGCTGTTCAATTCAGGATTCGACGAAGATTAGGCCATTTCGCTTTCGTCTCCTCCTTTCCCCCAGAAAACGCCATCCGGCGTTTTTTTTATTCCTTAGTCTTTCTGCCTGCGTCGATATTGTACATTCCGTGCGCTTTTCCGTCCTGATTAAACCTTAAGAAAACCTTAAGAAGGGTTGTTTACGTTCTATTTAAGTTGAGCGTTCATACTGGTTCTATTAGCAACATTACCAGGAGATCTCATGATACTTTTATCTGAACAAAATCCCCTGGGCACCGGACGCCATCGTAAATGTTATGCGCATCCGGGCGATGCCCAACGCTGCATTAAGATCATTTACAATGCCGACCGCAGCGGCAAGAAAGAGATCCGCCGTGAGCTGAAATATTACGCGCATCTGTCACGTTACCTGCAAGACTGGAGCGGCATCCCGCGCTATCACGGTACGGTAGAGACCGATTGTGGTACGGGCTACGTCTATGATGTGATAACCGATTTCGATGGCAAACCGTCCGTCACGCTGACTGAATTTGTGGCGCAGTGCCGCAATGACGAAGATGCCGCCGTCCTGCGCCAGTTGCTGAAAACGCTCAAGCGTTATATCTACGATAACCGCATCGTCACCATGACCCTGAAGCCGCAAAATATTCTGTGTCACCGCATTAGCGAATCCGAAGTGATCCCGGTGATCTGCGACAACATCGGCGAGAGCACGCTGATCCCGCTGGCAAGCTGGTCAGCCTGGTTTTGCCATCGCAAACAGGAAAGACAGTGGGAACGGTTGATCGCCCAACCGGGTCTGGTGGCGGCGCTGAAGATGAACCGCCAGGAAGAAGACAAAAGCGCGCTGCCGCTCCCCTCTCTTGAGGCCGCGCGTCAGCGCAATCTTAATACAGATGGTTACGCGTCATAAACGATTCGCCGCCCAACTGGCGCATCTGACGCAAAATCCACGCCTGACGGCTACGCACGTAGCCTGAAGGCGCGTTCGCTTTAAAGCGTAAAGGGTTCGGCAGCACCGCCGCCAATAGCGCGGCTTCTGACATACTTAACCGGCTGGCAGGTTTATTAAAATAGCGCTGCGACGCCGCTTCCACGCCAAACACGCCGTCGCCAAACTCCGCAATATTCAGGTAGACGGTCAGAATGCGTTTTTTACTCCAGACGGTTTCCAGCCCGACGGTGAGCCCGGCCTCCAGCCCTTTGCGCAGCCAACTGCGGCTATCCCATAAAAAGAGATTCTTCGCCGTTTGCTGAGAAAGCGTGGACGCCCCACGAATTCGGTTTTCATTGCGCTCATTGTGCGCCAGCGCCTTCTCAATGGCCGAAACATCAAAGCCCCAGTGCTCCGGGAATTTCTGATCTTCCGCCGCAATCACCGCCAGCCCCATCCACGGCGAGATCGCATCCATGCCAGCCCAGTCGGAATGCGCAACATAGCCGAAATCGCCTTGCAGCCACGCGCTAATCTGCCGTTCCACCATCACTGCGGAAAAAGGCACCGGAACGACGCTGAACAGCGCGATGCCGCCGCCCAAAAAACCGCCAGCACCACAAGGATGCGCAGGAGCAAACGGCGCAGGGACGCCAGCGGAGTGAAACGCCCTTTACTCATTCAGCCAGAACCAGTACGCGAGACACCAGTTTTTCAATGCCGGTCGCCGCCTGTGCAATATCCTGCGCCAGCATGTAAGCAGGCGTGGTGACAATTTTGTTATCTTCATCCACCACAATATCGTCAACCGGACATGGCACATGCTCGGCCCCCATATCTTCCAGCACTTCCGCCGTATCGATATCGGTGCCAATCGTCAGACGAAGCGGGAAACCAAAAATCCGGGGCAGCATTGCCGGGGCAATGCACATAAACCCTAACGGCTTACCCGACTGGTGCATGGCGATTGCCAGCGCGGCCAGGTCGCTATCTACCCGGCACTCGCTGCCCTGGCTGGCAAAATTACTCAGGTTTTTCGCCGCGCCAAATCCACCCGGAACAATCAGCGCGTCCAGATCGCTGGAGACCGCCTGAGCGAGTGGACGAATTTCCCCTCGCGTAATGCGCGCGGCTTCGATCAGGACATTTCGCGTCTCCGCCATCGCTTCGCCCGTCAGGTGATTAATCACATCAGCCTGCTGTTTATCCGGCGCAAAGCAGATGGCCTGCGCGCCGCTGCGGGCAATCGCCAGAAGGGTCAGAACGGCTTCATGAATCTCGGCGCCGTCGTATACGCCACATCCACTGAGCACTACGCCAATTTTTTTCATCGTGATGATCCTTTTCGCAACTTACTGAAGCGTATTAATAATTCTGATTAAAATGCTGCGCTTCACACATTTAACTGATTCATGTAACAAAACATTTAAGATTTGCTATCTTAACTGCGTGCGGCCTGAAAAGCAGAGCTGCGCCTGTGTAAAAAACAATCATAACTTACGGCGCAGCCACGATTTCCCTGGTGTTGGCGCAGTATTCGCGCACCCCGGTCAATCCGGGGTCATTTTTTTTCCGCGTTTGCCACCCAGGCTTTCAGCACCGCGACGTCGTGCTGCCACTCCTGCTTCATCTCTTCAACCCATTCACCCACATTATCTTCCCAGGCAGGAAGATCCGGCGACTGAATTTGCTGACCCAGTTGCTGTAGATGACGCAACCCGACGGAGCCCGCCGCGCCTTTAATTTTGTGGCCTTCTTCAACAACGCCTTTGGTATCTCGCGCCGTCAGGTTGGACTCCAGAATGCTTAAGTAACCCGGCATCATTTTTTCGAACACCGCCAGCCCATCCGTAATTAATTTCGGGCCGACCAGTTCGATATACTGCTCTAACATCGGGATATCTAACAACGCTTGCGATTTACTGCTCTCTTCAGATGTCACAGTGCTCTCCTCTTCGTTACGGGTATCCCAGTATTTTTTAATCATGGCGGTTAACGCCGGAACCGACAGCGGCTTGCTCAACACATCGTCCATTCCCGCGTCCAGATACTCTTTCTTGTCTTTCAGGACGTTAGCAGTGAGTGCGACCAGCGGCGGTAATTCTTCCCGCGCATAGCGGCGGGTAAGTTTCTCTGGAGATATCCAGACCCGTCATATCCGGCAGTTGAATATCCAGCAGCAGCAGATCGTATTCACCCGGGGTGAACATTTCCAGCGCGGCTTTCCCGGTCATCGCCACGTCAACGCTGTTGCCCAGTTTTTCCAGCACTGAACGCGCCACAATGACGTTCAGCTCAATGTCCTCGACCAGCAGCACATGCAGCGCAGGCAGCGGCATGTCATTCTCTTCCAGCGTGTCCTCGACCTCTTCCGCCACGGCTGGCGCATGTACGGTTAACGTAAAGACGGAACCTTTACCCGGCTGGCTGGAAACGGTGATGTCGCCGCCCATATTTTTTGCCAGGCGGCGCGATACCGCCAGGCCAATCCCGGTCCCCGTCGCGGGTTTACCGCCATGACTGTCTTTCACCTGATAATACATGGCGAAGATTTTATCCTGCTCATCCTGCGGAATACCGATGCCGGAATCTTCCACGTCAAAATGCAGCACATCGCCTTCGTCATAGCGTACGCGCACCGTCACCTGCCCTTGCTGGGTAAACTTCACGGCGTTGCTGATCAGGTTCCACAGAATTTGGCGCAAACGCGTGCCGTCGGTGATCACCTTATGCGGCAGCGGCAACGTCGGATCGAGCACAAAGCGTAGCCCTTTTTGCTGCGCCTGCAGGCCGGAGAGATTCTCCAGATCCGCCATGAAACTGGTGAAATCTACCGGCTGGTTATCCAGCTGGACTTTACGCCGTTCCATCTTATCCATATCAATGATATCGTTGAAAATATTCCCCAACGTGACCGCAGAGACATGGATGGTTTTCAGGTATTTTTCCTGCTCGCTGGTAAGGTCGGTGTCCAGCAGAATGCGGCTTAAGCCGACAATGCCATTCAGCGGCGTACGCAGCTCGTGGCTGATCGTAGAGATAAACGTGGTCTTGTCGCGGCTGGCGCGTTCAAGCGCGTCCTGATAACGCTTACGTTCCGTAATATCGCGCCCAAAGCCCATCAGGCCATGACGTTTACCAACGCGGTCGTAATAGGGCACCTTGCGAATTTCAAAGCACGCTTTGCGCCCGTCCGGGTAATCGAGCCACTGTTCATAGGTCAGCGAGACGTTATGGCGGAAGACTTTTTCGTCGGTCTCAATCACTTTTTCCGCCGCTTCGGGAGAATAGACGTCAGCCGGTTTTAAATGGACGAGCTGTTTTTCACTCTTGCCGGTCAGCAGTTCCATCGCGCGGTTACAGCCGGAAAACTCTTTATCCTCATTGCGATAAAAAACCAAATCCGGCGACGCATCGAGGAAAGAGCGTAAGAAAGAGGATTGCTGTTCAAGCTGAATCTGCGTCTCTTCACGCTCCTTGATTTCCACTTTTAGCTGTTCGAAGGTCGACTGGCGCTCAGCCTCCGCTTTTTCGCGATCGGCAATTTCCTGATTAAGCTGAGCAATATTGTCTTTCAGTTGTACCGTGAGTTTGAGATCGCGCTCGCGCATCTCTTCCAACTTTTGTACCAGCCGTGACAGGCGCTGCCTGGACTCCTCCAGTTGCTCCACCACCACCGACAAGAAATAGACGGCCCAGGGCGTGATTAACAGGCCAAAAAAGATGGAGCGGATGACATCAATACTTTCGACCTGACCATGCAACACCATGGTGACCGCCATCTGCACCACTATCGCCAGCACCACCAGCGCTAACGCCAACAGCATCGAAAAGCGCACCAGACCCAGCTTCATCATCAGGTCGACATAGTATTGCGCCAGCATACGAATTTGCTTCATAGGGGATTCCTTCACGACAACTTCGCACAATAATACTCAATTCTGAGCAGTACGTTGAAAATTGTGCAATAAATGCGGAGGTTATCGGTGAAATGACCGGGTAGGCGGTAGGCCGGATAAGGCATAGCCGCCATCCGGCAATGTGCGCCAGGCTGCCTGATGGCGCTTCGCTTATCAGGCCTACACGCCTACGGATGGTTAGCTCTCCGGCTGAATCCACGGAGTGCCAAGCGCGGAGCCTTGCGCGCCACGTTCGTTAAGATAGCGGTCCAGCTCCACCATGCCCGTCCAGCGATTCTCACACCATAGCGGGGCGAGCAACGTCGGACGACGGGCGCTGGCGGAAATGCGATGATAAATAACCTCCGGCGGCGTATGGCGGATCATCTCCCCCGCCGTCAGCGTGTACGCCTCCAGCGCAATACCGTTCAGGCGTCCCGCCTCCCAGGCTTTCGCCATAATGCTGCCTTTTACAATATGCAGCGGGTGCAGCTTAATGCCATCCACGCCCGTTTCCACCACGCGATCCAGCGTTTGCAGGCATTCCGTCTGCCCTTCGCCGGGTAAACCCACAATCAGATGCGAGCACACTTTCAGCCCACGCTCACGCGCCAGTTGGGTGGTTCGCTGATAACAGGCAAAATCATGGCCTCGATTAATACGGCGCAGCGTTTTGTCATGCGCGGTTTGCAATCCCAGTTCCAGCCACACTTCATAGCCCTGGTCTTTATATTCGCAGAGCAGATCGAGCACCGCATCCGGCACACAGTCGGGGCGCGTACCGACGCACAGCCCGACAATGCTGGCCTGGCTTACCGCCTGCTGATACATAGAACGCAGCACCTGCACTTCGGCAAAGGTACTGGTGTACGCCTGGAAATAGGCCAGATAACGTTTGGCGCGATTGACCAGATGCGCCTGATGCGCCAGTTGTTCGGCAATAGAGCGATACTGCTGCGCCTCATCGGCGAAAGAGGCGACATTACAGAAAGTACACCCGCCACGTCCGATGGTGCCATCGCGGTTTGGACAGCTAAATCCGCCGTGCAACGTCAGTTTATGAACTTTTTGCCCATAACGACGAGAAAGATCCCCACCAAACATATTGACTAATTTCTGTAACTGCATAATCTGATAGACCGCGCCTTGAAAAGAGGCCAAAGCCTGCCATTTTTAGCCTTTGTCGGCGATGACCTGGATCAATCGCTCCGGCAGGCTTTTATTTATTGCATAATCAAGCAAAATTACCGCAATTTCATTCACTCCATAGCTAATTACTTTTCCTTATAACTGCACACTTTTTTTATCTTCTGATGACACTGGCTAAAAAACAGTATCATGAAATGCCTGCGCCGCTTAAACCAGCATAAAATGCTAAGCATTACGGATAATCAAGGTGAGATAAGGGTTAACAGGCCGTTATGATAGTGAGACAGATCACACTCTCCCCCGCGCTTATGCCGCATCAAATCAATGTAAAAAAGTCATTATTTTCAATACATTCATAAAGTTAATCGACCCATAGCACATTTTTGCATAAATAAGATTGCCATTTGACCTGTGTGTGGATTCCCGATAAGTTGGAAATCCGCTGGAAGCTTTCTGGATGAGCAGCCTGCTCATCATATTTATGCAGTAATTGAGATTCCCTCTTAAACGTATTTACCGATGCGAAAAGGATAAAAGAGGGCGAATGCGAGGTAAGCGTATGATACGCAACCCCCGTCGCCACGCTCTTTCTGTGCCCGTGCGCAATCGGTATCGGATGGGAGTCCCGCAGAGCCTGGGGAGGTTCACTGATATGTTGTACGATAAATCCCTTGAGAGGGATAACTGTGGTTTCGGCCTGATCGCCCACATAGAAGGCGAACCTAGCCACAAGGTAGTGCGTACCGCCATACACGCACTGGCCCGTATGCAGCACCGTGGCGCGATCCTCGCTGACGGTAAAACCGGCGACGGTTGCGGTTTGCTGCTGCAAAAACCCGATCGCTTTTTTCGCATCGTTGCAGAAGAGCGCGGCTGGCGTTTAGCCAAAAAACTACGCTGTCGGCATGCTCTTCCTGAATAAGGACCCTGAGCTGGCGGCGGCATCACGTCGTATCGTCGAAGAAGAATTACAGCGGGAAACCCTGTCGATTGTGGGCTGGCGCGATGTGCCGACCAACGAAGGTGTCCTCGGTGAAATCGCCCTCTCCTCCCTGCCACGTATTGAGCAAATTTTTGTTAACGCGCCTGCGGGCTGGCGTCCCCGGGATATGGAACGCCGCCTGTTTATCGCCCGCCGCCGCATTGAGAAACGCCTTCAGGACGATAAAGACTTCTACGTTTGTAGCCTGTCGAACCTGGTGAACATCTATAAAGGTCTGTGTATGCCAGCGGATCTGCCGCGCTTCTACCTGGACCTGGCGGATTTGCGTCTGGAATCGGCCATTTGCCTGTTCCACCAGCGCTTCTCCACCAATACCGTGCCGCGCTGGCCGCTGGCGCAGCCGTTCCGCTACCTGGCGCACAACGGCGAAATCAACACCATTACCGGCAACCGTCAGTGGGCGCGCGCCCGTACTTATAAATTCCAGACGCCGCTGATCCCGGATTTACAATCCGCCGCACCGTTCGTCAACGAGACCGGTTCCGACTCCAGTTCGCTGGATAACATGCTGGAACTGCTGCTGGCAGGCGGGATGGACATCGTGCGCGCCATGCGCCTGCTGGTGCCGCCCGCATGGCAGAACAACCCGGATATGGACCCGGACCTGCGCGCCTTCTTTGACTTTAACTCCATGCACATGGAGCCGTGGGATGGCCCGGCGGGCATCGTGATGTCCGACGGTCGCTTCGCTGCCTGTAACCTCGATCGTAACGGTCTGCGTCCGGCGCGCTACGTCATCACCAAAGATAAACTGATCACCTGCGCCTCTGAGGTGGGGATCTGGGACTACCAGCCTGACGAAGTCGTGGAAAAGGGCCGCGTCGGCCCCGGTGAACTGATGGTGATCGACACCCGTGGCGGGCGCATCCTGCACTCTGCGGAAACCGATGACGATCTGAAAAGCCGCCATCCGTATAAAGAGTGGATGGAGAAGAACGTCCGCCGCCTGGTGCCGTTTGAAGATCTGGCCGACGATCAGGTCGGAAACCGCGAGCTGGACGACGACATGCTCGCCAGCTATCAGAAACAGTTTAACTACAGCGCCGAAGAGCTGGATTCCGTCATCCGCGTACTGGGCGAAAACGGCCAGGAAGCGGTCGGTTCAATGGGTGATGACACCCCCTTCGCCGTGCTTTCCAGCCAGCCGCGCATTATTTACGACTACTTCCGCCAGCAGTTTGCGCAGGTGACCAACCCGCCAATCGACCCGCTGCGTGAAGCGCACGTCATGTCACTGGCCACCAGCATTGGCCGCGAGATGAACGTCTTTTGCGAAGCGGAAGGACAGGCGCACCGTCTGAGCTTTAAATCGCCAATTCTGCTGTACTCCGATTTCAAACAGCTCACCACCATGAAAGAGGAGCATTACCGCGCCGATACGCTGGATATCACCTTCGATGTGACGGAAACCTCGCTCGAAGAAACGGTGAAAGCGCTGTGCGATAAGGCTGAACAGATGGTGCGTAACGGCACCGTCTTGCTGGTGCTCTCTGACCGTAACATCGCCAAAAACCGTTTACCGGTTCCAGCGCCGATGGCTGTCGGGGCGGTACAGACTCGTCTGGTGGATAAGAGCCTGCGCTGCGATGCCAACATCATCGTAGAAACCGCAAGCGCGCGCGACCCGCACCACTTTGCCGTGCTGCTGGGCTTTGGCGCCACGGCGATCTATCCGTATCTCGCCTATGAGACGCTGGGCCGTCTGATCGACACCCAGGCCATCGCCAAAGACTACCGTACCGTGATGCTGAACTACCGTAACGGCATCAATAAAGGTCTGTACAAGATCATGTCCAAAATGGGCATTTCGACCATCGCCTCATACCGCTGCTCCAAACTGTTTGAAGCGGTCGGTCTGCATGACGACGTGGTGAGCCACTGCTTCCAGGGCGTGGTCAGCCGCATTGGCGGCGCCGGTTTTGCCGACTTCCAGCAGGATCTGCTGAACCTCTCGAAACGTGCCTGGCTGGCGCGTAAACCGCTGGATCAGGGCGGTCTGCTGAAATACGTGCACGGCGGCGAATACCACGCCTATAACCCGGATGTGGTGCGCACCCTGCAACAGGCGGTGCAGAGCGGCGAGTACCGCGATTACCAGGCGTATGCCGCGCTTGTTAACGAACGTCCGGCGGCAACGCTGCGCGATCTGCTGGCGCTCAATCCTGGCGATAACGCCATCAGCATCAACGATGTTGAACCGGCGAATGAGCTGTTTAAACGCTTCGATACCGCCGCGATGTCCATCGGCGCACTGAGCCCGGAAGCGCACGAAGCGCTGGCGGAAGCGATGAACAGCATCGGCGGTAACTCCAACTCCGGCGAAGGCGGCGAAGACCCGGCGCGCTACGGCACCAACAAAGTGTCGCGCATCAAACAAGTCGCTTCCGGCCGCTTCGGCGTAACGCCAGCGTATCTGGTTAATGCTGACGTCATTCAGATTAAAGTCGCTCAGGGCGCGAAGCCGGGCGAAGGCGGCCAGTTGCCGGGTGACAAAGTGACCCCGTACATCGCCAAACTGCGCTATTCCGTACCGGGCGTGACGCTGATTTCTCCGCCGCCGCACCACGATATCTACTCTATCGAGGATCTGGCGCAGCTGATTTTCGACCTGAAACAGGTCAACCCGAAAGCGATGATCTCCGTGAAGCTGGTTTCTGAACCGGGCGTCGGCACCATCGCTACCGGCGTGGCGAAAGCCTATGCGGATCTGATCACTATCGCCGGATATGACGGCGGCACCGGCGCCAGCCCGCTCTCCTCCGTGAAATATGCGGGCTGTCCGTGGGAGCTGGGCCTGGTGGAAACCCAGCAGGCGCTGGTGGCTAACGGTCTGCGTCACAAGATCCGTTTGCAGGTGGACGGCGGCCTGAAAACCGGCCTCGACATCATCAAAGCGGCGATTCTCGGCGCGGAGAGCTTTGGCTTCGGCACCGGCCCGATGGTGGCGCTGGGCTGTAAATACCTGCGTATTTGCCACCTGAACAACTGCGCGACGGGTGTGGCGACTCAGGATGACAAACTGCGTAAGAACCACTACCACGGCCTGCCGTTCAAAGTGACCAACTACTTTGAGTTTATCGCCCGCGAAACCCGCGAGCTGATGGCGCAGCTTGGCGTGAAGCGTCTGGTGGATCTGATTGGCCGTACCGACCTGCTGAAAGAGCTGGAAGGGTTTACCGCTAAGCAGCAGAAGCTGGAACTGTCGCGACTGCTGGAAACCGCTGAGCCTCATCCGGGTAAAGCGCTGTACTGCACCGAGCATAACCCGCCGTTTGATAACGGCGTGCTGAACGCGCAACTGCTGCAACAGGCGAAACCGTTCGTCGATGAGCGCCAGAGCAAAACCTTCTGGTTCGACATTCGCAACACCGACCGCTCCGTGGGCGCGTCGCTCTCTGGCTATATCGCTCAGACGCATGGCGATCAGGGGCTGGCCTCTGACCCGATCAAGGCGCACTTCAGCGGCACCGCAGGTCAGAGCTTCGGCGTCTGGAACGCGGGCGGTGTGGAGCTGTATTTGACGGGCGATGCCAACGACTACGTCGGTAAAGGCATGGCGGGCGGTCTGCTGGCGGTGCGTCCTCCGGTGGGTTCAGCGTTTCGCAGTCATGAGGCCAGCATCATCGGCAATACCTGTCTGTACGGCGCGACCGGCGGTCGTCTGTATGCGGCGGGCCGCGCGGGCGAACGTTTCGCAGTACGTAACTCCGGGGCCATCACCGTGGTAGAAGGCATTGGCGACAACGGCTGTGAATACATGACGGGCGGTATTGTCTGTGTACTGGGCAAAACGGGCGTTAACTTCGGTGCTGGCATGACGGGCGGCTTCGCCTACGTGCTGGATGAAGACGGCGAGTTCCGCAAACGCGTGAACCCGGAGCTGGTGGAAGTGCTGAACGTCGATGACCTGGCCATCCACGAAGAACATCTGCGCGGTCTGATTACCGAGCATGTGCAGCATACCGGTTCCCAGCGCGGTGAAGAGATCCTGGCTAACTGGTCAGTATTCTCAACCAAATTCGCGCTGGTTAAGCCGAAGTCCAGTGATGTGAAAGCACTGTTGGGTCACCGTAGTCGTAGCGCAGCTGAGCTGCGTGTGCAGGCGCAGTAAGGGGTAGAGCAATGAGTCAGAACGTATACCAATTTATCGACCTGCAACGTGTTGATCCGCCGAAGAAGGCGCTGAAGATCCGCAAAATCGAGTTTGTTGAAATTTACGAACCGTTTTCCGAAGGCCAGGCCAAAGCGCAGGCGGATCGCTGCCTCTCCTGCGGCAACCCGTACTGCGAGTGGAAGTGTCCGGTACATAACTACATCCCGAACTGGCTGAAGCTGGCGAACGAAGGACGTATTTTCGAAGCGGCGGAGTTGTCTCACCAGACCAACACGCTGCCGGAAGTGTGCGGCCGCGTCTGCCCGCAGGATCGTTTGTGCGAGGGCTCCTGTACGCTCAACGACGAGTTCGGCGCAGTGACTATCGGCAACATCGAGCGTTACATCAACGATAAAGCGTTCGAAATGGGCTGGCGCCCGGACATGACTGGCGTGCGTCAGACAGACAAACGCGTGGCGATTATCGGCGCGGGCCCGGCAGGGCTGGCCTGTGCCGACGTACTCACCCGTAACGGCGTGAAGGCGGTGGTCTTTGACCGTCACCCGGAAATCGGCGGTTTGCTGACGTTCGGCATCCCGGCTTTCAAGCTGGAAAAAGAGGTGATGACCCGCCGTCGTGAGATCTTCACCGGCATGGGCATTGAGTTCAAACTGAATACCGAAGTGGGTCGCGACGTGCAGCTGGACGACTTGCTGGCAGAGTACGATGCGGTGTTCCTCGGCGTTGGCACCTATCAATCAATGCGCGGCGGGCTGGAAAACGAAGATGCCGACGGCGTGTTCGATGCCCTGCCGTTCCTGATTGCCAACACCAAACAGATCATGGGCTTTGGCGAAACCACTGACGAACCGTTCGTCAGTATGGAAGGCAAACGCGTCGTGGTACTGGGCGGCGGCGACACCGCGATGGACTGCGTGCGTACCTCGGTTCGACAGGGCGCAACGCACGTCACCTGCGCCTATCGTCGTGACGAAGAGAACATGCCTGGCTCCAGACGCGAAGTAAAAAACGCGCGCGAAGAAGGGGTCGAATTCCAGTTCAACGTGCAGCCGCTGGGTGTAGAAGTGAATGCCAACGGTAAAGTAAGCGGCGTGAAGATGGTGCGTACCGAAATGGGCGCGCCGGACGCGAAAGGTCGTCGTCGTGCGGAAATCGTCGCCGGTTCCGAGCATGTGATCCCGGCAGATGCGGTGGTAATGGCGTTTGGTTTCCGCCCGCACAGCATGGCGTGGCTGGCGAAACACAGCGTGGAGCTGGACTCCCAGGGGCGGATTATCGCGCCGGAAGGCAATGAAAACGCCTTCCAGACCAGCAACCCGAAAATCTTCGCCGGTGGCGACATCGTGCGCGGCTCGGATCTGGTGGTCACGGCAATTGCCGAAGGTCGTAAAGCGGCAGACGGTATCCTGAACTACCTGGAAGTGTAATACTGCTGGCTCCCCTGCCACAGCCTGACGCTGATGGATGGGGAGCCAATATTTCTCTGCTAAATACATGCGAATTCATTCTTCGAAAGCCGCTCGCAAATACAGCTGTAACTCAATGTAATACACCTCTCCTGGATTGACCGGGATATGCCTGAAGCGTATTGGTAGCGATATATTTTTCATTTCGCAACTATCGCGCCAGGGATGACATGACTGTACCAACACCACTTCAACACGCTGACGATTATCAGCAAATCCATGACGGTATTATTCGATTAGTGGACAATGCCCATACCGAGGCCGTCCGCAGTATTAATGCTTTGATGACGGCAACCTACTGGGAAATCGGCAGGAGAATTGTCGAGTTTGAACAAGGCGGCGAAGCACGAGCGGCTTATGGTATTCAACTCATTGAGCGGTTATCCGCAGATTTAAGCCAGCGCTATAAGCGTGGGTTCTCCACCGCAAACCTACGACAAATGCGCATGTTTTATCTTTATTTTCAAGATATTGAAATTCAGCAGACACTGTCTGGTGAATCTTCAAATCTCATTTACCTCGCTAAAGTTTTTCCGCTTCCCTGGTCAGCTTACGTGCGTTTACTATCGGTAAAAAATCCCGATGCCCGCACTTTTTACGAGAAAGAGACGCTACGTAACGGCTGGTCTGTGCGGCAACTCGATCGGCAAATCTCCACCCAGTTCTACGAACGGACGCTACTGTCCCATGACAAATCCGCCATGTTGCAGCAGCCTGCGCCCGCCGAGCCAACAGTCTTGCCGGAGCGAGCCATACGCGACCCGTTCATTCTGGAATTTCTTAACCTGAGGGACGAGTATTCCGAGTCGGATCTCGAAGAAGCGCTGCTCAGTCACCTGATGGACTTTATGCTGGAGCTTGGTGACGATTTTGCTTTTGTCGGCCGCCAGCGCCGATTACGCATAGACGATAGCTGGTTCCGCGTCGATCTGTTGTTCTTCCATCGCCGCTTGCGTTGCCTGTTTGCTCGTTGACCTGAAGGTCGGCAAATTCAGTTATGCGGATGCCGGGCAGATGAATATGTACCTGAACTATGCCAAAGAGCACTGGACAATGCCGGGAGAAAACCCGCCCGTCGGGTTGGTTTTGTGCGCAGGAAAAGGCGCGGGGGAAGCGCATTACGCGCTGAACGGCCTGCCAAATACCATCATGGCGAGCGAGTACAAAGTGCAGTTGCCTGACGAAAAACTACTGGCTGATGAGCTCATTCGTTCACAAATAGCACTGACTGCACAGCAAGTTGAATCAACTGAATAACCAATTTACTTTGCACATCAGGAAAAGCACTGCGTCACGAGCTGAATTTTCAACAACTCGACTGACGGGATTATGCACTATCGGGAGATATAAAGCGTTACACCGGATGGCTATAAACCATCCGGTATTTTCATACCTCTAATAAGCTGGCGCGCACCTTCACCACAGCTTTCTTAATCTCCCCTGACTCGCCAGCCATACACCCGGGCTTATGGGGTTCCCCCGGCATAAACACGGAGAACATCCCCGGTGTCAGCGTAATGGACTGCTGTTCATGGATACGGCGGCAAAGCTGGTAATCCTCTTCACCATGCAGCTCCTCGCATTCGCACGCTGACCCGACTACGCCGAACAGAATACGCTCCTCACCCATCAATAACAGCTGAATGTCGATGTACTGCACATGCAGCTCCGCTTTTTTCTCCTGCGGCGACTGCGTGGCAAACTGCATCACATTCATAAACACATCGTCGCCCTGCAATTCGTAGCGCCCGGGCGTTTTCTCCTGCGGCCTTGCCGCCAGCGCCAGGGTTAACGCATCCAGCAATACCGGATGCAGCCCGGCAGACGGTAACGACTGAATTTCTCCCATCAACATCATTTATCTCCCTGAGCTAACAACGCGGCGCCCAGTAGCCCCGCGTCATGTCGGTAATGCGCTGCGCTCAACGCCACCTGATAAACCGCCGGTTCCTGCGCCAGAAAGTGGCGTACCTGCGCCAGATATCCCTCAGCTAACCCTACGCTGCCGCCAATAACCACCTGCTGGCAATCCGTTGTCGCTTTAACATCGGCAATTAACCGCGCGACAACCTGCGCGGAGTGCTGAACCAGACGCACAGCCTGTTCGTTCCCGGCAGCCGCATGTGCAAAGACAGTTCTGGCGTCGCACCCGGCCAGGGAACCCTGCGCCGCTGCCGCAATGCCGCGTCCTGAGGCGATAGCCTCCACACAGGCCGCGTCGTCCGCAACCGCAAACCGGCCCCTGCGGATCGGCCAGCGTATGCCCCAGATGCCCGGCCAGACCGCTCATCCCGGTGAGCAATCTGCCGTCGCTCACCACGCCGCCGCCAACGCCGGTCGACACGGTAATAAACACCATATCGCGGACATCAGACGTAAGGGCATGGTATTCCGCCCATGCCGCCGCCTGCGCGTCATTCACCGCCAGCGTCGGCAGACCGGTAAGATCTTCTAACGTCTGCACCAACGGAAAATGCAGCAGACCACCCAGGTTATGCGGATTTATCGCCAGTAGCGCGCCTTCTCTGATAATGCCGGTAGAGGCTATCGCCACCCGCTGCGCGGTCGTTTGCAGCGGTTCAACCAGCACTTTTAGCGCCTCTCGCAGCGCATCCGGCGTTTTGCTGGCGGGGGTCGGCAATTCACGTCGTTCGCGGATGCGTAAATCATCATCCACCCGCGCGGCAGCCAGCTTGGTGCCGCCAATATCGATCGCCAGCGTGCTCATGACACCGCCTTTTTCAATGCCGTGTTGTACCACTGACAAATATGTTCCAGCCGCGTAATGGCAGATCCCACCGTCACCGCCCACGCGCCGTGACGCATAGCTTCAGCGGCCTGTGCCGGCGTGTTGTAACGCCCCTCGGCAATCACCCGGCATCCAGCATCATGCAGGACTTTGACCAGCGCGAGGTCAGGTTCGTCTGGCGTGTCCGGCGTGGTGTATCCCGACAGGCGTCGTGCCGATAAGTTCCGCGCCACGCCGCTGACAGGCTAAGCCATCCTCCAGTGACGAGCAGTCGGTCATCGCCAGCAACTGGTGCTGATGAATACGCGCCAGCAGCGCATCGACGGTCGCCGGACGTACACGGTCAGTACCGTCAACCGCAATAATATCCGCCCCGGCCTGTGCCAGCGCATCCACATCCTCAAGGAATGGCGTGATGCGAACCGGGGAGTCGTCAAGGTCACGCTTGAGAATACCGATAATCGGTACGGAGACCAGCGCACGGGTCGCCCGCAAATTGTCCACGCCTTCAATGCGTAGCGCGACGGCGCCAGCCTGCTCCGCCGCCAGCGCCATCGCAGCCACGATGTCCGGTTTATCGAGCGGGCTGCCAGGAACCGGCTGGCAGGAGACAATCAACCCGCCATTGGCGGCAATGTTTTTATCCAGTTGTTCAAGTAACGACATATCCACTTCCTTACGCAGGTCGCCCGGCAAACTGGCCGGGCAAACCTCATCAGCTTTTGGTTTTTACAAAGGCGCTTTTGTCGCCGCCAAACGGCACGGCCCCGCTGAATGGTTTTCCGTCGATAGCATCATGCATACGTAATGCTTCAGGACGCAGCCAGCGCTGTACGCGAGACGGCATATCCAACCCAATCAACAGGATCACCACAAAGGTCAGGCTGAAGGAGAGCGATCCCAGCGCGGTGCCCAGATCCAGACGCTGTGCGATCAGCGCCCCGAGGATAGGGGCCAGCGCGCCGCCCAGTGCGCCGACGTTATAGGTAAAGCCCAGACCTGCCGCTCGTTGGTCGGTATCAAAATAACCGCCAATGAGTTTCGGTAAGATCCCGGAGATCCCCTGCCCCAGCATCTGCTGGAAAAACAGTAGCAGACCCAGTACCCAGACGTTTGCGCCGCCAATCGCAAATACCGGAATGATCAGCACCTGGGAAGCCAGCAAACTGCAAACGTAGGCTTTGCGCGTACCCAGCCAGTCGCCGAGGAAACCGCCCACGCAGCAGCCGACTGCCGCCCCAAAGCCGCTAAAGAAAAGCACCTGCGCAACGGTGTGAGGATCGTAAGCCAGTTCAGTTTTCAGATACGTTGGCAGCAGCGCCTGAATCGGCCATGAATAGAGGAAAGCAAAGAGCACCACGATCATCAGCATCACGCCCGTTGGCCAGCGCTTGCCGCTGCTTTGCACCATAAAACTGATGAAGATAACGGCGCACAGTAACCCTAACACCGCCACAATCGCCGCGTTTTGCAGGTCGCCCGCAAAGCAGAACCACAGTGCGGTTGCGGCGGCGATCGTCATCAGCACATTGACGACGCGATGTTCGCCCCGGTAGAGGATGTCCACCATCGTGCGCACCGGCGCTTTACCTTCGTGTTTCTCTTTCCAGTCTTCCGCTTCAGGGATGTTTTTACGCAGCCAGAGCGCGAAAATGATCGCAGAATGCCAATGAAGAACAGCGCGCGCCAGCCCCAGATCGGAACCACCAGGCTGTAAACCTGCGCCGCCACGACCGCCCCGACGGAGAAACCGGAAATCAGGAATCCACTGGCTTTATTACGTAAGTGCTTCGGCCAGCTTTCAATGACATAGGTGGCACTGGAGCCATATTCGCCCGCCATTCCCATACCGATCACCAGACGGGCGATAAACATGGTGGTATAGCCGGGCGCAAAGCCGCAGGCCAGCGTCCCCATAGAGAAAAGAATGATACTGGTGACCATCGCCAGACGTCGGCCATAGCGGTCACCCATTGCGCCCAACATCAGCCCGCCAAACCAGCGGGAAATAAACGCCGCCGAGATCAGGCTGGCCGCCTGAACCGTCGTTAACCCAAACTCGCCCTGCACTTCCGTCAGAACAAGAGCAATCAACACAAAATCAAAACCATCAAGCAAATATCCCAGCCAGGCGGCGGAAAATGCCCGCCACTGCGCCCGGTTGAGATGGCGATACCACGGGATATTCTGGGTAGAAGTACTCATTTTTCATTCTCCCGACGATTTTTATTGTCGCTTTGCCGGATGGCGGCTTCGCCGCGCCTTACCCGGCCTACGCTTCATATTTTGTAGGCCGGATAAGCGTTAGCGCTATCCGGCAACCGGTCAACCGCGTTCCTGCATTAACTGCTGCGCCAGCGCTTTCAGTTCAGGCAGATACTTCTCATCAACCGGCGCAAACGGCTTACGGCACAGCGGTACGGAGACCACGTCCATGTAGTGCAGCACCGTTTTCAGCCCACGGAACACGCCGGTTTTAATGAGCAGATCGATGACCTTATTGCACTCGGTTTGCAGATGCTGCGCTTTCGCTACATCGCCTTCCTGCAAGGCCTTAACAATCCCCTGATAACGCCAGCCCATAATGTTGTAAGTGCTGCCGATTCCCCCGTCTGCGCCTGCCAGCAGACCGGAAGCGAAGATTTCGTCATAACCGTTGTAGAGCACCAGATCAGGATGCGCACGGCGGATCTGCTCCATCTGATAGAGATCGCCGGAGGTTTGTTTCAGCGCGCCAACGCCGCGGCAGCGTAACCAGCGTGTTGATTTGCTCCAGCGTCAGCTTCACGCCGCTCAGCGCAGGAATGTTGTACACCACCATCCGGTAATCCTTCAGCGGAATCGATAATGGCGCGGTAGTGATCGCAATGCTCTTCAAAGCTAAACGGATAATAAAACGGCGTCACGGCGGAAACGGCATCAAAACCGTAACGATGCGCCGCGCTGGCCAGTTGCTGGCTCTCTTCGGTACTGACCGTACCGACATGGGCAATCAGCGTCACCTTGCCCTTCGCCTCTTCCGCCACAATTTCCAGGACCTGCTCTCTTTCTGCGCGGCTCTGAACGAACGCTTCCCCGGTCGAGCCGCCGACGTACAGCCCATCAATGCCCTGTGCAATGTTGAAACGCACCAGACGGCGCAGGCTCTCAATATCCAGTTTCTGCTGGTTATCAAAAGGAGTTAACAACGCTGGCATTACGCCTCGTAAATCTTTTGCCATAACGACCTCTGTCGATGATTAGTGTTATCTGACTACATACCTTTATACCTGTTATACCAGATCAAATAAGCAGCACCCCAGTACAGAACGCACAGAGTGCGATCTACTTCACTAAACGATCGTCAGGATCGCGATCGCTAACGCAATTTTTTTCTTTTGATCGAAGGCGTGCCAGGTGGCGGATACGCTATTGAGGTGGGTTTGCAGCGCGCGGTCAGCTTCATCTGGATCGCGCCGACGAATGGCATCGACAATAGCGATATGCTCTTGATAGCTCACGCTATTATGCTCATGTAACTCCTGCCCGGAAACCGCAGGACGCGCAGCGATCAGCCAGTCCAGCAGCGCGACATGAATCGCCATGAAGATCGGATTGCCGGGGATTTCCGCCAGCACGCGGTGAAACTCGACGTCGGAACGAATGAAGAGCGCGTTATCGTCCAGCGACTGGCTGTTGATCTCCAGGGCTTTCGCCAGCAGATCAATTTGCTCATCCGTCGCGTGTTCCGCCGCATAGCGCACAAGGCTTGATTCAAAGAACAGGCGCAGTTGCTCAAAGTGGGCGATACCGCCGGGGTGCGAGAGGAAATCTTTCGCCATACCGGAAAGTTCGCTGATGATGGTATCAGCAGAAGGCCGCGACACGCGGGCGCGTTCGCCGTTATTGATTTGCACCAGACCTTTACGCTTGAGCGCGGCAAGCGCTTCACGCACCGACGGGCGGCCCACGTTGAAGAAGGTCATCAACTCCCGCTCAGACGGCAGCTGCTCCCCTTCGCCGAATTCACCGCGACGAATCATCTGCTCCAGTTCTTCTTCTACCATTTCCGACAGTTTTTTACGCGCCAGCGGACGGCTGCGCAAACTACGGCCGATGGTTGGCGGAGAATTTTCTGCTTGCGAATCAAATGCGTTCATAGCGTCCATTATGTAAATCGTCGAGGAAAACAGCAACTAGGGCCATCCTATCACAGGATCGAAAGTGGGGTGAAATCGTGGCAGGCGTGAAAAAAAACAGGCCCTCAGGGCCTGTCTGTCTTGGTTACTTCACTACGCGTAATGCGGGTCGACCACCACGCGGCGGCGGCGGATCGTCATCAGGATTGTTGTCATCGTCATGATCGGGCTTATCACCATCAATCACGGACATGACCGTTTCGCTCTCCGCGCCCGGCGTTACATCGTCATCATTTAAGCTGGCGACGTCTTCATCGTAAGCGGCTTCAGGTTCAAACATTGTGCCTGCGCCGTTTTCACGCGCATAGATCGCCAGTACCGCAGCCAGCGGTACGGAGACCTGGCGTGGGACGCCGCCAAAGCGCGCGTTGAAACGCACTTCGTCGTTGGCCAACTCCAGATTGCCAACCGCACGGGGAGCAATGTTCAGAACAATTTGCCCGTCGCGCGCATATTCCATAGGGACATGCACGCCAGGCAGCGTCACATCCACCACCAGGTGCGGCGTGAGCTGGTTATCCAGCAACCATTCATAGAATGCACGCAGCAAATATGGACGGCGTGGGGATAGCTGTGACAAATCCATACTGATTAACCCCGACCGAGGCGCATTTCACGTTCAGCTTCAGTTAAAGAAGCGAGGAACGAGTCACGTTCAAAGACGCGGGTCATATAGCCTTTCAGCTCTTTAGCGCCCGCACCGCTGAACTCAATACCCAGTTGCGGCAAACGCCACAGCAGCGGCGCCAGATAGCAATCAACCAGGCTGAACTCATCGCTCAGGAAATACGGTTTTTGACCAAACACAGGCGCGATCGCCTGCAACTCTTCACGCAGTTGTTTACGCGCAACGTCAGCTTCAGAAGCGGACCCGTTCACGATGACGTTCATCAACGTGTACCAGTCTTTTTCAATACGGTGCATGTACAGGCGGCTTTCGCCACGCGCAACCGGATAAACCGGCATCAGCGGCGGATGAGGGAAACGCTCATCCAGGTATTCCATAATGATGCGAGATTCCCACAGGGTCAGCTCACGATCCACCAGGGTCGGTACGCTCTGACTCGGGTTGAGGTCAATCAGATCCTGAGGTGGGTTATCCTTCTCCACATGCTCGATCTCGAAACTAACACCTTTCTCCGCCAGCACAATGCGGACCTGATGGCTATAGATGTCAGTAGGACCAGAAAACAGCGTCATTACCGAACGTTTGTTGGCAGCGACAGCCATGAAAACCTCCAGGTATATTCAGAATTTTTACTGCTACCAGCCACCGCGTGGCCAGCCAGAAGTTATGTCACCCGCCTGCGGAAAAGTCATTATTGTTCAAAAAGCAAACAAAAAATGAACAATACCCGTTATTTGGGCAGAAAATTGGATGATAGTTTACCAGATTTTGTGACCTTTGTGGTGAGTCGATTCTGGAAATGGGGAAAAAGAGGTGGCAATCAGAGGATTACTTTGGACACACGCGCATTTTTTCCATAAAAAAACCCGCCGAAGCGGGTTTTTTCGCCAATGGTAATCTGCCGGAGCAGAAACCCATTAACGTTTGGAGAACTGCGGACGACGACGTGCTTTACGCAGACCGACTTTCTTACGTTCAACCTGACGAGCGTCACGAGTAACGAAGCCAGCTTTACGCAGTTCAGAACGCAGAGACTCGTCGTACTCCATCAGAGCGCGGGTGATACCGTGACGGATCGCACCAGCCTGACCAGAGATACCACCACCTTTAACAGTGATGTACAGATCCAGTTTCTCAACCATGTCGACCAGTTCCAGCGGCTGACGAACTACCATGCGGGCAGTTTCACGACCGAAGTACTGTTCCAGAGAACGTTGGTTGATAACGATTTTGCCGTTGCCCGGTTTGATGAACACGCGAGCTGCGGAACTTTTGCGGCGACCAGTGCCGTAGTATTGATTTTCAGCCATTGCCTATAATCCCGATTAGATGTCAAGAACTTGCGGTTGCTGTGCCGCGTGGTTGTGCTCGTTGCCAGCGTAAACTTTCAGTTTACGGAACATAGCACGACCCAGCGGGCCTTTTGGCAGCATGCCTTTAACCGCGATTTCAATCACACGCTCAGGACGGCGAGCAATCATCTCTTCAAAGGTCGCTTGTTTGATACCACCGATGTGACCGGTGTGGTGATAGTACACTTTGTCAGTACGCTTGTTGCCGGTTACAGCAACTTTGTCAGCGTTCAGAACGATGATGTAATCACCGGTATCGACGTGCGGAGTGTATTCCGCTTTGTGCTTACCGCGCAGGCGAAGAGCCAGTTCAGTAGCCAGACGGCCCAGAGTTTTACCGGTCGCGTCAACAACGTACCAGTCGCGTTTTACGGTTTCTGGTTTAGCTGTAAAAGTTTTCATTAAAAGCTTACCCAATAGATAGTTACACGTTGGTGAACACCCAAACGTTTTCAATTGTTGAGGTTCACACGACAAAGTCCGGCAAACCTACCCCTTCGAATAGCCAATGCCAGCACACAAAAAGTTTTGGGAAAAAAACTTTCTTGTAACGTGGGGTCGCAGGATTATAGAGAAGTCAGGGGCAAAGATCGACCCCTTTTTGTGATTTGCAATGGGTTTTGTGCGGGGCATTTCTGCCGGTCTGATAAGCGAAGCGTCACCAGGCAACCTGCGCCGGATGGCGGCGAAACGCCTTATCCGGCCTACGGCATATGCGAGCGTTTCAGATACTCCTCGCTCTGCATCTCCTGCAAACGCGACAAACAGCGCTGGAACTCAAACTTCAGGCGTTCGCCCTGATAGATCTCATATAAAGGCGCGTCGGCGCTCACCACCAGCTTCACATGGCGCTCGTAGAATTCATCCACCAGCGCGATAAAACGGCGCGCTTCGCTTTCCATCAGCGGCGTCATGACCGGTACATCAAACAGCAGAACCGTGTGGAACAGGCGCGACAGCGCAATATAGTCATGCTGGCTGCGAGCGTCCACGCATAAGGTGATGAAGGAGACCGCCAGCGTCTGGTTTTCCACCCCCATCGTCGGTAACGGACGATGATTCACCTCCAGCGTTGGCGCATTCTCACGTTTCGCACCGGCCAGCGCCAGCCAGAGTTTATCCATCTGTTGCTGCGTTTCGTCATTACGCGGCGAGAGCCACAAATGCGCCTGCGTCAGCGTGCGCAGACGATAATCCACCCCGGCATCCACATTCATAATGTCGCAATGCGCTTTGATAGCGTCGATAGCCGGAAGGAAGCGGGTACGTTGCAGGCCATTGCGGTACAGCTCATCCGGCGGGATGTTCGAGGTCGCCACCAGGGTAATGCCGCGGGCAAACAGCGCTTTCATTAAGCCGCCAAGCAGCATCGCGTCGGTGATGTCGGAAACAAAAAACTCATCAAAGCAGAGCACATCCGTTTCGGCTTTGAAACGGTCGGCAATAATCTCCAGCGGGTCGGTCTGCCCCTGTAGCGCCGTTAGCTCCTCATGCACGCGCAGCATAAAGCGGTGGAAATGCAGACGCTGTTTCCGCTCGCCCGGCAGACTATGATAGAAGAGATCCATCAGCCAGGTTTTCCCTCGCCCGACCCCGCCCCACATATACAGACCACGCACCGGCGCATTCGCTGACGGTTCACGTTTGCCAAGCAGTTTACCGAAACGGGCCATCAGTCCGCCGGTTTGCGGCGCCGGAGGCGTTTTGGCGATCAGCGCCTGGTAAATCGACTCCAGTCGGTTGACCGCATCTTTTTGCACGTCGTCGGGTTGATGGCTGCCATCGTTGAGGGCCTGAAGGTAACGCGATGTGGGGGAAATGCTTTGCATGATGTTATTGTTATTCCTTGAAAATCGATGTGCCGACGTTCACGGCTGACGAAAAAAAGGCCGTTCTACATTACGTGATATAAACACGGGATTCCACTTCTACGGATTAGCGGTTATAGTGGCATAATCAGGCGCAGGCATGGAGCCTAAAGCCAAACACCCTACGGAAACAAAAGACAACGGGAGATGTTCATGACCTGGGAATATGCGCTAATTGGGTTAGTCGTCGGTATCATCATTGGTGCCGTAGCCATGCGTTTTGGTAATCGTAAATTACGCCAACAGCAGGCATTGCAGTACGAACTGGAAAAGAATAAAGCTGAGCTGGAAGAGTATCGTGAAGAGCTGGTCAGCCACTTTGCCCGCAGCGCCGAGCTGCTGGATACCATGGCGCATGATTATCGTCAGCTGTATCAGCACATGGCGAAAAGCTCCAGCAGCCTGCTGCCGGAACTGTCTGCGGAATCGAACCCGTTCCGTAACCGTCTGGCGGAGTCTGAAGCCAGTAACGATCAGGCGCCGGTCCAGATGCCGCGTGACTATTCCGAAGGCGCTTCCGGCCTGCTGCGTAGTGGCGCAAAACGCGACTAATCGCACATCGTTAAGAAATTTTTCGGGCGCAGCGGTTGCGCCCGTCTCTTCTCTCCTGTCCCAGCTATTATTTAGTCATTAACCTCATTGTTACCAGGCCGAAAATTCAATAACATCAAACTGTTTTGAATCGTCTTTCCGTTCACTCAAGGTACGAGAGCAGGCTCCATGAAAAAACAAATCCCGTTGTTAAGTGCATTAGCGTTAAGTGTCGGGTTAACGCTCTCGGCACCGTTTCAGGCCGTCGCATCGATACCCGCGCAGATTCCCGGTCAGGCGGCGCTCCCTAGTCTCGCGCCAATGCTGGAAAAAGTGCTGCCGGCCGTGATTAGCGTGAAGGTTGAGGGAACCGCGACCCAAGCCAGAAGGTTCCAGAAGAGTTTAAAAGTTTTTTGGCGACGAGTTGCCTGATCAACCCTCTCAGCCGTTCGAAGGGTTGGGGTCGGGCGTCATCATTGACGCCGCCAAAGGCTACGTTCTGACCAATAACCATGTCATTAATCAGGCACAGAAGATCAGCGTTCAGTTAAATGACGGACGCGAGTTTGACGCTAAACTGATCGGCAGCGACGACCAGAGCGACATCGCGCTGCTGCAAATCCAGAACGCCAGCAACCTGACGCAAATCGCCATCGCCGACTCCGACAAACTGCGCGTCGGCGACTTTGCCGTCGCGGTAGGCAACCCGTTTGGTCTGGGGCAAACCGCCACCTCCGGCATTGTCTCCGCGCTTTGGCCGGAGCGGGCTGAATCTGGAAGGACTGGAAAACTTTATTCAAACCGACGCCTCGATTAACCGCGGCAACTCCGGCGGCGCGCTGCTGAATCTCAACGGCGAGCTGATCGGCATCAACACCCGCTATTCTGGCACCTGGCGGCGGTAGCATCGGCATTGGGTTTGCGATTCCGAGTAATATGGCGCGCACGCTGGCGCAACAGCTGATTCAGTTTGGTGAAATTAAACGCGGCCTGCTGGGAATTAAAGGCATGGAATGACCGCCGATATCGCCAAAGCTTCAAGCTGGATGTACAGCGCGGGGCGTTCATCAGCGAAGTGCTCCCGGGTTCAGGTTCCGCGAAAGCGGTGTGAAATCCGGCGACGTGATCACCAGTCTCAACGGCAAGCCGCTGAGCAGCTTTGCCGAACTGCGCTCACGCATCGCAACGACGGAACCCGGCACGAAAGTGAAGCTCGGTCTGTTACGCGACGGCAAACCGCTGGAAGTCGAAGTCACCCTCGACACCAGCACGTCATCCTCCGCCAGCGCGGAGATGATCGCCCCGGCATTGCAGGGCGCAACGCTGAGCGATGGTCAGTTAAAAGACGGCAATAAAGGCATCAAAATTGATAACGTTGAAAAAAGCAGTCCCGCCGCCAGGCCGGTTTGCACAAAGATGACGTGA
Protein sequences of DBSCAN-SWA_4 >LR134204|1848055:1896534|1894731_1895130_+|VEB89875.1|DBSCAN-SWA MTWEYALIGLVVGIIIGAVAMRFGNRKLRQQQALQYELEKNKAELEEYREELVSHFARSAELLDTMAHDYRQLYQHMAKSSSSLLPELSAESNPFRNRLAESEASNDQAPVQMPRDYSEGASGLLRSGAKRD >LR134204|1848055:1896534|1852030_1853122_+|VEB89715.1|DBSCAN-SWA MRFSRFIIGLTTSIAFSVQAANVDEYINQLPDGANLAFMAQKVGASTPAIDYHSQQMALPASTQKVITALAALIQLGPDFRFTTTLETKGNVDNGVLKGDLVARFGADPTLKRQDIRNMVAMLKKSGVTQIAGNVLIDTSIFASHDKAPGWPWNDMTQCFSAPPAAAIVDRNCFSISLYSAQKPNDLAYIRVASYYPVTMFSQVRTLPRGSAEAQYCELDVVPGDLNRFTLTGCLPQRAEPLPLAFAIQDGASYAGAILKDELKQAGITYSGTLLRQTQVNEPGTVVASKQSAPLHDLLKIMLKKSDNMIADTVFRMIGHARFNVPGTWRRDLMPYARSCVSRLALILAIPSSLTVRASLATT >LR134204|1848055:1896534|1858254_1859232_-|VEB89738.1|DBSCAN-SWA MALGPLVARFGQGQVSLPGGCAIGARPVDLHITGLEQLGAEIKLEEGYVKASVNGRLKGAHIVMDKVSVGATVTIMSAATLAEGTTVIENAAREPEIVDTANFLVTLGAKISGQGTDRITIEGVERLGGGVYRVLPDRIETGTFLVAAAISGGKIVCRNAQPDTLDAVLAKLRDAGADIETGEDWISLDMHGQRPKAVNVRTAPHPAFPTDMQAQFTLLNLVAEGTGFITETIFENRFMHVPELIRMGAHAEIESNTVICHGVEKLSGAQVMATDLRASASLVLAGCIAEGTTLVDRIYHIDRGYERIEDKLRALGANIERVKGE >LR134204|1848055:1896534|1895298_1895541_+|VEB89878.1|protease|DBSCAN-SWA MKKQIPLLSALALSVGLTLSAPFQAVASIPAQIPGQAALPSLAPMLEKVLPAVISVKVEGTATQARRFQKSLKVFWRRVA >LR134204|1848055:1896534|1872125_1872776_-|VEB89799.1|DBSCAN-SWA MGGGIALFSVVPVPFSAVMVERQISAWLQGDFGYVAHSDWAGMDAISPWMGLAVIAAEDQKFPEHWGFDVSAIEKALAHNERNENRIRGASTLSQQTAKNLFLWDSRSWLRKGLEAGLTVGLETVWSKKRILTVYLNIAEFGDGVFGVEAASQRYFNKPASRLSMSEAALLAAVLPNPLRFKANAPSGYVRSRQAWILRQMRQLGGESFMTRNHLY >LR134204|1848055:1896534|1861494_1862277_-|VEB89756.1|DBSCAN-SWA MLLNALASLGHRGIKTLRTFGRAGLMLFNAVIGKPEFRKHAPLLVRQLYNVGVLSMLIIIVSGVFIGMVLGLQGYLVLTTYSAETSLGMLVALSLLRELGPVVAALLFAGRAGSALTAEIGLMRATEQLSSMEMMAVDPLRRVISPRLWAGVISLPLLTIIFVAVGIWGGSLVGVSWKGIDAGFFWSAMQDAVDWRLDLVNCLIKSVVFAITVTWIALFNGYDAIPTSAGISRATTRTVVHSSLAVLGLDFVLTALMFGN >LR134204|1848055:1896534|1869204_1869492_+|VEB89784.1|DBSCAN-SWA MQLNITGNNVEITEALRDFVSTKFAKLEQYFDRINQVYVVLKVEKVTHISDATLHVNGGEIHASAEGQDMYAAIDGLIDKLARQLTKHKDKLKQH >LR134204|1848055:1896534|1853846_1854770_-|VEB89722.1|DBSCAN-SWA MKHRFWSLLADGGNGCVSFRREKYIPKGGRMAVMAAMVATSWLEADENLNTLIDYRFEKSFRAERGQNGASRDCTGKRGKDVTIKVPVGTRVIDQGTGETMGDMTKHGQRLMVAKGGWHGLGNTRFKSSVNRTPRQKTMGTPGDKRDLMLELMLLADVGMLGMPNAGKSTFIRAVSAAKPKVADYPFTTLVPSLGVVRMDNEKSFVVADIPGLIEGAAEGAGLGIRFLKHLERCRVLLHLIDIDPIDGSDPVENARIIIGELEKYSQDLAAKPRWLVFNKIDLLDRPKRKKKRKLSRRRWAGKISTT >LR134204|1848055:1896534|1889884_1890727_-|VEB89856.1|DBSCAN-SWA MLFSSTIYIMDAMNAFDSQAENSPPTIGRSLRSRPLARKKLSEMVEEELEQMIRRGEFGEGEQLPSERELMTFFNVGRPSVREALAALKRKGLVQINNGERARVSRPSADTIISELSGMAKDFLSHPGGIAHFEQLRLFFESSLVRYAAEHATDEQIDLLAKALEINSQSLDDNALFIRSDVEFHRVLAEIPGNPIFMAIHVALLDWLIAARPAVSGQELHEHNSVSYQEHIAIVDAIRRRDPDEADRALQTHLNSVSATWHAFDQKKKIALAIAILTIV >LR134204|1848055:1896534|1865307_1865874_+|VEB89767.1|DBSCAN-SWA MSKAGASLATCYGPVSTQVIAQAENIRLLILDVDGVLSDGLIYMGNNGEELKAFNVRDGYGIRCALTSGIEVAIITGRKAKLVEDRCATLGITHLYQGQSDKLIAFGDLLDKLGVAPENVAYVGDDLIDWPVMEKVGLSVAVADAHPLLIPRADYVTHIAGGRGAVREVCDLLLLAQGKLDEAKGQSI >LR134204|1848055:1896534|1896216_1896534_+|VEB89883.1|protease|DBSCAN-SWA MKSGDVITSLNGKPLSSFAELRSRIATTEPGTKVKLGLLRDGKPLEVEVTLDTSTSSSASAEMIAPALQGATLSDGQLKDGNKGIKIDNVEKSSPAARPVCTKMT >LR134204|1848055:1896534|1871479_1872151_+|VEB89796.1|DBSCAN-SWA MILLSEQNPLGTGRHRKCYAHPGDAQRCIKIIYNADRSGKKEIRRELKYYAHLSRYLQDWSGIPRYHGTVETDCGTGYVYDVITDFDGKPSVTLTEFVAQCRNDEDAAVLRQLLKTLKRYIYDNRIVTMTLKPQNILCHRISESEVIPVICDNIGESTLIPLASWSAWFCHRKQERQWERLIAQPGLVAALKMNRQEEDKSALPLPSLEAARQRNLNTDGYAS >LR134204|1848055:1896534|1862284_1863097_-|VEB89758.1|DBSCAN-SWA MGQSVANLVDMRDVSFSRGDRCIFDNISLTVPRGKITAIMGPSGIGKTTLLRLIGGQIPPDSGEILFDGENVPAMSRSRLYTVRKRMSMLFQSGALFTDMNVFDNVAYPLREHTNLPAPLLKSTVMMKLEAVGLRGAAKLMPSELSGGMARRAALARAIALEPDLIMFDEPFVGQDPITMGVLVKLISELNSTLGVTCVVVSHDVPEVLSIADHAWIMADKKIVAHGSAKELQENADPRVRQFLDGIADGPVPFRYPAGDYHSDLLETGS >LR134204|1848055:1896534|1854797_1855763_-|VEB89725.1|DBSCAN-SWA MKQQAGIGILLALITAMCWGALPIAMKQVLEVMEPPTIVFYRFLMASIGLGAILAVKRKLPPLRIFRKPRWLVLLAIATGGLFGNFILFSSSLQYLSPTASQVIGQLSPVGMMVASVFILKEKMRGTQVIGALMLLCGLVMFFNTSLVEIFTKLTDYTWGVIFGVGAATVWVSYGVAQKVLLRRLASQQILFLLYTLCTIALLPLAKPGVISQLSDWQLACLIFCGLNTLVGYGALAEAMARWQAAQVSALITLTPLFTLLFSDLLSMAWPDFFARPMLNLLGYLGAFVVVGGAMYSAIGHRIWAVLRKHETVVSQPRSGE >LR134204|1848055:1896534|1888073_1888757_-|VEB89847.1|DBSCAN-SWA MSTSTQNIPWYRHLNRAQWRAFSAAWLGYLLDGFDFVLIALVLTEVQGEFGLTTVQAASLISAAFISRWFGGLMLGAMGDRYGRRLAMVTSIILFSMGTLACGFAPGYTTMFIARLVIGMGMAGEYGSSATYVIESWPKHLRNKASGFLISGFSVGAVVAAQVYSLVVPIWGWRALFFIGILRSFSRSGCVKTSLKRKTGKRNTKVKRRCARWWTSSTGANIASSMC >LR134204|1848055:1896534|1866975_1867701_+|VEB89777.1|DBSCAN-SWA MATLTAKNLAKAYKGRRVVEDVSLTVNSGEIVGLLGPNGAGKTTTFYMVVGIVPRDAGNIIIDDEDISLLPLHARARRGIGYLPQEASIFRRLSVFDNLMAVLQIRDDLTTEQREDRANELMEEFHIEHLRDSLGQALSGGERRRVEIARALAANPKFILLDEPFAGVDPISVIDIKRIIEHLRDSGLGVLITDHNVRETLAVCERAYIVSQGHLIAHGTPTEILQDEHVKRVYLGEDFRL >LR134204|1848055:1896534|1853610_1853727_-|VEB89719.1|DBSCAN-SWA MDDYHRQQLAEVEDEAEDDWDDDWDEDDEEGVEFIYKR >LR134204|1848055:1896534|1848055_1848898_-|VEB89703.1|protease|DBSCAN-SWA MNEAALFAARGNKRVVSMVEFEKAKDKIMMGAERRSMVMTEAQKESTAYHEAGHAIIGRLVPEHDPVHKVTIIPRGRALGVTFFLPEGDAISASRQKLESQISTLYGGRLAEEIIYGVEHVSTGASNDIKVATNLARNMVTQWGFSDKLGPLLYAEEEGEVFLGRSVAKAKHMSDETARIIDQEVKALIERNYNRARQILNDNMDILHAMKDALMKYETIDAPQIDDLMARREVRPPAGWEDPNGTNNSDNNGTPRAPRPVDEPRTPNPGNTMSEQLGDK >LR134204|1848055:1896534|1892275_1892668_-|VEB89865.1|DBSCAN-SWA MAENQYYGTGRRKSSAARVFIKPGNGKIVINQRSLEQYFGRETARMVVRQPLELVDMVEKLDLYITVKGGGISGQAGAIRHGITRALMEYDESLRSELRKAGFVTRDARQVERKKVGLRKARRRPQFSKR >LR134204|1848055:1896534|1859277_1859580_+|VEB89741.1|DBSCAN-SWA MLTWLASIQTEPLRSTFVPSWLSSFIVVSMSFSFGTFWISTGSSASSAAKRIGNAAFLAPEIVTSPWSVVGPCTRNLSIILFSVKNSYLLPACRPSLKNR >LR134204|1848055:1896534|1863309_1864287_+|VEB89761.1|DBSCAN-SWA MLLATALLIIGLLLVVYSADRLVFAASILCRAVGIPPLIIGMTVVSIGTSLPEIIVSVAASLHGQLDLAVGTALGSNIINILLILGLAALFHPFTVHSDVLRRELPLMLFVSVLAGSVLHDGELSRSDGIFLLLLAVLWLLFIVKIARLAERQGNDSLTREQVAELPREGGLPVAFLWLGIALIIMPMATRMVVDNATVLANYFAMSELTIGLTVIAIGTSLPELATAIAGIRKGENDIAIGNIIGANISNIAIVLGLPALITPGDVNPLAFGRDYSVMLLVSIVLALLCWRRPRQIGRGAGVLLTGGFIVWLAMLYWLSPLLIG >LR134204|1848055:1896534|1878125_1882334_+|VEB89819.1|DBSCAN-SWA MLFLNKDPELAAASRRIVEEELQRETLSIVGWRDVPTNEGVLGEIALSSLPRIEQIFVNAPAGWRPRDMERRLFIARRRIEKRLQDDKDFYVCSLSNLVNIYKGLCMPADLPRFYLDLADLRLESAICLFHQRFSTNTVPRWPLAQPFRYLAHNGEINTITGNRQWARARTYKFQTPLIPDLQSAAPFVNETGSDSSSLDNMLELLLAGGMDIVRAMRLLVPPAWQNNPDMDPDLRAFFDFNSMHMEPWDGPAGIVMSDGRFAACNLDRNGLRPARYVITKDKLITCASEVGIWDYQPDEVVEKGRVGPGELMVIDTRGGRILHSAETDDDLKSRHPYKEWMEKNVRRLVPFEDLADDQVGNRELDDDMLASYQKQFNYSAEELDSVIRVLGENGQEAVGSMGDDTPFAVLSSQPRIIYDYFRQQFAQVTNPPIDPLREAHVMSLATSIGREMNVFCEAEGQAHRLSFKSPILLYSDFKQLTTMKEEHYRADTLDITFDVTETSLEETVKALCDKAEQMVRNGTVLLVLSDRNIAKNRLPVPAPMAVGAVQTRLVDKSLRCDANIIVETASARDPHHFAVLLGFGATAIYPYLAYETLGRLIDTQAIAKDYRTVMLNYRNGINKGLYKIMSKMGISTIASYRCSKLFEAVGLHDDVVSHCFQGVVSRIGGAGFADFQQDLLNLSKRAWLARKPLDQGGLLKYVHGGEYHAYNPDVVRTLQQAVQSGEYRDYQAYAALVNERPAATLRDLLALNPGDNAISINDVEPANELFKRFDTAAMSIGALSPEAHEALAEAMNSIGGNSNSGEGGEDPARYGTNKVSRIKQVASGRFGVTPAYLVNADVIQIKVAQGAKPGEGGQLPGDKVTPYIAKLRYSVPGVTLISPPPHHDIYSIEDLAQLIFDLKQVNPKAMISVKLVSEPGVGTIATGVAKAYADLITIAGYDGGTGASPLSSVKYAGCPWELGLVETQQALVANGLRHKIRLQVDGGLKTGLDIIKAAILGAESFGFGTGPMVALGCKYLRICHLNNCATGVATQDDKLRKNHYHGLPFKVTNYFEFIARETRELMAQLGVKRLVDLIGRTDLLKELEGFTAKQQKLELSRLLETAEPHPGKALYCTEHNPPFDNGVLNAQLLQQAKPFVDERQSKTFWFDIRNTDRSVGASLSGYIAQTHGDQGLASDPIKAHFSGTAGQSFGVWNAGGVELYLTGDANDYVGKGMAGGLLAVRPPVGSAFRSHEASIIGNTCLYGATGGRLYAAGRAGERFAVRNSGAITVVEGIGDNGCEYMTGGIVCVLGKTGVNFGAGMTGGFAYVLDEDGEFRKRVNPELVEVLNVDDLAIHEEHLRGLITEHVQHTGSQRGEEILANWSVFSTKFALVKPKSSDVKALLGHRSRSAAELRVQAQ >LR134204|1848055:1896534|1848839_1850000_-|VEB89706.1|protease|DBSCAN-SWA MSDMAKNLILWLVIAVVLMSVFQSFGPSESNGRKVDYSTFLQEVNQDQVREARINGREINVTKKDSNRYTTYIPVNDPKLLDNLLTKNVKVVGEPPEEPSLLASIFISWFPMLLLIGVWIFFMRQMQGGGGKGAMSFGKSKARMLTEDQIKTTFADVAGCDEAKEEVAELVEYLREPSRFQKLGGKIPKGVLMVGPPGTGKTLLAKAIAGEAKVPFFTISGSDFVEMFVGVGASRVRDMFEQAKKAAPCIIFIDEIDAVGRQRGAGLGGGHDEREQTLNQMLVEMDGFEGNEGIIVIAATNRPDVLDPALLRPGRFDRQVVVGLPDVRGREQILKVHMRRVPLAPDIDAAIIARGTPGFFWRRSGEPCERSGAVCCPWQQACGIHG >LR134204|1848055:1896534|1855859_1856117_-|VEB89728.1|DBSCAN-SWA MAHKKAGGSTRNGRDSEAKRLGVKRFGGETVLAGSIIVRQRGTKFHAGTNVGCGRDHTLFAKADGKVKFEVKGPNNRKYISIVAE >LR134204|1848055:1896534|1873733_1874207_-|VEB89805.1|DBSCAN-SWA MDDVLSKPLSVPALTAMIKKYWDTRNEEESTVTSEESSKSQALLDIPMLEQYIELVGPKLITDGLAVFEKMMPGYLSILESNLTARDTKGVVEEGHKIKGAAGSVGLRHLQQLGQQIQSPDLPAWEDNVGEWVEEMKQEWQHDVAVLKAWVANAEKK >LR134204|1848055:1896534|1886717_1887218_-|VEB89841.1|DBSCAN-SWA MSLLEQLDKNIAANGGLIVSCQPVPGSPLDKPDIVAAMALAAEQAGAVALRIEGVDNLRATRALVSVPIIGILKRDLDDSPVRITPFLEDVDALAQAGADIIAVDGTDRVRPATVDALLARIHQHQLLAMTDCSSLEDGLACQRRGAELIGTTPVGIHHAGHARRT >LR134204|1848055:1896534|1883973_1884777_+|VEB89826.1|DBSCAN-SWA MTVPTPLQHADDYQQIHDGIIRLVDNAHTEAVRSINALMTATYWEIGRRIVEFEQGGEARAAYGIQLIERLSADLSQRYKRGFSTANLRQMRMFYLYFQDIEIQQTLSGESSNLIYLAKVFPLPWSAYVRLLSVKNPDARTFYEKETLRNGWSVRQLDRQISTQFYERTLLSHDKSAMLQQPAPAEPTVLPERAIRDPFILEFLNLRDEYSESDLEEALLSHLMDFMLELGDDFAFVGRQRRLRIDDSWFRVDLLFFHRRLRCLFAR >LR134204|1848055:1896534|1884817_1885060_+|VEB89829.1|DBSCAN-SWA MNMYLNYAKEHWTMPGENPPVGLVLCAGKGAGEAHYALNGLPNTIMASEYKVQLPDEKLLADELIRSQIALTAQQVESTE >LR134204|1848055:1896534|1870145_1871000_+|VEB89790.1|DBSCAN-SWA MVLMIVSGRSGSGKSVALRALEDMGFYCVDNLPVVLLPDLARTLADRQISAAVSIDVRNMPESPEIFEQAMNNLPDAFSPQLLFLDADRNTLIRRYSDTRRLHPLSSKNLSLESAIDQESDLLEPLRSRADLIVDTSEMSVHELAEMLRTRLLGKRERELTMVFESFGFKHGIPIDADYVFDVRFLPNPHWDPKLRPMTGLDKPVAAFLDRHTEVHNFIYQTRSYLELWLPMLETNNRSYLTVAIGCTGGKHRSVYIAEQLADYFRSRGKNVQSRHRTLEKRKT >LR134204|1848055:1896534|1870996_1871269_+|VEB89793.1|DBSCAN-SWA MTVKQTVEVTNKLGMHARPAMKLFELMQGFEAEVLLRNDEGTEAEANSVIALLMLDSAKGRQIEIEATGPQEVEALAAVIALFNSGFDED >LR134204|1848055:1896534|1889278_1889773_-|VEB89853.1|DBSCAN-SWA MAKDLRGVMPALLTPFDNQQKLDIESLRRLVRFNIAQGIDGLYVGGSTGEAFVQSRAEREQVLEIVAEEAKGKVTLIAHVGTVSTEESQQLASAAHRYGFDAVSAVTPFYYPFSFEEHCDHYRAIIDSAEGLPDGGVQHSCAERREADAGANQHAGYAAAALAR >LR134204|1848055:1896534|1865870_1866446_+|VEB89770.1|DBSCAN-SWA MSKTRRWVIILLSLAVLVLIGINLADKDDPAQVVVNTSDPTYKSEHTDTVVYSPEGALNYRLIAQHVEYYSEQALSWFTQPVLTTFDKDKVPTWSIKADKAKLTEDRMLYLYGHVEVNALAPDAQLRRITTDNATINLVTQDVTSEDLVTLYGTTFNSSGLKMRGNLRSKNAELIEKVRTSYEIQNKQTQP >LR134204|1848055:1896534|1859991_1860285_-|VEB89744.1|DBSCAN-SWA MTQSLSWTREGDTLALAGELDQDVLNPLWDARVDAMKGVTCIDLSQVSRVDSGGLALLVHLVDQAKRQGNSVSLQGVNEKVYTLAKLYNLPADVLPR >LR134204|1848055:1896534|1869617_1870109_+|VEB89787.1|DBSCAN-SWA MTNNDTTLQLSSVLNQECTRSGVHCQSKKRALEIISELAAKQLSLPPQVVFEAILTREKMGSTGIGNGIAIPHGKLEEDTLRAVGVFVQLETPIAFDAIDNQPVDLLFALLVPADQTKTHLHTLSLVAKRLADKTICRRLRAAQSDEELYQIITDTEGGKDEA >LR134204|1848055:1896534|1885190_1885658_-|VEB89832.1|DBSCAN-SWA MLMGEIQSLPSAGLHPVLLDALTLALAARPQEKTPGRYELQGDDVFMNVMQFATQSPQEKKAELHVQYIDIQLLLMGEERILFGVVGSACECEELHGEEDYQLCRRIHEQQSITLTPGMFSVFMPGEPHKPGCMAGESGEIKKAVVKVRASLLEV >LR134204|1848055:1896534|1891321_1891960_-|VEB89862.1|DBSCAN-SWA MAVAANKRSVMTLFSGPTDIYSHQVRIVLAEKGVSFEIEHVEKDNPPQDLIDLNPSQSVPTLVDRELTLWESRIIMEYLDERFPHPPLMPVYPVARGESRLYMHRIEKDWYTLMNVIVNGSASEADVARKQLREELQAIAPVFGQKPYFLSDEFSLVDCYLAPLLWRLPQLGIEFSGAGAKELKGYMTRVFERDSFLASLTEAEREMRLGRG >LR134204|1848055:1896534|1888877_1889252_-|VEB89850.1|DBSCAN-SWA MEQIRRAHPDLVLYNGYDEIFASGLLAGADGGIGSTYNIMGWRYQGIVKALQEGDVAKAQHLQTECNKVIDLLIKTGVFRGLKTVLHYMDVVSVPLCRKPFAPVDEKYLPELKALAQQLMQERG >LR134204|1848055:1896534|1866414_1866969_+|VEB89773.1|DBSCAN-SWA MKFKTNKLSLNLVLASTLLAASLPAFAVTGDTEQPIHIESDQQSLDMQGNVVTFTGNVVVTQGTIKINADKVVVTRPGGEQGKEVIDGFGNPATFYQMQDNGKPVKGHAQKMHYELAKDFVVLTGNAYLEQLDSNITGDKITYLVKEQKMQAFSEKGKRVTTVLVPSQLQDKNKDQAPAQKKGN >LR134204|1848055:1896534|1893414_1894539_-|VEB89871.1|DBSCAN-SWA MQSISPTSRYLQALNDGSHQPDDVQKDAVNRLESIYQALIAKTPPAPQTGGLMARFGKLLGKREPSANAPVRGLYMWGGVGRGKTWLMDLFYHSLPGERKQRLHFHRFMLRVHEELTALQGQTDPLEIIADRFKAETDVLCFDEFFVSDITDAMLLGGLMKALFARGITLVATSNIPPDELYRNGLQRTRFLPAIDAIKAHCDIMNVDAGVDYRLRTLTQAHLWLSPRNDETQQQMDKLWLALAGAKRENAPTLEVNHRPLPTMGVENQTLAVSFITLCVDARSQHDYIALSRLFHTVLLFDVPVMTPLMESEARRFIALVDEFYERHVKLVVSADAPLYEIYQGERLKFEFQRCLSRLQEMQSEEYLKRSHMP >LR134204|1848055:1896534|1876269_1877199_-|VEB89812.1|DBSCAN-SWA MQLQKLVNMFGGDLSRRYGQKVHKLTLHGGFSCPNRDGTIGRGGCTFCNVASFADEAQQYRSIAEQLAHQAHLVNRAKRYLAYFQAYTSTFAEVQVLRSMYQQAVSQASIVGLCVGTRPDCVPDAVLDLLCEYKDQGYEVWLELGLQTAHDKTLRRINRGHDFACYQRTTQLARERGLKVCSHLIVGLPGEGQTECLQTLDRVVETGVDGIKLHPLHIVKGSIMAKAWEAGRLNGIALEAYTLTAGEMIRHTPPEVIYHRISASARRPTLLAPLWCENRWTGMVELDRYLNERGAQGSALGTPWIQPES >LR134204|1848055:1896534|1860647_1860920_-|VEB89750.1|DBSCAN-SWA MFKRLMMVALLVIAPLSAATAADQTNPYKLMNEAAQKTFDRLKNEQPKIRANPDYLRDVVDQELLAVCAGEIRGRAGTGSLLPGGDAGAA >LR134204|1848055:1896534|1850811_1851147_+|VEB89712.1|DBSCAN-SWA MTRFQSQRKQKYTMNLSTKQKQAPERSGTSAQAGSYAWQQWFDPKGYWPRFEQALEHHELIKVKIASEDRETKTLIVEAIVRETGACNVQVIGKTLVLYRPTKERKISLPR >LR134204|1848055:1896534|1874244_1876071_-|VEB89809.1|DBSCAN-SWA MKQIRMLAQYYVDLMMKLGLVRFSMLLALALVVLAIVVQMAVTMVLHGQVESIDVIRSIFFGLLITPWAVYFLSVVVEQLEESRQRLSRLVQKLEEMRERDLKLTVQLKDNIAQLNQEIADREKAEAERQSTFEQLKVEIKEREETQIQLEQQSSFLRSFLDASPDLVFYRNEDKEFSGCNRAMELLTGKSEKQLVHLKPADVYSPEAAEKVIETDEKVFRHNVSLTYEQWLDYPDGRKACFEIRKVPYYDRVGKRHGLMGFGRDITERKRYQDALERASRDKTTFISTISHELRTPLNGIVGLSRILLDTDLTSEQEKYLKTIHVSAVTLGNIFNDIIDMDKMERRKVQLDNQPVDFTSFMADLENLSGLQAQQKGLRFVLDPTLPLPHKVITDGTRLRQILWNLISNAVKFTQQGQVTVRVRYDEGDVLHFDVEDSGIGIPQDEQDKIFAMYYQVKDSHGGKPATGTGIGLAVSRRLAKNMGGDITVSSQPGKGSVFTLTVHAPAVAEEVEDTLEENDMPLPALHVLLVEDIELNVIVARSVLEKLGNSVDVAMTGKAALEMFTPGEYDLLLLDIQLPDMTGLDISRETYPPLCAGRITAAGRTHC >LR134204|1848055:1896534|1895533_1895926_+|VEB89880.1|protease|DBSCAN-SWA MPDQPSQPFEGLGSGVIIDAAKGYVLTNNHVINQAQKISVQLNDGREFDAKLIGSDDQSDIALLQIQNASNLTQIAIADSDKLRVGDFAVAVGNPFGLGQTATSGIVSALWPERAESGRTGKLYSNRRLD >LR134204|1848055:1896534|1885844_1886531_-|VEB89838.1|DBSCAN-SWA MSTLAIDIGGTKLAAARVDDDLRIRERRELPTPASKTPDALREALKVLVEPLQTTAQRVAIASTGIIREGALLAINPHNLGGLLHFPLVQTLEDLTGLPTLAVNDAQAAAWAEYHALTSDVRDMVFITVSTGVGGGVVSDGRLLTGMSGLAGHLGHTLADPQGPVCGCGRRGLCGGYRLRTRHCGSGAGFPGRVRRQNCLCTCGCRERTGCASGSALRAGCRAVNCRC >LR134204|1848055:1896534|1890815_1891316_-|VEB89859.1|protease|DBSCAN-SWA MDLSQLSPRRPYLLRAFYEWLLDNQLTPHLVVDVTLPGVHVPMEYARDGQIVLNIAPRAVGNLELANDEVRFNARFGGVPRQVSVPLAAVLAIYARENGAGTMFEPEAAYDEDVASLNDDDVTPGAESETVMSVIDGDKPDHDDDNNPDDDPPPPRGGRPALRVVK >LR134204|1848055:1896534|1877779_1878142_+|VEB89815.1|DBSCAN-SWA MIRNPRRHALSVPVRNRYRMGVPQSLGRFTDMLYDKSLERDNCGFGLIAHIEGEPSHKVVRTAIHALARMQHRGAILADGKTGDGCGLLLQKPDRFFRIVAEERGWRLAKKLRCRHALPE >LR134204|1848055:1896534|1872849_1873503_-|VEB89802.1|DBSCAN-SWA MKKIGVVLSGCGVYDGAEIHEAVLTLLAIARSGAQAICFAPDKQQADVINHLTGEAMAETRNVLIEAARITRGEIRPLAQAVSSDLDALIVPGGFGAAKNLSNFASQGSECRVDSDLAALAIAMHQSGKPLGFMCIAPAMLPRIFGFPLRLTIGTDIDTAEVLEDMGAEHVPCPVDDIVVDEDNKIVTTPAYMLAQDIAQAATGIEKLVSRVLVLAE >LR134204|1848055:1896534|1857912_1858185_+|VEB89735.1|DBSCAN-SWA MENKLIDWHPADIIAGLRKKGTSMAAESRKNGLSSSTLANALTRPWPKGELIIAKALGTEPWVIWPSRYHDPETHEFIDRTRLMRARKGK >LR134204|1848055:1896534|1850091_1850622_-|VEB89709.1|DBSCAN-SWA MPGLNLMKYSKVTNFFKPGMTVVDLGAAPGGWSQYVVTQIGGKGRIIACDLLPMDPIVGVDFLQGDFRDELVMKALLERVGDSKVQVVMSDMAPNMSGTPAVDIPRAMYLVELALEMCRDVLAPGGSFVVKVFQGEGFDEYLREIRSLFTKVKVRKPDSSRARSREVYIVATGRKP >LR134204|1848055:1896534|1860938_1861490_-|VEB89752.1|DBSCAN-SWA MQTKKSEIWVGIFLLVALLAALFVCLKAANVTSMRTEPTYTIYATFDNIGGLKARSPVRIGGVVVGRVADISLDPKTYLPRVTLDIEERYNHIPDTSSLSIRTSGLLGEQYLALNVGFEDPELGTSILKDGDTIQDTKSAMVLEDMIGQFLYNSKGDDNKNSGDAPAPTEGNNEATPPAGATN >LR134204|1848055:1896534|1882345_1883764_+|VEB89823.1|DBSCAN-SWA MSQNVYQFIDLQRVDPPKKALKIRKIEFVEIYEPFSEGQAKAQADRCLSCGNPYCEWKCPVHNYIPNWLKLANEGRIFEAAELSHQTNTLPEVCGRVCPQDRLCEGSCTLNDEFGAVTIGNIERYINDKAFEMGWRPDMTGVRQTDKRVAIIGAGPAGLACADVLTRNGVKAVVFDRHPEIGGLLTFGIPAFKLEKEVMTRRREIFTGMGIEFKLNTEVGRDVQLDDLLAEYDAVFLGVGTYQSMRGGLENEDADGVFDALPFLIANTKQIMGFGETTDEPFVSMEGKRVVVLGGGDTAMDCVRTSVRQGATHVTCAYRRDEENMPGSRREVKNAREEGVEFQFNVQPLGVEVNANGKVSGVKMVRTEMGAPDAKGRRRAEIVAGSEHVIPADAVVMAFGFRPHSMAWLAKHSVELDSQGRIIAPEGNENAFQTSNPKIFAGGDIVRGSDLVVTAIAEGRKAADGILNYLEV >LR134204|1848055:1896534|1867748_1869182_+|VEB89780.1|DBSCAN-SWA MKQGLQLRLSQQLAMTPQLQQAIRLLQLSTLELQQELQQALESNPLLEQTDLHDEIDTRETQDNEALDTADALEQKEMPEELPLDASWDEIYTAGTPSGTSGDYIDDELPVYQGETTQSLQDYLMWQVGLTPFSDTDRAIATSIVDAVDDTGYLTISLDDILESIGDEEIGLDEIEAVLKRVQRFDPIGVAAKDLRDCLLIQLSQFDKSTPWLEDARLIISDHLDLLANHDFRTLMRVTRLKEEVLKEAVNLIQSLDPRPGQSIQTGEPEYVIPDVLVRKHNGYWMVELNGDSIPRLQINQHYAALCNGARNDADSQFIRSNLQDAKWLIKSLESRNDTLLRVSRCIVEQQQAFFEQGEEFMKPMVLADIAQAVEMHESTISRVTTQKYLHSPRGIFELKYFFSSHVNTEGGGEASSTAIRALVKKLIAAENPAKPLSDSKLTSMLSDQGIMVARRTVAKYRESLSIPPSNQRKQLV >LR134204|1848055:1896534|1856181_1857678_+|VEB89732.1|DBSCAN-SWA MALLLTIVFTTTKLNDFNFLATTVSNNFSFDYAAINERNADFDFFTVCDHQNFSELNSFASCDVQLFQANGLTFAYSVLFTTTLENRVHKKLRFRARLFNDSECAINIHNRARILRKTRAFDKCYRQYMKKKNTTCTVTFICAVFSVQSAYIPNPKPLVISSHDELIGAISPAFAMNLEKINELTAQDMAGVNATILEQLNSDVQLINQLGYYIVSGGGKRIRPMIAVLAARAVGYQENAHVTIAALIEFIHTATLLHDDVVDESDMRRGKATANAAFGNAASVLVGDFIYTRAFQMMTSLGSLKVLEVMSEAVNVIAEGEVLQLMNVNDPDITEENYMRVIYSKTARLFEAAAQCSGILAGCTPEQEKGLQDYGRYLGTAFQLIDDLLDYSADGEQLGKNVGDDLNEGKPTLPLLHAMRNGTPEQAQMIRTAIEQGNGRHLLEPVLEAMTACGSLEWTRQRAEDEADKAIAALQILPDTPWREALIGLAHIAVQRDR >LR134204|1848055:1896534|1885657_1885900_-|VEB89835.1|DBSCAN-SWA MRLVQHSAQVVARLIADVKATTDCQQVVIGGSVGLAEGYLAQVRHFLAQEPAVYQVALSAAHYRHDAGLLGAALLAQGDK >LR134204|1848055:1896534|1887267_1888122_-|VEB89844.1|DBSCAN-SWA MVDILYRGEHRVVNVLMTIAAATALWFCFAGDLQNAAIVAVLGLLCAVIFISFMVQSSGKRWPTGVMLMIVVLFAFLYSWPIQALLPTYLKTELAYDPHTVAQVLFFSGFGAAVGCCVGGFLGDWLGTRKAYVCSLLASQVLIIPVFAIGGANVWVLGLLLFFQQMLGQGISGILPKLIGGYFDTDQRAAGLGFTYNVGALGGALAPILGALIAQRLDLGTALGSLSFSLTFVVILLIGLDMPSRVQRWLRPEALRMHDAIDGKPFSGAVPFGGDKSAFVKTKS >LR134204|1848055:1896534|1864301_1865288_+|VEB89764.1|DBSCAN-SWA MSHLALQPGFDFQKAGKDVLEIEREGLAELDQYIDQNFTLACEKIFSCTGKVVVMGMGKSGHIGRKMAATFASTGTSAFFVHPGEAAHGDLGMVTSQDVVIAISNSGESNEIAALIPVLKRLQVPLICITGRPESSMARAADVHLCVKVPKEACPLGLAPTSSTTATLVMGDALAVALLKARGFTAEDFALSHPGGALGRKLLLRVNDIMHTGDEIPHVNKNASLRDALLEITRKNLGMTVICDDTMKIDGIFTDGDLRRVFDMGVDVRQLGIADVMTPGGIRVRPGILAVNALNLMQSRHITSVLVADGDQLLGVLHMHDLLRAGVV >LR134204|1848055:1896534|1892683_1893112_-|VEB89868.1|DBSCAN-SWA MKTFTAKPETVKRDWYVVDATGKTLGRLATELALRLRGKHKAEYTPHVDTGDYIIVLNADKVAVTGNKRTDKVYYHHTGHIGGIKQATFEEMIARRPERVIEIAVKGMLPKGPLGRAMFRKLKVYAGNEHNHAAQQPQVLDI >LR134204|1848055:1896534|1860284_1860425_-|VEB89747.1|DBSCAN-SWA MIAEGVSMITTKQNEWSDLLRTKGIDGLTAQLQSISRQKISLEEKK |
59 | Micromonas_pusilla_virus(12.5%) | protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2276061 : 2283635
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >LR134204|2276061:2283635|DBSCAN-SWA CATGAGGGGAAAGGCGTTTAATTCCATTACACGGGCCGTGAAACCACGCAGGTTTTCAGTCAGGCCCGTGATTTACGCCAGCGTGTTGAGCGCTGGCGTATTGTTGTGCGCCTTTTCCGCCCATGCGGATGACCGCGATCAGCTCAAATCCATCCAGGCGGATATCGCCGCCAAAGAGCGCGCCGTACGCCAGCAACAGCAGCAACGTTCCAGTTTGCTCGCACAGCTCAAAACGCAGGAAGAGGCCATCTCCGCCGCCGCGCGTAAACTGCGTGAAACCCAGAATACCCTTGCCCAGTTGAATAAGCAGATCGACGCGATGAACGCGTCTATCGCGAAGCTGGAACAACAAAAAGCCATTCAGGAACGCAGCCTTGCCGCACAGCTGGATGCGGCGTTTCGCCAGGGCGAGCATACGGGGATTCAGCTCATTCTTAGCGGTGAAGAGAGCCAGCGCGGGCAGCGTTTGCAGGCCTATTTCGGTTATCTGAACCAGGCGCGCCAGGACACCATTGCCGAACTGAAACAGACGCGTGAAGAGGTGGCGTCTCAGAAAGCGGCGCTGGAAGAGAAACAGAGCCAGCAACAGACGCTGCTGTATGAACAACGCGCCCAACAGGCGAAGCTGGAGCAGGCGCGTAATGAGCGTCAGAAAACGATCGCCGGTCTTGAGTCGTCAATCCAGCAGGGGCAACAGCAGTTAAGTGAATTGCGAGCGAACGAATCCCGCCTGCGCAACAGCATCGCGCGCGCAGAGGCGGCCGCCAAAGCCCGGGCGGAGCGAGAAGCCCGCGAAGCGCAGGCGGTGCGCAATCGTCAGCAGGAAGCCACCCGCAAAGGCACAACCTACAAACCAACCGAAAGCGAAAAATCGTTGATGTCCCGTACCGGCGGTCTGGGTTCACCGCGCGGTCAGGCATTCTGGCCCGTTCGCGGCCCAACGCTGCATCGTTATGGCGAACAGCTGCAAGGTGAGCTACGTTGGAAAGGGATGGTTATCGGTGCGTCTGAAGGCACTGAAGTCAAAGCAATCGCGGACGGTAGGGTGATTCTGGCGGACTGGTTGCAGGGCTACGGTCTGGTCGTCGTCGTGGAACACGGTAAAGGCGATATGAGCCTTTACGGGTATAACCAGAGCGCACTGGTCAGCGTTGGCACACAGGTTCGCGCAGGCCAGCCTATTGCGCTTGTAGGCAGCAGCGGCGGTCAGGGCCGACCGTCACTCTATTTCGAAATTCGTCGCCAGGGTCAGGCCGTCAATCCACAGCCGTGGTTGGGAAGATAAGTTTTGCCTCACTTTCGTCGCACAATACTCTCACTCGCCAGCCTACTGGCATTCGCCACCCCTGTTTTTGCTGGCAAACTCGCCATCGTAATCGACGATTTTGGTTATCGTCCGCACAATGAAAATCAGGTGCTGGCGATGCCGTCCGCCCTCTCCGTTGCCGTCCTGCCCAATGCCCCACATGCCCGTGAAATGGCAACCAAAGCGCACAACAGCGGGCATGAGGTGCTGATCCATCTGCCGATGGCGCCGCTGAGCAAGCAGCCGCTGGAGAAAGATACGCTGCGCCCGGAGATGAGCAGCGACGAAATTGAACGCATTATTCGCGATGCGGTGAATAAGGTGCCGTATGCCGTGGGTCTCAACAACCACATGGGCAGCGCGATGACCTCCAGCCTGTTTGGCATGCAAAAAGTGATGCAGGCGCTGGAGCGTTATAATCTTTACTTCCTCGACAGCATGACCATTGGCAACAGTCAGGCGATGCGTGCCGCCTCCGGGACGAGCGTGAAGGTGATTAAGCGCAAAGTGTTCCTTGATGATACGCAGAACGAAGCGGATATTCGTCGCCAGTTTAATCGCGCCATTGAGTTAGCCCGTCGCAACGGGTCAGCTATCGCGATTGGGCACCCGCATCCATCAACGGTGCGAGTCCTTCAGCAGATGGTCTATAACCTGCCTGCGGATATTACGCTGGTGCGTCCGAGCAGCCTGCTCGACGAGCCGCAGACAGATACCTCTACGCCAAATCTGACGCCACCGAAAAACGGTACGCCGGATGCACCGCGAAATCCGTTCCGCGGCGTGAAGGTGTGTAAATCGAAGAAACCGTTAGAACCTGTCTACGCCAGCCGGTTCTTTAGCGTGTTAAGCGAAAGCATTACGCAGAGCACGATGGTGAACTACTTCCGGCATCAGTGGCAAGGCTGGGGCAAGATCGCAGCCCCCCAGAACGCTAACGCGGATTAATCGCTTTACGGGCAACGCGGTGGTGCGAGGCCGTTTTATCACGCCACCGGTATAACCGACCCGACCACAACAGCGCCTGATAAGCCGCCTTCGCGCTTCGTATGTTAAGCATCATGCGCTTATACATGCCAGAGGCAAATATTTCGGCGATCATCCGCTGGCGAATAAGCGTGTCCTGCTCCTTACGCACCGCATGACACACCCGCAGCGCTTCCCAGGTAATTTGTTGCTTAAATTCAGGATAGACGGGAATGCGATCCGCATAATCACTATTCAGCTTTTCTAATAAACGCGTGATTTTAATGTAGTGTCGCTGGTAATGCAGATTACGCTCACCTTGTCGTTTCAGGCGGCTAACAGACTGATCGTGAAGAAAATATTTATACAACGCCTGTTCGGTATAGCGTACGCGCGTGGCATTGAACATCACTTCCGTGGTCCACAGAATATCCTGATGATGTAATCCAGGAACAAAGCTAATCGCGTGTTTTTCAATAAGGGCGCGCCGATAAACGCCCATCCAGACAACATGCGTCCAGCGGCGTGAAGCCAGCCCCATACGCAGCCAGTCCGGGCCGGTTAATACCCCTGTCGACCGAATGCGATCCGTGGGGATGGATGGCCAGGTATGCCCCGTCTCAAGAATGCACCAGTCTGCATTACACTGCGCCACATCCAGGTCATCCTGTAGCGCCATGGTCATCAGCGTTTCATACATGTTCGGATACACCAAATCATCGGCATCGACAAAAGCAACGTAGTCGCCGGTCGCCGCGGCCAACCCCCGGTTGCGGGCCACGGACGCGCCTGCATTATCCTGATGCAACAGGTGAACATGTGAATGGGTATCGGCAAAATACCTTGCGATATCAACGGAGTTGTCGGTAGACCCGTCGTTAACAATGATGATCTCCAGCGCATTCCATGTTTGTGCAATCAAAGATTCCATGCATGCCTGGAAGCTATTACCCGCATTGTATAACGGAATAATAACACTCAGTGTACTTGTGCTGCTTTTCATAGAATGGCCCGTCAGTCTTTAATACATTGGCCTTTTCTACCGTGCTTTCATTAAGAAATTATTAAGTCCAGTGGAGATAATGGTTAAGGCGCACAAAACACCCTACAGATGAGAAGCCTCACCATGTATACTACGCCTTTGTATTATACGTATTTAGAATACTTATCTTGAGGGAAGTGATGACCATCAGTAAGACACATGAAAAGGGCTATACGGTCTATTACAAAGAGGAAAATAAAGACCTTAAGTCCCTGATGGATAAGTATATGAATAACGAAATCAGTGGCAAACCGCTGAACAGCGGCAACGAATTCCGCAGCGTAGAACTTGTTGAATACCAGTCACGGAAGTTCATCATCAAAAACGATCGCGAAATAGACCCACGATTTGAGAAAAAGATTCAGAATTTTCTCTCCGGGCCTTTTTATTCTCAGCTGATCCAAAAACTGGACAGCCTGGCGCCACAAGTGCGTGCCTGTACCGCCGATCTCTACTGTGTGGCCGAGAAAACACGTTTTCGCCAGTGCTATGATGTGTATACCCTTCATGAATATATTGAAGGGGAGCCATTAAATGATATTAATGAAAGCAATAAAGAAGATATTAAGGCGTGTATTCAGCAACTGCACCGGGCAGGTCTGGCCTCCAATGATATTCATGCCGGAAACTTTATCCGCACACCTTCCGGGGAATTGCGCATCATCGATTTATCCTGCAAAGGAAGTCTGAAAATTTGCCAGGCAAATGATATTTTAGTATTACAAAATAAATACCATATGAATATTGAAGGTCAGGGTCTGGTTTATAAGCTTATTCAGCTTAAAGAGAAATTCAGACGCTTATCCCGCAAGATGCGCGGAAAATAAAAACGCCATTTTACGGCCAGGTTCGGGAAAACCTGGCCGCCAGCAGGATGCTATCAGTCCCAGCTCAGAATCACTTTTCCTGACTGGCCTGAACGCATCGCATCAAAGCCTTTCTGGAACTCATCGATGGAGAAACGGTGGGTGATGATCGGAGACAGATCCAGACCAGACTGAATCAGCGCCGCCATCTTATACCAGGTTTCAAACATCTCACGACCGTAAATACCTTTAATAAACAGCCCCTTGAAGATCACCTTCGTCCAGTCGATAGACATATCCGAAGGCGGAATCCCCAGCATCGCGATACGACCGCCGTGGTTCATGGTATCCAGCATGGCGCGAAACGCAGGCGGCGCACCGGACATCTCCAGGCCGACATCAAACCCTTCAGTCATGCCCAGATTAGCCATCACGTCGGTCAGGTTCTCTTTCGCAACGTTCACCGTGTGGATGATGCCCATTTTACGCGCCAGTTTCCAGACGGTACTCATTCACATCCGTGATCACGACATGACGCGCGCCCACATGTTTCGCTACCGCCGCCGCCATGATGCCGATCGGGCCGGCACCGGAAACCAGCACATCTTCACCGACCAGATCGAACGACAGCGCCGTGTGTACGGCATTGCCGAAGGGGTCGAAGATAGAGGCTAAATCATCGGAGATGTTATCCGGGATCTTAAAGGCGTTAAACGCCGGAAGTACCAGATATTCCGCAAAACAACCGGGACGGTTAACACCGACACCCAGAGTATTGCGGCACAAATGCGTACGGCCACCGCGACAGTTACGACAATGGCCGCAGGTAATATGGCCTTCGCCGGAAACGCGATCGCCAATCCTGAAGCCTTTAACTTCCTGACCGATACCGACCACCTCGCCGACATATTCATGCCCTACCACCATCGGAACTGGGATGGTTTTTTGCGACCATTCATCCCAGTTATAGATGTGAACGTCAGTGCCGCAGATGGCTGTTTTACGGATTTTAATCAGCAGATCGTTATGCCCGACTTCCGGTACCGGAACGTCAGTCATCCAGATGCCCTCTTCCGCTTTCAGTTTGGATAACGCTTTCATCTCACATCCTCAGGCAATAACGCCCAGTTGTTTACCAATACGTGTGAACGCCTCAACCGCACGCGTGATTTGCTCAGGGGTATGCGCCGCAGACATCTGGGTACGAATACGCGCCTGACCTTTCGGAACAACCGGATAGAAGAATCCGGTCACGTAAATCCCCTCTTTTTGCAGTTCGCGGGCAAATTGCTGCGCAACCACCGCATCACCCAGCATCACTGGAATAATAGCGTGATCGGCGCCAGCCAGCGTGAAGCCCGCAGCGGACATTTGCTCACGGAACTGACGCGCATTCGCCCACAGGCGATCGCGCAGTTCGCTCCCCGCCTCAACCATCTCCAGCACTTTAAGGGAGGCCGCCACAATGGCAGGCGCCAGGGAGTTAGAGAACAGATACGGACGGGAACGCTGACGCAGCCACTCCACCACCTCTTTACGCGCCGCCGTATAACCGCCAGATGCCCCACCCAGCGCTTTGCCTAACGTACCGGTAATGATGTCCACGCGTCCCATTACGTCACAGTATTCGTGAGAACCGCGACCGTTCTCCCCGACAAAACCGACAGCATGGGAGTCATCAACCATCACCAGCGCGCCGTAAATGTCTGCCAGGTCACAGACGCCTTTCAGGTTGGCGATAACGCCGTCCATCGAGAACACGCCGTCGGTGGCGATCAGCACATGACGAGCGCCAGCTTCACGGGCCTCTTTCAGACGCGCTTCCAGCTCCTGCATATCGTTGTTGGCATAGCGAAAACGCTTCGCTTTACACAGGCGAACGCCATCAATAATCGAGGCGTGGTTTAACGCATCCGAGATAATCGCATCTTCTGCGCCCAGCAGGGTTTCGAACAAGCCGCCGTTGGCGTCGAAGCAGGAAGAGTACAGAATAGCGTCTTCCATACCGAGGAAGTCCGCCAGTTTTTGCTCCAGCTGTTTATGGCTGTCCTGAGTGCCGCAGATAAAGCGCACTGAAGCCATGCCAAACCCGTGAGAATCCATTCCCGCTTTTGCTGCCGCAATCAGTTCGGGATGGTTAGCCAGCCCCAGATAGTTGTTGGCGCAAAAGTTGATGACGTGGCTTCCGTCTGCCACGGTGATGTCCGCCTGCTGCGCAGACGTGATAATGCGCTCTTCTTTAAACAATCCTTCCGCACGGGCGGTTTCCAGATCGTTGGTTAACTGCTTATAAAAATCCCCACGCATTGCGATTCTCCAGACTGGGCAAATTTCAGCACATATTACCCAAAGCTATACATTGATACGAGATGACGCATCATCACTTCTTGAAAATGCAGCATAAATCACGCGTACCCGCACAGCTTATCGGTGAATGGCTGAAGCCTGGGCCGTGTAGTGATATGATAAGAAGTATTCGTGTCTGAGATTGTCTCTGACTCCATAATTCGAAGGTTACAGTTATGATCATCGTTACCGGCGGCGCGGGCTTTATCGGCAGCAACATCGTTAAGTCCCTGAATGATAAAGGCATCACCGATATCCTGGTGGTGGACAACCTGAAAGACGGCACCAAGTTCGTCAACCTGGTGGACCTGAATATTGCTGACTATATGGATAAGGAAGACTTCCTGATCCAGATTATGGCCGGAGAAGAGTTCGGCGATATCGAAGCTGTTTTCCACGAAGGCGCCTGCTCTTCCACCACCGAGTGGGACGGCAAGTATATGATGGACAACAACTATCAATACTCCAAAGAGCTGCTGCACTACTGCCTGGAACGTGACATTCCGTTCCTGTATGCCTCTTCTGCGGCAACTTACGGCGGTCGCACCTCTGATTTCATTGAGTCTCGTGAATATGAGAAACCGCTGAACGTTTACGGCTACTCTAAGTTCCTGTTCGACGAGTATGTACGCCAGATCCTGCCTGAAGCCAGTTCGCAGATTGTAGGTTTCCGCTATTTCAACGTTTACGGACCGCGTGAAGGCCATAAAGGCAGCATGGCGAGCGTCGCTTTCCATCTGAATACTCAGCTCAACAACGGTGAAACGCCGAAACTGTTTGAAGGCAGCGAAAACTTCAAACGCGATTTTGTCTACGTCGGCGACGTGGCTGATGTAAACCTGTGGTTCTGGGAAAACGGCGTGTCCGGCATCTTTAACCTGGGCACCGGGCGCGCGGAATCCTTCCAGGCCGTGGCGGATGCCGCGCTGGCTTATCACAAGAAAAGCGATCTTGAGTACATTCCGTTCCCGGAAAAACTGAAAGGCCGTTACCAGGCGTTCACGCAGGCGGATCTGACCAATCTGCGCGCCGCAGGCTATGACAAACCGTTCAAGACCGTTGCCGAAGGCGTAACGGAGTATATGGCCTGGCTGAATCGCGACGCGTAA
Protein sequences of DBSCAN-SWA_5 >LR134204|2276061:2283635|2281290_2282487_-|VEB91259.1|DBSCAN-SWA MRGDFYKQLTNDLETARAEGLFKEERIITSAQQADITVADGSHVINFCANNYLGLANHPELIAAAKAGMDSHGFGMASVRFICGTQDSHKQLEQKLADFLGMEDAILYSSCFDANGGLFETLLGAEDAIISDALNHASIIDGVRLCKAKRFRYANNDMQELEARLKEAREAGARHVLIATDGVFSMDGVIANLKGVCDLADIYGALVMVDDSHAVGFVGENGRGSHEYCDVMGRVDIITGTLGKALGGASGGYTAARKEVVEWLRQRSRPYLFSNSLAPAIVAASLKVLEMVEAGSELRDRLWANARQFREQMSAAGFTLAGADHAIIPVMLGDAVVAQQFARELQKEGIYVTGFFYPVVPKGQARIRTQMSAAHTPEQITRAVEAFTRIGKQLGVIA >LR134204|2276061:2283635|2282702_2283635_+|VEB91262.1|DBSCAN-SWA MIIVTGGAGFIGSNIVKSLNDKGITDILVVDNLKDGTKFVNLVDLNIADYMDKEDFLIQIMAGEEFGDIEAVFHEGACSSTTEWDGKYMMDNNYQYSKELLHYCLERDIPFLYASSAATYGGRTSDFIESREYEKPLNVYGYSKFLFDEYVRQILPEASSQIVGFRYFNVYGPREGHKGSMASVAFHLNTQLNNGETPKLFEGSENFKRDFVYVGDVADVNLWFWENGVSGIFNLGTGRAESFQAVADAALAYHKKSDLEYIPFPEKLKGRYQAFTQADLTNLRAAGYDKPFKTVAEGVTEYMAWLNRDA >LR134204|2276061:2283635|2280663_2281281_-|VEB91256.1|DBSCAN-SWA MKALSKLKAEEGIWMTDVPVPEVGHNDLLIKIRKTAICGTDVHIYNWDEWSQKTIPVPMVVGHEYVGEVVGIGQEVKGFRIGDRVSGEGHITCGHCRNCRGGRTHLCRNTLGVGVNRPGCFAEYLVLPAFNAFKIPDNISDDLASIFDPFGNAVHTALSFDLVGEDVLVSGAGPIGIMAAAVAKHVGARHVVITDVNEYRLETGA >LR134204|2276061:2283635|2280254_2280692_-|VEB91253.1|DBSCAN-SWA MSTVWKLARKMGIIHTVNVAKENLTDVMANLGMTEGFDVGLEMSGAPPAFRAMLDTMNHGGRIAMLGIPPSDMSIDWTKVIFKGLFIKGIYGREMFETWYKMAALIQSGLDLSPIITHRFSIDEFQKGFDAMRSGQSGKVILSWD >LR134204|2276061:2283635|2278300_2279263_-|VEB91247.1|DBSCAN-SWA MESLIAQTWNALEIIIVNDGSTDNSVDIARYFADTHSHVHLLHQDNAGASVARNRGLAAATGDYVAFVDADDLVYPNMYETLMTMALQDDLDVAQCNADWCILETGHTWPSIPTDRIRSTGVLTGPDWLRMGLASRRWTHVVWMGVYRRALIEKHAISFVPGLHHQDILWTTEVMFNATRVRYTEQALYKYFLHDQSVSRLKRQGERNLHYQRHYIKITRLLEKLNSDYADRIPVYPEFKQQITWEALRVCHAVRKEQDTLIRQRMIAEIFASGMYKRMMLNIRSAKAAYQALLWSGRLYRWRDKTASHHRVARKAINPR >LR134204|2276061:2283635|2277348_2278314_+|VEB91244.1|DBSCAN-SWA MPHFRRTILSLASLLAFATPVFAGKLAIVIDDFGYRPHNENQVLAMPSALSVAVLPNAPHAREMATKAHNSGHEVLIHLPMAPLSKQPLEKDTLRPEMSSDEIERIIRDAVNKVPYAVGLNNHMGSAMTSSLFGMQKVMQALERYNLYFLDSMTIGNSQAMRAASGTSVKVIKRKVFLDDTQNEADIRRQFNRAIELARRNGSAIAIGHPHPSTVRVLQQMVYNLPADITLVRPSSLLDEPQTDTSTPNLTPPKNGTPDAPRNPFRGVKVCKSKKPLEPVYASRFFSVLSESITQSTMVNYFRHQWQGWGKIAAPQNANAD >LR134204|2276061:2283635|2276061_2277345_+|VEB91241.1|DBSCAN-SWA MRGKAFNSITRAVKPRRFSVRPVIYASVLSAGVLLCAFSAHADDRDQLKSIQADIAAKERAVRQQQQQRSSLLAQLKTQEEAISAAARKLRETQNTLAQLNKQIDAMNASIAKLEQQKAIQERSLAAQLDAAFRQGEHTGIQLILSGEESQRGQRLQAYFGYLNQARQDTIAELKQTREEVASQKAALEEKQSQQQTLLYEQRAQQAKLEQARNERQKTIAGLESSIQQGQQQLSELRANESRLRNSIARAEAAAKARAEREAREAQAVRNRQQEATRKGTTYKPTESEKSLMSRTGGLGSPRGQAFWPVRGPTLHRYGEQLQGELRWKGMVIGASEGTEVKAIADGRVILADWLQGYGLVVVVEHGKGDMSLYGYNQSALVSVGTQVRAGQPIALVGSSGGQGRPSLYFEIRRQGQAVNPQPWLGR >LR134204|2276061:2283635|2279514_2280201_+|VEB91250.1|DBSCAN-SWA MTISKTHEKGYTVYYKEENKDLKSLMDKYMNNEISGKPLNSGNEFRSVELVEYQSRKFIIKNDREIDPRFEKKIQNFLSGPFYSQLIQKLDSLAPQVRACTADLYCVAEKTRFRQCYDVYTLHEYIEGEPLNDINESNKEDIKACIQQLHRAGLASNDIHAGNFIRTPSGELRIIDLSCKGSLKICQANDILVLQNKYHMNIEGQGLVYKLIQLKEKFRRLSRKMRGK |
8 | Planktothrix_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2295709 : 2300299
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >LR134204|2295709:2300299|DBSCAN-SWA CATGCAAAAACGGGCGATTTATCCAGGAACCTTCGATCCGATCACCAACGGCCATATTGATATCGTGACGCGCGCGACGCAGATGTTCGATCACGTTATCCTCGCCATTGCCGCCAGCCCCAGCAAAAAACCGATGTTTACCCTTGAAGAGCGTGTAGCGCTGGCGCAACAGGCGACGGCGCACCTCGGCAACGTGGAAGTCATGGGATTCAGCGACTTAATGGCCAACTTCGCGCGCATTCAGCAGGCCAATATTTTGATTCGCGGCTTGCGCGCCGTGGCTGATTTTGAATATGAAATGCAACTGGCGCACATGAATCGCCATTTAATGCCGCAGCTTGAGAGCGTCTTCCTGATGCCTTCAAAAGAGTGGTCTTTTATCTCTTCTTCGCTGGTAAAAGAAGTGGCGCGCCATCAGGGCGACGTTACCCACTTCTTACCGGAGAACGTTCATCAGGCCCTGATGGATAAGCTGAAGTAAAACGCTTTTGCCGGATGGCGCGTCAGCGCGTTATCCGGCCTGCAATCCATCTTATTTCTGACACTGACGGCAGTAGAACGTCGCCCGTTGCGCGTGTTTAGCCGCAACGATCGGCGTACCGCAAACCCGGCACGGTTCCCCTTTACGTCCGTAAACCTGCAACTCCCTGAGCAAAATAGCCCGGCTTACCGTCGCTTTTGCAGAAAATCTTTCAACGTTGTACCACCCTGCTCGATGGAACGCAGTAAAACGGCCTTAATCACGCGGGCCAGCAGCTCACACTCAGCCTGCGATAATGACGACGCCAGACGGTCAGGATGGATGCCAGCCGCAAACAACGACTCGCTGGCGTAGATATTTCCCACGCCCACCACCAGTTTGTTATCCATCAGCCAGGGTTTAATTGCCGTTTTCTTCTTCGCACATTTCTGCTGCAAATAGGCCCCGTTAAAATCGTCGCTCAGCGGCTTCCGGGCCGAGATGCGCCAGCACGTTATGCCCTTCCAGCTCTTTCGTCCACAGCCAGGGCGCCAAAGCGGCGAGGATCGGTGTAGCGCAGGACTTTGCCATTGCTCATCACCAGATCAACGTGGTCATGTTTTTCAGCGGGAAGTTCTTCAGGAAGGATACGCAAACTGCCCGACATTCCCAGATGAATGATGATCCAGCCATTCCGGTAACTCCAGCAGCAAATATTTCGCGCGGCGTTGAACGCTGAGTACGGGTTTATCGCTCAGGCGGTAGATCTCTTCCGATACCGGCCAGCGCAGACGTCCATTGCGCACATTGGCATGCAGAATGGTTGCCCCAACCAGATGCGGTTCAATACCGCGGCGGCTGGTTTCAACTTCAGGTAATTCAGGCATAACGTCTCCGGCTCTGTTTCAGGTCTCATTAATATGGGGACGGCCAGAAAACAAAAAACCCCGCCGAAGCGAGGTTTTTTCACTACATCAAAGCGAGAATTATTTGATTTTCGCTTCTTTGTAGATAACGTGCTGACGGACAACTGGATCGAACTTTTTCAGTTCCAGTTTTTCCGGCTTAGTACGTTTGTTCTTCGTGGTGGTATAGAAGTGACCAGTACCAGCAGAAGAAACCAGCTTGATTTTCTCACGAATACCTTTAGCCATGATTTATTTCCTCTAAGTACTTAGTACTTTTCGCCACGGGCACGCAGTTCGGACCAGAAACTGTTTCGATGCCTTTCTTATCAATTACACGCATACCTTTAGCAGATACACGCAGGGTGACAAAACGCTTCTCGCTCTCAACCCAGAAACGGTGAGAGTGCAGGTTAGGCAGGAAACGGCGTTTAGTCGCGTTCAGTGCGTGGGAACGGTTGTTACCGGTCACCGGACGCTTGCCAGTAACTTGGCAGACTCGGGACATGTCTATTCTCCAAAAATCAAATTAGCTCGAGCTTCGTATGGGGTATTGGCGCCTCGTCAGGCTTCTCAGCCTGGTTATCGCAGTTCATTGTGAACTCTCGATTGCCAGGCCCAAATGCCAAACCCGAGATTCTCAAAGGTGGCGTAGTATACGCTGTGTAAGCGATGTGCTCAAGTCCCGAACAGACAAAGATCCCGATGGATCGCGCGAAGTGTTCTAAATCCAACCACGTTCGGCAAAAGAAACGTACTCTCCGCGCCCAATCACAAGGTGATCCAGTACTCGTATATCCATGAACTGACAACATTTTACGACACGTTCGGTGATGAGTTTATCCGCTTTGCTCGGTTCTGCACACCCCGAGGGGTGATTATGCGCAAGGATCACCGCAGAAGCATTTAATTTTATCCGCCTCGCGGATAATCTCGCGCGGATGCACCTCGACATGATTCAGCGTGCCGGAAAAAAGACGGCTGTGTTTAATCACCCGGTGCCGGGGCATCCAGAAAGATCACAAGAAAGATCTCCCGCTCTTCCCCCCGCCAACTGGCTTTGCAGAAATTCACGCGTCATTTCCGGGCTCAACAACGCGTTTTCCTTCATCATTGCGCGCCTTGTAATAGCGCCTTGCCAGTTCGGCAATCCCCTTAAGCTGCGCATACTTAGCGATACCGATACCTGTACCGCGAAATTGCGCGAAATCCGCAGACAGCAACCCATAGAGCGACCCACTGCGTTGCAGCGTTTCATGCGCCAGAGCCATCACATCTTTTCCTGGCACTCCTGTACGTAAAAAGAGCGCCAGCAGTTCAACATCCGTCAGCGAACCAATTCCTGACTTCAACATTTTTTCCCGGGGCATTAAACGTTCCGGCGCATCCATACGTTCACCTCCTTAAGCACGCCTCATGGTGGCATAGCTCCTGGGGTGAATCGACGCCCGTATTTCACTCCTGCGTAGTGCCTCGCAAAGTGAATTACGGGCAACTGCACGACGGGATTCAGGATTGTGATAAAATGTCCGCCTTCTGGTGCAATCCAACAGGAAAGATCATGATGAGCCTGGCTGGCAAAAAAATCGTTCTCGGCGTGAGCGGCGGCATTGCTGCCTATAAAGCCCCTGAACTGGTGCGTCGTTTACGCGAACGCGGCGCCGACGTCCGCGTGGCGATGACCGAAGCGGCAAAAGCCTTTATCACGCCCCTGAGCCTACAGGCGGTGTCGGGTTATCCCGTCTCTGACAGTCTTGCTTGACCCCTGCCGCAGAAGCCGCGATGGGCCATATTGAGCTGGGGAAATGGGCGGATTTAGTTATCCTCGCGCCAGCCACGGCAGATCTGATCGCTCGCGTCGCTGCCGGTATGGCCAACGATCTGGTTTCAACGATTTGTCTGGCGACACCCGCCCCGATTGCCGTACTTCCGGCCATGAACCAGCAAATGTATCGCGTCGCCGCCACGCAACATAATCTGGATGTACTCGCGTCACGCGGAATGCTCATCTGGGGGCCGGACAGCGGCAGCCAGGCGTGTGGAGACGTGGGGCCAGGACGCATGCTCGACCCCGTTGACCATTGTGGATATGGCGGCAGCACATTTATCCCCTGTCAACGATCTGCAACATCTCAACATCATGATTACGGCGGGCCCGACGCGCGAGCCGCTCGATCCGGTGCGTTATATCTCTAACCACAGCTCCGGTAAGATGGGATTTGCCATTGCCGCCGCCGCCGCACGACGCGGCGCCAACGTTACGCTGGTTTCCGGCCCCGTTGCGCTGCCCACGCCACCTTTTGTTCAGCGCATTGATGTGATGACCGCGCTGGAAATGGAAGCGGCAGTCCAGGGCAGCAGTTCAGAAGCAGCATATTTTCATCGGCTGTGCTGCGGTTGCTGACTACCGTGCAGCCGCCGTCGCCAGTGAAAAAATTAAGAAACAGGCGACGCAGGGCGATGAATTAACAGTAAAAATGGTCAAGAACCCTGATATTGTCGCCGGGGTCGCCGCACTCAAAGACAAGCGTCCTTATGTCGTTGGGTTTGCCGCGGAAACAAATAATGTGGAAGAATACGCCCGGCAAAAACGTATCCGCAAAAACCTTGATCTGATCTGCGCGAACGATGTTTCGCTTTCAACTCAAGGATTTAATAGCGACAGCAACGCATTACACCTTTTCTGGCAGGATGGAGATAAAGCCTTACCGCTCGAACGGAAAGCGCTCCTGGGCCAATTATTACTCGACGAGATCGTGACCCGTTATGATGAAAAAAATCGACGTTAAGATTCTGGACCCGCGCGTTGGGCAGCAATTTCCGCTTCCAACTTATGCCACCTCCGGCTCCGCCGGACTTGACCTGCGCGCCTGTCTCGATGACGCCGTAGAACTGGCGCCTGGCGCAACAACGCTGGTGCCGACCGGTCTGGCGATTCATATTGCCGATCCGTCTCTGGCGGCGGTAATGCTGCCACGTTCCGGCCTGGGCCATAAGCATGGTATTGTGCTCCGGCAATCTGGTCGGCCTGATCGACTCTGACTATCAGGGTCAGTTAATGGTTTCCATCTGGAACCGTGGCCAGGACAGCTTTACCATTGAGCCAGGCGAACGTATTGCTCAGATGGTTTTCGTACCGGTTGTACAAGCCGAATTTAATCTGGTGGAAGAGTTTGAAGCCACCGACCGTGGTGAAGGCGGCTTCGGCCATTCTGGTCGCAAGTAA
Protein sequences of DBSCAN-SWA_6 >LR134204|2295709:2300299|2298143_2298464_-|VEB91302.1|DBSCAN-SWA MDAPERLMPREKMLKSGIGSLTDVELLALFLRTGVPGKDVMALAHETLQRSGSLYGLLSADFAQFRGTGIGIAKYAQLKGIAELARRYYKARNDEGKRVVEPGNDA >LR134204|2295709:2300299|2296374_2296578_-|VEB91294.1|DBSCAN-SWA MDNKLVVGVGNIYASESLFAAGIHPDRLASSLSQAECELLARVIKAVLLRSIEQGGTTLKDFLQKRR >LR134204|2295709:2300299|2299839_2300115_+|VEB91312.1|DBSCAN-SWA MMKKIDVKILDPRVGQQFPLPTYATSGSAGLDLRACLDDAVELAPGATTLVPTGLAIHIADPSLAAVMLPRSGLGHKHGIVLRQSGRPDRL >LR134204|2295709:2300299|2299550_2299862_+|VEB91310.1|DBSCAN-SWA MVKNPDIVAGVAALKDKRPYVVGFAAETNNVEEYARQKRIRKNLDLICANDVSLSTQGFNSDSNALHLFWQDGDKALPLERKALLGQLLLDEIVTRYDEKNRR >LR134204|2295709:2300299|2296806_2297055_-|VEB91296.1|DBSCAN-SWA MPELPEVETSRRGIEPHLVGATILHANVRNGRLRWPVSEEIYRLSDKPVLSVQRRAKYLLLELPEWLDHHSSGNVGQFAYPS >LR134204|2295709:2300299|2298855_2299269_+|VEB91306.1|DBSCAN-SWA MGHIELGKWADLVILAPATADLIARVAAGMANDLVSTICLATPAPIAVLPAMNQQMYRVAATQHNLDVLASRGMLIWGPDSGSQACGDVGPGRMLDPVDHCGYGGSTFIPCQRSATSQHHDYGGPDARAARSGALYL >LR134204|2295709:2300299|2297154_2297322_-|VEB91298.1|DBSCAN-SWA MAKGIREKIKLVSSAGTGHFYTTTKNKRTKPEKLELKKFDPVVRQHVIYKEAKIK >LR134204|2295709:2300299|2299213_2299477_+|VEB91308.1|DBSCAN-SWA MITAGPTREPLDPVRYISNHSSGKMGFAIAAAAARRGANVTLVSGPVALPTPPFVQRIDVMTALEMEAAVQGSSSEAAYFHRLCCGC >LR134204|2295709:2300299|2300071_2300299_+|VEB91314.1|DBSCAN-SWA MVLCSGNLVGLIDSDYQGQLMVSIWNRGQDSFTIEPGERIAQMVFVPVVQAEFNLVEEFEATDRGEGGFGHSGRK >LR134204|2295709:2300299|2297314_2297581_-|VEB91300.1|DBSCAN-SWA MSRVCQVTGKRPVTGNNRSHALNATKRRFLPNLHSHRFWVESEKRFVTLRVSAKGMRVIDKKGIETVSGPNCVPVAKSTKYLEEINHG >LR134204|2295709:2300299|2295709_2296189_+|VEB91292.1|DBSCAN-SWA MQKRAIYPGTFDPITNGHIDIVTRATQMFDHVILAIAASPSKKPMFTLEERVALAQQATAHLGNVEVMGFSDLMANFARIQQANILIRGLRAVADFEYEMQLAHMNRHLMPQLESVFLMPSKEWSFISSSLVKEVARHQGDVTHFLPENVHQALMDKLK >LR134204|2295709:2300299|2298634_2298835_+|VEB91304.1|DBSCAN-SWA MMSLAGKKIVLGVSGGIAAYKAPELVRRLRERGADVRVAMTEAAKAFITPLSLQAVSGYPVSDSLA |
12 | uncultured_Mediterranean_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2759597 : 2770398
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >LR134204|2759597:2770398|DBSCAN-SWA ATTACAGCATTGGTTTGAGGAAACGGGCAGTATGTGATGCTTCACACTCTGCGACCGTCTCCGGCGTCCCGGAAACAAGAATTTCGCCGCCGCCGCTGCCGCCTTCCGGGCCGAGATCGACAATCCAGTCCGCCGTCTTAATGACGTCCAGGTTGTGCTCAATCACCACAATGGTGTTGCCCTGGTCGCGCAACTGGTGCAGCACGTCCAGCAGTTGCTGAATATCGGCAAAATGCAGACCGGTGGTCGGTTCATCCAGAATGTACAGCGTCTGCCCGGTGCCGCGTTTCGACAATTCACGCGCCAGCTTCACGCGCTGTGCTTCACCGCCGGAGAGCGTGGTGGCGGACTGACCGAGACGGATGTAGGTCAGACCCACGTCCATCAACGTTTGCAGCTTACGCGCCAGCGCCGGAACGGCGTCAAAAAATTCGCGCGCTTCTTCGATGGTCATATCCAGCACTTCGTGGATAGTCTTGCCCTTGTACTTAATCTCCAGCGTTTCGCGGTTATAGCGCTTGCCTTTGCACTGGTCGCACGGTACGTAAATATCCGGCAGGAAGTGCATTTCAACCTTGATGACGCCGTCGCCCTGACAGGCTTCGCAACGCCCGCCGCGCACGTTAAAGCTGAAACGTCCCGGCGTATAGCCGCGTGAGCGTGATTCCGGCACGCCAGCGAACAGCTCACGTACGGGCGTAAACACGCCGGTATACGTTGCCGGGTTAGAGCGCGGCGTACGGCCAATCGGACTCTGGTCGATGTCGATGACTTTGTCGAAATGCTCCAGCCCCTGAATATCACGATACGGCGCAGGTTCCGCAATGGTCGCCCCGTTTAACTGGCGCTGGGCAATCGGGAACAGCGTGTCGTTGATAAGCGTAGACTTACCCGACCCGGATACGCCGGTAATACAGGTAAACAGGCCAACCGGCAGCGTCAGGGTGACGTCTTTCAGGTTGTTGCCGCGCGCGCCGGTCAGCTTCAGCACTTTCTCCGGGTTCGCGGGAACGCGCTGTTTCGGCACTTCAATCTTGCGTTTGCCGCTCATGTACTGGCCGGTGAGGGATTCCGGCACCGCCATAATCGCCTCCAGCGGCCCTTCGGCCACGACCTGACCGCCGTGTACGCCAGCGCCGGGGCCGATATCGATCACATGGTCGGCGGCGCGGATGGCGTCTTCGTCATGCTCCACCACAATCACGGTATTGCCGAGGTTACGCAGGTGAATCAGCGTTCCCAGCAAACGTTCGTTATCGCGCTGATGGAGACCGATAGACGGTTCATCCAGCACATACATCACGCCTACCAGCCCGGCGCCAATCTGGCTGGCCAGACGGATACGCTGGGCTTCGCCGCCGGACAAGGTCTCTGCCGAACGGGACAGCGTCAGGTAGTTGAGGCCGACGTTTACCAGGAACTTGAGGCGGTCGCCAATCTCCTTGAGGATCTTCTCCGCAATCTTTGCACGCTGGCCTGCGAGCTTGAGATTGTTGAAGAAGTCCATCGCATGGCCGATGCTCATGTCGGAGATCGCAGGCAGCGGAGTATTCTCGACGAAGACATGACGTGCTTCACGGCGCAGACGCGTGCCTTCACAACTGGCGCACGGACGGTTGCTGATGAATTTAGCTAACTCTTCACGTACCGCGCTGGACTCCGTCTCTTTGTAACGGCGCTCCATGTTGTGCAGCACGCCTTCGAACGGATGGCGACGCACGGAGGTGTCGCCACGATCGTTGATGTATTTGAACTCGATGCTCTCTTTGCCGGAACCGTACAGAACGACTTTATGCACATTCTGACTCAGACTTCCCCACGGCGCTTCCACGTCGAACTTATAGTGTTCCGCCAGCGATTTCAGCATCTGGAAATAATAGAAGTTACGGCGATCCCAGCCGCGAATCGCGCCGCCAGCCAGAGAAAGCTCCGGGTTCTGGATCACGCGATCGGGATCGAAATATTGCTGTACGCCGAGGCCGTCGCAGGTCGGGCAGGCTCCCGCCGGGTTGTTAAACGAGAACAGACGCGGTTCCAGCTCGCGCATACTGTAGCCGCAAATTGGGCAGGCGAAGTTGGCGGAAAAGAGCAGCTCTTCCGCTTTCGTATCGTCCATATCCGCGACGATCGCGGTGCCACCGGAGAGTTCCAGCGCGGTTTCAAAAGATTCCGCCAGGCGCTGGGACAGATCGCTACGGACTTTGAAACGATCGATCACGACCTCAATGGTGTGTTTCTTTTGCAGCTCCAGCTTCGGCGGATCGGAGAGATCGCAGACCTCGCCATCAATACGGGCGCGGATGTAGCCCTGGCTGGCCAGGTTTTCCAGCGTTTTGGCGTGCTCGCCTTTGCGCTCTTTAATAATGGGCGCCAGCAGCATCAGGCGTTTGCCTTCCGGCTGAGACAGCACGTTATCCACCATCTGGCTGACGGTTTGCGCCGCCAGCGGGACGTCATGATCCGGGCAGCGCGGTTCACCCACGCGGGCGAACAGCAGACGCAGATAGTCATGTATTTCGGTAATTGTACCTACCGTAGAGCGCGGGTTATGGGATGTCGATTTCTGCTCAATTGAGATAGCGGGCGACAGCCCCTCAATATGGTCGACATCTGGTTTTTTCCATCAAAGACAGAAACTGACGCGCATACGCGGAAAGGGATTCAACGTAACGACGCTGCCCTTCGGCATACAAGGTGTCAAAAGCGAGCGAGGATTTGCCAGAACCCGAAAGCCCGGTGACGACAATCAGTTTGTCGCGCGGGATGACGAGGTTGATGTTTTTGAGATTATGGGTGCGGGCGCCCCGAACTTCGATCTTATCCATTCACCTTTCCCGGATTAAAACGCTTTTTCCCGGCCGCATGGCGCTACCGGCGATCACAAACGGTTAATTATGACACAACTCAACCTGAATGGATATACAGTATTGGAATGCAAAACACAGGCTACTGTGTAACAATGTCTGCCCAAGGTTGTTTACTGGAATCAGCCTCGCATCGTAGTAAAAACGCTATTGGTAATGCTACAATCGCGCGTTTACACTTATTCAGAAACGTTTTTCAGGAGACTCGATCATGGCCAGCAGAGGCGTAAACAAGGTGATTCTCGTCGGTAATCTGGGCCAGGACCCGGAAGTACGCTACATGCCAAATGGTGGCGCAGTTGCCAACATTACGCTGGCTACTTCCGAATCCTGGCGTGATAAGCAGACCGGTGAGATGAAAGAGCAGACGGAATGGCACCGCGTTGTGCTGTTCGGCAAACTGGCGGAAGTGGCCAGCGAATATCTGCGTAAAGGTTCTCAGGTTTACATTGAAGGTCAGCTGCGTACCCGTAAATGGACCGATCAGTCCGGTCAGGACAAATACACCACTGAAGTGGTGGTCAACGTTGGCGGCACCATGCAGATGCTGGGTGGCCGTCAGGGCGGTGGCGCTCCGGCTGGCGGCAATATGGGCGGCGGTCAGCAGCAGGGCGGTTGGGGTCAGCCTCAGCAACCGCAGGGCGGCAACCAGTTCAGCGGCGGCGCGCAGTCCCGTCCGCAGCAACAATCCGCTCCGGCACCGTCTAACGAACCGCCAATGGATTTCGACGACGACATCCCGTTCTGATGAATTAAAAAGGCTCCTTCACGGGAGCTTTTTTTTGGCTGAAATAGCGTGCAATTAAATAGCAGGCTTGTCAGTTTTGTCACCGGACGTAAATATATCTACAGACTTTATGGGTTATTTCCTTTCTTGCTTATGATCAAAAAAGACAGGGTATTATTGATTATTATTCCCCCGTTATTTTGGTCAAATGAATGCTGAATCTTGCCAGGTCTTTTCAGGGATATGTGATTGGCCTGTCAGCTAACTTAAGGAAATCATCATGAATAAAGTCATTTTAATGCCCGTTGATATTCTGGCGATGGATCTCTCTGACAAAGCCATTGCCTACGCTGATCATATGGTGGACAAAGAAAACGGCGTGATCCATCTGCTGCATGTGTTCCCTAAAACTGGGTACCTCCCCAATGCGTGGTTTTGCCTCGGATATCAGAAAATACGAGGAATACATGACGAATGACGCGCAAGAAAAAATGCTTAACCTTGCGAGAAAATTCAAAACGCCGTTGGAGAATATTCGTTTTGAAGTCCGTCATGGCAATATTCGTGATGAAGTAAATAACGCGGTAGAAGCGCATAACGCCGAGATGATTATCATCGGTTCGCGCAAGCCGGGTATTGCAACGCATTTGCTCGGTTCGGCTGCGGCGAATATTTTTGCGATACGCGAAAATTCCAGTGATGGTCATTCGTTAATCTGTCATGCCCGGTGGCGCTTTGCTTACCGGGCTGACAAAATCCCTCTCCCGTAGGCCGGATAAGGCGTTTGTGCCGCCATCCGGCGTCATGCCCGGTGGCGCTTTGGCTTACCGGGCCGACAAAACCTCTCTCCCGTAGGCCGGATAAGGCGCTTGCGCCGCCATCCGGCGTTATGCCCGGTGGCGCTTCGCTTACCGGGCCTACAAAATCCCTCTCCCGTAGGCCGGGTAAGGCGCTTGTGCCGCCATCCGTCGCCATGCCCGGTGGCGCTTTGCTTACCGGGCCTACAAAATCCCTCTCCCGTAGGCCGGATAAGGCGTTTGTGCCGCCATCCGGCGTCATGCCCGGTGGCGCTTTGCTTACCGGGCCGACAAAACCTCTCTCCCGTAGGCCGGATAAGGCGCTTGCGCCGCCATCCGGCGTTATGCCCGGTGGCGCTTCGCTTACCGGGCCTACAAAATCCCTCTCCCGTAGGCCGGGTAAGGCGCTTGTGCCGCCATCCGTCGCCATGCCCGGTGGCGCTTCGCTTACCGGGCCTACAAAACCCTTCCCCCATCGGCAAAAAAAAAGCCTCCGCTACGGGAGGCTTTTTGCTTCGCGGTGAACCGCTGGCTATTTCCGGCGGAATTTACGGGTGAGCGGTTTCTCTATCTCCACAATGCAGAACAGCACTGCGCCGATAGCGAAGGTTATGCCCCAGTAACGCAGCGGCAGCGCTTCTGTGCCAAACAGCATCTGCATGAAAGGCATATAGATAATCGCCATTTGTAACAGCAACAGAATGCCGCTGACCATCCAGATGCCTTTATTCATCAGCAGCCCGCGTCCCAGCGAGAAACCATCCGAAACCCGACAGTTGAGCATATATACCCACTGTGCGGTGACCAGCGTCTGCAACAGCACGGTACGGATAAACTCCGGGCTGTGGCCGCGAGGCTGCAACCAGGCTTCCAGCGCAAAGGCGCAGGCTGCAATTAACGTTCCGACAAAACCGACGCGCCAGATAGCAAAGGCATCCATGACGTTCTCATTACTCTGGCGTGGCGGGCGTCGCATGATGTTGCGCTCACCGGCTTCAAATGCCAGCCCGAACGACAGCGTGGCGGAGGTCGCCATGTTCATCCACAGAATCAGTACCGGCGTGAGCGGGATCAGGTTCCCAGCCAACAGGGCAATGATGATCAGCAGACCCTGTGCCAGGTTGGTGGGCATGATGAACAGAATCGTTTTCTTCAGGTTGTCGTAAACGCGGACGCCCCTCCTGCACGGCGCTGGCGATGGTGGCGAAGTTATCGTCCGTCAGTACCATATCAGCGGCTTCTTTTGTCACTTCCGTGCCTTTGATGCCCATCGCGATGCCGACGTCCGCCTGCTTCAACGCGGGGGCGTCGTTGACTCCGTCGCCGGTCATACCGACAATCTCGCCTTTGTTTTGCAGCGCCTTCACCAGACGCAGCTTATGTTCAGGGCTGGTGCGGGCAAAGATGTCGTAGGTGACAGCGGCTTCGGCTAATGCCGCATCGTCCATCTGCTCCAGTTCGTAGCCCGTTACCGCGTGGGTGCTGTTATGAATCCCCAGCATGCCGCCGATACTCATGGCCGTTTGCGGGTGATCGCCGGTGATCATCTTCACGCGAATCCCCGCCTGCTGGCAGACCTGGATTGCGTCAATGGCCTCTGGACGCGGTGGGTCCATCATGCCTGCGATGCCCAGGAAGATCAGGCCGTCATTCAGGCACTCATGGGTGAGAGAGGCGGCTTCTGCACGCTCTGGCTTCCAGGCTGCCGCAACCATGCGCAGACCTTCTTTCGCATAGCGCGCGATCTCCGCTTCCCAGTGATTTTGCGTAAGGGCTTCAGTACCGGTAGCGGTTTGCTGTAGCTGGCACAGTTTAAACAACACATCTGGTGCGCCGGTCACCAGTACGCGCTCTTCATTTCCAATGCGATGGTGCGTCGCCATGTATTTGTACTGCGAATCGAACGGGATTTTGCTGCGCAGTTCAGTTTCAATGGCGGGAAGGCGCGCTTTTGCCGCCAGCACTTTTAAGGCCCCTTCCGTTGGCCCGCCGGTGATGCCCCAGTGACCGTTTTCGTCCTGAATCAATTGGCTGTCATTACACAGGTCGATAGTGCGAAGGTAGTTCTCCAGCAGGCTGCCGGGCAGGATCTCTACCGCAGCGTCGCTCTCCTCAGTATGGATCTCGCCCATTGGCTCATAGCTGTTGCCCTGAACGCGGTAATTTTTGTCCGCCGTAATAATGGCTTTTACCGTCATCTCATTCATGGTCAGGGTGCCCGTTTTATCGGAACAGATCACGGACATCGCACCCAGCGTCTCGACCGTTGGCAGCTTGCGGATAATGGCGCGTTTGCGGGCCATCGTCTGTACGCCGAGCGACAGAATAATTGAGATGATCGCGGGCAGCCCTTCCGGTACGGCCGCGACGGCAAGGCTAATCAGAGAAAGCAGCAGTTCGCCCATCGGCATATCGCGCAGCAGCAGGCTGAACACAAATAGTCCTGCCATCATGGCGACGATAATGGCGAAAATGGCTTTGCCGAGTTTATCCATCTGAACCAGCAGCGGGGTACGGTGCTTCTCAATGCCCGTCATCATCTGGTTGATGTGGCCGAGTTCAGTCTCTTCCCCGGTAGCGATAACCACGCCCAGTCCGGCGCCTGCGCTGATGGTGGTGCCGGAGAACAGCAGATTTTTACGATCGCCTAACGGCAGTTCGCCGGTAAGGGTGTCGGCTGTTTTTATCGACAACGGTGGACTCGCCCGTCAGGATGGCCTCTTCAACCCGCAGGTTATGGGCTTCCATAACGCGAAGATCTGCCGGGATGCGATCGCCTGCGCGCAGCACGACGATATCGCCCACCACCAGTTCGGTTGTGGGGACGGTTTCATGTTGACCGTTGCGGATCGCGACCGCTGAGTTCGACAGCATATTGCGGATGCTTTTCAGCGACTTTTCCGCGTTATTTTCCTGAATGTGGCCAATCAGCGCGTTGATAACGGCAACACCCAAAATGACGGCCGTATCCACCCAATGACCCATGACCGCAGTCAGCACCGCCGCAGCGATCAGGACATAAATCAGAACATCGTGGAAATGCGCCAGGAAGCGTAACCATGCCGGTTTGCCTGCTTTTTCTGGCAGCGCGTTAGGCCCATGCTGCGCGAGGCGCGCGCTGGCTTCATTTTGCGTCAGCCCGGAAGGCTGACTCTGCTTTTTTTCAAGTACCTCGTTAACAGATTGCTGGTACGGTTTTTTTTCAACGCATGGCGCAGAATGGAGACGATTTTTATTCGGCATTCTGGTCAATTTAAAATTCCTCGAAGTATAAAATGCATAATGTTTTGAATCGCGCTTATACTATTAAAATAAAGAGTCTGAATTATTGACATTAATCAATGGCGTTTCGTTAAAGATTATTTTATCTTGATGGTTAGATAACGCTTGTAGCGTATATTTTCAGTGTTTTGATGGAAGTGTAAATTTAGTAAAGGAGGTAATTATGTTAGGAATATATAGCGGTAGGCGATTGATGTTACTGGTTATTTTTGCAATTATCATCGCGGTAATTAGCGGGATTTCAACCTACGCCTTCGTTAAATATCTGGGTTGAAAAACACTAAATATAGTGTTCCTTACATAACTTTTAAATATTTAAAGTTAATGACAGAGAATAAAAATAAAGGGCATAAGAATATATTTTTGATGAGCAGTCTGACAGGATGAACAGACTGCTCTGGACAGGTAAAAATATGGCTAGCAGACTAACGAATAATACACATTGGTTTTCCTGTCAGCGGTTCAGGGTGAATTTCCGCATCAACCTGAAATACCTTGCGTAATAATTTTGGTTGCATAACATCAACAGGTTGCCCTTGCGCAATCACATTCCCCTGCGCCAGCACGACAAGATGATCGCAATAGCGACTTGCCTGATTCAGGTCGTGCAGTACCGTCACGACCGTTTTTCCCGCCTGTTTTAGCATCTGCATGAGCGCCATCAACTCCACCTGATGATTAATGTCGAGCCAGGTGGTGGGTTCATCCAGTAATACAATGGGCGTATCCTGCGCTAACACCATCGCCAGAAACACGCGCTGCCGCTGACCGCCGGACAGCTCCGTCAGCCGCCTTTCTGCCAGCGCGTCAGTGTGCGTCTGCGCCATTGCCCGGTTGACATGCTGATGGTCGTCGCGAGAAAGCCGCCCCCACAGCGGCAGCCACGGGCTGCGGCCATAGCTGACCAGTTCCCGCACGGTGATCCCTTCCGGCGTCATGTGCTGTTGCGGCAGTAGCGCCAGGTGACGCGAAAGCGCGCGGGGTGAGAATGCCGACAGCGGCTTACCCGCAATGTGCAAGGTGCCGGATTGCGGTATGAGCAGGCGGGCAAGGCATTTCAGCAGCGTCGATTTGCCGCAGCCATTAGGCCCCAGTAATGCGGTGATCTGCCCGGCGGGTAGCGTAACCGACAGGCCGTCAAGAATGCGTTTATCACCGTATCCGGCAGTCAGGTTTTCACATCTGATATCCATTTAGCGCATCCTCACAAGCAACCAGATAAACCACGGCGCGCCGATAATGGCGGTGAGCATGCCTGCCGGAAGCTCCGTGGGAGGGTTAATGATCCTTGCCAGCAAATCGGCCAGCGTTAAGACAAATGCGCCGATTAGCGCAGCGCCGGGCAGCAGCCAGTGATGGCGCCCTCCCAGCAGGCGTCGGGTCAGGTGGGCACGACCAGGCTGACGAAAGCTATCGGCCCGCAAACTGCCACGCTGGTGGCTGCCAGCGCGACGGCGAGGAGCAATCCCAGGCGCTGTGCCCGCCTGACCGGGACGCCGAGCGTGCTGGCGCGATCATCGCCCAGCGCCAGCAGATCCAGATCGCGGCAAAACCAGGCGCTGAGTGGAATAAGCGCCAGCATTACCGGCAGCGCCACATACACAAACGACCAGCCGCGTCCCCATAAACTGCCGGTCAGCCACAGCAGCGCGTTGTTAACGTCCTGCGGGCGGGAGAGCATCAGGTAATCCGTCAGGCTGGCCCAGGTTGCCGACAACGCCACGCCGATTAACGCCAGACGTAGGGGGGAGGAGCGACCCGCAATCGCACGCAGCAAGATCAGCGCACAAAGCCCGCCCACAAATGCCAGCAGCGGCAGCCAGACGACCGACAGCGATGGCAGCAGCATGAGTGCGCTTACCGTCGCCAGACTGGCGGCATGGTTCACCCCTAAAATATCCGGCGAGGCCAGCGGATTACGCACGATGCCCTGCACCAGTACGCCGGAAATTGCCAGGGCAGCGCCGACGAATATCGCCAGCAGCAGTCGCGGCAGACGATACTGAGTCAGAACATAGTGGTGCTCGTGATCGGTATGCCAGCCATCGATCAGCGCGGACCAGGGCAGAGGAACCACGCCCAGACGCAATGACAGCAGCGCGGTGATACTCAGCAGGAGCAGCGTGATGAGCAGAATAAAGGCTTTCATCCACGCCTCCTTGCCAGCCAGACAAAACACGGCGCGCCAATCAGCGCCAGCACTGCGCCTGCCGGAAGCTCGCCCGGCCAGGCCAGCGCTCGCGCCAGAATATCCGCCAGCAGCATAAACGTTGCGCCCATCAGCAGGCTCATCGGCAGCAACTTACGCTGGTCGTGACCCGACCACGCCCTGGCCAGATGCGGCATTAACAGGCCGATAAACGCCACCGGCCCCGCCACGCTGACGCAGGCGCCGACCAGCACCAGCACGCCGATATTGACGGTAAGCCGCAGCCGAAACAGGTTTACGCCGAGCGTGTGTGCAGCGACATCGCTGACGTTGAGCAGATTAAGGGCGTTCGCCAGCAGCAGCATGACGGGAATAATCACCAGAACAAAAGGAAAGAGCTGCCAGAACTCGGCCCAGCGCACATGGGCAATACCGCCCGCCAGCCATGAAAAAATGCCGTAAGCGTGGTCTTCCGCCAATAGCAGCGTGATGCGGGTCAGCGCCATACACAGCGCGGAGAGCGCGATTCCGGCGAGAATCAGTTTGTTGCGGTCGGGCTCCTGTCGCCAGCCGCCGCCAGCCAGCATCACCAGAAGCCAGGCGAGGCCTCCGCCTGCGGCGGCGATAAACGCGATGGAATACCCGCTGAGCAGCATGGGGCTGAATGCGCTGGTTAACGCTATCGCCAGTGCCGCACCGCTGTTAATGCCGGTCAGAGAAGGAGAGGCTAGTGGGTTATGGGTCAGGGTTTGCAGCAATGTTCCCGCCAGGGCAAGGCTGGCGCCAATCAGCAATGCCACCAGGCTGCGCGGCAGACGTAAATTACGCACCAACGCCTCCGAAAGCGGAGGCGTTGAAGAAGGAAACAGGGCATGCAGAGCACTGACGGGTGAAATCGGGATCGCCGAGTAGCAGAACAGGCTCAGCCAGAACGCCGCCAGGAGCGCCAGTACGGGCGCTCCCCAAATGATCAATGTCCTCAT
Protein sequences of DBSCAN-SWA_7 >LR134204|2759597:2770398|2769411_2770398_-|VEB92329.1|DBSCAN-SWA MRTLIIWGAPVLALLAAFWLSLFCYSAIPISPVSALHALFPSSTPPLSEALVRNLRLPRSLVALLIGASLALAGTLLQTLTHNPLASPSLTGINSGAALAIALTSAFSPMLLSGYSIAFIAAAGGGLAWLLVMLAGGGWRQEPDRNKLILAGIALSALCMALTRITLLLAEDHAYGIFSWLAGGIAHVRWAEFWQLFPFVLVIIPVMLLLANALNLLNVSDVAAHTLGVNLFRLRLTVNIGVLVLVGACVSVAGPVAFIGLLMPHLARAWSGHDQRKLLPMSLLMGATFMLLADILARALAWPGELPAGAVLALIGAPCFVWLARRRG >LR134204|2759597:2770398|2767429_2767540_+|VEB92323.1|DBSCAN-SWA MLGIYSGRRLMLLVIFAIIIAVISGISTYAFVKYLG >LR134204|2759597:2770398|2762671_2763208_+|VEB92309.1|DBSCAN-SWA MASRGVNKVILVGNLGQDPEVRYMPNGGAVANITLATSESWRDKQTGEMKEQTEWHRVVLFGKLAEVASEYLRKGSQVYIEGQLRTRKWTDQSGQDKYTTEVVVNVGGTMQMLGGRQGGGAPAGGNMGGGQQQGGWGQPQQPQGGNQFSGGAQSRPQQQSAPAPSNEPPMDFDDDIPF >LR134204|2759597:2770398|2764979_2766677_-|VEB92319.1|DBSCAN-SWA MSIKTADTLTGELPLGDRKNLLFSGTTISAGAGLGVVIATGEETELGHINQMMTGIEKHRTPLLVQMDKLGKAIFAIIVAMMAGLFVFSLLLRDMPMGELLLSLISLAVAAVPEGLPAIISIILSLGVQTMARKRAIIRKLPTVETLGAMSVICSDKTGTLTMNEMTVKAIITADKNYRVQGNSYEPMGEIHTEESDAAVEILPGSLLENYLRTIDLCNDSQLIQDENGHWGITGGPTEGALKVLAAKARLPAIETELRSKIPFDSQYKYMATHHRIGNEERVLVTGAPDVLFKLCQLQQTATGTEALTQNHWEAEIARYAKEGLRMVAAAWKPERAEAASLTHECLNDGLIFLGIAGMMDPPRPEAIDAIQVCQQAGIRVKMITGDHPQTAMSIGGMLGIHNSTHAVTGYELEQMDDAALAEAAVTYDIFARTSPEHKLRLVKALQNKGEIVGMTGDGVNDAPALKQADVGIAMGIKGTEVTKEAADMVLTDDNFATIASAVQEGRPRLRQPEENDSVHHAHQPGTGSADHHCPVGWEPDPAHAGTDSVDEHGDLRHAVVRAGI >LR134204|2759597:2770398|2759597_2762078_-|VEB92305.1|DBSCAN-SWA MGEPRCPDHDVPLAAQTVSQMVDNVLSQPEGKRLMLLAPIIKERKGEHAKTLENLASQGYIRARIDGEVCDLSDPPKLELQKKHTIEVVIDRFKVRSDLSQRLAESFETALELSGGTAIVADMDDTKAEELLFSANFACPICGYSMRELEPRLFSFNNPAGACPTCDGLGVQQYFDPDRVIQNPELSLAGGAIRGWDRRNFYYFQMLKSLAEHYKFDVEAPWGSLSQNVHKVVLYGSGKESIEFKYINDRGDTSVRRHPFEGVLHNMERRYKETESSAVREELAKFISNRPCASCEGTRLRREARHVFVENTPLPAISDMSIGHAMDFFNNLKLAGQRAKIAEKILKEIGDRLKFLVNVGLNYLTLSRSAETLSGGEAQRIRLASQIGAGLVGVMYVLDEPSIGLHQRDNERLLGTLIHLRNLGNTVIVVEHDEDAIRAADHVIDIGPGAGVHGGQVVAEGPLEAIMAVPESLTGQYMSGKRKIEVPKQRVPANPEKVLKLTGARGNNLKDVTLTLPVGLFTCITGVSGSGKSTLINDTLFPIAQRQLNGATIAEPAPYRDIQGLEHFDKVIDIDQSPIGRTPRSNPATYTGVFTPVRELFAGVPESRSRGYTPGRFSFNVRGGRCEACQGDGVIKVEMHFLPDIYVPCDQCKGKRYNRETLEIKYKGKTIHEVLDMTIEEAREFFDAVPALARKLQTLMDVGLTYIRLGQSATTLSGGEAQRVKLARELSKRGTGQTLYILDEPTTGLHFADIQQLLDVLHQLRDQGNTIVVIEHNLDVIKTADWIVDLGPEGGSGGGEILVSGTPETVAECEASHTARFLKPML >LR134204|2759597:2770398|2764516_2765020_-|VEB92317.1|DBSCAN-SWA MATSATLSFGLAFEAGERNIMRRPPRQSNENVMDAFAIWRVGFVGTLIAACAFALEAWLQPRGHSPEFIRTVLLQTLVTAQWVYMLNCRVSDGFSLGRGLLMNKGIWMVSGILLLLQMAIIYMPFMQMLFGTEALPLRYWGITFAIGAVLFCIVEIEKPLTRKFRRK >LR134204|2759597:2770398|2762190_2762421_-|VEB92307.1|DBSCAN-SWA MDKIEVRGARTHNLKNINLVIPRDKLIVVTGLSGSGKSSLAFDTLYAEGQRRYVESLSAYARQFLSLMEKTRCRPY >LR134204|2759597:2770398|2763989_2764466_-|VEB92315.1|DBSCAN-SWA MPMGEGFCRPGKRSATGHGDGWRHKRLTRPTGEGFCRPGKRSATGHNAGWRRKRLIRPTGERFCRPGKQSATGHDAGWRHKRLIRPTGEGFCRPGKQSATGHGDGWRHKRLTRPTGEGFCRPGKRSATGHNAGWRRKRLIRPTGERFCRPGKPKRHRA >LR134204|2759597:2770398|2763654_2763957_+|VEB92313.1|DBSCAN-SWA MTNDAQEKMLNLARKFKTPLENIRFEVRHGNIRDEVNNAVEAHNAEMIIIGSRKPGIATHLLGSAAANIFAIRENSSDGHSLICHARWRFAYRADKIPLP >LR134204|2759597:2770398|2763467_2763665_+|VEB92311.1|DBSCAN-SWA MNKVILMPVDILAMDLSDKAIAYADHMVDKENGVIHLLHVFPKTGYLPNAWFCLGYQKIRGIHDE >LR134204|2759597:2770398|2766621_2767236_-|VEB92321.1|DBSCAN-SWA MTRMPNKNRLHSAPCVEKKPYQQSVNEVLEKKQSQPSGLTQNEASARLAQHGPNALPEKAGKPAWLRFLAHFHDVLIYVLIAAAVLTAVMGHWVDTAVILGVAVINALIGHIQENNAEKSLKSIRNMLSNSAVAIRNGQHETVPTTELVVGDIVVLRAGDRIPADLRVMEAHNLRVEEAILTGESTVVDKNSRHPYRRTAVRRS >LR134204|2759597:2770398|2767691_2768459_-|VEB92325.1|DBSCAN-SWA MDIRCENLTAGYGDKRILDGLSVTLPAGQITALLGPNGCGKSTLLKCLARLLIPQSGTLHIAGKPLSAFSPRALSRHLALLPQQHMTPEGITVRELVSYGRSPWLPLWGRLSRDDHQHVNRAMAQTHTDALAERRLTELSGGQRQRVFLAMVLAQDTPIVLLDEPTTWLDINHQVELMALMQMLKQAGKTVVTVLHDLNQASRYCDHLVVLAQGNVIAQGQPVDVMQPKLLRKVFQVDAEIHPEPLTGKPMCIIR >LR134204|2759597:2770398|2768647_2769415_-|VEB92327.1|DBSCAN-SWA MKAFILLITLLLLSITALLSLRLGVVPLPWSALIDGWHTDHEHHYVLTQYRLPRLLLAIFVGAALAISGVLVQGIVRNPLASPDILGVNHAASLATVSALMLLPSLSVVWLPLLAFVGGLCALILLRAIAGRSSPLRLALIGVALSATWASLTDYLMLSRPQDVNNALLWLTGSLWGRGWSFVYVALPVMLALIPLSAWFCRDLDLLALGDDRASTLGVPVRRAQRLGLLLAVALAATSVAVCGPIAFVSLVVPT |
13 | uncultured_Mediterranean_phage(28.57%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
2978903 : 2986788
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >LR134204|2978903:2986788|DBSCAN-SWA AATGAAAATCGTTAACGCAGAAGAAAAGCACATCCCCGCCATATGCGATATCTACGCCCATCATGTTATTCATGGTACCGCCAGCTTTGAAACTGAACCACCGGATACCCATGAGATGCTTGCCCGCCTGAAAAAGATCCGCAATCAGGCGCTGCCGTGGGTTGTCGCGTTAGAGGAAGAAAAGGTCATCGGGTATTGTTATCTCACTCGTTACCGTGAACGCTACGCCTATCGCCACACCCTCGAAGATTCAATTTATATTCACCCGGATGCGCAGCGTCAGGGGACAGGAAAAGCGTTGCTGCGCCACGTCATTGCCTGGGCAGAAACACACGGCTACCGGCAGATGATAGCCATTATCGGCGACAGCAATAATGAGGGTTCGCTGAAAGTCCATCAGCAGGTCGGATTTACAGAAAAAGGCACGCTGAAAGACATTGGTTTCAAGCATGGTCGCTGGCTGGACACGGTTTTGCTGCAACGAAATCTGGGGGAAGGGAACAGCTCGCTGCCATAAGATTCAGGCCTTATGCTCTGAAACAGGAAAAAGGGGGCATTCGCCCTCCTTCATATTTATTTCCGTCTTTAACTATTTCTGACGCCGATATTGAGAGAGTTTTCTTCTACCTACCGGAAATATCATCCGTTTTTCTCTTTCTGAAACCAATGCTTAACAGTTGAAAGGTAAGTTTGTATTCGAGCACATAGGTTGCTGGAGGGGAATAAATCCCGTCTCCCCCGAACTAAATTGTGACGTTTGCAACCGGGATACCGGAACCTGAGGAGATGCAAAAACGGTGAACACACCTCCACTTCTACTTAGCAGCAAGATTACCTGGTGCGTTGGTTTGGCCCAGTTAATTAACTGGGGGATTACCTACTACCTGTTGGGTGCTTTTGGTAGCGCCATTGCTAATGACACGTCGTGGGGGCAACCTCTTATCTTCTCAGGGTTGACGCTGGCTATGGGCATCATGGGGCTCATTTCACCGATATCCGGGCGATTATTAGCATCGATGGGGGGAAGAAAAGTCCTGCAGCTTGGTGCTCTCCTGAACGGCCTTGGATGCCTGCTCCTGGCAACGAGTCACTCCCTATACATTTACTTAATGGCCTGGCTGGTGATGGGCATAGGTATGCGCCTTTCACTTTATGATGCAGCTTTCGCTGTTCTCGTAAATCTTGCAGGGCCGACAGCCAGAAAATCTATTACACAGGTGACCCTGCTCGGAGGTCTGGCTTCTGTTGCCTTTTGGCCAATTGGCGAGGGCTTACTGAGCCTGCTGGGCTGGCGATGGGGTGTTGGCTGTTACAGCCTGTTCGCCCTGATAAGCATTTTGCTTTGTATTTCTTTGCCAGATAACAAAAGAGAACGAGGTGTAGCCCCCCGCGAATCGACGGAACAGGGCATCCAGCATCTTGAGCGGGACTATCTGAAATGCGGGTTGTATGCAGCATTGATGGCATTGCTCAGTTTCCTGTCCACAGGAATTTCAACCCATTTACCTCTGATTCTGGCCAGTGCTGGTGTCCCGGTACTTGTTGGTTCCTTATGGGGGGTGGGACAGGTCAGTGCACGTCTTGGCGATATAGCGATGGGGAGTCGAGGATCTGCTCTGGGGCTGAACCTGATAGTGGGGATTTTGCTTCCTTGTTGTTTCATGATCAGCCTGCTGGGACAGACATCCATCGTGGGGACCGGCTTGTTCGTTCTTGGATATGGTGCCGTCAACGGTCTATCTACTCTGGTGAGAGCAACATTACCATTAACTCTGTTTGACCCGCGCCTGTACGCCAGTCGCATGGGAACATTGCTCATGCCGAGTTTTTTTCTGTCGGCACTGGCCCCTTCACTCTATGCCAGCTTTCGCGAACGATTTGGGGATACTGGCATGCTGATCATTTCACTGGTATTTGCGTGTGTGGCAGCAATCATAGCCATATGGATCTATTTGATTGGAAAAGATAAACAGGCACTACAAAGAGTGCCTGTTGATATAACATCGAACGTTAATGAGTAGGCAAGCGCGTACGAGTCTCACAACTGCACTACACGCCTTTATCCAGTTGTTCCAGTTGCTGTTGTAAAGAAAGGCGATCTAGCTTCTCAAACGGCAAACTCAGAAACAGTCTGATGCGACTGGCTATTTGCTTCTCGACGTCCACAAATGCCTGGTACATTGCTTCCGAATCGCCTCTGCGAGTGCGGGATCTTTGAATCCCCAATGGGCCGTCAGCGGTTGGCCTGGAAATACCGGGCATCCTTCTCCAGCCATTTTGTCACACAAGGTGAAAATGAAATCCATTTTTGGTGAGTCGACTTGCTGATACTGCAGGAGACTTTTACTTGAAAGGCTGGAGATGTCAAACCCCTGACTTTTCAATACTTTTAGCGTCAAAGGGTGCGGTGATGAAGCGGGATGGCTGCCGGCGCTAAACGCCTTGAAAATCCCACCACCGAGCTGATTCATTAAAGCCTCAGCAAAAATACTTCTGGCAGAGTTGCCTGTGCACAGGAAAAGGACATGGTACGTTTTATCCATCATTTCAGCCTCTGAGTAAAAATCGCATAAGCTCTGTCACCAGGCAGCAATAAATAGAGCTGCCTGGTGAAATGGAACTACTTGATTACACGCTGCCCATGTGCGTCGATCACCTTTTCACCGTCTTCTTTAGTAAATTCACCTCTCTGTGCATCAGGCAAAATATCCAGTACCACTTCCGATGGGCGGCACAGTCGGGTTCCCAGTGGTGATACAACAATGGGTCGGTTAATCAGGATCGGATGCTGCAACATAAAGTCGATTAACTGATCATCAGTAAATTTATCTTCGGCGAGTCCCAATTCTTCATAAGGCTCAACGTTTTTACGAAGCAGGGCTCGTACTGAAATACCCATATCTGCAATGAGTTTGACCAGCTCATCGCGTGAAGGTGGATTCTCAAGATAGAGAATTACGGTCGGCTCAGTACCGCTGTTTCGGATCATCTCCAGCGTATTACGCGATGTGCCGCAGGCCGGGTTGTGATAAATGGTGATGTTGCTCATATCAGTATCTCATTACAAAGTGAAAGAGAGACGTAGCGCCAGCGCGGCCAGCGTTACAAACAGCACAGGCAGAGTCATGACGATCCCGGTGCGGAAATAGTATCCCCAGGTGATGGTCATATTCTTCTGTGAAAGCACATGCAGCCAGAGTAGCGTTGCCAGGCTGCCGATAGGTGTTATTTTCGGTCCCAGATCGCAGCCAATCACGTTGGCATAAATCATCGCCTCTTTGATAACGCCTGTTGCGGTGCTGCCATCAATAGAGAGAGCGCCAACCAGCACGGTAGGCATGTTATTCATGATGGAAGACAGGAATGCAGTCAGGAAGCCAGTACCTAAGGTAGCAGCCCAAAGACCTTTATCAGCAAGTACGTTCAGCACGTTAGACAAGTATTCTGTCAGCCCTGCGTTGCGTAATCCGTAGACCACCAGATACATCCCCAGTGAGAAGATCACGATCTGCCAGGGCGCACCGCGCAGCACTTTACCGGTGTTAATCCCATGACCTTTTTTGGCCACCGCAAACAGGATTGCGGCTCCCACTGCCGCGATTGCACTCACAGGAATTCCGAGTGGCTCGAGTACAAAAAAACCGACTAACAAAAAGGACTAAAACAATCCAGCCCGTTTTGAATGTTGCTGGATCTTTGATGGCTTTTGCGGGTTCTTTCAGTCGCGCCAGCTCATAAGTGGGCGGGATATCTTTGCGGAAAAAAAGATGCAGCATCACCAGCGTGGCGACAATGGCGGCGATATCCACTGGCACCATCACCGACGCATATTCTGTGAATCCCAGACGGAAGAAATCAGCCGAAACGATATTGACCAGGTTCGATACGATAAGTGGCAAGCTGGCGGTATCGGCAATAAACCCTGCGGCCATTACAAACGCCAGCGTGGTGCTTTTACTGAACCCCAGCGCTAGTAACATGGCGATAACGATCGGCGTCAGAATTAACGCCGCGCCATCGTTGGCAAACAATGCCGCCACCGCCGCGCCGAGCAGAACAATATAGGTAAATAGCAGACGTCCACGCCCATTGCCCCAGCGGGAAACGTGCAGGGCAGCCCATTCAAAAAAGCCGGACTCATCCAGCAACAGACTGATGATAATGACGGCAATAAAGGTGGCCGTGGCGTTCCAGACGATATTCCACACCACCGGAATATCGGCGATATGAATCACACCCGATGCCAGAGCCAGTACGGCACCCAGCGTGGCGCTCCAGCCAATGCCTAATCCCTTTGGCTGCCAGATAACCAACACGATGGTCAGGATAAAAATGGCTCCTGCCAGTAACATATAACCTCCCGAAAGGGCAGCAAAGCTGCCCCGAATAATGACAATAATTAACCAGCGAGTTGTTTGAGTTTGTCGATGCCGGTCGGTTCTGACGCCAGTACAGGAACAAGAGCTACGCGACTGGCATGCTGGAATTTAACGCTCTCGATCTGCGGAAGTTCCTGTTGTGCCCGCAGACGAAGCAGCGGTGAACGGGTATCGGCAATGGAAAGGCTGTTATTGATAATCCAGCCCCAGGGATGAATCCCTGCGCGTTCAAGGTCGGCCTGCAAATTTGCCGCCTCAAGCACAGGCGTGGTTTCCGGCAGCGTGACCAGTAACACTTTGGTTCGCTCCGGGTCCTGAAGCTGCATCATTGGCGTGGTGAAATGGCCTTTATCGCCCATTTTCTTAGCAATTTCGCGGTGATACGCCCCGGTGGCATCCAGCAGTAGTAGCGTATGCCCGGTGGGTGCCGTATCCATGACCACGAAGCGCTTACCCGCTTCACGAATCACCCGTGAAAAGGCCTGGAAGACCGCAATCTCTTCGGTGCAAGGAGAGCGTAAATCTTCTTCCAGCAGGCGTTTTCCTGCTTCGTCCAGTTCGCCTCCCTTAGTGTCAAGAACATGCTGACGGTAGCGTTCCGTTTCCTCGTGAGGATCGATCCTGCTGACCTGCAGATTCTTAAGGCTGCCATTGAGGGTTGTGCTGAGATGCGCCGCAGGATCAGACGTTGTCAGATGGACATCAAATCCCATTTCGGCCAGTCTGACGGCAATGGCAGCAGCCATCGTGGTTTTCCCCACGCCACCTTTGCCCATCAGCATGATCAAACCATGTTCATTACGGGCAATGTCATCAACTAGCGCAGAGAGCGATGGAATATCAGGACGTTGCTGAATGTAGTCTGCAGAAGATAATACTGCTTCTGGCTGAGTGGAGAGAAGCCCGCTGAGTGCAGATACACCGACCATATTAACCGGTTGAAGGAATAGCGTGTCAGTTGGAAGGCCAGAAAGATCAGAGGGAAGATGAGCTAGCGCTTCCTGTTCACGTTCCCATATTGCTGCAGCCAGTGTATCGGTAGCGGCTTCAGTTTTTGGAAGAACGCCATTAATGACCAGATACTGATTTTTAAGACCGATGGCGGCAAGTTCCAGATGAGTTCTGGCGACTTCCAGTAGCGTTGATTTCTGCAACCGCGCGACTAAAACCAGTCGGGTGCGTGTTGGGTCAGACAACGCTTCAACGGCATGAGCATATTGTTCACGCTGTTTTTTCCAGTCCCGCCATTGGGCCCAGACAAGATGCGCCATCTGGATTACTCTCTATAAAACTGCTCCATGCACCAGGCAACTGGAGAAGGCGGATAGTATGGCCAGTAGGTGCTGTATCAAAAATGATATGATCAAAGCGGGTCAGCAGAGAGGCGTCAGTCAGCAATCCAGTAAACTCATCAAATGCAGCAATCTCGGTTGTGCATGCACCCGACAGTTGTTCGTTGATGCTGCTAACGACGTCATAAGGCAGGACGCCTTTAATTGGGTCAACGATTCTGGCCCGGTACTGCTGTGCAGCTGCCTGAGGATCAATCTCAAGAGCCGATAATCCCGGAACAGAGGCTATTGGTTGAATGGTGTTGCCTATCGTCTGGCCAAACACCTGGCCAACGTTTGAGGCCGGGTCGGTACTGACCAGCAATACTCGTTTACCCTGTTCTGCAAGACGGATCGCTGTGGCGCAGGAAATAGAGGTTTTACCTACACCACCTTTACCAGTAAAAAACAGATAAGGGGGTATATTTTCTAAAAATTTCATTTGTCCTCCTGGCATACTTAACAACAAGAAGTATTACCACCGCAGCAGCCTGAAGGCGCTAATCCTACTTTCTCCAGCGGAATCCCAAACCACCGTGCAAGTTCAGCCCGTTTCGGATAACGTCCGGCCATCACTGTTTCGCCATCCAGTAACAGTAGTGGCAGTCCTTCCGCACCGGAAGCCTCAATGAATGCTTTAACTTTCTCGTTCTGGACAAAGCTCATCGGTTGCTGTGCCAGATTGTAACGATCTACCTGAACACCTCGTTGTTTAAGCCATTGTACATCCGCAGAAAAATCAACCAGAGCCTGATCGACATCAGTACCGCAGACACCGGTGCTACAACACATCGCTGGGTCGAATACCGTTAGCGTTTTCATCCTTAACACCTCACATGTGGAAAATCATATATATTCAGGCAAATTTTTAAATACAAATAGCCTTACTGCTGTTAGTGCAGTTGACTGAAGCCAGTTTACGGGCGATGGCCTGAAGATCGTCCTGTTGGCTTAACCTGGCCTGCTCAATAATCTGCGCAGCCCATGAAGGAATGTGTGGGGATAAGCGGTAATGAACCCATTTTCCGTGCTTACTATCAAGCAAAATCCCGCTTTCCCGCAACATTGCCAGATGCCGGGAGATCTTAGGCTGTGATTGATCAAGTGCAGTGCAAAGATCACAGACGCACAACTCCCCCATGGTTCTGAGTATGAGAACAATACCCAGACGTGTTTCATCAGACAGATTCTTAAAGAGTTGTAGGGAAGTAAGCGACATCATTCCTCCTGTTCGTTTGGGTAACGATATCTCCAGCGGAGATGAAAATCAACAACATATACGCATAAGCGAATGTATTTGATAAATGTCAAAACAATGACAAGTTCGTACACCCATACTACCTCTGTTTTCTTTTGTTATGTCTTTTTATTCATATTTATGTAAACAATAAATTGAAGGGTAACTATGAATACCGATGATGACGACTATCCAAAAGATTCCCAAAAACGGAAACTTATATCTACATTAGCCATTGCGACTGGAGCTGCGCTTGTTATAGCTCCTACTCAGGCAGCCCCCAGACTTACTGAAAAAAAACAGCACTGGTGTATGGTCATAGATCTGCGTAAATGCGTGGGTTGTCAGGCATGTACTGTCAGTTGCAAAATCGAGAATCACGCACCGCCGGGGCAATTCAGAACCTGGGTCGCCGATATTGAAGTTGGTCAATATCCAAAAACCAAAAGACAGTTTCTTCCGCGTCTATGTAACCATTGTGAAAACCCTAGCTGTGTCCCTGTTTGTCCAACAGGTGCTACGTTTAAACGCAATGATGGCATTGTTCTTATCAATCAGGATATTTGTTGGGGCTGTGGTGCCTGTGTTACTGCCTGCCCTTATGATGCTCGTTTTATTAATACTGAAACCAAAACAGCTGATAAATGTACCTTTTGCGCGCATCGCATAGAACAGGGGCTCGTCCCAGCCTGTGTCGAAACTTGTGTCGGTGCAGCAAGGGTTTTTGGTGATCTGAATGACAAAGAAAGTGAAGTGCATAAATTATATTCAGAGCATATAACAAATGTTTTGAATCCAGCAACGGGTAACAAGCCTCAGGTATTTTATATTGGGTTAAACAATGAAATGACTGTAGGAGAAGAAATAAAAAGTAAGACTTGGACGAAAGAGTTTTCTGATGTTTTCGATCAGGAACTCCCCTGGGTAAATCAGGAGTAA
Protein sequences of DBSCAN-SWA_8 >LR134204|2978903:2986788|2985480_2985831_-|VEB92793.1|DBSCAN-SWA MSLTSLQLFKNLSDETRLGIVLILRTMGELCVCDLCTALDQSQPKISRHLAMLRESGILLDSKHGKWVHYRLSPHIPSWAAQIIEQARLSQQDDLQAIARKLASVNCTNSSKAICI >LR134204|2978903:2986788|2984502_2985054_-|VEB92791.1|DBSCAN-SWA MKFLENIPPYLFFTGKGGVGKTSISCATAIRLAEQGKRVLLVSTDPASNVGQVFGQTIGNTIQPIASVPGLSALEIDPQAAAQQYRARIVDPIKGVLPYDVVSSINEQLSGACTTEIAAFDEFTGLLTDASLLTRFDHIIFDTAPTGHTIRLLQLPGAWSSFIESNPDGASCLGPMAGLEKTA >LR134204|2978903:2986788|2982444_2983254_-|VEB92789.1|DBSCAN-SWA MLLAGAIFILTIVLVIWQPKGLGIGWSATLGAVLALASGVIHIADIPVVWNIVWNATATFIAVIIISLLLDESGFFEWAALHVSRWGNGRGRLLFTYIVLLGAAVAALFANDGAALILTPIVIAMLLALGFSKSTTLAFVMAAGFIADTASLPLIVSNLVNIVSADFFRLGFTEYASVMVPVDIAAIVATLVMLHLFFRKDIPPTYELARLKEPAKAIKDPATFKTGWIVLVLFVSRFFCTRATRNSCECNRGSGSRNPVCGGQKRSWD >LR134204|2978903:2986788|2985071_2985434_-|VEB92792.1|DBSCAN-SWA MKTLTVFDPAMCCSTGVCGTDVDQALVDFSADVQWLKQRGVQVDRYNLAQQPMSFVQNEKVKAFIEASGAEGLPLLLLDGETVMAGRYPKRAELARWFGIPLEKVGLAPSGCCGGNTSCC >LR134204|2978903:2986788|2981522_2981951_-|VEB92787.1|DBSCAN-SWA MSNITIYHNPACGTSRNTLEMIRNSGTEPTVILYLENPPSRDELVKLIADMGISVRALLRKNVEPYEELGLAEDKFTDDQLIDFMLQHPILINRPIVVSPLGTRLCRPSEVVLDILPDAQRGEFTKEDGEKVIDAHGQRVIK >LR134204|2978903:2986788|2979699_2980923_+|VEB92785.1|DBSCAN-SWA MNTPPLLLSSKITWCVGLAQLINWGITYYLLGAFGSAIANDTSWGQPLIFSGLTLAMGIMGLISPISGRLLASMGGRKVLQLGALLNGLGCLLLATSHSLYIYLMAWLVMGIGMRLSLYDAAFAVLVNLAGPTARKSITQVTLLGGLASVAFWPIGEGLLSLLGWRWGVGCYSLFALISILLCISLPDNKRERGVAPRESTEQGIQHLERDYLKCGLYAALMALLSFLSTGISTHLPLILASAGVPVLVGSLWGVGQVSARLGDIAMGSRGSALGLNLIVGILLPCCFMISLLGQTSIVGTGLFVLGYGAVNGLSTLVRATLPLTLFDPRLYASRMGTLLMPSFFLSALAPSLYASFRERFGDTGMLIISLVFACVAAIIAIWIYLIGKDKQALQRVPVDITSNVNE >LR134204|2978903:2986788|2981963_2982494_-|VEB92788.1|DBSCAN-SWA MGAAILFAVAKKGHGINTGKVLRGAPWQIVIFSLGMYLVVYGLRNAGLTEYLSNVLNVLADKGLWAATLGTGFLTAFLSSIMNNMPTVLVGALSIDGSTATGVIKEAMIYANVIGCDLGPKITPIGSLATLLWLHVLSQKNMTITWGYYFRTGIVMTLPVLFVTLAALALRLSFTL >LR134204|2978903:2986788|2981046_2981445_-|VEB92786.1|DBSCAN-SWA MDKTYHVLFLCTGNSARSIFAEALMNQLGGGIFKAFSAGSHPASSPHPLTLKVLKSQGFDISSLSSKSLLQYQQVDSPKMDFIFTLCDKMAGEGCPVFPGQPLTAHWGFKDPALAEAIRKQCTRHLWTSRSK >LR134204|2978903:2986788|2978903_2979419_+|VEB92784.1|DBSCAN-SWA MKIVNAEEKHIPAICDIYAHHVIHGTASFETEPPDTHEMLARLKKIRNQALPWVVALEEEKVIGYCYLTRYRERYAYRHTLEDSIYIHPDAQRQGTGKALLRHVIAWAETHGYRQMIAIIGDSNNEGSLKVHQQVGFTEKGTLKDIGFKHGRWLDTVLLQRNLGEGNSSLP >LR134204|2978903:2986788|2986017_2986788_+|VEB92794.1|DBSCAN-SWA MNTDDDDYPKDSQKRKLISTLAIATGAALVIAPTQAAPRLTEKKQHWCMVIDLRKCVGCQACTVSCKIENHAPPGQFRTWVADIEVGQYPKTKRQFLPRLCNHCENPSCVPVCPTGATFKRNDGIVLINQDICWGCGACVTACPYDARFINTETKTADKCTFCAHRIEQGLVPACVETCVGAARVFGDLNDKESEVHKLYSEHITNVLNPATGNKPQVFYIGLNNEMTVGEEIKSKTWTKEFSDVFDQELPWVNQE >LR134204|2978903:2986788|2983301_2984552_-|VEB92790.1|DBSCAN-SWA MAHLVWAQWRDWKKQREQYAHAVEALSDPTRTRLVLVARLQKSTLLEVARTHLELAAIGLKNQYLVINGVLPKTEAATDTLAAAIWEREQEALAHLPSDLSGLPTDTLFLQPVNMVGVSALSGLLSTQPEAVLSSADYIQQRPDIPSLSALVDDIARNEHGLIMLMGKGGVGKTTMAAAIAVRLAEMGFDVHLTTSDPAAHLSTTLNGSLKNLQVSRIDPHEETERYRQHVLDTKGGELDEAGKRLLEEDLRSPCTEEIAVFQAFSRVIREAGKRFVVMDTAPTGHTLLLLDATGAYHREIAKKMGDKGHFTTPMMQLQDPERTKVLLVTLPETTPVLEAANLQADLERAGIHPWGWIINNSLSIADTRSPLLRLRAQQELPQIESVKFQHASRVALVPVLASEPTGIDKLKQLAG |
11 | uncultured_Caudovirales_phage(85.71%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
3542904 : 3554998
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >LR134204|3542904:3554998|DBSCAN-SWA GATGCACAATATTTCTGTTTCTGCGCCAGCTCCGGCTGCGCCATTATTTCCTGTACCGACGAACCAGCATGAACGATTTTTACGCCTGCCTGAGGTGATGCACTTGTGCGGATTGTCTCGCTCGACCATTTACGACCTGATCAGCCGCGATGCGTTTCCGAAGCAGATCCCGCTTGGCGGTAAAAATGTCGCTTGGGCACAGTCTGAGGTCAGCGCATGGATGGCGGACCGTATCAGCGCCCGTGGGCGGGGATGTGATGCATGATGATACCTGAATACAATAAATACCTTTTTTCTGGCTTGCTTACTGTCGTCATTTCCAGGTATAGTTTTTCCGCTGTCGCAAAATCGGCAGCCGGAATTGGCGTTCCGCGAAACTCAATGGCGACACCAGACGCGCCAGGCGTCTTTTTTTTCGTCGTAGCTCAGGCACACCCATTTTCCGGGCTGTGGTGTTTATGCTTACACCGTGGTTCTTTCGAGATAATGGCAGTCCGGGCGGGGCAGCCTTCGGGCTGGCCGGTATCCATTGAGGCCGGTTACGCCAACCCCGTTCGGGCTGCCACCAGTGAAATTGGCGTTTCCGGTGGTAGCAATAACCGCTACTCAATGGAGGCTGCCATCATGGCTACAATCCTCACCCCGTCATACCCGCAATATGTCTTTGTGTTTGCCGCAATCCGCCGTGCTGACACCCAACCCCGCATTTGTATGCTTCGTACTGTAGCCTGCGATGAGCGTTCCGCGCGTCTTTCGCTGGTCCGTGATTATGTGCTCTCCCTTTCTGCACGCCTGCCTGCCGGGGAGGTGACGCTATGAACCAGTTTGAGATTTCTTACGACGATGTTGTAAGGCTGAAACATTTACGCAATGTGGGTGAGTATGTAACTGGCATGGCAGCCCTGCAGGACTGTTATGAAAAGCCAGCAGGTGCCCAGTGCGAACAACTGGTTTCCCTCATTTATCTGATGACAGAGCAACTGGATGGAGTGGTGCAAACGCTGCCATGACGACCTGATGAATGCGGAGGTGGCCTGATGAAATGCTGTGAATCTCATCTGGCACTTCGTGCCGCTCTGTATCGCCGTGCCGTTGCCTGTGCCTGGCTGGCTCTCAGCAATCATCAGGAACGTTATTCCGGCCTGACGCTCGCTGAACTTGAAGATGCCATAGCCCGCGAGCTGGAAGGGTTCTATCTGCGCCAGCATGGGCAGCAAAGAGGGCTGGAAATTGCCTGCGCGTTGCTTTCGGATCTGATGGAGTCGGGGCCGCTTAAAGCCTGTCCGGTTCTCTCACTGCTCGGAATGACGGTCATGGATGAACTTTGTTCCCGTCACCTCAACAAACCAGCACTGCACTAAGGAGGGCCGCACAATGTCAGGAATGAAAGTTAGCCAGGCTGAGAAAGCAGCTCGTGGTCACTGGTCAAGAATTTTACCCGCGCTGGGGCGTAAAAGTACTGAAAAAATCGGCACCAGCCCTGCCCGGTCTGTGCCGGGAAAGACCGCTTTCGATTTGATGACCTGGAAGGGCGGGGAACGTGGTTCTGTAACCAGTGCGGGGCAGGTGATGGCCTGGCGCTAGTAAGTAAAGTACTGGATGTAGGCATTAGTGAAGCGGCAGACAGAATAAACGGTATTACCGGAAACCTGCCGCCAGTATCTCAGGAAATGCATGAATCTGGTTCTCCTGAAAAAGAGGATGGGAAAAAAGCTGCAGCAGTGCTGGCTGCCCGTTTGTTTGATAAGTCCCGTCAGACCACTGGCAATGCCTATCTGACGAGTAAAGGGTTTCCTGCACTGCCTTGCCGGGAATTAACCGCCATGCATAAAGTCGGTGGTGTGGCATTTCGCGCGGGAGATCTTATCGTTCCATTGTATGCAGATGGAGAGCTGGTAAATCTGCAGTTAATCAACGCTGATGGGGGCAAATGCTTCCTTAAAGGCGGTCAGGTTAAGAATGCCTTTTACCTGATTGAAGGTACTGCCAAAGCAGCCAAACGGCTCTGGATAGCGGAAGGATATGCCACCGCACTCACTATCAACCATCTGACTGGCGATGCTGTCATGGTGGCCTTTTCATCCGTCAATTTCCTTTCCCTGGCGAGCATTGCCTGCAGTCAGTACCCAACGCACCAGATAATTATTGCTGCTGACCGCGATCTCAACGGTGCGGGGCAAACAAGGGGTGAAGCTGCTGCCATGGCCTGCAATTGCACAATGGCGCTCCCACCTGTGTTTGGTGACTGGAACGATGCATTCACGCAGAACGGCGAAGAAGCCACCCGGCTGGCAATTTATGAAGTAATAAAACCAGCTGTTGCCAGCCCCTTCGACACAATGAGCGAAGCTGAATTTACCGCTCTGAGTGTCAGCGAAAAAGCGCAGAGGGTAGTGGATCACTATAAAAATTCACTGGCAGTAGACCCGAACGGGCAGCTCCTTTCACGTTATGAGGCGGGGGCCTGGAAAGTTATCTATTACGCCGATTTTGCCCGTGATGTCGCTGCGCTGTTTCAGCGCCTCGACGCACCTTTTTCATCCGCGAAAATTGCGTCTCTCGTGGAAACCCTCAAAACTGATCGTTCCGCAACAGCAGAATCCGGCGCGGCAACTTATCGGATTTCGCAACGGTGTGCTCGATACCCGGACAGGATTGTTCAGCCCGCACGATAAGAAGCACTGGTTACGTACGCTGTGCGAGGTGGATTACACGCAGCCCGTTGACGGCGAGTCACTGGAAACCTATGCCCCGGCATTCTGGCGCTGGCTGGATCGTGCCGCAGGTTTTAATCCTGAAAAGCGGGACATTATTCTGGCTGCATTGTTTATGGTGCTGGCTAACCGTTATGACTGGCAGCTGTTTCTGGAGGTCACTGGCCCTGGCGGAAGTGGAAAGAGTATTCTTGCTGAAATAGCAACCATGCTGGCGGGTGAAGATAACGCTACCTCGGCAACCATTGAAATGCTTGAGTCGCCAAGAGAACGAGCTGCGTTAATAGGTTTTTCACTGATTCGACTTCCCGACCAGGAAAAGTGGAGCGGTGACGGGGCCGGACTAAAAGCCATCACTGGCGGCGATGCGGTATCCGTTGATCCCAAATATCAGAACGCCTATTCAACCCACATCCCGGCGGTTATCCTGGCTGTGAACAATAATCCGATGCGTTTCACTGATCGTAGTGGTGGAGTTTCACGCCGAAGGGTGATCCTGCATTTCCCTGACCAGATAGCCCCGGAGGAACGTGATACCCAGCTCAAAGAGAAGATTGCCAGCGAGCTGGCAGTGATTGTTCGCCAGCTTATGCAGCGTTTCAGTGACCCTATGAGTGCCAGGACATTACTTCAGTCGCAGCAGAATTCCGATGAAGCACTCACTATCAAACGTGATGCTGATTCAGCGTTTGATTTTTGCGGCTACCTTAAAGTCCTGCCTGACACCACGGCATGTTTATGGGTAATGCTAACATTGTCCCACGTCAGCCCCGTACATACCTTTACCATGCCTATCTGGTCTATATGGAGGCTAACGGCTATAAAAACACGCTCAGTCTGACCATGTTTGGCAAGGGCTGCCGTTAATGCTGAAAGAGTATGGGCTGCAGTACGAGAAGCGGCGGACCAATCAGGGAATGCAGACCAATCTGGCTCTGAGAGAGGAAAGCAATTCTGACTGGCTACCTAAGTGCGATGATCCTGTAGCAACTTAACGATTCATCGACCCGGCGAATGCCGGGTTTTTTTATGCCTCAAGGCTAAATGTAGAGTTTGGTGTTCACTCTACACTATCTTGTTAACTTCTATTTTATTGATATTAATAGTAATAACCTAGTGGTGAACAGTATGAATGGTTTGCTCAAGAAAAAGTATTTTGTTGGATATTGAGATGAGTAATTTTTGAACGCCTTTGCTAGTGTGTATAGCTATGTGTATAGGATTGTTTTTCGATAAAAATAAATCATTTAAAAACAATATCTTGAGTTATATTTTTCATTCCTATTATCGCTTCAAACATCTCCTAACGTCTACTAAAGTTCACTCAAAGCCCTTATACTCTGCATTGTGTCGCTCCTGATAGTATTTTTACGTCTACTGACATACCCCAAAATCTACATGCTTCTGGGGGTACATTCGGGGGTATGTGCTGTTCGGTCTTGTGGAGATACCCCCAATGAAGCTCAACGCCCGCCAGGTTGATACCGCTAAACCAAAAGATAAACCCTATAAACTTGCTGATGGTGGTGGGCTTTATCTTCTGCGCACGATACTGGCGGCTTAAGTATCGTGTAGCAGGCAAAGAAAAGCTGCTGGCTCTGGGAGTATATCCAGACGTTACCCTTGCTGATGCGCGAGCTAAACGTGATGAGGCAAAAAGGGGTATCGCTGGGGGTATCGATCCCAACGAAGCGAAACGGGAAGAAAAGATAGCCCGAGAGGCAAATGTCAGGAATACGTATCAGGAAATTGCCTGTGAATGGCACTCCAGCAAACTATACAAATGGTCTGAGGGGTACGCCTCAGACATTATGGAAGCATTCAATAAAGATGTTTTTCCGTACATTGGTAAAAAACCAATCGCGGAAATCAAGCCGCTCGAATTGCTTAATGTGCTTCGCCGAATGGAGGGGCGAGGGGCAACGGAGAAGGCTAAGAAGGTTAGGCAGCGCTGCGGTGAAGTCTTCCGCTATGCCATTGTCACCGGCCGGGCTGAATACAACCCAGCGCCAGATCTCACCAGCGCTATGCAGGGGCATGAATCCAAGCACTATCCTTTCCTGAATACGTCTGAGCTACCGACGTTCTTTGAGGCTCTTTCTGGGTATTCCGGCAGTATGCTGGTGGTATTGGCAGCACGTTTGCTAATCATTACCGGTTTAAGAACTGGCGAACTGCGTGGGGCTATGTGGCAGGAAATCGATGCTGATGCTGCGGTGTGGGAAATCCCAGCAGAGCGAATGAAGATGCGCCGCCCACATATAGTGCCTTTGTCATCACAGGCCCAAGCCATCATCGCGCGTATCCGGGAAATGACAGGCCGCTATCCTCACATGTTTCCCGGGCGTAATGATCCGCGTAAAACCATGAGCGAGGCTAGCGTTAACCAGGTTTTCAAACGCATTGGATATGCAGGGAAAGTGACAGGTCACGGCTTCCGCCATACGATGAGCACCATTCTTCACGAGCAGGGCTACAGCACCGCATGGATTGAAACACAGCTCGCCCATGTTGATAAAAACTCTATTCGTGGGACATATAACCATGCTCAGTATCTGGATGGTCGCCGGGAAATGCTTCAATGGTACGCCGACTATATGGATGCGCTGGAACACCGTGAAAATGTGATTCATGGGTGTTTTGGGCAATCCTCATGACTGGATGAATAGACAGTGATTCTACACCTGGGTAGACTTCTGTAGACGAATAAAGGATAGGCTATGTCTAGGTTGATCCCCGAAAACCCGTACACCTCTGCGGGCTGGCATAGCCGCCAAAATCAGGGGGCGCGAGGTGGCGATATATGGCATTACCAAAGAGGGAGTATTATACACTTCAGCAAGCAGCTAAAAAATCAGGTTGTGAAGTGGAAGACCTTCTGCACTATGCAGCAATTGGCGTATTACAACTCTGCGTACATTATGAGGAGAGCAAAAAATCAGATAGCGTTTGTTATTTTTATGCCTCTCTATCTGATGGTTTATTGGATGAGTTAAACGACAATCCAGAAGGTTTTACTATGCATTATTCCTCTAAATATAATCTTATATCTATGGACTCTAATGCTTATTTCTTCACTGCCGAGGATGATCAGCCATGCTGGGCCGATAATGTTAAGGGATGGTTTGCTATCCCACATACCGAGCTAACATTACCTGCATTCGAGAATTCTAAGAAAGCTGAGGTTTTTCAACTTATACATCCGCGAAACAATTTGAATAAAAATACCGAGGGATGGGGGTTTTCAACGAAAGGATTTGATGTGAGTGGACCTTGTTTTTATGAGGCGAGAGCTTTCTCATCAGACGATTTTGTTATAATGGCGGATGAGTTGGATATTCTAATTAATGGTGGGATGAAGATTGATCTCTTTGGCCTAGCTGATGAATCAGCTCGTATTAAAAATGTCATAACGGAAAATGTTGGAAATAAAACATTAACATCGATGGCTAAACTAATTAAGTCATTGCTTTATCTTTGTTATAAAGATGAGGATGTTATAAATAATCCCCGAAAACACTTTGATAACAGTCAAAGTGAAATCAATAAGGATTTTGATACGCTCGGGCTTAAACTTCCATCAGGAAAAACCATTGACAAATGGCTCCGAGGTGTCGATCTCGATAAGAAATGAAAAGTGGAATATTCCAAGAGTGTGTTGGAATATTCCAAACTCGTGATATCTACAAAGGTAACCTTCACCCGTTGTCTATCAGCGTCTAACAAGGGCGCATTAGTTAAACCGGGGGTTACATGTCACAACCGTCTCTAATCCGTTTTCATGAAGTACAAAAGCGTACTGGCTATAGTAAGGCGTGGTTATACCGTCTTATGAGCGAGAAACGTTTTCCCGCAGCAATTAAAATTGGTTCCCGCTCTATCGCTTTTATTGAAAGTGAAATTGACGAGTGGATTAATCAGCGCATCGCTGAGTCGCGTGGAGAGGTGGCGTAATGGAGAAGAAAAACCGCCCCGTACAGCAGGCGGCTAACTCAGATATTTGCACGTCTGATATTACGCCGACCACCAGCCCTGTACAAGTACCTAAGCGTACCCCAAAGAAACACCGCGCCCGCGTCTATATGCTGCGCACTGGTGTAGAGGGATGGACAGAAAATGATATTCTGCGTTATTGCCGTCTTTCATCCGGGCGTAATTATGCGTCGGAGTTGGAGCGCCAGCTTGATATTCAACTGGATCGTATCGACGAGAAAAACCCGGATGGTATCGGCGCACACCTGCGCTACCGATTTTCCTGCTGTGGCGACGTTCTCAAAGTTATTCAGCTTGTGAACCATAACGCCGTTATTGGCCTTTATAGCGGCCTTTCACAGCAGGATATTGCCGACATTCTAAACCTCTACCCGGACGCGTTTAACGCCGCATAACGGAGCCGAAAACATGACTATCGAAAAAAGCCGATTCAATTCGGAGGCCGCCCCACAATCCAACGCTTGCCAAGGCGAACATAGCAATAATGACTTTGCCGCGATTGTTCCCGTTATTTCCGGTCAAATTGGCGGGCGTGAGGCCAGTATTGTTAGCGCTAAGGCACTGCATAAAGCGCTTGGAGTAGGTAACGACTTTTCAACCTGGATCAAACTACGCATTGATGAATATGGCTTTAGCCTGAGTGCTGATTACGTGGTTTTTGATTCCTCAGATTTCAGGAATCAAAGTTCAAATTTTGAACAGGGTAGCCCCGGATGGGTGACAAAGCGTGGCGGCGACCGTCGCAGTAAAGATTATGGCCTTTCTCTGGGAATGGCTAAAGAACTGGCAATGGTTGAGCGAAACGAACAAGGTCGCGCCGTTCGCCGTTACTTTATTCAGTGTGAAGAAGCCCTACAGCGCAGCGTGCCAGAGATTGCCGCTCAGTATCGCCGCCAGCTCAAAGCCCGTATCGGTGCCGCCAGCCTGTTTAAACCGATGTGCGTAGCACTGGAAAGCGCCCGCGCAGAACAGGGTAAACAGACGCAGGCTCGCCACTACAGCAACGAGAGCAACATGATCGCCCGCATTGTTTTGGGTGGCATGACGGCGAAGCAGTGGGCGCAGGCGAATAGCATTACTGGCGAACCACGCGACAGCATGAACGCCGGGCAACTGGAACACCTCACCTACCTGGAGAGTACCAACATTACGCTGATTGATATGGGTATGGGATATGACCAGCGTAAAGCCGAGCTAATCCGCCTTTCTCAGCGCTGGTTAGTTAAACATCTGGGGGCTAATCATGCTTAATGCCGCCGTCCAGAATAAAGTCTTTCCCTTGGCTGGCCTGATGTGTACTTTGAGCGGGATCGCGCACGTGCATGAGGGTAAAGCTGGCATAACAGTGCCAGCTAATGCCAGCTCCCCGTTTTCTCGCCGTGGTGACGAGGAGAGTTTCGAGACCAGTACGCGCACGCGTGAAGGTGGGTTTTCGGAGTCCATAAAAAAGAGATTCCATACTGAATTTCTTCAGTCCATAAAAAAGGACTTGCCGTCAATCGCAAGCCCGGTTTATGGTTATAGTGCACCAGCAAAATCTGGTGCCGGGATTGGCGTCCTGGTAATTCGAATGGCGACATACGACGCGCCTAGCGTCTTTTTTTTGTGTCGTTAACTCAGTACACCTCTTTTTCAGCGTTGCGGGTATAATCCGTGCCGCTCACAAAATTATGGTGGGCTGCGTGGGGGCTTCTTCGGAAGCGCCGGTGGCCATTCGAGCCGGTTACGCCAACCCTGCGCAGCTCACCACCAGCAAGATTGGCGTCTTCGGTGGTGGGGTTATCCCAAATCGAATGGAGGCTGCCGATATGCTGGCTACTACCCCTACCCAAAATCCGCAATTCATCTGGTTAATCGCTGCCGTTCGTCGCGATATGCCGTCAATTACCGCAAAAATCCATCATGTTGCTGCCGAAAATGAATGCGAAGCTCGCCGTACCTTAGCGCGGGATCACGTCTGCTTCTTTGCTGGCCGCATCCGCCTGGAGGTGAGCCATGCATGAAGCCACTGTTGAAGTGATCACCCATGCCGGGCAAGCGCTCGACTACAGCAATCAGGCGCTTGCTGTTCTGGATATGTGGAGGGATGTACTGACGTCCGATGAGGAGATGGAATGCCGCTGTGTATCAGCAGTTTACAGCCTTGTGCGTGAAGCCATCTCTTATCTGGAGAAAGCACAGGAGGTGACAGCATGAGCCAGTTACAACTGATTGATGCTGCCTGCCAGATTGAGCAGGCACAGGCTGTTTTATCTATGTGGTTAGAAAGTACGACTAAAGATACCGATCCAGACCTGCCGCGCCTTATCGGTTCAATTATCACACTGTTGCATGGTGTCCCGGAGGCAATGGAAGAAGCTGAAAGCAAGCTGGCTGACTATGTGATGCGTGAATATCGGGATGGTAAAGCATGCGCGACATTTACGATCTGGTGCGCCGTACTGATGGTGAAATAGTTTTCAGCTTCCCGCTGGTGGGTGCTATCAGGTATACACCACCAATGGTATTGCCTCTATACGTCAAATGCTTGATGACGAAATCACTGTGACACCAGCAACGCTGGTACAGCTCCTTATCCGTCTGGGGTATCGCATTACGCCGCCAGAGTCTGACAGAAAATCAGGGGGCGATAGTGCGTAATATCGACCTTATCCGCGAAGTTTACTCATGCTGCTGCCGGGCAGTGGCCTTCGGTGCTGGCAGGCCTGAGTATCGAGGTTGCCAACTCACCTCGTAGACATACTGCTTGCCCGCGTGCGGGGGAAAGGATCGCTTTCGTTTCGACGACACGGGCGCGGTAGCTTTATCTGCAATCAGTGCGGCGCTGGCGATGGTTTAGACCTGATCCGGAAGGTTAAAACGCTGCGATGCCGCAGAAGCTGCAAGAATGGTAGCTGATGTGCTGGGGTATTGATTGTCGGGCAGCACAAACCAGCCAGCAAGCAGCCAGCCAGAGACGGGTGCTGGAAACCGAACGCCAGCAGCGTGAGCTGGAGCGCCAGCAGAAAGCTACGGCAGATGCGGAGCATCGACGCAAAACTTTCATTGCTAAATACCAGCCGCTGCGGAGCAGGTCGAGCTGGGCGAAAGCGAATACCTTGTTGCTAAAGGGCTGGGCGGTTTCACCTTCCCGGTTCTCGCCGATGGCACGCTGTTATTACCGCTGGTGGATGAATCCGGAACAGTCCACTGTTGCGCAGACCATTACCCCAATGGGGGCAAAGCGACTTTTAACCGGCTCTGCAAAGAAAGGCGCATACCACGCAGTCAACACATCGGAAGCGCCGGAGATCATTATCATCGCCGAGGGTCTAGCTACCGTCCTATCTGCTCACATGATGCGGCCAGATGCGTTAGCAGTGGCCGCAATTGATGCCTGGAACCTCACCCCGGTTGTGCAGGTAATGCGAAGGAAATACCCGGACGCGAAAATAATCATCGCTGCTGATAACGACCAGCTCGATGATAAACCTAATACCGGAGCCGACACGGCTAACAAGGCGGCCATCTCCGTATCTGGCTGGGTATCGTTACCACCGACAGACTACAAGGCCGACTGGAACGACTACCACCAGCAAAAACGGGCTGGAAGTCGCCACACGCGCATTTAATGAATGTATGTATCAGCCGCAGGGGGAATGCGTGAAACCACAACTACAGGCCATTGAGGGCGGTAAAACTGACCAGCCAGAGAAAGACCCGCTAAAACCCCGCATTGAGAGCCGTAAGGATGGTGTTTTCTGGGTTACACCGAAGGTGGACAAAGAAAGCGGCGACATTATTAACAATGAAAGCTGGCTTGCCTCACCGATGGAGGTCATCGGCACCGGGCGAGATGATAAGGATCAGTATCTGATACTGCGCTGGCTAGCCTTTGGCGCAGACATACCGACAACAGCGGCTATCCCCCTGGCTGATATTGGCGAGCGCGAAGGATGGCGCACCTTGAAAGCGGGCGGGGTTAACGTCACCACCAAAAGCAGCTTGCGGGCGATTCTGGCTGACTGGCTACAGCGTAGCGGTTCGCGTGAATTGTGGCGTATTGCTCACGCTACTGGCTGGCAGTGCGGGGCATACATCATGCCGGACGGGGAGATTATCGGCGCACCAGCACAACCCGTTTTATTCAGTGGCCGCAGTTCTGCTGCTGCCGGGTACACCATGCAAGGCACTACTGAGAGTTGGCGTAACAGCGTTGCCCGGCTGGCATACGGTAACTATTCAATGATGACTGGCACCGCCGCCGCACTGTCAGCGCCGTTAATCGGGCTGACTGGCGCTGATGGATTCGGCATTCATTTCTATGAGCAATCGAGCGCGGGTAAGACCACCACGGCGAACGTTGCCAGCAGCCTTTACGGCAACCCGGACTTATTGCGCTTGACGTGGTACGGCACGGCGTTGGGGCTGGCGAACGAAGCCGCCGCACATAATGATGGGCTGATGCCACTGGATGAAGTCGGACAAGGGGCAGACCCCGTAAGCGTGTCGCAGTCTGCTTATGCGCTGTTTAACGGCGTGGGGAAATTACAAGGAGCAAAGGAGGGGGGCAACCGCGATTTAAAGCGCTGGCGAACCGTGGCGATCAGTACCGGGGAAATGGACTTAGAAACCTTCATCGCCACCGCCGGACGCAAGACAAAAGCCGGGCAACTGGTGCGCCTGCTGAATATCCCGCTGAGTAAGGCGGTGCGCTTCCACGACCAGCAGGACGGCAAACAGCACGCCGACGCGCTGAAAGATTCTTACCAGCACCATCACGGCGCTGCCGGGCGGGAGTGGATCAAGTGGTTGGCAGGCCACCAGCAGCAGGCCATTGATACCGTTCGTGAGTGTGAAGCCCGCTGGCGCAGTCTGATTCCTGCCGACTATGGCGAGCAGGTTCACCGCGTGGCCGCGCGTTTTGCCATTCTGGAGGCGGCGTTATTGCTGGGCGGCGTTGTCACCGGATGGGATGCTCAAACGTGCCGTGATGCCCTTCAGCACAGCTACAATGCATGGTTGCGTGAGTTCGGTACGGGGAACAAAGAGCACCAGCAGATTATTGAGCAGACGGAGGCGTTTTTGAATGCGCACGGCCTGAGTCGCTACGCGCCGCTGGGTTATGACCCCCGTGATTTACCCATCCGTGATTTAGCCGGGTACAGGAAGAAGGGGAACCACGACAGCGACCCGATAATTTTCTACACCTTCCCGGCGACCTTCGAGCAGGAGATAGCGCGGGGATTCAATGCCAAACAGTTTGCCGAAGTGCTTAAAGGCGTCGGAATGCTCACCCCGCCGACCAGCGGCAGAGGTTATCAGGGACGCGTGCGCGAGGATGGCAGGCAAATCCGTGTTTATGTGCTCAACTTCATGGCAGAGGAAAGCAGCCAGCCAGAGGAGTGA
Protein sequences of DBSCAN-SWA_9 >LR134204|3542904:3554998|3548420_3549251_+|VEB93460.1|DBSCAN-SWA MALPKREYYTLQQAAKKSGCEVEDLLHYAAIGVLQLCVHYEESKKSDSVCYFYASLSDGLLDELNDNPEGFTMHYSSKYNLISMDSNAYFFTAEDDQPCWADNVKGWFAIPHTELTLPAFENSKKAEVFQLIHPRNNLNKNTEGWGFSTKGFDVSGPCFYEARAFSSDDFVIMADELDILINGGMKIDLFGLADESARIKNVITENVGNKTLTSMAKLIKSLLYLCYKDEDVINNPRKHFDNSQSEINKDFDTLGLKLPSGKTIDKWLRGVDLDKK >LR134204|3542904:3554998|3551605_3551806_+|VEB93464.1|DBSCAN-SWA MHEATVEVITHAGQALDYSNQALAVLDMWRDVLTSDEEMECRCVSAVYSLVREAISYLEKAQEVTA >LR134204|3542904:3554998|3544277_3544487_+|VEB93455.1|DBSCAN-SWA MSGMKVSQAEKAARGHWSRILPALGRKSTEKIGTSPARSVPGKTAFDLMTWKGGERGSVTSAGQVMAWR >LR134204|3542904:3554998|3553228_3554998_+|VEB93467.1|DBSCAN-SWA MYQPQGECVKPQLQAIEGGKTDQPEKDPLKPRIESRKDGVFWVTPKVDKESGDIINNESWLASPMEVIGTGRDDKDQYLILRWLAFGADIPTTAAIPLADIGEREGWRTLKAGGVNVTTKSSLRAILADWLQRSGSRELWRIAHATGWQCGAYIMPDGEIIGAPAQPVLFSGRSSAAAGYTMQGTTESWRNSVARLAYGNYSMMTGTAAALSAPLIGLTGADGFGIHFYEQSSAGKTTTANVASSLYGNPDLLRLTWYGTALGLANEAAAHNDGLMPLDEVGQGADPVSVSQSAYALFNGVGKLQGAKEGGNRDLKRWRTVAISTGEMDLETFIATAGRKTKAGQLVRLLNIPLSKAVRFHDQQDGKQHADALKDSYQHHHGAAGREWIKWLAGHQQQAIDTVRECEARWRSLIPADYGEQVHRVAARFAILEAALLLGGVVTGWDAQTCRDALQHSYNAWLREFGTGNKEHQQIIEQTEAFLNAHGLSRYAPLGYDPRDLPIRDLAGYRKKGNHDSDPIIFYTFPATFEQEIARGFNAKQFAEVLKGVGMLTPPTSGRGYQGRVREDGRQIRVYVLNFMAEESSQPEE >LR134204|3542904:3554998|3544573_3545578_+|VEB93456.1|DBSCAN-SWA MHESGSPEKEDGKKAAAVLAARLFDKSRQTTGNAYLTSKGFPALPCRELTAMHKVGGVAFRAGDLIVPLYADGELVNLQLINADGGKCFLKGGQVKNAFYLIEGTAKAAKRLWIAEGYATALTINHLTGDAVMVAFSSVNFLSLASIACSQYPTHQIIIAADRDLNGAGQTRGEAAAMACNCTMALPPVFGDWNDAFTQNGEEATRLAIYEVIKPAVASPFDTMSEAEFTALSVSEKAQRVVDHYKNSLAVDPNGQLLSRYEAGAWKVIYYADFARDVAALFQRLDAPFSSAKIASLVETLKTDRSATAESGAATYRISQRCARYPDRIVQPAR >LR134204|3542904:3554998|3549570_3550005_+|VEB93462.1|DBSCAN-SWA MEKKNRPVQQAANSDICTSDITPTTSPVQVPKRTPKKHRARVYMLRTGVEGWTENDILRYCRLSSGRNYASELERQLDIQLDRIDEKNPDGIGAHLRYRFSCCGDVLKVIQLVNHNAVIGLYSGLSQQDIADILNLYPDAFNAA >LR134204|3542904:3554998|3545537_3546458_+|VEB93457.1|DBSCAN-SWA MLDTRTGLFSPHDKKHWLRTLCEVDYTQPVDGESLETYAPAFWRWLDRAAGFNPEKRDIILAALFMVLANRYDWQLFLEVTGPGGSGKSILAEIATMLAGEDNATSATIEMLESPRERAALIGFSLIRLPDQEKWSGDGAGLKAITGGDAVSVDPKYQNAYSTHIPAVILAVNNNPMRFTDRSGGVSRRRVILHFPDQIAPEERDTQLKEKIASELAVIVRQLMQRFSDPMSARTLLQSQQNSDEALTIKRDADSAFDFCGYLKVLPDTTACLWVMLTLSHVSPVHTFTMPIWSIWRLTAIKTRSV >LR134204|3542904:3554998|3547137_3548274_+|VEB93459.1|integrase|DBSCAN-SWA MVVGFIFCARYWRLKYRVAGKEKLLALGVYPDVTLADARAKRDEAKRGIAGGIDPNEAKREEKIAREANVRNTYQEIACEWHSSKLYKWSEGYASDIMEAFNKDVFPYIGKKPIAEIKPLELLNVLRRMEGRGATEKAKKVRQRCGEVFRYAIVTGRAEYNPAPDLTSAMQGHESKHYPFLNTSELPTFFEALSGYSGSMLVVLAARLLIITGLRTGELRGAMWQEIDADAAVWEIPAERMKMRRPHIVPLSSQAQAIIARIREMTGRYPHMFPGRNDPRKTMSEASVNQVFKRIGYAGKVTGHGFRHTMSTILHEQGYSTAWIETQLAHVDKNSIRGTYNHAQYLDGRREMLQWYADYMDALEHRENVIHGCFGQSS >LR134204|3542904:3554998|3549370_3549571_+|VEB93461.1|DBSCAN-SWA MSQPSLIRFHEVQKRTGYSKAWLYRLMSEKRFPAAIKIGSRSIAFIESEIDEWINQRIAESRGEVA >LR134204|3542904:3554998|3552507_3553221_+|VEB93466.1|DBSCAN-SWA MCWGIDCRAAQTSQQAASQRRVLETERQQRELERQQKATADAEHRRKTFIAKYQPLRSRSSWAKANTLLLKGWAVSPSRFSPMARCYYRWWMNPEQSTVAQTITPMGAKRLLTGSAKKGAYHAVNTSEAPEIIIIAEGLATVLSAHMMRPDALAVAAIDAWNLTPVVQVMRRKYPDAKIIIAADNDQLDDKPNTGADTANKAAISVSGWVSLPPTDYKADWNDYHQQKRAGSRHTRI >LR134204|3542904:3554998|3551802_3552066_+|VEB93465.1|DBSCAN-SWA MSQLQLIDAACQIEQAQAVLSMWLESTTKDTDPDLPRLIGSIITLLHGVPEAMEEAESKLADYVMREYRDGKACATFTIWCAVLMVK >LR134204|3542904:3554998|3543940_3544264_+|VEB93454.1|DBSCAN-SWA MKCCESHLALRAALYRRAVACAWLALSNHQERYSGLTLAELEDAIARELEGFYLRQHGQQRGLEIACALLSDLMESGPLKACPVLSLLGMTVMDELCSRHLNKPALH >LR134204|3542904:3554998|3543718_3543913_+|VEB93453.1|DBSCAN-SWA MNQFEISYDDVVRLKHLRNVGEYVTGMAALQDCYEKPAGAQCEQLVSLIYLMTEQLDGVVQTLP >LR134204|3542904:3554998|3546483_3546612_+|VEB93458.1|DBSCAN-SWA MLKEYGLQYEKRRTNQGMQTNLALREESNSDWLPKCDDPVAT >LR134204|3542904:3554998|3550018_3550861_+|VEB93463.1|DBSCAN-SWA MTIEKSRFNSEAAPQSNACQGEHSNNDFAAIVPVISGQIGGREASIVSAKALHKALGVGNDFSTWIKLRIDEYGFSLSADYVVFDSSDFRNQSSNFEQGSPGWVTKRGGDRRSKDYGLSLGMAKELAMVERNEQGRAVRRYFIQCEEALQRSVPEIAAQYRRQLKARIGAASLFKPMCVALESARAEQGKQTQARHYSNESNMIARIVLGGMTAKQWAQANSITGEPRDSMNAGQLEHLTYLESTNITLIDMGMGYDQRKAELIRLSQRWLVKHLGANHA >LR134204|3542904:3554998|3542904_3543168_+|VEB93452.1|DBSCAN-SWA MHNISVSAPAPAAPLFPVPTNQHERFLRLPEVMHLCGLSRSTIYDLISRDAFPKQIPLGGKNVAWAQSEVSAWMADRISARGRGCDA |
16 | Enterobacteria_phage(54.55%) | integrase | attL 3543438:3543454|attR 3551329:3551345 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_10 |
4200693 : 4206302
Sequences of DBSCAN-SWA_10
Nucleotide sequences of DBSCAN-SWA_10 >LR134204|4200693:4206302|DBSCAN-SWA GATGAACAGCTATCAGAAGATGATGAAGGTACTAACGAAAGCGGCGAAATCGTTATTACCTCCGAAGAAACAGAAACCCGCCGATTGGGTGGAGAATAATTTAGTATTCCCCGATGGTGAGTTACAAGGCCAGAAAGTAAATCTATTCGAATTTCAAAAGAAACCAATCAACGACATTGTTAATCCAAGAGTACGAAAGATTGTACTCATGAGCAGTGCTCAGTTATTAAAAACCACAGTACTACAAAATTCGATGTACTACTTCCTTGCCAATGATCCTAGTAATCAGATCTTTGCCGGGCAACGGCAGGAACTACAAGTAAATTCAGGACGGGTAAATGGCAATCAGTTATTGAAGCGTGTCCAGTACTGAAGAATCTCGTTAGTAATAAGAATGATAAGAACTATACGAATAACGATAAAACACAACAAAACTTAGATGGTACGTTTACCTACTTTCTTTACTCTTGGTAGTAGTGCACAACTACGTGGCCTTACAGCCCCCAGAGTGTTCTTAGATGAAGTCAGTAATGTTGATGCAGAAAGGGGGGGAAGGTAATCCGCTCAAACTCGCAGAGCAGCGTACAAAAGCATTCAGTACACCGTTAATCATGATTTGTTCTACGCCACTGGATGAAAATGACCTAATCACTCAGCAGTACGAACAGAGTAATAAACAGAAGTTTTATGTGCCTTGTCCCCATTGTGAGCATTCACATGAGTTAGTTTTTGAAAGCGTAAAATTTGACTGGAAAGTTATTGATGGCGGTCGCCGTCGCATACCAGACGCAGAAACAGCAAAATTACTATGTCCGAAATGTAGCAATGAAATTAGTGAAGCTCAACGTGTACGGATGATAAAAAAAGGACATTGGATAATAACTAATCCAGAAGTAACGGATATCATGGGCTATCACATTAGCCGCTTATACTCACCGATTAACTCAATTCGTTCAATAGTACAGGACTTTGCCGAAGCACATTATACTTCGATCTTGCCAGCTTTTACAATAACGTACTTGGACTTCCTTATATCGATAAAGAGAATACTGATCACGATTTGGTACTACTTGAAAATCTACGTGATTCATCAATTGATATTGATAATATTCCTGATGATGTATTGGGTATCGTACTTGGTGTGGATCAGCAATTAGATCGCCTTGAGGTAACTACTTTAGGAGTTAGTGAAAAGAATCTTTATGTACTGGATCACCGTTCAATTCATGCTATCGATTGTACAAAGATAGAATCACCTGCATGGAGTAAATTAACTGCATTTAGTCAAACAGCTTTCAAAACTCAGTCAGGTAAACCACTTAAAGTGTTGACGGGATTTGTTGATAGTTCTAACGGTAATGCAACGAATACAGTATACCGTTATTGTGGAGCATCGCAGATCTTTAGACCTATCAAGGGTGCAGCCAGTACTACAACGCCACTATTCAAACAATCAAGGGCTGGTGGTCATACTCTCATTAACCTAAACGTTAACTTAGGTAAAAGTAACATACGCCAGCTACTAAACAGGGCTGTATCTGATAGTGATAATTCTAAGGAAGTACAACTACATTTCTCACACTCACTACCAGATGATTACTTTATCCAACTTACAAGTGAAAAGAGGGTTATTAAGGCTGGCAATTGGATGTGGGTTAAGAAGGTATCGAGTCAGAGAAATGAAGCATTGGATTGTTTAAATTACTCTTTGATTTGCTTTCAGTGGTATCTATCTAAACTTGGTAATCAGCCATTCAGACAATTACGTGAATTTAATGCAAAACAAAAAGCAAAAGCCATAAATACAGACATAAATAAGGAAGAATCCAGACCACTTACTAAACAGCGTGTAATAAGACCAAGTAGACGCAGTGGGGGATTTTTTAAATAAGGAAGAAATACAATGGCAATTGTTCGCAAAGTTGATTTGAAAGGCGATATTATCAAAGGCGAGACGATTACTTTTAACTATCCGGTTGGTACTGATATCGATATTATCAGTCCAGAAGGTACAAAAACATCATACACCTATCCCTTCCCCGATATTGATACTAATACGTGGTTGCCTGGTATCTGGACGGCAATTATTGAAAGTCCTAATGTATATGGTGTACAGCAGTTTGAAGTAACCGATCCGACCGCAAAAGCCTCAGAATATAATGATCTCATTCAGATCATAAAAGATATTGACCAAATAACATTAGACCGTATTAAGGGTAATGGTGTCATTAGTCAAACAATTCAAAATAAATCGTTAACATATGAAAGTAGTGAAGTGGCTATTACGTCTACGCTCAATCTATGTAAAACGTGCCAATGATTTAATTACAGATATGAAAGGACTAAACGCAGGTAGCCCTATTAAATCAATTACTACATTTAACAGGGGGCGTTAATGTTCAGATGGAACAAAGAAAAGAAACCAGTAAAAGAAAAACTAAAAGTTGAAAAAACACCACTAATGAAAAACAACCCAATGAAACGTGCTATGACTTCCCTAAACCTGGGTTCAAATAGTCCTGTAATTTCATTTGGTTTTAGTGCTGGTAATCAGGCTGGCAACATTAATGCAATCATCAATCGTACTCTCCCGATAATGGTAGCGGCAAGCCGTGAGCTATCAATTAAAAACGGTATTGTTAAAAAGTATGTTGCTACTAATTCAGCAGGTGTAACGGGTGCTGATGGCTTGTACATACGCCCTTGTTCTCACAGTTCTGATGATGATGCAGTTAATCAGGAAATTAATAAGCAATTAGAAGATGCGTTTTATAAATGGGCTGAAGATCCAAAAGCGTTTTCACGTTGCGGTACATTAGATATCAGTACTTTCCAACGTCTGGTTGAACGTACACGTAGTATAGATGGTGATTGCTTTGTAAGAATTCATAGTGGTAACGATGGTATGCCACAAGTAGAGATCATTGATTCAATGCGTATTGCTACATATGACAAACCAGTTATTACCGTCAGGTAACTTTATCAGTAATGGAATTGAATATGATGTTGGTAGTAATTGCCCGGTTGCATACTGGATTACTAAATATAACCCAATTACATATCAGTACTTAATCGGTGAACGTGAACGAGTACCAGCAGATGAGATACTACATCTATTCCAACAGGACTACCCGACACAACAGCGTGGTATTCCAGATGTACACGCAGGGACTGATAAACTCAAAGAACTTGAGGAATTTATGAGTGCTGCAATTACTTCCCGTAAAGTGGCTGCGTCAGCAATGGCATTTATTACAAATCCAGATTCTGATGATATAGATTTAGTTAAGGGTGATGATATTTCATATTATGAACAGGATTATTTGAATCCGGCAGCTATTGTAGAACTACAGGCTGGTCAGGACATAAAAACTGTCAACCCCACTCAGACTACTGATGGTATCAACGAATTTGTAGATAATCAATTAATGATGATCGCAATGGGGCTTGATATTACTAAGCAATCTCTAACCAGTGATACAAGTAATGCCTCTTTTAGTGCTGCAAAGTTAGTAGATAAATTACAACAATCAACATTCAAAACTCGTACTAATGCGTTAATCGTATCAGTACTTAAACCTCTTTATATTAAGTGGTTAAAGGCTGCAATGATAAATAATAGTGCATTAAGTAATTTAAATTTTAGTGACTTTGACAAACTTACACACGCCCAATATGTACCAACTCGTCAAATTTCTTTAGATCCATATAAAGATCTTCAAACAGAAGTTTTGGCAATTCAAAACGGAATTAAGAGTAAAGCAATGGTTATTAGTGAAATGGGTTATGACCCGGCGGTAGTCATGGAAGAAATAGAAAAGGAAAAAATGGAAAATGGAATTAAAACAAACTCAACAAGTGAGGGCGATCAACTTAAGTAATGCTATTGATGAATCATCCCGAACAGTAGAACTTTCCTTTGCCTCAGAAACTCCGTAGAACGTGAAATTAATGGATCTCTATATAATGAGATCCTTCTATGTAATCCAGAGAATGTAGTTTTAAGCCGTCTGAATGACGGCTCTCCCGTTCTCGTTGAACATGACCCATTGCGGCAGGTTGGAATTGTAGAAAATGCTCGCGTAGATATGGATAAGGTATGCCGGGCAACAGTTCGATTTAGTACTTTGGGTAGTGCTCAAACTATCTTTGGAATGATTGTAGAAGGTATCCGACCTAAGATCTCAGTTGGCTATAACATTCGTGATTATTATTTCGAAGGTAATAACTTAATTGTTACCCGATGGGAACCTTATGAAATTAGTTCAGTAAGTACTCCAGCAGATATTAGTGTGGGAATTGGCAGATCACTAAATAGTAATAATGAAATCAATTTTAGAGGATCACCAGTTCATGGAAGAACAAGAAAATAAAGAAATCGTGGAAGAACAACCAGAAGCGATTGAAGAAGAAGTAATTACTCAGGAAGTAAAAGACGTTGATACAGAAGCCGAGCGTTCACTATCAATTGAAACAATTGTTGATGCTGTAAAAGAATCACTAAATAAAGATGTGGAAGAAAATCGTGTACGCGAGCTTCAGTCTATTTCGGGTGTGTTAGGGATTAACACCGAAGAAGCAATTAAAAACGGTGTAAGCGTAGAGGAATTTAAACGCTCACTAAATAAAGAAACACAATCAATTGATAAGGATATCAAAATGACTCAAAAATCTCTAATTGAGCAGGGACTACGCTCACTAAAAGGTGAAGTGAATGAATTAGATTCTTTCGAAAAGGGAACTCGTGGTTATAACGCCAATATGAATGAAATGGTGCGTTCTACTGCTAATACTACATCTACTGTTACCGCAGCAGGTCTTGTTAAAGAACAACTATCTGATTCATATATTCGCGAACTATTGGCACGTACTGTACTTGGTCAGCTACCCGTTACCGTATTTGGTGGTCTTGCAGGTCTTGGCAACTTCTCTATTCCAGTTGCTAACGGTATGACTCCCGGAGCACGTTTCTATGGTGAAGATGAAGCAGTGGAAGATGGTTTCGAATCATTCACCAAAATCACCCTAAAACCAAAAATGTTTGCAGCGGGTATCAAGATCACTAAAGCAATGCTACTAAGCAACGCTGCCACCGAACGTTATGTTACCGATGAACTACTACGCCAGTGTGCTGATGGCCTTGAAAAAGCAGTATTCGCACAACTATCTACTACCGTTCCTGTAGTTGAAACTGAAGAGGTAGGTGTCATTACCGAAGCTGATGTACAGGCGGCAATTGAAGTATTGGGTACTGCTAACGTGGATGTAAACCGTTGTGTGGCTATCGTACATCCGGCAATGTTAGCAAAACTACGCCAGTACGCTGTACTTGGTAACACCGCAGCCGTTAGTGCTGTCGCTGGTCATCGCTATGAAATGTGGCTATGTGATGAAGTCAAAGTAATTGAGTCTACTTTTGTTGACCAGGATACCGTACTAATTGGCGACTTCTCAGAGCTAATTTTCGCAAACTGGAATGACGGACAAGAATTGGATTTTGACGATACTACATATCGTTCAGCACAAACTATCGCTATTCGTTCCTTCCAGTACTTGGATACTGCGATTGCACATGAAGAATCATTTGTACAAATCAAATTAAAAGCATAA
Protein sequences of DBSCAN-SWA_10 >LR134204|4200693:4206302|4202595_4203012_+|VEB94230.1|DBSCAN-SWA MAIVRKVDLKGDIIKGETITFNYPVGTDIDIISPEGTKTSYTYPFPDIDTNTWLPGIWTAIIESPNVYGVQQFEVTDPTAKASEYNDLIQIIKDIDQITLDRIKGNGVISQTIQNKSLTYESSEVAITSTLNLCKTCQ >LR134204|4200693:4206302|4201833_4202583_+|VEB94229.1|terminase|DBSCAN-SWA MDQQLDRLEVTTLGVSEKNLYVLDHRSIHAIDCTKIESPAWSKLTAFSQTAFKTQSGKPLKVLTGFVDSSNGNATNTVYRYCGASQIFRPIKGAASTTTPLFKQSRAGGHTLINLNVNLGKSNIRQLLNRAVSDSDNSKEVQLHFSHSLPDDYFIQLTSEKRVIKAGNWMWVKKVSSQRNEALDCLNYSLICFQWYLSKLGNQPFRQLREFNAKQKAKAINTDINKEESRPLTKQRVIRPSRRSGGFFK >LR134204|4200693:4206302|4203640_4204573_+|VEB94232.1|portal|DBSCAN-SWA MTNQLLPSGNFISNGIEYDVGSNCPVAYWITKYNPITYQYLIGERERVPADEILHLFQQDYPTQQRGIPDVHAGTDKLKELEEFMSAAITSRKVAASAMAFITNPDSDDIDLVKGDDISYYEQDYLNPAAIVELQAGQDIKTVNPTQTTDGINEFVDNQLMMIAMGLDITKQSLTSDTSNASFSAAKLVDKLQQSTFKTRTNALIVSVLKPLYIKWLKAAMINNSALSNLNFSDFDKLTHAQYVPTRQISLDPYKDLQTEVLAIQNGIKSKAMVISEMGYDPAVVMEEIEKEKMENGIKTNSTSEGDQLK >LR134204|4200693:4206302|4200693_4201065_+|VEB94227.1|terminase|DBSCAN-SWA MNSYQKMMKVLTKAAKSLLPPKKQKPADWVENNLVFPDGELQGQKVNLFEFQKKPINDIVNPRVRKIVLMSSAQLLKTTVLQNSMYYFLANDPSNQIFAGQRQELQVNSGRVNGNQLLKRVQY >LR134204|4200693:4206302|4205045_4206302_+|VEB94233.1|capsid|DBSCAN-SWA MEEQENKEIVEEQPEAIEEEVITQEVKDVDTEAERSLSIETIVDAVKESLNKDVEENRVRELQSISGVLGINTEEAIKNGVSVEEFKRSLNKETQSIDKDIKMTQKSLIEQGLRSLKGEVNELDSFEKGTRGYNANMNEMVRSTANTTSTVTAAGLVKEQLSDSYIRELLARTVLGQLPVTVFGGLAGLGNFSIPVANGMTPGARFYGEDEAVEDGFESFTKITLKPKMFAAGIKITKAMLLSNAATERYVTDELLRQCADGLEKAVFAQLSTTVPVVETEEVGVITEADVQAAIEVLGTANVDVNRCVAIVHPAMLAKLRQYAVLGNTAAVSAVAGHRYEMWLCDEVKVIESTFVDQDTVLIGDFSELIFANWNDGQELDFDDTTYRSAQTIAIRSFQYLDTAIAHEESFVQIKLKA >LR134204|4200693:4206302|4203087_4203669_+|VEB94231.1|portal|DBSCAN-SWA MFRWNKEKKPVKEKLKVEKTPLMKNNPMKRAMTSLNLGSNSPVISFGFSAGNQAGNINAIINRTLPIMVAASRELSIKNGIVKKYVATNSAGVTGADGLYIRPCSHSSDDDAVNQEINKQLEDAFYKWAEDPKAFSRCGTLDISTFQRLVERTRSIDGDCFVRIHSGNDGMPQVEIIDSMRIATYDKPVITVR >LR134204|4200693:4206302|4201210_4201849_+|VEB94228.1|terminase|DBSCAN-SWA MKSVMLMQKGGEGNPLKLAEQRTKAFSTPLIMICSTPLDENDLITQQYEQSNKQKFYVPCPHCEHSHELVFESVKFDWKVIDGGRRRIPDAETAKLLCPKCSNEISEAQRVRMIKKGHWIITNPEVTDIMGYHISRLYSPINSIRSIVQDFAEAHYTSILPAFTITYLDFLISIKRILITIWYYLKIYVIHQLILIIFLMMYWVSYLVWISN |
7 | Vibrio_phage(50.0%) | portal,terminase,capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_11 |
4298242 : 4312946
Sequences of DBSCAN-SWA_11
Nucleotide sequences of DBSCAN-SWA_11 >LR134204|4298242:4312946|DBSCAN-SWA TTTAAATGTCCTCCTGACGATGAGCCAATAGTGTCGCAAACGCGCCGTTGGCGCTGCTGAGTTGGGCATAATCACCTTGCTCGATGATTTGCCCGTCCTGCATAACCCAGATCACGTCCCAGTCGGCAAGATCTTCTAACTGATGCGTCACCATCAGCGTGGTCTGGCGTTTTGATGCGGCATTCAGCGCCTGCATTACACGCTGCTCGCTGTGGGCATCCAGACTGGCGGCGGGTTCATCCAGCAACAACAATTTGCAGGGATTCAGCAGGGCGCGGGCAACCGCAACCCGCTGCGCCTGACCGACGGAAAGGCGACCAGCGTGATCGCCAGTCGGCGTGTCTACGCCTTGCGGCAACAGAGGTAAAAACTCACTGACCCAGGCGCTGTCGAGAGCCGCCTGTAACGCTTGTTCACTGGCATCCGGGCGGGCAAGCAGGACGTTTTCACGGAGCGTGCCAGCCGGTAATTGTGGGTTTTGCCCCACCCAGGAGAGATGTTTTCGCCAGGATTCCGGGGATAAATCACGAAGTTCAACGCCGTTAATGCACAGGGAGCCCTGATAAGAGAGAAAGCCAGACAGCGCGTTTAACAGTGAACTCTTTCCTGAACCGCTGCGACCAACCAGTACGGCGCGCTGACCTGCGGCCAGTGTGAAATTCAGCGGTCCTGCAAGCGTTTTACCTTCCGGCGAGGTAATTATCAGATCCTGAGCTTCAAGGGTTACCGGTTCTTTCCCGGAAAGTTCGACATCGCCGCGTTCCGGGTGAGCGAGAGGTGTTTCCATAAAGGTTTTCAGGCTGTCAGCCGCACCTACCGCCTGCGCTTTGGCATGGTAAAAGGTGCCCAGATCGCGCAGTGGCTGGAAAAACTCCGGCGCCAGAATCAGCGCCAGGAAGCCGGCGGCAAGCGTGACGCCCGTCCCGTAATGGCCGAAATTAAGTTCGCCGAGGTAGGAGAAACCGAAATAAACCGCGACCAGCGCAATGGACAGCGACGTGAAGAACTCCAGTACCCCGGAGGATAAAAAGGCGAGGCGCAGGACTTCCATTGTGCGCTGGCGAAAATCTTGTGAAGCGACACGAATGCTTTCCGTTTCTGCTTCACCGCGCCCGAAGATGCGCAGCGTCTCCATACCGCGTAAGCGGTCGAGGAAATGCCCGCTCAGGCGGGCAAGCGCCTGGAAATTACGGCGGTTAGCGTCGGCGGCGCCCATTCCCACCATCGCCATAAATAATGGGATCAGCGGTGCGGTTCCGAGCAAAATCAGCGCGGCGGCCCAGTTTGACGGGAAAATCGCCGCCACAATCAACAAGGGAACGCAGACAGCAAGCGCCATTTGCGGCAGATAGCGGGCATAGTAATCATGCATATCGTCAATCTGCTCCAGCACCAGCGTCGCCCAGCTACCAGCTGGTTTACCCTGAATCCAGGCTGGCCCTGCCTGTTGCAGGCGATCCAGCACCTGGCGGCGGATCTCAAATCGGATATGCTGCCCGGCATGAAAACCCACCCGCTCGCGCAGCCAGACCACCCAGGCGCGCAGGACAAACACCAACATCAGTACGATGAAGGGGAGCAGGAGGGCTTCGCGGGGAATATTGTCCATGATCATATGCTGGAGAATGCGAGCCATGATCCAGGCCTGAGCGACAATCAGTACGCCACTCACGAAACCCAACAGGCGTGAAATGTTCAGCCAACGTTGGGAAATTACACTTTGCTGTTTTAACCAGCGGGTCAATTCTTTTTGACGGGTTTTATTCATTGCACGCTTAGCAGGTGAGTTATCGGAATTTTTGGCACGGCAATGTTACAACGGGGCAAAAATAAAGGCGACTTATAGTCGCCTTTTTTACTTTTGTTACTGATTTGTAAAAACTATTTGCACGCGTCAGCCAGACCGTCAAGGTAGCGTTCAGCGTCCAGCGCCGCCATGCAACCTGTGCCCGCTGAGGTGATTGCCTGACGATAAATATGGTCCATCACGTCGCCTGCGGCGAACACGCCGGGAACGCTGGTCTGCGTGGCGTTACCGTGGATGCCGGACTGGACTTTGATATAGCCGTTTTCCAGTTCGAGCTGACCTTCAAAAATCGCGGTGTTCGGGCTGTGGCCGATGGCGACAAACAGGCCTGCAACGTCAAGGGATTCAATGTTGTCCGGGTTTTGCGTATCACGCAGACGCAAACCGGTAACGCCCATCTGATCGCCGGTCACCTCTTCCAGAGTGCGGTGGGTGTGCAGCACGATGTTGCCGTTTTCTACTTTATCCATCAGGCGCTTGATCAGAATTTTTTCGGCGCGGAAGCCGTCGCGACGGTGAATCAAATGCACTTCAGCGGCGATGTTCGCCAGGTATAACGCTTCTTCAACCGCCGTGTTGCCGCCGCCGATGACCGCGACCTTCTGGTTGCGGTAGAAGAAACCGTCGCAGGTGGCGCAGGCAGATACGCCACGGCCTTTAAATGCTTCTTCAGACGGCAGGCCGAGGTAGCGTGCGGAAGCGCCGGTTGCGATGATCAGCGCATCGCAGGTGTATTCGCCGCTGTCGCCGGTCAGGCGGAAAGGACGGTTTTGCAGATCCACGCTATTGATGTGATCGAAGATGATTTCAGTATCAAATTTCGTCGCATGTTCGTGCATGCGCTCCATCAACAGCGGCCCGGTCAGATCGTTCGGATCGCCAGGCCAGTTTTCCACTTCCGTCGTGGTGGTCAGTTGACCGCCTTTCTCCATACCGGTAATCAGAACCGGTTGCAGGTTTGCGCGCGCAGCGTAGACCGCAGCGGTATATCCCGCCGGTCCAGAACCCAGGATAAGCAGTTTACTGTGTTTGGTCGTGCCCATGAGATCCCCATAGTTGTTGGCAGACAATGGGCAGGATTGTAGGGAATTTGCAGACGTAAAAAAAGAGTATAGCGATTTTGTTAACAATATGTGTAATAGCAGGAACCGATGAATGGGCGGATTTAGGGCTGCGATATAACGCTTTCTACTTAAAATCGGCCAGTTATAAGGTGATAAAATGACGATTTAGCGCCGCTCTTAATGAATAACTGGCATGTTGTACTAAAAATCGATGTTTTGCTTTGACAATCCCCTGCTGTTTTGCGAAAACATCTTCTGGAAGAAAAAAACAGTGTTATGTGTGTGCTGCATAATCATGCATATAAGCACCATGTTTACCGGGCTAGTGAAATCTACGCATGGCGTGGACAGACGCCATTCGTGATGTCGATAGCTGCCATCCGGCAACGGTCTTCTCACCATAGGCCCTGGCATTGCGCGCCGTTAATCCCTCTGGGTTCGGTCTATTGTGATGGGCATCGACTCTGAACAGTGATGTTAGTAGAGTCAGGCAGGAGTAGGGAAGGAATACAGAGAGACAATAATAATGGTAGATAGCAAGAAGCGCCCTGGCAAAGATCTCGACCGTATCGATCGTAACATTCTTAATGAGTTGCAAAAGGATGGGCGTATTTCTAACGTCGAGCTTTCTAAACGAGTGGGACTTTCACCGACGCCTTGCCTTGAGCGTGTCCGTCGGCTGGAAAGACAGGGTTTTATTCAGGGCTATACGGCGCTGTTAAACCCGCATTATCTGGATGCATCACTTCTGGTTTTTGTTGAGATTACTCTGAATCGTGGCGCTCCAGATGTGTTTGAACAATTTAACGCCGCAGTACAAAAACTTGAAGAAATTCAAGAGTGTCATTTAGTTTCCGGTGATTTCGACTACCTGTTGAAAACACGCGTGCCGGATATGTCAGCGTACCGTAAGCTGCTGGGTGAAACCCTGCTGCGTCTGCCTGGCGTGAATGACACACGTACTTACGTGGTAATGGAAGAAGTCAAACAGAGCAATCGTCTGGTAATTAAGACGCGCTAACACGGAACAGGTGCAAAATCGACGTAGTTTGATTACACTCCTGTTAATCCATACAGCAACAGTGCCGGGGCAACCCGGTGCTGTTGTCCGTTTTAGCATCGGGCAGGAAAAGTCCGAAACCTGGAGAGCCTTTTTTGAGCCAGGAATACACTGAAGACAAAGAAGTCACATTGAAAAAGTTAAGCAGCGGGCGCCGACTTCTGGAAGCGTTGCTGATCCTTATTGCCCTTTTTGCCGTCTGGTTGATGGCAGCCTTACTTAGCTTTAACCCTTCAGATCCCAGTTGGTCGCAAACCGCATGGCATGAGCCTATCCATAATTTAGGCGGCGCGCCGGGCGCGTGGCTGGCTGACACGCTGTTCTTTATCTTTGGCGTGATGGCTTACACCATCCCGGTGATTATCGTTGGCGGCTGCTGGTTTGCCTGGCTGCACCAGAGCAACGACGATTACATCGACTATTTTGCCGTGTCGCTGCGCATTATCGGCGTCCTGGCGTTGATTCTTACCTCCTGCGGATTAGCCGCAATTAATGCTGATGATATCTGGTACTTCGCGTCCGGCGGCGTCATTGGCAGCCTGTTAAGCACCACGCTTCAGCCTCTGCTGCACAGCAGCGGGGGAACGATTACGCTGCTGTGCGTCTGGGCAGCCGGTCTGACGCTCTTCACAGGCTGGTCCTGGGTGAGCATTGCAGAAAAACTGGGTGGCTGGCTGCTGAATATTCTCACCTTTGCCAGCAACCGTACTCGCCGCGATGACACCTGGGTCGATGACGACGAGTATGAAGATGATGACGAATACGAAGATGAAGCGGTGGGCGCACAGCGCGAATCCCGACGTGCGCGCATCCTTCGTGGCGCACTGGCGCGCCGTAAGCGGCTGGCAGAGAAGTTCAGCAATCCGCGTGGGCGACACACGGATGCGGCGCTTTTCTCCGGCAAGCGTATGGATGATGAAGACGATATCGAATACAGCGCGCGTGGCGTGGCCGCCGATCCTGATGATGTGCTGTTCTCTGGCCATCGTGCGACGCAGCCGGAATATGAGGAATACGATCCTCTGCTGAATGGACATTCGGTCACGGAGCCTGTTGCCTCCGCCGCTGCGGCGACAACGGCAACGCAAGCCTGGGCGGCGCCTGTCGATCCAGTTATGTCTGCACCGTCGGTGCCGGGTGCTGAAGCCGCGCCTGCGCAACCGGTGGTGGAATGGCAGCCTGTTCCGGGGCCGCAAACCGGCGAGCCGGTCATTGCGCCCGCGCCGGAAAGTTATCCGCCGCAACCACAATATGCGCAGCCGCAAGCGCCACATCATGAACCGTGGCAGCAACCTGCGCCGACCGAGCCGCAAGCGCAGTATTACGCTGAACACGCGTATGAACAGCCTGTGCCGCAGGTGCAGGAACCTGCGGCGGAACAACCCTGGCACCCTGAGCCGGTTTATCCGCAAGAGCCTGTTTATCAACACGAGCCGACTTTCGAGCCGCAACCGGCTTACCCGCAGGAACCGGTTCAGGATACGTATTACCAGCAGCAGCCTGCTGTTGAGCCACCGTCTGCGGTGGAGCCTGAGCCTGTAGCAGAAGAAATTAAACCCGCCCGCCCGCCGCTCTACTACTTTGAAGAAGTGGAAGAGAAACGCGCGCGTGAGCGTGAGCAACTGGCTGCGTGGTATCAGCCGATTCCTGAGCCAGCGAAAGCGCCGGAGCCGGTTAAACCATCGGCACCGACAGCAGCTTCTGTGCCGCCTGTTGAATCCGTTGCCACGGTTGCGCCGTTGGCGGCAGGCGTCAAAGATGCAACGCTGGCCGCAGGCGCGGCGGCGGCATCCGCTGCTCCTGCGTTTAGCCCGGTAAGCGGTGGTGCGCCACGCCCACAGGTTAAAGAGGGGATTGGCCCGCAACTGCCTCGCCCTAACCGCGTTCGCGTTCCGACCCGCCGTGAGCTGGCCTCGTATGGCATAAAATTACCGTCTCAGCGTATCGCGGAAGAGAAAGCGCGTGCGGCGGGGCGTCATCAGTATGATGCCGAAACGCAGTATACCGATGATGAAATTGATGAGATGCAGCAGGATGAGCTGGCACGTCAGTTCGCCCAGTCTCAGCAGCATCGTTATGGTTCCGAATATCAACATGACGCTCCTCAGGCAGAAGATGAGGATGCAGCAGAAGCTGAACTGGCTCGCCAGTTTGCCTCTTCACAGCAGCAGCGCTATGCCGGGGAACAGCCTGCGGGAGCCCATCCGTTCTCGCTGGATGATTTTGAATTCTCGCCGATGAAAGCGCTGGTGGATGAAGGCCCGCATGAGCCGTTGTTTACTCCCGGCGTGATGCCTGAGCAGCCTGTTGCGCCGCAGCCACAGTATCAGCAGCCGCAACAGCCGGTTGCGCCACAGCAACAGTATCAGCAGCCGCAGCAGCCGGTTGCGCCGCAGCAACAGTATCAACAACCGCAACAACCGGCCGCGCCACAGCCGCAGGAAAGCCTGATCCATCCGTTGTTGATGCGTAATGGCGACAGTCGTCCGTTGCAGAAACCTACCACGCCGTTGCCGTCGCTGGATCTGCTGACGCCGCCGCCGAGCGCAGTGGAACCGGTCGATACTTTCGCGCTGGAACAAATGGCGCGCCTGGTCGAAGCGCGTCTGGCGGATTTCCGCATTAAAGCTGACGTGGTGAACTATTCCCCAGGGCCTGTTATTACCCGTTTTGAGCTGAATCTGGCGCCGGGCGTGAAAGCCGCGCGTATTTCTAACCTGTCACGCGATCTCGCGCGTTCACTGTCAACCGTTGCGGTACGCGTGGTAGAAGTGATCCCTGGTAAGCCTTACGTGGGCCTCGAACTGCCGAACAAAAAACGTCAGACCGTCTACCTGCGTGAAGTGCTGGATAACGCTAAATTCCGCGAGAATCCGTCGCCGCTCACCGTGGTGCTGGTAAAGATATCGCTGGCGATCCGGTTGTGGCCGATCTGGCGAAAATGCCTCATCTGCTGGTTGCGGGGACGACGGGTTCCGGTAAATCGGTTGGGGGTCAATGCCATGATCCTCAGCATGCTGTATAAAGCGCAGCCGGAAGACGTGCGTTTCATCATGATTGACCCGAAAATGCTGGAACTTTCGGTCTATGAAGGCATTCCACATCTGCTGACCGAAGTGGTCACCGATATGAAAGACGCCGCCAACGCATTGCGCTGGAGCGTTAATGAAATGGAACGCCGCTATAAGCTGATGTCGGCGCTTGGCGTGCGTAATCTGGCAGGCTATAACGAGAAAATTGCCGAAGCGGCACGGATGGGGCGTCCGATTCCAGACCCGTACTGGAAGCCGGGCGACAGCATGGATGCCCAGCATCCGGTGCTGGAAAAAACTGCCGTATATCGTGGTTCTGGTCGATGAATTCGCCGATCTGATGATGACCGTCGGTAAGAAAGTTGAAGAGCTGATTGCGCGTCTGGCGCAGAAAGCGCGTGCCGCAGGTATTCACCTGGTGCTGGCGACTCAACGCCCGTCGGTTGACGTTATTACCGGTCTGATTAAGGCCAACATTCCGACGCGTATCGCCTTTACCGTATCCAGTAAGATCGACTCACGCACGATCCTCGATCAGGGCGGCGCGGAGTCCCTGCTGGGGATGGGGGATATGCTTTATTCCGGCCCGAACTCGACCATGCCGGTACGTGTTCATGGCGCGTTTGTCCGCGACCAGGAAGTGCATGCGGTGGTGCAGGACTGGAAAGCGCGTGGTCGCCCGCAATACATTGACGGCATCACCTCGGATAGCGAAAGCGAAGGGGGCGGCGGCGGCTTTGACGGCGGCCGAGGAGCTGGACCCGTTATTCGATCAGGCGGTGAGCTTTTGTGACCGAAAAACGCAAAGCGTCCATTTCTGGCGTTCAGCGCCAGTTCCGTATCGGTTACAACCGTGCGGCGCGCATTGTCGAGCAGATGGAAGCCCAGGGCATTGTCAGCGAACAAGGCCATAACGGGAATCGCGAGGTGCTGGCGCCACCGCCGTTCGAATGATTGCAAAGATCGGTAAATAAATAAGAATCAGTATTTTCTTCTTTCCTCATGCTGATTTTTGGCCTGGAATAGAGAGCAGAGGGAACTCCCCATCGGGAGTGACGTATTTTTGAGGAATAATGATGAAAAAAATCCGCAATCACCTGCGCATTACTCTCCGGCTTTGTGGTGAGCAGCGTATGGGCGGATGCCGCCAGCGATCTGAAAAGCCGTCTGGATAAAGTCAGCAGCTTCCACGCCAGCTTTACGCAAAAAGTGACTGACGGCAGCGGCGCCGCCGTTCAGGAAGGCCAGGGCGACTTATGGGTTAAGCGTCCTAATCTGTTTAACTGGCATATGACGCAGCCCGACGAGAGCATTCTGGTCTCTGACGGGAAAACGCTATGGTTCTTTAACCCGTTTGTCGAGCAGGCGACGGCGACCTGGTTAAAAGATGCGACGGGCAATACGCCGTTTATGCTGATAGCCCGTAACCAGTCCAGCGACTGGCAGCAGTACAACATCAAACAAAATGGCGATGACTTTGTGCTGACGCCAAAAAGCAGCAGCGGTAACCTGAAGCAGTTCACCATTAATGTGGGTCGTGACGGGACTATTCATCAGTTCAGCGCAGTGGAACAAGACGATCAGCGCAGTAGCTATCAGCTGAAATCTCAGCAGAATAGCGCTGTCGATGCATCAAAATTTACCTTTACTCCGCCGCAAGGTGTAACGGTAGACGACCAACGTAAGTAGAGGCGCATGAGTGGAGCAATCTGTCGCTCGATTTTTCTGATAATACCTTTCAGCCACTGGCCGCGCGTATGCGGCCAGAAAATTTAGCGCAGTATATTGGGCAGCAGCATCTGCTGGCTGCGGGTAAGCCTTTGCCTCGTGCAATTGAGGCCGGGCACCTGCACTCGATGATTTTGTGGGGGCCGCCCGGTACGGGCAAAACCACGCTGGCCGAAGTGATTGCCCGCTATGCCAATGCTGACGTCGAACGCATTTCCGCCGTGACTTCTGGCGTAAAAGAGATCCGTGAAGCCATTGAACGCGCCCGCCAGAACCGTAATGCCGGGCGCCGCACTATTCTGTTTGTCGATGAAGTCCATCGCTTCAACAAGAGCCAGCAGGATGCGTTTCTGCCGCATATCGAAGACGGCACGATTACGTTTATTGGCGCGACGACCGAAAACCCGTCCTTCGAGCTGAACTCGGCTTTGCTGTCACGCGCCCGCGTGTATCTCCTCAAATCCTTAACGACTGAAGATATTGAGCAGGTGCTGAATCAGGCGATGGATGACAAAGCGCGTGGTTATGGTGGCCAGGATATCGTTCTGCCGGACGAGACGCGCCGGGCGATCGCCGAACTGGTGAACGGAGATGCGCGTCGGGCGTTAAATACGCTGGAAATGATGGCAGACATGGCGGAAGTGGATGACAGCGGTAAGCGAGTATTATTACCCGCATTACTGACCGAAATCGCCGGGGAGCGCAGCGCGCGCTTTGATAATAAAGGCGACCGCTTTTACGATCTCATCTCCGCATTGCATAAATCGGTGCGTGGCAGCGCCCCGGACGCGGCGCTTTACTGGTATGCCCGGATCATTACCGCTGGCGGCGACCCGCTGTATGTCGCCCGACGTTGCCTGGCGATTGCTTCTGAAGACGTCGGCAATGCCGATCCTCGCGCAATGCAGGTGGCGATTTCTGCGTGGGATTGCTTTACCCGCGTCGGCCCTGCGGAAGGGGAACGGGCGATAGCCCAGGCGATTGTTTATTTAGCCTGTGCGCCGAAAAGTAACGCGGTTTATACCGCGTTCAAAGCGGCGCTGGCCGATGCCCGTGAACGCCCGGATTATGACGTGCCTGTGTATCTGCGCAATGCGCCGACGAAACTGATGAAAGAAATGGGCTATGGCCAGGAATACCGCTACGCGCATGATGAGCCAAACGCCTACGCCGCAGGCGAGGTTTATTTCCCGCCGGAAATAGCGCAAACACGCTATTATCATCCCACAAACAGGGGTCTTGAAGGCAAGATTGGCGAAAAGCTCGCCTGGCTGGCTGAACAGGATCAAAATAGCCCCACAAAACGCTACCGTTAGCGCGATCGTTGCGGTAATGTTGGCACTGTATCCCTGTGACTGCAGGCTGTGGTCACGGTTCCTATTTTAATTCGATAAGCACAGGATAAGCATGCTCGATCCCAATCTGCTGCGTAATGAGCCAGACGCAGTCGCTGAAAAACTGGCACGCCGGGGCTTTAAGCTGGATGTAGATAAGCTGCGCGCTCTTGAAGAGCGTCGTAAAGTTTTGCAGGTCAACACGGAAAACCTGCAAGCAGAGCGTAACTCTCGATCGAAATCCATCGGCCAGGCGAAAGCGCGCGGGGAAGATATCGAGCCTTTACGTCTGGAAGTGAACAAGCTGGGCGAAGAGCTGGATGCGGCAAAAGCTGAGCTGGAGAGCTTACAGGCTGAAATTCGCGATATCGCGCTGACCATTCCTAACTTACCGGCTGATGATGTACCGGTAGGTAAAGATGAAAACGACAACGTTGAAGTCAGCCGTTGGGGAACCCCGCGTGAGTTTGATTTCGACGTCCGCGATCATGTGACGCTGGGTGAAATGCACGCCGGTCTCGACTTCGCGGCTGCGGTTAAACTGACCGGTTCTCGTTTTGTGGTAATGAAAGGGCAGATTGCCCGTATGCACCGCGCGCTGTCGCAGTTCATGCTGGATCTGCACACCGAACAGCATGGTTACAGTGAAAACTACGTGCCGTATCTGGTGAACCACGACACGCTGTACGGTACAGGGCAGTTGCCGAAATTTGCGGGCGATCTGTTCCATACTCGCCCGCTGGAAGAAGAGGCTGACAGCAGCAACTATGCGCTGATCCCGACGGCGGAAGTGCCGCTGACTAACCTGGTGCGTGATGAAATCATCGACGAAGATGCGCTGCCGATCAAAATGACTGCGCATACGCCATGCTTCCGTTCTGAAGCGGGTTCTTACGGTCGTGACACGCGCGGTCTGATCCGTATGCACCAGTTCGATAAAGTTGAGATGGTGCAGATTGTGCGTCCAGAAGAGTCAATGGATGCGCTGGAAGAGATGACCGGCCATGCTGAGAAAGTGCTTCAGTTGCTGGGTCTGCCGTACCGTAAAATCGTTCTGTGTACTGGTGACATGGGCTTTGGCGCATGTAAAACCTACGATCTCGAAGTATGGATCCCGGCGCAGAACACCTATCGTGAAATCTCTTCCTGCTCGAACGTATGGGATTTCCAGGCGCGTCGTATGCAGGCACGCTGCCGCAGTAAAGTCCGACAAGAAAACCCGTCTGGTTCATACTCTGAACGGTTCCGGTCTGGCGGTTGGGCGTACTTTAGTTGCCGTGATGGAAAACTACCAGCAGGCTGATGGCCGCATCGAAGTTCCAGAAGTATTACGTCCGTATATGAACGGGCTGGAATATATCGGCTAATCCCCGCTCCTCTCTGCTTAAAAAGCGCCTCCGGGCGCTTTTTTTTATGCCTGTTTGACACCGGACAAGAATGGTTACCCCTTTGGGGTAATACTACCTTCGAATGAAGATTAGCATTTATCGCTGATTTTCTCATATTCATTATATACAAAACTATATAGCGATTTATTCGTTGTATAAATAACTACATAACGAAGTGTCTGGCTTTTATCAATCGTGAGCAAGCAAAGTGAGTCATCATGAAAACGAATATCCCCGATGCTGTACTGGCAGCTGAGGTAACTCGTCGAGGCCTGGTAAAAACGACAGCAATAGGCGGGCTGGCAATCGCCAGTAGCGCATTGACTCTGCCATTTACCCGGATCGCCAATGCGGCTGACGCTATCAGCCCCAACGTATCCAGCGAAAAAATTATCTGGAGCGCTTGCACCGTTAACTGCGGAAGCCGCTGCCCGCTGCGCATGCATGTGGTGGACGGTGAAATCAAATATGTTGAAACAGACAATACGGGCGATGATAACTATGACGGTCTGCATCAGATTCGCGCCTGTCTGCGCGGGCGTTCTATGCGCCGCCGGGTTTATAACCCCGATCGTCTGAAATATCCAATGAAGCGCGTTGGCAAGCGTGGTGAAGGTAAGTTTGAGCGCATCAGCTGGGACGAAGCCTATGACATTATCACCACCAATATGCAGCGTCTTATCAAAGACTACGGCAACGAATCCATCTATTTAAACTATGGTACGGGTACGCTTGGCGGCACGATGACGCGCTCCTGGCCACCAGGAAAAACACTGGTTGCCCGGCTGATGAACTGCTGCGGCGGTTACCTCAACCACTACGGCGATTACTCATCCGCGCAAATTGCCGCCGGGTTAAATTACACCTACGGCGGCTGGGCGGACGGTAATAGTCCTTCCGACATTGAAAATAGTAAGCTGGTTGTGCTGTTTGGTAACAATCCGGGGGAAACCCGTATGAGCGGCGGCGGGGTAACCTATTACCTCGAACAGGCACGGCAAAAATCAAATGCGCGGATGATTATTATCGATCCGCGTTATACCGATACCGGCGCGGGTCGTGAGGATGAGTGGATTCCGATTCGCCCGGGTACGGATGCGGCGCTGGTAAATGCGCTGGCATACGTCATGATCACCGAAGATATGGTCGATCAGCCGTTCCTCGACAAATACTGTGTGGGTTATGACGAAAAAACCTTACCCGCCAGCGCGCCGAAAAATGGTCACTACAAAGCCTATATTCTGGGGCAGGGTAGAGACGGTATTGCGAAGACGCCGGAGTGGGCGTCGCAGATCACCGGTATCCCGGCGGCGCGCATTGTAAAACTGGCGCGTGAAATCGGTAGCGCCAAACCCGCTTATATCAGCCAGGGCTGGGGGCCGCAGCGTCATGCCAACGGCGAAATCGTTACTCGCGCAATCTCTATGCTGGCGATCCTGACCGGCAACGTCGGCATTAACGGCGGCAACAGCGGTGCGCGTGAGGGATCTTACGATTTGCCGTTTGAGCGTATGCCAACGCTGGAAAACCCGGTCGAGACCAGCATTTCGATGTTCTTATGGACCGATGCCATTGAGCGCGGGCCGGAAATGACGGCGCTGCGTGACGGGGTTCGCGGCAAAGATAAACTCGACGTTCCCATCAAAATGATCTGGAACTATGCAGGTAACTGCCTGATTAACCAGCATTCCGAAATTAACCGCACGCATGAAATTTTGCAGGATGATAAGAAGTGCGAAATGATCGTCGTTATCGACTGTCATATGACCTCTTCAGCGAAATATGCCGATATTCTGCTGCCGGACTGCACCGCTTCTGAGCAGATGGACTTCGCGCTGGATGCCTCCTGCGGGAACATGTCCTACGTTATTTTCACCGACCAGGCCATTAAACCGCGCTTTGAGTGCAAGACTATCTATCAGATGACCAGCGAGCTGGCGAAGCGTCTCGGCGTGGAGCAGCAGTTTACCGAAGGGCGTACGCAAGAAGAGTGGATGCGTCATCTGTACGAACAGTCGCGTCAGGCGATTCCCGAACTGCCGACATTTGAAGCATTCCGCAAGCAGGGCATCTTTAAGCAGCGCGACCCTGAAGGTCATCACGTTGCCTATAAAGCGTTCCGTGACGATCCGCAGGCTAATCCGCTGACAACGCCGTCGGGCAAAATCGAGATCTATTCGCAGGCGCTGGCAGAGATTGCGGCAACCTGGGAACTGCCGGAAGGCGATGTGATCGATCCGTTGCCGATCTATACGCCGGGCTTTGAAAACTACATCGATCCGTTGGACCAAAGACTATCCGTTGCAATTAACGGGCTTCCACTACAAATCTCGCGTGCACTCTACCTATGGCAACGTGGATGTTCTGAAAGCTTCCTGCCGCCAGGAGATGTGGATTAACCCGATGGACGCGCAAAAACGCGGCATCAGCAACGGCGACAAAGTGCGTATCTTCAACGGTCGCGGCGAGTTGCATATTGAGGCGAAAGTGACGCCGCGCATGATGCCGGGCGTCGTTGCGTTGGGCGAAGGCGCCTGGTATGACCCGGATGCAAAACGCGTGGATCAGGGGGGATGTATTAACGTCCTGACGACCCAGCGTCCTTCTCCTCTCGCTAAGGGGAATCCGTCCCATACGAACCTCGTTCAGGTTGAGAAGGTATAAGGAGTAACCGATGACAACCCAATATGGATTTTTTATTGACTCCAGTCGTTGCACCGGTTGCAAAACCTGCGAACTGGCCTGCAAAGACTACAAAGATTTAACCCCGGACGTCAGCTTCCGCCGCATCTATGAATATGCGGGCGGCGACTGGCAGGAGGATAACGGCGTCTGGCACCAGAATGTCTTTGCCTACTATCTTTCCATTTCCTGCAACCACTGCGAAGACCCGGCCTGCACGAAAGTGTGCCCGAGCGGCGCGATGCACAAACGCGAGGATGGCTTTGTGGTGGTGGATGAGGACGTCTGCATCGGCTGTCGCTACTGCCATATGGCCTGCCCGTACGGTGCGCCGCAGTATAACGCCGCCAAAGGCCACATGACCAAATGCGACGGTTGTCATGACCGCGTTGCCGACGGCAAAAAACCGATCTGCGTCGAGTCCTGCCCGCTGCGCGCGCTGGACTTTGGCCCGATAGAAGAGCTGCGGAAGAAACATGGCACGCTGGCGGCCGTTGCGCCGCTGCCGGGCGCGCACTTCACCAGACCGAACATTGTGATTAAACCCAACGCCAACAGCCGCCCGACCGGGGATACCACGGGTTATCTGGCAAATCCGAAGGAGGTGTAA
Protein sequences of DBSCAN-SWA_11 >LR134204|4298242:4312946|4302267_4305531_+|VEB94345.1|DBSCAN-SWA MSQEYTEDKEVTLKKLSSGRRLLEALLILIALFAVWLMAALLSFNPSDPSWSQTAWHEPIHNLGGAPGAWLADTLFFIFGVMAYTIPVIIVGGCWFAWLHQSNDDYIDYFAVSLRIIGVLALILTSCGLAAINADDIWYFASGGVIGSLLSTTLQPLLHSSGGTITLLCVWAAGLTLFTGWSWVSIAEKLGGWLLNILTFASNRTRRDDTWVDDDEYEDDDEYEDEAVGAQRESRRARILRGALARRKRLAEKFSNPRGRHTDAALFSGKRMDDEDDIEYSARGVAADPDDVLFSGHRATQPEYEEYDPLLNGHSVTEPVASAAAATTATQAWAAPVDPVMSAPSVPGAEAAPAQPVVEWQPVPGPQTGEPVIAPAPESYPPQPQYAQPQAPHHEPWQQPAPTEPQAQYYAEHAYEQPVPQVQEPAAEQPWHPEPVYPQEPVYQHEPTFEPQPAYPQEPVQDTYYQQQPAVEPPSAVEPEPVAEEIKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPAKAPEPVKPSAPTAASVPPVESVATVAPLAAGVKDATLAAGAAAASAAPAFSPVSGGAPRPQVKEGIGPQLPRPNRVRVPTRRELASYGIKLPSQRIAEEKARAAGRHQYDAETQYTDDEIDEMQQDELARQFAQSQQHRYGSEYQHDAPQAEDEDAAEAELARQFASSQQQRYAGEQPAGAHPFSLDDFEFSPMKALVDEGPHEPLFTPGVMPEQPVAPQPQYQQPQQPVAPQQQYQQPQQPVAPQQQYQQPQQPAAPQPQESLIHPLLMRNGDSRPLQKPTTPLPSLDLLTPPPSAVEPVDTFALEQMARLVEARLADFRIKADVVNYSPGPVITRFELNLAPGVKAARISNLSRDLARSLSTVAVRVVEVIPGKPYVGLELPNKKRQTVYLREVLDNAKFRENPSPLTVVLVKISLAIRLWPIWRKCLICWLRGRRVPVNRLGVNAMILSMLYKAQPEDVRFIMIDPKMLELSVYEGIPHLLTEVVTDMKDAANALRWSVNEMERRYKLMSALGVRNLAGYNEKIAEAARMGRPIPDPYWKPGDSMDAQHPVLEKTAVYRGSGR >LR134204|4298242:4312946|4305993_4306158_+|VEB94347.1|DBSCAN-SWA MTEKRKASISGVQRQFRIGYNRAARIVEQMEAQGIVSEQGHNGNREVLAPPPFE >LR134204|4298242:4312946|4306961_4308248_+|VEB94349.1|DBSCAN-SWA MRPENLAQYIGQQHLLAAGKPLPRAIEAGHLHSMILWGPPGTGKTTLAEVIARYANADVERISAVTSGVKEIREAIERARQNRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELNSALLSRARVYLLKSLTTEDIEQVLNQAMDDKARGYGGQDIVLPDETRRAIAELVNGDARRALNTLEMMADMAEVDDSGKRVLLPALLTEIAGERSARFDNKGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIASEDVGNADPRAMQVAISAWDCFTRVGPAEGERAIAQAIVYLACAPKSNAVYTAFKAALADARERPDYDVPVYLRNAPTKLMKEMGYGQEYRYAHDEPNAYAAGEVYFPPEIAQTRYYHPTNRGLEGKIGEKLAWLAEQDQNSPTKRYR >LR134204|4298242:4312946|4312328_4312946_+|VEB94353.1|DBSCAN-SWA MTTQYGFFIDSSRCTGCKTCELACKDYKDLTPDVSFRRIYEYAGGDWQEDNGVWHQNVFAYYLSISCNHCEDPACTKVCPSGAMHKREDGFVVVDEDVCIGCRYCHMACPYGAPQYNAAKGHMTKCDGCHDRVADGKKPICVESCPLRALDFGPIEELRKKHGTLAAVAPLPGAHFTRPNIVIKPNANSRPTGDTTGYLANPKEV >LR134204|4298242:4312946|4312057_4312318_+|VEB94352.1|DBSCAN-SWA MDAQKRGISNGDKVRIFNGRGELHIEAKVTPRMMPGVVALGEGAWYDPDAKRVDQGGCINVLTTQRPSPLAKGNPSHTNLVQVEKV >LR134204|4298242:4312946|4309872_4312053_+|VEB94351.1|DBSCAN-SWA MKTNIPDAVLAAEVTRRGLVKTTAIGGLAIASSALTLPFTRIANAADAISPNVSSEKIIWSACTVNCGSRCPLRMHVVDGEIKYVETDNTGDDNYDGLHQIRACLRGRSMRRRVYNPDRLKYPMKRVGKRGEGKFERISWDEAYDIITTNMQRLIKDYGNESIYLNYGTGTLGGTMTRSWPPGKTLVARLMNCCGGYLNHYGDYSSAQIAAGLNYTYGGWADGNSPSDIENSKLVVLFGNNPGETRMSGGGVTYYLEQARQKSNARMIIIDPRYTDTGAGREDEWIPIRPGTDAALVNALAYVMITEDMVDQPFLDKYCVGYDEKTLPASAPKNGHYKAYILGQGRDGIAKTPEWASQITGIPAARIVKLAREIGSAKPAYISQGWGPQRHANGEIVTRAISMLAILTGNVGINGGNSGAREGSYDLPFERMPTLENPVETSISMFLWTDAIERGPEMTALRDGVRGKDKLDVPIKMIWNYAGNCLINQHSEINRTHEILQDDKKCEMIVVIDCHMTSSAKYADILLPDCTASEQMDFALDASCGNMSYVIFTDQAIKPRFECKTIYQMTSELAKRLGVEQQFTEGRTQEEWMRHLYEQSRQAIPELPTFEAFRKQGIFKQRDPEGHHVAYKAFRDDPQANPLTTPSGKIEIYSQALAEIAATWELPEGDVIDPLPIYTPGFENYIDPLDQRLSVAINGLPLQISRALYLWQRGCSESFLPPGDVD >LR134204|4298242:4312946|4300122_4301091_-|VEB94343.1|DBSCAN-SWA MGTTKHSKLLILGSGPAGYTAAVYAARANLQPVLITGMEKGGQLTTTTEVENWPGDPNDLTGPLLMERMHEHATKFDTEIIFDHINSVDLQNRPFRLTGDSGEYTCDALIIATGASARYLGLPSEEAFKGRGVSACATCDGFFYRNQKVAVIGGGNTAVEEALYLANIAAEVHLIHRRDGFRAEKILIKRLMDKVENGNIVLHTHRTLEEVTGDQMGVTGLRLRDTQNPDNIESLDVAGLFVAIGHSPNTAIFEGQLELENGYIKVQSGIHGNATQTSVPGVFAAGDVMDHIYRQAITSAGTGCMAALDAERYLDGLADACK >LR134204|4298242:4312946|4305475_4305997_+|VEB94346.1|DBSCAN-SWA MPSIRCWKKLPYIVVLVDEFADLMMTVGKKVEELIARLAQKARAAGIHLVLATQRPSVDVITGLIKANIPTRIAFTVSSKIDSRTILDQGGAESLLGMGDMLYSGPNSTMPVRVHGAFVRDQEVHAVVQDWKARGRPQYIDGITSDSESEGGGGGFDGGRGAGPVIRSGGELL >LR134204|4298242:4312946|4298242_4300009_-|VEB94342.1|DBSCAN-SWA MNKTRQKELTRWLKQQSVISQRWLNISRLLGFVSGVLIVAQAWIMARILQHMIMDNIPREALLLPFIVLMLVFVLRAWVVWLRERVGFHAGQHIRFEIRRQVLDRLQQAGPAWIQGKPAGSWATLVLEQIDDMHDYYARYLPQMALAVCVPLLIVAAIFPSNWAAALILLGTAPLIPLFMAMVGMGAADANRRNFQALARLSGHFLDRLRGMETLRIFGRGEAETESIRVASQDFRQRTMEVLRLAFLSSGVLEFFTSLSIALVAVYFGFSYLGELNFGHYGTGVTLAAGFLALILAPEFFQPLRDLGTFYHAKAQAVGAADSLKTFMETPLAHPERGDVELSGKEPVTLEAQDLIITSPEGKTLAGPLNFTLAAGQRAVLVGRSGSGKSSLLNALSGFLSYQGSLCINGVELRDLSPESWRKHLSWVGQNPQLPAGTLRENVLLARPDASEQALQAALDSAWVSEFLPLLPQGVDTPTGDHAGRLSVGQAQRVAVARALLNPCKLLLLDEPAASLDAHSEQRVMQALNAASKRQTTLMVTHQLEDLADWDVIWVMQDGQIIEQGDYAQLSSANGAFATLLAHRQEDI >LR134204|4298242:4312946|4301638_4302133_+|VEB94344.1|DBSCAN-SWA MVDSKKRPGKDLDRIDRNILNELQKDGRISNVELSKRVGLSPTPCLERVRRLERQGFIQGYTALLNPHYLDASLLVFVEITLNRGAPDVFEQFNAAVQKLEEIQECHLVSGDFDYLLKTRVPDMSAYRKLLGETLLRLPGVNDTRTYVVMEEVKQSNRLVIKTR >LR134204|4298242:4312946|4308339_4309569_+|VEB94350.1|tRNA|DBSCAN-SWA MLDPNLLRNEPDAVAEKLARRGFKLDVDKLRALEERRKVLQVNTENLQAERNSRSKSIGQAKARGEDIEPLRLEVNKLGEELDAAKAELESLQAEIRDIALTIPNLPADDVPVGKDENDNVEVSRWGTPREFDFDVRDHVTLGEMHAGLDFAAAVKLTGSRFVVMKGQIARMHRALSQFMLDLHTEQHGYSENYVPYLVNHDTLYGTGQLPKFAGDLFHTRPLEEEADSSNYALIPTAEVPLTNLVRDEIIDEDALPIKMTAHTPCFRSEAGSYGRDTRGLIRMHQFDKVEMVQIVRPEESMDALEEMTGHAEKVLQLLGLPYRKIVLCTGDMGFGACKTYDLEVWIPAQNTYREISSCSNVWDFQARRMQARCRSKVRQENPSGSYSERFRSGGWAYFSCRDGKLPAG >LR134204|4298242:4312946|4306323_4306893_+|VEB94348.1|DBSCAN-SWA MVSSVWADAASDLKSRLDKVSSFHASFTQKVTDGSGAAVQEGQGDLWVKRPNLFNWHMTQPDESILVSDGKTLWFFNPFVEQATATWLKDATGNTPFMLIARNQSSDWQQYNIKQNGDDFVLTPKSSSGNLKQFTINVGRDGTIHQFSAVEQDDQRSSYQLKSQQNSAVDASKFTFTPPQGVTVDDQRK |
12 | Escherichia_phage(30.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_12 |
4726740 : 4784003
Sequences of DBSCAN-SWA_12
Nucleotide sequences of DBSCAN-SWA_12 >LR134204|4726740:4784003|DBSCAN-SWA TCTATTCCTGAGCCAGCGCTTCCAGCTTGTCCCGAAAGCCGGTGACGGAAATCGCGCGGTTGTCAGCCCGCCAGCGATCTTTTGCCGCCGGAGCGGAACTTTGTACGCCAATCAACTGCCAGCCAGCGTCGGTATGCAGCATTAACGGAGAACCGCTGTCGCCGGGCAGCGTGTCGCACTGGTGAGAGAGCACCGCGTTTTGCGCCCAGCCGGTAATCAGGCAATCCTGATGCGAGTAAAGCGTATCAAGATGATCTTCCGGGTAACCTGACTGCGTCACTTTGCGGTCGGCGGTTTTCAGGGCAGCGGTTAGCGCGGCTTTATCACCTTCAAATAAGGGCAGTGGGGTAATGCCGGACGGCGGATAACGCAACACAATCAAACCAAAATCCCACGGCGCGGCGGCCGGAGGGACAATCCAGCCATCGCCATCAGGCTTTAAGCGCTTTCCCAGCGACGGATCGACCCGACCTTCAATACCGTGAATTTCATAACGCCAGAGACCTTTTTGCGATACGAAACGCAAAGCCACGGCTTTATCTGGCTTTCCCTTCGGCGGCGTTAACAAGCAGTGTCCGGCGGTGAGCGCCAGATGAGATGAAATGAGCGTGGCGGTACACAGGTTGCCGCTGGCGGTTTCCAGTTGACCAATAGCGTCCCACGGCGACTGCGTCGGATCTTCAACCTTAACCCGATCGTCATGGTCAAAAAATAACGTTTTGGCTTCACGGGTATCCGTGGTGGTATCAGCCTCATCCGCATGAGCAACGACAGGGGAAAGACAAATTACGCCCAGTAATACCGCAATGGTTTTATGCATATCACACTCTGTGTGGGTAATTATGATTATTAAAAGCAAACCCTCAGTGAATACTATAGACGTGACGGCAGGAAAGTGGGAGTAAAATCAGATAACTACATTCAGGAAATATAAAAATGCGTGGCAATAAGGGCGCATAATATAAGCATGGTGAGAATCAACTCAAAGCGGTAACGAAACACGTTAACCCTCCGAAGAAACGGCGCTAAACCCGATGATTTAGCGCCGTGTGTTATGGCGCGGGGGAGCGCCATAGAACTCGCCAGTGCCTTATGCCGCCGGTTTAGCTGCCGGTTTAGCCGCAGTATGTTTTACCGCTTTTTTGGTGTGTTTTTTCGCTGCCTGCGCTTTTTGTTCAGGCGCTTTCTGAGCCGCTGCTTTGTGGTGTTTTTTGTGATGCGTGGTTTTTGCCGGTGCGGTTTTAGCGGGTGCCGCAGCCGGAGCCGTCGTGGACGTTGCCGTTTCAGCAGCAAATGCAGCAGAAGACAGACCCATAGCAGCGGCAACAACCAGAGCTAATACTTTTTTCATTCTGATACCCTCGATTTGGTTTTTTATTCAACCCCACTGCGGGGCCGGTGAAATCACTATATCGCTGTAAATCGAGGGCTTCCGTGAGTCATTGGTATCGGCGTGTAACGGTATGTACAACCGGCCCGACAGATTGACGCTATCGGGCAAACAAGAGACATCATGACAAATAGCGTGATTCCAGATGCGCCCGGAAGTAACGGCTGCTCAACGGTTCTCCGGTGGCCTGAGCGATCAGCTGCGCGGTAGTAAAGCGACTGCCGTGCTGCCAGATATTTTGTCGCAGCCAGTCAAACAAGGCGCTGAGATCGCCTTGCGCAATCTGATTATGCAGATCCGGCAATGCGCTATGAGCGGCGCTAAACAGCTGTGCGGCATACATGGCGCCCAGCGTATAAGACGGGAAGTAGCCAAAGCCGCCGTCGGTCCAGTGGATGTCCTGCATACAGCCGTTACGGTAATCTCCCGCGGTTGACAGCCCCAGCCAGGCCTGCATTTTTTCATCCCACAGTGCGGGAATATCATCCACTTCGATGTCGCCGTTAATCAGCGCCTGTTCAATGTCATAGCGCAAGATAACATGGGCAGGATAACTCACTTCGTCGGCATCCACGCGGATAAAGCCCGGTTTCACCCGCTGATTCCAGGCAATGAAATTCGCTTCTTCAAAAGCGGGCTGATCGCCAAAATGACGTTTAACGGCGGGAATGAGTCGTTTGAGGAACGCAGCGCTGCGACCCAGTTGCATCTCGAAAAACAGGCTCTGGGATTCATGGATGGCCGTGGAGCGGGCAAATGCGATCGGCTGACCCTGCCAGGCGTGGGGCAGATTCTGTTCATAGCGGGCGTGCCCCCGTTTCGTGGATCACGCCAAAGAGCGCGCTGAGTAATTCATTTTCATCATAGCGCGTGGTGATGCGCACATCTTCCGGCACGCCGCCGCAGAACGGGTGGGCGCTGATGTCCAGCCGACCGACGTTAAAATCAAACCCTAACTGCGACATGGTTTCCAGTCCCAGTTCACGCTGTAGCGCCGTCGGGAACGGGCCGACCGGCGCAATCAGCGGCTGATGCGACTGTTTTGACACCGCGCGGTTTAACAGATCGGGAAGCCAGGTTTTGACATCCGCAAACAGAACATCCAGTTGGGCGCTGGTCATATCGGGCTCAAAAATATCCAGCAGGGCATCGTAGGGCGAGCATCCTTTAGCTTCGGCGCGAAGTTTTGCCTCTTCACGGCTGTATTTCACAACCTCTTTCAGATTCTGCGCAAATCCTTGCCAGTCATTCGCCGGACGCTGACTCCGCCATGCGTGTTCACACCGGCTCCCCGCCAGCGATTTCGCCTCCACCAGTGACTCAGGCAGCAGGGCCGCCTGATGGTGCCGCCTCGCCATTTCACGCAGGTTCGCCTGCTCAACATCGTTCAAATCTTCCTGCCCGGCGGCGGCTATCCACTGCGCAACCTTCGGGTCAGTCAATAACTGATGTTCCAGCACGTTCAGCTCGGCCAGCGCTTCGCCTCGCGCGGTGCTTCCACCGGGAGGCATCATGGTGAACATATCCCAACTGGCGATAGCGGAGAGATGCGAAAAGCGGGAGAGGCGCTGGAAAGTGCGGGTGAGTTGTTGGTAGTGGCTCTTGTTGGACATTGTTGTGTTCCCGTCGTGTTCTGCGAATGTGAGGAGTTTACCCTGATTCATACATGTTTATGCGCTGATGGCAACGATGGCAGAAATGTTGGTAGGTTGTAAGTGATAATGATACTTCTTATCATAAATGGCGGTGTCGTGAAGCACACTGGTGAAATATTAACAGGGAACAGGGATGCAACAGGAAATCGATAACGCCTATATTGTCAGGACAAAAGCAGTGTCAGGCTGCTTTTCTATGTTTATGCTGATGTTTGGTATTACATTTACGCCACTGTTTTTTACCCGCAGTGCGCAATTAATGGAAAGTGGGTTACTTTTACCGCTAATGTTTGTGCTGGAGTTTCTTATTCTGATCCCGCTGTATTATCTCTTTTTTAGAAAAAGAGACGGTCTGGGAAAAGGCACATTGAGCGCAAAGTGGTTTGTTATTTTGTTTGGCGCCATTCTGATCATCCAGTTTCTTTTACCTGCTATGTTGGGGATGAGAAAAACAGAAGCGTGGGTCATGACGCAGGTTTCATTGCATAACTACGCTTTCTGGCTGACCAACCTGTCACTGATTTTCCTCGTCCCGGTCTACGAGGAAATCGTCTTCAGAGGGTGTCTGTTCAACGCCTTTCAGTACTGGTTTAATGATAAGGTGTGGGCGACGTCGCTTGTCGTTTCAACGCTGTTTGCGCTGATGCATACGCAGTACGCCGATATCAGAACATTATTGATGTTATTTTTAATATCGCAGGTGCTTATTGTCGCCAGAGTCAAATCTAAAGGGTTATTAATGCCTGTTACCCTGCATATATTAATGAATGCAACGGTGATCGGGATACAATATGGCGTTCAGGTCTTATTATCCTCCGGGTGATATTTACGGCATTCGGAAGCGATATCGTCAGTGTTGCCCACCATTACAGGTAGTTTTATCTTCCGGTCTCGCTTTTGGCTATCGCGCGGGGTAAAGTCAAACCTCAACAATAAAAATAAATACACACAGCATGATGAATCCAATTATCTGGGTTATCTTTGCTCTTCTGGCACTGGATGCCGTCAGGGAATTGATGGGGGCTTCTTCCATCCTGGGATTGTGGTAATCCAAAGCGCTGGTTGAGCTGTAAAGAATGCCCGGGCTCGACCCGGGCCTTCTCTTTATCAAACGTGCAGACGATGATGCAGGCGGGAGCCAACCAGCAGCGCCAGAAGCAGCATCAGCGCGATAAAACCGCCCACACCGTTCCAGCCATAGCTGTGCCAGAATACCCCGCCTAACGTCCCGGCAATGCTGGAACCCAGGTAATAGCTGAAGAGATACAGGGAAGATGCCTGGCCTTTGGCACGTTTCGCGCGTGGGCCAATCCAGCTGCTTGCCACCGAATGCGCGGCAAAGAATCCGGCGGAGAAGAGCAGCATTCCGGCGAAAATCAGCCATAGCGACGTGAACAGCGTCATTAACAGCCCGAACAACATCACGCCGGTTGAGAACAACATCACCGGGCCGCGACCATAACGCGTCGTCATGGCGCCTGCTTTAGGCGAGCTCCAGGTGCCCGTCAGGTAAGCCACCGATAAAAGCCCCACAATGGCCTGGCTGAGTTCCCAGGGCGAAAGCATCAACCGATAGCCGATATAGTTAAATAGCGTGACAAACGACCCCATTAACAAAAAGCCTTCTGCGAATAACAGCGGCAGACCCCTGTCGCGCCAGTGAAGGCGGAAGTTGATGAACAGCGTTTTGGGGCGTAATGAGGTCGGTCGGAAATGGCGGGATTCCGGCAAGATTTTCCAGAACATCAATGCGGACGCCAGCGCGAAACAGCCGATCGCCGCCAGTGCGATGCGCCAGTTAAAAAAAGTCAGTAAATACGCCGCTGATCAGACGCCCGCTCATGCCGCCAATCGAGTTTCCGCTAATGTACAACCCCATTGAGAAGGCGACGAAACTGGGATGGATCTCTTCACTTAAATAGGTCATGCCTACCGCGGCGACGCCGCTCAACGACAGGCCAATCAGCGCGCGCATGATTAAAATGCCATGCCAACTGGTCATCATGGTGGAAAGCAGCGTACAGCAGGAGGCCAGCAGTAGCGCCGTCACCATTACAGGTTTGCGGCCGATCGCGTCGGACAACGGGCCGGTAAACAGCAGCCCGATGGCTAACATGCCGGTGGAGATGGAAAGCGAGATACTGCTGCTGGCGGGAGAGACGCCAAATTCATGGGACAGCACAGGCAGTATCGGTTGTACGCAATAGAGAAGGGCGAACGTCGCCAGCCCGGCGGAGAATAGCGCCAGCGTAACGCGCATAAACTGGGATGTACCACGTTTTATATATTGAACCGGCTGGGAAATAATGTGTTCATCAACATCGCTCGCCGTTGCGGTGTCGACAGCAGTAATACGACTCACTTGAAATCCTTGCTCAACATCCCCGTGATTATGGTGACGGGTATGACCTCAGTGCTAAGAGTAGGAAAATCAAAATATTCTGTCTAATATATTAATAATCTCAAATAATATTTATAAAATATGAATATTGAACTTCGTCATCTACGTTACTTTGTGGCGGTTGCGGAAGAGTTGCATTTTGGCCGGGCGGCTGCGCGGCTGAATATCTCTCAACCCCCGCTAAGCCAGCAGATCCAGACCCTTGAGCAGCAGATTGGCGCGCGGCTGCTGGCGCGAACTAACCGGAGCGTTGCGCTGACGGCGGCGGGTAAGCAGTTTCTGGCAGACAGTCGGCAGATCCTTGGGCTGGTGAATGAGGCCGCCGCCCGCGCGGAACGCTTGCATCAGGGGGAGACCGGCGAGCTGCGCATTGGCTTCACCTCATCCGCGCCGTTTATTAAAGCCGTCTCGGATACGCTGTCTCTGTTTCGCCAGGACTACCCGGATGTGCATATGCAGACCCGTGAAATGAACACTCGCGAGCAGATTGCGCCGCTTAATGAAGGCGCGTTGGATATGGGGTTACTGCGCAATACCTCATTGCCGGATACGCTGGACTGGGAAGTGATTCTGCATGAACCGTTGCTGGCGATGATCCCCCGTGAACATCCGCTGGCGCAAAAGCCGGTCGTGACGCTGGCGGAGCTGGCGAAAGAACCGTTTGTCTTTTTTGATCCGCATGTCGGAACGGGGTTGTATGACGATATTCTCGGCCTGATGCGCCGTTACCATCTCACTCCGGCGATCACTCAGGAGGTTGGAGAGGCGATGACGATTATCGGGCTGGTAGCCGCAGGACTGGGGGTGTCGATCCTGCCCGCCTCGTTTAAACGGGTACAGCTTGATGAAATGCGTTGGGTGCCCATTGCCGAAGCGGACGCCGTGTCGGAAATGTGGCTGGTATGGCCGAAACATCATGAACAAAGTCACGCTGCGCAACGATTTCACCATCAACTTTTAACGGCCGCAAAGCGCGCTTAATTTAGTTTAAAAAGGGATGAAAATGTGCAGTAAATCACACGGCTAAGTAAAAAGTTGACGACAACTATTGAAGTCATTCACCATAGCCCACAGATTATTTCGGAGCGCGAAAATAAAGGGAGTCAGAGGTGGTTGCTGATAGTCAGCCAGGGCATATCGATCAAATCAAGCAGACCAATGCTGGCGCGGTATATCGCCTGATTGATCAGCTGGGCCCCGTGTCGCGAATTGATCTTTCTCGTCTGGCGCAACTGGCGCCTGCCAGTATCACTAAAATTGTTCGCGAAATGCTCGAAGCGCATTTAGTTCAGGAGCTGGAGATCAAAGAGGCGGGAAGCCGAGGTCGCCCGGCGGTCGGGTTGGTGGTGGAAACCGAGGCCTGGCACTATTTGTCTATTCGTATCAGCCGTGGCGAGATTTTTCTCGCCCTGCGTGACCTGAGCAGCAAGCTGGTGGTGGAGGAGTGCCTTGAGCTACCGCTGGTTTCTGAGACTCCGCTGCTTGAACGCATCATTTTTATGGTCGATCAGTTTTTTATCCGTCACCAGCAAAAACTCGAACGCCTGACCTCTATCGCCATCACCTTACCCGGTATTATTGATACTGAGAATGGCATTGTGCACCGCATGCCGTTTTACGACGATGTCAAAGAGATGCCGCTGGGGGAAACCCTCGAACACCACACCGGCGTACCGGTCTATATTCAGCATGATATCAGCGCCTGGACGATGGCCGAGGCGCTTTTTGGCGCATCGCGCGGCGCGCGAGATGTTATTCAGGTGGTGATTGATCACAACGTTGGGGCTGGCGTGATTACCGACGGCCACTTACTGCATGCGGGCAGCAGCAGTCTGGTGGAGATCGGCCATACGCAGGTCGACCCCTACGGTAAACGTTGTTACTGCGGGAACCACGGCTGTCTGGAAACGATCGCCAGCGTGGACAGCGTGCTTGAACTGGCGCAACTGCGACTCAATCAGTCAATGAGTTCTTCGTTGCACGGGCAGCCGCTGACGGTGGATTCGCTGTGCCAGGCGGCGGTGCAGGGGGATTTGCTGGCGAAAGATATCATTAGCGGCGTCGGGATGCATGTTGGCCGCATTCTGGCGATCATGGTGAATTTATTTAATCCGCAAAAAATTCTGATCGGTTCGCCGCTGAGCAAAGCGGCGGAAATCCTTTTTCCTGCGATTGCGGACAGCATCCGTCAACAGGCGCTGCCGGCGTATAGCAAACATATCGCCGTTGAAAGCACGCAGTTTTCCAATCAAGGGACGATGGCGGGCGCCGCACTGGTAAAAGACGCGATGTATAACGGTTCTTTGTTGATTCGTCTATTACAGGGTTAACATTTTTTAACTGTCGTACGAAAATTTGCGCTAACGCAAGCTGATCGCGATTGTCATACCTTAGACTTTCTCCACTGTATTATTTTCCTGGCTTATATTTTCGAAGCATAACGGTGGAGTTAGTGATGCTGAAGCGTTTCTTTATTACAGGTACAGACACTTCTGTTGGGAAGACAGTGGTTTCCCGCGCATTACTACAAGCGTTAGCGTCAGGGAACAAAAGCGTCGCAGGTTATAAACCCGTTGCGAAAGGCAGCAAAGAGACGCCGGAAGGCATGCGCAATAAAGATGCGCTGGTGTTGCAGAGCGTCTCTTCTATTGAGCTGCCTTATGAGGCGATCAATCCGATTGCGCTGAGCGAAGACGAAAGCAGCGTGGCGCATAGCTGTCCGATCAATTACACCCTGTTATCTAACGGGCTTGCCAGCCTGAGCGAAAAGGTTGACCACGTCGTGGTGGAAGGCACCGGCGGCTGGCGCAGCCTGATGAACGACCTGCGTCCGTTATCTGAATGGGTCGTGCAGGAGCAGTTACCGGTGTTGATGGTGGTGGGGATTCAGGAAGGCTGCATTAACCATGCGCTGCTGACCGCACAGGCTATCGCCAATGACGGATTACCGCTGATCGGCTGGGTGGCGAACCGCATCAATCCCGGACTGGCGCATTATGCCGAAATCATTGATGTGTTAAGCAAAAAACTGCCTGCGCCGCTGATTGGCGAACTGCCTTATTTGCCGCGTGCCGAACAACGCGAGCTGGGGCAATATATCCGTCTGTCAATGCTCGGCAGTGTGCTGTCGGTAGACCGAGTCCTCGCGTAACGTTCGCGATAAGACCGACGCCACTACGCAGGCAATCAGTAATCCGGGGAGTAGTCGATACTCCCCGGTCATTTCACAGATCATCAGCGTCGACATCATCGGCGCGTGTGTTGTTGCCGCCAGCAGCGTCGCCATTCCTGTTAATCCCAGCAATATTGCCATCTCATCCGTTCCCGGAAGCCAGAATCCCCACATCCGACCGTATAACATACCAATCGACAAGCCGATGAACAGCGTGGGGGTAAAGACGCCGCCGGGCGCACCGGAACCGCTGCTGGCGAGTACGGCGAGTAATTTGCAGATGAAGATCCCGGCGATCACGGAGAGCAGCGGCGGGGAGAGCAAAAATGACTGCACGACGCTGTAGCCATTTCCCCAGACGGCCGGCGTCAGCAGGGATAGTAAGCCAACGATGAATCCGCCCAGCGCCAGTTGCCACGGAGGCGATAATTTCAGCCGAATAAAACCGTTATGAGTGGTGGTCATTAACCACATAAACAGCGGCCCGCAGACGCCAGCCACCAGCCCGGTGCTGATAATCATCGCATATTCCCGCACATGTAAATCCAGTGACAGGTGAACCGTATAGAGCAGGGCATTACCGCCGCTGAGCAAATGGGTGGTCAGTAATGCCACGACGGCGGAAATCACAACGGGGCCCAGCGACGCCAGCATCAGCGTGCCAAACAGGATTTCCGCGATAAAGAGGCTGCCCGCCAGCGGGGCGTGATACGCGCTGGCCATCCCTGCCGCAGCGCCGCAGGCGATCCACAATTTCCACTCTTCACGCGGGGTGCAACGCTGAGCAAAACAGGATGCCGCCAGCGCTGCCAGTAAGATCATGGCCCCTTCGCGCCCGATGGCGCTACCGCTGGCGACGACCAGCAGCGAAGCAAGGGACTTCACCAGACTTGCCCCGTAATCGAATTGCCCATCTGTTTGCAGGGCTTCCATGTAGTCGGTGGGGGCATGGGGGCGTTGTTGATTCATTTTATGCCAGCCCCAGAGCAGCGCACCTGCCGCGAGGCCGCCCACGGCGGGGGTGATCAGCCGTCGCCAGGGCGAAAGGTTTGTCGCGGCATTCACCAGGCTTCCCGTATCGTTACGTAAGAAAAGCCATTCCAGCACCAGCATGGCGTGACGAAACCCGGCTACCGCCAGTGCGGCAAGGATGCCAATTATTGTCGCGATTAACAGGCGGCGAAACATCGCTCGCAGGTCAGGGTATGTATGTAGGCGGTGCATGGGCAGACCACACAGGCATTTTCAGGAGTGCCTATTTTGCGGGTAAACAGCGCGGGACGCAAAGGCACTGCGTGCGCAGGGGGAACTGCCCGATGGCGCTTCGCTTACCGGCCCGACAAGAATGTGCTCCGTAGGCCGGGTAAAGCGTAGCTGCCATTCGGCGTTTAGCTACTTTCGTGAATATTCAGCGCCCGACGGGTACGGCCTGAACTGAGATAATCGGCAATGTAATCCTGCGAGATTTCGCCGTTATAGCGTCCTTCCTCATCGACAATAGGCATCCAGCTGGTGTTACTCTCGTACAGTTTCGACAGCACAATACGCAGGTTGTCTTCCGCTTTTCCGGTGACGCGAAACGGGTGGATTATATCCGCGCAGGTGCCGTTGGCGTTTCTGGCTTCACGACGCTTCACAAAGCCAAGCGGCTTGCCGTCGCTGTCGACAACGGTGATCGCGCGAATATCATTATCGTCCATGATGCCGAACGCCTCTGCCAGCGAGGTGGCGGTGCGGGCGGTCAGGGTGGGTTGTTGGTCGGTTACGTCGCCCGCCGAAACCAGCAGAAGACGTTTTAGCGTACGATCCTGACCGACAAATGAGCCGACGAATTCATTCGCCGGTTTTGCCAGCAATTCATCCGGGCTGGCGCACTGTATAATGCGGCCCTGGCGGAATACGGCAATCCTGTCGCCCAGCTTCAGCGCTTCATCGATATCATGGCTGACCAGCATCACCGTCTTTTTCAGCGCTCGCTGCATATCAAGAAACTGATTCTGAATCACCTCGCGGTTAATCGGGTCCACCGCACCGAAAGGTTCATCCATCAACAGCACCGGGGGATCGGCCGCCAGCGCGCGGATCACGCCGATGCGCTGTTGTTGACCGCCAGACATCTCTTTCGGGTAGCGATGCAGAAACTTTTTCGCATCCATTGCCACCATGTCCATTAACTCTTCCGCGCGACGTTTGCAGCGGGTTTTATCCCAGCCCAGCATACGCGGGACAACCGTGATGTTCTCTTCGATGGTCATGTTGGGAAACAGGCCAATTTGCTGGATCACGTAGCCAATATTGCGCCGTAGCGTGACGGTATCCATCGCACTGGTGTTTTGTCCATTAATGAAAATATTGCCGCTGCTGGGCGCGATCAGCCGGTTGATCATCTTCAGGGTGGTGGTCTTGCCGCAGCCTGACGGCCCAAGCAGCACGCACATTTCACCTTCCGGCACGTTCAGATTGATGTTATCAACGGCGTTAAGCGGTTGGCCGTTCTTCTGTGAAAAGCGTTTGGTGAGGTTTTCCAGTTTTATCATTATCGTATCCCCTTAGGTGTCAGTACGACTTGCAGGCGATGCAGCAACCAGTCGAGCACAATGGCCAACAGACAAATCATCAGCGCACCGGCGATTAACATGCGGATGTCACTCCCGCCGATGCCGTTCAGTAACAGTAACCCCAGGCCGCCAGCGCCAATGACGGCGGCGATGGCCATCACGCCGATATTCATGACGACGGCAGTACGAATCCCGCCAAAAATCACCGGCAGGGCCATCGGAATTTCTACCCAGCGCAAACGCTGCCAGAACGTCATGCCAATTCCGCGTCCCGCTTCGCGCAAGCCCGGCGGTAAACTGTCGAGCGCGGTGTGCGTGTTGCGCACAATCGGCAGCAACGAGTAAAGAAAAACGGCGGTAATTGCGGGAAGGGCGCCAATCCCCTGACCGATCAGCGAAAACAGCGGGATCATCAAACCAAATAATGCGATGGACGGGATAGTGAGCAGCAGCGTCGCCGCCCCTAACACCGGCGTAGCCAGCCATTTATGACGAACAATCAGAATGCCCAGCGGCACGCCAATGATAATCGCCAGACCGACGGCCAGCGCAACCAGCCACAAATGCTGAAGAGTGAGGCTCGTCAGATAACCCGCGTTATCCATCATGTAATGAATCGTATCCATAAGCGCTCCTTACAGCAGACCTTTTTGTTGCAGGAATGTGCGCGCAACCTGCTGGGGTGACTGGTGATCGATGTCGACCTGCGCATTAAGCGTCGAGATGACGTCGTTATTGAGCAGCCCGGAAAGGGTATTCAGCGCATCATCCAGCCCCGGATGGGCATCCAGTATCTCTTTACGGACGACAGGCGTGACAGCGTAACCTGGGAAGAAGCCTTTATCATCTTCCAGCACCTTGAGGTCAAAACCCTTCACCCGGCCATCCGTGGTGTACACCAGACCGGCGTCCACAAAGCCGTCGCGCACGGCGTTATAGACCAGACCCGGGTCCATTTGCCGAATCTGCGGACGGTCAAGCTGCATCTGGTAGGCATCCTGCAACGGCTTCATCCCGTCGCTGCGCCCGGAAAACTCAAGGTCGAGACCTAACATCCAGTTATTGTCAGGGTCGGTCTGGCGGATGTGTTCGATTTTCGCCACCATTTCCGACAGGGTGTTGATGTTCTCTGCTTCGGCGCGTTTGCGCTGCATGGCGAACGCGTAGGTGTTATTCATATTCGCCGGGTTTAGCCATACCAGTCCGAGCTTCGCATCCAGGCGTTTGACGGTATCGTAGGATTCCTGCGGGCTCATCCGTTTATCAATGTGGTTAAAAATGATCAGGGACGTGCCGGTGTATTCCCAGGTGATATCAATCTGCTTATTCACCATGGCATTACGTGAAATCACCGCCGCAATGTTGGTGCGCGGCTGAACCTGAAAACCCTTTTTTTGCAGGTACTGCACGGTCATGGCGGAAAGGATGTGTTGTTCGGTAAAACCTTTGGTTGCAAGGATCAACGGGGCCGCCTGAGCCTGTCCGGTAACGAGCAACGCGGCGGCAAGCCATCCCAGCAGATGTTTTTTTAATGTCATTCAGCGCTCCTTATTCTTATCGTTGTATCAGGCCGTATGCGGACTGAGGATGCGGCCAAGCAGGGCAAGCAGGCTGTCGAGAATCAGCGCCACCAGAGCGGTTGCCGTTGCGCCCAGAATCAGCGTCGGGAAGTCATTGAGGTAAATACCGGGGAAAATCAGCTCGCCGTAGCTGCTGGCCCCAATCAGGAACGCCAGCGGGGCCGTACCCACATTAATCGCCGTCGCGATCCGAATCCCTGCAAGCATCACCGGCCAGGCATTGGGCAGCTCAACCTGACGCAGACGCTGCCATTTCGTCATACCGATGCCGTTTGCGGCTTCAATCAGGGAGGCGGGAACAGAACAAAGCCCGGCGTAGGTATTGCGCACGACAGGAAGAAGCGAGGCGAGGAACAGCGCGAAAATCGCCGGAACGTCGCCAATGCCGATAATCACCATCGCCAGTGCCAGTACGGCCAGAGGCGGCAGAGTATTGCCTACATTAAAGAGCTGCATTACGTATTCGGCGAATCCTTTGGCTGCCGGACGGCTCAGTAAAATGCCGCTGGGAATACCGATAACTAAGGCGAAGAACATGGAGGTAAAGACGAGGAGTAAATGCTGTTTGCCCAAATAAAGCAGATCAACCTGACGCGCTTTAATGGTTTCCAGACCCACTCCCCAGACCAGCAGCGCCAGCACAAGAGCGATCGCGCCAGTAAACCCGAGCACGCGTTTTAACGTAAAGGTGTGCATTGCGGTGTGTCTCCCTGTTTGCATGCGTTATGGCAACGAAAGAGACGTTGCGTGTTGTTATGCCATGTAGCGGCAGGGTGTTTTGAGCTATAGCAACACGTCGGGAAGGATTCCAGCGAGACAGGTATTTAAATTAGCAGGCTATGCATGCCCGTTGCTCATTTGCCCTTACAGGGCGAAGGATGTGCGCGAATGTGAAGAATTGTCTTAAAAATGGAAAGTTAAAGTGACGTTGTCACTTATCGAGAAGAGGAAACAACCGGTTTTTGACGATAAAGGCAGGGCGTACCCTGCCTGAAAAAGCGTTAGCGAAACAGTGGTTTTACGGCGACGGGGATCAGCAACTGAGACTGCCATTGCGCCAGCGTTTGCTGTGCCAGCTCACCTGCCGCCTGGTAAAATGGATGGTTTGCGCCTTCGATGAAAACCGCCAGGAAGCGCGCAGACCACGGCAACAGATGCCAGGCCAGCAGTTGTTCGCATTCGCTATGACGGCCATTTTCCGCAAGCCAGGCAGCCAACAGCAACAGGGAACCAAAATGATCTTCCGGTTCATTTTGCTGGATCTCAAACTGGATGCCGTTGTCCCGCATCCACTGGCGTAATGCCAGCGTAGACTCGCCAAACAACACGCTTTCACGGTCCAGCCAGACCGATCCCCAGGGAGGCGACGGAAGCGCCCAGGGACCGACAAACAGACGTTGCCAGGCTTGCGGGAGCGGCTCCTCGATGTCGGCCTGAAAAACCGCCGCAATTGGCGCCAGAACCTCCGGTGAAAGTGGCCACTGTGCCTGCCAGCCGTCGGTTTTTAACGCTGAAACCAACGGGGCGGCTTCGGTACTGTCTGGCGCATAATAAAACAACGCGCCCAGCACGCGTGCAGCCATAGAAAAATCGTCACGTTGTGAAAAAGAAGTCATGAAACCATCCTGAAAGGTGCGGGTTTCCCCGCACGCGTAAGTTAACCTGCAACGGCCATTCCGACGGTCATATGCAGGCCGTAAAACAATCCGCGACCTATGATTTCGCCGCCAAGCACCAGGATTAACCCCAGCAACAATCCGGCGACGTGCGGTTCTTTGCGGCGCACTAACGGGCATATCCAGCAACCCAGCCCGGCAGCAAGCAGCACTACGCGCCAGACCTGTAAGCGGCCATAGTCTGGAACCAGCGCGCTCGCCTGTTGCACGGAGCTGTGGATCGCCCCAAGCGACATGCCTTGCATGATGATTACGGCCGCGCAGACTAACAGTGCCAGCACGCTGATGCTTGCGAACGTCGTGCCGTTGAATCTGGCGCGCGCGGCGCGCAATATCAGCGCCGCAAATAGCGGCCCGCTCAGGAACACCGTCAGGAAGAATGCCAGCGTCGTGTAGCCATTATACCAGGTAGGAACGGTATCGATCTGATAGACCCGCGTCATTGCCCAGACAAATACCACCCCGAGTACCATGCTGACCAGCAGCCAGATTTTGCCCAGTGCTTCCGGCATTTTGCCGAGCACAGCCACCAGCCACCAGAAACCGCCTACGGCAAAAAAGACAGAGCCAGCGGCAATTTCATTACTCAGCGCGGAAGCGCCCACCCGGTTAAGCGAGTTAAACGCGCGGAACGGCGAGCCGAGGTGCATGATCGACGCCAGAAAACCCAGCCCCATGACGACCCATAAAAAGAACATACCGCGCACAATACGCTGTCTGCTGGCCGCATCGTCTTTCGCCGCAAACCAGCCCAGACCGCTGACAATCAATGCGCCGACAACGCATTGACCGAACACCGTAAACAGCACCAGCGGCCATTCATGCCATCCATTTCCCATTTTATACCTCCTTCGGATTTGCCAGATAACCCGTGGTATCCCCGGTCGGGCGGCTGTTGGCGTTGGGTTTAATCACAATGTTCGGTCTGGTGAAGTGCGCGCCCGGCAGCGGCGCAACGGCCGCCAGCGTGCCATGTTTCTTCCGCAGCTCTTCTATCGGGCCAAAGTCCAGCGCGCGCAGCGGGCAGGACTCGACGCAGATCGGCTTTTTGCCGTCGGCAACGCGGTCATGACAGCCGTCGCATTTGGTCATGTGGCCTTTGGCGGCGTTGTACTGCGGCGCACCGTACGGGCAGGCCATATGGCAGTAGCGACAGCCGATGCAGACATCCTCATCCACCACCACAAAACCGTCCTCGCGTTTGTGCATCGCGCCGCTCGGGCACACTTTCGTGCAGGCCGGGTCTTCGCAGTGGTTGCAGGAAATGGAAAGATAGTAGGCAAAGACGTTTCTGGTGCCAGACGCCGTTATCCTCCTGCCAGTCGCCGCCCGCATATTCATAGATGCGGCGGAAGCTGACGTCCGGGGTTAAATCTTTGTAGTCTTTGCAGGCCAGTTCGCAGGTTTTGCAACCGGTGCAACGACTGGAGTCAATAAAAAATCCATATTGGGTTGTCATCGGTTACTCCTTATACCTTCTCAACCTGAACGAGATTGCTGTGAGACGGGTTCCCCTTCGCCAGCGGAGAAGGACGATGTGATGTCAGAATGTTGATGGAACCGCCGTGATCGACCTGGTCGCCAAACATATCCGCCTTGAGCCAGGCGCCTTGCCCGATGGCGGTAACGCCCGGCAAAATGCGCGGCGTGACTTTTGCCGGGATCAGCATCTGCCCGTTATTGTTAAATACGCGCACCGTGTCACCCTGTTGGATGTTGCGCGCTTTCGCGTCAATGGGGTTAATCCAGATCTCTTGCGGGCACGCTTGCTGCAACACGTCAATATTGCCGTAGCTGGAGTGGGTGCGGGCTTTATAGTGGAAGCCGGTCAATTGCAGCGGATAGGTTATTCGGATGGGATCGTCCCAGCCATCAAACCCTGGCGCATAGGCGGGCAGCGGATGAACGATCTCATCGTTTTCCAGCTCCCAGGTGTCGGCGATGGTCGCCAGCCGCTCGGAGTAGATTTCAATCTTCCCTGATGGCGTTTTCAGCGGACTGGAGGCAGGATCTTCGCGGAATGCGCGGAAGGCGACATAGTGTTCCTCCGGGCATTTTTTCTTGAAGATGCCGGTGGTTTTCATCTCTTCGTAGTCCGGCATTTCCGGGTTACGCGCTTTGGTTTTGGCATGCAGGTATTTCACCCACTCGTGCTGGGTACGGCCTTCCGTAAATGTCTGGTAGACGTCCGGGCCGAGGCGTCTGGCGATTTCGCTCAGCGTCCAGTAGATCGGCTTTCTTTCAAATTTTGGTGAGGTGGCCGGTTGCCCCAGGATGACATAACCCATGGTTGCCTGCGGATTCGTGCGAAATAAGATCTTCCTGTTCCGTTGGCATCAGATCCGGCAGCAGAATGTCGCAGTATTTGGCGGACGCCGTCATGAAGTGCTCAATCCCGACGATCATTTCGCACTTGCTGTCGTCCTGTAATACGTCATGGGTGTGGGCGATGTTGCCATGCTGGTTAATCAGCGTATTGCTGGCGTAGCACCACATAAATTTGATCGGGACATCAAGCTTATCTTTACCGCGCACGCCGTCGCGTGTGGCGGTCATTTCCGCGCCATGATCGATGGCGTCTGTCCAGGTAAAAACGGAGATCTGCGTTTTTACCGGGTTATCCAGCATAGAAAACCACTCTACGCCGAGGTCCCAGGTGCCTTCACGCACGCCGGAGTTGCCGCCGTTGATCCCGACGTTGCCGGTCAGGATAGAGAGCATGGCGATCGCGCGCGCAGTTTGTTCGCCATTGGAGTGACGCTGCGGCCCCCAGCCCTGACAGATATATGCCGGTTTTGCCGAGCCGATTTCCCGTGCCAGCTGGATGATTTTTTCTGCCGGGATACTGGTGATATGCGCGGCCCATTCCGGTGTTTTGGCAATACCGTCCGGGCCATCGCCCAGAATATAGGCTTTATAGTGCGCATTACGCGGCGCGCTGGCGGGTAAGGTCGTTTCGTCGTAACCGACGCAGTATTTATCGAGGAAAGGCTTGTCGATCAGATTTTCGGTGATGAGTACCCAGGCGATCCCCGCCGCCAGCGCGGCATCAGTACCGGGACGAATAGGGAGCCATTCATCTTCGCGACCCGCTGCCGTATCGTTATAACGCGGATCGATGACGATCATTCTGGCGTTGGAGCGTTCGCGAGCCTGTTCCACATAGTAGGTCACGCCGCCGCCGCTCATGCGCGTTTCTGCCGGGTTGTTGCCGAACATTACCACCAGTTTACTGTTGGCGATGTCATCCGGGCTGTTGCCGTCATTCGTGCCGAACATATAACTCATTGCGGCGCTGATTTGTGCGGTACTGTAGCTGCCGTAACGGGCTGAGATAACCACCACATGAGTTCATCAGACGATAAGGCACGTTGGAGTTCGTGATGTTCCCGCCATCCACGCCGGTGCCATACAGCACATGTACGGCTTCGTTGCCGTAGTCTTTCAGAATACGCTTCAGGTTGTCGGCAATGGTATCCAGCGCTTCATCCCAGCTGATACGTTCAAATTTGCCTTCGCCGCGTTTACCGACGCGTTTCATCGGGTATTTAAGGCGATCCGGGTGATTCATACGGCGACGGATCGAGCGTCCGCGCAGGCAGGCCCGCACCTGATGATTGCCGTAGACGTCATCGCCGGTGGTGTCGGACTCGACCCAGTACACCGCATCATCTTTCACATGCAGCCGTAGCAGACAGCGGCTCCCGCAGTTGACGGTACAGGAACTCCAGACCGCTTTTTCTTCCACTGGCGCAGGCGAAAGGCCGTCAGCGGCGCGGGCGATTCGGGAGAAAGGAAGCGTGAAGGCGCTACTGGCGAGCGCGAGACCACTAATGGCGGATGTTTTCACCAGACTACGACGGCTGATAGTGGCCGTCATGAGCGCCTCAGGCGTGGTGATTTTCATAGTTACTCACTTTGCTTATTGATTAAAAAAAAGAAACGCGATTTATAAAATTTGTTTTTACAGGACGTTGGTTTGTTCACATGACCGATGGACGGGCGAACACATCCCCCGAAACGCCTGTAATCGTTTCAGGGGATGGGGGTTACACTTTCTCGATTTCGACCAGATTGGTGTGCTGCGGATTTCCTTTCGCCAGCGGAGAGGGGCGCTGCGTCGTTAACGTATTCACGCAGGCGCCATGATCGATGCGATCGCCTGACATATCGGCATCGTGCCAGGCGCCCTGGCCCATCGCGCTCACGCCGGGGAGAATGCGTGGCGTAACTTTCGCCGGAATGCGGACTTCGCCGCGATCGTTAAATACCCGCACCATATCCCCGTTGGCGATACCGCGTTTTTGCGCATCCACGGGGTTAATCCAGACTTCCTGACGGCAGGCGGATTTGAGAACATCAATGTTGCCGTAAGAGGAGTGCGTGCGTGATTTATAATGAAAACCAAAGAGTTGCAGTGGGAATACGCTGCGCTTAGGATCGTCCCATCCTTCGAACGTTGAGGCATAAATGGGAAGCGGGCTGATGACGTCGCCTTTTTCCAGCTCCCAGGTTTGGGCGATTTTAGCCAGGCGGCTGGAGTAAATTTCTATCTTGCCGGACGGCGTTTTCAGCGGGTTTGCCTGTGGGTCTTCACGGAATTTTTTATAGGCCACGAAATGGCCGTTGGGATCTTTACGCTTGTAGATCCCCATTTTTTTCAGCTCGTCATAGCCTGGTAACTCCGGGTCTTTAGCCAGCATTTTGGCGTACAGATATTGCAGCCACTGTTCCTGGGTTCGCCCCTCGGTAAACCTCTGGTGCACGTCCGGGCCCAGCCGCTTCGCGACTTCGCTCATTATCCAGTAAATGGGCTTACGCTCGAATTTAGGGGCGGTCGCGGGCTGGATGAAGAATTAAATAACCCATATTGCCCGCATAGTCGTTAGGAATGATGTCTTCCTGTTCTACCGTCATCAGGTCCGGCAACAGAATATCGGCGTATTTCGCGGAAGAGGTCATAAAGTTGTCGATGACCACGATCATTTCGCACTTCGATTCGTCTTGCAGGATCGCGTGTGTTTTATTGATATCCGAATGCTGGTTGGTGATGGTATTGCCTGCGTAATTCCAGATGAACTTGATCGGTACATCCAGCTTTTCTTTACCGCGCACGCCGTCGCGCAGGGCGGTCATTTCCGGGCCGCGGGCAATGGCGTCCGTCCAGGTAAACACGGAGATCTGCGTTTTTACCGGGTTTTCCGGTAACGGCATGCGCTCAATGGTGATGGTGTAGGTGGATTCGCGCGCGCCGCTGTTCCCGCCGTGAATACCGACATTGCCCGTCAGAATCGGCAGCATCGCAATCGCGCGTGCGGTGAGTTCACCGTTCGCCTGACGCTGCGGGCCCCATCCCTGACAAATACAGGCCGGTTTAGCGGCGCCAATCTCTCTGGCGAGCTTAATGATGCGATCCGCCGGAATACCGGTAATACGCGACGCCCACTCAGGCGTTTTGGCCGTGCCGTCTTCGCCCTGACCCAGAATATAGGCTTTATAGTGGCCATTCGCTGGCGCATCGGCCGGTAGCGTTTTTTCGTCGTAACCCACGCAGTAGTTGTCGAGGAAAGGCTGGTCGACAAGATCTTCGGTGATGAGCACCCAGGCCAGTCCGGCGACCAGCGCGGCATCGGTGCCGGGACGAATAGGTATCCATTCGTCTTCACGTCCTGCCGCGGTATCGGTATAGCGTGGATCGATAACGATCATGCGCGCATTTGAACGTTCCCTGGCCTGTTCCAGGAACCAGGTAATTCCGCCGCCGCTCATACGGGTTTCTGCCGGGTTATTACCGAACATCACCACCAGTTTGGTGTTCTCGATATCGGAGGTGCTGTTACCCTCGTTCGAACCATAGGTGTAGGGCATGGCCACCGCGATTTGCGCGGTGCTGTAGGTGACTAGGGACAGCTATGTATGGATCAAACAGGGAACCCGGAAACAACCGGAAATTCGTGGGAATAAACGGGAGGAGGCCTTGAGTCGCAAGGCTTGGCGCTACGCCATGTTAGCTAACTGGTAGAAAATTAGCCACGGTGACAATTTTGGATCATTCGATACCAAAGCTAATAAATTGTCATATAGACTTGGATCGCACTGCCGATTATTTAAAAGTGTTTAATGGGATTTAAAATCGGCTTAAAACGTTATTACATGGAGGTACAAAACTAAAGCCAGCCAATTGCGCATACGAGGAATCAGCCAGCTTTGCTTTGCGTGCTTACACTCGACGCTGGGCTAACCTGTTCATCGGAATTACGTTGTTCGAAAATCTCTTTCCGCTGCACCTGATTTGAGTCGGATAGCTCAAGCACCGCAGGCATAAGCATCTTTAAGCCGCCAGTGATAAATATTTCAACAACAGCTTCACGCTGGGCTTCTGTCATGCCGTGATAAATCATCATCCAGAGGGCTTTCTTCTCATCATTTGCTAAACCACTTTTTGCCTCTCCAAGCCCTTCAGATGGCTGTCCTGCGTCGCTATATCCAAGATAATTTTTGGTCCGTGCAGGCAGTATTGATATGTGGTATTCCGTAGCTTTAGTTCCGCCCCGTTTCCGCCGAACTCCCTCCAACCCTTCCGAGAGTTTTTCTAACTGTTTACGTACATTGGAGACTCCGCTTGGAAAGCCAGGCATCCCCGCACACTCTTGAGCTGTAAACCAAGTTTGTACATCCATCATCATCTCCTATTGTCGATTTTTTCTTGGTGTATAAAAAATTCGATTCGATTTATTCAGATTCAAAAAGACTGAATTAACTAAAAACAAAGACTTAAATTCGATTTATTTAGAGTTAATCAAATTCAGAAAAGATTGATAAAAAGATTGAATTCGATTTTTTGTGTGTTATATTTTTACACGTAAGGGTGAACTGTACGGTTCACCGCAACGAGTAAATGTTTAAGGATCGCAGAATGATGACCAAAAGCCAAGACTGGCACCCTGAAGACATCAAAGCAGCGATTAGAAAACGCGGAATGACGACCAGTCAGTTATCCCGGAGTCATGGGTTAGCGGAGTCCACATTACGTAATGTGTTCCGCCATCACTGGCCTAAAGGGGAAAAGATTATTGCTGACTTCCTCGGTATGAAGCCATGTGACATCTGGCCTTCGCGTTATCACGATCTGACTGTTAAAGAGGTTGCATGATGGATTTTTGGGTATCAGTAAAAGAATGTATTGGCGTCTGTGGTTTCCCGCAGGCTGAGTCTAACGCCCGTAAAAAATTAGAGGATCTGGTTTGTGGCCGTAGCGAACTGCGCCGCAAACGTGCTGGCACAAAAGCATTTGAATACCACATCTCTGTGTTACCACCGGAAGTCCGTGCTGAGCTGCTGGCTGGCCGTGGTCTGATTGAAACCTCATCCGGTCTGATTACACTGCCGCAGGAGCCTGAGCGTGTGGCGGCTGACGACCTGGAGCGCCAGCGCCTGTGGTCTGCATGGGAGAAAGCTACCGGTGAACAGCGTCTGCATGCAGAACGCCGCACTAAAGCCGCCGCGCTGGTGGCTGAACTGATGGCTTCCGGCGTCGGCAATCGCAAGGCGATTGCTCTGGCTGCGAAACAACTGCAAATCAGTGAAGGTACGCTGCGCAACCTGTACTACAAAGTGAAAGACCATAGCCCCGACCTCTGGGGACCCGTGTTGCTTGATCGCCGTGTACGCGAAAAACGCCAGACCGGACGGACAGCGGATATCTCTGAGGAAGCATGGCAGTTTTTCCTGGGTGATTACCTGCGCAATGAAGCGCCGTTCTTTTCCAAGTGCTATGAACGCTTGGAAAAAGCCGCTGACGCGCATGGCTGGATTATTCCCGCAGAGCGCACTCTACGCCGTAAGCTGGAGCGTGAAGTTGACCCGCGTATCGTTGTCGCCACCCGCGAGGGTGAAAATGCACTGGCCCAGATGTATCCGTCTCAGCAGCGTACCGTTGCACATCTTCACGCGATGGAGTGGATCAACGGCGATGGTTACCAGCACAACGTGTTTGTGCGCTGGTTCAACGGCGAGATTATCAGGCCCAAGACGTGGTTCTGGCAGGACGTCCACAGCCGCAAAATTATCGGCTGGCGTACTGATGTGTCCGAGAACAGCGACAGTATCCGTTTATCCCTGATGGATACCATCCGCGCTTACGGCAAGCCGAAGCATGTGACCATCGACAACACCCGCGCCGCAGCCAATAAGTGGTTGTCCGGTGGTGTGCCTAACCGCTATCGCTTCAAAGTCAAACCTGATGACCCGATGGGGATCATTCCTCTGCTGGGAATGAAGCTGCACTGGACGGGGGTAATTGGTGGTAAAGGCTGGGGACAGGCTAAACCCGTGGAACGTGCTTTCGGGGTGGGTGGTCTGGGTGAGTATATCGACAAACACCCCGGACTGGCGGGTGCATTTGCCGGTGAAAACGTCAGCGCCAAACCGGAGAACTACGGCAGCCGGGCGGTGGATGTGGAAGAGTTCCTGGAAACCATCAGCGAAGGTGTCGCCATGTTCAACGCGAAGACCGAACGCGAAACTGAGATGTGCCGGGGTGAGCTGTCCTTTGACCAGGCATTTGAGCGCAGCTACAGCCAGTCAGTTATCACCCGTATGACAGAAGAGCAAATCCGTCAACTGATGCTGCCAGCGGAAGCGGTTCGTGTGAAACCGACGGGCGAATTCACTATGGAATGCGGCGGTTCGTTGTTTGGCCGCAAAAACACCTACTGGAGTGAACAGCTTGTCAGTCACCGTTCCCACAAAATCACTGTTCGTTTTGATCCGCGCAACCTGCACGGTGAAGTGGCGTGTTATGACCTCGATGGCCGTTTCCTCTGCATGGCTGAATGTCGCGCCGCCGTTGCCTTTGGCGATACAGAGGCCGGACGTGAACATAACCGTGCCCGCCGCGAGATGATCAAAAGTACGAAGAAAGCCACCAAAGCACTGAACCGTATGACGGCGATTGAAGTAAATGACTTGCTACCAAAGACAGAGCATGCGGAATTACCGGAACGGCATGTGGTGGAGCGTGTGTTTACTCTGGGTAACACCGTCAAACGTGTGGAGGACATACAGGAATCACAAAGCGAAAACGACGTTATTTTCCAGCAGTTTGTTAATAAGGCTAAACAGGCGCGGAAATAAAAAAGCGACGTAGTGAGCGCCGCTTTTGAATTAAGTGAAACAGTTTTAATACCTGATTAAGTACAGGCCATTAAAAAAATACAGGATTAATAATCATGACGCAAATTAACCATGATGTTGTGCGTAGTGCCATTCGTGAATTAATTGACAGCAAAGCGATTTCAGGCGCAGCTCTGGCGCGGGAAACCGGCACCTCAACGGCTACGGTCTCTCAGTTTCTGAACGGGAAATACAAGGGAGATAACGATACTGTTGCCGCAAGCCTGAATACCTGGCTGGAAAGTCATAACGCCGCGAAAACCTCGCTGCCGGTGGCTCCGGATTTCGTTGAAACGCCGACCTCACAAAAAATCCTCGCCACCCTGACGTGGGCGCAACTGGCCGGGACAATTGTACTGGTTTACGGCAATCCGGGCGTCGGCAAGACCAAGGCTATCCGGCAGTATGCCGCAGGTGGTAACAACGTCTGGCACATCACCGCCAGCAAGTCCCGCAGTAATGAGCTGGAAACCCTTTACGAACTGGCTCTGAAAATGGGTATCAGCGATGCACCATACCGCCGTGGTGCTCTCTCACGTCTGCTGCGCCAGCGCCTGCCGGACACGCGTGGCCTGATAGTGGTTGATGAAGCCGACTGGCTGAGTCTGGATGCGGTGGAAGAGCTGCGTATTCTTCAGGAGGAATGTGGCGTGGGGCTGGCGCTGGTGGGTAACCACAAGGTTTATGACCGTCTGACGGGCGGTCAGCGCAGCGTGGACTTTGCCCGCCTGTTTTCCCGCGTGTCCAAGAAGTTCGTCATTAACACCGTTTCAGCAGGTGACGTGGACAGCTTCTGCGATGCCTGGCATGTCACCGGCGCTGAAGAACGCAAGCTACTGAAGGCTATTGCCCGTCGTCCCGGAGCGCTGCGTTCCCTGTCCCATATTCTGCCGCTGGCCGGGATTTACGCTCAGGGCAAGGGCGAGACCATCGGCACGGCGCACATCCAGTCCGCCATGCTGGAGCTGGGCCACAACGGTATCAACGAGGACTGACATTATGATCGCCGAACGTATTGCAGAACACATCAGCATGGCAGAAGCGGCGCAGAACTGGCTGCGTGCGCGTGGCAGTCGCGTGACTGACGTTCGGGTGTTTATGCGTCGCCCGATGCTGGAAATTGCCTGTCCGCCAGTGGAGCTGGTTAACAGTGCGGAGCGTATTGCCGAATCACACAACGGCGGCACCCGTTCAGTCTGGGTTGCCAGTCTGGAAGGTTGCCGGATTATCTGGCGTTAAATACAGAGGAAATGAAATGGCAAAAATTATCGCTTATGCGTGGGCTTCAGGACTGATTGAGCTAGGCCCAGAGTTGCCTGATGGAGCCCTGCCGATTATCACCGGCGAGGAAAATAGAATCAGAGATTTAATTAATATCTGGGCCAGACATTCCCGCACCGGTGAACAGCTCCTGGTTCCGGGGGTGCCGGAGGCTCAGAACCAACATGAGGGCTGCAACGCTCTGATGACATTTACAGAAACTATTACCCGTGAATATCTTGAAAAATAAAACTGAGGTTTACATGAAAGCACCTAAAAAGACTCGTGCAAAATCTGCTGCGGCAGTCGCCGTCCCTCAATCTCGCGAGGACGTAATCAGTGATATTCGCAAGATTGGTGATATTACCCGCGTCATTTTGCGTCGTGAAACAGAGCTGAATGACAAAATAGCCGCGCTGACGAATGATGTTGCACCAGGTATTGAAGCGCTTAAAAAAGAGCTTGAGCGTCTGCAAAAAGGCGTCCAGACATGGTGTGAAGCCAACCGTGCAGAACTGACGAAAGACGGTAAGACCAAGACGGCCAACCTCACTACCGGCGAAGTCCGCTGGCGTAAACGTCCACCCAGCGTCACTATTCGCAAGGTTGAGGATGTTATTGCGATGCTTAAGAAATTCAGTCTGGGTAAATTTCTTCGCAATAAAGAGGAGATTAATAAAGAGGCAATTCTTGCATCACCGAATGAAGTTAAGGGGATTGCAGGAATATCCATAAAATCAGATGTTGAGGATTTCGAAATAATCCCATTTGAACAAAGTGTAACTGATTAAATAACTCTCTTTAAATCTCGGCTATTTAATACGGCATGCCTGCCGGGGCCTTGTGCGTCCGCAGGCAGCTAAAGAAGGAGTTTAAACAAATGTCTAAATCACTAAATGCACGCTGCATCCGCCGCTGGGAAGTCGAGTTTAAACCGCTCTGCGATTCAAAGGTGAATCCGTACTGGCGCAAGCGCGATCTGCGCGGGTATATCCGCGAGGCGGCGCTTACTACCGCTTACAGCATGGTCGATAGCATGGCTGAACGTAACGCCAAATTTGACTTCGACGGTTCCACTATTGGTTGGTCGCCTGAGTTCTCATCCTGGTACCACGAACGCCGAGAAAAGTATCTCAAGGAGGCGCGAGACTATCTGAACGAAGACGCAACCAACGACGAAATCGACGAGGAAATTCAGAACGAGCTGGAGGCATGGAATGACTAACCGTAGAGCCATCAATATCCAGCAGATTCGTGGTGATATTGCCAGACGTAAAGCGATGCCTGGGTTTGGCCCGGACACCAGTATCGAACGCCTCAGAACAGTTCAGGAGACGCAGCGTAGTTTCACACCAGAGATTGTGGAAGCACTGCTGGATGAACTGAAAGTTGTTACGCACGCCGCCGCAGTAGACCACGAAGCGGCCTGCTCTCTGGCTGAGGAAAACGAAGAACTGAAGCGCAAACTGGCATCGCTGGGCACTGAGCCACGGCGAAATCCCGTGCTGGCCTATGCAGACAGCTATCGCGACATGGCTAAGCAGGGGGTGGAATCTATTCCGGTTTGGAATGTGATTACTGACCTGGAAAGAAATATTGCGCCACTTTACACCGCGCCGGTAGAACTTACCGATGAGCAGATCGACGCAGTGCTTGATTCTCACGGAAACATAGCATACGTCATTGCCGACAAGCGGGAACGCCTGCGTATGTTTGCTCGTGAAATTCTCCGCGCAGCCATGCTTCAGCCCGTAGACATTCAGGAAATTCCATCCGGATGGCGCTGGACGCTGCTTCCACCGGATTCACAGGAGTGAATGAGAATGGTGATCAAGTTCCCTGAAAAAGTTTTTGAGCTGGGTCAGTGGTGGTATCGCACCGCTGTTGAGTTCGAGATCGACGGCATCAGAAATACCGCATTTTTCTATTCACTAAACCGATATGACGCCGCTTGCAGGCTGGCAGCTATAAAAGAAAATGGCGTGCTGGGCGGCGAAGATACTGCATTTATTGATACTGGTAACGGGGGATAAATTTATGAGCCGTGCCAATCTAATAAAACTGATCCATGTTGCCCGTCGCAAGCTCCAGTTGGATGACGATACCTACCGTTACGCGCTTCATCGTGTAACAGGGAAGACCAGTTGCCGTGAGCTGAAGGTCGCCCAGCTTGAGGCCGTTCTGAAATCACTGGAGGATAAAGGTTTCAGGCGCACCCGTCCCCGTTCTCCGGCACGTCGTCATCGTGAGACGGATGTTTCTGCAAAGGTTCGTGGTATCTGGCAGCAGATGCATAAGGACGGCTTTACCCATGATGGCAGTGATACCGCGCTGGACGCGTTCGTGGCGAAGATGACAACCAGAACCAACGACGGTCAGGGTATTGCCAGTCTTGCCTGGTGCCGTGGAGATGATCTGCTGATGGTGCTGGAGAGCCTTAAACAGTGGCATATCCGTGAAATGAAGGTTGCTCTGGCGAACAACGGTTGTTTTCCGGTGAAACGTGGCTATGACGCTATTAACGATGTATACACCCGCAAGGTCAGAAAGGGGGCATCATGACGGAAAAACAGGATGATCTCTTCGGTGATATCCGCGACGACAGCATCCTTGAGCAACTGGATGATGATTCAGCAGAGTCACGCCGCTTTCCGGCGCTGCTGGCTCAGCTTAATGCCCTGTTGAGAACGGAACTGGAAAAACTCGGCCATGACCCGCGAATCTCACTCGATCTGATATATGCCATCAGCAAGTCAATAGGCGGGATGCAACTCTACTTTCCACGGGGAACTGCGCTGGAATCCCTGATTCGGGATATGAAGATCTGGCGTGATTTCAATGGCAAAAATATCCCTGAACTGGTTGAGCGATACCACGTCACATTCAATACGGTGTATGCTGCAATCCGGCGAATGCGTAAACTTGAACAACGTAAATATCAACTGGACCTGTTCAGCAAGGACTAACTCATGAAATTCATTGGTACTCTCTGTGTCATTTTTACCATTGGGTGGGCCTTTATCACACTGACAAAAAACGACAAACCATTATCCGTTAATGAAAAGCTCATTGCTCAATTTGCTAATCAACGCCTGCTTAAGAAGGACGAGTGGAATACAGGTAATGGTGTTGTTGATGGTGTTCAGGTGTATACCGCCAGAAAGGAATACCCGGCATTTGCAAGCCTCTGGAGACTCGGAGTAAAAGATGCAGGTGTAACCGTACTGACTTCAGGAACGTATCCTGAATTTGAACCAGTGCTGGCAATGGGACAATGTAAAAACCTTGCCATAGCGGTTTTTGACAGCGACAGCAAACCAGTTAACGATGCTGTTTCAACCATATTCACAACAGCGACAGAGACCTATAAAAAAGAAGGGAAAAAAGTACAGGCAACCGGCGATATAGGAAACCTCCCCTTCAGGGTGACCGTTCAGAATATCGACTCGGTGTTAACTTTCTCCTGCGATATTGACCTCAGCCACTACACTTCCTCTATTTGAATAAAGCCGGTTAATCCGGCTTTTTTTCTACCGGGCACATGATGGAAAGGTTCCATAACTTTTCAACAGGTGCCCTAAATGAACAACTCTCCAGAGTCCCCTGCATTCCGCCATGCGCTGCTGTTCGTGCTCCAGTTCGAAGGCGGCTACGTTAACGATCCTTCTGACCGTGGTGGCGAAACCAACTTCGGTATTTCTGATAAGCGTGATGGTGTCGCCGATGGCATGACTGACGTCGATGGTGATGGTAAGCCTGATACACGCATCCGCGATCTGACCGTTGACCAGGCGGGTCAGATTTACTTCCGCGATTACTGGTTTCCCGCTTACTGCCCTGAATGGGCTGACGGCATTTCTCTTTTTTTGTTTGATTCTGCCGTCCAGCATGGCGTCAAAAAAGCGGTTCAGATGTTGCAGGAAGCCGCTGGTGTGGCTGCGGATGGCATCGTGGGCGCGAAGACCCGAGCGGCCATCGCATCATATGACCCTCAGTATCTTCTGGCCCGGCTGTTCCTTCGTCGCTCCCGCTATTACGCCGACATCATTAAATCCAATGCATCACAGGGTAAATACCTGAACGGCTGGTTCAATCGCCTCGATGCGCTGGTAAATGCCTGCATGGAAGTTCTCGACGACGGCAACCTCGATGTTATTGCTTCTCCACGGAGCTGACGATGGGTAAAGGCTGGGAAGGCTCAATGCGTCAGGGTCGCCGTGATCGCCTGCGTCAGGAGGTCCTGCACCGGGTGGCCGGAGGGCCACCACCCGTGCCCCTCAGTTATCAGGGGCATGACGGGACTCATGGCAGCTTCTATATGCGTGGCTGGGAGTCCGTTGATACCCGCGATATTTTCTGGCAATGCCAGCGTTATAAGGAAAAACACAGTGTTTAAATCAATAAATACCGACTGGTTAAAACCGGCATTACTGCGGGTTTACCAGTCCGGATGGACCCTTTTGATTTTTGTCGGTCTGTTGCTGGTGTTCTGCAACTTCCACGGCCGACACGCTTTTCTGGTCTGGTGGCTGGCTTTCACAGGCGTTGTGCTTGTCGGGTTCAGTATCTTTCTCGGCAATCTGCCATACCGGCTTATAAAACCCGAAATGCATGTCAGCAGGTTCGCCAGTTTCTGGTCGTGGGTTATCTGGGGGGTGGGATTCATCCTGATTTGCCTGAGTCCGCTGTATGCCGAACCGTTATACCTTCTGTTTCTTGATCCTGCTGGTGCAGCGCTGGGTGTTCTGTTCTGCCAGTGGGTTCGTCGTAAGGGGCTGCTCGCATGGATCCAGTAACTATCTCCACTGTTGCCTCCGTTCTGATGAAGGCCGGACCGTCGCTGCTGCGTACTGTGGGCGGCTGGTTTGGTGGTGACACGGCCAGAACGGCCGATTCTGTGGCGGGTATCGTTGAGAACGTCAACAGCGTCATCAACCCGCAGGACCAGCAGCGCGTGCTTGAGCAGAAGATTGCAGCACTTCCGCCGGAACAGTTCGTCCAGCTCCAGTCCCTGAAAGTCCAGGTTGAGCAGTTCCAGCTTGAGCGGGACAAGGCGGTACTGGCTGACCGTCAGGCTGCTCACCATGAGCAACAGGAAACCATCCGTAACGGTGACAACGCCACGGACGAATATGTGCGCCAGACCCGTCCACTGATGGCCCGTCTGTCGCTCTACAGCAGCATTGCTTATGTGATGCTGATGTCTGTGGGTCAGCAGGCTGGCGCGGTATCCGGTGCTTTTGGTCATACGTTCGCCATGCCGTCACCGGACTGGGATATCGCGCTGATGCTGGCAACACCGGCGCTGGGGTATCTCGGTTTTCGCACCCTTGATGGTTTCGCGCGGTACAGCAAATCCAGCAAACACAAAGTGATGGTGGGCGGTAAATGACGGACGAACTGGACAGGGCCAGCGGCCTTGAAATGGCAGACCGTGAACGGGCGTTAAACGCCCGGTTAAACCGCGTTAAAGAAGTCCCGGAGGAATCCGGGTTCTGTAACGACTGCGGGGACGCTATCGACCCGGCCCGGATTGCCGTTCTTCCGGATGCGGTGACCTGTATTGACTGTCAGACACTCAGAGAACGGAGCGTGTAATGGAGTGGGAAACCGTCAGAAGTAACTGGGCCGTCATCTGGGCGGTACTGATGTCCGGGGTCAATATCGTCCAGCTTCTGCTGGCGAAAACCTATGCCCGCCGTGAAGAACTGGAGAAGGTCAGCAGCCGCCTGAACATTCTTGAGCATGCGGTTGACGGACTGCCAACCCGGCAGGAACTGCACCAGTTGCAGCTTGAGATGAGTAACCTGCGCGGGGAGCTGCGCGAACTGGCTCCGTCCATCAGACAGGTCAGCCGCATCAGCGATCTGTTGCTTGAGAATGAACTGAAGGAAAAAAACTAAGAGGCTATGAGCATGCAAGAAATCCTCAACAGCGACCAGCGTCTGGTCATTCTGCGCTCACTGGTGGAGTGCGGAGACAGCGCAAACGAATCCATTCTACAGACCTGCCTTCAGACCTACGGTCACCGGGTTTCCCGCGACACGGTACGCACTCACCTTGCGTGGCTGCGCGAACAGGGGCTGGTTACTCTGTCGGATGTCTCAGGGTGTTACGTTGCCGCCATCACCGGGCGCGGTGAAGATGTTGCTTTCGGGCTTGCAACGGTGCCGGGTGTCAAAAAACCTCGTGCGCGGGAGTGACTATGGAGCAACTCAGGATCCGACAAATGTTAGAGACATGCCGCCAGCAGGCGGAACAGCTGCGCCGTCTGGCTCGTCTGGCAAAACTTCGGGAATCCGGTGAAATCGGCATGTCCGGGAATGCCCTCTTTCAGGCGGCGGTGGTTATTGAGTCCCTTGTTGGCGCGAATGAAAAAGCGCTGGAAGGCATCGAACGGCTTGACCGGTCTGAAACCCAGCTTATCGGGGAACGCGATCAGGTTATCGCGGCGCTGGATGGTATGTACGAAGCCGTGACTGGCGCACCGCCGGAGTGGAGCAGCGCCTTTGGTTTTACGGATGCGATTAACGAAGTGACTGAACGCATTTTCGAGATGGAGAACGCAGGACATGACTAAATCACTTAAGGCGCTCAGCTGCGGACAGCGCGACATCATCAGAAAAATGGCGGCAATTCTCGTCTGTGCGGAAATCGAAGTCAGAGCCATTGCGCCACAGTATGAAAAAACGACGGGTAAAAAGTACGACTCCAAATCGGCTGACTCGTATCTGAATACCTTTCTCAACAACAATCCGGAATACAAACGCGTCTGGAAACTGCTGCTGAAGGACAAGACCAGTCACGAACGCGACTTCCTTGCCCGGATAAGGGGGGAGAATGGCAAGTGAGCAGCGCCCGACCCGTGGACGTCCGTCAAAGATTGATCTGCTGCCGGACAGCATCCGCGATCAACTGCATCAGATGCTGCGTGACAAACGGCATACGCAGGAGGAAATCCGCGAAGCCATTAACGAGCTGATTGACAGTCACGATCTGCCGGAAGACATGCAAATCAGCCGTACCGGTCTGAACCGCTACGCCAGCCGCATGGAAGAGTTCGGGGCAAAGATTCGCGCCTCGCGCGAGATGGCCGAAATCTGGGCGGCAAAGCTCGGCTCTGCGCCGACGTCTGACGTCGGTAAACTGTTGCTGGAGTTCGTCAAAACGCTGGCCTTCGAAACCTCAATGGACATGGCGGACAGCGGTAAAACGGTTGAGCCAAAAGCGCTGGGACAGCTTGCCCTTGTCGCCCAGCGACTTGAGGCTGCTGCGATGGCGAGCCATAAACGGGAGAAGGAGATCCAGCAGGAATTTGCGAAAAAAGCTGCTGCGGCCGCAGAAAGCATCACCCGTTCAGCCGGGTTATCTGCGGAGACCGCTGCTGATATTAAACGTCAGATTCTGGGGATTGCTGAATGACCGCCATACCGCCCGCCAGCGCAGTGACCAGCCTGTCCGCCGCCGCGATCCTTTCCGGCGAGTTCGACAAAAGCCAGCTGTTGCTGCCCTATCAGAAACGCTGGATTGCGGACAGCGCCCAGCTCAAGATAGCGGAGAAGTCCCGTCGTACCGGTCTGACCTGGGCGGAAGCCGCTGACGCTGCACTGAACGGGTCAATGTCGGTTGAAGCAGGAGGCTGTGATACGTTCTACGTCGGCACCACAAAGGACATGGCGCGTGAATTTATTGACGCCTGTGCCATGTGGGCCAAAGCCTATGACCGCGCGGCATCCGATATTGGCGAGGAGGTGCTGAAGGACGAAGACAAGGACATTCTGGTCTACGTCATCCAGTTTGCCAGTGGTTATAAAATCAAGGCCCTGTCATCAAACCCGTCAAACCTGCGTGGTATGCAGGGTAACGTCATCATTGATGAAGCCGCCTTTCAGGCTGACCTTGCCGCTGTACTGAAAGCCGCACTGGCACTGACCATGTGGGGCAATAATGTCCGACTGATCTCCACCCACAATGGTATTGATAACCTGTTCAATACCATTATCACCGACAGCCGTGCAGGTAAAAAACGCTACTCCGTTCACCGCGTGGATATTGAAACCGCCATTGCGGAAGGTCTGTATAAGCGCATCTGCCAGGTAACGAAAAAAGAGTGGTCATCAGAGGCAGAGGCCGAATGGCTGGCGAACCTGCTGAGTGATACGGCAACCGTTGAAGATGCCCGCGAGGAATACTACTGCGAGCCGAAGAACGGCGGCGGTGTGTATATCGCACGTTCCCTGCGTGAACGTGCGGCCAGAGGGCCATCTGTTGTACTGCGGTTTACCGGAACGCCGGAGTTTAATGCCCTGCCTGAAGGTCTGCGCCGTCTGGATATGCAGGAGTGGCTTGAGACGGTTGTCCGGCCCGAACTGGAGAAACTCCCCCAAAACCTGCGTCATTGCCTCGGCGAAGACTTTGCCCGCAACGGCGACCTCACCGTCTTTGCGCCGGTGACGGTGAACGATGACACCACCCGCGAAGTGCCGTTTCTGGTTGAACTGAGCAATGTGCCGTTCAAACAGCAGGAACAGGCGCTGTTTTTCATCTGCGATCTTCTTCCCCGTCGCGACGGTATCAAGCTGGATGCACGCGGCAACGGTCAGTATCTGGCAGAGCAGGCCGCAGAACGTTACGGTGATGAAGTTGAGCAGGTACAGCTCTCTGTTCCGTACTACCGGGAAAACATGCCCCGGTTTCGCGCGGCATTTGAAGACAATGAACTGGTGCTGCCAAAGCACGAGGATGTTATTACCGACCTCGGCGCTATTCAGCTTTATCGCGGCGTACCGGGCATTGATGATGCCCGTACAACCGGCACCGATGGGCGCAAACGTCACGGTGACTCCGCCGTTGCGATCTTCCTGGGCTTCCTCGCCAGTCGGGAAGACTGCCGCCGTTATGAAGTCCACAAATTAAAGAAACCCTCCCGCCCCGATGAACGTAATGAGCACCGGCAGGTTCGCATCACGCGGGGTCTTAAAAACCAGCGGGGATTACTCTGATGTTTAAACAGTTAACCGGGGCCGTTCGTCGGCTCTTCAGTTCTTCCACAGGCCAGACTGTCACCGTTCTTCAGGAAGAGCTTCAGCAGCCGCAGGCCCGCGCCAGCGTCGTCAGCGTGCGTACACCCTCGCCGGGTATCAGCGTCGCCAGCACTCTGTCACCGGGCAGACTGGCGGGTATTCTGCGCGGCGCAGCGGACGGTAACGCCCGCGATTTTTTTATCATGGCCGAAGAGCTGGAAGAACGCGATCTCCACTATGCCAGTGTACTGCGTACCCGTAAGCTGACCGTCGCCGGGATCGAACCTTCAGTGGAGGCCGCTAGCGATGATGCTCGCGACATTGATATTGCTGATGCTGTTCGTAACCTCATCACCCAGCCACAAATTCCGGAGCTGCTTTTTGATCTGCTTGACGGTCTGGGGAAAGGCGTGGGCGTCTGTGAAATCCTCTGGGATACAAGCAACCAGTTCTGGCAACCCCGCGATTATGAATGGGTGGATCCTCGTTTCCTGAAGCCGGACCGTGAGACCTTACGCGACTTCCGTCTGTTAACGGATGCCAGTCCCATTGAAGGTGAACCCCTGACGCCCGGTAAATATGTTGTGCATCAGCCACGCCTTAAATCGGGTCTCCCGTTACGCAACGGCCTGGCGCGTCTGGTGGCGGTGATGTACATGCTCAAGTCCTATACCGTCAGGGACTGGTGGGCGTTTGCCGAGAAGTTCGGTATTCCCGTTGTGGTCGGTAAGTACGGTAATAACGCCAGCCCGGAGCAGATCCAGACGCTGCTGGAAGCCATCGCGTCACTGGCATCAGATGCCGGTTGCGCCATTCCGGCTTCGATGACCCTTGAAATGCAGGAAACCGCCAGCCGTAATAACGGTGGTGCGCTCTTTAAAGAGATGGCCGTCTGGTGCGATGAGCAAATCAGCAAGGCCGTGCTGGGTCAGACCATGACCACCGATAACGGAAGCTCCCGGTCACAGGCGGACGTGCATGACCGGGTGCGCATGGATATTGCCCGCTGGGATGCCCGGCAGCTTGAGAACACACTCAATGAATTTCTGGTGCGCCCCTTCGTTATCATGAACTACGGGCCACAGGATTCTTATCCCCGCGTCGTGCTGCGCCTGAGTGAACCCGAAGACCTGAAGATGCTGGTTGATGCGCTGACCCCTCTGATTGATCGGGGTATGGAGGTGCAGATGTCAGAGGTGCGTGACAAGTTCGGCCTGTCAGAGCCTGAGAAAGGCGCGGCATTGCTGACACCCTCCAGCCAGGCGCTCCAGCCTGCGCTGGCGATAAACCGGGAGCGCCTTGCCCTGAACCGTAACCAGCAGGACGATATTGATCTGATGGTTGCTGATGCCATGCAGGACTGGCAGCGTACCGGCGATGCGTTTACCAGCCCGGTGCTGCAACTGGCGAAGGATGCTGACAGCTTCGACGCGTTTCTGGCCGGTCTGCCGGAACTTCAGAAAACGCTTGAACCAGACGAGTTCGTCACGCAACTGGCGCAGCTCTGCTTTAAGGCCCGTGCTCTGGGAGATGTAAACGATGCGTAAACCTGAACGCAAACCGGACATTATCCCGAAGGAAGCGCTGGAGTGGCTGAAGGCGAAAAAGCTGAAGCCGGGCTTCGATTATCGAGACGTGTGGCAGGAAGAGCACCGTTACGGTTACACCGTCGCGAAGATGACGCAGCTTGATTTGCTGGCCGACGTTCGCCAGCTCGTCGAGGATGCGCTGGAGAACGGCCAGACATTCGCGCAGTTCCGCGAGCTGTTGCGCCCTTTACTGGTAAAGCGCGGATGGTGGGGACAGGCGCTGATGGATGACCCGCTGACGGGGGAGACCCGGCAGGTGCAGCTCGGCAGCGAACGGCGAATGCGCGTTATCTATGACACCAATATGCGCACTGCCCGCGCAGCCGGTCAGTGGGAACGCATCCAGCGAACAAAGCGGGCGATGCCGTATCTTGTTTACACGCTGGGACCGTCACGGGAGCACCGGGCGGAGCATCTGAAATGGGCGAATACCTGTCTACCGGTTGACCATCAGTTCTGGATAACACATATGGTGCCGAACGGCTGGGGTTGTAAATGCAATGTTCGTCAGGTCAGCCTCTATGAGTTTGAGCAAATGCAGCAGAACGGCACCATCACGACAACCGCACCGGATGTCCGTTACGTGAAGTGGGTCAATAAGCGCACGGGCGAGGAAGAATCCATTCCAGAAGGGGTTGATCCGGGTTGGGCATACAATCCGGGTATTTCCCGCAGCGATGCACTCAGCCTACAGTTGAAGCAGAAACAACAGGCATTTGACAGTTATACATCATCGCAGAAATAATCCCCCTCAGACGCGCCCGGTGACGATATCGTGATTTATGCTACGATGGCGCGTGGGAATTTTATCTAACGTCTGTACGCGCTTTTAAACGGGTTTTAAACGGGGTTCCCCGCGTCGTTTACAGTAAAGCCCGTTAATCCATCTTCCCTTTCTTTCTCCCGCATGCTGTCCGGAGTCTTTTACGACAACGGACAACACCATGAAACCGACCAACACGGAACTGCTGGCACTCTGCTTTCAGCTTCCTGAACTTGCTGATGATGCGCTGCCGGAATGGCTGCCAATGATACCGGCCGGAACCTTCACCGGCCGCGATGGCCGTTCGTGGATCAACAACCAGCCGGAATCCGTCATTCGCGCCACCCTGAGCTATCCCAAACTGCCGTTTGATATTGAGCATGCCACCGAACTGAAAGGCCCCAAAGGCGAAGAAGCTCCGGCCTTTGCATGGCTGGACGATTACCGCATCCGCGACGATGGCGTAATTGAGGCGCATGTCGAATGGACTGCTGACGGTGCCGCACTGGTTCGTGGCAAAAAATACCGCTATTACAGCCCGGCATTTCGATTCACTGCGGATGGTCAGGTGACCCGCCTGTCCAGCGCCGGGCTGACTAACAAACCCAACCTTGATTTACCCGCACTCAACTCAGAGGAAAACACGATGACAGTACCTGTCCAGATTGTGACGGTGCTCGGCCTCGCGGCTACGGCCAGCGCGGACGATGCAGTCAAAGCCATTCAGCAGCTCAAAACCAGCGAGCAGGTTGCCCTTAACCGCGCGGAGAACCCGGACCTGACGAAGTTCATTCCGGTTGAGACTCACCAGTTAGCACTTAACCGGGCTGAAACGGCTGAAGCACAGCTCAGTGCAATCGCCATCAAAGAAGCAGAAGCACTGGTTGATGGCGCTATCGAGGCCGGAAAAGTTGCCCCGGCCAACCGTGAAATGTACCTCGCCACCTGTCGCTCTGAAGATGGCCGCAAGCAGTTTGCGGAGTTCGTTAAAGGTGCGCCAGTCATTGTCAGCAAGGACCCGTCCGACAAAAAAGATCCAGGCGGTGATGGCAACACCACACTTTCCGATGAAGACCTCGCGATGTGTCGCCAGATGGGCATCACCCAGGAAGAATTCCTTTCCGTTCGTAAGCAGGAGAAATAATCCATGCAGGTATCCGCAGAAGTATTGCATGCGCTGACCACCGCCCTGAGCGCCGCCTTTACCAAAGGTGTCGGGCGGGTCAACCCGCAGTATCGCTCCATCGCCACGGTCATTCCCAGTACCGGCGCATCTAACACTTATGGCTGGGTTGAAGATTTCCCGACGATCAAAGAGTGGATCGGGGAACGTCAGCTGAAAGAACTGGCTCAGGCCGGGTATACCATTACCAACAAGACCTGGGAAAACTCGGTCAAAGTTAAGCGCGAAAAAATCGAAGACGATCAGATTGGTCAGTATTCCGTGATTGCTGAACAGCTTGGCCGCGATACCACCATTTTCCCGGACAAGCTGTCGTTTGAGTTGCTGTGTAAAGGTTTTGATACGCTGTGCTGGGACGGCCAGTATTTCTTCGATACCGATCACCCGGTCGGCACATCCACCGCCTCAAACGTGATCGGCGACCCGACAACCGACGCAGGCGAACCGTGGTTCCTGGTGGATGCGACACATGCCCTGTTACCCATCATTTACCAGGAGCGCCGTCCATTTAATTTCACCGCCCTTGATGATCTCACCAGTGAGCGCGTGTTCCTTCAGAACGAATTCGCCTATGGCACCGATGGCCGCAGTAACGTCGGCTTTGGCTTCTGGCAGACCTGTGTCGGGTCGAAAGCAGCGCTGAACAAAGCTAATTATGAAGCCGCCGTCTCCGCAATGATGGATATCACTGACTCCAACAGCGAACCACTGGGCATGAACCCGACACTACTGGTTGTCGGTAAAAACAATCGCGGTGCCGCGAAGTCGCTTATCGAAGCGCTCATGGCAGATGGTGGCGGTTCCAACATCTATTACAAAGACGTTGATCTGCTGATTTCACCTTACGTAAAAGCCTGAGTCCGTTACGTAAAAAATAACGTAACCCTGCATTACAGGAGGGGTTAACCCTCCTGTAAATCACCGCTAACAGAGGTTTAAAAAGTGAGTGGAAAAACTGAAAAGAAATCTGTCAAACCCGCAGCCGGTAGCGACACAACCACGGCACAGAAAGACATCAAAGGCACGCAGGACACAACTGTCAAGACGGACCCGGCATCGCTGGCACCAGTTACTTCTGCTGACGGTCAGACATCACAATCAACGGTTACAGCGACAGATGGTGAAGCTGGCCCGGAATCTGATGCTACCGCCACCATTGCAGGAGTAACACCGTCCCCGGAGACGGACCATTCTCATCTTCAGGATGCGGGGGTTAATACAGTCATGTCCGCACTATGCCAGTCAGTTTCGGATGGTGTTCATATTTCTGGTGACGTGACCGTGCTGGAAGTTCGCGCAATCCCTGAAGGTGGATTTCACCGCGCGGGACGTTTCTGGCCGCATGACCCGGTGCATGTCTTCGTCAGTGATGATCCGGACGAGCAGGTTCTGGAAGATGGCAGCGGTCAGCCGCTTTATGGCTGTGTTATCAGCACCGCAGATGCCAGCCGTCTAAACCGGGAAAAGATGCTGGTCGTTACTGAGCTGAAACCAAAGGCGGAGGAAAGCTGATGGGTATCTACGTGACGCGGGACGATTTGCTGGCAACGGATGCGGAACGCGTCTGGAACATGGCGCTGAACAAAGCTACCCAGCAGCTTGATGAGGAGAAGATCCAGCGTGCAATTGATGATACTGACGCGGAAATCAATTCCTTTCTGGCGAAGCGTTATCACCTGCCGCTGAATCTCCCGACACTCCCGAGTCCGTTACGTCGTGCGGCGGTCTCCATCGCGTTCTACTGGCTGTCTGAACGTGACAGCCAGATCACCGACGAGATCCAGAAGCGCTACGACGATGCCCTGCGTACGCTCCGGGAAATCGCCAACGGCACCCGTGACCTTGGCGTACCGTCTGATACGCCGGTGCCTGAAACTGACACCGGCAAGCTGATTATTGTCAGCGATAACCGACGTCTGTTCACCCGTAACAACCTTAAGGGTGTGCTCTGATGGGAATAACCGTTGAGGTGACAGGGGCAGAGAAGCTCCAGACTATCCGCAAGGCAATGGAAAAGCTGGCCGACAGTTCGCTTCGTCAGGAACTGCTTGAAAGCATCGGCGCGGTGGCGGAGTCCCAGACACGTCGCCGTATCGCCAGCGAAAAAAGCAGCCCGGCAGGTGCAAAGTGGCAGGACTGGTCTGACAACTATGCAAAGACCCGCCACGGTAACCAGAGCCTGTTGCAGGGTAATGGCGACCTGCTGGACAGCATCCAGTATTTCGTCAGCGGCGAGCGGGTGCATATCGGTACGCCACTGCCTTACGGCAAAACGCACCAGGAAGGCTTTTCCGGTAGCGTCGCTGTGTCTTCCCACAAGCGCCTTATCACACAGGCATTCGGCCGGGCGCTGAAGCACGGCGTCTGGCAGACCGTGGGGGCGCATCAGCGCCAGATGGACATCCCGCAGCGCGAGTTCCTCGGTCTGTCTGCGGATAACAGTAACGAGCTGACCAGTGTGATCGGCGATTTCTGGAGTGAGGTTCTGAAATGAGTGAACGTCCTGCGTTCGTGACCCTCGGCAGTACGGTCAGTGCCGCTGAAAACATCGTGGCCTGGCTGAAGACGGAGCTGGAAGGCAACACGCCAGACCGTGTGGAGATAGTGGAGCGTCATGTCGGCCAGTTCAGTACGCCGGATGAGGTGAAGCGTTACCTTTCCGGGCGTTCCGGCTGCGTGCGTCTTGCGGCCCTGCGCGTACGTAATATCAGTAACCGCAACGGTATGACGGGGCTGGTGACGTGGGCGGCCTACGTCATGACCTCCGACTCATGGGGCTATGCCCGTGATGCCCGCTGCGAGGTTCTGGCCGGGAAAATCGCCCGCCGCATTTCAGTCCGGGAGGCTCCCCGCGCCATGAAGGCTGAGCGCATGGCGGAAAATATCGGCGCTGAAAATATCTACTCCGGCCGCCTGGATAACTTTGGCGTCAGTCTCTGGGCGGTGACGTGGGAACAGGTGTTTCGTCTGGATGACGAGATTGATATGGCCGCACTGCCGGAGTTCCTGCGACTTGGCGCATCGTTTGTCGTGAACGGCCAGCCGGTTACTGAAGAGCCGGACATCATTAATGTAAGAGAAGGTCAGACTGATGAATAAAAAACTGATTAAGCCCGCCCGCCCCGGTCTCCGGGTACGTAAGGCAGATGGCAGCCTGCTGAATGCTGATGGCGAAATACTTGCTGTCGCTGCGTACTGGCGACGTCGTGAATCCGAAGGTGATGTGGTTATCACCGCGCCATCCAAACCCAAATCCGGCAAAGCTGATAAGGAGGCATGATGGCTCTGGGTAACATTCCTGATGATATCCGCGTCCCGCTGGTGTGGATCGATATCGATAACTCAATGGCGATGAGCGGTGCGCCAGCTCAGTCACGCAAAATTCTGGTGATCGGCCAGCAGGTGGAGAGCGCCAGTGCTGAACCACTAACGCTCAATCGCATCACCGGCGACAGTATGGCGGATGAATACCACGGCCGGGGATCCATGCTGGCGGAGATGCTGAAAACCCTGCGTAAGGCAAACAGCTATACCGAGACTTATGCAATGGGACTGGCTGACATCATCACCGGTGCTGCTGCGACAGCCAGTATTACTGTCGTGGGAGACGCTCTCGCTGCCGGTACGCTTGCCCTTCTGATTAACGGTGTGTCCGTACAGGTCGGCGTCGCTCAGGGGGATTCTGCTGAAACCGTGGTGCAGTCCGTCATTACGGCCGTCACAGCGAAAACCGCCACGCAGGTCAGCGCAGTCGTTGACGGTGAGAATGCCGCCTCAGCGGTGCTGACGGTGAACTGGAAAGGTGTGACAGGCAACGACTGCGACGTACGCCTGAACTACTACTCAGGTGAGAAAACACCGTCCGGTATCAGCGTTACCGTGACACCGTTCACGGGTGGTGCCGGTACGCCGGATATTCAGGCTGTTGTCGCCGCGCTGGGTGATGACTGGTACACCGACATCGTTTTCCCCTACAACGACACGCAGAGCCTCAACACTATCCGCGACGAGCTGCTGGAACGCTGGGGGCCGCTGAAGATGATGGAGGCGCAGCTGTGGACTGCATTCCGTGGAACACATGCGCAGACCGGCACGTTCGGCAGCGCCCGCAACGACTGGCTGATTTCCTGTATCGGCACCAACATTTCCCCGGAGCCGGTCTGGTTATGGGCGGCAAGCTACGGCGGAACGGCAGCTTATCAGCTTGCCATCGACCCGGCCCGTCCGCTTCAGACGCTGGTACTGACAGGTATCAAGTCGCCAGCCCGCGCCGTTCGCTGGGACATGCCGGAGCGTAATCTGTTGCTGCACGATGGTATCGCTACCCACTTTGTGGATGCCGGGGACAATGTCTGCATCGAGCGTGAAATCACCATGTACCGCGTGAACAGCTTCGGTGACACCGACATCTCGTACCTCGATGTGCAGTCACCGGCAACGCTGGGACGTATCCGCTATGTCATCAAAAACCGTTTCACCAGCCGTTATCCGCGTCACAAACTGGCCGGGGATGATGTCCTTGATCTGCTCGATGCCGGACAGCCAGTGATGACGCCGAAAATCTGTCGTGCCGAGCTGCTGGATATTGCGCTGACCGAGCTTATCCCGGCCGGGCTTGTGGAAGATTTCGACGATTACAAAGACACGCTGGACGTCTCCATCGATTCCAGCGATCCAAACCGCCTGAACTTCATCTGTCACCCAAACCTGGTGAATCAGCTGCGCGTTCTGGCCGGTCTCATCCAGTACAAACTCTAAGGGGAAGCTATGTCGAGCATTCTGGGAATGGCGGCCATCCGTATCAACGGCCGCGAAATCAAGACCGAGGGAAAATCCACCCTCAATCCGGGCGGGTATCAGCGCCAGCAACATATGGGCGCTGGCAAAATCTGGGGGATTTCCCGTAAGACCGCCGCCCCGTCCATCAAACTGACCATTGCGGCAGACCAGGACGTTGATGTCATCGAGATAAGTCAATGGGAGGACGTCACCGTGATGTTCTACGGCGACAACGGCCTGAACTACATGATGACCAAAGCGGCAACGGACAACCCGGCCGAACTGGACGAAGACGCCGGAACCGTGACGGCAAACTTCATCGGCGTTCAGTGTGTGAAGGTGTAAGACATGGCAACGATTGAATTTGATCTGATTCATGGCCTGCGCACCGGCGCAGGCACCACCGATGAGGCGATGCACAAAACTGTCAGGCTGCGCGAGCTGACCACGGATGACATTGTGGACTCGCAGCTGGCGGCCGAACGTGTCGTGATTGGTGAGAACGGTAAGGCGGTTGCCTACTGTTCTGAAGTCCTTGTCGGGCTGGAGATGCTGCGCCGTCAGATTGCCAGTATTGGCTTTATCCCCGGCCCGCTGGATATGAAACAGTTACGTCGTCTGCACCCGGACGACCTGAATCTCATCAACGAAAAAGCCGCCGCACTGGATGACATGCTCCGCGAGGTGGCTGAACGGGGGCGAGCTGATGCCGCTGGCAGCGGCACTGACCCATCTGCTGATTAATCTTTCACAGCGCTTTGATATTCAGCGTCTCGGACAGCTGCCCCTGCGGCAGCTGTTAACACTGGTGCGGCAACTGGAGAAACAATATGGCCGGAAATCGCCTTAGTACGGAAATTCTGATTAATCTCGCCGGTAACCTTCAGGCCAAGGCCCGCCAGTACGGAGCCAGCATGTCCGAGTTTGCCAGCCGTAATCAGCGGGCGATGTCCATTGTTCGGGCGACGTCTGAAGCGGCCGGACGTGGACTGGACAGGTTGGGCAACCGTTACACCGGCCTGATTGCCAGCGTGGCCGGGGGCGCAGCCCTGCGGGAGTTTGCAAAAACGGATCGCATGTTGACCGAACTGGGGATTGCCGCCGGGAAGACGCGCGAGGAGATGCGCAAGATTTTTTCTGATACCCAAGATGCGTCCATCAAATTCAGGGTGGACGATTCGGAAGTGATGGCGGCAATTTCTAATGTCAACAAAATGACCGGCGATCTGGATTTTGGTGTCAGTAATAAGGACATGATGGCGGCTTCTATCGCAGCATCCGGATCTGATGGTGAATCAATCGGTGGGTTGTTCGCCAGTTTCCAGAAATTCAAAACCAAAAATGAACATGAAAACCTTCTGGCGATGGATCTGCTGAACCAGTTGGGTAAGGAAGGTGGTTTTGAGCTTAAAGATTTTGCCGAGAAAGGTACCAAGATCTTTTCCGCTTATGCCGGAACCGGGAGGACTGGCCCTCAAGCACTCAAAGAAATGGGCGTGGTTATGGAGTCGGCAATGGATGCCGTGGGGGATAAAGACCTAGCAGCCACTGCGTCCTTTAACTTACTTAACGATCTACGTAACCCGAAAATTGCTAAGGTACTGGAAGCCAGTGGCGTCAGACTACGTGATATGCAAGGGAACATGCTCCCCATCAATAACATAGTTAAAGATATCGCTCAGCGCTCCGGTAAGGATGGCTCCAAGCGTCAGGATGAGCGGCTTGCCAAAGCGGGGTTTACCGATTACAGCCGATTACTCATTTCCAGCGTTACTACTGGTAAAGGGGCCGAAAACTTTGCTCGCTATAACGCGGTTGTCGCTGATGGTTCGGGCATTATGACCGACGCCAAGTATGCGGCGCAGGATTTCACCTCTGCAATGAGCAGTCTCAACGTCACCTGGAAACAGTTCGCTAACAACAATCTGGCAAAACCGGTTCAGGAACTGGCTGATGCTATCAACAGCATGGAGCCAGCAGCCGTTCAGCGCTGGCTGGAAGTAGGTAAATACCTCGCTATTGCTGTTGGAGGTGTTATAGCTGCGCGTAAAGCCTTCCAGATTGGGAAAGGCACCTGGGACTTTTTCAACACCGTCCGGGGGAAAAACGGCAAAGGTGGCGTAGCCGGTGGAATCGCTGATGTGTTCGGCTCTGGCGTGATGCCTGTCTATGTCGTGAACATGGGGGCCGGTGGTATGGGGGGCGGTATCACTGACGCGCTGGGTGAAGCCGGAGGTCGTGGTGGTGGACTCCCCGGACGGTTTGGTCGCCTTGCCCGTGGTGCGGGAAAATTTGCTGGCATAGCAGGTGCAGGGGTTGCGCTTTATGACCACCTTGAAAGCAACTACAGGCTCGATGGCCGGGTTGATAACCTGACCAAACAGGTTGTGGAAGATAAAAATGCTTCCGTGCAGGAAAGAGCCTTTGCTGAAGAAAGCCAGCGTAACCGTCAGGCTCTGGCGAATAAATGGAAGCAATGGTTTGGCGGTGACGATACACCACGAACAAAAGTTGTTGATCCGCGCCCGTGGGCCTCGATGGCTCCTGTCATCCCCGCCGTAAATTTCGCATCGGTGCCTGCGCCATCAGACCCGAAAGGTCCCACCATTCCACAGCTTAAGAGTGATGAGCACTCATTGTGGGCGACTATCGCCGACTTTTTTAAGGGGGCCAATACCACCATTGAAAGCGGTATGCCTTCGGTAGCTCAGGAGGACAAAGCGCCACAGCTTCCGCCTGTTCCTCCGAAGCTTCAGGGTGAAATCCGGGTGATTGTTGAAGGCGATGCGCGGGTTAAAAGCGTGAAAATGGATCAGCCAGGTGTCACGCTCAGTGCCTTCGCAGGCGTCTCTAATGTGGAGCAAAACTGATGGGTACGACGAAATGGGAAGACCTGCGCGAAGCGTCGTTTCGGGGCGTGGCGTTTTATCTGGTGGATAACGAAGGCACCAGCGGCCGTCGTGCCATCCCCCGCGCATACCCGAAAAAAGAAGTGGGATGGACCGAAGACAACGGCGCTGTACTGACACAGCAGCAGATTAACGGGAAGTTAATTGGCAGCAGTTACCAGTCACAACTGGAAGATCTGCTGCGTGCACTCAATACACCGGGACCGGGGGAACTTGTTCACCCGTGGTTCGGGATCCAGAAAGTCCAGGTGGGTAAAGTGAATCACCGCCTGAGCACACAGGAAGGCGGTATTGCGTATATTTCCTTTGAGGTCTCTGAAGCTGGCGAACGCCTGTTCCCCGCAGCGGCTGAAAATACCAGCCTGACCGTACTTAGAGGCGTGGACAAAGTGAAAGCAGCGCTTGAGAACGGTGATTTCTTTGCTGTGCTCGATGGGCTGGGCGAGATGGTCGATACCTTTCTGGATGACATGGAAGGAATGGTCGTTAACCTGCTCACCCTGCCATCGGCCATCACCGAGTGGATGGATCGGCTGGGTCGTTTCCGTGGTCTGGTTGATGTCATCGTTGCGAAGCCTGCGAACTTCATCAACGAAATTCTGGGGCTGGTCAGTGGTGTGCATGAGACCGTGACTGAACCGCTCTGGTCAATGCGTCTCTATGACCGGTTACGCAGCCGCTGGGAAGGTGCACAGTCAGAAGGTTCCGGGGCGGGTATCAGTCGTGCAGAAGCGGCGGCCACCCGCCAGTTACCACAGTTTATGTCCGTCACTCCGGGGTCAGTGGAAGGAGGTGTCGGGGGATTTGCCAGCAGTATTCCCACAGTGGCAACCACACCATCGCCAGCGATGCAGGCCAACATCACCGGGTTTACGCAGGTGGTCGTACTGGCAACCCTGCTGGCACAGGCGGAAACCATCGCGCAGACGACGTTCCGCACCAGCGAAGAGGCTGTCAGTACCGGTGATGCTCTGGCCGTTCTGCTGGCTGAGCAGGCCGTCATTGCCGTTGAAAGTGGTCAGCGTGAACTCTGGCGCACGCTTCGCGATCTGCGTTTTGCCGTGGTGAATGACGTGCGCATCCGTAGCGCCAGACTGCCGCAGACGCGTCTGCTGTCTCCGACGATCACCTCTTCCGTCTCACTGATAGCCTGGAGGGAAACCGGCAACACAGAGAACCGGGACACCATCACACTGAGAAACCGGCTGCGCGACCCTTCCTTTATCCTGCCGGGTAAAACTATAGAGGTAACTGAATAATGGAATCTGTTGTTCTGACGGTGGATAGTCAGCAGTGGGACGGCTGGACGGAAATGTCGATCACGTCCTCACTGGAGGCCATCGCCGGAGAGTTCGATCTGACTGTCACCACGCAGTGGTCTGAAGCATCCCCCCGCGTTATCCGGCAGGGTATGCCCTGCACGGTGGCTCTTGGCAGCGATACGGTAGTCACTGGTTATATTGATGATTTTATTCCGAGTTATGACGCTGAGAACGTGAGTATCCGGGTCACCGGCCGTGACAAAACCGGCGATCTCGTTGACAGTTCTGTTGTTCACAAATCCGGGCAGTGGAAAGGCGTTCGCCTGGAGAAACTGGCGGAAGAAATCTGTAAACCCTACGGTGTCGCCGTCATTAATGAGACCGACACTGGTGAAGCGTTCCCCTCCGTTGCTCTTGAACAGGGTGAAACCGCCTTTGACTTGCTTGATCGGCTGGCGAAGCAACGCGGCGTTCTGCTGACCGCTGACGGGCTTGGGCGTCTTGTTATCACCCGTGCATCAACAAAACGCGCTGGCGTTGCTCTGGTGCTGGGGAAAAATATCCTTGCAGCGCGTGGCCGCTTCAGCTGGCGTGAACGCAACAGCCAGTACATCGTCAAAGGCACCACCAGTGCCGGTGGCAGTACATGGGACGAACAGCCCGCAAAAGTGACCGGCGGGCGTCAGACCATCGTTGATGACAACGAGATCAACCGCTACCGCCCTAAAATCCTCGTAAACGAAGACAGCCTGACCGTGGGCGGCGCAAACACGCGCGGTGAGTGGTTCAAGGCCCGAATGCTGGGCGAAGCAAACAGCACTGAAATAACGCTGGCAGGCTGGCGCGAGAACGGCGACAGCGGCCCGCTCTGGCAGAAAAAATCAACTGGTCGATATTGATGACCCGGTGCAGAACCTGAAGACCACCTGGCTGATAAAAACCGTCACGTTCACCGAAGGTGACAACGGCCGAATCTGTGTACTGACGCTGGTGCCTCCGGAATCAATGGATCTTCCTCTGACCGATGCGAAGAAGAAAGGTAAGAAGGCGAAGAAAGGTAAAACGGTGACGACATGGGACTGAATCCGGCAAATATTGGTCGCACGCTGACGGGTCTGGGGCGACGTCTTCGCCTGATGGTTGATCGGGCTGTTGTGCGAATTGTCACCGACAGTCTGGGGCGTCAGAACCTTCAGATCCAGTCGCTGGCTGACGCCACTAACGACGATGTTGAACGCTTCCAGAACTACGGTTTTACGTCAGTTCCGCCAGTGGGTTCTGAAGCCATCGTGCTTGCAGTCGGAGGGCGTCGTGAAGGTCTGGTGGCAGTTGCCGTCGAAGATAAACGCTGTCGTCCGAAAGGACTGAAGGACGGTGATGTCTGCATCTATCACGCAGACGGTCAGGCTCTGGTTATTCTGAAGAAAGACGGCGTGGCAGAAGTAAGAGTAAAAACGGTTAATTACACCGCCACTGACTTATTCGAGATAACTACAGCTCAGTTCAAAGTAAACGGGCCGTCAGAATTCTCGGAGGATATTGTGGTAGGTGAAAAATCCTTCCTTGAGCATTTCCATATAGACGGTGACGGCGAAAAAACATCGGAGCCGAAATGACTATCGGGATCACCTGGAATAACCAGCTGTCGCGCGGCGAGCTGACGGTGACGCATGATGGCCTCACTCTGGATGAGGGGCTGGTCACACTGGTGTTGATATGCCTGTTCACCGATACCCGAGCTGATGACGATGATGTCATTCCAGATAACACCGGCGACCCGCGCGGGTGGCCGGGGGACACCTTCAGTGCGTATCCGTGGGGTTCAAAGCTCTGGTTACTTGACCGCGAAAAGCTGACAGAGACGGTGCGCCAGCGTGTTGAGGATTATGCCAGTCTTGCCATGCAACCCTTATTGCGTTCGGGTTATGCCAGAACAGCCAGTGTGACGGCGGTAATCAGTGGTGCTGACCGCATCAATTTTATTGTCATCCTTAGCCGCCCGGACAAGACGCAGTTGCGTATTGAAATCAGTAAACGTTGGGAGGCGACAGAGCATGCCCTTTGATATTCCGGCGCTTCGTAAGCTTATCGCCGACGGTGAGAAAGACATTGCGATTGAGCTGGGTCTGCAAACACTCCCGCCAGTGGGTGTGGAGAAGGCACTGAATGTGACGTTCAGCAGTCAGGTACGCGACCTTTATGACCATCAGAGCTGGATAAAAGACCAGATCATCCCGTCAGTAAAAGCGGATGACGACACAATTATTGAGATTGCAGCCAGTGAGGGTGTGATCCGTAAGCAGGCGACATTCTCTGGTGGCCCGGTGATATTCCCCGGACTGGCGAGTATTCCGGAAGACACCGAGATGCAGACATCATCCGGTGTGCTGTATCTGGTCGTTGCATCCGGGATGCCGCAGAACGGCCAGGTTATGGTCACAGTGCAGGCCAGCGACGCTGGTGTTGCCGGTAATCTTCCTGAAGGCGAGACCATGACGCTGCTCTCTCCTGTTCCCGGCGTGGAAAGTGATGGCATTGTGGGTTCTGGCGGGCTGACCGGAGGCGCTGATATTGAGCCTGTAGCCGAGGTGCTTGACCGCCTGCTGTACCGTAAGCGCAATCCTCCTGTTGGTGGTGCACTGCATGATTACGTTATCTGGGCGCGTGAAATGGCGGGCGTCAGCCGCGCATGGTCGTGGGACGTCTGGCATGGTCCGGGTACTGTTGGTCTGGCATGGGTATACGACGGCCGTGAAGATATCACCCCAACGTTTCAGGACAGAGCCGATATGGAAGCTTATCTGTTTCGTCACGCTGATCCGGCAACGGGTAACTTCGTCGGCAAGCCTGGCGGTATTGAAGTCTGGCCGGTTGAACTTCATCTGAAGCCGGTGCCGCTTGCTATCCGGCTGACGCCGGACACTCAGGCCACACGTCAGTCAGTAGAGGCCCGGTTGCTGATCCTCCAGCAGACAATGGCACCGGGTCAGACAATGGGCGTTTCTGCACTGCGTACTGCAATTGGTACGGCTTCAGGCGTAACGGATTACACACTTGATATTGATGGAGATATTACCTGCGATCAGAACGAACTGATAACCGTTGGGGTGATTACATGGCTCACAGCGTAGATGAATGGCTGGGAGCGCTATGGCAGGTCATGCCACGAGGCAAGGCATGGTCGCGTGATAATGACAACGATTTAACGCGCTTTTTACGGGCATTAGCCAGGCGTTTAAGCCAGGCTGAATTTGATGCGGAACGGCTGCTGCCGGAGATGCGGCCAGAAACCACCTTTTTATTGCTGGAGGAATGGGAAGAATATCTGGAGTTACCTGAATGTGAGCAGGCATCCGGCACAATAGAGGATCGCCGTCGCGCTGTAGTGGAGAAATACCACCGTAAAGGCGGGCTGGCCCCGTGGCAGATTGAAGCGGTTGCTGCGGCTCTTGGGTTTACTATTCGCGTGAATGTGATCCTTCCTCACCACTGCCTGCGAAGTTGCATGTATCCACTTTATCCGGCGCGTTATCGCTGGATTTTACAGATTGATGTGCTCGGTATTAGTGGCGGGCGTTTTACGTGCATTGATAACGTTATGACGCCTCTGCTGAGTGATCGTGCCCGCGAACTGGAGTGTGTGATGACGAAATACCGGCTGGCCGGAACGGCCTACGATTATATTTATTATGCAGGAGATAACTGATGTTTTATGTCGATAACCCGACAGGCGTTCCGGTCATGCCAGAACCGTCGCCAGTCAGCAGCCTGACCGATTTGTTCTTTACTGAGGGTGGTAACGGCGTACCTCCGACTTATCCGGGGCCTGACTGGTTCAATATCATTCAGAGTGAGCTGATTAATATTGTCAGAGCCGCAGGGCTTGATCCTGACAAAATGGACAATACGCAAATTCTGGCTGCACTTAAAAAGCTGTTCCTGCAACGTCAGAATCCGTTCGGTGATATTAAGTCCGATGGCGCAGTTGCAACGGCTCTCGCAAACCTTGGTTTGGGGGAGCTGGCAAAAACGCCCCGTTTCCTCGTAAGTAAGGGGCAGAACGCAAACGGGTGGTATGAGATTTATAGCGACGGCTTTAAACGAGTAGGGAAAACGTGGGATAGCTCAAACCCGCTTTTAATCAGCACCCCTGCAACGGGAGCAAGAGTTAACTATCCGATCAGTTTTACGACGCAATTTAACGGGATCTATGTCACTGAGAACGGGAATACGAATAACAATTTTGAGTTTGCTAACCCCGCCCAAATCGGCATGACGGGCTTTTCTATGGCAACTATGGACATAACGCTTGGCTCATCACCATCAACGGCCTACGGAACGTCATTTACAGGGTATTACACTGCGGAGGGATATTAATATGTGGTACTGGAATCCGGTTGACTGTAATGAGGCTTTGCCGGACATACATGACTTATCTGGTTGTATTGAGATTGATGATGATGAGCACCCGTTTAAAACGGGAGATTTACCTGCTGGTAAAGTGTGGTCGAACGATGCCAGCAATCATCCGGTATTAACAGACACTCCCCCTCCATCACCTGAAGAAGAAATAGCGGAGGCCGAAGCAATAAGAAGCCAGTTACGCGCAACCGCAGATGCCGAAATAATCTGGAGGCAGGACGCTGTTGACGCCGGAATCGCGACAGAACGGGAAACCACAGAACTGGCTGAATGGATAAAATACCGGGTTTTGCTGATGCGCGTTGATACAACAAAACCCGTATGGCCTACAGTTCCGGGGGAACAGGCCAGTTAATATCCGGTGCGCTGGATGTATCAACGGCTTCCAGCACATCAAGGTAATCCAGCCACAGATTGTATTGCGACAGTTCTTCGCCTTTCAGGCGGCCAATTGCAGCTTTACCGGGCCATTGTTTGTTATTGATGCAGGTATTAGCCTCATTGATTAATTGCTGTCGTTTGCTTTCTGCGGCAGCAATTAACTCCTCGGTTGTCGGTTCTGGTAAATCTACCCAGCACGGCATGCCATCGTCTCCAACTCCACGTTCTTTCCCGTCAGGTGGTGTCGATGTGTATTCGATAAATAAATTATCGCTAATTTCCACACCATCTTCGGGCCAGTTACCAGACAGTTCATACTCTGTTTTTAATACAGTCAGATAAAACGCGTTATGGGAAGGGCTGTATATATATTTGCTCATTATTAGAATCCTACCGCGAGATAATTTGCAGCTTCCCCGGCTGTAAACGCCGATGCCCAGGGTTTTGACGCCAGCGCCGCAAAACCGGTTGTGGTGGTGGATGTGATCTCTTTTGACCACACGCATACAGAACCAATGCCGGATGCTTCCCCGGTTGTTTCTGTCAGGCCTACATAAAAAGGGGTTCGCGTAAATGAAACAGGAAACGTAGCAGTAGCTTTTCCGTCACTACCTGTTGTGCTCGTAGTTCCCCATTGAATGATAAGAACCTTTTTTTCTCCGCCAATAATCACGGGAAGTGTGGCGTATCCATTTGTGGCGATCAGCCCATTGCCAACACCCGCTTTGGCTAAATCTCCCAAACCAACGTTTTCGATAATGATCGTTTCTCGCTCAAATGGCATTATTCACGCCATTTAAAGCGAGTTTAACCATGCTTATTGGCTATGTACGTGTGTCAACAAATGACCAGAACACTGCGTTGCAGCGTAACGCGCTGGAGTGTTCAGGATGTGAGCTGATTTTTGAGGACAAGATAAGTGGCAGAACATCGGACAGACCAGGACTCAAACGCGTACTCAGAACGCTATCTGAAGGTGATACTCTCGTGGTCTGGAAGCTTGATCGCCTCGGACGCAGCATGCGGCACCTTGTCATTCTGGTGGAGGAGTTGCGGGAACGGGGAATAAATTTTCGCAGCCTGACCGACAGCATAGACACATCATCACCTATGGGGCGCTTCTTCTTCCATGTGATGGGTGCTCTGGCCGAGATGGAACGTGAACTGATTGTTGAACGTACCCGTGCAGGACTGGCGGCAGCACGAGCGGAGGGGCGTGTTGGAGGTCGAAGACCCAAGTTAACACCTGAGCAGTGGGCGCAGGCTGGAAGGTTGCTAGCGGCCGGTGAGACCCGCCAGAGGGTGGCTCTCATTTATGATGTCGGAATCTCAACTCTGTACAAACGATTCCCGGCGTCTGACAGATAAATCAGGATCGCACGGGCGATCCAATTTTAATTGTCCAAGTGTCGCGATTTGTTTGGCGCGCGGCAGTAGGTGCCATAGTGGCTAAGGAAACCGCCGTAACAGTTCATCAGGCGGGCCACCAGAGAGGCGTAAGGCGAGGAGCGGGTGATGTTGCCGCCGACGATACCGGAGGTGTAGTTAATGTACACCGCCTCGTTACCGTATTTATCGACCGTGTTTTTCAGGCTGTTGCTGATGGTATCCAGCGCTTCTTCCCAACTGATACGCTCAAATTTCCCCTCGCCGCGTTTGCCTGTGCGTTTCATCGGATAGTTGAGGCGATCAGGATGGTTAATTCGTCGACGGATTGAACGTCCGCGCAGGCAGGCGCGCACCTGGTGATTACCATAGACATCTTCGCCGGTGTTGTCGGTTTCCACCCACCAGACTTCATCATCTTTGACGTGGAGACGCAGGGCGCAGCGGCTACCGCAGTTGACCGAACATGCTCCCCAGACCACTTTGTCTTCAACGGGCTGGATTGCCTGCTGCACGGCAGCCGCCGCGGTACGCATCCCGAAAGGCAAGGAGACCCCTCCGGCAGCGAGCGCCAGCGATCCTATCGCGGTGGATTTCACTAACGTCCTGCGGCTGATGCCTCCCTGTTGTTCAACATCGGACATAATTCACTCCATTATAGTTATCGTTATATAATTATGTTTATAACGAAATTAATAGGATACTAATGAAAGAGTGATTTTAACCCCTTATAGGTAGGGGTTATTAATCCATGTCAAAATTGAGGGAATCTTTCTAAGAGGGAGTAAGAGCCTGGTATATCAGGCTCTTATCTGTTACTGAGGTTCGACAGCATGCCCGTCCTGAGAAGGACTGGTACTGGCGGGGGCGTTTCCGCTGCTGGTGCGGGTATACAAAATTTTATGGGTATCGTTTGCACAGTGGCCTACGACCTGGGAATCGGGCTGATCCGCCTGGTCATTAGGAACAATAGTTAACGTAAAACTGGACTCCGCCACACCATTGTTAATGATGCGCTGTTCGATGTCGCTTTTGACTCTTTCACATGACTCCGGCGCCGCCAGTACCTGCGGGGAGGCGACGATGAGCAGACACGCTGCAATTCCTGCTGAGATTTTCATCATTCACTCCTTTTATGAACACGATAATGAAAGCTTAGCATTTCTAGTGTAAACAACTGTATTTGCTAATATGATTGAGAATCATCTCCTTATCCTGGTGAATAACGTGAAGAAACAGACCCTGGTGAGCTTTCTGTGTGTGCTGCTTGTTGGATGTGATAACGCCACTGTTCTGGTCTCCTTTACCCCGGAAATGGCCAGCTTTTCCAATGAGTTCGATTTTGATCCGTTGCGCGGCCCGGTTAAAGATTTCAGCCAGACGTTGATGAACGACAAAGGGGAAGTGACGAAGCGCATCAGCGGAACGCTTTCGCAGGAAGGCTGCTTTGATACGCTGGAACTGCACGATCTGGAAAACAATACCGGGCTGGCGTTGGTACTGGATGCAAACTACTACCGCGATGCCGAGACGATGGAGAAAAGAGTGCGCTTGCAGGGGAAATGTCAGTTGGCGGAGTTGCCCTCGGCAGGCGTGGTCTGGGATACGGACGATAACGGTTTTGTTGTTTCCGCAACGGGAAGAGAGACGAAGGTACAATACCGCTATGATGCGGAAAGGTATCCGTTGGGTAAGACCACCATCAGTAAAGACAAAACGTTATCCATTGATGCAAAACCGTCGGCGGACCCGCTTAAAAAGCTCGACTATACGGCGGTCAGCCTGCTGAACGATCGCCCGTTAGGCAATGTGAAGCAGACGTGTGAATATGATAATTACGCCAATCCGATCAACTGCCAGCTTGTGATCGTTGATGAAAGCGTCGAACCCGCCGTTGCGCGGAGCTATACCATTAAAAATACGATTGATTATTTACCCGGCTGCGTCTGACACAGCCGGGTAAGCGGCTTACTGTGCGGTAGGCTTCAGCAGGCTTGAGCCAGGCGTTTTGTGATCGTCCAGATATTGCTGCTGGAAAATGCACATACGAATGGTGTTGCGGTATTGGGCCATTGATAAAGAACTCATGAATCAGTTCACCTTCAACCATAAACCTAACTTACGGTAAATGTGAATGGCTTTTTCGTTTTCTTTATCAACGATCAGATACAGCTTATAAAGGTTGAGTACGGTGAATCCGTAGTCCATCGCCAGCTTCGCTGCGCGCGAAGCCAGCCCTTTTCCCTGGTATTCCGGCGAGATGATGATTTGAAATTCAGCCCGGCGGTGAACGTGATTAATTTCTACCAGTTCGACCAGACCGGCTTTTTCGCCGTTGCACTCCACGACAAAGCGCCGTTCGCTCTGGTCATGAATGTGCTTATCATACAGGTCAGACAGCTCGACAAACGCTTCGTACGGCTCTTCAAACCAGTAACGCATTACGCTGGCGTTGTTATCAAGCTGGTGGACGAATCGTAAATCTTCGCGCTCCAGCGGGCGCAACTTAACACTGTTGGCGTGGGTCATTTTGTGTCCTTACCGTCTGTCGCTCAACGCACTGTTATGGCGCAACGGTACGGCCAGTACGACGATCCAGGCAACGCAGCGTATTCGGTTCCCAGTACGCGTTAAGGTTGGCGCTCTGCTCGCACTTGTCGCGGTTATCAAACGCGGCGTCGGCTTTGTCCCACTCTTTCTCCGTGCGCGTGTTCACTTTCTGGCGCAGGCTGCGGGTATCATTCCATTGTTCCTTTTCCATGGCTGCGTGTTGGCGGCTTTGCGCGCTGTCTCCAGATTCAATGATCAGTTTGCTGGTATTCGCGAATGCGGAGGTGGTATACACGACAGCACCCAGCGCAAGCATTGCCGTCAGGCACAGGCGTTTGCTAAGTGTATTGTTCATGGTTAATTCCTTTCATGATAGAGGGATAAAACTTGTCTAAATTCTACACCATTCCTGGCGCAGGGCATACCCACCCTGCGGGTATGGATACGAAGGCGCTATTCTGGCGTATCATGCTTAAACCTGACGTAACGCAATGATGATGATGCTAAAAACAACACTTCTTTTTTTCATAACCGCACTGTGCGAAATCGTGGGCTGTTTTCTGCCCTGGCTATGGTTGAAGCGGGCGCAACGGCCTGGCTGCTGGTTCCGGCGGGAGTCTCACTGGCGTTATTCGTCTGGCTGCTGACGTTGCATCCGGCCGCCAGCGGCCGCGTCTATGCGGCTTATGGCGGGGTCTATGTCTGTACTGCGCTGTTATGGTTACGCTTTGTTGACGGCGTCAGACTCAGTCTTTATGACTGGTCAGGCGCACTGATCGCACTCTGTGGAATGCTGATTATCGTTGCCGGGTGGGGGGCGCGCTTAAGCGCCTTATTTTGTGATCGATGAGCGATTTTTTGATCATTATACTTGTATGGCAGTAGTTCAGGTGTGTAAATTTCCTGCATCGCACAGAAGAGATGTAAGGAACAACAATGAAGATTGTAGGGGCTGAAGTTTTTGTCACATGTCCGGGACGTAACTTTGTCACGCTGAAAATCACCACCGAGGACGGTATCACCGGCCTGGGCGACGCTACCCTGAATGGCCGTGAGCTGTCGGTGGCATCCTATCTGAAAGATCACCTTTGCCCGCAGCTTATTGGCCGCGATGCGCACCGTATCGAAGATATCTGGCAGTTCTTTTATAAAGGCGCCTACTGGCGTCGCGGGCCGGTCACGATGTCTGCCATTTCAGCCGTTGACATGGCGCTGTGGGATATCAAAGCGAAAGCCGCCAACATGCCGCTGTATCAGTTGCTGGGCGGCGCATCCCGCGAAGGCGTGATGGTCTATTGCCACACGACGGGTCACACCATCGATGATGTGCTGGAAGATTATGCGCGTCACAAAGAGATGGGGTTTAAAGCGATTCGCGTGCAGTGCGGCGTGCCGGGTATGCAAACCACCTACGGCATGTCTAAAGGCAAAGGCCTGGCCTACGAACCCGCTACGAAAGGCCAGTGGCCGGAAGAGCAACTCTGGTCAACCGAGAAATACCTCGATTTCACGCCGAAACTGTTTGACGCCGTACGTGCTGAATTTGGTTTTGAAGAGCATTTGCTTCACGACATGCACCACCGCCTGACGCCGATTGAAGCGCCCCGTTTTGGTAAGCGCGTTGAGGATTATCGTCTGTTCTGGATGGAAGACCCGACGCCTGCGGAAAACCAGGAATGCTTCCGTCTGATTCGTCAGCATACGGTGACGCCGATTGCCGTCGGGGAAGTGTTCAACAGCATCTGGGACTGCAAGCAGCTCATTGAAGAGCAACTGATTGACTATATCCGCACCACCATTACCCATGCCGGCGGCATTACGGGGATGCGCCGCATTGCGGATTTTGCCTCGCTTTACCAGGTGCGCACGGGGTCTCATGGGCCTTCTGATTTGTCGCCGGTTTGCATGGCCGCAGCGCTTCACTTTGATCTGTGGGTGCCTAACTTTGGCGTTCAGGAATACATGGGGTACTCCGAACAAATGCTTGATGTCTTCCCGCATAGCTGGACGTTCGATAACGGCTATATGCACCCAGGCGACAAGCCGGGTCTTGGTATTGAGTTTGATGAAAAACTGGCGGCGAAATATCCGTATGACCCTGCGTATTTGCCGGTCGCCCGTCTGGAAGATGGCACGTTGTGGAACTGGTAA
Protein sequences of DBSCAN-SWA_12 >LR134204|4726740:4784003|4757520_4757808_+|VEB94876.1|DBSCAN-SWA MQEILNSDQRLVILRSLVECGDSANESILQTCLQTYGHRVSRDTVRTHLAWLREQGLVTLSDVSGCYVAAITGRGEDVAFGLATVPGVKKPRARE >LR134204|4726740:4784003|4781270_4781375_-|VEB94907.1|DBSCAN-SWA MAQYRNTIRMCIFQQQYLDDHKTPGSSLLKPTAQ >LR134204|4726740:4784003|4729917_4730607_+|VEB94837.1|DBSCAN-SWA MQQEIDNAYIVRTKAVSGCFSMFMLMFGITFTPLFFTRSAQLMESGLLLPLMFVLEFLILIPLYYLFFRKRDGLGKGTLSAKWFVILFGAILIIQFLLPAMLGMRKTEAWVMTQVSLHNYAFWLTNLSLIFLVPVYEEIVFRGCLFNAFQYWFNDKVWATSLVVSTLFALMHTQYADIRTLLMLFLISQVLIVARVKSKGLLMPVTLHILMNATVIGIQYGVQVLLSSG >LR134204|4726740:4784003|4769772_4771812_+|VEB94893.1|tail|DBSCAN-SWA MAGNRLSTEILINLAGNLQAKARQYGASMSEFASRNQRAMSIVRATSEAAGRGLDRLGNRYTGLIASVAGGAALREFAKTDRMLTELGIAAGKTREEMRKIFSDTQDASIKFRVDDSEVMAAISNVNKMTGDLDFGVSNKDMMAASIAASGSDGESIGGLFASFQKFKTKNEHENLLAMDLLNQLGKEGGFELKDFAEKGTKIFSAYAGTGRTGPQALKEMGVVMESAMDAVGDKDLAATASFNLLNDLRNPKIAKVLEASGVRLRDMQGNMLPINNIVKDIAQRSGKDGSKRQDERLAKAGFTDYSRLLISSVTTGKGAENFARYNAVVADGSGIMTDAKYAAQDFTSAMSSLNVTWKQFANNNLAKPVQELADAINSMEPAAVQRWLEVGKYLAIAVGGVIAARKAFQIGKGTWDFFNTVRGKNGKGGVAGGIADVFGSGVMPVYVVNMGAGGMGGGITDALGEAGGRGGGLPGRFGRLARGAGKFAGIAGAGVALYDHLESNYRLDGRVDNLTKQVVEDKNASVQERAFAEESQRNRQALANKWKQWFGGDDTPRTKVVDPRPWASMAPVIPAVNFASVPAPSDPKGPTIPQLKSDEHSLWATIADFFKGANTTIESGMPSVAQEDKAPQLPPVPPKLQGEIRVIVEGDARVKSVKMDQPGVTLSAFAGVSNVEQN >LR134204|4726740:4784003|4759023_4760607_+|VEB94880.1|terminase|DBSCAN-SWA MTAIPPASAVTSLSAAAILSGEFDKSQLLLPYQKRWIADSAQLKIAEKSRRTGLTWAEAADAALNGSMSVEAGGCDTFYVGTTKDMAREFIDACAMWAKAYDRAASDIGEEVLKDEDKDILVYVIQFASGYKIKALSSNPSNLRGMQGNVIIDEAAFQADLAAVLKAALALTMWGNNVRLISTHNGIDNLFNTIITDSRAGKKRYSVHRVDIETAIAEGLYKRICQVTKKEWSSEAEAEWLANLLSDTATVEDAREEYYCEPKNGGGVYIARSLRERAARGPSVVLRFTGTPEFNALPEGLRRLDMQEWLETVVRPELEKLPQNLRHCLGEDFARNGDLTVFAPVTVNDDTTREVPFLVELSNVPFKQQEQALFFICDLLPRRDGIKLDARGNGQYLAEQAAERYGDEVEQVQLSVPYYRENMPRFRAAFEDNELVLPKHEDVITDLGAIQLYRGVPGIDDARTTGTDGRKRHGDSAVAIFLGFLASREDCRRYEVHKLKKPSRPDERNEHRQVRITRGLKNQRGLL >LR134204|4726740:4784003|4769290_4769686_+|VEB94892.1|DBSCAN-SWA MATIEFDLIHGLRTGAGTTDEAMHKTVRLRELTTDDIVDSQLAAERVVIGENGKAVAYCSEVLVGLEMLRRQIASIGFIPGPLDMKQLRRLHPDDLNLINEKAAALDDMLREVAERGRADAAGSGTDPSAD >LR134204|4726740:4784003|4732266_4733166_+|VEB94840.1|DBSCAN-SWA MNIELRHLRYFVAVAEELHFGRAAARLNISQPPLSQQIQTLEQQIGARLLARTNRSVALTAAGKQFLADSRQILGLVNEAAARAERLHQGETGELRIGFTSSAPFIKAVSDTLSLFRQDYPDVHMQTREMNTREQIAPLNEGALDMGLLRNTSLPDTLDWEVILHEPLLAMIPREHPLAQKPVVTLAELAKEPFVFFDPHVGTGLYDDILGLMRRYHLTPAITQEVGEAMTIIGLVAAGLGVSILPASFKRVQLDEMRWVPIAEADAVSEMWLVWPKHHEQSHAAQRFHHQLLTAAKRA >LR134204|4726740:4784003|4728926_4729742_-|VEB94836.1|DBSCAN-SWA MSNKSHYQQLTRTFQRLSRFSHLSAIASWDMFTMMPPGGSTARGEALAELNVLEHQLLTDPKVAQWIAAAGQEDLNDVEQANLREMARRHHQAALLPESLVEAKSLAGSRCEHAWRSQRPANDWQGFAQNLKEVVKYSREEAKLRAEAKGCSPYDALLDIFEPDMTSAQLDVLFADVKTWLPDLLNRAVSKQSHQPLIAPVGPFPTALQRELGLETMSQLGFDFNVGRLDISAHPFCGGVPEDVRITTRYDENELLSALFGVIHETGARPL >LR134204|4726740:4784003|4780493_4781252_+|VEB94906.1|DBSCAN-SWA MIENHLLILVNNVKKQTLVSFLCVLLVGCDNATVLVSFTPEMASFSNEFDFDPLRGPVKDFSQTLMNDKGEVTKRISGTLSQEGCFDTLELHDLENNTGLALVLDANYYRDAETMEKRVRLQGKCQLAELPSAGVVWDTDDNGFVVSATGRETKVQYRYDAERYPLGKTTISKDKTLSIDAKPSADPLKKLDYTAVSLLNDRPLGNVKQTCEYDNYANPINCQLVIVDESVEPAVARSYTIKNTIDYLPGCV >LR134204|4726740:4784003|4755993_4756401_+|VEB94872.1|DBSCAN-SWA MFKSINTDWLKPALLRVYQSGWTLLIFVGLLLVFCNFHGRHAFLVWWLAFTGVVLVGFSIFLGNLPYRLIKPEMHVSRFASFWSWVIWGVGFILICLSPLYAEPLYLLFLDPAGAALGVLFCQWVRRKGLLAWIQ >LR134204|4726740:4784003|4775167_4776244_+|VEB94898.1|tail|DBSCAN-SWA MPFDIPALRKLIADGEKDIAIELGLQTLPPVGVEKALNVTFSSQVRDLYDHQSWIKDQIIPSVKADDDTIIEIAASEGVIRKQATFSGGPVIFPGLASIPEDTEMQTSSGVLYLVVASGMPQNGQVMVTVQASDAGVAGNLPEGETMTLLSPVPGVESDGIVGSGGLTGGADIEPVAEVLDRLLYRKRNPPVGGALHDYVIWAREMAGVSRAWSWDVWHGPGTVGLAWVYDGREDITPTFQDRADMEAYLFRHADPATGNFVGKPGGIEVWPVELHLKPVPLAIRLTPDTQATRQSVEARLLILQQTMAPGQTMGVSALRTAIGTASGVTDYTLDIDGDITCDQNELITVGVITWLTA >LR134204|4726740:4784003|4754574_4755108_+|VEB94869.1|DBSCAN-SWA MKFIGTLCVIFTIGWAFITLTKNDKPLSVNEKLIAQFANQRLLKKDEWNTGNGVVDGVQVYTARKEYPAFASLWRLGVKDAGVTVLTSGTYPEFEPVLAMGQCKNLAIAVFDSDSKPVNDAVSTIFTTATETYKKEGKKVQATGDIGNLPFRVTVQNIDSVLTFSCDIDLSHYTSSI >LR134204|4726740:4784003|4755186_4755780_+|VEB94870.1|DBSCAN-SWA MNNSPESPAFRHALLFVLQFEGGYVNDPSDRGGETNFGISDKRDGVADGMTDVDGDGKPDTRIRDLTVDQAGQIYFRDYWFPAYCPEWADGISLFLFDSAVQHGVKKAVQMLQEAAGVAADGIVGAKTRAAIASYDPQYLLARLFLRRSRYYADIIKSNASQGKYLNGWFNRLDALVNACMEVLDDGNLDVIASPRS >LR134204|4726740:4784003|4758445_4759027_+|VEB94879.1|terminase|DBSCAN-SWA MASEQRPTRGRPSKIDLLPDSIRDQLHQMLRDKRHTQEEIREAINELIDSHDLPEDMQISRTGLNRYASRMEEFGAKIRASREMAEIWAAKLGSAPTSDVGKLLLEFVKTLAFETSMDMADSGKTVEPKALGQLALVAQRLEAAAMASHKREKEIQQEFAKKAAAAAESITRSAGLSAETAADIKRQILGIAE >LR134204|4726740:4784003|4748323_4750306_+|VEB94859.1|transposase|DBSCAN-SWA MDFWVSVKECIGVCGFPQAESNARKKLEDLVCGRSELRRKRAGTKAFEYHISVLPPEVRAELLAGRGLIETSSGLITLPQEPERVAADDLERQRLWSAWEKATGEQRLHAERRTKAAALVAELMASGVGNRKAIALAAKQLQISEGTLRNLYYKVKDHSPDLWGPVLLDRRVREKRQTGRTADISEEAWQFFLGDYLRNEAPFFSKCYERLEKAADAHGWIIPAERTLRRKLEREVDPRIVVATREGENALAQMYPSQQRTVAHLHAMEWINGDGYQHNVFVRWFNGEIIRPKTWFWQDVHSRKIIGWRTDVSENSDSIRLSLMDTIRAYGKPKHVTIDNTRAAANKWLSGGVPNRYRFKVKPDDPMGIIPLLGMKLHWTGVIGGKGWGQAKPVERAFGVGGLGEYIDKHPGLAGAFAGENVSAKPENYGSRAVDVEEFLETISEGVAMFNAKTERETEMCRGELSFDQAFERSYSQSVITRMTEEQIRQLMLPAEAVRVKPTGEFTMECGGSLFGRKNTYWSEQLVSHRSHKITVRFDPRNLHGEVACYDLDGRFLCMAECRAAVAFGDTEAGREHNRARREMIKSTKKATKALNRMTAIEVNDLLPKTEHAELPERHVVERVFTLGNTVKRVEDIQESQSENDVIFQQFVNKAKQARK >LR134204|4726740:4784003|4781393_4781831_-|VEB94908.1|DBSCAN-SWA MTHANSVKLRPLEREDLRFVHQLDNNASVMRYWFEEPYEAFVELSDLYDKHIHDQSERRFVVECNGEKAGLVELVEINHVHRRAEFQIIISPEYQGKGLASRAAKLAMDYGFTVLNLYKLYLIVDKENEKAIHIYRKLGLWLKVN >LR134204|4726740:4784003|4734640_4735336_+|VEB94842.1|DBSCAN-SWA MLKRFFITGTDTSVGKTVVSRALLQALASGNKSVAGYKPVAKGSKETPEGMRNKDALVLQSVSSIELPYEAINPIALSEDESSVAHSCPINYTLLSNGLASLSEKVDHVVVEGTGGWRSLMNDLRPLSEWVVQEQLPVLMVVGIQEGCINHALLTAQAIANDGLPLIGWVANRINPGLAHYAEIIDVLSKKLPAPLIGELPYLPRAEQRELGQYIRLSMLGSVLSVDRVLA >LR134204|4726740:4784003|4753428_4753638_+|VEB94866.1|DBSCAN-SWA MVIKFPEKVFELGQWWYRTAVEFEIDGIRNTAFFYSLNRYDAACRLAAIKENGVLGGEDTAFIDTGNGG >LR134204|4726740:4784003|4751600_4751855_+|VEB94862.1|DBSCAN-SWA MAKIIAYAWASGLIELGPELPDGALPIITGEENRIRDLINIWARHSRTGEQLLVPGVPEAQNQHEGCNALMTFTETITREYLEK >LR134204|4726740:4784003|4782423_4782702_+|VEB94910.1|DBSCAN-SWA MVEAGATAWLLVPAGVSLALFVWLLTLHPAASGRVYAAYGGVYVCTALLWLRFVDGVRLSLYDWSGALIALCGMLIIVAGWGARLSALFCDR >LR134204|4726740:4784003|4780117_4780423_-|VEB94905.1|DBSCAN-SWA MKISAGIAACLLIVASPQVLAAPESCERVKSDIEQRIINNGVAESSFTLTIVPNDQADQPDSQVVGHCANDTHKILYTRTSSGNAPASTSPSQDGHAVEPQ >LR134204|4726740:4784003|4766114_4766657_+|VEB94887.1|DBSCAN-SWA MGITVEVTGAEKLQTIRKAMEKLADSSLRQELLESIGAVAESQTRRRIASEKSSPAGAKWQDWSDNYAKTRHGNQSLLQGNGDLLDSIQYFVSGERVHIGTPLPYGKTHQEGFSGSVAVSSHKRLITQAFGRALKHGVWQTVGAHQRQMDIPQREFLGLSADNSNELTSVIGDFWSEVLK >LR134204|4726740:4784003|4753642_4754167_+|VEB94867.1|DBSCAN-SWA MSRANLIKLIHVARRKLQLDDDTYRYALHRVTGKTSCRELKVAQLEAVLKSLEDKGFRRTRPRSPARRHRETDVSAKVRGIWQQMHKDGFTHDGSDTALDAFVAKMTTRTNDGQGIASLAWCRGDDLLMVLESLKQWHIREMKVALANNGCFPVKRGYDAINDVYTRKVRKGAS >LR134204|4726740:4784003|4757810_4758185_+|VEB94877.1|DBSCAN-SWA MEQLRIRQMLETCRQQAEQLRRLARLAKLRESGEIGMSGNALFQAAVVIESLVGANEKALEGIERLDRSETQLIGERDQVIAALDGMYEAVTGAPPEWSSAFGFTDAINEVTERIFEMENAGHD >LR134204|4726740:4784003|4767254_4767443_+|VEB94889.1|DBSCAN-SWA MNKKLIKPARPGLRVRKADGSLLNADGEILAVAAYWRRRESEGDVVITAPSKPKSGKADKEA >LR134204|4726740:4784003|4744433_4745078_-|VEB94854.1|DBSCAN-SWA MKITTPEALMTATISRRSLVKTSAISGLALASSAFTLPFSRIARAADGLSPAPVEEKAVWSSCTVNCGSRCLLRLHVKDDAVYWVESDTTGDDVYGNHQVRACLRGRSIRRRMNHPDRLKYPMKRVGKRGEGKFERISWDEALDTIADNLKRILKDYGNEAVHVLYGTGVDGGNITNSNVPYRLMNSCGGYLSPLRQLQYRTNQRRNELYVRHE >LR134204|4726740:4784003|4731582_4732146_-|VEB94839.1|DBSCAN-SWA MSRITAVDTATASDVDEHIISQPVQYIKRGTSQFMRVTLALFSAGLATFALLYCVQPILPVLSHEFGVSPASSSISLSISTGMLAIGLLFTGPLSDAIGRKPVMVTALLLASCCTLLSTMMTSWHGILIMRALIGLSLSGVAAVGMTYLSEEIHPSFVAFSMGLYISGNSIGGMSGRLISGVFTDFF >LR134204|4726740:4784003|4736745_4737894_-|VEB94844.1|DBSCAN-SWA MIKLENLTKRFSQKNGQPLNAVDNINLNVPEGEMCVLLGPSGCGKTTTLKMINRLIAPSSGNIFINGQNTSAMDTVTLRRNIGYVIQQIGLFPNMTIEENITVVPRMLGWDKTRCKRRAEELMDMVAMDAKKFLHRYPKEMSGGQQQRIGVIRALAADPPVLLMDEPFGAVDPINREVIQNQFLDMQRALKKTVMLVSHDIDEALKLGDRIAVFRQGRIIQCASPDELLAKPANEFVGSFVGQDRTLKRLLLVSAGDVTDQQPTLTARTATSLAEAFGIMDDNDIRAITVVDSDGKPLGFVKRREARNANGTCADIIHPFRVTGKAEDNLRIVLSKLYESNTSWMPIVDEEGRYNGEISQDYIADYLSSGRTRRALNIHESS >LR134204|4726740:4784003|4774186_4774729_+|VEB94896.1|plate|DBSCAN-SWA MGLNPANIGRTLTGLGRRLRLMVDRAVVRIVTDSLGRQNLQIQSLADATNDDVERFQNYGFTSVPPVGSEAIVLAVGGRREGLVAVAVEDKRCRPKGLKDGDVCIYHADGQALVILKKDGVAEVRVKTVNYTATDLFEITTAQFKVNGPSEFSEDIVVGEKSFLEHFHIDGDGEKTSEPK >LR134204|4726740:4784003|4738550_4739453_-|VEB94846.1|DBSCAN-SWA MTLKKHLLGWLAAALLVTGQAQAAPLILATKGFTEQHILSAMTVQYLQKKGFQVQPRTNIAAVISRNAMVNKQIDITWEYTGTSLIIFNHIDKRMSPQESYDTVKRLDAKLGLVWLNPANMNNTYAFAMQRKRAEAENINTLSEMVAKIEHIRQTDPDNNWMLGLDLEFSGRSDGMKPLQDAYQMQLDRPQIRQMDPGLVYNAVRDGFVDAGLVYTTDGRVKGFDLKVLEDDKGFFPGYAVTPVVRKEILDAHPGLDDALNTLSGLLNNDVISTLNAQVDIDHQSPQQVARTFLQQKGLL >LR134204|4726740:4784003|4765674_4766115_+|VEB94886.1|DBSCAN-SWA MGIYVTRDDLLATDAERVWNMALNKATQQLDEEKIQRAIDDTDAEINSFLAKRYHLPLNLPTLPSPLRRAAVSIAFYWLSERDSQITDEIQKRYDDALRTLREIANGTRDLGVPSDTPVPETDTGKLIIVSDNRRLFTRNNLKGVL >LR134204|4726740:4784003|4737893_4738541_-|VEB94845.1|DBSCAN-SWA MDTIHYMMDNAGYLTSLTLQHLWLVALAVGLAIIIGVPLGILIVRHKWLATPVLGAATLLLTIPSIALFGLMIPLFSLIGQGIGALPAITAVFLYSLLPIVRNTHTALDSLPPGLREAGRGIGMTFWQRLRWVEIPMALPVIFGGIRTAVVMNIGVMAIAAVIGAGGLGLLLLNGIGGSDIRMLIAGALMICLLAIVLDWLLHRLQVVLTPKGIR >LR134204|4726740:4784003|4748087_4748324_+|VEB94858.1|DBSCAN-SWA MMTKSQDWHPEDIKAAIRKRGMTTSQLSRSHGLAESTLRNVFRHHWPKGEKIIADFLGMKPCDIWPSRYHDLTVKEVA >LR134204|4726740:4784003|4779310_4779946_-|VEB94904.1|DBSCAN-SWA MSDVEQQGGISRRTLVKSTAIGSLALAAGGVSLPFGMRTAAAAVQQAIQPVEDKVVWGACSVNCGSRCALRLHVKDDEVWWVETDNTGEDVYGNHQVRACLRGRSIRRRINHPDRLNYPMKRTGKRGEGKFERISWEEALDTISNSLKNTVDKYGNEAVYINYTSGIVGGNITRSSPYASLVARLMNCYGGFLSHYGTYCRAPNKSRHLDN >LR134204|4726740:4784003|4745220_4745970_-|VEB94855.1|DBSCAN-SWA MSEVAKRLGPDVHQRFTEGRTQEQWLQYLYAKMLAKDPELPGYDELKKMGIYKRKDPNGHFVAYKKFREDPQANPLKTPSGKIEIYSSRLAKIAQTWELEKGDVISPLPIYASTFEGWDDPKRSVFPLQLFGFHYKSRTHSSYGNIDVLKSACRQEVWINPVDAQKRGIANGDMVRVFNDRGEVRIPAKVTPRILPGVSAMGQGAWHDADMSGDRIDHGACVNTLTTQRPSPLAKGNPQHTNLVEIEKV >LR134204|4726740:4784003|4747368_4747851_-|VEB94857.1|transposase|DBSCAN-SWA MDVQTWFTAQECAGMPGFPSGVSNVRKQLEKLSEGLEGVRRKRGGTKATEYHISILPARTKNYLGYSDAGQPSEGLGEAKSGLANDEKKALWMMIYHGMTEAQREAVVEIFITGGLKMLMPAVLELSDSNQVQRKEIFEQRNSDEQVSPASSVSTQSKAG >LR134204|4726740:4784003|4757202_4757508_+|VEB94875.1|DBSCAN-SWA MEWETVRSNWAVIWAVLMSGVNIVQLLLAKTYARREELEKVSSRLNILEHAVDGLPTRQELHQLQLEMSNLRGELRELAPSIRQVSRISDLLLENELKEKN >LR134204|4726740:4784003|4777491_4777890_+|VEB94901.1|tail|DBSCAN-SWA MWYWNPVDCNEALPDIHDLSGCIEIDDDEHPFKTGDLPAGKVWSNDASNHPVLTDTPPPSPEEEIAEAEAIRSQLRATADAEIIWRQDAVDAGIATERETTELAEWIKYRVLLMRVDTTKPVWPTVPGEQAS >LR134204|4726740:4784003|4727829_4728090_-|VEB94834.1|DBSCAN-SWA MKKVLALVVAAAMGLSSAAFAAETATSTTAPAAAPAKTAPAKTTHHKKHHKAAAQKAPEQKAQAAKKHTKKAVKHTAAKPAAKPAA >LR134204|4726740:4784003|4739480_4740191_-|VEB94847.1|DBSCAN-SWA MHTFTLKRVLGFTGAIALVLALLVWGVGLETIKARQVDLLYLGKQHLLLVFTSMFFALVIGIPSGILLSRPAAKGFAEYVMQLFNVGNTLPPLAVLALAMVIIGIGDVPAIFALFLASLLPVVRNTYAGLCSVPASLIEAANGIGMTKWQRLRQVELPNAWPVMLAGIRIATAINVGTAPLAFLIGASSYGELIFPGIYLNDFPTLILGATATALVALILDSLLALLGRILSPHTA >LR134204|4726740:4784003|4760606_4762175_+|VEB94881.1|portal|DBSCAN-SWA MFKQLTGAVRRLFSSSTGQTVTVLQEELQQPQARASVVSVRTPSPGISVASTLSPGRLAGILRGAADGNARDFFIMAEELEERDLHYASVLRTRKLTVAGIEPSVEAASDDARDIDIADAVRNLITQPQIPELLFDLLDGLGKGVGVCEILWDTSNQFWQPRDYEWVDPRFLKPDRETLRDFRLLTDASPIEGEPLTPGKYVVHQPRLKSGLPLRNGLARLVAVMYMLKSYTVRDWWAFAEKFGIPVVVGKYGNNASPEQIQTLLEAIASLASDAGCAIPASMTLEMQETASRNNGGALFKEMAVWCDEQISKAVLGQTMTTDNGSSRSQADVHDRVRMDIARWDARQLENTLNEFLVRPFVIMNYGPQDSYPRVVLRLSEPEDLKMLVDALTPLIDRGMEVQMSEVRDKFGLSEPEKGAALLTPSSQALQPALAINRERLALNRNQQDDIDLMVADAMQDWQRTGDAFTSPVLQLAKDADSFDAFLAGLPELQKTLEPDEFVTQLAQLCFKARALGDVNDA >LR134204|4726740:4784003|4782788_4784003_+|VEB94911.1|DBSCAN-SWA MKIVGAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLKDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHTIDDVLEDYARHKEMGFKAIRVQCGVPGMQTTYGMSKGKGLAYEPATKGQWPEEQLWSTEKYLDFTPKLFDAVRAEFGFEEHLLHDMHHRLTPIEAPRFGKRVEDYRLFWMEDPTPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTITHAGGITGMRRIADFASLYQVRTGSHGPSDLSPVCMAAALHFDLWVPNFGVQEYMGYSEQMLDVFPHSWTFDNGYMHPGDKPGLGIEFDEKLAAKYPYDPAYLPVARLEDGTLWNW >LR134204|4726740:4784003|4773109_4774012_+|VEB94895.1|plate|DBSCAN-SWA MESVVLTVDSQQWDGWTEMSITSSLEAIAGEFDLTVTTQWSEASPRVIRQGMPCTVALGSDTVVTGYIDDFIPSYDAENVSIRVTGRDKTGDLVDSSVVHKSGQWKGVRLEKLAEEICKPYGVAVINETDTGEAFPSVALEQGETAFDLLDRLAKQRGVLLTADGLGRLVITRASTKRAGVALVLGKNILAARGRFSWRERNSQYIVKGTTSAGGSTWDEQPAKVTGGRQTIVDDNEINRYRPKILVNEDSLTVGGANTRGEWFKARMLGEANSTEITLAGWRENGDSGPLWQKKSTGRY >LR134204|4726740:4784003|4740496_4741111_-|VEB94848.1|DBSCAN-SWA MTSFSQRDDFSMAARVLGALFYYAPDSTEAAPLVSALKTDGWQAQWPLSPEVLAPIAAVFQADIEEPLPQAWQRLFVGPWALPSPPWGSVWLDRESVLFGESTLALRQWMRDNGIQFEIQQNEPEDHFGSLLLLAAWLAENGRHSECEQLLAWHLLPWSARFLAVFIEGANHPFYQAAGELAQQTLAQWQSQLLIPVAVKPLFR >LR134204|4726740:4784003|4778729_4779284_+|VEB94903.1|DBSCAN-SWA MLIGYVRVSTNDQNTALQRNALECSGCELIFEDKISGRTSDRPGLKRVLRTLSEGDTLVVWKLDRLGRSMRHLVILVEELRERGINFRSLTDSIDTSSPMGRFFFHVMGALAEMERELIVERTRAGLAAARAEGRVGGRRPKLTPEQWAQAGRLLAAGETRQRVALIYDVGISTLYKRFPASDR >LR134204|4726740:4784003|4742011_4742380_-|VEB94850.1|DBSCAN-SWA MHKREDGFVVVDEDVCIGCRYCHMACPYGAPQYNAAKGHMTKCDGCHDRVADGKKPICVESCPLRALDFGPIEELRKKHGTLAAVAPLPGAHFTRPNIVIKPNANSRPTGDTTGYLANPKEV >LR134204|4726740:4784003|4762167_4762962_+|VEB94882.1|head|DBSCAN-SWA MRKPERKPDIIPKEALEWLKAKKLKPGFDYRDVWQEEHRYGYTVAKMTQLDLLADVRQLVEDALENGQTFAQFRELLRPLLVKRGWWGQALMDDPLTGETRQVQLGSERRMRVIYDTNMRTARAAGQWERIQRTKRAMPYLVYTLGPSREHRAEHLKWANTCLPVDHQFWITHMVPNGWGCKCNVRQVSLYEFEQMQQNGTITTTAPDVRYVKWVNKRTGEEESIPEGVDPGWAYNPGISRSDALSLQLKQKQQAFDSYTSSQK >LR134204|4726740:4784003|4751344_4751584_+|VEB94861.1|DBSCAN-SWA MIAERIAEHISMAEAAQNWLRARGSRVTDVRVFMRRPMLEIACPPVELVNSAERIAESHNGGTRSVWVASLEGCRIIWR >LR134204|4726740:4784003|4742342_4742630_-|VEB94851.1|DBSCAN-SWA MTTQYGFFIDSSRCTGCKTCELACKDYKDLTPDVSFRRIYEYAGGDWQEDNGVWHQKRLCLLSFHFLQPLRRPGLHESVPERRDAQTRGRFCGGG >LR134204|4726740:4784003|4766653_4767262_+|VEB94888.1|DBSCAN-SWA MSERPAFVTLGSTVSAAENIVAWLKTELEGNTPDRVEIVERHVGQFSTPDEVKRYLSGRSGCVRLAALRVRNISNRNGMTGLVTWAAYVMTSDSWGYARDARCEVLAGKIARRISVREAPRAMKAERMAENIGAENIYSGRLDNFGVSLWAVTWEQVFRLDDEIDMAALPEFLRLGASFVVNGQPVTEEPDIINVREGQTDE >LR134204|4726740:4784003|4726740_4727559_-|VEB94833.1|protease|DBSCAN-SWA MHKTIAVLLGVICLSPVVAHADEADTTTDTREAKTLFFDHDDRVKVEDPTQSPWDAIGQLETASGNLCTATLISSHLALTAGHCLLTPPKGKPDKAVALRFVSQKGLWRYEIHGIEGRVDPSLGKRLKPDGDGWIVPPAAAPWDFGLIVLRYPPSGITPLPLFEGDKAALTAALKTADRKVTQSGYPEDHLDTLYSHQDCLITGWAQNAVLSHQCDTLPGDSGSPLMLHTDAGWQLIGVQSSAPAAKDRWRADNRAISVTGFRDKLEALAQE >LR134204|4726740:4784003|4777861_4778296_-|VEB94902.1|tail|DBSCAN-SWA MSKYIYSPSHNAFYLTVLKTEYELSGNWPEDGVEISDNLFIEYTSTPPDGKERGVGDDGMPCWVDLPEPTTEELIAAAESKRQQLINEANTCINNKQWPGKAAIGRLKGEELSQYNLWLDYLDVLEAVDTSSAPDINWPVPPEL >LR134204|4726740:4784003|4768930_4769287_+|VEB94891.1|tail|DBSCAN-SWA MSSILGMAAIRINGREIKTEGKSTLNPGGYQRQQHMGAGKIWGISRKTAAPSIKLTIAADQDVDVIEISQWEDVTVMFYGDNGLNYMMTKAATDNPAELDEDAGTVTANFIGVQCVKV >LR134204|4726740:4784003|4765105_4765675_+|VEB94885.1|DBSCAN-SWA MSGKTEKKSVKPAAGSDTTTAQKDIKGTQDTTVKTDPASLAPVTSADGQTSQSTVTATDGEAGPESDATATIAGVTPSPETDHSHLQDAGVNTVMSALCQSVSDGVHISGDVTVLEVRAIPEGGFHRAGRFWPHDPVHVFVSDDPDEQVLEDGSGQPLYGCVISTADASRLNREKMLVVTELKPKAEES >LR134204|4726740:4784003|4774725_4775178_+|VEB94897.1|DBSCAN-SWA MTIGITWNNQLSRGELTVTHDGLTLDEGLVTLVLICLFTDTRADDDDVIPDNTGDPRGWPGDTFSAYPWGSKLWLLDREKLTETVRQRVEDYASLAMQPLLRSGYARTASVTAVISGADRINFIVILSRPDKTQLRIEISKRWEATEHAL >LR134204|4726740:4784003|4767442_4768921_+|VEB94890.1|tail|DBSCAN-SWA MALGNIPDDIRVPLVWIDIDNSMAMSGAPAQSRKILVIGQQVESASAEPLTLNRITGDSMADEYHGRGSMLAEMLKTLRKANSYTETYAMGLADIITGAAATASITVVGDALAAGTLALLINGVSVQVGVAQGDSAETVVQSVITAVTAKTATQVSAVVDGENAASAVLTVNWKGVTGNDCDVRLNYYSGEKTPSGISVTVTPFTGGAGTPDIQAVVAALGDDWYTDIVFPYNDTQSLNTIRDELLERWGPLKMMEAQLWTAFRGTHAQTGTFGSARNDWLISCIGTNISPEPVWLWAASYGGTAAYQLAIDPARPLQTLVLTGIKSPARAVRWDMPERNLLLHDGIATHFVDAGDNVCIEREITMYRVNSFGDTDISYLDVQSPATLGRIRYVIKNRFTSRYPRHKLAGDDVLDLLDAGQPVMTPKICRAELLDIALTELIPAGLVEDFDDYKDTLDVSIDSSDPNRLNFICHPNLVNQLRVLAGLIQYKL >LR134204|4726740:4784003|4745998_4747039_-|VEB94856.1|DBSCAN-SWA MPYTYGSNEGNSTSDIENTKLVVMFGNNPAETRMSGGGITWFLEQARERSNARMIVIDPRYTDTAAGREDEWIPIRPGTDAALVAGLAWVLITEDLVDQPFLDNYCVGYDEKTLPADAPANGHYKAYILGQGEDGTAKTPEWASRITGIPADRIIKLAREIGAAKPACICQGWGPQRQANGELTARAIAMLPILTGNVGIHGGNSGARESTYTITIERMPLPENPVKTQISVFTWTDAIARGPEMTALRDGVRGKEKLDVPIKFIWNYAGNTITNQHSDINKTHAILQDESKCEMIVVIDNFMTSSAKYADILLPDLMTVEQEDIIPNDYAGNMGYLILHPARDRP >LR134204|4726740:4784003|4750401_4751340_+|VEB94860.1|DBSCAN-SWA MTQINHDVVRSAIRELIDSKAISGAALARETGTSTATVSQFLNGKYKGDNDTVAASLNTWLESHNAAKTSLPVAPDFVETPTSQKILATLTWAQLAGTIVLVYGNPGVGKTKAIRQYAAGGNNVWHITASKSRSNELETLYELALKMGISDAPYRRGALSRLLRQRLPDTRGLIVVDEADWLSLDAVEELRILQEECGVGLALVGNHKVYDRLTGGQRSVDFARLFSRVSKKFVINTVSAGDVDSFCDAWHVTGAEERKLLKAIARRPGALRSLSHILPLAGIYAQGKGETIGTAHIQSAMLELGHNGINED >LR134204|4726740:4784003|4728250_4728838_-|VEB94835.1|DBSCAN-SWA MQLGRSAAFLKRLIPAVKRHFGDQPAFEEANFIAWNQRVKPGFIRVDADEVSYPAHVILRYDIEQALINGDIEVDDIPALWDEKMQAWLGLSTAGDYRNGCMQDIHWTDGGFGYFPSYTLGAMYAAQLFSAAHSALPDLHNQIAQGDLSALFDWLRQNIWQHGSRFTTAQLIAQATGEPLSSRYFRAHLESRYLS >LR134204|4726740:4784003|4752822_4753422_+|VEB94865.1|DBSCAN-SWA MTNRRAINIQQIRGDIARRKAMPGFGPDTSIERLRTVQETQRSFTPEIVEALLDELKVVTHAAAVDHEAACSLAEENEELKRKLASLGTEPRRNPVLAYADSYRDMAKQGVESIPVWNVITDLERNIAPLYTAPVELTDEQIDAVLDSHGNIAYVIADKRERLRMFAREILRAAMLQPVDIQEIPSGWRWTLLPPDSQE >LR134204|4726740:4784003|4758177_4758456_+|VEB94878.1|DBSCAN-SWA MTKSLKALSCGQRDIIRKMAAILVCAEIEVRAIAPQYEKTTGKKYDSKSADSYLNTFLNNNPEYKRVWKLLLKDKTSHERDFLARIRGENGK >LR134204|4726740:4784003|4742640_4743459_-|VEB94852.1|DBSCAN-SWA MGYVILGQPATSPKFERKPIYWTLSEIARRLGPDVYQTFTEGRTQHEWVKYLHAKTKARNPEMPDYEEMKTTGIFKKKCPEEHYVAFRAFREDPASSPLKTPSGKIEIYSERLATIADTWELENDEIVHPLPAYAPGFDGWDDPIRITYPLQLTGFHYKARTHSSYGNIDVLQQACPQEIWINPIDAKARNIQQGDTVRVFNNNGQMLIPAKVTPRILPGVTAIGQGAWLKADMFGDQVDHGGSINILTSHRPSPLAKGNPSHSNLVQVEKV >LR134204|4726740:4784003|4751868_4752396_+|VEB94863.1|DBSCAN-SWA MKAPKKTRAKSAAAVAVPQSREDVISDIRKIGDITRVILRRETELNDKIAALTNDVAPGIEALKKELERLQKGVQTWCEANRAELTKDGKTKTANLTTGEVRWRKRPPSVTIRKVEDVIAMLKKFSLGKFLRNKEEINKEAILASPNEVKGIAGISIKSDVEDFEIIPFEQSVTD >LR134204|4726740:4784003|4763161_4764124_+|VEB94883.1|protease|DBSCAN-SWA MKPTNTELLALCFQLPELADDALPEWLPMIPAGTFTGRDGRSWINNQPESVIRATLSYPKLPFDIEHATELKGPKGEEAPAFAWLDDYRIRDDGVIEAHVEWTADGAALVRGKKYRYYSPAFRFTADGQVTRLSSAGLTNKPNLDLPALNSEENTMTVPVQIVTVLGLAATASADDAVKAIQQLKTSEQVALNRAENPDLTKFIPVETHQLALNRAETAEAQLSAIAIKEAEALVDGAIEAGKVAPANREMYLATCRSEDGRKQFAEFVKGAPVIVSKDPSDKKDPGGDGNTTLSDEDLAMCRQMGITQEEFLSVRKQEK >LR134204|4726740:4784003|4741152_4742010_-|VEB94849.1|DBSCAN-SWA MGNGWHEWPLVLFTVFGQCVVGALIVSGLGWFAAKDDAASRQRIVRGMFFLWVVMGLGFLASIMHLGSPFRAFNSLNRVGASALSNEIAAGSVFFAVGGFWWLVAVLGKMPEALGKIWLLVSMVLGVVFVWAMTRVYQIDTVPTWYNGYTTLAFFLTVFLSGPLFAALILRAARARFNGTTFASISVLALLVCAAVIIMQGMSLGAIHSSVQQASALVPDYGRLQVWRVVLLAAGLGCWICPLVRRKEPHVAGLLLGLILVLGGEIIGRGLFYGLHMTVGMAVAG >LR134204|4726740:4784003|4776818_4777490_+|VEB94900.1|tail|DBSCAN-SWA MFYVDNPTGVPVMPEPSPVSSLTDLFFTEGGNGVPPTYPGPDWFNIIQSELINIVRAAGLDPDKMDNTQILAALKKLFLQRQNPFGDIKSDGAVATALANLGLGELAKTPRFLVSKGQNANGWYEIYSDGFKRVGKTWDSSNPLLISTPATGARVNYPISFTTQFNGIYVTENGNTNNNFEFANPAQIGMTGFSMATMDITLGSSPSTAYGTSFTGYYTAEGY >LR134204|4726740:4784003|4756388_4756997_+|VEB94873.1|DBSCAN-SWA MDPVTISTVASVLMKAGPSLLRTVGGWFGGDTARTADSVAGIVENVNSVINPQDQQRVLEQKIAALPPEQFVQLQSLKVQVEQFQLERDKAVLADRQAAHHEQQETIRNGDNATDEYVRQTRPLMARLSLYSSIAYVMLMSVGQQAGAVSGAFGHTFAMPSPDWDIALMLATPALGYLGFRTLDGFARYSKSSKHKVMVGGK >LR134204|4726740:4784003|4735291_4736545_-|VEB94843.1|DBSCAN-SWA MFRRLLIATIIGILAALAVAGFRHAMLVLEWLFLRNDTGSLVNAATNLSPWRRLITPAVGGLAAGALLWGWHKMNQQRPHAPTDYMEALQTDGQFDYGASLVKSLASLLVVASGSAIGREGAMILLAALAASCFAQRCTPREEWKLWIACGAAAGMASAYHAPLAGSLFIAEILFGTLMLASLGPVVISAVVALLTTHLLSGGNALLYTVHLSLDLHVREYAMIISTGLVAGVCGPLFMWLMTTTHNGFIRLKLSPPWQLALGGFIVGLLSLLTPAVWGNGYSVVQSFLLSPPLLSVIAGIFICKLLAVLASSGSGAPGGVFTPTLFIGLSIGMLYGRMWGFWLPGTDEMAILLGLTGMATLLAATTHAPMMSTLMICEMTGEYRLLPGLLIACVVASVLSRTLREDSVYRQHTAEH >LR134204|4726740:4784003|4730891_4731530_-|VEB94838.1|DBSCAN-SWA MFWKILPESRHFRPTSLRPKTLFINFRLHWRDRGLPLLFAEGFLLMGSFVTLFNYIGYRLMLSPWELSQAIVGLLSVAYLTGTWSSPKAGAMTTRYGRGPVMLFSTGVMLFGLLMTLFTSLWLIFAGMLLFSAGFFAAHSVASSWIGPRAKRAKGQASSLYLFSYYLGSSIAGTLGGVFWHSYGWNGVGGFIALMLLLALLVGSRLHHRLHV >LR134204|4726740:4784003|4755782_4756001_+|VEB94871.1|DBSCAN-SWA MGKGWEGSMRQGRRDRLRQEVLHRVAGGPPPVPLSYQGHDGTHGSFYMRGWESVDTRDIFWQCQRYKEKHSV >LR134204|4726740:4784003|4764127_4765021_+|VEB94884.1|head|DBSCAN-SWA MQVSAEVLHALTTALSAAFTKGVGRVNPQYRSIATVIPSTGASNTYGWVEDFPTIKEWIGERQLKELAQAGYTITNKTWENSVKVKREKIEDDQIGQYSVIAEQLGRDTTIFPDKLSFELLCKGFDTLCWDGQYFFDTDHPVGTSTASNVIGDPTTDAGEPWFLVDATHALLPIIYQERRPFNFTALDDLTSERVFLQNEFAYGTDGRSNVGFGFWQTCVGSKAALNKANYEAAVSAMMDITDSNSEPLGMNPTLLVVGKNNRGAAKSLIEALMADGGGSNIYYKDVDLLISPYVKA >LR134204|4726740:4784003|4754163_4754571_+|VEB94868.1|DBSCAN-SWA MTEKQDDLFGDIRDDSILEQLDDDSAESRRFPALLAQLNALLRTELEKLGHDPRISLDLIYAISKSIGGMQLYFPRGTALESLIRDMKIWRDFNGKNIPELVERYHVTFNTVYAAIRRMRKLEQRKYQLDLFSKD >LR134204|4726740:4784003|4756993_4757203_+|VEB94874.1|DBSCAN-SWA MTDELDRASGLEMADRERALNARLNRVKEVPEESGFCNDCGDAIDPARIAVLPDAVTCIDCQTLRERSV >LR134204|4726740:4784003|4776228_4776819_+|VEB94899.1|tail|DBSCAN-SWA MAHSVDEWLGALWQVMPRGKAWSRDNDNDLTRFLRALARRLSQAEFDAERLLPEMRPETTFLLLEEWEEYLELPECEQASGTIEDRRRAVVEKYHRKGGLAPWQIEAVAAALGFTIRVNVILPHHCLRSCMYPLYPARYRWILQIDVLGISGGRFTCIDNVMTPLLSDRARELECVMTKYRLAGTAYDYIYYAGDN >LR134204|4726740:4784003|4771811_4773110_+|VEB94894.1|DBSCAN-SWA MGTTKWEDLREASFRGVAFYLVDNEGTSGRRAIPRAYPKKEVGWTEDNGAVLTQQQINGKLIGSSYQSQLEDLLRALNTPGPGELVHPWFGIQKVQVGKVNHRLSTQEGGIAYISFEVSEAGERLFPAAAENTSLTVLRGVDKVKAALENGDFFAVLDGLGEMVDTFLDDMEGMVVNLLTLPSAITEWMDRLGRFRGLVDVIVAKPANFINEILGLVSGVHETVTEPLWSMRLYDRLRSRWEGAQSEGSGAGISRAEAAATRQLPQFMSVTPGSVEGGVGGFASSIPTVATTPSPAMQANITGFTQVVVLATLLAQAETIAQTTFRTSEEAVSTGDALAVLLAEQAVIAVESGQRELWRTLRDLRFAVVNDVRIRSARLPQTRLLSPTITSSVSLIAWRETGNTENRDTITLRNRLRDPSFILPGKTIEVTE >LR134204|4726740:4784003|4781865_4782207_-|VEB94909.1|DBSCAN-SWA MNNTLSKRLCLTAMLALGAVVYTTSAFANTSKLIIESGDSAQSRQHAAMEKEQWNDTRSLRQKVNTRTEKEWDKADAAFDNRDKCEQSANLNAYWEPNTLRCLDRRTGRTVAP >LR134204|4726740:4784003|4743412_4744450_-|VEB94853.1|DBSCAN-SWA MFGTNDGNSPDDIANSKLVVMFGNNPAETRMSGGGVTYYVEQARERSNARMIVIDPRYNDTAAGREDEWLPIRPGTDAALAAGIAWVLITENLIDKPFLDKYCVGYDETTLPASAPRNAHYKAYILGDGPDGIAKTPEWAAHITSIPAEKIIQLAREIGSAKPAYICQGWGPQRHSNGEQTARAIAMLSILTGNVGINGGNSGVREGTWDLGVEWFSMLDNPVKTQISVFTWTDAIDHGAEMTATRDGVRGKDKLDVPIKFMWCYASNTLINQHGNIAHTHDVLQDDSKCEMIVGIEHFMTASAKYCDILLPDLMPTEQEDLISHESAGNHGLCHPGATGHLTKI >LR134204|4726740:4784003|4733294_4734515_+|VEB94841.1|DBSCAN-SWA MVADSQPGHIDQIKQTNAGAVYRLIDQLGPVSRIDLSRLAQLAPASITKIVREMLEAHLVQELEIKEAGSRGRPAVGLVVETEAWHYLSIRISRGEIFLALRDLSSKLVVEECLELPLVSETPLLERIIFMVDQFFIRHQQKLERLTSIAITLPGIIDTENGIVHRMPFYDDVKEMPLGETLEHHTGVPVYIQHDISAWTMAEALFGASRGARDVIQVVIDHNVGAGVITDGHLLHAGSSSLVEIGHTQVDPYGKRCYCGNHGCLETIASVDSVLELAQLRLNQSMSSSLHGQPLTVDSLCQAAVQGDLLAKDIISGVGMHVGRILAIMVNLFNPQKILIGSPLSKAAEILFPAIADSIRQQALPAYSKHIAVESTQFSNQGTMAGAALVKDAMYNGSLLIRLLQG >LR134204|4726740:4784003|4752485_4752830_+|VEB94864.1|DBSCAN-SWA MSKSLNARCIRRWEVEFKPLCDSKVNPYWRKRDLRGYIREAALTTAYSMVDSMAERNAKFDFDGSTIGWSPEFSSWYHERREKYLKEARDYLNEDATNDEIDEEIQNELEAWND |
79 | Vibrio_phage(45.83%) | protease,transposase,plate,tail,terminase,head,portal | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|