Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
LR134301 | Stenotrophomonas maltophilia strain NCTC13014 genome assembly, chromosome: 1 | 5 crisprs | Cas9_archaeal,csa3,DEDDh,cas3,WYL,DinG | 0 | 1 | 6 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134301_1 | 615163-615264 | Orphan |
NA
Consensus repeat of LR134301_1
|
1 spacers
spacers of LR134301_1
>1.1|615188|52|LR134301|CRISPRCasFinder TGCCATGCATGGGCGCCCACCAAGGTGGGCATCTACCAGAGCGCTGTACCGC |
csa3,DEDDh |
CRISPR arrays and Neighbor proteins around LR134301_1
The CRISPR arrays of LR134301_1 >merge|LR134301|1|615163-615264|CRISPRCasFinder GGTAGTTGCCAACCTTGGTTGGCGCTGCCATGCATGGGCGCCCACCAAGGTGGGCATCTACCAGAGCGCTGTACCGCGGTAGTCGCCAACCTTGGTTGGCGC >LR134301|1|1|615163-615264|CRISPRCasFinder GGTAGTTGCCAACCTTGGTTGGCGC TGCCATGCATGGGCGCCCACCAAGGTGGGCATCTACCAGAGCGCTGTACCGC GGTAGTCGCCAACCTTGGTTGGCGC
>LR134301.1|VEE50990.1|614212_615157_+|5'-nucleotidase MGDNSPRLLTVAVTSRALFDLEESHALFESDGVAAYAEFQRQHEDDILGPGVAFPVVRKLLALNQGASPENPRVEVILLSRNSADTGLRIFNSIQHYGLGIIRATFTAGEPTWPYVKPFGTDLFLSANPESVRSALRHGIAAATILPKPPGETAAAAADQIDIGRPAGQLRIAFDGDAVIFGDESERISREQGVEAFGRHERERAREPLSGGPFRGFLSALHTLQEVFPAGESAPIRTALVTARSAPAHERVIRTLREWGVRLDEALFLGGRHKGPFLQAFGADIFFDDSQHNIDSAREHVAAGHVPHGVANEG >LR134301.1|VEE50989.1|613391_614165_+|NAD-kinase MSDTPRIAFLASTTEPAQMARAAMVSRYGDYTPEQADVLCPLGGDGFMLQTLHRHGHLGKPVFGMKLGTVGFLMNQYRGDDDVQARIARAEPANLRPLEMVALTESGTSTGSLAYNDVSLLRQTRQAAHIGIDLNGQERVAELIGDGVLVATPAGSTAYNYSAHGPVLPLGSHTIALTPLAPYRPRRWRGAILKADTEVRFRVLDPYKRPISVTADSHETRDVVEVTIRESKDRRVTLLFDPEHNLEDRILSEQFVF >LR134301.1|VEE50988.1|608206_613183_+|NAD-glutamate-dehydrogenase MKPQKTAAKTVKNKTKPAEAEVAVTAGFSLEPVYTALRKRYPAAAQAEAVAFATDFYKRMESDEFPHHSAEEWAALAAETLEFARARKAGKANVRVFNPTAKANGWESPHTVLQIVNDDMPFLVDTVTMSLAEQGVGVHVLGHPVLRFTRDKAGKLVKVGEGDAESVMLLEIDRQPAEAMAAIEQAINKALDEVRAIVRDWQPMKDKALALADDLGSRQLPVDAASRKEAQEFLRWAADNHFTFFGYREYRVEKQGKEEVLAPLNDTGLGLMRGKDKSAARPVKTLAAQGLNATSGLKDALILTKTNARSRVHRAGYMDYIGVLEFDAKGKIIGEQRFLGLFTSSAYNRRPWEIPLVRQRHEHVMKQSGLAPASHSGKALRHILETLPREELFQSSEDELLRTAMGVLGLQERVRSRLFLRRDKYSRFISALVYLPRERFNTDVRLRIEAMLKDALHGEYVDSSVVLGESPLAQVHLIVRPKPGEMLDVDTAELEQKLAQVLRNWQDDLREALVARHGETEGLRIAARIGKALPAGYIEDNSTAVAANDVSQLDALTGPDDLRLSLQAVPRESGDGLRLKLYRQLDDIPLSDALPMMENMGLRVIAERPYRLSVDNAPVYVQDFEVESTAGAIDAASVDEAFGETFARVWHGDAENDGFNRLVLAAGLHWRQVAMLRGYCKYLLQTGVPFSQAYVEGTFARYPLLARLLVELFEARFDPATGHESKDDIAAGQAQLKAHFDVLAAGDDATLKVLKTVVDARKGDRDAQMQAARDALLKLMDRVSSLDEDRILRSFMGVIDATLRTSYYQTDANGQHGHVISFKFDSALVPDLPKPRPYREIFVYGPRVEGTHLRFGAVARGGLRWSDRREDFRTEVLGLVKAQMVKNTVIVPVGAKGGFFAKTPPVNGDRDAIFANGVACYKLFIQGLLDITDNIVNNKIVPPVDVVRHDMDDPYLVVAADKGTATFSDIANGLAIAHGFWMGDAFASGGSVGYDHKGMGITARGAWESVKRHFRALGRDSQTQDFTAVGVGDMSGDVFGNGMLLSRHIRLVAAFDHRHIFLDPNPDAATTFVERERLFTVPRSSWADYDAKLISKGGGVYPRSLKSIEITPQVREALGLDDNVKALSPNDLMSAILKAPVDLLWNGGIGTYVKAASEQHSDVGDRANNALRVNGGELRCKVVGEGGNLGMTQLGRIEAAQAGVLLNTDFIDNSAGVDTSDHEVNIKILLNDVVRAKKLTVEQRNKLLASMTDEVAELVLNDNYRQNQALSLMERMAVKRLGSKQHFIRTLEQQGLLDRQIEFLPSDAELSQRKARGQGLTRPELSVLLSYSKLVAFAQLLDSDIPEDPYLSKELQRYFPTPLQKKYADAMERHRLKREIIATAVTNQTINRMGATFLMRMQEDTGRSIAEVAKAYTISRETLDARALWAQIDALDGKVPESVQIDALEVIWKLQRSFVRWLLSRPGAMPGITEAVNRYQGPFNDIRVASGVLPDSQRPTYEALVDEWKEKGLPSALAQQLAELHFLEPAFDIIELARTRKLKPVDVSKVHFRLGDALQLPWLFEQVDALEVNGRWHAVARGVLRDELAANHRNLAGQVLGTKGSSAEAKVAAWMGRDDNSLRFTLAMLAELAEQKTLDYPTVSVAVQRLGQLAAHGA >LR134301.1|VEE50987.1|607299_607956_-|TetR-family-transcriptional-regulator MNPAYLPVALDARDERVFDAVRELLAQQGMQMSMDAVAQHAGCSKQTLYSRYGSKQELLRRVMQRHVGHATGAMVRALRSDDLRASLLQFAIDFLEHFNQPHVGQACRLIAADASQFPEEARTLYRHGAGALTLHLAEWIETVCMQGQLRHDDPHFMAELLLSMIAGQDFDKQRFHTPHRDDALLRRRWAEFSVDSFLRAFAPQPSAAPSTNQPRSSS >LR134301.1|VEE50986.1|606031_607300_-|RND-family-acriflavine-resistance-protein-A MIAPLRTLALTCAVAVALAACKKPEQQTPPPPEVGVIDAKPQTLPLQRELVGRLSPFRSADVRARVPGVLLKRVYQEGSQVKQGQTLFLIDPAPLRASLNASEAQLASARATYANAKVAADRARSLAPQQFVSKSDLDNAESAERTALAAVKQAEAAVTSSRINLGYTEVTAPISGVANKQQVTEGALVGQGDVTLLTTVDQLDPLYVNFSLSVDELTQLRAQQAKGALALSGDGKATVNVKLADGSTYSEPGTLDFSSTTVDPATGAVSLRALLPNPQQILLPGAFVSFQANLGERNNAYLVPQQALLRDTTGGYVMVVGADGKVVRKNVKTDGAQNGNWLVSDGLAAGDKVIVAGVQKVKEGAPAVAKPWTPGQDANGKPAAGGAAPAGAAPAAGKAPADAAKPEQADAAKPAATDSNKQ >LR134301.1|VEE50985.1|602842_606016_-|RND-efflux-system,-inner-membrane-transporter-CmeB MPKFFIEHPVFAWVVAILISLSGVIAILNLGVESYPNIAPPQVTVSATYPGASADTTEKSVTQVIEQQLTGIDHLLYFSSSSASNGRAQITLTFETGTDPDIAQVQVQNKVSLATPRLPSEVTQQGVVVAKANAGFLMVIALQSDTPAINRDALNDIVGSRVLDQVSRIPGVGSTQQFGSEYAMNIWLNPEKMQGYGLSASQVLAAVRAQNVQFAAGALGSDPSPEGQHFTATVSAEGRFSSPQEFENIILRANADGSRVLLKDIARVAFGANNYGFDTQYNGKPTGAFAIQLLPGANALNVADAVRGKMDELQPSFPSGVTWFSPYDSTTFVKISIQEVVKTLFEAVFLVFLVMLIFLQNFRATLIPTLVIPVALLGTFLGMWMIGFTINQLTLFAMVLAIGIVVDDAIVVIENVERIMTEEGLAPKPATQKAMTQITGAVVAITVVLAAVFIPSALQGGAAGEIYKQFALTIAISMAFSAFLALGFTPALCATFLKPTHNDNPNIVYRTFNKYYDKISHTYVGHITSAVRHAPRWMILFVVLTALCGFLFTRMPGSFLPEEDQGYALAIVQLPPGSTKGQTNEVFGQMRGILEKQDGYEGMLQVAGFSFVGSGENVGMGFIRLKPWEERKFTAPEFIQNMNGAFYGIKEAQIFVVNLPTVQGLGQFGGFDMWLQDRSGAGYEQLTQARNILLGQAAQKPDHLVGVRPNGLENAPQLQLHVDRVQAQSMGMSVSDVYSTIQLMLAPVYVNDFFYEGRIKRVTMQADGPYRTGQESLKSFYSPSSLTQNADGTNSMIPLNTVVKSEWVSAPPSLSRYNGYSAINIVGSQAPGTSSGEAMQTMESIVNDDLPAGFGYDWSGMSYQEILAGNAATLLLVLSIVVVFLCLAALYESWSIPVAVLLVVPLGVLGALGLSMLRGLPNDLFFKIGLITVIGLAAKNAILIVEFAVEQRAAGKNLRDATIEAARLRFRPILMTSFAFIMGVIPMAISTGAGANSRHAIGTGVIGGMLFATLLGLLMIPVFFVVVRRMLGDKLDEPSKEFMERQRDADAAHRPDR >LR134301.1|VEE50984.1|602376_602667_+|transmembrane-protein MMIYRGWGFMTLVTPIAAILLLAYFFPHEGSRGNTPLTQVLLGAGIGAAVNVMLGLWFNRAPRQKGEAAPHHFFFVPMQWAALALVVVCVAVALLR >LR134301.1|VEE50983.1|601097_602246_+|acyl-CoA-dehydrogenase MDFSFTEEQLMLQDVARRIAQEKIAPSAEHHDRTGEFPLDNIRLLGENGLMGIEVPTEYGGAGMDPVAYVLAMVEVAAADAAHSTIMSVNNSLFCNGILTHGTEEQKQKYVRAIAEGEAIGAFALTEPQSGSDATAMRCRAVKQADGTFIINGKKSWITSGPVAKYIVLFAMSEPDKGARGITAFLIDTDKAGFGRGKTEPKLGIRASATCEIEFNDYVAQAEDVLGQEGEGFKIAMSVLDAGRIGIASQAIGIARAAYEATLEYVKERKAFGAAIGTFQMTQAKIADMKCKLDAALLLTLRAAWVKGQGKRFSTEAAVAKLTASEAAMWITHQAVQIHGGMGYSKEMPLERYFRDAKITEIYEGTSEIQRLVIARNETGLR >LR134301.1|VEE50982.1|600014_600944_-|ArsR-family-transcriptional-regulator-/-Methyltransferase MDLEDWSTRLKVFADATRVRLLALLEQEELTVAELSAITRLAQPRVSTHLARLKEAGLVRDRRAGVSAYYRFDEAQLDPAQRALWHALSNGSDDPLLRQDAERVAAVLAHRASDQNWADSVAGDMERHYSPGRTWEALARTALPLLETGDVLDIASGDGVLAELVAPHAKRYICIDTSARVVAAASERLRRLPNVEVREGDMHALPFKDGSFDLVVLMHALTYASKPAQAVTEAARVLRPGGRLLLCSLARHEHKAAVEAYGHVNLGFSDKELRKFVDKAGLQVSSLETVTREKRPPHFEVISLIANKP >LR134301.1|VEE50981.1|598906_600001_-|5-methyltetrahydrofolate--homocysteine-methyltransferase MSALPWLHPDRVNALLDALRQRILIIDGAMGTMIQRHGLQEDDYRGERFADGYDHAHGPGCDHGTPEGHDLKGNNDLLLLTRPQVIADIHTAYLEAGADLVETNTFNATSVSQADYHLEHLVYELNKAGAAVARACCDAATASTPDKPRFVIGVLGPTSRTASISPDVNDPGFRNTSFDELRDTYREAIDGLIDGGADTIMVETIFDTLNAKAALYAIEEAFDARGARLPIMISGTITDASGRTLSGQTAEAFHASLAHARPLSIGLNCALGAEAMRPHVETLSQVSNCHVSAHPNAGLPNAFGEYDETPEEMATTLRGFAEDGLLNLVGGCCGSTPDHIRAIAQAVAGLPPRALPGSQEQQAA >LR134301.1|VEE50991.1|615331_615883_-|Protein-of-uncharacterised-function-(DUF2939) MKKLTALAVLLVLSLAAWWFGGPYMAVHGLSKAIEERDTARLQRYVDFPRVRSSLRAQLNDYLVRQAGPDVAASAFGALLYGLGDQLGGAAVETMVTPTGIGAMLQGHVLWKRGRNELQGGDAFGATEPARPLKNAEHHFEALDRFVIDVDRGPDQPPMKVVLEPQGLRWKVVDLQLGMSGSP >LR134301.1|VEE50992.1|615938_616625_-|Uncharacterised-protein MSTYFSDASFKFLRSLARHNDKAWFNDHRQQYEDHVRQPFLRLLGDLQPALAEVSEHFRADTRGVGGSLFRIHRDARFSNDKSPYKTWQGARLFHERRREVAAPSFYVHLQPGESFVGAGLWHPEPETQRRVRHFILDNPGSWKAAAHAPALRKRFDFEESEKLVRPPRGFPADFEFIDDLKHRNWVMWRSLDDATMTGPRLLSTLGKDLAGLGPFVDYLCAALDLEF >LR134301.1|VEE50993.1|616621_618061_-|Exodeoxyribonuclease-I MADSFLFYDLETFGQDPRRTRISQFAAIRTDADLNEIDTPVSFFVRPADDLLPSPMATLVTGITPQQALAEGISEAEAFDRINEQLSRPGTCALGYNTLRFDDEFVRYGLFRNFHDPYEREWRNGNSRWDLLDMLRLMRAMRPDGIRWPLREDGATSFKLEHLAEANHVREGDAHEALSDVRATIGMARLFKQSQPRLWDYALKLRDKRFVGGLLDVAAMKPVLHISMRYPASRLCAAPVLPLAVHPTINNRVIVFDLEGEIDDLLELPAEVIAQRLYMRASELPEGAARVPLKEVHLNKVPALIAWNHLRADDHARLGLDVAAIEAKVERLRAFAPQLAEKARQVYNQPRAATVADVDASLYDGFLGNGDKPLLALARTTAPEQLAALEGRFRDPRLPELLFRYRARNHPDSLAPPERQRWQDYRRQRLLGDGGLGELNLPQYQQQLDALAAEAPEDTRRQALLQSLRDWGQHLQETL >LR134301.1|VEE50994.1|618060_619428_-|kynurenine-3-monooxygenase MIAHASRSLSIIGAGLAGSLLAILLSRQGWRITLYERRGDPRVADYESGRSINLALAERGRNALRQAGVEDEVMARAVMMRGRMVHPRDGEPQLQRYGRDDSEVIWSIHRSDLNTTLLELAEQAGATVHFHRRLHTVDFDAGYARFIDDRDDSPHDIHFDTLIGADGAGSALRAAMNRRAPLGEDIAFLDHSYKELEIPPAADGSFRIERNALHIWPRGHYMCIALPNHEGTFTVTLFLPNQGNPSFATVNTGAQAEALFAREFADTLPLIPNLRADWEQHPPGLLGTLTLERWHQQGRAVLIGDAAHAMVPFHGQGMNCAFEDCVALARHLMEADDLEGAFAAFETERKPNARAIQQMALENYLEMRDRVADPAFLLQRELEQELQRRWPTRFVPHYTMVTFLHTPYAEALRRTELQRDMLVAATTGHDSLDNIDWAALEAQIHAQLPVLEGAH >LR134301.1|VEE50995.1|619459_620734_-|Kynureninase MSDLLSRTHAIALDAADPLRPLRNEFLIPRHGGGEQTYFVGNSLGLQPRGAQAAVQEVMKQWGELAVEGHFTGPTQWLSYHRLVSAQLARVVGALPSEVVAMNTLSVNLHLMMVSFYRPTAERPVILMEAGAFPTDRHAVEAQIRFHGFDPAECLVEVQPDEVNGTISLAAIERAIAEHGPRLALVLWPGVQYRTGQAFDLDAITRAARLQGARIGFDLAHSVGNLPLRLHDVAPDFAVWCHYKYLNSGPGAVAGAFVHERHHRDTTLPRFAGWWGHEEATRFQMAPQFTPAIGAEGWQLSNPPILGLAPLRASLDLFERAGMEALRSKSLALTGMLEALVRARLSSVLDIITPAEPQRRGCQLSLRVIGGRERGRALFEHLRGIGVLGDWREPDVIRISPTPLYNRYLDVHHFVEEVEAWAGL >LR134301.1|VEE50996.1|620872_621394_-|3-hydroxyanthranilate-3,4-dioxygenase MLASPINLHAWIEENRHLLKPPVGNKMIDNGDFIVMVVGGPNSRTDYHYDEGPEWFYQLEGEMVLKVQEDGAVRDIPIRAGEIFLLPAKVPHSPRRPPGGIGLVVERKRLPHEMDGVIWHCERCNHKLHEEYFALLNIETDLPKVFARYHASLELRTCGQCGHVDPLPAPAAG >LR134301.1|VEE50997.1|621414_622077_-|carbonic-anhydrase MKDIHRLLQNNRDWADRIAKEDPEFFQQLSKQQHPEYLWIGCSDSRVPANQIIGMAPGEVFVHRNVANVVAHTDLNCLSVVQYAVDQLKVKHILIVGHYGCGGVHACLHNTRVGLADNWLRHVGDVVQKHQGILDAIEDDELKHARLCELNVIEQVANLCRSTIVEDAWARGQKLMVHGWVYSLKNGRVSEMGIDVGGPEELKPAYEKALSYVPRQGRRD >LR134301.1|VEE50998.1|622151_622790_-|transcriptional-regulator-protein-Pai2 MFTPRAFAETDLLWLDRLLARDPFVTVLTVGSDGLPELTRMPVLHRRDGDQIELRGHWARANPQSRHSGAAKVLVDGPHGYVSASWYPDKEPAARVPTWNYASAELRGQLQTFDDADALAELVGAISDRFEASVGQAWQFDATRAEHGPELRAIVGFRFQVEHVQLKLKLSQNHPDANQQAVIAALDALASPSSHELAQWMRWHREQSASSG >LR134301.1|VEE50999.1|622797_623112_-|Uncharacterised-protein MNDITASRDSWWLASLGNTLIWARLRVRPAGTAEVLDSDGNTLSYDSEDTARSQLFDAEFVEYDGLDEEDALVRGFSLHEVQPPQADSDEGLRGRMIQSLGGRA >LR134301.1|VEE51000.1|623119_623419_-|transmembrane-protein MTLFFALCFVGVAVAGFSAFVIFWPLTLVHVRDRHPALAERFGSGAFLKPDALAWLLRRDYRQQPDRSLSGLATPAWVSLLTLLAGLGMAALLWLASLW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134301_2 | 2482609-2482698 | Orphan |
NA
Consensus repeat of LR134301_2
|
1 spacers
spacers of LR134301_2
>2.1|2482632|44|LR134301|CRISPRCasFinder CAGCAGCAATACCAACCAGGTTGGCACCTACCACACCAGCAATA |
CRISPR arrays and Neighbor proteins around LR134301_2
The CRISPR arrays of LR134301_2 >merge|LR134301|2|2482609-2482698|CRISPRCasFinder CCAACCAAGGTTGGCACCCACCACAGCAGCAATACCAACCAGGTTGGCACCTACCACACCAGCAATACCAACCAAGGTTGGTACCCACCA >LR134301|2|2|2482609-2482698|CRISPRCasFinder CCAACCAAGGTTGGCACCCACCA CAGCAGCAATACCAACCAGGTTGGCACCTACCACACCAGCAATA CCAACCAAGGTTGGTACCCACCA
>LR134301.1|VEE52717.1|2482047_2482551_-|Transposase-and-inactivated-derivatives MLFCMNSTRLLLGRHSSIGQSYILTTNVHGRAALFANDTVAVTAMQEFQRLDLGGLTHSIAYVVMPDHVHWLMQLRAASLDVVMKRFKSRTAVHANRVLGRVGRFWQSCYHDHAIRSDESLFRHAMYVMANPIRAGLATQLGEYPHAWCEWELEGSCDKVEFMGADR >LR134301.1|VEE52716.1|2481180_2481978_+|Poly(3-hydroxybutyrate)-depolymerase MVRSMSRWLPLLAFLMMTGCTSLPDSARGRFEARAVKVEGETAYYQVFIPAGVEATAPTKLPVILFLHGSGERGADGVKQTHAGLGPYLRAHPDFPALVVFPQVPGHEEWSGRNNRAAVAALDATIAEFGADPARQYLTGMSMGGYGSWNIALDDPHRFAAIVPVCGAVLAPRAKRPTLFVEQVAQEADPYTVIAKRLQHTPIWIFHGALDDVVPPEDDRRLHAAFQSAAARDVRYTEYPEGNHNAWDATYADPAMWEWLFAQKR >LR134301.1|VEE52715.1|2480842_2481109_+|Uncharacterised-protein MATTIENYFQPGWRDQQHTCPACEWKGCSRAMVMELDEDATEYDCPVCENPMLVVLHPDMTQVQAAAAEGNAEAQEQLDIIASFPRPQ >LR134301.1|VEE52714.1|2479417_2480779_+|magnesium-transporter MAEAVRHDKTARQLRMLSDALDSGRLGPVRRLVNTLAPAEIGNLLESLPPGKRAIVWGLVDPEDDGEVLVHVGEEVRESLLADMDPDEIIAAVEDLDIDDLADLVEDLPDTVIDEVLKSMDRENRERLEQVLSYPEDSAGRLMNPDVVTVRADVNVDVVLRYLRLRGELPDHTDHLFVVSRRHQYLGRVSLAALVTHEDTTPINRLIDDEQPAIDVGESDQEVARQFSDHDWISAPVVDDNNILIGRITIDDVVDIIRGQAEHQALGAAGLDEDEDLFSPVWRAMRRRLMWLSVNLCTAFLASSVVGHFEGTIDKLVALAVLMPIVAGLGGNAGTQVLALMVRGLALGQVGASNARTLLWKEVRVALLNGVMLGSVLGLIVLAWFHSPGLSAVIAIALTCNLLFAALAGVLVPLTLKRFGFDPALASGIFLTAVTDSMGFFTFLGLATLVLLH >LR134301.1|VEE52713.1|2477514_2479284_+|PTS-system-phosphoenolpyruvate-protein-phosphotransferase MPRARAGQVSSGQRPVQLLAGHGAARGTAMGRARVRLPHALEVAEQRVAAHQVEAELARLHRAVDAARMEMHELRQRLQGALNQEVGEFLDLHALLLDDPELLFGLDELIRSGPYSAGYALRLQRDRLAKVFDGMDDAYLKSRMDDLDHVIGRIHAFLQPERPPAVKGLAGEILVCDNIAPSELAQLQTQGVVGIVTAAGSALSHSAILARSLHLPLIVNVPQLLSRIADGDVLIIDGADGSITANPQADNLRDYRARLKEHAREQRELGRLRSKPSRTRDQVDIALLANAESLEDVTQAHALGAQGLGLYRTEFLFLQRNELPDEEEQFQTYRDAALGMSGRPVTIRTLDLGADKADRTGLTLSNEENPALGLRGVRLSLARPKVADTQLRAILRASAYGKLRVLVPMVSTREELLAVRRRMLKLTEQLRGEGHMVSDHVPLGAMIEVPAAALALESFIDLVDFLSIGTNDLVQYLLAADRNNEALGELYSPLHPAVLRLLQMVIETGARHRIPVAVCGEIAGDARMTPLLLALGLTEFSLHPGTLLEVRRAVRDADLGALRARAPKLLAARDRRAIERWLALVADTP >LR134301.1|VEE52712.1|2477242_2477512_+|phosphocarrier-protein,-nitrogen-regulation-associated MLERELTVSNRLGLHARATAKLVQTLAPFRCNVTMAAKGREINAKSIMGVMLLAAGQGTPVTVRINGEDEAAAMEAVVGLFERRFDEDN >LR134301.1|VEE52711.1|2476857_2477250_+|PTS-system-fructose-subfamily-transporter-subunit-IIA MTCGILLVTHPGVGTALLDVATRLLRQLPLKTEAFEVPFDADLDALLPLASAALRRVDGGEGVLILTDLYGASPANLAGQLARLGTPVRRVSALSLPMLLRVMNYPEQGLDQLPATAAAGTRNGAIVDDA >LR134301.1|VEE52710.1|2475551_2476436_+|ATP-binding-protein MSTATPSAPTLIIVSGLSGSGKSVALKTFEDQDYYCSDNLPINLLPDFVRSLLANHDGSAPRRLAVGIDVRGQSDLSQLGNWRQLATDAGVEVKVLFFEASDEAVLKRYADTRRRHPLSQLGLSLPEAIARERELTAPLRREADAVIDTSNLNVHQLRRRIITEFALDHATRLSLLFESFAYKRGVPAEADFVFDARVLPNPHWDPDLRALSGREPGVRDYLEAQPDVQRYLAQLMDFLDTWLPKLGDGTRSYVTVAFGCTGGKHRSVFLAERMARHAREMGWEDVATYHREQD >LR134301.1|VEE52709.1|2474573_2475524_+|HPr-kinase/phosphorylase MNTSITARELFEQQRERLGLRWAAGKSGEKRELEAGNTVSRRPSLAGYLNAIYPNKVQILGTEELSWLDALEPRQRWETIEKIMQSHPLALVLTRNQACPEDLRAAADESGTPLWLSPKRGHELLNHLSYHLARTLAPRVILHGVFMEIYSIGVLITGEAGSGKSELALELLSRGHRLVADDAPEFTQIAPDVLDGTCPELLQDLLEVRGLGVLNVREMFGDTAVKKNKYLRLIVHLTKPMTEPTPHGYERLTGDSGTRHVLDLDVPLITLPVMPGRNLAVLTEAATRLHILRTKGIDPAAMFIARHSNLLERRTP >LR134301.1|VEE52708.1|2474124_2474577_+|PTS-transporter-subunit-IIA-like-nitrogen-regulatory-protein-PtsN MPLTDLLAAVQTQLCTATDRDSVLQAAAGLLACRQANAEQIYLNLCQREALGSTAIGHGIAIPHGRAPALDRPRGALLRLATPVDFGGDEPVDLVFAMAVPAHYTHQHLMLLSELAELFSAPDIRQALRAAGDARALREALDMTPPASAA >LR134301.1|VEE52718.1|2482704_2483655_-|aspartate-carbamoyltransferase MTAQQIDASGRLRHLLTLEGLPRETLLQLLDRAGQIRDAAVGRVGNKRHVLAGSAVCTLFFEPSTRTRSSFQLAAQRLGADVLNFDASTSSTRKGETASDTLRNLEAMGVRGFVVRHPDDGAVAALAEAAGEGTALINAGDGRSAHPTQGLLDMLTLRQAKGPDFSKMKVVIVGDVKHSRVARTDLHALRTLGVGEIRVCGPQSLLPDDETLKGCVVGDDFDAMLEGVDALMMLRLQRERMEEGLVPSLEQYHAQYGLNAARLARAGKDAAVLHPGPINRGVEVTDEVADGPQSWVLRQVANGVAVRLAVLETLLG >LR134301.1|VEE52719.1|2483671_2484166_-|Holliday-junction-resolvase MSEPITPAPDSAAPAIRRDGTVLGFDVGSRRIGVAIGSAFAAHARAVAVVDVHGNGPDWTAIERLLKEWKPDGLVVGDPLTLDGQDQPNRKRAQGFARQLRERFKLPVVMIDERSSSVEAARRFAVERAEGRKRRRDAAALDAVAAAVIIDRWLSSPDDATPIP >LR134301.1|VEE52720.1|2484158_2484725_-|Uncharacterized-ACR,-COG1678 MPVTPTSLADHLLVALPSLLDATFARSVALICQHDENGAMGVLVNQPSEYTLGEVLAQMDITTGDGDLQARMVLNGGPVHPERGFVIHDDARAWDSSLIVGDGLYLTTSRDILEAMARGEGPANAVVTLGCAGWGAGQLESELSENSWLTVPADAELVFQVPLEQRWQGAASRIGVDLFRLTDYSGHV >LR134301.1|VEE52721.1|2484816_2486604_+|transmembrane-protein MPSSSPLTLPARSAIVLIALLQGLMLYSAQELSDAWPFRDIGWRYCWYAWVLAIPSAVALSLVELGQRRLWLQAALGSAVVLALAAWTGWNLNGETALDSGALQIPLTLGMAVAVFVALPWWQFQLQHGHWRASYPELFERAWQNGLTLALAALFTGLTWLLLWLWAALFQLLEVTVFRDLFRQDAFIALATGSLVGFGVLIGRTQHRAIQITRQVLFAICRGLLPLLSFIAVLFVLSLPLTGLEPLWKTRSAASLLLVLSLLLVTFTNAVYQQGDDTAPYPLLLRRLVEASLLALPVYAGLALYALALRVAQYGWTVDRFWAVLIALAVAGYAVGYAVAVLRRQSRWLQTLEPVNRWMCWAVLALALLGNSPLLDPVRLTLPSQLARLRADPPAITSSDVNVLRFDLGRRGVQALRDLQRDPAITADANAPQVIAAALARTSRWDDGQRLDKGPQDVAALQRALKLAKGSSSPPDDWWQALATRAIDGESCAQSERDCLIAHRDLDGDGNTDVLLCELYTIRGPDCVLYARGRDSQWRRAGSLFGTVSGQAEAINQALRDGKLTLEPPRWPMLSIGGHPAVAIDPEPESNESSP >LR134301.1|VEE52722.1|2486600_2487143_+|DNA-3-methyladenine-glycosylase MSGYCLIAPGHPVHDYYHANEYGFPQREERELFERLVLEINQAGLSWETILKKREGFRAAYDGFDVDRVAAYAEQDIERLLSDAGIIRNRLKVLAAIHNAQVIQQLRASHGSFAAWLDAHHPRSKADWVKLFKKTFRFTGGEITGEFLMSLGYLPGAHAEDCPVHARLLTLSPPWLQVSR >LR134301.1|VEE52723.1|2487216_2487972_-|transporter MQIVSNTAPHPLLPTRMSLPSHGPAPCDDADGHLVTAGPLTPPPDSAGTTADEKALRHSIAEDVQGMVLATMVASLGLAIFAKGGLMIGGMAGMAFLLHYAMGWNFGLVFVLVNLPFYWVALRRMGWEFTLKTFAAVTACGVLTDLLPRWIDFSHINPLYSAIVGGALSGLGILFFIRHRASLGGIGILAVYLQRTRGWSAGKVQMSYDACLMVAAFFVLSPSKVLYSAIGAVVLSLVLMFNHRPGRYMGV >LR134301.1|VEE52724.1|2488034_2488247_-|Uncharacterised-protein MSTDTKPKGPASYFPSIEKTYGQPVAHWLGLLAGKKGLKHMELVSFLKSEHGLGHGHANALVAHHLTGKG >LR134301.1|VEE52725.1|2488447_2489473_+|LysR-family-transcriptional-regulator MDVLAPSPATSATPPPTGSQLLTLPLTLIRQVPGAARPYHPLMPRENLNDLQAFVHVAREGSFTKAAAQLGVSQSALSHAMRGLEQRLGVRLLTRTTRSVSTTEAGARLLDTLGPRLAEIEDGLAALAEYRERPAGTIRINATGHAAEYIAWPRLAPLLQQYPDLKVELAADYGLADIVAERYDIGIRLGERLARDMVAVPISPPLRMRVVGAPSYFRQHVVPRHPDELADHNCVTLRLPTHGGLMPWDFGQDGNELSVRVTGQWTFNTMGMTRAAALAGSGLAWLPEDQVQPMLGDGRLQSVLDDWCPHFDGYYAYYPSRRHVTVAMRTVLDALRGPMKG >LR134301.1|VEE52726.1|2489535_2490519_-|aldo-keto-reductase MQTRELGRSGLKVSALGLGCMGLTHAYGQPVERSQGIALLHAAVERGVTFFDTAEVYGPYTNEDLLGEALAPYRDKRVIATKFGFKDARTDAGLDSRPENIRAVAEASLKRLRTDHIDLFYQHRVDPNVPIEDVAGTVRDLIAEGKVGHFGLSEASAATVRRAHAVQPVTAVQSEYSLWWREPERELLPTLQELGIGFVPFSPLGRGFLTGTINADTTFDANDFRNSVPRFEVEARRANQALVDRISTIAAARGATPAQVALAWLLAQAPWIVPIPGTTKVHRLEENLAAADLQLAPEELQRIAQALDEVSIVGERYNAQRAAQAKG >LR134301.1|VEE52727.1|2490726_2491890_-|twitching-motility-protein MPHWEPAVNTTATTIDFTSFLKLMAHQRASDLFITAGMPPAMKVNGKISPITQTPLTPQQSRDLVLNVMTPAQREEFEKTHECNFAIGLSGVGRFRVSCFYQRNQVGMVLRRIETRIPTVEELSLPPIIKTLAMTKRGIILFVGATGTGKSTSLAAMIGYRNQNSTGHIITIEDPIEFVHKHEGCIITQREVGIDTDSWEAALKNTLRQAPDVIMIGEVRTREGMDHAIAFAETGHLVLCTLHANNANQAMDRIVNFFPEDRRNQLLMDLSLNLKGVVAQQLIPSPDGRSRKVAMEILLGTPLVQDYIRDGEIHKLKEVMKDSVQLGMKTFDQSLFELYQAGEISYEDALRYADSQNEVRLRIKLSQGGDARTLSQGLDGVEISEIR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134301_3 | 3039799-3039942 | Orphan |
NA
Consensus repeat of LR134301_3
|
1 spacers
spacers of LR134301_3
>3.1|3039846|50|LR134301|CRISPRCasFinder GTGCCAAGCCCCCGGTAGGTGTCGACCTTGGTCGACACGCATTACACGGC |
CRISPR arrays and Neighbor proteins around LR134301_3
The CRISPR arrays of LR134301_3 >merge|LR134301|3|3039799-3039942|CRISPRCasFinder GCCGACCAAGGTCGGCCGCTACCAGAGCCGCGCATGGCGACCGCTGCGTGCCAAGCCCCCGGTAGGTGTCGACCTTGGTCGACACGCATTACACGGCGCCGACCAAGGTCGGCCGCTACGAGAGCCGCGCATGGCGACCGCTGC >LR134301|3|3|3039799-3039942|CRISPRCasFinder GCCGACCAAGGTCGGCCGCTACCAGAGCCGCGCATGGCGACCGCTGC GTGCCAAGCCCCCGGTAGGTGTCGACCTTGGTCGACACGCATTACACGGC GCCGACCAAGGTCGGCCGCTACGAGAGCCGCGCATGGCGACCGCTGC
>LR134301.1|VEE53218.1|3039192_3039756_+|transmembrane-anchor-protein MKRSLAIALLAGALVAGGASAAPRTVTDADAPRALQADGPVSVKWEDPAKFTEIRQSTNRFEAERGDWVQQLARYLQTTAAKPLQPGQTLDVTLVDIKRAGDYEPWHGPRGRDIRIMRDIYPPRISLQYTLKDASGRIVSEGDARLSDTGYLHNLGLRSDSDPLRYEKRLIDDWVKRQLASQATAAR >LR134301.1|VEE53217.1|3036959_3038948_-|PAS/PAC-sensor-hybrid-histidine-kinase MMVPAPEPDLHHGVLERINAGFCVIQVLFEGDRAVDYRFIEVNDAFERHTGLKDACGQRMSLLEPNHEEDWFRIYGEVARSGRPAQFEMEARALGRSFAVDAVRVGRPGEDKVGILFFDITARKQMEVELGESEARFSALADGLPMPVWVLDERGHARFVNSAFSEFFGGDETRVPEDVWRGLVHPDDASVFEYELQEALKAQRSMHALVRARRADGEWRWLEMNARPRFSRMGRFIGLAGSSPDVTERREIELAREELLQSERAARSAAENMARLKDEFLATLSHELRTPLTTILGWSELLLQRVDNTSPLYKGLNVIANSAGAQKRLISDMLDLSSMLLGKVQLEVEILDLRDLLGEAIGAQELVAEGKALDVSLLLPEQPCLVLGDATRLQQVLWNLLSNAIKFTPAEGRIDLQLRADGNHWVITVRDTGDGIAPEFLNHLFSRFRQADGTTTRRHGGLGLGLAIVQQLVELHGGTVTAASEGHGHGATFTVRLPKHIPDAERRPLREVISGPILEPVIVEPYPLRGMHVLAVEDQPEVLEYLRRMLEEQGASVSAASSAGEALALLADTGHLQYHVMLTDIGMPGMDGYGLVRTLREDMGVDAMTLPAVAVTALARADDRRRALASGFQEHVAKPYSVAQLVSAVRKVQVPRVDSALH >LR134301.1|VEE53216.1|3036645_3036939_-|Protein-of-uncharacterised-function-(DUF3247) MSRTAPRIHTDPAQIARLEALLPQLEGETQVQLTLHDGRRLLGTVAVKPTVQQYRNEAGDEGSNGQLRLDDYDTPVQQHHVWLDEIASVNRLPPKAP >LR134301.1|VEE53215.1|3036311_3036575_+|Uncharacterised-protein MRALFVGGVVDNSEMDLEGSHPPVHYPEDTGGGHSRYRLHQVGHGADGSVAYAVYGAPDLADDEVARVAEERAYARRFEATPTLFEH >LR134301.1|VEE53214.1|3035090_3036155_-|methyltransferase-small MASSDDAPLQTLFLPFSQGALRWPEGPVAFLRARDGWPLREAAGNREVHCEQSFAPFAQPLQQAAGWTVSGQLDDVAGKGRYPLVLVLPPRQREEARALFARALALVADGGRIVACQSNNEGARSGEGDLKQLTGLGGSLTKNHCRVYWTAPMQGQHDADLAKRWSALDAVRPIVGGRFLSRPGVFAWDRIDPASALLAEHLPADLAGRAADLGAGYGYLSRELLERCPKITALDLYEAEQRALALAELNLAPPPRPLPLRFLWRDVTAGIEPGYDVIISNPPFHTPSRADRPDIGQRFIAVAAQALRPGGRLYVVANRHLPYEYTLNESFGAVRVVAERDGFKLVEAVKGKGK >LR134301.1|VEE53213.1|3034392_3035094_-|ribosomal-small-subunit-pseudouridine-synthase-A MKLVKLIANLGYGSRKQVQWMFREGRVTDADGEVLYADDQVPHEAIRVDGEPLDPPVGLSIALHKPAGYTCSTKDTGRLIYDLLPPRFRDRDPVLSTVGRLDRETSGLLLLTDDGSLLHRIISPKSKLPKVYEVELNDDLRGDEVALFASGTLMLESEKTPLLPAELEVLDARRARLVLHEGRYHQVRRMFAATGNHVQALHRSRVGGLDLQGLDEGQWRQLTSTDLDTLFAP >LR134301.1|VEE53212.1|3033712_3034396_-|hydrolase MTTIAALPFLPDAVIFDMDGLMIDSERVSLACWSEAADEFGLGLDEAVFLRMVGLGDRDTHALLRAQGIEDSVIDAVAARCHELYEARTQTGLPLRPGILELLELLKAHAVPRAVATTTRQPRANRKLAAAGLLPYFDAVITSGDVARPKPAPDIYLLAAQRLGQAPERCLALEDSPAGTRAALAAGMTVIQVPDLVHPDEELRALGHRIVGSLVDAHALLLPLLPR >LR134301.1|VEE53211.1|3032299_3033508_+|3-oxoacyl-ACP-synthase MRRVVITGMGITSCLGNDLDTVSAALREGRSGITALADHAEAGLRSQVGGRVDLDLDALIDRKQKRFMSDAAAFAYLSMRDAIADAGLSPEQVSNLRTGLIAGSGGGSSEWQIGAVDLLRERGVRKVGPYMVPRTMCSTVSACLATAYQIKGVSYSLSAACATSAHCIGAAADMIRHGAQDIMFAGGGEDLHWSMSVMFDAMGALSTSFNETPASASRPYDKDRDGFVIAGGGGVLVLEDYDHAVARGAHIHAELIGYGVTSDGADMVAPSGEGAVRCMKMALQGVDRPLDYLNTHGTSTPLGDVTELNAIREVFGDAVPPLSSTKALSGHSLGAASVHEAIYCLLMMRDGFVAGSANIGELDPKVESFPILRESREQKLDTVMSNSFGFGGTNAALVFGRV >LR134301.1|VEE53210.1|3031784_3032300_+|3-hydroxydecanoyl-ACP-dehydratase MTRLHAFNREQLLASARGELFGAAAGRLPNDPMLMFDRITDIREDGGPHGKGMVRAELDIRPDLWFFGCHFIGDPVMPGCLGLDAMWQLTGFFLTWLGAPGKGRALGCGEVKFTGQVLPEAKRVRYEIDISRVINRKLVMAQSDARMYVDDREIYSARDLRVGLFTETGSF >LR134301.1|VEE53209.1|3030498_3031593_-|DNA-polymerase-IV MTRLRKIIHVDMDAFYASVEQRDDPSLRGKPVVVAWRGARSVVCAASYEARVFGVRSAMPALRAERLCPDAIFVPPDFARYKAVSRHVREIFLRHTDLVEPLSLDEAYLDVTEPKSGIELATDIARTIRTQIREETNLTASAGIAPNKFLAKIASDWRKPDGQFVIPPQRVDAFLLPLPVNRVPGVGKVMEGKLAARGIVTCGDLRQWALIDLEEAFGSFGRSLYNRARGIDERPVEPDQQVQSISSEDTFAEDLPLEDLGEAIVQLAEKTWNATRKTERVGHTVVLKLKTAQFRILTRSFTPERPPESMEELRDIALALRARVDLPAETRYRLVGVGLGGFREKEPVVQGELFEHDMNNPTGT >LR134301.1|VEE53219.1|3040037_3041105_-|alanine-racemase MRPARALIDLGALRSNYRLARELGGGKALAIIKADAYGHGAVRCAQALEGEADGFGVATIEEALELRQVGIRAPILLLEGIFEPSDMALVAEHDFWFAVGSPWQLEAVAAFDSPRPLTVWLKLDSGMHRLGLDADSFRAAHARLSALPQVERIVLMTHLARADELDSERTHQQAATFARAIEGLHGETSVCNSPALLGWPDVRSDWVRPGLMLYGANPLPDNTALTGRLRPVMTMQSKVIAERWIEAGEPVGYGARFVAKARTRVGVVALGYADGYPQFAPNGTPVLIDGQPGALIGRVSMDMLTVDLTAHAQAGIGSVVELWGSAPTLSELAPRCGVSAYQLPCAVKRVAKVYV >LR134301.1|VEE53220.1|3041083_3042388_-|D-amino-acid-dehydrogenase-small-subunit MRVLVLGSGVIGTTSAWYLRQAGFEVTVIDRQPGPALETSFANAGQLSFGYTSPWAAPGVPKKAIGWLFEKHAPLAIKPGMDLAQYRWLWQMLRNCTHERYAINKARMVRMSEYSRDCLNELRAQIGIEFEGRDLGTTQLFRTQQQLDASAQDIEILAQYGVPYEVLDRAGIIQAEPALAHVDGLVGALRLPRDQTGDCQLFTRRLAQMCVDAGVEFRFDQDITGLEFDGDRITGVRIDGKLETADRFVVALGSYSPALVAPLGMRLPVYPLKGYSLTLPITDPAMAPTSTILDESYKVAVTRFDDRIRVGGMAEVAGFDLSLSQRRRETLELVVSDLYPKGGDLSRAQFWTGLRPATPDGTPVIGATPFRNLYLNTGHGTLGWTMACGSGRYLADLMSARQPQISTEGLDIFRYGQYGHAPQQENRTCVLPAR >LR134301.1|VEE53221.1|3042537_3043017_+|AsnC-family-transcriptional-regulator MATRIRELDKIDRKILRILQAEGRISFTELGERVGLSTTPCTERVRRLEREGVITGYHAHLDPAAVKASLLVFVEISLAYKSGDIFEEFRRAALKLPNVLECHLVSGDFDYLLKARISEMASYRKLLGSTLLTLPHVRESKSYIVMEEVKETLSLPIPD >LR134301.1|VEE53222.1|3043072_3043228_-|Uncharacterised-protein MKNQQNRHPGKEPQGRNPQQQQQQQQQQMEPQQHKGGKQQEQRSQKHPQQR >LR134301.1|VEE53223.1|3043399_3043774_-|Uncharacterised-protein MKPTLSLLLMALLPTLALAQVPTADPTATRSATTVAPQPVVPPPQAARPQPQVLPSPQPAQPIKSTGPARIAPAPVPKPADKVYDRNGRIVPGVRPAGPNRVFDSRTGRYYDSVPAGDGQQIKR >LR134301.1|VEE53224.1|3043914_3044382_+|peptide-methionine-sulfoxide-reductase-MsrB MTAFDLTPPTATQTEALVAGLSSEERRVLLQHGTEAPFCGVFLDNKREGVYCCRLCALPLFRSSTKFDSGTGWPSFFAPFDPAHVREIRDTSHGMVRTEITCARCGSHLGHVFPDGPPPTYERHCLNSVSLSFTGNGEPWPDPLQRGGAESGVAG >LR134301.1|VEE53225.1|3044473_3045328_+|flagellar-motor-rotation-protein-MotA MLIIVGFLVVIISVIGGYLGAHGRLGALWQPYELVIIGGAALGAFLVGTPAKTVKQTLQAMVGVFKGPRYKQQDYIDVLSLLYELLNKARREGFMALEDHVERPAESALFGNYPKVQADHHLIDFITDCLRLMIGSNIEPHELEPLLELELEKHHAEAMAPSQVLTKVADGLPGFGIVAAVLGIVITMGSIGGDIVEVGGHVAGALVGTFLGILLGYGFVGPMAAAMEARAEQDSRIYESVKTALLACLRGYNPKIALEFARKTLPSNVRPAFSDFEQHLKTVK >LR134301.1|VEE53226.1|3045331_3046270_+|flagellar-motor-protein-MotB MAETKPTVIVRRVKKAGHAAHHGGSWKVAYADFVTAMMAFFLVLWLMATTNKNDRAAISEYFRNPSPLSGQNATPAPGMAGPGGASTSMIKLGGATDISRGSSNDPFQNQKEAVPQPVDQQQRDKQQLEALMKELQEAISKSQALEPFKDQLLLDLTPEGLRIQIVDKQNRPMFDLGSATLKPYTQQILHELANYLNHVPNRISLTGHTDITAYSAARGYGNWELSADRANAARRALVDGGLEDSKITRVVGLSSSVLFDKADPQNPINRRISIVVMTQAAEAAALAGAGPQVGLSAPTADPDVQAAQGEAK >LR134301.1|VEE53227.1|3046365_3046854_+|Uncharacterised-protein MSQNSKAKRDKRKKQQAKRPFLRLNAQQQVQNHAVLTNEDGQVVAAIGLQGREWLLAIGGQTMGNAENPVPMLAMLKHLANVQEKEGRKVNLEYSELLQKLLDTLAAESEQTADEYLDKLVAEFEGVDAAEGEEGEAVEGDAAEEAAAEAAPAADSDSKPQA >LR134301.1|VEE53228.1|3046867_3047143_+|Uncharacterised-protein MTVYVDDAVHPWRGQRWAHLMADTLAELHAMAAQLGIPPRAFQNKASGAHYDVTAELRAQAIALGARAISRHTDRDLVKSVIANARAQYRP |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134301_4 | 3423848-3423954 | Orphan |
NA
Consensus repeat of LR134301_4
|
1 spacers
spacers of LR134301_4
>4.1|3423873|57|LR134301|CRISPRCasFinder CCGAGCAGGAGCAGGAGCAGGAGCAGTGCCAGGCGATCCTGGCACCCACCAGTAGAA |
CRISPR arrays and Neighbor proteins around LR134301_4
The CRISPR arrays of LR134301_4 >merge|LR134301|4|3423848-3423954|CRISPRCasFinder GTGCCGACCGACGGTCGGCACCCACCCGAGCAGGAGCAGGAGCAGGAGCAGTGCCAGGCGATCCTGGCACCCACCAGTAGAAGTGCCGACCAACGGTCGGCACCCAC >LR134301|4|4|3423848-3423954|CRISPRCasFinder GTGCCGACCGACGGTCGGCACCCAC CCGAGCAGGAGCAGGAGCAGGAGCAGTGCCAGGCGATCCTGGCACCCACCAGTAGAA GTGCCGACCAACGGTCGGCACCCAC
>LR134301.1|VEE53565.1|3422987_3423734_-|N-acetylmuramoyl-L-alanine-amidase MHPIRTALMLSACLLLAACASTPQETRNPLATWVPSPNQNARTPVIIVIHHTEQKSVQQSLHTLRTANSGGPVSAHYLIGADGHRYQLVADERRAWHAGAGRWGTITDLNSASIGIELDNDGRSPFSAAQIESLIVLLRDLTTRLNIPPRQVIGHADLAPTRKQDPSRFFPWQQLAEAGFGIWPRAADGAAPGGFDAWNALARFGYPLDNREATVAAFHRRFRGRDDLPKTLDAEDARILHSLLLQMP >LR134301.1|VEE53564.1|3421744_3422920_-|MltA-domain-containing-protein MIDSTTFFKRRHWTLLAAAVLAGCSTTAPRTGSEVPTPATPPAATYAKAAWSALPPVSDSDLQAGFVAWRSSCTRLKNDAVWARPCATAATVVDKDPAAIRQFLQRDLDVYALRAGGHQADGLITGYYEPIYAGSLTRTEKATVPVYGTPDDLVVVQLESLYPELKGKRLRGRVEGKVLKPYDDAGTIAAKGAKAPVLAWLTDPMDLQLLQIQGSGRVRLGDGKQVRLAYAEQNGHPYRAIGRWLVDQGQLKKEDVTMDAIRAWARANPARVPELLRSNPSYVFFVRNPDSPEGPRGSLNVPLTAGYSVAVDRTVVPLGSLLWLSTTRPDGSPVVRPVAAQDTGGAIAGEVRADLYWGSGDAAGKLAGDMKQKGNIWMLWPKGVALPVPAQ >LR134301.1|VEE53563.1|3421345_3421660_-|Uncharacterised-protein MTYAKHNAKPGTKSAAKKYYKHWAESGLGKGNVLMEARGEKIVRQVEIYGTKMVWADEHAHSDDRFLLADQPVSFLDFDEDDEISAREFESAWKKAREATGYPV >LR134301.1|VEE53562.1|3420704_3420986_-|Uncharacterised-protein MSVVVDGFQRVADSMHEIKAEQAKLSDIQKQQSKAAYHAFTGCGKARYVGMLVEIESQTNPRGARARVAQTLDLTPARITQLLNSDKNRKNGK >LR134301.1|VEE53561.1|3420084_3420474_+|plasmid-stabilization-protein MGVALSVSLSSDHKGLIAKLLGAVDFDMADHFQPTADNYLGRISKPFVVAALTEAKQVNGNANKEGLLAMKKGVLAAEAEKRLAGTGWLPKPIWGPKAETKPAAKGKAPTVKKVRAKSAKTKPPAAFAS >LR134301.1|VEE53560.1|3418892_3419774_+|Uncharacterised-protein MGELHEACQVLQALQCSNLGRPASDASDRLHSRILWNAWSLVEGGEEELRARYVLRRQSLDGALMMRSDEFSAYRQGLRSLSLRRRKHLALQYPQLEAVLKWPFAALSHREWDMRTVVGWNRLFTSSAGLLGVRYSFPGDTASTDENRVPVMAYDLDGLYQRGDIYGFLMLACAYRLFYLQRLADRQWYAASHMIRALSGACRDPRVRPYAHELIAQTKQLLRLLPDTSFPIHVNDDVIWDQIRNDVHEPSYLMRRAAERRGVHIPEPVSPIISYRYQKCAPHQRPLLFARNG >LR134301.1|VEE53559.1|3415866_3418287_-|integrase-catalytic-subunit MSTDPQFGGLSFEDRHQLYTRIKASELSIRQIESMVAGSGTKTVGKAALRNVRVRHNSIKNGSARTVESHTCELVFAYELELDPEVLGYYVQVPCRRVQRTTASGRNHISTAHVDFLVFRRDRVELVECKPISWLESQLALPDSLWVKQGVVWTYSPYAEWAKQHGLVFRVWVSPTPTGVYLQNLEACYAVAGDALLRHECTAVSKVTGLIAKRPFLLASLLAEVPGFTPRLALWMLGKAAAFGPWRSTPAARTDRFHLYGTREQAEEADELLLGAIVKAQSQPVVDDPLLVATATDLQRAHQRLGRLALIEAGSLPQTRRMGALQRQVRKAVAEGRSALSACLTKYAFSGNRASRLLPEHEQAIETVIALHWNTGRVHRPKDLLYVFEEECLRLGVEPCGRARLDARRRSESPTRHALSTGGFRAYHASRTRTDPRNRSLPPIGYGHTLHVDSSDLDVRCAPNLIHYFPASKAKFYIGIDGATGYPMAHALVFGSARTDALALLMREYVKRQGFLPKLIHLDRGPENTSRWLQDFCHGEISLRFSPTAGSAWNGIAENAIKQVNEQVAQRFPGSTRPDQMGRKVDGKFKSRNNARTDFVRVHEEFLNFVYHDLPMTPGPDDTTPSENRLEALAAYGAVGTQCEWGDEFLIRTSIKVQTRKRIDRQRGVRTADGWFTSDELLGALRTENASEIRSDCCDPNVIYVKVRGAWFKAFHNRVQSDALLSDQERLFHLLWSPALRSQSARRKEEVARARYSRRVQAQAARPATEHLAPETMTSTNESLDIETVPPTIAQDIVPFDEREDY >LR134301.1|VEE53558.1|3414865_3415867_-|Uncharacterised-protein MKMVQHPAYEGPRKRLRMALDQHRPGHMVFIIGPSGVGKTTMRRSVMQEMFGRPACWGLGRIPLVETFATLPHGAYFSSRQLAVSILQELHAPTLAWLLDGSYLGEDAKLEIRRELATAASEWESLARTRQTEGEYWGMVQRSLRARGCKYVSIDQVTALLVNHRDKSPADHTLHLMAIAEATGVMFVMTGVHKATQLWSINSELRRRVTTVWVPPYSDKRRDDKLPFLRLLKSLSARYELSQDDLLIRMANDILAATGGVFAEVVELLGRAEIAAKQEGCNRILKRHIEISYYGSEDLRNLWRDIDAFEMSMVAGNVTERSEHVKARWGSPT >LR134301.1|VEE53557.1|3413487_3414798_-|Uncharacterised-protein MGLGTGLVESAEHYVARLAWTVGVTVRALCPPIRSAGGILQRAQAMGASGFCGPGRQFKRRVEHLERLAGVEHIRHGTFWVVDDLLAVTGVGRDTKRQRWCPQCFLEWDEEHSYEPLIWMVDVQQSCPVHRCALEAACRACSSFQPTGRDYRRRRHCYRCGQGLAGLGKLAALPSHHIWAEHALAGLIALCATPGQPQIPYEQYERFVRGLIEISLDQPNQPATLRAAMARLRSNAIRGRVTLRTLVNLSALQGITIPQMLLDPVAAASRPLIDLWSGYQALEFPGGRHGTKVVAFRQCIREILSKCGERYVPPMRFVLRSMKINRDFAREMCVDVYEDYEAAYQRQGGYQSRLHRDRAFVLALRMIGQNNQSPFAPCDARKVARHAAKAARVSLADAEAATRSAIHSSRALERAKGAMLCSTRDYSSSRASRQKT >LR134301.1|VEE53556.1|3410037_3411504_+|Predicted-P-loop-ATPase MSLGKELDPIVSESEPWDRDPLSREAEGKILANLISGLGDSPFVISLKGGWGTGKSVFLKRLGYHLERFHKIPVVRIDAWQSDYLDDPLLAMTSALTDRLASTENRVSTVVDSVITGLAGSAGKIALPVLSAIAGLAMPGGSQVVQLASNLPDLANNFLEWDKSRKTAEEKFRSSLSEAREKLKASLESDDESPIVIIVDELDRCRPDFAIKFLERVKHFFNVSGICFLIATDHQNLPQAVKTVYGDQVDGELYLRKFFDFEFNLPRPSLKDHAYQIFQSFPGTDPARDASAIRKRLLELRDPEGYEQFYDNTPEELERAEYSIYFGHIASHFEMQLRDSLQAHTLLMAFVRSFPKSSVRFPFVDCYISCLRFAAPGEYMKLISNTGPGLPDILRASNKASLSTIGALNAFLGIKPDTDAEEFKAATRRWMNNNSATRGIGFLAYASLFVRGLEHEARGTYPRLTFNADDYLGSVLRLTAAFTDTEEP >LR134301.1|VEE53566.1|3423983_3425414_-|Ammonium-transporter MKMRLLTGWQARFHLVCLLMLLSALAAGAWPGNAHAQAQVSPLPSESVAVEPLQDPVAAAAAPAVAEAAAAYDRGDVAWMLTSTLLVLLMVVPGLALFYGGLVRSKNVLSVLSQILVVFSLVLLLWVAYGYSAVFSAGNPFFGSFTEFAFLKGFTPDSVGNTPIKGLPDYLFVAFQSTFAGITTALIVGAFAERIKFRAVLLFSALWFTLSYIPMAHIVWGGGYLGELGAIDFAGGTVVHINAGVAGLVAAWFVGKRLGYGQTALKPHNVPFTYIGAMLLWVGWFGFNAGSAAAADTVASLAFLNTVLATAAAVLGWTLVEAIGKGKPSALGAASGAVAGLVGITPACGTVGPLGAIVIGLVAGVVCVWGVTGLKRLLKVDDTADVFGVHGVGGIVGAILTGVFSAQSLGGTKADLDIAHQVWVQVVSVGLTVVWSAVVTTLILLVVRSVVGLRVTEEAERTGLDVTSHGESAYEA >LR134301.1|VEE53567.1|3425436_3425775_-|nitrogen-regulatory-protein-P-II MKLISAIIRPFKLDEVREALSDAGVSGITVTEVKGFGRQKGHTELYRGAEYVVDFLPKIKIETVVTDERADAVIEAIQSSAGTGKIGDGKIFVTAVEQVIRIRTGEIGADAL >LR134301.1|VEE53568.1|3426122_3427532_-|glutamine-synthetase MSVENVEKLIKDNQIEFVDLRFVDMRGVEQHVTFPVSIVEPSLFEEGKMFDGSSIAGWKGINESDMVLLPDTSSAYVDPFYADPTIVISCDILDPATMQPYGRCPRGIAKRAEAYLKSSGIAETAFFGPEPEFFIFDSVRFANEMGNTFFKVDSEEAAWNSGAKYDGANSGYRPGVKGGYFPVPPTDTLHDLRAEMCKTLEQVGIEVEVQHHEVATAGQCEIGTKFSTLVQKADELLRMKYVIKNVAHRNGKTVTFMPKPIVGDNGSGMHVHQSLSKGGTNLFSGDGYGGLSQLALWYIGGIFKHAKAINAFANSGTNSYKRLVPGFEAPVMLAYSARNRSASCRIPWVSNPKARRIEMRFPDPIQSGYLTFTALMMAGLDGIKNQIDPGAPSDKDLYDLPPEEEKLIPQVCSSLDQALEALDKDREFLKAGGVMSDDFIDGYIALKMQEVTKFRAATHPLEYQLYYAS >LR134301.1|VEE53569.1|3427756_3428551_+|UDP-diphosphatase MSDLLSALLLGILEGLTEFLPISSTGHLLIAQHWLGARSDFFNIVIQAGAIVAVVLVFRQRLLQLATGFNQRENREYVFKLGAAFLVTAVVGLVVRKAGWSLPETVSPVAWALIIGGVWMLLVEAYTARLPDRDQVTWTVAIGVGLAQVVAGVFPGTSRSASAIFLAMLLGLSRRAAAAEFVFLVGIPTMFAASAYTFLEMAKAGQLGSENWTDVGVAFLAAAVTGFVVVKWLMGYIKSHKFTAFAIYRIALGAALLLWLPSGS >LR134301.1|VEE53570.1|3428624_3429449_+|heat-inducible-protein MNRKLTLLLPLALMAACSQTPAPAGTGGDDVAPAAAKAADQQTLAHLDAQRLQSQHWLLQQATAADGKRIDALFAREDKPVTLDFADGRLSVSNTCNRMGGGYTFDAGKLSVSAMASTMMACTDKALMALDEAVSSRLQGKLKAEQDADGTLTLTNAKGEKLVFTPEPTAETRYGGAGETVFLEVAAKTEKCSHPLIPDYQCLQVREVKFDDKGLKQGEPGKFENFYGNIEGYTHEDGVRNVVRVKRYEVKNPPADAPSQAYVLDMVVESAIEK >LR134301.1|VEE53571.1|3429601_3430213_-|phospholipid-binding-protein MQLSSNSLTNGAPIDREFAAGDAAGFAPDRNPHLAWSGAPAGTRSFLLVCVDPDVPTVPETVGRNDMTVPRDQPRCDFVHWVMADIPASVQEIAAGSCSDGFVVKGKPAPAGPAGSRQGLNDFTGWFAGNPDMAGDYLGYDGPYPPFNDERVHRYFFRVFALDVASLELPARFTAADAYRAMHGHVLAEAALHGTYTLNPALA >LR134301.1|VEE53572.1|3430223_3432293_-|TPR-repeat-containing-protein MQDQIIQALRQNQADQAVQLAQAWTRDEPGRADAHRWLALALQQQGNAEAAMEALQQALRLAPDDAQLHLQHAGLLLALRQFEGADEALVRTTGLDPNSFSAYLMQAHLAIGRNDFDEAQRISTLASRVEPEHPELLTIDGMVALRRGEADRALALLSAASKALPDDTRVLYALGFAYLGKDMLAFAEQSFRRVLELNPSLSSLHGLVVQLALRQGNVPAAAEAVQVALRQPELDVPAMRRLAGELALRNGQPLQALDYLLPLLETQPEDRQVLQLLLMSWQRLGREEEARARLDAVLDAHDQLHDVWLARLAIEQVGSESAVAAVERWMAAMPTHVPALEARLRLHDMAGEHAQAEAIAERIVSLEPGRVSGESRLVEGLLQRDPAAAVARVQALIEQAPEAHRADLRTWMGEIQDRAGQPQEALRTWMSLQTDQAPQRLPLPPQAKSPPSWPDKGSIEGDASSAPIFLWGAPGSGVERVATGLAAASPVLRSDRYTNTPPDDAFQNYNTLQDLASGVLTPERLVQRWREQLPARGLQSDTVIDWLLWWDNALLWALRPQLPQGRLVLVLRDPRDMLLDWVAYGAVAPLAMTSLAEASEWLTRALTQIATLHEEDLYPHVLLRIDQIGNDPHAMAELLGRLFERPMPPAAQLGAPRLPAGHWRNYRDVMSAAFAQLTPVAVRLGYPEE >LR134301.1|VEE53573.1|3432349_3433384_-|putative-hydrolase MALIPPPVLDAGTPTLHAADYQPPRWLRNPHLQSMLSSSRMRLQRGLLLLAATGAVSEELILDGGDGVRLQGWHSHVEGRQPKGIALLLHGWEGSAESSYMRMAAARMIEQGFDVVRLNFRDHGNTHHLNPGIFHSNLIDEVVHAAGDIAQRWPQLPLVAAGYSLGGNFVLRLALRAPAAGVPLLRVASVCPVLDPALTMDSIENGPAMYDWYFRRKWAGSLRRKRDLFPELSDCDDRVLKLDIRALTAWLVERHTSFGSLQAYFDGYSIAGDRLSALQVPADILMAQDDPVIPYATFSDWQLPRQARLETACWGGHCGFIENWRGDGFSERWVAQRLQRVLQA >LR134301.1|VEE53574.1|3433391_3434183_-|transmembrane-acetyltransferase MTSPTPAPRGGLARVCRYLYRVPLLLVHISVFLPLILIGMLPPWGELRVGEDTFGAKVVNWWQGGLMWIFGFRLSQIGKPLPGAVLFVANHVSWVDISILHSQRMMGFVAKREIASWPLVGWLAARGQTIFHQRGNTESLGGVMQVMAERLRAGKAVGVFPEGRTRGGHEVGPFHARIFQAAVETGVPVQPVALVYGVKGDAQTIVAFGPGESFAANFLRLLGEPARHTEVHFLEPIGTQDLEGRRRIAETSRARIVAAMSTQ >LR134301.1|VEE53575.1|3434304_3434739_-|ComA-operon-protein-2 MTQVFREAVSIEQLNALSRNTAIESLGIVFSAVGEDWLQATMPVDERTRQPYGILHGGASVVLAETLGSSAGNLCVDPAKQICVGLEINANHVRAVRSGTVTGTARALHVGRSTQLWEIRIEDEQGRLVCISRLTLAVVAAGHG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134301_5 | 4267685-4267762 | Orphan |
NA
Consensus repeat of LR134301_5
|
1 spacers
spacers of LR134301_5
>5.1|4267709|30|LR134301|CRISPRCasFinder TGGCACAACTCCCACCGACGCCATGACCTT |
CRISPR arrays and Neighbor proteins around LR134301_5
The CRISPR arrays of LR134301_5 >merge|LR134301|5|4267685-4267762|CRISPRCasFinder GCTCTGGTGGGTGCCGACCTTGGTTGGCACAACTCCCACCGACGCCATGACCTTGCTCTGGTAGGTGCCGACCTTGGT >LR134301|5|5|4267685-4267762|CRISPRCasFinder GCTCTGGTGGGTGCCGACCTTGGT TGGCACAACTCCCACCGACGCCATGACCTT GCTCTGGTAGGTGCCGACCTTGGT
>LR134301.1|VEE54297.1|4264633_4267660_+|autotransporter-protein MSRAVPRATRMRRLTASVLQALAVPSVMLLSAGAAWAGCDNATPTAGQTVTCDANAPNPQTVPITANGPAGITVNVASGAQLQQSGGASAISLVGAGGHLLSNLGTISSVGGVAVQLGGGSRVENAGSISTGNNTALQFVGAGDSVLVNRGTISGRSGVQFGAGNDRLEMQAGSISGGVLQGDGNDVLLLGNGTIDSVDQGSGDDQMTVTGGTVTGVVAQGSGRDDFVMSGGMIGALQQGDNIDTFRMSGGRIIGAFEDGDQAWMTAGRIGRVNMKLDKNLWDQSGGTVDGNVVTGFDTDTIIISGTAYIGGNISVSGGNDSVTITDGTVRGQVLLSTGNDTFNWNGGGIVYGAIDLGPDDDVANLSNLNQGNLGAVPLFDGGAGNDRLSFNNVKTTGVGRFQNWETISLANSTELSFDGDLVLGDSATGTGTLTVDDTSTVYAGSGGHAIRPFNSGALVEMANAGRIDLTGTGAGDVFTVRGNYRGDGGGLYLRTVLGADNSTSDRLVIDGGAATGTTGIGILNAGGSGAATLADGILVVQALNGGRTAPGAFSLFAPVAAGAYEYFLFKGGVSAGTGENWYLRSTLVSGPTPAPNGGGTAVPPPLTPPVAPPPPITPAPPPPPEGATDPDLTAGETAPPPPPAEPAPVAPPSDPAVPDVPVAGGALPGTGTPPTPGARPAEGAVVPLYRVETATYAVVPPLLRETSLASLGTFHERQGEQRLLYNQGTFRTAWGRLVGQSSEIHWKGDAQPGFDGDVMGLQAGLDVWAAASDNHRNQIGVFVGRTRAQGKTTGLALGWENVQVGQNRLDDKHVGLYWTFTGSSGGYIDAVAMQSRYDGRVRSSRGLGFGLSGDGTSVSVEAGKPLLRFGQSAWWLEPQVQVIWQRTSLDDRRDEVSSVRFDNDNAWTGRIGLRLAGDYQLADNGWQPYFKLNYWHGRSGEDRIRFDNDVIVNSQRSRALEAGVGVVGRFNRTISAYAVADYTRELGGDRNEKRRIIEGNIGLRADW >LR134301.1|VEE54296.1|4263291_4264359_+|transmembrane-protein MSSTQPRRLGSIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMAFSVAPRARDAAARPALARGVLERALRILLAGALLHLLIWWALHTHHFRIWGVLQRIAVCAALVGVLAVYARPRVQAGALVTLLVGYTALLLGIGDLAPWTNPASRLDTTLFAPWIYQWHADTGLGHDPEGLLSTLGALASTVLGLLAGGLLRSGRTAALAGLGTVTAVLGLLLAVVLPLNKQLWTPSYVLWTGGLAALALWLGHVLIDQKGWPALGRRFGVNAITAYLGASVMSVALMATGAWGWIWQQLAAVMPQALELASMLQALAFVALWWGVAWWLDRRKIYLKI >LR134301.1|VEE54295.1|4262143_4263292_+|N-acetylglucosamine-6-phosphate-deacetylase MATVLRNARILAGDEFRDDLAVVIEEGRISALLPDAAPQLGHAAEQVDLGGGWLLPGFIDVQVNGGGGALFNNTPDVAALRTIAQAHRRFGTTAMLPTLISDDVGVMREAIAAMREAIAQGVPGVIGIHLEGPYIAPARKGTHDASKFRVPDEAEIALAASLDNGVTLLTLAPERVPLETIRALVERGVIVAAGHTAGTYEEIRAGLDAGVRGFTHLYNAMSPLQGREPGAVGAALEDRDSWIGIIVDGVHVHPASLRVALAAKPRGRLLLVTDAMPPVGADDPSYVLYGETITAIDGVVRNAAGSLAGSALDMATAVRNTVQLLGQPLAEAARMASTYPAQFLNVDDRLGHIAEGYQADLVLLDDALQVRGTWIAGQYEAA >LR134301.1|VEE54294.1|4261115_4262144_+|glucosamine-6-phosphate-deaminase-[isomerizing],alternative MSLSDPTSTLMFAEAAEAADVVARQFSRNHATMETLAASLRAAPPPFVVTCARGSSDHAATYGKYLLETQLGLVVASASPSVGSVYAAPLQLRGALFIVISQSGKSPDLLRNAEAAKAAGARVIALVNVEDSPLAQLADTVIPLHAGAEKSVAATKSYLASLAALLQLAAYWKQDSSLRAALDLLPDAMREAWQCDWTPVTEGLVEATNLFVLGRGLGLGAAQEAALKFKETCSLHAEAYSSAEVKHGPMALVDRGFPVLAFAQPDETGAGTRAVVEEFTSRGAQVWMAGAGGNLPVAATPHPLCAPLLTVQSFYRAINALALRRGFNPDLPPHLNKVTETV >LR134301.1|VEE54293.1|4260040_4261108_+|LacI-family-transcriptional-regulator MRRATIKDVAEKAKVSLKTVSRVINNEPSVMQATRARVLRAIAELDYEPDPSARNLRSGTTFVIGLVYDNPNPYHIIGVQNGVLAACRETGFGLQIHPCDSSSPLLADELADWVQRSRLAGLVLTAPMSERRDLIQALTARGIKLVRIIAATEDPADGACVFVDDREAAYEITEHLIQLGHQRIGFLWGGSSHRSSGERYAGYEAALKDYGMTVDKHLVVQGDYTFDDGFRGARRLLALREPPTAIFGSNDEIAAGVLAAAKSAGMNVPYDLSIAGFEDSPFSRQSWPPLTTAKQATEDIARHAARLLIAQLRSDAYDDAPAPLHNQGFVPQLVVRGSTAPMRPAGARPLPSDPA >LR134301.1|VEE54292.1|4258711_4260001_+|N-acetyl-D-glucosamine-permease MSAVPAASARPNVATSIAIVGVLFFLIGFFTWLNGPLITFVKLAFELSEVGAFLVLMVFYLSYFFLALPASWILRRTGMKKGLSLSLLVMAGGAALFGEFATQRWYPGALGGLFVIGSGLALLQTAINPYISILGPIETAARRIALMGICNKIAGILAPIVIGTVVLHGIGDLSATVAVADEATKAQLLNEFAAKIHAPYLAMAGLLVLLAVGVLFSPLPEIKSSEANATPVAAGAAERRSIFQFPHLWLGVLCLFVYVGVEVMAGDAIGTYGHGFDLPLDQTKMFTAFTLFAMLIGYVVGLLVIPNVVSQSRYLTFSAVLGVVFCLGAWATHGYVSVAFVALLGFANAMMWPAIFPLAIRGLGRFTETGSALLVMGIAGGAIIPQLFAVLKQHIDFQLVFVLLMVPCYLYILFYSVVGHRAGLPQDKV >LR134301.1|VEE54291.1|4257316_4258336_-|N-acetylglucosamine-kinase MTASHPAPAHVLSRIAPSFLAADVGGTHVRVARVQASGDAAHPVQVLEYRKYRNADHGGLSAILSDFLGEGPRPTHCVVASAGYAREDGTVITANLPWPLSARQVEADVGLQRVYIVNDFEAVAYAAAQVDASGVLHLCGPDTAARGPTLVVGPGTGLGAALWIPTAHGPVVLPTEAGQPTLAASTELEMAIVRHMQRDRAHVSIEHAISGPGLMNLYRAVCALQGQAPTLASPDAVTAAAMADTDASARQALDVFCGLLGSTIGDMALFYGAHGGVYLAGGILPQIREYLHASTFVERYLQKGPMGEALARIPVKVVEHGQLGVVGAASWYLQQQSAA >LR134301.1|VEE54290.1|4254476_4257140_-|N-acetylglucosamine-regulated-TonB-dependent-outer-membrane-receptor MNTRKTLLSAAIVSCIAFSAHAQQAAQTATDLDTVTVTGIRGSMEKSLDTKREANARVEVVTAEDVGKLPAHNVADTLQRLPGVNISSSSADEGGFDEADRVSLRGTSPSLTQTLINGHTVGSADWFVLSQGNNVGRSVSYSLLPSELVSSVEVNKSSQAKLQDGGTTGTVNIITRKPLEFSKQFTAEGSIGMVRSDQAKSNDPQYSALFNYKNDEGTFGVMVQGFSQKRELRREAQEIPGGFFKIGAGDPVAKTNPDLVGVNVPGLLGSTLFEQTRERKGGLVSLQFKPSDNLTLGLNGFSSELKANNYNRNFMMFGNSFAKSQAPDPGYVVKDGVLTNANYKGVPGTDYAVYDMIYRESKAKSSYVTFDADWQINDSLTAKFQAGSTKGTGETPRQYIAEVTLARGGGASWATHGNGSPIDWNVGGDISPNGVTSFGTWGNQQVTAEDKEKWATLDFNQYFNDGGVLSSIDFGLRFADHKREALSPEGATPGDIWSALKNGATANYPSGFAGDIGGTFPRNLWYFTPGALKDAVTNNSTWLAGNDGPTGRHNYGAEWKVKEKNFAGYVQANFRGDWWSGNVGLRYVNIKQDIDTYNAVSKAADADVSSLFGMWERLAFQNKRNRVLPSANIKFDLDDSLVLRVAASQTQTLPDYSALGASSYGSDLNRTGGGGNPNLKPTLSTNLDANLEWYFMPRGLLSVGAYHMNLKDYIAFDVVSRQLYSELTNQLETYQISTPINADGKVTGVEVAYEQPIGEYFGINANYTYANGTTSHTWSDGSHNLLGTSKNTYNVGAYFENERFGARVSYTRRSSFLISLSGTNPYYQDDFGTLSASLSYKATDWLSISLDGLNLNNPTYKYYQTAAIPTSFYSNGRQYYLNFRFKY >LR134301.1|VEE54289.1|4252007_4254365_-|beta-hexosaminidase MVKPSRARMRSAVLLGSLLALLPALPALAADPTPAAEAPGPQLRAGSLMLIPAPATVQPGQGSGITVRADTVLHAEGEAAQRVATQFADLLARSGGPHLALAKGKIAAHSGGIRFQIVPTFRDSGEGYTLESTAQGVLVQAGNETGLFYGATTLAQLATAGSNGVLPAVQIQDAPRFSWRGFMLDSARHFQSLDEIKRVLDAMAAHKLNTFHWHLTDDQGWRMEIKRYPKLTEVGSCRLPAGDGGTDPVSGKEHPYCGFYTQDQIREVIAYAAKLHIQVIPEIDVPGHATAAIAAYPELGSINTPLKPISEWGVFPNLFNVEDSTVTFLENVLEEVIALFPAKYVHVGGDEAVKDQWKASRQVQQRMRALGIKDEMAMQSHIIKRLETFLEEHDRRLIGWDEILEGGLPPQATVMSWQGTEGGLAAASAGHDVIMSPVGYLYLDYLQTASPNEPPGRPTQVNLGKLYNFEPVPAELAADKRGHILGLQANMFTEHTRSYARLQHNLFPRLAAVAETGWSTPEHRDFRDFLARLPAQLQRYRAWGLAYAQTPFEVGVDYTDDRAKNTVTVSLANPLGYEVRYSTDGQPVSAQSPLYQQPLTATLPAMVQAGAFYQGQMLAAKPTVAAFSAQSLLSRRSDELLSCVGKKGLVLRLEDDGPREGTRAVFNVDIFQPCWRWPQAQLDGVGSVEVRAGRIPYYFQLAHDEPKRRFEKARRAHGEMQVRRGDCSGKVLAEAALPAKPDADGFVTLRAALPKGTQGTADLCINFTGDTRPAMWVLDEVTLGK >LR134301.1|VEE54288.1|4251515_4251890_+|VanZ-family-protein MIKPLRRPRLWATLWATAVLLVIVVCLIPPPPIPLPENSDKGEHFLAYFILAGSAVQLFRRGRPLLWVGVGLVLMGIGIEFAQGALTSNRTADPMDAIANTIGVLAGMATALTPLRDLLLRWRG >LR134301.1|VEE54298.1|4267858_4268290_-|Uncharacterised-protein MNKPFPTYPRLQALLGAALPGLQLSNAVAEALEDALTEANEQAPPSAFFARLRGITHSHAADGQAWRERQLSDVRGRELAEATRCLAALSACGGVLLAAQSAREMDDAAAQCPPQVEEGLLHAVMVLADHAGALVEPDTHAAV >LR134301.1|VEE54299.1|4268701_4269745_+|nisin-resistance-protein MQVRKGVRHIGALVIALVAANAMADTPKAIEVPSPEAEAEILNLLERQALYRDRVDWPGTRTRLQSVQGDPVKRLALLREAIALSTGNHGVWTTTQRQSESLARAQQVGAVAVERAKAADAVDARIGWLVIEGYASTPGATLQEAFRQNIQRAARWQQVIRSKDDGMRCGWVVDLRDNSGGNMWPMLLGMAPLLRTSVVNNEDVGSFETAQGPERWTLTATAVQRAGKSVLDFGQSGYVLRQPGAPVAVLFGPRTGSSGEASALAWRGRAQTRSFGQPTAGVSTGNVVHTLADGSRLLLTTTVMRDRNDRGDGLKIEPDQRIEGDAATLAAAQAWLLAQPACQGTRS >LR134301.1|VEE54300.1|4269741_4270080_+|Uncharacterised-protein MILIGQQERVLIVGPTEAHHCLRCQTETEFAPQLRYRMARIDLLFGFTYQRRYELACSRCGHGWVLDTETMDQQLGGVPIPWRHRFGLPLMLVVVAGLAALGWLWRHGYIVH >LR134301.1|VEE54301.1|4270129_4270432_+|mRNA-interferase-HigB MKVVALAALKRFWERHPDSEMALRSWYDEVRHAVWATPHDVRQRYASASFVANSRVVFNIKGNSYRLIVAVGYRFQVVYIKFIGTHAEYDRIDADTVELL >LR134301.1|VEE54302.1|4270428_4270788_+|Antitoxin-HigA MNIQPIRTESDYENALREISAYVDNEPEPGSEEGDRFEILVTLVEAYEAKHYPIEPPDPIDAIRFRMEQGGLTVKDLVPSIGQLNRVYEVLNRKRGLTLEMIRKLHRNLGIPAESLIGR >LR134301.1|VEE54303.1|4270875_4271310_+|Uncharacterised-protein MDHDIPFEPLNDLEVRLLQAQDGTLTAAQFLDGLLTSTAFVLLDKAIGEDGAWDESISPLVLTSESGEPMFAVFTAPDRAGLWHEQLPQFAHAMPIAVHALLAGIGDGVGLVLNPGLDVGMEMIPDAVAQLKQRAAAITRGMAH >LR134301.1|VEE54304.1|4271657_4272878_-|tRNA-nucleotidyltransferase MKIYLVGGAVRDRLLQRPAGDRDWVVVGATPAQMEAQGYTAVGRDFPVFLHPKTGEEYALARTERKSGRGYRGFVVDADPAVTLEEDLQRRDFTINAIACDEDSGALVDPYGGVRDIEQRVLRHVGPAFVEDPLRVLRAARFMARFASLGFTVAEETMALMREVAASGELDALVPERVWQELRKALVSERPSAFLRTLHDAQALGPILPELEALYGVPQRAEFHPEVDTGIHQEMVSDMAAKLAPGDDLVGFAALTHDLGKGLTPPEEWPRHIMHEQRGIKPLKALCARLKIPTEHQQLAEAVCREHLNVHRIDELRDATVLELLGRCDALRRPERVARIALCCEADKRGRLGFEDADYPQGETLKRLHQAALSVQARDLDTTHLKGPAIGEALAKARVKAIAAAR >LR134301.1|VEE54305.1|4272950_4273187_-|Uncharacterised-protein MSLLAWIGIFAAWSLLATWVLRWGGAAWMEGWKSLAFVDSWGSLWDEAQIKLYFLCLWIVYGLWFLAGLFVPEWRGLP >LR134301.1|VEE54306.1|4273253_4275227_-|lytic-murein-transglycosylase MTRRTSTALSLLPLATTLLIGCANAQSLDAQNAQLKAAIAAAERGQFDPGQAAALSRHPAYGWLEYANLRRNIDTVDSAQAQAFLKRYDGQAVANTFRSVWLPSVARRQDWPTLLANWVPTDNAGLRCAQLTARQVTGKVDPQWIGEAQDLWRKNGKSLPDGCDAVFAVLQAQGGLSDALRWERIDAAADAQQPAVMRSAARGLPATDLALANNYAAFVDKPNASALNWPRNERSRRIATDGLAKLAKADPGATEQQLPQYAQALGLSAEQQGQVLYQIALWTVASYLPDSARRLNAVPESAYDERLHEWRVREAMSRGDWPAALTAIRKMASKQRSDPRWRYFEGRMLEKTGQAQQAQPLFRDAARAPTFHGFLAADKLQQGYTLCPWKPNDSAQAQAVIARDPAIQRAMALYQIDRAGWAVAEWNSALSRFDDTQRRLAVRVAQDNGWFDRAVFALGKQPQEQRLYDLRFPLHHDATIRRESARNAIDPAWVAAEIRAESTFTPRARSPANAMGLMQVLPATGAGVAKSIGLTGYGGADSLYDPDTNIAIGTAYLRQLMNKYDGLPYVTIAAYNAGPTPTARWQGQRPGFDPDLWIETISYKETREYVARVLAFSVIYDWRLNGDALPLSDRLMGRLVDKRKSFSCAANADQGGD >LR134301.1|VEE54307.1|4275390_4275798_+|biopolymer-transport-protein-ExbD/TolR MAFSSAGRSGPLADINVTPLVDVMLVLLIIFIVTAPIVARPIAVQLPQATDRVVDRPEPPPPIELRLDASNQLSWDGQPMAIGDLQARLQAQAGEHAGNLPELRIATDPSAEYEGMARILAAAEATGMERIAFVQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
LR134301_5 | 5.1|4267709|30|LR134301|CRISPRCasFinder | 4267709-4267738 | 30 | NZ_CP018064 | Rhodococcus sp. 2G plasmid p1, complete sequence | 160126-160155 | 9 | 0.7 |
LR134301_5 | 5.1|4267709|30|LR134301|CRISPRCasFinder | 4267709-4267738 | 30 | NZ_CP038031 | Rhodococcus ruber strain R1 plasmid unnamed1, complete sequence | 76075-76104 | 9 | 0.7 |
LR134301_5 | 5.1|4267709|30|LR134301|CRISPRCasFinder | 4267709-4267738 | 30 | NZ_CP040720 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed1, complete sequence | 8378-8407 | 9 | 0.7 |
LR134301_5 | 5.1|4267709|30|LR134301|CRISPRCasFinder | 4267709-4267738 | 30 | NZ_CP016821 | Rhodococcus sp. p52 plasmid pDF01, complete sequence | 62305-62334 | 9 | 0.7 |
1. spacer 5.1|4267709|30|LR134301|CRISPRCasFinder matches to NZ_CP018064 (Rhodococcus sp. 2G plasmid p1, complete sequence) position: , mismatch: 9, identity: 0.7
tggcacaactcccaccgacgccatgacctt CRISPR spacer cggcaccactcccaccgaggccatcctgcg Protospacer .***** *********** ***** . .
2. spacer 5.1|4267709|30|LR134301|CRISPRCasFinder matches to NZ_CP038031 (Rhodococcus ruber strain R1 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.7
tggcacaactcccaccgacgccatgacctt CRISPR spacer cggcaccactcccaccgaggccatcctgcg Protospacer .***** *********** ***** . .
3. spacer 5.1|4267709|30|LR134301|CRISPRCasFinder matches to NZ_CP040720 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.7
tggcacaactcccaccgacgccatgacctt CRISPR spacer cggcaccactcccaccgaggccatcctgcg Protospacer .***** *********** ***** . .
4. spacer 5.1|4267709|30|LR134301|CRISPRCasFinder matches to NZ_CP016821 (Rhodococcus sp. p52 plasmid pDF01, complete sequence) position: , mismatch: 9, identity: 0.7
tggcacaactcccaccgacgccatgacctt CRISPR spacer cggcaccactcccaccgaggccatcctgcg Protospacer .***** *********** ***** . .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1222032 : 1229594
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >LR134301|1222032:1229594|DBSCAN-SWA CATGACCGAGCCCCTTCTTGCCTTTGCGCTGCTCGGCGCCATTTCGGCCATCTCCATCGGTGCCGCTCGCATCGTTTCGTGGTACCTCGACCGTCGCGGGGAGCCTGCCCGTCGTAGCGCACGCGAAGCGGCCTTTGAAGCCGAGGCACGCGCCGAACTGGCCGCCACAGGCTGGACCGCTGAGGAAGAAGCCGCGTTCCAGGCAGTGCGATCGCAGCAAGGCAAGCATTTGCGCGCACTTCGCCCGGAATTCCGGCAACTGATTGAGGAGACTCACCGTGCGTAAGTTCGATTGGTGGGGTCTTCCGTTCGTCCTCTTCATGGTCTGCGGGGCGGCGCTTGGCTTCTGGATTGGCACCGCGTTCGCTGCTGACTGGAACGCAGTCGCTGCCGAGCGCGGCGAAATTCGCAATGCCTGCATCGCCGGTAACGACCGCGCCTGCCGCATGTACGAGGTCGAATATGGCCGCTGATCAGGTCTGCGAGTTCTTCCGCGATCCCTTTGTTGTCGCCCTCATCGGCGGTGCGCTGCTTACCGGTCTGTATTGGTCGCTGGTGTTCGTACTGCGTGGAAAGGGGGCAGGCAATGGCCGTTGATCGCGCTCGTTTCAGGATGGCTGTAGAGGGCGGGGTAGGGGGCTTTTCCCCGCTTTCGCCCGGTGAAAAGGGGCAGCGGGCGGCGGCGGAGATTGGCCCGGGGAGTAACACGGGCCAAAAGGGTCAGCAGGACGCAATTATCGACTACCTGACCATTGTGGTCCCGCTCTCTGTCCTTGAGGAAGTGAACTGCAAGAAGCTCGACCTCTTGCTGTTCCGCATCTTCGGTTTCCGTGGCGAAGTTGTTGCCGGTGCGATTCGTGAGAAGAACTGGAACTTCTACGAGCAGTCGGCGGTGCTGATCGACCGGGAAAACGAGGTGGTTGGTCGTGTTGGCATCGGTGGCAAGAAAAGCACCGTATGCCTGAGCCTCACCGGGATGGGTTGCAAATGGATTCGTGACTGGGCGCGCGTCTACAAGCAGTGCTCCATGCTCGACGCCAAGATTACCCGTGTTGACTGCGCGCACGACGACTACGAAGGCGAACGCCTGGACGTGCATGCGCTCCGCGAGGTTGCCGCGCAAGGTGGCTTCACCGAAGGCGGCTGCCCGCCGCGTCATCGCTTCATCTCCGATGAAGGCCACAACACCGGCTGCACGCTGTACGTCGGCGGCAAAGGCCACAAGGAACTGTGCGTGTACGAGAAGGGCAAGGCCGAGGGCCTGCCGTCCTCGCGCTGGGTGCGCGCCGAAGTCCGCCTGTACGGCAAGCACATGGAAATCCCGCTGGATGTGCTGTTGAACCCGGGCGCGTACCTGCGCGGCTCGTACAGCGCATTGCATGACCTCATCAAGGGCGTGTGCACGCGACTGCGCACGATCCGCAAGCATGTCGAAGTATCTGCCGAGGCGATGGTGCTCTGGATGGAGCGTCAGGTAGGCCCGGCCCTCAGTGTTCTGCGCGGAGCGTTCGGAGATTCATGGTCCGACTTCTGCGAGGCCCGCATCGTCCGTGACGGTCACCCCGGACGTTTTCGCGGTATTGCCAAGGGTGACGCACTCCATCGTTTCGTGAGGGAAGAACTATGCCCATCTGCCGCGTGAAGTCCGCTGCCGTCGAAGAGCGGCACAACAGCAAGACCAACACCATCAATCGTTCGCAGACCGCTGGCCTCGACCTGGGCAACGGCTTCGAACTGCCGTTCCGCGTCGGCCTCGGCCAGCGTCCGCCGTACCCGGCTGGCGAGTACGACATTGATCCGCAGTCCTTCGGACTGAGCGACTACGGTGATCTGGTGCTGAAGCGCTACGTGGACTTGATCCCCATCGGGTTCAAGGCTGCGCCGGCCGCATCGAAGACCTAAGTCATGAGCCTCTGCGTTGCTTTAGGGGAGAACGGAACGCTGATCCCAACCGGTCAGCCCGTCGATCAGTGCACGGGGTATGTGCTGATGAGCAGCGCAGAGGCTTCCTCTGTCGCCATGTTCGCCGAGGCGTTCAAGGTGCCGGACAAAGACGTACTCGCAGGATGGGCATCGGGGCCGTTCATTCTGATCATGACCTTGTACTTGGCTGCGCACATCGGTGGCCGTGTTGCAGCTGTGTTCGACAAATCGTAGGCCGCCATCAACTCAATCAATGAAAGGGGATTTACATGGATTTCGAATCGATTCTGACCGGCCTGTCGGTCGCTGCCGCGCTGACCGCCATCGGTGGTGGTCTGGCCCTGATCGCCGTGGTCGGCTTCTCCCTGTGGGGTGGCCGCAAGGTGGCGGGTCTGTTCGGTAAGTCGTAAGCCGAGCAGTGATGGGGTAGGGGAGGCCATGCCTCCCCTTTCTATTTCAGGGGAGCGATATGGACTATCAGATGATCATCGGCGGCCTGCAGGTCGGCATGGTGGCTCTTGCAGTGTTGGGCGGCTGCGCCGTCATTGCGCAGTTGAAGTTCGGCCTGTGGGCGGGCCCGAAGGTGGCGCGACTGTTCCTAATGCGGGGCGGCAAATGATCCTTTGCCTATTCGCCGGATTCATCAGCGGGCTGTGCGGCATCGCTGTCGTGATGGGGATTCGCGGGTGATTCGTCGCCTCAGCCATTTCGGAGTTGCCGCATGCCTCCTGCTTGCGCCCGGCATCGCCCAGGCGGCGGCGTTTCAGGATCAAGGTGCGGCGTATACGAAGTGCATGGCCGATATCGCGACTGGGCCGAGCTACTGGCAACCCATGCGCTGCAATGAGTCCCCCAGCAACGACGGCTCCGGCTGGTACGACGCATGGGATCGTACCGGCCGCAGCGTGGGCTGGGGCCTCTACAGCTGGCCTAAGGCGACCAGCTGCAAGACGCGGGGAGATGAAATGGGCTGGCAGGGTGGCGAGACGGCTGCAACCGTCAACGTCTGTCACACCGGCTGCATGTACAGCAGCAGCCTTGATCCATCGAGCCCGTCGGGCTTCACGTATTACCCAACCGGTGGCTCTTGCACTGAAAGCGACGCCCCTGCGCCGACGCCCGCCGGTGATGGTGGCGATCCCGGTGGTGGTGATGGCGGCGGTACTGACCCGGGAGGTGGTGACGGTGGTGGCGACAACGGGGGCGGGGATGGCGGAGGCGGTACCGATCCCGGTGGCGGTGATGGTGGCGGCGGTACCGGGCCGGGGGATGGCGATGGTGGTGGCGACGACGGGGGAGGGGATGGTGGCGGTACTGGTCCCGGTCCGGGCCCAGGTCCCGGCGAAGGTGATGGCGATGGTCCGGGGCAACCCGGGGGCGACGGTGATGCGCTATATGAGTCGGAAGGCAAGACCGTCGAGAAGCTCTACGACGACTTCGCCGAGCGCGTCAGCAAGGCCCCGATCATTGACGCTACCAAGAGCTTTTTTGAGATCAGCGTCAGCGCCTCTTGCCCGATCTTCACTTTCCCGGCGACGGCATATTGGGACGCCATGACGTTCGACTTCCTGTGCAAGCCTGAGATCGTCGCCATCCTTCAGCTGCTGGGCTGGCTGCTGCTCGCGTTCGCCGCCTTCCACGCAATCAAGATCGCGCTCACATGATCAACCTAATCGCATTCGTGACGCTGCAAGCCGGGTGGCTCAACGATCTGACCGAGTACATCCGCAGGCAGGTGGAACGCCTGTGGACGGCCATCGTTGAGTTCTTCCGTGATCTTGTGCTGTACGCGATCGAACAAGTCCTGGACTTAGCCGCACATGCGCTGGAAAAGCTGCCCGTGCCTGAATTCATGACCGAGTACAAGCTTGGCACCCTGTTCGCAAACGCAGGGCCGACCATCGCGTGGTTCGTCAACATCTTCAAGATTCCCGAGTGCATGACCGTGGTATCGCTCGGGATCGTGTTCTTCATCACCCGTAAAATTCTGACCTTGGGGAAGTGGTGACATGCTAGTTTTCAACGAGGGCGTACCGCGCGCAGGCAAGAGCTACGATGCGGTCAAGAATCACATCCTGCCCACGCTAAAGAAGGGACGCCGGGTGTTCGCGCGCCTCAATGGCTTGCACCATGAGCGCATCGCCGAATACTTGAACATGCCCGTGGATGAGGTCCACAAGCTGCTGACGCTGGTAGAAACCAAGGACGTTGCAACGACGTTCGTCTGCTCCAGGCATCCGGAGACCGGTCAGTGGCGCATCCCCGATGAGTTCAAAGATGCACTGGTGGTGATTGATGAGGTGCACGAGTTCTACGTTGCACAGCGCAACCAGTTGCCGGAAGAAGTGGAGAACTTCTTTGCGCTGATCGGCCAGAACGGCGGCGACGTGCTGATCATGACGCAGTGGATCAACCGCGTGCATCAGGCGGTGAGGGCGCGTATCGAGCGCAAGAACGTCTTCCAGAAGCTCACCGCCGTAGGCCTGAAATCGCGCTACCGGGTCACGTATTACCACACCACCAGCCCGGGAAAATTCGAGGTCGTCGGTGGCAAGACGCTGAAGTACGATCCGGCCATCTATCCGCTGTATCACGGCTATGCGGTCGGCGCAGAGAATGCAGAGGTGTACGAGGAAGGCGGTACCAACATCTGGAAGCAGCTTGCGCCCAAGATCGCCATTGCTGCCGTGGGCCTTGTGGTTGGCATCTGGGCGTTCGGCGGCTACTTCATCAGGATGATGGGCGACGATGAGCCGGAAGCTGCTGTAGAAGCCCCGGCAGGCGCCAAGGCAACGCAGGGGCAGGGCGCCCATAAGCCCATCAGCACCGGTGCTGTGCCGGCCGCGGCCGTTGCAGTGCCCGCAGCCGATCCACTGGCCGGGATGACCGTCGAGCAGCGCTATGTGGCCGCCATGACGCAGGCGAATCGGATTCGGTTGGCCTTTACCGCCCAGTTCGGGGAGCGCTCGGTGGGAATGGTCGAATGGGTCGACGGCTCAGGGAACACCGTCGATCAGTTGACCTTTGATGCCCTCATTGCGATGGGCTACCGGCTGAGGGTTGCCGTATACGGGGTGCGCCTGACGGCGGGTTCGTTTGAAACGGTCGCAACAGCGTGGCCGAGGGAAGCACCCCGGCGCGAGGAAGAGCCAACGTTGTACCGCCTGGACAGTGACCGTGCTGCCGCTGATTCTGCGAGCGTAGCGAGTGGAAGCGGCGGCGGCGCGGGCGCGGTCATGGCGGCAAGAAGCGGGGGCACCATCGTGCGCGTAGGTGAGCGCCCCATGGGCACGTTCCCGGAATCCAAGCCCTACCCGCCGAGCTTCTGACGTGATGCGTCACGTTTATCGAGTACCATCCTCTATAGACAAGGGGGGAATATGGATATTCGACTTGGGGCGCTGTGCGTACTGTTGGCCGTTGCCACCTCAGCTTCAGCGCAGCAGATACATTCGGCGAGAGGCCCCGCGCCAAAGCCCATCCCAGCAGCCCCGAAGGCCGCGCACAATTCGATGGCAAAGACCACGACGCCATTCAACTGCGAGCAGTATCGTTGGCCGAATCATCCTCACCCAGGCATGAAGCTGTACTGCGACGGTATTGAGGCAAGCACTCTCCAGGACGAAGCACGTCGCGCCGGGCGCCCCGGGCCATCCGGTAGCGTCGTGTCGCTGCCTTCACTCGGGAGCGATGCCGCCAAGCGCTCGGGATTCGCCTGCATCGGCGGCCAAGCGTTTCGGAAGCTCCGAAATGGTTGGGAACAAGTCTCCGCGCCTGCCGGTGGCTGGCAACGTTGCCGCGAGCAGTGATTTCGGGGTGTAGGGGCAGCGCCCCTACGGAAGCGCCTCACACGCGCTGGCGGGGCCTTGGCCCACGACTCAGATAGACCACATTGGAGGGCTCGGCGTCGGAACCCGGTCCACATCCGGCCAGCCGCCGTTCTCGGCGAATTCTGAGGACTTCGGCAAGGTAGATCACGCCGGATTTCGCCGTCGCGGATCCCGTGGAGGCCGGAGCCGATCGTTCGGCCAAGTCTGTGCGAGCCTCGGCCATCATCAGCCGCCATTCCCGCGCAATGTTGCAGGTCAGAGACCACCAGGCCATATCGCAGGGTTCCAGCTGGTGACCTTCAGGGGTGAACATATGCCCAGCTTGGAAGCCGAAACCGGCCCAAGGGCCGGTTAGGTCAGTTCGGTCATACGGATCGATCTCGATCATGCTGCAAGCTCATCCTTGTCGGGGGAGCCAACAGGGAGGCAAGAGCCAAGCCAGAGGCGGAGCCATTGCCAAGCCGAGCCCACGAACCGGATCACGCCCGATACAGCATTTCGCATAATGTATATTATGTTACAAAGAGGTCTGCCTAGACGAACCCCAAGCAGTCCAGTCCTGGCCGTATGCTGCCCCCTTCAGTAACCCTCCCAACGCCCCCAGAACCAGGCTCCACGGCGCCAACCTTCTCACCCAGCATGCAGGCCCACCGGTGTAGGCTCGACGCACCACTTAAGGACGGGTTTGGCGATGCGCCTGGCTGATTTTATCGAACGGAATGCGCGGGAAATCCTGGAAGACGCGGTGGCATTCGCCGAAACGCAGGCGCCCGATACTGTCGAGTTCAGCGTAAAGCAACTTCGGAATCATCTTCCCCAGATTCTTCAGGCAGTTGTAGACGATCTCAGATCGCCGCAGACAGCCTCCCAGCAGATCGCAAAGTCCCACGGGCTTGCACCGTTGAAGCCCGGCCCCGAGTCCGCCGCGTCCTATCATGGCCGAACCCGCGCCATCGCCGGCTTTGGCCTCAACCAGATGGTGGCCGAGTATCGAGCACTCCGGGCTTCGGTACTTCGGCGATGGGCATCCGACCAGCAGCTGATCACTTCATCGATCGATGACATCCTTCGTTTCAATGAGGCCATCGATCAGGCGGTCGCCGAATCCCTTGCTCAGTTCTCTGCCGAGGTCGAGTCCTGGAGGCAGATCTTCCTGGCAGCGCTCGGGCATGATCTCCGAGGTCCTTTGGCAGCGGTCATGTTCTCGGCGGACACTCTGGCCAGTGGACTGCAGGATCCGGCATTGTCCAGGCAGGCCGAGCGGATCCTCAATGGCAGCATGCGCATGAACAAGCTGCTCGACGACCTGCTGGCTTACAGCCGCAGCAAGCTGGGCGATGGCATGGCCATTCATCCGGTGGACTGCGATCTTGCACAATCGCTTGGCGAAGAAGTCGAATTGCTTCGCGCTTCCCTGCCCCACGTCCCCATCAAGTATGAGGCTGAAGGCGATGCCCGCGGTTGCTTCGACGCCTCTTCGTTGCGCGAGGCGGTTCACAACCTCACGACCAACGCTGCAAAGTACGGCGAACACGGCACCGATGTGCGGATCAGCCTTGAGGGCCTCGTCGACCAGATCATGATCACGGTGAGCAACACCGGGACCGAACTATCGGACGAAGCGTTCAACAGTCTGTTCGATCCCTTGCGCAGAGGCTCGCACAACGCGTCACAAGGCGAACACGCGAGCTTGGGCCTCGGCCTGTTCCTGGTGCGGGAAATCTGCCACGCGCACCGCGGCACTGTCCATGGACGATGGCGGGATGGTCGCACTTCATTCGTCATCACGCTCCCAAAGAACGCTGATTGA
Protein sequences of DBSCAN-SWA_1 >LR134301|1222032:1229594|1224787_1225618_+|VEE51541.1|DBSCAN-SWA MRCNESPSNDGSGWYDAWDRTGRSVGWGLYSWPKATSCKTRGDEMGWQGGETAATVNVCHTGCMYSSSLDPSSPSGFTYYPTGGSCTESDAPAPTPAGDGGDPGGGDGGGTDPGGGDGGGDNGGGDGGGGTDPGGGDGGGGTGPGDGDGGGDDGGGDGGGTGPGPGPGPGEGDGDGPGQPGGDGDALYESEGKTVEKLYDDFAERVSKAPIIDATKSFFEISVSASCPIFTFPATAYWDAMTFDFLCKPEIVAILQLLGWLLLAFAAFHAIKIALT >LR134301|1222032:1229594|1224223_1224364_+|VEE51539.1|DBSCAN-SWA MDFESILTGLSVAAALTAIGGGLALIAVVGFSLWGGRKVAGLFGKS >LR134301|1222032:1229594|1222617_1223673_+|VEE51536.1|DBSCAN-SWA MAVDRARFRMAVEGGVGGFSPLSPGEKGQRAAAEIGPGSNTGQKGQQDAIIDYLTIVVPLSVLEEVNCKKLDLLLFRIFGFRGEVVAGAIREKNWNFYEQSAVLIDRENEVVGRVGIGGKKSTVCLSLTGMGCKWIRDWARVYKQCSMLDAKITRVDCAHDDYEGERLDVHALREVAAQGGFTEGGCPPRHRFISDEGHNTGCTLYVGGKGHKELCVYEKGKAEGLPSSRWVRAEVRLYGKHMEIPLDVLLNPGAYLRGSYSALHDLIKGVCTRLRTIRKHVEVSAEAMVLWMERQVGPALSVLRGAFGDSWSDFCEARIVRDGHPGRFRGIAKGDALHRFVREELCPSAA >LR134301|1222032:1229594|1224423_1224573_+|VEE51540.1|DBSCAN-SWA MDYQMIIGGLQVGMVALAVLGGCAVIAQLKFGLWAGPKVARLFLMRGGK >LR134301|1222032:1229594|1225614_1225962_+|VEE51542.1|DBSCAN-SWA MINLIAFVTLQAGWLNDLTEYIRRQVERLWTAIVEFFRDLVLYAIEQVLDLAAHALEKLPVPEFMTEYKLGTLFANAGPTIAWFVNIFKIPECMTVVSLGIVFFITRKILTLGKW >LR134301|1222032:1229594|1223936_1224188_+|VEE51538.1|DBSCAN-SWA MSLCVALGENGTLIPTGQPVDQCTGYVLMSSAEASSVAMFAEAFKVPDKDVLAGWASGPFILIMTLYLAAHIGGRVAAVFDKS >LR134301|1222032:1229594|1222493_1222628_+|VEE51535.1|DBSCAN-SWA MAADQVCEFFRDPFVVALIGGALLTGLYWSLVFVLRGKGAGNGR >LR134301|1222032:1229594|1222309_1222504_+|VEE51534.1|DBSCAN-SWA MRKFDWWGLPFVLFMVCGAALGFWIGTAFAADWNAVAAERGEIRNACIAGNDRACRMYEVEYGR >LR134301|1222032:1229594|1225963_1227283_+|VEE51543.1|DBSCAN-SWA MLVFNEGVPRAGKSYDAVKNHILPTLKKGRRVFARLNGLHHERIAEYLNMPVDEVHKLLTLVETKDVATTFVCSRHPETGQWRIPDEFKDALVVIDEVHEFYVAQRNQLPEEVENFFALIGQNGGDVLIMTQWINRVHQAVRARIERKNVFQKLTAVGLKSRYRVTYYHTTSPGKFEVVGGKTLKYDPAIYPLYHGYAVGAENAEVYEEGGTNIWKQLAPKIAIAAVGLVVGIWAFGGYFIRMMGDDEPEAAVEAPAGAKATQGQGAHKPISTGAVPAAAVAVPAADPLAGMTVEQRYVAAMTQANRIRLAFTAQFGERSVGMVEWVDGSGNTVDQLTFDALIAMGYRLRVAVYGVRLTAGSFETVATAWPREAPRREEEPTLYRLDSDRAAADSASVASGSGGGAGAVMAARSGGTIVRVGERPMGTFPESKPYPPSF >LR134301|1222032:1229594|1223654_1223933_+|VEE51537.1|DBSCAN-SWA MPICRVKSAAVEERHNSKTNTINRSQTAGLDLGNGFELPFRVGLGQRPPYPAGEYDIDPQSFGLSDYGDLVLKRYVDLIPIGFKAAPAASKT >LR134301|1222032:1229594|1228475_1229594_+|VEE51544.1|DBSCAN-SWA MRLADFIERNAREILEDAVAFAETQAPDTVEFSVKQLRNHLPQILQAVVDDLRSPQTASQQIAKSHGLAPLKPGPESAASYHGRTRAIAGFGLNQMVAEYRALRASVLRRWASDQQLITSSIDDILRFNEAIDQAVAESLAQFSAEVESWRQIFLAALGHDLRGPLAAVMFSADTLASGLQDPALSRQAERILNGSMRMNKLLDDLLAYSRSKLGDGMAIHPVDCDLAQSLGEEVELLRASLPHVPIKYEAEGDARGCFDASSLREAVHNLTTNAAKYGEHGTDVRISLEGLVDQIMITVSNTGTELSDEAFNSLFDPLRRGSHNASQGEHASLGLGLFLVREICHAHRGTVHGRWRDGRTSFVITLPKNAD >LR134301|1222032:1229594|1222032_1222317_+|VEE51533.1|DBSCAN-SWA MTEPLLAFALLGAISAISIGAARIVSWYLDRRGEPARRSAREAAFEAEARAELAATGWTAEEEAAFQAVRSQQGKHLRALRPEFRQLIEETHRA |
12 | Stenotrophomonas_phage(85.71%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1861149 : 1874257
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >LR134301|1861149:1874257|DBSCAN-SWA ATCAACGGTAGTAGCGCGCTGTCTTCTCGTCCTCGTCGCGGTCGGCGGCGTCGCGCTGCTTGAACCACTTCTCAGTGAACGGCTCCGGCGGCGCCGGCTTGGCCTCGGGCGGGAGGCTGCCGTCAGCCATGAGGTCGATGCCACGGCCGAACAGGCTGCACACGTCGACCATATCGTCCCGGCGGCCGTCCTCGCCCGTGAAGGCGCACAGCTGGTCGATTAGCCTGTCGCCCCATTCGGTGTTCGGGATGTGCACTGACCCGGTGGCCGCGCGGGCAGCGAAGCCCAGAGCGCGGTCTGCCTTGCTGCCGGCACTGGCCAGCGGCACGCGGTGCACGAACGTCTGCGTGGCCTTGGCCGCTCGGCGGATGGCGCCGTCGGTGGTGCGCAGGATGACGCCCATCTCCTCGAACGCCATCACCGGCTTGTTGCGCCTGCCCATTTGCATCAGGGCCGCGATCCAAACGGACGGGTCTTCCTGGCCGCTCCACCAGTCCACGAACCACATGTCGCCGATGTGGTCGAGGCCAGCACAGCCATGCTCGGTCCAGTCGGGATCGGCCTCGGGGTCGTCCGGGTCCGGCGCGCCGGCGTAGTCGCTGGCCAGGTACTTGCGTAGGCCCTTCGGCTCATCGCCCAGGTTGAAGCGCTTGAACCAGTGCCGCTTGAACAGGATGCCGGCCTTGGCCCGGGGCTGGCCGCCGAAGATGTGGTCGTGCAGGTCCTGCGACACAGCCAACGTCTTCAGCCGCTCCGTCTCCATGGCGGGGTTCCACCACGGGTTATCCAGCCAGTTGATCTGGATGACGATCGCGTCCGGGTCATCGCCCAGCACCCAGCGCTTGTAGGCGTAGTCGTCCTGCTGGTCCGGGTTGAAGGTCACCCAGATCTCGGCGCCAGTGGTGCGGACAATGGTCGGGATCAGCTTGTTCCAGCTGTTGGCCGAGACGTTGGACGCCTCCTCCACCCATACGATGGTCGCTCCTTCGAACGACTTGATGCTGTCGGCGGTGTGGTCCTGCAGGCCCGTGAAGCTGAACGTAGACCCGGTCAGGATGCAGGTGATCTGGTCCTCGCCCTGCTTGTTGATCTTGAAGTAGGCCGACAGGCCCATCCGGTTTATGTAGTCCTCGATGACCCGCTTGGAGGACTGGGCGATCGACTTCTGGATCTCGCGCACGCACAGGATGCGGTGCTTGGCCTGCATCGACAACATCACCAGGATCTGCGCCACGGTGTGCGACTTCGCCGAGCCGCGGCCGCCGTACAGCACCTTGAACTGCTTCGGCTTCAGCACTGGCAGCAGCTTGGCCGGGATGTGCACCGGAGTGTGCGGTGCCAGTGGGTTCGGCTGGGCCGACACGCTCACTCGCCCTGCTTGGCGGGGACCACGCCCATGACGTAGAACGGCGGGGGGGCCGGCAGCTTCTCGCCATCAGCATCGGCCAGCTGGACCTTCTCGCCGTAGCGCTTGGCGTTCTCCTTGCCAGACTGCCACTTGATGGCGTCCATCATCACGCGGGCAGCGGCCGGGTCCAGCCGGCCGTCCTCGACCTTCTGCATGATCTCGTCCAGCCGCTCGAACCGGGCATCAGCCCGACTCGCGCGCGCGCGCATGTACTGCTGTCGGAAGGAGGCGGGATCATCCTCATCCCCACCAGCGTTGGAAGCCAGCCAGCGGAAGATGGTCCGGGCGTCAGGCATGCCCTCGGTCTCGCCGATCTTGGCGATGCTGTCCCCTTGGGCGATCAGGACGCACACCCGCTCGGCCAGGTCCTGCGTGTACTTGCTGGGGCGGCCGATAGGCTTGCCCGGCTTCTTGGCCGTGCTGGCAGTAGACTTACGCTGCGCTGCCATGGTTCTCCCACACATGGATGACACGCACAGGGACGCCGCGCCAGTAGATGCCATCAGCAGGCATCTCATCGAGGGCGTAGGGCACAGTCAGATGCGCCATCTCAAGGCACAGCTGCCTCCCCTGCAGGTTTGTGATGACCAGTTCCACCGGATATTCCCCTGTCCGCAGCATGGAATCTTCCAGCTGCCGGTCCAGTAGAGTGGCGACGGTATCCCAGTTCGTCTCCTGAATGCTCATGCGCGCTCCTTGAGTGCCTGTAGCCGCACCTGCCCCTGCCGGGTGATGCCGAAGCGCTCGCCCTGCTCCTGCGCGTAGCCGTGGCTCACCAGCGAGTCGAGCAGAGAGTCGCCGCCGCGGTGGTGGTCGCGCCACTCCTGCCGCGTCAGGCTGAACTGGCCAGCCAGGTGCTGCAGGCCCTGCGTGATCGGGTCCAGGCTCACGACAGGGCCTCCTCGGTCCCGCAGGGGAAAGTGGTCACGGGCCCGGCGTGGCGTCCGGTTCCGCCGCCGCAACGCGGCTCCCTGCTCCCTCGGTGCTGGCTGTTACCCACCTGCCAGCTGGGGCTACCGATACCTGCCTCGGTAGAGGGCCGCATGTACGCGGTCAAGCGGCGCGCCCAGACAGGTCGAACAGGTCGAGCTGCACAGGCATGGGCCGCTTCCGTGGCGCCGGCGTGGCGATGCCCAGATGCTCCAGCATGTCCTCCAGCACGTTCGCTGCGGCCTCGGCGGTCACACGGGGGAAGCTGTACTTGCCCACCAACCACGGCCAGTACGGAGACTTCTCGCCCTTGCGGGCCATCTCCACGGCCACCGGCTTGTCCTCGGCCAGGACGAACGCCTGCGAGGTCTCCGGGTTGATCAGTAGGAAGCTGGCTACCGTGCATCCGCGCTGGTTCTCTGCGATTCGTGGAAGGATGGCGTTCAGGGCATCTGCCGGGTTGGTCGGGTCAACCACGCAGACCACGCGCGGCTTCCAGACCTGCCGGAACGGAACACCCTCAGTTGTACCGGTGCGGCGCGGAGCTTCAACGGATGCGGCCATGTTTCGACCTCCCCTGCTGGGTGGGCTGACCTGTTTCATGCGCTGCCCTCAGTTGGGCGGTAGTAGGCGTTTCGAGCCTTGCGGTACTTCATGAGGGTGGCGCGGCCAGTTGCTTGGAGCAGGCCGATGCCAACCATGTTGCTGAGGCTGCGGGCGTACCTCGCTCTCTCGGCGGGGTCCTCGACTTGCATCCCATCGCAGATGTCAGCGGCGAAATGCCAGCCCGGATTGCCGTCCAGCCACAGTCGGATGCGCTTGGCTCGGCTCGGTTCGAAGGTGTGTCCGGGCCTCAGCGAATACAGCCGCTGCCCCCTCTGCCCGGTCGCCGTCAGGACGCCGTCGCGAGCCAAGCCGCAGACCGTGGCCGCCACGAGCGTCTTCTCCCTGCCGTGCGCGCGCATTCCCTGCAACACATCGGACAAGCGGTGCTCACCGCCGTGGCGCTCGAACCAAGCGCGGATTTCTCCAGTGCGGCTCATGCCTCGGGCCCTCCCGTGATCCGCACCACCACCTGGCCACCCTTGCGGACCTCGGTGCTGACCAGCGGGTGGCTGATGAACCGCTTGTCATCGATGCCGAGCGCGTCGGCGATGCCATCCCGGTACGGCTTGAACCGGGCGAGCATGTTGTCGTCGTCGGGCAGCAGCTTGGTCGGCGGGTAGAACGTGACGTGCAGGTGGATCCTCAGCTCCGGGAATACCGTCTGACGCCACCCAGCGTGGAACGCCAAGCAAGCCGCGTCAGATCGAGCCCGCTTTGCCGCCTTCGCCTTGCGCGCCCAGTGCACGCGCCCATTCGGCGACAGGTCCTTGTGCGGCCACGGCAGGATCAGCTCCTTCATGCCGCGCTCCTCTGGCCCAGCCGCTGCAGGGTCACGTTCAAGGCGGCCAACTCGTCCATCTTCATGACGAGCCACATGCGCCGCTCCCCATGGATCCCGTTGAAGCTGCCCTGATGGCAGTCCTTGCATAGGGCCACAGTGGTGTAGTGCTGCCCCTGGTTGATGTGGTGGGCGTCGCTCGGGCCCGGCAGGCCACACACGCTGCAGGGCAGATCCTTGACCGCCTGCAGGTGCGCGCGCTCTGCGGCGGTGATCTCCTTGGAGTTCTTCGTCCTCACGCAACCCTCCTGCTTGGCGGCGCGAACTGCGCCAGTTTCTCTACTGAGCTGTCCGACCAGCGCACCCGGTCCGAGAACTCGGCGTGGATGAAGGTGAGGAAGTCGCCCATCTTCCGGCGGCCGTACTTGCTGGTCCGGGCGCCGAGCATCACCACGCCGCCGCGCAGGCCGGGGGCCCACTCGGTCTCCTCCTCGAACGCGGCGGTCAGCACGTCCTTCCAGTCGTAGGGCGTGGCCTGCCTGGTACTGCCGTCGCGGCGGGTGATCACCAGCGGCACCTGCTTGGCGATGTCGCTGAGCGCCGGCCACATTGCCGCGTTCTGGTCCAGCGTCCGCTTCGGCTCGTCCAGGGTGATCTGCACCGGGCCGCCCTTGAGCCATTCGTTGATGGCGCGCACGACGCTGGAGATCACCTGCGGCCAGTTACGGTTGTTCGGCGGATCGATCAGGAAGGTGCGCTTCATGCTCGGCTCTCCTTCTTCCCGATGTAGCCGAACCCCTCACAGCGGTCGCAGCGCTCCCCCCATTCACTGCCGTCAGGGTTGTGCCAACGGTCGCCACCATGTCCGTGACACGTGGGGCATTCAACGACCTCCGGGAACATTTCGCTGAATTTCTCTTGCGAAATCTCCCCCATGGCCGCGAAGTTCTCTTTCTCGCTCATGGCCTTTCCTCCGGCGGAGCGGGCAGCGGCTGCCAGTCAGTGGCATCATTGAGAGGCTCACCATCCCACGTGGCCTCCCATCGCTCGCGCTCGTGATCGAAAACAGCAGCCCATGCCACCGGCTCGCCATTCGGACCGGTCATGTGCGCAGATGTGGCGAGTACAGCAGTACCATCCTTCGGCGCCGACTCAATCGGCCGCCACTGCGGTGCGAGCGCGGCGATAACCTCGTCAGCGATCTCCTCGTACTCGTCCTCTGGAATCCCGGGGTTTGAGCTGAAGTAGGCGTAGCCCTCCATCTCACGGGTGTGACCGGACAGGATGCTGATGATCCGCGCCTTCAATTCGATCTTGCTCACGGCCTTTCCTCCAGCGTGTCGAACGCAAAGTCCCAATCGATGCCTGATTCATCGGGGATGCCGGTGGCGTACTCGGCTTCGGCATGCTCCGCCCCGGGCGCATCGATTGAGATGAACCAGTCCTTGGCCTGCTCATCCCACTCCGGTTTGATCTCCCACCACGACCACAGACCGTCGCCGTCCATCGACAGCCACTGCGCCCATTCGGGAGCGTCTTTCCAATTCGGCTTGCTCATCCCCGCTTCCTCCTGATCTGCTCGTCTCGTTCGTCGTAGCCGGCCAGCCAAGCCCGGCGCAGCGCCAGCCCGTCCTCGCCCATGGCGTAGAGCGGGACCGAATTCCTGTCCTTGTGCGCGTCGCGCATCCACCGGCCAGTCTGGCGGGCGCGTTCCAGTTCGTATTGCGGGACCATCAGAACTCCCTCCCCAAGTCGGCGAAGCGCATGGTTTCGCTGAGGAAGGCGACCTTCTTGAATCCGGTCGGCCCGTGGCGGTTCTTCTCGATCAGGATCTCGGCGATGCCCCGGTCCTGCGTCTCGCGGTTGTAGACCTCATCCCGGTACAGCATCAGGATCTGGTCGGCCTCGCGGGTAAGCTCGTCGCTGTTGGCGAGGTCGCCGGCATTCGGGCGCTTGTCGCCTTGGCGCTGGTCAACTGCCTTCACAACCTGGGCAAGAGAGATGACCGGGATGTCCAAGTCGCGGGCTAGGTTCTTCATCCCGCGTGCCACTTGGGAGACTTCCGTGGTTCGGTCCGCACGCGGCACCGTGATGCGCTGGGCGTAATCGATAAACAGGCAGCCGATGCCGTGGGTGTGCTTCCACTTACGAGCGATCCCTACCAACTCGTCCAGCGTCACGGCCGAGCGGTCGTAGATCCACATGTCGCGATCTAGCGCCTTGGCCATACCCGCCTGCAGGAGCGACCAATCCTCGTCCTCCAAGTTGCCAGTACGAAGCTGGGTTGCTGAGACCTGGGACACCGCGGAGAGTCGGCGTAACGCCAGCTGTACTGCGGGTTGCTCGGCACTGATCACACCGGGCCGCCACTTCGCGTCTGCGGCAGCTTCGATCAGTCCACCGAGAAACGCAGTCTTGCCCATCGCAGGGCGGCCGCCGATGATCGTTAGATCGCCAGGGTGCCACCCGCCCAGAATGTCGTTTAGCGCATTCAACCCTGTCGGGATACCAGGTAGCGCACCGCCCGATGCGTGATTCCGCGCCACTTCTCGCCACGCTTCCTGCATCGCCTGCTTTCCGGTGTATTCGCAGGAGGTAACCACGGCATTGAGCGCCAGCAGCCGGCCAGCGGCCACGTCCACCGCGTCCTCCTCGCCGGCGCGCGCCGCCGCCACCAGCTGCAGTCCGACCGCCACAGCCTCGCGGCGACGCCAGTTCTCGCGCACCAGATCGGCGTAGGCGATAACCGCCGAGGAGCCAGGGACGTTCGCGGCCAGGTGCACCACGTAGTCGAAGTCGTCCGGCGATGCCTCGCCGATGGTCACCGCGTCGGCGGGCTCGCCGGCCAGCACGCGGTCGCGGATCAGGCCGAACACCCGTGCGCGCTGCGGGCTGGTGAAGTGGTCCGCGCCGATCAGCGGTGCCACGTCGTGGAAGCGCTGATTGTCCTGCAGCAAGCCGCCGATCACGGCTTCCTCGGCGAAGGATGGGGTCACGTTGCTCACAGCGGCCTCCTCCCGCCGCCAGCGGCGACCGGCTGCTGGATCTGCACGACCTGCGCTGGCGCCCGCTCAGTCGCCCCGCCCATCCAGCTGTTCAGGAACTTCGGCGTGCCGCGCCGCGTCTTGCGCTTCGGCGGATTCGCCAATGCCCACGCCCTGGCCTTGCGGATCTCGCCCACCACGTCGACACGGGGGAAGGCAGCGCGCAGCTCGGCCACCTCGGCGTCGGTCACGGCGTACTCGGAACCGTCGGCCAGCGGGATGGCCAGAACCTCCCCTTGCATCCCCTCCGGAGTAGAAACAGGAACAGGAGCAGGAACAGGAGAGTTTCCTAACGGTTTGGAAACCGTTTCGGAAACTCCGATCGATGCGAGGATTTCATCCTTGAAAGCGAGCGAGTCAGGCAGAGCCTGGGCCAGCTTGGCGATGGACTTCTGCTGGTTCGGATTGTCCGGGCGGTTCCACTTCACGAACTTGACGATCCAGACGACCTTGGTCGCGCGGTCGTACTTCACGAACCCCGCTTCAGAAAGGGTTTCCAAACCGTTCTGGAACCGTTCCGAATCCCAGCCAAGATCCTCACAGGCATAGGCATCGGGCAGCCGGAATGCACCCAGCATCGTCGTGTGCTGGCTCGTCATCAGGTAGATCGCCAGCAGGCGCGCATCGGACTCCAGCCCCAGCATCGTCTCGCTGGCCCAGAACCCGGTGTGGATCTTTCCGTAGTCGCGCATTACGCCGCTCCCAACAGGTCAGGCTGCGGCGTAGCCGTGCGCCGGGCTTCCGCCCTGGCCTGCTCGGCCGCGCATCGGGACAGGTGCGCGACGATCTCCTCGCGCGTCATGGGCGGGCTGGACTCGATAACCCGTATGCACTCGTCCAGTCGCTGCAGCAGCTCGCGGTTCTTCATTGGCGGACCTCATCAGGTCGGCCAGGCGCTCGACCTCGCAGATGCCGTCGAGGGCGGCCTGCAGGCTGAGGAACTGGCGGAGAAGGTTGGAGCCGGTCGCGGCACACAGCGGGCCGACCAGCTTCTCGGGGATCGGGCGGGCGCCGTTCTGCATCCGCGAGACGTAGGACTTCGACTTGCCGATGCAGGCCGCCACGTACTCCAGCTTGTGGTGGCCGGCGCGGATCATCACGGCCAGCGCGCGCGCCGCCGATTCGATCTGCCGGACGATCTGGGAGGGCGCATCCTTCGGGGCGTGGTGTACGCCAAATGCGAGCGGTAGAGCCTTTTGGTTGCCAGGGGTTGCCATGCGTTGCCTATCGTTGCCAAGCCCTCTCGGGCGGAATAAAGGCCCAACCCACAACGGATTGAGCCAAGTGAATTCAGAGGTGACGCGGTCGAGCGGTGTCGTTGAACTGGTGCCAGTTGCAGGCCGGCTGTTCGTGCTGAGGCGTTACGGCGATCGGGTGCGAATCACCCAGGTGGAGCGAAAGAACCCGCCCGTTCCCGGCAGCGGCACCGTGGTGCCCTTCCCTGCCCGGGGTCGGTGAGGTGGTCATGTCAGGGGGAGCGCTTACGGCGCGACTTGGGAGCCGAAGCACCAAACACGTCGGGCCTCAGTTCGTGGCGGGTGACGCCGGTATCGGCCTCGATCACGAGGACGTGCCTGGGTGGGACCGGGCGCAGCCCCTTGACCCACTGGTTGACTGCCTGGGGGGTCACCTGCAGGTGGCGCGCAACGGCGGCCTGAGGCATCTCGTATCGGTCGATGGCTGTCTGGATTGCGCTCATGGATAAACTAAAGCACCGCTTTAGTTCAAAGTCAAGCGTTGCCTTATTACGGACGCCGGTAGCCCGCCGCTACCATCAACCCATGCTTGATAACGATGAAATGGCGCGGCGCGTCCGCTACGCGTTCGACAATGCCGGCCGAGGCACACAGGCCGCCGTTGCGCGCGAACTTGGCATAAGCGCACAGGCGGTAACTGGCTGGGTGAAAACGGGGAAGATTGAGAAGTCGAACCTACCGGCCCTGGCCCGTCTCACTGGCCGCCGCGTGGACTACTTCTTGGACTCCACTGTGACGGATGAAGAATCAGGGGCCGACGTCGCACAGGCTGCGAATGACGACGACACCGTAGACATCCTTGGCTACAGCCAGGCTGCCGGCCTTGGCGATGGCATGGAGGCGATCGAATACGCCGAGACTCATCGCCTCAAGTTTCGAGCGTCGTCCCTGGCCCGTAAGCGCCTAATCCCAACGCGGCTTGCCGTCTTCTATGGCAAGGGCGACTCAATGCTCCCCCGCATCCAGTCAGGGGACGCGATCCTTTTCGATACCAGCGACACGCGCCCAGCCGACGACAAGCTGTTCGTCATCATGGCTGCGGGTATCGCAGGCAGCGAGTACAGCGTGAAGCGCTGCCGGACATTTGGCGATGACATCTATTTCGATGCCCTGAACGAAAAGGGCGACCACAACTGGCGCAAGCCGCGGAAGATGGACGACCCGCGCCACCCCATCACCATCATAGGGAGAGTGCGATGGATCGGAAGCTGGGAGGACTGATCCTGCTGGCGCTGGCTGCGCCGGCAGTAGCTTGGGCCCAGGATGGGTATAGACGAGCTGGCACCGCTCTTGGGCAGGCAATCTTCGGGACCAGCCAAGAGGCATATGACCGCGAGTACGAGCGACGCGTCCTTCTGGAGGGCGCGCGCCAAGACGCGATACGCGCTCGGCAAGAGGCCGATGCTGCAGAGCTTCAGGTGAAGGCAGTCCAGCGGCTACAGTCGATCTGGGTACAGTTGGGGCTACCCGATGAGGAGGCCCGCTCTGTTGCCTCCGCCTTCACCTGGGACGCCCAGATGGAAGCGATTTGCCAGCGCGCAGCCCGCGACGGGTACAAGACAACGATGGACGCCGGGATCAAGGCCTACAGGGACTACCAGTACCCGCTGGCCAACCAGCTTGTCCTCGCCGCCTATCTCCTGCCGGTCGAACCCACGCCCTAAAGCCTTCGCCGCGCCAATAAGAGCCCGCCCTGAGCGGGCTTTTTTGTGCCCGGCCCAAGGGCATCGAAAAAATTCTCAACTTTTTCTAAAGCGCCGCTTGACTTTGAACTAAAGCGGCGCTTTACTTACTCCAACGCCGCGACACGCCCACTCCCGGGAGCGGCTTGGAGACGAAGATGGGTCTGCACACTGCAACCGACATCGCCCGGAACGCCCAGCGGAGCCTTGATGGCCTGCTGCCGCCGGATACCGAAGAGGCGTTCCAGGACGCCTGCAACTCCCTGGCACGCACCTACGACCGTGAGGGCAGCACCGCCGAGCTGATCTCGGTGTTGGTGACCTCCGAGCGCGCCTTGGACTTCCTGATTTCCGAGGTCGAGGTTCCGGCGCACCTGCTCCATGACCTGCGCGAGCTGATGGACCTGCAGGCCCGGGTCGTCCGGCAGGTCGAGCGCTCGATGCGCGCTGGCGGTGCCACATGACCGCCGCCGACCGAGCCCTGCACTTCCAGGCGCTGAAGCTGGCTTCGGGCTACCTGCTGGCCTTCTGCATGGGCGTCGCTTTCGCCGTGGTGGTGCAGGCGGTGCTGTCGTGAGCCGGGCCGTGAATGTTCTGGCGGTTCTGGCCGAGCAGATCGCAAGCAGCCGAAACGCGCTCCCGATCCTTCCTGAGCACCTGCGATCCAAGGCCGAGGAAGCGATCAAAGAGGTAGAGCAGGTGCATGCCCGCGTTTCGGTCGTCTTCGACGCAGCCCGCGCCGTGCTGAGTGCCAGCGACTTCACCGACCTGCTCCACGCCGAGGAGTGCCTGCGCGAGGCGCTGGCCGCCTGCGAGCCGGAGACGCCGCATGAACCCGTTTGACCACCTCGACGCAGCGTTCGCCGCGCAGTTTGGCGCGCTGCCGCCCATCACCCCGCCGATGTCGCTGGCGGAAGCCCGGGAACAACGCAACCGCGAGGCCGTGGACGGCCTGTGCGTGGAGGAAAACGACGATGAGTAAGCACACCCCGGGGCCGTGGGTCGTAACCCCGCATCCAATGACGAATGTGGACGTGTTTGGCGTCGGCGTGATTATGGACGACAAGGAGATGCAGTACGCGCTTTCGCACACCATGTGCTATCAGAACGCGGAAGCCAACGCCCGCCTGATCGCCGCCGCGCCTGATCTGCTGGAAGCGCTTAGCAACTTTCCATCTGATGCCGACTATTCGTCATCCGACGAGTACTGGAAAGCGACTGTCCATTGGTGGCTAACCGCAGCAGAGCCCGCCATCGCCAAGGCCAAAGGCGGTGAGCAATGAGCCTCCGCCTCCGCATCGCCTGGGCCGCAGTCGCGCTGTTCGCCGCCGTCGTCGTGCCGCTGCGCATCGCCGAGATCCACCAGGCCCACACCGACCGTGACGCTGCCAAAGCCCGATGGGCTGCAACCAGCAGCGTGCGTGGCTGAATTCCCCCGCCCTCACGGGCCCCGCGCCGGCCGGGATTCCACGACGCCGGCATCTATTCCTGAGGAGATGACATGTTCTTCCGCAACCTGACGTTCTTCCGCTTCCCGACCACCACCGACTTTTCCGAAGTCGACACTTTGCTGCCGCACGCACTGCTGAAGCCGGTCGGCGCACTGGAAATGAATTCGCGCGGCTTCATCTCGCCGTTCGGCCGCGAGGAGAAGGAACTGCTCTCCCACCGTTGCGGCGACTTCCTGTGGCTGGCAGTTGGCGGCGAGGACAAGATTCTGCCGACGTCGGTGGTGAACGGCGAACTCGAAAAGAAGCTGGCGCACATCGAGGAAACCGAGGGCCGCAGGCCAGGCGGCCGCGAGCGCAAGCGCATGAAGGACGACCTGCTGCATGAGCTGCTGCCGCGCGCCTTCGTGAAGTCCTCGCGCAACGATGCCTTCATCGATTTGCAGCACGGCTACGTCGCGGTCGATACCTCCAGCCGCAAGACCGGCGAGTACGTCATCTCCGATATCCGTGGCCTCCTCGGTAGCTTCCCAGCGATGCCGCTGAACGCCGAAGTCGCTCCGCGCTCGGTCCTCACCGGCTGGATTGCTGGCGAACCGATGCCAGTCGGCCTGTCGCTGGGCGAGGAAGCGGAATTGCGCGACCCCGTGCAGGGCGGAGCTGTCGTGAAGTGCAGCTATCAGGAGCTGAAAAGCGATGAGATCGACAAGCACCTGGACGCGGGCAAGCAGGTAACCAAGCTGGCGCTTGTGTTCGAGGACAGCCTGTCCTTCGTACTTGGCGAAGACCTGATCGTCCGCAAGCTGCGCTTCCTCGACGGCGCACTGGACAAGCTGGAACACGCGGACCAGGACGGCCGCCGTGCCGAGTTCGATGCCCGTTTCGCCCTGCAGAGCGCCGAGATCCGCCGCCTGTTCCTGGTGCTGGAAGAAGCCTTCCGCATCTCCAGCGCCAACTGACCCCGCAAGGACGCACCCGCCCGCCCGGCGGCGGCTCCGAGAGCCGGGCACCTATTCCACCTACCAGCAGAGCAGCCATGAACCAGATCGTCACCATCGAGGATTCGGTCTACGGCACCAAGGACTCTTTCGCCTCGGTGCTGACTGATCGGTCCATCAACTTCGACCGTGAGGCCGAGTTCGCCCTGCAGACGCTGTACGGCAACGACTACGCGATGAAAATCGCGATGCAGAACCGGGCGTCGGTCATCGCTGCCGTGGTCAACATCGCGGCCATCGGCATCAGCTTGAACCCGGCGAAGAAGCAGGCCTACCTGGTGCCGCGCGACGGCAAGATCTGCCTCGACATCAGCTACATGGGCCTGATGGATCTGGCCATCGACTCCGGGTCGGTCCGCTGGGGCCAGGCCGAGCTGGTCTACGAGAGCGACCTGTTCGAGCTGGTCGGGGTGGACAAGGAACCAATCCACAAGCGGGCCCCGTTCAGCCGAAACCGCGGAGAGATTGTCGGCGCCTACGTGGTGGTGAAGACGCCCGAAGGTGATTACCTGACCACCGCAATGTCTGTGGATGAAATCAACGACATCCGCGACCGTTCGTCGGCATGGAAGGCGTGGGTATCGAAGAAGAAGTCCTGCCCGTGGGTCACCGACTGGGGCGAGATGGCGAAGAAGACGGTGGTGAAGCGCGCCTACAAGTATTGGCCGAAGACCGAGCGCCTGGAAACCGCGATTCACCACCTGAACACCGATGGCGGCGAAGGGCTGGCGGTGATCGAGCAGCAGGCGCAGAACCGCACCGCGCTGCCGCCCCCGGAGGACACCGAGGAACGCATCGAGCTCTACGCCAGCCTGCAGGACATCGCGACCGCCGGTGTCGAGGCGCTCGGTGAAGCGTGGTCGAAGTTGACGAAGGCGCAGCGCACGATGATCGGTCAGGCTGGTCTGGAGGCGCTGAAGGCTGAGGCCGAGAAGGCAGACGCCGAGGTGGTCGAATGA
Protein sequences of DBSCAN-SWA_2 >LR134301|1861149:1874257|1868624_1869131_-|VEE52132.1|DBSCAN-SWA MATPGNQKALPLAFGVHHAPKDAPSQIVRQIESAARALAVMIRAGHHKLEYVAACIGKSKSYVSRMQNGARPIPEKLVGPLCAATGSNLLRQFLSLQAALDGICEVERLADLMRSANEEPRAAAATGRVHTGYRVQPAHDARGDRRAPVPMRGRAGQGGSPAHGYAAA >LR134301|1861149:1874257|1865165_1865633_-|VEE52125.1|DBSCAN-SWA MKRTFLIDPPNNRNWPQVISSVVRAINEWLKGGPVQITLDEPKRTLDQNAAMWPALSDIAKQVPLVITRRDGSTRQATPYDWKDVLTAAFEEETEWAPGLRGGVVMLGARTSKYGRRKMGDFLTFIHAEFSDRVRWSDSSVEKLAQFAPPSRRVA >LR134301|1861149:1874257|1865629_1865833_-|VEE52126.1|DBSCAN-SWA MSEKENFAAMGEISQEKFSEMFPEVVECPTCHGHGGDRWHNPDGSEWGERCDRCEGFGYIGKKESRA >LR134301|1861149:1874257|1867871_1868606_-|VEE52131.1|DBSCAN-SWA MRDYGKIHTGFWASETMLGLESDARLLAIYLMTSQHTTMLGAFRLPDAYACEDLGWDSERFQNGLETLSEAGFVKYDRATKVVWIVKFVKWNRPDNPNQQKSIAKLAQALPDSLAFKDEILASIGVSETVSKPLGNSPVPAPVPVSTPEGMQGEVLAIPLADGSEYAVTDAEVAELRAAFPRVDVVGEIRKARAWALANPPKRKTRRGTPKFLNSWMGGATERAPAQVVQIQQPVAAGGGRRPL >LR134301|1861149:1874257|1871437_1871695_+|VEE52138.1|DBSCAN-SWA MNVLAVLAEQIASSRNALPILPEHLRSKAEEAIKEVEQVHARVSVVFDAARAVLSASDFTDLLHAEECLREALAACEPETPHEPV >LR134301|1861149:1874257|1871312_1871429_+|VEE52137.1|DBSCAN-SWA MTAADRALHFQALKLASGYLLAFCMGVAFAVVVQAVLS >LR134301|1861149:1874257|1864085_1864529_-|VEE52122.1|DBSCAN-SWA MSRTGEIRAWFERHGGEHRLSDVLQGMRAHGREKTLVAATVCGLARDGVLTATGQRGQRLYSLRPGHTFEPSRAKRIRLWLDGNPGWHFAADICDGMQVEDPAERARYARSLSNMVGIGLLQATGRATLMKYRKARNAYYRPTEGSA >LR134301|1861149:1874257|1872131_1872281_+|VEE52141.1|DBSCAN-SWA MSLRLRIAWAAVALFAAVVVPLRIAEIHQAHTDRDAAKARWAATSSVRG >LR134301|1861149:1874257|1863239_1863446_-|VEE52120.1|DBSCAN-SWA MSLDPITQGLQHLAGQFSLTRQEWRDHHRGGDSLLDSLVSHGYAQEQGERFGITRQGQVRLQALKERA >LR134301|1861149:1874257|1870366_1870834_+|VEE52135.1|DBSCAN-SWA MDRKLGGLILLALAAPAVAWAQDGYRRAGTALGQAIFGTSQEAYDREYERRVLLEGARQDAIRARQEADAAELQVKAVQRLQSIWVQLGLPDEEARSVASAFTWDAQMEAICQRAARDGYKTTMDAGIKAYRDYQYPLANQLVLAAYLLPVEPTP >LR134301|1861149:1874257|1871681_1871834_+|VEE52139.1|DBSCAN-SWA MNPFDHLDAAFAAQFGALPPITPPMSLAEAREQRNREAVDGLCVEENDDE >LR134301|1861149:1874257|1869695_1870391_+|VEE52134.1|DBSCAN-SWA MLDNDEMARRVRYAFDNAGRGTQAAVARELGISAQAVTGWVKTGKIEKSNLPALARLTGRRVDYFLDSTVTDEESGADVAQAANDDDTVDILGYSQAAGLGDGMEAIEYAETHRLKFRASSLARKRLIPTRLAVFYGKGDSMLPRIQSGDAILFDTSDTRPADDKLFVIMAAGIAGSEYSVKRCRTFGDDIYFDALNEKGDHNWRKPRKMDDPRHPITIIGRVRWIGSWED >LR134301|1861149:1874257|1873336_1874257_+|VEE52143.1|DBSCAN-SWA MNQIVTIEDSVYGTKDSFASVLTDRSINFDREAEFALQTLYGNDYAMKIAMQNRASVIAAVVNIAAIGISLNPAKKQAYLVPRDGKICLDISYMGLMDLAIDSGSVRWGQAELVYESDLFELVGVDKEPIHKRAPFSRNRGEIVGAYVVVKTPEGDYLTTAMSVDEINDIRDRSSAWKAWVSKKKSCPWVTDWGEMAKKTVVKRAYKYWPKTERLETAIHHLNTDGGEGLAVIEQQAQNRTALPPPEDTEERIELYASLQDIATAGVEALGEAWSKLTKAQRTMIGQAGLEALKAEAEKADAEVVE >LR134301|1861149:1874257|1861149_1862517_-|VEE52117.1|DBSCAN-SWA MSVSAQPNPLAPHTPVHIPAKLLPVLKPKQFKVLYGGRGSAKSHTVAQILVMLSMQAKHRILCVREIQKSIAQSSKRVIEDYINRMGLSAYFKINKQGEDQITCILTGSTFSFTGLQDHTADSIKSFEGATIVWVEEASNVSANSWNKLIPTIVRTTGAEIWVTFNPDQQDDYAYKRWVLGDDPDAIVIQINWLDNPWWNPAMETERLKTLAVSQDLHDHIFGGQPRAKAGILFKRHWFKRFNLGDEPKGLRKYLASDYAGAPDPDDPEADPDWTEHGCAGLDHIGDMWFVDWWSGQEDPSVWIAALMQMGRRNKPVMAFEEMGVILRTTDGAIRRAAKATQTFVHRVPLASAGSKADRALGFAARAATGSVHIPNTEWGDRLIDQLCAFTGEDGRRDDMVDVCSLFGRGIDLMADGSLPPEAKPAPPEPFTEKWFKQRDAADRDEDEKTARYYR >LR134301|1861149:1874257|1871010_1871316_+|VEE52136.1|DBSCAN-SWA MGLHTATDIARNAQRSLDGLLPPDTEEAFQDACNSLARTYDREGSTAELISVLVTSERALDFLISEVEVPAHLLHDLRELMDLQARVVRQVERSMRAGGAT >LR134301|1861149:1874257|1866603_1867875_-|VEE52130.1|DBSCAN-SWA MSNVTPSFAEEAVIGGLLQDNQRFHDVAPLIGADHFTSPQRARVFGLIRDRVLAGEPADAVTIGEASPDDFDYVVHLAANVPGSSAVIAYADLVRENWRRREAVAVGLQLVAAARAGEEDAVDVAAGRLLALNAVVTSCEYTGKQAMQEAWREVARNHASGGALPGIPTGLNALNDILGGWHPGDLTIIGGRPAMGKTAFLGGLIEAAADAKWRPGVISAEQPAVQLALRRLSAVSQVSATQLRTGNLEDEDWSLLQAGMAKALDRDMWIYDRSAVTLDELVGIARKWKHTHGIGCLFIDYAQRITVPRADRTTEVSQVARGMKNLARDLDIPVISLAQVVKAVDQRQGDKRPNAGDLANSDELTREADQILMLYRDEVYNRETQDRGIAEILIEKNRHGPTGFKKVAFLSETMRFADLGREF >LR134301|1861149:1874257|1872353_1873259_+|VEE52142.1|DBSCAN-SWA MFFRNLTFFRFPTTTDFSEVDTLLPHALLKPVGALEMNSRGFISPFGREEKELLSHRCGDFLWLAVGGEDKILPTSVVNGELEKKLAHIEETEGRRPGGRERKRMKDDLLHELLPRAFVKSSRNDAFIDLQHGYVAVDTSSRKTGEYVISDIRGLLGSFPAMPLNAEVAPRSVLTGWIAGEPMPVGLSLGEEAELRDPVQGGAVVKCSYQELKSDEIDKHLDAGKQVTKLALVFEDSLSFVLGEDLIVRKLRFLDGALDKLEHADQDGRRAEFDARFALQSAEIRRLFLVLEEAFRISSAN >LR134301|1861149:1874257|1862513_1863005_-|VEE52118.1|DBSCAN-SWA MAAQRKSTASTAKKPGKPIGRPSKYTQDLAERVCVLIAQGDSIAKIGETEGMPDARTIFRWLASNAGGDEDDPASFRQQYMRARASRADARFERLDEIMQKVEDGRLDPAAARVMMDAIKWQSGKENAKRYGEKVQLADADGEKLPAPPPFYVMGVVPAKQGE >LR134301|1861149:1874257|1865829_1866192_-|VEE52127.1|DBSCAN-SWA MSKIELKARIISILSGHTREMEGYAYFSSNPGIPEDEYEEIADEVIAALAPQWRPIESAPKDGTAVLATSAHMTGPNGEPVAWAAVFDHERERWEATWDGEPLNDATDWQPLPAPPEERP >LR134301|1861149:1874257|1864525_1864891_-|VEE52123.1|DBSCAN-SWA MKELILPWPHKDLSPNGRVHWARKAKAAKRARSDAACLAFHAGWRQTVFPELRIHLHVTFYPPTKLLPDDDNMLARFKPYRDGIADALGIDDKRFISHPLVSTEVRKGGQVVVRITGGPEA >LR134301|1861149:1874257|1871826_1872135_+|VEE52140.1|DBSCAN-SWA MSKHTPGPWVVTPHPMTNVDVFGVGVIMDDKEMQYALSHTMCYQNAEANARLIAAAPDLLEALSNFPSDADYSSSDEYWKATVHWWLTAAEPAIAKAKGGEQ >LR134301|1861149:1874257|1866424_1866604_-|VEE52129.1|DBSCAN-SWA MVPQYELERARQTGRWMRDAHKDRNSVPLYAMGEDGLALRRAWLAGYDERDEQIRRKRG >LR134301|1861149:1874257|1866188_1866428_-|VEE52128.1|DBSCAN-SWA MSKPNWKDAPEWAQWLSMDGDGLWSWWEIKPEWDEQAKDWFISIDAPGAEHAEAEYATGIPDESGIDWDFAFDTLEERP >LR134301|1861149:1874257|1864887_1865169_-|VEE52124.1|DBSCAN-SWA MRTKNSKEITAAERAHLQAVKDLPCSVCGLPGPSDAHHINQGQHYTTVALCKDCHQGSFNGIHGERRMWLVMKMDELAALNVTLQRLGQRSAA >LR134301|1861149:1874257|1862988_1863243_-|VEE52119.1|DBSCAN-SWA MSIQETNWDTVATLLDRQLEDSMLRTGEYPVELVITNLQGRQLCLEMAHLTVPYALDEMPADGIYWRGVPVRVIHVWENHGSAA >LR134301|1861149:1874257|1869382_1869613_-|VEE52133.1|DBSCAN-SWA MSAIQTAIDRYEMPQAAVARHLQVTPQAVNQWVKGLRPVPPRHVLVIEADTGVTRHELRPDVFGASAPKSRRKRSP >LR134301|1861149:1874257|1863609_1864050_-|VEE52121.1|DBSCAN-SWA MAASVEAPRRTGTTEGVPFRQVWKPRVVCVVDPTNPADALNAILPRIAENQRGCTVASFLLINPETSQAFVLAEDKPVAVEMARKGEKSPYWPWLVGKYSFPRVTAEAAANVLEDMLEHLGIATPAPRKRPMPVQLDLFDLSGRAA |
27 | Pseudomonas_phage(20.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1901755 : 1922998
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >LR134301|1901755:1922998|DBSCAN-SWA CTCATGCGCCCTCCCCTGCAATGTTACCGATGACCTGGAAGCGAGTTGAATCGGACGCGCCTTCCGGCGTGCCCGGCAGCGTTGTCCGCATCAGCCAAACTGGAGCCAACCCGCCGATCGTGTTGAATCGCACTGCGTTGTTGGTTGACCAGCCGGTTCCCCACCCCTCTTTCTTCATCACAAAGTACGGCTGCCCAGTGCGTGGGTTGATGGGCGCGAGATCCGTGGTCGTGTTTCCGGCAGAGATCGTGCCGACCGTCTCGCCGATCACCTCAAACTGCGTTGCACTGGTAAACCGCACCGCCCAACGCTCAGTGATCGCGTCCTTGTTCGTCACGGCGAGCGGATAGTCCGTGTCGTTGTAGGTACCGGGCGCGACACTGCCGCTCGGGGAATCGGCCCATACGTTGGTCCATGCGGCCTGGTCGAACAGACTCACCACACGCGCCTGCAGGTCCAGCGATCCATTGGCTTCGCCCAACCGCAGTGCGGTGCTGATCATCGACTCCCCCACCGGATAGTCGTGCGTCAGGGAGGTGTTGATCTCGATCTCACCGGTGATCTGAGGCTGAACCACCAAGCGCCGGTCCTCGACGCGCTGGCTGATCGTCAGTGGCAGCGTGTACGCGGCCAGGTTCAGCGGGTCACTGAAGCGCAGATTGCCAACGTCCAGATCCGCCGTGAACCATGCGGCGTCGACCGGACGCCCGGCCGCGTCACGTACTTCGATGCCTGCAATCCTGCCGCGACCAAAGGTGACCAGCTGTCCCGCTTGTGGCGAGGGCACCACGTGCTTGGCCGTGTGGTGGATCAGCACGGTCTGCCCCGGTTTGAACGCTGGCGCACGTCCGTCGCTGGGCAGTCGCACCGACGACAAGCCGATCACTACCTCCGACAGCGGAATCGATCGATACACGACCGCGCCCATGTAGATTGTGCCCGGCAGCACGAGCGTCGGCCTCCATACCTGATCCCCCTCAACCTGATCGGGGTCAAACCAGGGCTTGCCCTCATTCCCGGCCACGGGCACCAGCTGGCCGAACTGAACCTTGACGACACCACTTTCCCAATCGACCTTGCCACGAATTTCCGAAGCTGACAGCACACCATTGATATCCGCAGTCACCGTGAGCAGCTCACCATCGATGCGGGTCGCGCGCAGCGTGAACATGCCTGGCCGCAGCGGTGAACCAGGCGCACGGAAGAAACTGACCGCCACACCCGGATCGGCGATCCGTGTCAGGAGGGATTGGATCTGGACAGTGTTCTCGCCGCCTGCCAGCCACTGCACCAGGTTCACCACACCGGCCGCGTAGTCGATCGTCCCTGCGTAGATGCCCGAGCCAGTGGTCGGATCGATGGAGTGATACAGCCCGCCGCTACGGTCAACGTAGGTGCGCCCACGGAATGCAAAGCGGACGCTGCCCGGAACGATGCTGTCGCTGATCGTCGGGGTGAGCTGCAGCTGGACCGCCGGCAGCGGCATGACCTCCTGCGCAGACTGGGCGGCCTCGCCAGCGAGCATCCAACCGGCTGAAACGATACTGCCGGCGGAGAACTGCGCCAGCACGTCCAGACGCTCGTACCCAACCACCTTCAGACGACCAGAGCGGATCTCGTACACTGGATAGGAGACCTGGCGGACCATGAACTTTCCGACCTGCAGCGAGACCGCGCCCGTGCTGTAGTTCACCGACCCCAGCACCGTACTGAAGGCCGTGTCCCCCACCGACACGCCGACCAGGTTGCCTGCGCCATCATCCTTGGCGATGACCCGCATCGCCTGAGGGGCCGACGACAGGTCATCGCGATCGCGCCGCACACTCACAAGCCAGTCGAGAAGCACTGTGCCCTGCCTCACCGGTCCCTGCGGCAGGGTGAACGAGACGATCCCGGCCGCGTCCGGAACAGGCTGCGGCGCTGCGTGTAGCGGCTGCCCCCAGTCGTAGGCGATCGAGAGCTGACTGTCCGCATCAGGCAGACTCAGCGGCCGGAGCGAGATCTCACCGGTCGAGTAGCTGATCTGTCCACGAACCTGCCCGCCAATCAGCAAGCCGCCGTTCCCGTTATCTGTGACCGCCACATTGGAGCCGCCAACACGCAGCGTCAGTTGCACCGTCGCCGGTACCGCCGAGCCCTCGCCCAGCACAAAGCGGAGCGCCGGCGGCAGGATGGCGGTATCACCAACCCGGGCCTCCGCGATGATGGGAGTGCCCCAGTTGGAGATGATGCTGCTCTTCAGATCCGGCAGCGCGCCAGCGGTCACCACGACGGAGCCGGTCTGGTAGTTGATCGTCCCGGTGCCTTGGCCCGGCTTGCCTATCAGACGGCCGCGGCCGTTGTCGGTCAACCTGATCCAGCGCCCAAGGGCGCGGTAGTCGACCACCACCGTTCCTGGGGCCGGCAACGGGGTCAGCTGGAACAGCCAGACCATGCCCTGGTTGTTCTGCGTCACCTCAATCTCGTCGGTGAAGCCCTGCATGGGGATGGAGCCAGCCGGGGACGCGGTGATGCTGATACTGGTACTGCCGACGCCAGTCGCATGCGTCAAGGCTACGACCCCACCCTGATAGTCGACCGTACCGCTCCAAGGCGTCGCCACGGCGGAAGCGAGCCCGCCAGCACCGTCGTCGGTGAGTTCGACGCTGCCTGCGACAACCCTTACTGCGCCCACCACCAACCCAGTACCGAGGTATCGGCTGACCGGCACGCCAGCAGCAAAGCTGGCACTGAAGTTCTGGCCGAGGCTGCCGGCAGGGCCGGATGGGACCTGGCTGATGGTGCCCATGCCGGCCAGAACGTCACTGACCGGGGTTTCAGCCGTGGAGGTAGGAACGATGGACACGTACGGGGTATCGACTTGCACAGCCAGATCGCCAGGCTTCCCAGCGGCGGTGAGGCGTTTGACGCTGTGATAGCTCGTGGCCTCCACCACGTTGGTTTCGTAGATCCTGGTCGCGGGCTTGGTGGCCGAGTACCGCACCACCTCCTGGCCGTAGAAATCGCGCAGCAACGCGTTCACCAGCTCGATCACCAAGACATCCCGTTCAAATGCCCCTTGGTCATCGGTAAACGTCCGTGTCGTACGGGACAGCACGCCTTTGACCCGCACGTACTGTTCTGCCGGATCGTGCCCGGAGCTGGCGAGGGTCAGCAGCGAGAAGTTGTCATTGATGTCGGGGCTCGGCGCGTCCTTCATCGCGTAGACCTGGATAGTCATCTGGCCGCTGAAGTGGTTCCCCAGTAGGACAAAGCGGGACTCTGTGCCTCGCGTTATGTAGCTCTCGACGCGGTTCTTGGCGTCGAGGCGCACGTCACTGTAAGAGCCCGTGGCAAACATCGTGACGGTCACGCGCGGGTCTGCCGGAGGATCGATCAGTACCGCGATCGCGTCCTTCAGCACGTCAGGCGCCGGTGTATCCACATGCACAAACATCTTGCGGAGGGTGGTTCGGCCAGTGGTCCGCTCCTCATCGCCGATATCCGGAAAGAGGTTGTTCATCGCCCCATCAATGATTTCGGTCTGAACCATGCGGCCACCGCCGTCAGGGTTATCTGTCAAGCGCTGCGACTGTCGCAGCTTGATATCGGTAGCAAGAATCGTCATGGATTACACCGTCATGAGGCGAAGGGTGATGGAGAAAAGGTCCGCATCAAGCGCGGGGACAGCAAAGCGGGTGGGATCGACTTCGATGGCCGGCCCATCCGTGCGGCGCCACCGGACCTGGAATGACCGCTCGCCGCTGTTGTGGGCCGGCATGACCAGATCCAGCGGCGCCAGGCGAGCCTCGCTCTCACTGGCCTGCAGCGCCCGTAGAACAGGAAGACTGACGACGCCGACGTAGGCAGTACCGTCCCGGGTTGTCTGAAGCGTGATCGGCCGCCCGGCCTGTCGCGCAGACTCCTGGACGATCAAGGCACCCGTCAGGCTTGTACGCGCCTGCTGCCCAACCTTCCAGGCCGTGAACTCATCGGTCCACTGGAGGTCAGCCGGCAGTTCGATTCCAGCAAGAACGATGCGGGTCATCAGCTACGCCCCCGAACGGAAACGGCCCTGCTTTGCTGAACCTGCCGCAGGACCAACGGAGCGACGAGACCCGCGAGGCGCTGCGCTTGCTGCACTTCGGCTGCGGTGGCGCCAGCCACCACTTCCTTCGAAGGCAGCTTCCAGTCAATGACGATGACCTGCTCATTGCTGCCGTTGCTGCCGATACGGGCGGCATCCGCCCTCGCCTGCGCCTCTGCCTCAGCTTCAGCCTGCTTGCGGCGCTCGGCGAGCGCGGCGGCGGCCTCCTGGTCTCGCTGCTGCCGCTGCCGCATGACCTGGGCCTCCAGCTGCGCAACCTCTGCGATTTCGCCCTTGCCCACGTAGTCGAACTGCCCGGCCAGTCGCTCCTTCGCTGCTTTGGAAAGCTCGTCCTCCGCCTCTGCGGTCGCTTGGAGCTCTGCCTTGTATTCAGCCAGCTGCTTGCGCTGGTCAGTGACCCGGTTCAGCGCGTTGGCAAACTGCACAAGCGGGTTGGGGCCGCTGAGCTTGCGCATCGCCTGCAACGCCGATTCGGAGACCTCGCCAATGCTGAACGCCATTCCTTGCGCAGCGGCCCCGGCCTGCCCCATCTGCTTGCCCATTCGTTCGGAGCTACTCCCTACCCGGTCAACCTGGTCCGCCGCACCACCCGCGCTCTCACGCACCTCCTCCAGCCGCTCGCGGCTGCGGCTGGCACCGTCCTGCAGCTGGCGCATTGCCACGTCGCTCACATCGCCCAGCCGCTGCATGCTGCGCTCGGTGTCGTAGATCGATTCCTGCACTGCGAGCTGGCTATCGACGTTGTCCCGGCGCCACTGATCACTGTCTGCGGCTGCGGCTCGCGTTGCGTCGGCATATGCCCGGAAGGCTCGACGCACGTCCTCCACGGATGCCTTGCCCCTGGCCGCGCCGTCACGGATCGCATCGAAAGCGCTCTTGGCGGCGTCGCGCGTCGCATTGAGCGACGCCTGCGACTGGATGCCAAGCCTGGCAAATTCGTCGGCCAGCGGATCGATGGCGACCTGAATCTCGCGGATTCGAGCGTTCAGCGCTGCGGCTGACCGCGCGGCCGCATCGAAGCCGACCTTACCCTGCCGCCCTGCCGATTCCAGCAGCGCGCCCAGCGTGCGCGCTTCGTCCAGCGTCGCGACCTTGCCAAGCGCTGCCTTGAATGCGGTCTCGATCTGGACACCGGTCGCGATCGCGCTTTCGGCCACCGCACCGAACGCGGCGATCGCATCCTTGCCACCAGCACTGAAGCCGACGCCCATGCGCGACGCGGAAACACCAAGCCGCTCCATTGCTGCGACAAGCGTGGTCTGCAGCACGGCGGCGGCATTGGTTGCGCCTTGCGGCATAGCCTCAAACGCGGATTGGGCCGACGCCTGGAAGCGGAGCAGCTCGTCACCGGACAGCTTCTTCAGCGTCTCCAGCAAACCATCACGCACATTGCGCTCTGCGGCGGCGCCCTGGGAGGCGATGTAGCCCAACGCCGCTCCGACCGCCTCCAGGCTCGCCGTGTCTGCGAAGTTGAGCCCCTCAAACACCTTGCCGATCGATTCCTTCGCGAGCTTTGCGTTCCCGTCGATCCCAGCCAACTGCTGCACCACCAGCTGGGCAGCCGGGCCGATTCCATTCATGAGCGCGTCCGCAGCCGTCTGGACGCCACCCCGGAGCGACGCAAAGCCCGTCGACACCTCCAGCAACTTCTGTGTGACCAGGCCCAGCTGCTGCAGCTGCTCCGCCGTCGCGACGCCGGCCTTCTGCTGCATCAGCAGGAAGCCTTCCTGCGCTGTAAGGTACTGCTCCAAGCCGGACAGACGCTTCTCGTATGCCTGCCGCTCCGCCTCGCCCAGCTTGGCGACTTCCTCGGCCGACTTGATGACTACGTCACGGTACGCAACGAAGGACACTGCCTGTTCACGCAGCTGCAGCGCCGAATCGCGTACCTGGCTGATGTAGGCCCGCTGCGCTTCACCGGCACGCTTCAGCGCCGGATCGTGCTGCTTCCAGATGTCCTGCGCCACGGTCTTGAGGACATCCAGACCGCCCATCGCCGCTTCCAATCCCAGCACGGCCACCGTGATGGGGACTGCCTTCGGCAACCCGCGCAGCAACGCGCCGAACCGACCGATACCCCGGCTGCCGCTTGCGACCGCTGCATTGTTGGCAAGCTGCGCATTCGTGGTCGCGATCAGTGCTGCGCGCCATGCGTTCAGCTGGAGCAGGGCCCCCACGATCTTGAACTGCGCATATGCGGCAGCCATCAGGCCGATCACGCGGGCATGGTCCACAACCCACTGCGTCGTGCCTTTAACCGCCTCGGCCATGGTGATGATGGCCTGGGCGGTCTGCTTGGCCCAGCGGGACAGGCTGCCATCGGCGGCCAGTCGATCCAGCGTGGTCAGCAGGGTGGTGAGTTGTTCCTTGAAGTAGGTCAGCACGCCTTGGTCGGCGACTTCCTGTTTCCAGTCCTTAAAGCGATCGGTGGCCGTCTTCCACAGGCCGGCGATGGTGCCAACCTTTGCGGCGGCGGCTGCGCCACCATAGGATTCGGCCAGCAGATCGAGGATGATGGCCTGCGCCTTTGCCACCTGGCCGGTGGCTTCCAGGCTCTTGATCAGCGCCTTCTGGCTGTCATCCAGTGTGAAGCCCTGCTTGCTCAGGCTTTCCATCGCTTTCGACGGCGTCTGCAGTGCCTTGCCAACGACCTCGGCAGACCCCTCCAGCGACATGCCCAGCCGCTGAGCCTGATCGATGGTGATCTGCATTGCTGCCGGGAACTGCTCTCCGACGATATTGGTGTATGACAGCAGGCGTACCTGGGCCGCGCTGATCTGTCCATCGTCGAAGAGCCCGCTCTGGAGCTGCTTGCGCATTGCGGCCAGGCTTTGCGCGGTGAACTCACCGCTTCGGCCGGTCGCCTGCAGAGCGGCCTCCAGTTGCGACAGCTCTTGCTCAGCGTCACTGCCTTCCTTCACGATGGCCTTGATGCCATCGACCACCCGGTTCAGGCCGACAAACGCGAGCGCGCCAGCCGCCACAGCCTTCAGCTTACCGAACCAGCTGACAGTGCTCTCGGTGGCCGACGCCAGGTCGGCACTGCCGGCGGTGGCGTCGTCGGCACGCTCGCGATACTCCGCCAACGACTTTGCCGCAGCCCTGCTGGTGTTGGCCTGCTTGCGGAAGGCCGTCTCTGCCTCATCGATCTGCTGCTTGCGACGACGGCCCGCCTCGGCCTCTGCCGCTGCGGCCCTGGCTTGTTCGGTGAGCGCTGCTGCACTACGGGCGACCTCGACGCGCAGGCGCTGCTGGTGGTCGGCCAGGTTCGCGGTGTTGACGCCGAGCGAAGATAGCTCGTCGTCGGCCTTGCCGACGGCTTCCCATTGCTCGTTGAGCGCCTTCTTCAGGCGCTCGCCTTCCTTACGCAGATCCCGCTGGGTTGCCAGCACCTCGCGGGAGGGCTTGTCCATCTCGCCGATGCTGAGGCTGAGCGCCAGTGCGGCCTTCTGGTTGTCGTCAAACTGCTGCTCCAGCTGCGCGAGCTCGGTCAGCATGCCGTCAAAGGCATCCGCTTTCGCCGCCGCCTCGTTCAGCCCAGTGAGCGAGTCGAGCAGCTTCGTCGCCTTGCCAGCGGTCTCGACCGACACATCCCCCAGATCGCCGAACGCCGCGCGCAGTTCGTCCACACCTTCGCGGCCCTGCGTTTCGATGACGACCCGAATTGCTTCTTCCAGCCGATCAGCCATTGGAGCTTCCGTTGACGCGCCACTGGCGGCGCAGTTCTGTCAGATAGGTGGTGTGGAAGCGCTCGATCAGCCGGCGCCGGGCCTCCAGGGCGCGGCTGTTGCCATCAGCACCCGAGAGCATCTCGAACGGGCTGGGCCCTCGCAGGATGCGAACCGGGCCTCGACCGTGGCGTTTCTGCTGTGCCCGATCCCAGCCGCGCACACGGATGGCCCTGCGGCCCTTGATCGTCGCGATGAAGGCGCCGTCGTAGGTCTTCGACTCGCCCAGGCCAATGCCGGCCGTGGCACCTCGGGATTTGCGACCGGCCCAGCGACCACCGAACTCGATCAGCGAGATCTGCCGCGTGCTGGCCCAAATCGAAAGGAAGTCGTCCCTGCCACGCTTGCCGGTGCTGTAGCCGCGCTCGCCCGTCTCCACGCGATACTTCCCCCGCAGAGCAGAAGCGCGAATGTTGTAGGAGCCACGTACTTCTTGCGCAGTAGCCGGCCCAGCCCGACGCTGCAGACCAATAAACGCCCGCTGCACCGACAGGTCGTATCGATTCAGCACCTCGCCAGCCAGGTCGGTCAGACCATGGAAGCCCTTTGCCCGCCGGCCGCTGACGTAGTACTTGAGCAAGTTGTTGTTGCGATTGGACGCCACAATGCCCTTCCTGTTTCAAACCGGGAGGGCGCCCTCGCGGCGCCCTCCCCTCGCCGGGGTCACCGGCACGCTCAGCCCGCCGACTGCGCTGCGATCTTGAAGGTGTAGAGATCGGCCTCGCCGGCCTGGAAGATGACCGGGCCAGTCAAAGTCACCTGGATCGGTTCGTCGCTGAACCAGTCAACGTCGCCATCGACCGTCAGGTCGACATTGGGGATCGACAGCAGGCCCTCGTCGCCACTGATGCGGTCCTGCATGTCGCCCAGGACCTGGAACGACTTGCTCGGCGTGGTGCCGCCGCTGATTGCGGTTTCCAGGTAGGCATCGAAGCTGTAATCGGCCGTCACAACATCACCAGCCTGCAGCGCGCCACCCTTCTTGGGGATCAGCAGACCATGACGGGGCTCGAGGTCATAGTCGGTGCCCTTCACCAGGTCCACTGCGCCCTTCTTGAAGGCCGGCGCCGGCGTTGCTTCGATGAAGTTATGCGGCAGCTTCACGGGAGTATCCACGCTGCCAACAGTGACCGACACAGCGCTGGCCGAGCCGGCTGCCACAGTGGTGTTGACCAAGGTGCCGTACAGCATGCGAGCCAGGAAGGCCGTCGGCACCTCCAGCGCGGTAATCGAAACGTTGGTGACGCCCGGATTCGAATCCTTGTGGATGATCTGCTGATAACGAGCATCGCGGCGCTTGCTCTTGATCTCCACCGAATCGCCGGCTTCGTAGCTGAAGGTCAGCGAGGACTGCTCCAGGGGCTGGTTACCGAACTTGTCGGCGGGCTCGGGAATGACGGGGACGCGAACGCCCTCGGCGCCGTGCTCCCAGAAGCGCAGGTCACCTGCGAATTTGCGGACTTTTGGCTGTGCCATGGTTCTGGTTCCTTTACGGGTTGGACACGGGCTCAAAGGTCTCGGTCAGACCAGCCCGCGCGGTGATCTGAGCGACTACGGCGGTATGCCCGGCATCGTCCTCCAGGGTCGCAAGCTGGGTTTCGATCAGCTCGAAGCTGGTCACCCCCAGCGGCAGCGACTTCTCTTTGAACGTCAGGGCGCGGATCAGGTCATGGCGCGCGCGATGAACGAGCAGCCTAGGATTCGCTTCATCGCTGCCACGCGGGACTTCGAACTCGATGGTGATGGCCGCATCGGAGCTGGACTGGGCAACACCGCCGCCACTGCGCGAAAGCTGGCGGACAGAGATGATCGTTGCCGGCTCGGTGCTGTCCTCGCTGATCTCGGTTTCATCGATGATCACGGCGCCCGCACCGATGTCGGTGCGGAAGCCACTGCTGCGCGAGATCAGGCGGACGCGAGCAGCCAGGAACTCCACCAGCTGCCACGACAGCGGCTCTGCCAGGTCATCCACGGCTCACCAGCCAGCGGCTCTGCGAGCCGTCATCGCTGATCTTCTTGCTGTTGACGAAGACCTCAGTGCCGAACGCGCTCGAAACCACCTCGAAGCGATCGCCCTGGTCAGGCGCTACATCCGATCGCAGGTACGCGATCTCCACACGGCCCGCCCTGAACTGGCGCAGTTCTCCGATGGTCTCAACGTCGCGCTCCACGTAGGCACGCACGTCCTCGGTGGCCGGACCATCCTTGGCCGTGTACCGACCTCGCGACGCCATGCCCGCCAGCGCAAAGGCGGCGTGCAAAGTGCCGTCCAGATCGCGGAGGAAGTCCACCTCGCTCACTTCCCACCTCCGGCGCATAGCAGGGCAAAGGATTGCAGCCCCCTCACCTGTGCATCGCACTGGGCTGCGGCGCCAACAGCTCGGCCCGCACTCTCAATTCGGTCGTCGGCTCGACCATCAGGCTGGCTGGCGGCAGCGGCGGCTGCGGACAGCTCGGCGGTTGCGACGGTCGCTTGCCAGCGCTGGTGCAGGCGCTGGTTGCCAGCGCGAAGATCAGCGATAAGGCGATCAGAGGCTTTCTGTGCATCGTCCTTTTCCTTTTCATATGTAGCGGCCAAAGCGTTCGCAGCGGCTGCGCTGTCACGTTCGGCCGCCAGCGTGGCGGCCGTTGCATTCGCCTCGGCGCGGGCCGCATCACGTTCGCGCTCCATCACGTCACGGGCGGCAGCTGCCTGGTCGGTCGCGCGGTGCGCGATAGAAACCGACCCTCGCTGCCAGACAACCACGCCCAGCAGCAGAAGAGTGGCGAGGATGAGGGCACGGATCATGCGGACACCGCCGGGTCTTCAGGCGGAATGACTGCACCGAGCCTACGCAGCGTGGATTCAAGCTGCCGGACTCGCATCCGCAATGCGCTGGCCTCCTCCTGTGCCTTGAGCCGCATCATCATCTCTTGCTGCAGCCGCTCGTCCTGGACCGAGACCCTCTGTTCCAGCGACCCGATACGCTCGGAAAGACCCTTGATGAGATCCACGTTCGCATCAGTCTCGGTGCGCTCCTTCTTGCGCGACAGAAGTGCAGCCCAGGTCTCGCGCAGAATCCACACGGCTACGACGCTGCCCGCAGCCCACCAGGGAGCCGTGGCGGTGACACCGCCGCCGACCATCAGCTGAGCGCCTCGGCGACACCAGCACTGACCACATCGGCATTCCAGTACACGCCACCGTTCTCGTGCTTTGCGATGGCCGTCGCGAGACGCCCCAGCGTCACCGGGTTATCCAACCGGATGACTTCGGAAGGTGCGACGCCCACCGCCGCCGCAACCTGCCGGACGTAGGCGCCCGTATCGTTCTCCACCGGTGGTGCCCAGCGTCCGATGATCTCCTTCACGGTGCGGAGGCCGTGCTTGCGCTGATAGGTGAGCAGTGTCTTGCCCAGCGCCCTGAAACCCGCCTGCGGGGTCAGGAAGACGCAGAAGCGAGCCTCGCGGGCGATGGCAGCTGCCGACCGATCTTCGCCCTGCCAAGGCGTGCTGGTGCGGTCGATGTTGCCAGGATTGTTGTTGCGTACGCCGCGCGGCGTGCTGGTGGTGCCCATGCGATCCCCCGTTGTCGCTGTGGAAGAACCGGCACCGCTCACGCCACCCGGGCTTCTGTGAGCGGTGCCGGCCAGTTTGGTTACGCCTTGGTGGCGTTGCCGGGCGAAAGCCGGACCTCGGCGGTCGTCTGGCCGCTGGAGCCGGCGGCCCAGGCGAACGCTGCGCCGGTGATGTCGCCGGCCACGACAGTCGCGGAACTACCATCGACTGCCTTGGCGCTTGCGCTCCACACGAGCTTTTCACCCTGCTCGAAGACCGCCGTCGGCACCTTCGGCAGCGTGAACACGCCGCCCAGCGCTACGCTGCCCGTCGCGCCGGCGGCGATGTTGACCAGGGCAACCCCCAGCTGATGACCGACGACAACCGCCTGACCCGATGCAACCTGCTGTTCGGTGGTGTTAGTCCAGGGGATCACGTCCCCATCGGATACGAAGTTCTGTGCCATGTCTCAGTGCTCCAGTTGGGGATCAGCCGCAGCGCTGCACGCCGCGATAGTCGAGGGCGGCAATGCCGAAATCGAGGCGGGCCTTCCAGCGCACACCGTCGACGGTGAAGCCTTCCTCGTAGTCCAGGAAGGGTTCGGTGATCCCATCAAGGAATGCGACCTCGATGGCCGGACAGTCGTTCGGATCGGCGAACAGGTACCACTTGTCGTCCTTGATGCGCGCGGTGTCGACGATGTCGCGGAAGAGACCCTGCACCGCGTTCGGGCGCTGCAGCTTTCCTTCAGCGTCCGGGTCGTACTCGGCCTTGTTGGTGACACGCGCGGCACTGCCGTATTTGGTCGGACCGAGCCAGAGTGCCGGCGACAGATCCAGCACATCATTCCCGCCGACGTCCTTCTGCTGGGCCAGCTGGACGCGCATCGCGTCGACCGAGGTGACGCTCGGCACTGCTGCCGCCAGGATGTTGCCGTGGTCGGCGTGGAACAGCGCCTTGTTGGAATCCAGCTTCGGATTGCTGGCGAGGAACGCATACGCGTCGGCCTCAATGGTCCGCTTTGCGGCACGACCGAAGGCGGTCGCCAAGCCGAGGAACGCGCCCAGGTCGTCGTTGATGATCGCCTGACGCGTCAGGTTGATGGTGTTGCCCTTGGTGCCAGCGGTGATGGTTGCCTTCTCGCCGTCCGGGATCTTCTTGTTCTTGAACTCGCCAGCCTCGGTCAGCTTGTCCAGGTTGCCAATGCTGCCCACGCGGTAGCGCGAATGCTCGCGGAAGTCACTGACGGTGCCGGTGACGCACCAGCGGGACCAGGTGTCCGGCGCAACGGCGTAGGCCGCCTGCAGCGCCTTGTGCATCGTGCTTTCGAGCAGCACCGGGAAGTCACTGCCGCTCTGCGTGAACGCGCGGCCGACCAGCTCCAGCTTCGCCATGCCATCGGTGCGTACGCCGCAGCGCTCCAGGCTACGACGGGCCAGGTCCATCAGGGTCAGGCCGCGCACTGGATTGTCACCGGTGAGCGCGAAGATCCGCTTGGTTGCGGGATCGATCACCTGGGCTCGGTGCAGCAGCGCATGCGTCACAGCGGAGCGCTGCAGATCCTGCTCGTCCTCGGTGACGCTGATGCGGTTGATGTTGCCGCCAGCAGCGGCGTCGCGCTGCTCCAGCGTGGTCAAGATCAGGCCACGCACGTGGTCGACCGAGTGACCAGCGCGAATCCAGCCAGCTGCATGCTCGGTCTGGCCGTGGCGGGTTGCCAGCTCCACGATGTCGGCTGCGCGCGTGTCACCTTCCGGAGCCTGAGCTGCGGCCGGCGCTGCCGGAGCCGGGGTATTGTTGATGGGTTCCTGCTGGACCGCCGATTCGGCGGCGCGGGCGGCGGGCTGAGGCATGGTGTGCTCCTGCGACGATGCGCTACGGGTGAATACACAGGGGGTCCCCTGTGCGGGTTGATTGCTGCGGGTACCTGCTGCCGGGTCGGCCGGCACAGTGACGAAGCTGATCTCGCTCGGCGTCCACTCGACCGCGCGGTAGATCGGCAAATCGCCGGGGTTGACGGCGCGCTCGATCTCATAGCGCTGCACGGTGTAGCCAACCGAGATATTGCGAATGATTCCGGCACCGATATCGGCGACTACGCCGGCCAGCTCCTCGCGACCGGAGAGACGGATAAGGGCGTGGCCTTCACCATTGGAGAGCCAGGCGCGATCAACCACTCCCATCTGTGAGCCGATACCCCAGGTGTTATGGCTGTCCAGGACCGGCGCAGCGCCAGACGACAGACGCTCCATGTTGCAGGCAGCCTCATCAACGACCAGCTCCTCCCAGTAGTACGTGTCATTCCACCAGTCGTAGCGGCGCACCCGGGTACCGGCGGTCCACTGGAGTTCGATCGTGCGTGCCTCGCTATCGAAGCTGGTCGGCTGCAGCTCGGCCTCACGCAACTGCGGGGGCATGAGACGTGTCGTACCGTCCTGCGTCGGAGCCTGGATTGGCTGGGGCATGGTCATTCCTCGTTGGTTGTTGAGGCGTCGACCAGGCCGGTCCGGGCGCCACTGGATTGAAGGAAAGTCATCAGACCGAGGGCGCCGGTCTCTTTCATCCGCTTGAAGTCCTTGCCCATCTCGACGTAGACCGCATCCGGGTCGTAGCCACGCCGACGCAGCGCTTCACTGGGCGAGTTGAGACCGGCGCCCATCGCTGCGATTTCCGATTCGATGTCCTGCTTGGGGTTGACGTAGTCCCAGCGCGGCGTGCTCCAGTCGGCAGTGCTTCCCGTGGAACGCACCCCACCGCCGAGCGCAGCTGCTTCGTCAAACCAGCGCCAGATCGGCTTACACATCTGCGGGACTAGCACCAGCCACTGCATCTGCTCGCAATCACGACGGAACTCCATCTGCCGGATGCGGGCACTAGAGAAGTTCACCTCACGCATATCACCGGTGGCCGACTCGTACGGGACACCGATGCCAGCAGTAATGATGTGCGCGTTAAACTTGCAATACTCGACGTAGCCCCCTGCCGGCTTTGGCTCGACAGTCTGGAAGGCTGTGGCACCAGTGATGTGGGTGACGCCACCGCTGGGCAGTGGCCCAAGGTCGGTGACCTGGTTGCGATCTGAGCCGAGCTGCGAAGGGCCGTCATCGTCCGCGTTGGACATCGAGTCGATGTCGCCACTGACGATCACACCAAGCCGCGCTTCCAGGTTCTTCCGCGCCAGCTCGGCGTCTTCGTACAGCATCAGGTCGCGCACTCGCGCGATCACCGGGGCGAAGCGCGTAATGCCGCGTCCCTGCCCGGGGCGGACGGGGTTGTACAGGTGGATGATGTCGGATGCCGGCACCAACGAACTGCTCAAGCGTACGGAGCCACGCACAGCCTCACCGGGATGCGCTCCGAATAACCAGTAGCCGCGAATCCGACCGATCGCGTCGTACTCAATGCCGTTGATGATCTGACCACCGCCCGACGCAGACCCGTTCTTGTTCCCGTCCAGCCAGTCGATCTCCAGTACCTGAAGCTGCAGCGGGACTGCGAGACCGTCCGACTGATGCCTGGTGCGGCGGCGAATCATGACCTCACCGTCCTGCTCCATCGCGCGATACGCGGTAGCCATCAGGCCGTAGATGTCCGACTTTCCATCAGCATCCGCCACATCGGCCCAGCGACCCCACAGGACGTCCAGCGCGGAAGCGTTTGGGCCTTCGGCCTTGGGAGTAATGCCGGTCCCGATCGTTGCGCTCACCAGCACCTGGAGGGACCGCGCGCAGTACGGAACGTTCTGCACCAGCGCCCGAGCTCGATTGCGCAGCTCGCGGGCGTCTGCCAGGTGATCGGTGTTCGCGCTGGCCCCCGCCCTACGAACACGCCAACCGTCAGTGCGCGAAGCGCCCTCGTAGGCACGTACCGCATCCAGCGTTGCCCTGGCGCGGTGACGCTTCAGGGCTGCCTGCGGAGAAATGGCGCCGATGACCCTGTCCAGCAGCGAGGCCGCCATGTCAGAAGCCCCTCAGCGTCGTGAAGCGGTAGCGGCGTGTGGCCGACTTCCGTCGGCCAGCCGTCGTGGCGGCAACCTCAGCTTCCATGCGGTCCAAAGCCGTCAACATGGCTTCGACGGACTGATACGTGATCTGACGATCACCATGCCGAACGGACAACTGGCCGCTGGCGATGGCGGCCTTCAGCCTCTGCACATCGTCTGTGGTCCAGCTCATCAGTGAGGCATCCGCGAGATCATGGATGCAAGTTTCCTGATCAAGTGCGGGTGAGTCTCGGGGAACTCACCCGCACCCCCTACTCATTGGCGCAAGGCTCATCAAGCAAGCGGTACAGCGTCCGTCTGTCAATCCTGAACCTCCTGCACAGCGACCGTACAGACTCCTGTTCCTGCATCCCTTTTCGGATCTCATCCACAGGGTAGGCACTGATTTGCATGCTGGCGGGGATGTACAGATCCTGTGCTGGGTACTCTTCGACAAGGTAGGCCACAACTGCCTCCACAACGCTGCGTATATCGTCGCTATCACATCGCAGGCGCAGCGCGGCGCCGACCGCGAGCTCCTCCGTCAGCTCACTGATTCGTACTTTGTTTCGGACTGTGTTCCTGCTCACCACTGCCTCCCCGTGCTGCGAGGCTGCGCGGGACGTTGCCGGCGAGGCACACTCGATGTTTCACGGGAATCAGTGGACGTGTCGGCCTCCGACGCAGCACCTTGCGTTTCGCGTGAAACGTTCACGCTTGGCGGCGCAGCCAGCCTTTGTTCCAAAAGGTCCCAATCGGAACGCGTGAAGCGGTTGATGCGGACCTCTGGGTGATGGGTCGCCGCGTAGGCATACACCCATGTGTCCAACGGCTCGTTTCGGGTCACCTTCTTCTCAAAGCGATTCTTGACAGGGTTGTAGACCTCCGACACCAGGCCTGGGAAGAACTCATCCGGCAGCTGGTCACTGAGGTGGACCATGCGATTCTCGGGCTTACGCTCGGCGTCAGCCGACAGACGGCTGTAGAGGTAGTGCTTGGCTGCAACGGTACCCACGTGGTTGATGGTGATGCCGCGCTTGTCAGTCTTGCCCTTCCAGGTGACGTCAGCCAGCTTGCCCTTGGACAGCACGGGAGCATTGTTGGGTACAGCACCAAAGATGCACATTGGTCGGGTGATGCGCCGCTGACGGACGTAGTTTTTGACGGCCTCGGTGCGGTGGCCACCAGCGTCGATGGCCACTGCCATCGGCCGGAGCAGCGCGCCATCTTCGCGCTCGATTGCGCGGTTGAGCAAATCGGTCAGGGCTACCCAAACCGCTTCCTCGGCCGGATCGCCCTGCAGCTCCACATAGTCCAGCGTCCATGCGGTCATACCCCGTCCCCAGCCGACGACATGAACAGCAAGACGGTTGTCCTGCGTATCCACACCTACGGTGATAGCCAGCACACCTTGCGGAGCCGAACGCAGCGCATAGGGCTCGGCACGATCTTTGATGACGTTGTGCTTCACTGCCCGCATTGCCGGGTCTTCCCACGTCTCGGCCAGGCGATCATTGACGAAGGTCTTTAGGGAAGCGGGATCGCCCTGCGCCTCCAGCCATTCCTTCACCAGGTCCAACCAGCGCGGCCCCAGGCCGAACTGGTAGTAGAGGCAGTTGATGGTGTAGCCGCGAATGGGCGAGTCAGGGTTGGCCGCGACCCAGCGCCCGTTGGCGATCATCTCGGTCTTGAAGTGTTCCTCGATGGCGACACCGCACTCGCAACAGGCGTACCACGCGTGGGTCTTATCGGGCGACCACACCAGGCCACTCCACTGCAGCGCCTGGTAATGGCCGCAGTGGGGGCACGGCACGTGATAGCGGCGCTGGTCGCTCTTGTCGTACAGCTTCGTGATCCGGCTGAGTCCGGCGATGCCAGGCGTGCTGATGTACTGGCGCTTGTAGGTGGTCGGGAAGGACGACGTGCGGCCGTCCAGCATCTTCACCGGATCGTCGCCGGTGGAGAGCTGCTGCGGCGCCTCATCGATCTCATCCACCTGCAGGTACTTCACCGTCGAGGACTTCAGGCGCTGCGGGCTACCCATGTGCTCCACGAACAGCTGGCCGCCAGCGAAGTCCTTGAACGTGCGCTGGTTCGCGCTGTCGCGGCTGGCGGTGCTGGTCAGCGCCTTCTTGACTGCTGCGCAGACCTCGATCATCGGGTTCAGCTTCTGGGCGATCCACTTGTTCATGGACACCTCACCCGGCAGCGCATACATCATCGGGCCCGGCGCATAGTCCATCCAGTAGGCCATGGAATTGGTCGCCAGCTGGCTCTTGCCGAACTGGATCGGGAACATGCAGACCTGGTCATGCACCGGGCTACGGGCGGACATGTTGTCCATCGGCTCACGCAGTGGCGGGTTGCGGTCCGTCACCCAGCGCCCGGGCTTGCTGCCGCTCTTGGTGGACAGGCGCATGTGTTCGTCGCACCACTGCGAAACGCTCATGGGCCGCCGCGGCTGCAGCGAGCGCGCCAGCACCGACGCCAGGCAGCTCTGTGCCTCCATCATTCCGCAGCCTCCGCTGCCTTGGCCGCCAACGTGCGGAAGCCCTGGCTGAGTTCTTCCAGGGCGTGGCTCACCTCATCCCAGACCAGCCGCCGGCAACCGGCCTCATCCAGCGTTGCGGCGAGCTGCGGCGCCAGCGTGTCGGCCAGGCGCTCCATCGCACCCCGGAACGTCGTTGCATGCTCAGCGAGGAATGCCTCCACGTCCGCACGCGGCAGCAGTAGCCCTAGCTCCTTCTGCAGCGCGATGTGGGCCATGTGCGCGTCGGTCTCTGCCTTGTCGGCCAGCGCCTTGGCCTTGCGCGCGGAGTCTGGGGTCTGGGGCCGACCTACCCGTGAAGGCTTGGCATCGTCGTCGTCGCCATCCTCTTCGTCGTCATCGACGTCGGCGTCTACAGCATCGCCCCCCTCCCCGCTCCCCACCAGCGCGCTACCGCGCTCATCTGCGTGGCGCTGGGCGACACCGGCATAGACCGGGTCTGCGGTGCGAGCGTAGAGCTCCAGGGAGGCGGCCTTCAGGAATCCCTTGCCGCCCTCACCGACCACCACCCTGCCCTTCTTCCTCAGTTCTACCACGTAGGACGGCTTACAGCCGATCAGCGAGGCCAGCTCTTTGCCAGTGATCGTCACGTCTTCCTCAGCCATTGATTCCCCCTACTCCATTTCCTTCGAAGATCGTTAAAGCGGAAAAACGCGCGCGCGTGAGCATGTGCGGGCTGTGCGGTGGCGTGTGCGGGATGCGATAGCCGCCGAATCGACGTGGCACAAGGCGTGTGCTGTGTGTGCGGGATGTGCGGTCACCCACATACGCACGCGAGACGCATTGCGGTTTTGCAGCGCGATACCCGTTCGCACCCGCGCCCGCCCATGTAGGCCGATGCCCGCACGTCCCGCACACTCCTACTGCCGCAAGCGATTCACGGCAATTCAATGCCCGCACATCTGCCCGCACGTCCCGCACTCCCCGCACGTCAATGGGCATAGTGATCACGCACGCCCCTTGTAGTCGGAGTACATGCGGCGGAAGGACACGACCTGGTCGCCCAGCCATGCTGCCTCTGTCTTCCCGTCAGGCACCGTGCAATCGCCGAGCATCAGGAAGCCATGTGGCCCGTTCACGGTCTGCTCGATCTGGTAGCGCTTCCGGGCCCGATCGGGGTGGATGATCTGGCGCTTGCGCACCAGTGCGTTGATGAACTTTGGTGACGGTGCCGGGCGCGGCAAGCCCTCACGCGCGCACCAGGCCTTGTAGACCTCGTACCACTCTTTCGAGAGCGCCGGCATGGGCTTCAGCCCGGGAATGTCATCGCCGTAGAGCTCGTCCAGGAAGCGCTGCGGGCTGTCCTGGCTCAAGCCGATCAGCTCTTCCTTCGCATGGGTCATCGGCGGATGGGTGCCGTTGGTGAACCCGGTCAGGTCAACCTGCAGCAGGTAGTGGTGCAACGCTGCCGTTGCGCCGTTGCGGATATCGGCCAGCACCTCGGTGTAGAACTCCAGGCTCAGCTTCTCCGGCGTCCAGATCACCGCGTGGCGCCGGTCGTCCTCTTCCAGAACGACCGGCATCGCTTCATTCGAGAGGAACACCAGGTTCGCGTGATTGTCCTCCTCGTAGGCCTGGATGTTCTTCGGGTTGATGCGGATGCGGTCGCCCGTGATCAGCGCCTTGAGCTTGTTCTTGAGGTGGTACACCTCGGTGCGTGCAACCACTTCATCGGCCAGCAGGAACAGCTTACGGCTTGCCCAGTCGTTGAATTTGTCTTCCAGCGCCGCCTGGTCAAGCACGCGACCGTAGTCACCGTAGAGCTTCATGTACTCATCGAAGAACATGTTCTTGCCGGTGCCCTGCGGACCATGAATGACGATGGTCGATTTCATCTTGGCGCCAGGATGCTGCAGCGGGTACGCGAGCCACTTGACCACCCAGTCGTACAGCGCCTTCTGGTTGGCCTCGTTTCCGCACATGTGCCAGAGCAGCTGCAACAGCCGGTCGCAGTTGCCCTCCTGCGGTACGGTCGGCCACCCGGCGAAGAGATTGCACGTCACCCCGGGTTTCTCGCACGACGGGTCAAAGTCCACTTCCCGCACACGCACGATGGACCGATCCGAGTGCTCCATCCACGCCCGATGCAGTTCCTTGCGCACGCAGGCATCGCGCATGTCGCCCAAGGCAACCAGCATGTGCTCTTTGTGATCGAACACCGTGCCGCCCTGACCATAAACAAGCGCGAAGCGCTCAAGCAACTCGCTCAACGAGTGGATCGGTGCCAGGCGATCATTCCCCGCGCCCCCATCGCTGGTGATGGAAGGCGCGCGTTTTTCTGCAGGCACCCGCCATGAAAGCTCCGTGAGGCGAGCCTCGACCTGCGCCCGCACGACGTGAAGGCCCTCTTGGGCGTGCAAATCATTGAAGTCGCTGACCTTACGTCCGCTGTCGATGAAGCGCTCACGCCTGGCAGACTCATCGGCAAATACTGGGTGCAGCACCGCTCCGCCCACGTCCAGTGCTGCGGCCTCGGCACCGAGCAGGCCGGCATTCGACGCGCTATGCGGCTGCGCGCACGATGGGCAGAACTGCGGATGGTCGGCTAGCACCAGGCGGCTCTTGCAGTGCCGGCACTTCTGCAGCACATCGTCGTCGGCGCACAGCAGCATCTTGATGCTGCGATAGCGTTTCGCCAGAGCCGAGGCGACGGCCAGCATGTTGCCAGCATCGAATGCCACGGCTACCGGATAACCCGTCGCCATGTGCAGCGTGGCCGCAGTGGCATAGCCCTCGGCCACCAGCAAGATCCACTGCGGGCTTCCGCCGATCAGGTGGAAGTGGCCCTTCTTGACCATGCCTGCCGGCCAGTACTCCTTGGCCGGCTTGCGTCCTGCGGCGGCCAGCTTCGCGCTGCGTAGCACCTGCAGGCCATGCACCTGGCCGTTCACATCGAGCAACGGCACAAGTGCGGCACCGGTGGTGCCGTAGCGCAGACCGAACCCTTGCACGCCCTTGCTGACCAGGTAGTCAGCCTCGCCGACTGCATTCGCCTTTGCCCAGGCCGACGATGCACGCTCGGCCGCCCGCTTCGCCTGGGACTGGCGAGCAGACTCGGCCCTGCGACGATCCTCGGCCAGCCGGTTGCGCAGCGCTTCGCGCTGTTCATCGGAGAAGGTCTTATCGCGCTTGCGCAGATCAACCTTGGTCGCGCCGTTCTCGTTTCCGTGCCAGACGCCGTAGGTGCCGACGACTAGCACTTCGCCGGCCGAGGTGTTCAGTTCGTGGAGCGCGTACCAGCCCCGGCGCTCGCGTGAACCTTCGACGCGGCACCGGACCATGCGCCCGGTGGTGTCCAGTTCGGTGACCAGCAGGCCGGCAGACTGCAGCTGCTGCAGCACATCCCCATAGTTCTCAGACAT
Protein sequences of DBSCAN-SWA_3 >LR134301|1901755:1922998|1915573_1917064_-|VEE52195.1|capsid|DBSCAN-SWA MAASLLDRVIGAISPQAALKRHRARATLDAVRAYEGASRTDGWRVRRAGASANTDHLADARELRNRARALVQNVPYCARSLQVLVSATIGTGITPKAEGPNASALDVLWGRWADVADADGKSDIYGLMATAYRAMEQDGEVMIRRRTRHQSDGLAVPLQLQVLEIDWLDGNKNGSASGGGQIINGIEYDAIGRIRGYWLFGAHPGEAVRGSVRLSSSLVPASDIIHLYNPVRPGQGRGITRFAPVIARVRDLMLYEDAELARKNLEARLGVIVSGDIDSMSNADDDGPSQLGSDRNQVTDLGPLPSGGVTHITGATAFQTVEPKPAGGYVEYCKFNAHIITAGIGVPYESATGDMREVNFSSARIRQMEFRRDCEQMQWLVLVPQMCKPIWRWFDEAAALGGGVRSTGSTADWSTPRWDYVNPKQDIESEIAAMGAGLNSPSEALRRRGYDPDAVYVEMGKDFKRMKETGALGLMTFLQSSGARTGLVDASTTNEE >LR134301|1901755:1922998|1901755_1905361_-|VEE52183.1|DBSCAN-SWA MTILATDIKLRQSQRLTDNPDGGGRMVQTEIIDGAMNNLFPDIGDEERTTGRTTLRKMFVHVDTPAPDVLKDAIAVLIDPPADPRVTVTMFATGSYSDVRLDAKNRVESYITRGTESRFVLLGNHFSGQMTIQVYAMKDAPSPDINDNFSLLTLASSGHDPAEQYVRVKGVLSRTTRTFTDDQGAFERDVLVIELVNALLRDFYGQEVVRYSATKPATRIYETNVVEATSYHSVKRLTAAGKPGDLAVQVDTPYVSIVPTSTAETPVSDVLAGMGTISQVPSGPAGSLGQNFSASFAAGVPVSRYLGTGLVVGAVRVVAGSVELTDDGAGGLASAVATPWSGTVDYQGGVVALTHATGVGSTSISITASPAGSIPMQGFTDEIEVTQNNQGMVWLFQLTPLPAPGTVVVDYRALGRWIRLTDNGRGRLIGKPGQGTGTINYQTGSVVVTAGALPDLKSSIISNWGTPIIAEARVGDTAILPPALRFVLGEGSAVPATVQLTLRVGGSNVAVTDNGNGGLLIGGQVRGQISYSTGEISLRPLSLPDADSQLSIAYDWGQPLHAAPQPVPDAAGIVSFTLPQGPVRQGTVLLDWLVSVRRDRDDLSSAPQAMRVIAKDDGAGNLVGVSVGDTAFSTVLGSVNYSTGAVSLQVGKFMVRQVSYPVYEIRSGRLKVVGYERLDVLAQFSAGSIVSAGWMLAGEAAQSAQEVMPLPAVQLQLTPTISDSIVPGSVRFAFRGRTYVDRSGGLYHSIDPTTGSGIYAGTIDYAAGVVNLVQWLAGGENTVQIQSLLTRIADPGVAVSFFRAPGSPLRPGMFTLRATRIDGELLTVTADINGVLSASEIRGKVDWESGVVKVQFGQLVPVAGNEGKPWFDPDQVEGDQVWRPTLVLPGTIYMGAVVYRSIPLSEVVIGLSSVRLPSDGRAPAFKPGQTVLIHHTAKHVVPSPQAGQLVTFGRGRIAGIEVRDAAGRPVDAAWFTADLDVGNLRFSDPLNLAAYTLPLTISQRVEDRRLVVQPQITGEIEINTSLTHDYPVGESMISTALRLGEANGSLDLQARVVSLFDQAAWTNVWADSPSGSVAPGTYNDTDYPLAVTNKDAITERWAVRFTSATQFEVIGETVGTISAGNTTTDLAPINPRTGQPYFVMKKEGWGTGWSTNNAVRFNTIGGLAPVWLMRTTLPGTPEGASDSTRFQVIGNIAGEGA >LR134301|1901755:1922998|1911912_1912116_-|VEE52190.1|DBSCAN-SWA MHRKPLIALSLIFALATSACTSAGKRPSQPPSCPQPPLPPASLMVEPTTELRVRAELLAPQPSAMHR >LR134301|1901755:1922998|1909522_1910173_-|VEE52186.1|DBSCAN-SWA MASNRNNNLLKYYVSGRRAKGFHGLTDLAGEVLNRYDLSVQRAFIGLQRRAGPATAQEVRGSYNIRASALRGKYRVETGERGYSTGKRGRDDFLSIWASTRQISLIEFGGRWAGRKSRGATAGIGLGESKTYDGAFIATIKGRRAIRVRGWDRAQQKRHGRGPVRILRGPSPFEMLSGADGNSRALEARRRLIERFHTTYLTELRRQWRVNGSSNG >LR134301|1901755:1922998|1913594_1915571_-|VEE52194.1|DBSCAN-SWA MPQPIQAPTQDGTTRLMPPQLREAELQPTSFDSEARTIELQWTAGTRVRRYDWWNDTYYWEELVVDEAACNMERLSSGAAPVLDSHNTWGIGSQMGVVDRAWLSNGEGHALIRLSGREELAGVVADIGAGIIRNISVGYTVQRYEIERAVNPGDLPIYRAVEWTPSEISFVTVPADPAAGTRSNQPAQGTPCVFTRSASSQEHTMPQPAARAAESAVQQEPINNTPAPAAPAAAQAPEGDTRAADIVELATRHGQTEHAAGWIRAGHSVDHVRGLILTTLEQRDAAAGGNINRISVTEDEQDLQRSAVTHALLHRAQVIDPATKRIFALTGDNPVRGLTLMDLARRSLERCGVRTDGMAKLELVGRAFTQSGSDFPVLLESTMHKALQAAYAVAPDTWSRWCVTGTVSDFREHSRYRVGSIGNLDKLTEAGEFKNKKIPDGEKATITAGTKGNTINLTRQAIINDDLGAFLGLATAFGRAAKRTIEADAYAFLASNPKLDSNKALFHADHGNILAAAVPSVTSVDAMRVQLAQQKDVGGNDVLDLSPALWLGPTKYGSAARVTNKAEYDPDAEGKLQRPNAVQGLFRDIVDTARIKDDKWYLFADPNDCPAIEVAFLDGITEPFLDYEEGFTVDGVRWKARLDFGIAALDYRGVQRCG >LR134301|1901755:1922998|1912353_1912695_-|VEE52191.1|DBSCAN-SWA MVGGGVTATAPWWAAGSVVAVWILRETWAALLSRKKERTETDANVDLIKGLSERIGSLEQRVSVQDERLQQEMMMRLKAQEEASALRMRVRQLESTLRRLGAVIPPEDPAVSA >LR134301|1901755:1922998|1917360_1917678_-|VEE52197.1|DBSCAN-SWA MSRNTVRNKVRISELTEELAVGAALRLRCDSDDIRSVVEAVVAYLVEEYPAQDLYIPASMQISAYPVDEIRKGMQEQESVRSLCRRFRIDRRTLYRLLDEPCANE >LR134301|1901755:1922998|1905364_1905781_-|VEE52184.1|DBSCAN-SWA MTRIVLAGIELPADLQWTDEFTAWKVGQQARTSLTGALIVQESARQAGRPITLQTTRDGTAYVGVVSLPVLRALQASESEARLAPLDLVMPAHNSGERSFQVRWRRTDGPAIEVDPTRFAVPALDADLFSITLRLMTV >LR134301|1901755:1922998|1919596_1920241_-|VEE52199.1|DBSCAN-SWA MAEEDVTITGKELASLIGCKPSYVVELRKKGRVVVGEGGKGFLKAASLELYARTADPVYAGVAQRHADERGSALVGSGEGGDAVDADVDDDEEDGDDDDAKPSRVGRPQTPDSARKAKALADKAETDAHMAHIALQKELGLLLPRADVEAFLAEHATTFRGAMERLADTLAPQLAATLDEAGCRRLVWDEVSHALEELSQGFRTLAAKAAEAAE >LR134301|1901755:1922998|1917674_1919600_-|VEE52198.1|terminase|DBSCAN-SWA MMEAQSCLASVLARSLQPRRPMSVSQWCDEHMRLSTKSGSKPGRWVTDRNPPLREPMDNMSARSPVHDQVCMFPIQFGKSQLATNSMAYWMDYAPGPMMYALPGEVSMNKWIAQKLNPMIEVCAAVKKALTSTASRDSANQRTFKDFAGGQLFVEHMGSPQRLKSSTVKYLQVDEIDEAPQQLSTGDDPVKMLDGRTSSFPTTYKRQYISTPGIAGLSRITKLYDKSDQRRYHVPCPHCGHYQALQWSGLVWSPDKTHAWYACCECGVAIEEHFKTEMIANGRWVAANPDSPIRGYTINCLYYQFGLGPRWLDLVKEWLEAQGDPASLKTFVNDRLAETWEDPAMRAVKHNVIKDRAEPYALRSAPQGVLAITVGVDTQDNRLAVHVVGWGRGMTAWTLDYVELQGDPAEEAVWVALTDLLNRAIEREDGALLRPMAVAIDAGGHRTEAVKNYVRQRRITRPMCIFGAVPNNAPVLSKGKLADVTWKGKTDKRGITINHVGTVAAKHYLYSRLSADAERKPENRMVHLSDQLPDEFFPGLVSEVYNPVKNRFEKKVTRNEPLDTWVYAYAATHHPEVRINRFTRSDWDLLEQRLAAPPSVNVSRETQGAASEADTSTDSRETSSVPRRQRPAQPRSTGRQW >LR134301|1901755:1922998|1905780_1909530_-|VEE52185.1|DBSCAN-SWA MADRLEEAIRVVIETQGREGVDELRAAFGDLGDVSVETAGKATKLLDSLTGLNEAAAKADAFDGMLTELAQLEQQFDDNQKAALALSLSIGEMDKPSREVLATQRDLRKEGERLKKALNEQWEAVGKADDELSSLGVNTANLADHQQRLRVEVARSAAALTEQARAAAAEAEAGRRRKQQIDEAETAFRKQANTSRAAAKSLAEYRERADDATAGSADLASATESTVSWFGKLKAVAAGALAFVGLNRVVDGIKAIVKEGSDAEQELSQLEAALQATGRSGEFTAQSLAAMRKQLQSGLFDDGQISAAQVRLLSYTNIVGEQFPAAMQITIDQAQRLGMSLEGSAEVVGKALQTPSKAMESLSKQGFTLDDSQKALIKSLEATGQVAKAQAIILDLLAESYGGAAAAAKVGTIAGLWKTATDRFKDWKQEVADQGVLTYFKEQLTTLLTTLDRLAADGSLSRWAKQTAQAIITMAEAVKGTTQWVVDHARVIGLMAAAYAQFKIVGALLQLNAWRAALIATTNAQLANNAAVASGSRGIGRFGALLRGLPKAVPITVAVLGLEAAMGGLDVLKTVAQDIWKQHDPALKRAGEAQRAYISQVRDSALQLREQAVSFVAYRDVVIKSAEEVAKLGEAERQAYEKRLSGLEQYLTAQEGFLLMQQKAGVATAEQLQQLGLVTQKLLEVSTGFASLRGGVQTAADALMNGIGPAAQLVVQQLAGIDGNAKLAKESIGKVFEGLNFADTASLEAVGAALGYIASQGAAAERNVRDGLLETLKKLSGDELLRFQASAQSAFEAMPQGATNAAAVLQTTLVAAMERLGVSASRMGVGFSAGGKDAIAAFGAVAESAIATGVQIETAFKAALGKVATLDEARTLGALLESAGRQGKVGFDAAARSAAALNARIREIQVAIDPLADEFARLGIQSQASLNATRDAAKSAFDAIRDGAARGKASVEDVRRAFRAYADATRAAAADSDQWRRDNVDSQLAVQESIYDTERSMQRLGDVSDVAMRQLQDGASRSRERLEEVRESAGGAADQVDRVGSSSERMGKQMGQAGAAAQGMAFSIGEVSESALQAMRKLSGPNPLVQFANALNRVTDQRKQLAEYKAELQATAEAEDELSKAAKERLAGQFDYVGKGEIAEVAQLEAQVMRQRQQRDQEAAAALAERRKQAEAEAEAQARADAARIGSNGSNEQVIVIDWKLPSKEVVAGATAAEVQQAQRLAGLVAPLVLRQVQQSRAVSVRGRS >LR134301|1901755:1922998|1917065_1917281_-|VEE52196.1|DBSCAN-SWA MSWTTDDVQRLKAAIASGQLSVRHGDRQITYQSVEAMLTALDRMEAEVAATTAGRRKSATRRYRFTTLRGF >LR134301|1901755:1922998|1910244_1911045_-|VEE52187.1|DBSCAN-SWA MAQPKVRKFAGDLRFWEHGAEGVRVPVIPEPADKFGNQPLEQSSLTFSYEAGDSVEIKSKRRDARYQQIIHKDSNPGVTNVSITALEVPTAFLARMLYGTLVNTTVAAGSASAVSVTVGSVDTPVKLPHNFIEATPAPAFKKGAVDLVKGTDYDLEPRHGLLIPKKGGALQAGDVVTADYSFDAYLETAISGGTTPSKSFQVLGDMQDRISGDEGLLSIPNVDLTVDGDVDWFSDEPIQVTLTGPVIFQAGEADLYTFKIAAQSAG >LR134301|1901755:1922998|1920583_1922998_-|VEE52200.1|DBSCAN-SWA MSENYGDVLQQLQSAGLLVTELDTTGRMVRCRVEGSRERRGWYALHELNTSAGEVLVVGTYGVWHGNENGATKVDLRKRDKTFSDEQREALRNRLAEDRRRAESARQSQAKRAAERASSAWAKANAVGEADYLVSKGVQGFGLRYGTTGAALVPLLDVNGQVHGLQVLRSAKLAAAGRKPAKEYWPAGMVKKGHFHLIGGSPQWILLVAEGYATAATLHMATGYPVAVAFDAGNMLAVASALAKRYRSIKMLLCADDDVLQKCRHCKSRLVLADHPQFCPSCAQPHSASNAGLLGAEAAALDVGGAVLHPVFADESARRERFIDSGRKVSDFNDLHAQEGLHVVRAQVEARLTELSWRVPAEKRAPSITSDGGAGNDRLAPIHSLSELLERFALVYGQGGTVFDHKEHMLVALGDMRDACVRKELHRAWMEHSDRSIVRVREVDFDPSCEKPGVTCNLFAGWPTVPQEGNCDRLLQLLWHMCGNEANQKALYDWVVKWLAYPLQHPGAKMKSTIVIHGPQGTGKNMFFDEYMKLYGDYGRVLDQAALEDKFNDWASRKLFLLADEVVARTEVYHLKNKLKALITGDRIRINPKNIQAYEEDNHANLVFLSNEAMPVVLEEDDRRHAVIWTPEKLSLEFYTEVLADIRNGATAALHHYLLQVDLTGFTNGTHPPMTHAKEELIGLSQDSPQRFLDELYGDDIPGLKPMPALSKEWYEVYKAWCAREGLPRPAPSPKFINALVRKRQIIHPDRARKRYQIEQTVNGPHGFLMLGDCTVPDGKTEAAWLGDQVVSFRRMYSDYKGRA >LR134301|1901755:1922998|1911533_1911869_-|VEE52189.1|DBSCAN-SWA MSEVDFLRDLDGTLHAAFALAGMASRGRYTAKDGPATEDVRAYVERDVETIGELRQFRAGRVEIAYLRSDVAPDQGDRFEVVSSAFGTEVFVNSKKISDDGSQSRWLVSRG >LR134301|1901755:1922998|1912694_1913126_-|VEE52192.1|capsid|DBSCAN-SWA MGTTSTPRGVRNNNPGNIDRTSTPWQGEDRSAAAIAREARFCVFLTPQAGFRALGKTLLTYQRKHGLRTVKEIIGRWAPPVENDTGAYVRQVAAAVGVAPSEVIRLDNPVTLGRLATAIAKHENGGVYWNADVVSAGVAEALS >LR134301|1901755:1922998|1913206_1913572_-|VEE52193.1|DBSCAN-SWA MAQNFVSDGDVIPWTNTTEQQVASGQAVVVGHQLGVALVNIAAGATGSVALGGVFTLPKVPTAVFEQGEKLVWSASAKAVDGSSATVVAGDITGAAFAWAAGSSGQTTAEVRLSPGNATKA >LR134301|1901755:1922998|1911058_1911541_-|VEE52188.1|DBSCAN-SWA MDDLAEPLSWQLVEFLAARVRLISRSSGFRTDIGAGAVIIDETEISEDSTEPATIISVRQLSRSGGGVAQSSSDAAITIEFEVPRGSDEANPRLLVHRARHDLIRALTFKEKSLPLGVTSFELIETQLATLEDDAGHTAVVAQITARAGLTETFEPVSNP |
18 | Stenotrophomonas_phage(100.0%) | terminase,capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1927636 : 1933682
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >LR134301|1927636:1933682|DBSCAN-SWA CATGAGCCGTTCTGTTGTCATTTACGGGCCGCAGCGCTGCGGCAAGACCACCAACGCGCAGGATCTGCGCGCACACTTCGGCCTGCAGAAGGTCCTGGACGACTGGGACGGACACACTGCGTATCCGTTGGAGGACACGCTCGTCCTGACCAACAATCCGCACGCGATCGCGCACCATTCGTCGCGCGTCCTCGATCACGGCTGCGCTATGCGCGACATGTGCACGGGAGCGCAGGCATGAGCGCCCGTCCACAGCAGACCGGCCGCGCTGCCGAAGTGCGCAGGGTCCTGTCTATGTTCCCGCAAGGCGCCACGGTCGAGCAGATCAAGACCGCTGGCCGCATCAACAGCACCCACCAGGCCATCGGCTACACGCTGAAGGGGCTGGCGCGCAGCGGCCAGGCCATCTGCCACCGCTCCGGCGTGCGTGGCATCTGGCGCCTCTCCAGCCACACGCAACATGCGATCGCTCCGCTGCGCGCTGCACCTACCCGGGTGCAGCCGACCTGCACGCCAGGCCCGCTTACAGGTGTCAGTGACGCGGCGACCACGATCCGACACCGGGAACTCGACCGGCAGCAGTTGGCCGACGACCTGGACGCTTTCCTCGCAGCGGGCGGGCTGATCGAGGTGCTGGGGCACACCCCGCTTCGCCCGTTGATGAACCGTCACGTAGCCAACCACGGCAGCTATGCAGAGCGCACGGCAGCCCATGACATCGAATGAGGCCGCCATGAGCATCGAATCGCACACAGCGGCCGCCACTGAACCCGGCAGGCCGGGTAGCACCTATTCCGATGGCCCAGCATGGCATGCCTTCGGCCTCAGTCGCGCCGCCTACCTGGTGCTGCCGCGCCGCACCTTGCAATCCATGCCCTTGGATTGGCAAGCGCGTTTTGTCGCGCTGATCCAAGAGGCGCGCCAGGCATTGCCTGATGAGGCTTTCCCACAGTACCAGGCTGTCCTACTCAGAGATGGGCGGTACGCCGCCGATCCCAACTGCCGCTACCGCCGCATGCCGCCGTTTCCGCATCGCCCGACCGAGAACGCACCCGCCAAATGCGATGCGCCGCTCGCTGGCGCTTTCGTGAACACCTTCACTCAATTCGATCAGGCCACCCAATGACCGCGAATCTCACGGTTCTTCCAACGAACTGCCCCGTTCTGCGCGACGCATTCGAAACGATCAGTGCGATCGCTGTCGAGGCTGTGTGGCTGCCCAACCAGGCGAAGGCAATCACCCTCGCGCAGGCCCAGACCGCGCTGCGGGATCTGCACAACCGCCTGCCGCGCTTGCAGGATCTGCGCGTATTCGAAGCCGCTGTAACCGCCTACGTTTCGACTCTGCGCAGCAGCATGCAGGACGGCGACACGCCACTCTGCGACACCACCCGCGCCCGGCTGGCGCAGGCGACCGAGCTGCTGGAGCTGGTCAGGAATCAGACCAGGACCGTCGTCGATCCGGCGGACCCATGGCGCGGCCTGTATCACCCGAGTCGACTCCCGGCGCGCAATGCCGACGGCGAGATCCCGTGCCATCCGGACGTGCCGGCGTGGGCGGACGGTCGCGAGGTATCGCTGCGACCGTTGTTGCTTGCGCAGGGTTTCGACCTGCAGATTACGTTTGGCGACTTCACCGAAGAAGCCGTGGAGACCGGGGACCATCGCTACTGGGATGAAATGCGCGCGTGGCAGCCCACTGGCCCCGGCGCAGATTGGCGCTTGGTCTGGCTGGGTGATACCGAAGATGGCCCAGCGGCATGGTTCGTGCGGCCGCTTGCAGCTGAGGCGCTTGTCGCCTGGGAGGCAGCACATGGCTGACGGCTCCGGCGGTTTCCGCTTCCCGTTGCACGACCGCAAATCACGGCTGCGCTCGGACGAGATCGTAGTTGATCTGTTCGCAGGAGGAGGCGGCGCCAGCCACGCGATGGAGACCGCGCTGGCTCGGGCGGTCGACATTGCGATCAACCACAACCCCTGGGCTGTGGGGCTGCACTCCGCCAACCACCCGTTTACCCGCCACCTGTGCCAGGACGTGTGGGAAGCAGACCCGCGCGTCGAGTGCGGCGGTCGCCCTGTGGGCGCACTACATGCCAGCCCGGACTGCACGCACTTCAGCCAGGCCAAGGGCGGTCAGCCGCGCAGCCGAGCAACCCGCTCGCTTTCGTGGGTTGTGCCCCGCTGGGCCGGCACCGTGCGTCCGCGCATCATCACGCTGGAGAACGTCAAGCAGATCCTGAAGTGGGGCCCGCTGATCGCCAAGCGCGACAAGGCGACCGGCCGCGTGATCAAGCTGGACGGCACAGTCGCAGCTGTCGGCGAGCGCGTCCCGCTGGACCGGCAGTTCCTGATCCCAGACAAGAAGCGCGAGGGCAGCACCTGGCGTCGCTTCGTGGCTGTGCTGCGGGCGCTCGGGTATCAGGTGGAGTGGCGTGTGCTGCGGGCTTGCGATTACGGCGCAGGTACGACGAGGGAGCGGCTTTACATGGTCGCCCGCTGCGATGGTGAGGCGATCGTCTGGCCCGAACCGAGCCACGGTCCTGACCGCGCACAGCCGCACGTGTCGGCGGCATCCAGTATCGACTGGTCCATTCCATGCCCAAGCATCTTTGGCCGCAAGAAGCCGCTCGCCGCCGCGACTGAGGCGCGCATCTGTCGAGGCATCAAGCGCTTTGTGCTGGACACCGCCGAGCCGTTCATCGTCCACGCCACCCACGGCGGGGAACGCAGGCCGCACGGGATCGGTGAGCCGATGCCGACGATCACTGCCGCCAACCGTGGCGAGATGATGATCGTGTCGCCCACCATCGTGCAGTGCGCGAACGCGTCCGCCAACGGGGTCGCTTCCGGAGGCGATCCGCTGGGCACCATCACGGCTTGGCCGCGTGGCGGCTCACACGCTGTCGTGGCGCCGATGCTGGTGCAGGCCGCACATGGCGAAGGCAAGCCCGGAGGCGTCAAGCGGTGGGGCGCTGGGGTTCGACCCGCCGATGAACCTGCGGGCACAATTACCGCCAGCGGCAGCGGTGGCTATGCGCTCGCCGCAGCCTCCCTGGTCAAGTTCCGGGGTACTAGCGATGGAGCGGACGCAGGTCAGCCGATGCCGACCATCACCAGCGGCGCTGGCGCGGCGCGGCCGGCCGGTGCCGCCCATGCCATGGGCGTCATGGCGGCGTTTCTGGAACAGGCCAACGGCGGGTTCTACCAGGGCGCCGGCAGTGCAGCCGATGAGCCCATGCCGACCATCTGCGCCAACGGAAGCCACCAGCGCCTCACGACCGCGCACCTGGTTACGCTCCGCAGGAACCTGGACGGCCAGACAACGGCCGATCCGCTCAGCACCATTTGCGCTGGTGCAACTCACCACGGGGTGATCGAGTGCGTCCTGAGCCCTGATCAGGAAGCCGGCGCACTGCGCGTGGCCGCGTTCCTGATGCGCTACTACGGCACAGGCGGACAGCATGGAGCACTAGACGAGCCGCTCGCGACCATCACCACGAAGGACCGGTTGGCGCTGGTCACGGTGCATCTGAGCGGCGTGCCGTATGTGATCGTGGATATCGGGCTACGCATGCTCAAGCCACACGAGCTGTTCCGCGCGCAGGGTTTCCCGGCCAGCTACATCATCGATCGCACGCAGGATGGTCGACAGGTCAGCAACAGCCGCGCCGTTGCCATGGTTGGCAACAGCGTGAGCCCGCCGCCGCTTTGCGCAATTCTGAACGCAAATATCTGTTCGACCGCAACCCCCATTCTCGTAGACGGCCAGGAGAGATGCGGAAAGACTGCCAGCAAGGCCGGATCGAAATCCGGCGACAAGATGCGCGGGAGTGTTGCTGATGTGCAATGACGCGAACGAATCACCCCTAGGAGGGCTCTATCGTGTGATCAAGGCTGGGCGGATTGTGCATTCGGAATCTGATCGATTCATCGGATGCACGCACATTTGCATCTTTGACTCCGACGTCAACGAAGTTCGTCAGATCCTCGAAATCCGTCTCAGACCACGTGCAAAGGTCCTTCCTGAATTCATTGAGCTCCTTTGCGCAGCTGACGTAGTACACGCGATCGCCTTCCATGCGAACCATCTCGGATGGACGCGTTTTCTCGACTTTCCAAAGAGATCGCTCGAACGCTCTCAGAAGAGCAACGCACTTCGCCAGTCGCTTCTCCAGTTCGCTAGTAATGCTGTGCACGTGTGGAATCAGAGGACTAACACTGTCTGCCGATATCTGAGCTATCAGTTTCCTAACTTCTTTGTGCTGCCAATGGTCAACGAGGTCATTGGACAACTCGATGACCTTGGCAGCAACAAGGAGCCGGTGCTCCATAAGCGCCAGCTCACCCGCCGAAATTTTCGCGACCATCTTGGCATGCCGCTTACGATGACAGCGATCTCGAATTACTGGCGCCACCCCAACACCAAGAGCTACCACAGCAGCAAGCCCTGTTGTTACCGCCGCCCAATCTACAAAGCATGTGGAGCTCAGAGGCGAACACCGAATCAAACCGTCCCAAACACTCATTCACTCCCTCCCCTTTTGATTGAGCGGCATTCTGCCACGCCCATGTGCGTTCTGGAGGCGATCTGTGGGTCAACCATCTAGAACGCACATTCTGGACAAGCCAGATGACACCCTTGGTTTAGAGGATGCCGCCAGAATGCTACGGCTCGGGCTAGAGGCAATGAAAGACCTGGTGGACAAGGGCGAAGTGCCGGCAGTGCGCTTGAATCAGAAGCACACGGTCATGCTGCGCGAGGACCTGATCGAGTTCCTACGCTCGGAAGGGCGCAGGCAGGCCGCCGAGCGAAAGAAATCGATGATCGGAAACCGACCTGCAGCCAACACGCCTGAGTCAGGGTCGACAAGACGTGCAAGCCAGTCGCGTCGCACAAAGCTGCCCGATCTGCGCGCCTACGAGCAGGCCGATCACCAGAGCTGATCGGCCAAATCGGAAGCGCGGAGGTTGGCGTACCGCTTTAGCTGGCGCGGATCGCGATGCCCAGTGATGCTCGCGATCTTGATGTCTGTCAGCGTGGTCTTTTCGTACAGCCGGCTCGTCGCTTCGTGACGTAGATCGTGGAAGCCTAGATCCGCGCATCCGGCCGCGACGAAGATGCGCTCGAACTGGCGCGACAGCTTGCTCGATACACGCCGCAGGGCCAGCGGGGTACGCTCGCCAGCCCAGAACGGGAACAGCCGGCCTTCGTAATCACCCTCATACGCGGCGAGCTTAGCCAGCAGTACCGAGGTCATGGGTACCTGGCGCTTGCTCCCATTCTTTGTCTTGTCCAAGAAGATCGTGCGCCGCGCCACATCGAGCTGACTGCGCTCTAGCGTGTAGATCTCCCGCATGCGCATGGCCGTTTCCAGCGCCATGTCGAACATCAGAATCAAAGCCTCCCGCTGCGGCAGATCGAGCGGACGCTGCCGCCCCGGCGGCTTTGCGCCGGCTAGGATCTCGCGGATGCGTTCTTCCTCACCAGGCTCCAGACGGCGATCACGCTCCTGGTCAGTTTTCGCCTCACCGTCGATGCGCTTCACAGCTACTTTGTCGTCGGCCGTGTACGTCGAATAGCCCCGTGGCAGAAGCCGCAGTGGATTCATTGGCAGCGCGCCGTGCGCAGCCAGCCAGTCCAGGGCGCGCGACAGGGCACCCACGTAGTGCCGGATGGTCGAAGGCGCGAGGTTCTGCTCACGCTTCATGGTGGTGACCCACTCGGTCGCCCACGTAAAAGTCAGCTGCGGCAAGGTGATGCCGATTGGCAGCCGAGAGAGCAGGACGGGCAGCAGCTGCTCGTCATCGACCGAGATGTGCTGCGCGCTCCGATACTCGCTGACCTGGCTGCGTAGATCCTTCGCGGCTGCCTTGGTGTTGGCCAGCTCCTCCGGCACCACCCCACGGTCGAGCAGCGCCTCGAGGCGGCGCACGTACTCGTCGCCCTCTGCCTCCGAGGCGAAGCTCAGATAGACAGGCTGGGGCAGCAGCCCCGCCCGCTTGATCGTGTACTGCCAGGAGTCGCCCCGGCGACGCTTCGTTGCCAT
Protein sequences of DBSCAN-SWA_4 >LR134301|1927636:1933682|1931474_1932242_+|VEE52217.1|DBSCAN-SWA MCNDANESPLGGLYRVIKAGRIVHSESDRFIGCTHICIFDSDVNEVRQILEIRLRPRAKVLPEFIELLCAADVVHAIAFHANHLGWTRFLDFPKRSLERSQKSNALRQSLLQFASNAVHVWNQRTNTVCRYLSYQFPNFFVLPMVNEVIGQLDDLGSNKEPVLHKRQLTRRNFRDHLGMPLTMTAISNYWRHPNTKSYHSSKPCCYRRPIYKACGAQRRTPNQTVPNTHSLPPLLIERHSATPMCVLEAICGSTI >LR134301|1927636:1933682|1932297_1932579_+|VEE52218.1|DBSCAN-SWA MLRLGLEAMKDLVDKGEVPAVRLNQKHTVMLREDLIEFLRSEGRRQAAERKKSMIGNRPAANTPESGSTRRASQSRRTKLPDLRAYEQADHQS >LR134301|1927636:1933682|1928757_1929456_+|VEE52215.1|DBSCAN-SWA MTANLTVLPTNCPVLRDAFETISAIAVEAVWLPNQAKAITLAQAQTALRDLHNRLPRLQDLRVFEAAVTAYVSTLRSSMQDGDTPLCDTTRARLAQATELLELVRNQTRTVVDPADPWRGLYHPSRLPARNADGEIPCHPDVPAWADGREVSLRPLLLAQGFDLQITFGDFTEEAVETGDHRYWDEMRAWQPTGPGADWRLVWLGDTEDGPAAWFVRPLAAEALVAWEAAHG >LR134301|1927636:1933682|1929448_1931485_+|VEE52216.1|DBSCAN-SWA MADGSGGFRFPLHDRKSRLRSDEIVVDLFAGGGGASHAMETALARAVDIAINHNPWAVGLHSANHPFTRHLCQDVWEADPRVECGGRPVGALHASPDCTHFSQAKGGQPRSRATRSLSWVVPRWAGTVRPRIITLENVKQILKWGPLIAKRDKATGRVIKLDGTVAAVGERVPLDRQFLIPDKKREGSTWRRFVAVLRALGYQVEWRVLRACDYGAGTTRERLYMVARCDGEAIVWPEPSHGPDRAQPHVSAASSIDWSIPCPSIFGRKKPLAAATEARICRGIKRFVLDTAEPFIVHATHGGERRPHGIGEPMPTITAANRGEMMIVSPTIVQCANASANGVASGGDPLGTITAWPRGGSHAVVAPMLVQAAHGEGKPGGVKRWGAGVRPADEPAGTITASGSGGYALAAASLVKFRGTSDGADAGQPMPTITSGAGAARPAGAAHAMGVMAAFLEQANGGFYQGAGSAADEPMPTICANGSHQRLTTAHLVTLRRNLDGQTTADPLSTICAGATHHGVIECVLSPDQEAGALRVAAFLMRYYGTGGQHGALDEPLATITTKDRLALVTVHLSGVPYVIVDIGLRMLKPHELFRAQGFPASYIIDRTQDGRQVSNSRAVAMVGNSVSPPPLCAILNANICSTATPILVDGQERCGKTASKAGSKSGDKMRGSVADVQ >LR134301|1927636:1933682|1927636_1927876_+|VEE52212.1|DBSCAN-SWA MSRSVVIYGPQRCGKTTNAQDLRAHFGLQKVLDDWDGHTAYPLEDTLVLTNNPHAIAHHSSRVLDHGCAMRDMCTGAQA >LR134301|1927636:1933682|1928368_1928761_+|VEE52214.1|DBSCAN-SWA MSIESHTAAATEPGRPGSTYSDGPAWHAFGLSRAAYLVLPRRTLQSMPLDWQARFVALIQEARQALPDEAFPQYQAVLLRDGRYAADPNCRYRRMPPFPHRPTENAPAKCDAPLAGAFVNTFTQFDQATQ >LR134301|1927636:1933682|1927872_1928361_+|VEE52213.1|DBSCAN-SWA MSARPQQTGRAAEVRRVLSMFPQGATVEQIKTAGRINSTHQAIGYTLKGLARSGQAICHRSGVRGIWRLSSHTQHAIAPLRAAPTRVQPTCTPGPLTGVSDAATTIRHRELDRQQLADDLDAFLAAGGLIEVLGHTPLRPLMNRHVANHGSYAERTAAHDIE >LR134301|1927636:1933682|1932566_1933682_-|VEE52219.1|DBSCAN-SWA MATKRRRGDSWQYTIKRAGLLPQPVYLSFASEAEGDEYVRRLEALLDRGVVPEELANTKAAAKDLRSQVSEYRSAQHISVDDEQLLPVLLSRLPIGITLPQLTFTWATEWVTTMKREQNLAPSTIRHYVGALSRALDWLAAHGALPMNPLRLLPRGYSTYTADDKVAVKRIDGEAKTDQERDRRLEPGEEERIREILAGAKPPGRQRPLDLPQREALILMFDMALETAMRMREIYTLERSQLDVARRTIFLDKTKNGSKRQVPMTSVLLAKLAAYEGDYEGRLFPFWAGERTPLALRRVSSKLSRQFERIFVAAGCADLGFHDLRHEATSRLYEKTTLTDIKIASITGHRDPRQLKRYANLRASDLADQLW |
8 | Stenotrophomonas_phage(66.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2517645 : 2535719
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >LR134301|2517645:2535719|DBSCAN-SWA AATGCACGCACCGCACCCAGCGCAGGACATCGTTCCGGCGTTCCTGCAGGCCATGCACGCGCACGGCATCGTGCCGGATGCACGCGGCCGCGACGCGATCAACGCCGATGGCGCGCTGGTGCGCTTCCATGTGGAAGGCGACCGTCGTGGCACGCGCAATGGCTGGGCCGTATTGTTCGGAGACCACGTGCCGGCCGGTGAGTTCGGCAGCTGGCGCACCGGCAGCCGCCATGCCTGGTGTGCGAAGCCGGAAACCACACTGAGTGCTGCCGAACAGCGCACGATCCGGCAACGCCAGGAAGTCGCCCGTACCGAACGGGAACGGCAGCAGCGCGAACGTGAGGAGGCCGCAGCCAAGGCCGCCAATGTATTGTGGAACCGCGCCCTTCCCGCAGATGCGCACCATCCCTACCTGGTACGCAAGGGCATCCACGCGCATGCGCTGCGTGTGGCGGCATGGCCCGTGCGCAACAGCGACGGCCTGGTCTTCCGCTACATCGACAATGCCCTGCTGGTGCCGGTGATGAACGCCACGGGCCGGATCGTCTCGCTGCAGGCGATCTTTCCGCGCACCGATCCAGCCCTCGGCCGCGACAAGGACTTTCTCAGCGGAGGCCGCAAACAGGGTTGCTTCCACGTCATCGGCAAGCCGCTGCCTGGACAGCCGATTGCCATTGCGGAGGGTTACGCCACGGCGGCGTCAATCCACCAGGCCACGGGCGGGTGCGTGGTGGTGGCCTGGGACGCAGGCAATCTCGCTGCGGTCGCCCGTACCTGGCGCAGCGCGGTTCCCGATGCCTCCTTCGTCATCTGTGCCGACAACGACCAATGGACCCGGCAACCGTTGGACAATCCTGGTGTCACCCACGCCACTCGCGCCGCCGCAGAGATCGACGCACGTGTGGTGTGGCCCGAGTTCGCCGCGCTGCATGACGACGATGACCGCCCGACCGACTTCAACGATCTGCACCTGCGCGAAGGACTGGAGGCAGTGCGCACCCAGCTGCTCCCTCGGCCGCCTGCCATAGCCGAAGAAGACACTGCGGAAGAGGATGCGCCATCGTCGTCCAACGCACGCTACCAGGTGCCCGGCAACCTGTCCGCATTCGATGCCTTTACTCCGTTTCCAGACACCAGCGCACGCGGACGGCCGCTGCCGACCGCGCGCAATCTGGCCGAGCTGTGCCGGCGCACCGGCGTCACCGTGCGCTACAACGTCATCCGCAAGGATCTGGAGATCCTGGTCCCGGGGCTGCAGAGCACGGTCGACAACGCCAAGGAGGTGGCCGCCGGCGAAGTGATGGACTGCATGCACCGCGCCGGCATGGCCATCACCAGCTTCGAGACCAACCTGTGCCAGGTAGCCGAGGCCAATCCGTACAACCCCGTCGCAAGCTGGATCACCTCACGGCCCTGGGATGGCCAGTCCCGCCTGCAGGCGTTCTTCGACACCGTGCAGGAGGCCCAACCCACGCGCATGGGCGACGGACGCATCCTGAAGGAAGTGCTGATGCGGCGATGGCTGATCTCCGGTGTGGCTGCTGCGTTTGAACCGGACGGCGTAGTGGCGCGCGGTGTGCTGACCTTCGTTTCGAAGCAGAACCTGGGCAAGACACGCTGGGCACGGCAACTGGCACCGGCCGAGCTGCAGCTGATCGCCGATGGCGTGGTGCTCGATCCGGCCAACAAGGACAGCGTCAAACAGGTCATCTCCAAGTGGATCGTGGAGCTGGGCGAAGTCGACGCCACGTTCCGTCGCACCGACATCGCTGCGCTGAAGTCGTTCATCTCGCGCAGCCACGACGAGATCCGCCGCCCGTATGCGCGCACGGAATCCCGCTATGCGCGGCGCACCATCCTGTTCGCCAGCGTGAATGACGAGCGCTTCCTGCGCGACGCCACCGGCAACACCCGCTGGTGGACCGTGCATGCCGTGGGCCTGGGCGAACCGGCACGGATCGACATGCAGCAGGTGTGGGCCGAAGCCCATGCGCTCTACTGCGACGGCGAGACCTGGCACCTGTCCAGCGAGGAACTGGATGCGCTGAACGCCACCAACGGTGAGCACGAACCGATCTCGCCGATCGCCGAGCTGATCGACCGCCATTTCGACTGGTCCGTCCCCGCCGCGCAGTGGAGCGCACAGTACCGTGCCACCGAGATCGTCATCGCAGTAGGCATCGACAAGCCGAACCGCCGCGAGGTCAACGAGGCCGCGGCCTATGTGGTCAAGCGCCATGATGTGCGTACGAAGGTGGTCGGCAAGGAGCGGGCCAAGGTCTGGTTGATGCCACCGCGAAAGCTCAGCCTCGCCGGGCACGCAGCCGGTCCGTTCTGATGCCATGACTCCATGATGGCCATCAGATCAGCGGCGACAGCTCTGAGTGTCACCCCAACCCGTCGGGATGATAGTAGATGAAAGGTGTTGACTTGAGAGGTCTGCAGAGGCAAGATATCCGTATCGGCTTCGCCACCTTCCAGGAGCGCTTCCGCCATGCCTCGTCCCCAACTGCATGCCTTCGAAGGTGAGCAGCTGACCGTGCAGCAGATCCACCAGCGGGTACCGGTGCTTTCAGAACGCACCATCCGCGACCACCTCGCTGCCGGCCGCCGTACACGTTCCGCCATGCTGTGCTTCGACCCGGTCGCGGCGGCTGCTCGCGGTGGCCGCATCACCCAGCGCATCCTGCGCGCGCGCAACACCGCCGGCCGCGATTCCTGACCCGCCCGCACTGCCGAATTCCTCCAGGAGTAGATTCCGCATGATTCCCGCCTCCCTCGACAGCGGCCATCGCATGATTGCCGACACGCTGGCCGCGTTCCGTGCCGGCCCCGCTCTGGGCAGCATCGCGCTACGGGCGGCGCCGCAAGCGCGTCCTCCGCTCTATATCGGCATCGCCGGCAGCAGGCGGGCGGGCAAGGACACCCTCGCCAACGGACTGGCCTCGGCACTGGCGCTGCCCTGCGACAGCTTCGCCGCCCCGCTGCGGCAGTTCGTCGCCTCCCTCCTCGGGCTGTCGCTGCGTGAACTGGACAGCCGCAAGGAGGACGCCATCGACTGGCTGGCCGAGCTCACGCCGCGTCACCTGATGCAGACCGCCGGCAGCGAATGGGGGCGCGATCGCATTCACCCCGAGCTGTGGGTCCGTTCGCTGTTCGCGCGCCTCCCGGCGGGCGGCCTGGTACCTGATGTCCGCTTCGCCAATGAAGCCCATGCGATCCACCGCCGCGGCGGTGTCGTGATCCGCGTCAACCGCCCGGGCCATGGAGCCCATGACACCCGCGCCAGCGAGCAGTCACTGCCCGATGAACTGGTCGACATCGAGGTGAGCAACAACGGCAGCCAGGCCGATCTGGTACGCAGGACGCTGGACCAGCTGCTGTCGCGCGGCTTGATCTGAGCCTGTCTCAGGGCCGCGCTGCCCCGTCTCTTCCACTGACTCGATCGCAGGTTCCGGCCCATCATCGGCGAAAATGATGTAAGGTCGTCCCCCATCACCGACGACACCTGTCGTCACCCTGCATCCCCAACCCATTTCGCAGAGAGGAAACGCCATGGAGGTCGAACAGTTCACGTCGATGCGCCTGAAGGCCATCGAGTTGTTCAAGTCCCAGCCCAAGGGCAGCAAGGACACCGTGAGCCTGGACGCGATCTTCATCTCGCTGTGCTCGCAGGCCAATGTCCAGGCCGACGCCGGCGGTGCCAGGCAGGTGCGGACCGCCGCACGCCCTGGCAACCAACAACCGCCCGCACCGTGGTTCACCGAAACCCTCGCTGCCTTGAAGGGCAAGGGCGAGTCGATCACCGTGGCGCGCTTTCTGATGTTCGCCAACCGCTTCCCGGTCAAGCGCATGGACCAGGTCAACGCAGCCCGCTGGCTGCGCGACGCGGGTTACATCCCGCGCAAGACCGGCGGCAATCTGGTATTCGATCTTTGATCCAGCGCCTCCTGCAGTCCTGAAGCCCCGGCATCGTCCGGGGCTTTTTCGTTTCAGGGCCCGCTCGCCTGCATGAGGATGGCCAACGGGGCGTCGTCCTCACCCTGTACCTGCCGCATCTCCTTGATCTGCCCTTCCTTCCACCCTGTGAGGACCACGAGTACGAAAGGACAGGGAAACATCGGTGGCGACGGCCAGCAGTCGCGAGGCATCACACCGCGGCCTGCTGCACTCCTCACCCGTCCTCGCGGCGCCGATGAATGCCTTCAGCACAAATAATCGCAAGTAGATGAAAGGTGTTGACCTAAGTGACGGGATGGCAACAGTGGAGATCAATGCCACTGACGACACCCTTCATGAACGCCCTGCCCGACAGCATCCAGACCCTGGCCGAGGTCATCGGCGAATCCGCAGCCCTCACGCTGGTGCGTGCGTGGCCGCCGACCACCTCCAGCACCACCGGCCGTCACCGCGTCATCGTCTACGTCCCCTCCACCCTGCCCGACCAGCATCGGCTGATCGACATCCTTGGCCACGACGTCGCCCGGCGGCTGGTCGCGCACTTCGGCGGCGAACTGTTGTTCCTGGCGTCCTGCTTCGCTGCCGGCGCGCACGAACGCCGTGAGCAGATCGCCCGCGCGGTCGCCAGCGGCATGCCGCGCGAACATGTGGCACGTGAGTTCGGTGTCTCGCAGACCACCATCAAGCGCGCCCTGCGCGGTGCCCGCTCCGCGCCACCACCGGCGGTCCATCCGGCCCTGCTCAAGGGTTACGCGCGCGCATGAACGAAAGTGACCTGCTGGCCGGTGTACCGGACTGGGCCAAGTACCTGGGGGGAACCTCGGGCGTGCTGATCGCGGTATCGCTGTGGCTGCGCCAATGGTTGTCGTCAGCCAAAGTCGACCGTACCGCCGACGAAGCCACCAGCAACACGCTGCGCACCCTGCAGGAGCAGCTGGCCGCCGAACGCACCCGCGCCGACGGCCTGATGCACGAACGCGAGGCAATGGCGCAGGAGATCGGCCAGCTGCGCGGCGAGGTCAGCGCCCTGCGCGCACAGATCGTCCAGCAGAGCGTGCAGATAGACGCGCTGCTGGCGCTGGTGCGCAAACAGCCGGGAGCCGCTGCATGACCGCCGCCACAGCCATCGCCCTCGGCGGCGCCAATGTCGCCGCATTCCTGGACATGCTGGCCGTGTCCGAAGGCACCGACTTTCCCGGCCAGCGCTCGCGTGACCGTGGCTACGACGTGATCGTCGGTGGCCAGCTGTTCAGCGACTACCGCGACCATCCCCGGGTGCTGGTGTCGCTGCCGCGCTATGGCATCAAATCCAGCGCCGCCGGTCGCTACCAGTTCCTGCGCAGTACCTGGGATGACCTGCGTGCACGGCTGGACCTGCCCGACTTCGGCCCGGTTTCGCAGGATCGCGCAGCGGTTGCGCTGCTCAAGCAGTGCGGTGCCTACGAGCTGATCCGACTGGGACGTTTCGATGCCGCCGTCACCGCGGCCCGGCGCATCTGGGCGTCGCTGCCCGGCGCCGGCTATGGGCAGAAGGAACATGCGCTGGAAACACTGCGCGTGGCCTACCGCACCGCCGGTGGAGCCCTGCAGTGACACCCCTCGCCCTGCGGTTCCGAATCGGCCTGCTGGTCCTTGCCGGAAGCCACGCAGGCTGCGCCTGGCTGGGGTGGACGCTACGGGACCGCAGCGCAGACCTCGCCGATGCCAGCGCCCAGGCCGCACAGCAGGCATCGCGCGCTGACTCGGTGCAGGCCGCACACCAGCAGAACCTCGCCAATCTCCGTACCAGCGCTCAGGCCGAATCACGGCGCCTGGCCACGCAGGCCGAACGCACCCGGCAATTCACCAACCTGCAACAGGACATCGAGACCCATGCCAAGACGCCTGGCCGTGATCGCGGCGACGCTGATGCTGAGTTCGTGCGCATCTGGCGCGAAGCCAACGCCGGCCGCGCCCTGCCTCGTTGATCTCAGCATCGCACCGGCGCAGTTGCGTGCGCCGGACGAGCTGCCGGACCTGCAGGCTGCCACCGATGATGCACTGCTGCGCAACCACGTTGCGGTCGCGCGGCAGTACCACGCGCTGGCCGATCAACTACGAGCGCTGCTGTGCAGCCTCGGCAGTCAGCGCGGCATCACCCTCAACGGCGCGGTTCCGGTAACGCCTCCAGGCTGCGGCGTCGCTGCCGGCCACAACCACGGCGCTGTGCCGGCCTTCTGATTCCGCCGCGGGCGGCCACGCCTGCAATGCCGTGCCGCTCAGGTGATGGCAACACTGGCTGCAGACCGCGCGTGCCACAAGGCCACGTGCTTCCACCATTCACTACATGAGCTGATATGGCGACTGACTCCCCCTCCCCGACGCTCGACACACTGCACGGCGCCATCGAAACTGCCATCCGCGAACGCTTTCCGGACTTCGCAACCGTCGAGTTCTATCGCGAGACCAGCACCGAAGGCATGCCCACACCGGCCTGCCTGCTGGCACTGACCCGCTGTGACCGCAGCCGGGAAGGCAATGACGGCAGCGGACTGCTGCAGGCGGTGCTGCGTTTCGAGGCACGCGTCGTCGTAGCGGCCGATAGCACCGCAGGCGCGCTGCACCTGCGCAATGCCGCGATTGCACTGGCCACCTGGCTGCACCAGATAGGCCGCTTCCACGGCGCCTCCAGCGGCGCGATCGACGTGATCGCTGCCCTGCCCGAAGACACCGCAACCGCACAGCCCGGCCTGCACAGCTGGATCGTCGAATGGTCACTGCCGGTCGCGCTGGGCGACAACGCCTGGGACGATGAAAGGGGCCCAGTGCCGCAGGCATCCTATAGCTTCGCGCCAGAGATCGGCCTTGACCATGCGACGCGCTACCAGCCCCTGCCGGAGCGCGCGCCATGAGCGCCGAACACGCACGTTTGATCGGCAATCTGCTGATGATCGGTGTCGTGCGCGAGCTGGACGAAGCAGGCGGCCGCGTGCGCGTAGATGCCGATGGCATGCTCACCGACTGGATTCCCTGGCTGGAACGCCGGGCCGGACCCGGCGTGCGCAGCTGGTGCGCGCCCGAGCCGGGCGAGCAGGTCGTACTGGCGTGCCCCTATGGTGACCCCGGCCAGGCGCTGGTACTCGGCAGCCTGTACCAGGACCGCTTTCCGCCACCGGCCGACTCGCGCCTGCGGCAACGCACCGAATTCGCCGACGGCAGCACCATCGAATACGACCAGGAAACCACCACCCTCAACGTCCATGTCGGCAGCGGCAAGGTCATCGTCACCTGCGCGAATGCACAGGTGATCGCCAGCGAATCGATCGTGCTCGATACACCGTCGATCAAGGCGACCGGCAACCTGGACGTCAGCGGCGCGATCAATGCGGGCAAGGACATCAGCACACCCGCCGAGATCAAGGCAGGTGCCATCGGGCTGAAGGCACACAAGCACACCGCGCAGGGGCCGACCGCTCCGACCACGCCGGCGCAGGCCTGATCGGCCACGCCTGCAATGCCCTGAAAAACCGCACTTCACGACGATAGAGGCCATGCGAGGAATCGACGCCAACACCGGCAAATCATTGGATGGGCTCGCCCATCTGCACCAGTCCGTGCGTGACATTCTCACCACGCCCCTTGGCTCCCGCGTACTGCGCCGCGAATACGGCTCGCGCGTGTTCGAACTGATCGATGCGCCGACCAACCGCTCGTTGCGCATGGACCTGATCGCGGCCACCGTCGACGCGCTCGCGCGATGGGAACCGCGTCTCCACGTCGAGAACGTCGACGTCTCCCTCCCCGCCCCCGGCGTGATGATCCTGGCAGTGACCGGAATCCATCTGCCCGACGGCGAGGCCATCACCATCGAAGGAATCGAGGTTCGCTAACCGTGGCATCCGGCTCGTTCACCAGTGTCAATCTCTCCCAGCTGCCTGCACCGGCGGTCATCGAAGTGCTCGATTTCGAAGCCATGTTCGATGAGTCGCTGACTGCGCTCCAGGCCCTGGATCCCACCTTCGACGCGCTGTTGCCGTCGGACCCGGCATTCAAGATCCTCGAGGTCTGCACCTACCTGCGCCTGCTCGATCGCCAGCGCGTCAACGACGCAGCACGTGGCGTGATGTTGGCATATGCGGTCGGCAGCGACCTGGATCATCTCGCCGCGATCTTCGGCATCGCCCGCCAAGTGCTTGACCCGGGCAAGCCACAGCAGGGCGTCCCACCGCGCTACGAAAGTGACGAGGATTTCCGACGCCGTATCCAGCTGGGGCCTGAAGGTTTCAGCGTGGCCGGACCGGAGGGCGCGTATGTCTTCCATGCACTAAGCGCCGATCCTCGGGTACTCGACGCAAGCGCCACCAGCCCCACGCCGGGCGAGGTCGTGGTCTCGGTGCTGTCGCGCGAGGCGGATGGAACCGCCACCCAGGGCCTGCTCGATATCGTCGAGGCGAAGCTGAGTGCGGATGACGTGCGTCCGCTGACCGATCACGTAGTGGTGAAACCCGCTGCCATCGTCAACTACTCGGTTGACGCTGCACTTTTCACCTTTGCAGGCCCGGATTCCCAAGTTGTGCTGGCCGAAGCGCGCACCCGCCTTGACCGCTACATCAGCGAATCGCATCGGCTCGGTCGTGACGTCACCCGTTCGGGACTGTTCGCCGCACTGCATGCCGAGGGTGTGCAGCGCGTGGAGATCAGCAGCCCGGCGAAAGACATCGTGGTCGATCGTACCCAGGCCACGCACTGCACCAGTGTCACGCTGACCCATGGCGGCAACGATGAGTGACGCTGCCACCCGCCTGATCGGCGCGCGTCTGCGCGGTGCCATCGATGGGCGAAACCGTACCTTCCGTCATCCGGGTGGTGCACTGGCGACCTTGCAGGCGGTGTACCGTACCGACCAGCAAGGACGGCAGCGGTTGCGCGATGTTGCCATCAGCGGCGCCACGGTCACCCTGTCGGTCGCCCCGGCACCCGGCACGCTGATCGAGGGTGACGCGCAGATCGTGGTTCCGCATGCCTCCAGCCTGCTGCCACCCAATGCCACCCACGCCGAACGTGGACTGACCCGCGCCATCGTCGCCCGTCCGCTTCCGGTGGACATCACCGCACTGTGGGACGCCGACCGCTGCCCGACCGCGTTGCTGCCCTGGCTGGCCTGGGCGCTGTCGGTTGACGAATGGAAAGCGTACTGGCCCGAAGCCGTGAAGCGCGCCCGGGTGCGCACGGCCATTGCCATCCAGCGCCGCAAGGGCACATGGGGCAGTGTGCGCGACGTCGTCGCAGCGTTCGGCGGATCGATCCTGATCCGCGAATGGTGGGAGATCCAACCGAGAGGTACGCCCTACACCTTCGAAGCCGTGATGACCATCGCCAACCAGGGTGGCGAAACCGCAACAGGCAAGTTCGTCGATGACGTCATAGGCGAGATCAACCGAACAAAGCCTGTGCGCTCGCACTTCACCTTCACCCAGGGCATGCAGGCCGACACCGCTGTCGGCGTACTTGCAGGCGCCCACGCCACGGCGTTCCGCCGCATCCAACTGACCGGAGAGTAACCCCCGCATGCGTTTGAAAATCACTGATGCCGGCTTTGCCAAGCTGGTCAATCCGCCGAACACCGGTACCAACGCGGTGCTCGTGACCCAGATCGGCCTGACTTCCACTGCATTCACGCCATCGGCAGGCATGACCACACTTCCTGGCGAGATCAAGCGGATCGCAACCTTCGGTGGACAAGCCGTGGGTGACGACACTCTGCACCTCACCATCCGCGACGACAGTACGTCCGCCTACAGCCTGCGAGGCTTCGGCCTCTATCTGGCAGACGGCACATTGTTTGCCACCTTTGGCCAGGCCGATCCGATCATGGAAAAGACGGCAGAGTCAATGCTGCTGCTGGCCACCGACACGCGCTTTTCCGAGATCGATACCACGCAGATTCAGTTCGGCAATGCGGAGTTCATCTACCCTCCCGCCACCACTGAGGTGCAGGGCGTGGTCGAGCTGGCCACCAGCAGCGAAGCCGAAGACGGTAGCGACACCCAGCGCGCGATCACGCCACGCGGCCTTCGTGCATTCATCGACAAGCGTTTCGGCAGCACTGCGCCGACGACGTTCGTTCGCACGCTGCTGTCGATCGCTACGGATACGGCGTTCCGCTCCGCGCTGGGCCTCAAATCCGCAGCATTGAAGGATGAGGGTGCCGACAAGGGCCTTGATGCCGATCTTCTCGACGGTCGCCACGGTAACCATTACCTGGACTGGCGCAACATGACCGGCGTTCCTTCCAGCGTTCATGTTCCCGGTCAGGTGATCCTGTTCGCCGGCGCGACGGCCCCCAACGGCATGCTTCTGTGCAATGGCGCGGCCGTTCCACGCGCTTCGTATCCAGCACTTTTTGCTGCCATCGGCACCCGCTATGGCTCCGGCGATGGCACAACCACGTTCAATCTACCGCTGATGCGCGAAGGCACGGTGGTCGCGCACACCACTGATCCGCAATCCGTCGGCACATTCACGGCCGGCGCAGTCATCGCGCACGCGCATACAGGCACGACAGAGAACGCCGGGCTGCACGGTCACGCGGTAAGCATCGGCAATGCCGGCAGCCATGCCCATGGCGCAAGCGCCAGCGCTGTCGGGGACCATGCCCATGGTGCCTGGACCGATGCGCAAGGGAATCACAACCACGGCGTAAACGATCCCGGCCATTCCCACACATGGAACGGTCCAGCGTCCGGCGGCAGCGGTGGCTGGGCCGCGGCAACCGGTGCACGCCCCAATCCCACCGGCACCAGCCACAACGGCACGGGTATCTGGCTCAACGACGCAGGCAATCACGGGCACAACATCGGCATGAACGGCGCCGGTGGGCACACCCACTCGATCTCGATCGCGGCGGTCGGAGACCACAGCCATCCCGCCTCTCTCTCCAACGACGGCGAACACAGCCATGCCGTGAACGTGAAGCCTACCGGTGGCGACGCCAACCTGCCCGCCGGCCTCCGAATGATCTATTGCATCGCCTACTGAGGACACACGATTGACCACTGAACCACGATTTGCGTACTCCTACGATCCGCAGACCAGGGCCTATATGGGCACGGTGAAGCTGCAGCCATCGCCAGACGGACGGTGGCATCTTCCGGACTACACCGTTGAGGCCGCACCTCAACGCGCCGCTGGCGACTACCAATCACTACGCCTGAGCCAGGATGGCAGCCACTGGGAGCTGGTCGATGACTTCCGTAACCGGATGCTCTGGGACACGGTCACTTCTTCGGTGGTGCCCAATCGACTGGCGCTGGGCGAGAAGCTGCCGCCGGGCGTGACCCTGTCCGCTCCCTATCCGCTTACGGGAGGCGATGCGTACTTCAACGCCTGGAACGCTGACGCAGGTCGATGGGAGTTGAAGCCCGACTACAGCAACCGTCCACTGTGGAATCGTGCCGATGGCAGCCTCGCCGCACCGCTGGCCCGCGGCCAGGCACTCCCAGCCAGCGTCATCGACCAGGCACCGCCCGCAGAGCGTAAAGGTCCAGTCACCTATGACGAAGCCAGCGCAGCCTGGGTGAGTGTGATTGCGCCCGGGGACGAACCAGCGACTCCGCAGCAGCTGTGACATCGGGCCACGGCTGCAATTAAGCCAGCCGCGGCCAGATACGAAGATGTACCCATGCGGCGCAGATCGCGACCGCAGCACCCACCCAACCAAGGAAAAAACAACGCATGGCCGAATTTCTGCATGGCGTGCAGGTCGTCAACATCGATGGTGGTTCCCGCTCGATCGCTGTTGCCTCGACCAGCGTCATCGGCATCGTGGGCACCGCGCCCCGGGCCGACAAGATCGCCTTCCCATACAACACCCCGGTCCTGGTGACCTCGCGTTCGCAGGCTGCCAAACTGCTCGACGGCACCGCCACCGAAGTCGATGAGGGCACCCTGCCGGGCCAGCTCGACGCCATCTTCGACCAGTCCAACGCGGTCGTCGTCGTTGTCCGCGTCGAGAAGGGCGCCACCGAGAACGACACCCTGGCCAACGTGCTGGGCGGCGTGAACGCGCAGACCGGTGCCTACACCGGCGTGCATGCACTGCTGGCGGCAAAGTCGGTAGTGGGCATCAAGCCACGCATCCTGGCGGTGCCGGGCTTCACCCACACCCACGAAAAGCGCGACACCGAACTGCTGGCCAACCCGGTCGTGGCCGAACTGCTCGGCATCGCCGACAAGCTGCGCGCGGTGATCATCAAGGATGGCCCGAACAGCACCGACGACGCTGCCAGGAGCACCACCGCCCTGACCGGCTCCAAGCGCGTCTACGTGGTCGACCCGGCGCTGCTGGTGCAGTCCGGTGATGCCATCGTCACGCGCTACGCCTCCGGTGCAGTGGCCGGTGCCATCGCCCGCAGCGACAACGAACGCGGCTGGTGGGCTTCGCCATCGAACCTGGAACTCAACGGCGTGGTCGGTACCGCGCGTGCGATCGACTTCGGCCTGTCCGACGCGACCAGCCGCGCCAACCTGCTCAACCAGTCGAACGTGGCCACGGTCATCCGCGAAGGAGGCTTCCGCCTGTGGGGCAACCGCACCGCCAGCAGTGACCAGAAGTGGCAGTTCCTGTGCGTGGTACGTACTGCCGACATCATTGCCGACAGCCTCGAAGCTGCCCATCTGTGGGCCGTCGATCGCGGCATCAGCAAGACCTACGTCGACGACGTGCGTGAGGGTGTCAATGCCTTCCTGCGCGGCCTGAAGACCCAGGGCGCGATCCTCGGCGGCAACTGCTGGATCGACCCGGAACTGAACGCAGCGGACAGCGTGGCCCAGGGCCGCTTCTTCTGGGACTTCGACTTCACCCCGACCTACCCGGGTGAGCAGCTGACCTTCCGCATGCACATGAACAACAACTACGTCTCGGAGATCTTCTAAGCATGGCGCGCAAGATCCGTAAAAACTTCAACTTCTACGTCGACGGCAAGGGCTATGCCGGCAGCGTGATGTCCTTCACCGCACCGAAGCTGTCGCTGAAGACCGAGGACTTCCAGGCCGGCGGCATGCTCGCCCCGACCGAGATCGTGCTCGGCCATGAAAAGCTCACTGCCGATGTCGAGTTTGCCTCGGACGACGCGGAGATCATGAGCAAGTTCCACGTCGTTGAAAGCAAGGAATACGGCTTCACGGCCCGCGAGGCCCTGGAAGGCGATGACGGCGAAGTGACCCAGGTCGTGCACAACATGCGCGGCAAGGTGAAGCTGCTGGACCGCGGCGAAACCAAGGTCGGCGAGAAAGGCACGATCAAGGTCAGCCTGGCACTGAGCTACTACAAGCTGACCCATGGCGCCCAGGTCGTGCAGGAGATCGACGTGGTCAACATGATCGCCCGCCAGGGTGGCGTGGACGTGCTGGCCGGCATCCGCGGCGCGCTGGGTATCTAAGCCCGCACCTCACCGAGAAAACCGGGGGCGCCTCGCGCCCCCGCATCCATCGCACCGCATCGAATTCCAGGAACGCACCCATGTCCAGCAAGACCAAGACCCCCACCGACACCGTCATCGAGCGCGATGGCTTTGCCGAGATCACCCTCACCCGCCCGCGCCAGGTCAATGGCATGGAGACCGCCGTGCTGCGCATGCGCGAACCGACCGTGGAAGACATGGAGCGCTACCAGGACGACAAGGGCAGTGATGCACAGCGCGAAGTACGGATGATCGCCAACCTGTGCGAGATCTCGCCGGACGACGTGCGCAAGATGCCGTTGCGCGACTACGCCCGACTGCAGGCAGGCGTCGCGCTTTTTACCACCTGACCCTGCCGCAGATCAGGCAGGGAGTGCTCGCCCTGGCCGGTCATACCGGCTGGGGCCTGCGCGAGATCATGACACTGCGGGTGTCGAAGTTCATCTGGTGGATTCAGGGATTGCCGGTACATGGCCAATAACGTTCAAACGACAACGATCACGATCGGCGGCTCGGTGTCCAAGTCGCTGAAGGACGCATTGTCCTTCGCCAACGATGGCATCAAGCACATCGGCACCGAGTTGACATTGCTGGACCGCAGGCTTGCCCGCATGAGTACGACCAGCAAGGAATACGCCCGTATGCGTGCGCAGGTCGACGCGTTGCGTGCCTCGCAGGAGGGACTGGAGAGCATCGAGGCAAAGCGCACCGCCAACCTGGAGAAGCGCGAGAAGCTCGGCGCCTCGTTCAAGGCCGCACGCGGCACACTCGGCACCGCCGTCACCGCGTTGGCCACGCCGGTTGAGAACGCTTCCGGATTCGCCCGCCAGAACCAGCAGATCGGCGTAGCGGCCAACCTCAGCCGCGCCCAGGTCAGCGCACTTGGCCAGGCGATCCTGGAACAATCGCGTGCGACCAACCAGGGCGCCGATGCGCTGCAGCGCTCGATCAAGCTGATGGTCGCTGCCGGCATGGATGCGCAGTCGGCCCAGGCCAGCCTGGGTGCTGTCGGGCGAACCACTACCGTCACCGGTGCCAGCATCGATGATGTTGCCCAGGCCGCGGCCGCCCTGCAGCAGTCTTTCGATATCGATCCCTCGCGCATGCAGAGCGCACTGGATGTGCTGGTCGTCAACAGCCGGCAGGGCGGCCTGGGCCTGAAGGACATGGCCGAAGTGCTGCCTACCTTGGGTTCGTCGTTCGAAGCGATGAAGCTGCAGGGCACCTCGGCGGCCGCCACCGTCGGCGCCGCCCTGCAGGCGACGCTGGAATCGGCCGGCGGCGCCGACAAGGCCGCCAGCAACATGAAGAGCTTCATGTCCGAGGTGCTCTCGCCGGACATTCAGGAGAAGGCCAAGAAGAGCCTGAACCTGGATCTGCGCAAGATCATCGGCGATGCACAAACCAGCGGCGGCAATCCCTTCGATGCCGCGATGCAGGGGATCATCCAGGCGACTGCGGGCGACCAGAAGAAGATTGGCGCCCTGTTCAGCGATGCACAGGCGAAGAACTTCGTCCAGCCGATGATCGAGAACTGGGATACCTACATCCGCATCCGCGACACGGCGTTGAATGGATCGGCGGGTGCCACCGATGCAGCCTATGCCGATGCGATGCAGACCGATCCGCAGAAGATCGAAGGCGCCAAGATCGCCGTGGACAACCTGTCCAAGGTCTTCGGTGCGGCACTGCTGCCCGCGGTGGGCGAAGCCGCAGTCAAGCTGACCGAGCTGTTGAACGGGGTCACCTCGTTCGTGCAGGAAAACCCGAAGCTGATCGCCAACACCACGCAGATCGTGGTCGGCATGCTGGGCATGCGCACGGCGGTACTCGGCGCTCGCTATGCCTGGACCTTCCTGCAGGGCCCGATCCTGGGCGTGCAGAAGGCCTTGCAGCTGTTCCGGGGTGGCAGCCTGTTGGCCCAGATGGGGCGCTTCGGGCCAATGGCCATGCGCCTGGCATCGGGCTTCCGCATCGTCGCCACTGCCGTGGCCGCCATCGGTGGCGGGCCGATCACGATCGCCATCGCAGCGATCACCGCAGGTGCCATCCTGGTGCGCAAATACTGGGAACCGGTCAAGGCATTCCTGGGCGGCGTCTGGGAAGGTCTCAGCGGTGCAGGCACTGCGGCAATGGGTGAACTGATGCGTGCGATTGAACCGCTGCGCCCCGCCTGGGAAGTCATGAGTGGGTTGATCGGCCAGGCCTGGGATTGGCTGTCGAAGATGCTTGCACCTGCGCAGTACACCGGCAACGAGCTGTCTCGGGTTGCCCAGATTGGCAGCTTCCTCGGTACCGTTCTGATGGAAGGCCTGAGAATGAACATCCAGCTCATCAGCGGCCTGGTGCAGTACGTGGTCTGGATGGGCAATGTATACACGACCGTCGCCAGCGCGATCGGTAGCGGGATGAGCATGATGTGGACCGCGATCAAGTCCGGTGCCGAATCCCTGTTCGACTGGCTTGTCCAGAAGCTGGATTTCCTCATGCCCTACGTCGAGAAGCTGATGGGGTTCGTCGAAGGGGGCATGGGCAAGGTCAGTGCACTGGTCGGCAAGGGCCTGGACTTCGGGAAGGAAGTGCTTGCCGGTGGCGCGGAAGCGGTCGGCAACGGCATGGTCGGGTATACCAACATGCGGGCCGGCGGCCGGGGTGGCCTGAATGACGCCGTGGGCCTGGTCGGAGACGTCGCCACGTTGGACGCGGCGGGGGCGCGCAAGCGATGGGGAGGCATCAGCGAGGCCGCTCGCGGTCGCACCGCGCCCGACATGCCATCGCCCTCTTCGCGCGGCGTCACCACCGTGCAGCAACAGCAGACCAACAACATCACCATCCACCAGCAACCCGGTGAGTCCAGCGAATCGGTGGCACGTCGCACGGCCGATGAACTGCAGCGTCGCAACGCGGTTGCTGCCCGTGGTGGCCTGGCAGACAGGAACTAAGCATGAAGCGCGAGTTCGTAACCGCATCCATCGACAAGCTGTTGTCCAGTTTCCAGAGCAACGACTCCGGCAACGCCCCGGTGCTGCTGATGCTGGGTGGCTTCAAGTTCAGCCTCAACACCGCGGTGTTCCAGGAGATCCAGCAGAGCAACGAATATGGCTGGGCTGCGCAGGAGCGCATCGGCCAGATGGCCGCCCTGCAGTACACCGGCCCCGGCAAGGCCAGCATGACGCTGCCCGGCATCATCCACTATCAGTTCCGGGGCGCCGGCGATGAGCTCTCGCAGCTGCGCAAGCTGGCAGCGCAAGGCAAGCCCCAGCGGCTGCTGACCGGCAAGGGCGGGAACCTGGGGCTTTGGGTCATCGACAAGATCGACGCCACTGCCTCCGGCTTCACCGTCGATGCGGGAATCCAGCGGCACGAATTCACCCTCTCCCTGCGGAAGCACAGTGATGGCACGAACGTATAACACCCGCGACGGCGACGTCGTTGACCGCATCGCGTATGCGCACTATGGCGAGCAATCACCGGCCATTCTGCGCGCGGTGTTCGATGCCAATCCGGGCCTCGCCGCACGTGGCCCGGTGCTGGCCGCAGGCCTGGCGATCACCCTGCCCGAAGTGCAGCGCCCCGCCGGCGAACGCAAGGGGATAGCACTGTGGGACTGAACATCACACCGGCATTCCGCGTGGTGGCCAACAGCCAGGACATCACTGACAAGATCATGTCGCGCTTCAAGTCGCTGCGCATCACCGACGAGACCGACAACAACTCGGACATGCTGGAGCTGCAGCTGGCCGACCATGATCCGTCCGATCCGATCCAGCTGCCGCCGGCAGGCGCGGAGCTGGAAGCCTTCATCGGCTACGACGGCGAAGTACGGCGCATGGGCCTGTACATCTGCGATGAGGTGGAGATTTCCGGCTTCCCGGGCAGCATGACCCTGCGCGCGCGCGCCGCACCGTTCGAGGCCAGCAAGGGCGGCAAGAACGATCTGCAGACGCAGAAGACGCGCACCTGGAAGAAGGGCACGACGATCGGTGGCATGGTGCAGCGCATGGCCGCCGAGCACGGACTGAGCGCCGCCGTGAGTGGACCGCTGGCGTCGATCGTGCTGCCGCTGACGGTGCAGTCGCAGGAGTCGGACATGAACCTGCTGCTGCGCCTGGCCAAGCAGCATGATGCCATCGCCAAGCCGGGCGGCGGCCGCCTGATGTTCGTCAAACGCGGCGAATCCACCAGTGCCAGCGGTGAGCGTATTCCCGACGTTACCCTTACCCCCGCCGATGGCAGTGGCTACAAAGTGAGCATCGTCTCGCGCGAGAAGACCGGCACCACCATCGCCTATTACCGTGATGTACGCGTTGCCAAGCGCCAGGAGGTGAAGGTGGGCAGCGGTGAACCGATCGTGCGCCTGCGCATGGCCTACGCCGACCGCGAAGCCGCCGAAGCTGCAGCGCGCGCCAAGCATCAGGAACAAGCCCGACAGACGCGTACGCTCAGCTACACCCTGCCCGGCCGCGAGACCCTCATGGCCGAAGCCACGGTGGTGATGCAGGGCTTCCGCGATGGCGTGGATGGGCAGTGGCTGGTCAAGCGCGCCGAGCACACCATCAGCCACGACGGCTACATGACCAGCATCGAGTGCGAACAAACCAACAGTGCCGATGCAGTGAAGGCGGCCAGCAGTGCGGCAGCTACCGAAAGTGTGCAGGTTGGCAGCGAGGTGTAG
Protein sequences of DBSCAN-SWA_5 >LR134301|2517645:2535719|2525475_2526372_+|VEE52763.1|plate|DBSCAN-SWA MASGSFTSVNLSQLPAPAVIEVLDFEAMFDESLTALQALDPTFDALLPSDPAFKILEVCTYLRLLDRQRVNDAARGVMLAYAVGSDLDHLAAIFGIARQVLDPGKPQQGVPPRYESDEDFRRRIQLGPEGFSVAGPEGAYVFHALSADPRVLDASATSPTPGEVVVSVLSREADGTATQGLLDIVEAKLSADDVRPLTDHVVVKPAAIVNYSVDAALFTFAGPDSQVVLAEARTRLDRYISESHRLGRDVTRSGLFAALHAEGVQRVEISSPAKDIVVDRTQATHCTSVTLTHGGNDE >LR134301|2517645:2535719|2524491_2525082_+|VEE52761.1|plate|DBSCAN-SWA MSAEHARLIGNLLMIGVVRELDEAGGRVRVDADGMLTDWIPWLERRAGPGVRSWCAPEPGEQVVLACPYGDPGQALVLGSLYQDRFPPPADSRLRQRTEFADGSTIEYDQETTTLNVHVGSGKVIVTCANAQVIASESIVLDTPSIKATGNLDVSGAINAGKDISTPAEIKAGAIGLKAHKHTAQGPTAPTTPAQA >LR134301|2517645:2535719|2520406_2521045_+|VEE52754.1|DBSCAN-SWA MIPASLDSGHRMIADTLAAFRAGPALGSIALRAAPQARPPLYIGIAGSRRAGKDTLANGLASALALPCDSFAAPLRQFVASLLGLSLRELDSRKEDAIDWLAELTPRHLMQTAGSEWGRDRIHPELWVRSLFARLPAGGLVPDVRFANEAHAIHRRGGVVIRVNRPGHGAHDTRASEQSLPDELVDIEVSNNGSQADLVRRTLDQLLSRGLI >LR134301|2517645:2535719|2521918_2522368_+|VEE52756.1|DBSCAN-SWA MPLTTPFMNALPDSIQTLAEVIGESAALTLVRAWPPTTSSTTGRHRVIVYVPSTLPDQHRLIDILGHDVARRLVAHFGGELLFLASCFAAGAHERREQIARAVASGMPREHVAREFGVSQTTIKRALRGARSAPPPAVHPALLKGYARA >LR134301|2517645:2535719|2523940_2524495_+|VEE52760.1|DBSCAN-SWA MATDSPSPTLDTLHGAIETAIRERFPDFATVEFYRETSTEGMPTPACLLALTRCDRSREGNDGSGLLQAVLRFEARVVVAADSTAGALHLRNAAIALATWLHQIGRFHGASSGAIDVIAALPEDTATAQPGLHSWIVEWSLPVALGDNAWDDERGPVPQASYSFAPEIGLDHATRYQPLPERAP >LR134301|2517645:2535719|2522364_2522715_+|VEE52757.1|DBSCAN-SWA MNESDLLAGVPDWAKYLGGTSGVLIAVSLWLRQWLSSAKVDRTADEATSNTLRTLQEQLAAERTRADGLMHEREAMAQEIGQLRGEVSALRAQIVQQSVQIDALLALVRKQPGAAA >LR134301|2517645:2535719|2528631_2529210_+|VEE52766.1|DBSCAN-SWA MTTEPRFAYSYDPQTRAYMGTVKLQPSPDGRWHLPDYTVEAAPQRAAGDYQSLRLSQDGSHWELVDDFRNRMLWDTVTSSVVPNRLALGEKLPPGVTLSAPYPLTGGDAYFNAWNADAGRWELKPDYSNRPLWNRADGSLAAPLARGQALPASVIDQAPPAERKGPVTYDEASAAWVSVIAPGDEPATPQQL >LR134301|2517645:2535719|2533988_2534456_+|VEE52771.1|DBSCAN-SWA MKREFVTASIDKLLSSFQSNDSGNAPVLLMLGGFKFSLNTAVFQEIQQSNEYGWAAQERIGQMAALQYTGPGKASMTLPGIIHYQFRGAGDELSQLRKLAAQGKPQRLLTGKGGNLGLWVIDKIDATASGFTVDAGIQRHEFTLSLRKHSDGTNV >LR134301|2517645:2535719|2517645_2519982_+|VEE52752.1|integrase|DBSCAN-SWA MHAPHPAQDIVPAFLQAMHAHGIVPDARGRDAINADGALVRFHVEGDRRGTRNGWAVLFGDHVPAGEFGSWRTGSRHAWCAKPETTLSAAEQRTIRQRQEVARTERERQQREREEAAAKAANVLWNRALPADAHHPYLVRKGIHAHALRVAAWPVRNSDGLVFRYIDNALLVPVMNATGRIVSLQAIFPRTDPALGRDKDFLSGGRKQGCFHVIGKPLPGQPIAIAEGYATAASIHQATGGCVVVAWDAGNLAAVARTWRSAVPDASFVICADNDQWTRQPLDNPGVTHATRAAAEIDARVVWPEFAALHDDDDRPTDFNDLHLREGLEAVRTQLLPRPPAIAEEDTAEEDAPSSSNARYQVPGNLSAFDAFTPFPDTSARGRPLPTARNLAELCRRTGVTVRYNVIRKDLEILVPGLQSTVDNAKEVAAGEVMDCMHRAGMAITSFETNLCQVAEANPYNPVASWITSRPWDGQSRLQAFFDTVQEAQPTRMGDGRILKEVLMRRWLISGVAAAFEPDGVVARGVLTFVSKQNLGKTRWARQLAPAELQLIADGVVLDPANKDSVKQVISKWIVELGEVDATFRRTDIAALKSFISRSHDEIRRPYARTESRYARRTILFASVNDERFLRDATGNTRWWTVHAVGLGEPARIDMQQVWAEAHALYCDGETWHLSSEELDALNATNGEHEPISPIAELIDRHFDWSVPAAQWSAQYRATEIVIAVGIDKPNRREVNEAAAYVVKRHDVRTKVVGKERAKVWLMPPRKLSLAGHAAGPF >LR134301|2517645:2535719|2520138_2520366_+|VEE52753.1|DBSCAN-SWA MPRPQLHAFEGEQLTVQQIHQRVPVLSERTIRDHLAAGRRTRSAMLCFDPVAAAARGGRITQRILRARNTAGRDS >LR134301|2517645:2535719|2531514_2533986_+|VEE52770.1|tail|DBSCAN-SWA MANNVQTTTITIGGSVSKSLKDALSFANDGIKHIGTELTLLDRRLARMSTTSKEYARMRAQVDALRASQEGLESIEAKRTANLEKREKLGASFKAARGTLGTAVTALATPVENASGFARQNQQIGVAANLSRAQVSALGQAILEQSRATNQGADALQRSIKLMVAAGMDAQSAQASLGAVGRTTTVTGASIDDVAQAAAALQQSFDIDPSRMQSALDVLVVNSRQGGLGLKDMAEVLPTLGSSFEAMKLQGTSAAATVGAALQATLESAGGADKAASNMKSFMSEVLSPDIQEKAKKSLNLDLRKIIGDAQTSGGNPFDAAMQGIIQATAGDQKKIGALFSDAQAKNFVQPMIENWDTYIRIRDTALNGSAGATDAAYADAMQTDPQKIEGAKIAVDNLSKVFGAALLPAVGEAAVKLTELLNGVTSFVQENPKLIANTTQIVVGMLGMRTAVLGARYAWTFLQGPILGVQKALQLFRGGSLLAQMGRFGPMAMRLASGFRIVATAVAAIGGGPITIAIAAITAGAILVRKYWEPVKAFLGGVWEGLSGAGTAAMGELMRAIEPLRPAWEVMSGLIGQAWDWLSKMLAPAQYTGNELSRVAQIGSFLGTVLMEGLRMNIQLISGLVQYVVWMGNVYTTVASAIGSGMSMMWTAIKSGAESLFDWLVQKLDFLMPYVEKLMGFVEGGMGKVSALVGKGLDFGKEVLAGGAEAVGNGMVGYTNMRAGGRGGLNDAVGLVGDVATLDAAGARKRWGGISEAARGRTAPDMPSPSSRGVTTVQQQQTNNITIHQQPGESSESVARRTADELQRRNAVAARGGLADRN >LR134301|2517645:2535719|2534439_2534655_+|VEE52772.1|tail|DBSCAN-SWA MARTYNTRDGDVVDRIAYAHYGEQSPAILRAVFDANPGLAARGPVLAAGLAITLPEVQRPAGERKGIALWD >LR134301|2517645:2535719|2523193_2523571_+|VEE52759.1|DBSCAN-SWA MTPLALRFRIGLLVLAGSHAGCAWLGWTLRDRSADLADASAQAAQQASRADSVQAAHQQNLANLRTSAQAESRRLATQAERTRQFTNLQQDIETHAKTPGRDRGDADAEFVRIWREANAGRALPR >LR134301|2517645:2535719|2525134_2525473_+|VEE52762.1|plate|DBSCAN-SWA MRGIDANTGKSLDGLAHLHQSVRDILTTPLGSRVLRREYGSRVFELIDAPTNRSLRMDLIAATVDALARWEPRLHVENVDVSLPAPGVMILAVTGIHLPDGEAITIEGIEVR >LR134301|2517645:2535719|2534645_2535719_+|VEE52773.1|DBSCAN-SWA MGLNITPAFRVVANSQDITDKIMSRFKSLRITDETDNNSDMLELQLADHDPSDPIQLPPAGAELEAFIGYDGEVRRMGLYICDEVEISGFPGSMTLRARAAPFEASKGGKNDLQTQKTRTWKKGTTIGGMVQRMAAEHGLSAAVSGPLASIVLPLTVQSQESDMNLLLRLAKQHDAIAKPGGGRLMFVKRGESTSASGERIPDVTLTPADGSGYKVSIVSREKTGTTIAYYRDVRVAKRQEVKVGSGEPIVRLRMAYADREAAEAAARAKHQEQARQTRTLSYTLPGRETLMAEATVVMQGFRDGVDGQWLVKRAEHTISHDGYMTSIECEQTNSADAVKAASSAAATESVQVGSEV >LR134301|2517645:2535719|2526364_2527144_+|VEE52764.1|tail|DBSCAN-SWA MSDAATRLIGARLRGAIDGRNRTFRHPGGALATLQAVYRTDQQGRQRLRDVAISGATVTLSVAPAPGTLIEGDAQIVVPHASSLLPPNATHAERGLTRAIVARPLPVDITALWDADRCPTALLPWLAWALSVDEWKAYWPEAVKRARVRTAIAIQRRKGTWGSVRDVVAAFGGSILIREWWEIQPRGTPYTFEAVMTIANQGGETATGKFVDDVIGEINRTKPVRSHFTFTQGMQADTAVGVLAGAHATAFRRIQLTGE >LR134301|2517645:2535719|2521199_2521583_+|VEE52755.1|DBSCAN-SWA MEVEQFTSMRLKAIELFKSQPKGSKDTVSLDAIFISLCSQANVQADAGGARQVRTAARPGNQQPPAPWFTETLAALKGKGESITVARFLMFANRFPVKRMDQVNAARWLRDAGYIPRKTGGNLVFDL >LR134301|2517645:2535719|2530519_2531023_+|VEE52768.1|tail|DBSCAN-SWA MARKIRKNFNFYVDGKGYAGSVMSFTAPKLSLKTEDFQAGGMLAPTEIVLGHEKLTADVEFASDDAEIMSKFHVVESKEYGFTAREALEGDDGEVTQVVHNMRGKVKLLDRGETKVGEKGTIKVSLALSYYKLTHGAQVVQEIDVVNMIARQGGVDVLAGIRGALGI >LR134301|2517645:2535719|2531103_2531394_+|VEE52769.1|DBSCAN-SWA MSSKTKTPTDTVIERDGFAEITLTRPRQVNGMETAVLRMREPTVEDMERYQDDKGSDAQREVRMIANLCEISPDDVRKMPLRDYARLQAGVALFTT >LR134301|2517645:2535719|2527151_2528621_+|VEE52765.1|tail|DBSCAN-SWA MRLKITDAGFAKLVNPPNTGTNAVLVTQIGLTSTAFTPSAGMTTLPGEIKRIATFGGQAVGDDTLHLTIRDDSTSAYSLRGFGLYLADGTLFATFGQADPIMEKTAESMLLLATDTRFSEIDTTQIQFGNAEFIYPPATTEVQGVVELATSSEAEDGSDTQRAITPRGLRAFIDKRFGSTAPTTFVRTLLSIATDTAFRSALGLKSAALKDEGADKGLDADLLDGRHGNHYLDWRNMTGVPSSVHVPGQVILFAGATAPNGMLLCNGAAVPRASYPALFAAIGTRYGSGDGTTTFNLPLMREGTVVAHTTDPQSVGTFTAGAVIAHAHTGTTENAGLHGHAVSIGNAGSHAHGASASAVGDHAHGAWTDAQGNHNHGVNDPGHSHTWNGPASGGSGGWAAATGARPNPTGTSHNGTGIWLNDAGNHGHNIGMNGAGGHTHSISIAAVGDHSHPASLSNDGEHSHAVNVKPTGGDANLPAGLRMIYCIAY >LR134301|2517645:2535719|2522711_2523197_+|VEE52758.1|DBSCAN-SWA MTAATAIALGGANVAAFLDMLAVSEGTDFPGQRSRDRGYDVIVGGQLFSDYRDHPRVLVSLPRYGIKSSAAGRYQFLRSTWDDLRARLDLPDFGPVSQDRAAVALLKQCGAYELIRLGRFDAAVTAARRIWASLPGAGYGQKEHALETLRVAYRTAGGALQ >LR134301|2517645:2535719|2529317_2530517_+|VEE52767.1|tail|DBSCAN-SWA MAEFLHGVQVVNIDGGSRSIAVASTSVIGIVGTAPRADKIAFPYNTPVLVTSRSQAAKLLDGTATEVDEGTLPGQLDAIFDQSNAVVVVVRVEKGATENDTLANVLGGVNAQTGAYTGVHALLAAKSVVGIKPRILAVPGFTHTHEKRDTELLANPVVAELLGIADKLRAVIIKDGPNSTDDAARSTTALTGSKRVYVVDPALLVQSGDAIVTRYASGAVAGAIARSDNERGWWASPSNLELNGVVGTARAIDFGLSDATSRANLLNQSNVATVIREGGFRLWGNRTASSDQKWQFLCVVRTADIIADSLEAAHLWAVDRGISKTYVDDVREGVNAFLRGLKTQGAILGGNCWIDPELNAADSVAQGRFFWDFDFTPTYPGEQLTFRMHMNNNYVSEIF |
22 | Enterobacter_phage(31.25%) | tail,integrase,plate | attL 2508575:2508591|attR 2523328:2523344 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2956175 : 2966615
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >LR134301|2956175:2966615|DBSCAN-SWA CATGGCCGTAGTATTCACCGGAGGGGAACTGTCCCGCTGGCTGGATGCCGGGACACTGGAAAGAGGCAGGCAACGCGCAGGTGCCGTGCACGCGCTGCACTACAGGGCGCCGCTGCTGACCGCGCAGGTGGAGACCTGGCAGGTGACCATCGATCTGGACGCCAGCGCCCGTGCTCCGCGCAGCCCTCTGCAGAGTCGCTGCACCTGCCCGGCGGGAGGCTCCTGCGAGCATGTCGCTTCGGTCCTGTTGGCGGCACTGGGCACGTCACTGGAGGGCGACACGGGCCCGCTACGTCCCGCTGCGCATCCGACGCTGCCAACCCTGCCGATCCAGGCGATGACACCGATCCTCCATCTGCACAGCCTGGAGTACCGACCCGACTGGCTCACCGAGCGCGATCCTTCGCAGTGGCTGGACGTTGCCACCCTCGCGTTCGAGTACGACGAGCACGTACGCTTTCTCGACGACCCGTCGCCGTTCGTGCATGGCCCGCAGGGCCAGCTCTACCTGCTGCCGCGTGACCTCCACGAGGAGGCACGGCGCGAAGATGAACTGCGCAGCGTGCATCTGCACCGTGAGCGGCAGCCACGCATCGTCCTGGACGGCGGCGGACCATTGTTCGAACTGCGCAGTTTCGACTGGACCCGCTTCCTGCTGGATGACGTGCCGCGGCTGGAGGCCCTGGGCTGGAAAGTGGAGGCAGACGACGATTTCCGCCATCGCATCACCCGGGTGGATGAAATCGATCTGGACATCCAGGCGGACCCGCAGGATGCGGGCTGGTTCAATCTGGGCCTGGAGATCCAGGTGGACGGCCACAACGTGTCCATGGTTCCGCTGCTGCAGCAGGTACTGAATGCCGATCCTCGCTGGTCGCGCGGCCAGCTGGACGCCATTGGCGATGACGAGAACATCCTGCTGACTGCCGGTGGCAATACGCGCCTGGCGCTTCGGGCGTCGCGATTGAAACCGGTCATTGCCCTGCTGGCCGACCTGTTCACCCCGCGCGATGTGCCGTTGCGGCTGTCGATGCACGATCGCGGTCGGCTGCAAGCCCTGAAGGACGACGCACACCTGCAGCTGCGGGGCCACGGGGACACCCAGGCACTCGTGCAGCGCCTGCTGCAGGCGCCTGCCCCGGAAGACGTGGCAACGCCTGCCGGATTGCAGGCGACGCTGCGTGGCTATCAGCGCGATGGCCTCTCGTGGCTGCAGTACCTGCGGCAGCAGCGCCTGGGCGGCGTACTGGCCGATGACATGGGGCTGGGCAAGACCCTGCAGACCCTGGCCCATCTGCTGGTCGAAAAGGAAAGCGGCCGCCTGGACCGGCCGGCACTGCTGGTGGTGCCGACCTCACTGCTGCACAACTGGCAAAGCGAAGCGGCACGCTTCACCCCGGACCTGCGCGTGCTGACCCTGCACGGATCGACGCGCGAGACGCTGTTCGACGCGATTCCCGCGCATGATCTCGTACTGACCACGTACCCCCTGCTGTGGCGCGACGAGCAGGCGCTGCAGGCACATGCCTACCATCTGCTGATTCTGGATGAAGCCCAGCAGGTGAAGAATCCGAAATCACGCGCTGCCGTCACCCTGCGCACGTTGCAGGCCCACCATCGCCTGTGTTTGACCGGTACGCCGCTGGAAAACCACCTGGGCGAACTGTGGACGCAGTTCGATTTCCTGCTGCCCGGCCTGCTCGGCACCGAGAAGCAGTTCAACCAGCATTGGCGCCACCCCATCGAACGCGGCAGCGACCATCGCCGTGCCCAGTTGCTGGCGCAGCGGCTGCGCCCGTTCATCCTGCGCCGACGCAAGGACCAGGTCGCGTCGGAGCTTCCGCCCAAGACCCTCATTACCCGTTCGGTCGAAATGGAGGGCGGCCAGCGTGATCTCTATGAAACCGTGCGTGCAGCGATGGAAAAGCAGGTGCGTGAGGCCATCAGCGGCAGCGGGCTGGCGCGCAGCCATATCGTGGTGCTGGATGCATTGCTGAAACTGCGCCAGGTCTGCTGCGATCCGCGCCTGCTGCCGGGCAACACACCGGCACGGGCTGCAGGCTCGGCCAAACTGGAGCTGTTGCGCGAGATGCTGCCGCCGATGATCGAGGAAGGTCGTCGCATCCTGTTGTTCTCGCAGTTCACTGGCATGCTGGCGCTGATCGCGCAGGCGCTGGACGATCTGGGACTGGCCTATGTGACCCTGACCGGTGATACCCAGGACCGGGCCACGCCGGTGCAGCGTTTCATGCAGGGTGAAGTCCCGGTGTTCCTGATCAGCCTCAAGGCAGGTGGCGTGGGGTTGAACCTGACCGCGGCCGACACCGTCATCCATTTCGATCCCTGGTGGAATCCCGCCGCCGAGAACCAGGCCAGTGACCGCGCTCACCGGATCGGCCAGCAGCAGCCGGTGTTCGTCTACCGGTTGATCGCCGCAGGCAGCATCGAAGAGCGGATTGCCGAGCTGCAGGAACGCAAGGCCCTGCTGGCCGAATCGATTCTCGAAGGCGGTGGCAGCGCCGGGCCGCGCTTCAGCGAAGAAGACGTGCAGGCACTGCTGGCGCCACTGCCGGGCCTGCCGGCCAGACGCACCCGCAGGCCGGGCCGACGGCCGCCGCAGCACTGACCGGAACACCCAGTCAGGTGGCGTTTCCCGATGGAAACGATTCCCATACGGGACTGGGACTACAATTGAGGGCTGAGCTCACAAAGGATCTGATCGATGCCCCTGCCCGCTTTCAAGGCCTACGATATCCGCGGCCGTGTGCCGGAAGAACTGAACGAGGACCTGGCCCGCCGCATCGGCGTCGCCTTGGCCGCACAGCTCGCTCCCGGCCCGGTGGTCCTCGGCCATGATGTGCGCCTGACCAGTCCGGCACTGCAGGATGCGCTGGCCGCAGGCCTGCGTGGGACCGGCCGTGAAGTGATCGACATCGGCCTGTGCGGCACCGAAGAGGTCTACTTCCAGACCGATCACCTCGGCGCGGCCGGCGGCGTGATGGTGACCGCCAGCCACAACCCGATGGACTACAACGGTATGAAGCTGGTCAAGGAGAACGCGCGGCCGATCAGTTCGGATACCGGCCTGTTCGCGATTTCCGACGCGGTCGCCGCCGATACGTCCGAAGCCCGGCCGCTGCGTGCCGGGCAGACCGCCCAGCATGACAAGCACGCCTACATCCAGCATCTGCTGAGCTATGTCGACGCCGGCAAGCTCAAGCCGCTGAAGCTGGTGGTCAACGCCGGCAACGGCGGCGCCGGCGCCATCGTCGACCTGCTCGCACCGCACCTGCCGTTCGAGTTCATCCGCATCTGCCACGAACCGGACGGCAGCTTCCCCAACGGCATTCCCAACCCGCTGCTGCCGGAAAACCGCGCTGCCACCGCCGACGCGGTGCGCGAACACGGCGCCGACTTCGGCATTGCCTGGGACGGTGACTTCGACCGCTGCTTCTTCTTCGACCACACCGGCCGCTTCATCGAGGGCTATTACCTGGTCGGCCTGCTGGCCAAGGCGATCCTGGCGCGCAACCCGGGTGGCAAGATCGTGCATGACCCGCGCCTGGTCTGGAACACCGTGGACATGGTCGAGCAGGCCGGCGGCGTGCCGGTGCAGTGCAAGAGCGGCCATGCCTTCATCAAGGAAAAGATGCGCGCCGAAGATGCAGTGTACGGCGGCGAGATGAGCGCCCATCACTACTTCCGCGAATTCGCCTATGCCGACTCCGGCATGATCCCGTGGCTGCTGATCGCCCAGCTGGTTTCCGAGAGCGGCCGCTCGCTGGCCGACTGGGTCGAAGACCGCATGGCGGCCTACCCGTGCAGTGGCGAAATCAACTTCAAGGTTGCCGATGCCAAGGCCGCCGTCGCGCGCGTGATGGAGCACTTCGCGGCGCAGTCGCCCGCGCTGGACCATACCGATGGCATCAGCGCCGACTTCGGCGACTGGCGCTTCAACCTGCGCAGCTCCAACACCGAGCCGCTGCTGCGCTTGAACGTCGAAGCACGTGGTGACGCCGCATTGATGCAGGCGCGCACCGATGATATCGCCCGCCTCATCCAGCAGTAATCAAGGAAGATCATGAGCCGCATCCAGCCTGTCATCCTTTCCGGTGGTTCCGGTACCCGCCTGTGGCCGCTTTCGCGCGAGGCGTATCCGAAGCAGTTCCTGCCACTGGCAGGCGAACTGACCATGCTGCAGGCCACCTGGCAGCGCGTCGCTCCCATCGCCGCGCACGGTCCGCTGGTCATCGCCAACGAGGAGCACCGTTTCGTCGCCGCCGAACAGCTGCAGCAGGTCGGGGCTGAACCGGCCGCGATCATCCTGGAGCCGGTGGGTCGCAATACCGCTCCGGCCATCGCCGTGGCCGCGCTGGAGGCCACCCGTGACGGTGCGGATGCATTGCTGCTGGTACTGCCTTCGGACCATGTGATCACCAACGAAGCTGCCTTTCGCGATGCGGTACAGGCTGCAGCCAGTGCCGCCGAATCCGGCAAGCTGGTGACCTTCGGCATCGTGCCGACAGGCCCGGAAACGGGCTACGGCTATATCAAGGCCGCCGACGGCCAGGGTCTGCGCGCGGTCGAGCGTTTCGTCGAGAAGCCCGACCTGGACACGGCCACCGGCTACGTCAGCAGTGGCCAGTACTACTGGAACAGCGGCATGTTCCTGTTCAAGGCATCGCGCTACCTGCAGGAACTGGAACGCTTCCAGCCGGCAATGCTGGCTGGCAGCCGCCAGGCCTGGCAGCAGGCCCGCCGCGATGCCGACTTCACCCGCCTGGACAAGGACGCCTTCAGCGCTGTGGCGTCCGACTCGATCGACTACGCGGTGATGGAAAAAACCGCAGACGCGGTGGTGATCCCGCTGGACGCGGGCTGGAACGACGTCGGTTCCTGGACCGCACTGCGCGATGTCTCGCAGCAGGACAGTGACGGCAACGCCCACCAGGGTGATGTGATCGCCATCGACTGCCGCAACACCTACGCCTATGCCCAGCGGCTGGTCGCCCTGGTTGGGCTGGATGATGTGATCGTGGTCGAGACCGATGACGCGGTGCTGGTCGGCAAGGCCGACCGCATGCAGGAGGTCAAGACGGTGGTGGCGAAGCTGAAGGCCGAAGGCCGCAGCGAAGCCACCTGGCACCGCAAGGTCTACCGCCCCTGGGGTGCCTACGACTCGATCGACAATGGTGAGCGCTTCCAGGTCAAGCGCATCACCGTCAAGCCCGGCGGCACGCTCAGCCTGCAGATGCATCACCATCGCGCCGAGCACTGGATCGTGGTCAGCGGCACCGCCGAGGTGACCCGCGGCAACGACGTGATCCTGCTCAGCGAGAACCAGAGCACCTACATTCCGCTGGGCGTCACCCATCGCCTGCGCAACCCCGGAAAACTGCCGCTGGAACTGATCGAAGTACAGTCGGGCAGCTACCTGGGCGAGGATGACATCGTGCGGTTCGAGGATACCTACGGGCGCAACTGACGCAGTGGATCCTTCGGCGGGCAGTCAGACATCGACGGCCCGCCGGCCCTGCTCATCGGTCGCCGGATACCGACGGATCGACTTTCACCGGCAGAATCGCGCGCAGCTTCGGATCATCCAGCTGGGCCTTCAGCTTCGCCGCATCCGGGTACGGCAGCTCGAACTCGCCGACGTCCAGCGCAGCCGGATCGCCGGTGACCAGATAACGCACCGCATTGGCCTGCTGTACCCGGCAGTCCGCAGCGAACTCACGGATCTTGCGGACGTCCTTGCCCACCTGGCTCAGCGGCGCCACCACCACCAGCAGCGTGAACAGGACCACTGCGGTACGCGCTGCGTGGCGCATCCGCGACGCGGGCAGCAGGTTCCAGAGCTGCAGTGCAAACCACGCATTGGCGAACAGCCCAGGTACGAACAGTTCGGTGTAGCGCGATGCAGGAACACGCATCTCATGGCCGCGACCGTAGGCAATCGCCACGCCCTGCAGCGCTGTCCAGATGCACAGGCCAGCCATCACCAGATCGGTCCTGCTCGCCTGCCTGCGCAGCAGCATGCGCGCGATCATCACCACACCGGGCAACCATACGATGATGACCGCCCAGTTCGAGCGTGCCGGCCACGCCAGGGTATGCGTTGCCGCCAGCAACAGCTCCGCCAGCGACTGTGCACGTAGCATGCCGTGACCATCGATGACGGGAATCGAGCGATAGGCAACGATCGCCAACACCGCCAGTACCGCCGTCGCACAGATTGCCGGAGTGCGCTGGCCAGGCAGGCACAGGCAGGCGATCACGCAGGTGATGCCGACCGCGACGGCCGTCAGCATGCCCGAGGCCATGGTGGTGGCGGCCAGAACGCTCAGCGCGATGGCCGCCAGCAGCGCAGGAATGCTCTGATGATGGCGCGCAGCCAGTCCAACGGCGGCAATGCTGGACAGAATGAGGAAGTAGAACTGGCTCTGGAACCCGACCAGGAAGTTCTGCCAGGCAAACGGCAGGACCGACATCAGCAGTGCAAAGGCGACCAGCAGCCGGCGTCCGTTGGCCGCTGCGGCGTCCCGCAGCGCGTACCAGACCAGCAGCGCGGGAATGAAGCAGAACACCAGCGCGCTGATGCGTGCTTCGTAGACGTTGTTCCACTGTCCGGTCAGCAGATAGGACAGCAGTGCCACCAGACGGGTCGGCAGGATGCGATGCTCGTTATGCGGCGTCAGCAACGTGCTCCACTGCAGCGTCCCGTTCAACCAGGGCTGCAGTGCAGTGGCGCCTTCTCCGTCCCACTGGTCCCAGAAGGGCATCGGCGTGCTGAAGAACTGCACGTAGAGCAGGCGCACCAACAGCGCCATCACTGCTACACCGGCGGCCACAGCCAGTGGGGACCATCCATTCCTTGCGGCGGGCGCAGTGCACGCTCTGCTCATGAATTCCTTCTCGTCGACGATGAATGCGCCGCGCGATGGTGGAGCCATTTGCAGGAGCTGTCAAAGAAAAAGGCGCGTCGCGAACCCTGTTCACGGACACGCCCCTTTGCCTGCGCTACCTCAGCCGCGGGCCGCAGCGACCTCGCCGATCACGCGCGACAGGCCGTCTTGCCACGTCGGCAGCACGATGCCGAAATCCTGCTGCAACCGACGGTTGTCCAGCACCGACCAGGCCGGGCGCTTGGCGGGCGTCGGATACTCTGAGCTGGGAATCGCCTCGACGGCGGGCACCTTCGTCAACACGCCCGTCGCCAGCGCCTCGGCAAAAATCGCCTCGGCAAAGCCATGCCAGCTGGTCTGGCCACTGGCGGTCAGGTGCCAGGTGCCCGACAACTGGCCCGGATGCTGCAGTGCCTGCGCGGTGACATCGGCGATCAGCGCCGCCGGCGTCGGCGTACCCACCTGGTCAGCTACCACACGCAACTGCTCCCGCTCGGCGCCGACACGCAGCATGGTGCGGAGGAAATTCGCACCATGCGAGGCATACACCCATGCCGTGCGGAAGATCAGGTGGCGTCCTGCCGCCGCCCGTACGGCGTCCTCACCATCACGCTTGCTGATGCCGTAGACACCCAGCGGCGCGGTCGGTTCGTCCTCACGGTACGGGGCGGTGCCCTGTCCATCGAACACATAATCGGTGGAGTAATGCACGAACGGCACGCCATGCGCCGCACACCAGCGTGCGATCACGCCCGGCGCCTGCGCGTTGGCCGCAAAGGCGGCGTCGACCTCCTGTTCGGCACGGTCGACGGCGGTGTAGGCGGCCGCATTGACCACGATTGACGGCTGCAGCCGGTCAAGCAGGGCCGGTAAGCTGTCCGGCTGGCCGAAATCGGCGGTTTCGCAGGCGCTGCCATCGGGCAGGACGCCGCTGCGGGTGGTCGCGACCACCTTGCCCAGCGGCGCCAGCGCGCGCAGCAGCTCCTGGCCGACCTGGCCGTTGCCGCCGAATACCAGAACCGTCATGGCACGTAGACCGGCAGGCGGTCTTCAGCGATGTCCTTCAGGAAGGGGGCGTTCTCGTCCTTGGCCGACAGCGTCGGTGCACTGACCGGCCAGTCCACGGCGATGTCTGCATCGTTCCAGCGCACGCCGGCATCGAAGTCCTTGAGGTAGACCTCAGTACACAGGTAGCTGAACACCGCGCGTTCGGACAGCACAGCAAAGCCGTGGGCGAAGCCCTCCGGAATCCAGAACTGCTTCTTGTTCTCCGCGCTCAGCACCACCGCTTCCCACTGGCCGAAGGTCGGCGAACCGCGACGGATGTCGACGGCCACGTCATAGACCTCGCCTTCCAGCACGCTGACCAGCTTGCCCTGCGGCCGCGGCCACTGGTAGTGCAGGCCGCGCAGCACGCCCTGCGCCGAGGTGGAAACATTGCTCTGCACAAAGCGATCCGGCAGGCCCAGCGCAGCGAAGCGCTCGGCGTTCCAGGTTTCGAAGAAGTAACCACGGGCATCGCCGAACACGGCCGGCTCGATCACCACACAGCCGGGCAACTTGGTTTCAATCACTTTCACGGAACGACTCCACGCAGGGCGAGCTTGTGCAGGTACTGGCCGTAGCCATTCTTGATCAGCGGTGCAGCCAGTGCTTCCAGCTGCTCGGCGGTGATCCATCCCTTGCCAAACGCGATTTCTTCCGGGCAGCAGACCTGCAGGCCCTGACGGGTCTGGATGGTCTCGATGAAGTTGGATGCCTCCAGCAGCGACTGGTGGGTGCCGGTATCGAGCCAGGCGTAGCCACGGCCGAGCGCCTCCAGATGCAGGTGGCCTTCACGCAGGTAGCGCTGGTTCAGATCGGTGATCTCCAGCTCGCCACGCGGCGACGGCTTGAGCTCAGCCGCATAGTCACTGGCATTGCCGTCGTAGAAGTACAGGCCGGTGACCGCGTAGTTCGATCGCGGATTCTCCGGTTTCTCGACCAGGTCGATGACCTTGCCGTCCTTGTCGAACTCGGCCACGCCGTAGCGTTCCGGGTCATTGACCCAGTAACCGAACACGGTCGCGCCCTGCTCGCGCTGGTCAGCGTTGCGCAGCACTTCGCGCAGGCCATGGCCATGGAAGATGTTGTCACCCAGCACCAGGCAGCTCGGCTTGCCGGCGACGAAATCACGACCGATCAGGTAGGCCTGGGCCAGGCCATCGGGGCTGGGCTGCACGGCGTACTGGATGTCCATGCCCCACTGCGAGCCGTCGCCGAGCAGCTGCTGGAACAGCGCCTGCTCGTGCGGCGTGTTGATGATCAGCACTTCACGGATGCCCGCCAGCATCAGCACGCTGAGCGGGTAGTAGATCATGGGTTTGTCATACACCGGCAGCAGCTGTTTGCTGACGCCCTTGGTGATCGGATACAGCCGGGTGCCGGAGCCGCCGGCGAGGATGATGCCCTTGCGCTGGGTCATGAGGTCTCCTGGGTTTTCAGGCCGCGGTGCCGATGCGCTGCAGACGGTAGCTGCCGTCGAGCACGCCGTTGACCCATTCCTGGTTGTCCAGGTACCAGTCAACGGTGAAGGTGATGCCCTGCTCGAAGGTGTAGGCCGGTTCCCAGCCCAGGTCGTTCTTCAGCTTGGAGGCATCGATCGCGTAACGGCGGTCATGGCCCGGGCGGTCGGTGACGTAGGTGATCTGGCTGCTGCGCGGCTGGCCGTCCGCGCGCGGGCGGCGTTGGTCCAGCAGTGCGCAGATGGCCTGCACCACTTCGATGTTCTGCTTTTCCGAATTGCCGCCGACGTTGTAGGTCTCGCCGACCTGGCCCTTGGCCAGCACGGTGCGGATCGCTTCGCAGTGGTCGGACACGAACAGCCAGTCGCGCACCTGCTTGCCATCGCCGTACACCGGCAGCGGCTCGCCGGCCAGCGCCTTGGCGATCACCAGCGGGATCAGCTTCTCGGGGAAATGATACGGGCCGTAGTTGTTGGAGCAGTTGGTGGTCAGCACCGGCAGCCCGTAGGTGTGGTGGAAGGCACGGACCAGATGGTCCGAAGCGGCCTTCGACGCCGAATATGGGGAATTCGGAGCATACGGCGTGGTTTCGCTGAACTTGCCGGTCTCACCCAGGGTGCCGTACACCTCGTCGGTGGACACGTGCAGGAAGCGGAAGGCCGCGCCCTGTTCGGCCGGCAGCGCCTTCCAGTAGTCGCGCACCGCTTCCAGCAGGCCCAGGGTGCCCACCACGTTGGTCTGGATGAACGCTCCCGGGCCGTCGATGGAGCGATCCACGTGGCTCTCGGCGGCAAAGTTGAGCACGGCGTCCGGCCGGTGCTCGGCCAGCAGGCGGGTGACGAGCGCCTGGTCGCCGATGTCCCCTTGCACGAACACATGATTCGGGTTGCCGTCCAGGCTGGACAGGGTCTTCAGGTTGCCGGCGTAGGTCAGCGCATCGAGGTTGATGACCTTGACGCCGCGCGCGACGGCCTCGAGAACGAAGTTACCGCCAATGAATCCGGCGCCGCCGGTGACAAGCCATGTGGGCAC
Protein sequences of DBSCAN-SWA_6 >LR134301|2956175:2966615|2965559_2966615_-|VEE53152.1|DBSCAN-SWA MPTWLVTGGAGFIGGNFVLEAVARGVKVINLDALTYAGNLKTLSSLDGNPNHVFVQGDIGDQALVTRLLAEHRPDAVLNFAAESHVDRSIDGPGAFIQTNVVGTLGLLEAVRDYWKALPAEQGAAFRFLHVSTDEVYGTLGETGKFSETTPYAPNSPYSASKAASDHLVRAFHHTYGLPVLTTNCSNNYGPYHFPEKLIPLVIAKALAGEPLPVYGDGKQVRDWLFVSDHCEAIRTVLAKGQVGETYNVGGNSEKQNIEVVQAICALLDQRRPRADGQPRSSQITYVTDRPGHDRRYAIDASKLKNDLGWEPAYTFEQGITFTVDWYLDNQEWVNGVLDGSYRLQRIGTAA >LR134301|2956175:2966615|2961711_2963127_-|VEE53148.1|DBSCAN-SWA MAPPSRGAFIVDEKEFMSRACTAPAARNGWSPLAVAAGVAVMALLVRLLYVQFFSTPMPFWDQWDGEGATALQPWLNGTLQWSTLLTPHNEHRILPTRLVALLSYLLTGQWNNVYEARISALVFCFIPALLVWYALRDAAAANGRRLLVAFALLMSVLPFAWQNFLVGFQSQFYFLILSSIAAVGLAARHHQSIPALLAAIALSVLAATTMASGMLTAVAVGITCVIACLCLPGQRTPAICATAVLAVLAIVAYRSIPVIDGHGMLRAQSLAELLLAATHTLAWPARSNWAVIIVWLPGVVMIARMLLRRQASRTDLVMAGLCIWTALQGVAIAYGRGHEMRVPASRYTELFVPGLFANAWFALQLWNLLPASRMRHAARTAVVLFTLLVVVAPLSQVGKDVRKIREFAADCRVQQANAVRYLVTGDPAALDVGEFELPYPDAAKLKAQLDDPKLRAILPVKVDPSVSGDR >LR134301|2956175:2966615|2956175_2958800_+|VEE53145.1|DBSCAN-SWA MAVVFTGGELSRWLDAGTLERGRQRAGAVHALHYRAPLLTAQVETWQVTIDLDASARAPRSPLQSRCTCPAGGSCEHVASVLLAALGTSLEGDTGPLRPAAHPTLPTLPIQAMTPILHLHSLEYRPDWLTERDPSQWLDVATLAFEYDEHVRFLDDPSPFVHGPQGQLYLLPRDLHEEARREDELRSVHLHRERQPRIVLDGGGPLFELRSFDWTRFLLDDVPRLEALGWKVEADDDFRHRITRVDEIDLDIQADPQDAGWFNLGLEIQVDGHNVSMVPLLQQVLNADPRWSRGQLDAIGDDENILLTAGGNTRLALRASRLKPVIALLADLFTPRDVPLRLSMHDRGRLQALKDDAHLQLRGHGDTQALVQRLLQAPAPEDVATPAGLQATLRGYQRDGLSWLQYLRQQRLGGVLADDMGLGKTLQTLAHLLVEKESGRLDRPALLVVPTSLLHNWQSEAARFTPDLRVLTLHGSTRETLFDAIPAHDLVLTTYPLLWRDEQALQAHAYHLLILDEAQQVKNPKSRAAVTLRTLQAHHRLCLTGTPLENHLGELWTQFDFLLPGLLGTEKQFNQHWRHPIERGSDHRRAQLLAQRLRPFILRRRKDQVASELPPKTLITRSVEMEGGQRDLYETVRAAMEKQVREAISGSGLARSHIVVLDALLKLRQVCCDPRLLPGNTPARAAGSAKLELLREMLPPMIEEGRRILLFSQFTGMLALIAQALDDLGLAYVTLTGDTQDRATPVQRFMQGEVPVFLISLKAGGVGLNLTAADTVIHFDPWWNPAAENQASDRAHRIGQQQPVFVYRLIAAGSIEERIAELQERKALLAESILEGGGSAGPRFSEEDVQALLAPLPGLPARRTRRPGRRPPQH >LR134301|2956175:2966615|2958896_2960243_+|VEE53146.1|DBSCAN-SWA MPLPAFKAYDIRGRVPEELNEDLARRIGVALAAQLAPGPVVLGHDVRLTSPALQDALAAGLRGTGREVIDIGLCGTEEVYFQTDHLGAAGGVMVTASHNPMDYNGMKLVKENARPISSDTGLFAISDAVAADTSEARPLRAGQTAQHDKHAYIQHLLSYVDAGKLKPLKLVVNAGNGGAGAIVDLLAPHLPFEFIRICHEPDGSFPNGIPNPLLPENRAATADAVREHGADFGIAWDGDFDRCFFFDHTGRFIEGYYLVGLLAKAILARNPGGKIVHDPRLVWNTVDMVEQAGGVPVQCKSGHAFIKEKMRAEDAVYGGEMSAHHYFREFAYADSGMIPWLLIAQLVSESGRSLADWVEDRMAAYPCSGEINFKVADAKAAVARVMEHFAAQSPALDHTDGISADFGDWRFNLRSSNTEPLLRLNVEARGDAALMQARTDDIARLIQQ >LR134301|2956175:2966615|2964101_2964659_-|VEE53150.1|DBSCAN-SWA MKVIETKLPGCVVIEPAVFGDARGYFFETWNAERFAALGLPDRFVQSNVSTSAQGVLRGLHYQWPRPQGKLVSVLEGEVYDVAVDIRRGSPTFGQWEAVVLSAENKKQFWIPEGFAHGFAVLSERAVFSYLCTEVYLKDFDAGVRWNDADIAVDWPVSAPTLSAKDENAPFLKDIAEDRLPVYVP >LR134301|2956175:2966615|2963199_2964105_-|VEE53149.1|DBSCAN-SWA MTVLVFGGNGQVGQELLRALAPLGKVVATTRSGVLPDGSACETADFGQPDSLPALLDRLQPSIVVNAAAYTAVDRAEQEVDAAFAANAQAPGVIARWCAAHGVPFVHYSTDYVFDGQGTAPYREDEPTAPLGVYGISKRDGEDAVRAAAGRHLIFRTAWVYASHGANFLRTMLRVGAEREQLRVVADQVGTPTPAALIADVTAQALQHPGQLSGTWHLTASGQTSWHGFAEAIFAEALATGVLTKVPAVEAIPSSEYPTPAKRPAWSVLDNRRLQQDFGIVLPTWQDGLSRVIGEVAAARG >LR134301|2956175:2966615|2960255_2961659_+|VEE53147.1|DBSCAN-SWA MSRIQPVILSGGSGTRLWPLSREAYPKQFLPLAGELTMLQATWQRVAPIAAHGPLVIANEEHRFVAAEQLQQVGAEPAAIILEPVGRNTAPAIAVAALEATRDGADALLLVLPSDHVITNEAAFRDAVQAAASAAESGKLVTFGIVPTGPETGYGYIKAADGQGLRAVERFVEKPDLDTATGYVSSGQYYWNSGMFLFKASRYLQELERFQPAMLAGSRQAWQQARRDADFTRLDKDAFSAVASDSIDYAVMEKTADAVVIPLDAGWNDVGSWTALRDVSQQDSDGNAHQGDVIAIDCRNTYAYAQRLVALVGLDDVIVVETDDAVLVGKADRMQEVKTVVAKLKAEGRSEATWHRKVYRPWGAYDSIDNGERFQVKRITVKPGGTLSLQMHHHRAEHWIVVSGTAEVTRGNDVILLSENQSTYIPLGVTHRLRNPGKLPLELIEVQSGSYLGEDDIVRFEDTYGRN >LR134301|2956175:2966615|2964655_2965543_-|VEE53151.1|DBSCAN-SWA MTQRKGIILAGGSGTRLYPITKGVSKQLLPVYDKPMIYYPLSVLMLAGIREVLIINTPHEQALFQQLLGDGSQWGMDIQYAVQPSPDGLAQAYLIGRDFVAGKPSCLVLGDNIFHGHGLREVLRNADQREQGATVFGYWVNDPERYGVAEFDKDGKVIDLVEKPENPRSNYAVTGLYFYDGNASDYAAELKPSPRGELEITDLNQRYLREGHLHLEALGRGYAWLDTGTHQSLLEASNFIETIQTRQGLQVCCPEEIAFGKGWITAEQLEALAAPLIKNGYGQYLHKLALRGVVP |
8 | Enterobacteria_phage(42.86%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|