Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
LR134531 | Pragia fontium strain NCTC12284 genome assembly, chromosome: 1 | 4 crisprs | DEDDh,RT,cas3,cas3-cas2,cas8f,cas5f,cas7f,cas6f,WYL,csa3,DinG,cas3f,cas1 | 0 | 7 | 5 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134531_1 | 1539865-1540192 | TypeI-F |
I-F
Consensus repeat of LR134531_1
|
5 spacers
spacers of LR134531_1
>1.1|1539893|32|LR134531|PILER-CR,CRISPRCasFinder,CRT GCAAATAGCTCAAATCGGCGAGGATGGGGAAG >1.2|1539953|32|LR134531|PILER-CR,CRISPRCasFinder,CRT AACAAGTCGCTCAAATATTCCGATGTGACCAG >1.3|1540013|32|LR134531|PILER-CR,CRISPRCasFinder,CRT AAGCCCAGCCCGCTTAACTGCGTTCCTCCATC >1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT TCTTAATTTTGAATATAAAAATCCCTGCCAAT >1.5|1540133|32|LR134531|CRISPRCasFinder,CRT TCAGCTACTCGTCACTGGCAACATTATCGATA |
cas6f,cas7f,cas5f,cas8f,cas3-cas2 |
CRISPR arrays and Neighbor proteins around LR134531_1
The CRISPR arrays of LR134531_1 >merge|LR134531|1|1539865-1540192|PILER-CR,CRISPRCasFinder,CRT CTGAACCGCCGTATAGGCGGCTTAGAAAGCAAATAGCTCAAATCGGCGAGGATGGGGAAGCTGAACCGCCGTATAGGCGGCTTAGAAAAACAAGTCGCTCAAATATTCCGATGTGACCAGCTGAACCGCCGTATAGGCGGCTTAGAAAAAGCCCAGCCCGCTTAACTGCGTTCCTCCATCCTGAACCGCCGTATAGGCGGCTTAGAAATCTTAATTTTGAATATAAAAATCCCTGCCAATCTGAACCGCCGTATAGGCGGCTTAGAAATCAGCTACTCGTCACTGGCAACATTATCGATACTGAACCGCCGCATAGGCAGCCCAACCC >LR134531|1|1|1539865-1540132|PILER-CR CTGAACCGCCGTATAGGCGGCTTAGAAA GCAAATAGCTCAAATCGGCGAGGATGGGGAAG CTGAACCGCCGTATAGGCGGCTTAGAAA AACAAGTCGCTCAAATATTCCGATGTGACCAG CTGAACCGCCGTATAGGCGGCTTAGAAA AAGCCCAGCCCGCTTAACTGCGTTCCTCCATC CTGAACCGCCGTATAGGCGGCTTAGAAA TCTTAATTTTGAATATAAAAATCCCTGCCAAT CTGAACCGCCGTATAGGCGGCTTAGAAA >LR134531|1|1|1539865-1540192|CRISPRCasFinder CTGAACCGCCGTATAGGCGGCTTAGAAA GCAAATAGCTCAAATCGGCGAGGATGGGGAAG CTGAACCGCCGTATAGGCGGCTTAGAAA AACAAGTCGCTCAAATATTCCGATGTGACCAG CTGAACCGCCGTATAGGCGGCTTAGAAA AAGCCCAGCCCGCTTAACTGCGTTCCTCCATC CTGAACCGCCGTATAGGCGGCTTAGAAA TCTTAATTTTGAATATAAAAATCCCTGCCAAT CTGAACCGCCGTATAGGCGGCTTAGAAA TCAGCTACTCGTCACTGGCAACATTATCGATA CTGAACCGCCGCATAGGCAGCCCAACCC >LR134531|1|1|1539865-1540192|CRT CTGAACCGCCGTATAGGCGGCTTAGAAA GCAAATAGCTCAAATCGGCGAGGATGGGGAAG CTGAACCGCCGTATAGGCGGCTTAGAAA AACAAGTCGCTCAAATATTCCGATGTGACCAG CTGAACCGCCGTATAGGCGGCTTAGAAA AAGCCCAGCCCGCTTAACTGCGTTCCTCCATC CTGAACCGCCGTATAGGCGGCTTAGAAA TCTTAATTTTGAATATAAAAATCCCTGCCAAT CTGAACCGCCGTATAGGCGGCTTAGAAA TCAGCTACTCGTCACTGGCAACATTATCGATA CTGAACCGCCGCATAGGCAGCCCAACCC
>LR134531.1|VEJ54888.1|1539062_1539677_+|CRISPR-associated-protein-Cas6/Csy4,-subtype-I-F/YPEST MDSYIDIQILHNPEIASALVMNQLFYQLHLTLVENESPVAVSFPDYHVDADNHRNNTLGNRLRLHGSATALNSLNIQQYLVDISNYALLGEIKPVPDVVQGYAAFVRKHSKSPRDIDRKRQYLLKKAGGEWNDMAEKSLSRFIEKLRCQLPFIHLVSHSSKPAEDGGKNYFHLFINKETGEVQQGKLTKYGLSDVQHRTTVPIF >LR134531.1|VEJ54886.1|1538063_1539059_+|CRISPR-type-I-F/YPEST-associated-protein-Csy3 MSKTTLKTASVLAFERNFDVSDGVFFQTNWQQRDGFNKVKISEKSVRGTISNRLKNAIASDPTKLDSEIQKANLQTVDMAALSQENDTLVVKWSLKVLPFTGKPCVCNDANYQNKLEQTLKTYIDKVGFDELAKRYASNIANGRFLWRNRMGAEFIDVVVKAEGEKPMTFKAKDYSLMSFDKPDGELEKLANWIKQGLSGEKFVLLHIEAYAKVGSGQEVYPSQELILDKGRKSKTLYQIDEAAAMHSQKISNALRTIDTWYPDAQFPIAVEPYGSVTSMGTAFRQPKAKADFYTLFDAWVLKDQELTLEQRHYVAAILIRGGVFGESGKE >LR134531.1|VEJ54884.1|1537053_1538025_+|CRISPR-type-I-F/YPEST-associated-protein-Csy2 MKYYLVIPHIKIQNANCISSPLTYGFPAITAFTGAVHALSRKLTETFDVKLEGVAIAAHRCDVQRSRPNNYASDWSFIQSRHPIKKDGNSPSIIEEGYAHLDVSLVIEVIGDIDWNPQEKQLFCDAVYLQLMQQRLAGGSVLSIGTVLRRIHLHNDPKGDNEAIKAIKIQISPSFLLMDSQQALIKHTKKLQAKHPDKTALDALIDLCSIHVHPESVGDARDDISKGNVTWTMSSEKNGWLVPIPVGYKGIAPELGANQVADIRAPQYPTHFVETVYSVGEWRFTYKVNDFNTIFWRHHTDLDNQLFLVSQSSANDENLQTLF >LR134531.1|VEJ54882.1|1535755_1537057_+|CRISPR-type-I-F/YPEST-associated-protein-Csy1 MPVQRAVHAFINERLQAKLDKAKSPQEINDLKLKYHPHEWIADAARRASQIQFATHTVKGIHPDSKGTNINAQLIKITDGLVSSADLNTHALDIVGNAAALDVYKFLSVSVDENSTILDLVLKEDSQLLSVFSESPQQAKQYLAAFKSITQSSMENAKVYELMKQLLWPVNIDAQQQDNYLNIVPLYPTSFVHQVYQNLQEIRFSDEVKLAKDARKNEKGHEVGYTIVPDIAYVKLGGTKPQNISQLNSERGGRNYLLPSLPPTNQPEDTSAWRVKPPVLESSIFDRVFNYFAKKHIKRFCQNVIDESRNNASVREKRQLIIKDILEVLMRYGEQVRGFDGGWSKHGGLKASQAFWLDPKRADIDAEWAELRETSDWKMELAHDFGVWLNHRLNDAYRDNKYIHLADTEQKEWKYDMADLIKQSIREGWGIFE >LR134531.1|VEJ54880.1|1532268_1535424_+|CRISPR-associated-helicase-Cas3,-subtype-I-F/YPEST MDDFSPSDLKIILHSQKANLYGLEYLLRKTGEDSWQFAQSLQILTVLAALFHDVGKSSVGFQNKLKGIDSGLKGDPFRHEWVSMRIFQAIVGKAETDEQWLTFLKDFNQFEQDYPQWQQDIFKDGIDNDDKQDARYYAWQLLPPLARAVGWLITSHHRMPFYADGEENSERSIDEAFFKHLSPVNGWVQNRYEPVAKKYIQPFWTFESMVTESHSWQKSIARWADKALSDRYLAQQNNFSFFDPLLMHLSRMSLMVGDHTYSGLPRHRQYGDKDFSLYANTEKRDEKASQVKMRQRLDEHLLGVARQSEKFSQLLPKIADNLPSIADHKAFKKHTAEPRFSWQNQAYELACRLRTKSERGGFFGINIASTGCGKTLGNGRIMYGLSDPDKGARFTMALGLRVLTLQTGREYRQRMSLTKEDLAILVGGQAVIDVFNLRHDDKEKRMVLNDETESHQAQINHYGIGSESAQDFQDDYVHFDSPIEDHHLGSVLRDEKAKKLLYAPIVTCTIDHLVGATEIARGGKFIVPMLRLLSSDLILDEPDDFNLEDLPALTRLVNMAGMLGSRILLSSATLPPDMLAGLFEAYQAGRMIWNRHHAIQASPVVCAWFDETEVKSVDCVDKAQFTQENTQFVANRVIRLNQSPILRQGAILPLSVKPKKSLQFNQLAQDLLQGIQRLHSDHQQTCLQTGKRVSFGLIRLSNISPLVELAQALYQSDGLADADIRLCVYHARQLFALRSDLENQLDQILHRKEQQNIFRHPAVASVLAESASPNVIFIVLATPVAEVGRDHDYDWAIVEPSSMRSIIQLAGRIRRHRQEPVSTPNMLMMSTNIRALQQGKDLGAGHTPVFCLPGFESTSNKLSGPELILRSHNSADILLTALDNINSVSRISKPKPQIPLIALEHAAIARVMNKPDGKLNLVNAWWRPSPNSAVQSANPYLFHLQKQTPFRKSVRQTHYICLINDEEKFEFRDMKYITQTEQAKFEVDINIVHNHKVTPWLNQQYKTVLEGLADDIGEKDLNKLGIKFGFVDLDERQLSWKFHPLLGFWPK >LR134531.1|VEJ54878.1|1531922_1532195_+|Uncharacterized-conserved-protein-(DUF2132) MQHQSKDPLHGITLERLLTTLVEKYGWESLAREVSINCFKNDPSIKSSLKFLRRTPWARKQVEDLYIHSLNDNLDNKRDEVNNPWGKWVK >LR134531.1|VEJ54876.1|1530708_1531680_-|Uncharacterised-protein MLNKAVPLSIALLSLVLTSTQPVLAAKTISSSGQITYSEHTQPSRHYTVISRAPVTFYPPQLDSLQGHQVTGRLAVSFNRGMATNKHENSASQRLYGVIWFSALIESNMLAGTTRLYDMIIQQTNLPGSASDRANYLAQINTALQQKTLMANLNQFANTMPEGVHQLISNQSDVSIWYRPFGNLWGNGPVVMLNNLDTGQGNTLDQTTRIVWSDSAINHDHIGSSGYHKITGNKQRGTVSYSSSHAQNDLDSGVYRNAPPPQSIDSNSVLPAQRVYAGKDGNVYYYDKSQGWQLIQSNGQLKNIEPTANSGVEQDRLIRNHRQ >LR134531.1|VEJ54874.1|1528629_1530651_+|Excinuclease-ABC-subunit-B MSKTFKLNSAFQPAGDQPEAIRRLSEGLEDGLAHQTLLGVTGSGKTFTIANVIAQLDRPTMILAPNKTLAAQLYGEMKAFFPENAVEYFVSYYDYYQPEAYVPSSDTFIEKDASINEHIEQMRLSATKALLERRDVVLVASVSAIYGLGDPDAYLKMMLHLTRGMIIDQRSILKRLSELQYTRNDQVFQRSTFRVRGEVIDVFPAESDEYALRIELFDDEVERLSVFDPLTGQVQQVVPRFTIYPKTHYVTPRERILEAIEGIKVELADRRRVLLANDKLVEEQRLTQRTQFDIEMMNELGYCSGIENYSRYLSGRGEGYPPPTLFDYLPADGLLVIDESHVTVPQIGGMYRGDRSRKETLVEYGFRLPSALDNRPLRFEEFEQLAPQTIYVSATPGNYELEKSGGEIIEQVVRPTGLLDPEIEVRPVGTQVDDLLSEIRIRVEKNERVLVTTLTKRMAEDLTEYLEEHGERVRYLHSDIDTVERVEIIRDLRLGEFDVLVGINLLREGLDMPEVSLVAILDADKEGFLRSERSLIQTIGRAARNLNGKAILYGDRVTNSMAKAIGETERRRARQIAYNAENGIVPQGLNKSVEDILELGQGLGTLKNNKGRGKKKAAEPAADYTALTPQALDKKIRELEGKMYEHAQNLEFEEAAHLRDQLQKLREQFIALS >LR134531.1|VEJ54872.1|1526915_1527734_-|Uncharacterized-phosphatase-YwpJ MKYQIIALDLDGTLLNSQKQILPESLEALTQARKQGVKVIIVTGRHHVAIHPFYQALQLDTPAICCNGTYSYDYQKQQVLSGTPLTKTQAIRVAQLLRDYPIQSLMYIDNAMTFEQRDDNITRWYAWSERLPENQRPNILHVDTFETAINNAENVWKFATSSEDTDALNHFSALIEDELGLSCERSWHNQIDLAQQGNSKGNRLREWVESQGISMDKVIAFGDNLNDVSMLTQVGLGVAMGNSNQDVKVHADMVIGENETPAIANTIRQYVL >LR134531.1|VEJ54870.1|1525802_1526870_+|Sulfate/thiosulfate-import-ATP-binding-protein-CysA MLTLNFKQRLGDLGLDVSVQVPGNGITAVFGLSGAGKTSLINAVSGLTHPDSGEIILNQRVLVDCAKGYFLPPEKRHIGYVFQEARLFPHYSTKGNLQYGMKPSMAAQFDTIVELLGIGHLLKRFPMTLSGGEKQRVAIGRALLTAPELLLMDEPLASLDVPRKRELIPYLERLAHDVNIPILYVTHSLEEILRLAKQVIILDAGKVRASGELETVWASEAMSPWLHQEERSSILNLTLLKQHDNYPMTGLALGDDCLWVSKVDARAGEKLRVRINAADVSLVLALPQQSSIRNILAAEVVDVYTSEDKVDVKLSVGQHYLWARITPWARDELNIVVGQRLYAQIKSVSMSREAW >LR134531.1|VEJ54890.1|1540435_1540903_+|PAL-cross-reacting-lipoprotein MKSFILKSIGAACIALSMAGCASNISSDSYSDQQVGQASRTFAGTIVSSRVVNVEGNNEVGGLVGSVAGGVAGSAIGGGFRANALGAIGGALLGGVLGSSVEKGVSQQKAIEYVIQTERDGLITVAQGIDNPLGNGQKVLVIQGKTTRVIADTRP >LR134531.1|VEJ54892.1|1540961_1541873_-|LPPG:FO-2-phospho-L-lactate-transferase MRNRTLSDLEHVVALGGGHGLGRVMSALSSLDSRLTGIVTTTDNGGSTGRIRQSEGGIAWGDLRNCLNQLITEPSIASTMFEYRFSGNGELAGHNLGNLMLKALDNLSVRPLDAINLIRNMLKVRAYLIPMSEHPVDLAAIDVEGNLIHGEVNVDSLKQMPQQLMLEPHVAATYEAVEAIDRAELILVGPGSFMTSLMPPLLLKEIAEAMQRSKARIIYIGNLAKELSPAAASLTLQIKMGIIEQAIGGRKIDGIIIGPHTSIEGIQDKVIVQQPLEAEDIPYRHDRDMLHKALECALRKFGE >LR134531.1|VEJ54894.1|1543306_1543822_+|Molybdenum-cofactor-biosynthesis-protein-B MGHAASEFIPVSLAVLTVSDSRGEAEDTSGHYLVEAAAAVGHQVVDKRIIKDNIYQIRAVVSGWIADPSVQAIVITGGTGFTARDNTPEAIQPLFDRQVEGFGELFRMLSYEDIGTSTIQSRALAGIANQTIIFAVPGSTNACKMAWERIIVEQLDARHRPCNFLPHLSKK >LR134531.1|VEJ54896.1|1543836_1544316_+|Molybdenum-cofactor-biosynthesis-protein-C MSQLTHINASGEAHMVDVSAKSETVREARAEAYVEMSEQTLAMIMQGSHHKGDVFATARIAGIQAAKRTWELIPLCHPLLLSKVEVSLEAQPENNRVRIESCCRLTGKTGVEMEALTAASVAALTIYDMCKAVQKDMVIGPVRLLTKSGGKSGDFQAEA >LR134531.1|VEJ54898.1|1544315_1544561_+|Sulfur-carrier-protein-moaD MIKVIFFAQVRELVGTDQLALPAEYPTVEALRQALCSRGNKWPLALEPGKLLMAVNQTLVTAEHPIADGDEVAFFPPVTGG >LR134531.1|VEJ54900.1|1544562_1545024_+|Molybdopterin-synthase-catalytic-subunit MSEDKNTRITVSPAAFNVGDEYQWLSQCDDDGAVVTFTGKVRNHNLGDSVSALTLEHYPGMTEKSLAEIIIAARQRWPLQRVNVIHRIGELFPGDEIVFVGVTSAHRSSAFEAAEFIMDYLKTKAPFWKREATQEGDRWVDSRESDKQAAERW >LR134531.1|VEJ54902.1|1545232_1545808_+|Trp-repressor-binding-protein MTRVAIVYHSGYGHTAKVAEAVAKGLMEVSDTLVDMVPIDAEGNLPEQAWDALAVADGIIFGSPTYMSGPSWQFKKFADGSSAPWVAQQWKDKLFAGFTNSASMNGDKLSTLDYMFHLSQQHGGIWVGMGMLPSNTKAATRNDVNYIAGVSGLMTVSPADASVEEAPLPGELETARLFGQRIAQLTCRWVR >LR134531.1|VEJ54904.1|1545975_1546677_+|Inner-membrane-protein-YbhL MDPRHNGTIVEHANTGLQAFMAQVYGWMTCGLLLTAFTAWFVTGTALQSFIFSSNITFFGLIIVQLGVVFFLSGMLNRISGSLATTLFMLYSVLTGLTMSSIFVAYTSSSIASTFFVTAGTFGAMSLYGYTTKRDLSGLGSMMIMGLIGILLASLVNIFLKSPALTWAITYIGVIVFVGLTAYDTQKLKQIGEQVSLDDREGYRKSSIMGALTLYLDFINLFLMLLRILGDRR >LR134531.1|VEJ54906.1|1546817_1548209_-|Succinate-semialdehyde-dehydrogenase-[NADP(+)]-1 MSYKTVNPYTGETLKTFPDATDAEVAGAIDKAHNAFLSWKEKPIAERLALLQRVADGLRKEKKEIAKLLTIEMGKLYSEAQGEVELSAQIFEYYVKNAEELLKPEKLPVADPAEGQAILVCDPLGVLLAIEPWNFPYYQIARILAPQLAVGNTLLLKHASNVPQSAAAFERVVLEAGLPEGLLQNLYATRNQVETIINDPRVHGVALTGSEGAGSVIASQAGKALKKCTMELGGADAFVVLADAPLKETVRWAVFGRHWNGGQVCVSSKRMIIDAKIYDDFLSQYQEGVAALKAGDPFDETTTLAPLSSQQAADEVKDKVRQAVAHGATAIEVGPKVPSQGAFVQPTILTNVTPDNPAYYWEFFGPVSMLFKAKDEDDAVRIANDTPFGLGGSVFTARPEHGAEVAKRISTGMIFVNHPTMVKADLPFGGVRRSGFGRELIGLGLKEFANHKLIDIVDINAPF >LR134531.1|VEJ54908.1|1548766_1550170_+|ATP-dependent-RNA-helicase-rhlE MSFDSLGLNAEILRAVEEQGYREPTPIQQQAIPVVLSGRDLMACAQTGTGKTAGFTLPLLQLLTANNDSPRGRRPVRALILTPTRELAAQVGENVQEYSRHLNIRSLVVFGGVSINPQMMKLRGGVDVLVATPGRLLDLEHQNAVDLSKVEILVLDEADRMLDMGFIHDIRRVLAKLPVKRQNLLFSATFSDEIKGLASKLLHNPASVEVAKRNTPSELVDQKVHLVDKKRKRELLSQMIGQGQWQQVLVFTRTKHGANHLAEQLNKDGITASAIHGNKSQGARTRALDEFKNGKIRVLVATDIAARGLDIDHLPHVVNYELPNVPEDYVHRIGRTGRAECTGEAVSLVCVDEHKLLRDIERLLKLEIPRIAFEGYDPDPTIKAEPIINGRQGRGQGQGPRQNSGRGRSSSQGNGHKDGGPWGNRNGQAAEGRADSKPRSQKPRSDKPAQQKRASFSGARRSGSGGE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134531_3 | 2376857-2376994 | Orphan |
NA
Consensus repeat of LR134531_3
|
1 spacers
spacers of LR134531_3
>3.1|2376906|40|LR134531|CRISPRCasFinder GGATTCCTCAGCGCTTTCGCAGGGATGACGGTATAAATGC |
CRISPR arrays and Neighbor proteins around LR134531_3
The CRISPR arrays of LR134531_3 >merge|LR134531|3|2376857-2376994|CRISPRCasFinder GGGAATGACAATATAAATCATAGTTAAGAATTCATGTTTATTTTTCTGGGGATTCCTCAGCGCTTTCGCAGGGATGACGGTATAAATGCGGGAATGACAATATAAATCATAGTTAAGAATTCATGTTTATTTTTCTGG >LR134531|3|2|2376857-2376994|CRISPRCasFinder GGGAATGACAATATAAATCATAGTTAAGAATTCATGTTTATTTTTCTGG GGATTCCTCAGCGCTTTCGCAGGGATGACGGTATAAATGC GGGAATGACAATATAAATCATAGTTAAGAATTCATGTTTATTTTTCTGG
>LR134531.1|VEJ55773.1|2375663_2375942_-|Propanediol-utilization-protein-PduA MGDALGLIETKGLVACIEAADAMCKAANVELIGYENVGSGLVTAMVKGDVGAVKAAVDSGVESAQRVGEVVTSLVIARPHNDINKIVIKHKA >LR134531.1|VEJ55772.1|2375354_2375639_-|Propanediol-utilization-protein-PduA MGDALGLIETKGLVACIEAADAMCKAANVELIGYENVGSGLVTAMVKGDVGAVKAAVESGTESAKRVGEVVTSLVIARPHNDIQKIVSQYKVIE >LR134531.1|VEJ55771.1|2375063_2375342_-|Propanediol-utilization-protein-PduA MKEALGLIETKGLVPAIEAADSMCKAANIQLIGYENVGSGLVTVMVKGDVGAVNAAVESGVEAAGKIGVVVSSRVIARPHNDIEKIASQHKA >LR134531.1|VEJ55770.1|2373381_2375001_-|Aldehyde-alcohol-dehydrogenase MIALDNDLQSVQNARELIRNAKKAQSIYAKFSQEKIDSIVKHIAREAYLHAEELAKLANEETGFGNWQDKVLKNVFASSRVYEHIKNLKTVGIINDDRVNKVMEVGVPLGIVTALVPSTNPTSTIFYKTLIALKAGNAIIFSPHPNAQQCSRRAIDIVRKAAEEAGAPAGIVDGLSLLTLEATKALMHSKDVSLILATGGEGMVRAAYSSGTPTISGGPGNGPAFIERSADIKQAVNDIITSKTFDNGVICASEQSIIVERCIYNDVHRELLSQGAYFMNDDESNKLAGILLRPNGMINPAIVGKTAISLSERAGFSVPADTRVLISLQDSVSHKNPYSREKLCPVLGLYIEEDWMKACDRVVELLTNEGLGHTLVIHTQNQDVIRQFALEKPVHRILINTPAALGGIGATTNLIPALTLGCGAVGGGSTSDNVGPLNLINVRKVGYGVRSINDLRHPVVESANEPVVATAPACSSNTSIFDDHRFTSGAAIPATSQPAALYANGDDRFGACAISEAKGEITEESVERIIKEVLGRIGR >LR134531.1|VEJ55769.1|2373082_2373340_-|Carbon-dioxide-concentrating-mechanism-protein-CcmL MILAKVTGHVVATQKSDELRGCNLLLVTAIGDDQTLIKDKTYVAVDSVGAGTGDLVLVEEYFALNKDRYKAMSIVAIVEKVQRDC >LR134531.1|VEJ55768.1|2371918_2373067_-|Aldehyde-alcohol-dehydrogenase MNEFLIKPKIYFGANALECIETLDGRNAFIVTDKAMVKFGLTDKVCAILQRKQINYRLYDDVASDPDISAIVKGMKLMDENFPDLVIALGGGSVIDAAKAVIYALWHCRKDTGLTKPCFIAIPTTSGTGSEVTSFSVIKSRTEKLVIVDEFMLPDIAILDPSLVQSVPPAITADTGMDVLCHALEAYVSRQANDFTDALAEKTVKLVFGHLIDCYNNGNNALAREKMHNASCIAGMSFTNASLGITHSLAHALGGMFAIPHGKANALLMSYVVAFNADFYGSCDNQAARKYAELARMLALPAETTRQGVNSLIVAINVLKEEMNIPTSIKAVGISESAFSERLSSLVHQALKDNCTPTNPRDVSAWDMETLYQQAYAGVAMS >LR134531.1|VEJ55767.1|2368371_2371800_-|4-hydroxyphenylacetate-decarboxylase-large-subunit MATFSLTPRVKLLAERLLTKNSSISTERASVIAALANEISGMPQVIKNARLLNEFVKKFPTYIGQDELIVGSQSSNPRGVIFHSEAELKNPSVFNFLCHNASVISPDYMEVITKGFLAIEARMEDRVRSLGSAVSRSALDEVNQCKAAIYACSAAMNLALAFANKAASQATMESNPYRKAELLETAEILNKVPANPAGSFKEACQAFYLFQLVLHLENGSFAVNPMGFDKALYPLYQQDINSGKLTPQQAYEIIECLWLKLCELSEVRATREVDGYPMFDWLLHGGSLNDKEVVQNELSAMLLAARQNIASYNNGLQVRLFSSHQSGQPQFVESMSQSAPAATYQSSTSEMEGLTPRMKRLRDNYLQARPSVSIYRAVTFTEVTKKNLGLPPILLRAKAFRASCETAPILIQNEELIVGHPCGKPRAGAFSPDIAWRWVKDELDTMGTRPQDPFVISEEDKKVIREEIVPFWEGRSLDEICEAQYREAGIWAFSGETYVSDLSYHQINGGGDTCPGYDVLLFTKGMNGIKADAQERLAALSMENPGDIEKIYFYKAAIETCEGVVAYAKRIADRARELAAVENDPVRRAELLTIAQTNENVPANPPKTLQEALQSVWTVESLFEVEENQTGLSLGRLDQYCYPMYQADIESGRITQDQALELLQAFILKCAELMWMSSELGAKYFAGYQPFINLTVGGQKRTGGDATNDLTYLIMDAVRFVKVYQPSLACRIHNQSPQQYMEKIVDVVKAGMGFPACHFDDSHIKMMLGKGFSFEDARDYCLMGCVEPQKSGRIYQWTSTGYTQWPIAIEFVLNRGRMILFDSHQGIDTGELSSLKTFEDFERAVKTQISHIVKLSAIGTVISQRVHKEVAPKPLMSLLVEGCMEKGLDVAAGGAVRNYGPGLIFSGLATYVDSMAAIRKLVYEDKKYTLEYIRDGLLANFEGYDELKRDCLNAPKYGNDDDTVDLFALDITEWTEKECGKYQMLYSKLSHGTLSISNNTPIGELTAATPNGRLAWMPLSDGISPTQGADKQGPTAIIKSVSKMNVETMNIGMVHNFKFLKGLLDTPEGRNGLITLLRTASILGNGQMQFSYVDNEVLKQAQKEPEKYRDLIVRVAGYSAYFVELCKEVQNEIISRTVIEKF >LR134531.1|VEJ55766.1|2367321_2368272_-|4-hydroxyphenylacetate-decarboxylase-activating-enzyme MDDILEKKGRIFNVQKYSIYDGDGIRTLIFLKGCNIRCDWCANPEGLSSAFQVMFSQDRCVNCGKCVEVCPTGVHYRQSDAAGNPVHRIDRTAECIGCRKCEEVCITNALDIVGKDVTVREMMDVIMQDYDFYQASGGGVTLGGGELSLQADFAAALLTQCKKMMINTAIETNGTTNLANYEKLAECTDLFLFDVKHIDTEQHKALFGVGNEGVIRNLERLVELNANIVVRMPLVRGYNDSYDAITGAIHYVMELAKRGKIQRIDILPFHQLGKTKYEKLDMIYPVKGDPSYSDEELDRLADFFTQFDFDIRLVRH >LR134531.1|VEJ55765.1|2366867_2367308_-|Propanediol-utilization-protein-PduA MNMSLGLIETRGLTATISAADAACKSASVEIIGYKKVGSGLVTVCFQGEVSAVQAAIENGVAAVRPADLVVSSLVIARPDSSVVKLLNQFGRKKGSMDVEKKPVAVKAPVEVAPVIEPVLASQPVEPPVVNHPVKDKKPEGKKSKK >LR134531.1|VEJ55764.1|2366176_2366818_-|Phosphate-propanoyltransferase MINQSLLAKIHQKLPGFTANVNSASASIPIGVSNRHVHLSAGDIEALFGPGYQLTPFKELKQPGQYAAKECVMIVGPKGNITNVRVLGPARDKTQLEISKADCFVLGVKAPVRESGDLPDSADALLVGPAGHVHLKSQVICAQRHIHMNERDAQMLNVTNGQTVRVKTAGQRSLIFDEVVVRVKPSFALEFHIDTDEANAAGLKSNDSVFIVA >LR134531.1|VEJ55774.1|2377163_2377679_-|Uncharacterised-protein MSVKHHCGKLFFFAALFMLIFTSLPIAWRMLPIRDGAMSCSTKAIMHFEDTKMQSVNANIHFSFFGKGKGSIVVEGYTTSTEGYLYLQRYVQFDYTSERISPLERYYRVKSWQASKSSIDQSPDVVFDYFMREMSDSHDGLLLKAQKMNDKTLLLSSLNSPLYICALKPIR >LR134531.1|VEJ55775.1|2377682_2378495_-|Transcriptional-regulatory-protein,-C-terminal MKYKINAFLYYDATEGSLKLDDNGTSDTTLSITANALLYVLIQNPGVMTRDSVMKQVWDDNGLVSSNSNLNQYISLLRKTFRNYGIENIIVTIPKGRLEINPTLTIEVIDNNILHPVLQQQLIHDELADKQQETSDKAKQEKSTEISVDKNWGYAGLFIFIFACVMFFISHLSESASPQGIKLTAVDHDRCELLSIEKMINTAVKEGFIKSFDAVRTRLSINCGEDERFLFYYGDKLQTNGLGRTFLAHCAKHEDNPFSYCENYFYYAWK >LR134531.1|VEJ55776.1|2378525_2379131_-|transcriptional-regulator-BetI MIPRMNNNYQRKKDPERMQEQLLQAASVIAARDGIAALSLNAVALQAGVSKGGLLHHFPGKQELIHALFSNLLGRMEKKIVALEQEDPLPEGRFSRAYLMYIAGLGETDESRELALLSLAMPKETVLRKCWRDWMLEHLRHGDEIDRSYIGNLVRYAADGLWLSELTEGPTMTPIERQALVSRLADMTRGGNENIVLPDHK >LR134531.1|VEJ55777.1|2379410_2379827_-|acetyl-coenzyme-A-synthetase-(ADP-forming),-alpha-domain MEDRDIWTILREVKTIALVGASDKPDRPSYVVMEFLLSQGYDVIPVSPKLAGQMLLGQPVFANLKDIPRPVDMVDVFRHSDAAYEIAQEAIAIKASVLWMQIGVINHEAEALAQSAGLKVVMNLCPKIEIKRLGMNQS >LR134531.1|VEJ55778.1|2380011_2380470_+|Methylglyoxal-synthase MQLTTRIISVEKNIALVAHDHCKQYLLKWAEDNKNTLAKHKLFATGTTGNLIQRATEIPVHSMLSGPMGGDQQIGAMISEGKIDVLIFFWDPLNAVPHDPDVKALLRLATVWNIPVATNRSTADMLINSGLFEKEIEIAIPDYQKYLQERLK >LR134531.1|VEJ55779.1|2380521_2382576_-|Helicase-IV MELKSTSVGQHLAQHPYNKVKMLHAGVEVSGPKHTYTIPFNQLIAIRCKRGLVWGELEFELPQQKVVRLHGTEWHETQRFYHHLMQVWQLWGLEMSQIAADVLLKQMEKIDQRLQQERWLTHQDLVDIQAEIHTSLVSLPLPKERLHEFPNCHPHYQRCVEWIDSGTDIIDATNQRWVQRMLQQHQAFFQHIESQPLNPTQCRAVISGEKNVLVLAGAGSGKTSVLVARAGWLLYRKEAKAEDILLLAFGRKAADEMNARIQQRLATTEIQAKTFHALALSIISQSSKKVPVISKLETSSKARQHFLIEQWHSQCAEKKSQASGWRTWLSQDMGWSLDEGAFWKNSGLSERLATRLDYWIGLMRNHGGSQAEMLALAPEDLRTDFQKKLRLLAPLLKAWKKALKEEGAVDFAGLLHQAVNLIEKGKFISPWRHILVDEFQDISPQRAALVNALRRQQQDSSLFAVGDDWQSIYRFSGSEQSLVADFNRIFGMGEQCALDLTYRFNQGINDITSRYIQQNPAQIKRKIGSLYQGDKNAITILPDSQLEALLNKLSGYVRPEEKVLILGRYHYSKPEVLANASTRWPKLNMEFMTMHASKGQQAEYVIILGLSADRDGFPATENHSVIEQVLLPEVDDFPYAEERRLLYVALTRAKHQVWLMQDTASPSVFVKQLADIGVAVKRKP >LR134531.1|VEJ55780.1|2382798_2383248_+|Inner-membrane-protein-yccF MRSILNILNFILGGVFTTLGWLFATVLTVVLIFTLPLTRSCWEITKLSLFPFGNEAIHVDDLNPNERNALMNAGGTALNVFWLVFFGWWLCLAHICAGIAQCITIIGIPVGIANFKIAVIALWPVGRRVVPVEVAQQARIAKAKRQFQQ >LR134531.1|VEJ55781.1|2383291_2385433_+|Inner-membrane-protein-yccS MSFILGLRRYFYNSNLLYSIRIFIALSGVVFVPWWYGESLFTIPLTLGVVAAALTDLDDRLTGRLRNLVITLACFFIASVSIELLFPYPWLFALGLLLSTASFILLGSLGQRYATIAFGALLIAIYTMLGVSMYDVWYVQPLLLLIGAIWYNLITLLGHLILPIRPLQDNLARCFEDLARYLDAKSMLFDPDEEHQFTDQLIHATMANGNLVNTLNQTKVSLLTRLKGDRGQRNTRRMLRYYFVAQDIHEQASSSHAQYQQLSKKLRYTDILFRFQRLMTLQSKACLSIAYSIRFHQKYVYDTYQTQALAHLEKSLSNLQANRDLPSSLIQSLHHLLKNLKGIDTQLSNIESEQTVNTPSPQDNTLADDKLTGIKDIWSRIAQNLTPQSELFRHAVRMSIVLFIGYAIIQLAHLQNGYWIMLTSLFVCQPNYNATRARLTLRVLGTIAGILLGLPILYFVPSQEGQLILIVISGVLFFAFRNVRYAYATMFITLLVLFCFNLLGEGFRVAIPRVIDTLIGCGIAWAAVSFIWPDWKFRHISLVLKKAIDANCRYLDAILEQYHQGKNNSVDYRIVRRNAHNRDAELASVISSMAAEPRKDTQQLDQGFRLLCLNHSLLGYISALGAHREKLTESPAILSLLDDAVCYICDALQVNSDDTQSTMLALEKINQQITEMNSASSKNPLVTQQLALIIGLLPEFITLSDKIYTDIKQ >LR134531.1|VEJ55782.1|2385584_2386229_-|Regulator-of-competence-specific-genes MNREDTKSLVKDVIGYFSELGELTSRSMFGGYGICKNKVMFGLVSDDKFYLRANKYLESVFISYGMSQFIYNKRGVPVLMKYYHVNESLWQNEEILKRFVTYALSSAMTDMEERLIQEYLRLKDLPNLNIGIERLLRQVGVRTREDLMHLGALRTYIKLREFKRDVKLDLLFSLAGAIKGCHVAALPNSLRNELIEQLRTHDQTKYKELVNYSV >LR134531.1|VEJ55783.1|2386464_2386959_+|Cell-division-inhibitor-SulA MRSTQQTYSNALHTLINPMDKSSLGRYGANLVSEIVYDPQNPFTLHLLLPFLQQLGQQPRWQLWLSPERRLNRYWINSLGLPEKKTVALNTTSIEMSVEMIEKALKSGNFSSVIAWLPSMAPNVKEKLRQAAVDGDCYCFILQPLSANQGQFYQPDLFNRAHWH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134531_4 | 3682253-3682880 | TypeI-F |
I-F
Consensus repeat of LR134531_4
|
10 spacers
spacers of LR134531_4
>4.1|3682281|32|LR134531|PILER-CR,CRISPRCasFinder,CRT TGGCCAGTGATTATGTTCAACATAAACAGCAC >4.2|3682341|32|LR134531|PILER-CR,CRISPRCasFinder,CRT TTGGTTTTGATTCAACTAATTGCGCTGAATAT >4.3|3682401|32|LR134531|PILER-CR,CRISPRCasFinder,CRT CGACCACTCTCAACAGGTGATGTCTGCGCCAA >4.4|3682461|32|LR134531|PILER-CR,CRISPRCasFinder,CRT GCCAGCTTTAAACATTGACAATATCCGCTTGC >4.5|3682521|32|LR134531|PILER-CR,CRISPRCasFinder,CRT CTTACGGCGCTTAACCACATTAGAGGGATAGA >4.6|3682581|32|LR134531|PILER-CR,CRISPRCasFinder,CRT CACGGCCGCAGAGATTTCCAGTTCTTGATGCT >4.7|3682641|32|LR134531|PILER-CR,CRISPRCasFinder,CRT GTGATAAAAATATCGCAAAAGGGACTCGCTTT >4.8|3682701|32|LR134531|PILER-CR,CRISPRCasFinder,CRT ATAATGCTCAGTTAATAACCGTTTCTTGGAAC >4.9|3682761|32|LR134531|PILER-CR,CRISPRCasFinder,CRT AAACGTGCGGGAAAAATACGGCTCAACGCTAT >4.10|3682821|32|LR134531|PILER-CR,CRISPRCasFinder,CRT AAGCCAAACCTATCGAGCGGGCGTTCTCGCAT |
cas6f,cas7f,cas5f,cas8f,cas3f,cas1,WYL |
CRISPR arrays and Neighbor proteins around LR134531_4
The CRISPR arrays of LR134531_4 >merge|LR134531|4|3682253-3682880|PILER-CR,CRISPRCasFinder,CRT TTTCTAAGCTGCCTGTATGGCAGTGAACTGGCCAGTGATTATGTTCAACATAAACAGCACTTTCTAAGCTGCCTGTATGGCAGTGAACTTGGTTTTGATTCAACTAATTGCGCTGAATATTTTCTAAGCTGCCTGTACGGCAGTGAACCGACCACTCTCAACAGGTGATGTCTGCGCCAATTTCTAAGCTGCCTGTACGGCAGTGAACGCCAGCTTTAAACATTGACAATATCCGCTTGCTTTCTAAGCTGCCTGTACGGCAGTGAACCTTACGGCGCTTAACCACATTAGAGGGATAGATTTCTAAGCTGCCTGTACGGCAGTGAACCACGGCCGCAGAGATTTCCAGTTCTTGATGCTTTTCTAAGCTGCCTGTACGGCAGTGAACGTGATAAAAATATCGCAAAAGGGACTCGCTTTTTTCTAAGCTGCCTGTACGGCAGTGAACATAATGCTCAGTTAATAACCGTTTCTTGGAACTTTCTAAGCTGCCTGTACGGCAGTGAACAAACGTGCGGGAAAAATACGGCTCAACGCTATTTTCTAAGCTGCCTGTACGGCAGTGAACAAGCCAAACCTATCGAGCGGGCGTTCTCGCATTTTCTAAGCTGCCTGTACGGCAGTGAAC >LR134531|4|3|3682253-3682880|PILER-CR TTTCTAAGCTGCCTGTATGGCAGTGAAC TGGCCAGTGATTATGTTCAACATAAACAGCAC TTTCTAAGCTGCCTGTATGGCAGTGAAC TTGGTTTTGATTCAACTAATTGCGCTGAATAT TTTCTAAGCTGCCTGTACGGCAGTGAAC CGACCACTCTCAACAGGTGATGTCTGCGCCAA TTTCTAAGCTGCCTGTACGGCAGTGAAC GCCAGCTTTAAACATTGACAATATCCGCTTGC TTTCTAAGCTGCCTGTACGGCAGTGAAC CTTACGGCGCTTAACCACATTAGAGGGATAGA TTTCTAAGCTGCCTGTACGGCAGTGAAC CACGGCCGCAGAGATTTCCAGTTCTTGATGCT TTTCTAAGCTGCCTGTACGGCAGTGAAC GTGATAAAAATATCGCAAAAGGGACTCGCTTT TTTCTAAGCTGCCTGTACGGCAGTGAAC ATAATGCTCAGTTAATAACCGTTTCTTGGAAC TTTCTAAGCTGCCTGTACGGCAGTGAAC AAACGTGCGGGAAAAATACGGCTCAACGCTAT TTTCTAAGCTGCCTGTACGGCAGTGAAC AAGCCAAACCTATCGAGCGGGCGTTCTCGCAT TTTCTAAGCTGCCTGTACGGCAGTGAAC >LR134531|4|3|3682253-3682880|CRISPRCasFinder TTTCTAAGCTGCCTGTATGGCAGTGAAC TGGCCAGTGATTATGTTCAACATAAACAGCAC TTTCTAAGCTGCCTGTATGGCAGTGAAC TTGGTTTTGATTCAACTAATTGCGCTGAATAT TTTCTAAGCTGCCTGTACGGCAGTGAAC CGACCACTCTCAACAGGTGATGTCTGCGCCAA TTTCTAAGCTGCCTGTACGGCAGTGAAC GCCAGCTTTAAACATTGACAATATCCGCTTGC TTTCTAAGCTGCCTGTACGGCAGTGAAC CTTACGGCGCTTAACCACATTAGAGGGATAGA TTTCTAAGCTGCCTGTACGGCAGTGAAC CACGGCCGCAGAGATTTCCAGTTCTTGATGCT TTTCTAAGCTGCCTGTACGGCAGTGAAC GTGATAAAAATATCGCAAAAGGGACTCGCTTT TTTCTAAGCTGCCTGTACGGCAGTGAAC ATAATGCTCAGTTAATAACCGTTTCTTGGAAC TTTCTAAGCTGCCTGTACGGCAGTGAAC AAACGTGCGGGAAAAATACGGCTCAACGCTAT TTTCTAAGCTGCCTGTACGGCAGTGAAC AAGCCAAACCTATCGAGCGGGCGTTCTCGCAT TTTCTAAGCTGCCTGTACGGCAGTGAAC >LR134531|4|2|3682253-3682880|CRT TTTCTAAGCTGCCTGTATGGCAGTGAAC TGGCCAGTGATTATGTTCAACATAAACAGCAC TTTCTAAGCTGCCTGTATGGCAGTGAAC TTGGTTTTGATTCAACTAATTGCGCTGAATAT TTTCTAAGCTGCCTGTACGGCAGTGAAC CGACCACTCTCAACAGGTGATGTCTGCGCCAA TTTCTAAGCTGCCTGTACGGCAGTGAAC GCCAGCTTTAAACATTGACAATATCCGCTTGC TTTCTAAGCTGCCTGTACGGCAGTGAAC CTTACGGCGCTTAACCACATTAGAGGGATAGA TTTCTAAGCTGCCTGTACGGCAGTGAAC CACGGCCGCAGAGATTTCCAGTTCTTGATGCT TTTCTAAGCTGCCTGTACGGCAGTGAAC GTGATAAAAATATCGCAAAAGGGACTCGCTTT TTTCTAAGCTGCCTGTACGGCAGTGAAC ATAATGCTCAGTTAATAACCGTTTCTTGGAAC TTTCTAAGCTGCCTGTACGGCAGTGAAC AAACGTGCGGGAAAAATACGGCTCAACGCTAT TTTCTAAGCTGCCTGTACGGCAGTGAAC AAGCCAAACCTATCGAGCGGGCGTTCTCGCAT TTTCTAAGCTGCCTGTACGGCAGTGAAC
>LR134531.1|VEJ56864.1|3680932_3682036_+|tRNA-(uracil(54)-C(5))-methyltransferase MTPDSLPTDQYNAQLTEKKNRLMSMMAPFSAPEPEVFRSPLSHYRMRAEFRIWHDGDDLYHIMFDQATKQRIRVETFPAASELINRLMPELLAGVKPYKTLRYKLFQIDYLSTMSGEIVVSLLYHRQLDDEWLQHATQLRDALRERGFNLQLIGRASKQKICLDRDYADERLMVAGREMIYRQTENSFTQPNAAVNIHMLEWALSVTEGSKGDLLELYCGNGNFSLALARNFNRVLATEIAKPSVAAAQFNIAANKIENVQIIRMAAEEFTQAMQGVREFNRLKGIDLTSYQCETIFVDPPRSGLDPQTVTMVQDYPRILYISCNPETLCENLTVLNQTHRVSRLALFDQFPYTHHMESGVLLERIK >LR134531.1|VEJ56863.1|3678676_3680524_-|Outer-membrane-cobalamin-translocator MSTKKIVLITSLSAAAFSAWAQEDAEKMVVTANRFPQPVSTVLAPMDVVDRDQIDLWQSKSLTDVLRRLPGVDIAQNGGRGQASSLYIRGTESRHVLVLVDGIRVPISGIMGVADFNQIPISLVQRVEYIRGPRSAVYGSEAIGGVINIITNAEKDGGKIEAGLGSNHYQLYDGSVRQTVGEKTTITAAGAFEDTRGFNIQPGSSYPPDSDKDGFRSKSLWAGIEQQFSSQFSGFLRAYGYSNNTEYDGSYGDERQLYSRNYDTGLRFLQGDFSSQLIASYQTYKDYNYNSAKGMYKDGTTLDDMTQRNIQWGNSYKLDRGVISSGIDWRQEKLESSNNYVSDSYKRDNTGFYITGQKSLSDFTFEGAVRTDKNQQFGWHETWQTAAAWEFIPAYRLTLSYGTGFLAPTLGQLYGSDRFNISSNHDLKPEESRQWEAGLEGDTGPLNWRLAAYRNKITNLIGYESDPITWQGGYYNIESATIKGVEWTGSFATGPLDHRITLEYLDPRRDKDNEVLGRRSKQKAKYQLDWNMFGLDMDVSYQYYGKRYDNNTNEYANTQRRLSSYSTVDVSAAYPITENFTVRGKVANLLDKDYQTAYRYETAEREYTLTASYTF >LR134531.1|VEJ56862.1|3677860_3678652_-|Glutamate-racemase MPTALVFDSGVGGLSVYQEIRQLLPDLSIIYAFDSAGFPYGEKSEQVIAERVVKLVGAICQQHQIDIIVIACNTASVVSLPALRDSFDIPVVGVVPAVKPAASLTRNGIVGLLATKVTVNRPYTHELIAKFAQDCQIKLMGSSRLVELAEEKLHGQPVMAEELKQILSPWLNTPQAPDTIVLGCTHFPLLKEELAAVLPKETLLVDSGAAVARRVATLLGDVDWQKVKKAPNIAYCSRLDAEAEKLLPILHQFGFESLTEQSI >LR134531.1|VEJ56861.1|3671539_3671800_-|Transposase MIRNSRRNFSPEFRLEAAQLVLDQHYTVTAAATAMNVGKSTMDKWVRQLKEERAGKSPKASPMTPEQLRIRELEKRLQRIEMETIY >LR134531.1|VEJ56860.1|3670623_3671562_-|Integrase-core-domain MKWKRYIKKGYRALDVRLPEQFSLVERLRTRFPVAFICNVFGIHRSSYRYWISRPQKPDAKHIVILSLVREVHHASNGSAGARSIADMVSGRGVPLSRWRASKLMKELNIVSCQQPKYRCRKATKERVDIPNHLDRQFAVTEPNQTWCGDVTYIWTGKRWAYLAVVLDLFSRKPIGWAMSFSPDSVLTGKALTMAWESRRKPAGVMYHSDQGSHYTSREFRRLLWRYRIKQSMSRRGNCWDNSPMERFFRSLKSEWVPNCGYANFSEANMAITNYIIGYYSQLRPHQYNGGLTPNESERLFWENSKTVANFC >LR134531.1|VEJ56859.1|3669636_3670569_-|Predicted-esterase MILRLLFSISLLLPLSSFSAELLNREDGSAITYYLERNSTSDKSPVLLVIIQGSDCNSVSKNRLIKNQLKYVWPQADVLTVEKYGIDSTLVYSDSVEREDCPKQYMQRDNLEQRVTDIKQVMDVSRDKYHYGSVIVLGGSEGAVVANILASKVDYIDATVSFSGGGRWFKDDVLHSMSVDTDNPSKVKEDAANFGEMVKYILSAKPFELEMSNHGYGWWRSVLSVDQQVVLSSVTSPVLIIQGGKDTSVPPKKVSQMIEILQGAGKNNIDYLFYPELDHVLQDKEGNSQMSEVAKEINIWLKKVMRNSGN >LR134531.1|VEJ56858.1|3668882_3669584_+|Bacterial-nodulin-like-intrinsic-protein MFKRLSAEFFGTFWLVFGGCGSAVLAAAFPGLGIGFAGVALAFGLTVLTMAYAVGHISGGHFNPAVTLGLFAGGRFPAKDVVPYIIAQVIGAIAAAAVLWVIADGKAGFDASASGFASNGYGEHSPGGFSLQAAIVAELVLTAFFLIIIHGATDKRAPAGFAPLAIGLGLTLIHLISIPVTNTSVNPARSTGVAIFQGGWALEQLWMFWLVPLIGGVIGGLIYRFLLESKKAD >LR134531.1|VEJ56857.1|3667923_3668475_-|Uncharacterized-protein-conserved-in-bacteria MQCPKCKTGVLQPTRLDNLIPVHTCNACGGNWLILEDYLRYKGQLPESAELPNDVVVKSEETKSALLCPVTGSLMLKFRISKDTEHRLDLSPKVNGIWLDKGEWELLNAHGLADKLNAIFTDVWQRQIREAHTQAQFDEMYLSQFGQEDYDKVKDLREWLQQHPQKDRLKAFLLADNPWSAVK >LR134531.1|VEJ56856.1|3666215_3667871_+|Acetate-transporter-ActP MKMRPLSLLPLLAISPMVFAEGISGEVKRQPLNIEAIIMFVLFVGATLYITYWASKRTRSRSDYYTAGGKITGLQNGLAIAGDFMSAASFLGISALVYTSGYDGLIYSIGFLIGWPIILFLIAERLRNLGKYTFADVASYRLKQKEIRTLSACGSLVVVALYLIAQMVGAGKLIQLLFGLNYHVAVILVGILMVLYVLFGGMLATTWVQIIKAVLLLAGASFMAIMVMKSVNFDFNELFVQAIKTSPKGVAIMSPGGLVSDPISALSLGLALMFGTAGLPHILMRFFTVSDAKEARKSVFYATGFIGYFYILTFIIGFGAIFLVGGNPIFKDAAGALIGGNNMAAVHLADAVGGSFFLGFISAVAFATILAVVAGLTLAGASAVSHDLYANVIKQGKATERDELKVSKVTTIVLGFVAIGLGILFEKQNIAFMVGLAFSIAASCNFPIIILSMYWRRLTTRGAMIGGWLGLISAVTMMVLGPTIWVQILGHAKPIYPYEYPALFSMTIAFFGTWFFSITDSSQSAIQERQLFYPQFIRSQTGYGASESVSH >LR134531.1|VEJ56855.1|3665907_3666219_+|Inner-membrane-protein-yjcH MNEHIYQRIESNPRFKDLVRKRDRFAWSLSFITLALYVGFILLIAFEPQWLGTPIAEGTSITRGIPVGIGLIITSFLLTGIYVYRANREFDEINAKILDEAHQ >LR134531.1|VEJ56865.1|3683036_3683675_-|CRISPR-associated-protein-Cas6/Csy4,-subtype-I-F/YPEST MNYYQEITLLPDADISLGFLWQNVFQQVHIALVEHKVDTNQSAVAVGFPDYRQAQFPLGSKLRLFAKEQATLEKIAINQWLARLKDYVHIKGIKPVPSDVTYVSFVRKQVKSPERIERDMQQKSALWAAKSGKSLAECLIELEKSKPTDLCRLPFIYLHSQQTKQRSPDKNSKFPLFIEMHPQSASLDGVFDCYGLSAKASGKPAFATVPHF >LR134531.1|VEJ56866.1|3683683_3684727_-|CRISPR-type-I-F/YPEST-associated-protein-Csy3 MAKNNDTASVLAFEKKLVPSDGYLFGTNWETKEQTTPLALQEKSVRGTISNRFNKKDVGEFTKDPAKLDAKVESPNLQRVDACALGQDQDTLKLHFTLKVLGGLAQPSACNNALFKQSYSAAVGQYIAKHGCLELAKRYATNLANARFLWRNRVGAEEIEVQVKALNKGAEQTWTFDAKQFSTRHFEHNDAQINSLADRIAQALASESGHLMLQIDCYANVGKAQEVYPSEELVLDKGNSKTKKSKILYAVNEHAAMHSQKIGNALRSIDTWYPEYVSEEQSAGAIAIEPYGAVTNLGKAFRTPKDKQDFYTFFDKWARGEALAREEDEHYLVAVLVRGGVFGESDK >LR134531.1|VEJ56867.1|3684738_3685632_-|CRISPR-type-I-F/YPEST-associated-protein-Csy2 MNVLILPHINIHNANALSSSFTIGFPAMTAWLGFVHALERKLNKAGLPELMLHSAAVVSHRCDVQTHKGEGDFVHSIIGTGNPLDKDGSRSAFIEEARCHLDVSLVIEWGGNEDQVQHADFAEQLQAVIATMKVAGGDVLSMHRPLNQSVDIDNPQETRALLRKLMPGYVLIERRDLMTEAMAQGSDALDALLSYLTVNHRCEQLEDGSVIWRSQRKASGWIVPIATGFQGISPLGEAKNQRDPSVPHRFAESVVTLGEFVMAHKIQHLDDMLWHYHNDLENDLYLCQQVNAINEHQ >LR134531.1|VEJ56868.1|3685628_3686906_-|CRISPR-type-I-F/YPEST-associated-protein-Csy1 MEEISVLDPAIATFFAERKEAWLKKNISAAMQASEVYEKQQECEQNFLLVNWLPDAARRAGQISVASHPCTFSHPSARKNKNGYVSSIIAKNKPRTDGFLRSGNVSVEPDALGNAAALDVYKFLSLAMSDQRSLLVHIEQESELARQLLNVPTCDYQTLRDGFLKMINTDQDSVSSSKIKQVYFPIADGEYHLLSLLTHSGHLFELRKRLDALRFGEEVKKVRECKKSNHFHPTGYQEIFGLTTIGFGGTKPQNISVLNNQNAGKAHLLASIPPDLKPRDIRLPKTDFFKESFTAWQSKEVLESLHRLFITDYNNIHIREGRDYRIQQYVDLVIEKMWQVRLFLAEYQGELPDELLQEQKIWLYPEFEQQREQEDEWLDKITRQIARSLILHYSRSKVIANPVLLADQELLAIEKVVSSNKENLR >LR134531.1|VEJ56869.1|3686905_3690280_-|CRISPR-associated-helicase-Cas3,-subtype-I-F/YPEST MMVTFISQCEKNALKKTRRVLDAFANRIGDNTWQTLITEEGLLTVKKMLRQTASRSTAVSCHWIRSRSRSQFLWVVGNKKKFNAEGVVPVNSTEKDLLNSEYESDWKYLPLIKALAAMAALLHDWGKSSLLFQAKLNPEIKTKYKGDPLRHEWVSCLLFHQFVTNHTNENHDRAWLNSLINQGIDEPSFNSNSLLREKALAELPSAAALIAWLIVSHHRLPLPKEQELCKAQRENSNASLADLLAKITPSWGYENRFDEYNSLLPKCFEFPLGLLSNAQTWLAELKHRAKDLLHHLPLLEQAMNDGSWRVILHHARLCLMLGDHYYSSQANDPQWHSSSELYANTDPSTKALKQKLDEHLVNVAKVTVNTVKLLPFFESEPLKATELTELAPKACTPKAFRWQDKAVRKIIEWREHTEDKSQGYFVVNMASTGCGKTMANAKIMQALSEDGESLRFILALGLRTLTLQTGDEYKQRLKLQDSDIAVLIGSKAIYELHQSGKQVDKEEIELNQAELGSESMESLQEETDELHWQGELPKEELTTVLTKEKDRKLLYAPVLACTIDHIMAATETKRGGRYILPCLRLMSSDLVIDEIDDFTEDDLIAIGRLVHLAGMLGRKVMISSATIPPDLALGLYNAYRQGWAVFAASRDRTTSINCVYVDEFTAHTELVGSSDENLDAYQAFQQGFIIKRVEKLKQQTAKRKAEIIPCLKQPGLPLEKQYFETVKQAVLDKHQHYFTLDPESQTQVSFGVVRVANIQPCVELTKHLLSSDWPEDTEVRCMAYHSQQVLLLRHEQEKHLDEVLKRKEKAGELPAAFAHPTIRGHLDTCGAKNLIFILVATPVEEVGRDHDFDWAVIEPSSYRSIIQMAGRVRRHRDGEIIAPNIGLLQYNVKGFKGGEERVFNHPGYETDRTTQLVTHDLTQLVDEKTLLQSVNAIARIQKRTTLEPQKKLADLEHFATAKTLGTDQIGKPEVTTASRQERYSRNRRDRQPHPYWSEHLHGHLHGYWWLTALPQYFKRFRKSEPTVQIYLVKKTRSIEFCLREEQGGLCPIERVLNIQHQPLAPEQQQKLWLQRDYCELIGQYSSSAEQEFATSVRYGEISFIYREGNQQYSYNDQLGLVKVK >LR134531.1|VEJ56870.1|3690276_3691266_-|CRISPR-associated-endonuclease-Cas1,-subtype-I-F/YPEST MDDLSPSDLKVILHSKRANMYYLEYCRVMQKDGRVLYLTEADKENLYFNIPIANTTVLMMGNGTSITQAAMRMLSQAGVLVGFCGGGGTPLHMATEVEWFTPQSEYRPTEYLQGWLSFWFDDEKRLAAAKQFQNARIEYLQQVWSSDRELALEKFNVQDEVIKQSLDTFHQRTDAASKQSDLLLTEAQLTKALYKYAANNTGKEGFSRQHQPDKKCTDKANGFLNHGNYLAYGLAASCLWVLGIPHGLAVMHGKTRRGALVFDVADLIKDAIVLPWAFVCAKENASEQEFRQQILQAFIEHKALDFMFDTVKRVALQGQSEEESEDRPL >LR134531.1|VEJ56871.1|3691385_3692222_-|Uncharacterised-protein MTIHTLQERHLFLEMLALWQGYIRNKDLVDQFVITRQQAYQDIRAYQERYPERLNKMASGPYQFSAQYIYQAPKHSLEHYLQWLSTGQFYAPQPALNSVLGEQCSVPQRYVSPQVIAVLTDAIRQQKRVELGYVSLSNPEWQGRIFHPHSFIKTGLRWHMRGYCEKSQDYRDLVLSRCRGEAELLDASEHTKDDDQVWNTQVDLIFAPDPRLNEAQREVIVHDYQMDNGQLHITTRAALVDYLLKEMQVKTQYLEGTPEAQQLILLNPRDVKPWLFDR >LR134531.1|VEJ56872.1|3692432_3692807_-|Inner-membrane-protein-yijD MTEQVNKEYGILLLAFIAGLSVNGSFNALFDSVIAFSIFPLIALGFSIYCLHQRYVTHPMPAGTPMLAASCFLLGIFLYSAVIRAEYAGMGSNFLLTVICVALVFWIGYKLNITARHKVASQTD >LR134531.1|VEJ56873.1|3692825_3693464_-|HTH-type-transcriptional-repressor-fabR MIGVRAQQKERTRRSLIEAAFSQLSAERSFASLSLREVAREAGIAPTSFYRHFRDVDELGLTMVDESGLMLRQLMRQARQRIAKGGSVIRTSVSTFMEFIGDNPNAFRLLLRERSGTSAAFRAAVAREIQHFIAELADYLELANQMPRSFSEAQAEAMVTIVFSAGAEALDIDAAQRKQLEERLVLQLRMISKGVYYWYRREQEKGVVPISL >LR134531.1|VEJ56874.1|3693613_3694531_-|Morphology-and-auto-aggregation-control-protein MNIRDLEYLVALSEHCHFRRAADSCHVSQPTLSGQIRKLEDELGVMLLERTSRKVLFTQAGMLLVDQAKTVLREVKVLKEMASQQGETMSGPLHIGLIPTVAPYLLPHIIPMLHQEFPKLEMYLHEAQTKDLLAQLDSGKLDCAILALVKETESFIEIPLYDEPMRLAIYSDHPWAGRDKIMMSELAGEKLLMLEDGHCLRDQALGFCFQAGADEDSHFRATSLETLRNMVAAASGITLLPLLAAPPERERDGIYYLPCYKPEPKRTIGLVYRPGSPLRARYEQLADTIAGHMPGVIDSEQKRTS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
LR134531_5 | 3737382-3737502 | Orphan |
NA
Consensus repeat of LR134531_5
|
1 spacers
spacers of LR134531_5
>5.1|3737416|53|LR134531|CRISPRCasFinder GGCTCACACGGCATAAAACCCGGGAAATGGATTAATTAGGTTATTTTGCAAAG |
CRISPR arrays and Neighbor proteins around LR134531_5
The CRISPR arrays of LR134531_5 >merge|LR134531|5|3737382-3737502|CRISPRCasFinder TTACCCTTGCTTTTTAATGTTAAAATTATAACATGGCTCACACGGCATAAAACCCGGGAAATGGATTAATTAGGTTATTTTGCAAAGTTACACTTGCTTTTTAATGTTAAAATTATAACAT >LR134531|5|4|3737382-3737502|CRISPRCasFinder TTACCCTTGCTTTTTAATGTTAAAATTATAACAT GGCTCACACGGCATAAAACCCGGGAAATGGATTAATTAGGTTATTTTGCAAAG TTACACTTGCTTTTTAATGTTAAAATTATAACAT
>LR134531.1|VEJ56909.1|3735978_3737157_-|Nucleoside-transport-system-protein-nupC MKYIIGVLSIIIICGLAALVSKKRKEIRLRPIFVMLATQFIMAMLLLKTNVGNTIIMTFAKGFGYLLSYAQEGVNFAFGNLVNQGEMSFFVSVLLPIVFISALIGILQHWHILGFIVKYIGLGLSKINGMGKLESYNAVASAILGQSEVFISIKKQLGLLPEHRLYTLCTSAMSTISMAIVGAYMQLLDPRYVVTAMVLNLLGGFIIASIINPYQVQPEEDILTIQEERQSFFEMLSEYILDGFKVAVIVAAMLIGFIALITMANSIFTAVLGISFQEILGYIFAPLAFLVGIPWEEAVQAGGIMATKIVSNEFVAMHLLMNSEFKFSEHSIAVISVFLVSFANFSSIGIIVGAIKALDSRQGNTAARFGLKLLYGASLVSFLSATVVGLLY >LR134531.1|VEJ56908.1|3733946_3735899_-|2',3'-cyclic-nucleotide-2'-phosphodiesterase/3'-nucleotidase-precursor MLLRPSLPLSFLAVLIAGTVNAATVDLRIMETTDVHGNMMDYDYYKDKMTDKYGLVRTAKLIEQARKEATNSVLVDNGDLIQGSPMADYAAASGLKKGEIHPVYKVMNNLDYTVGSLGNHEFNYGLDYLKRALSGARFPYINANIFDAETGKPYFKQYLIVDTPVKDRDGKEHTVRVGYIGFVPPQVTLWDKSNLNGKVVTKDITETAKQLVPEMRKQGADVVVAIAHSGLSTDPYKVLAENSVYYLSEVEGIDAIAFGHAHGIFPGKDFADVKGADIEKGTLNGIPAVMPGHWGDHLGVIDLVLEGETNKWKITSGTAKARPVFDKEGKQVIKAEDKLVDIIKHDHQATRDFVNKPIGKIDTNVNSYLALVQDDPLIQIVRDAQKAYVEHYIQGDPDLDGYKVISAAAPFKAGGRKNDPAAFVNINKGELTFRNAADIYQYSNTLAAVKITGQDVREWLECSAGMYNQIDVNSAKPQHLFNWDGFRTYNFDMFDGVSYQIDVTKPARYDRDCNLVNPNSHRITNLTYEGKPLDAKQKFLAAVNNYRAYSGTFSGTGEKNVAFSAPDEVRSVVANYISSETKKHGVLTPATKYGWSIAPIAAPQQLDIRVETSPSETAAAFIKDHAQYPMTFVGHDDIGFAVYKIDLQKK >LR134531.1|VEJ56907.1|3732750_3733935_-|Phosphopentomutase MLTPFKRIHVVVIDSVGIGAAPDAAQFGDVGADTLGHIAESVSQLNVPNMAGLGLANIRPLKGVSKAEQPSGYYGKMQEISASKDTLTGHWEMMGIYVDQPFSVFPDGFPAELIEKIEAFSGRKVIGNKPASGTDVLDELGEQQLASGDLIVYTSADSVLQIAANEAVIPLEELYRICEYCREITRDEPYRIGRIIARPYVGDCRENFTRTTNRHDYALKPFTPTVLDALKSAGNDVVALGKISDIFDGEGITRSIRTSGNMDGMDKFISLLSEPFHGISFLNLVDFDALYGHRRDVQGYAQALEEFDARLPAVFAGMGQDDLLIITADHGNDPTAEGTDHTREFVPLLAYSNRFKQGGDLAIRQAFADLGATIAENFGLEKPNYGDSFLNKLI >LR134531.1|VEJ56906.1|3731393_3732695_-|Pyrimidine-nucleoside-phosphorylase MRMVDLIAKKRDGKELTTEEINFFINGYTDSSIPDYQASALAMAIYFQDMNDRERADLTMAMVNSGEVIDLSEIDGVKVDKHSTGGVGDTTTLVLAPLVAALDIPVAKMSGRGLGHTGGTIDKLEAVEGFHVEITKEQFIDLVNRNKVAVIGQTGNLTPADKKLYALRDVTGTVNSIPLIAGSIMSKKIAAGADAIVLDVKTGAGAFMKTNEDAIELAKAMVRIGNNVGRKTMAVISDMSQPLGYAIGNSLEVQEAIDTLRGQGPEDLTELVMALGCQMVVLAGKADTLEGARVLLEEVIANGKALEKFRTFLESQGGNGAVIDDPNALPQAAHLIEVPAKTSGVVSNIVADELGVAAMLLGAGRATKEDTIDLSVGLMLRKKVGDKVEAGESLVTVYANRPDVAAVIDKIYNNITISDRAEVPTLIHTIITE >LR134531.1|VEJ56905.1|3729821_3731171_-|putative-outer-membrane-porin-protein MRINTLHKLSVALLTCALLPLSAYAESQKNDTTPEFLADFVNNSQLDFSLRNVFKNLNTSDYGERSVQTAWGQGFTLDYRSGYLADMIGVDASYYSVLKLANSDEFYGRSVLYNDNGEAKGFNKMGQIYGKFRLGDDDTYFHLYSGWKELRKWGALNISTRAIPSSYLGWSAEAGTGPLRLRGAYVTRYTDRDSPEKVHFRSADRKRQISNISTADVKYQLQDYSALYFWGESHDYMRRQGLELEWKPSTMAGNKLRVTSLFYLNQGLDNWTEMSTSHKTFNKNAYHYALMAQWQADRWKHKVGASYTVAKLDDGLGRFEWHLAKNSRGTFNSMADSWGNDYVGHKEKMIAWTPGYAVTPEIEVGAVTNYGWGMKYKGVSIDRGETLLYTRWAPVDGKLKNLSVQLSGGPSWNFQSKRNKPILNESQDKALLAQNHSIELQIDYKFKIF >LR134531.1|VEJ56904.1|3728913_3729678_+|Deoxyribose-operon-repressor MDTPRGERLQKLIQTLKEGDKIHLRDAAQSLGVSEMTIRRDLNDNPYSLIVLGGYIVIDPKNNNVNHYFLSEQKSKNIDEKRYIGKLAAQQIEENDTVFFDSGTTLPFIIENIPDELRFTGVCYSLNTFLSLKNKPNCTVILCGGEFKTSSYIFTPVGANNELDFIRPNKAFISAAGVSLEHGVTSFILDEVRMKMQAMSASKQKILIADAYKFDKTKPGRFGTLNQFDKIITDQQPDKSYLDYCYQHDIGVIY >LR134531.1|VEJ56903.1|3728155_3728515_-|Uncharacterised-protein MKKMSILVLVVLAVAGCASKYSIMTKYHKQCDAQNPEANAYVGYVDCMNALLAADHKVSQGSGAINIMATANKYKQQVIDGKMTGKEAKLAFQNQYKRFIFNFGNPESTESVAAAPAVN >LR134531.1|VEJ56902.1|3727187_3727904_-|Ribonuclease-PH MRPEGRSAQQVRPITFTRHYTKHAEGSVLVEFGETKVLCTATIEEGVPRFLKGQGQGWITAEYGMLPRSTHTRNPREAAKGKQGGRTLEIQRLIARSLRAAVDLTLLGEFTITLDCDVLQADGGTRTASISGACVALADALNSLVAKGKLKKNPMKCMVAAVSVGIVGGEALCDLEYVEDSAAETDMNVVMTDDGRMIEVQGTAEGEPFTHEELLELLALARQGIDTIIQAQIAVLAQ >LR134531.1|VEJ56901.1|3726476_3727118_-|Orotate-phosphoribosyltransferase MKPYQRQFIEFALNKQVLKFGEFHLKSGRVSPYFFNAGLFNTGRDLALLGRFYAAALVDSGIAFDLLFGPAYKGIPIATTTAVALSEHHDIDIPYCFNRKETKDHGEGGSLVGSPLTGRVMLVDDVITAGTAIRESMEVIRQHDATLSGVLISLDRQERGRGELSAIQEVKRDYQCEVITIVTLDDLVEYLTEKPEMVEQLAAIQAYRSEFGV >LR134531.1|VEJ56900.1|3725394_3726405_+|Fructose-1,6-bisphosphatase-1-class-2 MKRELAIEFSRVTEAAALAGYRYLGRGDKNKADGAAVEAMRIVLNQVNIDGEIVIGEGEIDEAPMLYIGEKVGTGHGDAVDIAVDPIEGTRMTAMGQANALSVLAVAEKGAFLHAPDMYMEKLVVGPGAKDTIDLNLPLRENLIRIAAKLEKPLESLTVITLAKPRHDGVIAEMQQLGVKVFAIPDGDVAASILTCMPDSEVDVMYGIGGAPEGVISAAVIRALDGDMQSRLLARHQVKGDSEENRRIGEQELERCRQMGIEAGKVLKLGDMARNDNIIFSATGITKGDLLNGIQRKGNMATTETLMIRGKSRTIRRIQSIHYLDRKDPALCDILL >LR134531.1|VEJ56910.1|3737568_3738249_+|Deoxyribose-phosphate-aldolase-1 MNQQQIDYANYIDHTLLNMDATEAQITKLCEEAITHHFYAVCVNSGYVPLAAKCLKDTNVTVCSVIGFPLGAGLTSSKVFEAQAAIEAGAQEIDMVINVGWLKSGKFDEVKHDIQMVQNACGNVPLKVILETCLLTDEEIVQVCHMCKEIGTAFVKTSTGFSKSGASVHATKLMRESVGPVMGVKSSGGVRDRETAKAMIEVGATRIGTSSGVFIVGADSQDKSSY >LR134531.1|VEJ56911.1|3738490_3739354_+|YicC-like-family,-N-terminal-region MIRSMTAYARHETKGEWGSATWELRSVNQRYLETYIRLPEQFRSLEPVIRERLRTRLTRGKIECNLRFDPDPSAQTELMLNKELAAQLVQAANWVKMQSDEGEIDPVDILRWPGVMSAKSQDLDAISSELMASLEIALTDFIDAREREGAALKALIEQRLAGVSTEVQKVRAHMPEVLVWQRERLLSKLEEAQVQLDNNRLEQELVMLAQRVDVAEELDRLEAHVKETYVILKKPEAVGRRLDFMMQEFNRESNTLASKSINSEVTTSAIELKVLIEQMREQIQNIE >LR134531.1|VEJ56912.1|3740028_3740775_+|Ferredoxin--NADP-reductase MADWVSGKVTRVDNWTDGLFSLIIQAPVRSFTAGQFAKLALDVDGERVQRAYSYVNSPDSSELEFYLVKVPEGKLSPKLHQLKVGDDILVTQDAAGFFVLEEVPVADTLWMLSTGTAIGPFLSILQLGQDLERFNHIVLVHAVRYNQDLNYLPLMRELEKRYAGKLRIQTIVSREKTTDSLHGRIPALIESGELESSVGLTINPENSHVMLCGNPQMAKDTQQLLKQQRGMSKHLRRKPGHITTEQYW >LR134531.1|VEJ56913.1|3740935_3741601_+|Protein-of-uncharacterised-function-(DUF1454) MGHDTLTKAGIFSATLLTLMLSNTAFAVEPNASSSESSSAKHSETVVAPYLREDAPTFNLTVAQFREKFNTTYPHMVLAEYKTIKTLEVKSPLIRAASRINNTLYSSVAIDKSQRTIRSLQLTYLPPPPSEEKPEKPENIAKAEKADRAVLANYMAAFISLFEPTSTLEKCQNKANELLEKGKGSPFYQQKEGTLRFVIADHNEKGITFAIEPIKLSLSDK >LR134531.1|VEJ56914.1|3741766_3742537_+|Triosephosphate-isomerase MRHPLVMGNWKLNGSKHMVNELIAGLRHELSNVDGCDVAIAPPFIYLDQAKQAISDSHIALGAQNVDINLSGAFTGETSANMLKDIGAKYVIIGHSERRTYHKESDELIAEKFAVLKEAGLIPVLCIGETEAENEAGQTQAVCARQLDAVLKTMGAPALKGSVIAYEPVWAIGTGKSATPAQAQAVHKFIRDHIAKHDADVAAKIIIQYGGSVNDSNAAELFSQPDIDGALVGGASLKADAFAIIVKAAAKAKKAK >LR134531.1|VEJ56915.1|3742610_3743573_-|6-phosphofructokinase-isozyme-1 MIKRIGVLTSGGDAPGMNAAIRGVVRAALSQGLEVFGIYDGYQGLCEDRMEQLDRYSVSDVINRGGTFLGSVRYPEFRDPAAREKAIENLKRRNIDALVVIGGDGSYMGAKLLTERGFPCIGLPGTIDNDVAGTDYTIGFFTALDTVVEAIDRLRDTSSSHQRISIVEVMGRHCGDLTLAAAIAGGCEFIVLPEIEFNRDDLVAEIMAGISKGKKHAIVAITEHICDINELAKYIQQKTHRDTRATVLGHIQRGGSPVAYDRILASRMGAYAIDLLLQGYGGRCVGIQNEKMVHHDIIDAIENMKRPFKGDWLKTAKELF >LR134531.1|VEJ56916.1|3743917_3744796_-|Ferrous-iron-efflux-pump-FieF MSDQYGRLVKTAALCATTVATALLIIKVVAWWLTGSVSLLAALVDSLVDIAASLTNLLVVRYALQPADHQHTFGHGKAESLAALAQSMFISGSAIFLFLTGFQHLVNPQPLKAAGVGIGVTIVSLITTLCLVAFQRWVVRKTQSQAVRADMLHYQSDIFMNGAILAALGFSLYGFLHADAIFALAIGCYILYSALRMAYDAVQTLLDRALPDEERQEIMNIALSWPGIQGAHDLRTRQSGPTRFIQLHVEMDDNLPLIEAHKIADNVEQDILRRFPNSEVIIHQDPCSVVIS >LR134531.1|VEJ56917.1|3744944_3745478_-|Periplasmic-protein-CpxP-precursor MHKVNILIMASMLALSTSTVLAADGKNVEPENCSETGACLGSVSTTNSSEAVKAQIMHQHSLFDGLNLTVKQRQQMRDLIRQNYHDAMPKMYMDNMEAMHNLVIADHFDENAARAQAEFIAKAQVERQVALAKVHHQFYSLLTPEQRIVFNRNHSERMEKMQQHLDQMRKYEELDPQ >LR134531.1|VEJ56918.1|3745624_3746323_+|Transcriptional-regulatory-protein-YycF MSKILLVDDDREITSLLEELLELEGFDVIVAYDGEQAINMMDNSVDLLLLDIMMPKKNGIDTLKELRQNYQTPVIMLTARGSELDRVLGLELGADDYLPKPFNDRELIARIRAILRRSNWSEQQQEPEVSSNTLQVDKLRLNPGRQEASFDGQTLDLTGTEFTLLYLLAQRLGQVVSREYLSQEVLGKRLTPFDRAIDMHISNLRRKLPERHDGLPWFKTLRGRGYLMVTVT >LR134531.1|VEJ56919.1|3746319_3747699_+|Sensor-protein-CpxA MITFNSLTTRIFAIFWLTLTLVVMVVLMVPKLDSRQLSQVMDSERRQGTMLEQHVEAELASTPNSDLMWWRRLLQSIEKWSPPGQRMLLVTSEGRIVGVLKRNEMQMVRNFIGQADNADYPMKKKYGLQEMIGPFSVRDREEHYQLYLIRPANSPQSDFINLLFDRPLLLLAVTMVISIPLLLWLAWSLAKPARKLKQAADEVARGNLRESPELEFGPLEFRAAGASFNQMVNGLDRMVKAQQRLISDISHELRTPLTRLQLATALMRRRHGESKELLRIETEAQRLDNMINHLLALSRNQYKSEISRERLMANELWADVLDNAKFEAEHTDKKLEIAVPPGPWPIFGNRTALDSALENIIRNAFRYSDKHIVVTFSCDNQGIVIHVDDDGPGVAEEDREQIFRPFYRTDEARDRNSGGSGLGLAIVESAISQHGGWAKAEKSPLGGLRLTIWLPLYLR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
LR134531_4 | 4.10|3682821|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 3682821-3682852 | 32 | NC_031059 | Rhodovulum phage vB_RhkS_P1, complete genome | 5963-5994 | 7 | 0.781 |
LR134531_1 | 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 1540073-1540104 | 32 | NZ_CP031397 | Borreliella burgdorferi strain MM1 plasmid plsm_lp54, complete sequence | 39566-39597 | 8 | 0.75 |
LR134531_1 | 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 1540073-1540104 | 32 | NZ_CP019854 | Borreliella burgdorferi plasmid lp54, complete sequence | 39468-39499 | 8 | 0.75 |
LR134531_1 | 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 1540073-1540104 | 32 | NZ_CP019765 | Borreliella burgdorferi strain B31_NRZ isolate B31 plasmid p_lp54, complete sequence | 39468-39499 | 8 | 0.75 |
LR134531_1 | 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 1540073-1540104 | 32 | NC_011784 | Borreliella burgdorferi ZS7 plasmid ZS7_lp54, complete sequence | 39440-39471 | 8 | 0.75 |
LR134531_1 | 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 1540073-1540104 | 32 | NZ_CP017218 | Borreliella burgdorferi strain B331 plasmid B331_lp54, complete sequence | 14296-14327 | 8 | 0.75 |
LR134531_1 | 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 1540073-1540104 | 32 | NC_013130 | Borreliella burgdorferi N40 plasmid N40_lp54, complete sequence | 39478-39509 | 8 | 0.75 |
LR134531_1 | 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 1540073-1540104 | 32 | NC_013128 | Borrelia burgdorferi 297 plasmid 297_lp54, complete sequence | 37228-37259 | 8 | 0.75 |
LR134531_1 | 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 1540073-1540104 | 32 | NC_013129 | Borreliella burgdorferi JD1 plasmid JD1_lp54, complete sequence | 39419-39450 | 8 | 0.75 |
LR134531_1 | 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 1540073-1540104 | 32 | NC_001857 | Borreliella burgdorferi B31 plasmid lp54, complete sequence | 39468-39499 | 8 | 0.75 |
LR134531_4 | 4.4|3682461|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 3682461-3682492 | 32 | CP046512 | Bacillus cereus strain JHU plasmid p1, complete sequence | 493358-493389 | 8 | 0.75 |
LR134531_4 | 4.9|3682761|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 3682761-3682792 | 32 | NZ_CP048619 | Legionella pneumophila strain ERS1305867 plasmid unnamed, complete sequence | 58838-58869 | 8 | 0.75 |
LR134531_1 | 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 1540073-1540104 | 32 | NZ_CP020746 | Bacillus mycoides strain Gnyt1 plasmid unnamed3, complete sequence | 49366-49397 | 9 | 0.719 |
LR134531_4 | 4.8|3682701|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 3682701-3682732 | 32 | NZ_CP017672 | Providencia rettgeri strain RB151 plasmid pRB151-NDM, complete sequence | 70934-70965 | 9 | 0.719 |
LR134531_1 | 1.3|1540013|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 1540013-1540044 | 32 | NZ_CP028957 | Morganella morganii strain AR_0133 plasmid unnamed1, complete sequence | 5233-5264 | 10 | 0.688 |
LR134531_1 | 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 1540073-1540104 | 32 | NZ_CP016932 | Yersinia enterocolitica strain YE165 plasmid unnamed1, complete sequence | 68575-68606 | 10 | 0.688 |
LR134531_1 | 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 1540073-1540104 | 32 | NZ_CP025801 | Yersinia ruckeri strain SC09 plasmid pWKY, complete sequence | 2234-2265 | 10 | 0.688 |
LR134531_4 | 4.1|3682281|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 3682281-3682312 | 32 | NC_048080 | Acinetobacter phage vB_AbaM_B09_Aci05, complete genome | 58769-58800 | 10 | 0.688 |
LR134531_4 | 4.1|3682281|32|LR134531|PILER-CR,CRISPRCasFinder,CRT | 3682281-3682312 | 32 | NC_048074 | Acinetobacter phage vB_AbaM_B09_Aci01-1, complete genome | 58347-58378 | 10 | 0.688 |
1. spacer 4.10|3682821|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NC_031059 (Rhodovulum phage vB_RhkS_P1, complete genome) position: , mismatch: 7, identity: 0.781
aagccaaacctatcgagcgggcgttctcgcat CRISPR spacer aggccaagcccatcgagcgggcgttccgcgat Protospacer *.*****.**.***************. **
2. spacer 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP031397 (Borreliella burgdorferi strain MM1 plasmid plsm_lp54, complete sequence) position: , mismatch: 8, identity: 0.75
tcttaattttgaatataaaaatccctgccaat CRISPR spacer ttttatttttaaatataaaaatcccctttaag Protospacer *.*** ****.**************. ..**
3. spacer 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP019854 (Borreliella burgdorferi plasmid lp54, complete sequence) position: , mismatch: 8, identity: 0.75
tcttaattttgaatataaaaatccctgccaat CRISPR spacer ttttatttttaaatataaaaatcccctttaag Protospacer *.*** ****.**************. ..**
4. spacer 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP019765 (Borreliella burgdorferi strain B31_NRZ isolate B31 plasmid p_lp54, complete sequence) position: , mismatch: 8, identity: 0.75
tcttaattttgaatataaaaatccctgccaat CRISPR spacer ttttatttttaaatataaaaatcccctttaag Protospacer *.*** ****.**************. ..**
5. spacer 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NC_011784 (Borreliella burgdorferi ZS7 plasmid ZS7_lp54, complete sequence) position: , mismatch: 8, identity: 0.75
tcttaattttgaatataaaaatccctgccaat CRISPR spacer ttttatttttaaatataaaaatcccctttaag Protospacer *.*** ****.**************. ..**
6. spacer 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP017218 (Borreliella burgdorferi strain B331 plasmid B331_lp54, complete sequence) position: , mismatch: 8, identity: 0.75
tcttaattttgaatataaaaatccctgccaat CRISPR spacer ttttatttttaaatataaaaatcccctttaag Protospacer *.*** ****.**************. ..**
7. spacer 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NC_013130 (Borreliella burgdorferi N40 plasmid N40_lp54, complete sequence) position: , mismatch: 8, identity: 0.75
tcttaattttgaatataaaaatccctgccaat CRISPR spacer ttttatttttaaatataaaaatcccctttaag Protospacer *.*** ****.**************. ..**
8. spacer 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NC_013128 (Borrelia burgdorferi 297 plasmid 297_lp54, complete sequence) position: , mismatch: 8, identity: 0.75
tcttaattttgaatataaaaatccctgccaat CRISPR spacer ttttatttttaaatataaaaatcccctttaag Protospacer *.*** ****.**************. ..**
9. spacer 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NC_013129 (Borreliella burgdorferi JD1 plasmid JD1_lp54, complete sequence) position: , mismatch: 8, identity: 0.75
tcttaattttgaatataaaaatccctgccaat CRISPR spacer ttttatttttaaatataaaaatcccctttaag Protospacer *.*** ****.**************. ..**
10. spacer 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NC_001857 (Borreliella burgdorferi B31 plasmid lp54, complete sequence) position: , mismatch: 8, identity: 0.75
tcttaattttgaatataaaaatccctgccaat CRISPR spacer ttttatttttaaatataaaaatcccctttaag Protospacer *.*** ****.**************. ..**
11. spacer 4.4|3682461|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to CP046512 (Bacillus cereus strain JHU plasmid p1, complete sequence) position: , mismatch: 8, identity: 0.75
gccagct--ttaaacattgacaatatccgcttgc CRISPR spacer --tagttaacaaaacattaacaatatgcgcttgc Protospacer .**.* . *******.******* *******
12. spacer 4.9|3682761|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP048619 (Legionella pneumophila strain ERS1305867 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
aaacgtgcgggaaaaatacggctcaacgctat CRISPR spacer gaatctaaggaaaaaatacgactcaacgctac Protospacer .**. *. **.*********.**********.
13. spacer 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP020746 (Bacillus mycoides strain Gnyt1 plasmid unnamed3, complete sequence) position: , mismatch: 9, identity: 0.719
tcttaattttgaatataaaaatccctgccaat CRISPR spacer tgttaattttgaatattaaaaccccaattgac Protospacer * ************** ****.*** ....*.
14. spacer 4.8|3682701|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP017672 (Providencia rettgeri strain RB151 plasmid pRB151-NDM, complete sequence) position: , mismatch: 9, identity: 0.719
ataatgctcagttaataaccgtttcttggaac CRISPR spacer caattgctgagttaatcaccgtttcttttgag Protospacer * **** ******* ********** .*
15. spacer 1.3|1540013|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP028957 (Morganella morganii strain AR_0133 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
aagcccagcccgcttaactgcgttcctccatc CRISPR spacer tcttaccttccgctttactgcgtttctccatc Protospacer . * .****** ********.*******
16. spacer 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP016932 (Yersinia enterocolitica strain YE165 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
tcttaattttgaatataaaaatccctgccaat CRISPR spacer tcgtaattttcaatataaaaatcagtaatgcc Protospacer ** ******* ************ *. .. .
17. spacer 1.4|1540073|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP025801 (Yersinia ruckeri strain SC09 plasmid pWKY, complete sequence) position: , mismatch: 10, identity: 0.688
tcttaattttgaatataaaaatccctgccaat CRISPR spacer tcgtaattttcaatataaaaatcagtaatgcc Protospacer ** ******* ************ *. .. .
18. spacer 4.1|3682281|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NC_048080 (Acinetobacter phage vB_AbaM_B09_Aci05, complete genome) position: , mismatch: 10, identity: 0.688
tggccagtgattatgttcaacataaacagcac CRISPR spacer cttcttaccattatgtgcaacataaacatcac Protospacer . *. .. ******* *********** ***
19. spacer 4.1|3682281|32|LR134531|PILER-CR,CRISPRCasFinder,CRT matches to NC_048074 (Acinetobacter phage vB_AbaM_B09_Aci01-1, complete genome) position: , mismatch: 10, identity: 0.688
tggccagtgattatgttcaacataaacagcac CRISPR spacer cttcttaccattatgtgcaacataaacatcac Protospacer . *. .. ******* *********** ***
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1065846 : 1082230
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >LR134531|1065846:1082230|DBSCAN-SWA CATGCCGATGAAAGAACCTCAAAACTACCTTTCTCTGAGCTATATCCTGGTATTGGTCATGACATTACTGGGCGCCATCGCCAGTTACGCTTACCGCATCCTGAACGGTGAAGAGTTTCGCTGGTCGATTCTGCTATTGCAGGCGACGGTGGCCATTTTTGCCGGAGCGTTAGTGCTACTGGCTGCCAGCTATTATCACTGGGCAGCAGAACTTGCTGGCGGTATTGCTGGCTTGGCGGGATGGTCCGGTGCTGAATTTATTAAGATTCTGGAAAAGCGTTTTTTGAAGCGTATAGATGGAGGACGTGATGATAACTAGCCAAAGTGGCGTAGAGCACATTAAATCCTTCGAATCTTGCCAGTTAAAAGCTTATCTCTGTCCGGCAAAAGTTTGGACGATTGGCTATGGCCATACCTCTGGGGTTAAGCCAAACGATCAAATTAGTGCCATGCAGGCGGAACGTTATCTTAAAGCGGATTTGGTACGCGTTGAACAGGATGTGTTACGGATTGTTAAGGTTCCACTAACCCAAGGGCAGTTCGATGCACTGGTTTCTTTTGCTTTTAACTGTGGCGCTCGAGCGTTAAGTACTTCGACGTTGTTACGCAAATTAAATCAGCGGGATTACAACGGTGCTGCCGATGAGTTTGGTCGCTGGGTTTATGCCAACGGTAAACGGTTTGCCGGTTTAGAACGGCGTCGCCGTTTAGAAAAACGGATGTTTGAATCATGACATTTTCTGCACGAATGATGACTACCGCTTTAACCGGTGTATGCATCGTTTTGCTGCTGCTTTATGCCCGCTGGCTTGGTAATCAACTGAGCCAGTTAAGAAATGAGAAGCAACAGGCAGTGGTGGCGTTAGCAGAGGAACGGGCTTATTCAGCCAAGATCAGGGCCCAGTATAGGCAAATCCAGGAGGTAATGGATGATGTTGCAGAGCAAAAACAAGAGAGCGAAAAACGCACTGTGGCGTTGCAGCGAGCATTGGCTCAAAGCCAACTGGCGAGTCCTTGCGTTGCTGAGCCCGTTCCTGATGCTGTCACTCAGCGGCTGCGCGAACGAGTCGCCGAGGTCAACGCCACCGCTGCCGGTGCCAAAAATGCTGTTCCGCCCGTGCCAGGCACCTGACTATCAGGTCCGCTATTACGGTGATTATCCTGGATATGTCGCTGAACTGCTGGCGGTGATTGAACGGTGTAATGGGCAATTAGAGGGGGTGAGGAAGGTGGTTGAGTCTTATACTTCGTTTGATAACGAAATTTTATCGCCAAATAACAACAAGAATAAAGAAAAACAATTAAAAGTTAAATAAACAACTAAAAGGGGTTGTTTTTATCTGATTATCAGGTATAGTAACTCCCATAAAGTCACAAGATTGTATCCAAACAAAGCATAACAAAAGGCCTGCATTTGAATGCGGGCCTTTTGCGTTTTATTGGCTAACTATCTGGTTTATTGATTTTATTTGATGCTCACTACTTTGTGGTTTGTGGTTTGTGGGCGTTAGTTATGGATTAATAGTGTTTAATGATGATCAATCACAGCTGAGTTCTTGAGCACTAAAAGTTGTTAATTGACGATTCGTTTATCTATTTCCCTATTTAAGCCCCGCTTTGCGGGGCTTTTTTGTTTCTCATATTCATTGAACCGCTATTTTGAGGTTTTTATGGCAACAATCATTGAACAGCAGCTGAAAAGTCTGCTGCAACCCCTGCTGGACTCCAACCTGATTATCAAAGGGGAAACGGTACCGCAGGAACCTTATGCCGAGTTAAGTACGGTTTCCTGTACTGCGCTGGGGATGAGTGATGAAGTTGATCGAAACGTCAGTGATGACGGTTATTTGATCGTTCGTGGCCAGCGTCGGGCTGAAATTGCTATTCACTATTACGGCGGTGATGTGGTTGAGCAATTAAGCAAAATCAACGATGGGCTGCGAAAAGTATCTGTGTCGGAGAAGTTTCAACTGGCACAAATTGGAATAGAGGGAAGCGCTCAATTAAAAACAGAAATGAAAGAGGATGCCCAATGGACACCGAATTCAGAAACCTATCTGAGCTTTTTTATCCACTATTCCGTGATTATTAAAGACGCTGTAAGCGTCATTGATAATGTTCAGGCTACCTCTGATGGTAGCAAAACTATTTTTAACTTAACGAGGTAAAACAATGGGTTCTCTGAATCAGATTGTGAACGTAAATATTGCATTAAGCACTACCAGTGTACCGCGCGGCGTGTTTGGCGTGCCAATGATTATTGCTCCGCTAACAACGTTTACCGAGCGCGTTCGCGTTTATTTTGACTATAACGCAGCTCAAGAAGATGGCCTGCCAGCCGATGTGCTGAAAGCGCTGAGCGCGGTATTTAGTCAGACTCCACGTCCGCAAATGTGTAAGGTGGGTCGTCTGGAAGCTGATACAGAAGGTAAAGTTGTTGCTACTACGCTGGCTGCTCAGCTAACGGCGATTCAGGCTGAAGATGCCAACTGGTACGGTTTTGCCTTGACTGAACGAACGGCTGCACTGCAAATGGCGGCGGCTGAGTGGGCTGAAACTCAAACTAAAATGTTCTTCACTTCCTGCGCAGAAGCAGCGGTAACTGATGCCAGCAGCAAAGAAGATACTCTGTCTCAACTGGCGGCGAAAAATTACCTGCGTACCGCCGTTATTGTTGATAAACATGCGGCAGAACAGTATCTGGAAATGGCTTGGATGGGACGCTGCTTTACCATTGCTCCGGGTGGTGAAACCTGGGCACTGAAACAACTTTCTGGTGTACAAGCCTCTGACTGGAGTGCTACTGAGCAGCAAACCATTCTGAAGAAAGGCGGGAATACCTTCGAACGTTTTGCTCCGCAAATTTATCTGACCACGCCAGGCAAAGTAGTTAGCGGTGAATGGGTTGACGTTATCCGCTTCCGCGACTGGTTAGCTGATGCTATTCAGACCAGCCTGAGTACGATGATGATCAATCGTAATAAAGTTCCTTATACCGACGGTGGTATTGCTCTGATCGTGAATAACCTGACCGGCTGCCTGATTGAAGGCCAGCGCGTAGGAGGTATTGCGCCGGATGAAATCGATGCTGAAGGTAACAACGTTAAGGGCTTTGTGGTGACTTATCCACGTAGTGTTGATGTCTCTTTCCAGGATAAAGCCGATCGTATTTTGAATTTATCCTTCTCCGCCCGTTTAGCCGGTGCGATTCACTTAACCAATATCAACGGCAATCTGTCGTATGAATTACAATAATAAGGAAAAATATCATTATGGCTGCTGAATTAACGGGTACTTATAAAGGCGATCAGGTTTTTGTTACTGTTGGCCCGGTTCTTATTTCTGGTTTTAGTGACGGTGATGCAATTACGGTAAAACGTGCTGAAGGTCTATACACTTCAAAAGTCGGTATTGATGGTGGTGTGGCGCGTGTACGTAATGCGAATAAGTCTGGTTCGATCGAAATTAAATTATTGCAAACCAGCAAAGTTAACGATGAACTTTCAGATTTGTTTTATGTGGACAACTTTAATGAAGATGGTTCTCCGGTGTTGCCGATCAGTGTGACTGATGGAAATGGCCGCACATTATGCTCCGCTGGTCACGCATGGTTGAAAGCGGTACCTGATGTCTCTTTTGGTAAAGATGTTTCCGACCGCCTATGGGCATTTGAGTGTGCTGATTTAAAAATTTATGTTGGCGGCAACTAATTTTATTCATTAGTTAATCAAAAACTATCGCAAAGTTAAATTAATTTTATTTTTTAGCTGAATAGTCAGACATATTCGGGAAGCTTCGGCTTCCCTTTTTATAAGGAATAGATATGCAAGTTGAAACTTTTATTATTGGCAACAGAGAATATACAGCAAAGAAGATGAATGCATTTGATGCTGGTCGTTTCTTATTGAAAATTAAAAATATAGCGGCACCTGCATTAACTGCATTTGCTCAGGCTGATAATGCGAATGATGTTAATTTATTTGAGCTTTTTTCTGGGCTGGATGAAGAAGCACATGAAAAAGTGATTTTCCCAATTCTTGCGGCTTCAGCCGTTTATTCTATTGAAGATAAAAGAAAAGTCGCATCTATCACTGATATGAATATGTGCTTTAACGTTGATACATTGCTCGATTTTTATCTTCTGGTTTGGGAGGTGTTAAAGATGAATTTTGCCCCTTTTATCGAGCGCGCCAGTACCCACTTTGGTTATCAAGATATAGAGCAAGCGGCCAGCGAGTAGCCAGCGATAGTGCAGGAACGTTGAGAGAAGATCTACAACAGGAGCTTTGGATTTGGCGCCCAATTATGGCACGCCTGGTGACTTTACAAGAAGTGAAACAAGGTGAAGTCGATACTGATGATTTATTAAAGCTCAATGCACTGTTGGATATGCGCGAAACGATGCAGTTACAAGCGCAGTTGGAGAACCAAACATGAGTAATCAGAATATTGGGGCTATGCAAGTAGCCAATTCATTAGATGGATTAAGAGCATCGTTACCACCACTTAATAAAGTTAATGAAAAAATAGAAGACACGGTTAAAAAAATAGGGGTACAGGTTAATATTAATCAGAGTATTGCTGAAAAAAATATTAGGCATATATCTAATAATGTTAACAATCTATTGAGTGGATTAAGTAAAGTATCGTCAGCGGTATTAACGCGTAATGCTCCCAGTATCAATCAAGCGGTAAAAAAATTCTTTTTTATTTCGAATATTACTGAACGAATAAAAAATGATATTTCAATTAAGATAAACGTTGCGCCCTCTCAATCTAATGAGAAATCTTCTTGCCCAAATGATGGAGAGCGTGGTGGTGATAAAAAAGATATCTGGGACACCATCAAGTTTTTTACTGAATTGGCTGCCGACCTTCTCACTATACTCGATATATTAGGCGGGTTTCGTAAACTGAAGAGAATCTTCAGGTTAATGGGGGGAGCATTTTCAAAGGTATTTTCTCGAATAGGTTCTTTTGCCAAGGGGGTTGGTAGCGCTGTTGGTAAGGCTCTTAAGTCAGCTGCCGGATCTATTACTCGACAAGCCAAGCGTCTTTTTTCTGGTGTCAAACAGATTGCGAGCAGTGTTAAGAGTAAGGTCGGCTCTGGTATTAGAGGCGGATACACTAAAGCCAAGAATTTTCTTTCCTCTGGTGGTAATAAGGTAAAAACGCTTTTTTTAGCCGGGAAGGATAAAGCTAAAAACCTGTTTACTTCTGGGCGAGAAAAGGCAAAAAATTTGCTTTCTTCAGGCAAGGAGAAGGCGAGGAATTTACTTTCTTCAGGGAAAAATAGGGTTAAATCAGTTGCCAACGTTGGAAAACAGCTATTTGGCCGGTTAAGCAATTTTATTAAGCCCGCAGTTTCAAAAGGAAAAGACTTACTCTCTTCAGGGAAAAACAGAGTTAAGTCAGTTGGTAACGTTGGAAAACAGTTATTTAGCCGGTTAGGCAATTTTATTAAGCCTGCGGTTTCAAAAGGAAAAGATTTACTCTCTTCAGGGAAAAACAGAGTTAAGTCAGTTGCTAACGTTGGAAAGCAGTTATTTGGACGATTGGGCAATTTTCTCAAACCTGCGGTTTCCAAAGGAAAAGATTTACTCTCAAAAGGTAAAAACACAGCTAAATCATTGGCTGGAAGAGCTTCTGGCTTAGGTAGCGATATTCTTAAGAGAGGTGGGAGCTTATTTACTAAGATGGCTGGCCCAATGCTAAACATTGGCAAAAGCCTGATGAGTAAAGTTCCTGGTGGCATTAAGTCAGCCATGAAAACAACGTTTACTGCAGGAAAAAGCCTGCTGAGTAATCCGAAGGTACTCCGAACGTTGGGTAGGGTGGCTTCTGTCGGTTTACGCGTTGGTCGGATGGCGACACCGATTGGTTGGGCAAGTCTGGCTGCTGAGGGCGCTATTCGATTAGGCTACCATGCTTATAAAAAGTATCAGGCATCAAAAGAGAATGAAGGAACACCAGGGAGTGGAGAAGGCCCAGGTAAGGATAACTCTGATGTTTATCCACAAACCGTTACCCCAGGAATGGCTGCGGCAGCCAGTGCTGCTAATGCAATGAATTACATGATTAACAGTAATCCTCAGTTTCAAATCAATATATTACCGGGAACTCCTGAACAGCAGATTAATGATATTCGTCAGGCCGCTTTGGATGGCACTAAGCAAGGCGAAAAACAGTTAACTGCAACGATTAATGGGGGTATGTAATGAGTAGTGTTTGGGGGATATTAACAAAACTTTTAAATGATAAGTCATCTGGTGTTGTGCGGTTTCAAAATGCCTATTCAAACATGGAGTTTGATGTCGTTACCAGTGAAAGCCATACATGGACGGCGGATGTGACAACCAATCCAGTGGAAACTGGGGCGGTTATTACAGACCATGTACAGCTAAAACCGGACTCGCTTGAAATCTCGGGGATCATCAGCAATTCTTCTATTGAACGCTGGCGTGGCCGTTTTCTGGAAACGCTGTCTGATTTATTCAATAAAGAATCGAATATACAAAAAGCGTTCGATCAATTAAGGATGCTGTTAGAAAATCGCCAACCGGTGATGGTGTATACCAAATACCGCAATTACCCCGATATGGTGCTAACCCGATTATCCATTCCGCGTAAAGCAGGGGAAGGGGATTCCATTGAGTTTTCTGCGACCTTTACCCATATTCGTCGAGTATCTACGCTCATTGTTGATGCAGAAGAAGCTGGAATTAATCCTAAACAGGCTGATTCTCCGGCGACATCACGTAAGTCGTCTCCAAAAAATAATAAAGGGCCAATACACCCGTCAACCGCAAATGACGAAACAGTTACCCATATTGAAAATAAAGAGACAACGGTTGAAACGAATGTGTCGGTAGAGGTGAATACTTAGAAAATAGCTATTTTCTATTGTAAGGTTTAATAGGTAATTATATGGCAACTGTATTAAAAATACCGCTGGATGCGGGTATTGCCGATCAACAAATGGATATTACTCTGGATAAAAAACCGTTAACACTAAGAGTAACATGGAATGAATACGGGCAATATTGGTATTTTTCATTAAGCGAACGTAATGGCGAGTCTATTATTGATGGGATTAAGATGGTTAAAGATACCTTTCTTCTTAAGCGGTATCAATTATCTTCACCAGAAGGCGATTTCATTTTTATGGATAATTATAGCGGGAAAGAACGTCCTGATTTTTACTCGCTAGGCAATGACCATTTATTGCTTTATCGCACAAAATATTAATTTGCTGATAAACACTTTTTTCCTTATTTAAATAAAAGGGAGCGATATGTTATTTAATCGTGTCGCAGAATTAGTGGTGGGTGAAGCTAATGGTAAAGCGGTTATTATTAACGATCTGCGTTTTTCTTTTGAAATTTCAAAAGATAATGACAAAACCACTAATAAACTAGCGCTTAAAATTTACAATATGAATAACCAAACGCGTAGTTTGGTAGAACGTGTAAATAATAATGTCATCTTAAAGGCCGGTTATGAAGATGATATTGGAGCTGTAACAATCTTTACCGGAACCGTTGTCAGTGCGTGGACGACTCGCGAAGGTAATGACACGCTGACTGAGCTTACCGTTCGCGATGGCGCATTACCTTTGAGACAAACCAAGCTGTCACTGAGTTATGCGCCAAATACATCGGCCTTGGATATCCTGGCCGATATATCCAAAGCATTTGGCCTACCGGTAAAACCCATGCCAGAAAAGATAATGGATAAACCTTATCGTCGTGGATTTGCCTTCTGTGGTAAGGCTGAAGTCGCGATGAAGGATGTGTGCCAATATCTTGGTTTAACCTGGTCAATACAAAACAACGAAATTCAAATCTTGAGTAAAGATAATCCGATCACGGATGAATTGGTTGTTTTAACCCCAGATAACGGATTAATTGGACTACCAACCAGAATTATCGATTCGACCAGAAATAAGTCCCAGGGAGAATCGTCTCCACCTTCAAAGTTAGTTCTTTCTGAAAGTCTTGGCGATAAGAGCCAGTATCAGATCGAGGGATATAACGTGAAGTGTTTATTGCAACCACGTCTGTATCCGGGCTGCTATGTCGGTCTGGAAAGCGACATGTTGCTACTCGATCCGAGTGCTGATAAGGAACGTAGCGAGCGCCCTCGAGCATTCTTCCGTGCCGAAGTCGTAACTCACAGCGGTGATACTTTCGAAGGTGAGTGGATTACCGAGTGTGAACTCAAGGCGATTCCTCAAGGAGGCAAGCGTGGCGGAAAATAATAATCTGATGCAGGCACTGCAAACCCTGATTCAGTCGGAGACCAGTCAGATCAATACCGCAGTGGATGGCATTATTGAAAGCTACGATGCCGGTATTGCCAGCGTTAAACCTATTCCTAAACAGCGTTTTGTTGATGGGACTTCACTGGATTATCCGGTGATACCTAACGTGCCGGTGATGTGGCCACGGTTTGCTGGCGATGTAGCAGGGGTAAAAGGCCCGGTTCGGTCGGGAGATAAGTGCCTGCTGGTGTTTTGTCAGCAAGCGGCGGATGACAGTGATGATGAACGTCGATTCTCCCTGACGGATGCTTACTGCATTGTAGGCGGTTTTGGCGCAGCGAAAGATCGCGCAGCGGAGAACGATCAGATGCAGCTCTATTTTGGTGAGGCGTATATCGCCCTGACCGAAGAGGGCAAACTGCTGATCAATGCCCCTGCCGGCGTTGAGATCACCACGCCAGAAACCCTGAATAAAGGGTTATTAACCACCGAAGGGCAGCTTAACTATCAGTCGGGCATGTCAGGAAAAGGTGGGGCAACTATCAACGGCACGGTAAAAGCAACCGGTGACGTTCAGGGAAGCGGGATCTCATTAACTCAACATACCCATCAGGAACACGATGGACCTTCGACAGGACCCGCAAAATGATTGATTTAAAACTCGATGCAACGGGAGATCTCGATCTGCAGCGTAATGATTTGCTGTGGATTGATGGTGCAGAACGTGTTCATCAGCAGCTACAAATCAAGCTAAAGCTATGGAAGGGAGAGTGGTTTCTCAATACTCAATTTGGTACGCCTTATCTGCAACAAATTCTGGGTAAGCAAATCACCCTGAATGGTGCACTGGCGGCGCTGAAAAACAGCATTAATGAAGTTGATGGCGTGTTGGAGATTGAGCAATTTAACTATGACTTTGATCGGCAAACGCGGCAGTTAAGCGTGCAGTTTGCTGTTAAAACCCCTTATGGCTTAGTCAAATATAAAGGTAATCAATAATGGCTTTAACTAAAGATGGCTACACAATCAAACGGTTAGCCGAACTTAAAAAAGAGTATGACCGACTGTTGATTAACCGTTTTGGGCCAATTAATACCCAGCCTGATTCGGTTATCGGTCAGTTGGAAGGCATTTGGGCCGAGGCATTGGCCAATATCTACGAACAGGCTCAGGATACCTATCATGCGATGTATCCTTTTAGCGCTGAGGGTGTATCCCTTGATGGTGCCGTTTCTTATGTGGGGATTACTCGCTTTGCCGCTTCCGCTACACAGGTTATCGCTGCGGTTTATGGTAAAGAGTCTACCTTATTGAAAAGCGGCGCGCAGGCAACGAACGGCGGCCAACGTTATCAAAGTATGTTTGATGCGGTGATTAGTCGGGCTAATGCGGTGGATACGTCGATTGAAATGAATGTGCGGGATAAAACCCAATATTCCATCAATATTAATGGCACCCTCTTTAACACTATCTCCGTCGAGGGAGAGGACGCCACACAGATTATGCAAAAAATCGCCGATCAGCTTAATCCTAATGTTCTGTCCTATGAGATAAAAGATAAGGTATTACGTATTTTTGCCGTTGATGGCATTACACCTTTTGCATTGTCAGTCGGTGAACACCTCAAGCTGGTGCGTATTGGTTCTCCGGCTAGATTTATAGCGATGCAGGAAGGGCGTCATGTCCTCCCATTGGGAGCATTGACTGAAATTGTTACACCACGTAGTGGTTGGGATGCGGTATGCAACTTAGCGGAAGGCGTGGTTGGCAGAGAAAGGGAAAGCGATGCACAACTGAGAGTTCGCTTTGAACAATCCCGTCAGTCTACCGGCTCAGCAACGGTGAAAGCGATTAGAGCCAGATTAATTCAGGAGGTCGCCGGGGTTAGTGAAGTTCATATTTTTGAAAATCGAACCGGTTCCATATCTGAAGAGGGGATGCCTCCTCATGCTTTTGAAGCGCTAGTGGTGGGGGGCGATAATCAAAGCGTTGCTAATGCATTATGGCGTCATAAACCCGCTGGCATCGAGACTTATGGCGCTCATGCCGTGCTGGTGAAAGATGAGAATGGCGACGGCCAACAAATCAATTTTTCGCGGCCAACCCAGAAGCTGGCATGGATTAAAATTAATGTCACAGGGCTATATAACGAAGAAAACCTGCCACAGAACGTTATCAGCAATATTAAAAAAGCGGTGTTGAAATATGGTTCAACGCTGAAGATTGGCGACGATATTATTTTACAGCGCATGTTAGGCCCTATTTATAGCAATACCAGTGGATTGGCTGAAATTACCATTGAAGCGGCTATGACCGAGGGCCCTGATGATGAGCCAGTTTATCAGTTGGAAAATGTTGCCATTGATAAACGTAGCGTGGCCTTGTTTGATGAGTCGCGGCTGGAGGTGATTGGGCTATGACATTTCCCTATAGAAAAACAGCCCTTTCACGGTTGGTAGGGCAATTTCAAGATAAGCCAAAAGTAAAAGCGTTAGCCGAATCTATGGTGGCTCCGCTGGAGGCCATATCTGCCGATTTTAACGATCTGAAAAACGGTCGTTGGGTGGATAGTGCAACAGGCGCTCAGCTTGATGGCTGCGGTTATATTGTAGGTGTTCCACGTTTTAGTCGAGAGGATGACGAGTACCGTATTGCCATTAAGTCGCGAATTTTAAGCAATATTTCTCGGGCGCGACCGCAGGATTTAATTGAAGGTGTGCGCTTTCTAACTAAACCAACGGAAGTTCAATATCTGGAGAGCTATCCGAGCTGCGCCATGTTATTTACCGATGGTCGACAGATTCCGGAAGGTAGTCAGGCCATTTTGCAGGATATAGCACCGGTGGCGATAGAAAATGTACCGCTGATGGTGAGCTATGGGCGAGCGAAACCTTTTAGAGCTGGGCGTTTGGCCAATCCGATGAATATGCGGGTTGATATGACGGAGAACGGCAATAGTTCGCTGTTAACTAATGATAAACATCTGGTTATTAGTGGCAGTAATCCTGTTGGCTCCGCGCGCTTATCCGGTATGGCCCCAGATAAAACACCGTTAACGGCTAATGGCACAAAAATTAGCGTCGGCGCCGGCGTTTTATCCGTAAGCACCACCTATCAGGTATTCGATAACGGATATCACTTACCAGGAGTCTTTCAATGACCACTTTTGCAGAAAATCATATAGTTTTTCCTGACGGGCAGCTAAACGTAGCGCCGTTGCCGGAAGCCGTGATGCATAACGGCTTTACCCCAGAAACCCGCGATGCGCCGGGTATGCCATTACCGGCTCAATATCTGAACTGGCTATTTCGTGATATTTACCGGAGATCGCAGGCACTAGAAGAAAAAAACCAAACCTTAAATAGCCGGGTTATTCCTGCCTGGATGCCGATTGCCTGTCCGATGGAAGAACCGCCACTGGGATATCTGAAATGTAATGGCGCTAAATTTGATAAAGAAAAGTACCCGGAATTGGCTATGGGGTATCCCTCAGGCGTGTTACCCGATTTGCGAGGGGAATTTATTCGTGGCTGGGATGATGGGCGAGGGGTAGATTCAGGAAGGGAGATATTGAGGGCTCAAGGAGATGCTATCAGAGATATTACAGGAACTTTGTTAGGGGTATTATCAAATGCGCCGGATCAGGAAAGTGTGAATTCATATGTAAAGGGGGCGTTTTCAGCAAAGCTTCTATTTGGTGGCGCAGCCATGGCGGGAACAGCTTATCGTTATGATGCGACGTTCGCTTCATCTAGAGTTGTTCCAACAGCTAATGAAAATCGTCCCCGTAATATCGCATTCTTATACATAGTGAGGGCTGCATAATGAAAACAACTTTAGATAATTATGGCTTTGCCATTTCCGACGGTTATGTTCAGGTTTATAACGTCGATCCAAAAAACGGTGAATTTATCAGTGAATCTGAAGAGTTCCTCAATCAAGGGGTAGGGCTACCGGCCTATAGCTATCTTGATAAACCGCTACCAGAAAAACAAGGGTTTGTCGTTTGCCGTCAAGGTGATGATTGGATATATCAGGAAGATCACCGAGAGGCGTTAGTTTATTCTACTGAAACCGGTGAGCAGGTTACCATTACCGAACCAGGGGCGCTGCCAGATAATCTTACTCCGTCTCAACCACAGACGCCGTTTGATAAGTGGAATGGTTCAGCATGGGTAATGGATCTTGATGCTAAGCAGAAAGCTGAGATAGATAAGGTAACAGCCGATAAACAGCGCTTGGAATTACAAGCCAAAGAGATGATTGCCACGCTATCTGATGCCATTGAGCTGGAGTTAGCTTTACCTGGGGATGATTTGAAACTGCTGGCATGGCGTAAATATCGGGTGTTATTAAATCAGCTTGATATGAAAAACGCTCAGGATATCGAATGGCCAGCCTTACCTGATTGTTAAACACCTCTATTTAATCGTTTTATAAACCTCCCATTTTTAATTTTTGATGATGCAAAGGAATATTGCTATTCCTTTGGCAGAGCTCTGCATTTTTTCGAGCAGAGTTATATTTATAAGGAAACCCAAATGGCTTTCACGATTGAACAGGAACAGGCGCTATTAGCACTGCTCAATGAAAAGAAAATTACCCTATCAGAATTACCCGCCGCCACAGAACTAGCTGCTGACGATCTGCTGCTAATGCGTCAGGGAATTCTCGATAAATCGGTGAATAATGATGTATTAAAGGCATACTTTACTCCTCCGGAGTCATCATTGATTGAATCGGGTATTGTTAAGTTAAGTAATGCCATCAACAGTGATGATGAATATATCGCGGCCACCTCTAAAGCGGTTAAATCTGCCCATAATATCGCCCTAAATGCTAGCCAAAGCGTAGCGGAGCAGGCATTAAGTAAAAGTGCTAATTTATCGGATGTAGCAGATAAGGAACAGGTTCTTAAAAACCTTGGTTTATTGGAAGCAGCAAGTCGAAGTGGAATCACAGCGTTAAATGATAAAAATGGCTGGCTATCTATTCCTGTAAAAGCTATGGGGAAATTAAGTAATGTTATTATTCAATGGGGAACGGTTACATTACCTTTAATTTCAGACGGTGGTATTGGTGAATGGCGTCAAAGCTCTACCAGTTTTTCGTTTCCCATATCATTTCCTAACGCATGCTTTTCTACTGGCTGTAGTCTGCTACATGGAGGTTCATCAACTCAATGGGTAGCTATAGCTAGTGCATCCCCATCAACCAGAACAACGGGTAGGGCATTAATACAGACGCGTTATCCTCTAGAAACTAATCCATCAATTACTTGGATAGCAATTGGATATTGAGGTTTTTTATGTCATATTTTTATAGTGCTCACGAAAATGCATTTTTTCCTGCCAAATTCAAAGAGGCATATATAGCTGCAAATACATGGCCAGAGGATCTTCTTGAAGTCTCAGTTGAGATATTTAATCAATACTCAATACAGCCACCTGAAGGTAAAATTCGTGTGGCTGGTGTAGATGGACTGCCAACTTGGATAAGCATCCCTGAACCAATAAAAACGCAATATGTGGCTACTGCTGAAATAGAAAAATATAACCGAATATCAGAAGCGAATAGAATTACTCAAACTTGGCAGACTCAATTAATATTAGGAATTATTAGCGAGCAGGATAGAGAAAAATTAACGAACTGGATGTTATATTTACAAGCGGTGCAAGCCATAGAGACCCAAGCCGCCCCTGATATTGACTGGCCTAAAGCTCCGGCGGCATAGGTAAATAAACATGCTAAGAACAGTGATATATATGACATCATTTTTGATAGGTAGATGAAGTCATATCAGACGAGCTATATTTATGGCCCGTCTGATATTTGCACACTGTAATTCTTATCGTATTGATTTAATTAACATATTATTGAGTATGATTATGAAGCTTATTAAGAGGCTCTTACAATATAGTTAAAAGATATATTGCGCGGGCGAGTACTGAACCACCATGATACTCCTGTAGATGAAGCTAACGCAGCGGTAGAAACTGTCCCTGGTATACCAGAATCGGCATTCCAACCAAGTAGTTGCTCATTATTAGCCGATTTCGCTGTGCTTGGAAAAGATTTTGTTGTCATATCAGCATTATCAAATGGAATCCCTATAGCTGCAGCAGGTACACCTTCAGCGTCGTTACCAACATAATCAAGAATAGCCGTTCTGATTAAGGTCGGTGATTGATAGCTAATTAATGCTCGTCCGGTATCAATCCCGCGTCCATCATCCCAGCCACGAATAAACTCCCCACGTAAATCTGGTAATTTTAATAATGGATAAGCTTGTGCCAACTTAGGATATTGCGCAGCAGTAAACGCCGCACCATTACATTTTAGCCACCCTTTTGGTGGTATAGCTGACGGCCAAGGGATGGGAACACCTACCGGTAAAACTGAACCTTCTTCTAAACCAAGATTTTTAAGAACCCCCTATTTATCCCAATAGTTCTTTGAGACTATCTACTCTAAAAAGAGAAGGGGGAAACGGGGTGTTAATTGGCTATGTTAGGGTGTCAACAAATGACCAGCAGACGGATTTACAGCGAGAATCGCTTAGTCGTGCAAATTGTGAGCTAATTTTTGAGGATAAAATCAGTGGAGTGAAGTCCGAAAGGCCAGGGTTAAAACGGTTGTTAAAAACCGCTAAAGCCGGGGATACCGTTGTGGTGTGGAAGCTCGATCGCTTGGGGCGCAGCGTTAGACATTTGATAACGTTGGTTTCTGATTTGAAAGCCAGAGGGATTCATTTTAGAAGCATTACCGACAGCATTGATACCAGTTCTGCTGCCGGGCGTTTCTTTTTTCATGTGATGAGTGCGCTGGCTGAAATGGAACGTGAATTAATCGTCGAACGCACCCGGGCGGGTTTGGCAGTAGCGCGAGCACAAGGGCGGATTGGTGGACGCCCGAGGAGATTAACTGATGAACAGATACAGCAGTCCATCAGATTATTAAGCAAGGGGCATAGCCGAAAAGAATTAGCGTTAATTTATAATGTGAGTCTGGCGACTATCTATAAATATATGCCGGCCAGCTCAGTCGTCTCAGATTAACCACATATCGACATTAGACGAGCTATAAGTGGATATAGCTCGTTTTAATATTGATTAACACTCTCACCAAAACGCCTATTTTTAATTTTATTTTGCTAAGGAGTATCTGTAACTCCTTCGGCAGGGCTCTGCATTTTTTGAGCGGGCATATTTATAAGGAAAATCAAATGGCTTTAACTAATGAACAGGAGCTGACGCTATTAGCGCTGCTCAATGAAAAGAAAATTACCCTATCGGAATTACCTGCTGCTACAGACCTGGTTGCGGATGATTTGATGCTGATTCGTCAGGGAATTATCGATAAGTCGGTGAATAGTAGTGTGCTGAAAAAACACTTTACTCCACCAGCTTCATCATTAACTGGTGCTGGCATTGTTAAGTTAAGTAATGCGATTAATAGCAATGATGAATTAATGGCAGCTACGCCTAAAGCAGTGAGGCAGGCACTCGAATTAGCGATCACCAGAAGTATTGATGCAGTATATCCTGTCGGCGTTGTAATGTTTTTTGCTGAAAATAAAGATCCTAATATGCTATTTCAGGGTACAAAATGGGAATATTTAGGAGAGGAAAGAACAATCCGTTTGGCTAAAAAGGATGCATCTGATCTTAAAGAGCTTGGTGGTGCTGATATGGCCACGCTATCTGTGGAAAATATGCCGGCGCATACACACTCATTTAGTGGCTCTACATCTAATTTTGATTATGGAACTAAGACGGTAAGTACGTTCGATTATGGGACGAAAACGACAAATGCAGCCGGGGAGCATATACATGATACAGCTGTGGGAGGGAATGGCTCTTATACACCATTTACGAACCACAGGCGTGAAGCAGGCATATCTATTAGAGGAGCGGATTATACATATAATTTAATGACTGCATATACAAGCTCAGGAGGAAACCATGCCCATGTTGTTGGCATTGGTGCACACAACCATACCGTAGGGATTGGTTCTCACAGTCACAGCATTAATGGAAATACAAACTCTGTTGGTTCAGGCAGTCCTGTACCCGTTGTAAATGCTTATATTAAGTTAATGGGATGGTATCGTTCAGCTTAA
Protein sequences of DBSCAN-SWA_1 >LR134531|1065846:1082230|1067515_1068013_+|VEJ54278.1|DBSCAN-SWA MATIIEQQLKSLLQPLLDSNLIIKGETVPQEPYAELSTVSCTALGMSDEVDRNVSDDGYLIVRGQRRAEIAIHYYGGDVVEQLSKINDGLRKVSVSEKFQLAQIGIEGSAQLKTEMKEDAQWTPNSETYLSFFIHYSVIIKDAVSVIDNVQATSDGSKTIFNLTR >LR134531|1065846:1082230|1065846_1066164_+|VEJ54275.1|DBSCAN-SWA MPMKEPQNYLSLSYILVLVMTLLGAIASYAYRILNGEEFRWSILLLQATVAIFAGALVLLAASYYHWAAELAGGIAGLAGWSGAEFIKILEKRFLKRIDGGRDDN >LR134531|1065846:1082230|1081333_1082230_+|VEJ54302.1|tail|DBSCAN-SWA MALTNEQELTLLALLNEKKITLSELPAATDLVADDLMLIRQGIIDKSVNSSVLKKHFTPPASSLTGAGIVKLSNAINSNDELMAATPKAVRQALELAITRSIDAVYPVGVVMFFAENKDPNMLFQGTKWEYLGEERTIRLAKKDASDLKELGGADMATLSVENMPAHTHSFSGSTSNFDYGTKTVSTFDYGTKTTNAAGEHIHDTAVGGNGSYTPFTNHRREAGISIRGADYTYNLMTAYTSSGGNHAHVVGIGAHNHTVGIGSHSHSINGNTNSVGSGSPVPVVNAYIKLMGWYRSA >LR134531|1065846:1082230|1080602_1081166_+|VEJ54301.1|DBSCAN-SWA MLIGYVRVSTNDQQTDLQRESLSRANCELIFEDKISGVKSERPGLKRLLKTAKAGDTVVVWKLDRLGRSVRHLITLVSDLKARGIHFRSITDSIDTSSAAGRFFFHVMSALAEMERELIVERTRAGLAVARAQGRIGGRPRRLTDEQIQQSIRLLSKGHSRKELALIYNVSLATIYKYMPASSVVSD >LR134531|1065846:1082230|1074091_1074757_+|VEJ54289.1|DBSCAN-SWA MAENNNLMQALQTLIQSETSQINTAVDGIIESYDAGIASVKPIPKQRFVDGTSLDYPVIPNVPVMWPRFAGDVAGVKGPVRSGDKCLLVFCQQAADDSDDERRFSLTDAYCIVGGFGAAKDRAAENDQMQLYFGEAYIALTEEGKLLINAPAGVEITTPETLNKGLLTTEGQLNYQSGMSGKGGATINGTVKATGDVQGSGISLTQHTHQEHDGPSTGPAK >LR134531|1065846:1082230|1068017_1069100_+|VEJ54279.1|DBSCAN-SWA MGSLNQIVNVNIALSTTSVPRGVFGVPMIIAPLTTFTERVRVYFDYNAAQEDGLPADVLKALSAVFSQTPRPQMCKVGRLEADTEGKVVATTLAAQLTAIQAEDANWYGFALTERTAALQMAAAEWAETQTKMFFTSCAEAAVTDASSKEDTLSQLAAKNYLRTAVIVDKHAAEQYLEMAWMGRCFTIAPGGETWALKQLSGVQASDWSATEQQTILKKGGNTFERFAPQIYLTTPGKVVSGEWVDVIRFRDWLADAIQTSLSTMMINRNKVPYTDGGIALIVNNLTGCLIEGQRVGGIAPDEIDAEGNNVKGFVVTYPRSVDVSFQDKADRILNLSFSARLAGAIHLTNINGNLSYELQ >LR134531|1065846:1082230|1077264_1077933_+|VEJ54294.1|DBSCAN-SWA MTTFAENHIVFPDGQLNVAPLPEAVMHNGFTPETRDAPGMPLPAQYLNWLFRDIYRRSQALEEKNQTLNSRVIPAWMPIACPMEEPPLGYLKCNGAKFDKEKYPELAMGYPSGVLPDLRGEFIRGWDDGRGVDSGREILRAQGDAIRDITGTLLGVLSNAPDQESVNSYVKGAFSAKLLFGGAAMAGTAYRYDATFASSRVVPTANENRPRNIAFLYIVRAA >LR134531|1065846:1082230|1077932_1078523_+|VEJ54296.1|tail|DBSCAN-SWA MKTTLDNYGFAISDGYVQVYNVDPKNGEFISESEEFLNQGVGLPAYSYLDKPLPEKQGFVVCRQGDDWIYQEDHREALVYSTETGEQVTITEPGALPDNLTPSQPQTPFDKWNGSAWVMDLDAKQKAEIDKVTADKQRLELQAKEMIATLSDAIELELALPGDDLKLLAWRKYRVLLNQLDMKNAQDIEWPALPDC >LR134531|1065846:1082230|1069117_1069555_+|VEJ54281.1|DBSCAN-SWA MAAELTGTYKGDQVFVTVGPVLISGFSDGDAITVKRAEGLYTSKVGIDGGVARVRNANKSGSIEIKLLQTSKVNDELSDLFYVDNFNEDGSPVLPISVTDGNGRTLCSAGHAWLKAVPDVSFGKDVSDRLWAFECADLKIYVGGN >LR134531|1065846:1082230|1072062_1072731_+|VEJ54286.1|DBSCAN-SWA MSSVWGILTKLLNDKSSGVVRFQNAYSNMEFDVVTSESHTWTADVTTNPVETGAVITDHVQLKPDSLEISGIISNSSIERWRGRFLETLSDLFNKESNIQKAFDQLRMLLENRQPVMVYTKYRNYPDMVLTRLSIPRKAGEGDSIEFSATFTHIRRVSTLIVDAEEAGINPKQADSPATSRKSSPKNNKGPIHPSTANDETVTHIENKETTVETNVSVEVNT >LR134531|1065846:1082230|1075106_1076528_+|VEJ54292.1|DBSCAN-SWA MALTKDGYTIKRLAELKKEYDRLLINRFGPINTQPDSVIGQLEGIWAEALANIYEQAQDTYHAMYPFSAEGVSLDGAVSYVGITRFAASATQVIAAVYGKESTLLKSGAQATNGGQRYQSMFDAVISRANAVDTSIEMNVRDKTQYSININGTLFNTISVEGEDATQIMQKIADQLNPNVLSYEIKDKVLRIFAVDGITPFALSVGEHLKLVRIGSPARFIAMQEGRHVLPLGALTEIVTPRSGWDAVCNLAEGVVGRERESDAQLRVRFEQSRQSTGSATVKAIRARLIQEVAGVSEVHIFENRTGSISEEGMPPHAFEALVVGGDNQSVANALWRHKPAGIETYGAHAVLVKDENGDGQQINFSRPTQKLAWIKINVTGLYNEENLPQNVISNIKKAVLKYGSTLKIGDDIILQRMLGPIYSNTSGLAEITIEAAMTEGPDDEPVYQLENVAIDKRSVALFDESRLEVIGL >LR134531|1065846:1082230|1070150_1070282_+|VEJ54283.1|DBSCAN-SWA MARLVTLQEVKQGEVDTDDLLKLNALLDMRETMQLQAQLENQT >LR134531|1065846:1082230|1072772_1073093_+|VEJ54287.1|DBSCAN-SWA MATVLKIPLDAGIADQQMDITLDKKPLTLRVTWNEYGQYWYFSLSERNGESIIDGIKMVKDTFLLKRYQLSSPEGDFIFMDNYSGKERPDFYSLGNDHLLLYRTKY >LR134531|1065846:1082230|1074753_1075107_+|VEJ54291.1|DBSCAN-SWA MIDLKLDATGDLDLQRNDLLWIDGAERVHQQLQIKLKLWKGEWFLNTQFGTPYLQQILGKQITLNGALAALKNSINEVDGVLEIEQFNYDFDRQTRQLSVQFAVKTPYGLVKYKGNQ >LR134531|1065846:1082230|1073139_1074105_+|VEJ54288.1|DBSCAN-SWA MLFNRVAELVVGEANGKAVIINDLRFSFEISKDNDKTTNKLALKIYNMNNQTRSLVERVNNNVILKAGYEDDIGAVTIFTGTVVSAWTTREGNDTLTELTVRDGALPLRQTKLSLSYAPNTSALDILADISKAFGLPVKPMPEKIMDKPYRRGFAFCGKAEVAMKDVCQYLGLTWSIQNNEIQILSKDNPITDELVVLTPDNGLIGLPTRIIDSTRNKSQGESSPPSKLVLSESLGDKSQYQIEGYNVKCLLQPRLYPGCYVGLESDMLLLDPSADKERSERPRAFFRAEVVTHSGDTFEGEWITECELKAIPQGGKRGGK >LR134531|1065846:1082230|1076524_1077268_+|VEJ54293.1|DBSCAN-SWA MTFPYRKTALSRLVGQFQDKPKVKALAESMVAPLEAISADFNDLKNGRWVDSATGAQLDGCGYIVGVPRFSREDDEYRIAIKSRILSNISRARPQDLIEGVRFLTKPTEVQYLESYPSCAMLFTDGRQIPEGSQAILQDIAPVAIENVPLMVSYGRAKPFRAGRLANPMNMRVDMTENGNSSLLTNDKHLVISGSNPVGSARLSGMAPDKTPLTANGTKISVGAGVLSVSTTYQVFDNGYHLPGVFQ >LR134531|1065846:1082230|1069668_1070085_+|VEJ54282.1|DBSCAN-SWA MQVETFIIGNREYTAKKMNAFDAGRFLLKIKNIAAPALTAFAQADNANDVNLFELFSGLDEEAHEKVIFPILAASAVYSIEDKRKVASITDMNMCFNVDTLLDFYLLVWEVLKMNFAPFIERASTHFGYQDIEQAASE >LR134531|1065846:1082230|1078649_1079408_+|VEJ54297.1|tail|DBSCAN-SWA MAFTIEQEQALLALLNEKKITLSELPAATELAADDLLLMRQGILDKSVNNDVLKAYFTPPESSLIESGIVKLSNAINSDDEYIAATSKAVKSAHNIALNASQSVAEQALSKSANLSDVADKEQVLKNLGLLEAASRSGITALNDKNGWLSIPVKAMGKLSNVIIQWGTVTLPLISDGGIGEWRQSSTSFSFPISFPNACFSTGCSLLHGGSSTQWVAIASASPSTRTTGRALIQTRYPLETNPSITWIAIGY >LR134531|1065846:1082230|1079416_1079842_+|VEJ54298.1|tail|DBSCAN-SWA MSYFYSAHENAFFPAKFKEAYIAANTWPEDLLEVSVEIFNQYSIQPPEGKIRVAGVDGLPTWISIPEPIKTQYVATAEIEKYNRISEANRITQTWQTQLILGIISEQDREKLTNWMLYLQAVQAIETQAAPDIDWPKAPAA >LR134531|1065846:1082230|1066575_1066977_+|VEJ54277.1|DBSCAN-SWA MTFSARMMTTALTGVCIVLLLLYARWLGNQLSQLRNEKQQAVVALAEERAYSAKIRAQYRQIQEVMDDVAEQKQESEKRTVALQRALAQSQLASPCVAEPVPDAVTQRLRERVAEVNATAAGAKNAVPPVPGT >LR134531|1065846:1082230|1079924_1080032_+|VEJ54299.1|DBSCAN-SWA MARLIFAHCNSYRIDLINILLSMIMKLIKRLLQYS >LR134531|1065846:1082230|1066153_1066579_+|VEJ54276.1|DBSCAN-SWA MITSQSGVEHIKSFESCQLKAYLCPAKVWTIGYGHTSGVKPNDQISAMQAERYLKADLVRVEQDVLRIVKVPLTQGQFDALVSFAFNCGARALSTSTLLRKLNQRDYNGAADEFGRWVYANGKRFAGLERRRRLEKRMFES >LR134531|1065846:1082230|1070278_1072063_+|VEJ54285.1|DBSCAN-SWA MSNQNIGAMQVANSLDGLRASLPPLNKVNEKIEDTVKKIGVQVNINQSIAEKNIRHISNNVNNLLSGLSKVSSAVLTRNAPSINQAVKKFFFISNITERIKNDISIKINVAPSQSNEKSSCPNDGERGGDKKDIWDTIKFFTELAADLLTILDILGGFRKLKRIFRLMGGAFSKVFSRIGSFAKGVGSAVGKALKSAAGSITRQAKRLFSGVKQIASSVKSKVGSGIRGGYTKAKNFLSSGGNKVKTLFLAGKDKAKNLFTSGREKAKNLLSSGKEKARNLLSSGKNRVKSVANVGKQLFGRLSNFIKPAVSKGKDLLSSGKNRVKSVGNVGKQLFSRLGNFIKPAVSKGKDLLSSGKNRVKSVANVGKQLFGRLGNFLKPAVSKGKDLLSKGKNTAKSLAGRASGLGSDILKRGGSLFTKMAGPMLNIGKSLMSKVPGGIKSAMKTTFTAGKSLLSNPKVLRTLGRVASVGLRVGRMATPIGWASLAAEGAIRLGYHAYKKYQASKENEGTPGSGEGPGKDNSDVYPQTVTPGMAAAASAANAMNYMINSNPQFQINILPGTPEQQINDIRQAALDGTKQGEKQLTATINGGM |
23 | Erwinia_phage(31.25%) | tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1326529 : 1345327
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >LR134531|1326529:1345327|DBSCAN-SWA AATGCTAAAGATTTTTAATACCCTGAGCAGACAAAAAGAAGAATTTAAACCCATTCATGCTGATAAAGTAGGTATGTATGTGTGCGGGGTAACCATTTATGACCTGTGTCATATCGGGCATGGACGGACTTTTGTTGCTTTTGATGTTGTTGCCCGCTATCTGCGTTTTTTAGGTTATAACCTGACCTACGTGAGAAACGTGACCGATGTGGATGATAAGATTATCCGTAGAGCAGCAGAGAATAATGAAAGCTGTGAGCAACTGACCGAGCGTATGCTGGCTGAAATGCATGCGGATTTTGATTCACTGAATATTGCTCGTCCGGATATAGAGCCACGTGCTACCCACCATATTGCTGAAATTATTGAAATAGTCGAACTGTTGCTACAGCGTAACCATGCTTATATTGCAGATAATGGCGATGTCATGTTCTCGGTAGAAACTGATGCGGATTACGGTTTGTTATCTCGCCAAAATCTGGAGCAGTTACAGGCTGGGGCAAGGGTCGAGATTGCTGATGTGAAGCGCAATCCTATGGACTTTGTGCTATGGAAAATGTCTAAGCCAGGTGAACCAAGCTGGGAATCACCGTGGGGGGCAGGGCGCCCGGGCTGGCATATTGAATGCTCTGCGATGAACGGCAAGCAGTTGGGCGATCATTTTGATATCCACGGCGGTGGTTCTGACCTGATGTTCCCCCATCACGAAAATGAAATTGCACAGTCAACCTGCGCCCATGATGGCCCGTATGTGAACTACTGGATGCATTCCGGTATGGTGATGATTGACCGTGAGAAAATGTCGAAGTCATTAGGTAACTTTTTTACCATCCGTGATGTGCTACAGCATTATGATGCAGAAACGGTGCGCTACTTTTTGATGTCCGGTCATTACCGCAGTCAGTTGAACTACAGTGAAGAGAATTTAAAGCAGGCAAGAGCGGCGCTGGAACGTTTTTATACCGCGTTACGTGGTACTGATGAGTCGGTTAAGGCGCAAGGTAATACTGAGTTTGAAGCGCGTTTTCGTGAAGCGATGGACGATGATTTCAATACGCCAGAAGCTTACTCAATCCTGTTTGATATGGCGCGAGAAGTTAACCGCTTGAAGTCAGAAGATATGGTTGCTGCTAATCAATTGGCGGCAGAGTTACGTCGTCTGGCGGCAGTATTAGGGCTGCTTGAACAGTCGCCGGAAGCCTTTTTACAAGGGGCTGGGCAAAATGCTGCGGACGGTGACGAAGTCGCTGAAATTGAAGCACTGATTAAACAGCGTAATGATGCACGTCAGGCTAAAGATTGGGCACAGGCTGATGTTGCCCGTAATCGATTGACCGAGATGGGAATTATCCTGGAAGATAGTTCTGAGGGGACGACTTGGCGTCGTAAATAGTATTGAGTCAGAGATAAAAAAAGCGCCATTAGGCGCTTTTTTTATTTACGTTGACCCGTCCTGAATAGAGTTATAGCGGGTCGTGGTAATCCTGACAGGCTTGCAGAGTATTTTGCATTAAAGTTGCTACGGTCATCGGGCCGACACCACCGGGTACTGGCGTGATCCAACTTGCGCGTTGAGCAGCAACGTCATATTCCACATCACCTACAACTTTACCGTTTTCTAACCGGTTAATACCCACATCAATCACAATTGCGCCGGGTTTTATCCATTCCCCACGAACAAAACCAGGCCGACCTACGGCGGCAACAATTAAATCAGCAGCCCTGACTTTCTGTTCCATGTTTTGCGTAAAGCGGTGAGTGATGGTGGTTGTACAGCCTGCGAGCAGCAGTTCTAATCCCATTGGACGACCAACGATATTCGATGCACCAATGACTAATGCATCCAGACCATAGGTATTGATTTCACAGCGCTGTAGCATGGTGATAATACCACGCGGCGTACAAGGACGCAGTTTAGGCGTGCGCTGGCACAAACGGCCAATATTATAGGGATGGAAGCCATCGACGTCTTTCTCTGGCGAGATGCGCTCCAGAATTTTGATATTATCAAAGCCTTGAGGCAGCGGCAGCTGAACCAGAATGCCATCAATCAGCGGATCGTTATTCAGTGAATCGATCAGTTCGAGCAGTTCATTCTCAGTACAGGACATGTCCAGATTAAACGAACGTGATAGGAATCCGACTTCCTCACAAGCGCGGCGTTTACTACTGACATAAATTTGCGATGCTGGATTCTCGCCCACCAGAACAACGGCTAAGCCAGGGGCGCGTTTGCCCGCCTGAATACGTTGCTGAACACTTGCGGCAATTTCAGTTCTGAGTTGCTGAGCAATCGCCTTACCATCAATAATCTTTGCTGACATCTGAGAATAGATCCATTTTTTAACGACGGGGCTCATATTCTGTCAGAAGCAAAGCGTTCTGTCAGGGGTAAGTTATGATTAAATATCAATTAGGCAGCTTTTTGAGTAAAAAAACGAATGCAGGCTTAATAAGCGGTGGAAGCAGAGCTTAAATTGCGTTGTGAATAGGGCAACAAAAAGCGATATTCAATAATGAAAGTAATAACAAGTGCTTAAGTTATGTCCAGTGAAGCACTGAAATACATGTGTCAGCCTAATTAACACGTAAAAGCCATTGACTGAGGACAGAAGAGCCGTATAATTCGACCCCATCAAGAAATCCCGCGCTAAGCCGGTATCTTGGTATCACTTCAACGCTCCCATAGCTCAGCTGGATAGAGCAACGGCCTTCTAAGCCGTAGGTCGCAGGTTCGAATCCTGCTGGGGGCGCCATAAATATCAATCACTTGCATTATTTAACCCCCGATAATTTTCCAGCTTGGGACATATTTGGGACGGACTCGTCGAAAATGCTATCAATTTGCTTAGCGTGTTCAGTTAAATGATTGGGTGCTAAGTGAGCATAACGACGTACCATTTCTATACTTTCCCATCCGCCCATTTCTTGCAACACTGACAGTGGGACACCAGACTGAATCAGCCAACTTGCCCATGTGTGTCTTAAGTCATGGAACCTGAAGTTTTCTATGCCCGCACGTTTTAAAGCACTATTCCAAGCCGAGTTTGAATCAACCCTCATCTTTCTGACCGAGGGTGTTTTTGTGCCATCTGAACGATTCGCCGGTTCAGTATGAACAAAGACACATGTATCATGCTTGCCGAGTTGGTCGCGCAATACTTTGCAAGCAGTATCATTCAGAGCTACGCCAATAGCTTTGCCTGATTTGCTGTCCTCTGGGTTAATCCACGCAACCTTGCGAATCATATCGACTTGCTGCCATTCAAGATTAAGAATGTTTGATCTCCTTAATCCTGTTGCCAGAGCAAATTTAACAATGGATTTTAGGGGCTCAGGGCATTCGTCAATTAGTCGTTTTGCCTCATGAGCCTCAAGCCACCTGAACCGTTTATTTCTAACTTGGGGCACCTTAATTACAGGCGCCTTTTCTAGCCACTTCCAGTCTCTTTCAGCAGCTCTCAGTAACGACTTTATCAAAGCCAAATGTTTAGCTTTAGTTGATGTAGATACTGGCTTTTGTTGATACGTCGGTATTTCCTTTCCCTTTTTCGCTAATGATATTGCTCTAGATTTAAAGTTACCAGCAGCCTTTCGATTGACCATTTTGCTGATAGCTAGGTAGATTTTAGCTTCCGTAATTTCTCTAAGTTTTACCCCCTCAAAATGCAAAAGCCAGAATCCCATCCGGCTTTTATCATCATCAAGTGATTTCTTTTCAGCCTTCTCCTCCAACCAGCGCAAACAAGCTTCGTCAAAAGTAACATCAGGAAAGTCGCCTAATTTGTCGATCCTCCACAATTCGGATTTTCGTTTGTCGTGCAACTCCTGAGCTTCCCGCTTGTCCTCTGTGCCAAGAGATTCTTTAATTCTTTTTCCACCCGGTAACGAGTAGCTCGCGTAGTAGGTTTTACCTCTGCGGAAGATTGACATGGTTTATTCTCCATCCGTTCATCCGCGCTCACTTGCACAGCGTAAACTGGATTTTGTAGCGCGGCAATACATGCCTGTCTAGTTGTTTGATAAGGTGATTTAGGTTTGTTCGGATCTTTTCGTGATGCTTGGAGCCTTCCCGTTTTTACCCACTCATTAAATGTCGGGCCAGAGATACCGAGAAAACAACATGCCTCAGCTTTGCTTAAGCTGTATTTATCCATCATTCATCTCTCTTTAAATTGTCCGTTAATAACCCCGATCGCCCACGTAAGCTCAGTTAGCTTGATGCCTTTAATTTCCAATGGTGCGTAATGCTTTAAGATGATCGGGCGGGTTATTGTGTCTGGGGGGGTATTACGTGGTTTTTTTTGGATTGCTTCGGCTAGTTCTTTGGCGCATCGTCGTGCGACTGATTTAACAGAGTGTCTCGTATGCCATTGTTTACCTCTGTAAACACGATATTTGGAACAGCACCGGCTTGAATATCTCTAACCAAGTCATTAGCTAATTGATGCGCCAATGCTGGTGTAATGCCGCGATAGAGATTGTCATAAGCTAAGTATTCTTTGATGTATTCGATTGTTTCTTTCATGCTGCTTTATCCTTACTGATCAGTTCAGATCCGATACTCATTAATTTTTCACGCTCAATAAACGATATCCGGCACCGGCCTATTTCGCCGTAGCGCCAGATTACGAGCATTGAGCCTTTATTGTTTCCTGTCTGCCTACGCCCACTCTCTGCGTGAACAAATGCCAGGCGGCCGTCAGTGATGATTCGAACCTCTGAGCAAGTCTCTCGAATCCAACGAAACCAACCAACGCTAGTATCAGACGGGATTAGCATAACAACTGTGTGACCGCTGACCGCTGCTTCTAGTGACTTTCTAACCCACGGAAGAATTGAGCTGTATGGGGGATTTAGCCAGATTGATCCGGCTGTATTCCACGGTACTGTTAATGCATCTGTTTGCTCATCAATGAAGTCAGGGCAGAGGTGATTGTATTGGCTTGCTGCTGCGTCTAGTGTGAAGTTGAATTCTTGATTAAGTGCGGCGAATATTTCGGGTGGAGTTTGCCATAAGTCTCTAAATTCCGGCGCTGTATTACTGCCGGCAAAGTCAGTCATCGTGAGCTCTCCCATTCACTGCGATTAACTACCGATGTTTTGTAGTCACTAGCAGCCTGAAATACAAGGTTACTGATGTGAATATACTCGCAACCCGTTACCCATATATCAGCAACATCGGACCATCTAGGTAATTCAACTGATAAATAAACGGCGCCATCACTACCCTGAACGGCGTGAGGATATCGATCAGGCCAGCGCTCAAGTTTTGCTATCAAAATTTCTAACAGCGTCATAAATCCTCCGGGCATTAAATTTCAACTACACATTTTTCGCCGTTTGCCCAACCTTCCTCGTAATCCGTTTCCCCTTTAGAAATTAATGAGGCATGTTTCTGTACGCTACTCCAGTTCATGTTATTTGCTGCCCAATCCTTAATTTCGAATGGATCAGAGTTAAATAGAATCATAGTGTCCTCTGCAAGACTTCTACCAATATCACCGCTGAACTCATGAGCGTAGTATTCCGCACGATGTCTAGCAATAATGCTAACTGGCACTGCCCAAACACTATTATCAGGCATTGTAACTTCTAATTGTTTAGTCATATTCACTCCATTAAAAAACCCTCACTTGGAGGGCTCTTAAAGTTGACTTAATAATTAATCTTCAAATGGGCTGGGTTTGAACTTGAATATCGCCACCCCAACATCAGCTAAATTTGCCGGCTCATGAACTGCGAGCAAATAAGACTTTCCATCATCATGCGGGTGTTTGATGACTTTAAAGGTAACGTTTTTTACAGCTTGATACTTTAATTCTTTCCCATGCTTTAAATGTTGCTGATCATCTTTAGCAATGTATTCGATAGTCTCTGGGGAATATTCAAATTTTCTTAAATCCCCACTATATCCAGCGCCATATAACAAATAATCTACTATCATAAGAATCCCCCAAGTAAAGTTACTTAAATTAGTAACACTTAAAGTTTAGAATTACTATTCTTTTGGCTCATACGGCAGCTTACCGAGAGTTTTCGACCGCTCTCGATAAAATGCTACACGTTCTGTGAGATAATCACGTAGGTGAGTAGGGATTAAGTTCTCAATCTCACGCATACTAAACGGCATTCCATATCGCTCTTTATAAGCAACGCCGGACGCGTGTAAATCAACGTCGATTTTTTCGCGGTCTTCTTTCGGCAGGTCGGCTAAGTTGTGGCTCATGATAATGTCTCCTTTGTGTGGACATTATACAGAGCGAGGAGGGGGATGCTAATGTGTTAGGTCTTAGGGTTAGCAATACATGAATAAACTAACCACAAAAGCAAAGCAGGTATTGTAAGTGCAATAGATACTGGCCATACGCTCGACATGGTAAATGCGCCAGCGAAACCTATGGGGTCGCCTTTGCCAGCGCATCTATCAACGAAAGCGCACATTAAGAAGGTCACAGTAAACCCCGCCAAATAGATAAAGCACGCCCATTCTTTGATGTTTTCCATATCACTTCACCGTTAGCCCTTGGGCCTCGATAGTTTCACGGCATAATTCAATGCCAATATTGAACCCCATGCGCTGGTCATTCCATGCATCATCGTAGACGCTAGGCAACTCCTCATAGGCACCGCGGGTAAATCGAACAGCCCCGATCGCCTCAAGAATTCGCCCAACCTCTGCATTTACCTGTTTGTCGACACTATAGTGAAATTCAAAACTGTTAGAATTGGTTGTTTTCGGCTCGACTAACTCATTATGATAGCCTCGATACGTAGTTGTTCTAACTCGTTTCATTTCCCCCAAAGCCCGTAACGCGACAAAAGGCAGGGGTCGCTGCATCATCACAAAATCCTTTAACTTCTTTTTAGGTTTCTGCCTGAATTCAGCGGGAATAGCTGCGGGATACATGCTGGCCAGCACACCATTTAGACGCCATGCCATATCAGGATGAACTTCGAAATGCGCGGTACCTTTTTTATAAGTCCGAATCCTCATGGAACCGCCATCGATATTCATCCACTGCACCGACTGACGATCTGCCGCCTTTAATAATTGCTCGGTACTTTGCCAATTAGGCTCGTCTTCGCGTCCCATGAACTTGGCTACTACCTGCCGCAAATCAGTAATATGCTCCTTTCTGTATTCCACCCAAATGATCATTCTCTTACCGAAACCTTGCGGCTGGTTTGTAACATGCTCACCAGATAAGGCATGAAAAATACCATCGACACGTTCACCAAAGAATTGCTGTCGGCTATTCAGTAAATCAGTCAAAGTTGAGCGGACAGTATCCTCTTCAAAATCTGGGGTTTTCATTTCCTCAATTTGTTTATCCCACTCTGAACGACGACTAGCTGGCATATATTCATAAACATCTGTCAACCGCATAGCCATTTGCCAGTAATCGGCGTTAAGTCGCGCTATGGCTCCCTCTTGTTTAAATATGTCATTAACTTTACGGATATAACGCTCACTCTCACGGTTTCCGTGTAAGAAGTACGAAATAGCCTTGGCGTTACTACCGTTATTAATAAACTCGAACACGCTCTCAATATTGAGTTTCATTTGTCGATATCTACCAATTAAGTTATCAACAAGATCCGCTGACACTGGCGCGAAGAATTCCTCTGTTTCAATAAGATGATTCATGTTTTCTCCGGTATCGCTCTAAATCGAATCTCAAGCCTAATCCTCGACCCATTGCCCTAATATCATCAATGTCTCGTTCTAGCTTTTCAGCGATAACATTGATGTGCATTTCTCCGGCAACTTGCATGATGAAATCCTTTTCATATTGCTGCCATTGAGTGAGTAGCTTTGTGTTGAGTTGCTCGGCGAAGTATTGTGCTTTGATTGGGTTTTTAATGACTTGATGCTCTGGCGTTATCCAGCCTTTTGATATTTGGGAATAGGGCAGAGTTACCCGCCCTACGGTTATAGAATTGTGGGCGCTGTTCATAATCAGCCCCTAGCTTGTTGCGTTTTTGATTTTCTCTGAGCAGGCTTTGTAAACCTCACCGGCTGCTTTTCTATCTGCATCAGTAATAGTGATGGAATTAAGCTGCTCACCAACTTCTTTAAGCTCTTTGGCTGTCTGGCATAATCCCATCTGATTGATGATATCGTCGTAAGTTGCGCTATCCTCTGGAGTCATAGTGACAGCGGGTTCAATAACCAGCTTTCTAACTCTATGCTCTGCGCGGTTTCCGCGAGACACTGAAAGCATCATAGAAAAATCACAATCAACATGGCTCATTGCCTGAATCTTTATGCCACCCACAGCAACACCGCCGAACTTAACATTAGGGTCACCAACCAACGTTAGCGACTTGCCAGCCCACGAATGACCATCTGAACCCCAGCCGCCGATAAGAACGCGACGCATAGACTTAGATGGCTTGTACGGACGACCATCAAAGCCAATTAAATCAATCCATACCGGTTGCTCTTTATTGCCTGACCGTACTGATTTTATTTCAGCGGTAATACTGGATGATTGAACATCCTCAAAGTTAATTTGGTCTGATTTAGGGATAACAGTTCTTGATAAGTCCATTACAGGAATACCTCTTCATCTTCATCGTCATCAAATAGATAGTTAGGGACGTTTATTTCAGCCGATGCCAAAACAACCCCCTCGGTTTTTGGTATGTCGCCATCAACACAGGCTTTAAGCTTGTATAGTGCTGTCATCATTGTTTTTCGTCCCAGCTCTAGCGAATCTTCGCCGATGTAGTACATGCAGTTTCGATATGGCGGCTTGTTCTCTTGAGCGAAGAAGCAAAATTGATTTAAGTCTCTTCCGGTTACCAGCTTCAATACGTACAAGTAAAAAGCCGCTTGAACGTGATAGTGAAATTTGCCAAACGCAACGCTAAACCCGCGTTCAGAAGCATCATTACAGCTCTTTACATCTAAGGGATAAGATAGGTGATCAGATATCCGATCGAATCGGCATTTAAGCTTTAGTCCCGTTTCAGGGCAGGTGGTGAACATAGATACTTCTGACTTGCCGGGCGTGTTCATGTAGTCCATGAAGTCTTCATTCAGTTTTGACGCTTCGATCATCCGGTTGATTGTTTCCACTTCGTTACCGATGAATATAAATTCCTCATTAGCAACTTCGGCTAGCTCTTTGTATGCTTTCGATGCTCTGGAGTTTATTGTTTCATCAAGAATGAAATCACGCTCGAAAACGTTAGGTTCAAGCAGGGCGGCATGAATCGCTGTTCCTATCTTTGCCTGCTTACTGCCTTTGAATGGGTTGAAATAAAGGTTTGCAGGACTAACGCTGATAGCTTTTACACTCGTGGAGCCTATGGCTTCATCAGCGTGATAGCCCTCATTCGATATGTCGTAATATATGCCGGGTTCCATTAAGCAACCTCGTCAAATTCGTGGCTCTTATGTTCCGCTTCCTTTGCTAGTGCTAATTCACTGACGCGGTAGAGAAGGGCGTGAAGCGCATAATTTAAGCTCCCTTCATCTTTCATTGCGTCCTCTTCAATAGCTTGCTCGAACAGTTCAGCGTCGAAACCGGGTAAGTGTTTAAGCATCTTACGAATGTTTTCGTTAGTGATGTTTCCGTGCCAGCTTTCAGCTTGTGCGTTTACTTCGTCAATATCGCAGTCAGTGAATGCGGCGATTCGTTTATCTAATTTGTCGATATTCATACAGCCTCCGCCGCGCTGAGCGCGATAATTAAGAAGAAGGTGAAGAGTAGAGCGCCAGTAACATCAGCAGCCCATCTAAGCGCTATTGAGAAGCCTTTTTTCGTATTGCTGAACTCACGCGAACGTCCGCAGACGAACACGTTAATTCGTTTTCCCGTTGGAACGATTCGCATAAGAACCTCGAATTTAGGTAACAAAAAAGGCCGCGATTGCGACCTTATGGTGACTGGTTGGACTTATTAATCAGCTGTTAAAATATGTGGGTTAAATTTCCCGCTTTAGAGCGATACTCGACTTGTTCACCGCGGCAGATTTTCAATTCAGTATTAAATTTAAGCCCCTGACCTGTAATATCCTCGTGTTGCGATATGTTTGATTCGCCGTATTTTTCCAGCGATTCCATATGATGCTTTGCGATGTTATGAGAGTAGCGTTCGCCCCACTCCTTTCCGTTTCTATTCTCGAATCTTGATGGTTTGTAATGGGACTTAATGAATTCGTATAACTCTTGATATGTCGGCATCCTCTATCCCTCTAATGTGGTTAACAGGCCTCTAATCACCTTAATTGCTGACTTTATTCGCGCCCTGCAAACAGGGTGGTTTCGATACTTAAAATTCAGTCTTATTGATTTTTTCATTGCCGCTATCGCGTTTTCAGCAATTCCGTAATTAAATCGAAGCACGTCGATAGGGTCATACGATGTAAGTAATGTCATTTCTCACCCGTCAATTGGTTTTCAAGCTCTTCAACCCGCTTGCTTAGCGCTGCGTTTTGATTTTTTTCGCTGCCGAGCTCACTTAATAGTGCTGATACTTGAGCCTTTAGCTTTTCATTTTGCTCAGTTGCTATTTCATAACCGACCCTATAATTCATCGCCTCCCCTCCAATTAGGTTGTCATTATCAATGGCTCTCGTAAGAACCATTTGTAATAACTCTTTGCGCTTTAATCACCGTTCCAGCCTGCTTATCCTTTTCGCGCCTATTCTCACTCTCGTGTAGGGCTAATCTTCTCAGCGACCGGGTCGCGCTCGCTGATACGTCGTATGACTCGTCAGGGGCTAGGCTGTGCGGGTGATTAATTCCGACTTTCCATCAATGTTAAAGAGCGGTTCGAGGCGGTGGGTTGCTTTCGATGTAATGAGTTTATTAAACACTAAACTCATCGTCAAGTGAAAAATAAACAAATGGTTTAGTAATTAGTTAACTTTATGATTTTAAAATGACTTTAGTTTGAAAAATAATTTGATGAGTATGATTTGATAAGGGGGGAGATAGAGCTGTGATTTATGGGTATAAAAAAGCCCGCACTGTGGGCGGGCCATACTGTTTTAACTGACTCTTATGGGAGGTTAGCGAGCTTGGCATCTACAACAACGCCAATAATTTTACAATTTTCATTGATTTCTACCATCCTGTACTGAGGATTAAGCGGCTTAAGATAATGAACCCCTGATTCGACTATATACCGCCTGAATGTCGCCTTATTTTCATCTTGGAGTTTTGCCACCACAAGCTTTTCGCTTTTCGGTTCAACTGAAGGGTCTACTAAAATTATCATACCTTCAGGGATGCTCAGGCCGGATGGCGCTGTCATTGAGTCACCAGTCACAGTAAGCCAAAATGCATTATCCCCAACATTTTCTGTTGTAGCAGGCCATTCATCAATGTCATTTTCATTATATGGCTCTACAGCTTCCAGCCAGCTCCCCGCACTCACCCAGCTAACAAGAGGGTATGCTCGGCCATTGGTCGTTCGCTCGACGTACTTAACGTTCCCGGCATCATCATACAGATCCTGATCAAGATACTTATCTGGCATCCCGTAAGTATTCTCAAGCCTTCTTGCGGCTCTTTCTCCGAACGAAGCTTTCCCGTTTATCAACTGAGATAAATAACTTTTCTCTTTTTGCGGGATTGCTTTATCAGCAAACCAATGCTTAAGGCGTTTTTGCCTGATCTTTTTTATGTCCATATCTCTATTTTGATTAGCAATTAATAAACAAGCAAATACTTGACCTTCGGTTTAGCATTTAATAAACTCAACTAAACAATCAGGAGTAAATTATGGAACTAAAAATTTATATCGAAGGACTTGAGCGCGGCGGAGCCAAGCAGTTAGCCAGCCAACTAAGCGTTTCAAGTTCATTTTTATCTCAAATGGCCTCTGGCGCAGCAGCTATTTCACCGGAGCGTTGCGTTCTTATTGAATCGTTAACCGGTGGAGCTGTGACTAGAAAAGATTTAAAGCCGAGTGACTGGCAAAAAATCTGGCCTGAACTTCAAGCTACATAAGAGATAAGGCAACAGTACCGCATTGAAGTAATTAAGAAAACTACTAAGTGGAGGTGAGGATGTTAGATAAGAAAGAAATGGTTAAGCAAATGATAGCAGCGTTTCCCGGTGGAAAGTCTGCGCTGGCGGGAGCGATGGGAATCGATGTCGGATCGTTCAATAACTCTCTTTACGAGAAGAACGGAACTAAGTTTTTTGATTTCGATGAATTGGAAGCAATGCAGGACTTAACAAAGACAGCTTATGTCGCTGATTACTTTGCTCAGAGAGCGGGATTTGTAATTTCAAAAATACCCAAAACTGAGGTTTTAGATAATGTTGAATTGTACAACTTAAGTCTATCCGCTGATGTTAAACGAGGCGAAACAGACAGGTTGATGTACGAATCGATAGCTAATGACGGCGTTATTGATAAACGAGAGGCTAAAAGCATTAGAGATGGGCTGTATGTCGAGTTTGCGGCTCGCGACGCTGAAGTTAGAGCAACAATTCGCGTTAACTCAAGAGATGAGTGATATGAACCCTACAGAATTTATCCGAAAAAATATTGTAACGAAGCTATTAGCTGAAAACTTCACGTTATCAGTGGCACAGGGTGCGGCTCATAAAGGTGTTGAGTTTTATCTGAAAGCCTGCGGTGCAAGCCGTAGGGGGGGGCTGTTTGATGATTGCTACCAACACGCTAAAGCATGGGCGATTAAGAACACAACAACGCTTGATAAGCCAATCAAGCAAAAGAAAACCCGAACAGTTGCAGCTGGCCGGGCTATCAGTCTTTTCTAACTCTACTAGCGGATTTAATTATGACTAAAAAACCTAAAAAGTTCAACCAAAAAAACGTTACCCAGCGCTCTGATAAGCCTGACGAGTTGCTCATGATCTGCGTAGATAAACCGGTATTCGGTAAGCGTCTCGCGTCAGAGTTTCGCAAATTACAGGAGCAGCGTAATGGGTAACTTAGCTTATGACAACGTAACACCGATTAGACCTGATATTAAGGTCGTGGAGAGTCGCGTGGCAAATACCGATGATGGTTATACGCGATTAGCTAATGAGCTGTATGAAGAGCTAATAGGAGCAAATCTAACGCGTAATCAGGCAAAGGTTGCTCATGCCATTTGCCGCAAAACATATGGCTATAACAAAAAAATGGATCGCATTTCAGATAGCCAAATTTCAAAGCTAACAAAATTACCAAGACAGAAAGTCAATAAGGCGAAAAACCAACTAATTACCATGAACGTAATGATTAGTGATGGTCGCATGATTGGACCGAATAAAGACTTATCGAAATGGAATTTACCCGAGTGTAACCAATTCGGTGACAGTGACACCAAGACGGTGACAGAAAATGTCACCAAAGTAGTGACAGCGATGTCACCAAAGCAGGGACACACAAAAGACACTATTACAAAAGACAAGAAAGACAATAAAAACACTTTGCCCGAACAAGTTCAGGTCGAGGGTAAAACTCCTTCACACGAAAACCTAGTCGAGAAAGCATTCGAGGAAATCTTCTGGATTGCAGGAATGGTTAAGTCTGGCAAGACCAAGGCTAAATCAGCGTTCAAAACTCAGTTCAAAGAGTACCGGAAAGAAACCAGCGCTACGCCTGAGCAGTTCGCCTCTGTGCTGGCTCAAGATATTAAATCACGCCTAGGCAAACAATTCGGATTCGAGAAGCTTCACCCAACGACCTATCTGAACGGTAAGCGATGGGAGGACGAGAAGCCTCAACAGGGTTCAGAACCAACGACGAATAAATCCACGATCACCGTGAGCGGGTCTGGTTATGTTTTTTACTAAATCCGCTATGAAGTCACGCATTAAAGCGCTTCTAGTCGCTGGGTATAACCACGGACTAATTAGCGATCGCGTAGTGAGTTTCTGGTTTGAGTTTTTAAAGTTGAGGTCTGTATGAGCCCTAGTGAATTGTCTGAAATGTTATGGAATCAGGTCGATCGCGTGGCTAAGTTTTTACTGCCAAATGGCAAGAAAGACGGCCATGAGTGGGTAGCCGGTTCCGTTGGTGGTGAAGCTGGTAAAAGCTTGAAGGTTAATCTTGCGGGTAAGCGGGTATGGTCAGATTTTGCCGAAGGTTCTGCCGGTGACTTACTTGATTTATGGGTAGCCGTGAATGACTGCTCTCTACATCAGGCCATGAGTGAGGCTAAGGCGTTCTTAGGGATCAAGGATGATGATCACCACTTTCAAGCCAAACAAAAGAAATTCTCACGACCCAACAAACAAGAGATTAAGAAGCACGTAAGCAAGGCTAACTATTGTTACGAGTACCTTTCAAGCCGAGGTATCAGCAAAGAGACAGTAGATGTTTTCAAGGCGACAGACGCTACGGTGTGGAGCAATGACGAGAGACGTGAGCTCAAGGCTGTAGCATTCCCATACATCAGAGATGGCGAGCTGTTGCAAGTGAAGCGGATCAGCTCAGAACGCCCGAGCGGTAAGAAAGTCATCATGGCCGAAAAGGATTGTGAACCGAGCTTGTTTGGCTGGCAGGCCATGCCAAAGAATCTCCGTATCGTCGTTATCTGCGAAGGTGAGATCGACTGCATGAGTTACTACGAACTTGGCTTGCCAGCGCTATCAGTTCCATTTGGTGGCGGGAAAGGAGCTAAGCAGCAATGGATTGAATACGAATATCACAACCTAGACCGTTTTGATGAAATCTGGCTAAGTCTTGATAACGACGATGTAGGGCAGGAAGCCGCAAAAGAAATCGCCAGCAGATTAGGAGAGCATCGTTGTCGCCTAGTTAAGCTACCACAGAAAGACATCAACGAATGTCTTCAAGCAGGGATGACCAGCGATCAGCTTATCGATATTCTTGAAAGCTCTGAATACTTCGATCCTGAAGAGCTATACAGCGCAAGAGAGTACCAACAGCAAACTATTGACGCTTTCTACTCAAAAGAAAAAGGACTGTTCTATAGCCCGTGGGAGCCGTTAAATCATAACTTCGTTTTTCGTGATTCTGAACTGTCTTTAGTCAACGGCGTTAACGGGCATGGAAAAACAGAAGTCGTTGGACATATGGCACTTGAAGCCATGCGCCAAGGGGTTAGAACCTGCATTGCTTCACTGGAAATCAAGCCTCCGATCCTGCTCAAACGTTTAACTCGTCAAGCCTGCTGCGCCACTAAACCGCCGGTCATAGAAATCGAATCAGCCTTTAAATTTTATGATGATCGGTTGTGGTTATTTGGCCTCACGGGTACCGCCAAGGCCGATCGGTTGTTAGAGATATTCCAGTACGCTCGCCGCCGGTACGGAATAAAGCTTTTCATCATCGATAGCTTGATGAAGTGCGGCCTTGGTGAAGATGACTACAACGGCCAGAAGGCTTTCATCGACTCACTCTGTGACTTTAAGAACAAAACAAGCTCACACATCATCCTTGTTACCCATAGCCGCAAAGGTGACAGCGAAGATAAACCAACAGGAAAAATGGACGTTAAGGGTACCGGCGCTATAACCGACCTGACAGATAACCTATTCATTATTTGGCGCAATAAGGCCAGAGAAAAAGCCATTCAAAAGCAACAGGCTGGCGAGCAGCTAAACGATAAAGAACTATCTGCATTAAATGGCCCTGCATCGGTTTTAAGCCTAGAGAAACAGCGAAATGGTGAGGGGTGGGAGGGCGGCATTCCTCTTTATCTTGAGCCGGTATCGCATCAGTTTTTGCAGACCGAAACAGCATCATCATTTAGCTATATCGCAAACATGCCAATCGACGAATACAACGAAGAATTCCGCAATAAATACGTGACGGAGGGTTACTAATGGACGATTTCTGCTTACACAAAGACACCTTTCAACAGTTGGGCGTAACACTACAGACGCTTATCAGTACCGGTAAACGCTATCGAGTGAGAGTTTGTGAGTGGAAAGAGAAGCGTAGCTTAGGCCAGAACGATTTAAGCCACGTTTGGTATGACGTTCTGAGCAAATACCTAATCAGTAAAGGTCGCACAGAGTGTTCTCCAAAGTGGGTTAAACGAGCAATGAAACACACTTATTTGGGTTATGAAGATATTGAAATGGTGGACGTTGTAACCGGCGAGAGAACAACGCGACAGGAGCTACGGCACACAGCAGATTTAGATACTGGAGCAATGCATTTCTACCTGACTCAGGTCGAAGGTTGGGCGCTCAATGTCGGTTGCATGTTAGCTGTTCCGGCCGGGTGTGAATATCAGCAATTACAACAAAAGCAGGTGGCGTAAATGGAACTGACTCGTGACGACGAAGTAGTGATAACAGAGTACCTTCGCGGTCTCCATGAGCACTATGACGGACCAGTGTTAATCAACATGCAGCGTTTGATAGAGCTACATATGGTCATGAGTAAACTCATTGCCGCGTATCTGTTCTGCGCAAAGAAATTGAATGACGTCAGATCAGCGCGGGATATGGTTATTAAGTATCAAGCCATGAAACAGGAACTAGCAGAAATGAAAAAGGCGGTGCGTCATGACTAATTTACGTAAAGAGGCGAGAGGCCGAGAGTGCCAAGTCAGGATTCCTGGTATATGCAACGGCAACCCGGAGACCTCCGTTTTAGCTCATTTAAGATTGGCGGGGACATGCGGTACCGGAATCAAGCCAGTTGATACGCAGGCGGCAATTAGCTGTAACTGCTGTCATGACACTATCGATGGTCGTACTAAAACGGCATATACGCATGATGAACTCAGATTAATGCATGCAGAAGGTGTTTTTAGAACGCAGGCAATATGGATTCGGGAGGGTTTTATAAAGATATGAGTACTTACAATATAACCCTGCCATGGCCTCCAACGAATAACAATCTATTCACTGTAGCCCGTGGTCGCAAGATTAAAAGTCAGAAAGGGAGAAACTATCTGTCTGAAGTGGCGGCTTACGTTCTGATTAATCGCATGGCGCTAATGCTGTCATCAAGCTTATCCGTAAACATAACAGCATACCCACCAACGCGAGCTAAGCGCGATTTAGACAACTTATTCAAAGCTCCTCTGGATGCTTTGACGCAATGCGGCGTAATAGCCGATGACAGCCTGATTGACGACCTTCGTATCGTACGCGGTGAAGTAGTAAAGGGCGGGCGATTGGAGATAACAATTACGGAGATAGAGTGATGGATATTCAGATTTCAACAATTCCCGATCTGTTAGTTAAGACCAGAGGAAATCAATCGGCCGTTGCAAATATTTTAAGTGTTCAGCGGTATACCGTTAAAAAATACTCACGGGATTTTAAAGCGGAAGGTCACGCGGTAATTAACGGTCGATTAATGGTTAAAACATCAGGCAAACGACGTAAAGATAAGGCGGCAGCATGATTATTAACATAGCAACATTAAAGCTAAACAAAGAACAATTGGACTGGGTTGATAGCTGGTTGTCGCTATGGGGTGCATGGGTGTACAGCGGGAGACTGGATAAACGGCAGAGCAGTATAATTGCAGAATACATGGCGACAGTAGAACCACAAAAATATCCGGACCGGCCGACATGCAATGACGATGACGGAATGCTGATGAGTGCAGCTATAGACTCAGTAATGAAGATTGACAGGAAAGCGTTCGGAATGCTGCTTAGCTACTACGCTAACAACTCGTCACGACACGCTATAGCAACTTACATGCAGAAGGTTGCCCCGGCGCGGAGAATGGATACAAGAGGTGGTAATAGACTGAAGAAACCATCGCTATCAACTTGCCGTCGTGAAGTTGATGAAATACTCGATGCAAGTTTGCACATGGTATATAACCCGCTGGCATCTGCATTCAATAACCGCAAACGTGTCGGTAAAATTAAGAAAGTCGCAAATTTGGCTTGACAGCAATAAGCAAATGAGCAATGATTTACGTATAAGCTGCCGAAGTTATACGACATGGCACTACAGATTAAAAAGAACCCGCTCACAAGGCGGGTTTTTTGTTTCCCTAATTCAGCCACCGCACACGCTACCAGACAATATTTAGCATCTGGATAGTAGTTCGGTGGCTTTTTCACATCTAGCCCCAGCCAAACGTCAACACTCACTCAAACGCAGACAGTGACTACGGCTGCGGGCTATTTCTCTCAATCTAAAAATGGACATCTCCACGGGAGGTGCAAATGCGTATGCCAATCAAAGAGCCTCAAAACATCGGCCTGCTGAGCCACTTTCTAGCATTGTTTATGACACTGCTTGGGGCTCTGGCTAGCTATTCATACAAAGTATTAAACGGTGAGCAATTCCGGTGGAAAGTTTTCGTGCTGCAAGTGATTGTAGCTATTTTCGCTGGGTTCCTGATTATTTTGGCCGCCAATTACTACCACTGGGCCGCTGAGTTTGCTGGTGGTGCTGCTGGGTTGGCTGGATGGTCAGGGGCTGAGTTCATTAAAGCAATTGAAAAGCGCTTCCTGAGAAAAGCGGAGGGTGGGCATGAGTAACTTTAAATTTAGCCAGCGCAGCGAAACTAATCTTCGCGGAGTTAATCCTGATTTGGTAAGGGTGGTTCGTCGTGCGCTAGAGATTACTAAACGTGACTTTACTGTCATTGAGGGTAAGCGCACTGAAGCCCGCCAGAGGCAATTAGTGTTGAACGGCAAATCAAAGACAATGAACTCTCGTCACTTGAGCGGGAACGCTGTTGATTTGTTACCGGTAGGCGCTGACTGGAATAACTATAAAGGCTGGTTGCCGGTTCTTGATGCTGTATATAAAGCTGGTCGGGAACTGAATATCAAGCTGCGCTTTGGTATTACTTGGACTGATTGTCCGAACGATACGCCAGCTAAGTTTCTTGATGCCCCGCATGTTGAGATACCGGCATGA
Protein sequences of DBSCAN-SWA_2 >LR134531|1326529:1345327|1334498_1335086_-|VEJ54586.1|DBSCAN-SWA MDLSRTVIPKSDQINFEDVQSSSITAEIKSVRSGNKEQPVWIDLIGFDGRPYKPSKSMRRVLIGGWGSDGHSWAGKSLTLVGDPNVKFGGVAVGGIKIQAMSHVDCDFSMMLSVSRGNRAEHRVRKLVIEPAVTMTPEDSATYDDIINQMGLCQTAKELKEVGEQLNSITITDADRKAAGEVYKACSEKIKNATS >LR134531|1326529:1345327|1339447_1340305_+|VEJ54599.1|DBSCAN-SWA MGNLAYDNVTPIRPDIKVVESRVANTDDGYTRLANELYEELIGANLTRNQAKVAHAICRKTYGYNKKMDRISDSQISKLTKLPRQKVNKAKNQLITMNVMISDGRMIGPNKDLSKWNLPECNQFGDSDTKTVTENVTKVVTAMSPKQGHTKDTITKDKKDNKNTLPEQVQVEGKTPSHENLVEKAFEEIFWIAGMVKSGKTKAKSAFKTQFKEYRKETSATPEQFASVLAQDIKSRLGKQFGFEKLHPTTYLNGKRWEDEKPQQGSEPTTNKSTITVSGSGYVFY >LR134531|1326529:1345327|1342746_1343001_+|VEJ54603.1|DBSCAN-SWA MELTRDDEVVITEYLRGLHEHYDGPVLINMQRLIELHMVMSKLIAAYLFCAKKLNDVRSARDMVIKYQAMKQELAEMKKAVRHD >LR134531|1326529:1345327|1331588_1331828_-|VEJ54579.1|DBSCAN-SWA MTLLEILIAKLERWPDRYPHAVQGSDGAVYLSVELPRWSDVADIWVTGCEYIHISNLVFQAASDYKTSVVNRSEWESSR >LR134531|1326529:1345327|1335085_1335907_-|VEJ54587.1|DBSCAN-SWA MEPGIYYDISNEGYHADEAIGSTSVKAISVSPANLYFNPFKGSKQAKIGTAIHAALLEPNVFERDFILDETINSRASKAYKELAEVANEEFIFIGNEVETINRMIEASKLNEDFMDYMNTPGKSEVSMFTTCPETGLKLKCRFDRISDHLSYPLDVKSCNDASERGFSVAFGKFHYHVQAAFYLYVLKLVTGRDLNQFCFFAQENKPPYRNCMYYIGEDSLELGRKTMMTALYKLKACVDGDIPKTEGVVLASAEINVPNYLFDDDEDEEVFL >LR134531|1326529:1345327|1339015_1339282_+|VEJ54596.1|DBSCAN-SWA MNPTEFIRKNIVTKLLAENFTLSVAQGAAHKGVEFYLKACGASRRGGLFDDCYQHAKAWAIKNTTTLDKPIKQKKTRTVAAGRAISLF >LR134531|1326529:1345327|1332193_1332475_-|VEJ54581.1|DBSCAN-SWA MIVDYLLYGAGYSGDLRKFEYSPETIEYIAKDDQQHLKHGKELKYQAVKNVTFKVIKHPHDDGKSYLLAVHEPANLADVGVAIFKFKPSPFED >LR134531|1326529:1345327|1332529_1332757_-|VEJ54582.1|DBSCAN-SWA MSHNLADLPKEDREKIDVDLHASGVAYKERYGMPFSMREIENLIPTHLRDYLTERVAFYRERSKTLGKLPYEPKE >LR134531|1326529:1345327|1331842_1332139_-|VEJ54580.1|DBSCAN-SWA MTKQLEVTMPDNSVWAVPVSIIARHRAEYYAHEFSGDIGRSLAEDTMILFNSDPFEIKDWAANNMNWSSVQKHASLISKGETDYEEGWANGEKCVVEI >LR134531|1326529:1345327|1335906_1336203_-|VEJ54588.1|DBSCAN-SWA MNIDKLDKRIAAFTDCDIDEVNAQAESWHGNITNENIRKMLKHLPGFDAELFEQAIEEDAMKDEGSLNYALHALLYRVSELALAKEAEHKSHEFDEVA >LR134531|1326529:1345327|1327991_1328852_-|VEJ54574.1|DBSCAN-SWA MSAKIIDGKAIAQQLRTEIAASVQQRIQAGKRAPGLAVVLVGENPASQIYVSSKRRACEEVGFLSRSFNLDMSCTENELLELIDSLNNDPLIDGILVQLPLPQGFDNIKILERISPEKDVDGFHPYNIGRLCQRTPKLRPCTPRGIITMLQRCEINTYGLDALVIGASNIVGRPMGLELLLAGCTTTITHRFTQNMEQKVRAADLIVAAVGRPGFVRGEWIKPGAIVIDVGINRLENGKVVGDVEYDVAAQRASWITPVPGGVGPMTVATLMQNTLQACQDYHDPL >LR134531|1326529:1345327|1336453_1336726_-|VEJ54590.1|DBSCAN-SWA MPTYQELYEFIKSHYKPSRFENRNGKEWGERYSHNIAKHHMESLEKYGESNISQHEDITGQGLKFNTELKICRGEQVEYRSKAGNLTHIF >LR134531|1326529:1345327|1333036_1334179_-|VEJ54584.1|DBSCAN-SWA MNHLIETEEFFAPVSADLVDNLIGRYRQMKLNIESVFEFINNGSNAKAISYFLHGNRESERYIRKVNDIFKQEGAIARLNADYWQMAMRLTDVYEYMPASRRSEWDKQIEEMKTPDFEEDTVRSTLTDLLNSRQQFFGERVDGIFHALSGEHVTNQPQGFGKRMIIWVEYRKEHITDLRQVVAKFMGREDEPNWQSTEQLLKAADRQSVQWMNIDGGSMRIRTYKKGTAHFEVHPDMAWRLNGVLASMYPAAIPAEFRQKPKKKLKDFVMMQRPLPFVALRALGEMKRVRTTTYRGYHNELVEPKTTNSNSFEFHYSVDKQVNAEVGRILEAIGAVRFTRGAYEELPSVYDDAWNDQRMGFNIGIELCRETIEAQGLTVK >LR134531|1326529:1345327|1337546_1338179_-|VEJ54593.1|DBSCAN-SWA MDIKKIRQKRLKHWFADKAIPQKEKSYLSQLINGKASFGERAARRLENTYGMPDKYLDQDLYDDAGNVKYVERTTNGRAYPLVSWVSAGSWLEAVEPYNENDIDEWPATTENVGDNAFWLTVTGDSMTAPSGLSIPEGMIILVDPSVEPKSEKLVVAKLQDENKATFRRYIVESGVHYLKPLNPQYRMVEINENCKIIGVVVDAKLANLP >LR134531|1326529:1345327|1329303_1330461_-|VEJ54575.1|DBSCAN-SWA MSIFRRGKTYYASYSLPGGKRIKESLGTEDKREAQELHDKRKSELWRIDKLGDFPDVTFDEACLRWLEEKAEKKSLDDDKSRMGFWLLHFEGVKLREITEAKIYLAISKMVNRKAAGNFKSRAISLAKKGKEIPTYQQKPVSTSTKAKHLALIKSLLRAAERDWKWLEKAPVIKVPQVRNKRFRWLEAHEAKRLIDECPEPLKSIVKFALATGLRRSNILNLEWQQVDMIRKVAWINPEDSKSGKAIGVALNDTACKVLRDQLGKHDTCVFVHTEPANRSDGTKTPSVRKMRVDSNSAWNSALKRAGIENFRFHDLRHTWASWLIQSGVPLSVLQEMGGWESIEMVRRYAHLAPNHLTEHAKQIDSIFDESVPNMSQAGKLSGVK >LR134531|1326529:1345327|1339302_1339455_+|VEJ54597.1|DBSCAN-SWA MTKKPKKFNQKNVTQRSDKPDELLMICVDKPVFGKRLASEFRKLQEQRNG >LR134531|1326529:1345327|1343283_1343640_+|VEJ54605.1|DBSCAN-SWA MSTYNITLPWPPTNNNLFTVARGRKIKSQKGRNYLSEVAAYVLINRMALMLSSSLSVNITAYPPTRAKRDLDNLFKAPLDALTQCGVIADDSLIDDLRIVRGEVVKGGRLEITITEIE >LR134531|1326529:1345327|1344624_1344942_+|VEJ54609.1|DBSCAN-SWA MRMPIKEPQNIGLLSHFLALFMTLLGALASYSYKVLNGEQFRWKVFVLQVIVAIFAGFLIILAANYYHWAAEFAGGAAGLAGWSGAEFIKAIEKRFLRKAEGGHE >LR134531|1326529:1345327|1344934_1345327_+|VEJ54610.1|DBSCAN-SWA MSNFKFSQRSETNLRGVNPDLVRVVRRALEITKRDFTVIEGKRTEARQRQLVLNGKSKTMNSRHLSGNAVDLLPVGADWNNYKGWLPVLDAVYKAGRELNIKLRFGITWTDCPNDTPAKFLDAPHVEIPA >LR134531|1326529:1345327|1343639_1343843_+|VEJ54607.1|DBSCAN-SWA MDIQISTIPDLLVKTRGNQSAVANILSVQRYTVKKYSRDFKAEGHAVINGRLMVKTSGKRRKDKAAA >LR134531|1326529:1345327|1342302_1342746_+|VEJ54602.1|DBSCAN-SWA MDDFCLHKDTFQQLGVTLQTLISTGKRYRVRVCEWKEKRSLGQNDLSHVWYDVLSKYLISKGRTECSPKWVKRAMKHTYLGYEDIEMVDVVTGERTTRQELRHTADLDTGAMHFYLTQVEGWALNVGCMLAVPAGCEYQQLQQKQVA >LR134531|1326529:1345327|1326529_1327921_+|VEJ54573.1|tRNA|DBSCAN-SWA MLKIFNTLSRQKEEFKPIHADKVGMYVCGVTIYDLCHIGHGRTFVAFDVVARYLRFLGYNLTYVRNVTDVDDKIIRRAAENNESCEQLTERMLAEMHADFDSLNIARPDIEPRATHHIAEIIEIVELLLQRNHAYIADNGDVMFSVETDADYGLLSRQNLEQLQAGARVEIADVKRNPMDFVLWKMSKPGEPSWESPWGAGRPGWHIECSAMNGKQLGDHFDIHGGGSDLMFPHHENEIAQSTCAHDGPYVNYWMHSGMVMIDREKMSKSLGNFFTIRDVLQHYDAETVRYFLMSGHYRSQLNYSEENLKQARAALERFYTALRGTDESVKAQGNTEFEARFREAMDDDFNTPEAYSILFDMAREVNRLKSEDMVAANQLAAELRRLAAVLGLLEQSPEAFLQGAGQNAADGDEVAEIEALIKQRNDARQAKDWAQADVARNRLTEMGIILEDSSEGTTWRRK >LR134531|1326529:1345327|1338271_1338499_+|VEJ54594.1|DBSCAN-SWA MELKIYIEGLERGGAKQLASQLSVSSSFLSQMASGAAAISPERCVLIESLTGGAVTRKDLKPSDWQKIWPELQAT >LR134531|1326529:1345327|1340416_1342303_+|VEJ54600.1|DBSCAN-SWA MSPSELSEMLWNQVDRVAKFLLPNGKKDGHEWVAGSVGGEAGKSLKVNLAGKRVWSDFAEGSAGDLLDLWVAVNDCSLHQAMSEAKAFLGIKDDDHHFQAKQKKFSRPNKQEIKKHVSKANYCYEYLSSRGISKETVDVFKATDATVWSNDERRELKAVAFPYIRDGELLQVKRISSERPSGKKVIMAEKDCEPSLFGWQAMPKNLRIVVICEGEIDCMSYYELGLPALSVPFGGGKGAKQQWIEYEYHNLDRFDEIWLSLDNDDVGQEAAKEIASRLGEHRCRLVKLPQKDINECLQAGMTSDQLIDILESSEYFDPEELYSAREYQQQTIDAFYSKEKGLFYSPWEPLNHNFVFRDSELSLVNGVNGHGKTEVVGHMALEAMRQGVRTCIASLEIKPPILLKRLTRQACCATKPPVIEIESAFKFYDDRLWLFGLTGTAKADRLLEIFQYARRRYGIKLFIIDSLMKCGLGEDDYNGQKAFIDSLCDFKNKTSSHIILVTHSRKGDSEDKPTGKMDVKGTGAITDLTDNLFIIWRNKAREKAIQKQQAGEQLNDKELSALNGPASVLSLEKQRNGEGWEGGIPLYLEPVSHQFLQTETASSFSYIANMPIDEYNEEFRNKYVTEGY >LR134531|1326529:1345327|1331052_1331592_-|VEJ54578.1|DBSCAN-SWA MTDFAGSNTAPEFRDLWQTPPEIFAALNQEFNFTLDAAASQYNHLCPDFIDEQTDALTVPWNTAGSIWLNPPYSSILPWVRKSLEAAVSGHTVVMLIPSDTSVGWFRWIRETCSEVRIITDGRLAFVHAESGRRQTGNNKGSMLVIWRYGEIGRCRISFIEREKLMSIGSELISKDKAA >LR134531|1326529:1345327|1338558_1339014_+|VEJ54595.1|DBSCAN-SWA MLDKKEMVKQMIAAFPGGKSALAGAMGIDVGSFNNSLYEKNGTKFFDFDELEAMQDLTKTAYVADYFAQRAGFVISKIPKTEVLDNVELYNLSLSADVKRGETDRLMYESIANDGVIDKREAKSIRDGLYVEFAARDAEVRATIRVNSRDE >LR134531|1326529:1345327|1334162_1334489_-|VEJ54585.1|DBSCAN-SWA MNSAHNSITVGRVTLPYSQISKGWITPEHQVIKNPIKAQYFAEQLNTKLLTQWQQYEKDFIMQVAGEMHINVIAEKLERDIDDIRAMGRGLGLRFDLERYRRKHESSY >LR134531|1326529:1345327|1342993_1343287_+|VEJ54604.1|DBSCAN-SWA MTNLRKEARGRECQVRIPGICNGNPETSVLAHLRLAGTCGTGIKPVDTQAAISCNCCHDTIDGRTKTAYTHDELRLMHAEGVFRTQAIWIREGFIKI >LR134531|1326529:1345327|1343839_1344343_+|VEJ54608.1|DBSCAN-SWA MIINIATLKLNKEQLDWVDSWLSLWGAWVYSGRLDKRQSSIIAEYMATVEPQKYPDRPTCNDDDGMLMSAAIDSVMKIDRKAFGMLLSYYANNSSRHAIATYMQKVAPARRMDTRGGNRLKKPSLSTCRREVDEILDASLHMVYNPLASAFNNRKRVGKIKKVANLA >LR134531|1326529:1345327|1336917_1337130_-|VEJ54591.1|DBSCAN-SWA MVLTRAIDNDNLIGGEAMNYRVGYEIATEQNEKLKAQVSALLSELGSEKNQNAALSKRVEELENQLTGEK >LR134531|1326529:1345327|1330846_1331056_-|VEJ54576.1|DBSCAN-SWA MKETIEYIKEYLAYDNLYRGITPALAHQLANDLVRDIQAGAVPNIVFTEVNNGIRDTLLNQSHDDAPKN |
31 | Salmonella_phage(19.05%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1349048 : 1369172
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >LR134531|1349048:1369172|DBSCAN-SWA AGTGGCAAAGCTCAACGACAAACAAGAGCTGTTTTGCCGTGAGTATATTGTCGATTTAAACGCGACGCAGGCAGCTATACGAGCGGGATACAGCGAAAAGACCGCGCAGGAGCAATCTAGCCGTCTGTTATCAAATGTTATGATTCAGTCTCGGATTGCAGAACTTAATGCCGATAGGCTAGAGAGAGTCAGGATTGATGCTGACTACGTACTTAATCGCTTGGTTGAAATAGACCAAATGGATGTTTTGGACATCCTAACTGACTCTGGAGACCTAAAGCCGGTTGCTGATTGGCCTAAAGCATGGCGTACAACGCTATCAGGTTTAGAAGTAACGGCAATGATGGGGGATGGTGATTCAGCGGCGTTACTGAAGAAGATTAAGTGGCCTGACAAAGTTAAGAACCTTGAATTGCTAGGCAAGCACATTAGTGTGCAGGCATTCAAAGAACAGGTTGAACAGAAAGTCACAGCAACACATAACATCATGCCGGTTCCAACATGCAGCAGCGCTGACGAATGGGAGGCAGCCGCACAGCAACAACAGGGCGAGGTATTAGGTCAATGAACTACAACGTAGTTTGGAAGCCATTACCGGGCTCTCAGTCGCTTTCATTAAGCTGCCCCTGTGATGAGATATTATTCGAAGGTACGCGCGGACCCGGTAAGACAGCAGCGCAATTGGCAAGGTATAGAAGTCGAGTAGGTTTGGGGTACGGCACATTCTGGCGCGGTGTCATCTTCGATACTGAATATAAGAACCTTGCTGACATCATCACTCAATCAAAGCGTATGTATCGCCTGTTTAACGACGGAGCTAGATTCTTAGCGTCAGCATCGGAATTAAGGTGGATATGGCCTACCGGTGAAGAGCTGCTATTCAGATTTGGTAAAGAAGAGAGCGACTATTGGGACTATCACGGTCAAGAATTCCCTTTTATCGGCTTTAACGAACTAACAAAGCAACCTAGCGCCGACTTTTACGAATCAATGTTCTCATGTCGCCGGTCGTCATTCAGGCCACAAGATTACCCGCGTGAAGATGGCTCGTTATTACCGAATATCCCTCTGGAGACGTTCAACACCACTAACCCGTTTGGCATTGGTCATACATGGGTTAAGAAAAGATTCATCGACCCGGCTCCGCGTGGAACTATCATCCGAGATACTCAGATGGTTCCCAATCCACAAACGCAGCAGGAAGAAGAAATAACACTAACCCGTGTAGCTATTCACGGTTCATTCAAAGAGAACCCTTATTTAGACCCTGTATACATTGCTACGCTGATGAATATTAAAGACCCAAATAAGCGCAAGGCTTGGGTTGAGGGTTCATGGGATGTTACCAGTGGCGGGCGATTTGACCACCTTTGGAATGAATCATTGCATGTTATCAAACCATTCAATATTCCCGATAGCTGGACTGTTGACCGCTCCCATGACTGGGGAGAATCAAAGCCGTTTTCTAACTTGTGGTGGGCTCAAGCTGATGGAACCGAAGCAATGCTTCCTGATGGTTCTAAATTCTGCCCGCCATCCGGATCATTAATCCTGATCGGCGAGTGGTACGGCTGTCCGCCTGATGAGCTAAACAAAGGCTTAAATATGTCATCGACAAACGTGGCTAAAGGCGTTAAGTGGGTTGATAGGCGGCTTGCCGGTGAAGAGGTAGAAGAGCCAGAAGAAACTAAAGGGCAGGGCCAGATGCATATTATGCCGGGGATCTGTAAGAAGGTGATTCGCGGCCCTGCTGATGGTTCTATTTTCAACACGGCTGATAACGAATTATCGATAGCGCAAAAAATGGAAGCGCAGGGCGTTAAATGGCTGGAGGCAAACAAAAAGCCCGGCTCCCGCATTAATGGCGCGTCGCTATTTGCTGACATGCTTGAAGCTGTCGTGGAAGGTAAGAAAACAGAGTCAGGAATGCCAGAAAAACCGGCGTTCTATGTATTTGACTACTGCCGAGGCTGGATTAGTCGCATTCCTACATTAGTCAGAGACGAAAAGAACCCTGATGATGTAGACACAAAACAAGAAGACCACGACTGGGATGCTACACGATACCGTGTTTTACATTCTCCTTGTCGCGAGGCATTCAGCATAAACATGAGAACAACATACTAATGGCTAATAACGATATTACATTTATCCGGCCTGAGCACAAAGCCGCAAGCCTAATATGGGAAACGATTCGCGATGTTTGTCGCGGGCCATATGCAGTAAAGCAAAGGAAGCACAAGTATTTACCAAAGCTAGATCCAGCCAATACTAGTGAGGAGAATAATCGGCGTAATGATGATTATATTCGCCGTGCAGTTTTCTATGCTATTACAGGACATACAAAAAACGGGCTGATTGGTATGGCGTTTAGACGCGATCCTACTGTGACTATCATCGATAAGATTGAGTATCTAAAAACCAACGCTAATGGCGCAGGTATCAGCATCTATCAGCAGGCTCAGTCAGTGCTTGAATCTGTGTTAGAAGTCGGCAGGGATGGGCTGTACGTTGATTACAGTTCTGATATGAAAGGCGCAATTATCCTGAGCTATCGTGCCGAGGATATTATTAATTGGCGTACAGAGCGGATAAATGGGCGCGACAAGCTGGTTATGGTTGTCTTGCGTGAGTGTGTGGAAGAGCCAGATGGTTTTGGTTTCAAAGATCGCATTCAATACCGCGAATTAGCAATGGATGAGGGGCGGTTTGTTTGCCGCGTCTGGCGTAATACTGGACCTAAAAACACAGGAGTCTATGTTGTTGATAGCGAATATTACCCTGTAGTTCAGTTCGGCGGGGCATGGGATGAAATTCCGTTTACCTTTGTTGGAGCTCAAAATAATGATCCAACAATCGATGAATCACCACTGTTAGCACTGACTGAAATCAATCTAGGTCATTATAGAAACTCGGCAGATTATGAAGATAGCCTGTTTTTCTGTGGACAGGTTCAGCCTTGGATCAGTGGGTTATCGGAAGAATGGCGTGATTGGTTACAAAAGAATGGTGTCGTATTAGGTTCCCGCTCTCCAATATTACTGCCAAAAGAAGGCGCTGCGGGGTTTAACCAAGCGGAGCCGAATATGATTGCTAAAGAAGGTATGGACTCTAAGCGAGATTACATGGTTTCTCTTGGAGCTCAATTAGTTGAGCAAAATGGCGCTGTTAAGACAGCGACTCAGGCAAATGGTGATCAAGCTGCATCTACATCTGTTTTGGGTATTTGTTGCGCCAACGTTTCAGAGGCATATACCCAAGCCCTGAACTGGTGCGGCAAGTATCTTGGTGTAAACGATGCAAAAGCATCATATTCAATTAGTCAGGAGTTCATTCAGAAAGTAGCAGACTCGGGAATGCTGACGGCAATTGTTGCTGCATGGCAAAGTGGTGCAATTAGGGATGGTGACATGATCCGCGCAATGCAGAAGTTAGACATTATTGACCCTGAATCAAACCCCAATGAGATTTTGGACGAATTAAAGAATCAGGGGCCAAATCTGACCGGTGACTAAAATGGCAACGATTAACGAAGTGCTTAGTGATGAAATTATTGCTCATAGCCTGTTTCAGTCTCGATACGGCACTGGTGTAGCTCGTAAAATGGTTAAACTGCTCAATGAAAGCGATTCTGAGCTATCTGCGCGCTTAATTATGGCATTGAGTGAGGTTAATCCAAACACCATTACTGTGAAGCGCTTAGAGAGTCTTTTGTTTAGTGTTCGTGAAGTTAATAAGCGAGTGATTGATGCAATGTATATGTCACTGACTGATGAACTGATGGGATTTGCTAAGCATGAAGCCATTTATCAATTCAACTTGTTTGACTCTCTGTTGCCACACCCTGTATTGCAGCGTTTCCCGCTTGCTTCAATCACTCAAGAGCAGGTTTATGCCGCAGCTATAACCCAACCATTTCAGGGAAGATTGTTGCGTGATTGGGCTGAGAATATCGAAGTTGACCGGATGACGCGAATAACAAACACTGTGAAAAACGGTTATCTTGCCGGTGATACTGTGGAGCAGATGGCTAGAAAGGTTAGGGGGACTAAGGCTAGGAACTATCAGGACGGCATAATTGAAGCAAGTAGAAGAAATGTCACTTCCGTAGTAAAGACCGCTGTAACTCATTTTGCCGCAGTGGCCAGAAATAAGTTTGCTGACAACAATAGAGACATTATCAAAGCAAAGCAGTGGCTTAGTACGTTAGATAATCGCACATCCCACATTTGTATTATCCGGGATCGCTTAAAGTACACGTTAGACGGAAAACCAATAGGCCATAAAATTCCATACCTGCAAGGCCCCGGGCGAATTCATTTTTGCTGTCGTTCAATGGAAACCTTTATCACCAAATCATGGCGTGAAATGGGTATTGATATAGAGGATATAAGGGAAGGGACTCGGGCTAGCATGGATGGTCAGGTACCCGCTGAAACCACTTATAGCGAATGGCTACAGAGGCAATCGTACAGTCGTCAGGTTCAGGTACTGGGTGAAACACGGGCCAGACTAATGAAGGATGGAGGAATGAGAACCGATGAGTTTTTCAATGATAAAGGTGAATGGCTGACGTTACAGCAACTTCGGGATATTGATGAGCGATCTTTTAATGAGGCTGGACTATGAATGCTGGAGAAATAAAAGAACAGAAAGAGAGGGAACTCGCTGAGATAATTACACAAAAAGCTAAAGAGTTGACGAGTGAGACTGGATTAAAGGTATTCAATATAGAAGTTACGCAAATATATCCACTTAGAGTTCGATTTATTATCCAGTAATCACATCAATGAACTGTCGATTTTCCGGCATTTAAATATCAAAACTCAGCCTCGCCAATGTGCGGGGCTTTTTTATGGGCTAGGCCCGGCAACACATCCCAAGGGGACTATATGTTATTCCGAAATATCGAGCGTAAATATTACTCAGCAGCAGGTGAAGGCGGCGATGCTGGCGGCGGTTCTACTTTAGAGATTACCCCAGAAATCCAAGCTCTCATTGATGCGCGAGTCAACGAATCAGTAACAGGTCTGAAAACCAAAAATAGCGAACTGCTCGGTAAGCTCAAAGAGCAAGGCGACAACTTAAAGCGCTATGACGGTATTGACCCTGATGCGGTTAAAACCATTCTGCAACGATTCTCTGATGACGAAGAAGCAAAGCTGATCGCCGCCGGAAAAATTGACGAGGTACTCGATAAGCGCACTGAGCGTTTACGTGCTGACGTAGATAAGAAACTCAAAAGTGCAAATGACCGCGCAGAAAAGGCTGAGCTCTTTAGCAAGAAATTCAGCGACCGAGTACTTGGCGATGCCATCCGTTCTGCTGCATTGAAAACGGGCGCTTTACCCGGTGCAGCTGATGACATCATTCTGCGTGCTAAAGGTGTATTTACTCTCAGCGAGGAAGGCGAGGCCATAGCCGTTGATAAAGATGGAGCTGCGTTACTGGGGAAAGATGGCAGAACGCCACTTACACCGCAGGAATGGGCCGAATCACTTAAAGATATAGCCCCTCATCTTTGGCCTCAGGCTGAAGGTACTGGCGCTGGTGGTCATAAGCCATCCAGTGGTGGGGCTTTGAAACGATCTGAAATGTCTGCATCACAGAAAGCGGAGTACATACGCGCTAATGGGCAACAGGCATTCTTAAAACTTCCAAAAGAATAAGGATTTATAATTTATGAATACAACTGTTAACTCTGACCTGATCATCTATAACGATTTGGCACAGACATCATATCTTGAGCGCCGGCAGGATAATCTCGACGTGTTCAATGCTTCATCTAATGGCGCTATCGTTTTAGATAACGTTTTAATTGAAGGTGACTTCCGTAAGCGTGCCTTTTATCAACTGGGCGGCAGCATTGAGCATCGCGATGTTGATTCTACTAACAAAGTAACAGGCAAGAAGATTGGTGCCGGGGAGTCTGTAGGCGTGAAAGCGCCGTGGAAATATGGTCCTTATCAAACCACCGAAGAAGCATTTAAACGCCGCGGTCGTGACGTATCTGAATTCTCTGAAATTGTTGGCGTGGATGTTGCTGATGCTTCACTAGAAGGATTTATTAAGTATGGCATTCAAGCGTTAAGTGCATCCATTGGCGCGAACCCAGACATGGTAGTTACTGCCAACATCGAAACTGATGGTAAAAAGACACTGACCAAGGGGATGCGTAAATACGGTGATAAGTTTGGGCGAATCGCACTGTTCGTGATGCATTCTGCAACCTACTTCGACATTGTCGATCAAGCTATTGCTGAGAAGATTTATGAAGAAGCCGGTGTTGTGGTATACGGTGGTCAGCCGGGCACATTAGGTAAACCGGTTCTTGTAACTGATACAGCTCCGATTGATGCTATTTTTGGCTTGTTGCCAAATGCTGTGGTAATCACTGAATCGCAGGCTCCGGGATTTCGCTCATACCCAATCAATGACGAAGAGAACCTTGGTGTTGGTTATCGAGCCGAGGGAACGATTAACATCGATTTGCTGGGTTACAGTTGGGATGAAAGCAAAGGTGGCGCAAACCCGGATCTGACTAAAGTTGGTGCTGCTGATAGCTGGAAGAAACATGCAACGAGCAACAAGGTCACTGCTGGTGTGATGATTAAGCTGGAAGTCGAACCGGTTGCAGTAACTGGCGTAGCACTTGACCAGACCACTGCATCCGTTGTTGTTGGGGCAACTATGTCACTGGCGGCAATTGTTGCTCCGGCTGATGCGACGAACAAGAAGGTTACATTCTCATCATCTTCTGTCAGCAAGGCTACAGTTGACGCAGACACCGGTGTAGTCACTGGTGTTGCAGCAGGTACGGCGAAGATTACAGCTACTTCCGCTGATGGCGCTAAAACCGCTGTTTGTGATGTAACGGTAACAGCCGCTTAATGCCAAATAATAAGGGGCTTTGGCCCCTTTGTTAGCGGAGGTTGGATGATTATCACAAACCCTCAATCACCTGACTTTAATAGTTATGCATCGACAGTAGATTTAGCCAAGTTCGCCTCATCAAGAAACCTTGAGTTACCGGAAAATACAGAGTCACTGCTGATTAAGGCTATGGATTACCTAAATGGGCTTAATTGGTATGGTAACCGCACAGCAGTTACTCAACCATTGCCTTGGCCTCGCAAGGGTGTAGTCTTTGATGGCGTTAGGTTGCCTGCCAACACTATCCCGGCTCAGCTTGCTACAGCACAGTGCATGCTTGCAGTTGAAGCCATTGATGGTGAGTTGCTTGCATCCAGTCGTGAGGCATCGGTGAAATCAGAGCGTGTAGAGGGTGCGGTCACTGTTACATATGCAGTTACTGATGGTGAGTGCTTCACTGCTTCATATCCGTTGGTGATGGGCATTCTTAATGGCTTAGTTGCTGGGCGAGGATTCGCCATTAATGCAATAGTGAGGCGTGACTAATGGCTATCAACCATCAAAGAATGAGATCAACGGCCACTCGATTGATTACACAGAATGGAGCTACTTATACATTAACCCGTGGCGGTGGTGTTGAAGTCATTGGCGGCGTTGAGGTAGATATACAACCAGAGACACACATCATCACTGCCATTAAGTCTGATTATGCATCAGGTGAGATTGACGGAACGCTGGTATTGAATGGTGATGTGAAAATGTCGGCAACGGCTGAAGTAGAAATACGTATCGGTGACCTTATTCTGATTGATGGTAAGTCTCACCGCGTAATAAAGCCTAACCCAGTCAAGCCCGCATCATTGCTACTTTGCTATAAACCACAGTTGAGGGCTTGATATGTCTGACAACTCTCAATTCATGCAAAGCATTAACCTATTTATTGATAAAGCCAAGTCTAATCAGGAAGAGGTTGTACGTGCTACAGGGCTTCGTATTTTGGCGCAGCTAGTAAATATGTCCCCCGTTGGAAATCCCGAAATATGGGAAATAAACTCAACAGCAGGAGCATATAACCAAGCGGTTTTTGACCACAACGAAATGCAGAGACAGGATCCGGATAACTTAACCAAGGCCGGGCGGCTTAAAAAACGTGCCCGTGTTAACGACAGTATGGAGATAAAAGCCCCCGCAGGGTATACCGGTGGACGCTTTCGCGGTAACTGGCAAGTTTCTTTTAATCAGCCTGCTGACGGGGTTATTGACCGAGACAAGAAAAGTGATCCTGGTGGTAATGTCACATTAGGTGAAGGCAAGGCGGTCATAGAAATGTTTAAAGTCGGTATAAATGCAGTCTATTTCGCCAATAATGTACCTTATTCTTATCGGCTTGAAATGGGGCATTCAACTCAAGCCCCGGCAGGAATGATCCGGGTTACCGCAGCCGAGATACAGCAATATGTAGATCAGTCTGCTAGGGAAGTTAATAAGTGAATACGTTAACTATAACAGAACTGCTTGAATCGCGCCTTGCAACCGTAGCGACGGGGCTAGGGTTGAAAGTATCATATGAAAACGTTCAGTTCGAACCATTCGATGATATTTATCTCGAATCTCATGTTATTCCGGCTAAAACAAACAGTATCGATTTGGCGGGCGATAGTCGGGTTTATGTTGGTGTCTATCAGGTCAATGTCGTTGTTAAAGCAGGCTCAGGCAAGAGTAAAGCAGGAAAGATAGCTAACGATATCATTAACGCCTTTCCGCTTAACCTTGAGCTATCAAGAGGGGATTTTACTGTCTATATCAATTCGGTTCCTAGCGCCTTTCCAGCAGTTCAAGGTAATGCAACTTATAGCATCCCGGTGAGCATGAATTACCGGTCTGACAATAGTTAATAGTAGCAACAATCAAATATGACCACCTTAGGGTGGTTTTTTATTATCAAAAATCGGAGAGAACACAATGGGATTCGCTTTGCCAAATGGTGCACACGTCTATCTGGCGGCAACTTACGAAGCTGAGCAGGTTATTTCGGGGATTTCCAATGCAACAGAGGCAGTTGTGACGCTTAGCGCGGCTCATGACATCGCTGTTGGGGATATCGTTCAATTAAAATCAGGGTGGAGTGCGCTCGACAATCTGGTCGCTAAAGTGTCAAAAGTTGATTCAAGCACAATCACTTTAGGTAATGTAAATACGTCCAACGTAGGTAAGTTTGCGGCAGGAGGTGGCGTTGGTTCACTCAGAAAGGTCCTTACTTGGATTGAGTTACCGCAGATCACCGAAGTTTCTAATAGCGGTGGTGATCAGCAAAACGTACAGATTCAATTCCTGTCTGATGACAACCAGCGAAACTTAAATACTTTCAAGGCTGCAATGTCGCAGACCTACACCATTGCTCATGATTCTAGCCTGCCTGTATATGATTTGCTTCGTGATGCGGACGAGTCAGGAGACACTATTGCCGCATATATGTATGTGCCTAAAGCGAAGGAGAATCGCTATTGGTCATCTACTGTTTCATTTAATGGAATGCCAACAACAGCGGTAAACGCGGTTGAGACGGTGGCTGTTGTACTGAATCTTCAATCGGCTGGCATGACATTCTATAAAGCTGAATAAGCAATAACGACTTATTTTCTATGATATTTAATTTAATGGCTGGTTTTTTCCGGCCATTTTCATTTTTATAAAGGCGCTCACTTATGGTTACTAAATTTCAATTACAACCCAAACCCACGTTTAAAGCTGATGTGACGATACCACGGGCCGGGGAAGATGATGGGGTTCTGACGTTCACGTTTAAGCACCACCCGCTAAAGCAGTTATCAGAAATCGAAGCTATCGAAGGTACAACCGCTGTTGATTTTGTCATGCAAATTGCAGAGGGGTGGGCTCTTTCTGACGATTTTACGCATGGAAATGTAGTAACTCTGCTAGAGAACTACCCGCAAGCAGGAACTGCGATCGTGAAAAAATACTATGCAGAAATGGTGGGGAATCGCGAAAAAAACTAATTGCGGTTGCCTCATCTCTCTACACGCCTGAACCTACAGAGGAAGAGCTAGCCAGCTCTGGATTAACGCCTGATGATTTTGATGAGGTTGTAATTGAAATCCTACCTTGCATATGGAACTCGTTCCTAGTCTTTAAAGCTATGTCCACCCAGTGGCGAGTTGGGATGAGTGGTGCTACCGGGCTGGATTATGGCTGCTTACCTCAGGTTATGGACTTCATTGGGGTTAGTGAAGATAAGGCAACCGTTTTTGCAGACATTCAAGTAATGGAAAGTGTTGCTTTATCCATCATAAATAAGAGGTAAGCATGGCAGATGTTGCAACAATTACGCTGCAAGTAAATACCGCGCCGCTTGCTGAGGGGAATAAGGCGTTAGATCAGTTTGGCGCTAAAGCAGGTCACGCAGCGAAGTCCGCGGATTCATTTGGTGACTCAAATAAAAAAGCCGCAAAATCAACTGAGGAAATGGCAAAGTCAGTCGATGAATCTCATCGGCGGATTGCTGAGTTTGCTAAAAAACTGAGAGACTCAGGAAACTCTGCACATCAAGCAGCTGATAGGCAAGATAAATTGGCTGATTCATTTTTTAAGCAAATTGATGCTATAAAAGAAGTTGATGGAGCTAGTGCCAAGTTAAAGCGGATTCAAGATGAGTTAGGACGAGCCAACCGAGCAGGTGAATTAGGGCAAGGGAACTACGTCACACTTCTTGGCGAAACAACCGATAAGCTAAATCAGGTTGAAAAAGCAGAGAGCACACTCTCTAGAACTAAAGCTGAATTTCTAAGGAGACTGAAAGAGCAAGTCGCGACTCAGAAGATGAGTAGTGAGGAGCTTTTACGATACCGCGCCGCTCAGCTGGGCGTAGGTTCTTCGGCTGAGGTTTATATCAAAAGCCTGTCCGCATCAAATAAAGAGCTGACAAACTTAAGAACTAACTCCGTATCTGCTCGCCGTGAGTTTGGAATGATGGCCGCTCAATTGGCCAGAGGTGACATTAGCGGTTTAAGAGGGACGGCACTAACTACTGCTGGCCGCTCTGGATTAGTTGAGCAGCTAACGACTTTGAGAGGGTTGGCTGTTGGCGGGCTTTTAGCTGGTGTTGCTGTTGGTTTGGTTGCTGTTACAAAAGCGTACTTTCAGGGGGCTAAAGAGTCTCAGGAGTTCAATAAACAGCTAATTTTAACCGGTGGATATGCGGGACAAACCGCAGGCCAGCTTCAAGCATTAGCAGCGTCATTGTCTGGTAATGGTGTAACACAATCGAATATGGCCGAATCTCTGGCTAAAGTTGTTGGTACCGGTTCGTTTTCTGGTGGTGACATTTCCATGATCGCGGGTGTCGCTGCAAGAATGGAGGATGCGGTAGGTCAATCAATTGATGAGACAGTAAAGCATTTCCAAAAGCTGAAAGATGAACCCGCTAAAGCAGCTCAGGAATTGGATAAATCCCTTCATTTCCTGACAGCAACTCAGCTTGAAAATATTTTAGCTTTAGAAAATCAGGGGAGAGCCTCAGAAGCTGCAAAAGTAGCCATGGATGCCTACGCATCTGCAATGAATGATCGCGCAGTCCAAATTGTTGATAATTTGGGTTGGCTTGAAAGCGCATGGAAGTATGTAGGTGATGCGGCCTCTGGTGCATGGGATAAGATGCTAAATATTGGGCGAATGGAGAGTGCGACCGCTCAATTAGCAGATGTCCGCAGAAAGATTTTAGAGTTACAACTTGACCCTAAAAAACAGGCTATCGCAGCTTACAACGGCGACGCAGGGCTTGATGAACTGCGCCTGCAAGAATCTTTGCTTAGTGAACAGGCTTACCAAGAACAAATGTCAGCCGGGCAGGGTAAGGCTATTGCGGATGAGCAGGAGCGACAAAAACTACGGATTAGTCAAAACGATGCATTAAACCGTAAGTATGAAAATAGCCAAGAAAAACACCTTCGTACCATTAATGAGATTAATAACAAATACGCACATGCAGACCAATCTGTACGTGAAAAAGCTGTGCAAGCTGAAAATAAACGGTTCAAAGAAGAGCTGGATCGTGAGACTAAAAAGGGGCGAAGTAGCGGAGCCAAATCATATACTGATGATTCAGCGACCAAAATGCTATTGGATTCCAAGCAGCGTTTAGCTTCTTTGCAAGAGCAAATGGGGGCGACATCACAACTGACAGAGCAGGAAAAACGGCTTGCTGAGTTCACCCAACAGATTGCTGAGTTGAAGGAAAAACGAATACTAACGGCTGACCAGAAGAGTATTCTGACCCGAAGCGATGAGATTAAAGCGTCATTAGAACTTGAATCAAGCGAATCCCGGCGATTGAAGAACGCGCAGGAACTAGCAAAAGGCCACGAAACCGCGCTTAAGTTTATTCAGCAACAGGAGGCTGCAATCTCGGCAATGCAAAACAGTGCCGGGATGAGTAATCGGGAAGCGCAACGAAACCGAGAGCGTGAGCAAACAAGGATGATTAAAGCATCACCTGAAGATAGAGATTTGGCATTAGCTAAATTAGAGGAACGCTATCAAAGGGAAGATGAGCTGCGCGGTGACTGGCTGGCGGGTGCTAAGAAAGGATTCTCTGAGTATCTCGATAGTGCTACTGATACTTATTCAGCTATGAGCAGTGTTGCTCAATCTGCCCTCGGCGGTATGTCAAATATGCTGACTGACTTAGTGACCACCGGCACAACGAGTTTTAAGCAGTTCACTGTTTCTATTCTAAAAATGATAGTCGATATCATTAATCGCATGATGATTGCTTATGCTCTTCAGCAGGCGATGGGGTGGATAGCTGGAGGAGCTAGCGGCGGAGTAGCTAACGGACAAACGGTTCCTACAACACCCGGTAACAGCATTGTTGTTGGCTATGATTCTGGTGGCTATACCGGAGACGGTGGAAAGTATGAGCCAGCAGGAGTCGTACACCGCGGTGAGTTCGTCATGACCAAGGAAGCTACGGCGCGGATCGGCGTTAATAATCTGTACTCGATGATGCGTGGATATGCTGACGGTGGAGTGGTTGGCAAAGCACCAATGCATGGACTGCAATCATCTGGTAGTAGCTCTGGTGGTGTAACGGTCAATATGGGCGGCATCGTTATGAATACTGGCAGTGATGGAAATCAGCAATCCGGCAGCGGTAATACAAATACGGGGGCAGCTATCATGAAGCAGTTAAAACCAGTGATTATCGCTGCAATTACCGAACAGGCTCAAAGTCCGGGAACTCCGCTATGGAATGCTATCAGCGGGGGGCGATAATCCTCCTGATGAGTAAACTTGATTAGCCCTGCTTTTGCGGGGCTTTTTTTATGTCCCAAGGAAATCATATGGCGATAGATACTTTTTTTTGGCGAACTCAGATACAGAGTGGAATGGAAGGAAGTTTTTCATATAAAACCAGAACGGCGCAATTTGGTGATGGTTATAAGCAAATTGTTGGGGATGGACTTAATCCTGAATCGCAAAGCTGGCCTATCACATTAACCGGATTAAAAAGCGAGATGCTCCCGGCGTTAGATTTTATCCGCAGCCATGTAACTAAATCATTTATCTGGACTTCTCCACTTGATGAAGCTGGGCTATATCGTGTAGCTGCGGACTCTGTGAAGTATTACCCATTGTCGGCACAGGCAATAACTATTAATGCGACGTTTGAAAGGGCCTTTGCACCATGAGCTTAAACAGCGATTATCAAAAGCTTGAACCCGGCAATGCCGTAAGGCTAATTGATGTTGATGGCACTAAATTTGGGGTATCTGATGTCATGCGGTTTCATGCGCATAACATCCCTCATACACCTGAAGAGATTGAGGCGGCAGGCGGTGATGAGAGCAAGTTACCGGCTAAATCGATCTGGTGGCAAGGTAATGAATATAGTGCGTGGCCGTATGAGATTGAGGGGATTGAAACTTCCACCAATGGTGGCAGTGCATCACCAAAGCTGTCAGTGGCAAACATAGATAGTTCTATTACTGCACTGTGCTTGCATTATGACGACATGCTTAAAGCTAAAGTGACTATTCACGATACTTTGACGCAGTATTTGGATGCTAATAATTTTCCTGATGGTAATGCTTCTGCTGACCCAACACAGGAAAAGATGAAAGTCTTCTATATCGATGCCAAAAGTAGCGAAACTAATGAGGTTGTTGAGTTTACGCTAGCGTCACCGATGGATTTACAGGGGGTATTGATTCCTACGCGTCAGCTTCATTCTGTCTGCACGTGGTGTATTCGCGGAAAATATCGATCGGGGGATGGGTGTGATTACTCGGGAACAAAGTATTTCGATAAATACGGAAATCCTGTAGACGATCCATCACTTGATGAGTGCTCAGGAATGCTAGCAACCGGATGCAAGCCCCGCTTTGGCGAAAATAACGAATTGCCGTTCGGTGGGTTCCCCGGCACGTCTTTACTGAGGACATAGCGATGAGAAAGAAAACGCTTAGCGCGATGATGGCTCACGCTGAAGCTGAGCATCCGAGAGAGAGTTGTGGCTTGATAGCTCAGAAGGGGCGAGTAGAGCGTTATTTCCCTTGTCGTAATTTAGCCGCAGACCCGACAGAACAGTTTTATATGTCACCTGAGGATTATGCAGATGCGGAGGACTGGGGAACGATTACGGGCATTGTTCATAGTCATCCTGACGCAACAACACAGCCGAGTGAATTAGACAAGGCTCAGTGCGACGCTACAGGACTGATATGGCACATCATTAGTTGGCCTGAGGGGGATTTACGCACGATTAAACCCCGTGGTGAATTACCCCTGCTAGGTCGCCCGTTTGTCCTTGGTCATACGGATTGCGGCGGTTTGATCATTGACTACTACAAACAAGTCCATGGGATCGATATCCCTGACCATAGAACTGAGCGGGTATGGTGGGAATCCGGTGAAGAAAATATCTATATGGATAATTGGTATGCCGCTGGATTTCGAGAGTTCTCTGGTGATGCGAAGCCGGGTGATATGGTAATCATGCAGGTATCCGCCCCAGTAGCTAATCACGCCGGGATCCTTCTTGAAGATAATATGCTACTTCATCATTTATACGGGCAGTTAAGTCAGCGAGTGCCGTATGGCGGTTACTGGCGAGAACGAACAATAAAAGTACTACGGCACAAATCATTAAAGTAAAAAGTATATCAACCAAGCCCTGTCACTCGACGGGGCTTTTTTATATCTGAATTTCACCGCGCATCGCAGCGCATTTAAACACAGAACCTTGCAGAAAGTGAGCCTGAGAAAACCGTTATTGGTGCTTCTGTGGGCGGCTTTTCTGTGCGAGCAGGTTCACTTTCTAGAAGGTAAAACACCATGAGTAAAAACATTTCCGTAGTACCTGCGTTCGACTTTACTAATATGGTTTCAATTGCAGGCAGTCAGGTAGTTACAACCTCGGCAAAAATAGCTGAGTATTTCGGTAAAAAACATAAGACAGTATTAAGGTCTATTAGAAACCTGAAATGTTCGGAAGATTTTACCCGGCGCAATTTTGTGCCCACTGATTTCATTGATAAAAATGGCGATATTCAGCCAATGTACAATATGACAAAAGACGGATGTATGTTCCTGATTATGGGATTTACTGGCGAGGCCGCCGCAGCTATTAAAGAGTGCTATATCAATGCCTTCAACTGGATGGCTGAAACACTTAATCGCCGACAGATGATGGGAGAGCAAGCCCAGCATCAATACGTCATCAAGGATAAGGTATCAAAAGTCAAAGCATCTATGGGTAGCCGATTAATGAATCAACGTAAAAAGGAAATTCCATTGCTTAAGCAGGAGCTTGAAAGAGTGAAGGCATTAACAACGCCTGATCTGTTTGGTGGGTTGTTGTCATAACCATGCTGATATTTAAATAGAACAAAGGCGCATCCGTGCGCCTTGGTGGTTAATTATCTATTCGTCATTATCCTTTGGTGCTTGAGAAATATAATCTTTAACTGCTTTTTCTTTCAACTTCTCAAGATATTTATCAGTAATTTTACTTATTGATTTTTGCGCTATTTTTTCCATTTCTTCTCTTAGCATATCTAGCGCTACACCTTTTATATGGGTCACTGGGCTATTTTCTTCGGCATAGTTCATTAAATTACTTGGATCTTCAGCCATCTTCAAAGCAATATCTAGCCGCAAAACAATCTCTGCGTTCATTGAGCGTTTGTTTTTTTTAGCCAAAGATTCTATTTTATCTTTTATTTCAGCAGGTAAGCGAATCCTAAGCTGTGGGTCATCTCTGCTCATGCGATAGCCTTTATTAAATATCAATATTGATATTTAAATTATGCCCCACCGTGGGGTTGACTTCAATGACGCACCGTGTGACAATGAATTTATGCCTCAATGTGGGGCATTAAGAAAGGAGGGTAGAAGTGCAAAAAGCAAAAGATATGTATCAAAAGAAAATGAGGATTCCTGAGGATGTGCGGCTGGCGGTGATAAGCAACGGGGAAAAAGAGAGTCGTAAATTTAATACAGAAGTGATTCATCAACTAAGAAAGGCGTATGGGCTTATTGAGAGCAAACAAAATGCAGAAGTCCAATAAAGCGAAAACCCCAACTGCTGTAACAGTTGAGGCTTCTAAAAATTTGAGTCTTAGGGAAACAAATCTTATGAACAATATTACTAGTTCAAGTAACGCATGTCCAGCCGTAACGGTTCCATTTCATGGTGCATCATTATTTTTAACTAAATATAATGACGAGCCTTATGTGCCAATGAAGCCCGTTGTCGAAGGTATGGGTTTGGATTGGAAAAGTCAGCACGCAAAGATTTCTCAGCGTTTTTCTAAAGGTATGGTGGAAATCACCATACCTACAAAAGGTGGGGATCAATTAATGTCTTGCCTCCCACTTCGTAAACTTCCCGCTTGGTTGTACTCCATCCAACCAAATAAAGTTAAAGCAGAGATCCGCGATAAGGTACTCCAGTATCAAGAAGAGTGCGACGAGGTGCTTTGGCAGTACTGGACTAAAGGCGAAGCTAAGAAACCCGCTGTCCGCGCTGCTGTTAAATCCTATCTTCCAGAGTACAGAAAAGCTAGGGCGGCAAAGATGGCTGCTGAAACGATGGCTCTAGCCCTGTCATTCATGCCTAATCTGAGTGATATCTCCAAACAAACAGCAATGGCCAAAGCCGTCAATGACGCTGCTGGTGTTGAATTGTTACCTCTACCAGCCATTGAAGAGCATTATTATACCGCTGGTGAAATAGGGGAGCTGGCTGGTATTAGTGCTCAGAAAGTTGGTCGCCTCGCTAACGCTAACAACCTAAAAACGGAAGAGTTTGGCAAGTATTTCATGGATAAGTCCGCTTACTCATCAAAACAAGTCGAAGCGTTCCGCTATAACCAGAAAGGCGTCGAAGCTATCACTATGTTAGCTAGGGCGGCAGCATGAACCCAGCCCTTAAAGCCAAAGAGTATGAATCTAAGTTAGAGATGGCGGGTACCATAACCGCGCAGTTGCAATCCTATCTGTCTTTATTGCTAGAGATAGAGAGGGACGAAGATAATATCAACTTACTATCCGTGGCATTAACTGTAGTGAGTGAGTTAAAAAGTACACTTGAGCATTAATAATAAAGCCTCCCGAAGGAGGCTTTTTATATCTAAAGGAAAATGAAGAATAAATGTTTTTTAACAACTGGGTCTGCTTGTGATTTCGCTTTATAATTGACTGGTAATAATTACCTTAGGAGTATTTGGTATGTCATTAAATAGAAATGAACTAGCTGGTTATTATTCTAGTAAATTCAGTCAACTTGCCGTAAAAAAACATAACGGTACATATGATGAAGAGGCGCTTGAGGTTTTAATTAAAGAAGCAATGGATCACATGAAAGTTTGGGTGCAAATTCCTTATGAAGAGAATCTCACAAACTTTATTGCTCAGGTAAAATATTTTGGTGGTGCTAAGGATGTGGCTTACGAAAGTAGGCAGGAGTTACTAAAAGACGCACTCCCCCTAATAGAAAAGTATAAATAGCACAAACCCTCTCCGGAGGGTTTTTTATTTCCAGTAACAATGACAGGTGATTGATTGCTTTCAGGTAGGTATGATGAGCAGCATGGTGTTAAATCATTCGGATTTTTAGATGGGAGGGGGGATGAAAGGTTTCGGAATATTAATTTTAATTACTGGCCTTATTGCAGCGATTGCTTCATTGAGCATGGATGTAAGTGTGGTAACAGGATATGGCAATAGAGTAAACAATTTAGGATTAATGGCGCAGCGACAGAATTTCATTTTAATCAGTTGTTTTGCTGTTTTTTGTGGGTTGCTCATGGTTATATTTAGTAAGAAAAAAACATCAGATAATAACAATTATATTAAATGCTCATTTTGTGCGGAAGAAATCCTTCCAGAAGCGATAAAATGTAAGCACTGCGGTAGTGATGTAAAGAAGGAAAGTATAGAGTTAGACAATACGATGGATTCAAGTGAGATTATCGATTTTGATTGTAATAAGCTTGTTATAAAAAAGAAGGTTGGATTCATGATTGATGATGAGTCTGTTATGGAGTTAGCCGACCTACTAAAGTCAAAATCAAAACATGTCGATAAACACCATTTATACAATATGTTCGAAAAAAATATCTTAGAAATTAAATTACGTCTTCCAGATGAGATACATGAAACATTTATGAGAAGAATAAAGTACTGGATAGAAATGTGATAAAAAAGATATATCAACAGGGTAATTTAATGAAAAAACTATTTTTACTACTAGCCGCGGTGGTGTTGTCGGGGTGCTCTACAACTCCAATGCCTGCAAATTTAGCTAAAGAAGTTAAAGCATCTACCTCTTTTCAGGTTGAGCAAGGAAAAGTACCAGTAACCATAGTTCGTGATAAGGGGCATGTTGGAAGTTATTGTTTAATAACGGCATTCATCAACGGTGATCCTGTTGCAGAATTAGACGCAGGAGAGAAAGTTATTGCATATGTTAGCCCTGGTGAAGTAATCGTAGGCGCTGGTTTTATGGGGGCGGGGTTATGTAGCGGAGATCCTAAAAAAGAAAGAGAGTTCATAATTAAAGATAAGCAACCAAGGACGCTCCGCATATTTACGGATCAGAATGCAAATGTAGATATTTTGCCAACAACAGAAAATTAAAAAACACTTAGTCCTCTGCTCCTGTACTTATTAAGGAATGGCTAATGAAAAAGCTACTCTTATTATTGGCTGTTGTGATGGTAATGGCTTGTACCGCTAGACCATACATGTCCGCTCATGAAACTTATGATTCTAGATATCTTGTTCAGAACAAAAGCAAGACACACGTAAGGATACATCGAGTAAATCAAATATCAGGATCCGGTCTCCCTGATGATTGTCCTTTGGTCCTTAAAGTTGATGGTATTGATGTGGTTGGATTGCAGCAAAATCAGTTTGTAGATCTGTATTTACCAAACGGTGAGTATTCTTTATCTGTAAGATTTAAATGTGCGTTAACTGAATGGAAAAAGTCGGTCACACTTGTAGCTGATGGCGTTCCTCAGGTGTATGAAACTGAAACTGGAGCCTCAGGGCAATATCGCATGTGGCGAGCAAAGTAATTCAATTAATTAAAACAATTAAACCTCCTTTTTAGGAGGTTTTTTATTTGGAGTAAATCATGCAAGAAGTAATGACACAAATAGAGTTACACGGTTCACTAGGTAAAACCTTTGGTAAGATTCACCATCGGCTAATCAGCATAACTAAGGAATCCGTCAGAGCTTTAGCTAAGACCATCCCGGGCTTTGAAGCATATATGATTAGCAGCGAAGCTAGAGGGATAACTTATGCTGTATTCAAAGGCAAAAAGAATATCGGTGAAGATGACTTGGGTTACCCAGTGACAGGAGAGGTAATTAAAATTGTGCCTGTAATAATTGGGAGCAAAAGGGCTGGTATGTTTCAGACTATTTTAGGGGCGGTTCTTATTGCTGTCGGTGCTATTTTAAACTTTACACCCGCAGCAGCAGCCTCACCTTTTCTATACCAGATGGGAGGGGCAATGATGCTTGGCGGCGTAATCCAGATGCTATCCCCACAACCCGGAGGGTTAGCAATGAAGAACGATGCGGATAATAAGCCATCTTATGCGTTTGGTGGAGTAACTAATACGGCGTCACAAGGCTACCCTGTTCCGGTGTTATATGGTAAGCGTCGTATAGGCGGAGCTATTATTTCTGCTGGGATCTATGTCGAGGATCAGCAGTGA
Protein sequences of DBSCAN-SWA_3 >LR134531|1349048:1369172|1363384_1364092_+|VEJ54638.1|DBSCAN-SWA MRKKTLSAMMAHAEAEHPRESCGLIAQKGRVERYFPCRNLAADPTEQFYMSPEDYADAEDWGTITGIVHSHPDATTQPSELDKAQCDATGLIWHIISWPEGDLRTIKPRGELPLLGRPFVLGHTDCGGLIIDYYKQVHGIDIPDHRTERVWWESGEENIYMDNWYAAGFREFSGDAKPGDMVIMQVSAPVANHAGILLEDNMLLHHLYGQLSQRVPYGGYWRERTIKVLRHKSLK >LR134531|1349048:1369172|1351173_1352562_+|VEJ54620.1|DBSCAN-SWA MANNDITFIRPEHKAASLIWETIRDVCRGPYAVKQRKHKYLPKLDPANTSEENNRRNDDYIRRAVFYAITGHTKNGLIGMAFRRDPTVTIIDKIEYLKTNANGAGISIYQQAQSVLESVLEVGRDGLYVDYSSDMKGAIILSYRAEDIINWRTERINGRDKLVMVVLRECVEEPDGFGFKDRIQYRELAMDEGRFVCRVWRNTGPKNTGVYVVDSEYYPVVQFGGAWDEIPFTFVGAQNNDPTIDESPLLALTEINLGHYRNSADYEDSLFFCGQVQPWISGLSEEWRDWLQKNGVVLGSRSPILLPKEGAAGFNQAEPNMIAKEGMDSKRDYMVSLGAQLVEQNGAVKTATQANGDQAASTSVLGICCANVSEAYTQALNWCGKYLGVNDAKASYSISQEFIQKVADSGMLTAIVAAWQSGAIRDGDMIRAMQKLDIIDPESNPNEILDELKNQGPNLTGD >LR134531|1349048:1369172|1357882_1358542_+|VEJ54631.1|tail|DBSCAN-SWA MGFALPNGAHVYLAATYEAEQVISGISNATEAVVTLSAAHDIAVGDIVQLKSGWSALDNLVAKVSKVDSSTITLGNVNTSNVGKFAAGGGVGSLRKVLTWIELPQITEVSNSGGDQQNVQIQFLSDDNQRNLNTFKAAMSQTYTIAHDSSLPVYDLLRDADESGDTIAAYMYVPKAKENRYWSSTVSFNGMPTTAVNAVETVAVVLNLQSAGMTFYKAE >LR134531|1349048:1369172|1353672_1353828_+|VEJ54622.1|DBSCAN-SWA MNAGEIKEQKERELAEIITQKAKELTSETGLKVFNIEVTQIYPLRVRFIIQ >LR134531|1349048:1369172|1349048_1349615_+|VEJ54617.1|DBSCAN-SWA MAKLNDKQELFCREYIVDLNATQAAIRAGYSEKTAQEQSSRLLSNVMIQSRIAELNADRLERVRIDADYVLNRLVEIDQMDVLDILTDSGDLKPVADWPKAWRTTLSGLEVTAMMGDGDSAALLKKIKWPDKVKNLELLGKHISVQAFKEQVEQKVTATHNIMPVPTCSSADEWEAAAQQQQGEVLGQ >LR134531|1349048:1369172|1364272_1364803_+|VEJ54639.1|DBSCAN-SWA MSKNISVVPAFDFTNMVSIAGSQVVTTSAKIAEYFGKKHKTVLRSIRNLKCSEDFTRRNFVPTDFIDKNGDIQPMYNMTKDGCMFLIMGFTGEAAAAIKECYINAFNWMAETLNRRQMMGEQAQHQYVIKDKVSKVKASMGSRLMNQRKKEIPLLKQELERVKALTTPDLFGGLLS >LR134531|1349048:1369172|1362623_1363382_+|VEJ54637.1|tail|DBSCAN-SWA MSLNSDYQKLEPGNAVRLIDVDGTKFGVSDVMRFHAHNIPHTPEEIEAAGGDESKLPAKSIWWQGNEYSAWPYEIEGIETSTNGGSASPKLSVANIDSSITALCLHYDDMLKAKVTIHDTLTQYLDANNFPDGNASADPTQEKMKVFYIDAKSSETNEVVEFTLASPMDLQGVLIPTRQLHSVCTWCIRGKYRSGDGCDYSGTKYFDKYGNPVDDPSLDECSGMLATGCKPRFGENNELPFGGFPGTSLLRT >LR134531|1349048:1369172|1368123_1368522_+|VEJ54649.1|DBSCAN-SWA MKKLLLLLAVVMVMACTARPYMSAHETYDSRYLVQNKSKTHVRIHRVNQISGSGLPDDCPLVLKVDGIDVVGLQQNQFVDLYLPNGEYSLSVRFKCALTEWKKSVTLVADGVPQVYETETGASGQYRMWRAK >LR134531|1349048:1369172|1353939_1354716_+|VEJ54624.1|DBSCAN-SWA MLFRNIERKYYSAAGEGGDAGGGSTLEITPEIQALIDARVNESVTGLKTKNSELLGKLKEQGDNLKRYDGIDPDAVKTILQRFSDDEEAKLIAAGKIDEVLDKRTERLRADVDKKLKSANDRAEKAELFSKKFSDRVLGDAIRSAALKTGALPGAADDIILRAKGVFTLSEEGEAIAVDKDGAALLGKDGRTPLTPQEWAESLKDIAPHLWPQAEGTGAGGHKPSSGGALKRSEMSASQKAEYIRANGQQAFLKLPKE >LR134531|1349048:1369172|1359244_1362211_+|VEJ54635.1|tail|DBSCAN-SWA MADVATITLQVNTAPLAEGNKALDQFGAKAGHAAKSADSFGDSNKKAAKSTEEMAKSVDESHRRIAEFAKKLRDSGNSAHQAADRQDKLADSFFKQIDAIKEVDGASAKLKRIQDELGRANRAGELGQGNYVTLLGETTDKLNQVEKAESTLSRTKAEFLRRLKEQVATQKMSSEELLRYRAAQLGVGSSAEVYIKSLSASNKELTNLRTNSVSARREFGMMAAQLARGDISGLRGTALTTAGRSGLVEQLTTLRGLAVGGLLAGVAVGLVAVTKAYFQGAKESQEFNKQLILTGGYAGQTAGQLQALAASLSGNGVTQSNMAESLAKVVGTGSFSGGDISMIAGVAARMEDAVGQSIDETVKHFQKLKDEPAKAAQELDKSLHFLTATQLENILALENQGRASEAAKVAMDAYASAMNDRAVQIVDNLGWLESAWKYVGDAASGAWDKMLNIGRMESATAQLADVRRKILELQLDPKKQAIAAYNGDAGLDELRLQESLLSEQAYQEQMSAGQGKAIADEQERQKLRISQNDALNRKYENSQEKHLRTINEINNKYAHADQSVREKAVQAENKRFKEELDRETKKGRSSGAKSYTDDSATKMLLDSKQRLASLQEQMGATSQLTEQEKRLAEFTQQIAELKEKRILTADQKSILTRSDEIKASLELESSESRRLKNAQELAKGHETALKFIQQQEAAISAMQNSAGMSNREAQRNREREQTRMIKASPEDRDLALAKLEERYQREDELRGDWLAGAKKGFSEYLDSATDTYSAMSSVAQSALGGMSNMLTDLVTTGTTSFKQFTVSILKMIVDIINRMMIAYALQQAMGWIAGGASGGVANGQTVPTTPGNSIVVGYDSGGYTGDGGKYEPAGVVHRGEFVMTKEATARIGVNNLYSMMRGYADGGVVGKAPMHGLQSSGSSSGGVTVNMGGIVMNTGSDGNQQSGSGNTNTGAAIMKQLKPVIIAAITEQAQSPGTPLWNAISGGR >LR134531|1349048:1369172|1368581_1369172_+|VEJ54650.1|tail|DBSCAN-SWA MQEVMTQIELHGSLGKTFGKIHHRLISITKESVRALAKTIPGFEAYMISSEARGITYAVFKGKKNIGEDDLGYPVTGEVIKIVPVIIGSKRAGMFQTILGAVLIAVGAILNFTPAAAASPFLYQMGGAMMLGGVIQMLSPQPGGLAMKNDADNKPSYAFGGVTNTASQGYPVPVLYGKRRIGGAIISAGIYVEDQQ >LR134531|1349048:1369172|1365333_1365507_+|VEJ54642.1|DBSCAN-SWA MQKAKDMYQKKMRIPEDVRLAVISNGEKESRKFNTEVIHQLRKAYGLIESKQNAEVQ >LR134531|1349048:1369172|1365490_1366360_+|VEJ54643.1|DBSCAN-SWA MQKSNKAKTPTAVTVEASKNLSLRETNLMNNITSSSNACPAVTVPFHGASLFLTKYNDEPYVPMKPVVEGMGLDWKSQHAKISQRFSKGMVEITIPTKGGDQLMSCLPLRKLPAWLYSIQPNKVKAEIRDKVLQYQEECDEVLWQYWTKGEAKKPAVRAAVKSYLPEYRKARAAKMAAETMALALSFMPNLSDISKQTAMAKAVNDAAGVELLPLPAIEEHYYTAGEIGELAGISAQKVGRLANANNLKTEEFGKYFMDKSAYSSKQVEAFRYNQKGVEAITMLARAAA >LR134531|1349048:1369172|1356465_1356816_+|VEJ54627.1|DBSCAN-SWA MAINHQRMRSTATRLITQNGATYTLTRGGGVEVIGGVEVDIQPETHIITAIKSDYASGEIDGTLVLNGDVKMSATAEVEIRIGDLILIDGKSHRVIKPNPVKPASLLLCYKPQLRA >LR134531|1349048:1369172|1354729_1355938_+|VEJ54625.1|DBSCAN-SWA MNTTVNSDLIIYNDLAQTSYLERRQDNLDVFNASSNGAIVLDNVLIEGDFRKRAFYQLGGSIEHRDVDSTNKVTGKKIGAGESVGVKAPWKYGPYQTTEEAFKRRGRDVSEFSEIVGVDVADASLEGFIKYGIQALSASIGANPDMVVTANIETDGKKTLTKGMRKYGDKFGRIALFVMHSATYFDIVDQAIAEKIYEEAGVVVYGGQPGTLGKPVLVTDTAPIDAIFGLLPNAVVITESQAPGFRSYPINDEENLGVGYRAEGTINIDLLGYSWDESKGGANPDLTKVGAADSWKKHATSNKVTAGVMIKLEVEPVAVTGVALDQTTASVVVGATMSLAAIVAPADATNKKVTFSSSSVSKATVDADTGVVTGVAAGTAKITATSADGAKTAVCDVTVTAA >LR134531|1349048:1369172|1349611_1351174_+|VEJ54619.1|DBSCAN-SWA MNYNVVWKPLPGSQSLSLSCPCDEILFEGTRGPGKTAAQLARYRSRVGLGYGTFWRGVIFDTEYKNLADIITQSKRMYRLFNDGARFLASASELRWIWPTGEELLFRFGKEESDYWDYHGQEFPFIGFNELTKQPSADFYESMFSCRRSSFRPQDYPREDGSLLPNIPLETFNTTNPFGIGHTWVKKRFIDPAPRGTIIRDTQMVPNPQTQQEEEITLTRVAIHGSFKENPYLDPVYIATLMNIKDPNKRKAWVEGSWDVTSGGRFDHLWNESLHVIKPFNIPDSWTVDRSHDWGESKPFSNLWWAQADGTEAMLPDGSKFCPPSGSLILIGEWYGCPPDELNKGLNMSSTNVAKGVKWVDRRLAGEEVEEPEETKGQGQMHIMPGICKKVIRGPADGSIFNTADNELSIAQKMEAQGVKWLEANKKPGSRINGASLFADMLEAVVEGKKTESGMPEKPAFYVFDYCRGWISRIPTLVRDEKNPDDVDTKQEDHDWDATRYRVLHSPCREAFSINMRTTY >LR134531|1349048:1369172|1366669_1366948_+|VEJ54645.1|DBSCAN-SWA MSLNRNELAGYYSSKFSQLAVKKHNGTYDEEALEVLIKEAMDHMKVWVQIPYEENLTNFIAQVKYFGGAKDVAYESRQELLKDALPLIEKYK >LR134531|1349048:1369172|1355983_1356466_+|VEJ54626.1|DBSCAN-SWA MIITNPQSPDFNSYASTVDLAKFASSRNLELPENTESLLIKAMDYLNGLNWYGNRTAVTQPLPWPRKGVVFDGVRLPANTIPAQLATAQCMLAVEAIDGELLASSREASVKSERVEGAVTVTYAVTDGECFTASYPLVMGILNGLVAGRGFAINAIVRRD >LR134531|1349048:1369172|1367668_1368079_+|VEJ54648.1|DBSCAN-SWA MKKLFLLLAAVVLSGCSTTPMPANLAKEVKASTSFQVEQGKVPVTIVRDKGHVGSYCLITAFINGDPVAELDAGEKVIAYVSPGEVIVGAGFMGAGLCSGDPKKEREFIIKDKQPRTLRIFTDQNANVDILPTTEN >LR134531|1349048:1369172|1366356_1366539_+|VEJ54644.1|DBSCAN-SWA MNPALKAKEYESKLEMAGTITAQLQSYLSLLLEIERDEDNINLLSVALTVVSELKSTLEH >LR134531|1349048:1369172|1356817_1357411_+|VEJ54629.1|DBSCAN-SWA MSDNSQFMQSINLFIDKAKSNQEEVVRATGLRILAQLVNMSPVGNPEIWEINSTAGAYNQAVFDHNEMQRQDPDNLTKAGRLKKRARVNDSMEIKAPAGYTGGRFRGNWQVSFNQPADGVIDRDKKSDPGGNVTLGEGKAVIEMFKVGINAVYFANNVPYSYRLEMGHSTQAPAGMIRVTAAEIQQYVDQSAREVNK >LR134531|1349048:1369172|1357407_1357815_+|VEJ54630.1|DBSCAN-SWA MNTLTITELLESRLATVATGLGLKVSYENVQFEPFDDIYLESHVIPAKTNSIDLAGDSRVYVGVYQVNVVVKAGSGKSKAGKIANDIINAFPLNLELSRGDFTVYINSVPSAFPAVQGNATYSIPVSMNYRSDNS >LR134531|1349048:1369172|1359101_1359242_+|VEJ54633.1|DBSCAN-SWA MSGATGLDYGCLPQVMDFIGVSEDKATVFADIQVMESVALSIINKR >LR134531|1349048:1369172|1352563_1353676_+|VEJ54621.1|DBSCAN-SWA MATINEVLSDEIIAHSLFQSRYGTGVARKMVKLLNESDSELSARLIMALSEVNPNTITVKRLESLLFSVREVNKRVIDAMYMSLTDELMGFAKHEAIYQFNLFDSLLPHPVLQRFPLASITQEQVYAAAITQPFQGRLLRDWAENIEVDRMTRITNTVKNGYLAGDTVEQMARKVRGTKARNYQDGIIEASRRNVTSVVKTAVTHFAAVARNKFADNNRDIIKAKQWLSTLDNRTSHICIIRDRLKYTLDGKPIGHKIPYLQGPGRIHFCCRSMETFITKSWREMGIDIEDIREGTRASMDGQVPAETTYSEWLQRQSYSRQVQVLGETRARLMKDGGMRTDEFFNDKGEWLTLQQLRDIDERSFNEAGL >LR134531|1349048:1369172|1367069_1367639_+|VEJ54647.1|DBSCAN-SWA MKGFGILILITGLIAAIASLSMDVSVVTGYGNRVNNLGLMAQRQNFILISCFAVFCGLLMVIFSKKKTSDNNNYIKCSFCAEEILPEAIKCKHCGSDVKKESIELDNTMDSSEIIDFDCNKLVIKKKVGFMIDDESVMELADLLKSKSKHVDKHHLYNMFEKNILEIKLRLPDEIHETFMRRIKYWIEM >LR134531|1349048:1369172|1358625_1358937_+|VEJ54632.1|DBSCAN-SWA MVTKFQLQPKPTFKADVTIPRAGEDDGVLTFTFKHHPLKQLSEIEAIEGTTAVDFVMQIAEGWALSDDFTHGNVVTLLENYPQAGTAIVKKYYAEMVGNREKN >LR134531|1349048:1369172|1364860_1365205_-|VEJ54641.1|DBSCAN-SWA MSRDDPQLRIRLPAEIKDKIESLAKKNKRSMNAEIVLRLDIALKMAEDPSNLMNYAEENSPVTHIKGVALDMLREEMEKIAQKSISKITDKYLEKLKEKAVKDYISQAPKDNDE >LR134531|1349048:1369172|1362279_1362627_+|VEJ54636.1|DBSCAN-SWA MAIDTFFWRTQIQSGMEGSFSYKTRTAQFGDGYKQIVGDGLNPESQSWPITLTGLKSEMLPALDFIRSHVTKSFIWTSPLDEAGLYRVAADSVKYYPLSAQAITINATFERAFAP |
28 | Cronobacter_phage(28.57%) | tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2413794 : 2423607
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >LR134531|2413794:2423607|DBSCAN-SWA CTTAAATAACGGGATTAACGGAGGCTCCGCGGCCGATACCATAATAGGTAAACCCACGTTGGTTCATTCTTTCCGGTTCGTACAGATTACGCCCGTCAAAAACAACAGGATGTTTAAGCGTAGCTTTAATCAAGTCAAAATCTGGCGCACGGAAATTTTGCCATTCCGTACAAATAATTAATGCATCAGCGTTATGCAATGTTGCCTCTTTCGTGCCCATTAAGAGCAAATCGTCACGATTGCCATATATGCGCTGGGCTTCTTCCATGGCTTCTGGATCGAAAGCCTGAACTTTAGCTCCGGCTTGCCATAACGCTTCCATAAGAACCCGACTGGATGACTCTCTCATATCGTCAGTATTTGGTTTAAATGCCAGTCCCCACAGCGCAAACGTCCGACCTTTTAGATCGCCATTAAAATGGCGCTGTACGAAACTAAATAGTTTATGTTTTTGTCTGGCATTAACGGCTTCAACAGCCTGAAGTAGTTCTGGCTGATATCCGATATGTTGAGCAGTACGGATTAACGCTTGTACGTCTTTAGGGAAGCAAGAACCGCCGTAGCCACAACCAGGATAAATAAAATGGTAGCCAATACGAGAATCCGAACCAATACCTTGACGAACTTTCTCGATATCAGCACCCAGCATCTCGGCTAAATTAGACATCTCATTCATAAAGCTGATTTTAGTCGCCAGCATACAGTTGGCGGCATATTTAGTCAGCTCGGCGCTGCGGATATCCATAATCACCATTCTGTCATGATTACGGTTAAAAGGACTGTACAGCTCACGAATCAAATCCAATACTTGCTCATTATCCGTACCGATAACAATGCGCTCCGGTTTCATACAGTCAGAAACAGCGGCACCTTCTTTGAGGAATTCAGGATTTGACACAACGTCAAACGATAGTTCTGACTGACGTTTTTTCAAGGTTTCACTAACAACTTCATGGACTTTGTCTGCGGTTCCTACCGGCACGGTAGATTTGTCAATAATCACTTTATAACTTGTCATATGCTCAGCAATGGTGCGAGCAACAGCGGTGACATATTTTAAGTCAGCGGAACCATCTTCATCCGGTGGAGTACCAACAGCGATAAATTGAATTTCGCCATGAGCAACGCCTTGTTTAGCATCGGTGGTGAAAATCAGCCGTCCTGCTTCATAATTTTCTTTAACCAAAGGGGTTAGCCCTGGTTCAAAAATAGGAATAATACCTTTTTTAAGATTGGCGACTTTAGTTTCATCAATGTCAACGCACATCACATCATGGCCAACTTCAGCCAATACGGCGGCCTGAACCAAACCAACATAACCAATACCAAAAACTGTTACTTTCATCTTATGATATCCACATATACCGTTAATTAGTTCTTCGCAGACAGTTGCAGCTGTTGCAACCAAGAATGAAATTCGCTACCCAGACTTGCATGACGTAAGCCATACTCCACAAAAGCCTGCATATACCCCATCTTGTTGCCGCAATCGTGGCTGCGCCCTCTAATGTGATAAGCATCCACCGGCTGGGACTCCATTAATAATGCAATCGCATCAGTTAACTGGATTTCATCACCTGCTCCCGGTGGAATTTTAGATAACAGCGGCCAAATATCAGCAGACAGGACATAGCGCCCCACAATCGACAGATTGGATGGAGCCTCAGTGACGCTTGGTTTCTCTACCACTTTAATAATTGGCGCACTTTCTCCCGCTGTCAGGGCATAACCATCACAATCAACAATACCGTAATCACTCACTTCGTTAATCGGAACAGGTTCAACCATAATTTGGCTATGTCCAGTTTCTTCATAACGAGAAAGCATCTCACTGAGATTATCTTTCTTCGGATTAGAGTCATACTCATCAATAATCACGTCAGGCAGCACGACGGCAAAAGGCTCATGTCCTACTAGCGGATATGCGCACATAATGGCATGACCTAACCCTTTTGCCAGCCCTTGACGTACATGCATAATGGTCACATTACTGGGGCAAATAGCCTGTACTTCATCTAGTAATTGGCGCTTAACTCGCTTTTCCAAAATAGCTTCCAACTCAAAGCTGGTATCAAAATGGTTTTCTATCGAGTTTTTAGAAGAATGGGTTACCAATACAATATGATCAATTCCGGCCGCAATACATTCATTCACCACATACTGAATAAGCGGCTTATCAACCAGCGGTAACATCTCTTTCGGAATGGCTTTGGTCGCGGGTAGCATCCTGGTTCCTAATCCAGCAACCGGTATGATGGCTTTTTTGATCTTAGGCTGTGACATAGAGTAAATTCTCACTATTCAAAATATCAAGGGCCTGTTCAGAGATAAGACGCTCTGGCGTTCATAACATGAATCCGGCCTTGTTCAGCATTGTCCTGCCCCATATTTAGACATTCAGTGTCATAAGATTATGGAGTATAGCTATTCAGAACAAATAGTTGCCTCAGTATATCAGGTAAGGTGATATTATCCGATAAACTTCCTTATCTTAATAATGCGATAAATCATCAATATCTATGATTAAGGCGTGACATTTAGCTGTAGGCGGCCGCCATTTCCCCATATACGACAACTCCAACGTTCGGCCACCAAACTGGTTTGGCTGGGATAGAGGGTTTGTAATGTTCCCATAGGAACACTATCATTTAATTCGTATTTTCGATCGTTAGCATCAATAGAAATATGCAATCCCGCGGAAACAACCAGTAGCTTTTGCTTCGGCACATGGTAATAGCCAGCTAAAAATGGAAACTGCCCTTCCAGATAAGATTGATGTAATAACTTATTGATACGATTAAGTACCGTCGACATTTGCGGCAAGGCTTCATTCTGCTCGGCTAAATGTTCTTGCAATAAATTATTGAATAAGGCGCGTAACATTAATGCCGCCAGTACCCCTTTATTACCAACTTTAGAGATGTCCAGACAGTAAAAAGCGAAGTCATCGTTAGATAGCGCTGCGATGTCAAAAACAATACCTGGACGATCTACCAACGTTAATTGACGATAGTTGATATTACAGCCAGAAACGACCTGTTGTGCCGGAGGTTGTAACTGCTGCAGCAAGTGCAATGCTTCTTGCGGTTTTTCGCTTAAATCTGCCCAATCTTCCAACATGGCGATTTCATCGCTGGCAGGAGACGAGAAGAGATAGGGATAAAAACAGGCTAATAGTGCCTCTCTCAGGTGATTTAAATCAGAGATTGGCTTGAGTAAAACATCATGAACACCGAGGCGCAGCATTTTATCGACATCGGTGATTTTCTCCGTGGCCGAAATGACAATAACCGGGATCTGACATCCCTGATGTCTCAGGCGGTCAACTAAGGTAATACCGTCCATATTCGGCATATTTAAATCACAAATCATCAATTCCGGTTGAAACTGTTCAACATAGACTAATGCTTGCTCACCATCTTCCGCTTCACCAATGACGGCCCCAAGTGAAGTAAGGTATCCAGCCAGCATTGAGCGGAATACAGCATCATCTTCAACAAGCAATATGTGTTTATTAACTAAAGGGGGTGTCATATCAACTTCCATGCTTCACGGTTTATTATATTTTGGTCCAAAATTGGCACTTATGCCTGCTAAATTCATTATTTATTCTTGATAATTATTATTGTTAGGCTTTTCAATCTTCATCATGAGATTCCTCATTGCAGTTTTCCTGCCACTCTGTTTAAGTTAAAGTTCTGGCTAATCATCTTTCTTATGACACAAAGGAGTTCATATTGTTCAACCCGTGCCCTTGCGATAGTTTAAAAGAGTTCAGTTTATGCTGTCAGCCTCTTATATTAGGACAAACTATCGCATTAACCACGCTGGATTTAATGAAATCCCGTTACAGTGCCTATGTCAAACATGACGTTAATTATTTAATCTCCACATGGCATCCGGACTATCGTCATCCTGAGTTAGCGGAGTCCCTCACCCACAGCTTTGAGAATACTGAATGGCTGGGATTAACCATCATCTCAACCGCAGACGGCTTAAATGAAGGATACGTTGAGTTTATCGCTAAATTTCGTGATACCCAGACACAAGAGCATCATGCCATTCATGAACGTTCTCATTTTATAAAACAGAAAGACAGTTGGTACTATACCTCGGGTGTTAAACCTTCGTTTGGACGTAATGAGCTATGCCCTTGCCATTCTGGAAAAAAATATAAAAAATGTTGCGGAAAATAATTGAATCAAACCATCGGCTATTGAATCGCGGTATGCCACAATGAACGCCGGTTTATACCTATAACCCCCCTTATAATAAAATAAGTAACAGCACACATCCAAAGGCCAGAAAAAACATGCAAAAGAAAATACTACGTACCATTTGCCCTGATGCTAAAGGACTTATCGCCAAGATAACCAACATTTGTTACAAGCATCAATTGAACATTGTGCAAAATAATGAATTCGTCGACCATCGCACTGGCCACTTTTTTATGCGTACAGAGCTTGAGGGATATTTTAACGATCAAACTCTGTTGGCCGATTTAGACGGTGCCTTACCTGAAGGCTCCATCCGCGAGTTGAATGAGGCCGGTAAACAAAAAATCGTTATTTTAGTCACCAAAGAAGCCCACTGTTTGGGCGATTTACTGATGAAGAGCACTTACGGTGGTTTGGATGTTGAAATTGCTGCGGTTATCGGCAACCATGATACATTGCGCACTTTGGTGGAACGCTTTGATATTCCATTTCATCTGATTAGCCATGAAGGCTTAACCCGTGATGAACATGACAAATTAGTGGTTGAGGCGATTGACGAGATTGCTCCTGACTATGTGGTTCTAGCCAAGTATATGCGTATCCTGACCCCTGACTTTGTGCGCCACTATCCTAATCGTATCATTAATATTCACCATTCATTTCTACCAGCCTTTATCGGTGCGCGTCCTTATCACCAAGCTTATGAACGTGGCGTGAAAATTATTGGTGCAACGGCTCATTTTGTTAATGATAATCTTGATGAAGGACCAATTATCATGCAGGACGTTATCCACGTCGACCACACATACAGCGCAGAAGATATGATGCGTGCCGGTCGGGATGTTGAAAAGAATGTATTAAGCCACGGCTTACATCATGTGCTCACTCAGCGAGTTTTTGTTTACGGTAATAGAACCGTCATTCTTTAATTACAATCCTCACTGAAAGTCAGGCGGCATGTCTTGTTCATGCCGCTATAATCAAAAGTACAGTAAACCACCGCTTCACGTTGGAAAAATCAGCAATCACTTATTTTTTCCATCAGAATGCTTTACAGTCCGCCGGACTTTGGTATGATGCGCGTCCCTGCTGTAAAGCAGAATAATAAATATCGTTATTGGTGGGATTCCCGAGCGGCCAAAGGGAGCAGACTGTAAATCTGCCGTCACAGACTTCGAAGGTTCGAATCCTTCTCCCACCACCATCTTATTTGATTCTTATCTTTACCCCATTACTACTCTTGCGCCAATCATTTCTCCTTAGGAGAAGGTTGAGAAGCTTCGACCAAGGTTCGAGCCGAGCAACGCGAGACAACGTTGCGATCAGCAACGGCCCGTAGGGCGAATCACAACGTGATGAGTCATCCTTCTCCCACCACCATCCTTATTCAAGCATCTGGTTTACTCCGCAACTTCTCTGGTGATTATTACTCCTCGGGAGAAGGTGAGAAGCTTCGATTAAGGTTCGAGCCGAGCAACGCGAGACAACGTTGCGATCAGCAACGGCCCATAGGGCGAATCACAACGTGATGAGTCATCCTTCTCCCACCACCATCTTATTTGATTCTTAGTTTTACCTCACCACTCTTATTCCAAAGTGATGCATTAATAGCATCAACAATTCATCGATTTTTTCATCCCTAACTTGAAGCGAGAGATTAAATAAGCTATACCAAGCCCCTTATATTCATTTATCAGGGTGTCTCATGCGTAAAAATATTTGTGTTTATTGTGGTGCTAGCACCGGGAATAATCCTGTCTATGCTGAAGCGGCTCAACAACTTGGAAAAGCCATTGCACAACAAGGTCGTCGCTTAATTTATGGTGGTGGTAACAAAGGTCTGATGGGCATTTTGGCTAATTCAGTATTAGAGGCTGGCGGTGAAGTAACCGGCGTTATTCCTGAGCTTTTGGTTGAGGCCGAAACCGCGCACCATGGTTTAACCAAGCTGGAAATTGTGGCCGATATGCACATTCGCAAAGCACGCATGAGTGAGTTGGCCGATGGGTTTATTGCCCTTCCTGGTGGCATTGGTACCTTTGAAGAGCTATTTGAGGTGTGGACCTGGAGCCAAATTGGCTACCATAAAAATCCGGTTGGCTTATTGAATATTCAAGATTTTTACTCACCGATGAGTCAATTCTTGCAGCATGTTGTTGATGAAGGATTTGTACGCCAGAGCTATCTGAATACGCTATTAGTCAGTGATTCGGCTCATCAGCTGCTGGAACAGTTTGATCAATACCAGCCACATAACCTCGATCGCTGGGCTAAATAATTAGCCTCTGATCTGGCCGATGTAGTCAACAATACTACATCGGTTAATGTAACAATAAGACTCAATCTCTGTTACTATTGTTCAACCTTCTTGGCAGATAAAAGACGATACTCATGAAATTTATATCATTTAATATCAATGGATTAAGAGCCCGCCCGCACCAGCTTGCTGCGATTATTGAACAGCACCAGCCGGATGTGATCGGATTACAGGAAACCAAAGTCCATGATGATATGTTTCCGCTTGAGGAAGTGAGTCAACACGGTTATCACGTCTTTTACCATGGTCAAAAAGGCCATTATGGCGTAGCTCTGTTAACCAAAGCAGAACCACTCGCGGTAAGAAAAGGGTTTCCAACCGATGATGATGAAGCCCAGCGCCGTATTATCATGGCCGATCTCGCTACCCCAATGGGTACATTAACCGTTATCAATGGTTATTTCCCTCAGGGTGAGAGCCGCGATCACGCCACAAAATTCCCTGCTAAACAGAAATTCTACGCCGACTTACAGTCCTATCTGGAACAACAGTTAGATCCGGCTAATCCAGTAGTGATTATGGGCGATATGAACATTAGCCCAACCGATAACGATATTGGCATCGGTGAAGATAGCCGTAAACGTTGGTTACGTACTGGAAAATGCTCATTCCTACCGGAAGAGCGGGAGTGGATGGCTAAACTATTGGACTGGGGATTAGTGGATACTTTCCGTCATGCCCACCAAGAAACAACCGACCGATATTCATGGTTTGACTATCGCTCTCGTGGATTTGATGAAAACCGCGGACTGCGTATCGATCTGGTATTAGCCAGTCAGGCCTTAGCTCCCCACTGCATCGCTACCGGTATTGATTACGATATTCGAGCCATGGAAAAACCTTCCGACCACGCTCCTATATGGGCTGAGTTTAAGCTCTGAACTCCTCAATAGACTAACTGCCTTTCCCTCCCGGGAAGGGCATACCTTTGCTGACCATCATGAAAATCATTAATGTTGTCGCTGCAATTATTGAGCATCAAGATCATATCTTAATCGCCCAACGGGATAACCAAAGCGATCTTGCTGGCTACTGGGAATTCCCGGGTGGCAAAATTGAAGCTAATGAAACACCTCAGCAGGCACTCTGCCGTGAGCTGTATGAGGAGTTGAATATTGAACAGGTAAAGGTCACCGATTACATCGCAACCAGCCACATTCGGCTGCCTGAACGCATAATACAGCTACAGGCATGGAAAGTTGTCGCCTATCAGGGAAATATCCAGCTACATTGCCATACTGACTATCGTTGGGTAACCCCCGATGAGGCTCGACATTACCTGCTGGCTCCGGCAGATATTCCCTTGTTAAACGCCTATTGTCTTACGTTGCTAAAAAAATAGGCTGAATTACTCTTCACTCTTTTTTTTGCTGGTTTTAGCGCGTTTCTTTTTCGGCTTACTTTCCGCTTCGTCTACCGTTGTCTGTATGCCTTTAAACACCTTTGGCATGCGATTACGCTTAGCCTGATAGATCAGTTCCTGCAATGTGATGACCATCGGCTGCATAAAATCCTGATAGCGGCACTGCTTTTCACTAATTTGGGTCAAGGTAGATTCCCAGTGTGCTGTCATATCAGGCCGGGCAGCGGTTTCTGGTAAACAATGGATCAATGCACGTCCCGCTTCCGTAGCGGTAATCGCTCGTCCTTTTTTCTGCAAAAAGCGGCGCTTAAACAACAATTCAATAATACCTGCCCGCGTTGCTTCAGTACCAAGTCCATCCGTTGCTCGCAGGATCTTCTTTAAATCTTTATCCTGAACAAAACGGGCAATTCCCGTCATAGCTGAAAGCAATGATGCATCGGTAAAAGGTCGAGGTGGCTGAGTCTGTTTCTCAACCACTTCGCCATTTTCACACAATAACTCATCACCCTTAGCCACCACTGGTAACGGGGTACCTTCATTCTCTTCATCCCGTTCTTTACTGCCTAATAACACTCGCCAGCCGGCTTCTGCCAGAAAACGGGCTTTGGCAATGAATTTTCCACCGGCGATATCCAGCTCAATCACACACTTGCGGAAAATAGCATCCGGACAAAACTGCATCAGATACTGTCGAGAAACTAACTCATAAATTTTCTGTTCATCCGCTGACAAAGAGACTGATGAACTGCGGGCGGTAGGAATGATCGCATGGTGGGCATCCACTTTTTTGTCATCCCAGCAGCGATTACGTTTTTCAGTATCCATCACCGACAGGGGGAAAAGCTGCGGCTGATGAATTTCAATAGCAGCCAATACCGCATGACGATCGGCAAAATGCTCTTCCGGTAAATAGCGGCTATCGGAACGAGGATAAGTAATTAATTTATGCGTTTCATACAAGCGCTGACACAGATCCAACACTTGCTGGGCGCTTAATCCGTGACGTTTTGCCGCTTCAATTTGTAAGGCAGAAAGAGAGAATGGCAACGGTGCAATATCTGACTCTCGCTTATCATTGTATTCTGTCACATATGCTGGCTGGCCAGTGATGCGTTTAACCACATGCTCCGCCAGAGCACGATTTAATAACCGCCCTTCTTCATCCTGATGAGACTCACAGGATTCACTGGGTTGCCAAATCGCAGTAAAGCGTTCTTCAGCCGGGGTAACAATATGCGCTTTGACCTCAAAAAAATCTTTCGGTACAAAGTTCTCAATTTCTTCATCGCGGCGCACCACCAGCCCAAGCACCGGCGTTTGTACCCGACCTACGGATAACACACCATCATACCCAGCATTGCGCCCTAACAGCGTGTAAGCCCGGGTCATATTAATACCGTAGAGCCAGTCAGCGCGGGCGCGAGCCAAAGCAGAAACACACAAGGGAATAAATTCACGATTATAGCGAAGCTTATCTATCGCCCGGGTAACCGCTTGCGGGTTCAAGTCATTAATCAAACAGCGCTTAACCTGTTGACGTTTTTCCGAATCCAGCTCAAGAAAGTCGAGTACTTCATCGACCAATAGTTGCCCTTCTCTATCCGGGTCACCGGCATGTACCACTTCGCTAGCTTCGGTTAGCAACCGTTTGATCACATTAAGCTGTTTTGCGACCGACTCTCTCGGTTTTAATTGCCATTTTTCAGGAATAATCGGTAGGTCGGCCAGCGACCAACGCGCGTAGCGGGCATCATATTGATCCGGCTGAGCCTGTTCCAGCAAATGACCCACGCACCAGGTTACAACCTGATCGCTGCCACAAACGATATAGCCGTCCTGACGCTTATGCGGTTTAGGTAATACATCTGCAATAGCCCGTGCCAAACTCGGTTTTTCGGCAATAAACAAACGCAT
Protein sequences of DBSCAN-SWA_4 >LR134531|2413794:2423607|2417530_2417989_+|VEJ55810.1|DBSCAN-SWA MFNPCPCDSLKEFSLCCQPLILGQTIALTTLDLMKSRYSAYVKHDVNYLISTWHPDYRHPELAESLTHSFENTEWLGLTIISTADGLNEGYVEFIAKFRDTQTQEHHAIHERSHFIKQKDSWYYTSGVKPSFGRNELCPCHSGKKYKKCCGK >LR134531|2413794:2423607|2413794_2415138_-|VEJ55807.1|DBSCAN-SWA MKVTVFGIGYVGLVQAAVLAEVGHDVMCVDIDETKVANLKKGIIPIFEPGLTPLVKENYEAGRLIFTTDAKQGVAHGEIQFIAVGTPPDEDGSADLKYVTAVARTIAEHMTSYKVIIDKSTVPVGTADKVHEVVSETLKKRQSELSFDVVSNPEFLKEGAAVSDCMKPERIVIGTDNEQVLDLIRELYSPFNRNHDRMVIMDIRSAELTKYAANCMLATKISFMNEMSNLAEMLGADIEKVRQGIGSDSRIGYHFIYPGCGYGGSCFPKDVQALIRTAQHIGYQPELLQAVEAVNARQKHKLFSFVQRHFNGDLKGRTFALWGLAFKPNTDDMRESSSRVLMEALWQAGAKVQAFDPEAMEEAQRIYGNRDDLLLMGTKEATLHNADALIICTEWQNFRAPDFDLIKATLKHPVVFDGRNLYEPERMNQRGFTYYGIGRGASVNPVI >LR134531|2413794:2423607|2421675_2423607_-|VEJ55815.1|DBSCAN-SWA MRLFIAEKPSLARAIADVLPKPHKRQDGYIVCGSDQVVTWCVGHLLEQAQPDQYDARYARWSLADLPIIPEKWQLKPRESVAKQLNVIKRLLTEASEVVHAGDPDREGQLLVDEVLDFLELDSEKRQQVKRCLINDLNPQAVTRAIDKLRYNREFIPLCVSALARARADWLYGINMTRAYTLLGRNAGYDGVLSVGRVQTPVLGLVVRRDEEIENFVPKDFFEVKAHIVTPAEERFTAIWQPSESCESHQDEEGRLLNRALAEHVVKRITGQPAYVTEYNDKRESDIAPLPFSLSALQIEAAKRHGLSAQQVLDLCQRLYETHKLITYPRSDSRYLPEEHFADRHAVLAAIEIHQPQLFPLSVMDTEKRNRCWDDKKVDAHHAIIPTARSSSVSLSADEQKIYELVSRQYLMQFCPDAIFRKCVIELDIAGGKFIAKARFLAEAGWRVLLGSKERDEENEGTPLPVVAKGDELLCENGEVVEKQTQPPRPFTDASLLSAMTGIARFVQDKDLKKILRATDGLGTEATRAGIIELLFKRRFLQKKGRAITATEAGRALIHCLPETAARPDMTAHWESTLTQISEKQCRYQDFMQPMVITLQELIYQAKRNRMPKVFKGIQTTVDEAESKPKKKRAKTSKKKSEE >LR134531|2413794:2423607|2421267_2421669_+|VEJ55814.1|DBSCAN-SWA MKIINVVAAIIEHQDHILIAQRDNQSDLAGYWEFPGGKIEANETPQQALCRELYEELNIEQVKVTDYIATSHIRLPERIIQLQAWKVVAYQGNIQLHCHTDYRWVTPDEARHYLLAPADIPLLNAYCLTLLKK >LR134531|2413794:2423607|2419715_2420288_+|VEJ55812.1|DBSCAN-SWA MRKNICVYCGASTGNNPVYAEAAQQLGKAIAQQGRRLIYGGGNKGLMGILANSVLEAGGEVTGVIPELLVEAETAHHGLTKLEIVADMHIRKARMSELADGFIALPGGIGTFEELFEVWTWSQIGYHKNPVGLLNIQDFYSPMSQFLQHVVDEGFVRQSYLNTLLVSDSAHQLLEQFDQYQPHNLDRWAK >LR134531|2413794:2423607|2418105_2418939_+|VEJ55811.1|DBSCAN-SWA MQKKILRTICPDAKGLIAKITNICYKHQLNIVQNNEFVDHRTGHFFMRTELEGYFNDQTLLADLDGALPEGSIRELNEAGKQKIVILVTKEAHCLGDLLMKSTYGGLDVEIAAVIGNHDTLRTLVERFDIPFHLISHEGLTRDEHDKLVVEAIDEIAPDYVVLAKYMRILTPDFVRHYPNRIINIHHSFLPAFIGARPYHQAYERGVKIIGATAHFVNDNLDEGPIIMQDVIHVDHTYSAEDMMRAGRDVEKNVLSHGLHHVLTQRVFVYGNRTVIL >LR134531|2413794:2423607|2416316_2417327_-|VEJ55809.1|DBSCAN-SWA MTPPLVNKHILLVEDDAVFRSMLAGYLTSLGAVIGEAEDGEQALVYVEQFQPELMICDLNMPNMDGITLVDRLRHQGCQIPVIVISATEKITDVDKMLRLGVHDVLLKPISDLNHLREALLACFYPYLFSSPASDEIAMLEDWADLSEKPQEALHLLQQLQPPAQQVVSGCNINYRQLTLVDRPGIVFDIAALSNDDFAFYCLDISKVGNKGVLAALMLRALFNNLLQEHLAEQNEALPQMSTVLNRINKLLHQSYLEGQFPFLAGYYHVPKQKLLVVSAGLHISIDANDRKYELNDSVPMGTLQTLYPSQTSLVAERWSCRIWGNGGRLQLNVTP >LR134531|2413794:2423607|2415164_2416019_-|VEJ55808.1|DBSCAN-SWA MLPATKAIPKEMLPLVDKPLIQYVVNECIAAGIDHIVLVTHSSKNSIENHFDTSFELEAILEKRVKRQLLDEVQAICPSNVTIMHVRQGLAKGLGHAIMCAYPLVGHEPFAVVLPDVIIDEYDSNPKKDNLSEMLSRYEETGHSQIMVEPVPINEVSDYGIVDCDGYALTAGESAPIIKVVEKPSVTEAPSNLSIVGRYVLSADIWPLLSKIPPGAGDEIQLTDAIALLMESQPVDAYHIRGRSHDCGNKMGYMQAFVEYGLRHASLGSEFHSWLQQLQLSAKN >LR134531|2413794:2423607|2420401_2421208_+|VEJ55813.1|DBSCAN-SWA MKFISFNINGLRARPHQLAAIIEQHQPDVIGLQETKVHDDMFPLEEVSQHGYHVFYHGQKGHYGVALLTKAEPLAVRKGFPTDDDEAQRRIIMADLATPMGTLTVINGYFPQGESRDHATKFPAKQKFYADLQSYLEQQLDPANPVVIMGDMNISPTDNDIGIGEDSRKRWLRTGKCSFLPEEREWMAKLLDWGLVDTFRHAHQETTDRYSWFDYRSRGFDENRGLRIDLVLASQALAPHCIATGIDYDIRAMEKPSDHAPIWAEFKL |
9 | Bacillus_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2764750 : 2786211
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >LR134531|2764750:2786211|DBSCAN-SWA GCTAAATGCCTAAATAAATTGCAGGCGTCATTCCCATATTTAAAGGGCGATTTTCTTCAGCTGTTGGAACAGTAAATGATGCATCAAATGTTAATTCAGAAACTTGCGTATAACCAGACGTACCATTCTGATGACCTTTTATCGCTTTTGTCGTTTTAAATGCACCAGAACTGATAACATTACTCCCTCCTCGGCCATCACTGACATTATTAATAACACCATTAATATTCCGAATAGCATCATCCTGAATATGACCAACCTGTCTTGAAATGCCATCAACCGATCTCAAAAATACACCTCGCCCATCTGAATGAAAGAGATTAGGTAAATTAATGGTATTATTTATTACTTTAATACCCCAGTCAGTCTTAAATGATATAGATAAAGAGTTAAGCGCCTTCCCTTGAACAGATGTTAACGAATACTTATCACCATTAGTAAAATACCAACCAACAGGTAATTCATTCTTTCTAAAAGGCAATAATTGAATAGAACCAACTGTTGTTTTCTCTATATGGCGGCTAATATTATCCGTTGTAATAATTTCCTTCCAAACTAGACGACTATCGGTATTCATTTTATAACCATACCAAAGTCTAGAACTCTGATAAGAATAGAACATACAACCAAATCCCTGCCCAATATCTCTATTACATATCTTTTCAATATAGCCATAAGATATCGGCAATTGATACCCACCCGATTGAACACACATAAACTTTGCCCCAATAGGCAAAGATAGAATATCATCATCTGATGAAATAGATTTCATTGGTGAGCCATTGGTTCTACCAATATTAGTTGAATTATCAAGCATTGAATTAGTATTAATACCATGATTAAACGTTACCTGACCACTTTGCCAATTTAGTCTAATCGGCCTTAAGGAATTCCAGGTGCCATGGGGATCGTTATTATTAGTAAACAATATATAATAATCATAACCATCAAACCTTGAAATTGCCGCTCGATTCTTAACCATAATTCGGTGAGCATCTAAGTTAGTTGTTTTAAGCTCGCCAGACATGGTATCACCAGATTTACTTACTGCTCCAACATCACTAGCGTTTAATACCAAACTCCCCGTTTTACCATTAACACTATTTACCGGAATAAGTTTTTTAATCTGTTCAATGGTTAACGCAGTTTCTTTCCACACGGTATTAGCTTTTGGTTCTATTCCTTTACTATTTCGTTCTGCTAGCCAAACATCTTTGTTATAAGAAACTAGCGCCCCAGCGGGATAGTCTTCCGTTTTTGACCATTCGGCAATACCTCGTTGTAACAGGTAACGAATGGCTTCATCGGTACGTTGGCCTAAGGCATTAAACCACTCCATCGGTGGAATACCGCCGGTTTTATCAAAAGCCACGCCCCAGCCGCGAGAAATATCGGGGAATGGTTTTACTTCATCTTGCTTTGATGATTCAGCAAGAATTTTTTCATCTGGCCGCTGATAGATAGTCATAATTTTTATCCATATATTTTTATTTAATAAGTACGCAAAAAATCCGACGAATAATCGGTATTAATATATCTGTAAGTTAATCTTATTAACTTCATTTATAGAAAGAAGGAGGTTAAGGAAAAACTGAAGCTATGTTTTTACTGAGTTTTAGGTGCCGCTACAACTGACGCACCGTCAAATGGGAATTTGTTCGCACTGAATTGGGATTGTACTTCAGCGTAAAATTGAAAATACATGCCTCCCGGCATAATCTGTTCTAAATAAACAGCATCTTCATTTTTTACTTTCTGAATAAGTTCATTGCTTTCGGTTTTAAAGTCATATGATTTCCAGGTTTCTGGATTTCGGTTATATTTATTTTCTTCAAGAGAAACGATATCACAAGTAAATCCGATTTTTGCATTGCCATTACTCTTCTCGTCTAAATCATAGGTTTTTCTTGCCTGTTCACCGCATCGTAATCTGACCAAATAATCAGGAAATTCCGTGCGAAAAAGTTTAACACCACTCCAATGAAGATAAGGTAGTGTATATTCAGAAGTGATATCACTAGAAGAACAGTAGCTATAATAACCAGAAGAACCATCTTTTCTTGCAATTAAATTCCCTCTTCCATCCGAAACTTTAATCGTCATTTTACAATTATTATAACCAACAATCCTGGATCCATAGTTTTCCTGAGCCCTCCTGATTGCCTGAATATCGCAATCCATTTGAGTATCAACCCCCTTATTAGGAATATCATCAGTTCCTGAACCACTGCAATATTGGGACCATAATGATATATTCTGTTTTCTTTCACTGAGATCAATATCCGTCCCATTGCCACATCCTGAAATCTCAAAATAAATAGGATATACCTCAGCGGAAGATAGATAAGGTATCATCTTACCCTCAACACATATTGATGGTTCTTCTTCATCCCAATATTCTGGTGAATCAAAGTAAGAGTTCTTATCCACCTGTAATATAAAAGGATAGTCATTACGCATTTTTGTACATGTCCATTCGGACTCATCAGTACCACTAACTTCACTCTGCCAGTAGAGATAAAGACAAAAACCCTTTTTAGCCTGCCAAATATGGCGGCCAGACGTTAATGTTTTTAACAATGGAATATCAAAGACAACGTGTAAGTCCTCGTCTTTATCTGATCGATGCTCAACAATATTCTCTGAATTACTTTGAGTTGCTTCCTCTTTTACCCATTGTCCGGCACATCTTACGTAACGTTCGCCATCTTTTGGCGCTTCGATAATAAAATTGGCTGAATCAATCGTTAAAGCGCTTTCCCTCCATACGGTATTCATTTTTGGTTCTATGCCTTTATTGTTTTTTTCCGCCAGCCATATCCTTTTATCATAAGAAACAACGGCGCCCACTGGGTAGTCTTCCGTTTTTGACCATTCAGCAATTCCACGCTGTAATAGGTAACGAATAGCCTCATCGGTACGTTTACCTAAAGCATTAAACCACTCCATCGGAGGAATACCGCCGGTTTTATCAAAAACGATTCCCCAACCGCGAGAAATATCGGGGAAGTCTTTTACTTCCCCCTGTTTTGATGACTCAGCGAGGATTTTTTCATCTGGACGTTGATAGATTGTCATTGAGTGTTACTCATAAAAAATAGCCAGCGAATAAATCAATAGCCGCTAACTACTGTAATTAATGAAATATAAAATTTATTTGGGAATAATTTATTGAGGCTGAATAGGTCTAAATGAAGGATCGGGATAATTTATATTTTGCTCAAAACGCCATTTCCTTAATGCCTTTCGATATTCTATCCAAGCTGCTCTTGTAGCTATTGCGGAATCATCGTCATCATCATGTTTTAATAATTCATCAGCAATTACGCGCATTTGCTCAGAAATAAAAGCGTCCTCAATGGCCGGATCGCCCTTATGCCATAAACCATCTAATTTGGCATGCCAATCTCCGTCTGCTGGGCGCTGAGATAACATCTTAATGTAACCGTCAAATGGCAGGGAGTCGCCTGTACTGCCATCAATCAGTTGCTCTGAAGTATTTAATTTTGCATATACTTTCATTATGTTACCTTCCAAACTTTAACACGCAGCGGAGCTGAAGTTATAGATGCTGAAGCATGGGGAGTCCCCGTATTATGTGCCGCATTTAGAACTTCTTTATTCCCTGTCTGTACAATGATTTTGTCATCATCATATTGATGCGCTACCGTGCCATAGCCTCCAGGGCCACCTGGAGAATAAATAAATCCAGTTGGTCCCCATTTATCCTGAAATTTAACTTCTGCAATACACATAACACAATGACCGGGAAATGGATTATCAACAACTAATCGGGTATTTACACCAATTGTCATCGGATTCTGTGCCGTGCCATTAGGATAAAGAATAGTAAAATTATTAGTAACGGTAGTCTCTTTCCAGGTTAATTTGCCTTTATAATATATACCAACCCAAGATCTGACATTATCAGCCTGAGCGGCATGCTGATGAACAATAAAGAAAGGGACTTGATTACCCGCCTTTTGCACAATGACATAAAGCACGCTATTTTTTGTATTCGGCGGTAAATTTTTAGTAGGTGTTGCAGTACCTGCGCCAAGTGCCCGATAAAAACCTGATGGCAAATTGTCGGTAAAATCTGAAATTAATGGAGCTAACTCACTACCAAGACCATAATCACCTTTCATGATTACCGGATCATTTGATGATATTGTTTTGGATGGCTCAACATAAACAATCTTTTCAGGTTCAATATCTTGCCAAATAAGTTTATCGCAAACGCTGATATAAGATGGATGATATCCACTTAATCTAGTTAAGCCTAAAAAACCACCGTAAGATGTGCTTTTCAACCAAAGTTCTACCGTATTATCCACGGCATTGAATACCCAGCCATAACGGTTTTGATGATCTGGATGAAATGATGGGGTAATAATTTGGTGAATAATAGTCGCTTTATTACTCTCTGTACGGCAGCTAAACGATACATACTCTAAATTATTATTTGGGCTACCAAAGTTATTAGCGCCAGAAACTAAAAATGAAACATTTCCTATATCAATCCTATTTATTCTACCTGTTGCTATTTTTGCATATCTAACTTGCTTACTAGTGGAATCAGCAACTAATTGATTAGCAACTTGATTAACCGTCGGGATATTAACGCCATTTTTTGTTACGTTACAATTTTCGAAATTCCAAACACCTGAAATTTTACTTGCATCAGGAGTAAACCGAAATCTAGTTGTTGTCCCGCCGGCTCCATTATCTGTTGCTATACCAAATGTATGATTACAAAATAAATCGGCCTTAAATTTATTTTGATGATTCGTGACTAAACGTACCATTGCAAGAGCACTTTTTTGAACGTGTAGATCACCATCCAAAACTCCACCTAACTTATCTAACGCCCCAATATCCCCCGCATTTAACACCACATTCCCCATTTTTCCATTCACACTATTAACAGGAATTAACTTACGGATCTGTTCAATGGTTAATGCAGTTTCCTTCCATAACGAATTGGTTTTCGGTTCTACACCTTTGCTGCTTTGCTCCGCCAGCCATATCCCTTGATTATGAGAAACGAGCGCTCCTACCGGGTAATCTTCTGTTTTTGACCATTCCGCAATACCTCGTTGTAATAGGTAACGAACAGCCTCATCGGTACGTTTACCTAAGGCATTAAACCACTCCATTGGTGGAATACCACCAGTTTTATCAAAAGCTACACCCCAACCGCGAGAAATATCGGGAAAGTCTTTTACTTCCCCCTGCTTTGATGACTCAGCAAGGATTTTTTCATCTGGACGTTGATAGATTGTCATATTAATGTATGCCTACAAAAAGTCTGTTTATACATTCGCATAAACAAACGGTTACCTGAGTAATAAAAAATGAATTGGAAATAATTTATTGGCTGAGAATAATTATAAATTTCTAATTGAAATAATATATTAAGAAAATTTAAAATCTAAATACTATCCTTTGATTAAGGAGGATTTATAAGAACACTATCATTTTCCTACCTATCTAAAATAGCTTAGGATAGCTTAGTGACACGAACTCGATAAGGCGCTGATAAGACATTTAGTGCCCTGTTCCAAATATTTTTAATAGCTACGCCCCCAATAACAACTGCATGCCCACCGGTTATAACAACAATTTTATTATCACTTCTTGGATAAGCATCGACCCCACCACTATAATCTCGGCCAGAATACCAATAACATGTTGAGCCAGATGGTCCCCAATAATAGTTACCGTCATCTGCCTGTATGTACAATTCAACAATACAATGTATATTCATGCTCTTATAAGGATTATCTATAATTATCCTCTGATTAGTTAAAAGCGTTGAGGGATTAGCTTCCGTTCCTTCTGGATATAGAACCGTAGTTTTCGGAAGATAATTAGCAATATTAGCCGTAGTTATTGCCTCTACCGGTTTCCCCCAACCGTTTAAAGCATATTCTTGAAAAAATACCGCCGGTGTTTTACTTGCTACACCATAGGGATCTGCAGAAAATAATATTCTATAAGAATATGGTAGGCTGCTTGTCCTATAAGGAATAGTTAATCCTGCGCATCCCAACATACCTGATCCTTTCATTGATTCAGGTAATTCGTTTGGTGGCGAAGCTTGAAATGCACCATTTTGCTTTGTTATATCATCAGTAAACTTGCAATCAGCAACAAGATTTTTGGAGCCAAGGCCGTATTCACCATCAACTAATAAGTTTTTGCCATGTTTAGTGACGTTACAGCTTTCAAAATTCCATGTATTCTTATCAGAAATCCATCTTAATCGGGTTTTATCACCATCCTGACCAGTAATATAGTGAATAAATGTATCATCACCACTAAAAGCATTTCTATATACACGATTTTTCTTTTTAAATGTAAACGTTGGTGCAATATCTTTATCTATAATTACCTCACCGGAAAAGGTCCCTCCCGATGATAAAATAATACTTTCCCAATTACTCCATATCCAAGAACCAGTACTTTTATTACCATGCTTTATATAGTGAGTCCCCTCATGAAATAAAATTAAAGTACACGCACAATTAGCATCATACAGTCGACGTTTGACGGTTAAAGTAGCCCCCCACACTTTATTATCATCCGGAGTTCCAATATATTTTCCTGATAAAGTGAATTCATTATTTGACGATGTCACCAAATCGCTAGAAAAATTAGTACCGGCAGGCAATAAAGGTGATCTTGTACCTAAACCAAAGTCTCCTCGTTTTAACGCCCCAACATCCCCCGCATTTAACACCACATTCCCCATTTTTCCATTCACACTATTAACAGGAATTAACTTACGGATCTGTTCAATGGTTAATGCAGTTTCCTTCCATAACGAATTGGTTTTCGGTTCTACACCTTTGCTGCTTTGCTCCGCCAGCCATATCCCTTGATTATGAGAAACGAGCGCTCCTACCGGGTAATCTTCTGTTTTTGACCATTCCGCAATACCTCGTTGTAATAGGTAACGAACAGCCTCATCGGTACGTTTACCTAAGGCATTAAACCACTCCATTGGTGGAATACCGCCGGTTTTATCGAAAGCCACGCCCCAGCCGCGAGAAATATCGGGAAAGTCTTTTACTTCCCCCTGCTTTGATGACTCAGCGAGGATTTTTTCATCTGGACGTTGGTAGATTGTCATGAGTTATATCCATATTCACTTGATTTTTTTAAATAAAAAAGCCGCGATAATTTTTCTATGATTATCGCGGCTTTGATTAAGCAGACTATTCTGGTTGTTCTGGCCATTGAATATTTGGTGCCATCTCAATATCAACGCGGTTGAGTAAAATACGATATTTACGCCAGGTCGTTAATCTGTCCCGATCGTCATCAAGTTCAATACCAAACTCAATGGCATCAGATAAACGCACGATTTCATCTTTGGCTTGCTCATCCCGCTGCTGTTTTTCTATTTTTGCAGTATCAACTTCAGCCTGATGCTCAGCGTTCTTATCTGTTACCCACCGATCATCATGCCATTTATCATAAATCGTCATCGGTTTTAATAGTGTCAGCTCTGCCGGAATAGGCCCTAACTCAGACATAATCTGCTCAGCACCGTTATGTTTTACATAAGCTGTTTCCCCCCGATGGTCTTCAATATATTGCCATTCACCATCTCTGCGACAAACAGCAAACCCAGCTCGACTCACAAAAGGCTCATCAAGATAACTGTATTGAGGCAATCCTATACCGGCAACTAAAAGCTCATCGGTTGAATTTAAATATTCGCCAGTTATTGAATCAACGTTATAAACGCGAATGATCCCTTCATTATTGGCTAATCCATACTTATCAAATTTAATATTATTCATTATGCCGCCCTGACTATATATAAAAATGCAATATTACGGGGACGGGTTTCTTCTGCTGTTCTCACCTCTCGGGATGAATCAAACATGACATTTGTCCCCCAACCATCACCAGAACCACTTTTAATGCTGGTACTCCAGCGGCTATGTAAATAAGCACTTCCGCCAATCCAGTCATAATTCCTGTCGATAGAACGAAAGTACCCTGTGATTTTTTGCATCGCATCATTCTGATCCGTCAATATCCCTCGCGCCTCATCAACACCACGCCCATCATCCCAGCCTCGAATAAACTCACCGCGTAAATCAGGCAAAGCACCAGCGGGGTAAGCTTTCGCTAAAAGAGGAAAACGATTTTTATCAAAGGATGCGCCATTACACTTTAACCAGCCGGAAGGTGGATTTATATTCGGCCAAGGCATGGGGACGCCCACCGGGGTTGTAATCGCAAAGCTATTCACATCGGTGATAACCAATTGCCAGTTAGACCAGCTATTGCCTCGCTTAATCCGAGAAAATTTTTCACCGCTATTGGCTACCGTTAGGTCCTGATAAACCGTATTTACCCCGACATGAACCAGAATATAGCTATCGTTTAGTTTGGGGATATCTTTCACCGTACTGGCGACAAAATAGAACCCTGTAGTAAGCAAGTTATTTGCCGTTGTGACATTGGTATACTGGGTCTGCTGATTGCCTAAACCAAATGCCCCAACTTGCATCAGTTGATTATTGGCGACACCAACATCACGCAGTGCTGCGGCTCCTAGTCTCAGGTTCTTACGCACAATCACCGGGTCAGCATCAGCCAGATTTTTGCTTCGTTCAAAACTGCCCACATCCTGAGCATTAAGAACCACATGATTTAGCTTACCATTCACACTTTTGACTGGCACATCCGGAATTAACCGTTTAATTTCTTCAATGGTTAATGCGGTTTGTATCCAGGCGCTGCTCATTAAGGGCTGTTCATTGATATTCGCTCTAACAGCAACCCACATTTTATTTTTGTGCTGAACATAAGCATTAAGGGGATAGTCTTCCGTTGCTGACCAGTCTGATATCCCTTTTTGTAGCAGGTAACGAATTGACTCATCCGTCCTTTTACCTAACCCATTAAACCATTCCATTGGTGGGATGCCATCTGATTGTTCAAAAGCAACTCCCCATCCTCGTTCAACATCAGGAAATGACTCAACCTCGCCTACTTTGGCTGATTGAGCAAAAACCTTCTCATCCGGTCTTGTATATTGACTCATAGTATTCTCGCAAATTGTCCTTCATTAAAAGCATACGCACCAACGCCTCCCAATAGGCCAAAGGGCTTATTCGAGATAGCTGTATAGAAATAGATATTGACGCCAGCGGGTCTGGGTAAAATATCCAGCTGGGTGATGGCATAACGCTTAAATGTTGAAATATCATCGGATTTAATCACAACAGAAAGCGACATATCATATTGGTCGTAGGCGATCGATTCAGAATTAAAAATAAAACTTAATAGTTCAGTAATATTTTCAATAGTGCCGGTCATATAATTTTTAGCGATCCGACAACGAATTAAAAAACGATAGTCTTCATCATCCAGTATTGCTGACTCGGCTAAAATACTGCCTTTTTTATACCACCTGCCGCCTGACTGCCTTTTTTGACTAAAGCCCTTACTATTTAAAGAAGCCCAAAAGCCAAATAGCTCTTTTGACATCGCATGATTTAATATGCGTGACTGTCCAACGTGTTTCCCAATCAAATCAAGATTTTTACCCTCTGCGCGATCAATATTGAGTGCTTCAACCAAGCTAATTAAACCCTGAGCCGTTTTTTGATATTGTTCGCTAACCAGCTCAATGGTTGCTTTAGCCTTTGGCTTATCTTTATATTGCCAAATCAGTAACTCACTATATTTCATCGCACAATCACCTCAATATCATCAATATGAATGTTGGCAATTTCACGAATATTAATATCAATATTCTGCGCTTTTAGAGGTTCGCCCTTACGGGCAATAAATAACTTATCAACCCAAAAACCCTGAGTCTGATTAATCGGCGTATATAAGCGTGAAATATTCACTGACTGACCAATAGAGAATGTCATTTGACTAATCGCCACTTTAATTGCATCAATATCAATGCTGGTAAAATCTTGATTACGGCACAGTTCTAAATAAACTTGGCAGTTTGCCCACTTAGGCCGATCAAAGCAGATATCTCTGGCAATATTTTGTGTATCACGTACTACAACGGATGTTTGACCGAATAATCCGGTTCCTGCAGTTTTTCTATTAAATACGGTTTCTGCAATCAAATAATCATCGCCACCGTCAATAATGACATTGATAGAATGCGCCGGAACCTGATTTTTATCCGGTTCACTGGTCGAGTTTTCCAGAATCATCACTTCTTTAACGTCAGGTAATTGATTTAAGCTGGCCACTAAACCATCAACGTTATTTTTCGCCTGTTTAGCCCGTGAACGAAAAAAGCGTTTTCTAAACACCGGATCGCTTTCTTCTTCCTGACCAATCTCCGCAGCATAACTGCTCAATGCATATCGCCAGCCTAGAACAACGATTTCAATTTTTAGTTCTGTTTTTTCGCTAACCGGGTAGCTGCCTAAATATTCACTGCGAAAATCGGCCACTGCCGACCCCTGAGCATTTAGCGTGACATCATTGACCAGCAACCAGCGATTTTTATTTGGATCGCTAACAACAGCGCCTGAATAAATTTTGGTAAACGGATCGCCAGTAAGTACCACATTACGTAAGTAGCTATAGCTTGCTTTTCGTCTAATTAACCCTGCGTAGGCAGCCCTTTGTTCTAGCCACACGCCTGTTGCATGATCCGGATCCAGAGCTTGATAAATCACTTCTGCCAGTTCTTCTAAATCAGCTTTAATCTGTCCAATTAAGCCAATCATTTGACCATCTGGAGAATCAGGATCCAGATTAATATCATTACCATAAATACGTTTGAAACCGGACTGCAGCTCTTCAATTATTGTGTTTAGCCTTTCAGGAACATAGCCTGTATGTGTAAGTTGTCCCATATTTCACCTATAAAAAAACCTGCTTCCGCAGGTTTAAGTTTTTACTATTAATTGTTGGTCATATTGGTCGGTTAAATAAACGGACACCGTCATTTGCCGACTATCGGGATCTAAGCGCATGGTGAACTCGTCTAATGATTTAACCCCTTCCGTTTCCAATATCTGTAATTTAATTTCTCGTTCGAGTCGGGATAAATCAAAGCTTCGTTCAAAGTGCGGCAACCAGGGAATACCATGTTCCAGATCGTGTACCCAATCACCTTTAAAGGACAGTAATCGGGTCTTGACTCGCTGTGCAATAGATTCGCTATCCTGCGCATAATTAAAACGTCCCAAACCAAATGACCAATCATGGTTTTTATCTAATCGTCTAACTCGCATTCGGTGCTCCTGTTGTTCCTCCTGAATCGCCAGGGTGACTGTGATTGGTTAAACTCACTCCATTGGCAATCACATCACCTGCAACTTTAACCGTGCCAGAAATACTGGCCGATGCGCCGCCACTCCCTCCTGAACCAGCCATTCCGCCCTGATATGTCAAAAGTTGCTCAACAAACAGAGGACATTTGATAATCATCTGCTGGCCATCGACCTCAATGCTGCCCGAGTCATCAATTTTGACATAAGCTGAATTATCCACTTTACGTATCGCAATACAGTCCATATCAAACGCCGGTATCGCTTTTGGTACTGAACTAATCCCGGGAATAAATGACGCATCAGATAAATCAAATAAACGATAATCTAAAGGTTTACTGTGATCCCCTGATTGAAACCAACCATCAATACAGCGTTCATTAATAATCACCTGCCCTTCATCACCAGGTTTAACTGGAAACGTTAAAGCAAATTCACCTGCTCGAGGAAACTGAACGGGAACATCTAGCAGTACCGGTAAAGAAATAACCGTATCATCAATGAGAACTTGATCGATCATCGGTTGAATTTGGGCTGTTTGTAACGCTGGGTTAAATGCAACAACTTTGCCAGGAAAAGCGGTGTGTATTTGTAATAACGTTGCCTTGATTGTGCTGGCAATCGCATTATCCTGTGAGGGCATACTGGTTGATAGCTCACTCATTCTTTTCTTCCCTTTTATTGATTAACCGTAATGTGGTTTTCCAGTCACCGCCAATGGCATCCCCCTGATGCAATACACTAATCACTTTATAATCACCGTTATGAATACTTTCGATGGATTCAATACGGACCTTGCTCCCCACCTGTATTTGCGGATTAAGTAAACAACAGACTTCAAGATCTTTATCTGACGTTGCCGGTGAGCCAATCATGCCGGTATCTTGTGATAACACCGTAATTTCTTTATTCAGAAAACGATCGTTAGGTAAGAAAACCAACTCTCCATCCTGAATCGACCAATCGGCATTATGCTTTATCGCAATATCACTCAGTAAATCACGTGAGGCTCCATGCAAAACCTTACCGCGCGGCAAGCTTAACTCGGATTGAATATCAATCGAGCCGGGTTGGGTATTTTTCATTGAGTCTGACAGCGCCTGAATAATATGCTGATCGGTACACCCTGATGCTAACGATCTTGAAATAAATGAATTTCTATAGTCAACATCGCCGTCAGCACATTCCAATTCAATAAGAATATCAAGCCCATCACGTACAACTTTCGTAGTCTTAATGTTGCCTGCATAAATCATGTTCAGCCGATCGTAGCCGACAAATAATTTGAGATATTTAAATTGTAATGAAATTAATTGCTCACGATGGCTTTGATTAAGATTCCAAATTTTAATCTGGGCCGGATTAGGTTTTTTATCAATGGTTTTACTGATAGAAAATTCTACGCGTAAGTTATTAATGGAGATCCCATCATTTTCATCACCAATATCAAGCCGAAACTGGCGATTAAATTGCCTCATCCAATACCTCTGATTTTAAACCAACGTGCAATACACAGCGAGAATCCAGATCGTCCATAAAAATAGGATCTAAACCCAATTGCTCCTCATCAGTTAAATAAAAATAGTATTCAGTCGGTGAGCGCCAAAGAATTGGGGTACCCACCACCAATGCAGCCCCTTGTACAATAACCTTTTGCTTTTTCACATCATAAACATCCATAAACCATTGGGTACCAATACTGTTATAGCGTAAAGTCACCCGTAATTTACAGTCTCTAATCGTGAAGTGAAATTCTTGATAAGGCATTGATTTTAATTTTATTTTATACATATCAGCCTCTAATTGAAAACTACTTAGGAGCAACAGAACCCACTGACGTCTTTGCACAGCCTTGACTGGCAGCCCGTCCTGATAATTGCTTTTCATTTTTTAGAATAATACCAACGCCATTTATTACTTTTGATTGCACAATAAATACCTCTCTGGCAGTAATTGAAAAATCAGCGAATCCTTCAGAACTCTGACTAACGGATAAAGATTCAATTAACATATTTTTATAGTGCTTAATGCCGGTTGTAATCTCTATTGTCTGACCTAGCTTTTGTAACGCCAGAAAGTCATCATAAATACGTTCAATACGATTACTTTTCACTGAGCGATCTTTCAGCCAAGCTTGAGAGTTATCCGGCGACCATGGCGTAATGGCATGTTTTGCTTCTTTGGCGCTATCGTTATTTAAACTGGCATTTTTCTTGATCCTTTTTTGACACAGTTTTTCTGCATAATCAGTTCTATTTTTTAATCCCGGAATAGAAACGTCATTCACAAAATCCTTATTACCGCGAATGGTTGGCGAAGCAGTATTAGAAAAAGCCTCCTGCTCACTGTCATAATCAACAACAAGGCCAGAGATACTAATTGATATTGGTGTCACCATAGAATGGTCAGTAACCGATGCACCAAATTCAACCGGATTTTCGGTAATTTTCAGCTGTGAACCATGTTGTTCTGACTTGGTCACATCTAACGTTAAATGCCCCACTCTACGGTTATACAAGCTCACCCGTTGCTCTTGATTAGAAATAGAAGCGCGGGATTTTCCTCGTTTATTTGATATTGGCATGGTTTATACCTTTAGTTGTGCCCGGTTATCTTCATAGGTTGACACATCCCTTTGTTTTAATTTTTTATCAATAATTTCAGCAACTAAGTGGGGATTATCCGATGTGATATTCATATGAACCGTCTGCTCACTATGATTATTAACCGCAATATCTCTATCTTTAGCACTTCTTATTTGATGGGAAGCCCTTGTTCGAGCGGAATCAACGGCTTTGGGAACTTCACTAACCGTCTTCAGCGTTTTATCGACAGTGGCACTAATATCATCATCGGAACCAAAGCCAAAGAAGCTACCGATAGCAGAAAAAATACTGGTTACTGCCGAAAACTTATCTTTTATCCAGCCGATGACCGCATCAATACCACTTAAGACCCAATCCCATGCGGTCTCAAAAGCATCGCTAATAATGTCAGGAATATCCGCACATGCGTCTTGAATAAATGCGATAAAATCTTTAAATGCCTGTAATGGATTGGAAACAATGCGCCAAATTAAACTGAAATAACCAACAATAAGGTTTTTAAAAAAATCAAACAGCCCCATTACAAACTGTTTAACCGCATTAAACCCCTTGACTAAGGGCTCCCAAAATCCTGCCAGACTTGACTCACCGCCATCAAGGTAATTGCAGAAGTCTTGAAATAAAGCGTATAAGCCAATTAACGCAGCAATAATCAATCCAATTGGGTTTGAAATAAACGCAAGCCTTAGATACTTAGCAAATAAAATCACTGTTGCAATGGCGGTAATCAGGATCTCATTGAATCCAGCGGTCCCATTAATAACGGAAGAAATCGCTTTCACAACCATTGTTAGTATATTTGCTAAGAAAGAAAATACTGAACCTAATGCAGCAATAAATTGATCAACAGATTCTTTATTCTCTTTCAAAAATTTAAGTAGTTGCTCACAAATATCAAGTAATATCGGCAAAAAAGCCGCTAATAATTCTTGAAATACTTTATCTGTGACTTCAGATATTTTCTTCTGCACTTTTTCATACTTTTCCGCTAATCCCTCCATGCCCGGAATTTTGGACATATCGCCCAATATATTAAAAGCATGAGAAAAAACCCCTTTGAGTTTGGAGAACGTAGAAAAAATGGTTAGCCCTCCTTTTTTTATCCCGGAAAAAGCTTTATCCATACCAGAACTTAACTTACTCATTCCCTGAGAAAGTCCGGCCGGAATTTTTGACAATAGAGGAAAAGATTGCTGACATTTTTCAGCCAACTGACTAAAAAATTGTTTACTTGATACTATTGCCTGACTAAACGCCCCATCGACCTTATCCGCCATATTCTGGCTAAAAACCGCCAGTAGACTAATAAAGCTGGTTCGAACCCCTTTAAAGCTGGTCATTAACTCATTTTTCTTCTCCGACACAAAGTCAATGAATTGGCCAAAATGGTTACGTAGCTCATCGAATTGCTGCTTTATCGCTGCAAATTTACTAATATACTGAACAAGCAGGTTCAACTTTTCTGACAGTAATACCGCTTGTTTTTCGATCCCTTCCAGACCTGATAACTTAGCGATTTCAGTAAGACGAGAAGAAAAACCGATCACCCCATTTTCTAATCGAGTAAAAATCGACGTGGTATGGCTTGATTGTGCGGTAATATTTTTAAAAACATTCGATTGAGACTTATCTACCCCACTGAATAATTGGTTAATTTTAGAAGCACTACTCTCAATATTATTAACAATATCTTTTGTCATTCTGGAGTGTGATTGATTAAATTTCTTAAGCACAGCTTGTTGCTGAACTTCCAATACTCCTGAAAGTTTGAGTAGCGCAGCATTGAGTTTATCCATTCCAGATGATAAATCTGATAATTGCCCTACTGCACTCATGTTATTGGCTTCTCATTATGCTGGCGTTGTATCTCATCCCATTCCGTTATGGCTTCATGAAATGAGCATAAATCACCAATTGAATAAATTGTTTTTAGCTCATTCAAAGTACAAAGGCCCCGCATAACGGGGCCAAAAATGAACCAATCAGTAGAGCCTGTTACTCTGTCTTCGGTAGAAGACCCGATAACAGACTCTCGCCGCCAGAGAAAAAATCAGAAAATTGATATTTCACCCCCTCAACAAGCACATTAATCATATGAGAACGGTATTTATTAAAGTGTTCATTAACTTTATTGCTTAGCTTAAAAGCATTTCCTTCAATGGAAACCGAAGTATGCTCAAACACCAATTTTTCAATGTCATTAACCACTGGATCGCCTAAATTAGACAACAGCGTACCGATTGAAACTGGTTGCTGACCATGACCAAACTCTAAACCACGCAATAACGAAGCGGCTTTTTTTAATGCACTCCAGGCTGCAACGGCATTGGCTGGCGTCATAACATAAGTCACATCATCAACTGAAAATTCTTGTTTCATTTCCATTAGTTTGCATATCCTTTTTCAAAGCGGAAATCCATTTTTTCAAATACGATGGTCCACGTTTGGGCATTATGACCAGCACCGCGAACGTAACCAGGTAATGTGGTTAAATAACCTTTCGTGGCCGTCACTTGGTCTTCATTAAGTAAATCGCGAATTTCTAATGTGAAAGGAACAAATGTTTTTAGATTTTCTTGCTGACGAGCCCATTCACTAAACTGTTTGTTATCTGCTGAATGCTGTTTAATTTTTAATGCTAACGTACCCGATTTATTTGGCTCCCGGATAAAAATACCTTTACCATCTGCACCAATTGTCATCGCCCCCATATCTGTTGCCCGAGTAGCGCTAATCACATCTGAGCCATCGGCCCAATCACTCATCACGACACCATTTAAAATTACTACTGTTTGTTGTGGATCAAAAACTGCCATCTTATTTTCCTTATAAAAAAAGCCCTAAATGGGCCTAAAATTAATTAATAATAAATAGATTGAAATATTAACGATTGAAATTCACCATCACATCGACAGAATGAATGGCTCCTGCTAACTTAACTGCAACTTGGATTGGTGGTGATTTACGTTGTTCACGGTCGGATATTGATAAATTATCAACGGTATCAGCCCATACATAGAAACCATCGTCTAAGCGATCGCCTGAATTTAATGTACCAAAGCTATCTCCATTCCATACCCCAGGTGCAAAAGCACCGTTTTTAATACCTTCTAAACAGACTTTCTTCACCGCAGCAATTAATCGAGCGGTTCCAGCGTCAGTTAAAGGAACTTTGGTTGGTGACTGATAAAGCACAGCAAAGGTTTCTTTTTCAACTGCATCAATAAACCAGTCAAGAATATGTACTTCATCAAAGAAACGTTTGCCAAGCACAGTGCCTTCCGCCACCATGGCCGCCTGATCAAAATAGGTGTAATAGTTCAGACCTAATGCTTTACATTTATTTGCTTCGGTCAGTGTTAAATCATCCGCAGAAACGCCCGGTAATGACTTAAATTTCATGGTAATGGTTGAATTATTGGCAGAAAAATTAACGGCTAATGCTCGAGCTAACCAAGAGTTAATCGCATATTGATCGTTCTTATCATATAAAACCACTGTGCGATCGTTCTGTTTATCCACCAGCTTTTTGAAAATATTGCCATTACTATTTTCAATATGAGAGGATTTTAACGTTGTTAAACCTAACACCTTTTTATCAGAGGCTTGAATCCAGTCGGACGCGGCTTGAATTTCACTATCGGTTAATGCCGCAACAATACTTGCTGCATACCAATTTGGATTTTTATTTTGCAATGCGGTAAACGCTTCCGGCAATGATTGCACGGAGATTGTTGCCGCATCGCCACCTTTCAGTTTTTCAGCCTGCCCATTTTCCAACTGTAGCAGTTTGCCAATATAGGTGCCGGATAATGCTGAATCAACGACATAATCAACAGAATGTTGCTGCCCGGCATTTTTAGCTTTAATAATAAACCGCTGCCCAACTTCATCAAAGATGACGTTAATTGGGCTGGTTTTTGGCAACAATGAATTAATCATGGTAGCAATACTGGTGAAGCTTGATACGCCGCTTGATAAATCCAAGCCTGAAATATCAAACTTCTGACCATCAGCAACCAATGATAAATAGCCATCGGTAATATAGGCCATATCGGCTAATGTTGTGCTAACAGGATATCCTTCAACCGCCGCTTCAACCGCCGAAATCGTCTGAGTCGTGTTATTCCAACGCGCAATAATAAGCTGTTTTGGTTTTGGCGTTTGGGAAAAAAACAGACGTGATGCCGCCGCAGTTTCAGAACGTGAGCCAAATGCTTTTTCCACATCATTTTGTGTGGCACATTCAATAAATACGGTATTGGGATCGGTAAAAACATGACCATATTCTGGTGTGAATAGCGCTAACGTGCCAAAGTCGCGTCGAAATGGCGCTACTGGTGATGAGTTAAGCTGTACATTAACAATTTGGGATAAAGGTAGTGACATATTTCTCTCTTTCTTATTTTAAGTTATTGGTTTCTGTATGAACAACCAGACCGCAAGCGTCCATGCGTCGCTGTTCTACATCCACATTGATTTTATGAGTTAAGATCAATTCCATTGATACAGATTGTTCTGGTATTGACCCTTCATTTTGTTGCCTTATCGGCGACGTTTTAAGAAGGCTTAAATGTTTTCTTTTGAATTCACTTTTACCTTGGCTGGAATGAATTACACAATAAGCGCGCATCATTAATTCATAGGCTTGTTCCCCATAGGTTGTTAAACGAATAGTTGATGAACAATCAACAGTGACACGTTCAATTTCATCATGATGAAGAAATTCCAATTTTGCTCCTCCTAAAGGAGCTACCTCTAATAATGCCAATGCAGCAAGAGGACTGGTCAAATTGGGTTGAAATGAAGCTGTAATTTGTTCATCTGTTAAACGTAAAAGAGACTTGAGAATATCTTTTACACTGTAGATGGATAATGTTTCTGTTGGCATATTAATTACTCTTATCACCAATGAAATAGGCAATATCTGTCCTCTCACTATAAAAATAGCAAGAGGCATGTTTAAATCCCCTAAATGAATGATTAATATAATCCGCTGTCAGTTAAAAATAATAAAAAACAAATTACATATGCAATATTTCGTTCTAATAACCTTTAACTTACCCAACGTTATTATAATGAATAAAACTATAACACCAGCTTAATTATAATTAGCTGATATAATTTTTAAATGTATAAAATGACCTATCACGCATTTCTACATTCAGCCATATTTAGATAATAAAAAACCAGCCGAAGCTGGTTATGATAATTACCATTAACTCAAATAAAGTTATTAGATAGACATAAAAAATAAAGTCTGATATTTCATTTAAATACTTTAAGCAGCGACGTCCATTGCATCATCAATCTTAGCTAAAACATATTTTTCAGCATAATTTAATGTTTCAGAAATAGAATGAGGCCGTTTGCCCATCACCTTAGCCTGACGGTTACATGAAATCCTATTTAAATAATAAGAAGTTAATACCGCATAAGCGAACGGATCCGAACTCCTTAATCTTAATACAGATTTATCAACCAACTCGGCTTCATCTTCAGTTAAAAAAGGACGATAATCAAACTGTTGATTGACCGATATAAACCCCGAAGAATGAGAAGGAAACTCAGTACCTACTCGTTTAGTGCCTCTAACATTTGCCCAGCCCTTTAAGACTGTTTTAATATCTCTCATCACTATTTTCCCTGTAGTCATCCTAAGCTATACACAAAATGTTAAAGCACATGCGCAAACATGTCAACAAAATAAACAAAATGTTTATTGCAATTTTAAACCGTTTGTTTAAAAATAGTTTATATAAATTAAAGAAAAGGGAGATATCATGGGCTCATTATCAGAAAGACTAAAACAAATAATGACAGAAAAAAATTTGAATCAAACAGAATTAGCCAAACTTGCTGGTGCTTCACCACAATCCGTCACTAACTGGTTAAAACGAAATTCAATGAGTAAAAATGTTGCACGCATTATTTGTGAGAAGACCGGTTACTCTATTGATTGGCTATTATTCGGTCTTGGTCAAGATGCACATACTGATACTAATTTAAAACCATCACGCCTACAGCCCATTACCTGGGAAGAGATATCGACAACAAATACTGAGTTTGTTGAAGTTCCAGTTTTAGACATTCATCTCTCAGCCGGCGATGGGTACTACAATACAGAAGAAAATGAAATCTACACGCTACCTTTTAGGGCTCACACACTCAGGCGTGAAGGTATTCATGTTGAAAACTTAAAGGTCGTCAGAGTAACGGGTGACAGTATGGAGCCGAAACTTTCTGATAACGATACTGTTTCTATTGATACCGCAGATAAAATTGTGCGCGATGGAAAGATGTATGCTATCCGTATTGGCGACGTGCAAAAAATTAAAACATTGATACAGAACTCAGATGGCAGCATCACCATGCGCTCTTATAATGCAAACTATAAAGACGAGATTGTTGCTAAAGAACAAATAGAATGTGGGGAATTTACCGTTATAGGACGAGTATGGTGGATCTCATCAATCGTTTAATGCCATCTTCCCGACAATATTGATTGAATTGATAGCGGGCTAAAATAATAAATCAAACATTTAATCGATCAAATAATGAACAAAAATATCATGCGCATAAAAAAATACCAGACACTAAGGTCTGGTATCTTAAAGTAGTGGCGGAACGGACGGGACTCGAACCCGCGACCCCCTGCGTGACAGGCAGGTATTCTAACCAACTGAACTACCGCTCCACTCGACTTTCCTTAGAAAGCGGGATGATAGTAAGGTTCCGTGCAGCGGGAGTCAATGAAAATTTACTCAAACAGAATCATTTGCACACAAATTCAGCGAACCGAATAAAAACAGCTAATTAACGGTTTCTCCGTCAACGGGTGTGCGCCAGAGACAACTGCCCCCTTTCTTGGCAATAAGATCGAGACGTGATTCGTGGGCTTCCACCTCTTCATCAGAGGCATAAACAATTTTGAGAGCACTTTCGGCACGAACAAGACGCTGAATTTCGGCACCTCGGGCTTGAGTCTGCGTCTCACCATCAACAGAAAATGATAATGAGGTTTGGCCACCGGTCATCGTTAGATAAACTTCGGCCAAAATTTCGGCATCGAGTAAAGCGCCGTGCAGCGTTCGGCGGCTGTTATCAATATCTAAACGGGTACATAGCGCATCCAAACTATTACGCTTGCCCGGGAACATCTTACGCGCCATTAGCAATGAGTCAGTTATCGTACAGAATGACTCAACTTTCGGATAATCGCTACGTAACATGCGAAACTCATGATCCATAAAGCCCACGTCAAACGACGCGTTGTGAATCACTAATTCCGCACCATCAATAAACTGATAAAAGTCATCGGCGATTTCCGCAAACGTTGGTTTATCCGCCAAAAATTCGTCGCTAATCCCATGTACCGCATAAGCATCCGGATCAACCAAGCGGTCGGGCTTAATATACACATGAAAATGCCTGCCCGTCAGGCGACGGTTCACCACTTCAACAGCACCGATTTCAATAATTCGGTGTCCCTCATAGTGAACCCCAACCATATTCATACCGGTGGTTTCAGTATCGAGAACAATCTGTCGTGTAATTTCAGTGCTTATTGGTTCCAT
Protein sequences of DBSCAN-SWA_5 >LR134531|2764750:2786211|2771972_2772563_-|VEJ56103.1|tail|DBSCAN-SWA MNNIKFDKYGLANNEGIIRVYNVDSITGEYLNSTDELLVAGIGLPQYSYLDEPFVSRAGFAVCRRDGEWQYIEDHRGETAYVKHNGAEQIMSELGPIPAELTLLKPMTIYDKWHDDRWVTDKNAEHQAEVDTAKIEKQQRDEQAKDEIVRLSDAIEFGIELDDDRDRLTTWRKYRILLNRVDIEMAPNIQWPEQPE >LR134531|2764750:2786211|2766354_2767827_-|VEJ56099.1|DBSCAN-SWA MTIYQRPDEKILAESSKQGEVKDFPDISRGWGIVFDKTGGIPPMEWFNALGKRTDEAIRYLLQRGIAEWSKTEDYPVGAVVSYDKRIWLAEKNNKGIEPKMNTVWRESALTIDSANFIIEAPKDGERYVRCAGQWVKEEATQSNSENIVEHRSDKDEDLHVVFDIPLLKTLTSGRHIWQAKKGFCLYLYWQSEVSGTDESEWTCTKMRNDYPFILQVDKNSYFDSPEYWDEEEPSICVEGKMIPYLSSAEVYPIYFEISGCGNGTDIDLSERKQNISLWSQYCSGSGTDDIPNKGVDTQMDCDIQAIRRAQENYGSRIVGYNNCKMTIKVSDGRGNLIARKDGSSGYYSYCSSSDITSEYTLPYLHWSGVKLFRTEFPDYLVRLRCGEQARKTYDLDEKSNGNAKIGFTCDIVSLEENKYNRNPETWKSYDFKTESNELIQKVKNEDAVYLEQIMPGGMYFQFYAEVQSQFSANKFPFDGASVVAAPKTQ >LR134531|2764750:2786211|2770252_2771887_-|VEJ56102.1|DBSCAN-SWA MTIYQRPDEKILAESSKQGEVKDFPDISRGWGVAFDKTGGIPPMEWFNALGKRTDEAVRYLLQRGIAEWSKTEDYPVGALVSHNQGIWLAEQSSKGVEPKTNSLWKETALTIEQIRKLIPVNSVNGKMGNVVLNAGDVGALKRGDFGLGTRSPLLPAGTNFSSDLVTSSNNEFTLSGKYIGTPDDNKVWGATLTVKRRLYDANCACTLILFHEGTHYIKHGNKSTGSWIWSNWESIILSSGGTFSGEVIIDKDIAPTFTFKKKNRVYRNAFSGDDTFIHYITGQDGDKTRLRWISDKNTWNFESCNVTKHGKNLLVDGEYGLGSKNLVADCKFTDDITKQNGAFQASPPNELPESMKGSGMLGCAGLTIPYRTSSLPYSYRILFSADPYGVASKTPAVFFQEYALNGWGKPVEAITTANIANYLPKTTVLYPEGTEANPSTLLTNQRIIIDNPYKSMNIHCIVELYIQADDGNYYWGPSGSTCYWYSGRDYSGGVDAYPRSDNKIVVITGGHAVVIGGVAIKNIWNRALNVLSAPYRVRVTKLS >LR134531|2764750:2786211|2774465_2775611_-|VEJ56106.1|DBSCAN-SWA MGQLTHTGYVPERLNTIIEELQSGFKRIYGNDINLDPDSPDGQMIGLIGQIKADLEELAEVIYQALDPDHATGVWLEQRAAYAGLIRRKASYSYLRNVVLTGDPFTKIYSGAVVSDPNKNRWLLVNDVTLNAQGSAVADFRSEYLGSYPVSEKTELKIEIVVLGWRYALSSYAAEIGQEEESDPVFRKRFFRSRAKQAKNNVDGLVASLNQLPDVKEVMILENSTSEPDKNQVPAHSINVIIDGGDDYLIAETVFNRKTAGTGLFGQTSVVVRDTQNIARDICFDRPKWANCQVYLELCRNQDFTSIDIDAIKVAISQMTFSIGQSVNISRLYTPINQTQGFWVDKLFIARKGEPLKAQNIDINIREIANIHIDDIEVIVR >LR134531|2764750:2786211|2780629_2781019_-|VEJ56113.1|DBSCAN-SWA MEMKQEFSVDDVTYVMTPANAVAAWSALKKAASLLRGLEFGHGQQPVSIGTLLSNLGDPVVNDIEKLVFEHTSVSIEGNAFKLSNKVNEHFNKYRSHMINVLVEGVKYQFSDFFSGGESLLSGLLPKTE >LR134531|2764750:2786211|2784417_2785116_+|VEJ56118.1|DBSCAN-SWA MGSLSERLKQIMTEKNLNQTELAKLAGASPQSVTNWLKRNSMSKNVARIICEKTGYSIDWLLFGLGQDAHTDTNLKPSRLQPITWEEISTTNTEFVEVPVLDIHLSAGDGYYNTEENEIYTLPFRAHTLRREGIHVENLKVVRVTGDSMEPKLSDNDTVSIDTADKIVRDGKMYAIRIGDVQKIKTLIQNSDGSITMRSYNANYKDEIVAKEQIECGEFTVIGRVWWISSIV >LR134531|2764750:2786211|2775644_2775992_-|VEJ56107.1|DBSCAN-SWA MRVRRLDKNHDWSFGLGRFNYAQDSESIAQRVKTRLLSFKGDWVHDLEHGIPWLPHFERSFDLSRLEREIKLQILETEGVKSLDEFTMRLDPDSRQMTVSVYLTDQYDQQLIVKT >LR134531|2764750:2786211|2783915_2784269_-|VEJ56117.1|DBSCAN-SWA MRDIKTVLKGWANVRGTKRVGTEFPSHSSGFISVNQQFDYRPFLTEDEAELVDKSVLRLRSSDPFAYAVLTSYYLNRISCNRQAKVMGKRPHSISETLNYAEKYVLAKIDDAMDVAA >LR134531|2764750:2786211|2776684_2777506_-|VEJ56109.1|DBSCAN-SWA MRQFNRQFRLDIGDENDGISINNLRVEFSISKTIDKKPNPAQIKIWNLNQSHREQLISLQFKYLKLFVGYDRLNMIYAGNIKTTKVVRDGLDILIELECADGDVDYRNSFISRSLASGCTDQHIIQALSDSMKNTQPGSIDIQSELSLPRGKVLHGASRDLLSDIAIKHNADWSIQDGELVFLPNDRFLNKEITVLSQDTGMIGSPATSDKDLEVCCLLNPQIQVGSKVRIESIESIHNGDYKVISVLHQGDAIGGDWKTTLRLINKREEKNE >LR134531|2764750:2786211|2778615_2780469_-|VEJ56112.1|DBSCAN-SWA MSAVGQLSDLSSGMDKLNAALLKLSGVLEVQQQAVLKKFNQSHSRMTKDIVNNIESSASKINQLFSGVDKSQSNVFKNITAQSSHTTSIFTRLENGVIGFSSRLTEIAKLSGLEGIEKQAVLLSEKLNLLVQYISKFAAIKQQFDELRNHFGQFIDFVSEKKNELMTSFKGVRTSFISLLAVFSQNMADKVDGAFSQAIVSSKQFFSQLAEKCQQSFPLLSKIPAGLSQGMSKLSSGMDKAFSGIKKGGLTIFSTFSKLKGVFSHAFNILGDMSKIPGMEGLAEKYEKVQKKISEVTDKVFQELLAAFLPILLDICEQLLKFLKENKESVDQFIAALGSVFSFLANILTMVVKAISSVINGTAGFNEILITAIATVILFAKYLRLAFISNPIGLIIAALIGLYALFQDFCNYLDGGESSLAGFWEPLVKGFNAVKQFVMGLFDFFKNLIVGYFSLIWRIVSNPLQAFKDFIAFIQDACADIPDIISDAFETAWDWVLSGIDAVIGWIKDKFSAVTSIFSAIGSFFGFGSDDDISATVDKTLKTVSEVPKAVDSARTRASHQIRSAKDRDIAVNNHSEQTVHMNITSDNPHLVAEIIDKKLKQRDVSTYEDNRAQLKV >LR134531|2764750:2786211|2767917_2768271_-|VEJ56100.1|DBSCAN-SWA MKVYAKLNTSEQLIDGSTGDSLPFDGYIKMLSQRPADGDWHAKLDGLWHKGDPAIEDAFISEQMRVIADELLKHDDDDDSAIATRAAWIEYRKALRKWRFEQNINYPDPSFRPIQPQ >LR134531|2764750:2786211|2772562_2773819_-|VEJ56104.1|DBSCAN-SWA MSQYTRPDEKVFAQSAKVGEVESFPDVERGWGVAFEQSDGIPPMEWFNGLGKRTDESIRYLLQKGISDWSATEDYPLNAYVQHKNKMWVAVRANINEQPLMSSAWIQTALTIEEIKRLIPDVPVKSVNGKLNHVVLNAQDVGSFERSKNLADADPVIVRKNLRLGAAALRDVGVANNQLMQVGAFGLGNQQTQYTNVTTANNLLTTGFYFVASTVKDIPKLNDSYILVHVGVNTVYQDLTVANSGEKFSRIKRGNSWSNWQLVITDVNSFAITTPVGVPMPWPNINPPSGWLKCNGASFDKNRFPLLAKAYPAGALPDLRGEFIRGWDDGRGVDEARGILTDQNDAMQKITGYFRSIDRNYDWIGGSAYLHSRWSTSIKSGSGDGWGTNVMFDSSREVRTAEETRPRNIAFLYIVRAA >LR134531|2764750:2786211|2775981_2776692_-|VEJ56108.1|plate|DBSCAN-SWA MSELSTSMPSQDNAIASTIKATLLQIHTAFPGKVVAFNPALQTAQIQPMIDQVLIDDTVISLPVLLDVPVQFPRAGEFALTFPVKPGDEGQVIINERCIDGWFQSGDHSKPLDYRLFDLSDASFIPGISSVPKAIPAFDMDCIAIRKVDNSAYVKIDDSGSIEVDGQQMIIKCPLFVEQLLTYQGGMAGSGGSGGASASISGTVKVAGDVIANGVSLTNHSHPGDSGGTTGAPNAS >LR134531|2764750:2786211|2785446_2786211_-|VEJ56119.1|DBSCAN-SWA MEPISTEITRQIVLDTETTGMNMVGVHYEGHRIIEIGAVEVVNRRLTGRHFHVYIKPDRLVDPDAYAVHGISDEFLADKPTFAEIADDFYQFIDGAELVIHNASFDVGFMDHEFRMLRSDYPKVESFCTITDSLLMARKMFPGKRNSLDALCTRLDIDNSRRTLHGALLDAEILAEVYLTMTGGQTSLSFSVDGETQTQARGAEIQRLVRAESALKIVYASDEEVEAHESRLDLIAKKGGSCLWRTPVDGETVN >LR134531|2764750:2786211|2773815_2774469_-|VEJ56105.1|DBSCAN-SWA MKYSELLIWQYKDKPKAKATIELVSEQYQKTAQGLISLVEALNIDRAEGKNLDLIGKHVGQSRILNHAMSKELFGFWASLNSKGFSQKRQSGGRWYKKGSILAESAILDDEDYRFLIRCRIAKNYMTGTIENITELLSFIFNSESIAYDQYDMSLSVVIKSDDISTFKRYAITQLDILPRPAGVNIYFYTAISNKPFGLLGGVGAYAFNEGQFARIL >LR134531|2764750:2786211|2768270_2770037_-|VEJ56101.1|DBSCAN-SWA MTIYQRPDEKILAESSKQGEVKDFPDISRGWGVAFDKTGGIPPMEWFNALGKRTDEAVRYLLQRGIAEWSKTEDYPVGALVSHNQGIWLAEQSSKGVEPKTNSLWKETALTIEQIRKLIPVNSVNGKMGNVVLNAGDIGALDKLGGVLDGDLHVQKSALAMVRLVTNHQNKFKADLFCNHTFGIATDNGAGGTTTRFRFTPDASKISGVWNFENCNVTKNGVNIPTVNQVANQLVADSTSKQVRYAKIATGRINRIDIGNVSFLVSGANNFGSPNNNLEYVSFSCRTESNKATIIHQIITPSFHPDHQNRYGWVFNAVDNTVELWLKSTSYGGFLGLTRLSGYHPSYISVCDKLIWQDIEPEKIVYVEPSKTISSNDPVIMKGDYGLGSELAPLISDFTDNLPSGFYRALGAGTATPTKNLPPNTKNSVLYVIVQKAGNQVPFFIVHQHAAQADNVRSWVGIYYKGKLTWKETTVTNNFTILYPNGTAQNPMTIGVNTRLVVDNPFPGHCVMCIAEVKFQDKWGPTGFIYSPGGPGGYGTVAHQYDDDKIIVQTGNKEVLNAAHNTGTPHASASITSAPLRVKVWKVT >LR134531|2764750:2786211|2777492_2777819_-|VEJ56110.1|DBSCAN-SWA MYKIKLKSMPYQEFHFTIRDCKLRVTLRYNSIGTQWFMDVYDVKKQKVIVQGAALVVGTPILWRSPTEYYFYLTDEEQLGLDPIFMDDLDSRCVLHVGLKSEVLDEAI >LR134531|2764750:2786211|2781517_2783023_-|VEJ56115.1|DBSCAN-SWA MSLPLSQIVNVQLNSSPVAPFRRDFGTLALFTPEYGHVFTDPNTVFIECATQNDVEKAFGSRSETAAASRLFFSQTPKPKQLIIARWNNTTQTISAVEAAVEGYPVSTTLADMAYITDGYLSLVADGQKFDISGLDLSSGVSSFTSIATMINSLLPKTSPINVIFDEVGQRFIIKAKNAGQQHSVDYVVDSALSGTYIGKLLQLENGQAEKLKGGDAATISVQSLPEAFTALQNKNPNWYAASIVAALTDSEIQAASDWIQASDKKVLGLTTLKSSHIENSNGNIFKKLVDKQNDRTVVLYDKNDQYAINSWLARALAVNFSANNSTITMKFKSLPGVSADDLTLTEANKCKALGLNYYTYFDQAAMVAEGTVLGKRFFDEVHILDWFIDAVEKETFAVLYQSPTKVPLTDAGTARLIAAVKKVCLEGIKNGAFAPGVWNGDSFGTLNSGDRLDDGFYVWADTVDNLSISDREQRKSPPIQVAVKLAGAIHSVDVMVNFNR >LR134531|2764750:2786211|2764750_2766217_-|VEJ56098.1|DBSCAN-SWA MTIYQRPDEKILAESSKQDEVKPFPDISRGWGVAFDKTGGIPPMEWFNALGQRTDEAIRYLLQRGIAEWSKTEDYPAGALVSYNKDVWLAERNSKGIEPKANTVWKETALTIEQIKKLIPVNSVNGKTGSLVLNASDVGAVSKSGDTMSGELKTTNLDAHRIMVKNRAAISRFDGYDYYILFTNNNDPHGTWNSLRPIRLNWQSGQVTFNHGINTNSMLDNSTNIGRTNGSPMKSISSDDDILSLPIGAKFMCVQSGGYQLPISYGYIEKICNRDIGQGFGCMFYSYQSSRLWYGYKMNTDSRLVWKEIITTDNISRHIEKTTVGSIQLLPFRKNELPVGWYFTNGDKYSLTSVQGKALNSLSISFKTDWGIKVINNTINLPNLFHSDGRGVFLRSVDGISRQVGHIQDDAIRNINGVINNVSDGRGGSNVISSGAFKTTKAIKGHQNGTSGYTQVSELTFDASFTVPTAEENRPLNMGMTPAIYLGI >LR134531|2764750:2786211|2777838_2778612_-|VEJ56111.1|DBSCAN-SWA MPISNKRGKSRASISNQEQRVSLYNRRVGHLTLDVTKSEQHGSQLKITENPVEFGASVTDHSMVTPISISISGLVVDYDSEQEAFSNTASPTIRGNKDFVNDVSIPGLKNRTDYAEKLCQKRIKKNASLNNDSAKEAKHAITPWSPDNSQAWLKDRSVKSNRIERIYDDFLALQKLGQTIEITTGIKHYKNMLIESLSVSQSSEGFADFSITAREVFIVQSKVINGVGIILKNEKQLSGRAASQGCAKTSVGSVAPK >LR134531|2764750:2786211|2781018_2781450_-|VEJ56114.1|DBSCAN-SWA MAVFDPQQTVVILNGVVMSDWADGSDVISATRATDMGAMTIGADGKGIFIREPNKSGTLALKIKQHSADNKQFSEWARQQENLKTFVPFTLEIRDLLNEDQVTATKGYLTTLPGYVRGAGHNAQTWTIVFEKMDFRFEKGYAN >LR134531|2764750:2786211|2783036_2783558_-|VEJ56116.1|DBSCAN-SWA MPISLVIRVINMPTETLSIYSVKDILKSLLRLTDEQITASFQPNLTSPLAALALLEVAPLGGAKLEFLHHDEIERVTVDCSSTIRLTTYGEQAYELMMRAYCVIHSSQGKSEFKRKHLSLLKTSPIRQQNEGSIPEQSVSMELILTHKINVDVEQRRMDACGLVVHTETNNLK |
22 | Haemophilus_phage(57.89%) | plate,tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|