Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP028702 | Escherichia coli strain J53 chromosome, complete genome | 6 crisprs | DEDDh,DinG,cas3,c2c9_V-U4,WYL,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,PD-DExK | 0 | 12 | 9 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP028702_1 | 438569-438700 | Orphan |
NA
Consensus repeat of CP028702_1
|
2 spacers
spacers of CP028702_1
>1.1|438586|40|CP028702|PILER-CR CCATAAAACAATATTGAAAATTTCTTTTTGCTACGCCGTG >1.2|438643|42|CP028702|PILER-CR GAACTTAACAATATTGAAAGTTGGATTTATCTGCGTGTGACA |
CRISPR arrays and Neighbor proteins around CP028702_1
The CRISPR arrays of CP028702_1 >merge|CP028702|1|438569-438700|PILER-CR TTTCAATATTGGTGATCCATAAAACAATATTGAAAATTTCTTTTTGCTACGCCGTGTTTTCAATATTGGTGAGGAACTTAACAATATTGAAAGTTGGATTTATCTGCGTGTGACATTTTCAATATTGGTGAT >CP028702|1|1|438569-438700|PILER-CR TTTCAATATTGGTGATC CATAAAACAATATTGAAAATTTCTTTTTGCTACGCCGTGT TTTCAATATTGGTGAGG AACTTAACAATATTGAAAGTTGGATTTATCTGCGTGTGACAT TTTCAATATTGGTGAT
>CP028702.1|AVZ47607.1|436804_438319_-|L-carnitine/gamma-butyrobetaine-antiporter MKNEKRKTGIEPKVFFPPLIIVGILCWLTVRDLDAANVVINAVFSYVTNVWGWAFEWYMVVMLFGWFWLVFGPYAKKRLGNEPPEFSTASWIFMMFASCTSAAVLFWGSIEIYYYISTPPFGLEPNSTGAKELGLAYSLFHWGPLPWATYSFLSVAFAYFFFVRKMEVIRPSSTLVPLVGEKHAKGLFGTIVDNFYLVALIFAMGTSLGLATPLVTECMQWLFGIPHTLQLDAIIITCWIILNAICVACGLQKGVRIASDVRSYLSFLMLGWVFIVSGASFIMNYFTDSVGMLLMYLPRMLFYTDPIAKGGFPQGWTVFYWAWWVIYAIQMSIFLARISRGRTVRELCFGMVLGLTASTWILWTVLGSNTLLLIDKNIINIPNLIEQYGVARAIIETWAALPLSTATMWGFFILCFIATVTLVNACSYTLAMSTCREVRDGEEPPLLVRIGWSILVGIIGIVLLALGGLKPIQTAIIAGGCPLFFVNIMVTLSFIKDAKQNWKD >CP028702.1|AVZ47606.1|435631_436774_-|crotonobetainyl-CoA-dehydrogenase MDFNLNDEQELFVAGIRELMASENWEAYFAECDRDSVYPERFVKALADMGIDSLLIPEEHGGLDAGFVTLAAVWMELGRLGAPTYVLYQLPGGFNTFLREGTQEQIDKIMAFRGTGKQMWNSAITEPGAGSDVGSLKTTYTRRNGKIYLNGSKCFITSSAYTPYIVVMARDGASPDKPVYTEWFVDMSKPGIKVTKLEKLGLRMDSCCEITFDDVELDEKDMFGREGNGFNRVKEEFDHERFLVALTNYGTAMCAFEDAARYANQRVQFGEAIGRFQLIQEKFAHMAIKLNSMKNMLYEAAWKADNGTITSGDAAMCKYFCANAAFEVVDSAMQVLGGVGIAGNHRISRFWRDLRVDRVSGGSDEMQILTLGRAVLKQYR >CP028702.1|AVZ47605.1|434285_435503_-|L-carnitine-CoA-transferase MDHLPMPKFGPLAGLRVVFSGIEIAGPFAGQMFAEWGAEVIWIENVAWADTIRVQPNYPQLSRRNLHALSLNIFKDEGREAFLKLMETTDIFIEASKGPAFARRGITDEVLWQHNPKLVIAHLSGFGQYGTEEYTNLPAYNTIAQAFSGYLIQNGDVDQPMPAFPYTADYFSGLTATTAALAALHKVRETGKGESIDIAMYEVMLRMGQYFMMDYFNGGEMCPRMSKGKDPYYAGCGLYKCADGYIVMELVGITQIEECFKDIGLAHLLGTPEIPEGTQLIHRIECPYGPLVEEKLDAWLATHTIAEVKERFAELNIACAKVLTVPELESNPQYVARESITQWQTMDGRTCKGPNIMPKFKNNPGQIWRGMPSHGMDTAAILKNIGYSENDIQELVSKGLAKVED >CP028702.1|AVZ47604.1|432658_434212_-|ATP-dependent-acyl-CoA-ligase MDIIGGQHLRQMWDDLADVYGHKTALICESSGGVVNRYSYLELNQEINRTANLFYTLGIRKGDKVALHLDNCPEFIFCWFGLAKIGAIMVPINARLLCEESAWILQNSQACLLVTSAQFYPMYQQIQQEDATQLRHICLTDVALPADDGVSSFTQLKNQQPATLCYAPPLSTDDTAEILFTSGTTSRPKGVVITHYNLRFAGYYSAWQCALRDDDVYLTVMPAFHIDCQCTAAMAAFSAGATFVLVEKYSARAFWGQVQKYRATVTECIPMMIRTLMVQPPSANDQQHRLREVMFYLNLSEQEKDAFCERFGVRLLTSYGMTETIVGIIGDRPGDKRRWPSIGRVGFCYEAEIRDDHNRPLPAGEIGEICIKGIPGKTIFKEYFLNPQATAKVLEADGWLHTGDTGYRDEEDFFYFVDRRCNMIKRGGENVSCVELENIIAAHPKIQDIVVVGIKDSIRDEAIKAFVVLNEGETLSEEEFFRFCEQNMAKFKVPSYLEIRKDLPRNCSGKIIRKNLK >CP028702.1|AVZ47603.1|431764_432550_-|carnitinyl-CoA-dehydratase MSESLHLTRNGSILEITLDRPKANAIDAKTSFEMGEVFLNFRDDPQLRVAIITGAGEKFFSAGWDLKAAAEGEAPDADFGPGGFAGLTEIFNLDKPVIAAVNGYAFGGGFELALAADFIVCADNASFALPEAKLGIVPDSGGVLRLPKILPPAIVNEMVMTGRRMGAEEALRWGIVNRVVSQAELMDNARELAQQLVNSAPLAIAALKEIYRTTSEMPVEEAYRYIRSGVLKHYPSVLHSEDAIEGPLAFAEKRDPVWKGR >CP028702.1|AVZ47602.1|431168_431759_-|carnitine-operon-protein-CaiE MSYYAFEGLIPVVHPTAFVHPSAVLIGDVIVGAGVYIGPLASLRGDYGRLIVQAGANIQDGCIMHGYCDTDTIVGENGHIGHGAILHGCLIGRDALVGMNSVIMDGAVIGEESIVAAMSFVKAGFRGEKRQLLMGTPARAVRNVSDDELHWKRLNTKEYQDLVGRCHVSLHETQPLRQMEENRPRLQGTTDVTPKR >CP028702.1|AVZ47601.1|430687_431083_+|transcriptional-activatory-protein-CaiF MCEGYVEKPLYLLIAEWMMAENRWVIAREISIHFDIEHSKAVNTLTYILSEVTEISCEVKMIPNKLEGRGCQCQRLVKVVDIDEQIYARLRNNSREKLVGVRKTPRIPAVPLTELNREQKWQMMLSKSMRR >CP028702.1|AVZ47600.1|430473_430653_-|hypothetical-protein MTRFEAIKQGHIKIVDISIVCNFTVDKCELNPAYVIKNIDSPKDLLNGQKKTVLIREPY >CP028702.1|AVZ47599.1|427204_430426_+|carbamoyl-phosphate-synthase-large-chain MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVILVNSNPATIMTDPEMADATYIEPIHWEVVRKIIEKERPDAVLPTMGGQTALNCALELERQGVLEEFGVTMIGATADAIDKAEDRRRFDVAMKKIGLETARSGIAHTMEEALAVAADVGFPCIIRPSFTMGGSGGGIAYNREEFEEICARGLDLSPTKELLIDESLIGWKEYEMEVVRDKNDNCIIVCSIENFDAMGIHTGDSITVAPAQTLTDKEYQIMRNASMAVLREIGVETGGSNVQFAVNPKNGRLIVIEMNPRVSRSSALASKATGFPIAKVAAKLAVGYTLDELMNDITGGRTPASFEPSIDYVVTKIPRFNFEKFAGANDRLTTQMKSVGEVMAIGRTQQESLQKALRGLEVGATGFDPKVSLDDPEALTKIRRELKDAGADRIWYIADAFRAGLSVDGVFNLTNIDRWFLVQIEELVRLEEKVAEVGITGLNADFLRQLKRKGFADARLAKLAGVREAEIRKLRDQYDLHPVYKRVDTCAAEFATDTAYMYSTYEEECEANPSTDREKIMVLGGGPNRIGQGIEFDYCCVHASLALREDGYETIMVNCNPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKPKGVIVQYGGQTPLKLARALEAAGVPVIGTSPDAIDRAEDRERFQHAVERLKLKQPANATVTAIEMAVEKAKEIGYPLVVRPSYVLGGRAMEIVYDEADLRRYFQTAVSVSNDAPVLLDHFLDDAVEVDVDAICDGEMVLIGGIMEHIEQAGVHSGDSACSLPAYTLSQEIQDVMRQQVQKLAFELQVRGLMNVQFAVKNNEVYLIEVNPRAARTVPFVSKATGVPLAKVAARVMAGKSLAEQGVTKEVIPPYYSVKEVVLPFNKFPGVDPLLGPEMRSTGEVMGVGRTFAEAFAKAQLGSNSTMKKHGRALLSVREGDKERVVDLAAKLLKQGFELDATHGTAIVLGEAGINPRLVNKVHEGRPHIQDRIKNGEYTYIINTTSGRRAIEDSRVIRRSALQYKVHYDTTLNGGFATAMALNADATEKVISVQEMHAQIK >CP028702.1|AVZ47598.1|426038_427187_+|carbamoyl-phosphate-synthase-small-subunit MIKSALLVLEDGTQFHGRAIGATGSAVGEVVFNTSMTGYQEILTDPSYSRQIVTLTYPHIGNVGTNDADEESSQVHAQGLVIRDLPLIASNFRNTEDLSSYLKRHNIVAIADIDTRKLTRLLREKGAQNGCIIAGDNPDAALALEKARAFPGLNGMDLAKEVTTAEAYSWTQGSWTLTGGLPEAKKEDELPFHVVAYDFGAKRNILRMLVDRGCRLTIVPAQTSAEDVLKMNPDGIFLSNGPGDPAPCDYAITAIQKFLETDIPVFGICLGHQLLALASGAKTVKMKFGHHGGNHPVKDVEKNVVMITAQNHGFAVDEATLPANLRVTHKSLFDGTLQGIHRTDKPAFSFQGHPEASPGPHDAAPLFDHFIELIEQYRKTAK >CP028702.1|AVZ47608.1|438790_439561_+|protein-FixA MKIITCYKCVPDEQDIAVNNADGSLDFSKADAKISQYDLNAIEAACQLKQQAAEAQVTALSVGGKALTNAKGRKDVLSRGPDELIVVIDDQFEQALPQQTASALAAAAQKAGFDLILCGDGSSDLYAQQVGLLVGEILNIPAVNGVSKIISLTADTLTVERELEDETETLSIPLPAVVAVSTDINSPQIPSMKAILGAAKKPVQVWSAADIGFNAEAAWSEQQVAAPKQRERQRIVIEGDGEEQIAAFAENLRKVI >CP028702.1|AVZ47609.1|439575_440517_+|protein-FixB MNTFSQVWVFSDTPSRLPELMNGAQALANQINTFVLNDADGAQAIQLGANHVWKLNGKPDDRMIEDYAGVMADTIRQHGADGLVLLPNTRRGKLLAAKLGYRLKAAVSNDASTVSVQDGKATVKHMVYGGLAIGEERIATPYAVLTISSGTFDAAQPDASRTGETHTVEWQAPAVAITRTATQARQSNSVDLDKARLVVSVGRGIGSKENIALAEQLCKAIGAELACSRPVAENEKWMEHERYVGISNLMLKPELYLAVGISGQIQHMVGANASQTIFAINKDKNAPIFQYADYGIVGDAVKILPALTAALAR >CP028702.1|AVZ47610.1|440567_441854_+|FAD-dependent-oxidoreductase MSEDIFDAIIVGAGLAGSVAALVLAREGAQVLVIERGNSAGAKNVTGGRLYAHSLEHIIPGFADSAPVERLITHEKLAFMTEKSAMTMDYCNGDETSPSQRSYSVLRSKFDAWLMEQAEEAGAQLITGIRVDNLVQRDGKVVGVEADGDVIEAKTVILADGVNSILAEKLGMAKRVKPTDVAVGVKELIELPKSVIEDRFQLQGNQGAACLFAGSPTDGLMGGGFLYTNENTLSLGLVCGLHHLHDAKKSVPQMLEDFKQHPAVAPLIAGGKLVEYSAHVVPEAGINMLPELVGDGVLIAGDAAGMCMNLGFTIRGMDLAIAAGEAAAKTVLSAMKSDDFSKQKLAEYRQHLESGPLRDMRMYQKLPAFLDNPRMFSGYPELAVGVARDLFTIDGSAPELMRKKILRHGKKVGFINLIKDGMKGVTVL >CP028702.1|AVZ47611.1|441850_442138_+|ferredoxin-like-protein-FixX MTSPVNVDVKLGVNKFNVDEEHPHIVVKADADKQALELLVKACPAGLYKKQDDGSVRFDYAGCLECGTCRILGLGSALEQWEYPRGTFGVEFRYG >CP028702.1|AVZ47612.1|442194_443526_+|MFS-transporter MQPSRNFDDLKFSSIHRRILLWGSGGPFLDGYVLVMIGVALEQLTPALKLDADWIGLLGAGTLAGLFVGTSLFGYISDKVGRRKMFLIDIIAIGVISVATMFVSSPVELLVMRVLIGIVIGADYPIATSMITEFSSTRQRAFSISFIAAMWYVGATCADLVGYWLYDVEGGWRWMLGSAAIPCLLILIGRFELPESPRWLLRKGRVKECEEMMIKLFGEPVAFDEEQPQQTRFRDLFNRRHFPFVLFVAAIWTCQVIPMFAIYTFGPQIVGLLGLGVGKNAALGNVVISLFFMLGCIPPMLWLNTAGRRPLLIGSFAMMTLALAVLGLIPDMGIWLVVMAFAVYAFFSGGPGNLQWLYPNELFPTDIRASAVGVIMSLSRIGTIVSTWALPIFINNYGISNTMLMGAGISLFGLLISVAFAPETRGMSLAQTSNMTIRGQRMG >CP028702.1|AVZ47613.1|443633_444164_+|glutathione-regulated-potassium-efflux-system-ancillary-protein-KefF MILIIYAHPYPHHSHANKRMLEQARTLEGVEIRSLYQLYPDFNIDIAAEQEALSRADLIVWQHPMQWYSIPPLLKLWIDKVFSHGWAYGHGGTALHGKHLLWAVTTGGGESHFEIGAHPGFDVLSQPLQATAIYCGLNWLPPFAMHCTFICDDETLEGQARHYKQRLLEWQEAHHG >CP028702.1|AVZ47614.1|444156_446019_+|glutathione-regulated-potassium-efflux-system-protein-KefC MDSHTLIQALIYLGSAALIVPIAVRLGLGSVLGYLIAGCIIGPWGLRLVTDAESILHFAEIGVVLMLFIIGLELDPQRLWKLRAAVFGCGALQMVICGGLLGLFCMLLGLRWQVAELIGMTLALSSTAIAMQAMNERNLMVTQMGRSAFAVLLFQDIAAIPLVAMIPLLATSSASTTMGAFALSALKVAGALVLVVLLGRYVTRPALRFVARSGLREVFSAVALFLVFGFGLLLEEVGLSMAMGAFLAGVLLASSEYRHALESDIEPFKGLLLGLFFIGVGMSIDFGTLLENPLRIVILLLGFLIIKIAMLWLIARPLQVPNKQRRWFAVLLGQGSEFAFVVFGAAQMANVLEPEWAKSLTLAVALSMAATPILLVILNRLEQSSTEEAREADEIDEEQPRVIIAGFGRFGQITGRLLLSSGVKMVVLDHDPDHIETLRKFGMKVFYGDATRMDLLESAGAAKAEVLINAIDDPQTNLQLTEMVKEHFPHLQIIARARDVDHYIRLRQAGVEKPERETFEGALKTGRLALESLGLGPYEARERADVFRRFNIQMVEEMAMVENDTKARAAVYKRTSAMLSEIITEDREHLSLIQRHGWQGTEEGKHTGNMADEPETKPSS >CP028702.1|AVZ47615.1|446210_446690_+|type-3-dihydrofolate-reductase MISLIAALAVDRVIGMENAMPWNLPADLAWFKRNTLNKPVIMGRHTWESIGRPLPGRKNIILSSQPGTDDRVTWVKSVDEAIAACGDVPEIMVIGGGRVYEQFLPKAQKLYLTHIDAEVEGDTHFPDYEPDDWESVFSEFHDADAQNSHSYCFEILERR >CP028702.1|AVZ47616.1|446767_447610_-|diadenosine-tetraphosphatase MATYLIGDVHGCYDELIALLHKVEFTPGKDTLWLTGDLVARGPGSLDVLRYVKSLGDSVRLVLGNHDLHLLAVFAGISRNKPKDRLTPLLEAPDADELLNWLRRQPLLQIDEEKKLVMAHAGITPQWDLQTAKECARDVEAVLSSDSYPFFLDAMYGDMPNNWSPELRGLGRLRFITNAFTRMRFCFPNGQLDMYSKESPEEAPAPLKPWFAIPGPVAEEYSIAFGHWASLEGKGTPEGIYALDTGCCWGGTLTCLRWEDKQYFVQPSNRHKDLGEAAAS >CP028702.1|AVZ47617.1|447616_447994_-|Co2+/Mg2+-efflux-protein-ApaG MINSPRVCIQVQSVYIEAQSSPDNERYVFAYTVTIRNLGRAPVQLLGRYWLITNGNGRETEVQGEGVVGVQPLIAPGEEYQYTSGAIIETPLGTMQGHYEMIDENGVPFSIDIPVFRLAVPTLIH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP028702_2 | 792905-793049 | Orphan |
NA
Consensus repeat of CP028702_2
|
1 spacers
spacers of CP028702_2
>2.1|792948|59|CP028702|CRISPRCasFinder GGTGCCAGAACCGTAGGCCGGATAAGGCGTTCACGCCGCATCCGGCAATAAGTGCTCCG |
CRISPR arrays and Neighbor proteins around CP028702_2
The CRISPR arrays of CP028702_2 >merge|CP028702|2|792905-793049|CRISPRCasFinder ATGCCTGATGCGACGCTTGCCGCGTCTTATCAGGCCTACAAAAGGTGCCAGAACCGTAGGCCGGATAAGGCGTTCACGCCGCATCCGGCAATAAGTGCTCCGATGCCTGATGCGACGCTTGCCGCGTCTTATCAGGCCTGCAAAA >CP028702|2|1|792905-793049|CRISPRCasFinder ATGCCTGATGCGACGCTTGCCGCGTCTTATCAGGCCTACAAAA GGTGCCAGAACCGTAGGCCGGATAAGGCGTTCACGCCGCATCCGGCAATAAGTGCTCCG ATGCCTGATGCGACGCTTGCCGCGTCTTATCAGGCCTGCAAAA
>CP028702.1|AVZ47937.1|791972_792881_+|fructokinase MRIGIDLGGTKTEVIALGDAGEQLYRHRLPTPRDDYRQTIETIATLVDMAEQATGQRGTVGMGIPGSISPYTGVVKNANSTWLNGQPFDKDLSARLQREVRLANDANCLAVSEAVDGAAAGAQTVFAVIIGTGCGAGVAFNGRAHIGGNGTAGEWGHNPLPWMDEDELRYREEVPCYCGKQGCIETFISGTGFAMDYRRLSGHALKGSEIIRLVEESDPVAELALRRYELRLAKSLAHVVNILDPDVIVLGGGMSNVDRLYQTVGQLIKQFVFGGECETPVRKAKHGDSSGVRGAAWLWPQE >CP028702.1|AVZ51601.1|790936_791848_-|recombination-associated-protein-RdgC MLWFKNLMVYRLSREISLRAEEMEKQLASMAFTPCGSQDMAKMGWVPPMGSHSDALTHVANGQIVICARKEEKILPSPVIKQALEAKIAKLEAEQARKLKKTEKDSLKDEVLHSLLPRAFSRFSQTMMWIDTVNGLIMVDCASAKKAEDTLALLRKSLGSLPVVPLSMENPIELTLTEWVRSGSAAQGFQLLDEAELKSLLEDGGVIRAKKQDLTSEEITNHIEAGKVVTKLALDWQQRIQFVMCDDGSLKRLKFCDELRDQNEDIDREDFAQRFDADFILMTGELAALIQNLIEGLGGEAQR >CP028702.1|AVZ47936.1|790497_790779_+|DUF2773-domain-containing-protein MLQSRNDHLRQTALRNAHTPALLLTTLTEPQDRSLAINNPQLAADVKTAWLKEDPSLLLFVEQPDLSLLRDLVKTGATRKIRSEARHRLEEKQ >CP028702.1|AVZ47935.1|790005_790290_+|pyrimidine/purine-nucleoside-phosphorylase MLQSNEYFSGKVKSIGFSSSSTGRASVGVMVEGEYTFSTAEPEEMTVISGALNVLLPDATDWQVYEAGSVFNVPGHSEFHLQVAEPTSYLCRYL >CP028702.1|AVZ47934.1|789256_789934_+|protein-AroM MSASLAILTIGIVPMQEVLPLLTEYIDEDNISHHSLLGKLSREEVMAEYAPEAGEDTILTLLNDNQLAHVSRRKVERDLQGVVEVLDNQGYDVILLMSTANISSMTARNTIFLEPSRILPPLVSSIVEDHQVGVIVPVEEMLPVQAQKWQILQKSPVFSLGNPIHDSEQKIIDAGKELLAKGADVIMLDCLGFHQRHRDLLQKQLDVPVLLSNVLIARLAAELLV >CP028702.1|AVZ47933.1|789029_789227_+|hypothetical-protein MPLQGICVTFFIIHVFLIIRILNNNNPLLESFGIFTLCRARLLLRFLSFVAQVSVSSGASHLPGN >CP028702.1|AVZ47932.1|788807_788999_+|protein-YaiA MPTKPPYPREAYIVTIEKGKPGQTVTWYQLRADHPKPDSLISEHPTAQEAMDAKKRYEDPDKE >CP028702.1|AVZ47931.1|788233_788758_+|shikimate-kinase MTQPLFLIGPRGCGKTTVGMALADSLNRRFVDTDQWLQSQLNMTVAEIVEREEWAGFRARETAALEAVTAPSTVIATGGGIILTEFNRHFMQNNGIVVYLCAPVSVLVNRLQAAPEEDLRPTLTGKPLSEEVQEVLEERDALYREVAHIIIDATNEPSQVISEIRSALAQTINC >CP028702.1|AVZ47930.1|787592_788051_+|YaiI/YqxD-family-protein MTIWVDADACPNVIKEILYRAAERMQMPLVLVANQSLRVPPSRFIRTLRVAAGFDVADNEIVRQCEAGDLVITADIPLAAEAIEKGAAALNPRGERYTPATIRERLTMRDFMDTLRASGIQTGGPDSLSQRDRQAFAAELEKWWLEVQRSRG >CP028702.1|AVZ47929.1|786663_787473_-|pyrroline-5-carboxylate-reductase MEKKIGFIGCGNMGKAILGGLIASGQVLPGQIWVYTPSPDKVAALHDQFGINAAESAQEVAQIADIIFAAVKPGIMIKVLSEITSSLNKDSLVVSIAAGVTLDQLARALGHDRKIIRAMPNTPALVNAGMTSVTPNALVTPEDTADVLNIFRCFGEAEVIAEPMIHPVVGVSGSSPAYVFMFIEAMADAAVLGGMPRAQAYKFAAQAVMGSAKMVLETGEHPGALKDMVCSPGGTTIEAVRVLEEKGFRAAVIEAMTKCMEKSEKLSKS >CP028702.1|AVZ51602.1|793125_794310_-|MFS-transporter MKKVILSLALGTFGLGMAEFGIMGVLTELAHNVGISIPAAGHMISYYALGVVVGAPIIALFSSRYSLKHILLFLVALCVIGNAMFTLSSSYLMLAIGRLVSGFPHGAFFGVGAIVLSKIIKPGKVTAAVAGMVSGMTVANLLGIPLGTYLSQEFSWRYTFLLIAVFNIAVMASVYFWVPDIRDEAKGNLREQFHFLRSPAPWLIFAATMFGNAGVFAWFSYVKPYMMFISGFSETAMTFIMMLVGLGMVLGNMLSGRISGRYSPLRIAAVTDFIIVLALLMLFFCGGMKTTSLIFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAFNLGSAVGAYCGGMMLTLGLAYNYVALPAALLSFAAMSSLLLYGRYKRQQAADTPVLAKPLG >CP028702.1|AVZ47938.1|794435_797582_-|nuclease-SbcCD-subunit-C MKILSLRLKNLNSLKGEWKIDFTREPFASNGLFAITGPTGAGKTTLLDAICLALYHETPRLSNVSQSQNDLMTRDTAECLAEVEFEVKGEAYRAFWSQNRARNQPDGNLQVPRVELARCADGKILADKVKDKLELTATLTGLDYGRFTRSMLLSQGQFAAFLNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTELEKLQAQASGVTLLTPEQVQSLTASLQVLTDEEKQLITAQQQEQQSLNWLTRQDELQQEASRRQQALQQALAEEEKAQPQLAALSLAQPARNLRPHWERIAEHSAALAHIRQQIEEVNTRLQSTMALRASIRHHAAKQSAELQQQQQSLNTWLQEHDRFRQWNNEPAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLNALAAITLTLTADEVATALAQHAEQRPLRQHLVALHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKTICEQEARIKTLEAQRAQLQAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEGATLRGQLDAITKQLQRDENEAQSLRQDEQALTQQWQAVTASLNITLQPLDDIQPWLDAQDEHERQLRLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQLLLTTLTGYALTLPQEDEEESWLATRQQEAQSWQQRQNELTALQNRIQQLTPILETLPQSDELPHCEETVVLENWRQVHEQCLALHSQQQTLQQQDVLAAQSLQKAQAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQTLVTQTAETLAQHQQHRPDDGLALTVTVEQIQQELAQTHQKLRENTTSQGEIRQQLKQDADNRQQQQTLMQQIAQMTQQVEDWGYLNSLIGSKEGDKFRKFAQGLTLDNLVHLANQQLTRLHGRYLLQRKASEALEVEVVDTWQADAVRDTRTLSGGESFLVSLALALALSDLVSHKTRIDSLFLDEGFGTLDSETLDTALDALDALNASGKTIGVISHVEAMKERIPVQIKVKKINGLGYSKLESTFAVK >CP028702.1|AVZ47939.1|797578_798781_-|exonuclease-sbcCD-subunit-D MRILHTSDWHLGQNFYSKSREAEHQAFLDWLLETAQTHQVDAIIVAGDVFDTGSPPSYARTLYNRFVVNLQQTGCHLVVLAGNHDSVATLNESRDIMAFLNTTVVASAGHAPQILPRRDGTPGAVLCPIPFLRPRDIITSQAGLNGIEKQQHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIYIGTLDAFPAQNFPPADYIALGHIHRAQIIGGMEHVRYCGSPIPLSFDECGKSKYVHLVTFSNGKLESVENLNVPVTQPMAVLKGDLASITAQLEQWRDVSQEPPVWLDIEITTDEYLHDIQRKIQALTESLPVEVLLVRRSREQRERVLASQQRETLSELSVEEVFNRRLALEELDESQQQRLQHLFTTTLHTLAGEHEA >CP028702.1|AVZ47940.1|798970_799660_+|DNA-binding-response-regulator MARRILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWMLPGGSGIQFIKHLKRESMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRISPMAVEEVIEMQGLSLDPTSHRVMAGEEPLEMGPTEFKLLHFFMTHPERVYSREQLLNHVWGTNVYVEDRTVDVHIRRLRKALEPGGHDRMVQTVRGTGYRFSTRF >CP028702.1|AVZ47941.1|799717_801013_+|PAS-domain-containing-sensor-histidine-kinase MLERLSWKRLVLELLLCCLPAFILGAFFGYLPWFLLASVTGLLIWHFWNLLRLSWWLWVDRSMTPPPGRGSWEPLLYGLHQMQLRNKKRRRELGNLIKRFRSGAESLPDAVVLTTEEGGIFWCNGLAQQILGLRWPEDNGQNILNLLRYPEFTQYLKTRDFSRPLNLVLNTGRHLEIRVMPYTHKQLLMVARDVTQMHQLEGARRNFFANVSHELRTPLTVLQGYLEMMNEQPLEGAVREKALHTMREQTQRMEGLVKQLLTLSKIEAAPTHLLNEKVDVPMMLRVVEREAQTLSQKKQTFTFEIDNGLKVSGNEDQLRSAISNLVYNAVNHTPEGTHITVRWQRVPHGAEFSVEDNGPGIAPEHIPRLTERFYRVDKARSRQTGGSGLGLAIVKHAVNHHESRLNIESTVGKGTRFSFVIPERLIAKNSD >CP028702.1|AVZ47942.1|801419_802739_+|branched-chain-amino-acid-transport-system-2-carrier-protein MTHQLRSRDIIALGFMTFALFVGAGNIIFPPMVGLQAGEHVWTAAFGFLITAVGLPVLTVVALAKVGGGVDSLSTPIGKVAGVLLATVCYLAVGPLFATPRTATVSFEVGIAPLTGDSALPLFIYSLVYFAIVILVSLYPGKLLDTVGNFLAPLKIIALVILSVAAIVWPAGSISTATEAYQNAAFSNGFVNGYLTMDTLGAMVFGIVIVNAARSRGVTEARLLTRYTVWAGLMAGVGLTLLYLALFRLGSDSASLVDQSANGAAILHAYVQHTFGGGGSFLLAALIFIACLVTAVGLTCACAEFFAQYVPLSYRTLVFILGGFSMVVSNLGLSQLIQISVPVLTAIYPPCIALVVLSFTRSWWHNSSRVIAPPMFISLLFGILDGIKASAFSDILPSWAQRLPLAEQGLAWLMPTVVMVVLAIIWDRAAGRQVTSSAH >CP028702.1|AVZ47943.1|802814_804188_+|proline-specific-permease-ProY MESKNKLKRGLSTRHIRFMALGSAIGTGLFYGSADAIKMAGPSVLLAYIIGGIAAYIIMRALGEMSVHNPAASSFSRYAQENLGPLAGYITGWTYCFEILIVAIADVTAFGIYMGVWFPTVPHWIWVLSVVLIICAVNLMSVKVFGELEFWFSFFKVATIIIMIVAGFGIIIWGIGNGGQPTGIHNLWSNGGFFSNGWLGMVMSLQMVMFAYGGIEIIGITAGEAKDPEKSIPRAINSVPMRILVFYVGTLFVIMSIYPWNQVGTAGSPFVLTFQHMGITFAASILNFVVLTASLSAINSDVFGVGRMLHGMAEQGSAPKIFSKTSRRGIPWVTVLVMTTALLFAVYLNYIMPENVFLVIASLATFATVWVWIMILLSQIAFRRRLPPEEVKALKFKVPGGVATTIGGLIFLLFIIGLIGYHPDTRISLYVGFAWIVVLLIGWMFKRRHDRQLAENQ >CP028702.1|AVZ47944.1|804346_806161_+|alpha-glycosidase MLNAWHLPVPPFVKQSKDQLLITLWLTGEDPPQRIMLRTEHDNEEMSVPMHKQRSQPQPGVTAWRAAIDLSSGQPRRRYSFKLLWHDRQRWFTPQGFSRMPPARLEQFAVDVPDIGPQWAADQIFYQIFPDRFARSLPREAEQDHVYYHHAAGQEIILRDWDEPVTAQAGGSTFYGGDLDGISEKLPYLKKLGVTALYLNPVFKAPSVHKYDTEDYRHVDPQFGGDGALLRLRHNTQQLGMRLVLDGVFNHSGDSHAWFDRHNRGTGGACHNPESPWRDWYSFSDDGTALDWLGYASLPKLDYQSESLVNEIYRGEDSIVRHWLKAPWNMDGWRLDVVHMLGEAGGARNNMQHVAGITEAAKETQPEAYIVGEHFGDARQWLQADVEDAAMNYRGFTFPLWGFLANTDISYDPQQIDAQTCMAWMDNYRAGLSHQQQLRMFNQLDSHDTARFKTLLGRDIARLPLAVVWLFTWPGVPCIYYGDEVGLDGKNDPFCRKPFPWQVEKQDTALFALYQRMIALRKKSQALRHGGCQVLYAEDNVVVFVRVLNQQRVLVAINRGEACEVVLPASPFLNAVQWQCKEGHGQLTDGILALPAISATVWMN >CP028702.1|AVZ47945.1|806165_806747_-|ACP-phosphodiesterase MNFLAHLHLAHLAESSLSGNLLADFVRGNPEESFPPDVVAGIHMHRRIDVLTDNLPEVREAREWFRSETRRVAPITLDVMWDHFLSRHWSQLSPDFPLQEFVCYAREQVMTILPDSPPRFINLNNYLWSEQWLVRYRDMDFIQNVLNGMASRRPRLDALRDSWYDLDAHYDALETRFWQFYPRMMAQASRKAL >CP028702.1|AVZ47946.1|806839_807910_+|tRNA-preQ1(34)-S-adenosylmethionine-ribosyltransferase-isomerase-QueA MRVTDFSFELPESLIAHYPMPERSSCRLLSLDGPTGALTHGTFTDLLDKLNPGDLLVFNNTRVIPARLFGRKASGGKIEVLVERMLDDKRILAHIRASKAPKPGAELLLGDDESINATMTARHGALFEVEFNDERSVLDILNSIGHMPLPPYIDRPDEDADRELYQTVYSEKPGAVAAPTAGLHFDEPLLEKLRAKGVEMAFVTLHVGAGTFQPVRVDTIEDHIMHSEYAEVPQDVVDAVLAAKARGNRVIAVGTTSVRSLESAAQAAKNDLIEPFFDDTQIFIYPGFQYKVVDALVTNFHLPESTLIMLVSAFAGYQHTMNAYKAAVEEKYRFFSYGDAMFITYNPQAINERVGE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP028702_3 | 934533-934629 | Orphan |
NA
Consensus repeat of CP028702_3
|
1 spacers
spacers of CP028702_3
>3.1|934560|43|CP028702|CRISPRCasFinder CCGGTGCCGCATCCGGCAATTGGTGCACAATGCCTGATGCGAT |
CRISPR arrays and Neighbor proteins around CP028702_3
The CRISPR arrays of CP028702_3 >merge|CP028702|3|934533-934629|CRISPRCasFinder GCTTGACGCGTCTTATCAGGCCTACAACCGGTGCCGCATCCGGCAATTGGTGCACAATGCCTGATGCGATGCTTGACGCATCTTATCAGGCCTACAA >CP028702|3|2|934533-934629|CRISPRCasFinder GCTTGACGCGTCTTATCAGGCCTACAA CCGGTGCCGCATCCGGCAATTGGTGCACAATGCCTGATGCGAT GCTTGACGCATCTTATCAGGCCTACAA
>CP028702.1|AVZ48064.1|933605_934499_+|carbamate-kinase MKTLVVALGGNALLQRGEALTAENQYRNIASAVPALARLARSYRLAIVHGNGPQVGLLALQNLAWKEVEPYPLDVLVAESQGMIGYMLAQSLSAQPQMPPVTTVLTRIEVSPDDPAFLQPEKFIGPVYQPEEQEALEAAYGWQMKRDGKYLRRVVASPQPRKILDSEAIELLLKEGHVVICSGGGGVPVTDDGAGSEAVIDKDLAAALLAEQINADGLVILTDADAVYENWGTPQQRAIRHATPDELAPFAKADGSMGPNVTAVSGYVRSRGKPAWIGALSRIEETLAGEAGTCISL >CP028702.1|AVZ48063.1|932793_933609_+|DUF2877-domain-containing-protein MTIIHPLLASSSAPNYRQSWRLAGVWRRAINLMTESGELLTLHRQGSGFGPGGWVLRRAQFDALCGGLCGNERPQVVAQGIRLGRFTVKQPQRYCLLRITPPAHPQPLAAAWMQRAEETGLFGPLALAASDPLPAELRQFRHCFQAALNGVKTDWRHWLGKGPGLTPSHDDTLSGMLLAAWYYGALDARSGRPFFACSDNLQLVTTAVSVSYLRYAAQGYFASPLLHFVHALSCPKRTAVAIDSLLALGHTSGADTLLGFWLGQQLLQGKP >CP028702.1|AVZ48062.1|931523_932783_+|DUF1116-domain-containing-protein MFTSVAQANAAVIEQIRRARPHWLDVQPASSLISELNEGKTLLHAGPPMRWQEMTGPMKGACVGACLFEGWAKDEAQALAILEQGEVNFIPCHHVNAVGPMGGITSASMPMLVVENVTDGNRAYCNLNEGIGKVMRFGAYGEDVLTRHRWMRDVLMPVLSAALGRMERGIDLTAMMAQGITMGDEFHQRNIASSALLMRALAPQIARLDHDKQHIAEVMDFLSVTDQFFLNLAMAYCKAAMDAGAMIRAGSIVTAMTRNGNMFGIRVSGLGERWFTAPVNTPQGLFFTGFSQEQANPDMGDSAITETFGIGGAAMIAAPGVTRFVGAGGMEAARAVSEEMAEIYLERNMQLQIPSWDFQGACLGLDIRRVVETGITPLINTGIAHKEAGIGQIGAGTVRAPLACFEQALEALAESMGIG >CP028702.1|AVZ48061.1|929846_931514_+|protein-FdrA MIHAFIKKGCFQDSVSLMIISRKLSESENVDDVSVMMGTPANKALLDTTGFWHDDFNNATPNDICVAIRSEAADAGIAQAIMQQLEEALKQLAQGSGSSQALTQVRRWDSACQKLPDANLALISVAGEYAAELANQALDRNLNVMMFSDNVTLEDEIQLKTRAREKGLLVMGPDCGTSMIAGTPLAFANVMPEGNIGVIGASGTGIQELCSQIALAGEGITHAIGLGGRDLSREVGGISALTALEMLSADEKSEVLAFVSKPPAEAVRLKIVNAMKATGKPTVALFLGYTPAVARDENVWFASSLDEAARLACLLSRVTARRNAIAPVSSGFICGLYTGGTLAAEAAGLLAGHLGVEADDTHQHGMMLDADSHQIIDLGDDFYTVGRPHPMIDPTLRNQLIADLGAKPQVRVLLLDVVIGFGATADPAASLVSAWQKACAARLDNQPLYAIATVTGTERDPQCRSQQIATLEDAGIAVVSSLPEATLLAAALIHPLSPAAQQHTPSLLENVAVINIGLRSFALELQSASKPVVHYQWSPVAGGNKKLARLLERLQ >CP028702.1|AVZ48060.1|928480_929530_-|ureidoglycolate-dehydrogenase-(NAD(+)) MKISRETLHQLIENKLCQAGLKREHAATVAEVLVYADARGIHSHGAVRVEYYAERISKGGTNREPEFRLEETGPCSAILHADNAAGQVAAKMGMEHAIKTAQQNGVAVVGISRMGHSGAISYFVQQAARAGFIGISMCQSDPMVVPFGGAEIYYGTNPLAFAAPGEGDEILTFDMATTVQAWGKVLDARSRNMSIPDTWAVDKNGVPTTDPFAVHALLPAAGPKGYGLMMMIDVLSGVLLGLPFGRQVSSMYDDLHAGRNLGQLHIVINPNFFSSSELFRQHLSQTMRELNAITPAPGFNQVYYPGQDQDIKQRKAAVEGIEIVDDIYQYLISDALYNTSYETKNPFAQ >CP028702.1|AVZ48059.1|927223_928459_-|Zn-dependent-hydrolase MITHFRQAIEETLPWLSSFGADPAGGMTRLLYSPEWLETQQQFKKRMAASGLETRFDEVGNLYGRLNGTEYPQEVVLSGSHIDTVVNGGNLDGQFGALAAWLAIDWLKTQYGAPLRTVEVVAMAEEEGSRFPYVFWGSKNIFGLANPDDVRNICDAKGNSFVDAMKACGFTLPNAPLTPRQDIKAFVELHIEQGCVLESNGQSIGVVNAIVGQRRYTVTLNGESNHAGTTPMGYRRDTVYAFSRICHQSVEKAKRMGDPLVLTFGKVEPRPNTVNVVPGKTTFTIDCRHTDAAVLRDFTQQLENDMRAICDEMDIGIDIDLWMDEEPVPMNKELVATLTELCEREKLNYRVMHSGAGHDAQIFAPRVPTCMIFIPSINGISHNPAERTNITDLAEGVKTLALMLYQLAWQK >CP028702.1|AVZ48058.1|926427_927213_-|(S)-ureidoglycine-aminohydrolase MGYLNNVTGYREDLLANRAIVKHGNFALLTPDGLVKNIIPGFENCDATILSTPKLGASFVDYLVTLHQNGGNQQGFGGEGIETFLYVISGNITAKAEGKTFALSEGGYLYCPPGSLMTFVNAQAEDSQIFLYKRRYVPVEGYAPWLVSGNASELERIHYEGMDDVILLDFLPKELGFDMNMHILSFAPGASHGYIETHVQEHGAYILSGQGVYNLDNNWIPVKKGDYIFMGAYSLQAGYGVGRGEAFSYIYSKDCNRDVEI >CP028702.1|AVZ48057.1|925054_926200_+|glycerate-kinase MKIVIAPDSFKESLSAEKCCQAIKAGFSTLFPDANYICLPIADGGEGTVDAMVAATGGNIVTLEVCGPMGEKVNAFYGLTGDGKTAVIEMAAASGLMLVAPEKRNPLLASSFGTGELIRHALDNDIRHIILGIGGSATVDGGMGMAQALGVRFLDADGQALAANGGNLARVASIEMDECDPRLANCHIEVACDVDNPLVGARGAAAVFGPQKGATPEMVEELEQGLQNYARVLQQQTEINVCQMAGGGAAGGMGIAAAVFLNADIKPGIEIVLNAVNLAQAVQGAALVITGEGRIDSQTAGGKAPLGVASVAKQFNVPVIGIAGVLGDGVEVVHQYGIDAVFSILPRLAPLAEVLASGETNLFNSARNIACAIKIGQGIKN >CP028702.1|AVZ48056.1|923731_925033_+|uracil/xanthine-transporter MFNFAVSRESLLSGFQWFFFIFCNTVVVPPTLLSAFQLPQSSLLTLTQYAFLATALACFAQAFCGHRRAIMEGPGGLWWGTILTITLGEASRGTPINDIATSLAVGIALSGVLTMLIGFSGLGHRLARLFTPSVMVLFMLMLGAQLTTIFFKGMLGLPFGIADPNFKIQLPPFALSVAVMCLVLAMIIFLPQRFARYGLLVGTITGWLLWYFCFPSSHSLSGELHWQWFPLGSGGALSPGIILTAVITGLVNISNTYGAIRGTDVFYPQQGAGNTRYRRSFVATGFMTLITVPLAVIPFSPFVSSIGLLTQTGDYTRRSFIYGSVICLLVALVPALTRLFCSIPLPVSSAVMLVSYLPLLFSALVFSQQITFTARNIYRLALPLFVGIFLMALPPVYLQDLPLTLRPLLSNGLLVGILLAVLMDNLIPWERIE >CP028702.1|AVZ48055.1|922313_923675_+|cyclic-amidohydrolase MSFDLIIKNGTVILENEARVVDIAVKGGKIAAIGQDLGDAKEVMDASGLVVSPGMVDAHTHISEPGRSHWEGYETGTRAAAKGGITTMIEMPLNQLPATVDRASIELKFDAAKGKLTIDAAQLGGLVSYNIDRLHELDEVGVVGFKCFVATCGDRGIDNDFRDVNDWQFFKGAQKLGELGQPVLVHCENALICDELGEEAKREGRVTAHDYVASRPVFTEVEAIRRVLYLAKVAGCRLHVCHVSSPEGVEEVTRARQEGQDVTCESCPHYFVLDTDQFEEIGTLAKCSPPIRDLENQKGMWEKLFNGEIDCLVSDHSPCPPEMKAGNIMKAWGGIAGLQSCMDVMFDEAVQKRGMSLPMFGKLMATNAADIFGLQQKGRIAPGKDADFVFIQPNSSYVLTNDDLEYRHKVSPYVGRTIGARITKTILRGDVIYDIEQGFPVAPKGQFILKHQQ >CP028702.1|AVZ48065.1|934693_935761_-|5-(carboxyamino)imidazole-ribonucleotide-synthase MKQVCVLGNGQLGRMLRQAGEPLGIAVWPVGLDAEPAAVPFQQSVITAEIERWPETALTRELARHPAFVNRDVFPIIADRLTQKQLFDKLHLPTAPWQLLAERSEWPAVFDRLGELAIVKRRTGGYDGRGQWRLRANETEQLPAECYGECIVEQGINFSGEVSLVGARGFDGSTVFYPLTHNLHQDGILRTSVAFPQANAQQQAQAEEMLSAIMQELGYVGVMAMECFVTPQGLLINELAPRVHNSGHWTQNGASISQFELHLRAITDLPLPQPVVNNPSVMINLIGSDVNYDWLKLPLVHLHWYDKEVRPGRKVGHLNLTDSDTSRLTATLEALIPLLPPEYASGVIWAQSKFG >CP028702.1|AVZ48066.1|935757_936267_-|5-(carboxyamino)imidazole-ribonucleotide-mutase MSSRNNPARVAIVMGSKSDWATMQFAAEIFEILNVPHHVEVVSAHRTPDKLFSFAESAEENGYQVIIAGAGGAAHLPGMIAAKTLVPVLGVPVQSAALSGVDSLYSIVQMPRGIPVGTLAIGKAGAANAALLAAQILATHDKELHQRLNDWRKAQTDEVLENPDPRGAA >CP028702.1|AVZ48067.1|936384_937107_-|UDP-2,3-diacylglucosamine-hydrolase MATLFIADLHLCVEEPAITAGFLRFLAGEARKADALYILGDLFEAWIGDDDPNPLHRKMAAAIKAVSDSGVPCYFIHGNRDFLLGKRFARESGMTLLPEEKVLELYGRRVLIMHGDTLCTDDAGYQAFRAKVHKPWLQTLFLALPLFVRKRIAARMRANSKEANSSKSLAIMDVNQNAVVSAMEKHQVQWLIHGHTHRPAVHELIANQQPAFRVVLGAWHTEGSMVKVTADDVELIHFPF >CP028702.1|AVZ48068.1|937109_937604_-|peptidyl-prolyl-cis-trans-isomerase MVTFHTNHGDIVIKTFDDKAPETVKNFLDYCREGFYNNTIFHRVINGFMIQGGGFEPGMKQKATKEPIKNEANNGLKNTRGTLAMARTQAPHSATAQFFINVVDNDFLNFSGESLQGWGYCVFAEVVDGMDVVDKIKGVATGRSGMHQDVPKEDVIIESVTVSE >CP028702.1|AVZ48069.1|937777_939163_+|cysteine--tRNA-ligase MLKIFNTLTRQKEEFKPIHAGEVGMYVCGITVYDLCHIGHGRTFVAFDVVARYLRFLGYKLKYVRNITDIDDKIIKRANENGESFVAMVDRMIAEMHKDFDALNILRPDMEPRATHHIAEIIELTEQLIAKGHAYVADNGDVMFDVPTDPTYGVLSRQDLDQLQAGARVDVVDDKRNPMDFVLWKMSKEGEPSWPSPWGAGRPGWHIECSAMNCKQLGNHFDIHGGGSDLMFPHHENEIAQSTCAHDGQYVNYWMHSGMVMVDREKMSKSLGNFFTVRDVLKYYDAETVRYFLMSGHYRSQLNYSEENLKQARAALERLYTALRGTDKTVAPAGGEAFEARFIEAMDDDFNTPEAYSVLFDMAREVNRLKAEDMAAANAMASHLRKLSAVLGLLEQEPEAFLQSGAQADDSEVAEIEALIQQRLDARKAKDWAAADAARDRLNEMGIVLEDGPQGTTWRRK >CP028702.1|AVZ48070.1|939198_939720_-|hypothetical-protein MPTVITHAAVPLCIGLGLGSKVIPPRLLFAGIILAMLPDADVLSFKFGVAYGNVFGHRGFTHSLVFAFVVPLLCVFIGRRWFRAGLIRCWLFLTVSLLSHSLLDSVTTGGKGVGWLWPWSDERFFAPWQVIKVAPFALSRYTTPYGHQVIISELMWVWLPGMLLMGMLWWRRR >CP028702.1|AVZ48071.1|939827_940040_-|ribosome-associated-protein MATFSLGKHPHVELCDLLKLEGWSESGAQAKIAIAEGQVKVDGAVETRKRCKIVAGQTVSFAGHSVQVVA >CP028702.1|AVZ48072.1|940041_940908_-|bifunctional-methylenetetrahydrofolate-dehydrogenase/methenyltetrahydrofolate-cyclohydrolase MAAKIIDGKTIAQQVRSEVAQKVQARIAAGLRAPGLAVVLVGSNPASQIYVASKRKACEEVGFVSRSYDLPETTSEAELLELIDTLNADNTIDGILVQLPLPAGIDNVKVLERIHPDKDVDGFHPYNVGRLCQRAPRLRPCTPRGIVTLLERYNIDTFGLNAVVIGASNIVGRPMSMELLLAGCTTTVTHRFTKNLRHHVENADLLIVAVGKPGFIPGDWIKEGAIVIDVGINRLENGKVVGDVVFEDAAKRASYITPVPGGVGPMTVATLIENTLQACVEYHDPQDE >CP028702.1|AVZ48073.1|941378_941921_+|type-1-fimbrial-protein-subunit-FimA MKLRFISSALAAALFAATGSYAAVVDGGTIHFEGELVNAACSVNTDSADQVVTLGQYRTDIFNAVGNTSALIPFTIQLNDCDPVVAANAAVAFSGQADAINDNLLAIASSTNTTTATGVGIEILDNTSAILKPDGNSFSTNQNLIPGTNVLHFSARYKGTGTSASAGQANADATFIMRYE >CP028702.1|AVZ48074.1|942140_942833_+|fimbrial-chaperone-SfmC MMTKIKLLMLIIFYLIISASAHAAGGIALGATRIIYPADAKQTAVWIRNSHTNERFLVNSWIENSSGVKEKSFIITPPLFVSEPKSENTLRIIYTGPPLAADRESLFWMNVKTIPSVDKNALNGRNVLQLAILSRMKLFLRPIQLQELPAEAPDTLKFSRSGNYINVHNPSPFYVTLVNLQVGSQKLGNAMAAPRVNSQIPLPSGVQGKLKFQTVNDYGSVTPVREVNLN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP028702_4 | 2781089-2781215 | Orphan |
NA
Consensus repeat of CP028702_4
|
1 spacers
spacers of CP028702_4
>4.1|2781128|49|CP028702|CRISPRCasFinder TCCGGGTGCCGGATGCAGCGTGAACGCCTTATCCGGCCTACGGCTCGGA |
CRISPR arrays and Neighbor proteins around CP028702_4
The CRISPR arrays of CP028702_4 >merge|CP028702|4|2781089-2781215|CRISPRCasFinder TTTGTAGGCCTGATAAGACGCGCCAGCGTCGCATCAGGCTCCGGGTGCCGGATGCAGCGTGAACGCCTTATCCGGCCTACGGCTCGGATTTGTAGGCCTGATAAGACGCGCCAGCGTCGCATCAGGC >CP028702|4|3|2781089-2781215|CRISPRCasFinder TTTGTAGGCCTGATAAGACGCGCCAGCGTCGCATCAGGC TCCGGGTGCCGGATGCAGCGTGAACGCCTTATCCGGCCTACGGCTCGGA TTTGTAGGCCTGATAAGACGCGCCAGCGTCGCATCAGGC
>CP028702.1|AVZ49820.1|2778752_2781038_+|ribonucleoside-diphosphate-reductase-1-subunit-alpha MNQNLLVTKRDGSTERINLDKIHRVLDWAAEGLHNVSISQVELRSHIQFYDGIKTSDIHETIIKAAADLISRDAPDYQYLAARLAIFHLRKKAYGQFEPPALYDHVVKMVEMGKYDNHLLEDYTEEEFKQMDTFIDHDRDMTFSYAAVKQLEGKYLVQNRVTGEIYESAQFLYILVAACLFSNYPRETRLQYVKRFYDAVSTFKISLPTPIMSGVRTPTRQFSSCVLIECGDSLDSINATSSAIVKYVSQRAGIGINAGRIRALGSPIRGGEAFHTGCIPFYKHFQTAVKSCSQGGVRGGAATLFYPMWHLEVESLLVLKNNRGVEGNRVRHMDYGVQINKLMYTRLLKGEDITLFSPSDVPGLYDAFFADQEEFERLYTKYEKDDSIRKQRVKAVELFSLMMQERASTGRIYIQNVDHCNTHSPFDPAIAPVRQSNLCLEIALPTKPLNDVNDENGEIALCTLSAFNLGAINNLDELEELAILAVRALDALLDYQDYPIPAAKRGAMGRRTLGIGVINFAYYLAKHGKRYSDGSANNLTHKTFEAIQYYLLKASNELAKEQGACPWFNETTYAKGILPIDTYKKDLDTIANEPLHYDWEALRESIKTHGLRNSTLSALMPSETSSQISNATNGIEPPRGYVSIKASKDGILRQVVPDYEHLHDAYELLWEMPGNDGYLQLVGIMQKFIDQSISANTNYDPSRFPSGKVPMQQLLKDLLTAYKFGVKTLYYQNTRDGAEDAQDDLVPSIQDDGCESGACKI >CP028702.1|AVZ49819.1|2774304_2778057_-|AIDA-I-family-autotransporter-YfaL MRIIFLRKEYLSLLPSMIASLFSANGVAAVTDSCQGYDVKASCQASRQSLSGITQDWSIADGQWLVFSDMTNNASGGAVFLQQGAEFSLLPENETGMTLFANNTVTGEYNNGGAIFAKENSTLNLTDVIFSGNVAGGYGGAIYSSGTNDTGAVDLRVTNAMFRNNIANDGKGGAIYTINNDVYLSDVIFDNNQAYTSTSYSDGDGGAIDVTDNNSDSKHPSGYTIVNNTAFTNNTAEGYGGAIYTNSVTAPYLIDISVDDSYSQNGGVLVDENNSAAGYGDGPSSAAGGFMYLGLSEVTFDIADGKTLVIGNTENDGAVDSIAGTGLITKTGSGDLVLNADNNDFTGEMQIENGEVTLGRSNSLMNVGDTHCQDDPQDCYGLTIGSIDQYQNQAELNVGSTQQTFVHALTGFQNGTLNIDAGGNVTVNQGSFAGIIEGAGQLTIAQNGSYVLAGAQSMALTGDIVVDDGAVLSLEGDAADLTALQDDPQSIVLNGGVLDLSDFSTWQSGTSYNDGLEVSGSSGTVIGSQDVVDLAGGDNLHIGGDGKDGVYVVVDASDGQVSLANNNSYLGTTQIASGTLMVSDNSQLGDTHYNRQVIFTDKQQESVMEITSDVDTRSDAAGHGRDIEMRADGEVAVDAGVDTQWGALMADSSGQHQDEGSTLTKTGAGTLELTASGTTQSAVRVEEGTLKGDVADILPYASSLWVGDGATFVTGADQDIQSIDAISSGTIDISDGTVLRLTGQDTSVALNASLFNGDGTLVNATDGVTLTGELNTNLETDSLTYLSNVTVNGNLTNTSGAVSLQNGVAGDTLTVNGDYTGGGTLLLDSELNGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVEDNNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQPVLNAKVGGYLNNLRAANQAFMMERRDHAGGDGQTLNLRVIGGDYHYTAAGQLAQHEDTSTVQLSGDLFSGRWGTDGEWMLGIVGGYSDNQGDSRSNMTGTRADNQNHGYAVGLTSSWFQHGNQKQGAWLDSWLQYAWFSNDVSEQEDGTDHYHSSGIIASLEAGYQWLPGRGVVIEPQAQVIYQGVQQDDFTAANRARVSQSQGDDIQTRLGLHSEWRTAVHVIPTLDLNYYHDPHSTEIEEDGSTISDDAVKQRGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW >CP028702.1|AVZ49818.1|2773454_2774177_+|bifunctional-3-demethylubiquinone-3-O-methyltransferase/2-octaprenyl-6-hydroxy-phenol-methylase MNAEKSPVNHNVDHEEIAKFEAVASRWWDLEGEFKPLHRINPLRLGYIAERAGGLFGKKVLDVGCGGGILAESMAREGATVTGLDMGFEPLQVAKLHALESGIQVDYVQETVEEHAAKHAGQYDVVTCMEMLEHVPDPQSVVRACAQLVKPGGDVFFSTLNRNGKSWLMAVVGAEYILRMVPKGTHDVKKFIKPAELLGWVDQTSLKERHITGLHYNPITNTFKLGPGVDVNYMLHTQNK >CP028702.1|AVZ49817.1|2770680_2773308_-|DNA-gyrase-subunit-A MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDSAVYDTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDDEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEVDAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDGMRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDIIAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHAPTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVKYQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVYSMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMATANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAEGKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQNGYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITDAGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDTIDGSAAEGDDEIAPEVDVDDEPEEE >CP028702.1|AVZ49816.1|2768843_2770532_-|DUF2138-domain-containing-protein MSGEKKAKGWRFYGLVGFGAIALLSAGVWALQYAGSGPEKTLSPLVVHNNLQIDLNEPDLFLDSDSLSQLPKDLLTIPFLHDVLSEDFVFYYQNHADRLGIEGSIRRIVYEHDLTLKDKLFSSLLDQPAQAALWHDKQGHLSHYMVLIQRSGLSKLLEPLLFAATSDSQLSKTEISSIKINSETVPVYQLRYNGNNALMFATYQDKMLVFSSTDMLFKDDQQDTEATAIAGDLLSGKKRWQASFGLEERTAEKTPVRQRIVVSARWLGFGYQRLMPSFAGVRFEMGNDGWHSFVALNDESASVDASFDFTPVWNSMPAGASFCVAVPYSHGIAEEMLSHISQENDKLNGALDGAAGLCWYEDSKLQTPLFVGQFDGTAEQAQLPGKLFTQNIGAHESKAPEGVLPVSQTQQGEAQIWRREVSSRYGQYPKAQAAQPDQLMSDYFFRVSLAMQNKTLLFSLDDTLVNNALQTLNKTRPAMVDVIPTDGIVPLYINPQGIAKLLRNETLTSLPKNLEPVFYNAAQTLLMPKLDALSQQPRYVMKLAQMEPGAAWQWLPITWQPL >CP028702.1|AVZ49815.1|2768223_2768847_-|DUF1175-domain-containing-protein MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIYRLNFLAR >CP028702.1|AVZ49814.1|2764186_2768200_-|alpha-2-macroglobulin-family-protein MGTGLANADDSLPSSNYAPPAGGTFFLLADSSFSSSEEAKVRLEAPGRDYRRYQMEEYGGVDVRLYRIPDPMAFLRQQKNLHRIVVQPQYLGDGLNNTLTWLWDNWYGKSRRVMQRTFSSQSRQNVTQALPELQLGNAIIKPSRYVQNNQFSPLKKYPLVKQFRYPLWQAKPFEPQQGVKLEGASSNFISPQPGNIYIPLGQQEPGLYLVEAMVGGYRATTVVFVSDTVALSKVSGKELLVWTAGKKQGEAKPGSEILWTDGLGVMTRGVTDDSGTLQLQHISPERSYILGKDAEGGVFVSENFFYESEIYNTRLYIFTDRPLYRAGDRVDVKVIGREFHDPLHSSPIVSAPAKLSVLDANGSLLQTVNVTLDARNGGQGSFRLPENAVAGGYELRLAYRNQVYSSSFRVANYIKPHFEIGLALAKKEFKTGEAVSGKLQLLYPDGEPVKNARVQLSLRAQQLSMVGNDLRYAGRFPVSLEGSETVSDASGHVALNLPAADKPSRYLLTVSASDGAAYRVTTTKEILIERGLAHYSLSTAAQYSNSGESVVFRYAALESSKQVPVTYEWLRLEDRTSHSGELPSGGKSFTVNFAKPGNYNLTLRDKDGLILAGLSHAVSGKGSTAHTGTVDIVADKTLYQPGETAKMLITFPEPIDEALLTLERDRVEQQSLLSHPANWLTLQRLNDTQYEARVPVSNSFAPNITFSVLYTRNGQYSFQNAGIKVAVPQLDIRVKTDKTHYQPGELVNVELTSSLKGKPVSAQLTVGVVDEMIYALQPEIAPNIGKFFYPLGRNNVRTSSSLSFISYDQALSSEPVAPGATNRSERRVKMLERPRREEVDTAAWMPSLTTDKQGKAYFTFLMPDSLTRWRITARGMNGDGLVGQGRAYLRSEKNLYMKWSMPTVYRVGDKPAAGLFIFSQQDNEPVALVTKFAGAEMRQTLTLHKGANYISLTQNIQQSGLLSAELQQNGQVQDSISTKLSFVDNSWPVEQQKNVMLGGGDNALMLPEQASNIRLQSSETPQEIFRNNLDALVDEPWGGVINTGSRLIPLSLAWRSLADHQSAAANDIRQMIQDNRLRLMQLAGPGARFTWWGEDGNGDAFLTAWAWYADWQASQAIGVTQQPEYWQHMLDSYAEQADNMPLLHRALVLAWAQEMNLPCKTLLKGLDEAIARRGTKTEDFSEEDTRDINDSLILDTPESPLADAVANVLTMTLLKKAQLKSTVMPQVQQYAWDKAANSNQPLAHTVVLLNSGGDATQTAAILSGLTAEQSTIERALAMNWLAKYMATMPPVVLPAPAGAWAKHKLTGGGEDWRWVGQGVPDILSFGDELSPQNVQVR >CP028702.1|AVZ49813.1|2763685_2764171_-|alpha-2-macroglobulin MAQQSNIPVTVERQLYRLIPGEEEMSFILQPVTSNEIDSDALYLDEITLTSEQDAVLRYGQVEVPLPPGADVERTTWGISVNKPNAAKQQGQLLEKARNEMGELAYMVPVKELTGTVTFRHLLRFSQKGQFVLPPARYVRSYAPAQQSVAAGSEWTGMQVK >CP028702.1|AVZ49812.1|2762035_2763685_-|DUF2300-domain-containing-protein MNWRRIVWLLALVTLPTLAEETPLQLVLRGAQHDQLYQLSSSGVTKVSALPDSLTTPLGSLWKLYVYAWLEDTHQPEQPYQCRGNSPEEVYCCQAGESITRDTALVRSCGLYFAPQRLHIGADVWGQYWQQRQAPAWLASLTTLKPETSVTVKSLLDSLATLPAQNKAQEVLLDVVLDEAKIGVASMLGSRVRVKTWSWFADDKQEIRQGGFAGWLTDGTPLWVTGSGTSKTVLTRYATVLNRVLPVPTQVASGQCVEVELFARYPLKKITAEKSTTAVNPGVLNGRYRVTFTNGNHITFVSHGETTLLSEKGKLKLQSHLDREEYVARVLDREAKSTPPEAAKAMTVAIRTFLQQNANREGDCLTIPDSSATQRVSASPATTGARTMTAWTQDLIYAGDPVHYHGSRATEGTLSWRQATAQAGQGERYDQILAFAYPDNSLSRWGAPRSTCQLLPKAKAWLAKKMPQWRRILQAETGYNEPDVFAVCRLVSGFPYTDRQQKRLFIRNFFTLQDRLDLTHEYLHLAFDGYPTGLDENYIETLTRQLLMD >CP028702.1|AVZ49811.1|2761254_2762031_-|DUF2135-domain-containing-protein MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPAEGEDASFSQSINYPASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDGSFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTPGTGTIRARLRLVLSWDTDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPIHGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGELTLVKSFDW >CP028702.1|AVZ51683.1|2781271_2782402_+|ribonucleoside-diphosphate-reductase-1-subunit-beta MAYTTFSQTKNDQLKEPMFFGQPVNVARYDQQKYDIFEKLIEKQLSFFWRPEEVDVSRDRIDYQALPEHEKHIFISNLKYQTLLDSIQGRSPNVALLPLISIPELETWVETWAFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEQIQKRAEGISSYYDELIEMTSYWHLLGEGTHTVNGKTVTVSLRELKKKLYLCLMSVNALEAIRFYVSFACSFAFAERELMEGNAKIIRLIARDEALHLTGTQHMLNLLRSGADDPEMAEIAEECKQECYDLFVQAAQQEKDWADYLFRDGSMIGLNKDILCQYVEYITNIRMQAVGLDLPFQTRSNPIPWINTWLVSDNVQVAPQEVEVSSYLVGQIDSEVDTDDLSNFQL >CP028702.1|AVZ49821.1|2782401_2782656_+|ferredoxin MARVTLRITGTQLLCQDEHPSLLAALESHNVAVEYQCREGYCGSCRTRLVAGQVDWIAEPLAFIQPGEILPCCCRAKGDIEIEM >CP028702.1|AVZ49822.1|2782709_2783360_-|protein-InaA MAVSAKYDEFNHWWATEGDWVEEPNYRRNGMSGVQCVERNGKKLYVKRMTHHLFHSVRYPFGRPTIVREVAVIKELERAGVIVPKIVFGEAVKIEGEWRALLVTEDMAGFISIADWYAQHAVSPYSDEVRQAMLKAVALAFKKMHSINRQHGCCYVRHIYVKTEGNAEAGFLDLEKSRRRLRRDKAINHDFRQLEKYLEPIPKADWEQVKAYYYAM >CP028702.1|AVZ49823.1|2783574_2783781_+|hypothetical-protein MNFIRQGLGIALQPELTLKSIAGELCSVPLEPTFYRQISLLAKEKPVEGSPLFLLQMCMEQLVAIGKI >CP028702.1|AVZ49824.1|2783822_2784899_-|glycerophosphoryl-diester-phosphodiesterase MKLTLKNLSMAIMMSTIVMGSSAMAADSNEKIVIAHRGASGYLPEHTLPAKAMAYAQGADYLEQDLVMTKDDNLVVLHDHYLDRVTDVADRFPDRARKDGRYYAIDFTLDEIKSLKFTEGFDIENGKKVQTYPGRFPMGKSDFRVHTFEEEIEFVQGLNHSTGKNIGIYPEIKAPWFHHQEGKDIAAKTLEVLKKYGYTGKDDKVYLQCFDADELKRIKNELEPKMGMELNLVQLIAYTDWNETQQKQPDGSWVNYNYDWMFKPGAMKQVAEYADGIGPDYHMLIEETSQPGNIKLTGMVQDAQQNKLVVHPYTVRSDKLPEYTPDVNQLYDALYNKAGVNGLFTDFPDKAVKFLNKE >CP028702.1|AVZ49825.1|2784903_2786262_-|glycerol-3-phosphate-transporter MLSIFKPAPHKARLPAAEIDPTYRRLRWQIFLGIFFGYAAYYLVRKNFALAMPYLVEQGFSRGDLGFALSGISIAYGFSKFIMGSVSDRSNPRVFLPAGLILAAAVMLFMGFVPWATSSIAVMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLFLLGMAWFNDWHAALYMPAFCAILVALFAFAMMRDTPQSCGLPPIEEYKNDYPDDYNEKAEQELTAKQIFMQYVLPNKLLWYIAIANVFVYLLRYGILDWSPTYLKEVKHFALDKSSWAYFLYEYAGIPGTLLCGWMSDKVFRGNRGATGVFFMTLVTIATIVYWMNPAGNPTVDMICMIVIGFLIYGPVMLIGLHALELAPKKAAGTAAGFTGLFGYLGGSVAASAIVGYTVDFFGWDGGFMVMIGGSILAVILLIVVMIGEKRRHEQLLQERNGG >CP028702.1|AVZ49826.1|2786534_2788163_+|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-A MKTRDSQSSDVIIIGGGATGAGIARDCALRGLRVILVERHDIATGATGRNHGLLHSGARYAVTDAESARECISENQILKRIARHCVEPTNGLFITLPEDDLSFQATFIRACEEAGISAEAIDPQQARIIEPAVNPALIGAVKVPDGTVDPFRLTAANMLDAKEHGAVILTAHEVTGLIREGATVCGVRVRNHLTGETQALHAPVVVNAAGIWGQHIAEYADLRIRMFPAKGSLLIMDHRINQHVINRCRKPSDADILVPGDTISLIGTTSLRIDYNEIDDNRVTAEEVDILLREGEKLAPVMAKTRILRAYSGVRPLVASDDDPSGRNVSRGIVLLDHAERDGLDGFITITGGKLMTYRLMAEWATDAVCRKLGNTRPCTTADLALPGSQEPAEVTLRKVISLPAPLRGSAVYRHGDRTPAWLSEGRLHRSLVCECEAVTAGEVQYAVENLNVNSLLDLRRRTRVGMGTCQGELCACRAAGLLQRFNVTTSAQSIEQLSTFLNERWKGVQPIAWGDALRESEFTRWVYQGLCGLEKEQKDAL >CP028702.1|AVZ49827.1|2788152_2789412_+|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-B MRFDTVIMGGGLAGLLCGLQLQKHGLRCAIVTRGQSALHFSSGSLDLLSHLPDGQPVTDIHSGLESLRQQAPAHPYSLLEPQRVLDLACQAQALIAESGAQLQGSVELAHQRVTPLGTLRSTWLSSPEVPVWPLPAKKICVVGISGLMDFQAHLAAASLRELGLAVETAEIELPELDVLRNNATEFRAVNIARFLDNEENWPLLLDALIPVANTCEMILMPACFGLADDKLWRWLNEKLPCSLMLLPTLPPSVLGIRLQNQLQRQFVRQGGVWMPGDEVKKVTCKNGVVNEIWTRNHADIPLRPRFAVLASGSFFSGGLVAERNGIREPILGLDVLQTATRGEWYKGDFFAPQPWQQFGVTTDETLRPSQAGQTIENLFAIGSVLGGFDPIAQGCGGGVCAVSALHAAQQIAQRAGGQQ >CP028702.1|AVZ49828.1|2789408_2790599_+|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-C MNDTSFENCIKCTVCTTACPVSRVNPGYPGPKQAGPDGERLRLKDGALYDEALKYCINCKRCEVACPSDVKIGDIIQRARAKYDTTRPSLRNFVLSHTDLMGSVSTPFAPIVNTATSLKPVRQLLDAALKIDHRRTLPKYSFGTFRRWYRSVAAQQAQYKDQVAFFHGCFVNYNHPQLGKDLIKVLNAMGTGVQLLSKEKCCGVPLIANGFTDKARKQAITNVESIREAVGVKGIPVIATSSTCTFALRDEYPEVLNVDNKGLRDHIELATRWLWRKLDEGKTLPLKPLPLKVVYHTPCHMEKMGWTLYTLELLRNIPGLELTVLDSQCCGIAGTYGFKKENYPTSQAIGAPLFRQIEESGADLVVTDCETCKWQIEMSTSLRCEHPITLLAQALA >CP028702.1|AVZ49829.1|2790791_2791691_+|hypothetical-protein MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAYYSDVLWSVETSDGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHQDKGYDRVPLVVPLLFYHGETSPYPYSLNWLDEFDDPQLARQLYTEAFLLVDITIVPDDEIMQHRRIALLELIQKHIRDRDLIGMVDRITTLLVRGFTNDSQLQTLFNYLLQCGDTSRFTRFIEEIAERSPLQKERLMTIAERLRQEGHQIGWQEGMHEQAIKIALRMLEQGFEREIVLATTQLTDADIPNCH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP028702_5 | 3312807-3313569 | TypeI-E |
I-E
Consensus repeat of CP028702_5
|
12 spacers
spacers of CP028702_5
>5.1|3312836|32|CP028702|CRISPRCasFinder,CRT CAGCGTCAGGCGTGAAATCTCACCGTCGTTGC >5.2|3312897|32|CP028702|CRISPRCasFinder,CRT TCGGTTCAGGCGTTGCAAACCTGGCTACCGGG >5.3|3312958|32|CP028702|CRISPRCasFinder,CRT GTAGTCCATCATTCCACCTATGTCTGAACTCC >5.4|3313019|32|CP028702|CRISPRCasFinder,CRT,PILER-CR CCGGGGGATAATGTTTACGGTCATGCGCCCCC >5.5|3313080|32|CP028702|CRISPRCasFinder,CRT,PILER-CR TGGGCGGCTTGCCTTGCAGCCAGCTCCAGCAG >5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR AAGCTGGCTGGCAATCTCTTTCGGGGTGAGTC >5.7|3313202|32|CP028702|CRISPRCasFinder,CRT,PILER-CR TAGTTTCCGTATCTCCGGATTTATAAAGCTGA >5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR GCAGGCGGCGACGCGCAGGGTATGCGCGATTCG >5.9|3313325|33|CP028702|CRISPRCasFinder,CRT,PILER-CR GCGACCGCTCAGAAATTCCAGACCCGATCCAAA >5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR TCAACATTATCAATTACAACCGACAGGGAGCC >5.11|3313448|32|CP028702|CRISPRCasFinder,CRT,PILER-CR AGCGTGTTCGGCATCACCTTTGGCTTCGGCTG >5.12|3313509|32|CP028702|CRISPRCasFinder,CRT,PILER-CR TGCGTGAGCGTATCGCCGCGCGTCTGCGAAAG |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around CP028702_5
The CRISPR arrays of CP028702_5 >merge|CP028702|5|3312807-3313569|CRISPRCasFinder,CRT,PILER-CR CGGTTTATCCCCGCTGATGCGGGGAACACCAGCGTCAGGCGTGAAATCTCACCGTCGTTGCCGGTTTATCCCTGCTGGCGCGGGGAACTCTCGGTTCAGGCGTTGCAAACCTGGCTACCGGGCGGTTTATCCCCGCTAACGCGGGGAACTCGTAGTCCATCATTCCACCTATGTCTGAACTCCCGGTTTATCCCCGCTGGCGCGGGGAACTCCCGGGGGATAATGTTTACGGTCATGCGCCCCCCGGTTTATCCCCGCTGGCGCGGGGAACTCTGGGCGGCTTGCCTTGCAGCCAGCTCCAGCAGCGGTTTATCCCCGCTGGCGCGGGGAACTCAAGCTGGCTGGCAATCTCTTTCGGGGTGAGTCCGGTTTATCCCCGCTGGCGCGGGGAACTCTAGTTTCCGTATCTCCGGATTTATAAAGCTGACGGTTTATCCCCGCTGGCGCGGGGAACTCGCAGGCGGCGACGCGCAGGGTATGCGCGATTCGCGGTTTATCCCCGCTGGCGCGGGGAACTCGCGACCGCTCAGAAATTCCAGACCCGATCCAAACGGTTTATCCCCGCTGGCGCGGGGAACTCTCAACATTATCAATTACAACCGACAGGGAGCCCGGTTTATCCCCGCTGGCGCGGGGAACTCAGCGTGTTCGGCATCACCTTTGGCTTCGGCTGCGGTTTATCCCCGCTGGCGCGGGGAACTCTGCGTGAGCGTATCGCCGCGCGTCTGCGAAAGCGGTTTATCCCCGCTGGCGCGGGGAACTC >CP028702|5|4|3312807-3313569|CRISPRCasFinder CGGTTTATCCCCGCTGATGCGGGGAACAC CAGCGTCAGGCGTGAAATCTCACCGTCGTTGC CGGTTTATCCCTGCTGGCGCGGGGAACTC TCGGTTCAGGCGTTGCAAACCTGGCTACCGGG CGGTTTATCCCCGCTAACGCGGGGAACTC GTAGTCCATCATTCCACCTATGTCTGAACTCC CGGTTTATCCCCGCTGGCGCGGGGAACTC CCGGGGGATAATGTTTACGGTCATGCGCCCCC CGGTTTATCCCCGCTGGCGCGGGGAACTC TGGGCGGCTTGCCTTGCAGCCAGCTCCAGCAG CGGTTTATCCCCGCTGGCGCGGGGAACTC AAGCTGGCTGGCAATCTCTTTCGGGGTGAGTC CGGTTTATCCCCGCTGGCGCGGGGAACTC TAGTTTCCGTATCTCCGGATTTATAAAGCTGA CGGTTTATCCCCGCTGGCGCGGGGAACTC GCAGGCGGCGACGCGCAGGGTATGCGCGATTCG CGGTTTATCCCCGCTGGCGCGGGGAACTC GCGACCGCTCAGAAATTCCAGACCCGATCCAAA CGGTTTATCCCCGCTGGCGCGGGGAACTC TCAACATTATCAATTACAACCGACAGGGAGCC CGGTTTATCCCCGCTGGCGCGGGGAACTC AGCGTGTTCGGCATCACCTTTGGCTTCGGCTG CGGTTTATCCCCGCTGGCGCGGGGAACTC TGCGTGAGCGTATCGCCGCGCGTCTGCGAAAG CGGTTTATCCCCGCTGGCGCGGGGAACTC >CP028702|5|1|3312807-3313569|CRT CGGTTTATCCCCGCTGATGCGGGGAACAC CAGCGTCAGGCGTGAAATCTCACCGTCGTTGC CGGTTTATCCCTGCTGGCGCGGGGAACTC TCGGTTCAGGCGTTGCAAACCTGGCTACCGGG CGGTTTATCCCCGCTAACGCGGGGAACTC GTAGTCCATCATTCCACCTATGTCTGAACTCC CGGTTTATCCCCGCTGGCGCGGGGAACTC CCGGGGGATAATGTTTACGGTCATGCGCCCCC CGGTTTATCCCCGCTGGCGCGGGGAACTC TGGGCGGCTTGCCTTGCAGCCAGCTCCAGCAG CGGTTTATCCCCGCTGGCGCGGGGAACTC AAGCTGGCTGGCAATCTCTTTCGGGGTGAGTC CGGTTTATCCCCGCTGGCGCGGGGAACTC TAGTTTCCGTATCTCCGGATTTATAAAGCTGA CGGTTTATCCCCGCTGGCGCGGGGAACTC GCAGGCGGCGACGCGCAGGGTATGCGCGATTCG CGGTTTATCCCCGCTGGCGCGGGGAACTC GCGACCGCTCAGAAATTCCAGACCCGATCCAAA CGGTTTATCCCCGCTGGCGCGGGGAACTC TCAACATTATCAATTACAACCGACAGGGAGCC CGGTTTATCCCCGCTGGCGCGGGGAACTC AGCGTGTTCGGCATCACCTTTGGCTTCGGCTG CGGTTTATCCCCGCTGGCGCGGGGAACTC TGCGTGAGCGTATCGCCGCGCGTCTGCGAAAG CGGTTTATCCCCGCTGGCGCGGGGAACTC >CP028702|5|2|3312990-3313569|PILER-CR CGGTTTATCCCCGCTGGCGCGGGGAACTC CCGGGGGATAATGTTTACGGTCATGCGCCCCC CGGTTTATCCCCGCTGGCGCGGGGAACTC TGGGCGGCTTGCCTTGCAGCCAGCTCCAGCAG CGGTTTATCCCCGCTGGCGCGGGGAACTC AAGCTGGCTGGCAATCTCTTTCGGGGTGAGTC CGGTTTATCCCCGCTGGCGCGGGGAACTC TAGTTTCCGTATCTCCGGATTTATAAAGCTGA CGGTTTATCCCCGCTGGCGCGGGGAACTC GCAGGCGGCGACGCGCAGGGTATGCGCGATTCG CGGTTTATCCCCGCTGGCGCGGGGAACTC GCGACCGCTCAGAAATTCCAGACCCGATCCAAA CGGTTTATCCCCGCTGGCGCGGGGAACTC TCAACATTATCAATTACAACCGACAGGGAGCC CGGTTTATCCCCGCTGGCGCGGGGAACTC AGCGTGTTCGGCATCACCTTTGGCTTCGGCTG CGGTTTATCCCCGCTGGCGCGGGGAACTC TGCGTGAGCGTATCGCCGCGCGTCTGCGAAAG CGGTTTATCCCCGCTGGCGCGGGGAACTC
>CP028702.1|AVZ50315.1|3311686_3312724_+|Zn-dependent-exopeptidase-M28 MFSALRHRTAALALGVCFILPVHASSPKPGDFANTQARHIATFFPGRMTGTPAEMLSADYIRQQFQQMGYRSDIRTFNSRYIYTARDNRKSWHNVTGSTVIAAHEGKAPQQIIIMAHLDTYAPLSDADADANLGGLTLQGMDDNAAGLGVMLELAERLKNTPTEYGIRFVATSGEEEGKLGAENLLKRMSDTEKKNTLLVINLDNLIVGDKLYFNSGVKTPEAVRKLTRDRALAIARSHGIAATTNPGLNKNYPKGTGCCNDAEIFDKAGIAVLSVEATNWNLGNKDGYQQRAKTPAFPAGNSWHDVRLDNHQHIDKALPGRIERRCRDVMRIMLPLVKELAKAS >CP028702.1|AVZ50314.1|3310526_3311435_-|sulfate-adenylyltransferase-subunit-2 MDQIRLTHLRQLEAESIHIIREVAAEFSNPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYEFRDRTAKAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWHNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMIDDNRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESNAQTLPEIIEEMLVSTTSERQGRVIDRDQAGSMELKKRQGYF >CP028702.1|AVZ50313.1|3309097_3310525_-|sulfate-adenylyltransferase MNTALAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTRQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCELAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYSEETFTRIREDYLTFAGQLPGNLDIRFVPLSALEGDNVASQSESMPWYSGPTLLEVLETVEIQRVVDAQPMRFPVQYVNRPNLDFRGYAGTLASGRVEVGQRVKVLPSGVESNVARIVTFDGDREEAFAGEAITLVLTDEIDISRGDLLLAADEALPAVQSASVDVVWMAEQPLSPGQSYDIKIAGKKTRARVDGIRYQVDINNLTQREVENLPLNGIGLVDLTFDEPLVLDRYQQNPVTGGLIFIDRLSNVTVGAGMVHEPVSQATAAPSEFSAFELELNALVRRHFPHWGARDLLGDK >CP028702.1|AVZ50312.1|3308492_3309098_-|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVQQRELHHGHRGVVLWFTGLSGSGKSTVAGALEEALHKLGVSTYLLDGDNVRHGLCSDLGFSDADRKENIRRVGEVANLMVEAGLVVLTAFISPHRAERQMVRERVGEGRFIEVFVDTPLAICEARDPKGLYKKARAGELRNFTGIDSVYEAPESAEIHLNGEQLVTNLVQQLLDLLRQNDIIRS >CP028702.1|AVZ50311.1|3308119_3308443_-|hypothetical-protein MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNTLFFFIYTWPFFLALMPVAVVVGIALHSLMDGKLRYSIVFTLVTVGIMFGALFMWLLG >CP028702.1|AVZ50310.1|3307614_3307926_-|cell-division-protein-FtsB MGKLTLLLLAILVWLQYSLWFGKNGIHDYTRVNDDVAAQQATNAKLKARNDQLFAEIDDLNGGQEALEERARNELSMTRPGETFYRLVPDASKRAQSAGQNNR >CP028702.1|AVZ50309.1|3306885_3307596_-|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MATTHLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKRVVIAISPGDSRFAQLPLANHPQITVVDGGDERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALSETSRTGGILAAPVRDTMKRAEPGKNAIAHTVDRNGLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTRTIHQENT >CP028702.1|AVZ50308.1|3306406_3306886_-|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLIKATK >CP028702.1|AVZ50307.1|3305360_3306410_-|tRNA-pseudouridine(13)-synthase-TruD MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDLSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGAQRWAQTNTPVRDRNKRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRAMLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE >CP028702.1|AVZ50306.1|3304618_3305380_-|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVTCSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW >CP028702.1|AVZ51699.1|3313674_3313959_-|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLPV >CP028702.1|AVZ50316.1|3313960_3314878_-|subtype-I-E-CRISPR-associated-endonuclease-Cas1 MTWLPLNPIPLKDRVSMIFLQYGQIDVIDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVRLAAQVGTLLVWVGEAGVRVYASGQPGGARSDKLLYQAKLALDEDLRLKVVRKMFELRFGEPAPARRSVEQLRGIEGSRVRATYALLAKQYGVTWNGRRYDPKDWEKGDTINQCISAATSCLYGVTEAAILAAGYAPAIGFVHTGKPLSFVYDIADIIKFDTVVPKAFEIARRNPGEPDREVRLACRDIFRSSKTLAKLIPLIEDVLAAGEIQPPAPPEDAQPVAIPLPVSLGDAGHRSS >CP028702.1|AVZ50317.1|3314893_3315493_-|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL >CP028702.1|AVZ50318.1|3315479_3316154_-|type-I-E-CRISPR-associated-protein-Cas5/CasD MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDVSQ >CP028702.1|AVZ50319.1|3316156_3317248_-|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGEA >CP028702.1|AVZ50320.1|3317260_3317743_-|type-I-E-CRISPR-associated-protein-Cse2/CasB MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRLLTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLTTNKNA >CP028702.1|AVZ50321.1|3317735_3319244_-|type-I-E-CRISPR-associated-protein-Cse1/CasA MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQGGPSNG >CP028702.1|AVZ50322.1|3319658_3322325_-|CRISPR-associated-helicase/endonuclease-Cas3 MEPFKYICHYWGKSSKSLTKGNDIHLLIYHCLDVAAVADCWWDQSVVLQNTFCRNEMLSKQRVKAWLLFFIALHDIGKFDIRFQYKSAESWLKLNPATPSLNGPSTQMCRKFNHGAAGLYWFNQDSLSEQSLGDFFSFFDAAPHPYESWFPWVEAVTGHHGFILHSQDQDKSRWEMPASLASYAAQDKQAREEWISVLEALFLTPAGLSINDIPPDCSSLLAGFCSLADWLGSWTTTNTFLFNEDAPSDINALRTYFQDRQQDASRVLELSGLVSNKRCYEGVHALLDNGYQPRQLQVLVDALPVAPGLTVIEAPTGSGKTETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEASASHLFSSPNLILAHGNSRFNHLFQSIKSRAITEQGQEEAWVQCCQWLSQSNKKVFLGQIGVCTIDQVLISVLPVKHRFIRGLGIGRSVLIVDEVHAYDTYMNGLLEAVLKAQADVGGSVILLSATLPMKQKQKLLDTYGLHTDPVENNSAYPLINWRGVNGAQRFDLLAHPEQLPPRFSIQPEPICLADMLPDLTMLERMIAAANAGAQVCLICNLVDVAQVCYQRLKELNNTQVDIDLFHARFTLNDRREKENRVISNFGKNGKRNVGRILVATQVVEQSLDVDFDWLITQHCPADLLFQRLGRLHRHHRKYRPAGFEIPVATILLPDGEGYGRHEHIYSNVRVMWRTQQHIEELNGASLFFPDAYRQWLDSIYDDAEMDEPEWVGNGMDKFESAECEKRFKARKVLQWAEEYSLQDNDETILAVTRDGEMSLPLLPYVQTSSGKQLLDGQVYEDLSHEQQYEALALNRVNVPFTWKRSFSEVVDEDGLLWLEGKQNLDGWVWQGNSIVITYTGDEGMTRVIPANPK >CP028702.1|AVZ50323.1|3322683_3323418_-|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >CP028702.1|AVZ50324.1|3323492_3325205_-|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLTDAERMKHESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDNWHLTLFIENGRILDYPARPLKTGLLEIAKIHKGDFRITANQNLIIAGVPESEKAKIEKIAKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVMRVTGCPNGCGRAMLAEVGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPEILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDLWD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP028702_6 | 3339120-3339513 | Unclear |
I-E
Consensus repeat of CP028702_6
|
6 spacers
spacers of CP028702_6
>6.1|3339148|33|CP028702|PILER-CR,CRISPRCasFinder,CRT GACAGAACGGCCTCAGTAGTCTCGTCAGGCTCC >6.2|3339209|33|CP028702|PILER-CR,CRISPRCasFinder,CRT CTGTTTTCGCAAATCTATGGACTATTGCTATTC >6.3|3339270|33|CP028702|PILER-CR,CRISPRCasFinder,CRT GGGCGCACGGAATACAAAGCCGTGTATCTGCTC >6.4|3339331|33|CP028702|PILER-CR,CRISPRCasFinder,CRT TGGCTCTGCAACAGCAGCACCCATGACCACGTC >6.5|3339392|33|CP028702|PILER-CR,CRISPRCasFinder,CRT GAAATGCTGGTGAGCGTTAATGCCGCAAACACA >6.6|3339453|33|CP028702|PILER-CR,CRISPRCasFinder,CRT ATTACGCCTTTTTGCGATTGCCCGGTTTTTGCC |
CRISPR arrays and Neighbor proteins around CP028702_6
The CRISPR arrays of CP028702_6 >merge|CP028702|6|3339120-3339513|PILER-CR,CRISPRCasFinder,CRT GGTTTATCCCCGCTGGCGCGGGGAACTCGACAGAACGGCCTCAGTAGTCTCGTCAGGCTCCGGTTTATCCCCGCTGGCGCGGGGAACACCTGTTTTCGCAAATCTATGGACTATTGCTATTCGGTTTATCCCCGCTGGCGCGGGGAACACGGGCGCACGGAATACAAAGCCGTGTATCTGCTCGGTTTATCCCCGCTGGCGCGGGGAACACTGGCTCTGCAACAGCAGCACCCATGACCACGTCGGTTTATCCCCGCTGGCGCGGGGAACACGAAATGCTGGTGAGCGTTAATGCCGCAAACACAGGTTTATCCCCGCTGGCGCGGGGAACACATTACGCCTTTTTGCGATTGCCCGGTTTTTGCCGGTTTATCCCCGCTGGCGCGGGGAACAC >CP028702|6|3|3339120-3339513|PILER-CR GGTTTATCCCCGCTGGCGCGGGGAACTC GACAGAACGGCCTCAGTAGTCTCGTCAGGCTCC GGTTTATCCCCGCTGGCGCGGGGAACAC CTGTTTTCGCAAATCTATGGACTATTGCTATTC GGTTTATCCCCGCTGGCGCGGGGAACAC GGGCGCACGGAATACAAAGCCGTGTATCTGCTC GGTTTATCCCCGCTGGCGCGGGGAACAC TGGCTCTGCAACAGCAGCACCCATGACCACGTC GGTTTATCCCCGCTGGCGCGGGGAACAC GAAATGCTGGTGAGCGTTAATGCCGCAAACACA GGTTTATCCCCGCTGGCGCGGGGAACAC ATTACGCCTTTTTGCGATTGCCCGGTTTTTGCC GGTTTATCCCCGCTGGCGCGGGGAACAC >CP028702|6|5|3339120-3339513|CRISPRCasFinder GGTTTATCCCCGCTGGCGCGGGGAACTC GACAGAACGGCCTCAGTAGTCTCGTCAGGCTCC GGTTTATCCCCGCTGGCGCGGGGAACAC CTGTTTTCGCAAATCTATGGACTATTGCTATTC GGTTTATCCCCGCTGGCGCGGGGAACAC GGGCGCACGGAATACAAAGCCGTGTATCTGCTC GGTTTATCCCCGCTGGCGCGGGGAACAC TGGCTCTGCAACAGCAGCACCCATGACCACGTC GGTTTATCCCCGCTGGCGCGGGGAACAC GAAATGCTGGTGAGCGTTAATGCCGCAAACACA GGTTTATCCCCGCTGGCGCGGGGAACAC ATTACGCCTTTTTGCGATTGCCCGGTTTTTGCC GGTTTATCCCCGCTGGCGCGGGGAACAC >CP028702|6|2|3339120-3339513|CRT GGTTTATCCCCGCTGGCGCGGGGAACTC GACAGAACGGCCTCAGTAGTCTCGTCAGGCTCC GGTTTATCCCCGCTGGCGCGGGGAACAC CTGTTTTCGCAAATCTATGGACTATTGCTATTC GGTTTATCCCCGCTGGCGCGGGGAACAC GGGCGCACGGAATACAAAGCCGTGTATCTGCTC GGTTTATCCCCGCTGGCGCGGGGAACAC TGGCTCTGCAACAGCAGCACCCATGACCACGTC GGTTTATCCCCGCTGGCGCGGGGAACAC GAAATGCTGGTGAGCGTTAATGCCGCAAACACA GGTTTATCCCCGCTGGCGCGGGGAACAC ATTACGCCTTTTTGCGATTGCCCGGTTTTTGCC GGTTTATCCCCGCTGGCGCGGGGAACAC
>CP028702.1|AVZ50336.1|3337001_3338480_+|sugar-kinase MSKKYIIGIDGGSQSTKVVMYDLEGNVVCEGKGLLQPMHTPDADTAEHPDDDLWASLCFAGHDLMSQFAGNKEDIVGIGLGSIRCCRALLKADGTPAAPLISWQDARVTRPYEHTNPDVAYVTSFSGYLTHRLTGEFKDNIANYFGQWPVDYKSWAWSEDAAVMDKFNIPRHMLFDVQMPGTVLGHITPQAALATHFPAGLPVVCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAYWPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDARAQDLSPEDLLNKKASCVPPGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNHFAKHVIITGGGSNSDLFMQIFADVFNLPARRNAINGCASLGAAINTAVGLGLYPDYATAVDNMVRVKDIFIPIESNAKRYDAMNKGIFKDLTKHTDVILKKSYEVMHGELGNVDSIQSWSNA >CP028702.1|AVZ50335.1|3335697_3336975_+|MFS-transporter MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQIAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAPDDSTSLKTVIIIYSVVYILLGILCWFFVSDNNNLRSANNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAILSYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSVLGLLTLTALLVTNSNPQSVAMGIGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGHWQDTLPAAEAYRNMWLMGMAALGMVIVFTFLLFQKIRTADSAPAMASSK >CP028702.1|AVZ50334.1|3334593_3335379_-|oxidoreductase MSIESLNAFSMDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEKQGVEVDFMQVGITAEGAPQKIIAACCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAAFELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNIQVNGIAPGYYATDITLATRSNPETNQRVLDHIPANRWGDTQDLMGAAVFLASPASNYVNGHLLVVDGGYLVR >CP028702.1|AVZ50333.1|3333069_3334524_-|FAD-binding-oxidoreductase MSLSRAAIVDQLKEIVGADRVITDETVLKKNSIDRFRKFPDIHGIYTLPIPAAVVKLGSTEQVSRVLNFMNAHKINGVPRTGASATEGGLETVVENSVVLDGSAMNQIINIDIENMQATAQCGVPLEVLENALREKGYTTGHSPQSKPLAQMGGLVATRSIGQFSTLYGAIEDMVVGLEAVLADGTVTRIKNVPRRAAGPDIRHIIIGNEGALCYITEVTVKIFKFTPENNLFYGYILEDMKTGFNILREIMVEGYRPSIARLYDAEDGTQHFTHFADGKCVLIFMAEGNPRIAKVTGEGIAEIVARYPQCQRVDSKLIETWFNNLNWGPDKVAAERVQILKTGNMGFTTEVSGCWSCIHEIYESVINRIRTEFPHADDITMLGGHSSHSYQNGTNMYFVYDYNVVDCKPEEEIDKYHNPLNKIICEETIRLGGSMVHHHGIGKHRVHWSKLEHGSAWALLEGLKKQFDPNGIMNTGTIYPIEK >CP028702.1|AVZ50332.1|3331638_3332976_-|MFS-transporter MNTSPVRMDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFLGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGILLGAFSVVWTVGYVLASIAGHHFISENPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLFSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQVTLLLLATVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV >CP028702.1|AVZ50331.1|3330881_3331661_-|electron-transfer-flavoprotein MNILLAFKAEPDAGMLAEKEWQAAAQGKSGPDISLLRSLLGADEQAAAALLLAQRKNGTPMSLTALSMGDERALHWLRYLMALGFEEAVLLETAADLRFAPEFVARHIAEWQHQNPLDLIITGCQSSEGQNGQTPFLLAEMLGWPCFTQVERFTLDALFITLEQRTEHGLRCCRVRLPAVIAVRQCGEVALPVPGMRQRMAAGKAEIIRKTVAAEMPAMQCLQLARAEQRRGATLIDGQTVAEKAQKLWQDYLRQRMQP >CP028702.1|AVZ50330.1|3330024_3330885_-|electron-transfer-flavoprotein MNIAIVTINQENAAIASWLAAQDFSGCTLAHWQIEPQPVVAEQVLDALVEQWQRTPADVVLFPPGTFGDELSTRLAWRLHGASICQVTSLDIPTVSVRKSHWGNALTATLQTEKRPLCLSLARQAGAAKNATLPSGMQQLNIVPGALPDWLVSTEDLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVGYSRARVMNGGVDAEKVIGISGHLLAPEVCIVVGASGAAALMAGVRNSKFVVAINHDASAAVFSQADVGVVDDWKVVLEALVTNIHADCQ >CP028702.1|AVZ50329.1|3329301_3329877_+|glycerol-3-phosphate-responsive-antiterminator MPLLHLLRQNPVIAAVKDNASLQLAIDSECQFISVLYGNICTISNIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLVTEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVCDEEDARNAINAGVVALSTTNTGVWTLAKKLL >CP028702.1|AVZ50328.1|3329024_3329285_+|ferredoxin-like-protein-YgcO MSVARNLWRVADAPHIVPADSVERQTAERLINACPAGLFSLTPEGNLRIDYRSCLECGTCRLLCDESTLQQWRYPPSGFGITYRFG >CP028702.1|AVZ50327.1|3327762_3329034_+|electron-transfer-flavoprotein-quinone-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERRITHESLSLLTPDGVTTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGRICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL >CP028702.1|AVZ50337.1|3339852_3340524_-|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWEKLEDREVSLFSILAKTKESDKWGAASSEDLLAVIGRQGYTARHVVITGGEPCIHDLLPLTDLLEKNGFSCQIETSGTHEVRCTPNTWVTVSPKLNMRGGYEVLSQALERANEIKHPVGRVRDIEALDELLATLTDDKPRVIALQPISQKDDATRLCIETCIARNWRLSMQTHKYLNIA >CP028702.1|AVZ50338.1|3340662_3340803_+|hypothetical-protein MSEENKENGFNHVKTFTKIIFIFSVLVFNDNEYKITDAAVNLFIQI >CP028702.1|AVZ50339.1|3340816_3341689_+|TPM-domain-protein-phosphatase MRYFILMFTFVCSFVAAQPTIVPQLQQQVTDLTSSLNSQEKKELTHKLESIFNNTQVQIAVLIVPTTKDETIEQYATRVFDNWRLGDAKRNDGILIVVAWSDRTVRIQVGYGLEEKVTDALAGDIIRSNMIPAFKQQKLAKGLELAINALNNQLTSQHQYPTNPSESESASSSDHYYFAIFWVFAVMFFPFWFFHQGSNFCRACKSGVCISAIYLLDLFLFSDKIFSIAVFSFFFTFTIFMVFTCLCVLQKRASGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASGRW >CP028702.1|AVZ50340.1|3341748_3343047_-|enolase MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >CP028702.1|AVZ50341.1|3343134_3344772_-|CTP-synthetase MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK >CP028702.1|AVZ50342.1|3344999_3345791_-|nucleoside-triphosphate-pyrophosphohydrolase MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFADSSAENSSEVLARWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCANVGFDWTTLGPVVDKVYEEIDEVMYEARQAVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLEMTGVDLETMEEVWQQVKRQEIDL >CP028702.1|AVZ50343.1|3345861_3346197_-|mRNA-interferase-MazF MVSRYVPDMGDLIWVDFDPTKGSEQAGHRPAVVLSPFMYNNKTGMCLCVPCTTQSKGYPFEVVLSGQERDGVALADQVKSIAWRARGATKKGTVAPEELQLIKAKINVLIG >CP028702.1|AVZ50344.1|3346196_3346445_-|antitoxin-MazE MIHSSVKRWGNSPAVRIPATLMQALNLNIDDEVKIDLVDGKLIIEPVRKEPVFTLAELVNDITPENLHENIDWGEPKDKEVW >CP028702.1|AVZ50345.1|3346522_3348757_-|GTP-pyrophosphokinase MVAVRSAHINKAGEFDPEKWIASLGITSQKSCECLAETWAYCLQQTQGHPDASLLLWRGVEMVEILSTLSMDIDTLRAALLFPLADANVVSEDVLRESVGKSVVNLIHGVRDMAAIRQLKATHTDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHLREVKDAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYKRIAKLLHERRLDREHYIEEFVGHLRAEMKAEGVKAEVYGRPKHIYSIWRKMQKKNLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQSIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAAAGGARSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQVFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQIEIITQKQPNPSRDWLNPNLGYVTTSRGRSKIHAWFRKQDRDKNILAGRQILDDELEHLGISLKEAEKHLLPRYNFNDVDELLAAIGGGDIRLNQMVNFLQSQFNKPSAEEQDAAALKQLQQKSYTPQNRSKDNGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQLAELRSHAPERIVDAVWGESYSAGYSLVVRVVANDRSGLLRDITTILANEKVNVLGVASRSDTKQQLATIDMTIEIYNLQVLGRVLGKLNQVPDVIDARRLHGS >CP028702.1|AVZ50346.1|3348804_3350106_-|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MAQFYSAKRRTTTRQIITVSVNDLDSFGQGVARHNGKTLFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSPERETPRCPHFGVCGGCQQQHASVDLQQRSKSAALARLMKHDVSEVIADVPWGYRRRARLSLNYLPKTQQLQMGFRKAGSSDIVDVKQCPILAPQLEALLPKVRACLGSLQAMRHLGHVELVQATSGTLMILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILETVSGEMPWYDSNGLRLTFSPRDFIQVNAGVNQKMVARALEWLDVQPEDRVLDLFCGMGNFTLPLATQAASVVGVEGVPALVEKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVMQQIIKLEPIRIVYVSCNPATLARDSEALLKAGYTIARLAMLDMFPHTGHLESMVLFSRVK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP028702_1 | 1.2|438643|42|CP028702|PILER-CR | 438643-438684 | 42 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 141085-141126 | 0 | 1.0 |
CP028702_6 | 6.5|3339392|33|CP028702|PILER-CR,CRISPRCasFinder,CRT | 3339392-3339424 | 33 | NZ_LR134258 | Klebsiella aerogenes strain NCTC9644 plasmid 5, complete sequence | 3574-3606 | 4 | 0.879 |
CP028702_6 | 6.5|3339392|33|CP028702|PILER-CR,CRISPRCasFinder,CRT | 3339392-3339424 | 33 | LR134281 | Klebsiella aerogenes strain NCTC9793 genome assembly, plasmid: 6 | 3567-3599 | 4 | 0.879 |
CP028702_6 | 6.5|3339392|33|CP028702|PILER-CR,CRISPRCasFinder,CRT | 3339392-3339424 | 33 | KY271401 | Klebsiella phage 1 LV-2017, complete genome | 21043-21075 | 4 | 0.879 |
CP028702_5 | 5.5|3313080|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313080-3313111 | 32 | NC_021229 | Arthrobacter nicotinovorans pAO1 megaplasmid sequence, strain ATCC 49919 | 65474-65505 | 5 | 0.844 |
CP028702_5 | 5.5|3313080|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313080-3313111 | 32 | NZ_CP017422 | Arthrobacter sp. ZXY-2 plasmid pZXY21, complete sequence | 208287-208318 | 6 | 0.812 |
CP028702_5 | 5.11|3313448|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313448-3313479 | 32 | KY883647 | Vibrio phage JSF33, complete genome | 9760-9791 | 6 | 0.812 |
CP028702_5 | 5.12|3313509|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313509-3313540 | 32 | NZ_CP009293 | Novosphingobium pentaromativorans US6-1 plasmid pLA4, complete sequence | 152196-152227 | 6 | 0.812 |
CP028702_6 | 6.5|3339392|33|CP028702|PILER-CR,CRISPRCasFinder,CRT | 3339392-3339424 | 33 | KY653119 | Morganella phage IME1369_02, complete genome | 18216-18248 | 6 | 0.818 |
CP028702_5 | 5.1|3312836|32|CP028702|CRISPRCasFinder,CRT | 3312836-3312867 | 32 | NZ_AP018516 | Acetobacter orientalis strain FAN1 plasmid pAOF1, complete sequence | 48296-48327 | 8 | 0.75 |
CP028702_5 | 5.5|3313080|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313080-3313111 | 32 | MK113951 | Phage 5P_3, complete genome | 11967-11998 | 8 | 0.75 |
CP028702_5 | 5.5|3313080|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313080-3313111 | 32 | AP017924 | Ralstonia phage RP12 DNA, complete genome | 11643-11674 | 8 | 0.75 |
CP028702_5 | 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313263-3313295 | 33 | NC_013856 | Azospirillum sp. B510 plasmid pAB510b, complete sequence | 375744-375776 | 8 | 0.758 |
CP028702_5 | 5.11|3313448|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313448-3313479 | 32 | MN855762 | Bacteriophage sp. isolate 505, complete genome | 4840-4871 | 8 | 0.75 |
CP028702_5 | 5.11|3313448|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313448-3313479 | 32 | NC_020548 | Azoarcus sp. KH32C plasmid pAZKH, complete sequence | 224460-224491 | 8 | 0.75 |
CP028702_5 | 5.12|3313509|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313509-3313540 | 32 | NZ_CP007130 | Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence | 750410-750441 | 8 | 0.75 |
CP028702_6 | 6.4|3339331|33|CP028702|PILER-CR,CRISPRCasFinder,CRT | 3339331-3339363 | 33 | NZ_CP007129 | Gemmatirosa kalamazoonesis strain KBS708 plasmid 1, complete sequence | 755172-755204 | 8 | 0.758 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 229083-229131 | 9 | 0.816 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 229184-229232 | 9 | 0.816 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 229285-229333 | 9 | 0.816 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 239577-239625 | 9 | 0.816 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 239678-239726 | 9 | 0.816 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 239779-239827 | 9 | 0.816 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 230489-230537 | 9 | 0.816 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 230590-230638 | 9 | 0.816 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 230691-230739 | 9 | 0.816 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 211459-211507 | 9 | 0.816 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 211560-211608 | 9 | 0.816 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 211661-211709 | 9 | 0.816 |
CP028702_5 | 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313263-3313295 | 33 | NZ_CP010957 | Sphingobium sp. YBL2 plasmid 3pYBL2-3, complete sequence | 26182-26214 | 9 | 0.727 |
CP028702_5 | 5.11|3313448|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313448-3313479 | 32 | NZ_CP015585 | Roseomonas gilardii strain U14-5 plasmid 1, complete sequence | 104261-104292 | 9 | 0.719 |
CP028702_5 | 5.11|3313448|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313448-3313479 | 32 | NZ_CP054618 | Azospirillum oryzae strain KACC 14407 plasmid unnamed4, complete sequence | 142898-142929 | 9 | 0.719 |
CP028702_5 | 5.12|3313509|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313509-3313540 | 32 | MN234174 | Mycobacterium phage Efra2, complete genome | 35614-35645 | 9 | 0.719 |
CP028702_5 | 5.12|3313509|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313509-3313540 | 32 | MN234165 | Mycobacterium phage Yunkel11, complete genome | 35570-35601 | 9 | 0.719 |
CP028702_5 | 5.12|3313509|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313509-3313540 | 32 | MN234201 | Mycobacterium phage Guanica15, complete genome | 35571-35602 | 9 | 0.719 |
CP028702_2 | 2.1|792948|59|CP028702|CRISPRCasFinder | 792948-793006 | 59 | MT230312 | Escherichia coli strain DH5alpha plasmid pESBL31, complete sequence | 97-155 | 10 | 0.831 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 229386-229434 | 10 | 0.796 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 239880-239928 | 10 | 0.796 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 230792-230840 | 10 | 0.796 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 211762-211810 | 10 | 0.796 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP044147 | Escherichia coli O157 strain AR-0428 plasmid pAR-0428-2 | 7442-7490 | 10 | 0.796 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | CP044351 | Escherichia coli strain 194195 plasmid p194195_1, complete sequence | 84266-84314 | 10 | 0.796 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 386508-386556 | 10 | 0.796 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 14196-14244 | 10 | 0.796 |
CP028702_5 | 5.5|3313080|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313080-3313111 | 32 | NC_002580 | Propionibacterium freudenreichii plasmid p545, complete sequence | 2898-2929 | 10 | 0.688 |
CP028702_5 | 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313141-3313172 | 32 | NZ_CP028970 | Aminobacter sp. MSH1 plasmid pUSP2, complete sequence | 156123-156154 | 10 | 0.688 |
CP028702_5 | 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313141-3313172 | 32 | NZ_CP053984 | Achromobacter pestifer strain FDAARGOS_790 plasmid unnamed, complete sequence | 21888-21919 | 10 | 0.688 |
CP028702_5 | 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313141-3313172 | 32 | NC_010935 | Comamonas testosteroni CNB-1 plasmid pCNB, complete sequence | 28766-28797 | 10 | 0.688 |
CP028702_5 | 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313141-3313172 | 32 | JX469826 | Uncultured bacterium plasmid pB12, complete sequence | 11283-11314 | 10 | 0.688 |
CP028702_5 | 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313141-3313172 | 32 | JN106171 | Uncultured bacterium plasmid pAKD26, complete sequence | 11289-11320 | 10 | 0.688 |
CP028702_5 | 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313141-3313172 | 32 | NC_016968 | Comamonas testosteroni plasmid pTB30, complete sequence | 11287-11318 | 10 | 0.688 |
CP028702_5 | 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313141-3313172 | 32 | NC_016978 | Comamonas testosteroni plasmid pI2, complete sequence | 11272-11303 | 10 | 0.688 |
CP028702_5 | 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313141-3313172 | 32 | NZ_CP017760 | Cupriavidus necator strain NH9 plasmid pENH91, complete sequence | 67078-67109 | 10 | 0.688 |
CP028702_5 | 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313141-3313172 | 32 | NZ_CP053554 | Diaphorobacter sp. JS3050 plasmid pDCNB, complete sequence | 4235-4266 | 10 | 0.688 |
CP028702_5 | 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313141-3313172 | 32 | NC_019263 | Delftia acidovorans plasmid pLME1, complete sequence | 11288-11319 | 10 | 0.688 |
CP028702_5 | 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313141-3313172 | 32 | NC_019264 | Delftia acidovorans plasmid pNB8c, complete sequence | 11288-11319 | 10 | 0.688 |
CP028702_5 | 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313141-3313172 | 32 | NC_019283 | Delftia acidovorans plasmid pC1-1, complete sequence | 11288-11319 | 10 | 0.688 |
CP028702_5 | 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313141-3313172 | 32 | NC_006830 | Achromobacter xylosoxidans A8 plasmid pA81, complete sequence | 11350-11381 | 10 | 0.688 |
CP028702_5 | 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313263-3313295 | 33 | CP046443 | Pseudomonas coronafaciens pv. coronafaciens strain B19001 plasmid unnamed2, complete sequence | 31933-31965 | 10 | 0.697 |
CP028702_5 | 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313263-3313295 | 33 | NZ_LT963392 | Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP1, complete sequence | 103013-103045 | 10 | 0.697 |
CP028702_5 | 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313263-3313295 | 33 | NZ_LT963392 | Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP1, complete sequence | 110510-110542 | 10 | 0.697 |
CP028702_5 | 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313263-3313295 | 33 | NZ_CP034079 | Pseudomonas syringae pv. pisi str. PP1 plasmid pPP1-1, complete sequence | 48454-48486 | 10 | 0.697 |
CP028702_5 | 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313263-3313295 | 33 | NZ_CP034080 | Pseudomonas syringae pv. pisi str. PP1 plasmid pPP1-2, complete sequence | 39480-39512 | 10 | 0.697 |
CP028702_5 | 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313263-3313295 | 33 | NC_005918 | Pseudomonas syringae pv. maculicola strain ES4326 plasmid pPMA4326A, complete sequence | 31117-31149 | 10 | 0.697 |
CP028702_5 | 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313263-3313295 | 33 | NZ_CP047262 | Pseudomonas syringae pv. maculicola str. ES4326 plasmid pPma4326A, complete sequence | 30966-30998 | 10 | 0.697 |
CP028702_5 | 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313263-3313295 | 33 | NZ_CP026560 | Pseudomonas amygdali pv. morsprunorum strain R15244 plasmid p3_tig5, complete sequence | 19118-19150 | 10 | 0.697 |
CP028702_5 | 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313263-3313295 | 33 | NZ_LT963406 | Pseudomonas syringae pv. avii isolate CFBP3846 plasmid PP4, complete sequence | 54820-54852 | 10 | 0.697 |
CP028702_5 | 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313263-3313295 | 33 | LT985193 | Pseudomonas syringae strain CFBP 2116 genome assembly, plasmid: PP2 | 32077-32109 | 10 | 0.697 |
CP028702_5 | 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313263-3313295 | 33 | NZ_LT963393 | Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP2, complete sequence | 50597-50629 | 10 | 0.697 |
CP028702_5 | 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313263-3313295 | 33 | NZ_LT985210 | Pseudomonas syringae pv. cerasicola strain CFBP 6110 plasmid PP1, complete sequence | 105842-105874 | 10 | 0.697 |
CP028702_5 | 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313263-3313295 | 33 | NZ_LT985211 | Pseudomonas syringae pv. cerasicola strain CFBP 6110 plasmid PP2, complete sequence | 84272-84304 | 10 | 0.697 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052797 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N18S2039 plasmid pN18S2039, complete sequence | 45808-45839 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052795 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0125 plasmid pN19S0125, complete sequence | 282589-282620 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | NZ_CP047882 | Salmonella enterica subsp. enterica serovar Infantis strain 119944 plasmid pESI, complete sequence | 94965-94996 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052804 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S973 plasmid pN17S0973, complete sequence | 304288-304319 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP038508 | Salmonella enterica subsp. enterica serovar Infantis strain FARPER-219 plasmid p-F219, complete sequence | 112376-112407 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052802 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S976 plasmid pN17S0976, complete sequence | 315682-315713 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052788 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0611 plasmid pN19S0611, complete sequence | 203378-203409 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052840 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S024 plasmid pN16S024, complete sequence | 127648-127679 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052786 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0641 plasmid pN19S0641, complete sequence | 215302-215333 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052838 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S097 plasmid pN16S097, complete sequence | 214483-214514 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | NZ_CP028316 | Salmonella enterica subsp. enterica serovar Typhimurium var. 5- strain CFSAN067217 plasmid pSC-31-2, complete sequence | 108893-108924 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP051676 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1234 plasmid pN16S1234, complete sequence | 83669-83700 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052783 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0679 plasmid pN19S0679-1, complete sequence | 194119-194150 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052836 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S103 plasmid pN16S103, complete sequence | 18410-18441 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | NZ_CP022063 | Salmonella enterica strain FDAARGOS_312 plasmid unnamed3, complete sequence | 64615-64646 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052781 | Salmonella enterica strain CVM N19S0949 plasmid pN19S0949, complete sequence | 169480-169511 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052834 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S041 plasmid pN17S0041, complete sequence | 6457-6488 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052793 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0388 plasmid pN19S0388, complete sequence | 25758-25789 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052779 | Salmonella enterica strain 19TN07GT06K-S plasmid pN19S1233, complete sequence | 140403-140434 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052832 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1040 plasmid pN17S1040, complete sequence | 160727-160758 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | NZ_CP031362 | Salmonella enterica subsp. enterica serovar Heidelberg strain 5 plasmid p3, complete sequence | 140152-140183 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052830 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1105 plasmid pN17S1105, complete sequence | 193709-193740 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052828 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1126 plasmid pN17S1126, complete sequence | 126974-127005 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052826 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1245 plasmid pN17S0637, complete sequence | 110984-111015 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | NZ_CP016409 | Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502916 plasmid pFSIS1502916, complete sequence | 94916-94947 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052824 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1265 plasmid pN17S1265, complete sequence | 91497-91528 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052822 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1349 plasmid pN17S1349, complete sequence | 110984-111015 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | NZ_CP016407 | Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502169 plasmid pFSIS1502169, complete sequence | 94916-94947 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052820 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1442 plasmid pN17S1442, complete sequence | 94916-94947 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | NZ_CP016413 | Salmonella enterica subsp. enterica serovar Infantis strain CVM44454 plasmid pCVM44454, complete sequence | 94916-94947 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | NZ_CP016411 | Salmonella enterica subsp. enterica serovar Infantis strain N55391 plasmid pN55391, complete sequence | 94916-94947 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052816 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1598 plasmid pN17S1598 | 165317-165348 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052814 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S349 plasmid pN17S0349, complete sequence | 99109-99140 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | NZ_CP022662 | Salmonella enterica subsp. enterica strain RM11065 plasmid pRM11065-2, complete sequence | 54379-54410 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052812 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S376 plasmid pN17S0376, complete sequence | 1671-1702 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052810 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S535 plasmid pN17S0535, complete sequence | 212751-212782 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052808 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S637 plasmid pN17S0637, complete sequence | 306376-306407 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052806 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S816 plasmid pN17S0816, complete sequence | 164579-164610 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052791 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0552 plasmid pN17S0637, complete sequence | 168074-168105 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052818 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1509 plasmid pN17S1509, complete sequence | 190524-190555 | 10 | 0.688 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | CP052799 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S990 plasmid pN17S0990-1, complete sequence | 6457-6488 | 10 | 0.688 |
CP028702_2 | 2.1|792948|59|CP028702|CRISPRCasFinder | 792948-793006 | 59 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 40375-40433 | 11 | 0.814 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 194003-194051 | 11 | 0.776 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 194096-194144 | 11 | 0.776 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 194282-194330 | 11 | 0.776 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 204497-204545 | 11 | 0.776 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 204590-204638 | 11 | 0.776 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 204776-204824 | 11 | 0.776 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 195331-195379 | 11 | 0.776 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 195424-195472 | 11 | 0.776 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 195610-195658 | 11 | 0.776 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 176286-176334 | 11 | 0.776 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 176379-176427 | 11 | 0.776 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 176565-176613 | 11 | 0.776 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP010208 | Escherichia coli strain M11 plasmid B, complete sequence | 27282-27330 | 11 | 0.776 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP010208 | Escherichia coli strain M11 plasmid B, complete sequence | 27381-27429 | 11 | 0.776 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 51834-51882 | 11 | 0.776 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_AP023209 | Escherichia coli strain TUM18781 plasmid pMTY18781-4, complete sequence | 19389-19437 | 11 | 0.776 |
CP028702_5 | 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR | 3313387-3313418 | 32 | NZ_CP026128 | Acinetobacter baumannii strain ABNIH28 plasmid pABA-1fe1, complete sequence | 49165-49196 | 11 | 0.656 |
CP028702_6 | 6.5|3339392|33|CP028702|PILER-CR,CRISPRCasFinder,CRT | 3339392-3339424 | 33 | MF158039 | Shigella phage Sf12, complete genome | 4974-5006 | 11 | 0.667 |
CP028702_6 | 6.5|3339392|33|CP028702|PILER-CR,CRISPRCasFinder,CRT | 3339392-3339424 | 33 | MF158042 | Shigella phage Sd1, complete genome | 937-969 | 11 | 0.667 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 211730-211778 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 211930-211978 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 194189-194237 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 222224-222272 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 222424-222472 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 204683-204731 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 213058-213106 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 213258-213306 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 195517-195565 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 194013-194061 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 194213-194261 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 176472-176520 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 11511-11559 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 397088-397136 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 404133-404181 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_AP023207 | Escherichia coli strain TUM18781 plasmid pMTY18781-2, complete sequence | 27994-28042 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_AP023207 | Escherichia coli strain TUM18781 plasmid pMTY18781-2, complete sequence | 31299-31347 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | MG065691 | UNVERIFIED: Campylobacter phage A11a, complete genome | 33059-33107 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | MG065686 | UNVERIFIED: Campylobacter phage A18a, complete genome | 141016-141064 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | MF374379 | Escherichia phage DN1, complete genome | 31158-31206 | 12 | 0.755 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 14105-14153 | 13 | 0.735 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_AP023208 | Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence | 6859-6907 | 13 | 0.735 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NC_049343 | Escherichia phage 500465-2, complete genome | 31797-31845 | 13 | 0.735 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | MT230112 | Escherichia coli strain DH5alpha plasmid pESBL112, complete sequence | 94-142 | 13 | 0.735 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 82842-82890 | 14 | 0.714 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 313592-313640 | 14 | 0.714 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | MG065691 | UNVERIFIED: Campylobacter phage A11a, complete genome | 63889-63937 | 14 | 0.714 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | MG065686 | UNVERIFIED: Campylobacter phage A18a, complete genome | 110186-110234 | 14 | 0.714 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP053721 | Escherichia coli strain CP131_Sichuan plasmid pCP131-IncHI1, complete sequence | 198909-198957 | 14 | 0.714 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP019246 | Escherichia coli strain Combat13F7 plasmid pCombat13F7-1, complete sequence | 40305-40353 | 14 | 0.714 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP044299 | Escherichia coli strain P59A plasmid pP59A-CTX-M-55, complete sequence | 91418-91466 | 14 | 0.714 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 102544-102592 | 15 | 0.694 |
CP028702_4 | 4.1|2781128|49|CP028702|CRISPRCasFinder | 2781128-2781176 | 49 | NZ_CP044308 | Escherichia coli strain C27A plasmid pC27A-3, complete sequence | 56031-56079 | 15 | 0.694 |
1. spacer 1.2|438643|42|CP028702|PILER-CR matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 0, identity: 1.0
gaacttaacaatattgaaagttggatttatctgcgtgtgaca CRISPR spacer gaacttaacaatattgaaagttggatttatctgcgtgtgaca Protospacer ******************************************
2. spacer 6.5|3339392|33|CP028702|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LR134258 (Klebsiella aerogenes strain NCTC9644 plasmid 5, complete sequence) position: , mismatch: 4, identity: 0.879
gaaatgctggtgagcgttaatgccgcaaacaca CRISPR spacer gaaatgctggtgagcgttaacgccgcgaacccc Protospacer ********************.*****.*** *
3. spacer 6.5|3339392|33|CP028702|PILER-CR,CRISPRCasFinder,CRT matches to LR134281 (Klebsiella aerogenes strain NCTC9793 genome assembly, plasmid: 6) position: , mismatch: 4, identity: 0.879
gaaatgctggtgagcgttaatgccgcaaacaca CRISPR spacer gaaatgctggtgagcgttaacgccgcgaacccc Protospacer ********************.*****.*** *
4. spacer 6.5|3339392|33|CP028702|PILER-CR,CRISPRCasFinder,CRT matches to KY271401 (Klebsiella phage 1 LV-2017, complete genome) position: , mismatch: 4, identity: 0.879
gaaatgctggtgagcgttaatgccgcaaacaca CRISPR spacer gaaatgctggtgagcgttaacgccgcgaacccc Protospacer ********************.*****.*** *
5. spacer 5.5|3313080|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NC_021229 (Arthrobacter nicotinovorans pAO1 megaplasmid sequence, strain ATCC 49919) position: , mismatch: 5, identity: 0.844
tgggcggcttgccttgcagccagctccagcag- CRISPR spacer tgggcggcttgcgttgcagcctgc-cgagcgga Protospacer ************ ******** ** * ***.*
6. spacer 5.5|3313080|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP017422 (Arthrobacter sp. ZXY-2 plasmid pZXY21, complete sequence) position: , mismatch: 6, identity: 0.812
tgggcggcttgccttgcagccagctccagcag- CRISPR spacer ggggcggcttgcgttgcagcctgc-cgagcgga Protospacer *********** ******** ** * ***.*
7. spacer 5.11|3313448|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to KY883647 (Vibrio phage JSF33, complete genome) position: , mismatch: 6, identity: 0.812
agcgtgttcggcatcacctttggcttcggctg CRISPR spacer agcagtttcggcatcagctttggctttggctt Protospacer ***. ********** *********.****
8. spacer 5.12|3313509|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP009293 (Novosphingobium pentaromativorans US6-1 plasmid pLA4, complete sequence) position: , mismatch: 6, identity: 0.812
tgcgtgagcgtatcgccgcgcgtctgcgaaag CRISPR spacer agaatgagcgtgtcgccgcgcgtctgcgtgag Protospacer * .*******.**************** .**
9. spacer 6.5|3339392|33|CP028702|PILER-CR,CRISPRCasFinder,CRT matches to KY653119 (Morganella phage IME1369_02, complete genome) position: , mismatch: 6, identity: 0.818
gaaatgctggtgagcgttaatgccgcaaacaca CRISPR spacer gaaatgctggtcagcgttaacgccgcacaacct Protospacer *********** ********.****** * *
10. spacer 5.1|3312836|32|CP028702|CRISPRCasFinder,CRT matches to NZ_AP018516 (Acetobacter orientalis strain FAN1 plasmid pAOF1, complete sequence) position: , mismatch: 8, identity: 0.75
cagcgtcaggcgtgaaatctcaccgtcgttgc CRISPR spacer attctttaggcgtgacatcttaccgtcgttga Protospacer * *.******** ****.**********
11. spacer 5.5|3313080|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to MK113951 (Phage 5P_3, complete genome) position: , mismatch: 8, identity: 0.75
tgggcggcttgccttgcagccagctccagcag CRISPR spacer ggggcagcttgccttgcagccagccgatgctc Protospacer ****.******************. **
12. spacer 5.5|3313080|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to AP017924 (Ralstonia phage RP12 DNA, complete genome) position: , mismatch: 8, identity: 0.75
tgggcggcttgccttgcagccagctccagcag CRISPR spacer tgggccgcttgccgtgcagccagcgcttccgc Protospacer ***** ******* ********** *. *.
13. spacer 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NC_013856 (Azospirillum sp. B510 plasmid pAB510b, complete sequence) position: , mismatch: 8, identity: 0.758
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer cgcgtcggcgacgcgcaggtaatgcgcgatcag Protospacer * ************** *********. *
14. spacer 5.11|3313448|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to MN855762 (Bacteriophage sp. isolate 505, complete genome) position: , mismatch: 8, identity: 0.75
agcgtgttcggcatcacctttggcttcggctg CRISPR spacer gaccagctcgaaatcacctttggcttcggctt Protospacer ..* *.***. *******************
15. spacer 5.11|3313448|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NC_020548 (Azoarcus sp. KH32C plasmid pAZKH, complete sequence) position: , mismatch: 8, identity: 0.75
agcgtgtt---cggcatcacctttggcttcggctg CRISPR spacer ---ctgctcgccggcatcaccttcggcttctgcta Protospacer **.* ************.****** ***.
16. spacer 5.12|3313509|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP007130 (Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence) position: , mismatch: 8, identity: 0.75
tgcgtgagcgtatcgccgcgcgtctgcgaaag- CRISPR spacer agcgagagcgtatcgccgcgc-ttcgtgaagcc Protospacer *** **************** *..*.***.
17. spacer 6.4|3339331|33|CP028702|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP007129 (Gemmatirosa kalamazoonesis strain KBS708 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.758
tggctctgcaacagcagcacccatgaccacgtc CRISPR spacer cgctccagcaacagcagcacccacgaccacgga Protospacer .* ..* ****************.*******
18. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 9, identity: 0.816
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtgaacgccttatccggcctacggatggcg Protospacer . **. **********.************************** * * .
19. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 9, identity: 0.816
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtgaacgccttatccggcctacggatggcg Protospacer . **. **********.************************** * * .
20. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 9, identity: 0.816
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtgaacgccttatccggcctacgggtggcg Protospacer . **. **********.************************** * * .
21. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 9, identity: 0.816
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtgaacgccttatccggcctacggatggcg Protospacer . **. **********.************************** * * .
22. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 9, identity: 0.816
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtgaacgccttatccggcctacggatggcg Protospacer . **. **********.************************** * * .
23. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 9, identity: 0.816
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtgaacgccttatccggcctacgggtggcg Protospacer . **. **********.************************** * * .
24. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 9, identity: 0.816
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtgaacgccttatccggcctacggatggcg Protospacer . **. **********.************************** * * .
25. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 9, identity: 0.816
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtgaacgccttatccggcctacggatggcg Protospacer . **. **********.************************** * * .
26. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 9, identity: 0.816
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtgaacgccttatccggcctacgggtggcg Protospacer . **. **********.************************** * * .
27. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 9, identity: 0.816
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtgaacgccttatccggcctacggatggcg Protospacer . **. **********.************************** * * .
28. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 9, identity: 0.816
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtgaacgccttatccggcctacggatggcg Protospacer . **. **********.************************** * * .
29. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 9, identity: 0.816
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtgaacgccttatccggcctacgggtggcg Protospacer . **. **********.************************** * * .
30. spacer 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP010957 (Sphingobium sp. YBL2 plasmid 3pYBL2-3, complete sequence) position: , mismatch: 9, identity: 0.727
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer cgaggcggcgacacgcaaggtatgcgggtcgag Protospacer **********.****.******** * . *
31. spacer 5.11|3313448|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP015585 (Roseomonas gilardii strain U14-5 plasmid 1, complete sequence) position: , mismatch: 9, identity: 0.719
agcgtgttcggcatcacctttggcttcggctg CRISPR spacer atccgcacgggcatcacctttggctccagctg Protospacer * * . ****************.*.****
32. spacer 5.11|3313448|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP054618 (Azospirillum oryzae strain KACC 14407 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.719
agcgtgttcggcatcacctttggcttcggctg CRISPR spacer ctcggcctcggcaacacctttgccttcggcgc Protospacer ** .****** ******** *******
33. spacer 5.12|3313509|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to MN234174 (Mycobacterium phage Efra2, complete genome) position: , mismatch: 9, identity: 0.719
tgcgtgagcgtatcgccgcgcgtctgcgaaag CRISPR spacer gccgtgagcgtgacgccgcgcgtctggtgatc Protospacer *********. ************* .*
34. spacer 5.12|3313509|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to MN234165 (Mycobacterium phage Yunkel11, complete genome) position: , mismatch: 9, identity: 0.719
tgcgtgagcgtatcgccgcgcgtctgcgaaag CRISPR spacer gccgtgagcgtgacgccgcgcgtctggtgatc Protospacer *********. ************* .*
35. spacer 5.12|3313509|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to MN234201 (Mycobacterium phage Guanica15, complete genome) position: , mismatch: 9, identity: 0.719
tgcgtgagcgtatcgccgcgcgtctgcgaaag CRISPR spacer gccgtgagcgtgacgccgcgcgtctggtgatc Protospacer *********. ************* .*
36. spacer 2.1|792948|59|CP028702|CRISPRCasFinder matches to MT230312 (Escherichia coli strain DH5alpha plasmid pESBL31, complete sequence) position: , mismatch: 10, identity: 0.831
ggtgccagaaccgtaggccggataaggcgttcacgccgcatccggcaataagtgctccg- CRISPR spacer gagcacagaaccgtaggacggataaggcgttcacgccgcatccggcgat-cgtgcactga Protospacer *. ************ ****************************.** **** *.*
37. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 10, identity: 0.796
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtaaacgccttatccggcctacggatggcg Protospacer . **. **********.****.********************* * * .
38. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 10, identity: 0.796
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtaaacgccttatccggcctacggatggcg Protospacer . **. **********.****.********************* * * .
39. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 10, identity: 0.796
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtaaacgccttatccggcctacggatggcg Protospacer . **. **********.****.********************* * * .
40. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 10, identity: 0.796
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacgactgccggatgcggcgtaaacgccttatccggcctacggatggcg Protospacer . **. **********.****.********************* * * .
41. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP044147 (Escherichia coli O157 strain AR-0428 plasmid pAR-0428-2) position: , mismatch: 10, identity: 0.796
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacaattgccggatgcggcgtgaacgccttatccggcctacggttgagt Protospacer . *.. **********.**************************.* .*
42. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to CP044351 (Escherichia coli strain 194195 plasmid p194195_1, complete sequence) position: , mismatch: 10, identity: 0.796
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacaattgccggatgcggcgtgaacgccttatccggcctacggttgagt Protospacer . *.. **********.**************************.* .*
43. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 10, identity: 0.796
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga-- CRISPR spacer cacaagtgccggatgcggcgtaaacgccttatccggcctacg--ccagact Protospacer . *..***********.****.******************** .*.**
44. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 10, identity: 0.796
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer gctggttgccggatgcggcgtgaacgccttatccggcctacattcggca Protospacer *.** **********.************************. .. * *
45. spacer 5.5|3313080|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NC_002580 (Propionibacterium freudenreichii plasmid p545, complete sequence) position: , mismatch: 10, identity: 0.688
tgggcggcttgccttgcagccagctccagcag CRISPR spacer ccagcggcttgcgtggcagccagctctcaggg Protospacer . .********* * ***********. . .*
46. spacer 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP028970 (Aminobacter sp. MSH1 plasmid pUSP2, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer gcgtgtgctggcaatcgcttccggggtgacgt Protospacer . *. ********** ***.******** .
47. spacer 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP053984 (Achromobacter pestifer strain FDAARGOS_790 plasmid unnamed, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
48. spacer 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NC_010935 (Comamonas testosteroni CNB-1 plasmid pCNB, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
49. spacer 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to JX469826 (Uncultured bacterium plasmid pB12, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
50. spacer 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to JN106171 (Uncultured bacterium plasmid pAKD26, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
51. spacer 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NC_016968 (Comamonas testosteroni plasmid pTB30, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
52. spacer 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NC_016978 (Comamonas testosteroni plasmid pI2, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
53. spacer 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP017760 (Cupriavidus necator strain NH9 plasmid pENH91, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
54. spacer 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP053554 (Diaphorobacter sp. JS3050 plasmid pDCNB, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
55. spacer 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NC_019263 (Delftia acidovorans plasmid pLME1, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
56. spacer 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NC_019264 (Delftia acidovorans plasmid pNB8c, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
57. spacer 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NC_019283 (Delftia acidovorans plasmid pC1-1, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
58. spacer 5.6|3313141|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NC_006830 (Achromobacter xylosoxidans A8 plasmid pA81, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
59. spacer 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP046443 (Pseudomonas coronafaciens pv. coronafaciens strain B19001 plasmid unnamed2, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
60. spacer 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LT963392 (Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP1, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
61. spacer 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LT963392 (Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP1, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
62. spacer 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP034079 (Pseudomonas syringae pv. pisi str. PP1 plasmid pPP1-1, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
63. spacer 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP034080 (Pseudomonas syringae pv. pisi str. PP1 plasmid pPP1-2, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
64. spacer 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NC_005918 (Pseudomonas syringae pv. maculicola strain ES4326 plasmid pPMA4326A, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
65. spacer 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP047262 (Pseudomonas syringae pv. maculicola str. ES4326 plasmid pPma4326A, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
66. spacer 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP026560 (Pseudomonas amygdali pv. morsprunorum strain R15244 plasmid p3_tig5, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
67. spacer 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LT963406 (Pseudomonas syringae pv. avii isolate CFBP3846 plasmid PP4, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
68. spacer 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to LT985193 (Pseudomonas syringae strain CFBP 2116 genome assembly, plasmid: PP2) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
69. spacer 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LT963393 (Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP2, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
70. spacer 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LT985210 (Pseudomonas syringae pv. cerasicola strain CFBP 6110 plasmid PP1, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
71. spacer 5.8|3313263|33|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LT985211 (Pseudomonas syringae pv. cerasicola strain CFBP 6110 plasmid PP2, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
72. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052797 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N18S2039 plasmid pN18S2039, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
73. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052795 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0125 plasmid pN19S0125, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
74. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP047882 (Salmonella enterica subsp. enterica serovar Infantis strain 119944 plasmid pESI, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
75. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052804 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S973 plasmid pN17S0973, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
76. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP038508 (Salmonella enterica subsp. enterica serovar Infantis strain FARPER-219 plasmid p-F219, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
77. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052802 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S976 plasmid pN17S0976, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
78. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052788 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0611 plasmid pN19S0611, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
79. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052840 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S024 plasmid pN16S024, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
80. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052786 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0641 plasmid pN19S0641, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
81. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052838 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S097 plasmid pN16S097, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
82. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP028316 (Salmonella enterica subsp. enterica serovar Typhimurium var. 5- strain CFSAN067217 plasmid pSC-31-2, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
83. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP051676 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1234 plasmid pN16S1234, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
84. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052783 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0679 plasmid pN19S0679-1, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
85. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052836 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S103 plasmid pN16S103, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
86. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022063 (Salmonella enterica strain FDAARGOS_312 plasmid unnamed3, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
87. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052781 (Salmonella enterica strain CVM N19S0949 plasmid pN19S0949, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
88. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052834 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S041 plasmid pN17S0041, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
89. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052793 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0388 plasmid pN19S0388, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
90. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052779 (Salmonella enterica strain 19TN07GT06K-S plasmid pN19S1233, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
91. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052832 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1040 plasmid pN17S1040, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
92. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP031362 (Salmonella enterica subsp. enterica serovar Heidelberg strain 5 plasmid p3, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
93. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052830 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1105 plasmid pN17S1105, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
94. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052828 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1126 plasmid pN17S1126, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
95. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052826 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1245 plasmid pN17S0637, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
96. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP016409 (Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502916 plasmid pFSIS1502916, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
97. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052824 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1265 plasmid pN17S1265, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
98. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052822 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1349 plasmid pN17S1349, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
99. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP016407 (Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502169 plasmid pFSIS1502169, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
100. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052820 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1442 plasmid pN17S1442, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
101. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP016413 (Salmonella enterica subsp. enterica serovar Infantis strain CVM44454 plasmid pCVM44454, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
102. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP016411 (Salmonella enterica subsp. enterica serovar Infantis strain N55391 plasmid pN55391, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
103. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052816 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1598 plasmid pN17S1598) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
104. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052814 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S349 plasmid pN17S0349, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
105. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022662 (Salmonella enterica subsp. enterica strain RM11065 plasmid pRM11065-2, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
106. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052812 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S376 plasmid pN17S0376, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
107. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052810 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S535 plasmid pN17S0535, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
108. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052808 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S637 plasmid pN17S0637, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
109. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052806 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S816 plasmid pN17S0816, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
110. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052791 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0552 plasmid pN17S0637, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
111. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052818 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1509 plasmid pN17S1509, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
112. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to CP052799 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S990 plasmid pN17S0990-1, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
113. spacer 2.1|792948|59|CP028702|CRISPRCasFinder matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 11, identity: 0.814
-ggtgccagaaccgtaggccggataaggcgttcacgccgcatccggcaataagtgctccg CRISPR spacer tcgcacca-aaccgtaggccggataaggcgtttacgccgcatccggcaaaaagccgtacc Protospacer *..*** ***********************.**************** ***. * *
114. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer tgttcatgccggatgcggcgtgaacgccttatccggcctacgaatggcg Protospacer * . .**********.*************************. * * .
115. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer caccattgccggatgcggcgtgaacgccttatccggcctacgaatggcg Protospacer . * . **********.*************************. * * .
116. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer caccattgccggatgcggcgtgaacgccttatccggcctacgagtggcg Protospacer . * . **********.*************************. * * .
117. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer tgttcatgccggatgcggcgtgaacgccttatccggcctacgaatggcg Protospacer * . .**********.*************************. * * .
118. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer caccattgccggatgcggcgtgaacgccttatccggcctacgaatggcg Protospacer . * . **********.*************************. * * .
119. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer caccattgccggatgcggcgtgaacgccttatccggcctacgagtggcg Protospacer . * . **********.*************************. * * .
120. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer tgttcatgccggatgcggcgtgaacgccttatccggcctacgaatggcg Protospacer * . .**********.*************************. * * .
121. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer caccattgccggatgcggcgtgaacgccttatccggcctacgaatggcg Protospacer . * . **********.*************************. * * .
122. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer caccattgccggatgcggcgtgaacgccttatccggcctacgagtggcg Protospacer . * . **********.*************************. * * .
123. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer tgttcatgccggatgcggcgtgaacgccttatccggcctacgaatggcg Protospacer * . .**********.*************************. * * .
124. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer caccattgccggatgcggcgtgaacgccttatccggcctacgaatggcg Protospacer . * . **********.*************************. * * .
125. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer caccattgccggatgcggcgtgaacgccttatccggcctacgagtggcg Protospacer . * . **********.*************************. * * .
126. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP010208 (Escherichia coli strain M11 plasmid B, complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacaaccgccggatgcggcgtgaacgccttatccggcctacgggtgagc Protospacer . *.. .*********.************************** * .*
127. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP010208 (Escherichia coli strain M11 plasmid B, complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer cacaaccgccggatgcggcgtgaacgccttatccggcctacgggtgagt Protospacer . *.. .*********.************************** * .*
128. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer agctggtgccggatgcggcgtaaacgccttatccggcctacaaatgcgc Protospacer * ************.****.*******************.. * *
129. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_AP023209 (Escherichia coli strain TUM18781 plasmid pMTY18781-4, complete sequence) position: , mismatch: 11, identity: 0.776
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer tgtttatgccggatgcggtgtgaacgccttatccggcctacggatggcc Protospacer * . .**********.*.************************ * *
130. spacer 5.10|3313387|32|CP028702|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP026128 (Acinetobacter baumannii strain ABNIH28 plasmid pABA-1fe1, complete sequence) position: , mismatch: 11, identity: 0.656
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gatacattgccaattacaaccgacagttcaaa Protospacer *****..**************** .
131. spacer 6.5|3339392|33|CP028702|PILER-CR,CRISPRCasFinder,CRT matches to MF158039 (Shigella phage Sf12, complete genome) position: , mismatch: 11, identity: 0.667
gaaatgctggtgagcgttaatgccgcaaacaca CRISPR spacer cggcacttggggagcgttaatgctgcaaacaat Protospacer .. .*** ************.*******
132. spacer 6.5|3339392|33|CP028702|PILER-CR,CRISPRCasFinder,CRT matches to MF158042 (Shigella phage Sd1, complete genome) position: , mismatch: 11, identity: 0.667
gaaatgctggtgagcgttaatgccgcaaacaca CRISPR spacer cggcacttggagagcgttaatgctgcaaacaat Protospacer .. .*** ************.*******
133. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer agtttatgccggatgcggcgtgaacgccttatccggcctacgtagagca Protospacer . .**********.************************* * *
134. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer agtttatgccggatgcggcgtgaacgccttatccggcctacgtagagca Protospacer . .**********.************************* * *
135. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer caccactgccggatgcggcgtggacgccttatccggcctacgagtggcg Protospacer . * . **********.*****.*******************. * * .
136. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer agtttatgccggatgcggcgtgaacgccttatccggcctacgtagagca Protospacer . .**********.************************* * *
137. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer agtttatgccggatgcggcgtgaacgccttatccggcctacgtagagca Protospacer . .**********.************************* * *
138. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer caccactgccggatgcggcgtggacgccttatccggcctacgagtggcg Protospacer . * . **********.*****.*******************. * * .
139. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer agtttatgccggatgcggcgtgaacgccttatccggcctacgtagagca Protospacer . .**********.************************* * *
140. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer agtttatgccggatgcggcgtgaacgccttatccggcctacgtagagca Protospacer . .**********.************************* * *
141. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer caccactgccggatgcggcgtggacgccttatccggcctacgagtggcg Protospacer . * . **********.*****.*******************. * * .
142. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer agtttatgccggatgcggcgtgaacgccttatccggcctacgtagagca Protospacer . .**********.************************* * *
143. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer agtttatgccggatgcggcgtgaacgccttatccggcctacgtagagca Protospacer . .**********.************************* * *
144. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer caccactgccggatgcggcgtggacgccttatccggcctacgagtggcg Protospacer . * . **********.*****.*******************. * * .
145. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer aatttttgccggatgcggcgtgaacgccttatccggcctacaacgggca Protospacer . **********.************************..* * *
146. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga--- CRISPR spacer cacaattgccggatgcggcgtaaacgccttatccggcctaca---tggataa Protospacer . *.. **********.****.*******************. .***
147. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer ttttcttgccggatgcggcgtaaacgccttatccggcctacaggacgtg Protospacer *.. **********.****.*******************.* ** .
148. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_AP023207 (Escherichia coli strain TUM18781 plasmid pMTY18781-2, complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer ctcaaatgccggatgcggcgtgaacgccttatccggcctacgcacacta Protospacer ..*...**********.************************* . *
149. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_AP023207 (Escherichia coli strain TUM18781 plasmid pMTY18781-2, complete sequence) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer agcaaatgccggatgcggcgtaaacgccttatccggcctacatttggca Protospacer *...**********.****.*******************. .* * *
150. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to MG065691 (UNVERIFIED: Campylobacter phage A11a, complete genome) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer acactttgccggatgcggcgtgaacgcctgatccggcctacggtaagcc Protospacer * **********.************ *************. *
151. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to MG065686 (UNVERIFIED: Campylobacter phage A18a, complete genome) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer acactttgccggatgcggcgtgaacgcctgatccggcctacggtaagcc Protospacer * **********.************ *************. *
152. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to MF374379 (Escherichia phage DN1, complete genome) position: , mismatch: 12, identity: 0.755
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer tgtttatgccggatgcggcgtgaacgccttatccggcctacaaaccgcg Protospacer * . .**********.************************.. .** .
153. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 13, identity: 0.735
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer gttgattgccggatgcggcgtaaacgccttatccggcctacattcggca Protospacer ..*. **********.****.*******************. .. * *
154. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_AP023208 (Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence) position: , mismatch: 13, identity: 0.735
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer ttgttttgccggatgcggcgtgaacgccttatccggcctacaaaaccat Protospacer *. **********.************************.. * .
155. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NC_049343 (Escherichia phage 500465-2, complete genome) position: , mismatch: 13, identity: 0.735
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer agtgtttgccggatgcggcgtgaacgccttatccgacctacgtgtgacg Protospacer .* **********.******************.****** * . .
156. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to MT230112 (Escherichia coli strain DH5alpha plasmid pESBL112, complete sequence) position: , mismatch: 13, identity: 0.735
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer ttatgttgccggatgcggcgtaaacgccttatccggcctacaaaagcaa Protospacer *. * **********.****.*******************.. .*
157. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 14, identity: 0.714
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer gtaaaatgccggatgcggcgtgaacgccttatccggcctacaaaccaag Protospacer . ...**********.************************.. .*...
158. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 14, identity: 0.714
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer acaaaatgccggatgcggcgtaaacgccttatccggcctacaaaatcgt Protospacer * ...**********.****.*******************.. . *
159. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to MG065691 (UNVERIFIED: Campylobacter phage A11a, complete genome) position: , mismatch: 14, identity: 0.714
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer gcttcttgccggatgcggcgtgaacgccttatccggcctacaaaatcat Protospacer *. **********.************************.. . .
160. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to MG065686 (UNVERIFIED: Campylobacter phage A18a, complete genome) position: , mismatch: 14, identity: 0.714
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer gcttcttgccggatgcggcgtgaacgccttatccggcctacaaaatcat Protospacer *. **********.************************.. . .
161. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP053721 (Escherichia coli strain CP131_Sichuan plasmid pCP131-IncHI1, complete sequence) position: , mismatch: 14, identity: 0.714
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer tgttcatgccggatgcggcgtgaacgccttatccagcctacaaaattgt Protospacer * . .**********.*****************.******.. . *
162. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP019246 (Escherichia coli strain Combat13F7 plasmid pCombat13F7-1, complete sequence) position: , mismatch: 14, identity: 0.714
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer tgtttatgccggatgcggcgtaaacgccttatccggcctacaaaaagcg Protospacer * . .**********.****.*******************.. * .
163. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP044299 (Escherichia coli strain P59A plasmid pP59A-CTX-M-55, complete sequence) position: , mismatch: 14, identity: 0.714
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer tgttcatgccggatgcggcgtgaacgccttatccagcctacaaaattgt Protospacer * . .**********.*****************.******.. . *
164. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 15, identity: 0.694
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer agtcaatgccggatgcggcgtgaacgccttatccggcctacaaaagcat Protospacer . ..**********.************************.. .
165. spacer 4.1|2781128|49|CP028702|CRISPRCasFinder matches to NZ_CP044308 (Escherichia coli strain C27A plasmid pC27A-3, complete sequence) position: , mismatch: 15, identity: 0.694
tccgggtgccggatgcagcgtgaacgccttatccggcctacggctcgga CRISPR spacer caccattgccggatgcggcgtaaacgccttatccggcctacaaaaaacg Protospacer . * . **********.****.*******************.. . .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
911960 : 965264
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP028702|911960:965264|DBSCAN-SWA ATTACCGCGCCTTAACCCATTCCGCCACTTCCGCCCACTCACCGCGAAAGACAACTTTTTCCGCTTTTTTCTCAAGCTGATAGCGATACATCGGGTCGTAATATTCTTCAAGTAACGGCACCAGCCAGGCCAGATGACCGTCGGTGCTGCCGGTGGTGAGTTGCGTTGTCAGTGCTGCATCCAGCCTTGCAGCCAGTTCGTTATAGCGCTGTAGCCCCAGCCGACGCTTAATCGCCGAAAGTCCGTGATGCAGGTATTCGCAATACTCCTGCCAGCCCTGTTCGTCGCCGTACGCGTGGGTAAAATCATGATGCATACGCAAGAAATACTCTTCGTTCAGGCGCTCAAGACGGATCTCAAACGGATCTTCTACCACCGCAATCGCCGCCTGAGTCATTCGCTCGCGCAGGCATTCCGGCAGGTGATTCGAACCGATCATCCGGCTTTCGTCTTCCAGCACCCACAGGCGCAAATTCTGACGGGCGTCGGTTTTTAGCATTTCGGCAGCCAGCAGGTTTTCAAAACTCGCCTGGCTAAGTTGTGGTTGTAACGTGCGACCAAACGCCGAACCGCGATGACGCGCCAACCCTTCCAGATCAACACCGTTCGGCTGTTGCTGCACTAACAGCGTTTTACCGCTGCCGGTACAACCGCCAATCAGCACTATCGGTTTTTGTGCCAGTTCAATAGTCGCCTGAATCGCGGTCTGGCGCAGTGCCTTATAACCGCCTTCCACCAGCGGATAATCAATCCCCGCTGCATGCAACCAGCTTTGCACAATATGTGAGCGCTGACCGCCACGGGCGCAGCAGAGAATACCTTGCGGATTTTGCAGGCACGCTGCCCGCCAGGCGTCCATGCGCTGCTGACGAATTTCACCCGCCACCAGTTTATGTCCCAGCGCCAGCGCTGCGTCTGAGCCTTGCTGTTTATAGCAGGTGCCAACGGCGGCGCGTTCATCGTTATTCATTAACGGCAGATTGATAGCGGCGGGCATTGCGCCGTGCTCAAACTCGATAGGGGCGCGAACATCAATAATGGGCGTATCAGCAATCAGCAGGGCACGATAGTCCTGTTCCGTGTGTCTCTCTTGCATAGTTAAAAGTGAACCTCAAATCAGCTTGCGCGCTATTTTACGCGCCAACGCGCAAGGAAACTTGATTTTTAACTGCGTGGGTTGCCGAAAATTTCTAAAAATCCGCTGATTTCCGGCCTGCGCTGGGTAAACAAGGTCACAATATCTTCTACCGCTTTGCCGCTGCCAAATTTGCGCCATGCCAGACTCAATGGCGAAGGAGGGCGCATCGTTGGGATTACCCGGCTGACCAGTTGTTGATTATCGATCATTGACTGGCAAAGCGATTTTGGCAAAAAACCAATGCCAACGCCCGCCAGATGGGCGGCGATTTTCGTTTCCATATCGGGAACAATAATCTCTTTTTGCCCTGGCAATCGCCAGGCGACGCGTTTGGTTAAGGTGCGGGCGCTGTCTTCAATATTGACCGCCGGAAAGCGCCGCAACTGCGCTTCTGTTAGCGGCTCTTCAACGTTCGCCAGCGGATGATCCGCCGCCATGACAAAGCGCCATTGCACCGATCCTAAGGGATCAAGACTAAAGGTATTTGCCAGCGCCTCAGTTCCCGTGACGCCGATAGCCAGCGAAAAACCTTCGTACAATAGCGAGTCCCAGACGCCCATATAGATTTGTCGGGAGATGTGAAACTGGGTAAAGGGGTAACGCTCATTCAGCCACGCCAGCAACTGGGCGACGGCCTGGGGGTTGTAGAGCAGGTTGTTGATGACAATATTCACCTGGCGTTCCACGCCATCATTCACCTGTTGCAGCTCGCTTGGCATACTTTCCAGCCAGCTCAGCCAGTCTCTGGCCTGGGAAAGTAGATGCTCGCCAGCCGCTGTCAACGTCACGCTGCGAGTCGTACGGAAAAACAGCGCTACTCCGGTATTCTCTTCCAGAAGTTTAATGCGATAACTGATCGTCGCCGTGGTTTTACATAATCGTTCTGCCGCTTTTGAAAAACTTCCTGTTTCAGCAACCGCAATGAAAGTCCGCAAGGTTTCTGGATCGAACATCTTCAGGTATCCCCTTTTAAATCCGCAAGTTGCGTGATTTTCTTATCCTCTGATTTATCAGTATTTTTACATGATAACCCTGTTCAATTTGTGGACTAAATCTAGTTTTGGAAAAATATTCCAACTTTTGTATTGATGTTGTTCTCTTAAGGTTTTAGATTGCCTGTTATTGAAACCAAGCTGACCGGTCGGCGGTGGTTGAACGGAATTATGTTACAAGGACAAAAAGATGAAACTTCAGGTATTACCGTTAAGTCAGGAAGCCTTTAGTGCTTATGGCGACGTAATCGAAACGCAGCAACGGGATTTTTTCCATATTAACAATGGCCTGGTGGAGCGTTACCACGATTTGGCGCTGGTTGAGATTCTTGAGCAAGACTGTACGCTTATCAGCATTAACCGCGCGCAACCGGCGAATCTGCCGCTGACCATTCACGAACTCGAACGTCATCCGCTGGGTACTCAGGCCTTTATCCCGATGAAAGGTGAGGTGTTTGTGGTGGTCGTGGCGTTAGGTGACGACAAACCAGACCTGTCAACGCTGCGGGCGTTTATCACCAACGGCGAACAGGGAGTGAATTACCATCGTAACGTCTGGCATCACCCACTTTTCGCCTGGCAGCGCGTCACCGATTTTCTGACCATCGATCGCGGCGGCAGTGACAACTGTGATGTTGAAAGTATTCCTGAACAGGAACTCTGTTTTGCGTGACGCCTGCAACCGACTTGCATAAGATAAACTAATTGTTCATTGTTTATGCTCACTTGTAGGTCGGAGTTAACGTAGGTATGACGGAAGTTAGACGGCGCGGCAGGCCAGGACAGGCGGAGCCTGTGGCACAGAAGGGCGCACAGGCGTTAGAGCGGGGAATTGCGATTCTGCAATATTTGGAAAAAAGTGGGGGAAGTTCGTCGGTTAGCGATATTTCTCTCAATCTGGATTTGCCGCTCTCCACGACCTTTCGCTTGCTGAAGGTTTTACAGGCAGCGGATTTTGTCTATCAGGACAGTCAATTAGGCTGGTGGCATATAGGATTAGGTGTCTTTAACGTCGGTGCGGCGTACATCCATAACCGCGATGTCCTCTCCGTCGCCGGGCCGTTTATGCGCCGCCTGATGTTACTTTCCGGCGAAACGGTCAATGTCGCGATCCGTAACGGCAATGAAGCGGTATTAATTGGTCAGTTAGAGTGTAAATCGATGGTCAGGATGTGTGCGCCACTGGGCAGTCGTCTGCCACTGCATGCTTCCGGTGCGGGCAAAGCGCTGCTTTATCCGCTGGCGGAAGAGGAGTTGATGAGCATCATTCTGCAAACCGGTTTGCAGCAGTTTACGCCAACTACGCTTGTGGATATGCCCACCTTGCTGAAGGACCTGGAACAAGCGCGTGAACTGGGCTATACCGTAGATAAAGAAGAGCATGTTGTAGGTCTGAATTGCATAGCTTCAGCAATTTACGATGATGTCGGTAGTGTTGTTGCCGCTATCTCCATCTCCGGGCCTTCATCAAGACTGACAGAAGATCGTTTTGTCAGTCAGGGTGAGCTGGTCAGAGACACCGCCCGCGATATCAGCACGGCGTTGGGACTGAAAGCACATCCATAATGTCTGTCGCATCCCGCTCTGCGGAGCGGGTTTTTTTGACAAAATTTGAAAGTTGGAAAAATTTTCCAATAAATAGAGGTAGGAATAAAATGGCAAAAATGAGAGCCGTTGACGCGGCAATGTATGTGCTGGAGAAAGAAGGTATCACTACCGCCTTCGGTGTTCCGGGAGCTGCAATCAATCCGTTCTACTCAGCGATGCGTAAGCACGGCGGTATTCGTCACATTCTGGCGCGTCATGTGGAAGGTGCTTCGCACATGGCGGAAGGTTATACCCGCGCAACGGCAGGGAATATCGGCGTATGTCTGGGGACTTCCGGTCCTGCGGGCACGGACATGATCACCGCGCTCTATTCCGCTTCTGCTGATTCCATTCCTATTCTGTGCATTACCGGCCAGGCACCGCGCGCCCGTCTGCATAAAGAAGATTTTCAGGCCGTAGATATTGAAGCAATTGCTAAACCGGTCAGCAAAATGGCGGTTACAGTTCGTGAAGCGGCGCTGGTGCCTCGCGTGCTGCAACAGGCATTTCACCTGATGCGTTCTGGTCGTCCGGGTCCGGTACTGGTGGATTTACCGTTCGACGTTCAGGTTGCGGAAATCGAGTTTGATCCTGACATGTACGAACCGCTGCCGGTCTACAAACCTGCTGCCAGCCGTATGCAGATCGAAAAAGCTGTAGAAATGTTAATCCAGGCCGAACGTCCGGTGATTGTTGCCGGGGGCGGGGTAATTAATGCTGACGCAGCTGCACTGTTACAACAGTTTGCTGAACTGACCAGCGTTCCGGTGATCCCAACGCTAATGGGCTGGGGCTGTATCCCGGACGATCATGAACTGATGGCCGGGATGGTGGGTCTGCAAACCGCGCATCGTTACGGTAACGCAACGCTGCTGGCGTCTGACATGGTGTTTGGTATCGGTAACCGTTTTGCTAACCGTCATACCGGCTCGGTAGAGAAATACACCGAAGGGCGCAAAATCGTTCATATTGATATTGAGCCGACGCAAATTGGTCGCGTGCTGTGTCCGGATCTCGGTATTGTCTCTGATGCTAAAGCGGCGCTGACACTGCTGGTTGAAGTGGCGCAGGAGATGCAAAAAGCGGGTCGTCTGCCGTGTCGTAAAGAATGGGTCGCCGACTGCCAGCAGCGTAAACGCACTTTGCTGCGCAAAACCCACTTCGACAACGTGCCGGTGAAACCGCAGCGCGTGTATGAAGAGATGAACAAAGCCTTTGGTCGCGATGTTTGTTATGTCACCACCATTGGTCTGTCACAAATCGCTGCGGCACAAATGCTGCATGTCTTTAAAGACCGCCACTGGATCAACTGTGGTCAGGCTGGTCCGTTAGGCTGGACGATTCCGGCTGCGCTAGGGGTTTGTGCCGCTGATCCGAAACGCAATGTGGTGGCGATTTCTGGCGACTTTGACTTCCAGTTCCTGATTGAAGAGTTAGCTGTTGGCGCGCAGTTCAACATTCCGTACATCCATGTGCTGGTCAACAACGCTTATCTGGGGCTGATTCGTCAGTCACAACGCGCTTTTGACATGGACTACTGCGTGCAACTCGCTTTCGAGAATATCAACTCCAGTGAAGTGAATGGCTACGGTGTTGACCACGTAAAAGTAGCGGAAGGTTTAGGTTGTAAAGCTATTCGGGTCTTCAAACCGGAAGATATTGCGCCAGCCTTTGAACAGGCGAAAGCCTTAATGGCGCAATATCGGGTACCGGTAGTCGTGGAAGTTATTCTCGAGCGTGTGACCAATATTTCGATGGGCAGCGAACTGGATAACGTCATGGAATTTGAAGATATCGCCGATAACGCAGCGGACGCACCGACTGAAACCTGCTTCATGCACTATGAATAAGGGAGATAAATAATGTTACGTTTCTCTGCTAATTTATCGATGTTATTTGGAGAATATGATTTTCTCGCCCGTTTTGAGAAAGCTGCGCAGTGTGGTTTTCGCGGCGTTGAATTTATGTTTCCTTATGACTACGACATTGAAGAATTAAAACATGTGCTGGCGAGTAATAAACTCGAACATACGCTGCACAATTTACCGGCGGGTGACTGGGCGGCGGGGGAGCGCGGTATTGCCTGTATTCCTGGCCGTGAAGAAGAGTTTCGGGATGGCGTAGCAGCAGCGATTCGTTATGCCCGTGCGCTGGGTAATAAAAAAATTAACTGTCTGGTCGGTAAAACGCCGGCTGGTTTCAGCAGTGAACAGATTCACGCAACGCTTGTAGAAAACCTGCGTTATGCCGCGAATATGCTGATGAAAGAAGATATTTTATTACTGATTGAACCTATTAACCATTTTGATATTCCTGGTTTCCATCTCACCGGAACTCGGCAGGCGCTGAAATTGATTGATGATGTTGGTTGCTGCAATTTGAAAATTCAGTATGACATTTATCATATGCAGCGGATGGAAGGTGAATTAACCAACACCATGACTCAGTGGGCTGATAAAATTGGTCACCTGCAAATTGCCGATAATCCGCATCGCGGCGAACCGGGAACCGGAGAAATTAATTATGATTATCTCTTTAAGGTAATCGAAAATTCTGACTACAACGGTTGGGTTGGGTGTGAATATAAACCCCAAACCACCACGGAAGCCGGTTTACGCTGGATGGATCCGTACCGTTAAAACGTAACGCTATTCAGACAATGCTTTTTTAGGCCGCTAAGTTGGCAGGGGATCGTGTTGTCTGAATTCAGGAAAAGCGAAATTTAAAAGAGGTTAATTATGAAACTGGGATTTATTGGCTTAGGCATTATGGGTACACCGATGGCCATTAATCTGGCGCGTGCCGGTCATCAATTACATGTCACGACCATTGGACCGGTTGCTGATGAATTACTGTCACTGGGTGCCGTCAGTGTTGAAACTGCTCGCCAGGTAACGGAAGCATCGGACATCATTTTTATTATGGTGCCGGACACACCTCAGGTTGAAGAAGTTCTGTTCGGTGAAAATGGTTGTACCAAAGCCTCGCTGAAGGGCAAAACCATTGTTGATATGAGCTCCATTTCCCCGATTGAAACTAAGCGTTTCGCTCGTCAGGTGAATGAACTGGGCGGCGATTATCTCGATGCGCCAGTCTCCGGCGGTGAAATCGGTGCGCGTGAAGGGACGTTGTCGATTATGGTTGGCGGTGATGAAGCGGTATTTGAACGTGTTAAACCGCTGTTTGAACTGCTCGGTAAAAATATCACCCTCGTGGGCGGTAACGGCGATGGTCAAACCTGCAAAGTGGCAAATCAGATTATCGTGGCGCTCAATATTGAAGCGGTTTCTGAAGCCCTGCTATTTGCTTCAAAAGCCGGTGCGGACCCGGTACGTGTGCGCCAGGCGCTGATGGGCGGCTTTGCTTCCTCACGTATTCTGGAAGTTCATGGCGAGCGTATGATTAAACGCACCTTTAATCCGGGCTTCAAAATCGCTCTGCACCAGAAAGATCTCAACCTGGCACTGCAAAGTGCGAAAGCACTTGCGCTGAACCTGCCAAACACTGCGACCTGCCAGGAGTTATTTAATACCTGTGCGGCAAACGGTGGCAGCCAGTTGGATCACTCTGCGTTAGTGCAGGCGCTGGAATTAATGGCTAACCATAAACTGGCCTGATACCCGCAATAAAAATGGCCGATATCAGAAAATGAATCGGCCAGCAATATTAAAAAAGAAAGCAGCCAAAGATGTTGCTTCAGTATTAAAAATAATATTTTTATTTTATTTGTTCCTCATAGCTAGATTAAAACAACGTTATTCGATACGTGAAATTAAGAGGGATTTATGGAACATCAGAGAAAACTATTCCAGCAACGCGGCTATAGCGAAGATCTATTGCCGAAAACGCAAAGCCAGCGGACCTGGAAAACATTTAACTATTTTACCTTATGGATGGGTTCGGTTCATAACGTTCCCAATTATGTGATGGTCGGCGGCTTTTTTATTCTCGGCTTGTCTACCTTTAGTATTATGCTGGCAATTATCCTCAGCGCCTTTTTCATTGCCGCGGTAATGGTATTAAACGGTGCTGCGGGCAGTAAATACGGTGTGCCTTTTGCCATGATCCTGCGTGCTTCTTACGGTGTACGTGGTGCACTGTTTCCCGGATTATTAAGGGGCGGAATTGCCGCCATCATGTGGTTTGGTTTGCAATGTTACGCGGGGTCACTGGCCTGCTTGATTCTGATTGGCAAAATCTGGCCGGGATTTTTAACTCTCGGTGGTGATTTCACTCTGTTAGGCCTTTCTCTACCGGGCTTAATTACTTTCTTAATCTTCTGGCTGGTCAACGTTGGTATAGGTTTTGGCGGTGGCAAAGTTTTAAATAAATTCACTGCCATTCTTAACCCGTGCATCTATATCGTTTTCGGCGGTATGGCGATTTGGGCGATTTCACTGGTCGGGATCGGTCCAATCTTTGACTACATTCCGAGCGGTATTCAGAAAGCAGAAAACGGTGGCTTCCTGTTCCTGGTGGTGATTAACGCGGTAGTTGCGGTCTGGGCGGCACCGGCGGTGAGCGCATCCGACTTTACGCAAAACGCCCACTCGTTTCGTGAGCAGGCGCTGGGCTGATGAATCCCCTAATGATTTTGGTAAAAATCATTAAGTTAAGGTGGATACACATCTTGTCATATGATCAAATGGTTTCGCGAAAAATCAATAATCAGACAACAAGATGTGCGAACTCGATATTTTACACGACTCTCTTTACCAATTCTGCCCCGAATTACACTTAAAACGACTCAACAGCTTAACGTTGGCTTGCCACGCATTACTTGACTGTAAAACTCTCACTCTTACCGAACTTGGCCGTAACCTGCCAACCAAAGCGAGAACAAAACATAACATCAAACGAATCGACCGATTGTTAGGTAATCGTCACCTCCACAAAGAGCGACTCGCTGTATACCGTTGGCATGCTAGCTTTATCTGTTCGGGCAATACGATGCCCATTGTACTTGTTGACTGGTCTGATATTCGTGAGCAAAAACGACTTATGGTATTGCGAGCTTCAGTCGCACTACACGGTCGTTCTGTTACTCTTTATGAGAAAGCGTTCCCGCTTTCAGAGCAATGTTCAAAGAAAGCTCATGACCAATTTCTAGCCGACCTTGCGAGCATTCTACCGAGTAACACCACACCGCTCATTGTCAGTGATGCTGGCTTTAAAGTGCCATGGTATAAATCCGTTGAGAAGCTGGGTTGGTACTGGTTAAGTCGAGTAAGAGGAAAAGTACAATATGCAGACCTAGGAGCGGAAAACTGGAAACCTATCAGCAACTTACATGATATGTCATCTAGTCACTCAAAGACTTTAGGCTATAAGAGGCTGACTAAAAGCAATCCAATCTCATGCCAAATTCTATTGTATAAATCTCGCTCTAAAGGCCGAAAAAATCAGCGCTCGACACGGACTCATTGTCACCACCCGTCACCTAAAATCTACTCAGCGTCGGCAAAGGAGCCATGGGTTCTAGCAACTAACTTACCTGTTGAAATTCGAACACCCAAACAACTTGTTAATATCTATTCGAAGCGAATGCAGATTGAAGAAACCTTCCGAGACTTGAAAAGTCCTGCCTACGGACTAGGCCTACGCCATAGCCGAACGAGCAGCTCAGAGCGTTTTGATATCATGCTGCTAATCGCCCTGATGCTTCAACTAACATGTTGGCTTGCGGGCGTTCATGCTCAGAAACAAGGTTGGGACAAGCACTTCCAGGCTAACACAGTCAGAAATCGAAACGTACTCTCAACAGTTCGCTTAGGCATGGAAGTTTTGCGGCATTCTGGCTACACAATAACAAGGGAAGACTTACTCGTGGCTGCAACCCTACTAGCTCAAAATTTATTCACACATGGTTACGCTTTGGGGAAATTATGAGGGGATCTCTCAGGGCGCTGGGGCAAACGCTGGGTTTAGTTGTGGCCTATATTCTGTTTGCGGTCGCCGGGGTATGTATTATTGCCGGAGCCAGTATTCACTACGGCGCTGATACCTGGAACGTGCTGGATATTGTTCAGCGTTGGGACAGCCTGTTCGCCTCGTTCTTTGCGGTACTGGTTATTCTGATGACAACTATCTCCACTAACGCGACCGGTAATATTATTCCAGCCGGTTATCAGATTGCCGCCATTGCACCGACAAAACTGACCTATAAAAACGGCGTACTGATTGCCAGTATTATCAGCTTGCTGATCTGCCCGTGGAAATTAATGGAAAATCAGGACAGCATTTATCTTTTCCTCGATATTATCGGCGGAATGCTTGGTCCGGTAATTGGTGTCATGATGGCGCATTATTTTGTGGTGATGCGCGGACAAATTAATCTTGATGAACTGTATACCGCACCTGGCGATTATAAATATTACGATAACGGTTTTAACCTCACTGCGTTTTCAGTAACTCTGGTGGCCGTTATTTTATCTCTTGGCGGTAAGTTTATTCACTTTATGGAACCGTTATCGCGTGTTTCATGGTTTGTCGGCGTCATCGTCGCCTTTGCGGCCTACGCCTTATTAAAGAAACGTACAACAGCAGAAAAAACAGGAGAGCAAAAAACCATAGGTTAATTAATCCCGATATTGAACATTGAGTTAAAAACCAATCTGTATTTTACAAGGAGTTTGTTATGTCTTTTGATTTAATCATTAAAAACGGCACCGTTATTTTAGAAAACGAAGCTCGCGTTGTAGATATCGCCGTTAAAGGCGGAAAAATTGCTGCTATCGGTCAGGATCTGGGCGATGCAAAAGAAGTTATGGATGCGTCTGGTCTGGTGGTTTCGCCGGGCATGGTTGATGCGCACACCCATATTTCTGAACCGGGTCGTAGCCACTGGGAAGGTTATGAAACCGGTACTCGCGCAGCGGCAAAAGGTGGTATCACCACCATGATCGAAATGCCGCTCAACCAGCTGCCTGCAACGGTTGACCGCGCTTCAATTGAACTGAAGTTCGATGCCGCTAAAGGCAAGCTGACTATTGATGCGGCACAACTCGGTGGCCTGGTGTCTTACAACATCGACCGTCTGCATGAGCTGGATGAAGTGGGCGTTGTCGGCTTCAAATGCTTCGTTGCGACCTGTGGCGATCGCGGTATCGACAACGACTTCCGTGATGTAAACGACTGGCAGTTCTTCAAAGGTGCGCAGAAGCTGGGCGAACTGGGTCAGCCGGTGCTGGTGCACTGCGAAAACGCGCTGATTTGTGACGAACTGGGCGAAGAAGCGAAGCGTGAAGGTCGCGTAACCGCTCATGACTATGTGGCTTCGCGTCCGGTATTTACCGAAGTGGAAGCAATTCGCCGCGTACTGTATCTGGCGAAAGTTGCTGGTTGCCGTCTGCACGTTTGCCACGTCAGCAGCCCGGAAGGTGTTGAGGAAGTGACTCGTGCACGTCAGGAAGGTCAGGACGTTACTTGTGAATCCTGCCCGCATTACTTTGTACTGGATACCGATCAGTTCGAAGAAATCGGTACTCTGGCGAAGTGTTCACCGCCGATCCGCGATCTGGAAAACCAGAAAGGCATGTGGGAAAAACTGTTTAACGGTGAAATCGACTGCCTGGTTTCCGACCACTCTCCATGCCCGCCGGAAATGAAAGCCGGTAACATCATGAAAGCATGGGGCGGTATCGCCGGTCTGCAAAGCTGCATGGACGTGATGTTCGATGAAGCGGTACAGAAACGCGGTATGTCTCTGCCAATGTTCGGCAAATTAATGGCGACTAACGCAGCAGATATTTTCGGTCTGCAGCAAAAAGGCCGTATCGCCCCAGGAAAAGATGCCGACTTCGTCTTCATTCAGCCGAATAGCAGCTATGTTCTTACCAATGACGATCTGGAATATCGCCACAAAGTCAGCCCGTATGTTGGCCGTACCATTGGCGCGCGTATCACGAAAACCATCTTACGTGGTGATGTGATTTACGACATTGAACAGGGCTTCCCTGTTGCGCCGAAAGGTCAATTTATCCTTAAACATCAGCAGTAATCTGGCCCCTGCAATGCCCGTCCTTGCGGCGGGCATTCTCCGGTTAAGGTGTGTTTATGTTCAATTTTGCAGTCAGCCGCGAAAGCCTGTTATCAGGATTTCAGTGGTTTTTCTTTATTTTTTGCAACACGGTTGTGGTTCCTCCTACGCTACTTTCTGCTTTTCAGTTGCCGCAAAGTAGCCTGCTTACGCTCACGCAATATGCTTTTCTTGCTACCGCACTGGCCTGCTTCGCTCAGGCGTTTTGCGGTCATCGTCGCGCTATTATGGAAGGGCCAGGTGGCCTGTGGTGGGGAACCATCCTTACTATCACCCTTGGTGAAGCATCGCGCGGGACACCGATCAACGATATCGCCACCAGCCTGGCAGTGGGGATTGCACTCTCCGGCGTGCTGACGATGTTGATTGGTTTTAGCGGATTAGGCCATCGCCTGGCACGGTTATTTACGCCGTCGGTGATGGTCTTGTTTATGTTGATGCTGGGCGCGCAGCTGACCACTATCTTTTTCAAAGGTATGCTCGGGCTGCCGTTTGGCATAGCCGACCCGAATTTTAAAATTCAGTTACCGCCGTTCGCGCTCTCGGTGGCGGTGATGTGCCTGGTACTGGCGATGATTATCTTCCTGCCGCAACGTTTTGCCCGTTATGGCCTGCTGGTCGGCACCATAACCGGCTGGTTGTTGTGGTACTTTTGCTTTCCTTCTTCGCACTCGCTCTCCGGTGAGTTGCACTGGCAGTGGTTCCCGCTCGGCAGTGGCGGTGCTTTGTCGCCGGGAATTATTCTGACGGCGGTGATTACAGGTCTGGTAAATATCAGCAATACCTACGGTGCGATTCGGGGCACGGATGTTTTTTATCCGCAGCAGGGCGCAGGGAATACGCGTTATCGTCGTAGCTTTGTGGCGACCGGATTTATGACGCTGATAACCGTACCGCTGGCGGTAATTCCATTTTCACCGTTTGTTTCATCCATTGGTTTATTAACCCAGACTGGCGATTACACGCGGCGTTCGTTTATTTATGGCAGCGTTATTTGCCTGCTGGTGGCGCTGGTTCCTGCACTCACGCGACTGTTTTGCAGTATCCCTTTACCCGTGAGTAGTGCGGTCATGCTGGTTTCTTATCTGCCTTTACTCTTTTCCGCGCTGGTGTTTAGCCAGCAAATAACGTTTACCGCTCGCAATATTTATCGACTCGCATTGCCGTTATTTGTCGGCATATTTTTAATGGCATTACCGCCTGTGTATCTGCAAGACCTTCCATTAACGCTTCGTCCTCTGCTCAGTAACGGCTTATTGGTCGGGATTTTACTGGCTGTTCTTATGGATAACCTTATTCCGTGGGAACGCATCGAATAATTTGTTGAAAAAGGATTGATAATGAAGATTGTCATTGCGCCAGACTCTTTTAAAGAGAGCTTAAGTGCAGAAAAATGTTGTCAGGCAATTAAAGCCGGGTTTTCGACCCTCTTTCCCGATGCGAACTATATCTGTTTGCCGATAGCGGATGGCGGCGAAGGGACGGTGGATGCGATGGTCGCCGCGACGGGCGGCAACATCGTGACGCTTGAAGTCTGCGGGCCGATGGGCGAAAAAGTGAATGCTTTTTATGGCCTTACCGGCGACGGGAAAACGGCGGTGATTGAGATGGCGGCAGCAAGTGGCCTGATGCTGGTCGCGCCTGAAAAGCGTAATCCGTTGCTGGCCTCCAGTTTTGGTACGGGGGAGTTAATTCGTCATGCGCTGGATAACGACATTCGCCATATTATTCTCGGCATTGGCGGCAGTGCGACGGTCGACGGCGGTATGGGCATGGCGCAGGCGCTCGGTGTGCGTTTCCTTGATGCCGACGGTCAGGCGCTGGCGGCAAACGGTGGTAATTTAGCGCGCGTGGCAAGCATTGAGATGGATGAATGCGATCCGCGTCTGGCGAATTGCCATATTGAAGTAGCATGTGACGTTGATAACCCGCTGGTAGGGGCACGCGGCGCGGCGGCGGTGTTTGGCCCGCAAAAAGGGGCAACGCCGGAGATGGTCGAAGAACTTGAACAGGGGCTGCAAAATTACGCCCGTGTTTTACAACAGCAAACTGAAATTAATGTCTGCCAGATGGCGGGCGGCGGCGCTGCGGGCGGTATGGGTATTGCGGCGGCGGTATTTCTCAATGCGGATATTAAACCGGGCATTGAAATTGTGTTGAATGCGGTCAATCTTGCGCAGGCAGTGCAGGGCGCAGCACTGGTGATTACCGGGGAAGGGCGCATCGACTCGCAAACGGCAGGCGGTAAAGCGCCGCTGGGTGTGGCGTCGGTGGCGAAGCAGTTTAATGTACCGGTGATTGGGATTGCTGGCGTATTGGGTGATGGCGTGGAAGTGGTGCACCAGTACGGCATTGACGCGGTATTCAGCATTTTGCCTCGTCTGGCACCTTTAGCCGAAGTGCTCGCCAGCGGTGAAACCAATCTCTTCAACAGCGCGCGAAATATTGCCTGCGCCATTAAAATAGGTCAGGGAATTAAAAACTAACCCTTACCTTTAAAGCGGATGCGATTTATATCGCATAAGAGTGCAGTACTCATGCCGGATGCGGCATGAGTACCATATCCTTCCTGAAAATCGCGCAAATTCTATATATTGCAGAGATCATGTAGGCCTGATAAGCGAAGCGCATCAGGCAATGTTACAAAAAAAGCCACGGTATAAACCGTGGCAAAATCCAACATAGCTAAAAATAATCAGGCGAGTGGTATGACTTAAATCTCTACGTCGCGGTTACAATCTTTCGAGTAAATATAGCTGAACGCTTCACCACGCCCTACACCATAACCAGCCTGTAAAGAATAAGCGCCCATAAAGATGTAATCGCCTTTTTTCACCGGGATCCAGTTATTGTCGAGGTTATAAACCCCCTGACCGGAAAGAATATAGGCACCGTGTTCCTGAACGTGTGTTTCGATATAACCGTGGCTGGCACCTGGTGCAAAAGAGAGGATATGCATGTTCATATCAAAACCTAACTCTTTGGGCAGAAAATCCAGCAGAATAACATCGTCCATGCCTTCATAATGAATGCGTTCCAGTTCGCTGGCATTGCCAGAAACCAGCCACGGTGCATAGCCTTCTACCGGAACATAGCGGCGCTTATATAAAAAGATTTGGCTGTCTTCGGCCTGGGCGTTAACAAACGTCATTAAGGAGCCTGGCGGGCAATAAAGATAGCCACCTTCGCTTAAGGCAAATGTTTTGCCTTCGGCTTTGGCAGTGATATTTCCAGAGATCACATACAGGAACGTTTCAATGCCTTCGCCACCGAAGCCCTGTTGGTTGCCACCGTTTTGATGCAGTGTGACCAGATAATCAACAAAAGAGGCACCCAGCTTTGGCGTGGAGAGGATTGTCGCGTCACAATTTTCAAAGCCCGGAATAATATTTTTTACCAGACCATCCGGGGTTAACAGTGCGAAATTACCGTGTTTAACAATCGCACGGTTAGCCAGTAAATCTTCGCGGTAACCGGTGACGTTATTTAAATATCCCATTTATGACTCCTTATTTCTGCCAGGCAAGTTGATAAAGCATGAGTGCCAACGTTTTGACCCCTTCGGCAAGGTCGGTAATATTGGTGCGTTCCGCCGGGTTATGGCTGATCCCGTTGATGCTGGGGATAAAAATCATGCAGGTTGGTACGCGAGGCGCGAAAATTTGCGCGTCGTGCCCGGCACCACTGTGCATCACCCGGTAATTCAGTTTTTCTCTTTCACACAATTCTGTCAGGGTGGCGACCAGCTCCTTATTCATCGGCACGGGTTCTTCGTCCATCCATAAATCGATATCAATACCAATGTCCATTTCATCGCAAATCGCCCGCATGTCGTTTTCTAACTGTTGGGTGAAATCGCGCAGCACGGCAGCGTCGGTATGACGACAATCAATGGTGAACGTGGTTTTACCCGGCACCACATTTACCGTATTCGGGCGCGGCTCTACTTTGCCAAAGGTCAGAACCAGCGGATCGCCCATCCTTTTCGCTTTTTCGACCGACTGATGGCAAATGCGACTGAAAGCGTAAACTGTATCACGACGATAACCCATCGGCGTGGTGCCTGCATGGTTTGATTCGCCGTTCAGCGTTACCGTATAACGACGCTGCCCGACAATTGCATTCACCACGCCAATTGATTGCCCATTACTTTCCAGCACACAGCCCTGTTCAATATGCAGTTCAACAAAGGCTTTAATATCCTGACGCGGAGTTAGTGGGGCGTTCGGAAGAGTAAATCCGCAAGCCTTCATCGCATCGACAAAACTATTTCCTTTGGCATCACAGATATTCCGCACGTCGTCAGGATTCGCCAGCCCAAAAATATTTTTACTGCCCCAGAAGACATACGGGAAGCGGCTGCCTTCTTCTTCTGCCATCGCCACCACTTCGACCGTACGTAGCGGCGCGCCGTATTGCGTTTTCAGCCAGTCAATTGCCAGCCACGCCGCCAGCGCGCCGAATTGCCCGTCAAGGTTACCGCCGTTAACCACGGTATCGATATGCGAACCGCTCAGAACCACTTCCTGTGGATATTCGGTGCCATTCAGGCGACCGTATAAATTCCCCACTTCATCGAAACGTGTTTCCAGCCCGCTTGCTGCCATTCTTTTTTTAAATTGCTGCTGGGTTTCCAGCCATTCCGGCGAATAAAGTAAACGGGTCATCCCACCCGCTGGGTCAGCGCCAAAAGAGGAAAGCCAGGGCAGCGTTTCTTCTATAGCTTGACGGAAATGTGTAATCATAAGAAAGTCCTGTCTCAATAATTATTGCGCAAAGGGATTTTTCGTTTCGTATGACGTGTTATAAAGCGCGTCGGAAATTAAATACTGGTAAATATCATCAACAATTTCGATGCCTTCGACGGCGGCTTTGCGTTGTTTAATATCCTGATCCTGTCCGGGATAATAAACCTGATTAAAACCGGGCGCGGGGGTAATGGCATTTAATTCGCGCATGGTCTGGCTAAGATGTTGACGGAATAATTCGCTGGAGGAGAAAAAGTTCGGATTAATAACTATATGTAATTGCCCCAAATTACGCCCTGCGTGTAAATCGTCATACATCGAACTAACCTGTCGCCCGAACGGTAAGCCGAGTAAGACGCCTGAGAGGACGTCAATCATCATCATCAGGCCATACCCTTTTGGCCCGGCGGCGGGGAGCAGAGCATGTACCGCGAACGGATCGGTTGTTGGTACACCGTTTTTATCGACCGCCCAGGTATCCGGGATAGACATATTACGCGAGCGGGCGTCGAGCACTTTTCCCCATGCCTGTACGGTAGTCGCCATATCAAAGGTAAGGATCTCGTCGCCTTCTCCCGGCGCGGCAAAGGCCAGGGGGTTAGTACCGTAGTAAATTTCCGCGCCGCCAAACGGCACCACCATTGGATCGGACTGGCACATCGAAATGCCAATGAATCCGGCGCGGGCTGCCTGCTGCACAAAATAAGAGATTGCGCCGCTGTGACCCATCCGGCTGATACCGACCACCGCAACGCCATTTTGCTGGGCGGTTTTGATGGCATGTTCCATACCCATTTTCGCCGCGACCTGTCCGGCGGCATTGTCGGCATGTAAAATTGCCGAGCACGGCCCGGTTTCCTCAAGACGAAACTCCGGTTCGCGGTTGGTGCCGCCTTTTGAAATGCGTTCCGCGTAGTATTCCACGCGCACCGCGCCATGAGAGTGGATCCCTCTGGCATCGGCGTAAACCAATACTTCAGCCACGGTTGCAGCGTGCTCACGTTTTAACCCAGCCTGGCAGAGTTTATTCTCAATTAGCTGGTGGAGTGTTTCCCGACTGATTTTCATCTGTCTTCCTTTTTAACGACGGTGTGAAGCATGACTGCAATTAACATACAGGGAAAATATCTGGATTATGTGATCCAGACAGGCAAAAAAATATAGTTAGAATTTATTTGATAATCCGCTCACTTTTAACCTGATTTTTAAAACAACAACGCTTATTAAAAAATAATGAGTAATAGCCTGGTGGTTATTTGAATTCTTTTGTTAATAATTCCTGTGTGATATTCATCACCTTATTTACTCGTTGTCATCGATACCGTAATCGCCACATTAACACTGCTCGTGCAATTGCCATGGGTGCAATTTTTAAGGAGTTGTTATGATCCACGCCTTTATTAAAAAAGGGTGTTTTCAGGATTCGGTCAGTTTAATGATTATTTCACGAAAACTCAGCGAATCAGAAAATGTTGATGATGTTTCCGTAATGATGGGTACGCCCGCCAATAAAGCGTTATTAGATACCACAGGTTTCTGGCATGACGATTTTAATAACGCCACGCCGAACGATATTTGCGTGGCAATTCGTAGCGAAGCGGCGGATGCGGGGATCGCGCAGGCGATTATGCAGCAGCTTGAAGAGGCGCTAAAACAACTGGCGCAGGGGTCAGGCAGCAGCCAGGCGTTGACGCAGGTGCGTCGCTGGGACAGTGCCTGTCAGAAATTACCCGATGCCAATCTGGCGCTGATTTCAGTGGCTGGCGAGTATGCGGCGGAGCTGGCAAACCAGGCGCTGGATCGCAACCTCAACGTGATGATGTTCTCCGATAACGTCACGCTGGAAGATGAAATCCAACTTAAAACCCGCGCGCGGGAAAAAGGCTTGCTGGTGATGGGGCCGGACTGCGGTACGTCGATGATTGCCGGCACACCGCTGGCTTTTGCTAACGTGATGCCGGAAGGCAATATTGGCGTCATTGGCGCTTCCGGTACCGGGATTCAGGAGCTGTGTTCGCAGATTGCGCTGGCAGGGGAGGGAATTACTCACGCGATTGGCCTTGGCGGGCGCGACCTCAGCCGTGAAGTGGGCGGCATCAGTGCGCTAACAGCGCTGGAAATGCTCAGTGCAGACGAGAAAAGCGAAGTGCTGGCATTTGTTTCAAAACCACCTGCCGAAGCTGTGCGTCTGAAAATTGTTAATGCCATGAAAGCAACCGGCAAACCGACGGTGGCGCTGTTTTTAGGTTATACCCCGGCGGTGGCCCGCGACGAGAATGTCTGGTTTGCCTCCTCGCTGGATGAGGCCGCACGCCTGGCTTGCCTGCTTTCACGCGTCACGGCGCGACGTAACGCAATAGCGCCTGTCAGCAGCGGATTTATTTGCGGTTTGTATACCGGCGGTACGCTGGCTGCCGAAGCGGCGGGATTACTTGCCGGACACCTTGGCGTGGAAGCCGACGATACCCATCAACATGGCATGATGCTGGACGCCGATAGCCACCAGATTATTGACCTCGGCGATGATTTCTACACCGTCGGGCGTCCCCATCCGATGATCGACCCAACCTTACGCAACCAGTTAATTGCCGATCTCGGCGCTAAACCGCAAGTGCGCGTGTTGCTGCTTGATGTCGTGATTGGCTTCGGTGCGACCGCCGATCCTGCCGCCTCGCTGGTGAGCGCCTGGCAAAAAGCCTGTGCCGCGCGTTTAGATAATCAACCACTGTATGCCATTGCCACGGTGACAGGCACTGAACGTGACCCGCAATGCCGCTCGCAGCAAATCGCCACGCTGGAAGATGCGGGGATTGCGGTCGTGAGTTCGCTACCGGAAGCCACCTTGCTGGCGGCAGCGTTAATTCATCCGCTCTCGCCTGCCGCACAGCAACACACACCGTCATTACTGGAAAACGTCGCCGTGATTAACATCGGATTACGCAGCTTTGCGCTGGAGCTACAAAGCGCCAGCAAACCGGTTGTGCATTACCAATGGTCGCCAGTCGCCGGTGGCAATAAAAAACTGGCTCGTTTATTAGAACGTTTGCAATAAGGGGTTCCCATGTTTACATCAGTGGCGCAAGCCAATGCTGCGGTTATCGAACAAATTCGTCGCGCTCGTCCACACTGGCTGGATGTGCAACCGGCTTCTTCACTTATCAGCGAACTAAACGAGGGCAAAACACTGCTTCACGCCGGGCCGCCAATGCGCTGGCAGGAGATGACCGGACCCATGAAAGGGGCGTGCGTGGGCGCATGTCTGTTCGAAGGTTGGGCGAAAGATGAAGCGCAGGCGCTGGCAATACTGGAGCAGGGGGAAGTGAACTTCATTCCTTGTCACCATGTGAATGCCGTCGGGCCAATGGGCGGTATTACTTCTGCCAGTATGCCGATGCTGGTGGTTGAGAACGTGACCGACGGCAACCGGGCGTACTGCAACCTCAACGAAGGTATCGGCAAAGTGATGCGTTTTGGCGCTTACGGCGAAGATGTCCTGACTCGCCATCGCTGGATGCGCGATGTGTTAATGCCAGTATTAAGCGCGGCGCTGGGGCGCATGGAGCGCGGTATCGATCTCACGGCGATGATGGCGCAGGGCATTACGATGGGCGATGAGTTCCATCAACGCAATATTGCTTCCTCTGCACTGTTAATGCGTGCGCTGGCCCCACAAATTGCTCGCCTCGATCATGATAAACAGCACATCGCCGAAGTGATGGATTTCCTCAGCGTGACCGATCAGTTCTTCCTCAACCTCGCGATGGCTTACTGCAAGGCGGCGATGGATGCTGGCGCGATGATCCGCGCAGGCAGCATCGTCACGGCAATGACCCGCAACGGCAATATGTTCGGGATTCGGGTAAGCGGGCTGGGCGAACGCTGGTTTACTGCGCCTGTAAACACTCCGCAAGGTCTGTTTTTCACCGGCTTCTCGCAGGAGCAGGCGAACCCGGATATGGGCGATAGCGCGATTACCGAAACCTTTGGTATCGGAGGTGCGGCAATGATCGCAGCGCCTGGCGTAACGCGCTTTGTCGGTGCGGGTGGCATGGAAGCGGCAAGAGCGGTATCTGAAGAGATGGCGGAAATTTACCTTGAACGCAATATGCAGTTGCAGATCCCAAGCTGGGATTTTCAGGGCGCGTGCCTGGGGCTGGACATTCGTCGCGTGGTAGAAACCGGCATTACGCCACTCATCAATACCGGTATCGCCCATAAAGAGGCGGGGATCGGGCAGATTGGCGCAGGCACCGTGCGGGCACCGCTGGCGTGCTTTGAACAGGCGCTGGAAGCACTGGCTGAAAGCATGGGTATTGGTTGAGGAACGCGCAATGACGATCATCCATCCTCTGCTTGCCAGTAGTAGCGCACCGAATTATCGCCAGTCCTGGCGGTTAGCGGGAGTGTGGCGGCGGGCGATTAACCTGATGACGGAAAGCGGCGAACTGTTAACGTTGCATCGTCAGGGTAGTGGTTTCGGCCCCGGAGGATGGGTGCTTCGCCGTGCGCAATTCGATGCGTTATGCGGTGGATTATGCGGCAATGAACGACCACAGGTTGTGGCTCAAGGGATTCGCCTCGGGCGTTTCACGGTTAAACAGCCACAGCGTTATTGTTTGCTGCGTATTACGCCGCCTGCGCATCCTCAACCACTTGCAGCTGCATGGATGCAACGCGCGGAGGAAACCGGGCTTTTCGGGCCACTGGCGTTGGCGGCAAGCGATCCGCTGCCTGCTGAGTTACGCCAGTTTCGTCACTGTTTTCAGGCCGCGCTCAATGGCGTTAAGACCGACTGGCGGCACTGGCTGGGTAAAGGCCCCGGATTAACGCCGAGTCATGATGACACGCTGAGCGGAATGCTGCTGGCGGCCTGGTATTATGGCGCTTTAGATGCGCGCTCCGGTCGTCCGTTTTTTGCCTGTTCCGACAATCTGCAACTCGTTACCACAGCGGTGAGCGTCAGTTATTTACGTTATGCCGCGCAAGGATATTTCGCCTCGCCACTCCTGCACTTTGTTCATGCTCTGAGTTGCCCGAAACGTACCGCTGTTGCGATTGATTCGCTGCTGGCGCTGGGGCATACGTCAGGGGCAGATACGCTGCTGGGGTTCTGGCTTGGCCAACAATTATTACAAGGAAAACCATGAAAACACTGGTTGTGGCTCTTGGGGGCAACGCCTTACTCCAGCGCGGTGAGGCGCTGACGGCAGAAAATCAATATCGCAATATCGCCAGTGCTGTACCCGCGCTGGCACGCCTGGCCCGTTCTTATCGGTTGGCGATTGTTCACGGCAACGGGCCGCAGGTGGGGCTGCTGGCATTGCAGAATCTGGCGTGGAAAGAGGTAGAACCGTATCCGCTGGATGTGCTGGTTGCGGAAAGCCAGGGGATGATTGGCTATATGCTGGCGCAGAGTTTGAGCGCACAGCCGCAGATGCCGCCCGTGACGACGGTGCTGACGCGCATTGAGGTTTCGCCTGATGATCCGGCGTTTTTGCAGCCAGAGAAATTTATTGGTCCGGTTTATCAGCCAGAAGAACAAGAGGCACTGGAAGCGGCTTACGGCTGGCAGATGAAACGTGATGGTAAATATTTGCGCCGGGTGGTGGCGTCTCCGCAACCGCGTAAAATTCTCGACAGCGAAGCCATCGAGTTGTTGCTCAAAGAGGGGCATGTGGTGATTTGCAGTGGCGGCGGCGGTGTGCCTGTGACGGATGACGGAGCAGGGAGTGAAGCAGTGATTGATAAAGATCTCGCCGCTGCGTTGCTCGCCGAGCAGATTAATGCAGATGGACTGGTGATCCTCACCGATGCTGATGCGGTATATGAAAACTGGGGAACGCCGCAGCAACGTGCCATTCGCCATGCCACACCGGATGAGTTAGCGCCATTTGCCAAAGCCGATGGTTCGATGGGGCCGAATGTAACGGCGGTGAGTGGTTATGTCAGAAGCCGTGGTAAACCCGCGTGGATTGGGGCGTTATCGCGAATTGAAGAGACGCTGGCGGGCGAAGCGGGGACCTGTATTTCGCTGTAGTCGTAGGCATTAGACATTTGTGCCTGATGCGACGCTTGACGCGTCTTATCAGGCCTACAACCGGTGCCGCATCCGGCAATTGGTGCACAATGCCTGATGCGATGCTTGACGCATCTTATCAGGCCTACAATGGGTACCGGATCGGTAGGCCGGATAAGGCGTTTACGCCGCATCCGGCAAGAATAGAGCACCAGTTAACCGAACTTACTCTGCGCCCAAATCACGCCGCTGGCATATTCCGGCGGCAGCAGCGGGATTAAGGCTTCCAGCGTCGCAGTCAGACGCGATGTGTCGCTGTCGGTCAAATTCAGATGCCCCACTTTACGCCCCGGACGGACTTCTTTGTCGTACCAGTGCAGATGCACCAGCGGCAGTTTCAGCCAGTCATAATTCACATCGCTACCAATCAGATTGATCATCACCGACGGATTATTCACCACTGGTTGCGGTAACGGCAGATCGGTAATCGCCCGCAGATGCAGCTCAAACTGGCTGATGCTGGCACCGTTTTGTGTCCAGTGACCGCTGTTATGCACACGCGGTGCCAGTTCGTTGATCAACAGACCTTGCGGGGTGACAAAACACTCCATCGCCATCACGCCCACATAGCCCAGCTCCTGCATAATCGCCGACAGCATCTCTTCGGCTTGCGCCTGCTGCTGTGCGTTGGCCTGCGGAAAAGCGACGCTGGTGCGCAAAATACCGTCCTGATGCAGGTTATGCGTCAGCGGATAAAACACGGTGCTGCCATCAAAGCCGCGCGCGCCAACCAGCGACACTTCACCAGAGAAGTTAATGCCCTGCTCGACAATACATTCGCCGTAACACTCTGCCGGTAACTGTTCGGTTTCATTTGCGCGTAAACGCCATTGACCGCGACCGTCATAACCACCAGTGCGACGCTTAACAATCGCCAGCTCACCTAAACGATCAAACACCGCAGGCCACTCGCTGCGTTCGGCAAGTAACTGCCACGGTGCAGTCGGCAGGTGGAGCTTATCGAAAAGCTGCTTCTGAGTCAGACGGTCAGCAATAATCGGGAACACATCGCGGTTCACAAAGGCCGGATGGCGCGCCAGCTCGCGGGTTAATGCGGTTTCCGGCCAGCGTTCTATCTCAGCGGTAATCACGCTTTGTTGAAAAGGCACCGCCGCCGGTTCAGCGTCCAGCCCGACTGGCCAGACAGCAATGCCTAACGGTTCGCCTGCCTGACGCAGCATACGGCCTAACTGCCCGTTACCGAGGACGCAAACCTGTTTCATGCCGCACCTCGCGGGTCCGGGTTTTCCAGCACTTCGTCGGTCTGGGCTTTGCGCCAGTCATTCAGACGCTGGTGCAGTTCTTTATCATGAGTCGCAAGAATTTGTGCTGCCAGTAACGCCGCGTTTGCCGCGCCAGCTTTACCAATCGCCAGCGTACCCACCGGAATGCCGCGCGGCATTTGTACGATGGAGTAGAGGCTATCGACACCGCTCAGTGCGGCGCTCTGTACTGGCACGCCCAGCACCGGCACCAGCGTTTTGGCGGCAATCATGCCTGGCAGATGCGCTGCGCCGCCTGCGCCCGCAATAATCACCTGATAACCGTTCTCTTCGGCGCTTTCGGCGAAGCTGAACAGTTTATCGGGGGTGCGGTGAGCAGAAACCACTTCAACGTGGTGCGGGACATTCAGGATTTCGAAGATTTCGGCGGCGAACTGCATGGTAGCCCAGTCGCTTTTGGACCCCATCACGATGGCGACACGCGCCGGATTATTGCGGGAAGACATGCGTCTTAAAACTCCTGTGGTGCACAACTCTCGGCTTTAGAGGGCACAGAGAATAGCACGGAAAGAGAGCAAGGAAAACGGTTGCGTGGCTGTGAAATCAGCAAAGTTGCGGGTTTTTTAAAACGGAAAATGAATCAGCTCAACGTCATCCGCCGTGACTTTCACCATTGAACCTTCCGTATGCCAGGCACCCAGTACCACGCGAAAAGCAGGTTGCTGATTGGCGATAAGTTCATGCACCGCCGGGCGATGGGTATGCCCGTGGATCAGCCATTGCACCTGATGTTTTTCCATCGCACTGACCACCGCGTTTTGGTTAACGTCCATGATCGCCAGCGATTTACTGCTGTTGGCTTCTTTGCTGTTCGCGCGCATTCGCGCGGCAATGCGTTTGCGCACAAACAACGGCAGGGCGAGGAATAGCGTCTGCAGCCAGGGTTTGTGGACCTTGGCGCGAAAAGCCTGATAACCCGCGTCATCGGTGCACAGCGTGTCGCCATGCATAATCAACACCCGGCGACCATAAAGTTCGAGCACCTTTTCTTCCGGCAATAACGTCATGCCACTTTCACGGGCAAAGCGTTTGCCGAGCAGAAAATCACGGTTGCCATGAATGAAATAACAGGGAACGCCGGAATCGGACACCGCTTTGATCGCCGCCGCCATCTTGCGATGGAGTGGGTTGGGATCGTCGTCGCCAATCCATGCTTCAAACAGATCGCCAAGAATATACAGCGCGTCGGCCTTGCGGGCTTCCCCCGCTAAAAAACGCAGAAAACCGGCGGTGATCGCCGGTTCTTCCACGCAGAGATGAAGATCTGCAATAAAGAGTGTCGCCACGATTACTCGCTAACGGTCACGCTTTCAATGATAACGTCTTCTTTTGGCACGTCCTGGTGCATACCGCTACGACCGGTTGCAACACCTTTGATTTTGTCTACCACGTCCATGCCGTCAACCACTTCAGCAAACACGCAGTAGCCCCAACCTTGCAGGCTTTCGCCAGAGAAGTTCAGGAAGTCGTTATCAACCACGTTGATGAAGAACTGTGCAGTTGCAGAGTGCGGAGCCTGAGTACGTGCCATTGCCAGCGTACCACGGGTATTTTTCAGGCCGTTGTTGGCTTCGTTTTTGATCGGTTCTTTGGTGGCTTTTTGTTTCATGCCCGGTTCAAAACCGCCGCCCTGAATCATAAAGCCGTTGATAACACGGTGGAAAATGGTGTTGTTGTAAAAACCTTCGCGGCAGTAGTCCAGGAAGTTTTTAACTGTTTCAGGTGCTTTATCGTCAAAAGTTTTGATGACAATATCGCCGTGATTGGTGTGGAAAGTAACCATTTTTGCATCCTGTTCCGTTTGATTGGTGCTTCAACCCAGTTCGGGTCATATATAGGGTGGTGTTATAGCATAACCGCACGATCGGATCATCACGCAATGTATGCTGATTCGCGCGGGAAATATGGGTATTATACGCAACTCAATTACCCACACATGTCTAAACGGAATCTTCGATGCTAAAAATCTTCAATACTCTGACACGCCAAAAAGAGGAATTTAAGCCTATTCACGCCGGGGAAGTCGGCATGTACGTGTGTGGAATCACCGTTTACGATCTCTGTCATATCGGTCACGGGCGTACCTTTGTTGCTTTTGACGTGGTTGCGCGCTATCTGCGTTTCCTCGGCTATAAACTGAAGTATGTGCGCAACATTACCGATATCGACGACAAAATCATCAAACGCGCCAATGAAAATGGCGAAAGCTTTGTGGCGATGGTGGATCGCATGATCGCCGAAATGCACAAAGATTTTGATGCTTTGAACATTCTGCGCCCGGATATGGAGCCGCGCGCGACGCACCATATCGCAGAAATTATTGAACTCACTGAACAACTGATCGCCAAAGGTCACGCTTATGTGGCGGACAACGGCGACGTGATGTTCGACGTCCCGACCGATCCAACTTATGGCGTGCTGTCGCGTCAGGATCTCGACCAGCTGCAGGCAGGCGCGCGCGTTGACGTGGTCGACGACAAACGCAACCCAATGGACTTCGTTCTGTGGAAGATGTCGAAAGAGGGCGAACCGAGCTGGCCGTCTCCGTGGGGCGCGGGTCGTCCTGGCTGGCACATTGAATGTTCGGCAATGAACTGCAAGCAGCTGGGTAACCACTTTGATATCCACGGCGGCGGTTCAGACCTGATGTTCCCGCACCACGAAAACGAAATCGCGCAGTCCACCTGTGCCCATGATGGTCAGTATGTGAACTACTGGATGCACTCGGGGATGGTGATGGTTGACCGCGAGAAGATGTCCAAATCGCTGGGTAACTTCTTTACCGTGCGCGATGTGCTGAAATACTACGACGCGGAAACCGTGCGTTACTTCCTGATGTCGGGCCACTATCGCAGCCAGTTGAACTACAGCGAAGAGAACCTGAAGCAGGCGCGTGCGGCGCTGGAGCGTCTCTACACTGCGCTGCGCGGCACAGATAAAACCGTTGCGCCTGCCGGTGGCGAAGCGTTTGAAGCGCGCTTTATTGAAGCGATGGACGACGATTTCAACACCCCGGAAGCCTATTCCGTACTGTTTGATATGGCGCGTGAAGTAAACCGTCTGAAAGCAGAAGATATGGCAGCGGCGAATGCAATGGCATCTCACCTGCGTAAACTTTCCGCTGTATTGGGCCTGCTGGAGCAAGAACCGGAAGCGTTCCTGCAAAGCGGCGCGCAGGCAGACGACAGCGAAGTGGCTGAGATTGAAGCGTTAATTCAACAGCGTCTGGATGCCCGTAAAGCGAAAGACTGGGCGGCGGCGGATGCGGCGCGTGATCGTCTTAACGAGATGGGGATCGTGCTGGAAGATGGCCCGCAAGGGACCACCTGGCGTCGTAAGTAATTGCGCTATTGCCGGATGCGAGTTTTCGCATCCGGTTATCGTCTGCGCCACCACAACATTCCCATCAGTAGCATCCCCGGCAACCACACCCACATCAATTCAGAAATAATCACCTGATGCCCGTACGGCGTGGTGTAACGAGACAATGCAAACGGCGCGACTTTTATCACCTGCCAGGGAGCGAAAAAGCGTTCATCTGACCACGGCCACAGCCAGCCAACGCCTTTACCGCCAGTGGTTACCGAATCCAGCAAGCTGTGCGATAGCAACGAGACGGTTAAAAACAGCCAGCAGCGAATCAGCCCAGCCCTGAACCATCGGCGTCCAATAAACACACATAACAGCGGGACAACAAACGCAAACACCAGCGAATGGGTAAACCCGCGATGACCAAAAACATTGCCGTAAGCAACGCCAAATTTAAACGACAATACGTCGGCGTCGGGCAGCATCGCCAGGATGATTCCGGCAAATAACAGACGCGGAGGGATGACTTTCGAACCCAACCCTAAACCAATGCATAGGGGAACGGCGGCGTGCGTAATAACGGTTGGCATGATGGTCGCTTCGGCAAAATGTCGATGCTATCAGCATGGATGAACGGGGCGTAGAGGGCAAAAGTCTGAAAAGAGAACCGGCCTGTTGATACAGGCCGGGAAAGGGATCAGGCAACAACCTGTACGCTGTGACCTGCAAAACTCACTGTCTGACCGGCGACGATTTTGCAGCGTTTGCGCGTTTCAACCGCACCGTCGACTTTCACCTGGCCTTCGGCAATCGCGATTTTCGCCTGCGCGCCGCTTTCGCTCCAGCCTTCCAGTTTCAGCAAGTCGCACAGCTCAACGTGCGGATGTTTACCTAAAGAAAATGTCGCCATGTTACTCATCCTGTGGATCATGATATTCAACGCACGCCTGTAGCGTGTTTTCAATCAGCGTGGCAACCGTCATCGGGCCAACGCCGCCGGGAACAGGCGTAATGTATGAGGCGCGTTTAGCCGCGTCTTCAAACACGACGTCGCCCACAACTTTGCCATTTTCCAGACGGTTGATGCCGACATCAATCACAATTGCGCCTTCTTTGATCCAGTCACCGGGAATAAAGCCTGGCTTGCCAACGGCAACGATCAATAGATCGGCATTTTCTACGTGATGACGCAGATTTTTAGTGAAGCGGTGAGTCACTGTAGTGGTGCAACCTGCCAGCAGCAGTTCCATGCTCATCGGGCGGCCAACGATATTCGATGCGCCAATCACCACGGCGTTGAGGCCGAAGGTATCAATGTTGTAACGCTCAAGCAGCGTGACGATACCGCGCGGGGTGCAGGGACGCAGACGCGGCGCGCGCTGGCACAGACGACCGACGTTGTAAGGATGGAAACCGTCCACGTCTTTGTCCGGATGAATACGTTCCAGCACTTTGACGTTATCAATACCCGCCGGTAACGGCAGTTGAACCAGAATGCCATCGATGGTGTTGTCGGCATTCAGCGTATCGATAAGCTCCAGCAGCTCCGCTTCGCTGGTGGTTTCCGGGAGGTCATAAGAGCGGGAGACGAACCCGACTTCTTCACAAGCCTTGCGTTTGCTTGCGACATAAATTTGCGATGCAGGGTTACTACCCACCAGCACAACGGCCAGTCCTGGTGCCCGCAGTCCGGCTGCAATACGCGCCTGAACTTTTTGAGCAACTTCAGAGCGCACCTGCTGCGCAATCGTTTTACCGTCAATAATCTTTGCTGCCATCAGAGAGAGGATTCCATCTGTTACGTAGATCGAAGGGGATGCGCCTATTTTGTCAGAAGCGGGGCGCGCTGTCAGGTTTCGTTTCAGATTTATCGCGTGAAGCGACCTCTTGCGAAGGTGAGGCGCACCGTCGCTGAGACTGAAAGCTTCATTTTTCGTCCATGATGGCGTTGTAAATCTGGAACTGATTTATTTCCTTGTCTAAGGATTAAGATAATTTAAGAAATACCTGACAATATAAAAAGAATTTTCAGCCTGGTAATTTACCGCTTCAGGTCTATATTTGTGTTGAATATATTTTGCGCGGAAGTATTCATCTAACGGGGCTCTCTATTTTTTAGAATAGAGTGCATATTTTCAATTAAGACATTCTTAGAGGATAAAAAGGAATTTACTACTATCAGTGTCTTAAATAAAGTAATCGGTTATATACGGATGTGGAGTCGATAAATGAGATTGAAGGAATATATATGAAATTAAGATTTATTTCGTCTGCGCTGGCTGCCGCACTATTCGCCGCTACGGGTAGTTATGCTGCCGTTGTAGATGGCGGTACAATTCACTTTGAAGGCGAACTGGTGAATGCTGCCTGTTCAGTGAATACTGACTCGGCAGACCAGGTTGTCACACTCGGTCAATATCGTACCGATATTTTCAATGCTGTTGGTAATACCTCTGCATTAATTCCATTCACCATTCAGTTGAACGACTGCGATCCTGTTGTTGCCGCTAATGCTGCCGTTGCATTTTCTGGTCAGGCTGATGCAATCAATGATAATTTATTGGCCATTGCATCCAGTACCAATACAACAACAGCAACGGGTGTCGGTATTGAAATACTTGATAATACATCCGCAATTCTCAAACCTGATGGGAATAGCTTCTCAACCAACCAGAACTTGATCCCCGGGACCAACGTTCTTCATTTTTCTGCACGTTATAAAGGCACCGGTACAAGTGCATCAGCAGGGCAAGCAAATGCTGACGCGACTTTTATTATGAGATATGAATAATCAAAACCACGTTGTTTTGAATTATATATCACGTCTTATAACAAAGTAATGTACCGGTTGTCTGAAGCGGTATGGTGGCAATGTAAATCGAAATCATGTTCACTTTGTATCATGCCGCTTTATTAAATGAAAAGGGAATGATGTGTTGTAAGAAACCAAAGCAATCATTTCTTTATATTCCTTATTTTTGCCGTCAGGAATACACAAGGCGTATTAACTATGATGACTAAAATAAAGTTATTGATGCTCATTATATTTTATTTAATCATTTCGGCCAGCGCCCATGCTGCCGGAGGGATCGCATTAGGTGCCACGCGTATTATTTATCCCGCTGATGCTAAACAGACTGCGGTATGGATTAGAAATAGCCATACCAATGAGCGCTTTCTGGTCAATTCGTGGATTGAAAACAGCAGCGGTGTAAAAGAAAAGTCATTCATCATTACACCGCCACTGTTTGTTAGTGAACCCAAAAGCGAAAATACTTTGCGTATTATTTACACCGGTCCACCGCTGGCAGCAGATCGTGAGTCTCTGTTCTGGATGAATGTTAAGACGATCCCTTCGGTAGATAAAAATGCATTGAACGGCAGGAATGTTTTGCAACTGGCGATTTTATCGCGCATGAAATTATTTCTCCGTCCAATTCAATTACAAGAATTACCCGCAGAAGCGCCGGACACACTCAAGTTTTCGCGATCCGGTAACTATATCAATGTTCATAATCCATCACCTTTTTATGTCACCCTGGTTAACTTACAAGTGGGCAGCCAAAAGTTGGGGAATGCTATGGCTGCACCCAGAGTTAATTCACAAATTCCCTTACCCTCAGGAGTGCAGGGAAAGCTGAAATTTCAGACCGTTAATGATTATGGTTCAGTAACTCCGGTCAGAGAAGTGAACTTAAACTAACCGAATCATCTGACAATATCAGAGCTAATTATGAAAATACCCACTACTACGGATATTCCGCAGAGGTATACCTGGTGTCTGGCCGGAATTTGTTATTCATCTCTTGCCATTTTACCCTCCTTTTTAAGCTATGCGGAAAGTTATTTCAACCCGGCATTTTTATTAGAGAATGGCACATCCGTTGCTGATTTATCGCGCTTTGAGAGAGGTAATCATCAACCTGCGGGCGTGTATCGGGTGGATCTCTGGCGTAATGATGAGTTCATTGGTTCGCAGGATATCGTATTTGAATCGACAACAGAAAATACAGGTGATAAATCAGGTGGGTTAATGCCCTGTTTTAACCAGGTACTTCTTGAACGAATTGGCCTTAATAGCAGTGCATTTCCCGAGTTAGCCCAGCAGCAAAACAATAAATGCATCAATTTACTGAAAGCTGTACCTGATGCCACAATTAACTTTGATTTTGCAGCGATGCGCCTGAACATCACTATTCCTCAGATAGCGTTGTTGAGTAGCGCTCACGGTTACATTCCGCCTGAAGAGTGGGATGAAGGTATTCCTGCTTTACTCCTGAATTATAATTTCACCGGTAACAGAGGTAATGGTAACGATAGCTATTTTTTTAGTGAGCTCAGCGGGATTAATATTGGCCCGTGGCGTTTACGCAACAATGGTTCCTGGAACTATTTTCGCGGAAATGGATATCATTCAGAACAGTGGAATAATATTGGCACCTGGGTACAGCGCGCCATTATTCCGCTGAAAAGTGAACTGGTAATGGGAGACGGCAATACAGGAAGTGATATTTTCGATGGCGTTGGATTTCGTGGTGTACGGCTTTATTCTTCTGATAATATGTATCCTGATAGCCAGCAAGGGTTTGCCCCAACGGTACGTGGGATTGCCCGTACGGCGGCCCAGCTAACGATTCGGCAAAATGGTTTTATTATCTATCAAAGCTATGTTTCCCCCGGCGCTTTTGAAATTACAGATTTGCACCCGACATCTTCAAATGGCGATCTGGACGTCACCATCGACGAGCGCGATGGCAATCAGCAGAATTACACAATTCCGTATTCAACAGTGCCAATTTTACAACGCGAAGGGCGTTTCAAATTTGACCTGACGGCGGGCGATTTTCGTAGCGGTAATAGTCAGCAATCATCGCCTTTCTTTTTTCAGGGTACGGCACTCGGCGGTTTACCACAGGAATTTACTGCCTACGGCGGGACGCAATTATCTGCCAATTACACCGCCTTTTTATTAGGGCTGGGGCGCAATCTCGGGAACTGGGGCGCAGTGTCGCTGGATGTAACGCATGCGCGCAGTCAGTTAGCCGACGCCAGTCGTCATGAGGGGGATTCTATTCGCTTCCTCTATGCGAAATCGATGAACACCTTCGGCACCAATTTTCAGTTAATGGGTTACCGCTATTCGACACAAGGTTTTTATACCCTTGATGATGTTGCGTATCGTCGAATGGAGGGGTACGAATATGATTACGACGGTGAGCATCGCGATGAACCGATAATCGTGAATTACCACAATTTACGCTTTAGCCGTAAAGACCGTTTGCAGTTAAATGTTTCACAATCACTTAATGACTTTGGCTCGCTTTATATTTCTGGTACCCATCAAAAATACTGGAATACTTCGGATTCAGATACGTGGTATCAGGTGGGGTATACCAGCAGCTGGGTTGGCATCAGTTATTCGCTCTCATTTTCGTGGAATGAATCTGTAGGGATCCCCGATAACGAACGTATTGTCGGACTTAATGTTTCAGTGCCTTTCAATGTTTTGACCAAACGTCGCTACACCCGGGAAAATGCGCTCGACCGCGCTTATGCCTCCTTTAACGCCAACCGTAACAGCAACGGGCAAAATAGCTGGCTGGCAGGTGTAGGTGGGACCTTACTGGAAGGCCACAACCTGAGTTATCACGTAAGCCAGGGTGATACCTCGAATAATGGGTACACGGGCAGCGCCACGGCAAACTGGCAGGCCGCTTACGGTACGCTGGGGGGCGGGTATAACTACGACCGCGATCAACATGACGTTAACTGGCAGCTGTCTGGCGGTGTGGTCGGGCATGAAAATGGCATAACGCTGAGCCAGCCTTTAGGGGATACCAATGTTTTGATTAAAGCGCCTGGCGCAGGCGGTGTACGCATTGAAAATCAAACTGGCATTTTAACCGACTGGCGCGGCTATGCGGTGATGCTGTATGCCACGGTTTATCGGTATAACCGTATCGCGCTTGATACCAATACGATGGGGAATTCCATCGATGTTGAAAAAAATATTAGCAGCGTTGTGCCGACGCAAGGCGCGTTGGTTCGTGCCAATTTTGATACCCGCATAGGCGTGCGGGCGCTCATTACCGTTACCCAGGGCGGAAAACCGGTGCCGTTTGGATCACTGGTACGGGAAAACAGTACCGGAATAACCAGTATGGTGGGTGATGACGGGCAAGTTTATTTAAGTGGTGCGCCATTGTCTGGTGAATTACTGGTTCAGTGGGGAGACGGCGCGAACTCACGCTGCATTGCGCACTATGTATTGCCGAAGCAAAGCTTACAGCAAGCCGTCACTGTTATTTCGGCAGTTTGCACACATCCTGGCTCATAAAGGAAATTATCAATAAGATAATCTGCAGATTATTATTGGCGATGGCATGTTTGTGTCTGGCAAACATATCCTGGGCTACTGTTTGTGCAAATAGTACTGGCGTAGCAGAAGATGAACACTATGATCTCTCAAATATCTTTAATAGCACCAATAACCAGCCAGGGCAGATTGTTGTTTTACCGGAAAAATCCGGCTGGGTAGGTGTCTCAGCAATTTGTCCACCCGGTACGCTGGTGAATTATACATACCGTAGTTATGTCACCAACTTTATTGTTCAGGAAACTATCGATAATTATAAATATATGCAATTACATGATTATCTATTAGGTGCGATGAGTCTGGTTGATAGTGTGATGGATATTCAGTTCCCCCCGCAAAATTATATTCGGATGGGAACAGATCCTAACGTTTCGCAAAACCTTCCATTCGGGGTGATGGATTCTCGTTTAATATTTCGTTTAAAGGTTATTCGTCCCTTTATTAACATGGTGGAGATCCCCAGACAGGTGATGTTTACCGTGTATGTGACATCAACGCCTTACGATCCGTTGGTTACACCTGTTTATACCATTAGTTTTGGTGGCCGGGTTGAAGTACCGCAAAACTGCGAATTAAATGCCGGGCAGATTGTTGAATTTGATTTTGGTGATATCGGCGCATCGTTATTTAGTGCGGCAGGGCCGGGTAATCGACCTGCTGGTGTCATGCCGCAAACCAAGAGCATTGCGGTCAAATGTACGAATGTTGCTGCGCAGGCTTATTTAACAATGCGTCTGGAAGCCAGTGCCGTTTCTGGTCAGGCGATGGTGTCGGACAATCAGGATTTAGGTTTTATTGTCGCCGATCAGAACGATACGCCGATCACGCCTAACGATCTCAATAGCGTTATTCCTTTCCGTCTGGATGCAGCTGCGGCAGCCAATGTCACACTTCGCGCCTGGCCTATCAGTATTACCGGTCAAAAACCGACCGAAGGGCCGTTTAGCGCGCTGGGGTATTTACGCGTCGATTATCAATGAGGTACGGAGAATGAGAAGAGTACTCTTTAGCTGTTTCTGCGGGCTACTGTGGAGTTCCAGTGGATGGGCAGTTGACCCTTTAGGAACGATTAATATCAATTTGCACGGTAACGTTGTTGATTTCTCCTGTACCGTAAACACAGCGGATATTGATAAGACGGTAGATTTAGGCAGATGGCCTACGACACAACTACTGAACGCTGGCGATACCACGGCACTCGTCCCTTTTAGCCTGCGGCTGGAGGGATGTCCTCCGGGTTCAGTTGCGATTTTATTTACGGGAACGCCGGCATCCGATACCAACCTGCTGGCTCTGGATGATCCCGCAATGGCACAAACCGTCGCCATCGAATTACGTAATAGCGATCGCTCCCGGCTCGCACTGGGGGAGGCGAGCCCGACTGAGGAAGTAGATGCAAATGGCAATGTCACACTAAACTTTTTTGCCAATTATCGAGCGTTAGCCAGCGGTGTTCGGCCAGGTGTGGCGAAAGCGGATGCGATATTTATGATCAATTATAATTAATATTATATTAATTCGTATAATTTGGCGTAGTCGATAAGCTCTACAATTGAATGCAAACCTAGCTTGCCATAAATATTAGATTTATGCGCACTAACTGTTTTATTGCTAAGTAATAACTTATCGGCAATTTCTTTATTAGATAATCCGCTAACCAGATAACGTAATATGGTCACTTCACGATTAGATAGCACAGTGACCGTTGAACTATTCGTACTACATTTATTGCTTTTTATATAGTTAAGCGTTTCGCTGGGAAAAAACGTGTATCCGGAGAGGATCATCTGAACGGCATGAAAAATATCATTCTGATCATTGCATTTACTGACAAAACCGTTAGCACCAGCTTGTATCGCTCTGCCAGCATAAAAGCATTCTGATTTCGATGATAAAAATAACACTTTCACTGTGCTCTGGATTTGTTTGATCCTTTTCAGGAAGGTAAAACCGTCTGTTCCGGGCAAGTCTATATCCATAATGATTAAATCAACAGGACGGGTTCGGAGATAATCGATGGTTATGCGATAATCATCCGTTTTCAGGACAATCTGCAATTCACTGTTTTTTTGCAACAGAACTTCAATAGACATTCTGATGATAGGATGAGTATCCATAATGATCACCGACGTTGGTTTCATAGTTACCAGTCTCATAGGAGCGGACAATTTTCCGTTAGGTCGGGAAATTGTACTTTGATACATGAAAATACGGGTTTTCTTGATTCAGACGCGCAGCGGTGTGCGTTTGTTTGCCGCTATAGCGAAATAAATCAGAAAATCAGACGCGGTCGTTCACTTGTTCAGCAACCAGATCAAAAGCCATTGACTCAGCAAGGGTTGACCGTATAATTCACGCGATTACACCGCATTGCGGTATCAACGCGCCCTTAGCTCAGTTGGATAGAGCAACGACCTTCTAAGTCGTGGGCCGCAGGTTCGAATCCTGCAGGGCGCGCCATTACAATTCAATCAGTTACGCCTTCTTTATATCCTCCATAATTTCAGAGTGGGACATATTTGGGACATTATCACCAAAAATGTCGTCTATTTTCCTCGCATGCTCTGTCAAATGATTAGGCGCAAGGTGAGCATACCTACGAACCATTTCTATGGACTCCCATCCGCCCATTTCCTGAAGCACTGATAATGGGACGCCTGACTGAATCAGCCAGCTTGCCCAGGTGTGTCTGAGGTCATGGAAACGGAAATCTTCAATTCCTGCACGACGACAAGCTGATAGCCATGATGTCTTGCTGTCGATGCGCATCTTCCTGACCGCAGGCGTTGATGTTCCATCTGCTCGCTTAGCCGCCTTGGTATGTACAAACACCCATTTGTGATGCTTGCCTATTTGATCACGCAACACTTTACAGGCGGTATCGTTCAGCGCCACACCAATGGCGCGGTTTGATTTGCTCTCTTCTGGATTCACCCAGGCAACTCGTCGCTGCATGTCGATTTGTTGCCATTCCAGATTTATGATGTTCGACTTTCTCAGACCAGTTGCCAGCGCAAACTTGACGACAGATTTCAGTGGTTCGGGGCACTCATCAATAAGGCGTTTTGCTTCCTCCTTTTCCAGCCATCTGACTCGCTTGTTTCTGACCGCTGGTATCTTGATGACAGGCGCTTTTTCCAGCCACTTCCAGTCGCGTTCTGCAGCACGGAGAATGGCCTTTATCATGGCAAGATGCTTTGCCTTTGTCTGAGTTGATACTGGCTTTGGTTCATAAACAGGCAGTTCTTTACCTTTCCTGATGGCGGCCTGAACTTTCTGTTTCCATATTTCTTTCGTCTTTCTGTTATGCATTCTGCTTACAGCAGAGTAAATCTTTGCCTCCGAGATATCTTTAAGCCTTATACCCTCAAAATGTTCAAGCCAGAACTCAATCCGGCTTTTATCTGAATCGAGAGATTTTTTATCAGCTTTTTCCTCAAGCCATCTTAGGCAGGCCTCTTCAAAAGTGACATCAGGTAAATCCCCTAGCTTTTCTACTCGCCAGAGTTCTGCTTTTCGCTTGTCGTGCAACTCCTGAGCTTGCCGTTTGTCCTTTGTGCCAAGAGATTCCTTAATTCGTTTCCCGCCCGGGAGCGAATACGAGGCATACCATATTTCATTTCTGCGGAAGAGTGACATTTTCTTTCCTCTGTTATGCCATCACCCGCGCTCACCTGGACAGTATGCAGCGGAGACTGAAGCGCCGCAATGCAGGCTTGCCGTGTTGTGAGGTACCCGGATATTATCGTGAGGATGCGTCATCGCCATTGCTCCCCAAATACAAAACCAATTTCAGCCAGTGCCTCGTCCATTTTTTCGATGAACTCCGGCACCATCTCGTCAAAACTCGCCATGTACTTTTCATTCCGCTCAATCACGACATAATGCAGGCCTTCACGCTTCATGCGCGGGTCATAGTTGGCAAAGTACCAGGCATCTTTTCGCGTCACCCACATGCTGTACTGCACCTGGGCCATGTAAGCCGATTTTATTGCCTCGAAACCACCGAGCCGGAATTTCATGAAATCCCGGGAGGTACGAGTATTGCCGGAAGCGTGGCCTGTATCCGGATGCAGAGTCTTATCCGTGGAAATCGAACGCGCATTACTGGTTGGTTACCAACTTGTACCAGAACATGCGGGCCAATGCGCTGGCTGACGCGGAATTACGGCGCAAGGCTGCCGATGAACTGACCTGTATGACAGCGCGAATTAACCGTGGTGAGACGATACCTGAACCAGTAAAACAACTTCCTGTTATGGGCGGTAGACCTCTAAATCGTGCACAGGCTCTGGCGAAGATCGCAGAAATTAAAGCTAAGTTCGGACTGAAAGGAGCAAGTGTATGACGGGCAAAGAGGCAATTATTCATTACCTGGGGACGCATAAGAGCTTCTGTGCACAGGACGTTGCCGCGGTAACAGGCGCAACCGTAATCTGATCTTACCCAGCAATAGTGGACACGCGGCTAAGTGAGTAAACTCTCAGTCAGAGGTGACTCACATGACAAAAACAGTATCAACCAGTAAAAAACCCCGTAAACAGCATTCGCCTGAATTTCGCAGTGAAGCCCTGAAGCTTGCTGAACGCATCGGTGTTACTGCCGCAGCCCGTGAACTCAGCCTGTATGAATCACAACTCTACAACTGGCGCAGTAAACAGCAAAATCAGCAGACGTCTTCTGAACGTGAACTGGAGATGTCTACCGAGATTGCACGTCTCAAACGCCAGCTGGCAGAACGGGATGAAGAGCTGGCTATCCTCCAAAAGGCCGCGACATACTTCGCGAAGCGCCTGAAATGAAGTATGTCTTTATTGAAAAACATCAGGCTGAGTTCAGCATCAAAGCAATGTGCCGCGTGCTCCGGGTGGCCCGCAGCGGCTGGTATACGTGGTGTCAGCGGCGGACAAGGATAAGCACGCGTCAGCAGTTCCGCCAACACTGCGACAGCGTTGTCCTCGCGGCTTTTACCCGGTCAAAACAGCGTTACGGTGCCCCACGCCTGACGGATGAACTGCGTGCTCAGGGTTACCCCTTTAACGTAAAAACCGTGGCGGCAAGCCTGCGCCGTCAGGGACTGAGGGCAAAGGCCTCCCGGAAGTTCAGCCCGGTCAGCTACCGCGCACACGGCCTGCCTGTGTCAGAAAATCTGTTGGAGCAGGATTTTTACGCCAGTGGCCCGAACCAGAAGTGGGCAGGAGACATCACGTACTTACGTACAGATGAAGGCTGGCTGTATCTGGCAGTGGTCATTGACCTGTGGTCACGTGCCGTTATTGGCTGGTCAATGTCGCCACGCATGACGGCGCAACTGGCCTGCGATGCCCTGCAGATGGCGCTGTGGCGGCGTAAGAGGCCCCGGAACGTTATCGTTCACACGGACCGTGGAGGCCAGTACTGTTCAGCAGATTATCAGGCGCAACTGAAGCGGCATAATCTGCGTGGAAGTATGAGCGCAAAAGGTTGCTGCTACGATAATGCCTGCGTGGAAAGCTTCTTTCATTCGCTGAAAGTGGAATGTATCCATGGAGAACACTTTATCAGCCGGGAAATAATGCGGGCAACGGTGTTTAATTATATCGAATGTGATTACAATCGGTGGCGGCGGCACAGTTGGTGTGGCGGCCTCAGTCCGGAACAATTTGAAAACAAGAACCTCGCTTAGGCCTGTGTCCATATTACGTGGGTAGGATCAACCAGCATAAATCAGGCTGCGGCTAAAATGGCGCGGGCAGGAATCCTGGTCGTTGATGGTAAGGTCTGGCGAACGGTGTATTACCGGTTCGCTACCAGAGAAGAATGGGAAGGAAAGGTGAGCACGAATCTGATTTTTAAGGAGTGTCGCCAGAGTGCCGCGATGAAACGGGTATTGAGGGTATATAAAAGAACATCAATGGGAACACAATGATGAAACAGGTGAGTTGAGTTCAAACTGTAGTACAATTCTCTCCAGTTTGAACAGGAAAGAATATGCTATGAACCCTTATATTTATCTTGGTGGTGCAATACTTGCAGAGGTCATTGGTACAACCTTAATGAAGTTTTCAGAAGGTTTTACACGGTTATGGCCATCTGTTGGTACAATTATTTGTTATTGTGCATCATTCTGGTTATTAGCTCAGACGCTGGCTTATATTCCTACAGGGATTGCTTATGCTATCTGGTCAGGAGTCGGTATTGTCCTGATTAGCTTACTGTCATGGGGATTTTTCGGCCAACGGCTGGACCTGCCAGCCATTATAGGCATGATGTTGATTTGTGCCGGTGTGTTGATTATTAATTTATTGTCACGAAGCACACCACATTAAAATAATTTGTTTCTAAACGACTAAAATATGGAGGCTCTTATATTTATATGAGCCTCGTTTTATGCTTTTTGTTAATGTCTTTATTTTTTATGTATTCTTTTGTGCTTTCAAGATTATGGCGTAAGAAAATTGCAATACGATTATTGTTGTATATTCAAGATAATGTGACCTTAATTGTCTTTTTAAATAAAAAATAAACAAAAATTATATCCCACCACTAAGGTTTATAAAAGCATACGTTAGCAGGTGTCACCATGAAAAAAGCCATAGCATATATGCGATTTTCATCACCAGGTCAGATGTCTGGCGACTCATTAAACCGACAGAGAAGACTTATTGCTGAATGGTTAAAGGTAAATAGTGATTATTATCTTGATACCATAACATATGAAGATTTAGGATTAAGTGCATTCAAAGGAAAGCATGCACAATCAGGAGCTTTTTCGGAATTTTTAGATGCTATAGAGCATGGTTATATATTGCCAGGAACTACATTGTTAGTTGAAAGTCTGGACAGACTTTCAAGAGAAAAAGTCGGTGAAGCGATTGAACGTCTGAAATTGATTTTGAATCACGGTATTGATGTTATAACTCTTTGCGACAATACAGTCTATAATATTGACTCTTTGAATGAGCCATATTCATTAATAAAAGCCATACTTATAGCACAAAGGGCAAATGAAGAAAGCGAGATAAAGTCAAGTCGGGTTAAATTATCATGGAAGAAAAAACGGCAGGATGCACTGGAATCAGGTACGATTATGACGGCGTCTTGTCCGAGATGGCTCTCCTTAGATGACAAAAGAACGGCTTTTGTTCCAGACCCCGACAGGGTGAAAACTATTGAGCTAATTTTTAAACTCAGGATGGAAAGGCGCTCATTGAATGCAATAGCCAAGTATTTAAATGATCATGCTGTAAAGAATTTCTCAGGAAAAGAAAGTGCATGGGGACCTTCTGTAATTGAAAAATTATTAGCGAATAAAGCTCTGATAGGTATTTGCGTACCTTCATATCGTGCAAGAGGGAAAGGGATAAGTGAAATCGCTGGCTATTATCCCAGAGTCATATCAGATGATTTGTTTTACGCTGTACAGGAAATTCGGTTGGCACCTTTTGGTATTAGCAATAGTAGCAAGAATCCTATGCTAATAAATCTACTTCGAACAGTTATGAAGTGTGAGGCTTGTGGTAATACCATGATTGTTCATGCGGTATCTGGAAGTTTGCATGGCTATTATGTTTGTCCGATGAGAAGATTACATCGATGTGACAGGCCATCAATAAAAAGAGATTTGGTTGATTATAATATCATTAATGAATTGCTTTTTAATTGTAGCAAAATTCAACCAGTTGAAAACAAGAAAGATGCTAATGAAACTTTAGAGTTAAAAATTATTGAGCTTCAGATGAAAATTAATAATTTAATCGTTGCATTGTCTGTCGCGCCTGAAGTTACCGCTATAGCAGAGAAAATAAGACTATTAGATAAGGAATTACGAAGGGCTTCGGTATCATTGAAAACTTTGAAGAGTAAAGGTGTAAATTCATTCAGTGATTTTTATGCTATTGACTTAACCAGTAAAAATGGACGAGAGTTATGCCGTACACTTGCCTATAAAACATTCGAAAAAATCATAATTAATACGGATAATAAAACCTGTGATATCTATTTTATGAATGGCATTGTTTTTAAACACTATCCTTTAATGAAAGTAATATCCGCCCAGCAGGCGATAAGTGCTCTCAAATATATGGTTGATGGTGAGATTTATTTCTAAATAATGATCTCGGATTTTAAGTTATGCTATGGTGATAAAGTGCAAGACAGAATTAATTATCTTTGACGAAACTTAATGGGTAATTACTTTGTTTGCTCCCACAAGCGAGTTTTGTACGGCTGTATTGGGGTAGTAAATGAGCTATACAATCTTAATCATTTGTTAGGTGAGAACTCTTGGTCGCAGATTCAAATACTGAAAATACGTGACAAATTATTATGAGCAAAATGGTGTATGTCACGTATTTTGAATGGTAGGTTAAAAAATAACACCGACTTTCGTAGGTGTTACTAATAATAAAGCAGAGTTTTTAGATAGTATCAATGTGCTTTGTGTATATTGTGGCAAATAATTGGGTTGGGGGTACAATTGTGATTGCTTTTGCATGAACATTGCGCCTTTATGCATAATGAGATAAAGGAATATCAAATAAAATAACGATAGGTCATAACAAAGAGGTTTTTATGAAAACACTTATCGTTTCAACTGTATTGGCATTCATAACATTTTCTGCGCAGGCTGCAGCATTTCAGGTCACTAGTAATGAAATAAAAACAGGAGAGCAACTTACAACGTCTCATGTCTTTTCTGGATTTGGGTGTGAAGGTGGTAATACATCGCCCTCATTAACCTGGTCTGGTGTTCCTGAAGGTACCAAAAGCTTTGCCGTAACTGTATATGATCCAGATGCACCTACAGGCAGTGGTTGGTGGCATTGGACTGTTGTTAATATTCCAGCAACAGTAACATATTTGCCCGTTGATGCAGGGAGACGTGATGGAACAAAACTGCCGACTGGTGCTGTTCAAGGCCGAAATGATTTTGGCTATGCTGGGTTTGGTGGCGCATGTCCTCCTAAAGGAGATAAACCACATCATTACCAGTTTAAAGTATGGGCTCTAAAAACTGAAAAGATTCCTGTAGATTCTAACTCCAGCGGAGCGTTAGTTGGTTATATGCTTAATGCTAATAAAATCGCAACCGCTGAGATAACACCAGTTTATGAGATAAAGTAGGGTGAGAGTATGCTGGCAAGAGGTAAGACTAACTTAAAGATCGAAGAAATACGGATGCATAAACATCATGAGATTCATAGGGTTAAGCCTCTTATGCCAGCTTTGTGTCGTATCCGTCAGGGAAAGAAAGTTATCAATTGGGAGACGCATACTTTAACTGTTGATAATAATCAAATAATATTATTTCCTTGTGGTTATGAATTTTATATTGAGAATTATCCTGAAGCAGGGCTTTATCTTGCAGAAATGCTTTACTTACCCATTGATTTAATTGAGAGTTTCCAAAAACTTTATACGGTAACTGATCAAATACGTAACAAAACAAGTTTCTTTTTACCTCAGAATCCTGAGTTAATATATTGTTGGGAGCAACTAAAAACATCTGTTTCCCGAGGCTTCTCAACTAAAATTCAGGAGCACTTAGCAATGGGCGTTCTACTTTCGTTAGGAGTGAATCATGTTAATCATTTACTTTTATCATATAGTAAACAATCATTGATAAGTCGTTGTTATAACCTGCTGCTATCCGAACCCGGCACAAAATGGACAGCAAACAAGGTTGCTCGATATCTCTACATTTCTGTTTCTACATTACATCGCCGTCTAGCAAGCGAGGGGGTAAGTTTCCAAAGTATACTGGACGATGTGAGGTTAAATAATGCGTTGTCTGCTATACAAACGACGGTAAAACCTATAAGCGAGATTGCCAGAGAAAATGGTTATAAGTGTCCTTCTCGTTTTACTGAAAGATTTCATAATCGTTTTAATATAACACCAAGAGAGATAAGAAAAGCTTCCAGAGAGTAAAAGTGTTTTAAGAAGGAGCAATTCTATCGATTTTGATTTTGGGAAATCAACACGGCATAATTATGTCACCGGAGCCTGAACAACTCCGGTGACTTCTGCGCTAAACGGGGACGTTTATGCGCACATACAATCCAAACTCTCTTCTCCCTTCACAGATGCAGAAATGCACCTGCAATTCTTTGCATCTAGCGTTTGACCTCTGCGGAGGGGAAGCGTGAACCTCTCACAAGACGGCATCAAATTACATCGCGGCAACTTCACCGCTATCGGTCGGCAGATCCAGCCTTATCTGGAGGAGGGCAAATGCTTTCGCATGGTGCTTAAACCGTGGCGTGAGAAACGCAGTCTTTCCCAGAATGCACTCAGCCACATGTGGTACAGCGAAATCAGTGAATACCTCATCAGCAGGGGTAAAACGTTCGCCACTCCAGCTTGGGTAAAAGATGCTCTCAAACACACATATCTCGGTTATGAAACCAAAGACCTGGTTGATGTCGTAACCGGTGATATCACCACTATCCAGTCGTTACGCCATACCTCCGATCTTGATACCGGAGAGATGTATGTCTTCCTGTGTAAGGTTGAAGCCTGGGCGATGAATATTGGTTGCCACCTGACTATTCCACAGAGCTGCGAGTTCCAGCTGCTGCGCGACAAGCAGGAGGCGTAATGGCTACACCGCTTATTCGTGTCATGAACGGACACATCTACAGAGTACCAAATCGTCGTAAGCGTAAACCTGAGCTGAAGCCATCCGAAATACCAACACTGCTCGGATATACCGCCAGCTTGGTTGATAAAAAATGGTTGCGACTGGCAGCAAGGAGGAGTCATGGCTGATTTGAGAAAAGCAGCGCGTGGTCGGGAATGCCAGGTAAGAATCCCTGGCGTATGTAATGGCAACCCTGAAACGTCTGTACTGGCACATATCCGGCTGACTGGATTGTGCGGCACCGGTACGAAACCGCCAGACCTGATTGCCACCATTGCATGTTCTGCCTGCCACGACGAAATCGACCGCCGCACGCATTTTGTTGACGCTGGATATGCAAAAGAATGCGCGCTGGAAGGTATGGCGAGAACACAGGTTATCTGGCTGAAAGAGGGGGTTATTAAGGCGTGAATACCTACAGCATCACATTACCCTGGCCTCCGAGCAATAATCGCTATTACCGCCATAATCGCGGGCGCACGCACGTCAGCGCAGAGGGGCAGGCATACCGCGATAACGTCGCCCGAATCATTAAAAACGCAATGCTGGATATCGGCCTGGCTATGCCTGTGAAAATCCGCATTGAGTGCCACATGCCGGATCGCCGTCGCCGTGACCTGGATAATCTGCAAAAAGCCGCTTTTGACGCACTCACTAAAGCAGGTTTCTGGCTGGATGATGCTCAGGTCGTTGATTACCGCGTTGTGAAGATGCCTGTTACCAAAGGTGGGAGGCTGGAACTGACCATCACCGAAATGGGGAATGAATGATGTTTGAGTTTAATATGGCAGAACTTCTTCGCCACCGCTGGGGGCGTCTGCGCTTATATCGTTTCCCCGGTTCTGTTTTGACCGATTACCGAATACTGAAGAATTACGCCAAAACCCTGACAGGAGCAGGAGTATGAAGTCAGAGATAACAATCAACTAATACTGTTTTGTTGATTTTTGCTTGTAATTGGCGTTCTGGTCTGATTTTTGTGGAGTAAGTTGATGCGTGATATTCAGATGGTTCTTGAGCGTTGGGGAGCGTGGGCGGCTAATAATCATGAAGATGTGACCTGGTCGTCCATTGCCGCCGGTTTTAAGGGATTAATTACTTCAAAAGTAAAATCTCGCCCGCAATGTTGTGACGATGACGCGATGATCATTTGCGGGTGCATGGCCCGTCTGAAAAAGAACAACAGCGATTTGCACGATTTATTAGTAGATTATTATGTAGTCGGTATGACATTCATGTCACTGGCAGGTAAGCATTGCTGCTCTGATGGTTATATCGGGAAAAGGTTACAGAAGGCTGAGGGCATAATTGAAGGGATGTTAATGGCATTAGATATCCGGTTAGAGATGGATATCGTTGTTAATAACTCTAATTAATATGCCAATTGTTTACTAAAAATTATTAAAAATGGGGCGTTGAGACGCCCCCAAAAATAAAGGGTAATATATAACAGAAGGTTTATATAGTTAGAAGCAAGGTTGTGCTTCTAAAGGAAGTGGCTTGAGGGAGCCACTTATATGTTGGGGAGGCAACGCCTCCCGCAACATATCTTTTTCGTAATCAGATTAGAACTGGTAAACCAGACCTACAGCAACGATGTCATCAGTGCTTACACCGAGTGCTTTAGGGAAGGTGCGAATAAGCGGGGAAATTCTTCTCGGCTGACTCAGTCATTTCATTTCTTCATGTTTGAGCCGATTTTTTCTCCCGTAAATGCCTTGAATCAGCCTATTTAGACCGTTTCTTCGCCATTTAAGGCGTTATCCCCAGTTTTTAGTGAGATCTCTCCCACTGACGTATCATTTGGTCCGCCCGAAACAGGTTGGCCAGCGTGAATAACATCGCCAGTTGGTTATCGTTTTTCAGCAACCCCTTGTATCTGGCTTTCACGAAGCCGAACTGTCGCTTGATGATGCGAAATGGGTGCTCCACCCTGGCCCGGATGCTGGCTTTCATGTATTCGATGTTGATGGCCGTTTTGTTCTTGCGTGGATGCTGTTTCAAGGTTCTTACCTTGCCGGGGCGCTCGGCGATCAGCCAGTCCACATCCACCTCGGCCAGCTCCTCGCGCTGTGGCGCCCCTTGGTAGCCGGCATCGGCTGAGACAAATTGCTCCTCTCCATGCAGCAGATTACCCAGCTGATTGAGGTCATGCTCGTTGGCCGCGGTGGTGACCAGGCTGTGGGTCAGGCCACTCTTGGCATCGACACCAATGTGGGCCTTCATGCCAAAGTGCCACTGATTGCCTTTCTTGGTCTGATGCATCTCCGGATCGCGTTGCTGCTCTTTGTTCTTGGTCGAGCTGGGTGCCTCAATGATGGTGGCATCGACCAAGGTGCCTTGAGTCATCATGACGCCTGCTTCGGCCAGCCAGCGATTGATGGTCTTGAACAATTGGCGGGCCAGTTGATGCTGCTCCAGCAGGTGGCGGAAATTCATGATGGTGGTGCGGTCCGGCAAGGCGCTATCCAGGGATAACCGGGCAAACAGACGCATGGAGGCGATTTCGTACAGAGCATCTTCCATCGCGCCATCGCTCAGGTTGTACCAATGCTGCATGCAGTGAATGCGTAGCATGGTTTCCAGCGGATAAGGTCGCCGGCCATTACCAGCCTTGGGGTAAAACGGCTCGATGACTTCCACCATGTTTTGCCATGGCAGAATCTGCTCCATGCGGGACAAGAAAATCTCTTTTCTGGTCTGACGGCGCTTACTGCTGAATTCACTGTCGGCGAAGGTAAGTTGATGACTCATGATGAACCCTGTTCTATGGCTCCAGATGACAAACATGATCTCATATCAGGGACTTGTTCGCACCTTCCTTAGTGAAGTCATTTTTGTCAAGCAGGTTGATTTTGTAATCAACGAAAGTAGACATATTTTTGTTGAAGTAATAGGTTGCACCTACATCAACATATTTGACTAAGTCCTGATCGCCCCATACTCCAAGATCCTTACCTTTAGATTGCAGGTAAGCAACGGACGGACGCAGACCGAAATCGAACTGATATTGTGCAACAGCTTCGAAGTTTTGGGCTTTATTAGCAACGAAGTGATCAGCAAATACAGTCATATTCTGGGTTTCAGAATAGGTAGTGGCCAGGTAAATGTTGTTAGCGTCATATTTCAGACCTGCGGCCCAAACTTCTGCATTTTTACCGGAAGCAAATACTTCAGGAAGAACTTTCCCTGCATTAACTTGAGTGTCGGTACGATCAGATTTCGCATAAGTTGCACCGATACCGAATCCTTCGTATTCATAGGTAGCAGAGAAACCGAAGCCATCACCGTTACCTTCAGTGTAGTTATCGAAATCGCTACGATCGTTTTTGCCTTGGTACTGAGCAGCAAAGTTCAGACCATCAACCAGACCAAAGAAGTCGTTGTTACGATAGGTTGCAACACCAGTTGCACGTTGAGTCATGAACACGTCGGTTTGAGTCCAAGTGTCACCACCGAATTCTGGCAGGACGTCAGTCCACGCACCGATGTCGTATGCTACACCGTAGTTACGGCCGTAATCGATGGAGCCGTAGTCACCGAATTTCAGGCCAGCGAAGGCAAGACGGGTTTTATCTTTGGAGGAACCTTGAGATTCAGCGCGGTTGCCTTTGAATTCATATTCCCACTGACCGAAACCAGTCAGTTGATCGTTGATTTGGGTTTCACCTTTGAAGCCAAGACGGGCATAAGTAGTATCACCATCATCTGCATCATTAGAGGAGAAGTAGTGCTTAGCATTAACTTTCCCGTACAGATCCAGCTTGTTACTGTCTTTATTATAAATTTCAGCTGCCTGAGCAGACATCGCCATTAGTACTGATGCAGCTACAGCAGAAATTGCCACTGTTAATTTTTTCATCGTGAGCCCTTTTTTTGAACTATTATTAAAAAATGATGTCACTGCGCGATAAATATTCATCTAATCAATGTGATTATTTCAAGATGTAAGTTTTGGTTTCTCGTTTGATTTGTGAAGTAGATCTCTATTTTTATCTGAACTTTTTTCTATCGAATCCTATTCATGGCTCTTGGCTGAATAAAAATAAATCTATTAGCCAATTTATATTAACGGCTGTTATTTATAAGTGCTCTATAATTTGAAGGTTCAATTTAAACCGGCTAAAAATAACACTGGAAATTATTTTTTGGTTATTTGTTGAGATTTGCTTATGTATTTGTAGTGGTGTTTTCAATACTCGGTAGCATTCTCTCAAATATCATTTAGTGGTTTACGTACGTAAAAAATTGGTTATGCTGTTAAGAGTGGTTACTTCGTCACACAGCTTAAACCCGCCGTCGAGCGGGTTTTTCCATTTTTTGAGTCTCGATATTAGCTGATAACCCAATACCTGAGTTATTCACTGACTCCGAGTCTGTTACGTTTCGTAGTATTCCCTCAATTTACACCCGCTTTGTCTGCGAGGTGGGGTTATGAAATCCATGGATAAGTTAACAACGGGTGTCGCCTATGGCACCTCAGCAGGTAGTGCCGGGTACTGGTTTTTACAGCTGCTAGATAAAGTCACTCCCTCACAGTGGGCAGCAATAGGTGTGCTGGGTAGCCTGGTATTTGGCCTGCTGACGTACCTGACAAACCTTTATTTCAAGATTAAAGAAGATAAGCGCAAGGCTGCGAGAGGTGAATAATGCCTCCATCATTACGAAAAGCCGTTGCTGCTGCTATTGGTGGCGGAGCAATTGCTATAGCATCAGTGTTAATCACTGGCCCAAGTGGTAACGATGGTCTGGAAGGTGTCAGCTACATACCATACAAAGATATTGTTGGTGTATGGACTGTATGTCACGGACACACCGGAAAAGACATCATGCTCGGTAAAACGTATACCAAAGCAGAATGCAAAGCACTCTTGAATAAAGACCTTGCCACTGTCGCCAGACAAATTAACCCGTATATCAAAGTCGATATACCGGAAACAACGCGCGGCGCTCTTTACTCATTCGTTTACAACGTGGGTGCTGGCAATTTTAGAACATCGACGCTTCTTCGCAAAATAAACCAGGGCGATATCAAAGGCGCATGTGATCAGCTGCGTCGCTGGACATACGCTGGCGGTAAGCAATGGAAAGGCCTGATGACTCGTCGTGAGATTGAGCGTGAAGTCTGTTTGTGGGGGCAACAGTGAGCAGAGTAACCGCGATTATATCCGCTCTGATTATCTGCATCATCGTCAGCCTGTCATGGGCGGTCAATCATTACCGTGATAACGCAATCGCCTACAAAGTCCAGCGCGACAAAAATGCCAGAGAACTGAAGCTAGCGAACGCGGCAATTACTGACATGCAGATGCGTCAGCGTGATGTTGCTGCGCTCGATGCAAAATACACGAAGGAGTTAGCTGATGCGAAAGCTGAAAATGATGCTCTGCGTGATGATGTTGCCGCTGGTCGTCGTCGGTTGCACATCAAAGCAGTCTGTCAGTCAGTGCGTGAAGCCACCACGGCCTCCGGCGTGGATAATGCAGCCTCCCCCCGACTGGCAGACACCGCTGAACGGGATTATTTCACCCTCAGAGAGAGGCTGATCACTATGCAAAAACAACTGGAAGGAACCCAGAAGTATATTAATGAGCAGTGCAGATAGAGCTGACCATATCGATGGGCAACTCATGCAATTATTTTGAGCAATACACACGCGCTTCCAGCGGAGTATAAATGCCTAAAGTAATAAAACCGAGCAATCCATTTACGAATGTTTGCTGGGTTTCTGTTTTAACAACATTTTCTGCGCCGCCACAAATTTTAGCTGCATCGACAGTTTTCTTCTGCCCAATTCCAGAAACGAAGAAATGATGGGTGATGGTTTCCTTTGGTGCTACTGCTGTCTGTTTGTTTTGAACAGTAAATGTCTGTTGAGCACATCCTGTAATAAGCAGGGCCAGCGCAGTAGCGAGTAGCATTTTTTTCATGGTGTTATTCCCGATGCTTTTTGAAGTTCGCAGAATCGTATGTGTAGAAAATTAAACAAACCCTAAACAATGAGTTGAAATTTCATATTGTTAATATTTATTAATGTATGCCAGGTGCGATGAATCGTCATTGTATTCCCGGATTAACTATGTCCACAGCCCTGACGGGGAACTTCTCTGCGGGAGTGTCCGGGAATAATTAAAAACGATGCACACAGGGTTTAGCGCGTACATGTATTGTATTATGCCAACACCCCGGTGCTGACACGGAAGAAACCGGACGTTATGATTTAGCGTGGAAAGATTTGTGTAGTGTTCTGAATGCTCTCAGTAAATAGTAATGAATTATCAAAGGTATAGTAATATCTTTTATGTTCGTGGATATTTGTAATCCATCGGAAAACTCCTGCTTTAGCAAGATTTTCCCTGTATTGCTGAAATGTGATTTCTCTTGATTTCAACCTATCATAGGACGTTTCTATAAGATGCGTATTTCTTGAGAATTTAACATTTACAACCTTTTTAAGTCCTTTTATTAACACGGTGTTATCGTTTTCTAACACAATGTGAATATTATCTGTGGCTAGATAGTAAATATAATGTGAGACATTGTGACGTTTTAGTTCAGAATAAAACAATTCACAGTTTAAATCTTTTCGCACTTGATCGAATATTTCTTTAAAAATGGCAACCTGAGCCATTGGTAAAACCTTCCATGTGATACGAGGGCGCGTAGTTTGCATTATCGTTTTTATCGCTTCAATCTGGTCTGACCTCTTTGTGTTTTGTTGATGATTTATGTCAAATATTAGGAATGTTTTCAATTAATAGTATTGGTTGCGTAACAAAGTGCGGTCCTGCTGGCATTCTGGAGGGAAATACAACCGACAGATGTATGTAAGGCCAACGTGCTCAAACCTTCATACAGAAAGATTTGAAGTAATATTTTAACCGCTAGATGAAGAGCAAGCGCATGGAGCGACAAAATGAATAAAGAACAATCTGCTGATGATCCCTCCGTGGATCTGATTCGTGTAAAAAATATGCTTAATAGCACCATTTCTATGAGTTACCCTGATGTTGTAATTGCATGTATAGAACATAAGGTGTCTCTGGAAGCATTCAGGGCAATTGAGGCAGCGTTGGTGAAGCACGATAATAATATGAAGGATTATTCCCTGGTGGTTGACTGATCACCATAACTGCTAATCATTCAAACTACTTAACCTGTGACAGAGCCAACACGCAGTCTGTCACTGTCAGGAAAGTGGTAAAACTGCAACTCAATTACTGCAATGCCCTCGTAATTAAGTGAATTTACAATATCGTCCTGTTCGGAGGGAAGAACGCGGGATGTTCATTCTTCATCACTTTTAATTGATGTATATGCTCTCTTTTCTGACGTTAGCCTCCGACGGCAGGCTTCAATGACCCAGGCTGAGAAATTCCCGGACCCTTTTTGCTCAAGAGCGATGTTAATTTGTTCAATCATTTGGTTAGGAAAGCGGATGTTGCGGGTTGTTGTTCTGCGGGTTCTGTTCTTAGTTGACATGAGGTTGCCCCGTATTCAGTGTCGCTGATTTGTATTGTCTGAAGTTGTTTTTACGTTAAGTTGATGCAGATCAATTAATACGATACCTGCGTCATAATTGATTATTTGACGTGGTTTGATGGCGTAGATGCACGTTGTGACATGTAGATGATAATTATTATCATTTTGTGGGTCCTTTCCGGCGATCCGACAGGTTACGGGGCGGCGACCTCGCGGGTTTTCGCTATTTATGAAAATTTTCCGGTTTAAGGTGTTTCCGTTCTTCTTCGTCGTAACTTAATGTATTTATTTAAAATACCCCCTGAAAAGAAAGGAAACGACAGGTGCTGAAAGCGAGCTTTTTGGCCTCTGTCGTTTCCTTTCTCTGTTTTTGTCCGTGGAATGTGCAATGGAAGTCAACAAAAAGCAGCTGGCTGACATTTTCGGTGCGAGTATCCGTACCATTCAGAACTGGCAGGAACAGGGAATGCCCGTTCTGCGAGGCGGTGGCAAGGGTAATGAGGTGCTTTATGACTCTGCCGCCGTCATAAAATGGTATGCCGAAAGGGATGCTGAAATTGAGAACGAAAAGCTGCGCCGGGAGGTTGAAGAACTGCTGCAGGCCAGCGAGACAGATCTCCAGCCAGGGACTATTGAGTACGAACGCCATCGACTTACGCGTGCGCAGGCCGATGCACAGGAGCTGAAAAATGCCAGAGACTCCGCTGAAGTGGTGGAAACCGCATTCTGTACTTTCGTGCTGTCGCGGATCGCAGGTGAAATTGCCAGTATTCTCGACGGGATCCCCCTGTCGGTGCAGCGGCGTTTTCCGGAACTGGAAAACCGACATGTTGATTTCCTGAAACGGGATATCATCAAAGCCATGAACAAAGCAGCCGCGCTGGATGAACTGATACCGGGGTTGCTGAGTGAATATATCGAACAGTCAGGTTAACAGGCTGCGGCATTTTGTCCGCGCCGGGCTTCGCTCACTGTTCAGGCCGGAGCCACAGACCGCCGTTGAATGGGCGGATGCTAATTACTATCTCCCGAAAGAATCCGCATACCAGGAAGGGCGCTGGGAAACACTGCCCTTTCAGCGGGCCATCATGAATGCGATGGGCAGCGACTACATCCGTGAGGTGAATGTGGTGAAGTCTGCCCGTGTCGGTTATTCCAAAATGCTGCTGGGTGTTTATGCCTACTTTATAGAGCATAAGCAGCGCAACACCCTTATTCCAGCTGGCTTCGTGGCTGTTTTCAACAGTGATGAGTCATCGTGGCATCTCGTTGAAGATCATCGGGGTAAAACGGTTTATGACGTAGCGTCAGGGGACGCGTTATTTATTTCTGAACTCGGTCCGTTACCGGAAAATGTTACCTGGTTATCGCCGGAAGGGGAGTTTCAGAAGTGGAACGGTACAGCCTGGGTGAAAGATGCAGAAGCAGAAAAACTGTTCCGGATTCGGGAGGCGGAAGAAACAAAAAACAGCCTGATGCAGGTAGCCAGTGAGCATATTGCGCCACTTCAGGATGCTGTAGATCTGGAAATCGCAACGGAGGAAGAAACCTCATTGCTGGAAGCCTGGAAAAAATATCGGGTGTTGCTGAACCGTGTTGATACATCAACTGCACCTGATATTGAGTGGCCTACGAACCCTGTCAGGGAGTAA
Protein sequences of DBSCAN-SWA_1 >CP028702|911960:965264|956250_956541_+|AVZ48090.1|DBSCAN-SWA MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDRRTHFVDAGYAKECALEGMARTQVIWLKEGVIKA >CP028702|911960:965264|935757_936267_-|AVZ48066.1|DBSCAN-SWA MSSRNNPARVAIVMGSKSDWATMQFAAEIFEILNVPHHVEVVSAHRTPDKLFSFAESAEENGYQVIIAGAGGAAHLPGMIAAKTLVPVLGVPVQSAALSGVDSLYSIVQMPRGIPVGTLAIGKAGAANAALLAAQILATHDKELHQRLNDWRKAQTDEVLENPDPRGAA >CP028702|911960:965264|949264_949528_-|AVZ48080.1|DBSCAN-SWA MKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVIERNEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >CP028702|911960:965264|945445_946486_+|AVZ48076.1|DBSCAN-SWA MHTSWLIKEIINKIICRLLLAMACLCLANISWATVCANSTGVAEDEHYDLSNIFNSTNNQPGQIVVLPEKSGWVGVSAICPPGTLVNYTYRSYVTNFIVQETIDNYKYMQLHDYLLGAMSLVDSVMDIQFPPQNYIRMGTDPNVSQNLPFGVMDSRLIFRLKVIRPFINMVEIPRQVMFTVYVTSTPYDPLVTPVYTISFGGRVEVPQNCELNAGQIVEFDFGDIGASLFSAAGPGNRPAGVMPQTKSIAVKCTNVAAQAYLTMRLEASAVSGQAMVSDNQDLGFIVADQNDTPITPNDLNSVIPFRLDAAAAANVTLRAWPISITGQKPTEGPFSALGYLRVDYQ >CP028702|911960:965264|918414_919293_+|AVZ48053.1|DBSCAN-SWA MKLGFIGLGIMGTPMAINLARAGHQLHVTTIGPVADELLSLGAVSVETARQVTEASDIIFIMVPDTPQVEEVLFGENGCTKASLKGKTIVDMSSISPIETKRFARQVNELGGDYLDAPVSGGEIGAREGTLSIMVGGDEAVFERVKPLFELLGKNITLVGGNGDGQTCKVANQIIVALNIEAVSEALLFASKAGADPVRVRQALMGGFASSRILEVHGERMIKRTFNPGFKIALHQKDLNLALQSAKALALNLPNTATCQELFNTCAANGGSQLDHSALVQALELMANHKLA >CP028702|911960:965264|939198_939720_-|AVZ48070.1|DBSCAN-SWA MPTVITHAAVPLCIGLGLGSKVIPPRLLFAGIILAMLPDADVLSFKFGVAYGNVFGHRGFTHSLVFAFVVPLLCVFIGRRWFRAGLIRCWLFLTVSLLSHSLLDSVTTGGKGVGWLWPWSDERFFAPWQVIKVAPFALSRYTTPYGHQVIISELMWVWLPGMLLMGMLWWRRR >CP028702|911960:965264|961273_961735_+|AVZ48099.1|lysis|DBSCAN-SWA MSRVTAIISALIICIIVSLSWAVNHYRDNAIAYKVQRDKNARELKLANAAITDMQMRQRDVAALDAKYTKELADAKAENDALRDDVAAGRRRLHIKAVCQSVREATTASGVDNAASPRLADTAERDYFTLRERLITMQKQLEGTQKYINEQCR >CP028702|911960:965264|958924_959992_-|AVZ48096.1|DBSCAN-SWA MKKLTVAISAVAASVLMAMSAQAAEIYNKDSNKLDLYGKVNAKHYFSSNDADDGDTTYARLGFKGETQINDQLTGFGQWEYEFKGNRAESQGSSKDKTRLAFAGLKFGDYGSIDYGRNYGVAYDIGAWTDVLPEFGGDTWTQTDVFMTQRATGVATYRNNDFFGLVDGLNFAAQYQGKNDRSDFDNYTEGNGDGFGFSATYEYEGFGIGATYAKSDRTDTQVNAGKVLPEVFASGKNAEVWAAGLKYDANNIYLATTYSETQNMTVFADHFVANKAQNFEAVAQYQFDFGLRPSVAYLQSKGKDLGVWGDQDLVKYVDVGATYYFNKNMSTFVDYKINLLDKNDFTKEGANKSLI >CP028702|911960:965264|961766_962060_-|AVZ48100.1|DBSCAN-SWA MKKMLLATALALLITGCAQQTFTVQNKQTAVAPKETITHHFFVSGIGQKKTVDAAKICGGAENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSK >CP028702|911960:965264|947014_947647_-|AVZ48078.1|DBSCAN-SWA MKPTSVIIMDTHPIIRMSIEVLLQKNSELQIVLKTDDYRITIDYLRTRPVDLIIMDIDLPGTDGFTFLKRIKQIQSTVKVLFLSSKSECFYAGRAIQAGANGFVSKCNDQNDIFHAVQMILSGYTFFPSETLNYIKSNKCSTNSSTVTVLSNREVTILRYLVSGLSNKEIADKLLLSNKTVSAHKSNIYGKLGLHSIVELIDYAKLYELI >CP028702|911960:965264|914839_915655_+|AVZ48050.1|DBSCAN-SWA MTEVRRRGRPGQAEPVAQKGAQALERGIAILQYLEKSGGSSSVSDISLNLDLPLSTTFRLLKVLQAADFVYQDSQLGWWHIGLGVFNVGAAYIHNRDVLSVAGPFMRRLMLLSGETVNVAIRNGNEAVLIGQLECKSMVRMCAPLGSRLPLHASGAGKALLYPLAEEELMSIILQTGLQQFTPTTLVDMPTLLKDLEQARELGYTVDKEEHVVGLNCIASAIYDDVGSVVAAISISGPSSRLTEDRFVSQGELVRDTARDISTALGLKAHP >CP028702|911960:965264|957903_958920_-|AVZ48095.1|transposase|DBSCAN-SWA MFVIWSHRTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMVEVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMRLFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQGTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSGLTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVDWLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGFVKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH >CP028702|911960:965264|954059_954611_+|AVZ48085.1|DBSCAN-SWA MKTLIVSTVLAFITFSAQAAAFQVTSNEIKTGEQLTTSHVFSGFGCEGGNTSPSLTWSGVPEGTKSFAVTVYDPDAPTGSGWWHWTVVNIPATVTYLPVDAGRRDGTKLPTGAVQGRNDFGYAGFGGACPPKGDKPHHYQFKVWALKTEKIPVDSNSSGALVGYMLNANKIATAEITPVYEIK >CP028702|911960:965264|963759_963861_+|AVZ48104.1|DBSCAN-SWA MIIIIILWVLSGDPTGYGAATSRVFAIYENFPV >CP028702|911960:965264|956537_956900_+|AVZ48091.1|DBSCAN-SWA MNTYSITLPWPPSNNRYYRHNRGRTHVSAEGQAYRDNVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVVKMPVTKGGRLELTITEMGNE >CP028702|911960:965264|922313_923675_+|AVZ48055.1|DBSCAN-SWA MSFDLIIKNGTVILENEARVVDIAVKGGKIAAIGQDLGDAKEVMDASGLVVSPGMVDAHTHISEPGRSHWEGYETGTRAAAKGGITTMIEMPLNQLPATVDRASIELKFDAAKGKLTIDAAQLGGLVSYNIDRLHELDEVGVVGFKCFVATCGDRGIDNDFRDVNDWQFFKGAQKLGELGQPVLVHCENALICDELGEEAKREGRVTAHDYVASRPVFTEVEAIRRVLYLAKVAGCRLHVCHVSSPEGVEEVTRARQEGQDVTCESCPHYFVLDTDQFEEIGTLAKCSPPIRDLENQKGMWEKLFNGEIDCLVSDHSPCPPEMKAGNIMKAWGGIAGLQSCMDVMFDEAVQKRGMSLPMFGKLMATNAADIFGLQQKGRIAPGKDADFVFIQPNSSYVLTNDDLEYRHKVSPYVGRTIGARITKTILRGDVIYDIEQGFPVAPKGQFILKHQQ >CP028702|911960:965264|955534_955636_+|AVZ48087.1|DBSCAN-SWA MRTYNPNSLLPSQMQKCTCNSLHLAFDLCGGEA >CP028702|911960:965264|915744_917526_+|AVZ48051.1|DBSCAN-SWA MAKMRAVDAAMYVLEKEGITTAFGVPGAAINPFYSAMRKHGGIRHILARHVEGASHMAEGYTRATAGNIGVCLGTSGPAGTDMITALYSASADSIPILCITGQAPRARLHKEDFQAVDIEAIAKPVSKMAVTVREAALVPRVLQQAFHLMRSGRPGPVLVDLPFDVQVAEIEFDPDMYEPLPVYKPAASRMQIEKAVEMLIQAERPVIVAGGGVINADAAALLQQFAELTSVPVIPTLMGWGCIPDDHELMAGMVGLQTAHRYGNATLLASDMVFGIGNRFANRHTGSVEKYTEGRKIVHIDIEPTQIGRVLCPDLGIVSDAKAALTLLVEVAQEMQKAGRLPCRKEWVADCQQRKRTLLRKTHFDNVPVKPQRVYEEMNKAFGRDVCYVTTIGLSQIAAAQMLHVFKDRHWINCGQAGPLGWTIPAALGVCAADPKRNVVAISGDFDFQFLIEELAVGAQFNIPYIHVLVNNAYLGLIRQSQRAFDMDYCVQLAFENINSSEVNGYGVDHVKVAEGLGCKAIRVFKPEDIAPAFEQAKALMAQYRVPVVVEVILERVTNISMGSELDNVMEFEDIADNAADAPTETCFMHYE >CP028702|911960:965264|964520_965264_+|AVZ48106.1|terminase|DBSCAN-SWA MNISNSQVNRLRHFVRAGLRSLFRPEPQTAVEWADANYYLPKESAYQEGRWETLPFQRAIMNAMGSDYIREVNVVKSARVGYSKMLLGVYAYFIEHKQRNTLIPAGFVAVFNSDESSWHLVEDHRGKTVYDVASGDALFISELGPLPENVTWLSPEGEFQKWNGTAWVKDAEAEKLFRIREAEETKNSLMQVASEHIAPLQDAVDLEIATEEETSLLEAWKKYRVLLNRVDTSTAPDIEWPTNPVRE >CP028702|911960:965264|956896_957037_+|AVZ48092.1|DBSCAN-SWA MMFEFNMAELLRHRWGRLRLYRFPGSVLTDYRILKNYAKTLTGAGV >CP028702|911960:965264|950008_951170_+|AVZ48082.1|transposase|DBSCAN-SWA MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWRSKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILPKGRDILREAPEMKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCDSVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLAVVIDLWSRAVIGWSMSPRMTAQLACDALQMALWRRKRPRNVIVHTDRGGQYCSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISREIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENKNLA >CP028702|911960:965264|929846_931514_+|AVZ48061.1|DBSCAN-SWA MIHAFIKKGCFQDSVSLMIISRKLSESENVDDVSVMMGTPANKALLDTTGFWHDDFNNATPNDICVAIRSEAADAGIAQAIMQQLEEALKQLAQGSGSSQALTQVRRWDSACQKLPDANLALISVAGEYAAELANQALDRNLNVMMFSDNVTLEDEIQLKTRAREKGLLVMGPDCGTSMIAGTPLAFANVMPEGNIGVIGASGTGIQELCSQIALAGEGITHAIGLGGRDLSREVGGISALTALEMLSADEKSEVLAFVSKPPAEAVRLKIVNAMKATGKPTVALFLGYTPAVARDENVWFASSLDEAARLACLLSRVTARRNAIAPVSSGFICGLYTGGTLAAEAAGLLAGHLGVEADDTHQHGMMLDADSHQIIDLGDDFYTVGRPHPMIDPTLRNQLIADLGAKPQVRVLLLDVVIGFGATADPAASLVSAWQKACAARLDNQPLYAIATVTGTERDPQCRSQQIATLEDAGIAVVSSLPEATLLAAALIHPLSPAAQQHTPSLLENVAVINIGLRSFALELQSASKPVVHYQWSPVAGGNKKLARLLERLQ >CP028702|911960:965264|923731_925033_+|AVZ48056.1|DBSCAN-SWA MFNFAVSRESLLSGFQWFFFIFCNTVVVPPTLLSAFQLPQSSLLTLTQYAFLATALACFAQAFCGHRRAIMEGPGGLWWGTILTITLGEASRGTPINDIATSLAVGIALSGVLTMLIGFSGLGHRLARLFTPSVMVLFMLMLGAQLTTIFFKGMLGLPFGIADPNFKIQLPPFALSVAVMCLVLAMIIFLPQRFARYGLLVGTITGWLLWYFCFPSSHSLSGELHWQWFPLGSGGALSPGIILTAVITGLVNISNTYGAIRGTDVFYPQQGAGNTRYRRSFVATGFMTLITVPLAVIPFSPFVSSIGLLTQTGDYTRRSFIYGSVICLLVALVPALTRLFCSIPLPVSSAVMLVSYLPLLFSALVFSQQITFTARNIYRLALPLFVGIFLMALPPVYLQDLPLTLRPLLSNGLLVGILLAVLMDNLIPWERIE >CP028702|911960:965264|956087_956258_+|AVZ48089.1|DBSCAN-SWA MATPLIRVMNGHIYRVPNRRKRKPELKPSEIPTLLGYTASLVDKKWLRLAARRSHG >CP028702|911960:965264|913123_914050_-|AVZ48048.1|DBSCAN-SWA MFDPETLRTFIAVAETGSFSKAAERLCKTTATISYRIKLLEENTGVALFFRTTRSVTLTAAGEHLLSQARDWLSWLESMPSELQQVNDGVERQVNIVINNLLYNPQAVAQLLAWLNERYPFTQFHISRQIYMGVWDSLLYEGFSLAIGVTGTEALANTFSLDPLGSVQWRFVMAADHPLANVEEPLTEAQLRRFPAVNIEDSARTLTKRVAWRLPGQKEIIVPDMETKIAAHLAGVGIGFLPKSLCQSMIDNQQLVSRVIPTMRPPSPLSLAWRKFGSGKAVEDIVTLFTQRRPEISGFLEIFGNPRS >CP028702|911960:965264|927223_928459_-|AVZ48059.1|DBSCAN-SWA MITHFRQAIEETLPWLSSFGADPAGGMTRLLYSPEWLETQQQFKKRMAASGLETRFDEVGNLYGRLNGTEYPQEVVLSGSHIDTVVNGGNLDGQFGALAAWLAIDWLKTQYGAPLRTVEVVAMAEEEGSRFPYVFWGSKNIFGLANPDDVRNICDAKGNSFVDAMKACGFTLPNAPLTPRQDIKAFVELHIEQGCVLESNGQSIGVVNAIVGQRRYTVTLNGESNHAGTTPMGYRRDTVYAFSRICHQSVEKAKRMGDPLVLTFGKVEPRPNTVNVVPGKTTFTIDCRHTDAAVLRDFTQQLENDMRAICDEMDIGIDIDLWMDEEPVPMNKELVATLTELCEREKLNYRVMHSGAGHDAQIFAPRVPTCMIFIPSINGISHNPAERTNITDLAEGVKTLALMLYQLAWQK >CP028702|911960:965264|932793_933609_+|AVZ48063.1|DBSCAN-SWA MTIIHPLLASSSAPNYRQSWRLAGVWRRAINLMTESGELLTLHRQGSGFGPGGWVLRRAQFDALCGGLCGNERPQVVAQGIRLGRFTVKQPQRYCLLRITPPAHPQPLAAAWMQRAEETGLFGPLALAASDPLPAELRQFRHCFQAALNGVKTDWRHWLGKGPGLTPSHDDTLSGMLLAAWYYGALDARSGRPFFACSDNLQLVTTAVSVSYLRYAAQGYFASPLLHFVHALSCPKRTAVAIDSLLALGHTSGADTLLGFWLGQQLLQGKP >CP028702|911960:965264|951481_951814_+|AVZ48083.1|DBSCAN-SWA MNPYIYLGGAILAEVIGTTLMKFSEGFTRLWPSVGTIICYCASFWLLAQTLAYIPTGIAYAIWSGVGIVLISLLSWGFFGQRLDLPAIIGMMLICAGVLIINLLSRSTPH >CP028702|911960:965264|963046_963253_+|AVZ48102.1|DBSCAN-SWA MNKEQSADDPSVDLIRVKNMLNSTISMSYPDVVIACIEHKVSLEAFRAIEAALVKHDNNMKDYSLVVD >CP028702|911960:965264|942140_942833_+|AVZ48074.1|DBSCAN-SWA MMTKIKLLMLIIFYLIISASAHAAGGIALGATRIIYPADAKQTAVWIRNSHTNERFLVNSWIENSSGVKEKSFIITPPLFVSEPKSENTLRIIYTGPPLAADRESLFWMNVKTIPSVDKNALNGRNVLQLAILSRMKLFLRPIQLQELPAEAPDTLKFSRSGNYINVHNPSPFYVTLVNLQVGSQKLGNAMAAPRVNSQIPLPSGVQGKLKFQTVNDYGSVTPVREVNLN >CP028702|911960:965264|917538_918315_+|AVZ48052.1|DBSCAN-SWA MLRFSANLSMLFGEYDFLARFEKAAQCGFRGVEFMFPYDYDIEELKHVLASNKLEHTLHNLPAGDWAAGERGIACIPGREEEFRDGVAAAIRYARALGNKKINCLVGKTPAGFSSEQIHATLVENLRYAANMLMKEDILLLIEPINHFDIPGFHLTGTRQALKLIDDVGCCNLKIQYDIYHMQRMEGELTNTMTQWADKIGHLQIADNPHRGEPGTGEINYDYLFKVIENSDYNGWVGCEYKPQTTTEAGLRWMDPYR >CP028702|911960:965264|925054_926200_+|AVZ48057.1|DBSCAN-SWA MKIVIAPDSFKESLSAEKCCQAIKAGFSTLFPDANYICLPIADGGEGTVDAMVAATGGNIVTLEVCGPMGEKVNAFYGLTGDGKTAVIEMAAASGLMLVAPEKRNPLLASSFGTGELIRHALDNDIRHIILGIGGSATVDGGMGMAQALGVRFLDADGQALAANGGNLARVASIEMDECDPRLANCHIEVACDVDNPLVGARGAAAVFGPQKGATPEMVEELEQGLQNYARVLQQQTEINVCQMAGGGAAGGMGIAAAVFLNADIKPGIEIVLNAVNLAQAVQGAALVITGEGRIDSQTAGGKAPLGVASVAKQFNVPVIGIAGVLGDGVEVVHQYGIDAVFSILPRLAPLAEVLASGETNLFNSARNIACAIKIGQGIKN >CP028702|911960:965264|962350_962761_-|AVZ48101.1|DBSCAN-SWA MAQVAIFKEIFDQVRKDLNCELFYSELKRHNVSHYIYYLATDNIHIVLENDNTVLIKGLKKVVNVKFSRNTHLIETSYDRLKSREITFQQYRENLAKAGVFRWITNIHEHKRYYYTFDNSLLFTESIQNTTQIFPR >CP028702|911960:965264|955632_956088_+|AVZ48088.1|DBSCAN-SWA MNLSQDGIKLHRGNFTAIGRQIQPYLEEGKCFRMVLKPWREKRSLSQNALSHMWYSEISEYLISRGKTFATPAWVKDALKHTYLGYETKDLVDVVTGDITTIQSLRHTSDLDTGEMYVFLCKVEAWAMNIGCHLTIPQSCEFQLLRDKQEA >CP028702|911960:965264|957122_957506_+|AVZ48093.1|DBSCAN-SWA MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLITSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVVGMTFMSLAGKHCCSDGYIGKRLQKAEGIIEGMLMALDIRLEMDIVVNNSN >CP028702|911960:965264|960564_960780_+|AVZ48097.1|lysis|DBSCAN-SWA MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >CP028702|911960:965264|936384_937107_-|AVZ48067.1|DBSCAN-SWA MATLFIADLHLCVEEPAITAGFLRFLAGEARKADALYILGDLFEAWIGDDDPNPLHRKMAAAIKAVSDSGVPCYFIHGNRDFLLGKRFARESGMTLLPEEKVLELYGRRVLIMHGDTLCTDDAGYQAFRAKVHKPWLQTLFLALPLFVRKRIAARMRANSKEANSSKSLAIMDVNQNAVVSAMEKHQVQWLIHGHTHRPAVHELIANQQPAFRVVLGAWHTEGSMVKVTADDVELIHFPF >CP028702|911960:965264|941378_941921_+|AVZ48073.1|DBSCAN-SWA MKLRFISSALAAALFAATGSYAAVVDGGTIHFEGELVNAACSVNTDSADQVVTLGQYRTDIFNAVGNTSALIPFTIQLNDCDPVVAANAAVAFSGQADAINDNLLAIASSTNTTTATGVGIEILDNTSAILKPDGNSFSTNQNLIPGTNVLHFSARYKGTGTSASAGQANADATFIMRYE >CP028702|911960:965264|920356_921565_+|AVZ48054.1|transposase|DBSCAN-SWA MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWVLATNLPVEIRTPKQLVNIYSKRMQIEETFRDLKSPAYGLGLRHSRTSSSERFDIMLLIALMLQLTCWLAGVHAQKQGWDKHFQANTVRNRNVLSTVRLGMEVLRHSGYTITREDLLVAATLLAQNLFTHGYALGKL >CP028702|911960:965264|947981_949145_-|AVZ48079.1|integrase|DBSCAN-SWA MSLFRRNEIWYASYSLPGGKRIKESLGTKDKRQAQELHDKRKAELWRVEKLGDLPDVTFEEACLRWLEEKADKKSLDSDKSRIEFWLEHFEGIRLKDISEAKIYSAVSRMHNRKTKEIWKQKVQAAIRKGKELPVYEPKPVSTQTKAKHLAMIKAILRAAERDWKWLEKAPVIKIPAVRNKRVRWLEKEEAKRLIDECPEPLKSVVKFALATGLRKSNIINLEWQQIDMQRRVAWVNPEESKSNRAIGVALNDTACKVLRDQIGKHHKWVFVHTKAAKRADGTSTPAVRKMRIDSKTSWLSACRRAGIEDFRFHDLRHTWASWLIQSGVPLSVLQEMGGWESIEMVRRYAHLAPNHLTEHARKIDDIFGDNVPNMSHSEIMEDIKKA >CP028702|911960:965264|946496_947012_+|AVZ48077.1|DBSCAN-SWA MRRVLFSCFCGLLWSSSGWAVDPLGTININLHGNVVDFSCTVNTADIDKTVDLGRWPTTQLLNAGDTTALVPFSLRLEGCPPGSVAILFTGTPASDTNLLALDDPAMAQTVAIELRNSDRSRLALGEASPTEEVDANGNVTLNFFANYRALASGVRPGVAKADAIFMINYN >CP028702|911960:965264|937777_939163_+|AVZ48069.1|tRNA|DBSCAN-SWA MLKIFNTLTRQKEEFKPIHAGEVGMYVCGITVYDLCHIGHGRTFVAFDVVARYLRFLGYKLKYVRNITDIDDKIIKRANENGESFVAMVDRMIAEMHKDFDALNILRPDMEPRATHHIAEIIELTEQLIAKGHAYVADNGDVMFDVPTDPTYGVLSRQDLDQLQAGARVDVVDDKRNPMDFVLWKMSKEGEPSWPSPWGAGRPGWHIECSAMNCKQLGNHFDIHGGGSDLMFPHHENEIAQSTCAHDGQYVNYWMHSGMVMVDREKMSKSLGNFFTVRDVLKYYDAETVRYFLMSGHYRSQLNYSEENLKQARAALERLYTALRGTDKTVAPAGGEAFEARFIEAMDDDFNTPEAYSVLFDMAREVNRLKAEDMAAANAMASHLRKLSAVLGLLEQEPEAFLQSGAQADDSEVAEIEALIQQRLDARKAKDWAAADAARDRLNEMGIVLEDGPQGTTWRRK >CP028702|911960:965264|963417_963612_-|AVZ48103.1|DBSCAN-SWA MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSEKRAYTSIKSDEE >CP028702|911960:965264|940041_940908_-|AVZ48072.1|DBSCAN-SWA MAAKIIDGKTIAQQVRSEVAQKVQARIAAGLRAPGLAVVLVGSNPASQIYVASKRKACEEVGFVSRSYDLPETTSEAELLELIDTLNADNTIDGILVQLPLPAGIDNVKVLERIHPDKDVDGFHPYNVGRLCQRAPRLRPCTPRGIVTLLERYNIDTFGLNAVVIGASNIVGRPMSMELLLAGCTTTVTHRFTKNLRHHVENADLLIVAVGKPGFIPGDWIKEGAIVIDVGINRLENGKVVGDVVFEDAAKRASYITPVPGGVGPMTVATLIENTLQACVEYHDPQDE >CP028702|911960:965264|949850_949946_+|AVZ48081.1|DBSCAN-SWA MTGKEAIIHYLGTHKSFCAQDVAAVTGATVI >CP028702|911960:965264|960779_961277_+|AVZ48098.1|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKDIVGVWTVCHGHTGKDIMLGKTYTKAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSTLLRKINQGDIKGACDQLRRWTYAGGKQWKGLMTRREIEREVCLWGQQ >CP028702|911960:965264|933605_934499_+|AVZ48064.1|DBSCAN-SWA MKTLVVALGGNALLQRGEALTAENQYRNIASAVPALARLARSYRLAIVHGNGPQVGLLALQNLAWKEVEPYPLDVLVAESQGMIGYMLAQSLSAQPQMPPVTTVLTRIEVSPDDPAFLQPEKFIGPVYQPEEQEALEAAYGWQMKRDGKYLRRVVASPQPRKILDSEAIELLLKEGHVVICSGGGGVPVTDDGAGSEAVIDKDLAAALLAEQINADGLVILTDADAVYENWGTPQQRAIRHATPDELAPFAKADGSMGPNVTAVSGYVRSRGKPAWIGALSRIEETLAGEAGTCISL >CP028702|911960:965264|934693_935761_-|AVZ48065.1|DBSCAN-SWA MKQVCVLGNGQLGRMLRQAGEPLGIAVWPVGLDAEPAAVPFQQSVITAEIERWPETALTRELARHPAFVNRDVFPIIADRLTQKQLFDKLHLPTAPWQLLAERSEWPAVFDRLGELAIVKRRTGGYDGRGQWRLRANETEQLPAECYGECIVEQGINFSGEVSLVGARGFDGSTVFYPLTHNLHQDGILRTSVAFPQANAQQQAQAEEMLSAIMQELGYVGVMAMECFVTPQGLLINELAPRVHNSGHWTQNGASISQFELHLRAITDLPLPQPVVNNPSVMINLIGSDVNYDWLKLPLVHLHWYDKEVRPGRKVGHLNLTDSDTSRLTATLEALIPLLPPEYASGVIWAQSKFG >CP028702|911960:965264|937109_937604_-|AVZ48068.1|DBSCAN-SWA MVTFHTNHGDIVIKTFDDKAPETVKNFLDYCREGFYNNTIFHRVINGFMIQGGGFEPGMKQKATKEPIKNEANNGLKNTRGTLAMARTQAPHSATAQFFINVVDNDFLNFSGESLQGWGYCVFAEVVDGMDVVDKIKGVATGRSGMHQDVPKEDVIIESVTVSE >CP028702|911960:965264|926427_927213_-|AVZ48058.1|DBSCAN-SWA MGYLNNVTGYREDLLANRAIVKHGNFALLTPDGLVKNIIPGFENCDATILSTPKLGASFVDYLVTLHQNGGNQQGFGGEGIETFLYVISGNITAKAEGKTFALSEGGYLYCPPGSLMTFVNAQAEDSQIFLYKRRYVPVEGYAPWLVSGNASELERIHYEGMDDVILLDFLPKELGFDMNMHILSFAPGASHGYIETHVQEHGAYILSGQGVYNLDNNWIPVKKGDYIFMGAYSLQAGYGVGRGEAFSYIYSKDCNRDVEI >CP028702|911960:965264|964000_964546_+|AVZ48105.1|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIENEKLRREVEELLQASETDLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIPGLLSEYIEQSG >CP028702|911960:965264|931523_932783_+|AVZ48062.1|DBSCAN-SWA MFTSVAQANAAVIEQIRRARPHWLDVQPASSLISELNEGKTLLHAGPPMRWQEMTGPMKGACVGACLFEGWAKDEAQALAILEQGEVNFIPCHHVNAVGPMGGITSASMPMLVVENVTDGNRAYCNLNEGIGKVMRFGAYGEDVLTRHRWMRDVLMPVLSAALGRMERGIDLTAMMAQGITMGDEFHQRNIASSALLMRALAPQIARLDHDKQHIAEVMDFLSVTDQFFLNLAMAYCKAAMDAGAMIRAGSIVTAMTRNGNMFGIRVSGLGERWFTAPVNTPQGLFFTGFSQEQANPDMGDSAITETFGIGGAAMIAAPGVTRFVGAGGMEAARAVSEEMAEIYLERNMQLQIPSWDFQGACLGLDIRRVVETGITPLINTGIAHKEAGIGQIGAGTVRAPLACFEQALEALAESMGIG >CP028702|911960:965264|942863_945467_+|AVZ48075.1|DBSCAN-SWA MKIPTTTDIPQRYTWCLAGICYSSLAILPSFLSYAESYFNPAFLLENGTSVADLSRFERGNHQPAGVYRVDLWRNDEFIGSQDIVFESTTENTGDKSGGLMPCFNQVLLERIGLNSSAFPELAQQQNNKCINLLKAVPDATINFDFAAMRLNITIPQIALLSSAHGYIPPEEWDEGIPALLLNYNFTGNRGNGNDSYFFSELSGINIGPWRLRNNGSWNYFRGNGYHSEQWNNIGTWVQRAIIPLKSELVMGDGNTGSDIFDGVGFRGVRLYSSDNMYPDSQQGFAPTVRGIARTAAQLTIRQNGFIIYQSYVSPGAFEITDLHPTSSNGDLDVTIDERDGNQQNYTIPYSTVPILQREGRFKFDLTAGDFRSGNSQQSSPFFFQGTALGGLPQEFTAYGGTQLSANYTAFLLGLGRNLGNWGAVSLDVTHARSQLADASRHEGDSIRFLYAKSMNTFGTNFQLMGYRYSTQGFYTLDDVAYRRMEGYEYDYDGEHRDEPIIVNYHNLRFSRKDRLQLNVSQSLNDFGSLYISGTHQKYWNTSDSDTWYQVGYTSSWVGISYSLSFSWNESVGIPDNERIVGLNVSVPFNVLTKRRYTRENALDRAYASFNANRNSNGQNSWLAGVGGTLLEGHNLSYHVSQGDTSNNGYTGSATANWQAAYGTLGGGYNYDRDQHDVNWQLSGGVVGHENGITLSQPLGDTNVLIKAPGAGGVRIENQTGILTDWRGYAVMLYATVYRYNRIALDTNTMGNSIDVEKNISSVVPTQGALVRANFDTRIGVRALITVTQGGKPVPFGSLVRENSTGITSMVGDDGQVYLSGAPLSGELLVQWGDGANSRCIAHYVLPKQSLQQAVTVISAVCTHPGS >CP028702|911960:965264|911960_913055_-|AVZ48047.1|tRNA|DBSCAN-SWA MQERHTEQDYRALLIADTPIIDVRAPIEFEHGAMPAAINLPLMNNDERAAVGTCYKQQGSDAALALGHKLVAGEIRQQRMDAWRAACLQNPQGILCCARGGQRSHIVQSWLHAAGIDYPLVEGGYKALRQTAIQATIELAQKPIVLIGGCTGSGKTLLVQQQPNGVDLEGLARHRGSAFGRTLQPQLSQASFENLLAAEMLKTDARQNLRLWVLEDESRMIGSNHLPECLRERMTQAAIAVVEDPFEIRLERLNEEYFLRMHHDFTHAYGDEQGWQEYCEYLHHGLSAIKRRLGLQRYNELAARLDAALTTQLTTGSTDGHLAWLVPLLEEYYDPMYRYQLEKKAEKVVFRGEWAEVAEWVKAR >CP028702|911960:965264|954620_955418_+|AVZ48086.1|DBSCAN-SWA MLARGKTNLKIEEIRMHKHHEIHRVKPLMPALCRIRQGKKVINWETHTLTVDNNQIILFPCGYEFYIENYPEAGLYLAEMLYLPIDLIESFQKLYTVTDQIRNKTSFFLPQNPELIYCWEQLKTSVSRGFSTKIQEHLAMGVLLSLGVNHVNHLLLSYSKQSLISRCYNLLLSEPGTKWTANKVARYLYISVSTLHRRLASEGVSFQSILDDVRLNNALSAIQTTVKPISEIARENGYKCPSRFTERFHNRFNITPREIRKASRE >CP028702|911960:965264|914279_914762_+|AVZ48049.1|DBSCAN-SWA MKLQVLPLSQEAFSAYGDVIETQQRDFFHINNGLVERYHDLALVEILEQDCTLISINRAQPANLPLTIHELERHPLGTQAFIPMKGEVFVVVVALGDDKPDLSTLRAFITNGEQGVNYHRNVWHHPLFAWQRVTDFLTIDRGGSDNCDVESIPEQELCFA >CP028702|911960:965264|952068_953595_+|AVZ48084.1|DBSCAN-SWA MKKAIAYMRFSSPGQMSGDSLNRQRRLIAEWLKVNSDYYLDTITYEDLGLSAFKGKHAQSGAFSEFLDAIEHGYILPGTTLLVESLDRLSREKVGEAIERLKLILNHGIDVITLCDNTVYNIDSLNEPYSLIKAILIAQRANEESEIKSSRVKLSWKKKRQDALESGTIMTASCPRWLSLDDKRTAFVPDPDRVKTIELIFKLRMERRSLNAIAKYLNDHAVKNFSGKESAWGPSVIEKLLANKALIGICVPSYRARGKGISEIAGYYPRVISDDLFYAVQEIRLAPFGISNSSKNPMLINLLRTVMKCEACGNTMIVHAVSGSLHGYYVCPMRRLHRCDRPSIKRDLVDYNIINELLFNCSKIQPVENKKDANETLELKIIELQMKINNLIVALSVAPEVTAIAEKIRLLDKELRRASVSLKTLKSKGVNSFSDFYAIDLTSKNGRELCRTLAYKTFEKIIINTDNKTCDIYFMNGIVFKHYPLMKVISAQQAISALKYMVDGEIYF >CP028702|911960:965264|928480_929530_-|AVZ48060.1|DBSCAN-SWA MKISRETLHQLIENKLCQAGLKREHAATVAEVLVYADARGIHSHGAVRVEYYAERISKGGTNREPEFRLEETGPCSAILHADNAAGQVAAKMGMEHAIKTAQQNGVAVVGISRMGHSGAISYFVQQAARAGFIGISMCQSDPMVVPFGGAEIYYGTNPLAFAAPGEGDEILTFDMATTVQAWGKVLDARSRNMSIPDTWAVDKNGVPTTDPFAVHALLPAAGPKGYGLMMMIDVLSGVLLGLPFGRQVSSMYDDLHAGRNLGQLHIVINPNFFSSSELFRQHLSQTMRELNAITPAPGFNQVYYPGQDQDIKQRKAAVEGIEIVDDIYQYLISDALYNTSYETKNPFAQ >CP028702|911960:965264|957647_957866_+|AVZ48094.1|DBSCAN-SWA MLGRQRLPQHIFFVIRLELVNQTYSNDVISAYTECFREGANKRGNSSRLTQSFHFFMFEPIFSPVNALNQPI >CP028702|911960:965264|939827_940040_-|AVZ48071.1|DBSCAN-SWA MATFSLGKHPHVELCDLLKLEGWSESGAQAKIAIAEGQVKVDGAVETRKRCKIVAGQTVSFAGHSVQVVA |
60 | Enterobacteria_phage(53.57%) | integrase,lysis,terminase,transposase,tRNA | attL 947922:947968|attR 969224:969270 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1190582 : 1240927
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP028702|1190582:1240927|DBSCAN-SWA ATTATTTGATTTCAATTTTGTCCCACTCCCTGCCTCTGTCATCACGATACTGTGATGCCATGGTGTCCGACTTATGCCCGAGAAGATGTTGAGCAAACTTATCGCTTATCTGCTTCTCATAGAGTCTTGCAGACAAACTGCGCAACTCGTGAAAGGTAGGCGGATCCCCTTCGAAGGAAAGACCTGATGCTTTTCGTGCGCGCATAAAATACCTTGATACTGTGCCGGATGAAAGCGGTTCGCGACGAGTAGATGCAATTATGGTTTCTCCGCCAAGAATCTCTTTGCATTTATCAAGTGTTTCCTTCATTGATATTCCGAGAGCATCAATATGCAATGCTGTTGGGATGGCAATTTTTACGCCTGTTTTGCTTTGCTCGACATAAAGATATCCATCTACGATATCAGACCACTTCATTTCGCATAAATCACCAACTCGTTGCCCGGTAACAACAGCCAGTTCCATTGCAAGTCTGAGCCAACATGGTGATGATTCTGCTGCTTGATAAATTTTCAGGTATTCGTCAGCCGTAAGTCTTGATCTCCTTACCTCTGATTTTGCTGCGCGAGTGGCAGCGACATGGTTTGTTGTTATATGGCCTTCAGCTATTGCCTCTCGGAATGCATCGCTCAGTGTTGATCTGATTAACTTGGCTGACGCCGCCTTGCCCTCGTCTATGTATCCATTGAGCATTGCCGCAATTTCTTTTGTGGTGATGTCTTCAAGTGGAGCATCAGGCAGACCCCTCCTTATTGCTTTAATTTTGCTCATGTAATTTATGAGTGTCTTCTGCTTGATTCCTCTGCTGGCCAGGATTTTTTCGTAGCGATCAAGCCATGAATGTAACGTAACGGAATTATCACTGTTGATTCTCGCTGTCAGAGGCTTGTGTTTGTGTCCTGAAAATAACTCAATGTTGGCCTGTATAGCTTCAGTGATTGCGATTCGCCTGTCTCTGCCTAATCCAAACTCTTTACCCGTCCTTGGGTCCCTGTAGCAGTAATATCCATTGTTTCTTATATAAAGGTTAGGGGGTAAATCCCGGCGCTCATGACTTCGCCTTCTTCCCATTTCTGATCCTCTTCAAAAGGCCACCTGTTACTGGTCGATTTAAGTCAACCTTTACCGCTGATTCGTGGAACAGATACTCTCTTCCATCCTTAACCGGAGGTGGGAATATCCTGCATTCCCGAACCCATCGACGAACTGTTTCAAGGCTTCTTGGACGTCGCTGGCGTGCGTTCCACTCCTGAAGTGTCAAGTACATCGCAAAGTCTCCGCAATTACACGCAAGAAAAAACCGCCATCAGGCGGCTTGGTGTTCTTTCAGTTCTTCAATTCGAATATTGGTTACGTCTGCATGTGCTATCTGCGCCCATATCATCCAGTGGTCGTAGCAGTCGTTGATGTTCTCCGCTTCGATAACTCTGTTGAATGGCTCTCCATTCCATTCTCCTGTGACTCGGAAGTGCATTTATCATCTCCATAAAACAAAACCCGCCGTAGCGAGTTCAGATAAAATAAATCCCCGCGAGTGCGAGGATTGTTATGTAATATTGGGTTTAATCATCTATATGTTTTGTACAGAGAGGGCAAGTATCGTTTCCACCGTACTCGTGATAATAATTTTGCACGGTATCAGTCATTTCTCGCACATTGCAGAATGGGGATTTGTCTTCATTAGACTTATAAACCTTCATGGAATATTTGTATGCCGACTCTATATCTATACCTTCATCTACATAAACACCTTCGTGATGTCTGCATGGAGACAAGACACCGGATCTGCACAACATTGATAACGCCCAATCTTTTTGCTCAGACTCTAACTCATTGATACTCATTTATAAACTCCTTGCAATGTATGTCGTTTCAGCTAAACGGTATCAGCAATGTTTATGTAAAGAAACAGTAAGATAATACTCAACCCGATGTTTGAGTACGGTCATCATCTGACACTACAGACTCTGGCATCGCTGTGAAGACGACGCGAAATTCAGCATTTTCACAAGCGTTATCTTTTACAAAACCGATCTCACTCTCCTTTGATGCGAATGCCAGCGTCAGACATCATATGCAGATACTCACCTGCATCCTGAACCCATTGACCTCCAACCCCGTAATAGCGATGCGTAATGATGTCGATAGTTACTAACGGGTCTTGTTCGATTAACTGCCGCAGAAACTCTTCCAGGTCACCAGTGCAGTGCTTGATAACAGGAGTCTTCCCAGGATGGCGAACAACAAGAAACTGGTTTCCGTCTTCACGGACTTCGTTGCTTTCCAGTTTAGCAATACGCTTACTCCCATCCGAGATAACACCTTCGTAATACTCACGCTGCTCGTTGAGTTTTGATTTTGCTGTTTCAAGCTCAACACGCAGTTTCCCTACTGTTAGCGCAATATCCTCGTTCTCCTGGTCGCGGCGTTTGATGTATTGCTGGTTTCTTTCCCGTTCATCCAGCAGTTCCAGCACAATCGATGGTGTTACCAATTCATGGAAAAGGTCTGCGTCAAATCCCCAGTCGTCATGCATTGCCTGCTCTGCCGCTTCACGCAGTGCCTGAGAGTTAATTTCGCTCACTTCGAACCTCTCTGTTTACTGATAAGTTCCAGATCCTCCTGGCAACTTGCACAAGTCCGACAACCCTGAACGACCAGGCGTCTTCGTTCATCTATCGGATCGCCACACTCACAACAATGAGTGGCAGATATAGCCTGGTGGTTCAGGCGGCGCATTTTTATTGCTGTGTTGCGCTGTAATTCTTCTATTTCTGATGCTGAATCAATGATGTCTGCCATCTTTCATTAATCCCTGAACTGTTGGTTAATACGCTTGAGGGTGAATGCGAATAATAAAAAAGGAGCCTGTAGCTCCCTGATGATTTTGCTTTTCATGTTCATCGTTCCTTAAAGACGCCGTTTAACATGCCGATTGCCAGGCTTAAATGAGTCGGTGTGAATCCCATCAGCGTTACCGTTTCGCGGTGCTTCTTCAGTACGCTACGGCAAATGTCATCGACGTTTTTATCCGGAAACTGCTGTCTGGCTTTTTTTGATTTCAGAATTAGCCTGACGGGCAATGCTGCGAAGGGCGTTTTCCTGCTGAGGTGTCATTGAACAAGTCCCATGTCGGCAAGCATAAGCACACAGAATATGAAGCCCGCTGCCAGAAAAATGCATTCCGTGGTTGTCATACCTGGTCTCTCTCATCTGCTTCTGCTTTCGCCACCATCATTTCCAGCTTTTGTGAAAGGGATGCGGCTAACGTATGAAATTCTTCGTCTGTTTCTACTGGTATTGGCACAAACCTGATTCCAATTTGAGCAAGGCTATGTGCCATCTCGATACTCGTTCTTAACTCAACAGAAGATGCTTTGTGCATACAGCCCCTCGTTTATTATTTATCTCCTCAGCCAGCCGCTGTGCTTTCAGTGGATTTCGGATAACAGAAAGGCCGGGAAATACCCAGCCTCGCTTTGTAACGGAGTAGACGAAAGTGATTGCGCCTACCCGGATATTATCGTGAGGATGCGTCATCGCCATTGCTCCCCAAATACAAAACCAATTTCAGCCAGTGCCTCGTCCATTTTTTCGATGAACTCCGGCACGATCTCGTCAAAACTCGCCATGTACTTTTCATCCCGCTCAATCACGACATAATGCAGGCCTTCACGCTTCATACGCGGGTCATAGTTGGCAAAGTACCAGGCATTTTTTCGCGTCACCCACATGCTGTACTGCACCTGGGCCATGTAAGCTGACTTTATGGCCTCGAAACCACCGAGCCGGAACTTCATGAAATCCCGGGAGGTAAACGGGCATTTCAGTTCAAGGCCGTTGCCGTCACTGCATAAACCATCGGGAGAGCAGGCGGTACGCATACTTTCGTCGCGATAGATGATCGGGGATTCAGTAACATTCACGCCGGAAGTGAATTCAAACAGGGTTCTGGCGTCGTTCTCGTACTGTTTTCCCCAGGCCAGTGCTTTAGCGTTAACTTCCGGAGCCACACCGGTGCAAACCTCAGCAAGCAGGGTGTGGAAGTAGGACATTTTCATGTCAGGCCACTTCTTTCCGGAGCGGGGTTTTGCTATCACGTTGTGAACTTCTGAAGCGGTGATGACGCCGAGCCGTAATTTGTGCCACGCATCATCCCCCTGTTCGACAGCTCTCACATCGATCCCGGTACGCTGCAGGATAATGTCCGGTGTCATGCTGCCACCTTCTGCTCTGCGGCTTTCTGTTTCAGGAATCCAAGAGCTTTTACTGCTTCGGCCTGTGTCAGTTCTGACGATGCACGAATGTCGCGGCGAAATATCTGGGAACAGAGCGGCAATAAGTCGTCATCCCATGTTTTATCCAGGGCGATCAGCAGAGTGTTAATCTCCTGCATGGTTTCATCGTTAACCGGAGTGATGTCGCGTTCCGGCTGACGTTCTGCAGTGTATGCAGTATTTTCGACAATGCGCTCGGCTTCATCCTTGTCATAGATACCAGCAAATCCGAAGGCCAGACGGGCACACTGAATCATGGCTTTATGACGTAACATCCGTTTGGGATGCGACTGCCACGGCCCCGTGATTTCTCTGCCTTCGCGAGTTTTGAATGGTTCGCGGCGGCATTCATCCATCCATTCGGTAACGCAGATCGGATGATTACGGTCCTTGCGGTAAATCCGGCATGTACAGGATTCATTGTCCTGCTCAAAGTCCATGCCATCAAACTGCTGGTTTTCATTGATGATGCGGGACCAGCCATCAACGCCCACCACCGGAACGATGCCATTCTGCTTATCAGGAAAGGCGTAAATTTCTTTCGTCCACGGATTAAGGCCGTACTGGTTGGCAACGATCAGTAATGCGATGAACTGCGCATCGCTGGCATCACCTTTAAATGCCGTCTGGCGAAGAGTGGTGATCAGTTCCTGTGGGTCGACAGAATCCATGCCGACACGTTCAGCCAGCTTCCCAGCCAGCGTTGCGAGTGCAGTACTCATTCGTTTTATACCTCTGAATCAATATCAACCTGGTGGTGAGCAATGGTTTCAACCATGTACCGGATGTGTTCTGCCATGCGCTCCTGAAACTCAACATCGTCATCAAACGCACGGGTAATGGATTTTTTGCTGGCCCCGTGGCGTTGCAAATGATCGATGCATAGCGATTCAAACAGGTGCTGGGGCAGGCCTTTTTCCATGTCGTCTGCCAGTTCTGCCTCTTTCTCTTCACGGGCGAGCTGCTGGTAGTGACGCGCCCAGCTCTGAGCCTCAAGACGATCCTGAATGTAATAAGCGTTCATGGCTGAACTCCTGAAATAGCTGTGAAAATATCGCCCGCGAAATGCCGGGCTGATTAGGAAAACAGGAAAGGGGGTTAGTGAATGCTTTTGCTTGATCTCAGTTTCAGTATTAATATCCATTTTTTATAAGCGTCGACGGCTTCACGAAACATCTTTTCATCGCCAATAAAAGTGGCGATAGTGAATTTAGTCTGGATAGCCATAAGTGTTTGATCCATTCTTTGGGACTCCTGGCTGATTAAGTATGTCGATAAGGCGTTTCCATCCGTCACGTAATTTACGGGTGATTCGTTCAAGTAAAGATTCGGAAGGGCAGCCAGCAACAGGCCACCCTGCAATGGCATATTGCATGGTGTGCTCCTTATTTATACATAACGAAAAACGCCTCGAGTGAAGCGTTATTGGTATGCGGTAAAACCGCACTCAGGCGGCCTTGATAGTCATATCATCTGAATCAAATATTCCTGATGTATCGATATCGGTAATTCTTATTCCTTCGCTACCATCCATTGGAGGCCATCCTTCCTGACCATTTCCATCATTCCAGTCGAACTCACACACAACACCATATGCATTTAAGTCGCTTGAAATTGCTATAAGCAGAGCATGTTGCGCCAGCATGATTAATACAGCATTTAATACAGAGCCGTGTTTATTGAGTCGGTATTCAGAGTCTGACCAGAAATTATTAATCTGGTGAAGTTTTTCCTCTGTCATTACGTCATGGTCGATTTCAATTTCTATTGATGCTTTCCAGTCGTAATCAATGATGTATTTTTTGATGTTTGACATCTGTTCATATCCTCACAGATAAAAAATCGCCCTCACACTGGAGGGCAAAGAAGATTTCCAATAATCAGAACAAGTCGGCTCCTGTTTAGTTACGAGCGACATTGCTCCGTGTATTCACTCGTTGGAATGAATACACAGTGCAGTGTTTATTCTGTTATTTATGCCAAAAATAAAGGCCACTATCAGGCAGCTTTGTTGTTCTGTTTACCAAGTTCTCTGGCAATCATTGCCGTCGTTCGTATTGCCCATTTATCGACATATTTCCCATCTTCCATTACAGGAAACATTTCTTCAGGCTTAACCATGCATTCCGATTGCAGCTTGCATCCATTGCATCGCTTGAATTGTCCACACCATTGATTTTTATCAATAGTCGTAGTCATACGGATAGTCCTGGTATTGTTCCATCACATCCTGAGGATGCTCTTCGAACTCTTCAAATTCTTCTTCCATATATCACCTTAAATAGTGGATTGCGGTAGTAAAGATTGTGCCTGTCTTTTAACCACATCAGGCTCGGTGGTTCTCGTGTACCCCTACAGCGAGAAATCGGATAAACTATTACAACCCCTACAGTTTGATGAGTATAGAAATGGATCCACTCGTTATTCTCGGACGAGTGTTCAGTAATGAACCTCTGGAGAGAACCATGTATATGATCGTTATCTGGGTTGGACTTCTGCTTTTAAGCCCAGATAACTGGCCTGAATATGTTAATGAGAGAATCGGTATTCCTCATGTGTGGCATGTTTTCGTCTTTGCTCTTGCATTTTCGCTAGCAATTAATGTGCATCGATTATCAGCTATTGCCAGCGCCAGATATAAGCGATTTAAGCTAAGAAAACGCATTAAGATGCAAAACGATAAAGTGCGATCAGTAATTCAAAACCTTACAGAAGAGCAATCTATGGTTTTGTGCGCAGCCCTTAATGAAGGCAGGAAGTATGTGGTTACATCAAAACAATTCCCATACATTAGTGAGTTGATTGAGCTTGGTGTGTTGAACAAAACTTTTTCCCGATGGAATGGGAAGCATATATTATTCCCTATTGAGGATATTTACTGGACTGAATTAGTTGCCAGCTATGATCCATATAATATTGAGATAAAGCCAAGGCCAATATCTAAGTAACTAGATAAGAGGAATCGATTTTCCCTTAATTTTCTGGCGTCCACTGCATGTTATGCCGCGTTCGCCAGGCTTGCTGTACCATGTGCGCTGATTCTTGCGCTCAATACGTTGCAGGTTGCTTTCAATCTGTTTGTGGTATTCAGCCAGCACTGTAAGGTCTATCGGATTTAGTGCGCTTTCTACTCGTGATTTCGGTTTGCGATTCAGCGAGAGAATAGGGCGGTTAACTGGTTTTGCGCTTACCCCAACCAACAGGGGATTTGCTGCTTTCCATTGAGCCTGTTTCTCTGCGCGACGTTCGCGGCGGCGTGTTTGTGCATCCATCTGGATTCTCCTGTCAGTTAGCTTTGGTGGTGTGTGGCAGTTGTAGTCCTGAACGAAAACCCCCCGCGATTGGCACATTGGCAGCTAATCCGGAATCGCACTTACGGCCAATGCTTCGTTTCGTATCACACACCCCAAAGCCTTCTGCTTTGAATGCTGCCCTTCTTCAGGGCTTAATTTTTAAGAGCGTCACCTTCATGGTGGTCAGTGCGTCCTGCTGATGTGCTCAGTATCACCGCCAGTGGTATTTATGTCAACACCGCCAGAGATAATTTATCACCGCAGATGGTTATCTGTATGTTTTTTATATGAATTTATTTTTTGCAGGGGGGCATTGTTTGGTAGGTGAGAGATCTGAATTGCTATGTTTAGTGAGTTGTATCTATTTATTTTTCAATAAATACAATTGGTTATGTGTTTTGGGGGCGATCGTGAGGCAAAGAAAACCCGGCGCTGAGGCCGGGTTATTCTTGTTCTCTGGTCAAATTATATAGTTGGAAAACAAGGATGCATATATGAATGAACGATGCAGAGGCAATGCCGATGGCGATAGTGGGTATCATGTAGCCGCTTATGCTGGAAAGAAGCAATAACCCGCAGAAAAACAAAGCTCCAAGCTCAACAAAACTAAGGGCATAGACAATAACTACCGATGTCATATACCCATACTCTCTAATCTTGGCCAGTCGGCGCGTTCTGCTTCCGATTAGAAACGTCAAGGCAGCAATCAGGATTGCAATCATGGTTCCTGCATATGATGACAATGTCGCCCCAAGACCATCTCTATGAGCTGAAAAAGAAACACCAGGAATGTAGTGGCGGAAAAGGAGATAGCAAATGCTTACGATAACGTAAGGAATTATTACTATGTAAACACCAGGCATGATTCTGTTCCGCATAATTACTCCTGATAATTAATCCTTAACTTTGCCCACCTGCCTTTTAAAACATTCCAGTATATCACTTTTCATTCTTGCGTAGCAATATGCCATCTCTTCAGCTATCTCAGCATTGGTGACCTTGTTCAGAGGCGCTGAGAGATGGCCTTTTTCTGATAGATAATGTTCTGTTAAAATATCTCCGGCCTCATCTTTTGCCCGCAGGCTAATGTCTGAAAATTGAGGTGACGGGTTAAAAATAATATCCTTGGCAACCTTTTTTATATCCCTTTTAAATTTTGGCTTAATGACTATATCCAATGAGTCAAAAAGCTCCCCTTCAATATCTGTTGCCCCTAAGACCTTTAATATATCGCCAAATACAGGTAGCTTGGCTTCTACCTTCACCGTTGTTCGGCCGATGAAATGCATATGCATAACATCGTCTTTGGTGGTTCCCCTCATCAGTGGCTCTATCTGAACGCGCTCTCCACTGCTTAATGACATTCCTTTCCCGATTAAAAAATCTGTCAGATCGGATGTGGTCGGCCCGAAAACAGTTCTGGCAAAACCAATGGTGTCGCCTTCAACAAACAAAAAAGATGGGAATCCCAATGATTCGTCATCTGCGAGGCTGTTCTTAATATCTTCAACTGAAGCTTTAGAGCGATTTATCTTCTGAACCAGACTCTTGTCATTTGTTTTGGTAAAGAGAAAAGTTTTTCCATCGATTTTATGAATATACAAATAATTGGAGCCAACCTGCAGGTGATGATTATCAGCCAGCAGAGAATTAAGGAAAACAGACAGGTTTATTGAGCGCTTATCTTTCCCTTTATTTTTGCTGCGGTAAGTCGCATAAAAACCATTCTTCATAATTCAATCCATTTACTATGTTATGTTCTGAGGGGAGTGAAAATTCCCCTAATTCGATGAAGATTCTTGCTCAATTGTTATCAGCTATGCGCCGACCAGAACACCTTGCCGATCAGCCAAACGTCTCTTCAGGCCACTGACTAGCGATAACTTTCCCCACAACGGAACAACTCTCATTGCATGGGATCATTGGGTACTGTGGGTTTAGTGGTTGTAAAAACACCTGACCGCTATCCCTGATCAGTTTCTTGAAGGTAAACTCATCACCCCCAAGTCTGGCTATGCAGAAATCACCTGGCTCAACAGCCTGCTCAGGGTCAACGAGAATTAACATTCCGTCAGGAAAGCTTGGCTTGGAGCCTGTTGGTGCGGTCATGGAATTACCTTCAACCTCAAGCCAGAATGCAGAATCACTGGCTTTTTTGGTTGTGCTTACCCATCTCTCCGCATCACCTTTGGTAAAGGTTCTAAGCTCAGGTGAGAACATCCCTGCCTGAACATGAGAAAAAACAGGGTACTCATACTCACTTCTAAGTGACGGCTGCATACTAACCGCTTCATACATCTCGTAGATTTCTCTGGCGATTGAAGGGCTAAATTCTTCAACGCTAACTTTGAGAATTTTTGCAAGCAATGCGGCGTTATAAGCATTTAATGCATTGATGCCATTAAATAAAGCACCAACGCCTGACTGCCCCATCCCCATCTTGTCTGCGACAGATTCCTGGGATAAGCCAAGTTCATTTTTCTTTTTTTCATAAATTGCTTTAAGGCGACGTGCGTCCTCAAGCTGCTCTTGTGTTAATGGTTTCTTTTTTGTGCTCATACGTTAAATCTATCACCGCAAGGGATAAATATCTAACACCGTGCGTGTTGACTATTTTACCTCTGGCGGTGATAATGGTTGCATGTACTAAGGAGGTTGTATGGAACAACGCATAACCCTGAAAGATTATGCAATGCGCTTTGGGCAAACCAAGACAGCTAAAGATCTCGGCGTATATCAAAGCGCGATCAACAAGGCCATTCATGCAGGCCGAAAGATTTTTTTAACTATAAACGCTGATGGAAGCGTTTATGCGGAAGAGGTAAAGCCCTTCCCGAGTAACAAAAAAACAACAGCATAAATAACCCCGCTCTTACACATTCCAGCCCTGAAAAAGGGCATCAAATTAAACCACACCTATGGTGTATGCATTTATTTGCATACATTCAATCAATTGTTATCTAAGGAAATACTTACATATGGTTCGTGCAAACAAACGCAACGAGGCTCTACGAATCGAGAGTGCGTTGCTTAACAAAATCGCAATGCTTGGAACTGAGAAGACAGCGGAAGCTGTGGGCGTTGATAAGTCGCAGATCAGCAGGTGGAAGAGGGACTGGATTCCAAAGTTCTCAATGCTGCTTGCTGTTCTTGAATGGGGGGTCGTTGACGACGACATGGCTCGATTGGCGCGACAAGTTGCTGCGATTCTCACCAATAAAAAACGCCCGGCGGCAACCGAGCGTTCTGAACAAATCCAGATGGAGTTCTGAGGTCATTACTGGATCTATCAACAGGAGTCATTATGACAAATACAGCAAAAATACTCAACTTCGGCAGAGGTAACTTTGCCGGACAGGAGCGTAATGTGGCAGATCTCGATGATGGTTACGCCAGACTATCAAATATGCTGCTTGAGGCTTATTCGGGCGCAGATCTGACCAAGCGACAGTTTAAAGTGCTGCTTGCCATTCTGCGTAAAACCTATGGGTGGAATAAACCAATGGACAGAATCACCGATTCTCAACTTAGCGAGATTACAAAGTTACCTGTCAAACGGTGCAATGAAGCCAAGTTAGAACTCGTCAGAATGAATATTATCAAGCAGCAAGGCGGCATGTTTGGACCAAATAAAAACATCTCAGAATGGTGCATCCCTCAAAACGAGGGAAAATCCCCTAAAACGAGGGATAAAACATCCCTCAAATTGGGGGATTGCTATCCCTCAAAACAGGGGGACACAAAAGACACTATTACAAAAGAAAAAAGAAAAGATTATTCGTCAGAGAATTCTGGCGAATCCTCTGACCAGCCAGAAAACGACCTTTCTGTGGTGAAACCGGATGCTGCAATTCAGAGCGGCAGCAAGTGGGGGACAGCAGAAGACCTGACCGCCGCAGAGTGGATGTTTGACATGGTGAAGACTATCGCACCATCAGCCAGAAAACCGAATTTTGCTGGGTGGGCTAACGATATCCGCCTGATGCGTGAACGTGACGGACGTAACCACCGCGACATGTGTGTGCTGTTCCGCTGGGCATGCCAGGACAACTTCTGGTCCGGTAACGTGCTGAGCCCGGCCAAACTCCGCGATAAGTGGACCCAACTCGAAATCAACCGTAACAAGCAACAGGCAGGCGTGACAGCCAGCAAACCAAAACTCGACCTGACAAACACAGACTGGATTTACGGGGTGGATCTATGAAAAACATCGCCGCACAGATGGTTAACTTTGACCGTGAGCAGATGCGTCGGATCGCCAACAACATGCCGGAACAGTACGACGAAAAGCCGCAGGTACAGCAGGTAGCGCAGATCATCAACGGTGTGTTCAGCCAGTTACTGGCAACTTTCCCGGCGAGCCTGGCTAACCGTGACCAGAACGAAGTGAACGAAATCCGTCGCCAGTGGGTTCTGGCTTTTCGGGAAAACGGGATCACCACGATGGAACAGGTTAACGCAGGAATGCGCGTAGCCCGTCGGCAGAATCGACCATTTCTGCCATCACCCGGGCAGTTTGTTGCATGGTGCCGGGAAGAAGCATCCGTTACCGCCGGACTGCCAAACGTCAGCGAGCTGGTTGATATGGTTTACGAGTATTGCCGGAAGCGAGGCCTGTATCCGGATGCGGAGTCTTATCCGTGGAAATCAAACGCGCACTACTGGCTGGTTACCAACCTGTATCAGAACATGCGGGCCAATGCGCTTACTGATGCGGAATTACGCCGTAAGGCCGCAGATGAGCTTGTCCATATGACTGCGAGAATTAACCGTGGTGAGGCGATCCCTGAACCAGTAAAACAACTTCCTGTCATGGGCGGTAGACCTCTAAATCGTGCACAGGCTCTGGCGAAGATCGCAGAAATCAAAGCTAAGTTCGGACTGAAAGGAGCAAGTGTATGACGGGCAAAGAGGCAATTATTCATTACCTGGGGACGCATAATAGCTTCTGTGCGCCGGACGTTGCCGCGCTAACAGGCGCAACAGTAACCAGCATAAATCAGGCCGCGGCTAAAATGGCACGGGCAGGTCTTCTGGTTATCGAAGGTAAGGTCTGGCGAACGGTGTATTACCGGTTTGCTACCAGGGAAGAACGGGAAGGAAAGATGAGCACGAACCTGGTTTTTAAGGAGTGTCGCCAGAGTGCCGCGATGAAACGGGTATTGGCGGTATATGGAGTTAAAAGATGACCATCTACATTACTGAGCTAATAACAGGCCTGCTGGTAATCGCAGGCCTTTTTATTTGGGGGAGAGGGAAGTCATGAAAAAACTAACCTTTGAAATTCGATCTCCAGCACATCAGCAAAACGCTATTCACGCAGTACAGCAAATCCTTCCAGACCCAACCAAACCAATCGTAGTAACCATTCAGGAACGCAACCGCAGCTTAGACCAAAACAGGAAGCTATGGGCCTGCTTAGGTGACGTCTCTCGTCAGGTTGAATGGCATGGTCGCTGGCTGGATGCAGAAAGCTGGAAGTGTGTGTTTACCGCAGCATTAAAGCAGCAGGATGTTGTTCCTAACCTTGCCGGGAATGGCTTTGTGGTAATAGGCCAGTCAACCAGCAGGATGCGTGTAGGCGAATTTGCGGAGCTATTAGAGCTTATACAGGCATTCGGTACAGAGCGTGGCGTTAAGTGGTCAGACGAAGCGAGACTGGCTCTGGAGTGGAAAGCGAGATGGGGAGACAGGGCTGCATGATAAATGTCGTTAGTTTCTCCGGTGGCAGGACGTCAGCATATTTGCTCTGGCTAATGGAGCAAAAGCGACGGGCAGGTAAAGACGTGCATTACGTTTTCATGGATACAGGTTGTGAACATCCAATGACATATCGGTTTGTCAGGGAAGTTGTGAAGTTCTGGGATATACCGCTCACCGTATTGCAGGTTGATATCAACCCGGAGCTTGGACAGCCAAATGGTTATACGGTATGGGAACCAAAGGATATTCAGACGCGAATGCCTGTTCTGAAGCCATTTATCGATATGGTAAAGAAATATGGCACTCCATACGTCGGCGGCGCGTTCTGCACTGACAGATTAAAACTCGTTCCCTTCACCAAATACTGTGATGACCATTTCGGGCGAGGGAATTACACCACGTGGATTGGCATCAGAGCTGATGAACCGAAGCGGCTAAAGCCAAAGCCTGGAATCAGATATCTTGCTGAACTGTCAGACTTTGAGAAGGAAGATATCCTCGCATGGTGGAAGCAACAACCATTCGATTTGCAAATACCGGAACATCTCGGTAACTGCATATTCTGCATTAAAAAATCAACGCAAAAAATCGGACTTGCCTGCAAAGATGAGGAGGGATTGCAGCGTGTTTTTAATGAGGTCATCACGGGATCCCATGTGCGTGACGGACATCGGGAAACGCCAAAGGAGATTATGTACCGAGGAAGAATGTCGCTGGACGGTATCGCGAAAATGTATTCAGAAAATGATTATCAAGCCCTGTATCAGGACATGGTACGAGCTAAAAGATTCGATACCGGCTCTTGTTCTGAGTCATGCGAAATATTTGGAGGGCAGCTTGATTTCGACTTCGGGAGGGAAGCTGCATGATGCGATGTTATCGGTGCGGTGAATGCAAAGAAGATAACCGCTTCCGACCAAATCAACCTTACTGGAATCGATGGTGTCTCCGGTGTGAAAGAACACCAACAGGGGTGTTACCACTACCGCAGGAAAAGGAGGACGTGTGGCGAGACAGCGACGAAGTATCACCGACATAATCTGCGAAAACTGCAAATACCTTCCAACGAAACGCACCAGAAATAAACCCAAGCCAATCCCAAAAGAATCTGACGTAAAAACCTTCAACTACACGGCTCACCTGTGGGATATCCGGTGGCTAAGACGTCGTGCGAGGAAAACAAGGTGATTGACCAAAATCGAAGTTACGAACAAGAAAGCGTCGAGCGAGCTTTAACGTGCGCTAACTGCGGTCAGAAGCTGCATGTGCTGGAAGTTCACGTGTGTGAGCACTGCTGCGCAGAACTGATGAGCGATCCGAATAGCTCGATGCACGAGGAAGAAGATGATGGCTAAACCAGCGCGAAGACGATGTAAAAACGATGAATGCCGGGAATGGTTTCACCCTGCATTCGCTAATCAGTGGTGGTGCTCTCCAGAGTGTGGAACCAAGATAGCACTCGAACGACGAAGTAAAGAACGCGAAAAAGCGGAAAAAGCAGCAGAGAAGAAACGACGACGAGAGGAGCAGAAACAGAAAGATAAACTTAAGATTCGAAAACTCGCCTTAAAGCCCCGCAGTTACTGGATTAAACAAGCCCAACAAGCCGTAAACGCCTTCATCAGAGAAAGAGACCGCGACTTACCATGTATCTCGTGCGGAACGCTCACGTCTGCTCAGTGGGATGCCGGACATTACCGGACAACTGCTGCGGCACCTCAACTCCGATTTAATGAACGCAATATTCACAAGCAATGCGTGGTGTGCAACCAGCACAAAAGCGGAAATCTCGTTCCGTATCGCGTCGAACTGATTAGCCGCATCGGGCAGGAAGCAGTAGACGAAATCGAATCAAACCATAACCGCCATCGCTGGACTATCGAAGAGTGCAAGGCGATCAAGGCAGAGTACCAACAGAAACTCAAAGACCTGCGAAATAGCAGAAGTGAGGCCGCATGACGTTCTCAGTAAAAACCATTCCAGACATGCTCGTTGAAGCATACGGAAATCAGACAGAAGTAGCACGCAGACTGAAATGTAGTCGCGGTACGGTCAGAAAATACGTTGATGATAAAGACGGGAAAATGCACGCCATCGTCAACGACGTTCTCATGGTTCATCGCGGATGGAGTGAAAGAGATGCGCTATTACGAAAAAATTGATGGCAGCAAATACCGAAATATTTGGGTAGTTGGCGATCTGCACGGATGCTACACGAACCTGATGAACAAACTGGATACGATTGGATTCGACAACAAAAAAGACCTGCTTATCTCGGTGGGCGATTTGGTTGATCGTGGTGCAGAGAACGTTGAATGCCTGGAATTAATCACATTCCCCTGGTTCAGAGCTGTACGTGGAAACCATGAGCAAATGATGATTGATGGCTTATCAGAGCGTGGAAACGTTAATCACTGGCTGCTTAATGGCGGTGGCTGGTTCTTTAATCTCGATTACGACAAAGAAATTCTGGCTAAAGCTCTTGCCCATAAAGCAGATGAACTTCCGTTAATCATCGAACTGGTGAGCAAAGATAAAAAATATGTTATCTGCCACGCCGATTATCCCTTTGACGAATACGAGTTTGGAAAGCCAGTTGATCATCAGCAGGTAATCTGGAACCGCGAACGAATCAGCAACTCACAAAACGGGATCGTGAAAGAAATCAAAGGCGCGGACACGTTCATCTTTGGTCATACGCCAGCAGTGAAACCACTCAAGTTTGCCAACCAAATGTATATCGATACCGGCGCAGTGTTCTGCGGAAACCTAACATTGATTCAGGTACAGGGAGAAGGCGCATGAGACTCGAAAGCGTAGCTAAATTTCATTCGCCAAAAAGCCCGATGATGAGCGACTCACCACGGGCCACGGCTTCTGACTCTCTTTCCGGTACTGATGTGATGGCTGCTATGGGGATGGCGCAATCACAAGCCGGATTCGGTATGGCTGCATTCTGCGGTAAGCACGAACTCAGCCAGAACGACAAACAAAAGGCTATCAACTATCTGATGCAATTTGCACACAAGGTATCGGGGAAATACCGTGGTGTGGCAAAGCTTGAAGGAAATACTAAGGCAAAGGTACTGCAAGTGCTCGCAACATTCGCTTATGCGGATTATTGCCGTAGTGCCGCGACGCCGGGGGCAAGATGCAGAGATTGCCATGGTACAGGCCGTGCGGTTGATATTGCCAAAACAGAGCTGTGGGGGAGAGTTGTCGAGAAAGAGTGCGGAAGATGCAAAGGCGTCGGCTATTCAAGGATGCCAGCAAGCGCAGCATATCGCGCTGTGACGATGCTAATCCCAAACCTTACCCAACCCACCTGGTCACGCACTGTTAAGCCGCTGTATGACGCTCTGGTGGTGCAATGCCACAAAGAAGAGTCAATCGCAGACAACATTTTGAATGCGGTCACACGTTAGCAGCATGATTGCCACGGATGGCAACATATTAACGGCATGATATTGACTTATTGAATAAAATTGGGTAAATTTGACTCAACGATGGGTTAATTCGCTCGTTGTGGTAGTGAGATGAAAAGAGGCGGCGCTTACTACCGATTCCGCCTAGTTGGTCACTTCGACGTATCGTCTGGAACTCCAACCATCGCAGGCAGAGAGGTCTGCAAAATGCAATCCCGAAACAGTTCGCAGGTAATAGTTAGAGCCTGCATAACGGTTTCGGGATTTTTTATATCTGCACAACAGGTAAGAGCATTGAGTCGATAATCGTGAAGAGTCGGCGAGCCTGGTTAGCCAGTGCTCTTTCCGTTGTGCTGAATTAAGCGAATACCGGAAGCAGAACCGGATCACCAAATGCGTACAGGCGTCATCGCCGCCCAGCAACAGCACAACCCAAACTGAGCCGTAGCCACTGTCTGTCCTGAATTCATTAGTAATAGTTACGCTGCGGCCTTTTACACATGACCTTCGTGAAAGCGGGTGGCAGGAGGTCGCGCTAACAACCTCCTGCCGTTTTGCCCGTGCATATCGGTCACGAACAAATCTGATTACTAAACACAGTAGCCTGGATTTGTTCTATCAGTAATCGACCTTATTCCTAATTAAATAGAGCAAATCCCCTTATTGGGGGTAAGACATGAAGATGCCAGAAAAACATGACCTGTTGGCCGCCATTCTCGCGGCAAAGGAACAAGGCATCGGGGCAATCCTTGCGTTTGCAATGGCGTACCTTCGCGGCAGATATAATGGCGGTGCGTTTACAAAAACAGTAATCGACGCAACGATGTGCGCCATTATCGCCTGGTTCATTCGTGACCTTCTCGACTTCGCCGGACTAAGTAGCAATCTCGCTTATATAACGAGCGTGTTTATCGGCTACATCGGTACTGACTCGATTGGTTCGCTTATCAAACGCTTCGCTGCTAAAAAAGCCGGAGTAGAAGATGGTAGAAATCAATAATCAACGTAAGGCGTTCCTCGATATGCTGGCGTGGTCGGAGGGAACTGATAACGGACGTCAGAAAACCAGAAATCATGGTTATGACGTCATTGTAGGCGGAGAGCTATTCACTGATTACTCCGATCACCCTCGCAAACTTGTCACGCTAAACCCAAAACTCAAATCAACAGGCGCCGGACGCTACCAGCTTCTTTCCCGTTGGTGGGATGCCTACCGCAAGCAGCTTGGCCTGAAAGACTTCTCTCCGAAAAGTCAGGACGCTGTGGCATTGCAGCAGATTAAGGAGCGTGGCGCTTTACCTATGATTGATCGTGGTGATATCCGTCAGGCAATCGACCGTTGCAGCAATATCTGGGCTTCACTGCCGGGCGCTGGTTATGGTCAGTTCGAGCATAAGGCTGACAGCCTGATTGCAAAATTCAAAGAAGCGGGCGGAACGGTCAGAGAGATTGATGTATGAGCAGAGTCACCGCGATTATCTCCGCTCTGGTTATCTGCATCATCGTCTGCCTGTCATGGGCTGTTAATCATTACCGTGATAACGCCATTACCTACAAAGCCCAGCGCGACAAAAATGCCAGAGAACTGAAGCTGGCGAACGCGGCAATTACTGACATGCAGATGCGTCAGCGTGATGTTGCTGCGCTCGATGCAAAATACACGAAGGAGTTAGCTGATGCTAAAGCTGAAAATGATGCTCTGCGTGATGATGTTGCCGCTGGTCGTCGTCGGTTGCACATCAAAGCAGTCTGTCAGTCAGTGCGTGAAGCCACCACCGCCTCCGGCGTGGATAATGCAGCCTCCCCCCGACTGGCAGACACCGCTGAACGGGATTATTTCACCCTCAGAGAGAGGCTGATCACTATGCAAAAACAACTGGAAGGAACCCAGAAGTATATTAATGAGCAGTGCAGATAGAGTTGCCCATATCGATGGGCAACTCATGCAATTATTGTGAGCAATACACACGCGCTTCCAGCGGAGTATAAATGCCTAAAGTAATAAAACCGAGCAATCCATTTACGAATGTTTGCTGGGTTTCTGTTTTAACAACATTTTCTGCGCCGCCACAAATTTTGGCTGCATCGACAGTTTTCTTCTGCCCAATTCCAGAAACGAAGAAATGATGGGTGATGGTTTCCTTTGGTGCTACTGCTGCCGGTTTGTTTTGAACAGTAAACGTCTGTTGAGCACATCCTGTAATAAGCAGGGCCAGCGCAGTAGCGAGTAGCATTTTTTTCATGGTGTTATTCCCGATGCTTTTTGAAGTTCGCAGAATCGTATGTGTAGAAAATTAAACAAACCCTAAACAATGAGTTGAAATTTCATATTGTTAATATTTATTAATGTATGTCAGGTGCGATGAATCGTCATTGTATTCCCGGATTAACTATGTCCACAGCCCTGACGGGGAACTTCTCTGCGGGAGTGTCCGGGAATAATTAAAACGATGCACACAGGGTTTAGCGCGTACACGTATTGCATTATGCCAACGCCCCGGTGCTGACACGGAAGAAACCGGACGTTATGATTTAGCGTGGAAAGATTTGTGTAGTGTTCTGAATGCTCTCAGTAAATAGTAATGAATTATCAAAGGTATAGTAATATCTTTTATGTTCATGGATATTTGTAACCCATCGGAAAACTCCTGCTTTAGCAAGATTTTCCCTGTATTGCTGAAATGTGATTTCTCTTGATTTCAACCTATCATAGGACGTTTCTATAAGATGCGTGTTTCTTGAGAATTTAACATTTACAACCTTTTTAAGTCCTTTTATTAACACGGTGTTATCGTTTTCTAACACGATGTGAATATTATCTGTGGCTAGATAGTAAATATAATGTGAGACGTTGTGACGTTTTAGTTCAGAATAAAACAATTCACAGTCTAAATCTTTTCGCACTTGATCGAATATTTCTTTAAAAATGGCAACCTGAGCCATTGGTAAAACCTTCCATGTGATACGAGGGCGCGTAGTTTGCATTATCGTTTTTATCGTTTCAATCTGGTCTGACCTCCTTGTGTTTTGTTGATGATTTATGTCAAATATTAGGAATGTTTTCACTTAATAGTATTGGTTGCGTAACAAAGTGCGGTCCTGCTGGCATTCTGGAGGGAAATACAACCGACAGATGTATGTAAGGCCAACGTGCTCAAATCTTCATACAGAAAGATTTGAAGTAATATTTTAACCGCTAGATGAAGAGCAAGCGCATGGAGCGACAAAATGAATAAAGAACAATCTGCTGATGATCCCTCCGTGGATCTGATTCGTGTAAAAAATATGCTTAATAGCACCATTTCTATGAGTTACCCTGATGTTGTAATTGCATGTATAGAACATAAGGTGTCTCTGGAAGCATTCAGAGCAATTGAGGCAGCGTTGGTGAAGCACGATAATAATATGAAGGATTATTCCCTGGTGGTTGACTGATCACCATAACTGCTAATCATTCAAACTATTTAGTCTGTGACAGAGCCAACACGCAGTCTGTCACTGTCAGGAAAGTGGTAAAACTGCAACTCAATTACTGCAATGCCCTCGTAATTAAGTGAATTTACAATATCGTCCTGTTCGGAGGGAAGAACGCGGGATGTTCATTCTTCATCACTTTTAATTGATGTATATGCTCTCTTTTCTGACGTTAGTCTCCGACGGCAGGCTTCAATGACCCAGGCTGAGAAATTCCCGGACCCTTTTTGCTCAAGAGCGATGTTAATTTGTTCAATCATTTGGTTAGGAAAGCGGATGTTGCGGGTTGTTGTTCTGCGGGTTCTGTTCTTCGTTGACATGAGGTTGCCCCGTATTCAGTGTCGCTGATTTGTATTGTCTGAAGTTGTTTTTACGTTAAGTTGATGCAGATCAATTAATACGATACCTGCGTCATAATTGATTATTTGACGTGGTTTGATGGCCTCCACGCACGTTGTGATATGTAGATGATAATCATTATCACTTTACGGGTCCTTTCCGGTGATCCGACAGGTTACGGGGCGGCGACCTCGCGGGTTTTCGCTATTTATGAAAATTTTCCGGTTTAAGGCGTTTCCGTTCTTCTTCGTCATAACTTAATGTTTTTATTTAAAATACCCTCTGAAAAGAAAGGAAACGACAGGTGCTGAAAGCGAGCTTTTTGGCCTCTGTCGTTTCCTTTCTCTGTTTTTGTCCGTGGAATGAACAATGGAAGTCAACAAAAAGCAGCTGGCTGACATTTTCGGTGCGAGTATCCGTACCATTCAGAACTGGCAGGAACAGGGAATGCCCGTTCTGCGAGGCGGTGGCAAGGGTAATGAGGTGCTTTATGACTCTGCCGCCGTCATAAAATGGTATGCCGAAAGGGATGCTGAAATTGAGAACGAAAAGCTGCGCCGGGAGGTTGAAGAACTGCGGCAGGCCAGCGAGGCAGATCTCCAGCCAGGAACTATTGAGTACGAACGCCATCGACTTACGCGTGCGCAGGCCGACGCACAGGAACTGAAGAATGCCAGAGACTCCGCTGAAGTGGTGGAAACCGCATTCTGTACTTTCGTGTTGTCGCGGATCGCAGGTGAAATTGCCAGTATTCTCGACGGGCTCCCCCTGTCGGTGCAGCGGCGTTTTCCGGAACTGGAAAACCGACATGTTGATTTCCTGAAACGGGATATCATCAAAGCCATGAACAAAGCAGCCGCGCTGGATGAACTGATACCGGGGTTGCTGAGTGAATATATCGAACAGTCAGGTTAACAGGCTGCGGCATTTTGTCCGCGCCGGGCTTCGCTCACTGTTCAGGCCGGAGCCACAGACCGCCGTTGAATGGGCGGATGCTAATTACTATCTCCCGAAAGAATCCGCATACCAGGAAGGGCGCTGGGAAACACTGCCCTTTCAGCGGGCCATCATGAATGCGATGGGCAGCGACTACATCCGTGAGGTGAATGTGGTGAAGTCTGCCCGTGTCGGTTATTCCAAAATGCTGCTGGGTGTTTATGCCTACTTTATAGAGCATAAGCAGCGCAACACCCTTATCTGGTTGCCGACGGATGGTGATGCCGAGAACTTTATGAAAACCCACGTTGAGCCGACTATTCGTGATATTCCGTCGCTGCTGGCGCTGGCCCCGTGGTATGGCAAAAAGCACCGGGATAACACGCTCACCATGAAGCGTTTCACTAATGGGCGTGGCTTCTGGTGCCTGGGCGGTAAAGCGGCAAAAAACTACCGTGAAAAGTCGGTGGATGTGGCGGGTTATGATGAACTTGCTGCTTTTGATGATGATATTGAACAGGAAGGCTCTCCGACGTTCCTGGGTGACAAGCGTATTGAAGGCTCGGTCTGGCCAAAGTCCATCCGTGGCTCCACGCCAAAAGTGAGAGGCACCTGTCAGATTGAGCGTGCAGCCAGTGAATCCCCGCATTTTATGCGTTTTCATGTTGCCTGCCCGCATTGCGGGGAGGAGCAGTATCTTAAATTTGGCGACAAAGAGACGCCGTTTGGCCTCAAATGGACGCCGGATGACCCCTCCAGCGTGTTTTATCTCTGCGAGCATAATGCCTGCGTCATCCGCCAGCAGGAGCTGGACTTTACTGATGCCCGTTATATCTGCGAAAAGACCGGGATCTGGACCCGTGATGGCATTCTCTGGTTTTCGTCATCCGGTGAAGAGATTGAGCCACCTGACAGTGTGACCTTTCACATCTGGACAGCGTACAGCCCGTTCACCACCTGGGTGCAGATTGTCAAAGACTGGATGAAAACGAAAGGGGATACGGGAAAACGTAAAACCTTCGTAAACACCACGCTCGGTGAGACGTGGGAGGCGAAAATTGGCGAACGTCCGGATGCTGAAGTGATGGCAGAGCGGAAAGAGCATTATTCAGCGCCCGTTCCTGACCGTGTGGCTTACCTGACCGCCGGTATCGACTCCCAGCTGGACCGCTACGAAATGCGCGTATGGGGATGGGGGCCGGGTGAGGAAAGCTGGCTGATTGACCGGCAGATTATTATGGGCCGCCACGACGATGAACAGACGCTGCTGCGTGTGGATGAGGCCATCAATAAAACCTATACCCGCCGGAATGGTGCAGAAATGTCGATATCCCGTATCTGCTGGGATACTGGCGGGATTGACCCGACCATTGTGTATGAACGCTCGAAAAAACATGGGCTGTTCCGGGTGATCCCCATTAAAGGGGCATCCGTCTACGGAAAGCCGGTGGCCAGCATGCCACGTAAGCGAAACAAAAACGGGGTTTACCTTACCGAAATCGGTACGGATACCGCGAAAGAGCAGATTTATAACCGCTTCACACTGACGCCGGAAGGGGATGAACCGCTTCCCGGTGCCGTTCACTTCCCGAATAACCCGGATATTTTTGATCTGACCGAAGCGCAGCAGCTGACTGCTGAAGAGCAGGTCGAAAAATGGGTGGATGGCAGGAAAAAAATACTGTGGGACAGCAAAAAGCGACGCAATGAGGCACTCGACTGCTTCGTTTATGCGCTGGCGGCGCTGCGCATCAGTATTTCCCGCTGGCAGCTGGATCTCAGTGCGCTGCTGGCGAGCCTGCAGGAAGAGGATGGTGCAGCAACCAACAAGAAAACACTGGCAGATTACGCCCGTGCCTTATCCGGAGAGGATGAATGACGCGACAGGAAGAACTTGCCGCTGCCCGTGCGGCACTGCATGACCTGATGACAGGTAAACGGGTGGCAACAGTACAGAAAGACGGACGAAGGGTGGAGTTTACGGCCACTTCCGTGTCTGACCTGAAAAAATATATTGCAGAGCTGGAAGTGCAGACCGGCATGACACAGCGACGCAGGGGACCTGCAGGATTTTATGTATGAAAACGCCCACCATTCCCACCCTTCTGGGGCCGGACGGCATGACATCGCTGCGCGAATATGCCGGTTATCACGGCGGTGGCAGCGGATTTGGAGGGCAGTTGCGGTCGTGGAACCCACCGAGTGAAAGTGTGGATGCAGCCCTGTTGCCCAACTTTACCCGTGGCAATGCCCGCGCAGACGATCTGGTACGCAATAACGGCTATGCCGCCAACGCCATCCAGCTGCATCAGGATCATATCGTCGGGTCTTTTTTCCGGCTCAGTCATCGCCCAAGCTGGCGCTATCTGGGCATCGGGGAGGAAGAAGCCCGTGCCTTTTCCCGCGAGGTTGAAGCGGCATGGAAAGAGTTTGCCGAGGATGACTGCTGCTGCATTGACGTTGAGCGAAAACGCACGTTTACCATGATGATTCGGGAAGGTGTGGCCATGCACGCCTTTAACGGTGAACTGTTCGTTCAGGCCACCTGGGATACCAGTTCGTCGCGGCTTTTCCGGACACAGTTCCGGATGGTCAGCCCGAAGCGCATCAGCAACCCGAACAATACCGGCGACAGCCGGAACTGCCGTGCCGGTGTGCAGATTAATGACAGCGGTGCGGCGCTGGGATATTACGTCAGCGAGGACGGGTATCCTGGCTGGATGCCGCAGAAATGGACATGGATACCCCGTGAGTTACCCGGCGGGCGCGCCTCGTTCATTCACGTTTTTGAACCCGTGGAGGACGGGCAGACTCGCGGTGCAAATGTGTTTTACAGCGTGATGGAGCAGATGAAGATGCTCGACACGCTGCAGAACACGCAGCTGCAGAGCGCCATTGTGAAGGCGATGTATGCCGCCACCATTGAGAGTGAGCTGGATACGCAGTCAGCGATGGATTTTATTCTGGGCGCGAACAGTCAGGAGCAGCGGGAAAGGCTGACCGGCTGGATTGGTGAAATTGCCGCGTATTACGCCGCAGCGCCGGTCCGGCTGGGAGGCGCAAAAGTACCGCACCTGATGCCGGGTGACTCACTGAACCTGCAGACGGCTCAGGATACGGATAACGGCTACTCCGTGTTTGAGCAGTCACTGCTGCGGTATATCGCTGCCGGGCTGGGTGTCTCGTATGAGCAGCTTTCCCGGAATTACGCCCAGATGAGCTACTCCACGGCACGGGCCAGTGCGAACGAGTCGTGGGCGTACTTTATGGGGCGGCGAAAATTCGTCGCATCCCGTCAGGCGAGCCAGATGTTTCTGTGCTGGCTGGAAGAGGCCATCGTTCGCCGCGTGGTGACGTTACCTTCAAAAGCGCGCTTCAGTTTTCAGGAAGCCCGCAGTGCCTGGGGGAACTGCGACTGGATAGGCTCCGGTCGTATGGCCATCGATGGTCTGAAAGAAGTTCAGGAAGCGGTGATGCTGATAGAAGCCGGACTGAGTACCTACGAGAAAGAGTGCGCAAAACGCGGTGACGACTATCAGGAAATTTTTGCCCAGCAGGTCCGTGAAACGATGGAGCGCCGTGCAGCCGGTCTTAAACCGCCCGCCTGGGCGGCTGCAGCATTTGAATCCGGGCTGCGACAATCAACAGAGGAGGAGAAGAGTGACAGCAGAGCTGCGTAATCTCCCGCATATTGCCAGCATGGCCTTTAATGAGCCGCTGATGCTTGAACCCGCCTATGCGCGGGTTTTCTTTTGTGCGCTTGCAGGCCAGCTTGGGATCAGCAGCCTGACGGATGCGGTGTCCGGCGACAGCCTGACTGCCCAGGAGGCACTCGCGACGCTGGCATTATCCGGTGATGATGACGGACCACGACAGGCCCGCAGTTATCAGGTCATGAACGGCATCGCCGTGCTGCCGGTGTCCGGCACGCTGGTCAGCCGGACGCGGGCGCTGCAGCCGTACTCGGGGATGACCGGTTACAACGGCATTATCGCCCGTCTGCAACAGGCTGCCAGCGATCCGATGGTGGACGGCATTCTGCTCGATATGGACACGCCCGGCGGGATGGTGGCGGGGGCATTTGACTGCGCTGACATCATCGCCCGTGTGCGTGACATAAAACCGGTATGGGCGCTTGCCAACGACATGAACTGCAGTGCAGGTCAGTTGCTTGCCAGTGCCGCCTCCCGGCGTCTGGTCACGCAGACCGCCCGGACAGGCTCCATCGGCGTCATGATGGCTCACAGTAATTACGGTGCTGCGCTGGAGAAACAGGGTGTGGAAATCACGCTGATTTACAGCGGCAGCCATAAGGTGGATGGCAACCCCTACAGCCATCTTCCGGATGACGTCCGGGAGACACTGCAGTCCCGGATGGACGCAACCCGCCAGATGTTTGCGCAGAAGGTGTCGGCATATACCGGCCTGTCCGTGCAGGTTGTGCTGGATACCGAGGCTGCAGTGTACAGCGGTCAGGAGGCCATTGATGCCGGACTGGCTGATGAACTTGTTAACAGCACCGATGCGATCACCGTCATGCGTGATGCACTGGATGCACGTAAATCCCGTCTCTCAGGAGGGCGAATGACCAAAGAGACTCAATCAACAACTGTTTCAGCCACTGCTTCGCAGGCTGACGTTACTGACGTGGTGCCAGCGACGGAGGGCGAGAACGCCAGCGCGGCGCAGCCGGACGTGAACGCGCAGATCACCGCAGCGGTTGCGGCAGAAAACAGCCGCATTATGGGGATCCTCAACTGTGAGGAGGCTCACGGACGCGAAGAACAGGCACGCGTGCTGGCAGAAACCCCCGGTATGACCGTGAAAACGGCCCGCCGCATTCTGGCCGCAGCACCACAGAGTGCACAGGCGCGCAGTGACACTGCGCTGGATCGTCTGATGCAGGGGGCACCGGCACCGCTGGCTGCAGGTAACCCGGCATCTGATGCCGTTAACGATTTGCTGAACACACCAGTGTAAGGGATGTTTATGACGAGCAAAGAAACCTTTACCCATTACCAGCCGCAGGGCAACAGTGACCCGGCTCATACCGCAACCGCGCCCGGCGGATTGAGTGCGAAAGCGCCTGCAATGACCCCGCTGATGCTGGACACCTCCAGCCGTAAGCTGGTTGCGTGGGATGGCACCACCGACGGTGCTGCCGTTGGCATTCTTGCGGTTGCTGCTGACCAGACCAGCACCACGCTGACGTTCTACAAGTCCGGCACGTTCCGTTATGAGGATGTGCTCTGGCCGGAGGCTGCCAGCGACGAGACGAAAAAACGGACCGCGTTTGCCGGAACGGCAATCAGCATCGTTTAACTTTACCCTTCATCACTAAAGGCCGCCTGTGCGGCTTTTTTTACGGGATTTTTTTATGTCGATGTACACAACCGCCCAACTGCTGGCGGCAAATGAGCAGAAATTTAAGTTTGATCCGCTGTTTCTGCGTCTCTTTTTCCGTGAGAGCTATCCCTTCACCACGGAGAAAGTCTATCTCTCACAAATTCCGGGACTGGTAAACATGGCGCTGTACGTTTCGCCGATTGTTTCCGGTGAGGTTATCCGTTCCCGTGGCGGCTCCACCTCTGAATTTACGCCGGGATATGTCAAGCCGAAGCATGAAGTGAATCCGCAGATGACCCTGCGTCGCCTGCCGGATGAAGATCCGCAGAATCTGGCGGACCCGGCTTACCGCCGCCGTCGCATCATCATGCAGAACATGCGTGACGAAGAGCTGGCCATTGCTCAGGTCGAAGAGATGCAGGCAGTTTCTGCCGTGCTTAAGGGCAAATACACCATGACCGGTGAAGCCTTCGATCCGGTTGAGGTGGATATGGGCCGCAGTGAGGAGAATAACATCACGCAGTCCGGCGGCACGGAGTGGAGCAAGCGTGACAAGTCCACGTATGACCCGACCGACGATATCGAAGCCTACGCGCTGAACGCCAGCGGTGTGGTGAATATCATCGTGTTCGATCCGAAAGGCTGGGCGCTGTTCCGTTCCTTCAAAGCCGTCAAGGAGAAGCTGGATACCCGTCGTGGCTCTAATTCCGAGCTGGAGACAGCGGTGAAAGACCTGGGCAAAGCGGTGTCCTATAAGGGGATGTATGGCGATGTGGCCATCGTCGTGTATTCCGGACAGTACGTGGAAAACGGCGTCAAAAAGAACTTCCTGCCGGACAACACGATGGTGCTGGGGAACACTCAGGCACGCGGTCTGCGCACCTATGGCTGCATTCAGGATGCGGACGCACAGCGCGAAGGCATTAACGCCTCTGCCCGTTACCCGAAAAACTGGGTGACCACCGGCGATCCGGCGCGTGAGTTCACCATGATTCAGTCAGCACCGCTGATGCTGCTGGCTGACCCTGATGAGTTCGTGTCCGTACAACTGGCGTAATCATGGCCCTTCGGGGCCATTGTTTCTCTGTGGAGGAGTCCATGACGAAAGATGAACTGATTGCCCGTCTCCGCTCGCTGGGTGAACAACTGAACCGTGATGTCAGCCTGACGGGGACGAAAGAAGAACTGGCGCTCCGTGTGGCAGAGCTGAAAGAGGAGCTTGATGACACGGATGAAACTGCCGGTCAGGACACCCCTCTCAGCCGGGAAAATGTGCTGACCGGACATGAAAATGAGGTGGGATCAGCGCAGCCGGATACCGTGATTCTGGATACGTCTGAACTGGTCACGGTCGTGGCACTGGTGAAGCTGCATACTGATGCACTTCACGCCACGCGGGATGAACCTGTGGCATTTGTGCTGCCGGGAACGGCGTTTCGTGTCTCTGCCGGTGTGGCAGCCGAAATGACAGAGCGCGGCCTGGCCAGAATGCAATAACGGGAGGCGCTGTGGCTGATTTCGATAACCTGTTCGATGCTGCCATTGCCCGCGCCGATGAAACGATACGCGGGTACATGGGAACGTCAGCCACCATTACATCCGGTGAGCAGTCAGGTGCGGTGATACGTGGTGTTTTTGATGACCCTGAAAATATCAGCTATGCCGGACAGGGCGTGCGCGTTGAAGGCTCCAGCCCGTCCCTGTTTGTCCGGACTGATGAGGTGCGGCAGCTGCGGCGTGGAGACACGCTGACCATCGGTGAGGAAAATTTCTGGGTAGATCGGGTTTCGCCGGATGATGGCGGAAGTTGTCATCTCTGGCTTGGACGGGGCGTACCGCCTGCCGTTAACCGTCGCCGCTGAAAGGGGGATGTATGGCCATAAAAGGTCTTGAGCAGGCCGTTGAAAACCTCAGCCGTATCAGCAAAACGGCGGTGCCTGGTGCCGCCGCAATGGCCATTAACCGCGTTGCTTCATCCGCGATATCGCAGTCGGCGTCACAGGTTGCCCGTGAGACAAAGGTACGCCGGAAACTGGTAAAGGAAAGGGCCAGGCTGAAAAGGGCCACGGTCAAAAATCCGCAGGCCAGAATCAAAGTTAACCGGGGGGATTTGCCCGTAATCAAGCTGGGTAATGCGCGGGTTGTCCTTTCGCGCCGCAGGCGTCGTAAAAAGGGGCAGCGTTCATCCCTGAAAGGTGGCGGCAGCGTGCTTGTGGTGGGTAACCGTCGTATTCCCGGCGCGTTTATTCAGCAACTGAAAAATGGCCGGTGGCATGTCATGCAGCGTGTGGCTGGGAAAAACCGTTACCCCATTGATGTGGTGAAAATCCCGATGGCGGTGCCGCTGACCACGGCGTTTAAACAAAATATTGAGCGGATACGGCGTGAACGTCTTCCGAAAGAGCTGGGCTATGCGCTGCAGCATCAACTGAGGATGGTAATAAAGCGATGAAACATACTGAACTCCGTGCAGCCGTACTGGATGCACTGGAGAAGCATGACACCGGGGCGACGTTTTTTGATGGTCGCCCCGCTGTTTTTGATGAGGCGGATTTTCCGGCAGTTGCCGTTTATCTCACCGGCGCTGAATACACGGGCGAAGAGCTGGACAGCGATACCTGGCAGGCGGAGCTGCATATCGAAGTTTTCCTGCCTGCTCAGGTGCCGGATTCAGAGCTGGATGCGTGGATGGAGTCCCGGATTTATCCGGTGATGAGCGATATCCCGGCACTGTCAGATTTGATCACCAGTATGGTGGCCAGCGGCTATGACTACCGGCGCGACGATGATGCGGGCTTGTGGAGTTCAGCCGATCTGACTTATGTCATTACCTATGAAATGTGAGGACGCTATGCCTGTACCAAATCCTACAATGCCGGTGAAAGGTGCCGGGACCACCCTGTGGGTTTATAAGGGGAGCGGTGACCCTTACGCGAATCCGCTTTCAGACGTTGACTGGTCGCGTCTGGCAAAAGTTAAAGACCTGACGCCCGGCGAACTGACCGCTGAGTCCTATGACGACAGCTATCTCGATGATGAAGATGCAGACTGGACTGCGACCGGGCAGGGGCAGAAATCTGCCGGAGATACCAGCTTCACGCTGGCGTGGATGCCCGGAGAGCAGGGGCAGCAGGCGCTGCTGGCGTGGTTTAATGAAGGCGATACCCGTGCCTATAAAATCCGCTTCCCGAACGGCACGGTCGATGTGTTCCGTGGCTGGGTCAGCAGTATCGGTAAGGCGGTGACGGCGAAGGAAGTGATCACCCGCACGGTGAAAGTCACCAATGTGGGACGTCCGTCGATGGCAGAAGATCGCAGCACGGTAACAGCGGCAACCGGCATGACCGTGACGCCTGCCAGCACCTCGGTGGTGAAAGGGCAGAGCACCACGCTGACCGTGGCCTTCCAGCCGGAGGGCGTAACCGACAAGAGCTTTCGTGCGGTGTCTGCGGATAAAACAAAAGCCACCGTGTCGGTCAGTGGTATGACCATCACCGTGAACGGCGTTGCTGCAGGCAAGGTCAACATTCCGGTTGTATCCGGTAATGGTGAGTTTGCTGCGGTTGCAGAAATTACCGTCACCGCCAGTTAATCCGGAGAGTCAGCGATGTTCCTGAAAACCGAATCATTTGAACATAACGGTGTGACCGTCACGCTTTCTGAACTGTCAGCCCTGCAGCGCATTGAGCATCTCGCCCTGATGAAACGGCAGGCAGAACAGGCGGAGTCAGACAGCAACCGGAAGTTTACTGTGGAAGACGCCATCAGAACCGGCGCGTTTCTGGTGGCGATGTCCCTGTGGCATAACCATCCGCAGAAGACGCAGATGCCGTCCATGAATGAAGCCGTTAAACAGATTGAGCAGGAAGTGCTTACCACCTGGCCCACGGAGGCAATTTCTCATGCTGAAAACGTGGTGTACCGGCTGTCTGGTATGTATGAGTTTGTGGTGAATAATGCCCCTGAACAGACAGAGGACGCCGGGCCCGCAGAGCCTGTTTCTGCGGGAAAGTGTTCGACGGTGAGCTGAGTTTTGCCCTGAAACTGGCGCGTGAGATGGGGCGACCCGACTGGCGTGCCATGCTTGCCGGGATGTCATCCACGGAGTATGCCGACTGGCACCGCTTTTACAGTACCCATTATTTTCATGATGTTCTGCTGGATATGCACTTTTCCGGGCTGACGTACACCGTGCTCAGCCTGTTTTTCAGCGATCCGGATATGCATCCGCTGGATTTCAGTCTGCTGAACCGGCGCGAGGCTGACGAAGAGCCTGAAGATGATGTGCTGATGCAGAAAGCGGCAGGGCTTGCCGGAGGTGTCCGCTTTGGCCCGGACGGGAATGAAGTTATCCCCGCTTCCCCGGATGTGGCGGACATGACGGAGGATGACGTAATGCTGATGACAGTATCAGAAGGGATCGCAGGAGGAGTCCGGTATGGCTGAACCGGTAGGCGATCTGGTCGTTGATTTGAGTCTGGATGCGGCCAGATTTGACGAGCAGATGGCCAGAGTCAGGCGTCATTTTTCTGGTACGGAAAGTGATGCGAAAAAAACAGCGGCAGTCGTTGAACAGTCGCTGAGCCGACAGGCGCTGGCTGCACAGAAAGCGGGGATTTCCGTCGGGCAGTATAAAGCCGCCATGCGTATGCTGCCTGCACAGTTCACCGACGTGGCCACGCAGCTTGCAGGCGGGCAAAGTCCGTGGCTGATCCTGCTGCAACAGGGGGGGCAGGTGAAGGACTCCTTCGGCGGGATGATCCCCATGTTCAGGGGGCTTGCCGGTGCGATCACCCTGCCGATGGTGGGGGCCACCTCGCTGGCGGTGGCGACCGGTGCGCTGGCGTATGCCTGGTATCAGGGCAACTCAACCCTGTCCGATTTCAACAAAACGCTGGTCCTTTCCGGCAATCAGGCGGGACTGACGGCAGATCGTATGCTGGTCCTGTCCAGAGCCGGGCAGGCGGCAGGGCTGACGTTTAACCAGACCAGCGAGTCACTCAGCGCACTGGTTAAGGCGGGGGTAAGCGGTGAGGCTCAGATTGCGTCCATCAGCCAGAGTGTGGCGCGTTTCTCCTCTGCATCCGGCGTGGAGGTGGACAAGGTCGCTGAAGCCTTCGGGAAGCTGACCACAGACCCGACGTCGGGGCTGACGGCGATGGCTCGCCAGTTCCATAACGTGTCGGCGGAGCAGATTGCGTATGTTGCTCAGTTGCAGCGTTCCGGCGATGAAGCCGGGGCATTGCAGGCGGCGAACGAGGCCGCAACGAAAGGGTTTGATGACCAGACCCGCCGCCTGAAAGAGAACATGGGCACGCTGGAGACCTGGGCAGACAGGACTGCGCGGGCATTCAAATCCATGTGGGATGCGGTGCTGGATATTGGTCGTCCTGATACCGCGCAGGAGATGCTGATTAAGGCAGAGGCTGCGTATAAGAAAGCAGACGACATCTGGAATCTGCGCAAGGATGATTATTTTGTTAACGATGAAGCGCGGGCGCGTTACTGGGATGATCGTGAAAAGGCCCGTCTTGCGCTTGAAGCCGCCCGAAAGAAGGCTGAGCAGCAGACTCAACAGGACAAAAATGCGCAGCAGCAGAGCGATACCGAAGCGTCACGGCTGAAATATACCGAAGAGGCGCAGAAGGCTTACGAACGGCTGCAGACGCCGCTGGAGAAATATACCGCCCGTCAGGAAGAACTGAACAAGGCACTGAAAGACGGGAAAATCCTGCAGGCGGATTACAACACGCTGATGGCGGCGGCGAAAAAGGATTATGAAGCGACGCTGAAAAAGCCGAAACAGTCCAGCGTGAAGGTGTCTGCGGGCGATCGTCAGGAAGACAGTGCTCATGCTGCCCTGCTGACGCTTCAGGCAGAACTCCGGACGCTGGAGAAGCATGCCGGAGCAAATGAGAAAATCAGCCAGCAGCGCCGGGATTTGTGGAAGGCGGAGAGTCAGTTCGCGGTACTGGAGGAGGCGGCGCAACGTCGCCAGCTGTCTGCACAGGAGAAATCCCTGCTGGCGCATAAAGATGAGACGCTGGAGTACAAACGCCAGCTGGCTGCACTTGGCGACAAGGTTACGTATCAGGAGCGCCTGAACGCGCTGGCGCAGCAGGCGGATAAATTCGCACAGCAGCAACGGGCAAAACGGGCCGCCATTGATGCGAAAAGCCGGGGGCTGACTGACCGGCAGGCAGAACGGGAAGCCACGGAACAGCGCCTGAAGGAACAGTATGGCGATAATCCGCTGGCGCTGAATAACGTCATGTCAGAGCAGAAAAAGACCTGGGCGGCTGAAGACCAGCTTCGCGGGAACTGGATGGCAGGCCTGAAGTCCGGCTGGAGTGAGTGGGAAGAGAGCGCCACGGACAGTATGTCGCAGGTAAAAAGTGCAGCCACGCAGACCTTTGATGGTATTGCACAGAATATGGCGGCGATGCTGACCGGCAGTGAGCAGAACTGGCGCAGCTTCACCCGTTCCGTGCTGTCCATGATGACAGAAATTCTGCTTAAGCAGGCAATGGTGGGGATTGTCGGGAGTATCGGCAGCGCCATTGGCGGGGCTGTTGGTGGCGGCGCATCCGCGTCAGGCGGTACAGCCATTCAGGCCGCTGCGGCGAAATTCCATTTTGCAACCGGAGGATTTACGGGAACCGGCGGCAAATATGAGCCAGCGGGGATTGTTCACCGTGGTGAGTTTGTCTTCACGAAGGAGGCAACCAGCCGGATTGGCGTGGGGAATCTTTACCGGCTGATGCGCGGCTATGCCACCGGCGGTTATGTCGGTACACCGGGCAGCATGGCAGACAGCCGGTCGCAGGCGTCCGGGACGTTTGAGCAGAATAACCATGTGGTGATTAACAACGACGGCACGAACGGGCAGATAGGTCCGGCTGCTCTGAAGGCGGTGTATGACATGGCCCGCAAGGGTGCCCGTGATGAAATTCAGACACAGATGCGTGATGGTGGCCTGTTCTCCGGAGGTGGACGATGAAGACCTTCCGCTGGAAAGTGAAACCCGGTATGGATGTGGCTTCGGTCCCTTCTGTAAGAAAGGTGCGCTTTGGTGATGGCTATTCTCAGCGAGCGCCTGCCGGGCTGAATGCCAACCTGAAAACGTACAGCGTGACGCTTTCTGTCCCCCGTGAGGAGGCCACGGTACTGGAGTCGTTTCTGGAAGAGCACGGGGGCTGGAAATCCTTTCTGTGGACGCCGCCTTATGAGTGGCGGCAGATAAAGGTGACCTGCGCAAAATGGTCGTCGCGGGTCAGTATGCTGCGTGTTGAGTTCAGCGCAGAGTTTGAACAGGTGGTGAACTGATGCAGGATATCCGGCAGGAAACACTGAATGAATGCACCCGTGCGGAGCAGTCGGCCAGCGTGGTGCTCTGGGAAATCGACCTGACAGAGGTCGGTGGAGAACGTTATTTTTTCTGTAATGAGCAGAACGAAAAAGGTGAGCCGGTCACCTGGCAGGGGCGACAGTATCAGCCGTATCCCATTCAGGGGAGCGGTTTTGAACTGAATGGCAAAGGCACCAGTACGCGCCCCACGCTGACGGTTTCTAACCTGTACGGTATGGTCACCGGGATGGCGGAAGATATGCAGAGTCTGGTCGGCGGAACGGTGGTCCGGCGTAAGGTTTACGCCCGTTTTCTGGATGCGGTGAACTTCGTCAACGGAAACAGTTACGCCGATCCGGAGCAGGAGGTGATCAGCCGCTGGCGCATTGAGCAGTGCAGCGAACTGAGCGCGGTGAGTGCCTCCTTTGTACTGTCCACGCCGACGGAAACGGATGGCGCTGTTTTTCCGGGACGTATCATGCTGGCCAACACCTGCACCTGGACCTATCGCGGTGACGAGTGCGGTTATAGCGGTCCGGCTGTCGCGGATGAATATGACCAGCCAACGTCCGATATCACGAAGGATAAATGCAGCAAATGCCTGAGCGGTTGTAAGTTCCGCAATAACGTCGGCAACTTTGGCGGCTTCCTTTCCATTAACAAACTTTCGCAGTAAATCCCATGACACAGACAGAATCAGCGATTCTGGCGCACGCCCGGCGATGTGCGCCAGCGGAGTCGTGCGGCTTCGTGGTAAGCACGCCGGAGGGGGAAAGATATTTCCCCTGCGTGAATATCTCCGGTGAGCCGGAGGCGTATTTCCGTATGTCGCCGGAAGACTGGCTGCAGGCAGAAATGCAGGGTGAGATTGTGGCGCTGGTCCACAGCCACCCCGGTGGTCTGCCCTGGCTGAGTGAGGCCGACCGGCGGCTGCAGGTGCAGAGTGATTTGCCGTGGTGGCTGGTCTGCCGGGGGACGATTCATAAGTTCCGCTGTGTGCCGCATCTCACCGGGCGGCGCTTTGAGCACGGTGTGACGGACTGTTACACACTGTTCCGGGATGCTTATCATCTGGCGGGGATTGAGATGCCGGACTTTCATCGTGAGGATGACTGGTGGCGTAACGGCCAGAATCTCTATCTGGATAATCTGGAGGCGACGGGGCTGTATCAGGTGCCGTTGTCAGCGGCACAGCCGGGCGATGTGCTGCTGTGCTGTTTTGGTTCATCAGTGCCGAATCACGCCGCAATTTACTGCGGCGACGGCGAGCTGCTGCACCATATTCCTGAACAACTGAGCAAACGAGAGAGGTACACCGACAAATGGCAGCGACGCACACACTCCCTCTGGCGTCACCGGGCATGGCGCGCATCTGCCTTTACGGGGATTTACAACGATTTGGTCGCCGCATCGACCTTCGTGTGAAAACGGGGGCTGAAGCCATCCGGGCACTGGCCACACAGCTCCCGGCGTTTCGTCAGAAACTGAGCGACGGCTGGTATCAGGTACGGATTGCCGGGCGGGACGTCAGCACGTCCGGGTTAACGGCGCAGTTACATGAGACTCTGCCTGATGGCGCTGTAATTCATATTGTTCCCAGAGTCGCCGGGGCCAAGTCAGGTGGCGTATTCCAGATTGTCCTGGGGGCTGCCGCCATTGCCGGATCATTCTTTACCGCCGGAGCCACCCTTGCAGCATGGGGGGCAGCCATTGGGGCCGGTGGTATGACCGGCATCCTGTTTTCTCTCGGTGCCAGTATGGTGCTCGGTGGTGTGGCGCAGATGCTGGCACCGAAAGCCAGAACTCCCCGTATACAGACAACGGATAACGGTAAGCAGAACACCTATTTCTCCTCACTGGATAACATGGTTGCCCAGGGCAATGTTCTGCCTGTTCTGTACGGGGAAATGCGCGTGGGGTCACGCGTGGTTTCTCAGGAGATCAGCACGGCAGACGAAGGGGACGGTGGTCAGGTTGTGGTGATTGGTCGCTGATGCAAAATGTTTTATGTGAAACCGCCTGCGGGCGGTTTTGTCATTTATGGAGCGTGAGGAATGGGTAAAGGAAGCAGTAAGGGGCATACCCCGCGCGAAGCGAAGGACAACCTGAAGTCCACGCAGTTGCTGAGTGTGATCGATGCCATCAGCGAAGGGCCGATTGAAGGTCCGGTGGATGGCTTAAAAAGCGTGCTGCTGAACAGTACGCCGGTGCTGGACACTGAGGGGAATACCAACATATCCGGTGTCACGGTGGTGTTCCGGGCTGGTGAGCAGGAGCAGACTCCGCCGGAGGGATTTGAATCCTCCGGCTCCGAGACGGTGCTGGGTACGGAAGTGAAATATGACACGCCGATCACCCGCACCATTACGTCTGCAAACATCGACCGTCTGCGCTTTACCTTCGGTGTACAGGCACTGGTGGAAACCACCTCAAAGGGTGACAGGAATCCGTCGGAAGTCCGCCTGCTGGTTCAGATACAACGTAACGGTGGCTGGGTGACGGAAAAAGACATCACCATTAAGGGCAAAACCACCTCGCAGTATCTGGCCTCGGTGGTGATGGGTAACCTGCCGCCGCGCCCGTTTAATATCCGGATGCGCAGGATGACGCCGGACAGCACCACAGACCAGCTGCAGAACAAAACGCTCTGGTCGTCATACACTGAAATCATCGATGTGAAACAGTGCTACCCGAACACGGCACTGGTCGGCGTGCAGGTGGACTCGGAGCAGTTCGGCAGCCAGCAGGTGAGCCGTAATTATCATCTGCGCGGGCGTATTCTGCAGGTGCCGTCGAACTATAACCCGCAGACGCGGCAATACAGCGGTATCTGGGACGGAACGTTTAAACCGGCATACAGCAACAACATGGCCTGGTGTCTGTGGGATATGCTGACCCATCCGCGCTACGGCATGGGGAAACGTCTTGGTGCGGCGGATGTGGATAAATGGGCGCTGTATGTCATCGGCCAGTACTGCGACCAGTCAGTGCCGGACGGCTTTGGCGGCACGGAGCCGCGCATCACCTGTAATGCGTACCTGACCACACAGCGTAAGGCGTGGGATGTGCTCAGCGATTTCTGCTCGGCGATGCGCTGTATGCCGGTATGGAACGGGCAGACGCTGACGTTCGTGCAGGACCGACCGTCGGATAAGACGTGGACCTATAACCGCAGTAATGTGGTGATGCCGGATGATGGCGCGCCGTTCCGCTACAGCTTCAGCGCCCTGAAGGACCGCCATAATGCCGTTGAGGTGAACTGGATTGACCCGAACAACGGCTGGGAGACGGCGACAGAGCTTGTTGAAGATACGCAGGCCATTGCCCGTTACGGTCGTAATGTTACGAAGATGGATGCCTTTGGCTGTACCAGCCGGGGGCAGGCACACCGCGCCGGGCTGTGGCTGATTAAAACAGAACTGCTGGAAACGCAGACCGTGGATTTCAGCGTCGGCGCAGAAGGGCTTCGCCATGTACCGGGCGATGTTATTGAAATCTGCGATGATGACTATGCCGGTATCAGCACCGGTGGTCGTGTGCTGGCGGTGAACAGCCAGACCCGGACGCTGACGCTCGACCGTGAAATCACGCTGCCATCCTCCGGTACCGCGCTGATAAGCCTGGTTGACGGAAGTGGCAATCCGGTCAGCGTGGAGGTTCAGTCCGTCACCGACGGCGTGAAGGTAAAAGTGAGCCGTGTTCCTGACGGTGTTGCTGAATACAGCGTATGGGGGCTGAAGCTGCCGACGCTGCGCCAGCGACTGTTCCGCTGCGTGAGTATCCGTGAGAACGACGACGGCACGTATGCCATCACCGCCGTGCAGCATGTGCCGGAAAAAGAGGCCATCGTGGATAACGGGGCGCACTTTGACGGCGAACAGAGTGGCACGGTGAATGGTGTCACGCCGCCAGCGGTGCAGCACCTGACCGCAGAAGTCACTGCAGACAGCGGGGAATATCAGGTGCTGGCGCGATGGGACACACCGAAGGTGGTGAAGGGCGTGAGTTTCCTGCTCCGTCTGACCGTAACAGCGGACGACGGCAGTGAGCGGCTGGTCAGCACGGCCCGGACGACGGAAACCACATACCGCTTCACGCAACTGGCGCTGGGGAACTACAGGCTGACAGTCCGGGCGGTAAATGCGTGGGGGCAGCAGGGCGATCCGGCGTCGGTATCGTTCCGGATTGCCGCACCGGCAGCACCGTCGAGGATTGAGCTGACGCCGGGCTATTTTCAGATAACCGCCACGCCGCATCTTGCCGTTTATGACCCGACGGTACAGTTTGAGTTCTGGTTCTCGGAAAAGCAGATTGCGGATATCAGACAGGTTGAAACCAGCACGCGTTATCTTGGTACGGCGCTGTACTGGATAGCCGCCAGTATCAATATCAAACCGGGCCATGATTATTACTTTTATATCCGCAGTGTGAACACCGTTGGCAAATCGGCATTCGTGGAGGCCGTCGGTCGGGCGAGCGATGATGCGGAAGGTTACCTGGATTTTTTCAAAGGCAAGATAACCGAATCCCATCTCGGCAAGGAGCTGCTGGAAAAAGTCGAGCTGACGGAGGATAACGCCAGCAGACTGGAGGAGTTTTCGAAAGAGTGGAAGGATGCCAGTGATAAGTGGAATGCCATGTGGGCTGTCAAAATTGAGCAGACCAAAGACGGCAAACATTATGTCGCGGGTATTGGCCTCAGCATGGAGGACACGGAGGAAGGCAAACTGAGCCAGTTTCTGGTTGCCGCCAATCGTATCGCATTTATTGACCCGGCAAACGGGAATGAAACGCCGATGTTTGTGGCGCAGGGCAACCAGATATTCATGAACGACGTGTTCCTGAAGCGCCTGACGGCCCCCACCATTACCAGCGGCGGCAATCCTCCGGCCTTTTCCCTGACACCGGACGGAAAGCTGACCGCTAAAAATGCGGATATCAGTGGCAGTGTGAATGCGAACTCCGGGACGCTCAGTAATGTGACGATAGCTGAAAACTGTACGATAAACGGTACGCTGAGGGCGGAAAAAATCGTCGGGGACATTGTAAAGGCGGCGAGCGCGGCTTTTCCGCGCCAGCGTGAAAGCAGTGTGGACTGGCCGTCAGGTACCCGTACTGTCACCGTGACCGATGACCATCCTTTTGATCGCCAGATAGTGGTGCTTCCGCTGACGTTTCGCGGAAGTAAGCGTACTGTCAGCGGCAGGACAACGTATTCGATGTGTTATCTGAAAGTACTGATGAACGGTGCGGTGATTTATGATGGCGCGGCGAACGAGGCGGTACAGGTGTTCTCCCGTATTGTTGACATGCCAGCGGGTCGGGGAAACGTGATCCTGACGTTCACGCTTACGTCCACACGGCATTCGGCAGATATTCCGCCGGATACGTTTGCCAGCGATGTGCAGGTTATGGTGATTAAGAAACAGGCGCTGGGCATCAGCGTGGTCTGAGTGTGTTACAGAGGTTCGTCCGGGAACGGGCGTTTTATTATAAAACAGTGAGAGGTGAACGATGCGTAATGTGTGTATTGCCGTTGCTGTCTTTGCCGCACTTGCGGTGACAGTCACTCCGGCCCGTGCGGAAGGTGGACATGGTACGTTTACGGTGGGCTATTTTCAAGTGAAACCGGGTACATTGCCGTCGTTGTCGGGCGGGGATACCGGTGTGAGTCATCTGAAAGGGATTAACGTGAAGTACCGTTATGAGCTGACGGACAGTGTGGGGGTGATGGCTTCCCTGGGGTTCGCCGCGTCGAAAAAGAGCAGCACAGTGATGACCGGGGAGGATACGTTTCACTATGAGAGCCTGCGTGGACGTTATGTGAGCGTGATGGCCGGACCGGTTTTACAAATCAGTAAGCAGGTCAGTGCGTACGCCATGGCCGGAGTGGCTCACAGTCGGTGGTCCGGCAGTACAATGGATTACCGTAAGACGGAAATCACTCCCGGGTATATGAAAGAGACGACCACTGCCAGGGACGAAAGTGCAATGCGGCATACCTCAGTGGCGTGGAGTGCAGGTATACAGATTAATCCGGCAGCGTCCGTCGTTGTTGATATTGCTTATGAAGGCTCCGGCAGTGGCGACTGGCGTACTGACGGATTCATCGTTGGGGTCGGTTATAAATTCTGATTAGCCAGGTAACACAGTGTTATGACAGCCCGCCGGAACCGGTGGGCTTTTTTGTGGGGTGAATATGGCAGTAAAGATTTCAGGAGTCCTGAAAGACGGCACAGGAAAACCGGTACAGAACTGCACCATTCAGCTGAAAGCCAGACGTAACAGCACCACGGTGGTGGTGAACACGGTGGGCTCAGAGAATCCGGATGAAGCCGGGCGTTACAGCATGGATGTGGAGTACGGTCAGTACAGTGTCATCCTGCAGGTTGACGGTTTTCCACCATCGCACGCCGGGACCATCACCGTGTATGAAGATTCACAACCGGGGACGCTGAATGATTTTCTCTGTGCCATGACGGAGGATGATGCCCGGCCGGAGGTGCTGCGTCGTCTTGAACTGATGGTGGAAGAGGTGGCGCGTAACGCGTCCGTGGTGGCACAGAGTACGGCAGACGCGAAGAAATCAGCCGGCGATGCCAGTGCATCAGCTGCTCAGGTCGCGGCCCTTGTGACTGATGCAACTGACTCAGCACGCGCCGCCAGCACGTCCGCCGGACAGGCTGCATCGTCAGCTCAGGAAGCGTCCTCCGGCGCAGAAGCGGCATCAGCAAAGGCCACTGAAGCGGAAAAAAGTGCCGCAGCCGCAGAGTCCTCAAAAAACGCGGCGGCCACCAGTGCCGGTGCGGCGAAAACGTCAGAAACGAATGCTGCAGCGTCACAACAATCAGCCGCCACGTCTGCCTCCACCGCGGCCACGAAAGCGTCAGAGGCCGCCACTTCAGCACGAGATGCGGTGGCCTCAAAAGAGGCAGCAAAATCATCAGAAACGAACGCATCATCAAGTGCCGGTCGTGCAGCTTCCTCGGCAACGGCGGCAGAAAATTCTGCCAGGGCGGCAAAAACGTCCGAGACGAATGCCAGGTCATCTGAAACAGCAGCGGAACGGAGCGCCTCTGCCGCGGCAGACGCAAAAACAGCGGCGGCGGGGAGTGCGTCAACGGCATCCACGAAGGCGACAGAGGCTGCGGGAAGTGCGGTATCAGCATCGCAGAGCAAAAGTGCGGCAGAAGCGGCGGCAATACGTGCAGAAAATTCGGCAAAACGTGCAGAAGATATAGCTTCAGCTGTCGCGCTTGAGGATGCGGACACAACGAGAAAGGGGATAGTGCAGCTCAGCAGTGCAACCAACAGCACGTCTGAAACGCTTGCTGCAACGCCAAAGGCGGTTAAGGTGGTAATGGATGAAACGAACAGAAAAGCCCCACTGGACAGTCCGGCACTGACCGGAACGCCAACAGCACCAACCGCGCTCAGGGGAACAAACAATACCCAGATTGCGAACACCGCTTTTGTACTGGCCGCGATTGCAGATGTTATCGACGCGTCACCTGACGCACTGAATACGCTGAATGAACTGGCCGCAGCGCTCGGGAATGATCCAGATTTTGCTACCACCATGACTAACGCGCTTGCGGGTAAACAACCGAAGAATGCGACACTGACGGCGCTGGCAGGGCTTTCCACGGCGAAAAATAAATTACCGTATTTTGCGGAAAATGATGCCGCCAGCCTGACTGAACTGACTCAGGTTGGCAGGGATATTCTGGCAAAAAATTCCGTTGCAGATGTTCTTGAATACCTTGGGGCCGGTGAGAATTCGGCCTTTCCGGCAGGTGCGCCGATCCCGTGGCCATCAGATATCGTTCCGTCTGGCTACGTCCTGATGCAGGGGCAGGCGTTTGACAAATCAGCCTACCCAAAACTTGCTGTCGCGTATCCATCGGGTGTGCTTCCTGATATGCGAGGCTGGACAATCAAGGGGAAACCCGCCAGCGGTCGTGCTGTATTGTCTCAGGAACAGGATGGAATTAAGTCGCACACCCACAGTGCCAGTGCATCCGGTACGGATTTGGGGACGAAAACCACATCGTCGTTTGATTACGGGACGAAAACAACAGGCAGTTTCGATTACGGCACCAAATCGACGAATAACACGGGGGCTCATGCTCACAGTCTGAGCGGTTCAACAGGGGCCGCGGGTGCTCATGCCCACACAAGTGGTTTAAGGATGAACAGTTCTGGCTGGAGTCAGTATGGAACAGCAACCATTACAGGAAGTTTATCCACAGTTAAAGGAACCAACACACAGGGTATTGCTTATTTATCGAAAACGGACAGTCAGGGCAGCCACAGTCACTCATTGTCCGGTACAGCCGTGAGTGCCGGTGCACATGCGCATACAGTTGGTATTGGTGCGCACCAGCATCCGGTTGTTATCGGTGCTCATGCCCATTCTTTCAGTATTGGTTCACACGGACACACCATCACCGTTAACGCTGCGGGTAACGCGGAAAACACCGTCAAAAACATTGCATTTAACTATATTGTGAGGCTTGCATAATGGCATTCAGAATGAGTGAACAACCACGGACCATAAAAATTTATAATCTGCTGGCCGGAACTAATGAATTTATTGGTGAAGGTGACGCATATATTCCGCCTCATACCGGTCTGCCTGCAAACAGTACCGATATTGCACCGCCAGATATTCCGGCTGGCTTTGTGGCTGTTTTCAACAGTGATGAGGCATCGTGGCATCTCGTTGAAGACCATCGGGGTAAAACCGTCTATGACGTGGCTTCCGGCGACGCGTTATTTATTTCTGAACTCGGTCCGTTACCGGAAAATTTTACCTGGTTATCGCCGGGAGGGGAATATCAGAAGTGGAACGGCACAGCCTGGGTGAAGGATACGGAAGCAGAAAAACTGTTCCGGATCCGGGAGGCGGAAGAAACAAAAAAAAGCCTGATGCAGGTAGCCAGTGAGCATATTGCGCCGCTTCAGGATGCTGCAGATCTGGAAATTGCAACGGAGGAAGAAACCTCGTTGCTGGAAGCCTGGAAGAAGTATCGGGTGTTGCTGAACCGTGTTGATACATCAACTGCACCTGATATTGAGTGGCCTGCTGTCCCTGTTATGGAGTAATCGTTTTGTGATATGCCGCAGAAACGTTGTATGAAATAACGTTCTGCGGTTAGTTAGTATATTGTAAAGCTGAGTATTGGTTTATTTGGCGATTATTATCTTCAGGAGAATAATGGAAGTTCTATGACTCAATTGTTCATAGTGTTTACATCACCGCCAATTGCTTTTAAGACTGAACGCATGAAATATGGTTTTTCGTCATGTTTTGAGTCTGCTGTTGATATTTCTAAAGTCGGTTTTTTTTCTTCGTTTTCTCTAACTATTTTCCATGAAATACATTTTTGATTATTATTTGAATCAATTCCAATTACCTGAAGTCTTTCATCTATAATTGGCATTGTATGTATTGGTTTATTGGAGTAGATGCTTGCTTTTCTGAGCCATAGCTCTGATATCCAAATGAAGCCATAGGCATTTGTTATTTTGGCTCTGTCAGCTGCATAACGCCAAAAAATATATTTATCTGCTTGATCTTCAAATGTTGTATTGATTAAATCAATTGGATGGAATTGTTTATCATAAAAAATTAATGTTTGAATGTGATAACCGTCCTTTAAAAAAGTCGTTTCTGCAAGCTTGGCTGTATAGTCAACTAACTCTTCTGTCGAAGTGATATTTTTAGGCTTATCTACCAGTTTTAGACGCTCTTTAATATCTTCAGGAATTATTTTATTGTCATATTGTATCATGCTAAATGACAATTTGCTTATGGAGTAATCTTTTAATTTTAAATAAGTTATTCTCCTGGCTTCATCAAATAAAGAGTCGAATGATGTTGGCGAAATCACATCGTCACCCATTGGATTGTTTATTTGTATGCCAAGAGAGTTACAGCAGTTATACATTCTGCCATAGATTATAGCTAAGGCATGTAATAATTCGTAATCTTTTAGCGTATTAGCGACCCATCGTCTTTCTGATTTAATAATAGATGATTCAGTTAAATATGAAGGTAATTTCTTTTGTGCAAGTCTGACTAACTTTTTTATACCAATGTTTAACATACTTTCATTTGTAATAAACTCAATGTCATTTTCTTCAATGTAAGATGAAATAAGAGTAGCCTTTGCCTCGCTATACATTTCTAAATCGCCTTGTTTTTCTATCGTATTGCGAGAATTTTTAGCCCAAGCCATTAATGGATCATTTTTCCATTTTTCAATAACATTATTGTTATACCAAATGTCATATCCTATAATCTGGTTTTTGTTTTTTTGAATAATAAATGTTACTGTTCTTGCGGTTTGGAGGAATTGATTCAAATTCAAGCGAAATAATTCAGGGTCAAAATATGTATCAATGCAGCATTTGAGCAAGTGCGATAAATCTTTAAGTCTTCTTTCCCATGGTTTTTTAGTCATAAAACTCTCCATTTTGATAGGTTGCATGCTAGATGCTGATATATTTTAGAGGTGATAAAATTAACTGCTTAACTGTCAATGTAATACAAGTTGTTTGATCTTTGCAATGATTCTTATCAGAAACCATATAGTAAATTAGTTACACAGGAAATTTTTAATATTATTATTATCATTCATTATGTATTAAAATTAGAGTTGTGGCTTGGCTCTGCTAACACGTTGCTCATAGGAGATATGGTAGAGCCGCAGACACGTCGTATGCAGGAACGTGCTGCGGCTGGCTGGTGAACTTCCGATAGTGCGGGTGTTGAATGATTTCCAGTTGCTACCGATTTTACATATTTTTTGCATGAGAGAATTTGTACCACCTCCCACCGACCATCTATGACTGTACGCCACTGTCCCTAGGACTGCTATGTGCCGGAGCGGACATTACAAACGTCCTTCTCGGTGCATGCCACTGTTGCCAATGACCTGCCTAGGAATTGGTTAGCAAGTTACTACCGGATTTTGTAAAAACAGCCCTCCTCATATAAAAAGTATTCGTTCACTTCCGATAAGCGTCGTAATTTTCTATCTTTCATCATATTCTAGATCCCTCTGAAAAAATCTTCCGAGTTTGCTAGGCACTGATACATAACTCTTTTCCAATAATTGGGGAAGTCATTCAAATCTATAATAGGTTTCAGATTTGCTTCAATAAATTCTGACTGTAGCTGCTGAAACGTTGCGGTTGAACTATATTTCCTTATAACTTTTACGAAAGAGTTTCTTTGAGTAATCACTTCACTCAAGTGCTTCCCTGCCTCCAAACGATACCTGTTAGCAATATTTAATAGCTTGAAATGATGAAGAGCTCTGTGTTTGTCTTCCTGCCTCCAGTTCGCCGGGCATTCAACATAAAAACTGATAGCACCCGGAGTTCCGGAAACGAAATTTGCATATACCCATTGCTCACGAAAAAAAATGTCCTTGTCGATATAGGGATGAATCGCTTGGTGTACCTCATCTACTGCGAAAACTTGACCTTTCTCTCCCATATTGCAGTCGCGGCACGATGGAACTAAATTAATAGGCATCACCGAAAATTCAGGATAATGTGCAATAGGAAGAAAATGATCTATATTTTTTGTCTGTCCTATATCACCACAAAATGGACATTTTTCACCTGATGAAACAAGCATGTCATCGTAATATGTTCTAGCGGGTTTGTTTTTATCTCGGAGATTATTTTCATAAAGCTTTTCTAATTTAACCTTTGTCAGGTTACCAACTACTAAGGTTGTAGGCTCAAGAGGGTGTGTCCTGTCGTAGGTAAATAACTGACCTGTCGAGCTTAATATTCTATATTGTTGTTCTTTCTGCAAAAAAGTGGGGAAGTGAGTAATGAAATTATTTCTAACATTTATCTGCATCATACCTTCCGAGCATTTATTAAGCATTTCGCTATAAGTTCTCGCTGGAAGAGGTAGTTTTTTCATTGTACTTTACCTTCATCTCTGTTCATTATCATCGCTTTTAAAACGGTTCGACCTTCTAATCCTATCTGACCATTATAATTTTTTAGAATGGTTTCATAAGAAAGCTCTGAATCAACGGACTGCGATAATAAGTGGTGGTATCCAGAATTTGTCACTTCAAGTAAAAACACCTCACGAGTTAAAACACCTAAGTTCTCACCGAATGTCTCAATATCCGGACGGATAATATTTATTGCTTCTCTTGACCGTAGGACTTTCCACACGCAGGATTTTGGAACCTCTTGCAGTACTACTGGGGAATGAGTTGCAATTATTGCTACACCATTGCGTGCATCGAGTAAGTCGCTTAATGTTCGTAAAAAAGCAGAGAGCAAAGGTGGATGCAGATGAACCTCTGGTTCATCGAATAAAACTAATGACTTTTCGCCAACGACATCTACTAATCTTGTGATAGTAAATAAAACAATTGCATGTCCAGAGCTCATTCGAAGCAGATATTTCTGGATATTGTCATAAAACAATTTAGTGAATTTATCATCGTCCACTTGAATCTGTGGTTCATTACGTCTTAACTCTTCATATTTAGAAATGAGGCTGATGAGTTCCATATTTGAAAAGTTTTCATCACTACTTAGTTTTTTGATAGCTTCAAGCCAGAGTTGTCTTTTTCTATCTACTCTCATACAACCAATAAATGCTGAAATGAATTCTAAGCGGAGATCGCCTAGTGATTTTAAACTATTGCTGGCAGCATTCTTGAGTCCAATATAAAAGTATTGTGTACCTTTTGCTGGGTCAGGTTGTTCTTTAGGAGGAGTAAAAGGATCAAATGCACTAAACGAAACTGAAACAAGCGATCGAAAATATCCCTTTGGGATTCTTGACTCGATAAGTCTATTATTTTCAGAGAAAAAATATTCATTGTTTTCTGGGTTGGTGATTGCACCAATCATTCCATTCAAAATTGTTGTTTTACCACACCCATTCCGCCCGATAAAAGCATGAATGTTCGTGCTGGGCATAGAATTAACCGTCACCTCAAAAGGTATAGTTAAATCACTGAATCCGGGAGCACTTTTTCTATTAAATGAAAAGTGGAAATCTGACAATTCTGGCAAACCATTTAACACACGTGCGAACTGTCCATGAATTTCTGAAAGAGTTACCCCTCTAAGTAATGAGGTGTTAAGGACGCTTTCATTTTCAATGTCGGCTAATCGATTTGGCCATACTACTAAATCCTGAATAGCTTTAAGAAGGTTATGTTTAAAACCATCGCTTAATTTGCTGAGATTAACATAGTAGTCAATGCTTTCACCTAAGGAAAAAAACATTTCAGGGAGTTGACTGAATTTTTTATCTATTAATGAATAAGTGCTTACTTCTTCTTTTTGACCTACAAAACCAATTTTAACATTTCCGATATCGCATTTTTCACCATGCTCATCAAAGACAGTAAGATAAAACATTGTAACAAAGGAATAGTCATTCCAACCATCTGCTCGTAGGAATGCCTTATTTTTTTCTACTGCAGGAATATACCCGCCTCTTTCAATAACACTAAACTCCAACATATAGTAACCCTTAATTTTATTAAAATAACCGCAATTTATTTGGCGGCAACACAGGATCTCTCTTTTAAGTTACTCTCTATTACATACGTTTTCCATCTAAAAATTAGTAGTATTGAACTTAACGGGGCATCGTATTGTAGTTTTCCATATTTAGCTTTCTGCTTCCTTTTGGATAACCCACTGTTATTCATGTTGCATGGTGCACTGTTTATACCAACGATATAGTCTATTAATGCATATATAGTATCGCCGAACGATTAGCTCTTCAGGCTTCTGAAGAAGCGTTTCAAGTACTAATAAGCCGATAGATAGCCACGGACTTCGTAGCCATTTTTCATAAGTGTTAACTTCCGCTCCTCGCTCATAACAGACATTCACTACAGTTATGGCGGAAAGGTATGCATGCTGGGTGTGGGGAAGTCGTGAAAGAAAAGAAGTCAGCTGCGTCGTTTGACATCACTGCTATCTTCTTACTGGTTATGCAGGTCGTAGTGGGTGGCACACAAAGCTTTGCACTGGATTGCGAGGCTTTGTGCTTCTCTGGAGTGCGACAGGTTTGATGACAAAAAATTAGCGCAAGAAGACAAAAATCACCTTGCGCTAATGCTCTGTTACAGGTCACTAATACCATCTAAGTAGTTGATTCATAGTGACTGCATATGTTGTGTTTTACAGTATTATGTAGTCTGTTTTTTATGCAAAATCTAATTTAATATATTGATATTTATATCATTTTACGTTTCTCGTTCAGCTTTTTTATACTAACTTGAGCGAAACGGGAAGGTAAAAAGACAAAAAGTTGTTTTTAATACCTTTAAGTGATACCAGATGGCATTGCGCCATCTGGCAGAGTGATTAACTAAACATCGCAGTAATCGAGGCGCTTGCCAGAGAGTGGAAATGAACGTTAAACCCGACCATCGCGCCGCTGGCACCTTCATCGACATCAATACGTTCTATATCCAGCGCGTGAACGGTAAAAATGTAGCGATGAGTTTCGCCTTTCGGCGGTGCTGCGCCATCGTACCCGGTTTTACCAAAGTCGGTACGCGTCTGCAAAACGCCGTCTGGCATTGCTACCAGACCAGAGCCAAACCCTTGCGGTAATACGCGGGTATCAGCGGGTAAGTTAACAACTACCCAGTGCCACCAGCCGGAGCCGGTTGGCGCATCCGGGTCGTAGCAGGTGACAACAAAACTTTTCGTTCCCGCAGGAACATCATCCCACGCCAGATGCGGTGAAATATTATCGCCATCGTAACCCATGCCGTTAAAGACATGACGATGCGGCAATTTATCGCCATCGCGCAGATCGTTACTGATGAGTTTCATGAACCCTCCTTTCTTGTTTGCAGAAAGTGTAGCCAGAAACCCTCACGCGGACTTCTCGTTATTGGCAAAAAAATGTTTCATCCTGTACCGCGCGGTTAACCGCTGCGGTCAGACGCTGCAACTGTTGCGGGAGAATAATATAGGGCGGCATCAGGTAAATCAGTTTGCCAAAAGGCCGGATCCAGACACCCTGTTCGACAAAGAATTTTTGCAGCGCCGCCATATTCACCGGATGAGTGGTTTCGACCACGCCAATGGCCCCCAGTACGCGCACATCGGCAACCATTTCGGCATCACGGGCGGGGGCAAGTTGCTCGCGCAGCTGTACTTCAATATCCGCCACCTGTTGCTGCCAGTCGCCAGATTCGAGAATCGCCAGGCTGGCGTTTGCTGCCGCGCAGGCCAGCGGATTGCCCATAAAAGTTGGCCCATGCATAAAGCAACCGGCTTCACCGTTACTGATGGTTTCTGCAACCTCGCGCGTGGTGAGTGTGGCGGAAAGGGTCATTGTGCCGCCGGTTAAGGCTTTACCGAGGCACAAAATGTCCGGCGCGATTTCTGCATGTTCACAGGCAAACAGTTTCCCGGTACGACCAAATCCAGTGGCGATCTCGTCGGCAATCAGCAAGATACCTTCGCGATCGCATATTTTGCGGATTCGTTTTAACCATTCCGGATGGTACATGCGCATCCCGCCTGCGCCCTGGACAATCGGCTCAATGATCACCGCCGCGATTTCATGACGATGCGCCGCCATCAGGCGGGCAAAGCCCACCATATCGCGCTCATCCCATTCGCCATCCATGCGGCTTTGCGGGGCGGGAGCAAACAGGTTTTCTGGCAGGTAGCCTTTCCACAGACTGTGCATTGAGTTATCCGGATCGCACACCGACATCGCGCCAAAGGTATCGCCATGATAACCATTGCGGAAGGTCAGAAAACGCTGGCGCGCTTCGCCTTTGGCTTGCCAGTACTGCAACGCCATTTTCATCGCCACTTCCACCGCTACGGAACCGGAGTCCGCGAGAAAAACGCACTCCAGCGGTTGCGGCGTCATCGCCACCAGTTTGCGGCACAGCTCAATGGCTGGCGCATGGGTGATACCGCCAAACATCACATGCGACATGGCATCAATTTGCGACTTCATCGCCGCATTAAGCTGCGGGTGATTGTAGCCGTGGATCGCCGCCCACCAGGACGACATACCGTCAACCAGGCGTCTGCCGTCAGACAAAATCAGCTCGCAACCTTCGGCGCTCACCACCGGATAAACCGGCAGAGGGGAGGTCATGGATGTGTATGGGTGCCAGATATGGCGTTGGTCAAAGGCAAGATCGTCCGTTGTCAT
Protein sequences of DBSCAN-SWA_2 >CP028702|1190582:1240927|1220982_1221405_+|AVZ48359.1|tail|DBSCAN-SWA MFLKTESFEHNGVTVTLSELSALQRIEHLALMKRQAEQAESDSNRKFTVEDAIRTGAFLVAMSLWHNHPQKTQMPSMNEAVKQIEQEVLTTWPTEAISHAENVVYRLSGMYEFVVNNAPEQTEDAGPAEPVSAGKCSTVS >CP028702|1190582:1240927|1215689_1217009_+|AVZ48352.1|DBSCAN-SWA MTAELRNLPHIASMAFNEPLMLEPAYARVFFCALAGQLGISSLTDAVSGDSLTAQEALATLALSGDDDGPRQARSYQVMNGIAVLPVSGTLVSRTRALQPYSGMTGYNGIIARLQQAASDPMVDGILLDMDTPGGMVAGAFDCADIIARVRDIKPVWALANDMNCSAGQLLASAASRRLVTQTARTGSIGVMMAHSNYGAALEKQGVEITLIYSGSHKVDGNPYSHLPDDVRETLQSRMDATRQMFAQKVSAYTGLSVQVVLDTEAAVYSGQEAIDAGLADELVNSTDAITVMRDALDARKSRLSGGRMTKETQSTTVSATASQADVTDVVPATEGENASAAQPDVNAQITAAVAAENSRIMGILNCEEAHGREEQARVLAETPGMTVKTARRILAAAPQSAQARSDTALDRLMQGAPAPLAAGNPASDAVNDLLNTPV >CP028702|1190582:1240927|1207391_1207586_+|AVZ48339.1|DBSCAN-SWA MKRGGAYYRFRLVGHFDVSSGTPTIAGREVCKMQSRNSSQVIVRACITVSGFFISAQQVRALSR >CP028702|1190582:1240927|1190582_1191653_-|AVZ48308.1|integrase|DBSCAN-SWA MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNHVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTALHIDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >CP028702|1190582:1240927|1200811_1201012_+|AVZ48325.1|DBSCAN-SWA MEQRITLKDYAMRFGQTKTAKDLGVYQSAINKAIHAGRKIFLTINADGSVYAEEVKPFPSNKKTTA >CP028702|1190582:1240927|1206656_1207280_+|AVZ48338.1|DBSCAN-SWA MRLESVAKFHSPKSPMMSDSPRATASDSLSGTDVMAAMGMAQSQAGFGMAAFCGKHELSQNDKQKAINYLMQFAHKVSGKYRGVAKLEGNTKAKVLQVLATFAYADYCRSAATPGARCRDCHGTGRAVDIAKTELWGRVVEKECGRCKGVGYSRMPASAAYRAVTMLIPNLTQPTWSRTVKPLYDALVVQCHKEESIADNILNAVTR >CP028702|1190582:1240927|1224700_1225399_+|AVZ48363.1|tail|DBSCAN-SWA MQDIRQETLNECTRAEQSASVVLWEIDLTEVGGERYFFCNEQNEKGEPVTWQGRQYQPYPIQGSGFELNGKGTSTRPTLTVSNLYGMVTGMAEDMQSLVGGTVVRRKVYARFLDAVNFVNGNSYADPEQEVISRWRIEQCSELSAVSASFVLSTPTETDGAVFPGRIMLANTCTWTYRGDECGYSGPAVADEYDQPTSDITKDKCSKCLSGCKFRNNVGNFGGFLSINKLSQ >CP028702|1190582:1240927|1217406_1218432_+|AVZ48354.1|capsid|DBSCAN-SWA MSMYTTAQLLAANEQKFKFDPLFLRLFFRESYPFTTEKVYLSQIPGLVNMALYVSPIVSGEVIRSRGGSTSEFTPGYVKPKHEVNPQMTLRRLPDEDPQNLADPAYRRRRIIMQNMRDEELAIAQVEEMQAVSAVLKGKYTMTGEAFDPVEVDMGRSEENNITQSGGTEWSKRDKSTYDPTDDIEAYALNASGVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSELETAVKDLGKAVSYKGMYGDVAIVVYSGQYVENGVKKNFLPDNTMVLGNTQARGLRTYGCIQDADAQREGINASARYPKNWVTTGDPAREFTMIQSAPLMLLADPDEFVSVQLA >CP028702|1190582:1240927|1196306_1196675_-|AVZ48319.1|DBSCAN-SWA MSNIKKYIIDYDWKASIEIEIDHDVMTEEKLHQINNFWSDSEYRLNKHGSVLNAVLIMLAQHALLIAISSDLNAYGVVCEFDWNDGNGQEGWPPMDGSEGIRITDIDTSGIFDSDDMTIKAA >CP028702|1190582:1240927|1239637_1240927_-|AVZ48375.1|DBSCAN-SWA MTTDDLAFDQRHIWHPYTSMTSPLPVYPVVSAEGCELILSDGRRLVDGMSSWWAAIHGYNHPQLNAAMKSQIDAMSHVMFGGITHAPAIELCRKLVAMTPQPLECVFLADSGSVAVEVAMKMALQYWQAKGEARQRFLTFRNGYHGDTFGAMSVCDPDNSMHSLWKGYLPENLFAPAPQSRMDGEWDERDMVGFARLMAAHRHEIAAVIIEPIVQGAGGMRMYHPEWLKRIRKICDREGILLIADEIATGFGRTGKLFACEHAEIAPDILCLGKALTGGTMTLSATLTTREVAETISNGEAGCFMHGPTFMGNPLACAAANASLAILESGDWQQQVADIEVQLREQLAPARDAEMVADVRVLGAIGVVETTHPVNMAALQKFFVEQGVWIRPFGKLIYLMPPYIILPQQLQRLTAAVNRAVQDETFFCQ >CP028702|1190582:1240927|1235782_1236673_-|AVZ48371.1|DBSCAN-SWA MKKLPLPARTYSEMLNKCSEGMMQINVRNNFITHFPTFLQKEQQYRILSSTGQLFTYDRTHPLEPTTLVVGNLTKVKLEKLYENNLRDKNKPARTYYDDMLVSSGEKCPFCGDIGQTKNIDHFLPIAHYPEFSVMPINLVPSCRDCNMGEKGQVFAVDEVHQAIHPYIDKDIFFREQWVYANFVSGTPGAISFYVECPANWRQEDKHRALHHFKLLNIANRYRLEAGKHLSEVITQRNSFVKVIRKYSSTATFQQLQSEFIEANLKPIIDLNDFPNYWKRVMYQCLANSEDFFRGI >CP028702|1190582:1240927|1220226_1220967_+|AVZ51612.1|tail|DBSCAN-SWA MPVPNPTMPVKGAGTTLWVYKGSGDPYANPLSDVDWSRLAKVKDLTPGELTAESYDDSYLDDEDADWTATGQGQKSAGDTSFTLAWMPGEQGQQALLAWFNEGDTRAYKIRFPNGTVDVFRGWVSSIGKAVTAKEVITRTVKVTNVGRPSMAEDRSTVTAATGMTVTPASTSVVKGQSTTLTVAFQPEGVTDKSFRAVSADKTKATVSVSGMTITVNGVAAGKVNIPVVSGNGEFAAVAEITVTAS >CP028702|1190582:1240927|1195957_1196227_-|AVZ48318.1|DBSCAN-SWA MPLQGGLLLAALPNLYLNESPVNYVTDGNALSTYLISQESQRMDQTLMAIQTKFTIATFIGDEKMFREAVDAYKKWILILKLRSSKSIH >CP028702|1190582:1240927|1208263_1208740_+|AVZ48341.1|DBSCAN-SWA MVEINNQRKAFLDMLAWSEGTDNGRQKTRNHGYDVIVGGELFTDYSDHPRKLVTLNPKLKSTGAGRYQLLSRWWDAYRKQLGLKDFSPKSQDAVALQQIKERGALPMIDRGDIRQAIDRCSNIWASLPGAGYGQFEHKADSLIAKFKEAGGTVREIDV >CP028702|1190582:1240927|1204860_1205043_+|AVZ48333.1|DBSCAN-SWA MARQRRSITDIICENCKYLPTKRTRNKPKPIPKESDVKTFNYTAHLWDIRWLRRRARKTR >CP028702|1190582:1240927|1202352_1203054_+|AVZ48328.1|DBSCAN-SWA MKNIAAQMVNFDREQMRRIANNMPEQYDEKPQVQQVAQIINGVFSQLLATFPASLANRDQNEVNEIRRQWVLAFRENGITTMEQVNAGMRVARRQNRPFLPSPGQFVAWCREEASVTAGLPNVSELVDMVYEYCRKRGLYPDAESYPWKSNAHYWLVTNLYQNMRANALTDAELRRKAADELVHMTARINRGEAIPEPVKQLPVMGGRPLNRAQALAKIAEIKAKFGLKGASV >CP028702|1190582:1240927|1217018_1217351_+|AVZ48353.1|head|DBSCAN-SWA MTSKETFTHYQPQGNSDPAHTATAPGGLSAKAPAMTPLMLDTSSRKLVAWDGTTDGAAVGILAVAADQTSTTLTFYKSGTFRYEDVLWPEAASDETKKRTAFAGTAISIV >CP028702|1190582:1240927|1201130_1201424_+|AVZ48326.1|DBSCAN-SWA MVRANKRNEALRIESALLNKIAMLGTEKTAEAVGVDKSQISRWKRDWIPKFSMLLAVLEWGVVDDDMARLARQVAAILTNKKRPAATERSEQIQMEF >CP028702|1190582:1240927|1211462_1212008_+|AVZ48348.1|terminase|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIENEKLRREVEELRQASEADLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGLPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIPGLLSEYIEQSG >CP028702|1190582:1240927|1191888_1192056_-|AVZ48310.1|DBSCAN-SWA MHFRVTGEWNGEPFNRVIEAENINDCYDHWMIWAQIAHADVTNIRIEELKEHQAA >CP028702|1190582:1240927|1192144_1192426_-|AVZ48311.1|DBSCAN-SWA MSINELESEQKDWALSMLCRSGVLSPCRHHEGVYVDEGIDIESAYKYSMKVYKSNEDKSPFCNVREMTDTVQNYYHEYGGNDTCPLCTKHIDD >CP028702|1190582:1240927|1208736_1209198_+|AVZ48342.1|lysis|DBSCAN-SWA MSRVTAIISALVICIIVCLSWAVNHYRDNAITYKAQRDKNARELKLANAAITDMQMRQRDVAALDAKYTKELADAKAENDALRDDVAAGRRRLHIKAVCQSVREATTASGVDNAASPRLADTAERDYFTLRERLITMQKQLEGTQKYINEQCR >CP028702|1190582:1240927|1225404_1226148_+|AVZ48364.1|tail|DBSCAN-SWA MTQTESAILAHARRCAPAESCGFVVSTPEGERYFPCVNISGEPEAYFRMSPEDWLQAEMQGEIVALVHSHPGGLPWLSEADRRLQVQSDLPWWLVCRGTIHKFRCVPHLTGRRFEHGVTDCYTLFRDAYHLAGIEMPDFHREDDWWRNGQNLYLDNLEATGLYQVPLSAAQPGDVLLCCFGSSVPNHAAIYCGDGELLHHIPEQLSKRERYTDKWQRRTHSLWRHRAWRASAFTGIYNDLVAASTFV >CP028702|1190582:1240927|1210879_1211074_-|AVZ48346.1|DBSCAN-SWA MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSEKRAYTSIKSDEE >CP028702|1190582:1240927|1218473_1218872_+|AVZ48355.1|DBSCAN-SWA MTKDELIARLRSLGEQLNRDVSLTGTKEELALRVAELKEELDDTDETAGQDTPLSRENVLTGHENEVGSAQPDTVILDTSELVTVVALVKLHTDALHATRDEPVAFVLPGTAFRVSAGVAAEMTERGLARMQ >CP028702|1190582:1240927|1218883_1219237_+|AVZ48356.1|tail|DBSCAN-SWA MADFDNLFDAAIARADETIRGYMGTSATITSGEQSGAVIRGVFDDPENISYAGQGVRVEGSSPSLFVRTDEVRQLRRGDTLTIGEENFWVDRVSPDDGGSCHLWLGRGVPPAVNRRR >CP028702|1190582:1240927|1211221_1211323_+|AVZ48347.1|DBSCAN-SWA MIIIITLRVLSGDPTGYGAATSRVFAIYENFPV >CP028702|1190582:1240927|1201456_1202356_+|AVZ48327.1|DBSCAN-SWA MTNTAKILNFGRGNFAGQERNVADLDDGYARLSNMLLEAYSGADLTKRQFKVLLAILRKTYGWNKPMDRITDSQLSEITKLPVKRCNEAKLELVRMNIIKQQGGMFGPNKNISEWCIPQNEGKSPKTRDKTSLKLGDCYPSKQGDTKDTITKEKRKDYSSENSGESSDQPENDLSVVKPDAAIQSGSKWGTAEDLTAAEWMFDMVKTIAPSARKPNFAGWANDIRLMRERDGRNHRDMCVLFRWACQDNFWSGNVLSPAKLRDKWTQLEINRNKQQAGVTASKPKLDLTNTDWIYGVDL >CP028702|1190582:1240927|1205202_1205814_+|AVZ48335.1|DBSCAN-SWA MAKPARRRCKNDECREWFHPAFANQWWCSPECGTKIALERRSKEREKAEKAAEKKRRREEQKQKDKLKIRKLALKPRSYWIKQAQQAVNAFIRERDRDLPCISCGTLTSAQWDAGHYRTTAAAPQLRFNERNIHKQCVVCNQHKSGNLVPYRVELISRIGQEAVDEIESNHNRHRWTIEECKAIKAEYQQKLKDLRNSRSEAA >CP028702|1190582:1240927|1192617_1193166_-|AVZ48312.1|DBSCAN-SWA MSEINSQALREAAEQAMHDDWGFDADLFHELVTPSIVLELLDERERNQQYIKRRDQENEDIALTVGKLRVELETAKSKLNEQREYYEGVISDGSKRIAKLESNEVREDGNQFLVVRHPGKTPVIKHCTGDLEEFLRQLIEQDPLVTIDIITHRYYGVGGQWVQDAGEYLHMMSDAGIRIKGE >CP028702|1190582:1240927|1209812_1210223_-|AVZ48344.1|DBSCAN-SWA MAQVAIFKEIFDQVRKDLDCELFYSELKRHNVSHYIYYLATDNIHIVLENDNTVLIKGLKKVVNVKFSRNTHLIETSYDRLKSREITFQQYRENLAKAGVFRWVTNIHEHKRYYYTFDNSLLFTESIQNTTQIFPR >CP028702|1190582:1240927|1199045_1199885_-|AVZ48323.1|DBSCAN-SWA MKNGFYATYRSKNKGKDKRSINLSVFLNSLLADNHHLQVGSNYLYIHKIDGKTFLFTKTNDKSLVQKINRSKASVEDIKNSLADDESLGFPSFLFVEGDTIGFARTVFGPTTSDLTDFLIGKGMSLSSGERVQIEPLMRGTTKDDVMHMHFIGRTTVKVEAKLPVFGDILKVLGATDIEGELFDSLDIVIKPKFKRDIKKVAKDIIFNPSPQFSDISLRAKDEAGDILTEHYLSEKGHLSAPLNKVTNAEIAEEMAYCYARMKSDILECFKRQVGKVKD >CP028702|1190582:1240927|1233959_1235192_-|AVZ48370.1|DBSCAN-SWA MTKKPWERRLKDLSHLLKCCIDTYFDPELFRLNLNQFLQTARTVTFIIQKNKNQIIGYDIWYNNNVIEKWKNDPLMAWAKNSRNTIEKQGDLEMYSEAKATLISSYIEENDIEFITNESMLNIGIKKLVRLAQKKLPSYLTESSIIKSERRWVANTLKDYELLHALAIIYGRMYNCCNSLGIQINNPMGDDVISPTSFDSLFDEARRITYLKLKDYSISKLSFSMIQYDNKIIPEDIKERLKLVDKPKNITSTEELVDYTAKLAETTFLKDGYHIQTLIFYDKQFHPIDLINTTFEDQADKYIFWRYAADRAKITNAYGFIWISELWLRKASIYSNKPIHTMPIIDERLQVIGIDSNNNQKCISWKIVRENEEKKPTLEISTADSKHDEKPYFMRSVLKAIGGDVNTMNN >CP028702|1190582:1240927|1214107_1215709_+|AVZ48351.1|portal|DBSCAN-SWA MKTPTIPTLLGPDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRGDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSDSRAA >CP028702|1190582:1240927|1193939_1194122_-|AVZ51610.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLLLS >CP028702|1190582:1240927|1197324_1197807_+|AVZ48321.1|DBSCAN-SWA MYMIVIWVGLLLLSPDNWPEYVNERIGIPHVWHVFVFALAFSLAINVHRLSAIASARYKRFKLRKRIKMQNDKVRSVIQNLTEEQSMVLCAALNEGRKYVVTSKQFPYISELIELGVLNKTFSRWNGKHILFPIEDIYWTELVASYDPYNIEIKPRPISK >CP028702|1190582:1240927|1198595_1199030_-|AVZ48322.1|DBSCAN-SWA MRNRIMPGVYIVIIPYVIVSICYLLFRHYIPGVSFSAHRDGLGATLSSYAGTMIAILIAALTFLIGSRTRRLAKIREYGYMTSVVIVYALSFVELGALFFCGLLLLSSISGYMIPTIAIGIASASFIHICILVFQLYNLTREQE >CP028702|1190582:1240927|1194118_1194799_-|AVZ48315.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSGVNVTESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKNAWYFANYDPRMKREGLHYVVIERDEKYMASFDEIVPEFIEKMDEALAEIGFVFGEQWR >CP028702|1190582:1240927|1195586_1196003_-|AVZ48317.1|DBSCAN-SWA MDINTETEIKQKHSLTPFPVFLISPAFRGRYFHSYFRSSAMNAYYIQDRLEAQSWARHYQQLAREEKEAELADDMEKGLPQHLFESLCIDHLQRHGASKKSITRAFDDDVEFQERMAEHIRYMVETIAHHQVDIDSEV >CP028702|1190582:1240927|1203396_1203855_+|AVZ48330.1|DBSCAN-SWA MGEREVMKKLTFEIRSPAHQQNAIHAVQQILPDPTKPIVVTIQERNRSLDQNRKLWACLGDVSRQVEWHGRWLDAESWKCVFTAALKQQDVVPNLAGNGFVVIGQSTSRMRVGEFAELLELIQAFGTERGVKWSDEARLALEWKARWGDRAA >CP028702|1190582:1240927|1205039_1205210_+|AVZ48334.1|DBSCAN-SWA MIDQNRSYEQESVERALTCANCGQKLHVLEVHVCEHCCAELMSDPNSSMHEEEDDG >CP028702|1190582:1240927|1213904_1214111_+|AVZ48350.1|head,tail|DBSCAN-SWA MTRQEELAAARAALHDLMTGKRVATVQKDGRRVEFTATSVSDLKKYIAELEVQTGMTQRRRGPAGFYV >CP028702|1190582:1240927|1233246_1233831_+|AVZ48369.1|tail|DBSCAN-SWA MAFRMSEQPRTIKIYNLLAGTNEFIGEGDAYIPPHTGLPANSTDIAPPDIPAGFVAVFNSDEASWHLVEDHRGKTVYDVASGDALFISELGPLPENFTWLSPGGEYQKWNGTAWVKDTEAEKLFRIREAEETKKSLMQVASEHIAPLQDAADLEIATEEETSLLEAWKKYRVLLNRVDTSTAPDIEWPAVPVME >CP028702|1190582:1240927|1219823_1220219_+|AVZ48358.1|tail|DBSCAN-SWA MKHTELRAAVLDALEKHDTGATFFDGRPAVFDEADFPAVAVYLTGAEYTGEELDSDTWQAELHIEVFLPAQVPDSELDAWMESRIYPVMSDIPALSDLITSMVASGYDYRRDDDAGLWSSADLTYVITYEM >CP028702|1190582:1240927|1205994_1206660_+|AVZ48337.1|DBSCAN-SWA MRYYEKIDGSKYRNIWVVGDLHGCYTNLMNKLDTIGFDNKKDLLISVGDLVDRGAENVECLELITFPWFRAVRGNHEQMMIDGLSERGNVNHWLLNGGGWFFNLDYDKEILAKALAHKADELPLIIELVSKDKKYVICHADYPFDEYEFGKPVDHQQVIWNRERISNSQNGIVKEIKGADTFIFGHTPAVKPLKFANQMYIDTGAVFCGNLTLIQVQGEGA >CP028702|1190582:1240927|1209229_1209523_-|AVZ48343.1|DBSCAN-SWA MKKMLLATALALLITGCAQQTFTVQNKPAAVAPKETITHHFFVSGIGQKKTVDAAKICGGAENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ >CP028702|1190582:1240927|1226777_1230176_+|AVZ48366.1|DBSCAN-SWA MGKGSSKGHTPREAKDNLKSTQLLSVIDAISEGPIEGPVDGLKSVLLNSTPVLDTEGNTNISGVTVVFRAGEQEQTPPEGFESSGSETVLGTEVKYDTPITRTITSANIDRLRFTFGVQALVETTSKGDRNPSEVRLLVQIQRNGGWVTEKDITIKGKTTSQYLASVVMGNLPPRPFNIRMRRMTPDSTTDQLQNKTLWSSYTEIIDVKQCYPNTALVGVQVDSEQFGSQQVSRNYHLRGRILQVPSNYNPQTRQYSGIWDGTFKPAYSNNMAWCLWDMLTHPRYGMGKRLGAADVDKWALYVIGQYCDQSVPDGFGGTEPRITCNAYLTTQRKAWDVLSDFCSAMRCMPVWNGQTLTFVQDRPSDKTWTYNRSNVVMPDDGAPFRYSFSALKDRHNAVEVNWIDPNNGWETATELVEDTQAIARYGRNVTKMDAFGCTSRGQAHRAGLWLIKTELLETQTVDFSVGAEGLRHVPGDVIEICDDDYAGISTGGRVLAVNSQTRTLTLDREITLPSSGTALISLVDGSGNPVSVEVQSVTDGVKVKVSRVPDGVAEYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGEQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGVSFLLRLTVTADDGSERLVSTARTTETTYRFTQLALGNYRLTVRAVNAWGQQGDPASVSFRIAAPAAPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSEKQIADIRQVETSTRYLGTALYWIAASINIKPGHDYYFYIRSVNTVGKSAFVEAVGRASDDAEGYLDFFKGKITESHLGKELLEKVELTEDNASRLEEFSKEWKDASDKWNAMWAVKIEQTKDGKHYVAGIGLSMEDTEEGKLSQFLVAANRIAFIDPANGNETPMFVAQGNQIFMNDVFLKRLTAPTITSGGNPPAFSLTPDGKLTAKNADISGSVNANSGTLSNVTIAENCTINGTLRAEKIVGDIVKAASAAFPRQRESSVDWPSGTRTVTVTDDHPFDRQIVVLPLTFRGSKRTVSGRTTYSMCYLKVLMNGAVIYDGAANEAVQVFSRIVDMPAGRGNVILTFTLTSTRHSADIPPDTFASDVQVMVIKKQALGISVV >CP028702|1190582:1240927|1211982_1213908_+|AVZ48349.1|terminase|DBSCAN-SWA MNISNSQVNRLRHFVRAGLRSLFRPEPQTAVEWADANYYLPKESAYQEGRWETLPFQRAIMNAMGSDYIREVNVVKSARVGYSKMLLGVYAYFIEHKQRNTLIWLPTDGDAENFMKTHVEPTIRDIPSLLALAPWYGKKHRDNTLTMKRFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDDDIEQEGSPTFLGDKRIEGSVWPKSIRGSTPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKFGDKETPFGLKWTPDDPSSVFYLCEHNACVIRQQELDFTDARYICEKTGIWTRDGILWFSSSGEEIEPPDSVTFHIWTAYSPFTTWVQIVKDWMKTKGDTGKRKTFVNTTLGETWEAKIGERPDAEVMAERKEHYSAPVPDRVAYLTAGIDSQLDRYEMRVWGWGPGEESWLIDRQIIMGRHDDEQTLLRVDEAINKTYTRRNGAEMSISRICWDTGGIDPTIVYERSKKHGLFRVIPIKGASVYGKPVASMPRKRNKNGVYLTEIGTDTAKEQIYNRFTLTPEGDEPLPGAVHFPNNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKRRNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNKKTLADYARALSGEDE >CP028702|1190582:1240927|1236669_1238247_-|AVZ48372.1|DBSCAN-SWA MLEFSVIERGGYIPAVEKNKAFLRADGWNDYSFVTMFYLTVFDEHGEKCDIGNVKIGFVGQKEEVSTYSLIDKKFSQLPEMFFSLGESIDYYVNLSKLSDGFKHNLLKAIQDLVVWPNRLADIENESVLNTSLLRGVTLSEIHGQFARVLNGLPELSDFHFSFNRKSAPGFSDLTIPFEVTVNSMPSTNIHAFIGRNGCGKTTILNGMIGAITNPENNEYFFSENNRLIESRIPKGYFRSLVSVSFSAFDPFTPPKEQPDPAKGTQYFYIGLKNAASNSLKSLGDLRLEFISAFIGCMRVDRKRQLWLEAIKKLSSDENFSNMELISLISKYEELRRNEPQIQVDDDKFTKLFYDNIQKYLLRMSSGHAIVLFTITRLVDVVGEKSLVLFDEPEVHLHPPLLSAFLRTLSDLLDARNGVAIIATHSPVVLQEVPKSCVWKVLRSREAINIIRPDIETFGENLGVLTREVFLLEVTNSGYHHLLSQSVDSELSYETILKNYNGQIGLEGRTVLKAMIMNRDEGKVQ >CP028702|1190582:1240927|1191630_1191849_-|AVZ48309.1|DBSCAN-SWA MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGGLLKRIRNGKKAKS >CP028702|1190582:1240927|1230922_1233247_+|AVZ48368.1|DBSCAN-SWA MAVKISGVLKDGTGKPVQNCTIQLKARRNSTTVVVNTVGSENPDEAGRYSMDVEYGQYSVILQVDGFPPSHAGTITVYEDSQPGTLNDFLCAMTEDDARPEVLRRLELMVEEVARNASVVAQSTADAKKSAGDASASAAQVAALVTDATDSARAASTSAGQAASSAQEASSGAEAASAKATEAEKSAAAAESSKNAAATSAGAAKTSETNAAASQQSAATSASTAATKASEAATSARDAVASKEAAKSSETNASSSAGRAASSATAAENSARAAKTSETNARSSETAAERSASAAADAKTAAAGSASTASTKATEAAGSAVSASQSKSAAEAAAIRAENSAKRAEDIASAVALEDADTTRKGIVQLSSATNSTSETLAATPKAVKVVMDETNRKAPLDSPALTGTPTAPTALRGTNNTQIANTAFVLAAIADVIDASPDALNTLNELAAALGNDPDFATTMTNALAGKQPKNATLTALAGLSTAKNKLPYFAENDAASLTELTQVGRDILAKNSVADVLEYLGAGENSAFPAGAPIPWPSDIVPSGYVLMQGQAFDKSAYPKLAVAYPSGVLPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASGTDLGTKTTSSFDYGTKTTGSFDYGTKSTNNTGAHAHSLSGSTGAAGAHAHTSGLRMNSSGWSQYGTATITGSLSTVKGTNTQGIAYLSKTDSQGSHSHSLSGTAVSAGAHAHTVGIGAHQHPVVIGAHAHSFSIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA >CP028702|1190582:1240927|1210508_1210715_+|AVZ48345.1|DBSCAN-SWA MNKEQSADDPSVDLIRVKNMLNSTISMSYPDVVIACIEHKVSLEAFRAIEAALVKHDNNMKDYSLVVD >CP028702|1190582:1240927|1199997_1200711_-|AVZ48324.1|DBSCAN-SWA MSTKKKPLTQEQLEDARRLKAIYEKKKNELGLSQESVADKMGMGQSGVGALFNGINALNAYNAALLAKILKVSVEEFSPSIAREIYEMYEAVSMQPSLRSEYEYPVFSHVQAGMFSPELRTFTKGDAERWVSTTKKASDSAFWLEVEGNSMTAPTGSKPSFPDGMLILVDPEQAVEPGDFCIARLGGDEFTFKKLIRDSGQVFLQPLNPQYPMIPCNESCSVVGKVIASQWPEETFG >CP028702|1190582:1240927|1196857_1197058_-|AVZ48320.1|DBSCAN-SWA MTTTIDKNQWCGQFKRCNGCKLQSECMVKPEEMFPVMEDGKYVDKWAIRTTAMIARELGKQNNKAA >CP028702|1190582:1240927|1221813_1224375_+|AVZ48361.1|tail|DBSCAN-SWA MAEPVGDLVVDLSLDAARFDEQMARVRRHFSGTESDAKKTAAVVEQSLSRQALAAQKAGISVGQYKAAMRMLPAQFTDVATQLAGGQSPWLILLQQGGQVKDSFGGMIPMFRGLAGAITLPMVGATSLAVATGALAYAWYQGNSTLSDFNKTLVLSGNQAGLTADRMLVLSRAGQAAGLTFNQTSESLSALVKAGVSGEAQIASISQSVARFSSASGVEVDKVAEAFGKLTTDPTSGLTAMARQFHNVSAEQIAYVAQLQRSGDEAGALQAANEAATKGFDDQTRRLKENMGTLETWADRTARAFKSMWDAVLDIGRPDTAQEMLIKAEAAYKKADDIWNLRKDDYFVNDEARARYWDDREKARLALEAARKKAEQQTQQDKNAQQQSDTEASRLKYTEEAQKAYERLQTPLEKYTARQEELNKALKDGKILQADYNTLMAAAKKDYEATLKKPKQSSVKVSAGDRQEDSAHAALLTLQAELRTLEKHAGANEKISQQRRDLWKAESQFAVLEEAAQRRQLSAQEKSLLAHKDETLEYKRQLAALGDKVTYQERLNALAQQADKFAQQQRAKRAAIDAKSRGLTDRQAEREATEQRLKEQYGDNPLALNNVMSEQKKTWAAEDQLRGNWMAGLKSGWSEWEESATDSMSQVKSAATQTFDGIAQNMAAMLTGSEQNWRSFTRSVLSMMTEILLKQAMVGIVGSIGSAIGGAVGGGASASGGTAIQAAAAKFHFATGGFTGTGGKYEPAGIVHRGEFVFTKEATSRIGVGNLYRLMRGYATGGYVGTPGSMADSRSQASGTFEQNNHVVINNDGTNGQIGPAALKAVYDMARKGARDEIQTQMRDGGLFSGGGR >CP028702|1190582:1240927|1224371_1224701_+|AVZ48362.1|tail|DBSCAN-SWA MKTFRWKVKPGMDVASVPSVRKVRFGDGYSQRAPAGLNANLKTYSVTLSVPREEATVLESFLEEHGGWKSFLWTPPYEWRQIKVTCAKWSSRVSMLRVEFSAEFEQVVN >CP028702|1190582:1240927|1221386_1221821_+|AVZ48360.1|tail|DBSCAN-SWA MFDGELSFALKLAREMGRPDWRAMLAGMSSTEYADWHRFYSTHYFHDVLLDMHFSGLTYTVLSLFFSDPDMHPLDFSLLNRREADEEPEDDVLMQKAAGLAGGVRFGPDGNEVIPASPDVADMTEDDVMLMTVSEGIAGGVRYG >CP028702|1190582:1240927|1239102_1239579_-|AVZ48374.1|DBSCAN-SWA MKLISNDLRDGDKLPHRHVFNGMGYDGDNISPHLAWDDVPAGTKSFVVTCYDPDAPTGSGWWHWVVVNLPADTRVLPQGFGSGLVAMPDGVLQTRTDFGKTGYDGAAPPKGETHRYIFTVHALDIERIDVDEGASGAMVGFNVHFHSLASASITAMFS >CP028702|1190582:1240927|1203833_1204724_+|AVZ48331.1|DBSCAN-SWA MGRQGCMINVVSFSGGRTSAYLLWLMEQKRRAGKDVHYVFMDTGCEHPMTYRFVREVVKFWDIPLTVLQVDINPELGQPNGYTVWEPKDIQTRMPVLKPFIDMVKKYGTPYVGGAFCTDRLKLVPFTKYCDDHFGRGNYTTWIGIRADEPKRLKPKPGIRYLAELSDFEKEDILAWWKQQPFDLQIPEHLGNCIFCIKKSTQKIGLACKDEEGLQRVFNEVITGSHVRDGHRETPKEIMYRGRMSLDGIAKMYSENDYQALYQDMVRAKRFDTGSCSESCEIFGGQLDFDFGREAA >CP028702|1190582:1240927|1226045_1226717_+|AVZ48365.1|tail|DBSCAN-SWA MAATHTLPLASPGMARICLYGDLQRFGRRIDLRVKTGAEAIRALATQLPAFRQKLSDGWYQVRIAGRDVSTSGLTAQLHETLPDGAVIHIVPRVAGAKSGGVFQIVLGAAAIAGSFFTAGATLAAWGAAIGAGGMTGILFSLGASMVLGGVAQMLAPKARTPRIQTTDNGKQNTYFSSLDNMVAQGNVLPVLYGEMRVGSRVVSQEISTADEGDGGQVVVIGR >CP028702|1190582:1240927|1203050_1203341_+|AVZ48329.1|DBSCAN-SWA MTGKEAIIHYLGTHNSFCAPDVAALTGATVTSINQAAAKMARAGLLVIEGKVWRTVYYRFATREEREGKMSTNLVFKECRQSAAMKRVLAVYGVKR >CP028702|1190582:1240927|1239008_1239188_-|AVZ48373.1|DBSCAN-SWA MKVPAARWSGLTFISTLWQAPRLLRCLVNHSARWRNAIWYHLKVLKTTFCLFTFPFRSS >CP028702|1190582:1240927|1207956_1208280_+|AVZ48340.1|holin|DBSCAN-SWA MKMPEKHDLLAAILAAKEQGIGAILAFAMAYLRGRYNGGAFTKTVIDATMCAIIAWFIRDLLDFAGLSSNLAYITSVFIGYIGTDSIGSLIKRFAAKKAGVEDGRNQ >CP028702|1190582:1240927|1230237_1230858_+|AVZ48367.1|DBSCAN-SWA MRNVCIAVAVFAALAVTVTPARAEGGHGTFTVGYFQVKPGTLPSLSGGDTGVSHLKGINVKYRYELTDSVGVMASLGFAASKKSSTVMTGEDTFHYESLRGRYVSVMAGPVLQISKQVSAYAMAGVAHSRWSGSTMDYRKTEITPGYMKETTTARDESAMRHTSVAWSAGIQINPAASVVVDIAYEGSGSGDWRTDGFIVGVGYKF >CP028702|1190582:1240927|1205810_1206017_+|AVZ48336.1|DBSCAN-SWA MTFSVKTIPDMLVEAYGNQTEVARRLKCSRGTVRKYVDDKDGKMHAIVNDVLMVHRGWSERDALLRKN >CP028702|1190582:1240927|1197807_1198131_-|AVZ51611.1|DBSCAN-SWA MDAQTRRRERRAEKQAQWKAANPLLVGVSAKPVNRPILSLNRKPKSRVESALNPIDLTVLAEYHKQIESNLQRIERKNQRTWYSKPGERGITCSGRQKIKGKSIPLI >CP028702|1190582:1240927|1204720_1204894_+|AVZ48332.1|DBSCAN-SWA MMRCYRCGECKEDNRFRPNQPYWNRWCLRCERTPTGVLPLPQEKEDVWRDSDEVSPT >CP028702|1190582:1240927|1193775_1193967_-|AVZ48314.1|DBSCAN-SWA MHKASSVELRTSIEMAHSLAQIGIRFVPIPVETDEEFHTLAASLSQKLEMMVAKAEADERDQV >CP028702|1190582:1240927|1219248_1219827_+|AVZ48357.1|tail|DBSCAN-SWA MAIKGLEQAVENLSRISKTAVPGAAAMAINRVASSAISQSASQVARETKVRRKLVKERARLKRATVKNPQARIKVNRGDLPVIKLGNARVVLSRRRRRKKGQRSSLKGGGSVLVVGNRRIPGAFIQQLKNGRWHVMQRVAGKNRYPIDVVKIPMAVPLTTAFKQNIERIRRERLPKELGYALQHQLRMVIKR >CP028702|1190582:1240927|1193162_1193384_-|AVZ48313.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLVVQGCRTCASCQEDLELISKQRGSK >CP028702|1190582:1240927|1194795_1195581_-|AVZ48316.1|DBSCAN-SWA MSTALATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKAAEQKVAA |
71 | Enterobacteria_phage(75.0%) | integrase,head,holin,lysis,terminase,tail,capsid,portal | attL 1188860:1188875|attR 1213660:1213675 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1632867 : 1647248
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP028702|1632867:1647248|DBSCAN-SWA TTCACAACGCTACTTTGCTCCATCCTTTACCTCGATCATCATGATAACGATCGGTTTGTTGTTGTGTTTTATGACCAAGTAGTTTTTGTGTGTCTAACCCCTGTTCTTTATACAGACGTTCAGATAAAGACCTTTGCTCATGGAATGTCGCAGGTGAACCCTCTCCCCAGTCAATTCTTGCTAAATCTCTCGCTTTACTAAAATTCATCGTCAATGTATTGGCTTTAACCTGCGCTCCGCGCTCTGCTTGTGAAGTTGAACGAAAAAAATGCACTAAGTATGCACTGACTGCATAGTCACGGCAGCGGGCTACTACATCGCGTAAACTCCAGTTAATCGCATTGAGGCGAAGAGAAAGAGGAATTGCGATTTTGCTCCCGGTCTTTTCCTGAATGACATGAAGATGATCATCCCAAATATCGCTAAATTTCATACGCGAAATATCACCTAACCGCTGACCAGTAACCAGCGCTAACAGCATGGCATTTCCCATGTAACGATGAGTAGCGTCTGCGATATCGAAGATTTTTTTCCATTCTTCAAGGCTCAGCCGTTGTCGGGTAATTTTTCTTCTTGGTTGTTTAGTGGCTAATGCTGGGTTATAGCCAGGAGGTACTTCTCCGTAGTGCTGCGCCTCTTTGAAAACATCAATCAGGACGGAGCGAACTACTTGTGCCATTCTTGGCCGCCCAGCGGCGATATACTCATCAAGCAATTGTGCTATATCTCTGACATCAACGGCTGAGATCAACTTCATTCCTGCTCGTTCTCTGAGCAAGGATACTGGTTTAGCTTTTTGTTTATAGGTGTTGAGTCTTATATCACCACTTTTAAGCCTGTCATCCTGGATCGCTTGATAGCGATCTAACCAGGTTGACGTTGTGATAGCCTTTCCTTTGCTGGTTGCGATCCTGTCACTGATAGCCAGAATCTGCCGGGTTCTTTGTTCAGCCAGGCGAGTGTTGGCCTCAGTGGCAATAGCGATAGCTTCAGCTTCGTTTGTTCCCAAAGCATGGAATTTTCCTGTCACTGGATGCTTATACCGCCAATAGACTTTATTTACCTTCCTACTATAAAGCGGATATAAGTTAGGGACTGAAACATTATTCTTACGCGGTCTGGCTGCCATTACTCAAAATCCGTTGCAAAAGTAATGAGTCATTTTTCTTGATTACAGGTGTTACCAACTCCCCAACTAACTCGGCGTCCTCACGCACTCGCCATAACCGGCCTTGTTTCATGGCCGGTGGACAAAATAAATTCTGCTTAGCATAACGACGCAATGTGGACACACTTGGAGGATTACTTCTGTATTTTTCAGCAGCCCATTCTTCAAGAGTTAACATTTGAAGCATATGCGATCACCTTATTACTACACTAACTGCTTAGTCTCAGCATATCGACCCTGCACGGTCGGTTAGTTTCTCCACAAAACAGAGAAGAGCACCTGTGGCCACAGCTATCAGGATGGGTCGGGTTATTAACCCGTCATCCGGGGATACTCTTCTCTGTTTTGTAAAAAGGGCGGTACCAGAAAGGACTAAGGAAAAAACTGGTACCGCCAAGACTACACACAGCATAAAGTTGTGGTGTCGGGTGCCCCCGGTGCCTGGCGAAGGTTGCACACCAGGCGGGTGGGTATCCACAGAAGGTCGATTGTCAGCCTCAACCTTAACCCGCGTGCGCTGAGCCGCATTCACCACAACGCTAAGGATTCTCTCTGGTTGAAAATACTTAGCTGTTATGTGCCTGTCTTTTCACCACTTCAGGCTCGGTGGTATGCTGGAGTTCTCACACAGCCAGCAAGCAAGGAAACTTAATGAACCAGTTTTATGTTCACGTTCGTCTATTTGAAGACACAGCCGAACAGACCAAAAAATTTGAAGAATTAATGCTTAACTTTCTGTACCAGAAAACAGTTAAAGAGTCTGACGATAGCTGCTGCAGACTGATTCCAGAGGGATATATCCTCAAAAGTACAATGAACTGCCAACAAATCCTTGATCAAACATTTTCAATTGCTAACAGTGCCGGTGTTGACGCAAATATATTTGTCTGTAAATTTGAACAAAGCGCATGCTTACTTCCGTCTGCTTCCTTAGTTGGTAACGATTTCGTTCATTACGATCTTACGCCTAAGCCCATCAAGCTCGATTCTTAAAGCCTTAACCATTGTGTCGTGATAAACACGGCTCACCTTCTCTCCATTGCATGGCAGAGGGGTGAGTGTGTTAGCCATGAAATTCATGAACTCGGTTCGACCAGGGGCTTGCGCCCCGCAAGTCTTTAATGCCTGTTTTGCTAACAAAATGCGGGCCTCAGTGCCTGCATTTGGCTCTATCTGCTGCAAACGTTTAGCGTCTTCCAGCAACAATGCGATCACATGCTTCAAATTCTGCTCATTCATCTATTCTCTCCACTGAAATCATCCGCTAACGAATCATCCCGGTCTTCGTACGTACCGGGCGGGCTACTTCGTGGGCGTCCTGCCTGTTTGTTGTTTCTCTTGGGTACATTATGTATCTCAAAGGTACATTGTCAAGTATAAAAAAACCTGCCGAAGCAGGTTCATAAACATTGATTAGGCTTTGATTTTGTATCTTCTTGGTTTTCCTGAGAAAATCACAGTACCAATTATAGAGCAATTACCGTTGATCTTAATGTAAGGCTCAGGCCAGTTTGGGTTTAACGCTTTGAGATAACGCTGTGTCCCATCTTCTATCAACCTTTTGAAGGTGGTTTCACCTGTATCGTGCATCAATGCAATAACGTCGTCACCGTGGCAGGCAGGTACTTCAGGATCGACAAAAATCATGTCTCCCGGGCGGTACTCATCAATCATTGAATCACCTATTACCCGCAAGATATAAGTCATTTCCCCACAGGGTACAGGGCAGGGATACGTTTCTGCTGTGCTCAAATCAACCTCAGAATATCCAACTTCTTTCCATGCTCCGGCCTGTACCCATGATATGACAGGGACTAATGTGATTTGTTTATTAGTGATTGAAACATCAGGTTTTTTTGTGATGTTCGTTGTCTGGTGTTCTTGATCGAGCCATCCTACAGGCAGGTCGAAACATTTTTCGATGTGTCGTGCCATGCTGTCACCGATATTTTTAGTAGCACCATCTCCCATAAACCTGCTGGTCTGGGTTGGCTCGCGATCAATCATAGTGGCAAAGGAAGAATTCCCGCCAACACCATCTCTCAGTTTTCTGGCGTTAGACCGCCGGATGTCATGGATTGTTTTCATAACGAAATTAAAACCCTTGTACCGTTAAGGTACAAGTATCTTGAAGGTTCATTTCAATCATGTAATATGTACACCGGAGGTACATATTGTATGAAAGCGTATTGGGACTCTTTAACCAAAGAACAGCAGGGCGAGTTGGCCGGAAAAGTTGGCTCAACACCTGGCTACTTACGGCTGGTTTTCAATGGCTATAAAAAAGCCAGTTTTGTGCTGGCTAAAAAACTTGAGCAATACACATCAGGTGCAATTACGAAATCTGACTTAAGACCGGATATCTATCCGAAAGATTAGCAGAACACTTTCAATTTTTAACCACAGAACGATGAGGCTAATCGTGGGTAAGCATCACTGGAAAATAGAAAAACAGCCTGAGTGGTACGTGAAAGCTGTCAGAAAAACTATCGCGGCGTTGCCGAGTGGTTACGCTGAAGCGGCTGACTGGCTCGATGTAACAGAAAACGCTTTATTCAACCGCCTTCGTGCAGATGGCGATCAGATTTTCCCGCTGGGATGGGCAATGGTTTTACAGCGTGCTGGTGGCACTCACTTCATTGCTGATGCTGTGGCGCAGTCTGCAAATGGCGTCTTTGTGTCTCTTCCTGACGTCGAGGATGTGGACAACGCCGATATTAACCAGCGTCTGCTGGAAGTCATTGAACAGATCGGCAGTTATTCAAAACAGATTCGTTCAGCAATCGAAGACGGTGTAGTGGAACCGCATGAGAAGACAGCAATTAACGACGAGCTGTATCTCTCAATTTCGAAGCTGCAGGAGCATGCAGCACTTGTCTACAAAATTTTTTGCATTTCAGAAAGTAATGACGCCCGCGAGTGTGCAGCTCCGGGCGTCGTGGCGTCGATTGCTTCTGGTTGTGGAGAAACTAACGCATGAACAGTTTAACAACACACTACCGTCGCTCGCAACTGATTGCGCTTCCTGTACCGGGTGGAAAAGCGAAGGTGGAATATTGCTATGCAGTGAATGTACCAGGTGACAGGGAAATTGTAACCCACAGCTTTGCAGAGTGGGCTGTGGGTGATTTCAACCGGCAGAAGGAGACAGTCCTTTGCGACAAGTTAACCGCTGGTTCAAAGATCACTACGGAGTGCCCGTCAGAGTCATTCGTTGGGAGCCGGAAACACAACGGGTTATCTACCTCCGCGAAGGTTATGAGCATGAATGCTTCAGCCCGCTCGAACAGTTTCGTCGTAAATTCAGGGAAATAGAGGTCGGTCATGAGCACTAAATTAACCGGCTATGTATGGGATGGTTGCGCTGCATCAGGCATGAAGTTATCCAGCGTGGCAATTATGGCCCGCCTGGCTGATTTCAGTAATGACGAAGGTGTGTGCTGGCCATCAATTGAAACCATTGCCCGTCAGATTGGCGCGGGGATGAGTACCGTCAGAACGGCTATCGCACGGCTGGAAGCAGAAGGCTGGTTAACGCGTAAGGCGCGTCGCCAGGGTGATGGTTCATCACCCCACTGTGCCGTGGTGGATGAATATCACGAGCACGCCACAGATGCGCTTTACACCACGATGCTTACCGGGATGGGGGCGCGACGCCAGCCACTGATGTGGGCCATTACCACCGCCGGGTACAACATTGAGGGGCCGTGCTACGACAAACGGCGGGAAGTCATCGAGATGCTCAACGGCTCGGTGCCAAACGATGAACTGTTCGGGATCATCTATACCGTTGATGAAGGTGACGACTGGACCGACCCGCAGGTGCTGGAAAAAGCCAATCCAAATATTGGCGTGTCGGTTTATCGCGAATTTTTGTTAAGTCAGCAGCAGCGTGCGAAAAATAACGCCCGTCTGGCAAACGTCTTTAAAACAAAACACCTCAATATCTGGGCGTCGGCGCGTTCGGCGTATTTCAACCTGGTGAGCTGGCAGAGCTGCGAGGATAAATCACTGACCCTTGAGCAGTTCGAGGGGCAGCCGTGCATTCTGGCCTTTGACCTGGCGCGTAAGCTGGATATGAACAGCATGGCGCGACTTTATACCCGCGAGATTGACGGTAAAACGCATTACTACAGTGTGGCCCCGCGTTTCTGGGTACCGTATGACACGGTGTACAGCGTCGAGAAAAATGAAGATCGCCGGACAGCCGAACGCTTTCAGAAATGGGTGGAAATGGGCGTTCTGACCGTTACCGATGGTGCGGAGGTGGATTATCGCTACATCCTCGAAGAGGCCAAAGCGGCGAACAAAATCAGCCCGGTCAGTGAGTCACCCATCGACCCCTTCGGGGCGACCGGGCTGTCACATGACCTTGCTGATGAAGACCTGAACCCCGTCACCATCATTCAGAACTACACCAACATGTCCGATCCGATGAAAGAGCTGGAAGCGGCGATTGAATCGGGGCGCTTTCATCATGACGGCAATCCCATCATGACCTGGTGTATCGGCAACGTGGTCGGCAAAACCATTCCGGGTAACGATGATGTGGTGAAGCCCGTCAAGGAGCAGGCGGAAAACAAAATCGATGGTGCAGTTGCGCTGATTATGGCGGTTGGCAGAGCCATGCTGTACGAGAAAGAAGACACGCTGTCTGATCACATTGAGTCCTACGGGATCCGCTCGCTTTAACTGAGGTAATTATGATCATGCTGATTCTCGCGCCTCTGGTGGGCGTGCTGGGTGCGCTTTTGCTGGCGTATGGTGCCTGGCTGATTTATCCCCCGGCGGGTTTTGTTGTTGCCGGGGCGCTGTGCCTGTTCTGGTCGTGGCTGGTGGCGCGATATCTCGACCGTACACAGTCGTCTGTCGGCGGAGGTAAATAGTGTTCTTTTCGGGATTATTTCAACGAAAAAGTGACGCACCGGTGACCACGCCAGCAGAGCTGGCGGATGCCATCGGGCTGTCGTATGACACCTATACCGGAAAGCAGATCAGCAGTCAGCGGGCTATGCGACTGACGGCGGTTTTTTCCTGCGTCAGAGTGCTGGCAGAGTCGGTCGGGATGTTGCCCTGCAATCTGTATCACCTGAACGGCAGCCTGAAGCAGAGAGCCACCGGCGAACGTCTGCATAAACTGATCTCCACGCATCCCAATGGCTATATGACGCCGCAGGAGTTCTGGGAGCTGGTGGTCACCTGTCTGTGCCTGAGGGGAAACTTTTACGCCTACAAAGTGAAAGCATTTGGCGAAGTGGCTGAACTGCTGCCCGTCGATCCCGGCTGTGTGGTATATGCGCTGGGAAGGTGTCAGCGATGGCCTGAAGGTGACCGCCGGGAGTGTTATTCAGCGCGATGACCTGGTGCAGTACACGACAACTGACGATGCAACCAGCTCCGGTGGTGTCCTGCGCGTGCCGATCGCCTGCTCAAGTGCAGGTGCGGTCGGTAACGCTGACGACGGTACGGCATTAATCCTGGTCACGCCGGTGAATGGTCTGCCGTCTTCCGGTGTGGCTGACACCCTGACAGGCGGATTTGATACTGAAGAGCTGGAAACGTGGCGCGCCCGCGTCATTGAGCGGTATTACTGGACGCCGCAGGGCGGGGCTGACGGGGACTATGTCGTCTGGGCTAAAGAAGTGCCCGGCATTACCCGCGCATGGACATACCGTCACTTGATGGGAACGGGAACTGTCGGTGTGATGATTGCCAGCAGTGACCTGATTAATCCCATTCCGGAAGAATCAACGGAAACGGCGGCAAGACAACATATCGGGCCACTGGCCCCGGTGGCAGGCTCTGATTTGTATGTGTTCAGGCCGGTGGCACATACGGTGGATTTTCATATCCGCGTGACGCCGGACACACCAGAAATACGGGCTGCCATTACCGCGGAGTTGCGTTCGTTCCTGCTGCGTGATGGTTATCCGCAGGGAGAACTCAAGGTATCGCGTATCAGTGAGGCGATTTCCGGTGCGAACGGGGAATACAGCCATCAGTTGCTTGCACCGGTGGACAATATCTCCATTGCGAAAAACGAACTGGCGGTACTGGGGACGATTTCATGGACGTGACAAACGATGATTACATCCGCCTGTTATCGGCACTGTTGCCGCCCGGTCCGGTGTGGTCAGCCAGCGATCCGGCGATTGCCGGTGCGGCACCGTCATTAACCCGTGTTCATCAGCGTGCGGATGCCCTGATGCGGGAGCTGGATCCGCGCACCACCACTGAACTGATAAACCGCTGGGAGCGTCTGTGCGGTCTGCCGGATGAATGTATTCCGGCGGGAACGCAGACCCTTCGCCAGCGTCAGCAACGGCTGGATGCGAAGGTTAACCTGGCGGGCGGCATCAACGAGGATTTTTATCTTGCACAGCTTGCTGCCCTGGGCAGACCAGATGCCACCATCACGCGATACGACAAAAGCACTTTCACCTGCTCATCGGCCTGTACTGACGCGGTGAATGCGCCGGAATGGCGGTATTACTGGCAGGTCAACATGCCAGCCACCACCAACTCCACCTGGATGACATGTGGCGATCCCTGTGATTCCGCACTGCGTATCTGGGGTGACACCGTTGTCGAGTGTGTGCTTAACAAACTCTGCCCGTCGCATACCTACGTAATTTTTAAATATCCGGAGTAATCCATGCATCGTATAGACACGAAAACCGCGCAGAAGGATAAGTTCGGCGCGGGTAAGAACGGTTTTACCCGTGGTAACCCCCAGACCGGCACGCCTGCCACCGATCTGGATGATGACTACTTTGACATGTTGCAGGAGGAACTTTGCAGCGTGGTGGAGGCATCCGGTGCCAGCCTGGAGAAGGGGCGGCACGACCAGTTACTTACCGCACTTCGCGCGCTGCTGTTAAGCCGCAAGAATCCGTTTGGCGATATCAAATCGGATGGCACTGTGCAAACGGCTCTCGAAAACCTTGGTTTGGGAGAAGGAGCAAAACTCAATGCAGCAACGGCTACATTAGGACGCACCGGTTTCATAGCTATACCGGTTATGATTGGTGGTATTGAGCAATCAGTAATCATTCAGTGGGGGTGGAATGCCGCAAAAGCATCTGCCTCTGGGGGGGATGGAAATACAGTTGTATTCCCGGTTGCGTTTAATAATGCCTGTGTTGCCGTTGTTGCAAATTATGACAATGTCAGCGCACCTATCAATGCAGTGGCAACGGGGGGATATACAACCACTTCGTTTTTATTACGGTGCGCAGCTCAAACGGGTAGTTATTACTATAACTGGATTGCTATTGGGTATTAAGATGAAAATATACTGTTGCTTAAATACCGTTGGTTTTTTTATGGATGGCTGTGGCGTCATTCCGCCAGATTCTAAAGAAATAACGGCAGAACACTGGCAGTCATTATTAAAATCTCAAGCTGAAGGAGGCGTGATCGATTTTTCTGTTTTTCCTCCTTCTATTAAAGAGGTTATCCGTACTCATGATGATGAAGTCGCAGATGCGAACTTTCAAAAGCAGATGCTTATCTCTGATGCAACTGATTTTATCAATAGCAGACAGTGGCAGGGTAAGGCTGCATTGGGAAGACTTAAAGAAGATGAGCTGAAACAATATAATTTGTGGCTGGATTATCTGGAAGCACTGGAACTGGTTGATACATCCAGTGCGCCAGATATTGAATGGCCTACGCCTCCGGCAGTTCAGGCCAGATGACATCCGGCGCGGTGCTGGTATCTGTTGCCGTCACCGCGTCAATGTAATCCAGCACAGCGTTAAGTCTGGTTGTTTCTGCCTGCGTCAGTTTACGTCCGGCCTGCAATTTCAGTTGAATCAGACTAATGGAAGCCATTGCAGCATCAATCAGTGACTGGCGCTGTGCTTCTGCCGCGTCTACTGCGGCGCTATGCTGTGCTTCAGTATCGGTCACCCATTTCTCACCATCCCATTTATCGTATGGAGATAAAGGGGCGATAGTGGTTGTATTTTCAGGGTAATCACCCGGAGCTTTGATTTCTTTTGATTCTCCAGTTTTGGTGCTATAGACCGTTTCACCGCGATGGTCTGGCACATATTCCCATGAGTTAAAATCTGCAGAACGGCAGATTGCATAACCAGCCTTATGTGTACCAGGGGCATCTAAACAGGAACATGCCGGAATGCCGACACCAACGGCAAGATATTCATTTGAAGTGGAAATATATTCCCGTGTTTCACCATCGTAGTTATAAACGGTAACATCCCCTGCCTTTGTTGCAATAAGGTCACTATTTAATATTGCTTTATGCATCAGGCTGCCCTCACGATATAGTTAAATGCAATATTACGCGGACGCGTTTCTGAGGCTGCGGCACCTAAACCATCCACTGATTGTTTATATGTTTTAAAGGTTCCATAATCCGGGGCTGGTAATCCGGCATCGTTTGTGTTTCCTCTTTTGATAATGTCAGTGCCACTATTTACCCATATTTCATCAAAATAGAAATTAATCGTTGCATCAGTCACAATCGTGGATCTTGACGGTAATCCATGAGCATGATCCTCCGTTGCATACCCCTGAATACTTAAAATAGAGCGACCTGTATCAATCCCCCGCCCGTCATCCCAGCCACGAATAAACTCACCACGTAAATCAGGCAATTTATTTGTCGGATAAGCCTTTGCCAGTTCCGGGTATTCTTCAGCAGAAAAAGCGGCACCATTGCATTTCAGCCAGCCTGTTGGCGGAGTGGCTGAAGGCCACGGAACCGGGACACCAACAGGTAATGCAGAGCCTTCTCCCAAACCAACGTTTATGAAAATGAAGAAATAACAAGCAAATGGCATCATTCCTGCTTTTACCAGGGGGATTTAACATGCTTATTGGCTATGTACGCGTATCAACAAATGACCAGAACACAGATCTACAACGTAATGCGCTGAACTGTGCAGGATGCGAGCTGATTTTTGAAGACAAGATAAGCGGCACAAAGTCCGAAAGGCCGGGACTGAAAAAACTGCTCAGGACATTATCGGCAGGTGACACTCTGGTTGTCTGGAAGCTGGATCGGCTGGGGCGTAGTATGCGGCATCTTGTCGTGCTGGTGGAGGAGTTGCGCGAACGAGGCATCAACTTTCGTAGTCTGACGGATTCAATTGATACCAGCACACCAATGGGACGCTTTTTCTTTCATGTGATGGGTGCCCTGGCTGAAATGGAGCGTGAACTGATTGTTGAACGAACAAAAGCTGGACTGGAAACTGCTCGTGCACAGGGACGAATTGGTGGACGTCGTCCCAAACTTACACCAGAACAATGGGCACAAGCTGGACGATTAATTGCAGCAGGAACTCCTCGCCAGAAGGTGGCGATTATCTATGATGTTGGTGTGTCAACTTTGTATAAGAGGTTTCCTGCAGGGGATAAATAAAGTTAAAGACACTTTGTGTACAAAAGAAAGTAAAACAACAGCAACTTGTTGCAATTTTATCAATAAAAGTAGTATTGTCGTGAAAAATTGATTAAAGATTAATATTATGCATGTTTTTGATAATAATGGAATTGAACTGAAAGCTGAGTGTTCGATAGGTGAAGAGGATGGTGTTTATGGTCTAATCCTTGAGTCGTGGGGGCCGGGTGACAGAAACAAAGATTACAATATCGCTCTTGATTATATCATTGAACGGTTGGTTGATTCTGGTGTATCCCAAGTCGTAGTATATCTGGCGTCATCATCAGTCAGAAAACATATGCATTCTTTGGATGAAAGAAAAATCCATCCTGGTGAATATTTTACTTTGATTGGTAATAGCCCCCGCGATATACGCTTGAAGATGTGTGGTTATCAGGCTTATTTTAGTCGTACGGGGAGAAAGGAAATTCCTTCCGGCAATAGAACGAAACGAATATTGATAAATGTTCCAGGTATTTATAGTGACAGTTTTTGGGCGTCTATAATACGTGGAGAACTATCAGAGCTTTCACAGCCTACAGATGATGAATCGCTTCTGAATATGAGGGTTAGTAAATTAATTAAGAAAACGTTGAGTCAACCCGAGGGCTCCAGGAAACCAGTTGAGGTAGAAAGACTACAAAAAGTTTATGTCCGAGACCCGATGGTAAAAGCTTGGATTTTACAGCAAAGTAAAGGTATATGTGAAAACTGTGGTAAAAATGCTCCGTTTTATTTAAATGATGGAAACCCATATTTGGAAGTACATCATGTAATTCCCCTGTCTTCAGGTGGTGCTGATACAACAGATAACTGTGTTGCCCTTTGTCCGAATTGCCATAGAGAATTGCACTATAGTAAAAATGCAAAAGAACTAATCGAGATGCTTTACGTTAATATAAACCGATTACAGAAATAAAATTATTTATTAAAGTCACATTTAAGACGTAATACCCTACAGGGTAAAAATTTTCTCTGATCTTAACTTCTGCAAATGTTAACTGCTATTTTTATGCTAAAAATGGTTATCAAAACTCAAAAACACATGTTTATAATCAATGAGTTATAGAAATGCTAAGGGCTAATGAGTTATATGCAAATTAGTAAAATTATGTTGCTATGTCAGATAGTTACGATTTAGTCATCTAACTAATGCTGCGCCATATGGGTTGGACTGAAGCGGCTGACCTGATTGTTAAAGGTATGGAAGGCGCAATCAATGCCAAGACCGTAACTTATGACTTCGAACGTCTGATGGAAGGCGCTAAGCTGCTGAAATGTTCAGAGTTTGGTGAAGCGATCATCGAAAACATGTAATCTCTCCATGTGTTAAATATTGAAACGGGCGTATAACACGCCCGTTGTTTTATTTATGTGGATATTATTAATAGCATATCGAGCATATTTATATGAAGCCCATTACTTGAGCCCATATGGGCATATTTTTATAATGCAACTATTATGTAAACATTTATTTGTTATTTTGCTTTCTCCTGGAGGACACTCTTGACTGCTTTTGAGTAAACTCCATAAATCCTTGTTGAATGGTGCGATGTGATAAATAGTAATAGGATATTCTTTATCCTTAAGGATAATACCAGACTTAACCGGTGTAAATATACTGCCAGGAGGGAGAAATATAGTAGATTGATACCAGATGATCATTTTCATATTACCCCATATGGCTGAAAAAGATATACCACATGTAGGTTGAATTACCGTGTCAATTACTATCCACTTCATTTGTTATGTCTTATCCCACGGTATTTAATATGGTTCATTAGGATGTTTATTTCTTGATTTTGCATATGAGTATATTACCCCCCCCTCAAAAAAATAAATTAATTAAAATGATGGCTTATATAAAATAAAATTTAAAGCAAGGAATCTCAATGGATGTTAAACAAAATGAGATTTTGTGAAAGCAATAAATTATTGACTTCGTTTTAGATTTGTTTAGCTATAATGTTATACATTCAAATGACTGAACATCCTGTAATTAAAACATAGCCTTTATGCTACTTTGTGCCAATTTGCTAAACATTATGGTTGCCTTTTTATATAACGATAATAATGAATATAAGCATGACATGAGAATAAGGTTTCAATTTTTGAGTTATATAGGAATGATTTAACCTGTTCCTGGCTAAAATACATATAACCGGATGATGACTAAACCAAAATACATGTGCGTTAAGTATTGAAACGGACGTGTGGCACGGCCGTTGTTTTTATAAATATGTTAACCGTTATAAAATAACGTATCAAAAGTCAAGTGATCACATTTCAAATATCAAGTTGATAGTATTAGTCTGGTGATTATTTATGGGTGACAATAAAAAGACAGTATTAATCATCCATAGAGATAGTCTCTGCACTTTTATTTCCATTATGCTAATGCCTTACTGAATTATGAAGCATTTCTTAAGTATCCAACTTTAGCTAGATTAATGGTTTATTATTTTCTACATCTTCAATATATAAAAGCGTATTATCAATGGCGTAGTAACTGCGTTTGTTATGATTAACATCAGTAACCCACCGGAAAACGCCCGCGCCTGCCAGTGTTGAACAGTATTCCCGAAATGTAGATTTTCCGCAAATATGAAGCAATGCGGCCTCTTTTATTTTAGCAGGGTTCTTGGTCGTACTAACTTTTAACAGGTTCCTGGTTCCTCTTAATAACAAAACCGTATCATCGTGAGTAATAATTCTGATGTTATCCGTAGCCAGATAATAAATGTAATGTGCAATACGGTGATGTTTTAATTCTGAATAAAACCAGGAGAAGTTTTGCTCTTTTCTCACTTGCTCAAACATCTTTTGAAAAACAACGACCTGATCCATCAGGATAATAACCTCTTGTTAGTTGTGAGACTGCGTAGTGTGCACGATCGGTTTTACCACTTCAATCTGGTCTGTCCTTTGGCTGTGATATGTACAGAGTGTGATAGAGGGAATATCTGAATTCTCCCGGTGAGCATTTTGCAACGGACCAGCTCCGGTACAAACGCTGTTGTGGGTTCAGATTATAACATTCTGTCTAAGGGGCGGGATAAAGGTGAAATTAGGGGGCATGAAAGATGACTTTATAACCTTGCTCACCCCAGTGTTGTAAAAGTTCGTTTTGCCTTCTCGTTGGTGCCATGCCTGTCCAGACAATCAATGTTTGCGTCGGGAACAGTTCCGGGCGCGGCGATTCAATGGGATCAGCAAGAACAGAAATGTGCCATCCTGACAATGATAAACGCCATGCTTCCAGCCACAATCGTGCTCTCTCTTCAACATTCCACGCCATCAGAATGGCTTCTTTACCGGGCTTACGGCGCATTTCGAAAAGCGAGGTTGCTGCGTATTCAATTAATGCGCCGTCAAACATACTGCTCATAATGCGGGAGGTGTTGTGATCAAGCACGAGACGCTGGCGAACAGGAAGGTAAACATGATTAATCAATTGATCTACTGGGTACTCTCGACCCAGTGAAATAATTCTCGCGCGTAGTTTGGCAGGATTAGCCATGCGAAGAATTGACATCATCTCTTCTTGCAGGCGGCTCCAGTCATCTTCCGTATCCTGGCTGGTGGTTTCCAGTAATGCTTTAACTTTGCCTACAGGGACGCCATTACTTATCCAACGCTTGATCTCTTCGATGCGTTGTATGTCTTCTTCATCAAAGAGTCGGTGTCCGCCTTCACTGCGCTGTGGTTTTAACAAACCGTAGCGGCGTTGCCAGGCCCGGAGAGTGACAGGATTAATCCCGCAACGTTCAGCAACATCACCAATGCTGTAATAAGCCAC
Protein sequences of DBSCAN-SWA_3 >CP028702|1632867:1647248|1634685_1635027_+|AVZ48736.1|DBSCAN-SWA MNQFYVHVRLFEDTAEQTKKFEELMLNFLYQKTVKESDDSCCRLIPEGYILKSTMNCQQILDQTFSIANSAGVDANIFVCKFEQSACLLPSASLVGNDFVHYDLTPKPIKLDS >CP028702|1632867:1647248|1633975_1634221_-|AVZ48734.1|DBSCAN-SWA MLQMLTLEEWAAEKYRSNPPSVSTLRRYAKQNLFCPPAMKQGRLWRVREDAELVGELVTPVIKKNDSLLLQRILSNGSQTA >CP028702|1632867:1647248|1640101_1640686_+|AVZ48746.1|DBSCAN-SWA MDVTNDDYIRLLSALLPPGPVWSASDPAIAGAAPSLTRVHQRADALMRELDPRTTTELINRWERLCGLPDECIPAGTQTLRQRQQRLDAKVNLAGGINEDFYLAQLAALGRPDATITRYDKSTFTCSSACTDAVNAPEWRYYWQVNMPATTNSTWMTCGDPCDSALRIWGDTVVECVLNKLCPSHTYVIFKYPE >CP028702|1632867:1647248|1644601_1644766_+|AVZ48752.1|DBSCAN-SWA MLRHMGWTEAADLIVKGMEGAINAKTVTYDFERLMEGAKLLKCSEFGEAIIENM >CP028702|1632867:1647248|1637358_1638726_+|AVZ48742.1|DBSCAN-SWA MSTKLTGYVWDGCAASGMKLSSVAIMARLADFSNDEGVCWPSIETIARQIGAGMSTVRTAIARLEAEGWLTRKARRQGDGSSPHCAVVDEYHEHATDALYTTMLTGMGARRQPLMWAITTAGYNIEGPCYDKRREVIEMLNGSVPNDELFGIIYTVDEGDDWTDPQVLEKANPNIGVSVYREFLLSQQQRAKNNARLANVFKTKHLNIWASARSAYFNLVSWQSCEDKSLTLEQFEGQPCILAFDLARKLDMNSMARLYTREIDGKTHYYSVAPRFWVPYDTVYSVEKNEDRRTAERFQKWVEMGVLTVTDGAEVDYRYILEEAKAANKISPVSESPIDPFGATGLSHDLADEDLNPVTIIQNYTNMSDPMKELEAAIESGRFHHDGNPIMTWCIGNVVGKTIPGNDDVVKPVKEQAENKIDGAVALIMAVGRAMLYEKEDTLSDHIESYGIRSL >CP028702|1632867:1647248|1634485_1634686_+|AVZ48735.1|DBSCAN-SWA MHTRRVGIHRRSIVSLNLNPRALSRIHHNAKDSLWLKILSCYVPVFSPLQARWYAGVLTQPASKET >CP028702|1632867:1647248|1641705_1642308_-|AVZ48749.1|tail|DBSCAN-SWA MHKAILNSDLIATKAGDVTVYNYDGETREYISTSNEYLAVGVGIPACSCLDAPGTHKAGYAICRSADFNSWEYVPDHRGETVYSTKTGESKEIKAPGDYPENTTTIAPLSPYDKWDGEKWVTDTEAQHSAAVDAAEAQRQSLIDAAMASISLIQLKLQAGRKLTQAETTRLNAVLDYIDAVTATDTSTAPDVIWPELPEA >CP028702|1632867:1647248|1641320_1641734_+|AVZ48748.1|tail|DBSCAN-SWA MKIYCCLNTVGFFMDGCGVIPPDSKEITAEHWQSLLKSQAEGGVIDFSVFPPSIKEVIRTHDDEVADANFQKQMLISDATDFINSRQWQGKAALGRLKEDELKQYNLWLDYLEALELVDTSSAPDIEWPTPPAVQAR >CP028702|1632867:1647248|1638737_1638920_+|AVZ48743.1|DBSCAN-SWA MIMLILAPLVGVLGALLLAYGAWLIYPPAGFVVAGALCLFWSWLVARYLDRTQSSVGGGK >CP028702|1632867:1647248|1634964_1635273_-|AVZ48737.1|DBSCAN-SWA MNEQNLKHVIALLLEDAKRLQQIEPNAGTEARILLAKQALKTCGAQAPGRTEFMNFMANTLTPLPCNGEKVSRVYHDTMVKALRIELDGLRRKIVMNEIVTN >CP028702|1632867:1647248|1635447_1636122_-|AVZ48738.1|DBSCAN-SWA MKTIHDIRRSNARKLRDGVGGNSSFATMIDREPTQTSRFMGDGATKNIGDSMARHIEKCFDLPVGWLDQEHQTTNITKKPDVSITNKQITLVPVISWVQAGAWKEVGYSEVDLSTAETYPCPVPCGEMTYILRVIGDSMIDEYRPGDMIFVDPEVPACHGDDVIALMHDTGETTFKRLIEDGTQRYLKALNPNWPEPYIKINGNCSIIGTVIFSGKPRRYKIKA >CP028702|1632867:1647248|1644868_1645192_-|AVZ48753.1|DBSCAN-SWA MKWIVIDTVIQPTCGISFSAIWGNMKMIIWYQSTIFLPPGSIFTPVKSGIILKDKEYPITIYHIAPFNKDLWSLLKSSQECPPGESKITNKCLHNSCIIKICPYGLK >CP028702|1632867:1647248|1636212_1636413_+|AVZ48739.1|DBSCAN-SWA MKAYWDSLTKEQQGELAGKVGSTPGYLRLVFNGYKKASFVLAKKLEQYTSGAITKSDLRPDIYPKD >CP028702|1632867:1647248|1643534_1644368_+|AVZ48751.1|DBSCAN-SWA MHVFDNNGIELKAECSIGEEDGVYGLILESWGPGDRNKDYNIALDYIIERLVDSGVSQVVVYLASSSVRKHMHSLDERKIHPGEYFTLIGNSPRDIRLKMCGYQAYFSRTGRKEIPSGNRTKRILINVPGIYSDSFWASIIRGELSELSQPTDDESLLNMRVSKLIKKTLSQPEGSRKPVEVERLQKVYVRDPMVKAWILQQSKGICENCGKNAPFYLNDGNPYLEVHHVIPLSSGGADTTDNCVALCPNCHRELHYSKNAKELIEMLYVNINRLQK >CP028702|1632867:1647248|1645891_1646296_-|AVZ48754.1|DBSCAN-SWA MDQVVVFQKMFEQVRKEQNFSWFYSELKHHRIAHYIYYLATDNIRIITHDDTVLLLRGTRNLLKVSTTKNPAKIKEAALLHICGKSTFREYCSTLAGAGVFRWVTDVNHNKRSYYAIDNTLLYIEDVENNKPLI >CP028702|1632867:1647248|1639319_1640111_+|AVZ48745.1|DBSCAN-SWA MWYMRWEGVSDGLKVTAGSVIQRDDLVQYTTTDDATSSGGVLRVPIACSSAGAVGNADDGTALILVTPVNGLPSSGVADTLTGGFDTEELETWRARVIERYYWTPQGGADGDYVVWAKEVPGITRAWTYRHLMGTGTVGVMIASSDLINPIPEESTETAARQHIGPLAPVAGSDLYVFRPVAHTVDFHIRVTPDTPEIRAAITAELRSFLLRDGYPQGELKVSRISEAISGANGEYSHQLLAPVDNISIAKNELAVLGTISWT >CP028702|1632867:1647248|1638919_1639393_+|AVZ48744.1|DBSCAN-SWA MFFSGLFQRKSDAPVTTPAELADAIGLSYDTYTGKQISSQRAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSLKQRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKAFGEVAELLPVDPGCVVYALGRCQRWPEGDRRECYSAR >CP028702|1632867:1647248|1640689_1641319_+|AVZ48747.1|DBSCAN-SWA MHRIDTKTAQKDKFGAGKNGFTRGNPQTGTPATDLDDDYFDMLQEELCSVVEASGASLEKGRHDQLLTALRALLLSRKNPFGDIKSDGTVQTALENLGLGEGAKLNAATATLGRTGFIAIPVMIGGIEQSVIIQWGWNAAKASASGGDGNTVVFPVAFNNACVAVVANYDNVSAPINAVATGGYTTTSFLLRCAAQTGSYYYNWIAIGY >CP028702|1632867:1647248|1646516_1647248_-|AVZ48755.1|DBSCAN-SWA MAYYSIGDVAERCGINPVTLRAWQRRYGLLKPQRSEGGHRLFDEEDIQRIEEIKRWISNGVPVGKVKALLETTSQDTEDDWSRLQEEMMSILRMANPAKLRARIISLGREYPVDQLINHVYLPVRQRLVLDHNTSRIMSSMFDGALIEYAATSLFEMRRKPGKEAILMAWNVEERARLWLEAWRLSLSGWHISVLADPIESPRPELFPTQTLIVWTGMAPTRRQNELLQHWGEQGYKVIFHAP >CP028702|1632867:1647248|1632867_1633995_-|AVZ48733.1|integrase|DBSCAN-SWA MAARPRKNNVSVPNLYPLYSRKVNKVYWRYKHPVTGKFHALGTNEAEAIAIATEANTRLAEQRTRQILAISDRIATSKGKAITTSTWLDRYQAIQDDRLKSGDIRLNTYKQKAKPVSLLRERAGMKLISAVDVRDIAQLLDEYIAAGRPRMAQVVRSVLIDVFKEAQHYGEVPPGYNPALATKQPRRKITRQRLSLEEWKKIFDIADATHRYMGNAMLLALVTGQRLGDISRMKFSDIWDDHLHVIQEKTGSKIAIPLSLRLNAINWSLRDVVARCRDYAVSAYLVHFFRSTSQAERGAQVKANTLTMNFSKARDLARIDWGEGSPATFHEQRSLSERLYKEQGLDTQKLLGHKTQQQTDRYHDDRGKGWSKVAL >CP028702|1632867:1647248|1637189_1637369_+|AVZ48741.1|DBSCAN-SWA MRQVNRWFKDHYGVPVRVIRWEPETQRVIYLREGYEHECFSPLEQFRRKFREIEVGHEH >CP028702|1632867:1647248|1636456_1637014_+|AVZ48740.1|DBSCAN-SWA MGKHHWKIEKQPEWYVKAVRKTIAALPSGYAEAADWLDVTENALFNRLRADGDQIFPLGWAMVLQRAGGTHFIADAVAQSANGVFVSLPDVEDVDNADINQRLLEVIEQIGSYSKQIRSAIEDGVVEPHEKTAINDELYLSISKLQEHAALVYKIFCISESNDARECAAPGVVASIASGCGETNA >CP028702|1632867:1647248|1642873_1643428_+|AVZ48750.1|DBSCAN-SWA MLIGYVRVSTNDQNTDLQRNALNCAGCELIFEDKISGTKSERPGLKKLLRTLSAGDTLVVWKLDRLGRSMRHLVVLVEELRERGINFRSLTDSIDTSTPMGRFFFHVMGALAEMERELIVERTKAGLETARAQGRIGGRRPKLTPEQWAQAGRLIAAGTPRQKVAIIYDVGVSTLYKRFPAGDK >CP028702|1632867:1647248|1642307_1642802_-|AVZ51628.1|tail|DBSCAN-SWA MGEGSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDTGRSILSIQGYATEDHAHGLPSRSTIVTDATINFYFDEIWVNSGTDIIKRGNTNDAGLPAPDYGTFKTYKQSVDGLGAAASETRPRNIAFNYIVRAA |
24 | Shigella_phage(30.0%) | integrase,tail | attL 1631829:1631842|attR 1649860:1649873 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1839890 : 1870221
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP028702|1839890:1870221|DBSCAN-SWA CTTATGCCGCCAGCACGCGGTTGCGTCCATCATTTTTCGCCCGATACAAAGCATCATCAACGCGTTTAAACAGTTCATCGATGCTTTCATTTCCTTCGTGATGCGCCACACCAATGCTGACGGTAAAGCGTGGTAAGCCCGAAATACTCACTTTTGCCACGCTTACGCGGATAGTTTCAGCCAGCGAAAGCGCGGTATCCAGTGGGGTTCTTGGTAGCAATAAGACAAACTCTTCGCCTCCCCAACGAAACACCAAATCGCCTTTGCGAGCGCAACTTTCGAGGGTGCGGGCGAGGGCGCATAACACCTCATCACCTTTAGAATGCCCATAGAGATCGTTAATGTGTTTAAAACGATCGGTGTCGATGAGCAACAAGCTGTAATCCTGAGCGATGGCGAGATGCTGCATTTGGCCTGGTTCCGTAATGTGATAAAACTGTCGCCGATTCAGTAATCCGGTCATCGCGTCATGGTGAGCAGCATGTTCCAGCTGCTCCTCCAGCCGTTTTTGCTCAGTAATATCATGCACAATACATAACATGAGCTTGTCGCCATAAATTTCAATCGGTCCGGCATAGGTCTGCACATGACGAGTCGAACCATCCGCCAGTTTATGAACAAAATTCAAAGGTTTATGACCACCGGGTAAATGCGAGATTTCATGCATGATAGGCATGACGCGACGCCCGAGCATATTTATTTCCCAGGTATGTTTCTGGCACATCGTTTCATGGTTATAACCATAGAAATTGAGCGCGGCGAGGTTAGCATCGACGATTTGTCCATCTCGTGACGGGTCAATCAACAACATTGGTGCAGAGTTAGTCAGAAAAAAGCGCGCATAAAAACCTTGTTTTTTGCGCTGATAATTTGCCGAGCGACTGGCTTTTAAACCCAGCGTTGCCGGCGCTTCGATACCTTCGAAAATAATCACCGGTTCTGTTTCTGTCAGCTTTCGCAAAACAAGCCGACAGCTCAATGCTGTTTCCTCTTCTTTACGCTGAACAGTGAGGATTTCGATAATATCGTGTTGGTTTTGCAGATCGGAGAGGTATTTCGGCAGTTCTTTTTGTGAGGAGACGGAATAGGGTCCGGTTCGTAGCTGACTAAACGTGAGGTCTTGCATCAACAGTTTCGCCGCGCTATTGGCATAAATTAACTGTTCCTCAAAGGGCGAAACGATCCAGACAGGACTGGTGAGTAAGTCCAGGGTATTGAAGTTGTGCGTAATCATTGAGATCCCGTTATTTTTATCAATTTTTGTTGCTATCCGATCGCAAAAAAGCCACGTCATATGATCAGATAATTCTGATAATGATAGACGCTATTTAACACTTCACACGGTTTGTATACGGAAAAGCATTTTGCTTTTTGTATTCAATTTAGACAGAATTTTATTAATCATTTCAGGGTAATGGGGTGATGAGATGTTGCGTAACAGGGCCAGAAGGCTAGACTACAAAATAATGCGTTGATGATGGAGGCACTGTGGAAGCGATTAAGGGATCGGACGTTAATGTCCCGGATGCAGTATTTGCCTGGATGCTGGATGGTAGAGGCGGCGTTAAACCGCTGGAAAATACAGATGTGATTGATGAAGCGCATCCCTGTTGGCTCCACCTTAATTATGTACACCATGATAGCGCCCAATGGCTGGCGACAACACCGCTGCTTCCCAATAACGTACGTGATGCGCTGGCGGGCGAGAGCACGCGTCCCCGAGTCAGCCGTCTCGGTGAAGGCACGCTGATTACATTGCGCTGTATAAACGGCAGCACCGATGAACGCCCCGATCAACTGGTCGCCATGCGTGTATATATGGACGGGCGGTTAATTGTTTCGACCCGACAACGCAAAGTGCTGGCGCTGGACGATGTGGTGAGCGATCTGGAAGAGGGCACGGGTCCGACCGATTGCGGGGGATGGCTGGTGGATGTGTGCGATGCGTTGACCGATCATTCCAGTGAATTTATCGAGCAGCTGCACGATAAAATTATCGACCTTGAAGATAATCTCCTTGATCAGCAAATTCCACCGCGTGGATTCCTGGCTCTGCTGCGCAAACAATTAATTGTGATGCGTCGCTATATGGCACCGCAACGTGATGTTTATGCTCGTCTTGCCAGTGAACGTTTGCCGTGGATGAGCGATGACCAACGCCGTCGGATGCAGGATATTGCCGATCGCCTTGGGCGCGGCCTTGACGAAATCGACGCCTGTATAGCACGGACTGGCGTGATGGCGGATGAAATCGCTCAGGTGATGCAGGAAAATTTAGCTCGTCGTACCTATACAATGTCGTTGATGGCAATGGTCTTTTTACCCAGTACCTTTCTGACCGGGTTATTTGGCGTCAACCTTGGTGGGATCCCTGGCGGCGGGTGGCAATTCGGATTTTCAATTTTTTGTATTCTGTTAGTTGTTCTTATTGGTGGTGTTGCTTTATGGTTGCATCGTAGTAAATGGTTGTAACAAAAGCAATTTTTCCGGCTGTCTGTATACAAAAACGCCGCAAAGTTTGAGCGAAGTCAATAAACTCTCTACCCATTCAGGGCAATATCTCTCTTGCAGGTGAATGCAACGTCAAGCGATGGGCGTTGCGCTCCATATTGTCTTACTTCCTTTTTTGAATTACTGCATAGCACAATTGATTCGTACGACGCCGACTTTGATGAGTCGGCTTTTTTTTGCCTGTTATTTATCAGCGTCTACCCTTTAAGAGTCCACCCAATGACCAGAGGGAAATATGACGACACTTATTTATTTGCAAATTCCTGTCCCTGAACCGATTCCTGGCGATCCTGTTCCAGTGCCCGATCCGATCCCTCGCCCGCAACCCATGCCTGACCCACCACCCGATGAAGAACCGATTAAATTGTCGCATCGTGAGCGTAGATCTGCGAGGATACGCGCCTGCTAACTTTGCGTCGATGACCACGAGAATAGATTGTGACCGCTTTTTCTACCCTGAATGTTTTGCCTCCCGCCCAACTCACGAACCTTAATGAGTTGGGTTATTTAACCATGACGCCGGTGCAGGCCGCCGCGCTTCCGGCGATCCTTGCCGGAAAAGATGTTCGCGTGCAGGCGAAAACCGGCAGCGGCAAAACGGCGGCTTTTGGCCTCGGCTTGTTACAGCAAATTGATGCGTCGCTATTTCAAACCCAGGCTTTAGTGCTGTGTCCTACGCGTGAACTGGCGGATCAGGTGGCAGGTGAATTGCGTCGGCTGGCGCGTTTTCTGCCAAATACCAAAATTTTGACGTTGTGCGGTGGTCAACCGTTCGGTATGCAGCGTGATTCGTTGCAACATGCGCCGCATATTATCGTGGCAACGCCGGGGCGTTTGCTGGATCACCTGCAAAAAGGCACGGTATCACTGGATGCGTTGAATACGCTGGTGATGGATGAGGCCGACCGCATGCTGGATATGGGATTTAGCGATGCCATTGATGATGTCATCCGTTTTGCGCCTGCATCTCGACAGACGCTTCTGTTTTCGGCAACCTGGCCGGAAGCCATCGCTGCAATCAGCGGACGAGTGCAACGCGATCCTTTGGCGATTGAAATTGACTCAACAGATGCTTTGCCACCCATTGAACAACAATTTTATGAGACATCCAGCAAAGGCAAAATTCCTCTGTTGCAACGGTTATTAAGCTTGCATCAGCCATCCTCTTGCGTGGTGTTTTGCAATACCAAAAAAGATTGCCAGGCTGTCTGCGACGCGCTGAATGAAGTAGGGCAAAGTGCATTGTCATTACACGGCGATTTGGAGCAACGCGATCGCGATCAGACCCTGGTACGTTTTGCTAACGGTAGCGCCCGTGTACTGGTCGCGACTGATGTTGCTGCGCGTGGTCTGGATATTAAATCGCTTGAGCTGGTGGTGAACTTTGAGCTGGCGTGGGACCCTGAAGTTCATGTACATCGCATCGGTCGTACAGCTCGTGCAGGAAATAGCGGTCTGGCGATCAGTTTCTGTGCTCCGGAAGAAGCACAGCGGGCCAATATCATTTCTGACATGTTGCAGATAAAACTTAACTGGCAAACGCCGCCAGCTAATAGTTCCATTGCGACGCTGGAAGCAGAAATGGCAACGTTGTGTATCGATGGCGGGAAAAAAGCCAAAATGCGCCCGGGTGATGTATTAGGTGCACTGACAGGAGATATCGGGCTTGATGGCGCAGATATTGGCAAAATCGCCGTGCATCCGGCGCATGTCTATGTCGCGGTCCGTCAGGCTGTTGCTCATAAAGCATGGAAACAGTTACAGGGCGGGAAGATTAAAGGAAAAACGTGCCGGGTGCGGTTATTAAAATAATGAAATGTTGAATTGCCGGGTGCAAGAGTAAACATCTTATTCGGGATTGCCGGATGCGACGCTGGCCGCGTCTTATCCGGCCTCCATAAGAGTAGCCCGATACGCTTGCGCATCGGGCGCTATCCTGGTTATTTCACTTCAACCACATTCAGCCGTAACTCATCCAACTGATTTTCATCTTCTTCTGGCTGCCAGCACGCCGGTTGTAGTGGGATCTCTTCGCGATCAAACGCCAGATCACCCCCGTTAACCACTTCAGAACCGTGGGTGATGCCTTTGAAATCGAACAGGTTGGTATCGCACAGATGCGACGGCACCACATTCTGCATCGCGCTGAACATCGTCTCGATACGCCCTGGATAACGTTTATCCCAGTCACGCAACATGTCAGCAATCACCTGACGTTGCAGGTTAGGCTGTGAACCGCACAGGTTGCACGGAATAATCGGGAACGCTTTTGCATCGGCAAATCGCTGAATATCTTTCTCGCGGCAGTAGGCCAGCGGACGAATAACGATATGTTTGCCATCATCGCTCATCAGTTTCGGAGGCATACCTTTCATCTTACCGCCGTAGAACATATTTAAGAACAACGTTTGCAGGATATCGTCACGATGGTGACCCAACGCGATCTTCGTCGCCCCCAGTTCCGTTGCGGTACGATAAAGGATACCGCGACGAAGGCGAGAACACAGTGAGCAAGTGGTTTTGCCCTCTGGAATCTTCTCTTTCACGATACCGTAAGTATTCTCTTCAACAATCTTGTACTCAACGCCCAGCTTTTCAAGATACTCGGGCAGAACGTGTTCCGGGAAGCCCGGTTGCTTTTGATCGAGGTTAACAGCCACCAGCGAAAAATTGATTGGCGCGCTTTGCTGCAAATTGCGCAGAATCTCCAGCATGGTATAGCTGTCTTTACCCCCGGAGAGGCAAACCATGATGCGATCGCCTTCTTCAATCATATTGAAGTCAGCAATGGCTTCGCCCACGTTACGACGCAGACGTTTTTGTAATTTGTTCAGGTTGTATTGTTCTTTCTTTGTAATTTGTTGATTTTCTTGCATTATTTCAGTTCTCTGGTACTAAATGGGGCAAATTGGGGGCAAACTTTGCAACTACGATAACCGCGCATTCAACATGGCTATCTGTTCGTCGTTCATGTCATCAATCCACATACCGTAAATTTCATACACCATCTGCGCAGTTTCATGCCCCATTTGGCTGGCTATAAATGCCGGGTTCGCTCCTGCCGTCAACAGCCAGCAGGCAAAAGTATGCCGCGTATGGTACGGATTACGGCGGCGAATACCAGCACGTTTTACTGCTGCATTCCACCTTGCCCCCAAACTGCTTACCGAGTAATAAGGTTTTTGTTTTCCGTTACACACCCTGGGCATGAAAACAAAATGCAGTTTTTGCTTTTCGGTTCTGCCGTACTCCCGATGATAAAAGGTGATTTCGCTTTTGCGATGATGCCCGGTCAGTTTGTATTGCTCCTTCAGTGCTTCAAGAGCAGGCTGCAGTAGTGTTACTGTTCGGATCCCGGCATTTGTTTTTGGGGGACCGAACATATCAAGTATCGTCAGGTTTCTTCTGACATTCACTATTCCCTTTTCGAGATCCACATCCTCCCACGCCAGAGCTGCCAGTTCCCCGTGACGAAGTCCTGAGTAAACGGCAAATTTCCACAAGTTCTGGCTCTGTCCTTTTTCACTTTCCATTAATGCATTGAATTCTGTTTTAGATAACGGATCAGGCTTTATTCTGTTTCGCTGTAATTTTTTTACTCCTTCAAATGGTTTGGTTGATATAAATCCCGACTGATACGCAAAACGCAACAGCGAACAGAGCAGGGCGATATAGTTATCAACTGTGCGCACGGTTCTTCCTTTTTTGTTGGATCTTGGATTATCCAGGTAAAGCGTTTCTCCATGCAGCAGTTCATTCCGGTAGTTTAAGATATCGCTATAACGAATATGTGATATCGGGGTACTTTCACAAATTATTATTCTGAGTGTTTTTAATTGTGATTTCGTTTTCTTCATTGTGTTTGTTGTTAACTCTGTCTCTTTAATTTTTGTCCAGATATCACAAAGCTCTCCGAACGTTTTTATGACTCTCGTTGTCACCATTTTTGCCCCAGTGCTGGACTGGGGAAAACGTCTTAAATACTCAAATTCACCGGAGTTTATTTCATGAACTATCAGCGCTCTTAAATTTCCGGCCTTTTTAATATTACTGTTTGTAATCTCCCAGCCTTTTAATGTTTCCCGACATCGTTTTCCTCGAAACATGAACCAGATGCGAATGTATCTACCTCTAATCTCGACACCTGTTGGTAATTTAGACATATCATGAGTCTTTGATAAACTGATTTATCTTTGGATAGTTGTACCAGATAATCCCTCGTTTGCTGTCTGGCTTACCTAAAGGAGATACTCGTTTGAAGTGGAAGCCCTCCACCCAACAGTTCTGGCGGTATGCTTCAATTTGTCTGGCCCCCAGACCAGTGCGAAGCATCAGGCCGTATTCAACCATCCACTCTTCATTAAAGATTACTTGTGCCATCGCATCACCTCTGGCAGGCGCCAATGTTAGACTGAAATTGACGCCCGATGTTGATTATTAATAATCAGCTATGAAGTTTTAATTTGAATACAATGCAATTCTCGAGGACTGAAGTTTCTCGCAATTAAAATTTATCAGTTTTACTTTCTGCTCTCTGGAAACGCCTGCTTCTTTTTTACCTGAGAGCATTTTTTCGCATTCTGATTTCGTTAGTTTAGATTTTGAATATCTTGTCCAGTTAGTAGGAGTGCCACCTTCCTTTTCAATAGTGGCGGTAATTTTATACATGAACACCTCCATTATTATTTCCAGTGGTTCGTTTATTCCATCTTTCGAGTGCTTCTTTTTCACTTCCACCATAACCGGTTCGGGATTCGCATCCGTTACACTTCGCTCGGTAATATCCTGAAATGGCTTTCACCGTTACTGATGGACAACCACAAAATGGACATGGTTTAACATTGTCATATCTCATAATTTTTCTCATAAAAAATATTTCAAGTTGGCGGTGCATTACACCGCCAGGCTGAATTATTCCTCTGAATTATCGATTACACTGTATTCCCCGGTTAATACAGAGGAATCTGCAGGATCGATTGTCAGTGGTTCCTTTTCATCCATTGATACTGCACGCTGGATCTCAATTGATACGGGCAAATATTTGAACAGGCGACGAATAGCCGTTTTCTTTGCCATTTCTTCCCAGTGAGTTACCCACGGCCCGTTATTACCAGCTTTACTCAGGCTGCGCACCAGCTCAATCTGTTTGCGCGTCATAACTTCAAACTGAGTACCTCCGTCTTTCAGTCTTGCGACAGCATAGACGTGGGTAACCGGGGCATCTTCGTTTTCTCCCGGGCGGTGTATTAACTTTTCATCAAGGCCAAATTCGAAGCTAAACTCGTCACCTTCACGGACAACACGGGCTGACAGGCTGGCGATTTGACCAGAACGGCGAGCCAGATCAATCATGCCGCGATAGCCAATGATTAGCTGAACGTTCTTTTTACCGCTCTTTTCGTTTTTATTACCAAAAGGCAGTAAATATGCATGACCGAGGGCGCTACCTGGCTCAAGTCCGAGCTGTGAACACTGTACGATCGCACTGACAAAACTCATAGTGTCACAGTTTCCTAACGCCGGAACTTTACGAATTTCTGTGGTGGCGATACGGATCATACGTTCAGCCGTCATATGGCGTGGAAGAGCTGCTGCCAGTTGCTCTTTCATTGATGGCTGGTTAATAAAACTAATCACGTCGCTATTTTTAACTGCTGCTGGTGCACGGTTTCCCTGAGTTTTTTGCAGATCGGCTTTTGCGATTGGTGGTTGCTTAGTCATTTGCATATTCCTTAGCCCAGCGGGGCAGTGATAATGTCTTAATAGCTGGCCATTCATCGGTATTCAGGCAGTCAGACAGGGTTCGCAGATTGCGGTGATATTCCTGTTGACCTGCCAGTTTTGCTTCTTCGCCCATCATGAAAATTTCAACCGGATAACGTCCGCATTCAATAGTTGTGCTGGCAACCAGAAAAACGAAAGTTGGCTGCACTCCAAACTGTGCTTCATAACCGTCACTGTAGAATGCATCCTGAACGTGATAGCGGTAGTCGTAATAAGCGGTTTTGAATCGTTGAATATCCGCCGTAGTTTTCACGTCCATGATCCAGTGAAATTCAGGGATAATTTTGTCCGGACGGCACCGACACAAAATTCCTGTTTCAGGATCTTCCCAGTAAATTGATGATTCAGCGTGTCCGGCGCTTTCAACAAGCCATTGCCCCAGCGGCAAAGCCATAACGCTTTGATACATGAGTTCAATTTTCCGGCCTTCTTCCGCAGTGATAACCGTTTTTCCTGTGCTTGCGCATTCCATCAGAAACGCTTTCTCTTCTTCTTTTCCGGCGTTTGTACGGCGGTTAAATTCAGGTGCTACGATAAAGCGGTTACTGAATTCTTCCGGTTCAAGTACCCGGCAGTGGAAAGCAGTTCCTAAATCGAGCGTTTTTGTCTTTGTGGTGTCCACGGGGGCATTTTTACGCCACAAATATAGTGCCGGAGTATCAGCAATGTCATCGAGCTGAGACTTACTGATACCGGGACCCGCGTGGTAATTCTCATTCGAAATTCCGTAATAAATACCTGGCTCTATGTCTTCTACGATTACGGGATCTGCGACTTCGCCAGTTTCATCACTGCAATCGCGATGCGGATCGCTGCCAGCATTCTCATTGTGCGGATGTTCAGCGCCTTCCATTTCCTCCGGATCATTTTCCTTAGCTTCAACCTGACTCTCTTCATCGAATGTTTCCTGGTATGTTGCGTCGCCCATCACCGCACCACAGTCAGGGCAGTTATCCCCGCCAGTCTGGCCGCAGGCATTGCAGGCTATTTCCGGTTCCTGTTGCACTACTGGCTCAGGTTGATTCATATCTGGGCTGGTTTTTTCCGTTTCTGGCTGGTTCTGGTACACACAATCGCGAGTCTGGATCCCCTTTACCCATTTCGGATCGTTCGGGTCGCTAATTCCGTCAACAAATTCACCACGTGATGCAGCAAGCAATTTATCGTCATCGACAGGATTTTTTGATGGAATGTTTTTCCGGGCTTCATGGAGTTCTGCCCGCAGTTCCTGATATTTCGCATCAACAGAATTTACCTGTGACTGAGCATCCAGCGGCTGCGTGTCCTGATGATGTTCAGTTGCGTCCGGTTCCATTGTTTCAGCCTCTCCCTGTTCAACTGCCGTTGTTCCAGATGGTTGCGGTTTTTCTTCATCATCCTGTTTTCCTTCTTCTGTTACTCGCTGCGGCATCGGGGCAGAGGAGCGACCGCAGGCAATATCCACGATTTCCGGATCAGGGTTGGCATGATCGGTTTCAGTCAGTACTTTGTTCAGATATTCAGTGACGTGCGCGGGGATGACCTCGATCCCAATTGGTGCTTCTTTTACGGACGCAACCACGATGGCGCGGGAATAATCCAGCCCGCCAGGCATGGTGATGAATTTGTCGCGGAAAACAGAAAAGGGCGGTTTATTTTCAGCGATAATTTCCTCAATGCGTTTAGCGTGTGCCGGATGAAGGTTATAGATGTCCAGATCCATTGAACGGGCCAGTACGCCAGTGGCTACGTCGCGCGCCAGTGACGTCAGATCGTGTACGAAACCTTCGCCGCGATCGGTGAGGTTTCCGCCGCCAGCATTAGCACCGGAAGCCGTGCGAGTGATGTGTGAAACACGATTACCCTTCATCCACTCTTTTGTCAGCAGTCCTCGATCGGTGTAGTCAGCGTTCAGGTATGCTTCGAAAAAAGCAGTTATCAGTCCCAGGTTTGAATTACCAGGATTAGGGAAAACTTTGTCAGTGTCACGAACCAGTTTGTGGAGTTCGCGAATTTCCAGCGGGTCGAGCAGGCTGGTTTTGTGGGAAACAGCCAGGGCAGTAACAGCCGGTAGTTCTTCAGCCCGAGCAATGTGTAATGCCTGGAGTCCGTCGCGTGAAACGTGCGTTACCGGTTTTTCGCTGCCGTGTTGAGCAAGCCAACGAATGGGCAGTTCCTGGCCAGAAATTGGGAGTAGCATATTCTCCTCAATCTCAGTCATGTCTTCGCCGTTGACGTTGGTATTGCCTTGATAGTGAGCGTTGTCTGGTGCTGCTCCCGGTTTTAGTTCCCATGTCATGGAGTCTTTGCTGAGTTGATAGCGTTCACTCCAGGTAAAATCGATCTCACCTTCAGCGGGCAGGTCATTAACGACAGGAAAATTCGTGGCAACAGCTTTAAAATAGCTGCTCAGTTTTTTACCTGACTTAACGATCAGGTAGTCCAGAGTGGCACAGGTCGATTCAAAATCGTTGCTTGCCCACAGGACGACGTCAGGTTCACCGGATGATTTTTTCGCTTTCCGTAACAGGAAGAGTGGTTTTGTGCTCATTGTTTTTTAACCTCAACTCAGATTAAAATTCGTTTTGTTCAGTGAATGATCTTGCCGGATACACACTGTTCATAGCCTGCGCCATACGCAGGCTATTTCTTTCAGATTTCACCTTTTAATTTCATTGCAATTAGAGTTGCCAGAAATTCGGCTTTTTTTTCTGCGGGCAGATTCTTTCCGATATGCACCAGGCACATTTTTTTGACACCTTCATCAAGTGTTTTTACGTTGCCTGATGGACCATCGATATCAACCACAGTGAATGGGGTTTCTTTATTTTCTGTTTTAATTACGTAGCCAATGCGCTTTCCTTCCAGATTCACCTCGTGAACAATGTCATCGGTAGTTACAACAGTGGCTTCATAATTGGTAATCATGTTTTTCTCCTTAATTAAGGTTGAGCGAATACCTGCCATTTCTGGCATAAATTCAGTTTCGAATAGTCAATTAATTAAAGTTCATGTGCCATCTGGTCTTTTTCGGCACAAGCTTCACTGCAATATTTTCTCGGTTCGTCTTTTGATAAAATCCCGTGCATGAAGTGAAGCATTCTTTCAATAGCTTTGCTTTCTTCAACGTCTTTTTTGCAAAGGTGGTAAGCACATTTTATTTTCTTAGTCATCACCATGACTCCGCCTTTACAGGTAAACCATCACGACCGAGGAAGACTTTAATCATGCGGTCAGTAATGAATGTTTTTGTGGTCAGGTTACGAATATATAGTTTTCGCTTTTTAATATTGTTTGCCGAGGCAATATATGTCCGGCCTTCATGAAGAACATAATCGCCAGGAGTCACACACTGACGTGGTATTTCATCAGTTCCGAAGTGATGTGCAATCATAATTATCTCCATTTTTACAAATGAACTTTGTTGATGCGGTGTCTGGTGCCTCCAGGTGACTGCAACCAGTTAACAATTACAGTCGGCTTTCCCACCCAAACCAATAAGGACTAACATGACTTTTAACTGTGCCACGTGCGCTTAGCCGCATTCACCGCATCACAAAATTCACTTTAAAAAGGGCGGACATCAGCCGAACTTCAAGAAAAAAACTGATGCCGCCAGGACTACACACAGCAATGTCGTTATTTACAACCGGAGGCGCACTCCCACCATTTAAATTTAACAGACAAGACCGACTCTTTATGGATATCGGAAATGCGCCTTCGTGTTGTGCCCGGTTTTATTTCACCACCTCCGGGCTTCGGTGGTCTCGGCTATACCCCTACAGCGAGAGCTTGTGTTAACATTTCAATACCCTTACAGTTGAGAGTTATTGATATGTTGGATGTATTTACTCCATTGTTGAAACTTTTTGCTAACGAGCCACTCGAAAGACTTATGTATACGATTATCATTTTTGGTCTCACTCTCTGGCTGATACCGAAAGAGTTTACTGTCGCATTCAATGCTTATACTGAAATACCTTGGCTCTTTCAGATTATCGTTTTTGCCTTTTCTTTCGTGGTCGCCATTTCCTTCTCAAGATTGCGAGCACATATTCAAAAGCATTATTCATTACTACCAGAGCAACGAGTATTGCTTCGTTTATCTGAGAAAGAAATCGCTGTATTTAAAGATTTCCTTAAAACAGGAAATCTTATTATCACTTCTCCTTGCCGTAACCCGGTTATGAAAAAATTAGAACGGAAGGGCATCATTCAACATCAGAGTGATAGCGCAAACTGTTCTTATTATCTCGTCACCGAAAAATACTCCCATTTTATGAAGTTATTCTGGAACAGCAGGAGTAGACGTTTTAATCGTTAGCTTACTGTGTGCTTCTCCAACCATCGGCGCGCACCAGTTTCGGTTTTAAATGTTTTGCTTTTGGTATACGTCATGGCAGTGAACGTTCCATCCTGGTTGGGGAACACGCCGCACACCAGGGATTCGTTGTTGCCGAGGTCGATTTTTTGCATTTTGCGAATCTCACATCTTGTTGCTACGTATAGCGACTTCTGCCTGCCAGAGATCCCAGTCGTTGCTGCGTAAAGCCTGCACAGCCTGGTTGTAAGTGATACCGCAACAATCCATCAAATACTGAACTACTTCGTAATGCACCATCTTATCTCTCCCCTTAACGCCGGGTGGCGGAACTAACTGCTGCACTGCAAAATTTGAATCCCGCCGTCATGTTCATACGCCTCGGGCTGGCTACTTAACCCCTTACCACTGCCTGGTAACTCGAAGTATTGCCCGGCGTTCTGTGGGGCGGGGTGGGTGGTATGCTGGAACTATAGGTAATGCCTAATTGATTGTCAATAGGCTATGCCTAATGTTTTGAGCGTAACCTAATAGGTGATGGCGACAGCAGAAAGTGATGGGGGGGTTAAATAACGGAATCCAGGAGTTTTCCGTCAGACCATATAAGTTTAAGTTCCAGTTTTTGTGATGTTCTGGCTTTTCCGTTCAGATTCAAGAGCTTTCAGATACTTACCCACTTTCATTTCCATCGCTGCTATGTAGGCGCGAACATCGTGGTCAACCCAATCTGGTTCTGTAGCATTTCCAGATAACAGGAAAGCTACAATCGCTCTTATTTCATCAGAGGCTGCTTGATAAAGGTTGTTTATATCTAAAAGTTCACTTTTTGTATCTGAATTGGTGGGGGTTGGTATGGGGTATTCGTTAAGCCCCCAATGCTCTGGACCAACAACATCAGAAAAGAAACGCCATAATTCTGGAAGTTTATCTTTACTTATAGAGCCTTTCTTAATCCAGTCATAAATTGATGGTGGTTGGACTTTAAAGTGGCGTGCGACCTCCGCCTTTGATTTGACGGATCCCGATGCGATTTTTTTGTTAATGGCCTGCTCTATCGCTCGGCCTAAGTCTTTACCACTAAGCATTGCTTAATATTCTCCTATGCGCATTACATTAGGCAATCCCTACCCTTACTGCATTAGGCACAGCCTATTGACAATTGCGTTAGGCGTCGCCTAATATTTCTGTGTGTTTTTGGAGTTCATTCGATGAAAAAAGAGAACTATTCATTCAAGCAAGCTTGTGCTGTTGTCGGTGGGCAATCAGCAATGGCTAGGCTTTTAGGTGTATCACCTCCAAGCGTAAATCAATGGATCAAAGGGGTACGTCAATTGCCTGCCGAGAGATGTCCAGCAATTGAACGTGCAACAAGAGGTGAGGTTCTGTGCGAAGAACTTCGTCCTGATATTGACTGGTCATATTTACGACGTTCGGCATGTTGTTCGCAGAATATGTCAGTGAAGCAACTAAATGACAGTAACAAATCCTCATTTGATCATACCTGAAACATCAAGAGGCAAATGATTCATGAAAATCAAGCATGAGCACATCGAATCAGTGTTGTTTGCCCTAGCAGCCGAAAAAGGGCAGGCATGGGTAGCCAATGCAATTACTGAAGAATATCTGCGCCAGGGGGGCGGCGAATTGCCCCTGGTTCCAGGCAAGGACTGGAACAATCAGCAGAATATCTATCACCGTTGGTTGAAAGGTGAAACGAAAACGCAAAGAGAAAAAATTCAGAAGCTGATCCCAGCAATTCTGGCAATCCTTCCGCGCGAGCTGCGTCACCGACTCTGCATCTTCGATACCCTGGAACGCCGTGCATTACTGGCGGCGCAGGAAGCGTTAAGTACGGCAATTGATGCGCATGATGATGCAGTCCAAGCCGTTTACCGGAAAGCGCATTTCAGCGGCGGCGGTTCTTCCGACGATTCTGTCATTGTTCATTAAGCAAAAGTTTCCATGCTGTTTGTGCTTATTCTAAGCCACCGGGCAGCATCATACGGGGCAATTATGGCCGCATTACCATACATGCAACTGTACATAGCTGATTACCTGGCTGACACCATGCATTTGTCAGCAGAGGAGCATGGTGCGTATTTGTTGCTGATGTTCAATTACTGGCAAACAGGAAAGCCAATACCTAAAAACAGGCTGGCAAAAATTGCCCGTCTGACTAACGAGCGATGGGCTGATGTTGAACCATCCTTGCAGGAGTTTTTTTGCGATAACGGCGAGGAATGGGTGCATCTTCGGATTGAGGAAGATCTGGCATCAGTCAGGGAAAAATTAACCAAAAAATCAGCCGCAGGAAAAGCATCTGTTCAGGCCAGAAGAAGCAGAAAGGAAGCAGATGTTCAAACAAAACAAGAGAGAAATTTAACAGGTGTTCAAACAGATGTTGAAGTGGTGTTTGAACATGATGTCAACACAAAGGCAACTAATAAAGATACAGATAAAGATCTAAAAACAGATCCCCCCCTAAATCCCCCCCGGGGGAATCGAGGTGTCAAAAAGTTTGACCCTCTGGATATTACTTTGCCGAACTGGATTTCTGTCTCGCTTTGGCGTGAGTGGGTTGAATTTCGCCAGGCATTGCGAAAACCGATTCGAACGGAGCAGGGCGCTAACGGGGCGATACGGGAGCTGGAAAAATTCCGCCAGCAGGGTTTTTCACCTGAGCAGGTGATTCGACACAGCATCGCCAATGAATACCAGGGCTTGTTCGCGCCGAAAGGTGTTCGACCTGAGACGTTACTCCGACAGGTTAACACCGTCTCGTTACCGGATAGTGCGATCCCGCCAGGCTTCAGGGGGTAACTGACCATGAAAAATATTGCGACAGGCGATGTTCTTGAACGTATCCGCAGACTGGCCCCGTCACATGTAACCGCGCCATTCAAGACGGTAGCGGAGTGGCGCGAGTGGCAACTTTCCGAAGGCCAGAAACGTTGTGAGGAGATCAACCGTCAGAATCGTCAGTTGCGGGTGGAAAAAATTCTGAATCGCTCTGGCATCCAGCCATTGCACCGCAAATGCTCGTTTTCGAATTACCAGGTGCAGAACGAAGGGCAGCGATACGCGTTGAGTCAGGCGAAATCCATCGCTGATGAACTGATGACCGGGTGTACAAATTTTGCGTTCAGCGGAAAACCTGGTACCGGGAAGAACCACTTAGCGGCAGCTATCGGGAATCGCCTGCTGAAAGACGGTCAGACAGTGATTGTGGTTACCGTGGCTGATGTTATGAGTGCCCTGCACGCCAGCTATGACGATGGGCAGTCAGGCGAAAAATTTTTGCGGGAACTGTGCGAAGTGGATCTGCTGGTTCTTGATGAAATTGGCATTCAGCGCGAGACGAAAAACGAGCAGGTGGTACTGCACCAGATTGTTGATCGCCGGACAGCGTCGATGCGCAGCGTGGGGATGCTGACAAACCTGAACTATGAGGCCATGAAAACATTGCTCGGCGAGCGGATTATGGATCGCATGACCATGAACGGCGGGCGATGGGTGAATTTTAACTGGGAGAGCTGGCGTCCGAATGTCGTCCAGCCAGGAATTGCGAAGTAATTTTTACCGGGAGAAAAATTTAATGGAGACTGTTTTTGACGCACTGAAAGCAATGGGAAAAGCCACATCCATAGAACTTGCTGCGCGACTTGATATCAGTCGTGAAGAAGTGCTGAACGAACTATGGGAACTGAAAAAGGCTGGTTTTGTTGATAAAAGCGCGTACACCTGGCGTGTGGCTGATAACAATGTTCAGCAGGAACAGCCAGCGCAGGCAGAACTGCCGGAAGAAATCACCACAGCAACAGTAGCGAAAATCTCAGAGTGCGATTTAACCGCGACGATTGAACAACGAGGACCACAAACGGCTGATGAGCTGGCTACATTGTTTGGTACCACATCACGCAAAGTGGCTTCAACGCTGGCAATGGCAATCAGCAAAGGTCGTCTGATTCGCGTAAATCAGGGCGGTAAATTTCGTTACTGCATACCGGGCGATAATTTACCAGCAGAGCCGAAAGCAGCATCGGTATCTCCGCTCTGGTTATCTGCATCGTCGTCTGCCTGTCATGGGGTGTTAATCATTACCGTGATAACGCCATCGCCTACAAAGAACAGCGCGACAAAAATGCCAGAGAACTGAAGCTGGCGAACGCGGCAATTACTGAGATGCAGATGCGTCAGCGTGATGTTGCTGCGCTCGATGCAAAATACACGAAGGAGTTAGCTGATGCGAAAGCTGAAAATGATGCTCTGCGTGATGATGTTGCCGCTGGTCGTCGTCGGTTGCACATCAAAGCAGTCTGTCAGTCAGTGCGTGAAGCCACCACCGCCTCCGGCGTGGATAATGCAGCCTCCCCCCGACTGGCAGACACCGCTGAACGGGATTATTTCACCCTCCGGGAACGACTGGTAATGATGCAGGCCCAACTTGAAGGTGCTCAGCAATACATAACCGAGCAGTGTTTAAAGTAAAATCTTAACTACAATATGATTCATTTTGATGATTGTTTCATAAGGAACAGTGAAGTAAGATCTAAGAGGAGTTAAATTTTATACAGTATAATCATAATATTGCAGCAAGGTGGTTATAATTGAAAGAATATTTAGATATGAATACATCTCATGTAAGAGTTGTTACTCATATGTGTGGGTTCCTGGTTTGGCTCTATAGTCTTTCAATGTTGCCACCAATGGTTGTAGCATTGTTTTATAAAGAAAAAAGCCTGTTTGTTTTCTTTATAACTTTCGTTATATTTTTTTGCATTGGTGGCGGAGCGTGGTATACAACTAAGAAATCTGGCATTCAATTACGTACCCGTGATGGGTTTATTATAATTGTAATGTTTTGGATTTTGTTTTCTGTTATTAGTGCATTCCCTTTATGGATTGACTCAGAACTTAATTTAACGTTCATTGATGCTCTGTTTGAAGGGGTTTCTGGAATAACAACAACAGGAGCAACTGTAATTGATGATGTTAGTTCATTACCTCGGGCATATTTGTACTATCGGTCACAGTTAAATTTTATAGGTGGTTTAGGAGTTATTGTTCTGGCGGTTGCTGTATTGCCATTATTGGGTATTGGTGGTGCAAAGCTTTATCAGTCAGAAATGCCGGGGCCATTTAAGGATGACAAACTCACTCCCCGCCTGGCCGATACGTCACGGACACTGTGGATAACTTATTCTTTATTAGGTATTGCTTGTATTGTCTGTTATAGACTTGCAGGAATGCCTTTGTTTGATGCTATTTGTCACGGGATTTCCACAGTTTCGCTTGGTGGTTTCTCAACTCATAGCGAGAGTATCGGATATTTTAATAACTATTTGGTTGAGCTGGTGGCTGGTTCTTTTTCCCTGCTATCGGCTTTCAACTTCACTCTTTGGTATATTGTTATTAGCAGGAAAACGATAAAACCTTTAATCAGAGATATTGAACTTCGTTTCTTTCTGTTAATAGCCTTAGGGGTGATCATTGTTACCTCTTTCCAGGTCTGGCATATAGGTATGTATGACTTGCATGGAAGTTTTATTCATTCGTTTTTTCTTGCCAGCTCCATGCTCACTGATAATGGTTTAGCTACGCAGGATTATGCAAGTTGGCCCACGCACACGATAGTGTTTTTGCTGTTGTCAAGTTTCTTTGGGGGATGTATAGGTTCAACTTGTGGTGGAATTAAGTCACTTCGATTTCTTATACTTTTCAAACAAAGCAAACACGAGATAAATCAGCTTTCTCATCCCAGAGCGTTGTTGAGTGTAAATGTAGGAGGGAAGATAGTTACAGATCGTGTAATGAGGTCTGTATGGAGTTTCTTTTTTCTTTATACTCTCTTCACGGTGTTTTTTATACTGGTGTTAAATGGTATGGGATATGATTTTCTTACATCATTTGCAACAGTGGCTGCATGTATTAATAATATGGGATTAGGTTTTGGGGCTACTGCATCGTCATTCGGAGTGCTTAATGACATTGCAAAATGCTTAATGTGCATAGCTATGATTCTTGGTCGCCTTGAAATTTATCCTGTTATTATATTGTTTTCAGGTTTTTTTTGGCGCTCCTAATATATGGCTGATTTATAATTGTGAGTTTAATATTATGTTGACTCACTCATTGATCCAATACCTAACTTTACCAGCAACACCTCCGTCCCCAGTAGCACTGGCTGCTGGGGTGCGTTTTATTCATAAAGCAAGGCTGTATGAGCGAGAAATTAAAGATAGTCTATCGCCCATTACAAGAATTGTCACCGTATGCGCACAACGCCAGGACGCACAGTACTGAGCAGGTGGCACAACTGGTAGAAAGTATTAAGCAATTCGGCTGGACTAATCCGGTGCTGATTGACGAAAAGGGCGAAATTATTGCGGGTCACGGTCGTGTTATGGCGGCTGAAATGCTCAAAATGGATTCTGTTCCGGTCATTGTTCTGTCTGGCCTGACGGATGAGCAGAAGCAGCGATAACGATCAGTATCGCTCCCGTAATGCATTAATCCGTCGCCACATTGAGAAAATGGATGCCAGTTTGCACGTCGGAACGAAGGAGTTTGATATTTCAAAGGTTTCCGAGGTGGATTCTGTTGATGATTTACTCATTGATAATGCCGCTCGTTATCTGCTGAAAGACTGGAAAGGGGTTGGTGAACTGGTTAATGGTGTTGAGGTTGCACTGGAATATACGGCAGAACGAGGGATCGCGCTGCTTAAGCAGAATCCAGAGTTGTACTGGCAGATCCTTGCAGAAGCAGCCAGCATCGCCCAGGGTAAAGAGCAGCAGAAGCAGGATACGATAAAAAAGCCATAGCTGCCCAGCGGTGGTTATCGGAGTTCGGGGGAGAAAGGGGGGAAAAGGCAAGATGGAAGCGAGAAAAACTCAGGTTGCCACCGATACCGGAACCAGAAATAGACCCGGTGCTTAAGGAGTTGTTGTACGCCTATTCGGTAATATCCCGTGCCCGACGTTATGCTGGAATGGCTGGGGTGCCTTTGCCTTTATCTCTGACAGAGATAAATGAATATTTAGCCACTCATCCGGTATTGATTGAGCGCGATGAATTTGAAGCAGTGATCTTTGCACTGGATGACCAGTATTTTCAGGAGCAGTGTGTGTAGTTGTTAATTACGTACACTCTGTTACAGAGATGTGATGGTGTCTTTAATTAAATCGATGATGCTCCTGGAGAAAAGCATTGCGTGGCCTCGTAATCGCTATATCTACTATTATGTCGCCTGAAACCCACTTCGCGGTGGGTTTTTTGTTGTCAGGAGTTTTAATAAATGGCAGAGCAAACCTCGCGTCTCGCAATAATTATTGATAGCACTGGAGCGAAAAATAATGCTGACAATCTGACCTCCTCATTAGTCAAAATGACGCAGGCTGGGGAAACTGCTGCAAATAGCGCAGGGAAAGTGACTAAGGCAACAGAAGATGAGAAGAACGCGCTCGCAAAATTAAAAGCAGCTATTGATCCAGTTGGTGCCGCAATTGATACTGTCGGTCGACGCTATTCTGAATTAAAGAAATTTTTCGATAAAGGGCTTATTGATAAAGAAGAATATGAATTTCTTGTCCGTAAACTTAATGAAACCACAGAGGAATTGAGCGGGGTTGCGCAAGCGCAGAGAGAAGCCGAGAAGGCCGGAAAACTTGCTGCCGCTCAGCAGGAAGCGCAGGCTCAGGCCTTTCAAAGAATGCTGGACAAGATCGACCCTCTGGCTGCGGCGCTAAGAAATCTTGAACAACAGCATGATGAACTTAATGCTGCGTTTGCATCCGGGAAAATAAATGGTTCTCAGTTTGAGAATTATAGCCGAAAAATACAGGAAACACGGCGAGAGCTTACCGGAGAGGCTCAGGCAGAGCGAGAAGCAGCAAAAGCGCATGATGAACAGGTTGTTGCTTTGCAACGTCTGATTGCTCAACTTGATCCTGTCGGAACTGCTTTTAATCGTCTGGTAGAACAACAGAAACAGCTCAATGAAGCAAAAGCTAAGGGGATGCTTTCTCCTGAAATGTATGAGGAGCTTTCTGGAAAACTTCGTGCTATGCGGAGTGAGCTTGAGGTTACTCAATCACAATTAAGCAAAACCGGAATGTCGGCAAAACAAACGGCTTTTGCTATGCGCATGTTGCCTGCACAAATGACGGATATTGTTGTTGGGTTGTCCACTGGTCAGTCGCCATTTATGGTGTTAATGCAGCAGGGCGGCCATTCAGCTATGCAGGAGACAAGAATCGCCAGCTTACCCGTTACAGCGATACCCGCTGGCATGAAGATTCCGTGCGTAACCGCTGGTTCAGCGTGATGGTGGGGCCGTCTGTGCGCGTGAATGAATGGTTCAGCGCGTATGCGATGGCGGGTATGGCTTACAGCCGTGTGTCGACTTTCTCCGGGGATTATCTCCGCGTAACTGACAACAAGGGAAGGTGCGAATAAGCAGGTCATTTCTTCCCAAGCTGACTCGCTGATTAAAATTTCGCGGATCTGGGCCGATTTTTTTCCCGCAAACACATCGAATCAGCCTATTTAGGCTATTTTTTCCACCATTTCTGGCGTTATTTCCGGTTTTTACTGAGATCTCTCCCACTGACGTATCATTTGGTCCACCCGAAACAGGTTGGCCAGGGTGAATAACATCGCCAGTTGGTTATCGTTTTTCAGCAGCCCCTTGTATCTGGCTTTCACGAAGCCGAACTGCCGCTTGATGATGCGAAACGGGTGCTCCACCCTGGCACGGATGCTGGCTTTCATGTATTCGATGTTGATGGCCGTTTTGTTCTTGCGCGGATTCTGCTTCAAGGTTTTTACCTTGCCGGGACGCTCGGCGATCAGCCAGTCCACATCCACCTCGGCCAGCTCCTCGCGCTGTGGCGCTCCTTGGTAGCCGGCATCGGCTGAGACAAATTGCTCCTCTCCATGAAGCAGATTACCCAGCTGATTGAGGTCATGCTCGTTGGCCGCGGTGGTGACCAGGCTGTGGGTCAGGCCACTCTTGGCATCGACACCAATGTGGGCCTTCATGCCAAAGTGCCACTGATTGCCTTTCTTGGTCTGATGCATCTCCGGATCGCGTTGCTGCTCTTTGTTCTTGGTAGAGCTGGGTGCCTCAATGATGGTGGCATCCACCAAAGTGCCTTGGGTCATCATGACGCCTGCTTCGGCCAGCCAGCGATTGATGGTCTTGAACAATTGACGGGCCAGTTGATGCTGCTCGAGCAGGTGGCGGAAATTCATGATGGTGGTGCGATCCGGCAGGGCGCTATCCAGGGATAATCGGGCAAACAGGCGCATGGAGGCGATTTCGTACAGGGCATCTTCCATGGCACCGTCGCTCAGGTTGTACCAATGCTGCATGCAGTGAATACGCAGCATGGTCTCCAGCGGATAGGGCCGTCGGCCATTGCCCGCCTTGGGATAAAACGGCTCGATGACAGCGGTCATATTCTGCCATGGCAGAATCTGCTCCATGCGGGAGAGGAAAATCTCTTTTCGGGTCTGACGGCGCTTAGTGCTGAATTCACTATCGGCGAAGGTGAGTTGATGGCTCATGATGTCCCTCTGGGATGCGCTCCGGATGAATATGATGATCTCATATCAGGAACTTGTTCGCACCTTCCCAAGGGGAAAACGCACGACGTGCTGACCGGAAGTGATGACGGTCGCCACAGCAACACGTCTCTGGCGTGGGGAGCTGGCGTGCAGTTTAACCCGACCGAATCCGTGGCCATTGATATTGCTTATGAAGGCCCCGGCAGTGGCGACTGGCGCACTGACGGTTTCATCGTGGGTGTCGGTTATAAGTTCTGATTAGCCAGGTAACACAGTGTTATGACAGCCCGCCGGTTCAGGCGGGCTTTTTTGTGGGGTGAATATGGCAGTAAAGATTTCAGGTGTACTGAAAGACGGCACAGGAAAACCGGTACAGAACTGCACAATCCAGCTGAAAGCAAAACGTAACAGCACCACGGTGGTGGTGAACACGCTGGCCTCAGAAAATCCGGATGAAGCCGGGCGTTACAGCATGGACGTTGAGTACGGTCAGTACAGCGTTATTCTGTTGGTGGAAGGATTCCCGCCGTCACATGCCGGGACCATTACCGTGTATGAAGATTCTCAACCCGGTACGCTGAATGATTTTCTCGGTGCCATGACGGAGGATGATGCCCGTCCGGAGGCACTGCGCCGTTTTGAACTGATGGTGGAAGAGGTGGCGCGTAACGCGTCCGCGGTGGCACAGAACACGGCAGCCGCGAAGAAGTCAGCCAGTGATGCCAGCACATCAGCCCGTGAGGCGGCAACCCATGCGGCTGATGCTGCGGACTCAGCACGCGCAGCCAGCACGTCAGCCGGACAGGCCGCGTCGTCGGCTCAGTCAGCGTCTTCCAGCGCAGGAACGGCATCAACAAAGGCCACTGAAGCATCAAAAAGTGCTGCCGCTGCAGAGTCCTCAAAAAGCGCGGCGGCCACCAGTGCCGGTGCGGCGAAAACGTCAGAAACGAATGCTTCAGCGTCACTACAATCAGCAGCCACATCTGCATCCACCGCGACCACGAAGGCATCAGAAGCTGCGACCTCGGCCCGGGATGCGGCGGCCTCAAAAGAAGCGGCAAAATCATCAGAAACGAACGCATCATCAAGCGCCAGTAGTGCAGCTTCCTCGGCAACGGCGGCAGGAAATTCCGCGAAGGCGGCAAAAACGTCCGAGACGAACGCCAGGTCTTCTGAAACGGCAGCGGGACAGAGCGCCTCGGCTGCGGCAGGCTCAAAAACAGCGGCTGCGTCGTCTGCCAGTGCAGCGTCAACAAGTGCCGGGCAGGCCTCAGCCAGTGCCACCGCCGCCGGAAAATCGGCAGAAAGCGCCGCATCGTCTGCTTCAACAGCCACAACGAAGGCTGGCGAAGCCACTGAACAGGCCAGCGCAGCAGCGAGGTCTGCTTCCGCAGCGAAGACATCCGAAACGAACGCGAAAGCGTCGGAAACAAGCGCAGAATCCTCAAAAACGGCTGCCGCATCGTCAGCCAGTTCGGCGGCGTCATCGGCATCATCGGCGTCTGCTTCAAAAGATGAGGCGACCAGACAAGCGTCAGCAGCGAAGAGCAGCGCCACGACGGCATCCACGAAGGCGACAGAGGCTGCTGGCAGTGCGACGGCGGCAGCTCAGAGCAAAAGTACGGCGGAATCCGCGGCAACGCGCGCCGAGACAGCAGCTAAACGGGCAGAGGATATTGCATCCGCCGTGGCGCTTGAGGATGCAAGTACGACGAAAAAGGGGATAGTACAGCTCAGCAGTGCGACCAACAGTACGTCTGAAACGCTGGCGGCAACGCCAAAGGCAGTAAAATCAGCCTATGACAATGCAGAGAAACGTCTGCAGAAAGACCAGAACGGCGCTGATATACCCGATAAGGGATGCTTCCTGAACAACATTAACGCGGTCAGTAAAACAGACTTTGCTGATAAGCGTGGTATGCGTTATGTGCGGGTTAACGCTCCTGCAGGTGCAACATCTGGAAAATATTACCCTGTTGTTGTTATGCGTTCTGCTGGCTCAGTAAGCGAACTGGCATCAAGAGTCATTATCACCACGGCAACGCGAACCGCAGGCGATCCGATGAATAACTGCGAGTTTAACGGATTTGTTATGCCTGGTGGCTGGACTGACAGGGGGCGTTATGCTTATGGCATGTTCTGGCAATATCAAAACAATGAACGAGCCATTCACTCAATAATGATGAGTAATAAGGGCGATGATTTGCGCTCTGTGTTCTATGTTGATGGCGCTGCTTTCCCTGTTTTTGCGTTTATTGAAGATGGCCTGTCAATATCCGCACCTGGTGCTGATCTCGTTGTTAATGATACGACCTATAAGTTTGGGGCAACAAATCCGGCGACTGAATGTATCGCGGCGGACGTTATCCTTGATTTTAAGAGTGGGCGTGGTTTTTATGAGTCTCATTCGTTAATCGTTAACGATAACTTGTCGTGCAAAAAACTTTTTGCCACAGACGAAATTGTAGCGCGTGGTGGTAATCAGATTCGAATGATAGGTGGGGAGTATGGTGCATTATGGCGTAATGATGGCGCTAAAACTTACCTGCTGCTTACCAATCAAGGTGATGTTTATGGTGGCTGGAATACATTAAGACCGTTTGCTATTGATAACGCAACCGGCGAACTGGTTATTGGAACCAAACTGTCCGCAAGTCTGAACGGTAATGCATTAACAGCAACAAAGCTGCAAACGCCAAGACGGGTTTCTGGTGTTGAGTTTGATGGTTCCAAAGATATTACTTTAACCGCCGCGCATGTGGCTGCTTTTGCCAGAAGGGCAACGGATACATATGCCGATGCGGATGGTGGCGTTCCATGGAATGCCGAATCTGGCGCTTACAATGTCACCCGCTCTGGCGACAGCTATATTCTGGTTAACTTCTATACCGGAGTCGGAAGTTGCCGGACCCTGCAGATGAAGGCGCATTACAGAAATGGTGGTCTGTTCTACCGTTCTTCAAGAGACGGTTATGGTTTTGAGGAAGACTGGGCAGAAGTTTATACCTCGAAAAATCTTCCACCAGAAAGCTACCCAGTCGGCGCACCAATCCCGTGGCCATCAGATACCGTTCCGTCTGGTTATGCCCTGATGCAGGGGCAGGCTTTTGACAAATCTGCTTACCCGAAACTTGCAGCCGCTTATCCGTCAGGCGTGATCCCTGATATGCGTGGCTGGACGATTAAGGGCAAACCTGCCAGTGGTCGGGCCGTATTGTCTCAGGAACAGGACGGCATTAAATCGCATACCCACAGCGCCAGCGCATCCAGTACAGATTTGGGGACGAAAACCACATCGTCGTTTGATTACGGCACTAAATCCACGAATAACACCGGGGCACATACACACAGTGTGAGCGGCTCTACAAACTCGGCTGGAGCACACACACACTCACTAGCCAACGTGAACACGGCTAGTGCTAACTCCGGTGCTGGTAGTGCATCAACAAGATTGTCTGTTGTGCATAATCAAAACTATGCAACATCATCTGCTGGCGCACATACCCACTCACTGTCCGGCACTGCTGCAAGCGCAGGTGCACACGCGCATACTGTCGGTATTGGTGCTCATACGCACTCCGTTGCGATTGGTTCACATGGACACACCATCACCGTTAACGCTGCTGGTAACGCGGAAAACACCGTCAAAAACATCGCATTTAACTATATTGTGAGGCTTGCATAATGGCATTCAGAATGAGTGAACAACCACGGACCATAAAAATTTATAATCTGCTGGCCGGAACTAATGAATTTATTGGTGAAGGTGATGCATATATTCCGCCTCATACAGGTCTGCCAGCAAACAGTACCGATATTGCACCGCCAGATATTCCGGCTGGCTTCGTGGCTGTTTTCAACAGTGATGAGTCATCGTGGCATCTCGTTGAAGATCATCGGGGTAAAACGGTTTATGACGTGGCTTCCGGCAACGCGTTATTTATTTCTGAACTCGGTCCGTTACCGGAAAATGTTACCTGGTTATCGCCGGAAGGGGAGTTTCAGAAGTGGAACGGCACAGCCTGGGTGAAGGATACGGAAGCAGAAAAACTGTTCCGGATCCGGGAGGCGGAAGAAACAAAAAACAACCTGATGCAGGTAGCCAGTGAGCATATTGCGCCGCTTCAGGATGCTGCAGATCTGGAAATTGCAACGGAGGAAGAAATCTCGTTGCTGGAAGCATGGAAAAAGTATCGGGTATTGCTGAACCGTGTTGATACGTCAACTGCACAGGATATTGAATGGCCAGCACTGCCGTAGGGTAAAACATATAAATTCTATAATTAGATGTATCTTTCCATTTACGGCAAGGAAGGGGGCTTGGAAGACGTAAAGCATCTCACACCGAGATTATTTTTTATATGTCAGGTGTCTGAAGTTTTGCTTTGGCTCTTAAAATGGTTTGCCGCGAGGTTTTGAATTCCCGGGCAATGGCACTTATACTTACACCTGACTTAATTCGTTCGAATACCGCCTGTTTCTGTTCTTCATTTAACACAGGTGGTCGACCAAAACGTTTCCCTGCGCCGCGGGCTCTTACTATCCCGGAATGAGTGCGTTCAAGTAAAAGGTCTCGTTCAAATTCAGCGACTGCTGAAATTACTTGCATCATCATTTTTCCTGTTGGACTGGTCAGGTCAATGCCACCCAATGCTAAGCAATGCACTCTGATACCTGTTTCGGTCAGTTGTTCCACTGTTTTCCTGATATCCATTGCATTACAACCAAGGCGATCCAGTTTTGTCACAATCAATTGATCACCACATTTCAGGCGAGCAAGCAACCGGTTAAAACCAGGACGCTCACTGGTTGCTGCTGAGCCGCTAATGTGTTCTTCGATTATTTGCTGAGGTTTGATTTTAAAACCTGCACTTTCGATTTCCCGGCGTTGATTTTCGGTGGTCTGATCCAGCGTTGATATCCGACAGTAAGCAAAAATTTGAGACATAGTGAGACTCTATACGAAATTGGTGTTCATATCATAATGCATCTCAGAAAATAATTATGATTATTTTTGTGCATATTTGTATGTACACGTTCGAAAATAAACGAATGCGTATGCAACCCCGTAATTTTGGTGAGACCCAAAATCGATTTTGTGAAAAATGGCTTTAACTCGGTTTGTTTTTCGAGTTCCGGGCGGACTCAAGGAAGAAGAATAGTGTTGCGTGTTATTTTAACCAGATTTCAAGTTGTTTGGTCGTGGAAAAGTGGAGCAAAATGTTGTTAAAGTGGAAAAATGATAAAAAAGTAAGTTTATTATATTACATTTTACCATTTAAATTTTGGTTGTCTTTAAGAACTGATATCGCTGTTTGTAATAATTCTTTGTTATCCAGCCATGATTTTTTCTTTATGTTTCCTTCAATGTAATCAAGCAATGTTCTGGTATTGATAGGTCTTCCCTGTTTTGCTACTTCCACTACAGCATCCCCTAGGATAATTCTTACTTCAGGAAGCTGCGCAGGGAACCACTTTAGGGTGTCTTTTGATTTCATGAAGATATTCCTTAAAATATTATTGATTTTCATTGCGATATTGTATGTCTGATTCAGGATATGTTGACTTATACATCGGTTTTGTCTGGGTTATTGGATATGCCAATCCCTAATTTTATTAGAGCATGACTAAAAATGCTGAATATGATAAGGAGCGAAGTGATTATCAGTATGCTGTTCATATAGCCTCGAATTAGTAATGTGTTATATATGATATAGTTGACAATTTTTATCCTGGGTGTTCTTAAAGTTCGTAGATAAACATTGTCGTTTCAGGTATACAGGAATGCTAACAGGTGGCGGCAAAAATCAGGCGGTTTATGGCGCAAGCTGAAGTGGCAACTGCAAACTATCTTATGCAGAGACTCTACACGGATTGGGTTTAAAAGTATACATAGATAACAGTTTTTATCTGAAGAAGAAAAATATCAAGGTGATATAGCCTATATGCCTTTGATGCGGAGGAATGAATGTGATGGGAGTGATGTATCTGAATAGTTGAAAAACCGCAGTCACGTCGTATGCAAGAACGTGCTGCGGTTGGTTTGACTTTGATTGAGACGTTTTGGAATTTTTTTCGGTGGCAAAAATGGGGCAAAACGCTGCAAAAGGGGCAAAAAAGGGGCAAAAAAAGAGTGGATTATCGTAGCTTATTGTTGTCGCTGATGATATTTAACACATTGAAAAATAAGTAAAATACTTATGAGTCAGAGAGTTGTGATTTTTGCCCTTACTTGTTCAGGTTGTATTGTTCTTTCTTACTAATTTCTTGATTTTGCGACATTTAAAAGCGACTCAATTCGTTATATGGCATCAGAAGAGTATGCGTCATGCCGGAACGCCCAGCATAAGAAATCTGATATAAAAAACTGTGGCGTGTATGGTACGGATTAGAGGGGAAAATGTCAGCACATTTGCGAAATGAATCAAAAAGCCCGCAGCAATGTGCGGGCGTTAGTGTCAGCGCACAACCAGCACGGAGCACTCTGCGTGACGCACTACAGCTGCGGCGTTGGAACCGAGCAGATAAGTGGTGATATCCGGTCGATGGGAAGCAATGATGATCATATGAGCGGGGATCTTCTTCGCCAATTCCAGAATGCGGTCTTTGGGCGAGCCTTCCTCAACATGGACATGCACTCTGTCGGTTGGCAGTTTAAATTTTTTAATGATCTCTTCCAGTTGCGATTTGGCTTCCGCTTTCAGGTCATCCATTGCCGGTAATTCTGCGGAATACGCTAAACCCAGAGAGGCATAGTAGGGCAGTGAAGGTATTACCGTCAGGAAATGAACCTCTGCATCATCAATCTTTGCCTCTTCCTCAACGTGGCTAATCACGCGTTGAGTTAATTCTGAATCGGAAATATCGATAGGGACAAGAATCGTTCTGTTCATAAAACCTCCTGTTTTAGTATCCGCATAAAGTGTAACGCCAGATGACACTTTTTGTGTAATGACGGAGTTCACATTTTTAATTTAGATCAAAGGAGGAAGAATAAGCAGAAAAAGCCCGCCATAACAGCGGGCAGGAGGATTTAGAACTGATAAACCAGACCTAAAGCGACAATATCATCGGTAGAGATGCCATTGGCAGCGTAGAAGCTGTCATCTTCATCCAACAGGTTGATTTTATAGTCAACGTAGGTGGACATGTTTTTATTGAAATAGTAAGTCGCGCCAATATCGGCGTATTTAACCAGATCTTTATCATCAACACCTGCCGGGTTGTCTGCACCACCCGCAGCGTGCAGGTCACGGCCTTTAGACATCAGGAAAGAGACTGCCGGACGCAGACCAAAATCAAACTGGTACTGTGCAGTGACTTCAAAATTCTGGGTTTTGTTTGCCACAGCATAATCGCTGTCGCCAAACGGGGTCATATTACGCGTTTCTGAATACATGGTTGCCAGGTAAATATTGTTAGCATCGTATTTTAGCCCAGCAGTCCACGCGTCTGCTTTATCACCACCCGCCGCAGTATGGTTAACCTGGTCATTGGTGCGGTCAGAAGAGGTGTATGCCGCACCAGCGCTAAAGCCCATGCCTAAATCATATGTTGTGGAAAGACCCCAGCCGTCACCGTTTTCATGGCGAACATCACGTCCGTTGTTGGTGCCTTCCTGACCATTACTGGCTCCTTCGTTGTTACCTTGATACTGCACCGCGAAGTTCAGACCATTTACCAGACCGAAGAAATCAGTATTACGATAAGTCGCGACGCCATTGGCTCGACCAGTCATAAAGTTGTCTGCATTGGTATAAGAGTCACCGCCAAATTCAGGCAGCATATCGGTCCAGCCTTCGATGTCGTACATTACGCCATAATTACGTCCGTAATCGAAAGAACCGTAATCTGCAAATTTCAGCCCGGCAAATGCCAGACGGGTCCATGACTGGTTTTTTGAAGATTCAGTGTTGTTTGCCTGAATATTGTATTCCCATTGACCGTAGCCAGTGAGTTGATCGTTAATTTGGGTTTCGCCTTTAAAACCCAGACGCGCATAGCTCTGGTCGCCATCTTTCGCTGAATTATCAGAAAAATAATGCAGGCCATCAACTTTGCCATACAGATCTAATTTGTTGCCGTCTTTATTATAAACTTCGGCTGCATGTGCAGCACCTGCGGCGAGCAGGGCAGGAATTAAAAGTGCCAGTACTTTGCTTTTCAT
Protein sequences of DBSCAN-SWA_4 >CP028702|1839890:1870221|1858704_1858968_+|AVZ48961.1|DBSCAN-SWA MSEKLKIVYRPLQELSPYAHNARTHSTEQVAQLVESIKQFGWTNPVLIDEKGEIIAGHGRVMAAEMLKMDSVPVIVLSGLTDEQKQR >CP028702|1839890:1870221|1842838_1844212_+|AVZ48941.1|DBSCAN-SWA MTAFSTLNVLPPAQLTNLNELGYLTMTPVQAAALPAILAGKDVRVQAKTGSGKTAAFGLGLLQQIDASLFQTQALVLCPTRELADQVAGELRRLARFLPNTKILTLCGGQPFGMQRDSLQHAPHIIVATPGRLLDHLQKGTVSLDALNTLVMDEADRMLDMGFSDAIDDVIRFAPASRQTLLFSATWPEAIAAISGRVQRDPLAIEIDSTDALPPIEQQFYETSSKGKIPLLQRLLSLHQPSSCVVFCNTKKDCQAVCDALNEVGQSALSLHGDLEQRDRDQTLVRFANGSARVLVATDVAARGLDIKSLELVVNFELAWDPEVHVHRIGRTARAGNSGLAISFCAPEEAQRANIISDMLQIKLNWQTPPANSSIATLEAEMATLCIDGGKKAKMRPGDVLGALTGDIGLDGADIGKIAVHPAHVYVAVRQAVAHKAWKQLQGGKIKGKTCRVRLLK >CP028702|1839890:1870221|1852649_1852829_-|AVZ48952.1|DBSCAN-SWA MQQLVPPPGVKGRDKMVHYEVVQYLMDCCGITYNQAVQALRSNDWDLWQAEVAIRSNKM >CP028702|1839890:1870221|1847060_1847255_-|AVZ51642.1|DBSCAN-SWA MRYDNVKPCPFCGCPSVTVKAISGYYRAKCNGCESRTGYGGSEKEALERWNKRTTGNNNGGVHV >CP028702|1839890:1870221|1860915_1861035_+|AVZ51646.1|DBSCAN-SWA MTTREGANKQVISSQADSLIKISRIWADFFPANTSNQPI >CP028702|1839890:1870221|1858505_1858787_+|AVZ48960.1|DBSCAN-SWA MVALKFILLLYCFQVFFGAPNIWLIYNCEFNIMLTHSLIQYLTLPATPPSPVALAAGVRFIHKARLYEREIKDSLSPITRIVTVCAQRQDAQY >CP028702|1839890:1870221|1850815_1851091_-|AVZ48947.1|DBSCAN-SWA MITNYEATVVTTDDIVHEVNLEGKRIGYVIKTENKETPFTVVDIDGPSGNVKTLDEGVKKMCLVHIGKNLPAEKKAEFLATLIAMKLKGEI >CP028702|1839890:1870221|1844340_1845276_-|AVZ48942.1|tRNA|DBSCAN-SWA MQENQQITKKEQYNLNKLQKRLRRNVGEAIADFNMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNLDQKQPGFPEHVLPEYLEKLGVEYKIVEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFLNMFYGGKMKGMPPKLMSDDGKHIVIRPLAYCREKDIQRFADAKAFPIIPCNLCGSQPNLQRQVIADMLRDWDKRYPGRIETMFSAMQNVVPSHLCDTNLFDFKGITHGSEVVNGGDLAFDREEIPLQPACWQPEEDENQLDELRLNVVEVK >CP028702|1839890:1870221|1862096_1862312_+|AVZ51645.1|DBSCAN-SWA MSGTCSHLPKGKTHDVLTGSDDGRHSNTSLAWGAGVQFNPTESVAIDIAYEGPGSGDWRTDGFIVGVGYKF >CP028702|1839890:1870221|1848113_1850714_-|AVZ48946.1|DBSCAN-SWA MSTKPLFLLRKAKKSSGEPDVVLWASNDFESTCATLDYLIVKSGKKLSSYFKAVATNFPVVNDLPAEGEIDFTWSERYQLSKDSMTWELKPGAAPDNAHYQGNTNVNGEDMTEIEENMLLPISGQELPIRWLAQHGSEKPVTHVSRDGLQALHIARAEELPAVTALAVSHKTSLLDPLEIRELHKLVRDTDKVFPNPGNSNLGLITAFFEAYLNADYTDRGLLTKEWMKGNRVSHITRTASGANAGGGNLTDRGEGFVHDLTSLARDVATGVLARSMDLDIYNLHPAHAKRIEEIIAENKPPFSVFRDKFITMPGGLDYSRAIVVASVKEAPIGIEVIPAHVTEYLNKVLTETDHANPDPEIVDIACGRSSAPMPQRVTEEGKQDDEEKPQPSGTTAVEQGEAETMEPDATEHHQDTQPLDAQSQVNSVDAKYQELRAELHEARKNIPSKNPVDDDKLLAASRGEFVDGISDPNDPKWVKGIQTRDCVYQNQPETEKTSPDMNQPEPVVQQEPEIACNACGQTGGDNCPDCGAVMGDATYQETFDEESQVEAKENDPEEMEGAEHPHNENAGSDPHRDCSDETGEVADPVIVEDIEPGIYYGISNENYHAGPGISKSQLDDIADTPALYLWRKNAPVDTTKTKTLDLGTAFHCRVLEPEEFSNRFIVAPEFNRRTNAGKEEEKAFLMECASTGKTVITAEEGRKIELMYQSVMALPLGQWLVESAGHAESSIYWEDPETGILCRCRPDKIIPEFHWIMDVKTTADIQRFKTAYYDYRYHVQDAFYSDGYEAQFGVQPTFVFLVASTTIECGRYPVEIFMMGEEAKLAGQQEYHRNLRTLSDCLNTDEWPAIKTLSLPRWAKEYAND >CP028702|1839890:1870221|1854446_1855304_+|AVZ48956.1|DBSCAN-SWA MLFVLILSHRAASYGAIMAALPYMQLYIADYLADTMHLSAEEHGAYLLLMFNYWQTGKPIPKNRLAKIARLTNERWADVEPSLQEFFCDNGEEWVHLRIEEDLASVREKLTKKSAAGKASVQARRSRKEADVQTKQERNLTGVQTDVEVVFEHDVNTKATNKDTDKDLKTDPPLNPPRGNRGVKKFDPLDITLPNWISVSLWREWVEFRQALRKPIRTEQGANGAIRELEKFRQQGFSPEQVIRHSIANEYQGLFAPKGVRPETLLRQVNTVSLPDSAIPPGFRG >CP028702|1839890:1870221|1861073_1862054_-|AVZ48965.1|transposase|DBSCAN-SWA MSHQLTFADSEFSTKRRQTRKEIFLSRMEQILPWQNMTAVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMRLFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQGTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSGLTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVDWLIAERPGKVKTLKQNPRKNKTAINIEYMKASIRARVEHPFRIIKRQFGFVKARYKGLLKNDNQLAMLFTLANLFRVDQMIRQWERSQ >CP028702|1839890:1870221|1842635_1842809_+|AVZ48940.1|DBSCAN-SWA MTTLIYLQIPVPEPIPGDPVPVPDPIPRPQPMPDPPPDEEPIKLSHRERRSARIRAC >CP028702|1839890:1870221|1853092_1853569_-|AVZ48953.1|DBSCAN-SWA MLSGKDLGRAIEQAINKKIASGSVKSKAEVARHFKVQPPSIYDWIKKGSISKDKLPELWRFFSDVVGPEHWGLNEYPIPTPTNSDTKSELLDINNLYQAASDEIRAIVAFLLSGNATEPDWVDHDVRAYIAAMEMKVGKYLKALESERKSQNITKTGT >CP028702|1839890:1870221|1868512_1868947_-|AVZ48969.1|DBSCAN-SWA MNRTILVPIDISDSELTQRVISHVEEEAKIDDAEVHFLTVIPSLPYYASLGLAYSAELPAMDDLKAEAKSQLEEIIKKFKLPTDRVHVHVEEGSPKDRILELAKKIPAHMIIIASHRPDITTYLLGSNAAAVVRHAECSVLVVR >CP028702|1839890:1870221|1851165_1851336_-|AVZ51644.1|DBSCAN-SWA MTKKIKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHEL >CP028702|1839890:1870221|1846564_1846780_-|AVZ48944.1|DBSCAN-SWA MAQVIFNEEWMVEYGLMLRTGLGARQIEAYRQNCWVEGFHFKRVSPLGKPDSKRGIIWYNYPKINQFIKDS >CP028702|1839890:1870221|1839890_1841123_-|AVZ51641.1|DBSCAN-SWA MITHNFNTLDLLTSPVWIVSPFEEQLIYANSAAKLLMQDLTFSQLRTGPYSVSSQKELPKYLSDLQNQHDIIEILTVQRKEEETALSCRLVLRKLTETEPVIIFEGIEAPATLGLKASRSANYQRKKQGFYARFFLTNSAPMLLIDPSRDGQIVDANLAALNFYGYNHETMCQKHTWEINMLGRRVMPIMHEISHLPGGHKPLNFVHKLADGSTRHVQTYAGPIEIYGDKLMLCIVHDITEQKRLEEQLEHAAHHDAMTGLLNRRQFYHITEPGQMQHLAIAQDYSLLLIDTDRFKHINDLYGHSKGDEVLCALARTLESCARKGDLVFRWGGEEFVLLLPRTPLDTALSLAETIRVSVAKVSISGLPRFTVSIGVAHHEGNESIDELFKRVDDALYRAKNDGRNRVLAA >CP028702|1839890:1870221|1865738_1866314_+|AVZ48966.1|tail|DBSCAN-SWA MAFRMSEQPRTIKIYNLLAGTNEFIGEGDAYIPPHTGLPANSTDIAPPDIPAGFVAVFNSDESSWHLVEDHRGKTVYDVASGNALFISELGPLPENVTWLSPEGEFQKWNGTAWVKDTEAEKLFRIREAEETKNNLMQVASEHIAPLQDAADLEIATEEEISLLEAWKKYRVLLNRVDTSTAQDIEWPALP >CP028702|1839890:1870221|1856727_1856913_+|AVZ48958.1|DBSCAN-SWA MRKLKMMLCVMMLPLVVVGCTSKQSVSQCVKPPPPPAWIMQPPPDWQTPLNGIISPSGNDW >CP028702|1839890:1870221|1866411_1867002_-|AVZ48967.1|DBSCAN-SWA MSQIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSERPGFNRLLARLKCGDQLIVTKLDRLGCNAMDIRKTVEQLTETGIRVHCLALGGIDLTSPTGKMMMQVISAVAEFERDLLLERTHSGIVRARGAGKRFGRPPVLNEEQKQAVFERIKSGVSISAIAREFKTSRQTILRAKAKLQTPDI >CP028702|1839890:1870221|1851685_1851964_+|AVZ48949.1|DBSCAN-SWA MPRALSRIHRITKFTLKRADISRTSRKKLMPPGLHTAMSLFTTGGALPPFKFNRQDRLFMDIGNAPSCCARFYFTTSGLRWSRLYPYSESLC >CP028702|1839890:1870221|1857109_1858567_+|AVZ48959.1|DBSCAN-SWA MNTSHVRVVTHMCGFLVWLYSLSMLPPMVVALFYKEKSLFVFFITFVIFFCIGGGAWYTTKKSGIQLRTRDGFIIIVMFWILFSVISAFPLWIDSELNLTFIDALFEGVSGITTTGATVIDDVSSLPRAYLYYRSQLNFIGGLGVIVLAVAVLPLLGIGGAKLYQSEMPGPFKDDKLTPRLADTSRTLWITYSLLGIACIVCYRLAGMPLFDAICHGISTVSLGGFSTHSESIGYFNNYLVELVAGSFSLLSAFNFTLWYIVISRKTIKPLIRDIELRFFLLIALGVIIVTSFQVWHIGMYDLHGSFIHSFFLASSMLTDNGLATQDYASWPTHTIVFLLLSSFFGGCIGSTCGGIKSLRFLILFKQSKHEINQLSHPRALLSVNVGGKIVTDRVMRSVWSFFFLYTLFTVFFILVLNGMGYDFLTSFATVAACINNMGLGFGATASSFGVLNDIAKCLMCIAMILGRLEIYPVIILFSGFFWRS >CP028702|1839890:1870221|1869087_1870221_-|AVZ48970.1|DBSCAN-SWA MKSKVLALLIPALLAAGAAHAAEVYNKDGNKLDLYGKVDGLHYFSDNSAKDGDQSYARLGFKGETQINDQLTGYGQWEYNIQANNTESSKNQSWTRLAFAGLKFADYGSFDYGRNYGVMYDIEGWTDMLPEFGGDSYTNADNFMTGRANGVATYRNTDFFGLVNGLNFAVQYQGNNEGASNGQEGTNNGRDVRHENGDGWGLSTTYDLGMGFSAGAAYTSSDRTNDQVNHTAAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPFGDSDYAVANKTQNFEVTAQYQFDFGLRPAVSFLMSKGRDLHAAGGADNPAGVDDKDLVKYADIGATYYFNKNMSTYVDYKINLLDEDDSFYAANGISTDDIVALGLVYQF >CP028702|1839890:1870221|1846858_1847068_-|AVZ51643.1|DBSCAN-SWA MYKITATIEKEGGTPTNWTRYSKSKLTKSECEKMLSGKKEAGVSREQKVKLINFNCEKLQSSRIALYSN >CP028702|1839890:1870221|1860686_1860941_+|AVZ48964.1|DBSCAN-SWA MYGVNAAGRPFSYAGDKNRQLTRYSDTRWHEDSVRNRWFSVMVGPSVRVNEWFSAYAMAGMAYSRVSTFSGDYLRVTDNKGRCE >CP028702|1839890:1870221|1845327_1846563_-|AVZ48943.1|integrase|DBSCAN-SWA MSKLPTGVEIRGRYIRIWFMFRGKRCRETLKGWEITNSNIKKAGNLRALIVHEINSGEFEYLRRFPQSSTGAKMVTTRVIKTFGELCDIWTKIKETELTTNTMKKTKSQLKTLRIIICESTPISHIRYSDILNYRNELLHGETLYLDNPRSNKKGRTVRTVDNYIALLCSLLRFAYQSGFISTKPFEGVKKLQRNRIKPDPLSKTEFNALMESEKGQSQNLWKFAVYSGLRHGELAALAWEDVDLEKGIVNVRRNLTILDMFGPPKTNAGIRTVTLLQPALEALKEQYKLTGHHRKSEITFYHREYGRTEKQKLHFVFMPRVCNGKQKPYYSVSSLGARWNAAVKRAGIRRRNPYHTRHTFACWLLTAGANPAFIASQMGHETAQMVYEIYGMWIDDMNDEQIAMLNARLS >CP028702|1839890:1870221|1851998_1852487_+|AVZ48950.1|DBSCAN-SWA MLDVFTPLLKLFANEPLERLMYTIIIFGLTLWLIPKEFTVAFNAYTEIPWLFQIIVFAFSFVVAISFSRLRAHIQKHYSLLPEQRVLLRLSEKEIAVFKDFLKTGNLIITSPCRNPVMKKLERKGIIQHQSDSANCSYYLVTEKYSHFMKLFWNSRSRRFNR >CP028702|1839890:1870221|1841377_1842361_+|AVZ48939.1|DBSCAN-SWA MEAIKGSDVNVPDAVFAWMLDGRGGVKPLENTDVIDEAHPCWLHLNYVHHDSAQWLATTPLLPNNVRDALAGESTRPRVSRLGEGTLITLRCINGSTDERPDQLVAMRVYMDGRLIVSTRQRKVLALDDVVSDLEEGTGPTDCGGWLVDVCDALTDHSSEFIEQLHDKIIDLEDNLLDQQIPPRGFLALLRKQLIVMRRYMAPQRDVYARLASERLPWMSDDQRRRMQDIADRLGRGLDEIDACIARTGVMADEIAQVMQENLARRTYTMSLMAMVFLPSTFLTGLFGVNLGGIPGGGWQFGFSIFCILLVVLIGGVALWLHRSKWL >CP028702|1839890:1870221|1855310_1856057_+|AVZ48957.1|DBSCAN-SWA MKNIATGDVLERIRRLAPSHVTAPFKTVAEWREWQLSEGQKRCEEINRQNRQLRVEKILNRSGIQPLHRKCSFSNYQVQNEGQRYALSQAKSIADELMTGCTNFAFSGKPGTGKNHLAAAIGNRLLKDGQTVIVVTVADVMSALHASYDDGQSGEKFLRELCEVDLLVLDEIGIQRETKNEQVVLHQIVDRRTASMRSVGMLTNLNYEAMKTLLGERIMDRMTMNGGRWVNFNWESWRPNVVQPGIAK >CP028702|1839890:1870221|1858948_1859308_+|AVZ48962.1|DBSCAN-SWA MSRSSDNDQYRSRNALIRRHIEKMDASLHVGTKEFDISKVSEVDSVDDLLIDNAARYLLKDWKGVGELVNGVEVALEYTAERGIALLKQNPELYWQILAEAASIAQGKEQQKQDTIKKP >CP028702|1839890:1870221|1851335_1851557_-|AVZ48948.1|DBSCAN-SWA MIAHHFGTDEIPRQCVTPGDYVLHEGRTYIASANNIKKRKLYIRNLTTKTFITDRMIKVFLGRDGLPVKAESW >CP028702|1839890:1870221|1854011_1854434_+|AVZ48955.1|DBSCAN-SWA MKIKHEHIESVLFALAAEKGQAWVANAITEEYLRQGGGELPLVPGKDWNNQQNIYHRWLKGETKTQREKIQKLIPAILAILPRELRHRLCIFDTLERRALLAAQEALSTAIDAHDDAVQAVYRKAHFSGGGSSDDSVIVH >CP028702|1839890:1870221|1859415_1859616_+|AVZ48963.1|DBSCAN-SWA MLKELLYAYSVISRARRYAGMAGVPLPLSLTEINEYLATHPVLIERDEFEAVIFALDDQYFQEQCV >CP028702|1839890:1870221|1867318_1867552_-|AVZ48968.1|DBSCAN-SWA MKSKDTLKWFPAQLPEVRIILGDAVVEVAKQGRPINTRTLLDYIEGNIKKKSWLDNKELLQTAISVLKDNQNLNGKM >CP028702|1839890:1870221|1853692_1853989_+|AVZ48954.1|DBSCAN-SWA MKKENYSFKQACAVVGGQSAMARLLGVSPPSVNQWIKGVRQLPAERCPAIERATRGEVLCEELRPDIDWSYLRRSACCSQNMSVKQLNDSNKSSFDHT >CP028702|1839890:1870221|1847311_1848121_-|AVZ48945.1|DBSCAN-SWA MTKQPPIAKADLQKTQGNRAPAAVKNSDVISFINQPSMKEQLAAALPRHMTAERMIRIATTEIRKVPALGNCDTMSFVSAIVQCSQLGLEPGSALGHAYLLPFGNKNEKSGKKNVQLIIGYRGMIDLARRSGQIASLSARVVREGDEFSFEFGLDEKLIHRPGENEDAPVTHVYAVARLKDGGTQFEVMTRKQIELVRSLSKAGNNGPWVTHWEEMAKKTAIRRLFKYLPVSIEIQRAVSMDEKEPLTIDPADSSVLTGEYSVIDNSEE >CP028702|1839890:1870221|1852483_1852639_-|AVZ48951.1|DBSCAN-SWA MQKIDLGNNESLVCGVFPNQDGTFTAMTYTKSKTFKTETGARRWLEKHTVS >CP028702|1839890:1870221|1862376_1865739_+|AVZ51647.1|tail|DBSCAN-SWA MAVKISGVLKDGTGKPVQNCTIQLKAKRNSTTVVVNTLASENPDEAGRYSMDVEYGQYSVILLVEGFPPSHAGTITVYEDSQPGTLNDFLGAMTEDDARPEALRRFELMVEEVARNASAVAQNTAAAKKSASDASTSAREAATHAADAADSARAASTSAGQAASSAQSASSSAGTASTKATEASKSAAAAESSKSAAATSAGAAKTSETNASASLQSAATSASTATTKASEAATSARDAAASKEAAKSSETNASSSASSAASSATAAGNSAKAAKTSETNARSSETAAGQSASAAAGSKTAAASSASAASTSAGQASASATAAGKSAESAASSASTATTKAGEATEQASAAARSASAAKTSETNAKASETSAESSKTAAASSASSAASSASSASASKDEATRQASAAKSSATTASTKATEAAGSATAAAQSKSTAESAATRAETAAKRAEDIASAVALEDASTTKKGIVQLSSATNSTSETLAATPKAVKSAYDNAEKRLQKDQNGADIPDKGCFLNNINAVSKTDFADKRGMRYVRVNAPAGATSGKYYPVVVMRSAGSVSELASRVIITTATRTAGDPMNNCEFNGFVMPGGWTDRGRYAYGMFWQYQNNERAIHSIMMSNKGDDLRSVFYVDGAAFPVFAFIEDGLSISAPGADLVVNDTTYKFGATNPATECIAADVILDFKSGRGFYESHSLIVNDNLSCKKLFATDEIVARGGNQIRMIGGEYGALWRNDGAKTYLLLTNQGDVYGGWNTLRPFAIDNATGELVIGTKLSASLNGNALTATKLQTPRRVSGVEFDGSKDITLTAAHVAAFARRATDTYADADGGVPWNAESGAYNVTRSGDSYILVNFYTGVGSCRTLQMKAHYRNGGLFYRSSRDGYGFEEDWAEVYTSKNLPPESYPVGAPIPWPSDTVPSGYALMQGQAFDKSAYPKLAAAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTKTTSSFDYGTKSTNNTGAHTHSVSGSTNSAGAHTHSLANVNTASANSGAGSASTRLSVVHNQNYATSSAGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA |
39 | Escherichia_phage(41.38%) | integrase,tail,transposase,tRNA | attL 1840716:1840730|attR 1872496:1872510 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2062780 : 2081991
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >CP028702|2062780:2081991|DBSCAN-SWA TTTACTTCACGGAATATTTTGCCACGGTCGCTTTCGCGCCATGCGCTAATAAAGACAAGTACGTTTCCGTCACTCTTGCAGTAAACAAACTATTGTCTGGCAAATCATCACCAAAGATCGCCTTAATCGCCAGCAATGACTGGACGCGCGCTTTCCCTTCGGCACTACTTTGTACAGCCTTCTGAATAACAGGTAACAGTGGGTCACTGATTTCTATCGGATTTCCCTGTTCATCAACACCACCGACATAACGCATCCAACCCGCGACGCCCAGCGCCAGCAGATCGAACTTGCTGTCATGCGCCAGATGCCAGCGAACAGAATCCAACATCCGCTGTGGCAATTTCTGGCTACCATCCATCGCAATCTGCCAGGTTCGATGACGTAACGCCGGGTTGCTATAGCGTGCAATTAATCGGTTAGCGTAATCTTGCAAATCAACGCCCTGCACTTTCAACGTCGGCGCTTGTTCCTGCAACATCAAGCCATACGCCGCATAACGATAATGTTCATCTTCCATACAGTCATTAATGTGCTGATATCCTGCAAGATACCCCAGATACGCCAGGAATGAATGACTGCCGTTGAGCATGCGCAACTTCATCTCTTCATAAGGCAGCACATCGCTAACCAGTTCGGCTCCCGCTTTTTCCCATTCCGGACGTCCGGCAACAAAGTTATCTTCTATTACCCACTGGCGGAAAGGTTCACAGGCAACGCCCGCAGGATCGCGCACACCGGTAAGTTGTTCGATTTTCGCCAGCGTATCCTCTGTCACTGCGGGCACAATACGGTCCACCATTGTTGATGGGAAAGTCACGTTATCTTCGATCCATTGTGCCAGTTTTACATCAACGGCTTGTGCGTAGGAAGTGACAACGTCACGCATAACATGACCGTTTTCTGGCATGTTGTCACATGACATGACGGTAAATGCGGGAAGTCCTGCCGCTTTACGGCGAGCCAGCGCCTCAACAATCACCCCTGTTGCTGTTTTCGGCTGGTGGGGATTTTGCACGTCGGCAGCTACCATCGGGTGATCGAGCATTAACTGTCCGGTCGCCGGAGAGTGGAAATACCCTTTTTCGGTGATTGTCAGAGAGACAATCGCGATTTGCGGTTCACACATCGCTGCCAACACGGTTTCTAAGCCATCTATCTGTACGTGCAAGGCTTTTTTAACGACGCCAACGACGCGAGCCGTCCACACATCGGCCGACATTTCCGCAACGGTATAAAGATTATCTTGCTGTTGTAAATCGGCAATTTGCTGTTCGCCGCCGATTAAGTTGACCTCATAATATCCCCAGTCACTGAAATGTTCCGTAGCAAGAATATCGGCATACACACCCTGATGCGCACGGTGAAATGCACCAAAGCCTAAATGAACAATTCTTGGAGCCAGGTTATTAAGATCATAAACAGGGAGTGTCGCTTTTGCTGATAACAAATTATTTCCCATAACAATTCCTTAAATATAAATATGGCAAGCTATATGTTTTGTTATATGAATAAAAATCCCCTCTCCGGTAAGAGAAGGGATTAAGGGTTTACAGACTTCTGGAAGGTTGCGCAGCTCTTACAACACGCGGTTGATCTTCCGCAGCGTCTTCCAGCGCACTTAAATCACGGTCTTTCACCTCTGGCATTTTCAGCGCAGAGATTAAACCAATCACTGAATATGCCATGATCATAATGGCGATCGGATACCAGGATTCCGTCATGGTGCAGAAAATACCCGCCAGGATAGGACCAAAACCGGAAGCGATAAGACCACCAATTTCTTTAGAAATAGCCATCCGGGTAAAGCGGTTTTTACAGCCGAACATTTCTGCCATGGTAATGTTTTCCAGAGCAAATAATCCCAGCACCGCACAGTTATGAATCACAATCAGTGCAACCATAATGGTGCTCGGGGCATAGCTTTTATCTACAATGATAGAAAGCATTGGCCATGCCAGCACAATCGCGGAGGTATTCATAATAATATACGGGATCCGGCGACCAATTTTATCGGATAACCAACCAAGGAACGGAATGGTCATAAAGCCGAGAATCGAACTGATCATCAATGCATCTGTTGGAATTGCTTTGTTAAACAATAACGTCTGCACTAAATAGCCTGCAAGGAAAGTCTGAATTAACCCGGAGTTACCCGCCTGACCAAAACGCAGCCCTGTTGCCAGCCAGAAGGATTTGCTCTGGAACATGCTACCAGCAGGTGCAGGTTTTGCTGTCGGTTGGTTACTGTCGTTAACCTTCTCAAAGACCGGGCTTTCTTTCAGATTCATACGCAACCAGATAGCAAAGACCATCACGACAACACTCGCCAGGAACGGTATACGCCATCCCCACGCCAGCAGTTCCTCTTTACTGAGAATGAAGAACATAAAGGCCCAGATTGCCGTTGCGCTCAAGGTTCCGCAGTTAGTTCCCATAGCCACAAATGAGGAGATAATTCCGCGCTTACCTTTTGGCGCATATTCCGCCAGCATCGTACCGGCACCGGAAATTTCCGCACCTGCACCCAACCCCTGAATAATACGCAACGTCACCAGCAAGATGGGTGCAAAAACACCAATCTGTGCATAGGTCGGTAACACACCAATTAAGGTGGTACAGATCCCCATCATGGTGATGGTAATAAAGAGCACTTTTTTACGCCCTATTCTGTCGCCCATTTTGCCGAAAATAAATGCTCCGACAATACGCGCCACATAACCTGCACCGTAGGTTCCCATTGCCAGAATTAACGCCATTGCCGTTGATGATTCAGGAAAAAATATTTCATGAAACACTAACGCTGCGCCGAGCGAATATAACTGGAAATCCATAATTCACAGGTGTTTTTTCCCATCCTGTGGTTTCCTTGGCGTTTTCTAGGTTTTTTCAGATAGTTGCATTTTTTTAAAAAGCATCCTAAGTTCGATCTCAGTGTCTATCTGGGGCCTATTTCTGTCCCATATATGCCCCAAAAAAACTCCCCAACAGATAAGTAGTTTTTTCATGGATTTATGCGTAAAATCAAGAACGGCTGGAAATCATTCAATACTCACACTATCGAAAAATTTACCAGCCAATCGCAGCACGTTCTTGCATAAGGTGTGTCTGCGGTTTTTCAACTATTCAGATACATCACTCCCATCACATTCATTCCTCCGCATCAAAGGCATATAGGCTATATCACCTTGATATTTTTCTTTTTCAGATAAAAACTGTTATCTATGTATACTTTTAAACCCAATCCGTGTAGAGTCTCTACATAAGATAGTTTGCAGTTGCCGCTTCAGCTTGCGCCATAAACCGCCTGATTTTTGCTGCCACCTGTTAGCATTCCTGTATACCTGAAACGACAATGTTTATCTACGAACTTTAAGAACACCCAAGATAAAAATTGTCAACTATATCATATATAACACATTACTAATTCGAGGCTATATGAACAGCATACTGATAATCACATCTCTCCTTATCATATTCAGCATTTTTAGTCATGCCCTAATAAAATTAGGGATTGGCATATCCAATAACCCAGACAAAACCGATGTATAAGTCAACATATCCTGAATCAGACATACAATATCGCAATGAAAATCAATAATATTTTAAGGAATATCTTCATGAAATCAAAAGACACCCTAAAGTGGTTCCCTGCGCAGCTTCCTGAAGTAAGAATTATCCTAGGGGATGCTGTAGTGGAAGTAGCAAAACAGGGAAGACCTATCAATACCAGAACATTGCTTGATTACATTGAAGGAAACATAAAGAAAAAATCATGGCTGGATAACAAAGAATTATTACAAACAGCGATATCAGTTCTTAAAGACAACCAAAATTTAAATGGTAAAATGTAATATAATAAACTTACTTTTTTATCATTTTTCCACTTTAACAACATTTTGCTCCACTTTTCCACGACCAAACAACTTGAAATCTGGTTAAAATAACACGCAACACTATTCTTCTTCCTTGAGTCCGCCCGGAACTCGAAAAACAAACCGAGTTAAAGCCATTTTTCACAAAATCGATTTTGGGTCTCACCAAAATTACGGGGTTGCATACGCATTCGTTTATTTTCGAACGTGTACATACAAATATGCACAAAAATAATCATAATTATTTTCTGAGATGCATTATGATATGAACACCAATTTCGTATAGAGTCTCACTATGTCTCAAATTTTTGCTTACTGTCGGATATCAACGCTGGATCAGACCACCGAAAATCAACGCCGGGAAATCGAAAGTGCAGGTTTTAAAATCAAACCTCAGCAAATAATCGAAGAACACATTAGCGGCTCAGCAGCAACCAGTGAGCGTCCTGGTTTTAACCGGTTGCTTGCTCGCCTGAAATGTGGTGATCAATTGATTGTGACAAAACTGGATCGCCTTGGTTGTAATGCAATGGATATCAGGAAAACAGTGGAACAACTGACCGAAACAGGTATCAGAGTGCATTGCTTAGCATTGGGTGGCATTGACCTGACCAGTCCAACAGGAAAAATGATGATGCAAGTAATTTCAGCAGTCGCTGAATTTGAACGAGACCTTTTACTTGAACGCACTCATTCCGGGATAGTAAGAGCCCGCGGCGCAGGGAAACGTTTTGGTCGACCACCTGTGTTAAATGAAGAACAGAAACAGGCGGTATTCGAACGAATTAAGTCAGGTGTAAGTATAAGTGCCATTGCCCGGGAATTCAAAACCTCGCGGCAAACCATTTTAAGAGCCAAAGCAAAACTTCAGACACCTGACATATAAAAAATAATCTCGGTGTGAGATGCTTTACGTCTTCCAAGCCCCCTTCCTTGCCGTAAATGGAAAGATACATCTAATTATAGAATTTATATGTTTTACCCTACGGCAGTGCTGGCCATTCAATATCCTGTGCAGTTGACGTATCAACACGGTTCAGCAATACCCGATACTTTTTCCATGCTTCCAGCAACGAGATTTCTTCCTCCGTTGCAATTTCCAGATCTGCAGCATCCTGAAGCGGCGCAATATGCTCACTGGCTACCTGCATCAGGTTGTTTTTTGTTTCTTCCGCCTCCCGGATCCGGAACAGTTTTTCTGCTTCCGTATCCTTCACCCAGGCTGTGCCGTTCCACTTCTGAAACTCCCCTTCCGGCGATAACCAGGTAACATTTTCCGGTAACGGACCGAGTTCAGAAATAAATAACGCGTTGCCGGAAGCCACGTCATAAACCGTTTTACCCCGATGATCTTCAACGAGATGCCACGATGACTCATCACTGTTGAAAACAGCCACGAAGCCAGCCGGAATATCTGGCGGTGCAATATCGGTACTGTTTGCTGGCAGACCTGTATGAGGCGGAATATATGCATCACCTTCACCAATAAATTCATTAGTTCCGGCCAGCAGATTATAAATTTTTATGGTCCGTGGTTGTTCACTCATTCTGAATGCCATTATGCAAGCCTCACAATATAGTTAAATGCGATGTTTTTGACGGTGTTTTCCGCGTTACCAGCAGCGTTAACGGTGATGGTGTGTCCATGTGAACCAATCGCAACGGAGTGCGTATGAGCACCAATACCGACAGTATGTGCATGCGCGCCTGCGCTTGCAGCAGTGCCGGACAGCGAGTGGGTATGAGCACCATCTGATGATGTCTTCCCTGCATTACGAGTCTGGCCACTACCGCTTGTTGTGCTCATAATCCCCGCGCTTAGATTTGAAATCGCGGTATAACCATTAGGGAAAATGCTCGTGTTCGTGCCACCAAATGCACCGGAACTCTTGTGTTGGTGCGCACCGGCACTATTTGCGGTCCCGCTAATACTATGGGTATGCGCCCCGGTGTTATTCGTGGATTTGGTTCCGTAATCAAACGACGATGTGGTTTCCGTCCCCAAATCCGTACTGGATGCGCTGGCGCTGTGGGTGTGCGATTTAATGCCGTCCTGTTCCTGAGATAATACGGCCCGACCACTGGCGGGCTTGCCCTTAATCGTCCAGCCACGCATATCAGGGATCACGCCTGACGGATAAGCAACTGCAAGTTTCGGGTATGCAGATTTGTCAAAAGTCTGCCCCTGCATCAGGGCATAACCAGACGGAACGGTATCTGATGGCCACGGGATTGGTGCACCGACTGGATAAAACTCTGCAGGAGGATGAGCCGAGGTGTAAAGCTGCGCCCACGGCGACCAGTTTGCGTCGGTCGTATCCCGTCGTGAACGAATAAATGCCGGAGCATGAGCACCGCTTGTACCACTCCAGCCGATGAGTAACTCACCTTCGCCAACGGCTGTCATCCCTTTCAGGTGAATGATATTTCCATACGCTGTTGGATATCCGTTGTTATACACCTCGTATAACTCAAGACCTGCTGCCCCCTGCGTATTGTCTGTCAGCGCGGTTATATTCACTCAGCAACCCCGGTATCAGTTCATCCAGCGCGGCTGCTTTGTTCATGGCTTTGATGATATCCCGTTTCAGGAAATCAACATGTCGGTTTTCCAGTTCCGGAAAACGCCGCTGCACCGACAGGGGGATCCCGTCGAGAATACTGGCAATTTCACCTGCGATCCGCGACAGCACGAAAGTACAGAATGCGGTTTCCACCACTTCAGCGGAGTCTCTGGCATTTTTCAGCTCCTGTGCGTCGGCCTGCGCACGCGTAAGTCGATGGCGTTCGTACTCAATAGTCCCTGGCTGGAGATCTGTCTCGCTGGCCTGCCGCAGTTCTTCAACTTCCCGGCGCAGCTTTTCGTTCTCAATTTCAGCATCCCTTTCGGCATACCATCTTATAACGGCGGCAGAGTCATAAAGCACCTCATTACCCTTGCCACCGCCTCGCAGAACGGGCATTCCCTGTTCCTGCCAGTTCTGAATGGTACGGATACTCGCACCGAAAATGTCAGCCAGCTGCTTTTTGTTGACTTCCATTGTTCATTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAAAAGCTCGCTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGAGGGTATTTTAAATAAAAACATTAAGTTATGACGAAGAAGAACGGAAACGCCTTAAACCGGAAAATTTTCATAAATAGCGAAAACCCGCGAGGTCGCCGCCCCGTAACCTGTCGGATCACCGGAAAGGACCCGTAAAGTGATAATGATTATCATCTACATATCACAACGTGCGTGGAGGCCATCAAACCACGTCAAATAATCAATTATGACGCAGGTATCGTATTAATTGATCTGCATCAACTTAACGTAAAAACAACTTCAGACAATACAAATCAGCGACACTGAATACGGGGCAACCTCATGTCAACGAAGAACAGAACCCGCAGAACAACAACCCGCAACATCCGCTTTCCTAACCAAATGATTGAACAAATTAACATCGCTCTTGAGCAAAAAGGGTCCGGGAATTTCTCAGCCTGGGTCATTGAAGCCTGCCGCCGGAGACTGTGCTCAGAAAAAAGAGTTTCTTCTGAAGCAAACAAAGAAAAGAGTGACATTACTGAATTGCTCAGAAAACAGGTCAGACCAGATTGAAGCAATTTAGATAATCGTGCAGACTACGCCCCCTCATATCACATGGAAGGTTTATCTATGGATCAGGTAGTCATTTTTAAACAAATATTTGATAAAGTTCGAAACGATTTAAACTATCAATGGTTTTATTCTGAGCTAAAACGTCACAATGTCTCACATTACATTTACTATTTAGCCACAGAGAATGTTCATATTGTATTAAAAAATGATAATACAGTGTTATTAAAGGGCCTAAAAAACATTGTGTCTGTCAAATTTTCAAAGGATAGGCATCTTATAGAAACGACCTCTAATAAGCTGAAATCCAGAGAGATCACATTTCAGGAATACAGAAGAAACCTTGCTAAAGCAGGAGTTTTTCGGTGGGTTACAAATATCCACGAACAAAAAAGATATTACTATACCTTTGATAATTCATTACTATTTACTGAAAGCATCCAGAAAACTACACAGATCTTACCACGCTAAACCATAACGTCCGGCTTCTCTCACTCCTGAGCCGGACTGCATTGGTTTAATAAAAACCATCAACAATTGTGATTTAGATATTCGGAACCATTCAAATATAACAAAACCCCGTAAAAACGAGGTTTATGGATAAATTTTATTATTGAATACATCAGATTAAATTAATCTTGACATCATAGCTTTCAAGACCCGTCATTTTTTCCCGTGCGGTAAACTGAATACTGGTAACTTCTTTCCCGGTCTTTTTCTTAAGTTCAATAATTTTTTTTGTTATATATTCAGAAATATCTGCTTCTGCTTTTGTTTTTAAGTTTTCAATATTCATCATTTCCTCTTTTAGTCTGTTATGACTTTCCAGTTACACAGTAAGTCGATTATATGGTGCAAACGTGTAAAAGATAAGATGAAACATCGCAATAATCAACATACGATAGTCTAAATTTTACACAAACAGACAAAGAGAATTTTCCTGAATTATCAATGCAATAGCATCAAATCAACTCAAGAGCCTTATTGCTGCTTCCAGAATTTCTTCTGAAGTAACATGTCGATCCGCGGCTACATAAATGACTTTATGATCTCCGGTCAGAGATGGAAACCCTGCGGCCATTACAGTAAGGTGTGTTTTTTCGCCATTTGGATATTCACGCATGATGGTGTTAACTCCAGTCATCGCTGGCACTACCACTGCTGGTTCAGAGTTAAAAAAACTATGATTTTTTTCATGATGTTACCGTAGTATGTGAGTATCCATCGAATAGACACCAAGCAAAAAAGCTCCCGAAGGAGCCTTCATTTTCACTTTTTTAAATCCAACGACAGACGGCTGGCATTTAAGTATTGTGAAATATTATCAAATGTAATCATCATTGATTTACAAAAGATACATTTTGCCCCGAAAGGATTCATGTCAGAAACATCAAAAGATGATGTTCTATACTGGGAACCATGACAACACGGGCATCTAAAGTGAATATGGTTTGTAATATTGTCTACCTCAAAGCGCCACTACATGAACAGCGGCAGGACCTTTAGGTCCGTTCTCAATACCAAATTCAACTTCCTGATTCTCAGTTAATGTTTTGAAATCGTTGCTCTGAATTGCTGAGAAATGGACAAACACATCTTTGCTGCCATCTTTCGGCGTGATGAAACCAAAACCTTTTTCAGGGTTAAACCATTTCACTAAACCAGTCATTTTGTTAGACATAATTATTACCTTTTGAAGAAATTAGCCCTTGGGCAGAATGGTCCGAAAAAAAATATCAGAGAGAAAAACCAACAAGGAAATCTCAAGAGGTACAAATAATAAAATTATAACAATGACTGCTTCAGATAAATTTGTAACAAACCAGAACACCATTAACGCATGATTAACCACCCATAGCAAGGATTACTTTTGTAAAGAAAAACACAGCAATGAAAGAATAGCTTTATTTATTAATAAAACGTGTCATTCTGATTAAGACCTTTTATCTTACCCTTAAGATTTCAGGAATTTTGGCTCATGGAAGAGTCCTTTTTATTTAAATTTTACATTCCGCGATGTAAATGTTCCGATTTAATATTACCCTACATTTGATGCTTTTTATCTCTTAAAGATTCATAGATCTGTTGACAAGTCACTCCTGCGATGTAGCGTTCGTCAGCAATTTCAGCATAAAGCTGAGCTTCTGCTGCAATATCTCCGAGCATGTTGGTGAGCATTCCTTCGGCGGTTTTGGTTGTTTTGCCTCTGACGGCAGCGGCAAGATCTGCGGTATGCTTCGCTGCGTCAAGGCGTATGGCATATTTTTTTGCTTCGGCACGCAACTGGTTAACACTATCAGACAGATAAGCAGCCCTGGCAGAAATTTCAGCAGATTTCTGTTGCGCATCTTTAACAGCCTCATCACGGGCTATAGTTCGCCCCTGTTCAATTATTCGAGCAGCAAATTGAGCATTTACCTCTTGTGATAATGCGGCAGCATCACGTTCCGCCCATTTTTTTTGCCATCCTCGGTCGCTCCAGACATTTCCGACGATAAATCCTGACAACACGAGAAAAATCACCATGAATATCTGATTCACTGTTCTATCCCCCAGCAGGTTAATGCGCTCTCCTGGTCACGACGAATAACCTGACCGTAACAGTTATTTGAACGAATGCGGCAATCGCGTCCGCCATCCTTAATCCACCAGCGAATCGCTTCGCATGCACCTTTACGATCACCAGCATTCAGCCGCTTATAAAACGTCGACGGGAAACACTTACCGGGGCCAATGTTATAGGGACAAAATGACGCGATACCCGCTTTTTGTGGTTCGGTCAGTGGTACTTTAATATTGCGCTCCACCCATGCCAGCGCCTTATCACGCTCAATGGCGTTGACCTGGTCGCATTTTTCCTTCGACAGTTTCATATTGGGAAAAACGGTTTTTCCATCCACCACTGTGGCACCCCGACAGATGGTCCATATGCCAGAACCATCGCGGTATGCCATTGTGTGGTTACCTTCTTTTTCGTCCAGAAACTGGTCAAGTATCTGAGGAGCAGATGCGCCAGCACCAATCAGCGCCAGAACGGCAGCCGACAGGCCGTATCTGATTTTTGTGTTCATAGATATTTATGATGAGGACGCTCGTGCTTATTGGCAGGATTTTCAATCTTAAAGGAGTACTGATGCTGCAGATAAGACTCAACTTTTTCTGACAATTTTTCTGCTACTTCCAGGAAGACTTGCCGGACGCTCCTTCTGGCTGCTGCCTCATAAAACTCCAGCGCAGCTCCTTCAACACGGTCCATGGCGACATCCAGGTCAAAAATTTCACCGTCAAAGCGTTCTTTGTCCTGTAAGGCTACAGTTACCGTAACTTTATTCTCAAAATTACGGACTCCTTTCACAACCAGTTCATAGTCTTGAGTCATTGGATTACTCTCCTCTCGCAGCCTTACGCCTGTCTTCTTTAATCTTGAAATAAAGATTTGTCAGATACGTCAGCAGGCCAAAAACCAGGCTACCCAGCACACCGATTGCAGCCCACTGTGACGGAGTTACTTTATCGAGTAACTGCAATGCCCAGAAACCAGCATTACCCGCCGATGTGCCATAGGCAACACCTGTTGTTAACTTATCCATTGATTTCATATCCTCACCCCGATGTACACGGATGGTGCAATATGTTTGAAAAGATCGGAGTCTACGGGGTAGTTTTGACAGCACACGTTGTTCTCAACGGCGCTAAAAAAACATACACATTAAAAATGTGGGTAATTATTTTGAAAGAAAGTCATATATAAAATAATAATACGAGAAATGTTTTCATATTTAGTGTACTGTATACGGCCATTTATACAGGAAAAGCCTATGTCAGAACGTAAAAACTCAAAATCACGCCGTAATTATCTCGTTAAATGTTCCTGCCCAAACTGCACCCAAGAGTCAGAACACAGTTTTTCAAGAGTACAAAAAGGTGCCCTTTTGATCTGCCCTCATTGCAACAAAGTATTCCAGACAAATCTTAAAGCTGTAGCCTGATTGATTTTATTAGTAACAAGTATTTTTTATATTTTAATAATATATTTAAAGCAGATAATAAAAAACCCGCCTGAGCGGGTTTGAGATTGTGGTGCTTTTTGTGGGAGTCATCCACTTACGCACTTTGTTTTGCCATGCCAGCAGTTAGCTTCTGCTGTAAAACTATTCATGCAGCAAACCTGCACTTCACCACAATGGTTAGCATACTTTTCCTGATTAAGATTTTGCCAAATATGCTAGCCATTGTTTCATGTATTGGACCTCCTTACTTTTTATTAAAGAGATCCAATATTCACTACTCTGTCCGTATCTCTACTCAGGCATCAGCCTTCTTCGTTATCGTATACAGACGAGCGATGAATTTTAATCAGTAATGATGACATTTGCTGCTGCAGGACCTTTAGCACCACTCTCTATAGAGAAGGTAACCTTTTGACCTTCAAATAAGGTTCGATAATTATCATTCTGAATCGCAGAAAAATGCACAAACACATCTTTACTACCATCAACAGGAGAAATAAAGCCGAAACCTTTATCAGCGTTAAACCATTTTACTAAACCAGTCATTTTATTTGACATTCTACATTCCTTAACTTGAGCCTTTCGGCATAAATGGTTTGCATAACAGAAACGACTTCGTACTTAATTGGAGAGACTCAAAGAAGGAATAAGTGAATAACACCTGAAATGAGAACTGCTTTAGTAAACTACTTCGTATATCGTCTGTTCTTCAAACCGACGCAATCATTAACGCATAGTTGAACATATGAAGCAATGTTTATTTTAGACATCCAGCCATCTTCAACCCCATCAAAAAACTATAGCTTTCTTCAGGAACGTGTGTATAGTGCGCCAAGTTATCAGTATTAAGGAATTTTTTTGTCCCGTAAAATGACAGGAATTGTCAAAACCTTTGACGGCAAAAGCGGCAAGGGTCTTATCACCCCATCCGATGGTCGTATCGATGTCCAGCTTCATGTTTCAGCGCTCAATCTCCGCGATGCAGAAGAAATTACCACCGGATTACGCGTGGAATTTTGCCGGATAAATGGTCTGCGTGGCCCTTCAGCTGCCAATGTTTACCTTTCATGAGCTATATTAAAGCTTTAATTTCAGGCCCCATCGGATCACACATGGAGAGTTTTTATGAATAACCCCGTCTGTCTTGATGACTGGTTGATTGGCTTTAAAAGCTTATGCTGTACTTTGGCCGTAATAGCTCTGCTAATAATATAATAAGCAGACTCATTGTGTTTAGGGACATTGTACTGGAAGAAAACATTTTAAACATCAGGCAAATAACCAAGTCACCAGCTAAATAATAAGTTAACAGACATGAGTCCCGGGATGAGATTCAACATTACCATTGCCCCATTTAAAGCACAAAACCCGCTCATCAGCGGGTTTTCTACTTTTTCTTAACGTCGGGTATACAAAGCCCATCGTTGAAAAAATTTTATCCATATTTTTTGAAAAATGCAAACATCATGTCGCCATCTTCAGCAAAAATCATTTATCTCGTCACCTTCCTCAATTGCGCTTCCGCGTATGCTTCTTCCTGCCAGCACTTTGTTACCAGTTTACCAATGACGTCCGCATACCCCTTATACCACTGATAATCGGTCAGGTCTGGTACCAGCTTCTGGACATGACGTCGTGCCAGCGTGGTCGGTAAACGACTAAACCGGTTTCCATTACAACGCCCACAAATCTTATATACCGGTACGCCATGAAACCGGGTTCTTTTTTCATCCAGAACAATCCCTTTACCCTTACACCCTCTGCACGCTGTGCTGGCTTCGCCCTTACCATGGCAATGCTGACATAGTTCCTTCACCCATTCTTCCTTGATTACAGATTCCCCGCGTCTGTAGTGTTTCACCACTTCGCGCAATACATTATAAAATCCCGTACCTGAACAATGCTCACAGCGAGCCTTACTTGCCGCAGACCTGGAGTAATCAGCAAAGGCAAAACTCACGAGGTAAGGAATAATCTGTAACCGGATTTCTTCACTCAATTTGTTCAATGTCGGGTTATCCAGTGCCATCGCGTAATTTAGCAGGCCTTCAATCGCAAACTGAGGGTCCTGAACACCAACTTTTGCCAGAAATAAGGCCAACCCAAGTGGTGCTTTCGACTGCACCATCCCCTGCGCTGCCATTACATCCGTAATTGTTAAACAACCGGTGCCTGTCGCTGGAGCGTCATCGCTCAATTTTGGAGATTTTGGGGAGTAATATTTTGGTAAGGCTTCAAGGTTCATGCTCGTTCTCCACTTACGCCAGTACGCCAATTGCCAGCGCGCGATCGATAAAACGAAATATCAGCTCCAGTTGGGAGCCATACTTATCTTCAAATGCCACTGTATCCGTATGCAGCTCGTTGTGATGCTTTCTGCACAAAGGCAACACAAAGAGATCATGTGCTTTTGTTCCCATTCCGCCCTGCCCGTGACCAATCAGATGATGCGGATCGTCGGCTGGCATACCGCAGCAAGCACACGGCTGTGTCTTAACCCAACGTGTGTATTTCTCCTTAACCCAGCGGCGACGTTTAGGCAGCTTCATGAAAGATTCCGGAGACTCTGGATCAACGGTGATGCTTACCACCGTCTTTTCCTGTGGTGATTTTTGTTGCTGGTGGGCGTAAGGCAACGGTGCAAGATTTTTTGTGCGTTGTTTCAATATGCTGGTGGCGGTCTGCTCTCCCGGTACGATGTCGCTTTCGCGGTACACCGAGCAGATTTTTTCCGCTGGTAATCCCAGCGAACGACGCGATACAGCCTCAGGTAGTGCATCCACCACCTGATTGCAGACCGCCCACCAGGATAATTCAGCCAAAGATAATTCCCGCTCCTGCGTACCGCTTATTGCGTGACGGATGACGTCAATCACCCATGCTGTCAGATTTTGTTGAGCAAGCAGCTCCAGTGATTCCGATGTCTGGTCACGCAGTTGGTTGTCGCAGTGCCAGCACAACACCATTGCGCCGGTACCATAACGGTGAATGACTGTTTCAGTGTGATGGTAATCGCCATTAGGCCACTGGCAGGATGTAACATGACGTAATAGCCAGTCGGACAATGCGCCAACGCCGCCAGCAGCACGAATCACCCGTTCGTTACTAAAAAACGGCAGCAATGTTTTGTCTTCCGCCAGCGGCTGGCGAACGGCAGGAACGACTCCGGATGGCAGATTACGCATGCTTTTTGGTTCCGGTTCCACCAGCACTCGAGGATTATGAAATATCTGTATGGATTCACGGCCCGGCTTAAGGACCACCAGCCCAAGCTCAGGCACCAGAACAGGTCTAAGTAATACCCGCACGTTACCTCCAGATCCGTTGCTGGAAAGTGCGGGACGCACGTGGTGGGCGTTCGGAATAAGGCAGCCTGACAGAGATTATCCAGTGCCGATAGTCGAGACTGAGAGCTTTCTTAACCTCGAACCCGCGCCTGCGGTAAGAATGAATCAGCCATTCGGCCTGTTCTGCAGTGCATGGAGGGTGCTGGAACCATTCAGACTTGAATGCGTGAGAATACCGCCCGTGCGTGCAGGCAAGAACGGGCGAATTATCAGAATTGTAATATTTTGCGTTGCGTGCCATCGGTTTTCTCCGGTGGCACGGTGTTACTCAGCGGGAGTTCAGCCCCGCGCAAGATTGTAGATGAGTTTATTCTCCTGAAAAAGCAGAAAAGCCAGCTTTTATTCCGATCTCTTTCAATGCCTGTAATGAAGTGACAAACTCACCTTCGCGCAAGATAAATCCGTCCGTGACCCGAGCATCCACAAAATTAATTAACGCAGCCCCATTCTTTCGCAAACACATAATGCGGTAATGACTAACAAGATTTCCATTTTCAACGCACACAGCATAGAGGCCATCTTCACAAAAAATTTTACGCAGTTCTTCGATGTTCATCATCAGAATCCTTCCGGATAATTAGCTCTCCCCTTTAAGGGACCATCCCTCTTATCCCTGCGCGCTACTTAAGTATTTTTGATTCTATTCCGGCACCGTCCAGAACTTCAAACGCGTTGAAAATAAAAACAAAAACCCGCCGAAGCGGGTTAAGTGCGGGTGCGTTGAGGATGCCTGCCACATCAGAGGTGGCGAGGGATTTCTCCCCCGCCGGGTCTCTTACTCCTCAGGTTCGTAAGCTGTGAAGACAGCGACCTCCGTCTGGCCGGTTCGGATTCGTACCTCGCAGAGGTCTTTCCTCGTTACCAGTGCCGTCACTATGACGGTTAAACAGATGACGATCAGGGCGATTAACATCGCCTTTTGCTGCTTCATAGCCTGCTTCTCCTGTCAACGCAAAGCAGAAGTGTCACCTTCGGTGCGAAACAGAGATGTCATGCTTTGGTTCAGAGAATGCGTTTGACCGCCTCGCTATATACTTCCGAGCGTTCTCTTTTCCCAACAGAAATCACGAAAACGACAACTTTCTCGTCTATAACCTGGTATACAAGGCGATAGCCTGAAGACCGGAGCTTAATCTTGTAACAATCAGGCATACCACGGAGCTTGTTTGCTTCAATCCGGGGTGACTCAAGTACTTCAACCAGCTTCTTTTTCAACTGTTCACGTACCGTCGAGCCCAGCTTTCGCCATTCCTTTAGTGCCCGCTCGTCAAAATCCAGAAAATACGCCATCAGAGTTCATCCAGCGTCACACGTACTGGCTTAGGATTACGAAGCCGTTCTTTCACTATCTCCACAAGTTCAGCATCTTCATCACTCAGGAGTGTCTGTTTGAACGGCAAGCGTTCATTGTCAGCGATATACTCGAGCATGAGACGAAGCGCTTCAGAAGGAGTTACACCCATTTTTTCAAGCGCGGCGTAAGAACGCGCTTTAAGTTCATCGTCAATACGCAGGTTAATGCTACCCATGTCTTACACCTCTTGTAATTACAAATGTCATTACAAGTATCGCACTACAACATGCTTAGGGCAAGTCACGAAGGAAGTCAGAAAGTAGTCGTAAGAACGGTGATCACTGTCCGCTTTGTGCCAGGAGCAGCCATTGCTAAGTCCATCCTGTATTGTGCAGGTCAGCTCGTTTTTAAAGAGTCCGGCCATCATCTTACTGGTACAGACACCATATACTTTGTGACGGTCAGGCTACATATGCACAACTCAACTTATTCATCTATTTTTTGCTTTAGCATGTCAGTGTTGCTTTCTCGTCGGCGGGTGAGCGGTGACCTGACCTGTCGATAAAGGAACGTAACACGTTTTATGCAACACCCGCATGCGGCAGAAAATTATTGCCGAACGTTTACCCCTGTCAACAAGCTTTACTTTCTGAGGCGCGCCAGCCCGCGAGGAAAACAATCTGAACATCAAACAATTAATGACACAAGAAATACGATTAAAGATTTTTTTGTGCATGCCGATAGTGCTTTTTTAAAAGGAGAAATCTATGTCTGTCACAATTCAGGGAAATACCTCAACCGTTATTTCAAACAACTCCGCCCCGGAAGGAACATCAGAAATAGCCAAAATCACAAGACAAATTCAGGTGCTGACTGAAAAGCTTGGGAAAATCTCATCGGAAGAGGGGATGACGACACAGCAGAAAAAAGAAATGGCTGCATTGGTACAGAAGCAAATTGAAAGCCTCTGGGCTCAACTGGAGCAGTTGTTAAGGCAGCAGGCAGAGAAAAAGAATGAAGACGCGACAGTTCAGCCTGATAAAAAAGAAGAGAAAAAAGACGATACAAATACCGCTGGCACCATTGATATTTACGTCTAAGTGACAGCCGTATTGTGGCCCTCATCGGGCCACTTTTCGCCATCAGCCTTTTCTTTAAAGACATATTATCTTTGTATCATTTCTGATAGTTAACATTACAAGATATAAGTAATGGACGCACTCCCAATTAGTCTATTTAAATCGCCACGAGTTTAACTGACAACCCATGATCAATTATGAATTGCAACTATTTCTGTAGTCACTTTTGTGGGGACAGTCCACAAAACTGCCAACTTCCGCTTCTTGCTCTTAGCGGACATTAGCATAGGCTATTTACCATAACGCCTCATTACGCGCACCGCCCAGACTGACTCAGCGCGTTTCTGGCATATCCCCGGTAAAACAAGTAACAAACCACCCGAAAATGAACACCAGAAACGCGACTTAAGAATCTACCCTATGAATGGATATGCACTCAACCGAATCGATCTTGGTTTCAATCTTTTTTATCGGGATCAGGCTTCTTTTTAGGTAACTTCGGGGGCTTAACTTGCTGATGACTTTGCGTTCGGCGCGTAAGCCAGGGATGGTCAGCTTTAGGTTTAACATAGTATTTTGAGCGTAAATCAATACGGGCATTATCCACTCGTTCATGGACACTCTTTTCATCATCCAGTGGTAGCCTCCATAATTGCAGGCACTAGCGCCGTGAACTTTTCACGCTTATCCCTGGTGTCGATAGCCTTCCAGCGTTCAAATATCTTCACTCGATTAACGCCAAGCGCTCGCTGATCAATCGCGCCACCTTCATATGTGACACGCTGAACATCGATGTTCGGGCGCTCTTTCAAAGCCCAGAATGCTTCAGTGATTAATATCGTCGCCTGCTCCTGTGTCATTCCTGGTCGACATATCCAGGCATCCAGAGCCTCACGAGCCTGTTCAGGAGTGATTTTCATTGTTCAACCGCCCCGCCCGCTTCGTCTTACGATATTCATCATAAACTTTGGGATCATACTGAAGCTCCCCGCCAGATGCCTCCTGTAGACGCATCGCGCGACCTTCGGGAACTAAATCCCCTTTCCAGCTATAAAGCGAAGCCAAACGAATACCTGCTGCTTGTGCAAGTTTTGTTTTTGAACCGAAATACAAAAGAGCGTCAGTTTTAAGCATTTAAAACACCTTTATTGTTAGTCATAACTAACAAGATAGATGTTAACAAAAACATAGTCAATACGATTTAGCATTAGCTAACTATGGAAACAAAAAATTTAACTATCGGCGAACGCATCAGGTATCGTCGGAAAAACCTCAAACACACCCAAAGGTCTCTTGCTAAAGCCCTGAAAATCTCCCATGTGTCTGTATCACAATGGGAACGGGGTGATAGTGAACCTACAGGGAAGAACCTTTTTGCCCTCAGTAAAGTATTGCAATGCTCACCAACATGGATTCTATTTGGCGATGAAGACAAGCAACCAACACCACCTGTTGAGAAGCCAGTTGCCTTATCCCCCAAAGAACTAGAGCTCCTTGAGCTGTTTAATGCACTGCCAGAATCAGAACAGGATACCCAGCTCGCCGAAATGCGAGCTCGAGTAAAAAACTTCAATAAACTCTTTGAAGAATTACTAAAAGCCCGTCAGCGGACAAATAAAAGATAACATCATCAATGAGTTATCTTTTACCACATCAATTATGTTAGCTATAGCATACAAAATCACTTGACCGATATGTTAGTCATGGCTAATCTTGTTTGCATCAACACACCGCACGGTGTTCTCAGCAAACAGTTCCGCTACCCCAGCGTTAAGGGGAAATGAGGTCAGCATGGATACTATCGATCTTGGCAACAACGAATCTCTGGTGTACGGCGTGTTTCCAAACCAGGACGGCACGTTCACCGCAATGACGTATACCAAAAGCAAAACGTTTAAAACCGAAAATGGTGCCCGTCGCTGGCTGGAAAGAAACTCAGGTGAGTGA
Protein sequences of DBSCAN-SWA_5 >CP028702|2062780:2081991|2080334_2080457_-|AVZ49162.1|DBSCAN-SWA MLKSRFWCSFSGGLLLVLPGICQKRAESVWAVRVMRRYGK >CP028702|2062780:2081991|2072357_2072855_-|AVZ49147.1|DBSCAN-SWA MNQIFMVIFLVLSGFIVGNVWSDRGWQKKWAERDAAALSQEVNAQFAARIIEQGRTIARDEAVKDAQQKSAEISARAAYLSDSVNQLRAEAKKYAIRLDAAKHTADLAAAVRGKTTKTAEGMLTNMLGDIAAEAQLYAEIADERYIAGVTCQQIYESLRDKKHQM >CP028702|2062780:2081991|2071583_2071772_-|AVZ49145.1|DBSCAN-SWA MTNHIHFRCPCCHGSQYRTSSFDVSDMNPFGAKCIFCKSMMITFDNISQYLNASRLSLDLKK >CP028702|2062780:2081991|2073381_2073693_-|AVZ49149.1|DBSCAN-SWA MTQDYELVVKGVRNFENKVTVTVALQDKERFDGEIFDLDVAMDRVEGAALEFYEAAARRSVRQVFLEVAEKLSEKVESYLQHQYSFKIENPANKHERPHHKYL >CP028702|2062780:2081991|2079732_2080065_+|AVZ49161.1|DBSCAN-SWA MSVTIQGNTSTVISNNSAPEGTSEIAKITRQIQVLTEKLGKISSEEGMTTQQKKEMAALVQKQIESLWAQLEQLLRQQAEKKNEDATVQPDKKEEKKDDTNTAGTIDIYV >CP028702|2062780:2081991|2072851_2073385_-|AVZ49148.1|DBSCAN-SWA MNTKIRYGLSAAVLALIGAGASAPQILDQFLDEKEGNHTMAYRDGSGIWTICRGATVVDGKTVFPNMKLSKEKCDQVNAIERDKALAWVERNIKVPLTEPQKAGIASFCPYNIGPGKCFPSTFYKRLNAGDRKGACEAIRWWIKDGGRDCRIRSNNCYGQVIRRDQESALTCWGIEQ >CP028702|2062780:2081991|2071281_2071437_-|AVZ49144.1|DBSCAN-SWA MREYPNGEKTHLTVMAAGFPSLTGDHKVIYVAADRHVTSEEILEAAIRLLS >CP028702|2062780:2081991|2079224_2079530_+|AVZ49160.1|DBSCAN-SWA MSLQVSHYNMLRASHEGSQKVVVRTVITVRFVPGAAIAKSILYCAGQLVFKESGHHLTGTDTIYFVTVRLHMHNSTYSSIFCFSMSVLLSRRRVSGDLTCR >CP028702|2062780:2081991|2064329_2065613_-|AVZ49134.1|DBSCAN-SWA MDFQLYSLGAALVFHEIFFPESSTAMALILAMGTYGAGYVARIVGAFIFGKMGDRIGRKKVLFITITMMGICTTLIGVLPTYAQIGVFAPILLVTLRIIQGLGAGAEISGAGTMLAEYAPKGKRGIISSFVAMGTNCGTLSATAIWAFMFFILSKEELLAWGWRIPFLASVVVMVFAIWLRMNLKESPVFEKVNDSNQPTAKPAPAGSMFQSKSFWLATGLRFGQAGNSGLIQTFLAGYLVQTLLFNKAIPTDALMISSILGFMTIPFLGWLSDKIGRRIPYIIMNTSAIVLAWPMLSIIVDKSYAPSTIMVALIVIHNCAVLGLFALENITMAEMFGCKNRFTRMAISKEIGGLIASGFGPILAGIFCTMTESWYPIAIMIMAYSVIGLISALKMPEVKDRDLSALEDAAEDQPRVVRAAQPSRSL >CP028702|2062780:2081991|2062780_2064241_-|AVZ49133.1|DBSCAN-SWA MGNNLLSAKATLPVYDLNNLAPRIVHLGFGAFHRAHQGVYADILATEHFSDWGYYEVNLIGGEQQIADLQQQDNLYTVAEMSADVWTARVVGVVKKALHVQIDGLETVLAAMCEPQIAIVSLTITEKGYFHSPATGQLMLDHPMVAADVQNPHQPKTATGVIVEALARRKAAGLPAFTVMSCDNMPENGHVMRDVVTSYAQAVDVKLAQWIEDNVTFPSTMVDRIVPAVTEDTLAKIEQLTGVRDPAGVACEPFRQWVIEDNFVAGRPEWEKAGAELVSDVLPYEEMKLRMLNGSHSFLAYLGYLAGYQHINDCMEDEHYRYAAYGLMLQEQAPTLKVQGVDLQDYANRLIARYSNPALRHRTWQIAMDGSQKLPQRMLDSVRWHLAHDSKFDLLALGVAGWMRYVGGVDEQGNPIEISDPLLPVIQKAVQSSAEGKARVQSLLAIKAIFGDDLPDNSLFTARVTETYLSLLAHGAKATVAKYSVK >CP028702|2062780:2081991|2075182_2075395_+|AVZ49152.1|DBSCAN-SWA MSRKMTGIVKTFDGKSGKGLITPSDGRIDVQLHVSALNLRDAEEITTGLRVEFCRINGLRGPSAANVYLS >CP028702|2062780:2081991|2075816_2076569_-|AVZ49153.1|DBSCAN-SWA MNLEALPKYYSPKSPKLSDDAPATGTGCLTITDVMAAQGMVQSKAPLGLALFLAKVGVQDPQFAIEGLLNYAMALDNPTLNKLSEEIRLQIIPYLVSFAFADYSRSAASKARCEHCSGTGFYNVLREVVKHYRRGESVIKEEWVKELCQHCHGKGEASTACRGCKGKGIVLDEKRTRFHGVPVYKICGRCNGNRFSRLPTTLARRHVQKLVPDLTDYQWYKGYADVIGKLVTKCWQEEAYAEAQLRKVTR >CP028702|2062780:2081991|2081261_2081669_+|AVZ49164.1|DBSCAN-SWA METKNLTIGERIRYRRKNLKHTQRSLAKALKISHVSVSQWERGDSEPTGKNLFALSKVLQCSPTWILFGDEDKQPTPPVEKPVALSPKELELLELFNALPESEQDTQLAEMRARVKNFNKLFEELLKARQRTNKR >CP028702|2062780:2081991|2066399_2066633_+|AVZ49135.1|DBSCAN-SWA MKSKDTLKWFPAQLPEVRIILGDAVVEVAKQGRPINTRTLLDYIEGNIKKKSWLDNKELLQTAISVLKDNQNLNGKM >CP028702|2062780:2081991|2071782_2071995_-|AVZ49146.1|DBSCAN-SWA MSNKMTGLVKWFNPEKGFGFITPKDGSKDVFVHFSAIQSNDFKTLTENQEVEFGIENGPKGPAAVHVVAL >CP028702|2062780:2081991|2078673_2078961_-|AVZ49158.1|DBSCAN-SWA MAYFLDFDERALKEWRKLGSTVREQLKKKLVEVLESPRIEANKLRGMPDCYKIKLRSSGYRLVYQVIDEKVVVFVISVGKRERSEVYSEAVKRIL >CP028702|2062780:2081991|2070936_2071110_-|AVZ49143.1|DBSCAN-SWA MNIENLKTKAEADISEYITKKIIELKKKTGKEVTSIQFTAREKMTGLESYDVKINLI >CP028702|2062780:2081991|2069834_2069936_-|AVZ49140.1|DBSCAN-SWA MIIIITLRVLSGDPTGYGAATSRVFAIYENFPV >CP028702|2062780:2081991|2076582_2077632_-|AVZ49154.1|DBSCAN-SWA MRVLLRPVLVPELGLVVLKPGRESIQIFHNPRVLVEPEPKSMRNLPSGVVPAVRQPLAEDKTLLPFFSNERVIRAAGGVGALSDWLLRHVTSCQWPNGDYHHTETVIHRYGTGAMVLCWHCDNQLRDQTSESLELLAQQNLTAWVIDVIRHAISGTQERELSLAELSWWAVCNQVVDALPEAVSRRSLGLPAEKICSVYRESDIVPGEQTATSILKQRTKNLAPLPYAHQQQKSPQEKTVVSITVDPESPESFMKLPKRRRWVKEKYTRWVKTQPCACCGMPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHNELHTDTVAFEDKYGSQLELIFRFIDRALAIGVLA >CP028702|2062780:2081991|2069125_2069695_-|AVZ49139.1|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIRWYAERDAEIENEKLRREVEELRQASETDLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIPGLLSEYNRADRQYAGGSRS >CP028702|2062780:2081991|2068212_2069175_-|AVZ49138.1|tail|DBSCAN-SWA MNITALTDNTQGAAGLELYEVYNNGYPTAYGNIIHLKGMTAVGEGELLIGWSGTSGAHAPAFIRSRRDTTDANWSPWAQLYTSAHPPAEFYPVGAPIPWPSDTVPSGYALMQGQTFDKSAYPKLAVAYPSGVIPDMRGWTIKGKPASGRAVLSQEQDGIKSHTHSASASSTDLGTETTSSFDYGTKSTNNTGAHTHSISGTANSAGAHQHKSSGAFGGTNTSIFPNGYTAISNLSAGIMSTTSGSGQTRNAGKTSSDGAHTHSLSGTAASAGAHAHTVGIGAHTHSVAIGSHGHTITVNAAGNAENTVKNIAFNYIVRLA >CP028702|2062780:2081991|2070374_2070785_+|AVZ49142.1|DBSCAN-SWA MDQVVIFKQIFDKVRNDLNYQWFYSELKRHNVSHYIYYLATENVHIVLKNDNTVLLKGLKNIVSVKFSKDRHLIETTSNKLKSREITFQEYRRNLAKAGVFRWVTNIHEQKRYYYTFDNSLLFTESIQKTTQILPR >CP028702|2062780:2081991|2067637_2068213_-|AVZ49137.1|tail|DBSCAN-SWA MAFRMSEQPRTIKIYNLLAGTNEFIGEGDAYIPPHTGLPANSTDIAPPDIPAGFVAVFNSDESSWHLVEDHRGKTVYDVASGNALFISELGPLPENVTWLSPEGEFQKWNGTAWVKDTEAEKLFRIREAEETKNNLMQVASEHIAPLQDAADLEIATEEEISLLEAWKKYRVLLNRVDTSTAQDIEWPALP >CP028702|2062780:2081991|2078446_2078602_-|AVZ49157.1|DBSCAN-SWA MKQQKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >CP028702|2062780:2081991|2077978_2078230_-|AVZ49156.1|DBSCAN-SWA MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDARVTDGFILREGEFVTSLQALKEIGIKAGFSAFSGE >CP028702|2062780:2081991|2081835_2081991_+|AVZ49165.1|DBSCAN-SWA MDTIDLGNNESLVYGVFPNQDGTFTAMTYTKSKTFKTENGARRWLERNSGE >CP028702|2062780:2081991|2080947_2081178_-|AVZ49163.1|DBSCAN-SWA MLKTDALLYFGSKTKLAQAAGIRLASLYSWKGDLVPEGRAMRLQEASGGELQYDPKVYDEYRKTKRAGRLNNENHS >CP028702|2062780:2081991|2074666_2074882_-|AVZ49151.1|DBSCAN-SWA MSNKMTGLVKWFNADKGFGFISPVDGSKDVFVHFSAIQNDNYRTLFEGQKVTFSIESGAKGPAAANVIITD >CP028702|2062780:2081991|2066949_2067540_+|AVZ49136.1|DBSCAN-SWA MSQIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSERPGFNRLLARLKCGDQLIVTKLDRLGCNAMDIRKTVEQLTETGIRVHCLALGGIDLTSPTGKMMMQVISAVAEFERDLLLERTHSGIVRARGAGKRFGRPPVLNEEQKQAVFERIKSGVSISAIAREFKTSRQTILRAKAKLQTPDI >CP028702|2062780:2081991|2078960_2079200_-|AVZ49159.1|DBSCAN-SWA MGSINLRIDDELKARSYAALEKMGVTPSEALRLMLEYIADNERLPFKQTLLSDEDAELVEIVKERLRNPKPVRVTLDEL >CP028702|2062780:2081991|2070083_2070317_+|AVZ49141.1|DBSCAN-SWA MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLCSEKRVSSEANKEKSDITELLRKQVRPD >CP028702|2062780:2081991|2077633_2077912_-|AVZ49155.1|DBSCAN-SWA MARNAKYYNSDNSPVLACTHGRYSHAFKSEWFQHPPCTAEQAEWLIHSYRRRGFEVKKALSLDYRHWIISVRLPYSERPPRASRTFQQRIWR >CP028702|2062780:2081991|2073697_2073913_-|AVZ49150.1|lysis|DBSCAN-SWA MKSMDKLTTGVAYGTSAGNAGFWALQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDRRKAARGE |
33 | Enterobacteria_phage(42.86%) | tail,lysis | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2539777 : 2548448
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >CP028702|2539777:2548448|DBSCAN-SWA ATTAATCCGTACTCATTATATTTTTCACTTGATAAAGAGCGGCAGATATCACTTGATGCATATCATAATATTTATACTCGGCCAAACGCCCGCCAAATATAACCTTGTCTTCTCTGCTAGCTAACTCTCTATATTTCTTAAAAAGCTCCATGTTTTTATTATCATTAACTGGATAGTAGGGTTCGTCGCCAACTTTCCACTCTAATGGATATTCTTTTGTAACAACCGTATGCTTTGTCTCAACATAGTCAAAATGTTTATGCTCAATTATTCTGGTATATGGTACATTAGCATCAGTGAAATTTATTACTGCATTCCCTTGGAAGTTTGGAAATTCATGGCGTTCCGTCTCAAATTTTAAAGAGCGATATTCTAACGCTCCAAACCTATAGTCGAAGTACTGATCAATGGGTCCAGTGTAGATGATTCTATGGGCTTTACTCGCTAGAGAATCTTTGTCTTTCAAAAAATCAATGCCTAATTTTACGTCCACACCTTCAAGCATTTTTTCAATAAGCTTAGTGTAGCCTCCCACCGGAATACCTTGATAGCGATCGGAAAAATAATTGTTATCAAACGTAAATCTCACTGGGATTCGCTTAATAATAAATGCAGGCAATTCTTTTGCACTTCTTCCCCACTGCTTCTCCGTATAACCCTTTATCAATGCTTGGTATAAGTCCTCCCCAACTAATGAAATCGCCTGCTCCTCCAAATTTTCAGGTACCTTGTCACCATACTTTTTTTTCTGAGCATTAATGATATTTTGAGCTTCTTGAGGATCTTTAACTCCCCACATTTGGTGGAAAGTATTCATATTAAAAGGAAGGTTGAATAATTTGTCTTTATAAATCGCCAGTGGAGAATTAGTAAAACGATTAAATTCTACTAAATCATTAACGTAATCCCATATATATTTATCATTGGTATGAAAAATATGTGCACCATATTTATGAATCTGGATACCCTCACAGTCCTCTGTGTACGCATTTCCACCGATATGATTTCTTTTCTCAATCACTAAAACTTTTTTGTTTAGCTTTTTTAACTCATTCGCACAAACGGCACCAAACAAACCAGAACCAACAATGATATAATCGTACATAAAATCCTCAGCAAACCAGTAATTTATTATTTCTTACGAACATCAGCATGAGTGACGTAACTAAGCACTCTGTTGCAAGCAATGTTATTGCTGCACCAATCTCTTTAAAAAGAGTTGTTAGCGGAAAAATCAACAACAAACTCAACAAACCCGCAGCGATTAAAATCTTACTGAATTCTTTCTTATAATTATGGGTCAGCATAACTTGAATGCCATAGACATTACTTAATGAAATAAGAAAAGGCAGAGGCGATATAATCATTAGCACAATCACTGCATTATCATATCCCGGCCCTATACTTATTTTTACTAGTATAGATGCACCCAAGAGCAGAATTAATGAAAAAGCACCACCAATCAAACTCAAGCAGGTCAATGATTTTTTAATTAAAATCACACCCTTCACACGATTAAGAACAAGCGTACTTGATATTCTTGGGTATATTGCTTGGGTGATAGGATTTAATAGCCCTTGAAGCGCGTTTCTTATAGTATTGGCCGCATTAAAATTCCCTACGGACGTTGGTCCAGATATAAATCCCAGGATAATAACTATTCCCGTAGAATATAAACTAATAGCAGATGTGGAAATAAAAACATGAAAACCGTCTGCTAAAGATCGACGCACATTATGTAATGATAGCGTAACTTTACCAATCCAACCTTCATGAACAACGATAGCTAGTGCAATAATTCCAGCAACCAGATTTGCACTTGACTGAATAAAACCGGCAATTGCTATATCTGACTTTGTGTTCACAAAAATAAATGTTAGAGGGATAATAGCCAAGCGGGATAAAATACTACTTAAAGTCAGCCATTTCATTTTTTCTTTTCCCTGAAACAGCCAGATAGGGTAGATTAAATTCCCGACTAATGCAGGAACAAACGACCATATAATTACGGCATGCTTGTTATATTCAGGAACAAGCAAGGTCATCGACGTTAAGAAAATCAATGTAATGACGATAAGAACTATTTTTGAAAATATCACCGCCCAAAAAATAGACGTTACTTTATCTTTACTATCTGCTGCTTTGGCAATACTCTGAGTTGCTGTGAGATTGAAACCATATTCAACAAACATTATCATATATAGCATAGTCGCTTGGCAAAAACCGAATATACCGAAATTTTCAGGACCAAGTGTTCTTACAAGATATGGAAATGTAAGCAATGGTAAAAGATAATTGCTACCTTGAACGACAGCCAGATATATAACGTTTCTTCTTAAAGATAATTTATTCGTATTCATGCAATTAATTTTAATCTGATAAGCTCATCTAACGTAAAGAGCCTTTCATCTTTTGGCGAAAGGATTAACCCTGATGTTTGGGGCCAATCAATTGCAATGCGTTCATCATTCCAACATATTCCACAATCGCTTTCAGGATGATAATAGTTTGTAGTTTTATATTGAAATTCAGCGATATCAGACAGAACCAAAAAGCCATGAGCAAACCCTTTTGGTATCCACAACTGCTGCTTATTATCAGCTGAAAGCAGAACACCAACCCATTTACCAAAGGATACCGAATTGGGTCGAATATCAACAGCAACATCAAAAACTGCTCCATGAGTGCAGCGTACAAGTTTATCTTGTGCGTACTCGCCGCGTTGAAAGTGAAGGCCTCTGAGTACATTTTTTGATGAACGTGAGTGATTGTCTTGAACAAAGCTGACCGGATAGCCTAGAATATGTTCAAATGCTGATTGATTAAAGCTCTCATAAAAGAAACCTCTATCATCACCAAATACTCTTGGCTCCAGAATTAGCACATCTTCAATTTCAGTTCTAATCACATTCATTAATTTGAATCCTTCGTCATTTTATAAAGATACTGCCCATAATTATTCTTTATTAGTGGTACAGCTAATTTTCTTACTTGCTCAACATCAATAAAACCTTTACGAAATGCAATCTCTTCAGGACAGGAAACCTTCAATCCCTGGCGCTCTTCAATTGTCGCAATAAAATTACTTGCTTCTATCAGACTCTGATGAGTCCCCGTGTCCAGCCACGCGTAGCCACGCCCCATCATCGCGACAGACAGACGTCCCTGCTCAAGATAAATACGGTTAATATCTGTAATTTCTAACTCACCACGTGCAGACGGCTTCAAGTTTTTCGCCATCTGAACCACGTCGTTATCATAAAAGTACAGACCTGTAACGGCGTAATTACTCTTTGGTTCTAACGGTTTTTCTTCCAGACTGATTGCCGTACCGTTTTTATCAAACTCAACGACACCATAGCGTTCTGGATCATTAACGTGATAGGCAAATACCGTTGCACCACTTTCTTTGTTAACAGCGGCCTCCATTAGCTTCGGCAGATCGTGACCGTAAAAGATATTATCACCAAGAACCAAAGCACAATCATCACCACCAATAAACTCTTCACCGATGATAAATGCCTGCGCGAGGCCATCTGGGCTAGGTTGCACTTTGTACTGAAGATTCAGGCCCCACTGGCTACCGTCACCCAGCAATTGTTGAAAACGAGGAGTATCCTGAGGTGTACTGATAATCAAAATATCGCGAATACCCGCCAACATCAGTGTAGAGAGCGGGTAATAGATCATCGGTTTATCATAAATAGGTAATAGCTGTTTACTGACAGCCATAGTCACAGGATAAAGACGTGTACCAGAACCACCCGCTAAAATAATACCTTTACGCATTTTCATTTCATCATTCCTTTTAATTCATCTTGCTCCACCATCACGAACAAGATGCAAAAACTATTAAATTGCTGTAGTCGTAAATAATTCATTGAGCATTCGTTTCACGCCAACCTGCCAGTCAGGCAAGACAAGCGCAAAGTTCTGCTGAAATTTTTCTGTATTAAGGCGAGAGTTATGTGGACGACGAGCTGGTGTAGGATAGGCTGTTGTTGGTACTGCGTTGAGCTTGTTGAGTGCAAGGGGAATGCCTGCTTTGCGCGCCTCTTCAAAAACCAGCGCAGCATAATCGTACCAGGTTGTGGTACCACTGGCTACCAAATGGTACAAGCCTGCGACATCCGGTTTATTCAGTGCGACACGAATGGCATGTGCTGTACAATCAGCCAGCAGTTCAGCACCTGTTGGCGCACCAAACTGATCGTTAATAACCGCTAATTCTTCACGCTCTTTTGCCAGACGTAACATCGTTTTGGCGAAGTTATTTCCTTTTCCTGCATAGACCCAGCTGGTCCGGAAAATAAGATGCTTCGCGCAATATTCCTGTAACGCTTTTTCTCCGGCTAACTTGGTTTCACCGTAAACATTTAGTGGTGCGGTTGCATCCGTCTCCAGCCATGGCATATCGCCATTTCCAGGGAAGACGTAATCAGTCGAGTAATGGATAACCCAGGCTCCAACTTCATTTGCTGCTTTCGCAATCGCTTCGACACTTGTTGCGTTAATTAATTGTGCAAACTCCGGTTCTGATTCTGCTTTGTCTACTGCGGTGTGAGCGGCTGCATTGACAATAATATCCGGCCGAATGCTTCTTACGGTTTCAGCTACACCTTCAGGATTACTAAAATCACCGCAATAATCAGTAGAGTGAACATCAAAAGCAATCAAATTACCCAAAGGTGCCAGAGCACGCTGTAGTTCCCAACCTACCTGCCCTGTTTTGCCAAAAAGGAGGATATTCATTACTGGCGGCCCTCATAGTTCTGTTCAATCCACGATTGATAGGCACCACTTTTCACATTATCAACCCATTTTGTATTGGACAGGTACCATTCCACCGTTTTACGAATCCCGCTCTCAAACGTTTCCTGTGGTTTCCATCCCAATGCGCGACCAATCTTCTCAGCATCAATAGCATAGCGGCGATCGTGTCCCGGACGATCAGCAACATAAGTGATTTGCTCACGATAAGATTTCTCTTTCGGTACAATCTCATCCAGCAAATCACAAATAGTGAGCACTACATCGATGTTTTTCTTTTCGTTGTGCCCACCAATGTTATAAGTTTCACCCGCTTTACCTTCGGTTACGACGGTATATAACGCACGCGCATGATCTTCAACATACAACCAGTCGCGGATCTGATCTCCTTTGCCATAAATAGGTAATGCCTTACCTTCCAGTGCATTAAGAATAACCAGTGGAATAAGCTTTTCCGGGAAATGATAAGGACCATAGTTGTTCGAGCAATTAGTCACAATTGTCGGTAAACCATATGTACGTTTCCACGCGCGGACTAAATGATCGCTGGATGCTTTGGATGCGGAATAAGGGCTGCTTGGCGCGTAAGCTGTCGTCTCAGTAAATAAGGGTAATTCTTCTGTATTATTTACTTCATCTGGATGAGGCAAATCACCATAGACTTCGTCAGTAGAAATATGATGAAAACGGAAGCTATTTTTCTTGTCGCTATCAAGAGCAGACCAGTAATTGCGAGCGGCTTCCAAAAGGACATAAGTACCAACAATATTGGTTTCAATAAATGCCGCAGGGCCTGTAATTGAACGGTCAACATGGCTTTCAGCAGCCAGGTGCATCACTGCATCCGGCTGATGCTGAGCAAAAATCCGTGCCATTGCAGGTGCATCGCAAATATCCGCATGTTCAAAAACATAGCGTTCAGAATCAGAAACATCAGCAAGTGATTCCCGGTTTCCGGCGTACGTTAATTTATCGACATTAACAACACTATCCTGCGTATTATTTATAATGTGACGAACTACAGCTGAACCAATAAATCCTGCGCCACCAGTAACAAGTATTTTCACTTAATTTATTCCATATTACTTCAGAGCATGCTGTGAAATAAGCGGCTCTCAGTTTGATTAATAGAGGTATTAATGCACGCTACCGCCCCTGGCTTTACAGCTACCAGAGCACTGCATGCATGCCTATGATGTGACGAGCGTTACCCACTCGCGCTAAACCCGAAAAATTCAAACGCTAATTGTCTTACCAATCCGCTCTGGAAACAAGGAAAATCCTGGAAAACTTTGAATAAAACCCTACTGCTAACTCGTTGTTATTCTGATGGTTTATATAAAACAACGGCAGGAAGATTCGCAACAAATTACTTTTGCTGCGAATTTTCACTGCCGTTATAATTTTCTTATCAACCGTTACATCCGGTCAGATTTTCATTATTCGCTTAACAGCTTCTCAATACCTTTACGGAACTTCGCCCCTTCTTTCAGGTTGCGTAGGCCATACTTCACAAACGCCTGCATATAGCCCATTTTTTTGCCGCAGTCGTAACTGTCGCCGGTCATCAGCATTGCATCAACGGATTGTTTTTTCGCCAGCTCGGCAATAGCATCAGTCAGCTGAATACGTCCCCATGCACCAGGCTGAGTACGTTCCAGTTCCGGCCAAATATCGGCAGAAAGCACATAGCGACCTACGGCCATGATGTCTGAGTCCAGCGTCTGCGGCTGATCCGGTTTTTCGATAAATTCAACAATGCGGCTGACTTTACCCTCACGGTCCAGCGGCTCTTTAGTCTGGATGACGGAGTATTCAGAGAGGTCACCCGGCATACGTTTTGCCAGCACCTGGCTGCGGCCCGTTTCGTTGAAACGTGCAATCATGGCAGCAAGGTTGTAACGTAGCGGGTCGGCGCTGGCATCGTCGATCACAACGTCTGGCAGTACCACGACAAATGGGTTGTCACCAATGGCAGGTCGCGCACACAAAATGGAGTGGCCTAAACCTAAAGGTTCGCCCTGACGCACGTTCATAATGGTCACGCCCGGCGGACAGATGGACTGTACTTCCGCCAGCAGTTGACGCTTCACGCGCTGCTCAAGGAGTGATTCTAACTCATAAGAGGTGTCGAAGTGGTTTTCGACCGCGTTCTTGGACGCGTGAGTTACCAGGAGGATTTCTTTGATCCCTGCAGCCACAATCTCGTCAACAATGTACTGAATCATTGGCTTGTCGACGATTGGTAGCATCTCTTTGGGTATCGCCTTAGTGGCAGGCAACATATGCATCCCGAGACCCGCTACAGGAATAACTGCTTTTAAATTCGTCATTATTTCATCCACCTGTAAAATGGTTGCTGAATTATAGCTTGTTCGATTTTTTTCGCCAGCATCAATTACCCTGAATTGATTACTGAATTACTTGTGATGTTACGCCGCTTCGTTGTGGATTGCAGTAGCATTGTTCCTAAGTATGACTCCATTTTTCCAGGAATGGTCGCAAATCTACTCCCTCAGTTCCGGCAATCTAAAGTTAATCTTCTCCACATTAACAATATGGTGATTAATCCTGTCGATATCGACGGAGCTTTGTCCTTTTTCATTCACCGCATGAACATTTGCAAGAGACAGCAGTGTTTCTTTTTTCGCCATAAAAACACCACGAACGTCTTTGCGCATGTCAAAGTTCATGCTCAATGCGGGTCCAACTGAGGATTCCTGCATCACATTGATATTTCGCATAAAAAAATGTTGCGGTTTGTTGTGTAACTCCAGCGACGCACGCTTCATCTCAATGTTAGTTAGCGCCACAAAGGAAACAGCATTCCCGGCGGAGATTTGAATGCCGCGCAATTTATAAGCAAGATGAGTATTATCCAGTTGAATATCATTCACCCGGAAATTTTGCGGTATCGAGAGATATTTGCCTTTAATTACCCCATAGCCGATTAACATCCCGGCGCTATTAATCATTTCAATATTATCAATCACGAAATTGTCACAACCGTAAATAGCGACTGTCGCGTTATCAATGCCCGCTTTCTTACTGAAATCCGGCGTGATATTGCGGGCTTTGATATTACGAATAACAAAATGTTTACCATTTTCAACATGTATCAACTGCCGACAATCCGATCCCGTGATATTCGCCACGACAAAGTTTTTCACTGCCTGGTCTTCCGGGTAGTTGTTATCATAAGTGCTTCCCGCAAGGCCTATGCCGATGCCCCAGTTGATTTTGCCGTTGGTACAGTTGATGCGCTCGATGACATGGTCAGATATCAAAATATCACGGTCGTTAATTGCCACGTTCCATTCAATGGCGTCGCCTTGTAAGTCGCTGAACTTACAATTGGTGATGTTGGCACCGATAATCTGGTTATGAAATCCCTGGCGTAAGATGGCGTAATTAGCGTGGCTAACGGTCAGGTTATCGATGATCAGGTTGCGCATGACCCGTTTGTTTTTGCCGCCGATATAAATCTGCGTCACCGGGCCAAAGCCGCTCATAGTCAGCCCTTTGATGGTGCAGTCAGAACCACGCACATCCAGGGTGATGTTATGCATACTGCCGCCATCCTCCCCTGTCACCTGGCTGCCGTCCTGTAAGACAAATCGCCCTCTGCCGTTGCCGCGCAAGCTTCCAAGAATGTGTAACGTTTTACCGGGAGGGATAAAGATGCCGGTGTTGATATTGTCACAAACCAATCCGGCAGGCACGACGACTGTTTGCCCTTCGCTGAAGGCTTGTTTAAATGAGGCGATCCAGTCGTGTGGGTTGTAGTCGTTAATGTTAACGCTTTGTCGGGCGGGAAGCGCGCGGGCGAAAGGGGTATGGAGGAAGGCAAGCGCCGAGCTTGCCGTCAGGAACGTGCGTCGGGAGAGTTTTTTAAATGGCAT
Protein sequences of DBSCAN-SWA_6 >CP028702|2539777:2548448|2542132_2542690_-|AVZ49620.1|DBSCAN-SWA MNVIRTEIEDVLILEPRVFGDDRGFFYESFNQSAFEHILGYPVSFVQDNHSRSSKNVLRGLHFQRGEYAQDKLVRCTHGAVFDVAVDIRPNSVSFGKWVGVLLSADNKQQLWIPKGFAHGFLVLSDIAEFQYKTTNYYHPESDCGICWNDERIAIDWPQTSGLILSPKDERLFTLDELIRLKLIA >CP028702|2539777:2548448|2544527_2545613_-|AVZ49623.1|DBSCAN-SWA MKILVTGGAGFIGSAVVRHIINNTQDSVVNVDKLTYAGNRESLADVSDSERYVFEHADICDAPAMARIFAQHQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEAARNYWSALDSDKKNSFRFHHISTDEVYGDLPHPDEVNNTEELPLFTETTAYAPSSPYSASKASSDHLVRAWKRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKALPIYGKGDQIRDWLYVEDHARALYTVVTEGKAGETYNIGGHNEKKNIDVVLTICDLLDEIVPKEKSYREQITYVADRPGHDRRYAIDAEKIGRALGWKPQETFESGIRKTVEWYLSNTKWVDNVKSGAYQSWIEQNYEGRQ >CP028702|2539777:2548448|2539777_2540881_-|AVZ49618.1|DBSCAN-SWA MYDYIIVGSGLFGAVCANELKKLNKKVLVIEKRNHIGGNAYTEDCEGIQIHKYGAHIFHTNDKYIWDYVNDLVEFNRFTNSPLAIYKDKLFNLPFNMNTFHQMWGVKDPQEAQNIINAQKKKYGDKVPENLEEQAISLVGEDLYQALIKGYTEKQWGRSAKELPAFIIKRIPVRFTFDNNYFSDRYQGIPVGGYTKLIEKMLEGVDVKLGIDFLKDKDSLASKAHRIIYTGPIDQYFDYRFGALEYRSLKFETERHEFPNFQGNAVINFTDANVPYTRIIEHKHFDYVETKHTVVTKEYPLEWKVGDEPYYPVNDNKNMELFKKYRELASREDKVIFGGRLAEYKYYDMHQVISAALYQVKNIMSTD >CP028702|2539777:2548448|2540888_2542136_-|AVZ49619.1|DBSCAN-SWA MNTNKLSLRRNVIYLAVVQGSNYLLPLLTFPYLVRTLGPENFGIFGFCQATMLYMIMFVEYGFNLTATQSIAKAADSKDKVTSIFWAVIFSKIVLIVITLIFLTSMTLLVPEYNKHAVIIWSFVPALVGNLIYPIWLFQGKEKMKWLTLSSILSRLAIIPLTFIFVNTKSDIAIAGFIQSSANLVAGIIALAIVVHEGWIGKVTLSLHNVRRSLADGFHVFISTSAISLYSTGIVIILGFISGPTSVGNFNAANTIRNALQGLLNPITQAIYPRISSTLVLNRVKGVILIKKSLTCLSLIGGAFSLILLLGASILVKISIGPGYDNAVIVLMIISPLPFLISLSNVYGIQVMLTHNYKKEFSKILIAAGLLSLLLIFPLTTLFKEIGAAITLLATECLVTSLMLMFVRNNKLLVC >CP028702|2539777:2548448|2545985_2546879_-|AVZ49624.1|DBSCAN-SWA MTNLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEILLVTHASKNAVENHFDTSYELESLLEQRVKRQLLAEVQSICPPGVTIMNVRQGEPLGLGHSILCARPAIGDNPFVVVLPDVVIDDASADPLRYNLAAMIARFNETGRSQVLAKRMPGDLSEYSVIQTKEPLDREGKVSRIVEFIEKPDQPQTLDSDIMAVGRYVLSADIWPELERTQPGAWGRIQLTDAIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKGIEKLLSE >CP028702|2539777:2548448|2542689_2543571_-|AVZ49621.1|DBSCAN-SWA MKMRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQYKVQPSPDGLAQAFIIGEEFIGGDDCALVLGDNIFYGHDLPKLMEAAVNKESGATVFAYHVNDPERYGVVEFDKNGTAISLEEKPLEPKSNYAVTGLYFYDNDVVQMAKNLKPSARGELEITDINRIYLEQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIATIEERQGLKVSCPEEIAFRKGFIDVEQVRKLAVPLIKNNYGQYLYKMTKDSN >CP028702|2539777:2548448|2547053_2548448_-|AVZ49625.1|DBSCAN-SWA MPFKKLSRRTFLTASSALAFLHTPFARALPARQSVNINDYNPHDWIASFKQAFSEGQTVVVPAGLVCDNINTGIFIPPGKTLHILGSLRGNGRGRFVLQDGSQVTGEDGGSMHNITLDVRGSDCTIKGLTMSGFGPVTQIYIGGKNKRVMRNLIIDNLTVSHANYAILRQGFHNQIIGANITNCKFSDLQGDAIEWNVAINDRDILISDHVIERINCTNGKINWGIGIGLAGSTYDNNYPEDQAVKNFVVANITGSDCRQLIHVENGKHFVIRNIKARNITPDFSKKAGIDNATVAIYGCDNFVIDNIEMINSAGMLIGYGVIKGKYLSIPQNFRVNDIQLDNTHLAYKLRGIQISAGNAVSFVALTNIEMKRASLELHNKPQHFFMRNINVMQESSVGPALSMNFDMRKDVRGVFMAKKETLLSLANVHAVNEKGQSSVDIDRINHHIVNVEKINFRLPELRE >CP028702|2539777:2548448|2543628_2544528_-|AVZ49622.1|DBSCAN-SWA MNILLFGKTGQVGWELQRALAPLGNLIAFDVHSTDYCGDFSNPEGVAETVRSIRPDIIVNAAAHTAVDKAESEPEFAQLINATSVEAIAKAANEVGAWVIHYSTDYVFPGNGDMPWLETDATAPLNVYGETKLAGEKALQEYCAKHLIFRTSWVYAGKGNNFAKTMLRLAKEREELAVINDQFGAPTGAELLADCTAHAIRVALNKPDVAGLYHLVASGTTTWYDYAALVFEEARKAGIPLALNKLNAVPTTAYPTPARRPHNSRLNTEKFQQNFALVLPDWQVGVKRMLNELFTTTAI |
8 | Escherichia_phage(28.57%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2641854 : 2651295
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >CP028702|2641854:2651295|DBSCAN-SWA CATGTCTGAACTGAACGATCTTCTGACCACCCGTGAGCTACAACGCTGGCGATTAATTCTTGGCGAAGCGGCAGAAACGACGCTTTGTGGGCTGGATGACAACGCCCGGCAGATAGACCACGCGCTGGAGTGGCTGTATGGGCGCGATCCTGAACGGCTCCAGCGTGGTGAACGTTCCGGTGGATTAGGTGGCTCAAATCTCACCACCCCTGAGTGGATCAACAGTATTCACACGCTGTTTCCGCAGCAGGTGATTGAGCGGCTGGAAAGCGATGCCGTGCTGCGCTACGGCATTGAAGATGTGGTGACGAATCTCGACGTGCTGGAACGTATGCAGCCTTCTGAAAGCCTGCTACGCGCTGTTTTGCACACCAAACATCTGATGAACCCCGAAGTACTGGCTGCCGCCCGCCGGATAGTGTGCCAGGTTGTTGAAGAAATTATGGCTCGACTGGCAAAGGAAGTTCGTCAGGCTTTTTCTGGTGTCCGCGATCGCCGTCGCCGTTCATTTATTCCACTGGCGCGAAACTTTGATTTCAAAAGTACTCTGCGCGCCAACCTGCAACACTGGCACCCGCAACACGGCAAGTTGTATATCGAATCCCCCCGCTTTAACAGCCGCATTAAACGCCAAAGCGAACAATGGCAACTGGTCTTACTGGTTGATCAAAGCGGATCGATGGTCGATTCGGTGATCCACTCTGCGGTGATGGCGGCCTGTTTGTGGCAGTTACCCGGCATTCGTACCCATCTGGTGGCGTTTGACACAAGCGTCGTTGATCTCACGGCAGACGTTGCCGATCCGGTAGAGTTATTAATGAAAGTACAGTTGGGCGGCGGGACCAATATCGCCAGTGCCGTGGAGTATGGTCGGCAACTTATTGAACAACCAGCGAAAAGCGTCATTATCCTCGTGAGCGATTTTTACGAAGGGGGTTCATCATCATTACTGACGCATCAGGTGAAAAAGTGTGTCCAGAGCGGCATCAAAGTGCTGGGACTGGCAGCGCTCGATAGCACCGCAACACCTTGCTATGACCGCGATACGGCCCAGGCGCTGGTTAATGTCGGCGCACAAATAGCCGCCATGACGCCGGGCGAGCTGGCATCATGGCTTGCGGAGAATCTTCAGTCATGAATTCACTACGTCCGGAATTATTAGAACTGACACCGCAGGCCCTGACGGCGTTAAGCAATGCCGGTTTTGTTAAGCGCAGTCTTAAGGAACTGGAAAATGGCAACGTCCCGGAGATCAGCCATGAGAACGACGCTTTAATCGCCACCTTCAGTGACGGTGTCCGTACCCAGCTGGCGAACGGCCAGGCACTGAAAGAGGCTCAGTGCAGTTGCGGGGCCAACGGTATGTGCCGTCATCGCGTGATGCTGGTGTTAAGTTATCAACGACTTTGTGCCACCACTCAGTCTACGGAAAAAGAAGAAGAGTGGGATCCGGCAATCTGGCTGGAAGAACTGGCTACCCTTCCCGATGCTACCCGCAAACGCGCACAGGCGCTGGTCGCTAAAGGCATCACCATTGAGTTGTTCTGTGCGCCGGGTGAAATTCCCTCTGCCCGCTTACCGATGAGCGATGTGCGTTTTTATTCCCGCAGCAGTATTCGTTTCGCCCGTTGTGATTGTATTGAAGGCACACTTTGCGAACATGTCGTACTGGCGGTACAGGCCTTCGTCGAGGCCAAAGCGCAGCAAGCAGAATTTAACCATTTAATCTGGCAGATGCGCAGCGAACACGTCACATCATCTGACGATCCGTTTGCCAGCGAAGAAGGCAACGCGTGTCGTCAATATGTTCAGCAATTAAGCCAGACATTATGGCTTGGCGGCATCAGCCAGCCGCTCATCCATTACGAGGCAGCATTCAACCGCGCATTGCAGGCGGCAGAGACCTGCAACTGGCGCTGGGTGAGTGAATCGCTACGGCAACTGCGCGCCAGCGTTGATGCCTTCCACGCCCGCGCCAGCCACTATAATGCCGGAGAATGCTTACATCAGCTTGCGGCATTAAACAGTCGATTAAATTGCGCACAAGAGATGGCCCGGCGCGACAGTATTGGTGAAGTTCCTCCTGTGCCGTGGCGCACGGTCGTTGGCTCTGGCATTGCCGGAGAAGCAAAGCTTGATCATCTGCGGCTGGTGTCTTTAGGTATGCGTTGCTGGCAGGATATTGAGCATTATGGTTTACGCATCTGGTTTACCGATCCCGACACCGGCAGTATTTTGCACCTTTCGCGCAGTTGGCCGCGAAGTGAACAGGAAAACTCACCGGCAGCTACGCGTCGGCTGTTTAGTTTTCAGGCTGGCGCACTGGCGGGCGGGCAAATTGTTTCACAAGCAGCAAAACGCAGTGCCGATGGCGAGCTGCTGTTAGCTACCCGCAACCGCTTAAGCAGCGTTGTGCCGCTGTCGCCTGATGCCTGGCAAATGTTGAGCGCGCCGTTACGCCAGCCGGGCATTGTGGCTTTGCGGGAATATTTACGCCAGCGTCCCCCCGCCTGCATACGGCCTCTTAATCAGGTCGATAACTTATTTATTCTGCCGGTCGCTGAGTGTATTTCGCTCGGTTGGGACAGCAGCCGCCAGACGCTGGATGCGCAGGTCATTAGCGGCGAAGGGGAAGATAATGTGCTGACGTTATCATTACCAGCCTCAGCCAGCGCACCTTATGCCGTTGAACGCATGGCGGCGCTTTTGCAACAAACAGACGACCCCGTGTGTCTGGTTTCTGGCTTTGTCAGTTTTGTTGAAGGGCAATTGACACTGGAACCACGGGTGATGATGACAAAAACCCGTGCCTGGGCGCTGGACGCAGAAACTACGCCTGTGGCACCGCTACCTTCTGCCAGCGTTTTGCCTGTGCCGTCTACTGCTCATCAGTTGCTGATACGCTGCCAGGCGTTACTTATTCAACTGCTCCATAACGGCTGGCGCTATCAGGAACAGAGTGCTATTGGTCAGGCATAGTTGCTGGCGAATGACCTCACCGCGGTGGGTTTTTATCGGCTGGCACATGTGTTGGGACAATTTCGTAATACAGAAAGCGAGGCACGGGTAGAAGCAATGAATAACGGTGTTTTGCTTTGCGAACAATTATTCCCCATGCTTCAGCAACAAGGATGAAATAGTGCTTTTTACTAAGAGTTCTACTCCAGTTCCGGACTGCTCACGCCACGGTATTAGGCATATCCTATATAGCCCCTGGTGAGAGTCACCAGTTCCTTGATTAAATAAAATGGAGTTTTACATGAAGGCTTTCAATAAGCTGTTTTCCCTCGTTGTTGCATCTGTTCTGGTTTTCTCTCTTGCTGGCTGCGGTGACAAAGAAGAATCGAAGAAATTCAGCGCCAATCTGAACGGCACTGAAATTGCCATTACCTATGTCTACAAAGGTGACAAGGTGCTTAAGCAATCTTCTGAAACCAAAATTCAATTTGCCTCCATTGGTGCAACCACCAAAGAAGATGCTGCCAAGACACTTGAGCCGTTAAGCGCCAAATACAAAAACATCGCGGGTGTTGAAGAAAAATTAACCTATACCGATACCTACGCGCAGGAAAACGTGACTATCGATATGGAAAAAGTGGATTTTAAAGCCCTGCAGGGTATTTCAGGAATCAACGTTTCTGCTGAAGATGCCAAAAAAGGTATCACTATGGCGCAAATGGAACTGGTGATGAAAGCCGCTGGTTTTAAAGAAGTGAAATAATCGGTTGGCGGTCATGCTCTAAACATGACCGCCAATTTTTTAGCCTTTTTTCACATGCTGGCGCGCTGCCAGTCCACGCAGAAAATAACGTAAAAATTGATCGCCGCATTCGCGGAAGTTTTTATGATCCGGTGCGCGCATCATCGCTGTAATTTCCGGCATCGAAACGCGGAACTGCTGTTCGGTGAGGATAGCCAGAATGTCATCGGTTTTCAGCGAAAACGCGATGCGTAATTTTTTCAGCACGATGTTGTTATTAATGCGACGTTCCGGCTCCAGTGCCGGAGCAGACTCATCCTTGCCGCGTTTTTCATAAATCAGGCCATTGAGGAATGACGACAAAACAATGTCCGGACAACGCTGAAAACCCTCTTCGTCTTCTTTACGTAGCCAGACGGCGATCTGTTCCGCGGTGGCTTCGACATTACCCAGCGCCAGAATACGCACCAGGTCATTATTATTGGCTTTCAAAATGTAGCGCACGCTGCGCAGAATATCGTTACTTAGCATGAGGCCTTCAGGTGTTGATGAGGCAAAAAGCCATTTTAGCAGTCTTTTACAGGCCAATCGCCTCTTTTAAGCTTTTCAGATAACGGCGGCTGACCGGCACGGTTAAGCCATTACGCAAAATCAACTCGGCCTGGCCGTTATCTTCCAGACGAATCTCCTGTAAATGCGCGAGGTTAACCAGATACTGACGATGGCAGCGCAGTAGTGGTGTACGACTTTCCAGGGTACGTAATGTCAATTCGGTAAAGCCCTCTTTCCCTTCGTGGCTGGTAACGTAGACACCGCTCATCCGACTGCTGACAAATGCCACATCTTTCATTTGCAGCAAATAAATCCGACTATGCCCCGTACAAGGGATAAATTTCAGCGCCTGTTGATTTTCCGGTAACAGCGAAACATCCTGCTTGCTGCGCTCCTGACGCAATCGCGCCAGCGTTTTCTCCAGTCGCGCTTCATCAATTGGCTTCAGCAGATAATCAAAGGCATGTTCTTCAAAGGCTTTAATTGCGTATTCGTCAAACGCAGTGAGAAAAACAATATACGGGCGATGTTCCGGGTCAAGCATCCCCACCATTTCCAGACCACTGATGCGCGGCATCTGGATATCGAGAAACAGCACATCCGGGCGCAGTTTATGCACCGCGCCGATCCCTTCCACGGCGTTTGAACACTCTCCAACGATTTCAATATCGCTCTGCTCCTGCAAAAATACACGCAGGTTCTCCCGTGCTAACGGTTCATCATCGACAATTAAGACTTTAATCATGCCTCGTCCCTCCATGGTAGTCGTAACGTTATTCGGGTGTAACTATCAGGCTCACAGGCGACGCTTATTCCATAGTCATCGCCAAACCGTTCACGTAAACGCTTATCCACCAGATTCATCCCCAGCCCACTGGCATTGGTTACCGGTTGATACAAACCGGCATTGTCTTCGATCTCCAGCATCAAATGTTGCCCCTCACGTCGGGCGCTGATTGCCACTCGCCCTGTATCCAGCAGTTGTGATGTCCCATGTTTAATGGCGTTTTCCACTATCGGTTGCAGGGTAAACGCGGGCAATTGCTGCTGGGATAATTCTTGCGGAATAGCAATGTTGACCTGCAACCGCGACTGGAAGCGCGCCTTTTCAATTTGCAGATAAGCATTCACATGTTCAATTTCGTCGGCGAGAGTAACAAACTCCGAAGGCCGCTTTAAGTTTTTGCGGAAAAAAGTGGAAAGATACTGCACCAGCTGGCTGGCCTGTTCGCTGTCGCGGCGGATCACCGCTTTAATGGTGTTAAGCGCATTAAACAAAAAATGGGGATTCACCTGGGCGTGAAGCAGTTTGATCTCTGACTGGGTGAGCATCGCTTTTTGCCGCTCATATTGCCCGGCAAGGATCTGCGCCGAAAGCAATTGCGCAATCCCCTCACCCAACGTGCGGTTGATTGAACTGAATAAACGGTTTTTGGCTTCATACAATTTGATGGTGCCCATCACCCGCTGATTTTCACCACGCAACGGAATTACCAGCGTCGACCCCAGTTTGCATTGCGGATGCAAAGAGCAACGGTAAGGTACTTCGTTGCCATCAGCGTAGACCACTTCACCGGTTTCAATCGCTTTTAAAGTATAAGTTGAAGAAATCGGTTTGCCGGGTAAATGGTGGTCGTCACCAATTCCGGTAAAGGCCAGCAATTTCTCTCGATCGGTAATCGCGACTGCACCAATATCCAGTTCCTGATACAGCACCTGAGCCACTTTCATGCTGTTCACTTCGTTAAACCCCTGGCGCAAAATGCCTTCCGTCGAGGCTGCCACTTTCAGCGCAGTGGCAGAAAAAGCCGAAGTGTATTTTTCAAACATCGCGCGTTTATCGAGCAATATACGCATAAACAGCGCCGCGCCGACGGTATTGGTGACCATCATTGGCGCAGCAATATTACTCACCAGACGCACCGCATCTTCATAAGGTCGGGCGATCGCAAGGATGATCAGCATTTGCACCATTTCAGCGACGAACGTGACGGCACCGGCGGTAATGGGGTTAAAGACTTTATCAGTGCGCCCGCGGCGGATCAGGATGCTGTGTACCAGGCCACCGAGTAATCCTTCAACGATGGTCGAGATCATGCAACTTAACGCGGTCATGCCCCCCATCGAATATCGATGTAAGCCGCCGGTCAGCCCAACCAGCCCACCGACGACCGGACCGCCGAGTAAGCCGCCCATTACCGCGCCTATCGCACGGGTATTGGCAATAGAATCGTCAATGTGCAACCCAAACCAGGTGCCCATGATGCAGAAGATGGAAAAGACGATGTAGCAGAGAAATTTATGCGGCAGACGAACCGTGACCTGCATTAACGGTATGAATAATGGCGTTTTACTCATTAACCATGCAATGACTAAAAAAACGCACATCTGCTGAAGCAGCAGCAACACCAGATTAAAATCGTACATACCCGCAAACCACACTTCCCTTTAAAACGCGTAACATACATTGCCTGCGTTTAACTTTCTTTGAACTCTTGCAGAAAAATGAGAATTCGTGAGTACGATCACTCAAAATCGCCTGGCAAAAATAAAATCACCCTATAGATGCACAAAAAACGGGCAAAACTACCTGGTTCGCAAAACTGCGTCTAAAGTTAAACCGGGACCTCGCGAGCAAGGGTGAGACGATGGCGCTTTACACAATTGGTGAAGTGGCGTTGCTTTGTGATATTAATCCTGTCACGTTACGCGCGTGGCAGAGGCGTTACGGATTGCTGAAACCGCAACGGACAGACGGCGGTCATCGGCTGTTCAACGATGCCGATATTGACCGGATCCGCGAGATCAAACGCTGGATCGACAACGGCGTGCAGGTCAGCAAAGTTAAAATGCTGCTCAGTAATGAAAATGTTGATGTGCAGAACGGCTGGCGCGATCAGCAAGAAACATTACTGACTTACCTGCAAAGCGGCAATCTACATAGCCTGCGAACGTGGATCAAAGAGCGCGGTCAGGATTACCCCGCCCAGACACTCACCACACATCTGTTTATTCCTCTGCGCCGACGGCTTCAGTGCCAACAACCGACTCTCCAGGCGCTGCTGGCGATCCTCGACGGCGTACTGATCAACTACATCGCCATTTGTCTGGCTTCGGCACGTAAAAAACAGGGTAAAGATGCGCTGGTGGTTGGCTGGAATATTCAGGATACCACCCGTCTGTGGCTGGAGGGCTGGATTGCCAGTCAACAAGGATGGCGCATTGATGTCCTCGCCCACTCGCTCAATCAACTACGCCCTGAACTATTCGAAGGCCGTACATTGCTGGTGTGGTGCGGTGAAAATCGAACCTCCGCCCAACAGCAGCAACTCACCAGTTGGCAAGAACAAGGCCATGATATTTTCCCACTCGGCATTTAATGATTCGTTAACAAATGCGCTTTACTGTACAATCCTTTCGTTAACATAAGGAGTGCATTATGCGCATAGCTAAAATTGGGGTCATCGCCCTGTTCCTGTTTATGGCGTTAGGCGGAATTGGTGGCGTCATGCTCGCAGGTTATACCTTTATTTTGCGTGCTGGCTAAGCGCCTGCACCAGCCTTTCAAACAGGCGGTCTGCGATGATCGCCGCCAGTGCCACCAGTAACGCCCCCTGGATCACATACGCGGTATTAAATCCGCTAAGCCCGATGATGATGGGCGTACCCAGCGTGCTGGCCCCTACCGTTGAGGCGATCGTCGCCGTACCAATGTTGATAATCACCGAAGTTCGCACGCCCGCCAGAATCACCGGAGCCGCCAGCGGTAGCTCGACCTTACGCACTCGCTGACCACGACTCATTCCCATACCTTTCGCAACTTCTGTCACGCTGGCATCAATCGCTCCCAGCCCGGCAAGTGTCGCCTGCAGGACGGGCAGCACACCGTAAAGGATCAAGGCGATAATCGCTGGTTGCAGACCAAAGCCGATCACCGGAACGGCGATCGCCAGCACTGCGACGGGCGGAAAAGTCTGTCCAACGGCGGCAATAGTTTCCACCAGTGGGCGAAATTCCGCGCCCCACGGGCGAGTGACAGCAATTCCGGCACCAGTGCCAATGATCACCGCAAACAAACTCGAAATTCCCACCAGCCAGAAATGAGCCAGTGCCAGAGCTGCAAAACTTTCTTGCTGATAAACGGGTCGTGGCAGTTGTGGGAACAAGGCAGCAAACAGCGGCTGGCTGTAAGGCAGCCAGAAAATCAGCGCCACAAACAGAGCAATGAGCCAGAACAGCGGATCGCGCAACATCTTCATACGCTTACGCCTCCACCAGCAGATCCTGAAAATGCAGCGTGCCGCAAGGCTGGCCCTGCATGTTCACCACCGGCAGCACCTCGCATCCCCGCGCAACAAACAGAGAGAGCGCATCGCGTAGCGTCATCTCTTCTGCCAGTGCCTCACCATCTGCTCGTTCTTCGCGACGCACGTAATCCGCCACACTACGTAACGAAAGCAGGCGCACACCCAGTTCACTACGTCCAAAAAACTGGCGGACAAAATCATTCGCCGGACGAGTCAGCATCGTCAGCGGATTGCCCTGCTGCACTACTTCACCGTGATCCATCAATACCAGATGTTCTGCCAGCCGTAGCGCCTCATCAATATCATGAGTGACCAGCACAATGGTACGCCCCAGCAAACGGTGAATGCGCGTCATCTCTTGTTGCAACGCGCCGCGCGTTACCGGGTCCAGTGCGCCAAAAGGTTCATCCATTAGTAAGACTTGCGGATCGGCAGCCAGTGCACGCGCCACTCCCACACGTTGCTGCTGACCACCGGAAAGCTGATGCGGATAACGCTCACGCAAATTTGACTCCAGCCCCAGTAGCGCCATTAATTCGTCGATACGATCGTCAATCCGCGCCCGCGACCATTTTTGTAATTGCGGCACGGTAGCGATGTTTTGCGCCACGCTCCAGTGGGGGAACAGGCCAATAGATTGAATGGCATAGCCCATCCGGCGGCGCAACTCCAGTACTGGCAGCGAGCGAATTTCTTCTCCGGCAAAGCGGATCTCTCCGCTGTCATGCTCCACCAGGCGGTTAATCATTTTCAGGGTGGTGGATTTGCCGGAGCCAGATGTGCCAATCAGCACCGAAAAACTCCCTTCCTGAAAATTGAGATTGAGATCGTTAACGGCTTTTTGTGCGCCGAACAGTTTGCTGACATGGCTAAATTCAATCAT
Protein sequences of DBSCAN-SWA_7 >CP028702|2641854:2651295|2641854_2642991_+|AVZ49700.1|DBSCAN-SWA MSELNDLLTTRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDPERLQRGERSGGLGGSNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVLHTKHLMNPEVLAAARRIVCQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKSTLRANLQHWHPQHGKLYIESPRFNSRIKRQSEQWQLVLLVDQSGSMVDSVIHSAVMAACLWQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSVIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTAQALVNVGAQIAAMTPGELASWLAENLQS >CP028702|2641854:2651295|2650368_2651295_-|AVZ49709.1|DBSCAN-SWA MIEFSHVSKLFGAQKAVNDLNLNFQEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGEIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWSVAQNIATVPQLQKWSRARIDDRIDELMALLGLESNLRERYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHRLLGRTIVLVTHDIDEALRLAEHLVLMDHGEVVQQGNPLTMLTRPANDFVRQFFGRSELGVRLLSLRSVADYVRREERADGEALAEEMTLRDALSLFVARGCEVLPVVNMQGQPCGTLHFQDLLVEA >CP028702|2641854:2651295|2649544_2649652_+|AVZ49707.1|DBSCAN-SWA MRIAKIGVIALFLFMALGGIGGVMLAGYTFILRAG >CP028702|2641854:2651295|2646130_2646850_-|AVZ49704.1|DBSCAN-SWA MIKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVTSHEGKEGFTELTLRTLESRTPLLRCHRQYLVNLAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >CP028702|2641854:2651295|2648753_2649485_+|AVZ49706.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKMLLSNENVDVQNGWRDQQETLLTYLQSGNLHSLRTWIKERGQDYPAQTLTTHLFIPLRRRLQCQQPTLQALLAILDGVLINYIAICLASARKKQGKDALVVGWNIQDTTRLWLEGWIASQQGWRIDVLAHSLNQLRPELFEGRTLLVWCGENRTSAQQQQLTSWQEQGHDIFPLGI >CP028702|2641854:2651295|2649632_2650364_-|AVZ49708.1|DBSCAN-SWA MKMLRDPLFWLIALFVALIFWLPYSQPLFAALFPQLPRPVYQQESFAALALAHFWLVGISSLFAVIIGTGAGIAVTRPWGAEFRPLVETIAAVGQTFPPVAVLAIAVPVIGFGLQPAIIALILYGVLPVLQATLAGLGAIDASVTEVAKGMGMSRGQRVRKVELPLAAPVILAGVRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIADRLFERLVQALSQHAK >CP028702|2641854:2651295|2645112_2645574_+|AVZ49702.1|DBSCAN-SWA MKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKLTYTDTYAQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKAAGFKEVK >CP028702|2641854:2651295|2646846_2648532_-|AVZ49705.1|DBSCAN-SWA MYDFNLVLLLLQQMCVFLVIAWLMSKTPLFIPLMQVTVRLPHKFLCYIVFSIFCIMGTWFGLHIDDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSILIRRGRTDKVFNPITAGAVTFVAEMVQMLIILAIARPYEDAVRLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSTYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQQLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGLYQPVTNASGLGMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLPWRDEA >CP028702|2641854:2651295|2645613_2646084_-|AVZ49703.1|DBSCAN-SWA MLSNDILRSVRYILKANNNDLVRILALGNVEATAEQIAVWLRKEDEEGFQRCPDIVLSSFLNGLIYEKRGKDESAPALEPERRINNNIVLKKLRIAFSLKTDDILAILTEQQFRVSMPEITAMMRAPDHKNFRECGDQFLRYFLRGLAARQHVKKG >CP028702|2641854:2651295|2642987_2644832_+|AVZ49701.1|DBSCAN-SWA MNSLRPELLELTPQALTALSNAGFVKRSLKELENGNVPEISHENDALIATFSDGVRTQLANGQALKEAQCSCGANGMCRHRVMLVLSYQRLCATTQSTEKEEEWDPAIWLEELATLPDATRKRAQALVAKGITIELFCAPGEIPSARLPMSDVRFYSRSSIRFARCDCIEGTLCEHVVLAVQAFVEAKAQQAEFNHLIWQMRSEHVTSSDDPFASEEGNACRQYVQQLSQTLWLGGISQPLIHYEAAFNRALQAAETCNWRWVSESLRQLRASVDAFHARASHYNAGECLHQLAALNSRLNCAQEMARRDSIGEVPPVPWRTVVGSGIAGEAKLDHLRLVSLGMRCWQDIEHYGLRIWFTDPDTGSILHLSRSWPRSEQENSPAATRRLFSFQAGALAGGQIVSQAAKRSADGELLLATRNRLSSVVPLSPDAWQMLSAPLRQPGIVALREYLRQRPPACIRPLNQVDNLFILPVAECISLGWDSSRQTLDAQVISGEGEDNVLTLSLPASASAPYAVERMAALLQQTDDPVCLVSGFVSFVEGQLTLEPRVMMTKTRAWALDAETTPVAPLPSASVLPVPSTAHQLLIRCQALLIQLLHNGWRYQEQSAIGQA |
10 | Enterobacteria_phage(85.71%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
2899188 : 2910398
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >CP028702|2899188:2910398|DBSCAN-SWA CATGGACAACGACAAAATTGATCAACACAGCGACGAAATTGAAGTTGAGAGCGAAGAAAAAGAGCGCGGCAAAAAAATAGAAATAGATGAAGACCGACTCCCCTCCCGGGCGATGGCAATTCATGAGCATATCCGCCAGGATGGTGAAAAAGAGCTGGAACGCGACGCAATGGCGCTACTGTGGTCAGCCATTGCGGCGGGTCTGTCGATGGGCGCTTCGTTACTGGCAAAAGGGATATTTCAAGTCGAACTGGAAGGTGTGCCGGGCAGCTTCTTGCTGGAGAATCTCGGTTATACCTTTGGTTTTATTATCGTCATTATGGCCCGCCAGCAATTATTTACCGAAAATACCGTGACTGCGGTACTACCCGTCATGCAAAAACCGACAATGAGCAACGTCGGCTTACTTATACGGTTATGGGGCGTCGTGCTGCTGGGTAATATTCTCGGGACAGGTATTGCGGCGTGGGCATTTGAATATATGCCTATCTTCAATGAAGAAACTCGCGATGCATTTGTCAAAATCGGCATGGATGTGATGAAGAACACCCCCAGCGAGATGTTTGCCAACGCGATCATTTCCGGCTGGCTGATCGCCACTATGGTTTGGATGTTTCCTGCAGCGGGTGCGGCAAAGATTGTGGTGATTATATTGATGACCTGGCTTATTGCCCTGGGTGACACCACCCATATCGTGGTCGGTTCTGTTGAAATCCTCTATCTGGTGTTTAACGGTACGCTGCACTGGAGCGATTTCATCTGGCCCTTCGCACTACCTACTTTAGCGGGGAACATCTGCGGCGGCACCTTTATCTTCGCGTTAATGAGTCATGCACAGATTCGTAACGACATGAGCAATAAGCGTAAAGCAGAAGCACGCCAAAAAGCAGAACGTGCGGAAAACATTAAGAAAAATTATAAAAACCCGGCATAAATGGCGAGGGTTTAAGCAATCGAGCGGCAGCGTACTTACCCCGCACTCCATTAGCGGGTATACTCATGCCGCATTGTCCTCTTAGTTAAATGGATATAACGAGCCCCTCCTAAGGGCTAATTGCAGGTTCGATTCCTGCAGGGGACACCATTTATCAGTTCGCTCCCATCCGTACCAGTCCGCAAAATCCCCTGAATATCAAGCATTCCGTAGATTTACAGTTCGTCATGGTTCGCTTCAGATCGTTGACAGCCGCACTCCATGACGGGTAAAAAGTGGATAAAATAATTTTACCCACCGGATTTTTACCCATGCTCACCGTTAAGCAGATTGAAGCAGCAAAGCCGAAAGAAAAACCATACCGCCTTCTCGATGGTAATGGCCTGTACCTTTATGTCCCTGTGTCAGGGAAAAAGGTATGGCAGCTTCGCTACAAGATTGACGGTAAGGAGAAAATCCTGACCGTCGGAAAATATCCGCTTATGACTTTGCAGGAGGCAAGGGATAAAGCATGGACTGCGAGGAAAGACATCTCGGTTGGCATCGATCCTGTAAAGGCGAAAAAGGCTTCGTCTAACAACAATTCCTTTAGTGCGATTTACAAGGAATGGTACGAGCACAAGAAGCAAGTATGGTCAGTAGGGTATGCAACTGAACTTGCCAAAATGTTTGACGACGACATTTTACCTATCATTGGCGGCCTTGAAATTCAGGATATTGAGCCGATGCAACTGCTGGAAGTAATCCGCAGGTTTGAAGATCGCGGTGCAATGGAACGAGCCAACAAAGCACGCAGAAGATGCGGCGAGGTTTTCCGTTACGCTATTGTCACCGGAAGGGCTAAATATAACCCGGCACCTGACCTTGCTGACGCCATGAAGGGATACCGCAAGAAGAACTTCCCGTTTCTTCCTGCAGACCAGATCCCGGCATTCAACAAAGCACTGGCAACATTTTCAGGAAGTATCGTATCGCTCATTGCGACCAAAGTTTTACGCTACACAGCCCTAAGAACGAAAGAGCTTCGTTCCATGCTATGGAAGAACGTCGATTTTGAAAATAGGATTATCACCATCGACGCCAGTGTGATGAAAGGACGCAAAATTCATGTGGTTCCTATGTCAGACCAGGTAGTTGAACTTCTCACTACGCTAAGCTCCATCACCAAACCAGTCTCAGAGTTTGTTTTTGCCGGGCGCAACGATAAGAAGAAGCCAATCTGCGAGAACGCGGTACTGCTTGTGATCAAACAAATCGGCTATGAGGGTCTGGAAAGCGGTCACGGATTCAGGCATGAATTCAGCACGATTATGAACGAGCACGAATGGCCTGCTGACGCTATTGAAGTGCAACTGGCACATGCAAACGGCGGATCTGTGCGTGGGATTTACAACCATGCTCAGTATCTCGATAAACGCAGAGAAATGATGCAATGGTGGGCGGACTGGCTTGATGAGAAGGTGGAGTGAGCGACCTTAACAACTATCGAATAGCACAAAGTCTTGCAATCCAGTGCAAAGCTTTGTGTGTATAAGTTTTGTCTCATCAACCACAGCAAGTATCGATCGATTAAGACTTGGATGATAGACTTCATTCCTTTGATTATTAGCTGATAGAAGAAATGTTAAAGCTATTTGCAAAGTACACCTCTATTGGTGTGCTGAACACCCTTATACACTGGGTGGTTTTTGGTGTTTGTATCTATGTCGCGCATACAAACCAAGCTCTTGCAAACTTCGCAGGTTTCGTTGTGGCTGTGAGCTTTAGCTTCTTCGCGAATGCAAAATTCACATTCAAGGCATCGACTACAACGATGCGCTACATGCTATATGTTGGGTTCATGGGGACACTGAGTGCTACTGTTGGATGGGCTGCTGATAGATGCGCACTTCCCCCGATGATAACTCTTGTCACCTTCTCCGCCATCAGCCTGGTGTGCGGTTTCGTCTATTCAAAGTTCATTGTCTTTAGGGATGCGAAATGAAGATATCTCTTGTAGTTCCTGTCTTCAATGAAGAAGAAGCGATACCAATTTTTTATAAAACGGTACGTGAATTCGAAGAATTGAAGTCATATGAAGTGGAAATCGTTTTCATAAATGACGGCAGCAAAGACGCTACGGAGTCAATCATTAATGCTCTGGCTGTTTCAGATCCTCTAGTTGTTCCGCTGTCATTTACACGCAACTTTGGTAAAGAACCAGCATTGTTTGCAGGGTTAGACCATGCAACCGGGGATGCGATAATCCCAATTGATGTTGACCTGCAAGACCCGATTGAGGTTATTCCTCATCTTATTGAAAAATGGCAAGCAGGTGCTGATATGGTTCTTGCTAAAAGATCTGACCGCTCAACTGATGGACGCCTGAAGCGAAAAACGGCTGAGTGGTTCTATAAGCTCCACAATAAAATAAGCAATCCTAAAATTGAAGAGAATGTTGGTGATTTCAGGCTGATGAGCCGTGATGTTGTCGAAAATATTAAACTTATGCCAGAACGAAACCTTTTCATGAAAGGTATTCTGAGCTGGGTAGGAGGAAAGACAGATATTGTTGAATACGTGCGAGCGGAAAGAATTGCTGGAGATACAAAATTTAATGGATGGAAACTTTGGAATTTAGCACTTGAGGGTATTACAAGCTTTTCCACATTCCCTCTTCGCATCTGGACATACATAGGGTTAGTGGTAGCCAGTGTAGCATTTATTTATGGGGCGTGGATGATTTTAGATACTATCATATTTGGAAATGCTGTTAGGGGATATCCTTCACTACTTGTTTCAATACTGTTTTTAGGTGGAATTCAGATGATTGGAATAGGAGTATTAGGTGAATATATTGGACGCACATACATTGAAACCAAAAAACGCCCGAAATACATCATCAAGAGAGTCAAAAAATGAATAAAGCAATAAAAGTATCATTGTATATATCTTTTGTTTTGATTATTTGCGCCTTATCTAAAAACATAATGATGTTAAATACATCTGATTTCGGAAGAGCCATTAAGCCATTAATTGAAGACATACCAGCATTTACATATGACTTACCTTTATTGTATAAATTGAAAGGTCATATTGATTCAATTGATAGCTATGAGTATATAAGTTCATATAGTTATATTTTGTATACATACGTCCTGTTTATTAGCATTTTTACTGAATATCTTGATGCTAGGGTGTTATCGTTATTTCTAAAAGTAATATATATTTATTCATTATATGCGATATTTACTTCATATATAAAAACAGAAAGGTATGTAACTTTATTTACATTCTTTATTTTAGCTTTTCTTATGTGTTCTTCATCAACACTGTCAATGTTTGCATCATTCTATCAAGAGCAAATAGTTATAATTTTCCTTCCATTTTTGGTGTATTCATTAACATGCAAAAACAATAAATCTATGCTTTTGCTATTTTTTTCGTTGCTAATAATATCTACTGCTAAAAATCAATTTATATTAACCCCACTAATAGTGTATTCATATTATATTTTTTTTGATAGACACAAACTAATTATTAAATCTGTAATATGCGTGGTGTGCTTGCTTGCGTCAATATTTGCAATATCTTATTCAAAAGGTGTTGTTGAATTAAATAAGTACCATGCAACATACTTCGGTAGTTATCTTTATATGAAAAACAACGGGTATAAAATGCCATCGTATGTTGATGATAAGTGTGTTGGGTTAGATGCCTGGGGTAATAAATTCGACATATCATTTGGCGCAACCCCAACAGAAGTTGGAACGGAATGTTTCGAATCTCATAAAGATGAAACGTTTTCGAATGCACTCTTTTTATTGGTTAGCAAACCAAGCACCATCTTCAAACTTCCATTTGATGATGGTGTGATGTCTCAGTATAAAGAAAATTATTTCCATGTATATAAAAAACTACACGTAATATATGGAGAATCAAACATACTAACGACTATTACTAACATAAAAGACAATATATTTAAAAACATTAGATTTATATCATTGTTATTATTTTTTATTGCTTCTATTTTTATTAGAAATAATAAAATAAAGGCATCTTTATTTGTAGTATCTCTTTTTGGAATATCTCAATTTTATGTGTCATTTTTCGGGGAAGGATATAGAGATTTAAGCAAGCATTTATTTGGAATGTATTTTTCGTTCGACCTTTGCTTATACATAACAGTCGTTTTTTTAATTTATAAAATAATTCAAAGAAATCAAGACAATAGCGATGTAAAGCACTAAGTTTAAATTGCGCGCCAATCATGGCGCGCACAAGCTATAATACCAACCTAATTTCTCCTCCTCTTAGAGTGACTATATCTCCTGATAGAATTGCGGTATTGACTATCAAATGCCCTGATTCGTTGTTTATTGTAATATCTCCTCTATCTGCAGACGATAACTTAAATGCATCATTGCCCACAACAAACCCCCTCCAGAACCAAGTGCTGATATTATCATCAACAGTGATAGATACATATACTAACTGATTATCGTTATAAGTGATTCCTGTCTTATACTTAACATAAGGACTTCCACTTTGATTCTCGATAGACACATAACATCCAGGGGTTATGTTTGTATGCGTCCCGCGACTATCGCCCCATTAACGCCATACGATAAATGGGATGGTGAGAAATGGGTGACGGATACCGAGGCACAGCATAGCGTCGCAGTAGATGCAGCAGAAGCACAGCGCCAGTCGCTGATTGATACTGCAATGGCTTCCATTAGTCTGATTCAACTGAAATTACAGGCTGGGCGGAAGCTGATGCAGGCAGAGACCTCCCGACTTAACACTGTGCTGGATTACATTGACGCGGTGACGGCAACAGATACCAGCACCGCGCCGGATGTCATCTGGCCTGAACTGCCGGAGGAGTAGGCCATTCAATATCTGGCGCACTGGAAGTATCGACCAGCTCCAGTGCGTCCAGATAATCCAGCCACAAATTATATTGCGCCAGTTCCTCACCTTTCAGACGACCAATAGCCGCTTTACCAGCCCATTGTTTACTGTTCATATAATCGTTGGCCTGATTAATCAATTGCTGCTTTTTCAGTTCGGCTGCAGCAATCTGTTCCTCATGTGTTGGTGGTGGAATTTCAGACCATGCAGGAAAACCATTTTCTCCAGCGATACGGATTTTTCCTTTCGGCGGTAATCCGGAAAACTCAATATACACTTGCTCATCAACTTCAACAGCATCATCTGGCCATGAGCCAGCTTGCGTGTAATCCTCTTTCATCTCCAAGGGATAGAAAGAGTTTGTAGTCGCGGAATATATGTAATTCATTTTTCACTCCATAAAGTTAAAAGAAATTAACACCCTAATGCGAAAAATGAAGCACCGATACCGGGTACGCCTGCTCTGGAAATAAATTTCACCGGGTCCTGGTTATAACCGGCACAAGCTATATAGCCAACATTTGCACTGCCGGGAGTGTAATCCTGAGTCGCAAATACCCGCAGACATCTATTCGGAAATGCAATCGGAAAATAGGTTACTGTGTCCTGAGACGTCAGCGGAACATCAATTGGCCCCCATTGAATAATTAAACCGGATGGCAATTTTTGATATCCAGGAACTGAAGCAGAAAGCATAAAACTACCCATATCAGGTATCTGATTCGCCCCTGTCCCTACATTTCTTTTAGCCGCTTCTCCCAAACCAAGGTTTTCGAGAGCCTTTTGCACCGTGCCGTCCAATTTGATATCGCCAAACGGATTCTTGCGGCTTAACAGCAGCGCACGAAGCGCGGTAAGCAGCTGGTCATGCCGCCCCTTCTCCAGGCTGGCACCGGAGGCCTCCACAACGCTGCAAAGCTCCTCCTGCAACATGTCAAAGTAGTCATCATCCAGATCGGTGGCAGGCGTGCCGGTCTGGGGGTTACCACGGGTACGGGGGATTACCAAAGGCAGCACCTTTAAGCTCCGCAAGACGTTCTGACCAGTCATGCGCCAGCGCGTTGTCTTCCGCAGTGTAATACGCGGCACATTTGGCGTTATCACCATCAGTAAACAGATCCAGAACAAACGGGCCAAACAGGGTGTTAATTCCCCAGAAAATGTTGTCCGGCGTGCGCCACTGATCGCCCACTTCCTTCAGTTCATGGGCTGGTTTGTTCCGCAGCTCCACCAGCGCCTGGCAATATTTATTACTCATTAAGCCCCCACGTAATTCCCTGAGAGATACCACTCTTCACCTGATGCAGCCCGCTTACTGCTTTTCCGTAAACACCGTTCACGACGCGCCAGAAAATTGTTTCGTTCTGGCTGGGAGTGGCTTTCACGGAATGCCGCCATCCACACCGTTGCAGCACGACGGTATAAGCCCCTGGACTCCAGTTCTTCCGCCTGGCGGGTCAGGCACAAAATCACCCGCGGGTCGTTAGTGCCGACATAGAAATTGCGCACAGGTCTGGTTTCACGAACTGGTTGTGGTTCCGGATCCTGCGCTCTCTCAGTCAGGCGCGGGAAATGTCTGTGTGTATCTCCTTCACAACGGTGAGCCACACGCCCACTCTGACGTAACTTGCTTGCTGACTGCAGAACGCGCTGCCGTGAGTAACCTGCAAAAGCATCCGCAATGTCTCCGGAAGTACAGCCCGGATGGGCTTCAATGAATTTCTGAACGTCATTCAAAAGACTCATGCTCACCCCCTGAATCCTGCCGGGATCTGGCTGTAGTCCACATTGTCGTAACTGGCTTTGAAGTACGGGTCTTCGCGTTTTTCTGTGTACGTGCTGACGGACGGCGATAAGCGCAGGGAAAGCTCATCCCATTTTTCCCGCAGCTTCGACGGGCTGAGCACGTTACGGCACCAGAACGGATCGCGGCTGACGCGGCTGTACATCTCGCAGATTTGTTTGTGAGTACGACCATCCTGCACACACATCAGGCGAATTTCGTTTGCCCAGGCTGTCCAGTTCGGTTCTTTGGGACGAACCACCTCGCCGTCACATTCGGCGGCCTGCTCGTACAGGGCGATGATTTTTTTCCAGAGCCACTGTGCGCAGGTCAAATCATCCTGCGTTCCCCACTGGCGCTTTTTAGGGCTGAATACAACCGCATCAGGATGGCGAGTTAAAAAATCCTGTTCATCCGTCTGCGTGTCCGGTTGCGAAGCGTCCGGACGAGAAGGTTTTTTATCTGACGGATCATGTTTTGATTTTACTGACGGATCCCCGCCAGATTCTGACGGGTGAAAACCCGATTTTTTGCCAGATTTCGACGCATCAAATTTTGACGGGTCAGATTTTGATGCGTCAGATTTTGACGGGTCAGAGTCTGACAGTTGAGAAAATGCCGCTGCCTGAAGCTTCGCAACGTTAAGCTGATAAACATTCGACGCATTGCGGTTATGATGAGGAGCAGCAAAATGGCTAGCGAACGCAGTACTGATGTGCAGGCATTTATCGGGGAGCTGGACGGCGGCGTATTTGAAACCAAAATCGGCGCAGTTCTCAGTGAAGTCGCTTCCGGTGTGATGAACACGAAAACCAAAGGTAAGGTCTCACTCAACCTGGAAATCGAACCATTTGATGAGAACCGTCTGAAAATCAAACACAAACTCTCATATGTTCGCCCGACTAACCGTGGGAAAATTTCCGAAGAAGACACCACCGAAACGCCGATGTATGTCAATCGCGGTGGTCGCCTGACTATTCTGCAGGAAGACCAGGGACAATTACTGACTCTTGCCGGTGAGCCTGACGGAAAACTCCGCGCAGCAGGTCATTAATATCATTCTTAATTAACTAATTATTTATCTCATCACTGAATATCTTAATATAGTGAGGACTTATTATGTCTCAGAACTTAGACGCAACCGCAATTAATCAAATCCATGCCCTTATTTCTGCTCAGGGTGTTAATGAAATTATCAGTAAGATTGGTGCCGATGCTGTGGCATTGCCTGAGAATTTCCGCATTCATGATCTGGAAAAATTTAATTTAAATCGCTTCCGTTTCCGTGGTGCGCTTTCCACTGCCAGCATCGATGACTTTACCCGTTATTCTAAAGATCTTGCAGATGAAGGCACCCGCTGCTTTATCGATGCTGATAATATGCGTGCCGTCAGTGTGCTTAACCTGGGTACTATTGATGAACCAGGTCACGCAGATAACACCGCCACTCTCAAACTGAAAAAGACAGCACCGTTCTCTGCCCTGTTGTCTGTTAACGGCGAGCGTAACTCCCAGAAATCACTGGCAGAATGGATTGAAGACTGGGCCGACTACCTTGTGGGCTTTGATGCTAATGGTGACGCCATTCAGGCAACAAAAGCGGCTGCGGCAATCCGTAAAATCACGATTGAAGCAAACCAGACCGCTGATTTTGAAGATAATGACTTCAGCGGCAAACGCTCCCTGATGGAATCTGTCGAAGCGAAGACCAAAGACATTATGCCAGTGGCATTTGAATTTAAATGCGTTCCGTTTGAAGGTCTGAAAGAACGTCCGTTTAAATTACGCCTCAGCATTATCACTGGCGATCGTCCTGTACTGGTTCTGCGCATTATTCAGCTGGAAGCGGTGCAGGAAGATATGGCTAACGAATTTCGTGATCTGCTTGTTGAGAAATTCAAAGACAGCAAAGTAGAAACCTTTATTGGTACTTTCACCGCCTGATTTCATTACTGCAAATGCCCCTGCGGGGGCATTTATGGAAACGTAATTAACTCAATAATCACCGGATGGTGAGGGCTTCCTTTTACCCAAACTCAGCGCGGTGCAGCGCATATACGTGGAGAACAAAATGTCATTTATTAAAACTTTTTCCGGGAAGCATTTTTATTATGACAAGATAAATAAAGACGACATCGTGATTAACGATATCGCGGTTTCCCTTTCAAATATCTGCCGCTTTGCCGGTCATCTTTCTCACTTCTACAGTGTCGCCCAACATGCGGTGCTTTGCAGCCAGCTGGTGCCGCAGGAATTTGCTTTTGAAGCATTAATGCATGATGCAACAGAAGCGTATTGCCAGGACATCCCCGCACCACTGAAACGCCTTCTTCCTGACTATAAACGGATGGAAGAAAAAATAGATGCAGTAATCCGTGAGAAATACGGGTTACCTCCTGTTATGAGCACGCCAGTGAAATATGCCGATCTCATTATGCTGGCAACCGAACGCCGTGATCTCGGGCTTGATGATGGCTCTTTCTGGCCTGTACTGGAAGGCATCCCGGCAACAGAGATGTTCAACGTGATTCCACTGGCACCGGGTCATGCCTACGGGATGTTTATGGAACGTTTTAACGATTTATCGGAGTTACGCAAATGCGCATGAATGTTTTCGAAATGGAAGGGTTTCTTCGCGGGAAATGTGTACCGCGAGATCTGAAAGTGAACGAAACAAATGCTGAGTACCTGGTACGTAAATTCGACGCGCTTGAAGCTAAATGTGCGGCACTGGAAAACAAAATAATACCAGTGTCAGCTGAACTGCCACCAGCAAATGAAAGTGTTCTGTTATTTGATGCTAACGGAGAAGGCTGGCTGATTGGCTGGCGTTCTCTCTGGTACACCTGGGGACAAAAAGAAACCGGAGAATGGCAGTGGACATTTCAGGTTGGGGACCTTGAAAACGTCAATATCACTCACTGGGCAGTAATGCCAAAAGCACCGGAGGCTGGAGCATAATGACCACATTTACCAATAAAGAACTGATTAAAGAAATCAAAGAACGAATCAGCAGCCTAGAGGTTCGAGACGATATTGAGCGCCGTGCTTATGAAATCGCACTCGTATCTCTGGAAGTAGAGCCAGATGAACGCGAAGCCTATGAATTATTCATGGAAAAGCGTTTCGGTGACTTAGTAGATCGTCGGAGAGCAAAAAACGGCGATAACGAATACATGGCATGGGATATGACTCTCGGTTGGATCATCTGGCAGCAACGAGCTGGTATCCATTTTTCAACAATGTCACAACAAGAGGTGAAATAATAGAGCCATACAGCCTCACACTCGATGAGGCCTGTCAGTTTCTTAAAATATCCTGATCTACCATCGCCGTCATAGAGCGTATTTTTATTACCTGATTTGCAGGTTCGATTCCCTATTCGGAGATAGCACTCATGCAACACGAACTACAGCCTGATTCACTGGTTGATTTGAAATTCATCATGGCTGATACTGGCTTTGGTAAAACCTTCATCTATGACCGGATTAAGTCAGGCGACCTGCCAAAAGCCAAAGTTATCCACGGGCGAGCAAGATGGTTATATCGTGACCATTGTGAATTCAAAAATAAGCTCTTAAGCCGCGCCAATGGGTAA
Protein sequences of DBSCAN-SWA_8 >CP028702|2899188:2910398|2900432_2901590_+|AVZ49935.1|integrase|DBSCAN-SWA MLTVKQIEAAKPKEKPYRLLDGNGLYLYVPVSGKKVWQLRYKIDGKEKILTVGKYPLMTLQEARDKAWTARKDISVGIDPVKAKKASSNNNSFSAIYKEWYEHKKQVWSVGYATELAKMFDDDILPIIGGLEIQDIEPMQLLEVIRRFEDRGAMERANKARRRCGEVFRYAIVTGRAKYNPAPDLADAMKGYRKKNFPFLPADQIPAFNKALATFSGSIVSLIATKVLRYTALRTKELRSMLWKNVDFENRIITIDASVMKGRKIHVVPMSDQVVELLTTLSSITKPVSEFVFAGRNDKKKPICENAVLLVIKQIGYEGLESGHGFRHEFSTIMNEHEWPADAIEVQLAHANGGSVRGIYNHAQYLDKRREMMQWWADWLDEKVE >CP028702|2899188:2910398|2907919_2908744_+|AVZ49944.1|DBSCAN-SWA MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNGERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAIRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEDMANEFRDLLVEKFKDSKVETFIGTFTA >CP028702|2899188:2910398|2902101_2903022_+|AVZ49937.1|DBSCAN-SWA MKISLVVPVFNEEEAIPIFYKTVREFEELKSYEVEIVFINDGSKDATESIINALAVSDPLVVPLSFTRNFGKEPALFAGLDHATGDAIIPIDVDLQDPIEVIPHLIEKWQAGADMVLAKRSDRSTDGRLKRKTAEWFYKLHNKISNPKIEENVGDFRLMSRDVVENIKLMPERNLFMKGILSWVGGKTDIVEYVRAERIAGDTKFNGWKLWNLALEGITSFSTFPLRIWTYIGLVVASVAFIYGAWMILDTIIFGNAVRGYPSLLVSILFLGGIQMIGIGVLGEYIGRTYIETKKRPKYIIKRVKK >CP028702|2899188:2910398|2907491_2907854_+|AVZ49943.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRLKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGH >CP028702|2899188:2910398|2901742_2902105_+|AVZ49936.1|DBSCAN-SWA MLKLFAKYTSIGVLNTLIHWVVFGVCIYVAHTNQALANFAGFVVAVSFSFFANAKFTFKASTTTMRYMLYVGFMGTLSATVGWAADRCALPPMITLVTFSAISLVCGFVYSKFIVFRDAK >CP028702|2899188:2910398|2905999_2906275_-|AVZ49941.1|DBSCAN-SWA MSNKYCQALVELRNKPAHELKEVGDQWRTPDNIFWGINTLFGPFVLDLFTDGDNAKCAAYYTAEDNALAHDWSERLAELKGAAFGNPPYPW >CP028702|2899188:2910398|2903018_2904350_+|AVZ49938.1|DBSCAN-SWA MNKAIKVSLYISFVLIICALSKNIMMLNTSDFGRAIKPLIEDIPAFTYDLPLLYKLKGHIDSIDSYEYISSYSYILYTYVLFISIFTEYLDARVLSLFLKVIYIYSLYAIFTSYIKTERYVTLFTFFILAFLMCSSSTLSMFASFYQEQIVIIFLPFLVYSLTCKNNKSMLLLFFSLLIISTAKNQFILTPLIVYSYYIFFDRHKLIIKSVICVVCLLASIFAISYSKGVVELNKYHATYFGSYLYMKNNGYKMPSYVDDKCVGLDAWGNKFDISFGATPTEVGTECFESHKDETFSNALFLLVSKPSTIFKLPFDDGVMSQYKENYFHVYKKLHVIYGESNILTTITNIKDNIFKNIRFISLLLFFIASIFIRNNKIKASLFVVSLFGISQFYVSFFGEGYRDLSKHLFGMYFSFDLCLYITVVFLIYKIIQRNQDNSDVKH >CP028702|2899188:2910398|2904964_2905405_-|AVZ49940.1|tail|DBSCAN-SWA MNYIYSATTNSFYPLEMKEDYTQAGSWPDDAVEVDEQVYIEFSGLPPKGKIRIAGENGFPAWSEIPPPTHEEQIAAAELKKQQLINQANDYMNSKQWAGKAAIGRLKGEELAQYNLWLDYLDALELVDTSSAPDIEWPTPPAVQAR >CP028702|2899188:2910398|2904648_2904993_+|AVZ49939.1|tail|DBSCAN-SWA MILDRHITSRGYVCMRPATIAPLTPYDKWDGEKWVTDTEAQHSVAVDAAEAQRQSLIDTAMASISLIQLKLQAGRKLMQAETSRLNTVLDYIDAVTATDTSTAPDVIWPELPEE >CP028702|2899188:2910398|2906274_2906769_-|AVZ49942.1|DBSCAN-SWA MSMSLLNDVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTHRHFPRLTERAQDPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA >CP028702|2899188:2910398|2909760_2910066_+|AVZ49947.1|DBSCAN-SWA MTTFTNKELIKEIKERISSLEVRDDIERRAYEIALVSLEVEPDEREAYELFMEKRFGDLVDRRRAKNGDNEYMAWDMTLGWIIWQQRAGIHFSTMSQQEVK >CP028702|2899188:2910398|2905431_2905950_-|AVZ51686.1|DBSCAN-SWA MLQEELCSVVEASGASLEKGRHDQLLTALRALLLSRKNPFGDIKLDGTVQKALENLGLGEAAKRNVGTGANQIPDMGSFMLSASVPGYQKLPSGLIIQWGPIDVPLTSQDTVTYFPIAFPNRCLRVFATQDYTPGSANVGYIACAGYNQDPVKFISRAGVPGIGASFFALGC >CP028702|2899188:2910398|2909398_2909761_+|AVZ49946.1|DBSCAN-SWA MRMNVFEMEGFLRGKCVPRDLKVNETNAEYLVRKFDALEAKCAALENKIIPVSAELPPANESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA >CP028702|2899188:2910398|2908871_2909408_+|AVZ49945.1|DBSCAN-SWA MSFIKTFSGKHFYYDKINKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDGSFWPVLEGIPATEMFNVIPLAPGHAYGMFMERFNDLSELRKCA >CP028702|2899188:2910398|2899188_2900121_+|AVZ49934.1|DBSCAN-SWA MDNDKIDQHSDEIEVESEEKERGKKIEIDEDRLPSRAMAIHEHIRQDGEKELERDAMALLWSAIAAGLSMGASLLAKGIFQVELEGVPGSFLLENLGYTFGFIIVIMARQQLFTENTVTAVLPVMQKPTMSNVGLLIRLWGVVLLGNILGTGIAAWAFEYMPIFNEETRDAFVKIGMDVMKNTPSEMFANAIISGWLIATMVWMFPAAGAAKIVVIILMTWLIALGDTTHIVVGSVEILYLVFNGTLHWSDFIWPFALPTLAGNICGGTFIFALMSHAQIRNDMSNKRKAEARQKAERAENIKKNYKNPA >CP028702|2899188:2910398|2910197_2910398_+|AVZ49948.1|DBSCAN-SWA MQHELQPDSLVDLKFIMADTGFGKTFIYDRIKSGDLPKAKVIHGRARWLYRDHCEFKNKLLSRANG |
16 | Enterobacteria_phage(56.25%) | integrase,tail | attL 2897163:2897179|attR 2914073:2914089 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
3292198 : 3299337
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >CP028702|3292198:3299337|DBSCAN-SWA CATGAGTGCAATAGAAAATTTCGACGCCCATACGCCCATGATGCAGCAGTATCTCAGGCTGAAAGCCCAGCATCCCGAGATCCTGCTGTTTTACCGGATGGGTGATTTTTATGAACTGTTTTATGACGACGCAAAACGCGCGTCGCAACTGCTGGATATTTCACTGACCAAACGCGGTGCTTCGGCGGGAGAGCCGATCCCGATGGCGGGGATTCCCTACCATGCGGTGGAAAACTATCTCGCCAAACTGGTGAATCAGGGAGAGTCCGTTGCCATCTGCGAACAAATTGGCGATCCGGCGACCAGCAAAGGTCCGGTTGAGCGCAAAGTTGTGCGTATCGTTACGCCAGGCACCATCAGCGATGAAGCCCTGTTGCAGGAGCGTCAGGACAACCTGCTGGCGGCTATCTGGCAGGACAGCAAAGGTTTCGGCTACGCGACGCTGGATATCAGTTCCGGGCGTTTTCGCCTGAGCGAACCGGCTGACCGCGAAACGATGGCGGCAGAACTGCAACGCACTAATCCTGCGGAACTGCTGTATGCAGAAGATTTTGCTGAAATGTCGTTAATTGAAGGCCGTCGCGGCCTGCGCCGTCGCCCGCTGTGGGAGTTTGAAATCGACACCGCGCGCCAGCAGTTGAATCTGCAATTTGGGACCCGCGATCTGGTCGGTTTTGGCGTCGAGAACGCGCCGCGCGGACTTTGTGCTGCCGGTTGTCTGTTGCAGTATGCGAAAGATACCCAACGTACGACTCTGCCGCATATTCGTTCCATCACCATGGAACGTGAGCAGGACAGCATCATTATGGATGCCGCGACGCGTCGTAATCTGGAAATCACCCAGAACCTGGCGGGTGGTGCGGAAAATACGCTGGCTTCTGTGCTCGACTGCACCGTCACGCCGATGGGCAGCCGTATGCTGAAACGCTGGCTGCATATGCCAGTGCGCGATACCCGCGTGTTGCTTGAGCGCCAGCAAACTATTGGCGCATTGCAGGATTTCACCGCCGGGCTACAGCCGGTACTGCGTCAGGTCGGCGACCTGGAACGTATTCTGGCACGTCTGGCTTTACGAACTGCTCGCCCACGCGATCTGGCCCGTATGCGCCACGCTTTCCAGCAACTGCCGGAGCTGCGTGCGCAGTTAGAAACTGTCGATAGTGCACCGGTACAGGCGCTACGTGAGAAGATGGGCGAGTTTGCCGAGCTGCGCGATCTGCTGGAGCGAGCAATCATCGACACACCGCCGGTGCTGGTACGCGACGGTGGTGTTATCGCATCGGGCTATAACGAAGAGCTGGATGAGTGGCGCGCGCTGGCTGACGGCGCGACCGATTATCTGGAGCGTCTGGAAGTCCGCGAGCGTGAACGTACCGGCCTGGACACGCTGAAAGTTGGCTTTAATGCGGTGCACGGCTACTACATTCAAATCAGCCGTGGGCAAAGCCATCTGGCACCCATCAACTACATGCGTCGCCAGACGCTGAAAAACGCCGAGCGCTACATCATTCCAGAGCTAAAAGAGTACGAAGATAAAGTTCTCACCTCAAAAGGCAAAGCACTGGCACTGGAAAAACAGCTTTATGAAGAGCTGTTCGACCTGCTGTTGCCGCATCTGGAAGCGTTGCAACAGAGCGCGAGCGCGCTGGCGGAACTCGACGTGCTGGTTAACCTGGCGGAACGGGCCTATACCCTGAACTACACCTGCCCGACCTTCATTGATAAACCGGGCATTCGCATTACCGAAGGTCGCCATCCGGTAGTTGAACAAGTACTGAATGAGCCATTTATCGCCAACCCGCTGAATCTGTCGCCGCAGCGCCGCATGTTGATCATCACCGGTCCGAACATGGGCGGTAAAAGTACCTATATGCGCCAGACCGCACTGATTGCGCTGATGGCCTACATCGGCAGCTATGTACCGGCACAAAAAGTCGAGATTGGACCTATCGATCGCATCTTTACCCGCGTAGGCGCGGCAGATGACCTGGCGTCCGGGCGCTCAACCTTTATGGTGGAGATGACTGAAACCGCCAATATTTTACATAACGCCACCGAATACAGTCTGGTGTTAATGGATGAGATCGGGCGTGGAACGTCCACCTACGATGGTCTGTCGCTGGCGTGGGCGTGCGCGGAAAATCTGGCGAATAAGATTAAGGCATTGACGTTATTTGCTACCCACTATTTCGAGCTGACCCAGTTACCGGAGAAAATGGAAGGCGTCGCTAACGTGCATCTCGATGCACTGGAGCACGGCGACACCATTGCCTTTATGCACAGCGTGCAGGATGGCGCGGCGAGCAAAAGCTACGGCCTGGCGGTTGCAGCTCTGGCAGGCGTGCCAAAAGAGGTTATTAAGCGCGCACGGCAAAAGCTGCGTGAGCTGGAAAGCATTTCGCCGAACGCCGCCGCTACGCAAGTGGATGGTACGCAAATGTCTTTGCTGTCAGTACCAGAAGAAACTTCGCCTGCGGTCGAAGCTCTGGAAAATCTTGATCCGGATTCACTCACCCCGCGTCAGGCGCTGGAGTGGATTTATCGCTTGAAGAGCCTGGTGTAATAACAATTCCCGATAGTCTTTTGCTATCGGGAATATTAACGACAACTGACGAATAAAATAAAAACACCCTGTATAATAGGAAAGCTTATTTTACAGGGTAAAACCATGCCATCTACACGCTATCAAAAAATCAATGCCCATCACTATCGCCATATATGGGTCGTTGGTGATATTCATGGTGAATATCAGTTATTACAATCCCGCTTACATCAACTCTCTTTTTTCCCCAAAATCGACTTACTTATTTCTGTCGGCGATAATATTGATCGTGGACCGGAGAGTCTTGACGTCCTGCGCCTGCTAAACCAACCCTGGTTTACGTCGGTTAAAGGCAACCACGAAGCGATGGCGCTTGAGGCATTCGAAACTGGCGATGGCAATATGTGGCTTGCCAGCGGTGGTGACTGGTTTTTCGATTTAAATGATTCAGAGCAACAAGAGGCAATAGATCTGTTGCTGAAATTCCATCACCTTCCACATATTATTGAAATCACTAACGACAACATAAAATATGCCATCGCACATGCAGATTATCCGGGGAGTGAATATCTCTTTGGTAAAGAAATAGCGGAGAGCGAATTACTCTGGCCTGTTGATCGTGTGCAGAAATCGCTTAATGGCGAGTTACAACAAATAAACGGCGCTGATTATTTTATATTTGGACATATGATGTTTGATAACATTCAGACGTTCGCTAACCAGATTTATATTGATACCGGATCGCCGAACAGCGGGCGGCTGTCATTTTATAAAATAAAGTAGTCTCATGCTTCTTCTGTGAAGCATGAGTAACCCGGTGTTATTGCAGGCCATTATTCATTTTTCGCTACCAGCAAAGAGAGATCCTGCTTCACCAGCGCGCGACTGGCACTCTCCGGCAAACCGTCGTCTGTAATAATCTGATCAAACTCGCTTAATGGTAACGCCAGCCATGTCGCCACCTGACCATATTTCGTCGCATCACAGACCAAAACTCGCTGGCGGCTGGCACTGGCAATCGCCCGTTTCACCGTGACTTTATCTTCCGCTGGCGTAGAAATCCCCCGCACACTCCATGACGATGCAGAAATAAAAGCCTGATCAATCATCAGGCTGCGCAGCATGGTCGCAGCGGCTTCCCCGACACAGGAACGGTTTTCCCGACACACTGCACCGCCAGTGTGAATAATTGTGCAATTACTGTTGTCGAGCAAGTAGTCCGCAATAACGAAATCGTTTGTGACCACAGTCAGTGACTCCATGTGAATCAGATGCTGTGCTATCGCTAACGTGGTCGTTCCCGCATCCAGATAGATACAACTTCCCGGCTGAACAAGACTTGCCGCCAGCTTGCCAATAGCCGCTTTTTGCGTCATTGCCAGCGCAGTTTTTACCTGATGAGAAGGTTCATGCGCCACGCGTCCCGGAGACTGGACGCCTCCGGACACCAGCACAACGGCTCCCTGCTGCTCCAGTTTTTGTAAATCCCGACGAATGGTCATATGTGACACATTCATTCTGTCCGTTAGTTCAGCAATACTGACAATGCCTTTTTCAGCTACCATCTCAAGGATGATTTGGCGACGCTCTACGGGTATCAACTTTTGCTCCTTCCTTTGTCCTGCTGACATTCTACGCTATTTGCCTGCGAAACGTGCGCGGCGCAACTAACGCTTAGTTCACATAAAATAACACACAATGTTAATTTATGTGAATCAGATCACCATACCGTTATCTTCCAGCGCTTATATTCACAATATCAAACAAAATATCACTTAAATTAACAAGGAGAGCAGATGAAAACGGGATCTGAGTTTCATGTCGGTATCGTTGGCTTAGGGTCAATGGGAATGGGAGCAGCACTGTCATATGTCCGCGCAGGTCTTTCTACCTGGGGCGCAGACCTGAACAGCAATGCCTGCGCTACGTTGAAAGAGGCAGGTGCTTGCGGGGTTTCTGATAACGCCGCGACGTTTGCCGAAAAACTGGACGCACTGCTGGTGCTGGTGGTCAATGCGGCCCAGGTTAAACAGGTGCTGTTTGGTGAAACAGGCGTTGCACAACATCTGAAACCCGGTACGGCAGTAATGGTTTCTTCCACTATCGCTAGTGCTGATGCGCAAGAAATTGCTACCGCTCTGGCTGGATTCGATCTGGAAATGCTGGATGCGCCAGTTTCTGGTGGTGCAGTAAAAGCCGCTAACGGTGAAATGACTGTCATGGCCTCCGGTAGCGATATTGCCTTTGAACGACTGGCACCCGTGCTGGAAGCCGTTGCCGGAAAAGTTTATCGCATAGGTGCAGAACCGGGACTAGGTTCGACCGTAAAAATTATTCACCAGTTGTTAGCGGGCGTACATATTGCTGCCGGAGCCGAAGCGATGGCACTTGCAGCCCGTGCGGGGATCCCGCTGGATGTGATGTATGACGTCGTGACCAATGCCGCCGGAAATTCCTGGATGTTCGAAAACCGGATGCGTCATGTGGTGGATGGCGATTACACCCCGCATTCAGCCGTCGATATTTTTGTTAAGGATCTTGGTCTGGTTGCCGATACAGCCAAAGCCCTGCACTTCCCGCTGCCATTGGCCTCAACAGCATTGAATATGTTCACCAGCGCCAGTAACGCGGGTTACGGGAAAGAAGACGATAGCGCAGTTATCAAGATTTTCTCTGGCATCACTCTACCGGGAGCGAAATCATGATCAAGATTGGCGTTATCGCCGATGATTTTACCGGCGCGACGGATATCGCCAGTTTTCTGGTGGAAAACGGTCTACCAACGGTACAAATTAACGGTGTTCCAACAGGTAAAATGCCGGAAGCAATCGACGCACTGGTGATCAGCCTGAAAACGCGCTCCTGTCCAGTGGTTGAAGCCACACAGCAATCGCTGGCGGCTCTGAGCTGGTTGCAACAGCAAGGTTGCAAACAGATCTATTTCAAATACTGCTCTACTTTCGACAGTACGGCGAAAGGTAATATTGGCCCGGTTACCGATGCCTTAATGGATGCTCTCGACACGCCGTTTACGGTCTTCTCTCCGGCCCTGCCGGTCAACGGACGTACGGTTTATCAGGGGTATTTGTTCGTAATGAATCAACTGCTGGCCGAATCCGGGATGCGCCATCACCCGGTAAATCCCATGACCGACAGCTATCTTCCCCGTCTGGTTGAAGCGCAATCCACAGGGCGCTGCGGCGTCGTTTCGGCACATGTTTTCGAACAAGGTGTGGATGCCGTTCGTCAAGAGCTGGCTCGCTTACAGCAAGAGGGCTACCGCTACGCGGTGCTTGATGCGCTGACCGAACACCATCTGGAAATTCAGGGAGAAGCCTTGCGCGATGCCCCACTGGTAACGGGCGGTTCTGGTCTGGCGATTGGCCTGGCCCGGCAGTGGGCGCAAGAAAACGGTAACCAGGCTCGCAAAGCAGGGCGTCCGCTCGCTGGGCGCGGCGTAGTGCTCTCCGGTTCATGCTCTCAAATGACCAACCGCCAGGTAGCACATTACCGTCAAATTGCACCAGCCCGTGAAGTTGATGTGGCACGCTGCCTCTCAATTGAAACTCTGGCCGCTTATGCACACGAACTGGCAGAGTGGGTTCTGGGCCAGGAAAGTGTACTTGCTCCACTGGTTTTTGCCACCGCCAGCACTGACGCATTGGCAGCAATTCAACAGCAATACGGTGCACAAAAAGCCAGTCAGGCAGTAGAAACACTGTTTTCTCAACTAGCGGCGCGGTTAGCAGCGGAAGGCGTGACACGCTTTATTGTCGCAGGCGGTGAGACCTCCGGCGTAGTCACACAGAGCCTGGGAATAAAAGGGTTTCATATTGGCCCAACCATTTCCCCGGCGTGCCGTGGGTAAACGCACTGGATAAGCCTGTCTCACTCGCCCTTAAATCTGGCAACTTCGGTGATGACGCCTTTTTTTCACGAGCCCAAAGAGAGTTTTTATCATGAGCGATTTCGCAAAAGTAGAGCAGTCTTTGCGAGAGGAGATGACGCGGATTGCCAGTTCATTCTTTCAGCGCGGCTATGCAACCGGTTCGGCTGGCAATCTGTCGCTGCTTTTACCTGACGGGAATTTACTGGCGACACCGACAGGTTCATGCCTGGGCAATCTCGATCCGCAGCGGCTTTCCAAAGTCGCCGCGGATGGCGAATGGTTAAGTGGTGACAAACCCTCGAAAGAGGTGCTCTTTCATCTGGCGCTGTATCGCAACAATCCGCGCTGTAAAGCGGTGGTGCATTTGCACAGCACATGGTCGACGGCGCTTTCCTGCCTGCAAGGGCTGGACAGCAGCAACGTTATTCGTCCGTTCACACCATACGTGGTGATGCGGATGGGAAATGTCCCGCTGGTGCCTTATTACCGACCGGGCGATAAACGCATCGCACAGGATCTGGCGGAACTGGCAGCAGACAATCAGGCTTTTTTACTGGCAAATCATGGCCCAGTGGTTTGCGGTGAAAGCCTGCAAGAAGCCGCCAACAATATGGAAGAGCTGGAGGAAACGGCAAAGCTGATTTTTATTCTCGGTGACCGCCCGATCCGTTATCTGACCGCAGGTGAAATTGCGGAATTAAGGAGTTAA
Protein sequences of DBSCAN-SWA_9 >CP028702|3292198:3299337|3296535_3297444_+|AVZ50299.1|DBSCAN-SWA MKTGSEFHVGIVGLGSMGMGAALSYVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNAAQVKQVLFGETGVAQHLKPGTAVMVSSTIASADAQEIATALAGFDLEMLDAPVSGGAVKAANGEMTVMASGSDIAFERLAPVLEAVAGKVYRIGAEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSAVDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS >CP028702|3292198:3299337|3292198_3294760_+|AVZ50296.1|DBSCAN-SWA MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAGLQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV >CP028702|3292198:3299337|3295572_3296340_-|AVZ50298.1|DBSCAN-SWA MIPVERRQIILEMVAEKGIVSIAELTDRMNVSHMTIRRDLQKLEQQGAVVLVSGGVQSPGRVAHEPSHQVKTALAMTQKAAIGKLAASLVQPGSCIYLDAGTTTLAIAQHLIHMESLTVVTNDFVIADYLLDNSNCTIIHTGGAVCRENRSCVGEAAATMLRSLMIDQAFISASSWSVRGISTPAEDKVTVKRAIASASRQRVLVCDATKYGQVATWLALPLSEFDQIITDDGLPESASRALVKQDLSLLVAKNE >CP028702|3292198:3299337|3297440_3298607_+|AVZ50300.1|DBSCAN-SWA MIKIGVIADDFTGATDIASFLVENGLPTVQINGVPTGKMPEAIDALVISLKTRSCPVVEATQQSLAALSWLQQQGCKQIYFKYCSTFDSTAKGNIGPVTDALMDALDTPFTVFSPALPVNGRTVYQGYLFVMNQLLAESGMRHHPVNPMTDSYLPRLVEAQSTGRCGVVSAHVFEQGVDAVRQELARLQQEGYRYAVLDALTEHHLEIQGEALRDAPLVTGGSGLAIGLARQWAQENGNQARKAGRPLAGRGVVLSGSCSQMTNRQVAHYRQIAPAREVDVARCLSIETLAAYAHELAEWVLGQESVLAPLVFATASTDALAAIQQQYGAQKASQAVETLFSQLAARLAAEGVTRFIVAGGETSGVVTQSLGIKGFHIGPTISPACRG >CP028702|3292198:3299337|3294865_3295522_+|AVZ50297.1|DBSCAN-SWA MPSTRYQKINAHHYRHIWVVGDIHGEYQLLQSRLHQLSFFPKIDLLISVGDNIDRGPESLDVLRLLNQPWFTSVKGNHEAMALEAFETGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEITNDNIKYAIAHADYPGSEYLFGKEIAESELLWPVDRVQKSLNGELQQINGADYFIFGHMMFDNIQTFANQIYIDTGSPNSGRLSFYKIK >CP028702|3292198:3299337|3298698_3299337_+|AVZ50301.1|DBSCAN-SWA MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVAADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNVPLVPYYRPGDKRIAQDLAELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS |
6 | Escherichia_phage(83.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|