Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP029122 | Escherichia coli strain AR434 chromosome, complete genome | 10 crisprs | RT,csa3,PD-DExK,cas3,cas7,cas5,cas6e,cas1,cas2,DEDDh,c2c9_V-U4,DinG | 0 | 24 | 12 | 0 |
CP029123 | Escherichia coli strain AR434 plasmid unnamed1, complete sequence | 0 crisprs | RT,DEDDh | 0 | 0 | 1 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029122_1 | 892263-892402 | Orphan |
NA
Consensus repeat of CP029122_1
|
1 spacers
spacers of CP029122_1
>1.1|892312|42|CP029122|CRISPRCasFinder ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT |
CRISPR arrays and Neighbor proteins around CP029122_1
The CRISPR arrays of CP029122_1 >merge|CP029122|1|892263-892402|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCAACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGTTTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA >CP029122|1|1|892263-892402|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT TTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA
>CP029122.1|AWF25016.1|891170_892211_+|putative-permease-family-protein MTGQSSSQAATPIQWWKPALFFLVVIAGLWYVKWEPYYGKAFTAAETHSIGKSILAQADANPWQAALDYAMIYFLAVWKAAVLGVILGSLIQVLIPRDWLLRTLGQSRFRGTLLGTLFSLPGMMCTCCAAPVAAGMRRQQVSMGGALAFWMGNPVLNPATLVFMGFVLSWGFAAIRLVAGLVMVLLIATLVQKWVRETPQTQAPVEIDIPEAQGGFFSRWGRALWTLFWSTIPVYILAVLVLGAARVWLFPHADGTVDNSLMWVVAMAVAGCLFVIPTAAEIPIVQTMMLAGMGTAPALALLMTLPAVSLPSLIMLRKAFPAKALWLTGAMVAVSGVIVGGLALLF >CP029122.1|AWF25217.1|890462_891098_+|NADH(P)-binding-family-protein MSQVLITGATGLVGGHLLRMLINEPKVNAIAAPTRRPLGDMPGVFNPHDPQLTDALAQVTDPIDIVFCCLGTTRREAGSKEAFIHADYTLVVDTALTGRRLGAQHMLVVSAMGANAHSPFFYNRVKGEMEEALIAQNWPKLTIARPSMLLGDRSKQRMNETLFAPLFRLLPGNWKSIDARDVARVMLAESMRPEHEGVTILSSSELRKRAE >CP029122.1|AWF24733.1|889816_890335_-|intracellular-protease,-PfpI-family-protein MSKKIAVLITDEFEDSEFTSPADEFRKAGHEVITIEKQAGKTVKGKKGEASVTIDKSIDEVTPAEFDALLLPGGHSPDYLRGDNRFVTFTRDFVNSGKPVFAICHGPQLLISADVIRGRKLTAVKPIIIDVKNAGAEFYDQEVVVDKDQLVTSRTPDDLPAFNREALRLLGA >CP029122.1|AWF28008.1|889393_889837_+|hypothetical-protein METLIAISRWLAKQHVVTWCVQQEGELWCANAFYLFDAQKVAFYILTEEKTRHAQMSGPQAAVAGTVNGQPKTVALIRGVQFKGEIRRLEGEESDLARKAYNRRFPVARMLSAPVWEIRLDEIKFTDNTLGFGKKMIWLRDSGTEQA >CP029122.1|AWF24205.1|889040_889343_-|GIY-YIG-catalytic-domain-protein MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAFSAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAELLSSLQTPEIKSD >CP029122.1|AWF24601.1|888550_889054_+|acetyltransferase-domain-protein MLIRVEIPIDAPGIDALLRRSFESDAEAKLVHDLREDGFLTLGLVATDDEGQVIGYVAFSPVDVQGEDLQWVGMAPLAVDEKYRGQGLARQLVYEGLDSLNEFGYAAVVTLGDPALYSRFGFELAAHHDLRCRWPGTESAFQVHRLADDALNGVTGLVEYHEHFNRF >CP029122.1|AWF25863.1|888032_888557_+|SCP-2-sterol-transfer-family-protein MLDKLRSRIVHLGPSLLSVPVKLTPFALKRQVLEQVLSWQFRQALDDGELEFLEGRWLSIHVRDIDLQWFTSVVNGKLVVSQNAQADVSFSADASDLLMIAARKQDPDTLFFQRRLVIEGDTELGLYVKNLMDAIELEQMPKALRMMLLQLADFVEAGMKNAPETKQTSVGEPC >CP029122.1|AWF25547.1|886828_887824_-|hypothetical-protein MELLCPAGNLPALKAAIENGADAVYIGLKDDTNARHFAGLNFTEKKLQEAVSFVHQHRRKLHIAINTFAHPDGYARWQRAVDMAAQLGADALILADLAMLEYAAERYPHIERHVSVQASATNEEAINFYHRHFDVARVVLPRVLSIHQVKQLARVTPVPLEVFAFGSLCIMSEGRCYLSSYLTGESPNTVGACSPARFVRWQQTPQGLESRLNEVLIDRYQDGENAGYPTLCKGRYLVDGERYHALEEPTSLNTLELLPELMAANIASVKIEGRQRSPAYVSQVAKVWRQAIDRCKADPQNFVPQSAWMETLGSMSEGTQTTLGAYHRKWQ >CP029122.1|AWF25010.1|885941_886820_-|peptidase-U32-family-protein MKYSLGPVLWYWPKETLEEFYQQAATSSADVIYLGEAVCSKRRATKVGDWLEMAKSLAGSGKQIVLSTLALVQASSELGELKRYVENGEFLIEASDLGVVNMCAERKLPFVAGHALNCYNAVTLKILLKQGMMRWCMPVELSRDWLVNLLNQCDELGIRNQFEVEVLSYGHLPLAYSARCFTARSEDRPKDECETCCIKYPNGRNVLSQENQQVFVLNGIQTMSGYVYNLGNELASMQGLVDVVRLSPQGTDTFAMLDAFRANENGAAPLPLTANSDCNGYWRRLAGLELQA >CP029122.1|AWF28294.1|884728_885736_-|luciferase-oxidoreductase,-group-1-family-protein MTDKTIAFSLLDLAPIPEGSSAREAFSHSLDLARLAEKRGYHRYWLAEHHNMTGIASAATSVLIGYLAANTTTLHLGSGGVMLPNHSPLVIAEQFGTLNTLYPGRIDLGLGRAPGSDQRTMMALRRHMSGDIDNFPRDVAELVDWFDARDPNPNVRPVPGYGEKIPVWLLGSSLYSAQLAAQLGLPFAFASHFAPDMLFQALHLYRSNFKPSARLEKPYAMVCINIIAADSNRDAEFLFTSMQQAFVKLRRGETGQLPPPIQNMDQFWSPSEQYGVQQALSMSLVGDKAKVRHGLQSILRETDADEIMVNGQIFDHQARLHSFELAMDVKEELLG >CP029122.1|AWF24274.1|892415_892991_-|BON-domain-protein MKALSPIAVLISALLLQGCVAAAVVGTAAVGTKAATDPRSVGTQVDDGTLEVRVNSALSKDEQIKKEARINVTAYQGKVLLVGQSPNAELSARAKQIAMGVDGANEVYNEIRQGQPIGLGEASNDTWITTKVRSQLLTSDLVKSSNVKVTTENGEVFLMGLVTEREAKAAADIASRVSGVKRVTTAFTFIK >CP029122.1|AWF26441.1|893000_893591_-|SIS-domain-protein MQERIKACFTESIQTQIAAAEALPDAISRAAMTLVQSLLNGNKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIANDRLHDEVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVALTGYDGGELAGLLGPQDVEIRIPSHRSARIQEMHMLTVNCLCDLIDNTLFPHQDD >CP029122.1|AWF25011.1|893610_894006_-|hypothetical-protein MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTVFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS >CP029122.1|AWF28395.1|893963_896000_-|lppC-lipofamily-protein MVPSTFSRLKAARCLPVVLAALIFAGCGTHTPDQSTAYMQGTAQADSAFYLQQMQQSSDDTRINWQLLAIRALVKEGKTGQAVELFNQLPQELNDSQRREKTLLAVEIKLAQKDFAGAQNLLAKITPADLEQNQQARYWQAKIDASQGRPSIDLLRALIAQEPLLGAKEKQQNIDATWQALSSMTQEQANTLVINADENILQGWLDLQRVWFDNRNDPDMMKAGIADWQKRYPNNPGAKMLPTQLVNVKAFKPASTNKIALLLPLNGQAAVFGRTIQQGFEAAKNIGTQPVAAQVAAAPAADVAEQPQPQTVDGVASPAQASVSDLTGEQPAAQPVPVSAPATSTAAVSAPANPSAELKIYDTSSQPLSQILSQVQQDGASIVVGPLLKNNVEELLKSNTPLNVLALNQPENIENRVNICYFALSPEDEARDAARHIRDQGKQAPLVLIPRSSLGDRVANAFAQEWQKLGGGTVLQQKFGSTSELRAGVNGGSGIALTGSPITPRATTDSGMTTNNPTLQTTPTDDQFTNNGGRVDAVYIVATPGEIAFIKPMIAMRNGSQSGATLYASSRSAQGTAGPDFRLEMEGLQYSEIPMLAGGNLPLMQQALSAVNNDYSLARMYAMGVDAWSLANHFSQMRQVQGFEINGNTGSLTANPDCVINRKLSWLQYQQGQVVPAS >CP029122.1|AWF26485.1|896064_896925_+|ribosomal-RNA-small-subunit-methyltransferase-I MKQHQSADNSQGQLYIVPTPIGNLADITQRALEVLQAVDLIAAEDTRHTGLLLQHFGINARLFALHDHNEQQKAETLLAKLQEGQNIALVSDAGTPLINDPGYHLVRTCREAGIRVVPLPGPCAAITALSAAGLPSDRFCYEGFLPAKSKGRRDALKAIEAEPRTLIFYESTHRLLDSLEDIVAVLGESRYVVLARELTKTWETIHGAPVGELLAWVKEDENRRKGEMVLIVEGHKAQEEDLPADALRTLALLQAELPLKKAAALAAEIHGVKKNALYKYALEQQG >CP029122.1|AWF25757.1|896967_898059_-|fimbrial-family-protein MKRAPLITGLLLISTSCAYASSGGCGADSTSGATNYSSVVDDVTVNQTDNVTGREFTSATLSSTNWQYACSCSAGKAVKLVYMVSPVLTTTGHQAGYYKLNDSLDIKTTLKANDIPGLVTDQTVSVNTRFTQIKSNTVYSAATQTGVCQGDTSRYGPVNIGANTTFTLYVTKPFLGSMTIPKTDIAVIKGAWVDGMGSPSTGDFHDLVKLSIQGNLTAPQSCKINQGDVIKVNFGFINGQKFTTRNAMPDGFTPVDFDITYDCGDTSKIKNSLQMRIDGTTGVVDQYNLVARRRSSDNAPDVGIRIENLGGGVANIPFQNGILPVDPSGHGTVNMRAWPVNLVGGELETGKFQGTATITVIVR >CP029122.1|AWF26748.1|898069_900235_-|papC-N-terminal-domain-protein MDEIPALAEKDDDSVINSLEQIIPGTAAEFDFNHQRLNLSIPQIALYRDARGYVSPSRWDDGIPTLFTNYSFTGSDNRYRQGNRSQRQYLNMQNGANFGPWRLRNYSTWTRNDQTSSWNTISSYLQRDIKALKSQLLLGESATSGSIFSSYTFTGVQLASDDNMLPNSQRGFAPTVRGIANSSAIVTIRQNGYVIYQSNVPAGAFEINDLYPSSNSGDLEVTIEESDGTQRRFIQPYSSLPMMQRPGHLKYSATAGRYRADANSDSKEPEFAEATAIYGLNNTFTLYSGLLGSEDYYALGIGIGGTLGALGALSMDINRADTQFDNQHSFHGYQWRTQYIKDIPETNTNIAVSYYRYTNDGYFSFDEANTRNWDYNSRQKSEIQFNISQTIFDGVSLYASGSQQDYWGNNEKNRNISVGVSGQQWGIGYSLNYQYSRYTDQNNDRALSLNLSIPLERWLPRSRVSYQMTSQKDRPTQHEMRLDGSLLDDGRLSYSLEQSLDDDNNHNSSVNASYRSPYGTFSAGYSYGNDSSQYNYGVTGGVVIHPHGVTLSQYLGNAFALIDANGASGVRIQNYPGIATDPFGYAVVPYLTTYQENRLSVDTTQLPDNVDLEQTTQFVVPNRGAMVAARFNANIGYRVLVTVSDRNGKPLPFGALASNDETGQQSIVDEGGILYLSGISSKSQSWTVRWGNQADQQCQFAFSTPDSEPTTSVLQGTAQCH >CP029122.1|AWF26428.1|900732_901236_+|putative-transposase MPGNRPHYGRWPQHDFPPFKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRLRKTVVAHVFGERTLATLERLLSLLSAFEVVVWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ >CP029122.1|AWF25099.1|901256_901370_+|hypothetical-protein MQSMPLPVSTIGKQRLIMSANCFGVRLCPWWCRCGIP >CP029122.1|AWF27466.1|901392_901992_-|gram-negative-pili-assembly-chaperone,-C-terminal-domain-protein MIYDASRKEAALPVANKGAETPYLLQSWVDNIDGTSRAPFIITPPLFRLEAGDDSSLRIIKTADNLPENKESLFYINVRAIPAKKKSDNVNANELTLVFKTRIKMFYRPAHLKGRVNDAWKSLEFKRSDHSLNIYNPTEYYVVFAGLAVDKTDLTSKIEYIAPGEHKQLPLPASGGKNVKWAAINDYGGSSGTETRPLQ |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029122_2 | 937163-937280 | Orphan |
NA
Consensus repeat of CP029122_2
|
1 spacers
spacers of CP029122_2
>2.1|937203|38|CP029122|CRISPRCasFinder GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA |
CRISPR arrays and Neighbor proteins around CP029122_2
The CRISPR arrays of CP029122_2 >merge|CP029122|2|937163-937280|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGGGTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGATGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG >CP029122|2|2|937163-937280|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGG GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA TGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG
>CP029122.1|AWF27156.1|935831_937142_+|serine-dehydratase-alpha-chain-family-protein MFDSTLNPLWQRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPGTGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFSRAKVWNGEKWACVTIVGGHTNIVHIETHNGVVFTQQACVAEGEQESPLTVLSRTTLAEILKFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVIRTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAIYIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSCAMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEIMASKAR >CP029122.1|AWF25770.1|934472_935804_+|inner-membrane-transport-protein-YhaO MEIASNKGVIADASTPAGRAGMSESEWREAIKFDSTDTGWVIMSIGMAIGAGIVFLPVQVGLMGLWVFLLSSVIGYPAMYLFQRLFINTLAESPECKDYPSVISGYLGKNWGILLGALYFVMLVIWMFVYSTAITNDSASYLHTFGVTEGLLSDSPFYGLVLICILVAISSRGEKLLFKISTGMVLTKLLVVAALGVSMVGMWHLYNVGSLPPLGLLVKNAIITLPFTLTSILFIQTLSPMVISYRSREKSIEVARHKALRAMNIAFGILFVTVFFYAVSFTLAMGHDEAVKAYEQNISALAIAAQFISGDGAAWVKVVSVILNIFAVMTAFFGVYLGFREATQGIVMNILRRKMPAEKINENLVQRGIMIFAILLAWSAIVLNAPVLSFTSICSPIFGMVGCLIPAWLVYKVPALHKYKGMSLYLIIVTGLLLCVSPFLAFS >CP029122.1|AWF26063.1|932899_934198_+|L-serine-ammonia-lyase MNAGKSFIDRLESSGLLTATSHIVVDLYGSLSLTGKGHATDVAIIMGLAGNSPQDVVIDEIPAFIELVTRSGRLPVASGAHIVDFPVAKNIIFHPEMLPRHENGMRITAWKGQEALLSKTYYSVGGGFIVEEEHFGLSHDVETSVPYDFHSAGELLKMCDYNGLSISGLMMHNELALRSKAEIDAGFARIWQVMHDGIERGMNTEGVLPGPLNVPRRAVALRRQLVSSDNISNDPMNVIDWINMYALAVSEENAAGGRVVTAPTNGACGIIPAVLAYYDKFRRPVNERSIARYFLAAGAIGALYKMNASISGAEVGCQGEIGVACSMAAAGLTELLGGSPAQVCNAAEIAMEHNLGLTCDPVAGQVQIPCIERNAINAVKAVNAARMAMRRTSAPRVSLDKVIETMYETGKDMNDKYRETSRGGLAIKVVCG >CP029122.1|AWF25115.1|932372_932762_+|putative-reactive-intermediate-deaminase-TdcF MKKIIETQRAPGAIGPYVQGVDLGSMVFTSGQIPVCPQTGEIPADVQDQARLSLENVKAIVVAAGLSVGDIIKMTVFITDLNDFATINEVYKQFFDEHQATYPTRSYVQVARLPKDVKLEIEAIAVRSA >CP029122.1|AWF28077.1|930064_932359_+|formate-acetyltransferase MKVDIDTSDKLYADAWLGFKGTDWKNEINVRDFIQHNYTPYEGDESFLAEATPATTELWEKVMEGIRIENATHAPVDFDTNIATTITAHDAGYINQPLEKIVGLQTDAPLKRALHPFGGINMIKSSFHAYGREMDSEFEYLFTDLRKTHNQGVFDVYSPDMLRCRKSGVLTGLPDGYGRGRIIGDYRRVALYGISYLVRERELQFADLQSRLEKGEDLEATIRLREELAEHRHALLQIQEMAAKYGFDISRPAQNAQEAVQWLYFAYLAAVKSQNGGAMSLGRTASFLDIYIERDFKAGVLNEQQAQELIDHFIMKIRMVRFLRTPEFDSLFSGDPIWATEVIGGMGLDGRTLVTKNSFRYLHTLHTMGPAPEPNLTILWSEELPIAFKKYAAQVSIVTSSLQYENDDLMRTDFNSDDYAIACCVSPMVIGKQMQFFGARANLAKTLLYAINGGVDEKLKIQVGPKTAPLMDDVLDYDKVMDSLDHFMDWLAVQYISALNIIHYMHDKYSYEASLMALHDRDVYRTMACGIAGLSVATDSLSAIKYARVKPIRDENGLAVDFEIDGEYPQYGNNDERVDSIACDLVERFMKKIKALPTYRNAVPTQSILTITSNVVYGQKTGNTPDGRRAGTPFAPGANPMHGRDRKGAVASLTSVAKLPFTYAKDGISYTFSIVPAALGKEDPVRKTNLVGLLDGYFHHEADVEGGQHLNVNVMNREMLLDAIEHPEKYPNLTIRVSGYAVRFNALTREQQQDVISRTFTQAL >CP029122.1|AWF24125.1|928822_930031_+|acetate-kinase MNEFPVVLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVNGGEPAPLAHHSYEGALKAIAFELEKRNLNDSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLHNYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTSHRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSGDVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLRVLEKAWHEGHERAQLAIKTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVMEHLAVLGVEIDTEMNNRSNSCGERIVSSENARVICAVIPTNEEKMIALDAIHLGKVNAPAEFA >CP029122.1|AWF25528.1|927465_928797_+|serine-transporter-family-protein MSTSDSIVSSQTKQSSWRKSDTTWTLGLFGTAIGAGVLFFPIRAGFGGLIPILLMLVLAYPIAFYCHRALARLCLSGSNPSGNITETVEEHFGKTGGVVITFLYFFAICPLLWIYGVTITNTFMTFWENQLGFAPLNRGFVALFLLLLMAFVIWFGKDLMVKVMSYLVWPFIASLVLISLSLIPYWNSAVIDQVDLGSLSLTGHDGILITVWLGISIMVFSFNFSPIVSSFVVSKREEYEKDFGRDFTERKCSQIISRASMLMVAVVMFFAFSCLFTLSPANMAEAKAQNIPVLSYLANHFASMTGTKTTFAITLEYAASIIALVAIFKSFFGHYLGTLEGLNGLILKFGYKGDKTKVSLGKLNTISMIFIMGSTWVVAYANPNILDLIEAMGAPIIASLLCLLPMYAIRKAPSLAKYRGRLDNVFVTVIGLLTILNIVYKLF >CP029122.1|AWF27371.1|926454_927444_+|threonine-ammonia-lyase MHITYDLPVAIDDIIEAKQRLAGRIYKTGMPRSNYFSERCKGEIFLKFENMQRTGSFKIRGAFNKLSSLTDAEKRKGVVACSAGNHAQGVSLSCAMLGIDGKVVMPKGAPKSKVAATCDYSAEVVLHGDNFNDTIAKVSEIVEMEGRIFIPPYDDPKVIAGQGTIGLEIMEDLYDVDNVIVPIGGGGLIAGIAVAIKSINPTIRVIGVQSENVHGMAASFHSGEITTHRTTGTLADGCDVSRPGNLTYEIVRELVDDIVLVSEDEIRNSMIALIQRNKVVTEGAGALACAALLSGKLDQYIQNRKTVSIISGGNIDLSRVSQITGFVDA >CP029122.1|AWF24233.1|925417_926356_+|lysR-substrate-binding-domain-protein MSTILLPKTQHLVVFQEVIRSGSIGSAAKELGLTQPAVSKIINDIEDYFGVELVVRKNTGVTLTPAGQLLLSRSESITREMKNMVNEISGMSSEAVVEVSFGFPSLIGFTFMSGMINKFKEVFPKAQVSMYEAQLSSFLPAIRDGRLDFAIGTLSAEMKLQDLHVEPLFESEFVLVASKSRTCTGTTTLESLKNEQWVLPQTNMGYYSELLTTLQRNGISIENIVKTDSVVTIYNLVLNADFLTVIPCDMTSPFGSNQFITIPVEETLPVAQYAAVWSKNYRIKKAASVLVELAKEYSSYNGCRRRQLIEVG >CP029122.1|AWF25751.1|924884_925103_-|putative-dNA-binding-transcriptional-activator-TdcR MSKFSNFIINKPFSAINTAARHIFSRYLLENKHLFYQYFKISNTGIDHLEQLINVNFFSSDRTSFCECNRFP >CP029122.1|AWF24852.1|937353_937521_-|hypothetical-protein MMSKKSAKKRQPVKPVVAKEPARTAKNFGYEEMLSELEAIVADAETRLAEDEATA >CP029122.1|AWF27569.1|937540_938143_-|pirin-like-protein-YhaK MLGYASLRVLNQEVLAPGAAFQPRTYPKVDILNVILDGEAEYRDSEGNHVQASAGEALLLSTQPGVSYSEHNLSKDKPLTRMQLWLDACPQRENPLIQKLALNMGKQQLIASPEGTMGSLQLRQQVWLHHIVLDKGESANFQLHGPRAYLQSIHGKFHALTHHEEKAALTCGDGAFIRDEANITLVADSPLRALLIDLPV >CP029122.1|AWF26491.1|938346_939243_+|lysR-substrate-binding-domain-protein MAKERALTLEALRVMDAIDRRGSFAAAADELGRVPSALSYTMQKLEEELDVVLFDRSGHRTKFTNVGRMLLERGRVLLEAADKLTTDAEALARGWETHLTIVTEALVPTPAFFPLIDKLAAKANTQLAIITEVLAGAWERLEQGRADIVIAPDMHFRSSSEINSRKLYTLMNVYVAAPDHPIHQEPEPLSEVTRVKYRGIAVADTARERPVLTVQLLDKQPRLTVSTIEDKRQALLAGLGVATMPYPMVEKDIAEGRLRVVSPESTSEIDIIMAWRRDSMGEAKSWCLREIPKLFSGK >CP029122.1|AWF24986.1|939293_939650_-|inner-membrane-protein-YhaI MQWYLAVLKNYVGFSGRARRKEYWMFTLINAIVGAIINVIQLILGLEFPFLSLIYLAATIIPVIALCVRRLHDTDRSGAWALLYLVPIIGWLVLFVFACLEGNSGSNRYGNDPKFGSN >CP029122.1|AWF24586.1|939891_940257_-|inner-membrane-protein-YhaH MDWYLKVLKNYVGFRGRARRKEYWMFILVNIIFTFVLGLLDKMLGWQRAGGEGILTTIYGILVFLPWWAVQFRRLHDTDRSAWWALLFLIPFIGWLIIIVFNCQAGTPGENRFGPDPKLEP >CP029122.1|AWF26813.1|940549_941536_-|glutathionyl-hydroquinone-reductase-YqjG MGQLIDGVWHDTWYDTKSTGGKFQRSASAFRNWLTADGAPGPTGTGGFIAEKDRYHLYVSLACPWAHRTLIMRKLKGLEPFISVSVVNPLMLENGWTFDDSFPGATGDTLYQHEFLYQLYLHADPHYSGRVTVPVLWDKKNHTIVSNESAEIIRMFNTAFDALGAKAGDYYPPALQTKIDELNGWIYDTVNNGVYKAGFATSQQAYDEAVAKVFESLARLEQILGQHRYLTGNQLTEADIRLWTTLVRFDPVYVTHFKCDKHRISNYLNLYGFLRDIYQMPGIAETVNFDHIRNHYFRSHKTINPTGIISIGPWQDLDEPHGRDVRFG >CP029122.1|AWF27378.1|941605_941998_-|inner-membrane-protein-YqjF MKKLEDVGVLVARILMPILFITAGWGKITGYAGTQQYMEAMGVPGFMLPLVILLEFGGGLAILFGFLTRTTALFTAGFTLLTAFLFHSNFAEGVNSLMFMKNLTISGGFLLLAITGPGAYSIDRLLNKKW >CP029122.1|AWF26445.1|942183_942483_-|yqjK-like-family-protein MSSKVERERRKAQLLSQIQQQRLDLSASRREWLEATGAYDRRWNMLLSLRSWALVGSSVMAIWTIRHPNMLVRWARRGFGVWSAWRLVKTTLKQQQLRG >CP029122.1|AWF26296.1|942472_942877_-|inner-membrane-protein-YqjE MADTHHAQGPGKSVLGIGQRIVSIMVEMVETRLRLAVVELEEEKANLFQLLLMLGLTMLFAAFGLMSLMVLIIWAVDPQYRLNAMIATTVVLLLLALIGGIWTLRKSRKSTLLRHTRHELANDRQLLEEESREQ >CP029122.1|AWF24518.1|942879_943185_-|hypothetical-protein MSKEHTTEHLRAELKSLSDTLEEVLSSSGEKSKEELSKIRSKAEQALKQSRYRLGETGDAIAKQTRVAAARADEYVRENPWTGVGIGAAIGVVLGVLLSRR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029122_3 | 1310603-1311119 | Unclear |
I-E
Consensus repeat of CP029122_3
|
8 spacers
spacers of CP029122_3
>3.1|1310632|32|CP029122|PILER-CR,CRISPRCasFinder,CRT TCCACGCTGTAACGGCCATCATTAAGTTTAGT >3.2|1310693|32|CP029122|PILER-CR,CRISPRCasFinder,CRT GCTGATGGTCTGGGAGTGTCCATCGGGCAACT >3.3|1310754|32|CP029122|PILER-CR,CRISPRCasFinder,CRT GAAGTAGGCCTGACAGTGATTGAACGCATACT >3.4|1310815|32|CP029122|PILER-CR,CRISPRCasFinder,CRT AGTTGGGGCGGCGCAATAACGAGACGATACGC >3.5|1310876|32|CP029122|PILER-CR,CRISPRCasFinder,CRT GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT >3.6|1310937|32|CP029122|PILER-CR,CRISPRCasFinder,CRT TCAACGCGCTCAGACGTTGCGTGAGTGAACCA >3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT AAATATCCAGGGCTGGGCTGGAGGCAGACGGC >3.8|1311059|32|CP029122|PILER-CR,CRISPRCasFinder,CRT CCCGGAATGCATTCTGAAGGTTTGCTGTATAT |
CRISPR arrays and Neighbor proteins around CP029122_3
The CRISPR arrays of CP029122_3 >merge|CP029122|3|1310603-1311119|PILER-CR,CRISPRCasFinder,CRT GAGTTCCCCGCGCCAGCGGGGATAAACCGTCCACGCTGTAACGGCCATCATTAAGTTTAGTGAGTTCCCCGCGCCAGCGGGGATAAACCGGCTGATGGTCTGGGAGTGTCCATCGGGCAACTGAGTTCCCCGCGCCAGCGGGGATAAACCGGAAGTAGGCCTGACAGTGATTGAACGCATACTGAGTTCCCCGCGCCAGCGGGGATAAACCGAGTTGGGGCGGCGCAATAACGAGACGATACGCGAGTTCCCCGCGCCAGCGGGGATAAACCGGGGAGTGGCACTTCTGGGGTAGCGGCGGCCCTGAGTTCCCCGCGCCAGCGGGGATAAACCGTCAACGCGCTCAGACGTTGCGTGAGTGAACCAGAGTTCCCCGCGCCAGCGGGGATAAACCGAAATATCCAGGGCTGGGCTGGAGGCAGACGGCGAGTTCCCCGCGCCAGCGGGGATAAACCGCCCGGAATGCATTCTGAAGGTTTGCTGTATATGAGTTCCCCGCGCCAGCGGGGATAAACCA >CP029122|3|1|1310603-1311119|PILER-CR GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA >CP029122|3|3|1310603-1311119|CRISPRCasFinder GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA >CP029122|3|1|1310603-1311119|CRT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA
>CP029122.1|AWF26340.1|1309591_1310263_+|putative-7-cyano-7-deazaguanosine-(preQ0)-biosynthesis-protein-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWEKLEDREVSLFSILAKTKESDKWGAASSEDLLAVISRQGYTARHVVITGGEPCIHDLLPLTDLLEKNGFSCQIETSGTHEVRCTPNTWVTVSPKLNMRGGYEVLSQALERANEIKHPVGRVRDIEALDELLATLTDDKPRVIALQPISQKDDATRLCIETCIARNWRLSMQTHKYLNIA >CP029122.1|AWF25207.1|1309312_1309453_-|hypothetical-protein MSEENKENGFNHVKTFTKIIFIFSVLVFNDNESKITDAAVNLFIQI >CP029122.1|AWF27497.1|1308426_1309299_-|repair-family-protein MRYFILMFTFVCSFVAAQPTIVPQLQQQVTDLTSSLNSQEKKELTHKLESIFNNTQVQIAVLIVPTTKDETIEQYATRVFDNWRLGDAKRNDGILIIVAWSDRTVRIKVGYGLEEKVTDALAGDIIRSNMIPAFKQQKLAQGLELAINALNNQLTSQHQYPTNPSESESASSSDHYYFAIFWVFAVMFFPFWFFHQCSNFCRACKSGVCISAIYLLDLFLFSDKIFSIAVFSFFFTFTIFMVFTCLCVLQKRASGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASGRW >CP029122.1|AWF26008.1|1307068_1308367_+|phosphopyruvate-hydratase MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >CP029122.1|AWF26348.1|1305343_1306981_+|CTP-synthase MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK >CP029122.1|AWF27755.1|1304324_1305116_+|nucleoside-triphosphate-pyrophosphohydrolase MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFADSSAENSSEVLARWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCANVGFDWTTLGPVVDKVYEEIDEVMYEARQAVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLEMTGVDLETMEEVWQQVKRQEIDL >CP029122.1|AWF24269.1|1304029_1304254_+|mRNA-interferase-MazF MYNNKTGMCLCVPCTTQSKGYPFEVVLSGQERDGVALADQVKSIAWRARGATKKGTVAPEELQLIKAKINVLIG >CP029122.1|AWF25600.1|1303670_1303919_+|antitoxin-MazE MIHSSVKRWGNSPAVRIPATLMQALNLNIDDEVKIDLVDGKLIIEPVRKEPVFTLAELVNDITPENLHENIDWGEPKDKEVW >CP029122.1|AWF27774.1|1301358_1303593_+|GTP-pyrophosphokinase MVAVRSAHINKAGEFDPEKWIASLGITSQKSCECLAETWAYCLQQTQGHPDASLLLWRGVEMVEILSTLSMDIDTLRAALLFPLADANVVSEDVLRESVGKSVVNLIHGVRDMAAIRQLKATHTDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHLREVKDAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYKRIAKLLHERRLDREHYIEEFVGHLRAEMKAEGVKAEVYGRPKHIYSIWRKMQKKNLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQSIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAAAGGARSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQVFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQIEIITQKQPNPSRDWLNPNLGYVTTSRGRSKIHAWFRKQDRDKNILAGRQILDDELEHLGISLKEAEKHLLPRYNFNDVDELLAAIGGGDIRLNQMVNFLQSQFNKPSAEEQDAAALKQLQQKSYTPQNRSKDNGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQLAELRSHAPERIVDAVWGESYSAGYSLVVRVVANDRSGLLRDITTILANEKVNVLGVASRSDTKQQLATIDMTIEIYNLQVLGRVLGKLNQVPDVIDARRLHGS >CP029122.1|AWF24832.1|1300009_1301311_+|23S-rRNA-(uracil-5-)-methyltransferase-RumA MAQFYSAKRRTTTRQIITVSVNDLDSFGQGVARHNGKTLFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSPERETPRCPHFGVCGGCQQQHASVDLQQRSKSAALARLMKHDVSEVIADVPWGYRRRARLSLNYLPKTQQLQMGFRKAGSSDIVDVKQCPILAPQLEALLPKVRACLGSLQAMRHLGHVELVQATSGTLMILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILETVSGEMPWYDSNGLRLTFSPRDFIQVNAGVNQKMVARALEWLDVQPEDRVLDLFCGMGNFTLPLATQAASVVGVEGVPALVEKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVMQQIIKLEPIRIVYVSCNPATLARDSEALLKAGYTIARLAMLDMFPHTGHLESMVLFSRVK >CP029122.1|AWF26142.1|1311756_1313235_-|FGGY-family-of-carbohydrate-kinase,-N-terminal-domain-protein MSKKYIIGIDGGSQSTKVVMYDLEGNVVCEGKGLLQPMHTPDADTAEHPDDDLWASLCFAGHDLMSQFAGNKEDIVGIGLGSIRCCRALLKADGTPAAPLISWQDARVTRPYEHTNPDVAYVTSFSGYLTHRLTGEFKDNIANYFGQWPVDYKSWAWSEDAAVMDKFNIPRHMLFDVQMPGTVLGHITPQAALATHFPAGLPVVCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAYWPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDAKAQDLSPEDLLNKKASCVPPGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNYFAKHVIITGGGSNSDLFMQIFADVFNLPARRNAINGCASLGAAINTAVGLGLYPDYATAVDKMVRVKDIFMPVESNAKRYDAMNKGIFKDLTKHTDVILKKSYEVMHGELGNADSIQSWSNA >CP029122.1|AWF26216.1|1313261_1314539_-|inner-membrane-protein-YqcE MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQVAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAPDDSTSLKTVIIIYSVVYILLGILCWFFVSDNNNLRSANNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAILSYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSIIGLLALTALLVTNSNPQSVAMGIGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGHWQDTLPAAEAYRNMWLMGMAALGMVIVFTFLLFQKIRTADSAPAMASSK >CP029122.1|AWF24640.1|1314857_1315643_+|KR-domain-protein MSIESLNAFSMDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEKQGVEVDFMQVDITAEGAPQKIIAASCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAAFELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNIQVNGIAPGYYATDITLATRSNPETNQRVLDHIPANRWGDTQDLMGAAVFLASPASNYVNGHLLVVDGGYLVR >CP029122.1|AWF27450.1|1315712_1317167_+|FAD-binding-domain-protein MSLSRAAIVDQLKEIVGADRVITDETVLKKNSIDRFRKFPDIHGIYTLPIPAAVVKLGSTEQVSRVLNFMNAHKINGVPRTGASATEGGLETVVENSVVLDGSAMNQIINIDIENMQATAQCGVPLEVLENALREKGYTTGHSPQSKPLAQMGGLVATRSIGQFSTLYGAIEDMVVGLEAVLADGTVTRIKNVPRRAAGPDIRHIIIGNEGALCYITEVTVKIFKFTPENNLFYGYILEDMKTGFNILREVMVEGYRPSIARLYDAEDGTQHFTHFADGKCVLIFMAEGNPRIAKATGEGIAEIVARYPQCQRVDSKLIETWFNNLNWGPDKVAAERVQILKTGNMGFTTEVSGCWSCIHEIYESVINRIRTEFPHADDITMLGGHSSHSYQNGTNMYFVYDYNVVDCKPEEEIDKYHNPLNKIICEETIRLGGSMVHHHGIGKHRVHWSKLEHGSAWALLEGLKKQFDPNGIMNTGTIYPIEK >CP029122.1|AWF24110.1|1317281_1318598_+|inner-membrane-metabolite-transport-protein-YgcS MDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFLGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGILLGAFSVVWTVGYVLASIAGHHFISENPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLFSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQVTLLLLATVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV >CP029122.1|AWF24489.1|1318575_1319355_+|electron-transfer-flavodomain-protein MNILLAFKAEPDAGMLAEKEWQAAAQGKSGPDISLLRSLLGADEQAAAALLLAQRKNGTPMSLTALSMGDERALHWLRYLMALGFEEAVLLETAADLRFAPEFVARHIAEWQHQNPLDLIITGCQSSEGQNGQTPFLLAEMLGWPCFTQVERFTLDALFITLEQRTEHGLRCCRVRLPAVIAVRQCGEVALPVPGMRQRMAAGKAEIIRKTVAAEMPAMQCLQLARAEQRRGATLIDGQTVAEKAQKLWRDYLRQRMQP >CP029122.1|AWF26443.1|1319351_1320212_+|electron-transfer-flavodomain-protein MNIAIVTINQENAAIASWLAAQDFSGCTLAHWQIEPQPVVAEQVLDALVEQWQRTPADVVLFPPGTFGDELSTRLAWRLHGASICQVTSLDIPTVSVRKSHWGNALTATLQTEKRPLCLSLARQAGAAKNATLPSGMQQLIIVPGALPDWLVSTEDLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVGYSRARVMNGGVDAEKVIGISGHLLAPEVCIVVGASGAAALMAGVRNSKFVVAINHDASAAVFSQADVGVVDDWKVVLEALVTNIHADCQ >CP029122.1|AWF26087.1|1320359_1320902_-|hypothetical-protein MIAAVKDNASLQLAIDSECQFISVLYGNICTISNIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLVTEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVCDEEDARNAINAGVVALSTTNTGVWTLAKKLL >CP029122.1|AWF26021.1|1320951_1321212_-|putative-4Fe-4S-binding-protein MSVARNLWRVADAPHIVPADSVERQTAERLISACPAGLFSLTPEGDLRIDYRSCLECGTCRLLCDESTLQQWRYPPSGFGITYRFG >CP029122.1|AWF27670.1|1321202_1322474_-|HI0933-like-family-protein MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERCITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029122_4 | 1333504-1334082 | Unclear |
I-E
Consensus repeat of CP029122_4
|
9 spacers
spacers of CP029122_4
>4.1|1333534|31|CP029122|CRISPRCasFinder TTGCCCGCGCAATTCCGGGAGCATCCGCAAT >4.2|1333595|31|CP029122|CRISPRCasFinder ACGGACAAAATATATATTGATTTGCGAATTA >4.3|1333656|31|CP029122|CRISPRCasFinder GTAAAGAAACTGCCGACAAATCCCTGTTCGT >4.4|1333717|31|CP029122|CRISPRCasFinder CCCGTCACCGACGCGCAGTGGCGCTACCGTG >4.5|1333778|31|CP029122|CRISPRCasFinder GGATCTAACGCGCTGTAAAAATTCCGTGCTT >4.6|1333839|31|CP029122|CRISPRCasFinder TGCGGATTACCGGCAAAACATGGGAGCAAAC >4.7|1333900|31|CP029122|CRISPRCasFinder CCGAACGGCTGGCGAAGCAGGTGGCTGGCGT >4.8|1333961|31|CP029122|CRISPRCasFinder GTTTACCGCCCCGCAGAGGCGCTGGCAGATC >4.9|1334022|31|CP029122|CRISPRCasFinder GGATGACCTGTCGCTAAAACTCGCCGCGTAC >4.10|1333533|33|CP029122|PILER-CR GTTGCCCGCGCAATTCCGGGAGCATCCGCAATT >4.11|1333594|33|CP029122|PILER-CR GACGGACAAAATATATATTGATTTGCGAATTAT >4.12|1333655|33|CP029122|PILER-CR GGTAAAGAAACTGCCGACAAATCCCTGTTCGTT >4.13|1333716|33|CP029122|PILER-CR GCCCGTCACCGACGCGCAGTGGCGCTACCGTGA >4.14|1333777|33|CP029122|PILER-CR GGGATCTAACGCGCTGTAAAAATTCCGTGCTTT >4.15|1333838|33|CP029122|PILER-CR ATGCGGATTACCGGCAAAACATGGGAGCAAACC >4.16|1333899|33|CP029122|PILER-CR GCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA >4.17|1333960|33|CP029122|PILER-CR GGTTTACCGCCCCGCAGAGGCGCTGGCAGATCC >4.18|1334021|33|CP029122|PILER-CR GGGATGACCTGTCGCTAAAACTCGCCGCGTACA >4.19|1333534|32|CP029122|CRT TTGCCCGCGCAATTCCGGGAGCATCCGCAATT >4.20|1333595|32|CP029122|CRT ACGGACAAAATATATATTGATTTGCGAATTAT >4.21|1333656|32|CP029122|CRT GTAAAGAAACTGCCGACAAATCCCTGTTCGTT >4.22|1333717|32|CP029122|CRT CCCGTCACCGACGCGCAGTGGCGCTACCGTGA >4.23|1333778|32|CP029122|CRT GGATCTAACGCGCTGTAAAAATTCCGTGCTTT >4.24|1333839|32|CP029122|CRT TGCGGATTACCGGCAAAACATGGGAGCAAACC >4.25|1333900|32|CP029122|CRT CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA >4.26|1333961|32|CP029122|CRT GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC >4.27|1334022|32|CP029122|CRT GGATGACCTGTCGCTAAAACTCGCCGCGTACA |
cas2,cas1,cas6e,cas5,cas7,cas3 |
CRISPR arrays and Neighbor proteins around CP029122_4
The CRISPR arrays of CP029122_4 >merge|CP029122|4|1333504-1334082|CRISPRCasFinder,PILER-CR,CRT TGTGTTCCCCGCGCCAGCGGGGATAAACCGTTGCCCGCGCAATTCCGGGAGCATCCGCAATTGTGTTCCCCGCGCCAGCGGGGATAAACCGACGGACAAAATATATATTGATTTGCGAATTATGTGTTCCCCGCGCCAGCGGGGATAAACCGGTAAAGAAACTGCCGACAAATCCCTGTTCGTTGTGTTCCCCGCGCCAGCGGGGATAAACCGCCCGTCACCGACGCGCAGTGGCGCTACCGTGAGTGTTCCCCGCGCCAGCGGGGATAAACCGGGATCTAACGCGCTGTAAAAATTCCGTGCTTTGTGTTCCCCGCGCCAGCGGGGATAAACCATGCGGATTACCGGCAAAACATGGGAGCAAACCGTGTTCCCCGCGCCAGCGGGGATAAACCGCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTAGTGTTCCCCGCGCCAGCGGGGATAAACCGGTTTACCGCCCCGCAGAGGCGCTGGCAGATCCGTGTTCCCCGCGCCAGCGGGGATAAACCGGGATGACCTGTCGCTAAAACTCGCCGCGTACAGTGTTCCCCGCGCCAGCGGGGATAAACCG >CP029122|4|4|1333504-1334082|CRISPRCasFinder TGTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAAT TGTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTA TGTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGT TGTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTG AGTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTT TGTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAAC CGTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGT AGTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATC CGTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTAC AGTGTTCCCCGCGCCAGCGGGGATAAACCG >CP029122|4|2|1333505-1334081|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAACC GTTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACC GACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACC GGTAAAGAAACTGCCGACAAATCCCTGTTCGTT GTGTTCCCCGCGCCAGCGGGGATAAACC GCCCGTCACCGACGCGCAGTGGCGCTACCGTGA GTGTTCCCCGCGCCAGCGGGGATAAACC GGGATCTAACGCGCTGTAAAAATTCCGTGCTTT GTGTTCCCCGCGCCAGCGGGGATAAACC ATGCGGATTACCGGCAAAACATGGGAGCAAACC GTGTTCCCCGCGCCAGCGGGGATAAACC GCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACC GGTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACC GGGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACC >CP029122|4|2|1333505-1334082|CRT GTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTGA GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTTT GTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAACC GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACCG
>CP029122.1|AWF27411.1|1333120_1333408_+|CRISPR-associated-endoribonuclease-Cas2,-subtype-I-E MVVVVTENVPPRLRGRLAIWLLEVRAGVYVGDTSKRIREMIWQQITQLAGCGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVDNQ >CP029122.1|AWF26969.1|1332194_1333118_+|CRISPR-associated-endonuclease-Cas1 MTFVPLSPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLAATVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALTEDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRQTYALLAKQYGVKWNGRKYDPKDWEKGDVVNRCISAATSCLYGISEAAVLAAGYAPAIGFIHSGKPLSFVYDIADIIKFDSVVPKAFEIAARQPAEPDKEVRLACRDIFRSTKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPETLGDSGHRGRGG >CP029122.1|AWF26270.1|1331547_1332198_+|CRISPR-associated-protein-Cas6/Cse3/CasE,-subtype-I-E MYLSRITLHTGQLSPAQLLHLVDRGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQERPAESDTFTIECRSFAPELRTGQQLCFNLRANPTICKSGKRHDLLMEAKRQVRGQAEGSDVWLHQQQAALDWLAAQGERSGFTLLDTSVDAYRQQQLRRENSRQLIQFSSVDYTGMLTVTDPGLFLQRLSQGYGKSRAFGCGLMLIKPGAEA >CP029122.1|AWF27567.1|1330819_1331566_+|CRISPR-associated-protein-Cas5/CasD,-subtype-I-E MSQYLIFQLHGPMASWGVDAPGEVRHTHELPSRSALLGLLAAGVGIRRDDTERLNAFNRHYSLVVCASRNPRWARDYHTIQMPKEVRKARYFSRREELSDPDLLSAIISRRDYYTDAWWMVAVATTADAPYSLEQLQDGLRHPVFPLYLGRKSHPLALPLAPLLLEGNACDALCNAYQQYQDHFHKLKVSLPKLQDECWWEGEHDGLVASKILRRRDVPLNRQQWLFGERTINQGPWLSKEEPCTSQE >CP029122.1|AWF25183.1|1329924_1330809_+|CRISPR-associated-protein-Cas7/Cse4/CasC,-subtype-I-E MAGHIGIRSGRIAREAATILIEKGIEEKKAIEWAAKIADYLGKAKNDKKPKDPLTNAETEQLVHISPAEFDAVKALAHQLAEEKRAPKEEDLALLRKDRMAVDIAMFGRMLANKPEFNVEAACQVAHAFGVSETIVEDDFFTAVDDLRQASEDAGAGHLGETGFGSALFYTYICIDKDLLVENLGGDEALANQTIRAFTEAALKVSPTGKQNSFASRAYASWAMAEKGTEQPRSLAAAFYEPINGTRQLEVAVQRITTLRENMNTVYEQKTECASFDVMNKQGSMKDVLDFICA >CP029122.1|AWF24139.1|1328162_1329950_+|CRISPR-associated-endonuclease-Cas3-HD MRKYPLSLLKDKNIVTFFDFWGKTRRGEKEGGDDYHLLCWHSLDVAAMGYLMVKRNCFGLADYFRQLGISDKEQAAQFFAWLLCWHDIGKFARSFQQLYLPPELKIQEGARKNYEKISHSTLGYWLWNHYLSECQELLPSSSLSPRKLRRVIEMWMPVTTGHHGRPPDRMDELDNFLPEDKAAARDFLLAIKALFPRIEIPAFWDDDEGIELIKHLSWYISATVVLADWTGSSTRFFPRVAQAMDIKHYWQKALVQAQNALTVFPPKAETAPFTGINTLFPFIENPTPLQQKVLDLDISQPGPQLFILEDVTGAGKTEAALILAHRLIAAGKAQGLFFGLPTMATANAMYDRLVKTWLAFYSPESRPSLVLAHSARTLMDRFNESLWSGDLVGSEEPDEQTFSQGCAAWFADSNKKALLAEIGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAYMSCILEGLIERQARGGNSVILLSATLSQQQRDKLVAAFARGAEGQQEAPLLGKDDYPWLTHVTKTDVHSHRVATRKEVERSVSVGWLHSEQECIARIESAVSQGKCIAWIRNSVDDAIKVYRQLLGGPYWYS >CP029122.1|AWF28474.1|1327816_1327969_+|hok/gef-family-protein MLTKYALVAIIVLCCTVLGFTLMVGDSLCELSIRERGMEFKAVLAYESKK >CP029122.1|AWF24728.1|1326817_1327552_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIHPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTHTTRKWEPGMLEEETRFFGLKRECGLHEG >CP029122.1|AWF24209.1|1325031_1326744_+|sulfite-reductase-(NADPH)-hemoprotein,-beta-component MSEKHPGPLVVEGKLTDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHKGDFRITANQNLIIAGVPESEKAKIEKIAKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVMRVTGCPNGCGRAMLAEVGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPEILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDLWD >CP029122.1|AWF28283.1|1323232_1325032_+|sulfite-reductase-[NADPH]-flavoprotein,-alpha-component MTTQVPPSALLPLNPEQLVRLQAATTDLTPTQLAWVSGYFWGVLNQQPAALAATPAPAAEMPGITIISASQTGNARRVAEALRDDLLAAKLNVKLVNAGDYKFKQIASEKLLIVVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDSSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDALKSRAPVAAPSQSVATGAVNEIHTSPYSKDAPLVASLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVEGKTLPLNEALQWHFELTVNTANIVENYATLTRSETLLPLVGDKAKLQHYAATTPIVDMVRFSPAQLDAEALINLLRPLTPRLYSIASSQAEVENEVHVTVGVVRYDVEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPETPVIMIGPGTGIAPFRAFMQQRAADEAPGKNWLFFGNPHFTEDFLYQVEWQRYVKDGVLTRIDLAWSRDQKEKVYVQDKLREQGAELWRWINDGAHIYVCGDANRMAKDVEQALLEVIAEFGGMDTEAADEFLSELRVERRYQRDVY >CP029122.1|AWF26634.1|1334163_1335201_-|M42-glutamyl-aminopeptidase-family-protein MFSALRHRTAALALGVCFILPVHASSPKPGDFANTQARHIATFFPGRMTGTPAEMLSADYIRQQFQQMGYRSDIRTFNSRYIYTARDNRKSWHNVTGSTVIAAHEGKAPQQIIIMAHLDTYAPLSDADADANLGGLTLQGMDDNAAGLGVMLELAERLKNTPTEYGIRFVATSGEEEGKLGAENLLKRMSDTEKKNTLLVINLDNLIVGDKLYFNSGVKTPEAVRKLTRDRALAIARSHGIAATTNPGLNKNYPKGTGCCNDAEIFDKAGIAVLSVEATNWNLGNKDGYQQRAKTAAFPAGNSWHDVRLDNQQHIDKALPGRIERRCRDVMRIMLPLVKELAKAS >CP029122.1|AWF26941.1|1335452_1336361_+|sulfate-adenylyltransferase-subunit-2 MDQIRLTHLRQLEAESIHIIREVAAEFSNPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYEFRDRTAKAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWHNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMIDDNRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESNAQTLPEIIEEMLVSTTSERQGRVIDRDQAGSMELKKRQGYF >CP029122.1|AWF26563.1|1336362_1337790_+|sulfate-adenylyltransferase,-large-subunit MNTALAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTRQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCELAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYSEKTFTRIREDYLTFAGQLPGNLDIRFVPLSALEGDNVASQSESMAWYSGPTLLEVLETVEIQRVVDAQPMRFPVQYVNRPNLDFRGYAGTLASGRVEVGQRVKVLPSGVESNVARIVTFDGDREEAFAGEAITLVLTDEIDISRGDLLLAADEALPAVQSASVDVVWMAEQPLSPGQSYDIKIAGKKTRARVDGIRYQVDINNLTQREVENLPLNGIGLVDLTFDEPLVLDRYQQNPVTGGLIFIDRLSNVTVGAGMVHEPVSQATAAPSEFSAFELELNALVRRHFPHWGARDLLGDK >CP029122.1|AWF28399.1|1337789_1338395_+|adenylylsulfate-kinase MALHDENVVWHSHPVTVQQRELHHGHRGVVLWFTGLSGSGKSTVAGALEEALHKLGVSTYLLDGDNVRHGLCSDLGFSDADRKENIRRVGEVANLMVEAGLVVLTAFISPHRAERQMVRERVGEGRFIEVFVDTPLAICEARDPKGLYKKARAGELRNFTGIDSVYEAPESAEIHLNGEQLVTNLVQQLLDLLRQNDIIRS >CP029122.1|AWF27592.1|1338444_1338768_+|inner-membrane-protein-YgbE MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNTLFFFIYTWPFFLALMPVAVVVGIALHSLMDGKLRYSIVFTLVTVGIMFGALFMWLLG >CP029122.1|AWF24547.1|1338854_1338974_+|hypothetical-protein MDKCGTFARVVAVSPTEDVESSSDAWDDDAVFQGAGWVN >CP029122.1|AWF26583.1|1338961_1339273_+|cell-division-protein-FtsB MGKLTLLLLAILVWLQYSLWFGKNGIHDYTRVNDDVAAQQATNAKLKARNDQLFAEIDDLNGGQEALEERARNELSMTRPGETFYRLVPDASKRAQSAGQNNR >CP029122.1|AWF27312.1|1339291_1340002_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MATTHLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKRVVIAISPGDSRFAQLPLANHPQITVVDGGDERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALSETSRTGGILAAPVRDTMKRAEPGKNAIAHTVDRNGLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTRTIHQENT >CP029122.1|AWF28219.1|1340001_1340481_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYERGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLIKATK >CP029122.1|AWF25134.1|1340477_1341527_+|tRNA-pseudouridine-synthase-D MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDLSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGAQRWAQTNTPVRDRNKRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRAMLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029122_5 | 1836729-1836846 | Orphan |
NA
Consensus repeat of CP029122_5
|
1 spacers
spacers of CP029122_5
>5.1|1836760|56|CP029122|CRISPRCasFinder TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA |
CRISPR arrays and Neighbor proteins around CP029122_5
The CRISPR arrays of CP029122_5 >merge|CP029122|5|1836729-1836846|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGCTGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAACCGAGCCGTAGGCCGGATAAGGCGTTTACGC >CP029122|5|5|1836729-1836846|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGC TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA CCGAGCCGTAGGCCGGATAAGGCGTTTACGC
>CP029122.1|AWF26966.1|1835501_1836632_-|ribonucleoside-diphosphate-reductase-1-subunit-beta MAYTTFSQTKNDQLKEPMFFGQPVNVARYDQQKYDIFEKLIEKQLSFFWRPEEVDVSRDRIDYQALPEHEKHIFISNLKYQTLLDSIQGRSPNVALLPLISIPELETWVETWAFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEQIQKRAEGISSYYDELIEMTSYWHLLGEGTHTVNGKTVTVSLRELKKKLYLCLMSVNALEAIRFYVSFACSFAFAERELMEGNAKIIRLIARDEALHLTGTQHMLNLLRSGADDPEMAEIAEECKQECYDLFVQAAQQEKDWADYLFRDGSMIGLNKDILCQYVEYITNIRMQAVGLDLPFQTRSNPIPWINTWLVSDNVQVAPQEVEVSSYLVGQIDSEVDTDDLSNFQL >CP029122.1|AWF28396.1|1835247_1835502_-|2Fe-2S-iron-sulfur-cluster-binding-domain-protein MARVTLRITGTQLLCQDEHPSLLAALESHNVAVEYQCREGYCGSCRTRLVAGQVDWIAEPLAFIQPGEILPCCCRAKGDIEIEM >CP029122.1|AWF27321.1|1834543_1835194_+|lipopolysaccharide-kinase-family-protein MAVSAKYDEFNHWWATEGDWVEEPNYRRNGMSGVQCVERNGKKLYVKRMTHHLFHSVRYPFGRPTIVREVAVIKELERAGVIVPKIVFGEAVKIEGEWRALLVTEDMAGFISIADWYAQHAVSPYSDEVRQAMLKAVALAFKKMHSINRQHGCCYVRHIYVKTEGKAEAGFLDLEKSRRRLRRDKAINHDFRQLEKYLEPIPKADWEQVKAYYYAM >CP029122.1|AWF25543.1|1834122_1834329_-|putative-yfaH MNFIRQGLGIALQPELTLKSIAGELCSVPLEPTFYRQISLLAKEKPVEGSPLFLLQTCTEQLVVSGKI >CP029122.1|AWF24787.1|1833835_1833967_-|hypothetical-protein MTPGRNRYEAPDSLSKPIVESNDTPANSTQFTLEPFSIFKWDT >CP029122.1|AWF25497.1|1832367_1832871_+|putative-transposase MPGNCPHYGRWPQHDFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDSLRKTVVAHVFGERTMATLGRLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ >CP029122.1|AWF27846.1|1831809_1831950_-|hypothetical-protein MIQHSNGQLVITGKYCPEGVQTAFTQEQYDKLIRYINIFFTFPKCE >CP029122.1|AWF25556.1|1830514_1831591_+|glycerophosphoryl-diester-phosphodiesterase MKLKLKNLSMAIMMSTIVMGSSAMAADSNEKIVIAHRGASGYLPEHTLPAKAMAYAQGADYLEQDLVMTKDDHLVVLHDHYLDRVTDVADRFPDRARKDGRYYAIDFTLDEIKSLKFTEGFDIENGKKVQTYPGRFPMGKSDFRVHTFEEEIEFVQGLNHSTGKNIGIYPEIKAPWFHHQEGKDIAAKTLEVLKKYGYTGKDDKVYLQCFDADELKRIKNELEPKMGMDLNLVQLIAYTDWNETQQKQPDGSWVNYSYDWMFKPGAMKQVAEYADGIGPDYHMLIEETSQPGNIKLTGMVQDAQQNKLVVHPYTVRSDKLPEYTTDVNQLYDVLYNKAGVNGLFTDFPDKAVKFLNKE >CP029122.1|AWF26892.1|1829151_1830510_+|glycerol-3-phosphate-transporter MLSIFKPAPHKARLPAAEIDPTYRRLRWQIFLGIFFGYAAYYLVRKNFALAMPYLVEQGFSRGDLGFALSGISIAYGFSKFIMGSVSDRSNPRVFLPAGLILAAAVMLFMGFVPWATSSIAVMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLFLLGMAWFNDWHAALYMPAFCAILVALFAFAMMRDTPQSCGLPPIEEYKNDYPDDYNEKAEQELTAKQIFMQYVLPNKLLWYIAIANVFVYLLRYGILDWSPTYLKEVKHFALDKSSWAYFLYEYAGIPGTLLCGWMSDKVFRGNRGATGVFFMTLVTIATIVYWMNPAGNPTVDMICMIVIGFLIYGPVMLIGLHALELAPKKAAGTAAGFTGLFGYLGGSVAASAIVGYTVDFFGWDGGFMVMIGGSILAVILLIVVMIGEKRRHEQLLQKRNGG >CP029122.1|AWF27887.1|1827250_1828879_-|glycerol-3-phosphate-dehydrogenase,-anaerobic,-A-subunit MKTRDSQSSDVIIIGGGATGAGIARDCALRGLRVILVERHDIATGATGRNHGLLHSGARYAVTDAESARECISENQILKRIARHCVEPTNGLFITLPEDDLSFQATFIRACEEAGISAEAIDPQQARIIEPAVNPALIGAVKVPDGTVDPFRLTAANMLDAKEHGAVILTAHEVTGLIREGATVCGVRVRNHLTGETQALHAPVVVNAAGIWGQHIAEYADLRIRMFPAKGSLLIMDHRINQHVINRCRKPSDADILVPGDTISLIGTTSLRIDYNEIDDNRVTAEEVDILLREGEKLAPVMAKTRILRAYSGVRPLVASDDDPSGRNVSRGIVLLDHAERDGLDGFITITGGKLMTYRLMAEWATDAVCRKLGNTRPCTTADLALPGSQDPAEVTLRKVISLPAPLRGSAVYRHGDRTPAWLSEGRLHRSLVCECEAVTAGEVQYAVENLNVNSLLDLRRRTRVGMGTCQGELCACRAAGLLQRFNVTTSAQSIEQLSTFLNERWKGVQPIAWGDALRESEFTRWVYQGLCGLEKEQKDAL >CP029122.1|AWF28253.1|1836865_1839151_-|ribonucleoside-diphosphate-reductase,-alpha-subunit MNQNLLVTKRDGSTERINLDKIHRVLDWAAEGLHNVSISQVELRSHIQFYDGIKTSDIHETIIKAAADLISRDAPDYQYLAARLAIFHLRKKAYGQFEPPALYDHVVKMVEMGKYDNHLLEDYTEEEFKQMDTFIDHDRDMTFSYAAVKQLEGKYLVQNRVTGEIYESAQFLYILVAACLFSNYPRETRLQYVKRFYDAVSTFKISLPTPIMSGVRTPTRQFSSCVLIECGDSLDSINATSSAIVKYVSQRAGIGINAGRIRALGSPIRGGEAFHTGCIPFYKHFQTAVKSCSQGGVRGGAATLFYPMWHLEVESLLVLKNNRGVEGNRVRHMDYGVQINKLMYTRLLKGEDITLFSPSDVPGLYDAFFADQEEFERLYTKYEKDDSIRKQRVKAVELFSLMMQERASTGRIYIQNVDHCNTHSPFDPAIAPVRQSNLCLEIALPTKPLNDVNDENGEIALCTLSAFNLGAINNLDELEELAILAVRALDALLDYQDYPIPAAKRGAMGRRTLGIGVINFAYYLAKHGKRYSDGSANNLTHKTFEAIQYYLLKASNELAKEQGACPWFNETTYAKGILPIDTYKKDLDTIANEPLHYDWEALRESIKTHGLRNSTLSALMPSETSSQISNATNGIEPPRGYVSIKASKDGILRQVVPDYEHLHDAYELLWEMPGNDGYLQLVGIMQKFIDQSISANTNYDPSRFPSGKVPMQQLLKDLLTAYKFGVKTLYYQNTRDGAEDAQDDLVPSIQDDGCESGACKI >CP029122.1|AWF24495.1|1839894_1843599_+|outer-membrane-autotransporter-barrel-domain-protein MIASLFSANGVAAVTDSCQGYDVKASCQASRQSLSGITQDWSIADGQWLVFSDMTNNASGGAVFLQQGAEFSLLPENETGMTLFANNTVTGEYNNGGAIFAKENSTLNLTDVIFSGNVAGGYGGAIYSSGTNDTGAVDLRVTNAMFRNNIANDGKGGAIYTINNDVYLSDVIFDNNQAYTSTSYSDGDGGAIDVTDNNSDSKHPSGYTIVNNTAFTNNTAEGYGGAIYTNSVTAPYLIDISVDDSYSQNGGVLVDENNSAAGYGDGPSSAAGGFMYLGLSEVTFDIADGKTLVIGNTENDGAVDSIAGTGLITKTGSGDLVLNADNNDFTGEMQIENGEVTLGRSNSLMNVGDTHCQDDPQDCYGLTIGSIDQYQNQAELNVGSTQQTFVHALTGFQNGTLNIDAGGNVTVNQGSFAGIIEGAGQLTIAQNGSYVLAGAQSMALTGDIVVDDGAVLSLEGDAADLTALQDDPQSIVLNGGVLDLSDFSTWQSGTSYNDGLEVSGSSGTVIGSQDVVDLAGGDNLHIGGDGKDGVYVVVDASDGQVSLANNNSYLGTTQIASGTLMVSDNSQLGDTHYNRQVIFTDKQQESVMEITSDVDTRSDAAGHGRDIEMRADGEVAVDAGVDTQWGALMADSSGQHQDEGSTLTKTGAGTLELTASGTTQSAVRVEEGTLKGDVADILPYASSLWVGDGATFVTGADQDIQSIDAISSGTIDISDGTVLRLTGQDTSVALNASLFNGDGTLVNATDGVTLTGELNTNLETDSLTYLSNVTVNGNLTNTSGAVSLQNGVAGDTLTVNGDYTGGGTLLLDSELNGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKMVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVEDNNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQPVLNAKVGGYLNNLRAANQAFMMERRDHAGGDGQTLNLRVIGGDYHYTAAGQLAQHEDTSTVQLSGDLFSGRWGTDGEWMLGIVGGYSDNQGDSRSNMTGTRADNQNHGYAVGLTSSWFQHGNQKQGAWLDSWLQYAWFSNDVSEQEDGTDHYHSSGIIASLEAGYQWLPGRGVVIEPQAQVIYQGVQQDDFTAANRARVSQSQGDDIQTRLGLHSEWRTAVHVIPTLDLNYYHDPHSTEIEEDGSTISDDAVKQRGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW >CP029122.1|AWF25697.1|1843726_1844449_-|3-demethylubiquinone-9-3-O-methyltransferase MNAEKSPENHNVDHEEIAKFEAVASRWWDLEGEFKPLHRINPLRLGYIAERAGGLFGKKVLDVGCGGGILAESMAREGATVTGLDMGFEPLQVAKLHALESGIQVDYVQETVEKHAAKHAGQYDVVTCMEMLEHVPDPQSVVRACAQLVKPGGDVFFSTLNRNGKSWLMAVVGAEYILRMVPKGTHDVKKFIKPAELLGWVDQTSLKERHITGLHYNPITNSFKLGPGVDVNYMLHTQNK >CP029122.1|AWF27241.1|1844595_1847223_+|DNA-gyrase,-A-subunit MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDLAVYNTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDDEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEVDAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDGMRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDIIAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHAPTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVKYQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVYSMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMATANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAEGKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQNGYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITDAGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDTIDGSAAEGDDEIAPEVDVDDEPEEE >CP029122.1|AWF24465.1|1847371_1849060_+|hypothetical-protein MSGEKKAKGWRFYGLVGFGAIALLSAGVWALQYAGSGPEKTLSPLVVHNNLQIDLNEPDLFLDSDSLSQLPKDLLTIPFLHDVLSEDFVFYYQNHADRLGIEGSIRRIVYEHDLTLKDKLFSSLLDQPAQAALWHDKQGHLSHYMVLIQRSGLSKLLEPLLFAATSDSQLSKTEISSIKINSETVPVYQLRYNGNNALMFATYQDKMLVFSSTDMLFKDDQQDTEATAIAGDLLSGKKRWQASFGLEERTAEKTPVRQRIVVSARWLGFGYQRLMPSFAGVRFEMGNDGWHSFVALNDESASVDASFDFTPVWNSMPAGASFCVAVPYSHGIAEEMLSHISQENDKLNGALDGAAGLCWYEDSKLQTPLFVGQFDGTAEQAQLPGKLFTQNIGAHESKAPEGVLPVSQTQQGEAQIWRREVSSRYGQYPKAQAAQPDQLMSDYFFRVSLAMQNKTLLFSLDDTLVNNALQTLNKTRPAMVDVIPTDGIVPLYINPQGIAKLLRNETLTSLPKNLEPVFYNAAQTLLMPKLDALSQQPRYVMKLAQMEPGAAWQWLPITWQPL >CP029122.1|AWF24591.1|1849056_1849680_+|hypothetical-protein MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIYRLNFLAR >CP029122.1|AWF26719.1|1849703_1854218_+|PKD-domain-protein MGTGLANADDSLPSSNYAPPAGGTFFLLADSSFSSSEEAKVRLEAPGRDYRRYQMEEYGGVDVRLYRIPDPMAFLRQQKNLHRIVVQPQYLGDGLNNTLTWLWDNWYGKSRRVMQRTFSSQSRQNVTQALPELQLGNAIIKPSRYVQNNQFSPLKKYPLVEQFRYPLWQAKPFEPQQGVKLEGASSNFISPQPGNIYIPLGQQEPGLYLVEAMVGGYRATTVVFVSDTVALSKVSGNELLVWTAGKKQGEAKPGSEILWTDGLGVMTRGVTDDSGTLQLQHISPERSYILGKDAEGGVFVSENFFYESEIYNTRLYIFTDRPLYRAGDRVDVKVMGREFHDPLHSSPIVSAPAKLSVLDANGSLLQTVDVTLDARNGGQGSFRLPENAVAGGYELRLAYRNQVYSSSFRVANYIKPHFEIGLALAKKEFKTGEAVSGKLQLLYPDGEPVKNARVQLSLRAQQLSMVGNDLRYAGRFPVSLEGSETVSDASGHVALNLPAADKPSRYLLTVSASDGAAYRVTTTKEILIERGLAHYSLSTAAQYSNSGESVVFRYAALESSKQVPVTYEWLRLEDRTSHSGELPSGGKSFTVNFAKPGNYNLTLRDKDGLILAGLSHAVSGKGSTAHTGTVDIVADKTLYQPGETAKMLITFPEPIDEALLTLERDRVEQQSLLSHPANWLTLQRLNDTQYEARVPVSNSFAPNITFSVLYTRNGQYSFQNAGIKVAVPQLDIRVKTDKTHYQPGELVNVELTSSLKGKPVSAQLTVGVVDEMIYALQPEIAPNIGKFFYPLGRNNVRTSSSLSFISYDQALSSEPVAPGATNRSERRVKMLERPRREEVDTAAWMPSLTTDKQGKAYFTFLMPDSLTRWRITARGMNGDGLVGQGRAYLRSEKNLYMKWSMPTVYRVGDKPAAGLFIFSQQDNEPVALVTKFAGAEMRQTLTLHKGANYISLTQNIQQSGLLSAELQQNGQVQDSISTKLSFVDNSWPVEQQKNVMLGGGDNALMLPEQASNIRLQSSETPQEIFRNNLDALVDEPWGGVINTGSRLIPLSLAWRSLADHQSAAANDIRQMIQVNRLRLMQLAGPGARFTWWGEDGNGDAFLTAWAWYADWQASQAIGVTQQPEYWQHMLDSYAEQADNMPLLHRALVLAWAQEMNLPCKTLLKGLDEAIARRGTKTEDFSEEDTRDINDSLILDTPESPLADAVANVLTMTLLKKAQLKSTVMPQVQQYAWDKAANSNQPLAHTVVLLNSGGDATQAAAILSGLTAEQSTIERALAMNWLAKYMATMPPVVLPAPAGAWAKHKLTGGGEYWRWVGQGVPDILSFGDELSPQNVQVRWREPAKTAQQSNIPVTVERQLYRLITGEEEMSFTLQPVTSNEIDSDALYLDEITLTSEQDAVLRYGQVEVPLPPGADVERTTWGISVNKPNAAKQQGQLLEIARNEMGELAYMVPVKELTGTVTFRHLLRFSQKGQFVLPPARYMRSYAPAQQSVAAGSEWTRMQVK >CP029122.1|AWF27753.1|1854218_1855868_+|hypothetical-protein MNWRRIVWLLALVTLPTLAEEPPLQLALRGAQHDQLYKLSSSGVTNVSTLPDTLTTPLGSLWKLYVYAWLEDTHQPEQPYQCRGNSPEEVYCCQAGESITRDTALVRSCGLYFAPQRLHIGADVWGQYWQQRQAPAWLASLTTLKPETSVTVKSLLDSLATLPAQNKAQEVLLDVVLDEAKIGVASMLGSRVRVKTWSWFADDKQEIRQGGFAGWLTDGTPLWVTGSGTSKTVLTRYATVLNRVLPVPTQVASGQCVEVELFARYPLKKITAEKSTTAVKPGVLNGRYRVTFTNGNHITFVSHGETTLLSEKGKLKLQSHLDREEYVARVLDREAKSTPPEAAKAMTVAIRTFLQQNANREGDCLTIPDSSATQRVSASPATTGARTMAAWTQDLIYAGDPVHYHGSRATEGTLSWRQATAQAGQGERYDQILAFAYPDNSLSRWGAPRSTCQLLPKAKAWLAKKMPQWRRILQAETGYNEPDVFAVCRLVSGFPYTDRQQKRLFIRNFFTLQDRLDLTHEYLHLAFDGYPTGLDENYIETLTRQLLMD >CP029122.1|AWF25975.1|1855872_1856649_+|hypothetical-protein MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPVEGEDASFSQSINYPASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDGSFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTPGTGTIRARLRLVLSWDTDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPVHGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGELTLVKSFDW >CP029122.1|AWF24263.1|1856722_1857907_-|acetyl-CoA-C-acetyltransferase-family-protein MKNCVIVSAVRTAIGSFNGSLASTSAIDLGATVIKAAIERAKIDSQHVDEVIMGNVLQAGLGQNPARQALLKSGLAETVCGFTVNKVCGSGLKSVALAAQAIQAGQAQSIVAGGMENMSLAPYLLDAKARSGYRLGDGQVYDVILRDGLMCATHGYHMGITAENVAKEYGITREMQDELALHSQRKAAAAIESGAFTAEIVPVNVVTRKKTFVFSQDEFPKANSTAEALGALRPAFDKAGTVTAGNASGINDGAAALVIMEESAALAAGLTPLARIKSYASGGVPPALMGMGPVPATQKALQLAGLQLADIDLIEANEAFAAQFLAVGKTLGFDPEKVNVNGGAIALGHPIGASGARILVTLLHAMQARDKTLGLATLCIGGGQGIAMVIERLN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029122_6 | 2457166-2457289 | Orphan |
NA
Consensus repeat of CP029122_6
|
1 spacers
spacers of CP029122_6
>6.1|2457209|38|CP029122|CRISPRCasFinder CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA |
CRISPR arrays and Neighbor proteins around CP029122_6
The CRISPR arrays of CP029122_6 >merge|CP029122|6|2457166-2457289|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA >CP029122|6|6|2457166-2457289|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA
>CP029122.1|AWF24523.1|2456725_2457031_-|putative-monooxygenase-YdhR MATLLQLHFAFNGPFGDAMAEQLKPLAESINQEPGFLWKVWTESEKNHEAGGIYLFTDEKSALAYLEKHTARLKNLGVEEVVAKVFDVNEPLSQINQAKLA >CP029122.1|AWF24746.1|2454995_2456600_-|FAD-NAD(P)-binding-family-protein MKKIAIVGAGPTGIYTLFSLLQQQTPLSISIFEQADEAGVGMPYSDEENSKMMLANIASIEIPPINCTYLEWLQKQEASHLQRYGVKKETLHDRQFLPRILLGEYFRDQFLRLVDQARQQKFAVAVYESCQVTDLQITNAGVMLATNQDLPSETFDLVVIATGHVWPDEEEATRTYFPSPWSGLMEAKVDACNVGIMGTSLSGLDAAMAVAIQHGSFIEDDKQHVVFNRDNASEKLNITLMSRTGILPEADFYCPIPYEPLHIVTDQALNAEIQKGEEGLLDRVFRLIVEEIKFADPDWSQRIALESLNVDSFAQAWFAERKQRDPFDWAEKNLQEVERNKREKHTVPWRYVILRLHEAVQEIVPHLNEHDHKRFSKGLARVFIDNYAAIPSESIRRLLALREAGIIHILALGEDYEMEINESRTVLKTEDNSYSFDVFIDARGQRPLKVKDIPFPGLREQLQKTGDEIPDVGEDYTLQQPEDIRGRVAFGALPWLMHDQPFVQGLTACAEIGEAMARAVVKPASRARRRLSFD >CP029122.1|AWF28361.1|2454219_2454984_+|hypothetical-protein MYRWFLRHFPRGGSYADIHHALIEEGYTDWAESLVEYAWKKWLADENFAHQEVSSMQKLATDPGEIPFCSQFARSDDHARIGCCEDNARIATAGYAAQIASMGYSVRIGSVGFNSHIGSSGERARVAVTGNSSRISSAGDSSRIANTGMRVRVCTLGERCHVASNGDLAQIASFGANARIANSGDNVHIIASGENSTVVSTGVVDSIILGPGGSAALAYHDGERVRFAVAIEGENNIRAGVRYRLNEQHQFVEC >CP029122.1|AWF26018.1|2453382_2454168_+|prokaryotic-cytochrome-b561-family-protein MNPSQHAEQFQSQLANYVPQFTPEFWPVWLIIAGVLLVGMWLVLGLHALLRARGVKKSVTDYGEKIYLYCKAVRLWHWSNALLFVLLLASGLINHFALVGATAVKSLVAVHEVCGFLLLACWLGFVLINAVGGNGHHYRIRRQGWLERAAKQTRFYLFGIMQGEEHPFPATTQSKFNPLQQVAYVGVMYGLLPLLLLTGLLCLYPQAVGDVFPGVRYWLLQAHFALAFISLFFIFGHLYLCTTGRTPHETFKSMVDGYHRH >CP029122.1|AWF25859.1|2452831_2453386_+|4Fe-4S-dicluster-domain-protein MIHDESRCNGCNICARACRKTNHVPAQGSRLSIAHIPVTDNDNETQYHFFRQSCQHCEDAPCIDVCPTGASWRDEQGIVRVEKSQCIGCSYCIGACPYQVRYLNPVTKVADKCDFCAESRLAKGFPPICVSACPEHALIFGREDSPEIQAWLQQNKYYQYQLPGAGKPHLYRRFGQHLIKKENV >CP029122.1|AWF24611.1|2452015_2452654_+|putative-oxidoreductase MNHRDELPLAKVSEVDEAKRQWLQGMRHPVDTVTEPEPAEILAEFIRQHSAAGQLVARAVFLSPPYSVAEEELSVLLESIKQNGDYADIACMTGSQDDYYYSTQAMSENYAAMSLQVVEQDICRAIAHAVRFECQTYPRPYKVAMLMQAPYYFQEAQIEAAIAAMDVAPEYADIRQVESSTAVLYLFSERFMTYGKAYGLCEWFEVEQFQNP >CP029122.1|AWF28041.1|2449900_2452003_+|aldehyde-ferredoxin-oxidoreductase,-N-terminal-domain-protein MANGWTGNILRVNLTTGNITLEDSSKFKSFVGGMGFGYKIMYDEVPPGTKPFDEANKLVFATGPLTGSGAPCSSRVNITSLSTFTKGNLVVDAHMGGFFAAQMKFAGYDVIIIEGKAKSPVWLKIKDDKVSLEKADFLWGKGTRATTEEICRLTSPETCVAAIGQAGENLVPLSGMLNSRNHSGGAGTGAIMGSKNLKAIAVEGTKGVNIADRQEMKRLNDYMMTELIGANNNHVVPSTPQSWAEYSDPKSRWTARKELFWGAAEGGPIETGEIPPGNQNTVGFRTYKSVFDLGPAAEKYTVKMSGCHSCPIRCMTQMNIPRVKEFGVPSTGGNTCVANFVHTTIFPNGPKDFEDKDDGRVIGNLVGLNLFDDYGLWCNYGQLHRDFTYCYSKGVFKRVLPAEEYAEIHWDQLEAGDVNFIKDFYYRLAHRVGELSHLADGSYAIAERWNLGEEYWGYAKNKLWSPFGYPVHHANEASAQVGSIVNCMFNRDCMTHTHINFIGSGLPLKLQREVAKELFGSEDAYDETKNYTPINDAKIKYAKWSLLRVCLHNAVTLCNWVWPMTVSPLKSRNYRGDLALEAKFFKAITGEEMTQEKLDLAAERIFTLHRAYTVKLMQTKDMRNEHDLICSWVFDKDPQIPVFTEGTDKMDRDDMHASLTMFYKEMGWDPQLGCPTRETLQRLGLEDIAADLAAHNLLPV >CP029122.1|AWF25327.1|2449430_2449880_+|hypothetical-protein MLVTQRARCTGCHRCEISCTNFNDGSVGTFFSRIKIHRNYFFGDNGVGSGGGLYGDLNYTADTCRQCKEPQCMNVCPIGAITWQQKEGCITVDHKRCIGCSACTTACPWMMATVNTESKKSSKCVLCGECANACPTGALKIIEWKDITV >CP029122.1|AWF24807.1|2449088_2449226_-|hypothetical-protein MLTIQQPDIKHSTNTLLTLSPTPDIFPNPIVYEKRFNADKLILYG >CP029122.1|AWF26275.1|2448651_2448798_+|hypothetical-protein MLEMRDLGQEPKHIVIAGVLRTALANKRIQRSELEKQAMETVINALVK >CP029122.1|AWF25756.1|2457603_2458860_+|pertactin-family-protein MGSDAKNLMSDGNVQIVKTGEVIGATQLTEGELIVEAGGRAENTVVTGAGWLKVATGGIAKCTQYGNNGTLSVSDGAIATDIVQSEGGAISLSTLATVNGRHPEGEFSVDQGYACGLLLENGGNLRVLEGHRAEKIILDQEGGLLVNGTTSAVVVDEGGELLVYPGGEASNCEINQGGVFMLAGKASDTLLAGGTMNNLGGEDSDTIVENGSIYRLGTDGLQLYSSGKTQNLSVNVGGRAEVHAGTLENAVIQGGTVILLSPTSADENFVVEEDRAPVELTGSVALLDGASMIIGYGADLQQSTITVQQGGVLILDGSTVKGDGVTFIVGNINLNGGKLWLITGAATHVQLKVKRLRGEGAICLQTSAKEISPDFINVKGEVTGDIHVEITDASRQTLCNALKLQPDEDGIGATLQPA >CP029122.1|AWF28479.1|2458900_2460274_-|multidrug-resistance-protein-MdtK MQKYISEARLLLALAIPVILAQIAQTAMGFVDTVMAGGYSATDMAAVAIGTSIWLPAILFGHGLLLALTPVIAQLNGSGRRERIAHQVRQGFWLAGFVSVLIMLVLWNAGYIIRSMENIDPALADKAVGYLRALLWGAPGYLFFQVARNQCEGLAKTKPGMVMGFIGLLVNIPVNYIFIYGHFGMPELGGVGCGVATAAVYWVMFLAMVSYIKRARSMRDIRNEKGTAKPDPAVMKRLIQLGLPIALALFFEVTLFAVVALLVSPLGIVDVAGHQIALNFSSLMFVLPMSLAAAVTIRVGYRLGQGSTLDAQTAARTGLMVGVCMATLTAIFTVSLREQIALLYNDNPEVVTLAAHLMLLAAVYQISDSIQVIGSGILRGYKDTRSIFYITFTAYWVLGLPSGYILALTDLVVEPMGPAGFWIGFIIGLTSAAIMMMLRMRFLQRLPSVIILQRASR >CP029122.1|AWF28271.1|2460488_2461130_+|riboflavin-synthase,-alpha-subunit MFTGIVQGTVKLVSIDEKPNFRTHVVELPDHMLDGLETGASVAHNGCCLTVTEINGNHVSFDLMKETLRITNLGDLKVGDWVNVERAAKFSDEIGGHLMSGHIMTTAEVAKILTSENNRQIWFKVQDSQLMKYILYKGFIGIDGISLTVGEVTPTRFCVHLIPETLERTTLGKKKLGARVNIEIDPQTQAVVDTVERVLAARENAMNQPGTEA >CP029122.1|AWF24185.1|2461169_2462318_-|cyclopropane-fatty-acyl-phospholipid-synthase MSSSCIEEVSVPDDNWYRIANELLSRAGIAINGSAPADIRVKNPDFFKRVLQEGSLGLGESYMDGWWECDRLDMFFSKVLRAGLENQLPHHFKDTLRIASARLFNLQSKKRAWIVGKEHYDLGNDLFSRMLDPFMQYSCAYWKDADNLESAQQAKLKMICEKLQLKPGMRVLDIGCGWGGLAHYMASNYDVSVVGVTISAEQQKMAQERCEGLDVTILLQDYRDLNDQFDRIVSVGMFEHVGPKNYDTYFAVVDRNLKPEGIFLLHTIGSKKTDLNVDPWINKYIFPNGCLPSVRQIAQSSEPHFVMEDWHNFGADYDTTLMAWYERFLAAWPEIADNYSERFKRMFTYYLNACAGAFRARDIQLWQVVFSRGVENGLRVAR >CP029122.1|AWF24414.1|2462608_2463820_-|inner-membrane-transport-protein-YdhC MQPGKRFLVWLAGLSVLGFLATDMYLPAFAAIQADLQTPASAVSASLSLFLAGFAAAQLLWGPLSDRYGRKPVLLIGLTIFALGSLGMLWVENAATLLVLRFVQAVGVCAAAVIWQALVTDYYPSQKVNRIFATIMPLVGLSPALAPLLGSWLLVHFSWQAIFATLFAITVVLILPIFWLKPTTKARNNSQDGLTFTDLLRSKTYRGNVLIYAACSASFFAWLTGSPFILSEMGYSPAVIGLSYVPQTIAFLIGGYGCRAALQKWQGKQLLPWLLVLFAVSVIATWAAGFISHVSLVEILIPFCVMAIANGAIYPIVVAQALRPFPHATGRAAALQNTLQLGLCFLASLVVSWLISISTPLLTTTSVMLSTVVLVALGYMMQRCEEVGCQNHGNAEVAHSESH >CP029122.1|AWF27455.1|2463932_2464865_+|lysR-substrate-binding-domain-protein MWSEYSLEVVDAVARNGSFSAAAQELHRVPSAVSYTVRQLEEWLAVPLFERRHRDVELTAAGAWFLKEGRSVVKKMQITRQQCQQIANGWRGQLAIAVDNIVRPERTRQMIVDFYRHFDDVELLVFQEVFNGVWDALSDGRVELAIGATRAIPVGGRYAFRDMGMLSWSCVVASHHPLALMDGPFSDDTLRNWPSLVREDTSRTLPKRITWLLDNQKRVVVPDWESSATCISAGLCIGMVPTHFAKPWLNEGKWVALELENPFPDSACCLTWQQNDMSPALTWLLEYLGDSETLNKEWLREPEETPATGD >CP029122.1|AWF25836.1|2464861_2465887_-|periplasmic-binding-and-sugar-binding-domain-of-LacI-family-protein MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVNHTKSIGLLATSSEAAYFAEIIEAVEKNCFQKGYTLILGNAWNNLEKQRAYLSMMAQKRVDGLLVMCSEYPEPLLAMLEEYRHIPMVVMDWGEAKADFTDAVIDNAFEGGYMAGRYLIERGHREIGVIPGPLERNTGAGRLAGFMKAMEEAMIKVPESWIVQGDFEPESGYRAMQQILSQPHRPTAVFCGGDIMAMGALCAADEMGLRVPQDVSLIGYDNVRNARYFTPALTTIHQPKDSLGETAFNMLLDRIVNKREEPQSIEVHPRLIERRSVADGPFRDYRR >CP029122.1|AWF25755.1|2466155_2466275_+|hypothetical-protein MCGNKKGENLMSTDLKFSLVTTIIVLGLIVAVGLTAALH >CP029122.1|AWF26799.1|2466440_2467610_+|inner-membrane-transport-protein-YdhP MKINYPLLALAIGAFGIGTTEFSPMGLLPVIARGVDVSIPAAGMLISAYAVGVMVGAPLMTLLLSHRARRSALIFLMAIFTLGNVLSAIAPDYMTLMLSRILTSLNHGAFFGLGSVVAASVVPKHKQASAVATMFMGLTLANIGGVPAATWLGETIGWRMSFLATAGLGVISMVSLFFSLPKGGAGARPEVKKELAVLMRPQVLSALLTTVLGAGAMFTLYTYISPVLQSITHATPVFVTAMLVLIGVGFSIGNYLGGKLADRSVNGTLKGFLLLLMVIMLAIPFLARNEFGAAISMVVWGAATFAVVPPLQMRVMRVASEAPGLSSSVNIGAFNLGNALGAAAGGAVISAGLGYSFVPVMGAIVAGLALLLVFMSARKQPETVCVANS >CP029122.1|AWF27650.1|2467755_2468337_-|superoxide-dismutase MSFELPALPYAKDALAPHISAETIEYHYGKHHQTYVTNLNNLIKGTAFEGKSLEEIIRSSEGGVFNNAAQVWNHTFYWNCLAPNAGGEPTGKVAEAIAASFGSFADFKAQFTDAAIKNFGSGWTWLVKNSDGKLAIVSTSNAGTPLTTDATPLLTVDVWEHAYYIDYRNARPGYLEHFWALVNWEFVAKNLAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029122_8 | 3414320-3414464 | Orphan |
NA
Consensus repeat of CP029122_8
|
1 spacers
spacers of CP029122_8
>8.1|3414372|41|CP029122|CRISPRCasFinder TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC |
CRISPR arrays and Neighbor proteins around CP029122_8
The CRISPR arrays of CP029122_8 >merge|CP029122|8|3414320-3414464|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGCTGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTCGTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC >CP029122|8|8|3414320-3414464|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC
>CP029122.1|AWF26692.1|3412970_3414254_+|putative-acyl-CoA-thioester-hydrolase-YbhC MNTFSVSRLALALAFGVTLTACSSTPPDQRPSDQTAPGTSSRPILSAKEAQNFDAQHYFASLTPGAAAWNPSPITLPAQPDFVVGPAGTQGVTHTTIQAAVDAAIIKRTNKRQYIAVMPGEYQGTVYVPAAPGGITLYGTGEKPIDVKIGLSLDGGMSPADWRHDVNPRGKYMPGKPAWYMYDSCQSKRSDSIGVLCSAVFWSQNNGLQLQNLTIENTLGDSVDAGNHPAVALRTDGDQVQINNVNILGRQNTFFVTNSGVQNRLETNRQPRTLVTNSYIEGDVDIVSGRGAVVFDNTEFRVVNSRTQQEAYVFAPATLSNIYYGFLAVNSRFNAFGDGVAQLGRSLDVDANTNGQVVIRDSAINEGFNTAKPWADAVISNRPFAGNTGSVDDNDEIQRNLNDTNYNRMWEYNNRGVGSKVVAEAKK >CP029122.1|AWF27425.1|3411765_3412836_+|integrase MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >CP029122.1|AWF27947.1|3411362_3411530_+|hypothetical-protein MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >CP029122.1|AWF26951.1|3410517_3410667_-|putative-transmembrane-protein MDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >CP029122.1|AWF24944.1|3409771_3409987_+|putative-ybl17 MTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >CP029122.1|AWF25179.1|3409581_3409695_+|hypothetical-protein MPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >CP029122.1|AWF24920.1|3408671_3409352_+|exonuclease MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSSVNITESPIIYRDENMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >CP029122.1|AWF24834.1|3407889_3408675_+|phage-recombination-protein-Bet MSTALATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKATEQKVAA >CP029122.1|AWF27140.1|3407587_3407884_+|host-nuclease-inhibitor-protein-gam MNAYYIQDRLEAQSWARHYQQIAREEKEAELADDMEKGLPQHLFESLCIDHLQRHGASKKAITRAFDDDVEFQERMAEHIRYMVETIAHHQVDIDSEV >CP029122.1|AWF25981.1|3407305_3407512_+|putative-phage-encoded-cell-division-inhibitor-protein MVHQHYGTQTVNRGAVMPGMLVKHKDGTWTASANLRGRLYLHRGIERTYTRDLLVEVFLDGRGNGLNH >CP029122.1|AWF27996.1|3414487_3416749_-|aconitase-family-protein MIKLSEKGVFLASNNEIIAEEHFTGEIKKEEAQKGTIAWSILSSHNTSGNMDKLKIKFDSLASHDITFVGIVQTAKASGMERFPLPYVLTNCHNSLCAVGGTINGDDHVFGLSAAQRYGGIFVPPHIAVIHQYMREMMAGGGKMILGSDSHTRYGALGTMAVGEGGGELVKQLLNDTWDIDYPGVVAVHLTGKPAPYVGPQDVALAIIGAVFKNGYVKNKVMEFVGPGVSALSTDFRNSVDVMTTETTCLSSVWQTDEEVHNWLALHGRGQDYCQLNPQPMAYYDGCISVDLSAIKPMIALPFHPSNVYKIDTLNQNLTDILREIEIESERVAHGKAKLSLLDKVENGRLKVQQGIIAGCSGGNYENVIAAANALRGQSCGNDTFSLAVYPSSQPVFMDLAQKGVVADLIGAGAIIRTAFCGPCFGAGDTPINNGLSIRHTTRNFPNREGSKPANGQMSAVALMDARSIAATAANGGYLTSASELDCWDNVPEYAFDVTPYKNRVYQGFVKGATQQPLIYGPNIKDWPELGALTDNIVLKVCSKILDEVTTTDELIPSGETSSYRSNPIGLAEFTLSRRDPGYVGRSKATAELENQRLAGNVSELTEVFARIKQIAGQEHIDPLQTEIGSMVYAVKPGDGSAREQAASCQRVIGGLANIAEEYATKRYRSNVINWGMLPLQMAEVPTFEVGDYIYIPGIKAALDNPGTTFKGYVIHEDAPVTEITLYMGSLTAEEREIIKAGSLINFNKNRQM >CP029122.1|AWF25495.1|3416931_3418365_-|inner-membrane-protein-YbhI MNKKSLWKLILILAIPCIIGFMPAPAGLSELAWVLFGIYLAAIVGLVIKPFPEPVVLLIAVAASMVVVGNLSDGAFKTTAVLSGYSSGTTWLVFSAFTLSAAFVTTGLGKRIAYLLIGKIGNTTLGLGYVTVFLDLVLAPATPSNTARAGGIVLPIINSVAVALGSEPEKSPRRVGHYLMMSIYMVTKTTSYMFFTAMAGNILALKMINDILHLQISWGGWALAAGLPGIIMLLVTPLVIYTMYPPEIKKVDNKTIAKAGLAELGPMKIREKMLLGVFVLALLGWIFSKSLGVDESTVAIVVMATMLLLGIVTWEDVVKNKGGWNTLIWYGGIIGLSSLLSKVKFFEWLAEVFKNNLAFDGHGNVAFFVIIFLSIIVRYFFASGSAYIVAMLPVFAMLANVSGAPLMLTALALLFSNSYGGMVTHYGGAAGPVIFGVGYNDIKSWWLVGAVLTILTFLVHITLGVWWWNMLIGWNML >CP029122.1|AWF26703.1|3418440_3419493_-|proline-racemase-family-protein MKKIPCVMMRGGTSRGAFLLAEHLPEDQTQRDKILMAIMGSGNDLEIDGIGGGNPLTSKVAIISRSSDLRADVDYLFAQVIVHEQRVDTTPNCGNMLSGVGAFAIENGLIAATSPVTRVRIRNVNTGTFIEADVQTPNGVVEYEGSARIDGVPGTAAPVALTFLNAAGTKTGKVFPTDNQIDYFDDVPVTCIDMAMPVVIIPAEYLGKTGYELPAELDADKALLARIESIRLQAGKAMGLGDVSNMVIPKPVLISPAQKGGAINVRYFMPHSCHRALAITGAIAISSSCALEGTVTRQIVPSVGYGNINIEHPSGALDVHLSNEGQDATTLRASVIRTTRKIFSGEVYLP >CP029122.1|AWF28142.1|3419578_3419707_-|hypothetical-protein MPSYLIIHASLLLRKMEATILGGWSLKILTAFSQLKLMKKCY >CP029122.1|AWF24385.1|3419676_3420630_+|lysR-substrate-binding-domain-protein MKHELSSMKAFVILAESSSFNNAAKLLNITQPALTRRIKKMEEDLHIQLFERTTRKVTLTKAGKRLLPEARELIKKFDETLFNIRDMNAYHRGMVTLACIPTAVFYFLPLAIGKFNELYPNIKVRILEQGTNNCMESVLCNESDFGINMNNVTNSSIDFTPLVNEPFVLACRRDHPLAKKQLVEWQELVGYKMIGVRSSSGNRLLIEQQLADKPWKLDWFYEVRHLSTSLGLVEAGLGISALPGLAMPHAPYSSIIGIPLVEPVIRRTLGIIRRKDAVLSPAAERFFALLINLWTDDKDNLWTNIVERQRHALQEIG >CP029122.1|AWF27537.1|3420670_3421666_-|arylesterase-family-protein MKQTVYIASPESQQIHVWNLNHEGALTLTQVVDVPGQVQPMVVSPDKRYLYVGVRPEFRVLAYSIAPDDGALTFAAESALPGSPTHISTDHQGQFVFVGSYNAGNVSVTRLEDGLPVGVVDVVEGLDGCHSANISPDNRTLWVPALKQDRICLFTVSDDGHLVAQDPAEVTTVEGAGPRHMVFHPNEQYAYCVNELNSSVDVWELKDPHGNIECVQTLDMMPENFSDTRWAADIHITPDGRHLYACDRTASLITVFSVSEDGSVLSKEGFQPTETQPRGFNVDHSGKYLIAAGQKSHHISVYEIVGEQGLLHEKGRYAVGQGPMWVVVNAH >CP029122.1|AWF27735.1|3421820_3422639_+|pyridoxal-phosphate-phosphatase-YbhA MTTRVIALDLDGTLLTPKKTLLPSSIEALARAREAGYQLIIVTGRHHVAIHPFYQALALDTPAICCNGTYLYDYHAKTVLEADPMPVNKALQLIEMLNEHHIHGLMYVDDAMVYEHPTGHVIRTSNWAQTLPPEQRPTFTQVASLAETAQQVNAVWKFALTHDDLPQLQHFGKHVEHELGLECEWSWHDQVDIARGGNSKGKRLTKWVEAQGWSMENVVAFGDNFNDISMLEAAGTGVAMGNADDAVKARANIVIGDNTTDSIAQFIYSHLI >CP029122.1|AWF25961.1|3422639_3423698_-|molybdate-ABC-transporter,-ATP-binding-protein MLELNFSQTLGNHCLTINETLPANGITAIFGVSGAGKTSLINAISGLTRPQKGRIVLNGRVLNDAEKGICLTPEKRRVGYVFQDARLFPHYKVRGNLRYGMSKSMVDQFDKLVALLGIEPLLDRLPGSLSGGEKQRVAIGRALLTAPELLLLDEPLASLDIPRKRELLPYLQRLTREINIPMLYVSHSLDEILHLADRVMVLENGQVKAFGALEEVWGSSVMNPWLPKEQQSSILKVTVLEHHPHYAMTALALGDQHLWVNKLDEPLQAALRIRIQASDVSLVLQPPQQTSIRNVLRAKVVNSYDDNGQVEVELEVGGKTLWARISPWARDELAIKPGLWLYAQIKSVSITA >CP029122.1|AWF25716.1|3423700_3424390_-|molybdate-ABC-transporter,-permease-protein MILTDPEWQAVLLSLKVSSLAVLFSLPFGIFFAWLLVRCTFPGKALLDSVLHLPLVLPPVVVGYLLLVSMGRRGFIGERLYDWFGITFAFSWRGAVLAAAVMSFPLMVRAIRLALEGVDVKLEQAARTLGAGRWRVFFTITLPLTLPGIIVGTVLAFARSLGEFGATITFVSNIPGETRTIPSAMYTLIQTPGGESGAARLCIISIALAMISLLISEWLARISRERAGR >CP029122.1|AWF24923.1|3424389_3425163_-|molybdate-ABC-transporter,-periplasmic-molybdate-binding-protein MARKWLNLFAGAALSFAVAGNALADEGKITVFAAASLTNAMQDIATQYKKEKGVDVVSSFASSSTLARQIEAGAPADLFISADQKWMDYAVDKKAIDTATRQTLLGNSLVVVAPKASEQKDFTIDSKTNWTSLLNGGRLAVGDPEHVPAGIYAKEALQKLGAWDTLSPKLAPAEDVRGALALVERNEAPLGIVYGSDAVASKGVKVVAIFPEDSHKKVEYPVAVVEGHNNATVKAFYDYLKGPQAAEIFKRYGFTTK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029122_9 | 3910465-3910618 | Orphan |
NA
Consensus repeat of CP029122_9
|
1 spacers
spacers of CP029122_9
>9.1|3910518|48|CP029122|CRISPRCasFinder TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA |
CRISPR arrays and Neighbor proteins around CP029122_9
The CRISPR arrays of CP029122_9 >merge|CP029122|9|3910465-3910618|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCGTCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAACGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG >CP029122|9|9|3910465-3910618|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCG TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA CGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG
>CP029122.1|AWF24681.1|3908590_3910330_+|FHIPEP-family-protein MLSRSDLLTLLTINFIVVTKGAERISEVSARFTLDAMPGKQMAIDADLNAGLINQAQAQTRRKDVASEADFYGAMDGASKFVRGDAIAGMMILAINLIGGVCIGIFKYNLSADAAFQQYVLMTIGDGLVAQIPSLLLSTAAAIIVTRISDNGDITHDVRHQLLASPSVLYTATGIMFVLAVVPGMPHLPFLLFSALLGFTGWRMSKRPQAAEAEEKSLETLTRTITETSEQQVSWETIPLIEPISLSLGYKLVALVDKAQGNPLTQRIRGVRQVISDGNGVLLPEIRIRENFRLKPSQYAIFINGIKADEADIPADKLMALPSSETYGEIDGVLGNDPAYGMPVTWIQPAQKAKALNMGYQVIDSASVIATHVNKIVRSYIPDLFSYDDITQLHNRLSSMAPRLAEDLSAALNYSQLLKVYRALLTEGVSLRDIVTIATVLVASSAVTKDHILLAADVRLALRRSITHPFVRKQELTVYTLNNELENLLTNVVNQAQQGGKVMLDSVPVDPNMLNQFQSTMPQVKEQMKAAGKDPVLLVPPQLRPLLARYARLFAPGLHVLSYNEVPDELELKIMGALM >CP029122.1|AWF25924.1|3907860_3908631_-|ompA-family-protein MIVNSVSKSERESIIAALHGQSIFSGGGLSPLNKISPSHPPKPATVAVPEETEKKARDVNEKTALLKKKSATELGELATSINTIARDAHMEANLEMEIVPQGLRVLIKDDQNRNMFECGSAQIMPFFKTLLVELAPVFDSLDNKIIITGHTDAMAYKNNIYNNWNLSGDRALSARRVLEEAGMPEDKVMQVSAMADQMLLDAKNPQSAGNRRIEIMVLTKSASDTLYQYFGQHGDKVVQPLVQKLDKQQVLSQRMR >CP029122.1|AWF24203.1|3906734_3907790_-|DNA-polymerase-IV MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARKFGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPLSLDEAYLDVTDSVHCHGSATLIAQEIRQTIFNELHLTASAGVAPVKFLAKIASDMNKPNGQFVITPAEVPAFLQTLPLAKIPGVGKVSAAKLEAMGLRTCGDVQKCDLVILLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMAEDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQEHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLGL >CP029122.1|AWF25398.1|3906285_3906738_-|acetyltransferase-family-protein MNNIQIRNYQPGDFQQLCAIFIRAVMMTASQHYSPQQIAAWAQIDESRWKEKLAKSQVRVAVINAQPVGFISRIERHIDMLFVDPEYTRRGVASALLKPLIKSESELTVDASITAKPFFERYGFQIVKQQHVECRGAWFTNFYMRYKPQH >CP029122.1|AWF26868.1|3905712_3905967_-|hypothetical-protein MGKYIRPLSDAVFTIASDDLWIESLAIQQLHTTANLPNMQRVVGMPDLHPGRGYPIGAAFFSVGRFYPARRRGNGAGNRNGPLL >CP029122.1|AWF28495.1|3905546_3905744_-|hypothetical-protein MLETETGRYSDTLRSALVSLDGDNAWALSESWCGTIQWICPSPYRSHHGRKNWFLGMLTNGWRDC >CP029122.1|AWF25188.1|3905412_3905553_-|hypothetical-protein MLIAWKLEQQQQENSAALKSQRRMFHHQIERGNPRRTFTGMAFIEG >CP029122.1|AWF26214.1|3903898_3905356_+|cytosol-non-specific-dipeptidase MSELSQLSPQPLWDIFAKICSIPHPSYHEEQLAEYIVGWAKEKGFHVERDQVGNILIRKPATAGMENRKPVVLQAHLDMVPQKNNDTVHDFTKDPIQPYIDGEWVKARGTTLGADNGIGMASALAVLADENVVHGPLEVLLTMTEEAGMDGAFGLQSNWLQADILINTDSEEEGEIYMGCAGGIDFTSNLHLDREAVPAGFETFKLTLKGLKGGHSGGEIHVGLGNANKLLVRFLAGHAEELDLRLIDFNGGTLRNAIPREAFATIAVAADKVDALKSLVNTYQDILKNELAEKEKNLALLLDSVANDKAALIAKSRDTFIRLLNATPNGVIRNSDVAKGVVETSLNVGVVTMTDNNVEIHCLIRSLIDSGKDYVVSMLDSLGKLAGAKTEAKGAYPGWQPDANSPVMHLVRETYQRLFNKTPNIQIIHAGLECGLFKKPYPEMDMVSIGPTITGPHSPDEQVHIKSVGHYWTLLTELLKEIPAK >CP029122.1|AWF24378.1|3903179_3903638_-|xanthine-phosphoribosyltransferase MSEKYIVTWDMLQIHARKLASRLMPSEQWKGIIAVSRGGLVPGALLARELGIRHVDTVCISSYDHDNQRELKVLKRAEGDGEGFIVIDDLVDTGGTAVAIREMYPKAHFVTIFAKPAGRPLVDNYVVDIPQDTWIEQPWDMGVVFVPPISGR >CP029122.1|AWF25853.1|3901843_3903088_-|hypothetical-protein MTQANLSETLFKPRFKHPETSTLVRRFNHGAQPPVQSALDGKTIPHWYRMINRLMWIWRGIDPREILDVQARIVMSDAERTDDDLYDTVIGYRGGNWIYEWATQAMVWQQKACAEEDPQLSGRHWLHAATLYNIAAYPHLKGDDLAEQAQALSNRAYEEAAQRLPGTMRQMEFTVPGGAPITGFLHMPKGDGPFPTVLMCGGLDAMQTDYYSLYERYFAPRGIAMLTIDMPSVGFSSKWKLTQDSSLLHQHVLKALPNVPWVDHTRVAAFGFRFGANVAVRLAYLESPRLKAVACLGPVVHTLLSDFKCQQQVPEMYLDVLASRLGMHDASDDALRVELNRYSLKVQGLLGRRCPTPMLSGYWKNDPFSPEEDSRLITSSSADGKLLEIPFNPVYRNFDKGLQEITGWIEKRLC >CP029122.1|AWF25621.1|3910647_3911145_-|hypothetical-protein MSEYRRYYIKGGTWFFTVNLRNRRSQLLTTQYQMLRHAIIKVKRDRPFEINAWVVLPEHMHCIWTLPEGDDDFSSRWREIKKQFTHACGLKNIWQPRFWEHAIRNTKDYRHHVDYIYINPVKHGWVKQVSDWPFSTFHRDVARGLYPIDWAGDVTDINAGERIIL >CP029122.1|AWF28098.1|3911320_3911947_-|putative-endopeptidase-YafL MQNRARLLKQYQTHLKKQASYIVEGNAESRRALRQHNREQIKQHPEWFPAPLKASDRRWQALAENNHFLSSDHLHNITEVAIHRLEQQLGKPYVWGGTRPDQGFDCSGLVFYAYNKILEAKLPRTANEMYHYHRATIVANNDLRRGDLLFFHIHSREIADHMGVYLGDGQFIESPRTGENIRVSRLAEPFWQDHFLGARRILTEETIL >CP029122.1|AWF24988.1|3911984_3912125_+|hypothetical-protein MVALANKGSKTHNENTPGCRNLPSKNEDMKDINRLNEGENFKRPEE >CP029122.1|AWF25155.1|3912370_3913111_+|L,D-transpeptidase-catalytic-domain-protein MRKIALILAMLLIPCVSFAGLLGSSSSTTPVSKEYKQQLMGSPVYIQIFKEERTLDLYVKMGEQYQLLDSYKICKYSGGLGPKQRQGDFKSPEGFYSVQRNQLKPDSRYYKAINIGFPNAYDRAHGYEGKYLMIHGDCVSIGCYAMTNQGIDEIFQFVTGALVFGQPSVQVSIYPFRMTDANMKRHKYSNFKDFWEQLKPGYDYFEQTRKPPTVSVVNGRYVVSKPLSHEVVQPQLASNYTLPEAK >CP029122.1|AWF27894.1|3913081_3913849_-|glutamine-amidotransferases-class-II-family-protein MCELLGMSANVPTDICFSFTGLVQRGGGTGPHKDGWGITFYEGKGCRTFKDPQPSFNSPIAKLVQDYPIKSCSVVAHIRQANRGEVALENTHPFTRELWGRNWTYAHNGQLTGYKSLETGNFRPVGETDSEKAFCWLLHKLTQRYPRTPGNMAAVFKYIASLADELRQKGVFNMLLSDGRYVMAYCSTNLHWITRRAPFGVATLLDQDVEIDFSSQTTPNDVVTVIATQPLTGNETWQKIMPGEWRLFCLGERVV >CP029122.1|AWF27000.1|3914054_3914633_-|phosphoheptose-isomerase MYQDLIRNELNEAAETLANFLKDDANIHAIQRAAVLLADSFKAGGKVLSCGNGGSHCDAMHFAEELTGRYRENRPGYPAIAISDVSHISCVGNDFGFNDIFSRYVEAVGREGDVLLGISTSGNSANVIKAIAAAREKGMKVITLTGKDGGKMAGTADIEIRVPHFGYADRIQEIHIKVIHILIQLIEKEMVK >CP029122.1|AWF26976.1|3914872_3917317_+|acyl-CoA-dehydrogenase,-C-terminal-domain-protein MMILSILATVVLLGALFYHRVSLFISSLILLAWTAALGVAGLWSAWVLVPLAIILVPFNFAPMRKSMISAPVFRGFRKVMPPMSRTEKEAIDAGTTWWEGDLFQGKPDWKKLHNYPQPRLTAEEQAFLDGPVEEACRMANDFQITHELADLPPELWAYLKEHRFFAMIIKKEYGGLEFSAYAQSRVLQKLSGVSGILAITVGVPNSLGPGELLQHYGTDEQKNHYLPRLARGQEIPCFALTSPEAGSDAGAIPDTGIVCMGEWQGQQVLGMRLTWNKRYITLAPIATVLGLAFKLSDPEKLLGGAEDLGITCALIPTTTPGVEIGRRHFPLNVPFQNGPTRGKDVFVPIDYIIGGPKMAGQGWRMLVECLSVGRGITLPSNSTGGVKSVALATGAYAHIRRQFKISIGKMEGIEEPLARIAGNAYVMDAAASLITYGIMLGEKPAVLSAIVKYHCTHRGQQSIIDAMDITGGKGIMLGQSNFLARAYQGAPIAITVEGANILTRSMMIFGQGAIRCHPYVLEEMEAAKNNDVNAFDKLLFKHIGHVGSNKVRSFWLGLTRGLTSSTPTGDATKRYYQHLNRLSANLALLSDVSMAVLGGSLKRRERISARLGDILSQLYLASAVLKRYDDEGRNEADLPLVHWGVQDALYQAEQAMDDLLQNFPNRVVAGLLNVVIFPTGRHYLAPSDKLDHKVAKILQVPNATRSRIGRGQYLTPSEHNPVGLLEEALVDVIAADPIHQRICKELGKNLPFTRLDELAHNALAKGLIDKDEAAILVKAEESRLCSINVDDFDPEELATKPVKLPEKVRKVEAA >CP029122.1|AWF24480.1|3917359_3917806_-|inhibitor-of-vertebrate-lysozyme MFKAITTVAALVIATSAMAQDDLTISSLAKGETTKAAFNQMVQGHKLPAWVMKGGTYTPAQTVTLGDETYQVMSACKPHDCGSQRIAVMWSEKSNQMTGLFSTIDEKTSQEKLTWLNVNDALSIDGKTVLFAALTGSLENHPDGFNFK >CP029122.1|AWF24735.1|3917986_3918757_+|carbon-nitrogen-hydrolase-family-protein MPGLKITLLQQPLVWMDGPANLRHFDRQLEGITGRDVIVLPEMFTSGFAMEAAASSLAQNDVVNWMTAKAQQCNALIAGSVALQTESGSVNRFLLVEPGGTVHFYDKRHLFRMADEHLHYKAGNARVIVEWRGWRILPLVCYDLRFPVWSRNLNDYDLAIYVANWPAPRSLHWQALLTARAIENQAYVAGCNRVGSDGNGCHYRGDSRVINPQGEIIATADAHQATRIDAELSMVALREYREKFPAWQDADEFRLR >CP029122.1|AWF25970.1|3918798_3919842_-|DDE_Tnp_1-associated-family-protein MTIFAVISGAESWEDIEDFGETHLDFLKQYGDFENGIPVHDTIARVVSCISPAKFHECFINWMRDCHSSDDKDVIAIDGKTLRHSYDKSRRRGAIHVISAFSTMHSLVIGQIRTDEKSNEITAIPELLNMLDIKGKIITTDAMGCQKDIAEKIQKQGGDYLFAVKGNQGRLNKAFEEKFPLKELNNPEHDSYAMSEKSHGREEIRLHIVCDVPDELIDFTFEWKGLKKLCVAVSFRSIIAEQKKEPEMTVRYYISSADLTAEKFATAIRNHWHVENKLHWRLDVVMNEDDCKIRRGNAAELFSGIRHIAINILTNDKVFKAGLRRKMRKAAMDRNYLASVLAESGLS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029122_10 | 4100634-4100749 | Orphan |
NA
Consensus repeat of CP029122_10
|
1 spacers
spacers of CP029122_10
>10.1|4100665|54|CP029122|CRISPRCasFinder TGGCCTACGCGCTGTGTTTTTGTAGGCCGGATAAGCAAAGCGCATCCGGCATTC |
CRISPR arrays and Neighbor proteins around CP029122_10
The CRISPR arrays of CP029122_10 >merge|CP029122|10|4100634-4100749|CRISPRCasFinder AACGCCTGATGCGACGCTGACGCGTCTTATCTGGCCTACGCGCTGTGTTTTTGTAGGCCGGATAAGCAAAGCGCATCCGGCATTCAACGCCTGATGCGACGCTGGCGCGTCTTATC >CP029122|10|10|4100634-4100749|CRISPRCasFinder AACGCCTGATGCGACGCTGACGCGTCTTATC TGGCCTACGCGCTGTGTTTTTGTAGGCCGGATAAGCAAAGCGCATCCGGCATTC AACGCCTGATGCGACGCTGGCGCGTCTTATC
>CP029122.1|AWF28370.1|4099111_4100614_+|L-arabinose-isomerase MTIFDNYEVWFVIGSQHLYGPETLRQVTQHAEHVVNALNTEAKLPCKLVLKPLGTTPDEITAICRDANYDDRCAGLVVWLHTFSPAKMWINGLTMLNKPLLQFHTQFNAALPWDSIDMDFMNLNQTAHGGREFGFIGARMRQQHAVVTGHWQDKQAHERIGSWMRQAVSKQDTRHLKVCRFGDNMREVAVTDGDKVAAQIKFGFSVNTWAVGDLVQVVNSISDGDVNALVDEYESCYTMTPATQIHGEKRQNVLEAARIELGMKRFLEQGGFHAFTTTFEDLHGLKQLPGLAVQRLMQQGYGFAGEGDWKTAALLRIMKVMSTGLQGGTSFMEDYTYHFEKGNDLVLGSHMLEVCPSIAVEEKPILDVQHLGIGGKDDPARLIFNTQTGPAIVASLIDLGDRYRLLVNCIDTVKTPHSLPKLPVANALWKAQPDLPTASEAWILAGGAHHTVFSHALNLNDMRQFAEMHDIEITVIDNDTRLPAFKDALRWNEVYYGFRR >CP029122.1|AWF24275.1|4097400_4099101_+|L-ribulokinase MAIAIGLDFGSDSVRALAVDCASGEEIATSVEWYPRWQKGQFCDAPNNQFRHHPRDYIESMEAALKTVLAELSVEQRAAVVGIGVDTTGSTPAPIDADGNVLALRPEFAENPNAMFVLWKDHTAVEEAEEITRLCHAPGNVDYSRYIGGIYSSEWFWAKILHVTRQDSAVAQSAASWIELCDWVPALLSGTTGPQDIRRGRCSAGHKSLWHESWGGLPPASFFDELDPILNRHLPSPLFTDTWTADIPVGTLCPEWAQRLGLPESVVISGGAFDCHMGAVGAGAQPNALVKVIGTSTCDILIADKQSVGERAVKGICGQVDGSVVPGFIGLEAGQSAFGDIYAWFGRVLGWPLEQLAAQHPELKAQINASQKQLLPALTEAWAKNPSLDHLPVVLDWFNGRRTPNANQRLKGVITDLNLATDAPLLFGGLIAATAFGARAIMECFTDQGIAVNNVMALGGIARKNQVIMQACCDVLNRPLQIVASDQCCALGAAIFAAVAAKVHADIPSAQQKMASAVEKTLQPCSEQAQRFEQLYRRYQQWAMSAEQHYLPTSAPAQAAQAVPTL >CP029122.1|AWF28188.1|4096183_4097062_-|arabinose-operon-regulatory-protein MAEAQNDPLLPGYSFNAHLVAGLTPIEANGYLDFFIDRPLGMKGYILNLTIRGQGVVKNQGREFVCRPGDILLFPPGEIHHYGRHPEAREWYHQWVYFRPRAYWHEWLNWPSIFANTGFFRPDEAHQPHFSDLFGQIINAGQGEGRYSELLAINLLEQLLLRRMEAINESLHPPMDNRVREACQYISDHLADSNFDIASVAQHVCLSPSRLSHLFRQQLGISVLSWREDQRISQAKLLLSTTRMPIATVGRNVGFDDQLYFSRVFKKCTGASPSEFRAGCEEKVNDVAVKLS >CP029122.1|AWF26926.1|4095333_4096098_-|inner-membrane-protein-YabI MQALLEHFITQSTVYSLMAVVLVAFLESLALVGLILPGTVLMAGLGALIGSGELSFWHAWLAGIVGCLLGDWISFWLGWRFKKPLHRWSFLKKNKALLDKTEHALHQHSMFTILVGRFVGPTRPLVPMVAGMLDLPVAKFITPNIIGCLLWPPFYFLPGILAGAAIDIPAGMQSGEFKWLLLATAVFLWVGGWLCWRLWRSGKATDRLSHYLSRGRLLWLTPLISAIGVVALVVLIRHPLMPVYIDILRKVVGG >CP029122.1|AWF27297.1|4094521_4095220_+|thiamine-ABC-transporter,-ATP-binding-protein MLKLTDITWLYHHLPMRFSLTVERGEQVAILGPSGAGKSTLLNLIAGFLTPASGSLTIDGVDHTTTPPSRRPVSMLFQENNLFSHLTVAQNIGLGLNPGLKLNAAQQEKMHAIARQMGIDNLMARLPGELSGGQRQRVALARCLVREQPILLLDEPFSALDPALRQEMLTLVSTSCQQQKMTLLMVSHSVEDAARIATRSVVVADGRIAWQGKTNELLSGKASASALLGITG >CP029122.1|AWF27932.1|4092927_4094538_+|thiamine/thiamine-pyrophosphate-ABC-transporter,-permease-protein MATRRQPLIPGWLIPGVSAATLVVAVALAAFLALWWNAPQGNWVAVWQDSYLWHVVRFSFWQAFLSALLSVVPAIFLARALYRRRFPGRLALLRLCAMTLILPVLVAVFGILSVYGRQGWLASLCQSLGLEWTFSPYGLQGILLAHVFFNLPMASRLLLQALENIPGEQRQLAAQLGMRGWHFFRFVEWPWLRRQIPPVAALIFMLCFASFATVLSLGGGPQATTIELAIYQALSYDYDPARAAMLALIQMVCCLGLVLLSQRLSKAIAPGTTLLQGWRDPDDRLHSRICDTVLIVLALLLLLPPLLAVIVDGVNRQLPEVLAQPVLWQALWTSLRIALAAGVLCVVLTMMLLWSSRELRARQKMLAGQALEMSGMLILAMPGIVLATGFFLLLNNTIGLPQSADGIVIFTNALMAIPYALKVLENPMRDITARYSMLCQSLGIEGWSRLKVVELRALKRPLAQALAFACVLSIGDFGVVALFGNDDFRTLPFYLYQQIGSYRSQDGAVTALILLLLCFLLFTVIEKLPGRNVKTD >CP029122.1|AWF25534.1|4091968_4092952_+|thiamine/thiaminee-pyrophosphate-ABC-transporter,-thiamine/thiaminee-pyrophospate-binding-protein MLKKCLPLLLLCTAPVFAKPVLIVYTYDSFAADWGPGPKIKKAFEADCNCELKLVALEDGVSLLNRLRMEGKNSKADVVLGLDNNLLDAASKTGLFAKSGVAADAVNVPGGWNNDTFVPFDYGYFAFVYDKNKLKNPPQSLKELVESDQNWRVIYQDPRTSTPGLGLLLWMQKVYGDDAPQAWQKLAKKTVTVTKGWSEAYGLFLKGESDLVLSYTTSPAYHILEEKKDNYAAANFSEGHYLQVEVAARTAASKQPELAQKFLQFMVSPAFQNAIPTGNWMYPVANVTLPAGFEQLTKPATTLEFTPAEVAAQRQAWISEWQRAVSR >CP029122.1|AWF28539.1|4090149_4091805_+|bacterial-extracellular-solute-binding,-5-Middle-family-protein MPSARLQQQFIRLWQCCEGKSQDTTLNELAALLSCSRRHMRTLLNTMQDRGWLTWEAEVGRGKRSRLTFLYTGLALQQQRAEDLLEQDRIDQLVQLVGDKATVRQMLVSHLGRSFRQGRHILRVLYYRPLRNLLPGSALRRSETHIARQIFSSLTRINEENGELEADIAHHWQQISPLHWRFFLRPGVHFHHGRELEMDDVIASLKRINTLPLYSHIADIVSPTPWTLDIHLTQPDRWLPLLLGQVPAMILPREWETLSNFASHPIGTGPYAVIRNTTNQLKIQAFDDFFGYRALIDEVNVWVLPEIADEPAGGLMLKGPQGEEKEIESRLEEGCYYLLFDSRTHRGANQQVRDWVSYVLSPTNLVYFAEEQYQQLWFPAYGLLPRWHHARTIKSEKPAGLESLTLTFYQDHSEHRVIAGIMQQILASHQVTLEIKEISYDQWHEGEIESDIWLNSANFTLPLDFSLFAHLCEVPLLQHCIPIDWQADAARWRNGEMNLANWCQQLVASKAMVPLIHHWLIIQGQRSMRGLRMNTLGWFDFKSAWFAPPDP >CP029122.1|AWF25911.1|4089929_4090061_-|putative-inhibitor-of-glucose-uptake-transporter-SgrT MRQFYQHYFTATAKLCWLRWLSVPQRLTMLEGLMQWDDRNSES >CP029122.1|AWF25648.1|4088649_4089828_-|sugar-efflux-transporter-A MIWIMTMARRMNGVYAAFMLVAFMMGVAGALQAPTLSLFLSREVGAQPFWIGLFYTVNAIAGIGVSLWLAKRSDSQGDRRKLIIFCCLMAIGNALLFAFNRHYLTLITCGVLLASLANTAMPQLFALAREYADNSAREVVMFSSVMRAQLSLAWVIGPPLAFMLALNYGFTVMFSIAAGIFTLSLVLIAFMLPSVARVELPSENALSMQGGWQDSNVRMLFVASTLMWTCNTMYIIDMPLWISSELGLPDKLAGFLMGTAAGLEIPAMILAGYYVKRYGKRRMMVIAVAAGVLFYTGLIFFHSRMALMTLQLFNAVFIGIVAGIGMLWFQDLMPGRAGAATTLFTNSISTGVILAGVIQGAIAQSWGHFAVYWVIAVISVVALFLTAKVKDV >CP029122.1|AWF27895.1|4100813_4101509_+|L-ribulose-5-phosphate-4-epimerase MLEDLKRLVLEANLALPKHNLVTLTWGNVSAVDRERGVFVIKPSGVDYSVMTADDMVVVSIATGEVVEGTKKPSSDTPTHRLLYQAFPSIGGIVHTHSRHATIWAQAGQSIPATGTTHADYFYGTIPCTRKMTDAEINGEYEWETGNVIVETFEKQGIDAAQMPGVLVHSHGPFAWGKNAEDAVHNAIVLEEVAYMGIFCRQLAPQLPDMQQTLLDKHYLRKHGAKAYYGQ >CP029122.1|AWF27586.1|4101583_4103935_+|DNA-polymerase-II MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIPADQVPRAQHILQGEQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGGVTVYEADVRPPERYLMERFITSPVWVEGDIRNGAIVNARLKPHPDYRPPLKWVSIDIETTRHGELYCIGLEGCGQRIVYMLGPENGDASALDFELEYVASRPQLLEKLNAWFANYDPDVIIGWNVVQFDLRMLQKHAERYRIPLRLGRDNSELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQELLGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDCELVTQIFHKTEIMPFLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPHASPGGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHSTEGFLDAWFSREKHCLPEIVTNIWHGRDEAKRQGNKPLSQALKIIMNAFYGVLGTTACRFFDPRLASSITMRGHQIMRQTKALIEAQGYDVIYGDTDSTFVWLKGAHSEEEAAKIGRALVQHVNAWWAETLQKQRLTSALELEYETHFCRFLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKGLETVRTDWTPLAQQFQQELYLRIFRNEPYQEYIRETIDKLMAGELDARLVYRKRLRRPLSEYQRNVPPHVRAARLADEENQKRGRPLQYQNRGTIKYVWTTNGPEPLDYQRSPLDYEHYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF >CP029122.1|AWF28506.1|4104246_4107006_+|RNA-polymerase-associated-protein-RapA MTRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREVFLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQRTSLIPHQLNIAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAAERVLIIVPETLQHQWLVEMLRRFNLRFALFDDERYAEAQHDAYNPFDTEQLVICSLDFARRSKQRLEHLCEAEWDLLVVDEAHHLVWSEDAPSREYQAIEQLAEHVPGVLLLTATPEQLGMESHFARLRLLDPNRFHDFAQFVEEQKNYRPVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSARQELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKVSGIMGARKSAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTSHRSQKVLVICAKAATALQLEQVLREREGIRAAVFHEGMSIIERDRAAAWFAEEDTGAQVLLCSEIGSEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRIGQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGRTIYDSVYNDLINYLASPDQTEGFDDLIKNCREQHEALKAQLEQGRDRLLEIHSNGGEKAQALAESIEEQDDDTNLIAFAMNLFDIIGINQDDRGDNMIVLTPSDHMLVPDFPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSSTISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGNNLAAQVEFETFNRQLNAVNRHTGSKLVNAVQQDVHAILQLGEAQIEKSARALIDAARNEADEKLSAELSRLEALRAVNPNIRDDELTAIESNRQQVMESLDQAGWRLDALRLIVVTHQ >CP029122.1|AWF25762.1|4107017_4107677_+|pseudouridine-synthase,-RluA-family-protein MGMENYNPPQEPWLVILYQDDHIMVVNKPSGLLSVPGRLEEHKDSVMTRIQRDYPQAESVHRLDMATSGVIVVALTKAAERELKRQFREREPKKQYVARVWGHPSPAEGLVDLPLICDWPNRPKQKVCYETGKPAQTEYEVVEYAADNTARVVLKPITGRSHQLRVHMLALGHPILGDRFYASPEARAMAPRLLLHAEMLTITHPAYGNSMTFKAPADF >CP029122.1|AWF28478.1|4107793_4108609_-|dnaJ-like-protein-DjlA MQYWGKIIGVAVALLMGGGFWGVVLGLLIGHMFDKARSRKMAWFANQRERQALFFATTFEVMGHLTKSKGRVTEADIHIASQLMDRMNLHGASRTAAQNAFRVGKSDNYPLREKMRQFRSVCFGRFDLIRMFLEIQIQAAFADGSLHPNERAVLYVIAEELGISRAQFDQFLRMMQGGAQFGGGYQQQTGGGNWQQAQRGPTLEDACNVLGVKPTDDATTIKRAYRKLMSEHHPDKLVAKGLPPEMMEMAKQKAQEIQQAYELIKQQKGFK >CP029122.1|AWF26362.1|4108863_4111218_+|LPS-assembly-protein-LptD MKKRIPTLLATMIATALYSQQGLAADLASQCMLGVPSYDRPLVQGDTNDLPVTINADHAKGDYPDDAVFTGSVDIMQGNSRLQADEVQLHQKEAPGQPEPVRTVDALGNVHYDDNQVILKGPKGWANLNTKDTNVWEGDYQMVGRQGRGKADLMKQRGENRYTILDNGSFTSCLPGSDTWSVVGSEIIHDREEQVAEIWNARFKVGPVPIFYSPYLQLPVGDKRRSGFLIPNAKYTTTNYFEFYLPYYWNIAPNMDATITPHYMHRRGNIMWENEFRYLSQAGAGLMELDYLPSDKVYEDEHPNDDSSRRWLFYWNHSGVMDQVWRFNVDYTKVSDPSYFNDFDNKYGSSTDGYATQKFSVGYAVQNFNATVSTKQFQVFSEQNTSSYSAEPQLDVNYYQNDVGPFDTRIYGQAVHFVNTRDDMPEATRVHLEPTINLPLSNNWGSINTEAKLLATHYQQTNLDWYNSRNTTKLDESVNRVMPQFKVDGKMVFERDMEMLAPGYTQTLEPRAQYLYVPYRDQSDIYNYDSSLLQSDYSGLFRDRTYGGLDRIASANQVTTGVTSRIYDDAAVERFNISVGQIYYFTESRTGDDNITWENDDKTGSLVWAGDTYWRISERWGLRGGIQYDTRLDNVATSNSSIEYRRDEDRLVQLNYRYASPEYIQATLPKYYSTAEQYKNGISQVGAVASWPIADRWSIVGAYYYDTNANKQADSMLGVQYSSCCYAIRVGYERKLNGWDNDKQHAVYDNAIGFNIELRGLSSNYGLGTQEMLRSNILPYQNSL >CP029122.1|AWF26110.1|4111270_4112557_+|chaperone-SurA MKNWKTLLLGIAMIANTSFAAPQVVDKVAAVVNNGVVLESDVDGLMQSVKLNAAQARQQLPDDATLRHQIMERLIMDQIILQMGQKMGVKISDEQLDQAIANIAKQNNMTLDQMRSRLAYDGLNYNTYRNQIRKEMIISEVRNNEVRRRITILPQEVESLAQQVGNQNDASTELNLSHILIPLPENPTSDQVNEAESQARAIVDQARNGADFGKLAIAHSADQQALNGGQMGWGRIQELPGIFAQALSTAKKGDIVGPIRSGVGFHILKVNDLRGESKNISVTEVHARHILLKPSPIMTDEQARVKLEQIAADIKSGKTTFAAAAKEFSQDPGSANQGGDLGWATADIFDPAFRDALTRLNKGQMSAPVHSSFGWHLIELLDTRNVDKTDAAQKDRAYRMLMNRKFSEEAASWMQEQRASAYVKILSN >CP029122.1|AWF27582.1|4112556_4113546_+|4-hydroxythreonine-4-phosphate-dehydrogenase-PdxA MVKTQRVVITPGEPAGIGPDLVVQLAQREWPVELVVCADATLLTDRAAMLGLPLTLRTYSPNSPAQPQTAGTLTLLPVALRESVTAGQLAVENGHYVVETLARACDGCLNGEFAALITGPVHKGVINDAGIPFTGHTEFFEERSQAKKVVMMLATEELRVALATTHLPLRDIADAITPALLHEVIAILHHDLRTKFGIAEPRILVCGLNPHAGEGGHMGTEEIDTIIPLLDELRAQGMKLNGPLPADTLFQPKYLDNADAVLAMYHDQGLPVLKYQGFGRGVNITLGLPFIRTSVDHGTALELAGRGEADVGSFITALNLAIKMIVNTQ >CP029122.1|AWF25771.1|4113542_4114364_+|dimethyladenosine-transferase MNNRVHQGHLARKRFGQNFLNDQFVIDSIVSAINPQKGQAMVEIGPGLAALTEPVGERLDQLTVIELDRDLAARLQTHPFLGPKLTIYQQDAMTFNFGELAEKMGQPLRVFGNLPYNISTPLMFHLFSYTDAIADMHFMLQKEVVNRLVAGPNSKAYGRLSVMAQYYCNVIPVLEVPPSAFTPPPKVDSAVVRLVPHATMPHPVKDVRVLSRITTEAFNQRRKTIRNSLGNLFSVEVLTGMGIDPAMRAENISVAQYCQMANYLAENAPLQES >CP029122.1|AWF26568.1|4114492_4114744_+|hypothetical-protein MQLLGRYWLITNGNGRETEVQGEGVVGVQPLIAPGEEYQYTSGAIIETPLGTMQGHYEMIDENGVPFSIDIPVFRLAVPTLIH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP029122_11 | 4123662-4123794 | Orphan |
NA
Consensus repeat of CP029122_11
|
2 spacers
spacers of CP029122_11
>11.1|4123679|42|CP029122|PILER-CR TGTCACACGCAGATAAATCCAACTTTCAATATTGTTAAGTTC >11.2|4123738|40|CP029122|PILER-CR CATGGCGTAGCAAAAAGAAATTTTCAATATTGCTTTATGG |
CRISPR arrays and Neighbor proteins around CP029122_11
The CRISPR arrays of CP029122_11 >merge|CP029122|11|4123662-4123794|PILER-CR ATCACCAATATTGAAAATGTCACACGCAGATAAATCCAACTTTCAATATTGTTAAGTTCCTCACCAATATTGAAAACATGGCGTAGCAAAAAGAAATTTTCAATATTGCTTTATGGATCACCAATATTGAAAG >CP029122|11|3|4123662-4123794|PILER-CR ATCACCAATATTGAAAA TGTCACACGCAGATAAATCCAACTTTCAATATTGTTAAGTTC CTCACCAATATTGAAAA CATGGCGTAGCAAAAAGAAATTTTCAATATTGCTTTATGG ATCACCAATATTGAAAG
>CP029122.1|AWF27765.1|4122800_4123571_-|electron-transfer-flavodomain-protein MKIITCYKCVPDEQDIAVNNADGSLDFSKADAKISQYDLNAIEAACQLKQQAAEAQVTALSVGGKALTNAKGRKDVLSRGPDELIVVIDDQFEQALPQQTASALAAAAQKAGFDLILCGDGSSDLYAQQVGLLVGEILNIPAVNGVSKIISLTADTLTVERELEDETETLSIPLPAVVAVSTDINSPQIPSMKAILGAAKKPVQVWSAADIGFNAEAAWSEQQVAAPKQRERQRIVIEGDGEEQIAAFAENLRKVI >CP029122.1|AWF25998.1|4121844_4122786_-|electron-transfer-flavodomain-protein MNTFSQVWVFSDTPSRLPELMNGAQALANQINTFVLNDADGAQAIQLGANHVWKLNGKPDDRMIEDYAGVMADTIRQHGADGLVLLPNTRRGKLLAAKLGYRLKAAVSNDASTVSVQDGKATVKHMVYGGLAIGEERIATPYAVLTISSGTFDAAQPDASRTGETHTVEWQAPAVAITRTATQARQSNSVDLDKARLVVSVGRGIGSKENIALAEQLCKAIGAELACSRPVAENEKWMEHERYVGISNLMLKPELYLAVGISGQIQHMVGANASQTIFAINKDKNAPIFQYADYGIVGDAVKILPALTAALAR >CP029122.1|AWF27848.1|4120507_4121794_-|GDP-dissociation-inhibitor-family-protein MSEDIFDAIIVGAGLAGSVAALVLAREGAQVLVIERGNSAGAKNVTGGRLYAHSLEHIIPGFADSAPVERLITHEKLAFMTEKSAMTMDYCNGDETSPSQRSYSVLRSKFDAWLMEQAEEAGAQLITGIRVDNLVQRDGKVVGVEADGDVIEAKTVILADGVNSILAEKLGMAKRVKPTDVAVGVKELIELPKSVIEDRFQLQGNQGAACLFAGSPTDGLMGGGFLYTNENTLSLGLVCGLHHLHDAKKSVPQMLEDFKQHPAVAPLIAGGKLVEYSAHVVPEAGINMLPELVGDGVLIAGDAAGMCMNLGFTIRGMDLAIAAGEAAAKTVLSAMKSDDFSKQKLAEYRQHLESGPLRDMRMYQKLPAFLDNPRMFSGYPELAVGVARDLFTIDGSAPELMRKKILRHGKKVGFINLIKDGMKGVTVL >CP029122.1|AWF26117.1|4120223_4120511_-|putative-ferredoxin-like-protein-FixX MTSPVNVDVKLGVNKFNVDEEHPHIVVKADADKQVLELLVKACPAGLYKKQDDGSVRFDYAGCLECGTCRILGLGSALEQWEYPRGTFGVEFRYG >CP029122.1|AWF25329.1|4118834_4120166_-|sugar-(and-other)-transporter-family-protein MQPSRNFDDLKFSSIHRRILLWGSGGPFLDGYVLVMIGVALEQLTPALKLDADWIGLLGAGTLAGLFVGTSLFGYISDKVGRRKMFLIDIIAIGVISVATMFVSSPVELLVMRVLIGIVIGADYPIATSMITEFSSTRQRAFSISFIAAMWYVGATCADLVGYWLYDVEGGWRWMLGSAAIPCLLILIGRFELPESPRWLLRKGRVKECEEMMIKLFGEPVAFDEEQPQQTRFRDLFNRRHFPFVLFVAAIWTCQVIPMFAIYTFGPQIVGLLGLGVGKNAALGNVVISLFFMLGCIPPMLWLNTAGRRPLLIGSFAMMTLALAVLGLIPDMGIWLVVMAFAVYAFFSGGPGNLQWLYPNELFPTDIRASAVGVIMSLSRIGTIVSTWALPIFINNYGISNTMLMGAGISLFGLLISVAFAPETRGMSLAQTSNMTIRGQRMG >CP029122.1|AWF24271.1|4118196_4118670_-|NADPH-dependent-FMN-reductase-family-protein MLEQARTLEGVEIRSLYQLYPDFNIDIAAEQEALSRADLIVWQHPMQWYSIPPLLKLWIDKVFSHGWAYGHGGTALHGKHLLWAVTTGGGESHFEIGAHPGFDVLSQPLQATAIYCGLNWLPPFAMHCTFICDDETLEGQARHYKQRLLEWQEAHHG >CP029122.1|AWF26368.1|4116341_4118204_-|proton-antiporter-2-family-protein MDSHTLIQALIYLGSAALIVPIAVRLGLGSVLGYLIAGCIIGPWGLRLVTDAESILHFAEIGVVLMLFIIGLELDPQRLWKLRAAVFGGGALQMVICGGLLGLFCMLLGLRWQVAELIGMTLALSSTAIAMQAMNERNLMVTQMGRSAFAVLLFQDIAAIPLVAMIPLLATSSASTTMGAFALSALKVAGALVLVVLLGRYVTRPALRFVARSGLREVFSAVALFLVFGFGLLLEEVGLSMAMGAFLAGVLLASSEYRHALESDIEPFKGLLLGLFFIGVGMSIDFGTLLENPLRIVILLLGFLIIKIAMLWLIARPLQVPNKQRRWFAVLLGQGSEFAFVVFGAAQMANVLEPEWAKSLTLAVALSMAATPILLVILNRLEQSSTEEAREADEIDEEQPRVIIAGFGRFGQITGRLLLSSGVKMVVLDHDPDHIETLRKFGMKVFYGDATRMDLLESAGAAKAEVLINAIDDPQTNLQLTEMVKEHFPHLQIIARARDVDHYIRLRQAGVEKPERETFEGALKTGRLALESLGLGPYEARERADVFRRFNIQMVEEMAMVENDTKARAAVYKRTSAMLSEIITEDREHLSLIQRHGWQGTEEGKHTGNMADEPETKPSS >CP029122.1|AWF27943.1|4115670_4116150_-|dihydrofolate-reductase MISLIAALAVDRVIGMENAMPWNLPADLAWFKRNTLNKPVIMGRHTWESIGRPLPGRKNIILSSQPGTDDRVTWVKSVDEAIAACGDVPEIMVIGGGRVYEQFLPKAQKLYLTHIDAEVEGDTHFPDYEPDDWESVFSEFHDADAQNSHSYCFEILERR >CP029122.1|AWF25925.1|4114750_4115593_+|bis(5'-nucleosyl)-tetraphosphatase MATYLIGDVHGCYDELIALLHKVEFTPGKDTLWLTGDLVARGPGSLDVLRYVKSLGDSVRLVLGNHDLHLLAVFAGISRNKPKDRLTPLLEAPDADELLNWLRRQPLLQIDEEKKLVMAHAGITPQWDLQTAKECARDVEAVLSSDSYPFFLDAMYGDMPNNWSPELRGLGRLRFITNAFTRMRFCFPNGQLDMYSKESPEEAPAPLKPWFAIPGPVAEEYSIAFGHWASLEGKGTPEGIYALDTGCCWGGTLTCLRWEDKQYFVQPSNRHKDLGEAAAS >CP029122.1|AWF26568.1|4114492_4114744_+|hypothetical-protein MQLLGRYWLITNGNGRETEVQGEGVVGVQPLIAPGEEYQYTSGAIIETPLGTMQGHYEMIDENGVPFSIDIPVFRLAVPTLIH >CP029122.1|AWF24747.1|4124044_4125559_+|L-carnitine/gamma-butyrobetaine-antiporter MKNEKRKTGIEPKVFFPPLIIVGILCWLTVRDLDAANVVINAVFSYVTNVWGWAFEWYMVVMLFGWFWLVFGPYAKKRLGNEPPEFSTASWIFMMFASCTSAAVLFWGSIEIYYYISTPPFGLEPNSTGAKELGLAYSLFHWGPLPWATYSFLSVAFAYFFFVRKMEVIRPSSTLVPLVGEKHAKGLFGTIVDNFYLVALIFAMGTSLGLATPLVTECMQWLFGIPHTLQLDAIIITCWIILNAICVACGLQKGVRIASDVRSYLSFLMLGWVFIVSGASFIMNYFTDSVGMLLMYLPRMLFYTDPIAKGGFPQGWTVFYWAWWVIYAIQMSIFLARISRGRTVRELCFGMVLGLTASTWILWTVLGSNTLLLIDKNIINIPNLIEQYGVARAIIETWAALPLSTATMWGFFILCFIATVTLVNACSYTLAMSTCREVRDGEEPPLLVRIGWSILVGIIGIVLLALGGLKPIQTAIIAGGCPLFFVNIMVTLSFIKDAKQNWKD >CP029122.1|AWF25198.1|4125589_4126732_+|crotonobetainyl-CoA-dehydrogenase MDFNLNDEQELFVAGIRELMASENWEAYFAECDRDSVYPERFVKALADMGIDSLLIPEEHGGLDAGFVTLAAVWMELGRLGAPTYVLYQLPGGFNTFLREGTQEQIDKIMAFRGTGKQMWNSAITEPGAGSDVGSLKTTYTRRNGKIYLNGSKCFITSSAYTPYIVVMARDGASPDKPVYTEWFVDMSKPGIKVTKLEKLGLRMDSCCEITFDDVELDEKDMFGREGNGFNRVKEEFDHERFLVALTNYGTAMCAFEDAARYANQRVQFGEAIGRFQLIQEKFAHMAIKLNSMKNMLYEAAWKADNGTITSGDAAMCKYFCANAAFEVVDSAMQVLGGVGIAGNHRISRFWRDLRVDRVSGGSDEMQILTLGRAVLKQYR >CP029122.1|AWF25724.1|4126860_4128078_+|carnitine-CoA-transferase MDHLPMPKFGPLAGLRVVFSGIEIAGPFAGQMFAEWGAEVIWIENVAWADTIRVQPNYPQLSRRNLHALSLNIFKDEGREAFLKLMETTDIFIEASKGPAFARRGITDEVLWQHNPKLVIAHLSGFGQYGTEEYTNLPAYNTIAQAFSGYLIQNGDVDQPMPAFPYTADYFSGLTATTAALAALHKARETGKGESIDIAMYEVMLRMGQYFMMDYFNGGEMCPRMSKGKDPYYAGCGLYKCADGYIVMELVGITQIEECFKDIGLAHLLSTPEIPEGTQLIHRIECPYGPLVEEKLDAWLAAHTIAEVKERFAELNIACAKVLTVPELESNPQYVARESITQWQTMDGRTCKGPNIMPKFKNNPGQIWRGMPSHGMDTAAILKNIGYSENDIQELVSKGLAKVED >CP029122.1|AWF26342.1|4128151_4129705_+|hypothetical-protein MDIIGGQHLRQMWDDLADVYGHKTALICESSGGVVNRYSYLELNQEINRTANLFYTLGIRKGDKVALHLDNCPEFIFCWFGLAKIGAIMVPINARLLREESAWILQNSQACLLVTSAQFYPMYQQIQQEDATQLRHICLTDVALPADDGVSSFTQLKNQQPATLCYAPPLLTDDTAEILFTSGTTSRPKGVVITHYNLRFAGYYSAWQCALRDDDVYLTVMPAFHIDCQCTAAMAAFSAGATFVLVEKYSARAFWGQVQKYRATITECIPMMIRTLMVQPPSANDRQHRLREVMFYLNLSEQEKDAFCERFGVRLLTSYGMTETIVGIIGDRPGDKRRWPSIGRAGFCYEAEIRDDHNRPLPAGEIGEICIKGVPGKTIFKEYFLNPKATAKVLEADGWLHTGDTGYCDEEGFFYFVDRRCNMIKRGGENVSCVELENIIATHPKIQDIVVVGIKDSIRDEAIKAFVVLNEGETLSEEEFFRFCEQNMAKFKVPSYLEIRKDLPRNCSGKIIRKNLK >CP029122.1|AWF27246.1|4129813_4130599_+|carnitinyl-CoA-dehydratase MSESLHLTRNGSILEITLDRPKANAIDAKTSFEMGEVFLNFRDDPQLRVAIITGAGEKFFSAGWDLKAAAEGEAPDADFGPGGFAGLTEIFNLDKPVIAAVNGYAFGGGFELALAADFIVCADNASFALPEAKLGIVPDSGGVLRLPKILPPAIVNEMVMTGRRMGTEEALRWGIVNRVVSQAELMDNARELAQQLVNSAPLAIAALKEIYRTTSEMPVEEAYRYIRSGVLKHYPSVLHSEDAVEGPLAFAEKRDPVWKGR >CP029122.1|AWF26433.1|4130604_4131195_+|bacterial-transferase-hexapeptide-family-protein MSYYAFEGLIPVVHPTAFVHPSAVLIGDVIVGAGVYIGPLASLRGDYGRLIVQAGANIQDGCIMHGYCDTDTIVGENGHIGHGAILHGCVIGRDALVGMNSVIMDGAVIGEESIVAAMSFVKAGFHGEKRQLLMGTPARAVRSVSDDELHWKRLNTKEYQDLVGRCHASLHETQPLRQMEENRPRLQGTTDVTPKR >CP029122.1|AWF28209.1|4131280_4131676_-|hypothetical-protein MCEGYVEKPLYLLIAEWMMAENRWVIAREISIHFDIEHSKAVNTLTYILSEVAEISCEVKMIPNKLEGRGCQCQRLVKVVDIDEQIYARLRNNSRDKLVGVRKTPRIPAVPLTELNREQKWQMMLSKSMRR >CP029122.1|AWF26097.1|4131710_4131929_+|hypothetical-protein MTRFEAIKQGHIKIVDISIVCNFTVDKCELNPAYVIKNIDSPKDLLNGQKKRSSSENRITYSIKLADEKYPP >CP029122.1|AWF28616.1|4131936_4135158_-|carbamoyl-phosphate-synthase-large-chain MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVILVNSNPATIMTDPEMADATYIEPIHWEVVRKIIEKERPDAVLPTMGGQTALNCALELERQGVLEEFGVTMIGATADAIDKAEDRRRFDVAMKKIGLETARSGIAHTMEEALAVAADVGFPCIIRPSFTMGGSGGGIAYNREEFEEICARGLDLSPTKELLIDESLIGWKEYEMEVVRDKNDNCIIVCSIENFDAMGIHTGDSITVAPAQTLTDKEYQIMRNASMAVLREIGVETGGSNVQFAVNPKNGRLIVIEMNPRVSRSSALASKATGFPIAKVAAKLAVGYTLDELMNDITGGRTPASFEPSIDYVVTKIPRFNFEKFAGANDRLTTQMKSVGEVMAIGRTQQESLQKALRGLEVGATGFDPKVSLDDPEALTKIRRELKDAGAERIWYIADAFRAGLSVDGVFNLTNIDRWFLVQIEELVRLEEKVAEVGITGLNAEFLRQLKRKGFADARLAKLAGVREAEIRKLRDQYDLHPVYKRVDTCAAEFATDTAYMYSTYEEECEANPSTDREKIMVLGGGPNRIGQGIEFDYCCVHASLALREDGYETIMVNCNPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKPKGVIVQYGGQTPLKLARALEAAGVPVIGTSPDAIDRAEDRERFQHAVERLKLKQPANATVTAIEMAVEKAKEIGYPLVVRPSYVLGGRAMEIVYDEADLRRYFQTAVSVSNDAPVLLDHFLDDAVEVDVDAICDGEMVLIGGIMEHIEQAGVHSGDSACSLPAYTLSQEIQDVMRQQVQKLAFELQVRGLMNVQFAVKNNEVYLIEVNPRAARTVPFVSKATGVPLAKVAARVMAGKSLAEQGVTKEVIPPYYSVKEVVLPFNKFPGVDPLLGPEMRSTGEVMGVGRTFAEAFAKAQLGSNSTMKKHGRALLSVREGDKERVVDLAAKLLKQGFELDATHGTAIVLGEAGINPRLVNKVHEGRPHIQDRIKNGEYTYIINTTSGRRAIEDSRVIRRSALQYKVHYDTTLNGGFATAMALNADATEKVISVQEMHAQIK >CP029122.1|AWF24749.1|4135175_4136324_-|carbamoyl-phosphate-synthase-small-chain MIKSALLVLEDGTQFHGRAIGATGSAVGEVVFNTSMTGYQEILTDPSYSRQIVTLTYPHIGNVGTNDADEESSQVHAQGLVIRDLPLIASNFRNTEDLSSYLKRHNIVAIADIDTRKLTRLLREKGAQNGCIIAGDNPDAALALEKARAFPGLNGMDLAKEVTTAEAYSWTQGSWTLTGGLPEAKKEDELPFHVVAYDFGAKRNILRMLVDRGCRLTIVPAQTSAEDVLKMNPDGIFLSNGPGDPAPCDYAITAIQKFLETDIPVFGICLGHQLLALASGAKTVKMKFGHHGGNHPVKDVEKNVVMITAQNHGFAVDEATLPANLRVTHKSLFDGTLQGIHRTDKPAFSFQGHPEASPGPHDAAPLFDHFIELIEQYRKTAK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP029122_7 | 7.1|3110451|40|CP029122|CRISPRCasFinder | 3110451-3110490 | 40 | NZ_CP041417 | Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence | 47951-47990 | 0 | 1.0 |
CP029122_11 | 11.1|4123679|42|CP029122|PILER-CR | 4123679-4123720 | 42 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 141085-141126 | 0 | 1.0 |
CP029122_11 | 11.2|4123738|40|CP029122|PILER-CR | 4123738-4123777 | 40 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 141028-141067 | 1 | 0.975 |
CP029122_6 | 6.1|2457209|38|CP029122|CRISPRCasFinder | 2457209-2457246 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 113727-113764 | 2 | 0.947 |
CP029122_9 | 9.1|3910518|48|CP029122|CRISPRCasFinder | 3910518-3910565 | 48 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 4089-4136 | 3 | 0.938 |
CP029122_9 | 9.1|3910518|48|CP029122|CRISPRCasFinder | 3910518-3910565 | 48 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 4088-4135 | 3 | 0.938 |
CP029122_9 | 9.1|3910518|48|CP029122|CRISPRCasFinder | 3910518-3910565 | 48 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 4088-4135 | 3 | 0.938 |
CP029122_9 | 9.1|3910518|48|CP029122|CRISPRCasFinder | 3910518-3910565 | 48 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 4088-4135 | 3 | 0.938 |
CP029122_1 | 1.1|892312|42|CP029122|CRISPRCasFinder | 892312-892353 | 42 | NZ_CP010208 | Escherichia coli strain M11 plasmid B, complete sequence | 30214-30255 | 7 | 0.833 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_MG299151 | Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_KY471628 | Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence | 45716-45747 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_MG299131 | Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_KY471629 | Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence | 45716-45747 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_MG299133 | Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_MG299128 | Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_MG299147 | Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NC_018995 | Escherichia coli plasmid pHUSEC41-1, complete sequence | 29015-29046 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_CP053235 | Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence | 78292-78323 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_CP005999 | Escherichia coli B7A plasmid pEB1, complete sequence | 39563-39594 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | KU932021 | Escherichia coli plasmid pEC3I, complete sequence | 51902-51933 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_CP024154 | Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence | 18560-18591 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NC_011754 | Escherichia coli ED1a plasmid pECOED, complete sequence | 49240-49271 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_CP015141 | Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence | 81434-81465 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_LR213460 | Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3 | 28916-28947 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_MH287044 | Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence | 36182-36213 | 7 | 0.781 |
CP029122_3 | 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310998-1311029 | 32 | NZ_MH618673 | Escherichia coli strain 838B plasmid p838B-R, complete sequence | 32230-32261 | 7 | 0.781 |
CP029122_4 | 4.1|1333534|31|CP029122|CRISPRCasFinder | 1333534-1333564 | 31 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62682-62712 | 7 | 0.774 |
CP029122_4 | 4.1|1333534|31|CP029122|CRISPRCasFinder | 1333534-1333564 | 31 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1222106-1222136 | 7 | 0.774 |
CP029122_4 | 4.1|1333534|31|CP029122|CRISPRCasFinder | 1333534-1333564 | 31 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2467672-2467702 | 7 | 0.774 |
CP029122_4 | 4.4|1333717|31|CP029122|CRISPRCasFinder | 1333717-1333747 | 31 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17977-18007 | 7 | 0.774 |
CP029122_4 | 4.7|1333900|31|CP029122|CRISPRCasFinder | 1333900-1333930 | 31 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530641-530671 | 7 | 0.774 |
CP029122_1 | 1.1|892312|42|CP029122|CRISPRCasFinder | 892312-892353 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24899-24940 | 8 | 0.81 |
CP029122_3 | 3.6|1310937|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310937-1310968 | 32 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1417960-1417991 | 8 | 0.75 |
CP029122_4 | 4.4|1333717|31|CP029122|CRISPRCasFinder | 1333717-1333747 | 31 | NZ_CP017753 | Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence | 97498-97528 | 8 | 0.742 |
CP029122_4 | 4.7|1333900|31|CP029122|CRISPRCasFinder | 1333900-1333930 | 31 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14953-14983 | 8 | 0.742 |
CP029122_4 | 4.7|1333900|31|CP029122|CRISPRCasFinder | 1333900-1333930 | 31 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14983-15013 | 8 | 0.742 |
CP029122_4 | 4.7|1333900|31|CP029122|CRISPRCasFinder | 1333900-1333930 | 31 | NZ_CP015882 | Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence | 3454-3484 | 8 | 0.742 |
CP029122_4 | 4.7|1333900|31|CP029122|CRISPRCasFinder | 1333900-1333930 | 31 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 148992-149022 | 8 | 0.742 |
CP029122_4 | 4.10|1333533|33|CP029122|PILER-CR | 1333533-1333565 | 33 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62681-62713 | 8 | 0.758 |
CP029122_4 | 4.16|1333899|33|CP029122|PILER-CR | 1333899-1333931 | 33 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530640-530672 | 8 | 0.758 |
CP029122_4 | 4.19|1333534|32|CP029122|CRT | 1333534-1333565 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62682-62713 | 8 | 0.75 |
CP029122_4 | 4.19|1333534|32|CP029122|CRT | 1333534-1333565 | 32 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1222106-1222137 | 8 | 0.75 |
CP029122_4 | 4.19|1333534|32|CP029122|CRT | 1333534-1333565 | 32 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2467671-2467702 | 8 | 0.75 |
CP029122_4 | 4.19|1333534|32|CP029122|CRT | 1333534-1333565 | 32 | NC_008759 | Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence | 12670-12701 | 8 | 0.75 |
CP029122_4 | 4.22|1333717|32|CP029122|CRT | 1333717-1333748 | 32 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17977-18008 | 8 | 0.75 |
CP029122_4 | 4.22|1333717|32|CP029122|CRT | 1333717-1333748 | 32 | NZ_CP017753 | Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence | 97497-97528 | 8 | 0.75 |
CP029122_4 | 4.25|1333900|32|CP029122|CRT | 1333900-1333931 | 32 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 148991-149022 | 8 | 0.75 |
CP029122_4 | 4.25|1333900|32|CP029122|CRT | 1333900-1333931 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530640-530671 | 8 | 0.75 |
CP029122_4 | 4.26|1333961|32|CP029122|CRT | 1333961-1333992 | 32 | NZ_CP006991 | Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence | 532343-532374 | 8 | 0.75 |
CP029122_1 | 1.1|892312|42|CP029122|CRISPRCasFinder | 892312-892353 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24786-24827 | 9 | 0.786 |
CP029122_4 | 4.1|1333534|31|CP029122|CRISPRCasFinder | 1333534-1333564 | 31 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86182-86212 | 9 | 0.71 |
CP029122_4 | 4.2|1333595|31|CP029122|CRISPRCasFinder | 1333595-1333625 | 31 | CP011075 | Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence | 244686-244716 | 9 | 0.71 |
CP029122_4 | 4.2|1333595|31|CP029122|CRISPRCasFinder | 1333595-1333625 | 31 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78536-78566 | 9 | 0.71 |
CP029122_4 | 4.4|1333717|31|CP029122|CRISPRCasFinder | 1333717-1333747 | 31 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 405875-405905 | 9 | 0.71 |
CP029122_4 | 4.4|1333717|31|CP029122|CRISPRCasFinder | 1333717-1333747 | 31 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 2248363-2248393 | 9 | 0.71 |
CP029122_4 | 4.8|1333961|31|CP029122|CRISPRCasFinder | 1333961-1333991 | 31 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35740-35770 | 9 | 0.71 |
CP029122_4 | 4.10|1333533|33|CP029122|PILER-CR | 1333533-1333565 | 33 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86181-86213 | 9 | 0.727 |
CP029122_4 | 4.13|1333716|33|CP029122|PILER-CR | 1333716-1333748 | 33 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17976-18008 | 9 | 0.727 |
CP029122_4 | 4.22|1333717|32|CP029122|CRT | 1333717-1333748 | 32 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 405875-405906 | 9 | 0.719 |
CP029122_4 | 4.25|1333900|32|CP029122|CRT | 1333900-1333931 | 32 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14953-14984 | 9 | 0.719 |
CP029122_4 | 4.25|1333900|32|CP029122|CRT | 1333900-1333931 | 32 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14983-15014 | 9 | 0.719 |
CP029122_4 | 4.25|1333900|32|CP029122|CRT | 1333900-1333931 | 32 | NZ_CP015882 | Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence | 3454-3485 | 9 | 0.719 |
CP029122_4 | 4.26|1333961|32|CP029122|CRT | 1333961-1333992 | 32 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35740-35771 | 9 | 0.719 |
CP029122_3 | 3.1|1310632|32|CP029122|PILER-CR,CRISPRCasFinder,CRT | 1310632-1310663 | 32 | NZ_CP030933 | Enterococcus gilvus strain CR1 plasmid pCR1A, complete sequence | 51062-51093 | 10 | 0.688 |
CP029122_4 | 4.11|1333594|33|CP029122|PILER-CR | 1333594-1333626 | 33 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78535-78567 | 10 | 0.697 |
CP029122_4 | 4.16|1333899|33|CP029122|PILER-CR | 1333899-1333931 | 33 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14952-14984 | 10 | 0.697 |
CP029122_4 | 4.16|1333899|33|CP029122|PILER-CR | 1333899-1333931 | 33 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14982-15014 | 10 | 0.697 |
CP029122_4 | 4.17|1333960|33|CP029122|PILER-CR | 1333960-1333992 | 33 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35739-35771 | 10 | 0.697 |
CP029122_4 | 4.19|1333534|32|CP029122|CRT | 1333534-1333565 | 32 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86181-86212 | 10 | 0.688 |
CP029122_4 | 4.20|1333595|32|CP029122|CRT | 1333595-1333626 | 32 | CP011075 | Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence | 244686-244717 | 10 | 0.688 |
CP029122_4 | 4.20|1333595|32|CP029122|CRT | 1333595-1333626 | 32 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78536-78567 | 10 | 0.688 |
CP029122_4 | 4.22|1333717|32|CP029122|CRT | 1333717-1333748 | 32 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 2248362-2248393 | 10 | 0.688 |
1. spacer 7.1|3110451|40|CP029122|CRISPRCasFinder matches to NZ_CP041417 (Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence) position: , mismatch: 0, identity: 1.0
gcgctgcgggtcattcttgaaattacccccgctgtgctgt CRISPR spacer gcgctgcgggtcattcttgaaattacccccgctgtgctgt Protospacer ****************************************
2. spacer 11.1|4123679|42|CP029122|PILER-CR matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 0, identity: 1.0
tgtcacacgcagataaatccaactttcaatattgttaagttc CRISPR spacer tgtcacacgcagataaatccaactttcaatattgttaagttc Protospacer ******************************************
3. spacer 11.2|4123738|40|CP029122|PILER-CR matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 1, identity: 0.975
catggcgtagcaaaaagaaattttcaatattgctttatgg CRISPR spacer catggcgtagaaaaaagaaattttcaatattgctttatgg Protospacer ********** *****************************
4. spacer 6.1|2457209|38|CP029122|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 2, identity: 0.947
cggacgcaggatggtgcgttcaattggactcgaaccaa CRISPR spacer cagacgcagaatggtgcgttcaattggactcgaaccaa Protospacer *.*******.****************************
5. spacer 9.1|3910518|48|CP029122|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
6. spacer 9.1|3910518|48|CP029122|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
7. spacer 9.1|3910518|48|CP029122|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
8. spacer 9.1|3910518|48|CP029122|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
9. spacer 1.1|892312|42|CP029122|CRISPRCasFinder matches to NZ_CP010208 (Escherichia coli strain M11 plasmid B, complete sequence) position: , mismatch: 7, identity: 0.833
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer acaaatgccggatgcggcgtaaacgccttatctggcctacgc Protospacer ***. *.****************.*********.******.
10. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299151 (Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
11. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KY471628 (Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
12. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299131 (Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
13. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KY471629 (Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
14. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299133 (Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
15. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299128 (Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
16. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299147 (Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
17. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NC_018995 (Escherichia coli plasmid pHUSEC41-1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
18. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP053235 (Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
19. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP005999 (Escherichia coli B7A plasmid pEB1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
20. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to KU932021 (Escherichia coli plasmid pEC3I, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
21. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP024154 (Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
22. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NC_011754 (Escherichia coli ED1a plasmid pECOED, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
23. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP015141 (Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
24. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LR213460 (Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
25. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MH287044 (Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
26. spacer 3.7|1310998|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MH618673 (Escherichia coli strain 838B plasmid p838B-R, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
27. spacer 4.1|1333534|31|CP029122|CRISPRCasFinder matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer tccctatcgcaatgccggcagcatccgcaat Protospacer *. *. ****** **** ************
28. spacer 4.1|1333534|31|CP029122|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatc Protospacer **** ************ ***** * ** .
29. spacer 4.1|1333534|31|CP029122|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatc Protospacer **** ************ ***** * ** .
30. spacer 4.4|1333717|31|CP029122|CRISPRCasFinder matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.774
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer agcgtcaccgacgcgcagggccgctaccaac Protospacer **************** * *******.
31. spacer 4.7|1333900|31|CP029122|CRISPRCasFinder matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.774
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer ccgaacaggtggcgaagcaggtgatgggcca Protospacer ******.* **************.. ***
32. spacer 1.1|892312|42|CP029122|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 8, identity: 0.81
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer attgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer *. * ******************.*******.*******.
33. spacer 3.6|1310937|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
tcaacgcgctcagacgttgcgtgagtgaacca CRISPR spacer acaacgcggtcggacgttgcgtgattaccccg Protospacer ******* **.************ *. **.
34. spacer 4.4|1333717|31|CP029122|CRISPRCasFinder matches to NZ_CP017753 (Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.742
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacgtcaccgacgcgcagtcgcgcttcttca Protospacer ***************** ***** *. ..
35. spacer 4.7|1333900|31|CP029122|CRISPRCasFinder matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer agcggcagctggcgatgcaggtggcttgcgt Protospacer ..*.******** ********** ****
36. spacer 4.7|1333900|31|CP029122|CRISPRCasFinder matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer agcggcagctggcgatgcaggtggcttgcgt Protospacer ..*.******** ********** ****
37. spacer 4.7|1333900|31|CP029122|CRISPRCasFinder matches to NZ_CP015882 (Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer ttgcgcagctggcgcagcaggtggctgccga Protospacer ..* .*.******* ************ **
38. spacer 4.7|1333900|31|CP029122|CRISPRCasFinder matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer gggtacggctggcgaaggaggcggctgcgga Protospacer * ************* ***.***** *
39. spacer 4.10|1333533|33|CP029122|PILER-CR matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.758
gttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer gtccctatcgcaatgccggcagcatccgcaatc Protospacer **. *. ****** **** ************.
40. spacer 4.16|1333899|33|CP029122|PILER-CR matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.758
gccgaacggctggcgaagcaggtggctggcgta CRISPR spacer gccgaacaggtggcgaagcaggtgatgggccag Protospacer *******.* **************.. *** .
41. spacer 4.19|1333534|32|CP029122|CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer tccctatcgcaatgccggcagcatccgcaatc Protospacer *. *. ****** **** ************.
42. spacer 4.19|1333534|32|CP029122|CRT matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
43. spacer 4.19|1333534|32|CP029122|CRT matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
44. spacer 4.19|1333534|32|CP029122|CRT matches to NC_008759 (Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcg-----caattccgggagcatccgcaatt CRISPR spacer -----cgtgaaactcatttccgggagcatccgcattt Protospacer **.* ** ***************** **
45. spacer 4.22|1333717|32|CP029122|CRT matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer agcgtcaccgacgcgcagggccgctaccaact Protospacer **************** * *******.
46. spacer 4.22|1333717|32|CP029122|CRT matches to NZ_CP017753 (Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacgtcaccgacgcgcagtcgcgcttcttcaa Protospacer ***************** ***** *. ..*
47. spacer 4.25|1333900|32|CP029122|CRT matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer gggtacggctggcgaaggaggcggctgcggaa Protospacer * ************* ***.***** * *
48. spacer 4.25|1333900|32|CP029122|CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ccgaacaggtggcgaagcaggtgatgggccag Protospacer ******.* **************.. *** .
49. spacer 4.26|1333961|32|CP029122|CRT matches to NZ_CP006991 (Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence) position: , mismatch: 8, identity: 0.75
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer catcatcctcccgcagatgcgctggccgatcc Protospacer *.*.* .******** ******** *****
50. spacer 1.1|892312|42|CP029122|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 9, identity: 0.786
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer gttgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer .. * ******************.*******.*******.
51. spacer 4.1|1333534|31|CP029122|CRISPRCasFinder matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 9, identity: 0.71
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer gctaccgcgcaattcgaggagcatccgctgg Protospacer . *********** .*********** .
52. spacer 4.2|1333595|31|CP029122|CRISPRCasFinder matches to CP011075 (Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.71
acggacaaaatatatattgatttgcgaatta CRISPR spacer tgaggcaaaatatagattgatttccgaaaat Protospacer .*.********* ******** ****
53. spacer 4.2|1333595|31|CP029122|CRISPRCasFinder matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 9, identity: 0.71
acggacaaaatatatattgatttgcgaatta CRISPR spacer acggaaaaattatatattgattttacttctg Protospacer ***** *** ************* .*.
54. spacer 4.4|1333717|31|CP029122|CRISPRCasFinder matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.71
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacgtcactgacgcgcagtcgcgcttcttca Protospacer ******.********** ***** *. ..
55. spacer 4.4|1333717|31|CP029122|CRISPRCasFinder matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 9, identity: 0.71
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacatcaccgacgcccagtggcgcgacgtcc Protospacer *.********** ********* ** .
56. spacer 4.8|1333961|31|CP029122|CRISPRCasFinder matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.71
gtttaccgccccgcagaggcgctggcagatc CRISPR spacer cgagaccgcctcgccgaggcgctggcagcga Protospacer ******.*** *************
57. spacer 4.10|1333533|33|CP029122|PILER-CR matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 9, identity: 0.727
-gttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer cgcta-ccgcgcaattcgaggagcatccgctggg Protospacer *.*. *********** .*********** .
58. spacer 4.13|1333716|33|CP029122|PILER-CR matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.727
gcccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer cagcgtcaccgacgcgcagggccgctaccaact Protospacer **************** * *******.
59. spacer 4.22|1333717|32|CP029122|CRT matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.719
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacgtcactgacgcgcagtcgcgcttcttcaa Protospacer ******.********** ***** *. ..*
60. spacer 4.25|1333900|32|CP029122|CRT matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
61. spacer 4.25|1333900|32|CP029122|CRT matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
62. spacer 4.25|1333900|32|CP029122|CRT matches to NZ_CP015882 (Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ttgcgcagctggcgcagcaggtggctgccgag Protospacer ..* .*.******* ************ ** .
63. spacer 4.26|1333961|32|CP029122|CRT matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.719
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer cgagaccgcctcgccgaggcgctggcagcgac Protospacer ******.*** ************* *
64. spacer 3.1|1310632|32|CP029122|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP030933 (Enterococcus gilvus strain CR1 plasmid pCR1A, complete sequence) position: , mismatch: 10, identity: 0.688
tccacgctgtaacggccatcattaagtttagt CRISPR spacer ccgctgctgtgacgcccatcattaagttactc Protospacer .* .*****.*** ************* .
65. spacer 4.11|1333594|33|CP029122|PILER-CR matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 10, identity: 0.697
gacggacaaaatatatattgatttgcgaattat CRISPR spacer gacggaaaaattatatattgattttacttctgg Protospacer ****** *** ************* .*.
66. spacer 4.16|1333899|33|CP029122|PILER-CR matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 10, identity: 0.697
gccgaacggctggcgaagcaggtggctggcgta CRISPR spacer cagcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
67. spacer 4.16|1333899|33|CP029122|PILER-CR matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 10, identity: 0.697
gccgaacggctggcgaagcaggtggctggcgta CRISPR spacer cagcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
68. spacer 4.17|1333960|33|CP029122|PILER-CR matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 10, identity: 0.697
ggtttaccgccccgcagaggcgctggcagatcc CRISPR spacer ccgagaccgcctcgccgaggcgctggcagcgac Protospacer ******.*** ************* *
69. spacer 4.19|1333534|32|CP029122|CRT matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 10, identity: 0.688
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer gctaccgcgcaattcgaggagcatccgctggg Protospacer . *********** .*********** .
70. spacer 4.20|1333595|32|CP029122|CRT matches to CP011075 (Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer tgaggcaaaatatagattgatttccgaaaata Protospacer .*.********* ******** ****
71. spacer 4.20|1333595|32|CP029122|CRT matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer acggaaaaattatatattgattttacttctgg Protospacer ***** *** ************* .*.
72. spacer 4.22|1333717|32|CP029122|CRT matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 10, identity: 0.688
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacatcaccgacgcccagtggcgcgacgtccc Protospacer *.********** ********* ** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1341507 : 1354689
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP029122|1341507:1354689|DBSCAN-SWA TATGCGCATATTGCTGAGTAATGATGACGGGGTACATGCACCCGGTATACAAACGCTGGCGAAAGCCTTGCGTGAGTTTGCTGACGTTCAGGTGGTCGCCCCCGATCGTAACCGCAGCGGCGCTTCAAATTCTCTGACACTGGAATCCTCCCTGCGCACGTTTACCTTTGAAAATGGTGATATTGCTGTGCAAATGGGAACCCCGACCGATTGCGTCTATCTTGGCGTGAATGCTCTGATGCGTCCGCGCCCGGACATTGTTGTGTCCGGAATTAACGCCGGGCCGAATCTGGGGGATGATGTTATTTATTCCGGTACGGTAGCCGCCGCGATGGAAGGCCGTCATTTAGGTTTTCCGGCGCTTGCCGTCTCGCTTGACGGGCATAAACATTACGACACTGCCGCGGCGGTAATCTGTTCAATTTTGCGCGCACTGTGTAAAGAGCCGCTGCGCACCGGGCGTATTCTTAATATTAACGTTCCGGATTTACCCTTGGATCAAATCAAAGGTATTCGCGTGACGCGCTGCGGTACACGACATCCGGCAGATCAGGTGATCCCGCAGCAAGATCCGCGCGGCAATACGCTGTACTGGATTGGCCCGCCGGGCGGTAAATGTGATGCTGGTCCGGGGACCGATTTTGCTGCGGTAGATGAGGGCTATGTCTCCATCACGCCGCTGCATGTGGATTTAACTGCGCATAGCGCGCAAGATGTGGTTTCAGACTGGTTAAACAGCGTGGGAGTTGGCACGCAATGGTAAGCAGACGCGTACAAGCACTTCTGGATCAATTACGTGCGCAAGGTATTCAGGATGAGCAGGTGCTGAATGCACTTGCCGCCGTGCCGCGTGAAAAATTCGTTGATGAAGCGTTTGAACAAAAAGCCTGGGACAATATCGCTTTGCCGATAGGTCAGGGGCAGACAATTTCGCAGCCATATATGGTGGCGCGAATGACCGAATTACTCGAGCTGACGCCGCAGTCGCGGGTGCTGGAAATTGGCACCGGTTCGGGATATCAAACGGCAATCCTGGCGCATCTTGTCCAGCATGTTTGCTCGGTTGAACGGATTAAAGGCTTGCAGTGGCAGGCACGTCGCCGCCTGAAAAATCTTGATTTACATAATGTTTCAACCCGTCATGGCGATGGATGGCAAGGTTGGCAGGCACGTGCGCCGTTTGACGCTATCATTGTTACGGCGGCACCGCCGGAAATTCCAACTGCGCTAATGACGCAGCTGGACGAAGGCGGGATTCTCGTCTTACCCGTAGGGGAGGAGCACCAGTATTTGAAACGGGTGCGTCGTCGGGGAGGCGAATTTATTATCGATACCGTGGAGGCCGTGCGCTTTGTCCCTTTAGTGAAGGGTGAGCTGGCTTAAAACGTGAGGAAATACCTGGATTTTTCCTGGTTATTTTGCCGCAGGTCAGCGTATCGTGAACATCTTTTCCAGTGTTCAGTAGGGTGCCTTGCACGGTAATTATGTCACTGGTTATTAACCAATTTTTCCTGGGGGATAAATGAGCGCGGGAAGCCCAAAATTCACCGTTCGCCGCATTGCGGCTTTGTCACTGGTTTCGCTATGGCTGGCAGGCTGTTCTGACACTTCAAATCCACCGGCACCGGTCAGCTCCGTTAATGGCAATGCGCCTGCAAATACTAATTCTGGTATGTTGATTACGCCGCCGCCGAAAATGGGGACGACGTCTACAGCGCAGCAACCGCAAATTCAGCCGGTGCAGCAGCCACAAATTCAGGCTACTCAACAACCGCAAATCCAGCCAGTGCAGCCAGTAGCTCAGCAGCCGGTACAGATGGAAAACGGACGCATCGTCTATAACCGTCAGTATGGGAACATTCCGAAAGGCAGTTATAGCGGCAGTACCTATACCGTGAAAAAAGGCGACACACTTTTCTATATCGCCTGGATTACTGGCAACGATTTCCGTGACCTTGCTCAGCGCAACAATATTCAGGCACCATACGCGCTGAACGTTGGTCAGACCTTGCAGGTGGGTAATGCTTCCGGTACGCCAATCACTGGCGGAAATGCCATTACCCAGGCCGACGCAGCAGAGCAAGGAGTTGTGATCAAGCCTGCACAAAATTCCACCGTTGCTGTTGCGTCGCAACCGACAATTACGTATTCTGAGTCTTCGGGTGAACAGAGTGCTAACAAAATGTTGCCGAACAACAAGCCAACTGCGACCACGGTCACAGCGCCTGTAACGGTACCAACAGCAAGCACAACCGAGCCGACTGTCAGCAGTACATCAACCAGTACGCCTATCTCCACCTGGCGCTGGCCGACTGAGGGCAAAGTGATCGAAACCTTTGGCGCTTCTGAGGGGGGCAACAAGGGGATTGATATCGCAGGCAGCAAAGGACAGGCAATTATCGCGACCGCAGATGGCCGCGTTGTTTATGCTGGTAACGCGCTGCGCGGCTACGGTAATCTGATTATCATCAAACATAATGATGATTACCTGAGTGCCTACGCCCATAACGACACAATGCTGGTCCGGGAACAACAAGAAGTTAAGGCGGGGCAAAAAATAGCGACCATGGGTAGCACCGGAACCAGTTCAACACGCTTGCATTTTGAAATTCGTTACAAGGGGAAATCCGTAAACCCGCTGCGTTATTTGCCGCAGCGATAAATCGACGGAACCAGGCTTTTGCTTGAATGTTCCGTCAAGGGATCACGGGTAGGAGCCACCTTATGAGTCAGAATACGCTGAAAGTTCATGATTTAAATGAAGATGCGGAATTTGATGAGAACGGAGTTGAGGTTTTTGACGAAAAGGCCTTAGTAGAAGAGGAACCCAGTGATAACGATTTGGCCGAAGAGGAACTGTTATCGCAGGGAGCCACACAGCGTGTGTTGGACGCGACTCAGCTTTACCTTGGTGAGATTGGTTATTCACCACTGTTAACGGCCGAAGAAGAAGTTTATTTTGCGCGTCGCGCACTGCGTGGAGATGTCGCCTCTCGCCGCCGGATGATCGAGAGTAACTTGCGTCTGGTGGTAAAAATTGCCCGCCGTTATGGCAATCGTGGTCTGGCGTTGCTGGACCTTATCGAAGAGGGCAACCTGGGGCGATCCGCGCGGTAGAGAAGTTTGACCCGGAACGTGGTTTCCGCTTCTCAACATACGCAACCTGGTGGATTCGCCAGACGATTGAACGGGCGATTATGAACCAAACCCGTACTATTCGTTTGCCGATTCACATCGTAAAGGAGCTGAACGTTTACCTGCGAACCGCACGTGAGTTGTCCCATAAGCTGGACCATGAACCAAGTGCGGAAGAGATCGCAGAGCAACTGGATAAGCCAGTTGATGACGTCAGCCGTATGCTTCGTCTTAACGAGCGCATTACCTCGGTAGACACCCCGCTGGGTGGTGATTCCGAAAAAGCGTTGCTGGACATCCTGGCCGATGAAAAAGAGAACGGTCCGGAAGATACCACGCAAGATGACGATATGAAGCAGAGCATCGTCAAATGGCTGTTCGAGCTGAACGCCAAACAGCGTGAAGTGCTGGCACGTCGATTCGGTTTGCTGGGGTACGAAGCGGCAACACTGGAAGATGTAGGTCGTGAAATTGGCCTCACCCGTGAACGTGTTCGCCAGATTCAGGTTGAAGGCCTGCGCCGTTTGCGTGAAATCCTGCAAACGCAGGGGCTGAATATCGAAGCGCTGTTCCGCGAGTAAGTAAGCATCTGTCAGAAAGGCCAGTCTCAAGCGAGGCTGGCCTTTTCTGTGCACAATAAAAGGTCCGATGCCCATCGGACCTTTTTATTAAGGTCAAATTACCGCCCATACGCACCAGGTAATTAAGAATCCGGTAAAACCGAGAATGGTCGTTAACACTGTCCAGGTTTTCAGACCGTCTGCTACCGACAACCCCAGATATTTGGTCACAATCCAGAACCCTGAGTCATTAATATGTGACGCGCCAAGCCCACCAAAGCAGGCTGCCAGCGTCACCAATACGCACTGAATCGGATTCAATCCCATCACCGCTTCTGAGAGTAACCCGCCGGTTGTCAGGATTGCTACGGTTGCTGACCCCTGCGATGCACGCAGAGCCAGTGAAATAATAAATGCGGCTGGTAACAGAGGCAGGTCAATCATTTGTAGCATATTGGCAAGGGCTTTGCCGACGCCCGATTCCACCAGCACTTTGCCAAATACCCCTCCAGCACCAGTAACCAAAATCACTACCGCCGCAGTGGGAAGGGCTGAGCCCATAATGTCGCTGGTGTGTTGTAAGCTCCAGCCGCGACGTAAAGCCAATAACCAGAATGCCAGCACCAGCGCAATCATTAGAGCTACCATTGGTGAGCCGATCAGCTGTAGCGTACCAAGCAGGGGATGCGAAGGCGGCATCAGTGTTGCGGAAACCGTACCCGCCATGATAATCGCGATAGGAATAACAATTAGCGAGGTGACCAGCGCGACGCCCGGTGGATTTATTTTATCGCTTAATTTTGTCGCGCCTTCCTCACTGGCCGGAGCCAGTTGCATCTGTTCCAGTACTTCCACCGACATCGCATATTGGTGCTTATTGATTATTTTCGCTGCAAAGTAGCCAACAATCCCTACGGGAATAGAAATCGCAATACCGATGATGGTTAGCCAGCCGATGTCTGCGTGGAGTAACCCCGCTGCGGCGACAGGGCCTGGATGCGGCGGTACCGCCACATGTACAGTTAGCATGATCCCAGCGACAGGCAGGCCAAATTTGAGTGGCGATATTTTGGCAACCTTGGCAAAACCGTAAATGATTGGCGCAAGAATAATAAAGCCGACATCAAAGAAGACGGGAATACCGAGGAAGAACGCTGCCAGAGTCAGCGCAGCGATAGTTCGTTTGTCACCTAACTTGCGACTGAAATAATTAGCCAGTGACTCTGCACCACCAGAGTGTTCGATCATACGCCCCAGCATAGCGCCCAGACCAATAATAATAGTGACGGAACCAAGCACACCGCCCATCCCGGCGATCATCACTTTACCCACTTCGCCCGCCGGTATACCCGCCGCAAGTGCGACTAACAGGCTGACGAGGAGCAGAGCAACGAATGGTTGTACCTTTGCCTTGATGACCAGCAGCAACAGCATGATTACGCCAGTTAACGCAATGCATAACAATGTAATTGTGGACATGGGAAACCCTGTCTGAAAGTTATAGTTAACCTACCCCATCCGTAGATGGGGGGATGTATGGGTACGTTGTAATTAGGGATTTAACGAATTAGCGCCAGGCGTCAAACCAGCCAAGCCCTTCTTCGGTGAGGCCACGAGGTTTATATTCACAACCGATCCAGCCCTGATATCCCACCTCATCGAACAGGCGGAACAGCCACGGATAGTTGATTTCTCCATCGTCCGGTTCATGTCGATCAGGTAGTCCGGCAATTTGTACGTGCGCATATTTCCCGGCGTAGTCGCGGATTAAATGCGTCAGGTTGCCATCTACTTTTTGCGCATGAAAAGTATCTAGTTGAATAAACACGTTATCTCGCGCAACCTCTTCAACAATAGCCAGTGCCTGATACTGGCTGGAGAAGAGATAATGAGGCTTAACGTCGGGGCTGAGTGCTTCAACTAATATTCGCTTGCCGTGTAGCGCAAAGCGGTCGGCAGCGTAGCGGAGATTATCGATAAATACTGCCCGGTACCGTTCAGCGTCTTCGCCAGCGGGCACGACGCCTGCCATCACATGGACTTGTTCACAATTGAGCGCCAATGCATATTCCAGTGCCAGGTCGATGTCTGCGTGTGCTTCGTGCTCACGTCCGGGAAGGGCGGATAATCCCCATTCCCCCGCATTAATATCTCCGGGAGCGGTATTGAACAGCGCCAGTGTCAGATGGTTTTGCTCCAGTTGCTTTTGGATTTGCAGGGTGGAGTAGTCATAGGGAAACAGAAATTCCACAGCATCGAACCCGGCTTTTCGCGCTGCGGCGAAGCGTTCAATAAAAGGCACTTCGGTGAACATCATGGATAAATTAGCTGCAAAACGAGGCATTGCATTAACTCCTTAATTCCGCAATTTCACCTGCGGTCAGATAACGGATCGGGCGGTCACCGAGAATAAAAATCAGCTTTGCCGTTTCCTCCAGCTCTTCCATATTGTTGGCGGCTTCTTGCAGGCTTTCACCGCAAACCACTGGGCCATGATTTGCCAGTAAAAAAGCCTGATTGTCTGCTGCCAGTTCCGCCAGATCCTGTGCGATGCGTTTATCGCCCGGTCGGTAATAAGGCACCAGCGGGACATTTCCCATCCGCATCACCACGTATGGTGTGAACGGACGAATAACGTTGCTGCTGTCCAGCCCTTGCAGGCAGGAAAGCGCCGTCGACCATGTGCTGTGCAAATGCACCACCGCTTTACAGCGCGGATTGTTGCGATACAGCGCCAGATGAAAGAGCACCTCTTTCGAGGGTTTGTCACCACTTAACCATTCGCCATCCGCGGCGACTTTGGAAAGCCGCTGCGGATCGAGATTGCCCAGGCATGAACCTGTCGGTGTCGCCAGTAAATTCCCGTCAGGTAAAAGCAGCGACAGATTGCCAGCCGAACCGGTTGCATAGCCGCGCTGAAAGAATGAACTGGCAATCCGCGTCATCTCCTCTCGCAAAGACTGCTCTACTTTTGCGAAATCGCTCATGATAAAAACTCTCTTTGGGCTCGTGAAAAAAAGGCTTCATCACCGAAGTTGCCAGATTTAAGGGCGAGTGAGACAGGCTTATCCAGTGCGTTTACCCACGGCACGCCGGGGGAAATGGTTGGGCCAATATGAAACCCTTTTATGCCCAGGCTCTGTGTGACTACGCCGGAGGTCTCACCGCCTGCGACAATAAAGCGTGTCACGCCTTCCGCTGCTAACCGCGCCGCTAGTTGAGAAAACAGAGTTTCTACTGCCTGACTGGCTTTTTGTGCACCGTATTGCTGTTGAATTGCTGCCAATGCGTCAGTGCTGGCGGTGGCAAAAACCAGTGGAGCAAGTACACTTTCCTGGCCCAGAACCCACTCTGCCAGTTCGTGTGCATAAGCGGCCAGAGTTTCGGTTGAGAGGCAGCGTGCCACATCAACTTCACGGGCTGGTGCAATTTGACGGTAATGTGCCACCTGGCGGTTGGTCATTTGAGAGCATGAACCGGAGAGCACTACGCCGCGCCCAGCGAGCGGATGCCCTGCTTCGCGAGCCTGGTTACCGTTTTCTTGCGCCCACTGCCGGGCCAGGCCAATCGCCAGACCAGAACCGCCCGTTACCAGTGGGGCATCGCGCAAGGCTTCTCCCTGAATTTCCAGATGGTGTTCGGTCAGCGCATCAAGCACCGCGTAGCGGTAGCCCTCTTGCTGTAAGCGAGCCAGCTCTTGACGAACGGCATCCACACCTTGTTCGAAAACATGTGCCGAAACGACGCCGCAGCGCCCTGTGGATTGCGCTTCAACCAGACGGGGAAGATAGCTGTCGGTCATGGGATTTACCGGGTGATGGCGCATCCCGGATTCGGCCAGCAGTTGATTCATTACGAACAAATACCCCTGATAAACCGTACGTCCGTTGACCGGCAGGGCCGGAGAGAAGACCGTAAACGGCGTGTCGAGAGCATCCATTAATGCATCGGTAACCGGGCCGATATTACCTTTCGCCGTACTGTCGAAAGTAGAGCAGTATTTGAAATAGATCTGTTTGCAACCTTGCTGTTGCAACCAGCTCAGAGCCGCCAGCGATTGCTGTGTGGCTTCAACCACCGGACAGGAGCGCGTTTTCAGGCTGATCACCAGTGCGTCGATTGCTTCCGGCATTTTACCTGTTGGAACACCGTTAATTTGTACCGTTGGTAGACCGTTTTCCACCAGAAAACTGGCGATATCCGTCGCGCCGGTAAAATCATCGGCGATAACGCCAATCTTGATCATGATTTCGCTCCCGGTAGAGTGATGCCAGAGAAAATCTTGATAACTGCGCTATCGTCTTCTTTCCCGTAACCTGCGTTACTGGCGCTGGTGAACATATTCAATGCTGTTGAGGCCAATGGCAGCGGGAAGTGCAGGGCTTTGGCTGTATCGGCAACCAGACCAAGATCCTTAACAAAAATATCGACGGCTGAATGCGGGGTGTAATCGCCATCCACCACATGACGCATCCGGTTTTCGAACATCCAGGAATTTCCGGCGGCATTGGTCACGACGTCATACATCACATCCAGCGGGATCCCCGCACGGGCTGCAAGTGCCATCGCTTCGGCTCCGGCAGCAATATGTACGCCCGCTAACAACTGGTGAATAATTTTTACGGTCGAACCTAGTCCCGGTTCTGCACCTATGCGATAAACTTTTCCGGCAACGGCTTCCAGCACGGGTGCCAGTCGTTCAAAGGCAATATCGCTACCGGAGGCCATGACAGTCATTTCACCGTTAGCGGCTTTTACTGCACCACCAGAAACTGGCGCATCCAGCATTTCCAGATCGAATCCAGCCAGAGCGGTAGCAATTTCTTGCGCATCAGCACTAGCGATAGTGGAAGAAACCATTACTGCCGTACCGGGTTTCAGATGTTGTGCAACGCCTGTTTCACCAAACAGCACCTGTTTAACCTGGGCCGCATTGACCACCAGCACCAGCAGAGCGTCCAGTTTTTCGGCAAACGTCGCGGCGTTATCAGAAACCCCGCAAGCACCTGCCTCTTTCAACGTAGCGCAGGCATTGCTGTTCAGGTCTGCGCCCCAGGTAGAAAGACCTGCGCGGACACATGACAGTGCTGCTCCCATTCCCATTGACCCTAAGCCAACGATACCGACATGAAACTCAGATCCCGTTTTCATCTGCTCTCCTTGTTAATTTAAGTGATATTTTGTTTGATATTGTGAATATAAGCGCTGGAAGATAACGATATGGTGAGCTGATTCACATAAATTAACATTGTGTGTTATTTTATGTGAACTAAGCGTTAGTTGCGCCGCGCACGTTTCGCAGGCAAATAGCGTAGAATGTCAGCAGGACAAAGGAAGGAGCAAAAGTTGATACCCGTAGAGCGTCGCCAAATCATCCTTGAGATGGTAGCTGAAAAAGGCATTGTCAGTATTGCTGAACTAACGGACAGAATGAATGTGTCACATATGACCATTCGTCGGGATTTACAAAAACTGGAGCAGCAGGGAGCCGTTGTGCTGGTGTCCGGAGGCGTCCAGTCTCCGGGACGCGTGGCGCATGAACCTTCTCATCAGGTAAAAACTGCGCTGGCAATGACGCAAAAAGCGGCTATTGGCAAGCTGGCGGCAAGTCTTGTTCAGCCGGGAAGCTGTATCTATCTGGATGCGGGAACGACCACGTTAGCGATAGCACAGCATCTGATTCACATGGAGTCACTGACTGTGGTCACAAACGATTTCGTTATTGCGAACTACTTGCTCGACAACAGTAATTGCACAATTATTCACACTGGCGGTGCAGTGTGTCGGGAAAACCGTTCCTGTGTCGGGGAAGCCGCTGCGACCATGCTGCGCAGCCTGATGATTGATCAGGCTTTTATTTCTGCATCGTCATGGAGTGTGCGGGGGATTTCTACGCCAGCGGAAGATAAAGTCACGGTGAAACGGGCGATTGCCAGTGCCAGCCGCCAGCGAGTTTTGGTCTGTGATGCGACGAAATATGGTCAGGTGGCGACATGGCTGGCGTTACCATTAAGCGAGTTTGATCAGATTATCACAGACGACGGTTTGCCGGAGAGTGCCAGTCGCGCGCTGGCGAAGCAGGATCTCTCTTTGCTGGTAGCGAAAAATGAATAATGGCCTGCAATAACATTTGGTTACTCATGCTTCACAGAAGAAGCATGAGACTACTTTATTTTATAAAATGACAGCCGCCCGCTTTTCGGCGTGCCGGTATCAATATAAATCTGGTTAGCGAACGTCTGAATGTTATCAAACATCATATGTCCAAATATAAAATAATCAGCGCCGTTTATTTGTTGTAACTCGCCATTAAGCGATTTCTGCACACGATCAACAGGCCAGAGTAATTCGCTCTCCGCTATTTCTTTACCAAAGAGATATTCACTCCCCGGATAATCTGCATGTGCGATGACATATTTTATGTTGTCGTTAGTGATTTCAATAATATGTGGAAGGTGATGGAATTTCAGCAACAGATCTATTGCCTCTTGTTGCTCTGAATCATTTAAATCGAAAAACCAGTCACCACCGCTGGCAAGCCACATATTGCCATCGCCAGTTTCGAATGCATCCAACGCCATCGCTTCGTGGTTGCCTTTAACCGACGTAAACCAGGGTTGGTTTAGCAGGCGCAGCACGTTAAGACTCTTCGGCCCACGATCAATATTATCGCCTGTGGAAATAAGTAAGTCGGTTTCAGGGTAAAAAGAGAGTTGATGTAAACGGGATTGTAATAACTGATATTCACCATGAATATCACCAACGACCCATATATGGCGATAGTGATGGGCATTGATTTTTTGATAGCGTGTAGATGGCATGGTTTTACCCTGTAAAATAAGCTTTCCTATTATACAGGGTATTTTTATTTGATTCGTCAGTTGTCGTTAATATTCCCGATAGCAAAAGACTATCGGGAATTGTTATTACACCAGACTCTTCAAGCGATAAATCCACTCCAGCGCCTGACGCGGAGTCAGTGAATCCGGGTCGAGATTTTCCAGTGCTTCGACCGCAGGCGAAGTTTCTTCTGGCACTGACAGCAAAGACATCTGCGTACCATCCACTTGCGTCGCGGCGGCGTTCGGCGAAATGCTTTCCAGCTCACGCAGTTTTTGCCGTGCGCGCTTAATCACCTCTTTTGGCACGCCCGCCAGAGCTGCAACCGCCAGGCCGTAGCTTTTGCTCGCCGCGCCATCCTGCACGCTATGCATAAAGGCAATGGTGTCGCCGTGCTCCAGCGCATCGAGATGCACGTTGGCGACGCCTTCCATTTTCTCCGGTAACTGGGTCAGCTCGAAATAGTGGGTAGCAAATAACGTCAATGCCTTAATCTTATTCGCCAGATTTTCCGCGCACGCCCACGCCAGCGACAGACCATCGTAGGTGGACGTTCCACGCCCGATCTCATCCATTAACACCAGACTGTATTCGGTGGCGTTATGTAAAATATTGGCGGTTTCAGTCATCTCCACCATAAAGGTTGAGCGCCCGGAAGCCAGGTCATCTGCCGCGCCTACGCGGGTAAAGATGCGATCGATAGGTCCAATCTCGACTTTTTGTGCCGGTACATAGCTGCCGATGTAGGCCATCAGCGCAATCAGTGCGGTCTGGCGCATATAGGTACTTTTACCGCCCATGTTCGGGCCGGTAATAATCAACATACGGCGCTGCGGCGACAGATTCAGCGGGTTAGCGATAAATGGCTCATTCAGTACTTGTTCAACTACCGGATGGCGACCTTCGGTAATGCGAATGCCCGGTTTATCAATGAAGGTCGGGCAGGTGTAGTTCAGGGTATAGGCCCGTTCCGCCAGGTTAACCAGCACGTCGAGTTCCGCCAGCGCGCTCGCGCTCTGTTGCAACGCTTCCAGATGCGGCAACAGCAGGTCGAACAGCTCTTCATAAAGCTGTTTTTCCAGTGCCAGTGCTTTGCCTTTTGAGGTGAGAACTTTATCTTCGTACTCTTTTAGCTCTGGAATGATGTAGCGCTCGGCGTTTTTCAGCGTCTGGCGACGCATGTAGTTGATGGGTGCCAGATGGCTTTGCCCACGGCTGATTTGAATGTAGTAGCCGTGCACCGCATTAAAGCCAACTTTCAGCGTGTCCAGGCCGGTACGTTCACGCTCGCGGACTTCCAGACGCTCCAGATAATCGGTCGCGCCGTCAGCCAGCGCGCGCCACTCATCCAGCTCTTCGTTATAGCCCGATGCGATAACACCACCGTCGCGTACCAGCACCGGCGGTGTGTCGATGATTGCTCGCTCCAGCAGATCGCGCAGCTCGGCAAACTCGCCCATCTTCTCACGTAGCGCCTGTACCGGTGCACTATCGACAGTTTCTAACTGCGCACGCAGCTCCGGCAGTTGCTGGAAAGCGTGGCGCATACGGGCCAGATCGCGTGGGCGAGCAGTTCGTAAAGCCAGACGTGCCAGAATACGTTCCAGGTCGCCGACCTGACGCAGTACCGGCTGCAACTCGGCGGTGAAATCCTGCAATGCGCCAATAGTTTGCTGGCGCTCAAGCAACACGCGGGTATCGCGCACTGGCATATGCAGCCAGCGTTTCAGCATACGACTACCCATCGGCGTGACGGTGCAGTCGAGCACAGAAGCCAGCGTATTTTCCGCACCGCCCGCCAGGTTCTGAGTAATTTCCAGGTTACGACGTGTCGCGGCATCCATAATGATGCTGTCCTGCTCACGTTCCATGGTGATAGAACGAATATGCGGCAGGGTCGTGCGTTGGGTATCTTTCGCATACTGCAACAGACAACCGGCAGCACAAAGTCCGCGCGGCGCGTTCTCGACGCCAAAACCGACCAGATCGCGGGTGCCAAATTGCAGATTCAACTGCTGGCGCGCGGTGTCGATTTCAAACTCCCACAGCGGGCGACGGCGCAGGCCGCGACGGCCTTCAATCAGCGACATCTCGGCGAAATCTTCTGCATACAGCAGTTCCGCCGGATTAGTGCGTTGCAGCTCTGCCGCCATCGTTTCGCGGTCGGCCGGTTCGCTCAGACGAAAACGCCCGGAGCTGATATCCAGCGTCGCGTAGCCGAAACCTTTGCTGTCCTGCCAGATAGCCGCCAGCAGGTTGTCCTGACGCTCCTGTAACAGGGCTTCATCGCTGATGGTGCCTGGCGTAACGATACGCACAACTTTGCGCTCAACCGGACCTTTGCTGGTCGCCGGATCGCCAATTTGTTCGCAGATGGCAACGGACTCTCCCTGATTCACCAGTTTGGCGAGATAGTTTTCCACCGCATGGTAGGGAATCCCCGCCATCGGGATCGGCTCTCCCGCCGAAGCACCGCGTTTGGTCAGTGAAATATCCAGCAGTTGCGACGCGCGTTTTGCGTCGTCATAAAACAGTTCATAAAAATCACCCATCCGGTAAAACAGCAGGATCTCGGGATGCTGGGCTTTCAGCCTGAGATACTGCTGCATCATGGGCGTATGGGCGTCGAAATTTTCTATTGCACTCAT
Protein sequences of DBSCAN-SWA_1 >CP029122|1341507:1354689|1346768_1347545_-|AWF26366.1|DBSCAN-SWA MPRFAANLSMMFTEVPFIERFAAARKAGFDAVEFLFPYDYSTLQIQKQLEQNHLTLALFNTAPGDINAGEWGLSALPGREHEAHADIDLALEYALALNCEQVHVMAGVVPAGEDAERYRAVFIDNLRYAADRFALHGKRILVEALSPDVKPHYLFSSQYQALAIVEEVARDNVFIQLDTFHAQKVDGNLTHLIRDYAGKYAHVQIAGLPDRHEPDDGEINYPWLFRLFDEVGYQGWIGCEYKPRGLTEEGLGWFDAWR >CP029122|1341507:1354689|1349443_1350352_-|AWF28531.1|DBSCAN-SWA MKTGSEFHVGIVGLGSMGMGAALSCVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNAAQVKQVLFGETGVAQHLKPGTAVMVSSTIASADAQEIATALAGFDLEMLDAPVSGGAVKAANGEMTVMASGSDIAFERLAPVLEAVAGKVYRIGAEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSAVDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS >CP029122|1341507:1354689|1351365_1352022_-|AWF28527.1|DBSCAN-SWA MPSTRYQKINAHHYRHIWVVGDIHGEYQLLQSRLHQLSFYPETDLLISTGDNIDRGPKSLNVLRLLNQPWFTSVKGNHEAMALDAFETGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEITNDNIKYVIAHADYPGSEYLFGKEIAESELLWPVDRVQKSLNGELQQINGADYFIFGHMMFDNIQTFANQIYIDTGTPKSGRLSFYKIK >CP029122|1341507:1354689|1350547_1351315_+|AWF24379.1|DBSCAN-SWA MIPVERRQIILEMVAEKGIVSIAELTDRMNVSHMTIRRDLQKLEQQGAVVLVSGGVQSPGRVAHEPSHQVKTALAMTQKAAIGKLAASLVQPGSCIYLDAGTTTLAIAQHLIHMESLTVVTNDFVIANYLLDNSNCTIIHTGGAVCRENRSCVGEAAATMLRSLMIDQAFISASSWSVRGISTPAEDKVTVKRAIASASRQRVLVCDATKYGQVATWLALPLSEFDQIITDDGLPESASRALAKQDLSLLVAKNE >CP029122|1341507:1354689|1341507_1342269_+|AWF28097.1|DBSCAN-SWA MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVICSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW >CP029122|1341507:1354689|1352127_1354689_-|AWF24777.1|DBSCAN-SWA MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV >CP029122|1341507:1354689|1344703_1345222_+|AWF26964.1|DBSCAN-SWA MNQTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNERITSVDTPLGGDSEKALLDILADEKENGPEDTTQDDDMKQSIVKWLFELNAKQREVLARRFGLLGYEAATLEDVGREIGLTRERVRQIQVEGLRRLREILQTQGLNIEALFRE >CP029122|1341507:1354689|1347549_1348188_-|AWF25622.1|DBSCAN-SWA MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVAADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNVPLVPYYRPGDKRIAQDLAELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS >CP029122|1341507:1354689|1343028_1344168_+|AWF26807.1|DBSCAN-SWA MSAGSPKFTVRRIAALSLVSLWLAGCSDTSNPPAPVSSVNGNAPANTNSGMLITPPPKMGTTSTAQQPQIQPVQQPQIQATQQPQIQPVQPVAQQPVQMENGRIVYNRQYGNIPKGSYSGSTYTVKKGDTLFYIAWITGNDFRDLAQRNNIQAPYALNVGQTLQVGNASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPTATTVTAPVTVPTASTTEPTVSSTSTSTPISTWRWPTEGKVIETFGASEGGNKGIDIAGSKGQAIIATADGRVVYAGNALRGYGNLIIIKHNDDYLSAYAHNDTMLVREQQEVKAGQKIATMGSTGTSSTRLHFEIRYKGKSVNPLRYLPQR >CP029122|1341507:1354689|1344230_1344623_+|AWF24798.1|DBSCAN-SWA MSQNTLKVHDLNEDAEFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFARRALRGDVASRRRMIESNLRLVVKIARRYGNRGLALLDLIEEGNLGRSAR >CP029122|1341507:1354689|1342262_1342889_+|AWF25711.1|DBSCAN-SWA MVSRRVQALLDQLRAQGIQDEQVLNALAAVPREKFVDEAFEQKAWDNIALPIGQGQTISQPYMVARMTELLELTPQSRVLEIGTGSGYQTAILAHLVQHVCSVERIKGLQWQARRRLKNLDLHNVSTRHGDGWQGWQARAPFDAIIVTAAPPEIPTALMTQLDEGGILVLPVGEEHQYLKRVRRRGGEFIIDTVEAVRFVPLVKGELA >CP029122|1341507:1354689|1348184_1349447_-|AWF28154.1|DBSCAN-SWA MIKIGVIADDFTGATDIASFLVENGLPTVQINGVPTGKMPEAIDALVISLKTRSCPVVEATQQSLAALSWLQQQGCKQIYFKYCSTFDSTAKGNIGPVTDALMDALDTPFTVFSPALPVNGRTVYQGYLFVMNQLLAESGMRHHPVNPMTDSYLPRLVEAQSTGRCGVVSAHVFEQGVDAVRQELARLQQEGYRYAVLDALTEHHLEIQGEALRDAPLVTGGSGLAIGLARQWAQENGNQAREAGHPLAGRGVVLSGSCSQMTNRQVAHYRQIAPAREVDVARCLSTETLAAYAHELAEWVLGQESVLAPLVFATASTDALAAIQQQYGAQKASQAVETLFSQLAARLAAEGVTRFIVAGGETSGVVTQSLGIKGFHIGPTISPGVPWVNALDKPVSLALKSGNFGDEAFFSRAQREFLS >CP029122|1341507:1354689|1345315_1346635_-|AWF27904.1|DBSCAN-SWA MLLLLVIKAKVQPFVALLLVSLLVALAAGIPAGEVGKVMIAGMGGVLGSVTIIIGLGAMLGRMIEHSGGAESLANYFSRKLGDKRTIAALTLAAFFLGIPVFFDVGFIILAPIIYGFAKVAKISPLKFGLPVAGIMLTVHVAVPPHPGPVAAAGLLHADIGWLTIIGIAISIPVGIVGYFAAKIINKHQYAMSVEVLEQMQLAPASEEGATKLSDKINPPGVALVTSLIVIPIAIIMAGTVSATLMPPSHPLLGTLQLIGSPMVALMIALVLAFWLLALRRGWSLQHTSDIMGSALPTAAVVILVTGAGGVFGKVLVESGVGKALANMLQMIDLPLLPAAFIISLALRASQGSATVAILTTGGLLSEAVMGLNPIQCVLVTLAACFGGLGASHINDSGFWIVTKYLGLSVADGLKTWTVLTTILGFTGFLITWCVWAVI |
13 | Escherichia_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1427570 : 1436904
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP029122|1427570:1436904|DBSCAN-SWA GATGATTGATGTCTTAGGGCCGGAGAAACGCAGACGGCGTACCACACAGGAAAAGATCGCAATTGTTCAGCAGAGCTTTGAACCGGGGATGACGGTCTCCCTCGTTGCCCGGCAACATGGTGTAGCAGCCAGCCAGTTATTTCTCTGGCGTAAGCAATACCAGGAAGGAAGTCTTACTGCTGTCGCCGCCGGAGAACAGGTTGTTCCTGCCTCTGAACTTGCTGCCGCCATGAAGCAGATTAAAGAACTCCAGCGCCTGCTCGGCAAGAAAACGATGGAAAATGAACTCCTCAAAGAAGCCGTTGAATATGGACGGGCAAAAAAGTGGATAGCGCACGCGCCCTTATTGCCCGGGGATGGGGAGTAAGCTTAGTCAGCCGTTGTCTCCGGGTGTCGCGTGCGCAGTTGCACGTCATTCTCAGACGAACCGATGACTGGATGGATGGCCGCCGCAGTCGTCACACTGATGATACGGATGTGCTTCTCCGTATACACCATGTTATCGGAGAGCTGCCCACGTATGGTTATCGTCGGGTATGGGCGCTGCTTCGCAGACAGGCAGAACTTGATGGTATGCCTGCGATCAATGCCAAACGTGTTTACCGGATCATGCGCCAGAATGCGCTGTTGCTTGAGCGAAAACCTGCTGTACCGCCATCGAAACGGGCACATACAGGCAGAGTGGCCGTGAAAGAAAGCAATCAGCGATGGTGCTCTGACGGGTTCGAGTTCTGCTGTGATAACGGAGAGAGACTGCGTGTCACGTTCGCGCTGGACTGCTGTGATCGTGAGGCACTGCACTGGGCGGTCACTACCGGCGGCTTCAACAGTGAAACAGTACAGGACGTCATGCTGGGAGCGGTGGAACGCCGCTTCGGCAACGATCTTCCGTCGTCTCCAGTGGAGTGGCTGACGGATAATGGTTCATGCTACCGGGCTAATGAAACACGCCAGTTCGCCCGGATGTTGGGACTTGAACCGAAGAACACGGCGGTGCGGAGTCCGGAGAGTAACGGAATAGCAGAGAGCTTCGTGAAAACGATAAAGCGTGACTACATCAGTATCATGCCCAAACCAGACGGGTTAACGGCAGCAAAGAACCTTGCAGAGGCGTTCGAGCATTATAACGAATGGCATCCGCATAGTGCGCTGGGTTATCGCTCGCCACGGGAATATCTGCGGCAGCGGGCTTGTAATGGGTTAAGTGATAACAGATGTCTGGAAATATAGGGGCAAATCCAATTTAATTAGCAGTCTTCCTGCATAGTGCGTCAGTAGATATGTACTGCTTATGTAAAACCAAACATGACTAATGTCTGTCAGCTTCTTTAGTTGTAAAATTAATTTTTTAGCGTCGGCATTTGTTGGTGTGATAATGCACTTATCACTTATCTCAAATTCATTTTTTTGCTTTATAAGTAACGACAACAGCTCAATTGATTTAAATGTAATGTTCGTGAATACAATTTTTGAACTGTCTTTAAGATGAATATTACGGATAAGAAAACGAATGTAGTTATCGTTTTTAGATTCCCAGTGCGTATATTTACCAACTATGCTATTATTTTCTGACTGCCAGTGATCGAATGCTGTCTGTCCACTTATATTTTTATGATTAATGTCGCAACCAGCCTCAATTAGTGCATCGTACACATATCCTTCAAATGGAATGAAAAGAAAGGAATTGCCATAGCTATCTTTGGTATGGATATCTAGTCCTGCTTTGATAAGATATTTTACTAACCCTGGTGAAACATCAATATGTTGAATCGCAAGGGTGCCTGACTTATTCCGGTGGTTAACATTAATACCATTGCTGACAAGAAGTTCCACTGTTTCAACATTTTGAGCATAGAATAATGCATTATTACCGTCATTATCCAAGTGGTGAATGTCAATGTCAGCGTCAATCATTGCCTGTATAGCTTCAGGTACATTACAGGTAAAAAGAGCAGTTTGACCTTTACTATTGCACATATTTATATCCATGCCAGCCTTAAGGCAACGAAGGACACCATTTTTAGACTTTACACGGAATAGGTTATTTTTATTATTTTCTCTTGATATCATTGTTTATCTCCTTTTTTGTTGTTTTCAAGATATTAATGACACCAGATTTTACGGTATTCAAATTTATTTTTGTTTTATTAGGGGGTATCTGATTTGGTTTGCTTAAGGTTATTCTTAACTTAATGGAACATAGGGACGTTAAGGTTATCCTATGTGATGTGGATTAACCATTTTCACAAGAATATTTATCCATATTGGAACTAATCTGTATAAAGGAGATAATTAGAGGCTCAGATATTAATACTATAGAGAGTAATGCTATAGGATATGGAAAAGATCATGTAATATAATGGTGGTTATTAATGTTCATCTTAATTTTGGAAATTAAATGAGTAGTGTTTTTTATGAGGTTTTTTGCTGTTGATAATTCTTATTAAAATTTGAAGAGCATCCTTTAAGTTATTCGGCTTGAGGATGCTCATTTAAAATTTAAGCAGCGTAGTCTTTTTCAACAGCCAGGAAGGGGTGATGCTGCCAACTTACTGATTTAGTGTATGATGGTGTTTTTGAGGTGCTCCAGTGGCTTCTGTTTATATCAGCTGTCCCTCCTGTTCAGCTACTGACGGGGTGGTGCGTAACGGTAAAAGTACTGCCGGACATCAGCGCTATCTCTGCTCTCACTGCCGTAAAACATGGCAACTGCAGTTCACTTACACCGCTTCTCAACCCGGTACGCACCAGAAAATCATTGATATGGCCATGAATGGCGTTGGCCTCAACACGATTTTACGTCACTTAAAAAACTCAGGCCGCAGTCGGCAACCTCGCGCATACAGCCGGGCAGTGACATCATCGTCTGCGCGGAAATGGACGAACAGTGGGGCTACGTCGGGGCTAAATCGCGGCAGCGCTGGCTGTTTTACGCGTATGACAGGCTCTGGAAGACGGTTGTTGCGCACGTATTCGGTGAACGCACTATAGCGACGCTGGAGCGTCTTATGAGCCTGCTGTCACCCTTTGACGTGGTGATATGGATGACGGATGGCTGGCCGCTGTATGAATCCCGCCTGAAGGGAAAGCTGCATGTAATCAGCAAGCGATATACGCAGCGAATTGAGCGGCATAACCTGAATCTGAGGCAGCATCTGGCACGGCTGGGACGGAAGTCGCTGTCGTTCTCAAAATCGGTGGAGCTGCATGACAAAGTAATCGGGCATTATCTGAACATAAAACACTATCAATAAGTTGGGACCATTACCAAATAGTTTATATGCTAAAAATTTAGGGAATCATTTTCCTAATTCTCCAGCTAAAAGTTCAGCCTGCTTCAAAACAGTCTCTGTGGCGAGCAATGACATATCAGGCGGATAACCATATTTTTTGAGCAGACGTTTTACCACCACGCGAATTTTTGCTCTTGCTGCTTCTTTCACTGTCCAGTCAAGACTGACATTGTTGCGGATAGCCTCCGTAAGTACGACTGCCAGTTCTCGTAGTTTTTCCTTTTCCATTAACTCGCGAGCACTGTCGTTATCTGCAACAGCAGAATAGAAGGCATATTCATAAGCGCTGAGGTTTAACTGGCTGGCAAGGCTGTCAGATTCCTGGATAGTCTTGGCAAGCTTGATTAGCTCATCAATTACCTCCGCTGCGGTCAGTACCTTGTTCTGGTAGCCATTGATTGCAGAGGTCAGCATATCAATCAGTTTTTTGCCTTGGGTGATACTTTGATTCGAGCGAACCTTGATCTCGTCAGAAAGCAGTTTTTTAAGCGTTTCCAAAGCAATATTTCTGTGTTGGTAATCCTTCATTTCCTGAAGGAATTCTTCAGAAAGAACGGAAATATCAGGTTTTTGTATCCCAGCAGCGTCAAAAATATCAACAACTTTATCGGTAACCAGAGCCTGATCGATAGTCTGTTTTACCCGAACTTCGAGACTGTCATTGTGTTCTTCTTCTGATCCGTCTGAGTTTTCAGTAAATTTATTCAGTCTGGCTTTTACTGCCTGGAAGAATGCTACTTCGGGTGCCGCTTCCATTGCTTTATCGTGCGGTGTCGCCAATGCAAATGCCTGCGATAAGGCTGCAACCGCAGCGAGGAAACGCATTTTACCTTTACCGTTATCCAGCCCTAAAATATGGTTTTCTGATTCAAGAATAATTGTCAGGCGTTGTGAGGTAGTTGCGGCAAAGTAAGCTTTGTAATCGTATCCATGCATCATGCCTTCCAGGATTTCCAGCTTTTCCTGCATGAGCGTTACGGCTTCTTCCTGAACCTCAGCAGGATCTCCACGTCCACCTGCATCAGAGTAAAAGGAGAGGGCTTCTTTTAAATCAGAGGCAATGCCCAGATAGTCAACAACCAGACCGCCTATCTTATCTTTATACACACGGTTCACACGGGCAATTGCCTGCATAAGGTTGTGGCCTTTCATTGGTTTGTCGATATACAGCGTATGCATGCTGGGAGCGTCGAAGCCGGTTAACCACATATCACGCACTATCACCAGTTTCAGCTTGTCGTCGTCATCCTTCATACGGTTAGCCAGAACCTGACGTTCTTTTTTTGTGGTGTGGTGTTTGGCAATTTCTGGCCCGTCAGCAGCAGAAGAGGTCATGACGACTTTTATTATACCGTCATTTAAATCATCGCTGTGCCATTCAGGTTTTAAGGCTATGATTTCTTTATATAGTTCAGCAGCAATACGGCGGGACATGGTAACAATCATGCCCTTACCATGATCGGCATTGGATTTTAAGCGCTGCTCAAAGTGCAGAACCATATCCGCCGCAATCGCTTTAATACGTTTTGAACTGCCAATCAAACCTTCGATTCTGGCCCATTTAGAACGTTCTTTTTGCGTGAGCGTCAGTTCGTCCTCGTTAAACTCATCATCAAAGTCTTCAATAAGCTGACGACCTTCATCACTGATGGCAATTTTGGCAAGACGGCTTTCATAAAAGATACGAACTGTTGCACCATCTTCAACGGCCTGCGAGATATCGTAGATATCAACATAGTTACCAAAAACAGCAGGCGTGTTGACGTCCGTTTTTTCTATGGGGGTTCCGGTAAAGCCGAGATAGGTCGCATTAGGTAAGGCATCACGCATATATTTGGCAAAGCCGTAAACGGTGCGTTTACCTGTTACGTTGCCTTCGCTGTCTTTCACGTCAACTTCTTTGGCGCTGAAACCGTACTGGGAACGGTGTGCTTCATCAGCGATAACGACAATATTGGTTCTGTCTGACAACAACTCATAGATATTGCTGCCATCATCAGGCTGGAATTTTTGAATAGTAGTAAACACCACACCGCCAGAGGCGACACGCAAATATTCTTTGAGTTCTTCCCGGTTGTTGGCCTGTTTTGGTGTCTGACGAAGTAGCTGGGTCGCGGAAGAGAACGTACCGAATAGCTGATCATCTAAGTCGTTACGGTCAGTAATCACAACAACTGTCGGATTATCCAGCGCCAGCACAATTTTCCCGGTATAAAACACCATCGAAAGCGATTTACCGGAACCCTGAGTATGCCAGACTACGCCCGCTTTGCGATCTCCGGTTTTTTGCGCATTAACAAGATCTTTACTGTTACGTCCCTGCTGGCGCAGTGCCACTTCGGCAGAGGGGGAGTCCGCATTCACCGCTGAGGCACGAATAGTGGAAAGAACTGCTGCATTGACCGCGTAGTACTGGTGATAAGCCGCCATTTTTTTAACAGTACGGATACTGATAATCCCTTTGCTGTCTTCATGTTTGCTGGCCTCAAACACGATAAAGTGGCGGATTATATCCAGCAGCGTTACAGGATTAAGCAACCCCTGTAATAAAACTTCAAGCTGTGGTTGGGTACTGGTGGCTTGCGTTTTACCGTTAGCAGTTTTCCACGTCATATAACGACTGAAATCGGCAGAAACCGTCCCCGCTTTGGCTTCCAGCCCGTCAGATATCACATTAAAAGCATTGTAGTTAAACAGACCGGGGATTTGGTTCTGATAGGTTTTAATCTGATTATAGGCACCTGTTACCGTTGCATTTTCGTCAGCGGCATTTTTAAGTTCGATAACGACTAATGGCAGGCCATTGATGAATAAAATAAGATCAGGTCGGCGGGTATGGTTGCCTTCTTTGATGGTGAGCTGATTAATGACCAGAAATGCATTATTAGTGGGATCGTTAAAGTCGATCAGGCTGGCCAGTTCTCCTTGTGTATTACCATCCTTGCTGACTTCGATATTGATTCCTTCGGTCAAAAGGCGATGAAAGGTTAGATTGTTTGCCATTAGGTCAGGTGAGCTGATCTGCATCACCTGTTTTAGAGCTTCTTCACACTTCTGCTCACTAAGGTGCGGGTTAATTCGTTGCAATGCTGTACGCACTTTATCTTCAAGGATAACCTGCTGATAGCTACGTAACGGATTGATTCCACTGGGTTCAATGTCCGGGCCGTAAACATACTCATAGCCCAGCCCTTGCAGGTGCTCAATTGCCATTACTTCAATATCGGATTCGGTCATCTTTGCCATGCGTACTGTCCTTATTGCCGCTTAACCAGAGAGCGGGCTGTACACCCGCATAATTATTTTATGCTACTGATGCGATTGCTTCTTCCGCATACTGAACCCGTACTTCGCCGCTCATCAGTTTGGGAAGGAGAGTGTCGCGCAGAGATTCTATGTTTTTAATTTGTGAGTTGTTAATACTTATTTTTTTCCAGAAGAAATCAATCTGTGTAGTGAATTCTTTCACCATACACTCCGGGGGGAGTTTGAATTCTGGGCGTTGTAATCCGTCAGCCGACAGCTGATTTACAGTTGAACCGTTAGTTGCAGCAACAACTTGCTCATGCATATCTTTGGAGCAAAGTAAATAGTACAGAAAAGCATTGGTTAAGTAACTATCGTCATTGATGCTGATTTTAAATAAATGATGCGTATATATGGATAATTCGCTTTCTGTTTTTGGAATTATAGCAGGATACCCTATGAGTCGATATTCATGGCCCTGTTCTGTATTGGCTACAATGATATCCCCAGGCTTTATAATATGCCTTTCTTTAAAATCACCATTGTAGTATTTTATACCTGCACTCTTATAACCACCGCCCTCAAGTACTGAGTTCAAACTAAAAAGTGGAATGCCGTTATCTGACGTGGTAAGGCCGGCTCCCTTGTAGCTTAATCCCTTAGCTACTGTAATATGACAATCAAGAGTCGTTATTTCCCAATCCGCATGCGTCTCTTCAATAAACCACTGTCTAAATAGGGTTTCTGCCATGGATTCCAGAGTTTTATTCTGGCGATGAAGCAGGTCTATTTTGTCTTTAAGAGAGGACAAAATAGAGGCGATGGCTTTTTGTTCAGATTCATCAATTGGTAAGTAAAGAGGAATCAAATTTAGCGATGCTTTATTTAATTTTGGTTGAGCAGCACCTGTTAAATAAGGAGATAGATCAAGTTGTGAAAAATATTGGATTATCAATGCCGTTAAATGGGGCTTTTTAGCTTTAAGGACATGAGCGTGATTATTAACCCAAAATTTTCCAGTAGCTTCAAATGCTATAGGTGTTTTTCTGGACTTTAAGTTTTCTCCATCTTCAGAAATAAGGATGTAGCTGCCATCAAAAATATAATTGTCGATATAATCGACAATTCCTGACGCTCCATAATATGGAAAATTTCCTTTCTTATGGGCTCGTTCCATAGCACTCAAAGGAATTCTTTGTGCGTTGCAAAAATCTGCTAATTCAGTTATACAAAAATTTCTCCACCTACTCATAAAACCACCTTCGCCAGATTATCAACGATGGCTCTGTTCAGACGCGCCTCTTCTTCCAGTTGCGCTTCAAACTCAGCTTTAAGAGCCGTAAAGCGTTCTTTAAAGTCGAAATCGTCTTCTTCATCAGCAAGACCAACATAACGGCCAGGTGTCAGCACATAATCGAGTTTAGCAACTTCAGCAATGTCTACTGACGCACAGAAACCAGCGACGTCTTCGTAGTCGCCACCTTTGTTACGCCAGTTATGATAGGTATCAGCGATGGTTTTTATATCGTCGTCAGAAAGTACTTTGGTACGACGATTAATTAAATGACCGAGATTACGGGCATCAATAAACAGAATTTCTTTACTACGATCACGATACTGGTTGCTGTTATGACGGTCACGGCGCATAAACCATAATGCAGCAGGGATCTGAGTATTCAGGAACAGCTTTGCCGGTAAGTTGACGATACAGTCAATCACATTGGCATCTTTTACCAGTGCAGCACGAATATCACCTTCACCAGAGCTTTTAGAAGTTAATGCCCCTTTTGCCAGAACAACACCCGCTTGCCCTTTAGGAGACAAGTGATACAAAAAATGTTGCATCCATGCAAAGTTAGCGTTGCCAGCAGGTGGAATGCCATATTGCCAGCGGGCATCACCACGAAGCTGCTCACCAGACCAGTCGGAAACGTTAAACGGTGGGTTAGCTATGATAAAATCAGATTTCAAATCTTTGTGAGCATCGTTAAGAAATGAACCTTCATTATTCCAGCGAACGTGTTCAGAATTAATCCCACGAATTGCCAGGTTCATTTTTGCCAGACGCCAAGTGGTCTGGTTGGACTCCTGCCCATAGATCGAAATATCGTCAATATTTCCCTGATGTGCTTCTACAAATTTTTCTGACTGAACGAACATACCACCAGAACCACAGCAGGGATCAAAGACACGGCCTTTATAGGGTTCCAGCATGTTAACCAGCAGGCTTACAATAGATTTTGGCGTATAGAACTGGCCGCCTTGTTTGCCTTCTGCCAGTGCAAATTCACCTAGGAAGTATTCGAATACGTGACCTAAAACGTCAGCAGAGCGTGCTTTGGCATCTCCTAGTGCAATGTTGCCAATCAGATCTATCAGCTCACCCAGAACAGTGGCGTCGAGGTTTTGTCGGGCATAGACTTTCGGCAACACACCTTTTAACTGTGGATTGCCCGCTTCGATAAGCTCCATCGCATCATCAACCATCTTACCGATTTCAGGAAGCTTGGCCTTAGAAATCAGATAGTTCCAACGTGCAAGCTCAGGGACAAAGAAAACGTTGTAAGCGGTGTACTCGTCTTTGTCTTCCGGATCAGCACCTGCGAACTCGCCTTTACCGGCTTTCAGCAACTCATAATGAGATTCAAAAGAATCAGAAATGTACTTAAGGAAAATGAGGCCTAGTACGACGTGCTTGTACTCGGCTGCATCAATGTTTTTACGCAGTTTGTCTGCCGCTTTCCACAGGATGACCTCTAACGGGTCTGTTTTGATTTCTTTAGGTTTTCTGGCCAT
Protein sequences of DBSCAN-SWA_2 >CP029122|1427570:1436904|1428769_1429639_-|AWF28486.1|DBSCAN-SWA MISRENNKNNLFRVKSKNGVLRCLKAGMDINMCNSKGQTALFTCNVPEAIQAMIDADIDIHHLDNDGNNALFYAQNVETVELLVSNGINVNHRNKSGTLAIQHIDVSPGLVKYLIKAGLDIHTKDSYGNSFLFIPFEGYVYDALIEAGCDINHKNISGQTAFDHWQSENNSIVGKYTHWESKNDNYIRFLIRNIHLKDSSKIVFTNITFKSIELLSLLIKQKNEFEISDKCIITPTNADAKKLILQLKKLTDISHVWFYISSTYLLTHYAGRLLIKLDLPLYFQTSVIT >CP029122|1427570:1436904|1427959_1428799_+|AWF27205.1|DBSCAN-SWA MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRACNGLSDNRCLEI >CP029122|1427570:1436904|1430445_1430823_+|AWF25723.1|transposase|DBSCAN-SWA MDEQWGYVGAKSRQRWLFYAYDRLWKTVVAHVFGERTIATLERLMSLLSPFDVVIWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ >CP029122|1427570:1436904|1435356_1436904_-|AWF25820.1|DBSCAN-SWA MARKPKEIKTDPLEVILWKAADKLRKNIDAAEYKHVVLGLIFLKYISDSFESHYELLKAGKGEFAGADPEDKDEYTAYNVFFVPELARWNYLISKAKLPEIGKMVDDAMELIEAGNPQLKGVLPKVYARQNLDATVLGELIDLIGNIALGDAKARSADVLGHVFEYFLGEFALAEGKQGGQFYTPKSIVSLLVNMLEPYKGRVFDPCCGSGGMFVQSEKFVEAHQGNIDDISIYGQESNQTTWRLAKMNLAIRGINSEHVRWNNEGSFLNDAHKDLKSDFIIANPPFNVSDWSGEQLRGDARWQYGIPPAGNANFAWMQHFLYHLSPKGQAGVVLAKGALTSKSSGEGDIRAALVKDANVIDCIVNLPAKLFLNTQIPAALWFMRRDRHNSNQYRDRSKEILFIDARNLGHLINRRTKVLSDDDIKTIADTYHNWRNKGGDYEDVAGFCASVDIAEVAKLDYVLTPGRYVGLADEEDDFDFKERFTALKAEFEAQLEEEARLNRAIVDNLAKVVL >CP029122|1427570:1436904|1427570_1427936_+|AWF27722.1|transposase|DBSCAN-SWA MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >CP029122|1427570:1436904|1434166_1435360_-|AWF24823.1|DBSCAN-SWA MSRWRNFCITELADFCNAQRIPLSAMERAHKKGNFPYYGASGIVDYIDNYIFDGSYILISEDGENLKSRKTPIAFEATGKFWVNNHAHVLKAKKPHLTALIIQYFSQLDLSPYLTGAAQPKLNKASLNLIPLYLPIDESEQKAIASILSSLKDKIDLLHRQNKTLESMAETLFRQWFIEETHADWEITTLDCHITVAKGLSYKGAGLTTSDNGIPLFSLNSVLEGGGYKSAGIKYYNGDFKERHIIKPGDIIVANTEQGHEYRLIGYPAIIPKTESELSIYTHHLFKISINDDSYLTNAFLYYLLCSKDMHEQVVAATNGSTVNQLSADGLQRPEFKLPPECMVKEFTTQIDFFWKKISINNSQIKNIESLRDTLLPKLMSGEVRVQYAEEAIASVA >CP029122|1427570:1436904|1430868_1434099_-|AWF25396.1|DBSCAN-SWA MTESDIEVMAIEHLQGLGYEYVYGPDIEPSGINPLRSYQQVILEDKVRTALQRINPHLSEQKCEEALKQVMQISSPDLMANNLTFHRLLTEGINIEVSKDGNTQGELASLIDFNDPTNNAFLVINQLTIKEGNHTRRPDLILFINGLPLVVIELKNAADENATVTGAYNQIKTYQNQIPGLFNYNAFNVISDGLEAKAGTVSADFSRYMTWKTANGKTQATSTQPQLEVLLQGLLNPVTLLDIIRHFIVFEASKHEDSKGIISIRTVKKMAAYHQYYAVNAAVLSTIRASAVNADSPSAEVALRQQGRNSKDLVNAQKTGDRKAGVVWHTQGSGKSLSMVFYTGKIVLALDNPTVVVITDRNDLDDQLFGTFSSATQLLRQTPKQANNREELKEYLRVASGGVVFTTIQKFQPDDGSNIYELLSDRTNIVVIADEAHRSQYGFSAKEVDVKDSEGNVTGKRTVYGFAKYMRDALPNATYLGFTGTPIEKTDVNTPAVFGNYVDIYDISQAVEDGATVRIFYESRLAKIAISDEGRQLIEDFDDEFNEDELTLTQKERSKWARIEGLIGSSKRIKAIAADMVLHFEQRLKSNADHGKGMIVTMSRRIAAELYKEIIALKPEWHSDDLNDGIIKVVMTSSAADGPEIAKHHTTKKERQVLANRMKDDDDKLKLVIVRDMWLTGFDAPSMHTLYIDKPMKGHNLMQAIARVNRVYKDKIGGLVVDYLGIASDLKEALSFYSDAGGRGDPAEVQEEAVTLMQEKLEILEGMMHGYDYKAYFAATTSQRLTIILESENHILGLDNGKGKMRFLAAVAALSQAFALATPHDKAMEAAPEVAFFQAVKARLNKFTENSDGSEEEHNDSLEVRVKQTIDQALVTDKVVDIFDAAGIQKPDISVLSEEFLQEMKDYQHRNIALETLKKLLSDEIKVRSNQSITQGKKLIDMLTSAINGYQNKVLTAAEVIDELIKLAKTIQESDSLASQLNLSAYEYAFYSAVADNDSARELMEKEKLRELAVVLTEAIRNNVSLDWTVKEAARAKIRVVVKRLLKKYGYPPDMSLLATETVLKQAELLAGELGK |
7 | Shigella_phage(33.33%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1965040 : 1974482
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP029122|1965040:1974482|DBSCAN-SWA AATGATTGAATTTAGCCATGTCAGCAAACTGTTTGGCGCACAAAAAGCCGTTAACGATCTTAATCTCAATTTTCAGGAAGGGAGTTTTTCGGTGCTGATTGGCACATCTGGCTCCGGCAAATCCACCACCTTGAAAATGATTAACCGCCTGGTGGAACATGACAGCGGAGAGATCCGCTTTGCCGGAGAAGAAATTCGCTCGCTGCCAGTACTGGAGTTGCGCCGCCGGATGGGCTATGCCATTCAATCTATTGGCCTGTTCCCCCACTGGAGCGTGGCGCAAAACATCGCTACCGTGCCGCAATTACAAAAATGGTCGCGGGCACGGATTGACGATCGTATCGACGAATTAATGACGCTACTGGGGCTGGAGTCAAATTTGCGTGAGCGTTATCCGCATCAGCTTTCCGGTGGTCAGCAGCAACGTGTGGGAGTGGCGCGCGCACTGGCTGCCGATCCGCAAGTCTTACTGATGGATGAACCTTTTGGCGCACTGGACCCGGTAACGCGCGGCGCGTTGCAACAAGAGATGACGCGCATTCACCGTTTGCTGGGGCGCACTATTGTGCTGGTCACTCATGATATTGATGAGGCGCTAAGGCTGGCAGAACATCTGGTATTGATGGATCACGGTGAAGTGGTGCAGCAGGGGAATCCGCTGACGATGCTGACTCGTCCGGCGAATGATTTTGTCCGCCAGTTTTTTGGACGTAGTGAACTGGGGGTGCGCCTGCTTTCGTTACGTAGTGTGGCGGATTACGTGCGTCGCGAAGAACGGGCAGAAGGTGAGGCACTGGCAGAAGAGATGACGCTACGCGATGCGCTCTCCCTGTTTGTCGCGCGGGGATGCGAGGTGCTGCCGGTGATGAACACGCAGGGCCAGCCTTGCGGCACGCTGCATTTTCAGGATCTGCTGGAGGAGGCGTAAGCGTATGAAAATGTTGCGCGATCCGCTGTTCTGGCTCATTGCTTTGTTTGTGGCACTGATTTTCTGGCTGCCTTACAGCCAGCCGCTGTTTGCTGCCTTGTTCCCACAACTGCCACGACCCGTTTATCAGCAAGAAAGTTTTGCAGCTCTGGCACTGGCTCATTTCTGGCTGGTGGGAATTTCGAGTTTGTTTGCGGTGATCATTGGCACTGGTGCCGGAATTGCTGTCACTCGCCCGTGGGGCGCGGAATTTCGCCCACTGGTGGAAACTATTGCCGCCGTTGGACAGACTTTTCCGCCTGTCGCAGTGCTGGCGATTGCCGTTCCGGTGATCGGCTTTGGTCTGCAACCAGCGATTATCGCCTTGATCCTTTACGGTGTGCTGCCCGTCCTGCAGGCGACACTTGCCGGGCTGGGAGCGATTGATGCCAGCGTGACAGAAGTTGCGAAAGGTATGGGAATGAGTCGTGGTCAGCGACTGCGTAAGGTCGAGCTACCGCTGGCGGCTCCGGTGATTCTGGCGGGCGTGCGAACTTCGGTGATTATCAACATTGGTACGGCGACGATCGCCTCAACGGTAGGGGCCAGCACGCTGGGTACGCCCATCATCATCGGGCTTAGCGGATTTAATACCGCGTATGTGATCCAGGGGGCGTTACTGGTGGCACTGGCGGCGATCATCGCAGACCGCCTGTTTGAAAGGCTGGTGCAGGCGCTTAGCCAGCACGCAAAATAAAGGTATAACCTGCGAGCATGACGCCACCAATTCCGCCTAACGCCATAAACAGGAACAGGGCGATGACCCCAATTTTAGCTATGCGCATATTGCACTCCTTATGTTAACGAAAGGATTGTACAGTAAAGCGCATTTGTTAACGAATCATTAAATGCCGAGTGGGAAAATATCATGGCCTTGTTCTTGCCAACTGGTGAGTTGCTGCTGTTGGGCGGAGGTTCGATTTTCACCGCACCACACCAGCAATGTACGGCCTTCGAATAGTTCAGGGCGTAGTTGATTGAGCGAGTGGGCGAGGACATCAATGCGCCATCCTTGTTGACTGGCAATCCAGCCCTCCAGCCACAGACGGGTGGTATCCTGAATATTCCAGCCAACCACCAGCGCATCTTTACCCTGTTTTTTACGTGCCGAAGCCAGACAAATGGCGATGTAGTTGATCAGTACGCCGTCGAGGATCGCCAGCAGCGCCTGGAGAGTCGGTTGTTGGCACTGAAGCCGTCGGCGCAGAGGAATAAACAGATGTGTGGTGAGTGTCTGGGCGGGGTAATCCTGACCGCGCTCTTTGATCCACGTTCGCAGGCTATGCAGATTGCCGCTTTGCAGGTAGGTCAGTAATGTTTCTTGCTGATCGCGCCAGCCGTTCTGCACATCAACATTTTCATTACTGAGCAGCATTTTAACTTTGCTGACCTGCACGCCGTTGTCGATCCAGCGTTTGATCTCGCGGATACGGTCAATATCGGCATCGTTGAACAGTCGATGACCGCCGTCTGTCCGTTGCGGTTTCAGCAACCCGTAACGCCTCTGCCACGCGCGTAACGTGACAGGGTTAATATCACAAAGCAACGCCACTTCACCAATTGTGTAAAGCGCCATCGTCTCACCCTTGCTCGCGAGGTCCCGGTTTAACTTTAGACGCAGTTTTGCGAACCAGGTAGTTTTGCCCGTTTTTTGTGCATCTATAGGGTGATTTTATTTTTGCCAGGCGATTTTGAGTGATCGTACTCACGAATTCTCATTTTTCTGCAAGAGTTCAAAGAAAGTTAAACGCAGGCAATGTATGTTACGCGTTTTAAAGGGAAGTGTGGTTTGCGGGTATGTACGATTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTTTTTTTAGTCATTGCGTGGTTAATGAGTAAAACGCCATTATTCATACCGTTAATGCAGGTCACGGTTCGTCTGCCGCATAAATTTCTCTGCTACATCGTCTTTTCCATCTTCTGCATCATGGGCACCTGGTTTGGGTTGCACATTGACGATTCTATTGCCAATACCCGTGCGATAGGCGCGGTAATGGGCGGCTTACTCGGCGGTCCGGTCGTCGGTGGGCTGGTTGGTCTGACCGGCGGCTTACATCGATATTCGATGGGGGGCATGACCGCGCTAAGTTGCATGATCTCAACCATCGTTGAAGGATTGCTCGGCGGCCTGGTACACAGCATCCTGATCCGTCGCGGGCGCACTGATAAAGTCTTTAACCCCATTACCGCCGGTGCCGTCACGTTCGTCGCTGAAATGGTGCAAATGCTGATCATCCTTGCGATCGCCCGACCTTATGAAGATGCGGTGCGTCTGGTGAGTAATATTGCTGCACCAATGATGGTCACCAATACCGTCGGCGCGGCGCTGTTTATGCGTATATTGCTCGATAAACGCGCGATGTTTGAAAAATACACTTCTGCTTTTTCTGCCACTGCGCTGAAAGTGGCAGCCTCGACGGAAGGCATTTTGCGACAGGGGTTTAACGAAGTGAACAGCATGAAAGTGGCTCAGGTGCTGTATCAGGAGCTGGATATTGGTGCAGTCGCGATTACCGATCGAGAGAAATTGCTGGCCTTTACCGGAATTGGTGACGACCACCATTTACCCGGCAAACCGATTTCTTCGACTTACACCTTAAAAGCGATTGAAACCGGTGAAGTGGTCTACGCTGATGGCAACGAAGTACCTTATCGTTGCTCTTTGCATCCGCAATGCAAACTGGGGTCGACGCTGGTAATTCCGTTGCGTGGCGAAAATCAGCGCGTGATGGGCACCATCAAATTGTATGAAGCCAAAAACCGTTTATTCAGTTCAATAAACCGCACGCTGGGCGAGGGGATTGCGCAACTGCTTTCGGCGCAGATTCTTGCCGGGCAATATGAGCGGCAAAAAGCGATGCTCACCCAGTCAGAAATCAAACTGCTTCACGCCCAGGTGAATCCTCATTTTTTGTTTAATGCGCTTAACACCATTAAAGCGGTGATCCGCCGCGACAGCGAACAGGCCAGCCAGCTGGTGCAGTATCTTTCCACTTTTTTCCGCAAAAACTTAAAGCGGCCTTCGGAGTTTGTTACTCTCGCCGACGAAATTGAACATGTGAACGCTTATCTGCAAATTGAAAAGGCGCGCTTCCAGTCGCGGTTGCAGGTCAACATTGCTATTCCGCAAGAATTATCCCAGCAGCAATTGCCCGCGTTTACCCTGCAACCGATAGTGGAAAACGCCATTAAACATGGGACATCACAACTGCTGGATACAGGGCGAGTGGCAATCAGCGCCCGACGTGAGGGGCAACATTTGATGCTGGAGATCGAAGACAATGCCGGTTTGTATCAACCGGTAACCAATGCCAGTGGGCTGGGGATGAATCTGGTGGATAAGCGTTTACGTGAACGGTTTGGCGATGACTATGGGATAAGCGTCGCCTGTGAGCCTGATAGTTACACCCGAATAACGTTACGACTACCATGGAGGGACGAGGCATGATTAAAGTCTTAATTGTCGATGATGAACCGTTAGCACGGGAGAACCTGCGCGTATTTTTGCAGGAGCAGAGCGATATTGAAATCGTTGGAGAGTGTTCAAACGCCGTGGAAGGGATCGGCGCGGTGCATAAACTGCGCCCGGATGTGCTGTTTCTCGATATCCAGATGCCGCGCATCAGTGGTCTGGAAATGGTGGGGATGCTCGATCCGGAACATCGTCCGTATATTGTTTTTCTCACCGCGTTTGACGAATACGCTATTAAAGCCTTTGAAGAACATGCTTTTGATTATCTGCTGAAGCCAATTGATGAAGCGCGACTGGAGAAAACGCTGGCGCGATTGCGTCAGGAGCGCAGCAAGCAGGATGTTTCGCTGTTACCGGAAAATCAACAGGCGCTGAAATTTATCCCTTGTACGGGGCATAGTCGGATTTATTTGCTGCAAATGAAAGATGTGGCATTTGTCAGCAGTCGGATGAGCGGTGTCTACGTTACCAGCCACGAAGGGAAAGAGGGCTTTACCGAATTGACATTACGTACCCTGGAAAGTCGTACACCACTACTGCGCTGCCATCGTCAGTATCTGGTTAATCTCGCGCATTTACAGGAGATTCGTCTGGAAGATAACGGCCAGGCCGAGTTGATTTTGCGTAATGGCTTAACCGTGCCGGTCAGCCGCCGTTATCTGAAAAGCTTAAAAGAGGCGATTGGCCTGTAAAAGACTGCTAAAATGGCTTTTTGCCTCATCAACACCTGAAGGCCTCATGCTAAGTAACGATATTCTGCGCAGCGTGCGCTACATTTTGAAAGCCAATAATAATGACCTGGTGCGTATTCTGGCGCTGGGTAATGTCGAAGCCACCGCGGAACAGATCGCCGTCTGGCTACGTAAAGAAGACGAAGAGGGTTTTCAGCGTTGTCCGGACATTGTTTTGTCGTCATTCCTCAACGGCCTGATTTATGAAAAACGCGGCAAGGATGAGTCTGCTCCGGCACTGGAGCCGGAACGTCGCATTAATAACAACATCGTGCTGAAAAAATTACGCATCGCGTTTTCGCTGAAAACCGATGACATTCTGGCTATCCTCACCGAACAGCAGTTCCGCGTTTCGATGCCGGAAATTACAGCGATGATGCGCGCACCGGATCATAAAAACTTCCGCGAATGCGGCGATCAATTTTTACGTTATTTTCTGCGTGGACTGGCAGCGCGCCAGCATGTGAAGAAAAGCTAAGACGGGTATGGCGGCCATGCGAAACATGGCCGCCGACAGATTATTTCACTTCTTTAAAACCAGCGGCTTTCATCACCAGTTCCATTTGCGCCATAGTGATACCTTTTTTGGCATCTTCAGCAGAAACGTTGATTCCTGAAATACCCTGCAGGGCTTTAAAATCCACTTTTTCCATATCGATAGTCACGTTTTCCTGCACGTAGGTATCTGTATAGGTTAATTTTTCTTCAACACCCGCGATGTTTTTGTATTTGGCGCTTAACGGCTCAAGTGTCTTGGCAGCGTCTTCTTTGGTGGTTGCACCAATAGAAGCAAATTGAATTTTGGTTTCAGAAGATTGCTTAAGCACCTTGTCACCTTTGTAGACATAGGTAATGGCAATTTCAGTGCCGTTCAGATTGGCGCTGAATTTCTTCGATTCTTCTTTGTCACCGCAGCCAGCAAGAGAGAAAACCAGAACAGATGCAACAACGAGGGAAAACAGCTTATTGAAAGCCTTCATGTAAAACTCCATTTTATTTAATCAAGAAACTGGTGACTCTCACCAGGGGCTATATAGAATATGCCTAATACCGTGACGTGAGCAGTCCGGAACTGGAGTAGAACTCTTAGTAAAAAGCACTATTTCATCCTTGTTGCTGAAGCATGGGGAATAATTGTTCGCAAAGTAAAACACCGTTATTCATTGCTTCTACCCGTGCCTCGCTTTCTGTATTACGAAATTGTCCCAACACATGTGCCAGCCGATAAAAACCGACCGCGGAGAGGTCATTCGCCAGCAACTCTGCCTGACTAATAGCGCTCTGTTCCTGATAGCGCCAGCCGTTATGGAGCAGTTGAATAAGTAACGCCTGGCAGCGCATCAGCAACTGATGAGCGGTAGACGGCACAGGCAAAACGCTGGCAGAAGGTAGCGGTGCCACAGGCGCAGTTTCCGCGTCCAGCGCCCAGGCGCGGGTTTTTGTCATCATCACCCGTGGTTCCAGTGTCAATTGCCCATCAACAAAACTGACAAAGCCAGAAACCAGACTCACGGGGTCGTCTGTTTGTTGCAAAAGCGCCGCCATGCGTTCAACGGCAAAAGGAGAACAGGCAGATGCCGGAAGGGATAACGTCAGGAGATTATCTTCACCTTCGCCGCTGATTACCTGCGCATCCAGCGTCTGGCGGCTGCTATCCCAACCGAGCGAAATACACTCAGCGACCGGCAGAATAAATAAGTTATCGACCTGATTAAGAGGCCGTATGCAGGCGGGGGGACGCTGGCGTAAATATTCCCGCAAAGCCACAATGCCCGGCTGGCGTAACGGCGCGCTCAACATTTGCCAGGCATCAGGCGACAGCGGCACAACGCTGCTTAAGCGGTTGCGGGTAGCTAACAGCAGCTCGCCATCGGCACTGCGTTTTGCTGCTTGTGAAACAATTTGCCCACCCGCCAGTGCGCCAGCCTGAAAACTAAACAGCCGACGCGTAGCTGCCGGTGAGTTTTCCTGTTCACTTCGCGGCCAACTGCGCGAAAGGTGCAAAATACTGCCGGTGTCGGGATCGGTAAACCAGATGCGTAAACCATAATGCTCAATATCCTGCCAGCAACGCATACCTAAAGACACCAGCCGCAGATGATCAAGCTTTGCTTCTCCGGCAATGCCAGAGCCAACGACCGTGCGCCACGGCACAGGAGGAACTTCACCAACACTGTCGCGCCGGGCCATCTCTTGTGCGCAATTTAATCGACTGTTTAATGCCGCAAGCTGACGTAAGCATTCTCCGGCATGATAGTGGCTGGCGCGAGTGTGGAAGGCATCAACGCTTGCCCGTAGCTGTCGTAGTGATTCACTCACCCAGCGCCAGTTGCAGGCCTCTGCCGCCTGCAATGCGCGGTTGAATGCTGCCTCGTAGTGAATAAGCGGCTGGCTGATGCCGCCAAGCCATAATGCCTGGCTTAATTGCTGAACATATTGACGACACGTTTGGCCTTCTTCGCTGGCAAACGGATCGTCAGATGATGTGACGTGTTCGCTGCGCATCTGCCAGATTAAATGGTTAAATTCTGCTTGCTGCGCTTTGGCCTCGACGAAGGCCTGCACAGCCAGTACGACATGTTCGCAAAGTGTGCCTTCAATACAATCACAACGGGCGAAACGAATACTGCTACGGGAATAAAAACGCACATCGCTCATCGGTAAGCGGGCAGAGGGAATTTCGCCCGGCGCACAGAACAACTCAATGGTGATGCCTTTAGCGACCAGCGCCTGCGCGCGTTTGCGGGTGGCATCGGGCAGGGTAGCCAGTTCTTCCAGCCAGATTGCCGGATCCCACTCTTCTTCTTTTTCCGTGGGCTGGACGGTGGCACAAAGTCGTTGATAACTTAACACCAGCATCACGCGATGACGGCACATGCCGCTGGCCCCGCAGCTGCACTGAGCCTCTTTCAGTGCCTGGCTGTTCGCCAGCTGGGTACGGACACCGTCACTGAAGGTGGCGATTAAAGCGCCGTTCTCATGGCTGATTTCCGGGACGTTGCCATTTTCCAGTTCCTTAAGACTGCGCTTAACAAAACCGGCATTGCTTAACGCCGTCAGTGCCTGCGGTGTCAGTTCTAATAATTCCGGACGTAGTGAATTCATGACTGAAGATTCTCCGCAAGCCATGATGCCAGCTCGCCCGGCGTCATGGCGGCTATTTGTGCGCCGACATTAACCAGCGCCTGGGCCGTATCGCGGTCATAGCAAGGCGTTGCGGTGCTATCGAGCGCTGCCAGTCCCAGCACTTTGATGCCGCTCTGGACACACTTTTTCACCTGATGCGTCAGCAATGATGATGAACCCCCTTCGTAAAAATCGCTCACGAGGATAATGACGCTTTTTGCTGGTTGTTCAATAAGTTGCCGACCATACTCCACGGCACTGGCGATATTGGTCCCGCCGCCCAACTGTACTTTCATTAATAACTCTACCGGATCGGCAACGTCTGCCGTGAGATCAACCACGCTTGTGTCAAACGCCACCAGATGGGTACGAATGCCGGGTAACTGCCACAAACAGGCCGCCATCACCGCAGAGTGGATCACCGAGTCCACCATCGATCCGCTTTGATCAACCAGTAAGACCAGTTGCCATTGTTCGCTTTGGCGTTTAATGCGGCTGTTAAAGCGGGGGGATTCGATATACAACTTGCCGTGTTGCGGGTGCCAGTGTTGCAGGTTGGCGCGCAGAGTACTTTTGAAATCAAAGTTTCGCGCCAGTGGAATTAATGAGCGGCGACGGCGATCGCGGACACCAGAAAAAGCCTGACGAACTTCCTTTGCCAGTCGCGCCATAATTTCTTCAACAACCTGGCGCACTATCTGGCGGGCGGCAGCCAGTACTTCGGGATTCATCAGATGTTTGGTGTGCAAAACGGCGCGTAGCAGGCTTTCCGAAGGCTGCATACGTTCCAGCACGTCGAGATTCGTCACCACATCTTCAATGCCGTAGCGCAGTACGGCATCGCTTTCCAGTCGCTCAATCACCTGTTGCGGAAACAGAGTGTGAATACTGTTGATCCACTCAGGGGTGGTGAGATTTGAGCCACCTAATCCACCAGAGCGTTCACCACGCTGGAGCCGTTCAGGATCGCGCCCATACAGCCACTCCAGCGCGTGGTCTATCTGCCGGGCGTTGTCATCCAGCCCACAAAGCGTCGTTTCTGCCGCTTCGCCAAGAATTAATCGCCAGCGTTGTAGCTCACGGGTGGTCAGAAGATCGTTCAGTTCAGACAT
Protein sequences of DBSCAN-SWA_3 >CP029122|1965040:1974482|1973345_1974482_-|AWF25798.1|DBSCAN-SWA MSELNDLLTTRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDPERLQRGERSGGLGGSNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVLHTKHLMNPEVLAAARQIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSLIPLARNFDFKSTLRANLQHWHPQHGKLYIESPRFNSRIKRQSEQWQLVLLVDQSGSMVDSVIHSAVMAACLWQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSVIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTAQALVNVGAQIAAMTPGELASWLAENLQS >CP029122|1965040:1974482|1971348_1973349_-|AWF27236.1|DBSCAN-SWA MNSLRPELLELTPQALTALSNAGFVKRSLKELENGNVPEISHENGALIATFSDGVRTQLANSQALKEAQCSCGASGMCRHRVMLVLSYQRLCATVQPTEKEEEWDPAIWLEELATLPDATRKRAQALVAKGITIELFCAPGEIPSARLPMSDVRFYSRSSIRFARCDCIEGTLCEHVVLAVQAFVEAKAQQAEFNHLIWQMRSEHVTSSDDPFASEEGQTCRQYVQQLSQALWLGGISQPLIHYEAAFNRALQAAEACNWRWVSESLRQLRASVDAFHTRASHYHAGECLRQLAALNSRLNCAQEMARRDSVGEVPPVPWRTVVGSGIAGEAKLDHLRLVSLGMRCWQDIEHYGLRIWFTDPDTGSILHLSRSWPRSEQENSPAATRRLFSFQAGALAGGQIVSQAAKRSADGELLLATRNRLSSVVPLSPDAWQMLSAPLRQPGIVALREYLRQRPPACIRPLNQVDNLFILPVAECISLGWDSSRQTLDAQVISGEGEDNLLTLSLPASACSPFAVERMAALLQQTDDPVSLVSGFVSFVDGQLTLEPRVMMTKTRAWALDAETAPVAPLPSASVLPVPSTAHQLLMRCQALLIQLLHNGWRYQEQSAISQAELLANDLSAVGFYRLAHVLGQFRNTESEARVEAMNNGVLLCEQLFPMLQQQG >CP029122|1965040:1974482|1966850_1967582_-|AWF24892.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKMLLSNENVDVQNGWRDQQETLLTYLQSGNLHSLRTWIKERGQDYPAQTLTTHLFIPLRRRLQCQQPTLQALLAILDGVLINYIAICLASARKKQGKDALVVGWNIQDTTRLWLEGWIASQQGWRIDVLAHSLNQLRPELFEGRTLLVWCGENRTSAQQQQLTSWQEQGHDIFPLGI >CP029122|1965040:1974482|1970251_1970722_+|AWF28565.1|DBSCAN-SWA MLSNDILRSVRYILKANNNDLVRILALGNVEATAEQIAVWLRKEDEEGFQRCPDIVLSSFLNGLIYEKRGKDESAPALEPERRINNNIVLKKLRIAFSLKTDDILAILTEQQFRVSMPEITAMMRAPDHKNFRECGDQFLRYFLRGLAARQHVKKS >CP029122|1965040:1974482|1970762_1971224_-|AWF26958.1|DBSCAN-SWA MKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKLTYTDTYVQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKAAGFKEVK >CP029122|1965040:1974482|1965971_1966703_+|AWF24647.1|DBSCAN-SWA MKMLRDPLFWLIALFVALIFWLPYSQPLFAALFPQLPRPVYQQESFAALALAHFWLVGISSLFAVIIGTGAGIAVTRPWGAEFRPLVETIAAVGQTFPPVAVLAIAVPVIGFGLQPAIIALILYGVLPVLQATLAGLGAIDASVTEVAKGMGMSRGQRLRKVELPLAAPVILAGVRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIADRLFERLVQALSQHAK >CP029122|1965040:1974482|1969485_1970205_+|AWF24375.1|DBSCAN-SWA MIKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVTSHEGKEGFTELTLRTLESRTPLLRCHRQYLVNLAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >CP029122|1965040:1974482|1967803_1969489_+|AWF24186.1|DBSCAN-SWA MYDFNLVLLLLQQMCVFLVIAWLMSKTPLFIPLMQVTVRLPHKFLCYIVFSIFCIMGTWFGLHIDDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSILIRRGRTDKVFNPITAGAVTFVAEMVQMLIILAIARPYEDAVRLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSTYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQQLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGLYQPVTNASGLGMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLPWRDEA >CP029122|1965040:1974482|1965040_1965967_+|AWF26082.1|DBSCAN-SWA MIEFSHVSKLFGAQKAVNDLNLNFQEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGEIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWSVAQNIATVPQLQKWSRARIDDRIDELMTLLGLESNLRERYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHRLLGRTIVLVTHDIDEALRLAEHLVLMDHGEVVQQGNPLTMLTRPANDFVRQFFGRSELGVRLLSLRSVADYVRREERAEGEALAEEMTLRDALSLFVARGCEVLPVMNTQGQPCGTLHFQDLLEEA |
9 | Enterobacteria_phage(85.71%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2543045 : 2557148
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP029122|2543045:2557148|DBSCAN-SWA GTTATATTTTTTCGATCTCGACCAGATTAGTGTGCTGCGGGTTTCCCTTCGCCAGTGGTGAAGGGCGCAGAGTGGTTAGCGTATTCACACAGCCGCCATGGTCGATTTTATCGCCAGACATATTGGCCTCGTGCCAGGCTCCCTGGCCCATAGCGCTAACCCCAGGGAGAATGCGTGGTGTCACTTTGGCTGGTAGCCGAACTTCGCCACGATGGTTAAACACCCGCACCATATCGCCGTTGGCAATCCCACGTTTCTGCGCATCTATAGGGTTGATCCACACCTCCTGACGGCAGGCAGCCTTCAGGAGATCAATATTGCCGTAGGTCGAGTGAGTACGGGATTTGTAATGGAAACCAAACAGTTGAAGTGGGAAGGTTCTACGTTCAGGGGAGTCCCAGCCTTCAAAGGTAGAGGCATAAACTGGCAGTGGGCTTATCACTTCATCTTTTTCCAGTTCCCAGGTACGGGCAATTTCCGCCAGCTTGCTGGAATAAATTTCAATCTTACCGGAAGGTGTTTTAAGTGGATTTGCCTCGGGGTCGTCACGAAATGCTTTGTAGGCGACAAAATGACCATTGGGATCTTTACGCTTATAGATACCCATTTTTTTCAGTTCGTCGTAAGACGGTAACGCCGGATCTTTGGCAAGCATTTTGGCGTACAGATGTTGTAACCATTGTTCCTGCGTGCGACCTTCAGTGAACTTTTGATAGACGTCAGGTCCAAGACGTTTCGCGACTTCACTCAGGATCCAGTAAATCGGTTTACGTTCGAATTTTTCGCTGGTGACTGGCTGGAGGAAAATGAGATATCCCATGTTACCGGCGTAGTCGTTAGGAATAATATCTTCCTGCTCAACGGTCATCAGGTCTGGCAGCAGAATGTCGGCATATTTTGCCGATGAGGTCATAAAGTTTTCGATGACCACAATCATTTCGCATTTCGATTCGTCTTGCAGAATTTCATGCGTTTTGTTGATGTCAGAATGCTGATTAACGAGGGTATTTCCCGCGTAGTTCCAGATGAACTTAATGGGCACATCCAGTTTATCTTTGCCGCGGACGCCGTCGCGGATTGCCGTCATTTGCGGACCATGATCGATAGCATCTGTCCAGCTGAAGCAGGAGATTGACGTTTTGACCGGATTATCCAGCACCGGCAGGCGTTCTATGGTAATGGTATAGGTCGATTCACGCGCGCCACTATTTCCGCCGCTGATGCCGACATTGCCCGTCAAAATAGGTAACATAGCAATAGCGCGTGCAGTCAGTTCGCCGTTTGCCTGGCGTTGCGGCCCCCAGCCCTGGCAGATATAAGCGGGTTTTGCTGTGCCAATTTCACGCGCCAGTTTGATGATACGGTCTACCGGGATACCGGTAATTTGCGAAGCCCACTGCGGCGTTTTCGCTGTGTTATCATCACCTTCACCAAGAATATAGGCTTTATAGTGACCATTTTTGGGTGCATCTGCGGGTAAGGTTTTTTCGTCATAGCCGACGCAGTATTTATCGAGAAAAGGTTGATCAACGAGATTTTCGTTAATCAATACCCAGGCAATACCCGCAACCAGCGCGGCATCGGTGCCCGGGCGAATAGGGAGCCATTCATCTTCACGACCAGCAGCCGTATCGGTATATCGCGGATCGATAACAATCATTTTGGCGTTCGATTTCTCGCGCGCTTTTTCAAGAAGATAAGTGATGCCACCGCCGCTCATGCGGGTTTCTGCCGGGTTGTTACCAAACATCACGACCAGCTTGCTGTTTTCAATATCCGTGGTGCTGTTGCCATCATTACTGCCGTAGGTGTAGGGCATGGCACAGGAAATTTGCGCGGTGCTGTAGGAGCCATACTGATTGAGTGAACCGCCGTAGCAGTTCATCAGGCGTTTGACCGCCGAGGCTGATGGCGAAGAGCGGGTCATATTGCCGCCAACAATCCCCGAAGAGTACTGAATATATACAGCCTCATTGCCATATTGTTCGACGGTTTTTTTCAGGCTACTGGCGATAGTATCCAGGGCTTCATCCCAGCTAATCCGTTCGAATTTGCCTTCACCGCGTGTACCCACGCGTTTCATTGGGTAATTCAAGCGATCGGGATGATTAATACGCCGGCGGATGGAGCGACCGCGCAAACAGGCGCGTACCTGATGGTTGCCGTACTCATCGCTGCCGGTATTGTCAGTTTCCACCCAGGTCACTTCATTATCTTTAACATGTAGACGAAGTGCACAGCGGCTACCACAGTTGACGGAACAGGCACCCCAGATCACTTTTTCGCTGGCCTGTTGTACCGTTGCCGCTGCACTGCGCAGGGTGAACGGTAAAGAAAAACCGCCTGCAGCCAGCGCCAGAGAACCTATCGCGGTTGATTTAACGAGTGTTCTGCGGCTGATGCCCACCATTCGGTCATTTTTGGACATAACTCACTCCCTGTTCTTTATCGTTATATAAATGTTTATATATTGAATATTTAGCGCGCTAACAATAGAGGGAGTCTACCCATTTTGGGTTAAGAATTATTAATCCATATCAATAGAAGGGTATGAGTAATAAGGTGGGATTATGTTGTATGTTCAAATCGCCGGATGTGTCGTATCCGGCGTTCAGTCGATAATGTATTACTGCGGTTCGGCAGGCGCGCCATCCTGGGTAGACTGCGCGGGAGCAGAGACGTTACCGCTGGTGGTGCGGGTATAGAGAATTTTATGCGTATCATTAGCGCAATGGCCGACGACCTGGGAATCAGGCTGATCAACCTGGTCATTGGGTACAATACTTAACGTGAAGCTGCTTTCGGGTACGCCATTATTGATAATGCGCTGTGATATATCGCTCTGTATGCGCTCACAGGATCCCGGCGCGGCGAGTACCACGGGTGAGGCGAGGGCGAGCAGAAGCGCGGCACAGCAGGTTGAGAGTTTCATCATAAGCTCCTTACGCGAAGATAACTTCTTTAAGCATAGCATTTAACGTGTAAAGTACTGTATTTGCTACTATGATTGAGAATCATCTCTACTCTCTGGTGACTGTTGTGAAATACAAATTACTACCATGCTTACTCGCGATATTCCTCACAGGATGTGACCGCACAGAGGTAACACTTTCATTTACCCCTGAGATGGCCAGTTTCTCTAATGAATTCGATTTTGATCCGCTGCGTGGTCCGGTAAAAGATTTCACTCAGACATTAATGGATGAGCAAGGTGAAGTGACGAAACGTGTTTCTGGGACTTTGTCGGAAGAAGGCTGTTTTGATTCACTCGAATTACTGGATCTGGAAAATAATACCGTGGTCGCTCTGGTACTGGACGCCAATTATTACCGTGATGCCGAGACGCTGGAGAAGAGAGTACGTTTACAGGGAAAATGCCAGCTAGCAGAATTACCTTCTGCCGGGGTGAGTTGGGAAACCGATGATAATGGCTTCGTGATTAAAGCCAGCAGCAAACAAATGCAGATGGAATATCGCTATGATGATCAGGGTTATCCGCTGGGTAAAACCACGAAAAGTAACGACAAAACATTATCTGTCAGCGCCACGCCATCAACGGATCCGATCAAAAAATTAGATTACACAGCGGTTACTTTACTGAATAATCAACGGGTTGGTAATGTAAAACAGAGCTGTGAATATGACAGTCACGCTAATCCGGTGGACTGTCAGCTAATCATTGTTGATGAAGGAGTAAAACCCGCCGTCGAACGGGTTTACACCATCAAAAATACGATCGATTATTATTAATGCTATTGTGCGGTCGGCTTCAGGAGAGTCTGACCCGGTGTTTTGTGCTCTGCCAGATACTGATGCTGGAATATACACATGCGAATGGCATTACGATATTGACCATTAATAAAGAACTCGTGCATCAATTCACCTTCAACCGAAAAGCCAAGCTTGCGGTAAATGTGAATCGCTTTTTCATTCTCTTTATCAACGATCAGATACAGCTTATAGAGATTGAGAACGGTAAAGCCATAGTCCATTGCTAATTTGGCGGCACGGGTTGCCAGACCTTTCCCCTGATACTCCGGGGAGATAATTATCTGAAATTCTGCGCGGCGATGAACATGGTTAATTTCCACCAGCTCCACCAGACCGGCTTTTTCGCCGTCACATTCCACCACAAAGCGCCGTTCGCTCTGATCGTGAATATGCTTATCATACAGATCAGAGAGTTCAACAAAGGCTTCGTAGGGTTCCTCAAACCAGTAACGCATCACACTGGCGTTGTTGTCGAGTTGATGTACATAGCGTAAATCTTCACGCTCCAGCGGGCGTAGCTTAACACTGTGGGCGCTTGGCATAACGTGTCCTTACATTCCTTAAATCAATAACAGGTTAGGGGGTAATAACGCGGCCAGTTCGACGGTCCAGGCAGCGCAAAGTATTGGGCTCCCAGTAGGCATTGATGTTGGCGCTTTGCTCACATTTATCGCGGTTATCAAAAGCGGCGTCGGCTTTATCCCACTCTTTTTCAGTGCGTTTATTCACTTTCTGGCGCAGATTGCGCGTGTCATTCCATTGCTCTTTTTCCATAGCGGCGTGCTGGCGGCTTTGTGCACTGTCGCCAGACTCAATCACCAGTTTGTTAGTTTCGGCATGAACAGTTGTGCTCAATGCCAGTGCGCAAGGCAGCAGAATAGCGAGCAGGCCGATTCGTTTGCTGAGAGTGATTTTCATAATTCATTCCCTGTATGAATGATTAAAGGTGATTCTACACCATCCACTGCGGACGCAAAACGTACCAGGAGGGTGTTTATATTGATGATATTATGTCGCCCTATAACTATACATGATGTCAATAAGAGACAAAGATGATTAAAACAACGTTACTATTTTTTGCTACTGCGCTGTGTGAAATTATTGGATGCTTTCTGCCCTGGTTGTGGTTAAAACGAAACGCCAGTATCTGGCTGTTGCTTCCGGCGGGGATTTCACTGGCGCTGTTTGTCTGGTTGTTAACGTTGCATCCAGCGGCGAGTGGGCGTGTTTACGCGGCTTATGGTGGCGTTTATGTCTGCACGGCGTTGATGTGGCTGCGCGTTGTGGATGGCGTGAAACTGACTCTTTATGACTGGACGGGTGCGTTGATTGCGCTTTGCGGCATGTTGATCATTGTTGCGGGCTGGGGGCGCACGTAGGAACATAAATCCATTTTATCAATAAGATAAGAGGAAGTGTCAGCTGACAAAAGGTATTCTATTTCATCTTTTGTCAACCATTCACAGCGCAAATATACGCCTTTTTTTGTGATCACTCCGGCTTTTTTCGATCTTTATACTTGTATGGTAGTAGCTCAGTTGCGTAGATTTCATGCATCACGACAAGCGATGCAAGGAATCGAACATGAAGATCGTAAAGGCTGAAGTTTTTGTTACCTGTCCGGGGCGTAATTTCGTCACATTAAAAATCACCACTGAGGACGGTATTACGGGCCTTGGGGATGCCACCCTCAATGGACGTGAGCTTTCCGTGGCCTCTTATTTGCAGGATCACCTTTGTCCGCAGCTTATTGGTCGCGATGCGCACCGTATCGAAGATATCTGGCAGTTTTTCTATAAAGGTGCTTACTGGCGTCGCGGTCCGGTTACGATGTCGGCCATTTCAGCGGTTGATATGGCGCTGTGGGATATTAAAGCCAAAGCTGCCAACATGCCGCTTTACCAGTTACTCGGCGGCGCGTCTCGTGAAGGGGTGATGGTTTATTGCCATACCACCGGTCACAGTATTGATGAAGCTCTGGATGATTATGCCCGTCATCAGGAGCTGGGATTCAAAGCCATCCGCGTGCAGTGCGGAATCCCTGGTATGAAAACCACCTACGGCATGTCGAAAGGTAAAGGTCTGGCTTATGAACCCGCAACCAAAGGACAGTGGCCGGAAGAGCAGCTGTGGTCGACGGAGAAATACCTCGATTTCATGCCGAAATTGTTTGACGCGGTACGTAACAAGTTTGGTTTTAATGAACATTTGCTGCATGACATGCACCATCGCTTAACGCCTATTGAAGCGGCGCGCTTTGGTAAAAGCATTGAAGATTATCGCATGTTCTGGATGGAAGACCCGACGCCTGCAGAAAACCAGGAATGCTTCCGTCTCATTCGCCAACATACCGTCACACCCATCGCAGTGGGTGAAGTCTTCAACAGCATCTGGGACTGCAAACAACTGATTGAAGAGCAACTCATCGATTATATCCGCACCACGCTGACCCATGCAGGCGGAATTACCGGTATGCGCCGGATTGCCGATTTTGCTTCGCTGTATCAGGTACGTACTGGCTCACACGGTCCTTCCGATTTGTCACCAGTCTGCATGGCTGCGGCGCTGCACTTTGATCTGTGGGTCCCCAATTTCGGTGTCCAGGAATACATGGGTTATTCCGAACAAATGCTCGAAGTCTTCCCGCACAACTGGACTTTCGATAACGGCTATATGCATCCGGGAGACAAACCGGGTCTTGGCATCGAATTCGATGAAAAGCTGGCGGCGAAATATCCCTATGAACCTGCTTATCTACCAGTCGCACGTCTGGAAGATGGCACGCTGTGGAACTGGTAAGGAGTAAGATAATGAAAAGCATATTAATTGAAAAACCGAATCAACTGGCGATTGTCGAACGTGAAATACCCACCCCGTCAGCGGGTGAAGTACGAGTAAAAGTGAAACTTGCCGGAATTTGTGGTTCAGATAGCCATATTTATCGTGGGCATAATCCTTTTGCGAAATATCCGCGCGTCATTGGACATGAATTCTTTGGCGTCATTGATGCGGTGGGTGACGGCGTGGAAAGCGCCAGAGTCGGTGAACGTGTTGCTGTCGATCCGGTGGTCAGCTGTGGGCATTGCTATCCGTGCTCTATAGGTAAACCGAACGTTTGTACGACACTGGCTGTTTTAGGTGTGCACGCTGACGGTGGTTTCAGTGAATATGCCGTGGTTCCGGCAAAAAATGCGTGGAAAATTCCTGAAGCAGTGGCCGATCAATATGCGGTAATGATCGAACCTTTTACCATTGCGGCTAACGTAACCGGACATGGTCAACCGACTGAAAATGATACCGTTCTGGTTTATGGTGCCGGTCCAATCGGCCTGACGATCGTTCAGGTATTAAAAGGCGTCTATAACGTTAAAAATGTGATTGTTGCCGATCGCATTGATGAACGACTGGAAAAAGCGAAAGAGAGCGGGGCAGACTGGGCGATCAATAACAGCCAGACACCGCTTGGCGAGATTTTCGCTGAAAAAGGCATCAAGCCGACATTAATTATCGATGCGGCTTGTCATCCTTCTATCCTGAAAGAGGCCGTAACGCTGGCTTCTCCAGCGGCACGTATTGTATTGATGGGCTTCTCCAGTGAACCGTCTGAAGTGATTCAGCAAGGAATTACCGGAAAAGAACTCTCTATTTTCTCTTCACGCTTAAATGCAAATAAATTTCCGGTTGTTATCGACTGGTTAAGTAAAGGGTTAATTAAACCAGAAAAATTAATTACCCATACGTTTGATTTCCAGCATGTTGCTGATGCCATTAGTTTATTTGAACAGGATCAAAAACATTGCTGCAAAGTCTTACTCACTTTTTCTGAATAATACCAATAACGGCGAGTAAGTAGTACGCATCTTACCTCTTTTTTAGAGATAACCATTATGACAATAGAAAAACATGAAAGAAGCACTAAGGATTTGGTGAAAGCAGCAGTATCGGGATGGCTGGGCACTGCGCTTGAATTTATGGATTTCAAGAGTCATGCGTGTTAACTATTTGATAAATATTAAATTAATTTTTCATTGCTTCGTTATGGGGCATGGTTGGGGCAAACTCGCTTAACTGTGTATTTAACAAAGCTACCTGTGCATTATTGTTTTCAGACATCCATTTTCCGTATACCTGAAATACCATTTGCGCATCTGCATGGCCCATCTGGTTTGCTATAAATGCCGGGTTAGCACCAGCTGTCAGCGACCAGCAGGCATAAGTATGTCTCGACTGATATGATTTTCGATGGCGGAGTCCGGCACGTTTTATCGCTGCGTCCCACATCTGCCTTATTGAGTCAACGGTAAAATGGTCACCATAATTTTTTACTCTCGCTGACACTTCAGGTTGAAAAACAAAGGTGCATTTTTGTTTTTCTGTTCTGCCATACTCTCTGAGGTGAACATCAATGATATGCTCTTTGCTCAGTCTCGTTAATGTCATCTGACTCCGGAGAGCGTCGATTGCTGGCTTAATAAGATGAATGACCCGATTGGTTCCCGCCTGTGTTTTTGGTACCGTGAAACGGTCTTTTGCTAAATTTCTCCTGATCATCATTGTTCCATTTTTCAGATCTATGTCCTCCCATCCAAGTGCACACAGCTCACCAGGGCGAACGCCAGTATAAACAGAAACACACCATAAATTTTTTGCTTGCTGATTTCTGCACGCATCGATAAGACGGATAAATTCTTCCCGCGAAAGAGGATCCGGAATGGTTCTTGATTCCTTTAATGGCGAGATCCCCTTAAACGGATTATCTGCCAGGTAACCGTTATCAACACCAAACTGGAACACGGCGTTAAGATTTGTCATGTAATTATTTACAGTTACAGCCGATCTCCCTGGTTGTGTAACAATATAGTTACTTTTGGGGATCTGGTATCCAGTCAGTAACTCTTTACGAACCTCCAGTAATTTTTCTTTATTAATCGATGAGGCAAGATTTTTTTCACCGATTATGCTCAGGATATTTTTGATGACGGCACGGTATGTGTTGAGTGATGTTTTGGCGACTTCAGTTTCTTTCAGTGCCAGAAATTTTTCAGCCAGTTCTTTTATGGTTAAATCTTGTCGGGCCTCACCAAATTTTTCCAGATTGCGTGAGGAGGGAAACTGTTTTGCATAGTCGAAAACACCAGTTTTTATTGCGTAACAAACAGAGGAGCGTAGTTCACCTGCAACGCGCCTGTTTTTTGCTGTGTCAGGAACCCCCAGGTTTTCCCTGACTCTTACGCCTTTATAAACAAACCAGATACGTAATTTCCCTCCATGGTTTTCCACGCCTGTCGGATATTTCATTTCAACTTCTCTCATTAGTTAGTGTGGCTTTTAGTCAAGTAAGATGACGTCTTGGTCTCGCTGATGCCTGGCGCTCAATCCAGCGATCAATTTCTTCCAGGTTGTAAAAGCATGGACTGTTATCCCATGGCATACCGTCATGAGCGACATGCTTATATTCCCTTCCTTCCATAAACGATTTTTCCCGGGCCTTTTTTAACGTACCTTTTTTTATTCCTTTCAGCGCAATTAACTGCTCTTCGGATACCCATTTGCCGGGAGAGACAATCATGATTACTTCGCTCATCGATTTCTTTATCTCTTACATTAGACGAGCGCCGGTTGCAGAATACCAGTCACAACCGGCGACAGTTGAACATTAAGAATCAGCCTGACTCGGGATCAGTTTTTGCCAGATAACTGAAACGTATTTTGCCTGGTAACGGGCGTCATCAAGTGCATTATGGCGCTCACCTTCGAATTGAATAGCCGTTCTGGCATCGAAGTCTATGACTTTCCCCAGCTCAACGATTGTGCGTACATCGCGATCGTTGTAGTAACGCCACGGGCAGGGGATCCCCTGCCGTTCGTATGAACGGCGCAAAATCGTGTTGTCGAAGTTGGCTCCATTCCCCCAAACCTGAACAAAAAATTCACCGGAGTTTTCGTCGATAAATTCCCGCAATTGTAACAGTGCATCATCTAACGGGATTTCATCGGTCATAATGGCAGATTGCGCTTCGCGTGATTGCTTAAGCCACCATTTAATGGTGTCCCGATCAATGACTCCGCCAGCAGTTTCCAGATCGATAGTCTTACTAAATTCCGGTCCCATATCTCCGGTTTGCGGATCGAAAAATATTGCACCTATTGAGGTGATCGGGGCATCAGGATTTTTTCCCATGGTTTCAAGGTCGATCATTAGATGGTCACACGTCCTGCTGGTGGATGTGATTTCGTGATGACCGTTCACCTTAATTGGGTGATCTGCCGTCTCGCCAGTTTCATTATCGCTATTGTGATGCTGATTGCCGCCAGTGTTCTCCTTGTGTGGATGTTCAGCGCCTTCCATTTCCTCCGGATCATCTTCCTGAACTTCAACCTGATACTCTTCATCGAATGTTTCTTGGTATGTTGCGTCGCCCATCACCGCGCCACAATCAGGGCAGTTGCCGCCGCCGGTCTGACCGCAGGCGGTGCAGACTTTTTCCACTTCCTGTTGCGCTACTGGTTCAGGCTGTTTCGTTTCTGGCTCGTTTTGTAACGCATTTGGACTGTTTTGTTCCGCTTTTTGGTAGTTCCGTTCCGATTCATGCTGGTTCTGGTTCACAGAATCGCGGGTCTGGATCCCCTTAACCCATTTCGGATCATTCGGGTCACTAATCCCTTCAACAAATTCACCACGTGATGCAGCAAGCAACTTATCAGCGTCAGGCTGGCTGATATTGGCTGCCTGCATAATTTTGTTTACTTCGTCAGCGGTAACTTTTATCGGCTCTGGTTGTTCTGAATCTTCAGCGGTATCTACATTTTGCGGTAAGCCCGTGTATGTGCCATTTTTTCGGGCAAAATATTCTTCTTTTGTGATTTCAGTGGCGCCAGCAGCCAGTGCCTTATCCAGACCAGAAAGTTTGTTTGCGCGACCGTATTTTTCTCCGTCCTTATCTGCGAAGAGGAAATAGAACGGCCCCTCACGCTCTACAGATGGTTCAGCTTCCGGCGCGGTTTCATTTTTTGGGATATCAGATACCTCAGTTTCCACTGCATCAGTTTGTGTTTCTGATGACTGGAGAACATCAACAGTGCCCAGGTCTGTTTCTTCATTCTCAAACACGCCCTTTGTCGTCAGGTATTCGCAGATATATTTGTTCAGTGCTACGGGATCTTTGTGAATGTCGATCGGACGCTCACGGACAAGGCCAAAAATAGTTTGGCGGTCGTAGCGAAGGGCATCAGGCTGTTTGCGCATTGATGCCGAGATACGCTTCCAGTCTTCGCGGTCGTTGTCGATAACTTCTTTTTTTGCCCAGCGATGGATGCTGCCGTCAATGTTTCCGGCATCCACATCACCAGGCCAGAGAGCGTAGGCCAGTTCGTCATCCAGTGTTTTCCATGTCTGCTTGTATTCGCGATGAATGGCAGCAATGACCGGGCTGATTTTTCCTGTTGAATTTTCAGTGTACTGTTGATTGGCTCTGGCGCGTGCGAGATCAACAACAGACGTGTATTTTCCGGTTTCCTTGCGTTCACCTTCGCGACGTTTTTTCCAGATGCGCATCTCTGCCTGAATTTCGGGCCATTTAGCACCAGGAATACATTTATGCTTAACCCACCCAATGGCGTGCAACTTAAGCTCCGGATACATGGCGTTAACTTCTGGCATTTTCATCAACGCTTCAACGATATGTCCGTCGAATGTTGCCATGTCTTCCTGCAACAATTCCTGTGCGCTAATAACCATATCAACGGTGATGTTTTCACATGTGTCGAACTTAACCATGACAGCGTTCTGTACTTCAGGGGCCAGCTTGTCAAAAGTGACGTTCATCGGATCGGATTCAGTCTCAACCGGGACAAAAGAAGCAGACTCCTCATCCCAGCGGTTTTCCTGCATATATTCAGCATCCCATGAATCGAGGGCAGGGCGGGGTATGCCAGGTTTATCCTCGCAGACAATAAATTTATAAGCGCAGTCCTGAGCAGCCGGATAATGTTCCAGGAATTGCCAGTGAAATTTTGCTCGAGCACGGCGTTCGTCGCCAGCTTCAATGGCTGTGGCTACAGCCACAGCGCCTTCTTCCCTTGTTGCCAGTTCGTCAGGAATAGCGGCGCAAATAAAGACTTTACTCATTTTGTTTTAACCTCATGACAGATTTAAGGATGAACAAATCCCTGCCATTGCTGGCATATAAGAATGAAACCGGATATTTATTACGGAACTGTTTTAAAGACCTGCCGGGATTTCGATATTATCCTGGTTAATAACTTTATCGACCGGGTAACAGTTACCGGGAATTTTCTGTTCGGTTGCTGCAGTCATACACTCCTGCATTGTCCTGTGAACACTGACTGCAATATCAACTGGCACTCCGGAAACAAGAAAAACTGTCAGAACAAGCGCAAATGCTGAATTCATTGTGCACATCCTTTTGGCATCAGACGTAAACGAGCCAGCATTGAAACAATGCATATTTTATTTAATAGCTCCCGTTCTTGTTTTCTCTTGTTAATGGCATCTTCAGTAAATACAGGGTTACTGATAGTGACACCAATTTCAAAACAACCTTCAGACGTATTAACGTTTGGTAATAACGTTTTCATTATCGCGTCCTCAACAATGAATTTTGTGATGCAGTGCCTGGTGCCTCCAGGTGACGTTAACCAGTTAACAATTAACGTCGGATATCCGGATTAGTGATTTCAGGTTGTATCGTGAGATCAGTGATGGAAAAAGTATTACGTACATGATCGCCGGGTTAAATAAAGAATATGGCGATGTGGTGGAATCCGGACTGCTTTTTGCAGATCCTGCCGTTGTAGATCGTGAAACTGACGAACTTATAGAAAAAGCAATTGCTTTCAAGCTTGCGTATCGACAGCAATACCAACAAAAAGCTGGATGGAATTATGAGTCTTCTTTTTGCTGAACGCCCACTGGTTATAAACACGCAGCTGGCAATGAAAATTGGCTTAAACGAAGCCATTGTTTTGCAACAACTGCACTACTGGTTGAGAGATACCAACTCCGGCATGGAATGTGATGGTGTTCGCTGGATTTATAACACAACGGAACAATGGCTGGAACAGTTCCCATTCTGGTCAGAGTCAACGTTAAAGCGCGCGTTTGCAAGTCTGAAAACGCTGGGGCTTTTGCGTTGTGAAAAGCTCAATAAATCAAAGCGCGATATGACCAATTTCTACACGATCAACTACGGGAGCGAGCTTTTAGATGGTGGCAAATTGAGCGAATCCATCGGTTTAAAATGCGCCGCTCCATCAGGTCAAAATGACACGATGGAAGAGGTCAAAATGAAACGCTCCATTGGTTCAAAACGACTCAATGTCATCGGGTCAAAATGGCCTGATGATCTTACAGAGAATACAACAGAGATTACTACAGAGAATAAAAAGACTTCTCGTCCGGAAGCTTCGCAACCGGACCCGCAGACGGTTGAACAGGATTTTTTAACCCGACACCCTGACGCGGTTGTGTTCAGTGCGAAAAAACGCCAGTGGGGCAGCCAGGAAGATTTGGCGTGTGCGCAGTGGATCTGGGGGCGAATCGTGAGTCTTTACGAGCAGGCCGCCAGCGATGATGGCGAGATTTCGCGACCGAAAGAACCCAACTGGACCGCATGGGCCAACGACGTGCGCACAATGCGGATGCTGGATGGCAGAACTCACAGACAAATTTGTGAAATGTTTGGTCGGGTGCAGCGGGATCCATTCTGGGTAAAAAATATCATGAGTCCGTCAAAGCTTCGCGAAAAATGGGATGAACTGGTTATCCGCCTGGGGCGTTCGTCTGTACAGCGTTGTGTGAATCATATTTCTGAGCCGGATACCGAAATTCCGCCGGGCTTCAGGGGGTAAGTGTTAATTTCTGGTCATGAGGTAATTTTCAGGAGGGCTTGTGGCAAAAGTTTTTACACAAGAAGAGCGGGAAAAAATTAAAGGGCAGGTTGTTGAACTCGTACGCCGGAGTGGGCGTGAGACGTTACGGCAACTGGAAGTCAAGACAGGTGCGACAAGATATCTGATGAGCGTTCTCGCAAGAGAGCTGGTTGCCAGCGGCGATGTATACAACTCTGGTTACGGGTTATTCCCGTCTGAACAGGCGCGTAAGGACTGGCAAAATGCCCGTAAAAAGCTCTCAAGGGCAAAGCTGAAGGAACCATCTGCGGTTGATCCGGACCTTATCTGGTCATTACCTGACGGAGAAATACGTCGTTACGACAGGCGTCATAATATGATTTGTACTGAGTGTCGTAAAAGCGAAGTTATGCAGCGCATATTGTCGTTTTATCAGGGGGATGTCCGGTATTTATTGAAGTGA
Protein sequences of DBSCAN-SWA_4 >CP029122|2543045:2557148|2548399_2549614_+|AWF24144.1|DBSCAN-SWA MKIVKAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLQDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHSIDEALDDYARHQELGFKAIRVQCGIPGMKTTYGMSKGKGLAYEPATKGQWPEEQLWSTEKYLDFMPKLFDAVRNKFGFNEHLLHDMHHRLTPIEAARFGKSIEDYRMFWMEDPTPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQVRTGSHGPSDLSPVCMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWTFDNGYMHPGDKPGLGIEFDEKLAAKYPYEPAYLPVARLEDGTLWNW >CP029122|2543045:2557148|2556725_2557148_+|AWF27487.1|DBSCAN-SWA MAKVFTQEEREKIKGQVVELVRRSGRETLRQLEVKTGATRYLMSVLARELVASGDVYNSGYGLFPSEQARKDWQNARKKLSRAKLKEPSAVDPDLIWSLPDGEIRRYDRRHNMICTECRKSEVMQRILSFYQGDVRYLLK >CP029122|2543045:2557148|2543045_2545472_-|AWF25900.1|DBSCAN-SWA MSKNDRMVGISRRTLVKSTAIGSLALAAGGFSLPFTLRSAAATVQQASEKVIWGACSVNCGSRCALRLHVKDNEVTWVETDNTGSDEYGNHQVRACLRGRSIRRRINHPDRLNYPMKRVGTRGEGKFERISWDEALDTIASSLKKTVEQYGNEAVYIQYSSGIVGGNMTRSSPSASAVKRLMNCYGGSLNQYGSYSTAQISCAMPYTYGSNDGNSTTDIENSKLVVMFGNNPAETRMSGGGITYLLEKAREKSNAKMIVIDPRYTDTAAGREDEWLPIRPGTDAALVAGIAWVLINENLVDQPFLDKYCVGYDEKTLPADAPKNGHYKAYILGEGDDNTAKTPQWASQITGIPVDRIIKLAREIGTAKPAYICQGWGPQRQANGELTARAIAMLPILTGNVGISGGNSGARESTYTITIERLPVLDNPVKTSISCFSWTDAIDHGPQMTAIRDGVRGKDKLDVPIKFIWNYAGNTLVNQHSDINKTHEILQDESKCEMIVVIENFMTSSAKYADILLPDLMTVEQEDIIPNDYAGNMGYLIFLQPVTSEKFERKPIYWILSEVAKRLGPDVYQKFTEGRTQEQWLQHLYAKMLAKDPALPSYDELKKMGIYKRKDPNGHFVAYKAFRDDPEANPLKTPSGKIEIYSSKLAEIARTWELEKDEVISPLPVYASTFEGWDSPERRTFPLQLFGFHYKSRTHSTYGNIDLLKAACRQEVWINPIDAQKRGIANGDMVRVFNHRGEVRLPAKVTPRILPGVSAMGQGAWHEANMSGDKIDHGGCVNTLTTLRPSPLAKGNPQHTNLVEIEKI >CP029122|2543045:2557148|2546796_2547357_-|AWF24528.1|DBSCAN-SWA MPSAHSVKLRPLEREDLRYVHQLDNNASVMRYWFEEPYEAFVELSDLYDKHIHDQSERRFVVECDGEKAGLVELVEINHVHRRAEFQIIISPEYQGKGLATRAAKLAMDYGFTVLNLYKLYLIVDKENEKAIHIYRKLGFSVEGELMHEFFINGQYRNAIRMCIFQHQYLAEHKTPGQTLLKPTAQ >CP029122|2543045:2557148|2555556_2555739_+|AWF28329.1|DBSCAN-SWA MIAGLNKEYGDVVESGLLFADPAVVDRETDELIEKAIAFKLAYRQQYQQKAGWNYESSFC >CP029122|2543045:2557148|2546170_2546794_+|AWF24715.1|DBSCAN-SWA MASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY >CP029122|2543045:2557148|2547867_2548194_+|AWF25359.1|DBSCAN-SWA MIKTTLLFFATALCEIIGCFLPWLWLKRNASIWLLLPAGISLALFVWLLTLHPAASGRVYAAYGGVYVCTALMWLRVVDGVKLTLYDWTGALIALCGMLIIVAGWGRT >CP029122|2543045:2557148|2545670_2545976_-|AWF27806.1|DBSCAN-SWA MKLSTCCAALLLALASPVVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKILYTRTTSGNVSAPAQSTQDGAPAEPQ >CP029122|2543045:2557148|2555719_2556685_+|AWF27864.1|DBSCAN-SWA MSLLFAERPLVINTQLAMKIGLNEAIVLQQLHYWLRDTNSGMECDGVRWIYNTTEQWLEQFPFWSESTLKRAFASLKTLGLLRCEKLNKSKRDMTNFYTINYGSELLDGGKLSESIGLKCAAPSGQNDTMEEVKMKRSIGSKRLNVIGSKWPDDLTENTTEITTENKKTSRPEASQPDPQTVEQDFLTRHPDAVVFSAKKRQWGSQEDLACAQWIWGRIVSLYEQAASDDGEISRPKEPNWTAWANDVRTMRMLDGRTHRQICEMFGRVQRDPFWVKNIMSPSKLREKWDELVIRLGRSSVQRCVNHISEPDTEIPPGFRG >CP029122|2543045:2557148|2547391_2547733_-|AWF25060.1|DBSCAN-SWA MKITLSKRIGLLAILLPCALALSTTVHAETNKLVIESGDSAQSRQHAAMEKEQWNDTRNLRQKVNKRTEKEWDKADAAFDNRDKCEQSANINAYWEPNTLRCLDRRTGRVITP >CP029122|2543045:2557148|2552471_2554943_-|AWF27263.1|DBSCAN-SWA MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIHREYKQTWKTLDDELAYALWPGDVDAGNIDGSIHRWAKKEVIDNDREDWKRISASMRKQPDALRYDRQTIFGLVRERPIDIHKDPVALNKYICEYLTTKGVFENEETDLGTVDVLQSSETQTDAVETEVSDIPKNETAPEAEPSVEREGPFYFLFADKDGEKYGRANKLSGLDKALAAGATEITKEEYFARKNGTYTGLPQNVDTAEDSEQPEPIKVTADEVNKIMQAANISQPDADKLLAASRGEFVEGISDPNDPKWVKGIQTRDSVNQNQHESERNYQKAEQNSPNALQNEPETKQPEPVAQQEVEKVCTACGQTGGGNCPDCGAVMGDATYQETFDEEYQVEVQEDDPEEMEGAEHPHKENTGGNQHHNSDNETGETADHPIKVNGHHEITSTSRTCDHLMIDLETMGKNPDAPITSIGAIFFDPQTGDMGPEFSKTIDLETAGGVIDRDTIKWWLKQSREAQSAIMTDEIPLDDALLQLREFIDENSGEFFVQVWGNGANFDNTILRRSYERQGIPCPWRYYNDRDVRTIVELGKVIDFDARTAIQFEGERHNALDDARYQAKYVSVIWQKLIPSQADS >CP029122|2543045:2557148|2549625_2550645_+|AWF24829.1|DBSCAN-SWA MKSILIEKPNQLAIVEREIPTPSAGEVRVKVKLAGICGSDSHIYRGHNPFAKYPRVIGHEFFGVIDAVGDGVESARVGERVAVDPVVSCGHCYPCSIGKPNVCTTLAVLGVHADGGFSEYAVVPAKNAWKIPEAVADQYAVMIEPFTIAANVTGHGQPTENDTVLVYGAGPIGLTIVQVLKGVYNVKNVIVADRIDERLEKAKESGADWAINNSQTPLGEIFAEKGIKPTLIIDAACHPSILKEAVTLASPAARIVLMGFSSEPSEVIQQGITGKELSIFSSRLNANKFPVVIDWLSKGLIKPEKLITHTFDFQHVADAISLFEQDQKHCCKVLLTFSE >CP029122|2543045:2557148|2555036_2555228_-|AWF24420.1|DBSCAN-SWA MNSAFALVLTVFLVSGVPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVINQDNIEIPAGL >CP029122|2543045:2557148|2550832_2552113_-|AWF26206.1|integrase|DBSCAN-SWA MKYPTGVENHGGKLRIWFVYKGVRVRENLGVPDTAKNRRVAGELRSSVCYAIKTGVFDYAKQFPSSRNLEKFGEARQDLTIKELAEKFLALKETEVAKTSLNTYRAVIKNILSIIGEKNLASSINKEKLLEVRKELLTGYQIPKSNYIVTQPGRSAVTVNNYMTNLNAVFQFGVDNGYLADNPFKGISPLKESRTIPDPLSREEFIRLIDACRNQQAKNLWCVSVYTGVRPGELCALGWEDIDLKNGTMMIRRNLAKDRFTVPKTQAGTNRVIHLIKPAIDALRSQMTLTRLSKEHIIDVHLREYGRTEKQKCTFVFQPEVSARVKNYGDHFTVDSIRQMWDAAIKRAGLRHRKSYQSRHTYACWSLTAGANPAFIANQMGHADAQMVFQVYGKWMSENNNAQVALLNTQLSEFAPTMPHNEAMKN >CP029122|2543045:2557148|2552147_2552384_-|AWF28350.1|DBSCAN-SWA MIVSPGKWVSEEQLIALKGIKKGTLKKAREKSFMEGREYKHVAHDGMPWDNSPCFYNLEEIDRWIERQASARPRRHLT >CP029122|2543045:2557148|2555224_2555413_-|AWF25794.1|DBSCAN-SWA MKTLLPNVNTSEGCFEIGVTISNPVFTEDAINKRKQERELLNKICIVSMLARLRLMPKGCAQ |
16 | Escherichia_phage(22.22%) | integrase | attL 2540749:2540762|attR 2553114:2553127 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2564076 : 2571468
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >CP029122|2564076:2571468|DBSCAN-SWA CGTGCGGGTATTACTTCGACCTGTTCTGGTACCGGAACTCGGGCTGGTGGTCGTTAAGCCGGGCCGTGAATCCATGCCGGTATTCCACAATACCCGGGTACTGGTGGAGCCGGAACCGAAAAGCATGCGTAATCTGCCGTCCGGGGTCGTTCCTGCCGTTCGCCAGCCGCTGGTGGAAGACAAAACATTGCTGCCGTTTTTCAGTAACGCACGGGTAATTCGTGCTGCTGGTGGTGCTGGTGCATTGTCTGACTGGCTGTTGCGCCATATTAAATCCTGCCAGTGGCCACACGGCGATTATCATCACAGCGAAACCGTCATTCACCGTTATGGTACCGGCGCAATGGTGTTGTGCTGGCACTGCGACAACCAGCTGCGTGACCAGACATCCGAATCACTCGAGCAACTTGCTCATCAAAACCTGTCAGCATGGATGATTGACGTCATCGGTCACGCAATAAGCGGTACGCAGGAGCGTGAATTATCTTTGGCTGAATTATCCTGGTGGGCGGTCCGCAATCAGGTGGCGGACGCGCTACCGGAAGCGGTATTACGTGGTTCGCTGGGGTTGCGTGCGGAAAAAATCCGCTCAATGTACCGTGAAAGCGACATCGTACCGGGAGAGCAGACCGCCAACAGCATACTGAAACAGCGCACAAAAAATCTTGCGCCGCTGCCTCACGCCCACCAGCAACAGAACCCACCACAGGAAAAGACGGTGGTCAGCATTGCCGTTGATCCTGAGTCTCCGGAATCTTTCATGAAACGACCTAAACGTCGCCGCTGGGTTAACGAGAAATACACACGCTGGGTGAAGACACAGCCGTGTGCGTGTTGTGGTAAGCCAGCCGACGATCCCCATCACCTGATTGGTCATGGTCAGGGCGGAATGGGGACAAAATCTCACGATATTTTCACGCTACCGCTGTGTCGGGAGCATCACAACGAGCTTCATGCGGATCCTCTGGCGTTCGAAGAAAAGCATGGTTCTCAGGTTGATTTAATTTTTCGTTTTCTTGATCACGCCTTTGCAACTGGCGTGCTTGGGTAAAAGAGGTGACTGATGCTCATAGATTTGGTTTTACCTTACCCGCCGACGGTGAACACTTACTGGCGACGCCGTGGCAGCACATATTTTATCTCGGAGGAGGGAAAGCGTTATCGCCGGGCTGTGGCGCTTATTGTTCGCCAGCAGCGGCTGAAATTAAGCCTGTCCGGAAGGCTGGCGATAAAGGTGATTGCAGAGCCACCGGATAAGCGTCGTCGCGACCTGGACAATATCCTGAAAGCACCGCTGGATGCGCTGACGCATGCGGGAGTGTTAATGGACGATGAGCAGTTTGATGAAATCAATATCGTTCGTGGTCAGCCAGTATCTGGTGGACGTCTGGGGGTGAAGATTTACCCCATAATGCATGAAGAGCAGGTCAAAAAATGAAACTGGAAGATTTACCGAAATACTACTCCCCAAAATCCCCTGGCCTGACCGATGCATCGGCCTCAACGTCAAAAGATGCGCTGAGTATCACTGATGTGATGGCCGCGCAGGGCATGACACAGAATCGGGCTGAGATGGGTTTTTCTGCGTTCCTGGGGAAAATGGGCATCAGTATGAATGACAGGGCGCGGGCAACAGAATTACTGGCAGATTATGCACTCAGTCGGTGCGATCGTGTGGCGGCGTTGAGAAAGCTTCCGGCAGAAATAAAACCGGTAGTGATGCGCATTATGGCTTCGTACGCTTTTGAGGATTATGCCCGCAGCGCAGCGAGTAAAAAGCAGTGCCCTTGTTGCTATGGGGAAAAATTTATTGAAAGCGTAGTTTTTACAAACAAGGTCCAGTATCCGGATGGTAAGCCGCCGGTATGGGCAAAGTGTACGAAAGGTGTGTATCCGTCTTACTGGGAAGAATGGAAAAAAGTCAGGGAGGTGGTAAAAGTTGCCTGTCCGGAGTGTGGCGGAAAGGGTGAGGTTTCCACCGCCTGTAAGGATTGCCGTGGGCGTGGTGTCGCCATTCATCGTGAAGAGTCGGTAAAACGTGGTATGCCTGTTATCAGAGACTGCCAGCGTTGTGGTGGTCGTGGCTATGAAAGACTACCATCAACGGAGGCATTTAATGCTATATGCGAGGTGACAAACCAGATAACACGCGCGTCATGGGAAAAAACAGTTAAGAAATTCTATGATGCGCTGGTGACCCGGTTTGATATTGAAGAAGCATGGGCTGAGCGGCAGTTAAAAAAGGTAACTAGGTAACAAGGTTGATTTTTCCGGAATCTGTGGTAAATTCGTCATAACGATGGGCGTTTTATGCCTGACGTTAGAAGAGTTTCTACAACCCGCCGCCGAGCGGGTTTTTTATTGCGGAATTAATTATGGACCGTTATTATTCTGCTCCCGGCCCTTTAGCTCAGTGGTGAGAGCGAGCGACTCATAATCGACAGGTCGCTGGGTCAAATCCAGCAAGGGCCACCAACCGTCACCAGTTCATCAGGAAAGAGCGTCAACCCTTTAAGTTGAGTGTGCGAGGTTCGAGTCCCCGGTGGCGGTCCAGTGCCGACTTCGCTCAGTAGGTAGAGCAACTGACTTGTAATCAGTAGGTCACCAGTTCGATTCCGGTAGTCGGCACCATATGCGGGCATCGCATAATGGCTATTACCTCAGCCTTCCAAGCTGATGATGCGGGTTCGATTCCCGCTGCCCGCTCCAGTTAGAGTCTTTCAGTCTGCGATGATGGGAAATCCCGGAGTGACTGAAAGACGTTTAAGTTATGAATGATCGCTTTTTTTTGCAAAATTGCTGTGCAGAAATACTAACCTTCGGGCAGGCGATCATTCATAAGCACTCTGCTTTTATTCCGATTAACTGTGGGTGGTTTGTTGGATAGAGTGCTTTCCTTACTGTATATATTGTTTCGCCCGCTTTTGCGGGCTTTTCTTTTCAAATCCCTTTCATTTCTCAGTGTAAAACTACGCCATCCGTTATTTGCGGAGGTGAGGCTATGAAATCCATGGACAAAATTTCAACGGGCATTGCCTATGGCACCTCCGCAGGCAGTGCTGGCTACTGGTTTTTACAGCTGCTCGATAAAGTCACGCCCTCACAGTGGGCAGCAATAGGTGTGCTGGGTATCCGCACCGGCGGGCGCGTGCTGGCGGTAAACAGCCAGACCCGGACGCTGACGCTCGACCGTGAAATCACGCTGCCATCTTCCGGCACCACGCTGATAAGCCTGGTTGACGGGCAGGGGAGTCCGGTCAGCGTGGAGGTTCAGTCCGTCACCGACGGCGTGAAGGTGAAAGTGAGCCGTGTTCCTGACGGCGTTGCTGAATACAGCGTATGGGGGCTGAAGCTGCCGACGCTGCGCCAGCGCCTGTTCCGCTGCGTGAGTATCCGTGAGAACGACGACGGCACGTATGCCATCACCGCCGTGCAGCATGTACCGGAAAAAGAGGCCATCGTGGATAACGGGGCGCACTTTGACGGCGACCAGAGCGGCACGGTAAATGGTGTCACGCCGCCAGCGGTGCAGCACCTGACCGCAGAAGTCACCGCAGACAGCGGGGAATACCAGGTGCTGGCCCGCTGGGACACGCCGAAGGTGGTGAAGGGCGTGAGCTTCCTGCTTCGCCTGACCGTGGCAGCGGATGACGGCCGTGAGCGGCTGGTCAGCACGGCCCGGACGACGGAAACCACTTACCGCTTCACACAACTGGCTCTGGGGAACTACAGGCTGACAGTCCGGGCAGTAAATGCGCGGGGGCAGCAGGGCGATCCGGCGTCGGTATCGTTCCGGATTGCCGCACCGGCAGCGCCGTCGCGGATTGAGCTGACGCCGGGCTATTTTCAGATAACCGCCACGCCGCATCTTGCGGTTTATGACCCGACGGTACAGTTTGAGTTCTGGTTCTCGGAAAAACGGATTGCGGATATCAGGCAGGTTGAAACCAGCGCGCGTTATCTTGGTACGGCACTGTACTGGATAGCCGCCAGTATCAATATCAAACCGGGCCATGATTATTATTTTTACGTTCGCAGTGTGAACACCGTTGGCAAATCGGCATTCGTGGAGGCTGTCGGTCAGCCGAGTGATGATGCATCAGGCTATCTGGATTTTTTCAAAGGCGAGATAGGGAAAACCCATCTGGCTCAGGAGCTGTGGACGCAGATTGATAACGGTCAGCTTGCGCCTGACCTGACTGAAATCAGGACGTCCATAACGGATGTCAGCAATGAAATAACACAGACCGTCAATAAGAAACTGGAAGACCAGAGTGCAGCGATCCAGCAGATACAGAAGGTTCAGGTTGATACAAATAATAACCTGAACAGCATGTGGGCAGTGAAGCTGCAGCAGATGCAGGACGGACGCCTTTATATTGCGGGTATCGGTGCCGGTATTGAGAACACCCCTGACGGCATGCAGAGTCAGGTGCTGCTGGCAGCAGACAGGATTGCGATGATTAATCCTGCGAATGGCAACACAAAGCCGATGTTTGTTGGTCAGGGCGATCAGATATTCATGAATGAAGTGTTCCTGAAACGCCTGACGGCCCCCACCATTACCAGCGGCGGTAATCCTCCGGCATTTTCCCTGACATCAGACGGGAGACTGACGGCGAAAAATGCGGATATCAGTGGCAGTGTGAATGCGAACTCAGGAACGCTCAACAATGTCACGATTAACCAGAACTGTACGATTAAGGGCATGCTGGAGGCGACCCAGGTCAGAGGAGATTTCGTTAAAGCTGTATCAAAAGCCTTCCCGAAAAAAGTCGGTACGTGGGGTAACACGGAAACACCAAACGGTACGGTTACAGTAACCATCAGCGATGATCATAACTTTGACCGCCAGATTATTATTCCGCCCATTATTTTTAACGGTATAGCGTATGACGATCCGGGGAGCGGAAATAACCCAGGAGGCACGCGATACACGGGTTATGGTTTTGAAGTTCGCAAAAACGGCGTATTAATCGCATCCAGAGAAACTAAAGGGGCCATTCCCGGTAGTTACAGTGCAGTTATTGATATGCCGAGTGGCAGGGGAAGCGTCACTCTGGAGTTTAAGATTTTCCAGAAAGGCAATCAGGGGGCAGGCAATATCACCGACTGTACGGTGATTGTGACCAAAAAAGCGGCTTCCGGCATCAGTATTCGTTGAAATATTTATAACCCCAATAAAGGGCGTCAGGAATGACGCCTTTTTTATTGCAGAAAAGCGAGAGGTAATTATGCGTAAAGTTTGTGCAGCAATTTTGTCCGCAGCCATTTGTCTGGCCGTATCCGGTGCGCCTGCATGGGCGTCTGAACATCAGTCCACGCTGAGCGCGGGGTATCTTCATGCCCGGACCAACGTTCCCGGCAGTGATGATCTGAACGGGATTAACGTGAAATATCGTTATGAGTTTACGGATACGCTGGGGCTGGTGACGTCATTCAGCTATGCAGGAGACAAGAATCGCCAGCTGACCCGTTACAGCGATACCCGCTGGCATGAAGATTCCGTGCGTAACCGCTGGTTCAGCGTGATGGCGGGGCCGTCTGTGCGCGTGAATGAATGGTTCAGCGCGTATGCGATGGTACTGGTGGAAGAACTATCGAGCAAGCACGTGCGAACTTGCGGGTAATGTATGAGCAAAAAGCTGGCCTTGCTAATACTGACCTAAACACCCTTACCGGTGAATATTCTGGTTTCTATCAACAACCAACGAGCGCTTACGCAACAGAAGAGTTAAATTACCCAATCGGTCTGGCGGGCGCTTTAATAGTGCTCCAAACGAGAGCCAACACTGCTTCTTCCTGCGTTCAGGTGTACCACCCTTATAATAATCCGGGAATTACTTATAGACGAATATATGAAGGAGGTAGCGGTACCTGGTCTGAATGGAAGAGAGATGTATCAACAGAAAGGGTTGAAGAGGGAAAAGAAACAACTTACGTATATTCTACGTATTCTTCAGGCGCACCACGCTTACAGGTTTCCAAATCTGGTTTGTGGGGTTGTCATAATGGCACTGGCTGGTTGCCATTAGCTGTTGGGCAAGGAGGTACAGGTGCGACAACAGTAGAAGATGCGCGAAACAACTTAAGTCTTGGCGAAAGTAGCGCAGTTAAATTTAAAAACCTTACTTTAACCGAAGCGCTCGACACGACATTAGGACTGCTTACAAAAACAGGACGAGACTGGAACACGCAGCATACTGATAACATTAATAAATTTATACCAATTGCAGGCAGTACAAACGGCCCGGCAGGCTCTATGGTTCTTGGCGGCATTCATGTTCAATTTAGTAAAAATTATGCTGTGCAGTTCGGAGGCCGCAATTCCGGTTTTTGGGGAAGAACAATTGAAAATGGAACGACACAGGAATGGAAGAAATTACTAACAGTAGACGATCTCAATTCATCTACCGATCTTGCTGTCAGGTCATTAACCACATCTAACCCGGTAAAATCTGGCGGAGGGCGAATTGATGTCCTTGGAAGCACGTCAGACTATAGCAAAATGGATTGCTTTGTACGTGGGTTTGATAGCACCGGTAATTCTCTCGTGTGGGCGTTGGGTTCATCAGTCGGCGTAAGTAAGATGCTATCGCTAAAAAATTTCTTTAGCGGAGCTGAGATACTGTTAAATGGTAATGACGGCGCGGTTCAACTCAAAACAGGTGCTGTTAACGGGGCTACAGCGCAGACGCTCACTATCAACAAGAATGAGGTTAACTCAACCGTTGATTTAACCCTTACAAAGCAATCAGGGACTGGTAATCGTTTTGTTTTACAGAACTCAGGTAATGCAGAACTACCGTTTTCTGTCAGGGTGTGGGGTTCCAGTACTCGACAAAACGTTTTTGAGGTTGGAACGTCTGCTGCGTATCTGTTTTATGCGCAAAAAACGTCAGCAGGCCAGTTGTTTGATGTAAATGGCGCTATTAATTGCACAACGCTGAATCAGTCATCAGACCGCGACCTTAAAGACGATATTCTCGTTATCAGCGACGCGACGAAAGCAATCCGTAAAATGAACGGATACACCTACACGCTCAAGGAAAACGGGATGCCTTATGCTGGCGTTATTGCACAGGAAGTAATGGAGGCGATACCAGAAGCTGTGGGATCGTTTACTCATTATGGTGAAGAGTTGCAAGGTCCGACCGTTGACGGCAACGAGCTACGCGAAGAAACTCGCTATCTTAATGTTGACTACTCCGCCGTGACGGGTTTACTTGTTCAGGTCGCCCGTGAAACAGATGATCGCGTTACCGCGCTGGAAGAGGAAAACACAACGCTACGTCAAAATCTGGCAACAGCAGGCACCCGGATCAGCACTCTGGAAAATCAGGTAAGCGAACTGGTTGCACTTGTCCGGCAGTTAACAGGAAGCGAACATTGA
Protein sequences of DBSCAN-SWA_5 >CP029122|2564076:2571468|2565509_2566331_+|AWF28268.1|DBSCAN-SWA MKLEDLPKYYSPKSPGLTDASASTSKDALSITDVMAAQGMTQNRAEMGFSAFLGKMGISMNDRARATELLADYALSRCDRVAALRKLPAEIKPVVMRIMASYAFEDYARSAASKKQCPCCYGEKFIESVVFTNKVQYPDGKPPVWAKCTKGVYPSYWEEWKKVREVVKVACPECGGKGEVSTACKDCRGRGVAIHREESVKRGMPVIRDCQRCGGRGYERLPSTEAFNAICEVTNQITRASWEKTVKKFYDALVTRFDIEEAWAERQLKKVTR >CP029122|2564076:2571468|2566845_2566962_+|AWF25515.1|DBSCAN-SWA MNDRFFLQNCCAEILTFGQAIIHKHSAFIPINCGWFVG >CP029122|2564076:2571468|2567223_2569239_+|AWF25967.1|DBSCAN-SWA MLAVNSQTRTLTLDREITLPSSGTTLISLVDGQGSPVSVEVQSVTDGVKVKVSRVPDGVAEYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGDQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGVSFLLRLTVAADDGRERLVSTARTTETTYRFTQLALGNYRLTVRAVNARGQQGDPASVSFRIAAPAAPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSEKRIADIRQVETSARYLGTALYWIAASINIKPGHDYYFYVRSVNTVGKSAFVEAVGQPSDDASGYLDFFKGEIGKTHLAQELWTQIDNGQLAPDLTEIRTSITDVSNEITQTVNKKLEDQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMQDGRLYIAGIGAGIENTPDGMQSQVLLAADRIAMINPANGNTKPMFVGQGDQIFMNEVFLKRLTAPTITSGGNPPAFSLTSDGRLTAKNADISGSVNANSGTLNNVTINQNCTIKGMLEATQVRGDFVKAVSKAFPKKVGTWGNTETPNGTVTVTISDDHNFDRQIIIPPIIFNGIAYDDPGSGNNPGGTRYTGYGFEVRKNGVLIASRETKGAIPGSYSAVIDMPSGRGSVTLEFKIFQKGNQGAGNITDCTVIVTKKAASGISIR >CP029122|2564076:2571468|2569309_2569705_+|AWF26592.1|DBSCAN-SWA MRKVCAAILSAAICLAVSGAPAWASEHQSTLSAGYLHARTNVPGSDDLNGINVKYRYEFTDTLGLVTSFSYAGDKNRQLTRYSDTRWHEDSVRNRWFSVMAGPSVRVNEWFSAYAMVLVEELSSKHVRTCG >CP029122|2564076:2571468|2564076_2565126_+|AWF26497.1|DBSCAN-SWA MRVLLRPVLVPELGLVVVKPGRESMPVFHNTRVLVEPEPKSMRNLPSGVVPAVRQPLVEDKTLLPFFSNARVIRAAGGAGALSDWLLRHIKSCQWPHGDYHHSETVIHRYGTGAMVLCWHCDNQLRDQTSESLEQLAHQNLSAWMIDVIGHAISGTQERELSLAELSWWAVRNQVADALPEAVLRGSLGLRAEKIRSMYRESDIVPGEQTANSILKQRTKNLAPLPHAHQQQNPPQEKTVVSIAVDPESPESFMKRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKSHDIFTLPLCREHHNELHADPLAFEEKHGSQVDLIFRFLDHAFATGVLG >CP029122|2564076:2571468|2565246_2565513_+|AWF27083.1|DBSCAN-SWA MALIVRQQRLKLSLSGRLAIKVIAEPPDKRRRDLDNILKAPLDALTHAGVLMDDEQFDEINIVRGQPVSGGRLGVKIYPIMHEEQVKK >CP029122|2564076:2571468|2569704_2571468_+|AWF25712.1|DBSCAN-SWA MYEQKAGLANTDLNTLTGEYSGFYQQPTSAYATEELNYPIGLAGALIVLQTRANTASSCVQVYHPYNNPGITYRRIYEGGSGTWSEWKRDVSTERVEEGKETTYVYSTYSSGAPRLQVSKSGLWGCHNGTGWLPLAVGQGGTGATTVEDARNNLSLGESSAVKFKNLTLTEALDTTLGLLTKTGRDWNTQHTDNINKFIPIAGSTNGPAGSMVLGGIHVQFSKNYAVQFGGRNSGFWGRTIENGTTQEWKKLLTVDDLNSSTDLAVRSLTTSNPVKSGGGRIDVLGSTSDYSKMDCFVRGFDSTGNSLVWALGSSVGVSKMLSLKNFFSGAEILLNGNDGAVQLKTGAVNGATAQTLTINKNEVNSTVDLTLTKQSGTGNRFVLQNSGNAELPFSVRVWGSSTRQNVFEVGTSAAYLFYAQKTSAGQLFDVNGAINCTTLNQSSDRDLKDDILVISDATKAIRKMNGYTYTLKENGMPYAGVIAQEVMEAIPEAVGSFTHYGEELQGPTVDGNELREETRYLNVDYSAVTGLLVQVARETDDRVTALEEENTTLRQNLATAGTRISTLENQVSELVALVRQLTGSEH |
7 | Enterobacteria_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2973073 : 2983851
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >CP029122|2973073:2983851|DBSCAN-SWA GTCATTTTGCTAATAACACTTTTCTGGCTTCATCGACTTTATTTGGAGCATCGAACCAATATTCGGCGAGCAAAGGGCAGATCTCTGTATCAACGACTTGTTCATACCAAGCCTGGGCATCGTTGATTTTTTGGCCGATAGCTGGGGTGACATAGCTGTGACCGATACAAAACTGTGGTCCTAAGGTAACATCTTTTGCCAGCATATCGTTAAGTACCGTCAGTCGAGATTTGATGAATGCTAACATGTCAAAATCAATTGCGTAATTATAATTTACCCAGTTTCTCCACGCGTCATTAAAAGCTGGCTTTAGATCGATAAACGCGAAACGACGGCGCAGCGCTAGGTCAAGTAGTGCGAGTGAGCGGTCTGCAATATTCATCGTACCGATGATATATAGATTCTCAGGAATGTATATTTTTTCATCATCATTTTTAGGATAGGAAAGAGATAATGCCTCAGTCGGCGTGCGTTTATCTGCTTCCATCAACGTGAGTGTTTCACCGAAGATTTGCGCCGGATTGCCACGATTAATCTCTTCAATTATCACCACATATTTCGAGGTAGGATTATTGACTGCAGTTTTGATTGCATTTACAAAAGGTCCATCAATTAGCGTCAATTGCCCTTCTTTACCTGGACGCCAGCCGCGAATAAAGTCTTCGTAAGAGAGGTTCGGGTGAAATTGCACCGCGCTAATACGCTCAGGTGCTTTTTCTCCCATCAAGCAGTACGCCAGACGTCGCGCTAACCAGGTTTTTCCAGTTCCGGGTGGTCCTTGTAATATCAGGTTTTTCTTGTCGATCAGGCGCTGAAGTGTGAGTTGGATCTTAGCCTCTTCCAAGAAACAGCCATCCTGCACCAGATGACTGATGTCATAAGGAACGTGAGTGAGTTTTGGCAACGGCGCACTCTCTTCGACAGTTTCCTCAGTTATCGTCTGGGATTCATTTTTCTCAAAATTCAGGTATTCATAATTTCCGGACTCAAGGGACTTTAACTCATCATCATTGCATAGTTTCTGTAGATAAAAGTAGATTGAGCTTGTAACTGTTTTGCTCTCAGGATGCTCTACCTTGAATACGTCCAAGTAGTTTTCTTCCAACTCCGAAGCAGTGAAATAAGGGCCATTTTTTTTCAGGCATAACGCCTTAATTTTATTCAGTAAAGAGGCTTTCCACGTTCTATTTGCCACCTCATCTTTAGACTGGCTAAGATCTGTATTCCACGCTGATAAAGAAAGTTCTGGGAAAGAATGAACCGGATAGTTTGGTTGGGTAAAGACCTCGTTCAGCGCCCGCATAAGACTCAGGTAACTTTGTCCACTACAGCGACCTTTTGCCCCGTTTTTAATGATCTTAATGTTTAACACTGTCTGAATGTAATACTGCGACTGGCTATCTAAAGTAGGGTAGAACCAAGGACGGGTCCAGTACAACCCCATGGTGAGATTCCAACCGACATTCATTACTGTAGAAGCAATGTCATATGCGGCAGTGAAGTCTGCAGAGTTAGTATTCTGGTTATCCGCAAAGGTCATTGCCTGCGAGAACATTTCCCACAAGCATTCAATGTCATTAGGGTCGCGTGACTTTTCATAACCAAAGAACCAAGATTTTTGGTTATTCAACAGCGGGATTCCGGCAAAGGAGTCAGGAATCGGTTCGTTCACGCCCAACAAATTCGCTAGCTTGGCAGCAATAATTTTGCGATTGCTGTCGGTCAAGTTACGATTGAACAAGCCCATAGTAGTAAACGGACAAATGTCTTTTAAGGGAAAGATCTCTCCCATAATAGATTTGTCCTGCAGATGGGACATTCCTTCCACACCTGAGGCAATTAGATGAATACCTTTGACTAATTCATCTCTACGATTTCGCCAAGTCAGTAACGCGTTGGCAAAAGCCTCATAAAAACTAGCCCAAGCAAATTTGCCATCATGTTCTGCTGTATCCACGGGAACTCCATTATACTTGTTGAGCAATGATAATTTATCTGTGTGAGTTTTAACATATTACTATCAGTTATAGAAAAATTTAACTACCCGATACAGAGAGCGGCATGCTGAATTTGACCTGACTTGCTTCCAACTAATTAAAATCAACTTATTTATCAATTGGTTATTTTGGCGCATAGCGGTCATCAAGGGAATATCGCGTTGTCATAAGGTGTCGAGGCTCGGAGGTTCAAATCCTCTCATGCAAAAAATAAATAAAATTAATGACGGTTGGAAATTATTCAATACATACACTCTCGAAAGTGCATCAGCCAACCGCAGCACGTCTTGCATACGGCGTGTCTGCAGTTTTATATAATCCTGGCTGGAAACCTCTTATACAAAGTAGATACACCAATATCATAGATGATCGCCACCTTCTGACGCGGGACTTTTGATGCAATTAATCGCCCGGCCTGCGCCCATTGTTCTGATGTAAGTTTAGGACGACGTCCACCAATTCCTCCCTGTGCGCGAGCATCTTCCAGTCCAACTTTTGTCAGTCATCAATCAGTTCACGCTTCATTTCAACCAGGGCTCCATCACATGGAAAAATAAAAAAAACCATCGATGTTGACGTGTCAATGTTACCTGTCAGGCTACGGAAATCCCCCCCTTACCCGCAGCTACTCGGTAAGTATAATAAGATGTTTCCTGCTCCTTCCTAACCTGTCCAGTTTCCAGACAATTAAAGTATCACCTTATATAAGAGACTGCAGAGCGCACTTTAACCTAGGTCGTTCTGAAATAGTTGCAGATTCGATTAATCAACGATTATACTCCCCGGCACTCCAGAGGATCTGGTAAACATAAAGTCAGAGCTTGGGTATACTGGCACACAAATGGCAGATCTTGCAGGTGCAGCCAGTCATAGCCAGTGGCGAAAATACACGAGTGGTTCTGAGCCCCGCGCCATGTCATCACATATCTTGTTTTTTTATTGCTACCAATCTGACTTTGAGTACTAATGAGCTAGATAGAATTGTTGGTAATGACTCCAACTTACTGATAGTGTTTTATGTTCAGATAATGCCCGATGACCTTGTCATGCAGCTCCACCGATTTTGAGAACGACAGTGACTTCCGTCCCAGCCTTGCCAGATGTTGTCTCAGATTCAGGTTATGTCGCTCAATGCGCTGAGTGTAACGCTTGCTGATAACGTGCAGCTTTCCCTTCAGGCGGGATTCATACAGCGGCCAGCCATCCGTCATCCATACCACGACCTCAAAGGCCGACAGCAGGCTCAGAAGACACTCCAGTGTGGCCAGAGTGCGTTCACCGAAGACGTGCGCCACAACCGTCCTCCGTATCCTGTCATACGCGTAAAACAGCCAGCGCTGACGTGATTTAGCACCGACGTAGCCCCACTGTTCGTCCATTTCAGCGCAGACAATCACATCACTGCCCGGTTGTATGCGCGAGGTTACCGACTGCGGCCTGAGTTTTTTAAGTGACGTAAAACCGTGTTGAGGCCAACGCCCATAATGCGTGCACTGGCGCGACATCCGACGCCATTCATGGCCATATCAATGATTTTCTGGTGCGTACCGGGCTGAGAGGCGGTGTAAGTGAACTGTAGTTGCCATGTTTTACGGCAATGAGAGCAGAGATAGCGCTGATGCCCGGCAGTGCTTTTGCCGTTACGCACCACGCCTTCAGTAGCGGAGCAGGAAGGACATCTGATGGAAATGGAAGCCACGCAAGCACCTTAAAATCACCATCATACACTAAATCAGTAAGTTGGCAGCATTACCTTGTTAGTTTGTCTTTTGTGTTTACCTGATTCGGGTAAACGCCTTTACCTGATTTAGGTAAACTTTTCTTACCTGATTCAGGTAAATTTACCTCTTTCAGGTAAACTTTATTTTTCTTACCCGATTCGGGTAATGTTGACCATTCACTGACCACATTATTGATGCCGATATTCCGCCCGCTCTGAATAAAAATCCCACGCTTTACCAGAACACTTTTTGCAGCAGAACACTTGTGCGGCAATATCCCGGTTAATTCGGAAAGTTGCTCGTTGCTAACCCAATCCAGTTTTTTATTAAAGCCATATGTTTTGCGCATGACAGCCAGAAAGACCAGAAGCTGGTGCTGTGTTAATCCGGCCAGCATCACAGCTTCCAGCAACTCATTTGCAATGCGCGTATAACCATCATCGAGATCTGCCACGCGCCGCTCCTTTTGTGCCACATCCGGCACAGGAAAATTGAATATCTCAGCAGTGTTTGCCATAATTCCTCCCGCAATGAGTGTGTTACGATTTGCACCTGAAAGTCGGTTCTGTTCCCGCAGACCGACTTTCGCCATTTTTGAACCTGTCATATTGCCCCCAGCATGGTGGTGACCATCGCCATCAATGGACCAGCCAGATCCGGGTCCACTCGAAACATCGACACAATGCCTTCACTCATCTCCTTCAGTTTCTGGTGGCGTGGTGCGTTGAGAATGACCGCCTGCTTTGCCTCACAGAGTTCCTTTTCCATTTCAGCCAGCCGAGCCATGAAGCTATCCTGCTCAACCAGGTGGCCGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCACGGCTGACATCATCAGGGAAATCGATAGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCAGGCGCGGACGGCATCACGGATTTTTTCGTGGCCTGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTAGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACCGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATTGCCCTGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTGAAACAGAAAGCCTCTGAGCAGAAGGTGGCTGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGAGATGATGCGTGGAACAAATTACGACTCGGCGTCATCACGGCTTCAGAAGTTCACAATGTGATAGCAAAACCCCGCTCCGGTAAAAAGTGGCCTGACATGAAAATGTCCTACTTTCACACCCTGCTGGCTGAGATTTGCACCGGTGTGGCTCCGGAAGTTAACGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAATGACGCCAGAGCCCTGTTTGAGTTTACTTCCGGCGTGAATGTTACTGAATCCCCGATCATCTATCGCGACGAAAGTATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAGCTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAGTTCCGGCTCGGTGGTTTCGAGGCCATAAAGTCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACACGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGTATGAAGCGTGAAGGACTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGCGCGATCACTTTCGTCTACTCCATTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCACTGAAAGCACAGCGGCTGGCTGAGAAGATAAATAATAAACAGGAGGATATATGAGTCAGGTTGGTAATCATTCATTCGAATTTCCGGCATCGCAAGGTGTACAGGGTGGTACTGTTACACTCTTCCTTACCATACCAGGAAGATCGCTGGCTCGTTTCCTCGCTTCAGATAATTACGGCCATACACTGGAACGCTCTCAGCGAGAAATTAATCCAAATCGAGTACGAAAATTTTTAAATTATCTCACTAACGCAGACTCAAGAAATGAGTCTTTTATCATTCCCCCTCTCGTAGGTAACTGTGATTCGAATATAGAATTTGTACCGTTTGGCAACACAAATGTTGGTATAGCCAGAATTCCCCTCGACGCCGAAATAAAACTTTTTGATGGCCAACATCGTGCAGCTGGCATTGAGATATTTTGCCGAAGTTCCCCATCAACGCTCATGGTTCCCATGATGCTTACAATGAATCTGCCGCTAAAAACCCGGCAGCAGTTCTTTTCGGACATAAATAACAACGTTTCTAAGCCATCAGCGACCATCAATATGGCGTATAACGGCCGGGATGATATTGCTCAGGGAATGATATCCTTCCTGACCCAACATACTGTATTTGCCGATATAACCGATTTTGAACACAACGTAGTGCCATTAAAAAGTAATATGTGGGTGAGTTTCAAGGCACTCACTGATGCAACGTCAAAGTTTGCTAGGAACGGCAATCAACAACTTGAAATGGGATATATAGAATCTGTCTGGGAGGCATGGATTACACTAACTCAGATTGACTCAATCCGACATGGTGTACACCACGCTACGTACAAGCGCGATTATATTCAGTTCCATGGAGTAATGATTAACGCTTTCGGTTTTGCGGTTCAACAGATGATGGTTAATCATTCCATCGCAGAAATAACTTCTATGATCGAAAAACTCTGTGCAACTACCAGCTCTGCAGAAAGAGAGGATTTTTTTCTGATGGATAACTGGGCGGGGATCTGCACGAAAGCCAGCCAGGAAAAACTATCGGTTATTGCCAATGTGGCAGCGCAGAAAGCAGCAGCAAACAGACTGATACAAGCTTTTACCAAAGGAAGTCTGGAAACAACTTAATGAATCAACATTGTCTCATATCAGCATGCTGTACGGCGTCTTTAAGGAACGGTGAGCATGAAAAACAAAATCATCATGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATCATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGCTAGATGAACGAAGACGCCTGGCCGTTCAGGGTTGTCGGACTTGTGCAAGTTGCCAGGAGGAGATCGAACTTAAGAACAAACAATGGGGACTGTGATGGCCTCAAAGCAGCAAATTTCAACATCGTCCAACTGAGGTGTAAAAATGTTCAGAATCATTTTTCCTAACACCTGGTACGTCGACCACCACGGCACTCCCTGCAAAATCCTGCGTTCTACCCACAACAAAGTTCACTACATCCGAAAAGGCAGAACATGTATCGCCAGCATGTTCCGCTTTAATCATGACTTTGAACCTGTGAATAAAGCTGATGCAGATCGGATAGCAGAAGAGATCGAAACGGCAGAACACATTAAGAAGTTACGTGCCATACGCAGGAAATAGAAAAATTGATAAATTCAATACTGCATTTCTCAGCATTAAATTTATCTCTATGACCAGTCAAGAGATGTACCTGCCATGAGCTTAATATCATGTCAGATATATCGGTCACAAACTCCCTCAGCAGCTAAGAGGAGGACAAATGTCTCGACTAATCACTTTACAGGACTGGGCTAAAGAAGAATTTGGGGACTTAGCACCAAGTGAGCGAGTTCTGAAAAAATACGCGCAAGGGAAAATGATGGCCCCACCCGCTATAAAAGTTGGTCGCTACTGGATGATTGACCGAAATTCCCGTTTTGTAGGAACGCTGGCAGAACCGCAACTCCCAATAAACGCAAACCCAAAACTCCAACGGATAATCGCTGATGGCTGCTAGACCCCGATCTCACAAAATCTCTATACCCAATTTATATTGCAAATTAGATAAGCGAACCGGAAAGGTATATTGGCAATACAAACATCCACTATCCGGTCGTTTTCATAGCTTAGGAACTGATGAGAATGAAGCAAAACAAGTTGCTACTGAAGCAAATACCATTATTGCTGAACAACGTACCAGACAAATATTAAGCGTCAATGAGCGTCTGGAAAGAATGAAAGGCAGGCGCTCAGACATTACGGTGACAGAATGGCTTGATAAATATATTTCTATCCAGGAGGACAGGCTGCAACATAATGAACTAAGACCCAACTCCTATCGGCAAAAAGGCAAACCCATTCGTCTTTTCCGTGAGCATTGTGGAATGCAACACCTCAAGGATATTACCGCACTTGATATTGCCGAAATAATTGATGCTGTAAAGGCTGAAGGTCATAACAGGATGGCGCAAGTCGTGAGAATGGTGTTGATCGACGTCTTCAAAGAAGCACAACACGCAGGACATGTTCCGCCAGGATTTAACCCAGCGCAGGCAACAAAACAACCGCGAAATCGAGTAAACCGCCAAAGATTGTCACTGCCCGAATGGCAGGCAATATTTGAAAGCGTAAGCAGACGGCAGCCCTATTTAAAATGCGGCATGCTACTTGCTCTTGTTACTGGACAACGTTTAGGCGATATCTGCAATTTGAAATTCTCTGATATATGGGACGACATGTTGCACATTACTCAGGAAAAAACCGGTTCAAAACTTGCTATTCCGCTTAACCTGAAATGCGATGCTCTGAATATTACCCTTCGTGAAGTTATATCTCAGTGCAGGGATGCTGTTGTTAGTAAATATCTGGTCCATTACCGTCACACTACCTCTCAAGCAAACAGAGGAGACCAGGTTTCTGCAAATACTCTGACAACGGCTTTTAAAAAGGCCAGGGAAAAATGTGGCATAAAATGGGAGCAAGGAACTGCGCCCACATTTCATGAGCAGCGATCTCTGTCAGAACGGTTATATCGGGAACAGGGTCTGGATACGCAAAAGTTGTTAGGCCATAAATCCAGAAAAATGACCGACCGATACAATGATGATCGTGGTAAAGACTGGGTTATCGTAGATATCAAAACAGCATAGAAAATAGCCAGTTTTGGGGAAGGGTTTTGGGGAAAGTTTTGGGGAAGATTTTACATCATCATAAAACAACGGGCGTATAACACGCCCGTTTCAATATTTAACACATGTAGAGATTACATGTTCTTGATGATCGCATCACCAAACTCTGAACATTTCAACAGTTTAGCGCCTTCCATCAGACGTTCGAAGTCATAGGTTACGGTCTTCGCATTGATTGCGCCTTCCATACCTTTAACAATCAGGTCTGCGGCTTCGGTCCAACCCATGTGGCGTAACATCATCTCAGCAGAGAGAATAATAGAGCCTGGGTTTACTTTGTCCTGACCGGCATATTTCGGCGCAGTACCGTGGGTGGCTTCAAACAGGGCGCATTCGTCACCGATGTTTGCACCTGGGGCGATACCGATACCGCCAACCTGCGCTGCCAGGGCGTCAGAAATGTAGTCACCGTTCAGGTTCATACAGGCGATAACATCATATTCAGCCGGACGCAGCAGGATCTGTTGCAGGAATGCATCAGCAATCACGTCTTTAATGACGATCTCTTTGCCGGTGTTCGGGTTTTTAACTTTCAGCCACGGGCCGCCGTCGATCAGTTCACCGCCAAACTCTTCACGCGCCAGCTGGTAGCCCCAGTCTTTAAACGCTCCTTCGGTGAACTTCATGATGTTGCCTTTGTGCACCAGAGTCACAGAGTCACGATCGTTAGCAATTGCGTATTCGATCGCTGCACGAACCAGACGTTTGGTGCCTTCTTCCGAACACGGCTTAATACCGATACCACAATGTTCCGGGAAGCGAATTTTCTTCACCCCCATCTCTTCACGCAGGAATTTAATCACTTTCTCGGCGTCGGCAGAGTCTGCTTTCCATTCGATACCCGCATAAATGTCTTCCGAGTTTTCACGGAAGATAACCATATCGGTCAGTTCAGGGTGTTTAACCGGGCTTGGAGTGCCCTGATAGTAACGTACCGGACGCAGGCAGATGTAGAGATCCAGTTCCTGGCGCAGGGCAACGTTCAGAGAGCGAATACCGCCACCAACAGGAGTGGTCAGCGGACCTTTAATGGCAACGCGATATTCACGAATCAGATCAAGGGTTTCAGCAGGCAGCCAGACATCCTGACCATAAACCTGTGTGGATTTTTCACCGGTGTAAATTTCCATCCAGGAGATTTTACGCTCGCCTTTATAGGCTTTCTCGACTGCAGCGTCGACCACTTTCAGCATGGCTGGGGTTACATCTACACCGATTCCATCACCTTCAATGTAAGGGATAATCGGATTTTCAGGAACGTTGAGTTTGCCGTTTTGCAGGGTGATCTTCTTGCCTTGTGCCGGAACAACTACTTTACTTTCCAT
Protein sequences of DBSCAN-SWA_6 >CP029122|2973073:2983851|2977393_2977933_-|AWF28369.1|DBSCAN-SWA MQLLTYQQTSGFSPTAVINRSQTKQVPGHEKIRDAVRAWSAEDNQDVVATLIVNEYREQGGGTIDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGHLVEQDSFMARLAEMEKELCEAKQAVILNAPRHQKLKEMSEGIVSMFRVDPDLAGPLMAMVTTMLGAI >CP029122|2973073:2983851|2976818_2977397_-|AWF25357.1|DBSCAN-SWA MTGSKMAKVGLREQNRLSGANRNTLIAGGIMANTAEIFNFPVPDVAQKERRVADLDDGYTRIANELLEAVMLAGLTQHQLLVFLAVMRKTYGFNKKLDWVSNEQLSELTGILPHKCSAAKSVLVKRGIFIQSGRNIGINNVVSEWSTLPESGKKNKVYLKEVNLPESGKKSLPKSGKGVYPNQVNTKDKLTR >CP029122|2973073:2983851|2979094_2979259_+|AWF25441.1|DBSCAN-SWA MAMTHPHDNIRVGAITFVYSITKRGWVFPGLSVIRNPLKAQRLAEKINNKQEDI >CP029122|2973073:2983851|2982600_2983851_-|AWF26132.1|DBSCAN-SWA MESKVVVPAQGKKITLQNGKLNVPENPIIPYIEGDGIGVDVTPAMLKVVDAAVEKAYKGERKISWMEIYTGEKSTQVYGQDVWLPAETLDLIREYRVAIKGPLTTPVGGGIRSLNVALRQELDLYICLRPVRYYQGTPSPVKHPELTDMVIFRENSEDIYAGIEWKADSADAEKVIKFLREEMGVKKIRFPEHCGIGIKPCSEEGTKRLVRAAIEYAIANDRDSVTLVHKGNIMKFTEGAFKDWGYQLAREEFGGELIDGGPWLKVKNPNTGKEIVIKDVIADAFLQQILLRPAEYDVIACMNLNGDYISDALAAQVGGIGIAPGANIGDECALFEATHGTAPKYAGQDKVNPGSIILSAEMMLRHMGWTEAADLIVKGMEGAINAKTVTYDFERLMEGAKLLKCSEFGDAIIKNM >CP029122|2973073:2983851|2981344_2982487_+|AWF27149.1|integrase|DBSCAN-SWA MAARPRSHKISIPNLYCKLDKRTGKVYWQYKHPLSGRFHSLGTDENEAKQVATEANTIIAEQRTRQILSVNERLERMKGRRSDITVTEWLDKYISIQEDRLQHNELRPNSYRQKGKPIRLFREHCGMQHLKDITALDIAEIIDAVKAEGHNRMAQVVRMVLIDVFKEAQHAGHVPPGFNPAQATKQPRNRVNRQRLSLPEWQAIFESVSRRQPYLKCGMLLALVTGQRLGDICNLKFSDIWDDMLHITQEKTGSKLAIPLNLKCDALNITLREVISQCRDAVVSKYLVHYRHTTSQANRGDQVSANTLTTAFKKAREKCGIKWEQGTAPTFHEQRSLSERLYREQGLDTQKLLGHKSRKMTDRYNDDRGKDWVIVDIKTA >CP029122|2973073:2983851|2976070_2976574_-|AWF26299.1|transposase|DBSCAN-SWA MSRQCTHYGRWPQHGFTSLKKLRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLECLLSLLSAFEVVVWMTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKSLSFSKSVELHDKVIGHYLNIKHYQ >CP029122|2973073:2983851|2978121_2978427_+|AWF24152.1|DBSCAN-SWA MLCCEATYIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKASEQKVAA >CP029122|2973073:2983851|2981118_2981355_+|AWF28621.1|DBSCAN-SWA MSRLITLQDWAKEEFGDLAPSERVLKKYAQGKMMAPPAIKVGRYWMIDRNSRFVGTLAEPQLPINANPKLQRIIADGC >CP029122|2973073:2983851|2980739_2980979_+|AWF25950.1|DBSCAN-SWA MFRIIFPNTWYVDHHGTPCKILRSTHNKVHYIRKGRTCIASMFRFNHDFEPVNKADADRIAEEIETAEHIKKLRAIRRK >CP029122|2973073:2983851|2979255_2980320_+|AWF25064.1|DBSCAN-SWA MSQVGNHSFEFPASQGVQGGTVTLFLTIPGRSLARFLASDNYGHTLERSQREINPNRVRKFLNYLTNADSRNESFIIPPLVGNCDSNIEFVPFGNTNVGIARIPLDAEIKLFDGQHRAAGIEIFCRSSPSTLMVPMMLTMNLPLKTRQQFFSDINNNVSKPSATINMAYNGRDDIAQGMISFLTQHTVFADITDFEHNVVPLKSNMWVSFKALTDATSKFARNGNQQLEMGYIESVWEAWITLTQIDSIRHGVHHATYKRDYIQFHGVMINAFGFAVQQMMVNHSIAEITSMIEKLCATTSSAEREDFFLMDNWAGICTKASQEKLSVIANVAAQKAAANRLIQAFTKGSLETT >CP029122|2973073:2983851|2978423_2979104_+|AWF28302.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWNKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEICTGVAPEVNAKALAWGKQYENDARALFEFTSGVNVTESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >CP029122|2973073:2983851|2973073_2974891_-|AWF26446.1|DBSCAN-SWA MSHLQDKSIMGEIFPLKDICPFTTMGLFNRNLTDSNRKIIAAKLANLLGVNEPIPDSFAGIPLLNNQKSWFFGYEKSRDPNDIECLWEMFSQAMTFADNQNTNSADFTAAYDIASTVMNVGWNLTMGLYWTRPWFYPTLDSQSQYYIQTVLNIKIIKNGAKGRCSGQSYLSLMRALNEVFTQPNYPVHSFPELSLSAWNTDLSQSKDEVANRTWKASLLNKIKALCLKKNGPYFTASELEENYLDVFKVEHPESKTVTSSIYFYLQKLCNDDELKSLESGNYEYLNFEKNESQTITEETVEESAPLPKLTHVPYDISHLVQDGCFLEEAKIQLTLQRLIDKKNLILQGPPGTGKTWLARRLAYCLMGEKAPERISAVQFHPNLSYEDFIRGWRPGKEGQLTLIDGPFVNAIKTAVNNPTSKYVVIIEEINRGNPAQIFGETLTLMEADKRTPTEALSLSYPKNDDEKIYIPENLYIIGTMNIADRSLALLDLALRRRFAFIDLKPAFNDAWRNWVNYNYAIDFDMLAFIKSRLTVLNDMLAKDVTLGPQFCIGHSYVTPAIGQKINDAQAWYEQVVDTEICPLLAEYWFDAPNKVDEARKVLLAK |
12 | Enterobacteria_phage(27.27%) | transposase,integrase | attL 2971046:2971069|attR 2982554:2982577 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
3298227 : 3306997
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >CP029122|3298227:3306997|DBSCAN-SWA TTTAACTGGAAACTTCCGTTGACCGGCTGTATTCATGGCTGCGAATTTTCGCCATCAGCTCGTCTGTCAGTTCGGACACCCACTGGATAGCCAGCCGCTTTTCTTCGTCGCTGCACTCACTAGCCGCTACAAGCTTGATAAAAAAATCAATACGCTGAAGCTTCAATGACTCCAAAAGATAGTCCTGCATCTTCCCTCCTATCATTACACGGATACACAAAAACTGTATATACACCCACTGTTTATATAAACAGTATAATAGGAACAGAAAAATGTAAAACTGTTTTTTGTCAGTTAATTGGATGTACTGATGTCGGTCAATAAAGCACAAAATGTTAAACAGCAGCCTTAGTACCATTGACGCCATTTGTCATCTTCCTGCAGCCTCTGGTTACGGTAAAAAATACGTAAACCGGCACCGGATGGAATGCTGCCGCCACGCAGAAGCAAATCAATCTCAGATGCACTACCTTCAAACCCCCTGGTTGTCAGTTCTGCCTCAAGCTGCAGGCGCTGCTGCTCCAAAATACTCTGTATGTATGCTTTTTTCCGCTTCGGTTTTACCAGTCTCAACCTGGCTGTCAGCTCCCGCCGTTCCTTCTGGCCCATGTTGTGGAGATATTCCTGCAGCTCCTTCTCATCCATGGTTTTAATATCGGGTAAATCACCCCCTGATTTGTTCAGATTTTCAACAGGGGGACAGTTATTGCCACGAGTCCAAGGGGCGCAAGCGCCCTGGTCGGCTGCCGCCTCCTGAACGTCAACGGCTTTACGAACCATTTTCCACTTCACTGCATGAGTGCAGATCTTGCCCTCTGCAATGGGTGACCAGATGCCATAAATACGAATGCCATGATCGCCATAGGCGGTCGGCTCTTCGTTGATTTCATAAGCGGTTCTGATGAGGTGATATTTACGGGGAACCAGTACGCCGCCCTGCTTCATGATGTAGGTGGCAAAACAACCAGCATCAGCAGCAGCCAGAATGGCATCAAGACGCGGGTTATCCAGTACCGGCGCACCTGCTTTTTTGTCACCCTGTTGCCTTGCCGCCTGACCAGCCAGCAAGCGAAGTTCACGGTAAGCCTGACGCCCCGGAATACCAAAGAAGCGGAATTGCTGAACACGATGCAGAGACGCCCAGGCATTCACGTATTCAGCGTTATCACGCAGAGATTTACCCGTTTCCTTGCTGATCTCGCCAGCCAGACCACGACCGTCAATGTTCTTACTGATATATTTCGCGATGTAGCTTGTCGGCGTTCCTTTGCGCGGGTTAATCAACTCAGACTTAAAGCGCGGCCCAGTGTTATTGCCCAGCTCCTCGCGGTCTTCACGGATGGCAAACTTACGCAGTAATGCAGTGATGGCACGGCGGTCTTTTTTGCGCATGAAACACAACAGGTGCCAGTGAACTGTGCCATCATGATGCGGCTCAGCCACCCGCACGCCATACCAGCGCAACCCGGCTTTGTGCATCGCCTTACGAAATGCAGCAAACATACCGACCAGATAATCGCTGCTTTGTCTTACCGTCGCGTTTGTCCAGGTCGGGTTTGGTCTGCCGTTATTTAGCGTGGAATGGAAACGCGACGGACAGGTGATAGTGTAGAAAACGGCGCAGTCACCGCGCATTTCCGCGATAAGCTCCAGACCTTTAACACAGGCCATCATCTCATTGCGGCGATGCGCAGGGTTGCTGCTGCTGGCGTTTACCACATCCTCCATGTCCAGCGTGTCGCCGTCTTCGTTCACCAGTTCATGAGAACGGAAAAACTCCAGCGACTTACGACGCTGCTCACGTTTATGCATCACGGCTTCATAGCTGACATAAGGAGATGCTTTTTTGCTGACCAGGCAAACAGCGCGCAACTGCTCTTCCCGCCATTCGCAACGCATCTTCCATAATTTCCGATACCACCAGTCGGCGCACAACATACGCGCCAGCGAACCCGGAATGAGTTCATAGGGCACGGGTTTACGGCGGTTTCTTTTCCGACGGAGTTGCTCAAACGCAGGTGGGATGACATCCAGACGCAGGGTTTCCGCTGCCACCTTTTCCCATGTCTTGCGGATTTCTTCTGGCTTAACGTCATCGGTGGCATACAAATCACCACAAGCTGCATCAAGGCACATACTCATATGCGCAGCTACCAGGGTGGAAAGGCGTTTCACCTGATCCTGACTCATTTCAGGCAGGATCAGCAGGCCGTCCAGCCCTTCATGGCTTGCCATAAAGCGAAAAGATGCAGATAGCTGACTGTCGCGTACATGCTCCAGTCGTTCCAGACATGGCTTAATCGTCTCACGTAAATAGCGGGAATAAGCCTTTGGCCTGCCCAGGCTGCTGAAGTATTCAATACGTTGCATCAGCGGCTTGCTGATATGGGAAGGCTGGGCGTTGACGTCCGCCAGAATGACCATGTCTGAATTAAAACGCTGCTGCTCATGCGCCAGCTTTGCCCGGCTAATGAGCTTATCCTGCTCCATTTCGCGCTGGACAGGATCACGTGATTCATTAAAGAAATAACGCTCCCAGACCTGATCACTCAGTGCCTCGCGGCGCAACTGTTCCTGCTCGTTATCGGCAGCGTACAGAGTGATCAGGTTTGAAAGCGTAGAAACCGGCGCAACTTCCGCCGGGTCCAGATAAGGGTTAATGGCCTTTTTCGGGCTGTTCCATGAGAATGCTGCGGCAGCCTCGTTAAAGCCGCAGCAGTTGTTCATATCGGCATGACTCATGCACGTACTCCGTACACGGCAGAACTATCCACGCCACGCGAATAATCAAATCCCATCCAGCAGCGCGGCCCGGAAACAGCAATGATTTCTGTTGCTGATTTACCCTCGCCAGCTGCCACACCGATGCTGCGTTTTACCTTGATATAGTGGTGAGTAAAATTGCGATACAGCGAACGGATCAGGGATGTGTCACTGTTAGAAACAATGACCGGATGTCCTTCTGATGACCGATGTTCAAGAACGGATGCCAGGTGATACTGGTCATCTTCAGTGAAACCATCAGTGTGATAGCCGGAAAACGTACCGTCATATGGCGGATCGCAATACACCACATCTCCCGCCTTCAACATCGCCAGCGTTTCATCAAAGCTGGCGCAGATAAACGTCGCTCGCTGGGCTTTTTCTGCAAATGTGCGAAGTTCTTTTTCAGGGAAATACGGATTTTTATAATTACCGTAGGGAATGTTGAAATGCCCGCTCTTGTTATAGCGACATAAACCACGGTAACCGTGACGATTGAGATACAGGAAATATACCGCTTTCATGAAATCAGTAATTTCAGTTGAGCAGTTAAACTCCTGCCTTATGTTGTAATAAGCCACCTCCCTGTTTGCGATCTCAAATAAAACTCTGGCGCGAGATATAAACGATTCACAATCAGCGGCAACCTTTTTATAGAGGTTGATTAAATCAGGATTAATATCCGCAACCAGATAGCTGGGATAATCCGTTTCCATCATCACAGCACAGGAACCCGCGAAAGGTTCAACCAGTCGCGGGCCAGCAGGAAGGTATTTTTTCAGTTCTGGCATAATGGCGGTTTTATTTCCCGCCCATTTCAGGATGGTGCTCATACAGCACCTCCGTTGTAATGTTTGCCTTTCAGTTCTGCGATTTCCTGACAGGTAATGCAAAGCTGCACTCCCGGAATGGCGCGGCGTCGTGCTGGCGGAATTGGCGCTTCACATTCAATGCAAAGTACGCGTGACACGCCCGGTGATTTGGCACGGGCTGCACGGATATGGCGCTGGCGTTCTTCTTCAACGCGCTGCTGTACGAGATCCATTGCATCAGCCATTAGTGGATCTCCTGCGCTTCGTTCTGGATTGCTTCAGCAGTCACGCGCAGCAGTTCTGCCGCTTCCACGTGGTTTAGCTGGCGGGATGAGATATGACACGCCAGGCTATCAAGGCGAGCAGCCATTGCTTCAGCCCTTGCCCGGCGTTCTTCCAGACGAGCCTCTGTCAGTAAAATATTAAGCCCTGCGTCATCCGGTCCGGTTTTAGTCGTGAGGGTTTCAATATTACGCATAATCAATTCTCCTGAATTTAGATAAAGGGATGCCCGGCGGGTTTACGCCATTAATTTCATTAGTTGGTTAATTCGGCATGGTTAGCCGTCTGGGAAATAAGCTCACCACTGCACGAAAATGATTCATTGCTTTAATCAGCTCCCGCTTTTCGTCAGTGGTCAGCTCATTAATGCTGATGCTATGACGTTCAGCTGGAATTTTTGCCATAAAGAATATGGCAGCCAGTGCCCGTTTATTTTGTTCATTATTGATATCCCGTGGATCACGCATATCTTTAATAAACCGCTCAAGCTCTGACTCAATATTCAGGCCAAAAACTTTCGCCCTTAACTCCGCAATGTGATTAAGTCCATTCAGGCGTTCACCGGGGCTTAATGGAACAGTTGCTGCAGCGCCATTAATTGCCATACTTCATATCCCCCAAACGCAGCTATCGTTCTTTGTTCTTACGGTAACGCTCAAGAGGAGATACATTTTTTCGTATCGTCTCTTTAACCTGCTCTCCCCGTAAAAACGTCCCATCCTTTAACGTGAAAAAGTAACTGCCATCGCCCGACAATGACGGATAGCAACAGAGCAAATCATCTTCAGGTACTGAATAACTCTCCCCTCTGTAACGAAACTGATAAACCACTTCACTTTCTGCCGCATACATTTGGACTTTCTCCGTTTCCTCGTGGTCAATTCAGACAGCAATTCATCTTGTGAATGACATGGATGCCAGCGTTTTCCATCCTCACCCGTGATCCAGCCGTGACCGTAGTGCATTGCCGGGCTTTGTTTTACCAGCAGCGATGCAAATGATGGTTCTTTCGTCAGCATAAACACCTCACAGCAAACCGAATGAAGCACCAAGGCCAGTCATGGTATCAACTGCACTCGCCATCGCAGGATTAGCCTGTAAACGGGCTTGCAATGAAACAGCAGCCAGCGCCATCAGTCGTGTTACAGAGTTAATGCTGCTGATAGCATCACGACGACCTGCACTGGTTTTTACATCGCCAGATACCGCACCTGCAGCAACACGCCCGATCTCTGCGGTTGCACTCATGACGTAATGCGGTAGTTTCTCTTTTGCCACCTCATTAATCGGAACACATGGCAGACAATGAATCTGTGCCAGAAAACCATCTACCAGCGTTGAATCTTCAGTCAGATCGGTAAGCAACCAAATTTCTGGTGCGGTTAATAAATGAGGTTGAGCTGGGTTCAGTTTGTTCCGCAGAATCTGCACATTCATGCCTGCACGTTCTGCCAGTTGCACCAGATTGTGGCGCAGTGCGAATGCACGACAGGCTTCATCAAAATGTGGATGTTTGGAAACTTGGTAATCAAACATGGTCGACACCTCTGATGTATCCCAAAATGGAACTAGTTGAATACAACATTGCAATCAGTAAGTGCATCAACGGTAAGAGCAGCAAGGTTGATCATTACCTTTTCTCTTTTCTTGTCTTTCCGAAGGCGATGCCGAGGGATGCGACCGTCAGCCAGCATATCGTTAATTGTGTCGATTGAAAGACCAGTAAGTTCGCTATAACGCTCAATTGTGACATGTGGCGTATTCAGGGTTATTGAAATGTTAGGGGTCATGATGCAACATCTCCTATTGGCTTGTGGTGAGTCAGTTTTAATCGTGGCTTTAACTTCACATTTCGGAGAATAGGATCACAAATCGGTTATGTCAACACATGAAATCACATTTCGCCATGTGGACGAAAAAAAGAAATCCTTAATCATGCAGAATCGCGGAGGGCAATCGGTTATAGATCGGATACTGAAAGCCTATGGTTTTTCTTCCCGACAAGCATTCTGTAATCACCTAGGTATATCGCAAAGTACAATGGCGAACAGGTATGCCCGTGACACTTTCCCTGCTGATTGGGTTGTTATCTGTAGCATGGAGACTGGAGTGCCGGTCGAGTGGTTGGCATTTGGCACTGATACCGAGAAGGGAAGCATTACAAATAATGCAGAAAAAAGTCACAACAATTGTGACAGCAAGCATCAACATCTCAATAGAGAACAAGACATCCAAAATGAGAACTCTTTTACTATTAACCAAGGTGGAAAAGCAGCAATAGAGCGAATCGTTTTGGCTTATGGATTTAAGACAAGACAAGCTTTAGCTGATCATATTGGTGTATCAAAAAGTACATTAGCCAATCGTTACATGAGAGATACCTTTCCTGCTGACTGGATTATTCAATGCTCACTGGAAACCGGTGCTTCATTAACATGGCTAACCACTGGTAACGGGGCAATGTTTGAAAAGCCTCGAAACGATACTATCACTATCCCATATCATAAAATAATTGATGGATCTCTTGCTCAAGAAACCTTCTTGACTTTTGACTCTAAGTTGTTAGAAGGAACCTTTCTGCAACCTTTAGCAGTATTCATTGATGAGGAAATATATATTGTAGAATCAAAATTTAATGAAGTTACTGATGGCAAGTGGCTTGTGAATATTGAAGGGAAAATAAGTATCAAAGATTTGACTCGCATACCCGTTGGTATGGTTAAAGTTGTAGGCACTAACGCAAGTTTTGAATGCTTACTTACTGACATTATCGTTTTGGCAAAATGTAAAAGAGTTTTTACTAAAAATGTATAAAGAGAAACATCATGACTGAACCAACCAATAAAGATAGCGAAATAAAAAAACACCTATTAGAATTTCTTGATTCACAGTCTGAAAATATAGCAAAACACTTCTACTCTCATATAAAAGACTTAATAGAAGCAGGAGAGCTTTCTGAAGCTCATAATAACCTAGCGCTAATTGAAAAATACATAACTAGGCCACCGATGGATGAAGAACCCAATATAAATGAAAATAAAGCCAATAAAAGAAAAAATGTAAAATCACTTGAACCTAATAATTATGTAGAACATATAATACAATTAGAAGAACGAAACAGCATATTAACTCTACAGTTAGAGCATTATACTCAGGATCTTAATAGAAAAAACGCAATAATCGAAAACAACGTAAAACAAATTAATTCATTGATTAGTGAAAATAAGGAACTCCGTAGCCAAGTACAGCAACAAAGAATCGATGATAAAATCCCCACCTATGTTAACGATGTTAAATCAGATCTTGGTAGTGATGACAAACATTTTATATTGATGTCTATTATCTGGTCTATTGCAGGGGTATTTTTTGGCTTCCTTGCAGTAGTATCTGCTTTTTTTACATTATACATGAACTTAGATTTAAAAAATCTCACTAACCTTCAGTTAATATATATCTTCACGCGAGGATTAGTTGGAATCGCCATTCTTTCATGGCTATCATATATCTGCCTTAGTAACTCAAAAAAGTACACACATGAATCGATCAGGCGAAAAGATCGTCGACATGCTTTGATGTTTGGTCAAGTTTTTTTGCAGATATACGGTTCTACAGCAACTAAAGAGGATGCAATAGAAGTCTTTAAGGATTGGAATATTTCAGGTGACTCTGCATTTTCAGGTCAGACAGAGCAACCACCGAGTTTTGCGTCATTTTTGAATACAATCAAAGACAAAGTTAAAGTAACTGGAAGTGATAAAGAAACAGATTAATCATGAACATGTATGCTACTAAGTAAAAAATACATTGAATACTGTTGTTATATACAGTTAAATTTAGCCCTCTGATATGAGGGCATTTTTTATGGCAGTACGAAAACTCACCACAGGAAAATGGCTTTGCGAATGTTACCCCGCCGGACGTAGCGGACGCCGTGTGCGTAAACAATTCGCCACCAAAGGCGAAGCACTGGCCTTCGAGCGATACACCATGGGGGAAATAGAAGCAAAACCCTGGCTGGGCGAATCAGTGGATCGTCGGACACTGAAAGATATGGTTGAGCTATGGTTCAAATTACATGGCAAATCTCTTACTGCCGGACAGCATGTCTACAACAAGCTGCTGTTGATGGTTGACGCCTTGGGAAATCCCCTTGCAACTGATCTCACCTCAAAAATGTTTGCTCACTATCGAGATAAACGCCTGACAGGCGAGATCTACTTCAGCGAGAAATGGAAGAAAGGAGCAAGCCCGGTCACCATTAACCTGGAGCAAAGCTATCTAAGTAGTGTTTTTAGCGAACTATCCCGTCTGGGCGAATGGTCGTATCCGAACCCACTGGAGAACATGCGAAAATTCACCATCGCAGAAAAAGAGATGGCATGGCTTACCCATGAGCAGATTGTTGAATTGCTGGCTGATTGCAAACGTCAGGACCCAATTCTGGCACTGGTAGTTAAGATATGCTTAAGCACAGGCGCACGCTGGCGTGAAGCCGTAAATCTTACCCGCTCACAGGTGACCAAATACCGAATTACCTTTGTCAGAACGAAGGGGAAGAAAAACAGAAGCATCCCTATCAGTAAAGAGCTTTACGAAGAGATCATGGCGCTCGATGGGTTCAATTTCTTCACAGACTGCTATTTTCAATTTTTATCCGTGATGGAAAAAACGTCTATCGTGCTCCCTCGCGGTCAACTCACACACGTTCTGCGCCATACGTTTGCAGCGCACTTCATGATGTCGGGTGGAAACATTCTGGCCTTACAAAAAATTCTCGGACACCACGATATAAAAATGACTATGCGTTACGCACATCTGGCACCGGATCATCTGGAAACGGCGCTCCGTTTCAATCCTCTGGCAACGCTGCCAAGTGGCGACAAAGTGGCGGCAGCGGTTGGCATTACCCCGTAA
Protein sequences of DBSCAN-SWA_7 >CP029122|3298227:3306997|3301818_3302046_-|AWF27801.1|DBSCAN-SWA MADAMDLVQQRVEEERQRHIRAARAKSPGVSRVLCIECEAPIPPARRRAIPGVQLCITCQEIAELKGKHYNGGAV >CP029122|3298227:3306997|3302346_3302688_-|AWF26878.1|DBSCAN-SWA MAINGAAATVPLSPGERLNGLNHIAELRAKVFGLNIESELERFIKDMRDPRDINNEQNKRALAAIFFMAKIPAERHSISINELTTDEKRELIKAMNHFRAVVSLFPRRLTMPN >CP029122|3298227:3306997|3305944_3306997_+|AWF24998.1|integrase|DBSCAN-SWA MAVRKLTTGKWLCECYPAGRSGRRVRKQFATKGEALAFERYTMGEIEAKPWLGESVDRRTLKDMVELWFKLHGKSLTAGQHVYNKLLLMVDALGNPLATDLTSKMFAHYRDKRLTGEIYFSEKWKKGASPVTINLEQSYLSSVFSELSRLGEWSYPNPLENMRKFTIAEKEMAWLTHEQIVELLADCKRQDPILALVVKICLSTGARWREAVNLTRSQVTKYRITFVRTKGKKNRSIPISKELYEEIMALDGFNFFTDCYFQFLSVMEKTSIVLPRGQLTHVLRHTFAAHFMMSGGNILALQKILGHHDIKMTMRYAHLAPDHLETALRFNPLATLPSGDKVAAAVGITP >CP029122|3298227:3306997|3304908_3305853_+|AWF27710.1|DBSCAN-SWA MTEPTNKDSEIKKHLLEFLDSQSENIAKHFYSHIKDLIEAGELSEAHNNLALIEKYITRPPMDEEPNINENKANKRKNVKSLEPNNYVEHIIQLEERNSILTLQLEHYTQDLNRKNAIIENNVKQINSLISENKELRSQVQQQRIDDKIPTYVNDVKSDLGSDDKHFILMSIIWSIAGVFFGFLAVVSAFFTLYMNLDLKNLTNLQLIYIFTRGLVGIAILSWLSYICLSNSKKYTHESIRRKDRRHALMFGQVFLQIYGSTATKEDAIEVFKDWNISGDSAFSGQTEQPPSFASFLNTIKDKVKVTGSDKETD >CP029122|3298227:3306997|3302710_3302935_-|AWF26978.1|DBSCAN-SWA MYAAESEVVYQFRYRGESYSVPEDDLLCCYPSLSGDGSYFFTLKDGTFLRGEQVKETIRKNVSPLERYRKNKER >CP029122|3298227:3306997|3302045_3302279_-|AWF24089.1|DBSCAN-SWA MRNIETLTTKTGPDDAGLNILLTEARLEERRARAEAMAARLDSLACHISSRQLNHVEAAELLRVTAEAIQNEAQEIH >CP029122|3298227:3306997|3298227_3298416_-|AWF27878.1|DBSCAN-SWA MQDYLLESLKLQRIDFFIKLVAASECSDEEKRLAIQWVSELTDELMAKIRSHEYSRSTEVSS >CP029122|3298227:3306997|3300964_3301822_-|AWF28605.1|DBSCAN-SWA MSTILKWAGNKTAIMPELKKYLPAGPRLVEPFAGSCAVMMETDYPSYLVADINPDLINLYKKVAADCESFISRARVLFEIANREVAYYNIRQEFNCSTEITDFMKAVYFLYLNRHGYRGLCRYNKSGHFNIPYGNYKNPYFPEKELRTFAEKAQRATFICASFDETLAMLKAGDVVYCDPPYDGTFSGYHTDGFTEDDQYHLASVLEHRSSEGHPVIVSNSDTSLIRSLYRNFTHHYIKVKRSIGVAAGEGKSATEIIAVSGPRCWMGFDYSRGVDSSAVYGVRA >CP029122|3298227:3306997|3303109_3303619_-|AWF26636.1|DBSCAN-SWA MFDYQVSKHPHFDEACRAFALRHNLVQLAERAGMNVQILRNKLNPAQPHLLTAPEIWLLTDLTEDSTLVDGFLAQIHCLPCVPINEVAKEKLPHYVMSATAEIGRVAAGAVSGDVKTSAGRRDAISSINSVTRLMALAAVSLQARLQANPAMASAVDTMTGLGASFGLL >CP029122|3298227:3306997|3303961_3304897_+|AWF26867.1|DBSCAN-SWA MSTHEITFRHVDEKKKSLIMQNRGGQSVIDRILKAYGFSSRQAFCNHLGISQSTMANRYARDTFPADWVVICSMETGVPVEWLAFGTDTEKGSITNNAEKSHNNCDSKHQHLNREQDIQNENSFTINQGGKAAIERIVLAYGFKTRQALADHIGVSKSTLANRYMRDTFPADWIIQCSLETGASLTWLTTGNGAMFEKPRNDTITIPYHKIIDGSLAQETFLTFDSKLLEGTFLQPLAVFIDEEIYIVESKFNEVTDGKWLVNIEGKISIKDLTRIPVGMVKVVGTNASFECLLTDIIVLAKCKRVFTKNV >CP029122|3298227:3306997|3298574_3300968_-|AWF24887.1|DBSCAN-SWA MSHADMNNCCGFNEAAAAFSWNSPKKAINPYLDPAEVAPVSTLSNLITLYAADNEQEQLRREALSDQVWERYFFNESRDPVQREMEQDKLISRAKLAHEQQRFNSDMVILADVNAQPSHISKPLMQRIEYFSSLGRPKAYSRYLRETIKPCLERLEHVRDSQLSASFRFMASHEGLDGLLILPEMSQDQVKRLSTLVAAHMSMCLDAACGDLYATDDVKPEEIRKTWEKVAAETLRLDVIPPAFEQLRRKRNRRKPVPYELIPGSLARMLCADWWYRKLWKMRCEWREEQLRAVCLVSKKASPYVSYEAVMHKREQRRKSLEFFRSHELVNEDGDTLDMEDVVNASSSNPAHRRNEMMACVKGLELIAEMRGDCAVFYTITCPSRFHSTLNNGRPNPTWTNATVRQSSDYLVGMFAAFRKAMHKAGLRWYGVRVAEPHHDGTVHWHLLCFMRKKDRRAITALLRKFAIREDREELGNNTGPRFKSELINPRKGTPTSYIAKYISKNIDGRGLAGEISKETGKSLRDNAEYVNAWASLHRVQQFRFFGIPGRQAYRELRLLAGQAARQQGDKKAGAPVLDNPRLDAILAAADAGCFATYIMKQGGVLVPRKYHLIRTAYEINEEPTAYGDHGIRIYGIWSPIAEGKICTHAVKWKMVRKAVDVQEAAADQGACAPWTRGNNCPPVENLNKSGGDLPDIKTMDEKELQEYLHNMGQKERRELTARLRLVKPKRKKAYIQSILEQQRLQLEAELTTRGFEGSASEIDLLLRGGSIPSGAGLRIFYRNQRLQEDDKWRQWY |
11 | Salmonella_phage(88.89%) | integrase | attL 3297897:3297910|attR 3307039:3307052 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
3374112 : 3412836
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >CP029122|3374112:3412836|DBSCAN-SWA CTTACTGATAGTGTTTTATGTTCAGATAATGCCCGATGACCTTGTCATGCAACTCCACCGATTTTGAGAACGACAGTAACTTCCGTCCCAGCCTTGCCAGATGTTGTCTCAGATTCAGATTATGTCGCTCAATGCGCTGAGTGTAACGCTTGCTGATAACGTGCAGCTTTCCCTTCAGGCGTGATTCATACAGCGGCCAGCCATCCGTCATCTATACCACGACCTCAAAGGCCGACAGCAGGCTCAGAAGACGCTCCAGTGTGGCCAGAGTGCGTTCACCGAAGACGTGCGCCACAACCGTCCTCCGTATCCTGTCATACGCGTAAAACAGCCAGCGCTGACGTGATTTAGCACCGACGTAGCCCCACTGTTCGTCCATTTCAGCGCAGACAATCACATCACTGCCCGGTTGTATGCGCGAGGTTACCGACTGCGGCCTGATTTTTTTAAGTGACGTAAAACCGTGTTGAGGCCAACGCCCATAATGCGTGCACTGGCGCGACATCCGACGCCATTCATGGCCATATCAATGATTTTCTGGTGCTTACCGGGCTGAGAGGCGGTGTAAGTGAACTGTAGTTGCCATGTTTTACGGCAAGGAGAGCAGAGATAGCGCTGATGTCCGGCAGTGCTTTTGCCGTTACGCACCACGCCTTCAGTAGCGGAGCAGGAAGGACATCTGATGGAAATGGAAGCCACGCAAGCACCTTAAAATCACCATCATACACTAAATCAGTAAGTTGGCAGCATTACCTATGGCGACATACATTGTTGGTGAATCATATTTGGTTGCCGTAATTTCGAGATACTTAGCTCCCCAGGCCAGTCCTGCGGTTGCCACCAAACCTAAAGCGGTAATGAGCCATATACCGTTTAATATCCTGGTCTCGTGCTGAAAATCTAATTTATTGCTGTTTTGTGAGTATGATTCCATTCTTTTAGTTGTCCTTAGAAAAAATATTAATATTCACCATCAATAAATTTAAAACCACGCATACAAAATACTTTTGTATTTATGGATACGGAAAACGCTTCCGCTCGCCATAAGGCGTATTTTATACCTGGTACTTGTTAAGTGTCGTTTTTAGTAAGAAAAAAAATCCCGACACATGGCCGGGATATTGATGGTGAAAAAGAATTAACGGCGGTTGCCAAAAATTCGCAGCAACATCAGGAACAGGTTGATGAAGTCCAGATACAAGGTTAACGCGCCAAGAATGGAATATTTGCGCAGGTTCGACGTGTCGCGGGTATCAATCTGCTCACCCATATTTTTCAATTTCTGAGTGTCATACGCCGTCAATCCGACAAAGACAATCACGCCGATGTAGGTAACTGCCCACATCAATGCTTCGCTTTTCAGCCAGAAGTTGACCAGCGATGCCAGCACAATGCCGATTAACGCCATAAACAGCATATTGCCAAAGCCACTTAAATCGCGCTTCGTGGTGTAACCGTACAGGCTCATTGCGCCGAACATCCCGGCAGTAACGACGAAAGTACTGGCGATAGAAGCAGCGGTATAGACAATGAATATACTGGAAAGCGTAAGACCCGTCAGCGCCGAATAAAGCATAAAGAGCATCGTCGTTACACCTGCGCTCAGCTTTTGAATCATCGCTGATAACACAATAACCAATGCTAATTGCGCGATGATCAGACCGATTAAAAAGACACGGTTAGTAAACAACAGCTCCATCACGGCCGCGGAATTAGCCGCATACCAGGCAACAAATGCGGTCAGCAACAAGCCAACGGTCATCCAGCCATAGACTTGCGCCATATAAGTTTGCAAGCCAGCCCGGGGTTGTACGATTGAATCAGAACGTGGGAATCTGTCCATGACGATCTCCTGAAGATATAAGGAATATCTTAAGGATACTGCAAAATGATGAGGCTGTGCATCGACGCAGCGTAAACGCATGTACTGAGCGGTGAAATTGCCGGACGCAGCGGTGCCTTATCCGGCTAACAAAAAACTACCAGCGTTTTGCCGCCTGCTGATCGCTCTCCCGAGCTTCAACCCAGCGGTCGCCTTCCGGCGTGGCTTCGCGCTTCCAGAACGGTGCGCGGGTTTTGAGATAATCCATAATAAACTGCCCGGCTTCAAACGCACTGCTGCGATGCGCACTGGTGACACCGACAAAAACGATTTCATCGCCCGGCCATAATTCCCCGATGCGGTGAATCACAGTGACGCGCCCCAGCGGCCAGCGGTTACGCGCTTCATCAACAATTTCTGCCAGTGCTTTTTCAGTCATCCCCGGATAGTGTTCGAGGGTTAATGCTTTGACGCTGTCGCCCAGGTTATGGTTGCGCACCTTACCAGTAAAGGTGACTACCGCACCGTCTTCGTCACGCTCCGCCAGCCACGGGTACTCTTCTCCTACGCTGAACGGCTGCGGACCAACAACAATTTTGGTTTCTGCCATCTTAACCTCCGGTTACCGGCGGGAAGAAAGCTACTTCGTCGCCGTCAGTCAGCGGATGGTCAAAACTCACCAGCGTCTGGTTGACGGCAGCCAGTAATTTGCCATCTTCCAGCGCCAGCGCCCAGCGATCGCTCTGCGCAGCCATGTGCTGGCGTAACGCTTCAACAGTTGGGAAATCCGCAGCCACTTCGGTTGCATCTGTTCCCACCAACTCGCGCACCTGGGCGAAAAAAAGAACTTTAATCATCCGCTTCCACCTTAAAGTCACCCGACTTGCCGCCGCTCTTCGCCAGCAAACGTACCGGACCAATCACCATATCTTTTTGCACCGCTTTGCACATGTCATAAATGGTCAGCGCCGCCACGGAGGCCGCGGTTAATGCTTCCATTTCGACACCGGTTTTCCCGGTCAGGCGGCATAAGGTTTCTATACGCACCCGATTGTGCTCCGGCTCGGCCTGTAAATTGACTTCAACTTTGCTGAGCATCAGCGGATGACAGAGCGGGATCAGATCCCAGGTGCGTTTTGCCGCCTGAATACCGGCAATACGCGCAGTGGCAAATACGTCGCCTTTGTGGTGGCGACCATCAATAATCATCGCCAGCGTCTCGCTGCGCATGGTGACAAAGGCTTCCGCCCGCGCTTCACGCACGGTTTCCGCTTTGGCGGAGACATCCACCATGTGCGCTTCGCCAGCGGCGTTGATATGGGTCAGTTGCGACATACTTATTTCTTCAAATGTGGATGGAAATTACACGGACGCGTACGGGCATCCAGCTGCGGCGCGATGATATTTTCCCATGCGGTACGGCACGCTTTGGTCGAACCCGGCATGGCGAAAATCAGCGTTTTGTTGGCGACGCCCGCTACCGCACGAGATTGCAACGTGGAAGTGCCAATCTCTTCAAACGACAACATACGGAACACTTCACCAAAACCTTCAATTTCACGGTCGAACAACGGCAGCAATGCTTCGGGAGCCTGATCACCTTCCGTCAGGCCAGTACCACCCGTAATCAATACCACTTGTACATCGTCGCTGGCGATCCACGCAGATACCTGAGCGCGAATAGCGTAGCGGTTTTCTTTCACAATGGCTTTATCGACAACGTGATGGCCCGCTTCTTGCGCCGAATCGCGCAGATAGTGACCGGAGGTATCGTCTTCTTCACCGCGACGATTAGAAACCGTAAGAATAGCAATACGGGTCGGGATAAATTCAGTGCTTACCTGACTCATCTGATCTCTCCTTTTGACGTTTTAGCCGCCAATGTACGATAAGTTTTGCGTAATACCGGTGTTGTTTTGATGCAGGAAATGGGTCTGTTTCTTCTCCCGCAGCGCCGCTGAAATACGCGCTTCCAACGCCTGTTGCTGGGTATCGTCTTCCAGCAGATCGCGCAGGTTAACGCCGCCTTCACCAAACAGGCAGAGATGGAGTTTACCAATGGAGGAAACGCGCAGGCGGTTGCAAGTGGCGCAGAAGTCTTTTTCATACGGCATGATAAGGCCAATCTCTCCGGCGTAATCTGGATGGCAAAAGACTTGCGCGGGACCGTCGCTGCGTTGACGTAATTGGTGGATCCAGCCGCGACGCAGTAGCTCGTCACGCAGAACCTGACCAGAGATGTGATGCTTACGGAAGAGCTCGCTGCCCTCGCCCGTTTCCATCAGTTCGATGAAACGCAGCTGGATAGGGCGATGCTGGATCCAGTTCAGAAAGGTGTCGAGCTGGTGATGATTAACATCACGCATCAGCACGGTATTGACTTTGACCTTCTCAAAACCGGCCTCAAATGCAGCATCAATCCCCGCCATGACCTGGTTGAATTTATCCTGCCCGGTAATAGCGTGAAACTGGCGGGCGTCCAGACTGTCGACACTGACGTTAATGCCAGTAAGTCCCGCATCGCGCCAGTTCGCCACATCGCGTTCCAGACGGTAACCATTGGTGGTGACCGCAATCTGGCGGATAGCGTCGTTTTCCCGCACAGCGGCGATGATATCGGTAAAGTCGCGGCGTAAAGACGGCTCACCGCCTGTCAGGCGCACTTTTTCGGTGCCCAGACTGGCGAAGGCGCGCGTAACCCGGCGAATTTCATCGACGGTAAGAAAGCCTTTATTGGTGACGCCGCTCGGTTTGTAGCCATCCGGCAGGCAGTAGGTGCAACGAAAGTTACACACATCGGTAATCGACAGGCGCAAGTAGTAAAACTTACGCGCAAATGCATCAGTCAGTTGTGAAGCCATGTACACCTTTCCAGATACGGGAGGCGAAGTCATTTCTTCCTTCGCCCTGGTGGCAATCTTTACGAGCATTGTCACGGCCAAAGCACCGTATCAGTTGACCCAGGTGCAGAGGCTAGAGTGTTTAGTGGTTATGCCGATACTAGCGTGTAAATGAGATTTTTACCATCCACACTTTCGCTATGTAATCATGTATATAGCGTCATGATCGTGCTATTTCGCTACAGAATAAGGGAAAATGGCGGCTTTACTTTGATATACATCATCAGGGAGGCATAGCGTGCATCGTCCTTTAGAGGTAGAAGAAAATAGTTTGTATCCTCAATATTGGCAGGTTAATTGCTGTTTCCCCGCAATTTGCGCTACTGTAGCGCGGATAATCTGACTCCAGGAAACTATATGCGCAATCGTACGCTGGCTGATCTTGATCGTGTCGTTGCTCTCGGCGGAGGGCATGGACTGGGACGCGTTCTCTCATCACTTTCGTCTTTGGGTTCTCGTTTAACGGGTATCGTCACCACCACCGATAATGGTGGCTCGACGGGGCGTATTCGCCGTTCAGAAGGCGGCATTGCCTGGGGCGATATGCGCAACTGCCTCAACCAGCTGATAACGGAACCGAGCGTCGCCTCCGCGATGTTTGAATACCGTTTTGGTGGCAATGGCGAACTTTCCGGTCATAATCTCGGAAACTTGATGTTAAAGGCGCTGGATCACCTTAGCGTGCGGCCTCTGGAAGCCATCAATTTAATTCGTAATCTGCTGAAAGTGGATACGCATTTGATTCCAATGTCAGAGCATCCTGTTGATCTGATGGCGATTGACGATCAGGGGCATGAAGTTTACGGCGAGGTCAATATCGACCAGTTAACTACGCCGATTCAAGAGTTATTGTTAACGCCTAATGTACCCGCAACGCGTGAGGCGGTTCACGCTATCAATGAAGCGGATCTCATCATTATTGGGCCTGGCAGTTTTTATACCAGCCTGATGCCAATTCTGCTGCTGAAGGAAATCGCCCAGGCATTACGCCGCACGCCAGCGCCGATGGTTTATATCGGCAATCTGGGGCGTGAGTTGAGTTTACCTGCGGCTAATTTGAAGCTGGAAAGCAAGCTGGCAATTATGGAGCAGTATGTTGGTAAAAAGGTCATTGATGCGGTCATCGTCGGGCCAAAAGTGGATGTCTCGGCGGTGAAAGAGCGGATTGTGATCCAGGAGGTACTGGAGGCCAGCGATATTCCGTATCGTCATGACCGCCAGTTGTTACATAACGCGCTGGAAAAGGCGTTACAGGCTTTAGGTTAACGGAAATCCGAAGGAAAATTCCGGCTTCCTATTGAAGACAAAGTGCGCGTTGTTTATGCCGGATGCGGCGTGAACGCCTTATCCGGCCTACAAACCGCGCAAATTCAATATATTGCGGAGAAAATGTAGGCCTGATAAGCGTAGCGCATCAGGCTGTTTTCCGTTTGTCATCAGTCTTCTTCGCTATCCTGTTACGATGCCGCGATAAACAGCTCACGCAGCTGATGCAACTGGTCACGAATTTGCGCCGCTTCTTCGAACTCCAGATTCTGCGCGTGTTGCATCATCAACCCTTCCAGCTCATGGATTTTCTGCTGCAACGCTTTAGGCGACATATCCATCGGCACATTATCCGGCTCAACAATCGGTCGCGATTTTCCTCTGCCCTTCGCTTTGGTTTTAGCAATGTTCTGCCCCAACGCCAGGATATCGACCACTTTCTTGTTCAAGCCTTGCGGCGTAATGCCGTGTTCCTCGTTGTACTTCTGCTGTTTCTCGCGGCGACGTTCGGTTTCGCCAATCGCTTTCGCCATTGATGGGGTGATCTTATCGCCGTAGAGAATCGCTTTACCGTTAACGTTACGTGCCGCACGACCAATGGTCTGGATCAACGAACGTTCGGAACGCAGGAAGCCTTCTTTGTCAGCGTCGAGGATCGCCACCAGCGAAACTTCCGGCATATCCAGACCTTCACGCAGTAAGTTGATCCCTACCAATACGTCGAACTCACCCAGACGCAAGTCGCGGATAATCTCCATACGCTCGACGGTGTCGATATCTGAGTGAAGATAACGCACGCGCTCACCGTGTTCTTCGAGATATTCGGTAAGATCTTCCGCCATTCGCTTGGTCAGTGTGGTGACCAGTACGCGTTCGTTAATTGCCGCTCGCTGACGAATCTCCGAAAGAAGATCATCAACCTGTGTCGCCACCGGCCGCACTTCGATAATCGGGTCAAGCAATCCGGTTGGACGCACCACCTGATCCACCACATCGCCGCCGGATTTTTCCAGCTCGTAATTACCCGGCGTCGCCGAAACATAGATGGTTTGCGGCGCTAATGCTTCGAACTCTTCAAACTTAAGCGGACGGTTATCCAGCGCTGATGGCAGGCGGAAGCCGTACTCCACCAGTGTCTCTTTACGCGCCCGGTCACCGCGATACATGCCGCCAATTTGTGGAATGGTGACGTGAGATTCATCGACCACCAGCAGCCCATCAGCAGGGAGGTAATCAAACAGCGTCGGCGGTGGCTCACCCGGTCCACGACCGGAGAGGAAGCGCGAGTAGTTTTCAATCCCCGAACAGTAGCCCAGCTCGTTCATCATCTCCAGATCAAACTGGGTACGCTGGGTCAGCCGCTGCTCTTCCAGCAGTTTGTTGTTTTCCAACAGCACTTTGCGTCTGGCGGCCAGCTCTTCTTTGATCTCCTCCATCGCCTGTACGATGCGCTCGCGCGGTGTGACGTAGTGCGTTTTCGGGTAGATGGTAAAACGTGGAATAGTGGAAACAATCTGCCCGGTCAGCGGGTCAAATAACGACAATCGTTCCACTTCCTCGTCAAACAGTTCCACGCGAAGTGCAATGTCATCCGATTCTGCCGGGAAGATATCTATCACCTCGCCACGAACGCGGAAAGTACCACGCTGGAATGCTTGATCATTACGAGCGTATTGCAGCTCCGCCAGTCGGCGCAGAATCGCGCGCTGATCGATAATCATACCGACCGTGAGATGGAGCATCATCTTGAGATATAAATCAGGATCGCCCAGACCATAAATCGCGGAAACAGACGCCACCACAACCACATCACGCCGCTCCAGCATCGCTTTGGTGGCGGACAAACGCATCTGCTCAATATGTTCGTTAACCGAGGCATCTTTCTCAATGAAAGTGTCGGAACTCGGTACATAGGCTTCCGGCTGATAGTAGTCGTAGTAGGAAACGAAATATTCCACCGCGTTTTCCGGGAAGAACTCTTTCATTTCGCCATACAGCTGGGCCGCCAGCGTTTTGTTGGGCGCAAGTACCATGGTTGGGCGCTGAAGGTCAGCAATGACATTGGCAATGGTGAAGGTTTTCCCGGAGCCAGTCACGCCAAGTAACGTCTGGTGCGCCAGGCCATCTTCCAGCCCCTCTTCGAGACGTCGAATCGCCTCTGGCTGATCGCCAGAAGGTTTAAAAGCGGAATTCAGTTTGAACGGTTTACTCATGAGTCGCTACCTGAAGGAGTTGGGCGGGCAGGTATGTAATTTTACTCGTCGTACTTAATTATGCCAACAAATTATACTGGATAAAAAAACAGTTCATCACCATAATATTTCTGTTACAGCGTAAACTCCGCTGTCAATCATGAGCAAAATTTACTCTGTGGCGAGATAAAACTCCGGCCTTACCGGGTTATCCCCAAAGCAACGGCTTTTTTAACATTTGTCAAGATGAGTGATGACAGTTTTATGGCAAGAGATGCCTGTTCAGTGACTGCCATTGCATTTATCTAACCAGTTAAAAAATAAAAGATATTTCTTTGAGCCGTCTTTAACGCCAATTCGCGTGCAAGCCGCGTATTCTCTCGCTTGCCTCGTGTTTTCTAACTCTAATACACATGGTTATCCACAGGAATAGTGGATAACTGTCTCCAGCCCCTATCCCCCGCCGCCTGGGCAATTCCCACATTCGCCGCATGAGGCGGAAGTAAAATTTTTTTCGTGATTATTTTAGAATTTAATTGGTTAAATTGCAGTCAATCGAAGACGCGATCTCGCTCGCAATTTAACCAAATACAGGATAGCTACAACAAGGCAAGGTTTATGTACTTTCCGGTTGCCGCATTTTCTGGATTTTCTGCAAGCCAGGGGATCTCTCCCAGCAGCGGCGCGGGAATCATGCGGGTGAGCGTGGTCATATATTCAGCGTGACGTTTTCCCGGAGGCGTAACATCGTTCGCCACCCAACCCGCCAGAGTCAGTCCGGCGTGTTGTATTGCCTGCGCAGTCAACATCGCGTGATTAATACAGCCGAGTTTCACACCAACTACCAGTATCACCGGCAGTTGTTCCTGTGTTACCCAATCTGCAAAAGTGAAAGTGTCAGAAAGCGGCGTAAACCAGCCGCCAGCACCTTCCACTAACACCCAGTCAGCCTGTTGTTCAAGCGCGCGTAATCCGGCGCTCATTACCAATGATTCTATCGGCCTGCCCTCTTGCGCGCTGATGATGTGCGGCGAGGTGGGTTCTGCGAAGGTGTAAGGATTTACCGTTGCGTAATCCAGCTGCAGGCTGCTGTTGTGCTGTAACGCCAGCGCGTCACTATTACGTAAGCCTTCCGGGGTCTTTTCGCTGCCAGAGGCGACCGGTTTATAACCTGCCGTCCGATAACCTACTGCCTTCGCGGCTTGTAAAAGTGCACAACTGGCGACGGTTTTCCCCACTTCGGTATCCGTTCCGGTGACAAAATAACGTTTACTCACGAGCAATCACTCCCAAAAAAAGATGATACGTCAGAGGATATCGCCCCTGTTGTTGCGGCCAGGCCAGTTGCAATCGCTGCAATTGCGAACGCGTTAATATTCGCGGGTCGCGCCCTTCATGAAGATGCGTAGCACCGATGCCTTTCAGCGAACGCATGGCACTGAGCGCATCATCAAACCACAGCGTGATGGGCTGAATATGATGTTGATAATGCACGCCGTTCAGCGACTGTTCGATTTCATCTGGCGGTAAAAAGCGATTAGCATGCGGACGCTCGTCCACCGCCTGCCACGCCTGATGCAGTTCGGGTAACGATCCCTGCACCAGCGTGGTAAACGCGACCACGCCTTTGGGGCGCACCACCCGATACAGCTCGCGGAGTGCCGTGGATAAATTACCGCACCACTGCACTGCGAGATTGCTCCATGCAAGATCGAACGTCGCAGTCGCTAACGGCAGGGATTCGATATCTCCCGCCAGATAATGGTCTGCGGCATCCTTCTGGCGTGCCTGAACAAGCATTGGCGGCGAGAGATCTAAGGCCGTCACCTGCGCGTGACGTTCCCGCCAGTGGCGGCTCATCCAGCCAGGTCCACAACCCGCGTCCAGTACGTGGGTGTATTTACGCTGTGGAAGCATTGCCAGTAAGGCGTCAGCACTCTGGCGCTGTAGATCTGCATGTTGCTCATAGTGTGCGGCTGCCCGACCAAATGCCGCTGCAATGGCTTGTTTATTAACCGTTGCCATGCAGCACCTCCAGCAGACGGTCGATATCCTGCATTTCATGCGCAGCGGTTAGCGTTAAGCGCAGTCGCGCAGTACCAGCGGGTACGGTTGGCGGGCGAATCGCCGTGACCCAGCAGCCTTGCTGGCGCAGTTTTTCTGCCAGTTGTAACGCACGGCTGTTATCACCGACAATCAATGGCTGGATGGCGCTGCATGAATCAGCAAGCGTAAACGGCAAATCCTGTACTCCGGCACGAAAACGCGTAATGAGTGCCGCCAGTTTTTCGCGCCGTGCATCACCCTCATCACTGCGAATGACCGCCAGCGACGCACGTAATGCCTGCGCCTGAGCGGGCGGCATACTGGTGCTGTAGATAAGGTGGCGGGCGAATTGCAGCAGATAATCCGCCACCGTACTGGAGCAAAGCACCGCTGCCCCGCTGACGCCAAATCCTTTGCCAAAAGTCACTACCAGCAATTCTGGTTTTACCTTTTGCAGCCAGCAGCTGCCGCGCCCCTGCTCCCCGATAACGCCCGTGCCGTGGGCATCATCGACCATCAACCAGCCATTGTGCTGTTGCGTTACCTGCTGGATTTCCGCCAGTGGCGCACTATCGCCGTCCATGCTGAACACGCCTTCTGTCACCACCATTTGCTGCCCCGGACAGGGGGAAGCAAGCAATCGCGCCAAATGAGTGACATCGTTATGAGCAAAACGGCGAAGCTGCGACGGGCTTAAACTGGCAGCTTCCAGCAATGAGGCATGGCTAAGCCGGTCGGCAGCAATACGGTCCTCTTTCGCCATCATCGCGGCAATAACTGCCTGATTAGCGGCGAAACCAGAGATAAACAGCAGTGCCCGCGAATAGCCAAGCCACTCGGCCAGCTCTTCTTCCAGTGCCTGATGCACCACGCTATAACCGCTGACGTGACCGGAGCCGCCGCTACCGATGCCAAATTGCTCCGCCCCCTGCTGCCAGGCACGGATAATTTGCGGATGATGGCTTAAACCTAAATAATCGTTACTGGAAAAGTTCAGATACTGGCGATCATCCGCCACCAGCCAGCGTCCGGCTCCTTGCGCCACCGGATAACGGCGACGCAGGGCATCGGCAGCACGCCGCGCATCGAGCGCCGCGTTGATTTTCTCCTGCCAGCTCATAATGCTGCCGCGTTGTAATATTCGTCGGTGTCCGGGGTCATCAGCGCCTGTTCAAGACGTTGCTGTTGTTCCTTATCCCCTGCCAGCACGGCAGTTTGCTGCGGATTTAGCCCCAGTTTGCGGAACAGTTGCAGGTCTTTATCTTCTTCCGGATTCGGCGTGGTCAGCAGTTTGCAACCGTAGAAAATCGAGTTTGCGCCTGCCATAAAGCACATCGCCTGAGTCTGTTCGTTCATCTGCTCGCGTCCGGCAGAAAGGCGCACGTAAGAGGTTGGCATCATGATCCGCGCGACCGCAATGGTGCGAATAAAATCAAAGGCATCGACATCATCGTTATCGGCAAGCGGCGTGCCTTTCACCTTCACCAGCATGTTGATTGGCACGCTTTCCGGCGGCGTCGGCAGGTTTGCCAGTTGCAGCAATAATCCGGCGCGATCTTTTACCGTTTCGCCTAAGCCCACAATGCCGCCAGAACAGACCTTGATCCCGGCATCGCGCACTTTTTCCAGCGTATCGAGGCGTTCCTGATAAGTGCGTGTGGTGATGATATTGCCGTAAAACTCCGGCGAGGTGTCCAGGTTGTGGTTGTAGTAATCCAGCCCGGCGTTCGCGAGGCGCTGCGCCTGAGATTCACTCAACGTGCCCAGCGTCATACACGCCTCCAGCCCCATCGCTTTTACCCCCTGCACCATTTGTTCCAGGTACGGCATATCGCGTTCGTGGGGATTCTTCCACGCCGCGCCCATACAGAAGCGCGTCGATCCTGCCGCTTTCGCTTTGCGCGCCGACTCCAGCACCTGTTCAACTTCCATCAACCGCTCGGCTTCCAGCCCGGTTTTGTAGCGCGAGCTTTGCGGGCAGTATTTGCAATCTTCCGGACAAGCTCCGGTCTTAATCGACAGCAACGTGCTGACCTGCACCTGACGAGGATCGAAATGCTGGCGATGCACCTGCTGCGCTTCAAACAGCAGATCCAGCAACGGTTTTTCAAATAATTCTGTGACTTGCGACAATGTCCAGCGTGGGCGGTGAGCCATGGGGCTTCTCCAAAACGTGTTTTTTGTTGTTAATTCGGTGTAGACTTGTAAACCTAAATCTTTTCAATTTGGTTTACAAGTCGATTATGACAACGGACGATCTTGCCTTTGACCAACGCCATATCTGGCACCCATACACATCCATGACCTCCCCTCTGCCGGTTTATCCGGTGGTGAGCGCCGAAGGTTGCGAGCTGATTTTGTCTGACGGCAAACGCCTGGTTGACGGTATGTCGTCCTGGTGGGCGGCGATCCACGGCTACAATCACCCGCAGCTTAATGCGGCGATGAAGTCGCAAATTGATGCCATGTCGCATGTGATGTTTGGCGGTATCACCCATGCGCCAGCCATTGAGCTGTGCCGCAAACTGGTGGCGATGACGCCGCAACCGCTGGAGTGCGTTTTTCTCGCGGACTCCGGTTCCGTAGCGGTGGAAGTGGCGATGAAAATGGCGTTGCAGTACTGGCAAGCCAAAGGCGAAGCGCGCCAGCGTTTTCTGACCTTCCGCAATGGTTATCATGGCGATACCTTTGGCGCGATGTCGGTGTGCGATCCGGATAACTCAATGCACAGTCTGTGGAAAGGCTACCTGCCAGAAAACCTGTTTGCTCCCGCCCCGCAAAGCCGCATGGATGGCGAATGGGATGAGCGCGATATGGTGGGCTTTGCCCGCCTGATGGCGGCGCATCGTCATGAAATCGCGGCGGTGATCATTGAGCCGATTGTCCAGGGCGCAGGCGGGATGCGCATGTACCATCCGGAATGGTTAAAACGAATCCGCAAAATGTGCGATCGCGAAGGTATCTTGCTGATTGCCGACGAGATCGCCACCGGATTTGGTCGTACCGGCAAACTGTTTGCCTGTGAATATGCAGAAATCGCGCCGGACATTTTGTGCCTCGGTAAAGCCTTAACCGGCGGCACAATGACCCTTTCCGCCACACTTACCACGCGCGAGGTTGCAGAAACCATCAGTAACGGCGAAGCCGGCTGCTTTATGCATGGGCCAACTTTTATGGGCAATCCGCTGGCCTGCGCGGCAGCAAACGCCAGCCTGGCGATTATCGAATCCGGCGAATGGCAGCAGCAGGTGGCGGCTATTGAAGTGCAGCTGCGCGAGCAACTGGCACCAGCCCGTGATGCCGAAATGGTTGCCGATGTGCGCGTACTGGGGGCAATCGGTGTGGTCGAAACCACTCGTCCGGTGAATATGGCGGCGCTGCAAAAATTCTTTGTCGAACAGGGTGTCTGGATCCGGCCTTTTGGCAAACTGATTTACCTGATGCCGCCCTATATTATTCTCCCGCAACAGTTGCAGCGTCTGACCGCAGCGGTTAACCGCGCGGTACAGGATGAAACATTTTTTTGCCAATAACGGGAAGTCCGCGTGAGGGTTTCTGGCTACACTTTCTGCAAACAAGAAAGGAGGGTTCATGAAACTCATCAGTAACGATCTGCGCGATGGCGATAAATTGCCGCATCGTCATGTCTTTAACGGCATGGGTTACGATGGCGATAATATTTCACCGCATCTGGCGTGGGATGATGTTCCTGCGGGAACGAAAAGTTTTGTTGTCACCTGCTACGACCCGGATGCGCCAACCGGCTCCGGCTGGTGGCACTGGGTAGTTGTTAACTTACCCGCTGATACCCGCGTATTACCGCAAGGGTTTGGCTCTGGTCTGGTAGCAATGCCAGACGGCGTTTTGCAGACGCGTACCGACTTTGGTAAAACCGGGTACGATGGCGCAGCACCGCCGAAAGGCGAAACTCATCGCTACATTTTTACCGTTCACGCGCTGGATATAGAACGTATTGATGTCGATGAAGGTGCCAGCGGCGCGATGGTCGGGTTTAACGTTCATTTCCACTCTCTGGCAAGCGCCTCGATTACTGCGATGTTTAGTTAATCACTCTGCCAGATGGCGCAATGCCATCTGGTATCACTTAAAGGTATTAAAAACAACTTTTTGTCTTTTTACCTTCCCGTTTCGCTCAAGTTAGTATAAAAAAGCTGAATGCGAAACATTAAAAAACATTAATATCAATGTGTTACAATATCATTGGTCTAAAAAATAGACTACATGATGCTACAAAACACAACATATCCAGTCACTATGAATCAACTACTTAGATAGTATTAGTGACCTGAGACAGAGCATTAGCGCAAGGTGATTTTTGTCTTCTTGCGCTAATTTTTTGTTATCAAACATGTCGCACTCCAGAGAAGCACAAAGCCTTGCAATCCAGTGCAAAGATTTGTGTGCCTCAGTTTTGTCTAAGTGTTCTACTGAAAACATAGTAAAATCGGCAACAGCTGGAAATCATTCAATACTCGCACTATCGAAAGTTCACCAGCCAACCGCAGCACGTTCTTGCATACGACGTGCTGCGGTTTTCTTTATGATTTATGCACAATGGACAATTTGAAATTATTGATGATTGTATGGTGCATCGTTTTCTGAACCTACACTGATTTTTTGGTATAGCCTTGCCTAGTCAGTCTTACCGGATCAAACTCTTCGCTATTGCAATACTAACCAAAATCATCAATTTGACAGCGATTAACCAGAATAATAGTATACTATCACCAGTAAGAAATTATCGTTATTTGTAGCGATACATATTATATATATATCATTCTCAGGTGCGTACATGATTATCAACCAGGTACCTATAAAAATAAAAATCTTTATCTTTTTATTTTCATGCATCTCTATTATATTTTTGTTACTGCATGCAAATAATGGAATATACATAACACAAACAACACAAATAAGTTATAGTGTTTTCATTATTGGGCTTTTTTTCATAAACCTGATGATTTTTATTTTTCTATTGCTTTACTATGTTTCTAATCAGAGACAAAGTTATCTCTTAATTCTTTCATTCGCGTTTTTGAGCAACACGTATTATTTATTAGAAGTGGCTATTATTTCTTTATCTCCGTTAGGTAACGATTTATCTACAATCTATCAGAAATCAAATGATATCGCAATATATTATCTATTCCGTCAGTTCAGCTTTATATCTATAATCTTTCTGGCTGTTTATTCCACCAATGTTAAAAATAAAAGTGTTTTAGAAGATAAAAGAAACATAATAATTGTTGTTTTGTCAATATTAATTCTTTTTATTACTCCGTTTGTAGCAAAAAATCTAAGCAGTGACAATATAAAATATAGTCTTAATATTATACAATACTCGCTGAATCGTCATTTGCCGACGTGGAATATCGTGTACACCAAAATAATATCAGTATTTTGGCTTGTATTACTTATCAGCTCATGCATCAGCATACGTAATTACTCAAAAATATGGTTGTGTATAATACTTATTAGTATAGTGTCAGTATGCAATAATCTAATTTTATTGTATTTTATTGATAAATCCCATCCTGCATGGTATATGACAAAATTTCTTGAATTGATATCAATGATTTATATCATTTCAACACTCATGTATTATGTTTTCAGGAAATTAAATCATGCTAATCATATGGCAATTCATGATCCACTAACGAATACATACAATAGAAGATACTTTATTGACTCATTGAAGAATATATCAAAACACCATGATTTCTCAGTAATAATGTTAGATATTGACAGTTTCAAAAGCATCAATGACAAATGGGGGCATCATATGGGTGATCAAGTCATAGTAATGGTTACCAGAATAATAAAAAAATCCATCAGGAAAGAGGATGTATTAGGGCGCTTAGGCGGTGAGGAGTTCGGTATTATCATTAAAGGTAATACTCAAAAGCTCTTGCTATCAATTGCAGAGCGAATCAGAAAAAACATTGAAGAGCAATGCTCGGAAAAATTATTATCGCATGGACCTGAGAAAATAACTGTCAGTATTGGTTGCTTTACTTCAAAAGAGAATAATCTCAGCCCATCTGAAATGTTAGTCAATGCCGATAAAGCGTTATATCAAGCCAAAAGAACCGGAAAAAACAAGGTGATAATTCACTCAAAATAAACACCTTTTTAAAATACAGCCCCAATAAACTGCAGAATATTATCCCATATAATATCCTGCAGTTCGTAATGCACTATTCGATAATGGGTACTGTTGGCCATTCAATATCCGGTGCAGTTGTTGTATTAACACGGTTCAGCAACACCCGATACTTCTTCCAGGCTTCCAGCAACGAGTTTTCTTCCTCCGTTGCGATCTCCAGATCTACAGCATCCTGAAGTGGCGCAATATGCTCACTGAATTCCTGGATGTAGAACTATGTGGTGACGGTCTTCCAGCCATTCGGCTCCTGCTGTATCGAAGCATACCAGGCTATTTCAATATCGCTATGCTGCGGCAGCATTTAACCCCTTGTAATTCATCACCATAATTGATTTAATTCACAAACAAAACTATAACATGGTGAAATTAATGAAAAAAAACACAGATGATGGGGCTAAAATTTACACACCACTTACCCTAAAGCTTTATGACTGGTGGGTTTTGGGAGTATCAAATCGGCTTGCATGGGGATGTCCTACAAAGGAACACCTTCTTCCACACTTTCTGGAACATTTAGGTAACAACCATCTGGATATTGGTGTTGGAACTGGGTTTTACCTTACTCACGTACCTGAGAGTAGTCTGATATCTTTAATGGATTTGAACGAAGCTAGCCTGAACGCGGCATCTACAAGGGCTGGGGAATCAAAAATTAAACATAAAATTAGCCATGATGTTTTTGAACCTTATCCCGCGGCGTTACATGGTCAATTTGATTCCATTTCCATGTTTTACCTTCTTCACTGCCTGCCTGGAAATATATCTACAAAAAGCTGTGTAATACGCAATGCGGCGCAGGCCTTAACTGACGATGGAACTCTATACGGAGCCACAATTCTTGGCGATGGAGTTGTGCACAATAGCTTCGGTCAAAAACTGATGCGCATTTACAATCAGAAAGGCATCTTTTCAAACACAAAAGATTCCGAAGAAGGCTTAACACATATACTCTCAGAGCATTTCGAGAATGTTAAAACCAAGGTTCAAGGTACTGTAGTAATGTTTTCCGCTTCAGGGAAAAAATAGCATCCAACCGCAGCTCGCTCTTGCTTAAGACGTGCTGCGGCATAATCCCAATGATTACTCCCTGACAGGGTTCGTAGGCCACTCAATATCAGGTGCAGTTGATGTATCAACACGGTTCAGCAACACCCGATACTTCTTCCAGACTTCCAGCAACAAGGTTTCTTCCTCCGTTGCGATTTCCAGCTCAACAACAGTCTGAACGTACCAGGAACAGCCTCCTTCAGGGCTTGAAGGATATCAATGTTCGCTTCCTGTTAACTGCCCGACAAGTGCAACCAGTTCGCTTACCTGATTTTCCAGAGTGGTGATCCGGGTGCCTGCTGTTGCCAGATTTTCACGTAACGTTGTATTTTCCTCTTCCAGCGCGGTAACGCGATCATCTGTTTCACGGGCGACCTGAACAAGTAACCCCGTCACGGCGGAGTAGTCAACATTAAGATAGCGAGTTTCTTCGCGTAGCTCGTTGCCGTCAACGGTCGGACCTTGCAACTCTTCACCATAATGAGTAAACGATCCCACAGCTTCAGGTATCGCCTCCATTACTTCCTGTGCAATAACGCCAGCATAAGGCATTCCGTTTTCCTTGAGCGTGTAGGTGTACCCGTTCATTTTACGGATTGCTTTCGTCGCGTCGCTGATAACGAGAATATCGTCTTTAAGGTCGCGGTCTGATGACTGATTCAGCGTTGTGCAATTAATAGCGCCATTTACATCAAACAACTGGCCTGCTGACGTTTTTTGCGCATAAAACAGATACGCAGCAGACGTTCCAACCTCAAAAACGTTTTGTCGAGTACTGGAACCCCACACCCTGACAGAAAACGGTAGTTCTGCATTACCTGAGTTCTGTAAAACAAAACGATTGCCAGTCCCTGATTGTTTTGTAAGGGTTAAATCAACTGTTGAGTTAACCTCATCCTTGTTGATAGTGAGCGCCTGCGCTTTATCAGTTCATCCAGCGCGGCTGCTTTGTTCATGGCTTTGATGATATCCCGTTTCAGGAAATCAACATGTCGGTTTTCCAGTTCCGGAAAACGCCGCTGCACCGACAGGGGGATCCCGTCGAGAATACTGGCAATTTCACCTGCGATCCGCGACAGCACGAAAGTACAGAATGCGGTTTCCACCACTTCAGCGGAGTCTCTGGCATTTTTCAGCTCCTGTGCGTCGGCCTGCGCACGCGTAAGTCGATGGCGTTCGTACTCAATAGTCCCTGGCTGGAGATCTGTCTCGCTGGCCTGCCGCAGTTCTTCAACTTCCCGGCGCAGCTTTTCGTTCTCAATTTCAGCATCCCTTTCGGCATACCATTTTATGACGGCGGCAGAATCATAAAGCACCTCATTACCCTTCCCACCACCCCGCAGAACGGGCATTCCCTGCTCCTGCCAGTTCTGAATGGTACGGATACTCGCGCCGAAAATGTCAGCCAGCTGCTTTTTGTTGACTTCCATTGTTCATTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAAAAGCCCGTTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGGGGGTGTTTTAAATAAAAACATTAAGTTACGACGAAGAAGAACGGAAATGCCTTAAACCGGAAAATTTTCATAAATAGCGAAAACCCGAGAGGTCGCCGCCCCGTAACCTGTCGGATCGCCGGAAAGGACCCGCAAAATGATAATAATTATCATCTGCATGTCACAACGTGCATCTACGCCATCAAACCACGTCAAATAATCAATTATGACGCAGGTATCATATTAATTGATCTGCATCAACTTAACGTAAAAACAACTTCAGACAATACAAATCAGCGACACTAAATACGGGACAACCTCATGTCAACGAAGAACAGAACCCGCAGAACAACAATCCGCAACATCCGCTTTCCTAACCAAATGATTGAACAAATTAACATCGCTCTTGATCAAAAAGGGTCCGGGAATTTCTCAGCCTGGGTCATTGAAGCCTGCCGCCGGAGACTGTGCTCAGAAAAAAGAGTTTCTCCTGAAGCAAACAAAGAAAAGAGTGACATTACTGAATTGCTCAGAAAACAGATCAGACCAGATTGAAGCAATTTAGATAATCGTGCAGACTACGCCCCTCATATCACATGGAAGGTACTACAATGGCTCAGGTTGCCATTTTTAAACAAATATTCGATAAAGTGCGAAATAATTTAAACTATCACTGGTTTTATTCTGAACTAAAACGTCACAATGTCTCACATTACATTTACTATTTAGCTACAGAGAATATTCATCTTGTTCTTGAAAACGATAATACGGTTTTAATAAAAGGACAGGGTAAGGTTGTAAATGTAAGATTTTCAAAAAATAAATGCCTTATAGAAGCCACCTTAAAAGGATTCAAATCAGGAGAGTTATCATTTTACGAATACAGGAAAAATCTTGCTACAGCAGGGGTTTTCAGATGGATTACAAATATCCACGAAAACAAAAGGTATTACTATACCTTTGATAATTCATTACTCTTTACTGAGAACATTCAGAACACTACACAAATATTTCCGCACTAAATCATAACGTCCGGTTTCTTCCGTGCCAGAACCGGACTCGCTGGCATGATGAAATATGTGTACCCGGTAACCCCGGTGTGCATCGTTTTTGATTATTCCCGCACACTCGCGCAGAAGGAGTTCCCCGTCGGGCTACGGTCTCTGTTAATACGGGAATACGGCGACGATACAGCGCATGATGTGTCAGGCTTGAATACCTTTATCCTTTAAAAGGGATATCAGTTAAGTTATCCCGTGTAGGGTATAAACCATTATCAAAGCCACTCTGTAGGAAGTGGCTTTTGTAATGGCAATAAAAAGCCCCGCGAATGCGAGGCTAAATCCTGGTATTTGTAATGACTGGTTCTTATCTCAACGCAGCCCCTTACCGCGCGCAAAATGCTCAATATCAAGCATCAGCAATGAGATGTTTAATCTGGATTCACTCCAGAAGTGAGCACCACCCTGTCTACAGAGCCAGATGTGAAGGATGATGAGTAAAATTATCGCTATCATCGAAGGCATTGCGTCCTGATGTATTCCTGAAGCGTTCTCAGTGCTGTTTGGTCGCGGATAATTCCGTCCCGGATACCGAGAACGTTTCGTCCAGCAACTGGAGAGAGTTCGACGGTGGCATCATTGCCCATGCCGGAGGCGCTGGAGGTTTCGGCTGAGGATGGCACAGGGCATTTTCCTTTGACGAGCACCCGACCACCATTATCAAGCTTGCGCCGAAGAGCATCATTTTCAGCTTTCGCATCAGCTAACTCCTTCGTGTATTTATCATCGAGTGCATCAGCATCACGCTGGCGCTGCTGCATGTCAGTAATGGTGGCGTTCGCCTTCTCCAGTTCACTGGCCTTGTTATCGCGCTGTTCTTTGTAGGCGATTGCATTATCACGGTAATGATTAACAGCCCATGACAGGCAGACGATGATGCAGATAACCAGAGCGGAGATAATCGCGGTTACCCTGCTCATTGTTGCCCCCACAAACAGACCTCACGCTCAATCTCACGACGAGTCATCAGGCCTTTCCATTGCTTACCGCCAGCGTATGCCCAGCGACGTAGCTGGTCACATGCGCCCTTGATATCGCCCTGGTTTATTTTGCGAAGAAGAGCGGATGTTCTGAAATTGCCTGCGCCCACGTTATAGACGAACGAGTAAAGAGCGCCGCGCGTTGTTTCCGGTATATCGACTTTGATGTACGGGTTAATTTGTCTGGCTACCGTGGCAAGGTCTTTATTCAGGAGGGCTTTGCATTCTGCTTCGGTATACGTTTTACCGGGAATGATGTCTTTTCCTGTATGCCCGTGACATACAGTCCATACGCCAACGATATCTTTGTATGGTATGTAGCTGACACCTTCCAGACCATCGTTACCACTTGGGCCAGTGATTAACACTGATGCTATAGCAATTGCTCCGCCACCAATAGCAGCAGCAACGGCTTTTCGTAATGATGGAGGCATTATTCACCTCTCGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAAGCTACCCAGCACACCTATTGCTGCCCACTGTGAGGGCGTGACTTTATCGAGCAGCTGTAAAAACCAGTAGCCAGCACTGCCTGCGGAGGTGCCGTAGGCAATGCCCGTTGTTAACTTATCCATGGATTTCATAGCCTCACCTCCGCAAATAACGGATGGTGTACACGGTTCGGAACGAAGAGGAAAGGTATAGAAGTTACATTAGCGTAAGGCTTGAACATCTATTCAAAAAGAAAAACGCCGCGATTATTCTGGCGTAGCTGAAAGCATCATACAATTATCAAATACGAAAATTACAAAATCATTAAAACGCATCACGTTACATCATGTCTTTTTCTAAAAAAAATCTTGATGAATATTGATGGGGAGGAACACCAAAATACCTTCTGAAAACACTTACAAAATATGACGTGTTTTCATAACCGCATATCTCAGCAACTTTTCCAACAGAATATAAATTGTAGCTTAATAACCTTTCCGCCATCACCATTCGCTCTTCAAGAATTAACTTACTAAATGATAAGCCTTCGTGCTTTAATTTTCTTTTTAACAGACTTTCACTCAGATACAGTCTTGAAGATATATCACAAAGTCTCCATGCTGCAGATATATCCGTGTGAATAATAGCCTTAACTTTACTTCCTAAACTATTAAGACATCCAAATAAAAAACTTTGCACTATTTTCTCTGAAGATAAGATAGCAAGACATGCAAGTGATATTTGATTTCTAACAAAATCCACAGTTCTGCCATCACAATTCAAGCATGCAATCAAGTTCTTTAACAATGAAAAATCTTCACATTCCACCATCAAGTATGCCGGATAAAACCTTCTTACAGAAAAAGGTGAGAGTGTGTTGCTTTTAAAGAAATCATTAACTGTTTTTTCTTCAACATCTACGATCATTACATGATCTATATTTGATGAAAAAAATCTTTTAAATTGTAATCAATGAGAACAGCACTTCCTTTTTTAAACAAAATATCTTCTTTACCAATTCGGACATCAAACGAATTCAACACCAAAATGATAGAACATATGTATGGCATATTATCCACCTGATATCATTGGGGTTACACCAGGTAAGTGTAGGTGGAAAATCAATATTCGCCAGTTCAACAATAAGGAAAATTTCATTACATCACAAGTATAAAATTATGTATTTAACTCACAAAGACAAATTATTAAACCAATCTGTTATATTATATATAGCTGCGTGGAATCATAATATCATATATTTTGACTGGCATGTTTACCAACTTTAAGTTGCATCTCAATTGTTTCTTCAGCGTAAACAGAGTTTTTATACAAACTGACACTCTGGGTATCATAGTGTAGTTTTTACGATTGTAAATATCCTGCATGCAGGAACTCATCCTTTTGGATGATATCGCATACAATTAATTTACCATCAGTCTTAGAGCCAGTTCGTCCGGATAGGGATCGAAGTAATTTTGTGTAAGCAAGTAATCATTAGGATACTCACCCAGATAATGCTTCAGCAGAGTCAACGGCGCAAGAAGAGGTAATGTGCCAGAACGATAGTTAAGTATAACCTCGCTCAACTCTTTACGCTGGCGTGTACTTAAGTAGTTACTAAAATACCCCTGTATATGCATCAGCACATTCGTGTGATTTTTACGTGATGCAGGTTTTCTGAGAATCGCCATCAGCTTATCACGATACACCTCAAAGTATGATTCAAGGTCCGCCCACTCGTGTATTGCAGCCACAAATGGTCCCATATCTTTATAGCCTGCCTGACTATGCGCCAACAACTGAAGCTTATAACGACTATGAAAAGCTAATAACTCTCTTCTTGATAATTTCTCCTTGTAAAGGTGATTGAGCTCATGCAAAGCAAAAACTCTTTCAACAAAATTCTCACGAAGCACTGGATCATGTAATCGCCCATCCTCTTCAACCGGTAGCCAGGAAAACTTTTCCATCAAAGTGCTCGTAAATAGTCCCACTCCATCTTTACGACCTCGATTACCATTTTCATCATAGACACGCACGCGCTCCATGCCACAGCTGGGAGATTTAGCACAAACCACAAACCCCGATACATCCTTTAATTTGTCCATATAAGAACGACTAAACTCTGTCATTCTCTCTGTCACATCCTCATTCTGGTCGTGGCTGAAACACATCCGTATATTTCCTTGCATCGAGCGCACAAGACGTAGAGCAGGACGCGGAACTGGCAGCCCTATAGCCATTTCCGGACATACTGGTCTGAATGTTACCCATTCCACTAATTTGTCCATTAAAAAGTCAGCTCTTTTGTGACCACCATCAAAACGAACAGCAGAACCGGCCAAACAACCGCTGATTCCAATCACAGGTTTTTTTATCATATTCTCCCCCTTGACTAATTCATTAACACATAAACTGTGTAGTGCACGGAATAAATTGCCTTTCTGGCGTCATCACTGACAATTTTTCTGTTATGGACTATTCCTAATATAGTATGAAAGTTCTTTAAGTGATCGGTCGTAATCATCTATCTTTCATACTTACTCTCAACTATCAAAAGTACAGGATTTATTATGAAGTTATGGCCTGTGTTGACTGGCATTGCACTCTCTTTCACTCTTATAGCATGTAAGGCCCCGACACCACCTAAAGGTGTGCAGCCGATTACAAATTTTGACGCCAACCGCTACCTCGGAAAATGGTATGAAATAGCTCGCCTCGAGAACCGGTTCGAACGTGGTCTGGAACAGGTCAGCGCTACTTATGGAAAACGGAACGACGGAGGGATTCGTGTACTTAACCGTGGATACGATCCAACGAAAAATAAATGGAGCGAGAGCGAAGGTAAAGCATACTTTACTGGAGATACTAAAACTGCAGCATTGAAGGTTTCGTTTTTTGGCCCCTTCTATGGTGGCTATAATGTAATCAAACTGGATGATGAGTATAAGTATGCTCTTGTCAGTGGTCCGAACAGAGAATACCTATGGATTCTGGCAAGGACCCCAACTATTCCAGATAAAGTAAAAGCAGACTATGTGCGAACCGCTCAAAAGTTGGGATTCAATGTCAATGAATTATTATGGGTTAAACAATAAAATCCCTACCCGAAATAATACTTATTAGAAAAAAACCAGCCTTTGGGGAGGCTGGCTAAATCAGGAAACAAGCTGTTATATGATAATAACTACGTTGTGATTCCAACATTTAAAATGTTAGACTAATGACAATCAGACAGCAACTTTTCCTTTAATTATTTCGAACAATCAGCATCCATCTCCAATCGGAGATCCAACACCATCAGCATACCCTCCACTACGCCCTCAGCTTTCTGAAGCATCCTGCCAACCCAACAATCAGATCGCCCATGCTTACGTGCAAGCGCCATAAAAGTCATGCCGCCGACATAATAGTCCACCAATAAATCATGCAAATCGCTGTTGTTCTTTTTCAGACGGGCCATGCACCCGCAAATGATCATCGCATCATCGTCACAACATTGCGGGCGAGATTTTACTTTTGAAGGAATTAATCCCTTAAAACCGGCGGCAATGGACGACCAAGTCACATCTTCATGATTATTAGCCGCCCACGCTCCCCAACGCTCAAGAACCATCTGAATATCACGCATCAACTTACTCCACAAAAATCAGACCAGAACGCCAATCACAAGCAAAAATCAACAAAACAGTATTAGTTGATTGTTATCTCTGACTTCATACTCCTGCTCCTGTCAGGGTTTTGGCGTAATTCTTCAGTATTCGGTAATCGGTCAAAACAGAACCGGGGAAACGATATAAGCGCAGATGCCCCCAGCGGTGGCGAAGAAGTTCTGCCATATAAAACTCAAACATCATTCATTCCCCATTTCGGTGATGGTCAGTTCCAGCCTCCCACCTTTGGTAACAGGCATCTTCACAACGCGGTAATCAACGACCTGAGCATCATCCAGCCAGAAACCTGCTTTAGTGAGTGCGTCAAAAGCGGCTTTTTGCAGATTATCCAGGTCACGGCGACGGCGATCCGGCATGTGGCACTCAATGCGGATTTTCACAGGCATAGCCAGGCCGATATCCAGCATTGCGTTTTTAATGATTCGGGCGACGTTATCGCGGTATGCCTGCCCCTCTGCGCTGACGTGCGTGCGCCCGCGATTATGGCGGTAATAGCGATTATTGCTCGGAGGCCAGGGTAATGTGATGTTGTAGGTATTCACGCCTTGATTACCCCCTCTTTCATCCAGATAACCTGCGTTCTCGCCATACCTTCCAGCGCGCATTCTTTTGCATATGCGGCATCGACAAAATGTGTGCGGCGGTCGATTTCGTCGTGGCAGGCAGAACATGCAATGGTGGCAATCAGGTCTGGCGGTTTGGTACCGGTGCCGCACAATCCAGTCAGCCGGATATGTGCCAGTACAGACGTTTCAGGGTTGCCATTACATACGCCAGGGGTTCTTACCTGGCATTCCCGACCACGCGCTGCTTTTCTCAAATCAGCCATGATTCCTCCTTGCTGCCAGTCGCAACCATTTTTTATCAACCAGGCTGGCGGTATATCCGAGCAGTGTTGGTATTTCGGATGGCTTCAGCTCAGGTTTACGCTTACGACGATTTGGTACTCTGTAGATGTGTCCGTTCATGACACGAATAAGCGGTGTAGCCATTACGCCTCCTGCTTGTCGCGCAGCAGCTGGAACTCGCAGCTCTGCGGAATAGTCAGGTGGCAGCCAATATTCATCGCCCAGGCTTCAACCTTACACAGGAAGACATACATCTCTCCGGCATCAAGATCGGAGGTATGGCGTAACGACTGGATAGTGGTGATTTCACCGGTTACGACATCAACCAGTTCTTTGGTTTCATAACCGAGGTATGTGTGTTTGAGAGCTTCTTTTACCCATGCTGCGGTAGCGAACGATTTCCCCCTGCTGATAAGGTATTCACTGATTTCGCTGTACCACATGTGGCTGAGTGCATTCTGGGAAAGACTGCGTTTCTCACGCCACGGTTTAAGCACCATGCGAAAGCATTTTCCGTCCTCCAGATAAGGCTGGATCTGCTGGCCGATAGCGGTGAAGTTGCCGCGATGTAATTTGATGCCATCTTGTGGTAGGTTCACGCTTCACCTCCGCAGAGGTCAAACGCAGGATGCAAAAAATCGCAGGTGCATTTCTGCATCTGTGAAGGGAGAAGAGAGTTTGGATTGTATGTGCGCATAAACGTCCCCGTTTAGCGCAGAAGTCACCGGAGTTGTTCAAGCTCCGATGACTTTATTATTACGAATTGATTTTACAAAATCAAAAGGTATGTTAGTGACGCGGGTCTGTTATTATGCGAGAAGGGTTTCCGTATAAAACAAGGACCTTACTTCCTTGAGTAAATAACGGATCTTTGCCTTGAACAATGGTCATTAAATTCCCATTCTCAGTTTCGACAACATATTCCATGCCTGTTTGTTTTGTTGCTGAAGATTCGATTGCTGCCCCGGCAATACCACCAATGACTGCACCACCAACGGCACCAACGATATTAGAACGAACTCCCCCACCAAGCGCAGAACCAGCGGTTGCCCCCACGGCAGCCCCAGCAGTCCCGCCTAACGCGGAAGTCCCACTGATATCAACCCCCCTGGCACTAATAACTGTACCAGCGATAGTTCGATTAACCATGCCCACAGAGCCAACAGAATAACTATTTGGCGATATATTTTGTGCGCATCCAACCAACACTAAGAGTGGAGCAATTACGAATAATCGCTTCATTTAGCTACCCTAACAGGAAACATTGGACGAGAAAGATCAACACTTTCTAATGCTTGCAAGAACTGCGTTATGTTGTTTTGCACCGCGCGATTAACAGATTCGCGTGCTCGAACAATACCGTAGAATGCGTAACTGGCTGGAACAGTACCGGTAGACTCAATATCCTGCGTATATATAATATCACCATTCGCACGGTTGATTATTTCATACCTTGCAATTGCTTTAGTTGTCATTGAAACACCAAAAGCAGGAACGTCAAGAGCCAACACTTTAACATTTAAGCTAACCGTATTTGGTGAACTATCACGAAAAATAGTCATTCGGTCGAGTGCTTCCTGCAAAGATTCACGCCAAATTGGAGTTATAGCCTCCATACCAGCAGTGATATCCCCTTTCTGCTCATCTGGACGAGCAAGTGATACCGTTAATGACTTAATTTCAGCATCTATTTTTTTCTGGCTAACTCCCACGTTAGGTGTTGAAAAATTCAATGGTGGCACACTAGCGCAACCTGTTAAAGAACCAATAATCATGGCTAATAATATTATCTTCTTCATAAATTTACCTTATTGTTATAACCAAAGGAATTATAAAGTAAAAAAGTTCACTATCACTAGCCATTAACGACATCAATTTCAGAGAAACATGGTACTCATTTCCACAAATTTGACACAAGTCATTTTCATCTACATATTCCATCATACTTGATGCATATGTTATTGAAGCCTCTATCCTATCCGTTCATAATAGCAATAGTTACCCGGGTGATAGTACCTCTATGATTACTCGTCTTTCTGATTGATTGGATTAAATATGCGCGCCAAAATTTATCAACTTTCGTTATGGATATTTATTTCGTTTCTAGCGATCTATGCCTTTATTATCTATAAAGGTTCTTATATTGGAGTAGCATTGCATCAAATTGCTTGGATCATCATTATTGCCTCTGGCTTGATTGCTAGGCTAACTAAACCAAAGCAAAAACCAATTTCGTCCAATAATTAGACATGTATTAAAAAATGATATTTTTATGTACATAGTCTATTGAAAATTGCCGCGATAAAATGCCAACACCCGCTTCATCGCGGCACTCTGGCGACACTCCTTGAAAATCAGATTCGTGCTCACCTTTCCTTCCCGTTCTTCCCTGGTAGCGAACCGGTAATACACCGTTCGCCAGACCTTACCATCAATAACTAAGATTCCTGTCCGCGCCATTTTAGCCGCAGCCTGGTTTATGCTGGTTACTGTTGCGCCTGTTACCGCAGCAACGTCCTGCGCACAGAAGCTCTTATGCGTCCCCAGGTAATGAATAATTGCTTCTTTTCCCGTCATACACTGGCTCCTTTCAGTCCGAACTTAGCTTTGATTTCTGCAATCTTCGCCAGAGCCTGTGCACGATTTAGAGGTCTACCGCCCATGACAGGAAGTTGTTTTACTGGTTCAGGGATCGCCTCACCACGGTTAATTCTCGCAGTCATATGGACAAGCTCATCTGCGGCCTTACGGCGTAATTCCGCATCAGTAAGCGCATTGGCCCGCATGTTCTGATACAGGTTGGTAACCAGCCAGTAGTGCGCGTTTGATTTCCACGGATAAGACTCCGCATCCGGATACAGGCCTCGCTTCCGGCAATACTCGTAAACCATATCAACCAGCTCGCTGACGTTTGGCAGTCCGGCGATAACGGATGCTTCTTCCCGGCACCATGCAACAAACTGCCCGGGTGATGGCAGGAATGGTCGATTCTGCCGACGGGCTACGCGCATTCCAGCGTTAACCTGTTCCATTGTGGTGATCCCGTTTTCCCGAAAAGCCAGCACCCACTGGCGGCGGATTTCGTTCAGTTCGTTCTGGTCCCGGTTAGCCAGGCTCGCTGGGAAAGTTGCCAGTAACTGGCTGAATACACCGTTGATTATCTGCGCTAACCATCATCGAGATCTGCCACGCGCGGCTCCTTTTGTGCCGCATCCGGCACTGGAAAATTGAATATCTCAGCAGTGTTTGCCATAATTCCTCCCGCAATGAGTGTGTTACGATTTGCACCTGAAAGTCGGTTCTGTTCCAGCAGACCGGCTTTCGCCATTTCTGAACCTGTCATATCGCCCCCAGCATGGTAGTAACCATCGCCATCAATGGACCAGCCAGATCTGGGTCCACACGAAACATCGACACAATACCTTCACTAATTTCCTTCAGTTTCTGGTGGCGTGGTGCGTTGAGAATGACAGCCTGTTTTGCCTCACTGAGTTCCTTTTCCATTTCAGCCAACCTAGCCATGAAGCTATCCTGCTCAACCAGGTAACCGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCACGGCTGACATCATCAGGGAAATCGATGGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCAAGCGCGGACGGCATCACGGATTTTTTCGTGGCCTGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTGGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATAGACGGCCTGCTGTGAAACTTCGCAAGCAGCGCCCAGTTTCTTTTGTGAACCAACGATATTGATCGCTGTTTTGATAGCTGGGTTCATAACAACCTCCGTGGTTAATTTGAATCAAGATTAAAACTATGGTTGTTTTTAGTCAACAACCATTTTCGTTTGATGGAATAAAACCTTGGTTGTACATTTGGACTATGAAAACAACACTCTCAGAAAGACTTAAAGAAGCCAGATTAGCGCGAGGCCTTACACAAAAGGCGCTTGGGGATTTGGTCGGGGTTAGCCAAGCTGCTATTCAGAAAATCGAAACAGGGAAAGCTAATCAAACAACTAAAATCGTGGAGATCGCGAACGCTTTGGGTGTGCGCGCAGAATGGTTATCTTCTGGCGTTGGAAATATGTCAGACAGTACAGTGCAACCAATACAATCAACTGTCAGCCATTCCAAATACTTCAAGATTGACGTTCTTGATATAGAAGTCAGTGCTGGGCCGGGAGTCATCAACCGTGAGTTTGTAGAAGTTCTACGCTCGGTTGAGTACTCGTTTGACGATGCTCGTCACATGTTCGATGGTAGGAAGGCGGAAAATATCCGCATCATTAACGTGCGTGGTGACAGCATGTCAGGAACGATCGAACCAGGTGATCTGCTGTTCGTTGATATCACAGTTAAATCTTTCGACGGTGATGGTATCTATGCGTTTCTGTACGACGACACAGCCCATGTAAAGCGCCTGCAAATGATGAAGGATAAGCTGCTGGTCATCTCTGATAACAAAAGCTACTCACCGTGGGACCCGATCGAGAAAGACGAGATGAACCGGGTGTTCATCTTCGGTAAGGTTATTGGGAGCATGCCGCAGACATATAGGAAGCATGGTTAAAGTGAGGCTAAAAAACAGTTACAGCAATAGGCCTGTTGTTTTTCTTTAAACACGCAGTGTTAAACCGCTCTTTGAGATGCGGAGTAATGAGATGGAAGACTTGAATCACATAAGGGTTAGTGATGGAGTGCGTAGCGAGCAGCAATAGTGCAATACCTAATGTCGTTGAAGTAATACGTCGCATCAATGAAGGTTCCACTCAGCCATTTCTTTGCAAATGTGATGATGGGCAGTTGTATGTTTTGAAGTCAAAACCATCAATGCCCCCGAAAAATCTCTTAGCTGAGTTCATTTCGGCGTGTTTGGCTAATGATATCGGCCTTCCTTTACCTGACTTTAAAATCGTATTTGTGCCAGAGGAACTTATAGAGTACTCACCTGATCTGCAGCAACAAATTTGTACAGGATATGCCTTTGCTTCATTGTTCATTGACGGTGCAATAGCGTTAACGTTTACGCAGTCAAGAAACGAAACGATCATCCCAGTCGAACAGCAAAAATTAATCTATGTTTTTGATAAATGGATATTAAATGCAGACAGAACGCTTACTGACAAAGGTGGAAACGTTAACATCCTTTATGACATCAGTAACGATAAGTATTATCTGATTGACCATAATCTCTCATTTGATCAGAATGCTGGACCTGAAGATTTTTCTGTGCACGTGTACGGCCCTGGTAACCGCAAATGGCAATATGATTTAGTGGATCGCGTAGAGTACCGCCAGAGGGTCGTTAACAGTTTACACAAGCTTCCTGCTATCCTTGACGAAATTCCAGAAGAGTGGATAGTAGATGAGGAGTTTTTACCTTTTGTCTGCACTACGCTAGACAAAGGTGATTGTGATGAATTTTGGAGCGCAATAGAATGACAACTCCATGCCTATATAGCATCGTTCGCTATGCGCCTTATGCGGAGACTGAAGAATTCGCAAACATAGGCGTACTTCTGTGCGCGCCAAAAGAAAATTACTTTGATTTCCAGCTCACAAAGCGAAATGACTCTCGTGTAAAGAATTTTTTCCATGATGATTGTATTTTCCCTGTAGCAAAAGACTCAATACAAAGAGAACTACAGTTCGCAAAAATGCATGCGACCCAGATTGTTGGACATCAACAACTTGCACAATTCTTCAGATATTTTACAAACAAAAAAGAATCAATTTTTCAGTTCAGTTCTACGAGAGTGATTCTCAGCGAAAACCCAAAAGAAGAGCTGGCCCGCATTTACAATAAATATGTAAACCACTCTGACTACACAAAAGAGCGCCGTGAAGATGTTCTAGCCAGAGAGCTAAAACGAAGTATCGATAGAATAGATGGATTGAAGAACGTCTTCAAACAAGCAACCATTGATGGGTATTTCGCAAAGTTCTCAATGCCATTGGTCGCCAAGAAGCATGACAGGATCCAATGTGCCATCAAACCTCTGGCATTCACTCAAGCTGAACCAGGAAAAATGATGGAGCATAGTGATACTTGGGTGATGAGAATAACTCGAGCAGCAGAAGAAAACCTGCTTTCACTTGATGACATTTTATTCACAATTGAAACTCCTGAATCACCAAACTCAGGCCAAAGCAAAGTTATTGACATCATAAAGAGAACTATGGATGCTAAGAAAATAAATCATATACCTGCATCCAACCACAAAGAAACTATTGATTTTGCAAAAAAAATACTTCCCCAAGTTTAAAATTTATTTTTGTATGTGATATTCCTTATTAATAACCCGGCCACCGTGCCGGGTTTTCTTTTGCCTCCCCTCATCACACAAACCGCTCAAAAAACCACCATAACCTCGCTTCAGTTATCGCTATGCGATTCAAGTCACAAAATAAATCCATCCTAAATACAACCAGTTATATCTAAAACAACCAATAAAACAACTTTTGTTGTTGACGGTAAAACAACTATAGTTTTAAATAGGTTCATCGCAACAACACAACGATACGGCAACCACCTGATTCACCGTTGCGATGACCGCTTAGATCCGCAGTTTGAATTTCAGCAGGCTTCGGGGAGTGCGAGGGGTGAAACGGACGCGTGAACGTCGGTGTGACCAGCTGAAATCAACTCAACATTTCATACCTTAGTCGCTTCAACGAGGCGGCTTAGTTATGACAACCGGCGGCCATCCACCGCCTGAATACGCGCAGAAGTCTCTATATGTTCAGCAGCCCAGCTTACGGGCAGGAGTTTTTATGGTTCATCAACATTACGGAACGCAGACCGTTAATCGCGGCGCGGTCATGCCAGGAATGCTGGTCAAACACAAAGATGGTACCTGGACTGCATCAGCTAATTTACGCGGACGGCTTTATCTGCATCGCGGCATCGAGCGCACTTATACCCGTGATTTGCTCGTGGAAGTTTTTCTCGACGGACGCGGTAACGGCCTGAATCACTAATCCCCTTTCCTGTTTTCCTAATCAGCCTGGCATTTCGCGGGCGATATTTTCACAGCCATTTTCAGGAGGTCAGCCATGAACGCTTATTACATTCAGGATCGTCTTGAGGCTCAGAGCTGGGCGCGTCACTACCAGCAGATCGCCCGTGAAGAGAAAGAGGCAGAACTGGCAGACGACATGGAAAAAGGCCTGCCCCAGCACCTGTTTGAATCGCTATGCATCGATCATTTGCAACGCCACGGGGCCAGCAAAAAAGCCATTACCCGTGCGTTTGATGACGATGTTGAGTTTCAGGAGCGCATGGCAGAACACATCCGGTACATGGTTGAAACCATTGCTCACCACCAGGTTGATATTGATTCAGAGGTATAAAACGGATGAGTACAGCACTCGCAACGCTGGCTGGGAAGCTGGCTGAACGTGTCGGCATGGATTCTGTCGACCCACAGGAACTGATCACCACTCTTCGCCAGACGGCATTTAAAGGTGATGCCAGCGATGCGCAGTTCATCGCATTGCTGATCGTCGCCAACCAGTACGGCCTTAATCCGTGGACGAAAGAAATTTACGCCTTCCCTGATAAGCAGAACGGCATTGTTCCGGTGGTGGGCGTTGATGGCTGGTCCCGCATCATCAATGAAAACCAGCAGTTTGATGGCATGGACTTTGAGCAGGACAATGAATCCTGTACATGCCGGATTTACCGCAAGGACCGTAATCATCCGATCTGCGTTACCGAATGGATGGATGAATGCCGCCGCGAACCATTCAAAACCCGCGAAGGCAGAGAAATCACGGGGCCGTGGCAGTCGCATCCCAAACGGATGTTACGGCATAAAGCCATGATTCAGTGTGCCCGTCTGGCCTTCGGATTTGCTGGTATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACTGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATCGCCCTGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTTAAACAGAAAGCCACTGAGCAGAAGGTGGCAGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGGGATGATGCATGGCACAAATTACGGCTCGGCGTCATCACCGCTTCAGAAGTTCACAACGTGATAGCAAAGCCCCGCTCAGGAAAGAAGTGGCCTGACATGAAAATGTCCTACTTCCACACCCTGCTGGCTGAGGTTTGCACCGGTGTGGCTCCGGAAGTTAATGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAACGACGCCAGAACCCTGTTTGAATTCACTTCCAGCGTGAATATTACTGAATCCCCGATCATCTATCGCGACGAAAATATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAACTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAATTCCGGCTCGGTGGTTTCGAGGCAATAAAATCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACGCGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGCATGAAGCGTGAAGGCCTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGTGCGATCACTTTCGTCTACTCCGTTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCCCTGAAAGCACAGCGGCTGGCTGAGGAGATAAATAATAAACGGGGAGCTGTATGCACAAAGCATCTCCCGTTGAGTTAAGAACGAGTATCGAGATGGCACATAGCCTCGCTCAAATTGGAGTCAGGTTTGTGCCAATACCAGTAGAAACAGACGAAGAATTTCATACGTTAGCCGCATTCCTTTCACAAAAGCTGGAAATGATGGTGGCGAAAGCAGAAGCAGATGAGAGAGACCAGGTATGACAACCACTGAATGCATTTTTCTGGCAGCGGGCTTCATATTCTGTGTGCTTATGCTTGCCGACATGGGGCTTGTTCAATGACACCTCAGCAAGAAAACGCCCTTCGCAGCATTGCCCGTCAGGCTAATTCTGAAATCAAAAAAGCCAGACAGCAGTTTCCGGATAAAAACGTCGATGACATTTGCCGTAGCGTACTAAAGAAGCACCGCGAAACGGTAACGCTGATGGGATTCACACCGACTCATTTAAGCCTGGCGATCGGCATGTTGAACGGCGTCTTTAAGGAACGGTGAACATGAAAAGCAAAATCATCAGGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATAATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGATAGATGAACGAAGACGACTGGCCGTTCAGGGTTGTCGGACTTGTGCCAGTTGCCAGCAAGATCTGGAGCTTATCAGTAAACAGAGAGGTTCGAAGTGAGCGAAATTAACTAGAAGCCAAAGATAAAATCATCGCTGAGCAGGAGAAAATCGCTAACGGAGAAAAGACAGTAAGTCAGTATATGAAAACCGCATGATATCATCAGATAAAAATCGGTCGTAAAGCGAAATATTAATACCAGAACAAACGAGTCGAGGTAAATTATATTACCTCGATAAATTAACTAAAACTTGCCCGCTATATACTATATCATTCAGTATCATCACGCGCGGTCTGTGCATATGTCACTACCGCACCTAATATATTAATTTTCTTTTCAACATAGATAATATTATCGTACTCATAATTGCCATACGGATAGCAAATGCGAATATTCTCATGTAGATCGGGGTCATCCACCTCAGCTCCAGAACAACTTTTTGAACTACCGGAAGTATACCGATACGGTGCAACATAAGACGATGTCTCTCCAGGCAAAAAATAAGTTAGTGTCGTAAGGGGTATAATCAGAAAAAATCCAGCAAATATGCACATCCCTGCATAAACCTTAAGGTATGCTGACAGACTCTTCCAGCCGCTTTGTTTTACTATCCCCTTCTTAACCCAAAACAGAGATAACAGAAAAGCTATTCCCATGCTAAACAGAATGTAATAGTGGGATATACTCTGATTAAGAAACGTGACCCTGTAGATATCTGCCCGCCACCAGAAGAAAAGGAAAATAAAGATCAGCCCTGAAACTGTCATGCAAATCAAATAAGGATACGAATCTTTTTTCATGTTTAGCGCCCATAAAATTTTTCCTGACCCGGACAAATTTACCATCCATTTTTTGCGCAGAAAATAGCTCATTACTTACTGCACAATAATACACAAAATTGCGTAAATTTTTTGCATGGATTTTAGCTCTTTCAGCCGACATTTAAGGGGTAAATAGCATTTCCTAAAAGCAACTGCACCAACCCAACAGAATGGGCTACCGCTTACGTTGAGAGCAAAAAAGTGTATAGCAGCAATGAACAGCATCCTCGCACTGACGAGGATTTCTTTTATCTGAACTCGCTACGGCGGGTTTTGTTTTATGGAGATGATAAATGCACTTCCGAGTCACAGGTGAATGGAATGGAGAACCATTCAACAGAGTTATCGAAGCCGAGAACATCAGCGACTGCTATGACCACTGGATGCTGTGGGCGCAGATAGCACATGCAGACGTAACCAATATTCGAATTGAAGAACTGAAAGAACACCAAGCCGCCTGATGGCGGTTTTTTCTTGCGTGTAATTGCGGAGACTTTGCGATGTACTTGACACTTCAGGAGTGGAACGCTCGCCAGCGACGCCCAAGAAGCCTTGAAACAGTTCGTCGATGGGTGCGCGAATGCAGGATATTCCCTCCTCCGGTTAAGGATGGAAGAGAATATCTGTTCCACGAATCAGCGGTAAAGGTTGACTTAAATCGACCAGTAACAGGTAGCCTTTTGAAGAGGATCAGAAATGGGAAGAAGGCGAAGTCATGAGCGCCGGGATTTACCCCCTAACCTTTATATAAGAAACAATGGATATTACTGCTACAGGGACCCAAGGACGGGTAAAGAGTTTGGATTAGGCCGAGACAGGCGAATCGCAATCACTGAAGCTATACAGGCCAACATTGAGTTATTTTCAGGACACAAACACAAGCCTCTGACAGCGAGAATCAACAGTGATAATTCCGTTACGTTACATTCATGGCTTGATCGCTACGAAAAAATCCTGGCCAGCAGAGGAATCAAGCAGAAGACACTCATAAATTACATGAGCAAAATTAAAGCAATAAGGAGGGGTCTGCCTGATGCTCCACTTGAAGACATCACCACAAAAGAAATTGCGGCAATGCTCAATGGATACATAGACGAGGGCAAGGCGGCGTCAGCCAAGTTAATCAGATCAACACTGAGCGATGCATTCCGAGAGGCAATAGCTGAAGGCCATATAACAACAAACCCTGTCGCTGCCACTCGCGCAGCAAAATCAGAGGTAAGGAGATCAAGACTTACGGCTGACGAATACCTGAAAATTTATCAAGCAGCAGAATCATCACCATGTTGGCTCAGACTTGCAATGGAACTGGCTGTTGTTACCGGGCAACGAGTTGGTGATTTATGCGAAATGAAGTGGTCTGATATCGTAGATGGATATCTTTATGTCGAGCAAAGCAAAACAGGCGTAAAAATTGCCATCCCAACAGTATTGCATGTTGATGCTCTCGGAATATCAATGAAGGAAACACTTGATAAATGCAAAGAGATTCTTGGCGGAGAAACCATAATTGCATCTACTCGTCGCGAACCGCTTTCATCCGGCACAGTATCAAGGTATTTTATGCGCGCACGAAAAGCATCAGGTCTTTCCTTCGAAGGGGATCCGCCTACCTTTCACGAGTTGCGCAGTTTGTCTGCAAGACTCTATGAGAAGCAGATAAGCGATAAGTTTGCTCAACATCTTCTCGGGCATAAGTCGGACACCATGGCATCACAGTATCGTGATGACAGAGGCAGGGAGTGGGACAAAATTGAAATCAAATAA
Protein sequences of DBSCAN-SWA_8 >CP029122|3374112:3412836|3399818_3400109_-|AWF24597.1|DBSCAN-SWA MADLRKAARGRECQVRTPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDRRTHFVDAAYAKECALEGMARTQVIWMKEGVIKA >CP029122|3374112:3412836|3392440_3393001_-|AWF27040.1|terminase|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIENEKLRREVEELRQASETDLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIKRRRSLSTRMRLTQQLI >CP029122|3374112:3412836|3409771_3409987_+|AWF24944.1|DBSCAN-SWA MTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >CP029122|3374112:3412836|3410517_3410667_-|AWF26951.1|DBSCAN-SWA MDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >CP029122|3374112:3412836|3403294_3403471_-|AWF27733.1|DBSCAN-SWA MTGSEMAKAGLLEQNRLSGANRNTLIAGGIMANTAEIFNFPVPDAAQKEPRVADLDDG >CP029122|3374112:3412836|3377804_3378794_-|AWF27890.1|DBSCAN-SWA MASQLTDAFARKFYYLRLSITDVCNFRCTYCLPDGYKPSGVTNKGFLTVDEIRRVTRAFASLGTEKVRLTGGEPSLRRDFTDIIAAVRENDAIRQIAVTTNGYRLERDVANWRDAGLTGINVSVDSLDARQFHAITGQDKFNQVMAGIDAAFEAGFEKVKVNTVLMRDVNHHQLDTFLNWIQHRPIQLRFIELMETGEGSELFRKHHISGQVLRDELLRRGWIHQLRQRSDGPAQVFCHPDYAGEIGLIMPYEKDFCATCNRLRVSSIGKLHLCLFGEGGVNLRDLLEDDTQQQALEARISAALREKKQTHFLHQNNTGITQNLSYIGG >CP029122|3374112:3412836|3405223_3405973_+|AWF26251.1|DBSCAN-SWA MECVASSNSAIPNVVEVIRRINEGSTQPFLCKCDDGQLYVLKSKPSMPPKNLLAEFISACLANDIGLPLPDFKIVFVPEELIEYSPDLQQQICTGYAFASLFIDGAIALTFTQSRNETIIPVEQQKLIYVFDKWILNADRTLTDKGGNVNILYDISNDKYYLIDHNLSFDQNAGPEDFSVHVYGPGNRKWQYDLVDRVEYRQRVVNSLHKLPAILDEIPEEWIVDEEFLPFVCTTLDKGDCDEFWSAIE >CP029122|3374112:3412836|3407305_3407512_+|AWF25981.1|DBSCAN-SWA MVHQHYGTQTVNRGAVMPGMLVKHKDGTWTASANLRGRLYLHRGIERTYTRDLLVEVFLDGRGNGLNH >CP029122|3374112:3412836|3383560_3384316_-|AWF24211.1|DBSCAN-SWA MATVNKQAIAAAFGRAAAHYEQHADLQRQSADALLAMLPQRKYTHVLDAGCGPGWMSRHWRERHAQVTALDLSPPMLVQARQKDAADHYLAGDIESLPLATATFDLAWSNLAVQWCGNLSTALRELYRVVRPKGVVAFTTLVQGSLPELHQAWQAVDERPHANRFLPPDEIEQSLNGVHYQHHIQPITLWFDDALSAMRSLKGIGATHLHEGRDPRILTRSQLQRLQLAWPQQQGRYPLTYHLFLGVIARE >CP029122|3374112:3412836|3402701_3403193_-|AWF27948.1|DBSCAN-SWA MLAFRENGITTMEQVNAGMRVARRQNRPFLPSPGQFVAWCREEASVIAGLPNVSELVDMVYEYCRKRGLYPDAESYPWKSNAHYWLVTNLYQNMRANALTDAELRRKAADELVHMTARINRGEAIPEPVKQLPVMGGRPLNRAQALAKIAEIKAKFGLKGASV >CP029122|3374112:3412836|3411765_3412836_+|AWF27425.1|integrase|DBSCAN-SWA MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >CP029122|3374112:3412836|3382890_3383568_-|AWF26509.1|DBSCAN-SWA MSKRYFVTGTDTEVGKTVASCALLQAAKAVGYRTAGYKPVASGSEKTPEGLRNSDALALQHNSSLQLDYATVNPYTFAEPTSPHIISAQEGRPIESLVMSAGLRALEQQADWVLVEGAGGWFTPLSDTFTFADWVTQEQLPVILVVGVKLGCINHAMLTAQAIQHAGLTLAGWVANDVTPPGKRHAEYMTTLTRMIPAPLLGEIPWLAENPENAATGKYINLALL >CP029122|3374112:3412836|3374838_3374952_-|AWF24179.1|DBSCAN-SWA MATAGLAWGAKYLEITATKYDSPTMYVAIGNAANLLI >CP029122|3374112:3412836|3407587_3407884_+|AWF27140.1|DBSCAN-SWA MNAYYIQDRLEAQSWARHYQQIAREEKEAELADDMEKGLPQHLFESLCIDHLQRHGASKKAITRAFDDDVEFQERMAEHIRYMVETIAHHQVDIDSEV >CP029122|3374112:3412836|3393140_3393284_-|AWF27217.1|DBSCAN-SWA MTWFDGVDARCDMQMIIIIILRVLSGDPTGYGAATSRVFAIYENFPV >CP029122|3374112:3412836|3401366_3401870_-|AWF27977.1|DBSCAN-SWA MPPLNFSTPNVGVSQKKIDAEIKSLTVSLARPDEQKGDITAGMEAITPIWRESLQEALDRMTIFRDSSPNTVSLNVKVLALDVPAFGVSMTTKAIARYEIINRANGDIIYTQDIESTGTVPASYAFYGIVRARESVNRAVQNNITQFLQALESVDLSRPMFPVRVAK >CP029122|3374112:3412836|3374322_3374616_-|AWF25142.1|transposase|DBSCAN-SWA MSRQCTHYGRWPQHGFTSLKKIRPQSVTSRIQPGSDVIVCAEMDEQWGYVGAKSRQRWLFYAYDRIRRTVVAHVFGERTLATLERLLSLLSAFEVVV >CP029122|3374112:3412836|3375249_3375894_-|AWF25631.1|DBSCAN-SWA MAQVYGWMTVGLLLTAFVAWYAANSAAVMELLFTNRVFLIGLIIAQLALVIVLSAMIQKLSAGVTTMLFMLYSALTGLTLSSIFIVYTAASIASTFVVTAGMFGAMSLYGYTTKRDLSGFGNMLFMALIGIVLASLVNFWLKSEALMWAVTYIGVIVFVGLTAYDTQKLKNMGEQIDTRDTSNLRKYSILGALTLYLDFINLFLMLLRIFGNRR >CP029122|3374112:3412836|3379190_3380099_+|AWF24094.1|DBSCAN-SWA MRNRTLADLDRVVALGGGHGLGRVLSSLSSLGSRLTGIVTTTDNGGSTGRIRRSEGGIAWGDMRNCLNQLITEPSVASAMFEYRFGGNGELSGHNLGNLMLKALDHLSVRPLEAINLIRNLLKVDTHLIPMSEHPVDLMAIDDQGHEVYGEVNIDQLTTPIQELLLTPNVPATREAVHAINEADLIIIGPGSFYTSLMPILLLKEIAQALRRTPAPMVYIGNLGRELSLPAANLKLESKLAIMEQYVGKKVIDAVIVGPKVDVSAVKERIVIQEVLEASDIPYRHDRQLLHNALEKALQALG >CP029122|3374112:3412836|3400271_3400727_-|AWF24219.1|DBSCAN-SWA MNLPQDGIKLHRGNFTAIGQQIQPYLEDGKCFRMVLKPWREKRSLSQNALSHMWYSEISEYLISRGKSFATAAWVKEALKHTYLGYETKELVDVVTGEITTIQSLRHTSDLDAGEMYVFLCKVEAWAMNIGCHLTIPQSCEFQLLRDKQEA >CP029122|3374112:3412836|3407889_3408675_+|AWF24834.1|DBSCAN-SWA MSTALATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKATEQKVAA >CP029122|3374112:3412836|3404076_3404307_-|AWF28207.1|DBSCAN-SWA MNPAIKTAINIVGSQKKLGAACEVSQQAVYKWLHNKAKVSPEHVGSIVTATGGVVKAYQIRPDLPKLFPHTEKNAA >CP029122|3374112:3412836|3408671_3409352_+|AWF24920.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSSVNITESPIIYRDENMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >CP029122|3374112:3412836|3389150_3390482_+|AWF26614.1|DBSCAN-SWA MIINQVPIKIKIFIFLFSCISIIFLLLHANNGIYITQTTQISYSVFIIGLFFINLMIFIFLLLYYVSNQRQSYLLILSFAFLSNTYYLLEVAIISLSPLGNDLSTIYQKSNDIAIYYLFRQFSFISIIFLAVYSTNVKNKSVLEDKRNIIIVVLSILILFITPFVAKNLSSDNIKYSLNIIQYSLNRHLPTWNIVYTKIISVFWLVLLISSCISIRNYSKIWLCIILISIVSVCNNLILLYFIDKSHPAWYMTKFLELISMIYIISTLMYYVFRKLNHANHMAIHDPLTNTYNRRYFIDSLKNISKHHDFSVIMLDIDSFKSINDKWGHHMGDQVIVMVTRIIKKSIRKEDVLGRLGGEEFGIIIKGNTQKLLLSIAERIRKNIEEQCSEKLLSHGPEKITVSIGCFTSKENNLSPSEMLVNADKALYQAKRTGKNKVIIHSK >CP029122|3374112:3412836|3402414_3402570_+|AWF25380.1|DBSCAN-SWA MKIAAIKCQHPLHRGTLATLLENQIRAHLSFPFFPGSEPVIHRSPDLTINN >CP029122|3374112:3412836|3387928_3388405_+|AWF24855.1|DBSCAN-SWA MKLISNDLRDGDKLPHRHVFNGMGYDGDNISPHLAWDDVPAGTKSFVVTCYDPDAPTGSGWWHWVVVNLPADTRVLPQGFGSGLVAMPDGVLQTRTDFGKTGYDGAAPPKGETHRYIFTVHALDIERIDVDEGASGAMVGFNVHFHSLASASITAMFS >CP029122|3374112:3412836|3399322_3399463_-|AWF25630.1|DBSCAN-SWA MMFEFYMAELLRHRWGHLRLYRFPGSVLTDYRILKNYAKTLTGAGV >CP029122|3374112:3412836|3398859_3399237_-|AWF28177.1|DBSCAN-SWA MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLIPSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVGGMTFMALARKHGRSDCWVGRMLQKAEGVVEGMLMVLDLRLEMDADCSK >CP029122|3374112:3412836|3394581_3395049_-|AWF27498.1|lysis|DBSCAN-SWA MSRVTAIISALVICIIVCLSWAVNHYRDNAIAYKEQRDNKASELEKANATITDMQQRQRDADALDDKYTKELADAKAENDALRRKLDNGGRVLVKGKCPVPSSAETSSASGMGNDATVELSPVAGRNVLGIRDGIIRDQTALRTLQEYIRTQCLR >CP029122|3374112:3412836|3385453_3386494_-|AWF25704.1|DBSCAN-SWA MAHRPRWTLSQVTELFEKPLLDLLFEAQQVHRQHFDPRQVQVSTLLSIKTGACPEDCKYCPQSSRYKTGLEAERLMEVEQVLESARKAKAAGSTRFCMGAAWKNPHERDMPYLEQMVQGVKAMGLEACMTLGTLSESQAQRLANAGLDYYNHNLDTSPEFYGNIITTRTYQERLDTLEKVRDAGIKVCSGGIVGLGETVKDRAGLLLQLANLPTPPESVPINMLVKVKGTPLADNDDVDAFDFIRTIAVARIMMPTSYVRLSAGREQMNEQTQAMCFMAGANSIFYGCKLLTTPNPEEDKDLQLFRKLGLNPQQTAVLAGDKEQQQRLEQALMTPDTDEYYNAAAL >CP029122|3374112:3412836|3376544_3376790_-|AWF25189.1|DBSCAN-SWA MIKVLFFAQVRELVGTDATEVAADFPTVEALRQHMAAQSDRWALALEDGKLLAAVNQTLVSFDHPLTDGDEVAFFPPVTGG >CP029122|3374112:3412836|3398260_3398704_+|AWF27611.1|DBSCAN-SWA MQPITNFDANRYLGKWYEIARLENRFERGLEQVSATYGKRNDGGIRVLNRGYDPTKNKWSESEGKAYFTGDTKTAALKVSFFGPFYGGYNVIKLDDEYKYALVSGPNREYLWILARTPTIPDKVKADYVRTAQKLGFNVNELLWVKQ >CP029122|3374112:3412836|3395945_3396116_-|AWF26513.1|DBSCAN-SWA MVMAERLLSYNLYSVGKVAEICGYENTSYFVSVFRRYFGVPPHQYSSRFFLEKDMM >CP029122|3374112:3412836|3411362_3411530_+|AWF27947.1|DBSCAN-SWA MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >CP029122|3374112:3412836|3391376_3391550_+|AWF26545.1|DBSCAN-SWA MHNSFGQKLMRIYNQKGIFSNTKDSEEGLTHILSEHFENVKTKVQGTVVMFSASGKK >CP029122|3374112:3412836|3401018_3401132_-|AWF24145.1|DBSCAN-SWA MVPLVVQSLVVLPGQQSNLQQQNKQAWNMLSKLRMGI >CP029122|3374112:3412836|3376782_3377268_-|AWF27194.1|DBSCAN-SWA MSQLTHINAAGEAHMVDVSAKAETVREARAEAFVTMRSETLAMIIDGRHHKGDVFATARIAGIQAAKRTWDLIPLCHPLMLSKVEVNLQAEPEHNRVRIETLCRLTGKTGVEMEALTAASVAALTIYDMCKAVQKDMVIGPVRLLAKSGGKSGDFKVEADD >CP029122|3374112:3412836|3404345_3405101_+|AWF26396.1|DBSCAN-SWA MVVFSQQPFSFDGIKPWLYIWTMKTTLSERLKEARLARGLTQKALGDLVGVSQAAIQKIETGKANQTTKIVEIANALGVRAEWLSSGVGNMSDSTVQPIQSTVSHSKYFKIDVLDIEVSAGPGVINREFVEVLRSVEYSFDDARHMFDGRKAENIRIINVRGDSMSGTIEPGDLLFVDITVKSFDGDGIYAFLYDDTAHVKRLQMMKDKLLVISDNKSYSPWDPIEKDEMNRVFIFGKVIGSMPQTYRKHG >CP029122|3374112:3412836|3384302_3385457_-|AWF27772.1|DBSCAN-SWA MSWQEKINAALDARRAADALRRRYPVAQGAGRWLVADDRQYLNFSSNDYLGLSHHPQIIRAWQQGAEQFGIGSGGSGHVSGYSVVHQALEEELAEWLGYSRALLFISGFAANQAVIAAMMAKEDRIAADRLSHASLLEAASLSPSQLRRFAHNDVTHLARLLASPCPGQQMVVTEGVFSMDGDSAPLAEIQQVTQQHNGWLMVDDAHGTGVIGEQGRGSCWLQKVKPELLVVTFGKGFGVSGAAVLCSSTVADYLLQFARHLIYSTSMPPAQAQALRASLAVIRSDEGDARREKLAALITRFRAGVQDLPFTLADSCSAIQPLIVGDNSRALQLAEKLRQQGCWVTAIRPPTVPAGTARLRLTLTAAHEMQDIDRLLEVLHGNG >CP029122|3374112:3412836|3403467_3404007_-|AWF27239.1|DBSCAN-SWA MQPLTYQQTSGFSPTAVINRSQTKQVPGHEKIRDAVRAWSAEDNQDVVATLIVNEYREQGGGTIDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGYLVEQDSFMARLAEMEKELSEAKQAVILNAPRHQKLKEISEGIVSMFRVDPDLAGPLMAMVTTMLGAI >CP029122|3374112:3412836|3409581_3409695_+|AWF25179.1|DBSCAN-SWA MPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >CP029122|3374112:3412836|3396541_3396676_-|AWF26249.1|DBSCAN-SWA MPYICSIILVLNSFDVRIGKEDILFKKGSAVLIDYNLKDFFHQI >CP029122|3374112:3412836|3397027_3397987_-|AWF25927.1|DBSCAN-SWA MIKKPVIGISGCLAGSAVRFDGGHKRADFLMDKLVEWVTFRPVCPEMAIGLPVPRPALRLVRSMQGNIRMCFSHDQNEDVTERMTEFSRSYMDKLKDVSGFVVCAKSPSCGMERVRVYDENGNRGRKDGVGLFTSTLMEKFSWLPVEEDGRLHDPVLRENFVERVFALHELNHLYKEKLSRRELLAFHSRYKLQLLAHSQAGYKDMGPFVAAIHEWADLESYFEVYRDKLMAILRKPASRKNHTNVLMHIQGYFSNYLSTRQRKELSEVILNYRSGTLPLLAPLTLLKHYLGEYPNDYLLTQNYFDPYPDELALRLMVN >CP029122|3374112:3412836|3395542_3395758_-|AWF24901.1|lysis|DBSCAN-SWA MKSMDKLTTGIAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >CP029122|3374112:3412836|3391787_3392360_-|AWF25111.1|DBSCAN-SWA MWGSSTRQNVFEVGTSAAYLFYAQKTSAGQLFDVNGAINCTTLNQSSDRDLKDDILVISDATKAIRKMNGYTYTLKENGMPYAGVIAQEVMEAIPEAVGSFTHYGEELQGPTVDGNELREETRYLNVDYSAVTGLLVQVARETDDRVTALEEENTTLRENLATAGTRITTLENQVSELVALVGQLTGSEH >CP029122|3374112:3412836|3374112_3374322_-|AWF27986.1|transposase|DBSCAN-SWA MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKLLSFSKSVELHDKVIGHYLNIKHYQ >CP029122|3374112:3412836|3386637_3387870_+|AWF26460.1|DBSCAN-SWA MTSPLPVYPVVSAEGCELILSDGKRLVDGMSSWWAAIHGYNHPQLNAAMKSQIDAMSHVMFGGITHAPAIELCRKLVAMTPQPLECVFLADSGSVAVEVAMKMALQYWQAKGEARQRFLTFRNGYHGDTFGAMSVCDPDNSMHSLWKGYLPENLFAPAPQSRMDGEWDERDMVGFARLMAAHRHEIAAVIIEPIVQGAGGMRMYHPEWLKRIRKMCDREGILLIADEIATGFGRTGKLFACEYAEIAPDILCLGKALTGGTMTLSATLTTREVAETISNGEAGCFMHGPTFMGNPLACAAANASLAIIESGEWQQQVAAIEVQLREQLAPARDAEMVADVRVLGAIGVVETTRPVNMAALQKFFVEQGVWIRPFGKLIYLMPPYIILPQQLQRLTAAVNRAVQDETFFCQ >CP029122|3374112:3412836|3395045_3395543_-|AWF25803.1|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKDIVGVWTVCHGHTGKDIIPGKTYTEAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSALLRKINQGDIKGACDQLRRWAYAGGKQWKGLMTRREIEREVCLWGQQ >CP029122|3374112:3412836|3393449_3393623_+|AWF24477.1|DBSCAN-SWA MIEQINIALDQKGSGNFSAWVIEACRRRLCSEKRVSPEANKEKSDITELLRKQIRPD >CP029122|3374112:3412836|3380290_3382312_-|AWF24419.1|DBSCAN-SWA MSKPFKLNSAFKPSGDQPEAIRRLEEGLEDGLAHQTLLGVTGSGKTFTIANVIADLQRPTMVLAPNKTLAAQLYGEMKEFFPENAVEYFVSYYDYYQPEAYVPSSDTFIEKDASVNEHIEQMRLSATKAMLERRDVVVVASVSAIYGLGDPDLYLKMMLHLTVGMIIDQRAILRRLAELQYARNDQAFQRGTFRVRGEVIDIFPAESDDIALRVELFDEEVERLSLFDPLTGQIVSTIPRFTIYPKTHYVTPRERIVQAMEEIKEELAARRKVLLENNKLLEEQRLTQRTQFDLEMMNELGYCSGIENYSRFLSGRGPGEPPPTLFDYLPADGLLVVDESHVTIPQIGGMYRGDRARKETLVEYGFRLPSALDNRPLKFEEFEALAPQTIYVSATPGNYELEKSGGDVVDQVVRPTGLLDPIIEVRPVATQVDDLLSEIRQRAAINERVLVTTLTKRMAEDLTEYLEEHGERVRYLHSDIDTVERMEIIRDLRLGEFDVLVGINLLREGLDMPEVSLVAILDADKEGFLRSERSLIQTIGRAARNVNGKAILYGDKITPSMAKAIGETERRREKQQKYNEEHGITPQGLNKKVVDILALGQNIAKTKAKGRGKSRPIVEPDNVPMDMSPKALQQKIHELEGLMMQHAQNLEFEEAAQIRDQLHQLRELFIAAS >CP029122|3374112:3412836|3399459_3399822_-|AWF25353.1|DBSCAN-SWA MNTYNITLPWPPSNNRYYRHNRGRTHVSAEGQAYRDNVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVVKMPVTKGGRLELTITEMGNE >CP029122|3374112:3412836|3377270_3377783_-|AWF28116.1|DBSCAN-SWA MSQVSTEFIPTRIAILTVSNRRGEEDDTSGHYLRDSAQEAGHHVVDKAIVKENRYAIRAQVSAWIASDDVQVVLITGGTGLTEGDQAPEALLPLFDREIEGFGEVFRMLSFEEIGTSTLQSRAVAGVANKTLIFAMPGSTKACRTAWENIIAPQLDARTRPCNFHPHLKK >CP029122|3374112:3412836|3405969_3406797_+|AWF25059.1|DBSCAN-SWA MTTPCLYSIVRYAPYAETEEFANIGVLLCAPKENYFDFQLTKRNDSRVKNFFHDDCIFPVAKDSIQRELQFAKMHATQIVGHQQLAQFFRYFTNKKESIFQFSSTRVILSENPKEELARIYNKYVNHSDYTKERREDVLARELKRSIDRIDGLKNVFKQATIDGYFAKFSMPLVAKKHDRIQCAIKPLAFTQAEPGKMMEHSDTWVMRITRAAEENLLSLDDILFTIETPESPNSGQSKVIDIIKRTMDAKKINHIPASNHKETIDFAKKILPQV >CP029122|3374112:3412836|3376090_3376543_-|AWF27254.1|DBSCAN-SWA MAETKIVVGPQPFSVGEEYPWLAERDEDGAVVTFTGKVRNHNLGDSVKALTLEHYPGMTEKALAEIVDEARNRWPLGRVTVIHRIGELWPGDEIVFVGVTSAHRSSAFEAGQFIMDYLKTRAPFWKREATPEGDRWVEARESDQQAAKRW |
54 | Enterobacteria_phage(48.28%) | lysis,transposase,integrase,terminase | attL 3388496:3388510|attR 3412910:3412924 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
4196055 : 4218730
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >CP029122|4196055:4218730|DBSCAN-SWA TCTAGGGTCGTTTTCGGCCCATGAACAAACTCACCAGGAACAGAATAATCCCGACGACAAAGACAATTTTAGCTGCGCCTGCAGCGGTACCGGCCAGACCACCAAACCCAAGTGCGGCGGCGATTAACGCGATAACCAGAAATATGATGCCCCAACGAAACATAAGCGTCTCCTTTACCATAGTTAATGTCACCGCTTGATATGAGCGTGAAAATCACTCAATGCGATATTTTTAGTGTGGTGCACATCGCGCCTCCCGACAAAGTTCGGGAGGACGAATTACGACGAATTACTTAGTTTTCAGATCATTTTTAACGCTTTTCACACCATCTACCGCTTTGGCGATACTTTCAGCACGGTCGCTCTGTGCCTGGGAATCGACGGTACCGGAGAGCTGAACCACGCCGTCGGTGGTTTCAACTTTCACATGACGGGAAGGGACGATATCGTCCGCCAGCAGTTTGGCTTTGATTTCACTGGTGGTGGCGGTGTCACCCGCGTAGCCCTTCACAGAGCCTTCTTTAGCGTCGCGAACGTGCAGTTTGTCGCTGACAGAGGTCACCCCTTCAACGCCTTTCGCCACTTTCACTGCCTCTTCGGCCTGGGCCTGGCTTTCAACGAAACCGCTCAGGGTCACGACTTTTTGATCGGTTTTTACAGAGATATCGGTGCTCTTGATGTTGTCATGATCCACCAGGGCCGCCTTCACTTTCGCGGTGATGGCGCTGTCATCCATGAAATTACCGACTTTATTCATAGAGCTATCGACTTTTTGCCCTGCGCTTTCATTGGTAGTCTGTGCGTTGTTTTCCGCGTAGGCAGAGCCGGTCGCGACGGCAGAGGTCAACATTACAGCCAGCAGAGTTTTCGAAATCTTCAGTCTTGTCATAGTCATCGATTTATTCCTGTATGTTTGCTCGTAATTTGAGCCTGGCAACACGAGGTTGCATTGCTGAATAGGGAGAGACTTCACCCTCTACAGAAGTCAATGGTCGCCATCACAAAAGCGATGGGTGATGAATAACGACCATTACAGCCTCTGAATCAGTTATTAATATCGGCAGAATGACAATCGACGGCTTTAGATACTGATATCTACGCATTGAACGGTATTTAACGCCGTCAGAAATGTCATTACTTTGTTAAATATAGATCACAATTTTGAAACCGCTCGGGATATCAGCGAAAACATAAGCAAAAGTGAATGTTTTAAGAACATTCCGTAAGCGGCTAATAAGGAAGGGAAATTGACAGGGCGAGGCGGTTGCCGCGCCCTGGAGGCAAGAATTAATGCTCGCGGGTCTGGTGGAACTGAACGTCCGGATAACGTTCCTGTGCCAGGCGCAGGTTAACCATGCTGGTAGCGATGTAAGCGAGATTATCGCCGCCATCAAGCGCCAGTTGGCTTTCGTTCTTACGCTTGAACTCTTCAAATTTCTTCGCGTCCGCACATTCTACCCACCGGGCGGTGGCAACGTTGACTGATTCATATACTGCTTCAACGTTGTATTCGCTCTTCAGGCGCGATACCACTACATCAAACTGCAGCACACCAACCGCACCAACGATCAGGTCGTTGTTGGAGATCGGACGGAACACCTGCACCGCGCCCTCTTCGGAAAGCTGTACCAGCCCTTTGAGCAGCTGTTTTTGCTTCAGCGGATCTTTCAGGCGGATACGACGGAACAGTTCTGGTGCGAAGTTCGGAATACCGGTGAACTTCATCATCTCACCCTGGGTAAAGGTGTCGCCGATCTGAATGGTGCCGTGGTTGTGCAGGCCGAGGATATCGCCAGGATACGCTTCTTCAACGTGCGAACGGTCACCCGCCATAAAGGTCAGCGCGTCAGAGATCACCACGTCTTTCGCGGTGCGCACCTGGCGCAGCTTCATGCCTTTTTCATATTTACCGGATACCACGCGCATAAACGCCACGCGGTCGCGGTGTTTCGGGTCCATGTTGGCCTGAATTTTAAATACGAAGCCGGTAAATTTGTCTTCGCTCGCTTGTACGGTACGGGTATCAGTCTGACGCGGCATCGGCGCAGGTGCCCACTCCACCAGGCCATCCAACATATGATCGACGCCGAAGTTACCCAGCGCAGTACCGAAGAATACCGGAGTGATTTCGCCCGCAAGGAACAGCTCTTTGTCGAACTCGTTAGACGCGCCTTTAACCAGTTCCAGTTCGTCACGCAGCTGCTGTGCCAGATCTTCACCAACCGCAGCATCGAGATCCGGGTTATTCAGCCCTTTAACAATGCGGACTTCCTGAATGGTGTGCCCTTTACCGCTCTGATAGAGATAGGTTTCATCTTTATAAAGGTGGTAAACGCCTTTAAACAGCTTGCCGCAGCCAATTGGCCAGGTGATCGGTGCGCAGCCAATTTTCAGCTCGTTCTCAACTTCATCGAGCAATTCCATCGGGTCGCGGATATCACGGTCAAGTTTGTTCATAAAGGTGAGGATCGGCGTGTCGCGCAGACGGGTAACTTCCATCAGCTTACGGGTACGATCTTCAACACCTTTTGCGGCGTCGATAACCATCAGGCAGCAGTCCACCGCCGTCAGGGTACGATAGGTATCTTCCGAGAAGTCTTCATGCCCCGGGGTGTCGAGCAGGTTAACCAGGCAATCGTGATACGGAAACTGCATCACAGACGTAGTAATGGAGATCCCACGCTGCTTTTCCATCTCCATCCAGTCCGACTTAGCGTGCTGGTTGGAACCACGGCCTTTTACTGTACCGGCGGTCTGAATGGCCTGTCCGAACAGCAGCACCTTCTCGGTGATGGTAGTCTTACCGGCGTCCGGGTGAGAAATAATGGCAAAGGTTCTTCTTTTCGCTACCTCTGCGGCCAAAAGGCTTGTCATAATTGCTGTCTCTTTGATTATGTACAATCATATGTACAAAGAAAAATTCATGCCGCTGATTGTACATTTTGGAGGTATCAGAGCATAGCAGAACATGATCAGTAGGTGTATTTGTAGCCAAAGAACATTTGTACATGCAGCAATCTTTGTTCTATTGACACTGTTGATTGGGCGGTGTACAACACAAACAAAAACAGGATGTTAGAGGTCTCAGCAGGACACCGACCAGACGGTGAAGTGACAAAAAGATACGCAAGGGAGCCGCGGCTCCCTACTGAAATATTATGACTTTAAGTGAATTTTTACTTCTTATTGCTATACTAGTTGACATAAAATTATACCGCCTTTCAATATAATCGCTCAAAGCAGTTGAAAATTTGTTCTTTAAGCCCCATATTCATTAAACACCAAATACTGTGGATAAAAATTTTCCAATAAGCTAGGATTTGTCCCAATTTATGGGTACCAGCTGGAAACGAACAAACAATGAACTCAAACGTTGCTAGTATTTATAGTGCTGCTAATGTTAACAGTAATGATTTAGCTTTAGAGTTGTACTGGAAAATCCAAGAAGTCTCTGCATGGTTTGTGAAACATGTAAACGCAAAATCGGTTGAGCAACTACGCGACTTTAACCCGTCATTCGCTGAAATTGCCGATCTTTCTGACGCCACTGCAGATATCATTACGAAGTTGCTTCAAGTCGGTGTTTGGGACGATGAAAGAGTTATGGCAAATGCCCGCCAAGCAGTGCTTTTAATGCGCCAAGTCGCAGAAGCGATCGAGCGTGGCGATAACGACAGTATTCAAGACGGAGCCAACCGCTTATCAGCAATGGCATTCGTTTAACTTAGTCAACTTAAAAAGTGAGTTTTACCTAGCTAGGTAAGTCAGGAGCCAATATGATGAATAAAATTGAAGCACGCCGCATTGCGCTGTTGCGAGAAGCCATCAAAAACGTTGATAAAATCAAAGAAATTCAAACGTTTATCGATCAAGAGCTAAAGGCTATGAATCGAAAAGCTGCATAAAGTAAAAAAACCCGGCAAAGCCGGGTTTTTTATTACCATCCTTTTACACTTTCGGCAGAACAGTTGACCACCATTCTGAAAGAACGTATTTCCCTTAGGGCGAGGATTAATGAGTTCCGCGCTTTTAGCATCTCCGGATACCTCACGTACGCCTTAACCCTCTGCCGCAGAGTCCGTATTCTGAGAGGCTCGCCAACGGTAAGCAAAGGCAAAAAGCGTCAGATCCAGCTCGAAGGTCTCGTGCATTCCATCCACGGACACGGAGGTACCCAGTAATCCCTTCCGAACCAGGGACTCCACACCAGTTTGCGGATCATCAAGCCGGTTCATTTGTCCCCATGTCATCACAGCACCGCCGAATGCTACAAAGTGCTGTACCACGTTTTTTTCACTGTCCGTAAGTGAATCATAAAGTGCGGCCAACTGCTGCCTTGAAGCAGCTTTTCCTTTCTGGGCAAGAAGACGCTGATGCCATCCTCCGGCAATGATCTTGACGATCGGGAATGGAATAATAAATGCGAACAGCGTCAGGACATAGTCGTGAAGGATCAGATAACAGCTCATTCCGGCAAGTCCCGCGACAAATATGGCGACATTAATGCCGAAATCCTTCTCCGTATAAACCCTGTCGAACAGTTTTCCAAGAAATTCAGTCATTGATTTACTCCCTACATGCTATGTCGCACCGCCCCAAATCAAAAGAGCGAATGGTTCAGTGTCGGCAGTATTCCCAATGATTAGCCAACATGCGCCCGGCATCATACCAGTTTTCAGTCAACACCAAGCCGGCAATAAGAGAGAACTCTTGATCCATCATGGCAATCAGATCTCCAAACAACACTGACAGAAATTCCTGGATGAAGTTGCTTTTCGATTTACCGGCTACGCTGACCTAAGTGGTGTTGCCGTATTTATCAGTTTATTAACAGCCCTCATAGCCCAGTACCTGTGAACATTGCTCAGTCCCTAGACAAACCGCTGTGTGGTGACGGTCTTCCGGCCATTCGGTTCCCACTGTATTGAAGCATGCCAGGCTATTTCAATATCGCTATGCCGTGGCATCATTTAACCCCTTGTAATTCATCGTCATAACTGTAGTAATGTTTTCCGCTTCAGGGAAAAAATAGCATCCAACCGCAGCACGTTCTTGCATACGACGTGCTGCGGCATAATCCCAATGATTACTCCCTGACAGGGTTCGTAGGCCACTCAATATCAGGTGCAGTTGATGTATCAACACGGTTCAGCAACACCCGATACTTTTTCCAGGCTTCCACCACCAGCACGACAAGATGCCGCATACAGTGACCCAGTCAGTCCAGTTTCCAGACAACCAGTGCGTCACCTTTTTGAAGGCGCTTTAAAGCACGTTTTAATCCAGGTCGGCCTGTCCTTATTCCGCTTAATTTATCTTCAAATATTTGTTCACATCCTGCACAAACAAGAGCGTTTCGTTGCAGGTCTGTATTCTGGTCATTTGTTGATACCCTTACATAGCCAATCAGCACGCTGAATCTCCCGTCCAAAAGCACAAATCATGCCATGCAGGCCAGAAACCGCCATTATCTAAAACCTCGGTTTACAGGAAACGGTAAACAGGGCCAGGAACGCCGTGCAAAAGAATGGCGATACCTTGTCCGGTGGGCTTACTTTTGAAAACGACTCAATCCTTGCCTGGATTAGAAATACTGACTGGGCAAAGATTGGTTTTAAAAATAATGCCGACAGCGACACTGATTCATACATGTGGTTTGAAACAGGCGACAACGGCAATGAATATTTCAAATGGAGAAGCCGCCAGAGCACCACAACAAAAGACCTGATGACTCTTAAATGGGATGCTTTGTCTGTCCTTGTTAAAGCCCTTTTCAGCAGTGAAGTAAAAATATCGACAGTCAATGCACTGAGGATATTTAATTCATCTTTTGGTGCTATTTTTCGTCGTTCTGAAGAATGCCTGCATATCATCCCTACACGAGAGAATGAGGGAGAAAATGGTGATATAGGGCCACTACGCCCCTTTACGCTTAATCTCAGAACTGGTCGGATAAGCATGGGGCATGGTCTTGATGTTACAGGGGATATATTTGCAAACCGTTTTGCAATTAACAGTAGTACCGGCATGTGGATTCATATGCGTGACCAGAATGTTATTTTGGGACGCAATGCGGTATCCACCGATGGTGCGCAGGCATTACTTCGTCAGGACCACGCTGATCGCAAATTTATGATTGGTGGACTGGGGAATAAGCAATTTGGCATCTACATGATTAATAACTCAAGGACAGCCAATGGCACCGATGGTCAGGCGTACATGGACAACAATGGCAACTGGCTTTGCGGTGCGCAAGTTATTCCCGGCAATTATGGTAATTTTGACTCACGTTATGTGAGAGATGTCCGACTTGGTACACGTGTTGTTCAGACTATGCAAAAAGGCGTGATGTATGAGAAATCAGGTCATGCAATTACGGGGCTTGGCATTATCGGTGCAGTTGATGGCGATGATCCGGCAGTATTCAGACCAATACAAAAATACATCAATGGCACATGGTATAACGTCGTACAGGTGTAATTTATGCAGCATTTAAAAAATATTAAGTCTGGAAATCCAAAAACAAAAGAACAATATCAGCTAACAAAGAATTTTGATGTTATCTGGTTATGGTCCGAAGACGGAAAAAACTGGTATGAGGAAGTGAAAAACTTTCAGCCAGACACAATAAAGATTGTTTACGATGCAAATAATATTATTGTCGCCATCACCAAAGATGCCTCCACGCTTAACCCTGAAGGTTATAGCGTCGTTGAGGTTCCAGATATTACAGCCAACCGCCGCGCTGATGATTCCGGTAAGTGGATGTTTAGGGACGGAGCTGTGGTTAAACGGATTTATACGGCAGACGAGCAACAACAACAGGCCGAATCACAAAAGGCCGCATTGCTTTCCGAAGCTGAATCAGTCATCCAACCGCTGGAACGCGCTGTCAGGCTGAATATGGCAACAGACGAGGAACGCACACGACTGGAAGCATGGGAACGCTACAGTGTTCTGGTCAGCCGTGTGGATACGGCAAATCCTGAATGGCCACAAAAACCAGAGTAAAAATTAAGGCCCGATAGCGGGCCTTCTCTCATTCTGGTTGTTCGGGAAACGTTACTGGCAGGCCGGAAGTGTCTGTAGATTCGACTTTCTGCGCATAGAGCATCCACTCGGTTAATTTTTGTTTATTCTCGTCGGAAATGATGCCCAGCCGTAGCTGTGAGTCCCATAGCTGGGTTTTATCCCTGACAAGTTGCAACAGGCTTTGCTTTTCATTTTCCGCTTGTTGCCTCTGCTCTTCCTCGGTATAAGTTCGCTTTATCACTACGCCATCTTTGAACATCCATTTCCCCGAAATATCAGCCCGGCGATTTGCTGTAATATCAGGTAATTCAACGACGCTTGCACCCTCTGGATTAATTGCTGAAACATCCTTTTCAATACAAATAATAACGCCGTTATGGTCATAGACCATTTTCAAAGTGTCTGGCTGGAAATTCTTTTGTTCCTCATACCAGTTTTTTCCATCATCTGAATAAAGCCATTTGATGTTAAATTGCTTTGTTAGCTGGTATTGCTCTTTTGTTTTAGGGTTGCCAGCAGTAATGTTTTTTAAGTGCATCATCGTTAAATACTCCCCGCGTTATACCACGTCCCATTAATGCAATACTGAATTGGCCTTGCCTGAGTTGTATCAATTAATTCATCACGGTTTCCGTTAACTGAACCCGTAACGACATAACCTGACCTGTCAGACCAGCCGGGGCCTTTCCATGTCTGAACAGATGACAGACCGCCAAGGCGAATACCTGTAATAAACCTTGAGTTACATTCTGCCTGCGTATATGCACCAACATCCCCCGCAGAGGGTTTGCGTGTTGTGGTGTAAAACTCTGACCAGTTAGCTTCAAAGCCATAACCATCACGCGCTGAACGATAAAAAATACCGCCGTTCCTGTAATTCACGCGGAACTGTACAGCAGGGCAACTCCCCGCATTCATATTGAAGTGGAGGATTAATGTCGATGCACCACTGATATCTGCATCATAAACACCGCTATTCCAGTTCCAGCCAACAGCTTTACCCGCCGTTATCCTTCGCCAGACCCGCCAGAACTAACTGAGTCAGTATTAACTGGCACCGGGCTTCGCTTACTCCGGTAGTTCTCGTCATCATGCGTGGCGTTACCCACTTGTCAGCAGGTAAGAAATGAAGGACTGCGGCGGCGGTTTCTGTCATATCTTGCTGTTTTAGCATGTCTTTTTCCCTTCTGGTTAACATGACATACCAATAACTCTTGTCTAAAAAGCCAGCAAGATAAAAAGTCAGTATTCACGACCACCAGCGTGTTTACTGTACTGCACCAAGTTTACAGGTACAAAAAACCCGCTCAGTGGCGGGTTGCTATCACAGCTATATATTTACTTATTATGCCGTTACTAACATTTATCTTCGACATATAATCGAAAACAAGGTTTACTTAAAACTCTGCTTTCATTTTATCCGGGAATTTTTTATTTGCAGCATAATAACTACCAAGTACATAAGCGTTCATTTGCTGCTCTACATCAACCCGACATGCCGCACTAGAACAAGCTCCACTGATAAGCCCAAAAGAACTCCCTTTAGCAGAGAGATCAGCTTTGATTTCCTCTACAGTGTTTTTCCCCATAGCAACTACACACCCTGTCACAATATATCTAGCCTTCACATCATCCATGCTAAGGATAGTAGTTTTCGCAATTTTGCTGTATCCATCATTTTTATAAACATCCATGGCAAACGCACGGCAATCTGTATAATACGGACTTGCTTTAACTTGCGAATACTCAGGTAATTTCATACCTGCACAACCAACTAAACAAAAACCTATCGCTGCTATTAATACCTTTTTCATTACAGTCATAACCTAGAAGCATCATTGAAACTAATTTATTAAATAATCATCGAGTTTCTGGAATACAGACGTTAACCATCTTTCCAAAATCTAAAAGATAATAAGAAAAAATGTTTAACGCACCAATCCATTTCATAGTTTCATGAGACATCTGGCACAAAAAAACCCGCTCAGTGGCGGGTTCTTAAATCTTATCAACGGTAGACATACAAAGCCCATCGTTGGGAAAATCTTATCCATATTTTTTGAAAAATGCAAGCATCATGTCGTCATCTTCGGCGAAAACCATTTATCTTGTCACATTTCTCAATTGTATCTCTGCATATGCTTCTTCCTGCCAGCACTTTGTAACCAGTTTATCAATGACATCTGCATATCCTTTGTACCACTGATAATCCGTCAGGTCTGGTACCAGCTTCTGGACATGATGCCGCGCCAGTGTGGTTGGTAAACGGCTAAACCGGTTTCCATTGCAACGCCCACAAATCTTATAAACAGGCGTGCCATGAAGCCGGGTCCTTTTTTCATCCAGGACAATACCTTTACCCTTACACCCTCTGCACGCTGTGCTGACTTCTCCCTTACCATGACAATGCTGACATAGTTCCTTCACCCACTCTTCCTTGATAACAGATTCCCCGCTTCTGGAGTGTTTCACCACTTCGCGCAATACATTATGAAATCCAGTACCAGCACAATGCTCACAGCGAGCCTTACTTGCCGCAGACCTGGAATAATCAGCAAAGGCAAAATTCACAAGGTAAGGGATGATCTGTAACCGGGTTTCTTCACTCAATTTGTTCAATGTCGGGTTATCCAGTGCCATCGCGTAATTGAGCAGACCTTCAATCGCAAATTGAGGATCCTGAACACCAACTTTTGCCAGGAATAAGGCAAACCCAAGCGGTGCTTTCGACTGCACCATCCCCTGCGCAGCCATCACATCCGTAATCGTTAAACCACCTGAGCCTGTCGCCGGTGCGTCATCGCTCAATTTTGGAGATTTTGGGGAGTAATATTTTGGTAAGGCTTCAAGGTTCATGCTCGTTCTCCACTTACGCCAATACGCCAATTGCCAGCGCACGATCGATAAAACGAAATATCAGCTCCAGCTGAGAGCCATACTTCTCTTCAAATGCCACGGTATCCGCATGCAGCTCGTCGTGATGCTTTCTGCACAAAGGCAACACAAAGAGGTCATGCGCTTTTGTTCCCATTCCACCCTGACCGTGACCTATCAGGTGGTGGGGATCATCAGCGGGCTTTCCACAACATGCACACGGCTGTGTCTTAACCCAGCGCGTGTACTTTTCATTAACCCAGCGGCGACGTTTTGGGCGTAACATAAAAGACTCCGGCGACTCCGGATCCACTTTCAGCGCCAGCACCTTTTTCGCTTTATCCTGGATGATGCTGGTGGCAGGAACCGAAGGCACAAGGTCACTTTCCCGGGTGACAGACGGCACAACAGGCTTTGGTAATCTCAGTGCCTTACGGGCTGCACTTTCCGGTAAGGCATCCGCCAGGTCATTACGAATCAGCCACCAGCACAGTTCCGGCATTGTCACAACGTGACTGTCATCAAAACCGAGATCCCGACGCACAACAGACAACACCCAGCGGGCACAGTTATCCGTTGCCATTGATTCCAGCCGTTCCGTGAACTGATCGCGCAGCTGGTTATCGCAGTGCCAGCACAGACGGATTGCGCCCGGCGCGTGTCGCATTGTGGTCATGTTCTCGCTGTGCCAGTCGGAATGAGGCCACTGGCAGCCTTTTTCACGAAGTAACCAGCTTTCAAGACATTCCACGCCACCAGCACGACGGATCACTGCCTCATTGCGGAACACGGCCCGAACGGCAGGATCATCCGCCAGCGGTTGTGATGCCGCCGGAACGGCACCACTGGCGAAAGATGAATAACGTTCCGGCTCAGGCTCCAGCAGGACACGCCCCTGCATAAACAGGGGCATCAGCTCTGAACCTGGCCTGAACAATACGATCCCCATACGCGGGGCAATTTCAGGGGTCAGTAGTGCTCTCACGGTCACCTCAATGAACGGTATCGAGCAGCTTTAACAGCTCAGGGAATCGGGATTCGAAGAAATGCGGCTGCGTCTCGCGCGGATTTGCGGGACTGGTGATGTTCTTGCCGAACATGCAGCCTTTCGCTGTCAGCGACCAGAATTTTTTGATGTTGTTATCCGGAGGAAACAACACGATCTCCACTGAAGCAGGTGCCGACGTTGGTTTCGGCAGACGACGTAACTGCTCAACTATTGCTGCGCACGCCGCGCTCTGGAATTTGCGCCCCGCCGTGCTTATCAGGCTTTTACCAGCAAACTCCCCTTTGTTAGGGTGTCGCCAGTACGTATTCACGCTGGGCGGGAAAGGCAGGATCAGTTTCATACTTTCAGGCCTCTCTCATGTAGCCAGTGGGTTGCACGCAGCCTTGCGTTTTCCTCACCGGCAAGCAGTGAGCGGATAATCCCGACAGCCTCGCTGTCGTCGTCCTTCACCGCGGTATGAAGCGTTATCCCCCGGGCCACGCCACGCTTTATCGTGATGACGCCTTTTTTCTCCAGTGCGCGAAGATGCTCCACCGCTGCATTCACCGAACGGTATCCCAGCATGGTTGCCACCTCCTGATTGGTTGGCGGGAAGCCACGTTCTTTCTGGTAAGAAATCAGCATATCCAGCACCTGCTGCTGGCATTGAGTTAACGTCGTCATTAAGCCCCCACGTAATTCCCTGACAGATACCACTCTTCACCTGATGCAGCCCGCTTACTGCTTTTCCGTAAACACCGTTCACGACGCGCCAGAAAATTGTTTCGTTCTGGCTGGGAGTAGCTTTCACGGAATGCCGCCATCCACACCGTTGCAGCACGACGGTATAAGCCCCTGGACTCCAGTTCTTCCGCCTGGCGGGTCAGGCACAAAATCACCCGGGGATCGTTAGTGCCGACATAGAAATTGCGCACAGGTCTGGTTTCACGAACTGGTTGTGGTTCCGGCTCCTGCGCTCTCTCAGTCAGGCGTGGGAAATGTCTGCGTGTATCTCCTTCACAACGGTGAGCCACACGCCCACTCTGACGTAACTTGCTTGCTGACTGCAGAACGCGCTGCCGTGAGTAACCTGCAAAAGCATCCGCAATGTCTCCGGAAGTACACCCCGGATGGGCTTCAATGAATTTCTGAACTTCATTCAAAAGACTCATGATCACCCCCTGAATCCTGCCGGGATCTGGCTGTAGTCCACGTTGTCGTAACTGGCTTTGAAGTACGGGTCCTCACGTCTAGCTGCAGATACCGCAGGAACTTCCCAGGATTCTTCGAAATGACGATCCGGACCAAAGAACGTGACAGCCTGTTTCACAAATTGTGTGCCGCTGTTACCCATCGCAGATACCCAGCCCGCGTAGCGTTTCACACCTTCCAGCATGGTTTCGGGGTTTACCCCCTCATTCAAACGGGCTTTCCAGGCTTTGAAGGCTGCAGATTTTGAATTGCCACCAGCACGTTTGGGATATGCCAGCCATGCCTGCTCAAACTCCGGAGAGTATTCCGGTCGGTTTGAACGAACTCGCACGGACTCATCAACTGATGCACCAACAGCTATTGGTTCATTGACTGGTTCTTTGACTGGTTCAAAAGAGTGACTGGTTCTGGGTGAATCTCCTGCACTACCCCCTGGTGCAACTCCTGCACTACCTAGTGAATTTGCTGCACCAGATAGTGAATTATTTGCACTACCCCCTAGTGAATCTCCTGCACCATCCAGATGAAGGAGATAGATATTACTTGAGTTACCTTTTTCACCTTTCCGGGTGACTTTTTTTACCAGCCCGGACTCACAAAGGGCCGCAATATGATTCATCACAGAACGTTTGCTAATCTCACACTGGTCAGCAATATGCTGGTAGCTGGGCCAGCACTCACCCTGATCGCTGGCATTATCAGCCAGCTTGATCAGAACCAGTTTTCGCAATGGATTACCCACTCGAATTTTCATCGCTTTAACCATCAGCTCCATACTCATGCTGCACCTCCGAGATGCTTCATGTTTTTTCCGGAGCGAAAGGCTATAAGCGGCATACTGACGCGGTAATTACGGCCCAGCGGTTCACAAATCACCTTCTGGCATTCACGGTCAACCAGGCTAACACGTAGAACATGCCCTGCAGGTGTGGTGTACCACTGCCCAACTGTAGGAATTGATGTTTTTTTACGCTGAAGCAAACGGCAAATATTGAGGATCAACGGATTAAGCATGACGATGCCCTCCGCTGATATTCAGGAGACGGTGAATATGAAAATTAGCCTTATCCGCCAGACGAATACGTTCAGCCTGCAAGTTAAGAAGGGTTTCTACCAAAACCTGATGCGCCTGCGGATCCGAAAGAGTTACCTTGCGCAGAGCACGTAGTGCAGTTGTTACATAACTGAGTTTATGTAAGTCTTCATCATTCAGACGAGAGAGGGCTGGGACAGTAGCCATGATGGCAGCCTCCGTATGCAATGGATAACTTCCACCACCGGAAACGCCAATTTCGCTGGTGGTGAACTGAGCAGGGTTGGCGTAACCGGCGCATACGGAAACCGGCGCACCTTTCGGTGCCCCCACCCAGCCCACCATAATTTGGGTATAGCTGAGTTGTAGCAACAAAAAAGACGCTAACGCGCCAATTGTCGCCGTATGCAATTCCAGGACGCCAATCCCGACACCCGCTTTATAAGGTGCCTGAACAGTGTAACGTCCCGGAATGGCAGAATCAATGTGCTGGTGGTCCTTCACACTCAACAAAATCACGCCTGAATTTCCACAAAGGACTAAAGCACTCATGCGGGTAGTCTTTGCGAAGATAGATAACGCGCTGTGTTTCTGGCTCCCAACGAATAACATGGACATAAAGCCCTCTTCCGTCACGAAACCAGCGGTTAAGTTCTTGCACAACTCGCCCCCCACAGTCAGGTAAAGTTCTCTGTGGTTACTTACAGCCAGGTGATTTGGTAATCTGCATTCATGCCGTAACAACAGGTGTTCAGCGACGCTGACCACCAGCTGTTGCGACAAACGGTTATTTGCCGTTAAACTGTTCATGCGTTAGTTTCTCCACAGACACAAAACGCCACGACGCCCGGAGCTGCACACTCGCGGGCGTCACTCTTTTCTGGAGCGCAAAAGATTTTGTAGACCAGTGCTGCATGCTCCTGGAGCTTCGAAATTGACAGATACAACTCATCATTAATTGCTGTCTGCTCGTGTGGCTCCACGACCCCATCTTCGATTGCCGAACGAATCTGCTTTGAGTAATTCCCGATCTGTTCGATGACTTCCAGCAGGCGCTGGTTTATATCGGCGTTCTCTACTTCCTCAATTTCAGGAAGCGATACGAACACCCCACCAGCAGACTGTGCGACAGCATCCGCAATGTAGTGAGTGCCAGCCGCGCGCTGTAAAACCATTGCCCATCCCAGCGGGAAAATCTGATCGCCATCGGCACGAAGGCGGTTAAATAATGCGTTCTCTGTTACATCCAGCCAGTCAGCTGCTTCAGCGTAACCACCCGGCAACGCTGCGATAGTTTTTCTGACAGCTTTCACGTACCACTCAGGCTGTTTTTCTACTTTCCAGTGATACTTACCCACGGTTAGCCTCATCGTTCTGTGGTTAAAAATTGAAGGTGTTCTGTTAATCTTTCGGATAGATATCCGGTCTTAAGTCAGATTTCGTAATTGCACCTGACGTGCATTGCTCAAGTTTTTTCGCCAGCACAAAACTGGCTTTTTTATAACCATTGAAAACCAGCCGTAAGTAGCCAGGTGTTGAGCCAACTTTTCCGGCCAACTCGCCCTGCTGTTCTTTGGTTAAAGAGTCCCAATACGCTTTCATACAATATGTACCTCCGATATACATATTACATGATTGAAATGAACCTTCAAGATACTTGTACCTTATCGGTACAAAGGTTTTAATTTCGTTATGAAAACAATCCATGACATCCGGCGGTCTAACGCCAGAAAACTGAGAGATGGTGTTGGCGGGAATTCTTCCTTTGCCACCATGATTGATCGCGAGCCAACCCAGACCAGCAGGTTTATGGGAGATGGTGCTACTAAAAATATCGGTGACAGCATGGCGCGGCACATCGAAAAATGTTTCGACCTGCCTGTCGGATGGCTTGATCAAGAACACCAGACCACGAACATCACAAAAAAACCTGATGTTTCAATCACTAACAAACAAATAACGTTAGTCCCTGTCATATCATGGGTACAGGCCGGAGCATGGAAAGAAGTTGGCTATTCTGAGGTTGATTTGAGCACAGCAGAAACTTACCCCTGCCCTGTACCCTGTGGCGAAATGACTTATATCTTGCGGGTGATTGGTGATTCAATGATTGATGAGTACCGCCCTGGAGACATGATTTTTGTTGATCCCGAAGTCCCTGCCTGCCACGGTGACGACGTTATTGCATTGATGCACGATACAGGCGAAACCACCTTCAAGCGATTGATAGAAGATGGAACACAGCGTTATCTCAAAGCATTAAACCCAAACTGGCCTGAGCCTTACATTAAGATTAACGGTAATTGCTCTATAATTGGTACAGTGATTTTCTCGGGAAAACCAAGAAGATACAAAATAAAGGCCTAATCAATATTTATAACCTGCTTCGGCAGGTTTTTTTATACTTGACAATGTACCCTTGAGATACATAATGTATCTAAAAGAAACATAACACAGGCAAGATTAAACTAAATTTGGTTGTAACACGGCGTATGGCACATGCGTCGTTAGCGGTCTGGGGACGTTAAAGGGGACAATCCACTCCTTGCTCGGGCAAACAAACCAGGTAGCCGGAATGTGCAAGTCAATGATGATGCTGATAAGACGCCTAACCAGCGTGGCGATCCGGTTTGACGCCTGGGAAGAGACCAGGGTGCAACGATGAGGGCATTTATGGAACCGCGACAAAGTGTGGTGCCGTAACTGGCTAAGTGCTCTCAGCGTTGTGGTAATCCGCGAAATGGCGCGGCGGTAAGTATGGCGGGGTTACTCTTTCCCCGTTGAGGACACCGGATTGTCAGGTTGACCATACGCCTGAGTGACAACCCCACCACAACAGCCACTGCTTTGGCGGTACCAGTTTGTACACTTGCTTCCGGCTGGTACCGCTCTTTTTACAAAACAGAGAAGGGCATCACCGGACGACGGGCTCATAACCCAATCCATCCGGGCGGCTGCCACCGCAGGTGTTCTTCTCTGTTTTGTGGAGAAACCAACCGACCTTGCAGGGTCGATATGATGAGGAGCAGCAAAATGGCTAGCGAACGCAGTACTGATGTGCAGGCATTTATCGGGGAGCTGGACGGCGGCGTATTTGAAACCAAAATCGGCGCTGTTCTCAGTGAAGTCGCTTCCGGTGTGATGAACACGAAAACCAAAGGTAAGGTCTCGCTCAACCTGGAAATCGAACCGTTTGATGAGAACCGTGTAAAAATCAAACACAAACTCTCATATGTTCGCCCGACTAACCGCGGGAAAATTTCCGAAGAAGACACCACCGAAACGCCGATGTATGTCAATCGCGGTGGTCGCCTGACTATTCTGCAGGAAGACCAGGGACAATTACTGACTCTTGCCGGTGAACCTGACGGAAAACTCCGCGCAGCAGGTCGTTAATATCGTTTTTAATAAACTGATTATTTATCTCATCACTGAATATCTTTATATAGTGAGGACTTATTATGTCTCAGAACTTAGACACAACCGCAATTAATCAAATCCATGCCCTTATTTCTGCTCAGGGTGTTAATGAAATTATCAGTAAGATTGGTGCCGATGCTGTGGCATTGCCTGAGAATTTCCGCATTCATGATCTGGAAAAATTTAATTTAAATCGCTTCCGTTTCCGTGGTGCGCTTTCCACTGCCAGCATCGATGACTTTACCCGTTATTCTAAAGATCTTGCAGATGAAGGCACCCGCTGCTTTATCGATGCTGATAATATGCGAGCCGTCAGTGTGCTTAACCTGGGTACTATTGATGAACCAGGTCACGCAGATAACACCGCCACTCTCAAACTGAAAAAGACAGCACCGTTCTCTGCCCTGTTGTCTGTTAACGGCGAGCGTAACTCCCAGAAGTCACTGGCAGAATGGATTGAAGACTGGGCCGACTACCTTGTGGGCTTTGATGCTAATGGTGACGCCATTCAGGCAACAAAAGCGGCTGCGGCGGTCCGTAAAATCACGATTGAAGCAAACCAGACCGCTGATTTTGAAGACAATGACTTCAGCGGCAAACGCTCTCTGATGGAGTCTGTCGAAGCGAAAACCAAAGACATTATGCCAGTAGCATTTGAGTTTAAATGCGTTCCGTTTGAAGGCCTGAAAGAACGTCCGTTTAAATTACGCCTCAGCATTATCACTGGTGATCGCCCTGTACTGGTTCTGCGCATTATTCAGCTGGAAGCAGTGCAGGAAGAAATGGCTAACGAATTTCGTGATCTGCTTGTTGAAAAATTCAAAGACAGCAAAGTAGAAACCTTTATTGGTACTTTCACCGCCTGATTTCATTACTGCAAATGCCCCTGCGGGGGCATTTATGGAAACGTAATTAACTCAATAATCGCCTGATGGCGAGGGTTTTCTTTAACCAAAATTCAGCGCGGTGCAGCGCATATAAAGTGGAGAACAAAATGTCATTTATTAAAACTTTTTCCGGGAAGCATTTTTATTATGACAAGATAAATAAAGACGACATCGTTATTAACGATATCGCGGTTTCCCTTTCAAATATCTGCCGCTTTGCCGGTCATCTTTCTCACTTCTACAGCGTCGCCCAACATGCGGTGCTTTGCAGCCAGCTGGTGCCGCAGGAATTTGCTTTTGAAGCTTTAATGCATGATGCAACAGAAGCATATTGCCAGGACATCCCCGCACCACTGAAACGACTTCTTCCTGACTATAAACGGATGGAAGAAAAAATAGACGCCGTAATCCGTGAGAAATACGGGTTACCTCCTGTTATGAGCACGCCAGTGAAATATGCCGATCTCATTATGCTGGCAACCGAACGCCGCGATCTCGGGCTTGATGATGGCTCTTTCTGGCCTGTGCTGGAAGGTATCCCGGCAACAGAGATGTTCAACGTGATTCCACTGGCTCCAGGCCATGCCTACGGGATGTTTATGGAACGCTTTAACGAATTATCGGAGTTACGCAAATGCGCATGAATGTTTTCGAAATGGAAGGGTTTCTTCGTGGGAGATGTGTACCGCGAGATCTGAAAGTGAATGAAACGGATGCTGAATACCTAGTGCGTAAATTCGATGCGCTTGAAGCTAAATGTGCAGCACAGGAAAACAAAGTAATACCAGTGTCAACTGAACTGCCACCAGCAAATGAAAGTGTTTTGTTATTTGATGCTAATGGAGAAGGCTGGCTGATTGGCTGGCGTTCTCTCTGGTATACATGGGGGCAAAAAGAAACCGGAGAATGGCAGTGGACATTTCAGGTCGGGGACCTTGAAAACGTCAATATCACTCACTGGGCAGTAATGCCAAAAGCACCGGAGGCTGGAGCATAATGACCACATTTACCGATAAAGAACTGATTAAAGAAATCAAAGAACGAATCAGCAGCCTAGAGGTTCGAGACGATATTGAGCGCCGTGCTTATGAAATTGCTCTGGCATCGCTAGAAGAGGAGCCGGTGGCATGGCTGCATTCAGAAAATGGCTTAGGTATTCCGGCAATAACGAGGAGTAAAAACATTGCTGACAGTTGGTTATCAAAGGGCTGGTATGTTCAGCCGCTATATATAGCCAAGCCAGTGCCGGTGGTGCCAGATGCTCGTCCGTCTTTAAATAATGGCATAGTCGGTTTTGATGAAGGCTGGAACGCCTGCCGCGCCGCCATGCTTCATGGTGCCGAACCTGTAAGCCAGACTTACAAGTTGAACAAGCTGTCGGGCAACTCTCCAGTAACTCCGGATGGTTGGATAAGCTGTAGTGAGCGAATGCCGAACGATAAACAGTATGTTTGGTGTTGGGGGAAGCCTTACGGCTGGACTGAGTGCGATACCTTCGAAGGGTATTACGATTGGTCGAGAAACAAATGGTGGGCAGTTACTGACGATAGGGAAGAACCGGCATCGAAAGTAACCCACTGGATGCCGCTACCGGAGCCGCCGCAGGAGGTGAAGTAATGAACAACTTAATGACAACTAAACAAGTCGCCGACTTCTGTGGCGTTTCAGTATCGACTGTTCTTCGCTGGAACAGCGTAAACAGGAGAACTGGCCAGAAATACAGGCCTGACTTTCCAGATCCTGATATTAAATCCTGCCCAAATAAATGGGCATCACGCAAGATATACAGATTTGCTGGAGTTATTGAGTAACGTGTATTATCTCAGATGGGAGCTGACATATCTATGGCACAGACCAAACTAATCTGACAGTCAAGTCTGTGCCAAGAGCAAACGTTGCTAATTTAATGTTAAGTTGTTTCTCTTGAAGACGATCACGCTATGATACAACCTATAAAATTATCGATATCTGAGAGAACCAGTAAAAAGTTTAAGTCAAGGGTTCTATAAGTGTGTTAATTTATTGGGAAGCAATAAGCTTCCCTTCTCTTTATTTAGGTACGCTATCAACACTTTTGAGAATCGTTGATGTTAAAGGCTGCGACTTTAGTAAGTCTCGAGAAATATCCGGATCTGGTATAGCTATTTTTACTAAACTCTCAAGTTTTTCAAACCACTCCCCATCATTCATTGAGAATAAGTTATGAAGAGCTTCATTTTTATTTTTGAACCCCCAAGCCTTATAATTATTACCACAGTATTTTGACGCAGCAGTTAGAATATGCCAGCGTAATTTTGAATACTTCCCATCAAACCTCTTATTTGAGATAAGAGCCTTTAGCCTATACAAGCAATAGCAAGATATATAATAATCATCCTCGAGGGCATCGGAAGAGAAAACCTCATTAAGTAAGTCACCAGTTAACCTATTTGGATAACGACTGGAATAATCTGGTCTCATCATAACAATAGCAGCATATGCTCGCGCAACTTCTCTGATATCAAATATGCGTACTGGAGCTATACTTTCTGAACTATACTGCCCCTTTCTTCTCTCAAAATAAATTTTATTTCCTTCAATAGCACCTTTAGCATTAAAATAATGTTCTAATTCCCTTAATTTTTTTAGCGTTGAAATGAACTGTGCATCTTCAACTTTAGATTGTCTATTCGTAGCTCGTACAATGTCATCAAGGATCGCTGGCTCATCGGTTTCAATTAATTTAATCATTAAACTAACTGATTCATCTACTTGAATGTCTTTTGATATAAGAACATTAGACGTTTGACATCCATTAACAATTTGAAAATCTCGTATAAAAATTTCTTGCCCTGCAGGCCTCACGCTTGATGCAACTATTGTCACTCCATTATTCATTAAACCAAATCTTGCTTTTTTTCCATCAGTATCTAGTGTTCCTGCAATTTCTGAATTTACGTCACCATCAATTCCCAGAAAATCTCTAACATTTTCTTCGAATAATTTTTTTCGAGGGTTTCCATTTTTGTCTTTAAGTATAGAATCAATAAAACTACGAGCTTTGACTGTAGCTACATAGGCATTATTAATATTAGGGGCCGCTGGGAATGGAGCATAGCCTATTGTAGGAAGTTTAGCTTCTATTGGACCTTCCGCAGCAAGCCAAAGCTCATGAATCATGTCTTTGTGAGCCATAATGAAGGATGTTTCATGTGAAAACCCGAGTGACTTTAAGCTTTTTTCACCCGATGCAAATGCAGCTTTTATTTCTCTGGCCTCTGTATTTTGTGCAGCACTAAAAAAATATGCGTATAAGTCTGGAAGGCCATTTTTGACTCTGCCGATATTAGCAAATATCAAGTTGAACATTTTTTTAAAGTCCGCTAGATATTCACTGTGGGGTTGTTGTGGCGCTGGTGAAAGATAATCTCTTATTGACGCAATGTAAGAGTCTATTTCCTGTTTGCTCCACTTCTCCGATGACTTAGCTTGCGTGAATACTAATGAAACTTGAAACTCGCGACGTGAGTTTTGGAAAATCTCCTGAAGTTCTTCAGTTGAAAAAATGGCTCTGTCGTCTAAAAATAAAAATGCTCCATCAATACCAGGGTCCGGGCCTTCATACACAAGATCACTGACCTCAACTTTGTCGCCTGAGTATTTTGAAAAAGCACAATAGTTCACAAACGCCTCAAAATTTTTGGTCTCCTCATAGGGCGCAGCGAATGCTTTGCAAAAGGCATCGAAATAAGATTTCGTGACTAAATGCATAAAAACTCCTTTCTGAAATAAGATAAGTTTATTAGAAACAACAAGATAAAGATAAAGATAATAGTAATACAAGTTAAAGACATACTGCATCACATAAGTTGGTCAAGATCAATAAATCAATAAGGATCCCCTATGTTTTATGTATTGTTGATAATTCTATCACCTCTTACTTCTGGTCTAAGCGTAACCTACTCAAGAATGCCATTGAAAATGTCCACTGTTCGCTCAAAACAGACTGTCAGGGCTACGCAAGCTGTCAGGTAGATTCTGGGCCAGTACAAGTAACGATCGATTCAACTCTCTCCCACCATGCCTGATATGCTTTACGCTGTTCTTCTAGATAATCGCTCTTGTCATAAACCTGCCAAACCCCTGGCAGTTTATGGCCGAGCATTATTTCTGCAATATGAGGAGCAGTAAGATCAGAAAAGTTTGTTCGTGCTGTTCGCCTCAAATCATGAAGAGACCAATGAGGGAATTGATGCCCCAAACGCCTCCATGCGTACTGCATTAAATTGTAAGGCAGCGACTGCAATGATGTTCGACCAACGGGTTCCCTGCTTCCTTCCTTAGTAAAAAGCATATCGGAACCATTGTTCATAGAGATAGCGTTCTTTATAAGCTCCTCCACCGGTTCAATAATGGGCCTCTTTAGCGGTTCGCCTGTTATCTCTCCAGTCTTATGTCGTTCTGGTGGTACAGTCCATACCTTATTAATGAAATCAAAATCGCCCACCCTGGCAGTAATTAGCTCTGAACTACGGCAGCCAAAATGCAGCAATAGCTTAATGAAGGCCCGGTATTTAGGAACCATTCGAGAACCATCGATCGCAGCATAAAGGATTTTAATTTCATCATGTGTCAGAAACCGTTTCTTCTGACCTTTACGGATATCCATATCTTTACCCGTGATATCCGACAGCGGGCGAGTTTCAATGAGCTTTCTCTTATACGCCCAGACATGGGCCTGCTTTGCGTTAATTAGCAATCGGTCTGCTATTGCTGGAGTCTTAGTGCTAAGAGGCTCCAGGACTTCTAACCAATCATGCAATGTAGCTGCATCGTGAGGGATATTCCCGATTTTAGAGAACAGGTGCAGCTCAAACGAGCGGAGTATCTGTTCAGAACCTTTTTTATTTTTTACACAATATGCTTCATACCAGGCACGGATCACAGACTCTACCGTCATGGCTTCAGTAGCTTTTCGTTTTTCAGCCTGCTTGACCAATCGTGGATTACGGTTTGACTCAAGTTCACCACGGAGACGGATAACTTCTTCTCTGGCCTCTTTTAATCCAGTTGCCGGGTAAGTTCCGATATCAAGACGCTCACCTTTCCCTGCCCATTGATAACGATATTGGAACACTACGCGACCTTTCGGTGATACTCTGACAGACAGACCATCACGATCGGATTTAACCAAAACCTTATCACGTTCCTTTCCAACGACTGAACGCAACCACGCATCAGACAACGCCAT
Protein sequences of DBSCAN-SWA_9 >CP029122|4196055:4218730|4210594_4210795_-|AWF26792.1|DBSCAN-SWA MKAYWDSLTKEQQGELAGKVGSTPGYLRLVFNGYKKASFVLAKKLEQCTSGAITKSDLRPDIYPKD >CP029122|4196055:4218730|4203093_4203627_-|AWF26274.1|tail|DBSCAN-SWA MMHLKNITAGNPKTKEQYQLTKQFNIKWLYSDDGKNWYEEQKNFQPDTLKMVYDHNGVIICIEKDVSAINPEGASVVELPDITANRRADISGKWMFKDGVVIKRTYTEEEQRQQAENEKQSLLQLVRDKTQLWDSQLRLGIISDENKQKLTEWMLYAQKVESTDTSGLPVTFPEQPE >CP029122|4196055:4218730|4211266_4211560_+|AWF28233.1|DBSCAN-SWA MTYILRVIGDSMIDEYRPGDMIFVDPEVPACHGDDVIALMHDTGETTFKRLIEDGTQRYLKALNPNWPEPYIKINGNCSIIGTVIFSGKPRRYKIKA >CP029122|4196055:4218730|4200671_4200800_-|AWF27118.1|DBSCAN-SWA MLFGDLIAMMDQEFSLIAGLVLTENWYDAGRMLANHWEYCRH >CP029122|4196055:4218730|4202537_4203065_+|AWF25040.1|tail|DBSCAN-SWA MQHLKNIKSGNPKTKEQYQLTKNFDVIWLWSEDGKNWYEEVKNFQPDTIKIVYDANNIIVAITKDASTLNPEGYSVVEVPDITANRRADDSGKWMFRDGAVVKRIYTADEQQQQAESQKAALLSEAESVIQPLERAVRLNMATDEERTRLEAWERYSVLVSRVDTANPEWPQKPE >CP029122|4196055:4218730|4204946_4205060_-|AWF27789.1|DBSCAN-SWA MCQMSHETMKWIGALNIFSYYLLDFGKMVNVCIPETR >CP029122|4196055:4218730|4214496_4215117_+|AWF27125.1|DBSCAN-SWA MTTFTDKELIKEIKERISSLEVRDDIERRAYEIALASLEEEPVAWLHSENGLGIPAITRSKNIADSWLSKGWYVQPLYIAKPVPVVPDARPSLNNGIVGFDEGWNACRAAMLHGAEPVSQTYKLNKLSGNSPVTPDGWISCSERMPNDKQYVWCWGKPYGWTECDTFEGYYDWSRNKWWAVTDDREEPASKVTHWMPLPEPPQEVK >CP029122|4196055:4218730|4199480_4199777_+|AWF27381.1|DBSCAN-SWA MYWKIQEVSAWFVKHVNAKSVEQLRDFNPSFAEIADLSDATADIITKLLQVGVWDDERVMANARQAVLLMRQVAEAIERGDNDSIQDGANRLSAMAFV >CP029122|4196055:4218730|4217506_4218730_-|AWF27229.1|integrase|DBSCAN-SWA MALSDAWLRSVVGKERDKVLVKSDRDGLSVRVSPKGRVVFQYRYQWAGKGERLDIGTYPATGLKEAREEVIRLRGELESNRNPRLVKQAEKRKATEAMTVESVIRAWYEAYCVKNKKGSEQILRSFELHLFSKIGNIPHDAATLHDWLEVLEPLSTKTPAIADRLLINAKQAHVWAYKRKLIETRPLSDITGKDMDIRKGQKKRFLTHDEIKILYAAIDGSRMVPKYRAFIKLLLHFGCRSSELITARVGDFDFINKVWTVPPERHKTGEITGEPLKRPIIEPVEELIKNAISMNNGSDMLFTKEGSREPVGRTSLQSLPYNLMQYAWRRLGHQFPHWSLHDLRRTARTNFSDLTAPHIAEIMLGHKLPGVWQVYDKSDYLEEQRKAYQAWWERVESIVTCTGPEST >CP029122|4196055:4218730|4213607_4214144_+|AWF24364.1|DBSCAN-SWA MSFIKTFSGKHFYYDKINKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDGSFWPVLEGIPATEMFNVIPLAPGHAYGMFMERFNELSELRKCA >CP029122|4196055:4218730|4203629_4203968_-|AWF25243.1|tail|DBSCAN-SWA MNYRNGGIFYRSARDGYGFEANWSEFYTTTRKPSAGDVGAYTQAECNSRFITGIRLGGLSSVQTWKGPGWSDRSGYVVTGSVNGNRDELIDTTQARPIQYCINGTWYNAGSI >CP029122|4196055:4218730|4204083_4204284_-|AWF25014.1|DBSCAN-SWA MLTRREKDMLKQQDMTETAAAVLHFLPADKWVTPRMMTRTTGVSEARCQLILTQLVLAGLAKDNGG >CP029122|4196055:4218730|4208937_4209174_-|AWF27490.1|DBSCAN-SWA MLNPLILNICRLLQRKKTSIPTVGQWYTTPAGHVLRVSLVDRECQKVICEPLGRNYRVSMPLIAFRSGKNMKHLGGAA >CP029122|4196055:4218730|4212654_4213479_+|AWF25765.1|DBSCAN-SWA MSQNLDTTAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNGERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAVRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA >CP029122|4196055:4218730|4212226_4212589_+|AWF26178.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGR >CP029122|4196055:4218730|4201571_4202534_+|AWF28617.1|tail|DBSCAN-SWA MQKNGDTLSGGLTFENDSILAWIRNTDWAKIGFKNNADSDTDSYMWFETGDNGNEYFKWRSRQSTTTKDLMTLKWDALSVLVKALFSSEVKISTVNALRIFNSSFGAIFRRSEECLHIIPTRENEGENGDIGPLRPFTLNLRTGRISMGHGLDVTGDIFANRFAINSSTGMWIHMRDQNVILGRNAVSTDGAQALLRQDHADRKFMIGGLGNKQFGIYMINNSRTANGTDGQAYMDNNGNWLCGAQVIPGNYGNFDSRYVRDVRLGTRVVQTMQKGVMYEKSGHAITGLGIIGAVDGDDPAVFRPIQKYINGTWYNVVQV >CP029122|4196055:4218730|4207305_4207632_-|AWF28465.1|DBSCAN-SWA MTTLTQCQQQVLDMLISYQKERGFPPTNQEVATMLGYRSVNAAVEHLRALEKKGVITIKRGVARGITLHTAVKDDDSEAVGIIRSLLAGEENARLRATHWLHERGLKV >CP029122|4196055:4218730|4206952_4207309_-|AWF27212.1|DBSCAN-SWA MKLILPFPPSVNTYWRHPNKGEFAGKSLISTAGRKFQSAACAAIVEQLRRLPKPTSAPASVEIVLFPPDNNIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESRFPELLKLLDTVH >CP029122|4196055:4218730|4201140_4201260_-|AWF25778.1|tail|DBSCAN-SWA MRHLVVLVVEAWKKYRVLLNRVDTSTAPDIEWPTNPVRE >CP029122|4196055:4218730|4197341_4198916_-|AWF25017.1|DBSCAN-SWA MAAEVAKRRTFAIISHPDAGKTTITEKVLLFGQAIQTAGTVKGRGSNQHAKSDWMEMEKQRGISITTSVMQFPYHDCLVNLLDTPGHEDFSEDTYRTLTAVDCCLMVIDAAKGVEDRTRKLMEVTRLRDTPILTFMNKLDRDIRDPMELLDEVENELKIGCAPITWPIGCGKLFKGVYHLYKDETYLYQSGKGHTIQEVRIVKGLNNPDLDAAVGEDLAQQLRDELELVKGASNEFDKELFLAGEITPVFFGTALGNFGVDHMLDGLVEWAPAPMPRQTDTRTVQASEDKFTGFVFKIQANMDPKHRDRVAFMRVVSGKYEKGMKLRQVRTAKDVVISDALTFMAGDRSHVEEAYPGDILGLHNHGTIQIGDTFTQGEMMKFTGIPNFAPELFRRIRLKDPLKQKQLLKGLVQLSEEGAVQVFRPISNNDLIVGAVGVLQFDVVVSRLKSEYNVEAVYESVNVATARWVECADAKKFEEFKRKNESQLALDGGDNLAYIATSMVNLRLAQERYPDVQFHQTREH >CP029122|4196055:4218730|4196343_4196949_-|AWF27739.1|DBSCAN-SWA MTMTRLKISKTLLAVMLTSAVATGSAYAENNAQTTNESAGQKVDSSMNKVGNFMDDSAITAKVKAALVDHDNIKSTDISVKTDQKVVTLSGFVESQAQAEEAVKVAKGVEGVTSVSDKLHVRDAKEGSVKGYAGDTATTSEIKAKLLADDIVPSRHVKVETTDGVVQLSGTVDSQAQSDRAESIAKAVDGVKSVKNDLKTK >CP029122|4196055:4218730|4196055_4196217_-|AWF27278.1|DBSCAN-SWA MFRWGIIFLVIALIAAALGFGGLAGTAAGAAKIVFVVGIILFLVSLFMGRKRP >CP029122|4196055:4218730|4215549_4217250_-|AWF25129.1|DBSCAN-SWA MHLVTKSYFDAFCKAFAAPYEETKNFEAFVNYCAFSKYSGDKVEVSDLVYEGPDPGIDGAFLFLDDRAIFSTEELQEIFQNSRREFQVSLVFTQAKSSEKWSKQEIDSYIASIRDYLSPAPQQPHSEYLADFKKMFNLIFANIGRVKNGLPDLYAYFFSAAQNTEAREIKAAFASGEKSLKSLGFSHETSFIMAHKDMIHELWLAAEGPIEAKLPTIGYAPFPAAPNINNAYVATVKARSFIDSILKDKNGNPRKKLFEENVRDFLGIDGDVNSEIAGTLDTDGKKARFGLMNNGVTIVASSVRPAGQEIFIRDFQIVNGCQTSNVLISKDIQVDESVSLMIKLIETDEPAILDDIVRATNRQSKVEDAQFISTLKKLRELEHYFNAKGAIEGNKIYFERRKGQYSSESIAPVRIFDIREVARAYAAIVMMRPDYSSRYPNRLTGDLLNEVFSSDALEDDYYISCYCLYRLKALISNKRFDGKYSKLRWHILTAASKYCGNNYKAWGFKNKNEALHNLFSMNDGEWFEKLESLVKIAIPDPDISRDLLKSQPLTSTILKSVDSVPK >CP029122|4196055:4218730|4205955_4206945_-|AWF24876.1|DBSCAN-SWA MRALLTPEIAPRMGIVLFRPGSELMPLFMQGRVLLEPEPERYSSFASGAVPAASQPLADDPAVRAVFRNEAVIRRAGGVECLESWLLREKGCQWPHSDWHSENMTTMRHAPGAIRLCWHCDNQLRDQFTERLESMATDNCARWVLSVVRRDLGFDDSHVVTMPELCWWLIRNDLADALPESAARKALRLPKPVVPSVTRESDLVPSVPATSIIQDKAKKVLALKVDPESPESFMLRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHDELHADTVAFEEKYGSQLELIFRFIDRALAIGVLA >CP029122|4196055:4218730|4200112_4200616_-|AWF27850.1|DBSCAN-SWA MTEFLGKLFDRVYTEKDFGINVAIFVAGLAGMSCYLILHDYVLTLFAFIIPFPIVKIIAGGWHQRLLAQKGKAASRQQLAALYDSLTDSEKNVVQHFVAFGGAVMTWGQMNRLDDPQTGVESLVRKGLLGTSVSVDGMHETFELDLTLFAFAYRWRASQNTDSAAEG >CP029122|4196055:4218730|4208122_4208941_-|AWF28107.1|DBSCAN-SWA MSMELMVKAMKIRVGNPLRKLVLIKLADNASDQGECWPSYQHIADQCEISKRSVMNHIAALCESGLVKKVTRKGEKGNSSNIYLLHLDGAGDSLGGSANNSLSGAANSLGSAGVAPGGSAGDSPRTSHSFEPVKEPVNEPIAVGASVDESVRVRSNRPEYSPEFEQAWLAYPKRAGGNSKSAAFKAWKARLNEGVNPETMLEGVKRYAGWVSAMGNSGTQFVKQAVTFFGPDRHFEESWEVPAVSAARREDPYFKASYDNVDYSQIPAGFRG >CP029122|4196055:4218730|4214227_4214497_+|AWF25499.1|DBSCAN-SWA MRKFDALEAKCAAQENKVIPVSTELPPANESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA >CP029122|4196055:4218730|4205189_4205942_-|AWF27771.1|DBSCAN-SWA MNLEALPKYYSPKSPKLSDDAPATGSGGLTITDVMAAQGMVQSKAPLGFALFLAKVGVQDPQFAIEGLLNYAMALDNPTLNKLSEETRLQIIPYLVNFAFADYSRSAASKARCEHCAGTGFHNVLREVVKHSRSGESVIKEEWVKELCQHCHGKGEVSTACRGCKGKGIVLDEKRTRLHGTPVYKICGRCNGNRFSRLPTTLARHHVQKLVPDLTDYQWYKGYADVIDKLVTKCWQEEAYAEIQLRNVTR >CP029122|4196055:4218730|4199830_4199959_+|AWF27157.1|DBSCAN-SWA MMNKIEARRIALLREAIKNVDKIKEIQTFIDQELKAMNRKAA >CP029122|4196055:4218730|4198868_4199033_+|AWF25609.1|DBSCAN-SWA MRNNGKGSSFRYLCGQKACHNCCLFDYVQSYVQRKIHAADCTFWRYQSIAEHDQ >CP029122|4196055:4218730|4209999_4210551_-|AWF27499.1|DBSCAN-SWA MGKYHWKVEKQPEWYVKAVRKTIAALPGGYAEAADWLDVTENALFNRLRADGDQIFPLGWAMVLQRAAGTHYIADAVAQSAGGVFVSLPEIEEVENADINQRLLEVIEQIGNYSKQIRSAIEDGVVEPHEQTAINDELYLSISKLQEHAALVYKIFCAPEKSDARECAAPGVVAFCVCGETNA >CP029122|4196055:4218730|4207631_4208120_-|AWF28187.1|DBSCAN-SWA MSLLNEVQKFIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTRRHFPRLTERAQEPEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESYSQPERNNFLARRERCLRKSSKRAASGEEWYLSGNYVGA >CP029122|4196055:4218730|4209166_4210003_-|AWF27383.1|DBSCAN-SWA MNSLTANNRLSQQLVVSVAEHLLLRHECRLPNHLAVSNHRELYLTVGGELCKNLTAGFVTEEGFMSMLFVGSQKHSALSIFAKTTRMSALVLCGNSGVILLSVKDHQHIDSAIPGRYTVQAPYKAGVGIGVLELHTATIGALASFLLLQLSYTQIMVGWVGAPKGAPVSVCAGYANPAQFTTSEIGVSGGGSYPLHTEAAIMATVPALSRLNDEDLHKLSYVTTALRALRKVTLSDPQAHQVLVETLLNLQAERIRLADKANFHIHRLLNISGGHRHA >CP029122|4196055:4218730|4204483_4204675_-|AWF27852.1|DBSCAN-SWA MGKNTVEEIKADLSAKGSSFGLISGACSSAACRVDVEQQMNAYVLGSYYAANKKFPDKMKAEF |
34 | Escherichia_phage(56.25%) | tail,integrase | attL 4198867:4198886|attR 4218961:4218980 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_10 |
4283021 : 4300436
Sequences of DBSCAN-SWA_10
Nucleotide sequences of DBSCAN-SWA_10 >CP029122|4283021:4300436|DBSCAN-SWA ATCAAACCTCTTCTCTTTTTAATTTTTCGTTTATGAGATTATTTCTTTCCCATAATCCGGCAAAACGAGCAGCATTACTGGCGGTATAACGTACAGTATGGCGAATATTTCGATGCCCGAGATAATCCTGAATTAAACGAGTATCTGCACCACGCTCCGCCAGTTCATAACCGCAAGCATGCCTTAACATATGAGGATGAGTCTGCGTTACGGTTCCAGCTTCAATACCGGCATCGCGAATAATGCGATAGGCCTGCTGGCGAGAAAGCCGACTCCCACGGCGAGAAATAAATATAGCGTCGGTCCGGTCAGCGCCTTTCCAGTTAGCACGTTCCTGGGTCCAGCGTTCCACGGCTTCACGCTCATCAAAACGTAACGGGTGAACGGTAGAAAATCCGTTCTTCAGTCGGCGAATATTTATTCGACCTTCATTAAGGTCAAGATCCTGATAATGCAGATCAAGCAGTTCACTAATACGCATCCCATGCCGATATGCCAACAGAATAAGACAATAATCTCTGGCTCCCGTTGCCCCGTAACAAACCGCCTGCATCATGGCCTGAACTTCTTTACCGGTAAGATAACGACGTTTACTCACAATAGTAGTACTCCTGACTGAGATATATTTGAATGAATACCTATAGGAAACCTCAATCGGTCAAAATTAGCCCAATAGATAAAAAGATAACCACACACGAAGTGATGTGGCTATCAGTCAATTACATTAAATATCATACAAAAATAAAATATCACTGATGTGATAACATCTATTTTCATTGATTTGATTCAATATAACAATAGATTGCAGTGGATTAACTTCCAGTTGCTTAAAAAATAAAGAAAAAAGTGCGGAAACAGGAGAATTTAAGTGATGTGTAAATTTATGAATGATTTTGTTTATTTATTTTTAAGAACATAAACCATTGTATTTTGTATGGATATGGTTAACTTAATGTTTATTATGAAAATTAATAATCACACTTACAATAACATCAATTAATAAATGGTTAATCATTTTTGATATATCGTAAGAATAATGTAGTTTTTAACACCATCCCTGGTATCTCAACTATCTCTATAAAACAGCGTGACGCTGTCGTCCTCTGGCTCTATCCCAGATGCCGTAAAAACGCCCTGCATTGCTGGCGGTATACCAGACAGTATGACGAATATTGCGATGCCCAAGATAATCCTGGATAAGTCGCGTATCTATTCCCATATTCGCCAAAGCAAAACCACACGAATGGCGTAACATGTGCGGATGAATCTCCAGTGACAACCCGGCATTACCACCGGAAGTCGAGATAATATGGTAAAACTGTTGCCGAGAAAGCGGATTTCCCTTACGCGATAAAAATACCCATTCGCTCTCAGCATGCGGGTACGAAGTACGGATACTCAACCAGTTTTTTAAAGCCTGAACTTCTTTGTTCAATAGCGGGTGCGTTGTTGAAAAGCCTTTTTTTAATCGATGGATATATATACACTTTGCCTTAAGATCAATATCCGAAATCCTCAATCGACAAATTTCACTCGCCCGGAAACCATGAATAAAACAAAGCAAAGTCAGACAATAATTACGTGCTGCATGAGGCCCGGTATTTGCTGCTTTAAGGAGTGATTCGATTTCACTATGGGTCAGGAAGTTCCTTTTTTTGTTATCAGCCTTATTCTTCATCGTTTTTCCCTTATAATTACAGACGCGCACTAGCTGTGCTGGGTTAGCAATAATCCAACATTTTTATCCTGATTATGTTTATAAAAGCGAACGTTTGCTTAATAACTAACGTAAGTGGACCAGTTCTTCTGAGTGAACTTAAATGGAGTAGCAACAGTTACTTATAATAACGTTGCCATACAAACGCTATGCTCTTGCATGCTATGTACCTTTATATATTTATCAATCGGATACGAATCTAATAATAACCCTCATCATTAATGCAACTAATCTTATCTATATATCATGTGATATGTTCTGTAACAAGTAAAATCAACATAAAAAATTCAGCAAAATTTAATATTCGTTCAACTATCCGTCTCACATATGACATTATTAGACAACTCCATTTGCCACAAAATGGCTATTTCATGGAACGCAGTAATTTCTTAGTAGCTATTATCAGGCTAATTCTTAGGTCCCTAGCGATTATTCCTGGCTGGTTCGTCAGTTTAGTTCATGAGATTGTTACATTCCTTTTTATCCTGGAGATATGGGGGAAAGCAGGAGATGATTTTTTCATCTGCATCCGTTGGTAAATATGGCTTTGGTCACTGCGACTTAAACCTTCCTCAATCACAGCTGTTCGCCGGGCTATTCGGCAAACAAGCTAAGTGAAACATCATCCTGAAAACACCAACATCAACAAGCCTCTCCAGAGAGACTTCAGGAGTGACCAGCTACAAGCCACAAACCAGGAAACATATTTCATGATGAGAATTATGCTCAGCCTGATGGCGGGTAAACACTTATTTTCACCACGCCCACTGAGTTATCTTGCAAATCTTTCCTTGTTTATACTCTAAATAAATTACTTATAAAACAGATAATTAAATAACAACCCTTGGTTTGTGAAGGCCTCCGCAAGTGTTCGTTTGGCTTCACATGGCATCTTCTTCTTAGAAAAATATCGACATATTTTGTGACACGAATTGCAAATCTGGTTTTGTTGTATGGATTGCGTGATTTTTGATCTGGTATAACAGGTATAAAGGTGCACCAAGATAGTCAATGAGACAGGGCATCTCGCAATCTATGGCAAACATCACTTCAGTTCTTTCACATCGGGTGATGAAAATGCACTTCAGTCTGAAAGGAATATGAAAATGAGATCAACAGACACTCTATTTTATGACTCTGGGTAAAATGGATTGAGTAAGTGATATAGCTTACGAACATTCAAATCAATTAAACATCAGAAGAGATTTTATACTCAGGTATTTAATCTGGCTCTCTGTTTATTTAAATAATGTGAAAAGAGATTTTTCACAGGAGACCTTATACAAAAAATATAAAATACAGCTACCGGTTGCCAAAGACACTATAAGCCTGGCAAAAAAATATTACACAACATAAATGCTAATTGTTTATGCGGGCTTTGTATTGCTTTCTGTTTCCTACAAATGAGTGAAATTTATGAAAAAGGCTAAAATACTTTCTGGCGTATTATTACTGTGCTTTTCGTCACCATTAATTTCTCAAGCTGCGACACTGGACGTACGTGGTGGATATCGTAGTGGAAGCCACGCCTATGAGACTCGACTCAAAGTCAGTGAGGGATGGCAAAATGGATGGTGGGCAAGTATGGAAAGTAATACCTAGAATACCATTCATGATAATAAAAAGGAAAATGCCGCACTCAATGATGTTCAGGTTGATGTTAATTACGCGATTAAACTTGATGATCAATGGACGGTGCGCCCGGGAATGTTAACGCATTTTAGCAGCAACGGCACACGCTACGGACCCTACGTAAAACTGTCCTGGGACGCGACAAAAGATCTTAATTTTGGCATTCGCTATCGTTACGACTGGAAAGCTTACCGACAACAAGACTTATCCGGTGATATGTCTCGTGATAACGTTCATCGTTGGGATGGATATGTCACTTACCATATTAATAGTGATTTCACCTTCGCATGGCAAACGACGCTATACAGCAAACAGAACGATTATCGCTATGCAAACCATAAGAAATGGGCGACGGAAAATGCATTTGTTCTACAATACCATATGACGCCGGATATTACGCCATACATAGAATATGACTACCTTGACCGTCAGGGTGTTTACAACGGCAGAGATAATTTATCGGAAAACAGTTATCGCATTGGTGTGTCATTTAAACTGTAGTAGACAGGAGACAGTCACAATGAATAAAACAATAACGGCGCTTGCTATCATGATGGCTTCATTTGCCGCAAACGCGTCTGTATTACCGGAAACTCCTGTGCCATTTAAAAGTGGTACCGGAGCAATTGATAACGACACTGTCTACATTGGTTTAGGTAGCGCAGGTACGGCATGGTACAAGCTGGATACACAGGCCAAAGATAAAAAATGGAAAGCGTTAGCTGCATTCCCTGGTGGACCAAGAGATCAAGCAACCTCGGCATTTATTGATGGCAATCTGTATGTGTTTGGCGGCATTGGCAAAAACAGCGAGGGCTTGACTCAGGTATTTAATGACGTACACAAATACAACCCCAAAACCAATAGTTGGGTTAAATTGATGTCGCACGCACCGATGGGCATGGCGGGTCATGTAACTTTTGTACACAACGGCAAGGCTTATGTTACTGGCGGTGTTAACCAGAATATCTTCAATGGCTATTTTGAAGATCTCAACGAGGCTGGAAAAGATTCAACCGCTATAGATAAAATCAACGCTCACTATTTTGACAAAAAAGCAGAAGATTATTTTTTCAATAAGTTTCTGTTGTCTTTTGATCCCTCAACACAGCAATGGAGTTACGCTGGCGAATCGCCCTGGTACGGAACGGCTGGTGCGGCGGTTGTGAATAAAGGTGATAAAACCTGGCTTATTAATGGCGAAGCCAAACCAGGATTGCGAACGGATGCCGTATTTGAACTTGATTTCACCGGTAATAATTTAAAATGGAATAAGCTTGCTCCCGTCTCATCACCAGATGGCGTAGCTGGCGGTTTTGCGGGGATAAGCAATGATTCTCTTATATTTGCCGGAGGGGCCGGATTCAAAGGTTCACGAGAAAATTACCAGAACGGTAAGAACTATGCGCATGAAGGCCTGAAAAAATCATATAGCACTGATATTCATCTTTGGCATAACGGGAAATGGGATAAATCGGGTGAATTATCGCAAGGTCGGGCCTACGGAGTATCATTGCCCTGGAATAATAGTCTATTGATTATTGGCGGTGAAACTGCAGGCGGCAAAGCGGTGACGGATTCAGTTTTGATCTCTGTGAAGGATAATAAAGTTACAGTACAAAACTAACGCTTCAGGGCCCCGGTAAAGGGGCCCTGCTATCAATTTGTCATTTCAATTACGCGAAATTTATATGAACGCAATAATATCGCCCGATTATTACTATATTCTTACCATTGCTGGTCAGTCTAATGCCATGGCGTATGGCGAAGGACTGCCATTACCGGACAGGGAAGATGCGCCTCATCCCAGAATCAAACAATTAGCGAGATTTGCGCATACGCATCCCGGAGGCCCGCCATGTCACTTTAACGACATGATTCCACTGACTCACTGCCCACACGATGTTCAGGATATGCAGGGTTATCACCATCCTCTGGCAACGAATCATCAAACACAGTACGGCACCGTTGGCCAGGCACTGCATATAGCACGGAAATTACTGCCCTTTATTCCTGATAATGCAGGGGTTCTCATCGTTCCGTGTTGCCGTGGCGGATCGGCTTTTACCGCGGGCAGCGAAGGGACATATTCAGAACGGCACGGAGCCAGCCATGATGCTTGTCGTTGGGGAACGGATACTCCGCTATACCAGGATTTAGTCAGCAGAACGCGAGCCGCACTGGCAAAAAATCCGCATAACAAATTCCTCGGCGTATGCTGGATGCAAGGCGAATTTGACTTAATGACCAGTGACTACGCGTCACACCCTCAACACTTTAATCATATGGTTGAAGCCTTTCGTAGGGATCTAAAACAATACCATTCTCAGCTTAATAATATTACTGACGCACCGTGGTTTTGCGGCGATACCACCTGGTACTGGAAAGAAAATTTCCCTCATGCGTATGAAGTTATTTATGGCAATTATCAAAATAATGTTTTAGCCAATATTATTTTCGTCGACTTCCAGCAACAAGGTGAAAGAGGACTGACGAACGCGCCTGATGAAGACCCGGACGATTTAAGCACGGGATATTACGGTTCAGCGTACCGGTCACCGGAGAACTGGACGACGGCACTGCGAAGCAGTCATTTCAGCACGGCAGCCCGTCGGGGGATTATTTCTGACAAGTTTGTAGAAGCAATTTTGCAGTTTTGGCGCGAAAGGTGAGCGTTCATTAATTTGATTTATTCGCATCCATCAGCGCTTTTAACTGCTCCGGCGTAAACGGTGGTTTGCGTCGGTGCAACATCGCATGGCAATTAGGGTACACAGGCAGCAAATCATTAACTGGATCCAGCTGGTAATCCTCTTTAATGGCTGACAGCGGCACCAGGTGATGCACATGGATAAAGCCTTCAGCGATATCACCATAGATCTTCACCAGATCAACACCGCATACGTTGCACTGGCATCCATGATGTTCGACCGCTTTATCTCGGGCCTTTTTATCGCGTTCATAGCGGTTTACTGTCACCTTTATCGCCGCACCTTCAAGGTATGTCTCCTGCGACGGCAACTCATCCGGGAACAGAACGTCGGCTATGGGAGCCGCAGCGGCTTTATCCAGATGAATATGAAATCCCTGCTCTATGAGTAGCTTGACACATTTAGATTTTATTCCGCCGGTGAAATCCGCAGGGGTAAATTGCGTCCCCGTCATCAATGTCGCTGCGATGCCAGCAATCGCCTTCGAGGGGTAACGTCGGCCATGATATTCAATCTCATACAACCGCGCAGGCTGGAACTGATGAGTTTCTCCCGCATCCCATAAATGGATAGCTTCGATGATATACCGCCGCGGAAGATGATCAGGGAGTTTATTCATGGAGAATGTCCTTATACGTTCTCAGAACAGGCAGCAAAAAAACAAATGAGATCAAGGTAATATATCTTCACTTGCAGGCTTGAACAGCAGAAAATTTGCGAGCATGATCGCAATTATTTCACTGCCAGGGATATCAATAGCTTCCAGTAAATCGCTACAGCAGGCAATTGCCAATATAAAAATCTGGCACAAGGGTGAACAGCGCGCGCCGCATAAGCCATTGTTATTGCTATACGTATTAGCGGGATACCTGAATGGACATCCGCGCCTTTTCGATTATTGTAGTGGAACAGTCTACGAGGATGCAGTTCGAAGTGAGTAAGTAAAGGTTGCTGGAAACGATCAATGCCGCCATACAATGATTTTTTAGGATCAACAATGAGTAATAAAAACTACGAAAGTCACCGCAAAGCGATTGTTAGTAAAGGTATACCACCCGCTCTATTAAATAGGCTCACCAATTCGGATGTTCAGGTGATCAATACCTTCTTAACACGAGTGAGCAAACTGGAACTATCTCAACAAGAGAAAGACTGGATCATAAAAATCATCTCTATGGTTTAGACAATTTCAGCCTTGTAGCCTTTCAAGGGTTTTGTGTGGGTACAAACATTATCGAATCGGGGTATAGCCGCAGCTGTAGCTGTATCCACATCAGTAAGTTGATGTAGGTTGACTTAGGTTGAGCGAAAAATAGCGAAAAGCCTTGTGTGGCGCGGATTTCAGTCATAAAAAAACGTCCGTTGATGTCTATTGATGTTCCGATGGTGCGAAGGTGCGAAGGTGCGAAGGCCGGACTCAAACATGTGACTTAACCTCATGATTTAAAATTGTTAATAATAAACAATATCATCTTATACCAACACAGATACCAACACGAAAAATCACTGGTTTTCGGCATCGAACCAGGACAATCAGCTCCTGATTGAGATACCGAGCTCTGTTAGCTGGAGGTACCGTCATGAGAGTTGAGCAGAATAATTTCAGGGACATGTTCTCCGTTTATCTCTCCGGCCCGCCGCAGACGCAGCATGTGTTTTGTGTATCCCCGGCGGCCCGGGTCGCGCACACGGGTCTGGCAGGCGAAGAATGGCTGAAAGCCTTCCCGCTGCAGGCTTTCCAGTACGGTGATGGTGGGGATGTACGTATAGTTCTGGCTTCGGGACGTGTGTTTATCTTCCCCGAAAATCCCCGGCACATAGTGCATCAGTTCTTCGTGTGTCAGCGGACGGTCACGGCGTATCTGGTTCGCATACCCGACGATAACCATACGTGGGCAGCTCTCCGATAACATGGTGTATACGGAGAAGCACATCCGTATCATCAGTGTGACGACTGCGGCGGCCATCCATCCAGTCATCGGTTCGTCTGAGAATGACGTGCAACTGCGCACGCGACACCCGGAGACAACGGCTGACTAAGCTTACTCCCCATCCCCGGGCAATAAGGGCGCGTGCGCTATCCACTTTTTTGCCCGTCCATATTCAACGGCTTCTTTGAGGAGTTCATTTTCCATCGTTTTCTTGCCGAGCAGGCGCTGGAGTTCTTTAATCTGCTTCATGGCGGCAGCAAGTTCAGAGGCAGGAACAACCTGTTCTCCGGCGGCGACAGCAGTAAGACTTCCTTCCTGGTATTGCTTACGCCAGAGAAATAACTGGCTGGCTGCTACACCATGTTGCCGGGCAACGAGGGAGACCGTCATCCCCGGTTCAAAGCTCTGCTGAACAATTGCGATCTTTTCCTGTGTGGTACGCCGTCTGCGTTTCTCCGGCCCTAAGACATCAATCATCTGTTCTCCAATGACTAGTCTAAAAACTAGTATTAAGACTATCACTTATTTAAGTGATACTGGTTGTCTGGAGATTCAGGGGGCCAGTCTAATCAGAGAGTAACGTAAAGGATTTGCCAGAGTAAGCTAAGAGAGTAAATCTCTCCGGTAACAGAGCGCTGTGAATCGTTCATGTATCAGAGTGCAGATACATGAACGCTGATGCGTAAAATCAGCAGTGACACGCCAGCTCCCCCTGTGCCCATTCCTGTATAAGACATGAACTGAGCGTATTAATGCTCCTGCCCTGAGACTGAGCCGGTGAGTGAGGGGCTGAGAAATCCCTCTTGACACTGCTCTAAACCAGAAATACTTCCCCGGTTGTTGCGGTGCGTCCGGATCCCCGATAACGGAACACTAAGGCTGTATTGCGGGACACCTGGTAAGTGGCAGCCCGTAATACGTCAGGAGACGCGGGAAAAGTGAATTCCCACACGCGTTATAAACTTACCCTTGCGGAGCTGGAAGCCTTTAACTCTGCCGTTGACAACCGGCTGGCAGAACTGACAATGAACAAACTTTACGATCGCGCGCCGGCTTCCGTCTGGAAATATGTCACCTGAATATTGAAGCAGCGCTACGATGACTCCGGTGTTATCTGCACCGGAGGACCGGGAACATTAAGCGGTATAGCGACAACAAAGAATTCGAAATCAAAAGGGAATGGTGCCGGAGGCGTGGGCGGTAATAGGCTGATTTTCAGGTATTACCGGCCTGTATCTGCTGAAGGGGGCTCGTATTTGTCGGGGCTGCGTATTCCGCTCGATTCGATCACGGATTGGCGGAATACCTAACTGTGATTGATTCAGGTACCATTTTGCAGGGATCCGGTGCGTTACTGTCTGACACAACAAACGCAGCGGGTCGCTCTGCTCCGCCGATTCCTGATAATCCCTCCCCAGTAACCTACTGCAACAGTACAAATACACTGACGCCTGAAACGGTAACCAGAATCAGCACCAGCAGTAACTCAGTAAACAGAGCAGACGATCGGTCTTCCGGTAATAACCGGAAGGAACGACAGCGCCACAATACAGGCAACACACTGTATTATGAGTAATCGAATTACCGCAATCCCCGCATTTCTTTTTGCGTATCACGATAGCTCCCTCGCCACGCCTTATCCGTAACCGGTTTTTTACATTAAAAAAAACAACGCCGGGAATTCAATCGTCAGTCCGGAGACGACCGTTCGGGTTATCACAGAGAGCCTGAGCAGGGGGGCTGCGCATTTTTTACGATGTGGCTATTCCGTATGAACCATACGGAGATATAACCATGACCTACAAATACAACCCCTTCTGGCAGCAACGTATTCGTGAGACGGTGCGGCACGCACTGAATGTTCATCCCCGCCTGACGGCATTGCGGGTTGACCTGCGTTTCCCGGATGTACCGGCAGCAACGGACGCAGCTGTGATATCCCGCTTCATCAATGCCCTGAAAGCCCGAATCGACGCTTACCAGAAACGTAAGCATCGGGAAGGTAAACGCGTGCATCCCACAACCCTGCATTACGTCTGGGCCCGGGAGTTTGGGGAGTGCAAAGGTAAAAAACACTATCACCTGATGCTGCTGGTCAACCGGGATACCTGGTGTCGTGCCGGTGATTACCGCGCTCCGGGATCACTGGCCGGGATGATTAAACAGGCGTGGTGCAGTGCCCTGGGAGTGGATGTCGGGTGCCATGCCACGCTGGTGCATTTTCCGGCCTGGCCGGCGGTGTGGCTGGCGCGTAATGATGACACCGGCTTTCAGCAAGTGCTGGAACGTGCTGACTATCTGGCGAAGGAGCATACCAAAGTTCACTGCACCGGTGAGCGCAACTTTGGCTGCAGTCGTGGCTGAGCCAGACAGACGAGTTATCTGACTTCAGTGTGATGACGTGTTCACGGCCACCACACTGATTTTCCCTTATGAGACTGAACTGAAACCGCCATTCACACCGGACCACGCTGCCCGTACCGGGACGGCTGTTGCCTGCGTGTCGTCTGACGGTAAGGAACATATACTATGTACGCAAAATCCTTTCTCGCTCTTGATGGCAACGGACGTCTGACGGGCGCCCGTACAGCACAGGCCGCACCTTATGCTCACTACACCTGCCACTTGTGTGGCAGTGCACTCAGATACCATCCGCAATACGACACTGAACTTCCCTGGTTTGAACACACTGACGACAGGCTGACAGAGCACGGTCAACAGTGCCCTTATGTCAGGCCGGAGCGCAGAGAAATACAGTTGATTAAACGTCTGCAGCAATTCGTACCGGATGCCTTACCCGTGGTGCGTAAAGCCAGCTGGCACTGCAGACAATGTCACCACGATTATTATGGGGAGCAGTACTGCACACACTGCCAGACCGGAGGATTCAGTATTCCCCGGACAACTCAGGAGGAAATATGCGAATTCTGAACTGTTATATGGCGAATGACAGCAAAGGCCATTTTGTTACGGCGAAAGAAGCTGCGAAGCACAACCGACAGGACGTTTTGTGCTGTGTGTCCTGTGGATGCCCGTTAACACTTCAGCGGGGCAATGACGGACAACCACCGTGGTTTGAACATGACCAGATGACTGTCGCTGAAAAAATCCTGCTGCGATGTACCTGGCTTGACCCGGCGGAGAAAGAGGCCCGTCGTTTGCATCTGCAGGGCATGACGGTTCCGGATTATACGGTGAAGGTGAGAAAGTGGTTTTGTGTGATGTGTGACGAAGATTATGAGGGGGAAAAGTGCTGCCCACGCTGCGGTACCGGGGTATACAGCAGGGCGTGGGGGCGGCAGGAGGTGCCGTCGGAAGATGCCAGGGCTGATAATCCGTTACAGAGGCTGTAATGGTTGCCTCCGGAGCAGTTTGCGGTGATGCCTTTCCCGGGAATGGATTGCCAGTGCGGGGACTGTGGAGCTTAATTCCGGTGTTGGTGGCCTTTCCGTTGATTTTATGCCAACAGCCCCCTGTGACTGACAGGCTGCCTCGTCATTCCATTCGTATCTCCAGTAACAGGGGGTTGTATTGTATTTTATTGTGCCGGTTCAGGTGTGCGTTTCCCGGCGCGTCTGCACCGGCTTAACCAAATTCAACAGGGATAAAATAGTGGTAAGCGTACAGCCTGAACCGTCTGGTCAGAATCTGACGAATTAGACAAAGTGGTGTCCACCAAATAAGTAGTGGGAACCAAAGTGTCAGATATGCAGAAAAATGTGACTCCCGGCAGGCGAAAAGGCTGCCCTAATTATCCTCCCGAATTTAAACAGCTGCTCGTTGCTGCCTCCTGTGAACCCGGGATATCCATCTCAAAACTTGCTCTTGAAAATGGCATTAACGCCAATCTGTTGTTCAAATGGCGACAACAATGGCGCGAGGGAAAGCTGCTATTACCTTCTTCAGAGAGCCCCCAGCTACTTCCTGTGACTCTCGATGCAGCTGCCGAACAGCCAGAATCGCTCGCAGAGGACCCGGAAACCCTCAGTATCAGCTGTGAGGTAACGTTCCGGCACGGGACGCTCCGCTTCAATGGCAATGTCAGCGAAAAGCTCCTGACTCTGCTGATACAGGAACTGAAGCGATGATCCCGTTACCTTCCGGGACCAAAATTTGGCTGGTTGCCGGTATCACCGATATGAGAAATGGCTTCAACGGCCTGGCTGCGAAAGTACAAACGGCGCTGAAAGACGATCCCATGTCCGGCCATGTTTTCATTTTCCGGGGCCGCAGCGGCAGTCAGGTTAAACTGCTGTGGTCCACCGGTGACGGACTGTGCCTCCTGACCAAACGGCTGGAGCGTGGGCGCTTCGCCTGGCCGTCAGCCCGTGATGGCAAAGTGTTCCTTACGCAGGCGCAGCTGGCGATGCTGCTGGAAGGTATCGACTGGCGACAGCCCAAGCGGCTGCTGACCTCCCTGACCATGCTGTAAATCTCTTTATCCTGGTTGTCACAGAATAAGCCCGGTAAAATACGGGCTTATGAACGACATCTCTTCTGACGACATCTTCCTGCTAAAACAGCGCCTGGCCGAACAGGAAGCGCTGATCCACGCCCTGCAGGAAAAGCTGAGCAACCGGGAGCGCGAAATAGACCATCTGCAGGCGCAGCTGGATAAACTCCGCCGGATGAACTTCGGCAGTCGTTCCGAAAAAGTCTCCCGCCGTATCGCACAAATGGAAGCCGATCTGAACCGGCTTCAGAAAGAGAGCGATACGCTGACTGGTAGGGTGTATGACCCGGCAGTACAGCGTCCGTTGCGTCAGACCCGCACCCGTAAGCCGTTCCCTGAATCACTACCCCGTGACGAAAAGCGACTGTTGCCTGCGGCGCCGTGCTGCCCGAACTGCGGCGGTTCACTGAGCTATCTGGGCGAGGATACCGCCGAACAGCTGGAGTTGATGCGTAGCGCATTCCGGGTTATCCGGACGGTACGGGAAAAACATGCCTGTACTCAGTGCGATGCCATCGTGCAGGCACCTGCACCTTCGCGGCCCATCGAGCGGGGTATCGCCGGACCGGGGCTGCTGGCCCGCGTGCTGACCTCGAAGTATGCAGAGCACACCCCGCTGTATTGCCAGTCAGAAATATACGGCCGGCAAGGTGTGGAGCTGAGCCGTTCACTGCTGTCGGGCTGGGTGGATGCATGCTGCCGGCTGCTGTCTCCGCTGGAAGAGGCGCTTCATGGCTATGTCATGACTGACGGCAAACTCCATGCCGATGATACCCCGGTCCAGGTACTGCTGCCGGGTAATAAGAAGACGAAGACCGGGCGGTTGTGGGCGTATGTTCGTGATGACCGCAATGCCGGGTCAGCGTTGGCACCTGCAGTGTGGTTCGCTTACAGCCCGGACAGAAAAGGCATCCATCCGCAGACTCATCTTGCTTGCTTCAGCGGTGTGCTGCAAGCGGATGCGTACGCCGGGTTCAACGAGCTGTATCGCAATGGTGGGATAACGGAAGCTGCCTGCTGGGCTCATGCCCGCCGAAAGATCCACGATGTGCACGTCCGCATCCCGTCAGCACTGACGGAAGAAGCCCTGGAGCAGATCGGTCAGTTGTACGCCATAGAGGCGGATATAAGGGGAATGCCGGCAGAGCAGCGGCTTGCTGAACGTCAGCGAAAAACGAAACCGCTGTTGAAATCCCTGGAAAGCTGGTTGCGTGAAAAGATGAAGACCCTGTTCTTCGGCTCTGGCCATGGTGGTGAGCGGGGAGCGCTACTGTACAGCCTGATCGGGACGTGCAAACTGAATGACGTGGATCCAGAAAGCTACCTTCGCCATGTGCCTGGCGTCATAGCAGACTGGCCGGTCAACCGGGTCAGCGAACTGCTTCCGTGGCGCATAGCACTGCCAGCTGAATAACACATCCCCGTCAATACGGCCCTCGCTGTACGCTTACTTTTGTTCGGACTGCTTATGGCCTTTTTCCACTGGTGTAGCAGGAAAGTTAACCGGCTGCCACCATGAGATGACGTGTAAACCGTGCAGGCATGTTTTCACGTGGTGACGCTTTCATTGGGGGTCTACGCCGGAACCCCAATATTGTTCACATACATGAATTTGAAAAACTAAGCGATACTGCTGTCCGTACCTTATAAACTTATTTCAACGCTTCCGGCAGCGTCACAATCCACAGTTCACTGTCAGACTCCGGTTTCTCCAGTGGCTTCACTCTGAGGGTCGTTTCAACCTGCTTGTCGTCAAGCGTCAGCTTTCCTTCCGCGTTCACCCGATAATCCTTGAATGTCTTGCCCTCGACCGGATGTTGCATCGCCACCTTCACGCCAAAGTCGTTATCGTTCGCCACCGCCAGCGTTTTGCTGTCGATCAGCGCCAACCCTTCAGCCTTCTCCTGCTGCCAGCCCAAGGAGCGCAGATCAACAACCTGCGTTTTCTGCGCAAGGGTAATGCCGCGCTGTGACAGGGTTTTCTCATCATCAAACTCCGGATACTCACCCGGCTTGTCGAAGCCTGACAGGTCGCTCGCCTTATTGAGATCGACCTCGTAAATCAGGTTACGCATTCTGTTGTTTTTATCTCTTCCCTGTTCGATCAGCAGGATGTGCTGGTTATCGAGCGCCACGATGTCTCCGATTTTAGCGTCGCTGTTTTTGCTGTAGGCCGCGCTGTCGATAGGATAACCGTACATCGCGGTTTTTCCGCTTGCCGGATCGAAGCTCACCAGACGCGTAAACAGCGCCTTTTTCTTGCTCTTAGCGTCGATATCCAGCGTACTTTGCACGGCGACAATAATACGCCCGTCCGGCATACGGGTGAGGCCTTCAAAGCCGCGGTTCGCCTGACGCCATTTGAGGATATTTGGCAAACCGCCCGCAATCGCTTTTTCCCCTTCCGCCGCCTGCGGACCGTGGATTGCCAGGATTTTCCCTTTACTGTCGATGTTAATCAGGAACGGGCCATACTCATCGCACAGCCAGTAACCTCCTTTACCATCCGGAGTGATCCCTTCCGTATCCAGCCCGCGATTATCCCCCTTCAGTCTATGTAACGTGTCGCTGAAGGCCACTTCATTGGTGGAGCCAATCACGTCGCTTGCCAGCGGCAGACCGTTGATCGCGCCTTTATCGTCATGCAGAGGACGGGGATCGATGGCCTCCGCTTTGCCGTTTTGTACACGAATCGTCATCAGCAGCGGCGCGAAATCTGGCGTAACGAAAATTTTAGTTTCGTTTTTCCCCTCTTTCGGTGAATCCGCGTTCGGACCGCGATCGGTAATGGTTGCGAATGTCAGGGCATCGCCCTGTTTACCGGTGAACAGCAGGCCAGAACCTATCCCAACCGGAAGGCCGTTCGGGAATGCACTGGCAAATGCGCCGGCATAGTTCACGTGCGTGCCTTCTGGAAAACTGACGACATAGCGCTCAGCCGTAGGTTGTGCCGCCAGGGCAGAGAAAGAGAGCGTACAACCGATTAGCACGGGAATAATTTTTCTTTTCATATTGTTAGCATCCATAAAGTGAAGATCACACTACACCATAAAGAAGAAACGAGTCAGGAAAATGACATCGAATGGCAACATGCGCGGGCCTGAGGCCAAAGAAATATAAAACACTGTTGTTATTTGTAACAACGGAAACATTTACGGAAATTATTTGTCTTTCTTCGTGATCAAAAATGCTGTGTCTTGTTGGATTGGAGTGATGAGGGCTGCGTTTAGGAAACGCCCTATATTGTACGGTTGCATTATCGCGGTAGGGGTAATGCACGTGTCAGTATGATGAGCTGACCACACTCACGTGGGTGGACTGGTATAACAATCGACGATTGCTGGAAAGGCAGGGCCATATTCCACCGGCAGAAGCAGAAAAAGCTTATTATGCTTCCATCGGAAACGATGATCTGGCAGCCTGAGTTCACAGATAAAACACTCTCCAGGAAACCCGGGGCGGTTCAATTGCTGGATAAGATCATAACGATACACCGGGTTAAACAGGTGGTCACTGCTTCACTGAAGCTGCGTGATTATGATGGTCAGGTAGCAGAAGCTATGCCCATGATACCAAAATGACGAAAGCCGAAATGTCTGAAAGCGTGTGTATTACCTGAAAATATGACCGGCTCCGGGGTACGTAACCGATTCAACAAAGCCAATCGATATTACAAAACGAGAGGGACTATATATTTCGTTTTCAGATAAGACTCCTTGAGAGATTATCTTTTATGTAAAATCTCAAATATCACCTGATGTAATGTCAATGATAATATTAAGTGCGCTCTGAACAACATCTTTCTTGGTGTGATTAACATCTGACAAAAAGAACATCAGCTTCTTTATTATGTTCTTGTTGTTTATTATTTCACGCTCACAGAATAAAGCGTTGACTGCCTCTTCTATGACATTTCTCTCGTAAAAAAATTGAGAGTTAGCTTCATATAAAATATTGGCCAATTTTTCACCAAAAACCATATTATAATAATCACTGTGCATCTTTCTCCCCCAGGGTTCAGTCAATAATTTTACCATGCCTAATTATGACAGGTGTAACAAGCATATTTTGCTATATAAATAAAATACATAATACTGATTATTGTAAGAACAGGAAGCAGACTAATGTTTTTTTTTTGTGAGCACATTCCTTCTTATGTAGGAAGAGGCTGTATTATATTCAGTCAACTCTTTTTTACTAAAAACATTATCTATGCCATCAAAGTTGCCTGCTAGCTAATTTTTAGGATATCGCCTTGTCGTGGATTATGCACTCTAAACATGTAGAAGGTATTCATTACAAATACACAGACGGCCATCTTTTGTAAAATGGATGCCGTTTTAAGTGAGATTCTGATTTTCAAATTGTTCCGGACTGAGACCGCCACACCAACTGTACCATCGCCAGCGATTGTAATCGCACTCGATATAATTAAGTAAGTTTGCTGATTTTGCTGTTTACTGCGACAGATGTAAAGCTGTGGTGATTCATGCAGACTAAGTTCACGAACTCCGGCAGTAACACCGATGCATTCTGCAAGCTTCAGATCTTAACTACGAAATTCATGCGAATGGTGTTTACGTTTTTTTACTGGTTGAAACTGTTTTGTCTTGTGAATCACCTCTGACTGAGAGTTTACTCACTTAGCCGCGTGTCCACTATTGCTGGGTAAGATCACAACACTTTAAAATGCCCTATATAGGGTTAACCAGTAATTATCTGTATGATAGCTTTGAGTTCGGGAGGAGGTAATGACTCCAACTTACTGATAGTGTTTTATGTTCAGATAATGCCCGATGACCTTGTCATGCAACTCCACCGATTTTGAGAACGACAGTAACTTCCGTCCCAGCCTTGCCAGATGTTGTCTCAGATTCAGATTATGTCGCTCAATGCGCTGAGTGTAACGCTTGCTGATAACGTGCAGCTTTCCCTTCAGGCGTGATTCATACAGCGGCCAGCCATCCGTCAT
Protein sequences of DBSCAN-SWA_10 >CP029122|4283021:4300436|4297116_4298475_-|AWF27413.1|DBSCAN-SWA MKRKIIPVLIGCTLSFSALAAQPTAERYVVSFPEGTHVNYAGAFASAFPNGLPVGIGSGLLFTGKQGDALTFATITDRGPNADSPKEGKNETKIFVTPDFAPLLMTIRVQNGKAEAIDPRPLHDDKGAINGLPLASDVIGSTNEVAFSDTLHRLKGDNRGLDTEGITPDGKGGYWLCDEYGPFLINIDSKGKILAIHGPQAAEGEKAIAGGLPNILKWRQANRGFEGLTRMPDGRIIVAVQSTLDIDAKSKKKALFTRLVSFDPASGKTAMYGYPIDSAAYSKNSDAKIGDIVALDNQHILLIEQGRDKNNRMRNLIYEVDLNKASDLSGFDKPGEYPEFDDEKTLSQRGITLAQKTQVVDLRSLGWQQEKAEGLALIDSKTLAVANDNDFGVKVAMQHPVEGKTFKDYRVNAEGKLTLDDKQVETTLRVKPLEKPESDSELWIVTLPEALK >CP029122|4283021:4300436|4300226_4300436_-|AWF25833.1|transposase|DBSCAN-SWA MTDGWPLYESRLKGKLHVISKRYTQRIERHNLNLRQHLARLGRKLLSFSKSVELHDKVIGHYLNIKHYQ >CP029122|4283021:4300436|4295492_4296878_+|AWF25818.1|transposase|DBSCAN-SWA MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRMNFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRKPFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRTVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYCQSEIYGRQGVELSRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTPVQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTHLACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALTEEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKTLFFGSGHGGERGALLYSLIGTCKLNDVDPESYLRHVPGVIADWPVNRVSELLPWRIALPAE >CP029122|4283021:4300436|4290044_4290263_+|AWF24636.1|DBSCAN-SWA MPPYNDFLGSTMSNKNYESHRKAIVSKGIPPALLNRLTNSDVQVINTFLTRVSKLELSQQEKDWIIKIISMV >CP029122|4283021:4300436|4289750_4289873_-|AWF27980.1|DBSCAN-SWA MAIACCSDLLEAIDIPGSEIIAIMLANFLLFKPASEDILP >CP029122|4283021:4300436|4295095_4295443_+|AWF24229.1|transposase|DBSCAN-SWA MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRSGSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGIDWRQPKRLLTSLTML >CP029122|4283021:4300436|4290931_4291099_-|AWF26287.1|DBSCAN-SWA MSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVCEPDTP >CP029122|4283021:4300436|4292805_4293375_+|AWF28452.1|DBSCAN-SWA MTYKYNPFWQQRIRETVRHALNVHPRLTALRVDLRFPDVPAATDAAVISRFINALKARIDAYQKRKHREGKRVHPTTLHYVWAREFGECKGKKHYHLMLLVNRDTWCRAGDYRAPGSLAGMIKQAWCSALGVDVGCHATLVHFPAWPAVWLARNDDTGFQQVLERADYLAKEHTKVHCTGERNFGCSRG >CP029122|4283021:4300436|4288243_4289041_+|AWF25304.1|DBSCAN-SWA MIPLTHCPHDVQDMQGYHHPLATNHQTQYGTVGQALHIARKLLPFIPDNAGVLIVPCCRGGSAFTAGSEGTYSERHGASHDACRWGTDTPLYQDLVSRTRAALAKNPHNKFLGVCWMQGEFDLMTSDYASHPQHFNHMVEAFRRDLKQYHSQLNNITDAPWFCGDTTWYWKENFPHAYEVIYGNYQNNVLANIIFVDFQQQGERGLTNAPDEDPDDLSTGYYGSAYRSPENWTTALRSSHFSTAARRGIISDKFVEAILQFWRER >CP029122|4283021:4300436|4293929_4294364_+|AWF25036.1|DBSCAN-SWA MRILNCYMANDSKGHFVTAKEAAKHNRQDVLCCVSCGCPLTLQRGNDGQPPWFEHDQMTVAEKILLRCTWLDPAEKEARRLHLQGMTVPDYTVKVRKWFCVMCDEDYEGEKCCPRCGTGVYSRAWGRQEVPSEDARADNPLQRL >CP029122|4283021:4300436|4294542_4294671_+|AWF26449.1|DBSCAN-SWA MYFIVPVQVCVSRRVCTGLTKFNRDKIVVSVQPEPSGQNLTN >CP029122|4283021:4300436|4292433_4292559_-|AWF28100.1|DBSCAN-SWA MWRCRSFRLLPEDRSSALFTELLLVLILVTVSGVSVFVLLQ >CP029122|4283021:4300436|4293768_4293942_+|AWF24168.1|DBSCAN-SWA MIKRLQQFVPDALPVVRKASWHCRQCHHDYYGEQYCTHCQTGGFSIPRTTQEEICEF >CP029122|4283021:4300436|4290642_4290969_-|AWF25125.1|DBSCAN-SWA MVIVGYANQIRRDRPLTHEELMHYVPGIFGEDKHTSRSQNYTYIPTITVLESLQREGFQPFFACQTRVRDPGRRGYTKHMLRLRRAGEINGEHVPEIILLNSHDGTSS >CP029122|4283021:4300436|4284095_4284698_-|AWF27475.1|integrase|DBSCAN-SWA MKNKADNKKRNFLTHSEIESLLKAANTGPHAARNYCLTLLCFIHGFRASEICRLRISDIDLKAKCIYIHRLKKGFSTTHPLLNKEVQALKNWLSIRTSYPHAESEWVFLSRKGNPLSRQQFYHIISTSGGNAGLSLEIHPHMLRHSCGFALANMGIDTRLIQDYLGHRNIRHTVWYTASNAGRFYGIWDRARGRQRHAVL >CP029122|4283021:4300436|4286889_4287996_+|AWF26660.1|DBSCAN-SWA MNKTITALAIMMASFAANASVLPETPVPFKSGTGAIDNDTVYIGLGSAGTAWYKLDTQAKDKKWKALAAFPGGPRDQATSAFIDGNLYVFGGIGKNSEGLTQVFNDVHKYNPKTNSWVKLMSHAPMGMAGHVTFVHNGKAYVTGGVNQNIFNGYFEDLNEAGKDSTAIDKINAHYFDKKAEDYFFNKFLLSFDPSTQQWSYAGESPWYGTAGAAVVNKGDKTWLINGEAKPGLRTDAVFELDFTGNNLKWNKLAPVSSPDGVAGGFAGISNDSLIFAGGAGFKGSRENYQNGKNYAHEGLKKSYSTDIHLWHNGKWDKSGELSQGRAYGVSLPWNNSLLIIGGETAGGKAVTDSVLISVKDNKVTVQN >CP029122|4283021:4300436|4283021_4283618_-|AWF24395.1|integrase|DBSCAN-SWA MSKRRYLTGKEVQAMMQAVCYGATGARDYCLILLAYRHGMRISELLDLHYQDLDLNEGRINIRRLKNGFSTVHPLRFDEREAVERWTQERANWKGADRTDAIFISRRGSRLSRQQAYRIIRDAGIEAGTVTQTHPHMLRHACGYELAERGADTRLIQDYLGHRNIRHTVRYTASNAARFAGLWERNNLINEKLKREEV >CP029122|4283021:4300436|4291122_4291488_-|AWF25754.1|transposase|DBSCAN-SWA MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEAVEYGRAKKWIAHAPLLPGDGE >CP029122|4283021:4300436|4291949_4292090_+|AWF26990.1|DBSCAN-SWA MNSHTRYKLTLAELEAFNSAVDNRLAELTMNKLYDRAPASVWKYVT >CP029122|4283021:4300436|4286444_4286870_+|AWF27464.1|DBSCAN-SWA MLTHFSSNGTRYGPYVKLSWDATKDLNFGIRYRYDWKAYRQQDLSGDMSRDNVHRWDGYVTYHINSDFTFAWQTTLYSKQNDYRYANHKKWATENAFVLQYHMTPDITPYIEYDYLDRQGVYNGRDNLSENSYRIGVSFKL >CP029122|4283021:4300436|4298871_4299045_+|AWF28166.1|DBSCAN-SWA MIWQPEFTDKTLSRKPGAVQLLDKIITIHRVKQVVTASLKLRDYDGQVAEAMPMIPK >CP029122|4283021:4300436|4299207_4299465_-|AWF27356.1|DBSCAN-SWA MHSDYYNMVFGEKLANILYEANSQFFYERNVIEEAVNALFCEREIINNKNIIKKLMFFLSDVNHTKKDVVQSALNIIIDITSGDI >CP029122|4283021:4300436|4289048_4289699_-|AWF27012.1|DBSCAN-SWA MNKLPDHLPRRYIIEAIHLWDAGETHQFQPARLYEIEYHGRRYPSKAIAGIAATLMTGTQFTPADFTGGIKSKCVKLLIEQGFHIHLDKAAAAPIADVLFPDELPSQETYLEGAAIKVTVNRYERDKKARDKAVEHHGCQCNVCGVDLVKIYGDIAEGFIHVHHLVPLSAIKEDYQLDPVNDLLPVYPNCHAMLHRRKPPFTPEQLKALMDANKSN |
23 | Stx2-converting_phage(37.5%) | transposase,integrase | attL 4272627:4272642|attR 4303441:4303456 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_11 |
4307036 : 4313595
Sequences of DBSCAN-SWA_11
Nucleotide sequences of DBSCAN-SWA_11 >CP029122|4307036:4313595|DBSCAN-SWA GATGAAAATTGCGCTGGTTATTTTCATCACCCTTGCCCTGGCGGGCTGTGCGCTGTTATCACTCCATATGGGAGTGATCCCCGTGCCGTGGCGCGCGCTGCTGACCGACTGGCAGGCCGGACGCGAGCATTATTATGTATTGATGGAGTACCGACTGCCGCGCTTGCTGCTGGCACTGTTTGTCGGTGCAGCCCTCGCCGTGGCGGGCGTGCTGATACAGGGGATTGTGCGCAACCCTCTGGCATCACCGGATATTCTCGGCGTTAACCATGCCGCCAGCCTGGCCTCTGTGGGGGCTCTACTTCTTATGCCGTCACTGCCCGTGATGGTGCTGCCGCTGCTGGCCTTTGCGGGCGGCATGGCGGGGTTGATATTACTGAAGATGCTGGCAAAGACCCACCAGCCGATGAAGCTGGCGCTCACCGGCGTGGCGCTTTCTGCATGCTGGGCCAGCCTGACGGATTATCTGATGCTCTCGCGCCCACAGGATGTGAACAACGCCCTGCTGTGGCTGACCGGCAGCTTATGGGGCCGTGACTGGAGCTTTGTGAAGATTGCCATCCCGCTGATGATTTTATTTCTGCCGCTGAGCCTGAGTTTTTGCCGCGATCTCGACCTCCTTGCACTCGGCGATGCGCGCGCCACCACGCTCGGTGTGTCGGTGCCCCATACCCGATTCTGGGCTTTGTTACTAGCTGTCGCCATGACATCTACCGGCGTGGCCGCCTGCGGCCCGATTAGCTTTATTGGTCTCGTGGTGCCGCATATGATGCGTAGCATCACCGGTGGACGTCACCGCAGACTGCTGCCTGTTTCAGCCCTGACAGGTGCGTTGCTGTTGGTGGTTGCCGATCTGCTGGCGAGAATTATTCATCCCCCACTGGAGCTCCCGGTTGGCGTGCTGACCGCCATTATCGGTGCGCCGTGGTTTGTCTGGTTGCTTGTGAGAATGCGATAAATGACTTTACGAACTGAAAATCTGACGGTCAGTTACGGGACAGACAAGGTACTTAACGACGTTTCACTCTCACTGCCAACGGGGAAGATCACCGCCCTGATCGGTCCTAACGGTTGCGGGAAATCGACGCTGTTAAACTGTTTTTCGCGGCTTTTAATGCCGCAGTCTGGCACCGTATTTCTCGGCGATAATCCCATAAATATGCTCTCATCGCGCCAGTTGGCCCGCAGGCTTTCGCTGCTGCCTCAGCACCATTTAACGCCAGAGGGGATCACAGTCCAGGAGCTGGTTTCGTATGGTCGTAATCCCTGGCTGTCACTCTGGGGGCGTCTCTCCGCTGAAGACAATGCACGAGTTAATGTCGCCATGAACCAGACCCGGATCAATCATCTTGCCGTTCGTCGGTTAACCGAGCTTTCCGGCGGTCAGCGCCAGCGCGCATTTCTGGCGATGGTCCTGGCCCAGAATACGCCCGTTGTATTACTTGATGAGCCAACCACCTATCTTGATATCAATCACCAGGTGGACCTGATGCGGTTGATGGGCGAACTCCGGACTCAGGGGAAAACGGTGGTCGCTGTGCTGCACGACCTTAATCAGGCTAGCCGGTACTGCGATCAACTGGTGGTAATGGCAAACGGACATGTTATGGCGCAAGGCACACCAGAAGAGGTGATGACCCCAGGATTGCTGAGAACAGTATTCAGCGTGGAAGCGGAAATACACCCCGAGCCGGTATCTGGCAGGCCGATGTGCCTAATGAGGTAGATTGCACAGGCCGTAAGAACCAAACCACGACTGAATGAAACTGGACTGGCGCCAGCAAGCCTGTTCAGACTGGGGCTGAACTTTTCCGGACTCTGAAAGATTACCAATACTCATCGTCCATCCGCTTGCTTTAGGCTGACAGGTTCATAATCAACGCAAACCAGAGCTGTACAGGCTTGGGCGCGGCTTTCAAACCAGTCGTGATCACGGCAATCAATTTTGAACTCTGCTTAACGGACATTTCTGTATAACCCTTACGGCAACGAAAAACGCGAAGTTAAAATTTTAGAAACCCAAAAACGTGACATGACTAAGTTTAGATTTCAGGGGGGGAGATCAAAAAATTTCGCTCTGTGCCAGAGCGGACATTCACGGAGCTGGTTCATTACCAATGAGGTTGGGCTTTTGAGGATAAATCAATGATCAGACGCCAACGTAAATCAAAAGCACCCCTGGAACGGAATTGCTAATCCAGTTTCTGACCATCGATTTTTCTAAAAAGTGTGCGTTTGCTACTACTTAGGTAGGTGCAGCTTTCTTAATCACCGGCAGCCACGTTATACAGGCCAGTTGATGGATCGATTGTTATCAATGATATCTTTATGAGTCGGTGTCTCACCCAGCTTACCGAAGCTGGCATAAAGTGAAGGCAGACGGGCCCGTCCTTCTCCCTTTTTCGCCAGAGGGAAAGCGCGAAGCATGGTGGCAGCCTCCCAGTCACAACAATAGGATGGTGTGCACGGCTGCTGACGCCATGATTCAGCGATAGAGCCGGAAAATACGGGGTCAAAGCCGGTATCATTAACCAGAGTCATCGTGACCTGTACTGCGGCTGGATGATCTCCTGCTACTGCAACTGCTAGGCGACCGCGGCTCCCTTCAGGTAGTCTGTTGTGGTCAACTAAAACTGGCCTCCGCGTTAGAGTTTTTCCAGTATCGGTTTTCTGATTCGTTTGGTGGTAACCCACCATTATATTCGTGCGGTCTTAGTGCGCTGTAATATCCAACGATATAGTCCGTTATTGCGTGAGCTGCATCGCTGAAGCTTACATAGCCCGTCGCTGGCACCCATTCGTTCTTCAGACTCCTGAAGAAGCGCTCCATTGGGCTGTTATCCCAGCAGTTTCCACGCCGACTCATACTCTGCCTGATCCGGTATCGCCACAGTAACTGCCGGAACTGCCTGCTCGTATAATGACTGCCTTGATCGCCTGGAACATCACCCCGACGGGCTTACCACGGGTTTCCCATGCCATTTCCAGTGCTTTCATGGTAAGCCTGCTGTCCGGCGAGAACGACATGGCCCAGCCCACTGGTTTTCTTGCGAACAGGTCGAGAACAACGGCGAGGTACGCCCAGCGCTTACCCGTCCAGATATAGGTCACATCACCGCACCACACCTGATTTGGTTCCGTTACGGCGAACTGTCGCTCAAGATGATTCGGGATAGCAACGTGCTCATGACCGCCACGCTTATACCGGTGAGTCGGCTGCTGGCAACTGACCAGCCCCAGCTCTTTCATGAGTCTGCCAGCAAGCCAGTGCCCCATCTGGTAGCCTCTCCGGGTTGCCATTGTGGCGATGCTTCTTGCTCCGGCAGAGCCGTGGCTGATGCCATGCAGTTCAAGTACCTGGCTGCGTAATACAGCCCGTCTGCCGTCTGGTTTTTCAGGACGGTTTTTCCAGTATCTGTAGCTGCTGCGATGAACCCCGAACACATGGCAGAGTGTGACCACAGGATAATGCGCTCTGAGTTTCCTGATTATCGAGAACTGTTCAGGGAGTCTGACATCAAGAGCGCGGTTGTAGATTCAATTGGTCAACGCAACAGTTATGTGAAAACATGGGGTTGCGGAGGTTTTTTGAATGAGACGAACATTTACAGCAGAGGAAAAAGCCTCTGTTTTTGAACTATGGAAGAACGGAACAGGCTTCAGTGAAATAGCGAATATCCTGGGTTCAAAACCCGGAACGATCTTCACTATGTTAAGGGATACTGGCGGCATAAAACCCCATGAGCATAAGCGGGCTGTAGCTCACCTGACACTGTCTGAGCGCGAGGAGATACGAGCTGGTTTGTCAGCCAAAATGAGCATTCGTGCGATAGCTACTGCGCTGAATCGCAGTCCTTCGACGATCTCACGTGAAGTTCAGCGTAATCGGGGCAGACGCTATTACAAAGCTGTTGATGCTAATAACCGAGCCAACAGAATGGCGAAAAGGCCAAAACCGTGCTTACTGGATCAAAATTTACCATTGCGAAAGCTTGTTCTGGAAAAGCTGGAGATGAAATGGTCTCCAGAGCAAATATCAGGATGGTTAAGGCGAACAAAACCACGTCAAAAAACGCTGCGAATATCACCTGAGACAATTTATAAAACGCTGTACTTTCGTAGCCGTGAAGCGCTACACCACCTGAATATACAGCATCTGCGACGGTCGCATAGCCTTCGCCATGGCAGGCGTCATACCCGCAAAGGCGAAAGAGGTACGATTAACATAGTGAACGGAACACCAATTCACGAACGTTCCCGAAATATCGATAACAGACGCTCTCTGGGGCATTGGGAGGGCGATTTAGTCTCAGGTACAAAAAACTCTCATATAGCCACACTTGTAGACCGAAAATCACGTTATACGATCATCCTTAGACTCAGGGGCAAAGATTCTGTCTCAGTAAATCAGGCTCTTACCGACAAATTCCTGAGTTTACCGTCAGAACTCAGAAAATCACTGACATGGGACAGAGGAATGGAACTGGCCAGACATCTAGAATTTACTGTCAGCACCGGCGTTAAAGTTTACTTCTGCGATCCTCAGAGTCCTTGGCAGCGGGGAACAAATGAGAACACAAATGGGCTAATTCGGCAGTACTTTCCTAAAAAGACATGTCTTGCCCAATATACTCAACATGAACTAGATCTGGTTGCTGCTCAGCTAAACAACAGACCGAGAAAGACACTGAAGTTCAAAACACCGAAAGAGATAATTGAAAGGGGTGTTGCATTGACAGATTGAATCTACAGTAGCCTTTTTTAATATTTCATTCTCCATTTCAATGCGTTGTAGCTTTTTCCTCAGCTTACGTATTTCGATTTGTTCTGGTGTTATCGGAGAGGCTTTTGGTGTTTTGCCCTGACGCTCATCACGCAGTTGTTTGACCCATCTTGTCATTGTGGAAAGGCCAACATCCATAGCTTTGGCGGCATCTGCCACCGTGTATGTCTGGTCAACAACCAGTTGAGCGGATTCGCGTTTAAACTCTGCGCTAAAATTTCTTTTTTTCATTGGAGCACCTGTGTTGTTCTGAGGTGAGCATATCACCTCTGTTCAGGTGGCCAAATTCAGTAAACCACTTCACCACTTCAGAGCAGACGAGACAAAAAGAATGGTTGACGCATCAGCTCAACCCCATATAGTTGTAACACTTGAGCCAAACCCTTGGGCCGCTTTTTACTTTGATATTAACATTGCTAATACAGGGAACGCACCTGCCTATAATGTTGAGGTTGTGTTTGATCCTCCACTAGTAAATGCGGAGCATAGAGAAAAAAGTGAGATTCCGTTTAGTAAGGTAAGCGTGTTAAAAAATGGGCAATCACTTACCAGCAATCTCTGTAAGTATGAACAAATCAAAGATCAAATTTATAATATTAATATAAGCTGGGCAAGCAAACCTAAATCAAACGATAGAGAAACAAATGAATATGTGTATGACATGGCGACATTTGAAGGAATAAGTTATCTAGGAGCGAGAAGCCCATTGACGCAAATTGCAGAACAAATTAAAGGTATAAGAGAGGATTGGAAACCTATTGCACAAGGAGCTAAAAAAGTAAAAGCAGACGTATATACTTCAAGCGATAGAAACGAAGAACGCACGTATCTGCAAGAGCAACACGATTTGGCAATAAAAAGGAGAGATGAGAAAAGAGAAAAAAGATTAGAGTCTGGTGAATAATTTTAAAGGGAGTGGGTAACTAACCCACTCGTAACTATAAACCTGTAATTAATCACTTATTTTTGACAACAGATAATTACTGAACGCACTGCAAGTGACTAACATAAATTTAGCTTCAGCTAAGGTAGGATTTACATCTTCTTCTGTTAGCGCATGACGTATTCCACCCTGATCACTTGTATAACCATAGAGCTGACTAAAAGCGCCTTTCATTGCAGAGTGTATATATCCTTTTTCCTCTATAGCTTTAAGACAAGCCCCCAAGGTTCCTTTATCATTGCCCGTGATTTTCCTGCATAAAGATTCAATTGCAGAGATAGACTCTTTAATCGAGTTTCTGTAGTCTGGCTGCTCTCTATCCGTCATTAGTTGTAACGCCCTTTCGAAATGGCTACGCGATGAATCAGTGCCATTATCAACTGCGTTCTGAACACTTTCAATTTCGTTATCATTTGAAATAGGAGTAATACAACCATTTATTATGGTATAACCAACGCCATGCTTTTTAAAGATGGAATTGAGATGCTTCGATAGATTAATATATGAATTAGTTCTCTCAATGATGAACTCAATTAAATCATATACCAAATACCATGCTTCCCCATATATATAATCTCGGATAGCAGTCAGCAACGTCTTATCACTTTTGTATCCACTTTCATAACGAGGAATATTATCCGCAGGTTGATTTAGATAATATATCCACACAGACTGCGCACATTTTGTTGCTGTAGCAGTTTGACGATTGTTAGTCCAAAGGAAAAGATATAAGCAATTCCACAATGCCATGCGTGTATCAGAATTAAGATCATTCAGCTGGACATGCTCTCTAACGTCAACATGACCATACCTCACAGAAAATGGCTTTATCAT
Protein sequences of DBSCAN-SWA_11 >CP029122|4307036:4313595|4308149_4308761_+|AWF24351.1|DBSCAN-SWA MPQSGTVFLGDNPINMLSSRQLARRLSLLPQHHLTPEGITVQELVSYGRNPWLSLWGRLSAEDNARVNVAMNQTRINHLAVRRLTELSGGQRQRAFLAMVLAQNTPVVLLDEPTTYLDINHQVDLMRLMGELRTQGKTVVAVLHDLNQASRYCDQLVVMANGHVMAQGTPEEVMTPGLLRTVFSVEAEIHPEPVSGRPMCLMR >CP029122|4307036:4313595|4310627_4311779_+|AWF28530.1|DBSCAN-SWA MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPHEHKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRGRRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVLEKLEMKWSPEQISGWLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNIQHLRRSHSLRHGRRHTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEGDLVSGTKNSHIATLVDRKSRYTIILRLRGKDSVSVNQALTDKFLSLPSELRKSLTWDRGMELARHLEFTVSTGVKVYFCDPQSPWQRGTNENTNGLIRQYFPKKTCLAQYTQHELDLVAAQLNNRPRKTLKFKTPKEIIERGVALTD >CP029122|4307036:4313595|4308025_4308166_-|AWF28238.1|DBSCAN-SWA MPDCGIKSREKQFNSVDFPQPLGPIRAVIFPVGSESETSLSTLSVP >CP029122|4307036:4313595|4312524_4312722_+|AWF26196.1|DBSCAN-SWA MTQIAEQIKGIREDWKPIAQGAKKVKADVYTSSDRNEERTYLQEQHDLAIKRRDEKREKRLESGE >CP029122|4307036:4313595|4312770_4313595_-|AWF27347.1|DBSCAN-SWA MIKPFSVRYGHVDVREHVQLNDLNSDTRMALWNCLYLFLWTNNRQTATATKCAQSVWIYYLNQPADNIPRYESGYKSDKTLLTAIRDYIYGEAWYLVYDLIEFIIERTNSYINLSKHLNSIFKKHGVGYTIINGCITPISNDNEIESVQNAVDNGTDSSRSHFERALQLMTDREQPDYRNSIKESISAIESLCRKITGNDKGTLGACLKAIEEKGYIHSAMKGAFSQLYGYTSDQGGIRHALTEEDVNPTLAEAKFMLVTCSAFSNYLLSKISD >CP029122|4307036:4313595|4307036_4307993_+|AWF24449.1|DBSCAN-SWA MKIALVIFITLALAGCALLSLHMGVIPVPWRALLTDWQAGREHYYVLMEYRLPRLLLALFVGAALAVAGVLIQGIVRNPLASPDILGVNHAASLASVGALLLMPSLPVMVLPLLAFAGGMAGLILLKMLAKTHQPMKLALTGVALSACWASLTDYLMLSRPQDVNNALLWLTGSLWGRDWSFVKIAIPLMILFLPLSLSFCRDLDLLALGDARATTLGVSVPHTRFWALLLAVAMTSTGVAACGPISFIGLVVPHMMRSITGGRHRRLLPVSALTGALLLVVADLLARIIHPPLELPVGVLTAIIGAPWFVWLLVRMR >CP029122|4307036:4313595|4309897_4310479_-|AWF24283.1|DBSCAN-SWA MFGVHRSSYRYWKNRPEKPDGRRAVLRSQVLELHGISHGSAGARSIATMATRRGYQMGHWLAGRLMKELGLVSCQQPTHRYKRGGHEHVAIPNHLERQFAVTEPNQVWCGDVTYIWTGKRWAYLAVVLDLFARKPVGWAMSFSPDSRLTMKALEMAWETRGKPVGVMFQAIKAVIIRAGSSGSYCGDTGSGRV >CP029122|4307036:4313595|4309318_4309576_-|AWF24643.1|DBSCAN-SWA MTLVNDTGFDPVFSGSIAESWRQQPCTPSYCCDWEAATMLRAFPLAKKGEGRARLPSLYASFGKLGETPTHKDIIDNNRSINWPV |
8 | uncultured_Caudovirales_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_12 |
4557435 : 4580356
Sequences of DBSCAN-SWA_12
Nucleotide sequences of DBSCAN-SWA_12 >CP029122|4557435:4580356|DBSCAN-SWA AATGGCGTACTATAACATAGAGAAACGACTAAAATCCGATGGCACACCACGCTATCGCTGTAATGTGATTATCAAAGAAAAAGGTGTTATCACTTACAGGGAAAGCAAAACATTCCCTAAACATGCTCATGCCAAAACATGGGGCGCACAGAAAGTGATGGAATTAGATCTATATGGCATTCCATCATCAAATGCTGTTGACGGACTTACAGTCCGTGACTTACTACACAAATATTTAAATGACCCAAATGCCGGAGGTAAAGCAGGCCGTACTAAAAGATATGTGCTGGAACTGCTTATGGATAGTGACATCTCCGCGATCAAACTATCTGAACTGACAGAAAATGACGTAATTGAACATTGCAGGCTAAGAAACAACGCTGGTGCAGGCCCAGCAACAGTCAGCCACGATGTTAGTTATCTTGGCAGTGTTCTGGATGCGGCAAAACCTGTATACGGAATCAATTACACATCAAACCCGGCGAAAAGCGCTCGTCCATATCTACTTAAACTCGGTTTGATTGGTAAATCAAACCGTCGTAATCGTAGACCAGCATCTGATGAACTGAACATGCTCATTGAAGGCCTTCAACAACGATCTACTCATAAATGCTCAAAAATTCCGTTCGTTGATATCCTCAAATTTTCTGTGTGGTCCTGTATGCGAATCGGAGAAGTATGCCGGTTACGATGGGAAGATCTCGACCAGGAACAAAAATCTATACTAGTAAGAGACAGGAAAGATCCACGTAAAAAGGAAGGCAACCATATGAAAGTTGCCTTGCTTGGGGAAGCCTGGGATATCGTCCAGCGACAACCCAAAAAATCAGAATTCATTTTTCCATATAACAGCACTTCTGTTACCGCAGGATTTCAGAGGGTAAGAAGCAAATTAGGTATTAAAGATCTGAGATACCATGATTTGCGTAGAGAAGGGGCAAGTCGCTTATTTGAGGCTGGTTTTAGTATTGAGGAAGTCGCTCAGGTTACAGGGCATCGTTCATTAAACGTGCTATGGCAGGTATATACCGAACTGTATCCGAAATCTTTACATAATCGTTTTGAAGAACTCCAAAGGAGCAGAAATAAGACCTCTTGACACTGTTTATCCATACAGTTAAAAATAATACTGTATACAAATACAGTGTAGGGGACTTTTATGCGTATTGAAATCTGCATAGCCAAAGAAAAAATGACTAAAATGCCAACCGGTGCTGTGGATGCGTTAAAGGAAGAATTAACCCGACGCATCAGTAAACGTTATGACGATGTAGAAGTGATCGTAAAAGCCACCAGCAACGATGGCCTTTCTGTTACGCGCACCGCTGATAAAGATTCAGCTAAAACTTTTGTTCAGGAGACTCTGAAAGATACCTGGGAGTCTGCTGACGAGTGGTTTGTTCATTAAGTTCATCATCTCTGGACACCATTCACTTGTATTGACTTTAAGATCATTAAATTGTAATAATTCAAAGGGATGGGGCAGGTTTTTCCTCTTGCGCATGCAAGGGCGGCTCTGATAATGAACAGTAATATTTTACTGTTGAGCTTATGCTCACTGATCCTGCCGGGCAGGATGTTCAACGGTTCCAATATATCCTGCCCCTTCCGCTTTTAAAAAGGATCTTTTATGCATGACAATATTTGGTTTACATATAAAGCACGTATCCAAGCGCACCATCGACTAGAATGGCTTGAAAAACACTCTCAATTTATCCTCGTTTGGTATGCTATATTGAGTGCGGTACTTTCAATTGTAACGTTGCGATTTCCAAAGGTTCTAGGAGATAATACAGATGTCGTTGCGGCGATACTTTCAGTGGCTCTACTGGGTATTTCTCTGATCGTATCTAACCTAGATTTTCGTGGTCGAGCAATAGCCATGAGAAGGAATTATATTGCACTACAGCGACTCTATTTTGACATTACCACCAGTCAACAGTTATCTCTTGAACAGAAAGAAAAATATTTTAATTTGCTCAATGAGGTTGAGAATCACCGTGACATAGATGATAAAATTTCAAGGGTAACTCAAGTTGGACTTAAGACGAGGATCCCCACACAAAAAGAAAAAATAATTGTTATTTTATGGATATTACTTCGAATATTTATTACTGCCGCACTTTATATACTCCCATTAATATATCTTTGGATTGACTATGACTGCAAGCAGAATTTTTAAAAAGTCATTCTCGAAAAAAAATCTTCTAAAAGTATACTCTGAAAAAATCAAAGAATCAGGAGCGATTGGCATAGATCGGATTCGCCCATCAAAACTTGATTTGACAATAAAAAATGAGATCACTTTCATTTTTGAAAAGGTTAATTCTGGCAATTACAAATTTACAGCATATAAAGAAAAATTAATATCTAAAGGCGCTAACTCTACACCCAGACAGATTTCCATACCAACTGCTAGGGACAGAATTACTCTTAGAGCTCTCTGTGAATGCCTTACGGAAATATATCCTAAGTCCAGATTAAAACTACCACATACAGTAATTGACTCATTGAAAGAAGCATTAAACAACAGTCTATATGCTGAATATGCAAAAATAGATCTTAAAAGTTTCTATCCTTCAATTGAACATAAATTGATAATTAATGCAATAAAAAATAAAATTAGAAAAAAAGAAATTAGACAGTTAATAACATCATCATTAATCGTGCCTACTGTAAGTGGAACCACAGGAAGCAAAGGTATCCCTAATAATACCAGAGGAGTACCTCAGGGATTAGCGATATCAAACATTTTAGCTGAAATATCACTATCTAATTTCGATGATGAAATCAATAAAATGCATGACATATGGTACATGCGATACGTTGATGACATTCTTATTTTAACACCAAAATATCAAGCAACAAAAATAGCTTCTCATATCATTGATAAGCTTCAATCATTAAATTTAAACCCACATCCATTAAATGAAGAGAACTCAAAATCCAAAGTAGGCAGTTTGGATGAAAGTTTTAACTTTTTGGGATACCACATAGAAAATCGAGAATTATTGATAAAACATGAGAGCATTCTTAGATTTGAGTCATCCTTAGCAAAAATTTTTACTGCATATAGGCACGCTCTACTACAAGCTAAAAGTAAGCGTGATAAAGAACGAGCTGTTGCATATTGTCAGTGGAAACTAAATCTCAGAATTACGGGATGTGTGTTTGAAGGTAAACGATTGGGATGGGTATCGTACTTCTCACAAATAACCTCAACAGCTCAACTTCGCTCTGTTAATCATACTATCAATAATCTTATCCGCCGATTCGGCCTTTCATCAGAAATAAAACCAAAATCTTTGATTAAAACTTTCTATGAACTCCGCAGAGGTAGAGCGGAGACTTTTAAATACATACCTAACTTTGACAATCTACATATATCTCAGAAACGAGAACTTGTTTCTATGTGGATAGGTAAAGAGAAGGAAAAAAAACTTAGCAATAGTGAAATAGAGAGGAAGTTTAAATTTAAAATTGCGAAATCAGTAAAAGAGCTTGAAGAGGATATTTCAGGAATATCATAGATATGTAATCCATTAAGTCATTAAAATATATCGCAATAAACACACTATTTAAACTCACAAACCAGCCGCAGTATCCTGCCATGGCAAGTTGCTGCGGCTTTTTATGTTCAACGGATCAACAGCCAGATCAGAAGACACGCTACCATCGGCACAGCAAAATCCATCAGGCTTGCCACATCCCATGCGCGTGGATCAAAACCGCCCCACCACGGCATGTTAATCCGCTTGCCATGCCCGAACATTTCAATCCAGCGATATTATGCTTGGGTGTGTTCGCCCGCAATGAAGAACGTACAACTGGCTATCGCTCCGTAAACCCAGTTTCCGGTAAAAAGCCCCACCAGTATCTGCGCAGCCACAGCACAAAGTGCATGAAGGAAAGGTGTTATACCCATTTTCATCCTACCCAATAAAACGGGGCGCACAGCCCCTTAATATTATTTAGAGGCAAGCGCCGCCTCAATTGCAGATAATCTTTGTTTTAATTCTGCGTTTTCTTCTTCCAGCGCGGTAACGCGATCATCTGTTTCACGGGCGACCTGAACAAGTAAGCCCGTCACGGCGGCGTAGTCAACATTCAGATAACGTGTTTCTTCACGTAATTCATTGCCGTCAACCGTTGGTCCCTGCAACTCTTTACCGTAATGAGTGAATGAGCCTACCGCTTCCGGTATTGCCTCCATTGCTTCCTGTGCAATAACACCAGCGTAAGGCAGTCCGTTTTCCTTAAGTGTGTAGGTGTACCCGTTCATTTTACGGATAGCTTCGGTTGCGTCACTGATAATCTGAATATTGTCTTTCAGCTCGCGGTCTGATAACTGATTCAGTGTTGTACAGTTAATAGCACCGTTTACATCAAACAGCTGACCTGACGATGTTTTTTGAGCATAAAACAGATACGCGGCAGACGTTCCAACCTCAAAAACGTTTTGTCGATCACCTGAACCCCACACCTTGACAGCAAATGGTAGTTCTGTATTACCTAAGTTCTGTAAAACAAAACGATTGCCAGTCCCTGTTTGTTTTGTAAGAGTTAAATCAACAGTTGAGTTAACCTCATCCTTGTTGATAGTGAGCGCCTGCGCTTTAGCACCGTTAACAGCACCTGTTTTGAGTTGAACCGCGCCGTCATTACCATTTAGCAGTATCTCAGCTCCGCTAAAGAAATTTTTTAGCGACAGCATCTTACTTACGCCGACTGATGAACCCAACGCCCACGCGAGAGAATTACCGGTGCTATCAAACCCACGTACAAAGCAATCCATTTTGCTATAGTCTGACGTGCTTCCAAGGACATCAATCCGCCCTCCGCCAGATTTTATCGGGTTAGATGTGGTTAATGACCTGACAGCAAGAGCGGTAGATGAATTGAGATCGTCTACTGTTAGTAATTTCTTCCATTCCTGCGTCGTTCCATTTTCAATTGTTCTTCCCCAAAAACCGGAATTGCGGCCTCCGAACTGCACAGCATAATTTTTACTAAATTGAACATGAATGCCGCCAAGAACCATAGAGCCTGCCGGGCCGTTTGTACTGCCTGCAATTGGTCTAAATTTGTCGACGTTATCAGTATGCTGAGCGTTCCAGTCTCGTCCTGTTTTTGTAAGCAGTCCTAATGTCGTGTCGAGCGCTTCGGTTAAAGTAAGGTTTTTAAATTTAACTGCGCTACTTTCGCCAAGACTTAAGTTGTTTCGCGCATCTTCTACTGTTGTCGCACCTGTACCTCCTTGCCCAACAGCTAATGGCAACCAGCCAGTGCCATTATGACAACCCCACAAACCAGATTTGGAAACCTGTAAGCGTGGTGCGCCTGAAGAATACGTAGAATATACGTAAGTTGTTTCTTTTCCCTCTTCAACCCTTTCTGTTGATACATCTCTCTTCCATTCAGACCAGGTACCGCTACCTCCTTCATATATTCGTCTATAAGTAATTCCCGGATTATTATAAGGGTGGTACACCTGAACGCAGGAAGAAGCAGTGTTGGCTCTCGTTTGGAGCACTATTAAAGCGCCCGCCAGACCGATTGGGTAATTTAATTCTTCTGTTGCGTAAGCGCTCGTTGGTTGTTGATAGAAACCAGAATATTCACCGGTAAGGGTGTTTAGGTCAGTATTAGCAAGGCCAGCTTTTTGCTCATACATTACCCGCAAGTTCGCACGTGCTTGCTCGATAGTTCTTCCACCAGTACCACCCTGGGCTAAACCTAACGGAGACCATTGTCCAGAAGAAGAATCATAAACCCCCCAACTCCCGTTAGTATGGATTTCAAAGAAACGATCACCAGTTGGACCATACATACGAGTTGTTGATTCATTTTGACCAAAGCGGCTTAATTCATCTCTGCGGGTATGACGTATCCATTGCGGGCCAGAACTTTGGCTGTAAATGTACGAATACAATGATTTTGTATCTGAACCAACAAATAGACCGGAATATCTTTTTGTTGACACTTCTGATCTAACTATATAACCAGAAATAAGATTATCCCCGGCACTAGACGCTACTGATGGATAACCTTTCGCTGAGATGTAAATCCTGATAAATCCAATATAACCTGATGGCTCAACAGAGATATCAGTGCAGGTCATAGCAACTTCACCAAGACCAATATTGTTACCAACAACGGATTGTGCCTGGTTGCGATAATTAAGTGCGTCCGCTGCAGATTTCGCCGCGTTTGTTTCACTGGATTTAGCATTAGTTTCGCTGCTCTTTGCTGCTGCCTCGCTATTTTTCGCGTTGGTTTCTGATTTTTTGGCTGCTGTCGCGGAGTTTGCCGATGCAGTTTGTGAGGCCGCTGCCGCCTGTGCGCTGTTATCCGCATTCGTCTCAGACGTTTTTGCGGCCTTCGCGGAATTTCCTGCCGCCGTTGCCGAAGAGGCTGCACTACTGGCGCTCGAGGCTGCGCTCGTTTCTGATGATTTCGCTGCCTCTTTTGAGGCCGCCGCATCCCGGGCTGAGGTGGCAGCTTCTGACGCTTTCGTGGTCGCGGTGGATGCAGAAGTGGCTGCAGATTTTTGTGACGCTGCAGCATTCGTTTCTGACGTTTTCGCCGCACCGGCACTGGTAGCCGCCGCGCTTTTTGAGGACTCTGCAGCGGCAGCACTTTTTGATGCTTCAGTAGCCTTTGTTGATGCCGTTCCTGCGCTGGAAGACGCTGACTGAGCCGACGTCGCGGCCTGCCCGGCTGATGTGCTGGCTGCACGTGCTGAGCCTGCAGCATCAGTCGCATGGGTTCCCAGAGCACCACGCTGGCCGACTGCTCCGCACGGGTGCATTCATTCAGTGTTTCCTGCCGGATACCCAGTCTGATTCTGTCGCCGCTCCACCACGAATCTGGATTTTCCAGGCCTCAGCCTCCTTAACAGGGTCAATCCACGGCATCACTGGTCCGGAATACACCGCGGTATACAGTGAAGAACGGTCAAGATCGCGGGGTAACCTGATAACACCGGATGCCACAGCCTGTTTCAGCCATGCACGATACATCGGGCGGGTGACGGCACCAATAAACCAGTCCTGCAGGATCAGGTAGCCATCAGTAGACTCAACCAGCTCCTGACGCTGGGCGCTGTAAGTGCCGTTATAGTTGCGCGCTGTACTGGAAAAACTCAGACGACTACCAGCCGCCACTGCACGCAACTGACCATTACGAAAAGTTTCAAGATTAGGATTGGGGCGATCCGACTTCACCATTCCGATTTCTTCGCCGGGTTTCAGATCGTCGTAAATAATGCCTGGCTGAATGGTAAGCTCGCGTTCATTCTCCTTGCTGCCATTATAGCAATTGCTCCGCCACCAATAGCAGCAGCAACTGCTTTTCGTAATGATGGAGGCATTATTCACCTCTTGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAGGCTACCCAGCACACCTATTGCTGCCCACTGTGAGGGCGTGACTTTATCGAGCAGCTGTAAAAACCAGTAACCGGCACTACCTGCTGAGGTGCCATAGGCGACACCCGTTGTTAACTTATCCATGGATTTCATAACCCCACCTCGCAGATGCGGGTGCTGTGTAATGGAAATAAAAAGGCCACCTGACGTGGCCACCAGATTATTTCCCCACCAGCTCGTTTATCTCTTTCACTGTCTGGTTAAACCGCTCTGACTCAAGCTCAACACCTAAGGCCCGACGCCCCAGCGCCATTGCTGCTTTTATTGTGGAACCGGATCCCATAAAAAAATCAGCAACCAGATCACCAGGTCGACTACTGGCATTGATTATTTGCCTGAGCATATCCGCCGGTTTCTCACACGGATGTTTACCCGGGTAGAACTGAACGGGTTTATGCATCCAGACATCGGTATAAGGCACGGAGACTGATACGGAGAAATAGCGCCGGAGAGATTTAAACTCATCCAGCAATTCAGAATATTTGCGATTCAGTGAATCATAAGATGCCACCAGCTGGTGGTGTGGTTGTTCCAGTTGTTGTTCCTGAAACTTCTCTGCCGCTATACGGGAAAACAGTGCCTGTAACTTCCGATAGTCAGCCTCATTCGGCAACTGCCACTGACTGGCACCAAACCAGTGGGAAACCATATTTTTCTTACCTGTGGCTTCGGCAATTTGTTTTGCCGTTATACCCAGTTCGGCACGAGCATCCCTGAAATACGATATCAGCGGTGCCATTATGTGCTGTTTGAGTTCCCTTTCTTTTGCCGCATAGCCGTCACTTTTGCCGCGATATGGCCCCTGGTAATGTTCAGCAAACAGAACGCGCTCTGTGGCAGGAAAATATGCGCGCAGACTTTCTTTATTACACCCATTCCAACGTCCGGACGGCTTCGCCCAGATGATATGGTTAAGCACGTTGAAACGTTCACGCATCATGATCTCAATATCAGATGCCAGGCGATGCCCACAGAACAGGTAAAGGCTTCCGGCAGGTTTTAACACCCGCCAGAACTGGGCCAGACAGTGGTCCAGCCACTTAAGGTAATCTTCGTCCCCTTTCCACTGATTGTCCCAGCCGTTGGGTTTCACCTTGAAGTACGGCGGATCGGTAACAATCAGGTCAATGGAATCATCAGGCAGGGACTGAATAAAATGCAGGCAATCAGCGTTGATTAAATCAACACTGTTTATTTTTACAGTATTTTTCATGGATCAGTAAGCGTAACTCTGGTAGGCTCACTCTGCTTTTGCGCTAAAGCAGTGGGCCGTGGTTCGCTTGTGACCAGTAAGCATGAGCGAATGGCTGGCAGGTGCTACCAACACCCACCAGCCGCCCATTTTCACAAATTAAAAGTCCTTCATTGCTGAAGGCGTCTGTAACAGCCGAACTGGTAATCTGCCAGCCCCGCCATAACCAACTGGGTCAGTATTAACTGACAGCGTTCGCGTGAAAGATATGTGTTTTGTGCAATCTCCCCGACTGTTGCCGGTTCGATGCTTAATTCATTAAAAACAACTTTCGCCGTTTCTGTCATATCTTGCTGTTTTAGCATGTCTTTTTTCCTTCTGGTTAACATGACATACCAATAACTCTTGTCTAAAAAGCCAGCAAGATAAAAAGTCAGTATTCACGACCACCAGCGTGTTTACCGTACTGCACCAGGTTTACAGGTACAAAAAAACCCGCTCTACGGCGGGTTTAAGCTGTGTGGCGAAGTAACCACTCTTAACAGATTATGATAGTTTTTGCGTACGCGTTAGTAATTTATTATGCTCTTTACAATTGTTAGCTTAACCGCATGGAATTGACCAAAAAAATGAGCAAAGCAAGCATACGCCACGTAATCGTTCATGAGCTCTTAAAAGAATCTAATAAAGACTTCGATCACTCCAAACCATACAATCTTCGTGATACAGAACTAGATAAAACAAATGATATAGTAAAAAAATTAGTAGACGGTGTTATTGATTTGTATGGTTCAAAAGGGAACTCAGCGCATTATGGTGTTTTTATTAAAAATAAAACAAAGCAAGGCCCTATACCAGAACTATTTCATAAATACTCTTTAGTTCAACAATCTGTTTCAAGTGATTTCATTGAATTATCGAAGGAAGTTATGAAACAAATGTATAAATCTGCTCAAGAGCAGATTTGGGCTTCTGGAGGATATGTTGTTTTTACTGATTATATTTTATCTGGTTTCCGTTATCTATTGGTTACAATGATCAAAAAAACTAATGGCGTAACTATTAGTGAAAATTTAGAGCCAGAGGAAATGATTCACTTAGAACTTGGTAATATTAACCAAGCAGCAAAAATAAATTTCAGATATTATGAAGAATACCAAAAAGCAGATGACTTAAAAAAAACAGACTTAAGTTATCTAAGCTTTATAAGCAAAACTACGGGACAGTCAGCGGCAGCATATTTTATAGCAGCATTAGGATGTGACAAGGGGATTGCTTCAGCAGGTGCAACCCGTAAGTTACCAGATGAAATAAGGCGTTTCTTTAAGAAAGAACCTCTTTTAAAAAATCAAGCAGAGTCATTTAGAAATGATGTTATCAAATACTTAGAAAAGCAATTTGACAACGAGCACTCTGCAAGGCTTTCTGATATCGAATCGCTTGCTTCAGGCCATATGTCCTATTTAAAAGAGGAAGAAAAAACAGAACTTGTTGATAAATTAATGAAACACCTCAATAGTGAGGAAGTCAGAATCCCATCAGAGTTCGTAATCAATAAAAACTCCTTAGATAAAATCAGCAATGTGATATATAAAACCCCATCATTGAGCTTTCACTTCGACAAGGATTTACTCGGTGTCACAACTGATGCTAAAATATATTATGATGACGAAAACCAAAGCCTAACATTTAATAATTTGCCTGTTGAAGCATTAACTAAGATAAGAAGAGCGTTGAAAGAAACTGATAACCCAAGTAATGAAGAAGATAAAGAATGAATGATTTTAGTATAATAGTTAATCTGTATAGATTATCAAGCTATCCTCATTTTGACGGGGCTAAGTTTTCTGCGCGTATAGCTTATAATGCAGACGTAAAATCATTGTTCAAAAGAATTTTGAACCCTACTTTTCAAGCTGGTACAGCTGACGAAATAGAGGTGGATGGTCATTTAATTTATGATTATGAAGACTTTCCTGAAAAGGGAAATTTTCTTACATACTCGTTTAAAATTTCACAAGGAAGTGCGAATCGTTTTTATAAAAATAAAAACGAGTTTGTAAAAATAAACACGCTCAAGAAAGGCATAATGCCAGAGTATTTCTATATTATAGAGGATGATTTCTATTCATTAGAAACACCAAAACCTTCTTATATCCAAAAAATTGAGGACATTTGTGAGCTAATCAATGCTCTTTCCATGCTTGCTCATTTCCATGATATAAAAAAAGATAGCAAAGGTACATTTTATCGTTTAGTCTTTATTTTAAACTCAGAGTCTAAATCTTCTTCTGCTGTAATTGAAACAAATATTACAGAAGAAATTTTTAATGATAAAACAGTAAATACTCAGTTAGTTAAAACATTAGTAAGTAGTGAAGCTACTACTGATGCCCATCACATTGAAAAGATTAACACTTTCAGAAACACAGTTATTGAGTATGTTAATAAAAATGGAAATTCCTTTGTCGAGTTAATTAACAAGTGGGATTTCATATGCGAACTTTATACGAACAACTTAGCTGCTTATATGTCGGCATTTTCTTTTCATAAAGCAAGGAAAGAAGTTGTTGATGCTGAACTCGATTACTCAGAAAAACTGTCAAAAATAATTTCAGAAATTTCTAACAAAGCTCTTGCAATACCTATTTCACTAGCTGGTTCAATTGCTATTTTCAAATTAACAACAAAAGCTGATTGGATTATTGCTTTAATTGGATTGATTATCACAGCAATAATAACATCTGCAATGATTGTGTCACAAAAAAAACAACTTGCTCGTATTTCACACTCTAAAGAAATACTTTTTGGACAATTAAGATATAGAATAAAGGATGACACCAGCGATCTTAAAGAGAGCTTAGAAGAGGCTATTAAAAAATTAAATGACAATGAGGATTTTTGTCATAAGGTGCTTGACAGTTTATTATCACTAGCATGGATGCCTACATTCATAGGCATCATCGGTATTTTATTTAAATTAATGCCAAATATTACTTGAGCATGTACATAACCCCATTAATGAAACCTAACGCAGTCTGCAATTGCTTCCGGATTGTTCCATCTGAGCATCTGCGTCTCTTCGCAATAGAACGCAGTGAAATACCTATAACGAAGTGAGCAATGATCAGCTCATATTCGTCTGGTTTATACTTACACAGGCGGGCAACACAGCCGTCTATCATGATGCCTTCATCATCATCACACTGAAGGCGTATTTTTTTACCATGAGGTAAAAGCCCCTTAAAGCCTGCTGCTATCGGCTGCCAGTCCACACCACTGTTATCTGCTGCAGCCCACGCTCCCCAGCGGTCTAAAACTTCATACATATCACGCATCAACTTTCTCCACAAAATCAGGCCAGCACGCCAATTGCCAGCGCACGATCGATAAAACGAAATATCAGCTCCAGCTGGGAGCCATACTTCTCTTCAAATGCCACGGTATCCGCATGCAGCTCGTCGTGATGCTTTCTGCACAAAGGCAACACAAAAAGGTCATGCGCTTTTGTTCCCATTCCACCCTGACCGTGACCTATCAGGTGGTGGGGATCATCAGCGGGCTTTCCACAACATGCACACGGCTGTGTCTTAACCCAGCGCGTGTACTTTTCATTAACCCAGCGGCGACGTTTGGGGCGTAACATAAAAGACTCCGGCGACTCCGGATCCACTTTCAGCGCCAGCACCTTTTTCGCCTTATCCTGGATGATGCTGGTGGCAGGAACCGAAGGCACAAGGTCACTTTCCCGGGTAACAGACGGCACAACAGGCTTCGGTAATCTCAGTGCCTTACGGGCTGCACTTTCCGGTAAGGCATCCGCCAGATCATTACGAATCAGCCACCAGCACAGTTCCGGCATTGTCACAACGTGACTGTCATCAAAACCGAGATCCCGACGCACAACAGACAACACCCAGCGGGCACAGTTATCCGTTGCCATTGATTCCAGCCGTTCCGTGAACTGATCGCGCAGCTGGTTATCGCAGTGCCAGCACAGACGGATTGCACCCGGAGCGTGTCGCATTGTGGTCATGTTCTCGCTGTGCCAGTCGGAATGAGGCCACTGGCAGCCTTTTTCACGAAGTAACCAGCTTTCAAGACATTCCACGCCACCAGCACGACGGATCACTGCCTCATTGCGGAACACGGCCCGAACGGCAGGATCATCCGCCAGCGGTTGTGATGCCGCCGGAACGGCACCACTGGCGAAAGATGAATAACGCTCCGGCTCAGGCTCCAGCAGGACACGCCCCTGCATAAACAGGGGCATCAACTCTGAACCTGGCCTGAACAATACGATCCCCATACGCGGGGCAATTTCAGGGGTCAGTAGTGCTCTCACGGTCACCTCAATGAACGGTATCGAGCAGCTTTAACAGCTCAGGGAATCGGGATTCGAAGAAATGCGGCTGCGTCTCGCGCGGATTTGCGGGACTGGTGATGTTCTTGCCGAACATGCAACCTTTCGCTGTCAGCGACCAGAATTTTTTGATGTTGTTAATCGCGGTACGACTGTATCGTTCGCGCTGCTCGACGATCCCCAGCTTCACCATCTGGTGATATGCCTGATTAGCTGTCAGGCGGATACCATACTGCTTCAGCAGTGCACTCAGTGACAGCGTGGGGCGGCTTGAGCCATCAGGCGCGTCAGCAGGAGCATCAATGGCATAGCGCGGTGCCAGATTCGGTAAGCCAACAGCCTCCTGGAGTTTCTGACAGGCACCAAGCACTGAAGAGTTAGACAGGTTTAATTCCCGGCGCATAAAGTCCAGCAGAATCACACCAGCCTGCATCTTGTCAGCAGCCTGTCCGGATAATTTTTCCGGTGCGCTGGTTACCATGTCGAAAGTACGGATCACCTTAAGATGGAATGACGGGCTTATCCACATTGCATAGGCATACACCAGTTCTTTGCAGACATACGTCCCCTGGTTATTTCCGCCACGAATAACGTTAACTGGCTCTATATTGACCGAGTTGCAAATCTGCAACTCGCTTATTAAACGTTCAGTTTGCTCATTGCGGAGCCAGAATGCAGGCTTATGCTTATCCAGAGAACCGGCAGCCCTGTGCAGATCGTTCAGGCTGTAACGCCCATAAGCATCACGACGAACTTCAATACCATCAATAACCATCAGATTATTCATACTTCGTTTCTCCTCTTAATCAGGCGGCTGCACCCGCCGGTTTCTCATACTTACTGATAGTGATCTCGACCTTCCCTTTCGGGATAACCGGTCCCCACTCCACCAGCATTCTTTTCACCTGTCTGTCGTCTTCCCACACACCCGCGTGGGTCAACGCGTCAAACAGCGCCTTGTTATAGTTGTCCAGATCGCGGATCCGGTTATCCGGAGGAAACAACACGATCTCCACTGAAGCAGGTGCCGACGTTGGTTTCGGCAGACGACGTAACTGCTCAACTATTGCTGCGCACGCCGCGCTCTGAAATTTTCGCCCCGCCTCGCTTATCAGGCTCTTACCAGCAAATGCCCCTTTGTTGGGGTGTCGCCAGTACGTGTTCACACTGGGCGGGAAAGGAAGGATCAACTTCATACTTTCAGGCCCCTCTCATGTAACCAGTGGGCTGCACGCAACCTGGCGTTCTCCTCACCGGCAAGCAGTGCGCGGATGATACCGACCGCCTCGCTGTCGTCGTCCTTCACTGCGGTATGAAGCGTGATCCCCCGGGCCACGCCACGCTTTATCGTGATGACGCCTTTTTTCTCCAGTGCGCGAAGATGCTCCACGGCTGCATTCACTGAACGGTATCCCAGCATGGTTGCCACCTCCTGATTGGTTGGCAGAAAGCCACGCTCTTGCTGGTAAGAAATCAGAATATCCAGCACCTGCTGCTGGCATTGAGTTAACGTCGTCATGCCGCCATCTCCCTAACCAGTTTTTCCGCCTGCTGGCGAACCTGCGCCAGAAACGCCTCACCACATGCCTCAAGTTCATCGCGCCCGATGTAGCTGATTGCCGGTCCCTTCCAGGTCTTATCGAAAACAGCAATAGCACCAGCGAAGAAAGCGCCTGTCGGCACCTGCTTCTCATCCTTCGGGATAAACCAGGCAGGCAGTTCAAAACCAATACGCCCGCGAATAAAAGCAATATGGTCCGCATCTTCCGGCCACCACACTTCGCTGGTGGCAGCTTTGATCAGGAAAACATAGCGCCCACCCTTATCACGCATGGCACTGGCATGTTTCATGATGTAACGCATGCCGGTGATGTATTGCCCCTCATGCTGACTGGCGCGGCTGTATGGGGGATTACCAAAGGCAGCCCCTTTAAGCTCCGCAAGACGTTCTGACCAGTCATGCGCCAGCGCGTTGTCTTCCGCCGTGTAATACGCAGCACATTTGGCGTTATCACCGTCAGTGAACAGATCCAGAACAAACGGGCCAAACAGGGTGTTAATTCCCCAGAAAATGTTGTCCGGCGTGCGCCACTGATCGCCCACTTCCTTCAGTTCATGGGCTGGTTTGTTCCGCAGTTCCACCAGCGCCTGGCAATATTTATTACTCATTAAGCCCCCACGTAATTCCCTGACAGATACCACTCATCACCCGGTACAGCGCGCTTGCTGCTTTTCCGTAAACACCGCTCACGACGCGCAAGAAAATTGTTTCGCTCTGGCTGGGAGTGGCTTTCACGGAATGCCGCCATCCACACCGTTGCAGCACGACGGTATAAGCCCCTCGACTCCAGTTCTTCCGCCTGGCGGGTCAGGCACAAAATCACCCGGGGATCGTTAGTGCCGACATAGAAATTGCGCACAGGTCTGGTTTCACGAACTGGTTGCGGTTCCGCCTCCTGCGCTCTCTCAGTCAGGCGCGGGAAATGTCTGCGTGTATCCCCTTCACAACGGTGAGCCACACGACCACTCTGACGTAACTTGCTTGCTGACTGCAGAACGCGCTGCCGTGAGTAACCTGCAAAAGCATCCGCAATGTCTCCGGAAGTACACCCCGGATGGGCTTCAATGTATTTCTGAACTTCATTCAAAAGACTCATGATCACCCCCTGAATCCTGCCGGGATCTGGCTGTAGTCCACGTTGTCGTAACTGGCTTTGAAGTACGGGTCCTCGCGTCTGGCTGCAGATACCGCAGGAACTTCCCAGGATTCTTCGAAATGACGATCCGGACCAAAGAACGTGACAGCCTGTTTCACAAATTGTGTGCCGCTGTTACCCATCGCAGATACCCAGCCCGCGTAGCGTTTCACACCTTCCAGCATGGTTTCGGGGTTTACCCCCTCATTCAAACGGGCTTTCCAGGCTTTGAAGGCTGCAGATTTTGAATTGCCACCAGCACGTTTGGGATATACCAGCCATGCCTGCTCAAACTCCGGAGAGTATTCCGGTCGGTTTGAACGAACTCGCACGGACTCATCAACTGATGCACCAACAGCTATTGGTTCATTGACTGGTTCTTTGACTGGTTCAAAAGAGTGACTGGTTCTGGGTGAATCTCCTGCACTACCCCCTGGTGCAACTCCTGCACTACCTGGTGAATTTGCTGCACCAGATAGTGAATTATTTGCACTACCCCCTAGTGAATCTCCTGCACCATCCAGATGAAGGAGATAGATATTACTTGAGTTACCTTTTTCACCTTTCCGGGTGACTTTTTTTACCAGCCCGGACTCACAAAGGGCCGCAATATGATTCATCACAGAACGTTTGCTAATCTCGCACTGGTCAGCAATATGCTGGTAGCTGGGCCAGCACTCACCCTGATCGCTGGCATTATCAGCCAGCTTGATCAGAACCAGTTTTCGCAATGGATTACCCACTCGAATTTTCATCGCTTTAACCATCAGCTCCATACTCATGCTGCACCTCCGAGATGCTTCATGTTTTTTCCGGAGCGAAAGGCTATAAGCGGCATACTGACGCGGTAATTACGGCCCAGCGGTTCACAAATCACCTTCTGGCATTCACGGTCAACCAGGCTAACACGTAGAACATGCCCTGCAGGTGTGGTGTACCACTGCCCAACTGTAGGAATTGATGTTTTTTTACGCTGAAGCAAACGGCAAATATTGAGGATCAACGGATTAAGCATGACGATGCCCTCCGCTGATATTCAGGAGACGGTGAATATGAAAATTAGCCTTATCCGCCAGACGAATACGTTCAGCCTGCAAGTTAAGAAGGGTTTCTACCAGAACTTGATGCGCCTGCGGATCCGAAAGAGTTACCTTGCGCAGAGCACGTAGTGCAGTTGTTACATAACTGAGTTTATGTAAGTCTTCATCATTCAGACGAGTGAGGGCTGGGACAGTAGCCATGATGGCAGCCTCCGATAACAGTGAATTACCTTCACCACCGGAAACGCCAATTTCGCTGGTGGTGAACTGAACGGGGTTGGCGTAACCGGCGTTATCGGAAACCGGCGCACCTTTCGGTGCCCCCGTCCAGCCCACCATAATTTGGGTGTGCACAGACGCAGACGATAAAAAAGACGCTGGCGCGTCATATATCGCCGATAACATTTCCAGGACGCCAATCCCGGCACCCGCTTTATAAGGTGCCTGAACAGTGTAACGTCCCGGAATGGCAGAATCAATGTGCTGGTGGTCCTTCACACTCAACAAAATCACGCCTGAATTTCCACAAAGGACTAAAGCACTCATGCGGGTAGTCTTTGCGAAGATAGATAACGCGCTGTGTTTCTGGCTCCCAACGAATAACATGGACATAAAGCCCTCTTCCGTCACGAAACCAGCGGTTAAGTTCCTGCACAACTCGCCCCCCACAGTCAGGTAAAGTTCTCTGTGGTTACTTACAGCCAGGTGATTTGGTAATCTGCATTCATGCCGTAACAACAGGTGTTCAGCGACACTGACCACCAGCTGTTGCGACAAACGGTTATTTGCCGTTAAACTGTTCATGCGTTAGTTTCTCCACAGACACAAAACGCCACGACGCCCGGAGCTGCACACTCGCGGGCGTCACTCTTTTCTGGAGCGCAAAAGATTTTGTAGACCAGTGCTGCATGCTCCTGGAGCTTCGAAATTGACAGATACAACTCATCATTAATTGCTGTCTGCTCGTGTGGCTCCACTACCCCATCTTCGATTGCCGAACGAATCTGCTTTGAGTAACTCCCGATCTGTTCGATGACTTCCAGCAGGCGCTGGTTTATATCGGCGTTCTCTACTTCCTCAATTTCAGGAAGCGATACAAACACCCCACCAGCAGACTGTGCGACAGCATCCGCAATGTAGTGAGTGCCAGCCGCGCGCTGTAAAATCATTGCCCATCCCAGCGGGAAAATCTGATCGCCATCTGCACGAAGGCGGTTGAATAAAGCGTTCTCTGTTACATCCAGCCACTCAGCAGCTTCAGCGTAACCCCCCGGCAACGCCGCGATAGTTTTTCTGACAGCTTTCACGTACCACTCAGGCTGTTTTTCTACTTTCCAGTGATGCTTACCCACGGTTAGCCTCATCGTTCTGTGGTTAAAAATTGAAGGTGTTCTGTTAATCTTTCGGATAGATATCCGGTCTTAAGTCAGATTTCGTAATTGCACCTGACGTGCATTGCTCAAGTTTTTTAGCCAGCACAAAACTGGCTTTTTTATAACCATTGAAAACCAGCCGTAAGTAGCCTGGTGTTGAGCCAACTTTTCCGGCCAACTCACCCTGCTGTTCTTTGGTTAAAGAGTCCCAATACGCTTTCATACAATATGTACCTCCGGTATACATATTACATGATTGAGATGAACCTTCAAGATACTTGTACCTTATCGGTACAAAGGTTTTAATTTCTTTATGAAAACAGTCCATGACATCCGGCGGTCTAACGCCAGAAAACTGAGAGATGGTGTTGGCGGGAATTCTTCCTTTGCCACCATGATTGATCGCGAGCCAACCCAGACCAGCAGGTTTATGGGAGATGGTGCAACTAAAAATATCGGTGACAGCATGGCACGGCACATCGAAAAATGTTTCGACCTGCCTGTCGGATGGCTTGATCAAGAACACCAGACAACGAACATCACAAAAAAACCTGACGTTTCAATCACTAACAAACAAATAACGTTAGTCCCTGTCATATCATGGGTACAGGCCGGAGCATGGAAAGAAGTTGGCTATTCTGAGGTTGATTTGAGCACAGCAGAAACTTATCCCTGCCCTGTACCCTGTGGCGAAATGACTTATATCTTGCGGGTGATTGGTGATTCAATGATTGATGAGTACCGCCCGGGAGACATGATTTTTGTTGATCCTGAAGTCCCTGCCTGCCACGGTGACGACGTTATTGCATTGATGCACGATACAGGCGAAACCACCTTCAAGCGGTTGATAGAAGATGGAACACAGCGTTACCTCAAAGCATTAAACCCAAACTGGCCTGAACCTTACATTAAGATCAACGGTAATTGCTCTATAATTGGTACAGTGATTTTCTCAGGAAAACCAAGAAGATACAAAATAAAGGCCTAATCAATATTTATAACCTGCTTCGGCAGGTTTTTTTATACTTGACAATGTACCCTTAAGATACATAATGTATCTATAGGATACATAACACAGGCAAGATTAAACTAAATTTGGTTGTAACACGGCGTATGGCACATGCGTCGTTAGCGGTCTGGGGACGTTAAAGGGGACAATCCACTCCTTGCTCGGGCAAACAAACCAGGTAGCCGGAATGTGCAAGTCAATGATGATGCTGATAAGACGCCTAACCAGCGTGGCGATCCGGTTTGACGCCTGGGAAGAGACCAGGGTGCAACGATGAGGGCATTTATGGAACCGCGACAAAGTGTGGTGCCGTAACTGGCTAAGTGCTCTCAGCGTTGTGGTAATCCGCGAAATGGCGCGGCGGTAAGTATGGCGGGGTTACTCTTTCCCCGTTGAGGACACCGGATTGTCAGGTTGACCATACGCCTGAGTGACAACCCCACCACAACAGCCACTGCTTTGGCGGTACCAGTTTGTACACTTGCTTCCGGCTGGTACCGCTCTTTTTACAAAACAGAGAAGAGCATCACCGGACGACGGGCTCATAACCCAATCCATCCGGGCGGCTGCCACCGCAGGTGTTCTTCTCTGTTTTGTGGAGAAACCAACCGACCTTGCAGGGTCGATATGATGAGGAGCAGCAAAATGGCTAGCGAACGCAGTACTGATGTGCAGGCATTTATCGGGGAGCTGGACGGCGGCGTATTTGAAACCAAAATCGGCGCAGTTCTCAGTGAAGTCGCTTCCGGTGTGATGAACACGAAAACCAAAGGTAAGGTCTCACTCAACCTGGAAATCGAACCATTTGATGAGAACCGTGTGAAAATCAAACACAAACTCTCATATGTTCGCCCGACTAACCGCGGGAAAATTTCCGAAGAAGACACCACCGAAACGCCGATGTATGTCAATCGCGGTGGTCGCCTGACTATTCTGCAGGAGGACCAGGGACAATTACTGACTCTTGCCGGTGAACCTGACGGAAAACTCCGCGCAGCAGGTCGTTAATATCGTTCTTAATAAACTGATTATTTATCTCATCACTGAATATTTTTATATAGTGAGGACTTATTATGTCTCAGAACTTAGACGCAACCGCAATTAATCAAATCCATGCCCTTATTTCTGCTCAGGGTGTTAATGAAATTATCAGTAAGATTGGTGCCGATGCTGTGGCATTGCCTGAGAATTTCCGCATTCATGATCTGGAAAAATTTAATTTAAATCGCTTCCGTTTCCGTGGTGCGCTTTCCACTGCCAGCATCGATGACTTTACCCGTTATTCTAAAGTTCTTGCAGATGAAGGCACCCGCTGCTTTATCGATGCCGATAATATGCGTGCCGTCAGTGTGCTTAACCTGGGTACTATTGATGAACCAGGTCACGCAGATAACACCGCCACTCTCAAACTGAAAAAGACAGCACCGTTCTCTGCTCTGTTGTCTGTTAACGGCGAGCGTAACTCCCAGAAGTCACTGGCAGAATGGATTGAAGACTGGGCCGACTACCTTGTGGGCTTTGATGCTAATGGTGACGCCATTCAGGCAACCAAAGCGGCTGCGGCGATCCGTAAAATCACAATTGAAGCGAACCAGACCGCTGATTTTGAAGATAATGACTTCAGCGGCAAACGCTCCCTGATGGAGTCTGTCGAAGCGAAGACCAAAGACATTATGCCAGTGGCATTTGAATTTAAATGCGTTCCGTTTGAAGGTCTGAAAGAACGTCCGTTTAAATTACGCCTCAGCATTATCACTGGCGATCGTCCTGTACTGGTTCTGCGCATTATTCAGCTGGAAGCGGTACAGGAAGAAATGGCTAACGAATTTCGTGATCTGCTTGTTGAGAAATTCAAAGACAGCAAAGTAGAAACCTTTATTGGTACTTTCACCGCCTGATTTCATTACTGCAAATGCCCCTGCGGGGGCGTTTACGGAAGCGATAATTTTAACTATTGCCGCCCCTATAAAGAACCATTAAACAATAACGTGACGAAGCTATTGATAGTAAATGAAGCACTTGCAAAATAATTGCATTGCGTATATATACACATGTGTTGTATTACAGCGATAATGGTAAAGCAAATGATTAACTCTGAAGCAATTGAGCAACTAATGTGGCTATGGTCCTTATTTGACATTAAATTCTTATCTATTCTTGCCGCTGCCTTCACTATATATTTTGGCGTGCAAAAAATATCAAAAAAGGTGACAGTGTCGTATTCAGCAAATGCAAGTAGAATATATGACATGCATATATCAACCATAATCCTGAATAATAAAAGAGATAATGCAATTGCTATATCTTCAATCAATATGGAGGTTGAAGGTAAAGGGATACTACAAGTTATTAAATTTGACTCCCCTCTTCTTTTAAAGAACTATGATTCTTTAAAAGTTGAACCACCAAAATTTAGCAGCCTTTATAATAATGATGGCGTAGTTAAGTTAGATATTTATGATAAGTTTCATTTTTATATAATCACGACATCTGGAGATGAAATTAAATGTATTTCTGAAAATAAATATGTGGCACCAAACATGGAAAACAAAATAGCTACAGACATAAGAAAATTTAATGGCATTGTCTTAACAAACAGAATGTCTTATATTTTTTTCTATGCAAATGACAACAGAGAGAAATACTGCATAATAGATGTTTCATTGTTCATAAATGGTGACAACCCATTTCATTTTAATTTTTTAAAAGAAGATGAATTAAGAGATTTTTCTAGCATCCTTATTAGTTACGGATATCACCAACAGTTTAAAAGTTATGCATTGTTTAAAATAGACAACCATCTTGCTCCTTCTTTGGTTTTAAATAAATCAATGATAGAAAATAATATTATTGAAATGAATAAGTAACTCACCGGGTGCAGCCGGTTATGATGGAGAAATGATATGAATACCTTGTTTTTACTGATGGCTGAATTCAATACCCCAAACATTGAACTCTCAGCAGTTAGTCAAAAATACTTTGGTATGAGTCCAGCCACAGCAGAAGCAAAAGCAAACGCTTGTAAGTTGCCTGTACCTACATATCGCATCGGTACATCACAAAAAGCAAAACGCTGCATCAACATTCAGGATCTTGCGGAATATATTGACAAAAGACGGGAAGAAGGGCGAGCTGAGTGGGAAAAAGTCAGAACGGAAAAAAAATATAACTAAACTAAAACTATGGATAACCCGTATATGTACGGGTTATTTTTCTTTATCACTATCTTTTCTTGATTTGAACAATCCACTAACAACGAAACCAACCAAACCAACAATACTAATTGTACTTGTACCTAGTAATGCAACGATCGCTTCAACTGGAGCCTTTCCTTCATGTGCAATAAGAAACGATGTAAACATTGCGACAACGAATAAGCACCAACACGACATAAACCAAACCGTGAATGATGCCATTTTTGTCCGGAGCTCATTGTCTATTTCTTTACCAGTTGCGTCAGCTATCTTATCCCGTACTTGTGATTTGAGCATATCAAGCTGAGCTTGAAGACTGTCCATTCTGTTCTGCTGCATAAACTCATGCAATGCACCAGTATTAGAACCAAACTCTTCTTCCTCCAGAATAGCCTTATTTTCTGAAGAAGAATCATCATCGCGTTCATGATTAGACGGCTCAAATGCGGATTCAAAAGCCTGCTCTTGACTGTCAGTAGAGTTTAAGGATGCTTCAGAGCGACCATTTTCAACACCTGCGGCCGCTCCGATCAGTTTATAGATATCTGAATTATGAGACAT
Protein sequences of DBSCAN-SWA_12 >CP029122|4557435:4580356|4570472_4571270_-|AWF27011.1|DBSCAN-SWA MNNLMVIDGIEVRRDAYGRYSLNDLHRAAGSLDKHKPAFWLRNEQTERLISELQICNSVNIEPVNVIRGGNNQGTYVCKELVYAYAMWISPSFHLKVIRTFDMVTSAPEKLSGQAADKMQAGVILLDFMRRELNLSNSSVLGACQKLQEAVGLPNLAPRYAIDAPADAPDGSSRPTLSLSALLKQYGIRLTANQAYHQMVKLGIVEQRERYSRTAINNIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESRFPELLKLLDTVH >CP029122|4557435:4580356|4571289_4571487_-|AWF28567.1|DBSCAN-SWA MFPPDNRIRDLDNYNKALFDALTHAGVWEDDRQVKRMLVEWGPVIPKGKVEITISKYEKPAGAAA >CP029122|4557435:4580356|4571998_4572652_-|AWF25650.1|DBSCAN-SWA MSNKYCQALVELRNKPAHELKEVGDQWRTPDNIFWGINTLFGPFVLDLFTDGDNAKCAAYYTAEDNALAHDWSERLAELKGAAFGNPPYSRASQHEGQYITGMRYIMKHASAMRDKGGRYVFLIKAATSEVWWPEDADHIAFIRGRIGFELPAWFIPKDEKQVPTGAFFAGAIAVFDKTWKGPAISYIGRDELEACGEAFLAQVRQQAEKLVREMAA >CP029122|4557435:4580356|4557435_4558533_+|AWF28496.1|integrase|DBSCAN-SWA MAYYNIEKRLKSDGTPRYRCNVIIKEKGVITYRESKTFPKHAHAKTWGAQKVMELDLYGIPSSNAVDGLTVRDLLHKYLNDPNAGGKAGRTKRYVLELLMDSDISAIKLSELTENDVIEHCRLRNNAGAGPATVSHDVSYLGSVLDAAKPVYGINYTSNPAKSARPYLLKLGLIGKSNRRNRRPASDELNMLIEGLQQRSTHKCSKIPFVDILKFSVWSCMRIGEVCRLRWEDLDQEQKSILVRDRKDPRKKEGNHMKVALLGEAWDIVQRQPKKSEFIFPYNSTSVTAGFQRVRSKLGIKDLRYHDLRREGASRLFEAGFSIEEVAQVTGHRSLNVLWQVYTELYPKSLHNRFEELQRSRNKTS >CP029122|4557435:4580356|4567894_4569121_+|AWF24103.1|DBSCAN-SWA MNDFSIIVNLYRLSSYPHFDGAKFSARIAYNADVKSLFKRILNPTFQAGTADEIEVDGHLIYDYEDFPEKGNFLTYSFKISQGSANRFYKNKNEFVKINTLKKGIMPEYFYIIEDDFYSLETPKPSYIQKIEDICELINALSMLAHFHDIKKDSKGTFYRLVFILNSESKSSSAVIETNITEEIFNDKTVNTQLVKTLVSSEATTDAHHIEKINTFRNTVIEYVNKNGNSFVELINKWDFICELYTNNLAAYMSAFSFHKARKEVVDAELDYSEKLSKIISEISNKALAIPISLAGSIAIFKLTTKADWIIALIGLIITAIITSAMIVSQKKQLARISHSKEILFGQLRYRIKDDTSDLKESLEEAIKKLNDNEDFCHKVLDSLLSLAWMPTFIGIIGILFKLMPNIT >CP029122|4557435:4580356|4579807_4580356_-|AWF24170.1|DBSCAN-SWA MSHNSDIYKLIGAAAGVENGRSEASLNSTDSQEQAFESAFEPSNHERDDDSSSENKAILEEEEFGSNTGALHEFMQQNRMDSLQAQLDMLKSQVRDKIADATGKEIDNELRTKMASFTVWFMSCWCLFVVAMFTSFLIAHEGKAPVEAIVALLGTSTISIVGLVGFVVSGLFKSRKDSDKEK >CP029122|4557435:4580356|4577246_4577609_+|AWF28555.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGR >CP029122|4557435:4580356|4571675_4572002_-|AWF26728.1|DBSCAN-SWA MTTLTQCQQQVLDILISYQQERGFLPTNQEVATMLGYRSVNAAVEHLRALEKKGVITIKRGVARGITLHTAVKDDDSEAVGIIRALLAGEENARLRAAHWLHERGLKV >CP029122|4557435:4580356|4578715_4579468_+|AWF26869.1|DBSCAN-SWA MWLWSLFDIKFLSILAAAFTIYFGVQKISKKVTVSYSANASRIYDMHISTIILNNKRDNAIAISSINMEVEGKGILQVIKFDSPLLLKNYDSLKVEPPKFSSLYNNDGVVKLDIYDKFHFYIITTSGDEIKCISENKYVAPNMENKIATDIRKFNGIVLTNRMSYIFFYANDNREKYCIIDVSLFINGDNPFHFNFLKEDELRDFSSILISYGYHQQFKSYALFKIDNHLAPSLVLNKSMIENNIIEMNK >CP029122|4557435:4580356|4569475_4570465_-|AWF28411.1|DBSCAN-SWA MRALLTPEIAPRMGIVLFRPGSELMPLFMQGRVLLEPEPERYSSFASGAVPAASQPLADDPAVRAVFRNEAVIRRAGGVECLESWLLREKGCQWPHSDWHSENMTTMRHAPGAIRLCWHCDNQLRDQFTERLESMATDNCARWVLSVVRRDLGFDDSHVVTMPELCWWLIRNDLADALPESAARKALRLPKPVVPSVTRESDLVPSVPATSIIQDKAKKVLALKVDPESPESFMLRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHDELHADTVAFEEKYGSQLELIFRFIDRALAIGVLA >CP029122|4557435:4580356|4566290_4566509_-|AWF27035.1|DBSCAN-SWA MLTRRKKDMLKQQDMTETAKVVFNELSIEPATVGEIAQNTYLSRERCQLILTQLVMAGLADYQFGCYRRLQQ >CP029122|4557435:4580356|4565088_4566141_-|AWF26858.1|DBSCAN-SWA MKNTVKINSVDLINADCLHFIQSLPDDSIDLIVTDPPYFKVKPNGWDNQWKGDEDYLKWLDHCLAQFWRVLKPAGSLYLFCGHRLASDIEIMMRERFNVLNHIIWAKPSGRWNGCNKESLRAYFPATERVLFAEHYQGPYRGKSDGYAAKERELKQHIMAPLISYFRDARAELGITAKQIAEATGKKNMVSHWFGASQWQLPNEADYRKLQALFSRIAAEKFQEQQLEQPHHQLVASYDSLNRKYSELLDEFKSLRRYFSVSVSVPYTDVWMHKPVQFYPGKHPCEKPADMLRQIINASSRPGDLVADFFMGSGSTIKAAMALGRRALGVELESERFNQTVKEINELVGK >CP029122|4557435:4580356|4576286_4576580_+|AWF25976.1|DBSCAN-SWA MTYILRVIGDSMIDEYRPGDMIFVDPEVPACHGDDVIALMHDTGETTFKRLIEDGTQRYLKALNPNWPEPYIKINGNCSIIGTVIFSGKPRRYKIKA >CP029122|4557435:4580356|4577674_4578499_+|AWF26265.1|DBSCAN-SWA MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKVLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNGERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAIRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA >CP029122|4557435:4580356|4573957_4574194_-|AWF28634.1|DBSCAN-SWA MLNPLILNICRLLQRKKTSIPTVGQWYTTPAGHVLRVSLVDRECQKVICEPLGRNYRVSMPLIAFRSGKNMKHLGGAA >CP029122|4557435:4580356|4569113_4569458_-|AWF26508.1|DBSCAN-SWA MRDMYEVLDRWGAWAAADNSGVDWQPIAAGFKGLLPHGKKIRLQCDDDEGIMIDGCVARLCKYKPDEYELIIAHFVIGISLRSIAKRRRCSDGTIRKQLQTALGFINGVMYMLK >CP029122|4557435:4580356|4579504_4579774_+|AWF28032.1|DBSCAN-SWA MNTLFLLMAEFNTPNIELSAVSQKYFGMSPATAEAKANACKLPVPTYRIGTSQKAKRCINIQDLAEYIDKRREEGRAEWEKVRTEKKYN >CP029122|4557435:4580356|4574186_4575023_-|AWF28199.1|DBSCAN-SWA MNSLTANNRLSQQLVVSVAEHLLLRHECRLPNHLAVSNHRELYLTVGGELCRNLTAGFVTEEGFMSMLFVGSQKHSALSIFAKTTRMSALVLCGNSGVILLSVKDHQHIDSAIPGRYTVQAPYKAGAGIGVLEMLSAIYDAPASFLSSASVHTQIMVGWTGAPKGAPVSDNAGYANPVQFTTSEIGVSGGEGNSLLSEAAIMATVPALTRLNDEDLHKLSYVTTALRALRKVTLSDPQAHQVLVETLLNLQAERIRLADKANFHIHRLLNISGGHRHA >CP029122|4557435:4580356|4566749_4567898_+|AWF27187.1|DBSCAN-SWA MSKASIRHVIVHELLKESNKDFDHSKPYNLRDTELDKTNDIVKKLVDGVIDLYGSKGNSAHYGVFIKNKTKQGPIPELFHKYSLVQQSVSSDFIELSKEVMKQMYKSAQEQIWASGGYVVFTDYILSGFRYLLVTMIKKTNGVTISENLEPEEMIHLELGNINQAAKINFRYYEEYQKADDLKKTDLSYLSFISKTTGQSAAAYFIAALGCDKGIASAGATRKLPDEIRRFFKKEPLLKNQAESFRNDVIKYLEKQFDNEHSARLSDIESLASGHMSYLKEEEKTELVDKLMKHLNSEEVRIPSEFVINKNSLDKISNVIYKTPSLSFHFDKDLLGVTTDAKIYYDDENQSLTFNNLPVEALTKIRRALKETDNPSNEEDKE >CP029122|4557435:4580356|4564249_4564663_-|AWF25159.1|portal|DBSCAN-SWA MVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYNGTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVSGRKH >CP029122|4557435:4580356|4573142_4573961_-|AWF24417.1|DBSCAN-SWA MSMELMVKAMKIRVGNPLRKLVLIKLADNASDQGECWPSYQHIADQCEISKRSVMNHIAALCESGLVKKVTRKGEKGNSSNIYLLHLDGAGDSLGGSANNSLSGAANSPGSAGVAPGGSAGDSPRTSHSFEPVKEPVNEPIAVGASVDESVRVRSNRPEYSPEFEQAWLVYPKRAGGNSKSAAFKAWKARLNEGVNPETMLEGVKRYAGWVSAMGNSGTQFVKQAVTFFGPDRHFEESWEVPAVSAARREDPYFKASYDNVDYSQIPAGFRG >CP029122|4557435:4580356|4561221_4561365_-|AWF27251.1|DBSCAN-SWA MKMGITPFLHALCAVAAQILVGLFTGNWVYGAIASCTFFIAGEHTQA >CP029122|4557435:4580356|4561071_4561206_-|AWF25208.1|DBSCAN-SWA MFGHGKRINMPWWGGFDPRAWDVASLMDFAVPMVACLLIWLLIR >CP029122|4557435:4580356|4561401_4563555_-|AWF26737.1|DBSCAN-SWA MTCTDISVEPSGYIGFIRIYISAKGYPSVASSAGDNLISGYIVRSEVSTKRYSGLFVGSDTKSLYSYIYSQSSGPQWIRHTRRDELSRFGQNESTTRMYGPTGDRFFEIHTNGSWGVYDSSSGQWSPLGLAQGGTGGRTIEQARANLRVMYEQKAGLANTDLNTLTGEYSGFYQQPTSAYATEELNYPIGLAGALIVLQTRANTASSCVQVYHPYNNPGITYRRIYEGGSGTWSEWKRDVSTERVEEGKETTYVYSTYSSGAPRLQVSKSGLWGCHNGTGWLPLAVGQGGTGATTVEDARNNLSLGESSAVKFKNLTLTEALDTTLGLLTKTGRDWNAQHTDNVDKFRPIAGSTNGPAGSMVLGGIHVQFSKNYAVQFGGRNSGFWGRTIENGTTQEWKKLLTVDDLNSSTALAVRSLTTSNPIKSGGGRIDVLGSTSDYSKMDCFVRGFDSTGNSLAWALGSSVGVSKMLSLKNFFSGAEILLNGNDGAVQLKTGAVNGAKAQALTINKDEVNSTVDLTLTKQTGTGNRFVLQNLGNTELPFAVKVWGSGDRQNVFEVGTSAAYLFYAQKTSSGQLFDVNGAINCTTLNQLSDRELKDNIQIISDATEAIRKMNGYTYTLKENGLPYAGVIAQEAMEAIPEAVGSFTHYGKELQGPTVDGNELREETRYLNVDYAAVTGLLVQVARETDDRVTALEEENAELKQRLSAIEAALASK >CP029122|4557435:4580356|4559593_4560964_+|AWF24165.1|DBSCAN-SWA MTASRIFKKSFSKKNLLKVYSEKIKESGAIGIDRIRPSKLDLTIKNEITFIFEKVNSGNYKFTAYKEKLISKGANSTPRQISIPTARDRITLRALCECLTEIYPKSRLKLPHTVIDSLKEALNNSLYAEYAKIDLKSFYPSIEHKLIINAIKNKIRKKEIRQLITSSLIVPTVSGTTGSKGIPNNTRGVPQGLAISNILAEISLSNFDDEINKMHDIWYMRYVDDILILTPKYQATKIASHIIDKLQSLNLNPHPLNEENSKSKVGSLDESFNFLGYHIENRELLIKHESILRFESSLAKIFTAYRHALLQAKSKRDKERAVAYCQWKLNLRITGCVFEGKRLGWVSYFSQITSTAQLRSVNHTINNLIRRFGLSSEIKPKSLIKTFYELRRGRAETFKYIPNFDNLHISQKRELVSMWIGKEKEKKLSNSEIERKFKFKIAKSVKELEEDISGIS >CP029122|4557435:4580356|4575019_4575571_-|AWF26655.1|DBSCAN-SWA MGKHHWKVEKQPEWYVKAVRKTIAALPGGYAEAAEWLDVTENALFNRLRADGDQIFPLGWAMILQRAAGTHYIADAVAQSAGGVFVSLPEIEEVENADINQRLLEVIEQIGSYSKQIRSAIEDGVVEPHEQTAINDELYLSISKLQEHAALVYKIFCAPEKSDARECAAPGVVAFCVCGETNA >CP029122|4557435:4580356|4572651_4573140_-|AWF28352.1|DBSCAN-SWA MSLLNEVQKYIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTRRHFPRLTERAQEAEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAVPGDEWYLSGNYVGA >CP029122|4557435:4580356|4564805_4565012_-|AWF24686.1|lysis|DBSCAN-SWA MDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >CP029122|4557435:4580356|4563551_4563665_-|AWF25811.1|DBSCAN-SWA MLNPVKQTRRNLQRTHLIIATRHNPLLVTILVLVKLL >CP029122|4557435:4580356|4563706_4564132_+|AWF25390.1|DBSCAN-SWA MVSDFLAAVAEFADAVCEAAAACALLSAFVSDVFAAFAEFPAAVAEEAALLALEAALVSDDFAASFEAAASRAEVAASDAFVVAVDAEVAADFCDAAAFVSDVFAAPALVAAALFEDSAAAALFDASVAFVDAVPALEDAD >CP029122|4557435:4580356|4575614_4575815_-|AWF28104.1|DBSCAN-SWA MKAYWDSLTKEQQGELAGKVGSTPGYLRLVFNGYKKASFVLAKKLEQCTSGAITKSDLRPDIYPKD >CP029122|4557435:4580356|4558593_4558842_+|AWF25669.1|DBSCAN-SWA MRIEICIAKEKMTKMPTGAVDALKEELTRRISKRYDDVEVIVKATSNDGLSVTRTADKDSAKTFVQETLKDTWESADEWFVH |
32 | Shigella_phage(36.0%) | lysis,portal,integrase | attL 4548700:4548713|attR 4567418:4567431 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
2205 : 49895
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP029123|2205:49895|DBSCAN-SWA CTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCAATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCGTGTACATCGAAATACGGCTTATCAGGCGTTAAAAGATGCTTGCGATGACTTGTTTGCAAGACAATTCAGTTATCAGAGTCTTAGTGAAAAAGGTAACACTATTAATCACAAATCAAGATGGGTGAGCGAGGTGGCTTATATTGATAATGAAGCTGTCGTTAGACTTATTTTTGCCCCTGCTATTGTGCCTTTAATTACTAGGTTAGAAGAACAATTTACAAAGTATGAAATACAACAAATAAGTAATTTAACAAGTGCTTATGCTGTTCGTTTATATGAAATATTGATTGCATGGCGTAGTACTGGAAAAACGCCTCTCATAACTATGTATGATTTTAGACAAAAAATAGGTGTACTCGAGACTGAATACAAACGAATGTATGATTTTAAAAAATATGTTTTAGACATTGCATTAAAACAAGTAAATGAACATACCGATATTATTGTCAAAGTTGAACAGCATAAAACAGGTAGATCTATTACTGGTTTTTCATTTAGCTTTAAACAGAAAAAATCAGCCACGCATTCAGTCGAATCTAAAAGAGATCCGAATACATTAGACCTCTTTTCAAAAATAACAGATAAACAACGCCATCTATTCGCCAACAAGCTCTCAGAGCTTCCTGAGATGAGTAAATATTCACAAGGCACAGAAAGCTATCAGCAGTTTGCTGTACGTATAGCTGCCATGCTGCAAGATGCAGAGAAAGCAGGTCTGTGGTTGGATCTTTCCGCTTTAATCGGAACCGTCCATACATTTCAGGAATACGGCTAAAGGCAACCCCCATTTCTTTAATCAGTGATGTCTTTGCTTTTTCAAACTTAGGCAAATCGTGTAAACAAAAACGGTCGTACGCTGCTGCTTCAACTTCTTGTTCAGGCGTTAGTGGTGCGGGCTCAACAAAAGTAAGTTTTGGATCGCAGTGTACAGGATACGCATTTAAAATGTAGTAATGTGGGTTGATAAAGAAACGACTATACAACTCTGAGAACCCAGGGGATTGTAAACGGAGGTTACGCACCTCAACGGTTTCTAAATCTTTGATACCGTTGGGAACTTTAAACACCGTCTGATAGTTTTTATTGACAGACACGTCAATAATCTTTTGCTCCAACTCTCGTGGGTAACAGAAACCTAATACTTCTACGTCTTTTTCACCCAAGAGAGAAACGATTTCAGCACGGAAAATAAAGACGGTTAAAATGGTATTCTTAACATAAATGTCATATGAATACAATTCTGTTTCCAATGTTTCAACATCTTGTCTCCCGACAATAAGGTCGTTGAAAATAGTGTTAAACACTTCAGCGAGTTTGGAATTCTTTAAATGAAATTGAGGTTCACGCAAGTACATAATGTTTCCTTATAATAAACGACTAAGCGGACATTACATCCGCTTAGTCGTTTTTATTTACTATGAATAGATTTTAACAGAGTAACGACGTTTATGCTCTTCTGTTGATTGAGCGATAGACCATTCCGTCATGGACAGTTTTTTACGGTTCACACGAGACTTAAGGTCATCAGTAAACCCTTTACTGATAGTTTCTAATTCTCGACTGGGGTTATTAATAGTAACCAGCACGACTTGATCTAAGTGAGTTTTCACTCTGTCAAGTTGAGAAGCAGAAATAAAAAGCACAACATCGCTTGTTTTTGCTTTGTAGTAATGTTTAATTAACAACTCTGCTGACTTAGGTAAAATGTTTTCAACCCCAACAATGGGTTTATTTTGTTCACCTACTACGGCAAATAATTCACGGGCGCCGTCTGAAAGCGCACGACGTAAATCAAACTGACGTTCGTCAATAAATACAACGGCGTCTTTTGCATACTTAGTCAACATCTCTTCTACTTCTTTAACACGTAGAAGTGGGTTTTGCGTGGTTGTTAATTTGTTTTCGTGGTTCATTACCACAATCGCGGTTACTGACATAGTAAACTCCAATGAATAGTTAGAAGGTAGCCCTCTTAAAATTCATTGGAGAATAAATTAGTCGGATGATGAAGAACTTGAATCAGAGTCACTGGAGTCTTCTCTGGATTCGGTATCGATTTGTGATAGCGCAGCCGAAGTAGTCGCAGCTATAACAAAAGCTTCACCTTGTCTTAACCCGCCATTTTCTAACGGAGTGCTAAGATTATTCTGAGTCTGTTCTTTCTCCATCTCATGTTGCTGAGCCTGAAGTTCACGAAGTTTATGTTCTAAATCCTCGTGTCTCACTAACTGGGTGATTTCATAAGGCTTATCGTTGTCTTCATCCACATCGACAGACTTTAATGTTGCCAAATGAGTAAACCCGCCAAAGAGTTTAGGTTTTAGCGCTACATCACCTACTTCGTCATCATCCACAAATGTACAAATGATTTCGGATGCAAATGGAGTGGCTGCCTCATATACCTGCGCACCACCAATGACCCAGATTTCATCTTCGCTGTCACGATGAAAAGGAAGAAACTGATTGATAAAGCTATCAGGAGTGATGTAGACGACATGACCTTCTTTGGTAAAGGTTTCAGAACGTTTCATGTCATGGGTTGTGATGGTCTGAATGTCTTCAAACGTTTCCGCTTTTGATGTCACCACCACGTTTAGTCGATTAGGTAACCCGCGATTATTCAACGACTCAAATGTTTTACGCCCCATCACCACGATGTTGTTTTGTGTCATCTTTTTGAAGTTCAACATATCGTCTTTCAAACGATACATCAGTGTGTTGTTTTTACCGATAAAGCATTGGTTGTTAATTGCAAGAATCATACGAATCATGATGTTTCCTTAATGGAGTTTAGGGTTGGATGAATTAATGCCGGCTTCATTAAGCGCGTGTTCAGTTAAACGAATAAATAAATCCAACTGAAAACAAGGGTATAGGAAGTATAAACGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCGATTCACCGGTGAACCGGATGAGCTTCAGCTGGCACGATATTTTCACCTTGATGAAGCAGACAAGGAATTTATCGGAAAAAGCAGAGGTGATCACAACCGTCTGGGCATTGCCCTGCAAATTGGATGTGTCCGTTTTCTGGGCACCTTCCTCACCGATATGAATCATATTCCTTCCGGCGTCCGGCATTTTACCGCCAGACAGCTCGGGATTCGTGATATCACCGTTCTTGCAGAATACGGTCAGAGGGAAAATACCCGCCGTGAGCATGCAGCGCTGATACGTCAGCACTATCAGTATCGTGAATTTGCCTGGCCCTGGACATTTCGCCTTACCCGTCTTTTATATACCCGGAGCTGGATAAGCAACGAACGTCCTGGCCTGCTTTTCGATCTGGCGACAGGGTGGCTTATGCAACATCGTATTATTCTCCCCGGAGCCACTACGCTGACCCGGTTGATTTCAGAGGTAAGGGAAAAGGCGACGTTGCGCCTGTGGAACAAACTGGCACTGATACCGTCAGCCGAACAGCGTTCACAGCTGGAGATGCTGCTGGGGCCAACTGATTGCAGCCGCCTGTCTTTACTGGAATCACTGAAAAAGGGCCCTGTGACCATCAGTGGTCCGGCGTTTAATGAAGCAATTGAACGCTGGAAAACTCTGAACGATTTTGGCCTGCATGCTGAAAACCTGAGTACACTCCCGGCTGTGCGCCTGAAAAATCTCGCACGTTATGCTGGTATGACTTCGGTGTTCAATATTGCCAGGATGTCACCGCAGAAAAGGATGGCGGTTCTGGTTGCCTTTGTCCTTGCATGGGAAACGCTGGCGCTGGATGATGCATTGGACGTTCTGGACGCCATGCTGGCCGTTATCATCCGTGACGCCAGAAAGATTGGGCAGAAAAAACGGCTCCGCTCGCTGAAGGATCTGGATAAATCTGCATTGGCGCTCGCCAGCGCATGTTCGTACCTGCTGAAAGAAGAAACACCGGACGAATCGATTCGTGCTGAGGTGTTCAGCTACATCCCAAGGCAAAAGCTGGCTGAAATCATCACGCTTGTCCGTGAAATTGCCCGGCCCTCAGACGATAATTTTCATGAAGAAATGGTGGAGCAGTACGGGCGCGTTCGTCGTTTCCTGCCCCATCTGCTGAATACCGTTAAATTTTCATCCGCACCTGCCGGGGTTACCACTCTGAATGCCTGTGACTACCTCAGCCGGGAGTTCAGCTCACGGCGGCAGTTTTTTGACGACGCACCAACGGAAATTATCAGTCGGTCATGGAAACGGCTGGTGATTAACAAGGAAAAACATATCACCCGCAGGGGATACACGCTCTGCTTTCTCAGTAAACTGCAGGATAGTCTGAGGCGGAGGGATGTCTACGTTACCGGCAGTAACCGGTGGGGAGATCCTCGTGCAAGATTACTACAGGGTGCTGACTGGCAGGCAAACCGGATTAAGGTTTATCGTTCTTTGGGGCACCCGACAGACCCGCAGGAAGCAATAAAATCTCTGGGTCATCAGCTTGATAGTCGTTACAGACAGGTTGCTGCACGTCTTTGCGAAAATGAGGCTGTCGAACTCGATGTTTCTGGCCCGAAGCCCCGGTTGACAATTTCTCCCCTCGCCAGTCTTGATGAGCCGGACAGTCTGAAACGACTGAGCAAAATGATCAGTGATCTACTCCCTCCGGTGGATTTAACGGAGTTGCTGCTCGAAATTAACGCCCATACCGGATTTGCTGATGAGTTTTTCCATGCTAGTGAAGCCAGTGCCAGAGTTGATGATCTGCCCGTCAGCATCAGCGCCGTGCTGATGGCTGAAGCCTGCAATATCGGTCTGGAACCACTGATCAGATCAAATGTTCCTGCACTGACCCGACACCGGCTGAACTGGACAAAAGCGAACTATCTGCGGGCTGAAACTATCACCAGCGCTAATGCCAGACTGGTTGATTTTCAGGCAACGCTGCCACTGGCACAGATATGGGGTGGAGGAGAAGTGGCATCTGCAGATGGAATGCGCTTTGTTACGCCAGTCAGAACAATCAATGCCGGACCGAACCGCAAATACTTTGGTAATAACAGAGGGATCACCTGGTACAACTTTGTGTCCGATCAGTATTCCGGCTTTCATGGCATCGTTATACCGGGGACGCTGAGGGACTCTATCTTTGTGCTGGAAGGTCTTCTGGAACAGGAGACCGGGCTGAATCCAACCGAAATTATGACCGATACAGCAGGTGCCAGCGAACTTGTCTTTGGCCTTTTCTGGCTGCTGGGATACCAGTTTTCTCCACGCCTGGCTGATGCCGGTGCTTCGGTTTTCTGGCGAATGGACCATGATGCCGACTATGGCGTGCTGAATGATATTGCCAGAGGGCAATCAGATCCCCGAAAAATAGTCCTTCAGTGGGACGAAATGATCCGGACCGCTGGCTCCCTGAAGCTGGGCAAAGTACAGGTTTCAGTGCTGGTCCGTTCATTGCTGAAAAGTGAACGTCCTTCCGGACTGACTCAGGCAATCATTGAAGTGGGGCGCATCAACAAAACGCTGTATCTGCTTAATTATATTGATGATGAAGATTACCGCCGGCGCATTCTGACCCAGCTTAATCGGGGAGAAAGTCGCCATGCCGTTGCCAGAGCCATCTGTCACGGTCAAAAAGGTGAGATAAGAAAACGATATACCGACGGTCAGGAAGATCAACTGGGCACACTGGGGCTGGTCACTAACGCCGTCGTGTTATGGAACACTATTTATATGCAGGCAGCCCTGGATCATCTCCGGGCGCAGGGTGAAACACTGAATGATGAAGATATCGCACGCCTCTCCCCGCTTTGCCACGGACATATCAATATGCTCGGCCATTATTCCTTCACGCTGGCAGAACTGGTGACCAAAGGACATCTGAGACCATTAAAAGAGGCGTCAGAGGCAGAAAACGTTGCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCTACTAGCGGCCCCAGGCAGCAGGTCGATGCAAGAATGGCGGCCAGCCCGCCGGCGAAGAGCGCACCGCGCCCGTTTTGTGGTTCAGACATACGTTGGCCCTTTTGAATTTGGATTGGATAGCGTAACCTTACTTCCGTACTCATGTACGGAGTCAAGCGATATGGAAAATAATTTGGAAAACCTGACCATTGGCGTTTTTGCCAAGGCGGCCGGGGTCAACGTGGAGACAATCCGCTTCTATCAGCGCAAGGGCCTGTTGCGGGAACCGGACAAGCCTTACGGCAGCATCCGCCGCTATGGGGAGGCGGACGTGGTTCGGGTGAAATTCGTGAAATCGGCACAGCGGCTGGGGTTCAGTCTGGACGAGATTGCCGAGCTGTTGCGGCTCGACGATGGCACCCACTGCGAGGAGGCCAGCAGCCTGGCCGAACACAAGCTCAAGGACGTGCGCGAGAAGATGGCCGACTTGGCGCGCATGGAAACCGTGCTGTCTGAACTCGTGTGCGCCTGCCATGCACGAAAGGGGAATGTTTCCTGCCCGTTGATCGCGTCACTACAGGGCGAAGCAGGCCTGGCAAGGTCAGCTATGCCTTAGCGTGCTTTATTTAATGAGATGGTCACTCCCTCCTTCCCGGTACTATGCTGAGGACAGGCTTTCATTCGGAGAACTATCATGGAAAACATTGCGCTCATTGGTATCGATCTGGGTAAAAACTCTTTCCATATTCATTGCCAGGATCGTCGCGGGAAGGCTGTTTACCGTAAAAAATTTACCCGGCCAAAGTTGATCGAATTTTTGGCGACATGCCCCGCTACAACCATCGCAATGGAAGCCTGTGGCGGTTCTCACTTTATGGCACGCAAGTTGGAAGAGTTGGGGCATTCCCCAAAGCTGATATCACCACAATTTGTCCGCCCGTTCGTTAAAAGCAATAAAAACGACTTTGTCGACGCCGAAGCTATTTGTGAAGCTGCATCGCGTCCGTCTATGCGTTTTGTGCAGCCCAGAACGGAATCTCAGCAGGCAATGCGGGCTCTGCATCGTGTCCGTGAATCCCTGGTTCAGGATAAGGTGAAAACAACCAATCAAATGCATGCTTTTCTGCTGGAATTTGGCATTAGCGTTCCCCGAGGAGCTGCCGTTATTAGCCGACTGAGTACCATTCTTGAGGATAATAGTTTGCCTCTTTACCTCAGCCAGTTATTGCTGAAATTACAACAGCATTATCACTATCTTGTTGAGCAGATTAAAGATCTGGAATCCCAGTTGAAACGAAAGTTGGACGAAGATGAGGTTGGACAGCGCTTGCTGAGCATTCCCTGCGTCGGAACACTGACAGCGAGTACTATTTCAACTGAGATTGGCGACGGGAAGCAGTACGCCAGCAGCCGTGACTTTGCGGCGGCAACAGGGCTTGTACCTCGGCAGTACAGCACGGGAGGTAGGACGACATTGCTGGGAATTAGTAAGCGAGGTAATAAAAAGATCCGAACTTTGTTGGTTCAGTGTGCCAGGGTATTCATACAAAAACTGGAACACCAGTCTGGCAAATTGGCCGATTGGGTCAGGGATTTACTGTGCCGGAAAAGCAACTTTGTCGTCACTTGTGCTCTGGCAAACAAGCTGGCCAGAATAGCCTGGGCCCTAACGGCACGACAGCAAACTTATGTAGCATAACGGCAGAAATACACCGGTTTAAAGAATTACTGATCTGGTTTTGCGAATACTGATATTGATGATACTAACGGCCCACCGGCCTGTTGAGGAACCTGTAAAACGGAAAGGCTCATTGAAGCCGTATATTTTCTGGAGGTTCATCAGGCGCGGAACTCATCAAGGCGCGGGAATAAAATCCCATTCAGACGCCGGATAGATTCAAGCAAGCCAACTTGTCGTCAAAATCGGTGTTGCAAAAACGGGAGTGACCATAGATTCCGTTTTCTGAGACGACCCCTTGTAGGATTGGCTGTATCTGGGGACACTATAACCGTCAAAGAAGCCGGTTTGGTGTTGGTCATTGGGGTTATCCTGTGGATCTATGGTATAATCTTAACCAAGGTCAGCAATTCCTAAGGGGGTCAAATGGACGCTTTCACATTAGGCATGTTGGGGTTGCTCATTTTTTTCACTGTCGTCACTGGCGGCAGTCTGTATCTCTACCATGAGAAACAGAAGGAAAAAAAGCATCACAACGCCTAAAGAGTAACTGCGACAATGAACGTAAAGGCCAGCAATAGCTGGCCTTTTTTTATACTTGCAGTTTGAGGTGTTACATGTCACGATAAGGTAAGTGTGACATGTAACGGAAAAGGTGATTTTTTGAAAGTATCCAATGAAGATGCTCAGGCTACGGCGATCTATCTCCTCAGAGCTGCTTCGCGCCCAGCTTTCTGGCGTGACGTCCCATTCGATAAGAAACTTGAAGCCGTGGACAGCCTGAACAGCATAGGGCGATCACCATCAGAACTCACTGAATGGATTAATAAATACCTGACAGCAGAGCAAATCAATAAACTCGGGACATCAATTAGGCAACGTCGCAGAAGGGGATATGGTGTTGGTAAAAGCATAACTATAAGCGATAAAGCCCACAGGATTTTGAAGCGGTTGTCAGAAGTCGATGGTTGCAGTTTGTCAGAAGTGATCGAGAAACGCCTAGCCCGAGCTTATAAAAATACATGGGACCATAAATAGAGTGACACTAGGGGTTGCATTTTTAAAACCCGTGATAGCATCTAAATGAACCCAAACGAATAGGGGGGCTGGCGGCGATGCCGACCATTATCCCGACAATGGAGAAATAGACCATGATGACATTGACTACCGTCTCGAAGAAAACTTCAAATAATTCAGCCCTTGTATTCTGGCGCGTTGGTACAAAACGGAAAGGCATCCTTGATGTCCATATTGATTTTGACCACGAAGAAGCGGATCTTTTGGCTGAGCTCGTAGCCATTCGCTATCTGGCGCTGGACAAACAGGTTTTTTGCAGAGAGCCAGGTGCTGGTGCTGGTTACAAGCTGGTGGTATCCAAAGGTGCAATCAAGAAGCTGGCGCTGGGCAAATCCACCAAGGCTTTTGCTTTCAAGTTTGCGGCTTGCCTCACTGGGCGTTTAAAGGGCGCTACTATTGAAGTCTCGCAGAGCATGGAGTTTATGGATGAGCCGGGAGAGGGCAACATTGAGCTCCTCGATGTGGACAAGCAAGCCTATACCCAGACCCATGACGAAATATCTACACCTGCCATTGGCCCCGTCCTGGTTACTCAACATGCCATCGATCAGTATCAGGCCCGGATAACCTCTGGAGACCCTAAAAAACCGTGGGCCTCACTTGTTGGCCGCCTCCAGCATCCAGAGTTACAGGTTCAGCCCTTTGACGAGAAAGTGGCTCGCCATAAAGCCAGAAAGTATGGCCGCGTAGATAACGTGGAAGTCTGGGGCCATAGAGATTCCAAGTTCAAGTACCTGATGGTGATCAACGACGACAACCAAAAACGTGTTCTTGTCACAGTGTTTGAGCGAAATGAGTAACCTGCTTCTCATTTGTTCTGAATACACAACCTCTCCAAATCACGTATTCATACCCTAAAAGGGCAATAACGCAGTCGTTGATAACGACTCTCCCACCTGAAAATAGCTCCAGCATAAAAACTGGAGCTGTAAATGAGATCGAGACCTTCCCTTTTACTGATGCGAAACCTGCGGTCATTGGCCGTTGTGGTGCTCGCAACTTTGCCATGCGTTGCTTTTGCACAATGGCGGGTTGTAGCCGTGAGCACCGAAGTAGACAAAATGCGCTTCAACACCATCATAGACGCGCATTCCTTCATGAAGAATTATCGCTCTGGTGAGGAGCAAGGGAAGGGAACGCCAGTTGGGGATGCGCTTTATCCGGTTGCCAGTTATGGTGATGGCCGTTATGTCTCCAGGATCTGTTTTAAGTATCTAGGTGCTACTGGCGATTATGACCCCACTACCTGCACTGGCGACCCAGCTACGGTCTACTGGCGCTCAACCTATGTTCTGCCGGGTGAGATGGATAAAACACCGTTCCTGGACAGGGATCTCGGCTTACCAACAACGACCATGTGTGTAGGGAACCCTATCCATCTTGGCACCGGCAACAAGTTCCAGGCTGAACTGGATTATCAAAGCGGAGGCTCCGACCCTTTCACATTCACCAGATACTACAATAGCCACCTTCCTGATGAAGAGCTCGGCGGCTGGCGGCATACCTACTCGCGGAGCGTCGAGGTCAACGCCTCGAAGTACGGTGAGAACATGGTTGTCCTGCACCGGCCAGAAGGGCAACAACTCGCGTTCTACAACTCGTCCTCTGTCTGGGTGCCAACATGGAAAACCGATGATACCTTGACGAAGGATGCTACCGGCTGGCGCTACACTCAATCTGACGGGGTAGTCGAGGCCTATGATGAAACCGGACGACTGACCGGCATCGAGAAGCCCAACGGCAATCACATCACCCTGAGCTATCTGAACGGAGAGCTATCCTCGATCACTGACGGCTTTGGCCGCACCATCCAGTTCCAGTATCAAGATGGCCGCATGGTCAGTGTCACCGATCCTGCTGGTGGCAGTATTCAGTACCAGTACAACAGCGCTGGCAAGCTGGCGGAAGTGATCTACCAGGACAATACGAGCCGCAGCTATCTCTACGATGACCCGAATGCACCGGGTTTGCTCTCTGGGCTGGTGGACGAGAACGGCAACCGTTTCGCCACATGGGGATATGATACTCAGGGGAGAGCGGTTCTAAGCGAACACGCCGGGGGCGCAGAGAAAACTCAGGTAAGCTACAACGCTGACGGAAGCGTCTCTGTCACCAATGCCCTCGGCCACGTTCAGCGATACACCTACAGTCGCCACAATGGGATGCTCAAGCCTGATGTGGTTGAGGGTGCGCCGTGTACCGGCTTTGTGGGCGGCAAGGAAACCTACGTCTACGACAGCAAAGGCCTCGTCTCCAGCATTACTGACCGCGCTGGGCAGAAGCGCACGTTCACCCATAACGACCGGGGATTGGAAACCACCCAGATAGACCAGGACGGGGGTAAGGTTACGACCGACTGGCTTCCTTCCAAGTCGCTCCCGGCAAAAATCACAGAACCAACCAGGATCACTGAGCTCACCTACGACACTCATCTCCGGGTGATAAGCCGCAAGGTCACTGATCGCAGCTCGGGCGCTTCCCGGACATGGACCTACACCTATGCCCCTGTTGGTACAGGAAAGCCGAGCCTGTTGGCCTCGGTTGATGGTCCGCGCACCGACGTCAGCGATGTTACGACATTTGACTACGATGACCAGGGCAACCTGATCCGGACAACGAACGCGCTGGGGCAGGTGACGCAGTTTGGTGACTACGACGCGAACGGTCGCGCCGGGACCATTCAGGGTGTCAATGGTGTAACCCAAACCCTCACCTATGACGCCAGAGGAAGACTTGTCAGCTCCACTGGGCCAGAAGGAACCACGGTCTACAACTATGATGCTGTGGGCCTCCTGAGTTCGCTGACCAAGCCAAATGGTGCAACCGTCAGCTATGAGTACGACGCTGCACATCGGCTGGTGGCGGAAACAGATGCACAGGGCAACCGGCGCGAGCTTGAGCTCAATGACCTCGGGAACCCAGTAGAAGAGCGACTGCTCGATGCGCTGGGCCAGACCCGTTGGATAGAGCGCCGGATCTTCAACGAAATCGGCTGGCTCTCCAGTGTCTCCGACGCCTATAGCAATCAGTCATCGTTTTCCTACGATGTGGTGGCAAACCTGATACAGGAGACCAGTCCCTCTGGTAACACACACTCCTACAAGTACGACGGCTTCCATCACCGGACACAAACGACCGATCCCCTCGGGAAGGTCACGCAGGTGCTCTACAAGGATACCGGCGATGTTTACCGTGTCTCCGACCCTCGTTCGCGCCTGACCTACTACAGCTACAACGGCTTTGGCGAAGTGACCCAGGTCCGGAGCCCGGACACCGGCACCACCGACATTACCTATGACGAAGCCGGTAACGTGGCAACGCGCAAAACGGCCAAGGGGCAAACCACAAGCTACAGCTATGACGCGCTGAACCGGATCATCGAAACCTCCAGCGATGTCGCTGGCGAATCGCCAATTCTGTACGGGTACGACGAAGCAACCTCACCATACGGCATAGGCCGCTTGACCTCAGTCGATGATGGCAACGGTGTCCGGAGATTTGGCTACACCCCCGAAGGATGGCTGGCTTACGAAACCTGGGAAACCCACGGGCAGAGCCTGACTACCCAGTACCAATACGATGGTGCAGGCCTCGTCACGAAGATCACGTATCCCAGTGGCCGTGAGGTCTCCTACACCCGTGACTCAGCCGGTGACGTCATCGAGGTGACAACGACACAAGCAGGCACCACAACAAACCTGGCAAGCCAGATTGAGCGAGCGCCCTTTGGCCCCGTCACCAGTATGGTCAGAGGGAACGGCATTTCAGAAAGCCGCACTCTGGATCTCGATTACCGTGTCACCGGCATCGACGCTGCTAGGGTGCATTCGCTGGTCTATCGGTACACGCCAGACTCGTTGATTTCAGCCATAGACGACAATCTCAGCTCATCAGTCAATCAGTCACTCGGTTATGACGCGGTTGGCCGCATCACCTCTGCTGAGGGGATCTATGGCGTTTTGGGCTATGGCTATGACGCCACCGGCAACAGGACCTCGATCACGACCGATGGCCTGAGCCAAAGCTACACCATCAACTACATGAACAACTGGTTGGTGAAGGCCGGGCAAACCTCCAGAAGCTATGACGCCAATGGCAACCTGACGAAGCAGGGGGCGGATACCTTCACCTATGACAGCCAGAACCGGCTGGTGGCCGCAACGGTCGCGGGAGTGACTGTAAGCTACACATACAACCATCTGGATCAGCGTGTAACCAAGACCCTAAACGGGCATACCCGGCTGCTGGTTTACGACCTGGCAGGAAACCTCATCGAGGAGCTGGACGCGGCCACTGGAGACGTGCTGGCGGAGTACATCTGGCTCGATGGGACACCCTTGGGCTTTGTTCAGTCAGGACAGACCTACCAAGTCCACGTCGATCACCTGGGCACCCCGAAGGCACTGACCGACGTCAGCGGCCAAGTCGTTTGGAAGGCGAGCTACAGCCCGTTCGGTAAGGCCAGCATCATCATCCAGGGGCCAACCTTCAACCTGCGATTCCCAGGACAGTATTACGACGCGGAGACCGGGTTCCACTACAACTGGCGGCGTTACTACGACCCAGCGACCGGGCGGTACATTACCAGCGACCCTCTTGGCCTGATCGATGGAGTAAACACCTACGGGTATGTGCATGGAAACCCTATGTCCAATACCGACCCGACGGGTGAATTTGCGTTTGTTGGTGCAGGTATTGGAGCTGGGTTGGAGCTACTTAGCCAACTAATCGAAAATAATGGGAGTTGGAAATGTGTTAGTTGGTCAAAAGTTGGAATCGCCGGAGCGATTGGGGCTATAGGTGGCGGCTGGGCGTCAGGAGTTTTCAGACATGCCAGCTCCGGTAAATCGTGGTTCAAATTAAGCCAAAAATGGAGCAATGTCTCACCCAGAGTAAGGAAAGTTCAAGGGGTTCCACGAGGTAATGAGCTTCACCATTGGGCTATTCAGAGAAATGGCAAGTTTGGCAAATATGTTCCTGACTCAATAAAAAACCATCCTTGGAACTTGAAGTCCATTCCAAGAGATATTCACCAAAACATTCACGGCAATGGACCTACCCCATATAGCGCATTTGGTCGTTGGTGGCACGGGACGCCTGAATGGGCAAAAGTAGCTCAAGCCTCTCCTGTTAGTGGTGGTTTAGCTGATTCAATAAATGATGAGGGATGCGGTTGTGCAAATTGAATTTCCAGCGCTGCTTGTGAGCAGCAAGAAAAGGTCGCTCTTCGTAGTGGCATCAGAATCCGAGTTCGGGAAATGTACTATTCAGTCTTTGAGGAACGGTTATTTTGAGCTGATGGACATTTATGATTCGGAAGGTCGTCACTACAAAATAGACGAGGTTGCGAGCTACAAGCCGCTAAGTCCATTCTGGTACTGGCCTGTAGAAATTGTGATGTATGGTTCTCGACTGTTTAAGGCTAATTTCAATGCTGTTCTTATCTCTAATCTGGATTGCAAGGAGTTAAAATCTGAACTTTGTGATTTGGCTAAAAAGTACAGAAGTAATTTGGATTCCGGCGTCGGAATTGAAAAGATTATGGAGGAAATGGAGTCTGCCAGAACAATTAAAGAATTGATAAAGGTTTTTGGCTAGTTATTCATTGAATACGCTGGTGTTAGTCTAGCTAACCGACGCCAGCGGCCAAGTGTTTGGAAGGCGAGCCATCATAGAGTCAGTAGAGAGAGATACTTTTGGAAATCAATAGTATTATGATTGGTACAAATGAGTTGTAGCTCCATATTTTAAAGGGGGATTTCAGAGAAATGAAGAATAAGAAGGTAATAACTTTGATAATGGCCGCAGCCGCATCATGCTCGTCAGTCTACGCCGCGACATTACCGACCAGTGAGGTAGACGCATACATACTTGCGATGAACACCATGTCACCTATCACTGCAAAGGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATTGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCCGCCGATGCGCTCAAGCCGTGGATTGCGCGGCGTGAACGCTGGCCGTCCTTTCTGATCCGGCGCGATCCGCGCGACATCAGCCGTATCTGGGTCCTGGAACCGGAGGGACAGCATTACCTGGAAATTCCCTACCGTACCTTGTCGCATCCGGCTGTCACCCTCTGGGAACAACGGCAGGCGCTGGCGAAACTGCGGCAGCAAGGGCGCGAACAGGTGGATGAGTCGGCGCTGTTCCGCATGATCGGCCAGATGCGTGAGATTGTGACCAGCGCGCAGAAGGCCACACGCAAGGCGCGGCGTGACGCGGATCGCCGCCAGCACCTCAAGACATCAGCTCGGCCGGACAAGCCCGTTCCGCCGGATACGGATATTGCCGACCCGCAGGCAGACAACTTGCCACCCGCCAAACCGTTCGACCAGATTGAGGAGTGGTAGCCGTGGACGAATATCCCATCATCGACCTGTCCCACCTGCTGCCGGCGGCCCAGGGCTTGGCCCGTCTTCCGGCGGACGAGCGCATCCAGCGCCTTCGCGCCGACCGCTGGATCGGCTATCCGCGCGCAGTCGAGGCGCTGAACCGGCTGGAAGCCCTTTATGCGTGGCCAAACAAGCAACGCATGCCCAACCTGCTGCTGGTTGGCCCGACCAACAATGGCAAGTCGATGATCGTCGAGAAGTTCCGCCGCACCCACCCGGCCAGCTCCGACGCCGACCAGGAGCACATCCCGGTGTTGGTCGTGCAGATGCCGTCCGAGCCGTCCGTGATCCGCTTCTACGTCGCGCTGCTCGCCGCGATGGGCGCGCCGCTGCGCCCACGCCCACGGTTGCTGTGGGAGGAAAATAAAGTGTCATCCTAACTACAGGACCCTGCAACGTAGCAATTCCTGCAAAAATCCATTTAAAACAATGACTTATTTCAATTCTGCCATCTGCAAGCGTTTTGTGGGGATGGAGGCCGGAATCATTAAAGTATCATCCTAATTTTCGCTCAACCTTGGAAAATGCTGAGAATCGTAAAAATAAAGTTTCATCCTTTCCGCTACTAGTTCAAGGCTTTCTTCCGGCCGCCGGCAACAACGCTGCTTGGCTGAGGCGATCCACTTGCGCTACCGCTCACTGATCAGATCGAACTTGAACTCGACGAGCAGCCAACCATCAGGCCGAGCGTGATCGCCAAAAAAAACGGATCATGCGAAATCCTTCGTTGAGTTTGGCGCGGGTTTTAGCAAATCAGCTGGCCCGCGATCCGGTTGAGAGTGATCCGAAGCGGTCCTTGACGACCGGCACAAATCGGCCGATTCTGTTGAAAAAGTAGCTTTAGCGGCAGCCTGCCGATCAGGTGTGCCTGCTGTCGAAGTGGCTGCAAGCCACTTCAAGTTGCCTTCCGGCGTTTCACTGAGCGTCCTTGCTCAGGTTTAAGGGTTAATTTGAGGGTTTCTGCTCGTAGCAGGCATACCTATCCCGTCAGCGGTGGCCCTTGAGGCAAAAGCTTGGCCATGCGGCGCAGGTTCTGCACCATCGCAGCCAAGGTGAATTCGTCAGTGGCACCCGTTAGGCCACGCAGTCGTAAACGGTCGAGTTTCATGATCCGTTTGAGGTGGGCGAAAAGCATCTCCACCTTCTTTCGTTCGCAGCGAGAGACGAGGTACTCCGGTGTCTTGGCGATGCGTCGAGCCACGTCGCGGGCAGCCTCATGGATGCTGCGGACGATCTTCCGATTCGGCGTGTTGGGGCAGCATTTCGCTTTCAACGGGCAGGTGGCGCAGTCGGTTTGGCTGGAGCGGTAAATGACGGTTTTGGCCTTAGTTACCCGCGACCTTTGCTGGGTGAAGGCGCGCCATTCACTGCGTAGCGGTTTGCCGGCTGGGCAGCGATATTCATTGGCGTCCTGACTCCAGTGAAAGTCGTTACTGGAGAGGCTGTCGTCCTTGCGCTCGGTCTTGTCCCACACCGGCACATGCGGTTCGATGTCCTTTTCTTCGACCATCCAGGCCAGCATCGGGGCGGTGCCATAAGCGGTATCGCCGATAAGGCGTTCCGGTGTGAGATCGAACTGCGCCTCGACACGCTCGACCATCGTCCTAGTCGAATCGACTTCGGCGGTACGGTGCGCCGGGGTAGCTTCCACGTCCATGATCACACCGTGCTCAGTGTCGATCAGGTAATTCGTGGAGTAGGCAAAAAAGGCCGGGCCACCTGGCGCTGCTGTCCAACGGGACTGAGGATCAGTGAGCGAAATTTTCTTGGGAAGAGCCTCAGCCAGCGCCTCTTCATCAAGGGCTTCGAGGTACTCGCGCACTGCGCGGCTGCTGAGCTTTGGATCGTTCCAATCGACCTCATCTCCCGCCACCCCACGTTGCCGGCTGGCATCCGCCTTAATGATGCTGGCGTCGACGGCGAAACCTTCACCCTTGACTAGGCCGGCTGCCATGCAGCGCCGCAGCACCTCATTGAATAACCAGCGGAATAGATCGCTGTCACGAAAACGCCCATGGCGATTCTTCGAGAAGGTCGAGTGATTGGGGACTTCGTCTTCCAGACCCAACCGGCAGAACCAGCGATAGGCCAGGTTCAGGTGCACCTCTTCGCACAATCGCCGCTCGGAACGAATGCCATAGCAAGTAGCCGACGACCAGCATGCGCACCATCAACTCCGGGTCAATCGAGGGACGCCCGATGGGGCTATAGAAATCTGCCAGGTAGGCACGTAGATCACTGAGATCCAAGCACTGGTCGATGCTGCGCAGGAGATGTTGGGCCGGGACGTGATCTTCCAGATTGAACGAGTAGAACAGGCGCTGCTGTCCTCCCGGTAACTGTCCCATCATGCTGTTCGCCCCCACGCTCGCTGACAAAGCAATTTTGCCAACGGCATGGGGAGGCCGCTACTTTTTCAACAGAATCGGCCGCACACAGCCATTCGGCTCAACAGAACCCTGCCTTTACCCCACCAAACTCCGGAAACTCGCCCGGTAGTCCGTCGGCTTGAGATGCACATGTTGGCTGAGGTCGCCGGGTAGCGTGAACAGGCGCGGACCGTCCCGAAACTGGAACACGCGATACAGATGGAATTGATCGCCCGCCTCCTTGGAGAATTCGAGTTCGTTGTGGCTGACCAAGAAAGACGAGCCTACCCCGCCATTGGTGGTTTTCACCTCGATGAAGCGCTCATGGGCGTCCTCTTCGAACGACAGGATGTCGAACCCCGCACCGTCTCCCTGGGTGTCGGACACCCAATCCAGCCGCTGAAAAAGCTCTGGGTGGCCGAGCTCGGTCAGGCGTTGCTGTTCGTAGCCAATCACCCACTGCTCCCCTGCCCGGCCCAGCTTGCGGTTGGCTTCATCGCGAGCGGCATAATCGAACTTTCGCGGTAGGCGTTGCCGTAGAGATGCCGGGGTACGCACAAGCACTTCACGGGCGGGTGGTTCTACCAAAGCCGCTCGGTAGGTTTTGTCACCCGGAAGTTTTACCTCCTCCAGGGCATCGACAAGAGCGCCGACCGTCTGCTGATGTTCCAGAACGTAGGCGTGTACGGATTTACGCAGCAGCAGTTGGCTGTTGCCGCGTGGCTTGTAGCCGTTGATATAGGGCAGGCCCAGGGCATCGAGTACGGCGCTAATGTTCTGGTGCTTGAGCTCGACTGAAGACTTGCTGCGACCGTTCAGCAGTTGGCGCAGTGCCTGGTTGTGCTCGGACTTGTTGTACGGCTCCCCAGCCGCCTCGGCACGCAGCATGTCGAAATAGTCTTCGACCGTGGCCAGGACCTCTTCTTCGGACCAGTCTTCGCCGATGCGAATGATGCGAAACCCGAGCCGCGTCAGCGCCGGAACGACGGTCGCCTCGCCACCGGAGAAGCTGTCAGCAGTGAGCGGGCCCTGCTCGGGAAATTGCTTGCCGAAGGCCACACCGGCGATGGCCTTGGAATCGCAATCGGTGCCGGTCTTCGGATCACGTACCAGGAAGTCGCGGGACTTGCCGTAGCCGTGGCGCGCCAGGAATTTCGTGCGGCCCAGTTGCACGAACTCATCGATGGCAGCCTGCACGGCGGCGGGGCTTCGAAGCTGGGAGAGTTGAGACACAGGGTCCTTCCTTACTGTCATGGTGTGCCGGGAACCGCCGAGCCACGAGATTATGAGTAGCCCCTGAACAGAAACGTCACGATAAAAGCCGTGAACGCCACCAGGCCCATTAAATCCCTTGCGTATTTGCAGCCCGTGCTGTCCAAACCTGTACCAGGTCCGATCAACACGCTCCAACCATTGAGGTACGAAAACACCGCCTGACCGAACAAAAAATGCGTGATAGCCGCCGCAGCCATGACGCCGGAATCGTCAGACAGGCTGTAGCTGTTGACCATTGCCCTGTACGCCTGCATCTGAAATGGCTATTGCCCTGATGGGAGCCGGCTTTTCAGCCACCGACACCAGCGATGCCGTCAATATTCTTTACCCGATAACCATGACCGTACAAGCTAACAAGGCCTGGCAAGCCAGCGGACTTAAAAAGTCTTTTTTTCTACCATCCCACAAAAAAGTTCGTGGTGGAGAAAAGATAAGCTATGCAAGGCTTTAGGAGACGTGGTTTTTCAGGATGACGAAGAACGATTCGGCGCTAGGTGCAACATAGGTGCATCGCACGAGCGCTAGGAACGGCGAAAAAAGGCGGACGTGGCGAAATCGGTAGACGCAGCAGACTTTAAAATTGGAGTGCCCGCGGGGAAATCCGCGGAGTAGAACCGCTCAAAGTCGGGGAACGCTAACGGGCAATACCCTAAGCCAATCCCGAGCCAAGCCCCTTCGGGGGAAGGTGTAGAGACTGGACGGGCGGCGCCTAAAGCCTTCGGGCAATGGCGAAGGGACAGTCCAGACCACGAACGTCATCAGACGGCGGCGAAAGTCGAGGTGGTACGAAAATCTGCTTCTCTGTGAGAGTACGGGTTCGAGTCCCGTCGTCCGCACCACAAAGCCAAACATCCCTGCGATGATCGACCTCTGGGCGTTTGGTCGTGAGCCCGCCACCCTCGCGCTACGCTTTGCGCAGGCGTCGAAGGCGATCAGGTGCGCCCATCGATTCCGTCGAGCACCATCGCTGCGAGTTCATCGCTGGTGTGTGCACCACTATCGAGAACACGGGTTGTGCATGGAAGCCGTTCCCTTGCGGCCAAGCATCGGGCGACATTCGCTAATCGCCACTCTCGAATCTCCGCATTTCGATTCGGGTCAGGATGCATGGTCTGGTTCGCGATCCGGTGACGCAATAGGTCCTCGTTGAGCGTCAGAAAGATGTGCAGCAGCTGATCGTCGATCCGCCTTACCCCGTCGAGTATCTCAGTCAGATAGTCCGGGTGCACGAGCGTCATTGGGATGATGATGTCCTGCGAGTAATTCCTTCGAATCTCCCTGACCGCCGCGATCGTAAGTCCCCTCCACAAGGGGAGATCCTGATAGTCTCCGCTCGCTGGCATGGGGACCGTTTCTTTCACCACGAACCCGATTTCCTCGGGGTCAAAGATCAGCGATTTGGAACGCCGATCGCGCAGCCGCTTAGCGAGCGTCGTCTTTCCGGCGCCGAAAGGTCCGTTGATCCAGATTATCATTGTCGACGGCCTCTAACCTGAAGGCTCGCAAGAGCGCTCGACGGCCTCGTGCGGAGGCACGATCGGAGTGGTTCCGAAATGCTTCTCAAGATAGGTGACGCCGAACGTCACGATGTCCTGCGCGTCGAACAGGTAGCACTGAGCAAAGCCCACGACACCTTCTCGATGGCGACCGAGCTTCACGTAAGCATTTGCTATAGTTTCAACCGCATCCGGCTTTCCTTCGATAGCAAAGCAATCGAGAATGCCGTTTGAATCGTAATCCGATGCCGTTTTCCAGGCGACTTCACCGTCTCTTCCAAGCATCGGCATCTCATACGTCACCCACCGTTTGTTGGGGATATCGGCAACCGCCTCGGCGTAGTGCAATGCGGTAACGGAGTTTAGCGGCGCACCCAACAGCAGGGCCTTCCCGCCAAGGCGAACGAACCGCTCGACGGGCGATCCTTCCCCCAAGGCGTGACCGAGTTCGTGAGGCTCCGTCAGCGTTTCAGCCAGCGGACCAACCGCGACCATCGATGCATCGGGGTGCGCGCTGCGCCGCGCGCCGGGGGCTTGAACCAGAAATTGATTCAGCAGGCCGAACCCACGGTAAGTCCCGGCTGTTGCGGGATCGAACGGCAGCCAGGTACGGCGGGCTTCGTCATCCAGCCGAGCGCCATTCAGAGTCTCCTCGTAGGGTGATCGGTCCCACGACGCGTATCCCATCACAGTGCCAGTCGGCCCAACCGCGGAGCGTAACGCGGCAACGACCGTCTCCGCTCCTCCTTCGACCGGACCAATCGCTTTAAGTGAGGCATGCACCATCAAGAGGTCACCGGTTTGGACTCCGAGTTTTTGAAGCGCCTCCGTTATTGCCTTCCGCGTATGCATCGCGATATCTCCTCTAAACTGCAAAACACTATACCTATCGAGATATCACTCTACTATACCTATCGAGATATAGAGGTGGTCCCACTTGTTTGAACAACTAAAAGCGTATTTATAAGTGATATTCCGCTCTAGTTAAGCCACCTTGTTTTGTTGGGGTAGCTGATCATAGTAAAACTCATTTGGTGTCATTTTGTCTAGACTCGAATGAGGTCGTTTCAAATTATAAAACTCAAAATATGCACTTAATTGCTTTTTCGCATCTGTGACACTGCTATAAGCTTTGAGATACACCTCTTCATATTTAACGCTCCGCCATAATCGTTCAACCATCACATTATCTACCCATCGACCTTTACCATCCATACTGATTTGAATGCCATTTGATTTCAATACATCAATAAATGCATCACTGGTAAACTGGCTGCCTTGGTCTGTATTAAATATTTCAGGTCGACCATATTTTTCAATCGCTTCATTTAAAGCCGAAATACAAAAATCCACCTCCATACTAATCGATACCCTATGCGCAAGTACCTTGCGGCTATGCCAATCAATCACAGCACATAAATAAACAAAGCCTTTTGCCATAGGGATATACGTTATATCCGTAGACCACACTTGATTACTGCGCTGAATAGCCAACCCTTTGAGCAGATATGGATATTTACGGTGAGCTTGATTAGCCTGGCTTAAATTTGGTTTGCAATATAACGCCTGAATACCCATTTTCTTCATTAAAGTACGTGTATGACGTCGTCCTATATGATGTCCTTGACGATTCAACAAATCACGCATCATACGACTGCCTGCAAAAGGATATTGCATATGTAATTCATCAATACATCGCATCAGCTTCAGATCTGATGCACTCACAGGTTTTGGGCGATAGTAATAACAACCACGGGAGACTTTCAGCAGCTTAGCTTGCTTAGATACTGAAATCTGAAGTGAGTCGTCGATTAACTTTTGTGGTTGAAGCGGCCCAGTTTCTTCAACACACCTTCTAAAAAATCAATTTCTAATGCCTGCTCACCGATTTTTGCATGTAGTTTTTTTAGATCGATGGGTGGTTCTGTTGGAGCTTTTGATTGATCGAAAGCTTGCGAGGAAGCTGAGATCAATTGATTTTTCCAGTCAATAATTTGGTTTTGATGAACATCAAACTCAGCACTCAATTCAGCAAGTGTTTTTTCTGCTTTAATCGCAGCAAGTGCTACCTTAGCTTTAAAATCATTTGAATGATTTCTTCTTGGTCTACGTGCCATAAAATACTCCATATATTGATGTTTATAACATCATTTGAGGAGCAGAATATCACTTATAGGAGTTGTTCAAATTTACGGATCCATCTCTGTGCCTCGTTGAGGTTCTGAACTTCATCGATGATCAGCACTCCGAGTGCGTGCAGGTTGGCCACCTGGCACATGGATGCCATCAAACGTTTGGTACCCAGCTTCTTGCGTCCGTGGCTACGGCTGTAGTGGGTGCCCAGGATCTTGTCTACCTCGTTGAAGAAGCTGAGGCACAGCTCATCCAGGTCACCGTCAATGGGGCAGTCGACCTTCAGATAGACCAATTGAGTGATGTTGTAATCGGGGTGATGGAGAGCCTGGGGGTACATACCCAGAATCCGCTCCAGCGTGCGCGTCTTGCCGCAACCGGAGCAGCCAAACAGCGACAAACTGTTGGCTGTGGAGGTAACACTCTGGTACACCGCCGCATCAAGATCCTCCTCCTCCACCCGGCGATAGCCATTCTGCAGGTGTGCATACCAGGCACCGCTGGCCGGGTTCCGACCGATGTAGCCCTGACGGATCATCAGGCTGATCTTGCTCTCCAGCTCCAAGTGGTGGCTGAGTGGCTGAAAGAAGCCGTGCAACAGACGGGCAATGGCATGAGCCCGGAGCCGACCATCCAGAAGCGCCTCCTGCGGCTCGAAACTGGGAAGTTGCTGCATCAGGCCGACCACTTCCTGCAGATCAGGAATCGGAGGTAATGCACTGATGAGTGGATTGTCCTGATACTCGGGAAGCTGCTGCTCATGATAACGCGCCAGGGGAATCACTCCGCGCTGTAATTCTTCAGTCATCTTCTTCCTCCTTGAATATCAGGTCACTGAGATCCGGAAAGGCATAATCTTCCTGCTTCTCGCCCCGCAGTGGGATCACCTCCGCGGGTTTCGCCCGCTGTGCTTTCTCAGGTTTAAATGCTGTTTTCAGGCGCTCCTGGCGCTTTTCCTGCTGCTTGTTCTCGCGGATCTGGGTGCCCAGATCTTTCTTGCTAATGCCAGTTTTGAGCGGACTGGCGTTCTCTGCCTGGGCAACAATCGACTCGATCTGCTCCAGAAGCTTGCCTCTTTCCGCCAAGGCTTTTGACGCCGCATTGACATCGCTCCGCCGTTCCTCTCTGGACAGTATCCAGACATCCCAGAAGGTCATCCCCCTGAAACGTCGACTACGGTCGGCAAGATCGCAAACCCAGTAATCCTTGAGACTATTGGACGGCCGCAAGTAGATATGGTCCGCACTACGCGGGTCGTATGCGACCGTTACCCCTGTCGGCCGACGCCCCTGGCCTCGGTGAAACCAACCCTCCCTGATTGCTTCCGGGCAACTGTAGAAGCAGCCGAATAGCCTGATTCCCAGCTCCGAAACCGTGGCCGACTCATGGGACAAGAGATTGATCCACACCAGTTCCTCCGGCGCAGTGCGCAACCGACCGGTCAAGCTGGCCAAGCCCCAGTTCCACAGCATGACTGGAATCGCCGGCAGATCGCCCGGCATTCCCGCAGCCCTGTCGTATTTGCTCAGGGTGTGGAAATTGTTGTGGTGGAGGATGCCGGCAATGATGATTTTCGTGAATTCGGGCAAGGTCAGACTGGCATCAAGCCTGTAGTCGTGGCCACCTCGCTTCCGGCTGGTAGTGTCCTCGACCACACCGCTGGCGTAAGGCTTGAAACGCTCCTGCACCGTTCGAAAATAGCGCTCGACGATGCCCTTGGCATCACCTCGCCTGGCCGGTGCGTTCTCGATGCGGACGCCGAAGGCTTGGGAAAATGCCTCAACCTTGGTGCCGTTCAGTTCGCCCTTATCGGCCAGTATCACATCCGGCAAACCCTTGACCGGCCAATCGCTGGCATCGATCTCCAAACCATACTGGCGACAATACTCGACCTTGTCGGCGACCGTGTTGGCCAATGCCACCATGGCACTGACCCAGGAGGGCCCCTCGAATCCGACGTACATACCAACGACCATGCGGCTGAACACGTCGAGCACCATGTAAACGACAGGCCTGCCGACGATAAGGCTGCGGTCATGCTCTGAAACCAGATAAATATCTGCGATGGTGGCATCTATTTGGTAACGGTATCCTGGACCCAGGGTCTCAGTCGTCGATGTGCTGTTGAGCGGCCGGAAGTCCTTGGCAAAGTCCACTGCGGACATCCTGCGCGGCAAGGTGTCGGTGAAATGATATTCGCGCCCATAGAAGTACCGGAACTGCCCCAATGTAGGCAGTTCGGAGGTCGGCAGCTCCGGCTGAACAGCACGGAGCAGGTTCAGCCCGGAGGCGTAGGCATCAGGAATGGACGGATGTTTTTCCTTGAGCAAGCGTTCCTCGATGACCCGGCGAAAAATGCGCTCGATATCCGGCGTGACATTACGCCCTTTCCCCGCCATCACCACTCTTGGCCTGCCCAGCTTGGCTCTGTTCGGCTTTCGGCGCTTGCCACGGGCACCCGAGTTGACATAGTCCGGCAAGAGGGCGTTCCTACACATGCCTCGTTGCCAGTAGCGCCTCAGTAATCTGTACACCGTCTGTTTGGTGACACCATGGCGTTGCATGATGCTTCCGACAATGAGCCCTCTGGGGCGCCGCACGAAAAGCTGGGGATCGTGCATATAATCCGCCAGCATCGCCCACGCCTCATCACGTTTCAGCTGGTCAGGGGAGCCTGCCTCCACCTCCCGAAGGACCGTCTCCTCGAAGGGATCGCCAATGCTCTCCAGCTCACCCTCAATAATCAGACGCTCCAGCTCTGCAACCGATATCGATTCGGGCAGCGCCGTGTCCAAATCGATATCAATCCAGACAGCCTGCTCGGTACCGGACCAGAGCAGGCGCTTTCGGAGCTCTCCCATACGGAACACCTGATTAACGGACAACACTGATCAGCTCCTGATGAATCTGATGAGGGTTGGCAGCTAGATCCTTCGGCTTCAGCACGCGGTAAGGGGTGTCAAGATCGAAGAGAAACCAGTGCCGAGCCAGCAGCTGGCGTAGCCAATAGAGTGCTTGACCAGCCTCCAGTTGGCCGGAAGTATCCATTTCCTGGGCAATGGCGGTCAGCTTCCGGTCTGGGTGACGCTGAAACTCGAGCAAGAACAGCTGCTGATAGTGTGCGAGATCATCCTGTGCAATGTCGTCTTCAGCATGGGCCGGATAGAGCCATTGGATATTGGCAAATGCCTCTCTGGATACCTCGCGCTCCGTGATGATGAACCAGGGAATCCCTTTTTCCTGCCAGTAGCGCCTTTCAAGTTCAAGTCTCTCGATGACTTCCGGCTTCTGCAGATCGGCACTGTACTTGGCCTGGATCGCGACGGATGGGCGCTGGGGGTCATCAAAATCCACCAGGAAATCGCTGGTCAGAACCTGAGGGATTCCTTTGTAGCGACCGTGAGCAAGGCCAAGTTCCTCGGCGATGCGCACCGTATCTTCGGCCCGCATCGGGAACTGCTCACGGATATCAGTGACCTGGGGAGACCGATCCAAGGTCAGGAAAATGGCCAGTTCAAGATCGGATAAAAGATGGTGCAGACGCCGGGTCTTGCTGCCAGGCAGGCGGTGGGAGCGCCCCAACGAGGACACATCCCTGGTGTAGATGAACGGCTTGTAGTCCCGTCCCTGGCCTTGGCCACGCCCCTCCTTGAGCCTTCTATCAATCTGTGCCTGGGTCAGGCCCTTGAATGTTCCAGACATGAAAAAGCCTGCATCCACCGTGTCGATGACCTCAACATAGCGGATACAGGCTTAGGATGAAACTTTATTTGTAAGGATGATACTTTATTTGTACGGATGAAACTTTATTTGCATCCCACAGTTGCCGGAAATGGAGCAACTGGCTCTGGCACTGCTGCGCAAGGTCGGCGTGCGCATGCTGGTGATCGACGAGCTGCACAACGTGCTGGCCGGCAACAGCGTCAACCGCCGGGAATTCCTCAACCTGCTGCGCTTCCTCGGCAACGAACTGCGCATCCCGTTGGTTGGGGTAGGCACGCGCGACGCCTACCTAGCCATCCGCTCCGATGACCAGTTGGAAAATCGCTTCGAGCCGATGATGCTGCCGGTATGGGAGGCCAACGACGATTGCTGCTCACTGCTGGCCAGCTTCGCCGCTTCGCTCCCGCTGCGCCGGCCTTCCCCAATTGCCACGCTGGACATGGCTCGCTACCTGCTCACACGCAGCGAGGGCACCATAGGGGAACTGGCGCACTTGCTGATGGCGGCGGCCATCGTCGCCGTGGAGAGCGGCGAGGAAGCGATCAACCATCGCACACTCAGCATGGCCTGTTGAGTTGCATCTAAAATTGACCCACTGGGGGTGCGGACGATTTCTTGGACGGTTTATACGGACATCAATCCGACCGCATGACGATACTCGATGGGACTACGCCCGCCAAGCGACACTTTGATGCGGCGCTCGTTGTACCAGTGGATATAGGCATCGATTCGCGTCATGAGGTCTTTCAGCGTCACGTGCTGCCAATTCCTCGGGTAGATTAGTTCGGTCTTCAATCGTCCGAAAAAGCCCTCGCATGCAGCATTGTCTGGCGAGCAGCCCTTTTTGGACATCGACCGCGTTAATTGGGCATTTTCAGTGCGGCGGATCCACGCAGGCCAGCGATAATGCGAGCCCCTGTCCGAATGGATAACCGGATGCTCACCGGGTCGCAGTGTCCGTACCGCGTGATCCAGCATGGTATTGACCAGGTTCGCATCCGGGCTGGTGCCGATATTCCAGGCCACCACCAGCCCATCGAAGCAATCGACGATCGGCGAGACGTAGACCTTCCCTGCCGGAATGTGTATTTCCGTCAGATCGGTCAACCATTTCGTATTCGGCGCCGACGCGTGAAAGTCGCGATTCAGCAGATTCGGGACCGCTGGTGTCGGGTCGCCAGCATACGCCGAGAAGCGCCGGCGGCGCGGTGTTCTCACGACCAGACGCTCTTGCGCCATCAAGCGACGCACGACCTTCTCGGACACACGCATGCCACCAAGGCGCAAGGCACTATCAATGCGTCGATAGCCATAGCAGCGGTAGTTGTCCTCGAAGATAGTCCGAATGACCTCACGCACCTGCGTGTACTTGTCGGGCCGCGTCTGCCGCAGGCGTTGATAGAAGTATGTGCTGCGCGCCAGCTTCAGGCCGCACAACAGATTGGCTAATGGAAACGTGACTCTGAGGGCATCAACCACCTTCGTTTTTTCTCGGCTTGTCAGTTCGAGGGGGTTGATGCCCATGTCTTTTTTTATCAATTCACTCGCCTTCTCCAGAATTGCATTCTCCATGCGAAGCCGCTGGTTCTGGCTCTCCAGTTCGGCCAGTTCCCTGAGTAGTGCCTCATGCCGCTGCTCGAGCGAGGTGTCACCTTTCTTCTTTGTCATGGGTTTTAGGGGCACTTTGCCAAGTAATCGATGCTGCCAGTTATACAACGTTGGTCGCGATACACCGACAGTGTCGGCCACATCCTTTGCCGAACCTACGCGCAGGTTCAGTGCAATGACGGCTTGCTGCTTCTCGAGGCGAGAGCGGGCGACTGTGGGAGCGCTGCTGCCGACGACCGTCCTAGCGAATTCAGGGCGTAAATCACGGATCCAGGCACGCAAGGCCTCGCGGCTTGGGTAGCCCAGGCTTCGGATTGTGTGACTCAGGCAGTAGCCTTGTTCGATATAGTGATCTACTGCCCGTTGCTTTTGCTCATCGGTGTACTGCCGTTTTATCCGTTGATAGCCTCGGCGAAGATCCTGATTCCGTTCGAATTCTGCCAACCAGGCCTTCAGCGAGTTCTTGGTGGGGTATCCCAGCTGCCGTAGTGTGGCGCTCATCCGGCGCCCAAGCTTCAGGTACAACCTCACGGCTCGAAGGCGATCTTCATACGAATACATGAACTACTCCTAAAGTAGTCCAAGATTTTGTCCGCACCCCAACTTAGGGTAAAGATTTGCGTCGAAATTTGACCCACGTATGACACTGTTTCCCGTCTGGATATGGCGGGAGAAATCAAGGAGTGATAAACGTGGCGATATTGAGCGCAATTCGACGCTGGCATTTTCGCGATGGTGCGTCGATTCGGGAAATAGCCCGACGAAGCGGCCTGTCCAGGAACACCGTTCGCAAGTATTTGCAAAGCAAGGTGGTTGAACCGCAGTACCCAGCGCGAGACAGCGTTGGCAAGTTAAGTCCTTTTGAGCCCAAGTTAAGGCAGTGGCTCTCCACCGAGCACAAAAAGACAAAGAAGCTGCGCAGAAACCTGCGCAGCATGTACCGGGATTTGGTCGCTTTGGGCTTTACCGGGTCTTATGACCGAGTGTGTGCCTTTGCCCGACAGTGGAAAGATTCCGAACAGTTCAAGGCGCAAACCTCGGGCAAGGGTTGTTTCATCCCCTTGCGCTTTGCTTGTGGCGAAGCCTTCCAATTCGATTGGAGTGAGGACTTTGCCCGCATAGCGGGCAAACAGGTCAAACTTCAGATTGCCCAGTTTAAGTTGGCCCACAGCCGGGCCTTTGTGCTTCGGGCTTACTACCAGCAAAAACATGAAATGCTGTTTGATGCCCACTGGCATGCCTTTCAAATCTTCGGTGGCATTCCCAAGCGCGGCATCTACGACAACATGAAGACCGCTGTGGATTCGGTGGGGCGTGGCAAAGAGCGCAGGGTCAATCAGCGGTTCACTGCCATGGTCAGCCACTACCTGTTTGATGCGCAGTTCTGTAATCCAGCATCGGGTTGGGAGAAAGGCCAGATTGAGAAGAACGTGCAGGATTCCCGCCAACGCCTGTGGCAAGGGGCACCAGACTTTCAAAGCCTTGCTGATTTGAATGTGTGGCTTGAGCATCGCTGCAAAGCGCTGTGGTCTGAGCTGCGCCACCCCGAATTGGACCAAACCGTGCAAGAGGCCTTTGCCGATGAACAAGGCGAGTTGATGGCGCTACCCAATGCCTTTGATGCATTCGTGGAGCAAACCAAGCGAGTCACTTCAACCTGCCTTGTTCACCACGAGGGCAATCGCTACAGCGTTCCTGCCAGTTACGCCAACAGGGCCATCAGCCTTCGGATTTATGCAGACAAGCTGGTGATGGCTGCCGAAGGCCAACACATTGCCGAGCATCCAAGATTGTTTGGCAGTGGCCACGCTCGGCGTGGCCACACACAATACGACTGGCACCATTACTTGTCTGTGCTTCAGAAGAAACCTGGGGCGTTGCGCAATGGTGCGCCATTTGCTGAATTGCCACCCGCGTTCAAGAAGCTTCAATCCATCTTGCTGCAACGCCCCGGCGGTGACCGTGACATGGTGGAAATTCTGGCCCTTGTATTGCACCACGATGAAGGTGCGGTACTCAGTGCTGTGGAATTGGCATTGGAGTGTGGCAAGCCATCGAAGGAGCATGTGCTTAATCTGTTGGGACGTTTGACCGAAGAACCTCCACCCAAACCGATTCCAATTCCCAAGGGGTTAAGGCTGACATTGGAACCACAGGCCAACGTGAACCGCTATGACAGTTTAAGGAGAGCCCATGATGCAGCATGAAGGCCATGTGAGAATCCTCAAATCCTTGAAACTCTTTGGCATGGCACACGCCATTGAGGAGTTGGGCAATCAGAATTCACCAGCATTTAATCAAGCCTTGCCCATGCTGGACAGCTTGATTAAAGCTGAAGTGGCAGAGCGTGAAGTACGTTCGGTGAACTATCAATTGCGGGTGGCCAAGTTCCCCGTGTATCGGGACTTGGTGGGCTTTGACTTCAGTCAAAGCCTGGTTAATGAGGCCACGGTCAAACAATTGCACCGGTGCGACTTCATGGAACAAGCCCAGAACGTGGTGCTGATTGGTGGGCCAGGCACAGGCAAGACTCACCTGGCCACAGCCATTGGTACACAAGCAGTGATGCACTTGAACCGACGGGTGCGTTTCTTCTCCACCGTGGATTTGGTCAATGCACTGGAGCAAGAGAAATCATCTGGGCGTCAGGGACAAATCGCAAACCGTCTGTTGTATGCCGATTTGGTGATTCTGGATGAGCTGGGATATTTGCCTTTTAGCCAAACCGGTGGGGCACTGCTGTTTCACCTGCTCTCAAAGCTGTACGAAAAAACCAGCGTGATACTGACCACCAACTTGAGCTTCTCGGAATGGAGCCGAGTGTTTGGCGATGAAAAGATGACAACAGCGTTGTTGGACCGACTAACCCACCACTGCCACATCCTGGAAACCGGCAATGAAAGTTACCGCTTCAAACACAGTTCAACTCAGAATAAGCAGGAGGAAAAACAGACCCGCAAACTGAAAATCGAGACATAATTCTGACAACAAGGGGTGGGTCAAAATTCAATGCAAATCCCGGGTCAAATTTGGGTGCAAATCAACAGCGTGGGTTTGATCACGTGTCGAAACCCTGGCGTACCCCACCAACATGCTGTTCTCCTCTCAAAACACTTAGAAACGTAGATCGGTGAGAGATCGGAAGCGAGAGGCAGTTTTGAGAGACCTTCAACCTTCGGCAGACTGTCGGCTTTGGTATCTCTCATAAACGGATGTTTTTGAGAGAACTATCTTCGGCCTTCACACGCACGAAAGGCGGCGAAGCTCCGCCGTTAATCCGTCCGCCGGAGATCTCGCCCAGGCAGGCTGAAGGCCGAGCAAGCCTGACAGGCCCGAAAAGCCCGGCACGGGCGTCGGCGGCGATGACGGCGGCGGCATTATCCAGGGTTGATGATGGAAGTGGAGGATATCGACAACCTCTCGCGCAACCAAGACATCGCGGTCGGACTGCAAGTGATCTTGAAGCCACGGGCCCGTCCCACCCCGACATGGACCTCGATGCCCGAACGGACGTTAGATTTCGAGTTCTAGGCGTTCTGCGATGAAGGTTGGATCCCAGCCGGGATTGAAAGTGTCGACGTGGGTGAATCCGAGCCGCTCGTATAGGCCACGCAGGTTCGGGTGGCAGTCGAGCCGCAGCTTGGCGCACCCCTGCGTTCGCGCGGCATGGCGGCAAGCCTCGATCAGCGCGGAGCTGACACCCCGGCCCGCATGTGTCCGTCGCACCGCGAGCTTGTGCAGATATGCGGCCTCCCCCTTGAGGGCGTCGGGCCAGAACTCGGGATCCTCGGCCGACAAGGTGCAACAGCCGACGATGCCGTCGCTGCAACTCGCGACTAGGAGCTCGGATCTCAGGACGAAGGTCTCCGCGAATGTCCGGTCGATCCGCGCGACGTCCCAGGCGGGCGTTCCCTTGGCGGACATCCACGCCGCAGCGTCGTGCATCAGCCGCACAACCTCGTCGATATCACCCGAGCAGGCGACCCGAACGTTCGGAGGCTCCTCGCTGTCCATTCGCTCCCCTGGCGCGGTATGAACCGCCGCCTCATAGTGCAGTTTGATCCTGACGAGCCCAGCATGTCTGCGCCCACCTTCGCGGAACCTGACCAGGGTCCGCTAGCGGGCGGCCGGAAGGTGAATGCTAGGCATGATCTAACCCTCGGTCTCTGGCGTCGCGACTGCGAAATTTCGCGAGGGTTTCCGAGAAGGTGATTGCGCTTCGCAGATCTCCAGGCGCGTGGGTGCGGACGTAGTCAGCGCCATTGCCGATCGCGTGAAGTTCCGCCGCAAGGCTCGCTGGACCCAGATCCTTTACAGGAAGGCCAACGGTGGCGCCCAAGAAGGATTTCCGCGACACCGAGACCAATAGCGGAAGCCCCAACGCCGACTTCAGCTTTTGAAGGTTCGACAGCACGTGCAGCGATGTTTCCGGTGCGGGGCTCAAGAAAAATCCCATCCCCGGATCGAGGATGAGCCGGTCGGCAGCGACCCCGCTCCGTCGCAAGGCGGAAACCCGCGCCTCGAAGAACCGCACAATCTCGTCGAGCGCGTCTTCGGGTCGAAGGTGACCGGTGCGGGTGGCGATGCCATCCCGCTGCGCTGAGTGCATAACCACCAGCCTGCAGTCCGCCTCAGCAATATCGGGATAGAGCGCAGGGTCAGGAAATCCTTGGATATCGTTCAGGTAGCCCACGCCGCGCTTGAGCGCATAGCGCTGGGTTTCCGGTTGGAAGCTGTCGATTGAAACACGGTGCATCTGATCGGACAGGGCGTCTAAGAGCGGCGCAATACGTCTGATCTCATCGGCCGGCGATACAGGCCTCGCGTCCGGATGGCTGGCGGCCGGTCCGACATCCACGACGTCTGATCCGACTCGCAGCATTTCGATCGCCGCGGTGACAGCGCCGGCGGGGTCTAGCCGCCGGCTCTCATCGAAGAAGGAGTCCTCGGTGAGATTCAGAATGCCGAACACCGTCACCATGGCGTCGGCCTCCGCAGCGACTTCCACGATGGGGATCGGGCGAGCAAAAAGGCAGCAATTATGAGCCCCATACCTACAAAGCCCCACGCATCAAGCTTTTGCCCATGAAGCAACCAGGCAATGGCTGTAATTATGACGACGCCGAGTCCCGACCAGACTGCATAAGCAACACCGACAGGGATGGATTTCAGAACCAGAGAAAGAAAATAAAATGCGATGCCATAACCGATTATGACAACGGCGGAAGGGGCAAGCTTAGTAAAGCCCTCGCTAGATTTTAATGCGGATGTTGCGATTACTTCGCCAACTATTGCGATAACAAGAAAAAGCCAGCCTTTCATGATATATCTCCCAATTTGTGTAGGGCTTATTATGCACGCTTAAAAATAATAAAAGCAGACTTGACCTGATAGTTTGGCTGTGAGCAAATTTAAGGGGAAAACGTGTCCCGGGTCAACTGCAGGGCGAGATCGCGAACGTTTTTTAAACAATAAACGATCAGTTTGACGATCTGATCTAGTTGTGATCATATACTCCTTTTTTGGGAGCATGCCATGTACCTTGAACTCTTTACTACACACTTTGACACCATCATCGACAACCGTCAGTCTGCAAAAGTTACCTATCCTTTATCTGATGTTTTATTCGTGACCTTATGTGGCGTGATTGCAGGCGCAGAGGGATGGTCGGAGATCCATGATTATGCTAAAGGTCATCACGAGTGGTTTCAGAAACAAGGTTTTTTAAGTGATGGCGTCCCGGTAGATGACACGATAGCGCGTATTATTTCCAAAATTGCTCCGGAACAATTCAGGCAATGTTTTATCAACTGGATGCAAGCGGTGCACAAGCTAACGCAGGGAGAAGTGATTGCCATAGATGGCAAAACGCTACGTAGCTCTTACCATCCAGAAGACAGAAAATCGACCATCCATATGGTGAACGCCTTTGCTTGTGCGAACAAAGTGGTGTTAGGGCAGCTGAAGACCGTGGAAAAATCGAATGAAATCACAGCCATCCCAGAGCTGATTCGATTGCTGGATATTGAAGGCGCCCTGGTTTCAATAGATGCGATGGGATGTCAAACGGCGATAGCAGAGCAAGTGATAGAAGGGAATGGGGATTATCTGTTGGCGCTTAAGGCAAACCAGGGAACGTTATACAACGCAGTAGAAGCGTTATTTGCCGGGCAACGAAGTCGCCCTCTTGATGGGATTGTCATAGAAAAGAACCGAGGCCGAATAGAAGCAAGAAGTTACCATGTAAAAGACGCCAGCGAACTGAAGGGAAACTTTAGTAAATGGGTTGGTCTGCAAACCGTTGGAATGAATCTAAGTTACCGGGAAGTAAAAGGAAAAAATCGGAACTCACTTATCGTTACTACATCAGTTCAGCCAAGCTGAACGAAGTACAGTTAGCCGAGGCGGTTAGAGCTCATTGGGCCGTTGAAAATAGCCTACATTGGGTGCTGGATGTCAGCATGAAAGAAGATGCTTGCCAGATTTATCAGAATCACGCGGCGGAAAACTGGTCAATACTACGGCAATGGTCTTTAAATATGCTAAGAGCAGAGCCATCGAAAGGCAGCATCCCCGCAAAACAAAAACGTGCCTGGATGAAAACGGATTATTTGGAAGATGTCTTAAAAGCTGGTTTCAGCAGCAGAGTGTTTGAAAATTAAACACTCATGCGGGAGCCCTGGGGTCAACTGCTTTTCAAGAGTAAGCAACTTCTGATAAGATACTTTATGTCAGCAGGGTCTGTTTTCGGGTTGCGGTTTTGCTGAATGCGGGGCGTAGTTTCCTAAATCGATAATTTAACCAGATAGGAGTACAGACATATGAAAATCGTAAAAAGGATATTATTAGTATTGTTAAGTTTATTTTTTACAGTTGAGTATTCAAATGCTCAAACTGACAACTTAACTTTGAAAATTGAGAATGTTTTAAAGGCAAAAAATGCCAGAATAGGAGTAGCAATATTCAACAGCAATGAGAAGGATACTTTGAAGATTAATAACGACTTCCATTTCCCGATGCAAAGCGTTATGAAATTTCCGATTGCTTTAGCCGTTTTGTCTGAGATAGATAAAGGGAATCTTTCTTTTGAACAAAAAATAGAGATTACCCCTCAAGACCTTTTGCCTAAAATGTGGAGTCCGATTAAAGAGGAATTCCCTAATGGAACAACTTTGACGATTGAACAAATACTAAATTATACAGTATCAGAGAGCGACAATATTGGTTGTGATATTTTGCTAAAATTAATCGGAGGAACTGATTCTGTTCAAAAATTCTTGAATGCTAATCATTTCACTGATATTTCAATCAAAGCAAACGAAGAACAAATGCACAAGGATTGGAATACCCAATATCAAAATTGGGCAACCCCAACAGCGATGAACAAACTGTTAATAGATACTTATAATAATAAGAACCAATTACTTTCTAAAAAAAGTTATGATTTTATTTGGAAAATTATGAGAGAAACAACAACAGGAAGTAACCGATTAAAAGGACAATTACCAAAGAATACAATTGTTGCTCATAAAACAGGGACTTCCGGAATAAATAATGGAATTGCAGCAGCCACTAATGATGTTGGGGTAATTACTTTACCGAATGGACAATTAATTTTTATAAGCGTATTTGTTGCAGAGTCCAAAGAAACTTCGGAAATTAATGAAAAGATTATTTCAGACATTGCAAAAATAACGTGGAATTACTATTTGAATAAATAAAAAACTACCGCTAACACTGGCTCATAGGCAATGGCGGGTTGAAGTGCAATTTGCAAAGTCGGTAGCCCGCCCGAGCGTTTTCTCGGTTTGACAGGAAAGGCTCACGCAAACCGCCACTGCCCATAGCCCAAACCGTTATGGTTCAGTGGTGAAAAAAAGCGAAATTGGTATTTTTAGGTTTTTGTGCTTTTGGAGACTGGAAAAGAAAATTGAGTGTTTGTTGCGGAACTGAAAAATGGAAAGTTTCTTGAAAGGCTACCGTTGAGAGGAACAAAAAAGAAAAAAAGAAAACTTGGACGCAGATTGTTCCGGTCTATGCTCCTTCGGAGCGAATGGTTCTTCAGCGGAATCTGCATGGATCTTAAGTCTGCAAGGGTTTGCGTTGTGGTTGCGACCAACGAGAAACGGCTCCGAGAGTTTGGCAGTTCAAGCTCGCAACGTAAAGAAAAATTGAAGTTTTTTGTTTTTTAAGTGAAAATGAATACTTTTGGGAAACGGTAGCAAATTCTGAAAAATAAAGAATTGACCGCAACAAAATAGAGAAAAGAGAAGCCCCGGTTGAAATTTCTTTTTAGGAAACTACGCTCCGTACATCAAAACAAACCCCGGAAAAGTCCCCAGCCTCCTGACATAAAACACCTTATCAGAAGCTGCATCCCCGCTTAAAACAGTTGACCCCGTGCGCGAAATCCCCTTAAATTTAATCCGTTAGCGAGGTGCCGCCGGCTTCCATTCAGGTCGAGGTGGCCCGGCTCCATGCACCGCGACGCAGCGCCGGCAGGCAGAGCAAGTAGAGGGCAGCGCCTGCAATCCATGCCCACCCGTTCCACGTTGTTATAGAAGCCGCATAGATCGCCGTGAAGAGGAGGGGTCCGACGATCGAGGTCAGGCTGGTGAGCGCCGCCAGTGAGCCTTGCAGCTGCCCCTGACGTTCCTCATCCACCTGCCTGGACAACATTGCTTGCAGCGCCGGCATTCCGATGCCACCCGAAGCAAGCAGGACCATGATCGGGAACGCCATCCATCCCCGTGTCGCGAAGGCAAGCAGGATGTAGCCTGTGCCGTCGGCAATCATTCCGAGCATGAGTGCCCGCCTTTCGCCGAGCCGGGCGGCTACAGGGCCGGTGATCATTGCCTGGGCGAGTGAATGCAGAATGCCAAATGCGGCAAGCGAAATGCCGATCGTGGTCGCGTCCCAGTGAAAGCGATCCTCGCCGAAAATGACCCAAAGCGCGGCCGGCACCTGTCCGACAAGTTGCATGATGAAGAAGACCCCCATCAGGGCGGCGACGACGGTCATGCCCCGGGCCCACCGGAACGAAGCGAGCGGGTTGAGAGCCTCCCGGCGTAACGGCCGGCGTTCGCCTTTGTGCGACTCCGGCAAAAGGAAACAGCCCGTCAGGAAATTGAGGCCGTTCAAGGCTGCCGCGGCGAAGAACGGAGCGTGGGGGGAGAAACCGCCCATCAGCCCACCGAGCACAGGTCCCGCGACCATCCCGAACCCGAAACAGGCGCTCATGAAGCCGAAGTGCCGCGCGCGCTCATCGCCATCAGTGATATCGGCAATATAAGCGCCGGCTACCGCCCCAGTCGCCCCGGTGATGCCGGCCACGATCCGCCCGATATAGAGAACCCAAAGGAAAGGCGCCGTCGCCATGATGGCGTAGTCGACAGCAGCGCCGGCCAGCGAGACGAGCAAGACCGGCCGCCGCCCGAAACGATCCGACAGCGCGCCCAGCACAGGTGCGCAGGCAAATTGCATCAACGCATACAGCGCCAGCAGAATGCCATAGTGGGCGGTGACGTCGTTAAATTTAAGGGGAAAACGTGTCCCGGGTCAACTGCTTTTCAAGAGTAAGCAACTTCTGATAAGATACTTTATGTCAGTCGGGTCTGTTTTCGGGTTGCGGTTTTGCTGAATGCGGGGCGTAGTTTCCTAAATCGAAATGCTAGGCATGATCTAACCCTCGGTCTCTGGCGTCGCGACTGCGAAATTTCGCGAGGGTTTCCGAGAAGGTGATTGCGCTTCGCAGATCTCCAGGCGCGTGGGTGCGGACGTAGTCAGCGCCATTGCCGATCGCGTGAAGTTCCGCCGCAAGGCTCGCTGGACCCAGATCCTTTACAGGAAGGCCAACGGTGGCGCCCAAGAAGGATTTCCGCGACACCGAGACCAATAGCGGAAGCCCCAACGCCGACTTCAGCTTTTGAAGGTTCGACAGCACGTGCAGCGATGTTTCCGGTGCGGGGCTCAAGAAAAATCCCATCCCCGGATCGAGGATGAGCCGGTCGGCAGCGACCCCGCTCCGTCGCAAGGCGGAAACCCGCGCCTCGAAGAACCGCACAATCTCGTCGAGCGCGTCTTCGGGTCGAAGGTGACCGGTGCGGGTGGCGATGCCATCCCGCTGCGCTGAGTGCATAACCACCAGCCTGCAGTCCGCCTCAGCAATATCGGGATAGAGCGCAGGGTCAGGAAATCCTTGGATATCGTTCAGGTAGCCCACGCCGCGCTTGAGCGCATAGCGCTGGGTTTCCGGTTGGAAGCTGTCGATTGAAACACGGTGCATCTGATCGGACAGGGCGTCTAAGAGCGGCGCAATACGTCTGATCTCATCGGCCGGCGATACAGGCCTCGCGTCCGGATGGCTGGCGGCCGGTCCGACATCCACGACGTCTGATCCGACTCGCAGCATTTCGATCGCCGCGGTGACAGCGCCGGCGGGGTCTAGCCGCCGGCTCTCATCGAAGAAGGAGTCCTCGGTGAGATTCAGAATGCCGAACACCGTCACCATGGCGTCGGCCTCCGCAGCGACTTCCACGATGGGGATCGGGCGAGCAAAAAGGCAGCAATTATGAGCCCCATACCTACAAAGCCCCACGCATCAAGCTTTTGCCCATGAAGCAACCAGGCAATGGCTGTAATTATGACGACGCCGAGTCCCGACCAGACTGCATAAGCAACACCGACAGGGATGGATTTCAGAACCAGAGAAAGAAAATAAAATGCGATGCCATAACCGATTATGACAACGGCGGAAGGGGCAAGCTTAGTAAAGCCCTCGCTAGATTTTAATGCGGATGTTGCGATTACTTCGCCAACTATTGCGATAACAAGAAAAAGCCAGCCTTTCATGATATATCTCCCAATTTGTGTAGGGCTTATTATGCACGCTTAAAAATAATAAAAGCAGACTTGACCTGATAGTTTGGCTGTGAGCAATTATGTGCTTAGTGCATCTAACGTTTGACATGAGGGGCGGCCAAGGGCGCCAGCCCTTGGACGTCCCCCTCGATGGAAGGGTTAGGCATCACTGCGTGTTCGCTCGAATGCCTGGCGTGTTTGAACCATGTACACGGCTGGACCATCTGGGGTGGTTACGGTACCTTGCCTCTCAAACCCCGCTTTCTCGTAGCATCGGATCGCTCGCAAGTTGCTCGGCGACGGGTCCGTTTGGATCTTGGTGACCTCGGGATCATTGAACAGCAACTCAACCAGAGCTCGAACCAGCTTGGTTCCCAAGCCTTTGCCCAGTTGTGATGCATTCGCCAGTAACTGGTCTATTCCGCGTACTCCTGGATCGGTTTCTTCTTCCCACCATCCGTCCCCGCTTCCAAGAGCAACGTACGACTGGGCATACCCAATCGGCTCTCCATTCAGCATTGCAATGTATGGAGTGACGGACTCTTGCGCTAAAACGCTTGGCAAGTACTGTTCCTGTACGTCAGCAAGTGTCGGGCGTGCTTCTTCTCCGCCCCACCACTCGACGATATGAGATCGATTTAGCCACTCATAGAGCATCGCAAGGTCATGCTCAGTCATGAGGCGCAGTGTGACGGAATCGTTGCTGTTGGTCACGATGTTGTTCAATGGAGGTTCCTTCAGTTTTCTGATGAAGCGCGGAGGTGGCTCAACCTGCGAAAAGAAACGAGTTGCTACGTAAGTCCGAGAACATGCTTTCCATGGTCTCTGAGCTCGCCTTGATGCCCGAGGCATAGACTGTACAAAAAAACAGTCATAACAAGCCATGAAAACCGCCACTGCGCCGTTACCACCGCTGCGTTCGGTCAAGGTTCTGGACCAGTTGCGTGAGCGCATACGCTACTTGCATTACAGCTTACCAACCGAACAGGCTTATGTCAACTGGGTTCGTGCCTTCATCCGTTTCCACGGTGTGCGTCACCCGGCAACCTTGGGCAGCAGCGAAGTCGAGGCATTTCTGTCCTGGCTGGCGAACGAGCGCAAGGTTTCGGTCTCCACGCATCGTCAGGCATTGGCGGCCTTGCTGTTCTTCTACGGCAAGGTGCTGTGCACGGATCTGCCCTGGCTTCAGGAGATCGGAAGACCTCGGCCGTCGCGGCGCTTGCCGGTGGTGCTGACCCCGGATGAAGTGGTTCGCATCCTCGGTTTTCTGGAAGGCGAGCATCGTTTGTTCGCCCAGCTTCTGTATGGAACGGGCATGCGGATCAGTGAGGGTTTGCAACTGCGGGTCAAGGATCTGGATTTCGATCACGGCACGATCATCGTGCGGGAGGGCAAGGGCTCCAAGGATCGGGCCTTGATGTTACCCGAGAGCTTGGCACCCAGCCTGCGCGAGCAGCTGTCGCGTGCACGGGCATGGTGGCTGAAGGACCAGGCCGAGGGCCGCAGCGGCGTTGCGCTTCCCGACGCCCTTGAGCGGAAGTATCCGCGCGCCGGGCATTCCTGGCCGTGGTTCTGGGTTTTTGCGCAGCACACGCATTCGACCGATCCACGGAGCGGTGTCGTGCGTCGCCATCACATGTATGACCAGACCTTTCAGCGCGCCTTCAAACGTGCCGTAGAACAAGCAGGCATCACGAAGCCCGCCACACCGCACACCCTCCGCCACTCGTTCGCGACGGCCTTGCTCCGCAGCGGTTACGACATTCGAACCGTGCAGGATCTGCTCGGCCATTCCGACGTCTCTACGACGATGATTTACACGCATGTGCTGAAAGTTGGCGGTGCCGGAGTGCGCTCACCGCTTGATGCGCTGCCGCCCCTCACTAGTGAGAGGTAGGGCAGCGCAAGTCAATCCTGGCGGATTCACTACCCCTGCGCGAAGGCCATCGGTGCCGCATCGAACGGCCGGTTGCGGAAAGTCCTCCCTGCGTCCGCTGATGGCCGGCAGCAGCCCGTCGTTGCCTGATGGATCCAACCCCTCCGCTGCTATAGTGCAGTCGGCTTCTGACGTTCAGTGCAGCCGTCTTCTGAAAACGACACCATGTGCAAACGATGTCAGAATAGAGTTAAATTTCCTATTGATTGACATATTCCGTCAAAGGTAATAGATTTCATCCTGACACTTTTGCCTTTGGAGGCATCTTGCAAGGTCAACGCATCGGCTATGTCCGCGTCAGCAGCTTCGACCAGAACCCGGAACGGCAATTGGAGGGTGTTCAGGTGGCGCGGGTGTTCACCGACAAGGCTTCTGGCAAGGACACCCAGCGTCCCGAGCTGGAAAGGCTGCTGGCCTTCGTCCGCGAGGGCGACACCGTGGTGGTGCATAGCATGGACAGGCTGGCACGCAACCTTGATGACCTGCGCCGCATCGTCCAAGGGCTGACACAACGGGGCGTGCGCATGGAGTTCGTCAAAGAAGGGCTGAAGTTCACCGGCGAGGACTCACCGATGGCCAATCTGATGCTGTCGGTCATGGGAGCCTTCGCTGAGTTCGAGCGCGCCCTGATCCGCGAACGTCAGCGCGAGGGAATCGTGCTGGCCAAGCAGCGCGGTGCCTACCGGGGACGAAAGAAATCGCTGAACAGCGAACAAATTGCCGAGTTGAAACGGCGAGTTGCGGCAGGCGACCAAAAAACCTTGGTGGCCCGTGACTTCGGCATCAGCCGCGAAACCTTGTACCAGTACCTGCGGGAAGACTGACCATGCCACGCCGCTCAATCCTGTCCGCCACCGAGCGCGAAAGCCTGCTGGCACTGCCAGATGCCAAAGACGAACTGATACGGCACTACACGTTCAACGAAACCGACCTGTCGGTGATCCGTCAGCGTCGCGGCGCCGCGAATCGATTGGGCTTCGCTGTGCAGCTTTGCTACTTGCGATTCCCTGGCACCTTTTTGGGCGTCGATGAGCCTCCGTTTCCGCCCCTGTTGCGCATGGTGGCCGCGCAACTCAAGATGCCAGTGGAAAGTTGGAGCGAGTACGGCCAGCGCGAACAGACACGGCGGGAGCACTTGGTCGAGCTGCAAACGGTTTTTGGGTTCAAGCCCTTCACCATGAGCCACTATCGGCAAGCCGTGCATACATTGACCGAGCTGGCCTTGCAGACCGACAAAGGCATCGTGCTGGCGAGCGCACTTGTCGAGAATCTGCGGCGGCAGAGCATTATCCTGCCCGCCATGAATGCCATCGAGCGCGCAAGCGCCGAGGCCATCACCCGTGCCAACCGACGCATTTACGCGGCGCTGACCGATTCTTTGTTATCACCCCACCGTCAGCGCCTGGACGAACTTCTCAAGCGCAAGGACGGCAGTAAAGTGACGTGGCTGGCATGGCTGCGCCAGTCGCCTGCCAAACCGAACTCTCGCCACATGCTCGAACATATTGAGCGCCTGAAATCCTGGCAAGCACTTGATCTGCCCGCAGGCATCGAGCGGCAGGTTCACCAGAACCGCCTGCTCAAAATCGCTCGTGAAGGTGGCCAGATGACGCCTGCTGATCTGGCAAAGTTCGAGGTGCAACGACGCTATGCCACGCTGGTAGCGCTGGCCATCGAAGGCATGGCCACCGTCACCGATGAAATCATCGACCTTCACGATCGCATCATCGGCAAGCTGTTCAACGCGGCCAAGAACAAGCATCAGCAGCAGTTCCAGGCTTCCGGCAAGGCGATCAACGACAAGGTGCGGATGTATGGGCGCATCGGTCAAGCGTTGATTGAGGCCAAGCAAAGCGGCAGCGATCCGTTCGCCGCCATCGAGGCCGTTATGCCCTGGGACACCTTCGCCGCCAGCGTCACCGAAGCGCAAACATTGGCGCGGCCTGCCGACTTTGATTTCCTGCACCACATCGGTGAAAGCTATGCCACGCTACGCCGCTACGCGCCGCAGTTCCTGGGCGTGCTCAAATTGCGGGCTGCGCCCGCCGCCAAGGGTGTGCTCGATGCCATCGACATGCTGCGCGGCATGAACAGCGACAGCGCGCGCAAGGTGCCCGCCGATGCGCCAACCGCATTCATCAAGCCGCGCTGGGCAAAGCTGGTTCTGACCGACGACGGCATCGACCGGCGTTACTACGAGTTATGCGCCCTGTCGGAGCTGAAGAACGCGCTGCGCTCCGGTGATGTCTGGGTGCAGGGTTCTCGCCAGTTCAAGGACTTCGACGAATACCTGGTGCCGGTCGAGAAGTTCGCCACTTTGAAGCTGGCCAGCGAATTGCCGCTGGCAGTGGCCACCGACTGCGACCAATACCTGCATGACCGGTTGGAATTGTTGGAGGCGCAACTCGCCACAGTCAACCGCATGGCTGCGGCCAACGACTTACCGGATGCCATCATCACCACCGCGTCAGGCCTGAAGATCACGCCGCTGGACGCGGCAGTACCAGACGCCGCGCAAGCCATGATCGACCAGACAGCTATGCTGCTGCCGCACCTCAAAATCACCGAGTTGCTGATGGAGGTCGATGAATGGACGGGCTTCACCCGCCACTTCACACACCTGAAGACCAGCGACACGGCCAAGGACAAAACCTTGCTGTTGACGACGATCCTGGCCGACGCGATCAACCTGGGTCTGACCAAAATGGCCGAGTCCTGCCCTGGCACCACCTACGCCAAGCTGTCTTGGCTGCAAGCCTGGCACATCCGCGATGAAACCTATTCGACGGCGCTGGCCGAGCTGGTGAATGCGCAGTTTCGGCAACCCTTCGCCGGCAACTGGGGTGACGGCACCACGTCATCGTCGGACGGCCAGAACTTCAGAACCGGCAGCAAAGCAGAAAGCACTGGTCATATCAACCCGAAGTATGGAAGCAGTCCAGGACGGACTTTCTACACCCATATCTCCGACCAGTACGCGCCCTTCAGTGCCAAGGTGGTCAACGTGGGCATTCGTGATTCAACTTACGTGCTTGATGGCCTGCTGTACCACGAGTCGGACTTGCGCATCGAGGAACACTACACCGACACGGCAGGCTTCACCGATCACGTGTTTGGCTTGATGCATTTGCTGGGATTTCGCTTCGCGCCGCGTATCCGTGACTTGGGCGAAACCAAGCTATTCATCCCCAAGGGCGATGCCGCCTATGACGCGCTCAAGCCGATGATTAGCAGCGACAGGCTGAACATCAAGCAAATACGCGCCCATTGGGATGAAATTCTGCGGCTGGCCACCTCCATCAAGCAAGGCACGGTAACGGCTTCGCTGATGCTGCGCAAACTCGGCAGCTACCCGCGCCAGAACGGCTTGGCCGTGGCGTTGCGCGAGCTGGGGCGCATCGAGCGCACGCTGTTCATTTTGGATTGGCTGCAAAGCGTGGAGCTGCGCCGCCGCGTCCATGCGGGGCTGAATAAGGGCGAGGCGCGCAACGCGCTGGCCAGGGCGGTCTTCTTCTACCGATTGGGTGAAATCCGCGACCGCAGTTTTGAGCAGCAGCGCTACCGGGCCAGCGGCCTCAATCTGGTGACGGCGGCCATCGTGTTGTGGAACACGGTATATCTGGAGCGTGCCACCAGTGCTTTGCGTGGCAACGGCACGGCGCTGGACGACACATTGTTGCAATATCTGTCGCCGCTGGGGTGGGAGCACATCAACCTGACCGGCGATTACCTATGGCGCAGCAGCGCCAAGGTCGGTGCGGGGAAGTTTAGGCCATTGCGACCGCTGCCACCGGCTTAGCGTGCTTTATTTAATGAGATGGTCACTCCCTCCTTCCCGGTACTATGCTGAGGATAGGCTTTCATTCGGAGAACTATCATGGAAAACATTGCGCTCATTGGTATCGATCTGGGTAAAAACTCTTTCCATATTCATTGCCAAGATCGTCGCGGCAAGGCTGTTTACCGTAAAAAATTTACACGGCCAAAGTTAATCGAATTTTTGGCGACATGCCCCGCTACAACCATCGCAATGGAAGCCTGTGGTGGCTCTCACTTTATGGCACGCAAGTTGGAAGAGTTGGGGCATTTTCCTAAGCTGATATCACCACAATTTGTCCGTCCATTCGTTAAAAGTAACAAAAACGACTTTGTCGACGCCGAAGCTATTTGTGAAGCTGCATCGCGTCCGTCTATGCGTTTTGTACAGCCCAGAACTGAATCTCAGCAGGCAATGCGTGCGCTGCATCGTGTCCGTGAATCCCTGGTTCAGGATAAGGTAAAAACAACCAATCAGATGCATGCTTTTCTGCTGGAATTTGGCATCAGCGTTCCACGAGGAGCTGCCGTTATTAGCCGACTGAGTACCCTTCTTGAGGACAATAGTTTGCCTCTATACCTCAGCCAGTTATTGCTGAAATTACAACAGCATTATCACTATCTTGTTGAGCAGATTAAAGATTTGGAATCCCAGTTGAAACGAAAGTTGGACGAAGATGAGATTGGACAGCGCTTGCTGAGCATTCCCTGCGTCGGAACACTGACAGCGAGTACTATTTCAACTGAGATTGGCGACGGGAAGCAGTACGCCAGCAGTCGTGACTTTGCGGCGGCAACAGGGCTAGTGCCTCGACAGTACAGCACGGGAGGTCGGACGACATTGCTGGGAATTAGTAAGCGAGGTAACAAAAAGATCCGAACTTTGTTGGTTCAATGTGCCAGGGTATTCATACAAAAACTGGAACACCAGTCTGGCAAATTGGCCGATTGGGTCAGGGATCTACTGTGTAGGAAAAGCAACTTTGTCGTCACTTGTGCTCTGGCAAACAAGCTGGCCAGAATAGCCTGGGCCCTAACGGCACGACAGCAAACTTATGTAGCATAA
Protein sequences of DBSCAN-SWA_1 >CP029123|2205:49895|23265_23493_-|AWF28704.1|DBSCAN-SWA MVNSYSLSDDSGVMAAAAITHFLFGQAVFSYLNGWSVLIGPGTGLDSTGCKYARDLMGLVAFTAFIVTFLFRGYS >CP029123|2205:49895|26564_26870_-|AWF28697.1|transposase|DBSCAN-SWA MARRPRRNHSNDFKAKVALAAIKAEKTLAELSAEFDVHQNQIIDWKNQLISASSQAFDQSKAPTEPPIDLKKLHAKIGEQALEIDFLEGVLKKLGRFNHKS >CP029123|2205:49895|31233_32778_-|AWF28680.1|DBSCAN-SWA MYSYEDRLRAVRLYLKLGRRMSATLRQLGYPTKNSLKAWLAEFERNQDLRRGYQRIKRQYTDEQKQRAVDHYIEQGYCLSHTIRSLGYPSREALRAWIRDLRPEFARTVVGSSAPTVARSRLEKQQAVIALNLRVGSAKDVADTVGVSRPTLYNWQHRLLGKVPLKPMTKKKGDTSLEQRHEALLRELAELESQNQRLRMENAILEKASELIKKDMGINPLELTSREKTKVVDALRVTFPLANLLCGLKLARSTYFYQRLRQTRPDKYTQVREVIRTIFEDNYRCYGYRRIDSALRLGGMRVSEKVVRRLMAQERLVVRTPRRRRFSAYAGDPTPAVPNLLNRDFHASAPNTKWLTDLTEIHIPAGKVYVSPIVDCFDGLVVAWNIGTSPDANLVNTMLDHAVRTLRPGEHPVIHSDRGSHYRWPAWIRRTENAQLTRSMSKKGCSPDNAACEGFFGRLKTELIYPRNWQHVTLKDLMTRIDAYIHWYNERRIKVSLGGRSPIEYRHAVGLMSV >CP029123|2205:49895|27686_29747_-|AWF28707.1|DBSCAN-SWA MGELRKRLLWSGTEQAVWIDIDLDTALPESISVAELERLIIEGELESIGDPFEETVLREVEAGSPDQLKRDEAWAMLADYMHDPQLFVRRPRGLIVGSIMQRHGVTKQTVYRLLRRYWQRGMCRNALLPDYVNSGARGKRRKPNRAKLGRPRVVMAGKGRNVTPDIERIFRRVIEERLLKEKHPSIPDAYASGLNLLRAVQPELPTSELPTLGQFRYFYGREYHFTDTLPRRMSAVDFAKDFRPLNSTSTTETLGPGYRYQIDATIADIYLVSEHDRSLIVGRPVVYMVLDVFSRMVVGMYVGFEGPSWVSAMVALANTVADKVEYCRQYGLEIDASDWPVKGLPDVILADKGELNGTKVEAFSQAFGVRIENAPARRGDAKGIVERYFRTVQERFKPYASGVVEDTTSRKRGGHDYRLDASLTLPEFTKIIIAGILHHNNFHTLSKYDRAAGMPGDLPAIPVMLWNWGLASLTGRLRTAPEELVWINLLSHESATVSELGIRLFGCFYSCPEAIREGWFHRGQGRRPTGVTVAYDPRSADHIYLRPSNSLKDYWVCDLADRSRRFRGMTFWDVWILSREERRSDVNAASKALAERGKLLEQIESIVAQAENASPLKTGISKKDLGTQIRENKQQEKRQERLKTAFKPEKAQRAKPAEVIPLRGEKQEDYAFPDLSDLIFKEEEDD >CP029123|2205:49895|26923_27694_-|AWF28683.1|DBSCAN-SWA MTEELQRGVIPLARYHEQQLPEYQDNPLISALPPIPDLQEVVGLMQQLPSFEPQEALLDGRLRAHAIARLLHGFFQPLSHHLELESKISLMIRQGYIGRNPASGAWYAHLQNGYRRVEEEDLDAAVYQSVTSTANSLSLFGCSGCGKTRTLERILGMYPQALHHPDYNITQLVYLKVDCPIDGDLDELCLSFFNEVDKILGTHYSRSHGRKKLGTKRLMASMCQVANLHALGVLIIDEVQNLNEAQRWIRKFEQLL >CP029123|2205:49895|36358_37198_-|AWF28722.1|DBSCAN-SWA MVTVFGILNLTEDSFFDESRRLDPAGAVTAAIEMLRVGSDVVDVGPAASHPDARPVSPADEIRRIAPLLDALSDQMHRVSIDSFQPETQRYALKRGVGYLNDIQGFPDPALYPDIAEADCRLVVMHSAQRDGIATRTGHLRPEDALDEIVRFFEARVSALRRSGVAADRLILDPGMGFFLSPAPETSLHVLSNLQKLKSALGLPLLVSVSRKSFLGATVGLPVKDLGPASLAAELHAIGNGADYVRTHAPGDLRSAITFSETLAKFRSRDARDRGLDHA >CP029123|2205:49895|29760_30588_-|AWF28647.1|DBSCAN-SWA MSGTFKGLTQAQIDRRLKEGRGQGQGRDYKPFIYTRDVSSLGRSHRLPGSKTRRLHHLLSDLELAIFLTLDRSPQVTDIREQFPMRAEDTVRIAEELGLAHGRYKGIPQVLTSDFLVDFDDPQRPSVAIQAKYSADLQKPEVIERLELERRYWQEKGIPWFIITEREVSREAFANIQWLYPAHAEDDIAQDDLAHYQQLFLLEFQRHPDRKLTAIAQEMDTSGQLEAGQALYWLRQLLARHWFLFDLDTPYRVLKPKDLAANPHQIHQELISVVR >CP029123|2205:49895|9835_10270_+|AWF28662.1|DBSCAN-SWA MENNLENLTIGVFAKAAGVNVETIRFYQRKGLLREPDKPYGSIRRYGEADVVRVKFVKSAQRLGFSLDEIAELLRLDDGTHCEEASSLAEHKLKDVREKMADLARMETVLSELVCACHARKGNVSCPLIASLQGEAGLARSAMP >CP029123|2205:49895|10348_11353_+|AWF28705.1|transposase|DBSCAN-SWA MENIALIGIDLGKNSFHIHCQDRRGKAVYRKKFTRPKLIEFLATCPATTIAMEACGGSHFMARKLEELGHSPKLISPQFVRPFVKSNKNDFVDAEAICEAASRPSMRFVQPRTESQQAMRALHRVRESLVQDKVKTTNQMHAFLLEFGISVPRGAAVISRLSTILEDNSLPLYLSQLLLKLQQHYHYLVEQIKDLESQLKRKLDEDEVGQRLLSIPCVGTLTASTISTEIGDGKQYASSRDFAAATGLVPRQYSTGGRTTLLGISKRGNKKIRTLLVQCARVFIQKLEHQSGKLADWVRDLLCRKSNFVVTCALANKLARIAWALTARQQTYVA >CP029123|2205:49895|17707_18007_+|AWF28726.1|DBSCAN-SWA MDIYDSEGRHYKIDEVASYKPLSPFWYWPVEIVMYGSRLFKANFNAVLISNLDCKELKSELCDLAKKYRSNLDSGVGIEKIMEEMESARTIKELIKVFG >CP029123|2205:49895|19753_19993_+|AWF28666.1|DBSCAN-SWA MPNLLLVGPTNNGKSMIVEKFRRTHPASSDADQEHIPVLVVQMPSEPSVIRFYVALLAAMGAPLRPRPRLLWEENKVSS >CP029123|2205:49895|41887_42727_-|AWF28689.1|DBSCAN-SWA MVTVFGILNLTEDSFFDESRRLDPAGAVTAAIEMLRVGSDVVDVGPAASHPDARPVSPADEIRRIAPLLDALSDQMHRVSIDSFQPETQRYALKRGVGYLNDIQGFPDPALYPDIAEADCRLVVMHSAQRDGIATRTGHLRPEDALDEIVRFFEARVSALRRSGVAADRLILDPGMGFFLSPAPETSLHVLSNLQKLKSALGLPLLVSVSRKSFLGATVGLPVKDLGPASLAAELHAIGNGADYVRTHAPGDLRSAITFSETLAKFRSRDARDRGLDHA >CP029123|2205:49895|22078_23215_-|AWF28670.1|DBSCAN-SWA MSQLSQLRSPAAVQAAIDEFVQLGRTKFLARHGYGKSRDFLVRDPKTGTDCDSKAIAGVAFGKQFPEQGPLTADSFSGGEATVVPALTRLGFRIIRIGEDWSEEEVLATVEDYFDMLRAEAAGEPYNKSEHNQALRQLLNGRSKSSVELKHQNISAVLDALGLPYINGYKPRGNSQLLLRKSVHAYVLEHQQTVGALVDALEEVKLPGDKTYRAALVEPPAREVLVRTPASLRQRLPRKFDYAARDEANRKLGRAGEQWVIGYEQQRLTELGHPELFQRLDWVSDTQGDGAGFDILSFEEDAHERFIEVKTTNGGVGSSFLVSHNELEFSKEAGDQFHLYRVFQFRDGPRLFTLPGDLSQHVHLKPTDYRASFRSLVG >CP029123|2205:49895|4968_5745_-|AWF28719.1|DBSCAN-SWA MIRMILAINNQCFIGKNNTLMYRLKDDMLNFKKMTQNNIVVMGRKTFESLNNRGLPNRLNVVVTSKAETFEDIQTITTHDMKRSETFTKEGHVVYITPDSFINQFLPFHRDSEDEIWVIGGAQVYEAATPFASEIICTFVDDDEVGDVALKPKLFGGFTHLATLKSVDVDEDNDKPYEITQLVRHEDLEHKLRELQAQQHEMEKEQTQNNLSTPLENGGLRQGEAFVIAATTSAALSQIDTESREDSSDSDSSSSSSD >CP029123|2205:49895|11758_11875_+|AWF28731.1|DBSCAN-SWA MDAFTLGMLGLLIFFTVVTGGSLYLYHEKQKEKKHHNA >CP029123|2205:49895|43236_43755_-|AWF28679.1|DBSCAN-SWA MTEHDLAMLYEWLNRSHIVEWWGGEEARPTLADVQEQYLPSVLAQESVTPYIAMLNGEPIGYAQSYVALGSGDGWWEEETDPGVRGIDQLLANASQLGKGLGTKLVRALVELLFNDPEVTKIQTDPSPSNLRAIRCYEKAGFERQGTVTTPDGPAVYMVQTRQAFERTRSDA >CP029123|2205:49895|39038_39938_+|AWF28673.1|DBSCAN-SWA MKIVKRILLVLLSLFFTVEYSNAQTDNLTLKIENVLKAKNARIGVAIFNSNEKDTLKINNDFHFPMQSVMKFPIALAVLSEIDKGNLSFEQKIEITPQDLLPKMWSPIKEEFPNGTTLTIEQILNYTVSESDNIGCDILLKLIGGTDSVQKFLNANHFTDISIKANEEQMHKDWNTQYQNWATPTAMNKLLIDTYNNKNQLLSKKSYDFIWKIMRETTTGSNRLKGQLPKNTIVAHKTGTSGINNGIAAATNDVGVITLPNGQLIFISVFVAESKETSEINEKIISDIAKITWNYYLNK >CP029123|2205:49895|45279_45837_+|AWF28715.1|DBSCAN-SWA MQGQRIGYVRVSSFDQNPERQLEGVQVARVFTDKASGKDTQRPELERLLAFVREGDTVVVHSMDRLARNLDDLRRIVQGLTQRGVRMEFVKEGLKFTGEDSPMANLMLSVMGAFAEFERALIRERQREGIVLAKQRGAYRGRKKSLNSEQIAELKRRVAAGDQKTLVARDFGISRETLYQYLRED >CP029123|2205:49895|46211_48812_+|AWF28678.1|DBSCAN-SWA MHTLTELALQTDKGIVLASALVENLRRQSIILPAMNAIERASAEAITRANRRIYAALTDSLLSPHRQRLDELLKRKDGSKVTWLAWLRQSPAKPNSRHMLEHIERLKSWQALDLPAGIERQVHQNRLLKIAREGGQMTPADLAKFEVQRRYATLVALAIEGMATVTDEIIDLHDRIIGKLFNAAKNKHQQQFQASGKAINDKVRMYGRIGQALIEAKQSGSDPFAAIEAVMPWDTFAASVTEAQTLARPADFDFLHHIGESYATLRRYAPQFLGVLKLRAAPAAKGVLDAIDMLRGMNSDSARKVPADAPTAFIKPRWAKLVLTDDGIDRRYYELCALSELKNALRSGDVWVQGSRQFKDFDEYLVPVEKFATLKLASELPLAVATDCDQYLHDRLELLEAQLATVNRMAAANDLPDAIITTASGLKITPLDAAVPDAAQAMIDQTAMLLPHLKITELLMEVDEWTGFTRHFTHLKTSDTAKDKTLLLTTILADAINLGLTKMAESCPGTTYAKLSWLQAWHIRDETYSTALAELVNAQFRQPFAGNWGDGTTSSSDGQNFRTGSKAESTGHINPKYGSSPGRTFYTHISDQYAPFSAKVVNVGIRDSTYVLDGLLYHESDLRIEEHYTDTAGFTDHVFGLMHLLGFRFAPRIRDLGETKLFIPKGDAAYDALKPMISSDRLNIKQIRAHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAVALRELGRIERTLFILDWLQSVELRRRVHAGLNKGEARNALARAVFFYRLGEIRDRSFEQQRYRASGLNLVTAAIVLWNTVYLERATSALRGNGTALDDTLLQYLSPLGWEHINLTGDYLWRSSAKVGAGKFRPLRPLPPA >CP029123|2205:49895|25737_26403_-|AWF28653.1|DBSCAN-SWA MMRDLLNRQGHHIGRRHTRTLMKKMGIQALYCKPNLSQANQAHRKYPYLLKGLAIQRSNQVWSTDITYIPMAKGFVYLCAVIDWHSRKVLAHRVSISMEVDFCISALNEAIEKYGRPEIFNTDQGSQFTSDAFIDVLKSNGIQISMDGKGRWVDNVMVERLWRSVKYEEVYLKAYSSVTDAKKQLSAYFEFYNLKRPHSSLDKMTPNEFYYDQLPQQNKVA >CP029123|2205:49895|43960_44974_+|AWF28684.1|integrase|DBSCAN-SWA MKTATAPLPPLRSVKVLDQLRERIRYLHYSLPTEQAYVNWVRAFIRFHGVRHPATLGSSEVEAFLSWLANERKVSVSTHRQALAALLFFYGKVLCTDLPWLQEIGRPRPSRRLPVVLTPDEVVRILGFLEGEHRLFAQLLYGTGMRISEGLQLRVKDLDFDHGTIIVREGKGSKDRALMLPESLAPSLREQLSRARAWWLKDQAEGRSGVALPDALERKYPRAGHSWPWFWVFAQHTHSTDPRSGVVRRHHMYDQTFQRAFKRAVEQAGITKPATPHTLRHSFATALLRSGYDIRTVQDLLGHSDVSTTMIYTHVLKVGGAGVRSPLDALPPLTSER >CP029123|2205:49895|34431_35196_+|AWF28701.1|DBSCAN-SWA MRILKSLKLFGMAHAIEELGNQNSPAFNQALPMLDSLIKAEVAEREVRSVNYQLRVAKFPVYRDLVGFDFSQSLVNEATVKQLHRCDFMEQAQNVVLIGGPGTGKTHLATAIGTQAVMHLNRRVRFFSTVDLVNALEQEKSSGRQGQIANRLLYADLVILDELGYLPFSQTGGALLFHLLSKLYEKTSVILTTNLSFSEWSRVFGDEKMTTALLDRLTHHCHILETGNESYRFKHSSTQNKQEEKQTRKLKIET >CP029123|2205:49895|30718_31183_+|AWF28714.1|DBSCAN-SWA MEQLALALLRKVGVRMLVIDELHNVLAGNSVNRREFLNLLRFLGNELRIPLVGVGTRDAYLAIRSDDQLENRFEPMMLPVWEANDDCCSLLASFAASLPLRRPSPIATLDMARYLLTRSEGTIGELAHLLMAAAIVAVESGEEAINHRTLSMAC >CP029123|2205:49895|2205_2910_-|AWF28675.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEIEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP029123|2205:49895|48890_49895_+|AWF28706.1|transposase|DBSCAN-SWA MENIALIGIDLGKNSFHIHCQDRRGKAVYRKKFTRPKLIEFLATCPATTIAMEACGGSHFMARKLEELGHFPKLISPQFVRPFVKSNKNDFVDAEAICEAASRPSMRFVQPRTESQQAMRALHRVRESLVQDKVKTTNQMHAFLLEFGISVPRGAAVISRLSTLLEDNSLPLYLSQLLLKLQQHYHYLVEQIKDLESQLKRKLDEDEIGQRLLSIPCVGTLTASTISTEIGDGKQYASSRDFAAATGLVPRQYSTGGRTTLLGISKRGNKKIRTLLVQCARVFIQKLEHQSGKLADWVRDLLCRKSNFVVTCALANKLARIAWALTARQQTYVA >CP029123|2205:49895|13341_17595_+|AWF28658.1|DBSCAN-SWA MRSRPSLLLMRNLRSLAVVVLATLPCVAFAQWRVVAVSTEVDKMRFNTIIDAHSFMKNYRSGEEQGKGTPVGDALYPVASYGDGRYVSRICFKYLGATGDYDPTTCTGDPATVYWRSTYVLPGEMDKTPFLDRDLGLPTTTMCVGNPIHLGTGNKFQAELDYQSGGSDPFTFTRYYNSHLPDEELGGWRHTYSRSVEVNASKYGENMVVLHRPEGQQLAFYNSSSVWVPTWKTDDTLTKDATGWRYTQSDGVVEAYDETGRLTGIEKPNGNHITLSYLNGELSSITDGFGRTIQFQYQDGRMVSVTDPAGGSIQYQYNSAGKLAEVIYQDNTSRSYLYDDPNAPGLLSGLVDENGNRFATWGYDTQGRAVLSEHAGGAEKTQVSYNADGSVSVTNALGHVQRYTYSRHNGMLKPDVVEGAPCTGFVGGKETYVYDSKGLVSSITDRAGQKRTFTHNDRGLETTQIDQDGGKVTTDWLPSKSLPAKITEPTRITELTYDTHLRVISRKVTDRSSGASRTWTYTYAPVGTGKPSLLASVDGPRTDVSDVTTFDYDDQGNLIRTTNALGQVTQFGDYDANGRAGTIQGVNGVTQTLTYDARGRLVSSTGPEGTTVYNYDAVGLLSSLTKPNGATVSYEYDAAHRLVAETDAQGNRRELELNDLGNPVEERLLDALGQTRWIERRIFNEIGWLSSVSDAYSNQSSFSYDVVANLIQETSPSGNTHSYKYDGFHHRTQTTDPLGKVTQVLYKDTGDVYRVSDPRSRLTYYSYNGFGEVTQVRSPDTGTTDITYDEAGNVATRKTAKGQTTSYSYDALNRIIETSSDVAGESPILYGYDEATSPYGIGRLTSVDDGNGVRRFGYTPEGWLAYETWETHGQSLTTQYQYDGAGLVTKITYPSGREVSYTRDSAGDVIEVTTTQAGTTTNLASQIERAPFGPVTSMVRGNGISESRTLDLDYRVTGIDAARVHSLVYRYTPDSLISAIDDNLSSSVNQSLGYDAVGRITSAEGIYGVLGYGYDATGNRTSITTDGLSQSYTINYMNNWLVKAGQTSRSYDANGNLTKQGADTFTYDSQNRLVAATVAGVTVSYTYNHLDQRVTKTLNGHTRLLVYDLAGNLIEELDAATGDVLAEYIWLDGTPLGFVQSGQTYQVHVDHLGTPKALTDVSGQVVWKASYSPFGKASIIIQGPTFNLRFPGQYYDAETGFHYNWRRYYDPATGRYITSDPLGLIDGVNTYGYVHGNPMSNTDPTGEFAFVGAGIGAGLELLSQLIENNGSWKCVSWSKVGIAGAIGAIGGGWASGVFRHASSGKSWFKLSQKWSNVSPRVRKVQGVPRGNELHHWAIQRNGKFGKYVPDSIKNHPWNLKSIPRDIHQNIHGNGPTPYSAFGRWWHGTPEWAKVAQASPVSGGLADSINDEGCGCAN >CP029123|2205:49895|38663_38879_+|AWF28733.1|transposase|DBSCAN-SWA MLDVSMKEDACQIYQNHAAENWSILRQWSLNMLRAEPSKGSIPAKQKRAWMKTDYLEDVLKAGFSSRVFEN >CP029123|2205:49895|2900_3749_+|AWF28655.1|DBSCAN-SWA MGSCAAPSAKGDDKFITTDYLQQCRVHRNTAYQALKDACDDLFARQFSYQSLSEKGNTINHKSRWVSEVAYIDNEAVVRLIFAPAIVPLITRLEEQFTKYEIQQISNLTSAYAVRLYEILIAWRSTGKTPLITMYDFRQKIGVLETEYKRMYDFKKYVLDIALKQVNEHTDIIVKVEQHKTGRSITGFSFSFKQKKSATHSVESKRDPNTLDLFSKITDKQRHLFANKLSELPEMSKYSQGTESYQQFAVRIAAMLQDAEKAGLWLDLSALIGTVHTFQEYG >CP029123|2205:49895|4389_4911_-|AWF28672.1|DBSCAN-SWA MSVTAIVVMNHENKLTTTQNPLLRVKEVEEMLTKYAKDAVVFIDERQFDLRRALSDGARELFAVVGEQNKPIVGVENILPKSAELLIKHYYKAKTSDVVLFISASQLDRVKTHLDQVVLVTINNPSRELETISKGFTDDLKSRVNRKKLSMTEWSIAQSTEEHKRRYSVKIYS >CP029123|2205:49895|37752_38601_+|AWF28676.1|DBSCAN-SWA MYLELFTTHFDTIIDNRQSAKVTYPLSDVLFVTLCGVIAGAEGWSEIHDYAKGHHEWFQKQGFLSDGVPVDDTIARIISKIAPEQFRQCFINWMQAVHKLTQGEVIAIDGKTLRSSYHPEDRKSTIHMVNAFACANKVVLGQLKTVEKSNEITAIPELIRLLDIEGALVSIDAMGCQTAIAEQVIEGNGDYLLALKANQGTLYNAVEALFAGQRSRPLDGIVIEKNRGRIEARSYHVKDASELKGNFSKWVGLQTVGMNLSYREVKGKNRNSLIVTTSVQPS >CP029123|2205:49895|20592_21717_-|AWF28700.1|DBSCAN-SWA MHLNLAYRWFCRLGLEDEVPNHSTFSKNRHGRFRDSDLFRWLFNEVLRRCMAAGLVKGEGFAVDASIIKADASRQRGVAGDEVDWNDPKLSSRAVREYLEALDEEALAEALPKKISLTDPQSRWTAAPGGPAFFAYSTNYLIDTEHGVIMDVEATPAHRTAEVDSTRTMVERVEAQFDLTPERLIGDTAYGTAPMLAWMVEEKDIEPHVPVWDKTERKDDSLSSNDFHWSQDANEYRCPAGKPLRSEWRAFTQQRSRVTKAKTVIYRSSQTDCATCPLKAKCCPNTPNRKIVRSIHEAARDVARRIAKTPEYLVSRCERKKVEMLFAHLKRIMKLDRLRLRGLTGATDEFTLAAMVQNLRRMAKLLPQGPPLTG >CP029123|2205:49895|40671_41700_-|AWF28695.1|DBSCAN-SWA MQFACAPVLGALSDRFGRRPVLLVSLAGAAVDYAIMATAPFLWVLYIGRIVAGITGATGAVAGAYIADITDGDERARHFGFMSACFGFGMVAGPVLGGLMGGFSPHAPFFAAAALNGLNFLTGCFLLPESHKGERRPLRREALNPLASFRWARGMTVVAALMGVFFIMQLVGQVPAALWVIFGEDRFHWDATTIGISLAAFGILHSLAQAMITGPVAARLGERRALMLGMIADGTGYILLAFATRGWMAFPIMVLLASGGIGMPALQAMLSRQVDEERQGQLQGSLAALTSLTSIVGPLLFTAIYAASITTWNGWAWIAGAALYLLCLPALRRGAWSRATST >CP029123|2205:49895|24744_25605_-|AWF28699.1|DBSCAN-SWA MHTRKAITEALQKLGVQTGDLLMVHASLKAIGPVEGGAETVVAALRSAVGPTGTVMGYASWDRSPYEETLNGARLDDEARRTWLPFDPATAGTYRGFGLLNQFLVQAPGARRSAHPDASMVAVGPLAETLTEPHELGHALGEGSPVERFVRLGGKALLLGAPLNSVTALHYAEAVADIPNKRWVTYEMPMLGRDGEVAWKTASDYDSNGILDCFAIEGKPDAVETIANAYVKLGRHREGVVGFAQCYLFDAQDIVTFGVTYLEKHFGTTPIVPPHEAVERSCEPSG >CP029123|2205:49895|18378_19083_+|AWF28651.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEIEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP029123|2205:49895|7625_9641_+|AWF28682.1|transposase|DBSCAN-SWA MALASACSYLLKEETPDESIRAEVFSYIPRQKLAEIITLVREIARPSDDNFHEEMVEQYGRVRRFLPHLLNTVKFSSAPAGVTTLNACDYLSREFSSRRQFFDDAPTEIISRSWKRLVINKEKHITRRGYTLCFLSKLQDSLRRRDVYVTGSNRWGDPRARLLQGADWQANRIKVYRSLGHPTDPQEAIKSLGHQLDSRYRQVAARLCENEAVELDVSGPKPRLTISPLASLDEPDSLKRLSKMISDLLPPVDLTELLLEINAHTGFADEFFHASEASARVDDLPVSISAVLMAEACNIGLEPLIRSNVPALTRHRLNWTKANYLRAETITSANARLVDFQATLPLAQIWGGGEVASADGMRFVTPVRTINAGPNRKYFGNNRGITWYNFVSDQYSGFHGIVIPGTLRDSIFVLEGLLEQETGLNPTEIMTDTAGASELVFGLFWLLGYQFSPRLADAGASVFWRMDHDADYGVLNDIARGQSDPRKIVLQWDEMIRTAGSLKLGKVQVSVLVRSLLKSERPSGLTQAIIEVGRINKTLYLLNYIDDEDYRRRILTQLNRGESRHAVARAICHGQKGEIRKRYTDGQEDQLGTLGLVTNAVVLWNTIYMQAALDHLRAQGETLNDEDIARLSPLCHGHINMLGHYSFTLAELVTKGHLRPLKEASEAENVA >CP029123|2205:49895|32909_34424_+|AWF28664.1|DBSCAN-SWA MAILSAIRRWHFRDGASIREIARRSGLSRNTVRKYLQSKVVEPQYPARDSVGKLSPFEPKLRQWLSTEHKKTKKLRRNLRSMYRDLVALGFTGSYDRVCAFARQWKDSEQFKAQTSGKGCFIPLRFACGEAFQFDWSEDFARIAGKQVKLQIAQFKLAHSRAFVLRAYYQQKHEMLFDAHWHAFQIFGGIPKRGIYDNMKTAVDSVGRGKERRVNQRFTAMVSHYLFDAQFCNPASGWEKGQIEKNVQDSRQRLWQGAPDFQSLADLNVWLEHRCKALWSELRHPELDQTVQEAFADEQGELMALPNAFDAFVEQTKRVTSTCLVHHEGNRYSVPASYANRAISLRIYADKLVMAAEGQHIAEHPRLFGSGHARRGHTQYDWHHYLSVLQKKPGALRNGAPFAELPPAFKKLQSILLQRPGGDRDMVEILALVLHHDEGAVLSAVELALECGKPSKEHVLNLLGRLTEEPPPKPIPIPKGLRLTLEPQANVNRYDSLRRAHDAA >CP029123|2205:49895|12483_13209_+|AWF28703.1|DBSCAN-SWA MMTLTTVSKKTSNNSALVFWRVGTKRKGILDVHIDFDHEEADLLAELVAIRYLALDKQVFCREPGAGAGYKLVVSKGAIKKLALGKSTKAFAFKFAACLTGRLKGATIEVSQSMEFMDEPGEGNIELLDVDKQAYTQTHDEISTPAIGPVLVTQHAIDQYQARITSGDPKKPWASLVGRLQHPELQVQPFDEKVARHKARKYGRVDNVEVWGHRDSKFKYLMVINDDNQKRVLVTVFERNE >CP029123|2205:49895|19130_19571_+|AWF28713.1|transposase|DBSCAN-SWA MPADALKPWIARRERWPSFLIRRDPRDISRIWVLEPEGQHYLEIPYRTLSHPAVTLWEQRQALAKLRQQGREQVDESALFRMIGQMREIVTSAQKATRKARRDADRRQHLKTSARPDKPVPPDTDIADPQADNLPPAKPFDQIEEW >CP029123|2205:49895|5913_6618_-|AWF28712.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM |
39 | Escherichia_phage(31.25%) | integrase,transposase | attL 19131:19146|attR 47124:47139 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|