Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
CP034955 | Escherichia coli strain SCEC020026 plasmid p2_020026, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP034958 | Escherichia coli strain SCEC020026 chromosome, complete genome | 10 crisprs | RT,csa3,PD-DExK,cas5,cas6e,cas1,cas2,cas3,DEDDh,c2c9_V-U4,DinG | 1 | 21 | 10 | 0 |
CP034957 | Escherichia coli strain SCEC020026 plasmid pNDM5_020026, complete sequence | 0 crisprs | NA | 0 | 0 | 4 | 0 |
CP034954 | Escherichia coli strain SCEC020026 plasmid p1_020026, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CP034956 | Escherichia coli strain SCEC020026 plasmid pCTXM15_020026, complete sequence | 0 crisprs | NA | 0 | 0 | 2 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034958_1 | 635311-635450 | Orphan |
NA
Consensus repeat of CP034958_1
|
1 spacers
spacers of CP034958_1
>1.1|635360|42|CP034958|CRISPRCasFinder ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT |
CRISPR arrays and Neighbor proteins around CP034958_1
The CRISPR arrays of CP034958_1 >merge|CP034958|1|635311-635450|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCAACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGTTTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA >CP034958|1|1|635311-635450|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT TTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA
>CP034958.1|QAS84026.1|634218_635259_+|permease MTGQSSSQAATPIQWWKPALFFLVVIAGLWYVKWEPYYGKAFTAAETHSIGKSILAQADANPWQAALDYAMIYFLAVWKAAVLGVILGSLIQVLIPRDWLLRTLGQSRFRGTLLGTLFSLPGMMCTCCAAPVAAGMRRQQVSMGGALAFWMGNPVLNPATLVFMGFVLSWGFAAIRLVAGLVMVLLIATLVQKWVRETPQTQAPVEIDIPEAQGGFFSRWGRALWTLFWSTIPVYILAVLVLGAARVWLFPHADGTVDNSLMWVVAMAVAGCLFVIPTAAEIPIVQTMMLAGMGTAPALALLMTLPAVSLPSLIMLRKAFPAKALWLTGAMVAVSGVIVGGLALLF >CP034958.1|QAS84025.1|633510_634146_+|NAD-dependent-epimerase/dehydratase-family-protein MSQVLITGATGLVGGHLLRMLINEPKVNAIAAPTRRPLGDMPGVFNPHDPQLTDALAQVTDPIDIVFCCLGTTRREAGSKEAFIHADYTLVVDTALTGRRLGAQHMLVVSAMGANAHSPFFYNRVKGEMEEALIAQNWPKLTIARPSMLLGDRSKQRMNETLFAPLFRLLPGNWKSIDARDVARVMLAESMRPEHEGVTILSSSELRKRAE >CP034958.1|QAS84024.1|632864_633383_-|type-1-glutamine-amidotransferase MSKKIAVLITDEFEDSEFTSPADEFRKAGHEVITIEKQAGKTVKGKKGEASVTIDKSIDEVTPAEFDALLLPGGHSPDYLRGDNRFVTFTRDFVNSGKPVFAICHGPQLLISADVIRGRKLTAVKPIIIDVKNAGAEFYDQEVVVDKDQLVTSRTPDDLPAFNREALRLLGA >CP034958.1|QAS84023.1|632441_632885_+|hypothetical-protein METLIAISRWLAKQHVVTWCVQQEGELWCANAFYLFDAQKVAFYILTEEKTRHAQMSGPQAAVAGTVNGQPKTVALIRGVQFKGEIRRLEGEESDLARKAYNRRFPVARMLSAPVWEIRLDEIKFTDNTLGFGKKMIWLRDSGTEQA >CP034958.1|QAS84022.1|632088_632391_-|DNA-damage-response-exodeoxyribonuclease-YhbQ MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAFSAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAELLSSLQTPEIKSD >CP034958.1|QAS84021.1|631598_632102_+|N-acetyltransferase MLIRVEIPIDAPGIDALLRRSFESDAEAKLVHDLREDGFLTLGLVATDDEGQVIGYVAFSPVDVQGEDLQWVGMAPLAVDEKYRGQGLARQLVYEGLDSLNEFGYAAVVTLGDPALYSRFGFELAAHHDLRCRWPGTESAFQVHRLADDALNGVTGLVEYHEHFNRF >CP034958.1|QAS84020.1|631080_631605_+|SCP2-domain-containing-protein MLDKLRSRIVHLGPSLLSVPVKLTPFALKRQVLEQVLSWQFRQALDDGELEFLEGRWLSIHVRDIDLQWFTSVVNGKLVVSQNAQADVSFSADASDLLMIAARKQDPDTLFFQRRLVIEGDTELGLYVKNLMDAIELEQMPKALRMMLLQLADFVEAGMKNAPETKQTSVGEPC >CP034958.1|QAS84019.1|629876_630872_-|U32-family-peptidase MELLCPAGNLPALKAAIENGADAVYIGLKDDTNARHFAGLNFTEKKLQEAVSFVHQHRRKLHIAINTFAHPDGYARWQRAVDMAAQLGADALILADLAMLEYAAERYPHIERHVSVQASATNEEAINFYHRHFDVARVVLPRVLSIHQVKQLARVTPVPLEVFAFGSLCIMSEGRCYLSSYLTGESPNTVGACSPARFVRWQQTPQGLESRLNEVLIDRYQDGENAGYPTLCKGRYLVDGERYHALEEPTSLNTLELLPELMAANIASVKIEGRQRSPAYVSQVAKVWRQAIDRCKADPQNFVPQSAWMETLGSMSEGTQTTLGAYHRKWQ >CP034958.1|QAS84018.1|628989_629868_-|U32-family-peptidase MKYSLGPVLWYWPKETLEEFYQQAATSSADVIYLGEAVCSKRRATKVGDWLEMAKSLAGSGKQIVLSTLALVQASSELGELKRYVENGEFLIEASDLGVVNMCAERKLPFVAGHALNCYNAVTLKILLKQGMMRWCMPVELSRDWLVNLLNQCDELGIRNQFEVEVLSYGHLPLAYSARCFTARSEDRPKDECETCCIKYPNGRNVLSQENQQVFVLNGIQTMSGYVYNLGNELASMQGLVDVVRLSPQGTDTFAMLDAFRANENGAAPLPLTANSDCNGYWRRLAGLELQA >CP034958.1|QAS84017.1|627776_628784_-|LLM-class-flavin-dependent-oxidoreductase MTDKTIAFSLLDLAPIPEGSSAREAFSHSLDLARLAEKRGYHRYWLAEHHNMTGIASAATSVLIGYLAANTTTLHLGSGGVMLPNHSPLVIAEQFGTLNTLYPGRIDLGLGRAPGSDQRTMMALRRHMSGDIDNFPRDVAELVDWFDARDPNPNVRPVPGYGEKIPVWLLGSSLYSAQLAAQLGLPFAFASHFAPDMLFQALHLYRSNFKPSARLEKPYAMVCINIIAADSNRDAEFLFTSMQQAFVKLRRGETGQLPPPIQNMDQFWSPSEQYGVQQALSMSLVGDKAKVRHGLQSILRETDADEIMVNGQIFDHQARLHSFELAMDVKEELLG >CP034958.1|QAS84027.1|635463_636039_-|divisome-associated-lipoprotein-YraP MKALSPIAVLISALLLQGCVAAAVVGTAAVGTKAATDPRSVGTQVDDGTLEVRVNSALSKDEQIKKEARINVTAYQGKVLLVGQSPNAELSARAKQIAMGVDGANEVYNEIRQGQPIGLGEASNDTWITTKVRSQLLTSDLVKSSNVKVTTENGEVFLMGLVTEREAKAAADIASRVSGVKRVTTAFTFIK >CP034958.1|QAS84028.1|636048_636639_-|DnaA-initiator-associating-protein-DiaA MQERIKACFTESIQTQIAAAEALPDAISRAAMTLVQSLLNGNKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIANDRLHDEVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVALTGYDGGELAGLLGPQDVEIRIPSHRSARIQEMHMLTVNCLCDLIDNTLFPHQDD >CP034958.1|QAS84029.1|636658_637054_-|YraN-family-protein MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTVFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS >CP034958.1|QAS84030.1|637011_639048_-|penicillin-binding-protein-activator MVPSTFSRLKAARCLPVVLAALIFAGCGTHTPDQSTAYMQGTAQADSAFYLQQMQQSSDDTRINWQLLAIRALVKEGKTGQAVELFNQLPQELNDSQRREKTLLAVEIKLAQKDFAGAQNLLAKITPADLEQNQQARYWQAKIDASQGRPSIDLLRALIAQEPLLGAKEKQQNIDATWQALSSMTQEQANTLVINADENILQGWLDLQRVWFDNRNDPDMMKAGIADWQKRYPNNPGAKMLPTQLVNVKAFKPASTNKIALLLPLNGQAAVFGRTIQQGFEAAKNIGTQPVAAQVAAAPAADVAEQPQPQTVDGVASPAQASVSDLTGEQPAAQPVPVSAPATSTAAVSAPANPSAELKIYDTSSQPLSQILSQVQQDGASIVVGPLLKNNVEELLKSNTPLNVLALNQPENIENRVNICYFALSPEDEARDAARHIRDQGKQAPLVLIPRSSLGDRVANAFAQEWQKLGGGTVLQQKFGSTSELRAGVNGGSGIALTGSPITPRATTDSGMTTNNPTLQTTPTDDQFTNNGGRVDAVYIVATPGEIAFIKPMIAMRNGSQSGATLYASSRSAQGTAGPDFRLEMEGLQYSEIPMLAGGNLPLMQQALSAVNNDYSLARMYAMGVDAWSLANHFSQMRQVQGFEINGNTGSLTANPDCVINRKLSWLQYQQGQVVPAS >CP034958.1|QAS84031.1|639112_639973_+|16S-rRNA-(cytidine(1402)-2'-O)-methyltransferase MKQHQSADNSQGQLYIVPTPIGNLADITQRALEVLQAVDLIAAEDTRHTGLLLQHFGINARLFALHDHNEQQKAETLLAKLQEGQNIALVSDAGTPLINDPGYHLVRTCREAGIRVVPLPGPCAAITALSAAGLPSDRFCYEGFLPAKSKGRRDALKAIEAEPRTLIFYESTHRLLDSLEDIVAVLGESRYVVLARELTKTWETIHGAPVGELLAWVKEDENRRKGEMVLIVEGHKAQEEDLPADALRTLALLQAELPLKKAAALAAEIHGVKKNALYKYALEQQG >CP034958.1|QAS84032.1|640015_641107_-|fimbrial-protein MKRAPLITGLLLISTSCAYASSGGCGADSTSGATNYSSVVDDVTVNQTDNVTGREFTSATLSSTNWQYACSCSAGKAVKLVYMVSPVLTTTGHQAGYYKLNDSLDIKTTLKANDIPGLVTDQTVSVNTRFTQIKSNTVYSAATQTGVCQGDTSRYGPVNIGANTTFTLYVTKPFLGSMTIPKTDIAVIKGAWVDGMGSPSTGDFHDLVKLSIQGNLTAPQSCKINQGDVIKVNFGFINGQKFTTRNAMPDGFTPVDFDITYDCGDTSKIKNSLQMRIDGTTGVVDQYNLVARRRSSDNAPDVGIRIENLGGGVANIPFQNGILPVDPSGHGTVNMRAWPVNLVGGELETGKFQGTATITVIVR >CP034958.1|QAS84033.1|641117_643490_-|fimbrial-biogenesis-outer-membrane-usher-protein MLETTKSGMQTTDLSRFSKKYAQLPGTYQVDIWLNKKKVSQKKITFTANAEQLLQPQFTVEQLRELGIKVDEIPALAEKDDDSVINSLEQIIPGTAAEFDFNHQRLNLSIPQIALYRDARGYVSPSRWDDGIPTLFTNYSFTGSDNRYRQGNRSQRQYLNMQNGANFGPWRLRNYSTWTRNDQTSSWNTISSYLQRDIKALKSQLLLGESATSGSIFSSYTFTGVQLASDDNMLPNSQRGFAPTVRGIANSSAIVTIRQNGYVIYQSNVPAGAFEINDLYPSSNSGDLEVTIEESDGTQRRFIQPYSSLPMMQRPGHLKYSATAGRYRADANSDSKEPEFAEATAIYGLNNTFTLYSGLLGSEDYYALGIGIGGTLGALGALSMDINRADTQFDNQHSFHGYQWRTQYIKDIPETNTNIAVSYYRYTNDGYFSFDEANTRNWDYNSRQKSEIQFNISQTIFDGVSLYASGSQQDYWGNNEKNRNISVGVSGQQWGIGYSLNYQYSRYTDQNNDRALSLNLSIPLERWLPRSRVSYQMTSQKDRPTQHEMRLDGSLLDDGRLSYSLEQSLDDDNNHNSSVNASYRSPYGTFSAGYSYGNDSSQYNYGVTGGVVIHPHGVTLSQYLGNAFALIDANGASGVRIQNYPGIATDPFGYAVVPYLTTYQENRLSVDTTQLPDNVDLEQTTQFVVPNRGAMVAARFNANIGYRVLVTVSDRNGKPLPFGALASNDETGQQSIVDEGGILYLSGISSKSQSWTVRWGNQADQQCQFAFSTPDSEPTTSVLQGTAQCH >CP034958.1|QAS84034.1|644639_645395_-|galactosamine-6-phosphate-isomerase MERGTASGGASLLKEFHPVQTLQQVENYTALSERASEYLLAVIRSKPDAVICLATGATPLLTYHYLVEKIHQQQVDVSQLTFVKLDEWVDLPLTMPGTCETFLQQHIVQPLGLREDQLISFRSEEINETECERVTNLIARKGGLDLCVLGLGKNGHLGLNEPGESLQPACHISQLDARTQQHEMLKTAGRPVTRGITLGLKDILNAREVLLLVTGEGKQDATERFLTAKVSTAIPASFLWLHSNFICLINT >CP034958.1|QAS84035.1|645395_646187_-|PTS-N-acetylgalactosamine-transporter-subunit-IID MGSEISKKDITRLGFRSSLLQASFNYERMQAGGFTWAMLPILKKIYKDDKPGLSAAMKDNLEFINTHPNLVGFLMGLLISMEEKGENRDTIKGLKVALFGPIAGIGDAIFWFTLLPIMAGICSSFASQGNLLGPILFFAVYLLIFFLRVGWTHVGYSVGVKAIDKVRENSQMIARSATILGITVIGGLIASYVHINVVTSFAIDSTHSVALQQDFFDKVFPNILPMAYTLLMYYFLRVKKAHPVLLIGVTFVLSIVCSAFGIL >CP034958.1|QAS84036.1|646176_646980_-|N-acetylgalactosamine-permease-IIC-component-1 MHEITLLQGLSLAALVFVLGIDFWLEALFLFRPIIVCTLTGAILGDIQTGLITGGLTELAFAGLTPAGGVQPPNPIMAGLMTTVIAWSTGVDAKTAIGLGLPFSLLMQYVILFFYSAFSLFMTKADKCAKEADTAAFSRLNWTTMLIVASAYAVIAFLCTYLAQGAMQALVKAMPAWLTHGFEVAGGILPAVGFGLLLRVMFKAQYIPYLIAGFLFVCYIQVSNLLPVAVLGAGFAVYEFFNAKSRQQAQPQPVASKNEEEDYSNGI |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034958_2 | 663705-663956 | Orphan |
NA
Consensus repeat of CP034958_2
|
2 spacers
spacers of CP034958_2
>2.1|663759|64|CP034958|PILER-CR AGAACCCGGCTTATCGGTCAGTTTCACCTGATTTACGTAAAAACCCGCTTCGGCGGGTTTTTGC >2.2|663877|59|CP034958|PILER-CR GCGGCTTATCGGTCAGTTTCACCTGGTTTACGTAAAAAACCGCTTCGGCGGGTTTTTGC |
CRISPR arrays and Neighbor proteins around CP034958_2
The CRISPR arrays of CP034958_2 >merge|CP034958|2|663705-663956|PILER-CR AGATGAATGACTGTCCACGACAGAACCCGGCTTATCGGTCAGTTTCACCTGATTTACGTAAAAACCCGCTTCGGCGGGTTTTTGCTTTTGGAGGAGCGGAAAGATGAATGACTGTCCACGACGCTATACCCAAAAGAAAGCGGCTTATCGGTCAGTTTCACCTGGTTTACGTAAAAAACCGCTTCGGCGGGTTTTTGCTTTTGGAGGGGCAGAAAGATGAATGACTGTCCACGACGCTATACCCAAAAGAAA >CP034958|2|1|663705-663956|PILER-CR AGATGAATGACTGTCCACGACAGAACCCGGCTTATCGGTCAGTTTCACCTGATT TACGTAAAAACCCGCTTCGGCGGGTTTTTGCTTTTGGAGGAGCGGAAAGATGAATGACTGTCCA CGACGCTATACCCAAAAGAAAGCGGCTTATCGGTCAGTTTCACCTGGTTTACGT AAAAAACCGCTTCGGCGGGTTTTTGCTTTTGGAGGGGCAGAAAGATGAATGACTGTCCA CGACGCTATACCCAAAAGAAA
>CP034958.1|QAS84053.1|662199_663345_+|glycerate-2-kinase MKIVIAPDSYKESLSASEVAQAIEKGFREIFPDAQYVSIPVADGGEGTVEAMIAATQGSERHAWVTGPLGEKVNASWGISGDGKTAFIEMAAASGLELVPAEKRDPLVTTSRGTGELILQALESGATNIIIGIGGSATNDGGAGMVQALGAKLCDANGNEIGFGGGSLNTLNDIDISGLDPRLKDCVIRVACDVTNPLVGDNGASRIFGPQKGASEAMIVELDNNLSHYADVIKKALHVDVKDVPGAGAAGGMGAALMAFLGAELKSGIEIVTTALNLEEHIHDCTLVITGEGRIDSQSIHGKVPIGVANVAKKYHKPVIGIAGSLTDDVGVVHQHGIDAVFSVLTSIGTLDEAFRGAYDNICRASRNIAATLAIGMRNAG >CP034958.1|QAS84052.1|661212_662103_+|2-hydroxy-3-oxopropionate-reductase MTMKVGFIGLGIMGKPMSKNLLKAGYSLVVADRNPEAIADVIAAGAETASTAKAIAEQCDVIITMLPNSPHVKEVALGENGIIEGAKPGTVLIDMSSIAPLASREISEALKAKGIDMLDAPVSGGEPKAIDGTLSVMVGGDKAIFDKYYDLMKAMAGSVVHTGEIGAGNVTKLANQVIVALNIAAMSEALTLATKAGVNPDLVYQAIRGGLAGSTVLDAKAPMVMDRNFKPGFRIDLHIKDLANALDTSHGVGAQLPLTAAVMEMMQALRADGLGTADHSALACYYEKLAKVEVTR >CP034958.1|QAS84051.1|660412_661183_+|5-keto-4-deoxy-D-glucarate-aldolase MNNDVFPNKFKAALAAKQVQIGCWSALSNPISTEVLGLAGFDWLVLDGEHAPNDISTFIPQLMALKGSASAPVVRVPTNEPVIIKRLLDIGFYNFLIPFVETKEEAEQAVASTRYPPEGIRGVSVSHRANMFGTVADYFAQSNKNITILVQIESQQGVDNIDAIAATEGVDGIFVGPSDLAAALGHLGNASHPDVQKAIQHIFNRASAHGKPSGILAPIEADARRYLEWGATFVAVGSDLGVFRSATQKLADTFKK >CP034958.1|QAS84050.1|659062_660397_+|MFS-transporter MILDTVDVKKKGVHTRYLILLIIFIVTAVNYADRATLSIAGTEVAKELQLSAVSMGYIFSAFGWAYLLMQIPGGWLLDKFGSKKVYTYSLFFWSLFTFLQGFVDMFPLAWAGISMFFMRFMLGFSEAPSFPANARIVAAWFPTKERGTASAIFNSAQYFSLALFSPLLGWLTFAWGWEHVFTVMGVIGFVLTALWIKLIHNPTDHPRMSAEELKFISENGAVVDMDHKKPGSAAASGPKLHYIKQLLSNRMMLGVFFGQYFINTITWFFLTWFPIYLVQEKGMSILKVGLVASIPALCGFAGGVLGGVFSDYLIKRGLSLTLARKLPIVLGMLLASTIILCNYTNNTTLVVMLMALAFFGKGFGALGWSVISDTAPKEIVGLCGGVFNVFGNVASIVTPLVIGYLVSELHSFNAALIFVGCSALMAMVCYLFVVGDIKRMELQK >CP034958.1|QAS84049.1|657116_658688_-|galactarate-dehydratase MANIEIRQETPTAFYIKVHDTDNVAIIVNDNGLKAGTRFPDGLELIEHIPQGHKVALLDIPANGEIIRYGEVIGYAVRAIPRGSWIDESMVVLPEAPPLHTLPLATKVPEPLPPLEGYTFEGYRNADGSVGTKNLLGITTSVHCVAGVVDYVVKIIERDLLPKYPNVDGVVGLNHLYGCGVAINAPAAVVPIRTIHNISLNPNFGGEVMVIGLGCEKLQPERLLTGTDDVQAIPVESASIVSLQDEKHVGFQSMVEDILQVAERHLQKLNQRQRETCPASELVVGMQCGGSDAFSGVTANPAVGYASDLLVRCGATVMFSEVTEVRDAIHLLTPRAVNEEVGKRLLEEMEWYDNYLNIGKTDRSANPSPGNKKGGLANVVEKALGSIAKSGKSAIVEVLSPGQRPTKRGLIYAATPASDFVCGTQQVASGITVQVFTTGRGTPYGLMAVPVIKMATRTELANRWFDLMDINAGTIATGEETIEEVGWKLFHFILDVASGKKKTFSDQWGLHNQLAVFNPAPVT >CP034958.1|QAS84048.1|656632_656968_-|type-II-toxin-antitoxin-system-PrlF-family-antitoxin MPANARSHAVLTTESKVTIRGQTTIPAPVREALKLKPGQDSIHYEILPGGQVFMCRLGDEQEDHTMNAFLRFLDADIQNNPQKTRPFNIQQGKKLVAGMDVNIDDEIGDDE >CP034958.1|QAS84047.1|656168_656633_-|type-II-toxin-antitoxin-system-YhaV-family-toxin MDFPQRVNGWALYAHPCFQETYDALVAEVEALKGKDPENYQRKAATKLLAVVHKVIEEHITVNPSSPAFRHGKSLGSGKNKDWSRVKFGAGRYRLFFRYSEKEKVIILGWMNDENTLRTYGKKTDAYTVFSKMLKRGHPPADWESLTQETEENH >CP034958.1|QAS84046.1|655304_656114_+|DeoR-family-transcriptional-regulator MSNTDASGEKRVTGTSERREQIIQRLRQQGSVQVNDLSALYGVSTVTIRNDLAFLEKQGIAVRAYGGALICDSTTPSVEPSVEDKSALNTAMKRSVAKAAVELIQPGHRVILDSGTTTFEIARLMRKHTDVIAMTNGMNVANALLEAEGVELLMTGGHLRRQSQSFYGDQAEQSLQNYHFDMLFLGVDAIDLERGVSTHNEDEARLNRRMCEVAERIIVVTDSSKFNRSSLHKIIDTQRIDMIIVDEGIPADSLEGLRKAGVEVILVGE >CP034958.1|QAS84045.1|653775_655056_-|tagatose-bisphosphate-aldolase-subunit-KbaZ MKHLTEMVRQHKAGKTNAIYAVCSAHPLVLEAAIRYASANQTPLLIEATSNQVDQFGGYTGMTPADFRGFVCQLADSLNFPQDALILGGDHLGPNRWQNLPAAQAMANADDLIKSYVAAGFKKIHLDCSMSCQDDPIPLTDDIVAERAARLAKVAEETCLEHFGEADLEYVIGTEVPVPGGAHETLSELAVTTPDAARATLEAHRHAFEKQGLNAIWPRIIALVVQPGVEFDHTNVIDYQPAKASALSQMVENYETLIFEAHSTDYQTPQSLRQLVIDHFAILKVGPALTFALREALFSLAAIEEELVPAKACSGLRQVLEDVMLDRPEYWQSHYHGDGNARRLARGYSYSDRVRYYWPDSQIDDAFAHLVRNLADSPIPLPLISQYLPLQYVKVRSGELQPTPRELIINHIQDILAQYHTACEGQ >CP034958.1|QAS84044.1|653279_653753_-|PTS-N-acetylgalactosamine-transporter-subunit-IIB MPNIVLSRIDERLIHGQVGVQWVGFAGANLVLVANDEVAEDPVQQNLMEMVLAEGIAVRFWTLQKVIDNIHRAADRQKILLVCKTPADFLTLVKGGVPVNRINVGNMHYANGKQQIAKTVSVDAGDIAAFNDLKAAGVECFVQGVPTEPAVDLFKLL >CP034958.1|QAS84054.1|664254_665442_-|YhaC-family-protein MFPVSSIGNDISSDLVRRKMNDLPESPIVNNLEALAPGIEKLKQTSIQMVTLLNALQPGGKCIITGDFQKELAYLQNVILYNDSSLRMDFFGYNALIIQRSDNTCELTINEPLKNQEISTGNINVNFPLKDIYNEIRRLNVVFSCGTGGIVDLSSLDLRNIDLELYDFTDKHMANAILNPFKLDDTDFTNANMFQVNFVSSKQNTTISWDYLLKITPVLTSISDMYSEEKIKLVESCLNELGDITEEQLKIMRFAIIESIPRATLTDQLENELTKEIYKNSSKINNYLNRIKLPEMKGFSSEKIDYYIDIIIKDYESVKENAYLIDPKINYNTDLNIEDSSSEEFLSDNTLEKDENSPDNCFEVVKYNTYEAYNSENLYFTREEYTYDYDLLNAI >CP034958.1|QAS84055.1|665463_666003_-|hypothetical-protein MKGFPIAHIFHPSIPPMHAVVNNHNRNIDYWTVKRKFAEIVSTNDVNKIYSISNELRRVLSAITALNFYQGDVPSVMIRIQPENMSPFIIDISTGEHDDYIIQTLDVGTFAPFGEQCTCSAVNKKELECIKETISKYCAKFTRKEAILTPPAHFNKTSITSDCWQILFFSPDHFNNDFY >CP034958.1|QAS84056.1|666258_666603_-|DNA-binding-transcriptional-activator-TdcR MTGITIFYGDNIIRYVVNIKKGLRPYFKQLPDNYQAKFELNLMSKFSNFIINKPFSAINTAARHIFSRYLLENKHLFYQYFKISNTGIDHLEQLINVNFFSSDRTSFCECNRFP >CP034958.1|QAS84057.1|666791_667730_+|transcriptional-regulator-TdcA MSTILLPKTQHLVVFQEVIRSGSIGSAAKELGLTQPAVSKIINDIEDYFGVELVVRKNTGVTLTPAGQLLLSRSESITREMKNMVNEISGMSSEAVVEVSFGFPSLIGFTFMSGMINKFKEVFPKAQVSMYEAQLSSFLPAIRDGRLDFAIGTLSAEMKLQDLHVEPLFESEFVLVASKSRTCTGTTTLESLKNEQWVLPQTNMGYYSELLTTLQRNGISIENIVKTDSVVTIYNLVLNADFLTVIPCDMTSPFGSNQFITIPVEETLPVAQYAAVWSKNYRIKKAASVLVELAKEYSSYNGCRRRQLIEVG >CP034958.1|QAS84058.1|667828_668818_+|bifunctional-threonine-ammonia-lyase/L-serine-ammonia-lyase-TdcB MHITYDLPVAIDDIIEAKQRLAGRIYKTGMPRSNYFSERCKGEIFLKFENMQRTGSFKIRGAFNKLSSLTDAEKRKGVVACSAGNHAQGVSLSCAMLGIDGKVVMPKGAPKSKVAATCDYSAEVVLHGDNFNDTIAKVSEIVEMEGRIFIPPYDDPKVIAGQGTIGLEIMEDLYDVDNVIVPIGGGGLIAGIAVAIKSINPTIRVIGVQSENVHGMAASFHSGEITTHRTTGTLADGCDVSRPGNLTYEIVRELVDDIVLVSEDEIRNSMIALIQRNKVVTEGAGALACAALLSGKLDQYIQNRKTVSIISGGNIDLSRVSQITGFVDA >CP034958.1|QAS84059.1|668839_670171_+|threonine/serine-transporter-TdcC MSTSDSIVSSQTKQSSWRKSDTTWTLGLFGTAIGAGVLFFPIRAGFGGLIPILLMLVLAYPIAFYCHRALARLCLSGSNPSGNITETVEEHFGKTGGVVITFLYFFAICPLLWIYGVTITNTFMTFWENQLGFAPLNRGFVALFLLLLMAFVIWFGKDLMVKVMSYLVWPFIASLVLISLSLIPYWNSAVIDQVDLGSLSLTGHDGILITVWLGISIMVFSFNFSPIVSSFVVSKREEYEKDFGRDFTERKCSQIISRASMLMVAVVMFFAFSCLFTLSPANMAEAKAQNIPVLSYLANHFASMTGTKTTFAITLEYAASIIALVAIFKSFFGHYLGTLEGLNGLILKFGYKGDKTKVSLGKLNTISMIFIMGSTWVVAYANPNILDLIEAMGAPIIASLLCLLPMYAIRKAPSLAKYRGRLDNVFVTVIGLLTILNIVYKLF >CP034958.1|QAS84060.1|670196_671405_+|propionate-kinase MNEFPVVLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVNGGEPAPLAHHSYEGALKAIAFELEKRNLNDSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLHNYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTSHRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSGDVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLRVLEKAWHEGHERAQLAIKTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVMEHLAVLGVEIDTEMNNRSNSCGERIVSSENARVICAVIPTNEEKMIALDAIHLGKVNAPAEFA >CP034958.1|QAS84061.1|671438_673733_+|2-ketobutyrate-formate-lyase/pyruvate-formate-lyase MKVDIDTSDKLYADAWLGFKGTDWKNEINVRDFIQHNYTPYEGDESFLAEATPATTELWEKVMEGIRIENATHAPVDFDTNIATTITAHDAGYINQPLEKIVGLQTDAPLKRALHPFGGINMIKSSFHAYGREMDSEFEYLFTDLRKTHNQGVFDVYSPDMLRCRKSGVLTGLPDGYGRGRIIGDYRRVALYGISYLVRERELQFADLQSRLEKGEDLEATIRLREELAEHRHALLQIQEMAAKYGFDISRPAQNAQEAVQWLYFAYLAAVKSQNGGAMSLGRTASFLDIYIERDFKAGVLNEQQAQELIDHFIMKIRMVRFLRTPEFDSLFSGDPIWATEVIGGMGLDGRTLVTKNSFRYLHTLHTMGPAPEPNLTILWSEELPIAFKKYAAQVSIVTSSLQYENDDLMRTDFNSDDYAIACCVSPMVIGKQMQFFGARANLAKTLLYAINGGVDEKLKIQVGPKTAPLMDDVLDYDKVMDSLDHFMDWLAVQYISALNIIHYMHDKYSYEASLMALHDRDVYRTMACGIAGLSVATDSLSAIKYARVKPIRDENGLAVDFEIDGEYPQYGNNDERVDSIACDLVERFMKKIKALPTYRNAVPTQSILTITSNVVYGQKTGNTPDGRRAGTPFAPGANPMHGRDRKGAVASLTSVAKLPFTYAKDGISYTFSIVPAALGKEDPVRKTNLVGLLDGYFHHEADVEGGQHLNVNVMNREMLLDAIEHPEKYPNLTIRVSGYAVRFNALTREQQQDVISRTFTQAL >CP034958.1|QAS84062.1|673746_674136_+|enamine/imine-deaminase MKKIIETQRAPGAIGPYVQGVDLGSMVFTSGQIPVCPQTGEIPADVQDQARLSLENVKAIVVAAGLSVGDIIKMTVFITDLNDFATINEVYKQFFDEHQATYPTRSYVQVARLPKDVKLEIEAIAVRSA >CP034958.1|QAS84063.1|674207_675572_+|L-serine-ammonia-lyase MISAFDIFKIGIGPSSSHTVGPMNAGKSFIDRLESSGLLTATSHIVVDLYGSLSLTGKGHATDVAIIMGLAGNSPQDVVIDEIPAFIELVTRSGRLPVASGAHIVDFPVAKNIIFHPEMLPRHENGMRITAWKGQEALLSKTYYSVGGGFIVEEEHFGLSHDVETSVPYDFHSAGELLKMCDYNGLSISGLMMHNELALRSKAEIDAGFARIWQVMHDGIERGMNTEGVLPGPLNVPRRAVALRRQLVSSDNISNDPMNVIDWINMYALAVSEENAAGGRVVTAPTNGACGIIPAVLAYYDKFRRPVNERSIARYFLAAGAIGALYKMNASISGAEVGCQGEIGVACSMAAAGLTELLGGSPAQVCNAAEIAMEHNLGLTCDPVAGQVQIPCIERNAINAVKAVNAARMAMRRTSAPRVSLDKVIETMYETGKDMNDKYRETSRGGLAIKVVCG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034958_3 | 678537-678654 | Orphan |
NA
Consensus repeat of CP034958_3
|
1 spacers
spacers of CP034958_3
>3.1|678577|38|CP034958|CRISPRCasFinder GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA |
CRISPR arrays and Neighbor proteins around CP034958_3
The CRISPR arrays of CP034958_3 >merge|CP034958|3|678537-678654|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGGGTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGATGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG >CP034958|3|2|678537-678654|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGG GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA TGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG
>CP034958.1|QAS84065.1|677205_678516_+|serine-dehydratase-subunit-alpha-family-protein MFDSTLNPLWQRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPGTGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFSRAKVWNGEKWACVTIVGGHTNIVHIETHNGVVFTQQACVAEGEQESPLTVLSRTTLAEILKFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVIRTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAIYIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSCAMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEIMASKAR >CP034958.1|QAS84064.1|675846_677178_+|HAAAP-family-serine/threonine-permease MEIASNKGVIADASTPAGRAGMSESEWREAIKFDSTDTGWVIMSIGMAIGAGIVFLPVQVGLMGLWVFLLSSVIGYPAMYLFQRLFINTLAESPECKDYPSVISGYLGKNWGILLGALYFVMLVIWMFVYSTAITNDSASYLHTFGVTEGLLSDSPFYGLVLICILVAISSRGEKLLFKISTGMVLTKLLVVAALGVSMVGMWHLYNVGSLPPLGLLVKNAIITLPFTLTSILFIQTLSPMVISYRSREKSIEVARHKALRAMNIAFGILFVTVFFYAVSFTLAMGHDEAVKAYEQNISALAIAAQFISGDGAAWVKVVSVILNIFAVMTAFFGVYLGFREATQGIVMNILRRKMPAEKINENLVQRGIMIFAILLAWSAIVLNAPVLSFTSICSPIFGMVGCLIPAWLVYKVPALHKYKGMSLYLIIVTGLLLCVSPFLAFS >CP034958.1|QAS84063.1|674207_675572_+|L-serine-ammonia-lyase MISAFDIFKIGIGPSSSHTVGPMNAGKSFIDRLESSGLLTATSHIVVDLYGSLSLTGKGHATDVAIIMGLAGNSPQDVVIDEIPAFIELVTRSGRLPVASGAHIVDFPVAKNIIFHPEMLPRHENGMRITAWKGQEALLSKTYYSVGGGFIVEEEHFGLSHDVETSVPYDFHSAGELLKMCDYNGLSISGLMMHNELALRSKAEIDAGFARIWQVMHDGIERGMNTEGVLPGPLNVPRRAVALRRQLVSSDNISNDPMNVIDWINMYALAVSEENAAGGRVVTAPTNGACGIIPAVLAYYDKFRRPVNERSIARYFLAAGAIGALYKMNASISGAEVGCQGEIGVACSMAAAGLTELLGGSPAQVCNAAEIAMEHNLGLTCDPVAGQVQIPCIERNAINAVKAVNAARMAMRRTSAPRVSLDKVIETMYETGKDMNDKYRETSRGGLAIKVVCG >CP034958.1|QAS84062.1|673746_674136_+|enamine/imine-deaminase MKKIIETQRAPGAIGPYVQGVDLGSMVFTSGQIPVCPQTGEIPADVQDQARLSLENVKAIVVAAGLSVGDIIKMTVFITDLNDFATINEVYKQFFDEHQATYPTRSYVQVARLPKDVKLEIEAIAVRSA >CP034958.1|QAS84061.1|671438_673733_+|2-ketobutyrate-formate-lyase/pyruvate-formate-lyase MKVDIDTSDKLYADAWLGFKGTDWKNEINVRDFIQHNYTPYEGDESFLAEATPATTELWEKVMEGIRIENATHAPVDFDTNIATTITAHDAGYINQPLEKIVGLQTDAPLKRALHPFGGINMIKSSFHAYGREMDSEFEYLFTDLRKTHNQGVFDVYSPDMLRCRKSGVLTGLPDGYGRGRIIGDYRRVALYGISYLVRERELQFADLQSRLEKGEDLEATIRLREELAEHRHALLQIQEMAAKYGFDISRPAQNAQEAVQWLYFAYLAAVKSQNGGAMSLGRTASFLDIYIERDFKAGVLNEQQAQELIDHFIMKIRMVRFLRTPEFDSLFSGDPIWATEVIGGMGLDGRTLVTKNSFRYLHTLHTMGPAPEPNLTILWSEELPIAFKKYAAQVSIVTSSLQYENDDLMRTDFNSDDYAIACCVSPMVIGKQMQFFGARANLAKTLLYAINGGVDEKLKIQVGPKTAPLMDDVLDYDKVMDSLDHFMDWLAVQYISALNIIHYMHDKYSYEASLMALHDRDVYRTMACGIAGLSVATDSLSAIKYARVKPIRDENGLAVDFEIDGEYPQYGNNDERVDSIACDLVERFMKKIKALPTYRNAVPTQSILTITSNVVYGQKTGNTPDGRRAGTPFAPGANPMHGRDRKGAVASLTSVAKLPFTYAKDGISYTFSIVPAALGKEDPVRKTNLVGLLDGYFHHEADVEGGQHLNVNVMNREMLLDAIEHPEKYPNLTIRVSGYAVRFNALTREQQQDVISRTFTQAL >CP034958.1|QAS84060.1|670196_671405_+|propionate-kinase MNEFPVVLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVNGGEPAPLAHHSYEGALKAIAFELEKRNLNDSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLHNYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTSHRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSGDVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLRVLEKAWHEGHERAQLAIKTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVMEHLAVLGVEIDTEMNNRSNSCGERIVSSENARVICAVIPTNEEKMIALDAIHLGKVNAPAEFA >CP034958.1|QAS84059.1|668839_670171_+|threonine/serine-transporter-TdcC MSTSDSIVSSQTKQSSWRKSDTTWTLGLFGTAIGAGVLFFPIRAGFGGLIPILLMLVLAYPIAFYCHRALARLCLSGSNPSGNITETVEEHFGKTGGVVITFLYFFAICPLLWIYGVTITNTFMTFWENQLGFAPLNRGFVALFLLLLMAFVIWFGKDLMVKVMSYLVWPFIASLVLISLSLIPYWNSAVIDQVDLGSLSLTGHDGILITVWLGISIMVFSFNFSPIVSSFVVSKREEYEKDFGRDFTERKCSQIISRASMLMVAVVMFFAFSCLFTLSPANMAEAKAQNIPVLSYLANHFASMTGTKTTFAITLEYAASIIALVAIFKSFFGHYLGTLEGLNGLILKFGYKGDKTKVSLGKLNTISMIFIMGSTWVVAYANPNILDLIEAMGAPIIASLLCLLPMYAIRKAPSLAKYRGRLDNVFVTVIGLLTILNIVYKLF >CP034958.1|QAS84058.1|667828_668818_+|bifunctional-threonine-ammonia-lyase/L-serine-ammonia-lyase-TdcB MHITYDLPVAIDDIIEAKQRLAGRIYKTGMPRSNYFSERCKGEIFLKFENMQRTGSFKIRGAFNKLSSLTDAEKRKGVVACSAGNHAQGVSLSCAMLGIDGKVVMPKGAPKSKVAATCDYSAEVVLHGDNFNDTIAKVSEIVEMEGRIFIPPYDDPKVIAGQGTIGLEIMEDLYDVDNVIVPIGGGGLIAGIAVAIKSINPTIRVIGVQSENVHGMAASFHSGEITTHRTTGTLADGCDVSRPGNLTYEIVRELVDDIVLVSEDEIRNSMIALIQRNKVVTEGAGALACAALLSGKLDQYIQNRKTVSIISGGNIDLSRVSQITGFVDA >CP034958.1|QAS84057.1|666791_667730_+|transcriptional-regulator-TdcA MSTILLPKTQHLVVFQEVIRSGSIGSAAKELGLTQPAVSKIINDIEDYFGVELVVRKNTGVTLTPAGQLLLSRSESITREMKNMVNEISGMSSEAVVEVSFGFPSLIGFTFMSGMINKFKEVFPKAQVSMYEAQLSSFLPAIRDGRLDFAIGTLSAEMKLQDLHVEPLFESEFVLVASKSRTCTGTTTLESLKNEQWVLPQTNMGYYSELLTTLQRNGISIENIVKTDSVVTIYNLVLNADFLTVIPCDMTSPFGSNQFITIPVEETLPVAQYAAVWSKNYRIKKAASVLVELAKEYSSYNGCRRRQLIEVG >CP034958.1|QAS84056.1|666258_666603_-|DNA-binding-transcriptional-activator-TdcR MTGITIFYGDNIIRYVVNIKKGLRPYFKQLPDNYQAKFELNLMSKFSNFIINKPFSAINTAARHIFSRYLLENKHLFYQYFKISNTGIDHLEQLINVNFFSSDRTSFCECNRFP >CP034958.1|QAS84066.1|678727_678892_-|hypothetical-protein MSKKSAKKRQPVKPVVAKEPARTAKNFGYEEMLSELEAIVADAETRLAEDEATA >CP034958.1|QAS84067.1|678914_679616_-|pirin-family-protein MITTRTARQCGQADYGWLQARYTFSFGHYFDPKLLGYASLRVLNQEVLAPGAAFQPRTYPKVDILNVILDGEAEYRDSEGNHVQASAGEALLLSTQPGVSYSEHNLSKDKPLTRMQLWLDACPQRENPLIQKLALNMGKQQLIASPEGTMGSLQLRQQVWLHHIVLDKGESANFQLHGPRAYLQSIHGKFHALTHHEEKAALTCGDGAFIRDEANITLVADSPLRALLIDLPV >CP034958.1|QAS84068.1|679720_680617_+|LysR-family-transcriptional-regulator MAKERALTLEALRVMDAIDRRGSFAAAADELGRVPSALSYTMQKLEEELDVVLFDRSGHRTKFTNVGRMLLERGRVLLEAADKLTTDAEALARGWETHLTIVTEALVPTPAFFPLIDKLAAKANTQLAIITEVLAGAWERLEQGRADIVIAPDMHFRSSSEINSRKLYTLMNVYVAAPDHPIHQEPEPLSEVTRVKYRGIAVADTARERPVLTVQLLDKQPRLTVSTIEDKRQALLAGLGVATMPYPMVEKDIAEGRLRVVSPESTSEIDIIMAWRRDSMGEAKSWCLREIPKLFSGK >CP034958.1|QAS84069.1|680667_681024_-|DUF805-domain-containing-protein MQWYLAVLKNYVGFSGRARRKEYWMFTLINAIVGAIINVIQLILGLEFPFLSLIYLAATIIPVIALCVRRLHDTDRSGAWALLYLVPIIGWLVLFVFACLEGNSGSNRYGNDPKFGSN >CP034958.1|QAS84070.1|681265_681631_-|DUF805-domain-containing-protein MDWYLKVLKNYVGFRGRARRKEYWMFILVNIIFTFVLGLLDKMLGWQRAGGEGILTTIYGILVFLPWWAVQFRRLHDTDRSAWWALLFLIPFIGWLIIIVFNCQAGTPGENRFGPDPKLEP >CP034958.1|QAS84071.1|681923_682910_-|glutathione-S-transferase-family-protein MGQLIDGVWHDTWYDTKSTGGKFQRSASAFRNWLTADGAPGPTGTGGFIAEKDRYHLYVSLACPWAHRTLIMRKLKGLEPFISVSVVNPLMLENGWTFDDSFPGATGDTLYQHEFLYQLYLHADPHYSGRVTVPVLWDKKNHTIVSNESAEIIRMFNTAFDALGAKAGDYYPPALQTKIDELNGWIYDTVNNGVYKAGFATSQQAYDEAVAKVFESLARLEQILGQHRYLTGNQLTEADIRLWTTLVRFDPVYVTHFKCDKHRISNYLNLYGFLRDIYQMPGIAETVNFDHIRNHYFRSHKTINPTGIISIGPWQDLDEPHGRDVRFG >CP034958.1|QAS84072.1|682979_683462_-|DoxX-family-protein MILSIDSNDANTAPLHKKTISSLSGAVESMMKKLEDVGVLVARILMPILFITAGWGKITGYAGTQQYMEAMGVPGFMLPLVILLEFGGGLAILFGFLTRTTALFTAGFTLLTAFLFHSNFAEGVNSLMFMKNLTISGGFLLLAITGPGAYSIDRLLNKKW >CP034958.1|QAS84073.1|683557_683857_-|hypothetical-protein MSSKVERERRKAQLLSQIQQQRLDLSASRREWLEATGAYDRRWNMLLSLRSWALVGSSVMAIWTIRHPNMLVRWARRGFGVWSAWRLVKTTLKQQQLRG >CP034958.1|QAS84074.1|683846_684251_-|hypothetical-protein MADTHHAQGPGKSVLGIGQRIVSIMVEMVETRLRLAVVELEEEKANLFQLLLMLGLTMLFAAFGLMSLMVLIIWAVDPQYRLNAMIATTVVLLLLALIGGIWTLRKSRKSTLLRHTRHELANDRQLLEEESREQ >CP034958.1|QAS84075.1|684253_684559_-|DUF883-domain-containing-protein MSKEHTTEHLRAELKSLSDTLEEVLSSSGEKSKEELSKIRSKAEQALKQSRYRLGETGDAIAKQTRVAAARADEYVRENPWTGVGIGAAIGVVLGVLLSRR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034958_4 | 1051987-1052442 | Orphan |
I-E
Consensus repeat of CP034958_4
|
7 spacers
spacers of CP034958_4
>4.1|1052016|32|CP034958|PILER-CR,CRISPRCasFinder,CRT TCCACGCTGTAACGGCCATCATTAAGTTTAGT >4.2|1052077|32|CP034958|PILER-CR,CRISPRCasFinder,CRT GAAGTAGGCCTGACAGTGATTGAACGCATACT >4.3|1052138|32|CP034958|PILER-CR,CRISPRCasFinder,CRT AGTTGGGGCGGCGCAATAACGAGACGATACGC >4.4|1052199|32|CP034958|PILER-CR,CRISPRCasFinder,CRT GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT >4.5|1052260|32|CP034958|PILER-CR,CRISPRCasFinder,CRT TCAACGCGCTCAGACGTTGCGTGAGTGAACCA >4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT AAATATCCAGGGCTGGGCTGGAGGCAGACGGC >4.7|1052382|32|CP034958|PILER-CR,CRISPRCasFinder,CRT CCCGGAATGCATTCTGAAGGTTTGCTGTATAT |
CRISPR arrays and Neighbor proteins around CP034958_4
The CRISPR arrays of CP034958_4 >merge|CP034958|4|1051987-1052442|PILER-CR,CRISPRCasFinder,CRT GAGTTCCCCGCGCCAGCGGGGATAAACCGTCCACGCTGTAACGGCCATCATTAAGTTTAGTGAGTTCCCCGCGCCAGCGGGGATAAACCGGAAGTAGGCCTGACAGTGATTGAACGCATACTGAGTTCCCCGCGCCAGCGGGGATAAACCGAGTTGGGGCGGCGCAATAACGAGACGATACGCGAGTTCCCCGCGCCAGCGGGGATAAACCGGGGAGTGGCACTTCTGGGGTAGCGGCGGCCCTGAGTTCCCCGCGCCAGCGGGGATAAACCGTCAACGCGCTCAGACGTTGCGTGAGTGAACCAGAGTTCCCCGCGCCAGCGGGGATAAACCGAAATATCCAGGGCTGGGCTGGAGGCAGACGGCGAGTTCCCCGCGCCAGCGGGGATAAACCGCCCGGAATGCATTCTGAAGGTTTGCTGTATATGAGTTCCCCGCGCCAGCGGGGATAAACCA >CP034958|4|2|1051987-1052442|PILER-CR GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA >CP034958|4|3|1051987-1052442|CRISPRCasFinder GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA >CP034958|4|1|1051987-1052442|CRT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCCACGCTGTAACGGCCATCATTAAGTTTAGT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA
>CP034958.1|QAS84393.1|1050975_1051647_+|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWEKLEDREVSLFSILAKTKESDKWGAASSEDLLAVISRQGYTARHVVITGGEPCIHDLLPLTDLLEKNGFSCQIETSGTHEVRCTPNTWVTVSPKLNMRGGYEVLSQALERANEIKHPVGRVRDIEALDELLATLTDDKPRVIALQPISQKDDATRLCIETCIARNWRLSMQTHKYLNIA >CP034958.1|QAS84392.1|1050696_1050837_-|hypothetical-protein MSEENKENGFNHVKTFTKIIFIFSVLVFNDNESKITDAAVNLFIQI >CP034958.1|QAS84391.1|1049810_1050683_-|YgcG-family-protein MRYFILMFTFVCSFVAAQPTIVPQLQQQVTDLTSSLNSQEKKELTHKLESIFNNTQVQIAVLIVPTTKDETIEQYATRVFDNWRLGDAKRNDGILIIVAWSDRTVRIKVGYGLEEKVTDALAGDIIRSNMIPAFKQQKLAQGLELAINALNNQLTSQHQYPTNPSESESASSSDHYYFAIFWVFAVMFFPFWFFHQCSNFCRACKSGVCISAIYLLDLFLFSDKIFSIAVFSFFFTFTIFMVFTCLCVLQKRASGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASGRW >CP034958.1|QAS84390.1|1048452_1049751_+|phosphopyruvate-hydratase MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >CP034958.1|QAS84389.1|1046727_1048365_+|CTP-synthase-(glutamine-hydrolyzing) MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK >CP034958.1|QAS84388.1|1045708_1046500_+|nucleoside-triphosphate-pyrophosphohydrolase MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFADSSAENSSEVLARWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCANVGFDWTTLGPVVDKVYEEIDEVMYEARQAVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLEMTGVDLETMEEVWQQVKRQEIDL >CP034958.1|QAS84387.1|1045302_1045638_+|endoribonuclease-MazF MVSRYVPDMGDLIWVDFDPTKGSEQAGHRPAVVLSPFMYNNKTGMCLCVPCTTQSKGYPFEVVLSGQERDGVALADQVKSIAWRARGATKKGTVAPEELQLIKAKINVLIG >CP034958.1|QAS84386.1|1045054_1045303_+|MazF-MazE-toxin-antitoxin-system-antitoxin-MazE MIHSSVKRWGNSPAVRIPATLMQALNLNIDDEVKIDLVDGKLIIEPVRKEPVFTLAELVNDITPENLHENIDWGEPKDKEVW >CP034958.1|QAS84385.1|1042742_1044977_+|GTP-pyrophosphokinase MVAVRSAHINKAGEFDPEKWIASLGITSQKSCECLAETWAYCLQQTQGHPDASLLLWRGVEMVEILSTLSMDIDTLRAALLFPLADANVVSEDVLRESVGKSVVNLIHGVRDMAAIRQLKATHTDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHLREVKDAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYKRIAKLLHERRLDREHYIEEFVGHLRAEMKAEGVKAEVYGRPKHIYSIWRKMQKKNLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQSIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAAAGGARSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQVFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQIEIITQKQPNPSRDWLNPNLGYVTTSRGRSKIHAWFRKQDRDKNILAGRQILDDELEHLGISLKEAEKHLLPRYNFNDVDELLAAIGGGDIRLNQMVNFLQSQFNKPSAEEQDAAALKQLQQKSYTPQNRSKDNGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQLAELRSHAPERIVDAVWGESYSAGYSLVVRVVANDRSGLLRDITTILANEKVNVLGVASRSDTKQQLATIDMTIEIYNLQVLGRVLGKLNQVPDVIDARRLHGS >CP034958.1|QAS84384.1|1041393_1042695_+|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MAQFYSAKRRTTTRQIITVSVNDLDSFGQGVARHNGKTLFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSPERETPRCPHFGVCGGCQQQHASVDLQQRSKSAALARLMKHDVSEVIADVPWGYRRRARLSLNYLPKTQQLQMGFRKAGSSDIVDVKQCPILAPQLEALLPKVRACLGSLQAMRHLGHVELVQATSGTLMILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILETVSGEMPWYDSNGLRLTFSPRDFIQVNAGVNQKMVARALEWLDVQPEDRVLDLFCGMGNFTLPLATQAASVVGVEGVPALVEKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVMQQIIKLEPIRIVYVSCNPATLARDSEALLKAGYTIARLAMLDMFPHTGHLESMVLFSRVK >CP034958.1|QAS84394.1|1053079_1054558_-|sugar-kinase MSKKYIIGIDGGSQSTKVVMYDLEGNVVCEGKGLLQPMHTPDADTAEHPDDDLWASLCFAGHDLMSQFAGNKEDIVGIGLGSIRCCRALLKADGTPAAPLISWQDARVTRPYEHTNPDVAYVTSFSGYLTHRLTGEFKDNIANYFGQWPVDYKSWAWSEDAAVMDKFNIPRHMLFDVQMPGTVLGHITPQAALATHFPAGLPVVCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAYWPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDAKAQDLSPEDLLNKKASCVPPGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNYFAKHVIITGGGSNSDLFMQIFADVFNLPARRNAINGCASLGAAINTAVGLGLYPDYATAVDKMVRVKDIFMPVESNAKRYDAMNKGIFKDLTKHTDVILKKSYEVMHGELGNADSIQSWSNA >CP034958.1|QAS84395.1|1054584_1055862_-|MFS-transporter MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQVAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAPDDSTSLKTVIIIYSVVYILLGILCWFFVSDNNNLRSANNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAILSYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSIIGLLALTALLVTNSNPQSVAMGIGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGHWQDTLPAAEAYRNMWLMGMAALGMVIVFIFLLFQKIRTADSAPAMASSK >CP034958.1|QAS87632.1|1056180_1056966_+|SDR-family-oxidoreductase MSIESLNAFSMDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEKQGVEVDFMQVDITAEGAPQKIIAASCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAAFELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNIQVNGIAPGYYATDITLATRSNPETNQRVLDHIPANRWGDTQDLMGAAVFLASPASNYVNGHLLVVDGGYLVR >CP034958.1|QAS84396.1|1057035_1058490_+|FAD-binding-oxidoreductase MSLSRAAIVDQLKEIVGADRVITDETVLKKNSIDRFRKFPDIHGIYTLPIPAAVVKLGSTEQVSRVLNFMNAHKINGVPRTGASATEGGLETVVENSVVLDGSAMNQIINIDIENMQATAQCGVPLEVLENALREKGYTTGHSPQSKPLAQMGGLVATRSIGQFSTLYGAIEDMVVGLEAVLADGTVTRIKNVPRRAAGPDIRHIIIGNEGALCYITEVTVKIFKFTPENNLFYGYILEDMKTGFNILREVMVEGYRPSIARLYDAEDGTQHFTHFADGKCVLIFMAEGNPRIAKATGEGIAEIVARYPQCQRVDSKLIETWFNNLNWGPDKVAAERVQILKTGNMGFTTEVSGCWSCIHEIYESVINRIRTEFPHADDITMLGGHSSHSYQNGTNMYFVYDYNVVDCKPEEEIDKYHNPLNKIICEETIRLGGSMVHHHGIGKHRVHWSKLEHGSAWALLEGLKKQFDPNGIMNTGTIYPIEK >CP034958.1|QAS84397.1|1058583_1059921_+|MFS-transporter MNTSPVRMDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFLGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGILLGAFSVVWTVGYVLASIAGHHFISENPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLFSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQVTLLLLATVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV >CP034958.1|QAS84398.1|1059898_1060678_+|electron-transfer-flavoprotein-subunit-beta/FixA-family-protein MNILLAFKAEPDAGMLAEKEWQAAAQGKSGPDISLLRSLLGADEQAAAALLLAQRKNGTPMSLTALSMGDERALHWLRYLMALGFEEAVLLETAADLRFAPEFVARHIAEWQHQNPLDLIITGCQSSEGQNGQTPFLLAEMLGWPCFTQVERFTLDALFITLEQRTEHGLRCCRVRLPAVIAVRQCGEVALPVPGMRQRMAAGKAEIIRKTVAAEMPAMQCLQLARAEQRRGATLIDGQTVAEKAQKLWRDYLRQRMQP >CP034958.1|QAS84399.1|1060674_1061535_+|electron-transfer-flavoprotein-subunit-alpha/FixB-family-protein MNIAIVTINQENAAIASWLAAQDFSGCTLAHWQIEPQPVVAEQVLDALVEQWQRTPADVVLFPPGTFGDELSTRLAWRLHGASICQVTSLDIPTVSVRKSHWGNALTATLQTEKRPLCLSLARQAGAAKNATLPSGMQQLIIVPGALPDWLVSTEDLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVGYSRARVMNGGVDAEKVIGISGHLLAPEVCIVVGASGAAALMAGVRNSKFVVAINHDASAAVFSQADVGVVDDWKVVLEALVTNIHADCQ >CP034958.1|QAS84400.1|1061682_1062258_-|glycerol-3-phosphate-responsive-antiterminator MPLLHLLRQNPVIAAVKDNASLQLAIDSECQFISVLYGNICTISNIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLVTEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVCDEEDARNAINAGVVALSTTNTGVWTLAKKLL >CP034958.1|QAS84401.1|1062274_1062535_-|ferredoxin-family-protein MSVARNLWRVADAPHIVPADSVERQTAERLISACPAGLFSLTPEGDLRIDYRSCLECGTCRLLCDESTLQQWRYPPSGFGITYRFG >CP034958.1|QAS84402.1|1062525_1063797_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERCITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034958_5 | 1074827-1075405 | Unclear |
I-E
Consensus repeat of CP034958_5
|
9 spacers
spacers of CP034958_5
>5.1|1074857|31|CP034958|CRISPRCasFinder TTGCCCGCGCAATTCCGGGAGCATCCGCAAT >5.2|1074918|31|CP034958|CRISPRCasFinder ACGGACAAAATATATATTGATTTGCGAATTA >5.3|1074979|31|CP034958|CRISPRCasFinder GTAAAGAAACTGCCGACAAATCCCTGTTCGT >5.4|1075040|31|CP034958|CRISPRCasFinder CCCGTCACCGACGCGCAGTGGCGCTACCGTG >5.5|1075101|31|CP034958|CRISPRCasFinder GGATCTAACGCGCTGTAAAAATTCCGTGCTT >5.6|1075162|31|CP034958|CRISPRCasFinder TGCGGATTACCGGCAAAACATGGGAGCAAAC >5.7|1075223|31|CP034958|CRISPRCasFinder CCGAACGGCTGGCGAAGCAGGTGGCTGGCGT >5.8|1075284|31|CP034958|CRISPRCasFinder GTTTACCGCCCCGCAGAGGCGCTGGCAGATC >5.9|1075345|31|CP034958|CRISPRCasFinder GGATGACCTGTCGCTAAAACTCGCCGCGTAC >5.10|1074857|32|CP034958|PILER-CR,CRT TTGCCCGCGCAATTCCGGGAGCATCCGCAATT >5.11|1074918|32|CP034958|PILER-CR,CRT ACGGACAAAATATATATTGATTTGCGAATTAT >5.12|1074979|32|CP034958|PILER-CR,CRT GTAAAGAAACTGCCGACAAATCCCTGTTCGTT >5.13|1075040|32|CP034958|PILER-CR,CRT CCCGTCACCGACGCGCAGTGGCGCTACCGTGA >5.14|1075101|32|CP034958|PILER-CR,CRT GGATCTAACGCGCTGTAAAAATTCCGTGCTTT >5.15|1075162|32|CP034958|PILER-CR,CRT TGCGGATTACCGGCAAAACATGGGAGCAAACC >5.16|1075223|32|CP034958|PILER-CR,CRT CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA >5.17|1075284|32|CP034958|PILER-CR,CRT GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC >5.18|1075345|32|CP034958|PILER-CR,CRT GGATGACCTGTCGCTAAAACTCGCCGCGTACA |
cas2,cas1,cas6e,cas5 |
CRISPR arrays and Neighbor proteins around CP034958_5
The CRISPR arrays of CP034958_5 >merge|CP034958|5|1074827-1075405|CRISPRCasFinder,PILER-CR,CRT TGTGTTCCCCGCGCCAGCGGGGATAAACCGTTGCCCGCGCAATTCCGGGAGCATCCGCAATTGTGTTCCCCGCGCCAGCGGGGATAAACCGACGGACAAAATATATATTGATTTGCGAATTATGTGTTCCCCGCGCCAGCGGGGATAAACCGGTAAAGAAACTGCCGACAAATCCCTGTTCGTTGTGTTCCCCGCGCCAGCGGGGATAAACCGCCCGTCACCGACGCGCAGTGGCGCTACCGTGAGTGTTCCCCGCGCCAGCGGGGATAAACCGGGATCTAACGCGCTGTAAAAATTCCGTGCTTTGTGTTCCCCGCGCCAGCGGGGATAAACCATGCGGATTACCGGCAAAACATGGGAGCAAACCGTGTTCCCCGCGCCAGCGGGGATAAACCGCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTAGTGTTCCCCGCGCCAGCGGGGATAAACCGGTTTACCGCCCCGCAGAGGCGCTGGCAGATCCGTGTTCCCCGCGCCAGCGGGGATAAACCGGGATGACCTGTCGCTAAAACTCGCCGCGTACAGTGTTCCCCGCGCCAGCGGGGATAAACCG >CP034958|5|4|1074827-1075405|CRISPRCasFinder TGTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAAT TGTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTA TGTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGT TGTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTG AGTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTT TGTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAAC CGTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGT AGTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATC CGTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTAC AGTGTTCCCCGCGCCAGCGGGGATAAACCG >CP034958|5|3|1074828-1075405|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTGA GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTTT GTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAACC GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACCG >CP034958|5|2|1074828-1075405|CRT GTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG CCCGTCACCGACGCGCAGTGGCGCTACCGTGA GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATCTAACGCGCTGTAAAAATTCCGTGCTTT GTGTTCCCCGCGCCAGCGGGGATAAACCA TGCGGATTACCGGCAAAACATGGGAGCAAACC GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACCG
>CP034958.1|QAS84411.1|1074437_1074731_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAIWLLEVRAGVYVGDTSKRIREMIWQQITQLAGCGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVDNQ >CP034958.1|QAS84410.1|1073517_1074441_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLSPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLAATVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALTEDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRQTYALLAKQYGVKWNGRKYDPKDWEKGDVVNRCISAATSCLYGISEAAVLAAGYAPAIGFIHSGKPLSFVYDIADIIKFDSVVPKAFEIAARQPAEPDKEVRLACRDIFRSTKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPETLGDSGHRGRGG >CP034958.1|QAS84409.1|1072870_1073521_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTGQLSPAQLLHLVDRGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQERPAESDTFTIECRSFAPELRTGQQLCFNLRANPTICKSGKRHDLLMEAKRQVRGQAEGSDVWLHQQQAALDWLAAQGERSGFTLLDTSVDAYRQQQLRRENSRQLIQFSSVDYTGMLTVTDPGLFLQRLSQGYGKSRAFGCGLMLIKPGAEA >CP034958.1|QAS84408.1|1072142_1072889_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLIFQLHGPMASWGVDAPGEVRHTHELPSRSALLGLLAAGVGIRRDDTERLNAFNRHYSLVVCASRNPRWARDYHTIQMPKEVRKARYFSRREELSDPDLLSAIISRRDYYTDAWWMVAVATTADAPYSLEQLQDGLRHPVFPLYLGRKSHPLALPLAPLLLEGNACDALCNAYQQYQDHFHKLKVSLPKLQDECWWEGEHDGLVASKILRRRDVPLNRQQWLFGERTINQGPWLSKEEPCTSQE >CP034958.1|QAS84407.1|1069139_1069292_+|type-I-toxin-antitoxin-system-Hok-family-toxin MLTKYALVAIIVLCCTVLGFTLMVGDSLCELSIRERGMEFKAVLAYESKK >CP034958.1|QAS84406.1|1068140_1068875_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIHPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTHTTRKWEPGMLEEETRFFGLKRECGLHEG >CP034958.1|QAS84405.1|1066354_1068067_+|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLTDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHKGDFRITANQNLIIAGVPESEKAKIEKIAKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVMRVTGCPNGCGRAMLAEVGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPEILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDLWD >CP034958.1|QAS84404.1|1064555_1066355_+|NADPH-dependent-assimilatory-sulfite-reductase-flavoprotein-subunit MTTQVPPSALLPLNPEQLVRLQAATTDLTPTQLAWVSGYFWGVLNQQPAALAATPAPAAEMPGITIISASQTGNARRVAEALRDDLLAAKLNVKLVNAGDYKFKQIASEKLLIVVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDSSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDALKSRAPVAAPSQSVATGAVNEIHTSPYSKDAPLVASLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVEGKTLPLNEALQWHFELTVNTANIVENYATLTRSETLLPLVGDKAKLQHYAATTPIVDMVRFSPAQLDAEALINLLRPLTPRLYSIASSQAEVENEVHVTVGVVRYDVEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPETPVIMIGPGTGIAPFRAFMQQRAADEAPGKNWLFFGNPHFTEDFLYQVEWQRYVKDGVLTRIDLAWSRDQKEKVYVQDKLREQGAELWRWINDGAHIYVCGDANRMAKDVEQALLEVIAEFGGMDTEAADEFLSELRVERRYQRDVY >CP034958.1|QAS84403.1|1063874_1064240_-|6-carboxytetrahydropterin-synthase-QueD MMSTTLFKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIIDFAELKAAFKPTYERLDHHYLNDIPGLENPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCIYRGE >CP034958.1|QAS84402.1|1062525_1063797_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERCITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL >CP034958.1|QAS84412.1|1075486_1076524_-|aminopeptidase MFSALRHRTAALALGVCFILPVHASSPKPGDFANTQARHIATFFPGRMTGTPAEMLSADYIRQQFQQMGYRSDIRTFNSRYIYTARDNRKSWHNVTGSTVIAAHEGKAPQQIIIMAHLDTYAPLSDADADANLGGLTLQGMDDNAAGLGVMLELAERLKNTPTEYGIRFVATSGEEEGKLGAENLLKRMSDTEKKNTLLVINLDNLIVGDKLYFNSGVKTPEAVRKLTRDRALAIARSHGIAATTNPGLNKNYPKGTGCCNDAEIFDKAGIAVLSVEATNWNLGNKDGYQQRAKTAAFPAGNSWHDVRLDNQQHIDKALPGRIERRCRDVMRIMLPLVKELAKAS >CP034958.1|QAS84413.1|1076775_1077684_+|sulfate-adenylyltransferase-subunit-2 MDQIRLTHLRQLEAESIHIIREVAAEFSNPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYEFRDRTAKAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWHNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMIDDNRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESNAQTLPEIIEEMLVSTTSERQGRVIDRDQAGSMELKKRQGYF >CP034958.1|QAS84414.1|1077685_1079113_+|sulfate-adenylyltransferase-subunit-CysN MNTALAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTRQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCELAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYSEKTFTRIREDYLTFAGQLPGNLDIRFVPLSALEGDNVASQSESMAWYSGPTLLEVLETVEIQRVVDAQPMRFPVQYVNRPNLDFRGYAGTLASGRVEVGQRVKVLPSGVESNVARIVTFDGDREEAFAGEAITLVLTDEIDISRGDLLLAADEALPAVQSASVDVVWMAEQPLSPGQSYDIKIAGKKTRARVDGIRYQVDINNLTQREVENLPLNGIGLVDLTFDEPLVLDRYQQNPVTGGLIFIDRLSNVTVGAGMVHEPVSQATAAPSEFSAFELELNALVRRHFPHWGARDLLGDK >CP034958.1|QAS84415.1|1079112_1079718_+|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVQQRELHHGHRGVVLWFTGLSGSGKSTVAGALEEALHKLGVSTYLLDGDNVRHGLCSDLGFSDADRKENIRRVGEVANLMVEAGLVVLTAFISPHRAERQMVRERVGEGRFIEVFVDTPLAICEARDPKGLYKKARAGELRNFTGIDSVYEAPESAEIHLNGEQLVTNLVQQLLDLLRQNDIIRS >CP034958.1|QAS84416.1|1079767_1080091_+|DUF3561-family-protein MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNTLFFFIYTWPFFLALMPVAVVVGIALHSLMDGKLRYSIVFTLVTVGIMFGALFMWLLG >CP034958.1|QAS84417.1|1080284_1080596_+|cell-division-protein-FtsB MGKLTLLLLAILVWLQYSLWFGKNGIHDYTRVNDDVAAQQATNAKLKARNDQLFAEIDDLNGGQEALEERARNELSMTRPGETFYRLVPDASKRAQSAGQNNR >CP034958.1|QAS84418.1|1080614_1081325_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MATTHLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKRVVIAISPGDSRFAQLPLANHPQITVVDGGDERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALSETSRTGGILAAPVRDTMKRAEPGKNAIAHTVDRNGLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTRTIHQENT >CP034958.1|QAS84419.1|1081324_1081804_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYERGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLIKATK >CP034958.1|QAS84420.1|1081800_1082850_+|tRNA-pseudouridine(13)-synthase-TruD MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDLSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGAQRWAQTNTPVRDRNKRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRAMLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE >CP034958.1|QAS84421.1|1082830_1083592_+|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVICSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034958_6 | 1580575-1580692 | Orphan |
NA
Consensus repeat of CP034958_6
|
1 spacers
spacers of CP034958_6
>6.1|1580606|56|CP034958|CRISPRCasFinder TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA |
CRISPR arrays and Neighbor proteins around CP034958_6
The CRISPR arrays of CP034958_6 >merge|CP034958|6|1580575-1580692|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGCTGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAACCGAGCCGTAGGCCGGATAAGGCGTTTACGC >CP034958|6|5|1580575-1580692|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGC TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA CCGAGCCGTAGGCCGGATAAGGCGTTTACGC
>CP034958.1|QAS84852.1|1579347_1580478_-|ribonucleoside-diphosphate-reductase-1-subunit-beta MAYTTFSQTKNDQLKEPMFFGQPVNVARYDQQKYDIFEKLIEKQLSFFWRPEEVDVSRDRIDYQALPEHEKHIFISNLKYQTLLDSIQGRSPNVALLPLISIPELETWVETWAFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEQIQKRAEGISSYYDELIEMTSYWHLLGEGTHTVNGKTVTVSLRELKKKLYLCLMSVNALEAIRFYVSFACSFAFAERELMEGNAKIIRLIARDEALHLTGTQHMLNLLRSGADDPEMAEIAEECKQECYDLFVQAAQQEKDWADYLFRDGSMIGLNKDILCQYVEYITNIRMQAVGLDLPFQTRSNPIPWINTWLVSDNVQVAPQEVEVSSYLVGQIDSEVDTDDLSNFQL >CP034958.1|QAS84851.1|1579093_1579348_-|ferredoxin-like-diferric-tyrosyl-radical-cofactor-maintenance-protein-YfaE MARVTLRITGTQLLCQDEHPSLLAALESHNVAVEYQCREGYCGSCRTRLVAGQVDWIAEPLAFIQPGEILPCCCRAKGDIEIEM >CP034958.1|QAS84850.1|1578389_1579040_+|lipopolysaccharide-kinase-InaA MAVSAKYDEFNHWWATEGDWVEEPNYRRNGMSGVQCVERNGKKLYVKRMTHHLFHSVRYPFGRPTIVREVAVIKELERAGVIVPKIVFGEAVKIEGEWRALLVTEDMAGFISIADWYAQHAVSPYSDEVRQAMLKAVALAFKKMHSINRQHGCCYVRHIYVKTEGKAEAGFLDLEKSRRRLRRDKAINHDFRQLEKYLEPIPKADWEQVKAYYYAM >CP034958.1|QAS87646.1|1575655_1575970_-|hypothetical-protein MTNKLGGELIDIADKKLAPLINDSFSYTRDFFAYSKQENNIFTFDNSKFVDPKEKEGLMIQHSNGQLVITGKYCPEGVQTAFTQEQYDKLIRYINIFFTFPKCE >CP034958.1|QAS84849.1|1574360_1575437_+|glycerophosphodiester-phosphodiesterase MKLKLKNLSMAIMMSTIVMGSSAMAADSNEKIVIAHRGASGYLPEHTLPAKAMAYAQGADYLEQDLVMTKDDHLVVLHDHYLDRVTDVADRFPDRARKDGRYYAIDFTLDEIKSLKFTEGFDIENGKKVQTYPGRFPMGKSDFRVHTFEEEIEFVQGLNHSTGKNIGIYPEIKAPWFHHQEGKDIAAKTLEVLKKYGYTGKDDKVYLQCFDADELKRIKNELEPKMGMDLNLVQLIAYTDWNETQQKQPDGSWVNYSYDWMFKPGAMKQVAEYADGIGPDYHMLIEETSQPGNIKLTGMVQDAQQNKLVVHPYTVRSDKLPEYTTDVNQLYDVLYNKAGVNGLFTDFPDKAVKFLNKE >CP034958.1|QAS84848.1|1572997_1574356_+|glycerol-3-phosphate-transporter MLSIFKPAPHKARLPAAEIDPTYRRLRWQIFLGIFFGYAAYYLVRKNFALAMPYLVEQGFSRGDLGFALSGISIAYGFSKFIMGSVSDRSNPRVFLPAGLILAAAVMLFMGFVPWATSSIAVMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLFLLGMAWFNDWHAALYMPAFCAILVALFAFAMMRDTPQSCGLPPIEEYKNDYPDDYNEKAEQELTAKQIFMQYVLPNKLLWYIAIANVFVYLLRYGILDWSPTYLKEVKHFALDKSSWAYFLYEYAGIPGTLLCGWMSDKVFRGNRGATGVFFMTLVTIATIVYWMNPAGNPTVDMICMIVIGFLIYGPVMLIGLHALELAPKKAAGTAAGFTGLFGYLGGSVAASAIVGYTVDFFGWDGGFMVMIGGSILAVILLIVVMIGEKRRHEQLLQKRNGG >CP034958.1|QAS84847.1|1571096_1572725_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-A MKTRDSQSSDVIIIGGGATGAGIARDCALRGLRVILVERHDIATGATGRNHGLLHSGARYAVTDAESARECISENQILKRIARHCVEPTNGLFITLPEDDLSFQATFIRACEEAGISAEAIDPQQARIIEPAVNPALIGAVKVPDGTVDPFRLTAANMLDAKEHGAVILTAHEVTGLIREGATVCGVRVRNHLTGETQALHAPVVVNAAGIWGQHIAEYADLRIRMFPAKGSLLIMDHRINQHVINRCRKPSDADILVPGDTISLIGTTSLRIDYNEIDDNRVTAEEVDILLREGEKLAPVMAKTRILRAYSGVRPLVASDDDPSGRNVSRGIVLLDHAERDGLDGFITITGGKLMTYRLMAEWATDAVCRKLGNTRPCTTADLALPGSQDPAEVTLRKVISLPAPLRGSAVYRHGDRTPAWLSEGRLHRSLVCECEAVTAGEVQYAVENLNVNSLLDLRRRTRVGMGTCQGELCACRAAGLLQRFNVTTSAQSIEQLSTFLNERWKGVQPIAWGDALRESEFTRWVYQGLCGLEKEQKDAL >CP034958.1|QAS84846.1|1569847_1571107_-|glycerol-3-phosphate-dehydrogenase-subunit-GlpB MRFDTVIMGGGLAGLLCGLQLQKHGLRCAIVTRGQSALHFSSGSLDLLSHLPDGQPVADIHSGLESLRQQAPAHPYSLLGPQRVLDLACQAQALIAESGAQLQGSVELAHQRITPLGTLRSTWLSSPEVPVWPLPAKKICVVGISGLMDFQAHLAAASLRELDLSVETAEIELPELDVLRNNATEFRAVNIARFLDNEENWPLLLDALIPVANTCEMILMPACFGLADDKLWRWLNEKLPCSLMLLPTLPPSVLGIRLQNQLQRQFVRQGGVWMPGDEVKKVTCKNGVMNEIWTRNHADIPLRPRFAVLASGSFFSGGLVAERNGIREPILGLDVLQTATRGEWYKGDFFAPQPWQQFGVTTDETLRPSQAGQTIENLFAIGSVLGGFDPIAQGCGGGVCAVSALHAAQQIAQRAGGQQ >CP034958.1|QAS84845.1|1568660_1569851_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-C MNDTSFENCIKCTVCTTACPVSRVNPGYPGPKQAGPDGERLRLKDGALYDEALKYCINCKRCEVACPSDVKIGDIIQRARAKYDTTRPSLRNFVLSHTDLMGSVSTPFAPIVNTATSLKPVRQLLDAALKIDHRRTLPKYSFGTFRRWYRSVAAQQAQYKDQVAFFHGCFVNYNHPQLGKDLIKVLNAMGTGVQLLSKEKCCGVPLIANGFTAKARKQAITNVESIREAVGVKGIPVIATSSTCTFALRDEYPEVLNVDNKGLRDHIELATRWLWRKLDEGKTLPLKPLPLKVVYHTPCHMEKMGWTLYTLELLRKIPGLELTVLDSQCCGIAGTYGFKKENYPTSQAIGAPLFRQIEESGADLVVTDCETCKWQIEMSTSLRCEHPITLLAQALA >CP034958.1|QAS84844.1|1567568_1568468_-|ISNCY-family-transposase MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAYYSDVLWSVETSDGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKGYDRVPLVVPLLFYHGETSPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELIQKHIRDRDLIGMVDRITTLLVKGFTNDSQLQTLFNYLLQCGDTSRFTRFIEEIAKRSPLQKERLMTIAERLRQEGHQIGWQEGMHEQAIKIALRMLEQGFEREIVLATTQLTDADIPNCH >CP034958.1|QAS84853.1|1580711_1582997_-|ribonucleoside-diphosphate-reductase-1-subunit-alpha MNQNLLVTKRDGSTERINLDKIHRVLDWAAEGLHNVSISQVELRSHIQFYDGIKTSDIHETIIKAAADLISRDAPDYQYLAARLAIFHLRKKAYGQFEPPALYDHVVKMVEMGKYDNHLLEDYTEEEFKQMDTFIDHDRDMTFSYAAVKQLEGKYLVQNRVTGEIYESAQFLYILVAACLFSNYPRETRLQYVKRFYDAVSTFKISLPTPIMSGVRTPTRQFSSCVLIECGDSLDSINATSSAIVKYVSQRAGIGINAGRIRALGSPIRGGEAFHTGCIPFYKHFQTAVKSCSQGGVRGGAATLFYPMWHLEVESLLVLKNNRGVEGNRVRHMDYGVQINKLMYTRLLKGEDITLFSPSDVPGLYDAFFADQEEFERLYTKYEKDDSIRKQRVKAVELFSLMMQERASTGRIYIQNVDHCNTHSPFDPAIAPVRQSNLCLEIALPTKPLNDVNDENGEIALCTLSAFNLGAINNLDELEELAILAVRALDALLDYQDYPIPAAKRGAMGRRTLGIGVINFAYYLAKHGKRYSDGSANNLTHKTFEAIQYYLLKASNELAKEQGACPWFNETTYAKGILPIDTYKKDLDTIANEPLHYDWEALRESIKTHGLRNSTLSALMPSETSSQISNATNGIEPPRGYVSIKASKDGILRQVVPDYEHLHDAYELLWEMPGNDGYLQLVGIMQKFIDQSISANTNYDPSRFPSGKVPMQQLLKDLLTAYKFGVKTLYYQNTRDGAEDAQDDLVPSIQDDGCESGACKI >CP034958.1|QAS84854.1|1583692_1587445_+|AIDA-I-family-autotransporter-YfaL MRIIFLRKEYLSLLPSMIASLFSANGVAAVTDSCQGYDVKASCQASRQSLSGITQDWSIADGQWLVFSDMTNNASGGAVFLQQGAEFSLLPENETGMTLFANNTVTGEYNNGGAIFAKENSTLNLTDVIFSGNVAGGYGGAIYSSGTNDTGAVDLRVTNAMFRNNIANDGKGGAIYTINNDVYLSDVIFDNNQAYTSTSYSDGDGGAIDVTDNNSDSKHPSGYTIVNNTAFTNNTAEGYGGAIYTNSVTAPYLIDISVDDSYSQNGGVLVDENNSAAGYGDGPSSAAGGFMYLGLSEVTFDIADGKTLVIGNTENDGAVDSIAGTGLITKTGSGDLVLNADNNDFTGEMQIENGEVTLGRSNSLMNVGDTHCQDDPQDCYSLTIGSIDQYQNQAELNVGSTQQTFVHALTGFQNGTLNIDAGGNVTVNQGSFAGIIEGAGQLTIAQNGSYVLAGAQSMALTGDIVVDDGAVLSLEGDAADLTALQDDPQSIVLNGGVLDLSDFSTWQSGTSYNDGLEVSGSSGTVIGSQDVVDLAGGDNLHIGGDGKDGVYVVVDASDGQVSLANNNSYLGTTQIASGTLMVSDNSQLGDTHYNRQVIFTDKQQESVMEITSDVDTRSDAAGHGRDIEMRADGEVAVDAGVDTQWGALMADSSGQHQDEGSTLTKTGAGTLELTASGTTQSAVRVEEGTLKGDVADILPYASSLWVGDGATFVTGADQDIQSIDAISSGTIDISDGTVLRLTGQDTSVALNASLFNGDGTLVNATDGVTLTGELNTNLETDSLTYLSNVTVNGNLTNTSGAVSLQNGVAGDTLTVNGDYTGGGTLLLDSELNGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKMVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVEDNNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQPVLNAKVGGYLNNLRAANQAFMMERRDHAGGDGQTLNLRVIGGDYHYTAAGQLAQHEDTSTVQLSGDLFSGRWGTDGEWMLGIVGGYSDNQGDSRSNMTGTRADNQNHGYAVGLTSSWFQHGNQKQGAWLDSWLQYAWFSNDVSEQEDGTDHYHSSGIIASLEAGYQWLPGRGVVIEPQAQVIYQGVQQDDFTAANRARVSQSQGDDIQTRLGLHSEWRTAVHVIPTLDLNYYHDPHSTEIEEDGSTISDDAVKQRGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW >CP034958.1|QAS87647.1|1587572_1588295_-|bifunctional-2-polyprenyl-6-hydroxyphenol-methylase/3-demethylubiquinol-3-O-methyltransferase-UbiG MNAEKSPENHNVDHEEIAKFEAVASRWWDLEGEFKPLHRINPLRLGYIAERAGGLFGKKVLDVGCGGGILAESMAREGATVTGLDMGFEPLQVAKLHALESGIQVDYVQETVEKHAAKHAGQYDVVTCMEMLEHVPDPQSVVRACAQLVKPGGDVFFSTLNRNGKSWLMAVVGAEYILRMVPKGTHDVKKFIKPAELLGWVDQTSLKERHITGLHYNPITNSFKLGPGVDVNYMLHTQNK >CP034958.1|QAS84855.1|1588441_1591069_+|DNA-topoisomerase-(ATP-hydrolyzing)-subunit-A MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDLAVYNTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDDEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEVDAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDGMRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDIIAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHAPTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVKYQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVYSMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMATANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAEGKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQNGYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITDAGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDTIDGSAAEGDDEIAPEVDVDDEPEEE >CP034958.1|QAS84856.1|1591217_1592906_+|DUF2138-domain-containing-protein MSGEKKAKGWRFYGLVGFGAIALLSAGVWALQYAGSGPEKTLSPLVVHNNLQIDLNEPDLFLDSDSLSQLPKDLLTIPFLHDVLSEDFVFYYQNHADRLGIEGSIRRIVYEHDLTLKDKLFSSLLDQPAQAALWHDKQGHLSHYMVLIQRSGLSKLLEPLLFAATSDSQLSKTEISSIKINSETVPVYQLRYNGNNALMFATYQDKMLVFSSTDMLFKDDQQDTEATAIAGDLLSGKKRWQASFGLEERTAEKTPVRQRIVVSARWLGFGYQRLMPSFAGVRFEMGNDGWHSFVALNDESASVDASFDFTPVWNSMPAGASFCVAVPYSHGIAEEMLSHISQENDKLNGALDGAAGLCWYEDSKLQTPLFVGQFDGTAEQAQLPGKLFTQNIGAHESKAPEGVLPVSQTQQGEAQIWRREVSSRYGQYPKAQAAQPDQLMSDYFFRVSLAMQNKTLLFSLDDTLVNNALQTLNKTRPAMVDVIPTDGIVPLYINPQGIAKLLRNETLTSLPKNLEPVFYNAAQTLLMPKLDALSQQPRYVMKLAQMEPGAAWQWLPITWQPL >CP034958.1|QAS84857.1|1592902_1593526_+|DUF1175-domain-containing-protein MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIYRLNFLAR >CP034958.1|QAS87648.1|1593669_1598064_+|alpha-2-macroglobulin-family-protein MRLEAPGRDYRRYQMEEYGGVDVRLYRIPDPMAFLRQQKNLHRIVVQPQYLGDGLNNTLTWLWDNWYGKSRRVMQRTFSSQSRQNVTQALPELQLGNAIIKPSRYVQNNQFSPLKKYPLVEQFRYPLWQAKPFEPQQGVKLEGASSNFISPQPGNIYIPLGQQEPGLYLVEAMVGGYRATTVVFVSDTVALSKVSGNELLVWTAGKKQGEAKPGSEILWTDGLGVMTRGVTDDSGTLQLQHISPERSYILGKDAEGGVFVSENFFYESEIYNTRLYIFTDRPLYRAGDRVDVKVMGREFHDPLHSSPIVSAPAKLSVLDANGSLLQTVDVTLDARNGGQGSFRLPENAVAGGYELRLAYRNQVYSSSFRVANYIKPHFEIGLALAKKEFKTGEAVSGKLQLLYPDGEPVKNARVQLSLRAQQLSMVGNDLRYAGRFPVSLEGSETVSDASGHVALNLPAADKPSRYLLTVSASDGAAYRVTTTKEILIERGLAHYSLSTAAQYSNSGESVVFRYAALESSKQVPVTYEWLRLEDRTSHSGELPSGGKSFTVNFAKPGNYNLTLRDKDGLILAGLSHAVSGKGSTAHTGTVDIVADKTLYQPGETAKMLITFPEPIDEALLTLERDRVEQQSLLSHPANWLTLQRLNDTQYEARVPVSNSFAPNITFSVLYTRNGQYSFQNAGIKVAVPQLDIRVKTDKTHYQPGELVNVELTSSLKGKPVSAQLTVGVVDEMIYALQPEIAPNIGKFFYPLGRNNVRTSSSLSFISYDQALSSEPVAPGATNRSERRVKMLERPRREEVDTAAWMPSLTTDKQGKAYFTFLMPDSLTRWRITARGMNGDGLVGQGRAYLRSEKNLYMKWSMPTVYRVGDKPAAGLFIFSQQDNEPVALVTKFAGAEMRQTLTLHKGANYISLTQNIQQSGLLSAELQQNGQVQDSISTKLSFVDNSWPVEQQKNVMLGGGDNALMLPEQASNIRLQSSETPQEIFRNNLDALVDEPWGGVINTGSRLIPLSLAWRSLADHQSAAANDIRQMIQVNRLRLMQLAGPGARFTWWGEDGNGDAFLTAWAWYADWQASQAIGVTQQPEYWQHMLDSYAEQADNMPLLHRALVLAWAQEMNLPCKTLLKGLDEAIARRGTKTEDFSEEDTRDINDSLILDTPESPLADAVANVLTMTLLKKAQLKSTVMPQVQQYAWDKAANSNQPLAHTVVLLNSGGDATQAAAILSGLTAEQSTIERALAMNWLAKYMATMPPVVLPAPAGAWAKHKLTGGGEYWRWVGQGVPDILSFGDELSPQNVQVRWREPAKTAQQSNIPVTVERQLYRLITGEEEMSFTLQPVTSNEIDSDALYLDEITLTSEQDAVLRYGQVEVPLPPGADVERTTWGISVNKPNAAKQQGQLLEIARNEMGELAYMVPVKELTGTVTFRHLLRFSQKGQFVLPPARYMRSYAPAQQSVAAGSEWTRMQVK >CP034958.1|QAS84858.1|1598064_1599714_+|DUF2300-domain-containing-protein MNWRRIVWLLALVTLPTLAEEPPLQLALRGAQHDQLYKLSSSGVTNVSTLPDTLTTPLGSLWKLYVYAWLEDTHQPEQPYQCRGNSPEEVYCCQAGESITRDTALVRSCGLYFAPQRLHIGADVWGQYWQQRQAPAWLASLTTLKPETSVTVKSLLDSLATLPAQNKAQEVLLDVVLDEAKIGVASMLGSRVRVKTWSWFADDKQEIRQGGFAGWLTDGTPLWVTGSGTSKTVLTRYATVLNRVLPVPTQVASGQCVEVELFARYPLKKITAEKSTTAVKPGVLNGRYRVTFTNGNHITFVSHGETTLLSEKGKLKLQSHLDREEYVARVLDREAKSTPPEAAKAMTVAIRTFLQQNANREGDCLTIPDSSATQRVSASPATTGARTMAAWTQDLIYAGDPVHYHGSRATEGTLSWRQATAQAGQGERYDQILAFAYPDNSLSRWGAPRSTCQLLPKAKAWLAKKMPQWRRILQAETGYNEPDVFAVCRLVSGFPYTDRQQKRLFIRNFFTLQDRLDLTHEYLHLAFDGYPTGLDENYIETLTRQLLMD >CP034958.1|QAS84859.1|1599718_1600495_+|DUF2135-domain-containing-protein MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPVEGEDASFSQSINYPASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDGSFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTPGTGTIRARLRLVLSWDTDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPVHGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGELTLVKSFDW >CP034958.1|QAS84860.1|1600568_1601753_-|acetyl-CoA-C-acetyltransferase MKNCVIVSAVRTAIGSFNGSLASTSAIDLGATVIKAAIERAKIDSQHVDEVIMGNVLQAGLGQNPARQALLKSGLAETVCGFTVNKVCGSGLKSVALAAQAIQAGQAQSIVAGGMENMSLAPYLLDAKARSGYRLGDGQVYDVILRDGLMCATHGYHMGITAENVAKEYGITREMQDELALHSQRKAAAAIESGAFTAEIVPVNVVTRKKTFVFSQDEFPKANSTAEALGALRPAFDKAGTVTAGNASGINDGAAALVIMEESAALAAGLTPLARIKSYASGGVPPALMGMGPVPATQKALQLAGLQLADIDLIEANEAFAAQFLAVGKTLGFDPEKVNVNGGAIALGHPIGASGARILVTLLHAMQARDKTLGLATLCIGGGQGIAMVIERLN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034958_7 | 2185231-2185354 | Orphan |
NA
Consensus repeat of CP034958_7
|
1 spacers
spacers of CP034958_7
>7.1|2185274|38|CP034958|CRISPRCasFinder CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA |
CRISPR arrays and Neighbor proteins around CP034958_7
The CRISPR arrays of CP034958_7 >merge|CP034958|7|2185231-2185354|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA >CP034958|7|6|2185231-2185354|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA
>CP034958.1|QAS85376.1|2184790_2185096_-|monooxygenase MATLLQLHFAFNGPFGDAMAEQLKPLAESINQEPGFLWKVWTESEKNHEAGGIYLFTDEKSALAYLEKHTARLKNLGVEEVVAKVFDVNEPLSQINQAKLA >CP034958.1|QAS85375.1|2183060_2184665_-|FAD-NAD(P)-binding-protein MKKIAIVGAGPTGIYTLFSLLQQQTPLSISIFEQADEAGVGMPYSDEENSKMMLANIASIEIPPINCTYLEWLQKQEASHLQRYGVKKETLHDRQFLPRILLGEYFRDQFLRLVDQARQQKFAVAVYESCQVTDLQITNAGVMLATNQDLPSETFDLVVIATGHVWPDEEEATRTYFPSPWSGLMEAKVDACNVGIMGTSLSGLDAAMAVAIQHGSFIEDDKQHVVFNRDNASEKLNITLMSRTGILPEADFYCPIPYEPLHIVTDQALNAEIQKGEEGLLDRVFRLIVEEIKFADPDWSQRIALESLNVDSFAQAWFAERKQRDPFDWAEKNLQEVERNKREKHTVPWRYVILRLHEAVQEIVPHLNEHDHKRFSKGLARVFIDNYAAIPSESIRRLLALREAGIIHILALGEDYEMEINESRTVLKTEDNSYSFDVFIDARGQRPLKVKDIPFPGLREQLQKTGDEIPDVGEDYTLQQPEDIRGRVAFGALPWLMHDQPFVQGLTACAEIGEAMARAVVKPASRARRRLSFD >CP034958.1|QAS85374.1|2182236_2183049_+|hypothetical-protein MIITRADLREWRIGAVMYRWFLRHFPRGGSYADIHHALIEEGYTDWAESLVEYAWKKWLADENFAHQEVSSMQKLATDPGEIPFCSQFARSDDHARIGCCEDNARIATAGYAAQIASMGYSVRIGSVGFNSHIGSSGERARVAVTGNSSRISSAGDSSRIANTGMRVRVCTLGERCHVASNGDLAQIASFGANARIANSGDNVHIIASGENSTVVSTGVVDSIILGPGGSAALAYHDGERVRFAVAIEGENNIRAGVRYRLNEQHQFVEC >CP034958.1|QAS85373.1|2181447_2182233_+|thiosulfate-reductase-cytochrome-B-subunit MNPSQHAEQFQSQLANYVPQFTPEFWPVWLIIAGVLLVGMWLVLGLHALLRARGVKKSVTDYGEKIYLYCKAVRLWHWSNALLFVLLLASGLINHFALVGATAVKSLVAVHEVCGFLLLACWLGFVLINAVGGNGHHYRIRRQGWLERAAKQTRFYLFGIMQGEEHPFPATTQSKFNPLQQVAYVGVMYGLLPLLLLTGLLCLYPQAVGDVFPGVRYWLLQAHFALAFISLFFIFGHLYLCTTGRTPHETFKSMVDGYHRH >CP034958.1|QAS85372.1|2180782_2181451_+|4Fe-4S-dicluster-domain-containing-protein MSFTRRKFVLGMGTVIFFTGSASSLLANTRQEKEVRYAMIHDESRCNGCNICARACRKTNHVPAQGSRLSIAHIPVTDNDNETQYHFFRQSCQHCEDAPCIDVCPTGASWRDEQGIVRVEKSQCIGCSYCIGACPYQVRYLNPVTKVADKCDFCAESRLAKGFPPICVSACPEHALIFGREDSPEIQAWLQQNKYYQYQLPGAGKPHLYRRFGQHLIKKENV >CP034958.1|QAS85371.1|2180071_2180719_+|YdhW-family-putative-oxidoreductase-system-protein MGEMNHRDELPLAKVSEVDEAKRQWLQGMRHPVDTVTEPEPAEILAEFIRQHSAAGQLVARAVFLSPPYSVAEEELSVLLESIKQNGDYADIACMTGSQDDYYYSTQAMSENYAAMSLQVVEQDICRAIAHAVRFECQTYPRPYKVAMLMQAPYYFQEAQIEAAIAAMDVAPEYADIRQVESSTAVLYLFSERFMTYGKAYGLCEWFEVEQFQNP >CP034958.1|QAS85370.1|2177965_2180068_+|aldehyde-ferredoxin-oxidoreductase MANGWTGNILRVNLTTGNITLEDSSKFKSFVGGMGFGYKIMYDEVPPGTKPFDEANKLVFATGPLTGSGAPCSSRVNITSLSTFTKGNLVVDAHMGGFFAAQMKFAGYDVIIIEGKAKSPVWLKIKDDKVSLEKADFLWGKGTRATTEEICRLTSPETCVAAIGQAGENLVPLSGMLNSRNHSGGAGTGAIMGSKNLKAIAVEGTKGVNIADRQEMKRLNDYMMTELIGANNNHVVPSTPQSWAEYSDPKSRWTARKELFWGAAEGGPIETGEIPPGNQNTVGFRTYKSVFDLGPAAEKYTVKMSGCHSCPIRCMTQMNIPRVKEFGVPSTGGNTCVANFVHTTIFPNGPKDFEDKDDGRVIGNLVGLNLFDDYGLWCNYGQLHRDFTYCYSKGVFKRVLPAEEYAEIHWDQLEAGDVNFIKDFYYRLAHRVGELSHLADGSYAIAERWNLGEEYWGYAKNKLWSPFGYPVHHANEASAQVGSIVNCMFNRDCMTHTHINFIGSGLPLKLQREVAKELFGSEDAYDETKNYTPINDAKIKYAKWSLLRVCLHNAVTLCNWVWPMTVSPLKSRNYRGDLALEAKFFKAITGEEMTQEKLDLAAERIFTLHRAYTVKLMQTKDMRNEHDLICSWVFDKDPQIPVFTEGTDKMDRDDMHASLTMFYKEMGWDPQLGCPTRETLQRLGLEDIAADLAAHNLLPV >CP034958.1|QAS85369.1|2177318_2177945_+|ferredoxin-like-protein MNPVDRPLLDIGLTRLEFLRISGKGLAGLTIAPALLSLLGCKQEDIDSGTVGLINTPKGVLVTQRARCTGCHRCEISCTNFNDGSVGTFFSRIKIHRNYFFGDNGVGSGGGLYGDLNYTADTCRQCKEPQCMNVCPIGAITWQQKEGCITVDHKRCIGCSACTTACPWMMATVNTESKKSSKCVLCGECANACPTGALKIIEWKDITV >CP034958.1|QAS85368.1|2176653_2176863_+|fumarate-hydratase-FumD MGNRTKEDELYREMCRVVGKVVLEMRDLGQEPKHIVIAGVLRTALANKRIQRSELEKQAMETVINALVK >CP034958.1|QAS85367.1|2174685_2176098_-|pyruvate-kinase-I MKKTKIVCTIGPKTESEEMLAKMLDAGMNVMRLNFSHGDYAEHGQRIQNLRNVMSKTGKTAAILLDTKGPEIRTMKLEGGNDVSLKAGQTFTFTTDKSVIGNSEMVAVTYEGFTTDLSVGNTVLVDDGLIGMEVTAIEGNKVICKVLNNGDLGENKGVNLPGVSIALPALAEKDKQDLIFGCEQGVDFVAASFIRKRSDVIEIREHLKAHGGENIHIISKIENQEGLNNFDEILEASDGIMVARGDLGVEIPVEEVIFAQKMMIEKCIRARKVVITATQMLDSMIKNPRPTRAEAGDVANAILDGTDAVMLSGESAKGKYPLEAVSIMATICERTDRVMNSRLEFNNDNRKLRITEAVCRGAVETAEKLDAPLIVVATQGGKSARAVRKYFPDATILALTTNEKTAHQLVLSKGVVPQLVKEITSTDDFYRLGKELALQSGLAHKGDVVVMVSGALVPSGTTNTASVHVL >CP034958.1|QAS85377.1|2185668_2186925_+|hypothetical-protein MGSDAKNLMSDGNVQIVKTGEVIGATQLTEGELIVEAGGRAENTVVTGAGWLKVATGGIAKCTQYGNNGTLSVSDGAIATDIVQSEGGAISLSTLATVNGRHPEGEFSVDQGYACGLLLENGGNLRVLEGHRAEKIILDQEGGLLVNGTTSAVVVDEGGELLVYPGGEASNCEINQGGVFMLAGKASDTLLAGGTMNNLGGEDSDTIVENGSIYRLGTDGLQLYSSGKTQNLSVNVGGRAEVHAGTLENAVIQGGTVILLSPTSADENFVVEEDRAPVELTGSVALLDGASMIIGYGADLQQSTITVQQGGVLILDGSTVKGDGVTFIVGNINLNGGKLWLITGAATHVQLKVKRLRGEGAICLQTSAKEISPDFINVKGEVTGDIHVEITDASRQTLCNALKLQPDEDGIGATLQPA >CP034958.1|QAS85378.1|2186965_2188339_-|multidrug-efflux-MATE-transporter-MdtK MQKYISEARLLLALAIPVILAQIAQTAMGFVDTVMAGGYSATDMAAVAIGTSIWLPAILFGHGLLLALTPVIAQLNGSGRRERIAHQVRQGFWLAGFVSVLIMLVLWNAGYIIRSMENIDPALADKAVGYLRALLWGAPGYLFFQVARNQCEGLAKTKPGMVMGFIGLLVNIPVNYIFIYGHFGMPELGGVGCGVATAAVYWVMFLAMVSYIKRARSMRDIRNEKGTAKPDPAVMKRLIQLGLPIALALFFEVTLFAVVALLVSPLGIVDVAGHQIALNFSSLMFVLPMSLAAAVTIRVGYRLGQGSTLDAQTAARTGLMVGVCMATLTAIFTVSLREQIALLYNDNPEVVTLAAHLMLLAAVYQISDSIQVIGSGILRGYKDTRSIFYITFTAYWVLGLPSGYILALTDLVVEPMGPAGFWIGFIIGLTSAAIMMMLRMRFLQRLPSVIILQRASR >CP034958.1|QAS85379.1|2188553_2189195_+|riboflavin-synthase MFTGIVQGTVKLVSIDEKPNFRTHVVELPDHMLDGLETGASVAHNGCCLTVTEINGNHVSFDLMKETLRITNLGDLKVGDWVNVERAAKFSDEIGGHLMSGHIMTTAEVAKILTSENNRQIWFKVQDSQLMKYILYKGFIGIDGISLTVGEVTPTRFCVHLIPETLERTTLGKKKLGARVNIEIDPQTQAVVDTVERVLAARENAMNQPGTEA >CP034958.1|QAS85380.1|2189234_2190383_-|cyclopropane-fatty-acyl-phospholipid-synthase MSSSCIEEVSVPDDNWYRIANELLSRAGIAINGSAPADIRVKNPDFFKRVLQEGSLGLGESYMDGWWECDRLDMFFSKVLRAGLENQLPHHFKDTLRIASARLFNLQSKKRAWIVGKEHYDLGNDLFSRMLDPFMQYSCAYWKDADNLESAQQAKLKMICEKLQLKPGMRVLDIGCGWGGLAHYMASNYDVSVVGVTISAEQQKMAQERCEGLDVTILLQDYRDLNDQFDRIVSVGMFEHVGPKNYDTYFAVVDRNLKPEGIFLLHTIGSKKTDLNVDPWINKYIFPNGCLPSVRQIAQSSEPHFVMEDWHNFGADYDTTLMAWYERFLAAWPEIADNYSERFKRMFTYYLNACAGAFRARDIQLWQVVFSRGVENGLRVAR >CP034958.1|QAS85381.1|2190673_2191885_-|Bcr/CflA-family-multidrug-efflux-MFS-transporter MQPGKRFLVWLAGLSVLGFLATDMYLPAFAAIQADLQTPASAVSASLSLFLAGFAAAQLLWGPLSDRYGRKPVLLIGLTIFALGSLGMLWVENAATLLVLRFVQAVGVCAAAVIWQALVTDYYPSQKVNRIFATIMPLVGLSPALAPLLGSWLLVHFSWQAIFATLFAITVVLILPIFWLKPTTKARNNSQDGLTFTDLLRSKTYRGNVLIYAACSASFFAWLTGSPFILSEMGYSPAVIGLSYVPQTIAFLIGGYGCRAALQKWQGKQLLPWLLVLFAVSVIATWAAGFISHVSLVEILIPFCVMAIANGAIYPIVVAQALRPFPHATGRAAALQNTLQLGLCFLASLVVSWLISISTPLLTTTSVMLSTVVLVALGYMMQRCEEVGCQNHGNAEVAHSESH >CP034958.1|QAS85382.1|2191997_2192930_+|LysR-family-transcriptional-regulator MWSEYSLEVVDAVARNGSFSAAAQELHRVPSAVSYTVRQLEEWLAVPLFERRHRDVELTAAGAWFLKEGRSVVKKMQITRQQCQQIANGWRGQLAIAVDNIVRPERTRQMIVDFYRHFDDVELLVFQEVFNGVWDALSDGRVELAIGATRAIPVGGRYAFRDMGMLSWSCVVASHHPLALMDGPFSDDTLRNWPSLVREDTSRTLPKRITWLLDNQKRVVVPDWESSATCISAGLCIGMVPTHFAKPWLNEGKWVVLELENPFPDSACCLTWQQNDMSPALTWLLEYLGDSETLNKEWLREPEETPATGD >CP034958.1|QAS85383.1|2192926_2193952_-|HTH-type-transcriptional-repressor-PurR MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVNHTKSIGLLATSSEAAYFAEIIEAVEKNCFQKGYTLILGNAWNNLEKQRAYLSMMAQKRVDGLLVMCSEYPEPLLAMLEEYRHIPMVVMDWGEAKADFTDAVIDNAFEGGYMAGRYLIERGHREIGVIPGPLERNTGAGRLAGFMKAMEEAMIKVPESWIVQGDFEPESGYRAMQQILSQPHRPTAVFCGGDIMAMGALCAADEMGLRVPQDVSLIGYDNVRNARYFTPALTTIHQPKDSLGETAFNMLLDRIVNKREEPQSIEVHPRLIERRSVADGPFRDYRR >CP034958.1|QAS85384.1|2194250_2194340_+|YnhF-family-membrane-protein MSTDLKFSLVTTIIVLGLIVAVGLTAALH >CP034958.1|QAS85385.1|2194505_2195675_+|MFS-transporter MKINYPLLALAIGAFGIGTTEFSPMGLLPVIARGVDVSIPAAGMLISAYAVGVMVGAPLMTLLLSHRARRSALIFLMAIFTLGNVLSAIAPDYMTLMLSRILTSLNHGAFFGLGSVVAASVVPKHKQASAVATMFMGLTLANIGGVPAATWLGETIGWRMSFLATAGLGVISMVSLFFSLPKGGAGARPEVKKELAVLMRPQVLSALLTTVLGAGAMFTLYTYISPVLQSITHATPVFVTAMLVLIGVGFSIGNYLGGKLADRSVNGTLKGFLLLLMVIMLAIPFLARNEFGAAISMVVWGAATFAVVPPLQMRVMRVASEAPGLSSSVNIGAFNLGNALGAAAGGAVISAGLGYSFVPVMGAIVAGLALLLVFMSARKQPETVCVANS >CP034958.1|QAS85386.1|2195820_2196402_-|superoxide-dismutase-[Fe] MSFELPALPYAKDALAPHISAETIEYHYGKHHQTYVTNLNNLIKGTAFEGKSLEEIIRSSEGGVFNNAAQVWNHTFYWNCLAPNAGGEPTGKVAEAIAASFGSFADFKAQFTDAAIKNFGSGWTWLVKNSDGKLAIVSTSNAGTPLTTDATPLLTVDVWEHAYYIDYRNARPGYLEHFWALVNWEFVAKNLAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034958_10 | 3211186-3211330 | Orphan |
NA
Consensus repeat of CP034958_10
|
1 spacers
spacers of CP034958_10
>10.1|3211238|41|CP034958|CRISPRCasFinder TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC |
CRISPR arrays and Neighbor proteins around CP034958_10
The CRISPR arrays of CP034958_10 >merge|CP034958|10|3211186-3211330|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGCTGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTCGTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC >CP034958|10|8|3211186-3211330|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC
>CP034958.1|QAS86290.1|3209836_3211120_+|putative-acyl-CoA-thioester-hydrolase MNTFSVSRLALALAFGVTLTACSSTPPDQRPSDQTAPGTSSRPILSAKEAQNFDAQHYFASLTPGAAAWNPSPITLPAQPDFVVGPAGTQGVTHTTIQAAVDAAIIKRTNKRQYIAVMPGEYQGTVYVPAAPGGITLYGTGEKPIDVKIGLSLDGGMSPADWRHDVNPRGKYMPGKPAWYMYDSCQSKRSDSIGVLCSAVFWSQNNGLQLQNLTIENTLGDSVDAGNHPAVALRTDGDQVQINNVNILGRQNTFFVTNSGVQNRLETNRQPRTLVTNSYIEGDVDIVSGRGAVVFDNTEFRVVNSRTQQEAYVFAPATLSNIYYGFLAVNSRFNAFGDGVAQLGRSLDVDANTNGQVVIRDSAINEGFNTAKPWADAVISNRPFAGNTGSVDDNDEIQRNLNDTNYNRMWEYNNRGVGSKVVAEAKK >CP034958.1|QAS86289.1|3208631_3209702_+|integrase MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >CP034958.1|QAS86288.1|3208435_3208654_+|excisionase MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >CP034958.1|QAS86287.1|3208228_3208396_+|hypothetical-protein MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >CP034958.1|QAS86286.1|3208110_3208296_-|hypothetical-protein MFSASITLLNGSPFHSPVTRKCIYHLHKTKPAVASSDKRNPRQCEDAVHCCYTLFCSQRKR >CP034958.1|QAS86285.1|3207383_3207986_-|hypothetical-protein MSYFLRKKWMVNLSGSGKILWALNMKKDSYPYLICMTVSGLIFIFLFFWWRADIYRVTFLNQSISHYYILFSMGIAFLLSLFWVKKGIVKQSGWKSLSAYLKVYAGMCIFAGFFLIIPLTTLTYFLPGETSSYVAPYRYTSGSSKSCSGAEVDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >CP034958.1|QAS86284.1|3206951_3207173_+|TraR/DksA-family-transcriptional-regulator MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQQDLELISKQRGSK >CP034958.1|QAS86283.1|3206571_3206853_+|cell-division-protein-ZapA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >CP034958.1|QAS86282.1|3206369_3206561_+|DUF1382-family-protein MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >CP034958.1|QAS86281.1|3206214_3206397_+|DUF1317-family-protein MTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLPLS >CP034958.1|QAS86291.1|3211353_3213615_-|hydratase MIKLSEKGVFLASNNEIIAEEHFTGEIKKEEAQKGTIAWSILSSHNTSGNMDKLKIKFDSLASHDITFVGIVQTAKASGMERFPLPYVLTNCHNSLCAVGGTINGDDHVFGLSAAQRYGGIFVPPHIAVIHQYMREMMAGGGKMILGSDSHTRYGALGTMAVGEGGGELVKQLLNDTWDIDYPGVVAVHLTGKPAPYVGPQDVALAIIGAVFKNGYVKNKVMEFVGPGVSALSTDFRNSVDVMTTETTCLSSVWQTDEEVHNWLALHGRGQDYCQLNPQPMAYYDGCISVDLSAIKPMIALPFHPSNVYKIDTLNQNLTDILREIEIESERVAHGKAKLSLLDKVENGRLKVQQGIIAGCSGGNYENVIAAANALRGQSCGNDTFSLAVYPSSQPVFMDLAQKGVVADLIGAGAIIRTAFCGPCFGAGDTPINNGLSIRHTTRNFPNREGSKPANGQMSAVALMDARSIAATAANGGYLTSASELDCWDNVPEYAFDVTPYKNRVYQGFVKGATQQPLIYGPNIKDWPELGALTDNIVLKVCSKILDEVTTTDELIPSGETSSYRSNPIGLAEFTLSRRDPGYVGRSKATAELENQRLAGNVSELTEVFARIKQIAGQEHIDPLQTEIGSMVYAVKPGDGSAREQAASCQRVIGGLANIAEEYATKRYRSNVINWGMLPLQMAEVPTFEVGDYIYIPGIKAALDNPGTTFKGYVIHEDAPVTEITLYMGSLTAEEREIIKAGSLINFNKNRQM >CP034958.1|QAS86292.1|3213797_3215231_-|anion-permease MNKKSLWKLILILAIPCIIGFMPAPAGLSELAWVLFGIYLAAIVGLVIKPFPEPVVLLIAVAASMVVVGNLSDGAFKTTAVLSGYSSGTTWLVFSAFTLSAAFVTTGLGKRIAYLLIGKIGNTTLGLGYVTVFLDLVLAPATPSNTARAGGIVLPIINSVAVALGSEPEKSPRRVGHYLMMSIYMVTKTTSYMFFTAMAGNILALKMINDILHLQISWGGWALAAGLPGIIMLLVTPLVIYTMYPPEIKKVDNKTIAKAGLAELGPMKIREKMLLGVFVLALLGWIFSKSLGVDESTVAIVVMATMLLLGIVTWEDVVKNKGGWNTLIWYGGIIGLSSLLSKVKFFEWLAEVFKNNLAFDGHGNVAFFVIIFLSIIVRYFFASGSAYIVAMLPVFAMLANVSGAPLMLTALALLFSNSYGGMVTHYGGAAGPVIFGVGYNDIKSWWLVGAVLTILTFLVHITLGVWWWNMLIGWNML >CP034958.1|QAS86293.1|3215306_3216359_-|4-oxalomesaconate-tautomerase MKKIPCVMMRGGTSRGAFLLAEHLPEDQTQRDKILMAIMGSGNDLEIDGIGGGNPLTSKVAIISRSSDLRADVDYLFAQVIVHEQRVDTTPNCGNMLSGVGAFAIENGLIAATSPVTRVRIRNVNTGTFIEADVQTPNGVVEYEGSARIDGVPGTAAPVALTFLNAAGTKTGKVFPTDNQIDYFDDVPVTCIDMAMPVVIIPAEYLGKTGYELPAELDADKALLARIESIRLQAGKAMGLGDVSNMVIPKPVLISPAQKGGAINVRYFMPHSCHRALAITGAIAISSSCALEGTVTRQIVPSVGYGNINIEHPSGALDVHLSNEGQDATTLRASVIRTTRKIFSGEVYLP >CP034958.1|QAS87728.1|3216542_3217496_+|LysR-family-transcriptional-regulator MKHELSSMKAFVILAESSSFNNAAKLLNITQPALTRRIKKMEEDLHIQLFERTTRKVTLTKAGKRLLPEARELIKKFDETLFNIRDMNAYHRGMVTLACIPTAVFYFLPLAIGKFNELYPNIKVRILEQGTNNCMESVLCNESDFGINMNNVTNSSIDFTPLVNEPFVLACRRDHPLAKKQLVEWQELVGYKMIGVRSSSGNRLLIEQQLADKPWKLDWFYEVRHLSTSLGLVEAGLGISALPGLAMPHAPYSSIIGIPLVEPVIRRTLGIIRRKDAVLSPAAERFFALLINLWTDDKDNLWTNIVERQRHALQEIG >CP034958.1|QAS86294.1|3217536_3218532_-|6-phosphogluconolactonase MKQTVYIASPESQQIHVWNLNHEGALTLTQVVDVPGQVQPMVVSPDKRYLYVGVRPEFRVLAYSIAPDDGALTFAAESALPGSPTHISTDHQGQFVFVGSYNAGNVSVTRLEDGLPVGVVDVVEGLDGCHSANISPDNRTLWVPALKQDRICLFTVSDDGHLVAQDPAEVTTVEGAGPRHMVFHPNEQYAYCVNELNSSVDVWELKDPHGNIECVQTLDMMPENFSDTRWAADIHITPDGRHLYACDRTASLITVFSVSEDGSVLSKEGFQPTETQPRGFNVDHSGKYLIAAGQKSHHISVYEIVGEQGLLHEKGRYAVGQGPMWVVVNAH >CP034958.1|QAS86295.1|3218686_3219505_+|pyridoxal-phosphatase MTTRVIALDLDGTLLTPKKTLLPSSIEALARAREAGYQLIIVTGRHHVAIHPFYQALALDTPAICCNGTYLYDYHAKTVLEADPMPVNKALQLIEMLNEHHIHGLMYVDDAMVYEHPTGHVIRTSNWAQTLPPEQRPTFTQVASLAETAQQVNAVWKFALTHDDLPQLQHFGKHVEHELGLECEWSWHDQVDIARGGNSKGKRLTKWVEAQGWSMENVVAFGDNFNDISMLEAAGTGVAMGNADDAVKARANIVIGDNTTDSIAQFIYSHLI >CP034958.1|QAS86296.1|3219505_3220564_-|molybdenum-ABC-transporter-ATP-binding-protein-ModC MLELNFSQTLGNHCLTINETLPANGITAIFGVSGAGKTSLINAISGLTRPQKGRIVLNGRVLNDAEKGICLTPEKRRVGYVFQDARLFPHYKVRGNLRYGMSKSMVDQFDKLVALLGIEPLLDRLPGSLSGGEKQRVAIGRALLTAPELLLLDEPLASLDIPRKRELLPYLQRLTREINIPMLYVSHSLDEILHLADRVMVLENGQVKAFGALEEVWGSSVMNPWLPKEQQSSILKVTVLEHHPHYAMTALALGDQHLWVNKLDEPLQAALRIRIQASDVSLVLQPPQQTSIRNVLRAKVVNSYDDNGQVEVELEVGGKTLWARISPWARDELAIKPGLWLYAQIKSVSITA >CP034958.1|QAS86297.1|3220566_3221256_-|molybdenum-ABC-transporter-permease MILTDPEWQAVLLSLKVSSLAVLFSLPFGIFFAWLLVRCTFPGKALLDSVLHLPLVLPPVVVGYLLLVSMGRRGFIGERLYDWFGITFAFSWRGAVLAAAVMSFPLMVRAIRLALEGVDVKLEQAARTLGAGRWRVFFTITLPLTLPGIIVGTVLAFARSLGEFGATITFVSNIPGETRTIPSAMYTLIQTPGGESGAARLCIISIALAMISLLISEWLARISRERAGR >CP034958.1|QAS86298.1|3221255_3222029_-|molybdate-ABC-transporter-substrate-binding-protein MARKWLNLFAGAALSFAVAGNALADEGKITVFAAASLTNAMQDIATQYKKEKGVDVVSSFASSSTLARQIEAGAPADLFISADQKWMDYAVDKKAIDTATRQTLLGNSLVVVAPKASEQKDFTIDSKTNWTSLLNGGRLAVGDPEHVPAGIYAKEALQKLGAWDTLSPKLAPAEDVRGALALVERNEAPLGIVYGSDAVASKGVKVVAIFPEDSHKKVEYPVAVVEGHNNATVKAFYDYLKGPQAAEIFKRYGFTTK >CP034958.1|QAS86299.1|3222195_3222345_-|multidrug-efflux-pump-associated-protein,-AcrZ-family MLELLKSLVFAVIMVPVVMAIILGLIYGLGEVFNIFSGVGKKDQPGQNH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034958_11 | 3721999-3722152 | Orphan |
NA
Consensus repeat of CP034958_11
|
1 spacers
spacers of CP034958_11
>11.1|3722052|48|CP034958|CRISPRCasFinder TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA |
CRISPR arrays and Neighbor proteins around CP034958_11
The CRISPR arrays of CP034958_11 >merge|CP034958|11|3721999-3722152|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCGTCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAACGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG >CP034958|11|9|3721999-3722152|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCG TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA CGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG
>CP034958.1|QAS86719.1|3720124_3721864_+|flagellar-type-III-secretion-system-protein-FlhA MLSRSDLLTLLTINFIVVTKGAERISEVSARFTLDAMPGKQMAIDADLNAGLINQAQAQTRRKDVASEADFYGAMDGASKFVRGDAIAGMMILAINLIGGVCIGIFKYNLSADAAFQQYVLMTIGDGLVAQIPSLLLSTAAAIIVTRISDNGDITHDVRHQLLASPSVLYTATGIMFVLAVVPGMPHLPFLLFSALLGFTGWRMSKRPQAAEAEEKSLETLTRTITETSEQQVSWETIPLIEPISLSLGYKLVALVDKAQGNPLTQRIRGVRQVISDGNGVLLPEIRIRENFRLKPSQYAIFINGIKADEADIPADKLMALPSSETYGEIDGVLGNDPAYGMPVTWIQPAQKAKALNMGYQVIDSASVIATHVNKIVRSYIPDLFSYDDITQLHNRLSSMAPRLAEDLSAALNYSQLLKVYRALLTEGVSLRDIVTIATVLVASSAVTKDHILLAADVRLALRRSITHPFVRKQELTVYTLNNELENLLTNVVNQAQQGGKVMLDSVPVDPNMLNQFQSTMPQVKEQMKAAGKDPVLLVPPQLRPLLARYARLFAPGLHVLSYNEVPDELELKIMGALM >CP034958.1|QAS86718.1|3719394_3720165_-|putative-lateral-flagellar-export/assembly-protein-LafU MIVNSVSKSERESIIAALHGQSIFSGGGLSPLNKISPSHPPKPATVAVPEETEKKARDVNEKTALLKKKSATELGELATSINTIARDAHMEANLEMEIVPQGLRVLIKDDQNRNMFECGSAQIMPFFKTLLVELAPVFDSLDNKIIITGHTDAMAYKNNIYNNWNLSGDRALSARRVLEEAGMPEDKVMQVSAMADQMLLDAKNPQSAGNRRIEIMVLTKSASDTLYQYFGQHGDKVVQPLVQKLDKQQVLSQRMR >CP034958.1|QAS86717.1|3718268_3719324_-|DNA-polymerase-IV MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARKFGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPLSLDEAYLDVTDSVHCHGSATLIAQEIRQTIFNELHLTASAGVAPVKFLAKIASDMNKPNGQFVITPAEVPAFLQTLPLAKIPGVGKVSAAKLEAMGLRTCGDVQKCDLVILLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMAEDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQEHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLGL >CP034958.1|QAS86716.1|3717819_3718272_-|GNAT-family-N-acetyltransferase MNNIQIRNYQPGDFQQLCAIFIRAVMMTASQHYSPQQIAAWAQIDESRWKEKLAKSQVRVAVINAQPVGFISRIERHIDMLFVDPEYTRRGVASALLKPLIKSESELTVDASITAKPFFERYGFQIVKQQHVECRGAWFTNFYMRYKPQH >CP034958.1|QAS86715.1|3717246_3717513_-|hypothetical-protein MEWYMGKYIRPLSDAVFTIASDDLWIESLAIQQLHTTANLPNMQRVVGMPDLHPGRGYPIGAAFFSVGRFYPARRRGNGAGNRNGPLL >CP034958.1|QAS86714.1|3715432_3716890_+|cytosol-nonspecific-dipeptidase MSELSQLSPQPLWDIFAKICSIPHPSYHEEQLAEYIVGWAKEKGFHVERDQVGNILIRKPATAGMENRKPVVLQAHLDMVPQKNNDTVHDFTKDPIQPYIDGEWVKARGTTLGADNGIGMASALAVLADENVVHGPLEVLLTMTEEAGMDGAFGLQSNWLQADILINTDSEEEGEIYMGCAGGIDFTSNLHLDREAVPAGFETFKLTLKGLKGGHSGGEIHVGLGNANKLLVRFLAGHAEELDLRLIDFNGGTLRNAIPREAFATIAVAADKVDALKSLVNTYQDILKNELAEKEKNLALLLDSVANDKAALIAKSRDTFIRLLNATPNGVIRNSDVAKGVVETSLNVGVVTMTDNNVEIHCLIRSLIDSGKDYVVSMLDSLGKLAGAKTEAKGAYPGWQPDANSPVMHLVRETYQRLFNKTPNIQIIHAGLECGLFKKPYPEMDMVSIGPTITGPHSPDEQVHIKSVGHYWTLLTELLKEIPAK >CP034958.1|QAS86713.1|3714713_3715172_-|xanthine-phosphoribosyltransferase MSEKYIVTWDMLQIHARKLASRLMPSEQWKGIIAVSRGGLVPGALLARELGIRHVDTVCISSYDHDNQRELKVLKRAEGDGEGFIVIDDLVDTGGTAVAIREMYPKAHFVTIFAKPAGRPLVDNYVVDIPQDTWIEQPWDMGVVFVPPISGR >CP034958.1|QAS86712.1|3713377_3714622_-|esterase-FrsA MTQANLSETLFKPRFKHPETSTLVRRFNHGAQPPVQSALDGKTIPHWYRMINRLMWIWRGIDPREILDVQARIVMSDAERTDDDLYDTVIGYRGGNWIYEWATQAMVWQQKACAEEDPQLSGRHWLHAATLYNIAAYPHLKGDDLAEQAQALSNRAYEEAAQRLPGTMRQMEFTVPGGAPITGFLHMPKGDGPFPTVLMCGGLDAMQTDYYSLYERYFAPRGIAMLTIDMPSVGFSSKWKLTQDSSLLHQHVLKALPNVPWVDHTRVAAFGFRFGANVAVRLAYLESPRLKAVACLGPVVHTLLSDFKCQQQVPEMYLDVLASRLGMHDASDDALRVELNRYSLKVQGLLGRRCPTPMLSGYWKNDPFSPEEDSRLITSSSADGKLLEIPFNPVYRNFDKGLQEITGWIEKRLC >CP034958.1|QAS86711.1|3712918_3713320_-|sigma-factor-binding-protein-Crl MTLPSGHPKSRLIKKFTALGPYIREGKCEDNRFFFDCLAVCVNVKPAPEVREFWGWWMELEAQESRFTYSYQFGLFDKAGDWKSVPVKDTEVVERLEHTLREFHEKLRELLTTLNLKLEPADDFRDEPVKLTA >CP034958.1|QAS86710.1|3711824_3712880_+|phosphoporin-PhoE MKKSTLALVVMGIVASASVQAAEIYNKDGNKLDVYGKVKAMHYMSDNDSKDGDQSYIRFGFKGETQINDQLTGYGRWEAEFAGNKAESDTAQQKTRLAFAGLKYKDLGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGVIDGLNLTLQYQGKNENRDVKKQNGDGFGTSLTYDFGGSDFAISGAYTNSDRTNEQNLQSRGTGKRAEAWATGLKYDANNIYLATFYSETRKMTPITGGFANKTQNFEAVAQYQFDFGLRPSLGYVLSKGKDIEGIGDEDLVNYIDVGATYYFNKNMSAFVDYKINQLDSDNKLNINNDDIVAVGMTYQF >CP034958.1|QAS86720.1|3722181_3722679_-|transposase MSEYRRYYIKGGTWFFTVNLRNRRSQLLTTQYQMLRHAIIKVKRDRPFEINAWVVLPEHMHCIWTLPEGDDDFSSRWREIKKQFTHACGLKNIWQPRFWEHAIRNTKDYRHHVDYIYINPVKHGWVKQVSDWPFSTFHRDVARGLYPIDWAGDVTDINAGERIIL >CP034958.1|QAS86721.1|3722854_3723613_-|peptidoglycan-endopeptidase MSFMSSFLLGRFLHPGVFSLCVLLPLFASATTSHISFSYAARQRMQNRARLLKQYQTHLKKQASYIVEGNAESRRALRQHNREQIKQHPEWFPAPLKASDRRWQALAENNHFLSSDHLHNITEVAIHRLEQQLGKPYVWGGTRPDQGFDCSGLVFYAYNKILEAKLPRTANEMYHYHRATIVANNDLRRGDLLFFHIHSREIADHMGVYLGDGQFIESPRTGENIRVSRLAEPFWQDHFLGARRILTEETIL >CP034958.1|QAS86722.1|3723904_3724645_+|murein-L,D-transpeptidase MRKIALILAMLLIPCVSFAGLLGSSSSTTPVSKEYKQQLMGSPVYIQIFKEERTLDLYVKMGEQYQLLDSYKICKYSGGLGPKQRQGDFKSPEGFYSVQRNQLKPDSRYYKAINIGFPNAYDRAHGYEGKYLMIHGDCVSIGCYAMTNQGIDEIFQFVTGALVFGQPSVQVSIYPFRMTDANMKRHKYSNFKDFWEQLKPGYDYFEQTRKPPTVSVVNGRYVVSKPLSHEVVQPQLASNYTLPEAK >CP034958.1|QAS86723.1|3724615_3725383_-|class-II-glutamine-amidotransferase MCELLGMSANVPTDICFSFTGLVQRGGGTGPHKDGWGITFYEGKGCRTFKDPQPSFNSPIAKLVQDYPIKSCSVVAHIRQANRGEVALENTHPFTRELWGRNWTYAHNGQLTGYKSLETGNFRPVGETDSEKAFCWLLHKLTQRYPRTPGNMAAVFKYIASLADELRQKGVFNMLLSDGRYVMAYCSTNLHWITRRAPFGVATLLDQDVEIDFSSQTTPNDVVTVIATQPLTGNETWQKIMPGEWRLFCLGERVV >CP034958.1|QAS86724.1|3725588_3726167_-|D-sedoheptulose-7-phosphate-isomerase MYQDLIRNELNEAAETLANFLKDDANIHAIQRAAVLLADSFKAGGKVLSCGNGGSHCDAMHFAEELTGRYRENRPGYPAIAISDVSHISCVGNDFGFNDIFSRYVEAVGREGDVLLGISTSGNSANVIKAIAAAREKGMKVITLTGKDGGKMAGTADIEIRVPHFGYADRIQEIHIKVIHILIQLIEKEMVK >CP034958.1|QAS86725.1|3726406_3728851_+|acyl-CoA-dehydrogenase MMILSILATVVLLGALFYHRVSLFISSLILLAWTAALGVAGLWSAWVLVPLAIILVPFNFAPMRKSMISAPVFRGFRKVMPPMSRTEKEAIDAGTTWWEGDLFQGKPDWKKLHNYPQPRLTAEEQAFLDGPVEEACRMANDFQITHELADLPPELWAYLKEHRFFAMIIKKEYGGLEFSAYAQSRVLQKLSGVSGILAITVGVPNSLGPGELLQHYGTDEQKNHYLPRLARGQEIPCFALTSPEAGSDAGAIPDTGIVCMGEWQGQQVLGMRLTWNKRYITLAPIATVLGLAFKLSDPEKLLGGAEDLGITCALIPTTTPGVEIGRRHFPLNVPFQNGPTRGKDVFVPIDYIIGGPKMAGQGWRMLVECLSVGRGITLPSNSTGGVKSVALATGAYAHIRRQFKISIGKMEGIEEPLARIAGNAYVMDAAASLITYGIMLGEKPAVLSAIVKYHCTHRGQQSIIDAMDITGGKGIMLGQSNFLARAYQGAPIAITVEGANILTRSMMIFGQGAIRCHPYVLEEMEAAKNNDVNAFDKLLFKHIGHVGSNKVRSFWLGLTRGLTSSTPTGDATKRYYQHLNRLSANLALLSDVSMAVLGGSLKRRERISARLGDILSQLYLASAVLKRYDDEGRNEADLPLVHWGVQDALYQAEQAMDDLLQNFPNRVVAGLLNVVIFPTGRHYLAPSDKLDHKVAKILQVPNATRSRIGRGQYLTPSEHNPVGLLEEALVDVIAADPIHQRICKELGKNLPFTRLDELAHNALAKGLIDKDEAAILVKAEESRLCSINVDDFDPEELATKPVKLPEKVRKVEAA >CP034958.1|QAS86726.1|3728893_3729367_-|C-lysozyme-inhibitor MGRISSGGMMFKAITTVAALVIATSAMAQDDLTISSLAKGETTKAAFNQMVQGHKLPAWVMKGGTYTPAQTVTLGDETYQVMSACKPHDCGSQRIAVMWSEKSNQMTGLFSTIDEKTSQEKLTWLNVNDALSIDGKTVLFAALTGSLENHPDGFNFK >CP034958.1|QAS86727.1|3729520_3730291_+|2-oxoglutaramate-amidase MPGLKITLLQQPLVWMDGPANLRHFDRQLEGITGRDVIVLPEMFTSGFAMEAAASSLAQNDVVNWMTAKAQQCNALIAGSVALQTESGSVNRFLLVEPGGTVHFYDKRHLFRMADEHLHYKAGNARVIVEWRGWRILPLVCYDLRFPVWSRNLNDYDLAIYVANWPAPRSLHWQALLTARAIENQAYVAGCNRVGSDGNGCHYRGDSRVINPQGEIIATADAHQATRIDAELSMVALREYREKFPAWQDADEFRLR >CP034958.1|QAS86728.1|3731729_3732179_-|hypothetical-protein MMKYLMVLLSLFSGSVLGMGRVNELCGIDSVKTIEIINLPSYVTTLVPLSKEGLNEIYRYKVVVNEISDLYAGKIIDLLQMKYFRKEKYNNIRWGVSIISKGNNKCEIYFDAFGECGSVNGINVCFEKNEMIGWIKKEIPLLSQKIGGL >CP034958.1|QAS86729.1|3732190_3735250_-|RHS-repeat-protein MTSPLNSEGRYTEGEGGLKRVVKKEHADGSITRSEYDEAGRLKAQTDAAGRRTEYSLHMASGAVTAVTGPDGRTVRYGYNSQRQVTSVTYPDGLRSSREYDEKGRLTAETSRSGETTRYSYDDPASELPTGIQDATGSTKQMAWSRYGQLLAFTDCSGYTTRYEYDRYGQQIAVHREEGISTYSSYNPRGQLVSQKDAQGREIRYEYSAAGDLTATISPDGKRSTIEYDKRGRPVSVTEGGLTRSMGYDAAGRITVLTNENGSQSTFRYDPVDRLTEQRGFDGRTQRYHYDLTGKLTQSEDEGLITLWHYDASDRITHRTVNGDPAEQWQYDEHGWLTTLSHTCEGHRVSVHYGYDDKGRLTGERQTVENPETGEMLWEHETGHAYSEQGLATRQEPDGLPPVEWLTYGSGYLAGMKLGGTPLVEYTRDRLHRETARSFGGAGSTAGYEQATAYTLTGQLQSRHLNLPQLDCDYTWNDNGQLVRISGPQECREYRYSGTGRLTGVHTTAANLDIDIPYATDPAGNRLPDPELHPDSTLTAWPDNRIAEDAHYVYRYDEYGRLAEKTDRIPEGVIRMHDERTHHYHYDSQHRLVFYTRIQHGEPQVESRYLYDPLGRRTGKRVWRRERDLTGWMSLSRKPEETWYGWDGDRLTTVQTQQTRIQTVYQPGSFTPLLRIETENGEQAKARHRSLAEVLQEDTGVTLPAELAVMLGRLERELRQGSVSEESQQWLAQCGLTAEQMAAQLEAEYIPERKLHLYHCDHRGLPLALISPEGETAWQGEYDEWGNLLGEESAQHLQQSLRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLRGEWNLYKYPLNPVRFIDSLGLKFHVNGDPSDFNQAVEYLKQDSQMKETIDFLSSSEETINIEYIEGTNVRFNSNNMAIYWNSRASLFCSTELNSKSQSPALGLGHEFAHAQYYLLDKENFMALLSRTDKKYENKEEARVITIIESRAAKTLGECTRGAHSGLPFYRVDGPLQTMKITGTPE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CP034958_12 | 3935446-3935578 | Orphan |
NA
Consensus repeat of CP034958_12
|
2 spacers
spacers of CP034958_12
>12.1|3935463|42|CP034958|PILER-CR TGTCACACGCAGATAAATCCAACTTTCAATATTGTTAAGTTC >12.2|3935522|40|CP034958|PILER-CR CATGGCGTAGCAAAAAGAAATTTTCAATATTGCTTTATGG |
CRISPR arrays and Neighbor proteins around CP034958_12
The CRISPR arrays of CP034958_12 >merge|CP034958|12|3935446-3935578|PILER-CR ATCACCAATATTGAAAATGTCACACGCAGATAAATCCAACTTTCAATATTGTTAAGTTCCTCACCAATATTGAAAACATGGCGTAGCAAAAAGAAATTTTCAATATTGCTTTATGGATCACCAATATTGAAAG >CP034958|12|5|3935446-3935578|PILER-CR ATCACCAATATTGAAAA TGTCACACGCAGATAAATCCAACTTTCAATATTGTTAAGTTC CTCACCAATATTGAAAA CATGGCGTAGCAAAAAGAAATTTTCAATATTGCTTTATGG ATCACCAATATTGAAAG
>CP034958.1|QAS86891.1|3934584_3935355_-|electron-transfer-flavoprotein-FixA MKIITCYKCVPDEQDIAVNNADGSLDFSKADAKISQYDLNAIEAACQLKQQAAEAQVTALSVGGKALTNAKGRKDVLSRGPDELIVVIDDQFEQALPQQTASALAAAAQKAGFDLILCGDGSSDLYAQQVGLLVGEILNIPAVNGVSKIISLTADTLTVERELEDETETLSIPLPAVVAVSTDINSPQIPSMKAILGAAKKPVQVWSAADIGFNAEAAWSEQQVAAPKQRERQRIVIEGDGEEQIAAFAENLRKVI >CP034958.1|QAS86890.1|3933628_3934570_-|electron-transfer-flavoprotein-subunit-alpha/FixB-family-protein MNTFSQVWVFSDTPSRLPELMNGAQALANQINTFVLNDADGAQAIQLGANHVWKLNGKPDDRMIEDYAGVMADTIRQHGADGLVLLPNTRRGKLLAAKLGYRLKAAVSNDASTVSVQDGKATVKHMVYGGLAIGEERIATPYAVLTISSGTFDAAQPDASRTGETHTVEWQAPAVAITRTATQARQSNSVDLDKARLVVSVGRGIGSKENIALAEQLCKAIGAELACSRPVAENEKWMEHERYVGISNLMLKPELYLAVGISGQIQHMVGANASQTIFAINKDKNAPIFQYADYGIVGDAVKILPALTAALAR >CP034958.1|QAS86889.1|3932291_3933578_-|FAD-dependent-oxidoreductase MSEDIFDAIIVGAGLAGSVAALVLAREGAQVLVIERGNSAGAKNVTGGRLYAHSLEHIIPGFADSAPVERLITHEKLAFMTEKSAMTMDYCNGDETSPSQRSYSVLRSKFDAWLMEQAEEAGAQLITGIRVDNLVQRDGKVVGVEADGDVIEAKTVILADGVNSILAEKLGMAKRVKPTDVAVGVKELIELPKSVIEDRFQLQGNQGAACLFAGSPTDGLMGGGFLYTNENTLSLGLVCGLHHLHDAKKSVPQMLEDFKQHPAVAPLIAGGKLVEYSAHVVPEAGINMLPELVGDGVLIAGDAAGMCMNLGFTIRGMDLAIAAGEAAAKTVLSAMKSDDFSKQKLAEYRQHLESGPLRDMRMYQKLPAFLDNPRMFSGYPELAVGVARDLFTIDGSAPELMRKKILRHGKKVGFINLIKDGMKGVTVL >CP034958.1|QAS86888.1|3932007_3932295_-|ferredoxin-like-protein-FixX MTSPVNVDVKLGVNKFNVDEEHPHIVVKADADKQVLELLVKACPAGLYKKQDDGSVRFDYAGCLECGTCRILGLGSALEQWEYPRGTFGVEFRYG >CP034958.1|QAS86887.1|3930618_3931950_-|MFS-transporter MQPSRNFDDLKFSSIHRRILLWGSGGPFLDGYVLVMIGVALEQLTPALKLDADWIGLLGAGTLAGLFVGTSLFGYISDKVGRRKMFLIDIIAIGVISVATMFVSSPVELLVMRVLIGIVIGADYPIATSMITEFSSTRQRAFSISFIAAMWYVGATCADLVGYWLYDVEGGWRWMLGSAAIPCLLILIGRFELPESPRWLLRKGRVKECEEMMIKLFGEPVAFDEEQPQQTRFRDLFNRRHFPFVLFVAAIWTCQVIPMFAIYTFGPQIVGLLGLGVGKNAALGNVVISLFFMLGCIPPMLWLNTAGRRPLLIGSFAMMTLALAVLGLIPDMGIWLVVMAFAVYAFFSGGPGNLQWLYPNELFPTDIRASAVGVIMSLSRIGTIVSTWALPIFINNYGISNTMLMGAGISLFGLLISVAFAPETRGMSLAQTSNMTIRGQRMG >CP034958.1|QAS86886.1|3929980_3930511_-|glutathione-regulated-potassium-efflux-system-oxidoreductase-KefF MILIIYAHPYPHHSHANKRMLEQARTLEGVEIRSLYQLYPDFNIDIAAEQEALSRADLIVWQHPMQWYSIPPLLKLWIDKVFSHGWAYGHGGTALHGKHLLWAVTTGGGESHFEIGAHPGFDVLSQPLQATAIYCGLNWLPPFAMHCTFICDDETLEGQARHYKQRLLEWQEAHHG >CP034958.1|QAS86885.1|3928125_3929988_-|glutathione-regulated-potassium-efflux-system-protein-KefC MDSHTLIQALIYLGSAALIVPIAVRLGLGSVLGYLIAGCIIGPWGLRLVTDAESILHFAEIGVVLMLFIIGLELDPQRLWKLRAAVFGCGALQMVICGGLLGLFCMLLGLRWQVAELIGMTLALSSTAIAMQAMNERNLMVTQMGRSAFAVLLFQDIAAIPLVAMIPLLATSSASTTMGAFALSALKVAGALVLVVLLGRYVTRPALRFVARSGLREVFSAVALFLVFGFGLLLEEVGLSMAMGAFLAGVLLASSEYRHALESDIEPFKGLLLGLFFIGVGMSIDFGTLLENPLRIVILLLGFLIIKIAMLWLIARPLQVPNKQRRWFAVLLGQGSEFAFVVFGAAQMANVLEPEWAKSLTLAVALSMAATPILLVILNRLEQSSTEEAREADEIDEEQPRVIIAGFGRFGQITGRLLLSSGVKMVVLDHDPDHIETLRKFGMKVFYGDATRMDLLESAGAAKAEVLINAIDDPQTNLQLTEMVKEHFPHLQIIARARDVDHYIRLRQAGVEKPERETFEGALKTGRLALESLGLGPYEARERADVFRRFNIQMVEEMAMVENDTKARAAVYKRTSAMLSEIITEDREHLSLIQRHGWQGTEEGKHTGNMADEPETKPSS >CP034958.1|QAS87761.1|3927454_3927934_-|type-3-dihydrofolate-reductase MISLIAALAVDRVIGMENAMPWNLPADLAWFKRNTLNKPVIMGRHTWESIGRPLPGRKNIILSSQPGTDDRVTWVKSVDEAIAACGDVPEIMVIGGGRVYEQFLPKAQKLYLTHIDAEVEGDTHFPDYEPDDWESVFSEFHDADAQNSHSYCFEILERR >CP034958.1|QAS86884.1|3926534_3927377_+|bis(5'-nucleosyl)-tetraphosphatase-(symmetrical) MATYLIGDVHGCYDELIALLHKVEFTPGKDTLWLTGDLVARGPGSLDVLRYVKSLGDSVRLVLGNHDLHLLAVFAGISRNKPKDRLTPLLEAPDADELLNWLRRQPLLQIDEEKKLVMAHAGITPQWDLQTAKECARDVEAVLSSDSYPFFLDAMYGDMPNNWSPELRGLGRLRFITNAFTRMRFCFPNGQLDMYSKESPEEAPAPLKPWFAIPGPVAEEYSIAFGHWASLEGKGTPEGIYALDTGCCWGGTLTCLRWEDKQYFVQPSNRHKDLGEAAAS >CP034958.1|QAS86883.1|3926150_3926528_+|Co2+/Mg2+-efflux-protein-ApaG MINSPRVCIQVQSVYIEAQSSPDNERYVFAYTVTIRNLGRAPVQLLGRYWLITNGNGRETEVQGEGVVGVQPLIAPGEEYQYTSGAIIETPLGTMQGHYEMIDENGVPFSIDIPVFRLAVPTLIH >CP034958.1|QAS86892.1|3935828_3937343_+|L-carnitine/gamma-butyrobetaine-antiporter MKNEKRKTGIEPKVFFPPLIIVGILCWLTVRDLDAANVVINAVFSYVTNVWGWAFEWYMVVMLFGWFWLVFGPYAKKRLGNEPPEFSTASWIFMMFASCTSAAVLFWGSIEIYYYISTPPFGLEPNSTGAKELGLAYSLFHWGPLPWATYSFLSVAFAYFFFVRKMEVIRPSSTLVPLVGEKHAKGLFGTIVDNFYLVALIFAMGTSLGLATPLVTECMQWLFGIPHTLQLDAIIITCWIILNAICVACGLQKGVRIASDVRSYLSFLMLGWVFIVSGASFIMNYFTDSVGMLLMYLPRMLFYTDPIAKGGFPQGWTVFYWAWWVIYAIQMSIFLARISRGRTVRELCFGMVLGLTASTWILWTVLGSNTLLLIDKNIINIPNLIEQYGVARAIIETWAALPLSTATMWGFFILCFIATVTLVNACSYTLAMSTCREVRDGEEPPLLVRIGWSILVGIIGIVLLALGGLKPIQTAIIAGGCPLFFVNIMVTLSFIKDAKQNWKD >CP034958.1|QAS86893.1|3937373_3938516_+|crotonobetainyl-CoA-dehydrogenase MDFNLNDEQELFVAGIRELMASENWEAYFAECDRDSVYPERFVKALADMGIDSLLIPEEHGGLDAGFVTLAAVWMELGRLGAPTYVLYQLPGGFNTFLREGTQEQIDKIMAFRGTGKQMWNSAITEPGAGSDVGSLKTTYTRRNGKIYLNGSKCFITSSAYTPYIVVMARDGASPDKPVYTEWFVDMSKPGIKVTKLEKLGLRMDSCCEITFDDVELDEKDMFGREGNGFNRVKEEFDHERFLVALTNYGTAMCAFEDAAHYANQRVQFGEAIGRFQLIQEKFAHMAIKLNSMKNMLYEAAWKADNGTITSGDAAMCKYFCANAAFEVVDSAMQVLGGVGIAGNHRISRFWRDLRVDRVSGGSDEMQILTLGRAVLKQYR >CP034958.1|QAS86894.1|3938644_3939862_+|L-carnitine-CoA-transferase MDHLPMPKFGPLAGLRVVFSGIEIAGPFAGQMFAEWGAEVIWIENVAWADTIRVQPNYPQLSRRNLHALSLNIFKDEGREAFLKLMETTDIFIEASKGPAFARRGITDEVLWQHNPKLVIAHLSGFGQYGTEEYTNLPAYNTIAQAFSGYLIQNGDVDQPMPAFPYTADYFSGLTATTAALAALHKARETGKGESIDIAMYEVMLRMGQYFMMDYFNGGEMCPRMSKGKDPYYAGCGLYKCADGYIVMELVGITQIEECFKDIGLAHLLSTPEIPEGTQLIHRIECPYGPLVEEKLDAWLAAHTIAEVKERFAELNIACAKVLTVPELESNPQYVARESITQWQTMDGRTCKGPNIMPKFKNNPGQIWRGMPSHGMDTAAILKNIGYSENDIQELVSKGLAKVED >CP034958.1|QAS87762.1|3939935_3941489_+|crotonobetaine/carnitine-CoA-ligase MDIIGGQHLRQMWDDLADVYGHKTALICESSGGVVNRYSYLELNQEINRTANLFYTLGIRKGDKVALHLDNCPEFIFCWFGLAKIGAIMVPINARLLREESAWILQNSQACLLVTSAQFYPMYQQIQQEDATQLRHICLTDVALPADDGVSSFTQLKNQQPATLCYAPPLLTDDTAEILFTSGTTSRPKGVVITHYNLRFAGYYSAWQCALRDDDVYLTVMPAFHIDCQCTAAMAAFSAGATFVLVEKYSARAFWGQVQKYRATITECIPMMIRTLMVQPPSANDRQHRLREVMFYLNLSEQEKDAFCERFGVRLLTSYGMTETIVGIIGDRPGDKRRWPSIGRAGFCYEAEIRDDHNRPLPAGEIGEICIKGVPGKTIFKEYFLNPKATAKVLEADGWLHTGDTGYCDEEGFFYFVDRRCNMIKRGGENVSCVELENIIATHPKIQDIVVVGIKDSIRDEAIKAFVVLNEGETLSEEEFFRFCEQNMAKFKVPSYLEIRKDLPRNCSGKIIRKNLK >CP034958.1|QAS86895.1|3941597_3942383_+|crotonobetainyl-CoA-hydratase MSESLHLTRNGSILEITLDRPKANAIDAKTSFEMGEVFLNFRDDPQLRVAIITGAGEKFFSAGWDLKAAAEGEAPDADFGPGGFAGLTEIFNLDKPVIAAVNGYAFGGGFELALAADFIVCADNASFALPEAKLGIVPDSGGVLRLPKILPPAIVNEMVMTGRRMGAEEALRWGIVNRVVNQAELMDNARELAQQLVNSAPLAIAALKEIYRTTSEMPVEEAYRYIRSGVLKHYPSVLHSEDAIEGPLAFAEKRDPVWKGR >CP034958.1|QAS86896.1|3942388_3942979_+|carnitine-operon-protein-CaiE MSYYAFEGLIPVVHPTAFVHPSAVLIGDVIVGAGVYIGPLASLRGDYGRLIVQAGANIQDGCIMHGYCNTDTIVGENGHIGHGAILHGCVIGRDALVGMNSVIMDGAVIGEESIVAAMSFIKAGFRGEKRQLLMGTPARAVRSVSDDELHWKRLNTKEYQDLVGRCHASLHETQPLRQMEENRPRLQGTTDVTPKR >CP034958.1|QAS86897.1|3943097_3943493_-|carnitine-metabolism-transcriptional-regulator-CaiF MCEGYVEKPLYLLIAEWMMAENRWVIAREISIHFDIEHSKAVNTLTYILSEVTEISCEVKMIPNKLEGRGCQCQRLVKVVDIDEQIYARLRNNSREKLVGVRKTPRIPAVPLTELNREQKWQMMLSKSMRR >CP034958.1|QAS86898.1|3943753_3946975_-|carbamoyl-phosphate-synthase-large-subunit MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVILVNSNPATIMTDPEMADATYIEPIHWEVVRKIIEKERPDAVLPTMGGQTALNCALELERQGVLEEFGVTMIGATADAIDKAEDRRRFDVAMKKIGLETARSGIAHTMEEALAVAADVGFPCIIRPSFTMGGSGGGIAYNREEFEEICARGLDLSPTKELLIDESLIGWKEYEMEVVRDKNDNCIIVCSIENFDAMGIHTGDSITVAPAQTLTDKEYQIMRNASMAVLREIGVETGGSNVQFAVNPKNGRLIVIEMNPRVSRSSALASKATGFPIAKVAAKLAVGYTLDELMNDITGGRTPASFEPSIDYVVTKIPRFNFEKFAGANDRLTTQMKSVGEVMAIGRTQQESLQKALRGLEVGATGFDPKVSLDDPEALTKIRRELKDAGAERIWYIADAFRAGLSVDGVFNLTNIDRWFLVQIEELVRLEEKVAEVGITGLNAEFLRQLKRKGFADARLAKLAGVREAEIRKLRDQYDLHPVYKRVDTCAAEFATDTAYMYSTYEEECEANPSTDREKIMVLGGGPNRIGQGIEFDYCCVHASLALREDGYETIMVNCNPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKPKGVIVQYGGQTPLKLARALEAAGVPVIGTSPDAIDRAEDRERFQHAVERLKLKQPANATVTAIEMAVEKAKEIGYPLVVRPSYVLGGRAMEIVYDEADLRRYFQTAVSVSNDAPVLLDHFLDDAVEVDVDAICDGEMVLIGGIMEHIEQAGVHSGDSACSLPAYTLSQEIQDVMRQQVQKLAFELQVRGLMNVQFAVKNNEVYLIEVNPRAARTVPFVSKATGVPLAKVAARVMAGKSLAEQGVTKEVIPPYYSVKEVVLPFNKFPGVDPLLGPEMRSTGEVMGVGRTFAEAFAKAQLGSNSTMKKHGRALLSVREGDKERVVDLAAKLLKQGFELDATHGTAIVLGEAGINPRLVNKVHEGRPHIQDRIKNGEYTYIINTTSGRRAIEDSRVIRRSALQYKVHYDTTLNGGFATAMALNADATEKVISVQEMHAQIK >CP034958.1|QAS86899.1|3946992_3948141_-|carbamoyl-phosphate-synthase-small-subunit MIKSALLVLEDGTQFHGRAIGATGSAVGEVVFNTSMTGYQEILTDPSYSRQIVTLTYPHIGNVGTNDADEESSQVHAQGLVIRDLPLIASNFRNTEDLSSYLKRHNIVAIADIDTRKLTRLLREKGAQNGCIIAGDNPDAALALEKARAFPGLNGMDLAKEVTTAEAYSWTQGSWTLTGGLPEAKKEDELPFHVVAYDFGAKRNILRMLVDRGCRLTIVPAQTSAEDVLKMNPDGIFLSNGPGDPAPCDYAITAIQKFLETDIPVFGICLGHQLLALASGAKTVKMKFGHHGGNHPVKDVEKNVVMITAQNHGFAVDEATLPANLRVTHKSLFDGTLQGIHRTDKPAFSFQGHPEASPGPHDAAPLFDHFIELIEQYRKTAK >CP034958.1|QAS86900.1|3948596_3949418_-|4-hydroxy-tetrahydrodipicolinate-reductase MHDANIRVAIAGAGGRMGRQLIQAALALEGVQLGAALEREGSSLLGSDAGELAGAGKTGVTVQSSLDAVKDDFDVFIDFTRPEGTLNHLAFCRQHGKGMVIGTTGFDEAGKQAIRDAAADIAIVFAANFSVGVNVMLKLLEKAAKVMGDYTDIEIIEAHHRHKVDAPSGTALAMGEAIAHALDKDLKDCAVYSREGHTGERVPGTIGFATVRAGDIVGEHTAMFADIGERLEITHKASSRMTFANGAVRSALWLSGKEGGLFDMRDVLDLNSL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
CP034958_9 | 9.2|3199854|26|CP034958|PILER-CR | 3199854-3199879 | 26 | CP034958.1 | 2700064-2700089 | 0 | 1.0 |
1. spacer 9.2|3199854|26|CP034958|PILER-CR matches to position: 2700064-2700089, mismatch: 0, identity: 1.0
atttacctctttcaggtaaactttat CRISPR spacer atttacctctttcaggtaaactttat Protospacer **************************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
CP034958_8 | 8.1|2906369|40|CP034958|CRISPRCasFinder | 2906369-2906408 | 40 | NZ_CP041417 | Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence | 47951-47990 | 0 | 1.0 |
CP034958_9 | 9.1|3199807|25|CP034958|PILER-CR | 3199807-3199831 | 25 | NZ_AP023208 | Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence | 44572-44596 | 0 | 1.0 |
CP034958_9 | 9.2|3199854|26|CP034958|PILER-CR | 3199854-3199879 | 26 | NC_049946 | Escherichia virus Lambda_4A7 genome assembly, chromosome: 1 | 37604-37629 | 0 | 1.0 |
CP034958_9 | 9.2|3199854|26|CP034958|PILER-CR | 3199854-3199879 | 26 | LR595861 | Escherichia virus Lambda_4C10 genome assembly, chromosome: 1 | 37887-37912 | 0 | 1.0 |
CP034958_12 | 12.1|3935463|42|CP034958|PILER-CR | 3935463-3935504 | 42 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 141085-141126 | 0 | 1.0 |
CP034958_9 | 9.2|3199854|26|CP034958|PILER-CR | 3199854-3199879 | 26 | NZ_AP023208 | Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence | 44524-44549 | 1 | 0.962 |
CP034958_12 | 12.2|3935522|40|CP034958|PILER-CR | 3935522-3935561 | 40 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 141028-141067 | 1 | 0.975 |
CP034958_7 | 7.1|2185274|38|CP034958|CRISPRCasFinder | 2185274-2185311 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 113727-113764 | 2 | 0.947 |
CP034958_9 | 9.1|3199807|25|CP034958|PILER-CR | 3199807-3199831 | 25 | NC_049946 | Escherichia virus Lambda_4A7 genome assembly, chromosome: 1 | 37652-37676 | 2 | 0.92 |
CP034958_9 | 9.1|3199807|25|CP034958|PILER-CR | 3199807-3199831 | 25 | LR595861 | Escherichia virus Lambda_4C10 genome assembly, chromosome: 1 | 37935-37959 | 2 | 0.92 |
CP034958_11 | 11.1|3722052|48|CP034958|CRISPRCasFinder | 3722052-3722099 | 48 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 4089-4136 | 3 | 0.938 |
CP034958_11 | 11.1|3722052|48|CP034958|CRISPRCasFinder | 3722052-3722099 | 48 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 4088-4135 | 3 | 0.938 |
CP034958_11 | 11.1|3722052|48|CP034958|CRISPRCasFinder | 3722052-3722099 | 48 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 4088-4135 | 3 | 0.938 |
CP034958_11 | 11.1|3722052|48|CP034958|CRISPRCasFinder | 3722052-3722099 | 48 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 4088-4135 | 3 | 0.938 |
CP034958_9 | 9.2|3199854|26|CP034958|PILER-CR | 3199854-3199879 | 26 | NZ_CP015341 | Lactobacillus brevis strain 100D8 plasmid unnamed3, complete sequence | 35661-35686 | 4 | 0.846 |
CP034958_1 | 1.1|635360|42|CP034958|CRISPRCasFinder | 635360-635401 | 42 | NZ_CP010208 | Escherichia coli strain M11 plasmid B, complete sequence | 30214-30255 | 7 | 0.833 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NZ_MG299151 | Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NZ_KY471628 | Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence | 45716-45747 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NZ_MG299131 | Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NZ_KY471629 | Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence | 45716-45747 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NZ_MG299133 | Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NZ_MG299128 | Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NZ_MG299147 | Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence | 51276-51307 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NC_018995 | Escherichia coli plasmid pHUSEC41-1, complete sequence | 29015-29046 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NZ_CP053235 | Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence | 78292-78323 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NZ_CP005999 | Escherichia coli B7A plasmid pEB1, complete sequence | 39563-39594 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | KU932021 | Escherichia coli plasmid pEC3I, complete sequence | 51902-51933 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NZ_CP024154 | Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence | 18560-18591 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NC_011754 | Escherichia coli ED1a plasmid pECOED, complete sequence | 49240-49271 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NZ_CP015141 | Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence | 81434-81465 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NZ_LR213460 | Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3 | 28916-28947 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NZ_MH287044 | Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence | 36182-36213 | 7 | 0.781 |
CP034958_4 | 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052321-1052352 | 32 | NZ_MH618673 | Escherichia coli strain 838B plasmid p838B-R, complete sequence | 32230-32261 | 7 | 0.781 |
CP034958_5 | 5.1|1074857|31|CP034958|CRISPRCasFinder | 1074857-1074887 | 31 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62682-62712 | 7 | 0.774 |
CP034958_5 | 5.1|1074857|31|CP034958|CRISPRCasFinder | 1074857-1074887 | 31 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1222106-1222136 | 7 | 0.774 |
CP034958_5 | 5.1|1074857|31|CP034958|CRISPRCasFinder | 1074857-1074887 | 31 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2467672-2467702 | 7 | 0.774 |
CP034958_5 | 5.4|1075040|31|CP034958|CRISPRCasFinder | 1075040-1075070 | 31 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17977-18007 | 7 | 0.774 |
CP034958_5 | 5.7|1075223|31|CP034958|CRISPRCasFinder | 1075223-1075253 | 31 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530641-530671 | 7 | 0.774 |
CP034958_1 | 1.1|635360|42|CP034958|CRISPRCasFinder | 635360-635401 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24899-24940 | 8 | 0.81 |
CP034958_4 | 4.5|1052260|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052260-1052291 | 32 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1417960-1417991 | 8 | 0.75 |
CP034958_5 | 5.4|1075040|31|CP034958|CRISPRCasFinder | 1075040-1075070 | 31 | NZ_CP017753 | Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence | 97498-97528 | 8 | 0.742 |
CP034958_5 | 5.7|1075223|31|CP034958|CRISPRCasFinder | 1075223-1075253 | 31 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14953-14983 | 8 | 0.742 |
CP034958_5 | 5.7|1075223|31|CP034958|CRISPRCasFinder | 1075223-1075253 | 31 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14983-15013 | 8 | 0.742 |
CP034958_5 | 5.7|1075223|31|CP034958|CRISPRCasFinder | 1075223-1075253 | 31 | NZ_CP015882 | Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence | 3454-3484 | 8 | 0.742 |
CP034958_5 | 5.7|1075223|31|CP034958|CRISPRCasFinder | 1075223-1075253 | 31 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 148992-149022 | 8 | 0.742 |
CP034958_5 | 5.10|1074857|32|CP034958|PILER-CR,CRT | 1074857-1074888 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62682-62713 | 8 | 0.75 |
CP034958_5 | 5.10|1074857|32|CP034958|PILER-CR,CRT | 1074857-1074888 | 32 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1222106-1222137 | 8 | 0.75 |
CP034958_5 | 5.10|1074857|32|CP034958|PILER-CR,CRT | 1074857-1074888 | 32 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2467671-2467702 | 8 | 0.75 |
CP034958_5 | 5.10|1074857|32|CP034958|PILER-CR,CRT | 1074857-1074888 | 32 | NC_008759 | Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence | 12670-12701 | 8 | 0.75 |
CP034958_5 | 5.13|1075040|32|CP034958|PILER-CR,CRT | 1075040-1075071 | 32 | NZ_CP034185 | Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence | 17977-18008 | 8 | 0.75 |
CP034958_5 | 5.13|1075040|32|CP034958|PILER-CR,CRT | 1075040-1075071 | 32 | NZ_CP017753 | Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence | 97497-97528 | 8 | 0.75 |
CP034958_5 | 5.16|1075223|32|CP034958|PILER-CR,CRT | 1075223-1075254 | 32 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 148991-149022 | 8 | 0.75 |
CP034958_5 | 5.16|1075223|32|CP034958|PILER-CR,CRT | 1075223-1075254 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530640-530671 | 8 | 0.75 |
CP034958_5 | 5.17|1075284|32|CP034958|PILER-CR,CRT | 1075284-1075315 | 32 | NZ_CP006991 | Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence | 532343-532374 | 8 | 0.75 |
CP034958_1 | 1.1|635360|42|CP034958|CRISPRCasFinder | 635360-635401 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24786-24827 | 9 | 0.786 |
CP034958_5 | 5.1|1074857|31|CP034958|CRISPRCasFinder | 1074857-1074887 | 31 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86182-86212 | 9 | 0.71 |
CP034958_5 | 5.2|1074918|31|CP034958|CRISPRCasFinder | 1074918-1074948 | 31 | CP011075 | Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence | 244686-244716 | 9 | 0.71 |
CP034958_5 | 5.2|1074918|31|CP034958|CRISPRCasFinder | 1074918-1074948 | 31 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78536-78566 | 9 | 0.71 |
CP034958_5 | 5.4|1075040|31|CP034958|CRISPRCasFinder | 1075040-1075070 | 31 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 405875-405905 | 9 | 0.71 |
CP034958_5 | 5.4|1075040|31|CP034958|CRISPRCasFinder | 1075040-1075070 | 31 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 2248363-2248393 | 9 | 0.71 |
CP034958_5 | 5.8|1075284|31|CP034958|CRISPRCasFinder | 1075284-1075314 | 31 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35740-35770 | 9 | 0.71 |
CP034958_5 | 5.13|1075040|32|CP034958|PILER-CR,CRT | 1075040-1075071 | 32 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 405875-405906 | 9 | 0.719 |
CP034958_5 | 5.16|1075223|32|CP034958|PILER-CR,CRT | 1075223-1075254 | 32 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14953-14984 | 9 | 0.719 |
CP034958_5 | 5.16|1075223|32|CP034958|PILER-CR,CRT | 1075223-1075254 | 32 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14983-15014 | 9 | 0.719 |
CP034958_5 | 5.16|1075223|32|CP034958|PILER-CR,CRT | 1075223-1075254 | 32 | NZ_CP015882 | Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence | 3454-3485 | 9 | 0.719 |
CP034958_5 | 5.17|1075284|32|CP034958|PILER-CR,CRT | 1075284-1075315 | 32 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35740-35771 | 9 | 0.719 |
CP034958_4 | 4.1|1052016|32|CP034958|PILER-CR,CRISPRCasFinder,CRT | 1052016-1052047 | 32 | NZ_CP030933 | Enterococcus gilvus strain CR1 plasmid pCR1A, complete sequence | 51062-51093 | 10 | 0.688 |
CP034958_5 | 5.10|1074857|32|CP034958|PILER-CR,CRT | 1074857-1074888 | 32 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86181-86212 | 10 | 0.688 |
CP034958_5 | 5.11|1074918|32|CP034958|PILER-CR,CRT | 1074918-1074949 | 32 | CP011075 | Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence | 244686-244717 | 10 | 0.688 |
CP034958_5 | 5.11|1074918|32|CP034958|PILER-CR,CRT | 1074918-1074949 | 32 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78536-78567 | 10 | 0.688 |
CP034958_5 | 5.13|1075040|32|CP034958|PILER-CR,CRT | 1075040-1075071 | 32 | NZ_AP022593 | Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence | 2248362-2248393 | 10 | 0.688 |
1. spacer 8.1|2906369|40|CP034958|CRISPRCasFinder matches to NZ_CP041417 (Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence) position: , mismatch: 0, identity: 1.0
gcgctgcgggtcattcttgaaattacccccgctgtgctgt CRISPR spacer gcgctgcgggtcattcttgaaattacccccgctgtgctgt Protospacer ****************************************
2. spacer 9.1|3199807|25|CP034958|PILER-CR matches to NZ_AP023208 (Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence) position: , mismatch: 0, identity: 1.0
gtgcctttacctgatttgggtaaac CRISPR spacer gtgcctttacctgatttgggtaaac Protospacer *************************
3. spacer 9.2|3199854|26|CP034958|PILER-CR matches to NC_049946 (Escherichia virus Lambda_4A7 genome assembly, chromosome: 1) position: , mismatch: 0, identity: 1.0
atttacctctttcaggtaaactttat CRISPR spacer atttacctctttcaggtaaactttat Protospacer **************************
4. spacer 9.2|3199854|26|CP034958|PILER-CR matches to LR595861 (Escherichia virus Lambda_4C10 genome assembly, chromosome: 1) position: , mismatch: 0, identity: 1.0
atttacctctttcaggtaaactttat CRISPR spacer atttacctctttcaggtaaactttat Protospacer **************************
5. spacer 12.1|3935463|42|CP034958|PILER-CR matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 0, identity: 1.0
tgtcacacgcagataaatccaactttcaatattgttaagttc CRISPR spacer tgtcacacgcagataaatccaactttcaatattgttaagttc Protospacer ******************************************
6. spacer 9.2|3199854|26|CP034958|PILER-CR matches to NZ_AP023208 (Escherichia coli strain TUM18781 plasmid pMTY18781-3, complete sequence) position: , mismatch: 1, identity: 0.962
atttacctctttcaggtaaactttat CRISPR spacer atttacctctttaaggtaaactttat Protospacer ************ *************
7. spacer 12.2|3935522|40|CP034958|PILER-CR matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 1, identity: 0.975
catggcgtagcaaaaagaaattttcaatattgctttatgg CRISPR spacer catggcgtagaaaaaagaaattttcaatattgctttatgg Protospacer ********** *****************************
8. spacer 7.1|2185274|38|CP034958|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 2, identity: 0.947
cggacgcaggatggtgcgttcaattggactcgaaccaa CRISPR spacer cagacgcagaatggtgcgttcaattggactcgaaccaa Protospacer *.*******.****************************
9. spacer 9.1|3199807|25|CP034958|PILER-CR matches to NC_049946 (Escherichia virus Lambda_4A7 genome assembly, chromosome: 1) position: , mismatch: 2, identity: 0.92
gtgcctttacctgatttgggtaaac CRISPR spacer acgcctttacctgatttgggtaaac Protospacer ..***********************
10. spacer 9.1|3199807|25|CP034958|PILER-CR matches to LR595861 (Escherichia virus Lambda_4C10 genome assembly, chromosome: 1) position: , mismatch: 2, identity: 0.92
gtgcctttacctgatttgggtaaac CRISPR spacer acgcctttacctgatttgggtaaac Protospacer ..***********************
11. spacer 11.1|3722052|48|CP034958|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
12. spacer 11.1|3722052|48|CP034958|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
13. spacer 11.1|3722052|48|CP034958|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
14. spacer 11.1|3722052|48|CP034958|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
15. spacer 9.2|3199854|26|CP034958|PILER-CR matches to NZ_CP015341 (Lactobacillus brevis strain 100D8 plasmid unnamed3, complete sequence) position: , mismatch: 4, identity: 0.846
atttacctctttcaggtaaactttat CRISPR spacer aggtatctcattcaggtaaactttat Protospacer * **.*** ****************
16. spacer 1.1|635360|42|CP034958|CRISPRCasFinder matches to NZ_CP010208 (Escherichia coli strain M11 plasmid B, complete sequence) position: , mismatch: 7, identity: 0.833
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer acaaatgccggatgcggcgtaaacgccttatctggcctacgc Protospacer ***. *.****************.*********.******.
17. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299151 (Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
18. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KY471628 (Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
19. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299131 (Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
20. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KY471629 (Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
21. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299133 (Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
22. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299128 (Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
23. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MG299147 (Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
24. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NC_018995 (Escherichia coli plasmid pHUSEC41-1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
25. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP053235 (Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
26. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP005999 (Escherichia coli B7A plasmid pEB1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
27. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to KU932021 (Escherichia coli plasmid pEC3I, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
28. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP024154 (Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
29. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NC_011754 (Escherichia coli ED1a plasmid pECOED, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
30. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP015141 (Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
31. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LR213460 (Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
32. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MH287044 (Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
33. spacer 4.6|1052321|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_MH618673 (Escherichia coli strain 838B plasmid p838B-R, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
34. spacer 5.1|1074857|31|CP034958|CRISPRCasFinder matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer tccctatcgcaatgccggcagcatccgcaat Protospacer *. *. ****** **** ************
35. spacer 5.1|1074857|31|CP034958|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatc Protospacer **** ************ ***** * ** .
36. spacer 5.1|1074857|31|CP034958|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatc Protospacer **** ************ ***** * ** .
37. spacer 5.4|1075040|31|CP034958|CRISPRCasFinder matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.774
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer agcgtcaccgacgcgcagggccgctaccaac Protospacer **************** * *******.
38. spacer 5.7|1075223|31|CP034958|CRISPRCasFinder matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.774
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer ccgaacaggtggcgaagcaggtgatgggcca Protospacer ******.* **************.. ***
39. spacer 1.1|635360|42|CP034958|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 8, identity: 0.81
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer attgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer *. * ******************.*******.*******.
40. spacer 4.5|1052260|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
tcaacgcgctcagacgttgcgtgagtgaacca CRISPR spacer acaacgcggtcggacgttgcgtgattaccccg Protospacer ******* **.************ *. **.
41. spacer 5.4|1075040|31|CP034958|CRISPRCasFinder matches to NZ_CP017753 (Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.742
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacgtcaccgacgcgcagtcgcgcttcttca Protospacer ***************** ***** *. ..
42. spacer 5.7|1075223|31|CP034958|CRISPRCasFinder matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer agcggcagctggcgatgcaggtggcttgcgt Protospacer ..*.******** ********** ****
43. spacer 5.7|1075223|31|CP034958|CRISPRCasFinder matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer agcggcagctggcgatgcaggtggcttgcgt Protospacer ..*.******** ********** ****
44. spacer 5.7|1075223|31|CP034958|CRISPRCasFinder matches to NZ_CP015882 (Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer ttgcgcagctggcgcagcaggtggctgccga Protospacer ..* .*.******* ************ **
45. spacer 5.7|1075223|31|CP034958|CRISPRCasFinder matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer gggtacggctggcgaaggaggcggctgcgga Protospacer * ************* ***.***** *
46. spacer 5.10|1074857|32|CP034958|PILER-CR,CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer tccctatcgcaatgccggcagcatccgcaatc Protospacer *. *. ****** **** ************.
47. spacer 5.10|1074857|32|CP034958|PILER-CR,CRT matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
48. spacer 5.10|1074857|32|CP034958|PILER-CR,CRT matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
49. spacer 5.10|1074857|32|CP034958|PILER-CR,CRT matches to NC_008759 (Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcg-----caattccgggagcatccgcaatt CRISPR spacer -----cgtgaaactcatttccgggagcatccgcattt Protospacer **.* ** ***************** **
50. spacer 5.13|1075040|32|CP034958|PILER-CR,CRT matches to NZ_CP034185 (Deinococcus sp. S14-83 strain S14-83T plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer agcgtcaccgacgcgcagggccgctaccaact Protospacer **************** * *******.
51. spacer 5.13|1075040|32|CP034958|PILER-CR,CRT matches to NZ_CP017753 (Cupriavidus sp. USMAHM13 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacgtcaccgacgcgcagtcgcgcttcttcaa Protospacer ***************** ***** *. ..*
52. spacer 5.16|1075223|32|CP034958|PILER-CR,CRT matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer gggtacggctggcgaaggaggcggctgcggaa Protospacer * ************* ***.***** * *
53. spacer 5.16|1075223|32|CP034958|PILER-CR,CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ccgaacaggtggcgaagcaggtgatgggccag Protospacer ******.* **************.. *** .
54. spacer 5.17|1075284|32|CP034958|PILER-CR,CRT matches to NZ_CP006991 (Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence) position: , mismatch: 8, identity: 0.75
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer catcatcctcccgcagatgcgctggccgatcc Protospacer *.*.* .******** ******** *****
55. spacer 1.1|635360|42|CP034958|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 9, identity: 0.786
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer gttgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer .. * ******************.*******.*******.
56. spacer 5.1|1074857|31|CP034958|CRISPRCasFinder matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 9, identity: 0.71
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer gctaccgcgcaattcgaggagcatccgctgg Protospacer . *********** .*********** .
57. spacer 5.2|1074918|31|CP034958|CRISPRCasFinder matches to CP011075 (Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.71
acggacaaaatatatattgatttgcgaatta CRISPR spacer tgaggcaaaatatagattgatttccgaaaat Protospacer .*.********* ******** ****
58. spacer 5.2|1074918|31|CP034958|CRISPRCasFinder matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 9, identity: 0.71
acggacaaaatatatattgatttgcgaatta CRISPR spacer acggaaaaattatatattgattttacttctg Protospacer ***** *** ************* .*.
59. spacer 5.4|1075040|31|CP034958|CRISPRCasFinder matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.71
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacgtcactgacgcgcagtcgcgcttcttca Protospacer ******.********** ***** *. ..
60. spacer 5.4|1075040|31|CP034958|CRISPRCasFinder matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 9, identity: 0.71
cccgtcaccgacgcgcagtggcgctaccgtg CRISPR spacer gacatcaccgacgcccagtggcgcgacgtcc Protospacer *.********** ********* ** .
61. spacer 5.8|1075284|31|CP034958|CRISPRCasFinder matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.71
gtttaccgccccgcagaggcgctggcagatc CRISPR spacer cgagaccgcctcgccgaggcgctggcagcga Protospacer ******.*** *************
62. spacer 5.13|1075040|32|CP034958|PILER-CR,CRT matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.719
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacgtcactgacgcgcagtcgcgcttcttcaa Protospacer ******.********** ***** *. ..*
63. spacer 5.16|1075223|32|CP034958|PILER-CR,CRT matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
64. spacer 5.16|1075223|32|CP034958|PILER-CR,CRT matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
65. spacer 5.16|1075223|32|CP034958|PILER-CR,CRT matches to NZ_CP015882 (Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ttgcgcagctggcgcagcaggtggctgccgag Protospacer ..* .*.******* ************ ** .
66. spacer 5.17|1075284|32|CP034958|PILER-CR,CRT matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.719
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer cgagaccgcctcgccgaggcgctggcagcgac Protospacer ******.*** ************* *
67. spacer 4.1|1052016|32|CP034958|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP030933 (Enterococcus gilvus strain CR1 plasmid pCR1A, complete sequence) position: , mismatch: 10, identity: 0.688
tccacgctgtaacggccatcattaagtttagt CRISPR spacer ccgctgctgtgacgcccatcattaagttactc Protospacer .* .*****.*** ************* .
68. spacer 5.10|1074857|32|CP034958|PILER-CR,CRT matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 10, identity: 0.688
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer gctaccgcgcaattcgaggagcatccgctggg Protospacer . *********** .*********** .
69. spacer 5.11|1074918|32|CP034958|PILER-CR,CRT matches to CP011075 (Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer tgaggcaaaatatagattgatttccgaaaata Protospacer .*.********* ******** ****
70. spacer 5.11|1074918|32|CP034958|PILER-CR,CRT matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer acggaaaaattatatattgattttacttctgg Protospacer ***** *** ************* .*.
71. spacer 5.13|1075040|32|CP034958|PILER-CR,CRT matches to NZ_AP022593 (Mycolicibacterium arabiense strain JCM 18538 plasmid pJCM18538, complete sequence) position: , mismatch: 10, identity: 0.688
cccgtcaccgacgcgcagtggcgctaccgtga CRISPR spacer gacatcaccgacgcccagtggcgcgacgtccc Protospacer *.********** ********* ** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1082830 : 1096013
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP034958|1082830:1096013|DBSCAN-SWA TATGCGCATATTGCTGAGTAATGATGACGGGGTACATGCACCCGGTATACAAACGCTGGCGAAAGCCTTGCGTGAGTTTGCTGACGTTCAGGTGGTCGCCCCCGATCGTAACCGCAGCGGCGCTTCAAATTCTCTGACACTGGAATCCTCCCTGCGCACGTTTACCTTTGAAAATGGTGATATTGCTGTGCAAATGGGAACCCCGACCGATTGCGTCTATCTTGGCGTGAATGCTCTGATGCGTCCGCGCCCGGACATTGTTGTGTCCGGAATTAACGCCGGGCCGAATCTGGGGGATGATGTTATTTATTCCGGTACGGTAGCCGCCGCGATGGAAGGCCGTCATTTAGGTTTTCCGGCGCTTGCCGTCTCGCTTGACGGGCATAAACATTACGACACTGCCGCGGCGGTAATCTGTTCAATTTTGCGCGCACTGTGTAAAGAGCCGCTGCGCACCGGGCGTATTCTTAATATTAACGTTCCGGATTTACCCTTGGATCAAATCAAAGGTATTCGCGTGACGCGCTGCGGTACACGACATCCGGCAGATCAGGTGATCCCGCAGCAAGATCCGCGCGGCAATACGCTGTACTGGATTGGCCCGCCGGGCGGTAAATGTGATGCTGGTCCGGGGACCGATTTTGCTGCGGTAGATGAGGGCTATGTCTCCATCACGCCGCTGCATGTGGATTTAACTGCGCATAGCGCGCAAGATGTGGTTTCAGACTGGTTAAACAGCGTGGGAGTTGGCACGCAATGGTAAGCAGACGCGTACAAGCACTTCTGGATCAATTACGTGCGCAAGGTATTCAGGATGAGCAGGTGCTGAATGCACTTGCCGCCGTGCCGCGTGAAAAATTCGTTGATGAAGCGTTTGAACAAAAAGCCTGGGACAATATCGCTTTGCCGATAGGTCAGGGGCAGACAATTTCGCAGCCATATATGGTGGCGCGAATGACCGAATTACTCGAGCTGACGCCGCAGTCGCGGGTGCTGGAAATTGGCACCGGTTCGGGATATCAAACGGCAATCCTGGCGCATCTTGTCCAGCATGTTTGCTCGGTTGAACGGATTAAAGGCTTGCAGTGGCAGGCACGTCGCCGCCTGAAAAATCTTGATTTACATAATGTTTCAACCCGTCATGGCGATGGATGGCAAGGTTGGCAGGCACGTGCGCCGTTTGACGCTATCATTGTTACGGCGGCACCGCCGGAAATTCCAACTGCGCTAATGACGCAGCTGGACGAAGGCGGGATTCTCGTCTTACCCGTAGGGGAGGAGCACCAGTATTTGAAACGGGTGCGTCGTCGGGGAGGCGAATTTATTATCGATACCGTGGAGGCCGTGCGCTTTGTCCCTTTAGTGAAGGGTGAGCTGGCTTAAAACGTGAGGAAATACCTGGATTTTTCCTGGTTATTTTGCCGCAGGTCAGCGTATCGTGAACATCTTTTCCAGTGTTCAGTAGGGTGCCTTGCACGGTAATTATGTCACTGGTTATTAACCAATTTTTCCTGGGGGATAAATGAGCGCGGGAAGCCCAAAATTCACCGTTCGCCGCATTGCGGCTTTGTCACTGGTTTCGCTATGGCTGGCAGGCTGTTCTGACACTTCAAATCCACCGGCACCGGTCAGCTCCGTTAATGGCAATGCGCCTGCAAATACTAATTCTGGTATGTTGATTACGCCGCCGCCGAAAATGGGGACGACGTCTACAGCGCAGCAACCGCAAATTCAGCCGGTGCAGCAGCCACAAATTCAGGCTACTCAACAACCGCAAATCCAGCCAGTGCAGCCAGTAGCTCAGCAGCCGGTACAGATGGAAAACGGACGCATCGTCTATAACCGTCAGTATGGGAACATTCCGAAAGGCAGTTATAGCGGCAGTACCTATACCGTGAAAAAAGGCGACACACTTTTCTATATCGCCTGGATTACTGGCAACGATTTCCGTGACCTTGCTCAGCGCAACAATATTCAGGCACCATACGCGCTGAACGTTGGTCAGACCTTGCAGGTGGGTAATGCTTCCGGTACGCCAATCACTGGCGGAAATGCCATTACCCAGGCCGACGCAGCAGAGCAAGGAGTTGTGATCAAGCCTGCACAAAATTCCACCGTTGCTGTTGCGTCGCAACCGACAATTACGTATTCTGAGTCTTCGGGTGAACAGAGTGCTAACAAAATGTTGCCGAACAACAAGCCAACTGCGACCACGGTCACAGCGCCTGTAACGGTACCAACAGCAAGCACAACCGAGCCGACTGTCAGCAGTACATCAACCAGTACGCCTATCTCCACCTGGCGCTGGCCGACTGAGGGCAAAGTGATCGAAACCTTTGGCGCTTCTGAGGGGGGCAACAAGGGGATTGATATCGCAGGCAGCAAAGGACAGGCAATTATCGCGACCGCAGATGGCCGCGTTGTTTATGCTGGTAACGCGCTGCGCGGCTACGGTAATCTGATTATCATCAAACATAATGATGATTACCTGAGTGCCTACGCCCATAACGACACAATGCTGGTCCGGGAACAACAAGAAGTTAAGGCGGGGCAAAAAATAGCGACCATGGGTAGCACCGGAACCAGTTCAACACGCTTGCATTTTGAAATTCGTTACAAGGGGAAATCCGTAAACCCGCTGCGTTATTTGCCGCAGCGATAAATCGACGGAACCAGGCTTTTGCTTGAATGTTCCGTCAAGGGATCACGGGTAGGAGCCACCTTATGAGTCAGAATACGCTGAAAGTTCATGATTTAAATGAAGATGCGGAATTTGATGAGAACGGAGTTGAGGTTTTTGACGAAAAGGCCTTAGTAGAAGAGGAACCCAGTGATAACGATTTGGCCGAAGAGGAACTGTTATCGCAGGGAGCCACACAGCGTGTGTTGGACGCGACTCAGCTTTACCTTGGTGAGATTGGTTATTCACCACTGTTAACGGCCGAAGAAGAAGTTTATTTTGCGCGTCGCGCACTGCGTGGAGATGTCGCCTCTCGCCGCCGGATGATCGAGAGTAACTTGCGTCTGGTGGTAAAAATTGCCCGCCGTTATGGCAATCGTGGTCTGGCGTTGCTGGACCTTATCGAAGAGGGCAACCTGGGGCTGATCCGCGCGGTAGAGAAGTTTGACCCGGAACGTGGTTTCCGCTTCTCAACATACGCAACCTGGTGGATTCGCCAGACGATTGAACGGGCGATTATGAACCAAACCCGTACTATTCGTTTGCCGATTCACATCGTAAAGGAGCTGAACGTTTACCTGCGAACCGCACGTGAGTTGTCCCATAAGCTGGACCATGAACCAAGTGCGGAAGAGATCGCAGAGCAACTGGATAAGCCAGTTGATGACGTCAGCCGTATGCTTCGTCTTAACGAGCGCATTACCTCGGTAGACACCCCGCTGGGTGGTGATTCCGAAAAAGCGTTGCTGGACATCCTGGCCGATGAAAAAGAGAACGGTCCGGAAGATACCACGCAAGATGACGATATGAAGCAGAGCATCGTCAAATGGCTGTTCGAGCTGAACGCCAAACAGCGTGAAGTGCTGGCACGTCGATTCGGTTTGCTGGGGTACGAAGCGGCAACACTGGAAGATGTAGGTCGTGAAATTGGCCTCACCCGTGAACGTGTTCGCCAGATTCAGGTTGAAGGCCTGCGCCGTTTGCGTGAAATCCTGCAAACGCAGGGGCTGAATATCGAAGCGCTGTTCCGCGAGTAAGTAAGCATCTGTCAGAAAGGCCAGTCTCAAGCGAGGCTGGCCTTTTCTGTGCACAATAAAAGGTCCGATGCCCATCGGACCTTTTTATTAAGGTCAAATTACCGCCCATACGCACCAGGTAATTAAGAATCCGGTAAAACCGAGAATGGTCGTTAACACTGTCCAGGTTTTCAGACCGTCTGCTACCGACAACCCCAGATATTTGGTCACAATCCAGAACCCTGAGTCATTAATATGTGACGCGCCAAGCCCACCAAAGCAGGCTGCCAGCGTCACCAATACGCACTGAATCGGATTCAATCCCATCACCGCTTCTGAGAGTAACCCGCCGGTTGTCAGGATTGCTACGGTTGCTGACCCCTGCGATGCACGCAGAGCCAGTGAAATAATAAATGCGGCTGGTAACAGAGGCAGGTCAATCATTTGTAGCATATTGGCAAGGGCTTTGCCGACGCCCGATTCCACCAGCACTTTGCCAAATACCCCTCCAGCACCAGTAACCAAAATCACTACCGCCGCAGTGGGAAGGGCTGAGCCCATAATGTCGCTGGTGTGTTGTAAGCTCCAGCCGCGACGTAAAGCCAATAACCAGAATGCCAGCACCAGCGCAATCATTAGAGCTACCATTGGTGAGCCGATCAGCTGTAGCGTACCAAGCAGGGGATGCGAAGGCGGCATCAGTGTTGCGGAAACCGTACCCGCCATGATAATCGCGATAGGAATAACAATTAGCGAGGTGACCAGCGCGACGCCCGGTGGATTTATTTTATCGCTTAATTTTGTCGCGCCTTCCTCACTGGCCGGAGCCAGTTGCATCTGTTCCAGTACTTCCACCGACATCGCATATTGGTGCTTATTGATTATTTTCGCTGCAAAGTAGCCAACAATCCCTACGGGAATAGAAATCGCAATACCGATGATGGTTAGCCAGCCGATGTCTGCGTGGAGTAACCCCGCTGCGGCGACAGGGCCTGGATGCGGCGGTACCGCCACATGTACAGTTAGCATGATCCCAGCGACAGGCAGGCCAAATTTGAGTGGCGATATTTTGGCAACCTTGGCAAAACCGTAAATGATTGGCGCAAGAATAATAAAGCCGACATCAAAGAAGACGGGAATACCGAGGAAGAACGCTGCCAGAGTCAGCGCAGCGATAGTTCGTTTGTCACCTAACTTGCGACTGAAATAATTAGCCAGTGACTCTGCACCACCAGAGTGTTCGATCATACGCCCCAGCATAGCGCCCAGACCAATAATAATAGTGACGGAACCAAGCACACCGCCCATCCCGGCGATCATCACTTTACCCACTTCGCCCGCCGGTATACCCGCCGCAAGTGCGACTAACAGGCTGACGAGGAGCAGAGCAACGAATGGTTGTACCTTTGCCTTGATGACCAGCAGCAACAGCATGATTACGCCAGTTAACGCAATGCATAACAATGTAATTGTGGACATGGGAAACCCTGTCTGAAAGTTATAGTTAACCTACCCCATCCGTAGATGGGGGGATGTATGGGTACGTTGTAATTAGGGATTTAACGAATTAGCGCCAGGCGTCAAACCAGCCAAGCCCTTCTTCGGTGAGGCCACGAGGTTTATATTCACAACCGATCCAGCCCTGATATCCCACCTCATCGAACAGGCGGAACAGCCACGGATAGTTGATTTCTCCATCGTCCGGTTCATGTCGATCAGGTAGTCCGGCAATTTGTACGTGCGCATATTTCCCGGCGTAGTCGCGGATTAAATGCGTCAGGTTGCCATCTACTTTTTGCGCATGAAAAGTATCTAGTTGAATAAACACGTTATCTCGCGCAACCTCTTCAACAATAGCCAGTGCCTGATACTGGCTGGAGAAGAGATAATGAGGCTTAACGTCGGGGCTGAGTGCTTCAACTAATATTCGCTTGCCGTGTAGCGCAAAGCGGTCGGCAGCGTAGCGGAGATTATCGATAAATACTGCCCGGTACCGTTCAGCGTCTTCGCCAGCGGGCACGACGCCTGCCATCACATGGACTTGTTCACAATTGAGCGCCAATGCATATTCCAGTGCCAGGTCGATGTCTGCGTGTGCTTCGTGCTCACGTCCGGGAAGGGCGGATAATCCCCATTCCCCCGCATTAATATCTCCGGGAGCGGTATTGAACAGCGCCAGTGTCAGATGGTTTTGCTCCAGTTGCTTTTGGATTTGCAGGGTGGAGTAGTCATAGGGAAACAGAAATTCCACAGCATCGAACCCGGCTTTTCGCGCTGCGGCGAAGCGTTCAATAAAAGGCACTTCGGTGAACATCATGGATAAATTAGCTGCAAAACGAGGCATTGCATTAACTCCTTAATTCCGCAATTTCACCTGCGGTCAGATAACGGATCGGGCGGTCACCGAGAATAAAAATCAGCTTTGCCGTTTCCTCCAGCTCTTCCATATTGTTGGCGGCTTCTTGCAGGCTTTCACCGCAAACCACTGGGCCATGATTTGCCAGTAAAAAAGCCTGATTGTCTGCTGCCAGTTCCGCCAGATCCTGTGCGATGCGTTTATCGCCCGGTCGGTAATAAGGCACCAGCGGGACATTTCCCATCCGCATCACCACGTATGGTGTGAACGGACGAATAACGTTGCTGCTGTCCAGCCCTTGCAGGCAGGAAAGCGCCGTCGACCATGTGCTGTGCAAATGCACCACCGCTTTACAGCGCGGATTGTTGCGATACAGCGCCAGATGAAAGAGCACCTCTTTCGAGGGTTTGTCACCACTTAACCATTCGCCATCCGCGGCGACTTTGGAAAGCCGCTGCGGATCGAGATTGCCCAGGCATGAACCTGTCGGTGTCGCCAGTAAATTCCCGTCAGGTAAAAGCAGCGACAGATTGCCAGCCGAACCGGTTGCATAGCCGCGCTGAAAGAATGAACTGGCAATCCGCGTCATCTCCTCTCGCAAAGACTGCTCTACTTTTGCGAAATCGCTCATGATAAAAACTCTCTTTGGGCTCGTGAAAAAAAGGCTTCATCACCGAAGTTGCCAGATTTAAGGGCGAGTGAGACAGGCTTATCCAGTGCGTTTACCCACGGCACGCCGGGGGAAATGGTTGGGCCAATATGAAACCCTTTTATGCCCAGGCTCTGTGTGACTACGCCGGAGGTCTCACCGCCTGCGACAATAAAGCGTGTCACGCCTTCCGCTGCTAACCGCGCCGCTAGTTGAGAAAACAGAGTTTCTACTGCCTGACTGGCTTTTTGTGCACCGTATTGCTGTTGAATTGCTGCCAATGCGTCAGTGCTGGCGGTGGCAAAAACCAGTGGAGCAAGTACACTTTCCTGGCCCAGAACCCACTCTGCCAGTTCGTGTGCATAAGCGGCCAGAGTTTCGGTTGAGAGGCAGCGTGCCACATCAACTTCACGGGCTGGTGCAATTTGACGGTAATGTGCCACCTGGCGGTTGGTCATTTGAGAGCATGAACCGGAGAGCACTACGCCGCGCCCAGCGAGCGGATGCCCTGCTTCGCGAGCCTGGTTACCGTTTTCTTGCGCCCACTGCCGGGCCAGGCCAATCGCCAGACCAGAACCGCCCGTTACCAGTGGGGCATCGCGCAAGGCTTCTCCCTGAATTTCCAGATGGTGTTCGGTCAGCGCATCAAGCACCGCGTAGCGGTAGCCCTCTTGCTGTAAGCGAGCCAGCTCTTGACGAACGGCATCCACACCTTGTTCGAAAACATGTGCCGAAACGACGCCGCAGCGCCCTGTGGATTGCGCTTCAACCAGACGGGGAAGATAGCTGTCGGTCATGGGATTTACCGGGTGATGGCGCATCCCGGATTCGGCCAGCAGTTGATTCATTACGAACAAATACCCCTGATAAACCGTACGTCCGTTGACCGGCAGGGCCGGAGAGAAGACCGTAAACGGCGTGTCGAGAGCATCCATTAATGCATCGGTAACCGGGCCGATATTACCTTTCGCCGTACTGTCGAAAGTAGAGCAGTATTTGAAATAGATCTGTTTGCAACCTTGCTGTTGCAACCAGCTCAGAGCCGCCAGCGATTGCTGTGTGGCTTCAACCACCGGACAGGAGCGCGTTTTCAGGCTGATCACCAGTGCGTCGATTGCTTCCGGCATTTTACCTGTTGGAACACCGTTAATTTGTACCGTTGGTAGACCGTTTTCCACCAGAAAACTGGCGATATCCGTCGCGCCGGTAAAATCATCGGCGATAACGCCAATCTTGATCATGATTTCGCTCCCGGTAGAGTGATGCCAGAGAAAATCTTGATAACTGCGCTATCGTCTTCTTTCCCGTAACCTGCGTTACTGGCGCTGGTGAACATATTCAATGCTGTTGAGGCCAATGGCAGCGGGAAGTGCAGGGCTTTGGCTGTATCGGCAACCAGACCAAGATCCTTAACAAAAATATCGACGGCTGAATGCGGGGTGTAATCGCCATCCACCACATGACGCATCCGGTTTTCGAACATCCAGGAATTTCCGGCGGCATTGGTCACGACGTCATACATCACATCCAGCGGGATCCCCGCACGGGCTGCAAGTGCCATCGCTTCGGCTCCGGCAGCAATATGTACGCCCGCTAACAACTGGTGAATAATTTTTACGGTCGAACCTAGTCCCGGTTCTGCACCTATGCGATAAACTTTTCCGGCAACGGCTTCCAGCACGGGTGCCAGTCGTTCAAAGGCAATATCGCTACCGGAGGCCATGACAGTCATTTCACCGTTAGCGGCTTTTACTGCACCACCAGAAACTGGCGCATCCAGCATTTCCAGATCGAATCCAGCCAGAGCGGTAGCAATTTCTTGCGCATCAGCACTAGCGATAGTGGAAGAAACCATTACTGCCGTACCGGGTTTCAGATGTTGTGCAACGCCTGTTTCACCAAACAGCACCTGTTTAACCTGGGCCGCATTGACCACCAGCACCAGCAGAGCGTCCAGTTTTTCGGCAAACGTCGCGGCGTTATCAGAAACCCCGCAAGCACCTGCCTCTTTCAACGTAGCGCAGGCATTGCTGTTCAGGTCTGCGCCCCAGGTAGAAAGACCTGCGCGGACACATGACAGTGCTGCTCCCATTCCCATTGACCCTAAGCCAACGATACCGACATGAAACTCAGATCCCGTTTTCATCTGCTCTCCTTGTTAATTTAAGTGATATTTTGTTTGATATTGTGAATATAAGCGCTGGAAGATAACGATATGGTGAGCTGATTCACATAAATTAACATTGTGTGTTATTTTATGTGAACTAAGCGTTAGTTGCGCCGCGCACGTTTCGCAGGCAAATAGCGTAGAATGTCAGCAGGACAAAGGAAGGAGCAAAAGTTGATACCCGTAGAGCGTCGCCAAATCATCCTTGAGATGGTAGCTGAAAAAGGCATTGTCAGTATTGCTGAACTAACGGACAGAATGAATGTGTCACATATGACCATTCGTCGGGATTTACAAAAACTGGAGCAGCAGGGAGCCGTTGTGCTGGTGTCCGGAGGCGTCCAGTCTCCGGGACGCGTGGCGCATGAACCTTCTCATCAGGTAAAAACTGCGCTGGCAATGACGCAAAAAGCGGCTATTGGCAAGCTGGCGGCAAGTCTTGTTCAGCCGGGAAGCTGTATCTATCTGGATGCGGGAACGACCACGTTAGCGATAGCACAGCATCTGATTCACATGGAGTCACTGACTGTGGTCACAAACGATTTCGTTATTGCGAACTACTTGCTCGACAACAGTAATTGCACAATTATTCACACTGGCGGTGCAGTGTGTCGGGAAAACCGTTCCTGTGTCGGGGAAGCCGCTGCGACCATGCTGCGCAGCCTGATGATTGATCAGGCTTTTATTTCTGCATCGTCATGGAGTGTGCGGGGGATTTCTACGCCAGCGGAAGATAAAGTCACGGTGAAACGGGCGATTGCCAGTGCCAGCCGCCAGCGAGTTTTGGTCTGTGATGCGACGAAATATGGTCAGGTGGCGACATGGCTGGCGTTACCATTAAGCGAGTTTGATCAGATTATCACAGACGACGGTTTGCCGGAGAGTGCCAGTCGCGCGCTGGCGAAGCAGGATCTCTCTTTGCTGGTAGCGAAAAATGAATAATGGCCTGCAATAACATTTGGTTACTCATGCTTCACAGAAGAAGCATGAGACTACTTTATTTTATAAAATGACAGCCGCCCGCTTTTCGGCGTGCCGGTATCAATATAAATCTGGTTAGCGAACGTCTGAATGTTATCAAACATCATATGTCCAAATATAAAATAATCAGCGCCGTTTATTTGTTGTAACTCGCCATTAAGCGATTTCTGCACACGATCAACAGGCCAGAGTAATTCGCTCTCCGCTATTTCTTTACCAAAGAGATATTCACTCCCCGGATAATCTGCATGTGCGATGACATATTTTATGTTGTCGTTAGTGATTTCAATAATATGTGGAAGGTGATGGAATTTCAGCAACAGATCTATTGCCTCTTGTTGCTCTGAATCATTTAAATCGAAAAACCAGTCACCACCGCTGGCAAGCCACATATTGCCATCGCCAGTTTCGAATGCATCCAACGCCATCGCTTCGTGGTTGCCTTTAACCGACGTAAACCAGGGTTGGTTTAGCAGGCGCAGCACGTTAAGACTCTTCGGCCCACGATCAATATTATCGCCTGTGGAAATAAGTAAGTCGGTTTCAGGGTAAAAAGAGAGTTGATGTAAACGGGATTGTAATAACTGATATTCACCATGAATATCACCAACGACCCATATATGGCGATAGTGATGGGCATTGATTTTTTGATAGCGTGTAGATGGCATGGTTTTACCCTGTAAAATAAGCTTTCCTATTATACAGGGTATTTTTATTTGATTCGTCAGTTGTCGTTAATATTCCCGATAGCAAAAGACTATCGGGAATTGTTATTACACCAGACTCTTCAAGCGATAAATCCACTCCAGCGCCTGACGCGGAGTCAGTGAATCCGGGTCGAGATTTTCCAGTGCTTCGACCGCAGGCGAAGTTTCTTCTGGCACTGACAGCAAAGACATCTGCGTACCATCCACTTGCGTCGCGGCGGCGTTCGGCGAAATGCTTTCCAGCTCACGCAGTTTTTGCCGTGCGCGCTTAATCACCTCTTTTGGCACGCCCGCCAGAGCTGCAACCGCCAGGCCGTAGCTTTTGCTCGCCGCGCCATCCTGCACGCTATGCATAAAGGCAATGGTGTCGCCGTGCTCCAGCGCATCGAGATGCACGTTGGCGACGCCTTCCATTTTCTCCGGTAACTGGGTCAGCTCGAAATAGTGGGTAGCAAATAACGTCAATGCCTTAATCTTATTCGCCAGATTTTCCGCGCACGCCCACGCCAGCGACAGACCATCGTAGGTGGACGTTCCACGCCCGATCTCATCCATTAACACCAGACTGTATTCGGTGGCGTTATGTAAAATATTGGCGGTTTCAGTCATCTCCACCATAAAGGTTGAGCGCCCGGAAGCCAGGTCATCTGCCGCGCCTACGCGGGTAAAGATGCGATCGATAGGTCCAATCTCGACTTTTTGTGCCGGTACATAGCTGCCGATGTAGGCCATCAGCGCAATCAGTGCGGTCTGGCGCATATAGGTACTTTTACCGCCCATGTTCGGGCCGGTAATAATCAACATACGGCGCTGCGGCGACAGATTCAGCGGGTTAGCGATAAATGGCTCATTCAGTACTTGTTCAACTACCGGATGGCGACCTTCGGTAATGCGAATGCCCGGTTTATCAATGAAGGTCGGGCAGGTGTAGTTCAGGGTATAGGCCCGTTCCGCCAGGTTAACCAGCACGTCGAGTTCCGCCAGCGCGCTCGCGCTCTGTTGCAACGCTTCCAGATGCGGCAACAGCAGGTCGAACAGCTCTTCATAAAGCTGTTTTTCCAGTGCCAGTGCTTTGCCTTTTGAGGTGAGAACTTTATCTTCGTACTCTTTTAGCTCTGGAATGATGTAGCGCTCGGCGTTTTTCAGCGTCTGGCGACGCATGTAGTTGATGGGTGCCAGATGGCTTTGCCCACGGCTGATTTGAATGTAGTAGCCGTGCACCGCATTAAAGCCAACTTTCAGCGTGTCCAGGCCGGTACGTTCACGCTCGCGGACTTCCAGACGCTCCAGATAATCGGTCGCGCCGTCAGCCAGCGCGCGCCACTCATCCAGCTCTTCGTTATAGCCCGATGCGATAACACCACCGTCGCGTACCAGCACCGGCGGTGTGTCGATGATTGCTCGCTCCAGCAGATCGCGCAGCTCGGCAAACTCGCCCATCTTCTCACGTAGCGCCTGTACCGGTGCACTATCGACAGTTTCTAACTGCGCACGCAGCTCCGGCAGTTGCTGGAAAGCGTGGCGCATACGGGCCAGATCGCGTGGGCGAGCAGTTCGTAAAGCCAGACGTGCCAGAATACGTTCCAGGTCGCCGACCTGACGCAGTACCGGCTGCAACTCGGCGGTGAAATCCTGCAATGCGCCAATAGTTTGCTGGCGCTCAAGCAACACGCGGGTATCGCGCACTGGCATATGCAGCCAGCGTTTCAGCATACGACTACCCATCGGCGTGACGGTGCAGTCGAGCACAGAAGCCAGCGTATTTTCCGCACCGCCCGCCAGGTTCTGAGTAATTTCCAGGTTACGACGTGTCGCGGCATCCATAATGATGCTGTCCTGCTCACGTTCCATGGTGATAGAACGAATATGCGGCAGGGTCGTGCGTTGGGTATCTTTCGCATACTGCAACAGACAACCGGCAGCACAAAGTCCGCGCGGCGCGTTCTCGACGCCAAAACCGACCAGATCGCGGGTGCCAAATTGCAGATTCAACTGCTGGCGCGCGGTGTCGATTTCAAACTCCCACAGCGGGCGACGGCGCAGGCCGCGACGGCCTTCAATCAGCGACATCTCGGCGAAATCTTCTGCATACAGCAGTTCCGCCGGATTAGTGCGTTGCAGCTCTGCCGCCATCGTTTCGCGGTCGGCCGGTTCGCTCAGACGAAAACGCCCGGAGCTGATATCCAGCGTCGCGTAGCCGAAACCTTTGCTGTCCTGCCAGATAGCCGCCAGCAGGTTGTCCTGACGCTCCTGTAACAGGGCTTCATCGCTGATGGTGCCTGGCGTAACGATACGCACAACTTTGCGCTCAACCGGACCTTTGCTGGTCGCCGGATCGCCAATTTGTTCGCAGATGGCAACGGACTCTCCCTGATTCACCAGTTTGGCGAGATAGTTTTCCACCGCATGGTAGGGAATCCCCGCCATCGGGATCGGCTCTCCCGCCGAAGCACCGCGTTTGGTCAGTGAAATATCCAGCAGTTGCGACGCGCGTTTTGCGTCGTCATAAAACAGTTCATAAAAATCACCCATCCGGTAAAACAGCAGGATCTCGGGATGCTGGGCTTTCAGCCTGAGATACTGCTGCATCATGGGCGTATGGGCGTCGAAATTTTCTATTGCACTCAT
Protein sequences of DBSCAN-SWA_1 >CP034958|1082830:1096013|1084351_1085491_+|QAS84423.1|DBSCAN-SWA MSAGSPKFTVRRIAALSLVSLWLAGCSDTSNPPAPVSSVNGNAPANTNSGMLITPPPKMGTTSTAQQPQIQPVQQPQIQATQQPQIQPVQPVAQQPVQMENGRIVYNRQYGNIPKGSYSGSTYTVKKGDTLFYIAWITGNDFRDLAQRNNIQAPYALNVGQTLQVGNASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPTATTVTAPVTVPTASTTEPTVSSTSTSTPISTWRWPTEGKVIETFGASEGGNKGIDIAGSKGQAIIATADGRVVYAGNALRGYGNLIIIKHNDDYLSAYAHNDTMLVREQQEVKAGQKIATMGSTGTSSTRLHFEIRYKGKSVNPLRYLPQR >CP034958|1082830:1096013|1085553_1086546_+|QAS84424.1|DBSCAN-SWA MSQNTLKVHDLNEDAEFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFARRALRGDVASRRRMIESNLRLVVKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMNQTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNERITSVDTPLGGDSEKALLDILADEKENGPEDTTQDDDMKQSIVKWLFELNAKQREVLARRFGLLGYEAATLEDVGREIGLTRERVRQIQVEGLRRLREILQTQGLNIEALFRE >CP034958|1082830:1096013|1088873_1089512_-|QAS84427.1|DBSCAN-SWA MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVAADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNVPLVPYYRPGDKRIAQDLAELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS >CP034958|1082830:1096013|1083585_1084212_+|QAS84422.1|DBSCAN-SWA MVSRRVQALLDQLRAQGIQDEQVLNALAAVPREKFVDEAFEQKAWDNIALPIGQGQTISQPYMVARMTELLELTPQSRVLEIGTGSGYQTAILAHLVQHVCSVERIKGLQWQARRRLKNLDLHNVSTRHGDGWQGWQARAPFDAIIVTAAPPEIPTALMTQLDEGGILVLPVGEEHQYLKRVRRRGGEFIIDTVEAVRFVPLVKGELA >CP034958|1082830:1096013|1090767_1091676_-|QAS84429.1|DBSCAN-SWA MKTGSEFHVGIVGLGSMGMGAALSCVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNAAQVKQVLFGETGVAQHLKPGTAVMVSSTIASADAQEIATALAGFDLEMLDAPVSGGAVKAANGEMTVMASGSDIAFERLAPVLEAVAGKVYRIGAEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSAVDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS >CP034958|1082830:1096013|1091871_1092639_+|QAS84430.1|DBSCAN-SWA MIPVERRQIILEMVAEKGIVSIAELTDRMNVSHMTIRRDLQKLEQQGAVVLVSGGVQSPGRVAHEPSHQVKTALAMTQKAAIGKLAASLVQPGSCIYLDAGTTTLAIAQHLIHMESLTVVTNDFVIANYLLDNSNCTIIHTGGAVCRENRSCVGEAAATMLRSLMIDQAFISASSWSVRGISTPAEDKVTVKRAIASASRQRVLVCDATKYGQVATWLALPLSEFDQIITDDGLPESASRALAKQDLSLLVAKNE >CP034958|1082830:1096013|1088092_1088869_-|QAS84426.1|DBSCAN-SWA MPRFAANLSMMFTEVPFIERFAAARKAGFDAVEFLFPYDYSTLQIQKQLEQNHLTLALFNTAPGDINAGEWGLSALPGREHEAHADIDLALEYALALNCEQVHVMAGVVPAGEDAERYRAVFIDNLRYAADRFALHGKRILVEALSPDVKPHYLFSSQYQALAIVEEVARDNVFIQLDTFHAQKVDGNLTHLIRDYAGKYAHVQIAGLPDRHEPDDGEINYPWLFRLFDEVGYQGWIGCEYKPRGLTEEGLGWFDAWR >CP034958|1082830:1096013|1093451_1096013_-|QAS84432.1|DBSCAN-SWA MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV >CP034958|1082830:1096013|1089508_1090771_-|QAS84428.1|DBSCAN-SWA MIKIGVIADDFTGATDIASFLVENGLPTVQINGVPTGKMPEAIDALVISLKTRSCPVVEATQQSLAALSWLQQQGCKQIYFKYCSTFDSTAKGNIGPVTDALMDALDTPFTVFSPALPVNGRTVYQGYLFVMNQLLAESGMRHHPVNPMTDSYLPRLVEAQSTGRCGVVSAHVFEQGVDAVRQELARLQQEGYRYAVLDALTEHHLEIQGEALRDAPLVTGGSGLAIGLARQWAQENGNQAREAGHPLAGRGVVLSGSCSQMTNRQVAHYRQIAPAREVDVARCLSTETLAAYAHELAEWVLGQESVLAPLVFATASTDALAAIQQQYGAQKASQAVETLFSQLAARLAAEGVTRFIVAGGETSGVVTQSLGIKGFHIGPTISPGVPWVNALDKPVSLALKSGNFGDEAFFSRAQREFLS >CP034958|1082830:1096013|1086639_1088004_-|QAS84425.1|DBSCAN-SWA MSTITLLCIALTGVIMLLLLVIKAKVQPFVALLLVSLLVALAAGIPAGEVGKVMIAGMGGVLGSVTIIIGLGAMLGRMIEHSGGAESLANYFSRKLGDKRTIAALTLAAFFLGIPVFFDVGFIILAPIIYGFAKVAKISPLKFGLPVAGIMLTVHVAVPPHPGPVAAAGLLHADIGWLTIIGIAISIPVGIVGYFAAKIINKHQYAMSVEVLEQMQLAPASEEGATKLSDKINPPGVALVTSLIVIPIAIIMAGTVSATLMPPSHPLLGTLQLIGSPMVALMIALVLAFWLLALRRGWSLQHTSDIMGSALPTAAVVILVTGAGGVFGKVLVESGVGKALANMLQMIDLPLLPAAFIISLALRASQGSATVAILTTGGLLSEAVMGLNPIQCVLVTLAACFGGLGASHINDSGFWIVTKYLGLSVADGLKTWTVLTTILGFTGFLITWCVWAVI >CP034958|1082830:1096013|1092689_1093346_-|QAS84431.1|DBSCAN-SWA MPSTRYQKINAHHYRHIWVVGDIHGEYQLLQSRLHQLSFYPETDLLISTGDNIDRGPKSLNVLRLLNQPWFTSVKGNHEAMALDAFETGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEITNDNIKYVIAHADYPGSEYLFGKEIAESELLWPVDRVQKSLNGELQQINGADYFIFGHMMFDNIQTFANQIYIDTGTPKSGRLSFYKIK >CP034958|1082830:1096013|1082830_1083592_+|QAS84421.1|DBSCAN-SWA MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVICSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW |
12 | Escherichia_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1708661 : 1718103
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP034958|1708661:1718103|DBSCAN-SWA AATGATTGAATTTAGCCATGTCAGCAAACTGTTTGGCGCACAAAAAGCCGTTAACGATCTCAATCTCAATTTTCAGGAAGGGAGTTTTTCGGTGCTGATTGGCACATCTGGCTCCGGCAAATCCACCACCCTGAAAATGATTAACCGCCTAGTGGAACATGACAGCGGCGTGATCCGCTTTGCCGGAGAAGAAATTCGCTCGCTGCCAGTACTGGAGTTGCGCCGCCGGATGGGCTATGCCATTCAATCTATTGGCCTGTTCCCCCACTGGAGCGTGGCGCAAAACATCGCTACCGTGCCGCAATTGCAAAAATGGTCGCGGGCGCGGATTGACGATCGTATCGACGAATTAATGGCGCTACTGGGGCTGGAGTCAAATTTGCGTGAGCGTTATCCGCATCAGCTTTCCGGTGGTCAGCAGCAACGTGTGGGAGTGGCGCGCGCACTGGCTGCCGATCCGCAAGTCTTACTGATGGATGAACCTTTTGGCGCACTGGACCCGGTAACGCGCGGCGCGTTGCAACAAGAGATGACGCGCATTCACCGTTTGCTGGGGCGCACTATTGTGCTGGTCACTCATGATATTGATGAGGCGCTAAGGCTGGCAGAACATCTGGTATTGATGGATCACGGTGAAGTGGTGCAGCAGGGGAATCCGCTGACGATGCTGACTCGTCCGGCGAATGATTTTGTCCGCCAGTTTTTTGGACGTAGTGAACTGGGTGTGCGCCTGCTTTCGTTACGTAGTGTGGCGGATTACGTGCGTCGCGAAGAACGGGCAGAAGGTGAGGCACTGGCAGAAGAGATGACGCTACGCGATGCGCTCTCCCTGTTTGTCGCGCGGGGATGCGAGGTGCTGCCGGTGGTGAACACGCAGGGCCAGCCTTGCGGCACGCTGCATTTTCAGGATCTGCTGGAGGAGGCGTAAGCGTATGAAGATGTTGCGCGATCCGCTGTTCTGGCTCATTGCTTTGTTTGTGGCACTGATTTTCTGGCTGCCTTACAGCCAGCCGCTGTTTGCTGCCTTGTTCCCACAACTGCCACGACCCGTTTATCAGCAAGAAAGTTTTGCAGCTCTGGCACTGGCTCATTTCTGGCTGGTGGGAATTTCGAGTTTGTTTGCGGTGATCATTGGCACTGGTGCCGGAATTGCTGTCACTCGCCCGTGGGGCGCGGAATTTCGCCCACTGGTGGAAACTATTGCCGCCGTTGGACAGACTTTTCCGCCTGTCGCAGTGCTGGCGATTGCCGTTCCGGTGATCGGCTTTGGTCTGCAACCAGCGATTATCGCCTTGATCCTTTACGGTGTGCTGCCCGTCCTGCAGGCGACACTTGCCGGGCTGGGAGCGATTGATGCCAGCGTGACAGAAGTTGCGAAAGGTATGGGAATGAGTCGTGGTCAGCGACTGCGTAAGGTCGAGCTACCGCTGGCGGCTCCGGTGATTCTGGCGGGCGTGCGAACTTCGGTGATTATCAACATTGGTACGGCGACGATCGCCTCAACGGTAGGGGCCAGCACGCTGGGTACGCCCATCATCATCGGGCTTAGCGGATTTAATACCGCGTATGTGATCCAGGGGGCGTTACTGGTGGCACTGGCGGCGATCATCGCAGACCGCCTGTTTGAAAGGCTGGTGCAGGCGCTTAGCCAGCACGCAAAATAAAGGTATAACCTGCGAGCATGACGCCACCAATTCCGCCTAACGCCATAAACAGGAACAGGGCGATGACCCCAATTTTAGCTATGCGCATATTGCACTCCTTATGTTAACGAAAGGATTGTACAGTAAAGCGCATTTGTTAACGAATCATTAAATGCCGAGTGGGAAAATATCATGGCCTTGTTCTTGCCAACTGGTGAGTTGCTGCTGTTGGGCGGAGGTTCGATTTTCACCGCACCACACCAGCAATGTACGGCCTTCGAATAGTTCAGGGCGTAGTTGATTGAGCGAGTGGGCGAGGACATCAATGCGCCATCCTTGTTGACTGGCAATCCAGCCCTCCAGCCACAGACGGGTGGTATCCTGAATATTCCAGCCAACCACCAGCGCATCTTTACCCTGTTTTTTACGTGCCGAAGCCAGACAAATGGCGATGTAGTTGATCAGTACGCCGTCGAGGATCGCCAGCAGCGCCTGGAGAGTCGGTTGTTGGCATTGAAGTCGTCGGCGCAGAGGAATAAACAGATGTGTGGTGAGTGTCTGGGCGGGGTAATCCTGACCGCGCTCTTTGATCCACGTTCGCAGGCTATGCAGATTGCCGCTTTGCAGGTAGGTCAGTAATGTTTCTTGCTGATCGCGCCAGCCGTTCTGCACATCAACATTTTCATTACTGAGCAGCATTTTAACTTTGCTGACCTGCACGCCGTTGTCGATCCAGCGTTTGATCTCGCGGATACGGTCAATATCGGCATCGTTGAACAGTCGATGACCGCCGTCTGTCCGTTGCGGTTTCAGCAACCCGTAACGCCTCTGCCACGCGCGTAACGTGACAGGGTTAATATCACAAAGCAACGCCACTTCACCAATTGTGTAAAGCGCCATCGTCTCACCCTTGCTCGCGAGGTCCCGGTTTAACTTTAGACGCAGTTTTGCGAACCAGGTAGTTTTGCCCGTTTTTTGTGCATCTATAGGGTGATTTTATTTTTGCCAGGCGATTTTGAGTGATCGTACTCACGAATTCTCATTTTTCTGCAAGAGTTCAAAGAAAGTTAAACGCAGGCAATGTATGTTACGCGTTTTAAAGGGAAGTGTGGTTTGCGGGTATGTACGATTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTTTTTTTAGTCATTGCGTGGTTAATGAGTAAAACGCCATTATTCATACCGTTAATGCAGGTCACGGTTCGTCTGCCGCATAAATTTCTCTGCTACATCGTCTTTTCCATCTTCTGCATCATGGGCACCTGGTTTGGGTTGCACATTGACGATTCTATTGCCAATACCCGTGCGATAGGCGCGGTAATGGGCGGCTTACTCGGCGGTCCGGTCGTCGGTGGGCTGGTTGGTCTGACCGGCGGCTTACATCGATATTCGATGGGGGGCATGACCGCGCTAAGTTGCATGATCTCAACCATCGTTGAAGGATTGCTCGGCGGCCTGGTACACAGCATCCTGATCCGTCGCGGGCGCACTGATAAAGTCTTTAACCCCATTACCGCCGGTGCCGTCACGTTCGTCGCTGAAATGGTGCAAATGCTGATCATCCTTGCGATCGCCCGACCTTATGAAGATGCGGTGCGTCTGGTGAGTAATATTGCTGCACCAATGATGGTCACCAATACCGTCGGCGCGGCGCTGTTTATGCGTATATTGCTCGATAAACGCGCGATGTTTGAAAAATACACTTCTGCTTTTTCTGCCACTGCGCTGAAAGTGGCAGCCTCGACGGAAGGCATTTTGCGACAGGGGTTTAACGAAGTGAACAGCATGAAAGTGGCTCAGGTGCTGTATCAGGAGCTGGATATTGGTGCAGTCGCGATTACCGATCGAGAGAAATTGCTGGCCTTTACCGGAATTGGTGACGACCACCATTTACCCGGCAAACCGATTTCTTCGACTTACACCTTAAAAGCGATTGAAACCGGTGAAGTGGTCTACGCTGATGGCAACGAAGTACCTTATCGTTGCTCTTTGCATCCGCAATGCAAACTGGGGTCGACGCTGGTAATTCCGTTGCGTGGCGAAAATCAGCGCGTGATGGGCACCATCAAATTGTATGAAGCCAAAAACCGTTTATTCAGTTCAATAAACCGCACGCTGGGCGAGGGGATTGCGCAACTGCTTTCGGCGCAGATTCTTGCCGGGCAATATGAGCGGCAAAAAGCGATGCTCACCCAGTCAGAAATCAAACTGCTTCACGCCCAGGTGAATCCTCATTTTTTGTTTAATGCGCTTAACACCATTAAAGCGGTGATCCGCCGCGACAGCGAACAGGCCAGCCAGCTGGTGCAGTATCTTTCCACTTTTTTCCGCAAAAACTTAAAGCGGCCTTCGGAGTTTGTTACTCTCGCCGACGAAATTGAACATGTGAACGCTTATCTGCAAATTGAAAAGGCGCGCTTCCAGTCGCGGTTGCAGGTCAACATTGCTATTCCGCAAGAATTATCCCAGCAGCAATTGCCCGCGTTTACCCTGCAACCGATAGTGGAAAACGCCATTAAACATGGGACATCACAACTGCTGGATACAGGGCGAGTGGCAATCAGCGCCCGACGTGAGGGGCAACATTTGATGCTGGAGATCGAAGACAATGCCGGTTTGTATCAACCGGTAACCAATGCCAGTGGGCTGGGGATGAATCTGGTGGATAAGCGTTTACGTGAACGGTTTGGCGATGACTATGGAATAAGCGTCGCCTGTGAGCCTGATAGTTACACCCGAATAACGTTACGACTACCATGGAGGGACGAGGCATGATTAAAGTCTTAATTGTCGATGATGAACCGTTAGCACGGGAGAACCTGCGCGTATTTTTGCAGGAGCAGAGCGATATTGAAATCGTTGGAGAGTGTTCAAACGCCGTGGAAGGGATCGGCGCGGTGCATAAACTGCGCCCGGATGTGCTGTTTCTCGATATCCAGATGCCGCGCATCAGTGGTCTGGAAATGGTGGGGATGCTCGACCCGGAACATCGTCCGTATATTGTTTTTCTCACCGCGTTTGACGAATACGCTATTAAAGCCTTTGAAGAACATGCCTTTGATTATCTGCTGAAGCCAATTGATGAAGCGCGACTGGAGAAAACGCTGGCGCGTTTGCGTCAGGAACGCAGCAAGCAGGATGTTTCGCTGTTACCGGAAAATCAACAGGCGCTGAAATTTATCCCTTGTACGGGGCATAGTCGGATTTATTTGCTGCAAATGAAAGATGTGGCATTTGTCAGCAGTCGGATGAGCGGTGTCTACGTTACCAGCCACGAAGGGAAAGAGGGCTTTACCGAATTGACATTACGTACCCTGGAAAGTCGTACACCACTACTGCGCTGCCATCGTCAGTATCTGGTTAACCTCGCGCATTTACAGGAGATTCGTCTGGAAGATAACGGCCAGGCCGAGTTGATTTTGCGTAATGGCTTAACCGTGCCGGTCAGCCGCCGTTATCTGAAAAGCTTAAAAGAGGCGATTGGCCTGTAAAAGACTGCTAAAATGGCTTTTTGCCTCATCAACACCTGAAGGCCTCATGCTAAGTAACGATATTCTGCGCAGCGTGCGCTACATTTTGAAAGCCAATAATAATGACCTGGTGCGTATTCTGGCGCTGGGTAATGTCGAAGCCACCGCGGAACAGATCGCCGTCTGGCTACGTAAAGAAGACGAAGAGGGTTTTCAGCGTTGTCCGGACATTGTTTTGTCGTCATTCCTCAATGGCCTGATTTATGAAAAACGCGGCAAGGATGAGTCTGCTCCGGCACTGGAGCCGGAACGTCGCATTAATAACAACATCGTGCTGAAAAAATTACGCATCGCGTTTTCGCTGAAAACCGATGACATTCTGGCTATCCTCACCGAACAGCAGTTCCGCGTTTCGATGCCGGAAATTACAGCGATGATGCGCGCACCGGATCATAAAAACTTCCGCGAATGCGGCGATCAATTTTTACGTTATTTTCTGCGTGGACTGGCAGCGCGCCAGCATGTGAAGAAAAGCTAAGACGGGTATGGCGGCCATGCGAAACATGGCCGCCGACAGATTATTTCACTTCTTTAAAACCAGCGGCTTTCATCACCAGTTCCATTTGCGCCATAGTGATACCTTTTTTGGCATCTTCAGCAGAAACGTTGATTCCTGAAATACCCTGCAGGGCTTTAAAATCCACTTTTTCCATATCGATAGTCACGTTTTCCTGCGCGTAGGTATCTGTATAGGTTAATTTTTCTTCAACACCCGCGATGTTTTTGTATTTGGCGCTTAACGGCTCAAGTGTCTTGGCAGCGTCTTCTTTGGTGGTTGCACCAATAGAAGCAAATTGAATTTTGGTTTCAGAAGATTGCTTAAGCACCTTGTCACCTTTGTAGACATAGGTAATGGCAATTTCAGTGCCGTTCAGATTGGCGCTGAATTTCTTCGATTCTTCTTTGTCACCGCAGCCAGCAAGAGAGAAAACCAGAACAGATGCAACAACGAGGGAAAACAGCTTATTGAAAGCCTTCATGTAAAACTCCATTTTATTTAATCAAGAAACTGGTGACTCTCACCAGGGGCTATATAGAATATGCCTAATACCGTGACGTGAGCAGTCCGGAACTGGAGTAGAACTCTTAGTAAAAAGCACTATTTCATCCTTGTTGCTGAAGCATGGGGAATAATTGTTCGCAAAGTAAAACACCGTTATTCATTGCTTCTACCCGTGCCTCGCTTTCTGTATTACGAAATTGTCCCAACACATGTGCCAGCCGATAAAAACCGACCGCGGAGAGGTCATTCGCCAGCAACTCTGCCTGACTAATAGCGCTCTGTTCCTGATAGCGCCAGCCGTTATGGAGCAGTTGAATAAGTAACGCCTGGCAGCGCATCAGCAACTGATGAGCGGTAGACGGCACAGGCAAAACGCTGGCAGAAGGTAGCGGTGCCACAGGCGCAGTTTCTGCGTCCAGCGCCCAGGCGCGGGTTTTTGTCATCATCACCCGTGGTTCCAGTGTCAATTGCCCATCAACAAAACTGACAAAGCCAGAAACCAGACTCACGGGGTCGTCTGTTTGTTGCAAAAGCGCCGCCATGCGTTCAACGGCAAAAGGAGAACAGGCAGATGCCGGAAGGGATAACGTCAGGAGATTATCTTCACCTTCGCCGCTGATTACCTGCGCATCCAGCGTCTGGCGGCTGCTATCCCAACCGAGCGAAATACACTCAGCGACCGGCAGAATAAATAAGTTATCGACCTGATTAAGAGGCCGTATGCAGGCGGGGGGACGCTGGCGTAAATATTCCCGCAAAGCCACAATGCCCGGCTGGCGTAACGGCGCGCTCAACATTTGCCAGGCATCAGGCGACAGCGGCACAACGCTGCTTAAGCGGTTGCGGGTAGCTAACAGCAGCTCGCCATCGGCACTGCGTTTTGCTGCTTGTGAAACAATTTGCCCACCCGCCAGTGCGCCAGCCTGAAAACTAAACAGCCGACGCGTAGCTGCCGGTGAGTTTTCCTGTTCACTTCGCGGCCAACTGCGCGAAAGGTGCAAAATACTGCCGGTGTCGGGATCGGTAAACCAGATGCGTAAACCATAATGCTCAATATCCTGCCAGCAACGCATACCTAAAGACACCAGCCGCAGATGATCAAGCTTTGCTTCTCCGGCAATGCCAGAGCCAACGACCGTGCGCCACGGCACAGGAGGAACTTCACCAACACTGTCGCGCCGGGCCATCTCTTGTGCGCAATTTAATCGACTGTTTAATGCCGCAAGCTGACGTAAGCATTCTCCGGCATGATAGTGGCTGGCGCGGACGTGGAAGGCATCAACGCTTGCCCGTAGCTGTCGTAGTGATTCACTCACCCATCGCCAGTTGCAGCGTTCCGCCGCCTGCTGCGCGCGACTGAAAGCGGCCTCGTAGTGAATAAGCGGCTGGCTGATGCCGCCCAGCCATAATGCCTGGCTTAATTGCTGAACATATTGACGACACGCGTTACCCTCGTCGTTGGCAAACGGATCGTCAGATGATGTGACGTGTTCGCTGCGCATCTGCCAGATTAAATGAGTAAATTCCGCTTGCTGAGTTTTGGCCTCGACGAAGGCCTGCACAGCCAGTACGACATGTTCGCAAAGTGTGCCTTCAATACAATCACAACGGGCGAAACGAATACTGCTGCGGGAATAAAAACGCACATCGCTCATCGGTAAGCGGGCAGAGGGAATTTCGCCCGGCGTACAGAACAACTCAATGGTGATGCCTTTACCGACCAACGCCTGTGCGCGTTTGCGGGTGGCATCGGGCAGGGTAGCCAGTTCTTCCAGCCAGATTGCCGGATCCCACGCTTCTTCTTTTTCCGTAGGCTGGGCGGTAGTACAAAGTCGTTGATAACTTAACACAAGCATCACGCGATGACGGCACATACCGCTGGCCCCGCAGCTGCACTGAGCCTCTTTCAGTGCCTGGCCGTTCGCCAGCTGGGTACGGACACCGTCACTGAAGGTGGCGATTAAAGCGCTGTTCTCATGGCTGATTTCCGGGACGTTGCCATTTTCCAGTTCCTTAAGACTGCGCTTAACAAAACCGGCATTGCTTAACGCCGTCAGGGCCTGCGGTGTCAGTTCTAATAATTCCGGACGTAGTGAATTCATGACTGAAGATTCTCCGCAAGCCATGATGCCAGCTCGCCCGGTGTCATGGCGGCTATTTGTGCGCCGACATTTACCAGCGCCTGGGCCGTATCGTGGTCATAGCAAGGTGTTGCTGTGCTATCGAGCGCTGCCAGTCCCAGCACTTTGATGCCGCTCTGGACACACTTTTTCACCTGATGCGTCAGTAATGATGATGAACCCCCTTCGTAAAAATCGCTCACGAGGATAATGACGCTTTTCGCTGGTTGTTCAATAAGTTGCCGACCATACTCCACGGCACTGGCGATATTGGTCCCGCCGCCCAACTGTACTTTCATTAATAACTCTACCGGATCGGCAACGTCTGCCGTGAGATCAACGACGCTTGTGTCAAACGCCACCAGATGGGTACGAATGCCGGGTAACTGCCACAAACAGGCCGCCATCACCGCAGAGTGGATCACCGAGTCCACCATCGATCCGCTTTGATCAACCAGTAAGACCAGTTGCCATTGTTCGCTTTGGCGTTTAATGCGGCTGTTAAAGCGGGGGGATTCGATATACAACTTGCCGTGTTGCGGGTGCCAGTGTTGCAGGTTGGCGCGCAGAGTACTTTTGAAATCAAAGTTTCGCGCCAGTGGAATTAATGAGCGGCGACGGCGATCGCGGACACCAGAAAAAGCCTGACGAACTTCCTTTGCCAGTCGCGCCATAATTTCTTCAACAACCTGGCGCACTATCTGGCGGGCGGCAGCCAGTACTTCGGGATTCATCAGATGTTTGGTGTGCAAAACGGCGCGTAGCAGGCTTTCCGAAGGCTGCATACGTTCCAGCACGTCGAGATTTGTCACCACATCTTCAATGCCGTAGCGCAGTACGGCATCGCTTTCCAGCCGCTCAATCACCTGTTGCGGAAACAGCGTGTGGATACTGTTGATCCACTCAGGAGTGGTGAGATTTGAGCCACCTAATCCACCAGAGCGTTCACCACGCTGGAGCCGTTCAGGATCGCGCCCATACAGCCACTCCAGCGCGTGATCTATCTGCCGGGCGTTGTCATCCAGCCCACAAAGCGTCGTTTCTGCCGCTTCGCCAAGAATTAATCGCCAGCGTTGTAGCTCACGGGTGGTCAGAAGATCGTTCAGTTCAGACAT
Protein sequences of DBSCAN-SWA_2 >CP034958|1708661:1718103|1708661_1709588_+|QAS84951.1|DBSCAN-SWA MIEFSHVSKLFGAQKAVNDLNLNFQEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGVIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWSVAQNIATVPQLQKWSRARIDDRIDELMALLGLESNLRERYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHRLLGRTIVLVTHDIDEALRLAEHLVLMDHGEVVQQGNPLTMLTRPANDFVRQFFGRSELGVRLLSLRSVADYVRREERAEGEALAEEMTLRDALSLFVARGCEVLPVVNTQGQPCGTLHFQDLLEEA >CP034958|1708661:1718103|1716966_1718103_-|QAS84959.1|DBSCAN-SWA MSELNDLLTTRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDPERLQRGERSGGLGGSNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVLHTKHLMNPEVLAAARQIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSLIPLARNFDFKSTLRANLQHWHPQHGKLYIESPRFNSRIKRQSEQWQLVLLVDQSGSMVDSVIHSAVMAACLWQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSVIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDHDTAQALVNVGAQIAAMTPGELASWLAENLQS >CP034958|1708661:1718103|1714969_1716970_-|QAS84958.1|DBSCAN-SWA MNSLRPELLELTPQALTALSNAGFVKRSLKELENGNVPEISHENSALIATFSDGVRTQLANGQALKEAQCSCGASGMCRHRVMLVLSYQRLCTTAQPTEKEEAWDPAIWLEELATLPDATRKRAQALVGKGITIELFCTPGEIPSARLPMSDVRFYSRSSIRFARCDCIEGTLCEHVVLAVQAFVEAKTQQAEFTHLIWQMRSEHVTSSDDPFANDEGNACRQYVQQLSQALWLGGISQPLIHYEAAFSRAQQAAERCNWRWVSESLRQLRASVDAFHVRASHYHAGECLRQLAALNSRLNCAQEMARRDSVGEVPPVPWRTVVGSGIAGEAKLDHLRLVSLGMRCWQDIEHYGLRIWFTDPDTGSILHLSRSWPRSEQENSPAATRRLFSFQAGALAGGQIVSQAAKRSADGELLLATRNRLSSVVPLSPDAWQMLSAPLRQPGIVALREYLRQRPPACIRPLNQVDNLFILPVAECISLGWDSSRQTLDAQVISGEGEDNLLTLSLPASACSPFAVERMAALLQQTDDPVSLVSGFVSFVDGQLTLEPRVMMTKTRAWALDAETAPVAPLPSASVLPVPSTAHQLLMRCQALLIQLLHNGWRYQEQSAISQAELLANDLSAVGFYRLAHVLGQFRNTESEARVEAMNNGVLLCEQLFPMLQQQG >CP034958|1708661:1718103|1711424_1713110_+|QAS84955.1|DBSCAN-SWA MYDFNLVLLLLQQMCVFLVIAWLMSKTPLFIPLMQVTVRLPHKFLCYIVFSIFCIMGTWFGLHIDDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSILIRRGRTDKVFNPITAGAVTFVAEMVQMLIILAIARPYEDAVRLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSTYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQQLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGLYQPVTNASGLGMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLPWRDEA >CP034958|1708661:1718103|1714383_1714845_-|QAS87651.1|DBSCAN-SWA MKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKLTYTDTYAQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKAAGFKEVK >CP034958|1708661:1718103|1709592_1710324_+|QAS84952.1|DBSCAN-SWA MKMLRDPLFWLIALFVALIFWLPYSQPLFAALFPQLPRPVYQQESFAALALAHFWLVGISSLFAVIIGTGAGIAVTRPWGAEFRPLVETIAAVGQTFPPVAVLAIAVPVIGFGLQPAIIALILYGVLPVLQATLAGLGAIDASVTEVAKGMGMSRGQRLRKVELPLAAPVILAGVRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIADRLFERLVQALSQHAK >CP034958|1708661:1718103|1713106_1713826_+|QAS84956.1|DBSCAN-SWA MIKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVTSHEGKEGFTELTLRTLESRTPLLRCHRQYLVNLAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >CP034958|1708661:1718103|1713872_1714343_+|QAS84957.1|DBSCAN-SWA MLSNDILRSVRYILKANNNDLVRILALGNVEATAEQIAVWLRKEDEEGFQRCPDIVLSSFLNGLIYEKRGKDESAPALEPERRINNNIVLKKLRIAFSLKTDDILAILTEQQFRVSMPEITAMMRAPDHKNFRECGDQFLRYFLRGLAARQHVKKS >CP034958|1708661:1718103|1710471_1711203_-|QAS84954.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKMLLSNENVDVQNGWRDQQETLLTYLQSGNLHSLRTWIKERGQDYPAQTLTTHLFIPLRRRLQCQQPTLQALLAILDGVLINYIAICLASARKKQGKDALVVGWNIQDTTRLWLEGWIASQQGWRIDVLAHSLNQLRPELFEGRTLLVWCGENRTSAQQQQLTSWQEQGHDIFPLGI >CP034958|1708661:1718103|1710304_1710412_-|QAS84953.1|DBSCAN-SWA MRIAKIGVIALFLFMALGGIGGVMLAGYTFILRAG |
10 | Enterobacteria_phage(85.71%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
2270978 : 2293545
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP034958|2270978:2293545|DBSCAN-SWA GTTATATTTTTTCGATCTCGACCAGATTAGTGTGCTGCGGGTTTCCCTTCGCCAGTGGTGAAGGGCGCAGAGTGGTTAGCGTATTCACACAGCCGCCATGGTCGATTTTATCGCCAGACATATTGGCCTCGTGCCAGGCTCCCTGGCCCATAGCGCTAACCCCAGGGAGAATGCGTGGTGTCACTTTGGCTGGTAGCCGAACTTCGCCACGATGGTTAAACACCCGCACCATATCGCCGTTGGCAATCCCACGTTTCTGCGCATCTATAGGGTTGATCCACACCTCCTGACGGCAGGCAGCCTTCAGGAGATCAATATTGCCGTAGGTCGAGTGAGTACGGGATTTGTAATGGAAACCAAACAGTTGAAGTGGGAAGGTTCTACGTTCAGGGGAGTCCCAGCCTTCAAAGGTAGAGGCATAAACTGGCAGTGGGCTTATCACTTCATCTTTTTCCAGTTCCCAGGTACGGGCAATTTCCGCCAGCTTGCTGGAATAAATTTCAATCTTACCGGAAGGTGTTTTAAGTGGATTTGCCTCGGGGTCGTCACGAAATGCTTTGTAGGCGACAAAATGACCATTGGGATCTTTACGCTTATAGATACCCATTTTTTTCAGTTCGTCGTAAGACGGTAACGCCGGATCTTTGGCAAGCATTTTGGCGTACAGATGTTGTAACCATTGTTCCTGCGTGCGACCTTCAGTGAACTTTTGATAGACGTCAGGTCCAAGACGTTTCGCGACTTCACTCAGGATCCAGTAAATCGGTTTACGTTCGAATTTTTCGCTGGTGACTGGCTGGAGGAAAATGAGATATCCCATGTTACCGGCGTAGTCGTTAGGAATAATATCTTCCTGCTCAACGGTCATCAGGTCTGGCAGCAGAATGTCGGCATATTTTGCCGATGAGGTCATAAAGTTTTCGATGACCACAATCATTTCGCATTTCGATTCGTCTTGCAGAATTTCATGCGTTTTGTTGATGTCAGAATGCTGATTAACGAGGGTATTTCCCGCGTAGTTCCAGATGAACTTAATGGGCACATCCAGTTTATCTTTGCCGCGGACGCCGTCGCGGATTGCCGTCATTTGCGGACCATGATCGATAGCATCTGTCCAGCTGAAGCAGGAGATTGACGTTTTGACCGGATTATCCAGCACCGGCAGGCGTTCTATGGTAATGGTATAGGTCGATTCACGCGCGCCACTATTTCCGCCGCTGATGCCGACATTGCCCGTCAAAATAGGTAACATAGCAATAGCGCGTGCAGTCAGTTCGCCGTTTGCCTGGCGTTGCGGCCCCCAGCCCTGGCAGATATAAGCGGGTTTTGCTGTGCCAATTTCACGCGCCAGTTTGATGATACGGTCTACCGGGATACCGGTAATTTGCGAAGCCCACTGCGGCGTTTTCGCTGTGTTATCATCACCTTCACCAAGAATATAGGCTTTATAGTGACCATTTTTGGGTGCATCTGCGGGTAAGGTTTTTTCGTCATAGCCGACGCAGTATTTATCGAGAAAAGGTTGATCAACGAGATTTTCGTTAATCAATACCCAGGCAATACCCGCAACCAGCGCGGCATCGGTGCCCGGGCGAATAGGGAGCCATTCATCTTCACGACCAGCAGCCGTATCGGTATATCGCGGATCGATAACAATCATTTTGGCGTTCGATTTCTCGCGCGCTTTTTCAAGAAGATAAGTGATGCCACCGCCGCTCATGCGGGTTTCTGCCGGGTTGTTACCAAACATCACGACCAGCTTGCTGTTTTCAATATCCGTGGTGCTGTTGCCATCATTACTGCCGTAGGTGTAGGGCATGGCACAGGAAATTTGCGCGGTGCTGTAGGAGCCATACTGATTGAGTGAACCGCCGTAGCAGTTCATCAGGCGTTTGACCGCCGAGGCTGATGGCGAAGAGCGGGTCATATTGCCGCCAACAATCCCCGAAGAGTACTGAATATATACAGCCTCATTGCCATATTGTTCGACGGTTTTTTTCAGGCTACTGGCGATAGTATCCAGGGCTTCATCCCAGCTAATCCGTTCGAATTTGCCTTCACCGCGTGTACCCACGCGTTTCATTGGGTAATTCAAGCGATCGGGATGATTAATACGCCGGCGGATGGAGCGACCGCGCAAACAGGCGCGTACCTGATGGTTGCCGTACTCATCGCTGCCGGTATTGTCAGTTTCCACCCAGGTCACTTCATTATCTTTAACATGTAGACGAAGTGCACAGCGGCTACCACAGTTGACGGAACAGGCACCCCAGATCACTTTTTCGCTGGCCTGTTGTACCGTTGCCGCTGCACTGCGCAGGGTGAACGGTAAAGAAAAACCGCCTGCAGCCAGCGCCAGAGAACCTATCGCGGTTGATTTAACGAGTGTTCTGCGGCTGATGCCCACCATTCGGTCATTTTTGGACATAACTCACTCCCTGTTCTTTATCGTTATATAAATGTTTATATATTGAATATTTAGCGCGCTAACAATAGAGGGAGTCTACCCATTTTGGGTTAAGAATTATTAATCCATATCAATAGAAGGGTATGAGTAATAAGGTGGGATTATGTTGTATGTTCAAATCGCCGGATGTGTCGTATCCGGCGTTCAGTCGATAATGTATTACTGCGGTTCGGCAGGCGCGCCATCCTGGGTAGACTGCGCGGGAGCAGAGACGTTACCGCTGGTGGTGCGGGTATAGAGAATTTTATGCGTATCATTAGCGCAATGGCCGACGACCTGGGAATCAGGCTGATCAACCTGGTCATTGGGTACAATACTTAACGTGAAGCTGCTTTCGGGTACGCCATTATTGATAATGCGCTGTGATATATCGCTCTGTATGCGCTCACAGGATCCCGGCGCGGCGAGTACCACGGGTGAGGCGAGGGCGAGCAGAAGCGCGGCACAGCAGGTTGAGAGTTTCATCATAAGCTCCTTACGCGAAGATAACTTCTTTAAGCATAGCATTTAACGTGTAAAGTACTGTATTTGCTACTATGATTGAGAATCATCTCTACTCTCTGGTGACTGTTGTGAAATACAAATTACTACCATGCTTACTCGCGATATTCCTCACAGGATGTGACCGCACAGAGGTAACACTTTCATTTACCCCTGAGATGGCCAGTTTCTCTAATGAATTCGATTTTGATCCGCTGCGTGGTCCGGTAAAAGATTTCACTCAGACATTAATGGATGAGCAAGGTGAAGTGACGAAACGTGTTTCTGGGACTTTGTCGGAAGAAGGCTGTTTTGATTCACTCGAATTACTGGATCTGGAAAATAATACCGTGGTCGCTCTGGTACTGGACGCCAATTATTACCGTGATGCCGAGACGCTGGAGAAGAGAGTACGTTTACAGGGAAAATGCCAGCTAGCAGAATTACCTTCTGCCGGGGTGAGTTGGGAAACCGATGATAATGGCTTCGTGATTAAAGCCAGCAGCAAACAAATGCAGATGGAATATCGCTATGATGATCAGGGTTATCCGCTGGGTAAAACCACGAAAAGTAACGACAAAACATTATCTGTCAGCGCCACGCCATCAACGGATCCGATCAAAAAATTAGATTACACAGCGGTTACTTTACTGAATAATCAACGGGTTGGTAATGTAAAACAGAGCTGTGAATATGACAGTCACGCTAATCCGGTGGACTGTCAGCTAATCATTGTTGATGAAGGAGTAAAACCCGCCGTCGAACGGGTTTACACCATCAAAAATACGATCGATTATTATTAATGCTATTGTGCGGTCGGCTTCAGGAGAGTCTGACCCGGTGTTTTGTGCTCTGCCAGATACTGATGCTGGAATATACACATGCGAATGGCATTACGATATTGACCATTAATAAAGAACTCGTGCATCAATTCACCTTCAACCGAAAAGCCAAGCTTGCGGTAAATGTGAATCGCTTTTTCATTCTCTTTATCAACGATCAGATACAGCTTATAGAGATTGAGAACGGTAAAGCCATAGTCCATTGCTAATTTGGCGGCACGGGTTGCCAGACCTTTCCCCTGATACTCCGGGGAGATAATTATCTGAAATTCTGCGCGGCGATGAACATGGTTAATTTCCACCAGCTCCACCAGACCGGCTTTTTCGCCGTCACATTCCACCACAAAGCGCCGTTCGCTCTGATCGTGAATATGCTTATCATACAGATCAGAGAGTTCAACAAAGGCTTCGTAGGGTTCCTCAAACCAGTAACGCATCACACTGGCGTTGTTGTCGAGTTGATGTACATAGCGTAAATCTTCACGCTCCAGCGGGCGTAGCTTAACACTGTGGGCGCTTGGCATAACGTGTCCTTACATTCCTTAAATCAATAACAGGTTAGGGGGTAATAACGCGGCCAGTTCGACGGTCCAGGCAGCGCAAAGTATTGGGCTCCCAGTAGGCATTGATGTTGGCGCTTTGCTCACATTTATCGCGGTTATCAAAAGCGGCGTCGGCTTTATCCCACTCTTTTTCAGTGCGTTTATTCACTTTCTGGCGCAGATTGCGCGTGTCATTCCATTGCTCTTTTTCCATAGCGGCGTGCTGGCGGCTTTGTGCACTGTCGCCAGACTCAATCACCAGTTTGTTAGTTTCGGCATGAACAGTTGTGCTCAATGCCAGTGCGCAAGGCAGCAGAATAGCGAGCAGGCCGATTCGTTTGCTGAGAGTGATTTTCATAATTCATTCCCTGTATGAATGATTAAAGGTGATTCTACACCATCCACTGCGGACGCAAAACGTACCAGGAGGGTGTTTATATTGATGATATTATGTCGCCCTATAACTATACATGATGTCAATAAGAGACAAAGATGATTAAAACAACGTTACTATTTTTTGCTACTGCGCTGTGTGAAATTATTGGATGCTTTCTGCCCTGGTTGTGGTTAAAACGAAACGCCAGTATCTGGCTGTTGCTTCCGGCGGGGATTTCACTGGCGCTGTTTGTCTGGTTGTTAACGTTGCATCCAGCGGCGAGTGGGCGTGTTTACGCGGCTTATGGTGGCGTTTATGTCTGCACGGCGTTGATGTGGCTGCGCGTTGTGGATGGCGTGAAACTGACTCTTTATGACTGGACGGGTGCGTTGATTGCGCTTTGCGGCATGTTGATCATTGTTGCGGGCTGGGGGCGCACGTAGGAACATAAATCCATTTTATCAATAAGATAAGAGGAAGTGTCAGCTGACAAAAGGTATTCTATTTCATCTTTTGTCAACCATTCACAGCGCAAATATACGCCTTTTTTTGTGATCACTCCGGCTTTTTTCGATCTTTATACTTGTATGGTAGTAGCTCAGTTGCGTAGATTTCATGCATCACGACAAGCGATGCAAGGAATCGAACATGAAGATCGTAAAGGCTGAAGTTTTTGTTACCTGTCCGGGGCGTAATTTCGTCACATTAAAAATCACCACTGAGGACGGTATTACGGGCCTTGGGGATGCCACCCTCAATGGACGTGAGCTTTCCGTGGCCTCTTATTTGCAGGATCACCTTTGTCCGCAGCTTATTGGTCGCGATGCGCACCGTATCGAAGATATCTGGCAGTTTTTCTATAAAGGTGCTTACTGGCGTCGCGGTCCGGTTACGATGTCGGCCATTTCAGCGGTTGATATGGCGCTGTGGGATATTAAAGCCAAAGCTGCCAACATGCCGCTTTACCAGTTACTCGGCGGCGCGTCTCGTGAAGGGGTGATGGTTTATTGCCATACCACCGGTCACAGTATTGATGAAGCTCTGGATGATTATGCCCGTCATCAGGAGCTGGGATTCAAAGCCATCCGCGTGCAGTGCGGAATCCCTGGTATGAAAACCACCTACGGCATGTCGAAAGGTAAAGGTCTGGCTTATGAACCCGCAACCAAAGGACAGTGGCCGGAAGAGCAGCTGTGGTCGACGGAGAAATACCTCGATTTCATGCCGAAATTGTTTGACGCGGTACGTAACAAGTTTGGTTTTAATGAACATTTGCTGCATGACATGCACCATCGCTTAACGCCTATTGAAGCGGCGCGCTTTGGTAAAAGCATTGAAGATTATCGCATGTTCTGGATGGAAGACCCGACGCCTGCAGAAAACCAGGAATGCTTCCGTCTCATTCGCCAACATACCGTCACACCCATCGCAGTGGGTGAAGTCTTCAACAGCATCTGGGACTGCAAACAACTGATTGAAGAGCAACTCATCGATTATATCCGCACCACGCTGACCCATGCAGGCGGAATTACCGGTATGCGCCGGATTGCCGATTTTGCTTCGCTGTATCAGGTACGTACTGGCTCACACGGTCCTTCCGATTTGTCACCAGTCTGCATGGCTGCGGCGCTGCACTTTGATCTGTGGGTCCCCAATTTCGGTGTCCAGGAATACATGGGTTATTCCGAACAAATGCTCGAAGTCTTCCCGCACAACTGGACTTTCGATAACGGCTATATGCATCCGGGAGACAAACCGGGTCTTGGCATCGAATTCGATGAAAAGCTGGCGGCGAAATATCCCTATGAACCTGCTTATCTACCAGTCGCACGTCTGGAAGACGGCACGCTGTGGAACTGGTAAGGAGTAAGATAATGAAAAGCATATTAATTGAAAAACCGAATCAACTGGCGATTGTCGAACGTGAAATACCCACCCCGTCAGCGGGTGAAGTACGAGTAAAAGTGAAACTTGCCGGAATTTGTGGTTCAGATAGCCATATTTATCGTGGGCATAATCCTTTTGCGAAATATCCGCGCGTCATTGGACATGAATTCTTTGGCGTCATTGATGCGGTGGGTGACGGCGTGGAAAGCGCCAGAGTCGGTGAACGTGTTGCTGTCGATCCGGTGGTCAGCTGTGGGCATTGCTATCCGTGCTCTATAGGTAAACCGAACGTTTGTACGACACTGGCTGTTTTAGGTGTGCACGCTGACGGTGGTTTCAGTGAATATGCCGTGGTTCCGGCAAAAAATGCGTGGAAAATTCCTGAAGCAGTGGCCGATCAATATGCGGTAATGATCGAACCTTTTACCATTGCGGCTAACGTAACCGGACATGGTCAACCGACTGAAAATGATACCGTTCTGGTTTATGGTGCCGGTCCAATCGGCCTGACGATCGTTCAGGTATTAAAAGGCGTCTATAACGTTAAAAATGTGATTGTTGCCGATCGCATTGATGAACGACTGGAAAAAGCGAAAGAGAGCGGGGCAGACTGGGCGATCAATAACAGCCAGACACCGCTTGGCGAGATTTTCGCTGAAAAAGGCATCAAGCCGACATTAATTATCGATGCGGCTTGTCATCCTTCTATCCTGAAAGAGGCCGTAACGCTGGCTTCTCCAGCGGCACGTATTGTATTGATGGGCTTCTCCAGTGAACCGTCTGAAGTGATTCAGCAAGGAATTACCGGAAAAGAACTCTCTATTTTCTCTTCACGCTTAAATGCAAATAAATTTCCGGTTGTTATCGACTGGTTAAGTAAAGGGTTAATTAAACCAGAAAAATTAATTACCCATACGTTTGATTTCCAGCATGTTGCTGATGCCATTAGTTTATTTGAACAGGATCAAAAACATTGCTGCAAAGTCTTACTCACTTTTTCTGAATAATACCAATAACGGCGAGTAAGTAGTACGCATCTTACCTCTTTTTTAGAGATAACCATTATGACAATAGAAAAACATGAAAGAAGCACTAAGGATTTGGTGAAAGCAGCAGTATCGGGATGGCTGGGCACTGCGCTTGAATTTATGGATTTCAAGAGTCATGCGTGTTAACTATTTGATAAATATTAAATTAATTTTTCATTGCTTCGTTATGGGGCATGGTTGGGGCAAACTCGCTTAACTGTGTATTTAACAAAGCTACCTGTGCATTATTGTTTTCAGACATCCATTTTCCGTATACCTGAAATACCATTTGCGCATCTGCATGGCCCATCTGGTTTGCTATAAATGCCGGGTTAGCACCAGCTGTCAGCGACCAGCAGGCATAAGTATGTCTCGACTGATATGATTTTCGATGGCGGAGTCCGGCACGTTTTATCGCTGCGTCCCACATCTGCCTTATTGAGTCAACGGTAAAATGGTCACCATAATTTTTTACTCTCGCTGACACTTCAGGTTGAAAAACAAAGGTGCATTTTTGTTTTTCTGTTCTGCCATACTCTCTGAGGTGAACATCAATGATATGCTCTTTGCTCAGTCTCGTTAATGTCATCTGACTCCGGAGAGCGTCGATTGCTGGCTTAATAAGATGAATGACCCGATTGGTTCCCGCCTGTGTTTTTGGTACCGTGAAACGGTCTTTTGCTAAATTTCTCCTGATCATCATTGTTCCATTTTTCAGATCTATGTCCTCCCATCCAAGTGCACACAGCTCACCAGGGCGAACGCCAGTATAAACAGAAACACACCATAAATTTTTTGCTTGCTGATTTCTGCACGCATCGATAAGACGGATAAATTCTTCCCGCGAAAGAGGATCCGGAATGGTTCTTGATTCCTTTAATGGCGAGATCCCCTTAAACGGATTATCTGCCAGGTAACCGTTATCAACACCAAACTGGAACACGGCGTTAAGATTTGTCATGTAATTATTTACAGTTACAGCCGATCTCCCTGGTTGTGTAACAATATAGTTACTTTTGGGGATCTGGTATCCAGTCAGTAACTCTTTACGAACCTCCAGTAATTTTTCTTTATTAATCGATGAGGCAAGATTTTTTTCACCGATTATGCTCAGGATATTTTTGATGACGGCACGGTATGTGTTGAGTGATGTTTTGGCGACTTCAGTTTCTTTCAGTGCCAGAAATTTTTCAGCCAGTTCTTTTATGGTTAAATCTTGTCGGGCCTCACCAAATTTTTCCAGATTGCGTGAGGAGGGAAACTGTTTTGCATAGTCGAAAACACCAGTTTTTATTGCGTAACAAACAGAGGAGCGTAGTTCACCTGCAACGCGCCTGTTTTTTGCTGTGTCAGGAACCCCCAGGTTTTCCCTGACTCTTACGCCTTTATAAACAAACCAGATACGTAATTTCCCTCCATGGTTTTCCACGCCTGTCGGATATTTCATTTCAACTTCTCTCATTAGTTAGTGTGGCTTTTAGTCAAGTAAGATGACGTCTTGGTCTCGCTGATGCCTGGCGCTCAATCCAGCGATCAATTTCTTCCAGGTTGTAAAAGCATGGACTGTTATCCCATGGCATACCGTCATGAGCGACATGCTTATATTCCCTTCCTTCCATAAACGATTTTTCCCGGGCCTTTTTTAACGTACCTTTTTTTATTCCTTTCAGCGCAATTAACTGCTCTTCGGATACCCATTTGCCGGGAGAGACAATCATGATTACTTCGCTCATCGATTTCTTTATCTCTTACATTAGACGAGCGCCGGTTGCAGAATACCAGTCACAACCGGCGACAGTTGAACATTAAGAATCAGCCTGACTCGGGATCAGTTTTTGCCAGATAACTGAAACGTATTTTGCCTGGTAACGGGCGTCATCAAGTGCATTATGGCGCTCACCTTCGAATTGAATAGCCGTTCTGGCATCGAAGTCTATGACTTTCCCCAGCTCAACGATTGTGCGTACATCGCGATCGTTGTAGTAACGCCACGGGCAGGGGATCCCCTGCCGTTCGTATGAACGGCGCAAAATCGTGTTGTCGAAGTTGGCTCCATTCCCCCAAACCTGAACAAAAAATTCACCGGAGTTTTCGTCGATAAATTCCCGCAATTGTAACAGTGCATCATCTAACGGGATTTCATCGGTCATAATGGCAGATTGCGCTTCGCGTGATTGCTTAAGCCACCATTTAATGGTGTCCCGATCAATGACTCCGCCAGCAGTTTCCAGATCGATAGTCTTACTAAATTCCGGTCCCATATCTCCGGTTTGCGGATCGAAAAATATTGCACCTATTGAGGTGATCGGGGCATCAGGATTTTTTCCCATGGTTTCAAGGTCGATCATTAGATGGTCACACGTCCTGCTGGTGGATGTGATTTCGTGATGACCGTTCACCTTAATTGGGTGATCTGCCGTCTCGCCAGTTTCATTATCGCTATTGTGATGCTGATTGCCGCCAGTGTTCTCCTTGTGTGGATGTTCAGCGCCTTCCATTTCCTCCGGATCATCTTCCTGAACTTCAACCTGATACTCTTCATCGAATGTTTCTTGGTATGTTGCGTCGCCCATCACCGCGCCACAATCAGGGCAGTTGCCGCCGCCGGTCTGACCGCAGGCGGTGCAGACTTTTTCCACTTCCTGTTGCGCTACTGGTTCAGGCTGTTTCGTTTCTGGCTCGTTTTGTAACGCATTTGGACTGTTTTGTTCCGCTTTTTGGTAGTTCCGTTCCGATTCATGCTGGTTCTGGTTCACAGAATCGCGGGTCTGGATCCCCTTAACCCATTTCGGATCATTCGGGTCACTAATCCCTTCAACAAATTCACCACGTGATGCAGCAAGCAACTTATCAGCGTCAGGCTGGCTGATATTGGCTGCCTGCATAATTTTGTTTACTTCGTCAGCGGTAACTTTTATCGGCTCTGGTTGTTCTGAATCTTCAGCGGTATCTACATTTTGCGGTAAGCCCGTGTATGTGCCATTTTTTCGGGCAAAATATTCTTCTTTTGTGATTTCAGTGGCGCCAGCAGCCAGTGCCTTATCCAGACCAGAAAGTTTGTTTGCGCGACCGTATTTTTCTCCGTCCTTATCTGCGAAGAGGAAATAGAACGGCCCCTCACGCTCTACAGATGGTTCAGCTTCCGGCGCGGTTTCATTTTTTGGGATATCAGATACCTCAGTTTCCACTGCATCAGTTTGTGTTTCTGATGACTGGAGAACATCAACAGTGCCCAGGTCTGTTTCTTCATTCTCAAACACGCCCTTTGTCGTCAGGTATTCGCAGATATATTTGTTCAGTGCTACGGGATCTTTGTGAATGTCGATCGGACGCTCACGGACAAGGCCAAAAATAGTTTGGCGGTCGTAGCGAAGGGCATCAGGCTGTTTGCGCATTGATGCCGAGATACGCTTCCAGTCTTCGCGGTCGTTGTCGATAACTTCTTTTTTTGCCCAGCGATGGATGCTGCCGTCAATGTTTCCGGCATCCACATCACCAGGCCAGAGAGCGTAGGCCAGTTCGTCATCCAGTGTTTTCCATGTCTGCTTGTATTCGCGATGAATGGCAGCAATGACCGGGCTGATTTTTCCTGTTGAATTTTCAGTGTACTGTTGATTGGCTCTGGCGCGTGCGAGATCAACAACAGACGTGTATTTTCCGGTTTCCTTGCGTTCACCTTCGCGACGTTTTTTCCAGATGCGCATCTCTGCCTGAATTTCGGGCCATTTAGCACCAGGAATACATTTATGCTTAACCCACCCAATGGCGTGCAACTTAAGCTCCGGATACATGGCGTTAACTTCTGGCATTTTCATCAACGCTTCAACGATATGTCCGTCGAATGTTGCCATGTCTTCCTGCAACAATTCCTGTGCGCTAATAACCATATCAACGGTGATGTTTTCACATGTGTCGAACTTAACCATGACAGCGTTCTGTACTTCAGGGGCCAGCTTGTCAAAAGTGACGTTCATCGGATCGGATTCAGTCTCAACCGGGACAAAAGAAGCAGACTCCTCATCCCAGCGGTTTTCCTGCATATATTCAGCATCCCATGAATCGAGGGCAGGGCGGGGTATGCCAGGTTTATCCTCGCAGACAATAAATTTATAAGCGCAGTCCTGAGCAGCCGGATAATGTTCCAGGAATTGCCAGTGAAATTTTGCTCGAGCACGGCGTTCGTCGCCAGCTTCAATGGCTGTGGCTACAGCCACAGCGCCTTCTTCCCTTGTTGCCAGTTCGTCAGGAATAGCGGCGCAAATAAAGACTTTACTCATTTTGTTTTAACCTCATGACAGATTTAAGGATGAACAAATCCCTGCCATTGCTGGCATATAAGAATGAAACCGGATATTTATTACGGAACTGTTTTAAAGACCTGCCGGGATTTCGATATTATCCTGGTTAATAACTTTATCGACCGGGTAACAGTTACCGGGAATTTTCTGTTCGGTTGCTGCAGTCATACACTCCTGCATTGTCCTGTGAACACTGACTGCAATATCAACTGGCACTCCGGAAACAAGAAAAACTGTCAGAACAAGCGCAAATGCTGAATTCATTGTGCACATCCTTTTGGCATCAGACGTAAACGAGCCAGCATTGAAACAATGCATATTTTATTTAATAGCTCCCGTTCTTGTTTTCTCTTGTTAATGGCATCTTCAGTAAATACAGGGTTACTGATAGTGACACCAATTTCAAAACAACCTTCAGACGTATTAACGTTTGGTAATAACGTTTTCATTATCGCGTCCTCAACAATGAATTTTGTGATGCAGTGCCTGGTGCCTCCAGGTGACGTTAACCAGTTAACAATTAACGTCGGATATCCGGATTAGTGATTTCAGGTTGTATCGTGAGATCAGTGATGGAAAAAGTATTACGTACATGATCGCCGGGTTAAATAAAGAATATGGCGATGTGGTGGAATCCGGACTGCTTTTTGCAGATCCTGCCGTTGTAGATCGTGAAACTGACGAACTTATAGAAAAAGCAATTGCTTTCAAGCTTGCGTATCGACAGCAATACCAACAAAAAGCTGGATGGAATTATGAGTCTTCTTTTTGCTGAACGCCCACTGGTTATAAACACGCAGCTGGCAATGAAAATTGGCTTAAACGAAGCCATTGTTTTGCAACAACTGCACTACTGGTTGAGAGATACCAACTCCGGCATGGAATGTGATGGTGTTCGCTGGATTTATAACACAACGGAACAATGGCTGGAACAGTTCCCATTCTGGTCAGAGTCAACGTTAAAGCGCGCGTTTGCAAGTCTGAAAACGCTGGGGCTTTTGCGTTGTGAAAAGCTCAATAAATCAAAGCGCGATATGACCAATTTCTACACGATCAACTACGGGAGCGAGCTTTTAGATGGTGGCAAATTGAGCGAATCCATCGGTTTAAAATGCGCCGCTCCATCAGGTCAAAATGACACGATGGAAGAGGTCAAAATGAAACGCTCCATTGGTTCAAAACGACTCAATGTCATCGGGTCAAAATGGCCTGATGATCTTACAGAGAATACAACAGAGATTACTACAGAGAATAAAAAGACTTCTCGTCCGGAAGCTTCGCAACCGGACCCGCAGACGGTTGAACAGGATTTTTTAACCCGACACCCTGACGCGGTTGTGTTCAGTGCGAAAAAACGCCAGTGGGGCAGCCAGGAAGATTTGGCGTGTGCGCAGTGGATCTGGGGGCGAATCGTGAGTCTTTACGAGCAGGCCGCCAGCGATGATGGCGAGATTTCGCGACCGAAAGAACCCAACTGGACCGCATGGGCCAACGACGTGCGCACAATGCGGATGCTGGATGGCAGAACTCACAGACAAATTTGTGAAATGTTTGGTCGGGTGCAGCGGGATCCATTCTGGGTAAAAAATATCATGAGTCCGTCAAAGCTTCGCGAAAAATGGGATGAACTGGTTATCCGCCTGGGGCGTTCGTCTGTACAGCGTTGTGTGAATCATATTTCTGAGCCGGATACCGAAATTCCGCCGGGCTTCAGGGGGTAAGTGTTAATTTCTGGTCATGAGGTAATTTTCAGGAGGGCTTGTGGCAAAAGTTTTTACACAAGAAGAGCGGGAAAAAATTAAAGGGCAGGTTGTTGAACTCGTACGCCGGAGTGGGCGTGAGACGTTACGGCAACTGGAAGTCAAGACAGGTGCGACAAGATATCTGATGAGCGTTCTCGCAAGAGAGCTGGTTGCCAGCGGCGATGTATACAACTCTGGTTACGGGTTATTCCCGTCTGAACAGGCGCGTAAGGACTGGCAAAATGCCCGTAAAAAGCTCTCAAGGGCAAAGCTGAAGGAACCATCTGCGGTTGATCCGGACCTTATCTGGTCATTACCTGACGGAGAAATACGTCGTTACGACAGGCGTCATAATATGATTTGTACTGAGTGTCGTAAAAGCGAAGTTATGCAGCGCATATTGTCGTTTTATCAGGGGGATGTCCGGTATTTATTGAAGTGACGAGATTAAAGTGCATTAGTTCAGATGCAAATTGACATTTTGTGGCACAGGGTAGAGCTAGCGTGGTTGTCCGCTTTGTGCCAAGAGCGGACTTTGCAAAATGGGGGTTATTTCAATCAAAACGTAACGTCACAACCAGCCGACGCTCTCTCGCCATTTATAATTAGTAACTTTATCATTTTCGCTTATTTTTTTAGATATAGAGCGCGGCTCTCTTCCTAGATACTCAGATATTTCTATCGGGGACAAATCAAAATCAACAAGCATGACTCTAAGTTTTTCCATTTCCTTTAAAGTCCAAGGCTTGCCATAATTTTCATAAAGAGATACCTTATGCTCCCTGATAGTTCGCTTTCTCTGAGTTTCACTTTCCCTCTGAAAAATCTCGCTTTTAAATTGTGAACGGAACTCTTTACAGAAGTTGTTATCAGTTTTTTTGTTATATAGATTGCTAAAAATAAGCTGGGATGCCGGGTCTAAATCAGGTGTGTTAATAATGAAAACTTTGACCTTTTCCATATAGGGATATTCAATTTTACCCAATATGCTTAATTGCTTTATATTAAAAAAACCTCTCAAATCAAAAGATTTAATGAGCTTTGATTGGATTACACTTTCAATCCTGCTTGCATTTAAATAACAGTACTTTGCCATTGGAAGGGCCCATACTACCAGTTGAGGAATATATTTTTCTAATGCAATGCTCTGCCAATCAGTATCAAAGGTTTGGCTTTTTGCTAGCATGTTTTTCGGCAAATCAGAATACATTGTGGTAGATGCCCATATCTTATAATCATTCGCCAAGGCTTGATAGTATTTTGTGTGGTTATGAATTTTATAAGCTGACATGAAGCGATAAACGTCTTCGTCGTGACCAGCGTCGTAAATTGTTCGATTTCCTCTAAGATAACCGTCGTAATGTTCGGTTATCCTTCTACCTACATTACAACTTACCCCAACGTAAACCACACGACTGAAAAGTCCTTTATGGACAATAAGATAAACTCCGCTACAGCCAGATTTCCTGGCCTCTGATAGAGAACCTAAAAATCTCCATTCCATAATTAAATCCATAATTATTGCTACTGTTTTTGTTTATCATTATTTTCGTGAAACTTCAACAATTTTATCCAAAAGCTAAGGGCAAAGACTTATATAATTATACTTGTCATCGTTAGCGATTATATAGAGTAGTGGCGCTGACCTGCTCCCTGGTGATTCACACAGAATGCTGTTAGTAATGTCCGTTCCTCGCTCTCAGCGGACCTTCAGCTCAGTGATATCGTCCGCTCTGTGCAAAGAGCGGACGTTGGTATGCAAGAGCCCTCCAAAAGTTGATGGTTGGTTTGCAGGGGGGCTTAAAGAAACTGCACTTATCAAGTTGAAGTTCTGTATTCAGCGAAATCGTAGCACTCTGACGATAAGTAACTCCGGTACTCCGCTCTTCGATGAAGAACCAGAGTAATCCCCCCGAAAAACCAGCGCATCAAAATTGGATCTTCAGCGGTAGCTTATCGGCTATCGGAAGTACAGGTGTGGATTCGTGGTGAATTGCTTTGATAATAAACGATTAATACGGAAAAACGCATTAATCATTTATTAGCTTTTAGTAAACCACAATTTATTCCGTTTTACATATCATAGTAGTCGATTGGAGAATATAGTTTCTGGGAATGTACTCTTCAAAGTGTTCGTCCTTTTTAAATACATGAACTACATTTGGGAATAATTGATAGTCAACAGGGTGTATAGCGTTTGGATTATGGTACATGTACATGGCTGTACACCATGGTTCTTGATAGTTAGGGTCACTTACATCGGCTGAAAATGGATGTGGGGCTGCATCCTGATCAGTTTTAACACCACTGACGTACACTTTGAATCCACTCGCCTCTACACCTGCAAGAATTCCCATCCGGTTAAACTTAGGTATGGTTGCTTGAGTAGTGAGTAAAACGGCAGAAACATAATTATTTTGTTCTGAGCCAAAAAAGTTCGACTTGATACTTCTATTTTCATCTGTATGTCTTTCAATAGAAATGCCTGACTCAATATCAATCCCGTACAAATAGCTATGCAAGGCTTCGCTTGAGAAGGCCATGGACATTCTTTTTGAATAATCCTGCATTGCTATGACAAATGGTTTGTTCTTTGTATGGTTGAGTTCCCAGTAATGAACTTTCTCTGGCTCAGGGCAATGCCGGACTTTTTTTAATAAACTTCTTGCAAACTTAAAAGGCATGACATTTAGAACATGTTTTCTTAATTCATCCATCTGTTCATCGTTAATGACTTTTCTTTCAAGAGGGGCTTCTGCTTCAGCAATGCTTACAGCCTCTACAGCAATTTCCACTCCAAATTTAGATAGCAGAAAATCTGGTTGATTGTATTCTCTATTCATTTCAAAGTCGAGTTCATAAAATACAGCGTTCAAATATAATTCAAATAACCTTGAATTAAATGCATCACTTTGAAAATCCCTTATAAATATTCCATCAGGATCTTTGAACCAGTATGCAAGTTCCTCAAGAACAATATATGCAGGGAAATGAAGAGGGTCTTCGAGGAGCATTTTTATATAAACATTCCTTTTTTTCGCTGGGACCTTACTCAAGAATAATGAAAAAGGTTTGGTTGATTCATCGCCTTGCATGAATGTACCATTTTGGTGCTGCGCCAGCATCTTTGGTATGTCATCGTTCAAATTATTAAGCAAGACATCCATTGAATCAAATGAAGCCAAGACGTTTATTGCTCTGAATTTTTTATCTAAATCCCGACCTAAGACTATTGCGTTAAAATCTTTATCAATATTGCATATGATTATTGTGGATAACAATGTTATCCCATTCCCCTCATATTTAAACCAGCGTATCTCCTCAGAAAATGTCTTAAGGTAAGGTGAGCGACCGTAAAAATAAATATCAAATTGTTCTTTGCTGATCTCACTGAAGTGTAATCCTGCGTTCATACCAATTCCTTTTCAATGAATAATTGGCCTTTAGGAGTGATTCCCTTTGTCTTTAATTCAGTTCTAACTAGTTCTTTAATCCAATAGCCTAAGCTCATCATGCAGTTGGATCATAAGACAACGCCCTATAGTGCTCGTGATACTATAGGGCATCTGACCACACTGTTAACTGGAGTAACGACTATGGCAGGAATACAGCATAACCAAACTCACCCCAAACTTACATAGCGCTTTCTGGCCGTGAGCATAACAAGGTCCACTCCTCGCTCATAAGGGACAACCATACTCAAATCTCCCACATTGCAGGAGATTTGAGTATGAACACGTCACCGTGGAACAAAGACCGTATCATAGGCCAAAAAAGACCACTTCAGATATCTCATATCTGGGGTATCCGAATCCGACTTGAACTGGAAGGTAAAACTCGCGATTTAGCTCTGTTCAACATGGCCCTGGATAGTAAGCTTCGAGGCTGTGATCTGGTCAAACTCAAAGTATCTGATGTTGCATATGGTGGCTCTGTTTCAAGCAGAGCAACGGTGTTGCAACAGAAAACCGGTAGCCCTGTTCAATTTGAGATAACCAAAGGGACAAGAGAAGCTGTTGCTGCATTGATACAGCTTAGCAATTTGCACAGTAAAGACTTCTTGTTTCGGTCTAGGGTCGGAACTAACCAGCACATTTCAACCCGGCAATACAACCGAATCTTTCATGGGGGGGTAGAAAAGCTTGGTCTCGAAGATTCGCTTTACAGCACACATTCCATGAGAAGAACAAAACCTTACCTGATCTACAAGAAAACCAAGAATCTCCGGGTGATCCAACTTCTGTTGGGTCATAAGAAACTGGAAAGCACAGTCCGTTATCTGGGCATTGAAGTCGATGATGCGTTAGAGATTTCTGAATCGATTGAAGTCTAAGGTTGTCAGGGCTGCAACAGCAGCCCTGTGCCATAAGCGGAAGTATTTAACAACTATCAGTGTTGTTCAACAGATAAAGGGGCACTTGATTTTTTCTGTTCTCAGGAAATGATAAAAGCGCGTCGGTTCAAGCCTGCTTAACGGGAGTTTGTTAATCCTGTTGCCGTGACGTTTTGACACCATTATGATGGGGAGACACTTAATGTATGAAGGTTCCGCCACTTATACCTGTCCAACAACTGCCTCGGATGTTTCTTTGTATGAATAAGTGGTAATGAGTAGTGAATCGCTAACAGTCACCCGAACAATCGGTGCCTGCAATTAATTCTATATTCTAAACGAGGGGGAGATTATTACACATGAAATTTAAGGACAAGAACCTTAAGGCTCTCGCGGAATGTATCATAGGAGATAATAAGGCATTTCTGTATCGTTCAAGCAGTCACATCACTGAATTTTTCCAGGACTGCGGCATGGATGTTACTCATGACGGATCCACTCGGTGGAAATGGACGGCCCAGAGGCTTGAAGAACTTCTTTATGAGCCACAGTCAAAGCCACATACTTTGCCGGAAAGGTTTGTTCATGTGCTCAGAACTTTAATGTTAAAAGAAGATGCAATGGATGACGATCCAGGAAGATTAAAGGCGCTTGAAGAACTGAACAAGCCTTTGATGCGGGAAGGCTATGAGGCATTCTATGGTGACGATCGCCTTTTGTATATACGCCATACCGATACCAAAACGGTTTCAGTCAGTAATAACCCTCATCGGCCCTTAACGCCTCACGAAGTAGAATGCAGAAGGTTACTGACCGCGTTTCTTGATACCTGCTCAGAAGATGAGTTAATAGAAGATATTCTCCTTCCTTTATTCCGGCAACTTGGTTTTCACCGGATAACAGCAGTGGGACATAAAGATAAAGCGCTGGAATACGGGAAAGACATCTGGATGAAGTTCACACTGCCAACTCAGCATGTTCTTTATTTCGGCATTCAGGCAAAAAAAGGTAAGTTGGATGCGTCCGGTGCCAGCAAATCTACGAATTCAAACGTGGCAGAAATCTTCAACCAGGTACTGATGATGCTTGGCCATGAAATATTTGACCCAGAAACAAATAGAAAGGTGCTGGTAGATCATGCCTTTATCGTTGCTGGCGGAGAAATTACTAAACAGGCGAGGAACTGGCTGGGCGGGAAACTTGATGCCAGCAAAAGAAGCCAGATAATATTTATGGACCGGGAAGACATTCTTAATTTATATACTGTAAGTAATGTACCTCTGCCAACAGGTGCTCTCATCTCTGATGATGCCGTTAAGAACGATGATATTCCTTTCTAATCAGAAGTACGTCTTTTTCTGAAAGAATACGTGATAGGTAGCCACACCACACCTTTAGTGACCCCTTAATCTGGTAATATAACAGCCCGTATGAATGTCCGCGGCATCGCGGGCTGAAATTTATTAAAAATACTTATTCATCAAGCTGGAGTAGTTTGCCGAGTAACTGTAAACGCCCAACTTAACCGGACCATTCACTTTTAGATTGCTACCAGCAAACCAACTTCCGTTTCTCGCTCAAAGCGGACTAGAAGGTTAGCTTGCGTCGGACTTGGCGTATTTAAAGAAGTGCTGGTGGTAACTGGTTGTTGTGTTCCATTTCTACAAAACAAAATCACAGAAACTATACCCAATAGTTATATTGAATCAATGATGAGACAGCCTCATATTTATCAGAACTGGTGTACGTCCAATACAGGAGGTTGTCGTGCTGGTTCTCAAATATGCGCTAGCTATTGCGGCTGTAATGGCAATTTATTGTCTTGCTATTGTTCTTACGGATCGCCTTTCTGATTGATTTTATATTGGCGAGGTGACGGGAGTTAAGTAGAATTGCTGCGGGTGCTTGAGGCTATCTGCCTCAGGCATGAACACCAAAGGCAGATAGAGAAAAGCCCCAGTTAACATTACGCGTCCTGCAAGACGTTTAACATTAATCTGAGGCTCAATCTATGAACGGCAAATCTAGGTTAGCCTCTTACGTGCCGAAAGGCAAGGAGAAGCAGGCTATGAAGCAGCAAAAGGCGATGTTAATCGCCCTGATCGTCATCTGTTTAACCGTCATAATGACGGCACTGGTAACGAGGAAAGACCTCTGCGAGGTACGAATCCGAACCGGCCAGACGGAGGTCGCTGTCTTCACAGCTTACGAACCTGAGGAGTAAGAGACCAGGCGGGGGAGAATCCCTCGCCACCTCTGATGTGTCAGGCATCCTCAACGCACCCGCACTTAACCCGCTTCGGCGGGTTTTTGTTTTTATTTTCAACGCGTTTGAAGTTCTGGACGGTGCCGGAATAGAATCAAAAATACTTAAGTAGCGCGCAGGGATAAGAGGGATGGTCCCTTAAAGGGGAGAGCTAATTATCCGGAAGGATTCTGATGATGAACATCGAAGAACTGCGTAAAATTTTTTGTGAAGATGGCCTCTATGCTGTGTGCGTTGAAAATGGAAATCTTGTTAGTCATTACCGCATTATGTGTTTGCGAAAGAATGGGGCTGCGTTAATTAATTTTGTGGATGGTCGAGTGACAGACGGATTTATCTTGCGCGAAGGTGAGTTTGTCACTTCATTACAGGCACTGAAAGAGATTGGAATAAAAGCAGGCTTTTCAGCTTTTGCAGAAGAATAAACTCATCTACAATCTTGCGCGGGGCTGAACTCCCGCTGAGTAACACCGTGCCACCGGAGAAAACCGATGGCACGCAACGTAAAATATTACAATTCTGATAATTCGCCCGTTCTTGCCTGCACGCACGAGCGGTATTCTCACGCATTCAAGTCTGAATGGTTCCAGCACCCTCCATGCACTGAAGAGCAGGCTGAATGGATAATTCAGTGTTACCGCAGGCGCGGATACGAGGTTAAGAAAGCTCTTAGTCTCGACTACCGTCACTGGATAATCTCAGTCAGATTGCCTTACTCCGAACGCCCACCGCGTCCGTCCCGTACATTCCAGCAACGCATCTGGAGGTAACGTGCGGGTATTACTTCGACCTGTTCTGGTACCGGAACTCGGGCTGGTGGTCGTTAAGCCGGGCCGTGAATCCATGCCGGTATTCCACAATACCCGGGTAACCGAAGCGCTCGACACGACATTAGGACTGCTTACAAAAACAGGACGAGACTGGAACACGCAGCATACTGATAACATTAATAAATTTATACCAATTGCAGGCAGTACAAACGGCCCGGCAGGCTCTATGGTTCTTGGCGGCATTCATGTTCAATTTAGTAAAAATTATGCTGTGCAGTTCGGAGGCCGCAATTCCGGTTTTTGGGGAAGAACAATTGAAAATGGAACGACACAGGAATGGAAGAAATTACTAACAGTAGACGATCTCAATTCATCTACCGATCTTGCTGTCAGGTCATTAACCACATCTAACCCGGTAAAATCTGGCGGAGGGCGAATTGATGTCCTTGGAAGCACGTCAGACTATAGCAAAATGGATTGCTTTGTACGTGGGTTTGATAGCACCGGTAATTCTCTCGTGTGGGCGTTGGGTTCATCAGTCGGCGTAAGTAAGATGCTATCGCTAAAAAATTTCTTTAGCGGAGCTGAGATACTGTTAAATGGTAATGACGGCGCGGTTCAACTCAAAACAGGTGCTGTTAACGGGGCTACAGCGCAGACGCTCACTATCAACAAGAATGAGGTTAACTCAACCGTTGATTTAACCCTTACAAAGCAATCAGGGACTGGTAATCGTTTTGTTTTACAGAACTCAGGTAATGCAGAACTACCGTTTTCTGTCAGGGTGTGGGGTTCCAGTACTCGACAAAACGTTTTTGAGGTTGGAACGTCTGCTGCGTATCTGTTTTATGCGCAAAAAACGTCAGCAGGCCAGTTGTTTGATGTAAATGGCGCTATTAATTGCACAACGCTGAATCAGTCATCAGACCGCGACCTTAAAGACGATATTCTCGTTATCAGCGACGCGACGAAAGCAATCCGTAAAATGAACGGATACACCTACACGCTCAAGGAAAACGGGATGCCTTATGCTGGCGTTATTGCACAGGAAGTAATGGAGGCGATACCAGAAGCTGTGGGATCGTTTACTCATTATGGTGAAGAGTTGCAAGGTCCGACCGTTGACGGCAACGAGCTACGCGAAGAAACTCGCTATCTTAATGTTGACTACTCCGCCGTGACGGGTTTACTTGTTCAGGTCGCCCGTGAAACAGATGATCGCGTTACCGCGCTGGAAGAGGAAAACACAACGCTACGTCAAAATCTGGCAACAGCAGGCACCCGGATCAGCACTCTGGAAAATCAGGTAAGCGAACTGGTTGCACTTGTCGGGCAGTTAACAGGAAGCGAACATTGATATCCTTCAAGCCCTGAAGGAGGCTGTTCCTGGTACGTTCAGACTGTTGTTGAGCTGGAAATCGCAACGGAGGAAGAAACTTCGTTGCTGGAAGTCTGGAAGAAGTATCGGGTGTTGCTGAACCGTGTTAATACAACAACTGCACCGGATATTGAATGGCCAGTAGCACCTATAGGGTAA
Protein sequences of DBSCAN-SWA_3 >CP034958|2270978:2293545|2274729_2275290_-|QAS85455.1|DBSCAN-SWA MPSAHSVKLRPLEREDLRYVHQLDNNASVMRYWFEEPYEAFVELSDLYDKHIHDQSERRFVVECDGEKAGLVELVEINHVHRRAEFQIIISPEYQGKGLATRAAKLAMDYGFTVLNLYKLYLIVDKENEKAIHIYRKLGFSVEGELMHEFFINGQYRNAIRMCIFQHQYLAEHKTPGQTLLKPTAQ >CP034958|2270978:2293545|2270978_2273405_-|QAS85453.1|DBSCAN-SWA MSKNDRMVGISRRTLVKSTAIGSLALAAGGFSLPFTLRSAAATVQQASEKVIWGACSVNCGSRCALRLHVKDNEVTWVETDNTGSDEYGNHQVRACLRGRSIRRRINHPDRLNYPMKRVGTRGEGKFERISWDEALDTIASSLKKTVEQYGNEAVYIQYSSGIVGGNMTRSSPSASAVKRLMNCYGGSLNQYGSYSTAQISCAMPYTYGSNDGNSTTDIENSKLVVMFGNNPAETRMSGGGITYLLEKAREKSNAKMIVIDPRYTDTAAGREDEWLPIRPGTDAALVAGIAWVLINENLVDQPFLDKYCVGYDEKTLPADAPKNGHYKAYILGEGDDNTAKTPQWASQITGIPVDRIIKLAREIGTAKPAYICQGWGPQRQANGELTARAIAMLPILTGNVGISGGNSGARESTYTITIERLPVLDNPVKTSISCFSWTDAIDHGPQMTAIRDGVRGKDKLDVPIKFIWNYAGNTLVNQHSDINKTHEILQDESKCEMIVVIENFMTSSAKYADILLPDLMTVEQEDIIPNDYAGNMGYLIFLQPVTSEKFERKPIYWILSEVAKRLGPDVYQKFTEGRTQEQWLQHLYAKMLAKDPALPSYDELKKMGIYKRKDPNGHFVAYKAFRDDPEANPLKTPSGKIEIYSSKLAEIARTWELEKDEVISPLPVYASTFEGWDSPERRTFPLQLFGFHYKSRTHSTYGNIDLLKAACRQEVWINPIDAQKRGIANGDMVRVFNHRGEVRLPAKVTPRILPGVSAMGQGAWHEANMSGDKIDHGGCVNTLTTLRPSPLAKGNPQHTNLVEIEKI >CP034958|2270978:2293545|2283652_2284618_+|QAS85466.1|DBSCAN-SWA MSLLFAERPLVINTQLAMKIGLNEAIVLQQLHYWLRDTNSGMECDGVRWIYNTTEQWLEQFPFWSESTLKRAFASLKTLGLLRCEKLNKSKRDMTNFYTINYGSELLDGGKLSESIGLKCAAPSGQNDTMEEVKMKRSIGSKRLNVIGSKWPDDLTENTTEITTENKKTSRPEASQPDPQTVEQDFLTRHPDAVVFSAKKRQWGSQEDLACAQWIWGRIVSLYEQAASDDGEISRPKEPNWTAWANDVRTMRMLDGRTHRQICEMFGRVQRDPFWVKNIMSPSKLREKWDELVIRLGRSSVQRCVNHISEPDTEIPPGFRG >CP034958|2270978:2293545|2275800_2276127_+|QAS85457.1|DBSCAN-SWA MIKTTLLFFATALCEIIGCFLPWLWLKRNASIWLLLPAGISLALFVWLLTLHPAASGRVYAAYGGVYVCTALMWLRVVDGVKLTLYDWTGALIALCGMLIIVAGWGRT >CP034958|2270978:2293545|2283157_2283346_-|QAS85464.1|DBSCAN-SWA MKTLLPNVNTSEGCFEIGVTISNPVFTEDAINKRKQERELLNKICIVSMLARLRLMPKGCAQ >CP034958|2270978:2293545|2290831_2290939_-|QAS85472.1|DBSCAN-SWA MLTGAFLYLPLVFMPEADSLKHPQQFYLTPVTSPI >CP034958|2270978:2293545|2280080_2280317_-|QAS87682.1|DBSCAN-SWA MIVSPGKWVSEEQLIALKGIKKGTLKKAREKSFMEGREYKHVAHDGMPWDNSPCFYNLEEIDRWIERQASARPRRHLT >CP034958|2270978:2293545|2280404_2282876_-|QAS85462.1|DBSCAN-SWA MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIHREYKQTWKTLDDELAYALWPGDVDAGNIDGSIHRWAKKEVIDNDREDWKRISASMRKQPDALRYDRQTIFGLVRERPIDIHKDPVALNKYICEYLTTKGVFENEETDLGTVDVLQSSETQTDAVETEVSDIPKNETAPEAEPSVEREGPFYFLFADKDGEKYGRANKLSGLDKALAAGATEITKEEYFARKNGTYTGLPQNVDTAEDSEQPEPIKVTADEVNKIMQAANISQPDADKLLAASRGEFVEGISDPNDPKWVKGIQTRDSVNQNQHESERNYQKAEQNSPNALQNEPETKQPEPVAQQEVEKVCTACGQTGGGNCPDCGAVMGDATYQETFDEEYQVEVQEDDPEEMEGAEHPHKENTGGNQHHNSDNETGETADHPIKVNGHHEITSTSRTCDHLMIDLETMGKNPDAPITSIGAIFFDPQTGDMGPEFSKTIDLETAGGVIDRDTIKWWLKQSREAQSAIMTDEIPLDDALLQLREFIDENSGEFFVQVWGNGANFDNTILRRSYERQGIPCPWRYYNDRDVRTIVELGKVIDFDARTAIQFEGERHNALDDARYQAKYVSVIWQKLIPSQADS >CP034958|2270978:2293545|2282969_2283161_-|QAS85463.1|DBSCAN-SWA MNSAFALVLTVFLVSGVPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVINQDNIEIPAGL >CP034958|2270978:2293545|2283429_2283672_+|QAS85465.1|DBSCAN-SWA MRISDFRLYREISDGKSITYMIAGLNKEYGDVVESGLLFADPAVVDRETDELIEKAIAFKLAYRQQYQQKAGWNYESSFC >CP034958|2270978:2293545|2285210_2286155_-|QAS85468.1|DBSCAN-SWA MDLIMEWRFLGSLSEARKSGCSGVYLIVHKGLFSRVVYVGVSCNVGRRITEHYDGYLRGNRTIYDAGHDEDVYRFMSAYKIHNHTKYYQALANDYKIWASTTMYSDLPKNMLAKSQTFDTDWQSIALEKYIPQLVVWALPMAKYCYLNASRIESVIQSKLIKSFDLRGFFNIKQLSILGKIEYPYMEKVKVFIINTPDLDPASQLIFSNLYNKKTDNNFCKEFRSQFKSEIFQRESETQRKRTIREHKVSLYENYGKPWTLKEMEKLRVMLVDFDLSPIEISEYLGREPRSISKKISENDKVTNYKWRESVGWL >CP034958|2270978:2293545|2275324_2275666_-|QAS85456.1|DBSCAN-SWA MKITLSKRIGLLAILLPCALALSTTVHAETNKLVIESGDSAQSRQHAAMEKEQWNDTRNLRQKVNKRTEKEWDKADAAFDNRDKCEQSANINAYWEPNTLRCLDRRTGRVITP >CP034958|2270978:2293545|2291411_2291663_+|QAS85474.1|DBSCAN-SWA MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDGRVTDGFILREGEFVTSLQALKEIGIKAGFSAFAEE >CP034958|2270978:2293545|2274016_2274727_+|QAS85454.1|DBSCAN-SWA MKYKLLPCLLAIFLTGCDRTEVTLSFTPEMASFSNEFDFDPLRGPVKDFTQTLMDEQGEVTKRVSGTLSEEGCFDSLELLDLENNTVVALVLDANYYRDAETLEKRVRLQGKCQLAELPSAGVSWETDDNGFVIKASSKQMQMEYRYDDQGYPLGKTTKSNDKTLSVSATPSTDPIKKLDYTAVTLLNNQRVGNVKQSCEYDSHANPVDCQLIIVDEGVKPAVERVYTIKNTIDYY >CP034958|2270978:2293545|2290983_2291196_+|QAS85473.1|DBSCAN-SWA MNGKSRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIMTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >CP034958|2270978:2293545|2293419_2293545_+|QAS87684.1|tail|DBSCAN-SWA MEIATEEETSLLEVWKKYRVLLNRVNTTTAPDIEWPVAPIG >CP034958|2270978:2293545|2277558_2278578_+|QAS85459.1|DBSCAN-SWA MKSILIEKPNQLAIVEREIPTPSAGEVRVKVKLAGICGSDSHIYRGHNPFAKYPRVIGHEFFGVIDAVGDGVESARVGERVAVDPVVSCGHCYPCSIGKPNVCTTLAVLGVHADGGFSEYAVVPAKNAWKIPEAVADQYAVMIEPFTIAANVTGHGQPTENDTVLVYGAGPIGLTIVQVLKGVYNVKNVIVADRIDERLEKAKESGADWAINNSQTPLGEIFAEKGIKPTLIIDAACHPSILKEAVTLASPAARIVLMGFSSEPSEVIQQGITGKELSIFSSRLNANKFPVVIDWLSKGLIKPEKLITHTFDFQHVADAISLFEQDQKHCCKVLLTFSE >CP034958|2270978:2293545|2289331_2290312_+|QAS85471.1|DBSCAN-SWA MKFKDKNLKALAECIIGDNKAFLYRSSSHITEFFQDCGMDVTHDGSTRWKWTAQRLEELLYEPQSKPHTLPERFVHVLRTLMLKEDAMDDDPGRLKALEELNKPLMREGYEAFYGDDRLLYIRHTDTKTVSVSNNPHRPLTPHEVECRRLLTAFLDTCSEDELIEDILLPLFRQLGFHRITAVGHKDKALEYGKDIWMKFTLPTQHVLYFGIQAKKGKLDASGASKSTNSNVAEIFNQVLMMLGHEIFDPETNRKVLVDHAFIVAGGEITKQARNWLGGKLDASKRSQIIFMDREDILNLYTVSNVPLPTGALISDDAVKNDDIPF >CP034958|2270978:2293545|2291729_2292008_+|QAS85475.1|DBSCAN-SWA MARNVKYYNSDNSPVLACTHERYSHAFKSEWFQHPPCTEEQAEWIIQCYRRRGYEVKKALSLDYRHWIISVRLPYSERPPRPSRTFQQRIWR >CP034958|2270978:2293545|2278635_2278746_+|QAS85460.1|DBSCAN-SWA MTIEKHERSTKDLVKAAVSGWLGTALEFMDFKSHAC >CP034958|2270978:2293545|2278765_2280046_-|QAS85461.1|DBSCAN-SWA MKYPTGVENHGGKLRIWFVYKGVRVRENLGVPDTAKNRRVAGELRSSVCYAIKTGVFDYAKQFPSSRNLEKFGEARQDLTIKELAEKFLALKETEVAKTSLNTYRAVIKNILSIIGEKNLASSINKEKLLEVRKELLTGYQIPKSNYIVTQPGRSAVTVNNYMTNLNAVFQFGVDNGYLADNPFKGISPLKESRTIPDPLSREEFIRLIDACRNQQAKNLWCVSVYTGVRPGELCALGWEDIDLKNGTMMIRRNLAKDRFTVPKTQAGTNRVIHLIKPAIDALRSQMTLTRLSKEHIIDVHLREYGRTEKQKCTFVFQPEVSARVKNYGDHFTVDSIRQMWDAAIKRAGLRHRKSYQSRHTYACWSLTAGANPAFIANQMGHADAQMVFQVYGKWMSENNNAQVALLNTQLSEFAPTMPHNEAMKN >CP034958|2270978:2293545|2276332_2277547_+|QAS85458.1|DBSCAN-SWA MKIVKAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLQDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHSIDEALDDYARHQELGFKAIRVQCGIPGMKTTYGMSKGKGLAYEPATKGQWPEEQLWSTEKYLDFMPKLFDAVRNKFGFNEHLLHDMHHRLTPIEAARFGKSIEDYRMFWMEDPTPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQVRTGSHGPSDLSPVCMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWTFDNGYMHPGDKPGLGIEFDEKLAAKYPYEPAYLPVARLEDGTLWNW >CP034958|2270978:2293545|2290516_2290825_+|QAS87683.1|DBSCAN-SWA MATSKPTSVSRSKRTRRLACVGLGVFKEVLVVTGCCVPFLQNKITETIPNSYIESMMRQPHIYQNWCTSNTGGCRAGSQICASYCGCNGNLLSCYCSYGSPF >CP034958|2270978:2293545|2288369_2288972_+|QAS85470.1|integrase|DBSCAN-SWA MNTSPWNKDRIIGQKRPLQISHIWGIRIRLELEGKTRDLALFNMALDSKLRGCDLVKLKVSDVAYGGSVSSRATVLQQKTGSPVQFEITKGTREAVAALIQLSNLHSKDFLFRSRVGTNQHISTRQYNRIFHGGVEKLGLEDSLYSTHSMRRTKPYLIYKKTKNLRVIQLLLGHKKLESTVRYLGIEVDDALEISESIEV >CP034958|2270978:2293545|2273603_2273909_-|QAS87681.1|DBSCAN-SWA MKLSTCCAALLLALASPVVLAAPGSCERIQSDISQRIINNGVPESSFTLSIVPNDQVDQPDSQVVGHCANDTHKILYTRTTSGNVSAPAQSTQDGAPAEPQ >CP034958|2270978:2293545|2286702_2288052_-|QAS85469.1|DBSCAN-SWA MNAGLHFSEISKEQFDIYFYGRSPYLKTFSEEIRWFKYEGNGITLLSTIIICNIDKDFNAIVLGRDLDKKFRAINVLASFDSMDVLLNNLNDDIPKMLAQHQNGTFMQGDESTKPFSLFLSKVPAKKRNVYIKMLLEDPLHFPAYIVLEELAYWFKDPDGIFIRDFQSDAFNSRLFELYLNAVFYELDFEMNREYNQPDFLLSKFGVEIAVEAVSIAEAEAPLERKVINDEQMDELRKHVLNVMPFKFARSLLKKVRHCPEPEKVHYWELNHTKNKPFVIAMQDYSKRMSMAFSSEALHSYLYGIDIESGISIERHTDENRSIKSNFFGSEQNNYVSAVLLTTQATIPKFNRMGILAGVEASGFKVYVSGVKTDQDAAPHPFSADVSDPNYQEPWCTAMYMYHNPNAIHPVDYQLFPNVVHVFKKDEHFEEYIPRNYILQSTTMICKTE >CP034958|2270978:2293545|2284658_2285081_+|QAS85467.1|DBSCAN-SWA MAKVFTQEEREKIKGQVVELVRRSGRETLRQLEVKTGATRYLMSVLARELVASGDVYNSGYGLFPSEQARKDWQNARKKLSRAKLKEPSAVDPDLIWSLPDGEIRRYDRRHNMICTECRKSEVMQRILSFYQGDVRYLLK |
27 | Escherichia_phage(26.67%) | tail,integrase | attL 2268190:2268203|attR 2294805:2294818 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2696228 : 2707006
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP034958|2696228:2707006|DBSCAN-SWA GTCATTTTGCTAATAACACTTTTCTGGCTTCATCGACTTTATTTGGAGCATCGAACCAATATTCGGCGAGCAAAGGGCAGATCTCTGTATCAACGACTTGTTCATACCAAGCCTGGGCATCGTTGATTTTTTGGCCGATAGCTGGGGTGACATAGCTGTGACCGATACAAAACTGTGGTCCTAAGGTAACATCTTTTGCCAGCATATCGTTAAGTACCGTCAGTCGAGATTTGATGAATGCTAACATGTCAAAATCAATTGCGTAATTATAATTTACCCAGTTTCTCCACGCGTCATTAAAAGCTGGCTTTAGATCGATAAACGCGAAACGACGGCGCAGCGCTAGGTCAAGTAGTGCGAGTGAGCGGTCTGCAATATTCATCGTACCGATGATATATAGATTCTCAGGAATGTATATTTTTTCATCATCATTTTTAGGATAGGAAAGAGATAATGCCTCAGTCGGCGTGCGTTTATCTGCTTCCATCAACGTGAGTGTTTCACCGAAGATTTGCGCCGGATTGCCACGATTAATCTCTTCAATTATCACCACATATTTCGAGGTAGGATTATTGACTGCAGTTTTGATTGCATTTACAAAAGGTCCATCAATTAGCGTCAATTGCCCTTCTTTACCTGGACGCCAGCCGCGAATAAAGTCTTCGTAAGAGAGGTTCGGGTGAAATTGCACCGCGCTAATACGCTCAGGTGCTTTTTCTCCCATCAAGCAGTACGCCAGACGTCGCGCTAACCAGGTTTTTCCAGTTCCGGGTGGTCCTTGTAATATCAGGTTTTTCTTGTCGATCAGGCGCTGAAGTGTGAGTTGGATCTTAGCCTCTTCCAAGAAACAGCCATCCTGCACCAGATGACTGATGTCATAAGGAACGTGAGTGAGTTTTGGCAACGGCGCACTCTCTTCGACAGTTTCCTCAGTTATCGTCTGGGATTCATTTTTCTCAAAATTCAGGTATTCATAATTTCCGGACTCAAGGGACTTTAACTCATCATCATTGCATAGTTTCTGTAGATAAAAGTAGATTGAGCTTGTAACTGTTTTGCTCTCAGGATGCTCTACCTTGAATACGTCCAAGTAGTTTTCTTCCAACTCCGAAGCAGTGAAATAAGGGCCATTTTTTTTCAGGCATAACGCCTTAATTTTATTCAGTAAAGAGGCTTTCCACGTTCTATTTGCCACCTCATCTTTAGACTGGCTAAGATCTGTATTCCACGCTGATAAAGAAAGTTCTGGGAAAGAATGAACCGGATAGTTTGGTTGGGTAAAGACCTCGTTCAGCGCCCGCATAAGACTCAGGTAACTTTGTCCACTACAGCGACCTTTTGCCCCGTTTTTAATGATCTTAATGTTTAACACTGTCTGAATGTAATACTGCGACTGGCTATCTAAAGTAGGGTAGAACCAAGGACGGGTCCAGTACAACCCCATGGTGAGATTCCAACCGACATTCATTACTGTAGAAGCAATGTCATATGCGGCAGTGAAGTCTGCAGAGTTAGTATTCTGGTTATCCGCAAAGGTCATTGCCTGCGAGAACATTTCCCACAAGCATTCAATGTCATTAGGGTCGCGTGACTTTTCATAACCAAAGAACCAAGATTTTTGGTTATTCAACAGCGGGATTCCGGCAAAGGAGTCAGGAATCGGTTCGTTCACGCCCAACAAATTCGCTAGCTTGGCAGCAATAATTTTGCGATTGCTGTCGGTCAAGTTACGATTGAACAAGCCCATAGTAGTAAACGGACAAATGTCTTTTAAGGGAAAGATCTCTCCCATAATAGATTTGTCCTGCAGATGGGACATTCCTTCCACACCTGAGGCAATTAGATGAATACCTTTGACTAATTCATCTCTACGATTTCGCCAAGTCAGTAACGCGTTGGCAAAAGCCTCATAAAAACTAGCCCAAGCAAATTTGCCATCATGTTCTGCTGTATCCACGGGAACTCCATTATACTTGTTGAGCAATGATAATTTATCTGTGTGAGTTTTAACATATTACTATCAGTTATAGAAAAATTTAACTACCCGATACAGAGAGCGGCATGCTGAATTTGACCTGACTTGCTTCCAACTAATTAAAATCAACTTATTTATCAATTGGTTATTTTGGCGCATAGCGGTCATCAAGGGAATATCGCGTTGTCATAAGGTGTCGAGGCTCGGAGGTTCAAATCCTCTCATGCAAAAAATAAATAAAATTAATGACGGTTGGAAATTATTCAATACATACACTCTCGAAAGTGCATCAGCCAACCGCAGCACGTCTTGCATACGGCGTGTCTGCAGTTTTATATAATCCTGGCTGGAAACCTCTTATACAAAGTAGATACACCAATATCATAGATGATCGCCACCTTCTGACGCGGGACTTTTGATGCAATTAATCGCCCGGCCTGCGCCCATTGTTCTGATGTAAGTTTAGGACGACGTCCACCAATTCCTCCCTGTGCGCGAGCATCTTCCAGTCCAACTTTTGTCAGTCATCAATCAGTTCACGCTTCATTTCAACCAGGGCTCCATCACATGGAAAAATAAAAAAAACCATCGATGTTGACGTGTCAATGTTACCTGTCAGGCTACGGAAATCCCCCCCTTACCCGCAGCTACTCGGTAAGTATAATAAGATGTTTCCTGCTCCTTCCTAACCTGTCCAGTTTCCAGACAATTAAAGTATCACCTTATATAAGAGACTGCAGAGCGCACTTTAACCTAGGTCGTTCTGAAATAGTTGCAGATTCGATTAATCAACGATTATACTCCCCGGCACTCCAGAGGATCTGGTAAACATAAAGTCAGAGCTTGGGTATACTGGCACACAAATGGCAGATCTTGCAGGTGCAGCCAGTCATAGCCAGTGGCGAAAATACACGAGTGGTTCTGAGCCCCGCGCCATGTCATCACATATCTTGTTTTTTTATTGCTACCAATCTGACTTTGAGTACTAATGAGCTAGATAGAATTGTTGGTAATGACTCCAACTTACTGATAGTGTTTTATGTTCAGATAATGCCCGATGACCTTGTCATGCAGCTCCACCGATTTTGAGAACGACAGTGACTTCCGTCCCAGCCTTGCCAGATGTTGTCTCAGATTCAGGTTATGTCGCTCAATGCGCTGAGTGTAACGCTTGCTGATAACGTGCAGCTTTCCCTTCAGGCGGGATTCATACAGCGGCCAGCCATCCGTCATCCATACCACGACCTCAAAGGCCGACAGCAGGCTCAGAAGACACTCCAGTGTGGCCAGAGTGCGTTCACCGAAGACGTGCGCCACAACCGTCCTCCGTATCCTGTCATACGCGTAAAACAGCCAGCGCTGACGTGATTTAGCACCGACGTAGCCCCACTGTTCGTCCATTTCAGCGCAGACAATCACATCACTGCCCGGTTGTATGCGCGAGGTTACCGACTGCGGCCTGAGTTTTTTAAGTGACGTAAAACCGTGTTGAGGCCAACGCCCATAATGCGTGCACTGGCGCGACATCCGACGCCATTCATGGCCATATCAATGATTTTCTGGTGCGTACCGGGCTGAGAGGCGGTGTAAGTGAACTGTAGTTGCCATGTTTTACGGCAATGAGAGCAGAGATAGCGCTGATGCCCGGCAGTGCTTTTGCCGTTACGCACCACGCCTTCAGTAGCGGAGCAGGAAGGACATCTGATGGAAATGGAAGCCACGCAAGCACCTTAAAATCACCATCATACACTAAATCAGTAAGTTGGCAGCATTACCTTGTTAGTTTGTCTTTTGTGTTTACCTGATTCGGGTAAACGCCTTTACCTGATTTAGGTAAACTTTTCTTACCTGATTCAGGTAAATTTACCTCTTTCAGGTAAACTTTATTTTTCTTACCCGATTCGGGTAATGTTGACCATTCACTGACCACATTATTGATGCCGATATTCCGCCCGCTCTGAATAAAAATCCCACGCTTTACCAGAACACTTTTTGCAGCAGAACACTTGTGCGGCAATATCCCGGTTAATTCGGAAAGTTGCTCGTTGCTAACCCAATCCAGTTTTTTATTAAAGCCATATGTTTTGCGCATGACAGCCAGAAAGACCAGAAGCTGGTGCTGTGTTAATCCGGCCAGCATCACAGCTTCCAGCAACTCATTTGCAATGCGCGTATAACCATCATCGAGATCTGCCACGCGCCGCTCCTTTTGTGCCACATCCGGCACAGGAAAATTGAATATCTCAGCAGTGTTTGCCATAATTCCTCCCGCAATGAGTGTGTTACGATTTGCACCTGAAAGTCGGTTCTGTTCCCGCAGACCGACTTTCGCCATTTTTGAACCTGTCATATTGCCCCCAGCATGGTGGTGACCATCGCCATCAATGGACCAGCCAGATCCGGGTCCACTCGAAACATCGACACAATGCCTTCACTCATCTCCTTCAGTTTCTGGTGGCGTGGTGCGTTGAGAATGACCGCCTGCTTTGCCTCACAGAGTTCCTTTTCCATTTCAGCCAGCCGAGCCATGAAGCTATCCTGCTCAACCAGGTGGCCGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCACGGCTGACATCATCAGGGAAATCGATAGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCAGGCGCGGACGGCATCACGGATTTTTTCGTGGCCTGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTAGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACCGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATTGCCCTGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTGAAACAGAAAGCCTCTGAGCAGAAGGTGGCTGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGAGATGATGCGTGGAACAAATTACGACTCGGCGTCATCACGGCTTCAGAAGTTCACAATGTGATAGCAAAACCCCGCTCCGGTAAAAAGTGGCCTGACATGAAAATGTCCTACTTTCACACCCTGCTGGCTGAGATTTGCACCGGTGTGGCTCCGGAAGTTAACGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAATGACGCCAGAGCCCTGTTTGAGTTTACTTCCGGCGTGAATGTTACTGAATCCCCGATCATCTATCGCGACGAAAGTATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAGCTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAGTTCCGGCTCGGTGGTTTCGAGGCCATAAAGTCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACACGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGTATGAAGCGTGAAGGACTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGCGCGATCACTTTCGTCTACTCCATTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCACTGAAAGCACAGCGGCTGGCTGAGAAGATAAATAATAAACAGGAGGATATATGAGTCAGGTTGGTAATCATTCATTCGAATTTCCGGCATCGCAAGGTGTACAGGGTGGTACTGTTACACTCTTCCTTACCATACCAGGAAGATCGCTGGCTCGTTTCCTCGCTTCAGATAATTACGGCCATACACTGGAACGCTCTCAGCGAGAAATTAATCCAAATCGAGTACGAAAATTTTTAAATTATCTCACTAACGCAGACTCAAGAAATGAGTCTTTTATCATTCCCCCTCTCGTAGGTAACTGTGATTCGAATATAGAATTTGTACCGTTTGGCAACACAAATGTTGGTATAGCCAGAATTCCCCTCGACGCCGAAATAAAACTTTTTGATGGCCAACATCGTGCAGCTGGCATTGAGATATTTTGCCGAAGTTCCCCATCAACGCTCATGGTTCCCATGATGCTTACAATGAATCTGCCGCTAAAAACCCGGCAGCAGTTCTTTTCGGACATAAATAACAACGTTTCTAAGCCATCAGCGACCATCAATATGGCGTATAACGGCCGGGATGATATTGCTCAGGGAATGATATCCTTCCTGACCCAACATACTGTATTTGCCGATATAACCGATTTTGAACACAACGTAGTGCCATTAAAAAGTAATATGTGGGTGAGTTTCAAGGCACTCACTGATGCAACGTCAAAGTTTGCTAGGAACGGCAATCAACAACTTGAAATGGGATATATAGAATCTGTCTGGGAGGCATGGATTACACTAACTCAGATTGACTCAATCCGACATGGTGTACACCACGCTACGTACAAGCGCGATTATATTCAGTTCCATGGAGTAATGATTAACGCTTTCGGTTTTGCGGTTCAACAGATGATGGTTAATCATTCCATCGCAGAAATAACTTCTATGATCGAAAAACTCTGTGCAACTACCAGCTCTGCAGAAAGAGAGGATTTTTTTCTGATGGATAACTGGGCGGGGATCTGCACGAAAGCCAGCCAGGAAAAACTATCGGTTATTGCCAATGTGGCAGCGCAGAAAGCAGCAGCAAACAGACTGATACAAGCTTTTACCAAAGGAAGTCTGGAAACAACTTAATGAATCAACATTGTCTCATATCAGCATGCTGTACGGCGTCTTTAAGGAACGGTGAGCATGAAAAACAAAATCATCATGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATCATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGCTAGATGAACGAAGACGCCTGGCCGTTCAGGGTTGTCGGACTTGTGCAAGTTGCCAGGAGGAGATCGAACTTAAGAACAAACAATGGGGACTGTGATGGCCTCAAAGCAGCAAATTTCAACATCGTCCAACTGAGGTGTAAAAATGTTCAGAATCATTTTTCCTAACACCTGGTACGTCGACCACCACGGCACTCCCTGCAAAATCCTGCGTTCTACCCACAACAAAGTTCACTACATCCGAAAAGGCAGAACATGTATCGCCAGCATGTTCCGCTTTAATCATGACTTTGAACCTGTGAATAAAGCTGATGCAGATCGGATAGCAGAAGAGATCGAAACGGCAGAACACATTAAGAAGTTACGTGCCATACGCAGGAAATAGAAAAATTGATAAATTCAATACTGCATTTCTCAGCATTAAATTTATCTCTATGACCAGTCAAGAGATGTACCTGCCATGAGCTTAATATCATGTCAGATATATCGGTCACAAACTCCCTCAGCAGCTAAGAGGAGGACAAATGTCTCGACTAATCACTTTACAGGACTGGGCTAAAGAAGAATTTGGGGACTTAGCACCAAGTGAGCGAGTTCTGAAAAAATACGCGCAAGGGAAAATGATGGCCCCACCCGCTATAAAAGTTGGTCGCTACTGGATGATTGACCGAAATTCCCGTTTTGTAGGAACGCTGGCAGAACCGCAACTCCCAATAAACGCAAACCCAAAACTCCAACGGATAATCGCTGATGGCTGCTAGACCCCGATCTCACAAAATCTCTATACCCAATTTATATTGCAAATTAGATAAGCGAACCGGAAAGGTATATTGGCAATACAAACATCCACTATCCGGTCGTTTTCATAGCTTAGGAACTGATGAGAATGAAGCAAAACAAGTTGCTACTGAAGCAAATACCATTATTGCTGAACAACGTACCAGACAAATATTAAGCGTCAATGAGCGTCTGGAAAGAATGAAAGGCAGGCGCTCAGACATTACGGTGACAGAATGGCTTGATAAATATATTTCTATCCAGGAGGACAGGCTGCAACATAATGAACTAAGACCCAACTCCTATCGGCAAAAAGGCAAACCCATTCGTCTTTTCCGTGAGCATTGTGGAATGCAACACCTCAAGGATATTACCGCACTTGATATTGCCGAAATAATTGATGCTGTAAAGGCTGAAGGTCATAACAGGATGGCGCAAGTCGTGAGAATGGTGTTGATCGACGTCTTCAAAGAAGCACAACACGCAGGACATGTTCCGCCAGGATTTAACCCAGCGCAGGCAACAAAACAACCGCGAAATCGAGTAAACCGCCAAAGATTGTCACTGCCCGAATGGCAGGCAATATTTGAAAGCGTAAGCAGACGGCAGCCCTATTTAAAATGCGGCATGCTACTTGCTCTTGTTACTGGACAACGTTTAGGCGATATCTGCAATTTGAAATTCTCTGATATATGGGACGACATGTTGCACATTACTCAGGAAAAAACCGGTTCAAAACTTGCTATTCCGCTTAACCTGAAATGCGATGCTCTGAATATTACCCTTCGTGAAGTTATATCTCAGTGCAGGGATGCTGTTGTTAGTAAATATCTGGTCCATTACCGTCACACTACCTCTCAAGCAAACAGAGGAGACCAGGTTTCTGCAAATACTCTGACAACGGCTTTTAAAAAGGCCAGGGAAAAATGTGGCATAAAATGGGAGCAAGGAACTGCGCCCACATTTCATGAGCAGCGATCTCTGTCAGAACGGTTATATCGGGAACAGGGTCTGGATACGCAAAAGTTGTTAGGCCATAAATCCAGAAAAATGACCGACCGATACAATGATGATCGTGGTAAAGACTGGGTTATCGTAGATATCAAAACAGCATAGAAAATAGCCAGTTTTGGGGAAGGGTTTTGGGGAAAGTTTTGGGGAAGATTTTACATCATCATAAAACAACGGGCGTATAACACGCCCGTTTCAATATTTAACACATGTAGAGATTACATGTTCTTGATGATCGCATCACCAAACTCTGAACATTTCAACAGTTTAGCGCCTTCCATCAGACGTTCGAAGTCATAGGTTACGGTCTTCGCATTGATTGCGCCTTCCATACCTTTAACAATCAGGTCTGCGGCTTCGGTCCAACCCATGTGGCGTAACATCATCTCAGCAGAGAGAATAATAGAGCCTGGGTTTACTTTGTCCTGACCGGCATATTTCGGCGCAGTACCGTGGGTGGCTTCAAACAGGGCGCATTCGTCACCGATGTTTGCACCTGGGGCGATACCGATACCGCCAACCTGCGCTGCCAGGGCGTCAGAAATGTAGTCACCGTTCAGGTTCATACAGGCGATAACATCATATTCAGCCGGACGCAGCAGGATCTGTTGCAGGAATGCATCAGCAATCACGTCTTTAATGACGATCTCTTTGCCGGTGTTCGGGTTTTTAACTTTCAGCCACGGGCCGCCGTCGATCAGTTCACCGCCAAACTCTTCACGCGCCAGCTGGTAGCCCCAGTCTTTAAACGCTCCTTCGGTGAACTTCATGATGTTGCCTTTGTGCACCAGAGTCACAGAGTCACGATCGTTAGCAATTGCGTATTCGATCGCTGCACGAACCAGACGTTTGGTGCCTTCTTCCGAACACGGCTTAATACCGATACCACAATGTTCCGGGAAGCGAATTTTCTTCACCCCCATCTCTTCACGCAGGAATTTAATCACTTTCTCGGCGTCGGCAGAGTCTGCTTTCCATTCGATACCCGCATAAATGTCTTCCGAGTTTTCACGGAAGATAACCATATCGGTCAGTTCAGGGTGTTTAACCGGGCTTGGAGTGCCCTGATAGTAACGTACCGGACGCAGGCAGATGTAGAGATCCAGTTCCTGGCGCAGGGCAACGTTCAGAGAGCGAATACCGCCACCAACAGGAGTGGTCAGCGGACCTTTAATGGCAACGCGATATTCACGAATCAGATCAAGGGTTTCAGCAGGCAGCCAGACATCCTGACCATAAACCTGTGTGGATTTTTCACCGGTGTAAATTTCCATCCAGGAGATTTTACGCTCGCCTTTATAGGCTTTCTCGACTGCAGCGTCGACCACTTTCAGCATGGCTGGGGTTACATCTACACCGATTCCATCACCTTCAATGTAAGGGATAATCGGATTTTCAGGAACGTTGAGTTTGCCGTTTTGCAGGGTGATCTTCTTGCCTTGTGCCGGAACAACTACTTTACTTTCCAT
Protein sequences of DBSCAN-SWA_4 >CP034958|2696228:2707006|2696228_2698184_-|QAS85823.1|DBSCAN-SWA MDTAEHDGKFAWASFYEAFANALLTWRNRRDELVKGIHLIASGVEGMSHLQDKSIMGEIFPLKDICPFTTMGLFNRNLTDSNRKIIAAKLANLLGVNEPIPDSFAGIPLLNNQKSWFFGYEKSRDPNDIECLWEMFSQAMTFADNQNTNSADFTAAYDIASTVMNVGWNLTMGLYWTRPWFYPTLDSQSQYYIQTVLNIKIIKNGAKGRCSGQSYLSLMRALNEVFTQPNYPVHSFPELSLSAWNTDLSQSKDEVANRTWKASLLNKIKALCLKKNGPYFTASELEENYLDVFKVEHPESKTVTSSIYFYLQKLCNDDELKSLESGNYEYLNFEKNESQTITEETVEESAPLPKLTHVPYDISHLVQDGCFLEEAKIQLTLQRLIDKKNLILQGPPGTGKTWLARRLAYCLMGEKAPERISAVQFHPNLSYEDFIRGWRPGKEGQLTLIDGPFVNAIKTAVNNPTSKYVVIIEEINRGNPAQIFGETLTLMEADKRTPTEALSLSYPKNDDEKIYIPENLYIIGTMNIADRSLALLDLALRRRFAFIDLKPAFNDAWRNWVNYNYAIDFDMLAFIKSRLTVLNDMLAKDVTLGPQFCIGHSYVTPAIGQKINDAQAWYEQVVDTEICPLLAEYWFDAPNKVDEARKVLLAK >CP034958|2696228:2707006|2703628_2703847_+|QAS85828.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPLDERRRLAVQGCRTCASCQEEIELKNKQWGL >CP034958|2696228:2707006|2704273_2704510_+|QAS85830.1|DBSCAN-SWA MSRLITLQDWAKEEFGDLAPSERVLKKYAQGKMMAPPAIKVGRYWMIDRNSRFVGTLAEPQLPINANPKLQRIIADGC >CP034958|2696228:2707006|2703894_2704134_+|QAS85829.1|DBSCAN-SWA MFRIIFPNTWYVDHHGTPCKILRSTHNKVHYIRKGRTCIASMFRFNHDFEPVNKADADRIAEEIETAEHIKKLRAIRRK >CP034958|2696228:2707006|2705755_2707006_-|QAS85832.1|DBSCAN-SWA MESKVVVPAQGKKITLQNGKLNVPENPIIPYIEGDGIGVDVTPAMLKVVDAAVEKAYKGERKISWMEIYTGEKSTQVYGQDVWLPAETLDLIREYRVAIKGPLTTPVGGGIRSLNVALRQELDLYICLRPVRYYQGTPSPVKHPELTDMVIFRENSEDIYAGIEWKADSADAEKVIKFLREEMGVKKIRFPEHCGIGIKPCSEEGTKRLVRAAIEYAIANDRDSVTLVHKGNIMKFTEGAFKDWGYQLAREEFGGELIDGGPWLKVKNPNTGKEIVIKDVIADAFLQQILLRPAEYDVIACMNLNGDYISDALAAQVGGIGIAPGANIGDECALFEATHGTAPKYAGQDKVNPGSIILSAEMMLRHMGWTEAADLIVKGMEGAINAKTVTYDFERLMEGAKLLKCSEFGDAIIKNM >CP034958|2696228:2707006|2701270_2701582_+|QAS85825.1|DBSCAN-SWA MPLLCCEATYIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKASEQKVAA >CP034958|2696228:2707006|2702255_2702414_+|QAS87700.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSITKRGWVFPGLSVIRNPLKAQRLAEKINNKQEDI >CP034958|2696228:2707006|2700548_2701088_-|QAS85824.1|DBSCAN-SWA MQLLTYQQTSGFSPTAVINRSQTKQVPGHEKIRDAVRAWSAEDNQDVVATLIVNEYREQGGGTIDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGHLVEQDSFMARLAEMEKELCEAKQAVILNAPRHQKLKEMSEGIVSMFRVDPDLAGPLMAMVTTMLGAI >CP034958|2696228:2707006|2702410_2703475_+|QAS85827.1|DBSCAN-SWA MSQVGNHSFEFPASQGVQGGTVTLFLTIPGRSLARFLASDNYGHTLERSQREINPNRVRKFLNYLTNADSRNESFIIPPLVGNCDSNIEFVPFGNTNVGIARIPLDAEIKLFDGQHRAAGIEIFCRSSPSTLMVPMMLTMNLPLKTRQQFFSDINNNVSKPSATINMAYNGRDDIAQGMISFLTQHTVFADITDFEHNVVPLKSNMWVSFKALTDATSKFARNGNQQLEMGYIESVWEAWITLTQIDSIRHGVHHATYKRDYIQFHGVMINAFGFAVQQMMVNHSIAEITSMIEKLCATTSSAEREDFFLMDNWAGICTKASQEKLSVIANVAAQKAAANRLIQAFTKGSLETT >CP034958|2696228:2707006|2701578_2702259_+|QAS85826.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWNKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEICTGVAPEVNAKALAWGKQYENDARALFEFTSGVNVTESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >CP034958|2696228:2707006|2704499_2705642_+|QAS85831.1|integrase|DBSCAN-SWA MAARPRSHKISIPNLYCKLDKRTGKVYWQYKHPLSGRFHSLGTDENEAKQVATEANTIIAEQRTRQILSVNERLERMKGRRSDITVTEWLDKYISIQEDRLQHNELRPNSYRQKGKPIRLFREHCGMQHLKDITALDIAEIIDAVKAEGHNRMAQVVRMVLIDVFKEAQHAGHVPPGFNPAQATKQPRNRVNRQRLSLPEWQAIFESVSRRQPYLKCGMLLALVTGQRLGDICNLKFSDIWDDMLHITQEKTGSKLAIPLNLKCDALNITLREVISQCRDAVVSKYLVHYRHTTSQANRGDQVSANTLTTAFKKAREKCGIKWEQGTAPTFHEQRSLSERLYREQGLDTQKLLGHKSRKMTDRYNDDRGKDWVIVDIKTA |
11 | Enterobacteria_phage(40.0%) | integrase | attL 2694201:2694224|attR 2705709:2705732 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2835813 : 2880466
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >CP034958|2835813:2880466|DBSCAN-SWA GTTACTTACGGTCCGTAAACGGGCTGCCCGGACAGGGAATCGATAACTGCTCTCCCATTTTATCCTCTTCAAGCTGGTGCTTTATGTAATCCTGTATCTTCGCCGTGTTCTTACCCACTGTATCGACGTAGTCCCCCCTGCACCAGAACTCCCTGTTCCTGTATTTGAATTTCAAATCACCAAACTGCTCGTAAAGCATCAGACTGCTTTTCCCTTTCAGATATCCCATAAAGCCGGATACGCTCATTTTGGGCGGGATCTCCACAAGCATATGGATATGATCTGCACAGCATTCAGCTTCCAGAATCCGTACACTTTTCCACTCACACAGCTTTCTCAAAATACTGCCTGTTGCTCTACGCTTCTCTCTGTAGAACACCTGTCTTCGGTATTTTGGCGCAAAAACTATGTGATATTTACAGTTCCATCGGGTGTGCGCTAAGCTCTTTTCGTTCCCCATTTGAACCCCTTTTGATTTCTTGTTTGACTCTTGCAGTTGCCAGACCGCAAGGTGTTTTAACAAATCCGAGGATCTTAGTATGAATATGGAAGAAATTGTGGCCCTTAGTGTAAAGCATAACGTCTCGGATCTACACCTGTGCAGCGCCTGGCCCGCACGATGGCGTATTCGCGGGAGAATGGAAGCTGCGCCGTTTGACGCGCCGGACGTCGAAGAGCTACTGCGGGAGTGGCTGGATGACGATCAGCGGGCAATATTGCTGGAGAATGGTCAGCTGGATTTTGCTGTGTCGCTGGCGGAAAACCAGCGATTGCGCGGCAGTGCGTTCGCACAACGGCAAGGTATTTCTCTGGCGTTACGGCTGTTACCTTCGCACTGCCCGCAGCTCGAACAGCTTGGCGCACCACCGGTATTGCCGGAATTACTCAAGAGCGAGAATGGCCTGATTCTGGTGACGGGGGCGACGGGGAGCGGCAAATCTACCACGCTGGCGGCGATGGTTGGCTATCTCAATCAACATGCCGATGCGCATATTCTGACGCTGGAAGATCCTGTGGAATATCTCTATTCGGGGAGCTACACGCGACAACCAGGAATGCAGCCGTAACTGCAGCAACGACGGGCAAAATGCGCATGGGATTTTCCTTGCTGTATTTTTGTTAAGTGTAGATGACAACAGGAAAAAAAGAGAAAGAAAGGAGGCCCAATATCCTGGGCCTCATCGTCAGTTATTGCAGCTTTTCAAGAATGCGCCAGGCCGCCTCGACACGGACAGGGTTAGGATAGCTTTTGTTTGCCAGCATCACGATGCCAAGGTTTTTTTCTGGAACGAAGGCTACGTAGCTGCCAAATCCACCAGTGGAGCCCGTTTTATGCACCCATGAGGCTTTCACTGCGGGGGCGGGCGGGTTTACCTCAACGGCGGGAAGCGCTGCCAATGCCACTTTGCTGTCGCTGCCGTTGATGATCGAATCAGCTTTCAGCGGCCAGTTCAGCATCTCCCAGCCTAATCCCTGGTACATATCGCCAATACGCCAGTAGCGAGACTGCGCAAGCGCAATGCCCTGCTGGAGCGTTTTCTCCTGAACGTGGCTGGCATCCATGTTGGCCTGAACCCAGCGGGCCATATCAATAACGCTGGATTTCACGCCATAGGCTTCGGCGTCAAGTTGTCCCGGAGAAACGTGTACGGGCTTCCCTTCGCGATAGCCCCAGGCATAATCTTTTTGTTCGTTCTGCGGAACCGTAATCCAGGTATGCGCCAGTTTTAATGGTTGCAGGACGCGTCTGGTCATTGCCTCTTCGTAACTCATTCCTGAGGGTTTCACCGCCAGCGCGCCAAACAGACCAATGCTGGAGTTAGCGTAAAGTCGCTTAGCGCCCGGAGTCCATTGCGGCTGCCAGTTTTGATAAAAATGCAGTAATGCGGCTTTATCCCTAACGTCATCGGGGATCTGCAGCGGTAGGCCGCCTGCCGTATAGGTGGCTAAGTGCAGCAGGCGGATACCCTGCCACTGTTTGCCTGTCAGTTCTGGCCAGTATTTCGTGACCGGATCGCTGAGCTTAATTTCGCCGCGGGCGATAGCATCGCCGCCCAACACGCCGTTAAACGTCTTACTAACCGATCCTAGCTCAAACAGCGTTTGCTGCGTGACTGGGTGGTTATTGGCGATATCGGCTTTACCCCAGGTGAAATAATAGGGTTTTCCCTGGTAGATAACGGCAACGGCCATACCCGGAATAGCCTGCTCCTGCATCAACGGGGTGATGGTGCGATTAACGATATCGGCAATCTGTTGTTCTGTTTTTGCGGCAGCAAATGTGGAGAAAGAGGCTGTCAGCAGCAGAGCGCAGCATAACGATTTTTTCATCATGAAATCAGTTCCGTAATTAAAAGCAAAAAGGTGTCCGGGCCCGTCAGACGCAATCAGTGTGTTTGATTTGCACCGTGTTGACAAACGGTTAAATTTAGCAGCAGATATAAGTTTTTCCTAAATTCCACGTGTGTTTTTTATTAGCTTCAAAAATCACTATTTCACGAAGAATTTAGACTGCTTCTCACACATCGTAACATTATTTACAACCACCTTTCAATCATTTTTGATAAATCATTGATTTCATCTTTGCTGCAATGATACTTAATAAACTCTGCAAGTTATCCACAGAGCAACACTCAATTTTATTGATGATATTCTTATTATACCAGACATTTTTCATACACTCCCTTGTACGGATAGTTTTCCGACAACTTCATGATTACATATCTTGCGGTTTTGATTATTTTTGCTGCAAGAAATACATACTTCAAACGAAAGGTCTTTATTTGCTGTCTGTATTCTGAAGAGTCCAAGGAATCAAACTTGAACAACAAAAATAGGTTATATGAAAGCATCATCATTTGAAACACGGCTTCATTCGCCCAAAATGACTTTAGCAAGAGATGACCCACCGCCATGTCGTATTTGGCTTCTTTGATATAGTTTTCAGCATTACCACGCTTTTCATAGTATATAACTACTTTTTCAGAAAGCAAGGTAGTATTTGTTACAAAGAAAAAGTAGTCGTATTCGGAACCTTCTAAAAGTGATAATTGTGCTCTTTCTTTTTCTGGTTTCAGTACGCGAGATACGACAAATCTTCTGTCTTTTTCCCATTTAACTAATTTTGTATACAGTTCTGTAGTTTCTCTACCTTCTTCTCCTTTAACGAATACAATTGATGAATTCGTTGCTTGTGAGGTGAGTGTAGAATAACTTTTGGCTTTAATTAAATATTTGCATCCAAGAGATTCTATCGTTTCGATAATTTTTTCATCAAAGTAGCCACTATCCATTCGAAATAAAATTTCTAAATCGTCTGATTTGATGTTAGCAACAATTTCTTTGATCATTTCCGCAGCACCGTTTGCAGTGTAAGTATTGCCACTTCTTACAAATCCGGTAACATATGCTTTTAATTCGTCGCAAAATGCAAATTGGATATTGTAGCATCGGTTTCCCAGTTTCTTAGGATTATATCCTTTTGACGCACCTTCTTGATGACCTTCTACGTTAATTACACTACTATCAATATCAATCGTAATGGATGTCAATTTACTTTTAGTGAGCAGTTTTTTAAAGACTTTAAAATTAATGTCTCTAAACATTTGGGTTGTCTTGAAGTTGAAGTTTCCTAGAAACCGTGACACTGTTTCAGGTTCTTTTACGGAAATATCAAACTCGTTGACGAGGGGATCATTTTGAAGTAGCTTTAGACGTTCTAACTTATCAATGCCAATGAAGTGACCGCAGAGCATGGTCTTTATATGATTCATCTTGATTTTATTTGTTGAGTCATTATCAAATACGAGGTCATTTTCAATAAAATCAAAAATCCCATTGCTTTTTGCATTCTCAAGGAGCAGAAAAAGACCTGCATTTGATGTTAGATTCTTAGCTTTGAAATCAATTTTATTAATCATAATTAGAACCCCTTTTTACTACTTTTCTTACTATTATTTTACCATATATCGAGTCATAAAAGCTGATAATTTAACATATTTTTGAGCACTTTTCTTTCACCCAATGGGTGAAAGCTGAATTTCGAAGGAATGCATATTTATCAAGGCTTTGATTATGCTTTTTGAAGTACTGACGTAGAATCTAGGCAAATCAAAAGGGGTTTTAATAACTGGCTCAAAGCTGAAAGCTTTCCGGAACCCCCAGCCTAGCTGTAATGCCAGTCAGTTAAGCAACTGACTGGCTCTTTTTCGGGGCTGTGGGGTATTTCCAGGGCCTCTCCTTTACCACTCTCGGGAAGGCCCTTTCCCTTCTTGTCGGTAATTTCACAAGTTGTCCCATACTTGCAAGATCGCGCATCAGCTCCGGTATACGTCCCGGTGAAGCGCCCTGCAATGTCATCAGCATTCTCATCACCATTCCACATGATTCTGAGAAACTCAGTTGATTCGGCCAGTAACCTTTCAGATGTTCCGCCATTTTAATCATCTGATATCTCACCAGATTATAAGCCAGTAAGACACCCCACAGCTCTTGCTCCACAAGCTCCGGTTTTTTACTTCTCAGCGTCAGCCTGCTCAGTTGCATCGTCTGTTTTATCTCCCTGTATCCCAGTTCGATTTCCCAGCGATGACTGTACAGATCCGCCATTTCTCCTCCGGGGAAGCGCATGGCGTCCGTCATCGACGTCAGCAGATGGCAGACTTTTCCTTTGCGCGTCACGGTCAGCAGGCGGGCTGTCACTTCATTTCCCAGTCCCGGCCACTTTTTTCGTGCCTGCGGGCTGGTTTTCAGCTTCACCAGATGATCGCCTTTACCCAGTTTTCTGATCTCTTCATATTGCGCTCCCTTTCTGAGAGGTATCATCCAGTGGCGGTGTTCTCCCGCCAGGCTCCAGGCATTTAACAGTCCCAGTGAGTAATAACCTTTATCCATTAACGTCAGAGTGTTATCGCCGGTTTGTTCTATAAGTTGCTCAGCAAGCTCATTTTCGCTGTTCTTCATCGTGCCGAAGGCTGCAGCCGTCAGCAGATGGCTGGTCAGTTCCATCTGGCAGACCATTTTGACCTGCGGGTAGAGCGCCGGGTTCCCGGCATGTGTCTGGCGGGGGAAGGCTGCATCGTTCTCTGGTGTATCCGGTGTGCGCCAGAACACACCATCGATGGCCAGCAGGGTCAGGCCGCACCAGTGCGGATGCGGCGTGGCGTTATGCCAGAGCTGCGCTGTTTTCGTGAACACGCGGCGGACAGCCTCACTTCCCAGGCGCTGGCGGGCCTGAATAACGGCACTGGGGGCAACGAAGGGGCGATTGCCCGGCAGCATGATGTCCAGGCGATTCACAATCTGGTGAAGAGGTTCTTTACGCTCAAGCGCCATGCCAACAATACACCAGACCATCATTTCGAGGGGAAGACGGCGCTTGCGTAGCGTTACAGTACCTGATTCGGCAAGGCAACGAGAGATGAGTTCGGGGTCGAGGTAATCCCCCAGAGAAGTCAGTGGGTTACGCAGAGAATCGTAACGGGATACCAGATCAAGAGCCTGTCCAATGTGCATAAAAAAATCCGGAAACAAGTGAGCGTTTCCGGATTCTTACACAGCCACTGGATCGGTCAACTGATCCTTAACTGATCGGCATTACAGCCTAGCTGGGGGTTTTCTGTGCACAAAAAACCCCTGTAAAAAACTTACAGGGGTATAAGGCTTAGCCTAACTTGCGTCTGTTGCATGGTGCCGGGTGCCTCCCGGTGAATTCAGTCGGTGTCACTGAACCCGCGTAGGCTTCGCTCATAACATAATAAATGCTATGTACACCAGTCGCCCCACCGCACAGGGGGATTCACCACACAGCGCACTTTTTAACAAATATCCCTCCGGCCAGACAATAATAAACATAATGAATTGTGATCTTCTTAACGGTTTTAAAGTGTTACAGATAATATGCCAAGTAATTGCTTGTTTTTTCATCAATAGGAAGACCACACCAACAACCCCAGCTATCAGCCAGAACCGACATTATCCGGCTGATAAAATCTACCATCACTCCAACACCAATCACCGCCTGTGCCAGATCGCGTTTCTCAAACTTTTTAATCTCTGTTGCCACTCTGCGGGTTTTCTTTTTGAATTTTGAAAATACCAAATATCGTGACGTTTCTTTGGGGGATGAGCTATCAAGCGGGAACGATCTGCCTACAGAGAAAGTCAGCCAGACCACTCTGCAAATTATCCCCGAAAGGCTCTGTGGCTGATATGCGCCGGGCATGGCGCAATGGGCCAGTGGTGTCAACGACGGATGAAAAGTGATCCACTTATATCTCCACCAACGGCCCAATATTGATCCACCGTTTTACTCAGGATTAGCTTCTGCTATAACCCCGGCCTTTCGTTTCTGTCTGAGTCGATAGCTTTCTCCTTTGATTTGAACGACATGTGAGTGGTGTAAGATACGGTCCAGCATCGCTGAGGTCAGTGCTGCATCACCGGCGAACGTTTGATCCCACTGCCCGAACGGCAGATTGGATGTCAGGATCATTGCGCTCTTTTCGTAACGTTTAGCGATGACCTGGAAGAACAGCTTTGCTTCTTCCTGACTGAACGGCAGATAGCCTATTTCATCAATGATGAGCAGACGGGGGGCCATTACTCCACGCTGAAGCGTCGTTTTATAACGGCCCTGACGTTGTGCCGTAGATAACTGAAGTAACAGATCTGCTGCTGTTGTGAAGCGAACTTTGATACCTGCACGGACTGCTTCATAGCCCATCGCTATTGCCAGATGGGTTTTCCCCACACCTGATGGCCCCAGTAATACGATATTTTCATTACGTTCTATGAAGCTGAGTGAGCGTAACGACTGGAGTTGCTTCTGCGGTGCTCCGGTGGCGAATGTGAAGTCATACTCTTCGAACGTTTTCACCGCCGGGAAGGCTGCCATTCGGGTATACATCACCTGTTTACGTTGATGACGTGCCAGTTTTTCTTCATGAAGCAGATGCTCCAGGAAGTCCATATAACTCCATTCCTGGTCTACTGCCTGTTGTGACAGCGCAGGCGCTGCGCTTATAAGGCTTTCCAGTTGCAACTGCCCGGCGAGCGCCATCAGTCGTTGATGTTGCAGTTCCATCATCACGCCACTCCTCTGCAGAATGAGTCGTAGATGGAGAGTGGATGATGCAGGGGGTGTTTGTCGAAGTTCACCAGATTTTCATCAAGATGCACGTCATACTCTTTTTTCTCCGGAGGCAGTGCCAGCATGGACTGCTGCTCTTCGAGCCAGCGATCGCAGGGACGTGCCTGGATTGTTTCATGCTTTCGTTGGTTAGCGACATCGTGCAGCCAGCGCAGACCGTGGCGGTTGGCTGTTTCAACATCGACAGTGATCCCCATCGGGCGCAGGCGAGTCATTAGTGGGATGTAAAAACTGTTACGGGTGTACTGCACCATCCGTTCCACCTTACCTTTAGTCTGTGCCCTGAAGGGGCGACACAGTCGGGGAGAGAAGCCCATCTCCTTGCCGAACTGCCACAGCGAAGGATGGAACCGGTGCTGACCGGTCTGATATGCGTCACGTTGCAGAACCACAGTTTTCATATTGTCATACAACACTTAGCGCGGCACACCACCAAAGAAGCGGAACGCATTACGATGGCAGGTCTCCAGCGTGTCATAACGCATATTGTCAGTGAATTCGATGTACAGTATTCCGCTGTATCCGAGAACAGCAACGAACACGTGAAGCGGTGAGCGACCATTACGCATAGTGCCCCAGTCAACCTGCATCTGTCGTCCGGGTTCAGTTTCGAACCGAACGGCAGGCTCCTGCTCCTGAGGAACCGAGAGAGAACGAATGAATGCCCTGAGAATGGTCATTCCGCCACGATATCCCTGGTCTCTGATCTCGCGAGCGATTACCGTTGCCGGGATTTTGTAAGGATGAGCATCGGCGATGTGTTGACGAATATAATCCCGGTATTCATCCAGGAGTGAAGCAACAGCAGGTCGCGGCGTATATTTTGGCGGCTCAGATTTTGCCTGCAAATAACGTTTAACGGTATTGCGGGAGATCCCCAGTTCTCTGGCAATCGCCCGGCTACTCATTCCCTGCTTGTGCAGGATTTTAATTTCCATAACTGTCTCAAAAGTGACCATAAGCTCTCCTGAATCAGGAGAGCAGATTACCCCCTGGATCTGATTTCAGGCGTTGGGTGTGGATCACTATTGCACCGTTCGTGACAACAGAGAAAGTCAGCCAGACCACTCTGCAAATTATCCCCGAAAGGCTCTGTGACTGATATGTGCCGGGCATGGCGCAATGGGCCAGTGGCAGTGTGTGATGGTGGCCCTTACTGGATTTGAACCAGCGACCAAGCGATTATGAGTCCTCTTACCACTGAGCTAAAGGGTTGGAGAACGCAATATCACCTGCCTTATATGATTACACCCAGAATTTCCCGGACTGTCTTGTCAAAACATTCAGTCTCCAATTCCCACCAATAGCAAGACGGTCACTATGACAGCATCTCCGGAATGAGCTCAGGGCACGGGGGTTTACCGAGGTTACTTTTCCAACGAAGTTTCTCAAACGCAGGCGTGATAACATTCAAACTTAGGATCTCAGTTGCTATCTTTTCCCATGTCTTACGGATTAATGTCATCGGTTGCATACAAATCGCTACAAGCTGCATCAAGGCACATGCTTCATATGTACAGTACACCAAGATGGATAGGTGTTTCACCTAATCCTAGCTCATTTCAGGCAGGCTCAACAGGCTGCAGCCCGCATGTTTAAGGGCGCAGGTTATCAGCAGGGTATAAGCTGACCGTAAGCGTACAGCGAGGGCCGTATTGACGGGGATGTGTTATTCAGCTGGCAGTGCTATGCGCCACGGAAGCAGTTCGCTGACCCGGTTGACCGGCCAGTCTGCTATGACGCCAAGCACATGGCGAAGGTAGCTTTCTGGATCCACGTCATTCAGTTTGCACGTCCCGATCAGGCTGTACAGTAGCGCTCCCCGCTCACCACCATGGTCAGAGCCGAAGAACAGGAAGTTTTTACGACCCAGACTGACCGCCCGCAGGGCATTTTCAGCGATGTTGTTGTCGATTTCCACCCAGCCATCGTTCGCATAGTACGTCAGTGCCGACCACTGGTTAAGTGCGTACGCGAACGCCTTCGCCAACTCTGAGTGTCGCGACAGGGTCTTCATCTTTTCACGCAACCAGCTTTCCAGGGATTTCAACAGCGGTTTCGTTTTTCGCTGACGTTCAGCAAGCCGCTGCTCTGCCGGTATTCCCCTTATATCCGCCTCTATGGCGTACAACTGACCGATCTGTTCCAGGGCTTCTTCCGTCAGTGCTGACGGGATGCGGACGTGCACATCGTGGATCTTTCGGCGGGCATGAGCCCAGCAGGCAGCTTCCGTTATCCCACCATTGCGATACAGCTCGTTGAACCCGGCGTACGCATCCGCTTGCAGCACACCGCTGAAGCAAGCAAGATGAGTCTGCGGATGGATGCCTTTTCTGTCCGGGCTGTAAGCGAACCACACTGCAGGTGCCAACGCTGACCCGGCATTGCGGTCATCACGAACATACGCCCACAACCGCCCGGTCTTCGTCTTCTTATTACCCGGCAGCAGTACCTGGACCGGGGTATCATCGGCATGGAGTTTGCCGTCAGTCATGACATAGCCATGAAGCGCCTCTTCCAGCGGAGACAGCAGCCGGCAGCATGCATCCACCCAGCCCGACAGCAGTGAACGGCTCAGCTCCACACCTTGCCGGCCGTATATTTCTGACTGGCAATACAGCGGGGTGTGCTCTGCATACTTCGAGGTCAGCACGCGGGCCAGCAGCCCCGGTCCGGCGATACCCCGCTCGATGGGCCGCGAAGGTGCAGGTGCCTGCACGATGGCATCGCACTGAGTACAGGCATGTTTTTCCCGTACCGTCCGGATAACCCGGAATGCGCTACGCATCAACTCCAGCTGTTCGGCGGTATCCTCGCCCAGATAGCTCAGTGAACCGCCGCAGTTCGGGCAGCACGGCGCCGCAGGCAACAGTCGCTTTTCGTCACGGGGTAGTGATTCAGGGAACGGCTTACGGGTGCGGGTCTGACGCAACGGACGCTGTACTGCCGGGTCATACACCCTACCAGTCAGCGTATCGCTCTCTTTCTGAAGCCGGTTCAGATCGGCTTCCATTTGTGCGATACGGCGGGAGACTTTTTCGGAACGACTGCCGAAGTTCATCCGGCGGAGTTTATCCAGCTGCGCCTGCAGATGGTCTATTTCGCGCTCCCGGTTGCTCAGCTTTTCCTGCAGGGCGTGGATCAGCGCTTCCTGTTCGGCCAGGCGCTGTTTTAGCAGGAAGATGTCGTCAGAAGAGATGTCGTTCATAAGCCCGTATTTTACCGGGCTTATTCTGTGACAACCAGGATAAAGAGATTTACAGCATGGTCAGGGAGGTCAGCAGCCGCTTGGGCTGTCGCCAGTCGATACCTTCCAGCAGCATCGCCAGCTGCGCCTGCGTAAGGAACACTTTGCCATCACGGGCTGACGGCCAGGCGAAGCGCCCACGCTCCAGCCGTTTGGTCAGGAGGCACAGTCCGTCACCGGTGGACCACAGCAGTTTAACCTGACTGCCGCTGCGGCCCCGGAAAATGAAAACATGGCCGGACATGGGATCGTCTTTCAGCGCCGTTTGTACTTTCGCAGCCAGGCCGTTGAAGCCATTTCTCATATCGGTGATACCGGCAACCAGCCAAATTTTGGTCCCGGAAGGTAACGGGATCATCGCTTCAGTTCCTGTATCAGCAGAGTCAGGAGCTTTTCGCTGACATTGCCATTGAAGCGGAGCGTCCCGTGCCGGAACGTTACCTCACAGCTGATACTGAGGGTTTCCGGGTCCTCTGCGAGCGATTCTGGCTGTTCGGCAGCTGCATCGAGAGTCACAGGAAGTAGCTGGGGGCTCTCTGAAGAAGGTAATAGCAGCTTTCCCTCGCGCCATTGTTGTCGCCATTTGAACAACAGATTGGCGTTAATGCCATTTTCAAGAGCAAGTTTTGAGATGGATATCCCGGGTTCACAGGAGGCAGCAACGAGCTGCTGTTTAAATTCGGGAGAATAATTAGGGCAGCCTTTTCGCCTGCCGGGAGTCACATTTTTCTGCATATCTGACACTTTGGTTCCCACTACTTATTTGGTGGACACCACTTTGTCTAATTCGTCAGATTCTGACCAGACGGTTCAGGCTGTACGCTTACAGCTGACCGGATACATTCCGGCAAAGAAAAACCTTGCAGCGATGAAAGCCGTTTCGGTAATGCAATGACGATAAAGCTGTCCTGTATATATGTGCTTCGCCTCAAACGCTTGCCGTTTTGGTATTGTGCACATGCCGTCTGAACGATGTGGAGCCAGAAAAATGGATGCGTTATGTCATTGAGCATATCTAGGACTGGCCGGCAAACCGGGTACACGATCTGTTGCTCTAAATAGTTGAGCTGGCCTCTCAGTAAATATCAATACGGTTTTGGTGAGCCGCTTACCACTGAAGCATCACTTCGGCAGGTAGATTTCTGCAGGCAGAATGGCGCTTACCTTAGCGATAAGCGCGTAAATGTGGTTATTCAATACCTGTGGTGACTGTAAAAGTGCGCGTTTGCTGCGGTGCAACCTGAATCAGCGTGCCATTACGTTGCGCGGCAAGATACCCCTCAGGTCGACAGGTTGCTGGTAATGCAAAGGCGGCTACCTGTTGCTCGCCGTTATAAAGGATCCAGCGTGTCACATAATTTAGTTCAGCACTGGAGAAACGAGTAACAAACGTAGTGCCATCGGGAGCGATCATGCGAAACTCTGGCTGATCTGTGTAAGCGTCCAGTTTATCTGCAAAGAAGACAATTTCTGGATCATAAAATTCCGGTTGATTCAGCGTCGACAGAGAGGATTCCCCCTGCATAATCCGTTGATTAAACGCCAGCCACTGAGGGGTGGGATTAACATGCGAAGGTACTGATTCACGCAACCTTAATATTTCGTCCGGTATATTCTGGCTGAACGTAGCATTTGGTATATATGCATAATTCATGTGGCACATATATTGTAGTGGCATATCTACAGAAGCCAGATTGGTTACGGCCATCTTAATATCGAACAGTGTAGAGGATTTGTGAAGGACCACTGTTGGCTGAGCCAGATAATGATGCCCGAACCCCATTACATACTCGTAACGCCCGGTAAGGCGTAACATATCTCCCTCTAATTCCATCCATGCTTCATCCATCGCGGCACAGGCCATTTCACCGTGTAGCAGATGAGTATCTTCCGCAGATGGGCAGCCATTAGCCAGCAAACCTGAATGAAAGGCAAAACAGCCATAGGTCTCTATCACCTCTGTCGCCGGTTTAGGCTGGCGAAACATATTGCACATGGTGAGACTGTGTCCATCAAATTGCGCATCCCAAATCATCTGCCCCATCCAGGGAAGAATAATTAAATGTCCACGACTGTTTGCAATTTTAAGCCCCTCGACACCACTGTCATAGCGAAAAGACGTGACAGTAAAATCACTATTTTCCAGCAAGATACGAGGTTTTTCGCCAAAAAGCGCCCGCCACAAATAAATACGCGTACTCATAACGATTCTCCTCAGGACTCTGTGACTTCAGCCAGTGCATTACGTACTTTGCTTTCACGCCAGAAGTAGACACCGACATAGACAAAACAGAGCATAGAAACCAGGAATGAAAGCTGTAGTGAGTGGAACATATCTGCAATATACCCCTGAATTGCCGGAACCACCGCTGCGCCAACAATAGCCATAACAATGACTGCTCCTGCCATTTCTGTATGTTCGTTATCAACTGTATCCAGTGTTCCTGCATAGATCGTCGCCCAGCAAGGGCCAAACAAAACACTTACCAGGACGGCGACATAGACCGCGCTGAAACTTGGAGCCAGTGCAACATATGCCAGAAACAGCGCCCCTATAACGGAATAGAGAATTAGGACTTTTTCCGGATTAAAACGTGTCATAAGGATGTTGGCTATAAACTTGCCAATAAAGAAGCAGGCAAAGCTGTAGACCATGAAGTTTGAAGCATCACGTTCGTTGATATCGCCCAACTCCAGTGCCAGACGGATGGTAAATGACCATACTGCGACCTGCATACCCACATAAAGGAATTGAGCCACAATACCACGACGAAAGCGCGAATTTTTAGCCAGATAGCGCAGCGTATCCATTGCTGACGGGCGTTTATGGTGACTTGTCTGTGCCACTTTACAGGTTGGGAAGCGGGTTAAAAGGAACAACACCATGACCACAACCAGAATCATAATCATATACTTATACGGTTCAAGGGTGTTCTCTAACATCAGCACCTTAAAGTTGTGAATTTGCTCGGCGTTCATTCCGGACATCTGCTTCTCAAGGCTTTCCCCCTCGGAGAAAACCAGATATTTGCCCAATAAAATACCAGACGCAGCACCAATCGGATAAAAGGTCTGGCTGATATTGAGCCGCAATGTGGCATAGGCTTTTGGACCGATCATTGAACTGTATGTGTTCGCTGCAGTTTCAAGGAAACTCAGGCCAATCGCAATCGCAAAAATAGCTGCAAGGAACATAGTGTAGGTTGCCATATGCGAGGCAGGGAAAAAAAGTGTACAACCACCAATATACAGCGTCAGGCCAATTAAAATTGCCACCTTATAACTGGTCTTTTTAATCACAAGGGATGCTGGTATTGCAATTAAAAAATAACCTCCATAAAATGCGCTCTGCACCAATGCTGAAGCAAAGTTACTTAGCGAAAATACACTTTTGAATTGAGTGATTAATATGTCATTTAATGCAGCTGCGCATCCCCATAGCGGGAATAAACACGATAACAAAATAAACTGGAACAAGGGAGTCTTATTCAGATACCCATCAGGCATCTGAATGATGTTTTTATCGTTCATAGTGCTACCTTTAACTGTGCAGGATGATTATTCGTTTAAGGTTAAAAATTCATTAAATTGTTCAATACTCGGATAAGATGATTGCGTACCTTTCCCTGTGACGCTGAAAGCGGCAAAGAGAGCGGCTTTTTTCAAAGCGGCTTCAACATCACCGCTTTGAACATAATAATGGGAAAAGCAACCAATAAATGCGTCACCAGCGCCACTAGTATCAACAGCATTTACTTTGAATGCAGGAACATGGACTTCCTGATCGCGGGTCATCCATAATGCGCCTTTTTCGCTCATGGTAACAATAATATTGTTCAGCCCTTTATCAACTAACGAACGTGCGGCCAAACGAATATGATCATAAGTATCAACCGACATACCGGTTAATATTTCCAGTTCTGTTTCATTCGGAATAAAGAAATCACATTTGCAGGCATAAGACATATCTAACTCACGCAATGCCGGAGCCGGATTTAATAACACTTCAATACCATTTTTCTTACCAAACTCAATCGCGTGGTAAACTGTTTCCAGTTGAACTTCCAGTTGTAAAACGATCAATTTGCATTTTTTCAGATCTTCTGCAGCTCGATCGATATCTTCCGGGGAAAGAAATTTATTCGCTCCCTTAATTATTAATATACTATTGCTCGAGTTGGCATTAACAAAGATCGGTGCAACACCACTGCTGGTACAGGGGACCTTCTCAACATAAGTGGTATTAATTCCCCATGATTCGAGATTACGAATAGTATTATCCGCAAAAATATCATCACCTACTTTAGTCAGCATCAGGACTTTTGAATTCAATTTAGCCGCCGCCACCGCTTGATTAGCACCTTTCCCACCACATCCGATTTTGAAGGCAGGTGCTTCCAGAGTTTCTCCTTCTTTAGGCATCTGATTAGTGTAAGTAATGAGATCCACCATATTGGAACCAATAACTGCAATGTCCATTTCACTACCTCTTATAAACTTTCGCATAACAATGGTATTTAAATAACATTAGCATGTTACTTTTGCATCATTTGTGACTGAGATCGCGATTAGCACATCAACCCGATGTTTATTTAATAGACTTCCAGTCTCATCACTCAGGCCAACACTATCTAATCATAAGCAACCTAACAAGATTAGTGCCCAAAACTCAGCAGCCTATACCCTTTTCATTTCAAAGGGGCGGTCGTATAGTATGGTAATGAAAACAATGTTTACTAACGCCAAAATGTTATTTTTATAACATTCTTACGGAGAGAGAGTTGATGGAAACGAAGCAAAAAGAGCGTATCCGACGTTTGATGGAACTGCTTAAGAAAACCGACAGAATCCATTTGAAAGACGCAGCGCGAATGCTGGAAGTTTCTGTAATGACTATTCGTCGCGATCTCCATCAGGAAGATGAACCTCTGCCACTGACCCTACTGGGTGGCTATATTGTAATGGTGAATAAACCCGCGCCATCCATGCCAGTAATCCATGACGTTCCAAAAAATCATCGTGATGACTTACCTATTGCAATTCTGGCTGCCGGAATGGTTAATGAAAATGATCTGATCTTCTTTGATAATGGCCAGGAGATACCACTCGTTATAAGCATGATCCCGGATGCAATCACCTTCACCGGTATCTGTTACTCACATCGCGTCTTTGTTGCGTTGAATGAAAAGCCTAATGTAACAGCAATACTTTGTGGTGGTACGTATCGTGCCAGAAGTGATGCTTTTTACGATGCCAGTAACTCTTCGCCATTAGACTCTCTCAATCCGCGAAAAATATTTATTTCCGCCAGCGGTGTGCATAATCACTTTGGCGTCAGCTGGTTTAACCCTGAAGATCTTGCCACTAAGCGTAAAGCGATGAACCGTGGACTACGGAAAATTTTGCTCGCCCGCCACGCGTTGTTCGATGAAGTGGCCTCTGCCAGCCTCGCACCGATCTCTGCATTTGACGTTCTGATTAGCGATCGTCCGTTACCGGCAGATTATGTTACGCACTGCCAGAATGGTTCTGTAAAGATCATTACACCTGATTCAGAAGACGAATGACTTACTGAAAAAACACCACAATCTTGTTAAACATCGTCGGATTGGACTGATTACGTTGCACTTTCACCACATATTCCAGCTTATCTATTTGGCTTATCACCTACTCCAGACGCTGGTCATCCTTGACCAGTAGCCAGATATGGCTTTTGTCGCAGTCCTGAATCGGCAGGCAGAGAATGTCTTCAACGTTAAGAGCGCGACGGGCAAAAGCCCACAAAGGTGAGTCATTACACCCGGATGGTAAACAGGCACTTCCTTGTTGTAGAAGCCGTTATATTGTCCATAACGTCTGGGTTTTTAATGCATACTGGGTAACAACCGGACGAATCTTCGAATATCACTGGCGCTGCCAGCCGGTTTGAATTCATCTCACAACCCTGCATAGAGCGAATCTCCTGCCTGCACGTCACTCCACTCCATGGTATCAACTTCACACTCTTTATCTGCGGCTAGTTTCAGCCACCAGATAAGCATCGACTATGAAAGATGAGCCATGACAACATAACGTTGGTAACGCTCTGACGCCTTAATGGAAGATGCCTGCCACCATAGGGAATGTAAACGACTGAAGTGTGGCCTTTAATGCCGTGAACGGCTCATGGTCTCCTGGCACGGTTGCCGCCCCAACCTGTAACAACATTCCACAGTACAATGTCTGTCAGAGTCAGAGCCTCCCATGCTTGTTGTAGTAACTCTACCAGTGGATTTGCCCCTATATTTCCAGACGCCTGTTATCACTTAACCCATTACTGGCTTGCTGCCGTAGATATTCCCGTGGCGAGCGATAACCCAGTGCACTATGCGGATGCCATTCGTTATAATGCTCGAACGCCTCTGCAAGGTTCTTTGCTGCCGTTAACCCGTCTGGTTTGGGCATGACACTGATGTAGTCACGCTTTATCGTTTTCACAAAGCTCTCTGCTATTCCGTTACTCTCCGGACTCCGCACCGCCGTGCTCTTCGGTTCAAGCCCCAACATCCGGGCAAACTGCCGTGTTTCATTAGCCCGGTAGCATGAACCATTATCCGTCAGCCACTCTACTGGAGACGCCGGAAGCTCGTTGCCGAAGCGGCGTTCCACCGCTCCCAGCATGACGTCCTGTACTGTTTCACTGTTGAAGCCGCCCGTAGTGACCGCCCAGTGCAGTGCCTCACGGTCACAGCAGTCCAGCGCGAACGTGACTCGCAGTTTTTCTCCGTTATCACAGCGGAACTCGAACCCGTCAGAGCACCATCGCTGATTACTTTCTTTCACAGCCACTCTGCCGGTATGTGCCCGTTTCGATGGCGGTACAGCAGGTTTTCGCTCAAGCAACAGCGCATTCTGGCGCATGATCCGGTAAACACGTTTGGCATTGATCGCAGGCATACCATCAAGTTCTGCCTGTCTGCGAAGCAGCGTCCATACCCGACGATAACCATACGTGGGCAGCTCTCCGATAACATGGTGTATACGGAGAAGCACATCCGTATCATCAGTGTGACGACTGCGGCGGCCATCCATCCAGTCATCGGTTCGTCTGAGAATGACGTGCAACTGCGCACGCGACACCCGGAGACAACGGCTGACTAAGCTTACTCCCCATCCCCGGGCAATAAGGGCGCGTGCGCTATCCACTTTTTTGCCCGTCCATATTCAACGGCTTCTTTGAGGAGTTCATTTTCCATCGTTTTCTTGCCGAGCAGGCGCTGGAGTTCTTTAATCTGCTTCATGGCGGCAGCAAGTTCAGAGGCAGGAACAACCTGTTCTCCGGCGGCGACAGCAGTAAGACTTCCTTCCTGGTATTGCTTACGCCAGAGAAATAACTGGCTGGCTGCTACACCATGTTGCCGGGCAACGAGGGAGACCGTCATCCCCGGTTCAAAGCTCTGCTGAACAATTGCGATCTTTTCCTGTGTGGTACGCCGTCTGCGTTTCTCCGGTCCTAAGACATCAATCATCTGCTCTCCAATGACTAGTCTAAAAACTAGTATTAAGACTATCACTTAAATAAGTGATACTGGTTGTCTGGAGATTCAGGGGGCCAGTCTAACCAGTTACGAACATCCTTCCTCAAAATTGTTGTCATATCTCGCATGGAAGAAAAGATCCTGGCTAAGGAGCAACAAACAACGTATTGCGGAACTTGCATATTTTTCCTGTAACTAGTGTATTACCACATATGGTAATAGCTACCTGTGTGGTTTCGCTGGATAGCAAGGGGATTTATTCGCAAGTAAAATGCCTGATAAAATACACGAATCTAGTAATCATCAATATTTACTCTGGTCGAATGACGCGTGAAGTGGACTGCCAGCAGACGCGGCCAGTGGTCCACCGCCTGCTGAACAAAACGCCAGATATCTCTCGGCTCTGAAAGTAACGCTTCGGTTATTTGCACGGAATACTACTCCTTCAGACTCTGTTAAGTTTTGTTTGTTAAACCGGTGCAGACCTGCAGGAAAGCATGCCAGCACCGGCACTGTACGATATAAACATCCGGTACCGGGGATACGAATGGAATGACGAATACGCCAGAAAAGGGATAACAACCTTCCTCATAATGGTGAAATCATTCGCTATCGGTTACACGGTACGCGATGTCGCCAAAGGCAGCTGGATCGACGAATCCACGGTCACGCTACCGAAAGCGCCGCCGCTTAACACCCTGCCTCGGGCGACCAAAGTGCCGGAGCCGCAGCAGCCGCAGGAAGATTACACCTTTGAAGGTTACCGCAACGCCGACGGCAGCGTGGGCACCAAAAACCTGTTGGGTATTACCACCAGCGTGCACTGCATGGCAGACGTTGAGGACTACGTGGTTAAAATTATCGAACGCGACCTGCTGCCGAAATACCTGAGCATCGACGGCGTGGTCGACTTGAACCACCTCTACGGCTGTGGCGTAGCGATTAATGTACCGGCCGCCGTGGTGCCAATTCGCACCATCCATAATATTGCGCTGAACCCAAACTTCGGTGGCGAAGTGATGGTGGTGGGCATGCAGTGCGGTGGCAGCGACGCGTTCTCCGGCGTTACCACTAACCCCGCTGTCGGCTACGACTCTGACCTGCTGGTGCGCTGCGGCGCAACGGTGATGTTCTCCGAAGTCACTGAAGTACGCGACGCCATTCATCTGTTAACGCCACGCGCCATCAATGAAGAAGTGGGCAGGCGTCTGCTCGAAGAGATGGCCTGATACGATAACTATCCCGATATGGGCAAAACCGACCGCAGCGCCAACCCTTCGCCGGGCAACTAAAAGGGCGGCCTCGCCAACGTGGTAAAGAAAGCACTCGGCTCCATTGCTAAATCGGGTAAAACCGCAATTGTTGAAGTGCTGTCGCCCGGTCAACACCCGACTAAACGCGAATTAATTTACGCCGCGACGCCAGCCAGAGATTTTGTCTGTGGCACGCAACAGGTGGCTTCGGGTATCACCGTGCAAGTGTTTACGACCGGCCGTGGTACGCCGTACGGCCTGATGGAGGTACCCGTCATTAAAATGGCGACCCGCACCGGGCTGGCGAACCACTGGTTTGATTTAATGGATATTAACGCAGGCACTATCGCTACCGGCGAAGAAACCATTGAAGAGGTGGGCTGGAAGTTGTTCCACTTTATTCTCGACGTCGCCAGCGGGAAGAAGAAAACCCTCTCGGATCAATGGGGATTGCATAACCAACTGGCAGTGTTTAACCCGGCACCGGTGGCCTGATATTCTCTTCATACATTAAGTTGTATTATGCCCGATAACGCTTGTTTATCGGGCATAGTGAATCACAGCGAAGACGCGAGCTCCCCGACCAGAATCACTTCAACCCCAGCCTTTCGCAAGCCTTCCAGACTATCCGCAGGAATGCCTTCATCAACAATGATCATGTCGATACGTTGAGTATCAATGATCTTATGTAAACTGGAACGATTGAACTTACTGGAATCGGTGACCACGATGATCCGTTCCGCAACTTCGCACATCCGGCGGTTTAAACGGGCTTCATCTTCATTATGCGTGCTGACGCCGCGCTCCAGATCGATCGCATCTACACCAAGAAACAGCATATCGAAGTGGTAATTTTGCAGCGATTGCTCAGCCTGATCGCCGTAAAAAGATTGCGACTGACGGCGCAAATGCCCGCCGGTCATCAGCAGCTCAACGCCTTCCGCTTCCAGCAACGCATTAGCCACGTTCATACCGTTGGTCATCGCAATTACGTCAGTGTGCTTGCGCATCAGACGAGCAATCTCAAAAGTGGTGGTCCCGGAATCGAGGAGCACCCGATGACCTGGCTGAATCAACTCAACGGCAGCTTTCGCAACGCTGCGTTTCATCGCGGTGTTCAGTGCGCTTTTGTCTTCCACTGATGGCTCGACTGACGGCGTCGTGCTATCGCAGATCAACGCGCCACCATAGGCACGCACAGCGATTCCCTGCTTTTCCAGAAACGCCAGATCGTTGCGGATCGTCACAGTAGATACGCCATACAATGCCGACAGATCGTTAACCTGCACACTCCCTTGCTGTCGCAGACGCTGAATGATCTGTTCTCGTCGCTCGCTGGTGCCTGTCACTCGCTTCTCACCTGAAGCGTCGGTATTACTCATAGTAAGTCCTTTCGTAAAACTTTCGTTTCATTTCGTTTTGCCTATTAACGCCTTTCTATTAAGCAAATGCAAGCCCACCTTGCCCATTGGCGCAAGCTACTCTCGTTTCACTGACTTTCATTATGTTTCTTTTGTGAATCAGATCAGAAAACTATTATCTTTCGTTTTATTTTTATCTCACCATGACGCAGTATCAACTGAAACAAAACGAAAGATTAATATCGCAGCAATCTGAACTGGAGAGGAAAGTGAAACATCTGACAGAAATGGTGAGGCAGCACAAAGCGGGCAAAACAAATGGAATTTATGCCGTTTGTTTTGCCCGCTTTGTGCTGCATGGGGCGAGCGATGTGCCGGATGAGTATGTTCGTCGCACCATTGGGCCAGGCGTCTGCAAAGTCAACGTTGCAACCGAGTTGAAGATCGCCTTCTCTGACGCTATCAAAGCTTGGTTTGCTGAAAATCAGCAGAGCAACGATCCGCGCTTTTACATGCGGGTTGGCATGGACGCCATGAAAGAGGTGGTCAGAAGCAAAATCGCCGTCTGCGGCTCGGCAAATCGATTACGGCTACCGGCGGAGGCCTGATCCAACAGCGTATTACCTCAATATTTCAAAATAATTATAAGTCCCACAAATATGAAGGCGCGTCCTTAAACCGGGTAGTGCCTTCCATTATCCTAAAATTCGAGGAGCCCTATATGACACAAAAAAAATCTTTTAAATCAAAATTATGGGAGTTTTTACAAAGTCTGGGGAAAACCTTTATGTTCCCGGTTTCGCTTCTTGCCTTTATGGGATTGCTGCTGGGTATCGGTAGTTCAGTCACCAGCCCTTCCACCATTACTAGTTTTCCCTTTCTGGGCGGCGAATTTACCCAGTTGACCTTTGGCTTTATCGCTATGGTCGGTGGCTTTGCTTTTACCTATCTGCCGCTGATGTTTGCCATGGCGATCCCCATGGGGCTTGCCAAGCGCAACAAAGCGGTCGCTGCCTTTGCCGGGTTCGTTGGCTACATGCTGATGAACATGAGCATTAATTATTACCTGACGGCTACCCACCAGCTTGCCGACCCCGCCACCATGAAACAGGTAGGACAATCGATCGTGCTGGGCATTCAAACCCTGGAGATGGGGGTATTAGGTGGCATTGTGGTTGGGGTTATCACCTATTTTCTGCATGACCGTTTTCAGGACACGGTTCTGCATGACGCCTTCGCCTTCTTTAGCGGCATTCGTTTCGTGCCGATTATTACCGCGCTCACCCTGTCGCTGGTGGGTCTGTTCATTCCCATGCTGTGGGAATACGTCGCGCTGGGCATCGCGGGCATTGGGCATATCATCCAGAGCACCAGCGTTTTCGGCCCCTTCCTCTACGGCGTAGGCGTGCTGCTGCTTAAACCTTTTGGTCTGCACCACATCCTGCTGGCGATGGTGCGTTTTACCCCAGCAGGCGGCATTGAAATGGTAAATGGCCATGAGGTCGCCGGGGCGCTGAATATCTTCTACGCCGAGCTCAAAGCCGGCCTGCCGTTTAGCCCGCACGTTACCGCGTTTCTGTCACAAGGGTTTATGCCGACCTTTATCTTCGGTTTACCCGCCGTGGCTTACGCCATCTACCGCACCGCGCGTCCGGAAAATCGGCCGGTCATTAAGGGGTTGCTGCTTTCCGGCGTGCTGGTTTCCGTCGTCACCGGTATTTCAGAGCCGATTGAGTTCCTGTTCCTGTTTATCGCCCCCGCGCTTTACGCCTTCCATATCGTCATGTCTGGCCTGGCGCTGATGGTAATGGCCCTGCTGGGAGTGACCATCGGCAATACCGACGGCGGCATTCTGGATCTGCTGATTTTCGGCGTGATGCAGGGAATGTCGACCAAATGGTATCTGCTGTTCCCGGTTGGTATTGCCTGGTTTGCCATCTACTTCTTTGTCTTCCGCTGGTACATCCTCAAACACAACATCAAAACGCCGGGCCGCGAGGTGGATGTTCAGGGGGCACAGCAAGCCGTCGAGGCGAACACCCGCGCGCGCGGAAAATCAAAATACGATCACGAGCTTATCCTACGTGCGCTCGGAGGTAAAGAGAACATTGAGTCGCTTGATAACTGTATTACCCGCTTGCGTCTGGTGGTGAAAGATATGGGCCTTATCGATCAGCAGGCGCTGAAAGCGGCAGGCGCGTTGTCAGTGGTGATGCTTGATGCGCATAGCGTGCAGGTGATCATCGGACCGCAGGTACAGAGCGTCAAAACCGGCATTGAAGCCTTAATTTAACAGGAGGAGTGATGTTTGATTTCGACAAAATCATTGAGCGTCAAAATGATAAGTGCCGTAAATGGGACCATACCTTTGTTTGCTCGCGTTTCGGTGACGTCCCGGAGTCCTTTATCCCCCTATGGATAGCCGATATGGATTTCACCTCACCACCTGCGGTGATTGACGGTTTCCGGCGCATCGTGGAGCACGGCACCTTTGGTTATACCTGGTGCTTTGACGAATTCTACGACGCGGTCATTGCCTTCCAGCGCAAACGTCATCAGGTTGAGGTGGAAAAGTCGTGGATCACGTTGACCTACGGCACCGTATCCACGCTGCACTACACGGTTCAGGCATTCTGCAAACCGGGTGACAGCGTGATGATGAACACGCCGGTCTACGATCCTTTTGCGATGGCGGCACAGCGCCAGGGCGTGCAGGTACTGGCTAACCCGCTGCGCGTGGAGGAAAACCGCTATCAGCTTGATTTTAATCTGATAGAAGAACAGCTCAAAACCCACCGTCCAACGCTGTGGTTCTTCTGCTCGCCACATAACCCGTCCGGCAGGATCTGGCGCGAGGAAGAAATACGCCAGGTGTCCGATCTCTGTCAACGCTACGGCACGATTCTGGTGGTCGATGAGGTTCACGCTGAACACATTCTGGATGGCAAATTCGCCAGTTGTCTCACCTCTGGCTGTGCCGCCCAGGACAACCTGATCGTGCTCACATCGCCCAACAAAGCGTTCAATTTGGGCGGGCTGAAAACCTCCTACTCCATGATTCCAGACGACTCGCTGCGCCAGCGCTTCCGCCAGCAGCTCGAGAAGAACTCCATTACCTCGCCCAATTTGTTCGGGGTATGGGGAATCATTCTGGCCTATCAACACGGTCTGCCCTGGCTCGACGCGCTGAACGGTTATCTGCAAGGCAACGCCCGGTATCTGGCGGATGCCCTCCAGACCCACTTCCCGGCGTGGAAGATGATGAACCCGGAATCGTCGTATCTGGCGTGGATAGACGTAAGCGCGGATGAGCGTAGCGCAACGCAGCTAACCCAACATTTCGCACGGCAGGCAGGCGTGGTCATAGAAGACGGCAGCCACTATGTACAAAACGGCGAAAACTACCTGCGGATTAATTTTGGCACCCAGCGCTACTGGCTGGAGCAGTCCATTAACCGAATGCTGAAAAATGACAAATAAGGATCTTACCCCGATGAAGAAAGTGCTCACTCTCTCACTGCTGGCTCTCTGCGTTTCTCATGGTGCAGCGGCAGCAAACTACGCGCTCAATAACGACAATATTGCCCTCTTGTTTGATGATACAAACTCAACGGTCGTGGTGAAGGACAACAAGGCTAACCATCCGCTCACGCCGCAGGAGTTGTTCTTTCTGACGCTGCCGGATGAGAGTAAAATCCACACCGCGGATTTCAAAATCAAGCACGTCGAAAAGCAGGATAACGCGATTGTCATCGACTTTACGCACCCGGATTTTAACGTCACGGTGAAGCTGAACCTGGTGAAGGGAAAATACGCCAACATCGGCTACACCATTGCCGCCGTGGGGCAGCCGCGCGACGTCGCTAAAATCACCTTCTTCCCGACCCAAAAACAGTCTCAGGCCCCTTACGTAGACGGCGCAATCAATAGCTCTCCGATCGTTGCGGACTCGTTCTTTATCCTGCCGGATAAACCGATCGTGAATACCTACGCCTATGAAGCCACCACCAATCTCAACGTAGAGCTGAAAACGCCGATTCAGCCAGAGGCGCCGGTCAGCTTTACTACCTGGTTCGGCACTTTCCCGGAAACCAGCCAGCTGCGCCGCAGCGTGAACCAGTTTATTAATGACGTACGTCCACGCCCATACAAGCCTTATCTGCACTACAACAGCTGGATGGATATCGGCTTTTTCACTCCCTACACTGAACAGGATGTGCTGGGGCGTATGGACGAATGGAACAAGGAGTTCATTACGGGCCGCGGCGTGGCGCTGGACGCCTTCCTGCTGGATGATGGCTGGGACGATCTGACCGGACGCTGGCTATTTGGCACGGCATTCAGAAACGGTTTTAGCAAAGTACGGGAGAAAGCCGACAGCCTGCACAGCTCCGTTGGGCTATGGCTTTCACCGTGGGGTGGCTACAACAAACCGCGCGACGTTCGCGTTTCGCATGCAAAAGAGTATGGGTTCGAAACCGTGGACGGCAAACTGGCGCTGTCGGGAGCGAACTACTTTAAAAACTTCAATGAGCGGATCATCAAGCTTATCAAAAACGAGCACATCACCTCGTTTAAACTCGACGGGATGGGTAACGCCAGTTCGCATATCAAAGGCAGCTCGTTCGCCTCAGATTTCGATGCATCAATCGCCCTGCTGCACAATATGCGCAGCGCAACCCCGAATCTGTTTATCAACCTGACCACCGGCACCGACGCCAGCCCGTCCTGGCTGTTCTACGCTGATTCTATCTGGCGTCAGGGAGATGACATCAACCTGTATGGTTCCGGTACGCCGGTGCAGCAGTGGATGACCTACCGCGATGCCGAGACGTACCGCTCCATTGTCCGTAAAGGCCCTCTGTTCCCGCTGAACTCGCTGATGTACCACGGGATAGTCAGCGCCGAGAATGCCTATTACGGGTTAGAGAAGGTGCAAACGGACAGCGACTTTGCCGATCAGGTCTGGAGCTACTTCGCGACCGGCACCCAGCTGCAGGAGCTGTATATTACCCCGTCCATGCTGAACAAGGTGAAGTGGGATACGCTGGCGAAGGCTGCAAAATGGTCGAAGGAAAATGCCAGCGTGCTGGTTGATACCCACTGGATTGGCGGCGACCCAACGGCGCTTGCCGTGTACGGCTGGGCATCCTGGAGCAAAGACAAAGCCATTCTCGGTTTGCGCAACCCATCGGATAAGCCACAGGCCTACTATCTGGATTTGGCTAAGGATTTCGAAATACCGACAGGAGACGTGGCGCAGTTTAGTCTGAAAGCGGTATACGGCAGCAATAAAACCGTGCCCGTTGAGTATAAAAACGCGACGGTGATTACGTTGCAGCCGCTGGAAACGCTGGTGTTTGAGGCGGTGCCCGTTAACTAAACGCTTGTCCCAATGAGCAGACCGGGTAAGGCGCAAGCGCCACCCGGCAAAACCGGCAGCAGGGGCTTATTCCCCCTGCTGTTCCAGCGCATACTTATACAACGCATTCTTCTTCACTCCGTGGATTTCTGCCGCCAACGCCGCCGCTTGCTTCAACGGCAGCTCAGCCTACAACAGCGCCAGCGTACGCAGCGCATCGGCGGGCAGTTCGTCATCCTGGGCTTTATGGCCTTCAATAATCAGCACCATCTCGCCTTTGCGAGGGTTTTCATCTTCTTTGATCCACGCCAGCAGTTCGCCGACCGACGTGCCGTGGATGGTTCCCAGGCGGGTGTTCATCACATAACGGTATAGGGCATCATTCAGCGGAGCCATTAACAGGCAGCACGCGTCAACCCAGTTGGAGAGTAACGCCGTCATTGGGTTATTTCTTTCCATAAAAATACGCAAACATGCCCCATACCTCCTGTTCCAACAGCTCAACCTTTCTTAATCATTGCCGAAATTGCGCCACAGTCATGAAACTTATCAAAACCAGTGGTGCCTTTGATATATTCAGTAACCAGAGTACCAATAATCTTAACCGTTTCTTTGTTCGTTGTGATAAAGCCCAGTGGCATCGTTACAGGCAATATTTCGGGTTGAGAAATATGATAGATAACGGCAGATAACTCATCCTCAAGATGAAGGATCTCAACATTACCATCTAACAGCCCCTCAACTAGTCCTTTAGCGTACTCATGGAATTGCCCAAGTTCATACTTCCACTGTCTGGCAACCCGAGCATCAGCTGCATCCCGGACAGGTACGTACAGCACCGGTTTATACAGCTGCTTATCCTTCGTAGCCGACTTGCTAAGAAAATTGCTCAGGCTTGAGATAAATCGAATGCTTTTTGACCAGTAGGGATCTGATGGTGCCACCCAGATGAACGGGCTACCTCCATAATCAGTTTTATCTCTGATACGCTGATTGAGCGCTGGTGCCAGAGCTTTCTGCAGCTTTTCCCAGTTGTTTTTAGAATAAACCGGACCTAAAAAAGTCGTTCTGCCTGAGGACGAGTATTCTTCCAGTGCTTGCATATCTTTGATAAACAACGGATTAAGTCGATTCGTTTTAGAGTCAAACAGCTTAAGCGTAAGGTCATCGTTAATGGATTGCATGGCCTTAAAAATACTCATGGTATTGTTATGCCGTGATAACCTGGCTATCTCAACAGCCATTAACTCATGAACACATTCCGAACTATTTAACTGTTCATAGTCTTCACGCTCGACAGCCTTACCATCGCTATCCATTTTAAATTCCGTTGTTAAACGGAGTGGTATTTGCCGCAGCTTATTAACAGAATTAACGGTATAGATAGAAGGTTGTTCTTTTCCGACCTGAACCTGCAGGTGAGACAGGTAACCTTTATCCAGGATTTGATTAAACTGGTATTGAAAGTGCTTCTCAACCAGTTTCTCACTTTGAGAAATGTTGTTGTCATCAACCTTTAATGTTACCCCTCCCCTCCAGTGGTCCTGAAAAGCCTGGTTGCTGAAGCAACTAAATGTGGCTAAATCAAAGGATAAGGTACAGCCACTGTCCTGAATAGTTGAAAGATGCGGTATCTCGCCAACCTCTGAAAAATAGCCATTTGATTTGTCTGTTAAGGCCAGTCTGCCGTCTGAGGAGAGCGTCAATTCGCCACGCTGCACAAGATCGGAAATCGCCTCCTCTGTCTCACGGTGGGTCAGGCCGAAATAGGTTGCTATTTGTGCTTTGCTCATCGAAGCTACATGAACAAGTCTCAGCACAAACTCCCGGATAAATGGCAGCCCCTTCTGGGAAACATAGGAAAACTGAATGTTAAACCTCTGTGCTGGCAGCAGGAAGTCAACCTCATGATAGGTAACTTTGTTATCAGATATCATTTCTTTTTGCCTCCCTGCTGAGCAGAGAGGAACCTGTATCCAGCTTCCTGACCTCGCTCAGCCATATAACTTACAACGTAGCCTAATGGCAATTCTTTATTGTTTCCCTTCCAGATATCGGCATTACCCACAATCAACAGTCGATCCATTGCGCGTGACATCGCGACATTTATACGGTTTGGAACGCGTAAGAAACCTGGGCTATGCTGCTTATCCGAACGCGTCAGTGACAGGATAATGATCCGATTTTCCTTTCCCTGATAACTGTCAACAGTGTCAATTTTAACAATGTCCTTAAATCCCTCGCTCCAGATTTCCTGATTGAATTTCTGACGGAGTAACCGCTTTTGTTCGGCATACATACATATCACGCCGATAGCGGCTTCATCTTTGCTAACAAGTTTTGAAAGCTTAGCGACAAATTCTTCATTCTCTGACACCTGTTTAAGAACAGAAATAATCTCGTCAGCTTCACATCGGTTGTAAATGCTTGTTCCGCGATCTTCAAGATGATGTGCTCGGTGGCCCTGATTAGCAGTATCAAGCCAGGTTACAACGCTACGTAACGCTTCCGGAGCTTGCTGATAGACATCCGGAATTGCCCGTACTCCATTCAGAAGCTTCCCGTCATAAAACGTCTTCGATACGAGATTACCAATCGGTGGAGCCATACGATACTGGGTCATCAAAGCTGCACTCGTCTGCGCACCATAAGCAGAGTTGAAGGCTCGGGCAAAGTCACTTCGTAATACCTCGTCAATTTCAGTGCGGGAGTTATTGATACCCAGCTTCCTCGCTAATGCCGCCTTGTGGGCATCTGAGTACAATGGAGGAAGCTGCATGTGGTCACCCACCAACAGGACACGCCGGGCTGACTGCATTGCAATGGCCAGCTCACTTGAAATCGAGCGCGCCGCCTCATCAATAATCACCCAGTCATAGATATTCTCCTGAATGCCAATGTGCCCTTGTCCAATACCAACGCATGTGCCAGCCACTAACTGCCTGGAGCGCGAATAAAACTCGTCCAGGTTCACTCGCTCTCCCGACATAGCATCCTGCATATCACGGGATATTTTAGCTAATGCTTTTACTCGTCTTGCCTCATCAGGTCTGACACCATACTCTGTACATAGCTTGGAAATCAGAATGTCTTTTGCTGCTGAGACCTTCACGCCATTGTCCAAGTTAATCCCATACTCCTGACTCAGCTTAGAGCGGATGGAAAAATCGAGTTCGACTGCAATATCTTTCAGTTCATTGCTCTCATTTGAATCCGTTAAATTGTTAACCTGATAGAGCAATTTCTCAAGGTGATCGATTTGTCTGAACAAATTGAGCTCTGCAAGAACAACACCCGAAATAAATCCCGGCTCCAGACCAATAGCTTCACTCAATGCTTCAACACGGTACTTAATTTCAGCATTGAAGAGTTCGCGCTTCTCTGTTGTGATTGCGTGTGAGTACACATCTTTTAAGCCAGGGGAAACGGCTCCCTCTCGGTTGCTGAACCTGACAACGTCCAACTCTGTACCAAGCCGGGAACAATGCTTTCTGATACGCTCGGCCGCTGTATTCACAGCCTCGTGTGACTGGCTGACCAGTAAAATGCGTTTGGTATTCTGTTTCTCGATCAGGTAGTGAACAAAGGCCGCGATGAACTCGGTTTTACCGGTCCCCGGTGGCCCCTGAAGCAGGGAGAGAGGGCCATTATTGACCAGTTTGTTAAACGCCTTTCTTTGCTGTTCATTCAGACTGATCTTGTTTCCGTGCTGATCTTCACGGTCGTATCTGGCAAAATCGGTATCGCTGAGAGTGATACCATAATTTTGAGCCGCCTGTTTGCAGGATGGATCGAACAAGTCAATTAAGTCAGGCAGCACACTTTCTCGATCTAGGAGACGTTCCAGTGCGCGTTTACGTTTCTGATAAGATGCACGAGTCGGCCTTGTACGGAAGAAGACAATATCAGAATCCTTCAGCTTGAAGGCTGCTGAACTAACTTTGACAAGACGAATCTCTTTGAGCTCTGACTTCTTAAGTGACACTTCACCAATAAATCGCTCAACACCTTCCTGATCGACCTGTAAGGCTTCGACTTCATCACTACTTCTGAAAGCACCGAGTGGATCAACATCAGCAGAGTAAGGGAGAAGAAGCTCTCCATGAGCATCTGCAACCGGGACCACTTCACCGCTGATTTCAATGTTTGGATAAGATTCTGTTTCAGTATCCAGAATGGCGCGCCACAGTTTCACTGTTGGGATTTCCAGCACTTCTCTTAAAGAAGGTTCAAGCGTCTGCTTATCAAGCCTTGCAAAGGTATCTTTTAGCTGAAGCGTAAGTGGCTCTTGTACCTGAACATCTTCAGTTGCGGCAATCAACTCGATTGCCCGCGCAAAAGATTCTTCTTCATTAAGCAATACTGTCAACGCCGACAGATCCTGCGGACTGCCGGGAATAATCTTGATGCCAGTATCAATTTCGAACTGGCTTTCATCAATATCTTGCTTACGAATTGTGACGCGAGCCCGTGGCCTGAAGCCATGAACCAGCGTCTTCTGATCTTTGTTGAACACCGCAGTAAAACTGCCACCAATCCCAGAGAAAGTCACATTTACTTCAGCTGGCGCTTTAGGATTTGACTTAACTTTGACATACAGATGCCCGTTATCAGGCAGAATAGAGATTATTTCATCGGCATTTCCTGCAGTAATCTCTATCAGGTCTTGTTCTGGCACCAGGTCATTACTATCTATCGCTTTTTTAAAGCGGCCTAAATCCTTAAACCCAAAAACCGGATCTTCCAGCTCTGCTCGGATAGCATTCGCGATTGTCGGATAAATGTCTGATTCCAGCCCCCATGACATGCCTAGCAACTCGCAGGACATCTTCATCACTGCATAGTTGTCACGTTCAAAAGAAGTACAATTATCAATGTATTCAGGGCTGTAACTGTGATTTTTAGGTTCGTCACCGGACGGTGAAAAATCGGGGATATCGATGAGAAAAAGCAAACGGCTCTGTGTCTCAAATATCACATTGCCAGGGTGGATGTCCCCGTGAGAAACACCCAGACCATGTAAGTGCTCAACGGCGGCGACAAACTTACCAATGAGGTCGATTTTTTCATCATCGGGGACCGCTATTTTATCCCAGGTTTCTCCCTGTACCTGATCCGTGACCATATATAGGCTTGATGATTTTGATGCGATACCGAATTCACGAATTTGAGGTAGATACGTTGTTTTAACTGAAGAAAGCCGCTCAACCTGCTTTAAAAACTTCAGAACCTGGAAATTAATTGACGGATCGTATCCCTGTCCCCCAACATTCAGCCAAGCCTTCACGAGCCTTCCTTTCGAGATATAGACTTCTTTGTCGACTGTTTCGACCTGAAACTGAAAGCCATCGTCTTCCGGGTATTGACGAGCGTGATTGATAGCATGTCGGTAAGGGTCAAGCTCAGTATCATCAAATGTTGGAATATCCTTGCCAGCAGGTTCGGCTTGTTTCAGGGCATCAAAGAATTCTGTTGCAGAAGTGAATTTTGCGGCAACGGCATCCCGTAATACAGATGAATACCAGTGCTGGCTGTTTAGCATGTTGTCCTGTACTTTCTCCAGACTCTTTGGCGACATGCGCATACCGCTAAATAAGTGCCAGGCGACAAGGCCCAGAGTATGAACATCTTGCTGAAAAGGCGTAAGCTCGCCCTTATCAAGCATGTCTTTTACGTGGACTGCACCTACGGATAAAAGCTTACGGTAATCGCCAACCGTTCCGGCTGGCTGGTGGTAAGCCGAAATAAAGTTCGAAAGAGCAACTTCTTTTGAAGGTGAAATCCACAGACTGTGATCGGCGACATCCCTGTGCGCAATTTTCATCTCATGGAGATCACTAAACTTTGCAATGAGCAGTTTCACCACATTCAAGCGGTCCATATCAGAAAAGTTCTTACCATATTTTCCGATAAACTCATTAAACCGGACATGACCCGGCGGAACTTCGTAGACTTCGCTATACTCGGCCGTCACCTCGTCTTTCTGAAAACTCGTCAAAGACCTGAGACAGTGGTTATACAGGTCACGATTCTGGTGATTGATGTGCTGTAACACTTCGCGCTCACGTGAAACAATCTGAGCTCGTCCTTCCGGGGTATTCGCTTTCGTTCCCGTTATATTTCTAAAATTCCATACTCTGAGTAGCGCTTCGCTGTTCGTTGATATCTCAGATTTTGCCAGATATTCTCGATACACCTTTTTAGGGTGTTCGAAAATCATATCATTAGCTTCATAGCCATTAACTCTCAAGGCTTTTGGTGCCGTCTGAGGCCCTAGGAACAAATCATCGAAAAGATGGAAATCCTTGTTAAGAACCTTGGTAGCTGGGTGCGGTTTGAAATAGTTATTGAAGCTCCCGCGATCGGCAAACTTCAGGAAATCTTTTAACGAGATCGTATGGCGCCGTTGTTCCTCCGGCAGCGCGCTGAAATCTGCATTGCCGGTCATCACAACAAAAAAATGAACAATCGGGATATAGCCTTTGTTCGTAAAACGGTCTACCAGACGCTTGAGTTTTTTGTCCAGCATGAATTTTTTGCTACGAGTCACGCTTACTGGCGAGCGCCCCATGTTCTTATCGCCTTTAAACCAAGTATCTCCGCGTGCGGTTACAGGCTGATGGTTCCAGTCTTTCAGTTCAACAATGATCACGTTGCAGTGTGTTACAATAACTAAATCGAATTCCCCCTCTTTTTTTGCCTCAACAAATCGAAAACCTGCATAACCTTTCCAAGGGAACATTTCATTGCCGATAAAACCATAGCTTTTTAGCTGCTCACTAATTGAACCGCTACGGAAAGGCTTGTCAGGCTTAGATACGTTGACTGAAAAAGCAGCTTTTATTTTCTCGATAGCCAATACTTCCTGTTCTTGTAAGCCACCATCCCACATTTCTACTTCCAACGGTGTTCTCCTTAAATTTCAGTAAAAATAAGGTCATTTATTCTGGTCAATATGGCAGGTTTCGATGTTTATAGTTCTGAATACTATGGTGTTTTACACGTTCATTCTGGTTATTATACGTAGAAAAGAATTTATCGGGCAACGCCGAGATGACGCGGCCCCGTTAAACCGTGTAGTGGTAGATGATGAGATCAGATACAGGGATCAACTGACGGTGAGCGCAGTCCCCACATTTTATCCTGAGTTTACCGCTATTCCCGGCTGCCATTCGTTAGCACAGGCCCGCAAGTACCCTGATTTGCCGCTGGTTTTGCTTTCCCATCGAAGGGCCCATACATCATCACGCCCGCGAAACAGTCGACAAAATAACGCAACTTTCTCATCCGTGGATAATACGGAAACGCACTGCACAGGACTCTGCGGTTTACGTCACCATTCAATCCCATAAGCTTCAAGTTATGTTTGCCCTCAGTGCAGCTAATTCATCACTGTCAGATTTATGAACCATATTCGTTACCTGTATCCGATCCACTGAGGTACCACGGCAGAAATTCGGGTCCCCTACACTGATGACAAAGCGCCCCGCCCTCTTCTGCCTATTAACTCAAGCACCTCCCAACAACTTACAGAAACTTAATTCTAATACCTAATACAACTGTAATTCAGTTATGTCGTGGTCGGCCCCGGTTGCTTTTTTCGGGAAGCCTGCTACGGCCCAGAATAATACCGTGTTCTTTCTGATCCTGTTTAGCCAGGTAGCTAAGGGCGTAACGTAAACCGTCTACCGCAGACTTATCACTATAGTGAATCACGTGATCGATACGCACCGGGTATTTGTCTTTAGTACTGCACAGGTGAAAATAACCCTCCCCTTCCGTGATCCGACTCCAGATATCTCCCAGTTGCCTGGATATCCGATAAAATTTCTTGTGACGTTGCCCATCCAGGTAGCCGATAAAATGAATATGAAGCCCTTTGTTTGGTGTATATTCCATAACCCAGTAATATCCCGCCAACATCGTCTGGGTTTCGCTAAGTAAACGGTATATTTCCATACACATACTGTGCTTACAGGAATGCCCGAAACTGGGCGTGTCTTTCCTGTAGGCAAAATCAATTCTGAAAGGTAACAGTTTAGAAAAACGTTGAAACATACCATCCATGTGTTCATTCACGTCTTTCAGAATCATGAAGTCCATTTCGTAATTGGGATTAGCATTATACATTTTATAGTCCTTACTTTATAAAAGTTACAGGAAGGTGGAATTACATAAATACTGAGAGTACAGAAACGCCGTACCGCGGCATTAATAAACGAGGTAACCAGCAGCACAGACAACCTGATAAAACTGTTTCGTATTACCTCATTACTCCAAAGAGGTAATATTTCAGAGTACAGTAAGACTCACTTAAACTGAGGACCAGTACGAATTACAATGGTAAAAACACACCTCATAGAATTGAGTTTAAGGAAGTACCATTCAATCACTTACAGGAAGTAATCTGACATTCACAAAAGCCAGTATCGGGTAGCCACAGGCAACACCGCTATTAATACTTCTGTACCATTAAGTAAAAACAGCGACCCAGCTTAAGGCCTCTACAGATATTTAACTTCACCATAAAAATGATGGTATACAGACCATACCCCACACACCATAAGAATATTATTTCATACAGGAAAGATTATATTTTTTCATACCTATAAACTGTAACAGAGCCATAACATACTCTAATAGTGTTCCGGTACGTAACCAGTGTTAAGTGCGTACAAGACAACATAACAGATCACATAACTTCCACTAATAATACTAATTAACCTAAATTATTTATTGAACACGAAAACAATGAAGGCCCCACCTCACCTCAGACTAAAGCCGACAATATCCTGAGCAACACTCTATCCAATAGTAAATAGTGGTTACACCAGTTATATTTACTGACTGACCTGTACCAGAGTTTTCTAATACAACGCTGACATGGTAAAGACATTAGTCTCCTCCGCAAGTAGCAGAGGAGACTATATGTACTCAACGATTAAGTCGTTCAGAGGTCAAAGTTAGGTGCGACACACAACGTTTTTCCCTCAATAACCGGTACAACTCTGTTTTGTTCATACAGTAAATCCAGTAACCAGTTGATTTTATCCTTTTTCCGGAAACGGTTCGGACCACGCTGTAAAATATCATTTTTTTTCATACAGAGGATCCCCTTCTCAATACAATAGCTTTTTATCCAGTTGAAAAGTTCAAGCTCCTCTGGAATGAGTCGCACAGGTACGGTCAGGGCAGAGTTGTCAAAAGTTAACGGATTAGACAACCGCACATACTCATTACCGTACCATATTGCTAATTCTCTCGCCATTTCTGCAGTGTAAGGGGAAATCTCCCCCTCTTCACCGCTCGAATGGTAAATAAGTCCTGCCAGTCTTGCCATATACTCTGCATTTTTAGCAGCATATTCCCGGCAATGTCTTAAAGGCCCCAATCCCCCCAGCTTCGATTCCACATCATTGTAATAATCCGTCCAGATTCTGGCAGCCTGAGGAGAAAAGTGAAGGCAACGTCGTTCACCACTCATCGCCAGACTTTCATCAATAAGCTCATTGATCCTCTCTTCAAACAAATCCTGATACTGTGATGAATAATTATCTCCGGTTATTATCCTTGTCCCCTGCGTTGATGTTGGTTGACACATCAAAAACCTTGCATGATGTCCTGACGTTTTCACAATTTCTTTTTTTCGCGTACAAAAACCTTTGTGGTAAACATCAGGCTGAATCATCACCGATATCGTCAGTCTTGGCTCCTTCAAATTAATTCCGGGAGATGATTTCCTGTCGATGAAAAGAGAACCTCCATCCCACAAAGTGTTAATAATTCCCAGTTTACTCATGGCCCGGCTGTCAAAAATTACCCCCCCTTCACTGGATACAAGAGCAAAAGAGCGATTGCTATCGGAGTAATATTTTAACATTCCCTCTATCGTTGTCTCATTAAAAATTGTTCGACGTATCTGCGGCGGAACAGGAGGTTTATTCAGATGCGTTTCAAGCTCTGATTCTGTTGCCTTGTAATCTTTACCGGCACGAATCTCTTTATGAAATTTTGATTCCAGCGCTTTTTGTTTTTGCTCCCATATTTCCTTTTCTGTACTGTAATTCTCAACCAGTTTCGCGTATTCATCCGCCAGGGCTTCATCCCTGAGATAAAATGCTTTCATAAACACTTTATCCACGGTCGTTTTCCTTTCACCGGAATCAGCCAGAATCAGAGAGTAAAGATTAACAGGCCCATGTAAATTTCCAGGTCTGCACACGTCAATCTGATTCTGACAGGCAATTGAGATCGCTGTTAATGCGGATGTTGCCACCATAGCCAAAGGTGCCTGTGTATTTTTTTGAGTTTCAATTATTGCATTTCTCACCAGCGGTGGTAGTGCATATATCGGATAAGGATTTTCTGGTGCAAGTAAGCACATAAAAGCCTCTCTTTTCATTAATGGGTTAGTAAAATAGCAGCAAATCCCGCTACAATAGCGAAAATAGCTGCTATTCAGACCTGATGCTCTGAGAAATACAGAGCATTTGTAGATCTGATGTTTAATAAACCGATAAGGTTAATTCAGTGAAGACTGTATTATTCACCACGAAATATCATTCATATTTCTCATTCTTAATACCTCCGGAAATTATTTCACTAATACCTTTCCCCCGGTCTTTTCCCTTTCTCCAGAACCATAGTGTAAATATTAACGGCCCCACGTAAATTTCTGGGACTGCATACGTCAACCTGATTCTGACAGGCAATTGACATCGCTGTTAATACTGACATTGCAACCAGAGTCAGAGGTACTTGTTTATTGTATTGAGCTTTAATTATTGCCTTCCCTATCATTGATGGTCATGCATATACTGAATACGGCATATCCGGTTTGATGTAAGGCATAAATGCTTCCTTTTTCGTAATACGTTTCGGTTATAGCAGCAATTACCGCTACAATAGCGACAATAGCGACAATAGCGACAATAGCTGCTCTTTCAACAATGAATGCTGGAAATTCTGTTTATCAATTAAACTTCTGGCTAAATTGTTCTTATGCTTCTTTTTCCTCCTGAGGTGATCACGTGATGACAAAAACTTATCTGCTCATGGTCATATCTGAAGAAAGCGTATGTCTCACTTCGTCATTAACACCTTTGACTGCTGATCTGGCAGGTACAGTATGTAAAGTAAGCCATTCGTCAATAGCTGATGAACGCCAGCCCACAGAACCACTACCAAGACGTACCGGCCTGGGGAAGGTTGCATCGTAGTATTTCGATAACGGATTCATTTTTTCGTAAATTGTCGAACGAGATATACCTAACAATTGGCTCAGTTCAGGCATCCGTAAGATGCGTGAAGGAGTCCTGCAGTGTCCGGATTGATCTGGCATTTTATCCCCCATAGTGTGGTGAACTGTCGTTACAGTTTGTATTCCCCGGGTAAATTTGTCTGCGTAAAAAAATGGGTTGAGAAAAAAGACAATAAATAATTAAAAATCAATATGATAGAAAACGACAGCATTAGTGTTATTTTACTTTTCCCTTATCGCTGTCGATTCTCATTATCATTCATGGCAAAAAAACAGCAAAAGAGGCCCTGGTAATGACTCCTGTCAGTAGTACATGTGTTTTCGATAAATGAACCAATGGCTTTTCGTGAATTTCCAGGGAGCGTAAAAAGCAAATCGTTTTCCGGGCAGGTCGTTTGATGTGAGTACGAAGTGTCAGATTGTTGCGCTCAATGCGCCGGGTAAATATCTTGCCGACAAGATGCTTATCCTGTGGCATTTCTCTGGTATAACTGCTCCGGTTGTCTCTGGTTATCATGCCTGCGGAGAATGGCTTCAGGAATTCCGGCAACTCACGGCATGTTTCATCAGTGCGAGGACCAAAAGTGTAAGCCAGCACACCGTCAGCTTTGGTCTTATACGCGTACCAGTGCCACTGCTGACGAGCTTTGTTTTCGACAAAACTCCATTGTTCATCAAGTTCACAGATAAGTGCCACATCGACACTGGCGGCTGGAGATATGGTTACTTTACGTGGTGAGCGTTTTTTAAAGTGAGAATGACGGTGTTGATATCCACCTTCAGCGTTCTGGAGCTATCGCGAACCCCGGCTCCGTTATGAACCATTTCAACAATTTGCTCTTTAATGCCTGGTTTTCGGGTTTCGTAGATGTAATTCAGTTGAAATACACGTTTGCAGGATTAGCACTGGAAACGCTCATGATCTGAGGTGCTGTGATCATAACGGTACACTTTATAGGAATTACAGCGGGGAGAATATACTGTAACTGTTGCCATATGGTCTCCAGATACCAATAGAATACAACATTAATCTATCGTCAGAAGGCATCACCGGGGCCTTCTGACATAATCTGTTAATACGTGTACTATTACCCTGTTCAGAAAATATTGATTCAAAAATGAAAACCAGTTAACAGAAAAGCAAAATGATATAATGTTAAAATTTTATATAGTGCAATAAAAGGAGAATGTTATGTATAATTTTATCACTATAATGTATGATGTCTTTTCATGTTTTGGTGTTCTGGCTAAAAACCAGAATAGCCGTGACATCCGAAATATTAAAAATTTTTCCTCACATCAACATTCACTGGGCGACATGTTTGATGAATTAATAAACATTATTGATAAAGAACAAGTATTGAGTAAAGAACAACGAAAAGTTATATTTAGGAGATATGAAGATCTCTATGTTAAGCTAATGCACTATTCTGTTTTTACAGACAAAACACATCAAATAATAAAACAAAAATATTTTAATGACATTGTACCAATGATTCTCGCACTCGACATCAGGAACACATATCGCCCGGATAATGAGATGGCATTTTACTATCATATTCATTCTTTTCTCACTCAGATACCGGATAATGAGGATGATATATATCATGCTGCAAGGACATATCTGCGAAATTACGTTAAGTTATGTTTATCCGGATACACGCCAGCGAATGCGCATTTCAAAGATATCTTTGATGGCGTATATGAATTCATTCGTAATATTCGCAAAAACAGTACACCAGGAAAAACAAAACTTATCGCAACTATCAACACATGCAAAGAAACCTGTAAACATCTGCTTTATTTAAGTAATGAAGACAAGGAAAAAATAATTTCTGACTTAGATAAAGTTCAGGTTGCATGTTATTATCTCACTATATTACTGGCTTTCGAAAGACGAACTTCATTAACAAGCACCCTGACAACTTTATATAAAATGCTGATAAGCGAAAGAGAAGTTTCAGAATATGAATGCCAGTTATTATATTTAACCAACCCAATAGATGTAATGAATATACTGAACAAATACATATATTACTTTCCTAATGAGAACTCACCATTTTATACACTGAAAATTGACAGTGCATTATCGTGGGATGCCATTGACGCAATACGAGACTATAGTATTTCTGATATTTATCTTTATCCTGAACAAAAAACAATAAATTGTGTCGTTGAGATTGAAAACATTGTCTTTGGCGGTTACATTTATACATTGAACAACGGCGTCACATTACAAAACATAGAAAACTCTTTAAAAGATTCTTCATGCCATTATGTCTTAAATGGCTATACAGAATTTGTTAACTGTTTGAGACAACTTACTTCAGGAAAGACTGAAAGTGTTCATCGCACCATCAATAAACTGAACTATGAGAAATTACCTTTTGGATTTATCATTGCCGCGTTTGCTATACTAAAGATAGCATTTAAAATAAAATTCAGTAAAAATCATGTAAATATCCGAGCATTATTAAATGACATCAATTATTTTATGACTTATCAGGGCGAGTCCATTAACCTTATTTCACTGGATCACGAATACCCAGAGTCCTGTCTTCAAAATGACACAAACACATATTTATTAGGAAGAGTAATATTTCTGTATAACTCAATGATTTATAAGTTCATAAACTGTCAGGAACATGAAACCAATAACATTCACTCAGCTATGATAAATAACCTATTACAGGAAGTTGATATAGCCCTTGGTAAAATAAATGACATTATAGACAGCAGAAACATATCAGCCCCCCATGAACTGGCAAATATTCTTACCCGCGAAAAAATACTTACAACACGGGAAAAAAAAGGAAACCTGATAAGCCTGTTTGATGGATTCACTTTATTCCATTGTGTTGGAATGATAACCTTTCTTATCCATTATCTCAGAACACCTGAAGAAAAAGTTGAAAATATATTTATGTTATATGGTGCAGATAAAAACAATAAACTACGCAGAAGACTGATTTATGACGCACTAGGAATAATTCAGTCTCAGCAGGAGTGAAGAGTTAAACAGGCAAATTATTTTTTATAAAAAGGGGATTGTTAAGTAATCCCCTGTTAATCAAAATACGCTTTCCTGACGATCTGATAATTATCAGCTCAATACATTTGACATAAAAGCGGTTTCTTAATTACTACTGCTACGATCACATATCAAATCATGATCGTACAATATACATTTTCACTATTATTTATTCACAATTTGAGACATCACATAATCCGCCCATGCCTGCATCATCGGTACACGCTGCTCCAGTAGATCAGTACGATGATATGCCGCCTCAACCTTATTTTTCAGCGTATGAGCGAGCGCCCTTTCCGCCAGATCCCGCGAATACCCCTGTTCGCTACACCAGTCCCTGAATGTTGAGCGAAAACCATGTGCCGTGGCAACTCGCCCCGGAATGTCACTGACGGCTTTCTTTTTACGTAGAAAACTTGTCAACACCATATCGGAAAGGATCTGCTGCTTTCTGGGTGAAGGGAACACCAGTTCATCATGCAGGCCACGTATATTTTCCAGAATGTAAATAGCCTGCCGGGATAAAGGAACACGATGCTGTAGCCTAGCTTTCATTCTTTCTGCAGGTATAGTCCATACCCGCTTATGAAAATCAATTTCAGCCCAGCGCATTCCCCTGGCTTCGCCCGAGCGAGTTGCTGTAAGTATCACCATTAATAACAGTGCGCGGGTAACATTATAAGGTTCATCGGTATACACACTGGTCGCCACAAAAAGCGGTAACTGCCTCCAGGGCATTGCGGGTTGGTGTTCATCACGTCCTCTTGTCTGCTGAGGAAGCAAATGGTCAACCACATCAACAGGATTTGCTACACAAAAACCGTGCGCCCATCCCCACTGCATAACAACATGAATGCGCTGTTTAACCCGGCTTGCCGTTTCTGACAAGGTTAACCAGACTGGACGCAGTGTTTCTGCCACATCCGCAGCCGTAATCGAATCCAGCGTTTTTGCTCCCAGTTGAGGAAACGCGTAATTCTCAAGCGTCGATAACCACTGCCTTACATGCTTTGGATTTTCCCATCCAGGAGACAGTTCTGCATGTACACGCCTGGCTGCATCGGCAAATGTTGGGATAGCGACTTTCTCAGATTCAGCCTTTTTAATCTCCAGAGGATCATCACCTGCAGCAAGTTGCTCTCGCATTATCCGGGCAGTACGTGCAGCTTCAGCAATACTGACCTCTGGGTAAGTTCCCAATCCAGCATTACGTCTTTTTTGTGTCACCGGACTTACATAACGAAAAACCCATTTCCCCCGCCCCTTTACTGAAGAAGGATGAAGGGTCAGTCCGGTAATTCCCCCATGGGGCAATGGTTTGTCATCAGGTTTGATATGTCTTGCTTTCGTATCCGTCAATACTGCCATACGCTAATTCCCGTCACTCTGGTATGCCATCCAGTATGCCATTAGTCTCTCGCTTCACTAAGAATACATTAGATAATATCGGACAATAAAAAGTGTAATACATTGTTTATAATGAAATTATATAACACCCCTGGATTAAGTCAGATTTATTTCAGGCGGTCCCCCTCACCGCCATATTTAAAGAAGAGCCCGTACGAAAGTACGGGCTTTTTTTTCGTATATTGCACACACCGGGTGCCCCCGCCCACGTCTGTTAGGGCGAGGGAAAATTGTGCACATCAACTGTCTGGTTACTCAGTCAACAATCAAAATGCGCAAGATAGTAGAGAACATATATGCGTTATCTGGTCACTTCCTGTAACTTCCTCACTTATTAAAATGCCCCCAACGGAAATACATGTCACATGCCATCATAAAATCGTTATGCCAGAGGCATCGACAATGGAAAAAATAGTAAATTAAGCACGGTACATAGCAGAGTAAAAAATAGGCTTCAAACCAAATCCGCCATTCCACAAGCGTTGCCAATACCGACACCAGCGTAATCCACATCATTTCAATAACCAGCAATATGAATAACCGAAGCCAAAGCGGCTTGGCAAAAAAGGTCGACAACGCATTTTCACTGGAGAAAAACAGATCGATTTCCTGCAAAAAAAGCCTGCCCAGAAAAGCGTACCGCTTGGAGAAGCGCAGAAAAAACAGATTAATCGCCAACACTACGCCACAAAGAATGAGCAATGCAATTATCGATTTAAACGTTAACCACTGCCCCTCTGTCACAAAAGAATCACTGTTACGTAACGGGCTGATAGTAAGCAGGTTAACAACCGTTGAGATGCACACCGAAAGCAGCACATAGCGAGTCAGACTTTTATATTTTACTTTTTTCGCTACTGAAATAGCGACCTGCTTGACATAACTGGTGCGGATCATCCGAAACAGCAGAAAGATGACCAACGGCCCAAGTAACGAAATCAGGAACAGCGCATATAACGAGGGTAGAAGCTGGTATAGCCCCCAACTCATTGCACTCAACATGATTAACGACAGAAATCCTGCCACCAGAGGTTGTGTAAGAAATAATTCTATTTTGTGGTTCTGCTTTTTCTTAAACCACGAAAAATCGATATTCTTCGTGACGCATTTATAAGCCCAGTACGCCTTGAGATCAAAAATAAAATTATACAGCACCAGGGCCAGAATCCCACTAACGACACCAATATCCTCTGATGGAAAAGTAAATCCTGCATTTTTCCAATAAAAGCCTATCAGAATAAGTCCAATAGCATAAATGTATATCAGATTTCGTTTATCCTGAAAGCGAATAAAAGTTAAATAATCAGGAATCATTTGAATACCTTTGCATCACTAAATGCTTTACGACAATGCATAACTCGTTGCTGATCCACGCCAACAAAATCATTTAACATATTCCCCAAATAGGGCTTATTACTTATTTCAAGGAAAGCTACCGGGGTTGGCATATTCAGAAATGTATCGAGAGATTCAGTCCATTTCACCGTATGGGTTAAGTGTAATGAGAGATTTTCTTTGATAATTGACGGAGCGACCTCGCTTTTTGCTGTTACGTTCATCAGAACCTGATGCTCTGGCGAAGCGATATCCAGTCCGGCAAGATAATCGCGCATTGCCTGAACGCCGTCTTCCATCAAACGCGTGTGCCAGGCACCGCTCACACCGAGTTTAACCGGTTCGTATCCAGCGGCCATCAGCAGCGTGGCAAATTCATTCAACGAGGCCTGCGTTCCCCCAATAACCTGCTGGCGCGGCGTGTTATCACAGCTAATATCCAGCGCAATGCCTGATTCCGTAATCATCGTCTGCAGTTGTTCGCGGTTGATGCCTTTAACCGCCTGATCTTACCCAGCAATAGTGGACACGCGGCTAAGTGAGTAAACTCTCAGTCAGAGGTGACTCACATGACAAAAACAGTATCAACCAGTAAAAAACCCCGTAAACAGCATTCGCCTGAATTTCGCAGTGAAGCCCTGAAGCTTGCTGAACGCATCGGTGTTACTGCCGCAGCCCGTGAACTCAGCCTGTATGAATCACAACTCTACAACTGGCGCAGTAAACAGCAAAATCAGCAGACGTCTTCTGAACGTGAACTGGAGATGTCTACCGAGATTGCACGTCTCAAACGCCAGCTGGCAGAACGGGATGAAGAGCTGGCTATCCTCCAAAAGGCCGCGACATACTTCGCGAAGCGCCTGAAATGAAGTATGTCTTTATTGAAAAACATCAGGCTGAGTTCAGCATCAAAGCAATGTGCCGCGTGCTCCGGGTGGCCCGCAGCGGCTGGTATACGTGGTGTCAGCGGCGGACAAGGATAAGCACGCGTCAGCAGTTCCGCCAACACTGCGACAGCGTTGTCCTCGCGGCTTTTACCCGGTCAAAACAGCGTTACGGTGCCCCACGCCTGACGGATGAACTGCGTGCTCAGGGTTACCCCTTTAACGTAAAAACCGTGGCGGCAAGCCTGCGCCGTCAGGGACTGAGGGCAAAGGCCTCCCGGAAGTTCAGCCCGGTCAGCTACCGCGCACACGGCCTGCCTGTGTCAGAAAATCTGTTGGAGCAGGATTTTTACGCCAGTGGCCCGAACCAGAAGTGGGCAGGAGACATCACGTACTTACGTACAGATGAAGGCTGGCTGTATCTGGCAGTGGTCATTGACCTGTGGTCACGTGCCGTTATTGGCTGGTCAATGTCGCCACGCATGACGGCGCAACTGGCCTGCGATGCCCTGCAGATGGCGCTGTGGCGGCGTAAGAGGCCCCGGAACGTTATCGTTCACACGGACCGTGGAGGCCAGTACTGTTCAGCAGATTATCAGGCGCAACTGAAGCGGCATAATCTGCGTGGAAGTATGAGCGCAAAAGGTTGCTGCTACGATAATGCCTGCGTGGAAAGCTTCTTTCATTCGCTGAAAGTGGAATGTATCCATGGAGAACACTTTATCAGCCGGGAAATAATGCGGGCAACGGTGTTTAATTATATCGAATGTGATTACAATCGGTGGCGGCGGCACAGTTGGTGTGGCGGCCTCAGTCCGGAACAATTTGAAAACAAGAACCTCGCTTA
Protein sequences of DBSCAN-SWA_5 >CP034958|2835813:2880466|2872304_2872502_-|QAS85982.1|DBSCAN-SWA MIGKAIIKAQYNKQVPLTLVAMSVLTAMSIACQNQVDVCSPRNLRGAVNIYTMVLEKGKRPGERY >CP034958|2835813:2880466|2838469_2839732_-|QAS85958.1|transposase|DBSCAN-SWA MINKIDFKAKNLTSNAGLFLLLENAKSNGIFDFIENDLVFDNDSTNKIKMNHIKTMLCGHFIGIDKLERLKLLQNDPLVNEFDISVKEPETVSRFLGNFNFKTTQMFRDINFKVFKKLLTKSKLTSITIDIDSSVINVEGHQEGASKGYNPKKLGNRCYNIQFAFCDELKAYVTGFVRSGNTYTANGAAEMIKEIVANIKSDDLEILFRMDSGYFDEKIIETIESLGCKYLIKAKSYSTLTSQATNSSIVFVKGEEGRETTELYTKLVKWEKDRRFVVSRVLKPEKERAQLSLLEGSEYDYFFFVTNTTLLSEKVVIYYEKRGNAENYIKEAKYDMAVGHLLLKSFWANEAVFQMMMLSYNLFLLFKFDSLDSSEYRQQIKTFRLKYVFLAAKIIKTARYVIMKLSENYPYKGVYEKCLV >CP034958|2835813:2880466|2862202_2863639_-|QAS85979.1|DBSCAN-SWA MISDNKVTYHEVDFLLPAQRFNIQFSYVSQKGLPFIREFVLRLVHVASMSKAQIATYFGLTHRETEEAISDLVQRGELTLSSDGRLALTDKSNGYFSEVGEIPHLSTIQDSGCTLSFDLATFSCFSNQAFQDHWRGGVTLKVDDNNISQSEKLVEKHFQYQFNQILDKGYLSHLQVQVGKEQPSIYTVNSVNKLRQIPLRLTTEFKMDSDGKAVEREDYEQLNSSECVHELMAVEIARLSRHNNTMSIFKAMQSINDDLTLKLFDSKTNRLNPLFIKDMQALEEYSSSGRTTFLGPVYSKNNWEKLQKALAPALNQRIRDKTDYGGSPFIWVAPSDPYWSKSIRFISSLSNFLSKSATKDKQLYKPVLYVPVRDAADARVARQWKYELGQFHEYAKGLVEGLLDGNVEILHLEDELSAVIYHISQPEILPVTMPLGFITTNKETVKIIGTLVTEYIKGTTGFDKFHDCGAISAMIKKG >CP034958|2835813:2880466|2858644_2859820_+|QAS85978.1|DBSCAN-SWA MFDFDKIIERQNDKCRKWDHTFVCSRFGDVPESFIPLWIADMDFTSPPAVIDGFRRIVEHGTFGYTWCFDEFYDAVIAFQRKRHQVEVEKSWITLTYGTVSTLHYTVQAFCKPGDSVMMNTPVYDPFAMAAQRQGVQVLANPLRVEENRYQLDFNLIEEQLKTHRPTLWFFCSPHNPSGRIWREEEIRQVSDLCQRYGTILVVDEVHAEHILDGKFASCLTSGCAAQDNLIVLTSPNKAFNLGGLKTSYSMIPDDSLRQRFRQQLEKNSITSPNLFGVWGIILAYQHGLPWLDALNGYLQGNARYLADALQTHFPAWKMMNPESSYLAWIDVSADERSATQLTQHFARQAGVVIEDGSHYVQNGENYLRINFGTQRYWLEQSINRMLKNDK >CP034958|2835813:2880466|2851777_2851876_-|QAS85971.1|DBSCAN-SWA MISQIDKLEYVVKVQRNQSNPTMFNKIVVFFQ >CP034958|2835813:2880466|2848423_2849740_-|QAS85968.1|DBSCAN-SWA MNDKNIIQMPDGYLNKTPLFQFILLSCLFPLWGCAAALNDILITQFKSVFSLSNFASALVQSAFYGGYFLIAIPASLVIKKTSYKVAILIGLTLYIGGCTLFFPASHMATYTMFLAAIFAIAIGLSFLETAANTYSSMIGPKAYATLRLNISQTFYPIGAASGILLGKYLVFSEGESLEKQMSGMNAEQIHNFKVLMLENTLEPYKYMIMILVVVMVLFLLTRFPTCKVAQTSHHKRPSAMDTLRYLAKNSRFRRGIVAQFLYVGMQVAVWSFTIRLALELGDINERDASNFMVYSFACFFIGKFIANILMTRFNPEKVLILYSVIGALFLAYVALAPSFSAVYVAVLVSVLFGPCWATIYAGTLDTVDNEHTEMAGAVIVMAIVGAAVVPAIQGYIADMFHSLQLSFLVSMLCFVYVGVYFWRESKVRNALAEVTES >CP034958|2835813:2880466|2847398_2848412_-|QAS85967.1|DBSCAN-SWA MSTRIYLWRALFGEKPRILLENSDFTVTSFRYDSGVEGLKIANSRGHLIILPWMGQMIWDAQFDGHSLTMCNMFRQPKPATEVIETYGCFAFHSGLLANGCPSAEDTHLLHGEMACAAMDEAWMELEGDMLRLTGRYEYVMGFGHHYLAQPTVVLHKSSTLFDIKMAVTNLASVDMPLQYMCHMNYAYIPNATFSQNIPDEILRLRESVPSHVNPTPQWLAFNQRIMQGESSLSTLNQPEFYDPEIVFFADKLDAYTDQPEFRMIAPDGTTFVTRFSSAELNYVTRWILYNGEQQVAAFALPATCRPEGYLAAQRNGTLIQVAPQQTRTFTVTTGIE >CP034958|2835813:2880466|2869268_2869832_-|QAS85980.1|DBSCAN-SWA MYNANPNYEMDFMILKDVNEHMDGMFQRFSKLLPFRIDFAYRKDTPSFGHSCKHSMCMEIYRLLSETQTMLAGYYWVMEYTPNKGLHIHFIGYLDGQRHKKFYRISRQLGDIWSRITEGEGYFHLCSTKDKYPVRIDHVIHYSDKSAVDGLRYALSYLAKQDQKEHGIILGRSRLPEKSNRGRPRHN >CP034958|2835813:2880466|2839997_2841326_-|QAS85959.1|transposase|DBSCAN-SWA MHIGQALDLVSRYDSLRNPLTSLGDYLDPELISRCLAESGTVTLRKRRLPLEMMVWCIVGMALERKEPLHQIVNRLDIMLPGNRPFVAPSAVIQARQRLGSEAVRRVFTKTAQLWHNATPHPHWCGLTLLAIDGVFWRTPDTPENDAAFPRQTHAGNPALYPQVKMVCQMELTSHLLTAAAFGTMKNSENELAEQLIEQTGDNTLTLMDKGYYSLGLLNAWSLAGEHRHWMIPLRKGAQYEEIRKLGKGDHLVKLKTSPQARKKWPGLGNEVTARLLTVTRKGKVCHLLTSMTDAMRFPGGEMADLYSHRWEIELGYREIKQTMQLSRLTLRSKKPELVEQELWGVLLAYNLVRYQMIKMAEHLKGYWPNQLSFSESCGMVMRMLMTLQGASPGRIPELMRDLASMGQLVKLPTRRERAFPRVVKERPWKYPTAPKKSQSVA >CP034958|2835813:2880466|2846219_2846567_-|QAS85965.1|DBSCAN-SWA MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRSGSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGIDWRQPKRLLTSLTML >CP034958|2835813:2880466|2872746_2873043_-|QAS85983.1|DBSCAN-SWA MPDQSGHCRTPSRILRMPELSQLLGISRSTIYEKMNPLSKYYDATFPRPVRLGSGSVGWRSSAIDEWLTLHTVPARSAVKGVNDEVRHTLSSDMTMSR >CP034958|2835813:2880466|2843934_2844078_-|QAS85963.1|DBSCAN-SWA MPGTYQSQSLSGIICRVVWLTFSVVTNGAIVIHTQRLKSDPGGNLLS >CP034958|2835813:2880466|2878710_2879202_-|QAS85987.1|DBSCAN-SWA MITESGIALDISCDNTPRQQVIGGTQASLNEFATLLMAAGYEPVKLGVSGAWHTRLMEDGVQAMRDYLAGLDIASPEHQVLMNVTAKSEVAPSIIKENLSLHLTHTVKWTESLDTFLNMPTPVAFLEISNKPYLGNMLNDFVGVDQQRVMHCRKAFSDAKVFK >CP034958|2835813:2880466|2870652_2872086_-|QAS85981.1|DBSCAN-SWA MCLLAPENPYPIYALPPLVRNAIIETQKNTQAPLAMVATSALTAISIACQNQIDVCRPGNLHGPVNLYSLILADSGERKTTVDKVFMKAFYLRDEALADEYAKLVENYSTEKEIWEQKQKALESKFHKEIRAGKDYKATESELETHLNKPPVPPQIRRTIFNETTIEGMLKYYSDSNRSFALVSSEGGVIFDSRAMSKLGIINTLWDGGSLFIDRKSSPGINLKEPRLTISVMIQPDVYHKGFCTRKKEIVKTSGHHARFLMCQPTSTQGTRIITGDNYSSQYQDLFEERINELIDESLAMSGERRCLHFSPQAARIWTDYYNDVESKLGGLGPLRHCREYAAKNAEYMARLAGLIYHSSGEEGEISPYTAEMARELAIWYGNEYVRLSNPLTFDNSALTVPVRLIPEELELFNWIKSYCIEKGILCMKKNDILQRGPNRFRKKDKINWLLDLLYEQNRVVPVIEGKTLCVAPNFDL >CP034958|2835813:2880466|2859833_2861723_+|QAS87710.1|DBSCAN-SWA MKKVLTLSLLALCVSHGAAAANYALNNDNIALLFDDTNSTVVVKDNKANHPLTPQELFFLTLPDESKIHTADFKIKHVEKQDNAIVIDFTHPDFNVTVKLNLVKGKYANIGYTIAAVGQPRDVAKITFFPTQKQSQAPYVDGAINSSPIVADSFFILPDKPIVNTYAYEATTNLNVELKTPIQPEAPVSFTTWFGTFPETSQLRRSVNQFINDVRPRPYKPYLHYNSWMDIGFFTPYTEQDVLGRMDEWNKEFITGRGVALDAFLLDDGWDDLTGRWLFGTAFRNGFSKVREKADSLHSSVGLWLSPWGGYNKPRDVRVSHAKEYGFETVDGKLALSGANYFKNFNERIIKLIKNEHITSFKLDGMGNASSHIKGSSFASDFDASIALLHNMRSATPNLFINLTTGTDASPSWLFYADSIWRQGDDINLYGSGTPVQQWMTYRDAETYRSIVRKGPLFPLNSLMYHGIVSAENAYYGLEKVQTDSDFADQVWSYFATGTQLQELYITPSMLNKVKWDTLAKAAKWSKENASVLVDTHWIGGDPTALAVYGWASWSKDKAILGLRNPSDKPQAYYLDLAKDFEIPTGDVAQFSLKAVYGSNKTVPVEYKNATVITLQPLETLVFEAVPVN >CP034958|2835813:2880466|2874154_2875972_+|QAS85984.1|DBSCAN-SWA MYNFITIMYDVFSCFGVLAKNQNSRDIRNIKNFSSHQHSLGDMFDELINIIDKEQVLSKEQRKVIFRRYEDLYVKLMHYSVFTDKTHQIIKQKYFNDIVPMILALDIRNTYRPDNEMAFYYHIHSFLTQIPDNEDDIYHAARTYLRNYVKLCLSGYTPANAHFKDIFDGVYEFIRNIRKNSTPGKTKLIATINTCKETCKHLLYLSNEDKEKIISDLDKVQVACYYLTILLAFERRTSLTSTLTTLYKMLISEREVSEYECQLLYLTNPIDVMNILNKYIYYFPNENSPFYTLKIDSALSWDAIDAIRDYSISDIYLYPEQKTINCVVEIENIVFGGYIYTLNNGVTLQNIENSLKDSSCHYVLNGYTEFVNCLRQLTSGKTESVHRTINKLNYEKLPFGFIIAAFAILKIAFKIKFSKNHVNIRALLNDINYFMTYQGESINLISLDHEYPESCLQNDTNTYLLGRVIFLYNSMIYKFINCQEHETNNIHSAMINNLLQEVDIALGKINDIIDSRNISAPHELANILTREKILTTREKKGNLISLFDGFTLFHCVGMITFLIHYLRTPEEKVENIFMLYGADKNNKLRRRLIYDALGIIQSQQE >CP034958|2835813:2880466|2877727_2878714_-|QAS85986.1|DBSCAN-SWA MIPDYLTFIRFQDKRNLIYIYAIGLILIGFYWKNAGFTFPSEDIGVVSGILALVLYNFIFDLKAYWAYKCVTKNIDFSWFKKKQNHKIELFLTQPLVAGFLSLIMLSAMSWGLYQLLPSLYALFLISLLGPLVIFLLFRMIRTSYVKQVAISVAKKVKYKSLTRYVLLSVCISTVVNLLTISPLRNSDSFVTEGQWLTFKSIIALLILCGVVLAINLFFLRFSKRYAFLGRLFLQEIDLFFSSENALSTFFAKPLWLRLFILLVIEMMWITLVSVLATLVEWRIWFEAYFLLCYVPCLIYYFFHCRCLWHNDFMMACDMYFRWGHFNK >CP034958|2835813:2880466|2856609_2856948_+|QAS85976.1|DBSCAN-SWA MKHLTEMVRQHKAGKTNGIYAVCFARFVLHGASDVPDEYVRRTIGPGVCKVNVATELKIAFSDAIKAWFAENQQSNDPRFYMRVGMDAMKEVVRSKIAVCGSANRLRLPAEA >CP034958|2835813:2880466|2835813_2836272_-|QAS85956.1|transposase|DBSCAN-SWA MGNEKSLAHTRWNCKYHIVFAPKYRRQVFYREKRRATGSILRKLCEWKSVRILEAECCADHIHMLVEIPPKMSVSGFMGYLKGKSSLMLYEQFGDLKFKYRNREFWCRGDYVDTVGKNTAKIQDYIKHQLEEDKMGEQLSIPCPGSPFTDRK >CP034958|2835813:2880466|2879304_2880466_+|QAS85988.1|transposase|DBSCAN-SWA MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWRSKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILPKGRDILREAPEMKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCDSVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLAVVIDLWSRAVIGWSMSPRMTAQLACDALQMALWRRKRPRNVIVHTDRGGQYCSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISREIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENKNLA >CP034958|2835813:2880466|2842119_2842902_-|QAS85962.1|DBSCAN-SWA MMMELQHQRLMALAGQLQLESLISAAPALSQQAVDQEWSYMDFLEHLLHEEKLARHQRKQVMYTRMAAFPAVKTFEEYDFTFATGAPQKQLQSLRSLSFIERNENIVLLGPSGVGKTHLAIAMGYEAVRAGIKVRFTTAADLLLQLSTAQRQGRYKTTLQRGVMAPRLLIIDEIGYLPFSQEEAKLFFQVIAKRYEKSAMILTSNLPFGQWDQTFAGDAALTSAMLDRILHHSHVVQIKGESYRLRQKRKAGVIAEANPE >CP034958|2835813:2880466|2841699_2841876_-|QAS85961.1|DBSCAN-SWA MATEIKKFEKRDLAQAVIGVGVMVDFISRIMSVLADSWGCWCGLPIDEKTSNYLAYYL >CP034958|2835813:2880466|2852488_2853717_-|QAS85972.1|transposase|DBSCAN-SWA MIDVLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKELQRLLGKKTMENELLKEAVEYGRGKKVDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTDVLLRIHHVIGELPTYGYRRVWTLLRRQAELDGMPAINAKRVYRIMRQNALLLERKPAVPPSKRAHTGRVAVKESNQRWCSDGFEFRCDNGEKLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNELPASPVEWLTDNGSCYRANETRQFARMLGLEPKSTAVRSPESNGIAESFVKTIKRDYISVMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQQASNGLSDNRRLEI >CP034958|2835813:2880466|2854319_2854955_+|QAS87709.1|DBSCAN-SWA MVKSFAIGYTVRDVAKGSWIDESTVTLPKAPPLNTLPRATKVPEPQQPQEDYTFEGYRNADGSVGTKNLLGITTSVHCMADVEDYVVKIIERDLLPKYLSIDGVVDLNHLYGCGVAINVPAAVVPIRTIHNIALNPNFGGEVMVVGMQCGGSDAFSGVTTNPAVGYDSDLLVRCGATVMFSEVTEVRDAIHLLTPRAINEEVGRRLLEEMA >CP034958|2835813:2880466|2841557_2841737_-|QAS85960.1|DBSCAN-SWA MKKQAITWHIICNTLKPLRRSQFIMFIIVWPEGYLLKSALCGESPCAVGRLVYIAFIML >CP034958|2835813:2880466|2837000_2838146_-|QAS85957.1|DBSCAN-SWA MMKKSLCCALLLTASFSTFAAAKTEQQIADIVNRTITPLMQEQAIPGMAVAVIYQGKPYYFTWGKADIANNHPVTQQTLFELGSVSKTFNGVLGGDAIARGEIKLSDPVTKYWPELTGKQWQGIRLLHLATYTAGGLPLQIPDDVRDKAALLHFYQNWQPQWTPGAKRLYANSSIGLFGALAVKPSGMSYEEAMTRRVLQPLKLAHTWITVPQNEQKDYAWGYREGKPVHVSPGQLDAEAYGVKSSVIDMARWVQANMDASHVQEKTLQQGIALAQSRYWRIGDMYQGLGWEMLNWPLKADSIINGSDSKVALAALPAVEVNPPAPAVKASWVHKTGSTGGFGSYVAFVPEKNLGIVMLANKSYPNPVRVEAAWRILEKLQ >CP034958|2835813:2880466|2855536_2856361_-|QAS85975.1|DBSCAN-SWA MSNTDASGEKRVTGTSERREQIIQRLRQQGSVQVNDLSALYGVSTVTIRNDLAFLEKQGIAVRAYGGALICDSTTPSVEPSVEDKSALNTAMKRSVAKAAVELIQPGHRVLLDSGTTTFEIARLMRKHTDVIAMTNGMNVANALLEAEGVELLMTGGHLRRQSQSFYGDQAEQSLQNYHFDMLFLGVDAIDLERGVSTHNEDEARLNRRMCEVAERIIVVTDSSKFNRSSLHKIIDTQRIDMIIVDEGIPADSLEGLRKAGVEVILVGELASSL >CP034958|2835813:2880466|2857061_2858633_+|QAS85977.1|DBSCAN-SWA MTQKKSFKSKLWEFLQSLGKTFMFPVSLLAFMGLLLGIGSSVTSPSTITSFPFLGGEFTQLTFGFIAMVGGFAFTYLPLMFAMAIPMGLAKRNKAVAAFAGFVGYMLMNMSINYYLTATHQLADPATMKQVGQSIVLGIQTLEMGVLGGIVVGVITYFLHDRFQDTVLHDAFAFFSGIRFVPIITALTLSLVGLFIPMLWEYVALGIAGIGHIIQSTSVFGPFLYGVGVLLLKPFGLHHILLAMVRFTPAGGIEMVNGHEVAGALNIFYAELKAGLPFSPHVTAFLSQGFMPTFIFGLPAVAYAIYRTARPENRPVIKGLLLSGVLVSVVTGISEPIEFLFLFIAPALYAFHIVMSGLALMVMALLGVTIGNTDGGILDLLIFGVMQGMSTKWYLLFPVGIAWFAIYFFVFRWYILKHNIKTPGREVDVQGAQQAVEANTRARGKSKYDHELILRALGGKENIESLDNCITRLRLVVKDMGLIDQQALKAAGALSVVMLDAHSVQVIIGPQVQSVKTGIEALI >CP034958|2835813:2880466|2849767_2850688_-|QAS85969.1|DBSCAN-SWA MDIAVIGSNMVDLITYTNQMPKEGETLEAPAFKIGCGGKGANQAVAAAKLNSKVLMLTKVGDDIFADNTIRNLESWGINTTYVEKVPCTSSGVAPIFVNANSSNSILIIKGANKFLSPEDIDRAAEDLKKCKLIVLQLEVQLETVYHAIEFGKKNGIEVLLNPAPALRELDMSYACKCDFFIPNETELEILTGMSVDTYDHIRLAARSLVDKGLNNIIVTMSEKGALWMTRDQEVHVPAFKVNAVDTSGAGDAFIGCFSHYYVQSGDVEAALKKAALFAAFSVTGKGTQSSYPSIEQFNEFLTLNE >CP034958|2835813:2880466|2846563_2846944_-|QAS85966.1|DBSCAN-SWA MQKNVTPGRRKGCPNYSPEFKQQLVAASCEPGISISKLALENGINANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETLSISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR >CP034958|2835813:2880466|2852003_2852204_-|QAS87708.1|DBSCAN-SWA MIPWSGVTCRQEIRSMQGCEMNSNRLAAPVIFEDSSGCYPVCIKNPDVMDNITASTTRKCLFTIRV >CP034958|2835813:2880466|2861891_2862098_-|QAS87711.1|DBSCAN-SWA MAPLNDALYRYVMNTRLGTIHGTSVGELLAWIKEDENPRKGEMVLIIEGHKAQDDELPADALRTLALL >CP034958|2835813:2880466|2876158_2877361_-|QAS85985.1|DBSCAN-SWA MAVLTDTKARHIKPDDKPLPHGGITGLTLHPSSVKGRGKWVFRYVSPVTQKRRNAGLGTYPEVSIAEAARTARIMREQLAAGDDPLEIKKAESEKVAIPTFADAARRVHAELSPGWENPKHVRQWLSTLENYAFPQLGAKTLDSITAADVAETLRPVWLTLSETASRVKQRIHVVMQWGWAHGFCVANPVDVVDHLLPQQTRGRDEHQPAMPWRQLPLFVATSVYTDEPYNVTRALLLMVILTATRSGEARGMRWAEIDFHKRVWTIPAERMKARLQHRVPLSRQAIYILENIRGLHDELVFPSPRKQQILSDMVLTSFLRKKKAVSDIPGRVATAHGFRSTFRDWCSEQGYSRDLAERALAHTLKNKVEAAYHRTDLLEQRVPMMQAWADYVMSQIVNK >CP034958|2835813:2880466|2850993_2851776_+|QAS85970.1|DBSCAN-SWA METKQKERIRRLMELLKKTDRIHLKDAARMLEVSVMTIRRDLHQEDEPLPLTLLGGYIVMVNKPAPSMPVIHDVPKNHRDDLPIAILAAGMVNENDLIFFDNGQEIPLVISMIPDAITFTGICYSHRVFVALNEKPNVTAILCGGTYRARSDAFYDASNSSPLDSLNPRKIFISASGVHNHFGVSWFNPEDLATKRKAMNRGLRKILLARHALFDEVASASLAPISAFDVLISDRPLPADYVTHCQNGSVKIITPDSEDE >CP034958|2835813:2880466|2863635_2868594_-|QAS87712.1|DBSCAN-SWA MWDGGLQEQEVLAIEKIKAAFSVNVSKPDKPFRSGSISEQLKSYGFIGNEMFPWKGYAGFRFVEAKKEGEFDLVIVTHCNVIIVELKDWNHQPVTARGDTWFKGDKNMGRSPVSVTRSKKFMLDKKLKRLVDRFTNKGYIPIVHFFVVMTGNADFSALPEEQRRHTISLKDFLKFADRGSFNNYFKPHPATKVLNKDFHLFDDLFLGPQTAPKALRVNGYEANDMIFEHPKKVYREYLAKSEISTNSEALLRVWNFRNITGTKANTPEGRAQIVSREREVLQHINHQNRDLYNHCLRSLTSFQKDEVTAEYSEVYEVPPGHVRFNEFIGKYGKNFSDMDRLNVVKLLIAKFSDLHEMKIAHRDVADHSLWISPSKEVALSNFISAYHQPAGTVGDYRKLLSVGAVHVKDMLDKGELTPFQQDVHTLGLVAWHLFSGMRMSPKSLEKVQDNMLNSQHWYSSVLRDAVAAKFTSATEFFDALKQAEPAGKDIPTFDDTELDPYRHAINHARQYPEDDGFQFQVETVDKEVYISKGRLVKAWLNVGGQGYDPSINFQVLKFLKQVERLSSVKTTYLPQIREFGIASKSSSLYMVTDQVQGETWDKIAVPDDEKIDLIGKFVAAVEHLHGLGVSHGDIHPGNVIFETQSRLLFLIDIPDFSPSGDEPKNHSYSPEYIDNCTSFERDNYAVMKMSCELLGMSWGLESDIYPTIANAIRAELEDPVFGFKDLGRFKKAIDSNDLVPEQDLIEITAGNADEIISILPDNGHLYVKVKSNPKAPAEVNVTFSGIGGSFTAVFNKDQKTLVHGFRPRARVTIRKQDIDESQFEIDTGIKIIPGSPQDLSALTVLLNEEESFARAIELIAATEDVQVQEPLTLQLKDTFARLDKQTLEPSLREVLEIPTVKLWRAILDTETESYPNIEISGEVVPVADAHGELLLPYSADVDPLGAFRSSDEVEALQVDQEGVERFIGEVSLKKSELKEIRLVKVSSAAFKLKDSDIVFFRTRPTRASYQKRKRALERLLDRESVLPDLIDLFDPSCKQAAQNYGITLSDTDFARYDREDQHGNKISLNEQQRKAFNKLVNNGPLSLLQGPPGTGKTEFIAAFVHYLIEKQNTKRILLVSQSHEAVNTAAERIRKHCSRLGTELDVVRFSNREGAVSPGLKDVYSHAITTEKRELFNAEIKYRVEALSEAIGLEPGFISGVVLAELNLFRQIDHLEKLLYQVNNLTDSNESNELKDIAVELDFSIRSKLSQEYGINLDNGVKVSAAKDILISKLCTEYGVRPDEARRVKALAKISRDMQDAMSGERVNLDEFYSRSRQLVAGTCVGIGQGHIGIQENIYDWVIIDEAARSISSELAIAMQSARRVLLVGDHMQLPPLYSDAHKAALARKLGINNSRTEIDEVLRSDFARAFNSAYGAQTSAALMTQYRMAPPIGNLVSKTFYDGKLLNGVRAIPDVYQQAPEALRSVVTWLDTANQGHRAHHLEDRGTSIYNRCEADEIISVLKQVSENEEFVAKLSKLVSKDEAAIGVICMYAEQKRLLRQKFNQEIWSEGFKDIVKIDTVDSYQGKENRIIILSLTRSDKQHSPGFLRVPNRINVAMSRAMDRLLIVGNADIWKGNNKELPLGYVVSYMAERGQEAGYRFLSAQQGGKKK >CP034958|2835813:2880466|2855036_2855474_+|QAS85974.1|DBSCAN-SWA MVKKALGSIAKSGKTAIVEVLSPGQHPTKRELIYAATPARDFVCGTQQVASGITVQVFTTGRGTPYGLMEVPVIKMATRTGLANHWFDLMDINAGTIATGEETIEEVGWKLFHFILDVASGKKKTLSDQWGLHNQLAVFNPAPVA >CP034958|2835813:2880466|2844631_2846170_-|QAS85964.1|transposase|DBSCAN-SWA MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRMNFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRKPFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRTVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYCQSEIYGRQGVELSRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTPVQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTHLACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALTEEALEQIGQLYAIEADIRGIPAEQRLAERQRKTKPLLKSLESWLREKMKTLSRHSELAKAFAYALNQWSALTYYANDGWVEIDNNIAENALRAVSLGRKNFLFFGSDHGGERGALLYSLIGTCKLNDVDPESYLRHVLGVIADWPVNRVSELLPWRIALPAE >CP034958|2835813:2880466|2853893_2854133_+|QAS85973.1|DBSCAN-SWA MRNLHIFPVTSVLPHMVIATCVVSLDSKGIYSQVKCLIKYTNLVIINIYSGRMTREVDCQQTRPVVHRLLNKTPDISRL |
38 | Stx2-converting_phage(33.33%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
3094145 : 3102915
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >CP034958|3094145:3102915|DBSCAN-SWA TTTAACTGGAAACTTCCGTTGACCGGCTGTATTCATGGCTGCGAATTTTCGCCATCAGCTCGTCTGTCAGTTCGGACACCCACTGGATAGCCAGCCGCTTTTCTTCGTCGCTGCACTCACTAGCCGCTACAAGCTTGATAAAAAAATCAATACGCTGAAGCTTCAATGACTCCAAAAGATAGTCCTGCATCTTCCCTCCTATCATTACACGGATACACAAAAACTGTATATACACCCACTGTTTATATAAACAGTATAATAGGAACAGAAAAATGTAAAACTGTTTTTTGTCAGTTAATTGGATGTACTGATGTCGGTCAATAAAGCACAAAATGTTAAACAGCAGCCTTAGTACCATTGACGCCATTTGTCATCTTCCTGCAGCCTCTGGTTACGGTAAAAAATACGTAAACCGGCACCGGATGGAATGCTGCCGCCACGCAGAAGCAAATCAATCTCAGATGCACTACCTTCAAACCCCCTGGTTGTCAGTTCTGCCTCAAGCTGCAGGCGCTGCTGCTCCAAAATACTCTGTATGTATGCTTTTTTCCGCTTCGGTTTTACCAGTCTCAACCTGGCTGTCAGCTCCCGCCGTTCCTTCTGGCCCATGTTGTGGAGATATTCCTGCAGCTCCTTCTCATCCATGGTTTTAATATCGGGTAAATCACCCCCTGATTTGTTCAGATTTTCAACAGGGGGACAGTTATTGCCACGAGTCCAAGGGGCGCAAGCGCCCTGGTCGGCTGCCGCCTCCTGAACGTCAACGGCTTTACGAACCATTTTCCACTTCACTGCATGAGTGCAGATCTTGCCCTCTGCAATGGGTGACCAGATGCCATAAATACGAATGCCATGATCGCCATAGGCGGTCGGCTCTTCGTTGATTTCATAAGCGGTTCTGATGAGGTGATATTTACGGGGAACCAGTACGCCGCCCTGCTTCATGATGTAGGTGGCAAAACAACCAGCATCAGCAGCAGCCAGAATGGCATCAAGACGCGGGTTATCCAGTACCGGCGCACCTGCTTTTTTGTCACCCTGTTGCCTTGCCGCCTGACCAGCCAGCAAGCGAAGTTCACGGTAAGCCTGACGCCCCGGAATACCAAAGAAGCGGAATTGCTGAACACGATGCAGAGACGCCCAGGCATTCACGTATTCAGCGTTATCACGCAGAGATTTACCCGTTTCCTTGCTGATCTCGCCAGCCAGACCACGACCGTCAATGTTCTTACTGATATATTTCGCGATGTAGCTTGTCGGCGTTCCTTTGCGCGGGTTAATCAACTCAGACTTAAAGCGCGGCCCAGTGTTATTGCCCAGCTCCTCGCGGTCTTCACGGATGGCAAACTTACGCAGTAATGCAGTGATGGCACGGCGGTCTTTTTTGCGCATGAAACACAACAGGTGCCAGTGAACTGTGCCATCATGATGCGGCTCAGCCACCCGCACGCCATACCAGCGCAACCCGGCTTTGTGCATCGCCTTACGAAATGCAGCAAACATACCGACCAGATAATCGCTGCTTTGTCTTACCGTCGCGTTTGTCCAGGTCGGGTTTGGTCTGCCGTTATTTAGCGTGGAATGGAAACGCGACGGACAGGTGATAGTGTAGAAAACGGCGCAGTCACCGCGCATTTCCGCGATAAGCTCCAGACCTTTAACACAGGCCATCATCTCATTGCGGCGATGCGCAGGGTTGCTGCTGCTGGCGTTTACCACATCCTCCATGTCCAGCGTGTCGCCGTCTTCGTTCACCAGTTCATGAGAACGGAAAAACTCCAGCGACTTACGACGCTGCTCACGTTTATGCATCACGGCTTCATAGCTGACATAAGGAGATGCTTTTTTGCTGACCAGGCAAACAGCGCGCAACTGCTCTTCCCGCCATTCGCAACGCATCTTCCATAATTTCCGATACCACCAGTCGGCGCACAACATACGCGCCAGCGAACCCGGAATGAGTTCATAGGGCACGGGTTTACGGCGGTTTCTTTTCCGACGGAGTTGCTCAAACGCAGGTGGGATGACATCCAGACGCAGGGTTTCCGCTGCCACCTTTTCCCATGTCTTGCGGATTTCTTCTGGCTTAACGTCATCGGTGGCATACAAATCACCACAAGCTGCATCAAGGCACATACTCATATGCGCAGCTACCAGGGTGGAAAGGCGTTTCACCTGATCCTGACTCATTTCAGGCAGGATCAGCAGGCCGTCCAGCCCTTCATGGCTTGCCATAAAGCGAAAAGATGCAGATAGCTGACTGTCGCGTACATGCTCCAGTCGTTCCAGACATGGCTTAATCGTCTCACGTAAATAGCGGGAATAAGCCTTTGGCCTGCCCAGGCTGCTGAAGTATTCAATACGTTGCATCAGCGGCTTGCTGATATGGGAAGGCTGGGCGTTGACGTCCGCCAGAATGACCATGTCTGAATTAAAACGCTGCTGCTCATGCGCCAGCTTTGCCCGGCTAATGAGCTTATCCTGCTCCATTTCGCGCTGGACAGGATCACGTGATTCATTAAAGAAATAACGCTCCCAGACCTGATCACTCAGTGCCTCGCGGCGCAACTGTTCCTGCTCGTTATCGGCAGCGTACAGAGTGATCAGGTTTGAAAGCGTAGAAACCGGCGCAACTTCCGCCGGGTCCAGATAAGGGTTAATGGCCTTTTTCGGGCTGTTCCATGAGAATGCTGCGGCAGCCTCGTTAAAGCCGCAGCAGTTGTTCATATCGGCATGACTCATGCACGTACTCCGTACACGGCAGAACTATCCACGCCACGCGAATAATCAAATCCCATCCAGCAGCGCGGCCCGGAAACAGCAATGATTTCTGTTGCTGATTTACCCTCGCCAGCTGCCACACCGATGCTGCGTTTTACCTTGATATAGTGGTGAGTAAAATTGCGATACAGCGAACGGATCAGGGATGTGTCACTGTTAGAAACAATGACCGGATGTCCTTCTGATGACCGATGTTCAAGAACGGATGCCAGGTGATACTGGTCATCTTCAGTGAAACCATCAGTGTGATAGCCGGAAAACGTACCGTCATATGGCGGATCGCAATACACCACATCTCCCGCCTTCAACATCGCCAGCGTTTCATCAAAGCTGGCGCAGATAAACGTCGCTCGCTGGGCTTTTTCTGCAAATGTGCGAAGTTCTTTTTCAGGGAAATACGGATTTTTATAATTACCGTAGGGAATGTTGAAATGCCCGCTCTTGTTATAGCGACATAAACCACGGTAACCGTGACGATTGAGATACAGGAAATATACCGCTTTCATGAAATCAGTAATTTCAGTTGAGCAGTTAAACTCCTGCCTTATGTTGTAATAAGCCACCTCCCTGTTTGCGATCTCAAATAAAACTCTGGCGCGAGATATAAACGATTCACAATCAGCGGCAACCTTTTTATAGAGGTTGATTAAATCAGGATTAATATCCGCAACCAGATAGCTGGGATAATCCGTTTCCATCATCACAGCACAGGAACCCGCGAAAGGTTCAACCAGTCGCGGGCCAGCAGGAAGGTATTTTTTCAGTTCTGGCATAATGGCGGTTTTATTTCCCGCCCATTTCAGGATGGTGCTCATACAGCACCTCCGTTGTAATGTTTGCCTTTCAGTTCTGCGATTTCCTGACAGGTAATGCAAAGCTGCACTCCCGGAATGGCGCGGCGTCGTGCTGGCGGAATTGGCGCTTCACATTCAATGCAAAGTACGCGTGACACGCCCGGTGATTTGGCACGGGCTGCACGGATATGGCGCTGGCGTTCTTCTTCAACGCGCTGCTGTACGAGATCCATTGCATCAGCCATTAGTGGATCTCCTGCGCTTCGTTCTGGATTGCTTCAGCAGTCACGCGCAGCAGTTCTGCCGCTTCCACGTGGTTTAGCTGGCGGGATGAGATATGACACGCCAGGCTATCAAGGCGAGCAGCCATTGCTTCAGCCCTTGCCCGGCGTTCTTCCAGACGAGCCTCTGTCAGTAAAATATTAAGCCCTGCGTCATCCGGTCCGGTTTTAGTCGTGAGGGTTTCAATATTACGCATAATCAATTCTCCTGAATTTAGATAAAGGGATGCCCGGCGGGTTTACGCCATTAATTTCATTAGTTGGTTAATTCGGCATGGTTAGCCGTCTGGGAAATAAGCTCACCACTGCACGAAAATGATTCATTGCTTTAATCAGCTCCCGCTTTTCGTCAGTGGTCAGCTCATTAATGCTGATGCTATGACGTTCAGCTGGAATTTTTGCCATAAAGAATATGGCAGCCAGTGCCCGTTTATTTTGTTCATTATTGATATCCCGTGGATCACGCATATCTTTAATAAACCGCTCAAGCTCTGACTCAATATTCAGGCCAAAAACTTTCGCCCTTAACTCCGCAATGTGATTAAGTCCATTCAGGCGTTCACCGGGGCTTAATGGAACAGTTGCTGCAGCGCCATTAATTGCCATACTTCATATCCCCCAAACGCAGCTATCGTTCTTTGTTCTTACGGTAACGCTCAAGAGGAGATACATTTTTTCGTATCGTCTCTTTAACCTGCTCTCCCCGTAAAAACGTCCCATCCTTTAACGTGAAAAAGTAACTGCCATCGCCCGACAATGACGGATAGCAACAGAGCAAATCATCTTCAGGTACTGAATAACTCTCCCCTCTGTAACGAAACTGATAAACCACTTCACTTTCTGCCGCATACATTTGGACTTTCTCCGTTTCCTCGTGGTCAATTCAGACAGCAATTCATCTTGTGAATGACATGGATGCCAGCGTTTTCCATCCTCACCCGTGATCCAGCCGTGACCGTAGTGCATTGCCGGGCTTTGTTTTACCAGCAGCGATGCAAATGATGGTTCTTTCGTCAGCATAAACACCTCACAGCAAACCGAATGAAGCACCAAGGCCAGTCATGGTATCAACTGCACTCGCCATCGCAGGATTAGCCTGTAAACGGGCTTGCAATGAAACAGCAGCCAGCGCCATCAGTCGTGTTACAGAGTTAATGCTGCTGATAGCATCACGACGACCTGCACTGGTTTTTACATCGCCAGATACCGCACCTGCAGCAACACGCCCGATCTCTGCGGTTGCACTCATGACGTAATGCGGTAGTTTCTCTTTTGCCACCTCATTAATCGGAACACATGGCAGACAATGAATCTGTGCCAGAAAACCATCTACCAGCGTTGAATCTTCAGTCAGATCGGTAAGCAACCAAATTTCTGGTGCGGTTAATAAATGAGGTTGAGCTGGGTTCAGTTTGTTCCGCAGAATCTGCACATTCATGCCTGCACGTTCTGCCAGTTGCACCAGATTGTGGCGCAGTGCGAATGCACGACAGGCTTCATCAAAATGTGGATGTTTGGAAACTTGGTAATCAAACATGGTCGACACCTCTGATGTATCCCAAAATGGAACTAGTTGAATACAACATTGCAATCAGTAAGTGCATCAACGGTAAGAGCAGCAAGGTTGATCATTACCTTTTCTCTTTTCTTGTCTTTCCGAAGGCGATGCCGAGGGATGCGACCGTCAGCCAGCATATCGTTAATTGTGTCGATTGAAAGACCAGTAAGTTCGCTATAACGCTCAATTGTGACATGTGGCGTATTCAGGGTTATTGAAATGTTAGGGGTCATGATGCAACATCTCCTATTGGCTTGTGGTGAGTCAGTTTTAATCGTGGCTTTAACTTCACATTTCGGAGAATAGGATCACAAATCGGTTATGTCAACACATGAAATCACATTTCGCCATGTGGACGAAAAAAAGAAATCCTTAATCATGCAGAATCGCGGAGGGCAATCGGTTATAGATCGGATACTGAAAGCCTATGGTTTTTCTTCCCGACAAGCATTCTGTAATCACCTAGGTATATCGCAAAGTACAATGGCGAACAGGTATGCCCGTGACACTTTCCCTGCTGATTGGGTTGTTATCTGTAGCATGGAGACTGGAGTGCCGGTCGAGTGGTTGGCATTTGGCACTGATACCGAGAAGGGAAGCATTACAAATAATGCAGAAAAAAGTCACAACAATTGTGACAGCAAGCATCAACATCTCAATAGAGAACAAGACATCCAAAATGAGAACTCTTTTACTATTAACCAAGGTGGAAAAGCAGCAATAGAGCGAATCGTTTTGGCTTATGGATTTAAGACAAGACAAGCTTTAGCTGATCATATTGGTGTATCAAAAAGTACATTAGCCAATCGTTACATGAGAGATACCTTTCCTGCTGACTGGATTATTCAATGCTCACTGGAAACCGGTGCTTCATTAACATGGCTAACCACTGGTAACGGGGCAATGTTTGAAAAGCCTCGAAACGATACTATCACTATCCCATATCATAAAATAATTGATGGATCTCTTGCTCAAGAAACCTTCTTGACTTTTGACTCTAAGTTGTTAGAAGGAACCTTTCTGCAACCTTTAGCAGTATTCATTGATGAGGAAATATATATTGTAGAATCAAAATTTAATGAAGTTACTGATGGCAAGTGGCTTGTGAATATTGAAGGGAAAATAAGTATCAAAGATTTGACTCGCATACCCGTTGGTATGGTTAAAGTTGTAGGCACTAACGCAAGTTTTGAATGCTTACTTACTGACATTATCGTTTTGGCAAAATGTAAAAGAGTTTTTACTAAAAATGTATAAAGAGAAACATCATGACTGAACCAACCAATAAAGATAGCGAAATAAAAAAACACCTATTAGAATTTCTTGATTCACAGTCTGAAAATATAGCAAAACACTTCTACTCTCATATAAAAGACTTAATAGAAGCAGGAGAGCTTTCTGAAGCTCATAATAACCTAGCGCTAATTGAAAAATACATAACTAGGCCACCGATGGATGAAGAACCCAATATAAATGAAAATAAAGCCAATAAAAGAAAAAATGTAAAATCACTTGAACCTAATAATTATGTAGAACATATAATACAATTAGAAGAACGAAACAGCATATTAACTCTACAGTTAGAGCATTATACTCAGGATCTTAATAGAAAAAACGCAATAATCGAAAACAACGTAAAACAAATTAATTCATTGATTAGTGAAAATAAGGAACTCCGTAGCCAAGTACAGCAACAAAGAATCGATGATAAAATCCCCACCTATGTTAACGATGTTAAATCAGATCTTGGTAGTGATGACAAACATTTTATATTGATGTCTATTATCTGGTCTATTGCAGGGGTATTTTTTGGCTTCCTTGCAGTAGTATCTGCTTTTTTTACATTATACATGAACTTAGATTTAAAAAATCTCACTAACCTTCAGTTAATATATATCTTCACGCGAGGATTAGTTGGAATCGCCATTCTTTCATGGCTATCATATATCTGCCTTAGTAACTCAAAAAAGTACACACATGAATCGATCAGGCGAAAAGATCGTCGACATGCTTTGATGTTTGGTCAAGTTTTTTTGCAGATATACGGTTCTACAGCAACTAAAGAGGATGCAATAGAAGTCTTTAAGGATTGGAATATTTCAGGTGACTCTGCATTTTCAGGTCAGACAGAGCAACCACCGAGTTTTGCGTCATTTTTGAATACAATCAAAGACAAAGTTAAAGTAACTGGAAGTGATAAAGAAACAGATTAATCATGAACATGTATGCTACTAAGTAAAAAATACATTGAATACTGTTGTTATATACAGTTAAATTTAGCCCTCTGATATGAGGGCATTTTTTATGGCAGTACGAAAACTCACCACAGGAAAATGGCTTTGCGAATGTTACCCCGCCGGACGTAGCGGACGCCGTGTGCGTAAACAATTCGCCACCAAAGGCGAAGCACTGGCCTTCGAGCGATACACCATGGGGGAAATAGAAGCAAAACCCTGGCTGGGCGAATCAGTGGATCGTCGGACACTGAAAGATATGGTTGAGCTATGGTTCAAATTACATGGCAAATCTCTTACTGCCGGACAGCATGTCTACAACAAGCTGCTGTTGATGGTTGACGCCTTGGGAAATCCCCTTGCAACTGATCTCACCTCAAAAATGTTTGCTCACTATCGAGATAAACGCCTGACAGGCGAGATCTACTTCAGCGAGAAATGGAAGAAAGGAGCAAGCCCGGTCACCATTAACCTGGAGCAAAGCTATCTAAGTAGTGTTTTTAGCGAACTATCCCGTCTGGGCGAATGGTCGTATCCGAACCCACTGGAGAACATGCGAAAATTCACCATCGCAGAAAAAGAGATGGCATGGCTTACCCATGAGCAGATTGTTGAATTGCTGGCTGATTGCAAACGTCAGGACCCAATTCTGGCACTGGTAGTTAAGATATGCTTAAGCACAGGCGCACGCTGGCGTGAAGCCGTAAATCTTACCCGCTCACAGGTGACCAAATACCGAATTACCTTTGTCAGAACGAAGGGGAAGAAAAACAGAAGCATCCCTATCAGTAAAGAGCTTTACGAAGAGATCATGGCGCTCGATGGGTTCAATTTCTTCACAGACTGCTATTTTCAATTTTTATCCGTGATGGAAAAAACGTCTATCGTGCTCCCTCGCGGTCAACTCACACACGTTCTGCGCCATACGTTTGCAGCGCACTTCATGATGTCGGGTGGAAACATTCTGGCCTTACAAAAAATTCTCGGACACCACGATATAAAAATGACTATGCGTTACGCACATCTGGCACCGGATCATCTGGAAACGGCGCTCCGTTTCAATCCTCTGGCAACGCTGCCAAGTGGCGACAAAGTGGCGGCAGCGGTTGGCATTACCCCGTAA
Protein sequences of DBSCAN-SWA_6 >CP034958|3094145:3102915|3096882_3097740_-|QAS86167.1|DBSCAN-SWA MSTILKWAGNKTAIMPELKKYLPAGPRLVEPFAGSCAVMMETDYPSYLVADINPDLINLYKKVAADCESFISRARVLFEIANREVAYYNIRQEFNCSTEITDFMKAVYFLYLNRHGYRGLCRYNKSGHFNIPYGNYKNPYFPEKELRTFAEKAQRATFICASFDETLAMLKAGDVVYCDPPYDGTFSGYHTDGFTEDDQYHLASVLEHRSSEGHPVIVSNSDTSLIRSLYRNFTHHYIKVKRSIGVAAGEGKSATEIIAVSGPRCWMGFDYSRGVDSSAVYGVRA >CP034958|3094145:3102915|3097963_3098197_-|QAS86169.1|DBSCAN-SWA MRNIETLTTKTGPDDAGLNILLTEARLEERRARAEAMAARLDSLACHISSRQLNHVEAAELLRVTAEAIQNEAQEIH >CP034958|3094145:3102915|3097736_3097964_-|QAS86168.1|DBSCAN-SWA MADAMDLVQQRVEEERQRHIRAARAKSPGVSRVLCIECEAPIPPARRRAIPGVQLCITCQEIAELKGKHYNGGAV >CP034958|3094145:3102915|3094492_3096886_-|QAS86166.1|DBSCAN-SWA MSHADMNNCCGFNEAAAAFSWNSPKKAINPYLDPAEVAPVSTLSNLITLYAADNEQEQLRREALSDQVWERYFFNESRDPVQREMEQDKLISRAKLAHEQQRFNSDMVILADVNAQPSHISKPLMQRIEYFSSLGRPKAYSRYLRETIKPCLERLEHVRDSQLSASFRFMASHEGLDGLLILPEMSQDQVKRLSTLVAAHMSMCLDAACGDLYATDDVKPEEIRKTWEKVAAETLRLDVIPPAFEQLRRKRNRRKPVPYELIPGSLARMLCADWWYRKLWKMRCEWREEQLRAVCLVSKKASPYVSYEAVMHKREQRRKSLEFFRSHELVNEDGDTLDMEDVVNASSSNPAHRRNEMMACVKGLELIAEMRGDCAVFYTITCPSRFHSTLNNGRPNPTWTNATVRQSSDYLVGMFAAFRKAMHKAGLRWYGVRVAEPHHDGTVHWHLLCFMRKKDRRAITALLRKFAIREDREELGNNTGPRFKSELINPRKGTPTSYIAKYISKNIDGRGLAGEISKETGKSLRDNAEYVNAWASLHRVQQFRFFGIPGRQAYRELRLLAGQAARQQGDKKAGAPVLDNPRLDAILAAADAGCFATYIMKQGGVLVPRKYHLIRTAYEINEEPTAYGDHGIRIYGIWSPIAEGKICTHAVKWKMVRKAVDVQEAAADQGACAPWTRGNNCPPVENLNKSGGDLPDIKTMDEKELQEYLHNMGQKERRELTARLRLVKPKRKKAYIQSILEQQRLQLEAELTTRGFEGSASEIDLLLRGGSIPSGAGLRIFYRNQRLQEDDKWRQWY >CP034958|3094145:3102915|3099936_3100815_+|QAS87721.1|DBSCAN-SWA MQNRGGQSVIDRILKAYGFSSRQAFCNHLGISQSTMANRYARDTFPADWVVICSMETGVPVEWLAFGTDTEKGSITNNAEKSHNNCDSKHQHLNREQDIQNENSFTINQGGKAAIERIVLAYGFKTRQALADHIGVSKSTLANRYMRDTFPADWIIQCSLETGASLTWLTTGNGAMFEKPRNDTITIPYHKIIDGSLAQETFLTFDSKLLEGTFLQPLAVFIDEEIYIVESKFNEVTDGKWLVNIEGKISIKDLTRIPVGMVKVVGTNASFECLLTDIIVLAKCKRVFTKNV >CP034958|3094145:3102915|3101862_3102915_+|QAS86175.1|integrase|DBSCAN-SWA MAVRKLTTGKWLCECYPAGRSGRRVRKQFATKGEALAFERYTMGEIEAKPWLGESVDRRTLKDMVELWFKLHGKSLTAGQHVYNKLLLMVDALGNPLATDLTSKMFAHYRDKRLTGEIYFSEKWKKGASPVTINLEQSYLSSVFSELSRLGEWSYPNPLENMRKFTIAEKEMAWLTHEQIVELLADCKRQDPILALVVKICLSTGARWREAVNLTRSQVTKYRITFVRTKGKKNRSIPISKELYEEIMALDGFNFFTDCYFQFLSVMEKTSIVLPRGQLTHVLRHTFAAHFMMSGGNILALQKILGHHDIKMTMRYAHLAPDHLETALRFNPLATLPSGDKVAAAVGITP >CP034958|3094145:3102915|3100826_3101771_+|QAS86174.1|DBSCAN-SWA MTEPTNKDSEIKKHLLEFLDSQSENIAKHFYSHIKDLIEAGELSEAHNNLALIEKYITRPPMDEEPNINENKANKRKNVKSLEPNNYVEHIIQLEERNSILTLQLEHYTQDLNRKNAIIENNVKQINSLISENKELRSQVQQQRIDDKIPTYVNDVKSDLGSDDKHFILMSIIWSIAGVFFGFLAVVSAFFTLYMNLDLKNLTNLQLIYIFTRGLVGIAILSWLSYICLSNSKKYTHESIRRKDRRHALMFGQVFLQIYGSTATKEDAIEVFKDWNISGDSAFSGQTEQPPSFASFLNTIKDKVKVTGSDKETD >CP034958|3094145:3102915|3094145_3094334_-|QAS86165.1|DBSCAN-SWA MQDYLLESLKLQRIDFFIKLVAASECSDEEKRLAIQWVSELTDELMAKIRSHEYSRSTEVSS >CP034958|3094145:3102915|3098723_3099020_-|QAS86171.1|DBSCAN-SWA MLTKEPSFASLLVKQSPAMHYGHGWITGEDGKRWHPCHSQDELLSELTTRKRRKSKCMRQKVKWFISFVTEGRVIQYLKMICSVAIRHCRAMAVTFSR >CP034958|3094145:3102915|3099569_3099791_-|QAS86173.1|DBSCAN-SWA MTPNISITLNTPHVTIERYSELTGLSIDTINDMLADGRIPRHRLRKDKKREKVMINLAALTVDALTDCNVVFN >CP034958|3094145:3102915|3098264_3098606_-|QAS86170.1|DBSCAN-SWA MAINGAAATVPLSPGERLNGLNHIAELRAKVFGLNIESELERFIKDMRDPRDINNEQNKRALAAIFFMAKIPAERHSISINELTTDEKRELIKAMNHFRAVVSLFPRRLTMPN >CP034958|3094145:3102915|3099027_3099537_-|QAS86172.1|DBSCAN-SWA MFDYQVSKHPHFDEACRAFALRHNLVQLAERAGMNVQILRNKLNPAQPHLLTAPEIWLLTDLTEDSTLVDGFLAQIHCLPCVPINEVAKEKLPHYVMSATAEIGRVAAGAVSGDVKTSAGRRDAISSINSVTRLMALAAVSLQARLQANPAMASAVDTMTGLGASFGLL |
12 | Salmonella_phage(90.0%) | integrase | attL 3093815:3093828|attR 3102957:3102970 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
3186473 : 3209702
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >CP034958|3186473:3209702|DBSCAN-SWA ACTATTCGATAATGGGTACTGTTGGCCATTCAATATCCGGTGCAGTTGTTGTATTAACACGGTTCAGCAACACCCGATACTTCTTCCAGGCTTCCAGCAACGAGTTTTCTTCCTCCGTTGCGATCTCCAGATCTACAGCATCCTGAAGTGGCGCAATATGCTCACTGAATTCCTGGATGTAGAACTATGTGGTGACGGTCTTCCAGCCATTCGGCTCCTGCTGTATCGAAGCATACCAGGCTATTTCAATATCGCTATGCTGCGGCAGCATTTAACCCCTTGTAATTCATCACCATAATTGATTTAATTCACAAACAAAACTATAACATGGTGAAATTAATGAAAAAAAACACAGATGATGGGGCTAAAATTTACACACCACTTACCCTAAAGCTTTATGACTGGTGGGTTTTGGGAGTATCAAATCGGCTTGCATGGGGATGTCCTACAAAGGAACACCTTCTTCCACACTTTCTGGAACATTTAGGTAACAACCATCTGGATATTGGTGTTGGAACTGGGTTTTACCTTACTCACGTACCTGAGAGTAGTCTGATATCTTTAATGGATTTGAACGAAGCTAGCCTGAACGCGGCATCTACAAGGGCTGGGGAATCAAAAATTAAACATAAAATTAGCCATGATGTTTTTGAACCTTATCCCGCGGCGTTACATGGTCAATTTGATTCCATTTCCATGTTTTACCTTCTTCACTGCCTGCCTGGAAATATATCTACAAAAAGCTGTGTAATACGCAATGCGGCGCAGGCCTTAACTGACGATGGAACTCTATACGGAGCCACAATTCTTGGCGATGGAGTTGTGCACAATAGCTTCGGTCAAAAACTGATGCGCATTTACAATCAGAAAGGCATCTTTTCAAACACAAAAGATTCCGAAGAAGGCTTAACACATATACTCTCAGAGCATTTCGAGAATGTTAAAACCAAGGTTCAAGGTACTGTAGTAATGTTTTCCGCTTCAGGGAAAAAATAGCATCCAACCGCAGCTCGCTCTTGCTTAAGACGTGCTGCGGCATAATCCCAATGATTACTCCCTGACAGGGTTCGTAGGCCACTCAATATCAGGTGCAGTTGATGTATCAACACGGTTCAGCAACACCCGATACTTCTTCCAGACTTCCAGCAACAAGGTTTCTTCCTCCGTTGCGATTTCCAGCTCAACAACAGTCTGAACGTACCAGGAACAGCCTCCTTCAGGGCTTGAAGGATATCAATGTTCGCTTCCTGTTAACTGCCCGACAAGTGCAACCAGTTCGCTTACCTGATTTTCCAGAGTGGTGATCCGGGTGCCTGCTGTTGCCAGATTTTCACGTAACGTTGTATTTTCCTCTTCCAGCGCGGTAACGCGATCATCTGTTTCACGGGCGACCTGAACAAGTAACCCCGTCACGGCGGAGTAGTCAACATTAAGATAGCGAGTTTCTTCGCGTAGCTCGTTGCCGTCAACGGTCGGACCTTGCAACTCTTCACCATAATGAGTAAACGATCCCACAGCTTCAGGTATCGCCTCCATTACTTCCTGTGCAATAACGCCAGCATAAGGCATTCCGTTTTCCTTGAGCGTGTAGGTGTACCCGTTCATTTTACGGATTGCTTTCGTCGCGTCGCTGATAACGAGAATATCGTCTTTAAGGTCGCGGTCTGATGACTGATTCAGCGTTGTGCAATTAATAGCGCCATTTACATCAAACAACTGGCCTGCTGACGTTTTTTGCGCATAAAACAGATACGCAGCAGACGTTCCAACCTCAAAAACGTTTTGTCGAGTACTGGAACCCCACACCCTGACAGAAAACGGTAGTTCTGCATTACCTGAGTTCTGTAAAACAAAACGATTGCCAGTCCCTGATTGTTTTGTAAGGGTTAAATCAACTGTTGAGTTAACCTCATCCTTGTTGATAGTGAGCGCCTGCGCTTTATCAGTTCATCCAGCGCGGCTGCTTTGTTCATGGCTTTGATGATATCCCGTTTCAGGAAATCAACATGTCGGTTTTCCAGTTCCGGAAAACGCCGCTGCACCGACAGGGGGATCCCGTCGAGAATACTGGCAATTTCACCTGCGATCCGCGACAGCACGAAAGTACAGAATGCGGTTTCCACCACTTCAGCGGAGTCTCTGGCATTTTTCAGCTCCTGTGCGTCGGCCTGCGCACGCGTAAGTCGATGGCGTTCGTACTCAATAGTCCCTGGCTGGAGATCTGTCTCGCTGGCCTGCCGCAGTTCTTCAACTTCCCGGCGCAGCTTTTCGTTCTCAATTTCAGCATCCCTTTCGGCATACCATTTTATGACGGCGGCAGAATCATAAAGCACCTCATTACCCTTCCCACCACCCCGCAGAACGGGCATTCCCTGCTCCTGCCAGTTCTGAATGGTACGGATACTCGCGCCGAAAATGTCAGCCAGCTGCTTTTTGTTGACTTCCATTGTTCATTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAAAAGCCCGTTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGGGGGTGTTTTAAATAAAAACATTAAGTTACGACGAAGAAGAACGGAAATGCCTTAAACCGGAAAATTTTCATAAATAGCGAAAACCCGAGAGGTCGCCGCCCCGTAACCTGTCGGATCGCCGGAAAGGACCCGCAAAATGATAATAATTATCATCTGCATGTCACAACGTGCATCTACGCCATCAAACCACGTCAAATAATCAATTATGACGCAGGTATCATATTAATTGATCTGCATCAACTTAACGTAAAAACAACTTCAGACAATACAAATCAGCGACACTAAATACGGGACAACCTCATGTCAACGAAGAACAGAACCCGCAGAACAACAATCCGCAACATCCGCTTTCCTAACCAAATGATTGAACAAATTAACATCGCTCTTGATCAAAAAGGGTCCGGGAATTTCTCAGCCTGGGTCATTGAAGCCTGCCGCCGGAGACTGTGCTCAGAAAAAAGAGTTTCTCCTGAAGCAAACAAAGAAAAGAGTGACATTACTGAATTGCTCAGAAAACAGATCAGACCAGATTGAAGCAATTTAGATAATCGTGCAGACTACGCCCCTCATATCACATGGAAGGTACTACAATGGCTCAGGTTGCCATTTTTAAACAAATATTCGATAAAGTGCGAAATAATTTAAACTATCACTGGTTTTATTCTGAACTAAAACGTCACAATGTCTCACATTACATTTACTATTTAGCTACAGAGAATATTCATCTTGTTCTTGAAAACGATAATACGGTTTTAATAAAAGGACAGGGTAAGGTTGTAAATGTAAGATTTTCAAAAAATAAATGCCTTATAGAAGCCACCTTAAAAGGATTCAAATCAGGAGAGTTATCATTTTACGAATACAGGAAAAATCTTGCTACAGCAGGGGTTTTCAGATGGATTACAAATATCCACGAAAACAAAAGGTATTACTATACCTTTGATAATTCATTACTCTTTACTGAGAACATTCAGAACACTACACAAATATTTCCGCACTAAATCATAACGTCCGGTTTCTTCCGTGCCAGAACCGGACTCGCTGGCATGATGAAATATGTGTACCCGGTAACCCCGGTGTGCATCGTTTTTGATTATTCCCGCACACTCGCGCAGAAGGAGTTCCCCGTCGGGCTACGGTCTCTGTTAATACGGGAATACGGCGACGATACAGCGCATGATGTGTCAGGCTTGAATACCTTTATCCTTTAAAAGGGATATCAGTTAAGTTATCCCGTGTAGGGTATAAACCATTATCAAAGCCACTCTGTAGGAAGTGGCTTTTGTAATGGCAATAAAAAGCCCCGCGAATGCGAGGCTAAATCCTGGTATTTGTAATGACTGGTTCTTATCTCAACGCAGCCCCTTACCGCGCGCAAAATGCTCAATATCAAGCATCAGCAATGAGATGTTTAATCTGGATTCACTCCAGAAGTGAGCACCACCCTGTCTACAGAGCCAGATGTGAAGGATGATGAGTAAAATTATCGCTATCATCGAAGGCATTGCGTCCTGATGTATTCCTGAAGCGTTCTCAGTGCTGTTTGGTCGCGGATAATTCCGTCCCGGATACCGAGAACGTTTCGTCCAGCAACTGGAGAGAGTTCGACGGTGGCATCATTGCCCATGCCGGAGGCGCTGGAGGTTTCGGCTGAGGATGGCACAGGGCATTTTCCTTTGACGAGCACCCGACCACCATTATCAAGCTTGCGCCGAAGAGCATCATTTTCAGCTTTCGCATCAGCTAACTCCTTCGTGTATTTATCATCGAGTGCATCAGCATCACGCTGGCGCTGCTGCATGTCAGTAATGGTGGCGTTCGCCTTCTCCAGTTCACTGGCCTTGTTATCGCGCTGTTCTTTGTAGGCGATTGCATTATCACGGTAATGATTAACAGCCCATGACAGGCAGACGATGATGCAGATAACCAGAGCGGAGATAATCGCGGTTACCCTGCTCATTGTTGCCCCCACAAACAGACCTCACGCTCAATCTCACGACGAGTCATCAGGCCTTTCCATTGCTTACCGCCAGCGTATGCCCAGCGACGTAGCTGGTCACATGCGCCCTTGATATCGCCCTGGTTTATTTTGCGAAGAAGAGCGGATGTTCTGAAATTGCCTGCGCCCACGTTATAGACGAACGAGTAAAGAGCGCCGCGCGTTGTTTCCGGTATATCGACTTTGATGTACGGGTTAATTTGTCTGGCTACCGTGGCAAGGTCTTTATTCAGGAGGGCTTTGCATTCTGCTTCGGTATACGTTTTACCGGGAATGATGTCTTTTCCTGTATGCCCGTGACATACAGTCCATACGCCAACGATATCTTTGTATGGTATGTAGCTGACACCTTCCAGACCATCGTTACCACTTGGGCCAGTGATTAACACTGATGCTATAGCAATTGCTCCGCCACCAATAGCAGCAGCAACGGCTTTTCGTAATGATGGAGGCATTATTCACCTCTCGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAAGCTACCCAGCACACCTATTGCTGCCCACTGTGAGGGCGTGACTTTATCGAGCAGCTGTAAAAACCAGTAGCCAGCACTGCCTGCGGAGGTGCCGTAGGCAATGCCCGTTGTTAACTTATCCATGGATTTCATAGCCTCACCTCCGCAAATAACGGATGGTGTACACGGTTCGGAACGAAGAGGAAAGGTATAGAAGTTACATTAGCGTAAGGCTTGAACATCTATTCAAAAAGAAAAACGCCGCGATTATTCTGGCGTAGCTGAAAGCATCATACAATTATCAAATACGAAAATTACAAAATCATTAAAACGCATCACGTTACATCATGTCTTTTTCTAAAAAAAATCTTGATGAATATTGATGGGGAGGAACACCAAAATACCTTCTGAAAACACTTACAAAATATGACGTGTTTTCATAACCGCATATCTCAGCAACTTTTCCAACAGAATATAAATTGTAGCTTAATAACCTTTCCGCCATCACCATTCGCTCTTCAAGAATTAACTTACTAAATGATAAGCCTTCGTGCTTTAATTTTCTTTTTAACAGACTTTCACTCAGATACAGTCTTGAAGATATATCACAAAGTCTCCATGCTGCAGATATATCCGTGTGAATAATAGCCTTAACTTTACTTCCTAAACTATTAAGACATCCAAATAAAAAACTTTGCACTATTTTCTCTGAAGATAAGATAGCAAGACATGCAAGTGATATTTGATTTCTAACAAAATCCACAGTTCTGCCATCACAATTCAAGCATGCAATCAAGTTCTTTAACAATGAAAAATCTTCACATTCCACCATCAAGTATGCCGGATAAAACCTTCTTACAGAAAAAGGTGAGAGTGTGTTGCTTTTAAAGAAATCATTAACTGTTTTTTCTTCAACATCTACGATCATTACATGATCTATATTTGATGAAAAAAATCTTTTAAATTGTAATCAATGAGAACAGCACTTCCTTTTTTAAACAAAATATCTTCTTTACCAATTCGGACATCAAACGAATTCAACACCAAAATGATAGAACATATGTATGGCATATTATCCACCTGATATCATTGGGGTTACACCAGGTAAGTGTAGGTGGAAAATCAATATTCGCCAGTTCAACAATAAGGAAAATTTCATTACATCACAAGTATAAAATTATGTATTTAACTCACAAAGACAAATTATTAAACCAATCTGTTATATTATATATAGCTGCGTGGAATCATAATATCATATATTTTGACTGGCATGTTTACCAACTTTAAGTTGCATCTCAATTGTTTCTTCAGCGTAAACAGAGTTTTTATACAAACTGACACTCTGGGTATCATAGTGTAGTTTTTACGATTGTAAATATCCTGCATGCAGGAACTCATCCTTTTGGATGATATCGCATACAATTAATTTACCATCAGTCTTAGAGCCAGTTCGTCCGGATAGGGATCGAAGTAATTTTGTGTAAGCAAGTAATCATTAGGATACTCACCCAGATAATGCTTCAGCAGAGTCAACGGCGCAAGAAGAGGTAATGTGCCAGAACGATAGTTAAGTATAACCTCGCTCAACTCTTTACGCTGGCGTGTACTTAAGTAGTTACTAAAATACCCCTGTATATGCATCAGCACATTCGTGTGATTTTTACGTGATGCAGGTTTTCTGAGAATCGCCATCAGCTTATCACGATACACCTCAAAGTATGATTCAAGGTCCGCCCACTCGTGTATTGCAGCCACAAATGGTCCCATATCTTTATAGCCTGCCTGACTATGCGCCAACAACTGAAGCTTATAACGACTATGAAAAGCTAATAACTCTCTTCTTGATAATTTCTCCTTGTAAAGGTGATTGAGCTCATGCAAAGCAAAAACTCTTTCAACAAAATTCTCACGAAGCACTGGATCATGTAATCGCCCATCCTCTTCAACCGGTAGCCAGGAAAACTTTTCCATCAAAGTGCTCGTAAATAGTCCCACTCCATCTTTACGACCTCGATTACCATTTTCATCATAGACACGCACGCGCTCCATGCCACAGCTGGGAGATTTAGCACAAACCACAAACCCCGATACATCCTTTAATTTGTCCATATAAGAACGACTAAACTCTGTCATTCTCTCTGTCACATCCTCATTCTGGTCGTGGCTGAAACACATCCGTATATTTCCTTGCATCGAGCGCACAAGACGTAGAGCAGGACGCGGAACTGGCAGCCCTATAGCCATTTCCGGACATACTGGTCTGAATGTTACCCATTCCACTAATTTGTCCATTAAAAAGTCAGCTCTTTTGTGACCACCATCAAAACGAACAGCAGAACCGGCCAAACAACCGCTGATTCCAATCACAGGTTTTTTTATCATATTCTCCCCCTTGACTAATTCATTAACACATAAACTGTGTAGTGCACGGAATAAATTGCCTTTCTGGCGTCATCACTGACAATTTTTCTGTTATGGACTATTCCTAATATAGTATGAAAGTTCTTTAAGTGATCGGTCGTAATCATCTATCTTTCATACTTACTCTCAACTATCAAAAGTACAGGATTTATTATGAAGTTATGGCCTGTGTTGACTGGCATTGCACTCTCTTTCACTCTTATAGCATGTAAGGCCCCGACACCACCTAAAGGTGTGCAGCCGATTACAAATTTTGACGCCAACCGCTACCTCGGAAAATGGTATGAAATAGCTCGCCTCGAGAACCGGTTCGAACGTGGTCTGGAACAGGTCAGCGCTACTTATGGAAAACGGAACGACGGAGGGATTCGTGTACTTAACCGTGGATACGATCCAACGAAAAATAAATGGAGCGAGAGCGAAGGTAAAGCATACTTTACTGGAGATACTAAAACTGCAGCATTGAAGGTTTCGTTTTTTGGCCCCTTCTATGGTGGCTATAATGTAATCAAACTGGATGATGAGTATAAGTATGCTCTTGTCAGTGGTCCGAACAGAGAATACCTATGGATTCTGGCAAGGACCCCAACTATTCCAGATAAAGTAAAAGCAGACTATGTGCGAACCGCTCAAAAGTTGGGATTCAATGTCAATGAATTATTATGGGTTAAACAATAAAATCCCTACCCGAAATAATACTTATTAGAAAAAAACCAGCCTTTGGGGAGGCTGGCTAAATCAGGAAACAAGCTGTTATATGATAATAACTACGTTGTGATTCCAACATTTAAAATGTTAGACTAATGACAATCAGACAGCAACTTTTCCTTTAATTATTTCGAACAATCAGCATCCATCTCCAATCGGAGATCCAACACCATCAGCATACCCTCCACTACGCCCTCAGCTTTCTGAAGCATCCTGCCAACCCAACAATCAGATCGCCCATGCTTACGTGCAAGCGCCATAAAAGTCATGCCGCCGACATAATAGTCCACCAATAAATCATGCAAATCGCTGTTGTTCTTTTTCAGACGGGCCATGCACCCGCAAATGATCATCGCATCATCGTCACAACATTGCGGGCGAGATTTTACTTTTGAAGGAATTAATCCCTTAAAACCGGCGGCAATGGACGACCAAGTCACATCTTCATGATTATTAGCCGCCCACGCTCCCCAACGCTCAAGAACCATCTGAATATCACGCATCAACTTACTCCACAAAAATCAGACCAGAACGCCAATCACAAGCAAAAATCAACAAAACAGTATTAGTTGATTGTTATCTCTGACTTCATACTCCTGCTCCTGTCAGGGTTTTGGCGTAATTCTTCAGTATTCGGTAATCGGTCAAAACAGAACCGGGGAAACGATATAAGCGCAGATGCCCCCAGCGGTGGCGAAGAAGTTCTGCCATATAAAACTCAAACATCATTCATTCCCCATTTCGGTGATGGTCAGTTCCAGCCTCCCACCTTTGGTAACAGGCATCTTCACAACGCGGTAATCAACGACCTGAGCATCATCCAGCCAGAAACCTGCTTTAGTGAGTGCGTCAAAAGCGGCTTTTTGCAGATTATCCAGGTCACGGCGACGGCGATCCGGCATGTGGCACTCAATGCGGATTTTCACAGGCATAGCCAGGCCGATATCCAGCATTGCGTTTTTAATGATTCGGGCGACGTTATCGCGGTATGCCTGCCCCTCTGCGCTGACGTGCGTGCGCCCGCGATTATGGCGGTAATAGCGATTATTGCTCGGAGGCCAGGGTAATGTGATGTTGTAGGTATTCACGCCTTGATTACCCCCTCTTTCATCCAGATAACCTGCGTTCTCGCCATACCTTCCAGCGCGCATTCTTTTGCATATGCGGCATCGACAAAATGTGTGCGGCGGTCGATTTCGTCGTGGCAGGCAGAACATGCAATGGTGGCAATCAGGTCTGGCGGTTTGGTACCGGTGCCGCACAATCCAGTCAGCCGGATATGTGCCAGTACAGACGTTTCAGGGTTGCCATTACATACGCCAGGGGTTCTTACCTGGCATTCCCGACCACGCGCTGCTTTTCTCAAATCAGCCATGATTCCTCCTTGCTGCCAGTCGCAACCATTTTTTATCAACCAGGCTGGCGGTATATCCGAGCAGTGTTGGTATTTCGGATGGCTTCAGCTCAGGTTTACGCTTACGACGATTTGGTACTCTGTAGATGTGTCCGTTCATGACACGAATAAGCGGTGTAGCCATTACGCCTCCTGCTTGTCGCGCAGCAGCTGGAACTCGCAGCTCTGCGGAATAGTCAGGTGGCAGCCAATATTCATCGCCCAGGCTTCAACCTTACACAGGAAGACATACATCTCTCCGGCATCAAGATCGGAGGTATGGCGTAACGACTGGATAGTGGTGATTTCACCGGTTACGACATCAACCAGTTCTTTGGTTTCATAACCGAGGTATGTGTGTTTGAGAGCTTCTTTTACCCATGCTGCGGTAGCGAACGATTTCCCCCTGCTGATAAGGTATTCACTGATTTCGCTGTACCACATGTGGCTGAGTGCATTCTGGGAAAGACTGCGTTTCTCACGCCACGGTTTAAGCACCATGCGAAAGCATTTTCCGTCCTCCAGATAAGGCTGGATCTGCTGGCCGATAGCGGTGAAGTTGCCGCGATGTAATTTGATGCCATCTTGTGGTAGGTTCACGCTTCACCTCCGCAGAGGTCAAACGCAGGATGCAAAAAATCGCAGGTGCATTTCTGCATCTGTGAAGGGAGAAGAGAGTTTGGATTGTATGTGCGCATAAACGTCCCCGTTTAGCGCAGAAGTCACCGGAGTTGTTCAAGCTCCGATGACTTTATTATTACGAATTGATTTTACAAAATCAAAAGGTATGTTAGTGACGCGGGTCTGTTATTATGCGAGAAGGGTTTCCGTATAAAACAAGGACCTTACTTCCTTGAGTAAATAACGGATCTTTGCCTTGAACAATGGTCATTAAATTCCCATTCTCAGTTTCGACAACATATTCCATGCCTGTTTGTTTTGTTGCTGAAGATTCGATTGCTGCCCCGGCAATACCACCAATGACTGCACCACCAACGGCACCAACGATATTAGAACGAACTCCCCCACCAAGCGCAGAACCAGCGGTTGCCCCCACGGCAGCCCCAGCAGTCCCGCCTAACGCGGAAGTCCCACTGATATCAACCCCCCTGGCACTAATAACTGTACCAGCGATAGTTCGATTAACCATGCCCACAGAGCCAACAGAATAACTATTTGGCGATATATTTTGTGCGCATCCAACCAACACTAAGAGTGGAGCAATTACGAATAATCGCTTCATTTAGCTACCCTAACAGGAAACATTGGACGAGAAAGATCAACACTTTCTAATGCTTGCAAGAACTGCGTTATGTTGTTTTGCACCGCGCGATTAACAGATTCGCGTGCTCGAACAATACCGTAGAATGCGTAACTGGCTGGAACAGTACCGGTAGACTCAATATCCTGCGTATATATAATATCACCATTCGCACGGTTGATTATTTCATACCTTGCAATTGCTTTAGTTGTCATTGAAACACCAAAAGCAGGAACGTCAAGAGCCAACACTTTAACATTTAAGCTAACCGTATTTGGTGAACTATCACGAAAAATAGTCATTCGGTCGAGTGCTTCCTGCAAAGATTCACGCCAAATTGGAGTTATAGCCTCCATACCAGCAGTGATATCCCCTTTCTGCTCATCTGGACGAGCAAGTGATACCGTTAATGACTTAATTTCAGCATCTATTTTTTTCTGGCTAACTCCCACGTTAGGTGTTGAAAAATTCAATGGTGGCACACTAGCGCAACCTGTTAAAGAACCAATAATCATGGCTAATAATATTATCTTCTTCATAAATTTACCTTATTGTTATAACCAAAGGAATTATAAAGTAAAAAAGTTCACTATCACTAGCCATTAACGACATCAATTTCAGAGAAACATGGTACTCATTTCCACAAATTTGACACAAGTCATTTTCATCTACATATTCCATCATACTTGATGCATATGTTATTGAAGCCTCTATCCTATCCGTTCATAATAGCAATAGTTACCCGGGTGATAGTACCTCTATGATTACTCGTCTTTCTGATTGATTGGATTAAATATGCGCGCCAAAATTTATCAACTTTCGTTATGGATATTTATTTCGTTTCTAGCGATCTATGCCTTTATTATCTATAAAGGTTCTTATATTGGAGTAGCATTGCATCAAATTGCTTGGATCATCATTATTGCCTCTGGCTTGATTGCTAGGCTAACTAAACCAAAGCAAAAACCAATTTCGTCCAATAATTAGACATGTATTAAAAAATGATATTTTTATGTACATAGTCTATTGAAAATTGCCGCGATAAAATGCCAACACCCGCTTCATCGCGGCACTCTGGCGACACTCCTTGAAAATCAGATTCGTGCTCACCTTTCCTTCCCGTTCTTCCCTGGTAGCGAACCGGTAATACACCGTTCGCCAGACCTTACCATCAATAACTAAGATTCCTGTCCGCGCCATTTTAGCCGCAGCCTGGTTTATGCTGGTTACTGTTGCGCCTGTTACCGCAGCAACGTCCTGCGCACAGAAGCTCTTATGCGTCCCCAGGTAATGAATAATTGCTTCTTTTCCCGTCATACACTGGCTCCTTTCAGTCCGAACTTAGCTTTGATTTCTGCAATCTTCGCCAGAGCCTGTGCACGATTTAGAGGTCTACCGCCCATGACAGGAAGTTGTTTTACTGGTTCAGGGATCGCCTCACCACGGTTAATTCTCGCAGTCATATGGACAAGCTCATCTGCGGCCTTACGGCGTAATTCCGCATCAGTAAGCGCATTGGCCCGCATGTTCTGATACAGGTTGGTAACCAGCCAGTAGTGCGCGTTTGATTTCCACGGATAAGACTCCGCATCCGGATACAGGCCTCGCTTCCGGCAATACTCGTAAACCATATCAACCAGCTCGCTGACGTTTGGCAGTCCGGCGATAACGGATGCTTCTTCCCGGCACCATGCAACAAACTGCCCGGGTGATGGCAGGAATGGTCGATTCTGCCGACGGGCTACGCGCATTCCAGCGTTAACCTGTTCCATTGTGGTGATCCCGTTTTCCCGAAAAGCCAGCACCCACTGGCGGCGGATTTCGTTCAGTTCGTTCTGGTCCCGGTTAGCCAGGCTCGCTGGGAAAGTTGCCAGTAACTGGCTGAATACACCGTTGATTATCTGCGCTACCTGCTGTACCTGCGGCTTTTCGTCGTACTGTTCCGGCATGTTATTGGCGATCCGGCACATCTGCTCACGGTCAAAGTTAACCATCTGTGCGGCGATGTTTTTCATAAATCCACCCCGTAAATCCAGTCAGTGTTCGTCAGGTCGAGTTTTGGTTTGCCGGCTGTCACGCCAGCCTGTTGCTTGTTTCGGTTGATTTCGAGCTGGGTCCACTTGTCGCGGAGTTTGGCCGGACTCAGCACGTTACCGGACCAGAAGTTGTCCTGGCAGGCCCAGCGGAACAGTACACACATGTCGCGGTGGTTACGTCCATCACGTTCACGCATCAGACGGATATCGTTAGCCCACCCTGCAAAATTCGGTTTTCTGGCTGATGGCGCGATGGTCTTCACCATGTCAAACATCCACTCTGCGGCGGTCAGGTCTTCTGCTGTCCCCCACTTGCTGCCGTTCTGAATCGCAGCATCCGCTTTCACCACAGGAAGGTCGTTTTCTGGCAGGTCAGAGGATTCGCCAGAATTCTCGGACGAATAAGGTTTTATATTGTCTTTTGTTAGTTTGTCTTTTGTGTTTACCTGATTCGGGTAAGTGCCTTTACCTGATTTGGGTAAACTTTTCTTACCTGATTCAGGTAAATTTACCTCTTTCAGGTAAACTTTATTTTTCTTACCTGATTCGGGTAATGTTGACCATTCACTGACCACATTATTAATGCCTATATTCCGCCCGCTCTGAATAAAAATCCCACGCTTTACCAGAACACTTTTTGCAGCAGAACACTTGTGCGGCAATATCCCGGTCAACTCGGAAAGTTGCTCGTTGCTCACCCAATCCAGTTTTTTATTAAAGCCATATGTTTTGCGCATGACAGCCAGGAAGACCAGAAGCTGGTGCTGTGTTAATCCGGCCAGCATTACAGCTTCCAGCAACTCATTTGCAATGCGCGTATAACCATCATCGAGATCTGCCACGCGCGGCTCCTTTTGTGCCGCATCCGGCACTGGAAAATTGAATATCTCAGCAGTGTTTGCCATAATTCCTCCCGCAATGAGTGTGTTACGATTTGCACCTGAAAGTCGGTTCTGTTCCAGCAGACCGGCTTTCGCCATTTCTGAACCTGTCATATCGCCCCCAGCATGGTAGTAACCATCGCCATCAATGGACCAGCCAGATCTGGGTCCACACGAAACATCGACACAATACCTTCACTAATTTCCTTCAGTTTCTGGTGGCGTGGTGCGTTGAGAATGACAGCCTGTTTTGCCTCACTGAGTTCCTTTTCCATTTCAGCCAACCTAGCCATGAAGCTATCCTGCTCAACCAGGTAACCGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCACGGCTGACATCATCAGGGAAATCGATGGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCAAGCGCGGACGGCATCACGGATTTTTTCGTGGCCTGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTGGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATAGACGGCCTGCTGTGAAACTTCGCAAGCAGCGCCCAGTTTCTTTTGTGAACCAACGATATTGATCGCTGTTTTGATAGCTGGGTTCATAACAACCTCCGTGGTTAATTTGAATCAAGATTAAAACTATGGTTGTTTTTAGTCAACAACCATTTTCGTTTGATGGAATAAAACCTTGGTTGTACATTTGGACTATGAAAACAACACTCTCAGAAAGACTTAAAGAAGCCAGATTAGCGCGAGGCCTTACACAAAAGGCGCTTGGGGATTTGGTCGGGGTTAGCCAAGCTGCTATTCAGAAAATCGAAACAGGGAAAGCTAATCAAACAACTAAAATCGTGGAGATCGCGAACGCTTTGGGTGTGCGCGCAGAATGGTTATCTTCTGGCGTTGGAAATATGTCAGACAGTACAGTGCAACCAATACAATCAACTGTCAGCCATTCCAAATACTTCAAGATTGACGTTCTTGATATAGAAGTCAGTGCTGGGCCGGGAGTCATCAACCGTGAGTTTGTAGAAGTTCTACGCTCGGTTGAGTACTCGTTTGACGATGCTCGTCACATGTTCGATGGTAGGAAGGCGGAAAATATCCGCATCATTAACGTGCGTGGTGACAGCATGTCAGGAACGATCGAACCAGGTGATCTGCTGTTCGTTGATATCACAGTTAAATCTTTCGACGGTGATGGTATCTATGCGTTTCTGTACGACGACACAGCCCATGTAAAGCGCCTGCAAATGATGAAGGATAAGCTGCTGGTCATCTCTGATAACAAAAGCTACTCACCGTGGGACCCGATCGAGAAAGACGAGATGAACCGGGTGTTCATCTTCGGTAAGGTTATTGGGAGCATGCCGCAGACATATAGGAAGCATGGTTAAAGTGAGGCTAAAAAACAGTTACAGCAATAGGCCTGTTGTTTTTCTTTAAACACGCAGTGTTAAACCGCTCTTTGAGATGCGGAGTAATGAGATGGAAGACTTGAATCACATAAGGGTTAGTGATGGAGTGCGTAGCGAGCAGCAATAGTGCAATACCTAATGTCGTTGAAGTAATACGTCGCATCAATGAAGGTTCCACTCAGCCATTTCTTTGCAAATGTGATGATGGGCAGTTGTATGTTTTGAAGTCAAAACCATCAATGCCCCCGAAAAATCTCTTAGCTGAGTTCATTTCGGCGTGTTTGGCTAATGATATCGGCCTTCCTTTACCTGACTTTAAAATCGTATTTGTGCCAGAGGAACTTATAGAGTACTCACCTGATCTGCAGCAACAAATTTGTACAGGATATGCCTTTGCTTCATTGTTCATTGACGGTGCAATAGCGTTAACGTTTACGCAGTCAAGAAACGAAACGATCATCCCAGTCGAACAGCAAAAATTAATCTATGTTTTTGATAAATGGATATTAAATGCAGACAGAACGCTTACTGACAAAGGTGGAAACGTTAACATCCTTTATGACATCAGTAACGATAAGTATTATCTGATTGACCATAATCTCTCATTTGATCAGAATGCTGGACCTGAAGATTTTTCTGTGCACGTGTACGGCCCTGGTAACCGCAAATGGCAATATGATTTAGTGGATCGCGTAGAGTACCGCCAGAGGGTCGTTAACAGTTTACACAAGCTTCCTGCTATCCTTGACGAAATTCCAGAAGAGTGGATAGTAGATGAGGAGTTTTTACCTTTTGTCTGCACTACGCTAGACAAAGGTGATTGTGATGAATTTTGGAGCGCAATAGAATGACAACTCCATGCCTATATAGCATCGTTCGCTATGCGCCTTATGCGGAGACTGAAGAATTCGCAAACATAGGCGTACTTCTGTGCGCGCCAAAAGAAAATTACTTTGATTTCCAGCTCACAAAGCGAAATGACTCTCGTGTAAAGAATTTTTTCCATGATGATTGTATTTTCCCTGTAGCAAAAGACTCAATACAAAGAGAACTACAGTTCGCAAAAATGCATGCGACCCAGATTGTTGGACATCAACAACTTGCACAATTCTTCAGATATTTTACAAACAAAAAAGAATCAATTTTTCAGTTCAGTTCTACGAGAGTGATTCTCAGCGAAAACCCAAAAGAAGAGCTGGCCCGCATTTACAATAAATATGTAAACCACTCTGACTACACAAAAGAGCGCCGTGAAGATGTTCTAGCCAGAGAGCTAAAACGAAGTATCGATAGAATAGATGGATTGAAGAACGTCTTCAAACAAGCAACCATTGATGGGTATTTCGCAAAGTTCTCAATGCCATTGGTCGCCAAGAAGCATGACAGGATCCAATGTGCCATCAAACCTCTGGCATTCACTCAAGCTGAACCAGGAAAAATGATGGAGCATAGTGATACTTGGGTGATGAGAATAACTCGAGCAGCAGAAGAAAACCTGCTTTCACTTGATGACATTTTATTCACAATTGAAACTCCTGAATCACCAAACTCAGGCCAAAGCAAAGTTATTGACATCATAAAGAGAACTATGGATGCTAAGAAAATAAATCATATACCTGCATCCAACCACAAAGAAACTATTGATTTTGCAAAAAAAATACTTCCCCAAGTTTAAAATTTATTTTTGTATGTGATATTCCTTATTAATAACCCGGCCACCGTGCCGGGTTTTCTTTTGCCTCCCCTCATCACACAAACCGCTCAAAAAACCACCATAACCTCGCTTCAGTTATCGCTATGCGATTCAAGTCACAAAATAAATCCATCCTAAATACAACCAGTTATATCTAAAACAACCAATAAAACAACTTTTGTTGTTGACGGTAAAACAACTATAGTTTTAAATAGGTTCATCGCAACAACACAACGATACGGCAACCACCTGATTCACCGTTGCGATGACCGCTTAGATCCGCAGTTTGAATTTCAGCAGGCTTCGGGGAGTGCGAGGGGTGAAACGGACGCGTGAACGTCGGTGTGACCAGCTGAAATCAACTCAACATTTCATACCTTAGTCGCTTCAACGAGGCGGCTTAGTTATGACAACCGGCGACCATCCACCGCCTGAATACGCGCAGAAGTCTCTATATGTTCAGCAGCCCAGCTTACGGGCAGGAGTTTTTATGGTTCATCAACATTACGGAACGCAGACCGTTAATCGCGGCGCGGTCATGCCAGGAATGCTGGTCAAACACAAAGATGGTACCTGGACTGCATCAGCTAATTTACGCGGACGGCTTTATCTGCATCGCGGCATCGAGCGCACTTATACCCGTGATTTGCTCGTGGAAGTTTTTCTCGACGGACGCGGTAACGGCCTGAATCACTAATCCCCTTTCCTGTTTTCCTAATCAGCCTGGCATTTCGCGGGCGATATTTTCACAGCCATTTTCAGGAGGTCAGCCATGAACGCTTATTACATTCAGGATCGTCTTGAGGCTCAGAGCTGGGCGCGTCACTACCAGCAGATCGCCCGTGAAGAGAAAGAGGCAGAACTGGCAGACGACATGGAAAAAGGCCTGCCCCAGCACCTGTTTGAATCGCTATGCATCGATCATTTGCAACGCCACGGGGCCAGCAAAAAAGCCATTACCCGTGCGTTTGATGACGATGTTGAGTTTCAGGAGCGCATGGCAGAACACATCCGGTACATGGTTGAAACCATTGCTCACCACCAGGTTGATATTGATTCAGAGGTATAAAACGGATGAGTACAGCACTCGCAACGCTGGCTGGGAAGCTGGCTGAACGTGTCGGCATGGATTCTGTCGACCCACAGGAACTGATCACCACTCTTCGCCAGACGGCATTTAAAGGTGATGCCAGCGATGCGCAGTTCATCGCATTGCTGATCGTCGCCAACCAGTACGGCCTTAATCCGTGGACGAAAGAAATTTACGCCTTCCCTGATAAGCAGAACGGCATTGTTCCGGTGGTGGGCGTTGATGGCTGGTCCCGCATCATCAATGAAAACCAGCAGTTTGATGGCATGGACTTTGAGCAGGACAATGAATCCTGTACATGCCGGATTTACCGCAAGGACCGTAATCATCCGATCTGCGTTACCGAATGGATGGATGAATGCCGCCGCGAACCATTCAAAACCCGCGAAGGCAGAGAAATCACGGGGCCGTGGCAGTCGCATCCCAAACGGATGTTACGGCATAAAGCCATGATTCAGTGTGCCCGTCTGGCCTTCGGATTTGCTGGTATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACTGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATCGCCCTGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTTAAACAGAAAGCCACTGAGCAGAAGGTGGCAGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGGGATGATGCATGGCACAAATTACGGCTCGGCGTCATCACCGCTTCAGAAGTTCACAACGTGATAGCAAAGCCCCGCTCAGGAAAGAAGTGGCCTGACATGAAAATGTCCTACTTCCACACCCTGCTGGCTGAGGTTTGCACCGGTGTGGCTCCGGAAGTTAATGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAACGACGCCAGAACCCTGTTTGAATTCACTTCCAGCGTGAATATTACTGAATCCCCGATCATCTATCGCGACGAAAATATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAACTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAATTCCGGCTCGGTGGTTTCGAGGCAATAAAATCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACGCGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGCATGAAGCGTGAAGGCCTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGTGCGATCACTTTCGTCTACTCCGTTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCCCTGAAAGCACAGCGGCTGGCTGAGGAGATAAATAATAAACGGGGAGCTGTATGCACAAAGCATCTCCCGTTGAGTTAAGAACGAGTATCGAGATGGCACATAGCCTCGCTCAAATTGGAGTCAGGTTTGTGCCAATACCAGTAGAAACAGACGAAGAATTTCATACGTTAGCCGCATTCCTTTCACAAAAGCTGGAAATGATGGTGGCGAAAGCAGAAGCAGATGAGAGAGACCAGGTATGACAACCACTGAATGCATTTTTCTGGCAGCGGGCTTCATATTCTGTGTGCTTATGCTTGCCGACATGGGGCTTGTTCAATGACACCTCAGCAAGAAAACGCCCTTCGCAGCATTGCCCGTCAGGCTAATTCTGAAATCAAAAAAGCCAGACAGCAGTTTCCGGATAAAAACGTCGATGACATTTGCCGTAGCGTACTAAAGAAGCACCGCGAAACGGTAACGCTGATGGGATTCACACCGACTCATTTAAGCCTGGCGATCGGCATGTTGAACGGCGTCTTTAAGGAACGGTGAACATGAAAAGCAAAATCATCAGGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATAATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGATAGATGAACGAAGACGACTGGCCGTTCAGGGTTGTCGGACTTGTGCCAGTTGCCAGCAAGATCTGGAGCTTATCAGTAAACAGAGAGGTTCGAAGTGAGCGAAATTAACTAGAAGCCAAAGATAAAATCATCGCTGAGCAGGAGAAAATCGCTAACGGAGAAAAGACAGTAAGTCAGTATATGAAAACCGCATGATATCATCAGATAAAAATCGGTCGTAAAGCGAAATATTAATACCAGAACAAACGAGTCGAGGTAAATTATATTACCTCTATAAATTAACTAAAACTTGCCCGCTATATACTATATCATTCAGTATCATCACGCGCGGTCTGTGCATATGTCACTACCGCACCTAATATATTAATTTTCTTTTCAACATAGATAATATTATCGTACTCATAATTGCCATACGGATAGCAAATGCGAATATTCTCATGTAGATCGGGGTCATCCACCTCAGCTCCAGAACAACTTTTTGAACTACCGGAAGTATACCGATACGGTGCAACATAAGACGATGTCTCTCCAGGCAAAAAATAAGTTAGTGTCGTAAGGGGTATAATCAGAAAAAATCCAGCAAATATGCACATCCCTGCATAAACCTTAAGGTATGCTGACAGACTCTTCCAGCCGCTTTGTTTTACTATCCCCTTCTTAACCCAAAACAGAGATAACAGAAAAGCTATTCCCATGCTAAACAGAATGTAATAGTGGGATATACTCTGATTAAGAAACGTGACCCTGTAGATATCTGCCCGCCACCAGAAGAAAAGGAAAATAAAGATCAGCCCTGAAACTGTCATGCAAATCAAATAAGGATACGAATCTTTTTTCATGTTTAGCGCCCATAAAATTTTTCCTGACCCGGACAAATTTACCATCCATTTTTTGCGCAGAAAATAGCTCATTACTTACTGCACAATAATACACAAAATTGCGTAAATTTTTTGCATGGATTTTAGCTCTTTCAGCCGACATTTAAGGGGTAAATAGCATTTCCTAAAAGCAACTGCACCAACCCAACAGAATGGGCTACCGCTTACGTTGAGAGCAAAAAAGTGTATAGCAGCAATGAACAGCATCCTCGCACTGACGAGGATTTCTTTTATCTGAACTCGCTACGGCGGGTTTTGTTTTATGGAGATGATAAATGCACTTCCGAGTCACAGGTGAATGGAATGGAGAACCATTCAACAGAGTTATCGAAGCCGAGAACATCAGCGACTGCTATGACCACTGGATGCTGTGGGCGCAGATAGCACATGCAGACGTAACCAATATTCGAATTGAAGAACTGAAAGAACACCAAGCCGCCTGATGGCGGTTTTTTCTTGCGTGTAATTGCGGAGACTTTGCGATGTACTTGACACTTCAGGAGTGGAACGCTCGCCAGCGACGCCCAAGAAGCCTTGAAACAGTTCGTCGATGGGTGCGCGAATGCAGGATATTCCCTCCTCCGGTTAAGGATGGAAGAGAATATCTGTTCCACGAATCAGCGGTAAAGGTTGACTTAAATCGACCAGTAACAGGTAGCCTTTTGAAGAGGATCAGAAATGGGAAGAAGGCGAAGTCATGAGCGCCGGGATTTACCCCCTAACCTTTATATAAGAAACAATGGATATTACTGCTACAGGGACCCAAGGACGGGTAAAGAGTTTGGATTAGGCCGAGACAGGCGAATCGCAATCACTGAAGCTATACAGGCCAACATTGAGTTATTTTCAGGACACAAACACAAGCCTCTGACAGCGAGAATCAACAGTGATAATTCCGTTACGTTACATTCATGGCTTGATCGCTACGAAAAAATCCTGGCCAGCAGAGGAATCAAGCAGAAGACACTCATAAATTACATGAGCAAAATTAAAGCAATAAGGAGGGGTCTGCCTGATGCTCCACTTGAAGACATCACCACAAAAGAAATTGCGGCAATGCTCAATGGATACATAGACGAGGGCAAGGCGGCGTCAGCCAAGTTAATCAGATCAACACTGAGCGATGCATTCCGAGAGGCAATAGCTGAAGGCCATATAACAACAAACCCTGTCGCTGCCACTCGCGCAGCAAAATCAGAGGTAAGGAGATCAAGACTTACGGCTGACGAATACCTGAAAATTTATCAAGCAGCAGAATCATCACCATGTTGGCTCAGACTTGCAATGGAACTGGCTGTTGTTACCGGGCAACGAGTTGGTGATTTATGCGAAATGAAGTGGTCTGATATCGTAGATGGATATCTTTATGTCGAGCAAAGCAAAACAGGCGTAAAAATTGCCATCCCAACAGTATTGCATGTTGATGCTCTCGGAATATCAATGAAGGAAACACTTGATAAATGCAAAGAGATTCTTGGCGGAGAAACCATAATTGCATCTACTCGTCGCGAACCGCTTTCATCCGGCACAGTATCAAGGTATTTTATGCGCGCACGAAAAGCATCAGGTCTTTCCTTCGAAGGGGATCCGCCTACCTTTCACGAGTTGCGCAGTTTGTCTGCAAGACTCTATGAGAAGCAGATAAGCGATAAGTTTGCTCAACATCTTCTCGGGCATAAGTCGGACACCATGGCATCACAGTATCGTGATGACAGAGGCAGGGAGTGGGACAAAATTGAAATCAAATAA
Protein sequences of DBSCAN-SWA_7 >CP034958|3186473:3209702|3198619_3199321_-|QAS86272.1|DBSCAN-SWA MKNIAAQMVNFDREQMCRIANNMPEQYDEKPQVQQVAQIINGVFSQLLATFPASLANRDQNELNEIRRQWVLAFRENGITTMEQVNAGMRVARRQNRPFLPSPGQFVAWCREEASVIAGLPNVSELVDMVYEYCRKRGLYPDAESYPWKSNAHYWLVTNLYQNMRANALTDAELRRKAADELVHMTARINRGEAIPEPVKQLPVMGGRPLNRAQALAKIAEIKAKFGLKGASV >CP034958|3186473:3209702|3202089_3202839_+|QAS86275.1|DBSCAN-SWA MECVASSNSAIPNVVEVIRRINEGSTQPFLCKCDDGQLYVLKSKPSMPPKNLLAEFISACLANDIGLPLPDFKIVFVPEELIEYSPDLQQQICTGYAFASLFIDGAIALTFTQSRNETIIPVEQQKLIYVFDKWILNADRTLTDKGGNVNILYDISNDKYYLIDHNLSFDQNAGPEDFSVHVYGPGNRKWQYDLVDRVEYRQRVVNSLHKLPAILDEIPEEWIVDEEFLPFVCTTLDKGDCDEFWSAIE >CP034958|3186473:3209702|3200333_3200873_-|QAS86273.1|DBSCAN-SWA MQPLTYQQTSGFSPTAVINRSQTKQVPGHEKIRDAVRAWSAEDNQDVVATLIVNEYREQGGGTIDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGYLVEQDSFMARLAEMEKELSEAKQAVILNAPRHQKLKEISEGIVSMFRVDPDLAGPLMAMVTTMLGAI >CP034958|3186473:3209702|3197284_3197845_-|QAS86269.1|DBSCAN-SWA MKKIILLAMIIGSLTGCASVPPLNFSTPNVGVSQKKIDAEIKSLTVSLARPDEQKGDITAGMEAITPIWRESLQEALDRMTIFRDSSPNTVSLNVKVLALDVPAFGVSMTTKAIARYEIINRANGDIIYTQDIESTGTVPASYAFYGIVRARESVNRAVQNNITQFLQALESVDLSRPMFPVRVAK >CP034958|3186473:3209702|3190359_3190512_-|QAS86253.1|DBSCAN-SWA MPSMIAIILLIILHIWLCRQGGAHFWSESRLNISLLMLDIEHFARGKGLR >CP034958|3186473:3209702|3195377_3195740_-|QAS86263.1|DBSCAN-SWA MNTYNITLPWPPSNNRYYRHNRGRTHVSAEGQAYRDNVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVVKMPVTKGGRLELTITEMGNE >CP034958|3186473:3209702|3196641_3196743_-|QAS86267.1|DBSCAN-SWA MRTYNPNSLLPSQMQKCTCDFLHPAFDLCGGEA >CP034958|3186473:3209702|3189597_3190008_+|QAS86252.1|DBSCAN-SWA MAQVAIFKQIFDKVRNNLNYHWFYSELKRHNVSHYIYYLATENIHLVLENDNTVLIKGQGKVVNVRFSKNKCLIEATLKGFKSGELSFYEYRKNLATAGVFRWITNIHENKRYYYTFDNSLLFTENIQNTTQIFPH >CP034958|3186473:3209702|3187412_3187550_-|QAS86249.1|capsid|DBSCAN-SWA MAYEPCQGVIIGIMPQHVLSKSELRLDAIFSLKRKTLLQYLEPWF >CP034958|3186473:3209702|3191460_3191676_-|QAS86256.1|lysis|DBSCAN-SWA MKSMDKLTTGIAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >CP034958|3186473:3209702|3208110_3208296_-|QAS86286.1|DBSCAN-SWA MFSASITLLNGSPFHSPVTRKCIYHLHKTKPAVASSDKRNPRQCEDAVHCCYTLFCSQRKR >CP034958|3186473:3209702|3208228_3208396_+|QAS86287.1|DBSCAN-SWA MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >CP034958|3186473:3209702|3189307_3189541_+|QAS86251.1|DBSCAN-SWA MSTKNRTRRTTIRNIRFPNQMIEQINIALDQKGSGNFSAWVIEACRRRLCSEKRVSPEANKEKSDITELLRKQIRPD >CP034958|3186473:3209702|3186473_3186650_-|QAS87725.1|tail|DBSCAN-SWA MQEFSEHIAPLQDAVDLEIATEEENSLLEAWKKYRVLLNRVNTTTAPDIEWPTVPIIE >CP034958|3186473:3209702|3194777_3195155_-|QAS86261.1|DBSCAN-SWA MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLIPSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVGGMTFMALARKHGRSDCWVGRMLQKAEGVVEGMLMVLDLRLEMDADCSK >CP034958|3186473:3209702|3204453_3204750_+|QAS86278.1|DBSCAN-SWA MNAYYIQDRLEAQSWARHYQQIAREEKEAELADDMEKGLPQHLFESLCIDHLQRHGASKKAITRAFDDDVEFQERMAEHIRYMVETIAHHQVDIDSEV >CP034958|3186473:3209702|3196189_3196645_-|QAS86266.1|DBSCAN-SWA MNLPQDGIKLHRGNFTAIGQQIQPYLEDGKCFRMVLKPWREKRSLSQNALSHMWYSEISEYLISRGKSFATAAWVKEALKHTYLGYETKELVDVVTGEITTIQSLRHTSDLDAGEMYVFLCKVEAWAMNIGCHLTIPQSCEFQLLRDKQEA >CP034958|3186473:3209702|3195240_3195381_-|QAS86262.1|DBSCAN-SWA MMFEFYMAELLRHRWGHLRLYRFPGSVLTDYRILKNYAKTLTGAGV >CP034958|3186473:3209702|3190963_3191461_-|QAS86255.1|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKDIVGVWTVCHGHTGKDIIPGKTYTEAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSALLRKINQGDIKGACDQLRRWAYAGGKQWKGLMTRREIEREVCLWGQQ >CP034958|3186473:3209702|3190540_3190747_-|QAS86254.1|DBSCAN-SWA MRKLKMMLFGASLIMVVGCSSKENALCHPQPKPPAPPAWAMMPPSNSLQLLDETFSVSGTELSATKQH >CP034958|3186473:3209702|3208435_3208654_+|QAS86288.1|DBSCAN-SWA MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >CP034958|3186473:3209702|3196019_3196190_-|QAS86265.1|DBSCAN-SWA MATPLIRVMNGHIYRVPNRRKRKPELKPSEIPTLLGYTASLVDKKWLRLAARRNHG >CP034958|3186473:3209702|3199317_3200247_-|QAS87726.1|DBSCAN-SWA MANTAEIFNFPVPDAAQKEPRVADLDDGYTRIANELLEAVMLAGLTQHQLLVFLAVMRKTYGFNKKLDWVSNEQLSELTGILPHKCSAAKSVLVKRGIFIQSGRNIGINNVVSEWSTLPESGKKNKVYLKEVNLPESGKKSLPKSGKGTYPNQVNTKDKLTKDNIKPYSSENSGESSDLPENDLPVVKADAAIQNGSKWGTAEDLTAAEWMFDMVKTIAPSARKPNFAGWANDIRLMRERDGRNHRDMCVLFRWACQDNFWSGNVLSPAKLRDKWTQLEINRNKQQAGVTAGKPKLDLTNTDWIYGVDL >CP034958|3186473:3209702|3192945_3193905_-|QAS86259.1|DBSCAN-SWA MIKKPVIGISGCLAGSAVRFDGGHKRADFLMDKLVEWVTFRPVCPEMAIGLPVPRPALRLVRSMQGNIRMCFSHDQNEDVTERMTEFSRSYMDKLKDVSGFVVCAKSPSCGMERVRVYDENGNRGRKDGVGLFTSTLMEKFSWLPVEEDGRLHDPVLRENFVERVFALHELNHLYKEKLSRRELLAFHSRYKLQLLAHSQAGYKDMGPFVAAIHEWADLESYFEVYRDKLMAILRKPASRKNHTNVLMHIQGYFSNYLSTRQRKELSEVILNYRSGTLPLLAPLTLLKHYLGEYPNDYLLTQNYFDPYPDELALRLMVN >CP034958|3186473:3209702|3206369_3206561_+|QAS86282.1|DBSCAN-SWA MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >CP034958|3186473:3209702|3205537_3206218_+|QAS86280.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSSVNITESPIIYRDENMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >CP034958|3186473:3209702|3186799_3187468_+|QAS86248.1|DBSCAN-SWA MVKLMKKNTDDGAKIYTPLTLKLYDWWVLGVSNRLAWGCPTKEHLLPHFLEHLGNNHLDIGVGTGFYLTHVPESSLISLMDLNEASLNAASTRAGESKIKHKISHDVFEPYPAALHGQFDSISMFYLLHCLPGNISTKSCVIRNAAQALTDDGTLYGATILGDGVVHNSFGQKLMRIYNQKGIFSNTKDSEEGLTHILSEHFENVKTKVQGTVVMFSASGKK >CP034958|3186473:3209702|3208631_3209702_+|QAS86289.1|integrase|DBSCAN-SWA MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >CP034958|3186473:3209702|3195736_3196027_-|QAS86264.1|DBSCAN-SWA MADLRKAARGRECQVRTPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDRRTHFVDAAYAKECALEGMARTQVIWMKEGVIKA >CP034958|3186473:3209702|3194097_3194622_+|QAS86260.1|DBSCAN-SWA MKLWPVLTGIALSFTLIACKAPTPPKGVQPITNFDANRYLGKWYEIARLENRFERGLEQVSATYGKRNDGGIRVLNRGYDPTKNKWSESEGKAYFTGDTKTAALKVSFFGPFYGGYNVIKLDDEYKYALVSGPNREYLWILARTPTIPDKVKADYVRTAQKLGFNVNELLWVKQ >CP034958|3186473:3209702|3192459_3192594_-|QAS86258.1|DBSCAN-SWA MPYICSIILVLNSFDVRIGKEDILFKKGSAVLIDYNLKDFFHQI >CP034958|3186473:3209702|3207383_3207986_-|QAS86285.1|DBSCAN-SWA MSYFLRKKWMVNLSGSGKILWALNMKKDSYPYLICMTVSGLIFIFLFFWWRADIYRVTFLNQSISHYYILFSMGIAFLLSLFWVKKGIVKQSGWKSLSAYLKVYAGMCIFAGFFLIIPLTTLTYFLPGETSSYVAPYRYTSGSSKSCSGAEVDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >CP034958|3186473:3209702|3200942_3201173_-|QAS86274.1|DBSCAN-SWA MNPAIKTAINIVGSQKKLGAACEVSQQAVYKWLHNKAKVSPEHVGSIVTATGGVVKAYQIRPDLPKLFPHTEKNAA >CP034958|3186473:3209702|3206951_3207173_+|QAS86284.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQQDLELISKQRGSK >CP034958|3186473:3209702|3204755_3205541_+|QAS86279.1|DBSCAN-SWA MSTALATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKATEQKVAA >CP034958|3186473:3209702|3201277_3201967_+|QAS87727.1|DBSCAN-SWA MKTTLSERLKEARLARGLTQKALGDLVGVSQAAIQKIETGKANQTTKIVEIANALGVRAEWLSSGVGNMSDSTVQPIQSTVSHSKYFKIDVLDIEVSAGPGVINREFVEVLRSVEYSFDDARHMFDGRKAENIRIINVRGDSMSGTIEPGDLLFVDITVKSFDGDGIYAFLYDDTAHVKRLQMMKDKLLVISDNKSYSPWDPIEKDEMNRVFIFGKVIGSMPQTYRKHG >CP034958|3186473:3209702|3198101_3198293_+|QAS86270.1|DBSCAN-SWA MRAKIYQLSLWIFISFLAIYAFIIYKGSYIGVALHQIAWIIIIASGLIARLTKPKQKPISSNN >CP034958|3186473:3209702|3198329_3198623_-|QAS86271.1|DBSCAN-SWA MTGKEAIIHYLGTHKSFCAQDVAAVTGATVTSINQAAAKMARTGILVIDGKVWRTVYYRFATREEREGKVSTNLIFKECRQSAAMKRVLAFYRGNFQ >CP034958|3186473:3209702|3206214_3206397_+|QAS86281.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLPLS >CP034958|3186473:3209702|3206571_3206853_+|QAS86283.1|DBSCAN-SWA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >CP034958|3186473:3209702|3196835_3197288_-|QAS86268.1|DBSCAN-SWA MKRLFVIAPLLVLVGCAQNISPNSYSVGSVGMVNRTIAGTVISARGVDISGTSALGGTAGAAVGATAGSALGGGVRSNIVGAVGGAVIGGIAGAAIESSATKQTGMEYVVETENGNLMTIVQGKDPLFTQGSKVLVLYGNPSRIITDPRH >CP034958|3186473:3209702|3191863_3192451_-|QAS86257.1|DBSCAN-SWA MIVDVEEKTVNDFFKSNTLSPFSVRRFYPAYLMVECEDFSLLKNLIACLNCDGRTVDFVRNQISLACLAILSSEKIVQSFLFGCLNSLGSKVKAIIHTDISAAWRLCDISSRLYLSESLLKRKLKHEGLSFSKLILEERMVMAERLLSYNLYSVGKVAEICGYENTSYFVSVFRRYFGVPPHQYSSRFFLEKDMM >CP034958|3186473:3209702|3204171_3204378_+|QAS86277.1|DBSCAN-SWA MVHQHYGTQTVNRGAVMPGMLVKHKDGTWTASANLRGRLYLHRGIERTYTRDLLVEVFLDGRGNGLNH >CP034958|3186473:3209702|3202835_3203663_+|QAS86276.1|DBSCAN-SWA MTTPCLYSIVRYAPYAETEEFANIGVLLCAPKENYFDFQLTKRNDSRVKNFFHDDCIFPVAKDSIQRELQFAKMHATQIVGHQQLAQFFRYFTNKKESIFQFSSTRVILSENPKEELARIYNKYVNHSDYTKERREDVLARELKRSIDRIDGLKNVFKQATIDGYFAKFSMPLVAKKHDRIQCAIKPLAFTQAEPGKMMEHSDTWVMRITRAAEENLLSLDDILFTIETPESPNSGQSKVIDIIKRTMDAKKINHIPASNHKETIDFAKKILPQV >CP034958|3186473:3209702|3188358_3188919_-|QAS86250.1|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIENEKLRREVEELRQASETDLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIKRRRSLSTRMRLTQQLI |
45 | Enterobacteria_phage(50.0%) | tail,integrase,capsid,lysis | attL 3184414:3184428|attR 3209776:3209790 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
3697192 : 3712880
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >CP034958|3697192:3712880|DBSCAN-SWA AATGGATATTACTGAGTTTCCTTCTGGAGTAATTGAACACCTTGGCTGGTATGTATACCGATTGATTGATCCGAGGGACGGAAGCACCTTCTATGTAGGGAAAGGCAAAGGTAACCGCGTATTTGCCCATATGCGCGGTGAAGTGGCAGCGACTGATGATGACGAGTTACTGAGCAACAAGCTAAAGCAAATTAGAGAAATAAGGTTAGCAGGACTTGAAGTTATCCATGTCATCCATCGACACGGAATGACTGATGAAAAGACGGCGTACGAAGTTGAAGCAGCACTTATTGATGCCTACCCTGGGTTAACGAATATCATGAATGGTGCTGGCAGCAATGAATTCGGCGCCGCGCATGTCAAAGAGTTGATAGCAACATATCAGCCCGAAACCATAACATTTCATCATAAAGCATTAATGATATCCGTTAACAGAAGTGCAAAGGATTCAGAGCTTTATGATGCGGTTCGATTTAGCTGGCGCATTAATGTCTCTCGCGCCAGCCAAGCAGAAATCATTCTTGCTACTGTAAGGGGGATCGTTCGAGGGGTTTTCATTGCTGATAAATGGCTCAAATCAACACGTGAAAATTTCCCTTCGTTGAAATACTGGGACGAGGATCCTGACTTTGAGGCAACACAAAGTTCGCGCTATGGTTTTGAAGGTCGAGAAGCCCCACCTGAAATAGCAAATCTTTATCTTGGAAAAAAAATACCAGATGAATTAAGAAAAAAAGGAGCTATGTCCCCGGTCCGTTACTCACCTAATTTTTGAGTCTTTAAGTGATAAGCATAAACCGCAGCACGTCATGCATACGTCGTGTCTGCGGTTTTTCTTTTTTGCTTACACGGTGTCTGGTTCTTCTGGCCACTCAATATCAGGTGCAGTTGATGTATCAACACGGTTCAGCAACACCCGATACTTTTTCCATGCGGCCAGTAACGCTTTCTCTTCCTCCGTTGCGATTTCCAGCTCAACAACAGTCTGAACGTACCGGGAACAGCCTCCTTCAGAGCTTGAAGGATATCAATGTTCGCTTCCTGTTAACTGCCCGACAAGTGCAACCAGTTCGCTTACCTGATTTTCCAGAGTGGTGATCCGGGTGCCTGCTGTTGCCAGATTTTGACGTAGCGTTGTGTTTTCCTCTTCCAGCGCGGTAACGCGATCATCTGTTTCACGGGCGACCTGAACAAGTAAGCCCGTCACGGCGGCGTAGTCAACATTCAGATAACGCGTTTCTTCACGTAATTCATTGCCGTCAACCGTTGGTCCCTGCAACTCTTTACCGTAATGAGTGAATGAGCCTACCGCTTCCGGTATTGCCTCCATTGCTTCCTGTGCAATAACACCAGCGTAAGGCAGTCCGTTTTCCTTAAGTGTGTAGGTGTACCCGTTCATTTTACGGATAGCTTCGGTTGCGTCACTGATAATCTGAATATTGTCTTTCAGCTCGCGGTCTGATAACTGATTCAGTGTTGTACAGTTAATAGCACCGTTTACATCAAACAGCTGACCTGACGATGTTTTTTGAGCATAAAACAGATACGCGGCAGTCGTTCCAACCTCAAAAACGTTTTGTCGATCACCTGAACCCCACACCTTGACAGCAAATGGTAGTTCTGTATTACCTAAGTTCTGTAAAACAAAACGATTGCCAGTCCCTGTTTGTTTTGTAAGGGTTAAATCAACAGTTGAGTTAACCTCATCCTTGTTGATAGTGAGCGCCTGCGCTGTAGCCCCGTTAACAGCACCTGTTTTGAGTTGAACCGCGCCGTCATTACCATTTAGCAGTATCTCAGCTCCGCTAAAGAAATTTTTTAGCGATAGCATCTTACTTACGCCGACTGATGAACCCAACGCCCACGCGAGAGAATTACCGGTGCTATCAAACCCACGTACAAAGCAATCCATTTTGCTATAGTCTGACGTGCTTCCAAGGACATCAATCCGCCCTCCGCCAGATTTTATCGGGTTAGATGTGGTTAATGACCTGACAGCAATATCGGTAGATGAATTGAGATCGTCTACTGTTAGTAATTTCTTCCATTCCTGCGTCGTTCCATTTTCAATTGTTCTTCCCCAAAAACCGGAATTGCGGCCTCCGAACTGCACAGCATAATTTTTACTAAATTGAACATGAATGCCGCCAAGAACCATAGAGCCTGCCGGGCCGTTTGTACTGCCTGCAATTGATCTAAATTTGTCGACGTTATCAGTATGCTGAGCGTTCCAGTCATGACCTGTTAATGTTGCAATTCCTGAGTTAGCTGTAATTAACCCAGACGCTTTAAGATTCTGCACATCAATTCTGTCGTTTGCGAAATTATATGCAATTGCGTTACTAACGCTTCCGGCATCATCATAATCGCGTTTCTGAATAAACCAGTCGCCAGCATTAGCAATGAGAGTATAAGTAGGCGTGTTAGCCGGGCGATCTGTTTCATTAAATCTTATTGCTGGGTTAGCGCTCCTTATTTGTAAAGGTTTTTCAACCGTTGATTTCAGTATCGCACTATCTACAATTAAATTACCTTCTGAATTAAGGTTTAGATATTTTGAAGCACCAGTTGAACCATCGTATGTAAACCTTATTGTTAATTGACCTGGCTCATTTGATAAAGTTTCAACATACATATCAGCACCAAGACGCACAGTGCTGTCAGTTGCTAACAGTCTTGTATGGAGAATTCCACTAGATGTGTATGTATTGGTCGAATCACTATATCTATCAAGAAATAGACTGTTAAGTTGAGGGCTGTTGTTCCTACCTAAGCCAAGATTCGTTCTGGCACCATCAACAGTTGCCGCATTAGTGCCTCCTTGTCCAATAGGTAATGGAACCCATGACGATCCGTTATGGCAACCCCACAATCCAGATGTTGACACCTGAAGTCGTGGCGCTGATGGTGAGTAATTCGAATAAACGTAAGTAGTCGATTCGCCTTCTTCAATTCTTTCTGTTAATACTATTTTTTTCCATACACTCCAACCTTGTGTGCTGGTATATATCCTTCTATAAAGAATTGATGAATTGTTATATACAAAATAGCTCTGGATACAACCATCACTACCATTAGCACCAGTTCTTTGCACTAACAATGCACCAGCAAGCTGTATCGGGTAATTTAGTTCTGGCTTTGCGTTGGCAGACATTGGTTGGTAATAAAAACCGGCTGTAGTACCTTTTATATCGTTTAAGTTCGTATTCGCATCAAGGCCAGTTTTTGCTTCAAACATGACTTCAAGTTTAGAGCGCGCTGTACTTGCATCATTCGCCCCTGTGCCACCTTGCGCAACTGCGAGCGGTTGCCATCTACCAGCTTTAGGGTTAAATGCGCCCCACTGACCGTCAGCATCAACCTGTAAGTAACAGCCGTCCGCCTGAACGTCAGTAGATAAAATGATGGTTCTTGTGTCGCTTGAACGCTGAAATCGGATAACCTCATCTTTTCTTGCGTGGCGTGTCCATTGCGGGCCTATTGTCGGATTCCAGCGATAAGTATAAAGCGAGCGCGTAGCCCATCCCTGAAATACACCTGTATACGCTGGTGTTCCATCTACCTGACTAATAAACCCCGTAAGACTGCTTTCACCGGATGCAATCGATGGAAATCCTTTTGCATTACTCATAATGCGCATAAATCCGATATACCCTGATGGGTTGCCGGAAATATCAGGACAATCACGCGGGGCTGAGCCAAGACCGATATTATCGCCAACAATTAATTGCGCCTGGTTGCGATAATTAAGTGCGTCCGCCGCAGATTTCGCCGCGTTTGTTTCACTGGATTTAGCATTAGTTTCGCTGGCTTTAGCGTTTGTTTCGCTGCTCTTTGCTGCTGCCTCGCTATTTTTCGCGTTGGTTTCTGATTTTTTGGCTGCTGTCGCGGAGTTTGCCGATGCAGTTTGTGAGTCTGCTGCCGCCTGTGCGCTGTTACCCTGGCGACGCGCCTTACGCGTTAACCAGCCTTCTGCTTCCAGCCGTGCGATAGCCGTTCTGACGGTACTCATCCCCGCGCCAATCTGACGGGCAATGGTTTCAATTGATGGCCAGCACACACCTTCGTCATTACTGAAATCAGCCAGGCGGGCCATAATTGCCACGCTGGATAACTTCATGCCTGATGCAGCGCAACCATCCCATACATAGCCGGTTAATTTAGTGCTCATGACCGACCTCTATTTCCCTGAATTTACGACGAAACTGTTCGAGCGGGCTGAAGCACTCATGCTCATAGCCTTCGCGGAGGTAGATAACCCGTTGTGTTTCCGGTTCCCAACGAATGACTCTGACGGGCACTCCGTAGTGATCTTTGAACCAGCGGTTAACTTGTCGCAAAGGACTGTCTCCTTCTGCCGGTTGAAATCACCCACAGCCCACTCTGCAAAGCTGTGGGTTACAATTTCCCTGTCACCTGGTACATTTACTGCATAGCAATACTCCACCTTCGCTTTTCCACCCGGTACAGGAAGCGCAATCAGTTGCGAGCGACGGTAGTGTGTTGTTAAACTGTTCATGCGTTAGTTTCTCCACAACCAGAAGCAATCGACGCCACGACGCCCGGAGCTGCACACTCGCGGGCGTCATTACTTTCTGAAATGCAAAAAATTTTGTAGACAAGTGCTGCATGCTCCTGCAGCTTCGAAATTGAGAGATACAGCTCGTCGTTAATTGCTGTCTTCTCATGCGGTTCCACTACACCGTCTTCGATTGCTGAACGAATCTGTTTTGAATAACTGCCGATCTGTTCAATGACTTCCAGCAGACGCTGGTTAATATCGGCGTTGTCCACATCCTCGACGTCAGGAAGAGACACAAAGACGCCATTTGCAGACTGCGCCACAGCGTCAGCAATGAAGTGAGTTCCACCAGCACGTTGCAAAATCATTGCCCATCCCAGCGGGAAAATCTGATCGCCATCGGCACGAAGGCGGTTAAATAATGCGTTCTCTGTTACATCCAGCCAGTCAGCTGCTTCAGCGTAACCCCCCGGCAACGCTGCGATAGTTTTTCTGACAGCTTTCACGTACCACTCAGGCTGTTTTTCTACTTTCCAGTGATGCTTACCCACGGTTAGCCTCATCGTTCTGTGGTTTCTGTTAATCGATTTATCCATTAGATTTTTCATAAAGCTCAGGTTTAAATGGCAACCGTCCGCAAGTTCTATATGCAGCTTCTGCTGCACGTCCTTTTGGAATTAACTGGCCCGGACGGTTTCGCCACTGATAAACGGCTTCAGTTGTTATGCCGAAAAAAGCAGCAACTTTCTCAATACTGCCGAAGTAGCTTTCGATATCGTCAGTTGTCATACGCCCTCCAAACTAAGTTTTATTAGATGCTAATTACAAATCTATCTTTGGTCAATAAAAACTAAGATTACTTAGCAATTAAAGAAATGGTGCTCCTATGGAAACGGTTGGTCAGCGTATAAAAGCTCTGAGAAGAGTTACCAGAACGTCCCAGAAAGAATTGGGTAAATTTTGTGGAGTAAGCGACGTTGCTGTGGGGTACTGGGAGAAAGACATCAATACCCCTGGTGGGGAGGCACTTTCGAAATTAGCGAAGTTCTTCAATACGTCAATAGATTACATTCTTTATGGTGCTGAGTTTGAAGGCAAACTCGTCACAAACATGCGCAGAGTTCCTGTAATATCGTGGGTTCAGGCTGGGCAGTTTACTGAGTGCAGGGCAGCAGAAGTGTTTAGTGAAGTGGACAAGTGGGTAGATACATCATTAAAGGTTGGTGATAACTCATTTGCATTAGAGGTTAAAGGTGACTCCATGACTAACCCTAATGGCCTCCCAACAATACCAGAAGGCGCAACAGTGATTGTAGATCCAGATGCAGAACCTCGTCATGGAAAAATAGTCATCGCTCGACTTGATGGAACAAACGAAGCTACAGTAAAAAAATTAGTCATCGATGGCCCTCAAAAGTTTTTAGTACCATTAAATCCTCGGTATCCCAACATCCCTATCAATGGTAATTGCCTTATCATTGGTGTAGTCAAAGGAGTTCAATACGAACTCTAAGGCCTCTCTTCTCTAACTAAGGCACCGAACTAAGAAAAGTTTGGTGTTTTCTCTTGCCATGACAACTAAGTTAAGTTAGATTTTATATCAAAGATAACGAACAGGCAGGACGCCCACGAAGTAGCCGCCTGGGGCATATGAAGTCCAGGATGATTCGTTAGCAACAAAAAAGCGCCCTACAGGACGCTTAGCTCTTTAACAATCTGGGTATCATCCAACCAATGCAAGATTTAAGGAATCCAAGGCGAATTCAGATCTCGCCCCAACTCACGTAATGATCTTGGTCGTTCGTACATCGGATTTTTTTCCATAAGAAATTTATTTTCACAGTGAAGGCAACGGCTTGTAAGAAAATGAGAAGTTTTACCTACTGGAAGCGGATGAAGTATCGATTTTATATTTTTGGAATAACAAAGTGGGCACAGATGCACAGTTATTTCTTTTCCACTCACAATTTCATTTTTAGAGTAAACAAAAGCACCAGAGTCAAGCTGATCAAGGACATATCCTTCTACCTTGGCACAAAAATCTTCAAACTCTGCAATTTTTGCTTTGAGATGCATCACCTCTTCATCACGAAGGCGGATCGCATCGCCAAGAGAGAAGCATTCTGCCTGAAGCGTGATTAGTTTGTTCTGGAGTTCAATGGTTGCAGCTTTAACTTCTGCATCCGTTTTCGCGTCATTAATAACCTTAGCAAGACCGGCAGTCTCCTTTATAGCGGCCATAGCCGCAGACAGTTCAGCTATCACGTTGAATACTCAGCTAGTTGTTGGGGATATCCAGATTAACCAAATCCTTGTTGTTGGGGAATAACTAGGTCCACCTCGCCTGATGTGGCTAAAAGCAGGCACATAACAGCTAAGTATTTTCAACCAGAGAGAATCCTTAGCGTTGTGGTGAATGCGGCTCAGCGCACGCGGGTTAAGGTTGAGGCTGACAGTCGACCTTCTGTGGATACCCACCCGCCTGGTGTGCAACCTTCGCCAGGCACCGGGAGGCACCCGGCACCACAACTTTATGCTGTGTGTAGTCCTCGCGGTACCAGTTTGTACACTTGCTTCCGGCTGGTACCGCTCTTTTTACAAAACAGAGAAGAACATCACCGGACGACGGGCTCATAACCCAATCCATCCGGGCGGCTGCCACCGCAGGTGTTCTTCTCTGTTTTGTGGAGAAACCAACCGACCTTGCAGGGTCGATATGATTAGGAGCAGAAAAATGGCTAGCGAACGCAGTACTGATGTGCAGGCATTTATTGGGGAGCTGGACGGCGGCGTATTTGAAACCAAAATCGGCGCAGTTCTCAGTGAAGTCGCTTCCGGTGTGATGAACACGAAAACCAAAGGTAAGGTCTCACTCAACCTGGAAATCGAACCATTTGATGAGAACCGTGTGAAAATCAAACACAAACTCTCATATGTTCGCCCGACTAACCGCGGGAAAATTTCCGAAGAAGACACCACCGAAACGCCGATGTATGTCAATCGCGGTGGTCGCCTGACTATTCTGCAGGAAGACCAGGGACAATTACTGACTCTTGCCGGTGAACCTGACGGAAAACTCCGCGCAGCAGGTCATTAATATCGTTCTTAATTAACCGATTATTTATCTCATCACTGAATATCTTTATATAGTGAGGACTTATTATGTCTCAGAACTTAGACGCAACCGCAATTAATCAAATCCATGCCCTTATTTCTGCTCAGGGTGTTAATGAAATTATCAGTAAGATTGGTGCCGATGCTGTGGCATTGCCTGAGAATTTCCGCATTCATGATCTGGAAAAATTTAATTTAAATCGCTTCCGTTTCCGTGGTGCGCTTTCCACTGCCAGCATCGATGACTTTACCCGTTACTCTAAAGATCTTGCAGATGAAGGCACCCGCTGCTTTATCGATGCCGATAATATGCGAGCCGTCAGTGTGCTTAACCTGGGTACTATTGATGAACCAGGTCACGCAGATAACACCGCCACACTCAAACTGAAAAAGACAGCACCGTTCTCTGCTCTGTTGTCTATTAACGGCGAGCGTAACTCCCAGAAGTCACTAGCAGAATGGATTGAAGACTGGGCAGACTATCTTGTGGGCTTTGATGCTAATGGTGACGCTATTCAGGCAACAAAAGCGGCTGCGGCTGTCCGTAAAATCACGATTGAAGCAAACCAGACCGCTGATTTTGAAGATAATGACTTCAGCGGCAAACGCTCCCTGATGGAGTCTGTCGAAGCGAAGACCAAAGATATTATGCCTGTGGCATTTGAATTTAAATGCGTTCCGTTTGAAGGCCTGAAAGAACGTCCGTTTAAATTACGCCTCAGCATTATCACTGGCGATCGTCCTGTACTGGTTCTGCGCATTATTCAGCTGGAAGCAGTGCAGGAAGAAATGGCTAACGAATTTCGTGATCTGCTTGTTGAGAAATTCAAGGACAGCAAAGTAGAAACCTTTATTGGTACTTTCACCGCCTGATTTCATTACTGCAAATGCCCCTGCGGGGGCATTTATGGAAACGTAATTGACTCAATAATCGCCGGATGGTGAGGGCTTCCTTTTACCAGAATTCAGCGCGGTGCAGCGCATATACGTGGAGAACAAAATGTCATTTATTAAAACTTTTTCCGGGAAGCATTTTTATTATGACAGGATAAATAAAGACGACATCGTTATTAACGATATCGCGGTTTCTCTTTCAAATATCTGTCGCTTTGCAGGGCATCTTTCACATTTCTACAGCGTTGCCCAACATGCGGTGCTTTGCAGCCAACTGGTACCGCAGGAATTTGCTTTTGAAGCGTTAATGCATGATGCAACAGAAGCGTATTGCCAGGACATCCCGGCGCCACTGAAACGCCTTCTTCCTGACTATAAACGGATGGAAGAAAAAATAGATGCAGTAATCCGTGAGAAATACGAGTTGCCCCCGGTTATGAGCACGCCCGTGAAATATGCCGATCTCATCATGCTGGCAACCGAACGCCGTGATCTCGGGCTTGATGATGGCTCTTTATGGCCTGTACTGGAAGGTATCCCGGCAACAGAGATGTTCAAAGTTATTCCACTGGCACCGGGCCATGCCTACGGGATGTTTATGGAACGCTTCAACGAGTTATCGGAATTACGCAAATGTGCATAACTCATGTAGTTAGTTTTTCTGGCGGGAGAACATCCGCATATCTTGTTCACCTGATGGAAGAACAAAGAAAGGCTGGCAATAACGTCTGCTACATCTTTATGGATACCGGTTGCGAACATCCGCTGACATACCGCTTTATCCGGGAGGTTGTGAAGTTCTGGGACATACCACTAACTGTGTTACAGGTCGATATAAATCCTGAGCTTGGGCAGCCAAATGGTTATACAGAATGGGAGCCAAAGGATATTCAGACACGAATGCCGGTGCTTAAACCGTTTATGGACATGGTTAAAAAGTACGGCACGCCATACATCGGCGGCGCGCTCTGTACTGATAGGCTAAAACTCATCCCTTTCACAAAATACTGCGATAACCATTTCGGGCGAGGTAATTACATCACATGGCTGGGTATTCGTGCAGACGAACCCCGTAGGCTGAAACCGAAATCGGGCGTCCGGTATCTTGCCGAGCTGTCAGATTTTGATAAGTCGGATGTTATCCGGTGGTGGCGAAAACAACCTTTTGATTTGCAAATCCCGGAGCATCTCGGGAACTGTGTTTTCTGCATCAAAAAGTCAACGCAAAAGCTGGGGCTTGCATGTAAAGACGAACCAGGTCTGATGCGAGTTTTTAATGAGCTGGTTACAGGCAAACACGTCAGGGATGGTCATCGCAGAACAGGTAAAGACATTATGTACCGTGGTCACCTGACGCTTGACGGAATTGCCAGAATGTCTGCCAACAGCGACTACAGAAATTTGTATCAGGCGATGGTACAGGCCAGGCGATTCGATACCGGCTCGTGTTCAGAGTCATGTGAAATCTGGGGTGATCAATTGGAATTGGAATTCAAAGAGGTAGGGGTATGACAACCGAAATTAACTACCATGCACTGCTTGAGCGCGCACGGAATAAAGTGCAGAGCATTGAGTTCGCCTTAACACAGAGTGCATTCGCTGAGATTCGCGCTGAGCTTGAAAATGATTTAGAACTGGCACGGATTGCACTGGCATCTCTGGAAGTTGAGCCAGATGAACGCGCAGCCTATGAATTATTTATGGAAAAGCGTTTCGGTAAAACAGTCGATCGTCGGAGAGCAAAAAACGGCGATAACGAATACATGGCATGGGATATGACTCTCGGTTGGATCGTCTGGCAGCAACGAGCTGGTATCCATTTTTCAACAATGTCACAGCAAGAGGTGAAATAATGGAGCCATACAGCCTCACACTCGATGAGGCCTGTCATTTTCTCAAGATATCCAGACCGACTGCCATTAACTGGATACGCACAGGGCGTCTTCAGGCAACACGCAAAGATCCCACTAAGAATAAATCTCCTTACCTCACAACACGACAAGCCTGCATTGCGGCTCTTCAGTCTCCGCTGCATACTGTCCAGGTGAGCGCGGGTGATGGCATAACAGAGGAAAGAAAATGTCACTCTTCCGCAGAGGTGAAATATGGTACGCCAGTTTCACATTGCCGAACGGTAAAAGATTTAAACAGTCTCTTGGAACAAAGGACAAAAGGCAGGCGACAGAACTCCATGACAAGCTAAAGGCTGAAGCATGGCGGGTCAGCAAACTTGGTGAAATACCTGATATAACGTTCGAGGAAGCGTGTGTCAGGTGGCTTGAAGAGAAAGCACATAAAAAATCACTGGACGATGACAAAAGCCGGATCGGATTCTGGCTTCAACATTTCGCAGGAATGCAACTAAGAGACATTACTGAATCAAAAATTTATTCAGCAATGCAGAAAATGACGAACCGGCGTCATGAGGAAAACTGGAAACTCAGGGCAGAAGCATGCAGAAAAAAAGGGAAACCTGTTCCAGAATACACGCCAAAACCAGCGTCCGTTGCAACGAAGGCTACGCATCTTTCATTTATAAAGGCCCTACTAAGAGCCGCAGAGCGTGAATGGAAAATGCTGGATAAGGCACCAATTATTAAAGTGCCTCAACCAAAGAATAAACGGATCCGCTGGCTGGAGCCCCATGAAGCACAAAGGCTGATTGATGAATGTCCGGAGCCATTAAAGTCTGTTGTTGAATTTGCACTGGCAACAGGCTTAAGACGCTCGAACATCATCAACCTTGAATGGCAACAAATAGATATGCAGCGCCGGGTGGCATGGATAAACCCAGAAGAGAGTAAATCAAACCGCGCAATCGGCGTTGCGCTGAATGATACTGCATGTCGCGTTTTGAAAAAACAAATCGGGAATCATCACCGTTGGGTATTTGTGTACAAGGAAAGCTGTACCAAACCAGACGGAACGAAAGCGCCAACAGTAAGGAAGATGCGGTATGACGCAAACACAGCCTGGAAAGCGGCGCTGAGACGGGCTGGTATTGATGATTTCAGATTTCACGACTTGAGACACACCTGGGCAAGTTGGCTGGTTCAAGCCGGAGTCCCGTTGTCAGTGTTACAGGAAATGGGAGGCTGGGAGTCTATCGAAATGGTTCGTCGATATGCTCACCTTGCACCTAATCACCTTACCGAACACGCACGGCAAATAGACTCGATCCTGAACCCATCGGTCCCAAATTTGTCCCAGTCAAAAAATAAGGAAGGTACTAATGATGTGTAACTTATTGATTTAAATGGTGCCGATAATAGGAGTCGAACCTACGACCTTCGCATTACGAATGCGCTGCTCTACCAACTGAGCTATATCGGCCCTGAAAGGACATGTTCACGAACGTGAATCACGGTGGACAAGGTTAAAACTAACCGGGCGATGCGTCAATGGCCTTGTGAATCAAATGGCTACTTTTGCATCACCCGGTTTTATTTACGCACGAATGGTGTAATCACCAATACCGATCCACTTGTAAGTGGTCAGTGCTTCCAGCCCCATTGGGCCACGCGCGTGGAGTTTTTGTGTGCTTACCGCCACTTCCGCACCTAGTCCAAACTGGCCGCCGTCGGTAAAACGCGTAGAGGCGTTAACGTAAACAGCGGACGAATCCACTTCGTTAACAAAACGCTGGGCGTTGCGCATATCGCGGGTCAGGATCGCATCGGAGTGTTGTGTGCCGTGTTCACGAATATGGGCGATGGCATCGTCAAGATCACTGACGATTTTGACGTTCAAATCTAATGACAGAAACTCATCGTCATACTCTTCCGCTTTAACAGCCACCACCTTCGCGGGGCCTGTCTGCAACTGCGCCAGCGCAGCTGCATCTGCGTGTAATGCCACGCCGCTTTCCTCCATTTGTTTGCTTAATGCGGGCAGGAAGCTATCGGCGATGTTTTTATTCACCAGCAACGTTTCTACCGTATTACATGTGCTCGGACGCTGAGTTTTCGCGTTGACGATCACTTTTAATGCTTCAGCAATCTCTACACTTTCATCAACATAAATATGGCATACGCCTATACCACCTGTGATCACCGGGATCGTCGACTGTTCGCGGCACAGTTTATGCAAACCAGCGCCACCACGCGGGATCAGCATGTCGATGTATTTATCCATACGCAGCATTTCACTGACCAGCGCACGGTCAGGATTATCAATCGCCTGCACGGCACCCACCGGTAAGCCACAGGATTTCAGGGCGTCCTGAATCACCGCCACCGTTGCCGCGTTAGTGCGACAGGTTTCTTTACCGCCACGCAGAATCACTGCGTTACCGGTTTTCAGGCACAGCGAAGCGACATCAACCGTCACGTTCGGGCGCGCTTCATAAATCACGCCAATAACCCCCAGCGGTACGCGACGACGCTCAAGACGCAGGCCGCTGTCCAGTACGCTGCCATCGATTACCTGCCCCACCGGATCGGCGAGGTTACACACCTGGCGCACATCATCGGCAATGCCTTTCAGCCGTGCGGGCGTCAGTGCCAGACGGTCAAGCATCGCTTCGCCAAGGCCATTGGCACGCGCGTCAGCAACATCCTGGGCGTTAGCGTTGAGGATGATTTCGCTTTGTGCTTCCAGTTCATCGGCGATTTTTTCCAGCACGCGATTTTTTTCGCGGCTGGAGAGTTGCGCTAATTTATACGAGGCTTGCTTCGCGGCAATGCCCATTTGTTCCAGCATCAGCCTGCTCCTTAACGGGTAATCATGTCATCACGGTGAACGGCAACCGGGCCGTATTCATATCCCAGTATTGCATCAATTTCTTGCGAGTGGTGCCCGGCAATACGGCGTAATGCATCGCTGTTGTAACGACTGACGCCGTGGGCGATATCGCGACCTTCGAGGTTGCAAATGCGGATGACTTCACCACGCGAGAAATTGCCAGTCACGCTTTTAATGCCTTTCGGCAACAGGGAGCTGCCGCGTTCAAGAATGGCGGCAGTTGCCCCTTCATCTACCGTGATTTCACCCGCCGGCGGCGCACCGAAAATCCAGCGTTTACGGTTTTCAAGCGGAGTCGCCTGGGCATGGAACAGCGTACCGACGGAAATGCCTTCCATCACATCACCAATAACGCCCGGCTTGCTGCCCGCGGCAATAATGGTGTCGATACCCGCACGGCAAGCCACGTCAGCGGCCTGCAATTTGGTACTCATGCCGCCAGTTCCGAGGCCTGAAACGCTGTCACCGGCAATCGCGCGCAGTGCGTCATCAATGCCGTAAACATCTTTAATCAGTTCTGCCTGCGGATTGCTGCGCGGATCAGCGGTATACAAACCTTTTTGATCGGTCAGCAGCAACAGTTTATCGGCACCCGCCAGAATCGCCGCCAGCGCAGAAAGGTTATCGTTATCGCCGACCTTAATCTCTGCCGTAGCGACAGCATCGTTCTCATTGATTACCGGAACGATATTGTTATCGAGCAACGCACGCAGGGTGTCGCGGGCGTTCAGGAAGCGTTCACGGTCTTCCATATCAGCACGGGTCAGCAGCATCTGCCCGACGTGAATGCCATAAATCGAAAACAGCTGTTCCCACAGTTGAATCAGTCGACTCTGCCCTACCGCCGCCAGCAGTTGTTTCGAGGCGATAGTCGCTGGCAGTTCCGGGTACCCCAGGTGCTCACGTCCGGCGGCGATCGCGCCCGACGTCACAATAACAATCCGATGCCCGGCGGCATGTAACTGCGCGCACTGGCGAACAAGTTCAACGATATGGGCACGGTTCAGACGGCGCGATCCGCCTGTTAGCACACTGGTGCCGAGTTTTACCACCAGCGTCTGGCTGTCACTCATGATTCTCTGCCATTCAATTTTAGGAAAAATGATATCAAACGAACGTTTTAGCAGGACTGTCGTCGGTTGCCAACCATCTGCAAGCAAAGCATGGCGTTTTGTTGCGCGGGATCAGCAAGCCTAGCGGCAGTTGTTTACGCTTTTATTACAGATTTAATAAATTACCACATTTTAAGAATATTATTAATCTGTAATATATCTTTAACAATCTCAGGTTAAAAACTTTCCTGTTTTCAACGGGGCTCTCCCGCTGAATATTCGCGCGTTAATTAAAATCAGGAATGAAAATGAAAAAGAGCACTCTGGCATTAGTGGTGATGGGCATTGTGGCATCTGCATCCGTACAGGCCGCAGAAATATATAACAAAGACGGTAATAAACTGGATGTCTATGGCAAAGTTAAAGCCATGCATTATATGAGTGATAACGACAGTAAAGATGGCGACCAGAGTTATATCCGTTTTGGTTTTAAAGGCGAAACACAAATTAACGATCAACTGACTGGCTATGGCCGTTGGGAAGCGGAGTTTGCCGGAAATAAAGCGGAGAGTGATACTGCACAGCAAAAAACGCGTCTCGCTTTTGCCGGATTGAAGTATAAAGATTTGGGTTCTTTCGACTATGGCCGTAACCTGGGCGCGTTGTATGACGTGGAAGCCTGGACCGATATGTTCCCGGAATTTGGTGGCGACTCCTCGGCGCAGACCGACAACTTTATGACCAAACGCGCCAGCGGTCTGGCGACGTATCGGAACACCGACTTCTTCGGCGTTATCGATGGCCTGAACTTAACCCTGCAATATCAAGGGAAAAACGAAAACCGCGACGTTAAAAAGCAAAACGGCGATGGCTTCGGCACGTCATTGACATATGACTTTGGCGGCAGCGATTTCGCCATTAGTGGTGCCTATACCAACTCAGATCGCACCAACGAGCAGAACCTGCAAAGCCGTGGCACTGGCAAGCGTGCAGAAGCATGGGCAACAGGTCTGAAATACGATGCCAATAATATTTATCTGGCAACTTTTTATTCTGAAACACGCAAAATGACGCCAATAACTGGCGGCTTTGCCAATAAGACACAGAACTTTGAAGCGGTCGCTCAATACCAGTTTGACTTTGGTCTGCGTCCATCGCTGGGTTATGTCTTATCGAAAGGGAAAGATATTGAAGGTATCGGTGATGAAGATCTGGTCAATTATATCGATGTCGGGGCTACATATTATTTCAACAAAAATATGTCAGCGTTTGTTGATTATAAAATCAACCAACTGGATAGCGATAACAAATTGAATATTAATAATGATGATATTGTCGCGGTTGGCATGACCTATCAGTTTTAA
Protein sequences of DBSCAN-SWA_8 >CP034958|3697192:3712880|3698035_3698164_-|QAS87748.1|tail|DBSCAN-SWA MEIATEEEKALLAAWKKYRVLLNRVDTSTAPDIEWPEEPDTV >CP034958|3697192:3712880|3707800_3708964_+|QAS86707.1|integrase|DBSCAN-SWA MSLFRRGEIWYASFTLPNGKRFKQSLGTKDKRQATELHDKLKAEAWRVSKLGEIPDITFEEACVRWLEEKAHKKSLDDDKSRIGFWLQHFAGMQLRDITESKIYSAMQKMTNRRHEENWKLRAEACRKKGKPVPEYTPKPASVATKATHLSFIKALLRAAEREWKMLDKAPIIKVPQPKNKRIRWLEPHEAQRLIDECPEPLKSVVEFALATGLRRSNIINLEWQQIDMQRRVAWINPEESKSNRAIGVALNDTACRVLKKQIGNHHRWVFVYKESCTKPDGTKAPTVRKMRYDANTAWKAALRRAGIDDFRFHDLRHTWASWLVQAGVPLSVLQEMGGWESIEMVRRYAHLAPNHLTEHARQIDSILNPSVPNLSQSKNKEGTNDV >CP034958|3697192:3712880|3701351_3701531_-|QAS86699.1|DBSCAN-SWA MRQVNRWFKDHYGVPVRVIRWEPETQRVIYLREGYEHECFSPLEQFRRKFREIEVGHEH >CP034958|3697192:3712880|3707229_3707574_+|QAS86706.1|DBSCAN-SWA MTTEINYHALLERARNKVQSIEFALTQSAFAEIRAELENDLELARIALASLEVEPDERAAYELFMEKRFGKTVDRRRAKNGDNEYMAWDMTLGWIVWQQRAGIHFSTMSQQEVK >CP034958|3697192:3712880|3701706_3702264_-|QAS87749.1|DBSCAN-SWA MGKHHWKVEKQPEWYVKAVRKTIAALPGGYAEAADWLDVTENALFNRLRADGDQIFPLGWAMILQRAGGTHFIADAVAQSANGVFVSLPDVEDVDNADINQRLLEVIEQIGSYSKQIRSAIEDGVVEPHEKTAINDELYLSISKLQEHAALVYKIFCISESNDARECAAPGVVASIASGCGETNA >CP034958|3697192:3712880|3704875_3705700_+|QAS86703.1|DBSCAN-SWA MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSINGERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAVRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA >CP034958|3697192:3712880|3711824_3712880_+|QAS86710.1|DBSCAN-SWA MKKSTLALVVMGIVASASVQAAEIYNKDGNKLDVYGKVKAMHYMSDNDSKDGDQSYIRFGFKGETQINDQLTGYGRWEAEFAGNKAESDTAQQKTRLAFAGLKYKDLGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGVIDGLNLTLQYQGKNENRDVKKQNGDGFGTSLTYDFGGSDFAISGAYTNSDRTNEQNLQSRGTGKRAEAWATGLKYDANNIYLATFYSETRKMTPITGGFANKTQNFEAVAQYQFDFGLRPSLGYVLSKGKDIEGIGDEDLVNYIDVGATYYFNKNMSAFVDYKINQLDSDNKLNINNDDIVAVGMTYQF >CP034958|3697192:3712880|3703456_3703954_-|QAS87750.1|DBSCAN-SWA MAAIKETAGLAKVINDAKTDAEVKAATIELQNKLITLQAECFSLGDAIRLRDEEVMHLKAKIAEFEDFCAKVEGYVLDQLDSGAFVYSKNEIVSGKEITVHLCPLCYSKNIKSILHPLPVGKTSHFLTSRCLHCENKFLMEKNPMYERPRSLRELGRDLNSPWIP >CP034958|3697192:3712880|3697192_3697966_+|QAS86697.1|DBSCAN-SWA MDITEFPSGVIEHLGWYVYRLIDPRDGSTFYVGKGKGNRVFAHMRGEVAATDDDELLSNKLKQIREIRLAGLEVIHVIHRHGMTDEKTAYEVEAALIDAYPGLTNIMNGAGSNEFGAAHVKELIATYQPETITFHHKALMISVNRSAKDSELYDAVRFSWRINVSRASQAEIILATVRGIVRGVFIADKWLKSTRENFPSLKYWDEDPDFEATQSSRYGFEGREAPPEIANLYLGKKIPDELRKKGAMSPVRYSPNF >CP034958|3697192:3712880|3705827_3706364_+|QAS86704.1|DBSCAN-SWA MSFIKTFSGKHFYYDRINKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKRMEEKIDAVIREKYELPPVMSTPVKYADLIMLATERRDLGLDDGSLWPVLEGIPATEMFKVIPLAPGHAYGMFMERFNELSELRKCA >CP034958|3697192:3712880|3710433_3711537_-|QAS86709.1|DBSCAN-SWA MSDSQTLVVKLGTSVLTGGSRRLNRAHIVELVRQCAQLHAAGHRIVIVTSGAIAAGREHLGYPELPATIASKQLLAAVGQSRLIQLWEQLFSIYGIHVGQMLLTRADMEDRERFLNARDTLRALLDNNIVPVINENDAVATAEIKVGDNDNLSALAAILAGADKLLLLTDQKGLYTADPRSNPQAELIKDVYGIDDALRAIAGDSVSGLGTGGMSTKLQAADVACRAGIDTIIAAGSKPGVIGDVMEGISVGTLFHAQATPLENRKRWIFGAPPAGEITVDEGATAAILERGSSLLPKGIKSVTGNFSRGEVIRICNLEGRDIAHGVSRYNSDALRRIAGHHSQEIDAILGYEYGPVAVHRDDMITR >CP034958|3697192:3712880|3704447_3704810_+|QAS86702.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGH >CP034958|3697192:3712880|3706354_3707233_+|QAS86705.1|DBSCAN-SWA MCITHVVSFSGGRTSAYLVHLMEEQRKAGNNVCYIFMDTGCEHPLTYRFIREVVKFWDIPLTVLQVDINPELGQPNGYTEWEPKDIQTRMPVLKPFMDMVKKYGTPYIGGALCTDRLKLIPFTKYCDNHFGRGNYITWLGIRADEPRRLKPKSGVRYLAELSDFDKSDVIRWWRKQPFDLQIPEHLGNCVFCIKKSTQKLGLACKDEPGLMRVFNELVTGKHVRDGHRRTGKDIMYRGHLTLDGIARMSANSDYRNLYQAMVQARRFDTGSCSESCEIWGDQLELEFKEVGV >CP034958|3697192:3712880|3702301_3702502_-|QAS86700.1|DBSCAN-SWA MTTDDIESYFGSIEKVAAFFGITTEAVYQWRNRPGQLIPKGRAAEAAYRTCGRLPFKPELYEKSNG >CP034958|3697192:3712880|3698218_3701362_-|QAS86698.1|DBSCAN-SWA MSTKLTGYVWDGCAASGMKLSSVAIMARLADFSNDEGVCWPSIETIARQIGAGMSTVRTAIARLEAEGWLTRKARRQGNSAQAAADSQTASANSATAAKKSETNAKNSEAAAKSSETNAKASETNAKSSETNAAKSAADALNYRNQAQLIVGDNIGLGSAPRDCPDISGNPSGYIGFMRIMSNAKGFPSIASGESSLTGFISQVDGTPAYTGVFQGWATRSLYTYRWNPTIGPQWTRHARKDEVIRFQRSSDTRTIILSTDVQADGCYLQVDADGQWGAFNPKAGRWQPLAVAQGGTGANDASTARSKLEVMFEAKTGLDANTNLNDIKGTTAGFYYQPMSANAKPELNYPIQLAGALLVQRTGANGSDGCIQSYFVYNNSSILYRRIYTSTQGWSVWKKIVLTERIEEGESTTYVYSNYSPSAPRLQVSTSGLWGCHNGSSWVPLPIGQGGTNAATVDGARTNLGLGRNNSPQLNSLFLDRYSDSTNTYTSSGILHTRLLATDSTVRLGADMYVETLSNEPGQLTIRFTYDGSTGASKYLNLNSEGNLIVDSAILKSTVEKPLQIRSANPAIRFNETDRPANTPTYTLIANAGDWFIQKRDYDDAGSVSNAIAYNFANDRIDVQNLKASGLITANSGIATLTGHDWNAQHTDNVDKFRSIAGSTNGPAGSMVLGGIHVQFSKNYAVQFGGRNSGFWGRTIENGTTQEWKKLLTVDDLNSSTDIAVRSLTTSNPIKSGGGRIDVLGSTSDYSKMDCFVRGFDSTGNSLAWALGSSVGVSKMLSLKNFFSGAEILLNGNDGAVQLKTGAVNGATAQALTINKDEVNSTVDLTLTKQTGTGNRFVLQNLGNTELPFAVKVWGSGDRQNVFEVGTTAAYLFYAQKTSSGQLFDVNGAINCTTLNQLSDRELKDNIQIISDATEAIRKMNGYTYTLKENGLPYAGVIAQEAMEAIPEAVGSFTHYGKELQGPTVDGNELREETRYLNVDYAAVTGLLVQVARETDDRVTALEEENTTLRQNLATAGTRITTLENQVSELVALVGQLTGSEH >CP034958|3697192:3712880|3709168_3710422_-|QAS86708.1|DBSCAN-SWA MLEQMGIAAKQASYKLAQLSSREKNRVLEKIADELEAQSEIILNANAQDVADARANGLGEAMLDRLALTPARLKGIADDVRQVCNLADPVGQVIDGSVLDSGLRLERRRVPLGVIGVIYEARPNVTVDVASLCLKTGNAVILRGGKETCRTNAATVAVIQDALKSCGLPVGAVQAIDNPDRALVSEMLRMDKYIDMLIPRGGAGLHKLCREQSTIPVITGGIGVCHIYVDESVEIAEALKVIVNAKTQRPSTCNTVETLLVNKNIADSFLPALSKQMEESGVALHADAAALAQLQTGPAKVVAVKAEEYDDEFLSLDLNVKIVSDLDDAIAHIREHGTQHSDAILTRDMRNAQRFVNEVDSSAVYVNASTRFTDGGQFGLGAEVAVSTQKLHARGPMGLEALTTYKWIGIGDYTIRA >CP034958|3697192:3712880|3702599_3703226_+|QAS86701.1|DBSCAN-SWA METVGQRIKALRRVTRTSQKELGKFCGVSDVAVGYWEKDINTPGGEALSKLAKFFNTSIDYILYGAEFEGKLVTNMRRVPVISWVQAGQFTECRAAEVFSEVDKWVDTSLKVGDNSFALEVKGDSMTNPNGLPTIPEGATVIVDPDAEPRHGKIVIARLDGTNEATVKKLVIDGPQKFLVPLNPRYPNIPINGNCLIIGVVKGVQYEL |
17 | Shigella_phage(33.33%) | tail,integrase | attL 3694296:3694355|attR 3708965:3709024 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
4098759 : 4105318
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >CP034958|4098759:4105318|DBSCAN-SWA GATGAAAATTGCGCTGGTTATTTTCATCACCCTTGCCCTGGCGGGCTGTGCGCTGTTATCACTCCATATGGGAGTGATCCCCGTGCCGTGGCGCGCGCTGCTGACCGACTGGCAGGCCGGACGCGAGCATTATTATGTATTGATGGAGTACCGACTGCCGCGCTTGCTGCTGGCACTGTTTGTCGGTGCAGCCCTCGCCGTGGCGGGCGTGCTGATACAGGGGATTGTGCGCAACCCTCTGGCATCACCGGATATTCTCGGCGTTAACCATGCCGCCAGCCTGGCCTCTGTGGGGGCTCTACTTCTTATGCCGTCACTGCCCGTGATGGTGCTGCCGCTGCTGGCCTTTGCGGGCGGCATGGCGGGGTTGATATTACTGAAGATGCTGGCAAAGACCCACCAGCCGATGAAGCTGGCGCTCACCGGCGTGGCGCTTTCTGCATGCTGGGCCAGCCTGACGGATTATCTGATGCTCTCGCGCCCACAGGATGTGAACAACGCCCTGCTGTGGCTGACCGGCAGCTTATGGGGCCGTGACTGGAGCTTTGTGAAGATTGCCATCCCGCTGATGATTTTATTTCTGCCGCTGAGCCTGAGTTTTTGCCGCGATCTCGACCTCCTTGCACTCGGCGATGCGCGCGCCACCACGCTCGGTGTGTCGGTGCCCCATACCCGATTCTGGGCTTTGTTACTAGCTGTCGCCATGACATCTACCGGCGTGGCCGCCTGCGGCCCGATTAGCTTTATTGGTCTCGTGGTGCCGCATATGATGCGTAGCATCACCGGTGGACGTCACCGCAGACTGCTGCCTGTTTCAGCCCTGACAGGTGCGTTGCTGTTGGTGGTTGCCGATCTGCTGGCGAGAATTATTCATCCCCCACTGGAGCTCCCGGTTGGCGTGCTGACCGCCATTATCGGTGCGCCGTGGTTTGTCTGGTTGCTTGTGAGAATGCGATAAATGACTTTACGAACTGAAAATCTGACGGTCAGTTACGGGACAGACAAGGTACTTAACGACGTTTCACTCTCACTGCCAACGGGGAAGATCACCGCCCTGATCGGTCCTAACGGTTGCGGGAAATCGACGCTGTTAAACTGTTTTTCGCGGCTTTTAATGCCGCAGTCTGGCACCGTATTTCTCGGCGATAATCCCATAAATATGCTCTCATCGCGCCAGTTGGCCCGCAGGCTTTCGCTGCTGCCTCAGCACCATTTAACGCCAGAGGGGATCACAGTCCAGGAGCTGGTTTCGTATGGTCGTAATCCCTGGCTGTCACTCTGGGGGCGTCTCTCCGCTGAAGACAATGCACGAGTTAATGTCGCCATGAACCAGACCCGGATCAATCATCTTGCCGTTCGTCGGTTAACCGAGCTTTCCGGCGGTCAGCGCCAGCGCGCATTTCTGGCGATGGTCCTGGCCCAGAATACGCCCGTTGTATTACTTGATGAGCCAACCACCTATCTTGATATCAATCACCAGGTGGACCTGATGCGGTTGATGGGCGAACTCCGGACTCAGGGGAAAACGGTGGTCGCTGTGCTGCACGACCTTAATCAGGCTAGCCGGTACTGCGATCAACTGGTGGTAATGGCAAACGGACATGTTATGGCGCAAGGCACACCAGAAGAGGTGATGACCCCAGGATTGCTGAGAACAGTATTCAGCGTGGAAGCGGAAATACACCCCGAGCCGGTATCTGGCAGGCCGATGTGCCTAATGAGGTAGATTGCACAGGCCGTAAGAACCAAACCACGACTGAATGAAACTGGACTGGCGCCAGCAAGCCTGTTCAGACTGGGGCTGAACTTTTCCGGACTCTGAAAGATTACCAATACTCATCGTCCATCCGCTTGCTTTAGGCTGACAGGTTCATAATCAACGCAAACCAGAGCTGTACAGGCTTGGGCGCGGCTTTCAAACCAGTCGTGATCACGGCAATCAATTTTGAACTCTGCTTAACGGACATTTCTGTATAACCCTTACGGCAACGAAAAACGCGAAGTTAAAATTTTAGAAACCCAAAAACGTGACATGACTAAGTTTAGATTTCAGGGGGGGAGATCAAAAAATTTCGCTCTGTGCCAGAGCGGACATTCACGGAGCTGGTTCATTACCAATGAGGTTGGGCTTTTGAGGATAAATCAATGATCAGACGCCAACGTAAATCAAAAGCACCCCTGGAACGGAATTGCTAATCCAGTTTCTGACCATCGATTTTTCTAAAAAGTGTGCGTTTGCTACTACTTAGGTAGGTGCAGCTTTCTTAATCACCGGCAGCCACGTTATACAGGCCAGTTGATGGATCGATTGTTATCAATGATATCTTTATGAGTCGGTGTCTCACCCAGCTTACCGAAGCTGGCATAAAGTGAAGGCAGACGGGCCCGTCCTTCTCCCTTTTTCGCCAGAGGGAAAGCGCGAAGCATGGTGGCAGCCTCCCAGTCACAACAATAGGATGGTGTGCACGGCTGCTGACGCCATGATTCAGCGATAGAGCCGGAAAATACGGGGTCAAAGCCGGTATCATTAACCAGAGTCATCGTGACCTGTACTGCGGCTGGATGATCTCCTGCTACTGCAACTGCTAGGCGACCGCGGCTCCCTTCAGGTAGTCTGTTGTGGTCAACTAAAACTGGCCTCCGCGTTAGAGTTTTTCCAGTATCGGTTTTCTGATTCGTTTGGTGGTAACCCACCATTATATTCGTGCGGTCTTAGTGCGCTGTAATATCCAACGATATAGTCCGTTATTGCGTGAGCTGCATCGCTGAAGCTTACATAGCCCGTCGCTGGCACCCATTCGTTCTTCAGACTCCTGAAGAAGCGCTCCATTGGGCTGTTATCCCAGCAGTTTCCACGCCGACTCATACTCTGCCTGATCCGGTATCGCCACAGTAACTGCCGGAACTGCCTGCTCGTATAATGACTGCCTTGATCGCCTGGAACATCACCCCGACGGGCTTACCACGGGTTTCCCATGCCATTTCCAGTGCTTTCATGGTAAGCCTGCTGTCCGGCGAGAACGACATGGCCCAGCCCACTGGTTTTCTTGCGAACAGGTCGAGAACAACGGCGAGGTACGCCCAGCGCTTACCCGTCCAGATATAGGTCACATCACCGCACCACACCTGATTTGGTTCCGTTACGGCGAACTGTCGCTCAAGATGATTCGGGATAGCAACGTGCTCATGACCGCCACGCTTATACCGGTGAGTCGGCTGCTGGCAACTGACCAGCCCCAGCTCTTTCATGAGTCTGCCAGCAAGCCAGTGCCCCATCTGGTAGCCTCTCCGGGTTGCCATTGTGGCGATGCTTCTTGCTCCGGCAGAGCCGTGGCTGATGCCATGCAGTTCAAGTACCTGGCTGCGTAATACAGCCCGTCTGCCGTCTGGTTTTTCAGGACGGTTTTTCCAGTATCTGTAGCTGCTGCGATGAACCCCGAACACATGGCAGAGTGTGACCACAGGATAATGCGCTCTGAGTTTCCTGATTATCGAGAACTGTTCAGGGAGTCTGACATCAAGAGCGCGGTTGTAGATTCAATTGGTCAACGCAACAGTTATGTGAAAACATGGGGTTGCGGAGGTTTTTTGAATGAGACGAACATTTACAGCAGAGGAAAAAGCCTCTGTTTTTGAACTATGGAAGAACGGAACAGGCTTCAGTGAAATAGCGAATATCCTGGGTTCAAAACCCGGAACGATCTTCACTATGTTAAGGGATACTGGCGGCATAAAACCCCATGAGCATAAGCGGGCTGTAGCTCACCTGACACTGTCTGAGCGCGAGGAGATACGAGCTGGTTTGTCAGCCAAAATGAGCATTCGTGCGATAGCTACTGCGCTGAATCGCAGTCCTTCGACGATCTCACGTGAAGTTCAGCGTAATCGGGGCAGACGCTATTACAAAGCTGTTGATGCTAATAACCGAGCCAACAGAATGGCGAAAAGGCCAAAACCGTGCTTACTGGATCAAAATTTACCATTGCGAAAGCTTGTTCTGGAAAAGCTGGAGATGAAATGGTCTCCAGAGCAAATATCAGGATGGTTAAGGCGAACAAAACCACGTCAAAAAACGCTGCGAATATCACCTGAGACAATTTATAAAACGCTGTACTTTCGTAGCCGTGAAGCGCTACACCACCTGAATATACAGCATCTGCGACGGTCGCATAGCCTTCGCCATGGCAGGCGTCATACCCGCAAAGGCGAAAGAGGTACGATTAACATAGTGAACGGAACACCAATTCACGAACGTTCCCGAAATATCGATAACAGACGCTCTCTGGGGCATTGGGAGGGCGATTTAGTCTCAGGTACAAAAAACTCTCATATAGCCACACTTGTAGACCGAAAATCACGTTATACGATCATCCTTAGACTCAGGGGCAAAGATTCTGTCTCAGTAAATCAGGCTCTTACCGACAAATTCCTGAGTTTACCGTCAGAACTCAGAAAATCACTGACATGGGACAGAGGAATGGAACTGGCCAGACATCTAGAATTTACTGTCAGCACCGGCGTTAAAGTTTACTTCTGCGATCCTCAGAGTCCTTGGCAGCGGGGAACAAATGAGAACACAAATGGGCTAATTCGGCAGTACTTTCCTAAAAAGACATGTCTTGCCCAATATACTCAACATGAACTAGATCTGGTTGCTGCTCAGCTAAACAACAGACCGAGAAAGACACTGAAGTTCAAAACACCGAAAGAGATAATTGAAAGGGGTGTTGCATTGACAGATTGAATCTACAGTAGCCTTTTTTAATATTTCATTCTCCATTTCAATGCGTTGTAGCTTTTTCCTCAGCTTACGTATTTCGATTTGTTCTGGTGTTATCGGAGAGGCTTTTGGTGTTTTGCCCTGACGCTCATCACGCAGTTGTTTGACCCATCTTGTCATTGTGGAAAGGCCAACATCCATAGCTTTGGCGGCATCTGCCACCGTGTATGTCTGGTCAACAACCAGTTGAGCGGATTCGCGTTTAAACTCTGCGCTAAAATTTCTTTTTTTCATTGGAGCACCTGTGTTGTTCTGAGGTGAGCATATCACCTCTGTTCAGGTGGCCAAATTCAGTAAACCACTTCACCACTTCAGAGCAGACGAGACAAAAAGAATGGTTGACGCATCAGCTCAACCCCATATAGTTGTAACACTTGAGCCAAACCCTTGGGCCGCTTTTTACTTTGATATTAACATTGCTAATACAGGGAACGCACCTGCCTATAATGTTGAGGTTGTGTTTGATCCTCCACTAGTAAATGCGGAGCATAGAGAAAAAAGTGAGATTCCGTTTAGTAAGGTAAGCGTGTTAAAAAATGGGCAATCACTTACCAGCAATCTCTGTAAGTATGAACAAATCAAAGATCAAATTTATAATATTAATATAAGCTGGGCAAGCAAACCTAAATCAAACGATAGAGAAACAAATGAATATGTGTATGACATGGCGACATTTGAAGGAATAAGTTATCTAGGAGCGAGAAGCCCATTGACGCAAATTGCAGAACAAATTAAAGGTATAAGAGAGGATTGGAAACCTATTGCACAAGGAGCTAAAAAAGTAAAAGCAGACGTATATACTTCAAGCGATAGAAACGAAGAACGCACGTATCTGCAAGAGCAACACGATTTGGCAATAAAAAGGAGAGATGAGAAAAGAGAAAAAAGATTAGAGTCTGGTGAATAATTTTAAAGGGAGTGGGTAACTAACCCACTCGTAACTATAAACCTGTAATTAATCACTTATTTTTGACAACAGATAATTACTGAACGCACTGCAAGTGACTAACATAAATTTAGCTTCAGCTAAGGTAGGATTTACATCTTCTTCTGTTAGCGCATGACGTATTCCACCCTGATCACTTGTATAACCATAGAGCTGACTAAAAGCGCCTTTCATTGCAGAGTGTATATATCCTTTTTCCTCTATAGCTTTAAGACAAGCCCCCAAGGTTCCTTTATCATTGCCCGTGATTTTCCTGCATAAAGATTCAATTGCAGAGATAGACTCTTTAATCGAGTTTCTGTAGTCTGGCTGCTCTCTATCCGTCATTAGTTGTAACGCCCTTTCGAAATGGCTACGCGATGAATCAGTGCCATTATCAACTGCGTTCTGAACACTTTCAATTTCGTTATCATTTGAAATAGGAGTAATACAACCATTTATTATGGTATAACCAACGCCATGCTTTTTAAAGATGGAATTGAGATGCTTCGATAGATTAATATATGAATTAGTTCTCTCAATGATGAACTCAATTAAATCATATACCAAATACCATGCTTCCCCATATATATAATCTCGGATAGCAGTCAGCAACGTCTTATCACTTTTGTATCCACTTTCATAACGAGGAATATTATCCGCAGGTTGATTTAGATAATATATCCACACAGACTGCGCACATTTTGTTGCTGTAGCAGTTTGACGATTGTTAGTCCAAAGGAAAAGATATAAGCAATTCCACAATGCCATGCGTGTATCAGAATTAAGATCATTCAGCTGGACATGCTCTCTAACGTCAACATGACCATACCTCACAGAAAATGGCTTTATCAT
Protein sequences of DBSCAN-SWA_9 >CP034958|4098759:4105318|4104493_4105318_-|QAS87040.1|DBSCAN-SWA MIKPFSVRYGHVDVREHVQLNDLNSDTRMALWNCLYLFLWTNNRQTATATKCAQSVWIYYLNQPADNIPRYESGYKSDKTLLTAIRDYIYGEAWYLVYDLIEFIIERTNSYINLSKHLNSIFKKHGVGYTIINGCITPISNDNEIESVQNAVDNGTDSSRSHFERALQLMTDREQPDYRNSIKESISAIESLCRKITGNDKGTLGACLKAIEEKGYIHSAMKGAFSQLYGYTSDQGGIRHALTEEDVNPTLAEAKFMLVTCSAFSNYLLSKISD >CP034958|4098759:4105318|4103872_4104445_+|QAS87039.1|DBSCAN-SWA MVDASAQPHIVVTLEPNPWAAFYFDINIANTGNAPAYNVEVVFDPPLVNAEHREKSEIPFSKVSVLKNGQSLTSNLCKYEQIKDQIYNINISWASKPKSNDRETNEYVYDMATFEGISYLGARSPLTQIAEQIKGIREDWKPIAQGAKKVKADVYTSSDRNEERTYLQEQHDLAIKRRDEKREKRLESGE >CP034958|4098759:4105318|4099716_4100484_+|QAS87035.1|DBSCAN-SWA MTLRTENLTVSYGTDKVLNDVSLSLPTGKITALIGPNGCGKSTLLNCFSRLLMPQSGTVFLGDNPINMLSSRQLARRLSLLPQHHLTPEGITVQELVSYGRNPWLSLWGRLSAEDNARVNVAMNQTRINHLAVRRLTELSGGQRQRAFLAMVLAQNTPVVLLDEPTTYLDINHQVDLMRLMGELRTQGKTVVAVLHDLNQASRYCDQLVVMANGHVMAQGTPEEVMTPGLLRTVFSVEAEIHPEPVSGRPMCLMR >CP034958|4098759:4105318|4101041_4101299_-|QAS87036.1|DBSCAN-SWA MTLVNDTGFDPVFSGSIAESWRQQPCTPSYCCDWEAATMLRAFPLAKKGEGRARLPSLYASFGKLGETPTHKDIIDNNRSINWPV >CP034958|4098759:4105318|4098759_4099716_+|QAS87034.1|DBSCAN-SWA MKIALVIFITLALAGCALLSLHMGVIPVPWRALLTDWQAGREHYYVLMEYRLPRLLLALFVGAALAVAGVLIQGIVRNPLASPDILGVNHAASLASVGALLLMPSLPVMVLPLLAFAGGMAGLILLKMLAKTHQPMKLALTGVALSACWASLTDYLMLSRPQDVNNALLWLTGSLWGRDWSFVKIAIPLMILFLPLSLSFCRDLDLLALGDARATTLGVSVPHTRFWALLLAVAMTSTGVAACGPISFIGLVVPHMMRSITGGRHRRLLPVSALTGALLLVVADLLARIIHPPLELPVGVLTAIIGAPWFVWLLVRMR >CP034958|4098759:4105318|4103421_4103772_-|QAS87038.1|transposase|DBSCAN-SWA MKKRNFSAEFKRESAQLVVDQTYTVADAAKAMDVGLSTMTRWVKQLRDERQGKTPKASPITPEQIEIRKLRKKLQRIEMENEILKKATVDSICQCNTPFNYLFRCFELQCLSRSVV >CP034958|4098759:4105318|4102350_4103502_+|QAS87037.1|transposase|DBSCAN-SWA MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPHEHKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRGRRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVLEKLEMKWSPEQISGWLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNIQHLRRSHSLRHGRRHTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEGDLVSGTKNSHIATLVDRKSRYTIILRLRGKDSVSVNQALTDKFLSLPSELRKSLTWDRGMELARHLEFTVSTGVKVYFCDPQSPWQRGTNENTNGLIRQYFPKKTCLAQYTQHELDLVAAQLNNRPRKTLKFKTPKEIIERGVALTD |
7 | uncultured_Caudovirales_phage(16.67%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_10 |
4505967 : 4540540
Sequences of DBSCAN-SWA_10
Nucleotide sequences of DBSCAN-SWA_10 >CP034958|4505967:4540540|DBSCAN-SWA CGTGACAACTATAGTAAGCGTACGCCGTAACGGCCATGTGGTCATCGCTGGTGATGGTCAGGCCACGTTGGGCAATACCGTAATGAAAGGCAACGTGAAAAAGGTCCGCCGTCTGTACAACGACAAAGTCATCGCGGGCTTTGCGGGCGGTACTGCGGATGCTTTTACGCTGTTCGAACTGTTTGAACGTAAACTGGAAATGCATCAGGGCCATCTGGTTAAAGCCGCCGTTGAGCTGGCAAAAGACTGGCGTACCGATCGCATGCTGCGCAAACTTGAAGCACTGCTGGCAGTCGCGGATGAAACTGCATCGCTTATCATCACCGGTAACGGTGACGTGGTGCAGCCAGAAAACGATCTTATTGCTATCGGCTCCGGCGGCCCTTACGCCCAGGCTGCGGCGCGCGCGCTGTTAGAAAACACTGAACTTAGCGCCCGTGAAATTGCTGAAAAGGCGTTGGATATTGCAGGCGACATTTGCATCTATACCAACCATTTCCACACCATCGAAGAATTAAGCTACAAAGCGTAAGGATCTCCCATGTCTGAAATGACCCCACGCGAAATCGTCAGCGAACTGGATAAGCACATCATCGGCCAGGACAACGCCAAGCGTTCTGTGGCGATTGCTCTGCGTAACCGCTGGCGTCGCATGCAGCTCAACGAAGAGCTGCGCCATGAAGTGACCCCGAAAAATATCCTGATGATCGGCCCGACCGGTGTCGGTAAAACTGAAATCGCCCGTCGTCTGGCTAAGCTGGCGAATGCGCCGTTCATCAAAGTTGAAGCGACCAAATTCACCGAAGTGGGCTACGTCGGTAAGGAAGTGGATTCTATTATTCGCGATCTGACCGATGCCGCCGTGAAAATGGTACGCGTCCAGGCTATCGAGAAAAACCGTTATCGCGCTGAAGAACTGGCAGAAGAACGTATTCTCGACGTGCTGATCCCACCTGCTAAAAACAACTGGGGACAGACCGAACAGCAGCAGGAACCGTCCGCTGCTCGTCAGGCATTCCGCAAAAAACTGCGTGAAGGCCAGCTTGATGACAAAGAAATCGAGATCGATCTTGCCGCAGCACCGATGGGCGTTGAAATTATGGCTCCTCCAGGCATGGAAGAGATGACCAGCCAGCTGCAGTCCATGTTCCAGAACCTGGGCGGCCAGAAGCAAAAAGCGCGTAAGCTGAAAATCAAAGACGCCATGAAGCTGCTGATTGAAGAAGAAGCGGCGAAACTGGTGAACCCGGAAGAGCTGAAGCAAGACGCTATCGACGCTGTTGAGCAGCACGGGATCGTGTTTATCGACGAAATCGACAAAATCTGTAAGCGCGGCGAGTCTTCCGGTCCGGATGTTTCTCGTGAAGGCGTTCAGCGTGACCTGCTGCCGCTGGTAGAAGGTTGCACCGTTTCCACCAAACACGGGATGGTCAAAACTGACCACATTCTGTTTATCGCTTCTGGCGCGTTCCAGATTGCGAAACCGTCTGACCTAATCCCGGAACTGCAAGGTCGTCTGCCAATCCGCGTTGAACTGCAGGCACTGACCACCAGCGACTTCGAGCGTATTCTGACCGAGCCGAATGCCTCTATCACCGTGCAGTACAAAGCACTGATGGCGACTGAAGGCGTAAATATCGAGTTTACCGACTCCGGTATTAAACGCATCGCGGAAGCGGCATGGCAGGTGAACGAATCTACCGAAAACATCGGTGCTCGTCGTTTACACACTGTTCTGGAGCGTTTAATGGAAGAGATTTCCTACGACGCCAGCGATTTAAGCGGTCAAAATATCACTATTGACGCAGATTATGTGAGCAAACATCTGGATGCGTTGGTGGCAGATGAAGATCTGAGCCGTTTTATCCTATAATCGCGTTCAATCATTTTCATCATTGTTTGATGGGGCTGAAAGGCCCCATTTTTATTGGCGCGTATTATGACTGAACAACAAATTAGCCGAACTCAGGCGTGGCTGGAAAGTTTACGACCTAAAACCCTCCCCCTCGCCTTTGCTGCAATTATCGTCGGGACGGCGCTGGCATGGTGGCAAGGTCACTTCGATCCGCTGGTCGCCCTGCTGGCACTAATTACCGCCGGGCTATTACAGATCCTTTCTAACCTCGCCAATGATTACGGCGATGCGGTAAAAGGCAGCGATAAACCTGACCGCATTGGGCCGCTACGCGGCATGCAAAAAGGGGTCATTACCCAGCAAGAGATGAAACGGGCGCTCATTATTACCGTCGTGCTCATCTGTCTCTCCGGGCTGGCACTGGTTGCAGTGGCATGCCATACGCTGGCCGATTTTGTCGGTTTCCTGATTCTTGGCGGGTTGTCGATCATTGCCGCTATCACCTACACCGTGGGCAATCGTCCTTATGGTTATATCGGTCTGGGTGATATTTCCGTACTGGTTTTCTTTGGCTGGTTGAGCGTCATGGGGAGCTGGTATTTACAGGCTCATACATTGATTCCGGCACTGATCCTTCCGGCGACCGCCTGTGGCCTGCTGGCAACGGCAGTACTGAACATTAATAACCTGCGTGATATCAATAGCGACCGCGAAAATGGCAAAAACACGCTGGTGGTGCGCTTAGGTGAAGTGAACGCGCGTCGTTATCATGCCTGCCTGCTGATGGGCTCTCTGGTATGTCTGGCGCTGTTTAATCTCTTTTCGCTGCATAGCCTGTGGGGCTGGCTGTTCCTGCTGGCGGCACCATTACTGGTGAAGCAAGCCCGTTATGTGATGCGGGAAATGGACCCGGTGGCGATGCGACCAATGCTGGAACGTACTGTCAAGGGAGCGTTACTGACTAACCTGCTGTTTGTTTTAGGGATATTCCTAAGCCAGTGGGCAGCATAACTAACAAATATCAATTAACAATTGATGATTTTGCCAACAGCCCACATAGCGCGATATACTGAAAATTCTCGCAGCAACTGAATGTTAAGCCTATGAAATACGATACTTCCGAGCTTTGTGACATCTATCAAGAAGATGTTAACGTCGTGGAACCGCTGTTCTCCAACTTTGGCGGACGGGCGTCGTTTGGCGGACAAATAATCACGGTAAAATGTTTCGAGGACAACGGGTTGCTGTACGATCTGCTCGAACAGAATGGCCGTGGTCGTGTTCTTGTCGTCGATGGCGGTGGTTCTGTTCGTCGCGCACTGGTCGATGCTGAACTGGCGCGTCTGGCAGTACAAAATGAATGGGAAGGTCTGGTTATTTACGGCGCGGTGCGTCAGGTAGATGACCTGGAAGAGTTGGATATCGGCATCCAGGCGATGGCGGCAATTCCGGTTGGTGCCGCTGGCGAAGGCATTGGCGAAAGCGATGTCCGCGTCAATTTTGGCGGTGTCACCTTCTTCTCCGGCGACCATCTTTATGCCGACAATACCGGGATTATTCTTTCAGAAGATCCGCTGGATATTGAATGATAAGAAAGGCACCGCAAGGTGCCTTTTTTCTGCGTTACCTGTTGGCCTACACAGTAAAGAAATTACGCGAAAGATGAAGCGTAATCAGACCTCTTCCATGCGACCCAGCAGGGCCTGCAGACGTTCCTGCCAGCCGTTCTGCTGTTCTTTCAGATGGTTGTTCTCACGCTCCAGCTCTTCGCGCTGATGCTGGGCATTTTGAACTTCCTGCGACAGTGAGTTGTTTTTTTCTTTCAGCTCTTCGATTTCCATCTGCAACAGAGTGATGGTATCAATCGCCTGCTGTACTTTTGCTTCCAGTTTCTCAAACACTTCTAATGACATTGTCATACCTCTCCTGAATTGCAAGGCGTTGATGGATAAAAAATCCTCGTCCCGATTACCGGTGACGCCTTAATAAATACGAGCGCACTTTAGTTAGCTCCGATTGTATGAAGCCGCGCCATCGCTGTCCAGCGGCACGCCTTGCAGATTACGGTTTGCCACACTTTTCATCCTTCTCCTGGAGACATAATCCACACCAATCGAAAATGTTAATAAATTTGTTGCGCGAATGATCTAACAAACATGCATCATGTACAATCAGATGGAATAAATGGCGCGATAACGCTCATTTTATGACGAAGCACACACATTTTAAGTTCGATATTTCTCGTTTTTGCTCGTTAACGATAAGTTTACAGCATGCCTACAAGCATCGTGGAGGTCCGTGACTTTCACGCATACAACAAACATTAACTCTTCAGGATCCGATTATGAGTCAAACATCAACCTTGAAAGGCCAGTGCATTGCTGAATTCCTCGGTACCGGGTTGTTGATTTTTTTCGGTGTGGGTTGCGTTGCAGCACTAAAAGTCGCTGGTGCGTCTTTTGGTCAGTGGGAAATCAGTGTCATTTGGGGACTGGGGGTGGCAATGGCTATCTACCTGACCGCAGGGGTTTCCGGCGCGCATCTTAATCCCGCTGTTACCATTGCATTGTGGCTGTTTGCCTGTTTCGACAAGCGCAAAGTTATTCCTTTTATCGTTTCACAAGTTGCCGGCGCTTTCTGCGCTGCGGCTTTAGTTTACGGGCTTTACTACAATTTATTTTTCGACTTCGAGCAGACTCATCACATTGTTCGCGGCAGCGTTGAAAGTGTTGATCTGGCTGGCACTTTCTCTACTTACCCTAATCCTCATATCAATTTTGTGCAGGCTTTCGCAGTTGAGATGGTGATTACCGCTATTCTGATGGGGCTGATCCTGGCGTTAACGGACGATGGCAACGGTGTACCACGCGGCCCTTTGGCTCCCTTGCTGATTGGTCTACTGATTGCGGTCATTGGCGCATCTATGGGCCCATTGACGGGTTTTGCCATGAACCCAGCGCGTGACTTTGGTCCGAAAGTCTTTGCCTGGCTGGCGGGCTGGGGCAATGTCGCCTTTACCGGCGGCAGAGACATTCCTTACTTCCTGGTGCCGCTTTTCGGCCCTATCGTTGGCGCGATTGTAGGTGCATTTGCCTACCGCAAACTGATTGGTCGCCATTTGCCTTGCGATATCTGTGTTGTGGAAGAAAAGGAAACCACAACTCCTTCAGAACAAAAAGCTTCGCTGTAATGTGACTACGGGACAACTAAACATGACTGAAAAAAAATATATCGTCGCGCTCGACCAGGGCACCACCAGCTCCCGCGCGGTCGTGATGGATCACGATGCCAATATCATTAGCGTGTCGCAGCGCGAATTTGAGCAAATCTACCCAAAACCGGGCTGGGTAGAACACGACCCAATGGAAATCTGGGCCACCCAAAGCTCCACGCTGGTAGAAGTGCTGGCGAAAGCCGATATCAGTTCCGATCAAATTGCAGCTATCGGTATCACTAACCAGCGTGAAACCACTATTGTCTGGGAAAAAGAAACCGGCAAGCCCATCTATAACGCCATTGTCTGGCAGTGCCGTCGTACCGCAGAAATCTGCGAGCATTTAAAACGTGACGGTTTAGAAGATTATATCCGTAGCAATACCGGTCTGGTGATTGACCCGTACTTCTCTGGCACCAAGGTGAAGTGGATCCTCGATCATGTGGAAGGCTCTCGCGAGCGTGCGCGTCGTGGTGAATTGCTGTTTGGTACGGTTGATACGTGGCTTATCTGGAAAATGACTCAGGGCCGTGTCCATGTGACCGATTACACCAACGCCTCTCGTACTATGTTGTTCAACATCCATGCCCTGGACTGGGACGACAAAATGCTGGAAGTGCTGGATATTCCGCGCGAGATGCTGCCAGAAGTGCGTCGTTCTTCCGAAGTGTACGGTCAGACTAACATTGGCGGCAAAGGCGGCACGCGTATTCCAATCTCTGGGATCGCCGGTGACCAGCAGGCCGCGCTGTTTGGTCAGTTGTGCGTGAAAGAAGGGATGGCGAAGAACACCTATGGCACTGGCTGCTTTATGCTGATGAACACTGGCGAGAAAGCGGTGAAATCAGAAAACGGCCTGCTGACCACCATCGCCTGCGGCCCGACTGGCGAAGTGAACTATGCGCTGGAAGGTGCGGTGTTTATGGCGGGCGCATCCATTCAGTGGCTGCGCGATGAGATGAAGTTGATTAACGACGCGTACGATTCCGAATACTTTGCCACCAAAGTGCAAAACACCAACGGCGTATATGTGGTCCCAGCATTTACCGGGCTGGGTGCGCCGTACTGGGACCCGTATGCGCGCGGGGCGATTTTCGGTCTGACTCGTGGGGTAAACGCGAACCACATTATCCGCGCGACGCTGGAGTCTATTGCTTATCAGACGCGTGACGTGCTGGAAGCAATGCAGGCCGACTCTGGTATTCGTCTGCACGCCCTGCGCGTGGATGGCGGCGCAGTAGCAAACAATTTCCTGATGCAGTTCCAGTCCGATATTCTCGGTACGCGCGTTGAGCGCCCGGAAGTGCGCGAAGTCACCGCATTGGGTGCGGCCTATCTTGCTGGTCTGGCGGTTGGCTTCTGGCAGAACCTCGACGAGCTGCAAGAGAAAGCGGTGATTGAGCGCGAGTTCCGTCCAGGCATCGAAACCACTGAGCGTAATTACCGTTACGCAGGCTGGAAAAAAGCGGTCAAACGCGCGATGGCGTGGGAAGAACACGACGAGTAATGTATGCCGGATGAAGCGTTTTTGCCGCATCCGGTAGTCCCGAAACGTGCGGGGGCAACCCCGCACACATCAATAATCCCTCCCTTCCCCTGTGCTACACTTCGCGCCATTCCTTACTGCTTAGAGTTTGCTATGAGACGAGAACTTGCCATCGAATTTTCCCGCGTCACCGAATCAGCGGCGCTGGCTGGCTACAAATGGTTAGGACGCGGCGATAAAAACACCGCGGACGGCGCGGCGGTAAACGCCATGCGTATTATGCTCAACCAGGTCAACATTGACGGCACCATCGTCATTGGTGAAGGTGAAATCGACGAAGCACCGATGCTCTACATTGGTGAAAAAGTCGGTACTGGTCGCGGCGACGCGGTAGATATTGCTGTTGATCCGATTGAAGGCACGCGCATGACGGCGATGGGCCAGGCTAACGCGCTGGCGGTGCTGGCAGTAGGCGATAAAGGCTGCTTCCTCAATGCGCCGGATATGTATATGGAGAAGCTGATTGTCGGGCCGGGAGCCAAAGGCACCATTGATCTGAACCTGCCGCTGGCGGATAACCTGCGCAATGTAGCGGCGGCGCTCGGCAAACCGTTGAGCGAACTGACGGTAACGATTCTGGCTAAACCACGCCACGATGCCGTTATCGCTGAAATGCAGCAACTCGGCGTACGCGTATTTGCTATTCCGGACGGCGACGTTGCGGCCTCAATTCTCACCTGTATGCCAGACAGCGAAGTTGACGTGCTGTACGGTATTGGTGGCGCGCCGGAAGGCGTAGTTTCTGCGGCGGTGATCCGCGCATTAGATGGCGACATGAACGGTCGTCTGCTGGCGCGTCATGACGTCAAAGGCGACAACGAAGAGAATCGTCGCATTGGCGAGCAGGAGCTGGCACGCTGCAAAGCGATGGGCATCGAAGCCGGTAAAGTATTGCGCCTGGGCGATATGGCGCGCAGCGATAACGTCATCTTCTCTGCCACCGGTATTACCAAAGGCGATCTGCTGGAAGGCATTAGCCGCAAAGGCAATATCGCGACTACCGAAACGCTGCTGATCCGCGGCAAGTCACGCACCATTCGCCGCATTCAGTCCATCCACTATCTGGATCGCAAAGACCCGGAAATGCAGGTGCACATCCTCTGATTGATTTGATCGATTGAGCCTTCCAGTCCTTCGGGACTGGAATTTTTTTGTTCGGAGAACGAAGATAAGGCAAGTTAATCAAAACAGGAGAAAAACATGGCTGATTGGGTAACAGGCAAAGTCACTAAAGTGCAGAACTGGACCGACGCCCTGTTTAGTCTCACCGTTCACGCCCCCGTGCTTCCGTTTACCGCCGGGCAATTCACCAAGCTTGGCCTTGAAATCGACGGCGAACGCGTCCAGCGCGCCTACTCCTATGTTAACTCGCCCGATAATCCCGATCTGGAGTTTTACCTGGTCACCGTCCCCGATGGCAAATTAAGCCCACGATTGGCGGCACTGAAACCAGGCGATGAAGTGCAGGTGGTTAGCGAAGCAGCTGGCTTCTTTGTTCTGGATGAAGTACCGGACTGCGAAACGCTATGGATGCTGGCAACCGGTACAGCGATTGGCCCTTATTTATCGATTCTGCAACTAGGCAAAGATTTAGATCGCTTCAAAAATCTGGTCCTGGTTCACGCCGCACGTTATGCCGCCGACTTAAGCTATTTGCCACTGATGCAAGAACTGGAAAAACGCTACGAAGGGAAACTGCGCATTCAGACGGTGGTCAGTCGGGAAACGGCAGCGGGGTCGCTCACCGGGCGGATACCGGCGTTAATTGAAAGTGGGGAACTGGAAAGCGCGATTGGCCTGCCGATGAATAAAGAAACCAGCCATGTGATGCTGTGCGGCAATCCACAGATGGTGCGCGATACACAACAGTTGCTGAAAGAGACCCGGCAGATGACGAAACATTTACGTCGCCGACCGGGCCATATGACAGCGGAGCATTACTGGTAAGCGGTTACTTATCGATAAACGGCACGATGAGCAAATCCGCACTCATCTTATTGATCATCCCGCGATATGCCGGCATCAAACGGTTGATAAATGAGTGATGATGACCACAGACAAGGAGGTCGCACTGCTCTTTTTGCATAATTTCCAGCAGTGTTTCCGGCATTTCTCCGCGTTCAATACGCAGTTTTGTCTTCGGCCATTGAATATTTTTCGTCAGTTTATACAGCTTGTTATCCGACTTATTCTTCAACAATTGAAGAATATCTTCTGTTGCAGGGAAGTAGATACCCGGATACAACTCGCTTAAGCCATCATCAATATGAATTAACGTCAGGTGAGCGTCATTATGTCTGGCGAGCTCCAGAGCTTTATTCACCAGTAAGGCATCTTCTTCATTCCCGGAAATTGCCACGCCAATGTGTTTATAAGCCATGTTTAACTCCTTCTAAAGCAACTCCATCAAGCTAACGAACGCAGTGATAACTCAAAAATAATCATCTCGGCCTGGCATGAGAAAGTGAAGGCCGCATCAAGCTCAACTTGCCCCTCCAGATCCTTAATCTTGTAGGCCACGCTGGCCCCTTCATCTGCAGAGGATGCAGCGGCGGCAAGCTCTTTCAGTTTGGTCATAAAAGACTCTGCTTCTTCGCGGGTGGCAAAAAAGCGCGAAAACTTACTGGTACAGTTGTCGTTATCAATCACCGTACCGATATCAATCGCACATCCTTTAGTACTGCATTTATCTACGACATCTTTCATAGGGCACCTCTGTATATTCGCCTCTCTGTCGATTACGACTTAAGAATAATGCGCCGTTAAGGGAAGAACAAACAAGTAATGATCTTATGGATTACCATTTTTTTTGATATTGCTTAAGGTGAAGAGTTCAGAAGGATTTTCCGGGTATTGTCATTTGCATCGTGGAAATAACAATCCAGGAAAAAAATAGCAGTAAGAGATATATTAAAGGTTAATTACTTGATTTATTGTCGGCTTTATACTTCACATCCTGAGTATCTTTACCATATTTATTTTCGCCTTGTGTGCCAACAAACGCACCAAGATCGATAAGCATCATCACCAGAATCAACGTCGGGACAAAACGCCCCACCGCCCATTGCCAGACACCCGGTAAAATCGCCCAGTTACCCGCCAGCAGCATCCACGCCACAATCATCAGAAATGCCCATGCGCCGGAACGCCCGCGATCATGCAAGCGCTTAACAGTTACTGCCGCTGTTGGCCAAAGCAAGCACACAAGGCAAAACGCCGCGGTCTGAATATCGAGTAAATTTTTACCCGCCAGTGAGAAAAGCACCAGCATGCCTGCGAACCACAGGCCTATCCAAATCCAGAAATCACGGCGTCCAATACGCCCTTTAAATGAGAATAACCATTGCTGTATGGTCATGTAAGTTCCTTGATGGTTGTCTTTTCCAGGATTCTACCCTTTTGACAAGGGCGACAGATATCGTTTTAATCGGAGCCAGTCATAACAAAAGGTACTGTCAATGAAGCCAGGGTGTACGCTGTTTTTTCTCTTATGTTCTGCATTAACCGTTACAACAACGGCGCATGCACAAACACCAGATACGGCAACGACCGCGCCTTATCTGCTGGCTGGAGCCCCTACTTTCGATCTCTCCATCAGCCAGTTTCGAGAAGACTTTAACAGCCAGAATCCTAGCCTGCCACTGAACGAATTTCGTGCCATCGACAGCAGTCCCGACAAAGCCAATCTCACTCGTGCTGCAAGTAAAATTAACGAGAACTTGTATGCTTCTACAGCGCTGGAGCGCGGTACCTTAAAAATCAAAAGCATTCAAATGACCTGGCTACCCATCCAGGGGCCAGAGCAAAAAGCAGCGAAAGCGAAAGCTCAGGAATACATGGCAGCAGTGATCCGCACACTCACCCCATTAATGACCAAAACACAAAGCCAGAAAAAACTGCAGTCGCTGCTAACGGCGGGGAAAAACAAACGTTATTACACCGAGACAGAAGGTGCACTGCGTTATGTTGTCGCGGACAACGGCGAAAAGGGGCTGACCTTCGCTGTTGAACCGATTAAGCTGGCGCTATCTGAATCGCTTGAAGGTTTGAATAAATGACAAAAAGCAAAGCCTTTGTGCCGATGAATCTCTATACTGTTTCACAGACCTGCTGCCCTGCGGGGCGGCCATCTTCCTTTATTCGCTTATAAGCGTGGAGAATTAAAATGCGACATCCTTTAGTGATGGGTAACTGGAAACTGAACGGCAGCCGCCACATGGTTCACGAGCTGGTTTCTAACCTACGTAAAGAGCTGGCAGGTGTTGCTGGCTGTGCGGTTGCAATCGCACCACCGGAAATGTACATCGATATGGCGAAGCGCGAAGCTGAAGGCAGCCACATCATGCTGGGTGCGCAAAACGTGGACCTGAACCTGTCCGGCGCATTCACCGGTGAAACTTCTGCCGCTATGCTGAAAGACATCGGCGCTCAGTACATCATCATCGGCCACTCTGAACGTCGTACTTACCACAAAGAGTCTGACGAACTGATCGCGAAAAAATTCGCGGTGCTGAAAGAGCAGGGCCTGACTCCGGTTCTGTGCATCGGTGAAACCGAAGCTGAAAACGAAGCGGGCAAAACTGAAGAAGTTTGCGCACGTCAGATCGACGCGGTACTGAAAACTCAGGGTGCTGCGGCATTCGAAGGTGCGGTTATCGCTTACGAACCTGTATGGGCAATCGGTACTGGCAAATCTGCAACTCCGGCTCAGGCACAGGCTGTTCACAAATTCATCCGTGACCACATCGCTAAAGTTGACGCTAACATCGCTGAACAAGTGATCATTCAGTACGGCGGCTCTGTAAACGCGTCTAACGCTGCAGAACTGTTTGCTCAGCCAGATATCGACGGCGCGCTGGTTGGCGGTGCTTCTCTGAAAGCTGACGCTTTCGCAGTGATCGTTAAAGCTGCAGAAGCGGCTAAACAGGCTTAAGTCTGACAGGTGCCGGATTTCATATCCGGCACTTACTTTCCTTAACTCTTCGCCTTAACGCAAAATCTCACACTGATGATCCTGAATTTCCTCGGCTGAAGCACGGTTAAGCGTCAGTAGATTTCGTTGTGTCGCCAGCAATACAAATGAGTTATCACTCTGCCGTACCATCGCCAGCCCGTAGCTTCCCATATGTTCCCGCGCCTCAGGTACTTCTTCTGCCAGCATCATAAATGGGCTGCGTTGTACCAGTTCGCTTTCCGTTACCCGACGCGCCAGGTATTCATGCCCGCGCAAACCACCTGGCAGTGGCAACCAGCGGCTGCTGATGTTCGCCAGATTGTTATCCAGTTGTTTGCGCACATCAGGACGAATACAAGAGATATGAATATGAAAATGGTTTTGCGTACGCCCGGTGCGGGAGTTGATCGCCAAAGAAACCGCGCGATCGGGAACCGGCTGGCCGTATTTTTTGCTCATAAAATCACGCGCCTGCCAGGCCAACCAAAAGAAGTTCGGCGTTGAAGGATCGGTCAACAAAGGACTTTCAGTACCGTTAATACGATACGTTGGCATCAACAGATATTGCAGTGGGCCATTAAGATCTTTTAAAACCACGTATCCGGCATTGGGTTTGACTTCCGCACATGGCGAAGGATTTTGATTTTGCTGCTGATTGGGCAAACATTCCTCAAGGACAATCTTACGTAATGTATCCGACTCTTCACCGGTTAATTTCCAGTAACCAATACCGGCAGCCACAACGGCGATAACTATCATCACCAAAAAAAGAAGACCCGCTTTTTTCATCTTTTTTTCCCTGTACCTCAAAGAGATGCAAGGGTAACGCAAAATCGTGACAAATAAAAAACCCGGTCGCGAGAAACCACCGGGTTTGATAATTATCCTGGAGAAATCAGCGTTTGCTGATCTGATCGAACGTACCGCCGTTAGCAAAATGCTCTTTTTGCGCTTTCGTCCAGCCGCCGAACTCTTCATCAATGGTGAATAACTTCAGCTTTGGAAACGCATTTTCGTACTTTTTCGCTACTTCAGCGTCGCGCGGACGGTAGTAGTTTTTCGCGGCAATTTCCTGACCTTCTGGCGAGTAGAGATATTTCAGGTAGGCTTCCGCCACCTCTTTGGTGCCTTTTTTCTCGACCACTTTATCGACCACCGACACGGTTGGCTCTGCGAGGATAGACTCACTCGGCGTGACGATTTCGAATTTATCTTTCCCCAGTTCATTCGCTGCCAGCAGAACTTCGTTTTCCCAAGCGATCAGTACATCGCCAATCCCGCGCTCGACAAAGGTGTTAGTGGAGCCGCGCGCGCCAGAATCCAGAACTTCGACGTTTTTATACAGTGCCCGAACAAAATCCTGTGCTTTTGCCTGATCGTTGTTGTTGTGATGCAGCGCGTAGCCCCAGGCAGCCAGGTAGTTCCAGCGTGCGCCACCAGAGCTTTTCGGATTAGGCGTGATCACCGAAACACCCGGTTTAATCAGATCGTTCCAGTCATGGATCTGCTTCGGATTGCCCTTACGCACCAGGAAAACAATGGTGGAAGTGTACGGTGCGGAGTTATCCGGCAGACGTTTGATCCACTCTTTATCAATCCGCCCGCGTTCCGCAATCGCGTCCACGTCATAGGCCAGAGCCAGCGTGACAACATCAGCTTCAATACCGTTGATTACCGACGTCGCTTGTTTACCTGAACCACCGTGTGACTGACGAATCACCACGTTATCACCAGTTTGCTGTTTCCAGTGGGCGCTGAATGCCTTGTTGTACTGTTCGTACAATTCGCGCGTTGGATCATATGAAACGTTAAGAAGCTGAATATCCTTTGCCATAACGCTGGTTGCCGCCAGCAAAAATGTTAACCCTACGCCCCACTTGTTCATCGCCCGACTCTCTTATGTTGTGTTGTGATGAGCAAAGCGTGCCAGAAGGTTAACCAAACATTAAAGAATAAAAAAAGATTGGCTATAACTTGCGGGTATATGTTGAGGGATTAAAAAGGCGGAAGAAATCCGCCTCATATTGCTGACAAAGTGCGCTTTGTCCATGCCGGATGCGGCGCGAACGCCTTATCCGGCCTACAAAAGTTTGCAAATTCAATAAATTGCAGAATTCATGTAGGCCTGATAAGCGAAGCGCATCAGGCATTTTTGCTTCTGTCATCGGTTTCAGGCTAAAGGAATCTGCCTTTTTCCGAAATCATTAATACAGTTTTTTCGCGCAGTCCAGCCAGTCACCTTTGAACGGACGCTTCATGTTTTCGATAGCGTCGATGATGTCGTGGTGAACCAGCTGTTCGTTCTGGATACCTACACAACGACCGCCGTAACCTGCCAGCAGCAGATCGATAGCGTAAGCGCCCATACGGGAAGCCAGAATACGGTCGTAAGGCACCGGAGAACCACCGCGCTGGATGTGGCCCAGCACAGTTGCGCGGGTTTCACGACCGGTTTCTTTCTCGATGAAATGCGCCAGTTCGTCAACATCACACATATGTTCGGTAATCGCCACGATCGCGTGTTTTTTACCTTTCGCGATACCCGCTTTGATTTCGTTTACCAGGTCTTCACGGCTGAATTCAACTTCCGGAACCACAACGAATTCACAGCCACCGGCAATGGCCGCAGCCAGGGTCAGATCGCCACAATAACGGCCCATCACTTCCACCACGGAAATACGCTGGTGAGAAGAAGAGGTGTCACGCAGACGGTCGATCGCTTCTACAACGGTGCTCAGCGCAGTGAAGAAACCGATAGTGTAGTCAGTGCCTTTGATGTCGTTGTCGATAGTGCCCGGCAGACCGATGCACGGGAAGCCCATTTCGGTCAGACGCATTGCACCCATGTAGGAACCGTCACCGCCGATAACCACCAGCGCGTCGATCCCACGTTTTTTCAGGTTTTCGATAGCCACGGCGCGGATGTTCTCGTCGCGGAATTCCGGGAAACGCGCAGAACCGAGGAACGTACCGCCACGGTTGATCATGTCAGACACGCTGTAACGGTCTAGCTGTACCATACGGTCTTCATACAGACCCAGATAGCCGTCATAAATACCCATTACTTCCAGACCTTCTGTCAGCGCAGAACGAACAACCCCGCGAATTGCGGCGTTCATGCCTGGCGCATCACCGCCGCTTGTCAACACACCGATTTTCTTAATCATGACTACCTCTGAACTTTGGAATGCAAAATAAAATCTGTTGCCGGAAGTCTTCTTGCACATCGAAGTGATCCAACGAATGTGCAAATAGTATAACAATCACTTCCTGCTGAATTGATTCAGGTCAGGCCAAATGGCGGTATTTTATACACAAAATGCGGGTCTGGCTCTCTTTTATACTGATTATGAAAGCATAGACCGTTTACCCTCCCTGGGTACGACGGAACAGGGGTCCTGATGGATAATCACATCCGATCCCGGAAAACGCCGTAAAATAGCCTGCTCTACCTGATCCGCCACCATATGTGCCTGAACCAAAGGCAGAGAGTCTTCCATTTCCAAATGAATCTGAATAAAGCGGGTCGGCCCTGACTGCCGCGTGCGAAGATCGTGAGCGCCGCTAACACCCGGCCAGGAAGTCACGATATCAATAATTTCTTGCCGTTCCTCATCAGGCAATGCGCGATCCAGTAATGACTGTACCGCCTCATATCCCATGCGTAACGCGCTATATAAAATATAGATGCCGATTCCCAATGCAAACAGAGCATCGGCGCGATGCCAGCCGTACCAGGACAACCCCAGCGCCAGCAGAATTGCGCCGTTCATCATAACATCAGACTGGTAATGTAGCATATCAGCCCTCACCGCCTGGCTTTGCGTCCGGCGCACCACCCAACGCTGAAACGAGACAAGGATAATCGTACAAATTAGCGCCACAATTGTCACGATAACCCCGACGCCTGGATCTGTCATCGGTGTTGGAGATACCAGATGTTGAATACCCGTCAAAAACAGGAATAGTGCCGAACCGGAGATAAACATACTTTGCGCCAGCGCCGCGAGGGACTCAGCTTTACCGTGACCAAACGAGTGATTATCGTCGGCAGGTTGCAGGGAATATCGCACCACCAGTAAATTCGTCAACGACGCGCCGATATCCACCAGCGAATCCACCAGCGCGGCGAGAATACTCACCGACCCGGTATACCACCATGCAAAAATTTTAATCAGCAATAGCAGCGAAGCCATCGCCGTCGCAGCAATCGCCGCCCGACTGACCAGCCGTCCATAAGATTGATTCATAAATACTCCCGCTATCAACTGACGCTAGTATAACGGAAGCAAATCATCTGCAATGCATTAAGCAGCAGGCAAATTGAGGATAAAAAAAACCCCCACATCATGTGGGGGAAGACAGGGATGGTGTCACAAAAAATCACCTAACCTACTGATATAAATGGATTTATATAAACCACTGTCCACATAGCGTCCACATCGACCATAAATAAAGCCCCTCAACTGAGGGGCTATTTTTGTGATCACATCCACATAATTTGCTGCCCTGACGGCAACGGGTGCGGCCTCACGGCGTGGACTTCTCCCGGCTTCACGATGTATCTCTGTACCGACTCATAAGTGATGAACGTGGCGCTGCAATTCACGTTCTGGCACTGGTGATAACGCTCTTTTGTCGTGTCAGTGATATAGCGGCTTGTACGCGCATGTGCGGCATGCTGGCATAAAGGACAATGAAACATCGCGAGCACCTCTTCCGGTTTTGTTGATAGTGCCATTTTAGTTAAATTATCATTATAAAACAAAAAGATAAACAAAAGACATCACTCATAATCTTCTGTTTCGTACTCCACATCAGAAACCGTCGCGGCGAATCAGCGTAGTGACGCCTGACTCGTTAAGCAGGTCAGCATCGGTGCCGGACTCCTGCAAATCCCAGAATACAGATGCGCTGATGCCGGTAACACCGTTCACCCCGACATTGGACAGCGTTTTATGCCAGCCCTGCTCCTGGTCGATTTTAGCGCGCAGCCCCAGCGCACGGGCGGTGGCATACGCGGTGGCGGTGGTACTGGTGACCGTATCCCATGCGAGGAAATCCGGCCAGATGACCATCAGCTCACGCTGGCTGAAATTCTGGCGGTAGGCTTTCACCTCGGAAATGGTTTTACAGCCCCATGCGCTGATATACCCGAAAGCGCGCAGCTTCTGACAGACTGATGCCAGTGCAACAGCCACCTCTTTGGTATCCAGCCCCGGCACGCCGAGAATACGCGGTTTAACACCGGTTACCGACTCCGCCGCCAGCAGGGCTTTCAGTCCGGTGTACTGACCGTTTTCGTCGGTGGTGCCGATGATATTGGAAACGGTCTGCGCAAGTTTCGCTTCCTCGTCGTCGCCGGTGCCGTCTTCCACACGCACGACAACGGTGACCGGTTTTGACTGGTCGGCGATGGCCTGCAACGATGCCGCCAGCGTGCCTTTTTTACCGGCCTTTGCAATTGCGCTCTGCACATTGGTAATCAGCACTGGTTTATTGAGGGGGAAGATTTCCGCATCCGCATCGCTGGCCGTGCAGACCATGCCAACAATGGCGGTGGATACGGTGGAAATGACGCGGGTGCCGTCGTTAATCTCCAGCACCTGCACGCCGTGATGATAGTCACTCATCCGTTTAACTCCGTGGTTAATGGGTGCAACTATTTTCTGTTGGGCAGTGCATGAGACGCTATTTGACCTGGCTGGTCAGTGGATGAAACAACAGATAAAGAAAAGGCAGGCAATTCGCCCGCCTGTCCTGATTTGTACTCACTCATTTTCCGACTGACAATTTACATAGCCAAAACGCTATCAAATCTGACAGTCTGCTTTGAGCGAGGAGCAGAGGTTAGTTTTAGTTAACCAAAATGATAAAAAGCAGTAGAAAAATCCGCTCATTACGTTATGGTTATAAGCCGCACATAATCATCCGAGCCAAATCCTCTTGATTTCAAAGAATTAGCACTTGCTCTTCACTAGAAACTATGGTTCTGACTTCACGCTCAAATATTGGATCCATTATCATTTTTCTATTTGTTGGAATAGTAAATTTACCGAGTACCATGACAATGCTACTCAACGGAAATAATTTTGGAGTAACATTTCCTAATTGAATCGTAGGATATATATTAAAGCCCCCTTTTATCCTCATTGGCGAACCTTCAATGAAGTGTCGATAATTAAATTTCGCCATAGGGATTCTCGCCACATGCACTTTCATAATTACAGCAACCTGAGCTATTTTTACTTTCTGGAGGTAAGTTGACAATCGTTCTAACTGCGTTGTTTCATAGAAAAAATCAGACCACGTTACAGTACTTGTTCCAGTAACAATTTTTATGCTGTTTCTCTTGTTGGGGTGAGCTTTTCCATATTCATAGATCTCCCGTAAGCTATCCATGTTCTTCACATAAGGGGCTTTCCGACCTCTCTTGACATAAACGCGGTCTGTAGTATCAGAAGGTGGATTTGCCTGCATAGCTGCAGCTTTTCTGCTTAATTTAGAGATATCATCTTCATCGAGTATATGTATACGGAATATATTATTACCTGTGGCAAGCGCACGGCTGATATCATGACTGGAATCACTTGCATAAATTGTATTCAATCCTGAAGTTCCATACTGGCATAATTTGTCATGCGTGAAATTTTTTTCCAGTCTAAAGAAGGCGGGGATATGCAAGGTTTTATTAAAACTCCGACGAATATGGCTTTTGACATACACCAAATGAGCAGAGCAATTTGGGTCAGGACATTTGAGTGGATAGGGATTAACACTATAATTAAAAGTATTTACATTTACTAAAATCTGATTGTTATCAATAGCCTGAGTTATACGAATACCACTCATTACTCCTCCATAAAGATAGGCGCAAGCATTTAGTAATAAACCATTAACATGCCGAGATCATACCCGATCATTGAAATGTAAACACTCCCCTAAGACGGCTGCTGGCATAATATTTTATGACATCAGCAATGCCCACCTCTGGCACAGAGTGGACTGTCAGATTAGGCTTTACTCTGTGCCATAGATATGTAAGCCCACACTAGAGCTCATACAACTTATTGCGGCATTTCCGGCCATTCAGGATTTGCAGGATCCACACGACTGACCAGAACACTATAGCGTTCCCATGCCTCCAGTCGACTACGCTCCTCATCTGTTGCCATATTCAGCCTGACCGCGCGCTCCAGCGGCAAAATCACGGATTCAGCTTCGGAAAGCAAAGCTGCCTTATGTAATTCCGCCAGTTTCTGCTGTTCGTCTGCCGTATAAATCCGCTTAATCACGGCACCATCCTTAAACATCCATTTACCTGAGTCGTCAGCACGTCGGTTGGAGGTAATATCAGGAACCTCAACAACGCTAAAACCTTCAGGATTAAGCGTTGAAGCATCTCTGGTGATAGCGACAATAATATTATTTTCATCGTAAACAATCTTTATTGTGTCTGGCTGAAAGTTCTTCACTTCCTCATACCAGTTTTTTCCGTCCTCAGAGTAAAGCCAGATAACTCCGTGTTTCTTTGTTAACTCATACTGTTCCAGTGTTTTAGCGTTACCCGCTTTTATGTTCTTTAAGTGCATCATATTAAACGCTCGCTACATTATACCAGGTGCCATTTATATACTTTTGAACGGGTCTGTAATAAACGCCCGCTATATTATCGGCAGAGTTGGACCCTGTATCCTGAACATTAATACCAGACAATACATGACCTGACGGGCACTGGAAATTCCATGTTTGCCAGTTGTTCACTCCATAATATTGCTGTGACCCAAGTCGAACATCTTTCACATATCTGGAATCAAAATTGCCATAGTTGCCAGGAATAACTTGCGAGCCACAAAGCCAGTTACCGTTATTATCCATGTACGCCTGACCATCGGTGCCATTGGCTGTCCTTGAGTTATTAATCATGTAGATGCCAAATTGCTTATTTCCCAGACCGCCAATCATAAATTTGCGGTCGGCATGGTCCTGACGGAGCAAAGCCTGAGCACCATCAGTGGATACCGCATTACGTCCAAAAATAACATTCTGGTCACGCATATGAATCCACATGCCTGTGCTGCTGTTAATTGCAAGGCGGTTTGCGTACACCCATGCGTTAGTTGTTATATCTCCTGTAACATCCAGAGCGTGCCCCATAGTTATGCGACCAGTTCTGAGATTCAACGAAAAGGGGCGTAGTGGCCCGATATCACCATTTTCCCCCTCATTCTCTCGTGTGGGGATAATATGCAGGCATTCTTCAGAACGGCGAAAAATAGCACCAAAGGATGAATTAAAGATTCTCAGTGCATTGACTGTCGATATTTTTACTTCACTGCTGAAAAGGGCTTTAACAAGGACAGACAAAGCATCCCATTTAAGATTCATCAGGTCTTTTGTTGTGGTGCCCTGACGGCTTCTCCATTTGAAATATTCATTGCCGTTGTCGCCTGTTTCAAACCACATGTATGAATCAGTGTCACCATCGGCATCATTTTTAAATCCAATCTTCGCCCAGTCAGTATTTCGAATCCAGGCAAGGATTGAGTCGTTTTCAAAAGTAAGTCCACCGGACAAGGTATCGCCATTTTTTTGCACGGCGTTCCCGGCTCGGTTTACCGTTTCCTGTAAACCGAGATATTCGATAACGGCGGCAACGGTCGATTTAGCCAGAATACCGTGAGCGCGCTGGTTTCAGGTTCATACTCAATCACCGCCCCGTCAGGGAAACGGATATGCAGGGCATCCGCCGACGCAGACGGCGCGGGGTTATCGCCGGAATAAATCCCCGGCAGAACGAACGCCGTGTCGAGTTCACCACCCACGGCCAGAATCAGCACCTGTTCCCCCACGGAAGGTGCCCACCATGTGCGCGAACGCCCGGCGCGATGGGTCAGCCACTGAAGCCAGTCGGTGCACATGCCGCCGGTCTGCACACGGCAGCGACCGGCATTAAGGTCGGTTTCGACGATAATGCCGGTGCGGATCATGTTGCGCAGTGCGCGCGCGAGTTCCTGAATATTTGCGAGAGTGTTCATGCGTGTGAGATTGCACAATATATAAAAGTTATGCTATCTGGATTCATTTGTAGAACGACCATACAACATTCGAGGAGAGCGTAATGTTCAGTGATAATGTGACTAATGCGTGGTGGTTTATCTCTTTGTATCTATTTTTATTAATAGCATTAACATTTGTTACCTTTGGTAAAAGTAATCTTATGAGGTTTATTGCACATCATTTCAATCTTGAGTATTCAGACAGAAAGTTAAAAATGCTCGACAAAAAATGGCGCGACATTCAACTATTTAAAATAATTAACGGAATCAATGTATCAGGCATCGAAGATGTGAGAATGATACAGCAGGGGCTGATTGATGGAAAACTAAAAACATCGTATTTTTTCCTTACTCGCTTCTGGGGTGACATAACAAAACCACCACACATAATTAAAACAACAATTGTAATTCTGGCCAGTATTATTTATATTCTCTTCGCATGTTATATACACAATGAACAATCCGCTATAGTAAGGGATGCCATAGGTATACCATATAAAAATATGATGTACTATGTTTATAGTGACAAAGTTCTTTTATCCTTCAAAAATAAAACGGTTGAATTTAATAAAACTTATAGCCTTGCCGATTGCAAGAGTCTGCAAAACGTATTTATAAAAGACACACTTCCCGAGATCGCCTGCAATAAGCTCTTACAGCTAAACAAGGAGGACTCCGAATGGTTAAGTCAGGAGATTAAAGATAATAACAGCTACAGAAAAACATTATTAATAATGTCCCTAACCTATTTCATTTCAGGTCTGCTTATATTCCTGTCATATACAAAATTCCTTTACGCCAATAAGAAGGTTTTAGAATACAAAGCATCAAATAAAAACCACTCATAAACCTCTAAATATTGAGCGACCAGCACGGCCGCTCAATGCTTAATTGCGCATCAGCCTCTGCCTGGATAAAACTAACGCTCAAGGTGAGCCAGGATAATCTCTTCAATCATCTGCACATCCTCACCGGTAAAGCCGAGCAGAGGACGCGCCGGATAATCAATTTTCTTACCGTCTTTCCGGTTTTCTTCCGACAGACCGAACTGATGCACACTGGCGATTTTCGGTGACTTCCCGCCGTAAAACTCCATTGATGCCTGTTCCGGGCTGGCGCGGATATGCAAAAAACGACTGGTGATAAGTTTCGCAAACATTTTTCGCTTAACACGACCGGTCTTTTTTCTGGCGCTCTGCTGCTGGCGTGGCGCGTAGGGTGTGCCGTCCGGAGTTTTCTGTGCCATCACCCGACGCTGCTGACTCTGCCGCAGGCGTTTCGCCAGTTCGGCACTCAGTCGCCGACGCCCTGACGGTGACAGCGATTCAATCAGTCCGGTCAGCCGGTCTTCAAAACGCTTAAACTCATTCATCCCACTTGCTCACCAGTTCGCCATTGATATACAGCTCCATCGGGCGGGTGACCGGCTCCGGCGGCGGAGGTTCCGGGATATTCTTCACATGCAGCGCGCCGTCCACCTCACTGACCAGCGTGCGCTCGGTCAGCATCAGGCTGATACTGATATCAAAGCTGCTGTCATTGTTGATGTCCGCATAAAACGTGAAGCCCTTTTTCTGGCCTGCGTCGGTGGTCATGATGTCGGGCTGATTTTCCCGCAGCCACGCTAGCACCGGCACGATGAGCAGGTCAAAATCACCGGTAAAGTCGGTCACAATGACATTGAGCGTGTAACGCTTTTCAAATGACAGCGACCTCGCCAGTGTGGAGGCAATACTCCCGTTATCCACGAATATCCGCAGCATCTCGGGACTGGTTTTCAGCACCGTGACAGCATCAGTCAGCGCCCTGCGCAGGCTGTCGGGTTTGAGCATCGTTTTCGTCCTGACAGTGTTTAATCATTTTTACCTGGCTGGCACAGCGTGCCAGCGCGTTCTCAAGCTGCCGGATATCAGCACTTAAATCGCCGTTCGTCTCCGGGTCACTGCCCGGCATCGGGCAAAGGCTCACTTTCGGGCAGGCGTTGTGGACAATCACTGGCGTCGGCGCAGGCCGGGCGCTGGTGCAACCGGCGCACAGCATCAGGCAGGTCAGCGCCGTACCAGCGGCGAAAATCTTCGTTTTCATTAAGTAACCTCGTGATGGTTTTCTCGCGCTGTGCTTCACGCTTCGCGGCGTTCTCCAGCTCCTGACGCAGTGCCACCTGCGCCAGCTCGTTTTTGTCTGCCCTGGTAAGGGCAACATGAAGCTGATTTTTCAGCATGGTGATGGTCGTCTGCTGTTCACTGGCGACGTTGTTCGCCCTGTCCAGCGAGGCGCGCAGGCTGGCATTTTTGTGTTTCACCAGAAACAGACCGGCCACCGCCAGTGATAACAACACGACCAGCACAATCATCAGCTTTGACATGGTTCCCGCCCCTCAAAACGCTGACAGCAGGCCGTACGTATCAGCCGGAAGAACACCGATGCCACGAGATAAATCAGCGCGGTAAAAATCCCCCCGGCAGCGACCAGCGAGATAAACGTCGCCACCATCACTACCAGAGCCACTGACCGCCTGCGCCACGGCACCGGCTGCAAAAACAGCGCCGTGACAATCTTCACGGCCAGCGATTCCGGCGGCAGCTCCCGCCCGTAACGTTCCAGTACATACTCAGTGGCATACACGCCGACACCACCGGCAACCACACAGATAACCGTCGCCAGAATCGCCCAGGCGGCGACAAAACTGACGGCCACGCTCTGCGGGTAAATCAGGGACAGTGCCAGCATCAGCGCCAGCGACACGTTCAGCATCAGTGAAAGGGATAATTTCTTCATGGTGTTTACTCCGTTTACCGGTCTGGCAAAAAGCCTGCGTGCTGCCGTGCATCACAGCTCACCGATTTACGTCAAACGTAACATTCTGGCCTCAACGTTTATCCCACACCCGTGGCTTTCTCAGCAGGATTTCAGCCGCTTTGTGCTGGATTTTCTGGTATTCGGTAATGCGTTTCTGGAAAAGCGTTACAGCACCACCGGTAAGGTCATCAGACTGGAAACCTCACCGGCAAAATATACCCGCCGTGGCGTGGAGGAGGATGTTTACTGGTGGGTGCCGTCCTTCAACGAGCCGACACCTTTCGCGCCCGGCTCCGTGTTTCACCTGCTGGAGCCGGATATTAATCAGGAGCTGTACGGTCTGCCGGAATATCTCAGCGCCCTTAACTCTGCCTGGCTGAATGAGTCGGCCACGCTGTTCCGCCGCAAGTATTACGAAAACGGCGCTCATGCCGGATATATCATGTACGTCACTGATGCCGTGCAGGATCGCAACGATATCGAAATGCTTCGCGAAAACATGGTGAAGTCGAAAGGCCGCAACAACTTTAAAAACCTGTTTCTCTATGCCCCGCAGGGGAAAGCTGACGGCATTAAAATTATCCCGCTCAGTGAAGTGGCAACGAAGGACGATTTTTTTAATATCAAAAAAGCCAGCGCCGCTGACCTGCTGGACGCGCACCGCATCCCCTTTCAGTTGATGGGCGGCAAGCCGGAGAACGTCGGGTCGCTGGGTGATATTGAGAAAGTAGCAAAGGTCTTTGTCCGCAATGAGCTTATCCCGTTACAGGACAGGATCCGCGAGATAAACGGCTGGCTCGGTCAGGAGGTCATCCGATTTAAAAACTACTCACTGGACACTGACAACGGCTGAACATCGCCGCCTGCGGGCGGCTTTTTTACACCCCGTCATCACGCCCTCACACGTTCGCCACTGTACAAAACACCCCGCAGACACACCAACGCCCCGGCAGGCCGACTAAACGCCATCACGACGCGCTCAGACGCTGAAAAAATAAAATCAGCACCACCGCCAGCGCGCAGTGCTTTCCCCGCCTCGCCCGCCCGCTTCATGGGTCGGTTTTGATGCAATTCCAAAAGCCGTCCAAACTCTCTTAGGCTAAATGTCCAACGAGAAAATAGTTCTTTGAATGTGAATGCATTTTAATGCAGAGTTATGCCCAGCATTTTTGTACACTTCGATGTATCAAATGCGCTGCAAACGATCAAATATGGATGTTTTATCAAGCATCCCCCAAAAGATATTTACATCATCCCATGAGGTTAAGATGGATAACAAAATCGTAGAAATTGAGACAAATAAGCTTGATTTTGACCCTAAAAACCCACGTTTCTTTCGTCTCAATGATGCCAGTAACGCTGCAACAGTCATTGAGGAAATGTTAGATGACGAAAGTGTCCACGATCTAATGCTATCAATCGGTCAGCAAGGTTACTTTCCTGGAGAACCTTTATTGGCAGTAAAAAGCAATGGAAACTACATCGTGGTTGAGGGAAACAGACGCTTAGCTGCTGTAAAGTTGCTCAATGGAGATCTGCTTCCTCCAAAAAGAAAACTTAAAGGTGTGCAAGAAATCATTGATGATACTACCAATAAACCTAAGAAGCTTCCCTGCATCATTTATGAAAACCGAGAGGATGTACTGAGATATATCGGTTATCGTCATATAACTGGGGTCAAAGAATGGGACTCATTATCTAAAGCCAAATACCTTAAAGAGTTATGTGATACTTTTTATTCACATGAGCCTAAAGAGATAGTATTAAAAAATCTGGCTCGTGAGATTGGGAGTAAACCACATTATGTTGCAACACTTCTCACTGCACTGAACTTATATGAAGTCGCGCATGACCATGAGTTTTTTAATTTACCCATGAAGGCTTCTGACGTGGAATTTTCATATATAACCACAGCTTTGGGATATTCAAAAATCACAAACTGGTTAGGTCTACAGGATAAAAAGGATTTTTTAGACCCAAATTTAAATGAAGAAAACCTTAAGCGTTTATTCTCTTGGTTTTTTGTGCCTGACCAACAAGGTAGAACCATCATCGGTGAGTCTCGAAGAATAAAAGATATTGCAGCAGTGGTTGAGAAACCCGAAGCAATTGAAATTCTCATGAAAAGTTCAAACTTGGATGAAGCATATCTATATACCAGCGGAGAAAGAGAAGCATTAGATAAAGCACTAAACGCAGCTAGTGTTAAATTAAGAGTAGTTTGGGATATGCTACTTAAAGCTAAAGAATTAACATTAGAGCATGAAGAGGCTGCATCTGAAATTTTTGAGATGTCAAAAAATATTAGAAATCAGATCAGAAGCAAAAGGGAGGATGATTGAGATTATGATTACAAATCTTGATTCAATGCCTTCTAATGAGCCTTATTTATGGGCTGATTATATTGAGATATTGGCCTTAACTAATATCGACAGGTCATTCAGTCGAGGAGACCTATATAGCACACTGCAAGCTCAACCCGAAGCAGTACTAGCTGAAACAGATGAAGCAGAAGAAGAGGGCGTTTATGATGTTGATGATGAAAATGATACGCCTGTACGCAAGAGAACAAAACGAAGTGTTAGTCGAGCATATACTGACAGAAAGTGGAGCTATGCGATAGGCTTCATACGACAACGCATTGATTTATTTGGGGATAGTTACCCTTTTACTTTATCAGAAGACAACGATACTGTAGAGTTACGTGATATATCAGAAAAGCCACTGGAACATTTAGAAAGACTATATTTAGCTTTACTAATCTGTGCTAACATAAAATATGTCAACATAATGAGCAGAAGAGAGATAACGCGCAGTTTTGAACTAATTAGTTTACCTATTTTTGAAAGCCTAATGCCTAGCGGTAGCATAATAAAAGCATGCTGGGCTTCTGGTGGTCAAGCGGCCCCTTACACTGGAACTCTATATAATAAATTTAAGAGTATTGCTTCCGATATCCGTTGCACAGCGAACTTCAAAGAACGAGATTTCAGTCGAGGAAATAGTGGTGACGGAGGCCTTGACATAATTGCCTGGCATCCAATGGGAGATCAACGAGATGCCATCCCTATTTCTTTTGTTCAGTGTGGCTGTTCTCAAGAAGAGTGGGAAGCGAAGCAGCTTGAGGCCTCACCTGCGATGCTCTACAGTAAATTCCCCGTAGCTCACCGATGGGCAACTTATTATTTCTTACCTCAAGATCTACGATGGATAGATGGTGAGTGGGCGCATAAAAATAAGTTAGGCGATGCTATTTTTGTTGATCGCCTAAGATTAATCAATTTAACCAGAGCATCTGATAATATTGATCACAGTCAAAATATTAGCTATCTAGATATCATCCTTGATCCTTCCAGCGCGATCGCTGCTTAATCCCATAAATCTGGAAGGTTTCTAGCAACCGCCTCAAACAATGGAGGCGGTACTGCATTACCTACCACAGTATATTTCATATTAATAGAAGCCCGTTCAGTTTCTGGGAAAATTAAATCCCCAAAACCTTGTAAAATAGCAGCCTCTCGATAGCTAAACCTACGAGCTGGTGCATCCGAAGTAAATTGCCACTTATCAGGTCCCAATTTTTCTAATGTTGGACTTATTGGATGTAGAGGCATATGTCTAGGATTTGCAACAATTGTTTTAGATATCTGATCCCAATCTTGCCTACGGTTTCGCGATAGATAATACCAATGAAAATCGGCGTCATAAAACTCGCCAACAGGCCAAACAGGCATATGCCCAATAGCATCACGAATTGTGGAATATGGTGTCAAACCATCACCATGTGTTGGTTTTGGAAATTTGTATGTAATACCGTAGTCCTTTCGTATTCCTACGATAAAGATTCGCTTCCTATCTTGGGATACCCCATAATGGGACGCATTCAGAATTTGCGAGCTTACTGTATAACCTGCTTCTTCGAAAACTTTGAATTGATCCTTTAATAAATGCTCAAAGTTACGCCTTACCATACCAGAGACATTCTCTACAATGAATGCTTTTGGCTTAATTTTACTCAAAGCACGGGCAAACTCTAAATATAGTGTATTAATCTTTCTATCTGCCTTCCTTGCCCCACCTTGACTAAATCCTTGGCAAGGATAGCATCCGATGAGCAACTCAGCAGAAGGGAACGACTGGAGCCCTGAGATATCGCCCAAAATGTAGTCAGTTTCAGGATGGTTTTCTAAGTAAACGTCCCTTGCGTAAGGCAAAATATCATTTGCCATAAGCACATTGAAACCTGCGTTCAAAACTCCCGCATCAGAACCACCACATCCAGAAAAAAGTGAAACTACAGTTGGCATTGACCCCTCCTAAAAACCGACCGCGTATTATAGCGAAACACCCCGTTGGGAAAAGCTAGATTTTGCCAAGTCTTGATATTCTCACGTTTTAGTAGTTGTGGCCATCTTTAACGAGAAAAAGATAAAATTGACTTCTCATTAATTTTCAATAGGTTTAATTGTAAGCTCAAACTAACGCCTCGCGACACTCGTTATTCAACCCCGCCAGCCCTGAAAACAAGTTTCACGACTGGCGGCGTTCTCTATCGTCTGCGTTGTGGTGGCGCAACTCTGGACTGACCGATATGGTTAAACCGCCCGTAATTATCCCGGACTATTTCGGCACACCCGACCAGCTCATCGGGCGTCAGATTTTCGTTGACCATAATCCGCTGTAAACGCTGAACAATAGCCATCAGCTTGATATTTTTAGTTTTATGGTGCGGTATCTCGCCTGGTATTCTGTGCATTATCCAAGCCACCCGTTTTGCTGTGCACGCTCCATCTGTTCATCTGAATAGTTCCATGCTCCATCCGTGGCAACCATTGCCCCGCCAGACATCCCCGTCTCTGGTTCATACATAACAGCAAGGCCGAGCTGATGCATAATTTCATGATTAATTCTGAATACCAGACCACGCTCACTAAGTTCTTTCCAGTTCACAATCTCACATGCGCCTGTATTAAGCCGCTCAATACTTAGCAAGACATAATCTTCCAGCCAGTCTGACAGGTCAGTAACATCTGTTATCCGGGCTTCAACCTTTCGCCCCGTATACACACCCTGCACCCATTCATGCAAAATCAACGTGTCCCCGCGCTCATAATTACGGTCATTTTTCCGAAACTCTGCGCGTTTCTTTCCTTCCAGCACAAGGTCAAAATATTTTGCGTGCAGCTTTACCTCGTGAATTTTTGCCATTATGTCCACTCCATTACTGTTGAGAATCCCGGCCACTCATCAGCGACCGGATACGTGAATTTTTTCCCGTCATAATTTACGGTCGCTCCACGCGCCAGCGCCTCAAGCTCCCATCGCTGAGGCCTGATACCGTTCTGAGCAAGGTCAACGCGGATACGGGTGATTTGCATTCGTTCCGACCGGGTCAGTCTGGCCGACGGTGCAATTTCATGTGTTTTTAACGGGCTTCCGTTTCTTTGTTGACGATTTGGTGTTTTCAGGCCGTGTTTTAATGCTCCCCTGAGCGCCCTCACGACCTCCGGGTCATTCCATTCGATAACACCGTCATCAACCAGATTTAGCACTGCTGCGGCGTGCTCAGAAGGTGTGGGAGCCGGTAACGAAGCATCACTACCGGTGAGCTTTGTGATGTGGCGAATCATGGCGTCAAGATGAGAAGAAAAGCGCGTTGCTGCGTCGGCCTGTGCTTCGGTTCTGACCTGTTGCAGCAGTAATGCGTATTTACCGCACTGATTTTCAGAAACTGTATGCATGACTTTCTCCAGGCAAAAAGAAGCCCCGCACAATTAAGTGCGTTAAAAACTCTGGTTAATTACTTAATGCAGATATTGCTCTGGTTTTACCGACGTCAGGATTGTCGGTGCATACTCAAACAGGCTGAATAATTCACGTAATGCACGGAATAAGGCATCACGCCAGTAACATAATTCTTCATTAATTCGCCAGTATGGCTGGTTGAATTCTTTTTCAGTCAATCCGGCATGCATAAATAAAGTACGGCGCTGACTGACAGTTAAAAAGCTAATATATGCATACTCACTTGCACCGACCTGACGGCGTTTTGAGAATGCCCCACGCAATTCATCAATTGCACATACCAGTCGTTCACGTTCGACGTCGTTCATTTCTTCAAAACGCATCGTTGCGTGACGCTGTTTTAACTGTGCATGAAAGCAAACCGTTAGCCGTTCGCGCTCCATCATCTGATTATAATAATCACATGTATCCTGCCAGCGAGGGACGGCAAGATGCTTGCCAATTATCCGGCGCATAGCTGCTGGCTGTTTTTCAACGAGATTGAGCGTCATCACTGTCATTTCCAGCCCCTCCGGCTTTTCAGAAAGGTCAGAGCCTTTTTTAACGGACTCTGTTTTTTGGTGCGGATAATGATTCCCTTACGCCCCTTACCGTGGGTGATGGTGAAGTCAATCGCCCTGGGGCTTTCGTTACGCAGTAACTGAGCAATACAACGCGGTTCACTCATAATCACAACCCCATCCACAAAAGCCATGCATCACGCTGTTCAACTGGTCGGTTATAAAACGCCTCTCGTACAGCGCGATTAAACTCTGGAATGAAAACCCACTTCTCACCGACACGAGCGTTCGGCTTACTTGGATCACGAAGCTCAATAACTGGCAACTTATTCTCTTTTACCATCTTGACTACAGCCGTTTCTGGCTTACCAAGTAACTCTGCAAACTTAACCGTATGTACCGCATCAATCGGGTACTGAATCACATAGTCATTGACTTCCATTGATTAGCCCTTTTTGCTTTCGTGTTACCCTTATTAGATCCAGTCCCTTCTAGGTCGCACCTGTCCTTTCTAGGGACTGGCTAACACACTCAAAAGGTCACCAATACACAACCTTTTGACGGGAATATAAGTCACCAATAGGTTACTGTCAAATGCAGACATTCGAAAAACTGAAAGCGATTAGGAAAGCAGAAGGCTTAACACAGGCGAAATTCAGCGAAATTAGCGGGATAGCTCTAGGAACAGTCAAAAATTACGAAAGTGGGCATAAAGACCCTGGTCTCAGCATCGTTATGCGAGTCACAAATACGCCTTTATTTAAAAAATATACGCTCTGGTTAATGACTGGTGATACGTCACCACAAGCTGGTCAGATCGCGCCGGCTCTCGCACACATTGGGCAAAAACCAACAGAATCAGACCACTCCGAAAAACAGACTGGTTAACACTCTATAAACATTACATTTTCACCATTTGTTACCAAGATGGTGAATACAGCGTCAGAGGGCTTTCTTATGTCAATTAAGAAGCTCGATGATGGACGCTATGAAGTGGACATTAGACCTCGCGGTCGCGACGGAAAACGCATCCGCAGGAAATTTGAAAGAAAAGCTGAGGCTGTAGCATTTGAGCGATACACAATCGCCTACGCCAGCCAGAAAGAATGGGCAGGTCAGCGAGCAGATCGCAGAACTTTGAGTGAGTTGCTGAACATCTGGTGGAAATATCACGGGCAAAACCACGAGCATGGAACAAAAGAGTTTAATCATCTGCTCAAAACCATCAGCGGCATAGGTGATATACCAGTGAGCCGGATGAGCAAAAGAGCTTTGATGGATTATCGTTCCATGCGACTACGTGATGGTATCAGTGCCGCAACGATAAACCGTGACATGTACCGATTATCCGGCATGTTCACAAAATTAATTCAATTGGATGAATTTTCCGGGCAACACCCAATTCACGGACTGCCGCCACTGGCGGAGGCCAACCCTGAAATGACGTTCCTGGAAAAAGCAGAAATCGAAAAACTGTTAAATGTTTTGGATGGTGATGACTTACTTGTCGCACTTTTATGTCTGAGCACTGGAGGAAGATGGACGGAAGTTGCCACGCTAAAACCAGCACAGATTACAAATTGCAGGGTTACCTTCCTGAAAACCAAAAACGGTAAAAAGCGAACCGTGCCGATTTCTGAGGAACTGGAGAAAAAAGTTAAAGAGGAGGCCAGCGCTAAATTATTCAAAGTTGATTATGAGAAGTTTTGCGGGATTTTACGCAGAGTGAAGCCAGATATACCACCCAATCAGGCAACCCACATCCTGCGGCATACATTCGCAAGCCATTTCATGATGAATGGGGGCAATATAATCGCACTGCAACAGATTCTGGGACATGCGAGCATTCAGCAGACGATGGCCTATGCGCACCTTGCGCCTGACTACCTGCAAAATGCCGTCGCGCTGAATCCTCTAAAAGGCGGAGTGACGTTATAAATTTCCCTTCTGAGTGTCCACATAGTGTCCACACTCTCAGAACTTTGTAGCCCTTCCAGTCCCTTATAGGTTTTCTTAAGTTACTGTTTTCTTACGGAAACCGATGTAAGTGATTGATAAAAAAAACCCCCACATCATGTGGGGGAAGACAGGGATGGTGTCTATGGCAAGGAAAACAGGGTTTACTACTGGGAACGTGAGTTGCTACTACTCAATAGCTTCAACGATGAACTTTTTTGCCATTGCGTCACGTCGCGCAACTGCTCCATTCGTTGTTGATGTTTCTCGTTTAAAACCGCTTGCTGCTCCGGCGTTAACAGGCGATACATTTGGTTGCGGACTTTTGCCATCTCAACCTGACGAGCAATTTGCTCATTCGCCATTTTTTCTGCCTGTGCGCGCACAGCGTTTTCATCAAAATTTTCTGCGGTGACAAGGCGATGCATTGTCTCCAGTTCGCTAACATTAACAGGAGGCTGTTCGTGCCGGGCCTGTTGCATAAGATCTCGCATCTGCTGACGCTGATGTTCGGTTAAACTTATGCCGTCGAACATATGGCTCTGCGTACTGCGCTGCGTAAGTTCTTCACCCGGATGCCAGTTATCGCCTGAACCGACTTCAGCAGCGTGGCTTAATGAACTGACTGCCAGCGTTGAGGCCATGACGGCAGCGGTAACTATGCGCATCATTTGCTCCCAAAATCTTTCTGTCGCGATTCAACGATAGAGAGTTTACGATTCAGGCTGCAAACATGCGTCAGGGGGTGTAAAACAACGTAAAGTCATGGATTAGCGACGTCTGATGACGTAATTTCTGCCTCGGAGGTATTTAAACAATGAATAAAATCCTGTTAGTTGATGATGACCGAGAGCTGACTTCCCTATTAAAGGAGCTGCTCGAGATGGAAGGCTTCAACGTGATTGTTGCCCACGATGGGGAACAGGCGCTTGATCTTCTGGACGACAGCATTGATTTACTTTTGCTTGATGTAATGATGCCGAAGAAAAATGGTATCGACACATTAAAAGCACTTCGCCAGACACACCAGACGCCTGTCATTATGTTGACGGCGCGCGGCAGCGAACTTGATCGCGTTCTCGGCCTTGAGCTGGGCGCAGATGACTATCTCCCGAAACCGTTTAATGATCGTGAGCTGGTGGCACGTATTCGCGCGATCCTGCGCCGTTCGCACTGGAGCGAGCAACAGCAAAACAACGACAACGGTTCACCGACACTGGAAGTTGATGCCTTAGTGCTGAATCCAGGCCGTCAGGAAGCCAGCTTCGACGGGCAAACGCTGGAGTTAACCGGCACTGAGTTTACCCTGCTCTATTTGCTGGCACAGCATCTGGGTCAGGTGGTTTCCCGTGAACATTTAAGCCAGGAAGTGCTGGGCAAACGCCTGACGCCTTTTGACCGCGCTATCGATATGCACATTTCCAACCTGCGTCGTAAACTGCCGGATCGTAAAGATGGTCACCCGTGGTTTAAAACCTTGCGTGGTCGCGGCTATCTGATGGTTTCTGCTTCATGATAGGCAGCTTAACCGCGCGCATCTTCGCCATCTTCTGGCTGACGCTGGCGCTGGTGTTGATGTTGGTTTTGATGTTACCCAAGCTCGATTCACGCCAGATGACCGAGCTTCTGGATAGCGAACAGCGTCAGGGGCTGATGATTGAGCAGCATGTTGAAGCGGAACTGGCGAACGATCCGCCCAACGATTTAATGTGGTGGCGGCGTCTGTTTCGGGCGATTGATAAGTGGGCACCGCCAGGACAGCGTTTGTTATTGGTGACCACCGAAGGCCGCGTGATCGGCGCTGAACGCAGCGAAATGCAGATCATTCGTAACTTTATTGGTCAGGCCGATAACGCCGATCATCCGCAGAAGAAAAAGTATGGCCGCGTGGAACTGGTCGGTCCGTTCTCCGTGCGTGATGGCGAAGATAATTACCAACTTTATCTGATTCGTCCGGCCAGCAGTTCTCAATCCGATTTCATTAACTTACTGTTTGACCGCCCGTTATTACTGCTGATTGTCACCATGTTGGTCAGTACGCCGCTGCTGTTGTGGTTGGCCTGGAGTCTGGCAAAACCGGCGCGTAAGCTGAAAAACGCTGCCGATGAAGTTGCCCAGGGAAACTTACGCCAGCACCCGGAACTGGAAGCGGGGCCACAGGAATTCCTTGCCGCAGGTGCCAGTTTTAACCAGATGGTCACCGCGCTGGAGCGTATGATGACCTCCCAGCAGCGTCTGCTTTCTGATATCTCTCACGAGCTGCGCACCCCACTGACGCGTCTGCAACTGGGTACGGCGTTACTGCGCCGTCGTAGCGGTGAAAGCAAGGAACTGGAGCGTATTGAAACCGAAGCGCAACGTCTGGACAGCATGATCAACGATCTGTTGGTGATGTCACGTAATCAGCAAAAAAACGCGCTGGTTAGCGAAACCATCAAAGCCAACCAGTTGTGGAGTGAAGTGCTGGATAACGCGGCGTTCGAAGCCGAGCAAATGGGCAAGTCGTTGACGGTTAACTTCCCGCCTGGGCCGTGGCCGCTGTACGGCAACCCAAACGCCCTGGAAAGTGCGCTGGAAAACATTGTTCGTAATGCTCTGCGTTATTCCCATACGAAGATTGAAGTGGGCTTTGCGGTAGATAAAGACGGTATCACCATTACGGTGGACGACGATGGTCCTGGCGTTAGCCCGGAAGATCGCGAACAGATTTTCCGTCCGTTCTATCGGACCGATGAAGCACGCGATCGTGAATCTGGCGGTACAGGTTTGGGGCTGGCGATTGTTGAAACCGCCATTCAGCAGCATCGTGGCTGGGTGAAGGCAGAAGACAGCCCGCTGGGCGGTTTACGGCTGGTGATTTGGTTGCCGCTGTATAAGCGGAGTTAA
Protein sequences of DBSCAN-SWA_10 >CP034958|4505967:4540540|4513781_4514528_+|QAS87380.1|DBSCAN-SWA MADWVTGKVTKVQNWTDALFSLTVHAPVLPFTAGQFTKLGLEIDGERVQRAYSYVNSPDNPDLEFYLVTVPDGKLSPRLAALKPGDEVQVVSEAAGFFVLDEVPDCETLWMLATGTAIGPYLSILQLGKDLDRFKNLVLVHAARYAADLSYLPLMQELEKRYEGKLRIQTVVSRETAAGSLTGRIPALIESGELESAIGLPMNKETSHVMLCGNPQMVRDTQQLLKETRQMTKHLRRRPGHMTAEHYW >CP034958|4505967:4540540|4516039_4516639_+|QAS87384.1|DBSCAN-SWA MKPGCTLFFLLCSALTVTTTAHAQTPDTATTAPYLLAGAPTFDLSISQFREDFNSQNPSLPLNEFRAIDSSPDKANLTRAASKINENLYASTALERGTLKIKSIQMTWLPIQGPEQKAAKAKAQEYMAAVIRTLTPLMTKTQSQKKLQSLLTAGKNKRYYTETEGALRYVVADNGEKGLTFAVEPIKLALSESLEGLNK >CP034958|4505967:4540540|4530284_4530470_-|QAS87399.1|DBSCAN-SWA MELHQNRPMKRAGEAGKALRAGGGADFIFSASERVVMAFSRPAGALVCLRGVLYSGERVRA >CP034958|4505967:4540540|4528388_4528562_-|QAS87778.1|lysis|DBSCAN-SWA MSLCPMPGSDPETNGDLSADIRQLENALARCASQVKMIKHCQDENDAQTRQPAQGAD >CP034958|4505967:4540540|4524602_4525130_-|QAS87392.1|tail|DBSCAN-SWA MMHLKNIKAGNAKTLEQYELTKKHGVIWLYSEDGKNWYEEVKNFQPDTIKIVYDENNIIVAITRDASTLNPEGFSVVEVPDITSNRRADDSGKWMFKDGAVIKRIYTADEQQKLAELHKAALLSEAESVILPLERAVRLNMATDEERSRLEAWERYSVLVSRVDPANPEWPEMPQ >CP034958|4505967:4540540|4530661_4531735_+|QAS87400.1|DBSCAN-SWA MDNKIVEIETNKLDFDPKNPRFFRLNDASNAATVIEEMLDDESVHDLMLSIGQQGYFPGEPLLAVKSNGNYIVVEGNRRLAAVKLLNGDLLPPKRKLKGVQEIIDDTTNKPKKLPCIIYENREDVLRYIGYRHITGVKEWDSLSKAKYLKELCDTFYSHEPKEIVLKNLAREIGSKPHYVATLLTALNLYEVAHDHEFFNLPMKASDVEFSYITTALGYSKITNWLGLQDKKDFLDPNLNEENLKRLFSWFFVPDQQGRTIIGESRRIKDIAAVVEKPEAIEILMKSSNLDEAYLYTSGEREALDKALNAASVKLRVVWDMLLKAKELTLEHEEAASEIFEMSKNIRNQIRSKREDD >CP034958|4505967:4540540|4520882_4521785_-|QAS87389.1|DBSCAN-SWA MNQSYGRLVSRAAIAATAMASLLLLIKIFAWWYTGSVSILAALVDSLVDIGASLTNLLVVRYSLQPADDNHSFGHGKAESLAALAQSMFISGSALFLFLTGIQHLVSPTPMTDPGVGVIVTIVALICTIILVSFQRWVVRRTQSQAVRADMLHYQSDVMMNGAILLALGLSWYGWHRADALFALGIGIYILYSALRMGYEAVQSLLDRALPDEERQEIIDIVTSWPGVSGAHDLRTRQSGPTRFIQIHLEMEDSLPLVQAHMVADQVEQAILRRFPGSDVIIHQDPCSVVPREGKRSMLS >CP034958|4505967:4540540|4536655_4537636_+|QAS87408.1|integrase|DBSCAN-SWA MSIKKLDDGRYEVDIRPRGRDGKRIRRKFERKAEAVAFERYTIAYASQKEWAGQRADRRTLSELLNIWWKYHGQNHEHGTKEFNHLLKTISGIGDIPVSRMSKRALMDYRSMRLRDGISAATINRDMYRLSGMFTKLIQLDEFSGQHPIHGLPPLAEANPEMTFLEKAEIEKLLNVLDGDDLLVALLCLSTGGRWTEVATLKPAQITNCRVTFLKTKNGKKRTVPISEELEKKVKEEASAKLFKVDYEKFCGILRRVKPDIPPNQATHILRHTFASHFMMNGGNIIALQQILGHASIQQTMAYAHLAPDYLQNAVALNPLKGGVTL >CP034958|4505967:4540540|4535867_4536140_-|QAS87406.1|DBSCAN-SWA MEVNDYVIQYPIDAVHTVKFAELLGKPETAVVKMVKENKLPVIELRDPSKPNARVGEKWVFIPEFNRAVREAFYNRPVEQRDAWLLWMGL >CP034958|4505967:4540540|4509494_4509740_-|QAS87376.1|DBSCAN-SWA MTMSLEVFEKLEAKVQQAIDTITLLQMEIEELKEKNNSLSQEVQNAQHQREELERENNHLKEQQNGWQERLQALLGRMEEV >CP034958|4505967:4540540|4518430_4519420_-|QAS87387.1|DBSCAN-SWA MNKWGVGLTFLLAATSVMAKDIQLLNVSYDPTRELYEQYNKAFSAHWKQQTGDNVVIRQSHGGSGKQATSVINGIEADVVTLALAYDVDAIAERGRIDKEWIKRLPDNSAPYTSTIVFLVRKGNPKQIHDWNDLIKPGVSVITPNPKSSGGARWNYLAAWGYALHHNNNDQAKAQDFVRALYKNVEVLDSGARGSTNTFVERGIGDVLIAWENEVLLAANELGKDKFEIVTPSESILAEPTVSVVDKVVEKKGTKEVAEAYLKYLYSPEGQEIAAKNYYRPRDAEVAKKYENAFPKLKLFTIDEEFGGWTKAQKEHFANGGTFDQISKR >CP034958|4505967:4540540|4535197_4535698_-|QAS87405.1|DBSCAN-SWA MTVMTLNLVEKQPAAMRRIIGKHLAVPRWQDTCDYYNQMMERERLTVCFHAQLKQRHATMRFEEMNDVERERLVCAIDELRGAFSKRRQVGASEYAYISFLTVSQRRTLFMHAGLTEKEFNQPYWRINEELCYWRDALFRALRELFSLFEYAPTILTSVKPEQYLH >CP034958|4505967:4540540|4528533_4528959_-|QAS87397.1|lysis|DBSCAN-SWA MSKLMIVLVVLLSLAVAGLFLVKHKNASLRASLDRANNVASEQQTTITMLKNQLHVALTRADKNELAQVALRQELENAAKREAQREKTITRLLNENEDFRRWYGADLPDAVRRLHQRPACADASDCPQRLPESEPLPDAGQ >CP034958|4505967:4540540|4526656_4527442_+|QAS87394.1|DBSCAN-SWA MFSDNVTNAWWFISLYLFLLIALTFVTFGKSNLMRFIAHHFNLEYSDRKLKMLDKKWRDIQLFKIINGINVSGIEDVRMIQQGLIDGKLKTSYFFLTRFWGDITKPPHIIKTTIVILASIIYILFACYIHNEQSAIVRDAIGIPYKNMMYYVYSDKVLLSFKNKTVEFNKTYSLADCKSLQNVFIKDTLPEIACNKLLQLNKEDSEWLSQEIKDNNSYRKTLLIMSLTYFISGLLIFLSYTKFLYANKKVLEYKASNKNHS >CP034958|4505967:4540540|4511033_4512542_+|QAS87378.1|DBSCAN-SWA MTEKKYIVALDQGTTSSRAVVMDHDANIISVSQREFEQIYPKPGWVEHDPMEIWATQSSTLVEVLAKADISSDQIAAIGITNQRETTIVWEKETGKPIYNAIVWQCRRTAEICEHLKRDGLEDYIRSNTGLVIDPYFSGTKVKWILDHVEGSRERARRGELLFGTVDTWLIWKMTQGRVHVTDYTNASRTMLFNIHALDWDDKMLEVLDIPREMLPEVRRSSEVYGQTNIGGKGGTRIPISGIAGDQQAALFGQLCVKEGMAKNTYGTGCFMLMNTGEKAVKSENGLLTTIACGPTGEVNYALEGAVFMAGASIQWLRDEMKLINDAYDSEYFATKVQNTNGVYVVPAFTGLGAPYWDPYARGAIFGLTRGVNANHIIRATLESIAYQTRDVLEAMQADSGIRLHALRVDGGAVANNFLMQFQSDILGTRVERPEVREVTALGAAYLAGLAVGFWQNLDELQEKAVIEREFRPGIETTERNYRYAGWKKAVKRAMAWEEHDE >CP034958|4505967:4540540|4516746_4517514_+|QAS87385.1|DBSCAN-SWA MRHPLVMGNWKLNGSRHMVHELVSNLRKELAGVAGCAVAIAPPEMYIDMAKREAEGSHIMLGAQNVDLNLSGAFTGETSAAMLKDIGAQYIIIGHSERRTYHKESDELIAKKFAVLKEQGLTPVLCIGETEAENEAGKTEEVCARQIDAVLKTQGAAAFEGAVIAYEPVWAIGTGKSATPAQAQAVHKFIRDHIAKVDANIAEQVIIQYGGSVNASNAAELFAQPDIDGALVGGASLKADAFAVIVKAAEAAKQA >CP034958|4505967:4540540|4539166_4540540_+|QAS87410.1|DBSCAN-SWA MIGSLTARIFAIFWLTLALVLMLVLMLPKLDSRQMTELLDSEQRQGLMIEQHVEAELANDPPNDLMWWRRLFRAIDKWAPPGQRLLLVTTEGRVIGAERSEMQIIRNFIGQADNADHPQKKKYGRVELVGPFSVRDGEDNYQLYLIRPASSSQSDFINLLFDRPLLLLIVTMLVSTPLLLWLAWSLAKPARKLKNAADEVAQGNLRQHPELEAGPQEFLAAGASFNQMVTALERMMTSQQRLLSDISHELRTPLTRLQLGTALLRRRSGESKELERIETEAQRLDSMINDLLVMSRNQQKNALVSETIKANQLWSEVLDNAAFEAEQMGKSLTVNFPPGPWPLYGNPNALESALENIVRNALRYSHTKIEVGFAVDKDGITITVDDDGPGVSPEDREQIFRPFYRTDEARDRESGGTGLGLAIVETAIQQHRGWVKAEDSPLGGLRLVIWLPLYKRS >CP034958|4505967:4540540|4523487_4524387_-|QAS87391.1|DBSCAN-SWA MSGIRITQAIDNNQILVNVNTFNYSVNPYPLKCPDPNCSAHLVYVKSHIRRSFNKTLHIPAFFRLEKNFTHDKLCQYGTSGLNTIYASDSSHDISRALATGNNIFRIHILDEDDISKLSRKAAAMQANPPSDTTDRVYVKRGRKAPYVKNMDSLREIYEYGKAHPNKRNSIKIVTGTSTVTWSDFFYETTQLERLSTYLQKVKIAQVAVIMKVHVARIPMAKFNYRHFIEGSPMRIKGGFNIYPTIQLGNVTPKLFPLSSIVMVLGKFTIPTNRKMIMDPIFEREVRTIVSSEEQVLIL >CP034958|4505967:4540540|4514532_4514961_-|QAS87381.1|DBSCAN-SWA MAYKHIGVAISGNEEDALLVNKALELARHNDAHLTLIHIDDGLSELYPGIYFPATEDILQLLKNKSDNKLYKLTKNIQWPKTKLRIERGEMPETLLEIMQKEQCDLLVCGHHHSFINRLMPAYRGMINKMSADLLIVPFIDK >CP034958|4505967:4540540|4538471_4539170_+|QAS87409.1|DBSCAN-SWA MNKILLVDDDRELTSLLKELLEMEGFNVIVAHDGEQALDLLDDSIDLLLLDVMMPKKNGIDTLKALRQTHQTPVIMLTARGSELDRVLGLELGADDYLPKPFNDRELVARIRAILRRSHWSEQQQNNDNGSPTLEVDALVLNPGRQEASFDGQTLELTGTEFTLLYLLAQHLGQVVSREHLSQEVLGKRLTPFDRAIDMHISNLRRKLPDRKDGHPWFKTLRGRGYLMVSAS >CP034958|4505967:4540540|4528946_4529372_-|QAS87398.1|DBSCAN-SWA MKKLSLSLMLNVSLALMLALSLIYPQSVAVSFVAAWAILATVICVVAGGVGVYATEYVLERYGRELPPESLAVKIVTALFLQPVPWRRRSVALVVMVATFISLVAAGGIFTALIYLVASVFFRLIRTACCQRFEGREPCQS >CP034958|4505967:4540540|4512674_4513685_+|QAS87379.1|DBSCAN-SWA MRRELAIEFSRVTESAALAGYKWLGRGDKNTADGAAVNAMRIMLNQVNIDGTIVIGEGEIDEAPMLYIGEKVGTGRGDAVDIAVDPIEGTRMTAMGQANALAVLAVGDKGCFLNAPDMYMEKLIVGPGAKGTIDLNLPLADNLRNVAAALGKPLSELTVTILAKPRHDAVIAEMQQLGVRVFAIPDGDVAASILTCMPDSEVDVLYGIGGAPEGVVSAAVIRALDGDMNGRLLARHDVKGDNEENRRIGEQELARCKAMGIEAGKVLRLGDMARSDNVIFSATGITKGDLLEGISRKGNIATTETLLIRGKSRTIRRIQSIHYLDRKDPEMQVHIL >CP034958|4505967:4540540|4510165_4511011_+|QAS87377.1|DBSCAN-SWA MSQTSTLKGQCIAEFLGTGLLIFFGVGCVAALKVAGASFGQWEISVIWGLGVAMAIYLTAGVSGAHLNPAVTIALWLFACFDKRKVIPFIVSQVAGAFCAAALVYGLYYNLFFDFEQTHHIVRGSVESVDLAGTFSTYPNPHINFVQAFAVEMVITAILMGLILALTDDGNGVPRGPLAPLLIGLLIAVIGASMGPLTGFAMNPARDFGPKVFAWLAGWGNVAFTGGRDIPYFLVPLFGPIVGAIVGAFAYRKLIGRHLPCDICVVEEKETTTPSEQKASL >CP034958|4505967:4540540|4514987_4515287_-|QAS87382.1|DBSCAN-SWA MKDVVDKCSTKGCAIDIGTVIDNDNCTSKFSRFFATREEAESFMTKLKELAAAASSADEGASVAYKIKDLEGQVELDAAFTFSCQAEMIIFELSLRSLA >CP034958|4505967:4540540|4515498_4515939_-|QAS87383.1|DBSCAN-SWA MTIQQWLFSFKGRIGRRDFWIWIGLWFAGMLVLFSLAGKNLLDIQTAAFCLVCLLWPTAAVTVKRLHDRGRSGAWAFLMIVAWMLLAGNWAILPGVWQWAVGRFVPTLILVMMLIDLGAFVGTQGENKYGKDTQDVKYKADNKSSN >CP034958|4505967:4540540|4522021_4522276_-|QAS87390.1|DBSCAN-SWA MALSTKPEEVLAMFHCPLCQHAAHARTSRYITDTTKERYHQCQNVNCSATFITYESVQRYIVKPGEVHAVRPHPLPSGQQIMWM >CP034958|4505967:4540540|4536292_4536586_+|QAS87407.1|DBSCAN-SWA MQTFEKLKAIRKAEGLTQAKFSEISGIALGTVKNYESGHKDPGLSIVMRVTNTPLFKKYTLWLMTGDTSPQAGQIAPALAHIGQKPTESDHSEKQTG >CP034958|4505967:4540540|4508924_4509410_+|QAS87375.1|DBSCAN-SWA MKYDTSELCDIYQEDVNVVEPLFSNFGGRASFGGQIITVKCFEDNGLLYDLLEQNGRGRVLVVDGGGSVRRALVDAELARLAVQNEWEGLVIYGAVRQVDDLEELDIGIQAMAAIPVGAAGEGIGESDVRVNFGGVTFFSGDHLYADNTGIILSEDPLDIE >CP034958|4505967:4540540|4537821_4538322_-|QAS87779.1|DBSCAN-SWA MRIVTAAVMASTLAVSSLSHAAEVGSGDNWHPGEELTQRSTQSHMFDGISLTEHQRQQMRDLMQQARHEQPPVNVSELETMHRLVTAENFDENAVRAQAEKMANEQIARQVEMAKVRNQMYRLLTPEQQAVLNEKHQQRMEQLRDVTQWQKSSSLKLLSSSNSRSQ >CP034958|4505967:4540540|4532761_4533700_-|QAS87402.1|DBSCAN-SWA MPTVVSLFSGCGGSDAGVLNAGFNVLMANDILPYARDVYLENHPETDYILGDISGLQSFPSAELLIGCYPCQGFSQGGARKADRKINTLYLEFARALSKIKPKAFIVENVSGMVRRNFEHLLKDQFKVFEEAGYTVSSQILNASHYGVSQDRKRIFIVGIRKDYGITYKFPKPTHGDGLTPYSTIRDAIGHMPVWPVGEFYDADFHWYYLSRNRRQDWDQISKTIVANPRHMPLHPISPTLEKLGPDKWQFTSDAPARRFSYREAAILQGFGDLIFPETERASINMKYTVVGNAVPPPLFEAVARNLPDLWD >CP034958|4505967:4540540|4519739_4520702_-|QAS87388.1|DBSCAN-SWA MIKKIGVLTSGGDAPGMNAAIRGVVRSALTEGLEVMGIYDGYLGLYEDRMVQLDRYSVSDMINRGGTFLGSARFPEFRDENIRAVAIENLKKRGIDALVVIGGDGSYMGAMRLTEMGFPCIGLPGTIDNDIKGTDYTIGFFTALSTVVEAIDRLRDTSSSHQRISVVEVMGRYCGDLTLAAAIAGGCEFVVVPEVEFSREDLVNEIKAGIAKGKKHAIVAITEHMCDVDELAHFIEKETGRETRATVLGHIQRGGSPVPYDRILASRMGAYAIDLLLAGYGGRCVGIQNEQLVHHDIIDAIENMKRPFKGDWLDCAKKLY >CP034958|4505967:4540540|4506507_4507839_+|QAS87373.1|DBSCAN-SWA MSEMTPREIVSELDKHIIGQDNAKRSVAIALRNRWRRMQLNEELRHEVTPKNILMIGPTGVGKTEIARRLAKLANAPFIKVEATKFTEVGYVGKEVDSIIRDLTDAAVKMVRVQAIEKNRYRAEELAEERILDVLIPPAKNNWGQTEQQQEPSAARQAFRKKLREGQLDDKEIEIDLAAAPMGVEIMAPPGMEEMTSQLQSMFQNLGGQKQKARKLKIKDAMKLLIEEEAAKLVNPEELKQDAIDAVEQHGIVFIDEIDKICKRGESSGPDVSREGVQRDLLPLVEGCTVSTKHGMVKTDHILFIASGAFQIAKPSDLIPELQGRLPIRVELQALTTSDFERILTEPNASITVQYKALMATEGVNIEFTDSGIKRIAEAAWQVNESTENIGARRLHTVLERLMEEISYDASDLSGQNITIDADYVSKHLDALVADEDLSRFIL >CP034958|4505967:4540540|4534148_4534601_-|QAS87404.1|DBSCAN-SWA MAKIHEVKLHAKYFDLVLEGKKRAEFRKNDRNYERGDTLILHEWVQGVYTGRKVEARITDVTDLSDWLEDYVLLSIERLNTGACEIVNWKELSERGLVFRINHEIMHQLGLAVMYEPETGMSGGAMVATDGAWNYSDEQMERAQQNGWLG >CP034958|4505967:4540540|4525131_4526133_-|QAS87393.1|tail|DBSCAN-SWA MQKNGDTLSGGLTFENDSILAWIRNTDWAKIGFKNDADGDTDSYMWFETGDNGNEYFKWRSRQGTTTKDLMNLKWDALSVLVKALFSSEVKISTVNALRIFNSSFGAIFRRSEECLHIIPTRENEGENGDIGPLRPFSLNLRTGRITMGHALDVTGDITTNAWVYANRLAINSSTGMWIHMRDQNVIFGRNAVSTDGAQALLRQDHADRKFMIGGLGNKQFGIYMINNSRTANGTDGQAYMDNNGNWLCGSQVIPGNYGNFDSRYVKDVRLGSQQYYGVNNWQTWNFQCPSGHVLSGINVQDTGSNSADNIAGVYYRPVQKYINGTWYNVASV >CP034958|4505967:4540540|4531727_4532765_+|QAS87401.1|DBSCAN-SWA MIEIMITNLDSMPSNEPYLWADYIEILALTNIDRSFSRGDLYSTLQAQPEAVLAETDEAEEEGVYDVDDENDTPVRKRTKRSVSRAYTDRKWSYAIGFIRQRIDLFGDSYPFTLSEDNDTVELRDISEKPLEHLERLYLALLICANIKYVNIMSRREITRSFELISLPIFESLMPSGSIIKACWASGGQAAPYTGTLYNKFKSIASDIRCTANFKERDFSRGNSGDGGLDIIAWHPMGDQRDAIPISFVQCGCSQEEWEAKQLEASPAMLYSKFPVAHRWATYYFLPQDLRWIDGEWAHKNKLGDAIFVDRLRLINLTRASDNIDHSQNISYLDIILDPSSAIAA >CP034958|4505967:4540540|4527513_4527966_-|QAS87395.1|DBSCAN-SWA MNEFKRFEDRLTGLIESLSPSGRRRLSAELAKRLRQSQQRRVMAQKTPDGTPYAPRQQQSARKKTGRVKRKMFAKLITSRFLHIRASPEQASMEFYGGKSPKIASVHQFGLSEENRKDGKKIDYPARPLLGFTGEDVQMIEEIILAHLER >CP034958|4505967:4540540|4517568_4518324_-|QAS87386.1|DBSCAN-SWA MKKAGLLFLVMIVIAVVAAGIGYWKLTGEESDTLRKIVLEECLPNQQQNQNPSPCAEVKPNAGYVVLKDLNGPLQYLLMPTYRINGTESPLLTDPSTPNFFWLAWQARDFMSKKYGQPVPDRAVSLAINSRTGRTQNHFHIHISCIRPDVRKQLDNNLANISSRWLPLPGGLRGHEYLARRVTESELVQRSPFMMLAEEVPEAREHMGSYGLAMVRQSDNSFVLLATQRNLLTLNRASAEEIQDHQCEILR >CP034958|4505967:4540540|4507905_4508832_+|QAS87374.1|DBSCAN-SWA MTEQQISRTQAWLESLRPKTLPLAFAAIIVGTALAWWQGHFDPLVALLALITAGLLQILSNLANDYGDAVKGSDKPDRIGPLRGMQKGVITQQEMKRALIITVVLICLSGLALVAVACHTLADFVGFLILGGLSIIAAITYTVGNRPYGYIGLGDISVLVFFGWLSVMGSWYLQAHTLIPALILPATACGLLATAVLNINNLRDINSDRENGKNTLVVRLGEVNARRYHACLLMGSLVCLALFNLFSLHSLWGWLFLLAAPLLVKQARYVMREMDPVAMRPMLERTVKGALLTNLLFVLGIFLSQWAA >CP034958|4505967:4540540|4533942_4534149_-|QAS87403.1|DBSCAN-SWA MHRIPGEIPHHKTKNIKLMAIVQRLQRIMVNENLTPDELVGCAEIVRDNYGRFNHIGQSRVAPPQRRR >CP034958|4505967:4540540|4527958_4528426_-|QAS87396.1|tail|DBSCAN-SWA MLKPDSLRRALTDAVTVLKTSPEMLRIFVDNGSIASTLARSLSFEKRYTLNVIVTDFTGDFDLLIVPVLAWLRENQPDIMTTDAGQKKGFTFYADINNDSSFDISISLMLTERTLVSEVDGALHVKNIPEPPPPEPVTRPMELYINGELVSKWDE >CP034958|4505967:4540540|4505967_4506498_+|QAS87372.1|protease|DBSCAN-SWA MTTIVSVRRNGHVVIAGDGQATLGNTVMKGNVKKVRRLYNDKVIAGFAGGTADAFTLFELFERKLEMHQGHLVKAAVELAKDWRTDRMLRKLEALLAVADETASLIITGNGDVVQPENDLIAIGSGGPYAQAAARALLENTELSAREIAEKALDIAGDICIYTNHFHTIEELSYKA |
41 | Escherichia_virus(26.09%) | tail,integrase,protease,lysis | attL 4521864:4521910|attR 4537752:4537798 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
8609 : 57113
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP034956|8609:57113|DBSCAN-SWA CATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCCTTTTAAGCCATGCAACGCGCCCTGCGAGACGTAATCGCCATAGCTGGAGATGCCGAGCGCAAAAAGGATCAAGGCTATGGCAGACGGCAGCGTGAAGCCAGCCCAAGCAGCCAGCGCCCCGCTGTATCCAGCCCGAGACAGTCCTACCGCTATGCCGACCTGGCTGCTTGCAGGCCCTGGCAAGAACTGACAAAGCGCGACCAAGTCAGCATAGCTCCGTTCGGAGAGCCAGCGCCGCCGTGTGACAAATTCGGCGCGGAAGTAGCCCAAGTGCGCAATGGGGCCGCCAAAAGATGTCAATCCAAGCCGCAGAAAAATAAGAAAGACCGACCATGGTCTGCTGTCATCGGTAGGGTTATTCGTCATACTTTCGCCTTCATGATCTGCAACGAGTTGATCAATAATAAGCGAAATTCGATAACGAAATTCGATATAAATCTAGAAAAAAATACCTCTATGTGTACTACGCAGTTTTAGCTGTGGCTTTCACAGGAGCACGCTTACTTACGGCTTAGCGTGCTTTATTTTCCGTTTTCTGAGGCGATCCCTAGGAGCTCGGATCTCAGGACGAAGGTCTCCGCGAATGTCCGGTCGATCCGCGCGACGTCCCAGGCGGGCGTTCCCTTGGCGGACATCCACGCCGCAGCGTCGTGCATCAGCCGCACAACCTCGTCGATATCACCCGAGCAGGCGACCCGAACGTTCGGAGGCTCCTCGCTGTCCATTCGCTCCCCTGGCGCGGTATGAACCGCCGCCTCATAGTGCAGTTTGATCCTGACGAGCCCAGCATGTCTGCGCCCACCTTCGCGGAACCTGACCAGGGTCCGCTAGCGGGCGGCCGGAAGGTGAATGCTAGGCATGATCTAACCCTCGGTCTCTGGCGTCGCGACTGCGAAATTTCGCGAGGGTTTCCGAGAAGGTGATTGCGCTTCGCAGATCTCCAGGCGCGTGGGTGCGGACGTAGTCAGCGCCATTGCCGATCGCGTGAAGTTCCGCCGCAAGGCTCGCTGGACCCAGATCCTTTACAGGAAGGCCAACGGTGGCGCCCAAGAAGGATTTCCGCGACACCGAGACCAATAGCGGAAGCCCCAACGCCGACTTCAGCTTTTGAAGGTTCGACAGCACGTGCAGCGATGTTTCCGGTGCGGGGCTCAAGAAAAATCCCATCCCCGGATCGAGGATGAGCCGGTCGGCAGCGACCCCGCTCCGTCGCAAGGCGGAAACCCGCGCCTCGAAGAACCGCACAATCTCGTCGAGCGCGTCTTCGGGTCGAAGGTGACCGGTGCGGGTGGCGATGCCATCCCGCTGCGCTGAGTGCATAACCACCAGCCTGCAGTCCGCCTCAGCAATATCGGGATAGAGCGCAGGGTCAGGAAATCCTTGGATATCGTTCAGGTAGCCCACGCCGCGCTTGAGCGCATAGCGCTGGGTTTCCGGTTGGAAGCTGTCGATTGAAACACGGTGCATCTGATCGGACAGGGCGTCTAAGAGCGGCGCAATACGTCTGATCTCATCGGCCGGCGATACAGGCCTCGCGTCCGGATGGCTGGCGGCCGGTCCGACATCCACGACGTCTGATCCGACTCGCAGCATTTCGATCGCCGCGGTGACAGCGCCGGCGGGGTCTAGCCGCCGGCTCTCATCGAAGAAGGAGTCCTCGGTGAGATTCAGAATGCCGAACACCGTCACCATGGCGTCGGCCTCCGCAGCGACTTCCACGATGGGGATCGGGCGAGCAAAAAGGCAGCAATTATGAGCCCCATACCTACAAAGCCCCACGCATCAAGCTTTTGCCCATGAAGCAACCAGGCAATGGCTGTAATTATGACGACGCCGAGTCCCGACCAGACTGCATAAGCAACACCGACAGGGATGGATTTCAGAACCAGAGAAAGAAAATAAAATGCGATGCCATAACCGATTATGACAACGGCGGAAGGGGCAAGCTTAGTAAAGCCCTCGCTAGATTTTAATGCGGATGTTGCGATTACTTCGCCAACTATTGCGATAACAAGAAAAAGCCAGCCTTTCATGATATATCTCCCAATTTGTGTAGGGCTTATTATGCACGCTTAAAAATAATAAAAGCAGACTTGACCTGATAGTTTGGCTGTGAGCAATTATGTGCTTAGTGCATCTAACGCATAGTTGAGCGGCGGGCGCAGCCCGTCCGCTTGAACGCCGAGTTAGGCATCAGATGCCCTCGGCGCGGGTCGATGCACTTTTCGCACATGCCGCTCAACGCAAGATTCTCTCAATCGTTGCTTTGGCATATCGAACGAACGCGGCCGTCTCTTCGACGCGCATTGCTAGGTCGTCGTCCTCGCTACCCAGGTACGCCGCGCGTGCCTTGCAGATGAGGGGCCGATGCTCGGCAGGCAAACGCTCCGATACCCATGCGGCAGCAACGTCCTTAGGAGCAATGAGACCAGTTGAAGCGCTGTACCAAATGCGAGCAAGAGCAAGAACGACGTTCCGCTCGTCACCCTTCCAATCCGACTCTGCATTCCACTGGGCAATAGTGTCGAAAAGCGCCTTGGAGAAATGCTCCTTCGGCACCGGCTCGAAAAACGTGGCTGCGGATGGGCCTAGAAGCGCAAGGCTGTGTTGCCTCGCCTTGGTCAGCAAAATCGCAAGATCGTGATCCAGAACGGCAGGCTCGAACGTTCCGGAAAGGATGTCGTGGCGGAGCCACTCACCGAACTGAAGCTCACGCCGCGCCGGATAGCGCCAAGGCACTACTTCGCTTCGAGCGACAACAGTTAGCTCCAGCGGTCGCCATGTTCCGCCATCGCCTGGCGGTGATGAGACTTTCAGCAAATCGAGCATTAGCGCCTGCCGGAGCGAATCGTTAGGTGCGGCGCTGACGGTCACGAGCAAGTCTATGTCGCTGTCCGGCTTCAGCCCTCCATCGATCGCAGATCCGAACAGGTGGATTGTGTCCAGTGTCGCAGCCAGATGGCGCTCGATCACCGCGCGAGCGTGGGACAGCTGCTTGAAAACTTGTGCAGGGAAAAATTCACCCATGATGCCTAACGTTAAGTTCAGCGGCAGCTTTTAAGTTGCGGCTTTGTGGAATACTTTTGCGCAGCAAAACCACAAAGACGCGACTTAAAAGCTGTCCAAGGAGCGAAGCGACTGGTGCTGCAACGCATTGTTAGCCTTTTTTCCAAATCTGGTATGTATAATTTATATTAGACATAAAAAACTGTTCAAAAACCAAATTGAAATTCTCAGGCATTATAGGGAATTTGATATCACCTTCGACTTCAACGTGAACAGTAGACAAATGAATTATATCTGCTTTTTCAATAAGGCTATTATAGATTTGACCCCCGCCAGAGACATATACATGATCTGTAACTTTTGATAGCTCTTTCAAAGCATTTTCTATTGAAGGAAAAACTAGGACGTTTTCATTTGAGCTTGAAATTCCGTTCTTTGACACTACTGCATATTTGCGATTTGGAAGAACACCCATAGAGTCAAATGTTTTTCTTCCGACAAGGAGCCATTGATTATATGTGAGCGCTTTAAAGAGTAGTTGCTCACCTTTTACTGACCACGGGATATCAGGACCACTACCGATTACGCCATTTTCTGACACTGCAGAAATCAATGATATTTTCAATTTAACTCCCTTAATGGCTAACTTTGTTTTAGGGCGACTGCCCTGCTGCGTAACATCGTTGCTGCTCCATAACATCAAACATCGACCCACGGCGTAACGCGCTTGCTGCTTGGATGCCCGAGGCATAGACTGTACAAAAAAACAGTCATAACAAGCCATGAAAACCGCCACTGCGCCGTTACCACCGCTGCGTTCGGTCAAGGTTCTGGACCAGTTGCGTGAGCGCATACGCTACTTGCATTACAGTTTACGAACCGAACAGGCTTATGTCCACTGGGTTCGTGCCTTCATCCGTTTCCACGGTGTGCGTCACCCGGCAACCTTGGGCAGCAGCGAAGTCGAGGCATTTCTGTCCTGGCTGGCGAACGAGCGCAAGGTTTCGGTCTCCACGCATCGTCAGGCATTGGCGGCCTTGCTGTTCTTCTACGGCAAGGTGCTGTGCACGGATCTGCCCTGGCTTCAGGAGATCGGAAGACCTCGGCCGTCGCGGCGCTTGCCGGTGGTGCTGACCCCGGATGAAGTGGTTCGCATCCTCGGTTTTCTGGAAGGCGAGCATCGTTTGTTCGCCCAGCTTCTGTATGGAACGGGCATGCGGATCAGTGAGGGTTTGCAACTGCGGGTCAAGGATCTGGATTTCGATCACGGCACGATCATCGTGCGGGAGGGCAAGGGCTCCAAGGATCGGGCCTTGATGTTACCCGAGAGCTTGGCACCCAGCCTGCGCGAGCAGCTGTCGCGTGCACGGGCATGGTGGCTGAAGGACCAGGCCGAGGGCCGCAGCGGCGTTGCGCTTCCCGACGCCCTTGAGCGGAAGTATCCGCGCGCCGGGCATTCCTGGCCGTGGTTCTGGGTTTTTGCGCAGCACACGCATTCGACCGATCCACGGAGCGGTGTCGTGCGTCGCCATCACATGTATGACCAGACCTTTCAGCGCGCCTTCAAACGTGCCGTAGAACAAGCAGGCATCACGAAGCCCGCCACACCGCACACCCTCCGCCACTCGTTCGCGACGGCCTTGCTCCGCAGCGGTTACGACATTCGAACCGTGCAGGATCTGCTCGGCCATTCCGACGTCTCTACGACGATGATTTACACGCATGTGCTGAAAGTTGGCGGTGCCGGAGTGCGCTCACCGCTTGATGCGCTGCCGCCCCTCACTAGTGAGAGGTAGGGCAGCGCAAGTCAATCCTGGCGGATTCACTACCCCTGCGCGAAGGCCATCGGTGCCGCATCGAACGGCCGGTTGCGGAAAGTCCTCCCTGCGTCCGCTGATGGCCGGCAGCAGCCCGTCGTTGCCTGATGGATCCAACCCCTCCGCTGCTATAGTGCAGTCGGCTTCTGACGTTCAGTGCAGCCGTCTTCTGAAAACGACAATGGAGGTGGTAGCCGAGGGTGTGGAAACACCCGACTGCCTTGCGTGGTTGCGGCAGGCGGGTTGCGACACGGTGCAGGGTTTCCTGTTCGCCAGGCCGATGCCGGCGGCGGCCTTCGTCGGCTTCGTCAACCAATGGAGGAACACCGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCCTCAATACTCGTGTGGGCTCTGTTGCAAAAATCGTGAAGCTTGAGCATGCTTGGCGGAGATTGGACGGACGGAACGATGACGGATTTCAAGTGGCGCCATTTCCAGGGTGATGTGATCCTGTGGGCGGTGCGCTGGTATTGTCGCTATCCGATCAGCTATCGCGACCTTGAGGAAATGCTGGCGGAACGCGGCATTTCGGTCGACCATACGACGATCTATCGCTGGGTCCAGTGCTACGCCCCGGAGATGGAGAAGCGGCTGCGCTGGTTCTGGCGGCGTGGCTTTGATCCGAGCTGGCGCCTGGATGAAACCTACGTCAAGGTGCGGGGCAAGTGGACCTACCTGTACCGGGCAGTCGACAAGCGGGGCGACACGATCGATTTCTACCTGTCGCCGACCCGCAGCGCCAAGGCAGCGAAGCGGTTCCTGGGCAAGGCCCTGCGAGGCCTGAAGCACTGGGAAAAGCCTGCCACGCTCAATACCGACAAAGCGCCGAGCTATGGTGCAGCGATCACCGAATTGAAGCGCGAAGGAAAGCTGGACCGGGAGACGGCCCACCGGCAGGTGAAGTATCTCAATAACGTGATCGAGGCCGATCACGGAAAGCTCAAGATACTGATCAAGCCGGTGCGCGGTTTCAAATCGATCCCCACGGCCTATGCCACGATCAAGGGATTCGAAGTCATGCGAGCCCTGCGCAAAGGACAGGCTCGCCCCTGGTGCCTGCAGCCCGGCATCAGGGGCGAGGTGCGCCTTGTGGAGAGAGCTTTTGGCATTGGGCCCTCGGCGCTGACGGAGGCCATGGGCATGCTCAACCACCATTTCGCAGCAGCCGCCTGATCGGCGCAGAGCGACAGCCTACCTCTGACTGCCGCCAATCTTTGCAACAGAGCCCGCCGTGCTAGTCTGCTCGGTGATGGTGGAGTGAAGCCAACCCGCAATCGGGTTATGAATCTGCATCGCGATTCGCAATCAGCTGTCTCTTGAGCATGTCGAACTCCTGCGATGTTATCTCGCCGTGCTCGTGCTGCTACACCAACTTGTCCAACTCATCCGCCACACCGCTCCCCACCACCTGCGATGGCCAGGTTGTTTGCTGGGCGCATTGGCGGAGTATCGTCTCGACCCGGGACACCCACACCGGCACAACCTTACGGGAGTGATTCACTGTCAAAGAATCGGCCCGGTGCTCTGACGCAAGTATCGGGATGGTCACCATTTGTAAGCCGTAGACCTGAGTGGTGATCAAGACTTCGATACCACCGACCGTACCGGTACTAATCGACGACGGTCGTGTTCGTCGCCTGCCGCAGGGACTCTGCACACCTCCGTTTACGCATGTGCCTGGAGGAGTTGGAAATCGTCGTGTTCGGGAAACATTAAACACAGGATGGCAGCGATCTGAGCCAGCACATGATCAGCTAGCTCACCATCCGGATCGACGGCCCACTGCATCGTCGCGCCAGCGATGACCGAGTGCAGGAGCAACTCAGCTGCCGCAGGAGCACCTGGGGGCAGTCGCTTGCGGATCCCCTCCACCACCGCGCGGTTCCGCTGGATCGCAAGCGTGCGTAGCTCCGGCACCTGGAGCTCGTACCAGGAGATGAGATAGTTCACCGAGAAGTCGTTGCGAGTGTTCATGCTCCGAACGAGCACCTGCAAAAATTCCCAGAGCCCTTGCGGCCCTGCGCCTATCGGTATCGCATTCAGGTAATGCCGCACCTGCTCGACGCCGCGCTCCATCATCCTCACCAGCAGCGTATCGCGGTTGGTGAAGCGCTGGATTAACGCTGCGCGGGAGAGCCCCACCTCCTTTGCTACTCCGCTGAGCGTGAACTCTATGGGACCGCAACGCTTCAGCACTACGGTGGCGGCCTCGAGTACCTCGTCATCGGACTTGAGCTTGGGGCGGGGCATCAGTGTTCACCTTCTGTATGGGTTGGGGCGGAGGCTGTGGCTGCCGCCGCCATTGTAGCAAATTGAAGACGGAGCGAGAGTAGAGCCACGAGCCCCGCAAACACGGCCGATACAACGAGGCCAGGGAGCGGACCAGCAAGGTCGACAAACGCGCCGGCCGCAAGCATAACCATGGGCGAGGCTGACAGCATCACCGCCGAGACCGTGCCGAGTACCCGGCCGAGAAGTTCTGGCGGCGTGCGGTTGTAGATGGCAGCGTTGAGAATGGGAGAGACTGAGCCGGTCAGCAGTCCCACGAGCGCGCCCAACAACATCAGCACCGGCACGCCTGGCAACTGTGAAAGCAGAAGCGAGCCCACCGCAGAGCCACAAAATGCCACCGCCAGCCAGTTCTGCGCTGATATCCGGGCGCCGACCGACGCATGAATGGCAATGCCAAGGAGACCACCAGCCCCCATCATTGAGGAGAACAGCCCGAGCTCTGCTACTTGGCGTCCTGCATCTACAAACAGCGCAGGCATGATGACGCTGCCGTTGGCGCCAACGATGCCCACGAAGATCATCACTATACCAAAGAGAGGGCGCAGCAGGGGTTCGCTCCAGAGAAAAGCGACGCCGGCGCGCATGGAGAGAGTCGCCGTCGTGGTCATCGTCCGAGCGGCACGCGCGGGAAGCACCCACGCGCCGAGCAGACCTGCAAGGACGGAGCAGAACGCCGTCAGCCCGAGCGTTGGCGCAGCGCCAAGCAGGCCGATTGCGGCCCCCCCAAGGGCCGGGCCACCTAGAATCGCGACGTTCCCGATCACCGCTTTCAGTGACGAGACGCGCTCAACGGAGAGCCCGGCGACGTGGCCGAGTTTGGGCAGCTCACTGTCCTGCGCGGCCATACCGGGTGCGTCGAACGCGGCACCGAGCACCACGCAAGCGATCAGCCCAGTGTTCGAGAGGGCGCCAACGGCATCGAGCAGTGGGATGCTCGCCATGGCCACGCCGCCCACCACACCCGAGATCAATGCGACGGGCGCGCGCCCGAACCGATCGACGAGGCCACCACCAACCCACGCGCCGATGATGGTCGCGATGACGCTGCTAGCGGCCGTGGCGCCCGCCCAGGCCGCGCTCTTTGTATGAGACAGGACGAACCATGGAAGCGCGAGGGCCGCCACCGCGTTGCCGATCCGGAAGAGAAAGGTCGCCGCGAACAGCGTCGCGAGCGGGCTATATCGACGTTCGCTCATTCCGCTGCGGCGAGCTGCGCCTTCGCCGCAGCGAGGTACTCTTCGTTACCCGAGTCGAGGGCGAAGAGTGCGTAGGTGACCGCCCCGAACGCAAGGCGCTCCGCGATGTGGTGGGCGAGCCGCGGCCACACCCGGCCACCGGCCGCTTCATACGTGAGGAGGAGCTTCGCGAGCCCCTCTTCACCAAAGACCATAAGGTGCGCGGCCATGTCGATGGCAGGGTCATCAACGCGGGCCTCGCTCCAGTCGATCATCCCGCTGACGCGCTCCGTGTTGTCGATGAGCACATGGCCCACGTAGAGATCGCCATGCACCACCACGGAGAAATCTGGCCACGACGAATCGTCGTCGAGCCAGCGCTGCCACCGGTGGAGGCGCTTGTCGTTCACCACGAACTCGCGTCGGACGCGGTCAACGTCGTCGGCCACCTTCTGACGGGCCTGCGTCGGTGTACGGATGAGCATCCCCGCATCCACGGCGGCGGAAATGGGGACGGCATGCAGGGCGGCGAGCGCGGTCGCGAAGCTCTCCGCGAAGACCTCCGAGTCCTGCGGCACGACCCAGTCGGGCGTGGACGAACCAGGCTGGATGACCATCGCAGTCGAGTCTTCGAGCATGGGATAGGCAACGAGCTCGGCGTTGGCCACGCGCCAGTCCGGCACCGCGAACGGCAGGCGATTCTTGAGCATTGCCAGCACCCGCGCCTCTGGTTCGACCTTCGCGCTTACCTCGGCTCGGCGCGGGATGCGCAGCACCCACCGACGTCCATCGTCGACGGTGGCGATCACGATCCTATAGTCGAGCCCAAGCTCATTGACAGTCAGCGGGCCATGGAGCTTGAGCCCATGTCGGGCTGCAAGTGCGTACAGTTGGGAGGTATCGGCGGTCGTGACTACGGTCATGATTCACTCCTGAGGGCTTGACGGGTTTAGCCACCTAAATGTAACAGTCACGTCGGTTATATTCAATCCGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCCGTCGGCTTGAGATGCACATGTTGGCTGAGGTCGCCGGGTAGCGTGAACAGGCGCGGACCGTCCCGAAACTGGAACACGCGATACAGATGGAATTGATCGCCCGCCTCCTTGGAGAATTCGAGTTCGTTGTGGCTGACCAAGAAAGACGAGCCTACCCCGCCATTGGTGGTTTTCACCTCGATGAAGCGCTCATGGGCGTCCTCTTCGAACGACAGGATGTCGAACCCCGCACCGTCTCCCTGGGTGTCGGACACCCAATCCAGCCGCTGAAAAAGCTCTGGGTGGCCGAGCTCGGTCAGGCGTTGCTGTTCGTAGCCAATCACCCACTGCTCCCCTGCCCGGCCCAGCTTGCGGTTGGCTTCATCGCGAGCGGCATAATCGAACTTTCGCGGTAGGCGTTGCCGTAGAGATGCCGGGGTACGCACAAGCACTTCACGGGCGGGTGGTTCTACCAAAGCCGCTCGGTAGGTTTTGTCACCCGGAAGTTTTACCTCCTCCAGGGCATCGACAAGAGCGCCGACCGTCTGCTGATGTTCCAGAACGTAGGCGTGTACGGATTTACGCAGCAGCAGTTGGCTGTTGCCGCGTGGCTTGTAGCCGTTGATATAGGGCAGGCCCAGGGCATCGAGTACGGCGCTAATGTTCTGGTGCTTGAGCTCGACTGAAGACTTGCTGCGACCGTTCAGCAGTTGGCGCAGTGCCTGGTTGTGCTCGGACTTGTTGTACGGCTCCCCAGCCGCCTCGGCACGCAGCATGTCGAAATAGTCTTCGACCGTGGCCAGGACCTCTTCTTCGGACCAGTCTTCGCCGATGCGAATGATGCGAAACCCGAGCCGCGTCAGCGCCGGAACGACGGTCGCCTCGCCACCGGAGAAGCTGTCAGCAGTGAGCGGGCCCTGCTCGGGAAATTGCTTGCCGAAGGCCACACCGGCGATGGCCTTGGAATCGCAATCGGTGCCGGTCTTCGGATCACGTACCAGGAAGTCGCGGGACTTGCCGTAGCCGTGGCGCGCCAGGAATTTCGTGCGGCCCAGTTGCACGAACTCATCGATGGCAGCCTGCACGGCGGCGGGGCTTCGAAGCTGGGAGAGTTGAGACACAGGGTCCTTCCTTACTGTCATGGTGTGCCGGGAACCGCCGAGCCACGAGATTATGAGTAGCCCCTGAACAGAAACGTCACGATAAAAGCCGTGAACGCCACCAGGCCCATTAAATCCCTTGCGTATTTGCAGCCCGTGCTGTCCAAACCTGTACCAGGTCCGATCAACACGCTCCAACCATTGAGGTACGAAAACACCGCCTGACCGAACAAAAAATGCGTGATAGCCGCCGCAGCCATGACGCCGGAATCGTCAGACAGGCTGTAGCTGTTGACCATTGCCCTGTACGCCTGCATCTGAAATGGCTATTGCCCTGATGGGAGCCGGCTTTTCAGCCACCGACACCAGCGATGCCGTCAATATTCTTTACCCGATAACCATGACCGTACAAGCTAACAAGGCCTGGCAAGCCAGCGGACTTAAAAAGTCTTTTTTTCTACCATCCCACAAAAAAGTTCGTGGTGGAGAAAAGATAAGCTATGCAAGGCTTTAGGAGACGTGGTTTTTCAGGATGACGAAGAACGATTCGGCGCTAGGTGCAACATAGGTGCATCGCACGAGCGCTAGGAACGGCGAAAAAAGGCGGACGTGGCGAAATCGGTAGACGCAGCAGACTTTAAAATTGGAGTGCCCGCGGGGAAATCCGCGGAGTAGAACCGCTCAAAGTCGGGGAACGCTAACGGGCAATACCCTAAGCCAATCCCGAGCCAAGCCCCTTCGGGGGAAGGTGTAGAGACTGGACGGGCGGCGCCTAAAGCCTTCGGGCAATGGCGAAGGGACAGTCCAGACCACGAACGTCATCAGACGGCGGCGAAAGTCGAGGTGGTACGAAAATCTGCTTCTCTGTGAGAGTACGGGTTCGAGTCCCGTCGTCCGCACCACAAAGCCAAACATCCCTGCGATGATCGACCTCTGGGCGTTTGGTCGTGAGCCCGCCACCCTCGCGCTACGCTTTGCGCAGGCGTCGAAGGCGATCAGGTGCGCCCATCGATTCCGTCGAGCACCATCGCTGCGAGTTCATCGCTGGTGTGTGCACCACTATCGAGAACACGGGTTGTGCATGGAAGCCGTTCCCTTGCGGCCAAGCATCGGGCGACATTCGCTAATCGCCACTCTCGAATCTCCGCATTTCGATTCGGGTCAGGATGCATGGTCTGGTTCGCGATCCGGTGACGCAATAGGTCCTCGTTGAGCGTCAGAAAGATGTGCAGCAGCTGATCGTCGATCCGCCTTACCCCGTCGAGTATCTCAGTCAGATAGTCCGGGTGCACGAGCGTCATTGGGATGATGATGTCCTGCGAGTAATTCCTTCGAATCTCCCTGACCGCCGCGATCGTAAGTCCCCTCCACAAGGGGAGATCCTGATAGTCTCCGCTCGCTGGCATGGGGACCGTTTCTTTCACCACGAACCCGATTTCCTCGGGGTCAAAGATCAGCGATTTGGAACGCCGATCGCGCAGCCGCTTAGCGAGCGTCGTCTTTCCGGCGCCGAAAGGTCCGTTGATCCAGATTATCATTGTCGACGGCCTCTAACCTGAAGGCTCGCAAGAGCGCTCGACGGCCTCGTGCGGAGGCACGATCGGAGTGGTTCCGAAATGCTTCTCAAGATAGGTGACGCCGAACGTCACGATGTCCTGCGCGTCGAACAGGTAGCACTGAGCAAAGCCCACGACACCTTCTCGATGGCGACCGAGCTTCACGTAAGCATTTGCTATAGTTTCAACCGCATCCGGCTTTCCTTCGATAGCAAAGCAATCGAGAATGCCGTTTGAATCGTAATCCGATGCCGTTTTCCAGGCGACTTCACCGTCTCTTCCAAGCATCGGCATCTCATACGTCACCCACCGTTTGTTGGGGATATCGGCAACCGCCTCGGCGTAGTGCAATGCGGTAACGGAGTTTAGCGGCGCACCCAACAGCAGGGCCTTCCCGCCAAGGCGAACGAACCGCTCGACGGGCGATCCTTCCCCCAAGGCGTGACCGAGTTCGTGAGGCTCCGTCAGCGTTTCAGCCAGCGGACCAACCGCGACCATCGATGCATCGGGGTGCGCGCTGCGCCGCGCGCCGGGGGCTTGAACCAGAAATTGATTCAGCAGGCCGAACCCACGGTAAGTCCCGGCTGTTGCGGGATCGAACGGCAGCCAGGTACGGCGGGCTTCGTCATCCAGCCGAGCGCCATTCAGAGTCTCCTCGTAGGGTGATCGGTCCCACGACGCGTATCCCATCACAGTGCCAGTCGGCCCAACCGCGGAGCGTAACGCGGCAACGACCGTCTCCGCTCCTCCTTCGACCGGACCAATCGCTTTAAGTGAGGCATGCACCATCAAGAGGTCACCGGTTTGGACTCCGAGTTTTTGAAGCGCCTCCGTTATTGCCTTCCGCGTATGCATCGCGATATCTCCTCTAAACTGCAAAACACTATACGCTGATGAATCCCCTAATGATTTTGGTAAAAATCATTAAGTTAAGGTGGATACACATCTTGTCATATGATCAAATGGTTTCGCGAAAAATCAATAATCAGACAACAAGATGTGCGAACTCGATATTTTACACGACTCTCTTTACCAATTCTGCCCCGAATTACACTTAAAACGACTCAACAGCTTAACGTTGGCTTGCCACGCATTACTTGACTGTAAAACTCTCACTCTTACCGAACTTGGCCGTAACCTGCCAACCAAAGCGAGAACAAAACATAACATCAAACGAATCGACCGATTGTTAGGTAATCGTCACCTCCACAAAGAGCGACTCGCTGTATACCGTTGGCATGCTAGCTTTATCTGTTCGGGCAATACGATGCCCATTGTACTTGTTGACTGGTCTGATATTCGTGAGCAAAAACGACTTATGGTATTGCGAGCTTCAGTCGCACTACACGGTCGTTCTGTTACTCTTTATGAGAAAGCGTTCCCGCTTTCAGAGCAATGTTCAAAGAAAGCTCATGACCAATTTCTAGCCGACCTTGCGAGCATTCTACCGAGTAACACCACACCGCTCATTGTCAGTGATGCTGGCTTTAAAGTGCCATGGTATAAATCCGTTGAGAAGCTGGGTTGGTACTGGTTAAGTCGAGTAAGAGGAAAAGTACAATATGCAGACCTAGGAGCGGAAAACTGGAAACCTATCAGCAACTTACATGATATGTCATCTAGTCACTCAAAGACTTTAGGCTATAAGAGGCTGACTAAAAGCAATCCAATCTCATGCCAAATTCTATTGTATAAATCTCGCTCTAAAGGCCGAAAAAATCAGCGCTCGACACGGACTCATTGTCACCACCCGTCACCTAAAATCTACTCAGCGTCGGCAAAGGAGCCATGGGTTCTAGCAACTAACTTACCTGTTGAAATTCGAACACCCAAACAACTTGTTAATATCTATTCGAAGCGAATGCAGATTGAAGAAACCTTCCGAGACTTGAAAAGTCCTGCCTACGGACTAGGCCTACGCCATAGCCGAACGAGCAGCTCAGAGCGTTTTGATATCATGCTCTGATGAATCCCCTAATGATTTTGGTAAAAATCATTAAGTTAAGGTGGATACACATCTTGTCATATGATCAAATGGTTTCGCGAAAAATCAATAATCAGACAACAAGATGTGCGAACTCGATATTTTACACGACTCTCTTTACCAATTCTGCCCCGAATTACACTTAAAACGACTCAACAGCTTAACGTTGGCTTGCCACGCATTACTTGACTGTAAAACTCTCACTCTTACCGAACTTGGCCGTAACCTGCCAACCAAAGCGAGAACAAAACATAACATCAAACGAATCGACCGATTGTTAGGTAATCGTCACCTCCACAAAGAGCGACTCGCTGTATACCGTTGGCATGCTAGCTTTATCTGTTCGGGCAATACGATGCCCATTGTACTTGTTGACTGGTCTGATATTCGTGAGCAAAAACGACTTATGGTATTGCGAGCTTCAGTCGCACTACACGGTCGTTCTGTTACTCTTTATGAGAAAGCGTTCCCGCTTTCAGAGCAATGTTCAAAGAAAGCTCATGACCAATTTCTAGCCGACCTTGCGAGCATTCTACCGAGTAACACCACACCGCTCATTGTCAGTGATGCTGGCTTTAAAGTGCCATGGTATAAATCCGTTGAGAAGCTGGGTTGGTACTGGTTAAGTCGAGTAAGAGGAAAAGTACAATATGCAGACCTAGGAGCGGAAAACTGGAAACCTATCAGCAACTTACATGATATGTCATCTAGTCACTCAAAGACTTTAGGCTATAAGAGGCTGACTAAAAGCAATCCAATCTCATGCCAAATTCTATTGTATAAATCTCGCTCTAAAGGCCGAAAAAATCAGCGCTCGACACGGACTCATTGTCACCACCCGTCACCTAAAATCTACTCAGCGTCGGCAAAGGAGCCATGGGTTCTAGCAACTAACTTACCTGTTGAAATTCGAACACCCAAACAACTTGTTAAGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCCTAGAACGCACGAATGAGGGCCGACAGGAAGCAAAGCTGAAAGGAATCAAATTTGGCCGCAGGCGTACCGTGGACAGGAACGTCGTGCTGACGCTTCATCAGAAGGGCACTGGTGCAACGGAAATTGCTCATCAGCTCAGTATTGCCCGCTCCACGGTTTATAAAATTCTTGAAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGGTAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTTCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGTGCGGTATTATCCCGTGTTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCTGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGCAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCTTATGGCAGAGCAGGGAAAGGAATTGCCGGGCTATGTGCAACGGGAATTTGAAGAATTTCTCCAATGCGGGCGGCTGGAGCATGGCTTTCTACGGGTTCGCTGCGAGTCTTGCCACGCCGAGCACCTGGTCGCTTTCAGCTGTAAGCGTCGCGGTTTCTGCCCGAGCTGTGGGGCGCGGCGGATGGCCGAAAGTGCCGCCTTGCTGGTTGATGAAGTACTGCCTGAACAACCCATGCGTCAGTGGGTGTTGAGCTTCCCGTTTCAGCTGCGTTTCCTGTTTGGGGTCGTTTGCGGGAAGGGGCGGAATCCTACGCTAAGGCTTTGGCCAGCGATATTCTCCGGTGAGATTGATGTGTTCCCAGGGGATAGGAGAAGGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCCTTGATATCTAGTATGACGTCTGTCGCACCTGCTTGATCGCGGCCGCGATAGCTAGATCGCGTTGCTCCTCTTCTCCATCCGCGTTCCAAGCTGCGGAAAGGCACCCATAAGCGTACGCCTGGTCGAGCAGGCGACGCGGATCGACGTCCAGCGCACGAGAGAATGCGTCCGCCATCTGTGCAATGCGTCTAGGATCGAGACAAAGGTCGTCTCTGTCAGCCGGATCGTAGAACATATTGGCGGCGCCAAAGCCCACTTCACCGACCAGACCGACGGGATCTATCACCAGCCAGCCGCGACTGGAGAACATGATGTTTTCATGATGCAGATCGCCATGTAGCCCACGCAGTTCCGAGGCATTGCTCATCATTTGATCGGCTATAATCGCCGCGTGGACGTAGTCAGTTTGACAACCTGCGTTTTGATCATCGCGCGCCCGCTGAAACAAAGCTGCAAAGCGATCCCGGATCGGGAGAAGGGCAGAAGGCAGGGGTTCCTCAGATGCGGCATACAGCTTCGCCATTAGTTCCGCTGCAATTTCGGTCGCCTGGTAGTCGCCGTGCTCGGCAACGATGTGAGAGAGCATTCGCTCCCCGGCATATTCGAGCAACATCAGATTGTTCTCACGACCGAGCAACCGGACTGCTCCCCTCCCATTGCGCCATACCAGATAGTCGGCCCCGCGCAGTTCATCAGCAATGTCTTCTATAGGTTTCAATCCCTTGACGATTGCAGGAGTCCCGTCTGGCAATGAAACTTTCCAAACGAGGCTGGAAAAGGTGTCCGCAATGAGAACAGGTTGCGAAACGTGCCAATGAGCAGGAAAAACAGGCGGCATGAACATCAACCCCAAGTCAGAGGGTCCAATCGCAGATAGAAGGCAAGGCGTTCGCGGTCGGGGGCTTCGATCCCCAATACATTGAATAGGACAGCGAAGGCGCGCTCTGCTTCATCTGGCGCTGCCCAGTTCTCTTCGGCGTTAGCAATCATGAGTGCCAAATCGGCATAGCGATCTGCTGTTCCGAGCCGCCCAAGGTCGATCAGACCCGTGCATTGAAGAGTTTTAGGGTCCACCATGAAGTTCGGCATGCAGGGATCACCATGGCAAACAACCATATCGGTGCGCTCTTGGTCGAGCCGCACCGGTAGCTCTCGTTCGACACGAGCCAAAAGATCGAGCTGCGGCGTACTCTTGTCCTCGTCCGGTAAGAAGTCGGGATTGACGGCATTGCGGGACACCACATCAACGGCGCGTCCGAACATTCGCGACAGCCTGCGCTCAAACGGACATTGATCAACCGATAGGCTGTGAACAGCGCCAAGTTGCTGCCCCATTGACGGCCACGCTTTGAGCAAATCCGCTCCAGACAGATCAGCCGCCGGTACTCCCGGAATTGCCGTTATCACCAAGCATGCACCCTCCTGTTCCTCCTGCCAGTTGATGACCTCGGGGCAAGCCACACCTCGACCTTTGAGCCAAATGAGGCGGTCACGCTCTCCAGCGAGCTCACCGCGGCGGGAAGCAGGTGCGATTTTCGCGAAGGCATGCCCGTCACCACGTCGAAAAACAAAATCACCAGATTCTCCGCCTCTGACAGGCAACCAGTCAGAATGCGATTCACCAAAAAAAATATTAGTTCGATTCAATGGAGGTTCCTTCAGTTTTCTGATGAAGCGCGAATATAGAGAAATATCCCGAATGTGCAGTTAACGAATTCTTGCGGTTTCTTTTAGCGCCGCCAATACCGCCAGCCCGTCGCGCAAGGGGCGCGGCTCGTGTGTGCGGATGAAGTCAGCTCCACCTGCGGCGGCGGCAAGCTCTGCAGCGAGTGTCGCGGCCCCGACATCCCCCGGACCACGGCCTGTGAGCGCGCGCAGAAAGGATTTGCGCGAAACAGACAGAAGCACCGGCAAATCGAAGCGCAGCCGCAATTCATCGAACCGCGCCAGCACCGAGAGCGAGGTTTCGGGAGCAGCCCCCAGAAAAAACCCCATGCCGGGATCAAGGACAAGGCGGTTGCGTTTGATACCGGCACCCGTCAGCGCCGCGATGCGCGCGTCAAAGAACGCCGCAATGTGATCCATGATGTCGCCAGCGGGTGCCTCGCGCCGATCTGCCTGCCCGTCTTGCACCGAATGCATAACGACGAGTTTGGCAGATGATTTCGCCAATTGCGGATAGAACGCAGCGTCTGGAAAACCGCGAATATCATTGAGATAGGCCACACCACGCGACAAGGCATAGGCTTGCGTCGCGGGTTGATAACTGTCGAGCGAGACGGGAATGCCATCTGCCTTGAGCGCGTCCAGCACCGGCGCGATACGCTCGATTTCTGTGTCGGACGAAACAGGCGCGGCGTCGGGGTTGCTGGATGCCGGACCGAGGTCGATCACATCTGCCCCCTCGGCCATCAGCTTACGCGCCTGCGCAATGGCTGCGTCTGGCGCCAGATACCGGCCTCCATCGGAGAAACTGTCCGAGGTTATGTTGACGATGCCGAAAATGATGAGCGATTTATTCATGGGGGCTTCTATAATAATAATAATCGAGCATGAGTCTCATACGGATGCTCGGGTCGAAAGGGAATCCCCAGGCGAGTAACCTGTTTGCGGTGATCCATTAGCTGCAGGAGCAGAATAGCATACATCTGGAAGCAAAGCCAGGAAAGCGGCCTATGGAGCTGTGCGGCAGCGCTCAGTAGGCAATTTTTCAAAATATTGTTAAGCCTTTTCTGAGCATGGTATTTTTCATGGTATTACCAATTAGCAGGAAAATAAGCCATTGAATATAAAAGATAAAAATGTCTTGTTTACAATAGAGTGGGGGGGGTCAGCCTGCCGCCTTGGGCCGGGTGATGTCGTACTTGCCCGCCGCGAACTCGGTTACCGTCCAGCCCAGCGCGACCAGCTCCGGCAACGCCTCGCGCACCCGCTGGCGGCGCTTGCGCATGGTCGAACCACTGGCCTCTGACGGCCAGACATAGCCGCACAAGGTATCTATGGAAGCCTTGCCGGTTTTGCCGGGGTCGATCCAGCCACACAGCCGCTGGTGCAGCAGGCGGGCGGTTTCGCTGTCCAGCGCCCGCACCTCGTCCATGCTGATGCGCACATGCTGGCCGCCACCCATGACGGCCTGCGCGATCAAGGGGTTCAGGGCCACGTACAGGCGCCCGTCCGCCTCGTCGCTGGCGTACTCCGACAGCAGCCGAAACCCCTGCCGCTTGCGGCCATTCTGGGCGATGATGGATACCTTCCAAAGGCGCTCGATGCAGTCCTGTATGTGCTTGAGCGCCCCACCACTATCGACCTCTGCCCCGATTTCCTTTGCCAGCGCCCGATAGCTACCTTTGACCACCATGGCATCAGCGGTGACGGCCTCCCACTTGGGTTCCAGGAACAGCCGGAGCTGCCGTCCGCCTTCGGTCTTGGGTTCCGGGCCAAGCACTAGGCCATTAGGCCCAGCCATGGCCACCAGCCCTTGCAGGATGCGCAGATCATCAGCGCCCAGCGGCTCCGGGCCGCTGAACTCGATCCGCTTGCCGTCGCCGTAGTCATACGTCACGTCCAGCTTGCTGCGCTTGCGCTCGCCCCGCTTGAGGGCACGGAACAGGCCGGGGGCCAGACAGTGCGCCGGGTCGTGCCGGACGTGGCTGAGGCTGTGCTTGTTCTTAGGCTTCACCACGGGGCACCCCCTTGCTCTTGCGCTGCCTCTCCAGCACGGCGGGCTTGAGCACCCCGCCGTCATGCCGCCTGAACCACCGATCAGCGAACGGTGCGCCATAGTTGGCCTTGCTCACACCGAAGCGGACGAAGAACCGGCGCTGGTCGTCGTCCACACCCCATTCCTCGGCCTCGGCGCTGGTCATGCTCGACAGGTAGGACTGCCAGCGGATGTTATCGACCAGTACCGAGCTGCCCCGGCTGGCCTGCTGCTGGTCGCCTGCGCCCATCATGGCCGCGCCCTTGCTGGCATGGTGCAGGAACACGATAGAGCACCCGGTATCGGCGGCGATGGCCTCCATGCGACCGATGACCTGGGCCATGGGGCCGCTGGCGTTTTCTTCCTCGATGTGGAACCGGCGCAGCGTGTCCAGCACCATCAGGCGGCGGCCCTCGGCGGCGCGCTTGAGGCCGTCGAACCACTCCGGGGCCATGATGTTGGGCAGGCTGCCGATCAGCGGCTGGATCAGCAGGCCGTCAGCCACGGCTTGCCGTTCCTCGGCGCTGAGGTGCGCCCCAAGGGCGTGCAGGCGGTGATGAATGGCGGTGGGCGGGTCTTCGGCGGGCAGGTAGATCACCGGGCCGGTGGGCAGTTCGCCCACCTCCAGCAGATCCGGCCCGCCTGCAATCTGTGCGGCCAGTTGCAGGGCCAGCATGGATTTACCGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCCTTCAACGCATGAAAAAAGAGACTGCACAGTTATATCTGCTGTTTTTTGCTTTTCATCGGTTTCAGCAAATTAATGACAACTTAATTGAAGCATTGCTCCATTGGGTCGATCAATACGAGAAACAGGCCAAGCGTGCCGCTGAAGAAGCAATGAATAATGCGGTTACCAATGCAGCGAAAAATTTACAGGCTGCGGGTCATGTATTGAGCCTGTTTACGGATGACACCATCACCGATGACACACCTTTTTCCATTATTAAAGAAAAAGCCTATGCATTGCTTGAACAAGAGAGATTCCCATTAGTTGCTGATTACTTACGCAATATTGCTTTCGACAAAACGGCATTTGAATGGTCACATTACACAAAATTATCCGCCACATTCAAACGTAACTTAAGGCAACTTTTTACTGATCTGGATTTTGCCGGACGTGTAGAAGACTCTCCTTTGCTTGAAGCTATCGCGTTTTTACAAAACTTATTGCGCACAGAAAAATCACCAAGGCAAACTGACCCTAATTCATTTCCGACTGAGATTATTCCTAAAGGTTTACGCCGATATTTGTTTAGTAAAGAGGGCAAAACATTTAAAACGCTTGATGTAGATCGCTATGAGTTTTTGGTCTATCGCCTACTACGCAACTCACTGGAAGCGGGTGATGTGTACGTTAAACCCATATAATGCGCCTACTTATCAGCCTTTGGGACGTTGGGACGGTATTTTTCACTATATACCCCCAAGTTTTTAGACATGAAAAAAGCTCGATATATTATAAATATATCGAGCCTTTAAGTTACTGTTTTTCAATATCGTTCAAAATCTTGATAATTACAGTATACGGTTTTATGAAGTGGTATATTTGCAAATGCTTATTTTGCAAGGTCTATTTGCTTGATTTATCCTACAATCGTAATGCAAAACCCATGACCGGAAGCAACGAACTTAGGTTCAATCTCACTCAAATCCGAGGATAGCTTTTGTCCGGATTCTTGATTGGGCAATGTATCTGCGCCGCTTGATTTTACCGCTGCGTCCCAACTGCTTTCGATAACTTGTCCGTGATTCTCACCCATGCCGAACATGAGGTCGCTGCTAAAGAAAATACCACGTTTCTTTTCAAAAAATAAAAGTCCTTCCCACATATGCATTTCAGATGGGTAACTGATAGTTTGAAATTCAAAATCATCTCCGGCAAATATTTCATTCGGCTTTTTAATAAGCACATTGTTAGTAATACCAAACCCCATCAGTTGTCTTGCTGTGGTTTCGGAACAAACCGCGACAGCTTCGGGATGTTCTTTGAGAACCAAAGCAAGTCCGCCACATTCGTCTGATTCAAAATGAGAAATTAGAATGTATTTTATTTTGCGTTCACCGAGCAACTCTTGCAACTTAGGAATGGTAGTTTGCGCTTGTGATACGGCTCCGGTCTGAATGAGAACAGGCTCATTTGTCATCAATAAGTATTGGTGCATTGAAAGCTTAATCGGCTCCATTACCTCTGTAAATTGGTATAAATCTTTGATAATCTCTGTCATTGTCAAAACACCTTTTTCTATTTATAGTCTAACCTAAATTCCACGTGTGTTTTTTATTAGCTTCAAAAATCACTATTTCACGAAGAATTTAGACTGCTTCTCACACATTGTAACATTATTTACAACCACCTTTCAATCATTTTTGATAAATCATTGATTTCATCTTTGCTGCAATGATACTTAATAAACTCTGCAAGTTATCCACAGAGCAACACTCAATTTTATTGATGATATTCTTATTATACCAGACATTTTTCATACACTCCCTTGTACGGATAGTTTTCCGACAACTTCATGATTACATATCTTGCGGTTTTGATTATTTTTGCTGCAAGAAATACATACTTCAAACGAAAGGTCTTTATTTGCTGTCTGTATTCTGAAGAGTCCAAGGAATCAAACTTGAACAACAAAAATAGGTTATATGAAAGCATCATCATTTGAAACACGGCTTCATTCGCCCAAAATGACTTTAGCAAGAGATGACCCACCGCCATGTCGTATTTGGCTTCTTTGATATAGTTTTCAGCATTACCACGCTTTTCATAGTATATAACTACTTAGGGAATTCCATGACTGGACAGCGCATTGGGTATATCAGGGTCAGCACCTTCGACCAGAACCCGGAACGGCAACTGGAAGGCGTCAAGGTTGATCGCGCTTTTAGCGACAAGGCATCCGGCAAGGATGTCAAGCGTCCGCAACTGGAAGCGCTGATAAGCTTCGCCCGCACCGGCGACACCGTGGTGGTGCATAGCATGGATCGCCTGGCGCGCAATCTCGATGATTTGCGCCGGATCGTGCAAACGCTGACACAACGCGGCGTGCATATCGAATTCGTCAAGGAACACCTCAGTTTTACTGGCGAAGACTCTCCGATGGCGAACCTGATGCTCTCGGTGATGGGCGCGTTCGCCGAGTTCGAGCGCGCCCTGATCCGCGAGCGTCAGCGCGAGGGTATTGCGCTCGCCAAGCAACGCGGGGCTTACCGTGGCAGGAAGAAATCCCTGTCGTCTGAGCGTATTGCCGAACTGCGCCAACGTGTCGAGGCTGGCGAGCAAAAGACCAAGCTTGCTCGTGAATTCGGAATCAGTCGCGAAACCCTGTATCAATACTTGAGAACGGATCAGTAAATATGCCACGTCGTTCCATCCTGTCCGCCGCCGAGCGGGAAAGCCTGCTGGCGTTGCCGGACTCCAAGGACGACCTGATCCGACATTACACATTCAACGATACCGACCTCTCGATCATCCGACAGCGGCGCGGGCCAGCCAATCGGCTGGGCTTCGCGGTGCAGCTCTGTTACCTGCGCTTTCCCGGCGTCATCCTGGGCGTCGATGAACTACCGTTCCCGCCCTTGTTGAAGCTGGTCGCCGACCAGCTCAAGGTCGGCGTCGAAAGCTGGAACGAGTACGGCCAGCGGGAGCAGACCCGGCGCGAGCACCTGAGCGAGCTGCAAACCGTGTTCGGTTTCCGGCCCTTCACCATGAGCCATTACCGGCAGGCCGTCCAGATGCTGACCGAGCTGGCGATGCAAACCGACAAAGGCATCGTGCTGGCCAGCGCCTTGATCGGGCACCTGCGGCGGCAGTCGGTCATTCTGCCCGCCCTCAACGCCGTCGAGCGGGCGAGTGCCGAGGCGATCACCCGTGCTAACCGGCGCATCTACGACGCCTTGGCCGAACCACTGGCGGACGCGCATCGCCGCCGCCTCGACGATCTGCTCAAGCGCCGGGACAACGGCAAGACGACCTGGTTGGCTTGGTTGCGCCAGTCTCCGGCCAAGCCAAATTCGCGGCATATGCTGGAACACATCGAACGCCTCAAGGCATGGCAGGCACTCGATCTGCCTACCGGCATCGAGCGGCTGGTTCACCAGAACCGCCTGCTCAAGATTGCCCGCGAGGGCGGCCAGATGACACCCGCCGACCTGGCCAAATTCGAGCCGCAACGGCGCTACGCCACTCTCGTGGCGCTGGCCACCGAGGGCATGGCCACCGTCACCGACGAAATCATCGACCTGCACGACCGCATCCTGGGTAAGCTGTTTAACGCTGCCAAGAATAAGCATCAGCAGCAGTTCCAGGCGTCAGGCAAGGCCATCAACGCCAAGGTACGTCTGTACGGGCGCATCGGTCAGGCGCTGATCGACGCCAAGCAATCAGGCCGCGATGCGTTTGCCGCCATCGAGGCCGTCATGTCCTGGGATTCCTTTGCCGAGAGCGTCACCGAGGCGCAGAAGCTCGCGCAACCCGATGACTTCGATTTCCTGCATCGCATCGGCGAGAGCTACGCCACCCTGCGCCGCTATGCACCGGAATTCCTTGCCGTGCTCAAGCTGCGGGCCGCGCCCGCCGCCAAAAACGTGCTTGATGCCATTGAGGTGCTGCGCGGCATGAACACCGACAACGCCCGCAAGCTGCCAGCCGATGCACCGACCGGCTTCATCAAGCCGCGCTGGCAGAAACTGGTGATGACCGACGCCGGCATCGACCGGCGCTACTACGAACTGTGCGCGCTGTCCGAGTTGAAGAACTCCCTGCGCTCGGGCGACATCTGGGTGCAGGGTTCACGCCAGTTCAAGGACTTCGAGGACTACCTGGTACCGCCCGAGAAGTTCACCAGCCTCAAGCAGTCCAGCGAATTGCCGCTGGCCGTGGCCACCGACTGCGAACAATATCTGCATGAGCGGCTGACGCTGCTGGAAGCACAACTTGCCACCGTCAACCGCATGGCGGCAGCCAACGACCTGCCGGATGCCATCATCACCGAGTCGGGCTTGAAGATCACGCCGCTGGATGCGGCGGTGCCCGACACCGCGCAGGCGCTGATAGACCAGACAGCCATGGTCCTGCCGCACGTCAAGATCACCGAACTGCTGCTCGAAGTCGATGAGTGGACGGGCTTCACCCGGCACTTCACGCACTTGAAATCGGGCGATCTGGCCAAGGACAAGAACCTGTTGTTGACCACGATCCTGGCCGACGCGATCAACCTGGGCCTGACCAAGATGGCCGAGTCCTGCCCCGGCACGACCTACGCGAAGCTCGCTTGGCTGCAAGCCTGGCATACCCGCGACGAAACGTACTCGACAGCGTTGGCTGAACTGGTCAACGCTCAGTTTCGGCATCCCTTTGCCGGGCACTGGGGCGATGGCACCACATCATCATCGGACGGACAGAATTTCCGAACCGCTAGCAAGGCAAAGAGCACGGGGCACATCAACCCAAAATATGGCAGCAGCCCAGGACGGACTTTCTACACCCACATCTCCGACCAATACGCGCCATTCCACACCAAGGTGGTCAATGTCGGCCTGCGCGACTCAACCTACGTGCTCGACGGCCTGCTGTACCACGAATCCGACCTGCGGATCGAGGAGCACTACACCGACACGGCGGGCTTCACCGATCACGTCTTCGCCCTGATGCACCTCTTGGGCTTCCGCTTCGCGCCGCGCATCCGCGACCTGGGCGACACCAAGCTCTACATCCCGAAGGGCGATGCCGCCTATGACGCGCTCAAGCCGATGATCGGCGGCACGCTCAACATCAAGCACGTCCGCGCCCATTGGGACGAAATCCTGCGGCTGGCCACCTCGATCAAGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCAGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCACCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGAAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTACCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACATTAACCTATAAAAATAGGCGTATCACGAGGCCCTTTCGTCTTCAAGAATTTTATAAACCGTGGAGCGGGCAATACTGAGCTGATGAGCAATTTCCGTTGCACCAGTGCCCTTCTGATGAAGCGTCAGCACGACGTTCCTGTCCACGGTACGCCTGCGGCCAAATTTGATTCCTTTCAGCTTTGCTTCCTGTCGGCCCTCATTCGTGCGTTCTAGGATCCTCCGGCGTTCAGCCTGTGCCACAGCCGACAGGATGGTGACCACCATTTGCCCCATATCACCGTCGGGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCCCGCAAGATATGTAATCATGAAGTTGTCGGAAAACTATCCGTACAAGGGAGTGTATGAAAAATGTCTGGTATAATAAGAATATCATCAATAAAATTGAGTGTTGCTCTGTGGATAACTTGCAGAGTTTATTAAGTATCATTGCAGCAAAGATGAAATCAATGATTTATCAAAAATGATTGAAAGGTGGTTGTAAATAATGTTACAATGTGTGAGAAGCAGTCTAAATTCTTCGTGAAATAGTGATTTTTGAAGCTAATAAAAAACACACGTGGAATTTAGGGACTATTCATGTTGTTGTTATTTCGTATCTTCCAGAATAAGGAATCCCATGGTTAAAAAATCACTGCGCCAGTTCACGCTGATGGCGACGGCAACCGTCACGCTGTTGTTAGGAAGTGTGCCGCTGTATGCGCAAACGGCGGACGTACAGCAAAAACTTGCCGAATTAGAGCGGCAGTCGGGAGGCAGACTGGGTGTGGCATTGATTAACACAGCAGATAATTCGCAAATACTTTATCGTGCTGATGAGCGCTTTGCGATGTGCAGCACCAGTAAAGTGATGGCCGCGGCCGCGGTGCTGAAGAAAAGTGAAAGCGAACCGAATCTGTTAAATCAGCGAGTTGAGATCAAAAAATCTGACCTTGTTAACTATAATCCGATTGCGGAAAAGCACGTCAATGGGACGATGTCACTGGCTGAGCTTAGCGCGGCCGCGCTACAGTACAGCGATAACGTGGCGATGAATAAGCTGATTGCTCACGTTGGCGGCCCGGCTAGCGTCACCGCGTTCGCCCGACAGCTGGGAGACGAAACGTTCCGTCTCGACCGTACCGAGCCGACGTTAAACACCGCCATTCCGGGCGATCCGCGTGATACCACTTCACCTCGGGCAATGGCGCAAACTCTGCGGAATCTGACGCTGGGTAAAGCATTGGGCGACAGCCAACGGGCGCAGCTGGTGACATGGATGAAAGGCAATACCACCGGTGCAGCGAGCATTCAGGCTGGACTGCCTGCTTCCTGGGTTGTGGGGGATAAAACCGGCAGCGGTGGCTATGGCACCACCAACGATATCGCGGTGATCTGGCCAAAAGATCGTGCGCCGCTGATTCTGGTCACTTACTTCACCCAGCCTCAACCTAAGGCAGAAAGCCGTCGCGATGTATTAGCGTCGGCGGCTAAAATCGTCACCGACGGTTTGTAATAGCGGAAACGGAATGGGGAAACTCATTCCGTTTTTGTTTATCGCCTTAGACGGCAAAAGTGCTGTCGCCCACCTGCGCTTGCGCATACCAGGCCATAAGCTCCGTGGTTCCTGGTTCTCCTTCCGCTGGAGCCCAGTGCGCATAGTCATCGGCAGCCACGGGTTGATAGCCACCGTGTTTTACTTCAAAAATTATGCCACCGGTATCCAGCGACAGCACGGCATGCCAGGTTCCTGCGGCCATCTCCAGCACCGTACAGGTTTCCCCCAATATCGCCCGATGGGTGACGGTACCCCGATCGTCAAAATTCAGCACCACGAAACGACCCCTTAATGGCAACAGTAGCTCGAAGGTGTGAGGGTGTCGGTGCGGGCGCACGTAGGTCCCAGGTCATATTCCTTCCGGCGTCCGGCATTTTACCGCCAGACAGCTCGGGATTCGTGATATCACCGTTCTTGCAGAATACGGTCAGAGGGAAAATACCCGCCGTGAGCATGCAGCGCTGATACGTCAGCACTATCAGTATCGTGAATTTGCCTGGCCCTGGACATTTCGCCTTACCCGTCTTTTATATACCCGGAGCTGGATAAGCAACGAACGTCCTGGCCTGCTTTTCGATCTGGCGACAGGGTGGCTTATGCAACATCGTATTATTCTCCCCGGAGCCACTACGCTGACCCGGTTGATTTCAGAGGTAAGGGAAAAGGCGACGTTGCGCCTGTGGAACAAACTGGCACTGATACCGTCAGCCGAACAGCGTTCACAGCTGGAGATGCTGCTGGGGCCAACTGATTGCAGCCGCCTGTCTTTACTGGAATCACTGAAAAAGGGCCCTGTGACCATCAGTGGTCCGGCGTTTAATGAAGCAATTGAACGCTGGAAAACTCTGAACGATTTTGGCCTGCATGCTGAAAACCTGAGTACACTCCCGGCTGTGCGCCTGAAAAATCTCGCACGTTATGCTGGTATGACTTCGGTGTTCAATATTGCCAGGATGTCACCGCAGAAAAGGATGGCGGTTCTGGTTGCCTTTGTCCTTGCATGGGAAACGCTGGCGCTGGATGATGCATTGGACGTTCTGGACGCCATGCTGGCCGTTATCATCCGTGACGCCAGAAAGATTGGGCAGAAAAAACGGCTCCGCTCGCTGAAGGATCTGGATAAATCTGCATTGGCGCTCGCCAGCGCATGTTCGTACCTGCTGAAAGAAGAAACACCGGACGAATCGATTCGTGCTGAGGTGTTCAGCTACATCCCAAGGCAAAAGCTGGCTGAAATCATCACGCTTGTCCGTGAAATTGCCCGGCCCTCAGACGATAATTTTCATGAAGAAATGGTGGAGCAGTACGGGCGCGTTCGTCGTTTCCTGCCCCATCTGCTGAATACCGTTAAATTTTCATCCGCACCTGCCGGGGTTACCACTCTGAATGCCTGTGACTACCTCAGCCGGGAGTTCAGCTCACGGCGGCAGTTTTTTGACGACGCACCAACGGAAATTATCAGTCGGTCATGGAAACGGCTGGTGATTAACAAGGAAAAACATATCACCCGCAGGGGATACACGCTCTGCTTTCTCAGTAAACTGCAGGATAGTCTGAGGCGGAGGGATGTCTACGTTACCGGCAGTAACCGGTGGGGAGATCCTCGTGCAAGATTACTACAGGGTGCTGACTGGCAGGCAAACCGGATTAAGGTTTATCGTTCTTTGGGGCACCCGACAGACCCGCAGGAAGCAATAAAATCTCTGGGTCATCAGCTTGATAGTCGTTACAGACAGGTTGCTGCACGTCTTTGCGAAAATGAGGCTGTCGAACTCGATGTTTCTGGCCCGAAGCCCCGGTTGACAATTTCTCCCCTCGCCAGTCTTGATGAGCCGGACAGTCTGAAACGACTGAGCAAAATGATCAGTGATCTACTCCCTCCGGTGGATTTAACGGAGTTGCTGCTCGAAATTAACGCCCATACCGGATTTGCTGATGAGTTTTTCCATGCTAGTGAAGCCAGTGCCAGAGTTGATGATCTGCCCGTCAGCATCAGCGCCGTGCTGATGGCTGAAGCCTGCAATATCGGTCTGGAACCACTGATCAGATCAAATGTTCCTGCACTGACCCGACACCGGCTGAACTGGACAAAAGCGAACTATCTGCGGGCTGAAACTATCACCAGCGCTAATGCCAGACTGGTTGATTTTCAGGCAACGCTGCCACTGGCACAGATATGGGGTGGAGGAGAAGTGGCATCTGCAGATGGAATGCGCTTTGTTACGCCAGTCAGAACAATCAATGCCGGACCGAACCGCAAATACTTTGGTAATAACAGAGGGATCACCTGGTACAACTTTGTGTCCGATCAGTATTCCGGCTTTCATGGCATCGTTATACCGGGGACGCTGAGGGACTCTATCTTTGTGCTGGAAGGTCTTCTGGAACAGGAGACCGGGCTGAATCCAACCGAAATTATGACCGATACAGCAGGTGCCAGCGAACTTGTCTTTGGCCTTTTCTGGCTGCTGGGATACCAGTTTTCTCCACGCCTGGCTGATGCCGGTGCTTCGGTTTTCTGGCGAATGGACCATGATGCCGACTATGGCGTGCTGAATGATATTGCCAGAGGGCAATCAGATCCCCGAAAAATAGGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCCCACATCTTTTGTCACCAACGAGCGGCTGCCTATCACCGCACCGTGCCCGATCTTGATTCCGGGCATGACCATTGCCTCAGAGCCGATCCAAACGTCATTGCCAATGACAGTATTACCTGCTTTTTGGAAGGCATCGAGTGCGCTTGAGAATGCAGGTTCTTCCTGCATATAAAAGAACGGGAAAGATGATGCCCAGTCGTACCGATGCCCCTGATTGCCAGCCATGATAAAGGAAGCCCCACTCCCGATAGAGCAGAAACTACCGATGATCAACTTATCAACGTCATCACGGTCCGGAAACAGATACCGTGCGCAGTCATCGAATGAGTGCCCATGATAGTAGCCAGAGTAATAGCTGTACCGCCCAACTTTGATATTGGGGTTCTTCACTTGCTCAGAAAGCAGCTTGCCTTTGAAGGGGCTATCAAAGTAGTTGGTCATAAGAGATCCCGCGGTCTGTGACTTTGCCGTCTAACGTTTGAAATAAGGGGCGCCGAGCGCCAGCGAGGGGAGCCAAAAGCTTGCTTTTGGCCGTCCCGACTTGATTGAAGGGTTGGGCGATTTTGCCATTAGATTTTTTATAAATTTAGTGTGTTTAGAATGGTGATCGCATTTTTCTTGGCTTTTATGCTTGATGTTAAATTCGACCCCAAGTTTCCTGTAAGTGCGGACACAAAAACATATTTATGTCCTGATTTGCTTATAATAAACCCTTCAAACCATCCGTTTTGTAAGGTTCTATTTGCTGTGAATCCTGCACCAGTTTTCCCATACAGTTTTGTACTATTATCCAGATCTTGTAGATACATGTTCTCTATGGTGTTTTCTATGGCTGAGTTTTTAACTGGGAGATTGTGATTAATAATTTTACGCAGGAATTGAATTTGTTCTTCTGGTGAAATTTTTAAGCTACTTTCGAGCCATGCTTCTGTTAATCCGTTGTTTCTTTCTTTATCTCCAGAGAAGTCTTGATTTCCATAATCAAAATCTTTGAGATAATTCTTGATTTTATTTAATCCAATTTTTTGGGTTATTTCTTGCGAAACCCAAACAACAGAAAATTGCATCCACGTCTTTGGTGTATGATTGCTGTTCCAGATCTCCATTCCTTTGGGGGTTTTATCCCATTTGAATATGGTTTTCTGATCTATTATTTCCGCATCAAATGCCATAAGTGATAATGCGATCTTGAAAGTTGAATCTGGTGCCATTTGCGTTGCACACTTTGCTTTATTGAATTGAGCAATTTCAGCGTTTGTGGATGCATCGTAAAGTAAAAAACAACCTTCAGTTCCTTCAAATAATGGAGATGCAACAGTAGAGATATCTGTTGATGCACTGGCGCTGCTGTAGATAATATTTGCAATTATTAAAAAAATAGCGAAGTTGATATGTATTGTGTTTTTCATAATAAGTATTGGTTTGGTAAAGGGCTTAATTTTAACGGCTAACAATTAATGAGGCTCCGGGTTCGCCCAACGTTTGACATGAGGGGCGGCCAAGGGCGCCAGCCCTTGGACGTCCCCCTCGATGGAAGGGTTAGGCATCACTGCGTGTTCGCTCGAATGCCTGGCGTGTTTGAACCATGTACACGGCTGGACCATATGGGGTGGTTACGGTACCTTGCCTCTCAAACCCCGCTTTCTCGTAGCATCGGATCGCTCGCAAGTTGCTCGGCGACGGGTCCGTTTGGATCTTGGTGACCTCGGGATCATTGAACAGCAACTCAACCAGAGCTCGAACCAGCTTGGTTCCCAAGCCTTTGCCCAGTTGTGATGCATTCGCCAGTAACTGGTCTATTCCGCGTACTCCTGGATCGGTTTCTTCTTCCCACCGTCCGTCCCCGCTTCCAAGAGCAACGTACGACTGGGCATACCCAATCGGCTCTCCATTCAGCATTGCAATGTATGGAGTGACGGACTCTTGCGCTAAAACGCTTGGCAAGTACTGTTCCTGTACGTCAGCAAGTGTCGGGCGTGCTTCTTCTCCGCCCCACCACTCGACGATATGAGATCGATTTAGCCACTCATAGAGCATCGCAAGGTCATGCTCAGTCATGAGGCGCAGTGTGACGGAATCGTTGCTGTTGGTCACGATGCTGTACTTTGTGATGCCTAACTTTGTTTTTGCGTTGCTCATGATGTCTAACTCCCAATTTGTGTAGGGCTTATTATGCACGCTTAAAGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCTAACAGCGGATGTTCGCGATCACCTGGACAAACTTCTGAGTGAAATGCTCGCCGGCAATATCAGTCGTTTCATCTGGCTTCGCAACTTCGAGGTTGGTAACAACTCGGCTGCTGCTAACCGTTTGCTCGACAGGCTCGAATTTCTGCGTACCCTGAATATCAATCATAGTGCTTTGGCCAGCATACCTGCCCATCGCATTGCCCGGCTGCGTCGGCAGGGTGAACGCTACTTCACCGACGGTTTGCGTGACATCACTTCGGACCGCCGCTGGGCGATCCTTGCCGTCTGTGTTGTGGAGTGGGAAGCGGCGATTGCTGATGCCATAGTCGAAACCCATGACAGGATCGTAGGAAAAACCTGGCGGGAAGCGAAGCGCCAGCATGACGAAACAATTTCCGGCTCTAAAGCCACACTCACGGATACGATCCGTACCTTCACCGCGCTGGGAGCTTCGTTGCTTGAGGCCCGCAGTGACGGAACCCCGCTGGAGATGGCTGTCGCCAGTTCGGTTGCATGGGACCGGCTCGCTCAACTGGTAGCGACAGGGACTCAACTCAGCAACACGCTAGCCGATGAGCCTCTTGCATATGTCGGGCAGGGATACCATCGCTTTCGTCGTTATGCGCCCCGCATGTTGCGCTGTCTGAAGCTCGAAGCCGCGCCGGTCGCCGGACCATTGGTAGCAGCAGCTTTGTCGATCGGAGAGATGAAAGGTGTTGCATCGCCAGAAAGGCGTTTCCTGCGGCCCAGCTCCAAATGGAACCGTCATTTACGAGCTCAGGAAAAAGGAGATACCCGTCTTTGGGAAGTGGCGGTACTCTTTCACCTCCGGGATGCTTTTCGTTCCGGAGATGTCTGGCTCGCTCATTCGCGCCGCTATGGTGACCTCAAGCAGGTACTGGTGCCGATGATCGCGGCGCAGGAAAATGCAAAACTGGCCGTGCCTTCCAACCCACAGGATTGGCTGGCAGACAGAAAGGCGCGACTCACGATCGCTCTTAAGCGGCTGGCCCGGGCTGCCCGTAACGGCACTATTCCGCACGGTAGCATAGAAGATGGAACGTTGCGGATCGACAGGTTGACAGCAGACGTGCCGGATGGTGCCGAGGCACTCATACTGGATCTGTATCGCCGAATGCCGTCCGTTCGGATTACCGACATGCTGCTTGAAGTTGATGCAGCCCTTGGTTTCACAGATGCGTTTACCCATCTGAGAACCGGGGCTCCATGTCGCGACCGGATCGGTCTGCTCAACGTCCTGCTCGCTGAAGGGCTCAATCTGGGCCTGCGTAAGATGGCGGAAGCTACAAACACGCATGATTACTGGCAGCTCTCACGCCTTGCCCGCTGGCATGTTGAAAGCGAAGCCATGAACCAGGCATTGGCAATTGTGGTGGCCGCGCAGGGTAAACTGCCGATGTCACGCGTCTGGGGGATGGGCACGTCAGCATCGAGCGATGGTCAGTTTTTCCCGACAGCGCGGCATGGCGAAGCCATGAACATGGTCAATGCCAAATATGGTTCTGTTCCCGGCCTCAAAGCGTATACTCACGTAAGCGACCAGTTCGCGCCATTCGCTTGTCAGTCGATCCCGGCGACCGTGAGCGAGGCACCGTATATTCTCGATGGACTACTGATGAACGAGGTCGGTCGCCATGTTCGCGAACAGTATGCCGATACAGCAGGATTCACCGACCATTTGTTCGGAGCCAGTAGCCTGCTCGGCTACAATCTCGTTCTGCGAATCAGGGATCTGCCATCGAAGCGGTTGTACGTATTTAATCCCGATACGACCCCCAGGGAGTTACGCAAGTTGGTAGGTGGAAAAGCCCGGGAGGATCTTATCGTTGCGAACTGGCCTGATATTTTCCGTTGTGCCGCGACGATGACCGCTGGCAAAATCAGGCCCAGCCAACTCCTGCGCAAGCTCGCTTCTTACCCACGACAAAACAACCTTGCAGTTGCGCTTCGTGAAGTTGGTCGTATTGAACGGACCCTTTTCATTATTGAGTGGATCCTGGATACGGACATGCAGCGGCGTGCTCAGATCGGTCTTAACAAGGGAGAGGCCCACCATGCGCTCAAAAATGCGCTCCGTATCGGGAGGCAGGGGGAAATTCGCGATCGCACGACAGAGGGGCAGCACTACCGAATCGCTGGGCTCAATTTATTGACTGCGGTGATCATTTACTGGAATACCGTCCATCTTGGTCATGCCGTCACGGAGCGGCGGAACGAAGGGTTGGATGTTCCCCCTGAATTTCTTCCCCACATATCCCCATTGGGCTGGGCGCACATTCTACTGACTGGCGAATATCTTTGGCCCAAGGAACCGAAAGCTTAGGGTGTCATTTCGCCCTCAGCCGGAACCGACCCCTTTTAGCCAAATAACGTTTGGGAATCACCAGAATGGTGGGACAACAGCGGTTTTAGTGCCCTAAATCGTACGTTTTCATCCAGTTGCCCCTCAAACCCCATGTTCAAGTCAGAATAGTGGACAGGCGGCCAAGAACTTCGTTCATGATAGTCTCCGGAACCCGTTCGAGTCGTTTTCCGCCCCGTGCTTTCATATCAATTGTCCGGGGTTGATCGCAACGTACAACACCTGTGGTACGTATGCCAACACCATCCAACGACACCGCAAAGCCGGCAGTGCGGGCAAAATTGCCTCCGCTGGTTACGGGCACAACAACAGGCAGGCGGGTCACGCGATTAAAGGCCGCCGGTGTGACAATCAGCACCGGCCGCGTTCCCTGCTGCTCATGACCTGCGGTAGGATCAAGCGAGACAAGCCAGATTTCCCCTCTTTCCATGTCAGATTTCCTCCTGACCAGTCGCCGGTGCATCCAGCCATTCTCGTTCTTCAGCTGATATTTCAGCATTCGGATCACACTGTGCCAGTAGCTCAGCCAGTGAATATTGCGGGCGTCTGTACGGCTCAACAATCAGCCGGCCATTATCAATGACCATGCCAACTTCATTATCTGTGCCCAGAGACAGCGCATTCAGCAGTGCCGGTGGGACGGTCAGCATAACTGAGCCGCCAACCCTCTTCAGTCGGGTGGTATGCATTCTTCACCTCCATAAAAGTTATATTTAAATATAACATCCACTAAAAAAACACACCAGGCTTTAACGCACAATGTTTAATAAAAATATAACTTTCAACCAAACAGTAAACCCAGCGTGGTTGCTTCCATATGCAGCAATACCGGCAGCAGCAGGCCACCACTTCTGATCCTGGCCACTGATGTAATCAACCCCACCAGGAACAGTTCTGCCAGTGTCAGCAGGTTCTGATACTGGCTGTGCGCGGCGACGAACAACAACGACGTTATCAGCGCCCCCAGCCACATCGTCCAGCAGTACCGTGAACGGAAGACGTTCAGCATAATCCCCCGGAACAGCGTTTCCTCATTCAACGGGGCAAGGATAAAGATGGTCAGCAACGTCAGGATCACGTCAGGTATGGACTTATCGGCAAAAAGTTTCGTCATAAATGGCTCAGCAGGCAGAGCCAGCGCCTTACCGAGCAGAAATACACCGACATACACCACGGCCATCGCACCGACCAGCCACGGTACGCCAACGTTGCGCAGCTGACCAACGACCGGTAGCGGAGCTATCCAACGGCGGTATACCAGGAAAACACACAGCAGGTACATCAGAACAGTACCATGACTGAAGAACAAATAGTTTTTTCCTGATCCATAAAGCAGAACGGCCTGCTCCATGACAAATCTGGCTCCCCAACTAATGCCCCATGCAGCCAGCATAACCAGCATAAACTGCAGATATTGATTACGTGTTTGAATCATTGCATCGCCTGTAAATTTTTAACTTGTCCTATTTTTGTCATTACCACGTATATACACATGTATAACAATTCAGATATCGTTACCAGGATATGCCGCATCAGCGGCATGGAAGGCGGCACTCTGTTGTTTCATATGATACAGGAGTAAAACCGCCGAAGCCCGGCGTAAGCCGGTACTGATTGATAGATTTCACCTTACCCATCCCCAGCCCTGCCAGACCATACCCGCTTTCAGCCATGAGAGAGCTTCTGTGCGCGGTCGGAGTGGTCCCGACGAGGGTTTACCCGAAGTCGGGGCGTATCTCCGCGTTAGCGGGCCGTGAGGGCCGCTTACGAGCGTGTACTGAGAACTTCCAGCGAGAAGACTGACAGCGATGAAGATGTAGTTACAACATTCATAATTAAAAGCGACTCTGTTCCGGCCCGAAGGGCCGGGGCGGGGCCGCTTTTCAGTTATGAGGGAGGGGCTTTGTGGTTTCAGTTCTGCGCTGGTTCGGGGTTTTTCTGGAGGTTGGTTTTGTGTGTTGTAACTAAAGTGGCTCCGGTTGGGGCCCGCCGTTTACGGTGGGAGGTGCATATCTGTCTGTCCACAGGACAAGCAGTGAATAGGTTTTCTTTTTAAATGAATGTAATTAAGTAGTTTAAAGGAGATATAAACAGGTGTTTAAAAGATACATTGCACCCTGTAGGGCTGACGGCTGGCGCTTTATGACATTAACGATTGTAACCTTATGGGGAAGTCCCTTGCAGTTTAATGTGGATAAGCAAAATTACCCGTCTGTGAGGCGTGTTTTGTATCAAAAACAAGGGGGACCGGATGCACCTGAAGGTGGATGATGAGGTTGTTTTTTTGTATGTGGTGCTGATTTTTTGTGCACTGGCGGGCTTCAGGCGTGCGAATGCCTCCGGCGCGTGCCGAATTATTCAGAGGAGGTCACTTTCAGGGGGAAGCTGTGGCCAGGCGGCTGTAATTGCGGTTACGTGACAGAATCATGCGCTCCTTCACACGACGCTCCACTTCGCGTTTTACCGCCTCACGATTGGCAGTGAAGCGCCCTTCCGAGATTTCACGCGTCAGCTGTCGTTTCACCAGGGTGACGATATCCTGACGTTTCCTGTTCGCATCACGACGCGCACGGGCACGCTTTATTCCCCGGGACTTAAGCTCTGTTTGGTAACTGCGGAAACGCTCACGCACAAAACGCCAGGCTTTCGCTATCAGTTCATCCATACCCAGGGTATCCAGCCCCTGCTTTTTGCGCTGTTTGTTTTCCCATACCACACGGCTGCGGCGCGCAGCTGCCACTGCATCCTCAGACACATCAAGGGCGGCAAACAGCGCCAGTGTGAACGTGATATCGGTCGGAATGTAGCACCCGATAAGCGGGTCATATTCCGTCTGGTAAGTAATCAGTCCCAGCTCTGACAGGAACGTCAGGGCCCGGGTGGCCCGGGTGATGGAGAGTTTTCCTGCACCGGACTCTGTCGCCAGTCCGCACTCAATGGCCAGCGTGGTGATGGAACACTGGACGCGGTTGGCCAGCGGGTCATAATGGAAACACAGCCCCTGCAGCAGCGCATCAATAGCCCGTCGACGCAGCACCGGTGGCATGCGTCGACGCAGACCACGCGAACGGGCATGCGCCACATGAATAGCGAAATCAAAACGGGAGCTGAAGCCCACCGCTTTTTCCATCAGTTTTTCGCAGAACTTCAACGTTCCGGCACCTTCACGGGGTGTGAACACCGGATTCGGGTTCTTTACCTGGCGGTAATACGTTTGGTGAAGATCAGTCACACCATCCTGCACTTATGTTGCACAGAAGGAGTGAGCACAGAAAGAAGTCTTGAACTTTTCCGGTCATATAACTATACTCCCCGCATAGAGCAACAGCTTCTATGCAGTTTCTTGTTAGCCCCGGTAATCTTCTCTTAGTCGCCAAACCTGGTGAAGATTATCGGGGTTTTTGCTTTTCTGGCTCCTGTAGATCCACATCAGAACCAGTTCCCTGCCACCTTACGGCGTGGCCAGCTGCGTATTTTCATGAAAGGAGATCACTCAATAACTTCCATCGAGATCGGGTAATAACATTTGAACAGATCGCTGAATAACATCGATGGAGATCACTTTTGACTCATTTTGTTATTCAGTGATCTCCATCAATGTTATTGGAACTTCACAGGTGTGTTGATCTGTATCTTTTGCCATTCCGGTAAAGGATACCTATGCCAACAGTTCCAATTTCTATGAGAAAACTTAAAGAAATTCTTAGGCTTAAATACGGTGTTGGACTCAGCCATCGACAAATTGGTCGTAGTCTTGCAATCTCCCCTTCCGTTGTATCCAGATATGCTAATCGGGCGGCTCAACTTGGCATAAAGCAGTGGCCCTTACCTACAGGATGGGATGATACAAAACTAAAACATGCGTTCCTTCAGACCCAGGTTAAGATGAAGAAGCACTCTCTGCCTGACTGGGCTACAGTACACCGGGAACTGCGTAATAAATGCGTGACGCTGCAGCTACTCTGGGAAGAATACTGTGAGCGTAATCCAGGCGGTTTTTACAGCTATAACCATTACTGCCGGATGTACCGTGAATGGCTCAAAACCACTTCACCATCAATGCGTCAGGTACATAAAGCTGGCGAAAAACTTTTCGTTGATTACTGTGGACCTACCGTTGGCGTTACCGACCCTGAGACCGGAGAAATAAGAACTGCTCAGGTCATCGTAGCTGTTCTCGGGGCATCAAGTTACACATGGGCAGAGGCCACCTGGTCTCAGCAGCTTGAAGACTGGGTGATGAGTCATGTTCGCTGCTTCCAGTGGTTGGGTGGCGTTCCTGAACTTGTTGTTCCGGACAATCTGAAAAGCGCCACATCCAGGGCATGTAAGTATGATCCTGACGTTAACCCTACCTACCAGCAGATGCTTGAGCATTATAATGTCGCAGTTTTGCCTGCGCGGCCACGTAAACCGAAAGATAAAGCCAAAGCTGAAGTTGGCGTTCAGGTTGTTGAACGCTGGATCATGGCCCGAATCAGGCATGAGATCTTCTACAGCCTTGCATCGCTTAATCAGCGCATTCGGGAGTTGCTGGAAAGACTGAATAACAAAATAATGCAGAAGTTGGGTTATTCACGTGCAGAACTCTTCATCCAGCTTGATAAACCCGCACTGAAGCCTCTTCCTGAAGCCAGTTACAGTTACACCCTGGTGAAGAAAGTCAGAGTTCATGCCGATTACCACGTGGAAATCGACAAACATTACTACTCGGTTCCATGTTCGCTGTTAGGCCAGCAACTGGAAGCATGGATCTCCGGAGAACTGGTAAGACTCTTCAATCAGGGGCAGGAGGTTGCTGTGCACCCGCGCAAGCGTACTTATGGCTACAGTACCCGCAACGAGCACATGCCTGAAGCTCATCGACAGCATGCCACCTGGACGCCAGAGCGTCTTCTGGAATGGGCGGGGCACATAGGCAGTGAAACTCATAGTTATGTGCTTCATATACTGAACTCTCGTCCACATCCGGAACAAAGCTATCGCTTCTGCCTTGGACTCCTGAACCTTCATAAAAAATACAGTAAAGCCAGACTTAATGCAGCATGTGCAAGAGCTCTGAAAACAAAGGTATGGCGTCTGTCAGGTATTAAATCGATCCTGGAAAAAGGTCTGGATAAACAACCTGTTCAGGATCCAAAACCAGATCTGTTATCCACGATGGAACACGAAAACGTACGCGGCAGTGAGTATTACCACTGA
Protein sequences of DBSCAN-SWA_1 >CP034956|8609:57113|30080_30884_-|QAS83331.1|DBSCAN-SWA MNRTNIFFGESHSDWLPVRGGESGDFVFRRGDGHAFAKIAPASRRGELAGERDRLIWLKGRGVACPEVINWQEEQEGACLVITAIPGVPAADLSGADLLKAWPSMGQQLGAVHSLSVDQCPFERRLSRMFGRAVDVVSRNAVNPDFLPDEDKSTPQLDLLARVERELPVRLDQERTDMVVCHGDPCMPNFMVDPKTLQCTGLIDLGRLGTADRYADLALMIANAEENWAAPDEAERAFAVLFNVLGIEAPDRERLAFYLRLDPLTWG >CP034956|8609:57113|21687_21879_+|QAS83322.1|DBSCAN-SWA MAIALMGAGFSATDTSDAVNILYPITMTVQANKAWQASGLKKSFFLPSHKKVRGGEKISYARL >CP034956|8609:57113|14570_15275_-|QAS83313.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034956|8609:57113|16671_17256_-|QAS83316.1|DBSCAN-SWA MPRPKLKSDDEVLEAATVVLKRCGPIEFTLSGVAKEVGLSRAALIQRFTNRDTLLVRMMERGVEQVRHYLNAIPIGAGPQGLWEFLQVLVRSMNTRNDFSVNYLISWYELQVPELRTLAIQRNRAVVEGIRKRLPPGAPAAAELLLHSVIAGATMQWAVDPDGELADHVLAQIAAILCLMFPEHDDFQLLQAHA >CP034956|8609:57113|11605_12394_-|QAS83310.1|DBSCAN-SWA MGEFFPAQVFKQLSHARAVIERHLAATLDTIHLFGSAIDGGLKPDSDIDLLVTVSAAPNDSLRQALMLDLLKVSSPPGDGGTWRPLELTVVARSEVVPWRYPARRELQFGEWLRHDILSGTFEPAVLDHDLAILLTKARQHSLALLGPSAATFFEPVPKEHFSKALFDTIAQWNAESDWKGDERNVVLALARIWYSASTGLIAPKDVAAAWVSERLPAEHRPLICKARAAYLGSEDDDLAMRVEETAAFVRYAKATIERILR >CP034956|8609:57113|27032_27893_+|QAS83327.1|DBSCAN-SWA MSIQHFRVALIPFFAAFCLPVFAHPETLVKVKDAEDQLGARVGYIELDLNSGKILESFRPEERFPMMSTFKVLLCGAVLSRVDAGQEQLGRRIHYSQNDLVEYSPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLTTIGGPKELTAFLHNMGDHVTRLDRWEPELNEAIPNDERDTTMPAAMATTLRKLLTGELLTLASRQQLIDWMEADKVAGPLLRSALPAGWFIADKSGAGERGSRGIIAALGPDGKPSRIVVIYTTGSQATMDERNRQIAEIGASLIKHW >CP034956|8609:57113|24990_25857_+|QAS83325.1|transposase|DBSCAN-SWA MCELDILHDSLYQFCPELHLKRLNSLTLACHALLDCKTLTLTELGRNLPTKARTKHNIKRIDRLLGNRHLHKERLAVYRWHASFICSGNTMPIVLVDWSDIREQKRLMVLRASVALHGRSVTLYEKAFPLSEQCSKKAHDQFLADLASILPSNTTPLIVSDAGFKVPWYKSVEKLGWYWLSRVRGKVQYADLGAENWKPISNLHDMSSSHSKTLGYKRLTKSNPISCQILLYKSRSKGRKNQRSTRTHCHHPSPKIYSASAKEPWVLATNLPVEIRTPKQLVKALLQS >CP034956|8609:57113|55571_57113_+|QAS83348.1|transposase|DBSCAN-SWA MPTVPISMRKLKEILRLKYGVGLSHRQIGRSLAISPSVVSRYANRAAQLGIKQWPLPTGWDDTKLKHAFLQTQVKMKKHSLPDWATVHRELRNKCVTLQLLWEEYCERNPGGFYSYNHYCRMYREWLKTTSPSMRQVHKAGEKLFVDYCGPTVGVTDPETGEIRTAQVIVAVLGASSYTWAEATWSQQLEDWVMSHVRCFQWLGGVPELVVPDNLKSATSRACKYDPDVNPTYQQMLEHYNVAVLPARPRKPKDKAKAEVGVQVVERWIMARIRHEIFYSLASLNQRIRELLERLNNKIMQKLGYSRAELFIQLDKPALKPLPEASYSYTLVKKVRVHADYHVEIDKHYYSVPCSLLGQQLEAWISGELVRLFNQGQEVAVHPRKRTYGYSTRNEHMPEAHRQHATWTPERLLEWAGHIGSETHSYVLHILNSRPHPEQSYRFCLGLLNLHKKYSKARLNAACARALKTKVWRLSGIKSILEKGLDKQPVQDPKPDLLSTMEHENVRGSEYYH >CP034956|8609:57113|39733_40594_-|QAS83337.1|DBSCAN-SWA MSIQHFRVALIPFFAAFCLPVFAHPETLVKVKDAEDQLGARVGYIELDLNSGKILESFRPEERFPMMSTFKVLLCGAVLSRVDAGQEQLGRRIHYSQNDLVEYSPVTEKHLTDGMTVRELCSAAITMSDNTAANLLLTTIGGPKELTAFLHNMGDHVTRLDRWEPELNEAIPNDERDTTMPAAMATTLRKLLTGELLTLASRQQLIDWMEADKVAGPLLRSALPAGWFIADKSGAGERGSRGIIAALGPDGKPSRIVVIYTTGSQATMDERNRQIAEIGASLIKHW >CP034956|8609:57113|33674_34379_-|QAS83334.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034956|8609:57113|35338_35980_-|QAS83335.1|DBSCAN-SWA MTEIIKDLYQFTEVMEPIKLSMHQYLLMTNEPVLIQTGAVSQAQTTIPKLQELLGERKIKYILISHFESDECGGLALVLKEHPEAVAVCSETTARQLMGFGITNNVLIKKPNEIFAGDDFEFQTISYPSEMHMWEGLLFFEKKRGIFFSSDLMFGMGENHGQVIESSWDAAVKSSGADTLPNQESGQKLSSDLSEIEPKFVASGHGFCITIVG >CP034956|8609:57113|28479_29184_+|QAS83329.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034956|8609:57113|36552_37113_+|QAS83336.1|DBSCAN-SWA MTGQRIGYIRVSTFDQNPERQLEGVKVDRAFSDKASGKDVKRPQLEALISFARTGDTVVVHSMDRLARNLDDLRRIVQTLTQRGVHIEFVKEHLSFTGEDSPMANLMLSVMGAFAEFERALIRERQREGIALAKQRGAYRGRKKSLSSERIAELRQRVEAGEQKTKLAREFGISRETLYQYLRTDQ >CP034956|8609:57113|47100_47931_-|QAS83388.1|DBSCAN-SWA MKNTIHINFAIFLIIANIIYSSASASTDISTVASPLFEGTEGCFLLYDASTNAEIAQFNKAKCATQMAPDSTFKIALSLMAFDAEIIDQKTIFKWDKTPKGMEIWNSNHTPKTWMQFSVVWVSQEITQKIGLNKIKNYLKDFDYGNQDFSGDKERNNGLTEAWLESSLKISPEEQIQFLRKIINHNLPVKNSAIENTIENMYLQDLDNSTKLYGKTGAGFTANRTLQNGWFEGFIISKSGHKYVFVSALTGNLGSNLTSSIKAKKNAITILNTLNL >CP034956|8609:57113|11052_11400_-|QAS83309.1|DBSCAN-SWA MKGWLFLVIAIVGEVIATSALKSSEGFTKLAPSAVVIIGYGIAFYFLSLVLKSIPVGVAYAVWSGLGVVIITAIAWLLHGQKLDAWGFVGMGLIIAAFLLARSPSWKSLRRPTPW >CP034956|8609:57113|12524_12998_-|QAS83387.1|DBSCAN-SWA MKISLISAVSENGVIGSGPDIPWSVKGEQLLFKALTYNQWLLVGRKTFDSMGVLPNRKYAVVSKNGISSSNENVLVFPSIENALKELSKVTDHVYVSGGGQIYNSLIEKADIIHLSTVHVEVEGDIKFPIMPENFNLVFEQFFMSNINYTYQIWKKG >CP034956|8609:57113|9888_10092_-|QAS83306.1|DBSCAN-SWA MDSEEPPNVRVACSGDIDEVVRLMHDAAAWMSAKGTPAWDVARIDRTFAETFVLRSELLGIASENGK >CP034956|8609:57113|32067_32919_-|QAS83333.1|DBSCAN-SWA MVKPKNKHSLSHVRHDPAHCLAPGLFRALKRGERKRSKLDVTYDYGDGKRIEFSGPEPLGADDLRILQGLVAMAGPNGLVLGPEPKTEGGRQLRLFLEPKWEAVTADAMVVKGSYRALAKEIGAEVDSGGALKHIQDCIERLWKVSIIAQNGRKRQGFRLLSEYASDEADGRLYVALNPLIAQAVMGGGQHVRISMDEVRALDSETARLLHQRLCGWIDPGKTGKASIDTLCGYVWPSEASGSTMRKRRQRVREALPELVALGWTVTEFAAGKYDITRPKAAG >CP034956|8609:57113|42188_43064_+|QAS83339.1|DBSCAN-SWA MVKKSLRQFTLMATATVTLLLGSVPLYAQTADVQQKLAELERQSGGRLGVALINTADNSQILYRADERFAMCSTSKVMAAAAVLKKSESEPNLLNQRVEIKKSDLVNYNPIAEKHVNGTMSLAELSAAALQYSDNVAMNKLIAHVGGPASVTAFARQLGDETFRLDRTEPTLNTAIPGDPRDTTSPRAMAQTLRNLTLGKALGDSQRAQLVTWMKGNTTGAASIQAGLPASWVVGDKTGSGGYGTTNDIAVIWPKDRAPLILVTYFTQPQPKAESRRDVLASAAKIVTDGL >CP034956|8609:57113|48061_48616_-|QAS83389.1|DBSCAN-SWA MTNSNDSVTLRLMTEHDLAMLYEWLNRSHIVEWWGGEEARPTLADVQEQYLPSVLAQESVTPYIAMLNGEPIGYAQSYVALGSGDGRWEEETDPGVRGIDQLLANASQLGKGLGTKLVRALVELLFNDPEVTKIQTDPSPSNLRAIRCYEKAGFERQGTVTTPYGPAVYMVQTRQAFERTRSDA >CP034956|8609:57113|16369_16726_-|QAS83315.1|DBSCAN-SWA MFNVSRTRRFPTPPGTCVNGGVQSPCGRRRTRPSSISTGTVGGIEVLITTQVYGLQMVTIPILASEHRADSLTVNHSRKVVPVWVSRVETILRQCAQQTTWPSQVVGSGVADELDKLV >CP034956|8609:57113|29244_30081_-|QAS83330.1|DBSCAN-SWA MFMPPVFPAHWHVSQPVLIADTFSSLVWKVSLPDGTPAIVKGLKPIEDIADELRGADYLVWRNGRGAVRLLGRENNLMLLEYAGERMLSHIVAEHGDYQATEIAAELMAKLYAASEEPLPSALLPIRDRFAALFQRARDDQNAGCQTDYVHAAIIADQMMSNASELRGLHGDLHHENIMFSSRGWLVIDPVGLVGEVGFGAANMFYDPADRDDLCLDPRRIAQMADAFSRALDVDPRRLLDQAYAYGCLSAAWNADGEEEQRDLAIAAAIKQVRQTSY >CP034956|8609:57113|25890_26595_-|QAS83326.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034956|8609:57113|52692_53346_-|QAS83345.1|protease|DBSCAN-SWA MIQTRNQYLQFMLVMLAAWGISWGARFVMEQAVLLYGSGKNYLFFSHGTVLMYLLCVFLVYRRWIAPLPVVGQLRNVGVPWLVGAMAVVYVGVFLLGKALALPAEPFMTKLFADKSIPDVILTLLTIFILAPLNEETLFRGIMLNVFRSRYCWTMWLGALITSLLFVAAHSQYQNLLTLAELFLVGLITSVARIRSGGLLLPVLLHMEATTLGLLFG >CP034956|8609:57113|10219_11059_-|QAS83308.1|DBSCAN-SWA MVTVFGILNLTEDSFFDESRRLDPAGAVTAAIEMLRVGSDVVDVGPAASHPDARPVSPADEIRRIAPLLDALSDQMHRVSIDSFQPETQRYALKRGVGYLNDIQGFPDPALYPDIAEADCRLVVMHSAQRDGIATRTGHLRPEDALDEIVRFFEARVSALRRSGVAADRLILDPGMGFFLSPAPETSLHVLSNLQKLKSALGLPLLVSVSRKSFLGATVGLPVKDLGPASLAAELHAIGNGADYVRTHAPGDLRSAITFSETLAKFRSRDARDRGLDHA >CP034956|8609:57113|19517_20222_-|QAS83319.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034956|8609:57113|43110_43443_-|QAS83340.1|DBSCAN-SWA MRPHRHPHTFELLLPLRGRFVVLNFDDRGTVTHRAILGETCTVLEMAAGTWHAVLSLDTGGIIFEVKHGGYQPVAADDYAHWAPAEGEPGTTELMAWYAQAQVGDSTFAV >CP034956|8609:57113|13155_14169_+|QAS83311.1|integrase|DBSCAN-SWA MKTATAPLPPLRSVKVLDQLRERIRYLHYSLRTEQAYVHWVRAFIRFHGVRHPATLGSSEVEAFLSWLANERKVSVSTHRQALAALLFFYGKVLCTDLPWLQEIGRPRPSRRLPVVLTPDEVVRILGFLEGEHRLFAQLLYGTGMRISEGLQLRVKDLDFDHGTIIVREGKGSKDRALMLPESLAPSLREQLSRARAWWLKDQAEGRSGVALPDALERKYPRAGHSWPWFWVFAQHTHSTDPRSGVVRRHHMYDQTFQRAFKRAVEQAGITKPATPHTLRHSFATALLRSGYDIRTVQDLLGHSDVSTTMIYTHVLKVGGAGVRSPLDALPPLTSER >CP034956|8609:57113|55293_55431_+|QAS83347.1|DBSCAN-SWA MKIIGVFAFLAPVDPHQNQFPATLRRGQLRIFMKGDHSITSIEIG >CP034956|8609:57113|21436_21682_-|QAS83321.1|DBSCAN-SWA MQAYRAMVNSYSLSDDSGVMAAAAITHFLFGQAVFSYLNGWSVLIGPGTGLDSTGCKYARDLMGLVAFTAFIVTFLFRGYS >CP034956|8609:57113|8609_9314_+|QAS83305.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034956|8609:57113|48759_49464_-|QAS83342.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034956|8609:57113|28042_28468_+|QAS83328.1|transposase|DBSCAN-SWA MAEQGKELPGYVQREFEEFLQCGRLEHGFLRVRCESCHAEHLVAFSCKRRGFCPSCGARRMAESAALLVDEVLPEQPMRQWVLSFPFQLRFLFGVVCGKGRNPTLRLWPAIFSGEIDVFPGDRRRALLQIVGGDKLIIPFC >CP034956|8609:57113|10110_10290_+|QAS83307.1|DBSCAN-SWA MNRRLIVQFDPDEPSMSAPTFAEPDQGPLAGGRKVNARHDLTLGLWRRDCEISRGFPRR >CP034956|8609:57113|20258_21386_-|QAS83320.1|DBSCAN-SWA MSQLSQLRSPAAVQAAIDEFVQLGRTKFLARHGYGKSRDFLVRDPKTGTDCDSKAIAGVAFGKQFPEQGPLTADSFSGGEATVVPALTRLGFRIIRIGEDWSEEEVLATVEDYFDMLRAEAAGEPYNKSEHNQALRQLLNGRSKSSVELKHQNISAVLDALGLPYINGYKPRGNSQLLLRKSVHAYVLEHQQTVGALVDALEEVKLPGDKTYRAALVEPPAREVLVRTPASLRQRLPRKFDYAARDEANRKLGRAGEQWVIGYEQQRLTELGHPELFQRLDWVSDTQGDGAGFDILSFEEDAHERFIEVKTTNGGVGSSFLVSHNELEFSKEAGDQFHLYRVFQFRDGPRLFTLPGDLSQHVHLKPTGTVANSRW >CP034956|8609:57113|54285_55143_-|QAS83346.1|DBSCAN-SWA MTDLHQTYYRQVKNPNPVFTPREGAGTLKFCEKLMEKAVGFSSRFDFAIHVAHARSRGLRRRMPPVLRRRAIDALLQGLCFHYDPLANRVQCSITTLAIECGLATESGAGKLSITRATRALTFLSELGLITYQTEYDPLIGCYIPTDITFTLALFAALDVSEDAVAAARRSRVVWENKQRKKQGLDTLGMDELIAKAWRFVRERFRSYQTELKSRGIKRARARRDANRKRQDIVTLVKRQLTREISEGRFTANREAVKREVERRVKERMILSRNRNYSRLATASP >CP034956|8609:57113|41102_41807_+|QAS83338.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034956|8609:57113|14433_14625_+|QAS83312.1|DBSCAN-SWA MRHGAGFPVRQADAGGGLRRLRQPMEEHRHCCKVSDEAAFCLIQRPYISKTLLTRRISPRGSP >CP034956|8609:57113|52008_52341_-|QAS83343.1|DBSCAN-SWA MERGEIWLVSLDPTAGHEQQGTRPVLIVTPAAFNRVTRLPVVVPVTSGGNFARTAGFAVSLDGVGIRTTGVVRCDQPRTIDMKARGGKRLERVPETIMNEVLGRLSTILT >CP034956|8609:57113|55135_55210_-|QAS83390.1|DBSCAN-SWA MTGKVQDFFLCSLLLCNISAGWCD >CP034956|8609:57113|18490_19396_-|QAS83318.1|DBSCAN-SWA MTVVTTADTSQLYALAARHGLKLHGPLTVNELGLDYRIVIATVDDGRRWVLRIPRRAEVSAKVEPEARVLAMLKNRLPFAVPDWRVANAELVAYPMLEDSTAMVIQPGSSTPDWVVPQDSEVFAESFATALAALHAVPISAAVDAGMLIRTPTQARQKVADDVDRVRREFVVNDKRLHRWQRWLDDDSSWPDFSVVVHGDLYVGHVLIDNTERVSGMIDWSEARVDDPAIDMAAHLMVFGEEGLAKLLLTYEAAGGRVWPRLAHHIAERLAFGAVTYALFALDSGNEEYLAAAKAQLAAAE >CP034956|8609:57113|17255_18494_-|QAS83317.1|DBSCAN-SWA MSERRYSPLATLFAATFLFRIGNAVAALALPWFVLSHTKSAAWAGATAASSVIATIIGAWVGGGLVDRFGRAPVALISGVVGGVAMASIPLLDAVGALSNTGLIACVVLGAAFDAPGMAAQDSELPKLGHVAGLSVERVSSLKAVIGNVAILGGPALGGAAIGLLGAAPTLGLTAFCSVLAGLLGAWVLPARAARTMTTTATLSMRAGVAFLWSEPLLRPLFGIVMIFVGIVGANGSVIMPALFVDAGRQVAELGLFSSMMGAGGLLGIAIHASVGARISAQNWLAVAFCGSAVGSLLLSQLPGVPVLMLLGALVGLLTGSVSPILNAAIYNRTPPELLGRVLGTVSAVMLSASPMVMLAAGAFVDLAGPLPGLVVSAVFAGLVALLSLRLQFATMAAAATASAPTHTEGEH >CP034956|8609:57113|30944_31760_-|QAS83332.1|DBSCAN-SWA MNKSLIIFGIVNITSDSFSDGGRYLAPDAAIAQARKLMAEGADVIDLGPASSNPDAAPVSSDTEIERIAPVLDALKADGIPVSLDSYQPATQAYALSRGVAYLNDIRGFPDAAFYPQLAKSSAKLVVMHSVQDGQADRREAPAGDIMDHIAAFFDARIAALTGAGIKRNRLVLDPGMGFFLGAAPETSLSVLARFDELRLRFDLPVLLSVSRKSFLRALTGRGPGDVGAATLAAELAAAAGGADFIRTHEPRPLRDGLAVLAALKETARIR >CP034956|8609:57113|22915_23776_-|QAS83324.1|DBSCAN-SWA MHTRKAITEALQKLGVQTGDLLMVHASLKAIGPVEGGAETVVAALRSAVGPTGTVMGYASWDRSPYEETLNGARLDDEARRTWLPFDPATAGTYRGFGLLNQFLVQAPGARRSAHPDASMVAVGPLAETLTEPHELGHALGEGSPVERFVRLGGKALLLGAPLNSVTALHYAEAVADIPNKRWVTYEMPMLGRDGEVAWKTASDYDSNGILDCFAIEGKPDAVETIANAYVKLGRHREGVVGFAQCYLFDAQDIVTFGVTYLEKHFGTTPIVPPHEAVERSCEPSG >CP034956|8609:57113|45764_46469_+|QAS83341.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034956|8609:57113|22360_22903_-|QAS83323.1|DBSCAN-SWA MIIWINGPFGAGKTTLAKRLRDRRSKSLIFDPEEIGFVVKETVPMPASGDYQDLPLWRGLTIAAVREIRRNYSQDIIIPMTLVHPDYLTEILDGVRRIDDQLLHIFLTLNEDLLRHRIANQTMHPDPNRNAEIREWRLANVARCLAARERLPCTTRVLDSGAHTSDELAAMVLDGIDGRT >CP034956|8609:57113|52342_52600_-|QAS83344.1|DBSCAN-SWA MHTTRLKRVGGSVMLTVPPALLNALSLGTDNEVGMVIDNGRLIVEPYRRPQYSLAELLAQCDPNAEISAEEREWLDAPATGQEEI >CP034956|8609:57113|15414_16179_+|QAS83314.1|transposase|DBSCAN-SWA MTDFKWRHFQGDVILWAVRWYCRYPISYRDLEEMLAERGISVDHTTIYRWVQCYAPEMEKRLRWFWRRGFDPSWRLDETYVKVRGKWTYLYRAVDKRGDTIDFYLSPTRSAKAAKRFLGKALRGLKHWEKPATLNTDKAPSYGAAITELKREGKLDRETAHRQVKYLNNVIEADHGKLKILIKPVRGFKSIPTAYATIKGFEVMRALRKGQARPWCLQPGIRGEVRLVERAFGIGPSALTEAMGMLNHHFAAAA |
48 | Escherichia_phage(50.0%) | protease,integrase,transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
89835 : 97265
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP034956|89835:97265|DBSCAN-SWA GCTATATTTTTCCGCCTACGCCTTTAAACTTCTCAATAAACGAGACGATTTTCTGGAAAACTGCCTGTTTTTTCGTTTTATATTGCGGGTTTAACGGACTAAGTTTTGGTAATGTCTCGTTTAATTCTGTGCCATTTTCGGTGGCGTATTCGCGTTTTAAAGACGTGCGAATATAGCGTTTCGCCGCCTCTTCATTGAGATTTTCTTCTTTAATCAACGCTTCTGCTTCACGTAGCTGTTCGTGTTGAGCAAAGGTAAAGAACGCGTCAATGATGCTGGCTTTGTCTGGTAAATCATCCAGGTTCGTTTGCTGAATAAAATCGACCACCAGGCCCTCTTTCGCCCGGTTCCCCAGGCTTGAACGAATTAAGCGTTTGACCTCTTCGATCATTTCGCCCTTGCCTTTATTTTGTCTGTTATGTTCGAAAATCAGTCCAAGGATATAATCCAGGTTTATTTCCTGAGACTTCAGCAAATCGACCTCAAAAACCACGTCATCCCAGTCAGTGGTTGATTTCTCTTTTTTCTCAGCTTCTTTCTCACGGCGCTGCCAGTCGCGAATATCGTTATAGGCAGAACGATAATCCTGAATCTTGCGATCAGCAGGGAGACGAATTGTTTGCAATTCAGCGAGCTTTTCATCATCCACATAATGTTCTGCTTTGAATTTTTCTACTGCAACAGGATCGCTAAGATCGATTTGTTGCAGGGCTTTCAGCGTGGCAAATTCATCATAGTTTTGCAGGATGTTCTCGGTACGCAGGTATTCACCAAACAGTTTTACGAAGTCTTTCTTCTCTTTTTCACTTTCAATACTGGCAGGGTCAGGGAACCGTTGTTCCAGTTCTGCAACTACTGCCATAAAGCCGCGCTTAGCTTCACCAGTAGCAGCATCAGTAAAGCCTTCCATATACTCTGCATAACTCTTTTCTAACACTACATTTTTGGTGTTTTTGTCACCAAACAGCGTTATGGCATCAATAGTTGAGCGTTCCAGATCCCGGAAAGTGACGATATTACCGAAGGTTTTAGTGGCGTCATAAATGCGGTTGGTGCGGGAAAATGCCTGCATCAGGCCGTGAAAACGTAAGTTTTTATCGACGAATAGCGTGTTCAATGTTGGAGCATCGAAGCCGGTTAAGAACATCCCCACGACAATTAGCAGATCGATATCCTGATTTTTAACCCGCTGGGCTAAATCACGATAGTAGTTCTGAAAACCGTTACTGTCGGTGCTAAAGTTAGTTTTAAAATGGCTGTTATATTCACGAATTGCAGCGTCCAGAAACTCTTTAGCACTGCTGTCCATTGCGCTGGTATCAAAAGTTTCATCGGAAATCTCACCAATGGCATTTTGTTCTTCATTGGCGGCAAAGGAGAAGATTGTCGCAATACGCAGCGGTTTATAGGTAGCCGATTTATTAGCGGCTTCTTCTTGTAACCGTTTAAACGTCGCATAATAGGCTTTCGCGGCATCCACGCTGCTCACTGCCAACATAGCATTAAAACCTTTTGAGCCAGGGAAGGTACGGTGAGTTTTCTGGCGGAAATTATTCAGAATATATTGCGTAATTTCCTGTATACGCATGGGATGAAGAAACGCCTGCTGATTTTCAGCCGCACTCAGTTTTTTCTCGTCGGTTTCTGTCTCTAAAGACTTAAACTGTGGCCGCACATCGTTGTAGTCCACCTTGAATTTAAGCACTTTTTCATCTCGAATCGCATCGGTAATAACATATGAATGCAATTCACGACCAAATACGCTGGCGGTTGTTTCTGAGCCTAAGGCGTTTTCCGGGAAAATAGGGGTACCGGTAAAACCAAACTGATAATAGCGTTTGAATTTCTTCTTCAGGTTTTTCTGTGCTTCTCCAAACTGGCTGCGGTGGCATTCATCAAATATAAACACCACTTGCTGATTGTATACAGGCAGGTCGCTTTCTGCTTTCATCAGGTTATTGAGTTTCTGAATAGTGGTGACGATAATTTTGTTATCATCCTTATCCAGATTTCGTTTAAGACCTGCGGTATTTTCCGATCCATTGACACTGTCTGGCGAAAAACGCTGATATTCCTTCATGGTCTGGTAATCGAGGTCTTTCCTGTCGACCACAAAGAAGACTTTATCAATAAAGTCCAGTTCTGTTGCCAGACGCGCGGCTTTAAAGCTGGTCAGGGTTTTACCAGAACCGGTAGTGTGCCAGATAAAGCCACCACTTTCGGGGGTAGACCAGTTTTTCGCTTTATAGGAGCTGTTGATTTTCCATAAGATACGCTCAGTGGCGGCAATCTGGTACGGTCGCATCACCAGTAGCGTCTGGCTACTGTCAAAAACGCTGTAGTTCACCAGAACATTCAGCAGAGTATGTTTCTGGAAAAAGGTAGCGGTAAAGTCTTTGAGGTCTTTAATCAGCGTGTTGTCTGATTTTGCCCAATTCATGGTGAAGTCAAAACTGTTTTTATCGCGCTTTGTCGTGTTGGCAAAATAATGGGTATCGGTGCCGTTAGAAATGACAAACAGTTGCAGATACTTAAACAGGGAATTTTCGCTGTTAAAACTCTCTTTACTGTAACGATGTATCTGGTTGAAAGCCTCACGAATCGCCACCCCGCGTTTTTTTAGTTCGATTTGCACCAGCGGTAAACCATTAACCAGGATCGTGACGTCATAACGGTTAGCATGAGAACCCGTCTGTTCAAACTGCTGGATAATCTGCACCTTATTGCGCATGAGATTCTTTTTATCTATCAAATAGATGTTCTCAAGACGCTCGTCATCAAAAATAAAGTCGCAAATATAGTCGATATGGATTTTACGGGTCTTATCCAGAATGCCATCGCTCGGGTTATCCAGATACTGCTCCGTGAAACGCCGCCACTCGCTGTCATTAAACACCACACCATTGAGGTTCTGAAGCTGTTCCCGAACATTGGCCAGCATCGCCGACTGTGATTTTACGGATATAAATTCATAGCCCTGATTCCGCAGGTCCTGAATCAGTTCTCGTTCCAGGTCCGATTCGCTCTGGTAGCTGTCGCCTGTTGGCTCAGCTTTGATGTACTTATCAAGGACGATAAAGTTATTGGATTCAGCAATGGTGTGTGTCTGATGAGTCATAGCGCATCCTTTGTGCCGTCTGGCAAGGGCCGGAAGGGAGTTAAGGGTGACTTCCGGCACGTAAAAAATAGTCTATATACAGACCGGATGTTAAGGTGGCCCGGTCGGTAGCAACGGTCAATTAATTACTGACAGTTTCAGGTTTTGGGAAACTGAACAGTAAATCACGGTAGTATTCGTATTGTTTCTGGCGCAACTCGATTTCACGCGGAAGACCTTCGGTGATGGAGTTAGTCAGTGTGTCGAATTTGTCGAGTATTTCGACAATGCGAGCTTGTTCCTTAAGTGATTTTTCGTGATCTTTAGGATATGGAACCGGAATCATAATTTTTGAAAAACCATTAATGAGTAGTGTATTAACTTTTGTTCTGGCTACATACTTTGCTTTTTCAGAAATAAACGAATCGGTTTGCATGTAATAGGAAATAAATTTTGGATTCAAAGAATGTCGAAAAGCATAACAGTGATCATGAATAGCGATATCGTCATCCCCAAGCCATGCCACTGCTTTACCAACGTCTTCTACAGTCTCCCCCACGTCAGTTATCACGACATCTCCATGTTTGGCATAGCGTAATGACGCTGCCATATCAGCTCTAACCTGTGATAACGAATGAGTTGTGTAAACACCATATCGTGTATATATCTCACCATAATGGATTACACTGATACCACCATCTTCTACATAATCTGCTTTAGTAAAACGTTTTCCACGAATAAACTCACCAATTTCCCCCAAAGCTTTCCACTCAACCTCACCTTCTTTAAAAGTCAGCAATTGGTCGCGGTAGTAGTTGTACTGTTTTTTACGCATGTTAAGCTCAGCGGTAAGCTCAGCGGTAAGCTCAGCGGTAAGTGCAGTAAACTTATCCAGAATCCGAACGATTTCAGACTGGATGGCAAGGGACTTTTCCGGATTATCCGGGCAGGGGATGGGGATCTTAATATTTTTGACAATCTGCGCATTTATGTTTGTCTGGGACCCTGTTCCAAGGGATTTGATATATGTGTATTGGCTACACAAGAAGTGAAATACATATCTATAATGAGCAACTTCTTCATTAAGTTGAATATTTGCGCACGCTTGATTTGTTGTCATTGGAATTTTGTTTATGCCGATTTTCCCCACAGTTGCCCCATACATAGCAACAATGACACAATTCTTTGGTATCCATTTTGCACTAGAGTTTTTAACTCCAGACTCAGTTATTTTTACCTCGGTATCCCATATATCACAAAAGTTTACTTCTTGAGTTCTCAACCAAGGAATGTCGCCATCATAAAATTCTGATACGCCAGTTTTAGGGGTTCCTCCAGATGATATCTTTATAGAAATATCCTCAAGGGTTTTCCACTCAACCTCAACCCCATCCAGCAATTTTTCCAGATAACTCAACTCGCTCATTTCTGCACCTCGCAGCCTTCAATTTCAGCCACAATCGCATCAATATCTTTACGCAACTGGTCGATTTTGCTGACCGTAGTTTTAAGCTCTGCATTTAGCTCAGCAATATTGATAATTTCGCGGTTATCTTTCGCTTCTACATAGCAGCTCACCGACAGGTTATAGTCATTAGCGACAACGGTCTCAAACGCGACAGATTTCGCCAGATGAGCAACATCTTCCTTGCTGGCAAATACCTGCATAATCTGTTCGATATGGGCATCGGTCAGGATATTGTTGTTGGTCTCTTTTTTGAATAGTTCGCTGGCATCAATAAACTGAACTTTGGTATCCGTTTTATGTTTAGACAACACCAGAATATTGACGGCAATGGTGGTGCCAAAGAACAGGTTCGGTGCGAGTGAAATCACGGTTTCGACATAGTTATTGTCAACCAGATACTGACGGATTTTCTGCTCCGCGCCGCCACGGTAAAAAATGCCCGGGAAGCAGACAATCGCAGCACGACCTTTGGCAGAAAGATAGTTCAGCGCATGTAATACAAACGCAAAGTCAGCTTTGGATTTGGGGGCCAGAACGCCAGCCGGGGCAAAACGTTCATCGTTAATCAGCGTCGGGTCATCGCTGCCAATCCATTTCACCGAATACGGCGGGTTAGAAACGATGGCATCAAACGGTTTTTCATCTCTGAAGTGCGGTTCAGTCAGCGTATTGCCCAGCTTGATATCAAACTTGTCGTAGTTGATGTTGTGCAAAAACATGTTCATACGCGCCAGGTTATAGGTCGTATGGTTGATTTCCTGACCAAAAAAACCTTCTTCGATGATATGGTTATCAAACTGTTTTTTCGCCTGCAACAACAGTGAGCCGGAACCCGCTGCCGGGTCGTAGATTTTGTTAACGCTGGTCTGCCCGTGCATAGCCAGTTGTGCAATCAGCCTGGAGACGTGCTGCGGTGTAAAGAACTCACCGCCTGACTTACCGGCATTTGCCGCATAGTTAGAAATCAGGAACTCATAGGCATCACCGAACAGGTCAATCTGATGTTCGTTGAAGTCACCAAGTTTTAACCCTTCAACCCCTTTCAGAACCGCAGCCAGGCGGGCATTTTTATCTTTAACGGTGTTACCCAGGCGGTTACTGGTGGTATCGAAATCAGCAAACAAACCTTTGATGTCAGCTTCTGAAGGATAACCGTAAGCAGAACTTTCGATAGCAACGAAGATGCTGTTTAAATCTGCATTCAATCTGTCATTGGTATTTGCTTTCGCAGCTACGTTGCAGAAAAGCTGGCTTGGGTAGATGAAGTAGCCTTTGGTTTTGATGGCATCGTCTTTAATGTCATCAGTAATTACGCTGTCATCCAGTTTCGCATAACAGATACTGTCATCACCGGCTTCAATATAACTGGAAAAATTTTCGCTGATAAAACGGTAGAAAAGTGCGCCCAGAACGTATTGCTTAAAATCCCATCCATCGACCGAACCCCTGACATCGTTAGCAATTTGCCAGATTTGACGATGAAGCTCTGCACGTTGTTGAATACTTGTCATTTTCATCCACTTATTTCAGGCTTATGTAATTGGCGGTGATTCTACAGCAACTTGGATGCTTTAGCAGTTCGGACATTAGGCTACGAATGACCTGCCTAGAGGTTTGTTAAGCCGCAAAGTGCTGGTGCTTTATGCCTGTGAAGTTTATAATTGTGTACACATAACGAGTACACGAGGTGTTTATGCAATCCATTAACTTCCGTACCGCGCGCGGCAACCTTTCTGAAGTGCTCAACAATGTTGAGGCCGGGGAAGAGGTTGAAATCACCCGCAGAGGCCGTGAGCCAGCAGTAATTGTCAGCAAGGCTACTTTCGAAGCCTACAAAAAAGCGGCGCTGGATGCTGAATTTGCATCCCTGTTTGACACCCTGGACTCCACCAACAAGGAACTGGTTAACCGATAATGAGGCATATATCACCGGAAGAACTTATTGCGCTTCATGATGCGAATATAAACCGCTACGGCGGCCTGCCGGGAATGTCTGATCCGGGCAGGGCAGAGGCCATTATCGGGAGAGTTCAGGCCAGAGTTGCCTACGAAGAGATCACCGACCTTTTCGAAGTCTCCGCCACCTACCTAGTGGCTACAGCGAGAGGGCATATATTCAATGATGCCAATAAGCGTACCGCGCTAAACAGTGCGCTGTTATTTCTACGCCGTAACGGGGTGCAGGTATTTGATTCACCTGAACTGGCAGACCTTACCGTAGGGGCTGCGACCGGAGAGATATCTGTATCTTCTGTCGCCGACACGTTACGTAGATTGTATGGTTCTGCGGAGTAGATTAATGGCACGCAAATACAACAAATTGTCCCGTGAAGCGTTAAAGATGCTTCTTGATGGCGTGAGTCGCCGCGAGGTAAAGCAATACCTGGCTGGTAAGCAAATTGGTGCCAGGACCGCTATTGCTGTGTTATGCCGTCAGGAAATGGTTGTGCTTAAACAGAGAATGCCGGGCAGCAGATAAAGCCCAATCAGTGATGAAAGGTGTGATGTGAAAGCCGTAATTACTCCCTTTGTACAAAAAGAGCTTGGCGTCGCCACATTCAAAGTGGATCAGGAAGTCAGAAAGCTGGTGGAGGCTGGCCGTAAATTTATTATGGAGCCGGTGCCGCGTGAGTTAATCGAGCACATGGACGACGGCCTCGTTGTTTCCGAGCAAACTATGGCAACAAATGAGGCGTTGCAGCCGTTTTTTAACAGCGATGAACTGTTTCGCCGTATTGGTGGAATTGACTCGCTGGTAGCGTGGTTGCGCAGGAAAGAGGGGCAGGTCAGGCTGGGCCATGTCGTTCTCGCCGGACAGCAGGCTTACCATGAAAGCACTGGAAATGGCATGGGAAACCCGTGGTAA
Protein sequences of DBSCAN-SWA_2 >CP034956|89835:97265|89835_92952_-|QAS83379.1|DBSCAN-SWA MTHQTHTIAESNNFIVLDKYIKAEPTGDSYQSESDLERELIQDLRNQGYEFISVKSQSAMLANVREQLQNLNGVVFNDSEWRRFTEQYLDNPSDGILDKTRKIHIDYICDFIFDDERLENIYLIDKKNLMRNKVQIIQQFEQTGSHANRYDVTILVNGLPLVQIELKKRGVAIREAFNQIHRYSKESFNSENSLFKYLQLFVISNGTDTHYFANTTKRDKNSFDFTMNWAKSDNTLIKDLKDFTATFFQKHTLLNVLVNYSVFDSSQTLLVMRPYQIAATERILWKINSSYKAKNWSTPESGGFIWHTTGSGKTLTSFKAARLATELDFIDKVFFVVDRKDLDYQTMKEYQRFSPDSVNGSENTAGLKRNLDKDDNKIIVTTIQKLNNLMKAESDLPVYNQQVVFIFDECHRSQFGEAQKNLKKKFKRYYQFGFTGTPIFPENALGSETTASVFGRELHSYVITDAIRDEKVLKFKVDYNDVRPQFKSLETETDEKKLSAAENQQAFLHPMRIQEITQYILNNFRQKTHRTFPGSKGFNAMLAVSSVDAAKAYYATFKRLQEEAANKSATYKPLRIATIFSFAANEEQNAIGEISDETFDTSAMDSSAKEFLDAAIREYNSHFKTNFSTDSNGFQNYYRDLAQRVKNQDIDLLIVVGMFLTGFDAPTLNTLFVDKNLRFHGLMQAFSRTNRIYDATKTFGNIVTFRDLERSTIDAITLFGDKNTKNVVLEKSYAEYMEGFTDAATGEAKRGFMAVVAELEQRFPDPASIESEKEKKDFVKLFGEYLRTENILQNYDEFATLKALQQIDLSDPVAVEKFKAEHYVDDEKLAELQTIRLPADRKIQDYRSAYNDIRDWQRREKEAEKKEKSTTDWDDVVFEVDLLKSQEINLDYILGLIFEHNRQNKGKGEMIEEVKRLIRSSLGNRAKEGLVVDFIQQTNLDDLPDKASIIDAFFTFAQHEQLREAEALIKEENLNEEAAKRYIRTSLKREYATENGTELNETLPKLSPLNPQYKTKKQAVFQKIVSFIEKFKGVGGKI >CP034956|89835:97265|96698_96878_+|QAS83384.1|DBSCAN-SWA MARKYNKLSREALKMLLDGVSRREVKQYLAGKQIGARTAIAVLCRQEMVVLKQRMPGSR >CP034956|89835:97265|93073_94357_-|QAS83380.1|DBSCAN-SWA MSELSYLEKLLDGVEVEWKTLEDISIKISSGGTPKTGVSEFYDGDIPWLRTQEVNFCDIWDTEVKITESGVKNSSAKWIPKNCVIVAMYGATVGKIGINKIPMTTNQACANIQLNEEVAHYRYVFHFLCSQYTYIKSLGTGSQTNINAQIVKNIKIPIPCPDNPEKSLAIQSEIVRILDKFTALTAELTAELTAELNMRKKQYNYYRDQLLTFKEGEVEWKALGEIGEFIRGKRFTKADYVEDGGISVIHYGEIYTRYGVYTTHSLSQVRADMAASLRYAKHGDVVITDVGETVEDVGKAVAWLGDDDIAIHDHCYAFRHSLNPKFISYYMQTDSFISEKAKYVARTKVNTLLINGFSKIMIPVPYPKDHEKSLKEQARIVEILDKFDTLTNSITEGLPREIELRQKQYEYYRDLLFSFPKPETVSN >CP034956|89835:97265|96092_96314_+|QAS83382.1|DBSCAN-SWA MQSINFRTARGNLSEVLNNVEAGEEVEITRRGREPAVIVSKATFEAYKKAALDAEFASLFDTLDSTNKELVNR >CP034956|89835:97265|94353_95910_-|QAS83381.1|DBSCAN-SWA MTSIQQRAELHRQIWQIANDVRGSVDGWDFKQYVLGALFYRFISENFSSYIEAGDDSICYAKLDDSVITDDIKDDAIKTKGYFIYPSQLFCNVAAKANTNDRLNADLNSIFVAIESSAYGYPSEADIKGLFADFDTTSNRLGNTVKDKNARLAAVLKGVEGLKLGDFNEHQIDLFGDAYEFLISNYAANAGKSGGEFFTPQHVSRLIAQLAMHGQTSVNKIYDPAAGSGSLLLQAKKQFDNHIIEEGFFGQEINHTTYNLARMNMFLHNINYDKFDIKLGNTLTEPHFRDEKPFDAIVSNPPYSVKWIGSDDPTLINDERFAPAGVLAPKSKADFAFVLHALNYLSAKGRAAIVCFPGIFYRGGAEQKIRQYLVDNNYVETVISLAPNLFFGTTIAVNILVLSKHKTDTKVQFIDASELFKKETNNNILTDAHIEQIMQVFASKEDVAHLAKSVAFETVVANDYNLSVSCYVEAKDNREIINIAELNAELKTTVSKIDQLRKDIDAIVAEIEGCEVQK >CP034956|89835:97265|96905_97265_+|QAS83385.1|DBSCAN-SWA MKAVITPFVQKELGVATFKVDQEVRKLVEAGRKFIMEPVPRELIEHMDDGLVVSEQTMATNEALQPFFNSDELFRRIGGIDSLVAWLRRKEGQVRLGHVVLAGQQAYHESTGNGMGNPW >CP034956|89835:97265|96313_96694_+|QAS83383.1|DBSCAN-SWA MRHISPEELIALHDANINRYGGLPGMSDPGRAEAIIGRVQARVAYEEITDLFEVSATYLVATARGHIFNDANKRTALNSALLFLRRNGVQVFDSPELADLTVGAATGEISVSSVADTLRRLYGSAE |
7 | Escherichia_phage(57.14%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
0 : 11827
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >CP034957|0:11827|DBSCAN-SWA
Protein sequences of DBSCAN-SWA_1 >CP034957|0:11827|4613_4799_+|QAS83402.1|DBSCAN-SWA MDRKSNSDSYQRWMIEKADKRERTQISMSFVKMKDVLFNVLAIGTVLAALVVIFYEVQSSL >CP034957|0:11827|2443_2710_+|QAS83397.1|DBSCAN-SWA MISIAFASYILLALVVHLTLKQASYNGKVSSPAVNHACQQTSFLQPVWPLIFASNITRSMPALGVKLPAKKIAAIKGVVYIICIVRYR >CP034957|0:11827|1310_1766_+|QAS83395.1|DBSCAN-SWA MSENETFDATKKLLLNIRSVRVFARETSFEQLLEMQEKLNAVIEERREEAEREAAERAERERKRQELLQLIAGEGFSPEELLGLSEDAQKTRKKTLPKAPPKYQFDENGETKYWSGRGRTPKPIDEALKAGRSLDEFLIKKDASSTAGDEQ >CP034957|0:11827|1803_2454_-|QAS83396.1|DBSCAN-SWA MDIKPNSVIHLEPGETLPGYTNLKPVPDKYFDELKALINKLNTQSGNIYLKLKQMYAFLDRFNKEFVSTFTSCQKGCSSCCKMDVHLTALEATHIAQASKLTARDNPLTTGHESKCPFLSEKGTCSIYNYRPLLCRTYHVLTPPEMCNDLDAQVMQYGSQSANMGNHIYKTIAEWIYFQTYHCTGKLETKDIRDYFPYPREDIQRFLHHNPPRPFC >CP034957|0:11827|2726_2906_+|QAS83398.1|DBSCAN-SWA MTEKITDEELVDLLEALKRAHGMGVCSKAVKLAQRCADVFPAIVAELQEYRNAAKRTSA >CP034957|0:11827|6463_9481_+|QAS83403.1|transposase|DBSCAN-SWA MAFEERVQILSEAEQDELYGPPAFTSADQRFFFSLNDKELAIAKSLRHRGQRYMLVVLLGYFKAKPVVLNPGFHQIKQDLKYVYQTVLPGPGCRPFNLTPKENERIYQRVFQLCNYQRWNVKDHGAALRDYLSQQARAWTAPRHLFDAANEYCSGQKIAIPAYSTLQKIISQVVGDEQEHMAAHLERAMSRGLKQALAELVNGTGPLPFRQLRQSARNFTGTELEKELIVYRHIQHWMPEVDLLLSTLSLSQKNLQHLAEKVDYYGAKLKRQTVGSQWLYLLCYLQTRWQQALERIADGFVHHVRQTKQKAKDYAQEAVFKDWQKAAKNVSKAAEVLHLFIDDSIDLQLPFATVRQQALSLLTKRDLESVCLFLNEQRRSVDEAMWQYCDEKESLRKGLLRELFLCLRFEGCDGTQHLAAALAKTQNELNGQDAQLQTADTRLLSKKSREFLLDGEGNILIDRYEWFLYQQIPDRLNGQLTLPDITKYRALDADLIDGEHWRKNKYTLLQQSHFTKLAEEPEKLIKQMAMELDTRLYEVGEYLEQEDNRNIILRNPQGKRFWRLPSASKHHLVNNPFFQQIPTTGIADVLRMVDRDTGFIDCFAHVLGSQSRSRSHEYDLLAILVGNATNQGIYGMAQISDRTYDQLSTIQANYLRLETLNAANDNINNATAKLPIFRYYNIQEDVIHASADGQKFEARRETFKTRYSSKYFGTQKGVSAMTLIANHAAINARVIGANEHESHYIFDLLMSNTSDIIPDVLSTDTHGVNHVNFALLDLFGYQFAPRYAQVGKVINDMFDVKEDKEHRIQLCLKKPINTHRIAQHWDTIQRIAVSLKQRKTTQATLVRKLSEYKRNHPLLEALTEYNRLVKANYLLCYIDDASLRNYVQRALNRGEAYHQLRRAVSSVNGDQFRGSSDEEIQLWNECARLVTNAIVYFNSRILSQLLTSFEYQGDTKRIDIVKQASPVAWHNINLKGTYHFELSEKLPDLEELMRSIEGYLPVSEK >CP034957|0:11827|4105_4381_+|QAS83401.1|DBSCAN-SWA MHTTNAIILCLVSYPCIVAWVIYKNRFVDYLFLTLLACLGSIFSVVLYKQPDYSLTIYQAVAVGYAAVIMPTVLVLSVVNIFMGKAERTCK >CP034957|0:11827|9700_10702_+|QAS83404.1|transposase|DBSCAN-SWA MEYIKLSYHHLNFEDRTALMLESRKEGFSARKFAELIKRHPSTIYRELKRNSINDVYQARYASDNTFARRRRGHRKLKIDSILWKFIVEAIRCLWSPQQIAKRLKTFPDLDQTMNVSHTTIYSTIRALPKGELKKDLLSCLRHENKKRKANGEPKKDSILQDIKTIHERPAEVQERKIPGHWEADLIKGKDNKSSIATLIERNTRLCILATLPDAKAESVRKALTEALKYLPAELRKTLTYDRGREMSEHKILEEDLGIDVYFCDPHSPWQKGTCENMNGLIRQYLPKGIDLNQADQHYLNQVAMSLNTRPRKALDWLTPLGKVRISGEILLG >CP034957|0:11827|4443_4656_+|QAS83448.1|DBSCAN-SWA MVTQSLADFGRYSYKPRPAFQDWIERLKTGERLPRGRYFVGKKPGGRLVLIDELGGTWTEKVTVIHTNAG >CP034957|0:11827|10810_11827_-|QAS83405.1|transposase|DBSCAN-SWA MFVIWSHGTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMVEVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMRLFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQGTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSGLTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVDWLIAERPGKVRTLKQHPRKNKTAINIEYMKASIRAKVEHPFRIIKRQFGFVKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH >CP034957|0:11827|3833_4088_+|QAS83400.1|DBSCAN-SWA MGHYTIRTNDDEDQAIKKAQEATGQASASKTFMTAILELQRNRDEMAQLRRELAQEKARSQELVSSVKQFRSSLNNLFDLADNP >CP034957|0:11827|2902_3724_+|QAS83399.1|DBSCAN-SWA MTRATERAYSELQQAFDFYNQRLFDGELPDCLITFQRGKNTMGYFSYRRFVAADGSGRMIDEIALNPEYFPVYPLIEVMQTLVHEQCHMWQYHYGNPSRKTYHNAQWAAKMESIGLMPSSTGRPGGAKVGQKINDYPIPGGRFQRVTLELFQGQFALSWFDRFPVQVEQQKDMTAVIEQWRETLALAHQNAEAGIDIEAVLSMALLPSVSPQNGSHSDDMAGSNDNDLSVAAFEDKPKRNKVKYQCRGCGAAVWGKAGLNIECGDCELAFIEN |
12 | Bacillus_phage(33.33%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
15089 : 19542
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >CP034957|15089:19542|DBSCAN-SWA CATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCACAATGTGCGTAATGACCCCCCTGACCTCCAACCCGTCCTCATCGTCATCAGGTATAGCCCTGAAGTTATCCAGCTTATCGAGTTCCTGGAGACACCGCTTGGGGTGTAGTCGCAATCGCATGATCCTGAATTCTTGTTCCAGGACACAGAGCACCAAAGACCCGTCAACGGGCGTGCCCCCAAAATCTACTATGAGCAGCGCACCAGGCTTTATCCCTTCCCGGAAAGAGTAGGTATCACTTCGAAAAAAGAATACTCCCGGCGCTCCTGTATTACAGTGCTGATCGAGTGACAGCCTGTTTTCCTGGTAGTCGGTTGCTGGCGATGGAAAACCCATTTTTAACGGCCTCCGTTGGGGTTAAAAAGCATGAATGTCCGGTTCTCTCCTTCCTGAGTGGAAATGTCTTTAAAAACGGTCACGTAGGTTTCAATCCACTGATTTGCTTGGCGACAACTCCAGTGAAAATTAATCAGCGCAAGCTGGCTGACAAAGGCTTCTGTCGTAACTGTCTGACGACCGTTCGGCTCACGCTTAATGCTCGCACGCCAGGCTGAGTCAATTTCATAGTGTCTAGGCATGAAAAACCTCCGATCAATATAGCTGTATGCATATGGGTCTTCCCTTGTTGTGGTGGCTGAAGGCATGATAATGGTGTATTTAATCGCCAGAGGTCACCGCCATGGACGAAAAGTCCCTCTACGCTCATATTCTCAACCTGTCCGATCCGTGGCAGGTAAAGTCCCTTTCTCTCGATGAAAATGCCGGTTCTGTTACTGTCACTATTGAGATCGCTGAAAACACCCGGCTAGCCTGTCCGACCTGCGGTAAATCCTGTTCTGTTCACGATCACCGTCATCGTAAATGGCGCCATCTTGATACCTGCCAGTTCACCACTATTGTTGAAGCCGATGTTCCACGAATTATGTGTCCGGAGCATGGCTGCCTGACGTTGCCTGTTCCGTGGGCTGGCCCCGGAAGCCGGTATACGTTGCTATTCGAATCGTTCGTTCTCTCATGGCTGAAAATCAGCACCGTTGATGCTGTCAGGAAGCAACTTAAGCTCAGTTGGAATGCGGTTGACGGCATTATGACCCGGGCAGTTAAGCGAGGTCTTGCCCGGATAAAAAAGCCATTATCCGCCCGTCATATGAATGTGGATGAGGTCGCCTTTAAAAAAGGACATCGTTACATAACGGTGATCTCCGATCGCGATGGTCGGGCGCTGGCCTTAACGGATGATCGCGGCACAGAGAGTCTTGCCGGCTATCTTCGCACGCTCACTGATGGGCAGTTGCTGGCTATCAAAACGCTCTCAATGGACATGAACGCGGGCTATATAAGAGCAGCGCGTATCCACTTACCCAGTGCGGTTGAGAAAATCGCCTTTGACCGCTTCCATGTGGCGAAGCAACTGGGCGAGGTAGTTGATAAAACCCGTCAGAATGAACATCCGCACCTCCCTGTTGAAAGCCGACACCAGGCAAAAGGAACCCGCTTCCTGTGGCAGTACAGCGATAAGTGGATGACCGAATCCCGGCAGGAAAAGCTGATGTGGCTGCGTGCACAGATGAAGCTGACGAGCCAGTGCTGGGCGCTGAAAGAGCTGGCAAAGGATATCTGGAACAGGCCATGGAGCGAGGAAAGACGGAGTGACTGGCAGAGATGGTTGGCGCTGGCGGCTAACAGTGACGTTCCCATGATGAAAAATGCCGCGAAAACGATAGGAAAAAGGCTGTACGGGATCCTGAATGCGATGCGACACAGTGTCTCAAACGGAAATGCGGAGGCACTTAACAGCAAGATCAGGCTGCTGAGGATAAAAGCCAGGGGATACCGAAACCGGGAGCGCTTTAAACTGGGGGTGATGTTCCACTACGGAAAGCTGAATATGGCGTTCTGAGCCTTCCCACCATGATCGGGGAAGACCCATGCATATACAGTATAAAGAATCAGTTTTATTCTTCAACACACTGTTTATTATGTTGTAACTACTCCCTACCTTCAGGTTCATGGTTCATTTTGTTAACGGTCTGTTGCTTCACAGCTTTGCTGATAGTCGCCCGGCTACAACCCAGCACTTTTTGAATCTGGCTCCAGGAGCTGCCGCTCGCAATCAGCCTGTTTATGGCGTCATAGCGAGACTGATTAACCTGGCGGCCTTTATACTTCCCTTCCCTTTTCGCCCTGGCGATCCCCTGCTGCTGCCGTTCCCGGCGCTGTTCATAGTCTCGCCTGGCGACGGCGGCCAGCATATCCAGCAACATATCGTTTATGGCCGAAAACATTCGGCTGTCAAAATCATTGTGACCCGATGCGAGCCAGGTTGTCGGAACGTTGACGGCCACAACCCGGATATCTTTCTGCCGGATCATTTTCTTCAGTGTGTTCCAGTCCTCCCCTACCAGGCGTGAAAGCCTGTCCACATCCTCTACCAGCAAGATATCGTTTTGCTGACAATCTTTCAGGAGCCGGAACAACTCAGGGCGTTCAAGCTTAGAACCTGATTCATTTTCAATGTAGTAGTTACAGATGCTCAGGCCGCGCTCGATGGCGAAGGTATTTATCGTGTCCAGGGCGCGTGTTGCATCCTGTTCAGTCGTGGATGCGCGTAAGTAAGCTCTGACAAAGCCTTTCGGTACTGTTTGAGTCATTATAGGGAACCAGTTCATTTTGAGTTAACCAGACACACCCATACAGTACCCATTTCAGTTGGTTCAGGGAAGGCACACTCAAAAAGAACCAGCAAACAGACCCCAGCCCTATAGTGACAACGTCACACTAAAAATTTACAACATAAAGATGATGTAAAGACACACCAAGATTTACACCATAAAGCGCACATAAAGACACAGCAACATTTACCATGTAAAGACACCATAAAGATAAATATATCTTTATTTGTGCTTTACATGGTAAAGCCATTGCATTATAAAGACACTGTAAAGACAAAAGTATCTTTACGAAAAACACACCATGTAAAGACATTCATATCTTTACATGAGCTTTACAACATAAAGAGAGGGATTCTGAAATGGGTAAAATCTTGCTTGTCGTATCAGATAAAGGTGGGGTAGGCAAAAGCACCTACGTGGCTAACACAGGCTCAATGCTGGTCAACAAAGGTAAGTCGGTAATTATCCTGAAGACAGATAAAAACCACGATCTGTTGAGCTGGAACGAAAAGCGGACAGATAACGGCTTACTCACTATCCCGGTTCACGCAGCTTACGGAAACGTCAGCAACGAAATTAAGCGCCTTAGTAAGCTGTGTGAAGTTCTCATTGTCGATTGCCCCGGCCACGACAGCCAGGAGTTCCGTAGCGCCCTGACAGTATCAGATATCGTGGTGACACTGGTTAAGCCGTCCTCTGATTTCGAAAGCGAAACACTGACCAGTGTTACAGAGAAAATCCGCACCGCTCAGAAAGCTAACGCAGCTCTGGAACCGTGGGTGCTGTTCACCAGAATCAACTCAGCGAAACCCCGCCACAGAAAAGCGGCCATTGATCTGGATAAATTACTGCGTTCGGACAACATCTGGATTCAACCTCTGAAAACCCGGATATCCGAGCTGGATGTGTATGAAAGCGCCTGTAATGAAGGGGCAGGGGTACATGACGTTAGCCGGGCATCGAGCCTTTCAACAGCGAAAGCACAGATTGAACTTGTGGCACAGGAAATTGGCATTTTATAG
Protein sequences of DBSCAN-SWA_2 >CP034957|15089:19542|15089_15794_+|QAS83410.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >CP034957|15089:19542|16527_17748_+|QAS83413.1|transposase|DBSCAN-SWA MDEKSLYAHILNLSDPWQVKSLSLDENAGSVTVTIEIAENTRLACPTCGKSCSVHDHRHRKWRHLDTCQFTTIVEADVPRIMCPEHGCLTLPVPWAGPGSRYTLLFESFVLSWLKISTVDAVRKQLKLSWNAVDGIMTRAVKRGLARIKKPLSARHMNVDEVAFKKGHRYITVISDRDGRALALTDDRGTESLAGYLRTLTDGQLLAIKTLSMDMNAGYIRAARIHLPSAVEKIAFDRFHVAKQLGEVVDKTRQNEHPHLPVESRHQAKGTRFLWQYSDKWMTESRQEKLMWLRAQMKLTSQCWALKELAKDIWNRPWSEERRSDWQRWLALAANSDVPMMKNAAKTIGKRLYGILNAMRHSVSNGNAEALNSKIRLLRIKARGYRNRERFKLGVMFHYGKLNMAF >CP034957|15089:19542|15827_16184_-|QAS83411.1|DBSCAN-SWA MGFPSPATDYQENRLSLDQHCNTGAPGVFFFRSDTYSFREGIKPGALLIVDFGGTPVDGSLVLCVLEQEFRIMRLRLHPKRCLQELDKLDNFRAIPDDDEDGLEVRGVITHIVALLQS >CP034957|15089:19542|17836_18499_-|QAS83414.1|DBSCAN-SWA MTQTVPKGFVRAYLRASTTEQDATRALDTINTFAIERGLSICNYYIENESGSKLERPELFRLLKDCQQNDILLVEDVDRLSRLVGEDWNTLKKMIRQKDIRVVAVNVPTTWLASGHNDFDSRMFSAINDMLLDMLAAVARRDYEQRRERQQQGIARAKREGKYKGRQVNQSRYDAINRLIASGSSWSQIQKVLGCSRATISKAVKQQTVNKMNHEPEGRE >CP034957|15089:19542|18879_19542_+|QAS83415.1|DBSCAN-SWA MGKILLVVSDKGGVGKSTYVANTGSMLVNKGKSVIILKTDKNHDLLSWNEKRTDNGLLTIPVHAAYGNVSNEIKRLSKLCEVLIVDCPGHDSQEFRSALTVSDIVVTLVKPSSDFESETLTSVTEKIRTAQKANAALEPWVLFTRINSAKPRHRKAAIDLDKLLRSDNIWIQPLKTRISELDVYESACNEGAGVHDVSRASSLSTAKAQIELVAQEIGIL >CP034957|15089:19542|16186_16426_-|QAS83412.1|DBSCAN-SWA MPRHYEIDSAWRASIKREPNGRQTVTTEAFVSQLALINFHWSCRQANQWIETYVTVFKDISTQEGENRTFMLFNPNGGR |
6 | Escherichia_phage(40.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
24142 : 24658
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >CP034957|24142:24658|DBSCAN-SWA AATGAATATTCAGGAAGCATTAAACGTTTTTGGTTTATCCGGTGATCTGACTGAAAAGGACATCAAGGCGGCATACAAAAAAGCGGCTTTAAAATATCATCCAGACCGCAATCCATTGGGTGCCGAGCTGATGAAAGCAGTAAATGCTGCGTTTGATTTTCTCATGGCTAACATTGATAAAATCAATCAGTTTCAAAGCACTGATGAAAACGCACGTTACAACTACGGTGAAGATCTGGAAAAAGTTCTGAATACGCTTTCCGGCCTTACAGGGATTGTCTATGAAGTTATTGGTAATTGGGTCTGGATTAGCGGAGAAACTAAAGAACACAAAGACATTTTAAAGGAAATGGGCTGTAAGTGGGCATCTAAGAAAAAACAATGGTTTTACCGTCCTGAAGAACACAAAAGCCGCTGGAACCGTAAAGAACACAGCATTGAAGAAATTCGTGAAATGTATGGTACAGCGGGTAAACGTAAGGCGTCAGGCTGGACACGTGTAGAAGCAAGCGCATAA
Protein sequences of DBSCAN-SWA_3 >CP034957|24142:24658|24142_24658_+|QAS83421.1|DBSCAN-SWA MNIQEALNVFGLSGDLTEKDIKAAYKKAALKYHPDRNPLGAELMKAVNAAFDFLMANIDKINQFQSTDENARYNYGEDLEKVLNTLSGLTGIVYEVIGNWVWISGETKEHKDILKEMGCKWASKKKQWFYRPEEHKSRWNRKEHSIEEIREMYGTAGKRKASGWTRVEASA |
1 | Tupanvirus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
43289 : 45129
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >CP034957|43289:45129|DBSCAN-SWA AATGAGACATTTTTTTAATGTTCTTTCCTTTGCTGTTGCCCTGTTGCCAGCTACATTGTGTGCTGCTCAGATTCAGGGGAAAGTTATTCGCGTTCTCGATGGAGATACCATCGAAGTTAAAACACTACCCGCGAAAATTGTTGTGTATGAAGTTCCGATTCGAGTCCGATTGATAAATATCGATGCACCGGAAAAAAAACAACCCTTTGGTCGCTGGTCAACCAATCAACTTAAAGCCCTGCTGGCAGGGCAATCAGTTACCGTCTCTTATACGCAAACAGATCGCTACGGGCGAGTCCTGGGACGAGTGGTGACAGCGAACGGCACTGAGGCTAACCGCCAGCAAGTGCTAAAAGGGGCTGCATGGGTTTATGACAGGTACAACACCGATAACTCATTACCGGCTCTGCAACGGGAAGCTCAGACACAGAAGCGCGGCCTGTGGGCTGACAGTAATCCCGTTCCTCCGTGGGAGTGGCGTCATAAACAAAACTGACGCGCTTCGCTTGTCCAACCCCTCCTCTGCCTGAAGAAAATCTTCAGGCAACGTCACGGTTTTTAATATACATGGTGCGCATATGGGTAAATTAACTCTGGTAGAGCAGGCTTTACGGCTCCATAAAGAATATTACGGTGAAGATAATTTTGGTACTGTCGTAAAGGATTGTGATTACTATCCTAAAGTGGCAGAGATTTTTATAAACAGAAAGAATCGTAAAATTTATTCGAATCTTGATATCGAACCCGTTGCTTCAACCGTCGTTCATACGGGAAGTAGCCCTGTTATGGCTAAAGCCGGGAGGATTAAAAAGCTTATGTTTAAAGATGGTTCGGGAGCATACCGGATTCATCTTGGTCAAAATGAAGTGGTTCATATTATTCGCTTCATTCTTAACTCTAAAGTCAGAATGGAGTACGCAGTTGGCACAACCAAAAGTATGTCGATGCTTCATAATCTTTGCGAAGCTGGACGCGCTAAATTAAACCGCTCTTTACCCCGTAAAGGTATCTGGAAAATCGGGTGTTACGACGGCGTTTATTATCACGGTAAGGCAAGGAAGGAAGAAATAGAAAACGCTTTACGTTTTTCTGTTCATCCAAAGTTTAATGAGCTGGAGAATGACTTTAATCAGTTCTTTTCGGATATTGATTTCTACACCCGTTATGGTCAGTCAGGAATGCGAAAAGTGCTTTTTACTGGTCCTCCGGGCACAGGGAAAACGACTATCGCAAAAGCTTTGGGGGCAAAATATCAGGATAAATATGTATTTGTTTATGCTGATGATTACTTTAAAGATGTTTGCTATGCCGCAGCTCAAAAGAAAATACCCGTTATCATAATAGCTGAAGAGGTTGATGAACTTTATCGGGCTGATGCCGGAACGTTAAGTTTTCTTGATGGGGCTGATACACCGCGAAATCTTGCCGGGACATATGTTATTTTTTCAACGAACTATCCTAACCGGATTGATCCGCGCATCAGAAAACGCCCTGGCAGAATTGACAGAATCATATCTGTAGGTGCTTTTCGGACTAAAGCCGCTGCCGCTTGTGCCAAAATGTACTTACCAGACGATATAAATATTGACCTTAAAGAACTTGGCGCGGTGCTGGACAGAACGACTCCTGCTGAAATAAAGGAGATAATAAATATTGCTATTGGCATGATTCGCGGAACAAAAAACGAATTAACAGTAGATGTAATCAAGAATGCGCGTGCCTTTTTGAAAGGAACCCTTGACCTTAGTGTGCAGGAGGCAGAAGAAGACATAGAGGAAAGGGAAGAAATCTTTAAAAAGAATGGTGCGCAGCCTGACTATTCAAGTTACCTTGAGGATTGA
Protein sequences of DBSCAN-SWA_4 >CP034957|43289:45129|43289_43784_+|QAS83446.1|DBSCAN-SWA MRHFFNVLSFAVALLPATLCAAQIQGKVIRVLDGDTIEVKTLPAKIVVYEVPIRVRLINIDAPEKKQPFGRWSTNQLKALLAGQSVTVSYTQTDRYGRVLGRVVTANGTEANRQQVLKGAAWVYDRYNTDNSLPALQREAQTQKRGLWADSNPVPPWEWRHKQN >CP034957|43289:45129|43866_45129_+|QAS83447.1|DBSCAN-SWA MGKLTLVEQALRLHKEYYGEDNFGTVVKDCDYYPKVAEIFINRKNRKIYSNLDIEPVASTVVHTGSSPVMAKAGRIKKLMFKDGSGAYRIHLGQNEVVHIIRFILNSKVRMEYAVGTTKSMSMLHNLCEAGRAKLNRSLPRKGIWKIGCYDGVYYHGKARKEEIENALRFSVHPKFNELENDFNQFFSDIDFYTRYGQSGMRKVLFTGPPGTGKTTIAKALGAKYQDKYVFVYADDYFKDVCYAAAQKKIPVIIIAEEVDELYRADAGTLSFLDGADTPRNLAGTYVIFSTNYPNRIDPRIRKRPGRIDRIISVGAFRTKAAAACAKMYLPDDINIDLKELGAVLDRTTPAEIKEIINIAIGMIRGTKNELTVDVIKNARAFLKGTLDLSVQEAEEDIEEREEIFKKNGAQPDYSSYLED |
2 | Moraxella_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|