Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP020368 | Escherichia coli strain BLR(DE3) chromosome, complete genome | 7 crisprs | DinG,RT,cas3,c2c9_V-U4,DEDDh,csa3,cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e | 0 | 11 | 8 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP020368_1 | 344489-344635 | Orphan |
NA
Consensus repeat of NZ_CP020368_1
|
1 spacers
spacers of NZ_CP020368_1
>1.1|344535|55|NZ_CP020368|CRISPRCasFinder TTAGCGTCGCATCAGGCATCTGCGCACGACTGCCGGATGCGGCGTAAACGCCTTA |
CRISPR arrays and Neighbor proteins around NZ_CP020368_1
The CRISPR arrays of NZ_CP020368_1 >merge|NZ_CP020368|1|344489-344635|CRISPRCasFinder TCCGGCCTACGGATGGCGCGAGAATTTGTAGGCCTGATAAGACGCGTTAGCGTCGCATCAGGCATCTGCGCACGACTGCCGGATGCGGCGTAAACGCCTTATCCGGCCTACGGATGGCGCGGGAATTTGTAGGCCTGATAAGACGCG >NZ_CP020368|1|1|344489-344635|CRISPRCasFinder TCCGGCCTACGGATGGCGCGAGAATTTGTAGGCCTGATAAGACGCG TTAGCGTCGCATCAGGCATCTGCGCACGACTGCCGGATGCGGCGTAAACGCCTTA TCCGGCCTACGGATGGCGCGGGAATTTGTAGGCCTGATAAGACGCG
>NZ_CP020368.1|WP_001013494.1|343402_344416_+|4-hydroxy-2-oxovalerate-aldolase MNGKKLYISDVTLRDGMHAIRHQYSLENVRQIAKALDDAHVDSIEVAHGDGLQGSSFNYGFGAHSDLEWIEAAADVVKHAKIATLLLPGIGTIHDLKNAWQAGARVVRVATHCTEADVSAQHIQYARELGMDTVGFLMMSHMTTPENLAKQAKLMEGYGATCIYVVDSGGAMNMSDIRDRFRALKAVLKPETQTGMHAHHNLSLGVANSIEAVEEGCDRIDASLAGMGAGAGNAPLEVFIAAADKLGWQHGTDLYALMDAADDLVRPLQDRPVRVDRETLALGYAGVYSSFLRHCETAAARYGLSAVDILVELGKRRMVGGQEDMIVDVALDLRNNK >NZ_CP020368.1|WP_000044314.1|342455_343406_+|acetaldehyde-dehydrogenase-(acetylating) MSKRKVAIIGSGNIGTDLMIKILRHGQHLEMAVMVGIDPQSDGLARARRMGVATTHEGVIGLMNMPEFADIDIVFDATSAGAHVKNDAALREAKPDIRLIDLTPAAIGPYCVPVVNLEANVDQLNVNMVTCGGQATIPMVAAVSRVARVHYAEIIASIASKSAGPGTRANIDEFTETTSRAIEVVGGAAKGKAIIVLNPAEPPLMMRDTVYVLSDEASQDDIEASINEMAEAVQAYVPGYRLKQRVQFEVIPQDKPVNLPGVGQFSGLKTAVWLEVEGAAHYLPAYAGNLDIMTSSALATAEKMAQSLARKAGEAA >NZ_CP020368.1|WP_000160727.1|341649_342459_+|2-keto-4-pentenoate-hydratase MTKHTLEQLAADLRRAAEQGEAIAPLRDLIGIDNAEAAYAIQHINVQYDVAQGRRVVGRKVGLTHPKVQQQLGVDQPDFGTLFADMCYGDNEIIPFSRVLQPRIEAEIALVLNRDLPATDITFDELYNAIEWVLPALEVVGSRIRDWSIQFVDTVADNASCGVYVIGGPAQRPAGLDLKNCAMKMTRNNEEVSSGRGSECLGHPLNAAVWLARKMASLGEPLRTGDIILTGALGPMVAVNAGDRFEAHIEGIGSVAATFSSAAPKGSLS >NZ_CP020368.1|WP_000121898.1|340773_341640_+|2-hydroxy-6-oxononadienedioate/2-hydroxy-6--oxononatrienedioate-hydrolase MSYQPQTEAATSRFLNVEEAGKTLRIHFNDCGQGDETVVLLHGSGPGATGWANFSRNIDPLVEAGYRVILLDCPGWGKSDSIVNSGSRSDLNARILKSVVDQLDIAKIHLLGNSMGGHSSVAFTLNWPERVGKLVLMGGGTGGMSLFTPMPTEGIKRLNQLYRQPTIENLKLMMDIFVFDTSDLTDALFEARLNNMLSRRDHLENFVKSLEANPKQFPDFGPRLAEIKAQTLIVWGRNDRFVPMDAGLRLLSGIAGSELHIFRDCGHWAQWEHADAFNQLVLNFLARP >NZ_CP020368.1|WP_000543457.1|339811_340756_+|2,3-dihydroxyphenylpropionate/2,-3-dihydroxicinnamic-acid-1,2-dioxygenase MHAYLHCLSHSPLVGYVDPAQEVLDEVNGVIASARERIAAFSPELVVLFAPDHYNGFFYDVMPPFCLGVGATAIGDFGSAAGELPVPVELAEACAHAVMKSGIDLAVSYCMQVDHGFAQPLEFLLGGLDKVPVLPVFINGVATPLPGFQRTRMLGEAIGRFTSTLNKRVLFLGSGGLSHQPPVPELAKADAHMRDRLLGSGKDLPASERELRQQRVISAAEKFVEDQRTLHPLNPIWDNQFMTLLEQGRIQELDAVSNEELSAIAGKSTHEIKTWVAAFAAISAFGNWRSEGRYYRPIPEWIAGFGSLSARTEN >NZ_CP020368.1|WP_001007410.1|338145_339810_+|bifunctional-3-(3-hydroxy-phenyl)propionate/3-hydroxycinnamic-acid-hydroxylase MAIQHPDIQPAVNHSVQVAIAGAGPVGLMMANYLGQMGIDVLVVEKLDKLIDYPRAIGIDDEALRTMQSVGLVDDVLPHTTPWHAMRFLTPKGRCFADIQPMTDEFGWPRRNAFIQPQVDAVMLEGVSRFPNVRCLFSRELEAFSQQDDEVTLHLKTAEGQREIVKAQWLVACDGGASFVRRTLNVPFEGKTAPNQWIVVDIANDPLSTPHIYLCCDPVRPYVSAALPHAVRRFEFMVMPGETEEQLREPQNMRKLLSKVLPNPDNVELIRQRVYTHNARLAQRFRIDRVLLAGDAAHIMPVWQGQGYNSGMRDAFNLAWKLALVIQGKARDALLDTYQQERRDHAKAMIDLSVTAGNVLAPPKRWQGTLRDGVSWLLNYLPPVKRYFLEMRFKPMPQYYGGALVREGEAKHSPVGKMFIQPKVTLENGDVTLLDNAIGANFAVIGWGCNPLWGMSDEQIQQWRALGTRFIQVVPEVQIHTAQDNHDGVLRVGDTQGRLRSWFAQHNASLVVMRPDRFVAATAIPQTLGNTLNKLASVMTLTRPDADVSVEKVA >NZ_CP020368.1|WP_001310587.1|337121_338069_-|DNA-binding-transcriptional-activator-MhpR MIFYCALSIGRVFSATIKTCPNVHQVHHVVLTIEMSINMQNNEQTEYKTVRGLTRGLMLLNMLNKLDGGASVGLLAELSGLHRTTVRRLLETLQEEGYVRRSPSDDSFRLTIKVRQLSEGFRDEQWISALAAPLLGDLLREVVWPTDVSTLDVDAMVVRETTHRFSRLSFHRAMVGRRLPLLKTASGLTWLAFCPEQDRKELIEMLASRPGDDYQLAREPLKLEAILARARKEGYGQNYRGWDQEEKIASIAVPLRSEQRVIGCLNLVYMASAMTIEQAAEKHLPALQRVAKQIEEGVESQAILVAGRRSGMHLR >NZ_CP020368.1|WP_000805902.1|335962_337045_-|LacI-family-DNA-binding-transcriptional-regulator MKPVTLYDVAEYAGVSYQTVSRVVNQASHVSAKTREKVEAAMAELNYIPNRVAQQLAGKQSLLIGVATSSLALHAPSQIVAAIKSRADQLGASVVVSMVERSGVEACKAAVHNLLAQRVSGLIINYPLDDQDAIAVEAACTNVPALFLDVSDQTPINSIIFSHEDGTRLGVEHLVALGHQQIALLAGPLSSVSARLRLAGWHKYLTRNQIQPIAEREGDWSAMSGFQQTMQMLNEGIVPTAMLVANDQMALGAMRAITESGLRVGADISVVGYDDTEDSSCYIPPLTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKTTLAPNTQTASPRALADSLMQLARQVSRLESGQ >NZ_CP020368.1|WP_000291549.1|331460_332714_-|lactose-permease MYYLKNTNFWMFGLFFFFYFFIMGAYFPFFPIWLHDINHISKSDTGIIFAAISLFSLLFQPLFGLLSDKLGLRKYLLWIITGMLVMFAPFFIFIFGPLLQYNILVGSIVGGIYLGFCFNAGAPAVEAFIEKVSRRSNFEFGRARMFGCVGWALCASIVGIMFTINNQFVFWLGSGCALILAVLLFFAKTDAPSSATVANAVGANHSAFSLKLALELFRQPKLWFLSLYVIGVSCTYDVFDQQFANFFTSFFATGEQGTRVFGYVTTMGELLNASIMFFAPLIINRIGGKNALLLAGTIMSVRIIGSSFATSALEVVILKTLHMFEVPFLLVGCFKYITSQFEVRFSATIYLVCFCFFKQLAMIFMSVLAGNMYESIGFQGAYLVLGLVALGFTLISVFTLSGPGPLSLLRRQVNEVA >NZ_CP020368.1|WP_001320653.1|330783_331395_-|galactoside-O-acetyltransferase MNMPMTERIKAGKLFTDMCEGLPEKRLRGKTLMYEFNHSHPSEVEKRESLIKEMFATVGENAWVEPPVYFSYGSNIHIGRNFYANFNLTIVDDYTVTIGDNVLIAPNVTLSVTGHPVHHELRKNGEMYSFPITIGNNVWIGSHVVINPGVTIGDNSVIGAGSIVTKDIPPNVVAAGVPCRVIREINDRDKHYYFKDYKVESSV >NZ_CP020368.1|WP_000107627.1|344793_346005_+|3-(3-hydroxy-phenyl)propionate-transporter MSTRTPSSSSSRLMLTIGLCFLVALMEGLDLQAAGIAAGGIAQAFALDKMQMGWIFSAGILGLLPGALVGGMLADRYGRKRILIGSVALFGLFSLATAIAWDFPSLVFARLMTGVGLGAALPNLIALTSEAAGPRFRGTAVSLMYCGVPIGAALAATLGFAGANLAWQTVFWVGGVVPLILVPLLMRWLPESAVFAGEKQSAPPLRALFAPETATATLLLWLCYFFTLLVVYMLINWLPLLLVEQGFQPSQAAGVMFALQMGAASGTLMLGALMDKLRPVTMSLLIYSGMLASLLALGTVSSFNGMLLAGFVAGLFATGGQSVLYALAPLFYSSQIRATGVGTAVAVGRLGAMSGPLLAGKMLALGTGTVGVMAASAPGILVAGLAVFILMSRRSRIQPCADA >NZ_CP020368.1|WP_001096705.1|346106_346646_+|DUF2058-domain-containing-protein MAKLTLQEQLLKAGLVTSKKAAKVERTAKKSRVQAREARAAVEENKKAQLERDKQLSEQQKQAALAKEYKAQVKQLIEMNRITIANGDIGFNFTDGNLIKKIFVDKLTQAQLINGRLAIARLLVDNNSEGEYAIIPASVADKIAQRDASSIVLHSALSAEEQDEDDPYADFKVPDDLMW >NZ_CP020368.1|WP_000419042.1|346871_347705_-|S-formylglutathione-hydrolase-FrmB MELIEKHASFGGWQNVYRHYSQSLKCEMNVGVYLPPKAANEKLPVLYWLSGLTCNEQNFITKSGMQRYAAEHNIIVVAPDTSPRGSHVADADRYDLGQGAGFYLNATQAPWNEHYKMYDYIRNELPDLVMHHFPATAKKSISGHSMGGLGALVLALRNPDEYVSVSAFSPIVSPSQVPWGQQAFAAYLAENKDAWLDYDPVSLISQGQRVAEIMVDQGLSDDFYAEQLRTPNLEKICQEMNIKTLIRYHEGYDHSYYFVSSFIGEHIAYHANKLNMR >NZ_CP020368.1|WP_000842100.1|347797_348907_-|S-(hydroxymethyl)glutathione-dehydrogenase/class-III-alcohol-dehydrogenase MKSRAAVAFAPGKPLEIVEIDVAPPKKGEVLIKVTHTGVCHTDAFTLSGDDPEGVFPVVLGHEGAGVVVEVGEGVTSVKPGDHVIPLYTAECGECEFCRSGKTNLCVAVRETQGKGLMPDGTTRFSYNGQPLYHYMGCSTFSEYTVVAEVSLAKINPEANHEHVCLLGCGVTTGIGAVHNTAKVQPGDSVAVFGLGAIGLAVVQGARQAKAGRIIAIDTNPKKFDLARRFGATDCINPNDYDKPIKDVLLDINKWGIDHTFECIGNVNVMRAALESAHRGWGQSVIIGVAGAGQEISTRPFQLVTGRVWKGSAFGGVKGRSQLPGMVEDAMKGDIDLEPFVTHTMSLDEINDAFDLMHEGKSIRTVIRY >NZ_CP020368.1|WP_001141271.1|348941_349217_-|formaldehyde-responsive-transcriptional-repressor-FrmR MPSTPEEKKKVLTRVRRIRGQIDALERSLEGDAECRAILQQIAAVRGAANGLMAEVLESHIRETFDRNDCYSREVSQSVDDTIELVRAYLK >NZ_CP020368.1|WP_000596085.1|349402_350176_-|YaiO-family-outer-membrane-beta-barrel-protein MIKRTLLAAAIFSALPAYAGLTSITAGYDFTDYSGDHGNRNLAYAELVAKVENATLLFNLSQGRRDYETEHFNATRGQGAVWYKWNNWLTTRTGIAFADNTPVFARQDFRQDINLALLPKTLFTTGYRYTKYYDDVEVDAWQGGVSLYTGPVITSYRYTHYDSSDAGGSYSNMISVRLNDPRGTGYTQLWLSRGTGAYTYDWTPETRYGSMKSVSLQRIQPLTEQLNLGLTAGKVWYDTPTDDYNGLQLAAHLTWKF >NZ_CP020368.1|WP_001018416.1|351118_352081_+|taurine-ABC-transporter-substrate-binding-protein MAISSRNTLLAALAFIAFQAQAVNVTVAYQTSAEPAKVAQADNTFAKESGATVDWRKFDSGASIVRALASGDVQIGNLGSSPLAVAASQQVPIEVFLLASKLGNSEALVVKKTISKPEDLIGKRIAVPFISTTHYSLLAALKHWGIKPGQVEIVNLQPPAIIAAWQRGDIDGAYVWAPAVNALEKDGKVLTDSEQVGQWGAPTLDVWVVRKDFAEKHPEVVKAFAKSAIDAQQPYIANPDAWLKQPENISKLARLSGVPEGDVPGLVKGNTYLTPQQQTAELTGPVNKAIIDTAQFLKEQGKVPAVANDYSQYVTSRFVQ >NZ_CP020368.1|WP_000939399.1|352093_352861_+|taurine-ABC-transporter-ATP-binding-subunit MLQISHLYANYGGKPALEDINLTLESGELLVVLGPSGCGKTTLLNLIAGFVPYQHGSIQLAGKRIEGPGAERGVVFQNEGLLPWRNVQNNVAFGLQLAGIEKMQRLEIAHQMLKKVGLEGAEKRYIWQLSGGQRQRVGIARALAANPQLLLLDEPFGALDAFTRDQMQTLLLKLWQETGKQVLLITHDIEEAVFMATELVLLSPGPGRVLERLPLNFARRFVAGESSRSIKSDPQFIAMREYVLSRVFEQREAFS >NZ_CP020368.1|WP_000114585.1|352857_353685_+|taurine-ABC-transporter-permease-TauC MSVLINEKLHSHRLKWRWPLSRQVTLSIGTLAVLLTVWWAVAALQLISPLFLPPPQQVLAKLLTIAGPQGFMDATLWQHLAASLTRIVLALLAAVVIGIPVGIAMGLSPTVRGILDPIIELYRPVPPLAYLPLMVIWFGIGENSKILLIYLAIFAPVAMSALAGVKSVQQVRIRAARSLGASRAQVLWFVILPGALPEILTGLRIGLGVGWSTLVAAELIAATRGLGFMVQSAGEFLATDVVLAGIAVIAIIAFLLELGLRALQRRLTPWHGEVQ >NZ_CP020368.1|WP_000004024.1|353681_354533_+|taurine-dioxygenase MSERLSITPLGPYIGAQISGADLTRPLSDNQFEQLYHAVLRHQVVFLRDQAITPQQQRALAQRFGELHIHPVYPHAEGVDEIIVLDTHNDNPPDNDNWHTDVTFIETPPAGAILAAKELPSTGGDTLWTSGIAAYEALSVPFRQLLSGLRAEHDFRKSFPEYKYRKTEEEHQRWREAVAKNPPLLHPVVRTHPVSGKQALFVNEGFTTRIVDVSEKESEALLGFLFAHITKPEFQVRWRWQPNDIAIWDNRVTQHYANADYLPQRRIMHRATILGDKPFYRAG |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP020368_2 | 376478-376622 | Orphan |
NA
Consensus repeat of NZ_CP020368_2
|
1 spacers
spacers of NZ_CP020368_2
>2.1|376521|59|NZ_CP020368|CRISPRCasFinder GGTGCCAGAACCGTAGGCCGGATAAGGCGTTCACGCCGCATCCGGCAATAAGTGCTCCG |
CRISPR arrays and Neighbor proteins around NZ_CP020368_2
The CRISPR arrays of NZ_CP020368_2 >merge|NZ_CP020368|2|376478-376622|CRISPRCasFinder ATGCCTGATGCGACGCTTGCCGCGTCTTATCAGGCCTACAAAAGGTGCCAGAACCGTAGGCCGGATAAGGCGTTCACGCCGCATCCGGCAATAAGTGCTCCGATGCCTGATGCGACGCTTGCCGCGTCTTATCAGGCCTGCAAAA >NZ_CP020368|2|2|376478-376622|CRISPRCasFinder ATGCCTGATGCGACGCTTGCCGCGTCTTATCAGGCCTACAAAA GGTGCCAGAACCGTAGGCCGGATAAGGCGTTCACGCCGCATCCGGCAATAAGTGCTCCG ATGCCTGATGCGACGCTTGCCGCGTCTTATCAGGCCTGCAAAA
>NZ_CP020368.1|WP_001219309.1|375545_376454_+|fructokinase MRIGIDLGGTKTEVIALGDAGEQLYRHRLPTPRDDYRQTIETIATLVDMAEQATGQRGTVGMGIPGSISPYTGVVKNANSTWLNGQPFDKDLSARLQREVRLANDANCLAVSEAVDGAAAGAQTVFAVIIGTGCGAGVAFNGRAHIGGNGTAGEWGHNPLPWMDEDELRYREEVPCYCGKQGCIETFISGTGFAMDYRRLSGHALKGSEIIRLVEESDPVAELALRRYELRLAKSLAHVVNILDPDVIVLGGGMSNVDRLYQTVGQLIKQFVFGGECETPVRKAKHGDSSGVRGAAWLWPQE >NZ_CP020368.1|WP_001298537.1|374509_375421_-|recombination-associated-protein-RdgC MLWFKNLMVYRLSREISLRAEEMEKQLASMAFTPCGSQDMAKMGWVPPMGSHSDALTHVANGQIVICARKEEKILPSPVIKQALEAKIAKLEAEQARKLKKTEKDSLKDEVLHSLLPRAFSRFSQTMMWIDTVNGLIMVDCASAKKAEDTLALLRKSLGSLPVVPLSMENPIELTLTEWVRSGSAAQGFQLLDEAELKSLLEDGGVIRAKKQDLTSEEITNHIEAGKVVTKLALDWQQRIQFVMCDDGSLKRLKFCDELRDQNEDIDREDFAQRFDADFILMTGELAALIQNLIEGLGGEAQR >NZ_CP020368.1|WP_120795376.1|373571_373655_+|protein-YkiD MTQRPWSKLQRKTHNIAALKIIARRSE >NZ_CP020368.1|WP_000941942.1|372801_373086_+|pyrimidine/purine-nucleoside-phosphorylase MLQSNEYFSGKVKSIGFSSSSTGRASVGVMVEGEYTFSTAEPEEMTVISGALNVLLPDATDWQVYEAGSVFNVPGHSEFHLQVAEPTSYLCRYL >NZ_CP020368.1|WP_001276425.1|372052_372730_+|AroM-family-protein MSASLAILTIGIVPMQEVLPLLTEYIDEDNISHHSLLGKLSREEVMAEYAPEAGEDTILTLLNDNQLAHVSRRKVERDLQGVVEVLDNRGYDVIILMSTANISSMTARNTIFLEPSRILPPLVSSIVEDHQVGVIVPVEEMLPVQAQKWQILQKSPVFSLGNPIHDSEQKIIDAGKELLAKGADVIMLDCLGFHQRHRDLLQKQLDVPVLLSNVLIARLAAELLV >NZ_CP020368.1|WP_001142439.1|371603_371795_+|protein-YaiA MPTKPPYPREAYIVTIEKGKPGQTVTWYQLRADHPKPDSLISEHPTAQEAMDAKKRYEDPDKE >NZ_CP020368.1|WP_000193393.1|371029_371554_+|shikimate-kinase-AroL MTQPLFLIGPRGCGKTTVGMALADSLNRRFVDTDQWLQSQLNMTVAEIVEREEWAGFRARETAALEAVTAPSTVIATGGGIILTEFNRHFMQNNGIVVYLCAPVSVLVNRLQAAPEEDLRPTLTGKPLSEEVQEVLEERDALYREVAHIIIDATNEPSQVISEIRSALAQTINC >NZ_CP020368.1|WP_000158159.1|370388_370847_+|YaiI/YqxD-family-protein MTIWVDADACPNVIKEILYRAAERMQMPLVLVANQSLRVPPSRFIRTLRVAAGFDVADNEIVRQCEAGDLVITADIPLAAEAIEKGAAALNPRGERYTPATIRERLTMRDFMDTLRASGIQTGGPDSLSQRDRQAFAAELEKWWLEVQRSRG >NZ_CP020368.1|WP_001295331.1|369459_370269_-|pyrroline-5-carboxylate-reductase MEKKIGFIGCGNMGKAILGGLIASGQVLPGQIWVYTPSPDKVAALHDQFGINAAESAQEVAQIADIIFAAVKPGIMIKVLSEITSSLNKDSLVVSIAAGVTLDQLARALGHDRKIIRAMPNTPALVNAGMTSVTPNALVTPEDTADVLNIFRCFGEAEVIAEPMIHPVVGVSGSSPAYVFMFIEAMADAAVLGGMPRAQAYKFAAQAVMGSAKMVLETGEHPGALKDMVCSPGGTTIEAVRVLEEKGFRAAVIEAMTKCMEKSEKLSKS >NZ_CP020368.1|WP_000484048.1|368327_369443_+|diguanylate-cyclase-AdrA MFPKIMNDENFFKKAAAHGEEPPLTPQNEHQRSGLRFARRVRLPRAVGLAGMFLPIASTLVSHPPPGWWWLVLVGWAFVWPHLAWQIASRAVDPLSREIYNLKTDAVLAGMWVGVMGVNVLPSTAMLMIMCLNLMGAGGPRLFVAGLVLMVVSCLVTLELTGITVSFNSAPLEWWLSLPIIVIYPLLFGWVSYQTATKLAEHKRRLQVMSTRDGMTGVYNRRHWETMLRNEFDNCRRHNRDATLLIIDIDHFKSINDTWGHDVGDEAIVALTRQLQITLRGSDVIGRFGGDEFAVIMSGTPAESAITAMLRVHEGLNTLRLPNTPQVTLRISVGVAPLNPQMSHYREWLKSADLALYKAKKAGRNRTEVAA >NZ_CP020368.1|WP_012767698.1|376698_377883_-|MFS-transporter-AraJ MKKVILSLALGTFGLGMAEFGIMGVLTELAHNVGISIPAAGHMISYYALGVVVGAPIIALFSSRYSLKHILLFLVALCVIGNAMFTLSSSYLMLAIGRLVSGFPHGAFFGVGAIVLSKIIKPGKVTAAVAGMVSGMTVANLLGIPLGTYLSQEFSWRYTFLLIAVFNIAVMASVYFWVPDICDEAKGNLREQFHFLRSPAPWLIFAATMFGNAGVFAWFSYVKPYMMFISGFSETAMTFIMMLVGLGMVLGNMLSGRISGRYSPLRIAAVTDFIIVLALLMLFFCGGMKTTSLIFAFICCAGLFALSAPLQILLLQNAKGGELLGAAGGQIAFNLGSAVGAYCGGMMLTLGLAYNYVALPAALLSFAAMSSLLLYGRYKRQQAADTPVLAKPLG >NZ_CP020368.1|WP_000698909.1|378008_381155_-|exonuclease-subunit-SbcC MKILSLRLKNLNSLKGEWKIDFTREPFASNGLFAITGPTGAGKTTLLDAICLALYHETPRLSNVSQSQNDLMTRDTAECLAEVEFEVKGEAYRAFWSQNRARNQPDGNLQVPRVELARCADGKILADKVKDKLELTATLTGLDYGRFTRSMLLSQGQFAAFLNAKPKERAELLEELTGTEIYGQISAMVFEQHKSARTELEKLQAQASGVALLTPEQVQSLTASLQVLTDEEKQLLTAQQQEQQSLNWLTRLDELQQEASRRQQALQQALAEEEKAQPQLAALSLAQPARNLRPHWERIAEHSAALAHTRQQIEEVNTRLQSTMALRASIRHHAAKQSAELQQQQQSLNTWLQEHDRFRQWNNELAGWRAQFSQQTSDREHLRQWQQQLTHAEQKLNALAAITLTLTADEVATALAQHAEQRPLRQHLVALHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKTICEQEARIKTLEAQRAQLQAGQPCPLCGSTSHPAVEAYQALEPGVNQSRLLALENEVKKLGEEGATLRGQLDAITKQLQRDENEAQSLRQDEQALTQQWQAVTASLNITLQPLDDIQPWLDAQDEHERQLRLLSQRHELQGQIAAHNQQIIQYQQQIEQRQQLLLTTLTGYALTLPQEDEEESWLATRQQEAQSWQQRQNELTALQNRIQQLTPILETLPQSDELPHCEETVVLENWRQVHEQCLALHSQQQTLQQQDVLAAQSLQKAQAQFDTALQASVFDDQQAFLAALMDEQTLTQLEQLKQNLENQRRQAQTLVTQTAETLAQHQQHRPDDGLALTVTVEQIQQELAQTHQKLRENTTSQGEIRQQLKQDADNRQQQQTLMQQIAQMTQQVEDWGYLNSLIGSKEGDKFRKFAQGLTLDNLVHLANQQLTRLHGRYLLQRKASEALEVEVVDTWQADAVRDTRTLSGGESFLVSLALALALSDLVSHKTRIDSLFLDEGFGTLDSETLDTALDALDALNASGKTIGVISHVEAMKERIPVQIKVKKINGLGYSKLESTFAVK >NZ_CP020368.1|WP_001221319.1|381151_382354_-|exonuclease-subunit-SbcD MRILHTSDWHLGQNFYSKSREAEHQAFLDWLLETAQTHQVDAIIVAGDVFDTGSPPSYARTLYNRFVVNLQQTGCHLVVLAGNHDSVATLNESRDIMAFLNTTVVASAGHAPQILPRRDGTPGAVLCPIPFLRPRDIITSQAGLNGIEKQQHLLAAITDYYQQHYADACKLRGDQPLPIIATGHLTTVGASKSDAVRDIYIGTLDAFPAQNFPPADYIALGHIHRAQIIGGMEHVRYCGSPIPLSFDECGKSKYVHLVTFSNGKLESVENLNVPVTQPMAVLKGDLASITAQLEQWRDVSQEPPVWLDIEITTDEYLHDIQRKIQALTESLPVEVLLVRRSREQRERVLASQQRETLSELSVEEVFNRRLALEELDESQQQRLQHLFTTTLHTLAGEHEA >NZ_CP020368.1|WP_000113933.1|382543_383233_+|phosphate-response-regulator-transcription-factor-PhoB MARRILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWMLPGGSGIQFIKHLKRESMTRDIPVVMLTARGEEEDRVRGLETGADDYITKPFSPKELVARIKAVMRRISPMAVEEVIEMQGLSLDPTSHRVMAGEEPLEMGPTEFKLLHFFMTHPERVYSREQLLNHVWGTNVYVEDRTVDVHIRRLRKALEPGGHDRMVQTVRGTGYRFSTRF >NZ_CP020368.1|WP_000893580.1|383290_384586_+|phosphate-regulon-sensor-histidine-kinase-PhoR MLERLSWKRLVLELLLCCFPAFILGAFFGYLPWFLLASVTGLLIWHFWNLLRLSWWLWVDRSMTPPPGRGSWEPLLYGLHQMQLRNKKRRRELGNLIKRFRSGAESLPDAVVLTTEEGGIFWCNGLAQQILGLRWPEDNGQNILNLLRYPEFTQYLKTRDFSRPLNLVLNTGRHLEIRVMPYTHKQLLMVARDVTQMHQLEGARRNFFANVSHELRTPLTVLQGYLEMMDEQPLEGAVREKALHTMREQTQRMEGLVKQLLTLSKIEAAPTQLLNEKVDVPMMLRVVEREAQTLSQKKQTFTFEIDNGLKVSGNEDQLRSAISNLVYNAVNHTPEGTHITVRWLRVPHGAEFSVEDNGPGIAPEHIPRLTERFYRVDKARSRQTGGSGLGLAIVKHAVNHHESRLNIESTVGKGTRFSFVIPERLIAKNSD >NZ_CP020368.1|WP_000149639.1|384992_386312_+|branched-chain-amino-acid-transporter-carrier-protein-BrnQ MTHQLRSRDIIALGFMTFALFVGAGNIIFPPMVGLQAGEHVWTAAFGFLITAVGLPVLTVVALAKVGGGVDSLSTPIGKVAGVLLATVCYLAVGPLFATPRTATVSFEVGIAPLTGDSALPLFIYSLVYFAIVILVSLYPGKLLDTVGNFLAPLKIIALVILSVAAIVWPAGSISTATEAYQNAAFSNGFVNGYLTMDTLGAMVFGIVIVNAARSRGVTEARLLTRYTVWAGLMAGVGLTLLYLALFRLGSDSASLVDQSANGAAILHAYVQHTFGGGGSFLLAALIFIACLVTAVGLTCACAEFFAQYVPLSYRTLVFILGGFSMVVSNLGLSQLIQISVPVLTAIYPPCIALVVLSFTRSWWHNSSRVIAPPMFISLLFGILDGIKASAFSDILPSWAQRLPLAEQGLAWLMPTVVMVVLAIIWDRAAGRQVTSSAH >NZ_CP020368.1|WP_001295329.1|386387_387761_+|proline-specific-permease-ProY MESKNKLKRGLSTRHIRFMALGSAIGTGLFYGSADAIKMAGPSVLLAYIIGGIAAYIIMRALGEMSVHNPAASSFSRYAQENLGPLAGYITGWTYCFEILIVAIADVTAFGIYMGVWFPTVPHWIWVLSVVLIICAVNLMSVKVFGELEFWFSFFKVATIIIMIVAGFGIIIWGIGNGGQPTGIHNLWSNGGFFSNGWLGMVMSLQMVMFAYGGIEIIGITAGEAKDPEKSIPRAINSVPMRILVFYVGTLFVIMSIYPWNQVGTAGSPFVLTFQHMGITFAASILNFVVLTASLSAINSDVFGVGRMLHGMAEQGSAPKIFSKTSRRGIPWVTVLVMTTALLFAVYLNYIMPENVFLVIASLATFATVWVWIMILLSQIAFRRRLPPEEVKALKFKVPGGVATTIGGLIFLLFIIGLIGYHPDTRISLYVGFAWIVVLLIGWMFKRRHDRQLAENQ >NZ_CP020368.1|WP_001300528.1|387916_389734_+|maltodextrin-glucosidase MMLNAWHLPVPPFVKQSKDQLLITLWLTGEDPPQRIMLRTEHDNEEMSVPMHKQRSQPQPGVTAWRAAIDLSSGQPRRRYSFKLLWHDRQRWFTPQGFSRMPPARLEQFAVDVPDIGPQWAADQIFYQIFPDRFARSLPREAEQDHVYYHHAAGQEIILRDWDEPVTAQAGGSTFYGGDLDGISEKLPYLKKLGVTALYLNPVFKAPSVHKYDTEDYRHVDPQFGGDGALLRLRHNTQQLGMRLVLDGVFNHSGDSHAWFDRHNRGTGGACHNPESPWRDWYSFSDDGTALDWLGYASLPKLDYQSESLVNEIYRGEDSIVRHWLKAPWSMDGWRLDVVHMLGEAGGARNNMQHVAGITEAAKETQPEAYIVGEHFGDARQWLQADVEDAAMNYRGFTFPLWGFLANTDISYDPQQIDAQTCMAWMDNYRAGLSHQQQLRMFNQLDSHDTARFKTLLGRDIARLPLAVVWLFTWPGVPCIYYGDEVGLDGKNDPFCRKPFPWQVEKQDTALFALYQRMIALRKKSQALRHGGCQVLYAEDNVVVFVRVLNQQRVLVAINRGEACEVVLPASPFLNAVQWQCKEGHGQLTDGILALPAISATVWMN >NZ_CP020368.1|WP_001009885.1|389738_390320_-|ACP-phosphodiesterase MNFLAHLHLAHLAESSLSGNLLADFVRGNPEESFPPDVVAGIHMHRRIDVLTDNLPEVREAREWFRSETRRVAPITLDVMWDHFLSRHWSQLSPDFPLQEFVCYAREQVMTILPDSPPRFINLNNYLWSEQWLVRYRDMDFIQNVLNGMASRRPRLDALRDSWYDLDAHYDALETRFWQFYPRMMAQASRKAL >NZ_CP020368.1|WP_001266503.1|390412_391483_+|tRNA-preQ1(34)-S-adenosylmethionine-ribosyltransferase-isomerase-QueA MRVTDFSFELPESLIAHYPMPERSSCRLLSLDGPTGALTHGTFTDLLDKLNPGDLLVFNNTRVIPARLFGRKASGGKIEVLVERMLDDKRILAHIRASKAPKPGAELLLGDDESINATMTARHGALFEVEFNDERSVLDILNSIGHMPLPPYIDRPDEDADRELYQTVYSEKPGAVAAPTAGLHFDEPLLEKLRAKGVEMAFVTLHVGAGTFQPVRVDTIEDHIMHSEYAEVPQDVVDAVLAAKARGNRVIAVGTTSVRSLESAAQAAKNDLIEPFFDDTQIFIYPGFQYKVVDALVTNFHLPESTLIMLVSAFAGYQHTMNAYKAAVEEKYRFFSYGDAMFITYNPQAINERVGE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP020368_3 | 2219714-2219829 | Orphan |
NA
Consensus repeat of NZ_CP020368_3
|
1 spacers
spacers of NZ_CP020368_3
>3.1|2219743|58|NZ_CP020368|CRISPRCasFinder GGTTTGTAGGCCTGATAAGACGCGCCAGCGTCGCATCAGGCTCCGGGTGCCGGATGCA |
CRISPR arrays and Neighbor proteins around NZ_CP020368_3
The CRISPR arrays of NZ_CP020368_3 >merge|NZ_CP020368|3|2219714-2219829|CRISPRCasFinder GCGTAAACGCCTTATCCGGCCTACGGCTCGGTTTGTAGGCCTGATAAGACGCGCCAGCGTCGCATCAGGCTCCGGGTGCCGGATGCAGCGTGAACGCCTTATCCGGCCTACGGCTC >NZ_CP020368|3|3|2219714-2219829|CRISPRCasFinder GCGTAAACGCCTTATCCGGCCTACGGCTC GGTTTGTAGGCCTGATAAGACGCGCCAGCGTCGCATCAGGCTCCGGGTGCCGGATGCA GCGTGAACGCCTTATCCGGCCTACGGCTC
>NZ_CP020368.1|WP_001075164.1|2217408_2219694_+|ribonucleoside-diphosphate-reductase-1-subunit-alpha MNQNLLVTKRDGSTERINLDKIHRVLDWAAEGLHNVSISQVELRSHIQFYDGIKTSDIHETIIKAAADLISRDAPDYQYLAARLAIFHLRKKAYGQFEPPALYDHVVKMVEMGKYDNHLLEDYTEEEFKQMDTFIDHDRDMTFSYAAVKQLEGKYLVQNRVTGEIYESAQFLYILVAACLFSNYPRETRLQYVKRFYDAVSTFKISLPTPIMSGVRTPTRQFSSCVLIECGDSLDSINATSSAIVKYVSQRAGIGINAGRIRALGSPIRGGEAFHTGCIPFYKHFQTAVKSCSQGGVRGGAATLFYPMWHLEVESLLVLKNNRGVEGNRVRHMDYGVQINKLMYTRLLKGEDITLFSPSDVPGLYDAFFADQEEFERLYTKYEKDDSIRKQRVKAVELFSLMMQERASTGRIYIQNVDHCNTHSPFDPAIAPVRQSNLCLEIALPTKPLNDVNDENGEIALCTLSAFNLGAINNLDELEELAILAVRALDALLDYQDYPIPAAKRGAMGRRTLGIGVINFAYYLAKHGKRYSDGSANNLTHKTFEAIQYYLLKASNELAKEQGACPWFNETTYAKGILPIDTYKKDLDTIANEPLHYDWEALRESIKTHGLRNSTLSALMPSETSSQISNATNGIEPPRGYVSIKASKDGILRQVVPDYEHLHDAYELLWEMPGNDGYLQLVGIMQKFIDQSISANTNYDPSRFPSGKVPMQQLLKDLLTAYKFGVKTLYYQNTRDGAEDAQDDLVPSIQDDGCESGACKI >NZ_CP020368.1|WP_001220069.1|2212960_2216713_-|AIDA-I-family-autotransporter-adhesin-YfaL/EhaC MRIIFLRKEYLSLLPSMIASLFSANGVAAVTDSCQGYDVKASCQASRQSLSGITQDWSIADGQWLVFSDMTNNASGGAVFLQQGAEFSLLPENETGMTLFANNTVTGEYNNGGAIFAKENSTLNLTDVIFSGNVAGGYGGAIYSSGTNDTGAVDLRVTNAMFRNNIANDGKGGAIYTINNDVYLSDVIFDNNQAYTSTSYSDGDGGAIDVTDNNSDSKHPSGYTIVNNTAFTNNTAEGYGGAIYTNSVTAPYLIDISVDDSYSQNGGVLVDENNSAAGYGDGPSSAAGGFMYLGLSEVTFDIADGKTLVIGNTENDGAVDSIAGTGLITKTGSGDLVLNADNNDFTGEMQIENGEVTLGRSNSLMNVGDTHCQDDPQDCYGLTIGSIDQYQNQAELNVGSTQQTFVHALTGFQNGTLNIDAGGNVTVNQGSFAGIIEGAGQLTIAQNGSYVLAGAQPMALTGDIVVDDGAVLSLEGDAADLTALQDDPQSIVLNGGVLDLSDFSTWQSGTSYNDGLEVSGSSGTVIGSQDVVDLAGGDNLHIGGDGKDGVYVVVDASDGQVSLANNNSYLGTTQIASGTLMVSDNSQLGDTHYNRQVIFTDKQQESVMEITSDVDTRSDAAGHGRDIEMRADGEVAVDAGVDTQWGALMADSSGEHQDEGSTLTKTGAGTLELTASGTTQSAVRVEEGTLKGDVADILPYASSLWVGDGATFVTGADQDIQSIDTTSSGTIDISDGTVLRLTGQDTSVALNASLFNGDGTLVNATDGVTLTGELNTNLETDSLTYLSNVTVNGNLTNTSGAVSLQNGVAGDTLTVNGDYTGGGTLLLDSELNGDDSASDQLVLNGNTAGNTTVVINPITGIGEPISTGIKVVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVEDNNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQPVLNAKVGGYLNNLRAANQAFMMERRDHAGGDGQTLNLRVIGGDYHYTAAGQLAQHEDTSTVQLSVDLFRGRWGDDGEWMLGIVGGYSDNQGDSRSSMTGTRADNQNHGYAVGLTSSWFQHGKQKQGAWLDNWLQYAWFSNDVSEHEDGTDHYHSSGIIALLEAGYQWLPGRGVVIEPQAQVIYQGVQQDDFTAANRARVSQSQGDDIQTRLGLHSEWRTAVHVIPTLDLNYYHDPHSTEIEEDGSTISDDAVKQRGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW >NZ_CP020368.1|WP_000990753.1|2212110_2212833_+|bifunctional-3-demethylubiquinone-3-O-methyltransferase/2-octaprenyl-6-hydroxy-phenol-methylase MNAEKSPENHNVDHEEIAKFEAVASRWWDLEGEFKPLHRINPLRLGYIAERAGGLFGKKVLDVGCGGGILAESMAREGATVTGLDMGFEPLQVAKLHALESGIQVDYVQETVEEHAAKHAGQYDVVTCMEMLEHVPDPQSVVRACAQLVKPGGDVFFSTLNRNGKSWLMAVVGAEYILRMVPKGTHDVKKFIKPAELLGWVDQTSLKERHITGLHYNPITNTFKLGPGVDVNYMLHTQNK >NZ_CP020368.1|WP_001281254.1|2209336_2211964_-|DNA-topoisomerase-(ATP-hydrolyzing)-subunit-A MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDSAVYDTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDDEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEVDAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDGMRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDIIAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHAPTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVKYQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVYSMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMATANGTVKKTVLTEFNRLRTAGKVAIKLVEGDELIGVDLTSGEDEVMLFSAEGKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGEGAILTATQNGYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITDAGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDTIDGSAAEGDDEIAPEVDVDDEPEEE >NZ_CP020368.1|WP_000012273.1|2207499_2209188_-|DUF2138-domain-containing-protein MSGEKKAKGWRFYGLVGFGAIALLSAGVWALQYAGSGPEKTLSPLVVHNNLQIDLNEPDLFLDSDSLSQLPKDLITIPFLHDVLSEDFVFYYQNHADRLGIEGSIRRIVYEHDLTLKDKLFSSLLDQPAQAALWHDKQGHLSHYMVLIQRSGLSKLLEPLLFAATSDSQLSKTEISSIKINSETVPVYQLRYNGNNALMFATYQDKMLVFSSTDMLFKDDQQDTEATAIAGDLLSGKKRWQASFGLEERTAEKTPVRQRIVVSARWLGFGYQRLMPSFAGVHFEMGNDGWHSFVALNDESASVDASFDFTPVWNSMPAGASFCVAVPYSHGIAEEMLSHISQENDKLNGALDGAAGLCWYEDSKLQTPLFVGQFDGTAEQAQLPGKLFTQNIGAHESKAPEGVLPVSQTQQGEAQIWRREVSSRYGQYPKAQAAQPDQLMSDYFFRVSLAMQNKTLLFSLDDTLVNNALQTLNKTRPAMVDVIPTDGIVPLYINPQGIAKLLRNETLTSLPKNLEPVFYNAAQTLLMPKLDALSQQPRYVMKLAQMEPGAAWQWLPITWQPL >NZ_CP020368.1|WP_001300976.1|2206879_2207503_-|DUF1175-domain-containing-protein MRHGLLALICWLCCVVVHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIYRLNFLVR >NZ_CP020368.1|WP_122987104.1|2202341_2206736_-|alpha-2-macroglobulin-family-protein MRLEAPGRDYRRYQMEEYGGVDVRLYRIPDPMAFLRQQKNLHRIVVQPQYLGDGLNNTLTWLWDNWYGKSRRVMQRTFSSQSRQNVTQALPELQLGNAIIKPSRYVQNNQFSPLKKYPLVEQFRYPLWQAKPFEPQQGVKLEGASSNFISPQPGNIYIPLGQQEPGLYLVEAMVGGYRATTVVFVSDTVALSKVSGKELLVWTAGKKQGEAKPGSEILWTDGLGVMTRGVTDDSGTLQLQHISPERSYILGKDAEGGVFVSENFFYESEIYNTRLYIFTDRPLYRAGDRVDVKVIGREFHDPLHSSPIVSAPAKLSVLDANGSLLQTVNVTLDARNGGQGSFRLPENAVAGGYELRLAYRNQVYSSSFRVANYIKPHFEIGLALDKKEFKTGEAVSGKLQLLYPDGEPVKNARVQLSLRAQQLSMVGNDLRYAGRFPVSLEGSETVSDASGHVTLNLPAADKPSRYLLTVSASDGAAYRVTTTKEILIERGLAHYSLSTAAQYSNSGESVVFRYAALESSKQVPVTYEWLRLEDRTSHSGELPSGGKSFTVNFAKPGNYNLTLRDKDGLILAGLSHAVSGKGSTAHTGTVDIVADKTLYQPGETAKMLITFPEPIDEALLTLERDRVEQQSLLSHPANWLTLQRLNDTQYEARVPVSNSFAPNITFSVLYTRNGQYSFQNAGIKVAVPQLDIRVKTDKTHYQPGELVNVELTSSLKGKPVSAQLTVGVVDEMIYALQPEIAPNIGKFFYPLGRNNVRTSSSLSFISYDQALSSEPVAPGATNRSERRVKMLERPRREEVDTAAWMPSLTTDKQGKAYFTFLMPDSLTRWRITARGMNGDGLVGQGRAYLRSEKNLYMKWSMPTVYRVGDKPAAGLFIFSQQDNEPVALVTKFAGAEMRQTLTLHKGANYISLTQNIQQSGLLSAELQQNGQVQDSISTKLSFVDNSWPVEQQKNVMLGGGDNALMLPEQASNIRLQSSETPQEIFRNNLDALVDEPWGGVINTGSRLIPLSLAWRSLADHQSAAANDIRQMIQDNRLRLMQLAGPGARFTWWGEDGNGDAFLTAWAWYADWQASQAIGVTQQPEYWQHMLDSYAEQADNMPLLHRALVLAWAQEMNLPCKTLLKGLDEAIARRGTKTEDFSEEDTRDINDSLILDTPESPLADAVANVLTMTLLKKAQLKSTVMPQVQQYAWDKAANSNQPLAHTVVLLNSGGDATQTAAILSGLTAEQSTIERALAMNWLAKYMATMPPVVLPAPAGAWAKHKLTGGGEDWRWVGQGVPDILSFGDELSPQNVQVRWREPAKTAQQSNIPVTVERQLYWLIPGEEEMSFTLQPVTSNEIDSDALYLDEITLTSEQDAVLRYGQVEVPLPPGADVERTTWGISVNKPNAAKQQGQLLEKARNEMGELAYMLPVKELTGTVTFRHLLRFSQKGQFVLPPARYVRSYAPAQQSVAAGSEWTGMQVK >NZ_CP020368.1|WP_001104549.1|2200691_2202341_-|DUF2300-domain-containing-protein MNWRRIVWLLALVTLPTLAEETPLQLVLRGAQHDQLYQLSSSGVTKVSALPDSLTTPLGSLWKLYVYAWLEDTHQPEQPYQCRGNSPEEVYCCQAGESITRDTALVRSCGLYFAPQRLHIGADVWGQYWQQRQAPAWLASLTTLKPETSVTVKSLLDSLATLPAQNKAQEVLLDVVLDEAKIGVASMLGSRVRVKTWSWFADDKQEIRQGGFAGWLTDGTPLWVTGSGTSKTVLTRYATVLNRVLPVPTQVASGQCVEVELFARYPLKKITAEKSTTAVNPGVLNGRYRVTFTNGNHITFVSHGETTLLSEKGKLKLQSHLDREEYVARVLDREAKSTPPEAAKAMTVAIRTFLQQNANREGDCLTIPDSSATQRVSASPATTGARTMTAWTQDLIYAGDPVHYHGSRATEGTLSWRQATAQAGQGERYDQILAFAYPDNSLSRWGAPRSTCQLLPKAKAWLAKKMPQWRRILQAETGYNEPDVFAVCRLVSGFPYTDRQQKRLFIRNFFTLQDRLDLTHEYLHLAFDGYPTGLDENYIETLTRQLLMD >NZ_CP020368.1|WP_001225852.1|2199910_2200687_-|YfaP-family-protein MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPAEGEDASFSQSINYPASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDGSFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTPGTGTIRARLRLVLSWDTDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPIHGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGELTLVKSFDW >NZ_CP020368.1|WP_000786547.1|2198652_2199837_+|acetyl-CoA-C-acetyltransferase MKNCVIVSAVRTAIGSFNGSLASTSAIDLGATVIKAAIERAKIDSQHVDEVIMGNVLQAGLGQNPARQALLKSGLAETVCGFTVNKVCGSGLKSVALAAQAIQAGQAQSIVAGGMENMSLAPYLLDAKARSGYRLGDGQVYDVILRDGLMCATHGYHMGITAENVAKEYGITREMQDELALHSQRKAAAAIESGAFTAEIVPVNVVTRKKTFVFSQDEFPKANSTAEALGALRPAFDKAGTVTAGNASGINDGAAALVIMEESAALAAGLTPLARIKSYASGGVPPALMGMGPVPATQKALQLAGLQLADIDLIEANEAFAAQFLAVGKNLGFDSEKVNVNGGAIALGHPIGASGARILVTLLHAMQARDKTLGLATLCIGGGQGIAMVIERLN >NZ_CP020368.1|WP_000332036.1|2219935_2221066_+|ribonucleotide-diphosphate-reductase-subunit-beta MAYTTFSQTKNDQLKEPMFFGQPVNVARYDQQKYDIFEKLIEKQLSFFWRPEEVDVSRDRIDYQALPEHEKHIFISNLKYQTLLDSIQGRSPNVALLPLISIPELETWVETWAFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEQIQKRAEGISSYYDELIEMTSYWHLLGEGTHTVNGKTVTVSLRELKKKLYLCLMSVNALEAIRFYVSFACSFAFAERELMEGNAKIIRLIARDEALHLTGTQHMLNLLRSGADDPEMAEIAEECKQECYDLFVQAAQQEKDWADYLFRDGSMIGLNKDILCQYVEYITNIRMQAVGLDLPFQTRSNPIPWINTWLVSDNVQVAPQEVEVSSYLVGQIDAEVDTDDLSNFQL >NZ_CP020368.1|WP_000135040.1|2221065_2221320_+|ferredoxin-like-diferric-tyrosyl-radical-cofactor-maintenance-protein-YfaE MARVTLRITGTQLLCQDEHPSLLAALESHNVAVEYQCREGYCGSCRTRLVAGQVDWIAEPLAFIQPGEILPCCCRAKGDIEIEM >NZ_CP020368.1|WP_000301050.1|2221373_2222024_-|lipopolysaccharide-kinase-InaA MAVSAKYDEFNHWWATEGDWVEEPNYRRNGMSGVQCVERNGKKLYVKRMTHHLFHSVRYPFGRPTIVREVAVIKELERAGVIVPKIVFGEAVKIEGEWRALLVTEDMAGFISIADWYAQHAVSPYSDEVRQAMLKAVALAFKKMHSINRQHGCCYVRHIYVKTEGNAEAGFLDLEKSRRRLRRDKAINHDFRQLEKYLEPIPKADWEQVKAYYYAM >NZ_CP020368.1|WP_000779102.1|2222486_2223563_-|glycerophosphodiester-phosphodiesterase MKLTLKNLSMAIMMSTIVMGSSAMAADSNEKIVIAHRGASGYLPEHTLPAKAMAYAQGADYLEQDLVMTKDDNLVVLHDHYLDRVTDVADRFPDRARKDGRYYAIDFTLDEIKSLKFTEGFDIENGKKVQTYPGRFPMGKSDFRVHTFEEEIEFVQGLNHSTGKNIGIYPEIKAPWFHHQEGKDIAAKTLEVLKKYGYTGKDDKVYLQCFDADELKRIKNELEPKMGMDLNLVQLIAYTDWNETQQKQPDGSWVNYNYDWMFKPGAMKQVAEYADGIGPDYHMLIEETSQPGNIKLTGMVQDAQQNKLVVHPYTVRSDKLPEYTTDVNQLYDALYNKAGVNGLFTDFPDKAVKFLNKE >NZ_CP020368.1|WP_000948732.1|2223567_2224926_-|glycerol-3-phosphate-transporter MLSIFKPAPHKARLPAAEIDPTYRRLRWQIFLGIFFGYAAYYLVRKNFALAMPYLVEQGFSRGDLGFALSGISIAYGFSKFIMGSVSDRSNPRVFLPAGLILAAAVMLFMGFVPWATSSIAVMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLFLLGMAWFNDWHAALYMPAFCAILVALFAFAMMRDTPQSCGLPPIEEYKNDYPDDYNEKAEQELTAKQIFMQYVLPNKLLWYIAIANVFVYLLRYGILDWSPTYLKEVKHFALDKSSWAYFLYEYAGIPGTLLCGWMSDKVFRGNRGATGVFFMTLVTIATIVYWMNPAGNPTVDMICMIVIGFLIYGPVMLIGLHALELAPKKAAGTAAGFTGLFGYLGGSVAASAIVGYTVDFFGWDGGFMVMIGGSILAVILLIVVMIGEKRRHEQLLQKRNGG >NZ_CP020368.1|WP_000857257.1|2225198_2226827_+|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-A MKTRDSQSSDVIIIGGGATGAGIARDCALRGLRVILVERHDIATGATGRNHGLLHSGARYAVTDAESARECISENQILKRIARHCVEPTNGLFITLPEDDLSFQATFIRACEEAGISAEAIDPQQARIIEPAVNPALIGAVKVPDGTVDPFRLTAANMLDAKEHGAVILTAHEVTGLIREGATVCGVRVRNHLTGETQALHAPVVVNAAGIWGQHIAEYADLRIRMFPAKGSLLIMDHRINQHVINRCRKPSDADILVPGDTISLIGTTSLRIDYNEIDDNRVTAEEVDILLREGEKLAPVMAKTRILRAYSGVRPLVASDDDPSGRNVSRGIVLLDHAERDGLDGFITITGGKLMTYRLMAEWATDAVCRKLGNTRPCTTADLALPGSQEPAEVTLRKVISLPAPLRGSAVYRHGDRTPAWLSEGRLHRSLVCECEAVTAGEVQYAVENLNVNSLLDLRRRTRVGMGTCQGELCACRAAGLLQRFNVTTSAQSIEQLSTFLNERWKGVQPIAWGDALRESEFTRWVYQGLCGLEKEQKDAL >NZ_CP020368.1|WP_001209902.1|2226816_2228076_+|glycerol-3-phosphate-dehydrogenase-subunit-GlpB MRFDTVIMGGGLAGLLCGLQLQKHGLRCAIVTRGQSALHFSSGSLDLLSHLPDGQPVADIHSGLESLRQQAPAHPYSLLGPQRVLDLACQAQALIAESGAQLQGSVELAHQRITPLGTLRSTWLSSPEVPVWPLPAKKICVVGISGLMDFQAHLAAASLRELDLSVETAEIELPELDVLRNNATEFRAVNIARFLDNEENWPLLLDALIPVANTCEMILMPACFGLADDKLWRWLNEKLPCSLMLLPTLPPSVLGIRLQNQLQRQFVHQGGVWMPGDEVKKVTCKNGVVNEIWTRNHADIPLRPRFAVLASGSFFSGGLVAERNGIREPILGLDVLQTATRGEWYKGDFFAPQPWQQFGVTTDETLRPSQAGQTIENLFAIGSVLGGFDPIAQGCGGGVCAVSALHAAQQIAQRAGGQQ >NZ_CP020368.1|WP_001000359.1|2228072_2229263_+|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-C MNDTSFENCIKCTVCTTACPVSRVNPGYPGPKQAGPDGERLRLKDGALYDEALKYCINCKRCEVACPSDVKIGDIIQRARAKYDTTRPSLRNFVLSHTDLMGSVSTPFAPIVNTATSLKPVRQLLDAALKIDHRRTLPKYSFGTFRRWYRSIAAQQAQYKDQVAFFHGCFVNYNHPQLGKDLIKVLNAMGTGVQLLSKEKCCGVPLIANGFTDKARKQAITNVESIREAVGVKGIPVIATSSTCTFALRDEYPEVLNVDNKGLRDHIELATRWLWRKLDEGKTLPLKPLPLKVVYHTPCHMEKMGWTLYTLELLRKIPGLELTVLDSQCCGIAGTYGFKKENYPTSQAIGAPLFRQIEESGADLVVTDCETCKWQIEMSTSLRCEHPITLLAQALA >NZ_CP020368.1|WP_000140557.1|2229455_2230358_+|ISNCY-family-transposase MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAYYSDVLWSVETSDGDGYINCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKGYDRVPLVVPLLFYHGETSPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELIQKHIRDRDLIGMVDRITTLLVRGFTNDSQLQTLFNYLLQCGDTSRFTRFIEEIAERSPLQKERLMTIAERLRQEGHQIGWQEGMHEQAIKIALRMLEQGFDRDQVLAATQLSEADLAANNH >NZ_CP020368.1|WP_000992954.1|2230398_2231202_-|2-keto-3-deoxy-L-rhamnonate-aldolase MNALLSNPFKERLRKGEVQIGLWLSSTTAYMAEIAATSGYDWLLIDGEHAPNTIQDLYHQLQAVAPYASQPVIRPVEGSKPLIKQVLDIGAQTLLIPMVDTAEQARQVVSATRYPPYGERGVGASVARAARWGRIENYMAQVNDSLCLLVQVESKTALDNLDEILDVEGIDGVFIGPADLSASLGYPDNAGHPEVQRIIETSIRRIRAAGKAAGFLAVAPDMAQQCLAWGANFVAVGVDTMLYSDALDQRLAMFKSGKNGPRIKGSY |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP020368_4 | 2630215-2630359 | Orphan |
NA
Consensus repeat of NZ_CP020368_4
|
1 spacers
spacers of NZ_CP020368_4
>4.1|2630260|55|NZ_CP020368|CRISPRCasFinder ATACACTCATTCCGTATGGCGGATAAGGCGTTTTCGCCGCATCCGCCGTTCTGTG |
CRISPR arrays and Neighbor proteins around NZ_CP020368_4
The CRISPR arrays of NZ_CP020368_4 >merge|NZ_CP020368|4|2630215-2630359|CRISPRCasFinder CACAATGCCTGATGCGACGCTGGAGCGTCTTATCATGCCTACAAAATACACTCATTCCGTATGGCGGATAAGGCGTTTTCGCCGCATCCGCCGTTCTGTGCACAATGCCTGATGCGACGCTGGCGCGTCTTATCATGCCTACAAA >NZ_CP020368|4|4|2630215-2630359|CRISPRCasFinder CACAATGCCTGATGCGACGCTGGAGCGTCTTATCATGCCTACAAA ATACACTCATTCCGTATGGCGGATAAGGCGTTTTCGCCGCATCCGCCGTTCTGTG CACAATGCCTGATGCGACGCTGGCGCGTCTTATCATGCCTACAAA
>NZ_CP020368.1|WP_001216521.1|2629173_2630166_+|glycine-betaine/L-proline-ABC-transporter-substrate-binding-protein-ProX MRHSVLFATAFATLISTQTFAADLPGKGITVNPVQSTITEETFQTLLVSRALEKLGYTVNKPSEVDYNVGYTSLASGDATFTAVNWTPLHDNMYEAAGGDKKFYREGVFVNGAAQGYLIDKKTADQYKITNIAQLKDPKIAKLFDTNGDGKADLTGCNPGWGCEGAINHQLAAYELTHTVTHNQGNYAAMMADTISRYKEGKPVFYYTWTPYWVSNELKPGKDVVWLQVPFSALPGDKNADTKLPNGANYGFPVSTMHIVANKAWAEKNPAAAKLFAIMQLPVADINAQNAIMHDGKASEGDIQGHVDGWIKAHQQQFDGWVNEALAAQK >NZ_CP020368.1|WP_000774988.1|2628051_2629116_+|glycine-betaine/L-proline-ABC-transporter-permease-ProW MADQNNPWDTTPAADSAAQSADAWGTPTTAPTDGGGADWLTSTPAPNVEHFNILDPFHKTLIPLDSWVTEGIDWVVTHFRPVFQGVRVPVDYILNGFQQLLLGMPAPVAIIVFALIAWQISGVGMGVATLVSLIAIGAIGAWSQAMVTLALVLTALLFCIVIGLPLGIWLARSPRAAKIIRPLLDAMQTTPAFVYLVPIVMLFGIGNVPGVVVTIIFALPPIIRLTILGINQVPADLIEASRSFGASPRQMLFKVQLPLAMPTIMAGVNQTLMLALSMVVIASMIAVGGLGQMVLRGIGRLDMGLATVGGVGIVILAIILDRLTQAVGRDSRSRGNRRWYTTGPVGLLTRPFIK >NZ_CP020368.1|WP_000985494.1|2626856_2628059_+|proline/glycine-betaine-ABC-transporter-ATP-binding-protein-ProV MAIKLEIKNLYKIFGEHPQRAFKYIEQGLSKEQILEKTGLSLGVKDASLAIEEGEIFVIMGLSGSGKSTMVRLLNRLIEPTRGQVLIDGVDIAKISDAELREVRRKKIAMVFQSFALMPHMTVLDNTAFGMELAGINAEERREKALDALRQVGLENYAHSYPDELSGGMRQRVGLARALAINPDILLMDEAFSALDPLIRTEMQDELVKLQAKHQRTIVFISHDLDEAMRIGDRIAIMQNGEVVQVGTPDEILNNPANDYVRTFFRGVDISQVFSAKDIARRTPNGLIRKTPGFGPRSALKLLQDEDREYGYVIERGNKFVGAVSIDSLKTALTQQQGLDAALIDAPLAVDAQTPLSELLSHVGQAPCAVPVVDEDQQYVGIISKGMLLRALDREGVNNG >NZ_CP020368.1|WP_000777972.1|2625543_2626503_+|class-1b-ribonucleoside-diphosphate-reductase-subunit-beta MKLSRISAINWNKISDDKDLEVWNRLTSNFWLPEKVPLSNDIPAWQTLTVVEQQLTMRVFTGLTLLDTLQNVIGAPSLMPDALTPHEEAVLSNISFMEAVHARSYSSIFSTLCQTKDVDAAYAWSEENAPLQRKAQIIQQHYRGDDPLKKKIASVFLESFLFYSGFWLPMYFSSRGKLTNTADLIRLIIRDEAVHGYYIGYKYQKNMEKISLGQREELKSFAFDLLLELYDNELQYTDELYAETPWADDVKAFLCYNANKALMNLGYKPLFPAEMAEVNPAILAALSPNADENHDFFSGSGSSYVMGKAVETEDEDWNF >NZ_CP020368.1|WP_000246527.1|2623389_2625534_+|class-1b-ribonucleoside-diphosphate-reductase-subunit-alpha MATTTAECLTQETMDYHALNAMLNLYDSAGRIQFDKDRQAVDAFIATHVRPNSVTFSSQQQRLNWLVNEGYYDESVLNRYSRDFVITLFAHAHTSGFRFQTFLGAWKFYTSYTLKTFDGKRYLEDFADRVTMVALTLAQGDETLALQLTDEMLSGRFQPATPTFLNCGKQQRGELVSCFLLRIEDNMESIGRAVNSALQLSKRGGGVAFLLSNLREAGAPIKRIENQSSGVIPVMKMLEDAFSYANQLGARQGAGAVYLHAHHPDILRFLDTKRENADEKIRIKTLSLGVVIPDITFHLAKENAQMALFSPYDVERVYGKPFADVAISQHYDELVADERIRKKYLNARDFFQRLAEIQFESGYPYIMYEDTVNRANPIAGRINMSNLCSEILQVNSASEYDENLDYTRTGHDISCNLGSLNIAHTMDSPDFARTVETAVRGLTAVSDMSHIRSVPSIEAGNAASHAIGLGQMNLHGYLAREGIAYGSPEALDFTNLYFYAITWHALRTSMLLARERGETFAGFKQSRYASGEYFSQYLQGNWQPKTAKVGELFTRSGITLPTREMWAQLRDDVMRYGIYNQNLQAVPPTGSISYINHATSSIHPIVAKVEIRKEGKTGRVYYPAPFMTNENLALYQDAYEIGAEKIIDTYAEATRHVDQGLSLTLFFPDTATTRDINKAQIYAWRKGIKTLYYIRLRQMALEGTEIEGCVSCAL >NZ_CP020368.1|WP_000080947.1|2623006_2623417_+|class-Ib-ribonucleoside-diphosphate-reductase-assembly-flavoprotein-NrdI MSQLVYFSSSSENTQRFIERLGLPAVRIPLNERERIQVDEPYILIVPSYGGGGTAGAVPRQVIRFLNDEHNRALLRGVIASGNRNFGEAYGRAGDVIARKCGVPWLYRFELMGTQSDIENVRKGVTEFWQRQPQNA >NZ_CP020368.1|WP_001223227.1|2622764_2623010_+|glutaredoxin-like-protein-NrdH MRITIYTRNDCVQCHATKRAMENRGFDFEMINVDRVPEAAEALRAQGFRQLPVVIAGDLSWSGFRPDMINRLHPAPHAASA >NZ_CP020368.1|WP_001295174.1|2622187_2622517_+|DUF883-domain-containing-protein MFNRPNRNDVDDGVQDIQNDVNQLADSLESVLKSWGSDAKGEAEAARSKAQALLKETRARMHGRTRVQQAARDAVGCADSFVRERPWCSVGTAAAVGIFIGALLSMRKS >NZ_CP020368.1|WP_001613650.1|2621691_2622036_-|YgaC-family-protein MYLRPDEVARVLEKVGFTVDVVTQKTYGYRRGENYVYVNREARMGRTALVIHPTLKERSSTLAEPASDIKTCDHYQQFPLYLAGERHEHYGIPHGFSSRVALERYLNGLFGEAS >NZ_CP020368.1|WP_000492656.1|2621205_2621655_+|L-alanine-exporter-AlaE MFSPQSRLRHAVADTFAMVVYCSVVNMCIEVFLSGMSFEQSFYSRLVAIPVNILIAWPYGMYRDLFMRAARKVSPSGWIKNLADILAYVTFQSPVYVAILLVVGADWHQIMAAVSSNIVVSMLMGAVYGYFLDYCRRLFKVSRYQQVKA >NZ_CP020368.1|WP_000165699.1|2630457_2631642_+|MFS-transporter MTKPNHELSPALIVLMSIATGLAVASNYYAQPLLDTIARNFSLSASSAGFIVTAAQLGYAAGLLFLVPLGDMFERRRLIVSMTLLAAGGMLITASSQSLAMMILGTALTGLFSVVAQILVPLAATLASPDKRGKVVGTIMSGLLLGILLARTVAGLLANLGGWRTVFWVASVLMALMALALWRGLPQMKSETHLNYPQLLGSVFSMFISDKILRTRALLGCLTFANFSILWTSMAFLLAAPPFNYSDGVIGLFGLAGAAGALGARPAGGFADKGKSHHTTTFGLLLLLLSWLAIWFGHTSVLALIIGILVLDLTVQGVHITNQTVIYRIHPDARNRLTAGYMTSYFIGGAAGSLISASAWQHGGWAGVCLAGATIALVNLLVWWRGFHRQEAAN >NZ_CP020368.1|WP_000445651.1|2631765_2632503_+|AzlC-family-ABC-transporter-permease MESPTPQPAPGSATFMEGCKDSLPIVISYIPVAFAFGLNATRLGFSPLESVFFSCIIYAGASQFVITAMLAAGSSLWIAALTVMAMDVRHVLYGPSLRSRIIQRLQKSKTALWAFGLTDEVFAAATAKLVRNNRRWSENWMIGIAFSSWSSWVFGTVIGAFSGSGLLQGYPAVEAALGFMLPALFMSFLLASFQRKQSLCVTAALVGALAGVTLFSIPVAILAGIVCGCLTALIQAFWQGAPDEL >NZ_CP020368.1|WP_000119763.1|2632492_2632828_+|L-valine-transporter-subunit-YgaH MSYEVLLLGLLVGVANYCFRYLPLRLRVGNARPTKRGAVGILLDTIGIASICALLVVSTAPEVMHDTRRFVPTLVGFAVLGASFYKTRSIIIPTLLSALAYGLAWKVMAII >NZ_CP020368.1|WP_000378442.1|2632918_2633449_+|multidrug-efflux-transporter-EmrAB-transcriptional-repressor-EmrR MDSSFTPIEQMLKFRASRHEDFPYQEILLTRLCMHMQSKLLENRNKMLKAQGINETLFMALITLESQENHSIQPSELSCALGSSRTNATRIADELEKRGWIERRESDNDRRCLHLQLTEKGHEFLREVLPPQHNCLHQLWSALSTTEKDQLEQITRKLLSRLDQMEQDGVVLEAMS >NZ_CP020368.1|WP_001295175.1|2633575_2634748_+|multidrug-efflux-MFS-transporter-periplasmic-adaptor-subunit-EmrA MSANAETQTPQQPVKKSGKRKRLLLLLTLLFIIIAVAIGIYWFLVLRHFEETDDAYVAGNQIQIMSQVSGSVTKVWADNTDFVKEGDVLVTLDPTDARQAFEKAKTALASSVRQTHQLMINSKQLQANIEVQKIALAKAQSDYNRRVPLGNANLIGREELQHARDAVTSAQAQLDVAIQQYNANQAMILGTKLEDQPAVQQAATEVRNAWLALERTRIVSPMTGYVSRRAVQPGAQISPTTPLMAVVPATNMWVDANFKETQIANMRIGQPVTITTDIYGDDVKYTGKVVGLDMGTGSAFSLLPAQNATGNWIKVVQRLPVRIELDQKQLEQYPLRIGLSTLVSVNTTNRDGQVLANKVRSTPVAVSTAREISLAPVNKLIDDIVKANAG >NZ_CP020368.1|WP_001295176.1|2634764_2636303_+|multidrug-efflux-MFS-transporter-permease-subunit-EmrB MQQQKPLEGAQLVIMTIALSLATFMQVLDSTIANVAIPTIAGNLGSSLSQGTWVITSFGVANAISIPLTGWLAKRVGEVKLFLWSTIAFAIASWACGVSSSLNMLIFFRVIQGIVAGPLIPLSQSLLLNNYPPAKRSIALALWSMTVIVAPICGPILGGYISDNYHWGWIFFINVPIGVAVVLMTLQTLRGRETRTERRRIDAVGLALLVIGIGSLQIMLDRGKELDWFSSQEIIILTVVAVVAICFLIVWELTDDNPIVDLSLFKSRNFTIGCLCISLAYMLYFGAIVLLPQLLQEVYGYTATWAGLASAPVGIIPVILSPIIGRFAHKLDMRRLVTFSFIMYAVCFYWRAYTFEPGMDFGASAWPQFIQGFAVACFFMPLTTITLSGLPPERLAAASSLSNFTRTLAGSIGTSITTTMWTNRESMHHAQLTESVNPFNPNAQAMYSQLEGLGMTQQQASGWIAQQITNQGLIISANEIFWMSAGIFLVLLGLVWFAKPPFGAGGGGGGAH >NZ_CP020368.1|WP_001130211.1|2636366_2636882_-|S-ribosylhomocysteine-lyase MPLLDSFTVDHTRMEAPAVRVAKTMNTPHGDAITVFDLRFCVPNKEVMPERGIHTLEHLFAGFMRNHLNGNGVEIIDISPMGCRTGFYMSLIGTPDEQRVADAWKAAMEDVLKVQDQNQIPELNVYQCGTYQMHSLQEAQDIARSILERDVRINSNEELALPKEKLQELHI >NZ_CP020368.1|WP_000611804.1|2637031_2638588_-|glutamate--cysteine-ligase MIPDVSQALAWLEKHPQALKGIQRGLERETLRVNADGTLATTGHPEALGSALTHKWITTDFAEALLEFITPVDGDIEHMLTFMRDLHRYTARNMGDERMWPLSMPCYIAEGQDIELAQYGTSNTGRFKTLYREGLKNRYGALMQTISGVHYNFSLPMAFWQAKCGDISGADAKEKISAGYFRVIRNYYRFGWVIPYLFGASPAICSSFLQGKPTSLPFEKTECGMYYLPYATSLRLSDLGYTNKSQSNLGITFNDLYEYVAGLKQAIKTPSEEYAKIGIEKDGKRLQINSNVLQIENELYAPIRPKRVTRSGESPSDALLRGGIEYIEVRSLDINPFSPIGVDEQQVRFLDLFMVWCALADAPEMSSSELACTRVNWNRVILEGRKPGLTLGIGCETAQFPLPQVGKDLFRDLKRVAQTLDSINGGEAYQKVCDELVACFDNPDLTFSARILRSMIDTGIGGTGKAFAEAYRNLLREEPLEILREEDFVAEREASERRQQEMEAADTEPFAVWLEKHA >NZ_CP020368.1|WP_001287454.1|2638660_2639089_-|DedA-family-protein MSEALSLFSLFASSFLSATLLPGNSEVVLVAMLLSGISHPWVLVLTATMGNSLGGLTNVILGRFFPLRKTSRWQEKATGWLKRYGAVTLLLSWMPVVGDLLCLLAGWMRISWGPVIFFLCLGKALRYVAVAAATVQGMMWWH >NZ_CP020368.1|WP_000273290.1|2639085_2639652_-|fructose-1-phosphate/6-phosphogluconate-phosphatase MYERYAGLIFDMDGTILDTEPTHRKAWREVLGHYGLQYDIQAMIALNGSPTWRIAQAIIELNQADLDPHALAREKTEAVRSMLLDSVEPLPLVDVVKSWHGRRPMAVGTGSESAIAEALLAHLGLRHYFDAVVAADHVKHHKPAPDTFLLCAQRMGVQPTQCVVFEDADFGIQAARAAGMDAVDVRLL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP020368_5 | 2702148-2702910 | TypeI-E |
I-E
Consensus repeat of NZ_CP020368_5
|
12 spacers
spacers of NZ_CP020368_5
>5.1|2702177|32|NZ_CP020368|CRISPRCasFinder,CRT CAGCGTCAGGCGTGAAATCTCACCGTCGTTGC >5.2|2702238|32|NZ_CP020368|CRISPRCasFinder,CRT TCGGTTCAGGCGTTGCAAACCTGGCTACCGGG >5.3|2702299|32|NZ_CP020368|CRISPRCasFinder,CRT GTAGTCCATCATTCCACCTATGTCTGAACTCC >5.4|2702360|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR CGGGGGGATAATGTTTACGGTCATGCGCCCCC >5.5|2702421|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR TGGGCGGCTTGCCTTGCAGCCAGCTCCAGCAG >5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR AAGCTGGCTGGCAATCTCTTTCGGGGTGAGTC >5.7|2702543|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR TAGTTTCCGTATCTCCGGATTTATAAAGCTGA >5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR GCAGGCGGCGACGCGCAGGGTATGCGCGATTCG >5.9|2702666|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR GCGACCGCTCAGAAATTCCAGACCCGATCCAAA >5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR TCAACATTATCAATTACAACCGACAGGGAGCC >5.11|2702789|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR AGCGTGTTCGGCATCACCTTTGGCTTCGGCTG >5.12|2702850|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR TGCGTGAGCGTATCGCCGCGCGTCTGCGAAAG |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around NZ_CP020368_5
The CRISPR arrays of NZ_CP020368_5 >merge|NZ_CP020368|5|2702148-2702910|CRISPRCasFinder,CRT,PILER-CR CGGTTTATCCCCGCTGATGCGGGGAACACCAGCGTCAGGCGTGAAATCTCACCGTCGTTGCCGGTTTATCCCTGCTGGCGCGGGGAACTCTCGGTTCAGGCGTTGCAAACCTGGCTACCGGGCGGTTTATCCCCGCTAACGCGGGGAACTCGTAGTCCATCATTCCACCTATGTCTGAACTCCCGGTTTATCCCCGCTGGCGCGGGGAACTCCGGGGGGATAATGTTTACGGTCATGCGCCCCCCGGTTTATCCCCGCTGGCGCGGGGAACTCTGGGCGGCTTGCCTTGCAGCCAGCTCCAGCAGCGGTTTATCCCCGCTGGCGCGGGGAACTCAAGCTGGCTGGCAATCTCTTTCGGGGTGAGTCCGGTTTATCCCCGCTGGCGCGGGGAACTCTAGTTTCCGTATCTCCGGATTTATAAAGCTGACGGTTTATCCCCGCTGGCGCGGGGAACTCGCAGGCGGCGACGCGCAGGGTATGCGCGATTCGCGGTTTATCCCCGCTGGCGCGGGGAACTCGCGACCGCTCAGAAATTCCAGACCCGATCCAAACGGTTTATCCCCGCTGGCGCGGGGAACTCTCAACATTATCAATTACAACCGACAGGGAGCCCGGTTTATCCCCGCTGGCGCGGGGAACTCAGCGTGTTCGGCATCACCTTTGGCTTCGGCTGCGGTTTATCCCCGCTGGCGCGGGGAACTCTGCGTGAGCGTATCGCCGCGCGTCTGCGAAAGCGGTTTATCCCCGCTGGCGCGGGGAACTC >NZ_CP020368|5|5|2702148-2702910|CRISPRCasFinder CGGTTTATCCCCGCTGATGCGGGGAACAC CAGCGTCAGGCGTGAAATCTCACCGTCGTTGC CGGTTTATCCCTGCTGGCGCGGGGAACTC TCGGTTCAGGCGTTGCAAACCTGGCTACCGGG CGGTTTATCCCCGCTAACGCGGGGAACTC GTAGTCCATCATTCCACCTATGTCTGAACTCC CGGTTTATCCCCGCTGGCGCGGGGAACTC CGGGGGGATAATGTTTACGGTCATGCGCCCCC CGGTTTATCCCCGCTGGCGCGGGGAACTC TGGGCGGCTTGCCTTGCAGCCAGCTCCAGCAG CGGTTTATCCCCGCTGGCGCGGGGAACTC AAGCTGGCTGGCAATCTCTTTCGGGGTGAGTC CGGTTTATCCCCGCTGGCGCGGGGAACTC TAGTTTCCGTATCTCCGGATTTATAAAGCTGA CGGTTTATCCCCGCTGGCGCGGGGAACTC GCAGGCGGCGACGCGCAGGGTATGCGCGATTCG CGGTTTATCCCCGCTGGCGCGGGGAACTC GCGACCGCTCAGAAATTCCAGACCCGATCCAAA CGGTTTATCCCCGCTGGCGCGGGGAACTC TCAACATTATCAATTACAACCGACAGGGAGCC CGGTTTATCCCCGCTGGCGCGGGGAACTC AGCGTGTTCGGCATCACCTTTGGCTTCGGCTG CGGTTTATCCCCGCTGGCGCGGGGAACTC TGCGTGAGCGTATCGCCGCGCGTCTGCGAAAG CGGTTTATCCCCGCTGGCGCGGGGAACTC >NZ_CP020368|5|1|2702148-2702910|CRT CGGTTTATCCCCGCTGATGCGGGGAACAC CAGCGTCAGGCGTGAAATCTCACCGTCGTTGC CGGTTTATCCCTGCTGGCGCGGGGAACTC TCGGTTCAGGCGTTGCAAACCTGGCTACCGGG CGGTTTATCCCCGCTAACGCGGGGAACTC GTAGTCCATCATTCCACCTATGTCTGAACTCC CGGTTTATCCCCGCTGGCGCGGGGAACTC CGGGGGGATAATGTTTACGGTCATGCGCCCCC CGGTTTATCCCCGCTGGCGCGGGGAACTC TGGGCGGCTTGCCTTGCAGCCAGCTCCAGCAG CGGTTTATCCCCGCTGGCGCGGGGAACTC AAGCTGGCTGGCAATCTCTTTCGGGGTGAGTC CGGTTTATCCCCGCTGGCGCGGGGAACTC TAGTTTCCGTATCTCCGGATTTATAAAGCTGA CGGTTTATCCCCGCTGGCGCGGGGAACTC GCAGGCGGCGACGCGCAGGGTATGCGCGATTCG CGGTTTATCCCCGCTGGCGCGGGGAACTC GCGACCGCTCAGAAATTCCAGACCCGATCCAAA CGGTTTATCCCCGCTGGCGCGGGGAACTC TCAACATTATCAATTACAACCGACAGGGAGCC CGGTTTATCCCCGCTGGCGCGGGGAACTC AGCGTGTTCGGCATCACCTTTGGCTTCGGCTG CGGTTTATCCCCGCTGGCGCGGGGAACTC TGCGTGAGCGTATCGCCGCGCGTCTGCGAAAG CGGTTTATCCCCGCTGGCGCGGGGAACTC >NZ_CP020368|5|1|2702331-2702910|PILER-CR CGGTTTATCCCCGCTGGCGCGGGGAACTC CGGGGGGATAATGTTTACGGTCATGCGCCCCC CGGTTTATCCCCGCTGGCGCGGGGAACTC TGGGCGGCTTGCCTTGCAGCCAGCTCCAGCAG CGGTTTATCCCCGCTGGCGCGGGGAACTC AAGCTGGCTGGCAATCTCTTTCGGGGTGAGTC CGGTTTATCCCCGCTGGCGCGGGGAACTC TAGTTTCCGTATCTCCGGATTTATAAAGCTGA CGGTTTATCCCCGCTGGCGCGGGGAACTC GCAGGCGGCGACGCGCAGGGTATGCGCGATTCG CGGTTTATCCCCGCTGGCGCGGGGAACTC GCGACCGCTCAGAAATTCCAGACCCGATCCAAA CGGTTTATCCCCGCTGGCGCGGGGAACTC TCAACATTATCAATTACAACCGACAGGGAGCC CGGTTTATCCCCGCTGGCGCGGGGAACTC AGCGTGTTCGGCATCACCTTTGGCTTCGGCTG CGGTTTATCCCCGCTGGCGCGGGGAACTC TGCGTGAGCGTATCGCCGCGCGTCTGCGAAAG CGGTTTATCCCCGCTGGCGCGGGGAACTC
>NZ_CP020368.1|WP_000490428.1|2701027_2702065_+|aminopeptidase MFSALRHRTAALALGVCFILPVHASSPKPGDFANTQARHIATFFPGRMTGTPAEMLSADYIRQQFQQMGYRSDIRTFNSRYIYTARDNRKSWHNVTGSTVIAAHEGKAPQQIIIMAHLDTYAPLSDADADANLGGLTLQGMDDNAAGLGVMLELAERLKNTPTEYGIRFVATSGEEEGKLGAENLLKRMSDTEKKNTLLVINLDNLIVGDKLYFNSGVKTPEAVRKLTRDRALAIARSHGIAATTNPGLNKNYPKGTGCCNDAEIFDKAGIAVLSVEATNWNLGNKDGYQQRAKTPAFPAGNSWHDVRLDNHQHIDKALPGRIERRCRDVMRIMLPLVKELAKAS >NZ_CP020368.1|WP_000372108.1|2699867_2700776_-|sulfate-adenylyltransferase-subunit-CysD MDQIRLTHLRQLEAESIHIIREVAAEFSNPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYEFRDRTAKAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWHNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMIDDNRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESNAQTLPEIIEEMLVSTTSERQGRVIDRDQAGSMELKKRQGYF >NZ_CP020368.1|WP_001090361.1|2698438_2699866_-|sulfate-adenylyltransferase-subunit-CysN MNTALAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTRQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCELAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYSEETFTRIREDYLTFAGQLPGNLDIRFVPLSALEGDNVASQSESMPWYSGPTLLEVLETVEIQRVVDAQPMRFPVQYVNRPNLDFRGYAGTLASGRVEVGQRVKVLPSGVESNVARIVTFDGDREEAFAGEAITLVLTDEIDISRGDLLLAADEALPAVQSASVDVVWMAEQPLSPGQSYDIKIAGKKTRARVDGIRYQVDINNLTQREVENLPLNGIGLVDLTFDEPLVLDRYQQNPVTGGLIFIDRLSNVTVGAGMVHEPVSQATAAPSEFSAFELELNALVRRHFPHWGARDLLGDK >NZ_CP020368.1|WP_001173673.1|2697833_2698439_-|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVQQRELHHGHRGVVLWFTGLSGSGKSTVAGALEEALHKLGVSTYLLDGDNVRHGLCSDLGFSDADRKENIRRVGEVANLMVEAGLVVLTAFISPHRAERQMVRERVGEGRFIEVFVDTPLAICEARDPKGLYKKARAGELRNFTGIDSVYEAPESAEIHLNGEQLVTNLVQQLLDLLRQNDIIRS >NZ_CP020368.1|WP_001246104.1|2697460_2697784_-|DUF3561-family-protein MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNTLFFFIYTWPFFLALMPVAVVVGIALHSLMDGKLRYSIVFTLVTVGIMFGALFMWLLG >NZ_CP020368.1|WP_000517476.1|2696955_2697267_-|cell-division-protein-FtsB MGKLTLLLLAILVWLQYSLWFGKNGIHDYTRVNDDVAAQQATNAKLKARNDQLFAEIDDLNGGQEALEERARNELSMTRPGETFYRLVPDASKRAQSAGQNNR >NZ_CP020368.1|WP_000246138.1|2696226_2696937_-|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MATTHLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKRVVIAISPGDSRFAQLPLANHPQITVVDGGDERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALSETSRTGGILAAPVRDTMKRAEPGKNAIAHTVDRNGLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTRTIHQENT >NZ_CP020368.1|WP_001219242.1|2695747_2696227_-|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLIKATK >NZ_CP020368.1|WP_000568943.1|2694701_2695751_-|tRNA-pseudouridine(13)-synthase-TruD MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDLSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGAQRWAQTNTPVRDRNKRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRAMLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE >NZ_CP020368.1|WP_001295182.1|2693959_2694721_-|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVTCSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW >NZ_CP020368.1|WP_001381369.1|2703015_2703300_-|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMLVVVTENVPPRLRGRLAIWLLEVRAGVYVGDVSAKIREMIWEQIAGLAEEGNVVMAWATNTETGFEFQTFGLNRRTPVDLDGLRLVSFLPV >NZ_CP020368.1|WP_000220066.1|2703301_2704219_-|type-I-E-CRISPR-associated-endonuclease-Cas1 MTWLPLNPIPLKDRVSMIFLQYGQIDVIDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVRLAAQVGTLLVWVGEAGVRVYASGQPGGARSDKLLYQAKLALDEDLRLKVVRKMFELRFGEPAPARRSVEQLRGIEGSRVRATYALLAKQYGVTWNGRRYDPKDWEKGDTINQCISAATSCLYGVTEAAILAAGYAPAIGFVHTGKPLSFVYDIADIIKFDTVVPKAFEIARRNPGEPDREVRLACRDIFRSSKTLAKLIPLIEDVLAAGEIQPPAPPEDAQPVAIPLPVSLGDAGHRSS >NZ_CP020368.1|WP_000281400.1|2704234_2704834_-|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSKVIIARAWSRDLYQLHQGLWHLFPNRPDAARDFLFHVEKRNTPEGCHVLLQSAQMPVSTAVATVIKTKQVEFQLQVGVPLYFRLRANPIKTILDNQKRLDSKGNIKRCRVPLIKEAEQIAWLQRKLGNAARVEDVHPISERPQYFSGDGKSGKIQTVCFEGVLTINDAPALIDLVQQGIGPAKSMGCGLLSLAPL >NZ_CP020368.1|WP_001334996.1|2704820_2705495_-|type-I-E-CRISPR-associated-protein-Cas5/CasD MRSYLILRLAGPMQAWGQPTFEGTRPTGRFPTRSGLLGLLGACLGIQRDDTSSLQALSESVQFAVRCDELILDDRRVSVTGLRDYHTVLGAREDYRGLKSHETIQTWREYLCDASFTVALWLTPHATMVISELEKAVLKPRYTPYLGRRSCPLTHPLFLGTCQASDPQKALLNYEPVGGDIYSEESVTGHHLKFTARDEPMITLPRQFASREWYVIKGGMDVSQ >NZ_CP020368.1|WP_000064450.1|2705497_2706589_-|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MSNFINIHVLISHSPSCLNRDDMNMQKDAIFGGKRRVRISSQSLKRAMRKSGYYAQNIGESSLRTIHLAQLRDVLRQKLGERFDQKIIDKTLALLSGKSVDEAEKISADAVTPWVVGEIAWFCEQVAKAEADNLDDKKLLKVLKEDIAAIRVNLQQGVDIALSGRMATSGMMTELGKVDGAMSIAHAITTHQVDSDIDWFTAVDDLQEQGSAHLGTQEFSSGVFYRYANINLAQLQENLGGASREQALEIATHVVHMLATEVPGAKQRTYAAFNPADMVMVNFSDMPLSMANAFEKAVKAKDGFLQPSIQAFNQYWDRVANGYGLNGAAAQFSLSDVDPITAQVKQMPTLEQLKSWVRNNGEA >NZ_CP020368.1|WP_000752800.1|2706601_2707084_-|type-I-E-CRISPR-associated-protein-Cse2/CasB MADEIDAMALYRAWQQLDNGSCAQIRRVSEPDELRDIPAFYRLVQPFGWENPRHQQALLRMVFCLSAGKNVIRHQDKKSEQTTGISLGRALANSGRINERRIFQLIRADRTADMVQLRRLLTHAEPVLDWPLMARMLTWWGKRERQQLLEDFVLTTNKNA >NZ_CP020368.1|WP_001050401.1|2707076_2708585_-|type-I-E-CRISPR-associated-protein-Cse1/CasA MNLLIDNWIPVRPRNGGKVQIINLQSLYCSRDQWRLSLPRDDMELAALALLVCIGQIIAPAKDDVEFRHRIMNPLTEDEFQQLIAPWIDMFYLNHAEHPFMQTKGVKANDVTPMEKLLAGVSGATNCAFVNQPGQGEALCGGCTAIALFNQANQAPGFGGGFKSGLRGGTPVTTFVRGIDLRSTVLLNVLTLPRLQKQFPNESHTENQPTWIKPIKSNESIPASSIGFVRGLFWQPAHIELCDPIGIGKCSCCGQESNLRYTGFLKEKFTFTVNGLWPHPHSPCLVTVKKGEVEEKFLAFTTSAPSWTQISRVVVDKIIQNENGNRVAAVVNQFRNIAPQSPLELIMGGYRNNQASILERRHDVLMFNQGWQQYGNVINEIVTVGLGYKTALRKALYTFAEGFKNKDFKGAGVSVHETAERHFYRQSELLIPDVLANVNFSQADEVIADLRDKLHQLCEMLFNQSVAPYAHHPKLISTLALARATLYKHLRELKPQGGPSNG >NZ_CP020368.1|WP_000433152.1|2708999_2711666_-|CRISPR-associated-helicase/endonuclease-Cas3 MEPFKYICHYWGKSSKSLTKGNDIHLLIYHCLDVAAVADCWWDQSVVLQNTFCRNEMLSKQRVKAWLLFFIALHDIGKFDIRFQYKSAESWLKLNPATPSLNGPSTQMCRKFNHGAAGLYWFNQDSLSEQSLGDFFSFFDAAPHPYESWFPWVEAVTGHHGFILHSQDQDKSRWEMPASLASYAAQDKQAREEWISVLEALFLTPAGLSINDIPPDCSSLLAGFCSLADWLGSWTTTNTFLFNEDAPSDINALRTYFQDRQQDASRVLELSGLVSNKRCYEGVHALLDNGYQPRQLQVLVDALPVAPGLTVIEAPTGSGKTETALAYAWKLIDQQIADSVIFALPTQATANAMLTRMEASASHLFSSPNLILAHGNSRFNHLFQSIKSRAITEQGQEEAWVQCCQWLSQSNKKVFLGQIGVCTIDQVLISVLPVKHRFIRGLGIGRSVLIVDEVHAYDTYMNGLLEAVLKAQADVGGSVILLSATLPMKQKQKLLDTYGLHTDPVENNSAYPLINWRGVNGAQRFDLLAHPEQLPPRFSIQPEPICLADMLPDLTMLERMIAAANAGAQVCLICNLVDVAQVCYQRLKELNNTQVDIDLFHARFTLNDRREKENRVISNFGKNGKRNVGRILVATQVVEQSLDVDFDWLITQHCPADLLFQRLGRLHRHHRKYRPAGFEIPVATILLPDGEGYGRHEHIYSNVRVMWRTQQHIEELNGASLFFPDAYRQWLDSIYDDAEMDEPEWVGNGMDKFESAECEKRFKARKVLQWAEEYSLQDNDETILAVTRDGEMSLPLLPYVQTSSGKQLLDGQVYEDLSHEQQYEALALNRVNVPFTWKRSFSEVVDEDGLLWLEGKQNLDGWVWQGNSIVITYTGDEGMTRVIPANPK >NZ_CP020368.1|WP_000039850.1|2712024_2712759_-|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >NZ_CP020368.1|WP_001290679.1|2712833_2714546_-|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLTDAERMKHESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDNWHLTLFIENGRILDYPARPLKTGLLEIAKIHKGDFRITANQNLIIAGVPESEKAKIEKIAKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVMRVTGCPNGCGRAMLAEVGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPEILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDLWD |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP020368_6 | 2728461-2728854 | Unclear |
I-E
Consensus repeat of NZ_CP020368_6
|
6 spacers
spacers of NZ_CP020368_6
>6.1|2728489|33|NZ_CP020368|CRISPRCasFinder,CRT GACAGAACGGCCTCAGTAGTCTCGTCAGGCTCC >6.2|2728550|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR CTGTTTTCGCAAATCTATGGACTATTGCTATTC >6.3|2728611|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR GGGCGCACGGAATACAAAGCCGTGTATCTGCTC >6.4|2728672|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR TGGCTCTGCAACAGCAGCACCCATGACCACGTC >6.5|2728733|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR GAAATGCTGGTGAGCGTTAATGCCGCAAACACA >6.6|2728794|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR ATTACGCCTTTTTGCGATTGCCCGGTTTTTGCC |
CRISPR arrays and Neighbor proteins around NZ_CP020368_6
The CRISPR arrays of NZ_CP020368_6 >merge|NZ_CP020368|6|2728461-2728854|CRISPRCasFinder,CRT,PILER-CR GGTTTATCCCCGCTGGCGCGGGGAACTCGACAGAACGGCCTCAGTAGTCTCGTCAGGCTCCGGTTTATCCCCGCTGGCGCGGGGAACACCTGTTTTCGCAAATCTATGGACTATTGCTATTCGGTTTATCCCCGCTGGCGCGGGGAACACGGGCGCACGGAATACAAAGCCGTGTATCTGCTCGGTTTATCCCCGCTGGCGCGGGGAACACTGGCTCTGCAACAGCAGCACCCATGACCACGTCGGTTTATCCCCGCTGGCGCGGGGAACACGAAATGCTGGTGAGCGTTAATGCCGCAAACACAGGTTTATCCCCGCTGGCGCGGGGAACACATTACGCCTTTTTGCGATTGCCCGGTTTTTGCCGGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP020368|6|6|2728461-2728854|CRISPRCasFinder GGTTTATCCCCGCTGGCGCGGGGAACTC GACAGAACGGCCTCAGTAGTCTCGTCAGGCTCC GGTTTATCCCCGCTGGCGCGGGGAACAC CTGTTTTCGCAAATCTATGGACTATTGCTATTC GGTTTATCCCCGCTGGCGCGGGGAACAC GGGCGCACGGAATACAAAGCCGTGTATCTGCTC GGTTTATCCCCGCTGGCGCGGGGAACAC TGGCTCTGCAACAGCAGCACCCATGACCACGTC GGTTTATCCCCGCTGGCGCGGGGAACAC GAAATGCTGGTGAGCGTTAATGCCGCAAACACA GGTTTATCCCCGCTGGCGCGGGGAACAC ATTACGCCTTTTTGCGATTGCCCGGTTTTTGCC GGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP020368|6|2|2728461-2728854|CRT GGTTTATCCCCGCTGGCGCGGGGAACTC GACAGAACGGCCTCAGTAGTCTCGTCAGGCTCC GGTTTATCCCCGCTGGCGCGGGGAACAC CTGTTTTCGCAAATCTATGGACTATTGCTATTC GGTTTATCCCCGCTGGCGCGGGGAACAC GGGCGCACGGAATACAAAGCCGTGTATCTGCTC GGTTTATCCCCGCTGGCGCGGGGAACAC TGGCTCTGCAACAGCAGCACCCATGACCACGTC GGTTTATCCCCGCTGGCGCGGGGAACAC GAAATGCTGGTGAGCGTTAATGCCGCAAACACA GGTTTATCCCCGCTGGCGCGGGGAACAC ATTACGCCTTTTTGCGATTGCCCGGTTTTTGCC GGTTTATCCCCGCTGGCGCGGGGAACAC >NZ_CP020368|6|2|2728522-2728854|PILER-CR GGTTTATCCCCGCTGGCGCGGGGAACAC CTGTTTTCGCAAATCTATGGACTATTGCTATTC GGTTTATCCCCGCTGGCGCGGGGAACAC GGGCGCACGGAATACAAAGCCGTGTATCTGCTC GGTTTATCCCCGCTGGCGCGGGGAACAC TGGCTCTGCAACAGCAGCACCCATGACCACGTC GGTTTATCCCCGCTGGCGCGGGGAACAC GAAATGCTGGTGAGCGTTAATGCCGCAAACACA GGTTTATCCCCGCTGGCGCGGGGAACAC ATTACGCCTTTTTGCGATTGCCCGGTTTTTGCC GGTTTATCCCCGCTGGCGCGGGGAACAC
>NZ_CP020368.1|WP_000039688.1|2726342_2727821_+|kinase MSKKYIIGIDGGSQSTKVVMYDLEGNVVCEGKGLLQPMHTPDADTAEHPDDDLWASLCFAGHDLMSQFAGNKEDIVGIGLGSIRCCRALLKADGTPAAPLISWQDARVTRPYEHTNPDVAYVTSFSGYLTHRLTGEFKDNIANYFGQWPVDYKSWAWSEDAAVMDKFNIPRHMLFDVQMPGTVLGHITPQAALATHFPAGLPVVCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAYWPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDARAQDLSPEDLLNKKASCVPPGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNHFAKHVIITGGGSNSDLFMQIFADVFNLPARRNAINGCASLGAAINTAVGLGLYPDYATAVDNMVRVKDIFIPIESNAKRYDAMNKGIFKDLTKHTDVILKKSYEVMHGELGNVDSIQSWSNA >NZ_CP020368.1|WP_001164544.1|2725038_2726316_+|MFS-transporter MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQIAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAPDDSTSLKTVIIIYSVVYILLGILCWFFVSDNNNLRSANNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAILSYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSVLGLLTLTALLVTNSNPQSVAMGIGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGHWQDTLPAAEAYRNMWLMGMAALGMVIVFTFLLFQKIRTADSAPAMASSK >NZ_CP020368.1|WP_000021334.1|2723934_2724720_-|SDR-family-oxidoreductase MSIESLNAFSMDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEKQGVEVDFMQVGITAEGAPQKIIAACCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAAFELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNIQVNGIAPGYYATDITLATRSNPETNQRVLDHIPANRWGDTQDLMGAAVFLASPASNYVNGHLLVVDGGYLVR >NZ_CP020368.1|WP_000059307.1|2722410_2723865_-|FAD-linked-oxidoreductase MSLSRAAIVDQLKEIVGADRVITDETVLKKNSIDRFRKFPDIHGIYTLPIPAAVVKLGSTEQVSRVLNFMNAHKINGVPRTGASATEGGLETVVENSVVLDGSAMNQIINIDIENMQATAQCGVPLEVLENALREKGYTTGHSPQSKPLAQMGGLVATRSIGQFSTLYGAIEDMVVGLEAVLADGTVTRIKNVPRRAAGPDIRHIIIGNEGALCYITEVTVKIFKFTPENNLFYGYILEDMKTGFNILREIMVEGYRPSIARLYDAEDGTQHFTHFADGKCVLIFMAEGNPRIAKVTGEGIAEIVARYPQCQRVDSKLIETWFNNLNWGPDKVAAERVQILKTGNMGFTTEVSGCWSCIHEIYESVINRIRTEFPHADDITMLGGHSSHSYQNGTNMYFVYDYNVVDCKPEEEIDKYHNPLNKIICEETIRLGGSMVHHHGIGKHRVHWSKLEHGSAWALLEGLKKQFDPNGIMNTGTIYPIEK >NZ_CP020368.1|WP_001098105.1|2720979_2722317_-|MFS-transporter MNTSPVRMDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFLGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGILLGAFSVVWTVGYVLASIAGHHFISENPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLFSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQVTLLLLATVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV >NZ_CP020368.1|WP_001299652.1|2720222_2721002_-|electron-transfer-flavoprotein-subunit-beta/FixA-family-protein MNILLAFKAEPDAGMLAEKEWQAAAQGKSGPDISLLRSLLGADEQAAAALLLAQRKNGTPMSLTALSMGDERALHWLRYLMALGFEEAVLLETAADLRFAPEFVARHIAEWQHQNPLDLIITGCQSSEGQNGQTPFLLAEMLGWPCFTQVERFTLDALFITLEQRTEHGLRCCRVRLPAVIAVRQCGEVALPVPGMRQRMAAGKAEIIRKTVAAEMPAMQCLQLARAEQRRGATLIDGQTVAEKAQKLWQDYLRQRMQP >NZ_CP020368.1|WP_001299097.1|2719365_2720226_-|electron-transfer-flavoprotein-subunit-alpha/FixB-family-protein MNIAIVTINQENAAIASWLAAQDFSGCTLAHWQIEPQPVVAEQVLDALVEQWQRTPADVVLFPPGTFGDELSTRLAWRLHGASICQVTSLDIPTVSVRKSHWGNALTATLQTEKRPLCLSLARQAGAAKNATLPSGMQQLNIVPGALPDWLVSTEDLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVGYSRARVMNGGVDAEKVIGISGHLLAPEVCIVVGASGAAALMAGVRNSKFVVAINHDASAAVFSQADVGVVDDWKVVLEALVTNIHADCQ >NZ_CP020368.1|WP_001130266.1|2718642_2719218_+|glycerol-3-phosphate-responsive-antiterminator MPLLHLLRQNPVIAAVKDNASLQLAIDSECQFISVLYGNICTISNIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLVTEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVCDEEDARNAINAGVVALSTTNTGVWTLAKKLL >NZ_CP020368.1|WP_000109529.1|2718365_2718626_+|ferredoxin-family-protein MSVARNLWRVADAPHIVPADSVERQTAERLINACPAGLFSLTPEGNLRIDYRSCLECGTCRLLCDESTLQQWRYPPSGFGITYRFG >NZ_CP020368.1|WP_001301334.1|2717103_2718375_+|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERRITHESLSLLTPDGVTTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGRICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL >NZ_CP020368.1|WP_001199973.1|2729193_2729865_-|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWEKLEDREVSLFSILAKTKESDKWGAASSEDLLAVIGRQGYTARHVVITGGEPCIHDLLPLTDLLEKNGFSCQIETSGTHEVRCTPNTWVTVSPKLNMRGGYEVLSQALERANEIKHPVGRVRDIEALDELLATLTDDKPRVIALQPISQKDDATRLCIETCIARNWRLSMQTHKYLNIA >NZ_CP020368.1|WP_001288228.1|2730003_2730144_+|hypothetical-protein MSEENKENGFNHVKTFTKIIFIFSVLVFNDNEYKITDAAVNLFIQI >NZ_CP020368.1|WP_001268460.1|2730157_2731030_+|YgcG-family-protein MRYFILMFTFVCSFVAAQPTIVPQLQQQVTDLTSSLNSQEKKELTHKLESIFNNTQVQIAVLIVPTTKDETIEQYATRVFDNWRLGDAKRNDGILIVVAWSDRTVRIQVGYGLEEKVTDALAGDIIRSNMIPAFKQQKLAKGLELAINALNNQLTSQHQYPTNPSESESASSSDHYYFAIFWVFAVMFFPFWFFHQGSNFCRACKSGVCISAIYLLDLFLFSDKIFSIAVFSFFFTFTIFMVFTCLCVLQKRASGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASGRW >NZ_CP020368.1|WP_000036723.1|2731089_2732388_-|phosphopyruvate-hydratase MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >NZ_CP020368.1|WP_000210878.1|2732475_2734113_-|CTP-synthase-(glutamine-hydrolyzing) MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK >NZ_CP020368.1|WP_001071638.1|2734340_2735132_-|nucleoside-triphosphate-pyrophosphohydrolase MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFADSSAENSSEVLACWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCANVGFDWTTLGPVVDKVYEEIDEVMYEARQAVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLEMTGVDLETMEEVWQQVKRQEIDL >NZ_CP020368.1|WP_000254738.1|2735202_2735538_-|endoribonuclease-MazF MVSRYVPDMGDLIWVDFDPTKGSEQAGHRPAVVLSPFMYNNKTGMCLCVPCTTQSKGYPFEVVLSGQERDGVALADQVKSIAWRARGATKKGTVAPEELQLIKAKINVLIG >NZ_CP020368.1|WP_000581937.1|2735537_2735786_-|type-II-toxin-antitoxin-system-antitoxin-MazE MIHSSVKRWGNSPAVRIPATLMQALNLNIDDEVKIDLVDGKLIIEPVRKEPVFTLAELVNDITPENLHENIDWGEPKDKEVW >NZ_CP020368.1|WP_000226815.1|2735863_2738098_-|GTP-pyrophosphokinase MVAVRSAHINKAGEFDPEKWIASLGITSQKSCECLAETWAYCLQQTQGHPDASLLLWRGVEMVEILSTLSMDIDTLRAALLFPLADANVVSEDVLRESVGKSVVNLIHGVRDMAAIRQLKATHTDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHLREVKDAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYKRIAKLLHERRLDREHYIEEFVGHLRAEMKAEGVKAEVYGRPKHIYSIWRKMQKKNLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQSIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAAAGGARSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQVFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQIEIITQKQPNPSRDWLNPNLGYVTTSRGRSKIHAWFRKQDRDKNILAGRQILDDELEHLGISLKEAEKHLLPRYNFNDVDELLAAIGGGDIRLNQMVNFLQSQFNKPSAEEQDAAALKQLQQKSYTPQNRSKDNGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQLAELRSHAPERIVDAVWGESYSAGYSLVVRVVANDRSGLLRDITTILANEKVNVLGVASRSDTKQQLATIDMTIEIYNLQVLGRVLGKLNQVPDVIDARRLHGS >NZ_CP020368.1|WP_000046810.1|2738145_2739447_-|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MAQFYSAKRRTTTRQIITVSVNDLDSFGQGVARHNGKTLFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSPERETPRCPHFGVCGGCQQQHASVDLQQRSKSAALARLMKHDVSEVIADVPWGYRRRARLSLNYLPKTQQLQMGFRKAGSSDIVDVKQCPILAPQLEALLPKVRACLGSLQAMRHLGHVELVQATSGTLMILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILETVSGEMPWYDSNGLRLTFSPRDFIQVNAGVNQKMVARALELLDVQPEDRVLDLFCGMGNFTLPLATQAASVVGVEGVPALVEKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVMQQIIKLEPIRIVYVSCNPATLARDSEALLKAGYTIARLAMLDMFPHTGHLESMVLFSRVK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP020368_7 | 4387704-4387853 | Orphan |
NA
Consensus repeat of NZ_CP020368_7
|
1 spacers
spacers of NZ_CP020368_7
>7.1|4387758|42|NZ_CP020368|CRISPRCasFinder TGCCGCATCCGACAATAACAGCATTGCCTGATGCGACGCTTG |
CRISPR arrays and Neighbor proteins around NZ_CP020368_7
The CRISPR arrays of NZ_CP020368_7 >merge|NZ_CP020368|7|4387704-4387853|CRISPRCasFinder CGCGTCTTATCAGGCCTACGAGTTCGGTGCTGTGTAGGTCGGATAAGGCGTTCATGCCGCATCCGACAATAACAGCATTGCCTGATGCGACGCTTGCGCGTCTTATCAGGCCTACGAGTTCAGTGCTGTGTAGGTCGGATAAGGCGTTCA >NZ_CP020368|7|7|4387704-4387853|CRISPRCasFinder CGCGTCTTATCAGGCCTACGAGTTCGGTGCTGTGTAGGTCGGATAAGGCGTTCA TGCCGCATCCGACAATAACAGCATTGCCTGATGCGACGCTTG CGCGTCTTATCAGGCCTACGAGTTCAGTGCTGTGTAGGTCGGATAAGGCGTTCA
>NZ_CP020368.1|WP_000786393.1|4387092_4387536_-|DNA-polymerase-III-subunit-chi MKNATFYLLDNDTTIDGLSAVEQLVCEIAAERWRSGKRVLIACEDEKQAYRLDEALWARPAESFVPHNLAGEGPRGGAPVEIAWPQKRSSSPRDILISLRTSFADFATAFTEVVDFVPYEDSLKQLARERYKAYRVAGFNLNTATWK >NZ_CP020368.1|WP_000416403.1|4384237_4387093_-|valine--tRNA-ligase MEKTYNPQDIEQPLYEHWEKQGYFKPNGDESQESFCIMIPPPNVTGSLHMGHAFQQTIMDTMIRYQRMQGKNTLWQVGTDHAGIATQMVVERKIAAEEGKTRHDYGREAFIDKIWEWKAESGGTITRQMRRLGNSVDWERERFTMDEGLSNAVKEVFVRLYKEDLIYRGKRLVNWDPKLRTAISDLEVENRESKGSMWHIRYPLADGAKTADGKDYLVVATTRPETLLGDTGVAVNPEDPRYKDLIGKYVILPLVNRRIPIVGDEHADMEKGTGCVKITPAHDFNDYEVGKRHALPMINILTFDGDIRESAQVFDTKGNESDVYSSEIPAEFQKLERFAARKAVVAAVDALGLLEEIKPHDLTVPYGDRGGVVIEPMLTDQWYVRADVLAKPAVEAVENGDIQFVPKQYENMYFSWMRDIQDWCISRQLWWGHRIPAWYDEAGNVYVGRNEEEVRKENNLGADVALRQDEDVLDTWFSSALWTFSTLGWPENTDALRQFHPTSVMVSGFDIIFFWIARMIMMTMHFIKDENGKPQVPFHTVYMTGLIRDDEGQKMSKSKGNVIDPLDMVDGISLPELLEKRTGNMMQPQLADKIRKRTEKQFPNGIEPHGTDALRFTLAALASTGRDINWDMKRLEGYRNFCNKLWNASRFVLMNTEGQDCGFNGGEMTLSLADRWILAEFNQTIKAYREALDSFRFDIAAGILYEFTWNQFCDWYLELTKPVMNGGTEAELRGTRHTLVTVLEGLLRLAHPIIPFITETIWQRVKVLCGITADTIMLQPFPQYDASQVDEAALADTEWLKQAIVAVRNIRAEMNIAPGKPLELLLRGCSADAERRVNENRGFLQTLARLESITVLPADDKGPVSVAKIIDGAELLIPMAGLINKEDELARLAKEVAKIEGEISRIENKLANEGFVARAPEAVIAKEREKLEGYAEAKAKLIEQQAVIAAL >NZ_CP020368.1|WP_000079628.1|4382985_4384182_+|DUF898-domain-containing-protein MAQVINEMDVPSHSFVFHGTGERYFLICVVNVLLTIITLGIYLPWALMKCKRYLYANMEVNGQRFSYGITGGNVFFSCLVFVFFYFAILMTVSADMPLVGCVLTLSLLVLLIFMAAKGLRYQALMTSLNGVRFSFNCSMKGFWWVTFFLPILMAIGMGTVFFISTKMLHANSSSSVIISVVLMAIVGIVSIGIFNGTLYSLVMSFLWSNTSFGIHRFKVKLDTTYCIKYAILAFLALLPFLAVAGYIIFDQILNAYDSSVYANDDIENLQQFMEMQRKMIIAQLIYYFGIAVSTSYLTVSLRNHFMSNLSLNDGRIRFRSTLTYHGMLYRMCALVVISGITGGLAYPLLKIWMIDWQAKNTYLLGDLDDLPLINKEEQPDKGFLASISRGIMPSLPFL >NZ_CP020368.1|WP_001059397.1|4382289_4382793_-|GNAT-family-N-acetyltransferase MNNIAPQSPVMRRLTLQDNPAIARVIRQVSAEYGLTADKGYTVADPNLDELYQVYSQPGHAYWVVEYEGEVVGGGGIAPLAGSESDICELQKMYFLPAIRGKGLAKKLALKAMEEAREMGFKRCYLETTAFLKEAIGLYEHLGFQHIDYALGCTGHVDCEVRMLREL >NZ_CP020368.1|WP_000002953.1|4381827_4382244_+|ribonuclease-E-inhibitor-RraB MANPEQLEEQREETRLIIEELLEDGSDPDALYTIEHHLSADDLETLEKAAVEAFKLGYEVTDPEELEVEDGDIVICCDILSECALNADLIDAQVEQLMTLAEKFDVEYDGWGTYFEDPNGEDGDDEDFVDEDDDGVRH >NZ_CP020368.1|WP_000012907.1|4380661_4381666_-|ornithine-carbamoyltransferase MSGFYHKHFLKLLDFTPAELNSLLQLAAKLKADKKSGKEEAKLTGKNIALIFEKDSTRTRCSFEVAAYDQGARVTYLGPSGSQIGHKESIKDTARVLGRMYDGIQYRGYGQEIVETLAEYAGVPVWNGLTNEFHPTQLLADLLTMQEHLPGKAFNEMTLVYAGDARNNMGNSMLEAAALTGLDLRLVAPQACWPEAALVTECRALAQQNGGNITLTEDVAKGVEGADFIYTDVWVSMGEAKEKWAERIALLRDYQVNSKMMQLTGNPEVKFLHCLPAFHDDQTTLGKKMAEEFGLHGGMEVTDEVFESAASIVFDQAENRMHTIKAVMVATLSK >NZ_CP020368.1|WP_000036440.1|4378936_4380589_+|hypothetical-protein MSKISDLNYSQHITLADNFKQKSEVLNTWRVGMNNFARNAEGQDNTRNILDPKTFLEFLVKIFTLGYVDFSKRSNEAGRNMMAHIESSSYIKNNDGSEIMKFVMNNPEGERADLSKVEIEITLSAFTTMGTRQGHTAIIFQQPDGSTNRYEGKSFERKDESSLHLITNKILACYQREANKEIARLLNIPQELNNSQDLNNSQVSCKDSVDSTITDLLEKPLNNALLAIRKEHLLLMPYVCNESISYLLGEKGILKEIDDLNAVNNYLLNNKKATDNEINDIKVNLSHILIDSLDDAKVNLTPVIDSILETFLKSPYINDVRILDWCFNKRMQYFGDSEKIKYACSVINHIDFSRDQSKDFSCDQSKIKIAETLFFNLDKEPYKNSRKLQELIWDKLVAYVNDFNLSNQEKSRLILRLFDDVKLLFDEVPVSILVNDIFLKGFFMKQPDFAKWYFYQLLKKYEGEQLYLNELGYVYGNEEKTNEIVKKHPGYVVEIFEEKMGNELKIRTRMMEILRDGKINICEYINKEQLEKLNPPEDLRIAIKKLGWNN >NZ_CP020368.1|WP_001319730.1|4378361_4378814_+|DUF386-domain-containing-protein MIIGNIHNLQPWLPQELRQAIEHIKAHVTAETPKGKHDIEGNRLFYLISEDMTEPYEARRAEYHARYLDIQIVLRGQEGMTFSTQPAGTPDTDWLADKDIAFLPEGVDEKTVILNEGDFVVFYPGEVHKPLCAVGAPARVRKAVVKMLMA >NZ_CP020368.1|WP_001319729.1|4377623_4378217_+|TetR/AcrR-family-transcriptional-regulator MVTKKQSRVPGRPRRFAPEQAVSAAKVLFHQKGFDAVSVAEVTDYLGINPPSLYAAFGNKAGLFSRVLNEYVGTEAIPLADILRDDRPVGECLAEVLKEAARRYSQNGGCAGCMVLEGIHSHDPQARDIAVQYYHAAETTIYDYIARRHPQSAQCVTDFMSTVMSGLSAKAREGHSIEQLCATAALAGEAIKTILKE >NZ_CP020368.1|WP_000500685.1|4376839_4377553_-|SDR-family-oxidoreductase MGAFTGKTVLILGGSRGIGAAIVRRFVTDGANVRFTYAGSKDAAEHLAQETGATAVFTDSADRDAVIDVVRKSGALDILVVNAGIGVFGDALELNADDIDRLFKINIHAPYHASVEAARQMPEGGRILIIGSVNGDRMPVAGMAAYAASKSALQGMARGLARDFGPRGITINVVQPGPIDTDANPANGPMRDMLHGFMAIKRHGQPEEVAGMVAWLAGPEASFVTGAMHTIDGAFGA >NZ_CP020368.1|WP_000397144.1|4387889_4389401_-|leucyl-aminopeptidase MEFSVKSGSPEKQRSACIVVGVFEPRRLSPIAEQLDKISDGYISALLRRGELEGKPGQTLLLHHVPNVLSERILLIGCGKERELDERQYKQVIQKTINTLNDTGSMEAVCFLTELHVKGRNNYWKVRQAVETAKETLYSFDQLKTNKSEPRRPLRKMVFNVPTRRELTSGERAIQHGLAIAAGIKAAKDLGNMPPNICNAAYLASQARQLADSYSKNVITRVIGEQQMKELGMHSYLAVGQGSQNESLMSVIEYKGNASEDARPIVLVGKGLTFDSGGISIKPSEGMDEMKYDMCGAAAVYGVMRMVAELQLPINVIGVLAGCENMPGGRAYRPGDVLTTMSGQTVEVLNTDAEGRLVLCDVLTYVERFEPEAVIDVATLTGACVIALGHHITGLMANHNPLAHELIAASEQSGDRAWRLPLGDEYQEQLESNFADMANIGGRPGGAITAGCFLSRFTRKYNWAHLDIAGTAWRSGKAKGATGRPVALLAQFLLNRAGFNGEE >NZ_CP020368.1|WP_000584114.1|4389667_4390768_+|LPS-export-ABC-transporter-permease-LptF MIIIRYLVRETLKSQLAILFILLLIFFCQKLVRILGAAVDGDIPANLVLSLLGLGVPEMAQLILPLSLFLGLLMTLGKLYTESEITVMHACGLSKAVLVKAAMILAVFTAIVAAVNVMWAGPWSSRHQDEVLAEAKANPGMAALAQGQFQQATNGSSVLFIESVDGSDFKDVFLAQIRPKGNARPSVVVADSGHLTQLRDGSQVVTLNQGTRFEGTALLRDFRITDFQDYQAIIGHQAVALDPNDTDQMDMRTLWNTDTDRARAELNWRITLVFTVFMMALMVVPLSVVNPRQGRVLSMLPAMLLYLLFFLIQTSLKSNGGKGKLDPTLWMWTVNLIYLALAIVLNLWDTVPVRRLRASFSRKGAV >NZ_CP020368.1|WP_001295681.1|4390767_4391850_+|LPS-export-ABC-transporter-permease-LptG MQPFGVLDRYIGKTIFTTIMMTLFMLVSLSGIIKFVDQLKKAGQGSYDALGAGMYTLLSVPKDVQIFFPMAALLGALLGLGMLAQRSELVVMQASGFTRMQVALSVMKTAIPLVLLTMAIGEWVAPQGEQMARNYRAQAMYGGSLLSTQQGLWAKDGNNFVYIERVKGDEELGGISIYAFNENRRLQSVRYAATAKFDPEHKVWRLSQVDESDLTNPKQITGSQTVSGTWKTNLTPDKLGVVALDPDALSISGLHNYVKYLKSSGQDAGRYQLNMWSKIFQPLSVAVMMLMALSFIFGPLRSVPMGVRVVTGISFGFVFYVLDQIFGPLTLVYGIPPIIGALLPSASFFLISLWLLMRKS >NZ_CP020368.1|WP_001294573.1|4392010_4393513_-|DUF853-domain-containing-protein MSEPLLIARTPDTELFLLPGMANRHGLITGATGTGKTVTLQKLAESLSEIGVPVFMADVKGDLTGVAQAGTVSEKLLARLKNIGVNDWQPHANPVVVWDIFGEKGHPVRATVSDLGPLLLARLLNLNDVQSGVLNIIFRIADDQGLLLLDFKDLRAITQYIGDNAKSFQNQYGNISSASVGAIQRGLLSLEQQGAAHFFGEPMLDIKDWMRTDANGKGVINILSAEKLYQMPKLYAASLLWMLSELYEQLPEAGDLEKPKLVFFFDEAHLLFNDAPQVLLDKIEQVIRLIRSKGVGVWFVSQNPSDIPDNVLGQLGNRVQHALRAFTPKDQKAVKAAAQTMRANPAFDTEKAIQELGTGEALISFLDAKGSPSVVERAMVIAPCSRMGPVTEDERNGLINHSPVYGKYEDEVDRESAYEMLQKGFQASTEQQNNPPAKGKEVAVDDGILGGLKDILFGTTGPRGGKKDGVVQTMAKSAARQVTNQIVRGMLGSLLGGRRR >NZ_CP020368.1|WP_001309159.1|4393590_4394589_-|DNA-binding-transcriptional-regulator-IdnR MRNHRISLQDIATLAGVTKMTVSRYIRSPKKVAKETGERIAKIMEEINYIPNRAPGMLLNAQSYTLGILIPSFQNQLFADILAGIESVTSEHNYQTLIANYNYDRDSEEESVINLLSYNIDGIILSEKYHTIRTVKFLRSATIPVVELMDVQGERLDMEVGFDNRQAAFDMVCTMLEKRVRHKILYLGSKDDTRDEQRYQGYCDAMMLHNLSPLRMNPRAISSIHLGMQLMRDALSANPDLDGVFCTNDDIAMGALLLCRERNLAVPEQISIAGFHGLEIGRQMIPSLASVITPRFDIGRMAAQMLLSKIKNNDHNHNTVDLGYQIYHGNTL >NZ_CP020368.1|WP_001128347.1|4394655_4395975_-|gnt-II-system-L-idonate-transporter MPLIIIAAGVALLLILMIVFKVNGFIALVLVAAVVGFAEGMDAQAVLHSIQNGIGSTLGGLAMILGFGAMLGKLISDTGAAQRIATTLIATFGKKRVQWALVITGLVVGLAMFFEVGFVLLLPLVFTIVASSGLPLLYVGVPMVAALSVTHCFLPPHPGPTAIATIFEANLGTTLLYGFIITIPTVIVAGPLFSKLLTRFEKAPPEGLFNPHLFSEEEMPSFWNSIFAAVIPVILMAIAAVCEITLPKTNTVRLFFEFVGNPAVALFIAIVIAIFTLGRRNGRTIEQIMDIIGDSIGAIAMIVFIIAGGGAFKQVLVDSGVGHYISHLMTGTTLSPLLMCWTVAALLRIALGSATVAAITTAGVVLPIINVTHADPALMVLATGAGSVIASHVNDPGFWLFKGYFNLTVGETLRTWTVMETLISIMGLLGVLAINAVLH >NZ_CP020368.1|WP_000998695.1|4396037_4396802_-|gluconate-5-dehydrogenase MNDLFSLAGKNILITGSAQGIGFLLATGLGKYGAQIIINDITAERAELAVEKLHQEGIQAVAAPFNVTHKHEIDAAVEHIEKDIGPIDVLVNNAGIQRRHPFTEFPEQEWNDVIAVNQTAVFLVSQAVTRHMVERKAGKVINICSMQSELGRDTITPYAASKGAVKMLTRGMCVELARHNIQVNGIAPGYFKTEMTKALVEDEAFTAWLCKRTPAARWGDPQELIGAAVFLSSKASDFVNGHLLFVDGGMLVAV >NZ_CP020368.1|WP_001197411.1|4396825_4397857_-|L-idonate-5-dehydrogenase MQVKTQSCVVAGKKTVAVTEQTIDWNNNGTLVQITRGGICGSDLHYYQEGKVGNFMIKAPMVLGHEVIGKVIHSDSSELHEGQTVAINPSKPCGHCKYCIEHNENQCTDMRFFGSAMYFPHVDGGFTRYKMVETSQCVPYPAKADEKVMAFAEPLAVAIHAAHQAGELQGKRVFISGVGPIGCLIVSAVKTLGAAEIVCADVSPRSLSLGKEMGADVLVNPQNDDMDHWKAEKGYFDVSFEVSGHPSSVNTCLEVTRARGVMVQVGMGGAMAEFPMMTLIGKEISLRGSFRFTSEFNTAVSWLANGVINPLPLLSAEYPFTDLEEALRFAGDKTQAAKVQLVF >NZ_CP020368.1|WP_000896738.1|4398073_4398637_+|gluconokinase MAGESFILMGVSGSGKTLIGSKVAALLSAKFIDGDDLHPAKNIDKMSQGIPLSDEDRLPWLERLNDASYSLYKKNETGFIVCSSLKKQYRDILRKGSPHVHFLWLDGDYETILARMQRRAGHFMPVALLKSQFEALERPQADEQDIVRIDINHDIANVTEQCRQAVLAIRQNRICAKEGSASDQRCE >NZ_CP020368.1|WP_001318460.1|4398640_4399660_-|NADPH-dependent-aldehyde-reductase-Ahr MSMIKSYAAKEAGGELEVYEYDPGELRPQDVEVQVDYCGICHSDLSMIDNEWGFSQYPLVAGHEVIGRVVALGSAAQDKGLQVGQRVGIGWTARSCGHCDACISGNQINCEQGAVPTIMNRGGFAEKLRADWQWVIPLPENIDIESAGPLLCGGITVFKPLLMHHITATSRVGVIGIGGLGHIAIKLLHAMGCEVTAFSSNPAKEQEVLAMGADKVVNSRDPQALKALAGQFDLIINTVNVSLDWQPYFEALTYGGNFHTVGAVLTPLSVPAFTLIAGDRSVSGSATGTPYELRKLMRFAARSKVAPTTELFPMSKINDAIQHVRDGKARYRVVLKADY |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 229362-229416 | 0 | 1.0 |
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 239856-239910 | 0 | 1.0 |
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 230768-230822 | 0 | 1.0 |
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 211738-211792 | 0 | 1.0 |
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 229160-229214 | 1 | 0.982 |
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 229261-229315 | 1 | 0.982 |
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 239654-239708 | 1 | 0.982 |
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 239755-239809 | 1 | 0.982 |
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 230566-230620 | 1 | 0.982 |
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 230667-230721 | 1 | 0.982 |
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 211536-211590 | 1 | 0.982 |
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 211637-211691 | 1 | 0.982 |
NZ_CP020368_6 | 6.5|2728733|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2728733-2728765 | 33 | NZ_LR134258 | Klebsiella aerogenes strain NCTC9644 plasmid 5, complete sequence | 3574-3606 | 4 | 0.879 |
NZ_CP020368_6 | 6.5|2728733|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2728733-2728765 | 33 | LR134281 | Klebsiella aerogenes strain NCTC9793 genome assembly, plasmid: 6 | 3567-3599 | 4 | 0.879 |
NZ_CP020368_6 | 6.5|2728733|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2728733-2728765 | 33 | KY271401 | Klebsiella phage 1 LV-2017, complete genome | 21043-21075 | 4 | 0.879 |
NZ_CP020368_5 | 5.5|2702421|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702421-2702452 | 32 | NC_021229 | Arthrobacter nicotinovorans pAO1 megaplasmid sequence, strain ATCC 49919 | 65474-65505 | 5 | 0.844 |
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 229463-229517 | 6 | 0.891 |
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 239957-240011 | 6 | 0.891 |
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 230869-230923 | 6 | 0.891 |
NZ_CP020368_1 | 1.1|344535|55|NZ_CP020368|CRISPRCasFinder | 344535-344589 | 55 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 211839-211893 | 6 | 0.891 |
NZ_CP020368_5 | 5.5|2702421|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702421-2702452 | 32 | NZ_CP017422 | Arthrobacter sp. ZXY-2 plasmid pZXY21, complete sequence | 208287-208318 | 6 | 0.812 |
NZ_CP020368_5 | 5.11|2702789|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702789-2702820 | 32 | KY883647 | Vibrio phage JSF33, complete genome | 9760-9791 | 6 | 0.812 |
NZ_CP020368_5 | 5.12|2702850|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702850-2702881 | 32 | NZ_CP009293 | Novosphingobium pentaromativorans US6-1 plasmid pLA4, complete sequence | 152196-152227 | 6 | 0.812 |
NZ_CP020368_6 | 6.5|2728733|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2728733-2728765 | 33 | KY653119 | Morganella phage IME1369_02, complete genome | 18216-18248 | 6 | 0.818 |
NZ_CP020368_5 | 5.1|2702177|32|NZ_CP020368|CRISPRCasFinder,CRT | 2702177-2702208 | 32 | NZ_AP018516 | Acetobacter orientalis strain FAN1 plasmid pAOF1, complete sequence | 48296-48327 | 8 | 0.75 |
NZ_CP020368_5 | 5.5|2702421|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702421-2702452 | 32 | MK113951 | Phage 5P_3, complete genome | 11967-11998 | 8 | 0.75 |
NZ_CP020368_5 | 5.5|2702421|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702421-2702452 | 32 | AP017924 | Ralstonia phage RP12 DNA, complete genome | 11643-11674 | 8 | 0.75 |
NZ_CP020368_5 | 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702604-2702636 | 33 | NC_013856 | Azospirillum sp. B510 plasmid pAB510b, complete sequence | 375744-375776 | 8 | 0.758 |
NZ_CP020368_5 | 5.11|2702789|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702789-2702820 | 32 | MN855762 | Bacteriophage sp. isolate 505, complete genome | 4840-4871 | 8 | 0.75 |
NZ_CP020368_5 | 5.11|2702789|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702789-2702820 | 32 | NC_020548 | Azoarcus sp. KH32C plasmid pAZKH, complete sequence | 224460-224491 | 8 | 0.75 |
NZ_CP020368_5 | 5.12|2702850|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702850-2702881 | 32 | NZ_CP007130 | Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence | 750410-750441 | 8 | 0.75 |
NZ_CP020368_6 | 6.4|2728672|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2728672-2728704 | 33 | NZ_CP007129 | Gemmatirosa kalamazoonesis strain KBS708 plasmid 1, complete sequence | 755172-755204 | 8 | 0.758 |
NZ_CP020368_5 | 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702604-2702636 | 33 | NZ_CP010957 | Sphingobium sp. YBL2 plasmid 3pYBL2-3, complete sequence | 26182-26214 | 9 | 0.727 |
NZ_CP020368_5 | 5.11|2702789|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702789-2702820 | 32 | NZ_CP015585 | Roseomonas gilardii strain U14-5 plasmid 1, complete sequence | 104261-104292 | 9 | 0.719 |
NZ_CP020368_5 | 5.11|2702789|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702789-2702820 | 32 | NZ_CP054618 | Azospirillum oryzae strain KACC 14407 plasmid unnamed4, complete sequence | 142898-142929 | 9 | 0.719 |
NZ_CP020368_5 | 5.12|2702850|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702850-2702881 | 32 | MN234174 | Mycobacterium phage Efra2, complete genome | 35614-35645 | 9 | 0.719 |
NZ_CP020368_5 | 5.12|2702850|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702850-2702881 | 32 | MN234165 | Mycobacterium phage Yunkel11, complete genome | 35570-35601 | 9 | 0.719 |
NZ_CP020368_5 | 5.12|2702850|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702850-2702881 | 32 | MN234201 | Mycobacterium phage Guanica15, complete genome | 35571-35602 | 9 | 0.719 |
NZ_CP020368_2 | 2.1|376521|59|NZ_CP020368|CRISPRCasFinder | 376521-376579 | 59 | MT230312 | Escherichia coli strain DH5alpha plasmid pESBL31, complete sequence | 97-155 | 10 | 0.831 |
NZ_CP020368_5 | 5.5|2702421|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702421-2702452 | 32 | NC_002580 | Propionibacterium freudenreichii plasmid p545, complete sequence | 2898-2929 | 10 | 0.688 |
NZ_CP020368_5 | 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702482-2702513 | 32 | NZ_CP028970 | Aminobacter sp. MSH1 plasmid pUSP2, complete sequence | 156123-156154 | 10 | 0.688 |
NZ_CP020368_5 | 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702482-2702513 | 32 | NZ_CP053984 | Achromobacter pestifer strain FDAARGOS_790 plasmid unnamed, complete sequence | 21888-21919 | 10 | 0.688 |
NZ_CP020368_5 | 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702482-2702513 | 32 | NC_010935 | Comamonas testosteroni CNB-1 plasmid pCNB, complete sequence | 28766-28797 | 10 | 0.688 |
NZ_CP020368_5 | 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702482-2702513 | 32 | JX469826 | Uncultured bacterium plasmid pB12, complete sequence | 11283-11314 | 10 | 0.688 |
NZ_CP020368_5 | 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702482-2702513 | 32 | JN106171 | Uncultured bacterium plasmid pAKD26, complete sequence | 11289-11320 | 10 | 0.688 |
NZ_CP020368_5 | 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702482-2702513 | 32 | NC_016968 | Comamonas testosteroni plasmid pTB30, complete sequence | 11287-11318 | 10 | 0.688 |
NZ_CP020368_5 | 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702482-2702513 | 32 | NC_016978 | Comamonas testosteroni plasmid pI2, complete sequence | 11272-11303 | 10 | 0.688 |
NZ_CP020368_5 | 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702482-2702513 | 32 | NZ_CP017760 | Cupriavidus necator strain NH9 plasmid pENH91, complete sequence | 67078-67109 | 10 | 0.688 |
NZ_CP020368_5 | 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702482-2702513 | 32 | NZ_CP053554 | Diaphorobacter sp. JS3050 plasmid pDCNB, complete sequence | 4235-4266 | 10 | 0.688 |
NZ_CP020368_5 | 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702482-2702513 | 32 | NC_019263 | Delftia acidovorans plasmid pLME1, complete sequence | 11288-11319 | 10 | 0.688 |
NZ_CP020368_5 | 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702482-2702513 | 32 | NC_019264 | Delftia acidovorans plasmid pNB8c, complete sequence | 11288-11319 | 10 | 0.688 |
NZ_CP020368_5 | 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702482-2702513 | 32 | NC_019283 | Delftia acidovorans plasmid pC1-1, complete sequence | 11288-11319 | 10 | 0.688 |
NZ_CP020368_5 | 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702482-2702513 | 32 | NC_006830 | Achromobacter xylosoxidans A8 plasmid pA81, complete sequence | 11350-11381 | 10 | 0.688 |
NZ_CP020368_5 | 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702604-2702636 | 33 | CP046443 | Pseudomonas coronafaciens pv. coronafaciens strain B19001 plasmid unnamed2, complete sequence | 31933-31965 | 10 | 0.697 |
NZ_CP020368_5 | 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702604-2702636 | 33 | NZ_LT963392 | Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP1, complete sequence | 103013-103045 | 10 | 0.697 |
NZ_CP020368_5 | 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702604-2702636 | 33 | NZ_LT963392 | Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP1, complete sequence | 110510-110542 | 10 | 0.697 |
NZ_CP020368_5 | 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702604-2702636 | 33 | NZ_CP034079 | Pseudomonas syringae pv. pisi str. PP1 plasmid pPP1-1, complete sequence | 48454-48486 | 10 | 0.697 |
NZ_CP020368_5 | 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702604-2702636 | 33 | NZ_CP034080 | Pseudomonas syringae pv. pisi str. PP1 plasmid pPP1-2, complete sequence | 39480-39512 | 10 | 0.697 |
NZ_CP020368_5 | 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702604-2702636 | 33 | NC_005918 | Pseudomonas syringae pv. maculicola strain ES4326 plasmid pPMA4326A, complete sequence | 31117-31149 | 10 | 0.697 |
NZ_CP020368_5 | 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702604-2702636 | 33 | NZ_CP047262 | Pseudomonas syringae pv. maculicola str. ES4326 plasmid pPma4326A, complete sequence | 30966-30998 | 10 | 0.697 |
NZ_CP020368_5 | 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702604-2702636 | 33 | NZ_CP026560 | Pseudomonas amygdali pv. morsprunorum strain R15244 plasmid p3_tig5, complete sequence | 19118-19150 | 10 | 0.697 |
NZ_CP020368_5 | 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702604-2702636 | 33 | NZ_LT963406 | Pseudomonas syringae pv. avii isolate CFBP3846 plasmid PP4, complete sequence | 54820-54852 | 10 | 0.697 |
NZ_CP020368_5 | 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702604-2702636 | 33 | LT985193 | Pseudomonas syringae strain CFBP 2116 genome assembly, plasmid: PP2 | 32077-32109 | 10 | 0.697 |
NZ_CP020368_5 | 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702604-2702636 | 33 | NZ_LT963393 | Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP2, complete sequence | 50597-50629 | 10 | 0.697 |
NZ_CP020368_5 | 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702604-2702636 | 33 | NZ_LT985210 | Pseudomonas syringae pv. cerasicola strain CFBP 6110 plasmid PP1, complete sequence | 105842-105874 | 10 | 0.697 |
NZ_CP020368_5 | 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702604-2702636 | 33 | NZ_LT985211 | Pseudomonas syringae pv. cerasicola strain CFBP 6110 plasmid PP2, complete sequence | 84272-84304 | 10 | 0.697 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052797 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N18S2039 plasmid pN18S2039, complete sequence | 45808-45839 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052795 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0125 plasmid pN19S0125, complete sequence | 282589-282620 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | NZ_CP047882 | Salmonella enterica subsp. enterica serovar Infantis strain 119944 plasmid pESI, complete sequence | 94965-94996 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052804 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S973 plasmid pN17S0973, complete sequence | 304288-304319 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP038508 | Salmonella enterica subsp. enterica serovar Infantis strain FARPER-219 plasmid p-F219, complete sequence | 112376-112407 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052802 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S976 plasmid pN17S0976, complete sequence | 315682-315713 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052788 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0611 plasmid pN19S0611, complete sequence | 203378-203409 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052840 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S024 plasmid pN16S024, complete sequence | 127648-127679 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052786 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0641 plasmid pN19S0641, complete sequence | 215302-215333 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052838 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S097 plasmid pN16S097, complete sequence | 214483-214514 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | NZ_CP028316 | Salmonella enterica subsp. enterica serovar Typhimurium var. 5- strain CFSAN067217 plasmid pSC-31-2, complete sequence | 108893-108924 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP051676 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1234 plasmid pN16S1234, complete sequence | 83669-83700 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052783 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0679 plasmid pN19S0679-1, complete sequence | 194119-194150 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052836 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S103 plasmid pN16S103, complete sequence | 18410-18441 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | NZ_CP022063 | Salmonella enterica strain FDAARGOS_312 plasmid unnamed3, complete sequence | 64615-64646 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052781 | Salmonella enterica strain CVM N19S0949 plasmid pN19S0949, complete sequence | 169480-169511 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052834 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S041 plasmid pN17S0041, complete sequence | 6457-6488 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052793 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0388 plasmid pN19S0388, complete sequence | 25758-25789 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052779 | Salmonella enterica strain 19TN07GT06K-S plasmid pN19S1233, complete sequence | 140403-140434 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052832 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1040 plasmid pN17S1040, complete sequence | 160727-160758 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | NZ_CP031362 | Salmonella enterica subsp. enterica serovar Heidelberg strain 5 plasmid p3, complete sequence | 140152-140183 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052830 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1105 plasmid pN17S1105, complete sequence | 193709-193740 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052828 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1126 plasmid pN17S1126, complete sequence | 126974-127005 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052826 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1245 plasmid pN17S0637, complete sequence | 110984-111015 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | NZ_CP016409 | Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502916 plasmid pFSIS1502916, complete sequence | 94916-94947 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052824 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1265 plasmid pN17S1265, complete sequence | 91497-91528 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052822 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1349 plasmid pN17S1349, complete sequence | 110984-111015 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | NZ_CP016407 | Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502169 plasmid pFSIS1502169, complete sequence | 94916-94947 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052820 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1442 plasmid pN17S1442, complete sequence | 94916-94947 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | NZ_CP016413 | Salmonella enterica subsp. enterica serovar Infantis strain CVM44454 plasmid pCVM44454, complete sequence | 94916-94947 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | NZ_CP016411 | Salmonella enterica subsp. enterica serovar Infantis strain N55391 plasmid pN55391, complete sequence | 94916-94947 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052816 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1598 plasmid pN17S1598 | 165317-165348 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052814 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S349 plasmid pN17S0349, complete sequence | 99109-99140 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | NZ_CP022662 | Salmonella enterica subsp. enterica strain RM11065 plasmid pRM11065-2, complete sequence | 54379-54410 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052812 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S376 plasmid pN17S0376, complete sequence | 1671-1702 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052810 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S535 plasmid pN17S0535, complete sequence | 212751-212782 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052808 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S637 plasmid pN17S0637, complete sequence | 306376-306407 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052806 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S816 plasmid pN17S0816, complete sequence | 164579-164610 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052791 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0552 plasmid pN17S0637, complete sequence | 168074-168105 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052818 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1509 plasmid pN17S1509, complete sequence | 190524-190555 | 10 | 0.688 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | CP052799 | Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S990 plasmid pN17S0990-1, complete sequence | 6457-6488 | 10 | 0.688 |
NZ_CP020368_2 | 2.1|376521|59|NZ_CP020368|CRISPRCasFinder | 376521-376579 | 59 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 40375-40433 | 11 | 0.814 |
NZ_CP020368_5 | 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2702728-2702759 | 32 | NZ_CP026128 | Acinetobacter baumannii strain ABNIH28 plasmid pABA-1fe1, complete sequence | 49165-49196 | 11 | 0.656 |
NZ_CP020368_6 | 6.5|2728733|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2728733-2728765 | 33 | MF158039 | Shigella phage Sf12, complete genome | 4974-5006 | 11 | 0.667 |
NZ_CP020368_6 | 6.5|2728733|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR | 2728733-2728765 | 33 | MF158042 | Shigella phage Sd1, complete genome | 937-969 | 11 | 0.667 |
1. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 0, identity: 1.0
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta CRISPR spacer ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta Protospacer *******************************************************
2. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 0, identity: 1.0
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta CRISPR spacer ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta Protospacer *******************************************************
3. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 0, identity: 1.0
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta CRISPR spacer ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta Protospacer *******************************************************
4. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 0, identity: 1.0
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta CRISPR spacer ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta Protospacer *******************************************************
5. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 1, identity: 0.982
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta CRISPR spacer ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtgaacgcctta Protospacer *********************************************.*********
6. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 1, identity: 0.982
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta CRISPR spacer ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtgaacgcctta Protospacer *********************************************.*********
7. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 1, identity: 0.982
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta CRISPR spacer ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtgaacgcctta Protospacer *********************************************.*********
8. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 1, identity: 0.982
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta CRISPR spacer ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtgaacgcctta Protospacer *********************************************.*********
9. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 1, identity: 0.982
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta CRISPR spacer ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtgaacgcctta Protospacer *********************************************.*********
10. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 1, identity: 0.982
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta CRISPR spacer ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtgaacgcctta Protospacer *********************************************.*********
11. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 1, identity: 0.982
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta CRISPR spacer ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtgaacgcctta Protospacer *********************************************.*********
12. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 1, identity: 0.982
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta CRISPR spacer ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtgaacgcctta Protospacer *********************************************.*********
13. spacer 6.5|2728733|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LR134258 (Klebsiella aerogenes strain NCTC9644 plasmid 5, complete sequence) position: , mismatch: 4, identity: 0.879
gaaatgctggtgagcgttaatgccgcaaacaca CRISPR spacer gaaatgctggtgagcgttaacgccgcgaacccc Protospacer ********************.*****.*** *
14. spacer 6.5|2728733|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to LR134281 (Klebsiella aerogenes strain NCTC9793 genome assembly, plasmid: 6) position: , mismatch: 4, identity: 0.879
gaaatgctggtgagcgttaatgccgcaaacaca CRISPR spacer gaaatgctggtgagcgttaacgccgcgaacccc Protospacer ********************.*****.*** *
15. spacer 6.5|2728733|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to KY271401 (Klebsiella phage 1 LV-2017, complete genome) position: , mismatch: 4, identity: 0.879
gaaatgctggtgagcgttaatgccgcaaacaca CRISPR spacer gaaatgctggtgagcgttaacgccgcgaacccc Protospacer ********************.*****.*** *
16. spacer 5.5|2702421|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NC_021229 (Arthrobacter nicotinovorans pAO1 megaplasmid sequence, strain ATCC 49919) position: , mismatch: 5, identity: 0.844
tgggcggcttgccttgcagccagctccagcag- CRISPR spacer tgggcggcttgcgttgcagcctgc-cgagcgga Protospacer ************ ******** ** * ***.*
17. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 6, identity: 0.891
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta-- CRISPR spacer ttagcgtcgcatcaggcatctgcacacgactgccggatgcg--ataaacgtcttgtc Protospacer ***********************.***************** .******.***.
18. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 6, identity: 0.891
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta-- CRISPR spacer ttagcgtcgcatcaggcatctgcacacgactgccggatgcg--ataaacgtcttgtc Protospacer ***********************.***************** .******.***.
19. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 6, identity: 0.891
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta-- CRISPR spacer ttagcgtcgcatcaggcatctgcacacgactgccggatgcg--ataaacgtcttgtc Protospacer ***********************.***************** .******.***.
20. spacer 1.1|344535|55|NZ_CP020368|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 6, identity: 0.891
ttagcgtcgcatcaggcatctgcgcacgactgccggatgcggcgtaaacgcctta-- CRISPR spacer ttagcgtcgcatcaggcatctgcacacgactgccggatgcg--ataaacgtcttgtc Protospacer ***********************.***************** .******.***.
21. spacer 5.5|2702421|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP017422 (Arthrobacter sp. ZXY-2 plasmid pZXY21, complete sequence) position: , mismatch: 6, identity: 0.812
tgggcggcttgccttgcagccagctccagcag- CRISPR spacer ggggcggcttgcgttgcagcctgc-cgagcgga Protospacer *********** ******** ** * ***.*
22. spacer 5.11|2702789|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to KY883647 (Vibrio phage JSF33, complete genome) position: , mismatch: 6, identity: 0.812
agcgtgttcggcatcacctttggcttcggctg CRISPR spacer agcagtttcggcatcagctttggctttggctt Protospacer ***. ********** *********.****
23. spacer 5.12|2702850|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP009293 (Novosphingobium pentaromativorans US6-1 plasmid pLA4, complete sequence) position: , mismatch: 6, identity: 0.812
tgcgtgagcgtatcgccgcgcgtctgcgaaag CRISPR spacer agaatgagcgtgtcgccgcgcgtctgcgtgag Protospacer * .*******.**************** .**
24. spacer 6.5|2728733|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to KY653119 (Morganella phage IME1369_02, complete genome) position: , mismatch: 6, identity: 0.818
gaaatgctggtgagcgttaatgccgcaaacaca CRISPR spacer gaaatgctggtcagcgttaacgccgcacaacct Protospacer *********** ********.****** * *
25. spacer 5.1|2702177|32|NZ_CP020368|CRISPRCasFinder,CRT matches to NZ_AP018516 (Acetobacter orientalis strain FAN1 plasmid pAOF1, complete sequence) position: , mismatch: 8, identity: 0.75
cagcgtcaggcgtgaaatctcaccgtcgttgc CRISPR spacer attctttaggcgtgacatcttaccgtcgttga Protospacer * *.******** ****.**********
26. spacer 5.5|2702421|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to MK113951 (Phage 5P_3, complete genome) position: , mismatch: 8, identity: 0.75
tgggcggcttgccttgcagccagctccagcag CRISPR spacer ggggcagcttgccttgcagccagccgatgctc Protospacer ****.******************. **
27. spacer 5.5|2702421|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to AP017924 (Ralstonia phage RP12 DNA, complete genome) position: , mismatch: 8, identity: 0.75
tgggcggcttgccttgcagccagctccagcag CRISPR spacer tgggccgcttgccgtgcagccagcgcttccgc Protospacer ***** ******* ********** *. *.
28. spacer 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NC_013856 (Azospirillum sp. B510 plasmid pAB510b, complete sequence) position: , mismatch: 8, identity: 0.758
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer cgcgtcggcgacgcgcaggtaatgcgcgatcag Protospacer * ************** *********. *
29. spacer 5.11|2702789|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to MN855762 (Bacteriophage sp. isolate 505, complete genome) position: , mismatch: 8, identity: 0.75
agcgtgttcggcatcacctttggcttcggctg CRISPR spacer gaccagctcgaaatcacctttggcttcggctt Protospacer ..* *.***. *******************
30. spacer 5.11|2702789|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NC_020548 (Azoarcus sp. KH32C plasmid pAZKH, complete sequence) position: , mismatch: 8, identity: 0.75
agcgtgtt---cggcatcacctttggcttcggctg CRISPR spacer ---ctgctcgccggcatcaccttcggcttctgcta Protospacer **.* ************.****** ***.
31. spacer 5.12|2702850|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP007130 (Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence) position: , mismatch: 8, identity: 0.75
tgcgtgagcgtatcgccgcgcgtctgcgaaag- CRISPR spacer agcgagagcgtatcgccgcgc-ttcgtgaagcc Protospacer *** **************** *..*.***.
32. spacer 6.4|2728672|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP007129 (Gemmatirosa kalamazoonesis strain KBS708 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.758
tggctctgcaacagcagcacccatgaccacgtc CRISPR spacer cgctccagcaacagcagcacccacgaccacgga Protospacer .* ..* ****************.*******
33. spacer 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP010957 (Sphingobium sp. YBL2 plasmid 3pYBL2-3, complete sequence) position: , mismatch: 9, identity: 0.727
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer cgaggcggcgacacgcaaggtatgcgggtcgag Protospacer **********.****.******** * . *
34. spacer 5.11|2702789|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP015585 (Roseomonas gilardii strain U14-5 plasmid 1, complete sequence) position: , mismatch: 9, identity: 0.719
agcgtgttcggcatcacctttggcttcggctg CRISPR spacer atccgcacgggcatcacctttggctccagctg Protospacer * * . ****************.*.****
35. spacer 5.11|2702789|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP054618 (Azospirillum oryzae strain KACC 14407 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.719
agcgtgttcggcatcacctttggcttcggctg CRISPR spacer ctcggcctcggcaacacctttgccttcggcgc Protospacer ** .****** ******** *******
36. spacer 5.12|2702850|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to MN234174 (Mycobacterium phage Efra2, complete genome) position: , mismatch: 9, identity: 0.719
tgcgtgagcgtatcgccgcgcgtctgcgaaag CRISPR spacer gccgtgagcgtgacgccgcgcgtctggtgatc Protospacer *********. ************* .*
37. spacer 5.12|2702850|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to MN234165 (Mycobacterium phage Yunkel11, complete genome) position: , mismatch: 9, identity: 0.719
tgcgtgagcgtatcgccgcgcgtctgcgaaag CRISPR spacer gccgtgagcgtgacgccgcgcgtctggtgatc Protospacer *********. ************* .*
38. spacer 5.12|2702850|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to MN234201 (Mycobacterium phage Guanica15, complete genome) position: , mismatch: 9, identity: 0.719
tgcgtgagcgtatcgccgcgcgtctgcgaaag CRISPR spacer gccgtgagcgtgacgccgcgcgtctggtgatc Protospacer *********. ************* .*
39. spacer 2.1|376521|59|NZ_CP020368|CRISPRCasFinder matches to MT230312 (Escherichia coli strain DH5alpha plasmid pESBL31, complete sequence) position: , mismatch: 10, identity: 0.831
ggtgccagaaccgtaggccggataaggcgttcacgccgcatccggcaataagtgctccg- CRISPR spacer gagcacagaaccgtaggacggataaggcgttcacgccgcatccggcgat-cgtgcactga Protospacer *. ************ ****************************.** **** *.*
40. spacer 5.5|2702421|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NC_002580 (Propionibacterium freudenreichii plasmid p545, complete sequence) position: , mismatch: 10, identity: 0.688
tgggcggcttgccttgcagccagctccagcag CRISPR spacer ccagcggcttgcgtggcagccagctctcaggg Protospacer . .********* * ***********. . .*
41. spacer 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP028970 (Aminobacter sp. MSH1 plasmid pUSP2, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer gcgtgtgctggcaatcgcttccggggtgacgt Protospacer . *. ********** ***.******** .
42. spacer 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP053984 (Achromobacter pestifer strain FDAARGOS_790 plasmid unnamed, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
43. spacer 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NC_010935 (Comamonas testosteroni CNB-1 plasmid pCNB, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
44. spacer 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to JX469826 (Uncultured bacterium plasmid pB12, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
45. spacer 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to JN106171 (Uncultured bacterium plasmid pAKD26, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
46. spacer 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NC_016968 (Comamonas testosteroni plasmid pTB30, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
47. spacer 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NC_016978 (Comamonas testosteroni plasmid pI2, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
48. spacer 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP017760 (Cupriavidus necator strain NH9 plasmid pENH91, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
49. spacer 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP053554 (Diaphorobacter sp. JS3050 plasmid pDCNB, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
50. spacer 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NC_019263 (Delftia acidovorans plasmid pLME1, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
51. spacer 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NC_019264 (Delftia acidovorans plasmid pNB8c, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
52. spacer 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NC_019283 (Delftia acidovorans plasmid pC1-1, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
53. spacer 5.6|2702482|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NC_006830 (Achromobacter xylosoxidans A8 plasmid pA81, complete sequence) position: , mismatch: 10, identity: 0.688
aagctggctggcaatctctttcggggtgagtc CRISPR spacer aagctggctggcattctcattcgtcagtacct Protospacer ************* **** **** . * ..
54. spacer 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP046443 (Pseudomonas coronafaciens pv. coronafaciens strain B19001 plasmid unnamed2, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
55. spacer 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LT963392 (Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP1, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
56. spacer 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LT963392 (Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP1, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
57. spacer 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP034079 (Pseudomonas syringae pv. pisi str. PP1 plasmid pPP1-1, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
58. spacer 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP034080 (Pseudomonas syringae pv. pisi str. PP1 plasmid pPP1-2, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
59. spacer 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NC_005918 (Pseudomonas syringae pv. maculicola strain ES4326 plasmid pPMA4326A, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
60. spacer 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP047262 (Pseudomonas syringae pv. maculicola str. ES4326 plasmid pPma4326A, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
61. spacer 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP026560 (Pseudomonas amygdali pv. morsprunorum strain R15244 plasmid p3_tig5, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
62. spacer 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LT963406 (Pseudomonas syringae pv. avii isolate CFBP3846 plasmid PP4, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
63. spacer 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to LT985193 (Pseudomonas syringae strain CFBP 2116 genome assembly, plasmid: PP2) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
64. spacer 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LT963393 (Pseudomonas syringae pv. cerasicola isolate CFBP6109 plasmid PP2, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
65. spacer 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LT985210 (Pseudomonas syringae pv. cerasicola strain CFBP 6110 plasmid PP1, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
66. spacer 5.8|2702604|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_LT985211 (Pseudomonas syringae pv. cerasicola strain CFBP 6110 plasmid PP2, complete sequence) position: , mismatch: 10, identity: 0.697
gcaggcggcgacgcgcagggtatgcgcgattcg CRISPR spacer accggcggcgacgcgcaggagatgcgcagcgaa Protospacer .* ****************. ******... .
67. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052797 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N18S2039 plasmid pN18S2039, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
68. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052795 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0125 plasmid pN19S0125, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
69. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP047882 (Salmonella enterica subsp. enterica serovar Infantis strain 119944 plasmid pESI, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
70. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052804 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S973 plasmid pN17S0973, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
71. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP038508 (Salmonella enterica subsp. enterica serovar Infantis strain FARPER-219 plasmid p-F219, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
72. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052802 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S976 plasmid pN17S0976, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
73. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052788 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0611 plasmid pN19S0611, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
74. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052840 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S024 plasmid pN16S024, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
75. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052786 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0641 plasmid pN19S0641, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
76. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052838 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S097 plasmid pN16S097, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
77. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP028316 (Salmonella enterica subsp. enterica serovar Typhimurium var. 5- strain CFSAN067217 plasmid pSC-31-2, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
78. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP051676 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1234 plasmid pN16S1234, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
79. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052783 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0679 plasmid pN19S0679-1, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
80. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052836 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N16S103 plasmid pN16S103, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
81. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022063 (Salmonella enterica strain FDAARGOS_312 plasmid unnamed3, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
82. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052781 (Salmonella enterica strain CVM N19S0949 plasmid pN19S0949, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
83. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052834 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S041 plasmid pN17S0041, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
84. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052793 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0388 plasmid pN19S0388, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
85. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052779 (Salmonella enterica strain 19TN07GT06K-S plasmid pN19S1233, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
86. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052832 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1040 plasmid pN17S1040, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
87. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP031362 (Salmonella enterica subsp. enterica serovar Heidelberg strain 5 plasmid p3, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
88. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052830 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1105 plasmid pN17S1105, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
89. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052828 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1126 plasmid pN17S1126, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
90. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052826 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1245 plasmid pN17S0637, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
91. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP016409 (Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502916 plasmid pFSIS1502916, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
92. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052824 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1265 plasmid pN17S1265, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
93. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052822 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1349 plasmid pN17S1349, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
94. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP016407 (Salmonella enterica subsp. enterica serovar Infantis strain FSIS1502169 plasmid pFSIS1502169, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
95. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052820 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1442 plasmid pN17S1442, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
96. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP016413 (Salmonella enterica subsp. enterica serovar Infantis strain CVM44454 plasmid pCVM44454, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
97. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP016411 (Salmonella enterica subsp. enterica serovar Infantis strain N55391 plasmid pN55391, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
98. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052816 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1598 plasmid pN17S1598) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
99. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052814 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S349 plasmid pN17S0349, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
100. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP022662 (Salmonella enterica subsp. enterica strain RM11065 plasmid pRM11065-2, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
101. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052812 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S376 plasmid pN17S0376, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
102. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052810 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S535 plasmid pN17S0535, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
103. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052808 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S637 plasmid pN17S0637, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
104. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052806 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S816 plasmid pN17S0816, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
105. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052791 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N19S0552 plasmid pN17S0637, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
106. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052818 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S1509 plasmid pN17S1509, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
107. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to CP052799 (Salmonella enterica subsp. enterica serovar Infantis strain CVM N17S990 plasmid pN17S0990-1, complete sequence) position: , mismatch: 10, identity: 0.688
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gaggcgctatcaaacacaaccgacagggagta Protospacer ..*..****** .***************.
108. spacer 2.1|376521|59|NZ_CP020368|CRISPRCasFinder matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 11, identity: 0.814
-ggtgccagaaccgtaggccggataaggcgttcacgccgcatccggcaataagtgctccg CRISPR spacer tcgcacca-aaccgtaggccggataaggcgtttacgccgcatccggcaaaaagccgtacc Protospacer *..*** ***********************.**************** ***. * *
109. spacer 5.10|2702728|32|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to NZ_CP026128 (Acinetobacter baumannii strain ABNIH28 plasmid pABA-1fe1, complete sequence) position: , mismatch: 11, identity: 0.656
tcaacattatcaattacaaccgacagggagcc CRISPR spacer gatacattgccaattacaaccgacagttcaaa Protospacer *****..**************** .
110. spacer 6.5|2728733|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to MF158039 (Shigella phage Sf12, complete genome) position: , mismatch: 11, identity: 0.667
gaaatgctggtgagcgttaatgccgcaaacaca CRISPR spacer cggcacttggggagcgttaatgctgcaaacaat Protospacer .. .*** ************.*******
111. spacer 6.5|2728733|33|NZ_CP020368|CRISPRCasFinder,CRT,PILER-CR matches to MF158042 (Shigella phage Sd1, complete genome) position: , mismatch: 11, identity: 0.667
gaaatgctggtgagcgttaatgccgcaaacaca CRISPR spacer cggcacttggagagcgttaatgctgcaaacaat Protospacer .. .*** ************.*******
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
533907 : 550491
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP020368|533907:550491|DBSCAN-SWA GTTACGCCTTCTTTATATCCTCCATAATTCCAGAGTGGGACATATTTGGGACATTATCACCAAAAATGTCGTCTATTTTCCTCGCATGCTCTGTCAAATGATTAGGCGCAAGGTGAGCATACCTACGAACCATTTCTATGGACTCCCATCCGCCCATTTCCTGAAGCACTGATAATGGGACGCCTGACTGAATCAGCCAGCTTGCCCAGGTGTGTCTGAGGTCATGGAAACGGAAATCTTCAATTCCTGCACGACGACAAGCTGATAGCCATGATGTCTTGCTGTCGATGCGCATCTTCCTGACCGCAGGCGTTGATGTTCCATCTGCTCGCTTAGCCGCCTTGGTATGTACAAACACCCATTTGTGATGCTTGCCTATTTGATCACGCAACACTTTACAGGCGGTATCGTTCAGCGCCACACCAATGGCGCGGTTTGATTTGCTCTCTTCTGGATTCACCCAGGCAACTCGTCGCTGCATGTCGATTTGTTGCCATTCCAGATTTATGATGTTCGACTTTCTCAGACCAGTTGCCAGCGCAAACTTGACGACAGATTTCAGTGGTTCGGGGCACTCATCAATAAGGCGTTTTGCTTCCTCCTTTTCCAGCCATCTGACTCGCTTGTTTCTGACCGCTGGTATCTTGATGACAGGCGCTTTTTCCAGCCACTTCCAGTCGCGTTCTGCAGCACGGAGAATGGCCTTTATCATGGCAAGATGCTTTGCCTTTGTCTGAGTTGGTACTGGCTTTGGTTCATAAACAGGCAGTTCTTTACCTTTCCTGATGGCGGCCTGAACTTTCTGTTTCCATATTTCTTTCGTCTTTCTGTTATGCATTCTGCTTACAGCAGAGTAAATCTTTGCCTCCGAGATATCTTTAAGCCTTATACCCTCAAAATGTTCAAGCCAGAACTCAATCCGGCTTTTATCTGAATCGAGAGATTTTTTATCAGCTTTTTCCTCAAGCCATCTTAGGCAGGCCTCTTCAAAAGTGACATCAGGTAAATCCCCTAGCTTTTCTACTCGCCAGAGTTCTGCTTTTCGCTTGTCGTGCAACTCCTGAGCTTGCCGCTTGTCCTTTGTGCCAAGAGATTCCTTAATTCGTTTCCCGCCCGGGAGCGAATACGAGGCATACCATATTTCATTTCTGCGGAAGAGTGACATTTTCTTTCCTCTGTTATGCCATCACCCGCGCTCACCTGGACAGTATGCAGCGGAGACTGAAGCGCCGCAATGCAGGCTTGCCGTGTTGTGAGGTAAGGAGATTTTGGCTTGGTTGGATCTTTACGTGTTGCCTGTAGGCGGCCTGTTCGTATCCAGTTGGTGGCGGTTGGTCTGGATATCTTAAGAAACTGACAGGCCTCATCGAGTGTGAGGCTGTATGATTCCATGGTTACCTCTGCTTTTTGAACGCATGTCACGTAACTTCTTAATGTGTTCTGCCGTTTCGATCTCTTCTGCTATCCGATCTGCATCAGCTTTATTCACAGGTTCAAAGTCATGATTAAAGCGGAACATGCTGGCGATACATGTTCTGCCTTTTCGGATGTAGTGAACTTTGTTGTGGGTAGAACGCAGGATTTTGCAGGGAGTGCCGTGGTGGTCGACGTACCAGGTGTTAGGCAAAATGATTCTGAACATTTTTACACCTCAGTTGGACGATGTTGAAATTTGCTGCTTTGAGGCCATCACAATCCCCATTGTTTGTTCTTAAGTTCGATCTCCTCCTGGCAGCTTGCACAAGTCCGACAACCCTGAACGGCCAGGCGTCTTCGTTCATCTATGGGATCGCCACACTCACAACAATGAGTGGCAGATATAACCTGGTGGTTCAGACGACGCATTTTTATTGCTGTATTGCGCTGTAATTCTTCGATTTCTGATGCTGAATCAATGATGTCTGCCATCTTCCATTAATCCCTGAATTGTTGGTTAATACGCTTGAGGGTGAATGCGAATAATAAAAAAGGAGCCTGTAGCTCCCTGATGATTTTGCTTTTCATGTTCATCGCTCCTTAAAGACGCCGTTTAACATGCCGATCGCCAGGCTTAAATGAGTCGGTGTGAATCCCATCAGCGTTACCGTTTCGCGGTGCTTCTTCAGTACGCTACGGCAAATGTCATCGACGTTTTTATCCGGAAACTGCTGTCTGGCTTTTTTGATTTCAGAATTAGCCTGACGGGCTATGCTGCGAAGGGCGTTTTCCTGCTGAGGTGTCATTGAACAAGTCCCATGTCGGCAAGCATAAGCACACAGAATATGAAGCCCGCTGCCAGAAAAATGCATTCCGTGGTTGTCATGCAGCCTCCCGACGGGCAAGAATCCTTGAGCCGAACGCCATCAACTCTCCACGATCAACGGTCGTAAAGTGGCAGTGTGTACGGGGGTATGGGTGCCAGATAATGAGCATCGAGCCTTTATTATTTCCACTGACGTGTTTCCCGGTGAGTGGGTTAATAAATGCCAGTCGTCCTGCCGTGATGAATCTGACCTCACTGGCGGTTTGTATCGCTTCATGAAACCATCCGACAGATGTGTCAGCAGGTAATAACATTACACATCCCACACTGCTGAATTTGTTTTCAGTGGCTGCCTTTTTCACAAAAGGGGAAATATTGCTGTATGGTGGATTCAACCAGACATAACCAGAGGCATATCCCATTGCTTACTGGTTGGTTACCAACTTGTACCAGAACATGCGGGCCAACAGGTGCAACCGTAACCAGCATAAATCAGGCTGCGGCTAAAATGGCGCGGGCAGGAATCCTGGTCGTTGATGGTAAGGTCTGGCGAACGGTGTATTACCGGTTCGCTACCAGAGAAGAACGGGAAGGAAAGGTGAGCACGAATCTGATTTTTAAGGAGTGTCGCCAGAGTGCCGCGATGAAACGGGTACTGAGGGTATATAAAAGAACATCAATGGGAACACAATGATGAAACAGGTGAGTTGAGTTCAAACTGTAGTACAATTCTCTCCAGTTTGAACAGGAAAGAATATGCTATGAACCCTTATATTTATCTTGGTGGTGCAATACTTGCAGAGGTCATTGGTACAACCTTAATGAAGTTTTCAGAAGGTTTTACACGGTTATGGCCATCTGTTGGTACAATTATTTGTTATTGTGCATCATTCTGGTTATTAGCTCAGACGCTGGCTTATATTCCTACAGGGATTGCTTATGGTATCTGGTCAGGAGTCGGTATTGTCCTGATTAGCTTACTGTTATGGGGATTTTTCGGCCAACGGCTGGACCTGCCAGCCATTATAGGCATGATGTTGATTTGTGCCGGTGTGTTGGTTATTAATTTATTGTCACGAAGCACACCACATTAAAATAATTTGTTTCTAAACGACTAAAATATGGAGGCTCTTATATTTATATGAGCCTCGTTTTATGCTTTTTGTTAATGTCTTTATTTTTTATGTATTCTTTTGTGCTTTCAAGATTATGGCGTAAGAAAATTGCAATACGATTATTGTTGTATATTCAAGATAATGTGACCTTAATTGTCTTTTTAAATAAAAATTAAACAAAAATTATATCCCACCACTAAGGTTTATAAAAGCATACGTTAGCAGGTGTCACCATGAAAAAAGCCATAGCATATATGCGATTTTCATCACCAGGTCAGATGTCTGGCGACTCATTAAACCGACAGAGAAGACTTATTGCTGAATGGTTAAAGGTAAATAGTGATTATTATCTTGATACCATAACATATGAAGATTTAGGATTAAGTGCATTCAAAGGAAAGCATGCACAATCAGGAGCTTTTTCGGAATTTTTAGATGCTATAGAGCATGGTTATATATTGCCAGGAACTACATTGTTAGTTGAAAGTCTGGACAGACTTTCAAGAGAAAAAGTCGGTGAAGCGATTGAACGTCTGAAATTGATTTTGAATCACGGTATTGATGTTATAACTCTTTGCGACAATACAGTCTATAATATTGACTCTTTGAATGAGCCATATTCATTAATAAAAGCCATACTTATAGCACAAAGGGCAAATGAAGAAAGCGAGATAAAGTCAAGTCGGGTTAAATTATCATGGAAGAAAAAACGGCAGGATGCACTGGAATCAGGTACGATTATGACGGCGTCTTGTCCGAGATGGCTCTCCTTAGATGACAAAAGAACGGCTTTTGTTCCAGACCCCGACAGGGTGAAAACTATTGAGCTAATTTTTAAACTCAGGATGGAAAGGCGCTCATTGAATGCAATAGCCAAGTATTTAAATGATCATGCTGTAAAGAATTTCTCAGGAAAAGAAAGTGCATGGGGACCTTCTGTAATTGAAAAATTATTAGCGAATAAAGCTCTGATAGGTATATGCGTACCTTCATATCGTGCAAGAGGGAAAGGGATAAGTGAAATCGCTGGCTATTATCCCAGAGTCATATCAGATGATTTGTTTTACGCTGTACAGGAAATTCGGTTGGCACCTTTTGGTATTAGCAATAGTAGCAAGAATCCTATGCTAATAAATCTACTTCGAACAGTTATGAAGTGTGAGGCTTGTGGTAATACCATGATTGTTCATGCGGTATCTGGAAGTTTGCATGGCTATTATGTTTGTCCGATGAGAAGATTACATCGATGTGACAGGCCATCAATAAAAAGAGATTTGGTTGATTATAATATCATTAATGAATTGCTTTTTAATTGTAGCAAAATTCAACCAGTTGAAAACAAGAAAGATGCTAATGAAACTTTAGAGTTAAAAATTATTGAGCTTCAGATGAAAATTAATAATTTAATCGTTGCATTGTCTGTCGCGCCTGAAGTTACCGCTATAGCAGAGAAAATAAGACTATTAGATAAGGAATTACGAAGGGCTTCGGTATCATTGAAAACTTTGAAGAGTAAAGGTGTAAATTCATTCAGTGATTTTTATGCTATTGACTTAACCAGTAAAAATGGACGAGAGTTATGCCGTACACTTGCCTATAAAACATTCGAAAAAATCATAATTAATACGGATAATAAAACCTGTGATATCTATTTTATGAATGGCATTGTTTTTAAACACTATCCTTTAATGAAAGTAATATCCGCCCAGCAGGCGATAAGTGCTCTCAAATATATGGTTGATGGTGAGATTTATTTCTAAATAATGATCTCGGATTTTAAGTTATGCTATGGTGATAAAGTGCAAGACAGAATTAATTATCTTTAACGAAACTTAATGGGTAATTACTTTGTTTGCTCCCACAAGCGAGTTTTGTACGGCTGTATTGGGGTAGTAAATGAGCTATACAATCTTAATCATTTGTTAGGTGAGAACTCTTGGTCGCAGATTCAAATACTGAAAATACGTGACAAATTATTATGAGCAAAATGGTGTATGTCACGTATTTTGAATGGTAGGTTAAAAAATAACACCGACTTTCGTAGGTATTACTAATAATAAAGCAGAGTTTTTAGATAGTATCAATGTGCTTTGTGTATATTGTGGCAAATAATTGGGTTGGGGGTACAATTGTGATTGCTTTTGCATGAACATTGCGCCTTTATGCATAATGAGATAAAGGAATATCAAATAAAATAACGATAGGTCATAACAAAGAGGTTTTTATGAAAACACTTATCGTTTCAACTGTATTGGCATTCATAACATTTTCTGCGCAGGCTGCAGCATTTCAGGTCACTAGTAATGAAATAAAAACAGGAGAGCAACTTACAACGTCTCATGTCTTTTCTGGATTTGGGTGTGAAGGTGGTAATACATCGCCCTCATTAACCTGGTCTGGTGTTCCTGAAGGTACCAAAAGCTTTGCCGTAACTGTATATGATCCAGATGCACCTACAGGCAGTGGTTGGTGGCATTGGACTGTTGTTAATATTCCAGCAACAGTAACATATTTGCCCGTTGATGCAGGGAGACGTGATGGAACAAAACTGCCGACTGGTGCTGTTCAAGGCCGAAATGATTTTGGCTATGCTGGGTTTGGTGGCGCATGTCCTCCTAAAGGAGATAAACCACATCATTACCAGTTTAAAGTATGGGCTCTAAAAACTGAAAAGATTCCTGTAGATTCTAACTCCAGCGGAGCGTTAGTTGGTTATATGCTTAATGCTAATAAAATCGCAACCGCTGAGATAACACCAGTTTATGAGATAAAGTAGGGTGAGAGTATGCTGGCAAGAGGTAAGACTAACTTAAAGATCGAAGAAATACGGATGCATAAACATCATGAGATTCATAGGGTTAAGCCTCTTATGCCAGCTTTGTGTCGTATCCGTCAGGGAAAGAAAGTTATCAATTGGGAGACGCATACTTTAACTGTTGATAATAATCAAATAATATTATTTCCTTGTGGTTATGAATTTTATATTGAGAATTATCCTGAAGCAGGGCTTTATCTTGCAGAAATGCTTTACTTACCCATTGATTTAATTGAGAGTTTCCAAAAACTTTATACGGTAACTGATCAAATACGTAACAAAACAAGTTTCTTTTTACCTCAGAATCCTGAGTTAATATATTGTTGGGAGCAACTAAAAACATCTGTTTCCCGAGGCTTCTCAACTAAAATTCAGGAGCACTTAGCAATGGGCGTTCTACTTTCGTTAGGAGTGAATCATGTTAATCATTTACTTTTATCATATAGTAAACAATCATTGATAAGTCGTTGTTATAACCTGCTGCTATCCGAACCCGGCACAAAATGGACAGCAAACAAGGTTGCTCGATATCTCTACATTTCTGTTTCTACATTACATCGCCGTCTAGCAAGCGAGGGGGTAAGTTTCCAAAGTATACTGGACGATGTGAGGTTAAATAATGCGTTGTCTGCTATACAAACGACGGTAAAACCTATAAGCGAGATTGCCAGAGAAAATGGTTATAAGTGTCCTTCTCGTTTTACTGAAAGATTTCATAATCGTTTTAATATAACACCAAGAGAGATAAGAAAAGCTTCCAGAGAGTAAAAGTGTTTTAAGAAGGAGCAATTCTATCGATTTTGATTTTGGGAAATCAACACGGCATAATTATGTCACCGGAGCCTGAACAACTCCGGTGACTTCTGCGCTAAACGGGGACGTTTATGCGCACATACAATCCAAACTCTCTTCTCCCTTCACAGATGCAGAAATGCACCTGCAATTCTTTGCATCTAGCGTTTGACCTCTGCGGAGGTGAAGCGTGAACCTCTCACAAGACGGCATCAAATTACATCGCGGCAACTTCACCGCTATCGGTCGGCAGATCCAGCCTTATCTGGAGGAGGGCAAATGCTTTCGCATGGTGCTTAAACCGTGGCGTGAGAAACGCAGTCTTTCCCAGAATGCACTCAGCCACATGTGGTACAGCGAAATCAGTGAATACCTCATCAGCAGGGGTAAAACGTTCGCCACTCCAGCTTGGGTAAAAGATGCTCTCAAACACACATATCTCGGTTATGAAACCAAAGACCTGGTTGATGTCGTAACCGGTGATATCACCACTATCCAGTCGTTACGCCATACCTCCGACCTTGATACCGGAGAGATGTATGTCTTCCTGTGTAAGGTTGAAGCCTGGGCGATGAATATTGGCTGCCACCTGACTATTCCGCAGAGCTGCGAGTTCCAGCTGCTGCGTGATAAGCAGGAGGCGTAATGGCTACACCGCTTATTCGTGTCATGAACGGACACATCTACAAAGTACCAAATCGTCGTAAGCGTAAGCCTGAGCTGAAACCATCCGAAATACCAACACTGCTCGGATATACCGCCAGCCTGGTTGATAAAAAATGGTTGCGACTGGCAGCAAGGAGGAATCATGGCTGATTTGAGAAAAGCAGCGCGTGGTCGGGAATGCCAGGTAAGAATCCCTGGCGTATGTAATGGCAATCCTGAAACGTCTGTACTGGCACATATCCGGCTGGCTGGATTGTGCGGTACCGGTATCAAACCGCCAGACCTGATTGCCACCATTGCATGTTCTGCCTGTCACGACGAGATCGACCGTCGCACGCATTTTGTTGACGCTGGATATGCAAAAGAATGCGCGCTGGAAGGTATGGCGAGAACGCAGGTTATCTGGCTGAAAGAGGGGGTAATTAAGGCGTGAATACCTACAATATCACATTACCCTGGCCGCCGAGCAATAATCGCTATTACCGCCATAATCGCGGGCGCACGCACATCAGCGCAGAGGGGCAGGCATACCGCGAAAACGTCGCCCGAATCATTAAAAACGCAATGCTGGATATCGGCCTGGCTATGCCTGTGAAAATCCGCATTGAGTGTCACATGCCGGATCGCCGTCGCCGTGACCTGGATAATCTACAAAAGGCCGCTTTTGACGCACTCACCAAAGCAGGTTTCTGGCTGGATGATGCTCAGGTCGTTGATTACCGCGTTGTGAAGATGCCGGTTGTCAAAGGTGGAAAGCTGGAACTGACCATCACTGAACAGGGAGATGAATGATGTTTGAGTTTTATATGGCAGAACTTCTTCGCCACCGCTGGATGCGCCTGCGCTTATATCGTTTCCCCGGTTCTGTTTTGACCGATTACCGAATACTGAAGAATTACGCCAAAACACTGACAGGAGCAGGAGTATGAAGTCAGAGATAACAATCAACTAATACTGTTTTGTTGATTTTTGCTTGTAATTGGCGTTCTGGTCTGAGTTTTGTGGAGTAAGTTGATGCGTGATATTCAGATGGTTCTTGAGCGTTGGGGAGCGTGGGCGGCTAATAATCATGAAGATGTGACCTGGTCGTCCATTGCCGCCGGTTTTAAGGGATTAATTCCTTCAAAAGTAAAATCTCGCCCACAATGTTGTGACGATGACGCGATGATCATTTGCGGGTGCATGGCCCGTCTGAAAAAGAACAACAGCGATTTGCATGATTTATTAGTAGATTATTATGTAGTCGGTATGACATTCATGTCACTGGCAGGTAAGCATTGCTGCTCTGATGGTTATATCGGGAAAAGGTTACAGAAGGCTGAGGGCATAATTGAAGGGATGTTAATGGCATTAGATATCCGGTTAGAGATGGATATCGTTGTTAATAACTCTAATTAATATGCCAATTGTTTACTAAAAATTATTAAAAATGGGGCGTTGAGACGCCCCCAAAAATAAAGGGTAATATATAACAGAAGGTTTATATAGTTAGAAGCAAGGTTGTGCTTCTAAAGGAAGTGGCTTGAGGGAGCCACTTATATGTTGGGGAGGCAAAGCCTCCCGCAACATATCTTTTTCGTAATCAGATTAGAACTGATACACCAGACCTACAGCGACGATGTCGTCGGTATCAATACCAGCTGTTTTGGTAAACTTACTATCGTCAATTAAGTTGATTTTGTAATCAACAAAAGTGGACATGTTTTTATTAAAGTAGTAAGTAGCACCGACATCGACATACTTGACTAAGTCTCGGTCACCATGAACACCAAGGTCTTTACCTTTTGACTGAAGGTAAGCAACAGATGGTCGCAGACCGAAGTCAAACTGATATTGTGCTACTGCTTCAAAGTTTTGTGCTTTGTTTGCAATATGGTTATTACCAAAAACGGTCATATTCTGAGTTTCAGAATATGTGGTAGCCAGATAGATATTGTTCGCATCATATTTCAGGCCTGCAGCCCATACTTCCGCATTTTTGCCGGAGGCATTGAATTTGCTCTTACCATAGGCGACCTGACCGTCAGTGCGATCTGATTTAGCATAGGTTGCACCCACGCCGAATCCTTCATACTCATAAGTAGTGGAGAAACCGAAACCGGTAATGACTCCAACTTATTGATAGTGTTTTATGTTCAGATAATGCCCGATGACTTTGTCATGCAGCTCCACCGATTTTGAGAACGACAGCGACTTCCGTCCCAGCCGTGCCAGGTGCTGCCTCAGATTCAGGTTATGCCGCTCAATTCGCTGCGTATATCGCTTGCTGATTACGTGCAGCTTTCCCTTCAGGCGGGATTCATACAGCGGCCAGCCATCCGTCATCCATATCACCACGTCAAAGGGTGACAGCAGGCTCATAAGACGCCCCAGCGTCGCCATAGTGCGTTCACCGAATACGTGCGCAACAACCGTCTTCCGGAGCCTGTCATACGCGTAAAACAGCCAGCGCTGGCGCGATTTAGCCCCGACATAGCCCCACTGTTCGTCCATTTCCGCGCAGACGATGACGTCACTGCCCGGCTGTATGCGCGAGGTTACCGACTGCGGCCTGAGTTTTTTAAGTGACGTAAAATCGTGTTGAGGCCAACGCCCATAATGCGGGCAGTTGCCCGGCATCCAACGCCATTCATGGCCATATCAATGATTTTCTGGTGCGTACCGGGTTGAGAAGCGGTGTAAGTGAACTGCAGTTGCCATGTTTTACGGCAGTGAGAGCAGAGATAGCGCTGATGTCCGGCGGTGCTTTTGCCGTTACGCACCACCCCGTCAGTAGCTGAACAGGAGGGACAGCTGATAGAAACAGAAGCCACTGGAGCACCTCAAAAACACCATCATACACTAAATCAGTAAGTTGGCAGCATCACCACCGAAACCATCACCATTGGCTTCAGTTACGTCAGTGCGGTCATTTTTACCCTGATACTGAGCAGCAAAGTTCAGGCCATCGACCAGACCAAAGAAGTCGTTGTTACGATAGGTTGCAACACCAGTGGTGCGACCAGTCATGAACACATCTGTTTGGGTCCAGGTATCGCCACCGAATTCTGGCAGAACGTCAGTCCACGCACCGATGTCGTATGCTACACCGTAGTTACGGCCGTAATCGATTGAGCCGTAGTCACCGAATTTCAGGCCTGCAAATGCAAGACGGGTTTTGTCTTTGGAGGAACCTTGAGATTCAGCGCGGTTGCCTTTGAATTCATATTCCCACTGACCGAAACCAGTCAGTTGATCGTTGATTTGGGTTTCACCTTTGAAGCCAAGACGGGCATAAGTAGTATCACCATCATCTGCATCGTTAGAGGAAAAGTAGTGCTTGGCATTAACTTTCCCGTACAGATCCAGCTTGTTACTGTCTTTATTATAAATTTCAGCTGCCTGAGCAGACATCGCCATCAGTACTGATGCAGCTACAGCAGAAATTGCCACTGTTAATTTTTTCATCGTGAGCCCTTTTTTTTGAACTATTATTAAAAAATGATGTCACTGCGCGATAAATATTCATCTAATCAATGTGATTATTTCAAGATGTAAGTTTTAGTTTCTCATTTAATTTGTGAAGTAGATCTCTATTTTTATCTGAACTTTTTCTATCGAAACCTATTTATGGCTCTTATTTGAACAAAAATAAACCTATTAGCTAATTTATATTAATGGCTGTTATTTATGGGGGTTCTATAATTCAGTGGTTTAATTTAAATCAACTAAAAATAACGCCGGAAATTATTTATTGGTTATTTGTTGAGGTTTTCTTATATATTTGTGGTGGTGTTTTGAACACTCGGTAGCATTCTCATAAATATCATTCAGTGGTTTACGTACGTAAAAAATTGGTTATGCTGTTAAGAGTGGTTACTTCGTCACACAGCTTAAACCCGCCGTCGAGCTGGTTTTTCCATTTTTTGAGTCTCGATATTAGCTGATAACTCAATACCTGAGTTATTCACTGACTCCGAGTCTGTTACGTTTCTGCTTTTTTGCGATACGTTGTATTCCCTCAATTTACACCCGCTTTGTCTGCGAGGTGGGGTTATGAAATCCATGGATAAGTTAACAACGGGTGTCGCCTATGGCACCTCAGCAGGTAGTGCCGGTTACTGGTTTTTACAGCTGCTCGATAAAGTCACGCCCTCACAGTGGGCAGCAATAGGTGTGCTGGGTAGCCTGGTATTTGGCCTGCTGACGTACCTGACAAACCTTTATTTCAAGATTAAAGAAGATAAGCGCAAGGCTGCGAGAGGTGAATAATGCCTCCATCATTACGAAAAGCCGTTGCTGCTGCTATTGGTGGCGGAGCAATTGCTATAGCATCAGTGTTAATCACTGGCCCAAGTGGTAACGATGGTCTGGAAGGTGTCAGCTACATACCATACAAAGATATTGTTGGTGTATGGACTGTATGTCACGGGCATACAGGAAAAGACATCATGCTCGGTAAAACGTATACCAAAGCAGAATGCAAAGCCCTCCTGAATAAAGACCTTGCCACGGTCGCCAGACAAATTAACCCGTACATAAAAGTTGATATACCGGAAACAACGCGCGGCGCTCTTTACTCGTTCGTTTACAACGTGGGCGCTGGCAATTTCAGAACATCGACGCTTCTTCGCAAAATAAACCAGGGCGATATCAAAGGCGCATGTGATCAGCTACGTCGCTGGACATATGCTGGCGGTAAGCAATGGAAAGGTCTCATGACTCGTCGTGAGATTGAGCGTGAAATCTGTTTGTGGGGTCAGCAATGAACAGAGTAACCGCGATTATCTCCGCTCTGGTTATCTGCATCATCGTCTGCCTGTCATGGGCTGTTAATCATTACCGTGATAACGCCATTACCTACAAAGCCCAGCGCGACAAAAATGCCAGAGAACTGAAGCTGGCGAACGCGGCAATTACTGACATGCAGATGCGCCAGCGTGATGTTGCTGCACTGGATGAAAAATACACGAAGGAGTTAGCTAATGCGAAAGCTGAAAATGATGCTCTGCGTGATGATGTTGCCGCTGGTCGTCGTCGGTTGCACATCAAAGCAGTCTGTCAGTCAGTGCGTGAAGCCACCACCGCCTCCGGCGTGGATAATGCAGCCTCCCCCCGACTGGCAGACACCGCTGAACGGGATTATTTCACCCTCAGAGAGAGGCTGATCACTATGCAAAAACAACTGGAAGGAACCCAGAAGTATATTAATGAGCAGTGCAGATAGAGCTGCCCATATCGATGGGCAACTCATGCAATTATTGTGAGCAATACACACGCGCTTCCAGCGGAGTATAAATGCCTAAAGTAATAAAACCGAGCAATCCATTTACGAATGTTTGCTGGGTTTCTGTTTTAACAACATTTTCTGCGCCGCCACAAATTTTGGCTGCATCAACAGTTTTCTCCTGTCCAATTCCCGAAACGAAGAAGTGATGGGTGATGGTTTCCTTTGGTGTTACTGCTGTCGGTTTGTTTCCAACAGTAAACGTCTGTTGAGCACATCCTGTAATAAGCATTGCCAGAGCGGCAGAAAACAACATTTTTTTCATCTTATTATCCTGCATTGTTAAAAACGGCAGAATCCTATGTGACAACAATTAAACGATAGTTAAATGGATTGATGAAAATTAAAACTATATAGGTGGATGCTCAGCCTATTGGAGGAGGGGGGGCACTCAGAATCCTGTGGAATGAAATAAACCGCTCTTTCTGTCCATTACCCTTTTAGCTGCGCTGTATCGTCGCCGTATTCCCGCATTAACCATGACCGTAGCCCGACGGGGAATTCCTTCTGCGTGAGTGTGCGGGAATAATCAAAAACGATGCACACCGGGTTTTACTGTGCTGACAGACGCAGGGTTACCCTCATAGTCGCTTTTCCGGTGCGATGGTGGAAGAAACCGGGATGTTTATTCATCATCACTCTGGATTGATGTATATGCTCTCTTTTCTGACGTTAGTCTCCGACGGCAGGCTTCAATGACCCAGGCTGAGAAATTCCCAGACCCTTTTTGCTCAAGAGCGATGTTAATTTGTTCAATCATTTGGTTAGGAAAGCGGATGTTGCGGGTTGTTGTTCTGCGGGTTCTGTTCTTAGTTGACATGAGGTTGCCCCGTATTCAGTGTCGCTGATTTGTATTGTCTGAAGTTGTTTTTACGTTAAGTTGATGCAGATCAATTAATACGATACCTGCGTCATAATTGATTATTTGACGTGGTTTGATGGCGTAGATGCACGTTGTGACATGTAGATGATAATTATTATCATTTTGCGGGTCCTTTCCGGCGATCCGACAGGTTACGGGGCGGCGACCTCGCGGGTTTTCGCTATTTATGAAAATTTTCCGGTTTAAGGCATTTCCGTTCTTCTTCGTCGTAACTTAATGTTTTTATTTAAAATACCCCCTGAAAAGAAAGGAAACGACAGGTGCTGAAAACGAGCTTTTGGGCCTCTGTCGTTTCCTTTCTCTGTTTTTGGCCGTGGAATGAACAATGGAAGTCAACAAAAAGCAGCTGGCTGACATTTTCGGTGCGAGTATCCGTACCATTCAGAACTGGCAGGAACAGGGAATGCCCGTTCTGCGAGGCGGTGGCAAGGGTAATGAGGTGCTTTATGACTCTGCCGCCGTTATAAGATGGTATGCCGAAAGGGATGCTGAAATTGAGAACGAAAAGCTGCGCCGGGAAGTTGAAGAACTGCGGCAGGCCAGCGAGACAGATCTCCAGCCAGGGACTATTGAGTACGAACGCCATCGACTTACGCGTGCGCAGGCCGACGCACAGGAGCTGAAAAATGCCAGAGACTCCGCTGAAGTGGTGGAAACCGCATTCTGTACTTTCGTGCTGTCGCGGATCGCAGGTGAAATTGCCAGTATTCTCGACGGGATCCCCCTGTCGGTGCAGCGGCGTTTTCCGGAACTGGAAAACCGACATGTTGATTTCCTGAAACGGGATATCATCAAAGCCATGAACAAAGCAGCCGCGCTGGATGAACTGATACCCGAACACGGCACTGGTCGGCGTGCAGGTGGACTCGGAGCAGTTCGGCAGCCAGCAGGTGAGCCGTAATTATCATCTTCGCGGGCGCATTCTGCAGGTGCCGTCGAACTATAACCCGCAGACGCGGCAATACAGCGGTATCTGGGACGGAACGTTTAAGCCGGCATATGGGGGCTGAAGCTGCCGACGTTGCGCCAGCGCCTGTTCCGCTGCGTGAGTATCCGTGAGAACGACGACGGCACGTATGCCATCACCGCCGTGCAGCATGTACCGGAAAAAGAGGCCATCGTGGATAACGGGGCGCACTTTGACGGCGACCAGAGCGGCACGGTAAATGGTGTCACGCCGCCAGCGGTGCAGCACCTGACCGCAGAAGTCACCGCAGACAGCGGGGAATACCAGGTGCTGGCCCGCTGGGATACGCCGAAGGTGGTGAAGGGCGTGAGCTTCCTGCTTCGCCTGACCGTGGCAGCGGATGACGGCCGTGAGCGACTGGTCAGCACGGCCCGGACGACGGAAACCACTTACCGCTTCACACAACTGGCTCTGGGGAACTACAGGCTGACAGTCCGGGCAGTAAATGCGTGGGGGCAGCAGGGCGAGCCGGCGTCGGTATCGTTCCGGATTGCCGCACCGGCAGCGCCGTCGCAGATTGAGCTGACACCGGGGTATTTTCAGATAACCGCCACGCCGCATCTTGCGGTTTATGATCCGACGGTACAGTTTGAGTTCTGGTTCTCGGAAAAGCGGATTGCGGATATCAGGCAGGTTGAAACCACAGCACGCTATCTTGGCACGGCGCTGTACTGGATAGCCGCCAGTATCAATATCAAACCGGGCCATGATTATTATTTTTACATCCGCAGTGTGAACACCGTTGGCAAATCGGCATTCGTGGAGGCTGTTGGCCAGCCGAGTGATGATGCATCCGGCTATCTGAATTTTTTCAAAGGAGAGATAGGGAAAACCCATCTGGCTCAGGAGTTGTGGACGCAGATTGATAACGGTCAGCTTGCGCCTGACCTGGCGGAAATCAGAACGTCCATCACGGATGTCAGTAATGAAATCACGCAGACCGTCAATAAGAAACTGGAAGACCAGAGTGCGGCAATCCAGCAGATACAGAAGGTTCAGGTTGATACAAATAATAACCTGAACAGCATGTGGGCTGTGAAGCTGCAGCAGATGCAGGACGGACGCCTTTATATCGCGGGTATTGGTGCCGGTATTGAGAATACCCCTGACGGTATGCAGAGTCAGGTGCTGCTGGCGGCGGACAGGATTGCGATGATTAATCCTGCGAATGGCAACACAAAGCCGATGTTTGTTGGTCAGGGCGATCAGATATTCATGAACGAAGTGTTCCTGAAATATCTGACGGCTCCCACCATTACCAGCGGCGGTAATCCTCCGGCATTTTCCCTGACACCGGACGGGCGGCTGACGGCGAAAAATGCCGATATTAGCGGTAACGTGAATGCGAACTCCGGGACGCTCAACAACGTCACGATTAACGAGAACTGCCGGGTTCTGGGAAAACTGTCCGCGAACCAGATTGAAGGCGATCTCGTTAAAACAGTGGGCAAAGCTTTCCCCCGGGACTCCCGTGCACCGGAGCGGTGGCCATCAGGGACCATTACCGTCAGGGTTTATGACGATCAGCCGTTTGACCGGCAAATTGTTATTCCGGCGGTGGCATTCAGTGGCGCTAAACATGAGCAAGATCATACTGATATCTACTCCTCATGCCGTCTGATAGTACGAAAAAACGGTGCTGAAATTTATAACCGAACGGCTCTGGATAATACGCTGATATATACGGGTGTTATTGATATGCCTGCAGGCAGTGGTGTAATGACACTGGAATTTTCTGTATCGGCATGGCTGGTAAATGGCTGGTATCCCACAGCAAGTATCAGCGATTTGCTGGTTGTGGTGATGAAGAAAGCCACTGCAGGCATCATGATTAGCTGA
Protein sequences of DBSCAN-SWA_1 >NZ_CP020368|533907:550491|533907_535071_-|WP_001318654.1|integrase|DBSCAN-SWA MSLFRRNEIWYASYSLPGGKRIKESLGTKDKRQAQELHDKRKAELWRVEKLGDLPDVTFEEACLRWLEEKADKKSLDSDKSRIEFWLEHFEGIRLKDISEAKIYSAVSRMHNRKTKEIWKQKVQAAIRKGKELPVYEPKPVPTQTKAKHLAMIKAILRAAERDWKWLEKAPVIKIPAVRNKRVRWLEKEEAKRLIDECPEPLKSVVKFALATGLRKSNIINLEWQQIDMQRRVAWVNPEESKSNRAIGVALNDTACKVLRDQIGKHHKWVFVHTKAAKRADGTSTPAVRKMRIDSKTSWLSACRRAGIEDFRFHDLRHTWASWLIQSGVPLSVLQEMGGWESIEMVRRYAHLAPNHLTEHARKIDDIFGDNVPNMSHSGIMEDIKKA >NZ_CP020368|533907:550491|541678_541969_+|WP_000774479.1|DBSCAN-SWA MADLRKAARGRECQVRIPGVCNGNPETSVLAHIRLAGLCGTGIKPPDLIATIACSACHDEIDRRTHFVDAGYAKECALEGMARTQVIWLKEGVIKA >NZ_CP020368|533907:550491|540962_541064_+|WP_001303586.1|DBSCAN-SWA MRTYNPNSLLPSQMQKCTCNSLHLAFDLCGGEA >NZ_CP020368|533907:550491|537289_537439_+|WP_001299444.1|DBSCAN-SWA MSLVLCFLLMSLFFMYSFVLSRLWRKKIAIRLLLYIQDNVTLIVFLNKN >NZ_CP020368|533907:550491|545571_545787_+|WP_000839596.1|lysis|DBSCAN-SWA MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >NZ_CP020368|533907:550491|540048_540846_+|WP_000881075.1|DBSCAN-SWA MLARGKTNLKIEEIRMHKHHEIHRVKPLMPALCRIRQGKKVINWETHTLTVDNNQIILFPCGYEFYIENYPEAGLYLAEMLYLPIDLIESFQKLYTVTDQIRNKTSFFLPQNPELIYCWEQLKTSVSRGFSTKIQEHLAMGVLLSLGVNHVNHLLLSYSKQSLISRCYNLLLSEPGTKWTANKVARYLYISVSTLHRRLASEGVSFQSILDDVRLNNALSAIQTTVKPISEIARENGYKCPSRFTERFHNRFNITPREIRKASRE >NZ_CP020368|533907:550491|536909_537242_+|WP_001070454.1|DBSCAN-SWA MNPYIYLGGAILAEVIGTTLMKFSEGFTRLWPSVGTIICYCASFWLLAQTLAYIPTGIAYGIWSGVGIVLISLLLWGFFGQRLDLPAIIGMMLICAGVLVINLLSRSTPH >NZ_CP020368|533907:550491|541965_542328_+|WP_001099655.1|DBSCAN-SWA MNTYNITLPWPPSNNRYYRHNRGRTHISAEGQAYRENVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVVKMPVVKGGKLELTITEQGDE >NZ_CP020368|533907:550491|539487_540039_+|WP_001306955.1|DBSCAN-SWA MKTLIVSTVLAFITFSAQAAAFQVTSNEIKTGEQLTTSHVFSGFGCEGGNTSPSLTWSGVPEGTKSFAVTVYDPDAPTGSGWWHWTVVNIPATVTYLPVDAGRRDGTKLPTGAVQGRNDFGYAGFGGACPPKGDKPHHYQFKVWALKTEKIPVDSNSSGALVGYMLNANKIATAEITPVYEIK >NZ_CP020368|533907:550491|547427_547622_-|WP_001415975.1|DBSCAN-SWA MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSEKRAYTSIQSDDE >NZ_CP020368|533907:550491|548682_550491_+|WP_072094231.1|DBSCAN-SWA MWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGDQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGVSFLLRLTVAADDGRERLVSTARTTETTYRFTQLALGNYRLTVRAVNAWGQQGEPASVSFRIAAPAAPSQIELTPGYFQITATPHLAVYDPTVQFEFWFSEKRIADIRQVETTARYLGTALYWIAASINIKPGHDYYFYIRSVNTVGKSAFVEAVGQPSDDASGYLNFFKGEIGKTHLAQELWTQIDNGQLAPDLAEIRTSITDVSNEITQTVNKKLEDQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMQDGRLYIAGIGAGIENTPDGMQSQVLLAADRIAMINPANGNTKPMFVGQGDQIFMNEVFLKYLTAPTITSGGNPPAFSLTPDGRLTAKNADISGNVNANSGTLNNVTINENCRVLGKLSANQIEGDLVKTVGKAFPRDSRAPERWPSGTITVRVYDDQPFDRQIVIPAVAFSGAKHEQDHTDIYSSCRLIVRKNGAEIYNRTALDNTLIYTGVIDMPAGSGVMTLEFSVSAWLVNGWYPTASISDLLVVVMKKATAGIMIS >NZ_CP020368|533907:550491|536568_536655_+|WP_129486119.1|DBSCAN-SWA MAYWLVTNLYQNMRANRCNRNQHKSGCG >NZ_CP020368|533907:550491|535269_535548_-|WP_000488419.1|DBSCAN-SWA MFRIILPNTWYVDHHGTPCKILRSTHNKVHYIRKGRTCIASMFRFNHDFEPVNKADADRIAEEIETAEHIKKLRDMRSKSRGNHGIIQPHTR >NZ_CP020368|533907:550491|535912_536194_-|WP_001386642.1|DBSCAN-SWA MHFSGSGLHILCAYACRHGTCSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >NZ_CP020368|533907:550491|546500_546683_+|WP_001228695.1|lysis|DBSCAN-SWA MRKLKMMLCVMMLPLVVVGCTSKQSVSQCVKPPPPPAWIMQPPPDWQTPLNGIISPSERG >NZ_CP020368|533907:550491|537496_539023_+|WP_000709082.1|DBSCAN-SWA MKKAIAYMRFSSPGQMSGDSLNRQRRLIAEWLKVNSDYYLDTITYEDLGLSAFKGKHAQSGAFSEFLDAIEHGYILPGTTLLVESLDRLSREKVGEAIERLKLILNHGIDVITLCDNTVYNIDSLNEPYSLIKAILIAQRANEESEIKSSRVKLSWKKKRQDALESGTIMTASCPRWLSLDDKRTAFVPDPDRVKTIELIFKLRMERRSLNAIAKYLNDHAVKNFSGKESAWGPSVIEKLLANKALIGICVPSYRARGKGISEIAGYYPRVISDDLFYAVQEIRLAPFGISNSSKNPMLINLLRTVMKCEACGNTMIVHAVSGSLHGYYVCPMRRLHRCDRPSIKRDLVDYNIINELLFNCSKIQPVENKKDANETLELKIIELQMKINNLIVALSVAPEVTAIAEKIRLLDKELRRASVSLKTLKSKGVNSFSDFYAIDLTSKNGRELCRTLAYKTFEKIIINTDNKTCDIYFMNGIVFKHYPLMKVISAQQAISALKYMVDGEIYF >NZ_CP020368|533907:550491|542324_542465_+|WP_000971071.1|DBSCAN-SWA MMFEFYMAELLRHRWMRLRLYRFPGSVLTDYRILKNYAKTLTGAGV >NZ_CP020368|533907:550491|545786_546284_+|WP_001135280.1|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKDIVGVWTVCHGHTGKDIMLGKTYTKAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSTLLRKINQGDIKGACDQLRRWTYAGGKQWKGLMTRREIEREICLWGQQ >NZ_CP020368|533907:550491|541515_541686_+|WP_000224907.1|DBSCAN-SWA MATPLIRVMNGHIYKVPNRRKRKPELKPSEIPTLLGYTASLVDKKWLRLAARRNHG >NZ_CP020368|533907:550491|542550_542934_+|WP_001204780.1|DBSCAN-SWA MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLIPSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVVGMTFMSLAGKHCCSDGYIGKRLQKAEGIIEGMLMALDIRLEMDIVVNNSN >NZ_CP020368|533907:550491|541060_541516_+|WP_001054340.1|DBSCAN-SWA MNLSQDGIKLHRGNFTAIGRQIQPYLEEGKCFRMVLKPWREKRSLSQNALSHMWYSEISEYLISRGKTFATPAWVKDALKHTYLGYETKDLVDVVTGDITTIQSLRHTSDLDTGEMYVFLCKVEAWAMNIGCHLTIPQSCEFQLLRDKQEA >NZ_CP020368|533907:550491|546773_547067_-|WP_000738423.1|DBSCAN-SWA MKKMLFSAALAMLITGCAQQTFTVGNKPTAVTPKETITHHFFVSGIGQEKTVDAAKICGGAENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSQ >NZ_CP020368|533907:550491|534926_535298_-|WP_000446905.1|DBSCAN-SWA MESYSLTLDEACQFLKISRPTATNWIRTGRLQATRKDPTKPKSPYLTTRQACIAALQSPLHTVQVSAGDGITEERKCHSSAEMKYGMPRIRSRAGNELRNLLAQRTSGKLRSCTTSEKQNSGE >NZ_CP020368|533907:550491|535595_535814_-|WP_000763373.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQVISATHCCECGDPIDERRRLAVQGCRTCASCQEEIELKNKQWGL |
24 | Enterobacteria_phage(66.67%) | integrase,lysis | attL 532675:532688|attR 539992:540005 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
748735 : 770243
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP020368|748735:770243|DBSCAN-SWA TGTGAAACCAGTAACGTTATACGATGTCGCAGAGTATGCCGGTGTCTCTTATCAGACCGTTTCCCGCGTGGTGAACCAGGCCAGCCACGTTTCTGCGAAAACGCGGGAAAAAGTGGAAGCGGCGATGGCGGAGCTGAATTACATTCCCAACCGCGTGGCACAACAACTGGCGGGCAAACAGTCGTTGCTGATTGGCGTTGCCACCTCCAGTCTGGCCCTGCACGCGCCGTCGCAAATTGTCGCGGCGATTAAATCTCGCGCCGATCAACTGGGTGCCAGCGTGGTGGTGTCGATGGTAGAACGAAGCGGCGTCGAAGCCTGTAAAGCGGCGGTGCACAATCTTCTCGCGCAACGCGTCAGTGGGCTGATCATTAACTATCCGCTGGATGACCAGGATGCCATTGCTGTGGAAGCTGCCTGCACTAATGTTCCGGCGTTATTTCTTGATGTCTCTGACCAGACACCCATCAACAGTATTATTTTCTCCCATGAAGACGGTACGCGACTGGGCGTGGAGCATCTGGTCGCATTGGGTCACCAGCAAATCGCGCTGTTAGCGGGCCCATTAAGTTCTGTCTCGGCGCGTCTGCGTCTGGCTGGCTGGCATAAATATCTCACTCGCAATCAAATTCAGCCGATAGCGGAACGGGAAGGCGACTGGAGTGCCATGTCCGGTTTTCAACAAACCATGCAAATGCTGAATGAGGGCATCGTTCCCACTGCGATGCTGGTTGCCAACGATCAGATGGCGCTGGGCGCAATGCGCGCCATTACCGAGTCCGGGCTGCGCGTTGGTGCGGATATCTCGGTAGTGGGATACGACGATACCGAAGACAGCTCATGTTATATCCCGCCGTTAACCACCATCAAACAGGATTTTCGCCTGCTGGGGCAAACCAGCGTGGACCGCTTGCTGCAACTCTCTCAGGGCCAGGCGGTGAAGGGCAATCAGCTGTTGCCCGTCTCACTGGTGAAAAGAAAAACCACCCTGGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTAAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATAATGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGGATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGCGCTTTGCCTGGTTTCCGGCACCAGAAGCGGTGCCGGAAAGCTGGCTGGAGTGCGATCTTCCTGAGGCCGATACTGTCGTCGTCCCCTCAAACTGGCAGATGCACGGTTACGATGCGCCCATCTACACCAACGTGACCTATCCCATTACGGTCAATCCGCCGTTTGTTCCCACGGAGAATCCGACGGGTTGTTACTCGCTCACATTTAATGTTGATGAAAGCTGGCTACAGGAAGGCCAGACGCGAATTATTTTTGATGGCGTCGGGATCTGATCCGGATTTACTAACTGGAAGAGGCACTAAATGAACACGATTAACATCGCTAAGAACGACTTCTCTGACATCGAACTGGCTGCTATCCCGTTCAACACTCTGGCTGACCATTACGGTGAGCGTTTAGCTCGCGAACAGTTGGCCCTTGAGCATGAGTCTTACGAGATGGGTGAAGCACGCTTCCGCAAGATGTTTGAGCGTCAACTTAAAGCTGGTGAGGTTGCGGATAACGCTGCCGCCAAGCCTCTCATCACTACCCTACTCCCTAAGATGATTGCACGCATCAACGACTGGTTTGAGGAAGTGAAAGCTAAGCGCGGCAAGCGCCCGACAGCCTTCCAGTTCCTGCAAGAAATCAAGCCGGAAGCCGTAGCGTACATCACCATTAAGACCACTCTGGCTTGCCTAACCAGTGCTGACAATACAACCGTTCAGGCTGTAGCAAGCGCAATCGGTCGGGCCATTGAGGACGAGGCTCGCTTCGGTCGTATCCGTGACCTTGAAGCTAAGCACTTCAAGAAAAACGTTGAGGAACAACTCAACAAGCGCGTAGGGCACGTCTACAAGAAAGCATTTATGCAAGTTGTCGAGGCTGACATGCTCTCTAAGGGTCTACTCGGTGGCGAGGCGTGGTCTTCGTGGCATAAGGAAGACTCTATTCATGTAGGAGTACGCTGCATCGAGATGCTCATTGAGTCAACCGGAATGGTTAGCTTACACCGCCAAAATGCTGGCGTAGTAGGTCAAGACTCTGAGACTATCGAACTCGCACCTGAATACGCTGAGGCTATCGCAACCCGTGCAGGTGCGCTGGCTGGCATCTCTCCGATGTTCCAACCTTGCGTAGTTCCTCCTAAGCCGTGGACTGGCATTACTGGTGGTGGCTATTGGGCTAACGGTCGTCGTCCTCTGGCGCTGGTGCGTACTCACAGTAAGAAAGCACTGATGCGCTACGAAGACGTTTACATGCCTGAGGTGTACAAAGCGATTAACATTGCGCAAAACACCGCATGGAAAATCAACAAGAAAGTCCTAGCGGTCGCCAACGTAATCACCAAGTGGAAGCATTGTCCGGTCGAGGACATCCCTGCGATTGAGCGTGAAGAACTCCCGATGAAACCGGAAGACATCGACATGAATCCTGAGGCTCTCACCGCGTGGAAACGTGCTGCCGCTGCTGTGTACCGCAAGGACAAGGCTCGCAAGTCTCGCCGTATCAGCCTTGAGTTCATGCTTGAGCAAGCCAATAAGTTTGCTAACCATAAGGCCATCTGGTTCCCTTACAACATGGACTGGCGCGGTCGTGTTTACGCTGTGTCAATGTTCAACCCGCAAGGTAACGATATGACCAAAGGACTGCTTACGCTGGCGAAAGGTAAACCAATCGGTAAGGAAGGTTACTACTGGCTGAAAATCCACGGTGCAAACTGTGCGGGTGTCGATAAGGTTCCGTTCCCTGAGCGCATCAAGTTCATTGAGGAAAACCACGAGAACATCATGGCTTGCGCTAAGTCTCCACTGGAGAACACTTGGTGGGCTGAGCAAGATTCTCCGTTCTGCTTCCTTGCGTTCTGCTTTGAGTACGCTGGGGTACAGCACCACGGCCTGAGCTATAACTGCTCCCTTCCGCTGGCGTTTGACGGGTCTTGCTCTGGCATCCAGCACTTCTCCGCGATGCTCCGAGATGAGGTAGGTGGTCGCGCGGTTAACTTGCTTCCTAGTGAAACCGTTCAGGACATCTACGGGATTGTTGCTAAGAAAGTCAACGAGATTCTACAAGCAGACGCAATCAATGGGACCGATAACGAAGTAGTTACCGTGACCGATGAGAACACTGGTGAAATCTCTGAGAAAGTCAAGCTGGGCACTAAGGCACTGGCTGGTCAATGGCTGGCTTACGGTGTTACTCGCAGTGTGACTAAGCGTTCAGTCATGACGCTGGCTTACGGGTCCAAAGAGTTCGGCTTCCGTCAACAAGTGCTGGAAGATACCATTCAGCCAGCTATTGATTCCGGCAAGGGTCTGATGTTCACTCAGCCGAATCAGGCTGCTGGATACATGGCTAAGCTGATTTGGGAATCTGTGAGCGTGACGGTGGTAGCTGCGGTTGAAGCAATGAACTGGCTTAAGTCTGCTGCTAAGCTGCTGGCTGCTGAGGTCAAAGATAAGAAGACTGGAGAGATTCTTCGCAAGCGTTGCGCTGTGCATTGGGTAACTCCTGATGGTTTCCCTGTGTGGCAGGAATACAAGAAGCCTATTCAGACGCGCTTGAACCTGATGTTCCTCGGTCAGTTCCGCTTACAGCCTACCATTAACACCAACAAAGATAGCGAGATTGATGCACACAAACAGGAGTCTGGTATCGCTCCTAACTTTGTACACAGCCAAGACGGTAGCCACCTTCGTAAGACTGTAGTGTGGGCACACGAGAAGTACGGAATCGAATCTTTTGCACTGATTCACGACTCCTTCGGTACCATTCCGGCTGACGCTGCGAACCTGTTCAAAGCAGTGCGCGAAACTATGGTTGACACATATGAGTCTTGTGATGTACTGGCTGATTTCTACGACCAGTTCGCTGACCAGTTGCACGAGTCTCAATTGGACAAAATGCCAGCACTTCCGGCTAAAGGTAACTTGAACCTCCGTGACATCTTAGAGTCGGACTTCGCGTTCGCGTAACGCCAAATCAATACGACTCCGGATCCCCTTCGAAGGAAAGACCTGATGCTTTTCGTGCGCGCATAAAATACCTTGATACTGTGCCGGATGAAAGCGGTTCGCGACGAGTAGATGCAATTATGGTTTCTCCGCCAAGAATCTCTTTGCATTTATCAAGTGTTTCCTTCATTGATATTCCGAGAGCATCAATATGCAATGCTGTTGGGATGGCAATTTTTACGCCTGTTTTGCTTTGCTCGACATAAAGATATCCATCTACGATATCAGACCACTTCATTTCGCATAAATCACCAACTCGTTGCCCGGTAACAACAGCCAGTTCCATTGCAAGTCTGAGCCAACATGGTGATGATTCTGCTGCTTGATAAATTTTCAGGTATTCGTCAGCCGTAAGTCTTGATCTCCTTACCTCTGATTTTGCTGCGCGAGTGGCAGCGACATGGTTTGTTGTTATATGGCCTTCAGCTATTGCCTCTCGGAATGCATCGCTCAGTGTTGATCTGATTAACTTGGCTGACGCCGCCTTGCCCTCGTCTATGTATCCATTGAGCATTGCCGCAATTTCTTTTGTGGTGATGTCTTCAAGTGGAGCATCAGGCAGACCCCTCCTTATTGCTTTAATTTTGCTCATGTAATTTATGAGTGTCTTCTGCTTGATTCCTCTGCTGGCCAGGATTTTTTCGTAGCGATCAAGCCATGAATGTAACGTAACGGAATTATCACTGTTGATTCTCGCTGTCAGAGGCTTGTGTTTGTGTCCTGAAAATAACTCAATGTTGGCCTGTATAGCTTCAGTGATTGCGATTCGCCTGTCTCTGCCTAATCCAAACTCTTTACCCGTCCTTGGGTCCCTGTAGCAGTAATATCCATTGTTTCTTATATAAAGGTTAGGGGGTAAATCCCGGCGCTCATGACTTCGCCTTCTTCCCATTTCTGATCCTCTTCAAAAGGCCACCTGTTACTGGTCGATTTAAGTCAACCTTTACCGCTGATTCGTGGAACAGATACTCTCTTCCATCCTTAACCGGAGGTGGGAATATCCTGCATTCCCGAACCCATCGACGAACTGTTTCAAGGCTTCTTGGACGTCGCTGGCGTGCGTTCCACTCCTGAAGTGTCAAGTACATCGCAAAGTCTCCGCAATTACACGCAAGAAAAAACCGCCATCAGGCGGCTTGGTGTTCTTTCAGTTCTTCAATTCGAATATTGGTTACGTCTGCATGTGCTATCTGCGCCCATATCATCCAGTGGTCGTAGCAGTCGTTGATGTTCTCCGCTTCGATAACTCTGTTGAATGGCTCTCCATTCCATTCTCCTGTGACTCGGAAGTGCATTTATCATCTCCATAAAACAAAACCCGCCGTAGCGAGTTCAGATAAAATAAATCCCCGCGAGTGCGAGGATTGTTATGTAATATTGGGTTTAATCATCTATATGTTTTGTACAGAGAGGGCAAGTATCGTTTCCACCGTACTCGTGATAATAATTTTGCACGGTATCAGTCATTTCTCGCACATTGCAGAATGGGGATTTGTCTTCATTAGACTTATAAACCTTCATGGAATATTTGTATGCCGACTCTATATCTATACCTTCATCTACATAAACACCTTCGTGATGTCTGCATGGAGACAAGACACCGGATCTGCACAACATTGATAACGCCCAATCTTTTTGCTCAGACTCTAACTCATTGATACTCATTTATAAACTCCTTGCAATGTATGTCGTTTCAGCTAAACGGTATCAGCAATGTTTATGTAAAGAAACAGTAAGATAATACTCAACCCGATGTTTGAGTACGGTCATCATCTGACACTACAGACTCTGGCATCGCTGTGAAGACGACGCGAAATTCAGCATTTTCACAAGCGTTATCTTTTACAAAACCGATCTCACTCTCCTTTGATGCGAATGCCAGCGTCAGACATCATATGCAGATACTCACCTGCATCCTGAACCCATTGACCTCCAACCCCGTAATAGCGATGCGTAATGATGTCGATAGTTACTAACGGGTCTTGTTCGATTAACTGCCGCAGAAACTCTTCCAGGTCACCAGTGCAGTGCTTGATAACAGGAGTCTTCCCAGGATGGCGAACAACAAGAAACTGGTTTCCGTCTTCACGGACTTCGTTGCTTTCCAGTTTAGCAATACGCTTACTCCCATCCGAGATAACACCTTCGTAATACTCACGCTGCTCGTTGAGTTTTGATTTTGCTGTTTCAAGCTCAACACGCAGTTTCCCTACTGTTAGCGCAATATCCTCGTTCTCCTGGTCGCGGCGTTTGATGTATTGCTGGTTTCTTTCCCGTTCATCCAGCAGTTCCAGCACAATCGATGGTGTTACCAATTCATGGAAAAGGTCTGCGTCAAATCCCCAGTCGTCATGCATTGCCTGCTCTGCCGCTTCACGCAGTGCCTGAGAGTTAATTTCGCTCACTTCGAACCTCTCTGTTTACTGATAAGTTCCAGATCCTCCTGGCAACTTGCACAAGTCCGACAACCCTGAACGACCAGGCGTCTTCGTTCATCTATCGGATCGCCACACTCACAACAATGAGTGGCAGATATAGCCTGGTGGTTCAGGCGGCGCATTTTTATTGCTGTGTTGCGCTGTAATTCTTCTATTTCTGATGCTGAATCAATGATGTCTGCCATCTTTCATTAATCCCTGAACTGTTGGTTAATACGCTTGAGGGTGAATGCGAATAATAAAAAAGGAGCCTGTAGCTCCCTGATGATTTTGCTTTTCATGTTCATCGTTCCTTAAAGACGCCGTTTAACATGCCGATTGCCAGGCTTAAATGAGTCGGTGTGAATCCCATCAGCGTTACCGTTTCGCGGTGCTTCTTCAGTACGCTACGGCAAATGTCATCGACGTTTTTATCCGGAAACTGCTGTCTGGCTTTTTTTGATTTCAGAATTAGCCTGACGGGCAATGCTGCGAAGGGCGTTTTCCTGCTGAGGTGTCATTGAACAAGTCCCATGTCGGCAAGCATAAGCACACAGAATATGAAGCCCGCTGCCAGAAAAATGCATTCCGTGGTTGTCATACCTGGTCTCTCTCATCTGCTTCTGCTTTCGCCACCATCATTTCCAGCTTTTGTGAAAGGGATGCGGCTAACGTATGAAATTCTTCGTCTGTTTCTACTGGTATTGGCACAAACCTGATTCCAATTTGAGCAAGGCTATGTGCCATCTCGATACTCGTTCTTAACTCAACAGAAGATGCTTTGTGCATACAGCCCCTCGTTTATTATTTATCTCCTCAGCCAGCCGCTGTGCTTTCAGTGGATTTCGGATAACAGAAAGGCCGGGAAATACCCAGCCTCGCTTTGTAACGGAGTAGACGAAAGTGATTGCGCCTACCCGGATATTATCGTGAGGATGCGTCATCGCCATTGCTCCCCAAATACAAAACCAATTTCAGCCAGTGCCTCGTCCATTTTTTCGATGAACTCCGGCACGATCTCGTCAAAACTCGCCATGTACTTTTCATCCCGCTCAATCACGACATAATGCAGGCCTTCACGCTTCATACGCGGGTCATAGTTGGCAAAGTACCAGGCATTTTTTCGCGTCACCCACATGCTGTACTGCACCTGGGCCATGTAAGCTGACTTTATGGCCTCGAAACCACCGAGCCGGAACTTCATGAAATCCCGGGAGGTAAACGGGCATTTCAGTTCAAGGCCGTTGCCGTCACTGCATAAACCATCGGGAGAGCAGGCGGTACGCATACTTTCGTCGCGATAGATGATCGGGGATTCAGTAACATTCACGCCGGAAGTGAATTCAAACAGGGTTCTGGCGTCGTTCTCGTACTGTTTTCCCCAGGCCAGTGCTTTAGCGTTAACTTCCGGAGCCACACCGGTGCAAACCTCAGCAAGCAGGGTGTGGAAGTAGGACATTTTCATGTCAGGCCACTTCTTTCCGGAGCGGGGTTTTGCTATCACGTTGTGAACTTCTGAAGCGGTGATGACGCCGAGCCGTAATTTGTGCCACGCATCTTCCCCCTGTTCGACAGCTCTCACATCGATCCCGGTACGCTGCAGGATAATGTCCGGTGTCATGCTGCCACCTTCTGCTCTGCGGCTTTCTGTTTCAGGAATCCAAGAGCTTTTACTGCTTCGGCCTGTGTCAGTTCTGACGATGCACGAATGTCGCGGCGAAATATCTGGGAACAGAGCGGCAATAAGTCGTCATCCCATGTTTTATCCAGGGCGATCAGCAGAGTGTTAATCTCCTGCATGGTTTCATCGTTAACCGGAGTGATGTCGCGTTCCGGCTGACGTTCTGCAGTGTATGCAGTATTTTCGACAATGCGCTCGGCTTCATCCTTGTCATAGATACCAGCAAATCCGAAGGCCAGACGGGCACACTGAATCATGGCTTTATGACGTAACATCCGTTTGGGATGCGACTGCCACGGCCCCGTGATTTCTCTGCCTTCGCGAGTTTTGAATGGTTCGCGGCGGCATTCATCCATCCATTCGGTAACGCAGATCGGATGATTACGGTCCTTGCGGTAAATCCGGCATGTACAGGATTCATTGTCCTGCTCAAAGTCCATGCCATCAAACTGCTGGTTTTCATTGATGATGCGGGACCAGCCATCAACGCCCACCACCGGAACGATGCCATTCTGCTTATCAGGAAAGGCGTAAATTTCTTTCGTCCACGGATTAAGGCCGTACTGGTTGGCAACGATCAGTAATGCGATGAACTGCGCATCGCTGGCATCACCTTTAAATGCCGTCTGGCGAAGAGTGGTGATCAGTTCCTGTGGGTCGACAGAATCCATGCCGACACGTTCAGCCAGCTTCCCAGCCAGCGTTGCGAGTGCAGTACTCATTCGTTTTATACCTCTGAATCAATATCAACCTGGTGGTGAGCAATGGTTTCAACCATGTACCGGATGTGTTCTGCCATGCGCTCCTGAAACTCAACATCGTCATCAAACGCACGGGTAATGGATTTTTTGCTGGCCCCGTGGCGTTGCAAATGATCGATGCATAGCGATTCAAACAGGTGCTGGGGCAGGCCTTTTTCCATGTCGTCTGCCAGTTCTGCCTCTTTCTCTTCACGGGCGAGCTGCTGGTAGTGACGCGCCCAGCTCTGAGCCTCAAGACGATCCTGAATGTAATAAGCGTTCATGGCTGAACTCCTGAAATAGCTGTGAAAATATCGCCCGCGAAATGCCGGGCTGATTAGGAAAACAGGAAAGGGGGTTAGTGAATGCTTTTGCTTGATCTCAGTTTCAGTATTAATATCCATTTTTTATAAGCGTCGACGGCTTCACGAAACATCTTTTCATCGCCAATAAAAGTGGCGATAGTGAATTTAGTCTGGATAGCCATAAGTGTTTGATCCATTCTTTGGGACTCCTGGCTGATTAAGTATGTCGATAAGGCGTTTCCATCCGTCACGTAATTTACGGGTGATTCGTTCAAGTAAAGATTCGGAAGGGCAGCCAGCAACAGGCCACCCTGCAATGGCATATTGCATGGTGTGCTCCTTATTTATACATAACGAAAAACGCCTCGAGTGAAGCGTTATTGGTATGCGGTAAAACCGCACTCAGGCGGCCTTGATAGTCATATCATCTGAATCAAATATTCCTGATGTATCGATATCGGTAATTCTTATTCCTTCGCTACCATCCATTGGAGGCCATCCTTCCTGACCATTTCCATCATTCCAGTCGAACTCACACACAACACCATATGCATTTAAGTCGCTTGAAATTGCTATAAGCAGAGCATGTTGCGCCAGCATGATTAATACAGCATTTAATACAGAGCCGTGTTTATTGAGTCGGTATTCAGAGTCTGACCAGAAATTATTAATCTGGTGAAGTTTTTCCTCTGTCATTACGTCATGGTCGATTTCAATTTCTATTGATGCTTTCCAGTCGTAATCAATGATGTATTTTTTGATGTTTGACATCTGTTCATATCCTCACAGATAAAAAATCGCCCTCACACTGGAGGGCAAAGAAGATTTCCAATAATCAGAACAAGTCGGCTCCTGTTTAGTTACGAGCGACATTGCTCCGTGTATTCACTCGTTGGAATGAATACACAGTGCAGTGTTTATTCTGTTATTTATGCCAAAAATAAAGGCCACTATCAGGCAGCTTTGTTGTTCTGTTTACCAAGTTCTCTGGCAATCATTGCCGTCGTTCGTATTGCCCATTTATCGACATATTTCCCATCTTCCATTACAGGAAACATTTCTTCAGGCTTAACCATGCATTCCGATTGCAGCTTGCATCCATTGCATCGCTTGAATTGTCCACACCATTGATTTTTATCAATAGTCGTAGTCATACGGATAGTCCTGGTATTGTTCCATCACATCCTGAGGATGCTCTTCGAACTCTTCAAATTCTTCTTCCATATATCACCTCAAATAAGTGGTTTGCTGCCTAATTTAATTTTCTGGCGACCAACACAAGTCACACCCATTTCACTGCGTGGCTTGCTGTAGTAAATACGGTTCTGTTTACGCTCGACTTCTTCTGCCTTCTTGCAGCGAAGGCTTCCGAGTGATGCTGCTTTATCTGCTCTGACGCAACCAGAGAGCTTTAGCGCAATTTTTCGCGCCAGTGCTTCATTACTGCGTCGCTCGGCAATAAGTTCTGCTCTGCGAGCTTTGTAGCGGCTTTTTGCCGTACCTTTGGATTCTTTCCAGACAATGGTTACCATGATGGTCTCCTTTAAGTGGCTTTGGCGCATGACGCGTCGAGGTGCTTATCTTCTCGATCGCTGTCTTGTAGCTGCAATTCGCGCCATCCCCAAAACCACTCAAGTTCTGGTCTCAACGGTTAGGTTGAGAGTCCGTCGATGTTAAAGAGCCTGCCAATCTGTTCCGTTTGGCTTCCAGCGTCCTGCTGATGGCTTAAATTTAAGACTTCTTAATTTATTGGTCAAGTGCATTTTTGAAGAAAACTTAATTTTATGGGCGTGAATTTAGTTTGTCTTTGATTTTTAACGGGAAATAAAAAAGGGGCGAAAGCCCCTTAAGGAAGGTTTGCTAGCTTGGCATCAACGACAACGCCAATGATTTTACAGTTCCCATTGATTTCAATCATTGGGTATTGTGGATTGAGTGGTTTCAGGAATTTTCTACCGGCATCAATAACTAACTTTTTGAATGTCGCCTCGTTTTCTCCTTCAAGTTTGGCGACTACCAGCTTTCCATTACGTGGTTCGACTTCTGGGTCGACGAGAATAATCATCCCCTCAGGAATACTCAGTCCTGCCGGGGCAGTCATTGAATCGCCTTTAACGTCGAGCCAAAAAGAGTCTTCAGAACAATCTACCGTTGTGTCGTACCAGTTATCTATTGCACGCCTATGATATGGCTCTACAGCTTCCATCCAACATCCTGCGCTTACCCAACTAATTAGAGGATACGAACCTCTTGGATCATGCCTGCTGTGATAGGCAATGTTTGAAAGACTATCCTCTCCTTTCAACAGGTAATCAGGGGAGCACTGCAAAGCCTTGGCTAAGGCCAATAGGTTTTCGCCATTGGGCTCAGTTTCAGATCGCTCCCATTGGGAAATAGCAACATTAGACACGCCAACCATCTTGCCAAGGGCAGCCTGCCTAATCTTGAGTTCTTTTCTGCGAGCGCGAATACGCTCACCCATCAGTTGTGTATTCATAGTTAAGACATCTTAAATAAACTTGACTTAAGATTCCTTTGGTGGATAATTTAAGTGTTCTTTAATTTCGGAGCGAGTCTATGTACAAAAAAGATGTTATTGACCACTTCGGAACCCAGCGTGCTGTTGCTAAAGCACTAGGCATTAGCGATGCAGCAGTCTCTCAGTGGAAAGAAGTTATCCCAGAGAAAGACGCCTATCGATTGGAAATCGTTACAGCTGGCGCCCTGAAGTATCAAGAAAGTGCTTACCGCCAAGCGGCATAAGCAAATTGCTCTTTAACAGTTCTGGCCTTTCACCTCTATCTCCGCTCTGGTTATCTGCATCATCGTCTGCCTGTCATGGGCTGTTAATCATTACCGTGATAACGCCATTACCTACAAAGCCCAGCGCGACAAAAATGCCAGAGAACTGAAGCTGGCGAACGCGGCAATTACTGACATGCAGATGCGTCAGCGTGATGTTGCTGCGCTCGATGCAAAATACACGAAGGAGTTAGCTGATGCTAAAGCTGAAAATGATGCTCTGCGTGATGATGTTGCCGCTGGTCGTCGTCGGTTGCACATCAAAGCAGTCTGTCAGTCAGTGCGTGAAGCCACCACCGCCTCCGGCGTGGATAATGCAGCCTCCCCCCGACTGGCAGACACCGCTGAACGGGATTATTTCACCCTCAGAGAGAGGCTGATCACTATGCAAAAACAACTGGAAGGAACCCAGAAGTATATTAATGAGCAGTGCAGATAGAGCTGACCATATCGATGGGCAACTCATGCAATTATTTTGAGCAATACACACGCGCTTCCAGCGGAGTATAAATGCCTAAAGTAATAAAACCGAGCAATCCATTTACGAATGTTTGCTGGGTTTCTGTTTTAACAACATTTTCTGCGCCGCCACAAATTTTAGCTGCATCGACAGTTTTCTTCTGCCCAATTCCAGAAACGAAGAAATGATGGGTGATGGTTTCCTTTGGTGCTACTGCTGCCGGTTTGTTTTGAACAGTAAACGTCTGTTGAGCACATCCTGTAATAAGCAGGGCCAGCGCAGTAGCGAGTAGCATTTTTTTCATGGTGTTATTCCCGATGCTTTTTGAAGTTCGCAGAATCGTATGTGTAGAAAATTAAACAAACCCTAAACAATGAGTTGAAATTTCATATTGTTAATATTTATTAATGTATGTCAGGTGCGATGAATCGTCATTGTATTCCCGGATTAACTATGTCCACAGCCCTGACGGGGAACTTCTCTGCGGGAGTGTCCGGGAATAATTAAAACGATGCACACAGGGTTTAGCGCGTACACGTATTGCATTATGCCAACGCCCCGGTGCTGACACGGAAGAAACCGGACGTTATGATTTAGCGTGGAAAGATTTGTGTAGTGTTCTGAATGCTCTCAGTAAATAGTAATGAATTATCAAAGGTATAGTAATATCTTTTATGTTCATGGATATTTGTAACCCATCGGAAAACTCCTGCTTTAGCAAGATTTTCCCTGTATTGCTGAAATGTGATTTCTCTTGATTTCAACCTATCATAGGACGTTTCTATAAGATGCGTGTTTCTTGAGAATTTAACATTTACAACCTTTTTAAGTCCTTTTATTAACACGGTGTTATCGTTTTCTAACACGATGTGAATATTATCTGTGGCTAGATAGTAAATATAATGTGAGACGTTGTGACGTTTTAGTTCAGAATAAAACAATTCACAGTCTAAATCTTTTCGCACTTGATCGAATATTTCTTTAAAAATGGCAACCTGAGCCATTGGTAAAACCTTCCATGTGATACGAGGGCGCGTAGTTTGCATTATCGTTTTTATCGTTTCAATCTGGTCTGACCTCCTTGTGTTTTGTTGATGATTTATGTCAAATATTAGGAATGTTTTCACTTAATAGTATTGGTTGCGTAACAAAGTGCGGTCCTGCTGGCATTCTGGAGGGAAATACAACCGACAGATGTATGTAAGGCCAACGTGCTCAAATCTTCATACAGAAAGATTTGAAGTAATATTTTAACCGCTAGATGAAGAGCAAGCGCATGGAGCGACAAAATGAATAAAGAACAATCTGCTGATGATCCCTCCGTGGATCTGATTCGTGTAAAAAATATGCTTAATAGCACCATTTCTATGAGTTACCCTGATGTTGTAATTGCATGTATAGAACATAAGGTGTCTCTGGAAGCATTCAGAGCAATTGAGGCAGCGTTGGTGAAGCACGATAATAATATGAAGGATTATTCCCTGGTGGTTGACTGATCACCATAACTGCTAATCATTCAAACTATTTAGTCTGTGACAGAGCCAACACGCAGTCTGTCACTGTCAGGAAAGTGGTAAAACTGCAACTCAATTACTGCAATGCCCTCGTAATTAAGTGAATTTACAATATCGTCCTGTTCGGAGGGAAGAACGCGGGATGTTCATTCTTCATCACTTTTAATTGATGTATATGCTCTCTTTTCTGACGTTAGTCTCCGACGGCAGGCTTCAATGACCCAGGCTGAGAAATTCCCGGACCCTTTTTGCTCAAGAGCGATGTTAATTTGTTCAATCATTTGGTTAGGAAAGCGGATGTTGCGGGTTGTTGTTCTGCGGGTTCTGTTCTTCGTTGACATGAGGTTGCCCCGTATTCAGTGTCGCTGATTTGTATTGTCTGAAGTTGTTTTTACGTTAAGTTGATGCAGATCAATTAATACGATACCTGCGTCATAATTGATTATTTGACGTGGTTTGATGGCCTCCACGCACGTTGTGATATGTAGATGATAATCATTATCACTTTACGGGTCCTTTCCGGTGATCCGACAGGTTACGGGGCGGCGACCTCGCGGGTTTTCGCTATTTATGAAAATTTTCCGGTTTAAGGCGTTTCCGTTCTTCTTCGTCATAACTTAATGTTTTTATTTAAAATACCCTCTGAAAAGAAAGGAAACGACAGGTGCTGAAAGCGAGCTTTTTGGCCTCTGTCGTTTCCTTTCTCTGTTTTTGTCCGTGGAATGAACAATGGAAGTCAACAAAAAGCAGCTGGCTGACATTTTCGGTGCGAGTATCCGTACCATTCAGAACTGGCAGGAACAGGGAATGCCCGTTCTGCGAGGCGGTGGCAAGGGTAATGAGGTGCTTTATGACTCTGCCGCCGTCATAAAATGGTATGCCGAAAGGGATGCTGAAATTGAGAACGAAAAGCTGCGCCGGGAGGTTGAAGAACTGCGGCAGGCCAGCGAGGCAGATCTCCAGCCAGGAACTATTGAGTACGAACGCCATCGACTTACGCGTGCGCAGGCCGACGCACAGGAACTGAAGAATGCCAGAGACTCCGCTGAAGTGGTGGAAACCGCATTCTGTACTTTCGTGCTGTCGCGGATCGCAGGTGAAATTGCCAGTATTCTCGACGGGCTCCCCCTGTCGGTGCAGCGGCGTTTTCCGGAACTGGAAAACCGACATGTTGATTTCCTGAAACGGGATATCATCAAAGCCATGAACAAAGCAGCCGCGCTGGATGAACTGATACCGGGGTTGCTGAGTGAATATATCGAACAGTCAGGTTAACAGGCTGCGGCATTTTGTCCGCGCCGGGCTTCGCTCACTGTTCAGGCCGGAGCCACAGACCGCCGTTGAATGGGCGGATGCTAATTACTATCTCCCGAAAGAATCCGCATACCAGGAAGGGCGCTGGGAAACACTGCCCTTTCAGCGGGCCATCATGAATGCGATGGGCAGCGACTACATCCGTGAGGTGAATGTGGTGAAGTCTGCCCGTGTCGGTTATTCCAAAATGCTGCTGGGTGTTTATGCCTACTTTATAGAGCATAAGCAGCGCAACACCCTTATCTGGTTGCCGACGGATGGTGATGCCGAGAACTTTATGAAAACCCACGTTGAGCCGACTATTCGTGATATTCCGTCGCTGCTGGCGCTGGCCCCGTGGTATGGCAAAAAGCACCGGGATAACACGCTCACCATGAAGCGTTTCACTAATGGGCGTGGCTTCTGGTGCCTGGGCGGTAAAGCGGCAAAAAACTACCGTGAAAAGTCGGTGGATGTGGCGGGTTATGATGAACTTGCTGCTTTTGATGATGATATTGAACAGGAAGGCTCTCCGACGTTCCTGGGTGACAAGCGTATTGAAGGCTCGGTCTGGCCAAAGTCCATCCGTGGCTCCACGCCAAAAGTGAGAGGCACCTGTCAGATTGAGCGTGCAGCCAGTGAATCCCCGCATTTTATGCGTTTTCATGTTGCCTGCCCGCATTGCGGGGAGGAGCAGTATCTTAAATTTGGCGACAAAGAGACGCCGTTTGGCCTCAAATGGACGCCGGATGACCCCTCCAGCGTGTTTTATCTCTGCGAGCATAATGCCTGCGTCATCCGCCAGCAGGAGCTGGACTTTACTGATGCCCGTTATATCTGCGAAAAGACCGGGATCTGGACCCGTGATGGCATTCTCTGGTTTTCGTCATCCGGTGAAGAGATTGAGCCACCTGACAGTGTGACCTTTCACATCTGGACAGCGTACAGCCCGTTCACCACCTGGGTGCAGATTGTCAAAGACTGGATGAAAACGAAAGGGGATACGGGAAAACGTAAAACCTTCGTAAACACCACGCTCGGTGAGACGTGGGAGGCGAAAATTGGCGAACGTCCGGATGCTGAAGTGATGGCAGAGCGGAAAGAGCATTATTCAGCGCCCGTTCCTGACCGTGTGGCTTACCTGACCGCCGGTATCGACTCCCAGCTGGACCGCTACGAAATGCGCGTATGGGGATGGGGGCCGGGTGAGGAAAGCTGGCTGATTGACCGGCAGATTATTATGGGCCGCCACGACGATGAACAGACGCTGCTGCGTGTGGATGAGGCCATCAATAAAACCTATACCCGCCGGAATGGTGCAGAAATGTCGATATCCCGTATCTGCTGGGATACTGGCGGGATTGACCCGACCATTGTGTATGAACGCTCGAAAAAACATGGGCTGTTCCGGGTGATCCCCATTAAAGGGGCATCCGTCTACGGAAAGCCGGTGGCCAGCATGCCACGTAAGCGAAACAAAAACGGGGTTTACCTTACCGAAATCGGTACGGATACCGCGAAAGAGCAGATTTATAACCGCTTCACACTGACGCCGGAAGGGGATGAACCGCTTCCCGGTGCCGTTCACTTCCCGAATAACCCGGATATTTTTGATCTGGGGGAACTGCGACTGGATAGGCTCCGGTCGTATGGCCATCGATGGTCTGAAAGAAGTTCAGGAAGCGGTGATGCTGATAGAAGCCGGACTGAGTACCTACGAGAAAGAGTGCGCAAAACGCGGTGACGACTATCAGGAAATTTTTGCCCAGCAGGTCCGTGAAACGATGGAGCGCCGTGCAGCCGGTCTTAAACCGCCCGCCTGGGCGGCTGCAGCATTTGAATCCGGGCTGCGACAATCAACAGAGGAGGAGAAGAGTGACAGCAGAGCTGCGTAATCTCCCGCATATTGCCAGCATGGCCTTTAATGAGCCGCTGATGCTTGAACCCGCCTATGCGCGGGTTTTCTTTTGTGCGCTTGCAGGCCAGCTTGGGATCAGCAGCCTGACGGATGCGGTGTCCGGCGACAGCCTGACTGCCCAGGAGGCACTCGCGACGCTGGCATTATCCGGTGATGATGACGGACCACGACAGGCCCGCAGTTATCAGGTCATGAACGGCATCGCCGTGCTGCCGGTGTCCGGCACGCTGGTCAGCCGGACGCGGGCGCTGCAGCCGTACTCGGGGATGACCGGTTACAACGGCATTATCGCCCGTCTGCAACAGGCTGCCAGCGATCCGATGGTGGACGGCATTCTGCTCGATATGGACACGCCCGGCGGGATGGTGGCGGGGGCATTTGACTGCGCTGACATCATCGCCCGTGTGCGTGACATAAAACCGGTATGGGCGCTTGCCAACGACATGAACTGCAGTGCAGGTCAGTTGCTTGCCAGTGCCGCCTCCCGGCGTCTGGTCACGCAGACCGCCCGGACAGGCTCCATCGGCGTCATGATGGCTCACAGTAATTACGGTGCTGCGCTGGAGAAACAGGGTGTGGAAATCACGCTGATTTACAGCGGCAGCCATAAGGTGGATGGCAACCCCTACAGCCATCTTCCGGATGACGTCCGGGAGACACTGCAGTCCCGGATGGACGCAACCCGCCAGATGTTTGCGCAGAAGGTGTCGGCATATACCGGCCTGTCCGTGCAGGTTGTGCTGGATACCGAGGCTGCAGTGTACAGCGGTCAGGAGGCCATTGATGCCGGACTGGCTGATGAACTTGTTAACAGCACCGATGCGATCACCGTCATGCGTGATGCACTGGATGCACGTAAATCCCGTCTCTCAGGAGGGCGAATGACCAAAGAGACTCAATCAACAACTGTTTCAGCCACTGCTTCGCAGGCTGACGTTACTGACGTGGTGCCAGCGACGGAGGGCGAGAACGCCAGCGCGGCGCAGCCGGACGTGAACGCGCAGATCACCGCAGCGGTTGCGGCAGAAAACAGCCGCATTATGGGGATACTCAACTGTGAGGAGGCTCACGGACGCGAAGAACAGGCACGCGTGCTGGCAGAAACCCCCGGTATGACCGTGAAAACGGCCCGCCGCATTCTGGCCGCAGCACCACAGAGTGCACAGGCGCGCAGTGACACTGCGCTGGATCGTCTGATGCAGGGGGCACCGGCACCGCTGGCTGCAGGTAACCCGGCATCTGATGCCGTTAACGATTTGCTGAACACACCAGTGTAAGGGATGTTTATGACGAGCAAAGAAACCTTTACCCATTACCAGCCGCAGGGCAACAGTGACCCGGCTCATACCGCAACCGCGCCCGGCGGATTGAGTGCGAAAGCGCCTGCAATGACCCCGCTGATGCTGGACACCTCCAGCCGTAAGCTGGTTGCGTGGGATGGCACCACCGACGGTGCTGCCGTTGGCATTCTTGCGGTTGCTGCTGACCAGACCAGCACCACGCTGACGTTCTACAAGTCCGGCACGTTCCGTTATGAGGATGTGCTCTGGCCGGAGGCTGCCAGCGACGAGACGAAAAAACGGACCGCGTTTGCCGGAACGGCAATCAGCATCGTTTAACTTTACCCTTCATCACTAAAGGCCGCCTGTGCGGCTTTTTTTACGGGATTTTTTTATGTCGATGTACACAACCGCCCAACTGCTGGCGGCAAATGAGCAGAAATTTAAGTTTGATCCGCTGTTTCTGCGTCTCTTTTTCCGTGAGAGCTATCCCTTCACCACGGAGAAAGTCTATCTCTCACAAATTCCGGGACTGGTAAACATGGCGCTGTACGTTTCGCCGATTGTTTCCGGTGAGGTTATCCGTTCCCGTGGCGGCTCCACCTCTGAATTTACGCCGGGATATGTCAAGCCGAAGCATGAAGTGAATCCGCAGATGACCCTGCGTCGCCTGCCGGATGAAGATCCGCAGAATCTGGCGGACCCGGCTTACCGCCGCCGTCGCATCATCATGCAGAACATGCGTGACGAAGAGCTGGCCATTGCTCAGGTCGAAGAGATGCAGGCAGTTTCTGCCGTGCTTAAGGGCAAATACACCATGACCGGTGAAGCCTTCGATCCGGTTGAGGTGGATATGGGCCGCAGTGAGGAGAATAACATCACGCAGTCCGGCGGCACGGAGTGGAGCAAGCGTGACAAGTCCACGTATGACCCGACCGACGATATCGAAGCCTACGCGCTGAACGCCAGCGGTGTGGTGAATATCATCGTGTTCGATCCGAAAGGCTGGGCGCTGTTCCGTTCCTTCAAAGCCGTCAAGGAGAAGCTGGATACCCGTCGTGGCTCTAATTCCGAGCTGGAGACAGCGGTGAAAGACCTGGGCAAAGCGGTGTCCTATAAGGGGATGTATGGCGATGTGGCCATCGTCGTGTATTCCGGACAGTACGTGGAAAACGGCGTCAAAAAGAACTTCCTGCCGGACAACACGATGGTGCTGGGGAACACTCAGGCACGCGGTCTGCGCACCTATGGCTGCATTCAGGATGCGGACGCACAGCGCGAAGGCATTAACGCCTCTGCCCGTTACCCGAAAAACTGGGTGACCACCGGCGATCCGGCGCGTGAGTTCACCATGATTCAGTCAGCACCGCTGATGCTGCTGGCTGACCCTGATGAGTTCGTGTCCGTACAACTGGCGTAATCATGGCCCTTCGGGGCCATTGTTTCTCTGTGGAGGAGTCCATGACGAAAGATGAACTGATTGCCCGTCTCCGCTCGCTGGGTGAACAACTGAACCGTGATGTCAGCCTGACGGGGACGAAAGAAGAACTGGCGCTCCGTGTGGCAGAGCTGAAAGAGGAGCTTGATGACACGGATGAAACTGCCGGTCAGGACACCCCTCTCAGCCGGGAAAATGTGCTGACCGGACATGAAAATGAGGTGGGATCAGCGCAGCCGGATACCGTGATTCTGGATACCCGTACTGTCACCGTGACCGATGACCATCCTTTTGATCGCCAGATAGTGGTGCTTCCGCTGACGTTTCGCGGAAGTAAGCGTACTGTCAGCGGCAGGACAACGTATTCGATGTGTTATCTGAAAGTACTGATGAACGGTGCGGTGATTTATGATGGCGCGGCGAACGAGGCGGTACAGGTGTTCTCCCGTATTGTTGACATGCCAGCGGGTCGGGGAAACGTGATCCTGACGTTCACGCTTACGTCCACACGGCATTCGGCAGATATTCCGCCGTATACGTTTGCCAGCGATGTGCAGGTTATGGTGATTAAGAAACAGGCGCTGGGCATCAGCGTGGTCTGAGTGTGTTACAGAGGTTCGTCCGGGAACGGGCGTTTTATTATAAAACAGTGAGAGGTGAACGATGCGTAATGTGTGTATTGCCGTTGCTGTCTTTGCCGCACTTGCGGTGACAGTCACTCCGGCCCGTGCGGAAGGTGGACATGGTACGTTTACGGTGGGCTATTTTCAAGTGAAACCGGGTACATTGCCGTCGTTGTCGGGCGGGGATACCGGTGTGAGTCATCTGAAAGGGATTAACGTGAAGTACCGTTATGAGCTGACGGACAGTGTGGGGGTGATGGCTTCCCTGGGGTTCGCCGCGTCGAAAAAGAGCAGCACAGTGATGACCGGGGAGGATACGTTTCACTATGAGAGCCTGCGTGGACGTTATGTGAGCGTGATGGCCGGACCGGTTTTACAAATCAGTAAGCAGGTCAGTGCGTACGCCATGGCCGGAGTGGCTCACAGTCGGTGGTCCGGCAGTACAATGGATTACCGTAAGACGGAAATCACTCCCGGGTATATGAAAGAGACGACCACTGCCAGGGACGAAAGTGCAATGCGGCATACCTCAGTGGCGTGGAGTGCAGGTATACAGATTAATCCGGCAGCGTCCGTCGTTGTTGATATTGCTTATGAAGGCTCCGGCAGTGGCGACTGGCGTACTGACGGATTCATCGTTGGGGTCGGTTATAAATTCTGA
Protein sequences of DBSCAN-SWA_2 >NZ_CP020368|748735:770243|759207_759408_-|WP_000213975.1|DBSCAN-SWA MTTTIDKNQWCGQFKRCNGCKLQSECMVKPEEMFPVMEDGKYVDKWAIRTTAMIARELGKQNNKAA >NZ_CP020368|748735:770243|750420_753072_+|WP_001092355.1|DBSCAN-SWA MNTINIAKNDFSDIELAAIPFNTLADHYGERLAREQLALEHESYEMGEARFRKMFERQLKAGEVADNAAAKPLITTLLPKMIARINDWFEEVKAKRGKRPTAFQFLQEIKPEAVAYITIKTTLACLTSADNTTVQAVASAIGRAIEDEARFGRIRDLEAKHFKKNVEEQLNKRVGHVYKKAFMQVVEADMLSKGLLGGEAWSSWHKEDSIHVGVRCIEMLIESTGMVSLHRQNAGVVGQDSETIELAPEYAEAIATRAGALAGISPMFQPCVVPPKPWTGITGGGYWANGRRPLALVRTHSKKALMRYEDVYMPEVYKAINIAQNTAWKINKKVLAVANVITKWKHCPVEDIPAIEREELPMKPEDIDMNPEALTAWKRAAAAVYRKDKARKSRRISLEFMLEQANKFANHKAIWFPYNMDWRGRVYAVSMFNPQGNDMTKGLLTLAKGKPIGKEGYYWLKIHGANCAGVDKVPFPERIKFIEENHENIMACAKSPLENTWWAEQDSPFCFLAFCFEYAGVQHHGLSYNCSLPLAFDGSCSGIQHFSAMLRDEVGGRAVNLLPSETVQDIYGIVAKKVNEILQADAINGTDNEVVTVTDENTGEISEKVKLGTKALAGQWLAYGVTRSVTKRSVMTLAYGSKEFGFRQQVLEDTIQPAIDSGKGLMFTQPNQAAGYMAKLIWESVSVTVVAAVEAMNWLKSAAKLLAAEVKDKKTGEILRKRCAVHWVTPDGFPVWQEYKKPIQTRLNLMFLGQFRLQPTINTNKDSEIDAHKQESGIAPNFVHSQDGSHLRKTVVWAHEKYGIESFALIHDSFGTIPADAANLFKAVRETMVDTYESCDVLADFYDQFADQLHESQLDKMPALPAKGNLNLRDILESDFAFA >NZ_CP020368|748735:770243|756125_756317_-|WP_000548551.1|DBSCAN-SWA MHKASSVELRTSIEMAHSLAQIGIRFVPIPVETDEEFHTLAASLSQKLEMMVAKAEADERDQV >NZ_CP020368|748735:770243|762805_763012_+|WP_001031427.1|DBSCAN-SWA MNKEQSADDPSVDLIRVKNMLNSTISMSYPDVVIACIEHKVSLEAFRAIEAALVKHDNNMKDYSLVVD >NZ_CP020368|748735:770243|758656_759025_-|WP_000065374.1|DBSCAN-SWA MSNIKKYIIDYDWKASIEIEIDHDVMTEEKLHQINNFWSDSEYRLNKHGSVLNAVLIMLAQHALLIAISSDLNAYGVVCEFDWNDGNGQEGWPPMDGSEGIRITDIDTSGIFDSDDMTIKAA >NZ_CP020368|748735:770243|761054_761495_+|WP_084454367.1|DBSCAN-SWA MSALVICIIVCLSWAVNHYRDNAITYKAQRDKNARELKLANAAITDMQMRQRDVAALDAKYTKELADAKAENDALRDDVAAGRRRLHIKAVCQSVREATTASGVDNAASPRLADTAERDYFTLRERLITMQKQLEGTQKYINEQCR >NZ_CP020368|748735:770243|760100_760751_-|WP_001095982.1|DBSCAN-SWA MNTQLMGERIRARRKELKIRQAALGKMVGVSNVAISQWERSETEPNGENLLALAKALQCSPDYLLKGEDSLSNIAYHSRHDPRGSYPLISWVSAGCWMEAVEPYHRRAIDNWYDTTVDCSEDSFWLDVKGDSMTAPAGLSIPEGMIILVDPEVEPRNGKLVVAKLEGENEATFKKLVIDAGRKFLKPLNPQYPMIEINGNCKIIGVVVDAKLANLP >NZ_CP020368|748735:770243|754494_754776_-|WP_000026224.1|DBSCAN-SWA MSINELESEQKDWALSMLCRSGVLSPCRHHEGVYVDEGIDIESAYKYSMKVYKSNEDKSPFCNVREMTDTVQNYYHEYGGNDTCPLCTKHIDD >NZ_CP020368|748735:770243|763176_763371_-|WP_001421937.1|DBSCAN-SWA MSTKNRTRRTTTRNIRFPNQMIEQINIALEQKGSGNFSAWVIEACRRRLTSEKRAYTSIKSDEE >NZ_CP020368|748735:770243|762109_762520_-|WP_012775990.1|DBSCAN-SWA MAQVAIFKEIFDQVRKDLDCELFYSELKRHNVSHYIYYLATDNIHIVLENDNTVLIKGLKKVVNVKFSRNTHLIETSYDRLKSREITFQQYRENLAKAGVFRWVTNIHEHKRYYYTFDNSLLFTESIQNTTQIFPR >NZ_CP020368|748735:770243|763759_764305_+|WP_000453580.1|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIENEKLRREVEELRQASEADLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGLPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIPGLLSEYIEQSG >NZ_CP020368|748735:770243|754967_755516_-|WP_001289873.1|DBSCAN-SWA MSEINSQALREAAEQAMHDDWGFDADLFHELVTPSIVLELLDERERNQQYIKRRDQENEDIALTVGKLRVELETAKSKLNEQREYYEGVISDGSKRIAKLESNEVREDGNQFLVVRHPGKTPVIKHCTGDLEEFLRQLIEQDPLVTIDIITHRYYGVGGQWVQDAGEYLHMMSDAGIRIKGE >NZ_CP020368|748735:770243|758419_758584_-|WP_001198861.1|protease|DBSCAN-SWA MQYAIAGWPVAGCPSESLLERITRKLRDGWKRLIDILNQPGVPKNGSNTYGYPD >NZ_CP020368|748735:770243|761526_761820_-|WP_000738491.1|DBSCAN-SWA MKKMLLATALALLITGCAQQTFTVQNKPAAVAPKETITHHFFVSGIGQKKTVDAAKICGGAENVVKTETQQTFVNGLLGFITLGIYTPLEARVYCSK >NZ_CP020368|748735:770243|754238_754406_-|WP_000545733.1|DBSCAN-SWA MHFRVTGEWNGEPFNRVIEAENINDCYDHWMIWAQIAHADVTNIRIEELKEHQAA >NZ_CP020368|748735:770243|756468_757149_-|WP_000186891.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGEDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSGVNVTESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKNAWYFANYDPRMKREGLHYVVIERDEKYMASFDEIVPEFIEKMDEALAEIGFVFGEQWR >NZ_CP020368|748735:770243|757145_757931_-|WP_000100844.1|DBSCAN-SWA MSTALATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKAAEQKVAA >NZ_CP020368|748735:770243|758307_758451_-|WP_000372937.1|DBSCAN-SWA MDQTLMAIQTKFTIATFIGDEKMFREAVDAYKKWILILKLRSSKSIH >NZ_CP020368|748735:770243|767527_767860_+|WP_001297109.1|head|DBSCAN-SWA MTSKETFTHYQPQGNSDPAHTATAPGGLSAKAPAMTPLMLDTSSRKLVAWDGTTDGAAVGILAVAADQTSTTLTFYKSGTFRYEDVLWPEAASDETKKRTAFAGTAISIV >NZ_CP020368|748735:770243|769622_770243_+|WP_001246632.1|DBSCAN-SWA MRNVCIAVAVFAALAVTVTPARAEGGHGTFTVGYFQVKPGTLPSLSGGDTGVSHLKGINVKYRYELTDSVGVMASLGFAASKKSSTVMTGEDTFHYESLRGRYVSVMAGPVLQISKQVSAYAMAGVAHSRWSGSTMDYRKTEITPGYMKETTTARDESAMRHTSVAWSAGIQINPAASVVVDIAYEGSGSGDWRTDGFIVGVGYKF >NZ_CP020368|748735:770243|756289_756472_-|WP_000149542.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLLLS >NZ_CP020368|748735:770243|768982_769561_+|WP_084454370.1|DBSCAN-SWA MTKDELIARLRSLGEQLNRDVSLTGTKEELALRVAELKEELDDTDETAGQDTPLSRENVLTGHENEVGSAQPDTVILDTRTVTVTDDHPFDRQIVVLPLTFRGSKRTVSGRTTYSMCYLKVLMNGAVIYDGAANEAVQVFSRIVDMPAGRGNVILTFTLTSTRHSADIPPYTFASDVQVMVIKKQALGISVV >NZ_CP020368|748735:770243|759486_759786_-|WP_000256575.1|DBSCAN-SWA MVTIVWKESKGTAKSRYKARRAELIAERRSNEALARKIALKLSGCVRADKAASLGSLRCKKAEEVERKQNRIYYSKPRSEMGVTCVGRQKIKLGSKPLI >NZ_CP020368|748735:770243|757936_758233_-|WP_000995451.1|DBSCAN-SWA MNAYYIQDRLEAQSWARHYQQLAREEKEAELADDMEKGLPQHLFESLCIDHLQRHGASKKSITRAFDDDVEFQERMAEHIRYMVETIAHHQVDIDSEV >NZ_CP020368|748735:770243|755512_755734_-|WP_000763367.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLVVQGCRTCASCQEDLELISKQRGSK >NZ_CP020368|748735:770243|753980_754199_-|WP_002414258.1|DBSCAN-SWA MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGGLLKRIRNGKKAKS >NZ_CP020368|748735:770243|748735_749818_+|WP_000805902.1|DBSCAN-SWA MKPVTLYDVAEYAGVSYQTVSRVVNQASHVSAKTREKVEAAMAELNYIPNRVAQQLAGKQSLLIGVATSSLALHAPSQIVAAIKSRADQLGASVVVSMVERSGVEACKAAVHNLLAQRVSGLIINYPLDDQDAIAVEAACTNVPALFLDVSDQTPINSIIFSHEDGTRLGVEHLVALGHQQIALLAGPLSSVSARLRLAGWHKYLTRNQIQPIAEREGDWSAMSGFQQTMQMLNEGIVPTAMLVANDQMALGAMRAITESGLRVGADISVVGYDDTEDSSCYIPPLTTIKQDFRLLGQTSVDRLLQLSQGQAVKGNQLLPVSLVKRKTTLAPNTQTASPRALADSLMQLARQVSRLESGQ >NZ_CP020368|748735:770243|760831_761017_+|WP_000276885.1|DBSCAN-SWA MYKKDVIDHFGTQRAVAKALGISDAAVSQWKEVIPEKDAYRLEIVTAGALKYQESAYRQAA >NZ_CP020368|748735:770243|766198_767518_+|WP_000123343.1|DBSCAN-SWA MTAELRNLPHIASMAFNEPLMLEPAYARVFFCALAGQLGISSLTDAVSGDSLTAQEALATLALSGDDDGPRQARSYQVMNGIAVLPVSGTLVSRTRALQPYSGMTGYNGIIARLQQAASDPMVDGILLDMDTPGGMVAGAFDCADIIARVRDIKPVWALANDMNCSAGQLLASAASRRLVTQTARTGSIGVMMAHSNYGAALEKQGVEITLIYSGSHKVDGNPYSHLPDDVRETLQSRMDATRQMFAQKVSAYTGLSVQVVLDTEAAVYSGQEAIDAGLADELVNSTDAITVMRDALDARKSRLSGGRMTKETQSTTVSATASQADVTDVVPATEGENASAAQPDVNAQITAAVAAENSRIMGILNCEEAHGREEQARVLAETPGMTVKTARRILAAAPQSAQARSDTALDRLMQGAPAPLAAGNPASDAVNDLLNTPV >NZ_CP020368|748735:770243|767915_768941_+|WP_000063280.1|capsid|DBSCAN-SWA MSMYTTAQLLAANEQKFKFDPLFLRLFFRESYPFTTEKVYLSQIPGLVNMALYVSPIVSGEVIRSRGGSTSEFTPGYVKPKHEVNPQMTLRRLPDEDPQNLADPAYRRRRIIMQNMRDEELAIAQVEEMQAVSAVLKGKYTMTGEAFDPVEVDMGRSEENNITQSGGTEWSKRDKSTYDPTDDIEAYALNASGVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSELETAVKDLGKAVSYKGMYGDVAIVVYSGQYVENGVKKNFLPDNTMVLGNTQARGLRTYGCIQDADAQREGINASARYPKNWVTTGDPAREFTMIQSAPLMLLADPDEFVSVQLA |
30 | Enterobacteria_phage(90.0%) | capsid,protease,head | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
854681 : 927994
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP020368|854681:927994|DBSCAN-SWA ATCACACTTCTAGCGTTGCGAGGGGGTTGAATCTCAGAGCCGTCTCAAGATGATCGGGTGCTAAATGGGCATAGCGCATAGTCATTTTTATATCGTGATGTCCGAGGATTTTTTGCAAAGCAAGGATATTTCCACCCGACATCATGAAGTGCGCCGCAAACGTATGGCGCAGAACGTGTGTCAGTTGACCGCGAGGGAGCACGATAGACGTTTTTTCCATCACGGATAAAAATTGAAAATAGCAGTCTGTAAAGAAATTGAACCCATCAAGCGCCATGATCTCTTCGTAAAGCTCTTTACTGATAGGGATGCTTCTGTTTTTCTTCCCCTTCGTTCTTACAAAGGTAATTCGGTATTTGGTCACTTGCGAGCGGGTAAGATTTATTGCTTCTCGCCAGCGTGCGCCTGTGCTTAGGCATATCTTGACTACCAGTGCCAGAATTGGGTCCTGACGTTTGCAATCAGCCAGCAGTTCAACAATCTGCTCATGGGTAAGCCATGCCATCTCTTTTTCTGCGATGGTGAATTTTCGCATGTTCTCCAGTGGGTTCGGATACGACCATTCGCCCAGGCGGGATAGTTCGCTAAAAACACTACTTAGATAGCTTTGCTCCAGGTTAATGGTGACCGGGCTTGCTCCTTTCTTCCATTTCTCGCTGAAGTAGATCTCACCTGTCAGGCGTTTATCTCGATAGTGGGCAAACATTTTAGAGGTTAGATCAGTTGCAAGGGGATTGCCCAGAGCGTCAACCATCAACAGCAGTTTGTCATAGACATGCTGCCCAGCTGTCAGAGATTTACCATGTAGTTTGAACCATAGCTCAATCACGTCTTTCAATGTCCGACGATCCACTGATTCGCCCAGCCAGGGCTTTGCTTCGGTTTCTTCCATCGTGTGACGCTCAAAAGCCAGAGCTTCGCCTTTGGTGGCGAATTGTTTACGCACACGACGTCCACTACGTCCGGCGGGGTAACATTCGCAAAGCCATTTTCCTGTGGTGAGTTTTCGTACTGCCATAAAAAATGCCCTCCAGTAGAGAGCATTTTTACTGTATGTATAACCAGTGTCAATGTATGAAATCCTGCGACCATACATCTCACTGAAGCCATAATGAAGTTAGCTATTTTTTGCTATGTGAGTATGTGACTTTTGCGGTTAGCCTGCGGCTCATTGTTATATTAGGCGCAGATATAAAAGCAAAATTTATCGCGAGTTTTTAGTACAGATTTTTTTTGATTTACTAATAGTTCCATCATTGCAAACGAACTTTCCATCGGAGGTACAGTGAGAGACACCTCCCTTTTTCCCAGAACAAGGATAATTTTTAGCATAGGTAGTTAGTGGGTTTAATAACAAAGAGCATGATAAAACCACAAAAAATACCTTACCAAGCATAGTTTCCTCCCGGTATTACCTAACGTACTTAATTGTTAAACTTATAATTTTCCCAATTATTTCAACATCTTCTATCTTGCACTCGAAGGCTCTATTTCCACCCTCGACGAAGATTCTTCCACCGGGTAAACGAGTAATGTCACGGATCGTTATTTCGCCATCAATACTTATTACCCATTTACCATCACGTATATCATCAAATTCCTTATCACAAATAAATTCAGAATTATTATCTGTGATTACAAAAAGATTCTTGAATGCCGACGGTAGAAATTCTCTATCGAAAATATAAAAACCGTCTTCACACAAGGCCCCATCAGATAATACATATTTAGCAACTTCCATAGTATTTGTATTACCTGAAGTTTGCTTTGAACCATGCCCGGTTGTGAGCCAATTAAGCGAGGTGCCTGTTTCAAGGGCGCACTGGATTACCCATTCTGCTGGGAATGAGTCACGCATGTAGCGTGTGGCGAGTGTACTTTTAGAGATTCCTAAATGATCGCACAACGCCTGTCGAGTCTTGAATCCATAAGCTTCTACCATGCGCTCTATAGCGCCTCGTCCGCCTTTCTCCAAATTCATGGTCACTCCAAGTGAACTTTTATCTTGACGATTTCACTGTGCGATCGTATGTTTATGGTGTTCACAAAATACAAACGATCCGTATTCGTCCTGATTAATCATCATTAAACGAGGAATGTTGCATCATGAGACCTAACATTTCAATCACTCTTACCACCCCTCATGTGACTATTGAACGCTATAGCGAGCTGACAGGGCTATCCATCGATACCATCAATGACATGTTGGCTGATGGACGCCTTATCCGTCACCGACTGCGCAAAGATAAAAAACGCGAAAAAGTGATGATCAACATAGCAGCAATGACCGTTGATGCGCTTTCAGAATGCAATCTAAACCTTAATTAGTTCGATTCTGAAATACATCAGAGGCATTGACCATGTTTGATTACCAAGTTTCCAAACATCCACATTTTGATGAAGCCTGTCGTGCATTCGCATTGCGCCACAACCTGGTGCAACTGGCAGAACGTGCTGGCATGAATGTGCAGATTCTGCGGAACAAGTTGAACCCAGCGCAACCTCATTTATTAACCGCACCAGAAATCTGGCTGCTTACCGATCTGACTGAAGATTCAACGCTGGTAGATGGTTTTCTGGCTCAGATTCACTGCCTGCCATGCGTACCGATTAATGAGGTGGCAAAAGAGAAACTGCCACATTACGTCATGAGTGCAACCGCAGAGATCGGGCGTGTTGCTGCAGGTGCGGTAGCTGAGCCGCATCATGACGGTACTGTGCACTGGCATCTCATGTGTTTCATGCGTAAAAAAGACCGCCGCGCCATCACTGCATTACTGCGTAAGTTTGCCATCCGTGAAGACCGCGAGGAGCTGGGCAATAACACGGGACCACGCTTTAAGTCTGAGCTGATAAACCCGCGCAAAGGTACGCCAACAAGCTATATCGCGAAATACATCAGTAAGAACATTGACGGGCGTGGTCTGGCTGGGGAGATCAGCAAGGAAACGGGTAAATCCCTGCGTGATAACGCTGAATACGTTAATGCCTGGGCGTCTCTGCATCGTGTTCAGCAATTCCGCTTCTTTGGCATTCCGGGGCGTCAGGCTTACCGTGAACTGCGATTGCTGGCTGGTCAGGCGGCAAGGCAACAGGGTGACAAAAAAGCAGGTGCGCCGGTACTGGATAACCCGCGTCTTGATGCAATCCTGGCTGCTGCTGATGCTGGTTGTTTTGCCACCTACATCATGAAGCAGGGCGGCGTACTGGTTCCCCGCAAATATCACCTCATCAGAACTGCTTATGAAATCAACGAAGAACCGACCGCATATGGCGATCACGGTATTCGTATTTATGGCATCTGGTCACCCATTGCAGAGGGCAAGATCTGCACTCATGCGGTGAAGTGGAAAATGGTTCGTAAGGCCGTTGACGTTCAGGAGGCGGCAGCCGACCAGGGCGCTTGCGCCCCTTGGACTCGTGGCAATAACTGTCCCCTTGCTGAAAATTTGAACCAACAAGGGAAAGACAAATCAGCTGATGGGGACTCCAGAACGGATATTACCCGTATGAATGACAAGGAGTTGCACAATTACCTGCACAGTATGAGCAAAAAAGAGCGCCGGGAACTGGCTGCAAGGTTACGCCAGGTGAAACCGAAACGGCGTAAAGACTACAAACAGCGAATTACAGACCATCAGCGACAGCAGCTCGTCTATGAACTGAAGTCCAGGGGATTTGATGGCAGCGAGAAAGAAGTCGATTTGCTCCTTCGCGGCGGCAGTATTCCGTCAGGAGCAGGCCTGCGTATCTTCTATCGGAACCAGCGTCTGAAGGAAGATGATAAGTGGCGGAACCTGTATTAATTACGCGGGTTGACAATTCGTGCTCTTAATAATACCAGGCATATCAGGCCGATGAACGTAAAAAAACGTTTTACATCAGTAAGATTATTATATACTGTAAATATAAACAGTGGTTATGTATACAGTATTGTTTTGGTGTCATAGGAGGAAAGATGCAGGACTATTTTTTGGAGTCTTTGAAGCTCCAGCGCATTGATTTTTTTCTTAAGCTTGTAGCGGCTAGTGAGTGTAGTGATGAAGAGAAGGGGCTGGCTCTGCAGTGGGTTTCTGAATTGACTGATGAACTCATGGCAAAAATCAGAAGCCACGAATACAACCGCTCAATGGATGTCATCAGCTGAGGTGACTTTTATGCGCATTGAAATAATGATCGATAAAGAGCAGAAGATTAGCCAGTCTACCCTGGACGCCCTTGAATCCGAGCTTTACCGCAATCTGCGCCCCCTGTATCCCAAAACGGTAATTCGCATTCGCAAAGGTAGCTCTAACGGTGTGGAACTAACCGGACTGCAACTGGATGAAGAAAGAAAACAAGTGATGAAAATTATGCAGAAGGTGTGGGAGGACGACAGCTGGCTGCATTAAGAAATGTTGCCCCCAGGAGGATTCATTCTGATGGGGGCTAGTTTGGGCAATGAGTGAAATAAGGCGTAAGGTGGGCGGTTATTTTGATAAGTGATCGTCCGCTTTGTGTCAGAAGCAGAAGTGGGAGTGTCTGAGACTCTCTAAAAGTTGATGATTCACTTACAAAACCATTTTCCTGATGGATGCTTACACTTACGAATGATCATTCTGTTTAGTCTTCATGAGAAAATCCCGAATCTTTGCCAAGTTTGAATTTTCGGGGATTGAATCCATGTACGGATCACGCTCAAACACTAATTGCTCAGATTCAACGAAAGCCTGCCACTCTACTGGGTCTAGCTCAAATCCCATTGCTGTCATATCACTTTCATCTTCCCCCTCGATCCAGTGGACTAAATAAAACAATTCATCATCAGGAGAATCATCCTCACCATCAACAATATCAACATCGGTTACAGTGATACGTAGGGTGGGATCGATTTTTGAGACATAAATTCCTAACTGTGGTATAGGGGGAATATTGTTCATGGAAACTCCTTTTATTGGTGTTTGCTATTTGAATAAAAATACGCCAAACAGAGTGTTAACTGTGAATATGGAAAGAATCGTTGTAAGTACCCCGCTTAGGTGTCCCATTCTATTTTAGAGGTTCTCGTGCATATGGATTATGTTATCTGAAAGTTAACTTCCGCTTCCCGTTCACAGCGAACATTCATCTTTGTAAACCCGTATGATCCGTTTTTCAAGTGGCCATTCAGATACGGATTTTCACTTCCTTGACAGTGCATGACTATGCTGCATGAAATCGCATGATCGATTGAGGATCGTCTTTGCTCAGATCCGCCAGAACTGGCGGGCTTTTGCTCATGTTATGCATGTGCATGAAAACCACTGCATAAAGCGGGCAGGCGTGGCGGGGATACGAGCGCGCGCCATGTGGTATGGAGATTGGATCTATTCATAACTTGATGTATAAAGTAGAAAAAAAAGCGGGGAGATTATGAATAAAAAATTTACCGATGAGCAGCAACAACAGCTTATAGGACATCTCACAAAGAAAGGCTTCTATCGAGGAGCTAATATTAAAATAACCATTTTTCTATGTGGTGGTGACGTTGCTAATCATCAATCTTGGCGTCATCAATTATCACAATTTTTAGCAAAGTTCAGTGATGTTGATATATTTTATCCAGAAGATCTATTTGATGATCTTTTGGCTGGTCAAGGGCAGCATAGCCTTTTAAGTTTAGAAAATATTCTGGCTGAAGCTGTCGATGTAATAATTTTATTTCCTGAAAGTCCGGGGTCTTTCACAGAGCTTGGTGCGTTCTCTAATAATGAAAACTTAAGGAGAAAGTTGATTTGCATTCAAGATGCAAAATTTAAATCAAAACGTAGCTTTATTAACTATGGTCCTGTTCGCCTGTTGCGTAAGTTTAATTCAAAATCTGTTTTGCGTTGTAGTTCAAATGAACTAAAAGAAATGTGTGATTCATCTATTGATGTTGCCAGAAAATTACGATTATATAAAAAATTAATGGCATCTATTAAGAAGGTTAGGAAAGAAAATAAAGTATCAAAAGATATTGGAAATATATTATACGCAGAGCGGTTTCTATTGCCTTGTATCTATTTACTGGATAGTGTCAACTACCGCACACTGTGTGAACTAGCTTTTAAAGCGATAAAGCAAGATGATGTTTTATCTAAAATTATTGTTAGATCCGTTGTTTCTCGTCTAATAAATGAACGAAAAATACTTCAAATGACTGATGGTTATCAGGTCACTGCTTTGGGGGCTAGCTATGTTAGGAGCGTCTTTGATAGAAAGACACTTGACCGATTGCGGCTTGAGATTATGAATTTTGAAAACCGTAGAAAATCAACATTTAACTATGATAAGATTCCGTATGCGCACCCTTAGCGAGAGGTTTATCATTAAGGTCAACCTCTGGATGTTGTTTCGGCATCCTGCATTGAATCTGAGTTACTGTCTGTTTTCCTTGTTGGAACGGAGAGCATCGCCTGATGCTCTCCGAGCCAACCAGGAAACCCGTTTTTTCTGACGTAAGGGTGCGCAACTTTCATGAAATCCGCTGAATATTTGAACACTTTTAGATTGAGAAATCTCGGCCTACCTGTCATGAACAATTTGCATGACATGTCTAAGGCGACTCGCATATCTGTTGAAACACTTCGGTTGTTAATCTATACAGCTGATTTTCGCTATAGGATCTACACTGTAGAAAAGAAAGGCCCAGAGAAGAGAATGAGAACCATTTACCAACCTTCTCGAGAACTTAAAGCCTTACAAGGATGGGTTCTACGTAACATTTTAGATAAACTGTCGTCATCTCCTTTTTCTATTGGATTTGAAAAGCACCAATCTATTTTGAATAATGCTACCCCGCATATTGGGGCAAACTTTATACTGAATATTGATTTGGAGGATTTTTTCCCAAGTTTAACTGCTAACAAAGTTTTTGGAGTGTTCCATTCTCTTGGTTATAATCGACTAATATCTTCAGTTTTGACAAAAATATGTTGTTATAAAAATCTGCTACCACAAGGTGCTCCATCATCACCTAAATTAGCTAATCTAATATGTTCTAAACTTGATTATCGTATTCAGGGTTATGCAGGTAGTCGGGGCTTGATATATACGAGATATGCCGATGATCTCACCTTATCTGCACAGTCTATGAAAAAGGTTGTTAAAGCACGTGATTTTTTATTTTCTATAATCCCAAGTGAAGGATTGGTTATTAACTCAAAAAAAACTTGTATTAGTGGGCCTCGTAGTCAGAGGAAAGTTACAGGTTTAGTTATTTCACAAGAGAAAGTTGGGATAGGTAGAGAAAAATATAAAGAAATTAGAGCAAAGATACATCATATATTTTGCGGTAAGTCTTCTGAGATAGAACACGTTAGGGGATGGTTGTCATTTATTTTAAGTGTGGATTCAAAAAGCCATAGGAGATTAATAACTTATATTAGCAAATTAGAAAAAAAATATGGAAAGAACCCTTTAAATAAAGCGAAGACCTAATGGTCTTCGTTTTAAAACTAAAGCTCATAGGTTGAAAAATTGAGCACTTCTTCGTCCAACCAGTTATTTAGTTCCTGCAATCGTTTCTGCAGGGGCATCAATTCGTTTCTTACGAATACCTTGCTAGCCTTCTCCACATCCCCAAACCCCCCGACATTATTAGGCATAATTCCCATCATTTGCGGCGGCACGCGGTGTGCCGCCATCATGTCATCACGGCTCACGTTCTTGATATTCAGAAACTCATCCTTCGCTGTGACTTCTGACAACGGGATAATCTGAAGTCCGTCTTTTTTACCATTAGGCGAGTACATAAACAGGTTGCGGAAGTTGCCAGGGCCTTTGGCACTTTTCATCGCATTGCGGAGGTTGTTCACATCTTCCTGGTTCTGCGCGGCATCGGTCATGTACATGATGAAGCCTGCATGACTACCGTTAATGTAATACTTTCGGCGGAACAGCGTGGCGGACTCGTTGAGCAGGGCGGATGGGATGGCAGAAAGATAGCCGGGCAGGCCGTAGATCTCCTGGTTGATGTCCGGTTCCATCAGATGAAAAATGCTGCCTTTCGTGAACTGATACGGCTGCGTGGTCATGCCGTATTGCACAAACCAGTAGGTATCTAGGTCTAATCCGCGTCGGGTGTATTTTGCCAGCGCAGGCTCAAGGGCGATAACTTCACCGAATCGGTTCGTGCGTTTCTCCAGGTAGGCGTTACCAAAAACCAGATAGTCCTGCACAAAACGTGAAAACGCCTGCTGGCTGAGCAGCGGGTGAGGGATGTAGGTGCTGGTCAGAATGTTGCACTTTACTGCAATCGGTGAACTGTGGTGTACGGCGGCGCGGAATGTTCGCGCCAGTCCGTCAAAACTTACGGGTGGCTCATACCAGCGGTCCATCTTTACGCATTCCACATAGTCCAGTAATTCGCGGCGGTCCAGAACAGGAACGGGATCACCGAAGCTGAATGCTTCGGCTGAAGTTTGGTTTTTATGCTGGATCTGGTTCGTCGCCGCAGCGCGGTTTTTCTTACTCTTTCCCATCAAAAAATCTCCACAATATTGCTGGTATTGGCGGACTCGCCCTGCAGCGGTTCGTTAAACAGTGCGTGCATTGTTGCCCAGGCCAGATCGGCATGGCTGGCTTCTTCGCTGCGGCTGGCTTCATAGGTTGGGCGGTTGCCGCTGGCGGTGGTGGCGCGACGGATTGCCATAAATGACTGCGCTATGTCGGTGTGTCCGGCGTCAAACTCCAGACGACGGTGACTGATAATGTCGTAGGCCTTGAGTACCAGGGCGTTTTTAACGTTGGGGTTGTAGACAAACTCCCGGACGGCTGGAAAAAACGCTTTCACGTTCTCGTAAACCCCGTGGCCGACACCTGTCGAGTCGATGCCGATGTAGGTCACGTTGTACTGCTCGGTCAGTTTTTTGATGGCGTCAGCCTGGGCGCGGAAGTCCATCCCGCGCCACTGGTGACGCTCAAGAATGCGGAACTTACCACCCGGCACGGCTGGCGGTGCCACCACCACGCATCCGGCACTGTCGCCGTTCTGCGTACCTTTTGCCGGGTCATAACCGATCCACACTTCGCGCCAGCCAAACGGGCGCAGGGCCAGTGCATGAAAGTCGGTCCAGATTTCCCAACTGTCCACCATGCACGCTTGCAGCTCGCTGAGCGGGAACACGGACGCGAGATCGTCCACGAACTCGCACATCAGCAGGTTCTGGTATTCGTCCGGGCTGTACTCCATGCGTAGCTGGTCGAGGTCGAACAGGTTACAGCCGCCGCGCACCGCATCTTCCACGGTGACTATCTGGCGGTATTGCCCGTCTGTGCACAGCAGGCCGGGGGCCAGATTGCTGTGGGACAGGTCGATGTCCACCTTGTCGGCTTTGTTGCGCCCACGGTTGAACAGCGCACCGGACCAGAACGGATAAGCACTGTGTGTCAGGCTGGATGGCGTGGAAAAATAGGTTTGTCGCCATTTTTTGTGAATAGCCATACCGGAAGCCACTTTGCGCAGCTCTTGGAATTTCGGTATCCAGAAATATTCATCCAGATACAGGTTGCCGTGGTAACTCTGGGCCGTGCGGGCATTGGTGCCGAGGAAGTACAGCGTGGCCCCGTTAGGAAGCACTATCGGATCGCCTTTCAGCTCCACCTCCACTTCTTTGGCGAAGTCGATGATGTACTGCTTAAAGACGTGGGCCTGTGCCTTGCTGGCGGAAAGGAAAATCTGGTTACGCCCGGTTAGCAGGGCGTCAATCAGGGCTTCACGGGCAAAATAGAAGGTCGCGCCGATCTGGCGTGACTTCAGCAGGTTGCGGATGCGGTTGGTTTTTCCGGCTTCCCACCAGTGGCGCTGGTAGTTGAACATGGAGGAATGGAAGATTTCTTCCAGCTTCTCAATCTGTTCATCGGTGAAAACGTTCTTTTCCGGCTGACGACGCGGGCCTTTGTTGCGGTTGGCGACGTTAGGGTTTAAGTCGGCTTCGTTGCCGCCATTGTTAAACTTGCCGATCCGCGCATGGCGCTCCGACTGGCGCGCCAGCAGGTCAATCTCTTTGAAATCTTTCCCTTCTTTGTGCTCCTTCATAATGAGCTGGCAGTAGCGTGCGGCGGTGGTGAGCTGCATCTGATCCAGCGGCCCATAGTCACCCCACTTGTCGCGTTTTTTCCAGCTGTGAACGGTTGCAACTTTCTCGCCCAGCATTTCAGCAATGCGGGCTACGCGGTATCCCTGAAAGTACAGCAGCATGGCCTGCCGACGGGGATCGAGATCTGCGGGTGTCAGTGTGGTGTTCATGGCACAAACCTACAGCCTTGAATGAAGGCTTTCCCCGCCTGCGGTTTGTGTGGTTGTCGGTACAAATACCGCGCATTGTTTCACTACCCCCATCACCGCAACCATAAGGCTCCAGTAAGTTTTTTTTAACGGAGCACGGCTCATGACAGTGAAAGCAAAGCGTTTTCGCATCGGGGTGGAAGGTGCCACCACCGACGGACGCGAAATCCAGCGTGAATGGCTGGAACAGATGGCAGCCAGCTACAACCCGGCGGTGTATACCGCGCTGATTAACCTTGAGCACATCAAGTCTTATCTGCCGGACAGCACCTTTAACCGCTACGGCAAGGTGACGGCGCTGTTTGCTGAGGAAATCACGGAAGGTCCGCTGGCAGGCAAGATGGCGCTGTATGCCGACGTTGAGCCAACGGAGTCCCTGGTGGAACTGGTGAAAAAAGGCCAGAAATTATTCACCTCTATGGAAGTCAGCCCGAAGTTCGCTGATACGGGCAAAGCCTACCTGGTCGGCCTGGCTGCCACTGATGACCCTGCCAGTCTGGGCACTGAAATGCTGACATTCAGCGCCAGTGCAGCCCATAACCCGCTGGCAAACCGCAAGCAGAATCCTGCCAATCTCTTTACCGCTGCAGAGGAAACGGTGATCGAACTGGAGGAAGTCCAGGACGACAAACCGTCCCTGTTTTCCCGCGTCACGGCGCTGTTCACCAAAAAAGAGCAGTCCGATGACGCCCGGTTCTCTGATGTGCATAAGGCCGTGGAGCTGGTCGCCACTGAGCAGCAAAACCTGAGCGCGCGAACCGAAAAATCTCTGTCTGAGCAGGAAGAACGCCTGTCTGAACTGGAGTCTGCTCTGCAGAAGCAGCAACGGGCAATGCCTGTAGTTCCACACTGGTGAAATTGCTCAATCAGCGGCGCTGGGCAGATGCGTGCCGACAGTTGCCGCGCTGGGTTTATGTAAAAGGTGTGTTTAATCAGGGGCTGGATAACCGCCGTGCGCGGGAGATGGCCTGGTGCCTTAAAGGAGCTGGACTATGACGCGTGCGCTGGCAGTAGTGGTGGCGCTGGCACTCGTTGCGCTGGGCTGGCAGTCGTGGCGGCTTAACAGCGCCAGCCACACCATCGAAACGCAGCGCGCGGCGCTGAAAAGTAAAGCGCAGGAACTGACGAAGAAAAACAGCCAGCTTATCAGCCTGTCCATTCTGACTGAAACCAATAACCGGGAGCAGGCGCGGCTCTATGCCGAAGCAGAACAGACCAGTGCACTGCTGAGACAACGACAACACCGGATCGAGGAACTGAAACGTGAGAACGAGGATTTACGCCGCTGGGCTGATACTCCTTTGCCTGCTGACATTATCCGGCTGCGGGAACGTCCGGCACTCACCGGAGGTACAGCTTACCGTCAGTGGTTGTCCGCGAGTGACGCCGTGTCGGCTGGATCAGGCAACGCCGCGCACTAACGGTGATCTGAACGCGTTGCTGGATGAAACGGAGGCCGCCTGGGCGGTCTGTGCAGACAAAGTGGACATGATTATTGCGTGTCAGGAGCGAAACAGTGAACAAACCACAATCCCTGCGCCACGCCCTCAATAAAGCGGTGCCTTATGTCCGCAATAACCCGGACAAACTGCATCTGTTTGTGGATAACGGTTCGCTGGTTGCCACGGGGGCCAGCTCCATGTCGTGGGAGTACCGTTACACCCTGAACGCGGTGATTGAGGATTTCAGCGGCGACCAGAATCTGCTGATGGCCCCGGTTTTGCTGTGGCTGCGGGATAACCAGCCCGATGCCATCAATAACCCGGCGTTACGGGAAAAACTATTTACCTTTGAGGTGGATATTCTGCGCAACGATGTCTGTGATATCAGCCTTAACCTGCATCTGACGGAACGTGTGCTGGTCAGCACTGACGGCAGTGTGTCGAGCGTTGAAGCTGTAGCAGAACCCGATGAACCTGAAGAAATGTGGACGGTGAAACGTGGCTGAACTGCAGAAGGTGGACGACTGGCTTAGTGCCTTACTGGCGAATCTGGAACCAGTCGCAAGAAGCCGCATGATGCGCCAGCTGGCGCAGGAACTGCGCCGGACACAGCAGCAGAATATCAGGATGCAGCGCAATCCAGATGGCAGCAGTTATGAACCGCGCAAGGTCACGGCGCGCAGTAAAAAAGGCCGCATCAAACGTCAGATGTTTGCAAAGCTGCGCACCACCAAATACCTGAAAACCGCCGCCAGCGCCGACTCTGCCAGCGTACAGTTTGAAAGTAAGGTGCAGCGCATTGCCCGCGTTCACCATTATGGCCTGCGCGATCGCGTCAGCCGCAAAGGACCGGAAGTGCGTTACGCAGAGCGTCGTTTGTTGGGTCTTAATGGTGAGTCTTACGTTCTAACTCGTGATATATTGAATAGATTCCTTCTGTCTTGAATTCACCATATGCCCAATGAATTGCATTTTTTTGTTCAGTCAGAATATATTTCCTAACTTTTTTAGAATAGTGAATGGTCAGTATAAAACCCAAAGAAATCCATAATAGAATAAATATGGTTGTTGGGGCAAAGGAATAGATAAAGAATTTGTCTTTAACGAAATTCTTGACTTCTTCATCAATGTAAGTATTCCCTTCATCAGTGGTTAGTAGTTTACAAAGAATTTCTTTATGTTCGATAGTTAATGATTTGTACTTCTCTGTACTGGCAGAGGATGACTCTTCGCAATCATCTTTTGATATTTCCCAATCAGGAGTGGTTAATATTGTAGCTGCAGTCGCCAGTTTTTTTGACATATAAAATGATTCACCGGACTTTATCCAATAAAGCAATGTGCTATTTGGTGGGGTTCTAATGGCTTGGATGAAAACTGGACCAAAAATATAAATTGCACACAAAAAAAGAAGTGAAAAGAGAACCGTGATTGATTTTTCAGCCCATGTGATAGTAGGCAGCCTTTTTAATGATATTCGTATAAGAAAAAAATTTAATATAAACGCAGGTATTCCTGATTTTTTCAAAATTACATCATGAGATTTTCTATATTTGAATAGTGCTCTTATAACTATCGTCACCAAAGTTATAACAGCTCCAATAATGGAAATCCATGCGGCAAGTGCCGCTGCCGGGGTGTTTAAAAATTCAGCATTAAATATATTAGAACTTATTGGATTCATAACACTCATTTATCCACCTATTGGTTGTTTTTTATCCTTTTATCAATTTGTACCATCGATACCACAATTTGCAAGAAGTTATAAACAAGTTTTCAGCTGTCATCATATATCTATGAACGCACAACTAACCGAAATCATGCGCCTTATCACCAATCTGATCCGCATAGGTGTAGTCACCGAAGTGGACCGGGAAAACTGGCTTTGTCGGGTGAAAACGGGCGACCTTGAAACCAACTGGATCAGCTGGCTGACGCTGCGCGCGGGTAATGCCCGCACATGGTGGCGACCATCGGAAGGTGAGCAGGTGGTGCTGCTGAGTCTGGGCGGCAATCTGGAGACTGCCTTTGCGCTGCCCGCTGTCTATTCGAATCAGTTCGCACCACCGTCGACGTCGGCGGACGCCTGCGTGACAGAACATCCTGACGGTGGCTGGTTTGAATACGAACCCGCCACCGGGCGCTGGTATGTCAGGGGCATCAAATCAATGGTCATTGAGGCCGCTGATAACATCACCATGAAAACCAGTGAGTTTGTACTGGAGGCTGACCGCACGCGCATTAACAGCGAAGTGGTGATCAATGGTGGCGTTACCCAGGGCGGCGGAGCGATGAGTTCTAACGGGATCGTGGTTGATGCGCATCAGCATACTGGCGTCCTGAAAGGCGGCGATACAACCGGAGGCCCGGTATGACGCTTTATAGCGGGATGAACAATACCAGCGGCAAAGTCATTACTGATATTGACCATCTGCGCCAGTCGGTGCGGGACATTCTGCTGACGCCGCAGGGTAGCCGCATTGCTCGCCGTGAATATGGTTCCCTGCTGTCGGCACTGATAGACCAGCCACAAAATCCGGCGTTACGCCTGCAGGTCATGTCGGCAGTGTATGTGGCGCTGAGTCGCTGGGAGCCACGGCTGACGCTGGATTCCATCACCATTAACAGCAATTTTGACGGTTCAATGGTGGTGGGACTGACCGGGCGGCGTAATAACGGTGTGCCTGTTTCCCTTTCCGTATCAACAGGAGCAGAGAATGGCAGTGATTGACCTTTCGCAGTTGCCTGCGCCGCAGATTGTCGATGTGCCGGACTTTGAGACGTTGCTTGCCGAACGCAAGGCCGAATTTGTTGCGCTTCATCCGAAAGATGAGCAGGAAGCCGTGATCCGCACGCTGGAACTGGAATCTGAACCCGTCACCAAATTGCTGCAGGAGAATGCTTACCGTGAGTTGCTTCTGCGCCAGCGCATTAACGAAGCCGCGCAGGCGGTGATGGTGGCTTACGCGATGGGCGGCGATCTTGACCAGCTCGCTGCCAACTACAACGTGAAACGCCTGACGGTGACGCCTGCTGATAATGACGCTGTGCCGCCCGTTGCGGCTGTGATGGAAAGCGATGAAGCGTTACGCCTGCGTGTGCCTGCAGCCTTTGAAGGGCTTTCAGTTGCGGGGCCAACTGCAGCCTATGAATTTCATGCCCGAAGCGCCGACGGTCGGGTGGCGGATGCCAGTGCAACCAGTCCGGCACCTGCAGAGGTGGTACTGACTGTCCTTAGCCGTGAAGGCGACGGAACAGCAGAAAAAGACCTGCTGGATGTGGTGGAAAAAGCCCTGAACAGTGAGAACGTCCGCCCGGTGGCTGACCGTCTGACGGTTCGCAGCGCAGAAATCATCCCGTATCGCGTGGAAGCCACCATTTTTCTCTATCCGGGACCGGAAGCAGAGCCGGTAATGGCAGCGGCAAAAGCCAGCCTGCAGAAGTACATCGCCAGTCAGACGCGTCTTGGTCGGGATATTCGCCGTAGCGCCATCTTTGCCGCCTTGCATGTTGAGGGTGTGCAGCGTGTGGAGCTGGCTTCTCCTCTGGCGGATGTGGTCCTGAACAAAACACAGGCGGCATCATGTACGCAGTGGAGCGTAACCAACGGAGGAACGGATGAATAGTCTGCTGCCACCGGGTTCAACACCACTGGAGCGCCGACTGGCGCAAACCTGCAGCGGGATTTCTGATCTGCAGGTGCCGCTTCGTGACTTGTGGAATCCGGCAACCTGTCCGGTCAGTTTCCTGCCTTATCTCGCCTGGGCGTTCTCTGTGGATCGCTGGGACGAGGACTGGACAGAAAGTGTCAAGCGCCAGGTGGTGAAGGATGCTTTTTATATTCATCAGCATAAAGGGACCACCAGTGCCGTGCGGCGGGTGGTGGAGCCGTTCGGCTTTCTGATCCGCATTATTGAGTGGTGGCAGACCGGAGAGACACCGGGCACGTTTCGTCTGGATATCGGCGTGCAGGACCAGGGCATCACTGAAGATACCTATCTGGAACTTGAGCGACTGATAAGCGATGCCAAACCATGCAGCCGTCACATGATCGGCATGTCCATCAACCTGCAGACCAGCGGCCCGCATTGGGTGGGAGCCGCCAGCTATCTTGGCGAAGAAATCACGATCTATCCGTATATCAACGAAACAATTATTTCCGGTGGCACCGCGCATGAAGGCGGGGCGGTCCATGTTATTGACACAATGAGAGTGAATCCATGAGCACAAAATTTTATACCCTGCTGACGGATATTGGCGCGGCGAAACTTGCCAGCGCCGCCGCGCTCGGTGTGCCTTTAAAAATTACCCATATGGCGGTCGGCGATGGCGGCGGAGTATTGCCAACGCCGGATGCAAAGCAGACTGCACTGGTGAATGAGAAACGCCGGGCTGCGCTGAATATGCTCTATATCGACCCGCAGAACAGTAGCCAGATTATTGCTGAACAGGTAATCCCTGAAAATGAGGGCGGTTGGTGGATACGTGAAGTGGGCCTGTTTGATGAGTCCGGGGCATTGATTGCCGTGGGAAACTGCCCGGAAAGCTATAAGCCGCAACTGGCTGAAGGCAGTGGGCGTACCCAGACCGTGCGCATGGTGCTGATTACCAGCAGCACGGACAATATCACCCTGAAAATCGACCCTGCCGTCGTGCTGGCAACCCGCAAGTATGTGGATGACAAGGTACTGGAGCTGAAGGTGTTCGTGGATGATAAGATGGCAAAACATCTTGCCGCACCGGACCCGCATTCACAGTATGCACCCAAAGAAAGCCCGACATTGACCGGAACACCCAAAGCGCCAACGCCAGCGGAGGGGAATAACACCACGCAGATTGCGACCACCGCGTTTGTTCAGGCGGCACTGATGGCCCTTATTAATGGTGCGCCAGCCACACTGGATACGATGAAAGAAATTGCCGCTGCCATTAATAATGACCCGAAATTCAGTACCACCATTAACAATGCGCTGGCACTGAAAGCGCCGCTGTTAAGTCCGGCATTCACCGGAACGCCAACAGCCCCCACTGCCGCACAGTCGGTTAACAATACACAGATTGCCACCACGGCTTTTGTGAAATCGGCAATTGCGGCAATGGTGGGGTCTGCACCTGCTGCACTGGATACACTGAACGAACTGGCTGCGGCGCTGGGGAATGACCCGAACTTTGCCACGACAATGCTTAATGCGCTGGCAGGTAAACAACCGTTGGACAATACGCTGACTAATTTGAGTGGAAAGGATGTAGCTGGTCTTCTCGCATACCTTGGTTTGGGAGATGCATTAATTGGTGATGAATGTAAAATTGCAGGGTTTGACAGTAGTAACGTCAATGCCCCGTATATGCGATTCGCCAGAACAAATACAGTAGTTCGTCTGGCAACAAAAGACTATGCGCAACCAAAAGACCAGACACTGACAGATTTAAGCGGTAAGGATAAGGCTGAACTAAGAACTTATCTTGATCTGAAAAGTGCGGCTCAAAGGGATGTTGGCTCAGGGGCAAATCAGATTCCGGATATGAATGACTTCACATCCAGCCTGACCAGCCCTGGCTGGCAAAAATTACCGTCAGGTCTGATTATTCAGTGGGGGGCAGCCAATCCATCATCAACTGGAGAGATCTTTATTACGTTTCCTGTCGCGTTCTCTGCATACCCGATGTATGTGGGATTTGGTCCTCAGCAGGCATCGCTTCCTAACGTAGTTCAGTCGCCAGTAATTTCAGCGCCAACGATAACTAATTTAGGATGCGGCGTCCGAAATCTGATGATTCCAACAGCGGGCGGAGCACCAGTAGCCAGCATGAGTTCATTTTTCTGGATTGCGGTAGGGAAATAATATGTACAAATACAGTGCTAAAAAAAATGCGTTTTATCTGGCTGGTAATGAGGCCGTATACCGCGATTCCGGCACATGGCCTGATGATGCAAAAGATATTGAAACCCGACGTGCCGAGTCGTTTATGGCGACACCTCCGCAGGGTAAGCGACGTATTGCAGGTGCAGACGGAATGCCTGCGTGGGCAGATATTCCTTCACCCACGCATGAAGAACTTATTGAAATTTCTGAGTCAAAAAGACAGCTATTAATTAATCAGGCCAACGAATACATGAACAGTAAACAATGGCCCGGTAAAGCCGCTATTGGTCGTCTGAAAGGCGAGGAACTGGCGCAATATAATTCGTGGCTGGATTATCTGGACGCACTGGAGCTGGTTGATACCTCCAGTGTGCCAGATATTGAATGGCCTACGCCTCCGGCAGTTCAGGCCAGATGACATCCGGCGCGGTGCTGGTATCTGTTGCCGTCACCGCGTCAATGTAATCCAGCACAATGTTAAGTCGGGTGGTTTCTGCCTGCATCAGCTTCCGCCCGGCCTGCAATTTCAGCTGAATCAGACTAATGGAAGCCATTGCTGCATCAATCAGTGACTGGCGTTGTGCTTCTGCCACTTCTACTGCGGCACCGTGTTGTGCCTCAGTATCTGTCACCCATTTCTCACCATCCCATTTATCGTATGGCGTTAACGGTGCGATAGTGGTTGTATTTTCAGGGTAATCACCCGGAGCTGTGATTTCTTTGGCATCTCCCGTTTCGGTGTTATAGACGATTTCACCGCGATGATCTGGCACATATTCCCATGAGTTAAAATCTGTAGAACGGCAGATTGCATAATCAGCCTTATGTGTGCCAGGGGCATCTAAACAGGAACATGCCGGAATGCCGACACCAACGGAAAGATATTCATTTGAAGTGGAAATATATTCCCGTGTTTCACCATCATAGTTATAGACAGGAATATTTCCCGCCTTCGTGGCAATAAGCTCGCTATTTAATACAGCGTTATCCATTATGCAGCCCTCACGATATAGTTAAATGCAATATTCCGTGGGCGGGTTTCATTTACCCCATATGCTGTTGTTCTCTGATTCCCGACAAAATATCCATTTGGATTAGAGGATACAGAATAATCATCAAAATCAAGTGGGCTATCAGCACCCGGCGTCAGGCTTGCAATATATCCTCTTCCCATAACTTCTGATGCTGGTATTTCTTTTGTACGATAATCACTGACTCTGAAGAGCTGTCCCTGGTGGGTGTGTGATTCCACTCCTCCATTCTGAAGACTTAGCAAGGCACGTCCAGCATCAATACCACGCCCATCATCCCAGCCACGAATAAACTCACCACGTAAATCAGGCAATTTATTTGTCGGATAAGCCTTTGCCAGTTCCGGGTATTCTTCAGCAGAAAAAGCTGCACCATTGCATTTCAGCCAGCCTGTAGGCGGAGTGGCTGAAGGCCACGGAACAGGCACCCCAACAGGTAATGCAGAGTCTTCTCCCAAACCAACGTTTAAGAAAATGCAGCGATTACGACTAACTGGCATCATCCCCGATTTTTATTCAAGGAGATGATCATGCTTATTGGCTATGTACGCGTGTCAACAAATGACCAGAACACCGATTTGCAACGTAGTGCGCTGAACTGCGCGGGATGTGAGCGGATTTTTGAGGATAAAATCAGCGGCACTAAGTCCGACAGACCGGGGCTGAAAAAACTGCTCAGGACACTATCGGCAGGAGACACGCTGGTTGTCTGGAAGCTGGACAGGTTGGGGCGCAGTATGCGGCATCTTGTTACGCTGATAGAAGAGTTACCCGCGAACCTGCGAAGAAAGAAAGCACCACGGTGAAGCGTAAGCGCAGAACTAAGAAGCAGAAGAAAGAGCCGGAAGCGAAGCAGGGCGATTACCTGGTGGGTACGGATGAAAACGTGCTGGTACTTAATCGCACTTATGCCAACCGGAGCAACGCCGAACGAGCGGCGAAAATGCAGTGGGAACGCCTGCAACGCGGCGTTGCGTCATTCTCGCTACAACTGGCGGAAGGGCGGGCAGATCTCTACACGGAAATGCCAGTGAAAGTCAGTGGCTTTAAACAGCCGATAGATGATGCGGAATGGACTATTACCACCCTGACGCATACTGTCAGCCCGGATAACGGTTTTACGACCAGTCTGGAGCTTGAAGTGAAGATTGATGATTTCGAAATGGAATGATTCTTCGCAATGGAGAACTTTTAAGTTCTCAAAATGGAATAATGCGGTATCATTATTGTGAATTTAGCAAAAATGGGGAGAACTCGAAAAATGATGATTTGCCCACTGTGTGGAAGTGCCGCCCATACTCGCAGCAGTTTTCAGGTATCTTCATTGACCAAAGAGCGTTACAACCAGTGCCAGAACATTAACTGCAGCCATACTTTTGTTACTCATGAAACTTTTGTTCGTTCGATTGCAACGCCAAAAGAGTCAAATCCGGTTCAGCCGCATCCAATGAAATCAGGACAGGTGGCGCTCTCTCTTTGACGCTGCCGCCATTTTGTCGCCATCGTTAAAAAACAGTGCTTCTAACATCATGATTTTAAAGAGCATAAATTTCAGGCAACAAAAAACCCATCAACCTTGAACCGAAATGGCGGGGTTGATGGGCTCCACAAAATGGGGACATCAAAGAAAAGCAGTGGCACTAATTAAGACTGATGCCCTGCGGAAAAGTTCTGCGGTTGTGCAAAAAAATTTCATTTTCAGGGCAACTTCAGTTTTATCCTAATCCTGGCCATACCATGACGATGATTGTCCCTGCCAGCGTCAGCAGGACGTTGGCGATTGCATAGGTGCCCGCATAGCCCAGCGCCGGGATGTTACTGCGAGCTGTATCACTGATGATCTCCATTGCCGGCGCGCAGGTACGTGCGCCCATCATTGCGCCGAACAACAGCGCGCGGTTCATTCGCAATACATAAGCACCGAACAAGAAACAGATAACCACGGGCACCAGACTGACAATCAATCCGGCAATCAACATCTGACCGCCAATCGCGCCCAGGCCGTTATTAATACCGCTACCGGCGCTCAGACCAACGCCTGCCATAAACACCATCAAGCCGAACTCTTTCACCATGCTTAATGCACCCTGCGGAATGTAACCGAAGGTCGGGTGGTTAGCACGCATAAAGCCCAGCATAATTCCGGCGAATAACAACCCGGCAGCGTTCCCCATGCCGAAACTGAATGTGCTGAACTGGAAGGTGATCATCCCGATCATCAGCCCAATAACAAAGAAGGCGCAGAATGCCAGCAGGTCAGTGACCTGGCTGTGAATCGAGATAAAGCCGATGCGATCGGCGATGGTTTTTACGCGGCGGGCATCGCCGCTGACTTGTAAAACGTCACCTTTGTTAAGCACGACGTTGTCATCTATCGGCATCTCAATCTGGCTACGAATGACGCGGTTAAGGAAGCAACCGTGATCGGTCAACTTCAGTTGTGCGAGACGTTTACCTACAGCGTTATGGTTTTTAACGACCACTTCTTCAGTGACGATACGCATGTCGAGAAGGTCACGATCGAAAACTTCTTTACCGTTACGGAAGCTGGGATCGAGTCGGGCATGGGCGTCGGGATAGCCTACCAACGCTATTTCATCGCCCATTTGTAGCACGGCATCACCGTCTGGATTTGCCAGAATCCCGTTACGTCGAATACGTTCAATGTAGCAGCCGGTTTGTCGATAAATACCCAGTTCACGCAGATTTTTGCCGTCGGTCCAGGCCACCAGTTCCGGGCCGACGCGATAGGCGCGGATCACCGGTAAATAAACCTTACGGTTGGCATCAGTGTCCAGGCCACGTTCGCGGGCGATTTGCTGGGCGCTGGTCTGTAAGTCCTGATGCTGCAATTTCGGCAAGTAACGCGCACCAACAATCAAACTCACCAGACCGATTAAATAGGTTAAGGCATACCCGAGGCTCAGATTATCCAGTGCCAGTGAGAGCTGCCTGCTTTCCATGCCGGAATGACGCAGTGTATCGCCAGCACCGACCAGAACCGGTGTCGACGTCATAGAGCCTGCTAACATACCGGCCGTCAGGCCAATATCCCAGCCAAACAGCTTACCTAACCCTAAGGCGATCACCAGCGCACTGCCAACCATCACCAGTGCTAACATTAGGTAATTTTTCCCATCGCGAAAAAAAATGGAAAAAAAGTTCGGTCCGGCTTCGACCCCGACGCAGAAAATAAACAGCATAAAGCCAAGATTAAGCGCATCGGTGTTAATGCTGAAATGTTGTTGGCCTAATAACAGCGATACGACTAAAACGCCAATGGAATTACCCAGTTGGATCGAACCAAGTCGTAACTTTCCGAGACATAGCCCAAGCGCGAGGACCACAAATAATAACAGAATGTAATTCCCATTTAACAATTCGGCGACGTTTATATTCACGGAGGCTAACTTCTTGTTTACTAGTAAGCTGTTGAAAGAAATGGTAATTTACGATAATGTTTTTTACCAGAATTCAGGGCGCAGATTCATTCAGCGCACCTAAACGATAGTAAAGTAACAATATATTTTACTAGTGTAATCACATTAGGTATCAACGGCTATATGAATTGCGTTGGCCTATATTAGCATGGAATGCGAAGCGGCTTTATCTTACTGAACGCCACACTGGCGAAAAATGTGTTCGATAGACGCAGTGTCAGGAGGAACGAGTGAAACATAAACAACGTTGGGCGGGGGCAATCTGCTGTTTTGTCCTCTTCATTGTGGTGTGCCTTTTTCTGGCGACGCACATGAAAGGCGCTTTTCGGGCTGCCGGGCATCCTGAAATCGGCTTGCTATTTTTCATTCTTCCTGGAGCAGTCGCCAGCTTCTTTTCACAGCGTAGAGAAGTCCTGAAACCTCTGTTTGGCGCAATGCTGGCGGCACCCTGTTCGATGCTCATTATGCGGCTGTTTTTTTCACCGACGCGCTCATTCTGGCAAGAGCTGGCATGGTTACTAAGCGCGGTGTTCTGGTGTGCGCTGGGGGCACTGTGTTTCTTATTTATCAGTAGTTTGTTTAAACCACAGCACAGAAAAAATCAGTAAAGCCCTCAACGCGAGGGCTTGTCAGACGATCAGGCGTCCAGATTTTCTTTCACCCATGCAGCAAAATCGGTATAGCCGCCGATATGTTGCTGATCGACAAAAATCTGCGGCACGGTTTCTACGGGTTTACCTGCCTTTTGTTGTAGATCTTCTTTAGTGATCCCTTCCGCACGAATATCTACATACTGATACTGAAAATCATCGCGTTCATTGCTCAATTTCTCAGCCAGATCTTTTGCACGCACACAGTAAGGGCAACCCGAACGACCAAAAATAACGGTTTGCATTATTTCTCTCCTCATAGATTTATGCCTGTAATGATCACGCTAAAATGTATTCGCTGAAAGTAGGTTTAACCTGTTGCATTAATTGCTAAAAGCTATAACTGTTAAACACAATACAGTGAAAAGTTTTAGACTGAAGGCTCACTTTGCAGAGGGAAGCGTATGCGCGCGATCGGTAAATTGCCTAAAGGCGTGTTGATACTGGAATTTATCGGAATGATGCTACTGGCGGTGGCGCTGCTGTCGGTAAGCGACTCCCTGTCGCTGCCTGAGCCATTTTCTCGGCCAGAAGTGCAGATTCTGATGATTTTTCTCGGTGTTTTGCTCATGCTTCCCGCTGCGGTGGTGGTTATTCTTCAGGTGGCAAAACGTCTTGCCCCACAGCTGATGAACCGTCCACCGCAATATTCACGTTCAGAAAGAGAAAAAGATAATGACGCCAACCATTGAACTTATTTGTGGCCATCGCTCCATTCGCCATTTCACTGATGAACCCATTTCCGAAGCGCAGCGTGAGGCGATTATTAACAGCGCCCGTGCGACGTCCAGTTCCAGTTTTTTGCAGTGCAGTAGCATTATTCGCATTACCGACAAAGCGTTACGTGAAGAACTGGTGACGCTGACCGGCGGGCAAAAACACGTAGCGCAAGCGGCGGAGTTCTGGGTGTTCTGTGCCGACTTTAACCGCCATTTACAGATCTGTCCGGATGCTCAGCTCGGCCTGGCGGAACAACTGTTGCTCGGTGTCGTTGATACGGCAATGATGGCGCAGAATGCATTAATCGCAGCGGAATCGCTGGGATTGGGCGGGGTATATATCGGCGGCCTGCGCAATAATATTGAAGCGGTGACGAAACTGCTTAAATTACCGCAGCATGTTCTGCCGCTGTTTGGGCTGTGCCTTGGCTGGCCTGCGGATAATCCGGATCTTAAGCCGCGTTTACCGGCCTCCATTTTGGTGCATGAAAACAGCTATCAACCGCTGGATAAAGGCGCACTGGCGCAGTATGACGAGCAACTGGCGGAATATTACCTCACCCGTGGCAGCAATAATCGCCGGGATACCTGGAGCGATCATATCCGCCGAACAATCATTAAAGAAAGCCGCCCATTTATTCTGGATTATTTGCACAAACAGGGTTGGGCGACGCGCTAAAACCGCCACGTCGATGTATGATACGCGGGCTTTTGACCAGGTCTGACAGAGAGGTGCAGGGTGAAAATTGCCATATTGTCCCGGGATGGAACGCTCTATTCGTGTAAGCGGCTGCGTGAAGCCGCTATACAGCGCGGTCACCTGGTTGAAATTCTTGATCCGCTTTCTTGCTACATGAACATAAATCCTGCGGCGTCTTCTATTCACTACAAAGGCCGCAAGTTACCCCATTTTGACGCAGTGATCCCGCGTATTGGCACCGCCATTACCTTTTATGGGACGGCGGCACTGCGCCAGTTCGAGATGCTGGGGAGCTATCCGCTCAATGAGTCGGTCGCCATTGCCCGGGCGCGTGACAAATTGCGTTCCATGCAACTGCTGGCGCGTCAGGGCATCGACCTGCCTGTCACGGGCATTGCGCATTCGCCGGATGATACCAGCGATTTAATCGACATGGTCGGTGGTGCGCCGCTGGTGGTCAAGTTGGTTGAAGGCACGCAGGGAATTGGCGTCGTGCTGGCGGAGACGCGTCAGGCGGCGGAAAGCGTGATTGACGCTTTCCGCGGTCTGAACGCGCATATTCTGGTGCAGGAATATATCAAAGAGGCGCAAGGGTGCGATATCCGCTGTCTGGTTGTTGGCGATGAAGTGGTCGCTGCGATTGAACGGCGGGCGAAAGAGGGCGATTTTCGTTCCAATTTGCATCGTGGCGGCGCGGCAAGCGTCGCCAGTATCACACCACAGGAGCGTGAAATCGCGATAAAAGCCGCGCGAACGATGGCGCTGGACGTTGCTGGTGTGGATATTCTGCGTGCTAATCGCGGGCCGTTGGTGATGGAGGTGAATGCGTCGCCGGGGCTGGAAGGAATAGAAAAAACCACCGGTATCGACATCGCGGGTAAAATGATCCGCTGGATCGAACGCCACGCTACGACAGAATATTGCCTGAAAACGGGTGGTTAGTCGCAATCACATTACTGATCATGGTTTTGCCTGCGCTTTTTGCGTAAGCTGTGCCGGTCTTTTTATCGAAAGAGGTTGTACAAAATTATGACATCGCTGGTCGTTCCTGGTCTGGATACGCTGCGTCAATGGCTCGATGACCTGGGGATGAGTTTTTTTGAATGTGATAACTGTCAGGCTCTGCATCTGCCCCATATGCAGAATTTCGACGGTGTCTTTGATGCCAAAATCGATCTGATCGATAACACGATCCTGTTTTCTGCCATGGCGGAAGTCCGACCTTCAGCCGTATTGCCGCTGGCGGCGGATTTATCTGCCATCAATGCCAGTTCGCTGACCGTGAAAGCATTTCTTGATATGCAGGATGATAATCTGCCAAAGCTGGTGGTTTGCCAGTCTTTATCCGTTATGCAGGGCGTAACCTATGAGCAGTTTGCATGGTTCGTGCGTCAGAGCGAAGAGCAGATTTCGATGGTCATTCTTGAAGCTAATGCCCATCAACTGCTGTTACCGACTGATGATGAAGGGCAAAACAACGTTACCGAAAACTATTTCCTCCACTGATAACTCCTTTCGAGCACGCAGTCGCTGGTGCAGTGGCTGCGCGCTGCAAAATTATCTGCTGTTTTTAACCTTTTCTTAAAGATTATTTCACTTCTCTTGTGTCGATTTGGCTTTATCACATAGAGCAAATATGCATAAAAATTTGTTAAATACCGTTTTTTAATCCGAGCTATAGTCTCAAACCCTGGCTAAAGTTATTCTTGCGATGCTTTTATATAGCGAGCAGTGCTGGCCGGGAGAAAGTTCTCTTTTCTTACACCGCGCCGATAAAAAATATGCACGTTTATTGCATATCTTTCAGTGTGACAACTTTTGTTCGTTTGTTAACGAACTTTCAGAAGGAAAGAGATATGACCGCCTTAAATAAAAAATGGCTATCGGGTCTGGTTGCGGGTGCTCTGATGGCCGTCTCTGTCGGCACGCTCGCGGCTGAACAAAAAACACTCCACATTTATAACTGGTCTGATTATATCGCCCCGGACACGGTGGCCAATTTTGAAAAAGAAACCGGTATTAAAGTCGTCTACGATGTTTTCGACTCTAACGAAGTACTGGAAGGCAAATTAATGGCCGGGAGTACCGGCTTTGATCTGGTGGTTCCATCTGCCAGCTTTCTGGAGCGCCAGTTGACTGCGGGAGTTTTCCAGCCGCTGGACAAAAGCAAATTGCCGGAGTGGAAGAATCTCGACCCGGAACTGCTGAAGCTGGTCGCCAAACACGATCCCGACAATAAATTTGCTATGCCCTATATGTGGGCGACGACCGGGATTGGCTATAACGTTGATAAAGTTAAAGCGGTGCTGGGCGAAAACGCGCCCGTCGATAGCTGGGACTTGATCCTCAAACCTGAAAATCTGGAAAAACTGAAAAGCTGCGGTGTCTCTTTCCTGGATGCGCCAGAAGAAGTTTTTGCTACCGTGTTGAATTATCTCGGCAAAGATCCCAACAGCACTAAAGCGGATGATTACACCGGACCGGCAACAGATCTGCTGTTAAAGCTGCGCCCGAACATTCGTTATTTCCATTCATCTCAATACATTAACGACCTGGCGAATGGTGACATCTGCGTGGCGATTGGTTGGGCAGGTGATGTCTGGCAGGCGTCAAACCGCGCGAAGGAAGCGAAGAATGGCGTGAATGTCTCGTTCTCGATTCCAAAAGAAGGGGCGATGGCGTTCTTTGATGTATTCGCCATGCCTGCCGATGCCAAAAATAAAGACGAAGCCTATCAGTTCCTGAATTACCTGCTGCGTCCGGATGTGGTGGCACATATCTCTGACCATGTGTTCTATGCTAACGCCAATAAAGCAGCCACGCCGCTGGTGAGTGCGGAAGTTCGTGATAACCCGGGTATTTATCCGCCTGCGGATGTCCGTGCGAAGCTGTTCACTCTGAAAGTGCAGGACCCGAAAATCGACCGTGTGCGCACCCGCGCGTGGACCAAAGTGAAGAGCGGAAAATAATCCGCAGTCGTAGATGCCGGACGGGTGCACCACACACGCCGGCAATTCGCACCATCATGGTGCGCTTGCACACATTCAATGCCGGAGAGCAGCCGTGAATGACGCTATCCCTCGCCCGCAGGCGAAAACCCGTAAGGCGCTGACGCCGCTATTAGAAATCCGCAACCTCACCAAATCCTACGATGGTCAACATGCGGTGGATGATGTCAGCCTGACTATCTACAAAGGTGAAATCTTCGCGCTGCTGGGCGCATCCGGCTGTGGCAAGTCCACGCTGCTGCGTATGCTGGCAGGTTTCGAACAACCTTCTGCCGGACAGATAATGCTTGATGGCGTCGATTTGTCACAGGTTCCGCCTTACCTGCGCCCCATCAATATGATGTTTCAGTCTTACGCGCTGTTTCCGCATATGACCGTGGAACAGAACATCGCTTTTGGCCTGAAACAGGACAAACTACCGAAAGCGGAAATTGCCAGTCGGGTCAATGAGATGCTCGGCCTGGTGCACATGCAGGAGTTCGCCAAACGCAAACCGCATCAGCTTTCCGGTGGTCAGCGACAGCGTGTGGCCCTGGCCCGAAGCCTTGCCAAACGCCCGAAACTATTACTGCTCGATGAGCCGATGGGCGCGCTGGATAAAAAGCTGCGTGACCGGATGCAGCTTGAAGTGGTGGATATTCTGGAGCGCGTCGGTGTGACTTGTGTGATGGTCACCCACGATCAGGAAGAGGCGATGACCATGGCGGGGCGTATCGCCATTATGAATCGCGGGAAATTTGTCCAGATTGGCGAGCCGGAAGAGATCTACGAGCATCCAACTACCCGCTACAGCGCTGAATTTATCGGTTCGGTTAATGTCTTTGAAGGTGTACTCAAAGAGCGTCAGGAAGATGGCTTGGTGCTTGATTCGCCGGGGCTGGTGCATCCACTGAAAGTCGATGCGGATGCTTCAGTGGTCGATAACGTGCCGGTACACGTAGCGCTGCGCCCGGAAAAAATCATGCTTTGCGAAGAGCCGCCCGCCAATGGTTGTAACTTCGCGGTGGGGGAGGTGATACACATTGCCTATCTCGGCGATCTTTCGGTGTATCACGTTCGACTGAAAAGTGGGCAGATGATCAGCGCCCAGCTACAAAATGCCCATCGTCATCGTAAAGGGTTACCGACCTGGGGCGACGAAGTGCGTTTGTGCTGGGAAGTGGACAGCTGTGTGGTGCTGACGGTTTAAGGAGCAAAGATGAGTACACTTGAACCTGCTGCCCAGTCGAAACCGCCGGGCGGATTTAAGCTGTGGTTGTCGCAGCTGCAAATGAAGCATGGGCGCAAACTGGTCATTGCGTTGCCATATATCTGGTTGATCTTGCTGTTTCTGCTGCCATTTCTGATTGTCTTTAAAATAAGCCTGGGAGAGATGGCGCGCGCTATTCCACCTTATACCGAGCTGATGGAGTGGGCTGACGGGCAACTTTCCATCACTCTTAATCTCGGTAATTTTCTGCAACTGACCGACGATCCGCTCTATTTTGATGCTTATCTCCAGTCGTTACAGGTGGCGGCGATTTCGACTATTTGCTGTTTACTGATCGGCTATCCGCTGGCGTGGGCGGTGGCGCACAGTAAGCCTTCGACCCGTAATATTTTATTACTACTGGTGATCCTGCCGTCGTGGACCTCGTTTCTGATCCGCGTTTATGCCTGGATGGGAATATTAAAAAACAACGGTGTGCTGAATAATTTTCTGCTGTGGCTGGGGGTTATCGATCAACCGCTGACCATTCTGCATACCAATCTGGCCGTTTATATCGGCATTGTTTACGCTTACGTGCCGTTTATGGTACTGCCGATTTATACCGCGTTGATTCGTATTGATTATTCGCTGGTGGAAGCAGCGCTGGATCTCGGTGCACGACCGCTGAAAACGTTCTTTACTGTGATCGTGCCGCTGACTAAAGGTGGGATTATTGCCGGATCGATGCTGGTGTTTATCCCGGCTGTGGGCGAGTTTGTGATCCCGGAACTGCTCGGTGGCCCGGACAGCATCATGATCGGGCGCGTGCTGTGGCAGGAGTTCTTTAACAACCGCGACTGGCCGGTGGCCTCGGCGGTAGCGATCATCATGTTGCTGCTGCTAATTGTGCCGATAATGTGGTTTCACAAACACCAGCAAAAAAGCGTGGGAGAACACGGATGAATAATTTACCGGTAGTTCGTTCGCCCTGGCGGATTGTGATTTTGCTGTTGGGCTTCACCTTTCTCTACGCGCCAATGCTGATGCTGGTGATCTATTCATTCAACAGCTCGAAACTGGTGACGGTGTGGGCCGGCTGGTCAACGCGCTGGTATGGTGAGTTATTGCGCGATGACGCGATGATGAGTGCGGTTGGTTTAAGCCTGACAATTGCTGCCTGTGCGGCAACGGCGGCGGCGATCCTCGGGACTATTGCGGCGGTGGTGCTGGTGCGCTTTGGCAGGTTTCGCGGATCAAATGGCTTTGCCTTTATGATCACCGCGCCGCTGGTGATGCCAGATGTCATCACGGGCTTGTCGCTGTTGTTGTTATTCGTCGCGCTTGCTCATGCCATTGGCTGGCCTGCGGACCGCGGGATGCTTACCATCTGGCTGGCGCATGTCACGTTCTGTACGGCTTATGTGACAGTCGTTATTTCGTCGCGTCTGCGGGAACTGGATAGCTCGATAGAAGAAGCAGCGATGGATCTCGGTGCGACGCCGCTGAAAGTATTTTTCGTCATTACGCTACCGATGATCATGCCCGCGATCATTTCTGGCTGGTTACTGGCTTTTACTTTGTCGCTTGATGATCTGGTGATCGCCAGCTTTGTTTCTGGGCCGGGAGCCACCACGTTACCGATGCTGGTCTTTTCCAGCGTGCGGATGGGGGTGAATCCGGAAATCAACGCCCTTGCAACGTTAATTCTCGGTGCGGTCGGAATTGTCGGATTTATCGCCTGGTATCTGATGGCTCGCGCAGAAAAACAGCGGATACGCGATATCCAGCGTGCAAGACGTGGCTGAAGACACTAAAATTTGCCAACCTGGCTACATAATGCCGCGCATGCCGCGGCATTGTTTTCATGGAAGACGAAACGTTGGGATTTTTTAAGAAAACATCTTCATCTCATGCTCGCCTGAATGTGCCTGCGCTGGTGCAGGTAGCGGCGCTCGCCATTATTATGATCCGTGGCCTCGACGTGCTGATGATTTTCAATACGCTGGGCGTGCGCGGTATTGGCGAGTTCATTCATCGCAGCGTACAAACCTGGAGCTTAACGCTGGTCTTTTTAAGCAGTCTGGTGCTGGTTTTTATCGAGATCTGGTGTGCGTTTTCACTGGTGAAAGGGCGTCGCTGGGCGCGCTGGCTATATCTGCTGACACAAATCACCGCCGCAAGTTACTTGTGGGCGGCTTCGCTGGGGTATGGTTATCCGGAGCTGTTCAGCATTCCCGGTGAATCAAAACGTGAAATCTTCCATAGCCTGATGCTGCAGAAGCTGCCGGATATGCTCATCCTGATGCTGCTGTTCGTTCCCTCGACCAGTCGGCGGTTCTTCCAGTTGCAATAATGTGTATAATCGTCGCCCCTGATGATGTGAAGGTCAATGTATGCAGTGCGCACTTTACGACGCGGGTCGCTGTCGTTCCTGTCAGTGGATAACGCAGCCGATTCCAGAGCAACTCTCCGCTAAAACCGCCGATCTTAAAAATCTGCTCGCCGACTTTCCGGTTGAGGAATGGTGCGCGCCGGTGTCAGGCCCGGAACAAGGGTTTCGTAATAAAGCCAAAATGGTGGTGAGTGGTAGCGTTGAAAAACCACTGCTCGGTATGCTGCATCGAGACGGCACACCGGAAGACCTTTGTGACTGCCCGCTTTATCCTGCCTCATTTGCGCCCGTTTTTTCGGCGCTAAAACCGTTTATCGCCCGAGCGGGGTTAACGCCCTACAACGTGGCGCGTAAGCGTGGCGAACTGAAATACATTTTGCTGACTGAAAGCCAGAGCGATGGCGGCATGATGCTGCGTTTTGTGTTGCGTTCCGAAACCAAACTGGCGCAACTGCGTAAGGCGCTACCGTGGTTACAGGAACAACTACCGCAGCTAAAAGTTATTACCGTCAATATTCAGCCGGTACATATGGCGATTATGGAAGGGGAGACGGAGATCTACCTGACCGAACAACAGGCACTGGCGGAGCGTTTTAACGATGTGCCGCTGTGGGTCCGTCCGCAAAGTTTCTTCCAGACTAATCCGGCGGTCGCCAGCCAGCTGTACGCTACCGCGCGCGACTGGGTACGACAGCTGCCGGTTAAACATATGTGGGATCTGTTCTGTGGCGTGGGGGGCTTTGGTTTACACTGCGCGACGCCTGACATGCAGTTAACCGGGATCGAAATTGCGCCAGAGGCCATTGCCTGTGCAAAGCAGTCAGCCGCTGAACTGGGCTTAACGCGTTTGCAATTTCAGGCGCTGGACTCCACTCAGTTTGCCACTGCTCAGGGGGAGGTGCCGGAGCTGGTGCTGGTTAACCCGCCGCGCCGCGGCATTGGTAAACCGCTGTGTGATTATCTCTCAACGATGGCACCGCGTTTTATCATCTACTCCAGCTGTAACGCCCAAACCATGGCGAAAGATGTCCGAGAACTGCCTGGTTACCGCATCGAACGGGTACAGCTTTTTGATATGTTCCCGCACACCGCGCACTATGAAGTGCTGACGCTGCTGGTGAAGCAATAAAAAAGCCGCAGGTGCGGCTTCAGATTGCTGACAAAGTGCGCGTTGTTTATGCCGGATGCGGCGTAAACGCCTTATCCGGCCTACAAAAGCGTGCAAATTCAATACATTGCATGGGCCATGTAGGCCTGATAAGCGTAGCGCATCAGGCAATTTTACCTTTGTCATCAGTCTCAAGCCGCGGTTGCGGCTTTCTGAATCTTACTGCGGGAACCACTGGTCATTGATTTTCTGATATGTACCGTCAGCTTTAATTGCTGCCAGCGCGTTATTCAGTTTTTCCAGCAGAGCTTTGTTATCCGGACGTACAGCGATGCCCAGACCGGTGCCGAAGTATTGCGGATCGGTCACTTTTTCAGTAGCAACGCCCAGTTGCGGATTGGTTTTCAGCCATTCGTTTACCACCGCTGTGTCACCAAATACCCCATCAATACGACCATTTTTCAGATCGATAAAGGCATTCTGATAACTGTCATAGGAGACAGTTTTCACTTCGGGGTGCTGATCCTGAATATATTTCTGGTGCGTGGTGCCGTTTTCCATCCCAATACGTTTGCCTTTCAGATCGGCAAACGTTTTGTAGGTATCTTTTTTGGCAATCACGACGGCTGAGTTTTCATAGTAGGGCGTGGTAAACGATACCTGCTTGCTACGCTCGGGCGTGATATCCATACCGGAGATTACGGCGTCATATTTTCTGAATTTCAGTGACGGGATCAGGCTGTCGAACGCGTGATTAGTAAAAGTACATTCTGCCTGCATTTGTTTGCACAAGGCTTTTGCCAGATCGATATCAAAGCCGACAATCTCATTATTAGCACCTATAGATTCAAAGGGGGGATAGGTGGCTGAAACGCCAAAATTGATTTTCTCTGCGGCAGAAGCACCGAAAGTAAAGGAAGCAAGTAAAGCGGCAAGAACTAACTTTTTCATGATGGAACTCCCGTCTGTCAATCTTATGATTTTTGGCCGTGTCTGCGGCATGGGATAACAATGCCATCAAGTGAATTTATATGCAATAAACATGATTAAATAATTTAAATGAAATAAAAAAGACGGACAACTTAGTGGGTTGTCCGTCTTCATTATAAGAATTTATGCACTATGTAGGCCGGATAAGGCGTCCCCGCCGCATCCGGCACAGGCACCGTGCTGATGTCTGATGCGACGCTGGCGCGTCTTATCAGACCTACAAAACCCCCCGGCGAATGTACGCAGCCACATTAATTTCGCCGTTCGAATGCCAGCGCTTTGCGCTCGATCAGACGCATCATCAGCGTCAGCAGGCCGTTAACGACCAGGTAAATAATCCCTGCCGCACCGAACACCATTACATCGTAGGTGCGTCCGTACAACAACTGGCTGTATCCCATCACTTCCATCAGCGTAATGGTGTATGCCAGAGAGGTACTTTTGAATACCAGCACCACTTCGTTGGAATAAGAAGAGAGCGAGCGTTTAAAGGCATACGGCAGCAGGATCGCCAGCGTATCTTTTTTGCTCATTCCCAGGGCGCTACAGGACTGCCACTGACCTTCCGGGATCGCACGAATTGCACCGTAAAACAGCTGCGTGGTATACGCCGCACTATTCAGCGACAACGCAATCAGCGCACATAACCACGGTTCTGACAACAAATGCCACAGTGCCGGATACTCCTGCAAAGTCGGAAACTGGCCCGGCCCGTAATAAATCAGGAAGATCTGCACCAGCAGCGGCGTACCGGTAAACAGCGTGATATAACCCCGCACCAGCCACACCAGCACCGGCGTTTTCAGCGTCAGGATGATGGTAAAAATCAATGCCAGAATCAGTGCCACAATCAGCGAGGCAACGGTTAGCGTCAGGCTGGTGTGTAGCCCTTTCATCAGTTCGGGTAAATACTCAAACATTAGCTGGGCCTCCGCTCAAAACGTGTCGCGCGCAGGTCAATGCGTTTGAGAATGTACTGACTGAGCAGGGTGATCACCAGGTAAATCGCCGCCGCCACAATGTACCAGGTAAATGGTTCCTGGGTACGAGTAGCGATGCTTTTTGTTTGCAGCATTAAATCATTCACACTAATCAAACTGACCAGCGCGGTATCTTTCAGCAGCACCAGCCACTGGTTACCGAGGCCAGGCAGCGCATGACGCCACATCTGCGGCATCACCAGACGGAAAAAGATAGCCGATTTCGACAGCCCCAGCGCCTGACCAGACTCCCACTGACCCACCGGCACCGCTTTCAACGCGCCCCGAAGCGTTTGCGAGGCATAGGCGGCATACAGCAGTGACAGAGCGATGACACCACAAAGGAACGGGCTAACGTCGAAGTTCTCAATGTCCATCTGCACTGGGATCTGCACGAACCCAAGATTGATAGTGAAGCCATCCGAAAGCGTCAGCAGCAGCTGCGAGGAGCCAAAATAGATAAACAGCACCACCAGAATTTCTGGCAGGCCACGCAGAATGGTTACCAGCGCTGAACCTGCCCACGCGACAGGACGCCATTTTGCCGACTCCCATACCGCAAAGAACATCGCCAGCGCCAGCCCGACAATCAATGCACAAACGGCAAGGCCGACGGTCATCCCGGCGGCGCTTGCTAAAGGAAAAAATTCATTCATCAGGAATTACTTCTGGAACCATTTGTTGTAGATGGTTTCGTAAGTGCCATCTTTCTTCACTTTTTCCAGCGCAGTGTTGAGTTTCTGCTGCAGCTCAGTGTTGCCCTGACGTACCGCGATGCCGAGGCCAGTGCCGAAGTAATCTTTATCGGTCACTTTGTCGCCCACCGCCGCCAGTTTCGGGTTATCTTTCAGCCACTCAGTGACCACTGCGGTGTCACCGAAGACGCCGTCGATACGCCCGTTTTGCAGATCCAGTTTTGCGTTCTGGTAGCTGTCATACGGAACGGTAGTGATTTCCGGGTGCTTATCCATAATGAATTTCTGGTGTGTCGTCCCGTTCTGTACGCCGACTTTTTTGCCTTTCAGCTGATCAACACTGGTGTATTTGCCTTGCTGACCCACAAACAGGGCAGAGTTGTCATAGTACGGGGTGGTAAACAGCACCTGCTTTTCACGCTCCGGAGTGATATCCATGCCCGCCATCACGGCTTCTACGCGACGGAATTTCAGGCTTGGGATCAGGCTGTCAAACGCCTGGTTAGAGAAAGTGCAGGTTGCATCAATCTCTTTACACAGCGCTTGTGCCAGGTCGACGTCAAAACCAACGATCTGGTTGTTTGCATCAATCGATTCAAACGGAGGATAGGAGGCTTCGGTAGCAAAACGAATGGTTTCGGCAGCTGTGGCGGAAAGACTAAAACCTGCAATTAACGCGGCAATCAGAACTTTTTTCATTGTTGTTATCCCGAATCTTAGTGAGAGAGATAGTTTTTAAATGCTTCGGTTTGCGGCTCGGTAAAGCAGCTCGCGTCGCCTTGTTCTACGATATGACCATTTTCCATATACACCACTCGGCTGGCGGTTTTACGCGCCACTTCAACTTCGTGGGTGACGATCACCTGGGTAATATTCGTTTCTGCCAGCTCACGAATGATGCTGACGATTTGTGCCGTAATTTCCGGGTCCAGTGCGGCGGTCGGTTCATCGAACAGCAGTACCTGCGGTTCCATCATCAACGCACGGGCAATAGCAACACGCTGCTGCTGACCACCAGAAAGATGCAGCGGGTAACGATCGCTATAAGGTTTGAGACGCAGACGTTCCAGCAGTTTTTCTGCACGGGCCAGCGCCTGATCTTTACTCAACCCCAGTACACGGCAGGGCGCTTCAATCAGGTTTTGCTGCACGGTCAGATGCGGCCACAGGTTGTATTGCTGAAACACCATGCCAACGTTACGACGCAAATCGCGAATCGCTTTGTCAGAGGGTGTTTTGGTGAAATCGAAATGGTTGCCTGCAATGTTGAGCGTACCGGAGCGCGGCATCTCAAGCAGATTGAGTACACGCAGCAGCGAGCTTTTACCCGCGCCGCTGGGGCCAAGTAACACCAGCGTTTCGCCCTGTGGGCAATCCAGCGTGATATCGAACAGCGCCTGATGCGCGCCGTAGAAGCAATTAATGCCGTTTAATTGAATACTCATTGACACTCGTATACTGGCAGTCTGATAGCTATTGAGGTCGAAGATAGTACCTTTGACAGAATAATTATGCAATATTTCTGCTTTAAAAGTTAAAAGCAAAGCGCATTATTCAATAAACATAGCACAAAATAACGGGGGCGGTGGTCGGCGAGCATAAATGTCGGCATTCCTCACGAAATGCCGGACAATTTACGGGGTTTATTGGTTGATCAAGGCGTTAGCGATTCTCGATGGACTGACGGAGCGTACCCGCCGTGGCATGAACGCTACCGCCTAAGTAACGCACATCGTCGATGACCCAACACTGGCCTTCCTGAATCATTAACACTTCATCCTGCCAACCCTGGTCACCCTGTTTGAGATCCACGCGCAATGGAATGTTACGGGCATCACGATTAGGGATAGTCGATGCACTGGCAACGTGGGCGCTATCTGGCAAGGTGGTTCGACTGGAGAATGGATCGTTGGTCAGTAGTTCCCGATGGTTATTATCCCGGGAGGCATCGCTAAGCAGTGTCGCCAGTTTGTCGCTCAGATAAGGGCGCAAGGCGGTGATGTCGTTGCTGCGGTGCAAAATGCGGTAGTCATAAAATTGCTGGGCCACGTTATCCGGGCCTCCTTCAACGCAAGGACCACTGCGTGTGCCGTTATCTTTATAAGCTGGAGTGACTGTGGTGCAGGCACTGAGGAGCAGTGCGCAGGGGATAAGCATTGTCAATTTGCTGTAGCGCATAATGATTTCCTTATAAGCGATCGCTCTGAAAGCGTTCTACGATAATAATGATATCCTTTCAATAATAGCGTATCAGTCTGATAATGCTTTTGAGATCGAAGGCTTAGCAAACAAGGAGATCGATCATGCAATTTTCTACAACCCCAACTCTGGAAGGCCAGACCATCGTTGAATATTGCGGTGTGGTGACCGGCGAAGCGATTTTAGGTGCCAATATTTTCCGTGATTTCTTTGCCGGTATCCGCGATATCGTTGGCGGACGTTCCGGTGCGTATGAAAAAGAACTGCGTAAAGCACGGGAGATCGCCTTTGAGGAATTAGGCTCCCAGGCGCGGGCGCTGGGGGCCGATGCCGTCGTCGGTATTGATATCGACTACGAAACGGTCGGGCAAAACGGCAGTATGCTGATGGTTAGCGTCAGCGGTACGGCGGTGAAAACGCGTCGATGAGAAGAGTCTTCTGGCTGGTCGCTGCCGCTCTGTTATTGGCAGGGTGTGCAGGCGAAAAAGGCATTGTCGAGAAAGAGGGATATCAGCTTGATACCCGACGCCAGGCGCAGGCGGCGTATCCGCGCATTAAAGTGCTGGTGATCCACTACACCGCAGATGATTTTGATAGCTCGCTGGCGACACTGACCGATAAGCAGGTCAGCTCGCATTATCTGGTCCCTGCGGTACCACCGCGATACAACGGTAAACCGCGCATCTGGCAACTGGTGCCGGAACAAGAACTGGCCTGGCATGCGGGGATTAGCGCCTGGCGCGGGGCAACGCGCCTTAACGACACCTCTATTGGCATTGAGCTGGAAAACCGTGGCTGGCAAAAATCGGCCGGAGTGAAATATTTTGCCCCGTTTGAACCGGCACAGATTCAGGCGCTTATTCCACTGGCGAAAGATATTATTGCCCGTTATCACATCAAGCCGGAAAACGTAGTGGCACATGCGGATATCGCACCGCAGCGCAAAGACGATCCGGGGCCATTATTTCCCTGGCAGCAACTGGCGCAGCAGGGGATTGGTGCCTGGCCGGATGCGCAGCGGGTTAACTTTTACCTTGCCGGGCGCGCGCCGCACACTCCTGTAGATACTACGTCATTGCTGGAGCTTTTGGCGCGCTACGGTTATGACGTTAAACCTGATATGATACCGCGCGAGCAGCGGCGCGTGATTATGGCATTCCAGATGCATTTCCGCCCGACGTTATATAACGGCGAAGCGGATGCAGAAACTCAGGCGATTGCCGAAGCATTGCTGGAGAAATACGGGCAGGATTAGCGCGGCAGTTTTCCGTGGTCGCGTAGCCAGGCGGCAGTTTTCTCGATACCTTCATCCAGGGTGATGACCGGCTGATAACCTAACTCTTCCTGCGCACGCGTAATATCCAGCGTAAAGTCAAAATTCAACTTGGAGACGCCGTAGTGGGTCAGCGGCGGCTCTTTTGCTGACTTGCGGCCTAAACGCTCCATGCTGCGGGCGATCATATCCAGCATCGGGTAGGGGACGGAACGAATACGACAGTCAATATTCAACTCGTCGATCAGCTTCTGCACGATGCTGCGCAGTGTGCGATGCTCGCCGTTGGTGATGTTGTACACACGCCCGGAAGGTAGCTTATCGCAGGCTTCCTGGCTTGCCAGCCACATTGCGTGCACGGCATTTTCATAGTAGGTCATATCGACCAGCGCACTGCCGCCATGCGGTAACAGAATACTGCCGTAGTGGTGCATCATATGCGCCAGACGGGGAATAAAGACTTTATCGTGCGGTCCGAACAGACTTTGTGGGCGCAGAATAGTAAAGCGCGTTTGTGGATTCGCCTGCGAAAGCATATTGATCACTTCTTCGCTGGCTGCTTTGCTGCGGGCAAACTCGTTGGCGAAGCGGTGAGGGCGAAAATCTTCTTTAATATCGCGATGGTGGTGATAATCGAAGTACAGGGAGGGGGAAGAGATATGAATAAAGTTACGCACACCCCAGGCGACAGCCCATTCACCCAGGCGGCGAGTGGCGCGAACGTTAGCCAGATCGAAAGCCTGTTGTGTCCCCCAGGGTGAGGTAAAGCTGGAGCAGTGCCACAGCGTATCAATGCCCGCGAGCATCACTTTAGCTTGTGATGAAACCAGCTCGGTCAGATCCGCCGGAACAAACTCTGCGCCCATTTTTTCCAGCAATTTGCCCATTGCCTCGTTGCGACCGGTCGCTCGCACGCTGATGCCTTTCTGGCATAAAAACTCTACCGCGTTTCGACCTAAGCCGCTGGTGGCGCCGGTAACCAGTACCTTCATATCAATCCACTGTTGTTGAGAAAATAACGTGCGCATTCTTCCGTGATTTCCCCCATGATGCAATGGGAAACATGAAAGAATAACGCAGGTTTTGTCGATTAATCTGTGCTTTGTTCTGCCAGTCTGGCGATTTGTTTTGCCATTCCGCGAAAAATAAACAGATGCGCGGGGATCATCAATAACCAGTAAAACAGCCCCGGCATACCGTGCGGATGCCAGAAAGCGCGGACATCGATAGTACGATAGTCGCCTTTATCTTCCAGGCTAAAACACAGTCGTCCCAGCCCCGGCGCTTTCATGCCAAATAACAACGTAAGTTGTTTTTCCGGTTCAACGACAATCACTTTCCAGCTATCCACCGCATCGCCAGTCTGTAAATATTCGCGCTCCGGGCGGCCTTTCGCCAGCTTATGACCGATCGCGCGGTCCATCAACGCCCGTGTCTGCCACAAAATATTGCCAAAGAAATAACGCTCTTTACCGCCGATTTGGTTCACTACCTGCCATAAAGCAGCAAGGCTGGCGGACGTTTTAACGGTAAACCCCGCCTGTTTGGCAAAATAACCGTACTCCGGTCGCCAGCGGGCAAAGGCCTGAGCGTCGTAGCCCCAGTCGCTGGAGTTGACCAGTTTTTCCTCCTCTTTCAACGTGCTACGTACCGCGTCATCGAAAGCGATCAGCCGTTGTGGGATGAGTGCACGTAGCGCGGTATCATCCGCCAGCAGATCGTGTTTCAGCCCCTGAATCAACGCCCTGGCGGTGGTGGGCGGTACGGAAGTAATCACATTGAGAAACCACACCGAAATCCAGCGGGTGGGGAGGGGGATGGGGATCAACCAGCGGCGCTTACCGCTCACCGCCATAAAATGTTCAAACTGTTGCTGATAACTGAGCACCTCTGGTCCGGCGGCTTCGAAGATGCGGTGTTCGCTGGCCGGATGATCTAACAACGCCACCAGATAGTGCAGCAAGTTTTCCAGCGCGATGGGCGTGGTGCGTGAACGTACCCAGCGTGGCGGCGTTAACACTGGCAGGTTGTAGACCATATCGCGCATGACTTCGAACGCCGCTGAACCTGCGCCAACGATAATTCCGGCCCGAAGTTCGGTCACAGGTACATTCGCTTCACGAAGAATGTCCGCCGTAGCCTGACGAGCACGCAGATGATCCGACTGCTCATGTGGCGGGGCCTGCAACGAACTGAGAAAGATTAATTGCTTAACTGGTACTTCACGTAGCGCATCGCGGACGTTGAGAGCCACCTGGCGCTCCTGAGCGATAAAATCGCCGCCTTCGCCCATGCTGTGCACCAGAAAATAGACGGTATCGATATCCTGCAACAGGGCCGGAAGGTTATCCGGCCAGCTGAGATCGACTTTATGGCAACTGACGTTTGCCAGTTGCAGCTTTGCAAGCCTGTCGACATGACGTGCCGCCGCCAGGATCTGATGCCCTTGCCGGCTGAGTGTGCGCACCAGATGCTGACCAATGTAGCCACTGGCACCGAGAACTAAAATGCGTTGCGGCACGTCTCTCTCCTTAACGCGCCAGGAATGCGCGCCAGTGGGCGGCGACTTCCGCCAGTTGTTCGCGCGAGACGTCAAGATGCGTCACCAGGCGGACAATCGGCGAGGCGTTAATCAGCACGTTTCTCGCTTTCATGTATTCGCCTAACGCGGCAGCATTTTCTTCCCCGACGCGAACAAACAGCATATTAGTGTCCTGACGCATCACATCCGCGCCTGCTTCACGCAGTTGCTCCGCCATCCAGGCGGCGTTGTCGTGATCTTCCTGCAACCGTGCGACGTTATTTTTCAGGGCATACATCCCGGCGGCAGCCAGAATCCCGGACTGACGCATCCCGCCACCGGCCATTTTCCGCCAGCGGATGGCACGTTTAATGTAATCACGATTACCGACGAGTAATGAACCGACTGGCGTCCCAAGACCTTTCGACAGGCAAATGGTGAACGAATCACAATATTGCGTGATCTCTTTCAGTTCGCAGCCGTAAGCCACCACGGCATTAAAGATGCGCGCACCGTCAACATGCAGCGCCAGATTGCGCTCGCGGGTAAATTCCCATGCTTCTTTCAGGTATTCACGCGGCAGCATTTTGCCGTTGTGGGTGTTTTCCAGACTGAGTAATTTGGTGCGGGCGAAATGGATATCGTCGGGTTTGATTTTCATCGCCACTTTATCCAGCGGTAGCGTGCCGTCGGCAGCCGCGTCGATGGGTTGCGGCTGAATACTGCCCAGCACCGCTGCGCCACCGGCTTCAAACAGATAGTTATGCGCGGCCTGACCGACAATATACTCTTCGCCACGTTCGCAGTGACTGAGCAGAGCGACCAGGTTGGCCTGAGTGCCGGTCGGCAGAAAAATGGCGGCTTCTTTACCGGAAAGCTCTGCGGCGTAGTCCTGCAGAGCATTAACGGTAGGGTCGTCTCCGTAAACGTCGTCCCCAACCGGGGCGGCCATCATTGCTTCGAGCATGGCGCGGCTCGGTCGGGTAACGGTATCACTGCGTAAATCAATCATGGCATGTCCTTATTATGACGGGAAATGCCACCCTTTTTACCTTAGCCAGTTCGTTTTCGCCAGTTCGATCACTTCATCGCCGCGTCCGCTGATGATTGCGCGCAACATATACAAGCTAAATCCTTTGGCCTGTTCGAGTTTGATCTGCGGTGGAATGGCTAACTCTTCTTTGGCGACCACCACATCCACCAGCACCGGACCGTCGATGGAGAAGGCGCGTTGCAGGGCTTCATCAACTTCAGACGCTTTTTCTACACGGATACCCGTAATGCCGCACGCTTCGGCAATGCGGGCAAAGTTTGTGTCGTGTAGTTCGGTGCCGTCAGTCAAATAGCCACCAGCTTTCATCTCCATCGCCACAAAGCCCAGCACGCTGTTGTTAAAGACGACAATTTTCACTGGCAGTTTCATCTGCACTACTGAGAGGAAATCGCCCATCAACATGCTAAAACCGCCATCGCCGCACATGGCGACCACCTGACGTTCTGGCTCTGTCGCCTGCGCACCCAGCGCCTGCGGCATGGCGTTAGCCATCGAACCGTGGTTAAACGAACCTAACAGGCGACGCTTGCCGTTCATTTTTAGATAACGTGCCGCCCACACCGTTGGCGTACCAACGTCACAGGTGAAAATAGCGTCATCGGCGGCAAAATGACTAATTTGCTGCGCCAGATATTGCGGGTGAATGGCTTTCTCGCTCGGTTTAGCTAAATCGTCCAGCCCTTTGCGGGCGTCGCGGTAATCTTCCAGCGCTTTATCCAGAAACTTGCGATCGGCTTTTTCTTCCACCAATGGAAGCAATGCACGCAGAGTCGACTTGATATCGCCGACCAGTGCCATATCCACCTTGCTGTGAGCGCCGATGCTGGCTGGGTTGATATCAATCTGAATGATTTTGGCATCGGTCGGGTAGAAGGCGCGGTAGGGAAATTGCGTGCCGAGTAGCACTAACGTGTCGGCGTTCATCATGGTATGGAAACCTGACGAGAAGCCGATTAACCCGGTCATTCCAACATCATACGGATTATCGTATTCGACATGTTCTTTACCGCGCAGGGCATGAACAATAGGCGCTTTAATTTTCCCGGCAAACTCAACTAACTCTTTATGCGCCCCCGCGCAGCCGCTGCCACACATCAGGGCGATATTGCTGGAATAACGCAGCAGTTGCGCCAGTTTGCGTAACTCTTCTTCTTCCGGCGTCACGACTGGTTGTGGCGCATGATACCAGTGCATGGTTGCCCCTTCTGGCGCAGGTTTTAACGCCACGTCGCCTGGTAACACGACAATCGAAACGCCACGGTTAAGCACCGCTTTGCGCATGGCAATCGCCAGTACTTGTGGGATCTGCTCCGGGCTGGAAACCAGCTCGCAATAGTGACTACATTCGCGGAATAGCTCTTGTGGGTGGGTTTCCTGGAAATAGCCGCTGCCAATTTCGCTGGAGGGAATATGAGCGGCAATCGCCAGTACCGGAACGTGATTGCGGTGGCAATCGAACAGGCCGTTGATTAAGTGCAGGTTGCCGGGGCCGCACGATCCGGCACAGACCGCCAGTTCTCCGCTAAGTTGTGCTTCAGCGCCAGCGGCAAAGGCCGCCACTTCTTCGTGGCGGGTGGACATCCACTCGATGGTGCCCATGCGATTAAGACTGTCACTAAGACCGTTCAGAGAGTCGCCTGTGACTCCCCAGATGCGTTTCACCCCTGCCGATTCGAGTGTTTTGGCGATATAAGCTGCAACCGTTTGTTTCATGGTTCTCCATCTCCTGAATGTGATAACGGTAACAAGTTTAGTTCATCTGACGGAGGGGGAAGGGATGGGAGAGAAAGGAGGCACTAACGGTTAAATAGCCCGATGAAAGGAATATCATCGGGCATAAGGCGATTATGCGAGAACCAAATCCCCCTGCGGATGGCAGGAGCAGGCCAGTACGTAACCTTCAGCGATTTCGGCGTCGGTCAGCGTCATTGTGCTGCTCACCGTATATTCACCGGAAACCACTTTTGTCTTACAGCAGCCGCAAACACCCGCACGGCAGGCAGCGACAACCGGAACGTTATTGCTTTCCAGCGCCTCCAGTAGCGTGGTGCCAACCGGGGCGTAAAATTCTCGTGCCGGTTGCAGTTTGGTGAATTTCAGACCGCTGGTCGCCGCTTCTGCTACTGGGGTGAAGAATTTCTCTTTAAAGAAACGCGTCACGCCGAGCGCTTTCACTTCCTGCTCTACCCAATCCATATACGGAGCCGGGCCGCAGGTCATCACGGTACGTGAAGCTAAGTCAGGTACACCTGCCAGCAGTTCGCGAGTGAGACGACCAGCGATAAAGCCTTCGGTAACGTTATTTTCTGCCACCAGCGTTACCGGATAGTTACGCCACTCATCGGCGAAAATAACATCCTGCGGCGTACGCACGTTGTAGATCACCCGCACATCGGCCTGTGGACGGTTCTTCGCAAGCCAGCGACGCATCGACATAATCGGCGTGACGCCGCAGCCTGCCGCCAGCAACAGGAATTTATCTTCTGCTTTATCGTCGCAGGTAAATTCCCCCATCGCGTCCGAAAGCCAGAGATAATCACCGCGTTTTACATCGCGCGTCAGCCACTGGGAGCCGACACCGTCATCAATCCGCCGCACGGTCAGGGTGATGTATTCACTCACTCCTGGCGTGGAGGAAATGGTGTAAGCACGCAGCGTTTCCGCTGAGTTACGCACGCTGACCAGTGCATATTGCCCGGCGCGATATGGGTAGTAATCGTGGCAAATCAGGGAAATCGTCCACACATCCGGCGTTTCTTGCGTAATGTGATGAACCTGCATCCGCCACGGGCATTGATTCGTTGGCATCGTCATCGACAAACTCCTTACGCGCTCAACAGTTGCTTCATGTCTTCTTCAACAGTGGTGATAGAACGCAGGCCGAATTTCTCGTTCAGCACCGCCAGCAGGTCTGGTGTCAGGAAACCAGGTGCAGTCGGGCCGGTGACGATATTTTTCACGCCCAGAGAAAGCAGCGTCAGCAGAATGACGATCGCTTTCTGTTCAAACCAGGAGAGCACCAGCGACAGCGGCAGATCGTTCACACCACAGCCCAGTTTCTCTGCCAGAGTGACAGCCAGAATAATCGCTGAGTAAGCATCGTTACATTGACCTGCATCTACCAGACGCGGCAGACCTTCGATATCGCCAAACTCAAGTTTGTTAAAGCGATATTTACCACAGGCGAGGGTCAGGATCAGGCAGTCATCCGGCACGCTGGTGGCGAAATCGGTGAAGTAGTGGCGCTCGCCGCGTGCGCCGTCACAGCCACCAAGCAGGAAGATATGACGCAGTTTTTCACGGCTCACCAGATCAATCAGCGTATCAGCAGCGCCAAGCAGCGTCTGGCGACCAAAACCCACGGTGATAAGGTGCGGAATTTCGCTGTACGGGAAGCCTGCCATCTGTTGCGCCTGGGTGATAACCGCAGAGAAATCATCACCGTCCAGATGACGCACGCCAGGCCAGCCAACAATGCTGCGGGTCCAGATACGATCGTCATAAGCGCCTACGGTTGGGTCGATGATGCAGTTCGAGGTCATCACGATGGGGCCAGGGAAACGAGCGAACTCCACTTGCTGATTCTGCCAGCCGCTGCCGTAGTTACCGACCAGATGCTTGAATTTACGCAGCTCCGGGTAGCCATGCGCAGGCAGCATTTCGCCGTGGGTGTAAACATTAACGCCCGTGCCTTCGGTCTGTTCCAGCAGGTTGTAGAGATCTTTGAGATCGTGACCGGAAATCAGGATGCATTTACCCGCCGTCGCTTTGACGTTGACCTGGGTTGGCGTCGGGTGACCGTATTTACCGGTTTCGCCTGCATCCAGAATGCTCATCACTTTGAAGTTCATCTGGCCGATTTCCATTGAACACTCAAGAAGCGCGTTCATATCGGCAGGCCAGGTCCCCAGCCACGCCATGATTTTATGGTACTGGGCATAAATATCGTTGTCGTATTGACCGAGAACATGCGCGTGTTCCATATAGGCCGCCGCACCTTTCAGGCCATACAGGCACAGCAGACGCAGGCCGAGAATGTTTTCGCCAATCGCCGCTTTATCTTTGTTAGGGGTAAATTCTGCTGCCTGACGTTGCAGCTCGCCGAGATCATCGCTCACCAGTTGCAGGTCAGCCATCGGGTTATCGACGCGCGCGTTGGCATCTACAGCCAGGCATTGTGCTTTCAGCGCCTCGCGCAAGGCAATCGCTTCACGAGCGTAGCCGACAATACGCGGAGAATCGAAGTTAACGTTGGTCAGGGTTGAGAAAAAGGCACGTGGCGCGAAGCTGTCAACATCGTGGTTGATGATGCCGTATTCACGCGCTTTTACCGCCCAGGCAGAAAGCCCTTGCAGCGCCGCGATGAGTAAATCCTGAAGGTCAGAAGTTTCCGCCGTTTTACCACACATCCCCTGCGCGTATGAGCAGCCGTTTCCTGCCGGAGTACGGATAGTTTGTTCACATTGCACACAAAACATGATCACACCTTTTAAAGTTATATTTAATATACATGTTTAAGGTTAAGACGCTTAACGCGGGGATAAAAGGGATTTTTCATGCAACTTTAAGGGAGATTGATTTAGCGCAATTTTGGCGGCAGGGCTCTACCGCCAGAGAGGTATTACGCAGAGAAAAAGGCGATGAGGATCGGCACTAACAGGCTAAGAATAAAACCGTGAACAATTGCCGCCGGGACCATATCCAGCCCGCCAGTACGTTGAAGAACGGGCAGGGTGAAATCCATTGATGTGGCACCGCATAAGCCCAGTGCAGTAGAGCGGCTGCGGCGAATCAGCCCAGGGATCAACATAATAGCAATCAGTTCACGGGCCAGATCATTAAAAAACGCCGCGCTCCCGATTACCGGACCAAAAGATTCGGTCAATAAAATACCGGAAAGAGAATACCAGCCGAAACCGGAGGCCATTGCCAGCGCGGTATTGATGGGGAGATCAAGAATAAAGGCGTTAATTAAACCACCAATTAATGAACTGACAACCACCACCACGGCGACAATCATTCCCCGGCGATTAAGGACAATCTGCTTTAAGGTCATGCCATTATTGCGCAACTGAATACCAACGAGGAAAAGTAGCAAAATTAACGTGTATTCACTGGCTTCGGTCGCGTGTTGTAAGAAAGCCAGTCCACTCAGACCAATGGCAAAACCAATCACTACTACGCCGCACAGTTTTAGCGACTCCAGCGCCATCGCAATACGCGACGGGAGTTTTTCTTGCTGATGGTGGTTGCGCCACGGCAGGCCTCGCTCCAGCCACATCAGGGCGGCAATATTACACAGTAAAATAACGGTAATACTGACGGCAGAATAATGCAGAATCGCCAACAGGTTACTGGCGAGGTTATCGAGAAACGCCAGACTGATACCCATAAAAAAGAGAATAAGGTAAACCATCCAGCTTAATAGCTGATTAATAACTTTTAACGCAGCTTGTTGGCGAAGCGGAATGAGGTAACCCACAATCAGGGGAACCAGAATGATTAACAGCCCAGAAAACATGAAAACCCAGTCCTTGCAAAGATGAAGTCGAAATGCGCGATGACACACTACTGAAAGCGGAAGGACGAGTAAAGTTGCAATTAAAAGGAAATGTTATGCATAAGGAGCAGTAGAGTATTCGTTTTCATTTAAAGATATTCTTGCGCTTTAATTACAAACTGCACCGATGTTGGTGGCGTCAAAATCGCCGAGGCGTTCCCTGAAGGCCGGGGCAGCCCACATGGATGTGGGCTGAGGGCGCGTTTTACAGGGATGTTACCTCGCGCCCGACCCGGTAGCCGTAAGGGATAAGTCGAGGGCACCGCGCAGCGGCGATTTTGTTCGCCAGAGCCCGGGGGTGCAGGGGGCGGCGGCGATTGGCCGCCCCCTGCGCGCTCCTTGCGCCAGTGGCAATATGTTGCTTAGCTCATGAAAGGAGCGCAACAAGATGATGAATCAACATATAACAACATCTTAAAAAAAGGCCTGACATTACGCCAGGCCTTCTGCGTTAATTAATCACGCTTTTCCAGCAGGGTCCGGTAAATCAGACCACCGATAATGCCGCCGACAATTGGCACCACCCAGAAGAACCACAGTTGTTCTAATGCCCAGCCGCCCTGGAAGATAGCAACCGCGGTGCTGCGCGCCGGGTTAACAGAAGTGTTAGTCACCGGAATACTAATTAAGTGAATCAGGGTTAAGGCCAGACCAATAGCGATCGGCGCAAAACCTGCCGGCGCGAATTTGTCGGTTGCGCCGTGGATCACCAACAGGAAACCTGCACTCAATACCAGTTCAACTACCAGCGCGGAAAGCATGGAATAACCGCCTGGTGAATGCTCGCCATAACCGTTAGAAGCAAAACCGCTGGCTGCCGCGTCAAAACCCGTTTTACCACTGGCAATTAAATACAGCAGCGCTGCCGCAACAATACCGCCGACAACCTGGGCAATTACGTAGCCAACGACTTCTTTTGCCGGAAAACGTCCGCCAGCCCATAAACCAATAGTGACCGCCGGGTTAAAATGACCACCAGAAATATGACCAACAGCAAAGGCCATCGTCAGAACGGTCAGACCGAACGCCAACGCCACGCCGGCAAAACCAATGCCTAATTCCGGGAAGCCTGCGGCCAGTACAGCACTACCACAGCCACCAAAAACAAGCCAGAAAGTACCAAAACATTCAGCTGCTAATTTTCTGAACATATCCACCTCAATTAAAAATTGACCCTGTGAAAAATATGGTCGTTTTATAGGGCCGTCGTAAAAAGTGACGACGGAAATAATGCGCGGCTATTTTAAAAACGAAGGCGAGTCATTCACCAGATAAATAAATCCAGTAAATTTGATTTAGGGCAACAGCGGGTTGCCCCATATAGTCGTTTGTCTGATTGACAGTGTAGTGCAAGCAAAAGATTTAATCCTTTAGGCGTAATAAAAAATAATTTATCATGCTAATTATTTGATTTTGTTGTTTTTGCAGACTTATCAGCAAGAGGGAGTATAACGCGATTATTCGCTCATTTTTCAGACATTTGCCATGCTTAAATGTGATGTCATCACGTATTAGCAAGGCCTTTCCCGTTATACTGCCAGCGTAAAGGATAAGTCACATATTTCTGGAGGGGATATGATTCTTGAGCGCGTTGAAATTGTGGGTTTTCGCGGTATCAACCGTTTGTCGTTGATGCTGGAACAAAACAACGTCCTGATTGGGGAGAACGCGTGGGGTAAATCCAGCTTGCTGGACGCCTTAACTCTGCTGCTATCGCCAGAATCAGATCTCTACCATTTTGAGCGCGACGATTTCTGGTTCCCGCCGGGAGATATCAACGGGCGAGAACATCATCTGCATATTATTTTGACCTTCCGCGAATCGCTGCCAGGCCGACATCGGGTTCGCCGTTATCGGCCGCTGGAAGCGTGCTGGACGCCATGCACCGATGGCTATCACCGTATTTTTTATCGTCTGGAAGGGGAGAGTGCGGAAGACGGCAGCGTGATGACACTGCGCAGTTTTCTCGATAAAGACGGACATCCGATTGATGTCGAGGATATTAACGATCAGGCACGCCATCTGGTGCGTTTAATGCCGGTGCTGCGCTTGCGTGATGCCCGTTTTATGCGCCGTATTCGTAACGGCACGGTGCCAAATGTCCCTAATGTGGAAGTCACCGCGCGCCAGCTCGATTTCCTCGCCCGTGAGTTATCCTCACATCCGCAAAATCTCTCTGATGGGCAGATTCGTCAGGGACTTTCCGCAATGGTACAGCTGCTTGAGCATTATTTCTCTGAGCAGGGGGCCGGACAGGCGCGATATCGTTTAATGCGGCGGCGAGCCAGCAATGAGCAACGAAGCTGGCGCTATCTGGATATCATCAACCGGATGATTGACCGACCTGGTGGGCGCTCGTATCGGGTTATTTTGCTCGGCCTGTTTGCTACTTTGTTGCAGGCAAAAGGCACATTGCGACTGGATAAAGACGCCCGTCCATTGTTGCTGATCGAAGATCCAGAAACCCGTTTACACCCCATTATGCTTTCAGTTGCCTGGCATCTGTTGAATCTTCTGCCATTGCAGCGCATTGCCACCACCAACTCGGGTGAGTTGCTTTCGTTAACGCCGGTAGAGCATGTTTGCCGACTGGTACGTGAGTCCTCGCGCGTTGCCGCCTGGCGTCTGGGGCCGAGTGGCTTGAGTACCGAAGATAGCCGACGCATATCCTTTCACATTCGTTTTAACCGTCCGTCATCGCTGTTTGCACGCTGCTGGTTGCTGGTGGAAGGGGAAACGGAAACCTGGGTTATCAATGAACTGGCGCGTCAGTGCGGACATCATTTTGATGCCGAAGGGATCAAGGTCATTGAGTTTGCCCAGTCCGGGCTAAAGCCACTGGTTAAATTTGCCCGCCGAATGGGGATTGAATGGCATGTACTGGTCGATGGCGATGAAGCAGGGAAGAAATATGCCGCTACGGTACGCAGCCTGTTGAATAACGATCGGGAAGCCGAACGAGAACATTTAACGGCGTTACCGGCGCTGGATATGGAACATTTTATGTATCGCCAGGGATTTTCCGATGTGTTCCACCGCATGGCGCAAATCCCGGAAAATGTACCGATGAATCTACGCAAAATTATCTCGAAAGCGATCCATCGCTCTTCCAAACCCGATCTTGCCATTGAAGTGGCAATGGAGGCAGGACGTCGTGGTGTGGACTCCGTACCGACGCTGCTGAAAAAAATGTTCTCACGCGTGCTGTGGCTGGCGCGCGGTCGCGCGGATTAACCGCGAAACATCGTGGCCATTTGTGGCTGAATAGCGTCGAGCATCTCATAGCGCCGACGGTATTCAGCCCGTTTTTTACTGGCGATTTCGGCAATCTCTTTTCGTGCTATCTGTGCTGGAAGGCGGTAATGGCGTTCAGCATCACATACGCCGCCAACCGATTCCCAGAAAGCGTTGTAATCAGCGTGGATCTTGCCTTCTTTATCGCGATAACGCAGGCTGCGGTAAATATGCGTTTCATTGCTGACGGCAATAATCTGCTCTACCTGCAAACGTTGGGCAAACAGACAGGCCGCTTCCATCACGAGGCGTTTGGGAAATAGCCCGTGGCAGGCTTTCGTCGCATTCTGGATTTCCTGATGTGGAATTTCCCATTTTGCGCCTTGCAGTCCGCCAATAAACATCGTTCTTTTCCCCTGATATTCACACAGGGTAAACGTGATCTCTGCCAGAGGAATACCTTCGCTGTTGCGGAACAGGATTGTGCTGTCACCTTCTTTATCCATTGAGATCATCATGGTCAGCTCAAGCGTGAACTGCTCGCCGTTTTTGCCTTCCAGCTTCGCCAGTTGCAGCCCGGGGGTATTCAAATATAAGCTGAATTCTTCCGCCGACATACATCCGCGGAGTAACGCATAATGGTAACGTAACGCCTCCAGCAATTGCTTACGGCTAAGATTCGCCGCAAGGTAAGGGCGATGCAGACGCACAGGCAGTCGCGGCTGGCGCGTTAACAATACATTGAGATTAGGCCAGTGGGAAAGTTCGTTCATCCACTCAACGCTTAAACGCGGCATAATCAACGAGCGCAGCAAAAATTTCTGGCGAAAACTACGGCGATGCCAGAATTTACCCGGCCGACACTGTCCACGTGCCAGACTAAGAAAAAGTGACAGGCTGCTGAGAGATTCAGATGGCGTAAAGGTCCGTTCAGTTAGCTGCGACATATTCATGAAATCAATGGTTATACATGACGTCGATTTCACCATTGCGTATCTTAACCAAACATCAATAGTGTGATTACTAACGTAAATTTTAGGGGTTTGTTGATATTTCGTTGAAGTTAATGACCCGGATTGGCATATGGAGTATTCAGAATATTTATGAAAAAGCGGAAAACCGTGAAGAAGCGTTACGTTATTGCGCTGGTGATAGTCATCGCCGGACTGATTACGTTATGGAGAATTCTTAACGCACCCGTGCCGACTTATCAGACACTGATTGTGCGCCCCGGTGATTTACAGCAAAGCGTGCTGGCGACCGGAAAGCTGGACGCGCTGCGTAAGGTTGACGTGGGCGCGCAGGTCAGCGGTCAGTTGAAAACGCTGTCGGTGGCGATTGGCGATAAAGTAAAAAAAGACCAGCTTTTAGGGGTCATTGATCCTGAACAGGCTGAAAACCAGATCAAGGAGGTCGAAGCAACGCTGATGGAGCTACGCGCGCAGCGGCAGCAGGCGGAAGCGGAGCTGAAACTGGCGCGGGTGACGTATTCCCGTCAGCAACGTCTGGCACAAACGAAGGCTGTTTCACAGCAGGATCTCGACACCGCCGCGACGGAGATGGCTGTGAAACAGGCGCAAATTGGCACCATTGACGCGCAAATCAAGCGCAATCAGGCTTCTCTCGATACGGCTAAAACCAATCTCGATTACACTCGCATCGTTGCCCCGATGGCCGGGGAAGTCACGCAAATCACCACTCTGCAAGGCCAGACGGTGATTGCCGCACAACAAGCACCGAACATTCTGACGCTGGCAGATATGAGCACCATGCTGGTAAAAGCGCAGGTTTCTGAAGCGGATGTAATCCACCTGAAGCCGGGGCAAAAAGCCTGGTTTACGGTGCTTGGCGATCCACTGACGCGCTACGAGGGGCAAATCAAGGATGTACTACCGACGCCGGAAAAGGTTAACGACGCTATTTTCTATTACGCCCGTTTTGAAGTCCCCAACCCCAATGGTTTGCTGCGGCTGGATATGACTGCGCAAGTGCATATTCAGCTCACCGATGTGAAAAATGTGCTGACGATCCCTCTGTCGGCGTTAGGCGATCCGGTTGGCGATAATCGTTATAAAGTCAAATTGTTGCGTAATGGTGAAACACGCGAGCGTGAAGTGACGATTGGCGCACGTAACGATACCGATGTTGAGATTGTCAAAGGGCTTGAAGCGGGCGATGAAGTGGTGATTGGTGAGGCCAAACCAGGAGCTGCACAATGACGCCTTTGCTCGAATTAAAGGATATTCGTCGCAGCTATCCTGCCGGTGATGAGCAGGTTGAGGTGCTGAAGGGCATCACCCTCGATATTTATGCTGGTGAGATGGTGGCGATTGTTGGCGCTTCGGGTTCCGGTAAATCGACCCTGATGAATATTCTCGGCTGTCTGGATAAGGCCACCAGCGGCACCTATCGCGTCGCCGGTCAGGATGTTGCCACGCTGGACGCCGATGCGCTGGCGCAACTGCGCCGCGAGCATTTCGGCTTTATTTTCCAGCGTTACCATTTGCTTTCGCATTTAACCGCCGAGCAGAACGTTGAAGTTCCCGCCGTCTATGCTGGTCTTGAGCGGAAACAGCGGCTGCTTCGCGCCCAGGAGTTGCTGCAACGGCTGGGGCTGGAAGACCGTACAGAGTATTATCCGGCACAGCTTTCGGGTGGTCAGCAACAGCGCGTCAGCATCGCGCGGGCATTGATGAACGGCGGTCAGGTAATTCTTGCCGATGAACCAACCGGCGCACTGGACAGCCATTCTGGCGAAGAGGTGATGGCGATCCTGCATCAGCTGCGCGATCGCGGGCATACGGTGATTATCGTCACCCACGATCCGCAGGTCGCTGCTCAGGCCGAGCGGGTGATCGAAATTCGCGACGGCGAAATTGTGCGCAATCCTCCCGCCATTGAAAAAGTGAATGTTGCTGGCGGGACGGAGCCTGTTGTCAACACGGTGTCTGGCTGGCGGCAGTTTGTCAGCGGTTTTAACGAGGCGCTGACGATGGCATGGCGGGCGCTGGCAGCGAATAAAATGCGTACTTTACTGACCATGCTGGGGATTATTATCGGTATTGCGTCGGTGGTTTCCATTGTCGTGGTGGGTGACGCCGCCAAACAAATGGTGCTGGCGGATATTCGTTCTATTGGTACGAATACTATTGATGTCTATCCCGGGAAAGATTTTGGCGATGACGATCCGCAATATCAACAGGCGCTGAAGTACGACGACTTAATCGCCATCCAAAAACAACCGTGGGTCGCCTCAGCCACACCTGCCGTCTCGCAAAACCTGCGCCTGCGTTATAACAATGTTGATGTTGCTGCCAGTGCCAATGGCGTGAGCGGCGATTATTTTAATGTCTATGGCATGACCTTCAGTGAAGGGAACACCTTTAATCAGGAGCAGCTGAACGGTCGTGCGCAGGTCGTGGTTCTCGACAGTAATACTCGCCGCCAGCTTTTCCCCCATAAAGCAGATGTGGTTGGCGAGGTGATTCTGGTCGGCAATATGCCCGCCAGAGTCATTGGTGTGGCGGAAGAAAAACAGTCGATGTTTGGTAGCAGTAAAGTGCTGCGTGTCTGGCTACCTTACAGCACGATGTCCGGGCGAGTTATGGGCCAGTCGTGGCTTAACTCCATTACTGTCAGGGTGAAAGAAGGATTTGACAGCGCCGAGGCGGAACAGCAACTCACGCGTTTACTTTCACTGCGCCACGGAAAGAAGGATTTCTTTACCTGGAACATGGACGGCGTCTTGAAAACTGTTGAAAAGACCACACGTACTTTACAACTGTTTCTGACGCTGGTGGCGGTGATTTCGCTGGTGGTGGGCGGTATTGGTGTAATGAATATTATGCTGGTGTCAGTGACCGAGCGGACGCGGGAAATTGGCATTCGCATGGCTGTAGGTGCGCGAGCAAGCGATGTTTTGCAACAGTTCCTGATCGAAGCCGTACTGGTTTGCCTGGTCGGTGGCGCGTTGGGAATAACACTGTCACTGTTAATTGCTTTCACCTTGCAGCTTTTCTTACCCGGCTGGGAGATTGGTTTTTCACCGTTGGCGCTGCTGCTGGCGTTTCTCTGCTCGACGGTCACCGGGATTTTATTTGGCTGGTTACCCGCACGAAATGCGGCACGACTGGATCCAGTAGATGCTCTGGCACGAGAGTAATTTTTGAGATAAAAATGCCAGCCGATCGGGCTGGCATTTTGCCTTTAGGATGTACACAATGAGACAGAAGAGCTATGCGACTGCCGCTTCTACTTCGACGGGCACAATAACACTGGCGTGATTGCCTTTTGGCCCCTGGTGGACATCAAACTGAACGGATTGTCCAGCTTTTAGCGTTCTGTAACCATCCATCTGAATGGTGGAATAATGAGCGAAAATATCTTCGCCGCCGCCTTCAGGGCAGATGAAACCAAACCCTTTGGCATTGTTGAACCACTTAACAGTACCCTTTTCCATGCTTCGACATCCTTCGCAAATCTTATACAAGTAAGATGGAATAAACCGGGGTCAGAGAGGGGGCTGTTCAAAACCTCGCCAACTCTAGAAATACAATTTAGAGAATTAGGGCGAGCCGTCAAGCATTTGACAGGGGACAGGGGGCAGGTATGAATCAAAAATTTGAAGCAGTTAACGCTATTGACAGGAATGTGACAGATGTCGCTGATGCCAACGATAGATGATAGTTATCTATCATGTGGAGTAGATTGGTCAGGCAAATAAGCTCTTGTCAGCGGCAGGGCGTTCTGCCGATAACCGTAACCGAAGATGATAACTGACAATGGGTAAAACGAACGACTGGCTGGACTTTGATCAACTGGCGGAAGAAAAAGTTCGCGACGCGCTAAAACCGCCATCTATGTATAAAGTGATATTAGTCAATGATGATTACACTCCGATGGAGTTTGTTATTGACGTGTTACAAAAATTCTTTTCTTATGATGTAGAACGTGCAACGCAATTGATGCTCGCTGTTCACTACCAGGGGAAGGCCATTTGCGGAGTCTTTACCGCCGAGGTTGCAGAAACCAAAGTGGCGATGGTGAACAAGTACGCGAGGGAGAATGAGCATCCATTGCTGTGTACGCTAGAAAAAGCCTGAATGCAGGCATAAAAATTGGGGGAGGTGCCTATGCTCAATCAAGAACTGGAACTCAGTTTAAATATGGCTTTCGCCAGAGCGCGCGAGCACCGTCATGAGTTTATGACCGTCGAGCACTTGTTACTGGCGCTGCTCAGTAACCCATCTGCCCGGGAGGCGCTGGAAGCGTGTTCTGTGGATTTGGTTGCGCTCCGTCAGGAACTGGAAGCCTTTATTGAACAAACCACACCCGTTCTGCCTGCCAGTGAAGAGGAGCGCGACACACAGCCGACGCTGAGTTTTCAGCGTGTACTGCAACGTGCGGTCTTCCATGTCCAGTCCTCCGGTCGCAATGAGGTTACCGGTGCAAACGTTCTGGTCGCTATCTTTAGCGAACAGGAGTCGCAGGCGGCATATCTGTTGCGTAAACATGAAGTCAGCCGTCTCGATGTGGTGAATTTTATCTCTCATGGCACGCGTAAAGACGAGCCGACACAGTCTTCTGATCCTGGCAGCCAGCCAAACAGCGAAGAACAAGCTGGTGGGGAGGAACGTATGGAGAATTTCACGACGAACCTGAATCAGCTTGCGCGCGTGGGCGGAATCGACCCACTGATTGGTCGTGAGAAGGAGCTTGAGCGTGCTATTCAGGTTCTCTGCCGTCGCCGTAAAAACAACCCGCTGCTGGTGGGGGAATCTGGTGTCGGTAAAACCGCGATTGCAGAAGGTCTTGCCTGGCGAATTGTTCAGGGCGATGTGCCGGAAGTGATGGCTGACTGTACAATTTACTCTCTCGATATCGGTTCTCTGTTAGCGGGCACTAAATATCGCGGCGACTTTGAAAAACGTTTTAAAGCGTTGCTCAAGCAGCTGGAGCAGGACACTAACAGCATCCTGTTTATTGATGAGATCCACACCATTATCGGTGCGGGTGCAGCGTCTGGTGGCCAGGTCGATGCGGCTAACCTGATCAAACCGTTGCTCTCCAGCGGTAAAATTCGCGTAATTGGTTCGACAACCTATCAGGAGTTCAGCAACATTTTCGAGAAAGACCGTGCTCTGGCGCGTCGCTTCCAGAAAATTGATATTACTGAACCGTCGATCGAAGAAACTGTTCAAATCATCAATGGCCTGAAACCGAAGTATGAAGCGCACCACGACGTGCGTTATACCGCAAAAGCGGTGCGTGCGGCGGTAGAGCTGGCGGTGAAATACATTAACGATCGTCATCTGCCGGATAAAGCCATTGATGTTATCGACGAAGCGGGCGCTCGCGCACGTCTGATGCCGGTAAGCAAACGCAAGAAAACCGTTAATGTGGCGGATATTGAGTCCGTGGTGGCCCGTATTGCACGCATTCCAGAGAAGAGTGTTTCTCAGAGTGACCGCGATACCCTGAAAAACCTCGGCGATCGCCTGAAAATGCTGGTCTTCGGCCAGGATAAAGCCATTGAGGCGCTGACTGAAGCCATTAAGATGGCGCGTGCAGGTTTAGGTCACGAACATAAACCGGTCGGTTCGTTCCTGTTTGCCGGTCCTACCGGGGTCGGGAAAACAGAGGTGACGGTACAGCTTTCGAAAGCGTTGGGCATTGAGCTTCTGCGCTTTGATATGTCCGAGTATATGGAACGCCATACCGTCAGCCGTCTGATTGGTGCGCCTCCGGGATACGTTGGTTTTGATCAGGGCGGTTTGCTGACTGATGCGGTCATCAAGCATCCACATGCGGTTCTGCTGCTGGACGAAATCGAGAAAGCGCACCCTGACGTGTTCAATATTCTGTTGCAGGTGATGGACAACGGTACGCTGACCGATAACAACGGACGCAAAGCGGACTTCCGTAACGTGGTGCTGGTGATGACCACCAACGCCGGGGTACGTGAAACTGAGCGTAAATCCATTGGTCTTATCCACCAGGATAACAGCACCGATGCGATGGAAGAGATTAAGAAGATCTTTACACCGGAATTCCGTAACCGTCTCGACAACATTATCTGGTTTGATCATCTGTCAACCGACGTGATCCATCAGGTGGTGGATAAATTCATCGTCGAGTTGCAGGTTCAGCTGGATCAGAAAGGTGTTTCTCTGGAAGTGAGCCAGGAAGCGCGTAACTGGCTGGCCGAGAAAGGTTACGACCGGGCAATGGGCGCACGTCCGATGGCGCGTGTCATCCAGGACAACCTGAAAAAAACGCTCGCCAACGAACTGCTGTTTGGTTCGCTGGTGGACGGCGGTCAGGTCACCGTCGCGCTGGATAAAGAGAAAAATGAGCTGACTTACGGATTCCAGAGTGCACAAAAGCACAAGGCGGAAGCAGCGCATTAATCTGATTGTCAGGTAGGTTGGTGAAGTCCGTAATCTCGAAAGAGGTTACGGACTTTTTGTTTATGGGGTGGAACGGTGAAAACCCTATTTTTGGAGGTGAAGGTAAGTTGTTGATAATTAGTGCTGCTGGAAGGTAAGGATAAAAAAGGGTGCTGCAGGAGAATGGGATGGTTTTGCTTTATTAACAACGGGCTAAACGTGTAGTATTTGAGTTCACTGCCGTACAGGCAGCTTAGAAATTCACAGGTAACATACTCCACCCGCCCACCATGTTCACTGCCGTACAGACAGATAAAATGCGAAAAAAAAGCTCGCACTTTCGTACGAGCTCTTCTTTAAATATGGCGGTGAGGGGGGGATTGACTCGCTTCGCTCGCCCTGCGGGCAGCCCACTCACTGCGTTCGTGGTCTGTCCAACTGGCTGCGCCAGTTGTCGAACCCCGGTCGGGGCTTCTCACCCCCCCTGGAGTGCATTATGCGAAAAAAAGCTCGCACTTTCGTACGAGCTCTTCTTTAAATATGGCGGTGAGGGGGGGATTGACTCGCTTCGCTCGCCCTGCGGGCAGCCCACTCACTGCGTTCGTGGTCTGTCCAACTGGCTGCGCCAGTTGTCGAACCCCGGTCGGGGCTTCTCACCCCCCTGGAGTGCATTATGCGAAAAAAAGCTCGCACTTTCGTACGAGCTCTTCTTTAAATATGGCGGTGAGGGGGGGATTCGAACCCCCGATACGTTGCCGTATACACACTTTCCAGGCGTGCTCCTTCAGCCACTCGGACACCTCACCAAATTGTTTTGTCGCCTGACCTCATGGGGGCAACGGGGCGCTACTATAGGGAGTTGGAGTAAAACGGTCAAGAAGAATTTTAATGATAATTATTGTTTGCTCATACTGTAAACAAGTTGTGCAGTATATCTACATCGAGACAAGTTACGGACTTATACTTCCAAAGTACTTCATACATATCACAAAATAAAAAGGCCGGTTAAACCGACCTTTTACTCGTTCTTTCTCTTCGCCCATCAGGCGGTAAAACAATCAGCGACTACGGAAGACAATGCGGCCTTTGCTCAGGTCGTACGGGGTCAGTTCAACAGTCACTTTGTCGCCCGTCAGGATGCGGATGTAGTTTTTGCGCATTTTACCGGAGATGTGTGCAGTAACCACGTGACCGTTTTCTAACTCTACGCGGAACATGGTATTAGGCAACGTTTCAAGAACGGTACCTTGCATTTCAATATTGTCTTCTTTGGCCATCTAATCCTCTGGGGTATCACTACCGTAATTTGAACCGGCAAGATAATGCCGAAGTTCTGTAAATAAGTAAAGATTTGCGCGCTAAATCGCAACAAACAGGTTCGGCACATTACTCCGAAAACACACGGCTAAGCCGCACCAAAAGCGCAACGTATAAGGGAGCGGTGAGATAAACGATGGGCGTTACCTGACGCGAAAAATTCCTTATCGGCAGCGGGGTAATGAGCGTAACCAACTCTGCGACCGCAATTATAACACTCTGGGGAGAAATGTGCCGAAAACATTCATTCTTGTGGTGAAAACAAGCACCGTGGTACCCAGAAATTATTCGGCAATCGTCCGAGGCGCATTTGATTGAGATAATTAAGGTAATCCCGGCGGGGAATTTCGCAGGCACCAAGCGATGCTGTGTGATCGTTAAGGACCTGGCAGTCGATAAGCTTACCGCCATGACCGATAAATTCCTCACAGAATACCAGAAGCGCCGTTTTAGACGCATTTTCCATCCGGCTGAACATGGACTCGCCACAAAATAGTGTTCCCTGGGCCACGCCGTACATACCGCCGACAAGCTCATCTTCACGCCAGACTTCAATGGAGTGGGCATGACCGAGTTCGTGAAGGCGATGGTAGGCTTCGACCACGCCACGCGTGATCCAGGTTCCTTCTTCGCGATCGCTGGCACAGCCTTCAATGACCTGACCAAAAGCGTAATTCATCGTGACACGATAGGGCGAGCGTTTATGAAATCGCTTCATACTACGGCTGATATGCAGTGATTCTGGCCATAGCACCGCACGGGGATCGGGCGACCACCAGAGGATGGGGTCGCCTGGAGAAAACCACGGAAAAATACCACGCTGGTAAGCCATTAACAGGCGCGCAGGGCTAAGATCGCCCCCAAGTGCCAGCAGGCCGTTAGGCTCACGTAATGCGCCTTCCGGGGAAGGGAAGGCTATTGAATGGCGAGAAAGCTGAACCAGGCGCATGACCGCAAAACTCCACGCAAGTCGGATCGTTCAATAATAGCTTACAAACCCTGCTTGAACTGGTAATAACGCCCCTGTCTGGCAAGCAGTTCTGCGTGAGTACCTTGCTCAATAATTTGCCCGTTGTCCATCACTATTATTTGTTGGAAACGAGAGAGTCCGCGAAGTCGATGGGTGACCATTAACACCGTTTTCTCACGCATCATTTCTGCAAGCAATTCAAGGATCTGGCTTTCGGTTGTGGCATCTAAGCCTTCGGTAGGTTCATCCAGCAACACCAGTGGCGCATCATGTAACAGCGCACGGGCGATAGCCAGACGGCGCAGTTCACCACCGGAGAGCTGGCGTCCGCCTTCACCTAACCAACTGTTGAGACCTGCATCCTCGAGCAGCTTCTCCAGACCAACGCGACGCAAGATCTCCGACAGAGCCTCATCACTACTGCCAGGCGAGGCGAGTAACAGATTATCACGCAGCGTGGCGCTAAACAGATGCACTCGCTGAGGAACAACGCTGATGGTCTGTCGTAGAGCCGCTTCATTCAGGCTGGCTATGGGGCTATCGTTAAGCAAAATCTCGCCCTGTTGCGGGTCCCAAGCGCGGGTCAGCAGTTGTAACAGTGTTGATTTGCCGCATCCGGTTCGCCCGAGAATCGCAATATGTTCCCCGGCGTTTACCTGAAGAGAAATCCCTTTAAGTGCCTGTTGGGATTGCTCCGGATAAGTGAACTGAACATCCCGTAATGTCAGCGAAACGCGATCGGCAACACGAGTTTGGGTATCAGGGAAGGTGACTTCCGGTTTTTGATCCGTTAAATCAGTGATCCGAACGGCAGAGGCAATGACTTGCCCCAGATGCTGAAATGCACCCGTTACTGGTGCCAGTGCTTCAAACGCGGCTAACGCGCAGAAGACAAACAGGGCAATTAACGCGCCGGGTTGAGCATTGCCGCCAACGCCGCCAGACGCCATCCACAGCATCAGGATCACCGCTAACGCGCCAATGAGCAGCATTATCGCTTGCGACAATGCGGTCAGTTCAGATTGACGGCGTTGCGCTTCCAGCCATTGAATTTCTGTATTCTCTAGTTGCGTGCGATAACGATCGCTGGCACCAAAAATGGTCAGCTCAGCTTGCCCTTGCAGCCAGGCCGTCAGTTGTTGGCGATACTGTCCGCGAAGATGAGTCAGATTTTGCCCGGTGCTTTTTCCCGCACGATAAAACAGCGGTGGCATCAGGAAAAGCGTCAGTAACATAATGCCGCCCAGCGTAAAGGCGAGGGTGAAATCAAGGAAACTTAACCCGATTGTCACCACCATAATCACCACAAAAGCGCCCACCAGCGGCGAGATAACGCGCAGGTAAAGATGATCGAGCGTATCAACATCCGCCACCACGCGATTGAGCAATTCGCCCTGACGATAGCGCGCCAGTCCGGCAGGGGAGAGGGGCAGCAATTTGCTGAAGGTGTAAATGCGCAGATGCTGCAACACGCGGAAAGTCGCGTCGTGACTTACCAGACGTTCAAAATAGCGCCCGGCAGTACGGGTGATTGCTGCGCCACGCACGCCCGCAGCGGGTAGCATATAGTTGAAGCTGTACAGTCCGGCAACCCCCGCAACCGCTGAGGCCGAGAGGAACCAGCCGGAAAGTGTCAACAGACCGATACTGGCGAGCAGCGTCACAATTGCCAGCACAATACCAAGACTTAACATCCATTTATGACGTTTATACAGTGCCAGATAGGGTAGCAAAGCGCGCATTTAAATCTCCTCCTGACGATGGGCCAGTAATGTGGCGAATGGGCCACCAGCCACACTTAATTCCGCGTAACGTCCTTGCTCAATAATCCGGCCATCCTGCATAACCCAAATGACATCCCAGTCAGCAAGATCTTCTAACTGGTGGGTGACCATTAACGTTGTCTGGCGCAGAGAGGCGGCATTCAGCGCCTCCATTACGCGCTGTTCACTGTGAGCATCAAGGCTGGCAGCGGGTTCATCCAACAGTAATAGCGAACAGGGATTTAGTAACGCACGGGCCACCGCCACGCGCTGCGCCTGCCCCACGGAAAGGCGGGCAGCCTGGTCGCCAACAGGCGTATCAACGCCTTGTGGCAGGAGCGGTAGAAACTCGCTGACCCAGGCGTTATCCAGCGCTGCTTGTAATTCTTGTTCGCTGGCATCAGGTCGCGCCAGTAGTACGTTATCCCGTAATGTTGCTGCCGGTAATTGTGGGTTTTGCCCAACCCAGGAGAGATGTTTACGCCATGATTCTGGTGATAAATCGCGTAATTCTATCCCGTTGATTCGTAATGATCCCTGATATGAGAGAAAACCAGAAAGCGCGTTCAGCAGTGAGCTTTTACCTGAACCGCTGCGACCAACCAACACTGCACGTTGGCCTGCTGGCAAAGTAAAGTTCAGCGGTCCGGCCAGCGTTTTACCTTCCGGCGACGTGATAAACAGATCCTCGGCCTCAATGGTCAGCGGATCGGTCAATGCTAATTCCGCCTCACCGCGCTGCGGATGGGCGAGCGGGGTTTCCATAAACGTTTTCAGACTGTCAGCTGCACCAACAGCCTGGGCTTTAGCATGATAAAACGTACCGAGATCGCGTAATGGCTGGAAAAACTCTGGCGCAAGGATCAGGGCCAGAAAACCCGCAGCCAGCGTCACGCCAGTATCGTAGTGACCAAAATCCAGCTCGCCGAGATAGGAAAAACCAAAGTAGACCGCCACCAGAGCAATTGACAGCGAGGTAAAAAATTCGAGAATGCCGGAGGATAAAAACGCCAGTCGTAGCACTTCCATTGTCCGTTGGCGGAAATCTTCCGAAGCAGAACGAATACTTTCAATTTCAGCTTCACCACGACCAAAAATACGCAATGTTTCCATGCCGCGCAGGCGATCGAGGAAATGCCCACTTAAGCGAGCAAGAGCGAGAAAGTTACGTCGGTTAGCATCGGCAGCCCCCATTCCAACCAGCGCCATAAACAACGGAATTAGCGGTGCTGTGCCCAGCAGAATGAGCGCCGCAGCCCAGTTAGAGGGGAAGATCGCCACCACAATCAGCAACGGCACCGACACTGCCAGCGCCATTTGCGGCAGATAGCGTGCATAGTAATCATGCATATCGTCAATTTGCTCGAGTACCAGCGTCGCCCAGCTCCCCGCAGGTTTACCCTGAATCCACGCTGGCCCTGCTTGTTGCAGACGGTCGAGAACCTGACGGCGGATGGCAAAGCGGATATGCTGCCCGGCGTGATAACCCACCCGTTCGCGTAACCAGACCACCCATGCGCGCAGTACAAAGGTCAGAACCAGTAACGTAAAGGGAAGCAGCAGGGCTTCACGGGGAATATTCTCCATAATCATATGTTGCAGAATTCGCGCCATGAACCAGGCCTGGGCAATGATCAATATGCCGCTCACAAAGCCCAGCAGACGAGAAATATTCAGCCAACGTTGGGAGATGACGCTTTGCTGTTTTAACCAGCGGGTTAACTCTTTTTGACGAGATTTATTCATTGCACGCTTAGCAGGTGAGTTATCAGAATTATTTGCAGAGCAATGTTACAACGGGGAAAAAATAAAGGCGACCCATAGTCGCATGGTGTCGCCTTCTTTACTTTTGTTACTGATTTGTAAAATTATTTTGCGTCAGCTAAACCATCGAGGTAGCGTTCCGCATCAAGTGCTGCCATGCAGCCTGTACCGGCCGAAGTAATGGCCTGGCGATAAATGTGATCCATCACGTCGCCTGCGGCAAAGACGCCAGGAATGCTGGTCTGGGTGGCATTACCATGAATACCCGACTGTACTTTGATGTAGCCGTTTTCCAGTTCCAGCTGCCCTTCGAAAATCGCAGTATTCGGGCTGTGACCGATAGCAACAAACAGACCGGCAACGTCGAGTGACTCGATGTTATCGCTGTTTTGCGTATCGCGCAGACGAACGCCAGTGACACCCATTTGATCGCCGGTCACTTCTTCCAGCGTACGGTTGGTGTGCAGAATGATGTTGCCGTTCTCCACTTTATCCATCAGGCGCTTAATGAGGATTTTTTCCGCGCGGAAACCGTCACGGCGGTGAATCAGATGCACTTCCGAAGCGATGTTAGACAGATACAGCGCCTCTTCAACCGCGGTATTGCCGCCGCCGATGACCGCAACTTTCTGGTTGCGATAGAAGAAACCGTCGCAGGTTGCACAAGCAGAAACCCCACGGCCTTTAAAGGCTTCTTCAGAGGGCAGGCCGAGATAGCGTGCAGAAGCTCCGGTGGCAATAATCAGCGCGTCGCAAGTGTATTCGCCGTTATCGCCATTCAGACGGAACGGACGGTTTTGCAGATCCACCTTGTTGATATGATCAAAAATGATCTCAGTTTCAAACTTGGTGGCATGTTCGTGCATGCGCTCCATTAATAACGGACCGGTCAGATCGTTTGGATCGCCAGGCCAGTTTTCCACTTCCGTGGTGGTGGTCAGTTGGCCGCCTTTTTCCATGCCGGTAATCAGCACAGGTTGCAGGTTGGCGCGCGCCGCGTAGACAGCAGCGGTGTATCCCGCCGGGCCTGAACCCAGGATAAGCAGTTTACTGTGTTTGGTCGTGCCCATGAGATCCCCATAGTTGTTGGCAGACAATGGGCAGGATTGTAGGGAATTTACAGACGTAAAAAAAGAGTATGACGATTTTGTTAACAATTTGTGCAATCGGCAGCATCGATAAGCAGGTCAAATTCTCCCGTCATTATCACCTCTGCTACTTAAATTTCCCGCTTTATAAGCCGATTAAATGATGAATAAACGCCCCTGTTAATGAATATCTGGCATGTTGTACTAAAAATCGATGTTTTGCTTTGACAATCCCCTGGTGTTTTGCGAAAACATTCGAGGAAGAAAAAAAACAGTATTCTTATATGCGCATAACCATGCATGTAAATACCATGTTTACCGTGCTAGTGAAATCTACGTATGGCGTGGACAGACGCCATTCGTGATGTCGATAGCTGCCACAAGGCAACGGTCTTCTCACCGTAGACCCAGGCATTGCGCGCCGTGAATCTTCATGATTTCGGTCTATCGTGACGGGTAGCGACTCTGAACAGTGATGTTTCAGGGTCAGACAGGAGTAGGGAAGGAATACAGAGAGACAATAATAATGGTAGATAGCAAGAAGCGCCCTGGCAAAGATCTCGACCGTATCGATCGTAACATTCTTAATGAGTTGCAAAAGGATGGGCGTATTTCTAACGTCGAGCTTTCTAAACGTGTGGGACTTTCCCCAACGCCGTGCCTTGAGCGTGTGCGTCGGCTGGAAAGACAAGGGTTTATTCAGGGCTATACGGCGCTGCTTAACCCCCATTATCTGGATGCATCACTTCTGGTATTCGTTGAGATTACTCTGAATCGTGGCGCACCGGATGTGTTTGAACAATTCAATACCGCTGTACAAAAACTTGAAGAAATTCAGGAGTGTCATTTAGTATCCGGTGATTTCGACTACCTGTTGAAAACACGCGTGCCGGATATGTCAGCCTACCGTAAGTTGCTGGGGGAAACCCTGCTGCGTCTGCCTGGCGTCAATGACACACGGACATACGTTGTTATGGAAGAAGTCAAGCAGAGTAATCGTCTGGTTATTAAGACGCGCTAACACGGAACAGGTGCAAAATCGGCGTATTTTGATTACACTCCTGTTAATCCATACAGCAACAGTACTGGGGTAACCTGGTACTGTTGTCCGTTTTAGCATCGGGCAGGAAAAGCCTGTAACCTGGAGAGCCTTTCTTGAGCCAGGAATACACTGAAGACAAAGAAGTCACATTGACAAAGTTAAGTAGCGGCCGTCGCCTTCTGGAAGCGTTGCTGATCCTTATTGTCCTGTTTGCCGTCTGGTTGATGGCTGCCTTACTAAGCTTTAACCCTTCGGACCCCAGCTGGTCGCAAACGGCCTGGCATGAACCTATCCATAATTTAGGTGGGATGCCCGGTGCGTGGCTGGCAGATACGCTGTTCTTTATTTTTGGCGTGATGGCTTACACCATTCCCGTCATTATTGTCGGCGGTTGTTGGTTTGCCTGGCGTCATCAGTCCAGTGACGAATACATTGATTATTTTGCCGTTTCGCTACGCATCATTGGCGTTTTGGCGCTCATCCTTACCTCCTGTGGTCTGGCGGCAATCAACGCTGACGATATCTGGTATTTTGCCTCCGGTGGCGTCATTGGCAGCTTACTAAGCACTACGCTACAACCACTGCTACACAGTAGCGGGGGAACTATTGCGCTGCTCTGCGTTTGGGCTGCGGGCCTGACGCTGTTCACCGGTTGGTCATGGGTGACCATTGCTGAAAAACTCGGCGGCTGGATTTTAAACATTCTCACCTTCGCTAGTAATCGTACCCGTCGCGATGATACCTGGGTCGATGAAGATGAATATGAAGATGACGAAGAGTATGAAGATGAAAACCACGGCAAACAGCATGAATCACGCCGTGCCCGTATTCTTCGCGGCGCGCTAGCGCGTCGTAAACGGTTGGCGGAAAAATTCATTAATCCGATGGGGCGGCAAACAGACGCTGCGTTGTTCTCCGGTAAGCGGATGGATGATGACGAAGAGATTACCTACACTGCACGCGGTGTGGCTGCCGACCCGGACGACGTCCTATTTTCGGGCAATCGTGCAACGCAGCCAGAATATGACGAATACGATCCATTATTAAACGGTGCGCCAATTACCGAACCTGTCGCTGTGGCAGCTGCTGCTACCACGGCGACACAAAGCTGGGCTGCGCCGGTTGAACCTGTGACTCAGACGCCGCCTGTTGCCTCTGTTGATGTTCCACCTTCGCAACCTACAGTAGCCTGGCAGCCTGTACCGGGTCCACAAACGGGAGAGCCGGTTATTGCTCCTGCACCGGAAGGTTACCCACAGCAGTCACAATATGCGCAGCCTGCAGTGCAATATAATGAGCCGCTGCAACAACCAGTACAGCCGCAGCAGCCGTATTATGCACCTGCAGCTGAACAACCTGCGCAACAACCTTACTACGCTCCTGCGGCGGAACAACCTGTTCAGCAACCGTATTACGCCCCTGCGCCAGAACAACCGGTGGCAGGTAACGCCTGGCAAGCCGAAGAGCAGCAATCCACTTTTGCTCCACAGTCTACATACCAGACTGAGCAAACTTATCAGCAACCAGCCGCTCAGGAGCCGTTGTACCAACAGCCGCAATCCGTTGAACAGCAGCCTGTTGTGGAGCCTGAACCCGTTGTAGAAGAGACAAAACCCGCGCGTCCGCCGCTTTACTATTTTGAAGAAGTGGAAGAGAAGCGAGCCCGTGAACGTGAACAACTTGCGGCCTGGTATCAACCGATTCCAGAACCGGTTAAAGAACCAGAACCGATCAAATCTTCGCTGAAAGCACCTTCTGTTGCAGCAGTACCTCCAGTAGAAGCCGCTGCCGCTGTTTCCCCGCTGGCATCTGGCGTGAAAAAAGCGACACTGGCGACGGGGGCTGCCGCAACCGTTGCCGCGCCAGTCTTCAGTCTGGCAAATAGCGGTGGACCGCGTCCTCAGGTCAAAGAGGGGATTGGTCCGCAGTTGCCACGACCGAAACGTATCCGCGTGCCAACTCGTCGTGAACTGGCGTCTTACGGTATTAAGCTGCCCTCACAGCGTGCGGCGGAAGAAAAAGCCCGTGAAGCCCAGCGCAATCAGTACGATTCTGGCGATCAGTACAACGATGATGAAATCGATGCGATGCAGCAGGATGAACTGGCACGTCAGTTCGCCCAGACACAGCAGCAACGCTATGGCGAACAGTATCAACATGATGTGCCCGTAAACGCAGAAGATGCAGATGCTGCGGCAGAGGCTGAACTGGCTCGTCAGTTTGCGCAAACTCAACAACAACGTTATTCCGGCGAACAACCGGCTGGGGCGAATCCGTTCTCGCTGGATGATTTTGAATTTTCGCCAATGAAAGCGTTGCTGGATGATGGTCCACACGAACCGTTGTTTACGCCAATTGTTGAACCTGTACAGCAGCCGCAACAACCGGTTGCACCGCAGCAGCAATATCAGCAGCCGCAACAACCAGTTCCGCCGCAGCCGCAGTATCAGCAGCCACAACAGCCGGTTGCGCCGCAGCCACAATATCAGCAGCCGCAACAACCGGTTGCGCCGCAGCAGCAATATCAGCAGCCGCAACAACCGGTTGCGCCGCAGCAGCAGTATCAGCAGCCACAACAGCCAGTTGCGCCACAACCGCAGGATACCCTGCTTCATCCGCTGTTGATGCGTAATGGCGACAGCCGTCCGTTGCATAAACCGACGACGCCGCTGCCTTCTCTGGATTTGCTGACACCGCCGCCGAGCGAAGTGGAGCCGGTAGATACCTTTGCGCTTGAACAAATGGCTCGTCTGGTGGAAGCGCGTCTGGCTGATTTCCGTATTAAAGCCGATGTCGTCAATTACTCTCCGGGGCCGGTTATCACTCGCTTTGAATTGAACCTGGCACCGGGCGTAAAAGCGGCGCGCATTTCTAACTTGTCACGGGACCTTGCCCGTTCACTTTCGACGGTGGCGGTGCGTGTCGTTGAAGTTATTCCTGGCAAACCCTATGTAGGTCTGGAGTTACCGAATAAAAAACGACAAACCGTTTATCTGCGCGAAGTTTTGGATAACGCCAAATTCCGCGATAATCCGTCGCCATTAACCGTGGTGCTGGGTAAAGATATCGCCGGTGAGCCGGTGGTTGCCGATCTGGCGAAAATGCCGCACTTGTTGGTTGCGGGGACTACCGGTTCCGGTAAATCTGTCGGTGTGAACGCGATGATCCTGAGCATGCTTTATAAAGCACAGCCAGAAGATGTGCGTTTCATCATGATCGACCCGAAAATGCTGGAGCTTTCGGTTTATGAAGGCATTCCGCATCTGTTAACGGAAGTCGTTACTGATATGAAAGATGCCGCCAACGCGCTGCGCTGGTGTGTTAACGAGATGGAGCGTCGGTATAAACTGATGTCTGCGCTGGGTGTGCGTAATCTGGCGGGTTATAACGAAAAAATTGCTGAAGCCGATCGCATGATGCGTCCGATTCCAGACCCGTACTGGAAGCCGGGTGACAGTATGGATGCCCAGCATCCGGTGCTGAAAAAAGAACCATACATTGTGGTGTTGGTTGACGAATTTGCCGACCTGATGATGACGGTAGGTAAAAAAGTGGAAGAGCTGATAGCACGTCTGGCGCAAAAAGCCCGTGCCGCGGGTATCCACCTCGTACTGGCAACTCAGCGTCCATCGGTTGATGTTATTACTGGTCTGATTAAAGCGAATATTCCGACCCGTATCGCCTTTACCGTATCCAGTAAGATTGACTCACGTACCATTCTTGATCAGGCTGGCGCGGAATCACTGCTGGGTATGGGGGATATGCTCTACTCTGGGCCGAACTCCACGTTGCCGGTACGTGTCCATGGTGCTTTTGTTCGCGATCAGGAAGTTCATGCCGTGGTGCAGGACTGGAAAGCGCGTGGTCGCCCACAGTATGTTGATGGCATCACCTCCGACAGCGAAAGCGAAGGTGGTGCGGGTGGTTTCGATGGCGCTGAAGAACTGGATCCGTTGTTCGATCAGGCGGTGCAGTTTGTCACTGAAAAACGCAAAGCGTCAATTTCTGGCGTACAGCGTCAGTTCCGCATTGGTTATAACCGTGCAGCGCGTATTATCGAACAGATGGAAGCGCAGGGGATTGTCAGCGAACAGGGGCACAACGGTAATCGTGAAGTGCTGGCCCCACCGCCGTTTGACTAATTAATGCATTGCCGGATAAGGCGCGGTAGCGTCGCATCCGGCACTCTATCAACTGAAAATTCAGTATTTTCTTCTTTCCTCAAGCTGATTATTAGCCTGGAATAGAGAGTAGAGGGAACTCCCGATCGGGAGTGACGTAATTTGAGGAATAATGATGAAAAAAATTGCCATCACCTGTGCATTACTCTCAAGCTTAGTAGCAAGCAGCGTTTGGGCTGATGCCGCAAGCGATCTGAAAAGCCGCCTGGATAAAGTCAGCAGCTTCCACGCCAGCTTCACACAAAAAGTGACTGACGGTAGCGGCGCGGCGGTGCAGGAAGGTCAGGGCGATCTGTGGGTGAAACGTCCAAACTTATTCAACTGGCATATGACACAACCTGATGAAAGCATTCTGGTTTCTGACGGTAAAACACTGTGGTTCTATAACCCGTTCGTTGAGCAAGCTACGGCAACCTGGCTGAAAGATGCCACCGGTAATACGCCGTTTATGCTGATTGCCCGCAACCAGTCCAGCGACTGGCAGCAGTACAATATCAAACAGAATGGCGATGACTTTGTCCTGACGCCGAAAGCCAGCAATGGCAATCTGAAGCAGTTCACCATTAACGTGGGACGTGATGGCACAATCCATCAGTTTAGCGCGGTGGAGCAGGACGATCAGCGCAGCAGTTATCAACTGAAATCCCAGCAAAATGGGGCTGTGGATGCAGCGAAATTTACCTTCACCCCGCCGCAAGGCGTCACGGTAGATGATCAACGTAAGTAGAGGCACCTGAGTGAGCAATCTGTCGCTCGATTTTTCGGATAATACTTTTCAACCTCTGGCCGCGCGTATGCGGCCAGAAAATTTAGCACAGTATATCGGCCAGCAACATTTGCTGGCTGCGGGGAAGCCGTTGCCGCGCGCTATCGAAGCCGGGCATTTACATTCTATGATCCTCTGGGGGCCGCCGGGTACCGGCAAAACAACTCTCGCTGAAGTGATTGCCCGCTATGCGAACGCTGATGTGGAACGTATTTCTGCCGTCACCTCTGGCGTGAAAGAGATTCGCGAGGCGATCGAGCGCGCCCGGCAAAACCGCAATGCAGGTCGCCGCACTATTCTTTTTGTTGACGAAGTTCACCGTTTCAACAAAAGCCAGCAGGATGCATTTCTGCCACATATTGAAGACGGCACCATCACTTTTATTGGCGCAACCACTGAAAACCCGTCGTTTGAGCTTAATTCGGCACTGCTTTCCCGTGCCCGTGTCTATCTGTTGAAATCCCTGAGTACAGAGGATATTGAGCAAGTACTAACTCAGGCGATGGAAGACAAAACCCGTGGCTATGGTGGTCAGGATATTGTTCTGCCAGATGAAACACGACGCGCCATTGCTGAACTGGTGAATGGCGACGCGCGCCGGGCGTTAAATACGCTGGAAATGATGGCGGATATGGCCGAAGTCGATGATAGCGGTAAGCGGGTCCTGAAGCCTGAATTACTGACCGAAATCGCCGGTGAACGTAGCGCCCGCTTTGATAACAAAGGCGATCGCTTTTACGATCTGATTTCCGCACTGCATAAGTCGGTACGTGGTAGCGCACCCGATGCGGCGCTGTACTGGTATGCGCGAATTATTACCGCTGGTGGCGATCCGTTATATGTCGCGCGTCGCTGTCTGGCGATTGCGTCTGAAGACGTCGGTAATGCCGATCCACGGGCGATGCAGGTGGCAATTGCGGCCTGGGATTGCTTTACTCGCGTTGGCCCGGCGGAAGGTGAACGCGCCATTGCTCAGGCGATTGTTTACCTGGCCTGCGCGCCAAAAAGCAACGCTGTCTACACTGCGTTTAAAGCCGCGCTGGCCGATGCTCGCGAACGCCCGGATTATGACGTGCCGGTTCATTTGCGTAATGCGCCGACGAAATTAATGAAGGAAATGGGCTACGGGCAGGAATATCGTTACGCTCATGATGAAGCAAACGCTTATGCTGCCGGTGAGGTTTACTTCCCGCCGGAAATAGCACAAACACGCTATTATTTCCCGACAAACAGGGGCCTTGAAGGCAAGATTGGCGAAAAGCTCGCCTGGCTGGCTGAACAGGATCAAAATAGCCCCATAAAACGCTACCGTTAATGTTATCGTTGCGGTAATGTTGTTACTGTATCCCTGTGGTCGCAGGCTGTGGCCACATCTCCCATTTAATTCGATAAGCACAGGATAAGCATGCTCGATCCCAATCTGCTGCGTAATGAGCCAGACGCAGTCGCTGAAAAACTGGCACGCCGGGGCTTTAAGCTGGATGTAGATAAGCTGGGCGCTCTTGAAGAGCGTCGTAAAGTATTGCAGGTCAAAACGGAAAACCTGCAAGCGGAGCGTAACTCCCGATCGAAATCCATTGGCCAGGCGAAAGCGCGCGGGGAAGATATCGAGCCTTTACGTCTGGAAGTGAACAAACTGGGCGAAGAGCTGGATGCAGCAAAAGCCGAGCTGGATGCTTTACAGGCTGAAATTCGCGATATCGCGCTGACCATCCCTAACCTGCCTGCAGATGAAGTGCCGGTAGGTAAAGACGAAAATGACAACGTTGAAGTCAGCCGCTGGGGTACCCCGCGTGAGTTTGACTTTGAAGTTCGTGACCACGTGACGCTGGGTGAAATGCACTCTGGCCTCGACTTTGCAGCTGCAGTTAAGCTGACTGGTTCCCGCTTTGTGGTAATGAAAGGGCAGATTGCTCGCATGCACCGCGCACTGTCGCAGTTTATGCTGGATCTGCATACCGAACAGCATGGCTACAGTGAGAACTATGTTCCGTACCTGGTTAACCAGGACACGCTGTACGGTACGGGTCAACTGCCGAAATTTGCTGGCGATCTGTTCCATACTCGTCCGCTGGAAGAAGAAGCAGACACCAGTAACTATGCGCTGATCCCAACGGCAGAAGTTCCGCTGACTAACCTGGTGCGCGGTGAAATCATCGATGAAGATGATCTGCCAATTAAGATGACCGCCCACACCCCATGCTTCCGTTCTGAAGCCGGTTCATATGGTCGTGACACCCGTGGTCTGATCCGTATGCACCAGTTCGACAAAGTTGAAATGGTGCAGATCGTGCGCCCAGAAGACTCAATGGCGGCGCTGGAAGAGATGACTGGTCATGCAGAAAAAGTCCTGCAGTTGCTGGGCCTGCCGTACCGTAAAATCATCCTTTGCACTGGCGACATGGGCTTTGGCGCTTGCAAAACTTACGACCTGGAAGTATGGATCCCGGCACAGAACACCTACCGTGAGATCTCTTCCTGCTCCAACGTTTGGGATTTCCAGGCACGTCGTATGCAGGCACGTTGCCGCAGCAAGTCGGACAAGAAAACCCGTCTGGTTCATACCCTGAACGGTTCTGGTCTGGCTGTTGGTCGTACGCTGGTTGCAGTAATGGAAAACTATCAGCAGGCTGATGGTCGTATTGAAGTACCAGAAGTTCTGCGTCCGTATATGAACGGACTGGAATATATTGGCTAA
Protein sequences of DBSCAN-SWA_3 >NZ_CP020368|854681:927994|857040_858486_+|WP_000460887.1|DBSCAN-SWA MFDYQVSKHPHFDEACRAFALRHNLVQLAERAGMNVQILRNKLNPAQPHLLTAPEIWLLTDLTEDSTLVDGFLAQIHCLPCVPINEVAKEKLPHYVMSATAEIGRVAAGAVAEPHHDGTVHWHLMCFMRKKDRRAITALLRKFAIREDREELGNNTGPRFKSELINPRKGTPTSYIAKYISKNIDGRGLAGEISKETGKSLRDNAEYVNAWASLHRVQQFRFFGIPGRQAYRELRLLAGQAARQQGDKKAGAPVLDNPRLDAILAAADAGCFATYIMKQGGVLVPRKYHLIRTAYEINEEPTAYGDHGIRIYGIWSPIAEGKICTHAVKWKMVRKAVDVQEAAADQGACAPWTRGNNCPLAENLNQQGKDKSADGDSRTDITRMNDKELHNYLHSMSKKERRELAARLRQVKPKRRKDYKQRITDHQRQQLVYELKSRGFDGSEKEVDLLLRGGSIPSGAGLRIFYRNQRLKEDDKWRNLY >NZ_CP020368|854681:927994|916434_918201_-|WP_001043577.1|DBSCAN-SWA MNKSRQKELTRWLKQQSVISQRWLNISRLLGFVSGILIIAQAWFMARILQHMIMENIPREALLLPFTLLVLTFVLRAWVVWLRERVGYHAGQHIRFAIRRQVLDRLQQAGPAWIQGKPAGSWATLVLEQIDDMHDYYARYLPQMALAVSVPLLIVVAIFPSNWAAALILLGTAPLIPLFMALVGMGAADANRRNFLALARLSGHFLDRLRGMETLRIFGRGEAEIESIRSASEDFRQRTMEVLRLAFLSSGILEFFTSLSIALVAVYFGFSYLGELDFGHYDTGVTLAAGFLALILAPEFFQPLRDLGTFYHAKAQAVGAADSLKTFMETPLAHPQRGEAELALTDPLTIEAEDLFITSPEGKTLAGPLNFTLPAGQRAVLVGRSGSGKSSLLNALSGFLSYQGSLRINGIELRDLSPESWRKHLSWVGQNPQLPAATLRDNVLLARPDASEQELQAALDNAWVSEFLPLLPQGVDTPVGDQAARLSVGQAQRVAVARALLNPCSLLLLDEPAASLDAHSEQRVMEALNAASLRQTTLMVTHQLEDLADWDVIWVMQDGRIIEQGRYAELSVAGGPFATLLAHRQEEI >NZ_CP020368|854681:927994|877382_877640_-|WP_001195240.1|DBSCAN-SWA MQTVIFGRSGCPYCVRAKDLAEKLSNERDDFQYQYVDIRAEGITKEDLQQKAGKPVETVPQIFVDQQHIGGYTDFAAWVKENLDA >NZ_CP020368|854681:927994|925267_926611_+|WP_000067755.1|DBSCAN-SWA MSNLSLDFSDNTFQPLAARMRPENLAQYIGQQHLLAAGKPLPRAIEAGHLHSMILWGPPGTGKTTLAEVIARYANADVERISAVTSGVKEIREAIERARQNRNAGRRTILFVDEVHRFNKSQQDAFLPHIEDGTITFIGATTENPSFELNSALLSRARVYLLKSLSTEDIEQVLTQAMEDKTRGYGGQDIVLPDETRRAIAELVNGDARRALNTLEMMADMAEVDDSGKRVLKPELLTEIAGERSARFDNKGDRFYDLISALHKSVRGSAPDAALYWYARIITAGGDPLYVARRCLAIASEDVGNADPRAMQVAIAAWDCFTRVGPAEGERAIAQAIVYLACAPKSNAVYTAFKAALADARERPDYDVPVHLRNAPTKLMKEMGYGQEYRYAHDEANAYAAGEVYFPPEIAQTRYYFPTNRGLEGKIGEKLAWLAEQDQNSPIKRYR >NZ_CP020368|854681:927994|865071_865824_+|WP_000216259.1|capsid|DBSCAN-SWA MTVKAKRFRIGVEGATTDGREIQREWLEQMAASYNPAVYTALINLEHIKSYLPDSTFNRYGKVTALFAEEITEGPLAGKMALYADVEPTESLVELVKKGQKLFTSMEVSPKFADTGKAYLVGLAATDDPASLGTEMLTFSASAAHNPLANRKQNPANLFTAAEETVIELEEVQDDKPSLFSRVTALFTKKEQSDDARFSDVHKAVELVATEQQNLSARTEKSLSEQEERLSELESALQKQQRAMPVVPHW >NZ_CP020368|854681:927994|895179_896181_-|WP_000566356.1|DBSCAN-SWA MIDLRSDTVTRPSRAMLEAMMAAPVGDDVYGDDPTVNALQDYAAELSGKEAAIFLPTGTQANLVALLSHCERGEEYIVGQAAHNYLFEAGGAAVLGSIQPQPIDAAADGTLPLDKVAMKIKPDDIHFARTKLLSLENTHNGKMLPREYLKEAWEFTRERNLALHVDGARIFNAVVAYGCELKEITQYCDSFTICLSKGLGTPVGSLLVGNRDYIKRAIRWRKMAGGGMRQSGILAAAGMYALKNNVARLQEDHDNAAWMAEQLREAGADVMRQDTNMLFVRVGEENAAALGEYMKARNVLINASPIVRLVTHLDVSREQLAEVAAHWRAFLAR >NZ_CP020368|854681:927994|896217_897936_-|WP_000815335.1|DBSCAN-SWA MKQTVAAYIAKTLESAGVKRIWGVTGDSLNGLSDSLNRMGTIEWMSTRHEEVAAFAAGAEAQLSGELAVCAGSCGPGNLHLINGLFDCHRNHVPVLAIAAHIPSSEIGSGYFQETHPQELFRECSHYCELVSSPEQIPQVLAIAMRKAVLNRGVSIVVLPGDVALKPAPEGATMHWYHAPQPVVTPEEEELRKLAQLLRYSSNIALMCGSGCAGAHKELVEFAGKIKAPIVHALRGKEHVEYDNPYDVGMTGLIGFSSGFHTMMNADTLVLLGTQFPYRAFYPTDAKIIQIDINPASIGAHSKVDMALVGDIKSTLRALLPLVEEKADRKFLDKALEDYRDARKGLDDLAKPSEKAIHPQYLAQQISHFAADDAIFTCDVGTPTVWAARYLKMNGKRRLLGSFNHGSMANAMPQALGAQATEPERQVVAMCGDGGFSMLMGDFLSVVQMKLPVKIVVFNNSVLGFVAMEMKAGGYLTDGTELHDTNFARIAEACGITGIRVEKASEVDEALQRAFSIDGPVLVDVVVAKEELAIPPQIKLEQAKGFSLYMLRAIISGRGDEVIELAKTNWLR >NZ_CP020368|854681:927994|892626_893640_-|WP_001338420.1|DBSCAN-SWA MKVLVTGATSGLGRNAVEFLCQKGISVRATGRNEAMGKLLEKMGAEFVPADLTELVSSQAKVMLAGIDTLWHCSSFTSPWGTQQAFDLANVRATRRLGEWAVAWGVRNFIHISSPSLYFDYHHHRDIKEDFRPHRFANEFARSKAASEEVINMLSQANPQTRFTILRPQSLFGPHDKVFIPRLAHMMHHYGSILLPHGGSALVDMTYYENAVHAMWLASQEACDKLPSGRVYNITNGEHRTLRSIVQKLIDELNIDCRIRSVPYPMLDMIARSMERLGRKSAKEPPLTHYGVSKLNFDFTLDITRAQEELGYQPVITLDEGIEKTAAWLRDHGKLPR >NZ_CP020368|854681:927994|900844_901744_-|WP_000491142.1|DBSCAN-SWA MFSGLLIILVPLIVGYLIPLRQQAALKVINQLLSWMVYLILFFMGISLAFLDNLASNLLAILHYSAVSITVILLCNIAALMWLERGLPWRNHHQQEKLPSRIAMALESLKLCGVVVIGFAIGLSGLAFLQHATEASEYTLILLLFLVGIQLRNNGMTLKQIVLNRRGMIVAVVVVVSSLIGGLINAFILDLPINTALAMASGFGWYSLSGILLTESFGPVIGSAAFFNDLARELIAIMLIPGLIRRSRSTALGLCGATSMDFTLPVLQRTGGLDMVPAAIVHGFILSLLVPILIAFFSA >NZ_CP020368|854681:927994|910149_912426_+|WP_000934045.1|protease|DBSCAN-SWA MLNQELELSLNMAFARAREHRHEFMTVEHLLLALLSNPSAREALEACSVDLVALRQELEAFIEQTTPVLPASEEERDTQPTLSFQRVLQRAVFHVQSSGRNEVTGANVLVAIFSEQESQAAYLLRKHEVSRLDVVNFISHGTRKDEPTQSSDPGSQPNSEEQAGGEERMENFTTNLNQLARVGGIDPLIGREKELERAIQVLCRRRKNNPLLVGESGVGKTAIAEGLAWRIVQGDVPEVMADCTIYSLDIGSLLAGTKYRGDFEKRFKALLKQLEQDTNSILFIDEIHTIIGAGAASGGQVDAANLIKPLLSSGKIRVIGSTTYQEFSNIFEKDRALARRFQKIDITEPSIEETVQIINGLKPKYEAHHDVRYTAKAVRAAVELAVKYINDRHLPDKAIDVIDEAGARARLMPVSKRKKTVNVADIESVVARIARIPEKSVSQSDRDTLKNLGDRLKMLVFGQDKAIEALTEAIKMARAGLGHEHKPVGSFLFAGPTGVGKTEVTVQLSKALGIELLRFDMSEYMERHTVSRLIGAPPGYVGFDQGGLLTDAVIKHPHAVLLLDEIEKAHPDVFNILLQVMDNGTLTDNNGRKADFRNVVLVMTTNAGVRETERKSIGLIHQDNSTDAMEEIKKIFTPEFRNRLDNIIWFDHLSTDVIHQVVDKFIVELQVQLDQKGVSLEVSQEARNWLAEKGYDRAMGARPMARVIQDNLKKTLANELLFGSLVDGGQVTVALDKEKNELTYGFQSAQKHKAEAAH >NZ_CP020368|854681:927994|878070_878793_+|WP_000189159.1|DBSCAN-SWA MTPTIELICGHRSIRHFTDEPISEAQREAIINSARATSSSSFLQCSSIIRITDKALREELVTLTGGQKHVAQAAEFWVFCADFNRHLQICPDAQLGLAEQLLLGVVDTAMMAQNALIAAESLGLGGVYIGGLRNNIEAVTKLLKLPQHVLPLFGLCLGWPADNPDLKPRLPASILVHENSYQPLDKGALAQYDEQLAEYYLTRGSNNRRDTWSDHIRRTIIKESRPFILDYLHKQGWATR >NZ_CP020368|854681:927994|909798_910119_+|WP_000520781.1|protease|DBSCAN-SWA MGKTNDWLDFDQLAEEKVRDALKPPSMYKVILVNDDYTPMEFVIDVLQKFFSYDVERATQLMLAVHYQGKAICGVFTAEVAETKVAMVNKYARENEHPLLCTLEKA >NZ_CP020368|854681:927994|861157_862120_+|WP_001320043.1|DBSCAN-SWA MKSAEYLNTFRLRNLGLPVMNNLHDMSKATRISVETLRLLIYTADFRYRIYTVEKKGPEKRMRTIYQPSRELKALQGWVLRNILDKLSSSPFSIGFEKHQSILNNATPHIGANFILNIDLEDFFPSLTANKVFGVFHSLGYNRLISSVLTKICCYKNLLPQGAPSSPKLANLICSKLDYRIQGYAGSRGLIYTRYADDLTLSAQSMKKVVKARDFLFSIIPSEGLVINSKKTCISGPRSQRKVTGLVISQEKVGIGREKYKEIRAKIHHIFCGKSSEIEHVRGWLSFILSVDSKSHRRLITYISKLEKKYGKNPLNKAKT >NZ_CP020368|854681:927994|870028_870634_+|WP_001086815.1|tail|DBSCAN-SWA MNSLLPPGSTPLERRLAQTCSGISDLQVPLRDLWNPATCPVSFLPYLAWAFSVDRWDEDWTESVKRQVVKDAFYIHQHKGTTSAVRRVVEPFGFLIRIIEWWQTGETPGTFRLDIGVQDQGITEDTYLELERLISDAKPCSRHMIGMSINLQTSGPHWVGAASYLGEEITIYPYINETIISGGTAHEGGAVHVIDTMRVNP >NZ_CP020368|854681:927994|855884_856076_-|WP_001321204.1|DBSCAN-SWA MLGKVFFVVLSCSLLLNPLTTYAKNYPCSGKKGGVSHCTSDGKFVCNDGTISKSKKICTKNSR >NZ_CP020368|854681:927994|914712_916434_-|WP_001202189.1|DBSCAN-SWA MRALLPYLALYKRHKWMLSLGIVLAIVTLLASIGLLTLSGWFLSASAVAGVAGLYSFNYMLPAAGVRGAAITRTAGRYFERLVSHDATFRVLQHLRIYTFSKLLPLSPAGLARYRQGELLNRVVADVDTLDHLYLRVISPLVGAFVVIMVVTIGLSFLDFTLAFTLGGIMLLTLFLMPPLFYRAGKSTGQNLTHLRGQYRQQLTAWLQGQAELTIFGASDRYRTQLENTEIQWLEAQRRQSELTALSQAIMLLIGALAVILMLWMASGGVGGNAQPGALIALFVFCALAAFEALAPVTGAFQHLGQVIASAVRITDLTDQKPEVTFPDTQTRVADRVSLTLRDVQFTYPEQSQQALKGISLQVNAGEHIAILGRTGCGKSTLLQLLTRAWDPQQGEILLNDSPIASLNEAALRQTISVVPQRVHLFSATLRDNLLLASPGSSDEALSEILRRVGLEKLLEDAGLNSWLGEGGRQLSGGELRRLAIARALLHDAPLVLLDEPTEGLDATTESQILELLAEMMREKTVLMVTHRLRGLSRFQQIIVMDNGQIIEQGTHAELLARQGRYYQFKQGL >NZ_CP020368|854681:927994|858837_859071_+|WP_001217575.1|DBSCAN-SWA MRIEIMIDKEQKISQSTLDALESELYRNLRPLYPKTVIRIRKGSSNGVELTGLQLDEERKQVMKIMQKVWEDDSWLH >NZ_CP020368|854681:927994|874566_874785_+|WP_000972391.1|DBSCAN-SWA MMICPLCGSAAHTRSSFQVSSLTKERYNQCQNINCSHTFVTHETFVRSIATPKESNPVQPHPMKSGQVALSL >NZ_CP020368|854681:927994|890838_891354_-|WP_001270734.1|DBSCAN-SWA MRYSKLTMLIPCALLLSACTTVTPAYKDNGTRSGPCVEGGPDNVAQQFYDYRILHRSNDITALRPYLSDKLATLLSDASRDNNHRELLTNDPFSSRTTLPDSAHVASASTIPNRDARNIPLRVDLKQGDQGWQDEVLMIQEGQCWVIDDVRYLGGSVHATAGTLRQSIENR >NZ_CP020368|854681:927994|856091_856661_-|WP_001047321.1|DBSCAN-SWA MNLEKGGRGAIERMVEAYGFKTRQALCDHLGISKSTLATRYMRDSFPAEWVIQCALETGTSLNWLTTGHGSKQTSGNTNTMEVAKYVLSDGALCEDGFYIFDREFLPSAFKNLFVITDNNSEFICDKEFDDIRDGKWVISIDGEITIRDITRLPGGRIFVEGGNRAFECKIEDVEIIGKIISLTIKYVR >NZ_CP020368|854681:927994|856786_857008_+|WP_001247707.1|DBSCAN-SWA MRPNISITLTTPHVTIERYSELTGLSIDTINDMLADGRLIRHRLRKDKKREKVMINIAAMTVDALSECNLNLN >NZ_CP020368|854681:927994|883970_884816_+|WP_001061667.1|DBSCAN-SWA MNNLPVVRSPWRIVILLLGFTFLYAPMLMLVIYSFNSSKLVTVWAGWSTRWYGELLRDDAMMSAVGLSLTIAACAATAAAILGTIAAVVLVRFGRFRGSNGFAFMITAPLVMPDVITGLSLLLLFVALAHAIGWPADRGMLTIWLAHVTFCTAYVTVVISSRLRELDSSIEEAAMDLGATPLKVFFVITLPMIMPAIISGWLLAFTLSLDDLVIASFVSGPGATTLPMLVFSSVRMGVNPEINALATLILGAVGIVGFIAWYLMARAEKQRIRDIQRARRG >NZ_CP020368|854681:927994|866908_867355_+|WP_000829156.1|DBSCAN-SWA MAELQKVDDWLSALLANLEPVARSRMMRQLAQELRRTQQQNIRMQRNPDGSSYEPRKVTARSKKGRIKRQMFAKLRTTKYLKTAASADSASVQFESKVQRIARVHHYGLRDRVSRKGPEVRYAERRLLGLNGESYVLTRDILNRFLLS >NZ_CP020368|854681:927994|878853_879756_+|WP_000684321.1|DBSCAN-SWA MKIAILSRDGTLYSCKRLREAAIQRGHLVEILDPLSCYMNINPAASSIHYKGRKLPHFDAVIPRIGTAITFYGTAALRQFEMLGSYPLNESVAIARARDKLRSMQLLARQGIDLPVTGIAHSPDDTSDLIDMVGGAPLVVKLVEGTQGIGVVLAETRQAAESVIDAFRGLNAHILVQEYIKEAQGCDIRCLVVGDEVVAAIERRAKEGDFRSNLHRGGAASVASITPQEREIAIKAARTMALDVAGVDILRANRGPLVMEVNASPGLEGIEKTTGIDIAGKMIRWIERHATTEYCLKTGG >NZ_CP020368|854681:927994|913463_913682_-|WP_001040187.1|DBSCAN-SWA MAKEDNIEMQGTVLETLPNTMFRVELENGHVVTAHISGKMRKNYIRILTGDKVTVELTPYDLSKGRIVFRSR >NZ_CP020368|854681:927994|873265_873805_-|WP_001318481.1|tail|DBSCAN-SWA MPVSRNRCIFLNVGLGEDSALPVGVPVPWPSATPPTGWLKCNGAAFSAEEYPELAKAYPTNKLPDLRGEFIRGWDDGRGIDAGRALLSLQNGGVESHTHQGQLFRVSDYRTKEIPASEVMGRGYIASLTPGADSPLDFDDYSVSSNPNGYFVGNQRTTAYGVNETRPRNIAFNYIVRAA >NZ_CP020368|854681:927994|866484_866916_+|WP_001039932.1|tail|DBSCAN-SWA MNKPQSLRHALNKAVPYVRNNPDKLHLFVDNGSLVATGASSMSWEYRYTLNAVIEDFSGDQNLLMAPVLLWLRDNQPDAINNPALREKLFTFEVDILRNDVCDISLNLHLTERVLVSTDGSVSSVEAVAEPDEPEEMWTVKRG >NZ_CP020368|854681:927994|899048_900701_-|WP_000458809.1|DBSCAN-SWA MFCVQCEQTIRTPAGNGCSYAQGMCGKTAETSDLQDLLIAALQGLSAWAVKAREYGIINHDVDSFAPRAFFSTLTNVNFDSPRIVGYAREAIALREALKAQCLAVDANARVDNPMADLQLVSDDLGELQRQAAEFTPNKDKAAIGENILGLRLLCLYGLKGAAAYMEHAHVLGQYDNDIYAQYHKIMAWLGTWPADMNALLECSMEIGQMNFKVMSILDAGETGKYGHPTPTQVNVKATAGKCILISGHDLKDLYNLLEQTEGTGVNVYTHGEMLPAHGYPELRKFKHLVGNYGSGWQNQQVEFARFPGPIVMTSNCIIDPTVGAYDDRIWTRSIVGWPGVRHLDGDDFSAVITQAQQMAGFPYSEIPHLITVGFGRQTLLGAADTLIDLVSREKLRHIFLLGGCDGARGERHYFTDFATSVPDDCLILTLACGKYRFNKLEFGDIEGLPRLVDAGQCNDAYSAIILAVTLAEKLGCGVNDLPLSLVLSWFEQKAIVILLTLLSLGVKNIVTGPTAPGFLTPDLLAVLNEKFGLRSITTVEEDMKQLLSA >NZ_CP020368|854681:927994|867296_868103_-|WP_000115390.1|DBSCAN-SWA MSVMNPISSNIFNAEFLNTPAAALAAWISIIGAVITLVTIVIRALFKYRKSHDVILKKSGIPAFILNFFLIRISLKRLPTITWAEKSITVLFSLLFLCAIYIFGPVFIQAIRTPPNSTLLYWIKSGESFYMSKKLATAATILTTPDWEISKDDCEESSSASTEKYKSLTIEHKEILCKLLTTDEGNTYIDEEVKNFVKDKFFIYSFAPTTIFILLWISLGFILTIHYSKKVRKYILTEQKNAIHWAYGEFKTEGIYSIYHELERKTHH >NZ_CP020368|854681:927994|926701_927994_+|WP_000886683.1|tRNA|DBSCAN-SWA MLDPNLLRNEPDAVAEKLARRGFKLDVDKLGALEERRKVLQVKTENLQAERNSRSKSIGQAKARGEDIEPLRLEVNKLGEELDAAKAELDALQAEIRDIALTIPNLPADEVPVGKDENDNVEVSRWGTPREFDFEVRDHVTLGEMHSGLDFAAAVKLTGSRFVVMKGQIARMHRALSQFMLDLHTEQHGYSENYVPYLVNQDTLYGTGQLPKFAGDLFHTRPLEEEADTSNYALIPTAEVPLTNLVRGEIIDEDDLPIKMTAHTPCFRSEAGSYGRDTRGLIRMHQFDKVEMVQIVRPEDSMAALEEMTGHAEKVLQLLGLPYRKIILCTGDMGFGACKTYDLEVWIPAQNTYREISSCSNVWDFQARRMQARCRSKSDKKTRLVHTLNGSGLAVGRTLVAVMENYQQADGRIEVPEVLRPYMNGLEYIG >NZ_CP020368|854681:927994|865960_866389_+|WP_000196203.1|lysis|DBSCAN-SWA MTRALAVVVALALVALGWQSWRLNSASHTIETQRAALKSKAQELTKKNSQLISLSILTETNNREQARLYAEAEQTSALLRQRQHRIEELKRENEDLRRWADTPLPADIIRLRERPALTGGTAYRQWLSASDAVSAGSGNAAH >NZ_CP020368|854681:927994|859263_859599_-|WP_001059831.1|DBSCAN-SWA MNNIPPIPQLGIYVSKIDPTLRITVTDVDIVDGEDDSPDDELFYLVHWIEGEDESDMTAMGFELDPVEWQAFVESEQLVFERDPYMDSIPENSNLAKIRDFLMKTKQNDHS >NZ_CP020368|854681:927994|858638_858827_+|WP_001154431.1|DBSCAN-SWA MQDYFLESLKLQRIDFFLKLVAASECSDEEKGLALQWVSELTDELMAKIRSHEYNRSMDVIS >NZ_CP020368|854681:927994|881877_883011_+|WP_000996005.1|DBSCAN-SWA MNDAIPRPQAKTRKALTPLLEIRNLTKSYDGQHAVDDVSLTIYKGEIFALLGASGCGKSTLLRMLAGFEQPSAGQIMLDGVDLSQVPPYLRPINMMFQSYALFPHMTVEQNIAFGLKQDKLPKAEIASRVNEMLGLVHMQEFAKRKPHQLSGGQRQRVALARSLAKRPKLLLLDEPMGALDKKLRDRMQLEVVDILERVGVTCVMVTHDQEEAMTMAGRIAIMNRGKFVQIGEPEEIYEHPTTRYSAEFIGSVNVFEGVLKERQEDGLVLDSPGLVHPLKVDADASVVDNVPVHVALRPEKIMLCEEPPANGCNFAVGEVIHIAYLGDLSVYHVRLKSGQMISAQLQNAHRHRKGLPTWGDEVRLCWEVDSCVVLTV >NZ_CP020368|854681:927994|889143_889875_-|WP_000756569.1|DBSCAN-SWA MKKVLIAALIAGFSLSATAAETIRFATEASYPPFESIDANNQIVGFDVDLAQALCKEIDATCTFSNQAFDSLIPSLKFRRVEAVMAGMDITPEREKQVLFTTPYYDNSALFVGQQGKYTSVDQLKGKKVGVQNGTTHQKFIMDKHPEITTVPYDSYQNAKLDLQNGRIDGVFGDTAVVTEWLKDNPKLAAVGDKVTDKDYFGTGLGIAVRQGNTELQQKLNTALEKVKKDGTYETIYNKWFQK >NZ_CP020368|854681:927994|888420_889137_-|WP_001001691.1|DBSCAN-SWA MNEFFPLASAAGMTVGLAVCALIVGLALAMFFAVWESAKWRPVAWAGSALVTILRGLPEILVVLFIYFGSSQLLLTLSDGFTINLGFVQIPVQMDIENFDVSPFLCGVIALSLLYAAYASQTLRGALKAVPVGQWESGQALGLSKSAIFFRLVMPQMWRHALPGLGNQWLVLLKDTALVSLISVNDLMLQTKSIATRTQEPFTWYIVAAAIYLVITLLSQYILKRIDLRATRFERRPS >NZ_CP020368|854681:927994|889892_890621_-|WP_000027205.1|DBSCAN-SWA MSIQLNGINCFYGAHQALFDITLDCPQGETLVLLGPSGAGKSSLLRVLNLLEMPRSGTLNIAGNHFDFTKTPSDKAIRDLRRNVGMVFQQYNLWPHLTVQQNLIEAPCRVLGLSKDQALARAEKLLERLRLKPYSDRYPLHLSGGQQQRVAIARALMMEPQVLLFDEPTAALDPEITAQIVSIIRELAETNITQVIVTHEVEVARKTASRVVYMENGHIVEQGDASCFTEPQTEAFKNYLSH >NZ_CP020368|854681:927994|891479_891803_+|WP_001160737.1|DBSCAN-SWA MQFSTTPTLEGQTIVEYCGVVTGEAILGANIFRDFFAGIRDIVGGRSGAYEKELRKAREIAFEELGSQARALGADAVVGIDIDYETVGQNGSMLMVSVSGTAVKTRR >NZ_CP020368|854681:927994|903358_905017_+|WP_000599802.1|DBSCAN-SWA MILERVEIVGFRGINRLSLMLEQNNVLIGENAWGKSSLLDALTLLLSPESDLYHFERDDFWFPPGDINGREHHLHIILTFRESLPGRHRVRRYRPLEACWTPCTDGYHRIFYRLEGESAEDGSVMTLRSFLDKDGHPIDVEDINDQARHLVRLMPVLRLRDARFMRRIRNGTVPNVPNVEVTARQLDFLARELSSHPQNLSDGQIRQGLSAMVQLLEHYFSEQGAGQARYRLMRRRASNEQRSWRYLDIINRMIDRPGGRSYRVILLGLFATLLQAKGTLRLDKDARPLLLIEDPETRLHPIMLSVAWHLLNLLPLQRIATTNSGELLSLTPVEHVCRLVRESSRVAAWRLGPSGLSTEDSRRISFHIRFNRPSSLFARCWLLVEGETETWVINELARQCGHHFDAEGIKVIEFAQSGLKPLVKFARRMGIEWHVLVDGDEAGKKYAATVRSLLNNDREAEREHLTALPALDMEHFMYRQGFSDVFHRMAQIPENVPMNLRKIISKAIHRSSKPDLAIEVAMEAGRRGVDSVPTLLKKMFSRVLWLARGRAD >NZ_CP020368|854681:927994|891799_892630_+|WP_001255168.1|DBSCAN-SWA MRRVFWLVAAALLLAGCAGEKGIVEKEGYQLDTRRQAQAAYPRIKVLVIHYTADDFDSSLATLTDKQVSSHYLVPAVPPRYNGKPRIWQLVPEQELAWHAGISAWRGATRLNDTSIGIELENRGWQKSAGVKYFAPFEPAQIQALIPLAKDIIARYHIKPENVVAHADIAPQRKDDPGPLFPWQQLAQQGIGAWPDAQRVNFYLAGRAPHTPVDTTSLLELLARYGYDVKPDMIPREQRRVIMAFQMHFRPTLYNGEADAETQAIAEALLEKYGQD >NZ_CP020368|854681:927994|883020_883974_+|WP_000105444.1|DBSCAN-SWA MSTLEPAAQSKPPGGFKLWLSQLQMKHGRKLVIALPYIWLILLFLLPFLIVFKISLGEMARAIPPYTELMEWADGQLSITLNLGNFLQLTDDPLYFDAYLQSLQVAAISTICCLLIGYPLAWAVAHSKPSTRNILLLLVILPSWTSFLIRVYAWMGILKNNGVLNNFLLWLGVIDQPLTILHTNLAVYIGIVYAYVPFMVLPIYTALIRIDYSLVEAALDLGARPLKTFFTVIVPLTKGGIIAGSMLVFIPAVGEFVIPELLGGPDSIMIGRVLWQEFFNNRDWPVASAVAIIMLLLLIVPIMWFHKHQQKSVGEHG >NZ_CP020368|854681:927994|924645_925257_+|WP_001295343.1|DBSCAN-SWA MKKIAITCALLSSLVASSVWADAASDLKSRLDKVSSFHASFTQKVTDGSGAAVQEGQGDLWVKRPNLFNWHMTQPDESILVSDGKTLWFYNPFVEQATATWLKDATGNTPFMLIARNQSSDWQQYNIKQNGDDFVLTPKASNGNLKQFTINVGRDGTIHQFSAVEQDDQRSSYQLKSQQNGAVDAAKFTFTPPQGVTVDDQRK >NZ_CP020368|854681:927994|893738_895169_-|WP_001136577.1|DBSCAN-SWA MPQRILVLGASGYIGQHLVRTLSRQGHQILAAARHVDRLAKLQLANVSCHKVDLSWPDNLPALLQDIDTVYFLVHSMGEGGDFIAQERQVALNVRDALREVPVKQLIFLSSLQAPPHEQSDHLRARQATADILREANVPVTELRAGIIVGAGSAAFEVMRDMVYNLPVLTPPRWVRSRTTPIALENLLHYLVALLDHPASEHRIFEAAGPEVLSYQQQFEHFMAVSGKRRWLIPIPLPTRWISVWFLNVITSVPPTTARALIQGLKHDLLADDTALRALIPQRLIAFDDAVRSTLKEEEKLVNSSDWGYDAQAFARWRPEYGYFAKQAGFTVKTSASLAALWQVVNQIGGKERYFFGNILWQTRALMDRAIGHKLAKGRPEREYLQTGDAVDSWKVIVVEPEKQLTLLFGMKAPGLGRLCFSLEDKGDYRTIDVRAFWHPHGMPGLFYWLLMIPAHLFIFRGMAKQIARLAEQSTD >NZ_CP020368|854681:927994|875020_876706_-|WP_001024876.1|DBSCAN-SWA MNINVAELLNGNYILLLFVVLALGLCLGKLRLGSIQLGNSIGVLVVSLLLGQQHFSINTDALNLGFMLFIFCVGVEAGPNFFSIFFRDGKNYLMLALVMVGSALVIALGLGKLFGWDIGLTAGMLAGSMTSTPVLVGAGDTLRHSGMESRQLSLALDNLSLGYALTYLIGLVSLIVGARYLPKLQHQDLQTSAQQIARERGLDTDANRKVYLPVIRAYRVGPELVAWTDGKNLRELGIYRQTGCYIERIRRNGILANPDGDAVLQMGDEIALVGYPDAHARLDPSFRNGKEVFDRDLLDMRIVTEEVVVKNHNAVGKRLAQLKLTDHGCFLNRVIRSQIEMPIDDNVVLNKGDVLQVSGDARRVKTIADRIGFISIHSQVTDLLAFCAFFVIGLMIGMITFQFSTFSFGMGNAAGLLFAGIMLGFMRANHPTFGYIPQGALSMVKEFGLMVFMAGVGLSAGSGINNGLGAIGGQMLIAGLIVSLVPVVICFLFGAYVLRMNRALLFGAMMGARTCAPAMEIISDTARSNIPALGYAGTYAIANVLLTLAGTIIVMVWPGLG >NZ_CP020368|854681:927994|860071_860995_+|WP_001034589.1|DBSCAN-SWA MNKKFTDEQQQQLIGHLTKKGFYRGANIKITIFLCGGDVANHQSWRHQLSQFLAKFSDVDIFYPEDLFDDLLAGQGQHSLLSLENILAEAVDVIILFPESPGSFTELGAFSNNENLRRKLICIQDAKFKSKRSFINYGPVRLLRKFNSKSVLRCSSNELKEMCDSSIDVARKLRLYKKLMASIKKVRKENKVSKDIGNILYAERFLLPCIYLLDSVNYRTLCELAFKAIKQDDVLSKIIVRSVVSRLINERKILQMTDGYQVTALGASYVRSVFDRKTLDRLRLEIMNFENRRKSTFNYDKIPYAHP >NZ_CP020368|854681:927994|863162_864929_-|WP_001098413.1|terminase|DBSCAN-SWA MNTTLTPADLDPRRQAMLLYFQGYRVARIAEMLGEKVATVHSWKKRDKWGDYGPLDQMQLTTAARYCQLIMKEHKEGKDFKEIDLLARQSERHARIGKFNNGGNEADLNPNVANRNKGPRRQPEKNVFTDEQIEKLEEIFHSSMFNYQRHWWEAGKTNRIRNLLKSRQIGATFYFAREALIDALLTGRNQIFLSASKAQAHVFKQYIIDFAKEVEVELKGDPIVLPNGATLYFLGTNARTAQSYHGNLYLDEYFWIPKFQELRKVASGMAIHKKWRQTYFSTPSSLTHSAYPFWSGALFNRGRNKADKVDIDLSHSNLAPGLLCTDGQYRQIVTVEDAVRGGCNLFDLDQLRMEYSPDEYQNLLMCEFVDDLASVFPLSELQACMVDSWEIWTDFHALALRPFGWREVWIGYDPAKGTQNGDSAGCVVVAPPAVPGGKFRILERHQWRGMDFRAQADAIKKLTEQYNVTYIGIDSTGVGHGVYENVKAFFPAVREFVYNPNVKNALVLKAYDIISHRRLEFDAGHTDIAQSFMAIRRATTASGNRPTYEASRSEEASHADLAWATMHALFNEPLQGESANTSNIVEIF >NZ_CP020368|854681:927994|854681_855698_-|WP_000290930.1|integrase|DBSCAN-SWA MAVRKLTTGKWLCECYPAGRSGRRVRKQFATKGEALAFERHTMEETEAKPWLGESVDRRTLKDVIELWFKLHGKSLTAGQHVYDKLLLMVDALGNPLATDLTSKMFAHYRDKRLTGEIYFSEKWKKGASPVTINLEQSYLSSVFSELSRLGEWSYPNPLENMRKFTIAEKEMAWLTHEQIVELLADCKRQDPILALVVKICLSTGARWREAINLTRSQVTKYRITFVRTKGKKNRSIPISKELYEEIMALDGFNFFTDCYFQFLSVMEKTSIVLPRGQLTHVLRHTFAAHFMMSGGNILALQKILGHHDIKMTMRYAHLAPDHLETALRFNPLATLEV >NZ_CP020368|854681:927994|918323_919289_-|WP_000537418.1|DBSCAN-SWA MGTTKHSKLLILGSGPAGYTAAVYAARANLQPVLITGMEKGGQLTTTTEVENWPGDPNDLTGPLLMERMHEHATKFETEIIFDHINKVDLQNRPFRLNGDNGEYTCDALIIATGASARYLGLPSEEAFKGRGVSACATCDGFFYRNQKVAVIGGGNTAVEEALYLSNIASEVHLIHRRDGFRAEKILIKRLMDKVENGNIILHTNRTLEEVTGDQMGVTGVRLRDTQNSDNIESLDVAGLFVAIGHSPNTAIFEGQLELENGYIKVQSGIHGNATQTSIPGVFAAGDVMDHIYRQAITSAGTGCMAALDAERYLDGLADAK >NZ_CP020368|854681:927994|907232_909179_+|WP_000188180.1|DBSCAN-SWA MTPLLELKDIRRSYPAGDEQVEVLKGITLDIYAGEMVAIVGASGSGKSTLMNILGCLDKATSGTYRVAGQDVATLDADALAQLRREHFGFIFQRYHLLSHLTAEQNVEVPAVYAGLERKQRLLRAQELLQRLGLEDRTEYYPAQLSGGQQQRVSIARALMNGGQVILADEPTGALDSHSGEEVMAILHQLRDRGHTVIIVTHDPQVAAQAERVIEIRDGEIVRNPPAIEKVNVAGGTEPVVNTVSGWRQFVSGFNEALTMAWRALAANKMRTLLTMLGIIIGIASVVSIVVVGDAAKQMVLADIRSIGTNTIDVYPGKDFGDDDPQYQQALKYDDLIAIQKQPWVASATPAVSQNLRLRYNNVDVAASANGVSGDYFNVYGMTFSEGNTFNQEQLNGRAQVVVLDSNTRRQLFPHKADVVGEVILVGNMPARVIGVAEEKQSMFGSSKVLRVWLPYSTMSGRVMGQSWLNSITVRVKEGFDSAEAEQQLTRLLSLRHGKKDFFTWNMDGVLKTVEKTTRTLQLFLTLVAVISLVVGGIGVMNIMLVSVTERTREIGIRMAVGARASDVLQQFLIEAVLVCLVGGALGITLSLLIAFTLQLFLPGWEIGFSPLALLLAFLCSTVTGILFGWLPARNAARLDPVDALARE >NZ_CP020368|854681:927994|872663_873266_-|WP_000368077.1|tail|DBSCAN-SWA MDNAVLNSELIATKAGNIPVYNYDGETREYISTSNEYLSVGVGIPACSCLDAPGTHKADYAICRSTDFNSWEYVPDHRGEIVYNTETGDAKEITAPGDYPENTTTIAPLTPYDKWDGEKWVTDTEAQHGAAVEVAEAQRQSLIDAAMASISLIQLKLQAGRKLMQAETTRLNIVLDYIDAVTATDTSTAPDVIWPELPEA >NZ_CP020368|854681:927994|868206_868785_+|WP_000993743.1|plate|DBSCAN-SWA MNAQLTEIMRLITNLIRIGVVTEVDRENWLCRVKTGDLETNWISWLTLRAGNARTWWRPSEGEQVVLLSLGGNLETAFALPAVYSNQFAPPSTSADACVTEHPDGGWFEYEPATGRWYVRGIKSMVIEAADNITMKTSEFVLEADRTRINSEVVINGGVTQGGGAMSSNGIVVDAHQHTGVLKGGDTTGGPV >NZ_CP020368|854681:927994|862137_863163_-|WP_000520360.1|portal|DBSCAN-SWA MGKSKKNRAAATNQIQHKNQTSAEAFSFGDPVPVLDRRELLDYVECVKMDRWYEPPVSFDGLARTFRAAVHHSSPIAVKCNILTSTYIPHPLLSQQAFSRFVQDYLVFGNAYLEKRTNRFGEVIALEPALAKYTRRGLDLDTYWFVQYGMTTQPYQFTKGSIFHLMEPDINQEIYGLPGYLSAIPSALLNESATLFRRKYYINGSHAGFIMYMTDAAQNQEDVNNLRNAMKSAKGPGNFRNLFMYSPNGKKDGLQIIPLSEVTAKDEFLNIKNVSRDDMMAAHRVPPQMMGIMPNNVGGFGDVEKASKVFVRNELMPLQKRLQELNNWLDEEVLNFSTYEL >NZ_CP020368|854681:927994|876975_877353_+|WP_000681108.1|DBSCAN-SWA MKHKQRWAGAICCFVLFIVVCLFLATHMKGAFRAAGHPEIGLLFFILPGAVASFFSQRREVLKPLFGAMLAAPCSMLIMRLFFSPTRSFWQELAWLLSAVFWCALGALCFLFISSLFKPQHRKNQ >NZ_CP020368|854681:927994|887752_888421_-|WP_000464491.1|DBSCAN-SWA MFEYLPELMKGLHTSLTLTVASLIVALILALIFTIILTLKTPVLVWLVRGYITLFTGTPLLVQIFLIYYGPGQFPTLQEYPALWHLLSEPWLCALIALSLNSAAYTTQLFYGAIRAIPEGQWQSCSALGMSKKDTLAILLPYAFKRSLSSYSNEVVLVFKSTSLAYTITLMEVMGYSQLLYGRTYDVMVFGAAGIIYLVVNGLLTLMMRLIERKALAFERRN >NZ_CP020368|854681:927994|879843_880320_+|WP_000203025.1|DBSCAN-SWA MTSLVVPGLDTLRQWLDDLGMSFFECDNCQALHLPHMQNFDGVFDAKIDLIDNTILFSAMAEVRPSAVLPLAADLSAINASSLTVKAFLDMQDDNLPKLVVCQSLSVMQGVTYEQFAWFVRQSEEQISMVILEANAHQLLLPTDDEGQNNVTENYFLH >NZ_CP020368|854681:927994|913966_914671_-|WP_001241678.1|tRNA|DBSCAN-SWA MRLVQLSRHSIAFPSPEGALREPNGLLALGGDLSPARLLMAYQRGIFPWFSPGDPILWWSPDPRAVLWPESLHISRSMKRFHKRSPYRVTMNYAFGQVIEGCASDREEGTWITRGVVEAYHRLHELGHAHSIEVWREDELVGGMYGVAQGTLFCGESMFSRMENASKTALLVFCEEFIGHGGKLIDCQVLNDHTASLGACEIPRRDYLNYLNQMRLGRLPNNFWVPRCLFSPQE >NZ_CP020368|854681:927994|872254_872692_+|WP_000280166.1|tail|DBSCAN-SWA MYKYSAKKNAFYLAGNEAVYRDSGTWPDDAKDIETRRAESFMATPPQGKRRIAGADGMPAWADIPSPTHEELIEISESKRQLLINQANEYMNSKQWPGKAAIGRLKGEELAQYNSWLDYLDALELVDTSSVPDIEWPTPPAVQAR >NZ_CP020368|854681:927994|868781_869141_+|WP_000177591.1|plate|DBSCAN-SWA MTLYSGMNNTSGKVITDIDHLRQSVRDILLTPQGSRIARREYGSLLSALIDQPQNPALRLQVMSAVYVALSRWEPRLTLDSITINSNFDGSMVVGLTGRRNNGVPVSLSVSTGAENGSD >NZ_CP020368|854681:927994|877799_878087_+|WP_001201560.1|DBSCAN-SWA MRAIGKLPKGVLILEFIGMMLLAVALLSVSDSLSLPEPFSRPEVQILMIFLGVLLMLPAAVVVILQVAKRLAPQLMNRPPQYSRSEREKDNDANH >NZ_CP020368|854681:927994|886730_887462_-|WP_001295905.1|DBSCAN-SWA MKKLVLAALLASFTFGASAAEKINFGVSATYPPFESIGANNEIVGFDIDLAKALCKQMQAECTFTNHAFDSLIPSLKFRKYDAVISGMDITPERSKQVSFTTPYYENSAVVIAKKDTYKTFADLKGKRIGMENGTTHQKYIQDQHPEVKTVSYDSYQNAFIDLKNGRIDGVFGDTAVVNEWLKTNPQLGVATEKVTDPQYFGTGLGIAVRPDNKALLEKLNNALAAIKADGTYQKINDQWFPQ >NZ_CP020368|854681:927994|869127_870036_+|WP_000268294.1|plate|DBSCAN-SWA MAVIDLSQLPAPQIVDVPDFETLLAERKAEFVALHPKDEQEAVIRTLELESEPVTKLLQENAYRELLLRQRINEAAQAVMVAYAMGGDLDQLAANYNVKRLTVTPADNDAVPPVAAVMESDEALRLRVPAAFEGLSVAGPTAAYEFHARSADGRVADASATSPAPAEVVLTVLSREGDGTAEKDLLDVVEKALNSENVRPVADRLTVRSAEIIPYRVEATIFLYPGPEAEPVMAAAKASLQKYIASQTRLGRDIRRSAIFAALHVEGVQRVELASPLADVVLNKTQAASCTQWSVTNGGTDE >NZ_CP020368|854681:927994|909251_909476_-|WP_000410785.1|DBSCAN-SWA MEKGTVKWFNNAKGFGFICPEGGGEDIFAHYSTIQMDGYRTLKAGQSVQFDVHQGPKGNHASVIVPVEVEAAVA >NZ_CP020368|854681:927994|905013_905970_-|WP_001355621.1|DBSCAN-SWA MNMSQLTERTFTPSESLSSLSLFLSLARGQCRPGKFWHRRSFRQKFLLRSLIMPRLSVEWMNELSHWPNLNVLLTRQPRLPVRLHRPYLAANLSRKQLLEALRYHYALLRGCMSAEEFSLYLNTPGLQLAKLEGKNGEQFTLELTMMISMDKEGDSTILFRNSEGIPLAEITFTLCEYQGKRTMFIGGLQGAKWEIPHQEIQNATKACHGLFPKRLVMEAACLFAQRLQVEQIIAVSNETHIYRSLRYRDKEGKIHADYNAFWESVGGVCDAERHYRLPAQIARKEIAEIASKKRAEYRRRYEMLDAIQPQMATMFRG >NZ_CP020368|854681:927994|884875_885364_+|WP_000389260.1|DBSCAN-SWA MEDETLGFFKKTSSSHARLNVPALVQVAALAIIMIRGLDVLMIFNTLGVRGIGEFIHRSVQTWSLTLVFLSSLVLVFIEIWCAFSLVKGRRWARWLYLLTQITAASYLWAASLGYGYPELFSIPGESKREIFHSLMLQKLPDMLILMLLFVPSTSRRFFQLQ >NZ_CP020368|854681:927994|902238_902934_-|WP_001298299.1|DBSCAN-SWA MFRKLAAECFGTFWLVFGGCGSAVLAAGFPELGIGFAGVALAFGLTVLTMAFAVGHISGGHFNPAVTIGLWAGGRFPAKEVVGYVIAQVVGGIVAAALLYLIASGKTGFDAAASGFASNGYGEHSPGGYSMLSALVVELVLSAGFLLVIHGATDKFAPAGFAPIAIGLALTLIHLISIPVTNTSVNPARSTAVAIFQGGWALEQLWFFWVVPIVGGIIGGLIYRTLLEKRD >NZ_CP020368|854681:927994|919833_920328_+|WP_000228473.1|DBSCAN-SWA MVDSKKRPGKDLDRIDRNILNELQKDGRISNVELSKRVGLSPTPCLERVRRLERQGFIQGYTALLNPHYLDASLLVFVEITLNRGAPDVFEQFNTAVQKLEEIQECHLVSGDFDYLLKTRVPDMSAYRKLLGETLLRLPGVNDTRTYVVMEEVKQSNRLVIKTR >NZ_CP020368|854681:927994|870630_872253_+|WP_000104800.1|DBSCAN-SWA MSTKFYTLLTDIGAAKLASAAALGVPLKITHMAVGDGGGVLPTPDAKQTALVNEKRRAALNMLYIDPQNSSQIIAEQVIPENEGGWWIREVGLFDESGALIAVGNCPESYKPQLAEGSGRTQTVRMVLITSSTDNITLKIDPAVVLATRKYVDDKVLELKVFVDDKMAKHLAAPDPHSQYAPKESPTLTGTPKAPTPAEGNNTTQIATTAFVQAALMALINGAPATLDTMKEIAAAINNDPKFSTTINNALALKAPLLSPAFTGTPTAPTAAQSVNNTQIATTAFVKSAIAAMVGSAPAALDTLNELAAALGNDPNFATTMLNALAGKQPLDNTLTNLSGKDVAGLLAYLGLGDALIGDECKIAGFDSSNVNAPYMRFARTNTVVRLATKDYAQPKDQTLTDLSGKDKAELRTYLDLKSAAQRDVGSGANQIPDMNDFTSSLTSPGWQKLPSGLIIQWGAANPSSTGEIFITFPVAFSAYPMYVGFGPQQASLPNVVQSPVISAPTITNLGCGVRNLMIPTAGGAPVASMSSFFWIAVGK >NZ_CP020368|854681:927994|885404_886532_+|WP_001149743.1|DBSCAN-SWA MQCALYDAGRCRSCQWITQPIPEQLSAKTADLKNLLADFPVEEWCAPVSGPEQGFRNKAKMVVSGSVEKPLLGMLHRDGTPEDLCDCPLYPASFAPVFSALKPFIARAGLTPYNVARKRGELKYILLTESQSDGGMMLRFVLRSETKLAQLRKALPWLQEQLPQLKVITVNIQPVHMAIMEGETEIYLTEQQALAERFNDVPLWVRPQSFFQTNPAVASQLYATARDWVRQLPVKHMWDLFCGVGGFGLHCATPDMQLTGIEIAPEAIACAKQSAAELGLTRLQFQALDSTQFATAQGEVPELVLVNPPRRGIGKPLCDYLSTMAPRFIIYSSCNAQTMAKDVRELPGYRIERVQLFDMFPHTAHYEVLTLLVKQ >NZ_CP020368|854681:927994|898068_899037_-|WP_000178677.1|DBSCAN-SWA MTMPTNQCPWRMQVHHITQETPDVWTISLICHDYYPYRAGQYALVSVRNSAETLRAYTISSTPGVSEYITLTVRRIDDGVGSQWLTRDVKRGDYLWLSDAMGEFTCDDKAEDKFLLLAAGCGVTPIMSMRRWLAKNRPQADVRVIYNVRTPQDVIFADEWRNYPVTLVAENNVTEGFIAGRLTRELLAGVPDLASRTVMTCGPAPYMDWVEQEVKALGVTRFFKEKFFTPVAEAATSGLKFTKLQPAREFYAPVGTTLLEALESNNVPVVAACRAGVCGCCKTKVVSGEYTVSSTMTLTDAEIAEGYVLACSCHPQGDLVLA >NZ_CP020368|854681:927994|906120_907236_+|WP_000746443.1|DBSCAN-SWA MKKRKTVKKRYVIALVIVIAGLITLWRILNAPVPTYQTLIVRPGDLQQSVLATGKLDALRKVDVGAQVSGQLKTLSVAIGDKVKKDQLLGVIDPEQAENQIKEVEATLMELRAQRQQAEAELKLARVTYSRQQRLAQTKAVSQQDLDTAATEMAVKQAQIGTIDAQIKRNQASLDTAKTNLDYTRIVAPMAGEVTQITTLQGQTVIAAQQAPNILTLADMSTMLVKAQVSEADVIHLKPGQKAWFTVLGDPLTRYEGQIKDVLPTPEKVNDAIFYYARFEVPNPNGLLRLDMTAQVHIQLTDVKNVLTIPLSALGDPVGDNRYKVKLLRNGETREREVTIGARNDTDVEIVKGLEAGDEVVIGEAKPGAAQ >NZ_CP020368|854681:927994|880670_881783_+|WP_000126069.1|DBSCAN-SWA MTALNKKWLSGLVAGALMAVSVGTLAAEQKTLHIYNWSDYIAPDTVANFEKETGIKVVYDVFDSNEVLEGKLMAGSTGFDLVVPSASFLERQLTAGVFQPLDKSKLPEWKNLDPELLKLVAKHDPDNKFAMPYMWATTGIGYNVDKVKAVLGENAPVDSWDLILKPENLEKLKSCGVSFLDAPEEVFATVLNYLGKDPNSTKADDYTGPATDLLLKLRPNIRYFHSSQYINDLANGDICVAIGWAGDVWQASNRAKEAKNGVNVSFSIPKEGAMAFFDVFAMPADAKNKDEAYQFLNYLLRPDVVAHISDHVFYANANKAATPLVSAEVRDNPGIYPPADVRAKLFTLKVQDPKIDRVRTRAWTKVKSGK >NZ_CP020368|854681:927994|920462_924491_+|WP_000077053.1|DBSCAN-SWA MSQEYTEDKEVTLTKLSSGRRLLEALLILIVLFAVWLMAALLSFNPSDPSWSQTAWHEPIHNLGGMPGAWLADTLFFIFGVMAYTIPVIIVGGCWFAWRHQSSDEYIDYFAVSLRIIGVLALILTSCGLAAINADDIWYFASGGVIGSLLSTTLQPLLHSSGGTIALLCVWAAGLTLFTGWSWVTIAEKLGGWILNILTFASNRTRRDDTWVDEDEYEDDEEYEDENHGKQHESRRARILRGALARRKRLAEKFINPMGRQTDAALFSGKRMDDDEEITYTARGVAADPDDVLFSGNRATQPEYDEYDPLLNGAPITEPVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPPSQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAAEQPVQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQSVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVKKATLATGAAATVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRIRVPTRRELASYGIKLPSQRAAEEKAREAQRNQYDSGDQYNDDEIDAMQQDELARQFAQTQQQRYGEQYQHDVPVNAEDADAAAEAELARQFAQTQQQRYSGEQPAGANPFSLDDFEFSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVPPQPQYQQPQQPVAPQPQYQQPQQPVAPQQQYQQPQQPVAPQQQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVEPVDTFALEQMARLVEARLADFRIKADVVNYSPGPVITRFELNLAPGVKAARISNLSRDLARSLSTVAVRVVEVIPGKPYVGLELPNKKRQTVYLREVLDNAKFRDNPSPLTVVLGKDIAGEPVVADLAKMPHLLVAGTTGSGKSVGVNAMILSMLYKAQPEDVRFIMIDPKMLELSVYEGIPHLLTEVVTDMKDAANALRWCVNEMERRYKLMSALGVRNLAGYNEKIAEADRMMRPIPDPYWKPGDSMDAQHPVLKKEPYIVVLVDEFADLMMTVGKKVEELIARLAQKARAAGIHLVLATQRPSVDVITGLIKANIPTRIAFTVSSKIDSRTILDQAGAESLLGMGDMLYSGPNSTLPVRVHGAFVRDQEVHAVVQDWKARGRPQYVDGITSDSESEGGAGGFDGAEELDPLFDQAVQFVTEKRKASISGVQRQFRIGYNRAARIIEQMEAQGIVSEQGHNGNREVLAPPPFD |
72 | Salmonella_phage(36.84%) | capsid,integrase,terminase,plate,portal,tail,tRNA,protease,lysis | attL 851580:851595|attR 935017:935032 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1373913 : 1394635
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP020368|1373913:1394635|DBSCAN-SWA CTTATGCCGCCAGCACGCGGTTGCGTCCATCATTTTTCGCCCGATACAAAGCATCATCAACGCGTTTAAACAGTTCATCGATGCTTTCATTTCCTTCGTGATGCGCCACACCAATGCTGACGGTAAAGCGTGGTAAGCCCGAAATACTCACTTTTGCCACGCTTACGCGGATAGTTTCAGCCAGCGAAAGCGCGGTATCCAGTGGGGTTCTTGGTAGCAATAAGACAAACTCTTCGCCTCCCCAACGAAACACCAAATCGCCTTTGCGAGCGCAACTTTCGAGGGTGCGGGCGAGGGCGCATAACACCTCATCACCTTTAGAATGCCCATAGAGATCGTTAATGTGTTTAAAACGATCGGTGTCGATGAGCAACAAGCTGTAATCCTGAGCGATGGCGAGATGCTGCATTTGGCCTGGTTCCGTAATGTGATAAAACTGTCGCCGATTCAGTAATCCGGTCATCGCGTCATGGTGAGCAGCATGTTCCAGCTGCTCCTCCAGCCGTTTTTGCTCAGTAATATCATACACAATACATAACATGAGCTTGTCGCCATAAATTTCAATCGGTCCGGCATAGGTCTGCACATGACGAGTCGAACCATCCGCCAGTTTATGAACAAAATTCAAAGGTTTATGACCACCGGGTAAATGCGAGATTTCATGCATGATAGGCATGACGCGACGCCCGAGCATATTTATTTCCCAGGTATGTTTCTGGCACATCGTTTCATGGTTATAACCATAGAAATTGAGCGCGGCGAGGTTAGCATCGACGATTTGTCCATCTCGTGACGGGTCAATCAACAACATTGGTGCAGAGTTAGTCAGAAAAAAGCGCGCATAAAAACCTTGTTTTTTGCGCTGATAATTTGCCGAGCGACTGGCTTTTAAACCCAGCGTTGCCGGCGCTTCGATACCTTCGAAAATAATCACCGGTTCTGTTTCTGTCAGCTTTCGCAAAACAAGCCGACAGCTCAATGCTGTTTCCTCTTCTTTACGCTGAACAGTGAGGATTTCGATAATATCGTGTTGGTTTTGCAGATCGGAGAGGTATTTCGGCAGTTCTTTTTGTGAGGAGACGGAATAGGGTCCGGTTCGTAGCTGACTAAACGTGAGGTCTTGCATCAACAGTTTCGCCGCGCTATTGGCATAAATTAACTGTTCCTCAAAGGGCGAAACGATCCAGACAGGACTGGTGAGTAAGTCCAGGGTATTGAAGTTGTGCGTAATCATTGAGATCCCGTTATTTTTATCAATTTTTGTTGCTATCCGATCGCAAAAAAGCCACGTCATATGATCAGATAATTCTGATAATGATAGACGCTATTTAACACTTCACACGGTTTGTATACGGAAAAGCATTTTGCTTTTTGTATTCAATTTAGACAGAATTTTATTAATCATTTCAGGGTAATGGGGTGATGAGATGTTGCGTAACAGGGCCAGAAGGCTAGACTACAAAATAATGCGTTGATGATGGAGGCACTGTGGAAGCGATTAAGGGATCGGACGTTAATGTCCCGGATGCAGTATTTGCCTGGATGCTGGATGGTAGAGGCGGCGTTAAACCGCTGGAAAATACAGATGTGATTGATGAAGCGCATCCCTGTTGGCTCCACCTTAATTATGTACACCATGATAGCGCCCAATGGCTGGCGACAACACCGCTGCTTCCCAATAACGTACGTGATGCGCTGGCGGGCGAGAGCACGCGTCCCCGAGTCAGCCGTCTCGGTGAAGGCACGCTGATTACATTGCGCTGTATAAACGGCAGCACCGATGAACGCCCCGATCAACTGGTCGCCATGCGTGTATATATGGACGGGCGGTTAATTGTTTCGACCCGACAACGCAAAGTGCTGGCGCTGGACGATGTGGTGAGCGATCTGGAAGAGGGCACGGGTCCGACCGATTGCGGGGGATGGCTGGTGGATGTGTGCGATGCGTTGACCGATCATTCCAGTGAATTTATCGAGCAGCTGCACGATAAAATTATCGACCTTGAAGATAATCTCCTTGATCAGCAAATTCCACCGCGTGGATTCCTGGCTCTGCTGCGCAAACAATTAATTGTGATGCGTCGCTATATGGCACCGCAACGTGATGTTTATGCTCGTCTTGCCAGTGAACGTTTGCCGTGGATGAGCGATGACCAACGCCGTCGGATGCAGGATATTGCCGATCGCCTTGGGCGCGGCCTTGACGAAATCGACGCCTGTATAGCACGGACTGGCGTGATGGCGGATGAAATCGCTCAGGTGATGCAGGAAAATTTAGCTCGTCGTACCTATACAATGTCGTTGATGGCAATGGTCTTTTTACCCAGTACCTTTCTGACCGGGTTATTTGGCGTCAACCTTGGTGGGATCCCTGGCGGCGGGTGGCAATTCGGATTTTCAATTTTTTGTATTCTGTTAGTTGTTCTTATTGGTGGTGTTGCTTTATGGTTGCATCGTAGTAAATGGTTGTAACAAAAGCAATTTTTCCGGCTGTCTGTATACAAAAACGCCGCAAAGTTTGAGCGAAGTCAATAAACTCTCTACCCATTCAGGGCAATATCTCTCTTGCAGGTGAATGCAACGTCAAGCGATGGGCGTTGCGCTCCATATTGTCTTACTTCCTTTTTTGAATTACTGCATAGCACAATTGATTCGTACGACGCCGACTTTGATGAGTCGGCTTTTTTTTGCCTGTTATTTATCAGCGTCTACCCTTTAAGAGTCCACCCAATGACCAGAGGGAAATATGACGACACTTATTTATTTGCAAATTCCTGTCCCTGAACCGATTCCTGGCGATCCTGTTCCAGTGCCCGATCCGATCCCTCGCCCGCAACCCATGCCTGACCCACCACCCGATGAAGAACCGATTAAATTGTCGCATCGTGAGCGTAGATCTGCGAGGATACGCGCCTGCTAACTTTGCGTCGATGACCACGAGAATAGATTGTGACCGCTTTTTCTACCCTGAATGTTTTGCCTCCCGCCCAACTCACGAACCTTAATGAGTTGGGTTATTTAACCATGACGCCGGTGCAGGCCGCCGCGCTTCCGGCGATCCTTGCCGGAAAAGATGTTCGCGTGCAGGCGAAAACCGGCAGCGGCAAAACGGCGGCTTTTGGCCTCGGCTTGTTACAGCAAATTGATGCGTCGCTATTTCAAACCCAGGCTTTAGTGCTGTGTCCTACGCGTGAACTGGCGGATCAGGTGGCAGGTGAATTGCGTCGGCTGGCGCGTTTTCTGCCAAATACCAAAATTTTGACGTTGTGCGGTGGTCAACCGTTCGGTATGCAGCGTGATTCGTTGCAACATGCGCCGCATATTATCGTGGCAACGCCGGGGCGTTTGCTGGATCACCTGCAAAAAGGCACGGTATCACTGGATGCGTTGAATACGCTGGTGATGGATGAGGCCGACCGCATGCTGGATATGGGATTTAGCGATGCCATTGATGATGTCATCCGTTTTGCGCCTGCATCTCGACAGACGCTTCTGTTTTCGGCAACCTGGCCGGAAGCCATCGCTGCAATCAGCGGACGAGTGCAACGCGATCCTTTGGCGATTGAAATTGACTCAACAGATGCTTTGCCACCCATTGAACAACAATTTTATGAGACATCCAGCAAAGGCAAAATTCCTCTGTTGCAACGGTTATTAAGCTTGCATCAGCCATCCTCTTGCGTGGTGTTTTGCAATACCAAAAAAGATTGCCAGGCTGTCTGCGACGCGCTGAATGAAGTAGGGCAAAGTGCATTGTCATTACACGGCGATTTGGAGCAACGCGATCGCGATCAGACCCTGGTACGTTTTGCTAACGGTAGCGCCCGTGTACTGGTCGCGACTGATGTTGCTGCGCGTGGTCTGGATATTAAATCGCTTGAGCTGGTGGTGAACTTTGAGCTGGCGTGGGACCCTGAAGTTCATGTACATCGCATCGGTCGTACAGCTCGTGCAGGAAATAGCGGTCTGGCGATTAGTTTCTGTGCTCCGGAAGAAGCACAGCGGGCCAATATCATTTCTGACATGTTGCAGATAAAACTTAACTGGCAAACGCCGCCAGCTAATAGTTCCATTGCGACGCTGGAAGCAGAAATGGCAACGTTGTGTATCGATGGCGGGAAAAAAGCCAAAATGCGCCCGGGTGATGTATTAGGTGCACTGACAGGAGATATCGGGCTTGATGGCGCAGATATTGGCAAAATCGCCGTGCATCCGGCGCATGTCTATGTCGCGGTCCGTCAGGCTGTTGCTCATAAAGCATGGAAACAGTTACAGGGCGGGAAGATTAAAGGAAAAACGTGCCGGGTGCGGTTATTAAAATAATGAAATGTTGAATTGCCGGGTGCAAGAGTAAACATCTTATTCGGGATTGCCGGATGCGACGCTGGCCGCGTCTTATCCGGCCTCCATAAGAGTAGCCCGATACGCTTGCGCATCGGGCGCTATCCTGATTATTTCACTTCAACCACATTCAGCCGTAACTCATCCAACTGATTTTCATCTTCTTCTGGCTGCCAGCCCGCCGGTTGTAGTGGGATCTCTTCGCGATCAAACGCCAGATCACCCCCGTTAACCACTTCAGAACCGTGGGTGATGCCTTTGAAATCGAACAGGTTGGTATCGCACAGATGCGACGGCACCACATTCTGCATCGCGCTGAACATCGTCTCGATACGCCCTGGATAACGTTTATCCCAGTCACGCAACATGTCAGCAATCACCTGACGTTGCAGGTTAGGCTGTGAACCGCACAGGTTGCACGGAATAATCGGGAACGCTTTTGCATCGGCAAATCGCTGAATATCTTTCTCGCGGCAGTAGGCCAGCGGACGAATAACGATATGTTTGCCATCATCGCTCATCAGTTTCGGAGGCATACCTTTCATCTTACCGCCGTAGAACATATTTAAGAACAACGTTTGCAGGATATCGTCACGATGGTGACCCAACGCGATCTTCGTCGCCCCCAGTTCCGTTGCGGTACGATAAAGGATACCGCGACGAAGGCGAGAACACAGTGAGCAAGTGGTTTTGCCCTCTGGAATCTTCTCTTTCACGATACCGTAAGTATTCTCTTCAACAATCTTGTACTCAACGCCCAGCTTTTCAAGATACTCGGGCAGAACGTGTTCCGGGAAGCCCGGTTGCTTTTGATCGAGGTTAACAGCCACCAGCGAAAAATTGATTGGCGCGCTTTGCTGCAAATTGCGCAGAATCTCCAGCATGGTATAGCTGTCTTTACCCCCGGAGAGGCAAACCATGATGCGATCGCCTTCTTCAATCATATTGAAGTCAGCAATGGCTTCGCCCACGTTACGACGCAGACGTTTTTGTAATTTGTTCAGGTTGTATTGTTCTTTCTTTGTAATTTGTTGATTTTCTTGCATTATTTCAGTTCTCTGGTACTAAATGGGGCAAATTGGGGGCAAACTTTGCAACTACGATAACCGCGCATTCAACATGGCTATCTGTTCGTCGTTCATGTCATCAATCCACATACCGTAAATTTCATACACCATCTGCGCAGTTTCATGCCCCATTTGGCTGGCTATAAATGCCGGGTTCGCTCCTGCCGTCAACAGCCAGCAGGCAAAAGTATGCCGCGTATGGTACGGATTACGGCGGCGAATACCAGCACGTTTTACTGCTGCATTCCACCTTGCCCCCAAACTGCTTACCGAGTAATAAGGTTTTTGTTTTCCGTTACACACCCTGGGCATGAAAACAAAATGCAGTTTTTGCTTTTCGGTTCTGCCGTACTCCCGATGATAAAAGGTGATTTCGCTTTTGCGATGATGCCCGGTCAGTTTGTATTGCTCCTTCAGTGCTTCAAGAGCAGGCTGCAGTAGTGTTACTGTTCGGATCCCGGCATTTGTTTTTGGGGGACCGAACATATCAAGTATCGTCAGGTTTCTTCTGACATTCACTATTCCCTTTTCGAGATCCACATCCTCCCACGCCAGAGCTGCCAGTTCCCCGTGACGAAGTCCTGAGTAAACGGCAAATTTCCACAAGTTCTGGCTCTGTCCTTTTTCACTTTCCATTAATGCATTGAATTCTGTTTTAGATAACGGATCAGGCTTTATTCTGTTTCGCTGTAATTTTTTTACTCCTTCAAATGGTTTAGTTGATATAAATCCCGACTGATACGCAAAACGCAACAGCGAACAGAGCAGGGCGATATAGTTATCAACTGTGCGCACGGTTCTTCCTTTTTTGTTGGATCTTGGATTATCCAGGTAAAGCGTTTCTCCATGCAGCAGTTCATTCCGGTAGTTTAAGATATCGCTATAACGAATATGTGATATCGGGGTACTTTCACAAATTATTATTCTGAGTGTTTTTAATTGTGATTTCGTTTTCTTCATTGTGTTTGTTGTTAACTCTGTCTCTTTAATTTTTGTCCAGATATCACAAAGCTCTCCGAACGTTTTTATGACTCTCGTTGTCACCATTTTTGCCCCAGTGCTGGACTGGGGAAAACGTCTTAAATACTCAAATTCACCGGAGTTTATTTCATGAACTATCAGCGATCTTAAATTTCCGGCCTTTTTAATATTACTGTTTGTAATCTCCCAGCCTTTTAATGTTTCCCGACATCGTTTTCCTCGAAACATGAACCAGATGCGAATGTATCTACCTCTAATCTCGACACCTGTTGGTAATTTAGACATATCATGAGTCTTTGATAAACTGATTTATCTTTGGATAGTTGTACCAGATAATCCCTCGTTTGCTGTCTGGCTTACCTAAAGGAGATACTCGTTTGAAGTGGAAGCCCTCCACCCAACAGTTCTGGCGGTATGCTTCAATTTGTCTGGCCCCCAGACCAGTGCGAAGCATCAGGCCGTATTCAACCATCCACTCTTCATTAAAGATTACTTGTGCCATCGCATCACCTCTGGCAGGCGCCAATGTTAGACTGAAATTGACGCCCGATGTTGATTATTAATAATCAGCTATGAAGTTTTAATTTGAATACAATGCAATTCTCGAGGACTGAAGTTTCTCGCAATTAAAATTTATCAGTTTTACTTTCTGCTCTCTGGAAACGCCTGCTTCTTTTTTACCTGAGAGCATTTTTTCGCATTCTGATTTCGTTAGTTTAGATTTTGAATATCTTGTCCAGTTAGTAGGAGTGCCACCTTCCTTTTCAATAGTGGCGGTAATTTTATACATGAACACCTCCATTATTATTTCCAGTGGTTCGTTTATTCCATCTTTCGAGTGCTTCTTTTTCACTTCCACCATAACCGGTTCGGGATTCGCATCCGTTACACTTCGCTCGGTAATATCCTGAAATGGCTTTCACCGTTACTGATGGACAACCACAAAATGGACATGGTTTAACATTGTCATATCTCATAATTTTTCTCATAAAAAATATTTCAAGTTGGCGGTGCATTACACCGCCAGGCTGAATTATTCCTCTGAATTATCGATTACACTGTATTCCCCGGTTAATACAGAGGAATCTGCAGGATCGATTGTCAGTGGTTCCTTTTCATCCATTGATACTGCACGCTGGATCTCAATTGATACGGGCAGATATTTGAACAGGCGACGAATAGCCGTTTTCTTTGCCATTTCTTCCCAGTGAGTTACCCACGGCCCGTTATTACCAGCTTTACTCAGGCTGCGCACCAGCTCAATCTGTTTGCGCGTCATAACTTCAAACTGAGTACCTCCGTCTTTCAGTCTTGCGACAGCATAGACGTGGGTAACCGGGGCATCTTCGTTTTCTCCCGGGCGGTGTATTAACTTTTCATCAAGGCCAAATTCGAAGCTAAACTCGTCACCTTCACGGACAACACGGGCTGACAGGCTGGCGATTTGACCAGAACGGCGAGCCAGATCAATCATGCCGCGATAGCCAATGATTAGCTGAACGTTCTTTTTACCGCTCTTTTCGTTTTTATTACCAAAAGGCAGTAAATATGCATGACCGAGGGCGCTACCTGGCTCAAGTCCGAGCTGTGAACACTGTACGATCGCACTGACAAAACTCATAGTGTCACAGTTTCCTAACGCCGGAACTTTACGAATTTCTGTGGTGGCGATACGGATCATACGTTCAGCCGTCATATGGCGTGGAAGAGCTGCTGCCAGTTGCTCTTTCATTGATGGCTGGTTAATAAAACTAATCACGTCGCTATTTTTAACTGCTGCTGGTGCACGGTTTCCCTGAGTTTTTTGCAGATCGGCTTTTGCGATTGGTGGTTGCTTAGTCATTTGCATATTCCTTAGCCCAGCGGGGTAGTGATAATGTCTTAATAGCTGGCCATTCATCGGTATTCAGGCAGTCAGACAGGGTTCGCAGATTGCGGTGATATTCCTGTTGACCTGCCAGTTTTGCTTCTTCGCCCATCATGAAAATTTCAACCGGATAACGTCCGCATTCAATAGTTGTGCTGGCAACCAGAAAAACGAAAGTTGGCTGCACTCCAAACTGTGCTTCATAACCGTCACTGTAGAATGCATCCTGAACGTGATAGCGGTAGTCGTAATAAGCGGTTTTGAATCGTTGAATATCCGCCGTAGTTTTCACGTCCATGATCCAGTGAAATTCAGGGATAATTTTGTCCGGACGGCACCGACACAAAATTCCTGTTTCAGGATCTTCCCAGTAAATTGATGATTCAGCGTGTCCGGCGCTTTCAACAAGCCATTGCCCCAGCGGCAAAGCCATAACGCTTTGATACATGAGTTCAATTTTCCGGCCTTCTTCCGCAGTGATAACCGTTTTTCCTGTGCTTGCGCATTCCATCAGAAACGCTTTCTCTTCTTCTTTTCCGGCGTTTGTACGGCGGTTAAATTCAGGTGCTACGATAAAGCGGTTACTGAATTCTTCCGGTTCAAGTACCCGGCAGTGGAAAGCAGTTCCTAAATCGAGCGTTTTTGTCTTTGTGGTGTCCACGGGGGCATTTTTACGCCACAAATATAGTGCCGGAGTATCAGCAATGTCATCGAGCTGAGACTTACTGATACCGGGACCCGCGTGGTAATTCTCATTCGAAATTCCGTAATAAATACCTGGCTCTATGTCTTCTACGATTTTAGTTCCCATGTCATGGAGTCTTTGCTGAGTTGATAGCGTTCACTCCAGGTAAAATCGATCTCACCTTCAGCGGGCAGGTCATTAACGACAGGAAAATTCGTGGCAACAGCTTTAAAATAGCTGCTCAGTTTTTTACCTGACTTAACGATCAGGTAGTCCAGAGTGGCACAGGTCGATTCAAAATCGTTGCTTGCCCACAGGACGACGTCAGGTTCACCGGATGATTTTTTCGCTTTCCGTAACAGGAAGAGTGGTTTTGTGCTCATTGTTTTTTAACCTCAACTCAGATTAAAATTCGTTTTGTTCAGTGAATGATCTTGCCGGATACACACTGTTCATAGCCTGCGCCATACGCAGGCTATTTCTTTCAGATTTCACCTTTTAATTTCATTGCAATTAGAGTTGCCAGAAATTCGGCTTTTTTTTCTGCGGGCAGATTCTTTCCGATATGCACCAGGCACATTTTTTTGACACCTTCATCAAGTGTTTTTACGTTGCCTGATGGACCATCGATATCAACCACAGTGAATGGGGTTTCTTTATTTTCTGTTTTAATTACGTAGCCAATGCGCTTTCCTTCCAGATTCACCTCGTGAACAATGTCATCGGTAGTTACAACAGTGGCTTCATAATTGGTAATCATGTTTTTCTCCTTAATTAAGGTTGAGCGAATACCTGCCATTTCTGGCATAAATTCAGTTTCGAATAGTCAATTAATTAAAGTTCATGTGCCATCTGGTCTTTTTCGGCACAAGCTTCACTGCAATATTTTCTCGGTTCGTCTTTTGATAAAATCCCGTGCATGAAGTGAAGCATTCTTTCAATAGCTTTGCTTTCTTCAACGTCTTTTTTGCAAAGGTGGTAAGCACATTTTATTTTCTTAGTCATCACCATGACTCCGCCTTTACAGGTAAACCATCACGACCGAGGAAGACTTTAATCATGCGGTCAGTAATGCATGTTTTTGTGGTCAGGTTACGAATATATAGTTTTCGCTTTTTAATATTGTTTGCCGAGGCAATATATGTCCGGCCTTCATGAAGAACATAATCGCCAGGAGTCACACACTGACGTGGTATTTCATCAGTTCCGAAGTGATGTGCAATCATAATTATCTCCATTTTTACAAATGAACTTTGTTGATGCGGTGCCTGGTGCCTCCAGGTGACTGCAACCAGTTAACAATTACAGTCGGCTTTCCCACCCAAACCAATAAGGACTAACATGACTTTTAACTGTGCCACGTGCGCTTAGCCGCATTCACCGCATCACAAAATTCACTTTAAAAAGGGCGGACATCAGCCGAACTTCAAGAAAAAAACTGATGCCGCCAGGACTACACACAGCAATGTCGTTATTTACAACCGGAGGCGCACTCCCACCATTTAAATTTAACAGACAAGACCGACTCTTTATGGATATCGGAAATGCGCCTTCGTGTTGTGCCCGGTTTTATTTCACCACCTCCGGGCTTCGGTGGTCTCGGCTATACCCCTACAGCGAGAGCTTGTGTTAACATTTCAATACCCTTACAGTTGAGAGTTATTGATATGTTGGATGTATTTACTCCATTGTTGAAACTTTTTGCTAACGAGCCACTCGAAAGACTTATGTATACGATTATCATTTTTGGTCTCACTCTCTGGCTGATACCGAAAGAGTTTACTGTCGCATTCAATGCTTATACTGAAATACCTTGGCTCTTTCAGATTATCGTTTTTGCCTTTTCTTTCGTGGTCGCCATTTCCTTCTCAAGATTGCGAGCGCATATTCAAAAGCATTATTCATTACTACCAGAGCAACGAGTATTGCTTCGTTTATCTGAGAAAGAAATCGCTGTATTTAAAGATTTCCTTAAAACAGGAAATCTTATTATCACTTCTCCTTGCCGTAACCCGGTTATGAAAAAATTAGAACGGAAGGGCATCATTCAACATCAGAGTGATAGCGCAAACTGTTCTTATTATCTCGTCACCGAAAAATACTCCCATTTTATGAAGTTATTCTGGAACAGCAGGAGTAGACGTTTTAATCGTTAGCTTACTGTGTGCTTCTCCAACCATCGGCGCGCACCAGTTTCGGTTTTAAATGTTTTGCTTTTGGTATACGTCATGGCAGTGAACGTTCCATCCTGGTTGGGGAACACGCCGCACACCAGGGATTCGTTGTTGCCGAGGTCGATTTTTTGCATTTTTCGCACCTCACATCTTGTTGTTGCGGATAGAGGCTTCTGCTTGCCAGAGATCCCAGTCGTTGCTACGTAAAGCCTGCACAGCCGGGCTGTAAGTGATACCGCAACAATCCATCAAATACTGAACTACTTCGTAATGCACCATCTTATCTCTCCCCTTAACGCCGGGTGTCGCCTATGGCACCTCCGCAGGCAGTGCTGGCTACTGGTTTTTACAGTTGCTCGATAGAGTCACGCCCTCACAGTGGGCAGCAATAGGTGTGCTGGGTAGTCTGGTGTTTGGCCTGTTGACGTATCTGACAAATCTTTATTTCAAGATTAAAGAAGATAAGCGTAAGGCTGCACGGGGAGAGTAATTCAATGACTCAAAACTATGAACTGATTGTGAAAGGGATCCGCAATTTTGAGAATAAAGTTACGGTAACTTTAGCGTTACGGGACAAAAAACGCTTTGACGGTGAAATTTTAGACCTGGACATCTCGCTGGACCGTGTTGAAGGTGCCGCGCTGGAGTTTTATGAGGCAGCAGCCAGAAGGAGCATCAGACAGGTCTTCCTGGATGTTGCTGCAGGGTTATGTGAAGGGGATGAGCAGTCACCGGAAAAGCGCCCCATAATTTTAGAGGCGCAGGGTGTGTGGATAACCTACAAAGGAAAACTGCCGGGAAGAATTACTGGTTCACTGAAGACTCCGCCGAAATGGTAATTTCACCAGCATATTTTTCTTCCAGTAATACCGCCAGCCACTTGAAAGAATTTTGTTGTTGCTGGGACCATTTGGGGTTGAGTGATTCAAGCTGGAGCGATGCCAGTGTTGGTTGCATTTGTTCCTTGGGAATTGAGAATGCCAGATATGAAAATGCGACAGTAAGGGCATTTACATCATCCCGAAGCTTGGAAATGCAGTCGAGCAACTCCTGTAGAGAAATGGTGCTATTGTCCATAAACAATCCTCTCTATTGTATTTAACTATTCCTTGCCTGATTCAACAGGCCGGGACAGATAAACATATCCAGGGTTCAGAAACCGATAAATCCTGATAAATATCCATGAACGCAAAAATCAGATACGGCCTGTCGGCTGCCGTTCTGGCACTGATTGCCGTCGGTGCGCCTGCGCCTGATATTCTCGACCAGTTTCTGGATGAAAAAGAAGGTAACCACACAACGGCATACCGCGATGGTTCCGGTATATGGACCATCTGTCGTGGTGCCACAATGGTGGATGGTAAGCCCGTCATACCGGGAATGAGGCTGTCGAAGGAAAAATGCGACCAGGTTAACGCTATTGAACGTGATAAGGCGCTGGCATGGGTGGAGCGCAATATTAAAGTACCACTGACCGAACCACAGAAAGCGGGTATAGCGTCATTTTGTCCCTATAACATTGGCCCCGGTAAGTGTTTCCCGTCGACGTTTTATAAGCGGCTGAATGCCGGTGATCGTAAGGGTGCATGCGAGGCGATTCGCTGGTGGATAAAAGATGGTGGGCGCGATTGCCGCATACGTTCAAATAACTGCTATGGACAGGTTATTCGTCGTGACCAGGAAAGCGCATTAGCCTGTTGGGGGATAGATCAGTGAGCAGAGTCGCCGCGATTATTTATGCTCTGGTTATCTGCATCATCGTCTGCCTGTCATGGGCTGTTAATCATTACCGTGATAACGCCACCGCCTACAAAGAGCAGCGCGACAAAAATGCCAGAGAACTGAAGCTGGCGAACGCGGCAATTACTGACATGCAGATGCGTCAGCGTGATGTTGCTGCGCTCGATGCAAAATACACGAAGGAGTTAGCTGATGCGAAAGCTGAAAATGATGCTCTGCGTGATGATGTTGCCGCTGGTCGTCGTCGGTTGCACATCAAAGCAGTCTGTCAGTCAGTGCGTGAAGCCACCACCGCCTCCGGCGTGGATAATGCAGCCTCCCCCCGACTGGCAGACACCGCTGAACGGGATTATTTCACCCTCCGGGAACGACTGGTAATGATGCAGGCCCAACTTGAAGGTGCTCAGCAATACATAACCGAACAGTGTTTAAAGTAAAATCTTAACTACAATATGATTCATTTTGATGATTGTTTCATAAGGAACAGTGAAGTAAGATCTAAGAGGAGTTAAATTTTATACAGTATAATCATAATATTGCAGCAAGGTGGTTATAATTGAAAGAATATTTAGATATGAATACATCTCATGTAAGAGTTGTTACTCATATGTGTGGGTTCCTGGTTTGGCTCTATAGTCTTTCAATGTTGCCACCAATGGTTGTAGCATTGTTTTATAAAGAAAAAAGCCTATTCGTTTTCTTTATAACTTTCGTTATATTTTTTTGCATTGGTGGCGGAGCGTGGTATACAACTAAGAAATCTGGCATTCAATTACGTACCCGTGATGGGTTTATTATAATTGTAATGTTTTGGATTTTGTTTTCTGTTATTAGTGCATTCCCTTTATGGATTGACTCAGAACTTAATTTAACGTTCATTGATGCTCTGTTTGAAGGGGTTTCTGGAATAACAACAACAGGAGCAACTGTAATTGATGATGTTAGTTCATTACCTCGGGCATATTTGTACTATCGGTCACAGTTAAATTTTATAGGTGGTTTAGGAGTTATTGTTCTGGCGGTTGCTGTATTGCCATTATTGGGTATTGGTGGTGCAAAGCTTTATCAGTCAGAAATGCCGGGGCCATTTAAGGATGACAAACTCACTCCCCGCCTGGCCGATACGTCACGGACACTGTGGATAACTTATTCTTTATTAGGTATTGCTTGTATTGTCTGTTATAGACTTGCAGGAATGCCTTTGTTTGATGCTATTTGTCACGGGATTTCCACAGTTTCGCTTGGTGGTTTCTCAACTCATAGCGAGAGTATCGGATATTTTAATAACTATTTGGTTGAGCTGGTGGCTGGTTCTTTTTCCCTGCTATCGGCTTTCAACTTCACTCTTTGGTATATTGTTATTAGCAGGAAAACGATAAAACCTTTAATCAGAGATATTGAACTTCGTTTCTTTCTGTTAATAGCCTTAGGGGTGATCATTGTTACCTCTTTCCAGGTCTGGCATATAGGTATGTATGACTTGCATGGAAGTTTTATTCATTCGTTTTTTCTTGCCAGCTCCATGCTCACTGATAATGGTTTAGCTACGCAGGATTATGCAAGCTGGCCCACGCACACGATAGTGTTTTTGCTGTCGTCAAGTTTCTTTGGGGGATGTATAGGTTCAACTTGTGGTGGAATTAAGTCACTTCGATTTCTTATACTTTTCAAACAAAGCAAACACGAGATAAATCAGCTTTCTCATCCCAGAGCGTTGTTGAGTGTAAATGTAGGAGGGAAGATAGTTACAGATCGTGTAATGAGGTCTGTATGGAGTTTCTTTTTTCTTTATACTCTCTTCACGGTGTTTTTTATACTGGTGTTAAATGGTATGGGATATGATTTTCTTACATCATTTGCAACAGTGGCTGCATGTATTAATAATATGGGATTAGGTTTTGGGGCTACTGCATCGTCATTCGGAGTGCTTAATGACATTGCAAAATATTTAATGTGCATAGCTATGATTCTTGGTCGCCTTGAAATTTATCCTGTTATTATATTGTTTTCAGGTTTTTTTTGGCGCTCCTAATATATGGCTGATTTATAATTGTGAGTTTAATATTATATTGACTCACTCATTGATCCAATACCTAACTTTACCAGCAACACCTCCGCCCCCAGTAGCACTGGCTGCTGGGGTGCGTTTTATTCATAAAGCAAGGCTGTATGAGCGAGAAATTAAAGATAGTCTATCGCCCATTACAAGAATTGTCACCGTATGCGCACAACGCCAGGACGCACAGTACTGAGCAGGTGGCACAACTGGTAGAAAGTATTAAGCAATTCGGCTGGACTAATCCGGTGCTGATTGACGAAAAGGGCGAAATTATTGCGGGTCACGGTCGTGTTATGGCGGCTGAAATGCTCAAAATGGATTCTGTTCCGGTCATTGTTCTGGCCGGAGATACAACCTACAGGGAATGGTTTTTGCGTCAGCCTTACACCAGACAAAAACAGATTGTGGGGGAAACCCGGGCAAAGCTGATTCGGGATGGCGGTATGTCGCCAGATGAATTTTACACCGATAAAGGCGAATACCCGGATCGCAACAGTAAGCGGTAATACATCCCTTATAACAGCCAAAGCTGAACAACAGGGCGAGTGGACCTACTATGAGGCCACTTATACAGCTAATACGGACATTGATACCGTTAACTGTGCTTTTTATATGACAAATAAAATAAGTAATGAGCCATTCTATGATGACTCAACATTAACCATGACGACGCCGCAAATTGAACTGGGCAATACGGCATCGTCATTTATTGTAACTACAATGCCAACAACACGCGCAAGTGATGTGGTTACTATCCCCTCGGCGAATAACCTGTCAACACGGCCTTTTACAGTATTGTGCGAAGTAAGGAGGAACTGGAGTACACCGCCCAATGTTGCGCCAAGGATATTTGATGTTGGAGGGCACAGTATTGATGATAATTATTTATCGCTGGGGTTTGTTTCAACAGGAAAGATAAGCGCCAACGTAGGAATGGTTCAGCCACAAATTTCCTCAGATGGAGAAAGGTTCATTGTGGGTGTGAGAGCTAAATCTGATTTATCAGTAAATGCAATATGCAATGGTAATTATACAACAAACCTTAATGGTAAAATATTTGGAGTTACAGCAACATCGTACCGGTTTGGTGGGCAGACCGCAGCAGGAACGCGTCATTTGTTTGGACACATCAGAAATTTCAGAGTCTGGTTTAAAGAATTAAATGACAGGCAAATCAAGGAGGCAGTATGAAAGATTTAACTTTGAAATTTCCTGGTAACAGAGAGTTTAAATCCTTCCTGTCATCTCTTGACTGGGAGGAAGATGAAGACCTCCAGAATAAACTGTTAGTCGATGAAATTGGTTTCACCTACACAGAAACAGGGGTAACTGAAGAGGGAGAACCTGTCTGTATCCGGAATAACGGTTATTTTGTCAACATTCGCATTCTTGATGACTTGTTTGATGTTTCTGTATTCTCTGATTATGTCGTGGAGCTGGAAACACCGCTTCGGGAATGGAGCTGAAAGGAGGAAATAATGGATATAAGCCCCTTACTTCATGCACTTTGTGCTGTGGCTGCGCAGATACTGGTTGGTCTTTTTACCGGAAACTGGGCTTACGGAGCGATAGCCGGTTGTACGTTCTTCATTGCGCGTGAACATACCCAGGCAGAATATCGCTGGATTGAAATGTTCGGGCATGGCAAGCGAATGAATATGCCGTGGTGGGGCGGTTTTGATCCGCGCGCGTGGGATGTGGCAAGCCTGATGGATTTTGCTGTGCCGGTGGTGGCGTGTCTGCTGGTCTGGCTGTTGGTTAATCGTGGGTGAAAAAAGGTGGGCTGTATATGCAACGGAGGAAGAAACCTCGTTGCTGGAAGCATGGAAAAAGTATCGGGTATTGCTGAACCGTGTTGATACGTCAACTGCACAGGATATTGAATGGCCAGCACTGCCGTAGGGTAAAACATATAAATTCTATAATTAGATGTATCTTTCCATTTACGGCAAGGAAGGGGGCTTGGAAGACGTAAAGCATCTCACACCGAGATTATTTTTTATATGTCAGGTGTCTGAAGTTTTGCTTTGGCTCTTAAAATGGTTTGCCGCGAGGTTTTGAATTCCCGGGCAATGGCACTTATACTTACACCTGACTTAATTCGTTCGAATACCACCTGTTTCTGTTCTTCATTTAACACAGGTGGTCGACCAAAACGTTTCCCTGCGCCGCGGGCTCTTACTATCCCGGAATGAGTGCGTTCAAGTAAAAGGTCTCGTTCAAATTCAGCGACTGCTGAAATTACTTGCATCATCATTTTTCCTGTTGGACTGGTCAGGTCAATGCCCCCCAATGCTAAGCAATGCACTCTGATACCTGTTTCGGTCAGTTGTTCCACTGTTTTCCTGATATCCATTGCATTACAACCAAGGCGATCCAATTTTGTCACAATCAATTGATCACCACATTTCAGGCGAGCAAGCAACCGGTTAAAACCAGGACGCTCACTGGTTGCTGCTGAGCCACTAATGTGTTCTTCGATTATTTGCTGAGGTTTGATTTTAAAACCTGCACTTTCGATTTCCCGGCGTTGATTTTCGGTGGTCTGATCCAGCGTTGATATCCGACAGTAAGCAAAAATTTGAGACATAGTGAGACTCTATACGAAATTGGTGTTCATATCATAATGCATCTCAGAAAATAATTATGATTATTTTTGTGCATATTTGTATGTACACGTTCGAAAATAAACGAATGCGTATGCAACCCCGTAATTTTGGTGAGACCCAAAATCGATTTTGTGAAAAATGGCTTTAACTCGGTTTGTTTTTCGAGTTCCGGGCGGACTCAAGGAAGAAGAATAGTGTTGCGTGTTATTTTAACCAGATTTCAAGTTGTTTGGTCGTGGAAAAGTGGAGCAAAATGTTGTTAAAGTGGAAAAATGATAAAAAAGTAAGTTTATTATATTACATTTTACCATTTAAATTTTGGTTGTCTTTAAGAACTGATATCGCTGTTTGTAATAATTCTTTGTTATCCAGCCATGATTTTTTCTTTATGTTTCCTTCAATGTAATCAAGCAATGTTCTGGTATTGATAGGTCTTCCCTGTTTTGCTACTTCCACTACAGCATCCCCTAGGATAATTCTTACTTCAGGAAGCTGCGCAGGGAACCACTTTAGGGTGTCTTTTGATTTCATGAAGATATTCCTTAAAATATTATTGATTTTCATTGCGATATTGTATGTCTGATTCAGGATATGTTGACTTATACATCGGTTTTGTCTGGGTTATTGGATATGCCAATCCCTAATTTTATTAGGGCATGACTAAAAATGCTGAATATGATAAGGAGAGAAGTGATTATCAGTATGCTGTTCATATAGCCTCGAATTAGTAATGTGTTATATATGATATAGTTGACAATTTTTATCTTGGGTGTTCTTAAAGTTCGTAGATAAACATTGTCGTTTCAGGTATACAGGAATGCTAACAGGTGGCAGCAAAAATCAGGCGGTTTATGGCGCAAGCTGAAGCGGCAACTGCAAACTATCTTATGCAGAGACTCTACACGGATTGGGTTTAAAAGTATACATAGATAACAGTTTTTATCTGAAGAAGAAAAATATCAAGGTGATATAGCCTATATGCCTATGATGCGGAGGAATGAATGTGATGGGAGTGATGTATCTGAATAGTTGAAAAACCGCAGTCACGTCGTATGCAAGAACGTGCTGCGGTTGGTTTGACTTTGATTGAGACGTTTTGGAATTTTTTTTTGTGGCAAAAATGGGGCAAAACGCTGCAAAAGGGGCAAAAAAGGGGCAAAAAAAGAGTGGATTATCGTAGTTTATTGTTCTCGCTGATGATGTTTAACACATTGAAAAATAAGTAAAACACTTATGAGTCAGATAGTTGTGATTTTTGCCCTTACTTGTTCAGGTTGTATTGTTCTTTCTTACTAATTTCTTGATTTTGCGACATTTAAAAGCGACTCAATTCGTTATATGGCATCAGAAGAGTATGCGTCATGCCGGAACGCCCAGCATAAGAAATCTGATATAAAAAACTGTGGCGTGTATGGTACGGATTAGAGGGGAAAATGTCAGCACATTTGCGAAATGAATCAAAAAGCCCGCAGCAATGTGCGGGCGTTAGTGTCAGCGCACAACCAGCACGGAGCACTCTGCGTGACGCACTACAGCTGCGGCGTTGGAACCGAGCAGATAAGTGGTGATATCCGGTCGATGGGAAGCAATGATGATCATATGAGCGGGGATCTTCTTCGCCAATTCCAGAATGCGGTCTTTGGGCGAGCCTTCCTCAACATGGACATGCACTCTGTCGGTTGGCAGTTTAAATTTTTTAATGATCTCTTCCAGTTGCGATTTGGCTTCCGCTTTCAGGTCATCCATTGCCGGTAATTCTGCGGAATACGCTAAACCCAGAGAGGCATAGTAGGGCAGTGAAGGTATTACCGTCAGGAAATGAACCTCTGCATCATCAATCTTTGCCTCTTCCTCAACGTGGCTAATCACGCGTTGAGTTAATTCTGAATCGGAAATATCGATAGGGACAAGAATCGTTCTGTTCATAAAACCTCCTGTTTTAGTATCCGCATAAAGTGTAACGCCAGATGACACTTTTTGTGTAATGACGGAGTTCACATTTTTAATTTAGATCAAAGGAGGAAGAATAAGCAGAAAAAGCCCGCCATAACAGCGGGCAGGAGGATTTAGAACTGATAAACCAGACCTAAAGCGACAATATCATCGGTAGAGATGCCATTGGCAGCGTAGAAGCTGTCATCTTCATCCAACAGGTTGATTTTATAGTCAACGTAGGTGGACATGTTTTTATTGAAATAGTAAGTCGCGCCAACATCGGCGTATTTAACCAGATCTTTATCATCAACACCTGCCGGGTTGTCTGCACCACCCGCAGCGTGCAGGTCACGGCCTTTAGACATCAGGAAAGAGACTGCCGGACGCAGACCAAAATCAAACTGGTACTGTGCAGTGACTTCAAAATTCTGGGTTTTGTTTGCCACAGCATAATCGCTGTCGCCAAACGGGGTCATATTACGCGTTTCTGAATACATGGTTGCCAGGTAAATATTGTTAGCATCGTATTTTAGCCCAGCAGTCCACGCGTCTGCTTTATCACCACCCGCCGCAGTATGGTTAACCTGGTCATTGGTGCGGTCAGAAGAGGTGTATGCCGCACCAGCGCTAAAGCCCATGCCTAAATCATATGTTGTGGAAAGACCCCAGCCGTCACCGTTTTCATGGCGAACATCACGTCCGTTGTTGGTGCCTTCCTGACCATTACTGGCTCCTTCGTTGTTACCTTGATACTGCACCGCGAAGTTCAGACCATTTACCAGACCGAAGAAATCAGTATTACGATAAGTCGCGACGCCATTGGCTCGACCAGTCATAAAGTTGTCTGCATTGGTATAAGAGTCACCGCCAAATTCAGGCAGCATATCGGTCCAGCCTTCGATGTCGTACATTACGCCATAATTACGTCCGTAATCGAAAGAACCGTAATCTGCAAATTTCAGCCCGGCAAATGCCAGACGGGTCCATGACTGGTTTTTTGAAGATTCAGTGTTGTTTGCCTGAATATTGTATTCCCATTGACCGTAGCCAGTGAGTTGATCGTTAATTTGGGTTTCGCCTTTAAAACCCAGACGCGCATAGCTCTGGTCGCCATCTTTCGCTGAATTATCAGAAAAATAATGCAGGCCATCAACTTTGCCATACAGATCTAATTTGTTGCCGTCTTTATTATAAACTTCGGCTGCATGTGCAGCACCTGCGGCGAGCAGGGCAGGAATTAAAAGTGCCAGTACTTTGCTTTTCAT
Protein sequences of DBSCAN-SWA_4 >NZ_CP020368|1373913:1394635|1390825_1391416_-|WP_000078178.1|DBSCAN-SWA MSQIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSERPGFNRLLARLKCGDQLIVTKLDRLGCNAMDIRKTVEQLTETGIRVHCLALGGIDLTSPTGKMMMQVISAVAEFERDLLLERTHSGIVRARGAGKRFGRPPVLNEEQKQVVFERIKSGVSISAIAREFKTSRQTILRAKAKLQTPDI >NZ_CP020368|1373913:1394635|1386209_1386743_+|WP_000992105.1|DBSCAN-SWA MNAKIRYGLSAAVLALIAVGAPAPDILDQFLDEKEGNHTTAYRDGSGIWTICRGATMVDGKPVIPGMRLSKEKCDQVNAIERDKALAWVERNIKVPLTEPQKAGIASFCPYNIGPGKCFPSTFYKRLNAGDRKGACEAIRWWIKDGGRDCRIRSNNCYGQVIRRDQESALACWGIDQ >NZ_CP020368|1373913:1394635|1392487_1392661_+|WP_001157925.1|DBSCAN-SWA MQERAAVGLTLIETFWNFFLWQKWGKTLQKGQKRGKKRVDYRSLLFSLMMFNTLKNK >NZ_CP020368|1373913:1394635|1392926_1393361_-|WP_001300461.1|DBSCAN-SWA MNRTILVPIDISDSELTQRVISHVEEEAKIDDAEVHFLTVIPSLPYYASLGLAYSAELPAMDDLKAEAKSQLEEIIKKFKLPTDRVHVHVEEGSPKDRILELAKKIPAHMIIIASHRPDITTYLLGSNAAAVVRHAECSVLVVR >NZ_CP020368|1373913:1394635|1381334_1382144_-|WP_000166319.1|DBSCAN-SWA MTKQPPIAKADLQKTQGNRAPAAVKNSDVISFINQPSMKEQLAAALPRHMTAERMIRIATTEIRKVPALGNCDTMSFVSAIVQCSQLGLEPGSALGHAYLLPFGNKNEKSGKKNVQLIIGYRGMIDLARRSGQIASLSARVVREGDEFSFEFGLDEKLIHRPGENEDAPVTHVYAVARLKDGGTQFEVMTRKQIELVRSLSKAGNNGPWVTHWEEMAKKTAIRRLFKYLPVSIEIQRAVSMDEKEPLTIDPADSSVLTGEYSVIDNSEE >NZ_CP020368|1373913:1394635|1387341_1388799_+|WP_001097897.1|DBSCAN-SWA MNTSHVRVVTHMCGFLVWLYSLSMLPPMVVALFYKEKSLFVFFITFVIFFCIGGGAWYTTKKSGIQLRTRDGFIIIVMFWILFSVISAFPLWIDSELNLTFIDALFEGVSGITTTGATVIDDVSSLPRAYLYYRSQLNFIGGLGVIVLAVAVLPLLGIGGAKLYQSEMPGPFKDDKLTPRLADTSRTLWITYSLLGIACIVCYRLAGMPLFDAICHGISTVSLGGFSTHSESIGYFNNYLVELVAGSFSLLSAFNFTLWYIVISRKTIKPLIRDIELRFFLLIALGVIIVTSFQVWHIGMYDLHGSFIHSFFLASSMLTDNGLATQDYASWPTHTIVFLLSSSFFGGCIGSTCGGIKSLRFLILFKQSKHEINQLSHPRALLSVNVGGKIVTDRVMRSVWSFFFLYTLFTVFFILVLNGMGYDFLTSFATVAACINNMGLGFGATASSFGVLNDIAKYLMCIAMILGRLEIYPVIILFSGFFWRS >NZ_CP020368|1373913:1394635|1378363_1379299_-|WP_001157407.1|tRNA|DBSCAN-SWA MQENQQITKKEQYNLNKLQKRLRRNVGEAIADFNMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNLDQKQPGFPEHVLPEYLEKLGVEYKIVEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFLNMFYGGKMKGMPPKLMSDDGKHIVIRPLAYCREKDIQRFADAKAFPIIPCNLCGSQPNLQRQVIADMLRDWDKRYPGRIETMFSAMQNVVPSHLCDTNLFDFKGITHGSEVVNGGDLAFDREEIPLQPAGWQPEEDENQLDELRLNVVEVK >NZ_CP020368|1373913:1394635|1380587_1380803_-|WP_000079604.1|DBSCAN-SWA MAQVIFNEEWMVEYGLMLRTGLGARQIEAYRQNCWVEGFHFKRVSPLGKPDSKRGIIWYNYPKINQFIKDS >NZ_CP020368|1373913:1394635|1388936_1389332_+|WP_012775984.1|DBSCAN-SWA MSEKLKIVYRPLQELSPYAHNARTHSTEQVAQLVESIKQFGWTNPVLIDEKGEIIAGHGRVMAAEMLKMDSVPVIVLAGDTTYREWFLRQPYTRQKQIVGETRAKLIRDGGMSPDEFYTDKGEYPDRNSKR >NZ_CP020368|1373913:1394635|1383687_1383858_-|WP_001352098.1|DBSCAN-SWA MTKKIKCAYHLCKKDVEESKAIERMLHFMHGILSKDEPRKYCSEACAEKDQMAHEL >NZ_CP020368|1373913:1394635|1385304_1385517_+|WP_012775985.1|lysis|DBSCAN-SWA MLSLPLTPGVAYGTSAGSAGYWFLQLLDRVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >NZ_CP020368|1373913:1394635|1393501_1394635_-|WP_000837924.1|DBSCAN-SWA MKSKVLALLIPALLAAGAAHAAEVYNKDGNKLDLYGKVDGLHYFSDNSAKDGDQSYARLGFKGETQINDQLTGYGQWEYNIQANNTESSKNQSWTRLAFAGLKFADYGSFDYGRNYGVMYDIEGWTDMLPEFGGDSYTNADNFMTGRANGVATYRNTDFFGLVNGLNFAVQYQGNNEGASNGQEGTNNGRDVRHENGDGWGLSTTYDLGMGFSAGAAYTSSDRTNDQVNHTAAGGDKADAWTAGLKYDANNIYLATMYSETRNMTPFGDSDYAVANKTQNFEVTAQYQFDFGLRPAVSFLMSKGRDLHAAGGADNPAGVDDKDLVKYADVGATYYFNKNMSTYVDYKINLLDEDDSFYAANGISTDDIVALGLVYQF >NZ_CP020368|1373913:1394635|1385005_1385161_-|WP_001169151.1|DBSCAN-SWA MQKIDLGNNESLVCGVFPNQDGTFTAMTYTKSKTFKTETGARRWLEKHTVS >NZ_CP020368|1373913:1394635|1376861_1378235_+|WP_000123737.1|DBSCAN-SWA MTAFSTLNVLPPAQLTNLNELGYLTMTPVQAAALPAILAGKDVRVQAKTGSGKTAAFGLGLLQQIDASLFQTQALVLCPTRELADQVAGELRRLARFLPNTKILTLCGGQPFGMQRDSLQHAPHIIVATPGRLLDHLQKGTVSLDALNTLVMDEADRMLDMGFSDAIDDVIRFAPASRQTLLFSATWPEAIAAISGRVQRDPLAIEIDSTDALPPIEQQFYETSSKGKIPLLQRLLSLHQPSSCVVFCNTKKDCQAVCDALNEVGQSALSLHGDLEQRDRDQTLVRFANGSARVLVATDVAARGLDIKSLELVVNFELAWDPEVHVHRIGRTARAGNSGLAISFCAPEEAQRANIISDMLQIKLNWQTPPANSSIATLEAEMATLCIDGGKKAKMRPGDVLGALTGDIGLDGADIGKIAVHPAHVYVAVRQAVAHKAWKQLQGGKIKGKTCRVRLLK >NZ_CP020368|1373913:1394635|1385521_1385866_+|WP_000193293.1|DBSCAN-SWA MTQNYELIVKGIRNFENKVTVTLALRDKKRFDGEILDLDISLDRVEGAALEFYEAAARRSIRQVFLDVAAGLCEGDEQSPEKRPIILEAQGVWITYKGKLPGRITGSLKTPPKW >NZ_CP020368|1373913:1394635|1373913_1375146_-|WP_001301114.1|DBSCAN-SWA MITHNFNTLDLLTSPVWIVSPFEEQLIYANSAAKLLMQDLTFSQLRTGPYSVSSQKELPKYLSDLQNQHDIIEILTVQRKEEETALSCRLVLRKLTETEPVIIFEGIEAPATLGLKASRSANYQRKKQGFYARFFLTNSAPMLLIDPSRDGQIVDANLAALNFYGYNHETMCQKHTWEINMLGRRVMPIMHEISHLPGGHKPLNFVHKLADGSTRHVQTYAGPIEIYGDKLMLCIVYDITEQKRLEEQLEHAAHHDAMTGLLNRRQFYHITEPGQMQHLAIAQDYSLLLIDTDRFKHINDLYGHSKGDEVLCALARTLESCARKGDLVFRWGGEEFVLLLPRTPLDTALSLAETIRVSVAKVSISGLPRFTVSIGVAHHEGNESIDELFKRVDDALYRAKNDGRNRVLAA >NZ_CP020368|1373913:1394635|1383857_1384079_-|WP_000560223.1|DBSCAN-SWA MIAHHFGTDEIPRQCVTPGDYVLHEGRTYIASANNIKKRKLYIRNLTTKTCITDRMIKVFLGRDGLPVKAESW >NZ_CP020368|1373913:1394635|1383337_1383613_-|WP_000632297.1|DBSCAN-SWA MITNYEATVVTTDDIVHEVNLEGKRIGYVIKTENKETPFTVVDIDGPSGNVKTLDEGVKKMCLVHIGKNLPAEKKAEFLATLIAMKLKGEI >NZ_CP020368|1373913:1394635|1385831_1386104_-|WP_000370546.1|DBSCAN-SWA MDNSTISLQELLDCISKLRDDVNALTVAFSYLAFSIPKEQMQPTLASLQLESLNPKWSQQQQNSFKWLAVLLEEKYAGEITISAESSVNQ >NZ_CP020368|1373913:1394635|1390013_1390292_+|WP_000654171.1|DBSCAN-SWA MKDLTLKFPGNREFKSFLSSLDWEEDEDLQNKLLVDEIGFTYTETGVTEEGEPVCIRNNGYFVNIRILDDLFDVSVFSDYVVELETPLREWS >NZ_CP020368|1373913:1394635|1391732_1391966_-|WP_000836768.1|DBSCAN-SWA MKSKDTLKWFPAQLPEVRIILGDAVVEVAKQGRPINTRTLLDYIEGNIKKKSWLDNKELLQTAISVLKDNQNLNGKM >NZ_CP020368|1373913:1394635|1379350_1380586_-|WP_000040858.1|integrase|DBSCAN-SWA MSKLPTGVEIRGRYIRIWFMFRGKRCRETLKGWEITNSNIKKAGNLRSLIVHEINSGEFEYLRRFPQSSTGAKMVTTRVIKTFGELCDIWTKIKETELTTNTMKKTKSQLKTLRIIICESTPISHIRYSDILNYRNELLHGETLYLDNPRSNKKGRTVRTVDNYIALLCSLLRFAYQSGFISTKPFEGVKKLQRNRIKPDPLSKTEFNALMESEKGQSQNLWKFAVYSGLRHGELAALAWEDVDLEKGIVNVRRNLTILDMFGPPKTNAGIRTVTLLQPALEALKEQYKLTGHHRKSEITFYHREYGRTEKQKLHFVFMPRVCNGKQKPYYSVSSLGARWNAAVKRAGIRRRNPYHTRHTFACWLLTAGANPAFIASQMGHETAQMVYEIYGMWIDDMNDEQIAMLNARLS >NZ_CP020368|1373913:1394635|1390304_1390598_+|WP_000355360.1|DBSCAN-SWA MDISPLLHALCAVAAQILVGLFTGNWAYGAIAGCTFFIAREHTQAEYRWIEMFGHGKRMNMPWWGGFDPRAWDVASLMDFAVPVVACLLVWLLVNRG >NZ_CP020368|1373913:1394635|1380881_1381091_-|WP_000276809.1|DBSCAN-SWA MYKITATIEKEGGTPTNWTRYSKSKLTKSECEKMLSGKKEAGVSREQKVKLINFNCEKLQSSRIALYSN >NZ_CP020368|1373913:1394635|1386959_1387145_+|WP_001228696.1|lysis|DBSCAN-SWA MRKLKMMLCVMMLPLVVVGCTSKQSVSQCVKPPPPPAWIMQPPPDWQTPLNGIISPSGNDW >NZ_CP020368|1373913:1394635|1381083_1381278_-|WP_001317028.1|DBSCAN-SWA MRYDNVKPCPFCGCPSVTVKAISGYYRAKCNGCESRTGYGGSEKEALERWNKRTTGNNNGGVHV >NZ_CP020368|1373913:1394635|1384520_1385009_+|WP_001312793.1|DBSCAN-SWA MLDVFTPLLKLFANEPLERLMYTIIIFGLTLWLIPKEFTVAFNAYTEIPWLFQIIVFAFSFVVAISFSRLRAHIQKHYSLLPEQRVLLRLSEKEIAVFKDFLKTGNLIITSPCRNPVMKKLERKGIIQHQSDSANCSYYLVTEKYSHFMKLFWNSRSRRFNR >NZ_CP020368|1373913:1394635|1375400_1376384_+|WP_000387388.1|DBSCAN-SWA MEAIKGSDVNVPDAVFAWMLDGRGGVKPLENTDVIDEAHPCWLHLNYVHHDSAQWLATTPLLPNNVRDALAGESTRPRVSRLGEGTLITLRCINGSTDERPDQLVAMRVYMDGRLIVSTRQRKVLALDDVVSDLEEGTGPTDCGGWLVDVCDALTDHSSEFIEQLHDKIIDLEDNLLDQQIPPRGFLALLRKQLIVMRRYMAPQRDVYARLASERLPWMSDDQRRRMQDIADRLGRGLDEIDACIARTGVMADEIAQVMQENLARRTYTMSLMAMVFLPSTFLTGLFGVNLGGIPGGGWQFGFSIFCILLVVLIGGVALWLHRSKWL >NZ_CP020368|1373913:1394635|1392034_1392148_-|WP_120795384.1|DBSCAN-SWA MNSILIITSLLIIFSIFSHALIKLGIGISNNPDKTDV |
29 | Escherichia_phage(34.78%) | tRNA,integrase,lysis | attL 1374739:1374753|attR 1396910:1396924 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1560932 : 1576555
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP020368|1560932:1576555|DBSCAN-SWA TTTACTTCACGGAATATTTTGCCACGGTCGCTTTCGCGCCATGCGCTAATAAAGACAAGTACGCTTCCGTCACTTTTGCAGTAAACAAGCTATTGTCTGGCAAATCATCACCAAAGATTGCCTTAATCGCCAGCAATGACTGGACGCGCGCTTTCCCTTCGGCACTACTTTGTACAGCCTTCTGAATAACAGGTAACAGTGGGTCACTGATTTCTATCGGATTTTCCTGTTCATCAACACCACCGACATAACGCATCCAACCCGCGACGCCCAGCGCCAGCAGATCGAACTTGCTGTCATGCGCCAGATGCCAGCGAACAGAATCCAACATCCGCTGTGGCAATTTCTGGCTGCCATCCATCGCAATCTGCCAGGTTCGATGACGTAGCGCCGGGTTGCTATAGCGTGCAATTAATCGGTTAGCGTAATCTTGCAAATCAACGCCCTGCACTTTCAGCGTCGGCGCTTGTTCCTGCAACATCAAGCCATACGCCGCATGACGATAATGTTCATCTTCCATACAGTCATTAATGTGCTGATATCCAGCAAGATAACCCAGATACGCCAGGAATGAATGACTGCCGTTGAGCATGCGCAACTTCATCTCTTCATAAGGCAGTACATCGCTAACCAGTTCTGCTCCCGCTTTTTCCCATTCCGGACGTCCGGCAACAAAATTATCTTCTATTACCCACTGGCGGAAAGGTTCACAGGCAACGCCAGCAGGATCGCGCACACCGGTAAGTTGTTCGATTTTCGCAAGAGTATCCTCTGTCACTGCGGGCACAATACGGTCCACCATTGTTGATGGGAAAGTCACGTTATATTCGATCCATTGTGCCAGTTTTACATCAACGGCTTGCGCGTAGGAAGTGACAACGTCACGCATAACATGACCGTTTTCTGGCATGTTGTCACATGACATAACGGTAAATGCGGGAAGTCCTGCCGCTTTACGGCGAGCCAGCGCCTCAACAATCACCCCTGTTGCTGTTTTCGGCTGGTGGGGATTTTGCACGTCGGCAGCGACCATCGGGTGATCGAGCATTAACTGTCCGGTCGCCGGAGAGTGGAAATACCCTTTTTCGGTGATTGTCAGAGAGACAATCGCAATTTGCGGTTCACACATCGCAGCCAGCACGGTTTCTAAGCCATCTATTTGTACGTGCAAGGCTTTTTTAACGACGCCAACGACGCGAGCCGTCCACGCGTCGGCCGACATTTCCGCAACGGTATAAAGATTATCTTGCTGTTGTAAATCGGCAATTTGCTGTTCGCCGCCGATTAAGTTGACCTCATAATATCCCCAGTCACTGAAATGTTCCGTAGCAAGAATATCGGCATACACGCCCTGATGCGCACTGTGAAATGCACCAAAGCCTAAATGAACAATTCTTGGGGCCAGGTTATTACGATCATAAACAGGGAGTGTCGCTTTTGCTGATAACAAATTATTTCCCATAACAATTCCTTAAATATAAATATGGCAAGCTATATGTTTTGTTATATGAATAAAAATCCCCTCTCCGGTAAGAGAAGGGATTAAGGGTTTACAGACTTCTGGAAGGTTGCGCAGCTCTTACAACACGCGGTTGATCTTCCGCAGCGTCTTCCAGCGCACTTAAATCACGGTCTTTCACCTCTGGCATTTTCAGCGCAGAGATTAAACCAATCACTGAATATGCCATGATCATAATGGCGATCGGATACCAGGATTCCGTCATGGTGCAGAAAATACCCGCCAGGATAGGACCAAAACCGGAAGCGATAAGACCACCAATTTCTTTAGAAATAGCCATCCGGGTAAAGCGGTTTTTACAGCCGAACATTTCTGCCATGGTAATGTTTTCCAGAGCAAATAATCCCAGCACCGCACAGTTATGAATCACAATCAGTGCAACCATAATGGTGCTCGGGGCATAGCTTTTATCTACAATGATAGAAAGCATTGGCCATGCCAGCACAATCGCGGAGGTATTCATAATAATATACGGGATCCGGCGACCAATTTTATCGGATAACCAACCAAGGAACGGAATGGTCATAAAGCCGAGAATCGAACTAATCATCAATGCATCTGTTGGAATTGCTTTGTTAAACAATAACGTCTGCACTAAATAGCCTGCAAGGAAAGTCTGAATTAACCCGGAGTTACCCGCCTGACCAAAACGCAGCCCTGTTGCCAGCCAGAAGGATTTGCTCTGGAACATGCTACCAGCAGGTGCAGGTTTTGCTGTGGGTTGGTTGCTGTCATTAACCTTCTCAAAGACCGGGCTTTCTTTCAGATTCATACGTAACCAGATAGCAAAGACCATCACGACAACGCTCGCCAGGAACGGTATACGCCATCCCCACGCCAGCAGTTCCTCTTTACTGAGAATGAAGAACATAAAGGCCCAGATTGCCGTTGCGCTCAAGGTTCCGCAGTTAGTTCCCATAGCTACAAATGAGGAGATAATTCCGCGCTTACCTTTTGGTGCATATTCCGCCAGCATCGTACCGGCACCGGAAATTTCCGCACCTGCACCCAACCCCTGAATAATACGCAGCGTCACCAGCAAGATGGGGGCAAAAACACCAATCTGTGCATAGGTCGGTAACACACCAATTAAGGTGGTACAGATCCCCATCATGGTGATGGTAATAAAGAGCACTTTTTTACGCCCTATTCTGTCGCCCATTTTGCCGAAAATAAATGCTCCGACAATACGCGCCACATAACCTGCACCGTAGGTTCCCATTGCCAAAATTAACGCCATTGCCGTTGATGATTCAGGAAAAAATATTTCATGAAACACTAACGCTGCGCCGAGCGAATATAACTGGAAATCCATAATTCACAGGTGTTTTTTCCCATCCTGTGGTTCCCTTGGCGTTTTCTAGGTTTTTTCAGATAGTTGCATTTTTTAAAAAGCATCCTGAGTTCGATCTCAGTGTCTATCTGGGGCCTATTTCTGTCCCATATATGCCCCAAAAAAACTCCCCAACAGATAAGTGGTTTTTTCATGGATTTATGCGTAAAATCAAGAACGGCTGGAAATCATTCAATACACACACTATCGAAAAATTTACCAGCCAACCGCAGCACGTTCTTGCATAAGGCGTGTCTGCGGTTTTTCAACTATTCAGATACATCACTCCCATCACATTCATTCCTCCGCATCAAAGGCATATAGGCTATATCACCTTGATATTTTTCTTCTTCAGATAAAAACTGTTATCTATGTATACTTTTAAACCCAATCCGTGTAGAGTCTCTGCATAAGATAGTTTGCAGTTGCCGCTTCAGCTTGCGCCATAAACCGCCTGATTTTTGCTGCCACCTGTTAGCATTCCTGTATACCTGAAACGACAATGTTTATCTACGAACTTTAAGAACACCCAAGATAAAAATCGTCAACTATATCATATATAACACATTAATAATTCGAGGCTATATGAACAGCATACTGATAATCACTTCTCTCCTTATCATATTCAGCATTTTTAGTCATGCCCTAATAAAATTAGGGATTGGCATATCCAATAACCCAGACAAAACCGATGTATAAGTCAACATATTCTGAATCAGACATACAATATCGCAATGAAAATCAATAATATTTTAAGGAATATCTTCATGAAATCAAAAGACACCCTAAAGTGGTTCCCTGCGCAGCTTCCTGAAGTAAGAATTATCCTAGGGGATGCTGTAGTGGAAGTAGCAAAACAGGGAAGACCTATCAATACCAGAACATTGCTTGATTACATTGAAGGAAACATAAAGAAAAAATCATGGCTGGATAACAAAGAATTATTACAAACAGCGATATCAGTTCTTAAAGACAACCAAAATTTAAATGGTAAAATGTAATATAATTAACTTACTTTTTTATCATTTTCCACTTTAACAACATTTTGCTCCACTTTTCCACGACCAAACAACTTGAAATCTGGTTAAAATAACACGCAACACTATTCTTCTTCCTTGAGTCCGCCTGGAACTCGAAAAACAAACCGAGTTAAAGCCATTTTTCACAAAATCGATTTTGGGTCTCACCAAAATTACGGGGTTGCATACGCATTCGTTTATTTTCGAACGTGTACATACAAATATGCACAAAAATAATCAAAATTATTTTCTGAGATGCATTATGATATGAACACCAATTTCGTATAGAGTCTCACTATGTCTCGAATTTTTGCTTACTGTCGGATATCAACGCTGGATCAGACCACCGAAAATCAACGCCGGGAAATCGAAAGTGCAGGTTTTAAAATCAAACCTCAGCAAATAATCGAAGAACACATTAGCGGCTCAGCAGCAACCAGTGAGCGTCCTGGTTTTAACCGGTTGCTTGCTCGCCTGAAATGTGGTGATCAATTGATTGTGACAAAATTGGATCGCCTTGGTTGTAATGCAATGGATATCAGGAAAACAGTGGAACAACTGACCGAAACAGGTATCAGAGTGCATTGCTTAGCATTGGGGGGCATTGACCTGACCAGTCCAACAGGAAAAATGATGATGCAAGTAATTTCAGCAGTCGCTGAATTTGAACGAGACCTTTTACTTGAACGCACTCATTCCGGGATAGTAAGAGCCCGGGGCGCAGGGAAACGTTTTGGTCGACCACCTGTGTTAAATGAAGAACAGAAACAGGTGGTATTCGAACGGATTAAGTCAGGTGTAAGTATAAGTGCCATTGCCCGGGAATTCAAAACCTCGCGGCAAACCATTTTAAGAGCCAAAGCAAAACTTCAGACACCTGACATATAAAAAATAATCTCGGTGTGAGATGCTTTACGTCTTCCAAGCCCCCTTCCTTGCCGTAAATGGAAAGATACATCTAATTATAGAATTTATATGTTTTACCCTACGGCAGTGCTGGCCATTCAATATCCTGTGCAGTTGACGTATCAACACGGTTCAGCAACACCCGATACTTTTTCCTGGCTTCCAGCAACGGGGTTTCTTCCTCCGTTGCATATACAGCTCACCTTTTTTCACCCACGATTAACCAACAGCCAGATCAGCAGACACGCCACCACCGGCACAGCAAAATCCATCAGGCTTGCCACATCCCAAACACGCGGATCAAAACCGCCCCACCACGGCATGTTAATCCGCTTGCCATGCCCGAACATTTCAATCCAGCGATATTCTGCCTGGGTGTGTTCACGCGCAATGAAGAACGTACAACCAGCTATCGCCCCATAAGCCCAGTTTCCGGTAAAAAGACCAACCAGTGTCTGCGCAGCCACAGCACAAAGCGCATGAAGAAAAGGTGTTATATCCATTACTCCTCCTTTATCTAATATCGCTTCGGGAAGTTAATAACAACTTTAATTCTGACTCAAGTTCATCAACTCTTTCAGTCAGTTTCTGGATATGGTGAATCAGTGGAACAACCAGACGTTCGTACATTACACCTTCGGCAACAAGGCCATTGCTGGAAATAGCTTCAGGAGCATCATCTTCGTTAGCTGGTCGCCAGTGTACAAACTGAGGAGCAATTTCTCCTACTTCCTCGGCAATCAATCCGTAGAATCCCCAGTCACGCCTGTCATTTTCGCATTGCGACCTGTACCACACAGGGCGCATCCTGAAAATGAGATCGGCGTGCTCTGAATCTATTGTCTCTACTGAATGTTTATAGCGGATAGACGATGTTGACCGCAGCACAGACGAAATTGCAGGGTCAGGATTAAGATAAAGGTTTGCCGCCGCTGTAGTCGTGGCCAAATTCCATAAATAAAACGCTTCACGACCTGTCAGTGGATAAAAATCTCCGCCATAACGACCACTTTCCAGATCGTTCACTTCCACTTTGTTTTTCAGCTTATTATCAACTTCAGTTTTTGTGTATCTGGAGCTGATATCCTGCTTTGCACTGGTCATCTCAGTCTGAAGCGTTGATACTTTTCTGCTAATTGATGAAATATCTTCCTTAGCTTTACTGACATCCCCCTTTAGCTTGGTGATATCTCCTGGAATTACTGTCGATGTAGCCATTTTCCTGCCTCACATCCAGCCACGAAGTTGATGCTCAACAACAACCGCGTATTTATCGAATATTGACGATTTTTTTGCATCATTAATGATGCGCACGTTTACAAAATATCCGTCTTCCTTAACACATACCGGCTCGCCATCTTCAGTAAGTTCTCCGGTTTCTTTGTACACGTTACCTATCACGTCAATAAGAATATCATCCTGCATCGACTCGTCATCATAATAGCCAGCACTCTTCATAAAGGCCGAAAAGTCGGCCTTGTCTGTGAATTTGATAATTAAATCTCGCATTATATTATCTCTCCCATTTGCGCATCGGTTAATTCTTTATGCCATATTCTAAAATTCCTGATATAACCAAATAAATGACGTAAGCCGGCTGTAGTCTGGCCTCCAATCCTTATACCTGTAGCACCACTAACACTCTTCCACGTTGTTGATGCTGGTGTTGATAAAACCCCATTGCAGCCAGCCTGAAACTTGCCATCGCTTTTTATTCTATATACAGCGATAATCTTTCTTGGGTAATTTCCAGGTGGAAATGATATTGGATTACCAAATTGAACATATGGAGTTCCTACACTACCTGCTGTATTTCTGAATGCAAAAACAAAAGCGTCGTCTTTTGTTTTAAATCCTGTTATATCATAAATTCTAGGCGCGGAATTTGGTGCTATAGACCAATTTTTATTCACTTCAACCATGCAGGTTAAAGGTTTGCTGGCCATGTTATTTTTACAAGGCAATGTAACAATATCGCTTGCTCGCGTTGATGCAACTGTATCGGAAATGATAAAAGATGATGCGCATGAGCCGCCTTCAAATTGTGGTGTCGTAACATCCAAATATTCCCCGCCCAAATAAGTTCCAGGTGAAACAGGTGGAGCAATTTGGATCTGAGAGCCAACCATATTATCAATAGCTTCAGATTGATATGTAACCTCGAAATAAATCCATCCATTAGATTCTTTCTCCGCTCGGGCGGTTATTCTTGCAGCCCCTCCTCCCGTTTTCCTAATGGACATATCTGTTATATTAAGATATGCATCTGCCAGATAAAAATATGCGGACCCGTCATATCTTTCAAATCTTATACGACATTGTATATTAGCACTATCGCTTTTTACACGACATGATGTTGTAACATACTTTTCAGTTCCAGTAACATCGACACCTCTTCCTCCTGAAACCGCAGCGATATTCATTGTCGTATTTGTCCCAATTTTTTCGTCCTTTATTTGAAACCGCCCATAAGTAAAGCCAAACTCATCAACACCACTTTCTGTTATAGTTACACTTGTTGTAGCTCCCCACTTAGATGGCGTAAGTGAATTAAGATGATAGTTGGTTCTTTGTCCTTCAATCAGCAAACCTTCTCGTTCAAATCGCGGTTCGTCAACTTCAGCAACGGTTAATACTCCTGATTTATTAATGTATGTTGCACCTGATGCGCGTTTGAAGCTAACAACCTTATCGGATGGCATTTTAATAACATCATCACCTACAGTTATTTGCTTATAACCAGGCGAAAAGCCAGCAAGCATATCCAGTGAATCGTTAAACGGTATCCACACATCGGGAAGCGGCTGCAAAACTTGTTTATACGGCTCCGCAGCCTGGCTTGCATACTCTCTGGCTGCGTCTTCACTTGCTTTTGCAGCCGTCTGGCTTGCAGCGGATGCTTTCGCCGAGTTAGCCGCCGCAGTCTCGCTCGCCTTTGCGTTGGTTTCACTGGTTTTTGCAGCTTTTTGACTGTTGGCTGATGCAGTGGCAGAAGAAGCCGCCGCACTTGCAGAACCAGCTGCGGCACTCTCGCTTTGGGCCGCTGCATCCTGACTGTTTTTCGCCGCAGTTTCGCTGGCTTTGGCATTCGTTTCGCTGGTCTTCGCTGCCGTCTGGCTGGACTTTGCGTTAGTTTCGCTCGTCTTCGCTGCTTTCTGGCTGTTAGCTGCAGCAGTTGCTGATCCAGCTGCTGAAGTCGCAGAACCGGCTGCCGCACTCTCGCTTTCGGCTGCTGCAGCCTGACTGCTTTTCGCCGCAGTTTCACTGGTTTTGGCATTCGTTTCGCTTGTTTTCGCTGCCGCCTGGCTGGACTTTGCGTTGGTTTCGCTCGTCTTTGCTGCTGTCTCGCTATTTTTCGCGTTGTTTTCTGATTTTTTAGCTGCTGTCGCGGAGTTTGCCGATGCAGTCTGCGAGGCCGCTGCCGCCTGTGCGCTGTTAAGTGAGTAAACTCTCAGTCAGAGGTGACTCACATGACAAAAACAGTATCAACCAGTAAAAAACCCCGTAAACAGCATTCGCCTGAATTTCGCAGTGAAGCCCTGAAGCTTGCTGAACGCATCGGTGTTACTGCCGCAGCCCGTGAACTCAGCCTGTATGAATCACAACTCTACAACTGGCGCAGTAAACAGCAAAATCAGCAGACGTCTTCTGAACGTGAACTGGAGATGTCTACCGAGATTGCACGTCTCAAACGCCAGCTGGCAGAACGGGATGAAGAGCTGGCTATCCTCCAAAAGGCCGCGACATACTTCGCGAAGCGCCTGAAATGAAGTATGTCTTTATTGAAAAACATCAGGCTGAGTTCAGCATCAAAGCAATGTGCCGCGTGCTCCGGGTGGCCCGCAGCGGCTGGTATACGTGGTGTCAGCGGCGGACAAGGATAAGCACGCGTCAGCAGTTCCGCCAACACTGCGACAGCGTTGTCCTCGCGGCTTTTACCCGGTCAAAACAGCGTTACGGTGCCCCACGCCTGACGGATGAACTGCGTGCTCAGGGTTACCCCTTTAACGTAAAAACCGTGGCGGCAAGCCTGCGCCGTCAGGGACTGAGGGCAAAGGCCTCCCGGAAGTTCAGCCCGGTCAGCTACCGCGCACACGGCCTGCCTGTGTCAGAAAATCTGTTGGAGCAGGATTTTTACGCCAGTGGCCCGAACCAGAAGTGGGCAGGAGACATCACGTACTTACGTACAGATGAAGGCTGGCTGTATCTGGCAGTGGTCATTGACCTGTGGTCACGTGCCGTTATTGGCTGGTCAATGTCGCCACGCATGACGGCGCAACTGGCCTGCGATGCCCTGCAGATGGCGCTGTGGCGGCGTAAGAGGCCCCGGAACGTTATCGTTCACACGGACCGTGGAGGCCAGTACTGTTCAGCAGATTATCAGGCGCAACTGAAGCGGCATAATCTGCGTGGAAGTATGAGCGCAAAAGGTTGCTGCTACGATAATGCCTGCGTGGAAAGCTTCTTTCATTCGCTGAAAGTGGAATGTATCCATGGAGAACACTTTATCAGCCGGGAAATAATGCGGGCAACGGTGTTTAATTATATCGAATGTGATTACAATCGGTGGCGGCGGCACAGTTGGTGTGGCGGCCTCAGTCCGGAACAATTTGAAAACCAGAACCTCGCTTAGGCCTGTGTCCATATTACGTGGGTAGGATCATATAGCCATTTCCGGACATACTGGTCTGAATGTTACCCATTCCACTAATTTGTCCATTAAAAAGTCATCTCTTTTGTGACCACCATCAAAACGAACAGCAGAACCGCCCAAACAACCGCTGATCCCAATCACAGGTTTTTTTATCATTTCCTCCCCCTTGACTAATTCATTAACACATAAACTTTGTAGTGCACGGACTAAATTGCCTTTCTGGCTTCATCACTGACAATTTTTCTGTTATTGACTATTCCTAATATAGTAGGAAAGTTCTTTAAGTGATCGGTCGTACTCATCTATCTTTCATACTTACTCTCAACTATCAAAAGTACAGGATTTATTATGAAGTTATGGCCTGTGTTGACTGGCATTGCACTCTCTTTCACTCTTATAGCATGTAAGGCTCCGACACCACCTAAAGGTGTGCAGCCGATTACAAATTTTGACGCCAACCGCTACCTCGGAAAATGGTATGAAATAGCTCGCCTCGAGAACTGGTTCGAACGTGGTCTGGAACAGGTCAGCGCTACTTATGAAAAACGGAACGACGGAGGGATTCGCGTACTTAACCGTGGATACGATCCAACGAAAAACAAATGGAGCGAGAGCGAAGGTAAAGCATACTTTACTGGAGATACTAAAACTGCAGCGTTGAAAGTTTCGTTTTTTGGCCCCTTCTATGGTGGCTATAATGTAATCAAACTGGATGATGAGTATAAGTATGCTCTTGTCAGTGGTCCGAACAGAGAATACCTATGGATTCTGGCAAGGACCCAAACTATTCCAGATAATGTAAAAGCAGACTATGTGCGTACCGCTCAAAAGTTGGGATTCAATGTCAATGAACTATTATGGGTTAAACAATAAAATCCCCACCCGAAATGATACTTATTAGAAAAAAACCAGCCTTTGGGGAGGCTGGCTAAATCAGGAAATAAGCTGTTATATGATAATAACTACGTTGCGATTCCAACATTTAAAATGTTAGACTAATGAAAATCAGACAGCAACTTTTCCTTTAATTATTTCGAACAATCAGCATCCATCTCCAATCGGAGATCCAACACCATCAGCATGCCCTCCACTACGCCCTCAGCTTTCTGGAGCATCCTGCCAACCCAACAATCAGATCGCCCATGCTTACGTGCAAGCGCCATAAAAGTCATGCCGCCGACATAATAGTCCACCAATAAATCATGTAAATCGCTGTTGTTTTTTTTCAGGCGAGCCATACATTCACAAATGATCATCGCGTCATCGTCACAGCATTGCGGACGAGATCTTACTTTTGAAGGAATTAATCCCTTAAAACCGGCGGCAATGGACGACCAGGTCACATCTTCATGATTATTAGCCGCCCATGCCCCCCAACGCTCAAGAACCATCTGAATATCACGCATCAACTTACTCCACAAAATTAGGCCAGCACACCAATTGCCAGCGCGCGATCGATAAATCGAAAAATCAGCTCCAGTTGGGAGCCATACTTCTCTTCAAATGCCACGGTATCCGTATGCAACTCGTCGTGATGCTTTCTGCACAAAGGCAAAACAAAGAGATCATGTGCTTTGGTTCCCATTCCGCCCTGCCCGTGACCAATCAGATGATGCGGATCGTCGGCTGGCATACCGCAGCAAGCACACGGCTGTGTCTTAACCCAGCGTGTGTATTTCTCCTTAACCCACCGGTGACGTTTAGGCAGCTTCATGAAAGATTCCGGAGACTCTGGATCAACGGTGATGCTTACCACCGTCTTTTCCTGTGATGGGTTTTGTTGCTGGTGGGCGTGAGGCAACGGTGCAAGATTTTTTGTGCGCTGTTTCAATATGCTGGTGGCGGTCTGCTCTCCCGGTACGATGTCGCTTTCGCGGTACACCGAGCAGATTTTTTCCGCTGGTAATCCCAGCGAGCGACGTAATACCGCTTCCGGTAGCGCGTCCGCCAGCTGATTGCGGACCGCCCACCAGGATACCCGGGTATTGTGGAATACCGGCATGGATTCACGGCCCGGCTTAACGACCACCAGCCCAAGTTCCGGTACCAGAACAGGTCGAAGTAATACCCGCACGTTACCTCCAGATGCGTTGCTGGAATGTACGGGACGGACGCGGTGGGCGTTCGGAGTAAGGCAATCTGACTGAGATTATCCAGTGACGGTAGTCGAGACTAAGGGCTTTCTTAACCTCGTATCCGCGCCTGCGGTAACACTGAATTATCCATTCAGCCTGCTCTTCAGTGCATGGAGGGTGTTGGAACCATTCAGACTTGAATGCGTGAGAATACCGCTCGTGCGTGCAGACAAGAACGGGCGAATTATCAGAATTGTAATATTTTACGTTGCGTGCCATCGGTTTTCTCCGGTGGCACGGTGTTACTCAGCGGGAGTTCAGCCCCGCGCAAGATTGTAGATGAGTTTATTCTTCTGCAAAAGCTGAAAAGCCTGCTTTTATTCCGATCTCTTTCAGTGCCTGTAATGAAGTGACAAACTCACCTTCGCGCAAGATAAATCCGTCTGTCACTCGACCATCCACAAAATTAATTAACGCAGCCCCATTCTTTCGCAAACACATAATGCGGTAATGACTAACAAGATTTCCATTTTCAACGCACACAGCATAGAGGCCATCTTCACAAAAATTTTTACGCAGTTCTTCGATGTTCATCATCAGAATCCTTCCGGATAATTAGCTCTCCCCTTTAAGGGACCATCCCTCTTATCCCTGCGCGCTACTTAAGTATTTTTGATTCTATTCCGGCACCGTCCAGAACTTCAAACGCGTTGAAAATAAAAACAAAAACCCGCCGAAGCGGGTTAAGTGCGAGTGCGTTGAGGATGCCTGCCACATCAGAGGTGGCGAGGGATTTCTCCCTCGCCGGGTCTCTTACTCCTCAGGTTCGTAAGCTGTGAAGACAGCGACCTCCGTCTGGCCGGTTCGGATTCGTACCTCGCAGAGGTCTTTCCTCGTTACCAGTGCCGTCACTATGACGGTTAAACAGATGACGATCAGGGCGATTAACATCGCCTTTTGCTGCTTCATAGCCTGCTTCTCCTTGCCTTTCGGCACGTAAGAGGCTAACCTACATGTGTCTAGCATGAAATTGGCCTCAGATTAATGTTAAGCGTCTTGCAGGACGCGTAATGTTAACTGGGGCTTTTCTCTATCTGCCGTTGGTGTTCATGCCCGAGGCAGATAGCCTCAAGCACCCGCAGCCATTCTACTTAACTACCGTTACCTCTCCAATATGAAATCAATCAGAAAGGTGATCCATAAGAACAATAACAAGACAATAAATTGCCATTACAGCCACAATAGCCAGCGTGCATTTGAGAACCAGCACATAAATCTCCTGCTCTTGACGTATAATGACCAGATAAGGTCCGGATCAACCACAGCAGGTTTCTTCACCTTTGCCCTCGAGAGTTTTTTGCGGGCGTTTTGCCAGTCCTTACGCGCCTGTTCAGACGGGAATAACCCGTAGCCGGAGTTGTATACATCGCCACTGGCTACCAGCTCTCTGGCAAGAACGCTCATCAGATATCTAGTCGCACCTGTTTTAGCTTCCAGTTGCCGTAACGTCTCACGACCGCTCTGGCGCACGAGTTCCACCACCTGCCCTTTAATTTTTTCCCGCTCTTCTTGTGTAAAAACTTTTGCCACAACTCCCCCTTAAAATTACCTCATGACCTGAAATCAACACTTATCCTCTGAAACCAGGCGGAATTTCTGTATCCGGTTCAGAAATATGATTAACACAACGCTGTACAGGTGAACGTCCCAGGCGGATGACCAGTTCGTCCCATTTTTCGCGGAGCTTTGACGGGCTCATGATGTTTTTTACCCAGAATGGATCTCGCTGAACCCGACCAAACATTTCGCAAATTTGTCTGTGGCTTCTGCCATCCAGCATCCGCATTGTGCGCACGTCATTGGCCCAGGCAGTCCAGTTAGGCTCTTTTGGTCGCATGATCTCGCCATCATCACTGGCTGCCTGTTCGTAGAGACCCACGATCCGCCCCCAGATCCACTGCGCACACGCCAAATCCTCCCTGCTACCCCACTGGCGTTTTTTCGCACTAAACACAACCGCGTCGGAGTTCCGGGTTAAAAAATCCTGTTCAGTCGTCTGCGGGTCCGGTTGCGAAGCTTCCGGACGAAAAGTGTTTTTATTCTCTGTAGTAATCTCTGTTGTATTCTCTGTAAGATCATCAGGCCATTTTGACCCGATGACATTGGATCGTTTTGAACCAATGGAGCGTTTCATTTTGACCTCTTCCATCGTGTCATTTTGACCTGATGGAGCGGCGCATTTTGAACCGATGGATTCGTTCAATTTGCCACCATCTAAAAGCTCGCTCCCATAGTTGATCGTGTAGAAATTGGTCATATCGCGCTTTGATTTATTGAGCTTTTCACAACGCAAAAGCCCCAGCGTTTTCAGACTTGCAAACGCGCGCTTTAACGTTGACTCTGACCAGAATGGGAACTGTTCCAGCCATTGTTCCGTTGTGTTATAAATCCAGCGAACACCATCACATTCCATACCGGAGTTGGTATCTCTCAACCAGTAGTGCAGTTGTTGCAAAACAATGGCTTCGTTTAAGCCGATTTTCATTGCCAGCTGCGTGTTTATAACCAGTGGGCGTTCAGCAAAAAGAAGACTCATAATTCCATCCAGCTTTTTGTTGGTATTGCTGTCGATACGCAAGCTTGAAAGCAATTGCTTTTTCTATAAGTTCGTCAGTTTCACGATCTACAACGGCAGGATCTGCAAAAAGCAGTCCGGATTCCACCACATCGCCATATTCTTTATTTAACCCGGCGATCATGTACGTAATACTTTTTCCATCACTGATCTCACGATACAACCTGAAATCACTAATCCGGATAGCCTCCATAATTGCAGGCACTAGCGCTGTGAACTTTTCACGCTTATCCCTGGTGTCGATAGCCTTCCAGCGTTCGAATATCTTCACTCGATTAACGCTAAGCGCTCGCTGATCAACCGCGCCACCTTCATATGTGACACGCTGAACATCGATGTTCGGGCGCTCTTTCAAAGCCCAGAATGCTTCAGTGATTAATATCGTCGCCTGCTCCTGTGTCATTCCTGGTCGACATATCCAGGCATCCAGAGCCTCACGAGCCTGTTCAGGAGTGATTTTCATTGTTCAACCGCCCCGCCCGCTTCGTCTTACGATATTCGTCATAAACTTTGGGATCATACTGAAGCTCCCCGCCAGATGCCTCCTGTAGACGCATCGCGCGACCTTCAGGAACCAGTATCCCCCAAGCAGCAACACTTGCCAGTCTCACTCCTGCGGCATTGGCAAGCTTTGTTTTGCTGCCAAAAAAAGTAATTGCGTCAACTTTAAGCATCAAAGCCCCCTCTTGTTAGACTTTTCTAACATTATTGTGCGCGGGATACCTAAGTCAAGAAAAATTAGAATTACCTAACTATGGATACAAGAACCCTAGGCCAGCGAGTTCTGGCGCGACGAAAAGAATTACGCTTAACACAACGAGAAGCTGCGCGCCTCGCTGGAGTTGCTCACGTCACAATTTCACAATGGGAAAGAGACGAAACCCAGCCAGTCGGAAAACGATTGTTTGCTTTAGCGGATGCTCTGAAGTGCTCACCTACATGGCTAATGTTTGGTGACGAAGACAAGGCACCAGTGCCTGCACAAGAACTTCATGTGGAAACAGAGCTAACTCCCAACCACAAAGAATTGATCGAATTATTCGATGCTCTTCCATCTTCCGAGCAGGAAGCCTTGCTGTCTGAAATGCGCGCAAGAGTAGAAAACTTCAACAAACTCTTCGAAGAAATGCTTAAAGCGCGTAAAAATAAATCAATAAAATAACATTCTTTTCAAGTGATTAGTTGCGCCCACCCTTTTTGTTAGATCAATCTAACAAAAAACACTTGCCTCTCATGTTAGGTTATTCTAAATTACTTTCCATCAAGACACCGCACGGTGTTCTCAGCAAACAGTTCCGCTACCCCGGCGTTAAGGGGAAATGAGGTCAGCATGGATACTATCGATCTTGGCAACAGCGAATCTCTGGTATGTGGCGTGTTCCCCAACCAGGACGGTACGTTCACCGCGATGACATATACCAAAAGCAAAACGTTTAAAACCGAATCTGGCGCGCGTTGCTGGTTAGCCAGAAACACTGACTGA
Protein sequences of DBSCAN-SWA_5 >NZ_CP020368|1560932:1576555|1562481_1563765_-|WP_000347482.1|DBSCAN-SWA MDFQLYSLGAALVFHEIFFPESSTAMALILAMGTYGAGYVARIVGAFIFGKMGDRIGRKKVLFITITMMGICTTLIGVLPTYAQIGVFAPILLVTLRIIQGLGAGAEISGAGTMLAEYAPKGKRGIISSFVAMGTNCGTLSATAIWAFMFFILSKEELLAWGWRIPFLASVVVMVFAIWLRMNLKESPVFEKVNDSNQPTAKPAPAGSMFQSKSFWLATGLRFGQAGNSGLIQTFLAGYLVQTLLFNKAIPTDALMISSILGFMTIPFLGWLSDKIGRRIPYIIMNTSAIVLAWPMLSIIVDKSYAPSTIMVALIVIHNCAVLGLFALENITMAEMFGCKNRFTRMAISKEIGGLIASGFGPILAGIFCTMTESWYPIAIMIMAYSVIGLISALKMPEVKDRDLSALEDAAEDQPRVVRAAQPSRSL >NZ_CP020368|1560932:1576555|1572437_1572716_-|WP_012775982.1|DBSCAN-SWA MARNVKYYNSDNSPVLVCTHERYSHAFKSEWFQHPPCTEEQAEWIIQCYRRRGYEVKKALSLDYRHWIISVRLPYSERPPRPSRTFQQRIWR >NZ_CP020368|1560932:1576555|1575013_1575535_-|WP_000705360.1|DBSCAN-SWA MKITPEQAREALDAWICRPGMTQEQATILITEAFWALKERPNIDVQRVTYEGGAVDQRALSVNRVKIFERWKAIDTRDKREKFTALVPAIMEAIRISDFRLYREISDGKSITYMIAGLNKEYGDVVESGLLFADPAVVDRETDELIEKAIAFKLAYRQQYQQKAGWNYESSFC >NZ_CP020368|1560932:1576555|1571785_1572436_-|WP_001265279.1|DBSCAN-SWA MRVLLRPVLVPELGLVVVKPGRESMPVFHNTRVSWWAVRNQLADALPEAVLRRSLGLPAEKICSVYRESDIVPGEQTATSILKQRTKNLAPLPHAHQQQNPSQEKTVVSITVDPESPESFMKLPKRHRWVKEKYTRWVKTQPCACCGMPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHDELHTDTVAFEEKYGSQLELIFRFIDRALAIGVLA >NZ_CP020368|1560932:1576555|1564550_1564784_+|WP_000836768.1|DBSCAN-SWA MKSKDTLKWFPAQLPEVRIILGDAVVEVAKQGRPINTRTLLDYIEGNIKKKSWLDNKELLQTAISVLKDNQNLNGKM >NZ_CP020368|1560932:1576555|1565099_1565690_+|WP_000086527.1|DBSCAN-SWA MSRIFAYCRISTLDQTTENQRREIESAGFKIKPQQIIEEHISGSAATSERPGFNRLLARLKCGDQLIVTKLDRLGCNAMDIRKTVEQLTETGIRVHCLALGGIDLTSPTGKMMMQVISAVAEFERDLLLERTHSGIVRARGAGKRFGRPPVLNEEQKQVVFERIKSGVSISAIAREFKTSRQTILRAKAKLQTPDI >NZ_CP020368|1560932:1576555|1575826_1576234_+|WP_000381212.1|DBSCAN-SWA MDTRTLGQRVLARRKELRLTQREAARLAGVAHVTISQWERDETQPVGKRLFALADALKCSPTWLMFGDEDKAPVPAQELHVETELTPNHKELIELFDALPSSEQEALLSEMRARVENFNKLFEEMLKARKNKSIK >NZ_CP020368|1560932:1576555|1571390_1571768_-|WP_001204787.1|DBSCAN-SWA MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLIPSKVRSRPQCCDDDAMIICECMARLKKNNSDLHDLLVDYYVGGMTFMALARKHGRSDCWVGRMLQKAEGVVEGMLMVLDLRLEMDADCSK >NZ_CP020368|1560932:1576555|1572782_1573034_-|WP_000981003.1|DBSCAN-SWA MMNIEELRKNFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDGRVTDGFILREGEFVTSLQALKEIGIKAGFSAFAEE >NZ_CP020368|1560932:1576555|1566221_1566926_-|WP_000235975.1|DBSCAN-SWA MATSTVIPGDITKLKGDVSKAKEDISSISRKVSTLQTEMTSAKQDISSRYTKTEVDNKLKNKVEVNDLESGRYGGDFYPLTGREAFYLWNLATTTAAANLYLNPDPAISSVLRSTSSIRYKHSVETIDSEHADLIFRMRPVWYRSQCENDRRDWGFYGLIAEEVGEIAPQFVHWRPANEDDAPEAISSNGLVAEGVMYERLVVPLIHHIQKLTERVDELESELKLLLTSRSDIR >NZ_CP020368|1560932:1576555|1575518_1575746_-|WP_000921596.1|DBSCAN-SWA MLKVDAITFFGSKTKLANAAGVRLASVAAWGILVPEGRAMRLQEASGGELQYDPKVYDEYRKTKRAGRLNNENHS >NZ_CP020368|1560932:1576555|1560932_1562393_-|WP_000527826.1|DBSCAN-SWA MGNNLLSAKATLPVYDRNNLAPRIVHLGFGAFHSAHQGVYADILATEHFSDWGYYEVNLIGGEQQIADLQQQDNLYTVAEMSADAWTARVVGVVKKALHVQIDGLETVLAAMCEPQIAIVSLTITEKGYFHSPATGQLMLDHPMVAADVQNPHQPKTATGVIVEALARRKAAGLPAFTVMSCDNMPENGHVMRDVVTSYAQAVDVKLAQWIEYNVTFPSTMVDRIVPAVTEDTLAKIEQLTGVRDPAGVACEPFRQWVIEDNFVAGRPEWEKAGAELVSDVLPYEEMKLRMLNGSHSFLAYLGYLAGYQHINDCMEDEHYRHAAYGLMLQEQAPTLKVQGVDLQDYANRLIARYSNPALRHRTWQIAMDGSQKLPQRMLDSVRWHLAHDSKFDLLALGVAGWMRYVGGVDEQENPIEISDPLLPVIQKAVQSSAEGKARVQSLLAIKAIFGDDLPDNSLFTAKVTEAYLSLLAHGAKATVAKYSVK >NZ_CP020368|1560932:1576555|1567216_1568470_-|WP_000879385.1|DBSCAN-SWA MLAGFSPGYKQITVGDDVIKMPSDKVVSFKRASGATYINKSGVLTVAEVDEPRFEREGLLIEGQRTNYHLNSLTPSKWGATTSVTITESGVDEFGFTYGRFQIKDEKIGTNTTMNIAAVSGGRGVDVTGTEKYVTTSCRVKSDSANIQCRIRFERYDGSAYFYLADAYLNITDMSIRKTGGGAARITARAEKESNGWIYFEVTYQSEAIDNMVGSQIQIAPPVSPGTYLGGEYLDVTTPQFEGGSCASSFIISDTVASTRASDIVTLPCKNNMASKPLTCMVEVNKNWSIAPNSAPRIYDITGFKTKDDAFVFAFRNTAGSVGTPYVQFGNPISFPPGNYPRKIIAVYRIKSDGKFQAGCNGVLSTPASTTWKSVSGATGIRIGGQTTAGLRHLFGYIRNFRIWHKELTDAQMGEII >NZ_CP020368|1560932:1576555|1573250_1573463_-|WP_000887491.1|DBSCAN-SWA MLDTCRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >NZ_CP020368|1560932:1576555|1570710_1571235_+|WP_000780584.1|DBSCAN-SWA MKLWPVLTGIALSFTLIACKAPTPPKGVQPITNFDANRYLGKWYEIARLENWFERGLEQVSATYEKRNDGGIRVLNRGYDPTKNKWSESEGKAYFTGDTKTAALKVSFFGPFYGGYNVIKLDDEYKYALVSGPNREYLWILARTQTIPDNVKADYVRTAQKLGFNVNELLWVKQ >NZ_CP020368|1560932:1576555|1569178_1570340_+|WP_085947598.1|transposase|DBSCAN-SWA MTKTVSTSKKPRKQHSPEFRSEALKLAERIGVTAAARELSLYESQLYNWRSKQQNQQTSSERELEMSTEIARLKRQLAERDEELAILPKGRDILREAPEMKYVFIEKHQAEFSIKAMCRVLRVARSGWYTWCQRRTRISTRQQFRQHCDSVVLAAFTRSKQRYGAPRLTDELRAQGYPFNVKTVAASLRRQGLRAKASRKFSPVSYRAHGLPVSENLLEQDFYASGPNQKWAGDITYLRTDEGWLYLAVVIDLWSRAVIGWSMSPRMTAQLACDALQMALWRRKRPRNVIVHTDRGGQYCSADYQAQLKRHNLRGSMSAKGCCYDNACVESFFHSLKVECIHGEHFISREIMRATVFNYIECDYNRWRRHSWCGGLSPEQFENQNLA >NZ_CP020368|1560932:1576555|1565917_1566211_-|WP_000355603.1|DBSCAN-SWA MDITPFLHALCAVAAQTLVGLFTGNWAYGAIAGCTFFIAREHTQAEYRWIEMFGHGKRINMPWWGGFDPRVWDVASLMDFAVPVVACLLIWLLVNRG >NZ_CP020368|1560932:1576555|1568789_1568951_+|WP_000896277.1|DBSCAN-SWA MAFVSLVFAAVWLDFALVSLVFAAFWLLAAAVADPAAEVAEPAAALSLSAAAA >NZ_CP020368|1560932:1576555|1564368_1564482_+|WP_120795384.1|DBSCAN-SWA MNSILIITSLLIIFSIFSHALIKLGIGISNNPDKTDV >NZ_CP020368|1560932:1576555|1566935_1567217_-|WP_001205170.1|DBSCAN-SWA MRDLIIKFTDKADFSAFMKSAGYYDDESMQDDILIDVIGNVYKETGELTEDGEPVCVKEDGYFVNVRIINDAKKSSIFDKYAVVVEHQLRGWM >NZ_CP020368|1560932:1576555|1574067_1575033_-|WP_000054497.1|DBSCAN-SWA MSLLFAERPLVINTQLAMKIGLNEAIVLQQLHYWLRDTNSGMECDGVRWIYNTTEQWLEQFPFWSESTLKRAFASLKTLGLLRCEKLNKSKRDMTNFYTINYGSELLDGGKLNESIGSKCAAPSGQNDTMEEVKMKRSIGSKRSNVIGSKWPDDLTENTTEITTENKNTFRPEASQPDPQTTEQDFLTRNSDAVVFSAKKRQWGSREDLACAQWIWGRIVGLYEQAASDDGEIMRPKEPNWTAWANDVRTMRMLDGRSHRQICEMFGRVQRDPFWVKNIMSPSKLREKWDELVIRLGRSPVQRCVNHISEPDTEIPPGFRG >NZ_CP020368|1560932:1576555|1576402_1576555_+|WP_000379591.1|DBSCAN-SWA MDTIDLGNSESLVCGVFPNQDGTFTAMTYTKSKTFKTESGARCWLARNTD |
22 | Enterobacteria_phage(18.75%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
1988254 : 1997111
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP020368|1988254:1997111|DBSCAN-SWA ATCACATCTTACCCAATAATAATTTGGTTATTTTCACAACAATCTCTTCATTTAAATCTGTATATATTGGCAAGCATAGGACTTTATCAGCAACACTATTGGCTATTGGTAGATTATCAACGCTAGCTGAAATCAATCCTCGATACATAGGCATATTACTGATTAATGGATAAAAATACTTTCGAGATAGGATATTATTTTTCTTAAGTAATTCATATGCCTGATCACGACTCATATGAAAACCATCATCTATGAGTATAGGAAAATATGAATAGTTGCTATTTGCATTTATATTTCCTGGAAATATAGTAATACCCGGAGTCCCTTTCAGTAAATTCCGGTAAAGACTATCAATTATTTTTCGCTTAGATATTGAGCCCTCAATATGCTTTAACTGAACAAGTCCAAAGGCAGCATTAATTTCACTCATTTTGCCATTTATTCCTGGAGCTGTCACAGTCAGCTCGTCGGCGATACCAAAGTTTTTTAATCGATCTATACGTAACTTGGTTTTAGCATCAGGGCTAATTATTGCCCCACCTTCAAATGTATTGAAGACTTTCGTCGCATGGAAACTAAGAATTGATAAATCACCATAGTTAAGTAAACTTTGCCCTTTAAAATTTACACCGAATGCATGTGCTGCGTCGTAGATTACTTTCAATCCATAATTATCAGCTATTTTTTGAATTTCTTCGACTTCACAGGGTGTTGAGTAGCAATGGACAGGTAAAATTGCTGAAGTTTTTGGTGTAATAGCCTGTTCTATTTTTCTGTAATCAATATTATATCCATCATTTTCAATATCAACAAAGACAGGCGTAAGTCCATTCCATAAAATAGCATGTGAAGTTGCAACAAATGAATATGGTGTTGTTATTACCTCACCTGTTATTCTTAGGGCTTGTAATGCTGTGATCAATGCAATTGTTGCATTGTTAAAAAGCGAGATATGCTGCACACCCAAAAATTCACATAATTTTTCCTCGAGTTCTTGATGAAATGGTCCATTATTTGTCAACCATTTATTTTTCCATATTTTTTCCAGGTAAGGCATGAATTCTGCAAGTTCCGGCAAAGAAGGTTGCGTTACTGGAATAGTTTTATCGTTCATTTTCAGTCCTTATTTTTTTATAAGAGAAATAATAGTCATAAATATTTCTTTGTACATAATTCGAATTAGTGTGATGTATATAATGGCATTTAATAAAATAGTAAATATTATATTTAATAAAGGTTCCATACTTATTTCATTTATAATATAGTTAGAACCTAACCCACTAATTAAACATAGACACCATATAGGTAGTAACTCTTTTAACTGCTTCCTTATTGTGATCGTTGATACTTTCTGGCTAAATAATCCATTTAAAAACCACGAAAGATATGAGTGTAAAACTAAACCAATCACCATTGCATTTATATTTATTTGCATCGTAATGACAAGAATAATTGTCATCAATGTCTTTTTTAGTATTTCCAAACGCAAAAAAAGATCACTTCGCCCATGAACCTGTAGGATATTAATATTTATTGCATGTATAGGATAAATCATAAATGCAATACAAAGTAGAAACAGAATATCAGATGTAAATTTCCACTCACTTCCTAATAATACAGAGACTAATGGCTTCGAGATTAAACCTATTCCTAATAGAATTGGAAAAACTACAACGCCTGCAATTTTAAGTGTGTTTAAATAAGCATCATCTATTTTCCCTTTGCCATTATAAATATTACAAAAAAGTGGATAAGTTACTCTCTGAATAATATTAGTAAGTGTCATTGCAGGTACGCTAGAGATCTGAATAGCCTGTGTAAATTGCCCTACAAGCTGTGGTGTAAAAAATTTCCCTATAATTAATTGATATATATTTGAATAAAAAGCTTCAAGTACTCCAGCAATTAGTAAGTTACTGCTAAATGAAAATAGCTTTTTGAATGTTTCATAGCAAAATTTCTCTTTTGGATACCATGGATTATAAATATTCAGTACAATACAATTTATGACACTATAGGATAACGATTGGAAAACCAATGTCCATACTCCATAACCCAAAGCAGCTAACGTTATCGCTATTACACTACTACTGGTAATAGCGATAAGTGAACTTTTAGCTTGGATCTTAAAATTCACGTCTACAGTGAGTTGAATTTTGGGGATTAAAGTTAGACCATTTGCTATAACTGTTAATGCTAAAATATCAAGTAAAGATTCCAGCAGAGGTATTTGATAGAATAAAGCAATATAAGAAGATAAACAATAAAGTACAATATATATTAGTATTGATAGACAAATATTAAATATAAATACTGTCGAATAATCTTTTTCAGTACGCTCCGATTTTCTAATTAATGCTGCACTAAATCCACTATCAATTAGAACCTGAGCTAATGATAGAAATAAAGTTAGCATACCAACATAACCAAAAGAGCTTGGTCCAAGAACCCTACCCAAATACAACATTAATAATAATTGTATGGCTTGCGTTCCTAGCCTTTCAATTGTGCTCCATAATAATCCTTGCCTGGCTAGTTTGTTATTCATCAATTAAACTTTCTCAAAAAGTTCTCTGAATGATGGTTGTTTCATGTCTTTCTCAGAGATTATCGCTTTTTTTTCTTTCCAATTAATAGCTAAATATTCATCATCCCATCTTATACCTTTTTCCATAGTTGCATTATAATAATTATTTGTTTTGTATAAAAAATGAGCAACTTCACTAATCACTTCAAAACCATGGGCGAATCCTTCTGGAATCCACAACTGTCGTTTGTTTTCAGCAGAAAGATTTACACCGACCCATTTGCCAAAAGTTTTAGAATCTTTCCTTATATCAACAACAACGTCATATACCTCACCTACTACACAGCGTACTAATTTTGCCTGTGAATAAGGTGGTAATTGAAAATGTAATCCTCTAATTACTCCTTTAGTAGACTTAGAATGGTTATCCTGAACAAATTCAACCTTCCGCCCCACAGCTTCTTCGAAAACTTTCTGATTGAAGCTTTCCATAAAAAAACCACGCTCATCCCCAAATACTTTTGGTTCAAATATTAATACATCAGGAATTTCAGTTTTAATTACGTTCATCTTATTAATAACCTTTAATCATTTTGAGCAGATACTGACCATAAGCATTTTTCTTCAACGGTTCGGCTAATACTTTTACCTGCTCAGCATCAATAAACCCTTTACGATAAGCAATTTCTTCCGGACAGGAAACCTTTAGTCCCTGGCGCTCTTCAATGGTGGCAATGAAGTTGCTTGCTTCAATAAGACTTTGATGCGTCCCTGTATCCAGCCATGCATAGCCACGCCCCATCATAGCGACAGACAAACGTCCTTGTTCCATATAAATACGGTTAATATCGGTAATTTCCAGTTCACCTCGGGCAGAAGGCTTAAGGTTTTTCGCCATTTCCACAACGTCATTGTCATAGAAATAAAGCCCAGTAACCGCATAGTTACTTTTTGGTTCCAGCGGTTTTTCTTCCAGGCTAATTGCAGTACCGTTATTATCAAACTCCACGACACCATAACGTTCAGGATCATTAACGTGATAAGCAAATACCGTTGCACCGCTTTCTTTGTTAACAGCAGCTTCCATTAATTTCGGCAAGTCGTGTCCATAGAAGATATTATCACCAAGCACAAGTGCACAATCATCGCCACCAATAAAGTCTTCACCAATGATAAACGCTTGCGCCAGGCCATCCGGACTCGGTTGTACTTTATACTGTAGATTAAGCCCCCACTGGCTCCCGTCCCCCAACAATTGTTGGAAACGCGGCGTATCCTGTGGCGTACTGATAATAAGAATATCGCGAATACCCGCTAACATAAGCGTTGAAAGCGGATAATAAATCATCGGCTTATCATAAATCGGCAGCAGTTGTTTACTTACTGCCATCGTCACAGGATAAAGACGAGTGCCGGAACCACCAGCCAGAATAATACCTTTACGCGTTTTCATTTCACCATTCCTTTTAATTCATCCCGCTCTGGCATCATGAGCGAGATGCAAAAATTTGTTAAATTGCCGTAGTCGTAAATAATTCGTTGAGCATTCGTTTCACACCCACCTGCCAGTCAGGCAAGACAAGCGCAAAGTTCTGCTGAAACTTTTCTGTATTGAGGCGAGAGTTATGGGGTCGACGGGCTGGTGTGGGATAGGCCGTTGTTGGCACGGCGTTAAGTTTGTTAAGAGCAAGGTTAATCCCTGCTTTGCGCGCTTCTTCAAATACCAGAGCGGCATAATCGTGCCAGGTTGTTGTACCGCCAGCAACCAGATGGTACAAACCAGCAACTTCTGGTTTGTCTACTGCTACGCGAATGGCATGAGCGGTGCAATCAGCCAGCAATTCGGCACCTGTTGGTGCGCCAAACTGATCGTTTATCACAGCCAGTTCTTCGCGCTCTTTTGCCAGACGCAACATTGTTTTGGCGAAGTTATTTCCTTTACCTGCATATACCCAGCTGGTACGGAAAATAATATGCTTCGCGCAATGTTTCTGTAACGCTTTTTCTCCGGCTAGTTTAGTTTCACCATAAACATTCAAAGGTGCTGTAGCATCCGTCTCCTGCCACGGTATTTCACCGGTTCCCGGAAATACGTAGTCAGTAGAGTAGTGAATAACCCAAGCACCGACTTCATTGGCTGCTTTTGCAATCGCTTCGACACTTGTCGCGTTAAGTAATTGTGCAAATTCCGGTTCTGATTCCGCTTTGTCTACTGCAGTGTGAGCAGCCGCATTAACAATAACATCAGGGCGAATTTTTTTGACAGTTTCAGCTACACCTTCGGGATTACTAAAATCACCACAATAATCAGTGGAGTGAACATCAAGCGCAATCAGATTACCCAGCGGTGCCAGAGCACGCTGCAGTTCCCAACCTACCTGCCCTGTTTTGCCGAATAGGAGGATATTCATTACTGGCGGCCCTCATAGTTCTGTTCAATCCACGACTGATAAGCACCGCTTTTCACATTCTCAACCCAATTTGTATTAGCCAGATACCATTCCACCGTTTTACGAATACCGCTCTCAAACGTTTCTTGCGGTTTCCAGCCCAATTCGCGGCTAATTTTATCGGCATCAATTGCATAACGGCGATCATGCCCTGGGCGATCAGCAACATAGGTAATTTGCTCACGATAAGATTTCTCTTTCGGGACTATCTCATCCAACAAATCACAAATAGTGAACACTACGTCGATGTTTTTCTTTTCGTTGTGTCCACCAATGTTATAAGTTTCGCCCGCTTTACCTTCGGTTACGACGGTATATAACGCTCGAGCATGATCCTCTACATACAACCAGTCGCGGATCTGATCTCCTTTGCCATAAATAGGTAATGCCTTACCTTCCAGTGCATTAAGAATAACCAGTGGAATAAGCTTTTCCGGGAAATGATAAGGACCATAGTTGTTCGAGCAATTAGTCACAATGGTCGGTAAACCATAAGTACGTTTCCATGCGCGAACCAAATGATCGCTGGAAGCTTTAGAAGCAGAATATGGACTACTTGGCGCGTATGCTGTCGTTTCCGTAAATAGCTGCAACGTTTCATTGCTATTTACTTCATCCGGATGGGGTAAGTCACCATACACCTCATCAGTAGAAATATGATGAAAACGGAAGTTTTTTTTCTTTTCATCATCCAGACCAGACCAATAATTGCGCGCCGCTTCTAAAAGAACATAAGTACCCACAATATTGGTTTCAATAAATGCCGCAGGGCCAGTTATTGAGCGGTCAACGTGGCTCTCTGCTGCCAGGTGCATCACCGCGTCTGGCTGGTGCTGTGCGAAAATACGAGCCATCGCTTCGGCATCGCAGATATCTGCATGCTCAAATGAATAACGTTCAGAATCAGAAATTTCAGCGAGTGATTCCAGGTTTCCGGCGTATGTTAATTTATCGACATTAACAACACTATCTTGCGTATTATTTATTATGTGACGAACAACAGCAGAACCAATAAATCCTGCGCCACCAGTAACAAGTATCTTCACTTTTTTATTCCATATAGCCAGAGAGCATGCTGTGAAATAGACTGCTCCAGATTTGATTAATAGATGCATTAATGCACGCTACCGCCCCTGGCTTAACAGCTACCAGAGCACTGCGTACATTTCCACGATGTGACGAGCGTAACCCACTCGTGCCAAATCCGAAAAATTCAAACGCTAATTGTCTTACCAAACAGCTGTAGAAACAAGGAAAATCCTGGAAAAATTTGAACCATGATTGCAAGCTAACATGCTGTTTTTATTGTACTTATAAAAAAGACAACGGCAGTGAGATTCAATGAGCAAAACTTATGCTATGAATCTTCACTGCCGTTTTTAATTACTTACTGACCGACACTTCCAGTCAGGAATTTGTTACTCGCTTAACAGCTTCTCAATCCCTTTACGGAACTTCGCCCCTTCTTTCAGGTTGCGTAGTCCATACTTCACAAATGCCTGCATATACCCCATTTTTTTACCGCAGTCGTAGCTGTCACCTGTCATCAGCATGGCATCAACGGACTGTTTTTTCGCCAGTTCAGCGATGGCATCAGTCAGCTGAATACGCCCCCATGCACCAGGCTGAGTGCGTTCAAGTTCCGGCCAAATATCTGCAGAAAGCACATAGCGACCAACGGCCATGATGTCTGAGTCCAGCGTCTGCGGCTGATCCGGTTTTTCGATGAATTCAACAATGCGGCTGACTTTACCTTCGCGGTCCAGCGGCTCTTTGGTCTGGATGACAGAGTATTCAGAGAGGTCACCCGGCATACGTTTTGCCAGCACCTGGCTGCGGCCCGTTTCGTTGAAGCGCGCAATCATGGCAGCAAGGTTGTAGCGCAGCGGGTCGGCGCTGGCGTCGTCGATCACAACGTCTGGCAGCACCACGACAAATGGATTGTCACCAATGGCAGGTCGTGCACATAAAATGGAGTGGCCCAAACCTAAAGGTTCGCCCTGACGCACGTTCATAATTGTCACGCCCGGCGGGCAAATGGACTGTACTTCCGCCAGCAGTTGACGCTTCACGCGCTGTTCAAGGAGAGATTCTAATTCATAAGAGGTGTCGAAGTGGTTTTCGACCGCGTTCTTGGAAGCGTGAGTTACCAGGAGGATTTCTTTGATCCCTGCAGCCACAATCTCGTCAACGATGTACTGAATCATTGGCTTGTCGACGATCGGTAGCATCTCTTTGGGAATCGCCTTAGTGGCAGGCAACATATGCATCCCAAGACCCGCTACAGGAATAACTGCTTTTAAATTCGTCATTATTTCATCCACCTGTAAAATGGTTGCTGAATTATAGCTTGTTCGATTTTTTTCGCCAGCATCAATTACCCTGAATTGATTACTGAATTACTTGTGATGTTACGCCGCTTCGTTGTGGATTGCAGTAGCATTGTTCCTAAGTATGGCTCCATTTTTCCAGGAATGGTCGCAAATCTACTCCCTCCGTTCTGGCAATCTAAAGTTAATCTTCTCCACATTAACAATATGGTGATTAATCCTGTCGATATCGACGGAGATTTGTCCTTTTTCATTCACCGCATGAACATTTGCAAGAGACAGCAGTGTTTCTTTTTTCGCCATAAACACGCCGCGAACGTCTTTGCGCATGTCAAAGTTCATGCTCAATGCGGGTCCAACTGAGGATTCCTGCATCACATTGATATTTCGCATAAAAAGATGTTGCGGTTTGTTGTGTAACTCCAGCGACGCACGCTTCATCTCAATGTTAGTCAGCGCCACAAAGGAGACAGCATTCCCGGCGGAGATTTGGATGCCGCGCAATTTATAAGCAAGATGAGTATTATCCAGTTGAATATCATTCACTCGGAAATTCTGCGGTATCGAGAGATATTTGCCTTTAATTACCCCATAGCCGATTAACATCCCGGCACTATTAATCATTTCAATATTATCAATCACAAAGTTGTCACAACCATAAATAGCAACTGTGGCGTTATCAATACCTGCTTTCTTACTGAAATCCGGCGTGATATTACGAGCTTTGATATTACGGATAACAAAATGTTTACCATTTTCAACATGGATCAGTTGCCGACAATCTGATCCCGTGATATTCGCAACGACAAAGTTTTTCACTGCCTGGTTTTCCGGATAATTATTATCGTAAGTACTTCCTGCCAGTCCTATACCAATACCCCAGTTGATTTTGCCGTTAGTACAGTTGATGCGCTCGATGACATGGTCGGATATCAAAATGTCGCGATCATTGATTGCTACATTCCATTCAATGGCGTCGCCCTGTAAGTCGCTGAACTTACAATTGGTGATGTTGGCACCGATAATCTGGTTATGAAATCCCTGGCGTAAAATGGCGTAATTAGCGTGGCTGACGGTCAGGTTATCGATGGTCAAGTTGCGCATGACGCGTTTATTTTTGCCGCCGATATAAATCTGCGTTACCGGGCCAAAACCGCTCATCGCCAACCCTTTGATGATGCAGTCAGAACCGCGCACATCCAGGGTGATGTTATGCATACGGCCGCCCTTCTCCCCTGTCACCTGGCTGCCGTCTTGTAAGACAAAACGCCCTCTGCCGTTGCCGCGCAGGCTTCCAAGGATGTGCAACGTTTTGCCAGGAGGAATGAAGATGCCGGTGTTGATATTTTCACAAACGAATCCGGCAGGCACGACGACCGTTTGCCCTTCGCTGAAGGCTTGTTTAAATGAGGCGATCCAGTCGTGTGGGTTGTAGTCGTTAATGTTAACGATTTGTCGGGCGGGAAGCGCGCGAGCGAAAGGGGTATGGAGGAAGGCAAGCGCCGAGCTTGCCGTCAGGAAGGTGCGTCGGGAGAGTTTTTTAAATGGCAT
Protein sequences of DBSCAN-SWA_6 >NZ_CP020368|1988254:1997111|1995716_1997111_-|WP_001115981.1|DBSCAN-SWA MPFKKLSRRTFLTASSALAFLHTPFARALPARQIVNINDYNPHDWIASFKQAFSEGQTVVVPAGFVCENINTGIFIPPGKTLHILGSLRGNGRGRFVLQDGSQVTGEKGGRMHNITLDVRGSDCIIKGLAMSGFGPVTQIYIGGKNKRVMRNLTIDNLTVSHANYAILRQGFHNQIIGANITNCKFSDLQGDAIEWNVAINDRDILISDHVIERINCTNGKINWGIGIGLAGSTYDNNYPENQAVKNFVVANITGSDCRQLIHVENGKHFVIRNIKARNITPDFSKKAGIDNATVAIYGCDNFVIDNIEMINSAGMLIGYGVIKGKYLSIPQNFRVNDIQLDNTHLAYKLRGIQISAGNAVSFVALTNIEMKRASLELHNKPQHLFMRNINVMQESSVGPALSMNFDMRKDVRGVFMAKKETLLSLANVHAVNEKGQISVDIDRINHHIVNVEKINFRLPERRE >NZ_CP020368|1988254:1997111|1990804_1991350_-|WP_001100801.1|DBSCAN-SWA MNVIKTEIPDVLIFEPKVFGDERGFFMESFNQKVFEEAVGRKVEFVQDNHSKSTKGVIRGLHFQLPPYSQAKLVRCVVGEVYDVVVDIRKDSKTFGKWVGVNLSAENKRQLWIPEGFAHGFEVISEVAHFLYKTNNYYNATMEKGIRWDDEYLAINWKEKKAIISEKDMKQPSFRELFEKV >NZ_CP020368|1988254:1997111|1993190_1994276_-|WP_000699450.1|DBSCAN-SWA MKILVTGGAGFIGSAVVRHIINNTQDSVVNVDKLTYAGNLESLAEISDSERYSFEHADICDAEAMARIFAQHQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEAARNYWSGLDDEKKKNFRFHHISTDEVYGDLPHPDEVNSNETLQLFTETTAYAPSSPYSASKASSDHLVRAWKRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKALPIYGKGDQIRDWLYVEDHARALYTVVTEGKAGETYNIGGHNEKKNIDVVFTICDLLDEIVPKEKSYREQITYVADRPGHDRRYAIDADKISRELGWKPQETFESGIRKTVEWYLANTNWVENVKSGAYQSWIEQNYEGRQ >NZ_CP020368|1988254:1997111|1989376_1990801_-|WP_001060532.1|DBSCAN-SWA MNNKLARQGLLWSTIERLGTQAIQLLLMLYLGRVLGPSSFGYVGMLTLFLSLAQVLIDSGFSAALIRKSERTEKDYSTVFIFNICLSILIYIVLYCLSSYIALFYQIPLLESLLDILALTVIANGLTLIPKIQLTVDVNFKIQAKSSLIAITSSSVIAITLAALGYGVWTLVFQSLSYSVINCIVLNIYNPWYPKEKFCYETFKKLFSFSSNLLIAGVLEAFYSNIYQLIIGKFFTPQLVGQFTQAIQISSVPAMTLTNIIQRVTYPLFCNIYNGKGKIDDAYLNTLKIAGVVVFPILLGIGLISKPLVSVLLGSEWKFTSDILFLLCIAFMIYPIHAININILQVHGRSDLFLRLEILKKTLMTIILVITMQININAMVIGLVLHSYLSWFLNGLFSQKVSTITIRKQLKELLPIWCLCLISGLGSNYIINEISMEPLLNIIFTILLNAIIYITLIRIMYKEIFMTIISLIKK >NZ_CP020368|1988254:1997111|1994648_1995542_-|WP_000183060.1|DBSCAN-SWA MTNLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEILLVTHASKNAVENHFDTSYELESLLEQRVKRQLLAEVQSICPPGVTIMNVRQGEPLGLGHSILCARPAIGDNPFVVVLPDVVIDDASADPLRYNLAAMIARFNETGRSQVLAKRMPGDLSEYSVIQTKEPLDREGKVSRIVEFIEKPDQPQTLDSDIMAVGRYVLSADIWPELERTQPGAWGRIQLTDAIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKGIEKLLSE >NZ_CP020368|1988254:1997111|1988254_1989367_-|WP_000998544.1|DBSCAN-SWA MNDKTIPVTQPSLPELAEFMPYLEKIWKNKWLTNNGPFHQELEEKLCEFLGVQHISLFNNATIALITALQALRITGEVITTPYSFVATSHAILWNGLTPVFVDIENDGYNIDYRKIEQAITPKTSAILPVHCYSTPCEVEEIQKIADNYGLKVIYDAAHAFGVNFKGQSLLNYGDLSILSFHATKVFNTFEGGAIISPDAKTKLRIDRLKNFGIADELTVTAPGINGKMSEINAAFGLVQLKHIEGSISKRKIIDSLYRNLLKGTPGITIFPGNINANSNYSYFPILIDDGFHMSRDQAYELLKKNNILSRKYFYPLISNMPMYRGLISASVDNLPIANSVADKVLCLPIYTDLNEEIVVKITKLLLGKM >NZ_CP020368|1988254:1997111|1991354_1992233_-|WP_000857508.1|DBSCAN-SWA MKTRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQYKVQPSPDGLAQAFIIGEDFIGGDDCALVLGDNIFYGHDLPKLMEAAVNKESGATVFAYHVNDPERYGVVEFDNNGTAISLEEKPLEPKSNYAVTGLYFYDNDVVEMAKNLKPSARGELEITDINRIYMEQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIATIEERQGLKVSCPEEIAYRKGFIDAEQVKVLAEPLKKNAYGQYLLKMIKGY >NZ_CP020368|1988254:1997111|1992291_1993191_-|WP_001023616.1|DBSCAN-SWA MNILLFGKTGQVGWELQRALAPLGNLIALDVHSTDYCGDFSNPEGVAETVKKIRPDVIVNAAAHTAVDKAESEPEFAQLLNATSVEAIAKAANEVGAWVIHYSTDYVFPGTGEIPWQETDATAPLNVYGETKLAGEKALQKHCAKHIIFRTSWVYAGKGNNFAKTMLRLAKEREELAVINDQFGAPTGAELLADCTAHAIRVAVDKPEVAGLYHLVAGGTTTWHDYAALVFEEARKAGINLALNKLNAVPTTAYPTPARRPHNSRLNTEKFQQNFALVLPDWQVGVKRMLNELFTTTAI |
8 | Enterobacteria_phage(42.86%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2085011 : 2094453
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP020368|2085011:2094453|DBSCAN-SWA CATGTCTGAACTGAACGATCTTCTGACCACCCGTGAGCTACAACGCTGGCGATTAATTCTTGGCGAAGCGGCAGAAACGACGCTTTGTGGGCTGGATGACAACGCCCGGCAGATAGACCACGCGCTGGAGTGGCTGTATGGGCGCGATCCTGAACGGCTCCAGCGTGGTGAACGCTCCGGTGGATTAGGTGGCTCAAATCTCACCACCCCTGAGTGGATCAACAGTATTCACACGCTGTTTCCGCAACAGGTGATTGAGCGGCTGGAAAGCGATGCCGTACTGCGCTACGGCATTGAAGATGTGGTGACAAATCTCGACGTGCTGGAACGTATGCAGCCTTCCGAAAGCCTGCTACGCGCCGTTTTGCACACCAAACATCTGATGAATCCCGAAGTACTGGCTGCCGCCCGCCAGATAGTGCGCCAGGTTGTTGAAGAAATTATGGCGCGACTGGCAAAGGAAGTTCGCCAGGCTTTTTCTGGTGTCCGCGATCGCCGTCGCCGTTCATTTATTCCACTGGCGCGAAACTTTGATTTCAAAAGTACTCTGCGCGCCAACCTGCAACACTGGCACCCGCAACACGGCAAGTTGTATATCGAATCCCCCCGCTTTAACAGCCGCATTAAACGCCAAAGCGAACAATGGCAACTGGTCTTACTGGTTGATCAAAGCGGATCGATGGTCGATTCGGTGATCCACTCTGCGGTGATAGCGGCCTGTTTGTGGCAGTTACCCGGCATTCGTACCCATCTGGTGGCGTTTGACACAAGCGTCGTTGATCTCACGGCAGACGTTGCCGATCCGGTAGAGTTATTAATGAAAGTACAGTTGGGCGGCGGGACCAATATCGCCAGTGCCGTGGAGTATGGTCGGCAACTTATTGAACAACCAGCGAAAAGCGTCATTATCCTCGTGAGCGATTTTTACGAAGGGGGTTCATCATCATTACTGACGCATCAGGTGAAAAAGTGTGTCCAGAGCGGCATCAAAGTGCTGGGACTGGCAGCGCTCGATAGCACCGCAACACCTTGCTATGACCGCGATACGGCCCAGGCGCTGGTTAATGTCGGCGCACAAATAGCCGCCATGACGCCGGGCGAGCTGGCATCATGGCTTGCGGAGAATCTTCAGTCATGAATTCACTACGTCCGGAATTATTAGAACTGACACCGCAGGCCCTGACGGCGTTAAGCAATGCCGGTTTTGTTAAGCGCAGTCTTAAGGAACTGGAAAATGGCAACGTCCCGGAGATCAGCCATGAGAACGACGCTTTAATCGCCACCTTCAGTGACGGTGTCCGTACCCAGCTGGCGAACGGCCAGGCACTGAAAGAGGCTCAGTGCAGTTGCGGGGCCAACGGTATGTGCCGTCATCGCGTGATGCTGGTGTTAAGTTATCAACGACTTTGTGCCACCACTCAGTCTACGGAAAAAGAAGAAGAGTGGGATCCGGCAATCTGGCTGGAAGAACTGGCTACCCTGCCCGATGCCACCCGCAAACGCGCACAGGCGTTGGTCGCTAAAGGCATCACCATTGAGTTGTTCTGTACGCCGGGCGAAATTCCCTCTGCCCGCTTACCGATGAGCGATGTGCGTTTTTATTCCCGCAGCAGTATTCGTTTCGCCCGTTGTGATTGTATTGAAGGCACACTTTGCGAACATGTCGTACTGGCTGTGCAGGCCTTCGTCGAGGCCAAAACTCAGCAAGCGGAATTTACTCATTTAATCTGGCAGATGCGCAGCGAACACGTCACATCATCTGACGATCCGTTTGCCAACGACGAGGGTAACGCGTGTCGTCAATATGTTCAGCAATTAAGCCAGGCATTATGGCTGGGCGGCATCAGCCAGCCGCTTATTCACTACGAGGCCGCTTTCAGTCGCGCGCAGCAGGCGGCGGAACGCTGCAACTGGCGATGGGTGAGTGAATCACTACGACAGCTACGGGCAAGCGTTGATGCCTTCCACGCCCGCGCCAGCCACTATCATGCCGGAGAATGCTTACGTCAGCTTGCGGCATTAAACAGTCGATTAAATTGCGCACAAGAGATGGCCCGGCGCGACAGTGTTGGTGAAGTTCCTCCTGTGCCGTGGCGCACGGTCGTTGGCTCTGGCATTGCCGGAGAAGCAAAGCTTGATCATCTGCGGCTGGTGTCTTTAGGTATGCGTTGCTGGCAGGATATTGAGCATTATGGTTTACGCATCTGGTTTACCGATCCCGACACCGGCAGTATTTTGCACCTTTCGCGCAGTTGGCCGCGAAGTGAACAGGAAAACTCACCGGCAGCTACGCGTCGGCTGTTTAGTTTTCAGGCTGGCGCACTGGCGGGTGGGCAAATTGTTTCACAAGCAGCAAAACGCAGTGCCGATGGCGAGCTGCTGTTAGCTACCCGCAACCGCTTAAGCAGCGTTGTGCCGCTGTCGCCTGATGCCTGGCAAATGTTGAGCGCGCCGTTACGCCAGCCGGGCATTGTGGCTTTGCGGGAATATTTACGCCAGCGTCCCCCCGCCTGCATACGGCCTCTTAATCAGGTCGATAACTTATTTATTCTGCCGGTCGCTGAGTGTATTTCGCTCGGTTGGGATAGCAGCCGCCAGACGCTGGATGCGCAGGTAATCAGCGGCGAAGGTGAAGATAATCTCCTGACGTTATCCCTTCCGGCATCTGCCTGTTCTCCTTTTGCCGTTGAACGCATGGCGGCGCTTTTGCAACAAACAGACGACTCCGTGAGTCTGGTTTCTGGCTTTGTCAGTTTTGTTGATGGGCAATTGACACTGGAACCACGGGTGATGATGACAAAAACCCGCGCCTGGGCGCTGGACGCAGAAACTGCGCCTGTGGCACCGCTACCTTCTGCCAGCGTTTTGCCTGTGCCGTCTACCGCTCATCAGTTGCTGATGCGCTGCCAGGCGTTACTTATTCAACTGCTCCATAACGGCTGGCGCTATCAGGAACAGAGCGCTATTAGTCAGGCAGAGTTGCTGGCGAATGACCTCTCCGCGGTCGGTTTTTATCGGCTGGCACATGTGTTGGGACAATTTCGTAATACAGAAAGCGAGGCACGGGTAGAAGCAATGAATAACGGTGTTTTACTTTGCGAACAATTATTCCCCATGCTTCAGCAACAAGGATGAAATAGTGCTTTTTACTAAGAGTTCTACTCCAGTTCCGGACTGCTCACGTCACGGTATTAGGCATATTCTATATAGCCCCTGGTGAGAGTCACCAGTTTCTTGATTAAATAAAATGGAGTTTTACATGAAGGCTTTCAATAAGCTGTTTTCCCTCGTTGTTGCATCTGTTCTGGTTTTCTCTCTTGCTGGCTGCGGTGACAAAGAAGAATCGAAGAAATTCAGCGCCAATCTGAACGGCACTGAAATTGCCATTACCTATGTCTACAAAGGTGACAAGGTGCTTAAGCAATCTTCTGAAACCAAAATTCAATTTGCTTCTATTGGTGCAACCACCAAAGAAGACGCTGCCAGGACACTTGAGCCGTTAAGCGCCAAATACAAAAACATCGCGGGTGTTGAAGAAAAATTAACCTATACAGATACCTACGCGCAGGAAAACGTGACTATCGATATGGAAAAAGTGGATTTTAAAGCCCTGCAGGGTATTTCAGGAATCAACGTTTCTGCTGAAGATGCCAAAAAAGGTATCACTATGGCGCAAATGGAACTGGTGATGAAAGCCGCTGGTTTTAAAGAAGTGAAATAATCTGTCGGCGGCCATGTTTCGCATGGCCGCCATACCCGTCTTAGCTTTTCTTCACATGCTGGCGCGCTGCCAGTCCACGCAGAAAATAACGTAAAAATTGATCGCCGCATTCGCGGAAGTTTTTATGATCCGGTGCGCGCATCATCGCTGTAATTTCCGGCATCGAAACGCGGAACTGCTGTTCGGTGAGGATAGCCAGAATGTCATCGGTTTTCAGCGAAAACGCGATGCGTAATTTTTTCAGCACGATGTTGTTATTAATGCGACGTTCCGGCTCCAGTGCCGGAGCAGACTCATCCTTGCCGCGTTTTTCATAAATCAGGCCATTGAGGAATGACGACAAAACAATGTCCGGACAACGCTGAAAACCCTCTTCGTCTTCTTTACGTAGCCAGACGGCGATCTGTTCCGCGGTGGCTTCGACATTACCCAGCGCCAGAATACGCACCAGGTCATTATTATTGGCTTTCAAAATGTAGCGCACGCTGCGCAGAATATCGTTACTTAGCATGAGGCCTTCAGGTGTTGATGAGGCAAAAAGCCATTTTAGCAGTCTTTTACAGGCCAATCGCCTCTTTTAAGCTTTTCAGATAACGGCGGCTGACCGGCACGGTTAAGCCATTACGCAAAATCAACTCGGCCTGGCCGTTATCTTCCAGACGAATCTCCTGTAAATGCGCGAGGTTAACCAGATACTGACGATGGCAGCGCAGTAGTGGTGTACGACTTTCCAGGGTACGTAATGTCAATTCGGTAAAGCCCTCTTTCCCTTCGTGGCTGGTAACGTAGACACCGCTCATCCGACTGCTGACAAATGCCACATCTTTCATTTGCAGCAAATAAATCCGACTATGCCCCGTACAAGGGATAAATTTCAGCGCCTGTTGATTTTCCGGTAACAGCGAAACATCCTGCTTGCTGCGCTCCTGACGCAATCGCGCCAGCGTTTTCTCCAGTCGCGCTTCATCAATTGGCTTCAGCAGATAATCAAAGGCATGTTCTTCAAAGGCTTTAATTGCGTATTCGTCAAACGCAGTGAGAAAAACAATATACGGGCGATGTTCCGGGTCAAGCATCCCCACCATTTCCAGACCACTGATGCGCGGCATCTGGATATCGAGAAACAGCACATCCGGGCGCAGTTTATGCACCGCGCCGATCCCTTCCACGGCGTTTGAACACTCTCCAACGATTTCAATATCGCTCTGCTCCTGCAAAAATACACGCAGGTTCTCCCGTGCTAACGGTTCATCATCGACAATTAAGACTTTAATCATGCCTCGTCCCTCCATGGTAGTCGTAACGTTATTCGGGTGTAACTATCAGGCTCACAGGCGACGCTTATTCCATAGTCATCGCCAAACCGTTCACGTAAACGCTTATCCACCAGATTCATCCCCAGCCCACTGGCATTGGTTACCGGTTGATACAAACCGGCATTGTCTTCGATCTCCAGCATCAAATGTTGCCCCTCACGTCGGGCGCTGATTGCCACTCGCCCTGTATCCAGCAGTTGTGATGTCCCATGTTTAATGGCGTTTTCCACTATCGGTTGCAGGGTAAACGCGGGCAATTGCTGCTGGGATAATTCTTGCGGAATAGCAATGTTGACCTGCAACCGCGACTGGAAGCGCGCCTTTTCAATTTGCAGATAAGCGTTCACATGTTCAATTTCGTCGGCGAGAGTAACAAACTCCGAAGGCCGCTTTAAGTTTTTGCGGAAAAAAGTGGAAAGATACTGCACCAGCTGGCTGGCCTGTTCGCTGTCGCGGCGGATCACCGCTTTAATGGTGTTAAGCGCATTAAACAAAAAATGAGGATTCACCTGGGCGTGAAGCAGTTTGATTTCTGACTGGGTGAGCATCGCTTTTTGCCGCTCATATTGCCCGGCAAGAATCTGCGCCGAAAGCAGTTGCGCAATCCCCTCGCCCAGCGTGCGGTTTATTGAACTGAATAAACGGTTTTTGGCTTCATACAATTTGATGGTGCCCATCACGCGCTGATTTTCGCCACGCAACGGAATTACCAGCGTCGACCCCAGTTTGCATTGCGGATGCAAAGAGCAACGATAAGGTACTTCGTTGCCATCAGCGTAGACCACTTCACCGGTTTCAATCGCTTTTAAGGTGTAAGTCGAAGAAATCGGTTTGCCGGGTAAATGGTGGTCGTCACCAATTCCGGTAAAGGCCAGCAATTTCTCTCGATCGGTAATCGCGACTGCACCAATATCCAGCTCCTGATACAGCACCTGAGCCACTTTCATGCTGTTCACTTCGTTAAACCCCTGTCGCAAAATGCCTTCCGTCGAGGCTGCCACTTTCAGCGCAGTGGCAGAAAAAGCCGAAGTGTATTTTTCAAACATCGCGCGTTTATCGAGCAATATACGCATAAACAGCGCCGCGCCGACGGTATTGGTGACCATCATTGGCGCAGCAATATTACTCACCAGACGCACCGCATCTTCATAAGGTCGGGCGATCGCAAGGATGATCAGCATTTGCACCATTTCAGCGACGAACGTGACGGCACCGGCGGTAATGGGGTTAAAGACTTTATCAGTGCGCCCGCGACGGATCAGGATGCTGTGTACCAGGCCGCCGAGCAATCCTTCAACGATGGTTGAGATCATGCAACTTAGCGCGGTCATGCCCCCCATCGAATATCGATGTAAGCCGCCGGTCAGACCAACCAGCCCACCGACGACCGGACCGCCGAGTAAGCCGCCCATTACCGCGCCTATCGCACGGGTATTGGCAATAGAATCGTCAATGTGCAACCCAAACCAGGTGCCCATGATGCAGAAGATGGAAAAGACGATGTAGCAGAGAAATTTATGCGGCAGACGAACCGTGACCTGCATTAACGGTATGAATAATGGCGTTTTACTCATTAACCACGCAATGACTAAAAAAACGCACATCTGCTGAAGCAGCAGCAACACCAGATTAAAATCGTACATACCCGCAAACCACACTTCCCTTTAAAACGCGTAACATACATTGCCTGCGTTTAACTTTCTTTGAACTCTTGCAGAAAAATGAGAATTCGTGAGTACGATCACTCAAAATCGCCTGGCAAAAATAAAATCACCCTATAGATGCACAAAAAACGGGCAAAACTACCTGGTTCGCAAAACTGCGTCTAAAGTTAAACTGGGACCTCGCGAGCAAGGGTGAGACGATGGCGCTTTACACAATTGGTGAAGTGGCGTTGCTTTGTGATATTAACCCTGTCACGTTACGCGCGTGGCAGAGGCGTTACGGATTGCTGAAACCGCAACGGACAGACGGCGGTCATCGGCTGTTCAACGATGCCGATATTGACCGGATCCGCGAGATCAAACGCTGGATCGACAACGGCGTGCAGGTCAGCAAAGTTAAAATGCTGCTCAGTAATGAAAATGTTGATGTGCAGAACGGCTGGCGCGATCAGCAAGAAACATTACTGACTTACTTGCAAAGCGGCAATCTGCATAGCCTGCGAACGTGGATCAAAGAGCGCGGTCAGGATTACCCCGCCCAGACGCTCACCACGCATCTGTTTATTCCTCTGCGCCGACGGCTTCAGTGCCAACAACCGACTCTCCAGGCGCTGCTGGCGATCCTCGACGGCGTACTGATCAACTACATCGCCATTTGTCTGGCTTCGGCACGTAAAAAACAGGGTAAAGATGCGCTGGTGGTTGGCTGGAATATTCAGGATACCACCCGTCTGTGGCTGGAGGGCTGGATTGCCAGTCAACAAGGATGGCGCATTGATGTCCTCGCCCACTCGCTCAATCAACTACGCCCTGAACTGTTCGGAGGCCGTACATTGCTGGTGTGGTGCGGTGAAAATCGAACCTCCGCCCAACAGCAGCAACTCACCAGTTGGCAGGAACAAGGCCATGATATTTTCCCACTCGGCATTTAATGATTCGTTAACAAATGCGCTTTACTGTACAATCCTTTCGTTAACATAAGGAGTGCATTATGCGCATAGCTAAAATTTGGGTCATCGCCCTGTTCCTGTTTATGGCGTTAGGCGGAATTGGTGGCGTCATGCTCGCAGGTTATACCTTTATTTTGCGTGCTGGCTAAGCGCCTGCACCAGCCTTTCAAACAGGCGGTCTGCGATGATCGCCGCCAGTGCCACCAGTAACGCCCCCTGGATCACATACGCGGTATTAAATCCGCTAAGCCCGATGATGATGGGCGTACCCAGCGTGCTGGCCCCTACCGTTGAGGCGATCGTCGCCGTACCAATGTTGATAATCACCGAAGTTCGCACGCCCGCCAGAATCACCGGAGCCGCCAGCGGTAGCTCGACCTTACGCAGTCGCTGACCACGACTCATTCCCATACCTTTCGCAACTTCTGTCACGCTGGCATCAATCGCTCCCAGCCCGGCAAGTGTCGCCTGCAGGACGGGCAGCACACCGTAAAGGATCAAGGCGATAATCGCTGGTTGCAGACCAAAGCCGATCACCGGAACGGCAATCGCCAGCACTGCGACAGGCGGAAAAGTCTGCCCAACGGCGGCAATAGTTTCCACCAGTGGGCGAAATTCCGCGCCCCACGGGCGAGTAACAGCAATTCCGGCACCCGTGCCAATGATCACCGCAAACAAACTCGAAATTCCCACCAGCCAGAAATGAGCCAGTGCCAGAGCTGCAAAACTTTCTTGCTGATAAACGGGTCGTGGCAGTTGTGGGAACAAGGCAGCAAACAGCGGCTGGCTGTAAGGCAGCCAGAAAATCAGTGCCACAAACAAAGCAATGAGCCAGAACAGCGGATCGCGCAACATCTTCATACGCTTACGCCTCCACCAGCAGATCCTGAAAATGCAGCGTGCCGCAAGGCTGGCCCTGCATGTTCACCACCGGCAGCACCTCGCATCCCCGCGCAACAAACAGAGAGAGCGCATCGCGTAGCGTCATCTCTTCTGCCAGTGCCTCACCATCTGCTCGTTCTTCGCGACGCACGTAATCCGCCACACTACGTAACGAAAGCAGGCGCACACCCAGTTCACTACGTCCAAAAAACTGGCGGACAAAATCATTCGCCGGACGAGTCAGCATCGTCAGCGGATTGCCCTGCTGCACTACTTCACCGTGATCCATCAATACCAGATGTTCTGCCAGCCGTAGCGCCTCATCAATATCATGAGTGACCAGCACAATGGTACGCCCCAGCAAACGGTGAATGCGCGTCATCTCTTGTTGCAACGCGCCGCGCGTTACCGGGTCCAGTGCGCCAAAAGGTTCATCCATCAGTAAGACTTGCGGATCGGCAGCCAGTGCACGCGCCACTCCCACACGTTGCTGCTGACCACCGGAAAGCTGATGCGGATAACGCTCACGCAAATTTGGCTCCAGCCCCAGTAGCGCCATTAATTCGTCGATACGATCATCGATCCGCGCCCGTGACCATTTTTGTAATTGCGGCACGGTGGCGATGTTTTGCGCCACGCTCCAGTGGGGGAACAGGCCAATAGATTGAATGGCATAGCCCATCCGGCGGCGCAACTCCAGCACTGGCAGCGAGCGAATTTCTTCTCCGGCAAAGCGGATCACGCCGCTGTCATGCTCCACCAGGCGGTTAATCATTTTCAGGGTGGTGGATTTGCCGGAGCCAGATGTGCCAATCAGCACCGAAAAACTCCCTTCCTGAAAATTGAGATTGAGATCGTTAACGGCTTTTTGTGCGCCGAACAGTTTGCTGACATGGCTAAATTCAATCAT
Protein sequences of DBSCAN-SWA_7 >NZ_CP020368|2085011:2094453|2088771_2089242_-|WP_001295430.1|DBSCAN-SWA MLSNDILRSVRYILKANNNDLVRILALGNVEATAEQIAVWLRKEDEEGFQRCPDIVLSSFLNGLIYEKRGKDESAPALEPERRINNNIVLKKLRIAFSLKTDDILAILTEQQFRVSMPEITAMMRAPDHKNFRECGDQFLRYFLRGLAARQHVKKS >NZ_CP020368|2085011:2094453|2092790_2093522_-|WP_000783120.1|DBSCAN-SWA MKMLRDPLFWLIALFVALIFWLPYSQPLFAALFPQLPRPVYQQESFAALALAHFWLVGISSLFAVIIGTGAGIAVTRPWGAEFRPLVETIAAVGQTFPPVAVLAIAVPVIGFGLQPAIIALILYGVLPVLQATLAGLGAIDASVTEVAKGMGMSRGQRLRKVELPLAAPVILAGVRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIADRLFERLVQALSQHAK >NZ_CP020368|2085011:2094453|2086144_2088145_+|WP_001300967.1|DBSCAN-SWA MNSLRPELLELTPQALTALSNAGFVKRSLKELENGNVPEISHENDALIATFSDGVRTQLANGQALKEAQCSCGANGMCRHRVMLVLSYQRLCATTQSTEKEEEWDPAIWLEELATLPDATRKRAQALVAKGITIELFCTPGEIPSARLPMSDVRFYSRSSIRFARCDCIEGTLCEHVVLAVQAFVEAKTQQAEFTHLIWQMRSEHVTSSDDPFANDEGNACRQYVQQLSQALWLGGISQPLIHYEAAFSRAQQAAERCNWRWVSESLRQLRASVDAFHARASHYHAGECLRQLAALNSRLNCAQEMARRDSVGEVPPVPWRTVVGSGIAGEAKLDHLRLVSLGMRCWQDIEHYGLRIWFTDPDTGSILHLSRSWPRSEQENSPAATRRLFSFQAGALAGGQIVSQAAKRSADGELLLATRNRLSSVVPLSPDAWQMLSAPLRQPGIVALREYLRQRPPACIRPLNQVDNLFILPVAECISLGWDSSRQTLDAQVISGEGEDNLLTLSLPASACSPFAVERMAALLQQTDDSVSLVSGFVSFVDGQLTLEPRVMMTKTRAWALDAETAPVAPLPSASVLPVPSTAHQLLMRCQALLIQLLHNGWRYQEQSAISQAELLANDLSAVGFYRLAHVLGQFRNTESEARVEAMNNGVLLCEQLFPMLQQQG >NZ_CP020368|2085011:2094453|2088269_2088731_+|WP_001296231.1|DBSCAN-SWA MKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAARTLEPLSAKYKNIAGVEEKLTYTDTYAQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKAAGFKEVK >NZ_CP020368|2085011:2094453|2091911_2092643_+|WP_001240403.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKMLLSNENVDVQNGWRDQQETLLTYLQSGNLHSLRTWIKERGQDYPAQTLTTHLFIPLRRRLQCQQPTLQALLAILDGVLINYIAICLASARKKQGKDALVVGWNIQDTTRLWLEGWIASQQGWRIDVLAHSLNQLRPELFGGRTLLVWCGENRTSAQQQQLTSWQEQGHDIFPLGI >NZ_CP020368|2085011:2094453|2085011_2086148_+|WP_001300968.1|DBSCAN-SWA MSELNDLLTTRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDPERLQRGERSGGLGGSNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVLHTKHLMNPEVLAAARQIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSFIPLARNFDFKSTLRANLQHWHPQHGKLYIESPRFNSRIKRQSEQWQLVLLVDQSGSMVDSVIHSAVIAACLWQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSVIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDRDTAQALVNVGAQIAAMTPGELASWLAENLQS >NZ_CP020368|2085011:2094453|2092702_2092810_+|WP_001216966.1|DBSCAN-SWA MRIAKIWVIALFLFMALGGIGGVMLAGYTFILRAG >NZ_CP020368|2085011:2094453|2093526_2094453_-|WP_000569344.1|DBSCAN-SWA MIEFSHVSKLFGAQKAVNDLNLNFQEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGVIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWSVAQNIATVPQLQKWSRARIDDRIDELMALLGLEPNLRERYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHRLLGRTIVLVTHDIDEALRLAEHLVLMDHGEVVQQGNPLTMLTRPANDFVRQFFGRSELGVRLLSLRSVADYVRREERADGEALAEEMTLRDALSLFVARGCEVLPVVNMQGQPCGTLHFQDLLVEA >NZ_CP020368|2085011:2094453|2090004_2091690_-|WP_001295431.1|DBSCAN-SWA MYDFNLVLLLLQQMCVFLVIAWLMSKTPLFIPLMQVTVRLPHKFLCYIVFSIFCIMGTWFGLHIDDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSILIRRGRTDKVFNPITAGAVTFVAEMVQMLIILAIARPYEDAVRLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSTYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQQLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGLYQPVTNASGLGMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLPWRDEA >NZ_CP020368|2085011:2094453|2089288_2090008_-|WP_000598641.1|DBSCAN-SWA MIKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVTSHEGKEGFTELTLRTLESRTPLLRCHRQYLVNLAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL |
10 | Enterobacteria_phage(85.71%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
2681539 : 2688678
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NZ_CP020368|2681539:2688678|DBSCAN-SWA CATGAGTGCAATAGAAAATTTCGACGCCCATACGCCCATGATGCAGCAGTATCTCAGGCTGAAAGCCCAGCATCCCGAGATCCTGCTGTTTTACCGGATGGGTGATTTTTATGAACTGTTTTATGACGACGCAAAACGCGCGTCGCAACTGCTGGATATTTCACTGACCAAACGCGGTGCTTCGGCGGGAGAGCCGATCCCGATGGCGGGGATTCCCTACCATGCGGTGGAAAACTATCTCGCCAAACTGGTGAATCAGGGAGAGTCCGTTGCCATCTGCGAACAAATTGGCGATCCGGCGACCAGCAAAGGTCCGGTTGAGCGCAAAGTTGTGCGTATCGTTACGCCAGGCACCATCAGCGATGAAGCCCTGTTGCAGGAGCGTCAGGACAACCTGCTGGCGGCTATCTGGCAGGACAGCAAAGGTTTCGGCTACGCGACGCTGGATATCAGTTCCGGGCGTTTTCGCCTGAGCGAACCGGCTGACCGCGAAACGATGGCGGCAGAACTGCAACGCACTAATCCTGCGGAACTGCTGTATGCAGAAGATTTTGCTGAAATGTCGTTAATTGAAGGCCGTCGCGGCCTGCGCCGTCGCCCGCTGTGGGAGTTTGAAATCGACACCGCGCGCCAGCAGTTGAATCTGCAATTTGGGACCCGCGATCTGGTCGGTTTTGGCGTCGAGAACGCGCCGCGCGGACTTTGTGCTGCCGGTTGTCTGTTGCAGTATGCGAAAGATACCCAACGTACGACTCTGCCGCATATTCGTTCCATCACCATGGAACGTGAGCAGGACAGCATCATTATGGATGCCGCGACGCGTCGTAATCTGGAAATCACCCAGAACCTGGCGGGTGGTGCGGAAAATACGCTGGCTTCTGTGCTCGACTGCACCGTCACGCCGATGGGCAGCCGTATGCTGAAACGCTGGCTGCATATGCCAGTGCGCGATACCCGCGTGTTGCTTGAGCGCCAGCAAACTATTGGCGCATTGCAGGATTTCACCGCCGGGCTACAGCCGGTACTGCGTCAGGTCGGCGACCTGGAACGTATTCTGGCACGTCTGGCTTTACGAACTGCTCGCCCACGCGATCTGGCCCGTATGCGCCACGCTTTCCAGCAACTGCCGGAGCTGCGTGCGCAGTTAGAAACTGTCGATAGTGCACCGGTACAGGCGCTACGTGAGAAGATGGGCGAGTTTGCCGAGCTGCGCGATCTGCTGGAGCGAGCAATCATCGACACACCGCCGGTGCTGGTACGCGACGGTGGTGTTATCGCATCGGGCTATAACGAAGAGCTGGATGAGTGGCGCGCGCTGGCTGACGGCGCGACCGATTATCTGGAGCGTCTGGAAGTCCGCGAGCGTGAACGTACCGGCCTGGACACGCTGAAAGTTGGCTTTAATGCGGTGCACGGCTACTACATTCAAATCAGCCGTGGGCAAAGCCATCTGGCACCCATCAACTACATGCGTCGCCAGACGCTGAAAAACGCCGAGCGCTACATCATTCCAGAGCTAAAAGAGTACGAAGATAAAGTTCTCACCTCAAAAGGCAAAGCACTGGCACTGGAAAAACAGCTTTATGAAGAGCTGTTCGACCTGCTGTTGCCGCATCTGGAAGCGTTGCAACAGAGCGCGAGCGCGCTGGCGGAACTCGACGTGCTGGTTAACCTGGCGGAACGGGCCTATACCCTGAACTACACCTGCCCGACCTTCATTGATAAACCGGGCATTCGCATTACCGAAGGTCGCCATCCGGTAGTTGAACAAGTACTGAATGAGCCATTTATCGCCAACCCGCTGAATCTGTCGCCGCAGCGCCGCATGTTGATCATCACCGGTCCGAACATGGGCGGTAAAAGTACCTATATGCGCCAGACCGCACTGATTGCGCTGATGGCCTACATCGGCAGCTATGTACCGGCACAAAAAGTCGAGATTGGACCTATCGATCGCATCTTTACCCGCGTAGGCGCGGCAGATGACCTGGCGTCCGGGCGCTCAACCTTTATGGTGGAGATGACTGAAACCGCCAATATTTTACATAACGCCACCGAATACAGTCTGGTGTTAATGGATGAGATCGGGCGTGGAACGTCCACCTACGATGGTCTGTCGCTGGCGTGGGCGTGCGCGGAAAATCTGGCGAATAAGATTAAGGCATTGACGTTATTTGCTACCCACTATTTCGAGCTGACCCAGTTACCGGAGAAAATGGAAGGCGTCGCTAACGTGCATCTCGATGCACTGGAGCACGGCGACACCATTGCCTTTATGCACAGCGTGCAGGATGGCGCGGCGAGCAAAAGCTACGGCCTGGCGGTTGCAGCTCTGGCAGGCGTGCCAAAAGAGGTTATTAAGCGCGCACGGCAAAAGCTGCGTGAGCTGGAAAGCATTTCGCCGAACGCCGCCGCTACGCAAGTGGATGGTACGCAAATGTCTTTGCTGTCAGTACCAGAAGAAACTTCGCCTGCGGTCGAAGCTCTGGAAAATCTTGATCCGGATTCACTCACCCCGCGTCAGGCGCTGGAGTGGATTTATCGCTTGAAGAGCCTGGTGTAATAACAATTCCCGATAGTCTTTTGCTATCGGGAATATTAACGACAACTGACGAATAAAATAAAAACACCCTGTATAATAGGAAAGCTTATTTTACAGGGTAAAACCATGCCATCTACACGCTATCAAAAAATCAATGCCCATCACTATCGCCATATATGGGTCGTTGGTGATATTCATGGTGAATATCAGTTATTACAATCCCGCTTACATCAACTCTCTTTTTTCCCCAAAATCGACTTACTTATTTCTGTCGGCGATAATATTGATCGTGGACCGGAGAGTCTTGACGTCCTGCGCCTGCTAAACCAACCCTGGTTTACGTCGGTTAAAGGCAACCACGAAGCGATGGCGCTTGAGGCATTCGAAACTGGCGATGGCAATATGTGGCTTGCCAGCGGTGGTGACTGGTTTTTCGATTTAAATGATTCAGAGCAACAAGAGGCAATAGATCTGTTGCTGAAATTCCATCACCTTCCACATATTATTGAAATCACTAACGACAACATAAAATATGCCATCGCACATGCAGATTATCCGGGGAGTGAATATCTCTTTGGTAAAGAAATAGCGGAGAGCGAATTACTCTGGCCTGTTGATCGTGTGCAGAAATCGCTTAATGGCGAGTTACAACAAATAAACGGCGCTGATTATTTTATATTTGGACATATGATGTTTGATAACATTCAGACGTTCGCTAACCAGATTTATATTGATACCGGATCGCCGAACAGCGGGCGGCTGTCATTTTATAAAATAAAGTAGTCTCATGCTTCTTCTGTGAAGCATGAGTAACCCGGTGTTATTGCAGGCCATTATTCATTTTTCGCTACCAGCAAAGAGAGATCCTGCTTCACCAGCGCGCGACTGGCACTCTCCGGCAAACCGTCGTCTGTAATAATCTGATCAAACTCGCTTAATGGTAACGCCAGCCATGTCGCCACCTGACCATATTTCGTCGCATCACAGACCAAAACTCGCTGGCGGCTGGCACTGGCAATCGCCCGTTTCACCGTGACTTTATCTTCCGCTGGCGTAGAAATCCCCCGCACACTCCATGACGATGCAGAAATAAAAGCCTGATCAATCATCAGGCTGCGCAGCATGGTCGCAGCGGCTTCCCCGACACAGGAACGGTTTTCCCGACACACTGCACCGCCAGTGTGAATAATTGTGCAATTACTGTTGTCGAGCAAGTAGTCCGCAATAACGAAATCGTTTGTGACCACAGTCAGTGACTCCATGTGAATCAGATGCTGTGCTATCGCTAACGTGGTCGTTCCCGCATCCAGATAGATACAACTTCCCGGCTGAACAAGACTTGCCGCCAGCTTGCCAATAGCCGCTTTTTGCGTCATTGCCAGCGCAGTTTTTACCTGATGAGAAGGTTCATGCGCCACGCGTCCCGGAGACTGGACGCCTCCGGACACCAGCACAACGGCTCCCTGCTGCTCCAGTTTTTGTAAATCCCGACGAATGGTCATATGTGACACATTCATTCTGTCCGTTAGTTCAGCAATACTGACAATGCCTTTTTCAGCTACCATCTCAAGGATGATTTGGCGACGCTCTACGGGTATCAACTTTTGCTCCTTCCTTTGTCCTGCTGACATTCTACGCTATTTGCCTGCGAAACGTGCGCGGCGCAACTAACGCTTAGTTCACATAAAATAACACACAATGTTAATTTATGTGAATCAGATCACCATACCGTTATCTTCCAGCGCTTATATTCACAATATCAAACAAAATATCACTTAAATTAACAAGGAGAGCAGATGAAAACGGGATCTGAGTTTCATGTCGGTATCGTTGGCTTAGGGTCAATGGGAATGGGAGCAGCACTGTCATATGTCCGCGCAGGTCTTTCTACCTGGGGCGCAGACCTGAACAGCAATGCCTGCGCTACGTTGAAAGAGGCAGGTGCTTGCGGGGTTTCTGATAACGCCGCGACGTTTGCCGAAAAACTGGACGCACTGCTGGTGCTGGTGGTCAATGCGGCCCAGGTTAAACAGGTGCTGTTTGGTGAAACAGGCGTTGCACAACATCTGAAACCCGGTACGGCAGTAATGGTTTCTTCCACTATCGCTAGTGCTGATGCGCAAGAAATTGCTACCGCTCTGGCTGGATTCGATCTGGAAATGCTGGATGCGCCAGTTTCTGGTGGTGCAGTAAAAGCCGCTAACGGTGAAATGACTGTCATGGCCTCCGGTAGCGATATTGCCTTTGAACGACTGGCACCCGTGCTGGAAGCCGTTGCCGGAAAAGTTTATCGCATAGGTGCAGAACCGGGACTAGGTTCGACCGTAAAAATTATTCACCAGTTGTTAGCGGGCGTACATATTGCTGCCGGAGCCGAAGCGATGGCACTTGCAGCCCGTGCGGGGATCCCGCTGGATGTGATGTATGACGTCGTGACCAATGCCGCCGGAAATTCCTGGATGTTCGAAAACCGGATGCGTCATGTGGTGGATGGCGATTACACCCCGCATTCAGCCGTCGATATTTTTGTTAAGGATCTTGGTCTGGTTGCCGATACAGCCAAAGCCCTGCACTTCCCGCTGCCATTGGCCTCAACAGCATTGAATATGTTCACCAGCGCCAGTAACGCGGGTTACGGGAAAGAAGACGATAGCGCAGTTATCAAGATTTTCTCTGGCATCACTCTACCGGGAGCGAAATCATGATCAAGATTGGCGTTATCGCCGATGATTTTACCGGCGCGACGGATATCGCCAGTTTTCTGGTGGAAAACGGTCTACCAACGGTACAAATTAACGGTGTTCCAACAGGTAAAATGCCGGAAGCAATCGACGCACTGGTGATCAGCCTGAAAACGCGCTCCTGTCCAGTGGTTGAAGCCACACAGCAATCGCTGGCGGCTCTGAGCTGGTTGCAACAGCAAGGTTGCAAACAGATCTATTTCAAATACTGCTCTACTTTCGACAGTACGGCGAAAGGTAATATTGGCCCGGTTACCGATGCCTTAATGGATGCTCTCGACACGCCGTTTACGGTCTTCTCTCCGGCCCTGCCGGTCAACGGACGTACGGTTTATCAGGGGTATTTGTTCGTAATGAATCAACTGCTGGCCGAATCCGGGATGCGCCATCACCCGGTAAATCCCATGACCGACAGCTATCTTCCCCGTCTGGTTGAAGCGCAATCCACAGGGCGCTGCGGCGTCGTTTCGGCACATGTTTTCGAACAAGGTGTGGATGCCGTTCGTCAAGAGCTGGCTCGCTTACAGCAAGAGGGCTACCGCTACGCGGTGCTTGATGCGCTGACCGAACACCATCTGGAAATTCAGGGAGAAGCCTTGCGCGATGCCCCACTGGTAACGGGCGGTTCTGGTCTGGCGATTGGCCTGGCCCGGCAGTGGGCGCAAGAAAACGGTAACCAGGCTCGCAAAGCAGGGCGTCCGCTCGCTGGGCGCGGCGTAGTGCTCTCCGGTTCATGCTCTCAAATGACCAACCGCCAGGTAGCACATTACCGTCAAATTGCACCAGCCCGTGAAGTTGATGTGGCACGCTGCCTCTCAATTGAAACTCTGGCCGCTTATGCACACGAACTGGCAGAGTGGGTTCTGGGCCAGGAAAGTGTACTTGCTCCACTGGTTTTTGCCACCGCCAGCACTGACGCATTGGCAGCAATTCAACAGCAATACGGTGCACAAAAAGCCAGTCAGGCAGTAGAAACACTGTTTTCTCAACTAGCGGCGCGGTTAGCAGCGGAAGGCGTGACACGCTTTATTGTCGCAGGCGGTGAGACCTCCGGCGTAGTCACACAGAGCCTGGGAATAAAAGGGTTTCATATTGGCCCAACCATTTCCCCGGCGTGCCGTGGGTAAACGCACTGGATAAGCCTGTCTCACTCGCCCTTAAATCTGGCAACTTCGGTGATGACGCCTTTTTTTCACGAGCCCAAAGAGAGTTTTTATCATGAGCGATTTCGCAAAAGTAGAGCAGTCTTTGCGAGAGGAGATGACGCGGATTGCCAGTTCATTCTTTCAGCGCGGCTATGCAACCGGTTCGGCTGGCAATCTGTCGCTGCTTTTACCTGACGGGAATTTACTGGCGACACCGACAGGTTCATGCCTGGGCAATCTCGATCCGCAGCGGCTTTCCAAAGTCGCCGCGGATGGCGAATGGTTAAGTGGTGACAAACCCTCGAAAGAGGTGCTCTTTCATCTGGCGCTGTATCGCAACAATCCGCGCTGTAAAGCGGTGGTGCATTTGCACAGCACATGGTCGACGGCGCTTTCCTGCCTGCAAGGGCTGGACAGCAGCAACGTTATTCGTCCGTTCACACCATACGTGGTGATGCGGATGGGAAATGTCCCGCTGGTGCCTTATTACCGACCGGGCGATAAACGCATCGCACAGGATCTGGCGGAACTGGCAGCAGACAATCAGGCTTTTTTACTGGCAAATCATGGCCCAGTGGTTTGCGGTGAAAGCCTGCAAGAAGCCGCCAACAATATGGAAGAGCTGGAGGAAACGGCAAAGCTGATTTTTATTCTCGGTGACCGCCCGATCCGTTATCTGACCGCAGGTGAAATTGCGGAATTAAGGAGTTAA
Protein sequences of DBSCAN-SWA_8 >NZ_CP020368|2681539:2688678|2684206_2684863_+|WP_001141337.1|DBSCAN-SWA MPSTRYQKINAHHYRHIWVVGDIHGEYQLLQSRLHQLSFFPKIDLLISVGDNIDRGPESLDVLRLLNQPWFTSVKGNHEAMALEAFETGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEITNDNIKYAIAHADYPGSEYLFGKEIAESELLWPVDRVQKSLNGELQQINGADYFIFGHMMFDNIQTFANQIYIDTGSPNSGRLSFYKIK >NZ_CP020368|2681539:2688678|2684913_2685681_-|WP_001300386.1|DBSCAN-SWA MIPVERRQIILEMVAEKGIVSIAELTDRMNVSHMTIRRDLQKLEQQGAVVLVSGGVQSPGRVAHEPSHQVKTALAMTQKAAIGKLAASLVQPGSCIYLDAGTTTLAIAQHLIHMESLTVVTNDFVIADYLLDNSNCTIIHTGGAVCRENRSCVGEAAATMLRSLMIDQAFISASSWSVRGISTPAEDKVTVKRAIASASRQRVLVCDATKYGQVATWLALPLSEFDQIITDDGLPESASRALVKQDLSLLVAKNE >NZ_CP020368|2681539:2688678|2686781_2687948_+|WP_001393459.1|DBSCAN-SWA MIKIGVIADDFTGATDIASFLVENGLPTVQINGVPTGKMPEAIDALVISLKTRSCPVVEATQQSLAALSWLQQQGCKQIYFKYCSTFDSTAKGNIGPVTDALMDALDTPFTVFSPALPVNGRTVYQGYLFVMNQLLAESGMRHHPVNPMTDSYLPRLVEAQSTGRCGVVSAHVFEQGVDAVRQELARLQQEGYRYAVLDALTEHHLEIQGEALRDAPLVTGGSGLAIGLARQWAQENGNQARKAGRPLAGRGVVLSGSCSQMTNRQVAHYRQIAPAREVDVARCLSIETLAAYAHELAEWVLGQESVLAPLVFATASTDALAAIQQQYGAQKASQAVETLFSQLAARLAAEGVTRFIVAGGETSGVVTQSLGIKGFHIGPTISPACRG >NZ_CP020368|2681539:2688678|2688039_2688678_+|WP_001278994.1|DBSCAN-SWA MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVAADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNVPLVPYYRPGDKRIAQDLAELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS >NZ_CP020368|2681539:2688678|2681539_2684101_+|WP_001272928.1|DBSCAN-SWA MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAGLQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV >NZ_CP020368|2681539:2688678|2685876_2686785_+|WP_000848004.1|DBSCAN-SWA MKTGSEFHVGIVGLGSMGMGAALSYVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNAAQVKQVLFGETGVAQHLKPGTAVMVSSTIASADAQEIATALAGFDLEMLDAPVSGGAVKAANGEMTVMASGSDIAFERLAPVLEAVAGKVYRIGAEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSAVDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS |
6 | Escherichia_phage(83.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|