Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP040886 | Escherichia coli strain K71-77 chromosome, complete genome | 12 crisprs | DinG,DEDDh,WYL,RT,cas3,csa3,PD-DExK,cas5,cas6e,cas1,cas2,c2c9_V-U4 | 0 | 18 | 11 | 0 |
NZ_CP040885 | Escherichia coli strain K71-77 plasmid pK71-77-2, complete sequence | 0 crisprs | NA | 0 | 0 | 1 | 0 |
NZ_CP040884 | Escherichia coli strain K71-77 plasmid pK71-77-1-NDM, complete sequence | 0 crisprs | DEDDh | 0 | 0 | 12 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040886_1 | 318272-318363 | Orphan |
NA
Consensus repeat of NZ_CP040886_1
|
1 spacers
spacers of NZ_CP040886_1
>1.1|318298|40|NZ_CP040886|CRISPRCasFinder GCGCTGCGGGTCATTCTTGAAATTACCCCCGCTGTGCTGT |
CRISPR arrays and Neighbor proteins around NZ_CP040886_1
The CRISPR arrays of NZ_CP040886_1 >merge|NZ_CP040886|1|318272-318363|CRISPRCasFinder CCACCTTTTTTACCTGCTTCAGATGCGCGCTGCGGGTCATTCTTGAAATTACCCCCGCTGTGCTGTCCACCTTTTTTACCTGCTTCTGATGC >NZ_CP040886|1|1|318272-318363|CRISPRCasFinder CCACCTTTTTTACCTGCTTCAGATGC GCGCTGCGGGTCATTCTTGAAATTACCCCCGCTGTGCTGT CCACCTTTTTTACCTGCTTCTGATGC
>NZ_CP040886.1|WP_001347171.1|316829_318158_+|pyrimidine-utilization-transport-protein-G MAMFGFPHWQLKSTSTESGVVAPDERLPFAQTAIMGVQHAVAMFGATVLMPILMGLDPNLSILMSGVGTLLFFFITGGRVPSYLGSSAAFVGVVIAATGFNGQGINPNISIALGGIIACGLVYTVIGLVVMKIGTRWIERLMPPVVTGAVVMAIGLNLAPIAVKSVSASAFDSWMAVMTVLCIGLVAVFTRGMIQRLLILVGLIVACLLYGVMTNLLGLGKAVDFTLVSHAAWFGLPHFSTPAFNSQAMMLIAPVAVILVAENLGHLKAVAGMTGRNMDPYMGRAFVGDGLATMLSGSVGGSGVTTYAENIGVMAVTKVYSTLVFVAAAVIAMLLGFSPKFGALIHTIPAAVIGGASIVVFGLIAVAGARIWVQNRVDLSQNGNLIMVAVTLVLGAGDFALTLGGFTLGGIGTATFGAILLNALLSRKLVDVPPPEVVHQEP >NZ_CP040886.1|WP_001028095.1|316314_316809_+|pyrimidine-utilization-flavin-reductase-protein-F MNIVDQQTFRDAMSCMGAAVNIITTDGPAGRAGFTASAVCSVTDTPPTLLVCLNRGASVWPVFNENRTLCVNTLSAGQEPLSNLFGGKTPMEHRFAAARWQTGVTGCPQLEEALVSFDCRISQVVSVGTHDILFCAIEAIHRHATPYGLVWFDRSYHALMRPAC >NZ_CP040886.1|WP_001001184.1|315713_316304_+|malonic-semialdehyde-reductase MNEAVSPGALSTLFTDARTHNGWRETPVSDETLRELYALMKWGPTSANCSPARIVFIRTAEGKERLRPALSSGNLQKTLTAPVTAIVAWDSEFYERLPLLFPHGDARSWFTSSPQLAEETAFRNSSMQAAYLIVACRALGLDTGPMSGFDRQYVDDAFFAGSTLKSNLLINIGYGDNSKLYARLPRLSFEEACGLL >NZ_CP040886.1|WP_001323674.1|314903_315704_+|pyrimidine-utilization-protein-D MKLSLSPPPYADAPVVVLISGLGGSGSYWLPQLAVLEQEYQVVCYDQRGTGNNPDTLAEDYSIAQMAAELHQALVAAGIEHYAVVGHALGALVGMQLALDYPASVTVLVCVNGWLRINAHTRRCFQVRERLLYSGGAQAWVEAQPLFLYPADWMAARAPRLEAEDALALAHFQGKNNLLRRLNALKRADFSHHAVRIRCPVQIICASDDLLVPSACSSELHAALPDSQKMVMRYGGHACNVTDPETFNALLLNGLASLLHHREAAL >NZ_CP040886.1|WP_001126787.1|314509_314896_+|pyrimidine-utilization-protein-C MPKSVIIPAGSSAPLAPFVPGTLADGVVYVSGTLAFDQHNNVLFADDPKAQTRHVLETIRTVIETAGGTMADVTFNSIFITDWKNYAAINEIYAEFFPGDKPARFCIQCGLVKPDALVEIATIAHIAK >NZ_CP040886.1|WP_001345643.1|313805_314498_+|peroxyureidoacrylate/ureidoacrylate-amidohydrolase-RutB MTTLTARPEAITFDPQQSALIVVDMQNAYATPGGYLDLAGFDVSTTRPVIANIQTAVTAARAAGMLIIWFQNGWDEQYVEAGGPGSPNFHKSNALKTMRKQPQLQGKLLAKGSWDYQLVDELVPQPGDIVLPKPRYSGFFNTPLDSILRSRGIRHLVFTGIATNVCVESTLRDGFFLEYFGVVLEDATHQAGPEFVQKAALFNIETFFGWVSDVETFCDALSPTSFARIA >NZ_CP040886.1|WP_001345642.1|312714_313806_+|pyrimidine-utilization-protein-A MKIGVFVPIGNNGWLISTHAPQYMPTFELNKAIVQKAEHYHFDFALSMIKLRGFGGKTEFWDHNLESFTLMAGLAAVTSRIQIYATAATLTLPPAIVARMAATIDSISGGRFGVNLVTGWQKPEYEQMGIWPGDDYFSRRYDYLTEYVQVLRDLWGSGKSDFKGDFFTMNDCRVSPQPSVPMKVICAGQSDAGMAFSAQYADFNFCFGKGVNTPTAFAPTAARMKQAAEQTGRDVGSYVLFMVIADETDDAARAKWEHYKAGADEEALSWLTEQSQKDTRSGTDTNVRQMADPTSAVNINMGTLVGSYASVARMLDEVASVPGAEGVLLTFDDFLSGIETFGERIQPLMQCRAHLPALTQEVA >NZ_CP040886.1|WP_001295606.1|311788_312427_-|HTH-type-transcriptional-regulator-RutR MTQGAVKTTGKRSRTVSAKKKAILSAALDTFSQFGFHGTRLEQIAELAGVSKTNLLYYFPSKEALYIAVLRQILDIWLAPLKAFREDFAPLAAIKEYIRLKLEVSRDYPQASRLFCMEMLAGAPLLMDELTGDLKALIDEKSALIAGWVKSGKLAPIDPQHLIFMIWASTQHYADFAPQVEAVTGATLRDEVFFNQTVENVQRIIIEGIRPR >NZ_CP040886.1|WP_001299828.1|307786_311749_+|trifunctional-transcriptional-regulator/proline-dehydrogenase/L-glutamate-gamma-semialdehyde-dehydrogenase MGTTTMGVKLDDATRERIKSAATRIDRTPHWLIKQAIFSYLEQLENSDTLPELPALLSGAANESDEAPTPAEEPHQPFLDFAEQILPQSVSRAAITAAYRRPETEAVSMLLEQARLPQPVAEQAHKLAYQLADKLRNQKNASGRAGMVQGLLQEFSLSSQEGVALMCLAEALLRIPDKATRDALIRDKISNGNWQSHIGRSPSLFVNAATWGLLFTGKLVSTHNEASLSRSLNRIIGKSGEPLIRKGVDMAMRLMGEQFVTGETIAEALANARKLEEKGFRYSYDMLGEAALTAADAQAYMVSYQQAIHAIGKASNGRGIYEGPGISIKLSALHPRYSRAQYDRVMEELYPRLKSLTLLARQYDIGINIDAEEADRLEISLDLLEKLCFEPELAGWNGIGFVIQAYQKRCPLVIDYLIDLATRSRRRLMIRLVKGAYWDSEIKRAQMDGLEGYPVYTRKVYTDVSYLACAKKLLAVPNLIYPQFATHNAHTLAAIYQLAGQNYYPGQYEFQCLHGMGEPLYEQVTGKVADGKLNRPCRIYAPVGTHETLLAYLVRRLLENGANTSFVNRIADTSLPLDELVADPVTAVEKLAQQEGQTGLPHPKIPLPRDLYGHGRDNSAGLDLANEHRLASLSSALLNSALQKWQALPMLEQPVAAGEMSPVINPAEPKDIVGFVREATPREVEQALESAVNNAPIWFATPPVERAAILHRAAVLMESQMQQLIGILVREAGKTFSNAIAEVREAVDFLHYYAGQVRDDFANETHRPLGPVVCISPWNFPLAIFTGQIAAALAAGNSVLAKPAEQTPLIAAQGIAILLEAGVPPGVVQLLPGQGETVGAQLTGDDRVRGVMFTGSTEVATLLQRNIASRLDAQGRPIPLIAETGGMNAMIVDSSALTEQVVVDVLASAFDSAGQRCSALRVLCLQDEIADHTLKMLRGAMAECRMGNPGRLTTDIGPVIDSEAKANIERHIQTMRSKGRPVFQAVRENSEDAREWQSGTFVAPTLIELDDFAELQKEVFGPVLHVVRYNRNQLPELIEQINASGYGLTLGVHTRIDETIAQVTGSAHVGNLYVNRNMVGAVVGVQPFGGEGLSGTGPKAGGPLYLYRLLANRPESALAVTLARQDAEYPVDAQLKAALTQPLNALREWAANRPELQALCTQYGELAQAGTQRLLPGPTGERNTWTLLPRERVLCIADDEQDALTQLAAVLAVGSQVLWPDDALHRQLVKALPSAVSERIQLAKAENITAQPFDAVIFHGDSDQLRALCEAVAARDGAIVSVQGFARGESNILLERLYIERSLSVNTAAAGGNASLMTIG >NZ_CP040886.1|WP_001678465.1|305856_307365_-|sodium/proline-symporter-PutP MAISTPMLVTFCVYIFGMILIGFIAWRSTKNFDDYILGGRSLGPFVTALSAGASDMSGWLLMGLPGAVFLSGISESWIAIGLTLGAWINWKLVAGRLRVHTEYNNNALTLPDYFTGRFEDKSRILRIISALVILLFFTIYCASGIVAGARLFESTFGMSYETALWAGAAATILYTFIGGFLAVSWTDTVQASLMIFALILTPVIVIISVGGFGDSLEVIKQKSIENVDMLKGLNFVAIISLMGWGLGYFGQPHILARFMAADSHHSIVHARRISMTWMILCLAGAVAVGFFGIAYFNEHPAVAGAVNQNAERVFIELAQILFNPWIAGILLSAILAAVMSTLSCQLLVCSSAITEDLYKAFLRKHASQKELVWVGRVMVLVVALVAIALAANPENRVLGLVSYAWAGFGAAFGPVVLFSVMWSRMTRNGALAGMIIGALTVIVWKQFGWLGVYEIIPGFIFGSIGIVVFSLLGKAPSAAMQKRFAEADAHYHSAPPSRLQES >NZ_CP040886.1|WP_001151437.1|318786_319383_+|NAD(P)H:quinone-oxidoreductase MAKVLVLYYSMYGHIETMARAVAEGASKVDGAEVVVKRVPETMPPQLFEKAGGKTQTAPVATPQELADYDAIIFGTPTRFGNMSGQMRTFLDQTGGLWASGALYGKLASVFSSTGTGGGQEQTITSTWTTLAHHGMVIVPIGYAAQELFDVSQVRGGTPYGATTIAGGDGSRQPSQEELSIARYQGEYVAGLAVKLNG >NZ_CP040886.1|WP_001143120.1|319403_319631_+|hypothetical-protein MPTQEAKAHHVGEWASLRNTSPEIAEAIFEVAGYDEKMAEKIWEEGSDEVLVKAFAKTDKDSLFWGEQTIERKNV >NZ_CP040886.1|WP_001044313.1|319668_320910_-|bifunctional-glucose-1-phosphatase/inositol-phosphatase MNKTLIAATVAGIVLLASNAQAQTVPEGYQLQQVLMMSRHNLRAPLANNGSVLEQSTPNKWPEWDVPGGQLTTKGGVLEVYMGHYMREWLAQQGMVKSGECPPPDTVYAYANSLQRTVATAQFFITGAFPGCDIPVHHQEKMGTMDPTFNPVITDDSAAFSEQAVAAMEKELSKLQLTDSYQLLEKIVNYKDSPACKEKQQCSLVDGKNTFSAKYQQEPGVSGPLKVGNSLVDAFTLQYYEGFPMDQVAWGEIKSDQQWKVLSKLKNGYQDSLFTSPEVARNVAKPLVSYIDKALVTDRTSAPKITVLVGHDSNIASLLTALDFKPYQLHDQNERTPIGGKIVFQRWHDSKANRDLMKIEYVYQSAEQLRNADALTLQAPAQRVTLELSGCPIDANGFCPMDKFDSVLNEAVK >NZ_CP040886.1|WP_000097602.1|321201_322461_-|YccE-family-protein MSSNIHGISCTANNYLKQAWNNIKNEHEKNQKYSITLFENTLVCFMRLYKEIRRQKAEDYIPCLECDSLEKEFEEMQNDNDLSLFLRTLRTNDTETYSGVSEGITYTIQYVRDIDIVRVSLPGRGSESITDFKGYYWYGFMEYIENINACDDVFSEYCLDDENMSIQPEWINTPGISDLDTGIDLSGISFIQSEINKTYGLKYAPVDGDGYCLLRAILVLKEHEYSWALGSHKTQKQVYEEFIKIVDKQTIEALVDTAFNDLREDVKTLFGVNLQSDNKIQGQGGFLSWSFLSFKKEFIDSCLNDKKCILHLPEFIFNDNKARLVLDTDPEQKVNEVKNFLTALSDSICSLFIVNSNVASISLGNESFSTDDDLEYGYLINTGNHYDVYLPPELFAQAYELNNKERNAQIDFLTRYAIY >NZ_CP040886.1|WP_000420629.1|322720_323641_+|curved-DNA-binding-protein MELKDYYAIMGVKPTDDLKTIKTAYRRLARKYHPDVSKEPDAEARFKEVAEAWEVLSDEQRRAEYDQMWQHRNDPQFNRQFHHSDGQSFNAEDFDDIFSSIFGQHARQSRQRPATRGHDIEIEVAVFLEETLTEHKRTISYNLPVYNAFGMIEQEIPKTLNVKIPAGVGNGQRIRLKGQGTPGENGGPNGDLWLVIHIAPHPLFDIVGHDLEIVVPVSPWEAALGAKVTVPTLKESILLTIPPGSQAGQRLRVKGKGLVSKKQTGDLYAVLKIVMPPKPDENTAALWQQLADAQSSFDPRKDWGKA >NZ_CP040886.1|WP_000024560.1|323640_323946_+|chaperone-modulator-CbpM MANVTVTFTITEFCLHTGISEEELNEIVGLGVVEPREIQETTWVFDDHAAIVVQRAVRLRHELALDWPGIAVALTLMDDIAHLKQENRLLRQRLSRFVAHP >NZ_CP040886.1|WP_000209869.1|324038_324638_-|molecular-chaperone-TorD MTTLTAQQIACVYAWLAQLFSRELDDEQLTQIASAQMAEWFSLLKSEPPLAAAVNELENCIATLTVRDDARLELAADFCGLFLMTDKQAALPYASAYKQDEQEIKRLLVEAGMETSGNFNEPADHLAIYLELLSHLHFSLGEGTVPARRIDSLRQKTLTALWQWLPEFVVRCRQYDSFGFYAALSQLLLVLVESDHQNR >NZ_CP040886.1|WP_001062101.1|324634_327181_-|trimethylamine-N-oxide-reductase-TorA MNNNDLFQASRRRFLAQLGGLTVAGMLGPSLLTPRRATAAQAATDAVISKEGILTGSHWGAIRATVKDGRFVAAKPFELDKYPSKMIAGLPDHVHNAARIRYPMVRVDWLRKRHLSDTSQRGDNRFVRVSWDEALDMFYEELERVQKTHGPSALLTASGWQSTGMFHNASGMLAKAIALHGNSVGTGGDYSTGAAQVILPRVVGSMEVYEQQTSWPLVLQNSKTIVLWGSDLLKNQQANWWCPDHDVYEYYAQLKAKVAAGEIEVISIDPVVTSTHEYLGREHVKHIAVNPQTDVPLQLALAYTLYSENLYDKNFLANYCVGFEQFLPYLLGEKDGQPKDAAWAEKLTGIDAETIRGLARQMAANRTQIIAGWCVQRMQHGEQWAWMIVVLAAMLGQIGLPGGGFGFGWHYNGAGTPGRKGVILSGFSGSTSIPPVHDNSDYKGYSSTIPIARFIDAILEPGKVINWNGKSVKLPPLKMCIFAGTNPFHRHQQINRIIEGWRKLETVIAIDNQWTSTCRFADIVLPATTQFERNDLDQYGNHSNRGIIAMKQVVPPQFEARNDFDIFRELCRRFNREEAFTEGLDEMGWLKRIWQEGVQQGKGRGVHLPAFDDFWNNKEYVEFDHPQMFVRHQAFREDPDLEPLGTPSGLIEIYSKTIADMNYDDCQGHPMWFEKIERSHGGPGSQKYPLHLQSVHPDFRLHSQLCESETLRQQYTVAGKEPVFINPQDASARGIRNGDVVRVFNARGQVLAGAVVSDRYAPGVARIHEGAWYDPDKGGEPGALCKYGNPNVLTIDIGTSQLAQATSAHTTLVEIEKYNGAVEQVTAFNGPVEMVAQCEYVPASQVKS >NZ_CP040886.1|WP_001323677.1|327180_328353_-|pentaheme-c-type-cytochrome-TorC MRKLWNALRRPSARWSVLALVAIGIVIGIALIVLPHVGIKVTSTTEFCVSCHSMQPVYEEYKQSVHFQNASGVRAECHDCHIPPDMPGMVKRKLEASNDIYQTFIAHSIDTPEKFEAKRAELAEREWARMKENNSATCRSCHNYDAMDHAKQHPEAARQMKVAAKDNQSCIDCHKGIAHQLPDMSSGFRKQFDELRASANDSGDTLYSIDIKPIYAAKGDKEASGSLLPASEVKVLKRDGDWLQIEITGWTESAGRQRVLTQFPGKRIFVASIRGDVQQQVKTLEKTTVADTNTEWSKLQATAWMKKGDMVNDIKPIWAYADSLYNGTCNQCHGAPEIAHFDANGWIGTLNGMIGFTSLDKREERTLLKYLQMNASDTAGKAHGDKKEEK >NZ_CP040886.1|WP_001120112.1|328482_329175_+|two-component-system-response-regulator-TorR MPHHIVIVEDEPVTQARLQSYFTQEGYTVSVTASGAGLREIMQNQPVDLILLDINLPDENGLMLTRALRERSTVGIILVTGRSDRIDRIVGLEMGADDYVTKPLELRELVVRVKNLLWRIDLARQAQPHTQDNCYRFAGYCLNVSRHTLERDGEPIKLTRAEYEMLVAFVTNPGEILSRERLLRMLSARRVENPDLRTVDVLIRRLRHKLSADLLVTQHGEGYFLAADVC |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040886_2 | 623114-623258 | Orphan |
NA
Consensus repeat of NZ_CP040886_2
|
1 spacers
spacers of NZ_CP040886_2
>2.1|623166|41|NZ_CP040886|CRISPRCasFinder TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC |
CRISPR arrays and Neighbor proteins around NZ_CP040886_2
The CRISPR arrays of NZ_CP040886_2 >merge|NZ_CP040886|2|623114-623258|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGCTGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTCGTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC >NZ_CP040886|2|2|623114-623258|CRISPRCasFinder GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGGATGC TGCGAAAATGCCTTATCTGGCCTACAGATTCGATGCGATTC GTAGGTCGGATAAGATGCGCAAGCATCGCATCCGACAATAAGTGCCGAATGC
>NZ_CP040886.1|WP_001091569.1|621764_623048_+|putative-acyl-CoA-thioester-hydrolase MNTFSVSRLALALAFGVTLTACSSTPPDQRPSDQTAPGTSSRPILSAKEAQNFDAQHYFASLTPGAAAWNPSPITLPAQPDFVVGPAGTQGVTHTTIQAAVDAAIIKRTNKRQYIAVMPGEYQGTVYVPAAPGGITLYGTGEKPIDVKIGLSLDGGMSPADWRHDVNPRGKYMPGKPAWYMYDSCQSKRSDSIGVLCSAVFWSQNNGLQLQNLTIENTLGDSVDAGNHPAVALRTDGDQVQINNVNILGRQNTFFVTNSGVQNRLETNRQPRTLVTNSYIEGDVDIVSGRGAVVFDNTEFRVVNSRTQQEAYVFAPATLSNIYYGFLAVNSRFNAFGDGVAQLGRSLDVDANTNGQVVIRDSAINEGFNTAKPWADAVISNRPFAGNTGSVDDNDEIQRNLNDTNYNRMWEYNNRGVGSKVVAEAKK >NZ_CP040886.1|WP_000533646.1|620559_621630_+|tyrosine-type-recombinase/integrase MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >NZ_CP040886.1|WP_001303849.1|620363_620582_+|excisionase MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >NZ_CP040886.1|WP_000545745.1|620156_620324_+|hypothetical-protein MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >NZ_CP040886.1|WP_071525073.1|620038_620224_-|hypothetical-protein MFSASITLLNGSPFHSPVTRKCIYHLHKTKPAVASSDKRNPRQCEDAVHCCYTLFCSQRKR >NZ_CP040886.1|WP_000120065.1|619311_619914_-|hypothetical-protein MSYFLRKKWMVNLSGSGKILWALNMKKDSYPYLICMTVSGLIFIFLFFWWRADIYRVTFLNQSISHYYILFSMGIAFLLSLFWVKKGIVKQSGWKSLSAYLKVYAGMCIFAGFFLIIPLTTLTYFLPGETSSYVAPYRYTSGSSKSCSGAEVDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >NZ_CP040886.1|WP_000763365.1|618879_619101_+|TraR/DksA-family-transcriptional-regulator MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQQDLELISKQRGSK >NZ_CP040886.1|WP_001395510.1|618499_618781_+|cell-division-protein-ZapA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >NZ_CP040886.1|WP_023148020.1|618297_618489_+|DUF1382-family-protein MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >NZ_CP040886.1|WP_072126246.1|618142_618325_+|DUF1317-domain-containing-protein MTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLPLS >NZ_CP040886.1|WP_001372426.1|623281_625543_-|hydratase MIKLSEKGVFLASNNEIIAEEHFTGEIKKEEAQKGTIAWSILSSHNTSGNMDKLKIKFDSLASHDITFVGIVQTAKASGMERFPLPYVLTNCHNSLCAVGGTINGDDHVFGLSAAQRYGGIFVPPHIAVIHQYMREMMAGGGKMILGSDSHTRYGALGTMAVGEGGGELVKQLLNDTWDIDYPGVVAVHLTGKPAPYVGPQDVALAIIGAVFKNGYVKNKVMEFVGPGVSALSTDFRNSVDVMTTETTCLSSVWQTDEEVHNWLALHGRGQDYCQLNPQPMAYYDGCISVDLSAIKPMIALPFHPSNVYKIDTLNQNLTDILREIEIESERVAHGKAKLSLLDKVENGRLKVQQGIIAGCSGGNYENVIAAANALRGQSCGNDTFSLAVYPSSQPVFMDLAQKGVVADLIGAGAIIRTAFCGPCFGAGDTPINNGLSIRHTTRNFPNREGSKPANGQMSAVALMDARSIAATAANGGYLTSASELDCWDNVPEYAFDVTPYKNRVYQGFVKGATQQPLIYGPNIKDWPELGALTDNIVLKVCSKILDEVTTTDELIPSGETSSYRSNPIGLAEFTLSRRDPGYVGRSKATAELENQRLAGNVSELTEVFARIKQIAGQEHIDPLQTEIGSMVYAVKPGDGSAREQAASCQRVIGGLANIAEEYATKRYRSNVINWGMLPLQMAEVPTFEVGDYIYIPGIKAALDNPGTTFKGYVIHEDAPVTEITLYMGSLTAEEREIIKAGSLINFNKNRQM >NZ_CP040886.1|WP_001036475.1|625725_627159_-|anion-permease MNKKSLWKLILILAIPCIIGFMPAPAGLSELAWVLFGIYLAAIVGLVIKPFPEPVVLLIAVAASMVVVGNLSDGAFKTTAVLSGYSSGTTWLVFSAFTLSAAFVTTGLGKRIAYLLIGKIGNTTLGLGYVTVFLDLVLAPATPSNTARAGGIVLPIINSVAVALGSEPEKSPRRVGHYLMMSIYMVTKTTSYMFFTAMAGNILALKMINDILHLQISWGGWALAAGLPGIIMLLVTPLVIYTMYPPEIKKVDNKTIAKAGLAELGPMKIREKMLLGVFVLALLGWIFSKSLGVDESTVAIVVMATMLLLGIVTWEDVVKNKGGWNTLIWYGGIIGLSSLLSKVKFFEWLAEVFKNNLAFDGHGNVAFFVIIFLSIIVRYFFASGSAYIVAMLPVFAMLANVSGAPLMLTALALLFSNSYGGMVTHYGGAAGPVIFGVGYNDIKSWWLVGAVLTILTFLVHITLGVWWWNMLIGWNML >NZ_CP040886.1|WP_001372427.1|627234_628287_-|4-oxalomesaconate-tautomerase MKKIPCVMMRGGTSRGAFLLAEHLPEDQTQRDKILMAIMGSGNDLEIDGIGGGNPLTSKVAIISRSSDLRADVDYLFAQVIVHEQRVDTTPNCGNMLSGVGAFAIENGLIAATSPVTRVRIRNVNTGTFIEADVQTPNGVVEYEGSARIDGVPGTAAPVALTFLNAAGTKTGKVFPTDNQIDYFDDVPVTCIDMAMPVVIIPAEYLGKTGYELPAELDADKALLARIESIRLQAGKAMGLGDVSNMVIPKPVLISPAQKGGAINVRYFMPHSCHRALAITGAIAISSSCALEGTVTRQIVPSVGYGNINIEHPSGALDVHLSNEGQDATTLRASVIRTTRKIFSGEVYLP >NZ_CP040886.1|WP_000679972.1|628470_629424_+|LysR-family-transcriptional-regulator MKHELSSMKAFVILAESSSFNNAAKLLNITQPALTRRIKKMEEDLHIQLFERTTRKVTLTKAGKRLLPEARELIKKFDETLFNIRDMNAYHRGMVTLACIPTAVFYFLPLAIGKFNELYPNIKVRILEQGTNNCMESVLCNESDFGINMNNVTNSSIDFTPLVNEPFVLACRRDHPLAKKQLVEWQELVGYKMIGVRSSSGNRLLIEQQLADKPWKLDWFYEVRHLSTSLGLVEAGLGISALPGLAMPHAPYSSIIGIPLVEPVIRRTLGIIRRKDAVLSPAAERFFALLINLWTDDKDNLWTNIVERQRHALQEIG >NZ_CP040886.1|WP_000815449.1|629464_630460_-|6-phosphogluconolactonase MKQTVYIASPESQQIHVWNLNHEGALTLTQVVDVPGQVQPMVVSPDKRYLYVGVRPEFRVLAYSIAPDDGALTFAAESALPGSPTHISTDHQGQFVFVGSYNAGNVSVTRLEDGLPVGVVDVVEGLDGCHSANISPDNRTLWVPALKQDRICLFTVSDDGHLVAQDPAEVTTVEGAGPRHMVFHPNEQYAYCVNELNSSVDVWELKDPHGNIECVQTLDMMPENFSDTRWAADIHITPDGRHLYACDRTASLITVFSVSEDGSVLSKEGFQPTETQPRGFNVDHSGKYLIAAGQKSHHISVYEIVGEQGLLHEKGRYAVGQGPMWVVVNAH >NZ_CP040886.1|WP_000213425.1|630614_631433_+|pyridoxal-phosphatase MTTRVIALDLDGTLLTPKKTLLPSSIEALARAREAGYQLIIVTGRHHVAIHPFYQALALDTPAICCNGTYLYDYHAKTVLEADPMPVNKALQLIEMLNEHHIHGLMYVDDAMVYEHPTGHVIRTSNWAQTLPPEQRPTFTQVASLAETAQQVNAVWKFALTHDDLPQLQHFGKHVEHELGLECEWSWHDQVDIARGGNSKGKRLTKWVEAQGWSMENVVAFGDNFNDISMLEAAGTGVAMGNADDAVKARANIVIGDNTTDSIAQFIYSHLI >NZ_CP040886.1|WP_000891692.1|631433_632492_-|molybdenum-ABC-transporter-ATP-binding-protein-ModC MLELNFSQTLGNHCLTINETLPANGITAIFGVSGAGKTSLINAISGLTRPQKGRIVLNGRVLNDAEKGICLTPEKRRVGYVFQDARLFPHYKVRGNLRYGMSKSMVDQFDKLVALLGIEPLLDRLPGSLSGGEKQRVAIGRALLTAPELLLLDEPLASLDIPRKRELLPYLQRLTREINIPMLYVSHSLDEILHLADRVMVLENGQVKAFGALEEVWGSSVMNPWLPKEQQSSILKVTVLEHHPHYAMTALALGDQHLWVNKLDEPLQAALRIRIQASDVSLVLQPPQQTSIRNVLRAKVVNSYDDNGQVEVELEVGGKTLWARISPWARDELAIKPGLWLYAQIKSVSITA >NZ_CP040886.1|WP_000604034.1|632494_633184_-|molybdate-ABC-transporter-permease-subunit MILTDPEWQAVLLSLKVSSLAVLFSLPFGIFFAWLLVRCTFPGKALLDSVLHLPLVLPPVVVGYLLLVSMGRRGFIGERLYDWFGITFAFSWRGAVLAAAVMSFPLMVRAIRLALEGVDVKLEQAARTLGAGRWRVFFTITLPLTLPGIIVGTVLAFARSLGEFGATITFVSNIPGETRTIPSAMYTLIQTPGGESGAARLCIISIALAMISLLISEWLARISRERAGR >NZ_CP040886.1|WP_000101993.1|633183_633957_-|molybdate-ABC-transporter-substrate-binding-protein MARKWLNLFAGAALSFAVAGNALADEGKITVFAAASLTNAMQDIATQYKKEKGVDVVSSFASSSTLARQIEAGAPADLFISADQKWMDYAVDKKAIDTATRQTLLGNSLVVVAPKASEQKDFTIDSKTNWTSLLNGGRLAVGDPEHVPAGIYAKEALQKLGAWDTLSPKLAPAEDVRGALALVERNEAPLGIVYGSDAVASKGVKVVAIFPEDSHKKVEYPVAVVEGHNNATVKAFYDYLKGPQAAEIFKRYGFTTK >NZ_CP040886.1|WP_000891515.1|634123_634273_-|multidrug-efflux-pump-accessory-protein-AcrZ MLELLKSLVFAVIMVPVVMAIILGLIYGLGEVFNIFSGVGKKDQPGQNH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040886_3 | 1120034-1120187 | Orphan |
NA
Consensus repeat of NZ_CP040886_3
|
1 spacers
spacers of NZ_CP040886_3
>3.1|1120087|48|NZ_CP040886|CRISPRCasFinder TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA |
CRISPR arrays and Neighbor proteins around NZ_CP040886_3
The CRISPR arrays of NZ_CP040886_3 >merge|NZ_CP040886|3|1120034-1120187|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCGTCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAACGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG >NZ_CP040886|3|3|1120034-1120187|CRISPRCasFinder CGCCTTATCCGGCCTACCGATCCAGCACAGGTTTGTAGGCATGATAAGACGCG TCAGCGTCGCATCAGGCATCTGCGCATAACCGCCGGATGCGGCGTAAA CGCCTTATCCGGCCTACCGATCCGGCACAGGTTTGTAGGCATGATAAGACGCG
>NZ_CP040886.1|WP_000952760.1|1118159_1119899_+|flagellar-type-III-secretion-system-protein-FlhA MLSRSDLLTLLTINFIVVTKGAERISEVSARFTLDAMPGKQMAIDADLNAGLINQAQAQTRRKDVASEADFYGAMDGASKFVRGDAIAGMMILAINLIGGVCIGIFKYNLSADAAFQQYVLMTIGDGLVAQIPSLLLSTAAAIIVTRISDNGDITHDVRHQLLASPSVLYTATGIMFVLAVVPGMPHLPFLLFSALLGFTGWRMSKRPQAAEAEEKSLETLTRTITETSEQQVSWETIPLIEPISLSLGYKLVALVDKAQGNPLTQRIRGVRQVISDGNGVLLPEIRIRENFRLKPSQYAIFINGIKADEADIPADKLMALPSSETYGEIDGVLGNDPAYGMPVTWIQPAQKAKALNMGYQVIDSASVIATHVNKIVRSYIPDLFSYDDITQLHNRLSSMAPRLAEDLSAALNYSQLLKVYRALLTEGVSLRDIVTIATVLVASSAVTKDHILLAADVRLALRRSITHPFVRKQELTVYTLNNELENLLTNVVNQAQQGGKVMLDSVPVDPNMLNQFQSTMPQVKEQMKAAGKDPVLLVPPQLRPLLARYARLFAPGLHVLSYNEVPDELELKIMGALM >NZ_CP040886.1|WP_001371717.1|1117429_1118200_-|putative-lateral-flagellar-export/assembly-protein-LafU MIVNSVSKSERESIIAALHGQSIFSGGGLSPLNKISPSHPPKPATVAVPEETEKKARDVNEKTALLKKKSATELGELATSINTIARDAHMEANLEMEIVPQGLRVLIKDDQNRNMFECGSAQIMPFFKTLLVELAPVFDSLDNKIIITGHTDAMAYKNNIYNNWNLSGDRALSARRVLEEAGMPEDKVMQVSAMADQMLLDAKNPQSAGNRRIEIMVLTKSASDTLYQYFGQHGDKVVQPLVQKLDKQQVLSQRMR >NZ_CP040886.1|WP_001226155.1|1116303_1117359_-|DNA-polymerase-IV MRKIIHVDMDCFFAAVEMRDNPALRDIPIAIGGSRERRGVISTANYPARKFGVRSAMPTGMALKLCPHLTLLPGRFDAYKEASNHIREIFSRYTSRIEPLSLDEAYLDVTDSVHCHGSATLIAQEIRQTIFNELHLTASAGVAPVKFLAKIASDMNKPNGQFVITPAEVPAFLQTLPLAKIPGVGKVSAAKLEAMGLRTCGDVQKCDLVILLKRFGKFGRILWERSQGIDERDVNSERLRKSVGVERTMAEDIHHWSECEAIIERLYPELERRLAKVKPDLLIARQGVKLKFDDFQQTTQEHVWPRLNKADLIATARKTWDERRGGRGVRLVGLHVTLLDPQMERQLVLGL >NZ_CP040886.1|WP_001059874.1|1115854_1116307_-|GNAT-family-N-acetyltransferase MNNIQIRNYQPGDFQQLCAIFIRAVMMTASQHYSPQQIAAWAQIDESRWKEKLAKSQVRVAVINAQPVGFISRIERHIDMLFVDPEYTRRGVASALLKPLIKSESELTVDASITAKPFFERYGFQIVKQQHVECRGAWFTNFYMRYKPQH >NZ_CP040886.1|WP_001295202.1|1115281_1115548_-|hypothetical-protein MEWYMGKYIRPLSDAVFTIASDDLWIESLAIQQLHTTANLPNMQRVVGMPDLHPGRGYPIGAAFFSVGRFYPARRRGNGAGNRNGPLL >NZ_CP040886.1|WP_001293003.1|1113467_1114925_+|cytosol-nonspecific-dipeptidase MSELSQLSPQPLWDIFAKICSIPHPSYHEEQLAEYIVGWAKEKGFHVERDQVGNILIRKPATAGMENRKPVVLQAHLDMVPQKNNDTVHDFTKDPIQPYIDGEWVKARGTTLGADNGIGMASALAVLADENVVHGPLEVLLTMTEEAGMDGAFGLQSNWLQADILINTDSEEEGEIYMGCAGGIDFTSNLHLDREAVPAGFETFKLTLKGLKGGHSGGEIHVGLGNANKLLVRFLAGHAEELDLRLIDFNGGTLRNAIPREAFATIAVAADKVDALKSLVNTYQDILKNELAEKEKNLALLLDSVANDKAALIAKSRDTFIRLLNATPNGVIRNSDVAKGVVETSLNVGVVTMTDNNVEIHCLIRSLIDSGKDYVVSMLDSLGKLAGAKTEAKGAYPGWQPDANSPVMHLVRETYQRLFNKTPNIQIIHAGLECGLFKKPYPEMDMVSIGPTITGPHSPDEQVHIKSVGHYWTLLTELLKEIPAK >NZ_CP040886.1|WP_001291992.1|1112748_1113207_-|xanthine-phosphoribosyltransferase MSEKYIVTWDMLQIHARKLASRLMPSEQWKGIIAVSRGGLVPGALLARELGIRHVDTVCISSYDHDNQRELKVLKRAEGDGEGFIVIDDLVDTGGTAVAIREMYPKAHFVTIFAKPAGRPLVDNYVVDIPQDTWIEQPWDMGVVFVPPISGR >NZ_CP040886.1|WP_000189539.1|1111412_1112657_-|esterase-FrsA MTQANLSETLFKPRFKHPETSTLVRRFNHGAQPPVQSALDGKTIPHWYRMINRLMWIWRGIDPREILDVQARIVMSDAERTDDDLYDTVIGYRGGNWIYEWATQAMVWQQKACAEEDPQLSGRHWLHAATLYNIAAYPHLKGDDLAEQAQALSNRAYEEAAQRLPGTMRQMEFTVPGGAPITGFLHMPKGDGPFPTVLMCGGLDAMQTDYYSLYERYFAPRGIAMLTIDMPSVGFSSKWKLTQDSSLLHQHVLKALPNVPWVDHTRVAAFGFRFGANVAVRLAYLESPRLKAVACLGPVVHTLLSDFKCQQQVPEMYLDVLASRLGMHDASDDALRVELNRYSLKVQGLLGRRCPTPMLSGYWKNDPFSPEEDSRLITSSSADGKLLEIPFNPVYRNFDKGLQEITGWIEKRLC >NZ_CP040886.1|WP_000174677.1|1110953_1111355_-|sigma-factor-binding-protein-Crl MTLPSGHPKSRLIKKFTALGPYIREGKCEDNRFFFDCLAVCVNVKPAPEVREFWGWWMELEAQESRFTYSYQFGLFDKAGDWKSVPVKDTEVVERLEHTLREFHEKLRELLTTLNLKLEPADDFRDEPVKLTA >NZ_CP040886.1|WP_000749881.1|1109859_1110915_+|phosphoporin-PhoE MKKSTLALVVMGIVASASVQAAEIYNKDGNKLDVYGKVKAMHYMSDNDSKDGDQSYIRFGFKGETQINDQLTGYGRWEAEFAGNKAESDTAQQKTRLAFAGLKYKDLGSFDYGRNLGALYDVEAWTDMFPEFGGDSSAQTDNFMTKRASGLATYRNTDFFGVIDGLNLTLQYQGKNENRDVKKQNGDGFGTSLTYDFGGSDFAISGAYTNSDRTNEQNLQSRGTGKRAEAWATGLKYDANNIYLATFYSETRKMTPITGGFANKTQNFEAVAQYQFDFGLRPSLGYVLSKGKDIEGIGDEDLVNYIDVGATYYFNKNMSAFVDYKINQLDSDNKLNINNDDIVAVGMTYQF >NZ_CP040886.1|WP_000006256.1|1120216_1120714_-|REP-associated-tyrosine-transposase-RayT MSEYRRYYIKGGTWFFTVNLRNRRSQLLTTQYQMLRHAIIKVKRDRPFEINAWVVLPEHMHCIWTLPEGDDDFSSRWREIKKQFTHACGLKNIWQPRFWEHAIRNTKDYRHHVDYIYINPVKHGWVKQVSDWPFSTFHRDVARGLYPIDWAGDVTDINAGERIIL >NZ_CP040886.1|WP_000009291.1|1120889_1121648_-|C40-family-peptidase MSFMSSFLLGRFLHPGVFSLCVLLPLFASATTSHISFSYAARQRMQNRARLLKQYQTHLKKQASYIVEGNAESRRALRQHNREQIKQHPEWFPAPLKASDRRWQALAENNHFLSSDHLHNITEVAIHRLEQQLGKPYVWGGTRPDQGFDCSGLVFYAYNKILEAKLPRTANEMYHYHRATIVANNDLRRGDLLFFHIHSREIADHMGVYLGDGQFIESPRTGENIRVSRLAEPFWQDHFLGARRILTEETIL >NZ_CP040886.1|WP_001225679.1|1121939_1122680_+|murein-L,D-transpeptidase MRKIALILAMLLIPCVSFAGLLGSSSSTTPVSKEYKQQLMGSPVYIQIFKEERTLDLYVKMGEQYQLLDSYKICKYSGGLGPKQRQGDFKSPEGFYSVQRNQLKPDSRYYKAINIGFPNAYDRAHGYEGKYLMIHGDCVSIGCYAMTNQGIDEIFQFVTGALVFGQPSVQVSIYPFRMTDANMKRHKYSNFKDFWEQLKPGYDYFEQTRKPPTVSVVNGRYVVSKPLSHEVVQPQLASNYTLPEAK >NZ_CP040886.1|WP_000333380.1|1122650_1123418_-|class-II-glutamine-amidotransferase MCELLGMSANVPTDICFSFTGLVQRGGGTGPHKDGWGITFYEGKGCRTFKDPQPSFNSPIAKLVQDYPIKSCSVVAHIRQANRGEVALENTHPFTRELWGRNWTYAHNGQLTGYKSLETGNFRPVGETDSEKAFCWLLHKLTQRYPRTPGNMAAVFKYIASLADELRQKGVFNMLLSDGRYVMAYCSTNLHWITRRAPFGVATLLDQDVEIDFSSQTTPNDVVTVIATQPLTGNETWQKIMPGEWRLFCLGERVV >NZ_CP040886.1|WP_000284050.1|1123623_1124202_-|D-sedoheptulose-7-phosphate-isomerase MYQDLIRNELNEAAETLANFLKDDANIHAIQRAAVLLADSFKAGGKVLSCGNGGSHCDAMHFAEELTGRYRENRPGYPAIAISDVSHISCVGNDFGFNDIFSRYVEAVGREGDVLLGISTSGNSANVIKAIAAAREKGMKVITLTGKDGGKMAGTADIEIRVPHFGYADRIQEIHIKVIHILIQLIEKEMVK >NZ_CP040886.1|WP_000973093.1|1124441_1126886_+|acyl-CoA-dehydrogenase MMILSILATVVLLGALFYHRVSLFISSLILLAWTAALGVAGLWSAWVLVPLAIILVPFNFAPMRKSMISAPVFRGFRKVMPPMSRTEKEAIDAGTTWWEGDLFQGKPDWKKLHNYPQPRLTAEEQAFLDGPVEEACRMANDFQITHELADLPPELWAYLKEHRFFAMIIKKEYGGLEFSAYAQSRVLQKLSGVSGILAITVGVPNSLGPGELLQHYGTDEQKNHYLPRLARGQEIPCFALTSPEAGSDAGAIPDTGIVCMGEWQGQQVLGMRLTWNKRYITLAPIATVLGLAFKLSDPEKLLGGAEDLGITCALIPTTTPGVEIGRRHFPLNVPFQNGPTRGKDVFVPIDYIIGGPKMAGQGWRMLVECLSVGRGITLPSNSTGGVKSVALATGAYAHIRRQFKISIGKMEGIEEPLARIAGNAYVMDAAASLITYGIMLGEKPAVLSAIVKYHCTHRGQQSIIDAMDITGGKGIMLGQSNFLARAYQGAPIAITVEGANILTRSMMIFGQGAIRCHPYVLEEMEAAKNNDVNAFDKLLFKHIGHVGSNKVRSFWLGLTRGLTSSTPTGDATKRYYQHLNRLSANLALLSDVSMAVLGGSLKRRERISARLGDILSQLYLASAVLKRYDDEGRNEADLPLVHWGVQDALYQAEQAMDDLLQNFPNRVVAGLLNVVIFPTGRHYLAPSDKLDHKVAKILQVPNATRSRIGRGQYLTPSEHNPVGLLEEALVDVIAADPIHQRICKELGKNLPFTRLDELAHNALAKGLIDKDEAAILVKAEESRLCSINVDDFDPEELATKPVKLPEKVRKVEAA >NZ_CP040886.1|WP_000532698.1|1126928_1127402_-|C-lysozyme-inhibitor MGRISSGGMMFKAITTVAALVIATSAMAQDDLTISSLAKGETTKAAFNQMVQGHKLPAWVMKGGTYTPAQTVTLGDETYQVMSACKPHDCGSQRIAVMWSEKSNQMTGLFSTIDEKTSQEKLTWLNVNDALSIDGKTVLFAALTGSLENHPDGFNFK >NZ_CP040886.1|WP_001118055.1|1127555_1128326_+|2-oxoglutaramate-amidase MPGLKITLLQQPLVWMDGPANLRHFDRQLEGITGRDVIVLPEMFTSGFAMEAAASSLAQNDVVNWMTAKAQQCNALIAGSVALQTESGSVNRFLLVEPGGTVHFYDKRHLFRMADEHLHYKAGNARVIVEWRGWRILPLVCYDLRFPVWSRNLNDYDLAIYVANWPAPRSLHWQALLTARAIENQAYVAGCNRVGSDGNGCHYRGDSRVINPQGEIIATADAHQATRIDAELSMVALREYREKFPAWQDADEFRLR >NZ_CP040886.1|WP_000978828.1|1129764_1130214_-|hypothetical-protein MMKYLMVLLSLFSGSVLGMGRVNELCGIDSVKTIEIINLPSYVTTLVPLSKEGLNEIYRYKVVVNEISDLYAGKIIDLLQMKYFRKEKYNNIRWGVSIISKGNNKCEIYFDAFGECGSVNGINVCFEKNEMIGWIKKEIPLLSQKIGGL >NZ_CP040886.1|WP_023147999.1|1130225_1133285_-|RHS-repeat-protein MTSPLNSEGRYTEGEGGLKRVVKKEHADGSITRSEYDEAGRLKAQTDAAGRRTEYSLHMASGAVTAVTGPDGRTVRYGYNSQRQVTSVTYPDGLRSSREYDEKGRLTAETSRSGETTRYSYDDPASELPTGIQDATGSTKQMAWSRYGQLLAFTDCSGYTTRYEYDRYGQQIAVHREEGISTYSSYNPRGQLVSQKDAQGREIRYEYSAAGDLTATISPDGKRSTIEYDKRGRPVSVTEGGLTRSMGYDAAGRITVLTNENGSQSTFRYDPVDRLTEQRGFDGRTQRYHYDLTGKLTQSEDEGLITLWHYDASDRITHRTVNGDPAEQWQYDEHGWLTTLSHTCEGHRVSVHYGYDDKGRLTGERQTVENPETGEMLWEHETGHAYSEQGLATRQEPDGLPPVEWLTYGSGYLAGMKLGGTPLVEYTRDRLHRETARSFGGAGSTAGYEQATAYTLTGQLQSRHLNLPQLDCDYTWNDNGQLVRISGPQECREYRYSGTGRLTGVHTTAANLDIDIPYATDPAGNRLPDPELHPDSTLTAWPDNRIAEDAHYVYRYDEYGRLAEKTDRIPEGVIRMHDERTHHYHYDSQHRLVFYTRIQHGEPQVESRYLYDPLGRRTGKRVWRRERDLTGWMSLSRKPEETWYGWDGDRLTTVQTQQTRIQTVYQPGSFTPLLRIETENGEQAKARHRSLAEVLQEDTGVTLPAELAVMLGRLERELRQGSVSEESQQWLAQCGLTAEQMAAQLEAEYIPERKLHLYHCDHRGLPLALISPEGETAWQGEYDEWGNLLGEESAQHLQQSLRLPGQQYDEESGLYYNRNRYYDPLQGRYITQDPIGLRGEWNLYKYPLNPVRFIDSLGLKFHVNGDPSDFNQAVEYLKQDSQMKETIDFLSSSEETINIEYIEGTNVRFNSNNMAIYWNSRASLFCSTELNSKSQSPALGLGHEFAHAQYYLLDKENFMALLSRTDKKYENKEEARVITIIESRAAKTLGECTRGAHSGLPFYRVDGPLQTMKITGTPE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040886_4 | 1309366-1309481 | Orphan |
NA
Consensus repeat of NZ_CP040886_4
|
1 spacers
spacers of NZ_CP040886_4
>4.1|1309397|54|NZ_CP040886|CRISPRCasFinder TGGCCTACGCGCTGTGTTTTTGTAGGCCGGATAAGCAAAGCGCATCCGGCATTC |
CRISPR arrays and Neighbor proteins around NZ_CP040886_4
The CRISPR arrays of NZ_CP040886_4 >merge|NZ_CP040886|4|1309366-1309481|CRISPRCasFinder AACGCCTGATGCGACGCTGACGCGTCTTATCTGGCCTACGCGCTGTGTTTTTGTAGGCCGGATAAGCAAAGCGCATCCGGCATTCAACGCCTGATGCGACGCTGGCGCGTCTTATC >NZ_CP040886|4|4|1309366-1309481|CRISPRCasFinder AACGCCTGATGCGACGCTGACGCGTCTTATC TGGCCTACGCGCTGTGTTTTTGTAGGCCGGATAAGCAAAGCGCATCCGGCATTC AACGCCTGATGCGACGCTGGCGCGTCTTATC
>NZ_CP040886.1|WP_000151734.1|1307843_1309346_+|L-arabinose-isomerase MTIFDNYEVWFVIGSQHLYGPETLRQVTQHAEHVVNALNTEAKLPCKLVLKPLGTTPDEITAICRDANYDDRCAGLVVWLHTFSPAKMWINGLTMLNKPLLQFHTQFNAALPWDSIDMDFMNLNQTAHGGREFGFIGARMRQQHAVVTGHWQDKQAHERIGSWMRQAVSKQDTRHLKVCRFGDNMREVAVTDGDKVAAQIKFGFSVNTWAVGDLVQVVNSISDGDVNALVDEYESCYTMTPATQIHGEKRQNVLEAARIELGMKRFLEQGGFHAFTTTFEDLHGLKQLPGLAVQRLMQQGYGFAGEGDWKTAALLRIMKVMSTGLQGGTSFMEDYTYHFEKGNDLVLGSHMLEVCPSIAVEEKPILDVQHLGIGGKDDPARLIFNTQTGPAIVASLIDLGDRYRLLVNCIDTVKTPHSLPKLPVANALWKAQPDLPTASEAWILAGGAHHTVFSHALNLNDMRQFAEMHDIEITVIDNDTRLPAFKDALRWNEVYYGFRR >NZ_CP040886.1|WP_001371424.1|1306132_1307833_+|ribulokinase MAIAIGLDFGSDSVRALAVDCASGEEIATSVEWYPRWQKGQFCDAPNNQFRHHPRDYIESMEAALKTVLAELSVEQRAAVVGIGVDTTGSTPAPIDADGNVLALRPEFAENPNAMFVLWKDHTAVEEAEEITRLCHAPGNVDYSRYIGGIYSSEWFWAKILHVTRQDSAVAQSAASWIELCDWVPALLSGTTGPQDIRRGRCSAGHKSLWHESWGGLPPASFFDELDPILNRHLPSPLFTDTWTADIPVGTLCPEWAQRLGLPESVVISGGAFDCHMGAVGAGAQPNALVKVIGTSTCDILIADKQSVGERAVKGICGQVDGSVVPGFIGLEAGQSAFGDIYAWFGRVLGWPLEQLAAQHPELKAQINASQKQLLPALTEAWAKNPSLDHLPVVLDWFNGRRTPNANQRLKGVITDLNLATDAPLLFGGLIAATAFGARAIMECFTDQGIAVNNVMALGGIARKNQVIMQACCDVLNRPLQIVASDQCCALGAAIFAAVAAKVHADIPSAQQKMASAVEKTLQPCSEQAQRFEQLYRRYQQWAMSAEQHYLPTSAPAQAAQAVPTL >NZ_CP040886.1|WP_001300811.1|1304915_1305794_-|arabinose-operon-transcriptional-regulator-AraC MAEAQNDPLLPGYSFNAHLVAGLTPIEANGYLDFFIDRPLGMKGYILNLTIRGQGVVKNQGREFVCRPGDILLFPPGEIHHYGRHPEAREWYHQWVYFRPRAYWHEWLNWPSIFANTGFFRPDEAHQPHFSDLFGQIINAGQGEGRYSELLAINLLEQLLLRRMEAINESLHPPMDNRVREACQYISDHLADSNFDIASVAQHVCLSPSRLSHLFRQQLGISVLSWREDQRISQAKLLLSTTRMPIATVGRNVGFDDQLYFSRVFKKCTGASPSEFRAGCEEKVNDVAVKLS >NZ_CP040886.1|WP_001148402.1|1304065_1304830_-|DedA-family-protein MQALLEHFITQSTVYSLMAVVLVAFLESLALVGLILPGTVLMAGLGALIGSGELSFWHAWLAGIVGCLLGDWISFWLGWRFKKPLHRWSFLKKNKALLDKTEHALHQHSMFTILVGRFVGPTRPLVPMVAGMLDLPVAKFITPNIIGCLLWPPFYFLPGILAGAAIDIPAGMQSGEFKWLLLATAVFLWVGGWLCWRLWRSGKATDRLSHYLSRGRLLWLTPLISAIGVVALVVLIRHPLMPVYIDILRKVVGG >NZ_CP040886.1|WP_000916291.1|1303253_1303952_+|thiamine-ABC-transporter-ATP-binding-protein-ThiQ MLKLTDITWLYHHLPMRFSLTVERGEQVAILGPSGAGKSTLLNLIAGFLTPASGSLTIDGVDHTTTPPSRRPVSMLFQENNLFSHLTVAQNIGLGLNPGLKLNAAQQEKMHAIARQMGIDNLMARLPGELSGGQRQRVALARCLVREQPILLLDEPFSALDPALRQEMLTLVSTSCQQQKMTLLMVSHSVEDAARIATRSVVVADGRIAWQGKTNELLSGKASASALLGITG >NZ_CP040886.1|WP_000235700.1|1301659_1303270_+|thiamine/thiamine-pyrophosphate-ABC-transporter-permease-ThiP MATRRQPLIPGWLIPGVSAATLVVAVALAAFLALWWNAPQGNWVAVWQDSYLWHVVRFSFWQAFLSALLSVVPAIFLARALYRRRFPGRLALLRLCAMTLILPVLVAVFGILSVYGRQGWLASLCQSLGLEWTFSPYGLQGILLAHVFFNLPMASRLLLQALENIPGEQRQLAAQLGMRGWHFFRFVEWPWLRRQIPPVAALIFMLCFASFATVLSLGGGPQATTIELAIYQALSYDYDPARAAMLALIQMVCCLGLVLLSQRLSKAIAPGTTLLQGWRDPDDRLHSRICDTVLIVLALLLLLPPLLAVIVDGVNRQLPEVLAQPVLWQALWTSLRIALAAGVLCVVLTMMLLWSSRELRARQKMLAGQALEMSGMLILAMPGIVLATGFFLLLNNTIGLPQSADGIVIFTNALMAIPYALKVLENPMRDITARYSMLCQSLGIEGWSRLKVVELRALKRPLAQALAFACVLSIGDFGVVALFGNDDFRTLPFYLYQQIGSYRSQDGAVTALILLLLCFLLFTVIEKLPGRNVKTD >NZ_CP040886.1|WP_001371422.1|1300700_1301684_+|thiamine-ABC-transporter-substrate-binding-subunit MLKKCLPLLLLCTAPVFAKPVLIVYTYDSFAADWGPGPKIKKAFEADCNCELKLVALEDGVSLLNRLRMEGKNSKADVVLGLDNNLLDAASKTGLFAKSGVAADAVNVPGGWNNDTFVPFDYGYFAFVYDKNKLKNPPQSLKELVESDQNWRVIYQDPRTSTPGLGLLLWMQKVYGDDAPQAWQKLAKKTVTVTKGWSEAYGLFLKGESDLVLSYTTSPAYHILEEKKDNYAAANFSEGHYLQVEVAARTAASKQPELAQKFLQFMVSPAFQNAIPTGNWMYPVANVTLPAGFEQLTKPATTLEFTPAEVAAQRQAWISEWQRAVSR >NZ_CP040886.1|WP_001297366.1|1298881_1300537_+|HTH-type-transcriptional-regulator-SgrR MPSARLQQQFIRLWQCCEGKSQDTTLNELAALLSCSRRHMRTLLNTMQDRGWLTWEAEVGRGKRSRLTFLYTGLALQQQRAEDLLEQDRIDQLVQLVGDKATVRQMLVSHLGRSFRQGRHILRVLYYRPLRNLLPGSALRRSETHIARQIFSSLTRINEENGELEADIAHHWQQISPLHWRFFLRPGVHFHHGRELEMDDVIASLKRINTLPLYSHIADIVSPTPWTLDIHLTQPDRWLPLLLGQVPAMILPREWETLSNFASHPIGTGPYAVIRNTTNQLKIQAFDDFFGYRALIDEVNVWVLPEIADEPAGGLMLKGPQGEEKEIESRLEEGCYYLLFDSRTHRGANQQVRDWVSYVLSPTNLVYFAEEQYQQLWFPAYGLLPRWHHARTIKSEKPAGLESLTLTFYQDHSEHRVIAGIMQQILASHQVTLEIKEISYDQWHEGEIESDIWLNSANFTLPLDFSLFAHLCEVPLLQHCIPIDWQADAARWRNGEMNLANWCQQLVASKAMVPLIHHWLIIQGQRSMRGLRMNTLGWFDFKSAWFAPPDP >NZ_CP040886.1|WP_001248770.1|1298661_1298793_-|glucose-uptake-inhibitor-SgrT MRQFYQHYFTATAKLCWLRWLSVPQRLTMLEGLMQWDDRNSES >NZ_CP040886.1|WP_000637846.1|1297381_1298560_-|sugar-efflux-transporter-SetA MIWIMTMARRMNGVYAAFMLVAFMMGVAGALQAPTLSLFLSREVGAQPFWIGLFYTVNAIAGIGVSLWLAKRSDSQGDRRKLIIFCCLMAIGNALLFAFNRHYLTLITCGVLLASLANTAMPQLFALAREYADNSAREVVMFSSVMRAQLSLAWVIGPPLAFMLALNYGFTVMFSIAAGIFTLSLVLIAFMLPSVARVELPSENALSMQGGWQDSNVRMLFVASTLMWTCNTMYIIDMPLWISSELGLPDKLAGFLMGTAAGLEIPAMILAGYYVKRYGKRRMMVIAVAAGVLFYTGLIFFHSRMALMTLQLFNAVFIGIVAGIGMLWFQDLMPGRAGAATTLFTNSISTGVILAGVIQGAIAQSWGHFAVYWVIAVISVVALFLTAKVKDV >NZ_CP040886.1|WP_000888642.1|1309545_1310241_+|L-ribulose-5-phosphate-4-epimerase MLEDLKRLVLEANLALPKHNLVTLTWGNVSAVDRERGVFVIKPSGVDYSVMTADDMVVVSIATGEVVEGTKKPSSDTPTHRLLYQAFPSIGGIVHTHSRHATIWAQAGQSIPATGTTHADYFYGTIPCTRKMTDAEINGEYEWETGNVIVETFEKQGIDAAQMPGVLVHSHGPFAWGKNAEDAVHNAIVLEEVAYMGIFCRQLAPQLPDMQQTLLDKHYLRKHGAKAYYGQ >NZ_CP040886.1|WP_000035637.1|1310315_1312667_+|DNA-polymerase-II MAQAGFILTRHWRDTPQGTEVSFWLATDNGPLQVTLAPQESVAFIPADQVPRAQHILQGEQGFRLTPLALKDFHRQPVYGLYCRAHRQLMNYEKRLREGGVTVYEADVRPPERYLMERFITSPVWVEGDIRNGAIVNARLKPHPDYRPPLKWVSIDIETTRHGELYCIGLEGCGQRIVYMLGPENGDASALDFELEYVASRPQLLEKLNAWFANYDPDVIIGWNVVQFDLRMLQKHAERYRIPLRLGRDNSELEWREHGFKNGVFFAQAKGRLIIDGIEALKSAFWNFSSFSLETVAQELLGEGKSIDNPWDRMDEIDRRFAEDKPALATYNLKDCELVTQIFHKTEIMPFLLERATVNGLPVDRHGGSVAAFGHLYFPRMHRAGYVAPNLGEVPPHASPGGYVMDSRPGLYDSVLVLDYKSLYPSIIRTFLIDPVGLVEGMAQPDPEHSTEGFLDAWFSREKHCLPEIVTNIWHGRDEAKRQGNKPLSQALKIIMNAFYGVLGTTACRFFDPRLASSITMRGHQIMRQTKALIEAQGYDVIYGDTDSTFVWLKGAHSEEEAAKIGRALVQHVNAWWAETLQKQRLTSALELEYETHFCRFLMPTIRGADTGSKKRYAGLIQEGDKQRMVFKGLETVRTDWTPLAQQFQQELYLRIFRNEPYQEYIRETIDKLMAGELDARLVYRKRLRRPLSEYQRNVPPHVRAARLADEENQKRGRPLQYQNRGTIKYVWTTNGPEPLDYQRSPLDYEHYLTRQLQPVAEGILPFIEDNFATLMTGQLGLF >NZ_CP040886.1|WP_001117011.1|1312831_1315738_+|RNA-polymerase-associated-protein-RapA MPFTLGQRWISDTESELGLGTVVAVDARTVTLLFPSTGENRLYARSDSPVTRVMFNPGDTITSHDGWQMQVEEVKEENGLLTYIGTRLDTEESGVALREVFLDSKLVFSKPQDRLFAGQIDRMDRFALRYRARKYSSEQFRMPYSGLRGQRTSLIPHQLNIAHDVGRRHAPRVLLADEVGLGKTIEAGMILHQQLLSGAAERVLIIVPETLQHQWLVEMLRRFNLRFALFDDERYAEAQHDAYNPFDTEQLVICSLDFARRSKQRLEHLCEAEWDLLVVDEAHHLVWSEDAPSREYQAIEQLAEHVPGVLLLTATPEQLGMESHFARLRLLDPNRFHDFAQFVEEQKNYRPVADAVAMLLAGNKLSNDELNMLGEMIGEQDIEPLLQAANSDSEDAQSARQELVSMLMDRHGTSRVLFRNTRNGVKGFPKRELHTIKLPLPTQYQTAIKVSGIMGARKSAEDRARDMLYPERIYQEFEGDNATWWNFDPRVEWLMGYLTSHRSQKVLVICAKAATALQLEQVLREREGIRAAVFHEGMSIIERDRAAAWFAEEDTGAQVLLCSEIGSEGRNFQFASHMVMFDLPFNPDLLEQRIGRLDRIGQAHDIQIHVPYLEKTAQSVLVRWYHEGLDAFEHTCPTGRTIYDSVYNDLINYLASPDQTEGFDDLIKNCREQHEALKAQLEQGRDRLLEIHSNGGEKAQALAESIEEQDDDTNLIAFAMNLFDIIGINQDDRGDNMIVLTPSDHMLVPDFPGLSEDGITITFDREVALAREDAQFITWEHPLIRNGLDLILSGDTGSSTISLLKNKALPVGTLLVELIYVVEAQAPKQLQLNRFLPPTPVRMLLDKNGNNLAAQVEFETFNRQLNAVNRHTGSKLVNAVQQDVHAILQLGEAQIEKSARALIDAARNEADEKLSAELSRLEALRAVNPNIRDDELTAIESNRQQVMESLDQAGWRLDALRLIVVTHQ >NZ_CP040886.1|WP_000525176.1|1315749_1316409_+|bifunctional-tRNA-pseudouridine(32)-synthase/23S-rRNA-pseudouridine(746)-synthase-RluA MGMENYNPPQEPWLVILYQDDHIMVVNKPSGLLSVPGRLEEHKDSVMTRIQRDYPQAESVHRLDMATSGVIVVALTKAAERELKRQFREREPKKQYVARVWGHPSPAEGLVDLPLICDWPNRPKQKVCYETGKPAQTEYEVVEYAADNTARVVLKPITGRSHQLRVHMLALGHPILGDRFYASPEARAMAPRLLLHAEMLTITHPAYGNSMTFKAPADF >NZ_CP040886.1|WP_001200579.1|1316525_1317341_-|co-chaperone-DjlA MQYWGKIIGVAVALLMGGGFWGVVLGLLIGHMFDKARSRKMAWFANQRERQALFFATTFEVMGHLTKSKGRVTEADIHIASQLMDRMNLHGASRTAAQNAFRVGKSDNYPLREKMRQFRSVCFGRFDLIRMFLEIQIQAAFADGSLHPNERAVLYVIAEELGISRAQFDQFLRMMQGGAQFGGGYQQQTGGGNWQQAQRGPTLEDACNVLGVKPTDDATTIKRAYRKLMSEHHPDKLVAKGLPPEMMEMAKQKAQEIQQAYELIKQQKGFK >NZ_CP040886.1|WP_000746150.1|1317595_1319950_+|LPS-assembly-protein-LptD MKKRIPTLLATMIATALYSQQGLAADLASQCMLGVPSYDRPLVQGDTNDLPVTINADHAKGDYPDDAVFTGSVDIMQGNSRLQADEVQLHQKEAPGQPEPVRTVDALGNVHYDDNQVILKGPKGWANLNTKDTNVWEGDYQMVGRQGRGKADLMKQRGENRYTILDNGSFTSCLPGSDTWSVVGSEIIHDREEQVAEIWNARFKVGPVPIFYSPYLQLPVGDKRRSGFLIPNAKYTTTNYFEFYLPYYWNIAPNMDATITPHYMHRRGNIMWENEFRYLSQAGAGLMELDYLPSDKVYEDEHPNDDSSRRWLFYWNHSGVMDQVWRFNVDYTKVSDPSYFNDFDNKYGSSTDGYATQKFSVGYAVQNFNATVSTKQFQVFSEQNTSSYSAEPQLDVNYYQNDVGPFDTRIYGQAVHFVNTRDDMPEATRVHLEPTINLPLSNNWGSINTEAKLLATHYQQTNLDWYNSRNTTKLDESVNRVMPQFKVDGKMVFERDMEMLAPGYTQTLEPRAQYLYVPYRDQSDIYNYDSSLLQSDYSGLFRDRTYGGLDRIASANQVTTGVTSRIYDDAAVERFNISVGQIYYFTESRTGDDNITWENDDKTGSLVWAGDTYWRISERWGLRGGIQYDTRLDNVATSNSSIEYRRDEDRLVQLNYRYASPEYIQATLPKYYSTAEQYKNGISQVGAVASWPIADRWSIVGAYYYDTNANKQADSMLGVQYSSCCYAIRVGYERKLNGWDNDKQHAVYDNAIGFNIELRGLSSNYGLGTQEMLRSNILPYQNSL >NZ_CP040886.1|WP_000800453.1|1320002_1321289_+|peptidylprolyl-isomerase-SurA MKNWKTLLLGIAMIANTSFAAPQVVDKVAAVVNNGVVLESDVDGLMQSVKLNAAQARQQLPDDATLRHQIMERLIMDQIILQMGQKMGVKISDEQLDQAIANIAKQNNMTLDQMRSRLAYDGLNYNTYRNQIRKEMIISEVRNNEVRRRITILPQEVESLAQQVGNQNDASTELNLSHILIPLPENPTSDQVNEAESQARAIVDQARNGADFGKLAIAHSADQQALNGGQMGWGRIQELPGIFAQALSTAKKGDIVGPIRSGVGFHILKVNDLRGESKNISVTEVHARHILLKPSPIMTDEQARVKLEQIAADIKSGKTTFAAAAKEFSQDPGSANQGGDLGWATADIFDPAFRDALTRLNKGQMSAPVHSSFGWHLIELLDTRNVDKTDAAQKDRAYRMLMNRKFSEEAASWMQEQRASAYVKILSN >NZ_CP040886.1|WP_000241271.1|1321288_1322278_+|4-hydroxythreonine-4-phosphate-dehydrogenase-PdxA MVKTQRVVITPGEPAGIGPDLVVQLAQREWPVELVVCADATLLTDRAAMLGLPLTLRTYSPNSPAQPQTAGTLTLLPVALRESVTAGQLAVENGHYVVETLARACDGCLNGEFAALITGPVHKGVINDAGIPFTGHTEFFEERSQAKKVVMMLATEELRVALATTHLPLRDIADAITPALLHEVIAILHHDLRTKFGIAEPRILVCGLNPHAGEGGHMGTEEIDTIIPLLDELRAQGMKLNGPLPADTLFQPKYLDNADAVLAMYHDQGLPVLKYQGFGRGVNITLGLPFIRTSVDHGTALELAGRGEADVGSFITALNLAIKMIVNTQ >NZ_CP040886.1|WP_001065381.1|1322274_1323096_+|16S-rRNA-(adenine(1518)-N(6)/adenine(1519)-N(6))--dimethyltransferase-RsmA MNNRVHQGHLARKRFGQNFLNDQFVIDSIVSAINPQKGQAMVEIGPGLAALTEPVGERLDQLTVIELDRDLAARLQTHPFLGPKLTIYQQDAMTFNFGELAEKMGQPLRVFGNLPYNISTPLMFHLFSYTDAIADMHFMLQKEVVNRLVAGPNSKAYGRLSVMAQYYCNVIPVLEVPPSAFTPPPKVDSAVVRLVPHATMPHPVKDVRVLSRITTEAFNQRRKTIRNSLGNLFSVEVLTGMGIDPAMRAENISVAQYCQMANYLAENAPLQES >NZ_CP040886.1|WP_000610901.1|1323098_1323476_+|Co2+/Mg2+-efflux-protein-ApaG MINSPRVCIQVQSVYIEAQSSPDNERYVFAYTVTIRNLGRAPVQLLGRYWLITNGNGRETEVQGEGVVGVQPLIAPGEEYQYTSGAIIETPLGTMQGHYEMIDENGVPFSIDIPVFRLAVPTLIH |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040886_5 | 1332394-1332526 | Orphan |
NA
Consensus repeat of NZ_CP040886_5
|
2 spacers
spacers of NZ_CP040886_5
>5.1|1332411|42|NZ_CP040886|PILER-CR TGTCACACGCAGATAAATCCAACTTTCAATATTGTTAAGTTC >5.2|1332470|40|NZ_CP040886|PILER-CR CATGGCGTAGCAAAAAGAAATTTTCAATATTGCTTTATGG |
CRISPR arrays and Neighbor proteins around NZ_CP040886_5
The CRISPR arrays of NZ_CP040886_5 >merge|NZ_CP040886|5|1332394-1332526|PILER-CR ATCACCAATATTGAAAATGTCACACGCAGATAAATCCAACTTTCAATATTGTTAAGTTCCTCACCAATATTGAAAACATGGCGTAGCAAAAAGAAATTTTCAATATTGCTTTATGGATCACCAATATTGAAAG >NZ_CP040886|5|1|1332394-1332526|PILER-CR ATCACCAATATTGAAAA TGTCACACGCAGATAAATCCAACTTTCAATATTGTTAAGTTC CTCACCAATATTGAAAA CATGGCGTAGCAAAAAGAAATTTTCAATATTGCTTTATGG ATCACCAATATTGAAAG
>NZ_CP040886.1|WP_000692204.1|1331532_1332303_-|electron-transfer-flavoprotein-FixA MKIITCYKCVPDEQDIAVNNADGSLDFSKADAKISQYDLNAIEAACQLKQQAAEAQVTALSVGGKALTNAKGRKDVLSRGPDELIVVIDDQFEQALPQQTASALAAAAQKAGFDLILCGDGSSDLYAQQVGLLVGEILNIPAVNGVSKIISLTADTLTVERELEDETETLSIPLPAVVAVSTDINSPQIPSMKAILGAAKKPVQVWSAADIGFNAEAAWSEQQVAAPKQRERQRIVIEGDGEEQIAAFAENLRKVI >NZ_CP040886.1|WP_001091499.1|1330576_1331518_-|electron-transfer-flavoprotein-subunit-alpha/FixB-family-protein MNTFSQVWVFSDTPSRLPELMNGAQALANQINTFVLNDADGAQAIQLGANHVWKLNGKPDDRMIEDYAGVMADTIRQHGADGLVLLPNTRRGKLLAAKLGYRLKAAVSNDASTVSVQDGKATVKHMVYGGLAIGEERIATPYAVLTISSGTFDAAQPDASRTGETHTVEWQAPAVAITRTATQARQSNSVDLDKARLVVSVGRGIGSKENIALAEQLCKAIGAELACSRPVAENEKWMEHERYVGISNLMLKPELYLAVGISGQIQHMVGANASQTIFAINKDKNAPIFQYADYGIVGDAVKILPALTAALAR >NZ_CP040886.1|WP_001287715.1|1329239_1330526_-|FAD-dependent-oxidoreductase MSEDIFDAIIVGAGLAGSVAALVLAREGAQVLVIERGNSAGAKNVTGGRLYAHSLEHIIPGFADSAPVERLITHEKLAFMTEKSAMTMDYCNGDETSPSQRSYSVLRSKFDAWLMEQAEEAGAQLITGIRVDNLVQRDGKVVGVEADGDVIEAKTVILADGVNSILAEKLGMAKRVKPTDVAVGVKELIELPKSVIEDRFQLQGNQGAACLFAGSPTDGLMGGGFLYTNENTLSLGLVCGLHHLHDAKKSVPQMLEDFKQHPAVAPLIAGGKLVEYSAHVVPEAGINMLPELVGDGVLIAGDAAGMCMNLGFTIRGMDLAIAAGEAAAKTVLSAMKSDDFSKQKLAEYRQHLESGPLRDMRMYQKLPAFLDNPRMFSGYPELAVGVARDLFTIDGSAPELMRKKILRHGKKVGFINLIKDGMKGVTVL >NZ_CP040886.1|WP_000203747.1|1328955_1329243_-|ferredoxin-like-protein-FixX MTSPVNVDVKLGVNKFNVDEEHPHIVVKADADKQVLELLVKACPAGLYKKQDDGSVRFDYAGCLECGTCRILGLGSALEQWEYPRGTFGVEFRYG >NZ_CP040886.1|WP_001183198.1|1327566_1328898_-|MFS-transporter MQPSRNFDDLKFSSIHRRILLWGSGGPFLDGYVLVMIGVALEQLTPALKLDADWIGLLGAGTLAGLFVGTSLFGYISDKVGRRKMFLIDIIAIGVISVATMFVSSPVELLVMRVLIGIVIGADYPIATSMITEFSSTRQRAFSISFIAAMWYVGATCADLVGYWLYDVEGGWRWMLGSAAIPCLLILIGRFELPESPRWLLRKGRVKECEEMMIKLFGEPVAFDEEQPQQTRFRDLFNRRHFPFVLFVAAIWTCQVIPMFAIYTFGPQIVGLLGLGVGKNAALGNVVISLFFMLGCIPPMLWLNTAGRRPLLIGSFAMMTLALAVLGLIPDMGIWLVVMAFAVYAFFSGGPGNLQWLYPNELFPTDIRASAVGVIMSLSRIGTIVSTWALPIFINNYGISNTMLMGAGISLFGLLISVAFAPETRGMSLAQTSNMTIRGQRMG >NZ_CP040886.1|WP_000600725.1|1326928_1327459_-|glutathione-regulated-potassium-efflux-system-oxidoreductase-KefF MILIIYAHPYPHHSHANKRMLEQARTLEGVEIRSLYQLYPDFNIDIAAEQEALSRADLIVWQHPMQWYSIPPLLKLWIDKVFSHGWAYGHGGTALHGKHLLWAVTTGGGESHFEIGAHPGFDVLSQPLQATAIYCGLNWLPPFAMHCTFICDDETLEGQARHYKQRLLEWQEAHHG >NZ_CP040886.1|WP_000377129.1|1325073_1326936_-|glutathione-regulated-potassium-efflux-system-protein-KefC MDSHTLIQALIYLGSAALIVPIAVRLGLGSVLGYLIAGCIIGPWGLRLVTDAESILHFAEIGVVLMLFIIGLELDPQRLWKLRAAVFGGGALQMVICGGLLGLFCMLLGLRWQVAELIGMTLALSSTAIAMQAMNERNLMVTQMGRSAFAVLLFQDIAAIPLVAMIPLLATSSASTTMGAFALSALKVAGALVLVVLLGRYVTRPALRFVARSGLREVFSAVALFLVFGFGLLLEEVGLSMAMGAFLAGVLLASSEYRHALESDIEPFKGLLLGLFFIGVGMSIDFGTLLENPLRIVILLLGFLIIKIAMLWLIARPLQVPNKQRRWFAVLLGQGSEFAFVVFGAAQMANVLEPEWAKSLTLAVALSMAATPILLVILNRLEQSSTEEAREADEIDEEQPRVIIAGFGRFGQITGRLLLSSGVKMVVLDHDPDHIETLRKFGMKVFYGDATRMDLLESAGAAKAEVLINAIDDPQTNLQLTEMVKEHFPHLQIIARARDVDHYIRLRQAGVEKPERETFEGALKTGRLALESLGLGPYEARERADVFRRFNIQMVEEMAMVENDTKARAAVYKRTSAMLSEIITEDREHLSLIQRHGWQGTEEGKHTGNMADEPETKPSS >NZ_CP040886.1|WP_000624375.1|1324402_1324882_-|type-3-dihydrofolate-reductase MISLIAALAVDRVIGMENAMPWNLPADLAWFKRNTLNKPVIMGRHTWESIGRPLPGRKNIILSSQPGTDDRVTWVKSVDEAIAACGDVPEIMVIGGGRVYEQFLPKAQKLYLTHIDAEVEGDTHFPDYEPDDWESVFSEFHDADAQNSHSYCFEILERR >NZ_CP040886.1|WP_000257192.1|1323482_1324325_+|bis(5'-nucleosyl)-tetraphosphatase-(symmetrical) MATYLIGDVHGCYDELIALLHKVEFTPGKDTLWLTGDLVARGPGSLDVLRYVKSLGDSVRLVLGNHDLHLLAVFAGISRNKPKDRLTPLLEAPDADELLNWLRRQPLLQIDEEKKLVMAHAGITPQWDLQTAKECARDVEAVLSSDSYPFFLDAMYGDMPNNWSPELRGLGRLRFITNAFTRMRFCFPNGQLDMYSKESPEEAPAPLKPWFAIPGPVAEEYSIAFGHWASLEGKGTPEGIYALDTGCCWGGTLTCLRWEDKQYFVQPSNRHKDLGEAAAS >NZ_CP040886.1|WP_000610901.1|1323098_1323476_+|Co2+/Mg2+-efflux-protein-ApaG MINSPRVCIQVQSVYIEAQSSPDNERYVFAYTVTIRNLGRAPVQLLGRYWLITNGNGRETEVQGEGVVGVQPLIAPGEEYQYTSGAIIETPLGTMQGHYEMIDENGVPFSIDIPVFRLAVPTLIH >NZ_CP040886.1|WP_000787103.1|1332776_1334291_+|L-carnitine/gamma-butyrobetaine-antiport-BCCT-transporter MKNEKRKTGIEPKVFFPPLIIVGILCWLTVRDLDAANVVINAVFSYVTNVWGWAFEWYMVVMLFGWFWLVFGPYAKKRLGNEPPEFSTASWIFMMFASCTSAAVLFWGSIEIYYYISTPPFGLEPNSTGAKELGLAYSLFHWGPLPWATYSFLSVAFAYFFFVRKMEVIRPSSTLVPLVGEKHAKGLFGTIVDNFYLVALIFAMGTSLGLATPLVTECMQWLFGIPHTLQLDAIIITCWIILNAICVACGLQKGVRIASDVRSYLSFLMLGWVFIVSGASFIMNYFTDSVGMLLMYLPRMLFYTDPIAKGGFPQGWTVFYWAWWVIYAIQMSIFLARISRGRTVRELCFGMVLGLTASTWILWTVLGSNTLLLIDKNIINIPNLIEQYGVARAIIETWAALPLSTATMWGFFILCFIATVTLVNACSYTLAMSTCREVRDGEEPPLLVRIGWSILVGIIGIVLLALGGLKPIQTAIIAGGCPLFFVNIMVTLSFIKDAKQNWKD >NZ_CP040886.1|WP_000347117.1|1334321_1335464_+|crotonobetainyl-CoA-dehydrogenase MDFNLNDEQELFVAGIRELMASENWEAYFAECDRDSVYPERFVKALADMGIDSLLIPEEHGGLDAGFVTLAAVWMELGRLGAPTYVLYQLPGGFNTFLREGTQEQIDKIMAFRGTGKQMWNSAITEPGAGSDVGSLKTTYTRRNGKIYLNGSKCFITSSAYTPYIVVMARDGASPDKPVYTEWFVDMSKPGIKVTKLEKLGLRMDSCCEITFDDVELDEKDMFGREGNGFNRVKEEFDHERFLVALTNYGTAMCAFEDAARYANQRVQFGEAIGRFQLIQEKFAHMAIKLNSMKNMLYEAAWKADNGTITSGDAAMCKYFCANAAFEVVDSAMQVLGGVGIAGNHRISRFWRDLRVDRVSGGSDEMQILTLGRAVLKQYR >NZ_CP040886.1|WP_000349926.1|1335592_1336810_+|L-carnitine-CoA-transferase MDHLPMPKFGPLAGLRVVFSGIEIAGPFAGQMFAEWGAEVIWIENVAWADTIRVQPNYPQLSRRNLHALSLNIFKDEGREAFLKLMETTDIFIEASKGPAFARRGITDEVLWQHNPKLVIAHLSGFGQYGTEEYTNLPAYNTIAQAFSGYLIQNGDVDQPMPAFPYTADYFSGLTATTAALAALHKARETGKGESIDIAMYEVMLRMGQYFMMDYFNGGEMCPRMSKGKDPYYAGCGLYKCADGYIVMELVGITQIEECFKDIGLAHLLSTPEIPEGTQLIHRIECPYGPLVEEKLDAWLAAHTIAEVKERFAELNIACAKVLTVPELESNPQYVARESITQWQTMDGRTCKGPNIMPKFKNNPGQIWRGMPSHGMDTAAILKNIGYSENDIQELVSKGLAKVED >NZ_CP040886.1|WP_000351348.1|1336883_1338437_+|crotonobetaine/carnitine-CoA-ligase MDIIGGQHLRQMWDDLADVYGHKTALICESSGGVVNRYSYLELNQEINRTANLFYTLGIRKGDKVALHLDNCPEFIFCWFGLAKIGAIMVPINARLLREESAWILQNSQACLLVTSAQFYPMYQQIQQEDATQLRHICLTDVALPADDGVSSFTQLKNQQPATLCYAPPLLTDDTAEILFTSGTTSRPKGVVITHYNLRFAGYYSAWQCALRDDDVYLTVMPAFHIDCQCTAAMAAFSAGATFVLVEKYSARAFWGQVQKYRATITECIPMMIRTLMVQPPSANDRQHRLREVMFYLNLSEQEKDAFCERFGVRLLTSYGMTETIVGIIGDRPGDKRRWPSIGRAGFCYEAEIRDDHNRPLPAGEIGEICIKGVPGKTIFKEYFLNPKATAKVLEADGWLHTGDTGYCDEEGFFYFVDRRCNMIKRGGENVSCVELENIIATHPKIQDIVVVGIKDSIRDEAIKAFVVLNEGETLSEEEFFRFCEQNMAKFKVPSYLEIRKDLPRNCSGKIIRKNLK >NZ_CP040886.1|WP_000004404.1|1338545_1339331_+|crotonobetainyl-CoA-hydratase MSESLHLTRNGSILEITLDRPKANAIDAKTSFEMGEVFLNFRDDPQLRVAIITGAGEKFFSAGWDLKAAAEGEAPDADFGPGGFAGLTEIFNLDKPVIAAVNGYAFGGGFELALAADFIVCADNASFALPEAKLGIVPDSGGVLRLPKILPPAIVNEMVMTGRRMGTEEALRWGIVNRVVSQAELMDNARELAQQLVNSAPLAIAALKEIYRTTSEMPVEEAYRYIRSGVLKHYPSVLHSEDAVEGPLAFAEKRDPVWKGR >NZ_CP040886.1|WP_000122876.1|1339336_1339927_+|carnitine-operon-protein-CaiE MSYYAFEGLIPVVHPTAFVHPSAVLIGDVIVGAGVYIGPLASLRGDYGRLIVQAGANIQDGCIMHGYCDTDTIVGENGHIGHGAILHGCVIGRDALVGMNSVIMDGAVIGEESIVAAMSFVKAGFHGEKRQLLMGTPARAVRSVSDDELHWKRLNTKEYQDLVGRCHASLHETQPLRQMEENRPRLQGTTDVTPKR >NZ_CP040886.1|WP_000333120.1|1340012_1340408_-|carnitine-metabolism-transcriptional-regulator-CaiF MCEGYVEKPLYLLIAEWMMAENRWVIAREISIHFDIEHSKAVNTLTYILSEVAEISCEVKMIPNKLEGRGCQCQRLVKVVDIDEQIYARLRNNSRDKLVGVRKTPRIPAVPLTELNREQKWQMMLSKSMRR >NZ_CP040886.1|WP_001126376.1|1340668_1343890_-|carbamoyl-phosphate-synthase-large-subunit MPKRTDIKSILILGAGPIVIGQACEFDYSGAQACKALREEGYRVILVNSNPATIMTDPEMADATYIEPIHWEVVRKIIEKERPDAVLPTMGGQTALNCALELERQGVLEEFGVTMIGATADAIDKAEDRRRFDVAMKKIGLETARSGIAHTMEEALAVAADVGFPCIIRPSFTMGGSGGGIAYNREEFEEICARGLDLSPTKELLIDESLIGWKEYEMEVVRDKNDNCIIVCSIENFDAMGIHTGDSITVAPAQTLTDKEYQIMRNASMAVLREIGVETGGSNVQFAVNPKNGRLIVIEMNPRVSRSSALASKATGFPIAKVAAKLAVGYTLDELMNDITGGRTPASFEPSIDYVVTKIPRFNFEKFAGANDRLTTQMKSVGEVMAIGRTQQESLQKALRGLEVGATGFDPKVSLDDPEALTKIRRELKDAGAERIWYIADAFRAGLSVDGVFNLTNIDRWFLVQIEELVRLEEKVAEVGITGLNAEFLRQLKRKGFADARLAKLAGVREAEIRKLRDQYDLHPVYKRVDTCAAEFATDTAYMYSTYEEECEANPSTDREKIMVLGGGPNRIGQGIEFDYCCVHASLALREDGYETIMVNCNPETVSTDYDTSDRLYFEPVTLEDVLEIVRIEKPKGVIVQYGGQTPLKLARALEAAGVPVIGTSPDAIDRAEDRERFQHAVERLKLKQPANATVTAIEMAVEKAKEIGYPLVVRPSYVLGGRAMEIVYDEADLRRYFQTAVSVSNDAPVLLDHFLDDAVEVDVDAICDGEMVLIGGIMEHIEQAGVHSGDSACSLPAYTLSQEIQDVMRQQVQKLAFELQVRGLMNVQFAVKNNEVYLIEVNPRAARTVPFVSKATGVPLAKVAARVMAGKSLAEQGVTKEVIPPYYSVKEVVLPFNKFPGVDPLLGPEMRSTGEVMGVGRTFAEAFAKAQLGSNSTMKKHGRALLSVREGDKERVVDLAAKLLKQGFELDATHGTAIVLGEAGINPRLVNKVHEGRPHIQDRIKNGEYTYIINTTSGRRAIEDSRVIRRSALQYKVHYDTTLNGGFATAMALNADATEKVISVQEMHAQIK >NZ_CP040886.1|WP_000597260.1|1343907_1345056_-|glutamine-hydrolyzing-carbamoyl-phosphate-synthase-small-subunit MIKSALLVLEDGTQFHGRAIGATGSAVGEVVFNTSMTGYQEILTDPSYSRQIVTLTYPHIGNVGTNDADEESSQVHAQGLVIRDLPLIASNFRNTEDLSSYLKRHNIVAIADIDTRKLTRLLREKGAQNGCIIAGDNPDAALALEKARAFPGLNGMDLAKEVTTAEAYSWTQGSWTLTGGLPEAKKEDELPFHVVAYDFGAKRNILRMLVDRGCRLTIVPAQTSAEDVLKMNPDGIFLSNGPGDPAPCDYAITAIQKFLETDIPVFGICLGHQLLALASGAKTVKMKFGHHGGNHPVKDVEKNVVMITAQNHGFAVDEATLPANLRVTHKSLFDGTLQGIHRTDKPAFSFQGHPEASPGPHDAAPLFDHFIELIEQYRKTAK >NZ_CP040886.1|WP_000543597.1|1345511_1346333_-|4-hydroxy-tetrahydrodipicolinate-reductase MHDANIRVAIAGAGGRMGRQLIQAALALEGVQLGAALEREGSSLLGSDAGELAGAGKTGVTVQSSLDAVKDDFDVFIDFTRPEGTLNHLAFCRQHGKGMVIGTTGFDEAGKQAIRDAAADIAIVFAANFSVGVNVMLKLLEKAAKVMGDYTDIEIIEAHHRHKVDAPSGTALAMGEAIAHALDKDLKDCAVYSREGHTGERVPGTIGFATVRAGDIVGEHTAMFADIGERLEITHKASSRMTFANGAVRSALWLSGKEGGLFDMRDVLDLNSL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040886_6 | 2926033-2926172 | Orphan |
NA
Consensus repeat of NZ_CP040886_6
|
1 spacers
spacers of NZ_CP040886_6
>6.1|2926082|42|NZ_CP040886|CRISPRCasFinder ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT |
CRISPR arrays and Neighbor proteins around NZ_CP040886_6
The CRISPR arrays of NZ_CP040886_6 >merge|NZ_CP040886|6|2926033-2926172|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCAACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGTTTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA >NZ_CP040886|6|5|2926033-2926172|CRISPRCasFinder TTTGTATCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA ACAGCAGTCGGATGCGGCGTAAACACCTTATCTGACCTACGT TTTGTGTCGTTGTAGGCCTGATAAGACGCGGCAAGCGTCGCATCAGGCA
>NZ_CP040886.1|WP_001375265.1|2924940_2925981_+|permease MTGQSSSQAATPIQWWKPALFFLVVIAGLWYVKWEPYYGKAFTAAETHSIGKSILAQADANPWQAALDYAMIYFLAVWKAAVLGVILGSLIQVLIPRDWLLRTLGQSRFRGTLLGTLFSLPGMMCTCCAAPVAAGMRRQQVSMGGALAFWMGNPVLNPATLVFMGFVLSWGFAAIRLVAGLVMVLLIATLVQKWVRETPQTQAPVEIDIPEAQGGFFSRWGRALWTLFWSTIPVYILAVLVLGAARVWLFPHADGTVDNSLMWVVAMAVAGCLFVIPTAAEIPIVQTMMLAGMGTAPALALLMTLPAVSLPSLIMLRKAFPAKALWLTGAMVAVSGVIVGGLALLF >NZ_CP040886.1|WP_001295551.1|2924232_2924868_+|NAD(P)H-binding-protein MSQVLITGATGLVGGHLLRMLINEPKVNAIAAPTRRPLGDMPGVFNPHDPQLTDALAQVTDPIDIVFCCLGTTRREAGSKEAFIHADYTLVVDTALTGRRLGAQHMLVVSAMGANAHSPFFYNRVKGEMEEALIAQNWPKLTIARPSMLLGDRSKQRMNETLFAPLFRLLPGNWKSIDARDVARVMLAESMRPEHEGVTILSSSELRKRAE >NZ_CP040886.1|WP_000037608.1|2923586_2924105_-|protein/nucleic-acid-deglycase MSKKIAVLITDEFEDSEFTSPADEFRKAGHEVITIEKQAGKTVKGKKGEASVTIDKSIDEVTPAEFDALLLPGGHSPDYLRGDNRFVTFTRDFVNSGKPVFAICHGPQLLISADVIRGRKLTAVKPIIIDVKNAGAEFYDQEVVVDKDQLVTSRTPDDLPAFNREALRLLGA >NZ_CP040886.1|WP_000449030.1|2923163_2923607_+|YhbP-family-protein METLIAISRWLAKQHVVTWCVQQEGELWCANAFYLFDAQKVAFYILTEEKTRHAQMSGPQAAVAGTVNGQPKTVALIRGVQFKGEIRRLEGEESDLARKAYNRRFPVARMLSAPVWEIRLDEIKFTDNTLGFGKKMIWLRDSGTEQA >NZ_CP040886.1|WP_000189314.1|2922810_2923113_-|DNA-damage-response-exodeoxyribonuclease-YhbQ MTPWFLYLIRTADNKLYTGITTDVERRYQQHQSGKGAKALRGKGELTLAFSAPVGDRSLALRAEYRVKQLTKRQKERLVAEGAGFAELLSSLQTPEIKSD >NZ_CP040886.1|WP_000908554.1|2922320_2922824_+|N-acetyltransferase MLIRVEIPIDAPGIDALLRRSFESDAEAKLVHDLREDGFLTLGLVATDDEGQVIGYVAFSPVDVQGEDLQWVGMAPLAVDEKYRGQGLARQLVYEGLDSLNEFGYAAVVTLGDPALYSRFGFELAAHHDLRCRWPGTESAFQVHRLADDALNGVTGLVEYHEHFNRF >NZ_CP040886.1|WP_001375267.1|2921802_2922327_+|SCP2-domain-containing-protein MLDKLRSRIVHLGPSLLSVPVKLTPFALKRQVLEQVLSWQFRQALDDGELEFLEGRWLSIHVRDIDLQWFTSVVNGKLVVSQNAQADVSFSADASDLLMIAARKQDPDTLFFQRRLVIEGDTELGLYVKNLMDAIELEQMPKALRMMLLQLADFVEAGMKNAPETKQTSVGEPC >NZ_CP040886.1|WP_000421305.1|2920598_2921594_-|U32-family-peptidase MELLCPAGNLPALKAAIENGADAVYIGLKDDTNARHFAGLNFTEKKLQEAVSFVHQHRRKLHIAINTFAHPDGYARWQRAVDMAAQLGADALILADLAMLEYAAERYPHIERHVSVQASATNEEAINFYHRHFDVARVVLPRVLSIHQVKQLARVTPVPLEVFAFGSLCIMSEGRCYLSSYLTGESPNTVGACSPARFVRWQQTPQGLESRLNEVLIDRYQDGENAGYPTLCKGRYLVDGERYHALEEPTSLNTLELLPELMAANIASVKIEGRQRSPAYVSQVAKVWRQAIDRCKADPQNFVPQSAWMETLGSMSEGTQTTLGAYHRKWQ >NZ_CP040886.1|WP_001301318.1|2919711_2920590_-|U32-family-peptidase MKYSLGPVLWYWPKETLEEFYQQAATSSADVIYLGEAVCSKRRATKVGDWLEMAKSLAGSGKQIVLSTLALVQASSELGELKRYVENGEFLIEASDLGVVNMCAERKLPFVAGHALNCYNAVTLKILLKQGMMRWCMPVELSRDWLVNLLNQCDELGIRNQFEVEVLSYGHLPLAYSARCFTARSEDRPKDECETCCIKYPNGRNVLSQENQQVFVLNGIQTMSGYVYNLGNELASMQGLVDVVRLSPQGTDTFAMLDAFRANENGAAPLPLTANSDCNGYWRRLAGLELQA >NZ_CP040886.1|WP_000130392.1|2918498_2919506_-|LLM-class-flavin-dependent-oxidoreductase MTDKTIAFSLLDLAPIPEGSSAREAFSHSLDLARLAEKRGYHRYWLAEHHNMTGIASAATSVLIGYLAANTTTLHLGSGGVMLPNHSPLVIAEQFGTLNTLYPGRIDLGLGRAPGSDQRTMMALRRHMSGDIDNFPRDVAELVDWFDARDPNPNVRPVPGYGEKIPVWLLGSSLYSAQLAAQLGLPFAFASHFAPDMLFQALHLYRSNFKPSARLEKPYAMVCINIIAADSNRDAEFLFTSMQQAFVKLRRGETGQLPPPIQNMDQFWSPSEQYGVQQALSMSLVGDKAKVRHGLQSILRETDADEIMVNGQIFDHQARLHSFELAMDVKEELLG >NZ_CP040886.1|WP_000646033.1|2926185_2926761_-|divisome-associated-lipoprotein-YraP MKALSPIAVLISALLLQGCVAAAVVGTAAVGTKAATDPRSVGTQVDDGTLEVRVNSALSKDEQIKKEARINVTAYQGKVLLVGQSPNAELSARAKQIAMGVDGANEVYNEIRQGQPIGLGEASNDTWITTKVRSQLLTSDLVKSSNVKVTTENGEVFLMGLVTEREAKAAADIASRVSGVKRVTTAFTFIK >NZ_CP040886.1|WP_001158034.1|2926770_2927361_-|DnaA-initiator-associating-protein-DiaA MQERIKACFTESIQTQIAAAEALPDAISRAAMTLVQSLLNGNKILCCGNGTSAANAQHFAASMINRFETERPSLPAIALNTDNVVLTAIANDRLHDEVYAKQVRALGHAGDVLLAISTRGNSRDIVKAVEAAVTRDMTIVALTGYDGGELAGLLGPQDVEIRIPSHRSARIQEMHMLTVNCLCDLIDNTLFPHQDD >NZ_CP040886.1|WP_000246837.1|2927380_2927776_-|YraN-family-protein MATVPTRSGSPRQLTTKQTGDAWEAQARRWLEGKGLRFIAANVNERGGEIDLIMREGRTTVFVEVRYRRSALYGGAAASVTRSKQHKLLQTARLWLARHNGSFDTVDCRFDVVAFTGNEVEWIKDAFNDHS >NZ_CP040886.1|WP_000249160.1|2927733_2929770_-|penicillin-binding-protein-activator MVPSTFSRLKAARCLPVVLAALIFAGCGTHTPDQSTAYMQGTAQADSAFYLQQMQQSSDDTRINWQLLAIRALVKEGKTGQAVELFNQLPQELNDSQRREKTLLAVEIKLAQKDFAGAQNLLAKITPADLEQNQQARYWQAKIDASQGRPSIDLLRALIAQEPLLGAKEKQQNIDATWQALSSMTQEQANTLVINADENILQGWLDLQRVWFDNRNDPDMMKAGIADWQKRYPNNPGAKMLPTQLVNVKAFKPASTNKIALLLPLNGQAAVFGRTIQQGFEAAKNIGTQPVAAQVAAAPAADVAEQPQPQTVDGVASPAQASVSDLTGEQPAAQPVPVSAPATSTAAVSAPANPSAELKIYDTSSQPLSQILSQVQQDGASIVVGPLLKNNVEELLKSNTPLNVLALNQPENIENRVNICYFALSPEDEARDAARHIRDQGKQAPLVLIPRSSLGDRVANAFAQEWQKLGGGTVLQQKFGSTSELRAGVNGGSGIALTGSPITPRATTDSGMTTNNPTLQTTPTDDQFTNNGGRVDAVYIVATPGEIAFIKPMIAMRNGSQSGATLYASSRSAQGTAGPDFRLEMEGLQYSEIPMLAGGNLPLMQQALSAVNNDYSLARMYAMGVDAWSLANHFSQMRQVQGFEINGNTGSLTANPDCVINRKLSWLQYQQGQVVPAS >NZ_CP040886.1|WP_000809262.1|2929834_2930695_+|16S-rRNA-(cytidine(1402)-2'-O)-methyltransferase MKQHQSADNSQGQLYIVPTPIGNLADITQRALEVLQAVDLIAAEDTRHTGLLLQHFGINARLFALHDHNEQQKAETLLAKLQEGQNIALVSDAGTPLINDPGYHLVRTCREAGIRVVPLPGPCAAITALSAAGLPSDRFCYEGFLPAKSKGRRDALKAIEAEPRTLIFYESTHRLLDSLEDIVAVLGESRYVVLARELTKTWETIHGAPVGELLAWVKEDENRRKGEMVLIVEGHKAQEEDLPADALRTLALLQAELPLKKAAALAAEIHGVKKNALYKYALEQQG >NZ_CP040886.1|WP_000816988.1|2930737_2931829_-|fimbrial-protein MKRAPLITGLLLISTSCAYASSGGCGADSTSGATNYSSVVDDVTVNQTDNVTGREFTSATLSSTNWQYACSCSAGKAVKLVYMVSPVLTTTGHQAGYYKLNDSLDIKTTLKANDIPGLVTDQTVSVNTRFTQIKSNTVYSAATQTGVCQGDTSRYGPVNIGANTTFTLYVTKPFLGSMTIPKTDIAVIKGAWVDGMGSPSTGDFHDLVKLSIQGNLTAPQSCKINQGDVIKVNFGFINGQKFTTRNAMPDGFTPVDFDITYDCGDTSKIKNSLQMRIDGTTGVVDQYNLVARRRSSDNAPDVGIRIENLGGGVANIPFQNGILPVDPSGHGTVNMRAWPVNLVGGELETGKFQGTATITVIVR >NZ_CP040886.1|WP_024167269.1|2931839_2934212_-|fimbrial-biogenesis-outer-membrane-usher-protein MLETTKSGMQTTDLSRFSKKYAQLPGTYQVDIWLNKKKVSQKKITFTANAEQLLQPQFTVEQLRELGIKVDEIPALAEKDDDSVINSLEQIIPGTAAEFDFNHQRLNLSIPQIALYRDARGYVSPSRWDDGIPTLFTNYSFTGSDNRYRQGNRSQRQYLNMQNGANFGPWRLRNYSTWTRNDQTSSWNTISSYLQRDIKALKSQLLLGESATSGSIFSSYTFTGVQLASDDNMLPNSQRGFAPTVRGIANSSAIVTIRQNGYVIYQSNVPAGAFEINDLYPSSNSGDLEVTIEESDGTQRRFIQPYSSLPMMQRPGHLKYSATAGRYRADANSDSKEPEFAEATAIYGLNNTFTLYSGLLGSEDYYALGIGIGGTLGALGALSMDINRADTQFDNQHSFHGYQWRTQYIKDIPETNTNIAVSYYRYTNDGYFSFDEANTRNWDYNSRQKSEIQFNISQTIFDGVSLYASGSQQDYWGNNEKNRNISVGVSGQQWGIGYSLNYQYSRYTDQNNDRALSLNLSIPLERWLPRSRVSYQMTSQKDRPTQHEMRLDGSLLDDGRLSYSLEQSLDDDNNHNSSVNASYRSPYGTFSAGYSYGNDSSQYNYGVTGGVVIHPHGVTLSQYLGNAFALIDANGASGVRIQNYPGIATDPFGYAVVPYLTTYQENRLSVDTTQLPDNVDLEQTTQFVVPNRGAMVAARFNANIGYRVLVTVSDRNGKPLPFGALASNDETGQQSIVDEGGILYLSGISSKSQSWTVRWGNQADQQCQFAFSTPDSEPTTSVLQGTAQCH >NZ_CP040886.1|WP_001323952.1|2935361_2936117_-|galactosamine-6-phosphate-isomerase MERGTASGGASLLKEFHPVQTLQQVENYTALSERASEYLLAVIRSKPDAVICLATGATPLLTYHYLVEKIHQQQVDVSQLTFVKLDEWVDLPLTMPGTCETFLQQHIVQPLGLREDQLISFRSEEINETECERVTNLIARKGGLDLCVLGLGKNGHLGLNEPGESLQPACHISQLDARTQQHEMLKTAGRPVTRGITLGLKDILNAREVLLLVTGEGKQDATERFLTAKVSTAIPASFLWLHSNFICLINT >NZ_CP040886.1|WP_000534351.1|2936117_2936909_-|PTS-N-acetylgalactosamine-transporter-subunit-IID MGSEISKKDITRLGFRSSLLQASFNYERMQAGGFTWAMLPILKKIYKDDKPGLSAAMKDNLEFINTHPNLVGFLMGLLISMEEKGENRDTIKGLKVALFGPIAGIGDAIFWFTLLPIMAGICSSFASQGNLLGPILFFAVYLLIFFLRVGWTHVGYSVGVKAIDKVRENSQMIARSATILGITVIGGLIASYVHINVVTSFAIDSTHSVALQQDFFDKVFPNILPMAYTLLMYYFLRVKKAHPVLLIGVTFVLSIVCSAFGIL >NZ_CP040886.1|WP_000544489.1|2936898_2937702_-|PTS-N-acetylgalactosamine-transporter-subunit-IIC MHEITLLQGLSLAALVFVLGIDFWLEALFLFRPIIVCTLTGAILGDIQTGLITGGLTELAFAGLTPAGGVQPPNPIMAGLMTTVIAWSTGVDAKTAIGLGLPFSLLMQYVILFFYSAFSLFMTKADKCAKEADTAAFSRLNWTTMLIVASAYAVIAFLCTYLAQGAMQALVKAMPAWLTHGFEVAGGILPAVGFGLLLRVMFKAQYIPYLIAGFLFVCYIQVSNLLPVAVLGAGFAVYEFFNAKSRQQAQPQPVASKNEEEDYSNGI |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040886_7 | 2969372-2969489 | Orphan |
NA
Consensus repeat of NZ_CP040886_7
|
1 spacers
spacers of NZ_CP040886_7
>7.1|2969412|38|NZ_CP040886|CRISPRCasFinder GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA |
CRISPR arrays and Neighbor proteins around NZ_CP040886_7
The CRISPR arrays of NZ_CP040886_7 >merge|NZ_CP040886|7|2969372-2969489|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGGGTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGATGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG >NZ_CP040886|7|6|2969372-2969489|CRISPRCasFinder TGCCGGATGCGATGCTGGCGCACCTTATCCGGCCTACGGG GTGCTCAACTTGTTGATGTTGTTGTGTTTTGTACCTGA TGCCGGATGCGATGCTGGCGCATCTTATCCGGCCTACGGG
>NZ_CP040886.1|WP_000460519.1|2968040_2969351_+|serine-dehydratase-subunit-alpha-family-protein MFDSTLNPLWQRYILAVQEEVKPALGCTEPISLALAAAVAAAELEGPVERVEAWVSPNLMKNGLGVTVPGTGMVGLPIAAALGALGGNANAGLEVLKDATAQAIADAKALLAAGKVSVKIQEPCNEILFSRAKVWNGEKWACVTIVGGHTNIVHIETHNGVVFTQQACVAEGEQESPLTVLSRTTLAEILKFVNEVPFAAIRFILDSAKLNCALSQEGLSGKWGLHIGATLEKQCERGLLAKDLSSSIVIRTSAASDARMGGATLPAMSNSGSGNQGITATMPVVVVAEHFGADDERLARALMLSHLSAIYIHNQLPRLSALCAATTAAMGAAAGMAWLVDGRYETISMAISSMIGDVSGMICDGASNSCAMKVSTSASAAWKAVLMALDDTAVTGNEGIVAHDVEQSIANLCALASHSMQQTDRQIIEIMASKAR >NZ_CP040886.1|WP_000401598.1|2966681_2968013_+|HAAAP-family-serine/threonine-permease MEIASNKGVIADASTPAGRAGMSESEWREAIKFDSTDTGWVIMSIGMAIGAGIVFLPVQVGLMGLWVFLLSSVIGYPAMYLFQRLFINTLAESPECKDYPSVISGYLGKNWGILLGALYFVMLVIWMFVYSTAITNDSASYLHTFGVTEGLLSDSPFYGLVLICILVAISSRGEKLLFKISTGMVLTKLLVVAALGVSMVGMWHLYNVGSLPPLGLLVKNAIITLPFTLTSILFIQTLSPMVISYRSREKSIEVARHKALRAMNIAFGILFVTVFFYAVSFTLAMGHDEAVKAYEQNISALAIAAQFISGDGAAWVKVVSVILNIFAVMTAFFGVYLGFREATQGIVMNILRRKMPAEKINENLVQRGIMIFAILLAWSAIVLNAPVLSFTSICSPIFGMVGCLIPAWLVYKVPALHKYKGMSLYLIIVTGLLLCVSPFLAFS >NZ_CP040886.1|WP_000622115.1|2965042_2966407_+|L-serine-ammonia-lyase MISAFDIFKIGIGPSSSHTVGPMNAGKSFIDRLESSGLLTATSHIVVDLYGSLSLTGKGHATDVAIIMGLAGNSPQDVVIDEIPAFIELVTRSGRLPVASGAHIVDFPVAKNIIFHPEMLPRHENGMRITAWKGQEALLSKTYYSVGGGFIVEEEHFGLSHDVETSVPYDFHSAGELLKMCDYNGLSISGLMMHNELALRSKAEIDAGFARIWQVMHDGIERGMNTEGVLPGPLNVPRRAVALRRQLVSSDNISNDPMNVIDWINMYALAVSEENAAGGRVVTAPTNGACGIIPAVLAYYDKFRRPVNERSIARYFLAAGAIGALYKMNASISGAEVGCQGEIGVACSMAAAGLTELLGGSPAQVCNAAEIAMEHNLGLTCDPVAGQVQIPCIERNAINAVKAVNAARMAMRRTSAPRVSLDKVIETMYETGKDMNDKYRETSRGGLAIKVVCG >NZ_CP040886.1|WP_001375219.1|2964581_2964971_+|enamine/imine-deaminase MKKIIETQRAPGAIGPYVQGVDLGSMVFTSGQIPVCPQTGEIPADVQDQARLSLENVKAIVVAAGLSVGDIIKMTVFITDLNDFATINEVYKQFFDEHQATYPTRSYVQVARLPKDVKLEIEAIAVRSA >NZ_CP040886.1|WP_000861734.1|2962273_2964568_+|2-ketobutyrate-formate-lyase/pyruvate-formate-lyase MKVDIDTSDKLYADAWLGFKGTDWKNEINVRDFIQHNYTPYEGDESFLAEATPATTELWEKVMEGIRIENATHAPVDFDTNIATTITAHDAGYINQPLEKIVGLQTDAPLKRALHPFGGINMIKSSFHAYGREMDSEFEYLFTDLRKTHNQGVFDVYSPDMLRCRKSGVLTGLPDGYGRGRIIGDYRRVALYGISYLVRERELQFADLQSRLEKGEDLEATIRLREELAEHRHALLQIQEMAAKYGFDISRPAQNAQEAVQWLYFAYLAAVKSQNGGAMSLGRTASFLDIYIERDFKAGVLNEQQAQELIDHFIMKIRMVRFLRTPEFDSLFSGDPIWATEVIGGMGLDGRTLVTKNSFRYLHTLHTMGPAPEPNLTILWSEELPIAFKKYAAQVSIVTSSLQYENDDLMRTDFNSDDYAIACCVSPMVIGKQMQFFGARANLAKTLLYAINGGVDEKLKIQVGPKTAPLMDDVLDYDKVMDSLDHFMDWLAVQYISALNIIHYMHDKYSYEASLMALHDRDVYRTMACGIAGLSVATDSLSAIKYARVKPIRDENGLAVDFEIDGEYPQYGNNDERVDSIACDLVERFMKKIKALPTYRNAVPTQSILTITSNVVYGQKTGNTPDGRRAGTPFAPGANPMHGRDRKGAVASLTSVAKLPFTYAKDGISYTFSIVPAALGKEDPVRKTNLVGLLDGYFHHEADVEGGQHLNVNVMNREMLLDAIEHPEKYPNLTIRVSGYAVRFNALTREQQQDVISRTFTQAL >NZ_CP040886.1|WP_001297162.1|2961031_2962240_+|propionate-kinase MNEFPVVLVINCGSSSIKFSVLDASDCEVLMSGIADGINSENAFLSVNGGEPAPLAHHSYEGALKAIAFELEKRNLNDSVALIGHRIAHGGSIFTESAIITDEVIDNIRRVSPLAPLHNYANLSGIESAQQLFPGVTQVAVFDTSFHQTMAPEAYLYGLPWKYYEELGVRRYGFHGTSHRYVSQRAHSLLNLAEDDSGLVVAHLGNGASICAVRNGQSVDTSMGMTPLEGLMMGTRSGDVDFGAMSWVASQTNQSLGDLERVVNKESGLLGISGLSSDLRVLEKAWHEGHERAQLAIKTFVHRIARHIAGHAASLRRLDGIIFTGGIGENSSLIRRLVMEHLAVLGVEIDTEMNNRSNSCGERIVSSENARVICAVIPTNEEKMIALDAIHLGKVNAPAEFA >NZ_CP040886.1|WP_000107720.1|2959674_2961006_+|threonine/serine-transporter-TdcC MSTSDSIVSSQTKQSSWRKSDTTWTLGLFGTAIGAGVLFFPIRAGFGGLIPILLMLVLAYPIAFYCHRALARLCLSGSNPSGNITETVEEHFGKTGGVVITFLYFFAICPLLWIYGVTITNTFMTFWENQLGFAPLNRGFVALFLLLLMAFVIWFGKDLMVKVMSYLVWPFIASLVLISLSLIPYWNSAVIDQVDLGSLSLTGHDGILITVWLGISIMVFSFNFSPIVSSFVVSKREEYEKDFGRDFTERKCSQIISRASMLMVAVVMFFAFSCLFTLSPANMAEAKAQNIPVLSYLANHFASMTGTKTTFAITLEYAASIIALVAIFKSFFGHYLGTLEGLNGLILKFGYKGDKTKVSLGKLNTISMIFIMGSTWVVAYANPNILDLIEAMGAPIIASLLCLLPMYAIRKAPSLAKYRGRLDNVFVTVIGLLTILNIVYKLF >NZ_CP040886.1|WP_000548347.1|2958663_2959653_+|bifunctional-threonine-ammonia-lyase/L-serine-ammonia-lyase-TdcB MHITYDLPVAIDDIIEAKQRLAGRIYKTGMPRSNYFSERCKGEIFLKFENMQRTGSFKIRGAFNKLSSLTDAEKRKGVVACSAGNHAQGVSLSCAMLGIDGKVVMPKGAPKSKVAATCDYSAEVVLHGDNFNDTIAKVSEIVEMEGRIFIPPYDDPKVIAGQGTIGLEIMEDLYDVDNVIVPIGGGGLIAGIAVAIKSINPTIRVIGVQSENVHGMAASFHSGEITTHRTTGTLADGCDVSRPGNLTYEIVRELVDDIVLVSEDEIRNSMIALIQRNKVVTEGAGALACAALLSGKLDQYIQNRKTVSIISGGNIDLSRVSQITGFVDA >NZ_CP040886.1|WP_000104211.1|2957626_2958565_+|transcriptional-regulator-TdcA MSTILLPKTQHLVVFQEVIRSGSIGSAAKELGLTQPAVSKIINDIEDYFGVELVVRKNTGVTLTPAGQLLLSRSESITREMKNMVNEISGMSSEAVVEVSFGFPSLIGFTFMSGMINKFKEVFPKAQVSMYEAQLSSFLPAIRDGRLDFAIGTLSAEMKLQDLHVEPLFESEFVLVASKSRTCTGTTTLESLKNEQWVLPQTNMGYYSELLTTLQRNGISIENIVKTDSVVTIYNLVLNADFLTVIPCDMTSPFGSNQFITIPVEETLPVAQYAAVWSKNYRIKKAASVLVELAKEYSSYNGCRRRQLIEVG >NZ_CP040886.1|WP_000145820.1|2957093_2957438_-|DNA-binding-transcriptional-activator-TdcR MTGITIFYGDNIIRYVVNIKKGLRPYFKQLPDNYQAKFELNLMSKFSNFIINKPFSAINTAARHIFSRYLLENKHLFYQYFKISNTGIDHLEQLINVNFFSSDRTSFCECNRFP >NZ_CP040886.1|WP_001295544.1|2969562_2969727_-|hypothetical-protein MSKKSAKKRQPVKPVVAKEPARTAKNFGYEEMLSELEAIVADAETRLAEDEATA >NZ_CP040886.1|WP_000633577.1|2969749_2970451_-|pirin-family-protein MITTRTARQCGQADYGWLQARYTFSFGHYFDPKLLGYASLRVLNQEVLAPGAAFQPRTYPKVDILNVILDGEAEYRDSEGNHVQASAGEALLLSTQPGVSYSEHNLSKDKPLTRMQLWLDACPQRENPLIQKLALNMGKQQLIASPEGTMGSLQLRQQVWLHHIVLDKGESANFQLHGPRAYLQSIHGKFHALTHHEEKAALTCGDGAFIRDEANITLVADSPLRALLIDLPV >NZ_CP040886.1|WP_001041010.1|2970555_2971452_+|LysR-family-transcriptional-regulator MAKERALTLEALRVMDAIDRRGSFAAAADELGRVPSALSYTMQKLEEELDVVLFDRSGHRTKFTNVGRMLLERGRVLLEAADKLTTDAEALARGWETHLTIVTEALVPTPAFFPLIDKLAAKANTQLAIITEVLAGAWERLEQGRADIVIAPDMHFRSSSEINSRKLYTLMNVYVAAPDHPIHQEPEPLSEVTRVKYRGIAVADTARERPVLTVQLLDKQPRLTVSTIEDKRQALLAGLGVATMPYPMVEKDIAEGRLRVVSPESTSEIDIIMAWRRDSMGEAKSWCLREIPKLFSGK >NZ_CP040886.1|WP_001198780.1|2971502_2971859_-|DUF805-domain-containing-protein MQWYLAVLKNYVGFSGRARRKEYWMFTLINAIVGAIINVIQLILGLEFPFLSLIYLAATIIPVIALCVRRLHDTDRSGAWALLYLVPIIGWLVLFVFACLEGNSGSNRYGNDPKFGSN >NZ_CP040886.1|WP_000384145.1|2972100_2972466_-|DUF805-domain-containing-protein MDWYLKVLKNYVGFRGRARRKEYWMFILVNIIFTFVLGLLDKMLGWQRAGGEGILTTIYGILVFLPWWAVQFRRLHDTDRSAWWALLFLIPFIGWLIIIVFNCQAGTPGENRFGPDPKLEP >NZ_CP040886.1|WP_000531204.1|2972758_2973745_-|glutathione-S-transferase-family-protein MGQLIDGVWHDTWYDTKSTGGKFQRSASAFRNWLTADGAPGPTGTGGFIAEKDRYHLYVSLACPWAHRTLIMRKLKGLEPFISVSVVNPLMLENGWTFDDSFPGATGDTLYQHEFLYQLYLHADPHYSGRVTVPVLWDKKNHTIVSNESAEIIRMFNTAFDALGAKAGDYYPPALQTKIDELNGWIYDTVNNGVYKAGFATSQQAYDEAVAKVFESLARLEQILGQHRYLTGNQLTEADIRLWTTLVRFDPVYVTHFKCDKHRISNYLNLYGFLRDIYQMPGIAETVNFDHIRNHYFRSHKTINPTGIISIGPWQDLDEPHGRDVRFG >NZ_CP040886.1|WP_000603618.1|2973814_2974297_-|DoxX-family-protein MILSIDSNDANTAPLHKKTISSLSGAVESMMKKLEDVGVLVARILMPILFITAGWGKITGYAGTQQYMEAMGVPGFMLPLVILLEFGGGLAILFGFLTRTTALFTAGFTLLTAFLFHSNFAEGVNSLMFMKNLTISGGFLLLAITGPGAYSIDRLLNKKW >NZ_CP040886.1|WP_000096086.1|2974392_2974692_-|hypothetical-protein MSSKVERERRKAQLLSQIQQQRLDLSASRREWLEATGAYDRRWNMLLSLRSWALVGSSVMAIWTIRHPNMLVRWARRGFGVWSAWRLVKTTLKQQQLRG >NZ_CP040886.1|WP_000785722.1|2974681_2975086_-|hypothetical-protein MADTHHAQGPGKSVLGIGQRIVSIMVEMVETRLRLAVVELEEEKANLFQLLLMLGLTMLFAAFGLMSLMVLIIWAVDPQYRLNAMIATTVVLLLLALIGGIWTLRKSRKSTLLRHTRHELANDRQLLEEESREQ >NZ_CP040886.1|WP_000031415.1|2975088_2975394_-|DUF883-domain-containing-protein MSKEHTTEHLRAELKSLSDTLEEVLSSSGEKSKEELSKIRSKAEQALKQSRYRLGETGDAIAKQTRVAAARADEYVRENPWTGVGIGAAIGVVLGVLLSRR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040886_8 | 3344376-3344831 | Orphan |
I-E
Consensus repeat of NZ_CP040886_8
|
7 spacers
spacers of NZ_CP040886_8
>8.1|3344404|33|NZ_CP040886|PILER-CR GGCTGATGGTCTGGGAGTGTCCATCGGGCAACT >8.2|3344465|33|NZ_CP040886|PILER-CR GGAAGTAGGCCTGACAGTGATTGAACGCATACT >8.3|3344526|33|NZ_CP040886|PILER-CR GAGTTGGGGCGGCGCAATAACGAGACGATACGC >8.4|3344587|33|NZ_CP040886|PILER-CR GGGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT >8.5|3344648|33|NZ_CP040886|PILER-CR GTCAACGCGCTCAGACGTTGCGTGAGTGAACCA >8.6|3344709|33|NZ_CP040886|PILER-CR GAAATATCCAGGGCTGGGCTGGAGGCAGACGGC >8.7|3344770|33|NZ_CP040886|PILER-CR GCCCGGAATGCATTCTGAAGGTTTGCTGTATAT >8.8|3344405|32|NZ_CP040886|CRISPRCasFinder,CRT GCTGATGGTCTGGGAGTGTCCATCGGGCAACT >8.9|3344466|32|NZ_CP040886|CRISPRCasFinder,CRT GAAGTAGGCCTGACAGTGATTGAACGCATACT >8.10|3344527|32|NZ_CP040886|CRISPRCasFinder,CRT AGTTGGGGCGGCGCAATAACGAGACGATACGC >8.11|3344588|32|NZ_CP040886|CRISPRCasFinder,CRT GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT >8.12|3344649|32|NZ_CP040886|CRISPRCasFinder,CRT TCAACGCGCTCAGACGTTGCGTGAGTGAACCA >8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT AAATATCCAGGGCTGGGCTGGAGGCAGACGGC >8.14|3344771|32|NZ_CP040886|CRISPRCasFinder,CRT CCCGGAATGCATTCTGAAGGTTTGCTGTATAT |
CRISPR arrays and Neighbor proteins around NZ_CP040886_8
The CRISPR arrays of NZ_CP040886_8 >merge|NZ_CP040886|8|3344376-3344831|PILER-CR,CRISPRCasFinder,CRT GAGTTCCCCGCGCCAGCGGGGATAAACCGGCTGATGGTCTGGGAGTGTCCATCGGGCAACTGAGTTCCCCGCGCCAGCGGGGATAAACCGGAAGTAGGCCTGACAGTGATTGAACGCATACTGAGTTCCCCGCGCCAGCGGGGATAAACCGAGTTGGGGCGGCGCAATAACGAGACGATACGCGAGTTCCCCGCGCCAGCGGGGATAAACCGGGGAGTGGCACTTCTGGGGTAGCGGCGGCCCTGAGTTCCCCGCGCCAGCGGGGATAAACCGTCAACGCGCTCAGACGTTGCGTGAGTGAACCAGAGTTCCCCGCGCCAGCGGGGATAAACCGAAATATCCAGGGCTGGGCTGGAGGCAGACGGCGAGTTCCCCGCGCCAGCGGGGATAAACCGCCCGGAATGCATTCTGAAGGTTTGCTGTATATGAGTTCCCCGCGCCAGCGGGGATAAACCA >NZ_CP040886|8|2|3344376-3344830|PILER-CR GAGTTCCCCGCGCCAGCGGGGATAAACC GGCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACC GGAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACC GAGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACC GGGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACC GTCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACC GAAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACC GCCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACC >NZ_CP040886|8|7|3344376-3344831|CRISPRCasFinder GAGTTCCCCGCGCCAGCGGGGATAAACCG GCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA >NZ_CP040886|8|1|3344376-3344831|CRT GAGTTCCCCGCGCCAGCGGGGATAAACCG GCTGATGGTCTGGGAGTGTCCATCGGGCAACT GAGTTCCCCGCGCCAGCGGGGATAAACCG GAAGTAGGCCTGACAGTGATTGAACGCATACT GAGTTCCCCGCGCCAGCGGGGATAAACCG AGTTGGGGCGGCGCAATAACGAGACGATACGC GAGTTCCCCGCGCCAGCGGGGATAAACCG GGGAGTGGCACTTCTGGGGTAGCGGCGGCCCT GAGTTCCCCGCGCCAGCGGGGATAAACCG TCAACGCGCTCAGACGTTGCGTGAGTGAACCA GAGTTCCCCGCGCCAGCGGGGATAAACCG AAATATCCAGGGCTGGGCTGGAGGCAGACGGC GAGTTCCCCGCGCCAGCGGGGATAAACCG CCCGGAATGCATTCTGAAGGTTTGCTGTATAT GAGTTCCCCGCGCCAGCGGGGATAAACCA
>NZ_CP040886.1|WP_001199979.1|3343364_3344036_+|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWEKLEDREVSLFSILAKTKESDKWGAASSEDLLAVISRQGYTARHVVITGGEPCIHDLLPLTDLLEKNGFSCQIETSGTHEVRCTPNTWVTVSPKLNMRGGYEVLSQALERANEIKHPVGRVRDIEALDELLATLTDDKPRVIALQPISQKDDATRLCIETCIARNWRLSMQTHKYLNIA >NZ_CP040886.1|WP_001288227.1|3343085_3343226_-|hypothetical-protein MSEENKENGFNHVKTFTKIIFIFSVLVFNDNESKITDAAVNLFIQI >NZ_CP040886.1|WP_001679366.1|3342199_3343072_-|YgcG-family-protein MRYFILMFTFVCSFVAAQPTIVPQLQQQVTDLTSSLNSQEKKELTHKLESIFNNTQVQIAVLIVPTTKDETIEQYATRVFDNWRLGDAKRNDGILIIVAWSDRTVRIKVGYGLEEKVTDALAGDIIRSNMIPAFKQQKLAQGLELAINALNNQLTSQHQYPTNPSESESASSSDHYYFAIFWVFAVMFFPFWFFHQCSNFCRACKSGVCISAIYLLDLFLFSDKIFSIAVFSFFFTFTIFMVFTCLCVLQKRASGRSYHSDNSGSAGGSDSGGFSGGGGSSGGGGASGRW >NZ_CP040886.1|WP_000036723.1|3340841_3342140_+|phosphopyruvate-hydratase MSKIVKIIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVAAVNGPIAQALIGKDAKDQAGIDKIMIDLDGTENKSKFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKAKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >NZ_CP040886.1|WP_000210878.1|3339116_3340754_+|CTP-synthase-(glutamine-hydrolyzing) MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQMAVEIGREHTLFMHLTLVPYMAASGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISLKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIFEEANPVSEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVSVNIKLIDSQDVETRGVEILKGLDAILVPGGFGYRGVEGMITTARFARENNIPYLGICLGMQVALIDYARHVANMENANSTEFVPDCKYPVVALITEWRDENGNVEVRSEKSDLGGTMRLGAQQCQLVDDSLVRQLYNAPTIVERHRHRYEVNNMLLKQIEDAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAASEFQKRQAK >NZ_CP040886.1|WP_001071648.1|3338097_3338889_+|nucleoside-triphosphate-pyrophosphohydrolase MNQIDRLLTIMQRLRDPENGCPWDKEQTFATIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFADSSAENSSEVLARWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCANVGFDWTTLGPVVDKVYEEIDEVMYEARQAVVDQAKLEEEMGDLLFATVNLARHLGTKAEIALQKANEKFERRFREVERIVAARGLEMTGVDLETMEEVWQQVKRQEIDL >NZ_CP040886.1|WP_000254738.1|3337691_3338027_+|endoribonuclease-MazF MVSRYVPDMGDLIWVDFDPTKGSEQAGHRPAVVLSPFMYNNKTGMCLCVPCTTQSKGYPFEVVLSGQERDGVALADQVKSIAWRARGATKKGTVAPEELQLIKAKINVLIG >NZ_CP040886.1|WP_000581937.1|3337443_3337692_+|type-II-toxin-antitoxin-system-antitoxin-MazE MIHSSVKRWGNSPAVRIPATLMQALNLNIDDEVKIDLVDGKLIIEPVRKEPVFTLAELVNDITPENLHENIDWGEPKDKEVW >NZ_CP040886.1|WP_000226815.1|3335131_3337366_+|GTP-pyrophosphokinase MVAVRSAHINKAGEFDPEKWIASLGITSQKSCECLAETWAYCLQQTQGHPDASLLLWRGVEMVEILSTLSMDIDTLRAALLFPLADANVVSEDVLRESVGKSVVNLIHGVRDMAAIRQLKATHTDSVSSEQVDNVRRMLLAMVDDFRCVVIKLAERIAHLREVKDAPEDERVLAAKECTNIYAPLANRLGIGQLKWELEDYCFRYLHPTEYKRIAKLLHERRLDREHYIEEFVGHLRAEMKAEGVKAEVYGRPKHIYSIWRKMQKKNLAFDELFDVRAVRIVAERLQDCYAALGIVHTHYRHLPDEFDDYVANPKPNGYQSIHTVVLGPGGKTVEIQIRTKQMHEDAELGVAAHWKYKEGAAAGGARSGHEDRIAWLRKLIAWQEEMADSGEMLDEVRSQVFDDRVYVFTPKGDVVDLPAGSTPLDFAYHIHSDVGHRCIGAKIGGRIVPFTYQLQMGDQIEIITQKQPNPSRDWLNPNLGYVTTSRGRSKIHAWFRKQDRDKNILAGRQILDDELEHLGISLKEAEKHLLPRYNFNDVDELLAAIGGGDIRLNQMVNFLQSQFNKPSAEEQDAAALKQLQQKSYTPQNRSKDNGRVVVEGVGNLMHHIARCCQPIPGDEIVGFITQGRGISVHRADCEQLAELRSHAPERIVDAVWGESYSAGYSLVVRVVANDRSGLLRDITTILANEKVNVLGVASRSDTKQQLATIDMTIEIYNLQVLGRVLGKLNQVPDVIDARRLHGS >NZ_CP040886.1|WP_000046812.1|3333782_3335084_+|23S-rRNA-(uracil(1939)-C(5))-methyltransferase-RlmD MAQFYSAKRRTTTRQIITVSVNDLDSFGQGVARHNGKTLFIPGLLPQENAEVTVTEDKKQYARAKVVRRLSDSPERETPRCPHFGVCGGCQQQHASVDLQQRSKSAALARLMKHDVSEVIADVPWGYRRRARLSLNYLPKTQQLQMGFRKAGSSDIVDVKQCPILAPQLEALLPKVRACLGSLQAMRHLGHVELVQATSGTLMILRHTAPLSSADREKLERFSHSEGLDLYLAPDSEILETVSGEMPWYDSNGLRLTFSPRDFIQVNAGVNQKMVARALEWLDVQPEDRVLDLFCGMGNFTLPLATQAASVVGVEGVPALVEKGQQNARLNGLQNVTFYHENLEEDVTKQPWAKNGFDKVLLDPARAGAAGVMQQIIKLEPIRIVYVSCNPATLARDSEALLKAGYTIARLAMLDMFPHTGHLESMVLFSRVK >NZ_CP040886.1|WP_000039683.1|3345468_3346947_-|sugar-kinase MSKKYIIGIDGGSQSTKVVMYDLEGNVVCEGKGLLQPMHTPDADTAEHPDDDLWASLCFAGHDLMSQFAGNKEDIVGIGLGSIRCCRALLKADGTPAAPLISWQDARVTRPYEHTNPDVAYVTSFSGYLTHRLTGEFKDNIANYFGQWPVDYKSWAWSEDAAVMDKFNIPRHMLFDVQMPGTVLGHITPQAALATHFPAGLPVVCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAYWPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDAKAQDLSPEDLLNKKASCVPPGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNYDNMCNEMNYFAKHVIITGGGSNSDLFMQIFADVFNLPARRNAINGCASLGAAINTAVGLGLYPDYATAVDKMVRVKDIFMPVESNAKRYDAMNKGIFKDLTKHTDVILKKSYEVMHGELGNADSIQSWSNA >NZ_CP040886.1|WP_001164578.1|3346973_3348251_-|MFS-transporter MQHNSYRRWITLAIISFSGGVSFDLAYLRYIYQIPMAKFMGFSNTEIGLIMSTFGIAAIILYAPSGVIADKFSHRKMITSAMIITGLLGLLMATYPPLWVMLCIQVAFAITTILMLWSVSIKAASLLGDHSEQGKIMGWMEGLRGVGVMSLAVFTMWVFSRFAPDDSTSLKTVIIIYSVVYILLGILCWFFVSDNNNLRSANNEEKQSFQLSDILAVLRISTTWYCSMVIFGVFTIYAILSYSTNYLTEMYGMSLVAASYMGIVINKIFRALCGPLGGIITTYSKVKSPTRVIQILSIIGLLALTALLVTNSNPQSVAMGIGLILLLGFTCYASRGLYWACPGEARTPSYIMGTTVGICSVIGFLPDVFVYPIIGHWQDTLPAAEAYRNMWLMGMAALGMVIVFTFLLFQKIRTADSAPAMASSK >NZ_CP040886.1|WP_000021330.1|3348569_3349355_+|SDR-family-oxidoreductase MSIESLNAFSMDFFSLKGKTAIVTGGNSGLGQAFAMALAKAGANIFIPSFVKDNGETKEMIEKQGVEVDFMQVDITAEGAPQKIIAASCERFGTVDILVNNAGICKLNKVLDFGRADWDPMIDVNLTAAFELSYEAAKIMIPQKSGKIINICSLFSYLGGQWSPAYSATKHALAGFTKAYCDELGQYNIQVNGIAPGYYATDITLATRSNPETNQRVLDHIPANRWGDTQDLMGAAVFLASPASNYVNGHLLVVDGGYLVR >NZ_CP040886.1|WP_000059312.1|3349424_3350879_+|FAD-binding-oxidoreductase MSLSRAAIVDQLKEIVGADRVITDETVLKKNSIDRFRKFPDIHGIYTLPIPAAVVKLGSTEQVSRVLNFMNAHKINGVPRTGASATEGGLETVVENSVVLDGSAMNQIINIDIENMQATAQCGVPLEVLENALREKGYTTGHSPQSKPLAQMGGLVATRSIGQFSTLYGAIEDMVVGLEAVLADGTVTRIKNVPRRAAGPDIRHIIIGNEGALCYITEVTVKIFKFTPENNLFYGYILEDMKTGFNILREVMVEGYRPSIARLYDAEDGTQHFTHFADGKCVLIFMAEGNPRIAKATGEGIAEIVARYPQCQRVDSKLIETWFNNLNWGPDKVAAERVQILKTGNMGFTTEVSGCWSCIHEIYESVINRIRTEFPHADDITMLGGHSSHSYQNGTNMYFVYDYNVVDCKPEEEIDKYHNPLNKIICEETIRLGGSMVHHHGIGKHRVHWSKLEHGSAWALLEGLKKQFDPNGIMNTGTIYPIEK >NZ_CP040886.1|WP_001098105.1|3350972_3352310_+|MFS-transporter MNTSPVRMDDLPLNRFHCRIAALTFGAHLTDGYVLGVIGYAIIQLTPAMQLTPFMAGMIGGSALLGLFLGSLVLGWISDHIGRQKIFTFSFLLITLASFLQFFATTPEHLIGLRILIGIGLGGDYSVGHTLLAEFSPRRHRGILLGAFSVVWTVGYVLASIAGHHFISENPEAWRWLLASAALPALLITLLRWGTPESPRWLLRQGRFAEAHAIVHRYFGPHVLLGDEVVTATHKHIKTLFSSRYWRRTAFNSVFFVCLVIPWFVIYTWLPTIAQTIGLEDALTASLMLNALLIVGALLGLVLTHLLAHRKFLLGSFLLLAATLVVMACLPSGSSLTLLLFVLFSTTISAVSNLVGILPAESFPTDIRSLGVGFATAMSRLGAAVSTGLLPWVLAQWGMQVTLLLLATVLLVGFVVTWLWAPETKALPLVAAGNVGGANEHSVSV >NZ_CP040886.1|WP_001324445.1|3352287_3353067_+|electron-transfer-flavoprotein-subunit-beta/FixA-family-protein MNILLAFKAEPDAGMLAEKEWQAAAQGKSGPDISLLRSLLGADEQAAAALLLAQRKNGTPMSLTALSMGDERALHWLRYLMALGFEEAVLLETAADLRFAPEFVARHIAEWQHQNPLDLIITGCQSSEGQNGQTPFLLAEMLGWPCFTQVERFTLDALFITLEQRTEHGLRCCRVRLPAVIAVRQCGEVALPVPGMRQRMAAGKAEIIRKTVAAEMPAMQCLQLARAEQRRGATLIDGQTVAEKAQKLWRDYLRQRMQP >NZ_CP040886.1|WP_001324446.1|3353063_3353924_+|electron-transfer-flavoprotein-subunit-alpha/FixB-family-protein MNIAIVTINQENAAIASWLAAQDFSGCTLAHWQIEPQPVVAEQVLDALVEQWQRTPADVVLFPPGTFGDELSTRLAWRLHGASICQVTSLDIPTVSVRKSHWGNALTATLQTEKRPLCLSLARQAGAAKNATLPSGMQQLIIVPGALPDWLVSTEDLKNVTRDPLAEARRVLVVGQGGEADNQEIAMLAEKLGAEVGYSRARVMNGGVDAEKVIGISGHLLAPEVCIVVGASGAAALMAGVRNSKFVVAINHDASAAVFSQADVGVVDDWKVVLEALVTNIHADCQ >NZ_CP040886.1|WP_001130266.1|3354071_3354647_-|glycerol-3-phosphate-responsive-antiterminator MPLLHLLRQNPVIAAVKDNASLQLAIDSECQFISVLYGNICTISNIVKKIKNAGKYAFIHVDLLEGASNKEVVIQFLKLVTEADGIISTKASMLKAARAEGFFCIHRLFIVDSISFHNIDKQVAQSNPDCIEILPGCMPKVLGWVTEKIRQPLIAGGLVCDEEDARNAINAGVVALSTTNTGVWTLAKKLL >NZ_CP040886.1|WP_000109532.1|3354663_3354924_-|ferredoxin-family-protein MSVARNLWRVADAPHIVPADSVERQTAERLISACPAGLFSLTPEGDLRIDYRSCLECGTCRLLCDESTLQQWRYPPSGFGITYRFG >NZ_CP040886.1|WP_001295150.1|3354914_3356186_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERCITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040886_9 | 3367216-3367611 | Unclear |
I-E
Consensus repeat of NZ_CP040886_9
|
6 spacers
spacers of NZ_CP040886_9
>9.1|3367246|31|NZ_CP040886|CRISPRCasFinder TTGCCCGCGCAATTCCGGGAGCATCCGCAAT >9.2|3367307|31|NZ_CP040886|CRISPRCasFinder ACGGACAAAATATATATTGATTTGCGAATTA >9.3|3367368|31|NZ_CP040886|CRISPRCasFinder GTAAAGAAACTGCCGACAAATCCCTGTTCGT >9.4|3367429|31|NZ_CP040886|CRISPRCasFinder CCGAACGGCTGGCGAAGCAGGTGGCTGGCGT >9.5|3367490|31|NZ_CP040886|CRISPRCasFinder GTTTACCGCCCCGCAGAGGCGCTGGCAGATC >9.6|3367551|31|NZ_CP040886|CRISPRCasFinder GGATGACCTGTCGCTAAAACTCGCCGCGTAC >9.7|3367246|32|NZ_CP040886|PILER-CR,CRT TTGCCCGCGCAATTCCGGGAGCATCCGCAATT >9.8|3367307|32|NZ_CP040886|PILER-CR,CRT ACGGACAAAATATATATTGATTTGCGAATTAT >9.9|3367368|32|NZ_CP040886|PILER-CR,CRT GTAAAGAAACTGCCGACAAATCCCTGTTCGTT >9.10|3367429|32|NZ_CP040886|PILER-CR,CRT CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA >9.11|3367490|32|NZ_CP040886|PILER-CR,CRT GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC >9.12|3367551|32|NZ_CP040886|PILER-CR,CRT GGATGACCTGTCGCTAAAACTCGCCGCGTACA |
cas2,cas1,cas6e,cas5 |
CRISPR arrays and Neighbor proteins around NZ_CP040886_9
The CRISPR arrays of NZ_CP040886_9 >merge|NZ_CP040886|9|3367216-3367611|CRISPRCasFinder,PILER-CR,CRT TGTGTTCCCCGCGCCAGCGGGGATAAACCGTTGCCCGCGCAATTCCGGGAGCATCCGCAATTGTGTTCCCCGCGCCAGCGGGGATAAACCGACGGACAAAATATATATTGATTTGCGAATTATGTGTTCCCCGCGCCAGCGGGGATAAACCGGTAAAGAAACTGCCGACAAATCCCTGTTCGTTGTGTTCCCCGCGCCAGCGGGGATAAACCGCCGAACGGCTGGCGAAGCAGGTGGCTGGCGTAGTGTTCCCCGCGCCAGCGGGGATAAACCGGTTTACCGCCCCGCAGAGGCGCTGGCAGATCCGTGTTCCCCGCGCCAGCGGGGATAAACCGGGATGACCTGTCGCTAAAACTCGCCGCGTACAGTGTTCCCCGCGCCAGCGGGGATAAACCG >NZ_CP040886|9|8|3367216-3367611|CRISPRCasFinder TGTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAAT TGTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTA TGTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGT TGTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGT AGTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATC CGTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTAC AGTGTTCCCCGCGCCAGCGGGGATAAACCG >NZ_CP040886|9|3|3367217-3367611|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACCG >NZ_CP040886|9|2|3367217-3367611|CRT GTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCCCGCGCAATTCCGGGAGCATCCGCAATT GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGGACAAAATATATATTGATTTGCGAATTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTAAAGAAACTGCCGACAAATCCCTGTTCGTT GTGTTCCCCGCGCCAGCGGGGATAAACCG CCGAACGGCTGGCGAAGCAGGTGGCTGGCGTA GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTTACCGCCCCGCAGAGGCGCTGGCAGATCC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGATGACCTGTCGCTAAAACTCGCCGCGTACA GTGTTCCCCGCGCCAGCGGGGATAAACCG
>NZ_CP040886.1|WP_000063176.1|3366826_3367120_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAIWLLEVRAGVYVGDTSKRIREMIWQQITQLAGCGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVDNQ >NZ_CP040886.1|WP_000144861.1|3365906_3366830_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLSPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLAATVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALTEDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRQTYALLAKQYGVKWNGRKYDPKDWEKGDVVNRCISAATSCLYGISEAAVLAAGYAPAIGFIHSGKPLSFVYDIADIIKFDSVVPKAFEIAARQPAEPDKEVRLACRDIFRSTKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPETLGDSGHRGRGG >NZ_CP040886.1|WP_000281446.1|3365259_3365910_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTGQLSPAQLLHLVDRGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQERPAESDTFTIECRSFAPELRTGQQLCFNLRANPTICKSGKRHDLLMEAKRQVRGQAEGSDVWLHQQQAALDWLAAQGERSGFTLLDTSVDAYRQQQLRRENSRQLIQFSSVDYTGMLTVTDPGLFLQRLSQGYGKSRAFGCGLMLIKPGAEA >NZ_CP040886.1|WP_000085051.1|3364531_3365278_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLIFQLHGPMASWGVDAPGEVRHTHELPSRSALLGLLAAGVGIRRDDTERLNAFNRHYSLVVCASRNPRWARDYHTIQMPKEVRKARYFSRREELSDPDLLSAIISRRDYYTDAWWMVAVATTADAPYSLEQLQDGLRHPVFPLYLGRKSHPLALPLAPLLLEGNACDALCNAYQQYQDHFHKLKVSLPKLQDECWWEGEHDGLVASKILRRRDVPLNRQQWLFGERTINQGPWLSKEEPCTSQE >NZ_CP040886.1|WP_000956458.1|3361528_3361681_+|type-I-toxin-antitoxin-system-Hok-family-toxin MLTKYALVAIIVLCCTVLGFTLMVGDSLCELSIRERGMEFKAVLAYESKK >NZ_CP040886.1|WP_000039842.1|3360529_3361264_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRILALAETNAELEKLDAEGRVAWALDNLPGEYVLSSSFGIQAAVSLHLVNQIHPDIPVILTDTGYLFPETYRFIDELTDKLKLNLKVYRATESAAWQEARYGKLWEQGVEGIEKYNDINKVEPMNRALKELNAQTWFAGLRREQSGSRANLPVLAIQRGVFKVLPIIDWDNRTIYQYLQKHGLKYHPLWDEGYLSVGDTHTTRKWEPGMLEEETRFFGLKRECGLHEG >NZ_CP040886.1|WP_001290706.1|3358743_3360456_+|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLTDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTKQWQAIDKFAGENTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVATTDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGVETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDDNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHKGDFRITANQNLIIAGVPESEKAKIEKIAKESGLMNAVTPQRENSMACVSFPTCPLAMAEAERFLPSFIDNIDNLMAKHGVSDEHIVMRVTGCPNGCGRAMLAEVGLVGKAPGRYNLHLGGNRIGTRIPRMYKENITEPEILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDLWD >NZ_CP040886.1|WP_000211954.1|3356944_3358744_+|NADPH-dependent-assimilatory-sulfite-reductase-flavoprotein-subunit MTTQVPPSALLPLNPEQLVRLQAATTDLTPTQLAWVSGYFWGVLNQQPAALAATPAPAAEMPGITIISASQTGNARRVAEALRDDLLAAKLNVKLVNAGDYKFKQIASEKLLIVVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDSSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDALKSRAPVAAPSQSVATGAVNEIHTSPYSKDAPLVASLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVEGKTLPLNEALQWHFELTVNTANIVENYATLTRSETLLPLVGDKAKLQHYAATTPIVDMVRFSPAQLDAEALINLLRPLTPRLYSIASSQAEVENEVHVTVGVVRYDVEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPETPVIMIGPGTGIAPFRAFMQQRAADEAPGKNWLFFGNPHFTEDFLYQVEWQRYVKDGVLTRIDLAWSRDQKEKVYVQDKLREQGAELWRWINDGAHIYVCGDANRMAKDVEQALLEVIAEFGGMDTEAADEFLSELRVERRYQRDVY >NZ_CP040886.1|WP_000987944.1|3356263_3356629_-|6-carboxytetrahydropterin-synthase-QueD MMSTTLFKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIIDFAELKAAFKPTYERLDHHYLNDIPGLENPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCIYRGE >NZ_CP040886.1|WP_001295150.1|3354914_3356186_-|FAD-dependent-oxidoreductase MEDDCDIIIIGAGIAGTACALRCARAGLSVLLLERAEIPGSKNLSGGRLYTHALAELLPQFHLTAPLERCITHESLSLLTPDGATTFSSLQPGGESWSVLRARFDPWLVAEAEKEGVECIPGATVDALYEENGRVCGVICGDDILRARYVVLAEGANSVLAERHGLVTRPAGEAMALGIKEVLSLETSAIEERFHLENNEGAALLFSGGICDDLPGGAFLYTNQQTLSLGIVCPLSSLTQSRVPASELLTRFKAHPAVRPLIKNTESLEYGAHLVPEGGLHSMPVQYAGNGWLLVGDALRSCVNTGISVRGMDMALTGAQAAAQTLISACQHREPQNLFPLYHHNVERSLLWDVLQRYQHVPALLQRPGWYRTWPALMQDISRDLWDQGDKPVPPLRQLFWHHLRRHGLWHLAGDVIRSLRCL >NZ_CP040886.1|WP_000490426.1|3367692_3368730_-|alkaline-phosphatase-isozyme-conversion-aminopeptidase MFSALRHRTAALALGVCFILPVHASSPKPGDFANTQARHIATFFPGRMTGTPAEMLSADYIRQQFQQMGYRSDIRTFNSRYIYTARDNRKSWHNVTGSTVIAAHEGKAPQQIIIMAHLDTYAPLSDADADANLGGLTLQGMDDNAAGLGVMLELAERLKNTPTEYGIRFVATSGEEEGKLGAENLLKRMSDTEKKNTLLVINLDNLIVGDKLYFNSGVKTPEAVRKLTRDRALAIARSHGIAATTNPGLNKNYPKGTGCCNDAEIFDKAGIAVLSVEATNWNLGNKDGYQQRAKTAAFPAGNSWHDVRLDNQQHIDKALPGRIERRCRDVMRIMLPLVKELAKAS >NZ_CP040886.1|WP_000372108.1|3368981_3369890_+|sulfate-adenylyltransferase-subunit-CysD MDQIRLTHLRQLEAESIHIIREVAAEFSNPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYEFRDRTAKAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWHNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMIDDNRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESNAQTLPEIIEEMLVSTTSERQGRVIDRDQAGSMELKKRQGYF >NZ_CP040886.1|WP_001090386.1|3369891_3371319_+|sulfate-adenylyltransferase-subunit-CysN MNTALAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTRQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCELAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYSEKTFTRIREDYLTFAGQLPGNLDIRFVPLSALEGDNVASQSESMAWYSGPTLLEVLETVEIQRVVDAQPMRFPVQYVNRPNLDFRGYAGTLASGRVEVGQRVKVLPSGVESNVARIVTFDGDREEAFAGEAITLVLTDEIDISRGDLLLAADEALPAVQSASVDVVWMAEQPLSPGQSYDIKIAGKKTRARVDGIRYQVDINNLTQREVENLPLNGIGLVDLTFDEPLVLDRYQQNPVTGGLIFIDRLSNVTVGAGMVHEPVSQATAAPSEFSAFELELNALVRRHFPHWGARDLLGDK >NZ_CP040886.1|WP_001173673.1|3371318_3371924_+|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVQQRELHHGHRGVVLWFTGLSGSGKSTVAGALEEALHKLGVSTYLLDGDNVRHGLCSDLGFSDADRKENIRRVGEVANLMVEAGLVVLTAFISPHRAERQMVRERVGEGRFIEVFVDTPLAICEARDPKGLYKKARAGELRNFTGIDSVYEAPESAEIHLNGEQLVTNLVQQLLDLLRQNDIIRS >NZ_CP040886.1|WP_001246104.1|3371973_3372297_+|DUF3561-family-protein MRNSHNITLTNNDSLTEDEETTWSLPGAVVGFISWLFALAMPMLIYGSNTLFFFIYTWPFFLALMPVAVVVGIALHSLMDGKLRYSIVFTLVTVGIMFGALFMWLLG >NZ_CP040886.1|WP_000517476.1|3372490_3372802_+|cell-division-protein-FtsB MGKLTLLLLAILVWLQYSLWFGKNGIHDYTRVNDDVAAQQATNAKLKARNDQLFAEIDDLNGGQEALEERARNELSMTRPGETFYRLVPDASKRAQSAGQNNR >NZ_CP040886.1|WP_000246138.1|3372820_3373531_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MATTHLDVCAVVPAAGFGRRMQTECPKQYLSIGNQTILEHSVHALLAHPRVKRVVIAISPGDSRFAQLPLANHPQITVVDGGDERADSVLAGLKAAGDAQWVLVHDAARPCLHQDDLARLLALSETSRTGGILAAPVRDTMKRAEPGKNAIAHTVDRNGLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPQLVEGRADNIKVTRPEDLALAEFYLTRTIHQENT >NZ_CP040886.1|WP_001374730.1|3373530_3374010_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRIPYERGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLIKATK >NZ_CP040886.1|WP_000568943.1|3374006_3375056_+|tRNA-pseudouridine(13)-synthase-TruD MIEFDNLTYLHGKPQGTGLLKANPEDFVVVEDLGFEPDGEGEHILVRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDLSAFQLEGCQVLEYARHKRKLRLGALKGNAFTLVLREVSNRDDVEQRLIDICVKGVPNYFGAQRFGIGGSNLQGAQRWAQTNTPVRDRNKRSFWLSAARSALFNQIVAERLKKADVNQVVDGDALQLAGRGSWFVATTEELAELQRRVNDKELMITAALPGSGEWGTQREALAFEQAAVAAETELQALLVREKVEAARRAMLLYPQQLSWNWWDDVTVEIRFWLPAGSFATSVVRELINTTGDYAHIAE >NZ_CP040886.1|WP_001374723.1|3375036_3375798_+|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVICSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040886_10 | 3759492-3759598 | Orphan |
NA
Consensus repeat of NZ_CP040886_10
|
1 spacers
spacers of NZ_CP040886_10
>10.1|3759522|47|NZ_CP040886|CRISPRCasFinder ACGGTTGTCCAACGCAAACACCAGTAATGGCGCGGCTCTCAGTGGAG |
CRISPR arrays and Neighbor proteins around NZ_CP040886_10
The CRISPR arrays of NZ_CP040886_10 >merge|NZ_CP040886|10|3759492-3759598|CRISPRCasFinder TCAGGCAAAAAATTGTCCAACGGTTGTCCAACGGTTGTCCAACGCAAACACCAGTAATGGCGCGGCTCTCAGTGGAGATTGTCCAACGGTTGTCCAACGGTTGTCCA >NZ_CP040886|10|9|3759492-3759598|CRISPRCasFinder TCAGGCAAAAAATTGTCCAACGGTTGTCCA ACGGTTGTCCAACGCAAACACCAGTAATGGCGCGGCTCTCAGTGGAG ATTGTCCAACGGTTGTCCAACGGTTGTCCA
>NZ_CP040886.1|WP_001177653.1|3758656_3758935_+|transcriptional-regulator MQLTSTRKKANAITSNILNRIAVRGQRKVADALGINESQISRWKDSFIPKMGMLLAVLEWGVEDEELAELAKKVARMLTKEKAPKNGEFFEA >NZ_CP040886.1|WP_000276886.1|3758362_3758548_+|hypothetical-protein MYKKDVIDHFGTQRAVAKALGISDAAVSQWKEVIPEKDAYRLEVVTAGALKYQESAYRKAA >NZ_CP040886.1|WP_000856967.1|3757631_3758282_-|LexA-family-transcriptional-regulator MKTQLMGERIRARRKELKIRQAALGKMVGVSNVAISQWERSETEPNGENLLALANALKCSPDYLMKGEESLSNIAYHSRHDPRGSYPLISWVSAGCWMEAVEPYHKRAIDNWYDTTVDCSEDSFWLDVKGDSMTAPAGLSIPEGMIILVDPEVEPRNGKLVVAKLEGENEATFKKLVIDAGRKFLKPLNPQYPMIEINGNCKIIGVVVDAKLANLP >NZ_CP040886.1|WP_060504010.1|3756929_3757229_-|hypothetical-protein MTVVITYLADDNARNRRRARRQAQREQAMQEQRLARKIALKLSGCVRADKAASLVSLRCKKADEVERKQNRIYYRKPRSEMGVTCVGRQKMKLGSKPLI >NZ_CP040886.1|WP_000167595.1|3756450_3756921_-|hypothetical-protein MTKSWSVPFPESETEHDGMPVFWRFQATVEEDGIKIFALQYIAFHQTEHYAWLVPAHWIVNFKPAPNQWLQEWKQRRNRYAIKKVAKNAERSFAFPTKKLAIESLLRRKKYHLMRIKQDLAVVSTLVDGMKNIDTSTPDIEYNFGHNQETENWVFY >NZ_CP040886.1|WP_001609782.1|3755995_3756307_-|superinfection-exclusion-protein MKLRVWHIPQVPMKPFIAEVASVEEGVRLMDALADYDAFQYDNNIKPDYCNANGLEMWDESLTDEDLSEMGLTDRWVDWYSECQCYDDPRKYLESLKEETSAA >NZ_CP040886.1|WP_000972063.1|3755785_3755920_-|hypothetical-protein MMHFQLAGSGVMSAFYPHESELSRRVKQLIRAAKKQLEALCAMK >NZ_CP040886.1|WP_001243355.1|3755648_3755801_-|host-cell-division-inhibitory-peptide-Kil MRNEIAINHQMLRAAQNKAVIARFIGDSKMWLEANKAMKSAINLPWYRRK >NZ_CP040886.1|WP_060504008.1|3754686_3755394_-|recombinase MDLNKFDEPFSPEDIEWRIQQSGKTRDGKVWAMVLAYVTNRAIMKRLDDVCGKAGWRNEYRDIPNNGGVECGISIKIDSEWVTKWDAAENTQVEAVKGGRSGAMKRAAVQWGIGRYLYNLEEGFAQTSLDKKQGWHRAKLKDGTGFYWLPPSLPGWAIPASDNKPSPENTNQKSPSVDYEQILKDFSDFASKETDKKKLIERYQHDWQLMAGNEDAQAKCVQVMNIRVNELKQAA >NZ_CP040886.1|WP_024167014.1|3754212_3754686_-|single-stranded-DNA-binding-protein MASRGVNKVIIIGRLGHDPEIRYSPSGTAFANLTVATSEQWRDKQTGEQKEQTEWHRVVMSGKLAEIASEYLRKGSEVYLEGKLRTRKWQDQSGQDRFTTEVIVGVGGTMQMLGGKQGGNEQSSHQRNNGQQQRQQSQQQGNHSEPPMNFDDSDIPF >NZ_CP040886.1|WP_001549089.1|3759997_3761434_+|AAA-family-ATPase MTDNFYAPPHSIEAEQAVIGGLLLDDDSSERVQKVLAMLKPDSFYSRPHKILFEEITRMHREQKPVDGLTLFDELERKSLTASVGGFAYIAEIAKNTPSAANIVAYAMQVRETAMERYAINRMTEATELLYSRNGMTATQKYEAIQAIFTQLTDHAKTGSRRGLRSFGEVMEDWVSDLEKRFDPSGEQRGMSTGIPSLDRMLSPKGLVKGSLFVIGARPKMGKTTLYSQMAINCAVHEKKPALMFSLEMPGDQILEKLVGQKSGVNPNIFYLPATNDADDGYQGDYDGDFNRAIETANRLSEIDLLYIDDTPGLSLAQIVSESRRIKREKGCVGMILVDYLTLMTAEKADRNDLAYGMITKGLKNLAKELDCVVVLLTQLNRALESRTNKRPLPSDSRDTGQIEQDCDYWVGIHREGAFDDSVPPGETELILRLNRHGNTGTVYCIQANGAIYDTDQQSAEMRRREREEPQSKKKGGF >NZ_CP040886.1|WP_000796282.1|3761509_3761836_+|hypothetical-protein MADWQIPIIILAGASLVAGFILLKKHKDRDQKVEVLYGYPANSTTWLTIYHYRKSGRWVFEWDDLFAEKRPKSWGDISECMMFEERKSGATREEFNEAWARLSERGYL >NZ_CP040886.1|WP_000049638.1|3761832_3762033_+|hypothetical-protein MSKYEKLDQNILSMLSERPTPVFDIWLKWRSNGMYIETIDRRMQYLRKKGLVANVRGKGWVKINLS >NZ_CP040886.1|WP_060504012.1|3762044_3762311_+|hypothetical-protein MDESRKQFEEWFKNKYHVSSDVMKIMHIKVEIAWEAWQASRAAIEIELQKPKKGPLPGDYHIGYDSGAESQYESDVEAIRAAGVKVKE >NZ_CP040886.1|WP_001515066.1|3762313_3762520_+|hypothetical-protein MTNQQQIEFILEQIRKMREKNQPDMMEIWRRQQEEYRKHIFGERKQDDWSLYGYGTRTNKNGYSLYTY >NZ_CP040886.1|WP_060504015.1|3762528_3762939_+|recombination-protein-NinB MKQTIFLRSKQQQQAAINAILATPLDKDKPVTIRITDYKRNLDQNAKFHAMLADIASQVQWCGKWLKPEQWKVLLISGHAVATKQEADVLRGLEGEFVNIRESSAQMSVKRMASLIEYTTAWAIGQGVRFTDRRYE >NZ_CP040886.1|WP_001254255.1|3762935_3763112_+|NinE-family-protein MRRQRRSITDIICENCKYLPTKRSRNKRKPIPKESDVKTFNYTAHLWDIRWLRHRARK >NZ_CP040886.1|WP_060504017.1|3763108_3764059_+|DNA-cytosine-methyltransferase MTMTAYYNEIDPYAAQWLRNLIDAGEIAPGYVDERSIEDVTPGDLRGFTQHHFFAGIGVWSYALRKAGWPDNKSIWTGSCPCQPFSSAGKGKGVDDERHLWPAFFWLIEKCNPGIVIGEQVASADGLAWLDLVQTDLEGANYTSAGTDICAAGFGSPHIRQRLYWVAYSNDKYQLSARDTQGNSEPIWMRETSGMANSFSERCNRFNALLQRKRQERNPKNLLETSRDGEAMYPLPVNGFWRDADWLYCRDEKYRPVRPGSFPMVNGIAKSLGRGKSTLGRMAKRNQDQRIIGYGNAINAEVATAFVKVCMEVVNA >NZ_CP040886.1|WP_000950963.1|3764051_3764228_+|protein-ninF MLSPSQSLQYQKESVERALTCANCGQKLHVLEVHVCEHCCAELMSDPNSSMYEEEDDG >NZ_CP040886.1|WP_060504019.1|3764220_3764832_+|recombination-protein-NinG MAKPARRRCKNGECREWFHPAFANQWWCSPECGTKIALERRSKEREKAEKAAEKKLRREEQKQKDKLKIRKLALKPRSYWIKQAQQAVNAFIRERDRDLPCISCGTLTSAQWDAGHYRTTAAAPQLRFDERNIHKQCVVCNQHKSGNLVPYRVELISRIGQEAVDEIESNHNRHRWTIEECKAIKAEYQQKLKDLRNSRSEAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040886_11 | 3909722-3909839 | Orphan |
NA
Consensus repeat of NZ_CP040886_11
|
1 spacers
spacers of NZ_CP040886_11
>11.1|3909753|56|NZ_CP040886|CRISPRCasFinder TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA |
CRISPR arrays and Neighbor proteins around NZ_CP040886_11
The CRISPR arrays of NZ_CP040886_11 >merge|NZ_CP040886|11|3909722-3909839|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGCTGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAACCGAGCCGTAGGCCGGATAAGGCGTTTACGC >NZ_CP040886|11|10|3909722-3909839|CRISPRCasFinder CCGAGCCGTAGGCCGGATAAGGCGTTCACGC TGCATCCGGCACCCGGAGCCTGATGCGACGCTGGCGCGTCTTATCAGGCCTACAAA CCGAGCCGTAGGCCGGATAAGGCGTTTACGC
>NZ_CP040886.1|WP_000332037.1|3908494_3909625_-|ribonucleotide-diphosphate-reductase-subunit-beta MAYTTFSQTKNDQLKEPMFFGQPVNVARYDQQKYDIFEKLIEKQLSFFWRPEEVDVSRDRIDYQALPEHEKHIFISNLKYQTLLDSIQGRSPNVALLPLISIPELETWVETWAFSETIHSRSYTHIIRNIVNDPSVVFDDIVTNEQIQKRAEGISSYYDELIEMTSYWHLLGEGTHTVNGKTVTVSLRELKKKLYLCLMSVNALEAIRFYVSFACSFAFAERELMEGNAKIIRLIARDEALHLTGTQHMLNLLRSGADDPEMAEIAEECKQECYDLFVQAAQQEKDWADYLFRDGSMIGLNKDILCQYVEYITNIRMQAVGLDLPFQTRSNPIPWINTWLVSDNVQVAPQEVEVSSYLVGQIDSEVDTDDLSNFQL >NZ_CP040886.1|WP_000135040.1|3908240_3908495_-|ferredoxin-like-diferric-tyrosyl-radical-cofactor-maintenance-protein-YfaE MARVTLRITGTQLLCQDEHPSLLAALESHNVAVEYQCREGYCGSCRTRLVAGQVDWIAEPLAFIQPGEILPCCCRAKGDIEIEM >NZ_CP040886.1|WP_000301049.1|3907536_3908187_+|lipopolysaccharide-kinase-InaA MAVSAKYDEFNHWWATEGDWVEEPNYRRNGMSGVQCVERNGKKLYVKRMTHHLFHSVRYPFGRPTIVREVAVIKELERAGVIVPKIVFGEAVKIEGEWRALLVTEDMAGFISIADWYAQHAVSPYSDEVRQAMLKAVALAFKKMHSINRQHGCCYVRHIYVKTEGKAEAGFLDLEKSRRRLRRDKAINHDFRQLEKYLEPIPKADWEQVKAYYYAM >NZ_CP040886.1|WP_072163405.1|3904802_3905117_-|hypothetical-protein MTNKLGGELIDIADKKLAPLINDSFSYTRDFFAYSKQENNIFTFDNSKFVDPKEKEGLMIQHSNGQLVITGKYCPEGVQTAFTQEQYDKLIRYINIFFTFPKCE >NZ_CP040886.1|WP_000768974.1|3903507_3904584_+|glycerophosphodiester-phosphodiesterase MKLKLKNLSMAIMMSTIVMGSSAMAADSNEKIVIAHRGASGYLPEHTLPAKAMAYAQGADYLEQDLVMTKDDHLVVLHDHYLDRVTDVADRFPDRARKDGRYYAIDFTLDEIKSLKFTEGFDIENGKKVQTYPGRFPMGKSDFRVHTFEEEIEFVQGLNHSTGKNIGIYPEIKAPWFHHQEGKDIAAKTLEVLKKYGYTGKDDKVYLQCFDADELKRIKNELEPKMGMDLNLVQLIAYTDWNETQQKQPDGSWVNYSYDWMFKPGAMKQVAEYADGIGPDYHMLIEETSQPGNIKLTGMVQDAQQNKLVVHPYTVRSDKLPEYTTDVNQLYDVLYNKAGVNGLFTDFPDKAVKFLNKE >NZ_CP040886.1|WP_000948732.1|3902144_3903503_+|glycerol-3-phosphate-transporter MLSIFKPAPHKARLPAAEIDPTYRRLRWQIFLGIFFGYAAYYLVRKNFALAMPYLVEQGFSRGDLGFALSGISIAYGFSKFIMGSVSDRSNPRVFLPAGLILAAAVMLFMGFVPWATSSIAVMFVLLFLCGWFQGMGWPPCGRTMVHWWSQKERGGIVSVWNCAHNVGGGIPPLLFLLGMAWFNDWHAALYMPAFCAILVALFAFAMMRDTPQSCGLPPIEEYKNDYPDDYNEKAEQELTAKQIFMQYVLPNKLLWYIAIANVFVYLLRYGILDWSPTYLKEVKHFALDKSSWAYFLYEYAGIPGTLLCGWMSDKVFRGNRGATGVFFMTLVTIATIVYWMNPAGNPTVDMICMIVIGFLIYGPVMLIGLHALELAPKKAAGTAAGFTGLFGYLGGSVAASAIVGYTVDFFGWDGGFMVMIGGSILAVILLIVVMIGEKRRHEQLLQKRNGG >NZ_CP040886.1|WP_000857251.1|3900243_3901872_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-A MKTRDSQSSDVIIIGGGATGAGIARDCALRGLRVILVERHDIATGATGRNHGLLHSGARYAVTDAESARECISENQILKRIARHCVEPTNGLFITLPEDDLSFQATFIRACEEAGISAEAIDPQQARIIEPAVNPALIGAVKVPDGTVDPFRLTAANMLDAKEHGAVILTAHEVTGLIREGATVCGVRVRNHLTGETQALHAPVVVNAAGIWGQHIAEYADLRIRMFPAKGSLLIMDHRINQHVINRCRKPSDADILVPGDTISLIGTTSLRIDYNEIDDNRVTAEEVDILLREGEKLAPVMAKTRILRAYSGVRPLVASDDDPSGRNVSRGIVLLDHAERDGLDGFITITGGKLMTYRLMAEWATDAVCRKLGNTRPCTTADLALPGSQDPAEVTLRKVISLPAPLRGSAVYRHGDRTPAWLSEGRLHRSLVCECEAVTAGEVQYAVENLNVNSLLDLRRRTRVGMGTCQGELCACRAAGLLQRFNVTTSAQSIEQLSTFLNERWKGVQPIAWGDALRESEFTRWVYQGLCGLEKEQKDAL >NZ_CP040886.1|WP_001209908.1|3898994_3900254_-|glycerol-3-phosphate-dehydrogenase-subunit-GlpB MRFDTVIMGGGLAGLLCGLQLQKHGLRCAIVTRGQSALHFSSGSLDLLSHLPDGQPVADIHSGLESLRQQAPAHPYSLLGPQRVLDLACQAQALIAESGAQLQGSVELAHQRITPLGTLRSTWLSSPEVPVWPLPAKKICVVGISGLMDFQAHLAAASLRELDLSVETAEIELPELDVLRNNATEFRAVNIARFLDNEENWPLLLDALIPVANTCEMILMPACFGLADDKLWRWLNEKLPCSLMLLPTLPPSVLGIRLQNQLQRQFVRQGGVWMPGDEVKKVTCKNGVVNEIWTRNHADIPLRPRFAVLASGSFFSGGLVAERNGIREPILGLDVLQTATRGEWYKGDFFAPQPWQQFGVTTDETLRPSQAGQTIENLFAIGSVLGGFDPIAQGCGGGVCAVSALHAAQQIAQRAGGQQ >NZ_CP040886.1|WP_001000370.1|3897807_3898998_-|anaerobic-glycerol-3-phosphate-dehydrogenase-subunit-C MNDTSFENCIKCTVCTTACPVSRVNPGYPGPKQAGPDGERLRLKDGALYDEALKYCINCKRCEVACPSDVKIGDIIQRARAKYDTTRPSLRNFVLSHTDLMGSVSTPFAPIVNTATSLKPVRQLLDAALKIDHRRTLPKYSFGTFRRWYRSVAAQQAQYKDQVAFFHGCFVNYNHPQLGKDLIKVLNAMGTGVQLLSKEKCCGVPLIANGFTAKARKQAITNVESIREAVGVKGIPVIATSSTCTFALRDEYPEVLNVDNKGLRDHIELATRWLWRKLDEGKTLPLKPLPLKVVYHTPCHMEKMGWTLYTLELLRKIPGLELTVLDSQCCGIAGTYGFKKENYPTSQAIGAPLFRQIEESGADLVVTDCETCKWQIEMSTSLRCEHPITLLAQALA >NZ_CP040886.1|WP_001374259.1|3896715_3897615_-|ISNCY-family-transposase MTESTTSSPHDAVFKTFMFTPETARDFLEIHLPEPLRKLCNLQTLRLEPTSFIEKSLRAYYSDVLWSVETSDGDGYIYCVIEHQSSAEKNMAFRLMRYATAAMQRHLDKGYDRVPLVVPLLFYHGETSPYPYSLNWLDEFDDPQLARQLYTEAFPLVDITIVPDDEIMQHRRIALLELIQKHIRDRDLIGMVDRITTLLVKGFTNDSQLQTLFNYLLQCGDTSRFTRFIEEIAKRSPLQKERLMTIAERLRQEGHQIGWQEGMHEQAIKIALRMLEQGFEREIVLATTQLTDADIPNCH >NZ_CP040886.1|WP_001075164.1|3909858_3912144_-|ribonucleoside-diphosphate-reductase-subunit-alpha MNQNLLVTKRDGSTERINLDKIHRVLDWAAEGLHNVSISQVELRSHIQFYDGIKTSDIHETIIKAAADLISRDAPDYQYLAARLAIFHLRKKAYGQFEPPALYDHVVKMVEMGKYDNHLLEDYTEEEFKQMDTFIDHDRDMTFSYAAVKQLEGKYLVQNRVTGEIYESAQFLYILVAACLFSNYPRETRLQYVKRFYDAVSTFKISLPTPIMSGVRTPTRQFSSCVLIECGDSLDSINATSSAIVKYVSQRAGIGINAGRIRALGSPIRGGEAFHTGCIPFYKHFQTAVKSCSQGGVRGGAATLFYPMWHLEVESLLVLKNNRGVEGNRVRHMDYGVQINKLMYTRLLKGEDITLFSPSDVPGLYDAFFADQEEFERLYTKYEKDDSIRKQRVKAVELFSLMMQERASTGRIYIQNVDHCNTHSPFDPAIAPVRQSNLCLEIALPTKPLNDVNDENGEIALCTLSAFNLGAINNLDELEELAILAVRALDALLDYQDYPIPAAKRGAMGRRTLGIGVINFAYYLAKHGKRYSDGSANNLTHKTFEAIQYYLLKASNELAKEQGACPWFNETTYAKGILPIDTYKKDLDTIANEPLHYDWEALRESIKTHGLRNSTLSALMPSETSSQISNATNGIEPPRGYVSIKASKDGILRQVVPDYEHLHDAYELLWEMPGNDGYLQLVGIMQKFIDQSISANTNYDPSRFPSGKVPMQQLLKDLLTAYKFGVKTLYYQNTRDGAEDAQDDLVPSIQDDGCESGACKI >NZ_CP040886.1|WP_001220074.1|3912839_3916592_+|AIDA-I-family-autotransporter-adhesin-YfaL/EhaC MRIIFLRKEYLSLLPSMIASLFSANGVAAVTDSCQGYDVKASCQASRQSLSGITQDWSIADGQWLVFSDMTNNASGGAVFLQQGAEFSLLPENETGMTLFANNTVTGEYNNGGAIFAKENSTLNLTDVIFSGNVAGGYGGAIYSSGTNDTGAVDLRVTNAMFRNNIANDGKGGAIYTINNDVYLSDVIFDNNQAYTSTSYSDGDGGAIDVTDNNSDSKHPSGYTIVNNTAFTNNTAEGYGGAIYTNSVTAPYLIDISVDDSYSQNGGVLVDENNSAAGYGDGPSSAAGGFMYLGLSEVTFDIADGKTLVIGNTENDGAVDSIAGTGLITKTGSGDLVLNADNNDFTGEMQIENGEVTLGRSNSLMNVGDTHCQDDPQDCYGLTIGSIDQYQNQAELNVGSTQQTFVHALTGFQNGTLNIDAGGNVTVNQGSFAGIIEGAGQLTIAQNGSYVLAGAQSMALTGDIVVDDGAVLSLEGDAADLTALQDDPQSIVLNGGVLDLSDFSTWQSGTSYNDGLEVSGSSGTVIGSQDVVDLAGGDNLHIGGDGKDGVYVVVDASDGQVSLANNNSYLGTTQIASGTLMVSDNSQLGDTHYNRQVIFTDKQQESVMEITSDVDTRSDAAGHGRDIEMRADGEVAVDAGVDTQWGALMADSSGQHQDEGSTLTKTGAGTLELTASGTTQSAVRVEEGTLKGDVADILPYASSLWVGDGATFVTGADQDIQSIDAISSGTIDISDGTVLRLTGQDTSVALNASLFNGDGTLVNATDGVTLTGELNTNLETDSLTYLSNVTVNGNLTNTSGAVSLQNGVAGDTLTVNGDYTGGGTLLLDSELNGDDSVSDQLVMNGNTAGNTTVVVNSITGIGEPTSTGIKMVDFAADPTQFQNNAQFSLAGSGYVNMGAYDYTLVEDNNDWYLRSQEVTPPSPPDPDPTPDPDPTPDPDPTPDPEPTPAYQPVLNAKVGGYLNNLRAANQAFMMERRDHAGGDGQTLNLRVIGGDYHYTAAGQLAQHEDTSTVQLSGDLFSGRWGTDGEWMLGIVGGYSDNQGDSRSNMTGTRADNQNHGYAVGLTSSWFQHGNQKQGAWLDSWLQYAWFSNDVSEQEDGTDHYHSSGIIASLEAGYQWLPGRGVVIEPQAQVIYQGVQQDDFTAANRARVSQSQGDDIQTRLGLHSEWRTAVHVIPTLDLNYYHDPHSTEIEEDGSTISDDAVKQRGEIKVGVTGNISQRVSLRGSVAWQKGSDDFAQTAGFLSMTVKW >NZ_CP040886.1|WP_000990756.1|3916719_3917442_-|bifunctional-2-polyprenyl-6-hydroxyphenol-methylase/3-demethylubiquinol-3-O-methyltransferase-UbiG MNAEKSPENHNVDHEEIAKFEAVASRWWDLEGEFKPLHRINPLRLGYIAERAGGLFGKKVLDVGCGGGILAESMAREGATVTGLDMGFEPLQVAKLHALESGIQVDYVQETVEKHAAKHAGQYDVVTCMEMLEHVPDPQSVVRACAQLVKPGGDVFFSTLNRNGKSWLMAVVGAEYILRMVPKGTHDVKKFIKPAELLGWVDQTSLKERHITGLHYNPITNSFKLGPGVDVNYMLHTQNK >NZ_CP040886.1|WP_001281225.1|3917588_3920216_+|DNA-topoisomerase-(ATP-hydrolyzing)-subunit-A MSDLAREITPVNIEEELKSSYLDYAMSVIVGRALPDVRDGLKPVHRRVLYAMNVLGNDWNKAYKKSARVVGDVIGKYHPHGDLAVYNTIVRMAQPFSLRYMLVDGQGNFGSIDGDSAAAMRYTEIRLAKIAHELMADLEKETVDFVDNYDGTEKIPDVMPTKIPNLLVNGSSGIAVGMATNIPPHNLTEVINGCLAYIDDEDISIEGLMEHIPGPDFPTAAIINGRRGIEEAYRTGRGKVYIRARAEVEVDAKTGRETIIVHEIPYQVNKARLIEKIAELVKEKRVEGISALRDESDKDGMRIVIEVKRDAVGEVVLNNLYSQTQLQVSFGINMVALHHGQPKIMNLKDIIAAFVRHRREVVTRRTIFELRKARDRAHILEALAVALANIDPIIELIRHAPTPAEAKTALVANPWQLGNVAAMLERAGDDAARPEWLEPEFGVRDGLYYLTEQQAQAILDLRLQKLTGLEHEKLLDEYKELLDQIAELLRILGSADRLMEVIREELELVREQFGDKRRTEITANSADINLEDLITQEDVVVTLSHQGYVKYQPLSEYEAQRRGGKGKSAARIKEEDFIDRLLVANTHDHILCFSSRGRVYSMKVYQLPEATRGARGRPIVNLLPLEQDERITAILPVTEFEEGVKVFMATANGTVKKTVLTEFNRLRTAGKVAIKLVDGDELIGVDLTSGEDEVMLFSAEGKVVRFKESSVRAMGCNTTGVRGIRLGEGDKVVSLIVPRGDGAILTATQNGYGKRTAVAEYPTKSRATKGVISIKVTERNGLVVGAVQVDDCDQIMMITDAGTLVRTRVSEISIVGRNTQGVILIRTAEDENVVGLQRVAEPVDEEDLDTIDGSAAEGDDEIAPEVDVDDEPEEE >NZ_CP040886.1|WP_000012305.1|3920364_3922053_+|DUF2138-domain-containing-protein MSGEKKAKGWRFYGLVGFGAIALLSAGVWALQYAGSGPEKTLSPLVVHNNLQIDLNEPDLFLDSDSLSQLPKDLLTIPFLHDVLSEDFVFYYQNHADRLGIEGSIRRIVYEHDLTLKDKLFSSLLDQPAQAALWHDKQGHLSHYMVLIQRSGLSKLLEPLLFAATSDSQLSKTEISSIKINSETVPVYQLRYNGNNALMFATYQDKMLVFSSTDMLFKDDQQDTEATAIAGDLLSGKKRWQASFGLEERTAEKTPVRQRIVVSARWLGFGYQRLMPSFAGVRFEMGNDGWHSFVALNDESASVDASFDFTPVWNSMPAGASFCVAVPYSHGIAEEMLSHISQENDKLNGALDGAAGLCWYEDSKLQTPLFVGQFDGTAEQAQLPGKLFTQNIGAHESKAPEGVLPVSQTQQGEAQIWRREVSSRYGQYPKAQAAQPDQLMSDYFFRVSLAMQNKTLLFSLDDTLVNNALQTLNKTRPAMVDVIPTDGIVPLYINPQGIAKLLRNETLTSLPKNLEPVFYNAAQTLLMPKLDALSQQPRYVMKLAQMEPGAAWQWLPITWQPL >NZ_CP040886.1|WP_001295211.1|3922049_3922673_+|DUF1175-domain-containing-protein MRHGLLALICWLCCVVAHSEMLNVEQSGLFRAWFVRIAQEQLRQGPSPRWYQQDCAGLVRFAANETLKVHDSKWLKSNGLSSQYLPPEMTLTPEQRQLAQNWNQGNGKTGPYVTAINLIQYNSQFIGQDINQALPGDMIFFDQGDAQHLMVWMGRYVIYHTGSATKTDNGMRAVSLQQLMTWKDTRWIPNDSNPNFIGIYRLNFLAR >NZ_CP040886.1|WP_122633159.1|3922816_3927211_+|alpha-2-macroglobulin-family-protein MRLEAPGRDYRRYQMEEYGGVDVRLYRIPDPMAFLRQQKNLHRIVVQPQYLGDGLNNTLTWLWDNWYGKSRRVMQRTFSSQSRQNVTQALPELQLGNAIIKPSRYVQNNQFSPLKKYPLVEQFRYPLWQAKPFEPQQGVKLEGASSNFISPQPGNIYIPLGQQEPGLYLVEAMVGGYRATTVVFVSDTVALSKVSGNELLVWTAGKKQGEAKPGSEILWTDGLGVMTRGVTDDSGTLQLQHISPERSYILGKDAEGGVFVSENFFYESEIYNTRLYIFTDRPLYRAGDRVDVKVMGREFHDPLHSSPIVSAPAKLSVLDANGSLLQTVDVTLDARNGGQGSFRLPENAVAGGYELRLAYRNQVYSSSFRVANYIKPHFEIGLALAKKEFKTGEAVSGKLQLLYPDGEPVKNARVQLSLRAQQLSMVGNDLRYAGRFPVSLEGSETVSDASGHVALNLPAADKPSRYLLTVSASDGAAYRVTTTKEILIERGLAHYSLSTAAQYSNSGESVVFRYAALESSKQVPVTYEWLRLEDRTSHSGELPSGGKSFTVNFAKPGNYNLTLRDKDGLILAGLSHAVSGKGSTAHTGTVDIVADKTLYQPGETAKMLITFPEPIDEALLTLERDRVEQQSLLSHPANWLTLQRLNDTQYEARVPVSNSFAPNITFSVLYTRNGQYSFQNAGIKVAVPQLDIRVKTDKTHYQPGELVNVELTSSLKGKPVSAQLTVGVVDEMIYALQPEIAPNIGKFFYPLGRNNVRTSSSLSFISYDQALSSEPVAPGATNRSERRVKMLERPRREEVDTAAWMPSLTTDKQGKAYFTFLMPDSLTRWRITARGMNGDGLVGQGRAYLRSEKNLYMKWSMPTVYRVGDKPAAGLFIFSQQDNEPVALVTKFAGAEMRQTLTLHKGANYISLTQNIQQSGLLSAELQQNGQVQDSISTKLSFVDNSWPVEQQKNVMLGGGDNALMLPEQASNIRLQSSETPQEIFRNNLDALVDEPWGGVINTGSRLIPLSLAWRSLADHQSAAANDIRQMIQVNRLRLMQLAGPGARFTWWGEDGNGDAFLTAWAWYADWQASQAIGVTQQPEYWQHMLDSYAEQADNMPLLHRALVLAWAQEMNLPCKTLLKGLDEAIARRGTKTEDFSEEDTRDINDSLILDTPESPLADAVANVLTMTLLKKAQLKSTVMPQVQQYAWDKAANSNQPLAHTVVLLNSGGDATQAAAILSGLTAEQSTIERALAMNWLAKYMATMPPVVLPAPAGAWAKHKLTGGGEYWRWVGQGVPDILSFGDELSPQNVQVRWREPAKTAQQSNIPVTVERQLYRLITGEEEMSFTLQPVTSNEIDSDALYLDEITLTSEQDAVLRYGQVEVPLPPGADVERTTWGISVNKPNAAKQQGQLLEIARNEMGELAYMVPVKELTGTVTFRHLLRFSQKGQFVLPPARYMRSYAPAQQSVAAGSEWTRMQVK >NZ_CP040886.1|WP_001104488.1|3927211_3928861_+|DUF2300-domain-containing-protein MNWRRIVWLLALVTLPTLAEEPPLQLALRGAQHDQLYKLSSSGVTNVSTLPDTLTTPLGSLWKLYVYAWLEDTHQPEQPYQCRGNSPEEVYCCQAGESITRDTALVRSCGLYFAPQRLHIGADVWGQYWQQRQAPAWLASLTTLKPETSVTVKSLLDSLATLPAQNKAQEVLLDVVLDEAKIGVASMLGSRVRVKTWSWFADDKQEIRQGGFAGWLTDGTPLWVTGSGTSKTVLTRYATVLNRVLPVPTQVASGQCVEVELFARYPLKKITAEKSTTAVKPGVLNGRYRVTFTNGNHITFVSHGETTLLSEKGKLKLQSHLDREEYVARVLDREAKSTPPEAAKAMTVAIRTFLQQNANREGDCLTIPDSSATQRVSASPATTGARTMAAWTQDLIYAGDPVHYHGSRATEGTLSWRQATAQAGQGERYDQILAFAYPDNSLSRWGAPRSTCQLLPKAKAWLAKKMPQWRRILQAETGYNEPDVFAVCRLVSGFPYTDRQQKRLFIRNFFTLQDRLDLTHEYLHLAFDGYPTGLDENYIETLTRQLLMD >NZ_CP040886.1|WP_001567753.1|3928865_3929642_+|YfaP-family-protein MRKIFLPLLLVALSPVAHSEGVQEVEIDAPLSGWHPVEGEDASFSQSINYPASSVNMADDQNISAQIRGKIKNYAAAGKVQQGRLVVNGASMPQRIESDGSFARPYIFTEGSNSVQVISPDGQSRQKMQFYSTPGTGTIRARLRLVLSWDTDNTDLDLHVVTPDGEHAWYGNTVLKNSGALDMDVTTGYGPEIFAMPAPVHGRYQVYINYYGGRSETELTTAQLTLITDEGSVNEKQETFIVPMRNAGELTLVKSFDW >NZ_CP040886.1|WP_000786548.1|3929715_3930900_-|acetyl-CoA-C-acetyltransferase MKNCVIVSAVRTAIGSFNGSLASTSAIDLGATVIKAAIERAKIDSQHVDEVIMGNVLQAGLGQNPARQALLKSGLAETVCGFTVNKVCGSGLKSVALAAQAIQAGQAQSIVAGGMENMSLAPYLLDAKARSGYRLGDGQVYDVILRDGLMCATHGYHMGITAENVAKEYGITREMQDELALHSQRKAAAAIESGAFTAEIVPVNVVTRKKTFVFSQDEFPKANSTAEALGALRPAFDKAGTVTAGNASGINDGAAALVIMEESAALAAGLTPLARIKSYASGGVPPALMGMGPVPATQKALQLAGLQLADIDLIEANEAFAAQFLAVGKTLGFDPEKVNVNGGAIALGHPIGASGARILVTLLHAMQARDKTLGLATLCIGGGQGIAMVIERLN |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP040886_12 | 4600781-4600904 | Orphan |
NA
Consensus repeat of NZ_CP040886_12
|
1 spacers
spacers of NZ_CP040886_12
>12.1|4600824|38|NZ_CP040886|CRISPRCasFinder CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA |
CRISPR arrays and Neighbor proteins around NZ_CP040886_12
The CRISPR arrays of NZ_CP040886_12 >merge|NZ_CP040886|12|4600781-4600904|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTACGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAACGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA >NZ_CP040886|12|11|4600781-4600904|CRISPRCasFinder CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA CGGACGCAGGATGGTGCGTTCAATTGGACTCGAACCAA CGACCCCCACCATGTCAAGGTGGTGCTCTAACCAACTGAGCTA
>NZ_CP040886.1|WP_000212657.1|4600340_4600646_-|monooxygenase MATLLQLHFAFNGPFGDAMAEQLKPLAESINQEPGFLWKVWTESEKNHEAGGIYLFTDEKSALAYLEKHTARLKNLGVEEVVAKVFDVNEPLSQINQAKLA >NZ_CP040886.1|WP_000716929.1|4598610_4600215_-|FAD-NAD(P)-binding-protein MKKIAIVGAGPTGIYTLFSLLQQQTPLSISIFEQADEAGVGMPYSDEENSKMMLANIASIEIPPINCTYLEWLQKQEASHLQRYGVKKETLHDRQFLPRILLGEYFRDQFLRLVDQARQQKFAVAVYESCQVTDLQITNAGVMLATNQDLPSETFDLVVIATGHVWPDEEEATRTYFPSPWSGLMEAKVDACNVGIMGTSLSGLDAAMAVAIQHGSFIEDDKQHVVFNRDNASEKLNITLMSRTGILPEADFYCPIPYEPLHIVTDQALNAEIQKGEEGLLDRVFRLIVEEIKFADPDWSQRIALESLNVDSFAQAWFAERKQRDPFDWAEKNLQEVERNKREKHTVPWRYVILRLHEAVQEIVPHLNEHDHKRFSKGLARVFIDNYAAIPSESIRRLLALREAGIIHILALGEDYEMEINESRTVLKTEDNSYSFDVFIDARGQRPLKVKDIPFPGLREQLQKTGDEIPDVGEDYTLQQPEDIRGRVAFGALPWLMHDQPFVQGLTACAEIGEAMARAVVKPASRARRRLSFD >NZ_CP040886.1|WP_000587555.1|4597786_4598599_+|hypothetical-protein MIITRADLREWRIGAVMYRWFLRHFPRGGSYADIHHALIEEGYTDWAESLVEYAWKKWLADENFAHQEVSSMQKLATDPGEIPFCSQFARSDDHARIGCCEDNARIATAGYAAQIASMGYSVRIGSVGFNSHIGSSGERARVAVTGNSSRISSAGDSSRIANTGMRVRVCTLGERCHVASNGDLAQIASFGANARIANSGDNVHIIASGENSTVVSTGVVDSIILGPGGSAALAYHDGERVRFAVAIEGENNIRAGVRYRLNEQHQFVEC >NZ_CP040886.1|WP_001069997.1|4596997_4597783_+|thiosulfate-reductase-cytochrome-B-subunit MNPSQHAEQFQSQLANYVPQFTPEFWPVWLIIAGVLLVGMWLVLGLHALLRARGVKKSVTDYGEKIYLYCKAVRLWHWSNALLFVLLLASGLINHFALVGATAVKSLVAVHEVCGFLLLACWLGFVLINAVGGNGHHYRIRRQGWLERAAKQTRFYLFGIMQGEEHPFPATTQSKFNPLQQVAYVGVMYGLLPLLLLTGLLCLYPQAVGDVFPGVRYWLLQAHFALAFISLFFIFGHLYLCTTGRTPHETFKSMVDGYHRH >NZ_CP040886.1|WP_001310861.1|4596332_4597001_+|4Fe-4S-dicluster-domain-containing-protein MSFTRRKFVLGMGTVIFFTGSASSLLANTRQEKEVRYAMIHDESRCNGCNICARACRKTNHVPAQGSRLSIAHIPVTDNDNETQYHFFRQSCQHCEDAPCIDVCPTGASWRDEQGIVRVEKSQCIGCSYCIGACPYQVRYLNPVTKVADKCDFCAESRLAKGFPPICVSACPEHALIFGREDSPEIQAWLQQNKYYQYQLPGAGKPHLYRRFGQHLIKKENV >NZ_CP040886.1|WP_000504352.1|4595621_4596269_+|YdhW-family-putative-oxidoreductase-system-protein MGEMNHRDELPLAKVSEVDEAKRQWLQGMRHPVDTVTEPEPAEILAEFIRQHSAAGQLVARAVFLSPPYSVAEEELSVLLESIKQNGDYADIACMTGSQDDYYYSTQAMSENYAAMSLQVVEQDICRAIAHAVRFECQTYPRPYKVAMLMQAPYYFQEAQIEAAIAAMDVAPEYADIRQVESSTAVLYLFSERFMTYGKAYGLCEWFEVEQFQNP >NZ_CP040886.1|WP_001678907.1|4593515_4595618_+|aldehyde-ferredoxin-oxidoreductase MANGWTGNILRVNLTTGNITLEDSSKFKSFVGGMGFGYKIMYDEVPPGTKPFDEANKLVFATGPLTGSGAPCSSRVNITSLSTFTKGNLVVDAHMGGFFAAQMKFAGYDVIIIEGKAKSPVWLKIKDDKVSLEKADFLWGKGTRATTEEICRLTSPETCVAAIGQAGENLVPLSGMLNSRNHSGGAGTGAIMGSKNLKAIAVEGTKGVNIADRQEMKRLNDYMMTELIGANNNHVVPSTPQSWAEYSDPKSRWTARKELFWGAAEGGPIETGEIPPGNQNTVGFRTYKSVFDLGPAAEKYTVKMSGCHSCPIRCMTQMNIPRVKEFGVPSTGGNTCVANFVHTTIFPNGPKDFEDKDDGRVIGNLVGLNLFDDYGLWCNYGQLHRDFTYCYSKGVFKRVLPAEEYAEIHWDQLEAGDVNFIKDFYYRLAHRVGELSHLADGSYAIAERWNLGEEYWGYAKNKLWSPFGYPVHHANEASAQVGSIVNCMFNRDCMTHTHINFIGSGLPLKLQREVAKELFGSEDAYDETKNYTPINDAKIKYAKWSLLRVCLHNAVTLCNWVWPMTVSPLKSRNYRGDLALEAKFFKAITGEEMTQEKLDLAAERIFTLHRAYTVKLMQTKDMRNEHDLICSWVFDKDPQIPVFTEGTDKMDRDDMHASLTMFYKEMGWDPQLGCPTRETLQRLGLEDIAADLAAHNLLPV >NZ_CP040886.1|WP_001070230.1|4592868_4593495_+|ferredoxin-like-protein MNPVDRPLLDIGLTRLEFLRISGKGLAGLTIAPALLSLLGCKQEDIDSGTVGLINTPKGVLVTQRARCTGCHRCEISCTNFNDGSVGTFFSRIKIHRNYFFGDNGVGSGGGLYGDLNYTADTCRQCKEPQCMNVCPIGAITWQQKEGCITVDHKRCIGCSACTTACPWMMATVNTESKKSSKCVLCGECANACPTGALKIIEWKDITV >NZ_CP040886.1|WP_000528342.1|4592203_4592413_+|fumarate-hydratase-FumD MGNRTKEDELYREMCRVVGKVVLEMRDLGQEPKHIVIAGVLRTALANKRIQRSELEKQAMETVINALVK >NZ_CP040886.1|WP_001295403.1|4590235_4591648_-|pyruvate-kinase-PykF MKKTKIVCTIGPKTESEEMLAKMLDAGMNVMRLNFSHGDYAEHGQRIQNLRNVMSKTGKTAAILLDTKGPEIRTMKLEGGNDVSLKAGQTFTFTTDKSVIGNSEMVAVTYEGFTTDLSVGNTVLVDDGLIGMEVTAIEGNKVICKVLNNGDLGENKGVNLPGVSIALPALAEKDKQDLIFGCEQGVDFVAASFIRKRSDVIEIREHLKAHGGENIHIISKIENQEGLNNFDEILEASDGIMVARGDLGVEIPVEEVIFAQKMMIEKCIRARKVVITATQMLDSMIKNPRPTRAEAGDVANAILDGTDAVMLSGESAKGKYPLEAVSIMATICERTDRVMNSRLEFNNDNRKLRITEAVCRGAVETAEKLDAPLIVVATQGGKSARAVRKYFPDATILALTTNEKTAHQLVLSKGVVPQLVKEITSTDDFYRLGKELALQSGLAHKGDVVVMVSGALVPSGTTNTASVHVL >NZ_CP040886.1|WP_000534291.1|4601218_4602475_+|hypothetical-protein MGSDAKNLMSDGNVQIVKTGEVIGATQLTEGELIVEAGGRAENTVVTGAGWLKVATGGIAKCTQYGNNGTLSVSDGAIATDIVQSEGGAISLSTLATVNGRHPEGEFSVDQGYACGLLLENGGNLRVLEGHRAEKIILDQEGGLLVNGTTSAVVVDEGGELLVYPGGEASNCEINQGGVFMLAGKASDTLLAGGTMNNLGGEDSDTIVENGSIYRLGTDGLQLYSSGKTQNLSVNVGGRAEVHAGTLENAVIQGGTVILLSPTSADENFVVEEDRAPVELTGSVALLDGASMIIGYGADLQQSTITVQQGGVLILDGSTVKGDGVTFIVGNINLNGGKLWLITGAATHVQLKVKRLRGEGAICLQTSAKEISPDFINVKGEVTGDIHVEITDASRQTLCNALKLQPDEDGIGATLQPA >NZ_CP040886.1|WP_001174942.1|4602515_4603889_-|multidrug-efflux-MATE-transporter-MdtK MQKYISEARLLLALAIPVILAQIAQTAMGFVDTVMAGGYSATDMAAVAIGTSIWLPAILFGHGLLLALTPVIAQLNGSGRRERIAHQVRQGFWLAGFVSVLIMLVLWNAGYIIRSMENIDPALADKAVGYLRALLWGAPGYLFFQVARNQCEGLAKTKPGMVMGFIGLLVNIPVNYIFIYGHFGMPELGGVGCGVATAAVYWVMFLAMVSYIKRARSMRDIRNEKGTAKPDPAVMKRLIQLGLPIALALFFEVTLFAVVALLVSPLGIVDVAGHQIALNFSSLMFVLPMSLAAAVTIRVGYRLGQGSTLDAQTAARTGLMVGVCMATLTAIFTVSLREQIALLYNDNPEVVTLAAHLMLLAAVYQISDSIQVIGSGILRGYKDTRSIFYITFTAYWVLGLPSGYILALTDLVVEPMGPAGFWIGFIIGLTSAAIMMMLRMRFLQRLPSVIILQRASR >NZ_CP040886.1|WP_001373655.1|4604103_4604745_+|riboflavin-synthase MFTGIVQGTVKLVSIDEKPNFRTHVVELPDHMLDGLETGASVAHNGCCLTVTEINGNHVSFDLMKETLRITNLGDLKVGDWVNVERAAKFSDEIGGHLMSGHIMTTAEVAKILTSENNRQIWFKVQDSQLMKYILYKGFIGIDGISLTVGEVTPTRFCVHLIPETLERTTLGKKKLGARVNIEIDPQTQAVVDTVERVLAARENAMNQPGTEA >NZ_CP040886.1|WP_060503957.1|4604784_4605933_-|cyclopropane-fatty-acyl-phospholipid-synthase MSSSCIEEVSVPDDNWYRIANELLSRAGIAINGSAPADIRVKNPDFFKRVLQEGSLGLGDSYMDGWWECDRLDMFFSKVLRAGLENQLPHHFKDTLRIASARLFNLQSKKRAWIVGKEHYDLGNDLFSRMLDPFMQYSCAYWKDADNLESAQQAKLKMICEKLQLKPGMRVLDIGCGWGGLAHYMASNYDVSVVGVTISAEQQKMAQERCEGLDVTILLQDYRDLNDQFDRIVSVGMFEHVGPKNYDTYFAVVDRNLKPEGIFLLHTIGSKKTDLNVDPWINKYIFPNGCLPSVRQIAQSSEPHFVMEDWHNFGADYDTTLMAWYERFLAAWPEIADNYSERFKRMFTYYLNACAGAFRARDIQLWQVVFSRGVENGLRVAR >NZ_CP040886.1|WP_001182363.1|4606223_4607435_-|Bcr/CflA-family-multidrug-efflux-MFS-transporter MQPGKRFLVWLAGLSVLGFLATDMYLPAFAAIQADLQTPASAVSASLSLFLAGFAAAQLLWGPLSDRYGRKPVLLIGLTIFALGSLGMLWVENAATLLVLRFVQAVGVCAAAVIWQALVTDYYPSQKVNRIFATIMPLVGLSPALAPLLGSWLLVHFSWQAIFATLFAITVVLILPIFWLKPTTKARNNSQDGLTFTDLLRSKTYRGNVLIYAACSASFFAWLTGSPFILSEMGYSPAVIGLSYVPQTIAFLIGGYGCRAALQKWQGKQLLPWLLVLFAVSVIATWAAGFISHVSLVEILIPFCVMAIANGAIYPIVVAQALRPFPHATGRAAALQNTLQLGLCFLASLVVSWLISISTPLLTTTSVMLSTVVLVALGYMMQRCEEVGCQNHGNAEVAHSESH >NZ_CP040886.1|WP_000269501.1|4607547_4608480_+|LysR-family-transcriptional-regulator MWSEYSLEVVDAVARNGSFSAAAQELHRVPSAVSYTVRQLEEWLAVPLFERRHRDVELTAAGAWFLKEGRSVVKKMQITRQQCQQIANGWRGQLAIAVDNIVRPERTRQMIVDFYRHFDDVELLVFQEVFNGVWDALSDGRVELAIGATRAIPVGGRYAFRDMGMLSWSCVVASHHPLALMDGPFSDDTLRNWPSLVREDTSRTLPKRITWLLDNQKRVVVPDWESSATCISAGLCIGMVPTHFAKPWLNEGKWVALELENPFPDSACCLTWQQNDMSPALTWLLEYLGDSETLNKEWLREPEETPATGD >NZ_CP040886.1|WP_000190982.1|4608476_4609502_-|HTH-type-transcriptional-repressor-PurR MATIKDVAKRANVSTTTVSHVINKTRFVAEETRNAVWAAIKELHYSPSAVARSLKVNHTKSIGLLATSSEAAYFAEIIEAVEKNCFQKGYTLILGNAWNNLEKQRAYLSMMAQKRVDGLLVMCSEYPEPLLAMLEEYRHIPMVVMDWGEAKADFTDAVIDNAFEGGYMAGRYLIERGHREIGVIPGPLERNTGAGRLAGFMKAMEEAMIKVPESWIVQGDFEPESGYRAMQQILSQPHRPTAVFCGGDIMAMGALCAADEMGLRVPQDVSLIGYDNVRNARYFTPALTTIHQPKDSLGETAFNMLLDRIVNKREEPQSIEVHPRLIERRSVADGPFRDYRR >NZ_CP040886.1|WP_000102278.1|4609800_4609890_+|stress-response-protein-YnhF MSTDLKFSLVTTIIVLGLIVAVGLTAALH >NZ_CP040886.1|WP_000701040.1|4610055_4611225_+|MFS-transporter MKINYPLLALAIGAFGIGTTEFSPMGLLPVIARGVDVSIPAAGMLISAYAVGVMVGAPLMTLLLSHRARRSALIFLMAIFTLGNVLSAIAPDYMTLMLSRILTSLNHGAFFGLGSVVAASVVPKHKQASAVATMFMGLTLANIGGVPAATWLGETIGWRMSFLATAGLGVISMVSLFFSLPKGGAGARPEVKKELAVLMRPQVLSALLTTVLGAGAMFTLYTYISPVLQSITHATPVFVTAMLVLIGVGFSIGNYLGGKLADRSVNGTLKGFLLLLMVIMLAIPFLARNEFGAAISMVVWGAATFAVVPPLQMRVMRVASEAPGLSSSVNIGAFNLGNALGAAAGGAVISAGLGYSFVPVMGAIVAGLALLLVFMSARKQPETVCVANS >NZ_CP040886.1|WP_000007283.1|4611370_4611952_-|superoxide-dismutase-[Fe] MSFELPALPYAKDALAPHISAETIEYHYGKHHQTYVTNLNNLIKGTAFEGKSLEEIIRSSEGGVFNNAAQVWNHTFYWNCLAPNAGGEPTGKVAEAIAASFGSFADFKAQFTDAAIKNFGSGWTWLVKNSDGKLAIVSTSNAGTPLTTDATPLLTVDVWEHAYYIDYRNARPGYLEHFWALVNWEFVAKNLAA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP040886_1 | 1.1|318298|40|NZ_CP040886|CRISPRCasFinder | 318298-318337 | 40 | NZ_CP041417 | Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence | 47951-47990 | 0 | 1.0 |
NZ_CP040886_5 | 5.1|1332411|42|NZ_CP040886|PILER-CR | 1332411-1332452 | 42 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 141085-141126 | 0 | 1.0 |
NZ_CP040886_5 | 5.2|1332470|40|NZ_CP040886|PILER-CR | 1332470-1332509 | 40 | NZ_AP023206 | Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence | 141028-141067 | 1 | 0.975 |
NZ_CP040886_10 | 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder | 3759522-3759568 | 47 | NC_016160 | Escherichia phage HK75, complete genome | 28586-28632 | 1 | 0.979 |
NZ_CP040886_10 | 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder | 3759522-3759568 | 47 | NC_019705 | Enterobacteria phage mEpX2, complete genome | 29040-29086 | 1 | 0.979 |
NZ_CP040886_10 | 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder | 3759522-3759568 | 47 | NC_019719 | Enterobacteria phage HK633, complete genome | 31734-31780 | 1 | 0.979 |
NZ_CP040886_10 | 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder | 3759522-3759568 | 47 | JF974339 | Enterobacteria phage IME10, complete genome | 9717-9763 | 1 | 0.979 |
NZ_CP040886_10 | 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder | 3759522-3759568 | 47 | NC_019715 | Enterobacterial phage mEp234, complete genome | 30402-30448 | 2 | 0.957 |
NZ_CP040886_10 | 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder | 3759522-3759568 | 47 | NC_019711 | Enterobacteria phage HK629, complete genome | 37163-37209 | 2 | 0.957 |
NZ_CP040886_10 | 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder | 3759522-3759568 | 47 | NC_019768 | Enterobacteria phage HK106, complete genome | 32698-32744 | 2 | 0.957 |
NZ_CP040886_10 | 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder | 3759522-3759568 | 47 | KY979108 | Escherichia phage ECP1, complete genome | 421-467 | 2 | 0.957 |
NZ_CP040886_10 | 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder | 3759522-3759568 | 47 | NC_005344 | Enterobacteria phage Sf6, complete genome | 28404-28450 | 2 | 0.957 |
NZ_CP040886_12 | 12.1|4600824|38|NZ_CP040886|CRISPRCasFinder | 4600824-4600861 | 38 | NZ_CP043437 | Enterobacter sp. LU1 plasmid unnamed | 113727-113764 | 2 | 0.947 |
NZ_CP040886_3 | 3.1|1120087|48|NZ_CP040886|CRISPRCasFinder | 1120087-1120134 | 48 | NZ_CP053606 | Escherichia coli strain NEB_Turbo plasmid F', complete sequence | 4089-4136 | 3 | 0.938 |
NZ_CP040886_3 | 3.1|1120087|48|NZ_CP040886|CRISPRCasFinder | 1120087-1120134 | 48 | NZ_CP053608 | Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence | 4088-4135 | 3 | 0.938 |
NZ_CP040886_3 | 3.1|1120087|48|NZ_CP040886|CRISPRCasFinder | 1120087-1120134 | 48 | NZ_CP014271 | Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence | 4088-4135 | 3 | 0.938 |
NZ_CP040886_3 | 3.1|1120087|48|NZ_CP040886|CRISPRCasFinder | 1120087-1120134 | 48 | NZ_CP014273 | Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence | 4088-4135 | 3 | 0.938 |
NZ_CP040886_6 | 6.1|2926082|42|NZ_CP040886|CRISPRCasFinder | 2926082-2926123 | 42 | NZ_CP010208 | Escherichia coli strain M11 plasmid B, complete sequence | 30214-30255 | 7 | 0.833 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NZ_MG299151 | Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence | 51276-51307 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NZ_KY471628 | Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence | 45716-45747 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NZ_MG299131 | Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence | 51276-51307 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NZ_KY471629 | Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence | 45716-45747 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NZ_MG299133 | Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence | 51276-51307 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NZ_MG299128 | Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence | 51276-51307 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NZ_MG299147 | Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence | 51276-51307 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NC_018995 | Escherichia coli plasmid pHUSEC41-1, complete sequence | 29015-29046 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NZ_CP053235 | Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence | 78292-78323 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NZ_CP005999 | Escherichia coli B7A plasmid pEB1, complete sequence | 39563-39594 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | KU932021 | Escherichia coli plasmid pEC3I, complete sequence | 51902-51933 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NZ_CP024154 | Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence | 18560-18591 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NC_011754 | Escherichia coli ED1a plasmid pECOED, complete sequence | 49240-49271 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NZ_CP015141 | Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence | 81434-81465 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NZ_LR213460 | Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3 | 28916-28947 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NZ_MH287044 | Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence | 36182-36213 | 7 | 0.781 |
NZ_CP040886_8 | 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344710-3344741 | 32 | NZ_MH618673 | Escherichia coli strain 838B plasmid p838B-R, complete sequence | 32230-32261 | 7 | 0.781 |
NZ_CP040886_9 | 9.1|3367246|31|NZ_CP040886|CRISPRCasFinder | 3367246-3367276 | 31 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62682-62712 | 7 | 0.774 |
NZ_CP040886_9 | 9.1|3367246|31|NZ_CP040886|CRISPRCasFinder | 3367246-3367276 | 31 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1222106-1222136 | 7 | 0.774 |
NZ_CP040886_9 | 9.1|3367246|31|NZ_CP040886|CRISPRCasFinder | 3367246-3367276 | 31 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2467672-2467702 | 7 | 0.774 |
NZ_CP040886_9 | 9.4|3367429|31|NZ_CP040886|CRISPRCasFinder | 3367429-3367459 | 31 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530641-530671 | 7 | 0.774 |
NZ_CP040886_6 | 6.1|2926082|42|NZ_CP040886|CRISPRCasFinder | 2926082-2926123 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24899-24940 | 8 | 0.81 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NZ_MG299151 | Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence | 51275-51307 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NZ_KY471628 | Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence | 45715-45747 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NZ_MG299131 | Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence | 51275-51307 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NZ_KY471629 | Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence | 45715-45747 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NZ_MG299133 | Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence | 51275-51307 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NZ_MG299128 | Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence | 51275-51307 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NZ_MG299147 | Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence | 51275-51307 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NC_018995 | Escherichia coli plasmid pHUSEC41-1, complete sequence | 29014-29046 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NZ_CP053235 | Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence | 78291-78323 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NZ_CP024154 | Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence | 18559-18591 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NC_011754 | Escherichia coli ED1a plasmid pECOED, complete sequence | 49239-49271 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NZ_CP015141 | Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence | 81433-81465 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NZ_LR213460 | Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3 | 28915-28947 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NZ_MH287044 | Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence | 36181-36213 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NZ_MH618673 | Escherichia coli strain 838B plasmid p838B-R, complete sequence | 32229-32261 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | NZ_CP005999 | Escherichia coli B7A plasmid pEB1, complete sequence | 39563-39595 | 8 | 0.758 |
NZ_CP040886_8 | 8.6|3344709|33|NZ_CP040886|PILER-CR | 3344709-3344741 | 33 | KU932021 | Escherichia coli plasmid pEC3I, complete sequence | 51902-51934 | 8 | 0.758 |
NZ_CP040886_8 | 8.12|3344649|32|NZ_CP040886|CRISPRCasFinder,CRT | 3344649-3344680 | 32 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 1417960-1417991 | 8 | 0.75 |
NZ_CP040886_9 | 9.4|3367429|31|NZ_CP040886|CRISPRCasFinder | 3367429-3367459 | 31 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14953-14983 | 8 | 0.742 |
NZ_CP040886_9 | 9.4|3367429|31|NZ_CP040886|CRISPRCasFinder | 3367429-3367459 | 31 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14983-15013 | 8 | 0.742 |
NZ_CP040886_9 | 9.4|3367429|31|NZ_CP040886|CRISPRCasFinder | 3367429-3367459 | 31 | NZ_CP015882 | Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence | 3454-3484 | 8 | 0.742 |
NZ_CP040886_9 | 9.4|3367429|31|NZ_CP040886|CRISPRCasFinder | 3367429-3367459 | 31 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 148992-149022 | 8 | 0.742 |
NZ_CP040886_9 | 9.7|3367246|32|NZ_CP040886|PILER-CR,CRT | 3367246-3367277 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 62682-62713 | 8 | 0.75 |
NZ_CP040886_9 | 9.7|3367246|32|NZ_CP040886|PILER-CR,CRT | 3367246-3367277 | 32 | NZ_CP013104 | Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence | 1222106-1222137 | 8 | 0.75 |
NZ_CP040886_9 | 9.7|3367246|32|NZ_CP040886|PILER-CR,CRT | 3367246-3367277 | 32 | NZ_CP012748 | Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence | 2467671-2467702 | 8 | 0.75 |
NZ_CP040886_9 | 9.7|3367246|32|NZ_CP040886|PILER-CR,CRT | 3367246-3367277 | 32 | NC_008759 | Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence | 12670-12701 | 8 | 0.75 |
NZ_CP040886_9 | 9.10|3367429|32|NZ_CP040886|PILER-CR,CRT | 3367429-3367460 | 32 | NZ_CP017750 | Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence | 148991-149022 | 8 | 0.75 |
NZ_CP040886_9 | 9.10|3367429|32|NZ_CP040886|PILER-CR,CRT | 3367429-3367460 | 32 | NC_007336 | Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence | 530640-530671 | 8 | 0.75 |
NZ_CP040886_9 | 9.11|3367490|32|NZ_CP040886|PILER-CR,CRT | 3367490-3367521 | 32 | NZ_CP006991 | Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence | 532343-532374 | 8 | 0.75 |
NZ_CP040886_6 | 6.1|2926082|42|NZ_CP040886|CRISPRCasFinder | 2926082-2926123 | 42 | NZ_CP048307 | Escherichia coli strain 9 plasmid p009_C, complete sequence | 24786-24827 | 9 | 0.786 |
NZ_CP040886_9 | 9.1|3367246|31|NZ_CP040886|CRISPRCasFinder | 3367246-3367276 | 31 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86182-86212 | 9 | 0.71 |
NZ_CP040886_9 | 9.2|3367307|31|NZ_CP040886|CRISPRCasFinder | 3367307-3367337 | 31 | CP011075 | Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence | 244686-244716 | 9 | 0.71 |
NZ_CP040886_9 | 9.2|3367307|31|NZ_CP040886|CRISPRCasFinder | 3367307-3367337 | 31 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78536-78566 | 9 | 0.71 |
NZ_CP040886_9 | 9.5|3367490|31|NZ_CP040886|CRISPRCasFinder | 3367490-3367520 | 31 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35740-35770 | 9 | 0.71 |
NZ_CP040886_9 | 9.10|3367429|32|NZ_CP040886|PILER-CR,CRT | 3367429-3367460 | 32 | NZ_CP036297 | Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence | 14953-14984 | 9 | 0.719 |
NZ_CP040886_9 | 9.10|3367429|32|NZ_CP040886|PILER-CR,CRT | 3367429-3367460 | 32 | NZ_CP036288 | Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence | 14983-15014 | 9 | 0.719 |
NZ_CP040886_9 | 9.10|3367429|32|NZ_CP040886|PILER-CR,CRT | 3367429-3367460 | 32 | NZ_CP015882 | Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence | 3454-3485 | 9 | 0.719 |
NZ_CP040886_9 | 9.11|3367490|32|NZ_CP040886|PILER-CR,CRT | 3367490-3367521 | 32 | NZ_CP040723 | Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence | 35740-35771 | 9 | 0.719 |
NZ_CP040886_9 | 9.7|3367246|32|NZ_CP040886|PILER-CR,CRT | 3367246-3367277 | 32 | NC_011987 | Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence | 86181-86212 | 10 | 0.688 |
NZ_CP040886_9 | 9.8|3367307|32|NZ_CP040886|PILER-CR,CRT | 3367307-3367338 | 32 | CP011075 | Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence | 244686-244717 | 10 | 0.688 |
NZ_CP040886_9 | 9.8|3367307|32|NZ_CP040886|PILER-CR,CRT | 3367307-3367338 | 32 | GU075905 | Prochlorococcus phage P-HM2, complete genome | 78536-78567 | 10 | 0.688 |
1. spacer 1.1|318298|40|NZ_CP040886|CRISPRCasFinder matches to NZ_CP041417 (Escherichia coli strain STEC711 plasmid pSTEC711_1, complete sequence) position: , mismatch: 0, identity: 1.0
gcgctgcgggtcattcttgaaattacccccgctgtgctgt CRISPR spacer gcgctgcgggtcattcttgaaattacccccgctgtgctgt Protospacer ****************************************
2. spacer 5.1|1332411|42|NZ_CP040886|PILER-CR matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 0, identity: 1.0
tgtcacacgcagataaatccaactttcaatattgttaagttc CRISPR spacer tgtcacacgcagataaatccaactttcaatattgttaagttc Protospacer ******************************************
3. spacer 5.2|1332470|40|NZ_CP040886|PILER-CR matches to NZ_AP023206 (Escherichia coli strain TUM18781 plasmid pMTY18781-1_lncX3, complete sequence) position: , mismatch: 1, identity: 0.975
catggcgtagcaaaaagaaattttcaatattgctttatgg CRISPR spacer catggcgtagaaaaaagaaattttcaatattgctttatgg Protospacer ********** *****************************
4. spacer 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder matches to NC_016160 (Escherichia phage HK75, complete genome) position: , mismatch: 1, identity: 0.979
acggttgtccaacgcaaacaccagtaatggcgcggctctcagtggag CRISPR spacer acggttgtccaacgcaaacaccagtaatggcgcggctctcagcggag Protospacer ******************************************.****
5. spacer 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder matches to NC_019705 (Enterobacteria phage mEpX2, complete genome) position: , mismatch: 1, identity: 0.979
acggttgtccaacgcaaacaccagtaatggcgcggctctcagtggag CRISPR spacer acggttgtccaacgcaaacaccagtaatggcgcggctctcagcggag Protospacer ******************************************.****
6. spacer 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder matches to NC_019719 (Enterobacteria phage HK633, complete genome) position: , mismatch: 1, identity: 0.979
acggttgtccaacgcaaacaccagtaatggcgcggctctcagtggag CRISPR spacer acggttgtccaacgcaaacaccagtaatggcgcggctctcagcggag Protospacer ******************************************.****
7. spacer 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder matches to JF974339 (Enterobacteria phage IME10, complete genome) position: , mismatch: 1, identity: 0.979
acggttgtccaacgcaaacaccagtaatggcgcggctctcagtggag CRISPR spacer acggttgtccaacgcaaacaccagtaatggcgcggctctcagcggag Protospacer ******************************************.****
8. spacer 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder matches to NC_019715 (Enterobacterial phage mEp234, complete genome) position: , mismatch: 2, identity: 0.957
acggttgtccaacgcaaacaccagtaatggcgcggctctcagtggag CRISPR spacer acggttgtccaacgcaaacaccagtaatggcgcggatctcagcggag Protospacer *********************************** ******.****
9. spacer 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder matches to NC_019711 (Enterobacteria phage HK629, complete genome) position: , mismatch: 2, identity: 0.957
acggttgtccaacgcaaacaccagtaatggcgcggctctcagtggag CRISPR spacer acggttgtccaacgcaaacaccagtaatggcgcgtctctcagcggag Protospacer ********************************** *******.****
10. spacer 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder matches to NC_019768 (Enterobacteria phage HK106, complete genome) position: , mismatch: 2, identity: 0.957
acggttgtccaacgcaaacaccagtaatggcgcggctctcagtggag CRISPR spacer acggttgtccaacgcaaacaccagtaatggcgcggatctcagcggag Protospacer *********************************** ******.****
11. spacer 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder matches to KY979108 (Escherichia phage ECP1, complete genome) position: , mismatch: 2, identity: 0.957
acggttgtccaacgcaaacaccagtaatggcgcggctctcagtggag CRISPR spacer acagttgtccaacgcaaacaccagtaatggcgcggctctcagcggag Protospacer **.***************************************.****
12. spacer 10.1|3759522|47|NZ_CP040886|CRISPRCasFinder matches to NC_005344 (Enterobacteria phage Sf6, complete genome) position: , mismatch: 2, identity: 0.957
acggttgtccaacgcaaacaccagtaatggcgcggctctcagtggag CRISPR spacer acgtttgtccaacgcaaacaccagtaatggcgcggctctcagcggag Protospacer *** **************************************.****
13. spacer 12.1|4600824|38|NZ_CP040886|CRISPRCasFinder matches to NZ_CP043437 (Enterobacter sp. LU1 plasmid unnamed) position: , mismatch: 2, identity: 0.947
cggacgcaggatggtgcgttcaattggactcgaaccaa CRISPR spacer cagacgcagaatggtgcgttcaattggactcgaaccaa Protospacer *.*******.****************************
14. spacer 3.1|1120087|48|NZ_CP040886|CRISPRCasFinder matches to NZ_CP053606 (Escherichia coli strain NEB_Turbo plasmid F', complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
15. spacer 3.1|1120087|48|NZ_CP040886|CRISPRCasFinder matches to NZ_CP053608 (Escherichia coli strain NEB5-alpha_F'Iq plasmid F'Iq, complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
16. spacer 3.1|1120087|48|NZ_CP040886|CRISPRCasFinder matches to NZ_CP014271 (Escherichia coli K-12 strain K-12 DHB4 plasmid F128-(DHB4), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
17. spacer 3.1|1120087|48|NZ_CP040886|CRISPRCasFinder matches to NZ_CP014273 (Escherichia coli K-12 strain K-12 C3026 plasmid F128-(C3026), complete sequence) position: , mismatch: 3, identity: 0.938
tcagcgtcgcatcaggcatctgcgcataaccgccggatgcggcgtaaa CRISPR spacer ccagcgtcgcatcaggcatctgcgcataactgccggatgcggcataaa Protospacer .*****************************.************.****
18. spacer 6.1|2926082|42|NZ_CP040886|CRISPRCasFinder matches to NZ_CP010208 (Escherichia coli strain M11 plasmid B, complete sequence) position: , mismatch: 7, identity: 0.833
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer acaaatgccggatgcggcgtaaacgccttatctggcctacgc Protospacer ***. *.****************.*********.******.
19. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NZ_MG299151 (Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
20. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NZ_KY471628 (Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
21. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NZ_MG299131 (Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
22. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NZ_KY471629 (Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
23. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NZ_MG299133 (Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
24. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NZ_MG299128 (Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
25. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NZ_MG299147 (Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
26. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NC_018995 (Escherichia coli plasmid pHUSEC41-1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
27. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NZ_CP053235 (Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
28. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NZ_CP005999 (Escherichia coli B7A plasmid pEB1, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
29. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to KU932021 (Escherichia coli plasmid pEC3I, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
30. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NZ_CP024154 (Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
31. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NC_011754 (Escherichia coli ED1a plasmid pECOED, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
32. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NZ_CP015141 (Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
33. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NZ_LR213460 (Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
34. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NZ_MH287044 (Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
35. spacer 8.13|3344710|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NZ_MH618673 (Escherichia coli strain 838B plasmid p838B-R, complete sequence) position: , mismatch: 7, identity: 0.781
aaatatccagggctgggctggaggcagacggc-- CRISPR spacer cgttatccagggctgagctgcaggcag--ggcca Protospacer . ************.**** ****** ***
36. spacer 9.1|3367246|31|NZ_CP040886|CRISPRCasFinder matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer tccctatcgcaatgccggcagcatccgcaat Protospacer *. *. ****** **** ************
37. spacer 9.1|3367246|31|NZ_CP040886|CRISPRCasFinder matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatc Protospacer **** ************ ***** * ** .
38. spacer 9.1|3367246|31|NZ_CP040886|CRISPRCasFinder matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 7, identity: 0.774
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatc Protospacer **** ************ ***** * ** .
39. spacer 9.4|3367429|31|NZ_CP040886|CRISPRCasFinder matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 7, identity: 0.774
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer ccgaacaggtggcgaagcaggtgatgggcca Protospacer ******.* **************.. ***
40. spacer 6.1|2926082|42|NZ_CP040886|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 8, identity: 0.81
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer attgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer *. * ******************.*******.*******.
41. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NZ_MG299151 (Shigella sonnei strain SH287-2 plasmid pSH287-2, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
42. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NZ_KY471628 (Shigella sonnei strain SH15sh99 plasmid pSH15sh99, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
43. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NZ_MG299131 (Shigella sonnei strain SH271-2 plasmid pSH271-2, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
44. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NZ_KY471629 (Shigella sonnei strain SH15sh105 plasmid pSH15sh104, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
45. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NZ_MG299133 (Shigella sonnei strain SH272-2 plasmid pSH272-2, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
46. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NZ_MG299128 (Shigella sonnei strain SH262-2 plasmid pSH262-2, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
47. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NZ_MG299147 (Shigella sonnei strain SH284-2 plasmid pSH284-2, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
48. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NC_018995 (Escherichia coli plasmid pHUSEC41-1, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
49. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NZ_CP053235 (Escherichia coli strain SCU-106 plasmid pSCU-106-1, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
50. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NZ_CP024154 (Escherichia coli strain 14EC033 plasmid p14EC033g, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
51. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NC_011754 (Escherichia coli ED1a plasmid pECOED, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
52. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NZ_CP015141 (Escherichia coli strain Ecol_732 plasmid pEC732_3, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
53. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NZ_LR213460 (Shigella sonnei strain AUSMDU00008333 isolate AUSMDU00008333 plasmid 3) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
54. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NZ_MH287044 (Escherichia coli strain 5.1-R1 plasmid pCERC6, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
55. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NZ_MH618673 (Escherichia coli strain 838B plasmid p838B-R, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
56. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to NZ_CP005999 (Escherichia coli B7A plasmid pEB1, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
57. spacer 8.6|3344709|33|NZ_CP040886|PILER-CR matches to KU932021 (Escherichia coli plasmid pEC3I, complete sequence) position: , mismatch: 8, identity: 0.758
gaaatatccagggctgggctggaggcagacggc-- CRISPR spacer acgttatccagggctgagctgcaggcag--ggcca Protospacer . . ************.**** ****** ***
58. spacer 8.12|3344649|32|NZ_CP040886|CRISPRCasFinder,CRT matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
tcaacgcgctcagacgttgcgtgagtgaacca CRISPR spacer acaacgcggtcggacgttgcgtgattaccccg Protospacer ******* **.************ *. **.
59. spacer 9.4|3367429|31|NZ_CP040886|CRISPRCasFinder matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer agcggcagctggcgatgcaggtggcttgcgt Protospacer ..*.******** ********** ****
60. spacer 9.4|3367429|31|NZ_CP040886|CRISPRCasFinder matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer agcggcagctggcgatgcaggtggcttgcgt Protospacer ..*.******** ********** ****
61. spacer 9.4|3367429|31|NZ_CP040886|CRISPRCasFinder matches to NZ_CP015882 (Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer ttgcgcagctggcgcagcaggtggctgccga Protospacer ..* .*.******* ************ **
62. spacer 9.4|3367429|31|NZ_CP040886|CRISPRCasFinder matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.742
ccgaacggctggcgaagcaggtggctggcgt CRISPR spacer gggtacggctggcgaaggaggcggctgcgga Protospacer * ************* ***.***** *
63. spacer 9.7|3367246|32|NZ_CP040886|PILER-CR,CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer tccctatcgcaatgccggcagcatccgcaatc Protospacer *. *. ****** **** ************.
64. spacer 9.7|3367246|32|NZ_CP040886|PILER-CR,CRT matches to NZ_CP013104 (Paraburkholderia caribensis strain MWAP64 plasmid 1, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
65. spacer 9.7|3367246|32|NZ_CP040886|PILER-CR,CRT matches to NZ_CP012748 (Paraburkholderia caribensis MBA4 plasmid unnamed, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer ttgcgcgcgcaattccgtgagcagcgccatca Protospacer **** ************ ***** * ** .
66. spacer 9.7|3367246|32|NZ_CP040886|PILER-CR,CRT matches to NC_008759 (Polaromonas naphthalenivorans CJ2 plasmid pPNAP03, complete sequence) position: , mismatch: 8, identity: 0.75
ttgcccgcg-----caattccgggagcatccgcaatt CRISPR spacer -----cgtgaaactcatttccgggagcatccgcattt Protospacer **.* ** ***************** **
67. spacer 9.10|3367429|32|NZ_CP040886|PILER-CR,CRT matches to NZ_CP017750 (Cupriavidus sp. USMAA2-4 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer gggtacggctggcgaaggaggcggctgcggaa Protospacer * ************* ***.***** * *
68. spacer 9.10|3367429|32|NZ_CP040886|PILER-CR,CRT matches to NC_007336 (Cupriavidus pinatubonensis JMP134 megaplasmid, complete sequence) position: , mismatch: 8, identity: 0.75
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ccgaacaggtggcgaagcaggtgatgggccag Protospacer ******.* **************.. *** .
69. spacer 9.11|3367490|32|NZ_CP040886|PILER-CR,CRT matches to NZ_CP006991 (Rhizobium sp. IE4771 plasmid pRetIE4771e, complete sequence) position: , mismatch: 8, identity: 0.75
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer catcatcctcccgcagatgcgctggccgatcc Protospacer *.*.* .******** ******** *****
70. spacer 6.1|2926082|42|NZ_CP040886|CRISPRCasFinder matches to NZ_CP048307 (Escherichia coli strain 9 plasmid p009_C, complete sequence) position: , mismatch: 9, identity: 0.786
acagcagtcggatgcggcgtaaacaccttatctgacctacgt CRISPR spacer gttgatgtcggatgcggcgtaaacgccttatccgacctacaa Protospacer .. * ******************.*******.*******.
71. spacer 9.1|3367246|31|NZ_CP040886|CRISPRCasFinder matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 9, identity: 0.71
ttgcccgcgcaattccgggagcatccgcaat CRISPR spacer gctaccgcgcaattcgaggagcatccgctgg Protospacer . *********** .*********** .
72. spacer 9.2|3367307|31|NZ_CP040886|CRISPRCasFinder matches to CP011075 (Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.71
acggacaaaatatatattgatttgcgaatta CRISPR spacer tgaggcaaaatatagattgatttccgaaaat Protospacer .*.********* ******** ****
73. spacer 9.2|3367307|31|NZ_CP040886|CRISPRCasFinder matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 9, identity: 0.71
acggacaaaatatatattgatttgcgaatta CRISPR spacer acggaaaaattatatattgattttacttctg Protospacer ***** *** ************* .*.
74. spacer 9.5|3367490|31|NZ_CP040886|CRISPRCasFinder matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.71
gtttaccgccccgcagaggcgctggcagatc CRISPR spacer cgagaccgcctcgccgaggcgctggcagcga Protospacer ******.*** *************
75. spacer 9.10|3367429|32|NZ_CP040886|PILER-CR,CRT matches to NZ_CP036297 (Planctomycetes bacterium Pla86 plasmid pPla86_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
76. spacer 9.10|3367429|32|NZ_CP040886|PILER-CR,CRT matches to NZ_CP036288 (Planctomycetes bacterium Pla133 plasmid pPla133_1, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer agcggcagctggcgatgcaggtggcttgcgtg Protospacer ..*.******** ********** ****.
77. spacer 9.10|3367429|32|NZ_CP040886|PILER-CR,CRT matches to NZ_CP015882 (Ensifer adhaerens strain Casida A plasmid pCasidaAB, complete sequence) position: , mismatch: 9, identity: 0.719
ccgaacggctggcgaagcaggtggctggcgta CRISPR spacer ttgcgcagctggcgcagcaggtggctgccgag Protospacer ..* .*.******* ************ ** .
78. spacer 9.11|3367490|32|NZ_CP040886|PILER-CR,CRT matches to NZ_CP040723 (Rhodococcus pyridinivorans strain YF3 plasmid unnamed4, complete sequence) position: , mismatch: 9, identity: 0.719
gtttaccgccccgcagaggcgctggcagatcc CRISPR spacer cgagaccgcctcgccgaggcgctggcagcgac Protospacer ******.*** ************* *
79. spacer 9.7|3367246|32|NZ_CP040886|PILER-CR,CRT matches to NC_011987 (Agrobacterium radiobacter K84 plasmid pAtK84c, complete sequence) position: , mismatch: 10, identity: 0.688
ttgcccgcgcaattccgggagcatccgcaatt CRISPR spacer gctaccgcgcaattcgaggagcatccgctggg Protospacer . *********** .*********** .
80. spacer 9.8|3367307|32|NZ_CP040886|PILER-CR,CRT matches to CP011075 (Brevibacillus laterosporus strain B9 plasmid unnamed1, complete sequence) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer tgaggcaaaatatagattgatttccgaaaata Protospacer .*.********* ******** ****
81. spacer 9.8|3367307|32|NZ_CP040886|PILER-CR,CRT matches to GU075905 (Prochlorococcus phage P-HM2, complete genome) position: , mismatch: 10, identity: 0.688
acggacaaaatatatattgatttgcgaattat CRISPR spacer acggaaaaattatatattgattttacttctgg Protospacer ***** *** ************* .*.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
180920 : 191698
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP040886|180920:191698|DBSCAN-SWA GTCATTTTGCTAATAACACTTTTCTGGCTTCATCGACTTTATTTGGAGCATCGAACCAATATTCGGCGAGCAAAGGGCAGATCTCTGTATCAACGACTTGTTCATACCAAGCCTGGGCATCGTTGATTTTTTGGCCGATAGCTGGGGTGACATAGCTGTGACCGATACAAAACTGTGGTCCTAAGGTAACATCTTTTGCCAGCATATCGTTAAGTACCGTCAGTCGAGATTTGATGAATGCTAACATGTCAAAATCAATTGCGTAATTATAATTTACCCAGTTTCTCCACGCGTCATTAAAAGCTGGCTTTAGATCGATAAACGCGAAACGACGGCGCAGCGCTAGGTCAAGTAGTGCGAGTGAGCGGTCTGCAATATTCATCGTACCGATGATATATAGATTCTCAGGAATGTATATTTTTTCATCATCATTTTTAGGATAGGAAAGAGATAATGCCTCAGTCGGCGTGCGTTTATCTGCTTCCATCAACGTGAGTGTTTCACCGAAGATTTGCGCCGGATTGCCACGATTAATCTCTTCAATTATCACCACATATTTCGAGGTAGGATTATTGACTGCAGTTTTGATTGCATTTACAAAAGGTCCATCAATTAGCGTCAATTGCCCTTCTTTACCTGGACGCCAGCCGCGAATAAAGTCTTCGTAAGAGAGGTTCGGGTGAAATTGCACCGCGCTAATACGCTCAGGTGCTTTTTCTCCCATCAAGCAGTACGCCAGACGTCGCGCTAACCAGGTTTTTCCAGTTCCGGGTGGTCCTTGTAATATCAGGTTTTTCTTGTCGATCAGGCGCTGAAGTGTGAGTTGGATCTTAGCCTCTTCCAAGAAACAGCCATCCTGCACCAGATGACTGATGTCATAAGGAACGTGAGTGAGTTTTGGCAACGGCGCACTCTCTTCGACAGTTTCCTCAGTTATCGTCTGGGATTCATTTTTCTCAAAATTCAGGTATTCATAATTTCCGGACTCAAGGGACTTTAACTCATCATCATTGCATAGTTTCTGTAGATAAAAGTAGATTGAGCTTGTAACTGTTTTGCTCTCAGGATGCTCTACCTTGAATACGTCCAAGTAGTTTTCTTCCAACTCCGAAGCAGTGAAATAAGGGCCATTTTTTTTCAGGCATAACGCCTTAATTTTATTCAGTAAAGAGGCTTTCCACGTTCTATTTGCCACCTCATCTTTAGACTGGCTAAGATCTGTATTCCACGCTGATAAAGAAAGTTCTGGGAAAGAATGAACCGGATAGTTTGGTTGGGTAAAGACCTCGTTCAGCGCCCGCATAAGACTCAGGTAACTTTGTCCACTACAGCGACCTTTTGCCCCGTTTTTAATGATCTTAATGTTTAACACTGTCTGAATGTAATACTGCGACTGGCTATCTAAAGTAGGGTAGAACCAAGGACGGGTCCAGTACAACCCCATGGTGAGATTCCAACCGACATTCATTACTGTAGAAGCAATGTCATATGCGGCAGTGAAGTCTGCAGAGTTAGTATTCTGGTTATCCGCAAAGGTCATTGCCTGCGAGAACATTTCCCACAAGCATTCAATGTCATTAGGGTCGCGTGACTTTTCATAACCAAAGAACCAAGATTTTTGGTTATTCAACAGCGGGATTCCGGCAAAGGAGTCAGGAATCGGTTCGTTCACGCCCAACAAATTCGCTAGCTTGGCAGCAATAATTTTGCGATTGCTGTCGGTCAAGTTACGATTGAACAAGCCCATAGTAGTAAACGGACAAATGTCTTTTAAGGGAAAGATCTCTCCCATAATAGATTTGTCCTGCAGATGGGACATTCCTTCCACACCTGAGGCAATTAGATGAATACCTTTGACTAATTCATCTCTACGATTTCGCCAAGTCAGTAACGCGTTGGCAAAAGCCTCATAAAAACTAGCCCAAGCAAATTTGCCATCATGTTCTGCTGTATCCACGGGAACTCCATTATACTTGTTGAGCAATGATAATTTATCTGTGTGAGTTTTAACATATTACTATCAGTTATAGAAAAATTTAACTACCCGATACAGAGAGCGGCATGCTGAATTTGACCTGACTTGCTTCCAACTAATTAAAATCAACTTATTTATCAATTGGTTATTTTGGCGCATAGCGGTCATCAAGGGAATATCGCGTTGTCATAAGGTGTCGAGGCTCGGAGGTTCAAATCCTCTCATGCAAAAAATAAATAAAATTAATGACGGTTGGAAATTATTCAATACATACACTCTCGAAAGTGCATCAGCCAACCGCAGCACGTCTTGCATACGGCGTGTCTGCAGTTTTATATAATCCTGGCTGGAAACCTCTTATACAAAGTAGATACACCAATATCATAGATGATCGCCACCTTCTGACGCGGGACTTTTGATGCAATTAATCGCCCGGCCTGCGCCCATTGTTCTGATGTAAGTTTAGGACGACGTCCACCAATTCCTCCCTGTGCGCGAGCATCTTCCAGTCCAACTTTTGTCAGTCATCAATCAGTTCACGCTTCATTTCAACCAGGGCTCCATCACATGGAAAAATAAAAAAAACCATCGATGTTGACGTGTCAATGTTACCTGTCAGGCTACGGAAATCCCCCCCTTACCCGCAGCTACTCGGTAAGTATAATAAGATGTTTCCTGCTCCTTCCTAACCTGTCCAGTTTCCAGACAATTAAAGTATCACCTTATATAAGAGACTGCAGAGCGCACTTTAACCTAGGTCGTTCTGAAATAGTTGCAGATTCGATTAATCAACGATTATACTCCCCGGCACTCCAGAGGATCTGGTAAACATAAAGTCAGAGCTTGGGTATACTGGCACACAAATGGCAGATCTTGCAGGTGCAGCCAGTCATAGCCAGTGGCGAAAATACACGAGTGGTTCTGAGCCCCGCGCCATGTCATCACATATCTTGTTTTTTTATTGCTACCAATCTGACTTTGAGTACTAATGAGCTAGATAGAATTGTTGGTAATGACTCCAACTTACTGATAGTGTTTTATGTTCAGATAATGCCCGATGACCTTGTCATGCAGCTCCACCGATTTTGAGAACGACAGTGACTTCCGTCCCAGCCTTGCCAGATGTTGTCTCAGATTCAGGTTATGTCGCTCAATGCGCTGAGTGTAACGCTTGCTGATAACGTGCAGCTTTCCCTTCAGGCGGGATTCATACAGCGGCCAGCCATCCGTCATCCATACCACGACCTCAAAGGCCGACAGCAGGCTCAGAAGACACTCCAGTGTGGCCAGAGTGCGTTCACCGAAGACGTGCGCCACAACCGTCCTCCGTATCCTGTCATACGCGTAAAACAGCCAGCGCTGACGTGATTTAGCACCGACGTAGCCCCACTGTTCGTCCATTTCAGCGCAGACAATCACATCACTGCCCGGTTGTATGCGCGAGGTTACCGACTGCGGCCTGAGTTTTTTAAGTGACGTAAAACCGTGTTGAGGCCAACGCCCATAATGCGTGCACTGGCGCGACATCCGACGCCATTCATGGCCATATCAATGATTTTCTGGTGCGTACCGGGCTGAGAGGCGGTGTAAGTGAACTGTAGTTGCCATGTTTTACGGCAATGAGAGCAGAGATAGCGCTGATGCCCGGCAGTGCTTTTGCCGTTACGCACCACGCCTTCAGTAGCGGAGCAGGAAGGACATCTGATGGAAATGGAAGCCACGCAAGCACCTTAAAATCACCATCATACACTAAATCAGTAAGTTGGCAGCATTACCTTGTTAGTTTGTCTTTTGTGTTTACCTGATTCGGGTAAACGCCTTTACCTGATTTAGGTAAACTTTTCTTACCTGATTCAGGTAAATTTACCTCTTTCAGGTAAACTTTATTTTTCTTACCCGATTCGGGTAATGTTGACCATTCACTGACCACATTATTGATGCCGATATTCCGCCCGCTCTGAATAAAAATCCCACGCTTTACCAGAACACTTTTTGCAGCAGAACACTTGTGCGGCAATATCCCGGTTAATTCGGAAAGTTGCTCGTTGCTAACCCAATCCAGTTTTTTATTAAAGCCATATGTTTTGCGCATGACAGCCAGAAAGACCAGAAGCTGGTGCTGTGTTAATCCGGCCAGCATCACAGCTTCCAGCAACTCATTTGCAATGCGCGTATAACCATCATCGAGATCTGCCACGCGCCGCTCCTTTTGTGCCACATCCGGCACAGGAAAATTGAATATCTCAGCAGTGTTTGCCATAATTCCTCCCGCAATGAGTGTGTTACGATTTGCACCTGAAAGTCGGTTCTGTTCCCGCAGACCGACTTTCGCCATTTTTGAACCTGTCATATTGCCCCCAGCATGGTGGTGACCATCGCCATCAATGGACCAGCCAGATCCGGGTCCACTCGAAACATCGACACAATGCCTTCACTCATCTCCTTCAGTTTCTGGTGGCGTGGTGCGTTGAGAATGACCGCCTGCTTTGCCTCACAGAGTTCCTTTTCCATTTCAGCCAGCCGAGCCATGAAGCTATCCTGCTCAACCAGGTGGCCGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCACGGCTGACATCATCAGGGAAATCGATAGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCAGGCGCGGACGGCATCACGGATTTTTTCGTGGCCTGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTAGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACCGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATTGCCCTGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTGAAACAGAAAGCCTCTGAGCAGAAGGTGGCTGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGAGATGATGCGTGGAACAAATTACGACTCGGCGTCATCACGGCTTCAGAAGTTCACAATGTGATAGCAAAACCCCGCTCCGGTAAAAAGTGGCCTGACATGAAAATGTCCTACTTTCACACCCTGCTGGCTGAGATTTGCACCGGTGTGGCTCCGGAAGTTAACGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAATGACGCCAGAGCCCTGTTTGAGTTTACTTCCGGCGTGAATGTTACTGAATCCCCGATCATCTATCGCGACGAAAGTATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAGCTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAGTTCCGGCTCGGTGGTTTCGAGGCCATAAAGTCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACACGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGTATGAAGCGTGAAGGACTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGCGCGATCACTTTCGTCTACTCCATTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCACTGAAAGCACAGCGGCTGGCTGAGAAGATAAATAATAAACAGGAGGATATATGAGTCAGGTTGGTAATCATTCATTCGAATTTCCGGCATCGCAAGGTGTACAGGGTGGTACTGTTACACTCTTCCTTACCATACCAGGAAGATCGCTGGCTCGTTTCCTCGCTTCAGATAATTACGGCCATACACTGGAACGCTCTCAGCGAGAAATTAATCCAAATCGAGTACGAAAATTTTTAAATTATCTCACTAACGCAGACTCAAGAAATGAGTCTTTTATCATTCCCCCTCTCGTAGGTAACTGTGATTCGAATATAGAATTTGTACCGTTTGGCAACACAAATGTTGGTATAGCCAGAATTCCCCTCGACGCCGAAATAAAACTTTTTGATGGCCAACATCGTGCAGCTGGCATTGAGATATTTTGCCGAAGTTCCCCATCAACGCTCATGGTTCCCATGATGCTTACAATGAATCTGCCGCTAAAAACCCGGCAGCAGTTCTTTTCGGACATAAATAACAACGTTTCTAAGCCATCAGCGACCATCAATATGGCGTATAACGGCCGGGATGATATTGCTCAGGGAATGATATCCTTCCTGACCCAACATACTGTATTTGCCGATATAACCGATTTTGAACACAACGTAGTGCCATTAAAAAGTAATATGTGGGTGAGTTTCAAGGCACTCACTGATGCAACGTCAAAGTTTGCTAGGAACGGCAATCAACAACTTGAAATGGGATATATAGAATCTGTCTGGGAGGCATGGATTACACTAACTCAGATTGACTCAATCCGACATGGTGTACACCACGCTACGTACAAGCGCGATTATATTCAGTTCCATGGAGTAATGATTAACGCTTTCGGTTTTGCGGTTCAACAGATGATGGTTAATCATTCCATCGCAGAAATAACTTCTATGATCGAAAAACTCTGTGCAACTACCAGCTCTGCAGAAAGAGAGGATTTTTTTCTGATGGATAACTGGGCGGGGATCTGCACGAAAGCCAGCCAGGAAAAACTATCGGTTATTGCCAATGTGGCAGCGCAGAAAGCAGCAGCAAACAGACTGATACAAGCTTTTACCAAAGGAAGTCTGGAAACAACTTAATGAATCAACATTGTCTCATATCAGCATGCTGTACGGCGTCTTTAAGGAACGGTGAGCATGAAAAACAAAATCATCATGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATCATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGCTAGATGAACGAAGACGCCTGGCCGTTCAGGGTTGTCGGACTTGTGCAAGTTGCCAGGAGGAGATCGAACTTAAGAACAAACAATGGGGACTGTGATGGCCTCAAAGCAGCAAATTTCAACATCGTCCAACTGAGGTGTAAAAATGTTCAGAATCATTTTTCCTAACACCTGGTACGTCGACCACCACGGCACTCCCTGCAAAATCCTGCGTTCTACCCACAACAAAGTTCACTACATCCGAAAAGGCAGAACATGTATCGCCAGCATGTTCCGCTTTAATCATGACTTTGAACCTGTGAATAAAGCTGATGCAGATCGGATAGCAGAAGAGATCGAAACGGCAGAACACATTAAGAAGTTACGTGCCATACGCAGGAAATAGAAAAATTGATAAATTCAATACTGCATTTCTCAGCATTAAATTTATCTCTATGACCAGTCAAGAGATGTACCTGCCATGAGCTTAATATCATGTCAGATATATCGGTCACAAACTCCCTCAGCAGCTAAGAGGAGGACAAATGTCTCGACTAATCACTTTACAGGACTGGGCTAAAGAAGAATTTGGGGACTTAGCACCAAGTGAGCGAGTTCTGAAAAAATACGCGCAAGGGAAAATGATGGCCCCACCCGCTATAAAAGTTGGTCGCTACTGGATGATTGACCGAAATTCCCGTTTTGTAGGAACGCTGGCAGAACCGCAACTCCCAATAAACGCAAACCCAAAACTCCAACGGATAATCGCTGATGGCTGCTAGACCCCGATCTCACAAAATCTCTATACCCAATTTATATTGCAAATTAGATAAGCGAACCGGAAAGGTATATTGGCAATACAAACATCCACTATCCGGTCGTTTTCATAGCTTAGGAACTGATGAGAATGAAGCAAAACAAGTTGCTACTGAAGCAAATACCATTATTGCTGAACAACGTACCAGACAAATATTAAGCGTCAATGAGCGTCTGGAAAGAATGAAAGGCAGGCGCTCAGACATTACGGTGACAGAATGGCTTGATAAATATATTTCTATCCAGGAGGACAGGCTGCAACATAATGAACTAAGACCCAACTCCTATCGGCAAAAAGGCAAACCCATTCGTCTTTTCCGTGAGCATTGTGGAATGCAACACCTCAAGGATATTACCGCACTTGATATTGCCGAAATAATTGATGCTGTAAAGGCTGAAGGTCATAACAGGATGGCGCAAGTCGTGAGAATGGTGTTGATCGACGTCTTCAAAGAAGCACAACACGCAGGACATGTTCCGCCAGGATTTAACCCAGCGCAGGCAACAAAACAACCGCGAAATCGAGTAAACCGCCAAAGATTGTCACTGCCCGAATGGCAGGCAATATTTGAAAGCGTAAGCAGACGGCAGCCCTATTTAAAATGCGGCATGCTACTTGCTCTTGTTACTGGACAACGTTTAGGCGATATCTGCAATTTGAAATTCTCTGATATATGGGACGACATGTTGCACATTACTCAGGAAAAAACCGGTTCAAAACTTGCTATTCCGCTTAACCTGAAATGCGATGCTCTGAATATTACCCTTCGTGAAGTTATATCTCAGTGCAGGGATGCTGTTGTTAGTAAATATCTGGTCCATTACCGTCACACTACCTCTCAAGCAAACAGAGGAGACCAGGTTTCTGCAAATACTCTGACAACGGCTTTTAAAAAGGCCAGGGAAAAATGTGGCATAAAATGGGAGCAAGGAACTGCGCCCACATTTCATGAGCAGCGATCTCTGTCAGAACGGTTATATCGGGAACAGGGTCTGGATACGCAAAAGTTGTTAGGCCATAAATCCAGAAAAATGACCGACCGATACAATGATGATCGTGGTAAAGACTGGGTTATCGTAGATATCAAAACAGCATAGAAAATAGCCAGTTTTGGGGAAGGGTTTTGGGGAAAGTTTTGGGGAAGATTTTACATCATCATAAAACAACGGGCGTATAACACGCCCGTTTCAATATTTAACACATGTAGAGATTACATGTTCTTGATGATCGCATCACCAAACTCTGAACATTTCAACAGTTTAGCGCCTTCCATCAGACGTTCGAAGTCATAGGTTACGGTCTTCGCATTGATTGCGCCTTCCATACCTTTAACAATCAGGTCTGCGGCTTCGGTCCAACCCATGTGGCGTAACATCATCTCAGCAGAGAGAATAATAGAGCCTGGGTTTACTTTGTCCTGACCGGCATATTTCGGCGCAGTACCGTGGGTGGCTTCAAACAGGGCGCATTCGTCACCGATGTTTGCACCTGGGGCGATACCGATACCGCCAACCTGCGCTGCCAGGGCGTCAGAAATGTAGTCACCGTTCAGGTTCATACAGGCGATAACATCATATTCAGCCGGACGCAGCAGGATCTGTTGCAGGAATGCATCAGCAATCACGTCTTTAATGACGATCTCTTTGCCGGTGTTCGGGTTTTTAACTTTCAGCCACGGGCCGCCGTCGATCAGTTCACCGCCAAACTCTTCACGCGCCAGCTGGTAGCCCCAGTCTTTAAACGCTCCTTCGGTGAACTTCATGATGTTGCCTTTGTGCACCAGAGTCACAGAGTCACGATCGTTAGCAATTGCGTATTCGATCGCTGCACGAACCAGACGTTTGGTGCCTTCTTCCGAACACGGCTTAATACCGATACCACAATGTTCCGGGAAGCGAATTTTCTTCACCCCCATCTCTTCACGCAGGAATTTAATCACTTTCTCGGCGTCGGCAGAGTCTGCTTTCCATTCGATACCCGCATAAATGTCTTCCGAGTTTTCACGGAAGATAACCATATCGGTCAGTTCAGGGTGTTTAACCGGGCTTGGAGTGCCCTGATAGTAACGTACCGGACGCAGGCAGATGTAGAGATCCAGTTCCTGGCGCAGGGCAACGTTCAGAGAGCGAATACCGCCACCAACAGGAGTGGTCAGCGGACCTTTAATGGCAACGCGATATTCACGAATCAGATCAAGGGTTTCAGCAGGCAGCCAGACATCCTGACCATAAACCTGTGTGGATTTTTCACCGGTGTAAATTTCCATCCAGGAGATTTTACGCTCGCCTTTATAGGCTTTCTCGACTGCAGCGTCGACCACTTTCAGCATGGCTGGGGTTACATCTACACCGATTCCATCACCTTCAATGTAAGGGATAATCGGATTTTCAGGAACGTTGAGTTTGCCGTTTTGCAGGGTGATCTTCTTGCCTTGTGCCGGAACAACTACTTTACTTTCCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP040886|180920:191698|190447_191698_-|WP_000444487.1|DBSCAN-SWA MESKVVVPAQGKKITLQNGKLNVPENPIIPYIEGDGIGVDVTPAMLKVVDAAVEKAYKGERKISWMEIYTGEKSTQVYGQDVWLPAETLDLIREYRVAIKGPLTTPVGGGIRSLNVALRQELDLYICLRPVRYYQGTPSPVKHPELTDMVIFRENSEDIYAGIEWKADSADAEKVIKFLREEMGVKKIRFPEHCGIGIKPCSEEGTKRLVRAAIEYAIANDRDSVTLVHKGNIMKFTEGAFKDWGYQLAREEFGGELIDGGPWLKVKNPNTGKEIVIKDVIADAFLQQILLRPAEYDVIACMNLNGDYISDALAAQVGGIGIAPGANIGDECALFEATHGTAPKYAGQDKVNPGSIILSAEMMLRHMGWTEAADLIVKGMEGAINAKTVTYDFERLMEGAKLLKCSEFGDAIIKNM >NZ_CP040886|180920:191698|180920_182876_-|WP_000379042.1|DBSCAN-SWA MDTAEHDGKFAWASFYEAFANALLTWRNRRDELVKGIHLIASGVEGMSHLQDKSIMGEIFPLKDICPFTTMGLFNRNLTDSNRKIIAAKLANLLGVNEPIPDSFAGIPLLNNQKSWFFGYEKSRDPNDIECLWEMFSQAMTFADNQNTNSADFTAAYDIASTVMNVGWNLTMGLYWTRPWFYPTLDSQSQYYIQTVLNIKIIKNGAKGRCSGQSYLSLMRALNEVFTQPNYPVHSFPELSLSAWNTDLSQSKDEVANRTWKASLLNKIKALCLKKNGPYFTASELEENYLDVFKVEHPESKTVTSSIYFYLQKLCNDDELKSLESGNYEYLNFEKNESQTITEETVEESAPLPKLTHVPYDISHLVQDGCFLEEAKIQLTLQRLIDKKNLILQGPPGTGKTWLARRLAYCLMGEKAPERISAVQFHPNLSYEDFIRGWRPGKEGQLTLIDGPFVNAIKTAVNNPTSKYVVIIEEINRGNPAQIFGETLTLMEADKRTPTEALSLSYPKNDDEKIYIPENLYIIGTMNIADRSLALLDLALRRRFAFIDLKPAFNDAWRNWVNYNYAIDFDMLAFIKSRLTVLNDMLAKDVTLGPQFCIGHSYVTPAIGQKINDAQAWYEQVVDTEICPLLAEYWFDAPNKVDEARKVLLAK >NZ_CP040886|180920:191698|188320_188539_+|WP_001678640.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPLDERRRLAVQGCRTCASCQEEIELKNKQWGL >NZ_CP040886|180920:191698|186947_187106_+|WP_000149533.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSITKRGWVFPGLSVIRNPLKAQRLAEKINNKQEDI >NZ_CP040886|180920:191698|188586_188826_+|WP_000488406.1|DBSCAN-SWA MFRIIFPNTWYVDHHGTPCKILRSTHNKVHYIRKGRTCIASMFRFNHDFEPVNKADADRIAEEIETAEHIKKLRAIRRK >NZ_CP040886|180920:191698|185962_186274_+|WP_072163463.1|DBSCAN-SWA MPLLCCEATYIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKASEQKVAA >NZ_CP040886|180920:191698|186270_186951_+|WP_001372461.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWNKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEICTGVAPEVNAKALAWGKQYENDARALFEFTSGVNVTESPIIYRDESMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >NZ_CP040886|180920:191698|187102_188167_+|WP_001678641.1|DBSCAN-SWA MSQVGNHSFEFPASQGVQGGTVTLFLTIPGRSLARFLASDNYGHTLERSQREINPNRVRKFLNYLTNADSRNESFIIPPLVGNCDSNIEFVPFGNTNVGIARIPLDAEIKLFDGQHRAAGIEIFCRSSPSTLMVPMMLTMNLPLKTRQQFFSDINNNVSKPSATINMAYNGRDDIAQGMISFLTQHTVFADITDFEHNVVPLKSNMWVSFKALTDATSKFARNGNQQLEMGYIESVWEAWITLTQIDSIRHGVHHATYKRDYIQFHGVMINAFGFAVQQMMVNHSIAEITSMIEKLCATTSSAEREDFFLMDNWAGICTKASQEKLSVIANVAAQKAAANRLIQAFTKGSLETT >NZ_CP040886|180920:191698|188965_189202_+|WP_000088653.1|DBSCAN-SWA MSRLITLQDWAKEEFGDLAPSERVLKKYAQGKMMAPPAIKVGRYWMIDRNSRFVGTLAEPQLPINANPKLQRIIADGC >NZ_CP040886|180920:191698|185240_185780_-|WP_001753331.1|DBSCAN-SWA MQLLTYQQTSGFSPTAVINRSQTKQVPGHEKIRDAVRAWSAEDNQDVVATLIVNEYREQGGGTIDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGHLVEQDSFMARLAEMEKELCEAKQAVILNAPRHQKLKEMSEGIVSMFRVDPDLAGPLMAMVTTMLGAI >NZ_CP040886|180920:191698|189191_190334_+|WP_000741339.1|integrase|DBSCAN-SWA MAARPRSHKISIPNLYCKLDKRTGKVYWQYKHPLSGRFHSLGTDENEAKQVATEANTIIAEQRTRQILSVNERLERMKGRRSDITVTEWLDKYISIQEDRLQHNELRPNSYRQKGKPIRLFREHCGMQHLKDITALDIAEIIDAVKAEGHNRMAQVVRMVLIDVFKEAQHAGHVPPGFNPAQATKQPRNRVNRQRLSLPEWQAIFESVSRRQPYLKCGMLLALVTGQRLGDICNLKFSDIWDDMLHITQEKTGSKLAIPLNLKCDALNITLREVISQCRDAVVSKYLVHYRHTTSQANRGDQVSANTLTTAFKKAREKCGIKWEQGTAPTFHEQRSLSERLYREQGLDTQKLLGHKSRKMTDRYNDDRGKDWVIVDIKTA |
11 | Enterobacteria_phage(40.0%) | integrase | attL 178893:178916|attR 190401:190424 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
506073 : 514843
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP040886|506073:514843|DBSCAN-SWA TTTAACTGGAAACTTCCGTTGACCGGCTGTATTCATGGCTGCGAATTTTCGCCATCAGCTCGTCTGTCAGTTCGGACACCCACTGGATAGCCAGCCGCTTTTCTTCGTCGCTGCACTCACTAGCCGCTACAAGCTTGATAAAAAAATCAATACGCTGAAGCTTCAATGACTCCAAAAGATAGTCCTGCATCTTCCCTCCTATCATTACACGGATACACAAAAACTGTATATACACCCACTGTTTATATAAACAGTATAATAGGAACAGAAAAATGTAAAACTGTTTTTTGTCAGTTAATTGGATGTACTGATGTCGGTCAATAAAGCACAAAATGTTAAACAGCAGCCTTAGTACCATTGACGCCATTTGTCATCTTCCTGCAGCCTCTGGTTACGGTAAAAAATACGTAAACCGGCACCGGATGGAATGCTGCCGCCACGCAGAAGCAAATCAATCTCAGATGCACTACCTTCAAACCCCCTGGTTGTCAGTTCTGCCTCAAGCTGCAGGCGCTGCTGCTCCAAAATACTCTGTATGTATGCTTTTTTCCGCTTCGGTTTTACCAGTCTCAACCTGGCTGTCAGCTCCCGCCGTTCCTTCTGGCCCATGTTGTGGAGATATTCCTGCAGCTCCTTCTCATCCATGGTTTTAATATCGGGTAAATCACCCCCTGATTTGTTCAGATTTTCAACAGGGGGACAGTTATTGCCACGAGTCCAAGGGGCGCAAGCGCCCTGGTCGGCTGCCGCCTCCTGAACGTCAACGGCTTTACGAACCATTTTCCACTTCACTGCATGAGTGCAGATCTTGCCCTCTGCAATGGGTGACCAGATGCCATAAATACGAATGCCATGATCGCCATAGGCGGTCGGCTCTTCGTTGATTTCATAAGCGGTTCTGATGAGGTGATATTTACGGGGAACCAGTACGCCGCCCTGCTTCATGATGTAGGTGGCAAAACAACCAGCATCAGCAGCAGCCAGAATGGCATCAAGACGCGGGTTATCCAGTACCGGCGCACCTGCTTTTTTGTCACCCTGTTGCCTTGCCGCCTGACCAGCCAGCAAGCGAAGTTCACGGTAAGCCTGACGCCCCGGAATACCAAAGAAGCGGAATTGCTGAACACGATGCAGAGACGCCCAGGCATTCACGTATTCAGCGTTATCACGCAGAGATTTACCCGTTTCCTTGCTGATCTCGCCAGCCAGACCACGACCGTCAATGTTCTTACTGATATATTTCGCGATGTAGCTTGTCGGCGTTCCTTTGCGCGGGTTAATCAACTCAGACTTAAAGCGCGGCCCAGTGTTATTGCCCAGCTCCTCGCGGTCTTCACGGATGGCAAACTTACGCAGTAATGCAGTGATGGCACGGCGGTCTTTTTTGCGCATGAAACACAACAGGTGCCAGTGAACTGTGCCATCATGATGCGGCTCAGCCACCCGCACGCCATACCAGCGCAACCCGGCTTTGTGCATCGCCTTACGAAATGCAGCAAACATACCGACCAGATAATCGCTGCTTTGTCTTACCGTCGCGTTTGTCCAGGTCGGGTTTGGTCTGCCGTTATTTAGCGTGGAATGGAAACGCGACGGACAGGTGATAGTGTAGAAAACGGCGCAGTCACCGCGCATTTCCGCGATAAGCTCCAGACCTTTAACACAGGCCATCATCTCATTGCGGCGATGCGCAGGGTTGCTGCTGCTGGCGTTTACCACATCCTCCATGTCCAGCGTGTCGCCGTCTTCGTTCACCAGTTCATGAGAACGGAAAAACTCCAGCGACTTACGACGCTGCTCACGTTTATGCATCACGGCTTCATAGCTGACATAAGGAGATGCTTTTTTGCTGACCAGGCAAACAGCGCGCAACTGCTCTTCCCGCCATTCGCAACGCATCTTCCATAATTTCCGATACCACCAGTCGGCGCACAACATACGCGCCAGCGAACCCGGAATGAGTTCATAGGGCACGGGTTTACGGCGGTTTCTTTTCCGACGGAGTTGCTCAAACGCAGGTGGGATGACATCCAGACGCAGGGTTTCCGCTGCCACCTTTTCCCATGTCTTGCGGATTTCTTCTGGCTTAACGTCATCGGTGGCATACAAATCACCACAAGCTGCATCAAGGCACATACTCATATGCGCAGCTACCAGGGTGGAAAGGCGTTTCACCTGATCCTGACTCATTTCAGGCAGGATCAGCAGGCCGTCCAGCCCTTCATGGCTTGCCATAAAGCGAAAAGATGCAGATAGCTGACTGTCGCGTACATGCTCCAGTCGTTCCAGACATGGCTTAATCGTCTCACGTAAATAGCGGGAATAAGCCTTTGGCCTGCCCAGGCTGCTGAAGTATTCAATACGTTGCATCAGCGGCTTGCTGATATGGGAAGGCTGGGCGTTGACGTCCGCCAGAATGACCATGTCTGAATTAAAACGCTGCTGCTCATGCGCCAGCTTTGCCCGGCTAATGAGCTTATCCTGCTCCATTTCGCGCTGGACAGGATCACGTGATTCATTAAAGAAATAACGCTCCCAGACCTGATCACTCAGTGCCTCGCGGCGCAACTGTTCCTGCTCGTTATCGGCAGCGTACAGAGTGATCAGGTTTGAAAGCGTAGAAACCGGCGCAACTTCCGCCGGGTCCAGATAAGGGTTAATGGCCTTTTTCGGGCTGTTCCATGAGAATGCTGCGGCAGCCTCGTTAAAGCCGCAGCAGTTGTTCATATCGGCATGACTCATGCACGTACTCCGTACACGGCAGAACTATCCACGCCACGCGAATAATCAAATCCCATCCAGCAGCGCGGCCCGGAAACAGCAATGATTTCTGTTGCTGATTTACCCTCGCCAGCTGCCACACCGATGCTGCGTTTTACCTTGATATAGTGGTGAGTAAAATTGCGATACAGCGAACGGATCAGGGATGTGTCACTGTTAGAAACAATGACCGGATGTCCTTCTGATGACCGATGTTCAAGAACGGATGCCAGGTGATACTGGTCATCTTCAGTGAAACCATCAGTGTGATAGCCGGAAAACGTACCGTCATATGGCGGATCGCAATACACCACATCTCCCGCCTTCAACATCGCCAGCGTTTCATCAAAGCTGGCGCAGATAAACGTCGCTCGCTGGGCTTTTTCTGCAAATGTGCGAAGTTCTTTTTCAGGGAAATACGGATTTTTATAATTACCGTAGGGAATGTTGAAATGCCCGCTCTTGTTATAGCGACATAAACCACGGTAACCGTGACGATTGAGATACAGGAAATATACCGCTTTCATGAAATCAGTAATTTCAGTTGAGCAGTTAAACTCCTGCCTTATGTTGTAATAAGCCACCTCCCTGTTTGCGATCTCAAATAAAACTCTGGCGCGAGATATAAACGATTCACAATCAGCGGCAACCTTTTTATAGAGGTTGATTAAATCAGGATTAATATCCGCAACCAGATAGCTGGGATAATCCGTTTCCATCATCACAGCACAGGAACCCGCGAAAGGTTCAACCAGTCGCGGGCCAGCAGGAAGGTATTTTTTCAGTTCTGGCATAATGGCGGTTTTATTTCCCGCCCATTTCAGGATGGTGCTCATACAGCACCTCCGTTGTAATGTTTGCCTTTCAGTTCTGCGATTTCCTGACAGGTAATGCAAAGCTGCACTCCCGGAATGGCGCGGCGTCGTGCTGGCGGAATTGGCGCTTCACATTCAATGCAAAGTACGCGTGACACGCCCGGTGATTTGGCACGGGCTGCACGGATATGGCGCTGGCGTTCTTCTTCAACGCGCTGCTGTACGAGATCCATTGCATCAGCCATTAGTGGATCTCCTGCGCTTCGTTCTGGATTGCTTCAGCAGTCACGCGCAGCAGTTCTGCCGCTTCCACGTGGTTTAGCTGGCGGGATGAGATATGACACGCCAGGCTATCAAGGCGAGCAGCCATTGCTTCAGCCCTTGCCCGGCGTTCTTCCAGACGAGCCTCTGTCAGTAAAATATTAAGCCCTGCGTCATCCGGTCCGGTTTTAGTCGTGAGGGTTTCAATATTACGCATAATCAATTCTCCTGAATTTAGATAAAGGGATGCCCGGCGGGTTTACGCCATTAATTTCATTAGTTGGTTAATTCGGCATGGTTAGCCGTCTGGGAAATAAGCTCACCACTGCACGAAAATGATTCATTGCTTTAATCAGCTCCCGCTTTTCGTCAGTGGTCAGCTCATTAATGCTGATGCTATGACGTTCAGCTGGAATTTTTGCCATAAAGAATATGGCAGCCAGTGCCCGTTTATTTTGTTCATTATTGATATCCCGTGGATCACGCATATCTTTAATAAACCGCTCAAGCTCTGACTCAATATTCAGGCCAAAAACTTTCGCCCTTAACTCCGCAATGTGATTAAGTCCATTCAGGCGTTCACCGGGGCTTAATGGAACAGTTGCTGCAGCGCCATTAATTGCCATACTTCATATCCCCCAAACGCAGCTATCGTTCTTTGTTCTTACGGTAACGCTCAAGAGGAGATACATTTTTTCGTATCGTCTCTTTAACCTGCTCTCCCCGTAAAAACGTCCCATCCTTTAACGTGAAAAAGTAACTGCCATCGCCCGACAATGACGGATAGCAACAGAGCAAATCATCTTCAGGTACTGAATAACTCTCCCCTCTGTAACGAAACTGATAAACCACTTCACTTTCTGCCGCATACATTTGGACTTTCTCCGTTTCCTCGTGGTCAATTCAGACAGCAATTCATCTTGTGAATGACATGGATGCCAGCGTTTTCCATCCTCACCCGTGATCCAGCCGTGACCGTAGTGCATTGCCGGGCTTTGTTTTACCAGCAGCGATGCAAATGATGGTTCTTTCGTCAGCATAAACACCTCACAGCAAACCGAATGAAGCACCAAGGCCAGTCATGGTATCAACTGCACTCGCCATCGCAGGATTAGCCTGTAAACGGGCTTGCAATGAAACAGCAGCCAGCGCCATCAGTCGTGTTACAGAGTTAATGCTGCTGATAGCATCACGACGACCTGCACTGGTTTTTACATCGCCAGATACCGCACCTGCAGCAACACGCCCGATCTCTGCGGTTGCACTCATGACGTAATGCGGTAGTTTCTCTTTTGCCACCTCATTAATCGGAACACATGGCAGACAATGAATCTGTGCCAGAAAACCATCTACCAGCGTTGAATCTTCAGTCAGATCGGTAAGCAACCAAATTTCTGGTGCGGTTAATAAATGAGGTTGAGCTGGGTTCAGTTTGTTCCGCAGAATCTGCACATTCATGCCTGCACGTTCTGCCAGTTGCACCAGATTGTGGCGCAGTGCGAATGCACGACAGGCTTCATCAAAATGTGGATGTTTGGAAACTTGGTAATCAAACATGGTCGACACCTCTGATGTATCCCAAAATGGAACTAGTTGAATACAACATTGCAATCAGTAAGTGCATCAACGGTAAGAGCAGCAAGGTTGATCATTACCTTTTCTCTTTTCTTGTCTTTCCGAAGGCGATGCCGAGGGATGCGACCGTCAGCCAGCATATCGTTAATTGTGTCGATTGAAAGACCAGTAAGTTCGCTATAACGCTCAATTGTGACATGTGGCGTATTCAGGGTTATTGAAATGTTAGGGGTCATGATGCAACATCTCCTATTGGCTTGTGGTGAGTCAGTTTTAATCGTGGCTTTAACTTCACATTTCGGAGAATAGGATCACAAATCGGTTATGTCAACACATGAAATCACATTTCGCCATGTGGACGAAAAAAAGAAATCCTTAATCATGCAGAATCGCGGAGGGCAATCGGTTATAGATCGGATACTGAAAGCCTATGGTTTTTCTTCCCGACAAGCATTCTGTAATCACCTAGGTATATCGCAAAGTACAATGGCGAACAGGTATGCCCGTGACACTTTCCCTGCTGATTGGGTTGTTATCTGTAGCATGGAGACTGGAGTGCCGGTCGAGTGGTTGGCATTTGGCACTGATACCGAGAAGGGAAGCATTACAAATAATGCAGAAAAAAGTCACAACAATTGTGACAGCAAGCATCAACATCTCAATAGAGAACAAGACATCCAAAATGAGAACTCTTTTACTATTAACCAAGGTGGAAAAGCAGCAATAGAGCGAATCGTTTTGGCTTATGGATTTAAGACAAGACAAGCTTTAGCTGATCATATTGGTGTATCAAAAAGTACATTAGCCAATCGTTACATGAGAGATACCTTTCCTGCTGACTGGATTATTCAATGCTCACTGGAAACCGGTGCTTCATTAACATGGCTAACCACTGGTAACGGGGCAATGTTTGAAAAGCCTCGAAACGATACTATCACTATCCCATATCATAAAATAATTGATGGATCTCTTGCTCAAGAAACCTTCTTGACTTTTGACTCTAAGTTGTTAGAAGGAACCTTTCTGCAACCTTTAGCAGTATTCATTGATGAGGAAATATATATTGTAGAATCAAAATTTAATGAAGTTACTGATGGCAAGTGGCTTGTGAATATTGAAGGGAAAATAAGTATCAAAGATTTGACTCGCATACCCGTTGGTATGGTTAAAGTTGTAGGCACTAACGCAAGTTTTGAATGCTTACTTACTGACATTATCGTTTTGGCAAAATGTAAAAGAGTTTTTACTAAAAATGTATAAAGAGAAACATCATGACTGAACCAACCAATAAAGATAGCGAAATAAAAAAACACCTATTAGAATTTCTTGATTCACAGTCTGAAAATATAGCAAAACACTTCTACTCTCATATAAAAGACTTAATAGAAGCAGGAGAGCTTTCTGAAGCTCATAATAACCTAGCGCTAATTGAAAAATACATAACTAGGCCACCGATGGATGAAGAACCCAATATAAATGAAAATAAAGCCAATAAAAGAAAAAATGTAAAATCACTTGAACCTAATAATTATGTAGAACATATAATACAATTAGAAGAACGAAACAGCATATTAACTCTACAGTTAGAGCATTATACTCAGGATCTTAATAGAAAAAACGCAATAATCGAAAACAACGTAAAACAAATTAATTCATTGATTAGTGAAAATAAGGAACTCCGTAGCCAAGTACAGCAACAAAGAATCGATGATAAAATCCCCACCTATGTTAACGATGTTAAATCAGATCTTGGTAGTGATGACAAACATTTTATATTGATGTCTATTATCTGGTCTATTGCAGGGGTATTTTTTGGCTTCCTTGCAGTAGTATCTGCTTTTTTTACATTATACATGAACTTAGATTTAAAAAATCTCACTAACCTTCAGTTAATATATATCTTCACGCGAGGATTAGTTGGAATCGCCATTCTTTCATGGCTATCATATATCTGCCTTAGTAACTCAAAAAAGTACACACATGAATCGATCAGGCGAAAAGATCGTCGACATGCTTTGATGTTTGGTCAAGTTTTTTTGCAGATATACGGTTCTACAGCAACTAAAGAGGATGCAATAGAAGTCTTTAAGGATTGGAATATTTCAGGTGACTCTGCATTTTCAGGTCAGACAGAGCAACCACCGAGTTTTGCGTCATTTTTGAATACAATCAAAGACAAAGTTAAAGTAACTGGAAGTGATAAAGAAACAGATTAATCATGAACATGTATGCTACTAAGTAAAAAATACATTGAATACTGTTGTTATATACAGTTAAATTTAGCCCTCTGATATGAGGGCATTTTTTATGGCAGTACGAAAACTCACCACAGGAAAATGGCTTTGCGAATGTTACCCCGCCGGACGTAGCGGACGCCGTGTGCGTAAACAATTCGCCACCAAAGGCGAAGCACTGGCCTTCGAGCGATACACCATGGGGGAAATAGAAGCAAAACCCTGGCTGGGCGAATCAGTGGATCGTCGGACACTGAAAGATATGGTTGAGCTATGGTTCAAATTACATGGCAAATCTCTTACTGCCGGACAGCATGTCTACAACAAGCTGCTGTTGATGGTTGACGCCTTGGGAAATCCCCTTGCAACTGATCTCACCTCAAAAATGTTTGCTCACTATCGAGATAAACGCCTGACAGGCGAGATCTACTTCAGCGAGAAATGGAAGAAAGGAGCAAGCCCGGTCACCATTAACCTGGAGCAAAGCTATCTAAGTAGTGTTTTTAGCGAACTATCCCGTCTGGGCGAATGGTCGTATCCGAACCCACTGGAGAACATGCGAAAATTCACCATCGCAGAAAAAGAGATGGCATGGCTTACCCATGAGCAGATTGTTGAATTGCTGGCTGATTGCAAACGTCAGGACCCAATTCTGGCACTGGTAGTTAAGATATGCTTAAGCACAGGCGCACGCTGGCGTGAAGCCGTAAATCTTACCCGCTCACAGGTGACCAAATACCGAATTACCTTTGTCAGAACGAAGGGGAAGAAAAACAGAAGCATCCCTATCAGTAAAGAGCTTTACGAAGAGATCATGGCGCTCGATGGGTTCAATTTCTTCACAGACTGCTATTTTCAATTTTTATCCGTGATGGAAAAAACGTCTATCGTGCTCCCTCGCGGTCAACTCACACACGTTCTGCGCCATACGTTTGCAGCGCACTTCATGATGTCGGGTGGAAACATTCTGGCCTTACAAAAAATTCTCGGACACCACGATATAAAAATGACTATGCGTTACGCACATCTGGCACCGGATCATCTGGAAACGGCGCTCCGTTTCAATCCTCTGGCAACGCTGCCAAGTGGCGACAAAGTGGCGGCAGCGGTTGGCATTACCCCGTAA
Protein sequences of DBSCAN-SWA_2 >NZ_CP040886|506073:514843|511864_512743_+|WP_001680871.1|DBSCAN-SWA MQNRGGQSVIDRILKAYGFSSRQAFCNHLGISQSTMANRYARDTFPADWVVICSMETGVPVEWLAFGTDTEKGSITNNAEKSHNNCDSKHQHLNREQDIQNENSFTINQGGKAAIERIVLAYGFKTRQALADHIGVSKSTLANRYMRDTFPADWIIQCSLETGASLTWLTTGNGAMFEKPRNDTITIPYHKIIDGSLAQETFLTFDSKLLEGTFLQPLAVFIDEEIYIVESKFNEVTDGKWLVNIEGKISIKDLTRIPVGMVKVVGTNASFECLLTDIIVLAKCKRVFTKNV >NZ_CP040886|506073:514843|509664_509892_-|WP_000752610.1|DBSCAN-SWA MADAMDLVQQRVEEERQRHIRAARAKSPGVSRVLCIECEAPIPPARRRAIPGVQLCITCQEIAELKGKHYNGGAV >NZ_CP040886|506073:514843|510192_510534_-|WP_000996717.1|DBSCAN-SWA MAINGAAATVPLSPGERLNGLNHIAELRAKVFGLNIESELERFIKDMRDPRDINNEQNKRALAAIFFMAKIPAERHSISINELTTDEKRELIKAMNHFRAVVSLFPRRLTMPN >NZ_CP040886|506073:514843|512754_513699_+|WP_001678408.1|DBSCAN-SWA MTEPTNKDSEIKKHLLEFLDSQSENIAKHFYSHIKDLIEAGELSEAHNNLALIEKYITRPPMDEEPNINENKANKRKNVKSLEPNNYVEHIIQLEERNSILTLQLEHYTQDLNRKNAIIENNVKQINSLISENKELRSQVQQQRIDDKIPTYVNDVKSDLGSDDKHFILMSIIWSIAGVFFGFLAVVSAFFTLYMNLDLKNLTNLQLIYIFTRGLVGIAILSWLSYICLSNSKKYTHESIRRKDRRHALMFGQVFLQIYGSTATKEDAIEVFKDWNISGDSAFSGQTEQPPSFASFLNTIKDKVKVTGSDKETD >NZ_CP040886|506073:514843|510955_511465_-|WP_000460892.1|DBSCAN-SWA MFDYQVSKHPHFDEACRAFALRHNLVQLAERAGMNVQILRNKLNPAQPHLLTAPEIWLLTDLTEDSTLVDGFLAQIHCLPCVPINEVAKEKLPHYVMSATAEIGRVAAGAVSGDVKTSAGRRDAISSINSVTRLMALAAVSLQARLQANPAMASAVDTMTGLGASFGLL >NZ_CP040886|506073:514843|510651_510948_-|WP_000956192.1|DBSCAN-SWA MLTKEPSFASLLVKQSPAMHYGHGWITGEDGKRWHPCHSQDELLSELTTRKRRKSKCMRQKVKWFISFVTEGRVIQYLKMICSVAIRHCRAMAVTFSR >NZ_CP040886|506073:514843|509891_510125_-|WP_001244224.1|DBSCAN-SWA MRNIETLTTKTGPDDAGLNILLTEARLEERRARAEAMAARLDSLACHISSRQLNHVEAAELLRVTAEAIQNEAQEIH >NZ_CP040886|506073:514843|506420_508814_-|WP_001376443.1|DBSCAN-SWA MSHADMNNCCGFNEAAAAFSWNSPKKAINPYLDPAEVAPVSTLSNLITLYAADNEQEQLRREALSDQVWERYFFNESRDPVQREMEQDKLISRAKLAHEQQRFNSDMVILADVNAQPSHISKPLMQRIEYFSSLGRPKAYSRYLRETIKPCLERLEHVRDSQLSASFRFMASHEGLDGLLILPEMSQDQVKRLSTLVAAHMSMCLDAACGDLYATDDVKPEEIRKTWEKVAAETLRLDVIPPAFEQLRRKRNRRKPVPYELIPGSLARMLCADWWYRKLWKMRCEWREEQLRAVCLVSKKASPYVSYEAVMHKREQRRKSLEFFRSHELVNEDGDTLDMEDVVNASSSNPAHRRNEMMACVKGLELIAEMRGDCAVFYTITCPSRFHSTLNNGRPNPTWTNATVRQSSDYLVGMFAAFRKAMHKAGLRWYGVRVAEPHHDGTVHWHLLCFMRKKDRRAITALLRKFAIREDREELGNNTGPRFKSELINPRKGTPTSYIAKYISKNIDGRGLAGEISKETGKSLRDNAEYVNAWASLHRVQQFRFFGIPGRQAYRELRLLAGQAARQQGDKKAGAPVLDNPRLDAILAAADAGCFATYIMKQGGVLVPRKYHLIRTAYEINEEPTAYGDHGIRIYGIWSPIAEGKICTHAVKWKMVRKAVDVQEAAADQGACAPWTRGNNCPPVENLNKSGGDLPDIKTMDEKELQEYLHNMGQKERRELTARLRLVKPKRKKAYIQSILEQQRLQLEAELTTRGFEGSASEIDLLLRGGSIPSGAGLRIFYRNQRLQEDDKWRQWY >NZ_CP040886|506073:514843|506073_506262_-|WP_001376441.1|DBSCAN-SWA MQDYLLESLKLQRIDFFIKLVAASECSDEEKRLAIQWVSELTDELMAKIRSHEYSRSTEVSS >NZ_CP040886|506073:514843|508810_509668_-|WP_001544405.1|DBSCAN-SWA MSTILKWAGNKTAIMPELKKYLPAGPRLVEPFAGSCAVMMETDYPSYLVADINPDLINLYKKVAADCESFISRARVLFEIANREVAYYNIRQEFNCSTEITDFMKAVYFLYLNRHGYRGLCRYNKSGHFNIPYGNYKNPYFPEKELRTFAEKAQRATFICASFDETLAMLKAGDVVYCDPPYDGTFSGYHTDGFTEDDQYHLASVLEHRSSEGHPVIVSNSDTSLIRSLYRNFTHHYIKVKRSIGVAAGEGKSATEIIAVSGPRCWMGFDYSRGVDSSAVYGVRA >NZ_CP040886|506073:514843|511497_511719_-|WP_000188448.1|DBSCAN-SWA MTPNISITLNTPHVTIERYSELTGLSIDTINDMLADGRIPRHRLRKDKKREKVMINLAALTVDALTDCNVVFN >NZ_CP040886|506073:514843|513790_514843_+|WP_001372563.1|integrase|DBSCAN-SWA MAVRKLTTGKWLCECYPAGRSGRRVRKQFATKGEALAFERYTMGEIEAKPWLGESVDRRTLKDMVELWFKLHGKSLTAGQHVYNKLLLMVDALGNPLATDLTSKMFAHYRDKRLTGEIYFSEKWKKGASPVTINLEQSYLSSVFSELSRLGEWSYPNPLENMRKFTIAEKEMAWLTHEQIVELLADCKRQDPILALVVKICLSTGARWREAVNLTRSQVTKYRITFVRTKGKKNRSIPISKELYEEIMALDGFNFFTDCYFQFLSVMEKTSIVLPRGQLTHVLRHTFAAHFMMSGGNILALQKILGHHDIKMTMRYAHLAPDHLETALRFNPLATLPSGDKVAAAVGITP |
12 | Salmonella_phage(90.0%) | integrase | attL 505743:505756|attR 514885:514898 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
594426 : 621630
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP040886|594426:621630|DBSCAN-SWA TATGACAACGGACGATCTTGCCTTTGACCAACGCCATATCTGGCACCCATACACATCCATGACCTCCCCTCTGCCGGTTTATCCGGTGGTGAGCGCCGAAGGTTGCGAGCTGATTTTGTCTGACGGCAAACGCCTGGTTGACGGTATGTCGTCCTGGTGGGCGGCGATCCACGGCTACAATCACCCGCAGCTTAATGCGGCGATGAAGTCGCAAATTGATGCCATGTCGCATGTGATGTTTGGCGGTATCACCCATGCGCCAGCCATTGAGCTGTGCCGCAAACTGGTGGCGATGACGCCGCAACCGCTGGAGTGCGTTTTTCTCGCGGACTCCGGTTCCGTAGCGGTGGAAGTGGCGATGAAAATGGCGTTGCAGTACTGGCAAGCCAAAGGCGAAGCGCGCCAGCGTTTTCTGACCTTCCGCAATGGTTATCATGGCGATACCTTTGGCGCGATGTCGGTGTGCGATCCGGATAACTCAATGCACAGTCTGTGGAAAGGCTACCTGCCAGAAAACCTGTTTGCTCCCGCCCCGCAAAGCCGCATGGATGGCGAATGGGATGAGCGCGATATGGTGGGCTTTGCCCGCCTGATGGCGGCGCATCGTCATGAAATCGCGGCGGTGATCATTGAGCCGATTGTCCAGGGCGCAGGCGGGATGCGCATGTACCATCCGGAATGGTTAAAACGAATCCGCAAAATGTGCGATCGCGAAGGTATCTTGCTGATTGCCGACGAGATCGCCACCGGATTTGGTCGTACCGGCAAACTGTTTGCCTGTGAATATGCAGAAATCGCGCCGGACATTTTGTGCCTCGGTAAAGCCTTAACCGGCGGCACAATGACCCTTTCCGCCACACTTACCACGCGCGAGGTTGCAGAAACCATCAGTAACGGCGAAGCCGGCTGCTTTATGCATGGGCCAACTTTTATGGGCAATCCGCTGGCCTGCGCGGCAGCAAACGCCAGCCTGGCGATTATCGAATCCGGCGAATGGCAGCAGCAGGTGGCGGCTATTGAAGTGCAGCTGCGCGAGCAACTGGCACCAGCCCGTGATGCCGAAATGGTTGCCGATGTGCGCGTACTGGGGGCAATCGGTGTGGTCGAAACCACTCGTCCGGTGAATATGGCGGCGCTGCAAAAATTCTTTGTCGAACAGGGTGTCTGGATCCGGCCTTTTGGCAAACTGATTTACCTGATGCCGCCCTATATTATTCTCCCGCAACAGTTGCAGCGTCTGACCGCAGCGGTTAACCGCGCGGTACAGGATGAAACATTTTTTTGCCAATAACGGGAAGTCCGCGTGAGGGTTTCTGGCTACACTTTCTGCAAACAAGAAAGGAGGGTTCATGAAACTCATCAGTAACGATCTGCGCGATGGCGATAAATTGCCGCATCGTCATGTCTTTAACGGCATGGGTTACGATGGCGATAATATTTCACCGCATCTGGCGTGGGATGATGTTCCTGCGGGAACGAAAAGTTTTGTTGTCACCTGCTACGACCCGGATGCGCCAACCGGCTCCGGCTGGTGGCACTGGGTAGTTGTTAACTTACCCGCTGATACCCGCGTATTACCGCAAGGGTTTGGCTCTGGTCTGGTAGCAATGCCAGACGGCGTTTTGCAGACGCGTACCGACTTTGGTAAAACCGGGTACGATGGCGCAGCACCGCCGAAAGGCGAAACTCATCGCTACATTTTTACCGTTCACGCGCTGGATATAGAACGTATTGATGTCGATGAAGGTGCCAGCGGCGCGATGGTCGGGTTTAACGTTCATTTCCACTCTCTGGCAAGCGCCTCGATTACTGCGATGTTTAGTTAATCACTCTGCCAGATGGCGCAATGCCATCTGGTATCACTTAAAGGTATTAAAAACAACTTTTTGTCTTTTTACCTTCCCGTTTCGCTCAAGTTAGTATAAAAAAGCTGAATGCGAAACATTAAAAAACATTAATATCAATGTGTTACAATATCATTGGTCTAAAAAATAGACTACATGATGCTACAAAACACAACATATCCAGTCACTATGAATCAACTACTTAGATAGTATTAGTGACCTGAGACAGAGCATTAGCGCAAGGTGATTTTTGTCTTCTTGCGCTAATTTTTTGTTATCAAACATGTCGCACTCCAGAGAAGCACAAAGCCTTGCAATCCAGTGCAAAGATTTGTGTGCCTCAGTTTTGTCTAAGTGTTCTACTGAAAACATAGTAAAATCGGCAACAGCTGGAAATCATTCAATACTCGCACTATCGAAAGTTCACCAGCCAACCGCAGCACGTTCTTGCATACGACGTGCTGCGGTTTTCTTTATGATTTATGCACAATGGACAATTTGAAATTATTGATGATTGTATGGTGCATCGTTTTCTGAACCTACACTGATTTTTTGGTATAGCCTTGCCTAGTCAGTCTTACCGGATCAAACTCTTCGCTATTGCAATACTAACCAAAATCATCAATTTGACAGCGATTAACCAGAATAATAGTATACTATCACCAGTAAGAAATTATCGTTATTTGTAGCGATACATATTATATATATATCATTCTCAGGTGCGTACATGATTATCAACCAGGTACCTATAAAAATAAAAATCTTTATCTTTTTATTTTCATGCATCTCTATTATATTTTTGTTACTGCATGCAAATAATGGAATATACATAACACAAACAACACAAATAAGTTATAGTGTTTTCATTATTGGGCTTTTTTTCATAAACCTGATGATTTTTATTTTTCTATTGCTTTACTATGTTTCTAATCAGAGACAAAGTTATCTCTTAATTCTTTCATTCGCGTTTTTGAGCAACACGTATTATTTATTAGAAGTGGCTATTATTTCTTTATCTCCGTTAGGTAACGATTTATCTACAATCTATCAGAAATCAAATGATATCGCAATATATTATCTATTCCGTCAGTTCAGCTTTATATCTATAATCTTTCTGGCTGTTTATTCCACCAATGTTAAAAATAAAAGTGTTTTAGAAGATAAAAGAAACATAATAATTGTTGTTTTGTCAATATTAATTCTTTTTATTACTCCGTTTGTAGCAAAAAATCTAAGCAGTGACAATATAAAATATAGTCTTAATATTATACAATACTCGCTGAATCGTCATTTGCCGACGTGGAATATCGTGTACACCAAAATAATATCAGTATTTTGGCTTGTATTACTTATCAGCTCATGCATCAGCATACGTAATTACTCAAAAATATGGTTGTGTATAATACTTATTAGTATAGTGTCAGTATGCAATAATCTAATTTTATTGTATTTTATTGATAAATCCCATCCTGCATGGTATATGACAAAATTTCTTGAATTGATATCAATGATTTATATCATTTCAACACTCATGTATTATGTTTTCAGGAAATTAAATCATGCTAATCATATGGCAATTCATGATCCACTAACGAATACATACAATAGAAGATACTTTATTGACTCATTGAAGAATATATCAAAACACCATGATTTCTCAGTAATAATGTTAGATATTGACAGTTTCAAAAGCATCAATGACAAATGGGGGCATCATATGGGTGATCAAGTCATAGTAATGGTTACCAGAATAATAAAAAAATCCATCAGGAAAGAGGATGTATTAGGGCGCTTAGGCGGTGAGGAGTTCGGTATTATCATTAAAGGTAATACTCAAAAGCTCTTGCTATCAATTGCAGAGCGAATCAGAAAAAACATTGAAGAGCAATGCTCGGAAAAATTATTATCGCATGGACCTGAGAAAATAACTGTCAGTATTGGTTGCTTTACTTCAAAAGAGAATAATCTCAGCCCATCTGAAATGTTAGTCAATGCCGATAAAGCGTTATATCAAGCCAAAAGAACCGGAAAAAACAAGGTGATAATTCACTCAAAATAAACACCTTTTTAAAATACAGCCCCAATAAACTGCAGAATATTATCCCATATAATATCCTGCAGTTCGTAATGCACTATTCGATAATGGGTACTGTTGGCCATTCAATATCCGGTGCAGTTGTTGTATTAACACGGTTCAGCAACACCCGATACTTCTTCCAGGCTTCCAGCAACGAGTTTTCTTCCTCCGTTGCGATCTCCAGATCTACAGCATCCTGAAGTGGCGCAATATGCTCACTGAATTCCTGGATGTAGAACTATGTGGTGACGGTCTTCCAGCCATTCGGCTCCTGCTGTATCGAAGCATACCAGGCTATTTCAATATCGCTATGCTGCGGCAGCATTTAACCCCTTGTAATTCATCACCATAATTGATTTAATTCACAAACAAAACTATAACATGGTGAAATTAATGAAAAAAAACACAGATGATGGGGCTAAAATTTACACACCACTTACCCTAAAGCTTTATGACTGGTGGGTTTTGGGAGTATCAAATCGGCTTGCATGGGGATGTCCTACAAAGGAACACCTTCTTCCACACTTTCTGGAACATTTAGGTAACAACCATCTGGATATTGGTGTTGGAACTGGGTTTTACCTTACTCACGTACCTGAGAGTAGTCTGATATCTTTAATGGATTTGAACGAAGCTAGCCTGAACGCGGCATCTACAAGGGCTGGGGAATCAAAAATTAAACATAAAATTAGCCATGATGTTTTTGAACCTTATCCCGCGGCGTTACATGGTCAATTTGATTCCATTTCCATGTTTTACCTTCTTCACTGCCTGCCTGGAAATATATCTACAAAAAGCTGTGTAATACGCAATGCGGCGCAGGCCTTAACTGACGATGGAACTCTATACGGAGCCACAATTCTTGGCGATGGAGTTGTGCACAATAGCTTCGGTCAAAAACTGATGCGCATTTACAATCAGAAAGGCATCTTTTCAAACACAAAAGATTCCGAAGAAGGCTTAACACATATACTCTCAGAGCATTTCGAGAATGTTAAAACCAAGGTTCAAGGTACTGTAGTAATGTTTTCCGCTTCAGGGAAAAAATAGCATCCAACCGCAGCTCGCTCTTGCTTAAGACGTGCTGCGGCATAATCCCAATGATTACTCCCTGACAGGGTTCGTAGGCCACTCAATATCAGGTGCAGTTGATGTATCAACACGGTTCAGCAACACCCGATACTTCTTCCAGACTTCCAGCAACAAGGTTTCTTCCTCCGTTGCGATTTCCAGCTCAACAACAGTCTGAACGTACCAGGAACAGCCTCCTTCAGGGCTTGAAGGATATCAATGTTCGCTTCCTGTTAACTGCCCGACAAGTGCAACCAGTTCGCTTACCTGATTTTCCAGAGTGGTGATCCGGGTGCCTGCTGTTGCCAGATTTTCACGTAACGTTGTATTTTCCTCTTCCAGCGCGGTAACGCGATCATCTGTTTCACGGGCGACCTGAACAAGTAACCCCGTCACGGCGGAGTAGTCAACATTAAGATAGCGAGTTTCTTCGCGTAGCTCGTTGCCGTCAACGGTCGGACCTTGCAACTCTTCACCATAATGAGTAAACGATCCCACAGCTTCAGGTATCGCCTCCATTACTTCCTGTGCAATAACGCCAGCATAAGGCATTCCGTTTTCCTTGAGCGTGTAGGTGTACCCGTTCATTTTACGGATTGCTTTCGTCGCGTCGCTGATAACGAGAATATCGTCTTTAAGGTCGCGGTCTGATGACTGATTCAGCGTTGTGCAATTAATAGCGCCATTTACATCAAACAACTGGCCTGCTGACGTTTTTTGCGCATAAAACAGATACGCAGCAGACGTTCCAACCTCAAAAACGTTTTGTCGAGTACTGGAACCCCACACCCTGACAGAAAACGGTAGTTCTGCATTACCTGAGTTCTGTAAAACAAAACGATTGCCAGTCCCTGATTGTTTTGTAAGGGTTAAATCAACTGTTGAGTTAACCTCATCCTTGTTGATAGTGAGCGCCTGCGCTTTATCAGTTCATCCAGCGCGGCTGCTTTGTTCATGGCTTTGATGATATCCCGTTTCAGGAAATCAACATGTCGGTTTTCCAGTTCCGGAAAACGCCGCTGCACCGACAGGGGGATCCCGTCGAGAATACTGGCAATTTCACCTGCGATCCGCGACAGCACGAAAGTACAGAATGCGGTTTCCACCACTTCAGCGGAGTCTCTGGCATTTTTCAGCTCCTGTGCGTCGGCCTGCGCACGCGTAAGTCGATGGCGTTCGTACTCAATAGTCCCTGGCTGGAGATCTGTCTCGCTGGCCTGCCGCAGTTCTTCAACTTCCCGGCGCAGCTTTTCGTTCTCAATTTCAGCATCCCTTTCGGCATACCATTTTATGACGGCGGCAGAATCATAAAGCACCTCATTACCCTTCCCACCACCCCGCAGAACGGGCATTCCCTGCTCCTGCCAGTTCTGAATGGTACGGATACTCGCGCCGAAAATGTCAGCCAGCTGCTTTTTGTTGACTTCCATTGTTCATTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAAAAGCCCGTTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGGGGGTGTTTTAAATAAAAACATTAAGTTACGACGAAGAAGAACGGAAATGCCTTAAACCGGAAAATTTTCATAAATAGCGAAAACCCGAGAGGTCGCCGCCCCGTAACCTGTCGGATCGCCGGAAAGGACCCGCAAAATGATAATAATTATCATCTGCATGTCACAACGTGCATCTACGCCATCAAACCACGTCAAATAATCAATTATGACGCAGGTATCATATTAATTGATCTGCATCAACTTAACGTAAAAACAACTTCAGACAATACAAATCAGCGACACTAAATACGGGACAACCTCATGTCAACGAAGAACAGAACCCGCAGAACAACAATCCGCAACATCCGCTTTCCTAACCAAATGATTGAACAAATTAACATCGCTCTTGATCAAAAAGGGTCCGGGAATTTCTCAGCCTGGGTCATTGAAGCCTGCCGCCGGAGACTGTGCTCAGAAAAAAGAGTTTCTCCTGAAGCAAACAAAGAAAAGAGTGACATTACTGAATTGCTCAGAAAACAGATCAGACCAGATTGAAGCAATTTAGATAATCGTGCAGACTACGCCCCTCATATCACATGGAAGGTACTACAATGGCTCAGGTTGCCATTTTTAAACAAATATTCGATAAAGTGCGAAATAATTTAAACTATCACTGGTTTTATTCTGAACTAAAACGTCACAATGTCTCACATTACATTTACTATTTAGCTACAGAGAATATTCATCTTGTTCTTGAAAACGATAATACGGTTTTAATAAAAGGACAGGGTAAGGTTGTAAATGTAAGATTTTCAAAAAATAAATGCCTTATAGAAGCCACCTTAAAAGGATTCAAATCAGGAGAGTTATCATTTTACGAATACAGGAAAAATCTTGCTACAGCAGGGGTTTTCAGATGGATTACAAATATCCACGAAAACAAAAGGTATTACTATACCTTTGATAATTCATTACTCTTTACTGAGAACATTCAGAACACTACACAAATATTTCCGCACTAAATCATAACGTCCGGTTTCTTCCGTGCCAGAACCGGACTCGCTGGCATGATGAAATATGTGTACCCGGTAACCCCGGTGTGCATCGTTTTTGATTATTCCCGCACACTCGCGCAGAAGGAGTTCCCCGTCGGGCTACGGTCTCTGTTAATACGGGAATACGGCGACGATACAGCGCATGATGTGTCAGGCTTGAATACCTTTATCCTTTAAAAGGGATATCAGTTAAGTTATCCCGTGTAGGGTATAAACCATTATCAAAGCCACTCTGTAGGAAGTGGCTTTTGTAATGGCAATAAAAAGCCCCGCGAATGCGAGGCTAAATCCTGGTATTTGTAATGACTGGTTCTTATCTCAACGCAGCCCCTTACCGCGCGCAAAATGCTCAATATCAAGCATCAGCAATGAGATGTTTAATCTGGATTCACTCCAGAAGTGAGCACCACCCTGTCTACAGAGCCAGATGTGAAGGATGATGAGTAAAATTATCGCTATCATCGAAGGCATTGCGTCCTGATGTATTCCTGAAGCGTTCTCAGTGCTGTTTGGTCGCGGATAATTCCGTCCCGGATACCGAGAACGTTTCGTCCAGCAACTGGAGAGAGTTCGACGGTGGCATCATTGCCCATGCCGGAGGCGCTGGAGGTTTCGGCTGAGGATGGCACAGGGCATTTTCCTTTGACGAGCACCCGACCACCATTATCAAGCTTGCGCCGAAGAGCATCATTTTCAGCTTTCGCATCAGCTAACTCCTTCGTGTATTTATCATCGAGTGCATCAGCATCACGCTGGCGCTGCTGCATGTCAGTAATGGTGGCGTTCGCCTTCTCCAGTTCACTGGCCTTGTTATCGCGCTGTTCTTTGTAGGCGATTGCATTATCACGGTAATGATTAACAGCCCATGACAGGCAGACGATGATGCAGATAACCAGAGCGGAGATAATCGCGGTTACCCTGCTCATTGTTGCCCCCACAAACAGACCTCACGCTCAATCTCACGACGAGTCATCAGGCCTTTCCATTGCTTACCGCCAGCGTATGCCCAGCGACGTAGCTGGTCACATGCGCCCTTGATATCGCCCTGGTTTATTTTGCGAAGAAGAGCGGATGTTCTGAAATTGCCTGCGCCCACGTTATAGACGAACGAGTAAAGAGCGCCGCGCGTTGTTTCCGGTATATCGACTTTGATGTACGGGTTAATTTGTCTGGCTACCGTGGCAAGGTCTTTATTCAGGAGGGCTTTGCATTCTGCTTCGGTATACGTTTTACCGGGAATGATGTCTTTTCCTGTATGCCCGTGACATACAGTCCATACGCCAACGATATCTTTGTATGGTATGTAGCTGACACCTTCCAGACCATCGTTACCACTTGGGCCAGTGATTAACACTGATGCTATAGCAATTGCTCCGCCACCAATAGCAGCAGCAACGGCTTTTCGTAATGATGGAGGCATTATTCACCTCTCGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAAGCTACCCAGCACACCTATTGCTGCCCACTGTGAGGGCGTGACTTTATCGAGCAGCTGTAAAAACCAGTAGCCAGCACTGCCTGCGGAGGTGCCGTAGGCAATGCCCGTTGTTAACTTATCCATGGATTTCATAGCCTCACCTCCGCAAATAACGGATGGTGTACACGGTTCGGAACGAAGAGGAAAGGTATAGAAGTTACATTAGCGTAAGGCTTGAACATCTATTCAAAAAGAAAAACGCCGCGATTATTCTGGCGTAGCTGAAAGCATCATACAATTATCAAATACGAAAATTACAAAATCATTAAAACGCATCACGTTACATCATGTCTTTTTCTAAAAAAAATCTTGATGAATATTGATGGGGAGGAACACCAAAATACCTTCTGAAAACACTTACAAAATATGACGTGTTTTCATAACCGCATATCTCAGCAACTTTTCCAACAGAATATAAATTGTAGCTTAATAACCTTTCCGCCATCACCATTCGCTCTTCAAGAATTAACTTACTAAATGATAAGCCTTCGTGCTTTAATTTTCTTTTTAACAGACTTTCACTCAGATACAGTCTTGAAGATATATCACAAAGTCTCCATGCTGCAGATATATCCGTGTGAATAATAGCCTTAACTTTACTTCCTAAACTATTAAGACATCCAAATAAAAAACTTTGCACTATTTTCTCTGAAGATAAGATAGCAAGACATGCAAGTGATATTTGATTTCTAACAAAATCCACAGTTCTGCCATCACAATTCAAGCATGCAATCAAGTTCTTTAACAATGAAAAATCTTCACATTCCACCATCAAGTATGCCGGATAAAACCTTCTTACAGAAAAAGGTGAGAGTGTGTTGCTTTTAAAGAAATCATTAACTGTTTTTTCTTCAACATCTACGATCATTACATGATCTATATTTGATGAAAAAAATCTTTTAAATTGTAATCAATGAGAACAGCACTTCCTTTTTTAAACAAAATATCTTCTTTACCAATTCGGACATCAAACGAATTCAACACCAAAATGATAGAACATATGTATGGCATATTATCCACCTGATATCATTGGGGTTACACCAGGTAAGTGTAGGTGGAAAATCAATATTCGCCAGTTCAACAATAAGGAAAATTTCATTACATCACAAGTATAAAATTATGTATTTAACTCACAAAGACAAATTATTAAACCAATCTGTTATATTATATATAGCTGCGTGGAATCATAATATCATATATTTTGACTGGCATGTTTACCAACTTTAAGTTGCATCTCAATTGTTTCTTCAGCGTAAACAGAGTTTTTATACAAACTGACACTCTGGGTATCATAGTGTAGTTTTTACGATTGTAAATATCCTGCATGCAGGAACTCATCCTTTTGGATGATATCGCATACAATTAATTTACCATCAGTCTTAGAGCCAGTTCGTCCGGATAGGGATCGAAGTAATTTTGTGTAAGCAAGTAATCATTAGGATACTCACCCAGATAATGCTTCAGCAGAGTCAACGGCGCAAGAAGAGGTAATGTGCCAGAACGATAGTTAAGTATAACCTCGCTCAACTCTTTACGCTGGCGTGTACTTAAGTAGTTACTAAAATACCCCTGTATATGCATCAGCACATTCGTGTGATTTTTACGTGATGCAGGTTTTCTGAGAATCGCCATCAGCTTATCACGATACACCTCAAAGTATGATTCAAGGTCCGCCCACTCGTGTATTGCAGCCACAAATGGTCCCATATCTTTATAGCCTGCCTGACTATGCGCCAACAACTGAAGCTTATAACGACTATGAAAAGCTAATAACTCTCTTCTTGATAATTTCTCCTTGTAAAGGTGATTGAGCTCATGCAAAGCAAAAACTCTTTCAACAAAATTCTCACGAAGCACTGGATCATGTAATCGCCCATCCTCTTCAACCGGTAGCCAGGAAAACTTTTCCATCAAAGTGCTCGTAAATAGTCCCACTCCATCTTTACGACCTCGATTACCATTTTCATCATAGACACGCACGCGCTCCATGCCACAGCTGGGAGATTTAGCACAAACCACAAACCCCGATACATCCTTTAATTTGTCCATATAAGAACGACTAAACTCTGTCATTCTCTCTGTCACATCCTCATTCTGGTCGTGGCTGAAACACATCCGTATATTTCCTTGCATCGAGCGCACAAGACGTAGAGCAGGACGCGGAACTGGCAGCCCTATAGCCATTTCCGGACATACTGGTCTGAATGTTACCCATTCCACTAATTTGTCCATTAAAAAGTCAGCTCTTTTGTGACCACCATCAAAACGAACAGCAGAACCGGCCAAACAACCGCTGATTCCAATCACAGGTTTTTTTATCATATTCTCCCCCTTGACTAATTCATTAACACATAAACTGTGTAGTGCACGGAATAAATTGCCTTTCTGGCGTCATCACTGACAATTTTTCTGTTATGGACTATTCCTAATATAGTATGAAAGTTCTTTAAGTGATCGGTCGTAATCATCTATCTTTCATACTTACTCTCAACTATCAAAAGTACAGGATTTATTATGAAGTTATGGCCTGTGTTGACTGGCATTGCACTCTCTTTCACTCTTATAGCATGTAAGGCCCCGACACCACCTAAAGGTGTGCAGCCGATTACAAATTTTGACGCCAACCGCTACCTCGGAAAATGGTATGAAATAGCTCGCCTCGAGAACCGGTTCGAACGTGGTCTGGAACAGGTCAGCGCTACTTATGGAAAACGGAACGACGGAGGGATTCGTGTACTTAACCGTGGATACGATCCAACGAAAAATAAATGGAGCGAGAGCGAAGGTAAAGCATACTTTACTGGAGATACTAAAACTGCAGCATTGAAGGTTTCGTTTTTTGGCCCCTTCTATGGTGGCTATAATGTAATCAAACTGGATGATGAGTATAAGTATGCTCTTGTCAGTGGTCCGAACAGAGAATACCTATGGATTCTGGCAAGGACCCCAACTATTCCAGATAAAGTAAAAGCAGACTATGTGCGAACCGCTCAAAAGTTGGGATTCAATGTCAATGAATTATTATGGGTTAAACAATAAAATCCCTACCCGAAATAATACTTATTAGAAAAAAACCAGCCTTTGGGGAGGCTGGCTAAATCAGGAAACAAGCTGTTATATGATAATAACTACGTTGTGATTCCAACATTTAAAATGTTAGACTAATGACAATCAGACAGCAACTTTTCCTTTAATTATTTCGAACAATCAGCATCCATCTCCAATCGGAGATCCAACACCATCAGCATACCCTCCACTACGCCCTCAGCTTTCTGAAGCATCCTGCCAACCCAACAATCAGATCGCCCATGCTTACGTGCAAGCGCCATAAAAGTCATGCCGCCGACATAATAGTCCACCAATAAATCATGCAAATCGCTGTTGTTCTTTTTCAGACGGGCCATGCACCCGCAAATGATCATCGCATCATCGTCACAACATTGCGGGCGAGATTTTACTTTTGAAGGAATTAATCCCTTAAAACCGGCGGCAATGGACGACCAAGTCACATCTTCATGATTATTAGCCGCCCACGCTCCCCAACGCTCAAGAACCATCTGAATATCACGCATCAACTTACTCCACAAAAATCAGACCAGAACGCCAATCACAAGCAAAAATCAACAAAACAGTATTAGTTGATTGTTATCTCTGACTTCATACTCCTGCTCCTGTCAGGGTTTTGGCGTAATTCTTCAGTATTCGGTAATCGGTCAAAACAGAACCGGGGAAACGATATAAGCGCAGATGCCCCCAGCGGTGGCGAAGAAGTTCTGCCATATAAAACTCAAACATCATTCATTCCCCATTTCGGTGATGGTCAGTTCCAGCCTCCCACCTTTGGTAACAGGCATCTTCACAACGCGGTAATCAACGACCTGAGCATCATCCAGCCAGAAACCTGCTTTAGTGAGTGCGTCAAAAGCGGCTTTTTGCAGATTATCCAGGTCACGGCGACGGCGATCCGGCATGTGGCACTCAATGCGGATTTTCACAGGCATAGCCAGGCCGATATCCAGCATTGCGTTTTTAATGATTCGGGCGACGTTATCGCGGTATGCCTGCCCCTCTGCGCTGACGTGCGTGCGCCCGCGATTATGGCGGTAATAGCGATTATTGCTCGGAGGCCAGGGTAATGTGATGTTGTAGGTATTCACGCCTTGATTACCCCCTCTTTCATCCAGATAACCTGCGTTCTCGCCATACCTTCCAGCGCGCATTCTTTTGCATATGCGGCATCGACAAAATGTGTGCGGCGGTCGATTTCGTCGTGGCAGGCAGAACATGCAATGGTGGCAATCAGGTCTGGCGGTTTGGTACCGGTGCCGCACAATCCAGTCAGCCGGATATGTGCCAGTACAGACGTTTCAGGGTTGCCATTACATACGCCAGGGGTTCTTACCTGGCATTCCCGACCACGCGCTGCTTTTCTCAAATCAGCCATGATTCCTCCTTGCTGCCAGTCGCAACCATTTTTTATCAACCAGGCTGGCGGTATATCCGAGCAGTGTTGGTATTTCGGATGGCTTCAGCTCAGGTTTACGCTTACGACGATTTGGTACTCTGTAGATGTGTCCGTTCATGACACGAATAAGCGGTGTAGCCATTACGCCTCCTGCTTGTCGCGCAGCAGCTGGAACTCGCAGCTCTGCGGAATAGTCAGGTGGCAGCCAATATTCATCGCCCAGGCTTCAACCTTACACAGGAAGACATACATCTCTCCGGCATCAAGATCGGAGGTATGGCGTAACGACTGGATAGTGGTGATTTCACCGGTTACGACATCAACCAGTTCTTTGGTTTCATAACCGAGGTATGTGTGTTTGAGAGCTTCTTTTACCCATGCTGCGGTAGCGAACGATTTCCCCCTGCTGATAAGGTATTCACTGATTTCGCTGTACCACATGTGGCTGAGTGCATTCTGGGAAAGACTGCGTTTCTCACGCCACGGTTTAAGCACCATGCGAAAGCATTTTCCGTCCTCCAGATAAGGCTGGATCTGCTGGCCGATAGCGGTGAAGTTGCCGCGATGTAATTTGATGCCATCTTGTGGTAGGTTCACGCTTCACCTCCGCAGAGGTCAAACGCAGGATGCAAAAAATCGCAGGTGCATTTCTGCATCTGTGAAGGGAGAAGAGAGTTTGGATTGTATGTGCGCATAAACGTCCCCGTTTAGCGCAGAAGTCACCGGAGTTGTTCAAGCTCCGATGACTTTATTATTACGAATTGATTTTACAAAATCAAAAGGTATGTTAGTGACGCGGGTCTGTTATTATGCGAGAAGGGTTTCCGTATAAAACAAGGACCTTACTTCCTTGAGTAAATAACGGATCTTTGCCTTGAACAATGGTCATTAAATTCCCATTCTCAGTTTCGACAACATATTCCATGCCTGTTTGTTTTGTTGCTGAAGATTCGATTGCTGCCCCGGCAATACCACCAATGACTGCACCACCAACGGCACCAACGATATTAGAACGAACTCCCCCACCAAGCGCAGAACCAGCGGTTGCCCCCACGGCAGCCCCAGCAGTCCCGCCTAACGCGGAAGTCCCACTGATATCAACCCCCCTGGCACTAATAACTGTACCAGCGATAGTTCGATTAACCATGCCCACAGAGCCAACAGAATAACTATTTGGCGATATATTTTGTGCGCATCCAACCAACACTAAGAGTGGAGCAATTACGAATAATCGCTTCATTTAGCTACCCTAACAGGAAACATTGGACGAGAAAGATCAACACTTTCTAATGCTTGCAAGAACTGCGTTATGTTGTTTTGCACCGCGCGATTAACAGATTCGCGTGCTCGAACAATACCGTAGAATGCGTAACTGGCTGGAACAGTACCGGTAGACTCAATATCCTGCGTATATATAATATCACCATTCGCACGGTTGATTATTTCATACCTTGCAATTGCTTTAGTTGTCATTGAAACACCAAAAGCAGGAACGTCAAGAGCCAACACTTTAACATTTAAGCTAACCGTATTTGGTGAACTATCACGAAAAATAGTCATTCGGTCGAGTGCTTCCTGCAAAGATTCACGCCAAATTGGAGTTATAGCCTCCATACCAGCAGTGATATCCCCTTTCTGCTCATCTGGACGAGCAAGTGATACCGTTAATGACTTAATTTCAGCATCTATTTTTTTCTGGCTAACTCCCACGTTAGGTGTTGAAAAATTCAATGGTGGCACACTAGCGCAACCTGTTAAAGAACCAATAATCATGGCTAATAATATTATCTTCTTCATAAATTTACCTTATTGTTATAACCAAAGGAATTATAAAGTAAAAAAGTTCACTATCACTAGCCATTAACGACATCAATTTCAGAGAAACATGGTACTCATTTCCACAAATTTGACACAAGTCATTTTCATCTACATATTCCATCATACTTGATGCATATGTTATTGAAGCCTCTATCCTATCCGTTCATAATAGCAATAGTTACCCGGGTGATAGTACCTCTATGATTACTCGTCTTTCTGATTGATTGGATTAAATATGCGCGCCAAAATTTATCAACTTTCGTTATGGATATTTATTTCGTTTCTAGCGATCTATGCCTTTATTATCTATAAAGGTTCTTATATTGGAGTAGCATTGCATCAAATTGCTTGGATCATCATTATTGCCTCTGGCTTGATTGCTAGGCTAACTAAACCAAAGCAAAAACCAATTTCGTCCAATAATTAGACATGTATTAAAAAATGATATTTTTATGTACATAGTCTATTGAAAATTGCCGCGATAAAATGCCAACACCCGCTTCATCGCGGCACTCTGGCGACACTCCTTGAAAATCAGATTCGTGCTCACCTTTCCTTCCCGTTCTTCCCTGGTAGCGAACCGGTAATACACCGTTCGCCAGACCTTACCATCAATAACTAAGATTCCTGTCCGCGCCATTTTAGCCGCAGCCTGGTTTATGCTGGTTACTGTTGCGCCTGTTACCGCAGCAACGTCCTGCGCACAGAAGCTCTTATGCGTCCCCAGGTAATGAATAATTGCTTCTTTTCCCGTCATACACTGGCTCCTTTCAGTCCGAACTTAGCTTTGATTTCTGCAATCTTCGCCAGAGCCTGTGCACGATTTAGAGGTCTACCGCCCATGACAGGAAGTTGTTTTACTGGTTCAGGGATCGCCTCACCACGGTTAATTCTCGCAGTCATATGGACAAGCTCATCTGCGGCCTTACGGCGTAATTCCGCATCAGTAAGCGCATTGGCCCGCATGTTCTGATACAGGTTGGTAACCAGCCAGTAGTGCGCGTTTGATTTCCACGGATAAGACTCCGCATCCGGATACAGGCCTCGCTTCCGGCAATACTCGTAAACCATATCAACCAGCTCGCTGACGTTTGGCAGTCCGGCGATAACGGATGCTTCTTCCCGGCACCATGCAACAAACTGCCCGGGTGATGGCAGGAATGGTCGATTCTGCCGACGGGCTACGCGCATTCCAGCGTTAACCTGTTCCATTGTGGTGATCCCGTTTTCCCGAAAAGCCAGCACCCACTGGCGGCGGATTTCGTTCAGTTCGTTCTGGTCCCGGTTAGCCAGGCTCGCTGGGAAAGTTGCCAGTAACTGGCTGAATACACCGTTGATTATCTGCGCTACCTGCTGTACCTGCGGCTTTTCGTCGTACTGTTCCGGCATGTTATTGGCGATCCGGCACATCTGCTCACGGTCAAAGTTAACCATCTGTGCGGCGATGTTTTTCATAAATCCACCCCGTAAATCCAGTCAGTGTTCGTCAGGTCGAGTTTTGGTTTGCCGGCTGTCACGCCAGCCTGTTGCTTGTTTCGGTTGATTTCGAGCTGGGTCCACTTGTCGCGGAGTTTGGCCGGACTCAGCACGTTACCGGACCAGAAGTTGTCCTGGCAGGCCCAGCGGAACAGTACACACATGTCGCGGTGGTTACGTCCATCACGTTCACGCATCAGACGGATATCGTTAGCCCACCCTGCAAAATTCGGTTTTCTGGCTGATGGCGCGATGGTCTTCACCATGTCAAACATCCACTCTGCGGCGGTCAGGTCTTCTGCTGTCCCCCACTTGCTGCCGTTCTGAATCGCAGCATCCGCTTTCACCACAGGAAGGTCGTTTTCTGGCAGGTCAGAGGATTCGCCAGAATTCTCGGACGAATAAGGTTTTATATTGTCTTTTGTTAGTTTGTCTTTTGTGTTTACCTGATTCGGGTAAGTGCCTTTACCTGATTTGGGTAAACTTTTCTTACCTGATTCAGGTAAATTTACCTCTTTCAGGTAAACTTTATTTTTCTTACCTGATTCGGGTAATGTTGACCATTCACTGACCACATTATTAATGCCTATATTCCGCCCGCTCTGAATAAAAATCCCACGCTTTACCAGAACACTTTTTGCAGCAGAACACTTGTGCGGCAATATCCCGGTCAACTCGGAAAGTTGCTCGTTGCTCACCCAATCCAGTTTTTTATTAAAGCCATATGTTTTGCGCATGACAGCCAGGAAGACCAGAAGCTGGTGCTGTGTTAATCCGGCCAGCATTACAGCTTCCAGCAACTCATTTGCAATGCGCGTATAACCATCATCGAGATCTGCCACGCGCGGCTCCTTTTGTGCCGCATCCGGCACTGGAAAATTGAATATCTCAGCAGTGTTTGCCATAATTCCTCCCGCAATGAGTGTGTTACGATTTGCACCTGAAAGTCGGTTCTGTTCCAGCAGACCGGCTTTCGCCATTTCTGAACCTGTCATATCGCCCCCAGCATGGTAGTAACCATCGCCATCAATGGACCAGCCAGATCTGGGTCCACACGAAACATCGACACAATACCTTCACTAATTTCCTTCAGTTTCTGGTGGCGTGGTGCGTTGAGAATGACAGCCTGTTTTGCCTCACTGAGTTCCTTTTCCATTTCAGCCAACCTAGCCATGAAGCTATCCTGCTCAACCAGGTAACCGCGATATTCCAGCGGTAGTACCGCCAGAATTGCCGGGGTCAGTTCACGCACGTTATTTCGGTATTTTTCAGAATCGAATTTGTTATCGAGGAAGCGGAACAGCTTCTGGCGTGCACGGCTGACATCATCAGGGAAATCGATGGTGCCGCCGCCCTGCTCCCGATACTCATTCACAATGAGTGTGGCAACGACATCCTGATTATCTTCAGCCGACCAAGCGCGGACGGCATCACGGATTTTTTCGTGGCCTGGCACCTGTTTTGTTTGAGAACGATTTATCACCGCAGTCGGGCTAAATCCGCTAGTCTGTTGGTATGTAAGTGGTTGCATAATTGACTCCTTTAGTTTGAATTGACTGTTAAGTTGATTGCTTATTGTTAAAGAGCGTGAAATGGAAATTTAAGCTGCGTTCTTTTCGGTGTGTGGAAACAACTTCGGAAGATCCGGGCGAATCTGGTATGCCTTCACTACTCCACCAGTAGCCGTAACAATGCTGCCGACATGTTCAGGGGATACCTTTGCTTTGTTGTGAAGCCACTTATAGACGGCCTGCTGTGAAACTTCGCAAGCAGCGCCCAGTTTCTTTTGTGAACCAACGATATTGATCGCTGTTTTGATAGCTGGGTTCATAACAACCTCCGTGGTTAATTTGAATCAAGATTAAAACTATGGTTGTTTTTAGTCAACAACCATTTTCGTTTGATGGAATAAAACCTTGGTTGTACATTTGGACTATGAAAACAACACTCTCAGAAAGACTTAAAGAAGCCAGATTAGCGCGAGGCCTTACACAAAAGGCGCTTGGGGATTTGGTCGGGGTTAGCCAAGCTGCTATTCAGAAAATCGAAACAGGGAAAGCTAATCAAACAACTAAAATCGTGGAGATCGCGAACGCTTTGGGTGTGCGCGCAGAATGGTTATCTTCTGGCGTTGGAAATATGTCAGACAGTACAGTGCAACCAATACAATCAACTGTCAGCCATTCCAAATACTTCAAGATTGACGTTCTTGATATAGAAGTCAGTGCTGGGCCGGGAGTCATCAACCGTGAGTTTGTAGAAGTTCTACGCTCGGTTGAGTACTCGTTTGACGATGCTCGTCACATGTTCGATGGTAGGAAGGCGGAAAATATCCGCATCATTAACGTGCGTGGTGACAGCATGTCAGGAACGATCGAACCAGGTGATCTGCTGTTCGTTGATATCACAGTTAAATCTTTCGACGGTGATGGTATCTATGCGTTTCTGTACGACGACACAGCCCATGTAAAGCGCCTGCAAATGATGAAGGATAAGCTGCTGGTCATCTCTGATAACAAAAGCTACTCACCGTGGGACCCGATCGAGAAAGACGAGATGAACCGGGTGTTCATCTTCGGTAAGGTTATTGGGAGCATGCCGCAGACATATAGGAAGCATGGTTAAAGTGAGGCTAAAAAACAGTTACAGCAATAGGCCTGTTGTTTTTCTTTAAACACGCAGTGTTAAACCGCTCTTTGAGATGCGGAGTAATGAGATGGAAGACTTGAATCACATAAGGGTTAGTGATGGAGTGCGTAGCGAGCAGCAATAGTGCAATACCTAATGTCGTTGAAGTAATACGTCGCATCAATGAAGGTTCCACTCAGCCATTTCTTTGCAAATGTGATGATGGGCAGTTGTATGTTTTGAAGTCAAAACCATCAATGCCCCCGAAAAATCTCTTAGCTGAGTTCATTTCGGCGTGTTTGGCTAATGATATCGGCCTTCCTTTACCTGACTTTAAAATCGTATTTGTGCCAGAGGAACTTATAGAGTACTCACCTGATCTGCAGCAACAAATTTGTACAGGATATGCCTTTGCTTCATTGTTCATTGACGGTGCAATAGCGTTAACGTTTACGCAGTCAAGAAACGAAACGATCATCCCAGTCGAACAGCAAAAATTAATCTATGTTTTTGATAAATGGATATTAAATGCAGACAGAACGCTTACTGACAAAGGTGGAAACGTTAACATCCTTTATGACATCAGTAACGATAAGTATTATCTGATTGACCATAATCTCTCATTTGATCAGAATGCTGGACCTGAAGATTTTTCTGTGCACGTGTACGGCCCTGGTAACCGCAAATGGCAATATGATTTAGTGGATCGCGTAGAGTACCGCCAGAGGGTCGTTAACAGTTTACACAAGCTTCCTGCTATCCTTGACGAAATTCCAGAAGAGTGGATAGTAGATGAGGAGTTTTTACCTTTTGTCTGCACTACGCTAGACAAAGGTGATTGTGATGAATTTTGGAGCGCAATAGAATGACAACTCCATGCCTATATAGCATCGTTCGCTATGCGCCTTATGCGGAGACTGAAGAATTCGCAAACATAGGCGTACTTCTGTGCGCGCCAAAAGAAAATTACTTTGATTTCCAGCTCACAAAGCGAAATGACTCTCGTGTAAAGAATTTTTTCCATGATGATTGTATTTTCCCTGTAGCAAAAGACTCAATACAAAGAGAACTACAGTTCGCAAAAATGCATGCGACCCAGATTGTTGGACATCAACAACTTGCACAATTCTTCAGATATTTTACAAACAAAAAAGAATCAATTTTTCAGTTCAGTTCTACGAGAGTGATTCTCAGCGAAAACCCAAAAGAAGAGCTGGCCCGCATTTACAATAAATATGTAAACCACTCTGACTACACAAAAGAGCGCCGTGAAGATGTTCTAGCCAGAGAGCTAAAACGAAGTATCGATAGAATAGATGGATTGAAGAACGTCTTCAAACAAGCAACCATTGATGGGTATTTCGCAAAGTTCTCAATGCCATTGGTCGCCAAGAAGCATGACAGGATCCAATGTGCCATCAAACCTCTGGCATTCACTCAAGCTGAACCAGGAAAAATGATGGAGCATAGTGATACTTGGGTGATGAGAATAACTCGAGCAGCAGAAGAAAACCTGCTTTCACTTGATGACATTTTATTCACAATTGAAACTCCTGAATCACCAAACTCAGGCCAAAGCAAAGTTATTGACATCATAAAGAGAACTATGGATGCTAAGAAAATAAATCATATACCTGCATCCAACCACAAAGAAACTATTGATTTTGCAAAAAAAATACTTCCCCAAGTTTAAAATTTATTTTTGTATGTGATATTCCTTATTAATAACCCGGCCACCGTGCCGGGTTTTCTTTTGCCTCCCCTCATCACACAAACCGCTCAAAAAACCACCATAACCTCGCTTCAGTTATCGCTATGCGATTCAAGTCACAAAATAAATCCATCCTAAATACAACCAGTTATATCTAAAACAACCAATAAAACAACTTTTGTTGTTGACGGTAAAACAACTATAGTTTTAAATAGGTTCATCGCAACAACACAACGATACGGCAACCACCTGATTCACCGTTGCGATGACCGCTTAGATCCGCAGTTTGAATTTCAGCAGGCTTCGGGGAGTGCGAGGGGTGAAACGGACGCGTGAACGTCGGTGTGACCAGCTGAAATCAACTCAACATTTCATACCTTAGTCGCTTCAACGAGGCGGCTTAGTTATGACAACCGGCGGCCATCCACCGCCTGAATACGCGCAGAAGTCTCTATATGTTCAGCAGCCCAGCTTACGGGCAGGAGTTTTTATGGTTCATCAACATTACGGAACGCAGACCGTTAATCGCGGCGCGGTCATGCCAGGAATGCTGGTCAAACACAAAGATGGTACCTGGACTGCATCAGCTAATTTACGCGGACGGCTTTATCTGCATCGCGGCATCGAGCGCACTTATACCCGTGATTTGCTCGTGGAAGTTTTTCTCGACGGACGCGGTAACGGCCTGAATCACTAATCCCCTTTCCTGTTTTCCTAATCAGCCTGGCATTTCGCGGGCGATATTTTCACAGCCATTTTCAGGAGGTCAGCCATGAACGCTTATTACATTCAGGATCGTCTTGAGGCTCAGAGCTGGGCGCGTCACTACCAGCAGATCGCCCGTGAAGAGAAAGAGGCAGAACTGGCAGACGACATGGAAAAAGGCCTGCCCCAGCACCTGTTTGAATCGCTATGCATCGATCATTTGCAACGCCACGGGGCCAGCAAAAAAGCCATTACCCGTGCGTTTGATGACGATGTTGAGTTTCAGGAGCGCATGGCAGAACACATCCGGTACATGGTTGAAACCATTGCTCACCACCAGGTTGATATTGATTCAGAGGTATAAAACGGATGAGTACAGCACTCGCAACGCTGGCTGGGAAGCTGGCTGAACGTGTCGGCATGGATTCTGTCGACCCACAGGAACTGATCACCACTCTTCGCCAGACGGCATTTAAAGGTGATGCCAGCGATGCGCAGTTCATCGCATTGCTGATCGTCGCCAACCAGTACGGCCTTAATCCGTGGACGAAAGAAATTTACGCCTTCCCTGATAAGCAGAACGGCATTGTTCCGGTGGTGGGCGTTGATGGCTGGTCCCGCATCATCAATGAAAACCAGCAGTTTGATGGCATGGACTTTGAGCAGGACAATGAATCCTGTACATGCCGGATTTACCGCAAGGACCGTAATCATCCGATCTGCGTTACCGAATGGATGGATGAATGCCGCCGCGAACCATTCAAAACCCGCGAAGGCAGAGAAATCACGGGGCCGTGGCAGTCGCATCCCAAACGGATGTTACGGCATAAAGCCATGATTCAGTGTGCCCGTCTGGCCTTCGGATTTGCTGGTATCTATGACAAGGATGAAGCCGAGCGCATTGTCGAAAATACTGCATACACTGCAGAACGTCAGCCGGAACGCGACATCACTCCGGTTAACGATGAAACCATGCAGGAGATTAACACTCTGCTGATCGCCCTGGATAAAACATGGGATGACGACTTATTGCCGCTCTGTTCCCAGATATTTCGCCGCGACATTCGCGCATCGTCAGAACTGACACAGGCCGAAGCAGTGAAAGCTCTTGGATTCCTTAAACAGAAAGCCACTGAGCAGAAGGTGGCAGCATGACACCGGACATTATCCTGCAGCGTACCGGGATCGACGTGAGAGCTGTCGAACAGGGGGATGATGCATGGCACAAATTACGGCTCGGCGTCATCACCGCTTCAGAAGTTCACAACGTGATAGCAAAGCCCCGCTCAGGAAAGAAGTGGCCTGACATGAAAATGTCCTACTTCCACACCCTGCTGGCTGAGGTTTGCACCGGTGTGGCTCCGGAAGTTAATGCTAAGGCGCTGGCCTGGGGAAAACAGTACGAGAACGACGCCAGAACCCTGTTTGAATTCACTTCCAGCGTGAATATTACTGAATCCCCGATCATCTATCGCGACGAAAATATGCGCACCGCCTGCTCTCCCGATGGTTTATGCAGTGACGGCAACGGCCTTGAACTGAAATGCCCGTTTACCTCCCGGGATTTCATGAAATTCCGGCTCGGTGGTTTCGAGGCAATAAAATCGGCTTACATGGCCCAGGTGCAGTACAGCATGTGGGTGACGCGAAAAGATGCCTGGTACTTTGCCAACTATGACCCGCGCATGAAGCGTGAAGGCCTGCATTATGTCGTGGTTGAGCGGGATGAAAAGTACATGGCGAGTTTTGACGAGATGGTGCCGGAGTTCATCGAAAAAATGGACGAGGCACTGGCTGAAATTGGTTTTGTATTTGGGGAGCAATGGCGATGACGCATCCTCACGATAATATCCGGGTAGGTGCGATCACTTTCGTCTACTCCGTTACAAAGCGAGGCTGGGTATTTCCCGGCCTTTCTGTTATCAGAAATCCCCTGAAAGCACAGCGGCTGGCTGAGGAGATAAATAATAAACGGGGAGCTGTATGCACAAAGCATCTCCCGTTGAGTTAAGAACGAGTATCGAGATGGCACATAGCCTCGCTCAAATTGGAGTCAGGTTTGTGCCAATACCAGTAGAAACAGACGAAGAATTTCATACGTTAGCCGCATTCCTTTCACAAAAGCTGGAAATGATGGTGGCGAAAGCAGAAGCAGATGAGAGAGACCAGGTATGACAACCACTGAATGCATTTTTCTGGCAGCGGGCTTCATATTCTGTGTGCTTATGCTTGCCGACATGGGGCTTGTTCAATGACACCTCAGCAAGAAAACGCCCTTCGCAGCATTGCCCGTCAGGCTAATTCTGAAATCAAAAAAGCCAGACAGCAGTTTCCGGATAAAAACGTCGATGACATTTGCCGTAGCGTACTAAAGAAGCACCGCGAAACGGTAACGCTGATGGGATTCACACCGACTCATTTAAGCCTGGCGATCGGCATGTTGAACGGCGTCTTTAAGGAACGGTGAACATGAAAAGCAAAATCATCAGGGAGCTACAGGCTCCTTTTTTATTATTCGCATTCACCCTCAAGCGTATTAACCAACAATTCAGGGATTAATGAAAGATGGCAGACATAATTGATTCAGCATCAGAAATTGAAGAATTACAGCGCAACACAGCAATAAAAATGCGCCGCCTGAACCACCAGGCTATATCTGCCACTCATTGTTGTGAGTGTGGCGATCCGATAGATGAACGAAGACGACTGGCCGTTCAGGGTTGTCGGACTTGTGCCAGTTGCCAGCAAGATCTGGAGCTTATCAGTAAACAGAGAGGTTCGAAGTGAGCGAAATTAACTAGAAGCCAAAGATAAAATCATCGCTGAGCAGGAGAAAATCGCTAACGGAGAAAAGACAGTAAGTCAGTATATGAAAACCGCATGATATCATCAGATAAAAATCGGTCGTAAAGCGAAATATTAATACCAGAACAAACGAGTCGAGGTAAATTATATTACCTCGATAAATTAACTAAAACTTGCCCGCTATATACTATATCATTCAGTATCATCACGCGCGGTCTGTGCATATGTCACTACCGCACCTAATATATTAATTTTCTTTTCAACATAGATAATATTATCGTACTCATAATTGCCATACGGATAGCAAATGCGAATATTCTCATGTAGATCGGGGTCATCCACCTCAGCTCCAGAACAACTTTTTGAACTACCGGAAGTATACCGATACGGTGCAACATAAGACGATGTCTCTCCAGGCAAAAAATAAGTTAGTGTCGTAAGGGGTATAATCAGAAAAAATCCAGCAAATATGCACATCCCTGCATAAACCTTAAGGTATGCTGACAGACTCTTCCAGCCGCTTTGTTTTACTATCCCCTTCTTAACCCAAAACAGAGATAACAGAAAAGCTATTCCCATGCTAAACAGAATGTAATAGTGGGATATACTCTGATTAAGAAACGTGACCCTGTAGATATCTGCCCGCCACCAGAAGAAAAGGAAAATAAAGATCAGCCCTGAAACTGTCATGCAAATCAAATAAGGATACGAATCTTTTTTCATGTTTAGCGCCCATAAAATTTTTCCTGACCCGGACAAATTTACCATCCATTTTTTGCGCAGAAAATAGCTCATTACTTACTGCACAATAATACACAAAATTGCGTAAATTTTTTGCATGGATTTTAGCTCTTTCAGCCGACATTTAAGGGGTAAATAGCATTTCCTAAAAGCAACTGCACCAACCCAACAGAATGGGCTACCGCTTACGTTGAGAGCAAAAAAGTGTATAGCAGCAATGAACAGCATCCTCGCACTGACGAGGATTTCTTTTATCTGAACTCGCTACGGCGGGTTTTGTTTTATGGAGATGATAAATGCACTTCCGAGTCACAGGTGAATGGAATGGAGAACCATTCAACAGAGTTATCGAAGCCGAGAACATCAGCGACTGCTATGACCACTGGATGCTGTGGGCGCAGATAGCACATGCAGACGTAACCAATATTCGAATTGAAGAACTGAAAGAACACCAAGCCGCCTGATGGCGGTTTTTTCTTGCGTGTAATTGCGGAGACTTTGCGATGTACTTGACACTTCAGGAGTGGAACGCTCGCCAGCGACGCCCAAGAAGCCTTGAAACAGTTCGTCGATGGGTGCGCGAATGCAGGATATTCCCTCCTCCGGTTAAGGATGGAAGAGAATATCTGTTCCACGAATCAGCGGTAAAGGTTGACTTAAATCGACCAGTAACAGGTAGCCTTTTGAAGAGGATCAGAAATGGGAAGAAGGCGAAGTCATGAGCGCCGGGATTTACCCCCTAACCTTTATATAAGAAACAATGGATATTACTGCTACAGGGACCCAAGGACGGGTAAAGAGTTTGGATTAGGCCGAGACAGGCGAATCGCAATCACTGAAGCTATACAGGCCAACATTGAGTTATTTTCAGGACACAAACACAAGCCTCTGACAGCGAGAATCAACAGTGATAATTCCGTTACGTTACATTCATGGCTTGATCGCTACGAAAAAATCCTGGCCAGCAGAGGAATCAAGCAGAAGACACTCATAAATTACATGAGCAAAATTAAAGCAATAAGGAGGGGTCTGCCTGATGCTCCACTTGAAGACATCACCACAAAAGAAATTGCGGCAATGCTCAATGGATACATAGACGAGGGCAAGGCGGCGTCAGCCAAGTTAATCAGATCAACACTGAGCGATGCATTCCGAGAGGCAATAGCTGAAGGCCATATAACAACAAACCCTGTCGCTGCCACTCGCGCAGCAAAATCAGAGGTAAGGAGATCAAGACTTACGGCTGACGAATACCTGAAAATTTATCAAGCAGCAGAATCATCACCATGTTGGCTCAGACTTGCAATGGAACTGGCTGTTGTTACCGGGCAACGAGTTGGTGATTTATGCGAAATGAAGTGGTCTGATATCGTAGATGGATATCTTTATGTCGAGCAAAGCAAAACAGGCGTAAAAATTGCCATCCCAACAGTATTGCATGTTGATGCTCTCGGAATATCAATGAAGGAAACACTTGATAAATGCAAAGAGATTCTTGGCGGAGAAACCATAATTGCATCTACTCGTCGCGAACCGCTTTCATCCGGCACAGTATCAAGGTATTTTATGCGCGCACGAAAAGCATCAGGTCTTTCCTTCGAAGGGGATCCGCCTACCTTTCACGAGTTGCGCAGTTTGTCTGCAAGACTCTATGAGAAGCAGATAAGCGATAAGTTTGCTCAACATCTTCTCGGGCATAAGTCGGACACCATGGCATCACAGTATCGTGATGACAGAGGCAGGGAGTGGGACAAAATTGAAATCAAATAA
Protein sequences of DBSCAN-SWA_3 >NZ_CP040886|594426:621630|618879_619101_+|WP_000763365.1|DBSCAN-SWA MADIIDSASEIEELQRNTAIKMRRLNHQAISATHCCECGDPIDERRRLAVQGCRTCASCQQDLELISKQRGSK >NZ_CP040886|594426:621630|613205_613895_+|WP_000858975.1|DBSCAN-SWA MKTTLSERLKEARLARGLTQKALGDLVGVSQAAIQKIETGKANQTTKIVEIANALGVRAEWLSSGVGNMSDSTVQPIQSTVSHSKYFKIDVLDIEVSAGPGVINREFVEVLRSVEYSFDDARHMFDGRKAENIRIINVRGDSMSGTIEPGDLLFVDITVKSFDGDGIYAFLYDDTAHVKRLQMMKDKLLVISDNKSYSPWDPIEKDEMNRVFIFGKVIGSMPQTYRKHG >NZ_CP040886|594426:621630|617465_618146_+|WP_001372450.1|DBSCAN-SWA MTPDIILQRTGIDVRAVEQGDDAWHKLRLGVITASEVHNVIAKPRSGKKWPDMKMSYFHTLLAEVCTGVAPEVNAKALAWGKQYENDARTLFEFTSSVNITESPIIYRDENMRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREGLHYVVVERDEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR >NZ_CP040886|594426:621630|610547_611249_-|WP_001372464.1|DBSCAN-SWA MKNIAAQMVNFDREQMCRIANNMPEQYDEKPQVQQVAQIINGVFSQLLATFPASLANRDQNELNEIRRQWVLAFRENGITTMEQVNAGMRVARRQNRPFLPSPGQFVAWCREEASVIAGLPNVSELVDMVYEYCRKRGLYPDAESYPWKSNAHYWLVTNLYQNMRANALTDAELRRKAADELVHMTARINRGEAIPEPVKQLPVMGGRPLNRAQALAKIAEIKAKFGLKGASV >NZ_CP040886|594426:621630|601525_601936_+|WP_000079508.1|DBSCAN-SWA MAQVAIFKQIFDKVRNNLNYHWFYSELKRHNVSHYIYYLATENIHLVLENDNTVLIKGQGKVVNVRFSKNKCLIEATLKGFKSGELSFYEYRKNLATAGVFRWITNIHENKRYYYTFDNSLLFTENIQNTTQIFPH >NZ_CP040886|594426:621630|608569_608671_-|WP_072157016.1|DBSCAN-SWA MRTYNPNSLLPSQMQKCTCDFLHPAFDLCGGEA >NZ_CP040886|594426:621630|612261_612801_-|WP_001182899.1|DBSCAN-SWA MQPLTYQQTSGFSPTAVINRSQTKQVPGHEKIRDAVRAWSAEDNQDVVATLIVNEYREQGGGTIDFPDDVSRARQKLFRFLDNKFDSEKYRNNVRELTPAILAVLPLEYRGYLVEQDSFMARLAEMEKELSEAKQAVILNAPRHQKLKEISEGIVSMFRVDPDLAGPLMAMVTTMLGAI >NZ_CP040886|594426:621630|610257_610551_-|WP_000145917.1|DBSCAN-SWA MTGKEAIIHYLGTHKSFCAQDVAAVTGATVTSINQAAAKMARTGILVIDGKVWRTVYYRFATREEREGKVSTNLIFKECRQSAAMKRVLAFYRGNFQ >NZ_CP040886|594426:621630|620363_620582_+|WP_001303849.1|DBSCAN-SWA MYLTLQEWNARQRRPRSLETVRRWVRECRIFPPPVKDGREYLFHESAVKVDLNRPVTGSLLKRIRNGKKAKS >NZ_CP040886|594426:621630|606705_607083_-|WP_001204777.1|DBSCAN-SWA MRDIQMVLERWGAWAANNHEDVTWSSIAAGFKGLIPSKVKSRPQCCDDDAMIICGCMARLKKNNSDLHDLLVDYYVGGMTFMALARKHGRSDCWVGRMLQKAEGVVEGMLMVLDLRLEMDADCSK >NZ_CP040886|594426:621630|607168_607309_-|WP_000971068.1|DBSCAN-SWA MMFEFYMAELLRHRWGHLRLYRFPGSVLTDYRILKNYAKTLTGAGV >NZ_CP040886|594426:621630|601235_601469_+|WP_000105084.1|DBSCAN-SWA MSTKNRTRRTTIRNIRFPNQMIEQINIALDQKGSGNFSAWVIEACRRRLCSEKRVSPEANKEKSDITELLRKQIRPD >NZ_CP040886|594426:621630|620559_621630_+|WP_000533646.1|integrase|DBSCAN-SWA MGRRRSHERRDLPPNLYIRNNGYYCYRDPRTGKEFGLGRDRRIAITEAIQANIELFSGHKHKPLTARINSDNSVTLHSWLDRYEKILASRGIKQKTLINYMSKIKAIRRGLPDAPLEDITTKEIAAMLNGYIDEGKAASAKLIRSTLSDAFREAIAEGHITTNPVAATRAAKSEVRRSRLTADEYLKIYQAAESSPCWLRLAMELAVVTGQRVGDLCEMKWSDIVDGYLYVEQSKTGVKIAIPTVLHVDALGISMKETLDKCKEILGGETIIASTRREPLSSGTVSRYFMRARKASGLSFEGDPPTFHELRSLSARLYEKQISDKFAQHLLGHKSDTMASQYRDDRGREWDKIEIK >NZ_CP040886|594426:621630|609212_609773_-|WP_000720581.1|DBSCAN-SWA MKKIILLAMIIGSLTGCASVPPLNFSTPNVGVSQKKIDAEIKSLTVSLARPDEQKGDITAGMEAITPIWRESLQEALDRMTIFRDSSPNTVSLNVKVLALDVPAFGVSMTTKAIARYEIINRANGDIIYTQDIESTGTVPASYAFYGIVRARESVNRAVQNNITQFLQALESVDLSRPMFPVRVAK >NZ_CP040886|594426:621630|616381_616678_+|WP_000995439.1|DBSCAN-SWA MNAYYIQDRLEAQSWARHYQQIAREEKEAELADDMEKGLPQHLFESLCIDHLQRHGASKKAITRAFDDDVEFQERMAEHIRYMVETIAHHQVDIDSEV >NZ_CP040886|594426:621630|602891_603389_-|WP_001372488.1|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKDIVGVWTVCHGHTGKDIIPGKTYTEAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSALLRKINQGDIKGACDQLRRWAYAGGKQWKGLMTRREIEREVCLWGQQ >NZ_CP040886|594426:621630|594426_595716_+|WP_001356070.1|DBSCAN-SWA MTTDDLAFDQRHIWHPYTSMTSPLPVYPVVSAEGCELILSDGKRLVDGMSSWWAAIHGYNHPQLNAAMKSQIDAMSHVMFGGITHAPAIELCRKLVAMTPQPLECVFLADSGSVAVEVAMKMALQYWQAKGEARQRFLTFRNGYHGDTFGAMSVCDPDNSMHSLWKGYLPENLFAPAPQSRMDGEWDERDMVGFARLMAAHRHEIAAVIIEPIVQGAGGMRMYHPEWLKRIRKMCDREGILLIADEIATGFGRTGKLFACEYAEIAPDILCLGKALTGGTMTLSATLTTREVAETISNGEAGCFMHGPTFMGNPLACAAANASLAIIESGEWQQQVAAIEVQLREQLAPARDAEMVADVRVLGAIGVVETTRPVNMAALQKFFVEQGVWIRPFGKLIYLMPPYIILPQQLQRLTAAVNRAVQDETFFCQ >NZ_CP040886|594426:621630|610029_610221_+|WP_001403556.1|DBSCAN-SWA MRAKIYQLSLWIFISFLAIYAFIIYKGSYIGVALHQIAWIIIIASGLIARLTKPKQKPISSNN >NZ_CP040886|594426:621630|618499_618781_+|WP_001395510.1|DBSCAN-SWA MHFSGSGLHILCAYACRHGACSMTPQQENALRSIARQANSEIKKARQQFPDKNVDDICRSVLKKHRETVTLMGFTPTHLSLAIGMLNGVFKER >NZ_CP040886|594426:621630|618142_618325_+|WP_072126246.1|DBSCAN-SWA MTHPHDNIRVGAITFVYSVTKRGWVFPGLSVIRNPLKAQRLAEEINNKRGAVCTKHLPLS >NZ_CP040886|594426:621630|598727_599396_+|WP_000239881.1|DBSCAN-SWA MVKLMKKNTDDGAKIYTPLTLKLYDWWVLGVSNRLAWGCPTKEHLLPHFLEHLGNNHLDIGVGTGFYLTHVPESSLISLMDLNEASLNAASTRAGESKIKHKISHDVFEPYPAALHGQFDSISMFYLLHCLPGNISTKSCVIRNAAQALTDDGTLYGATILGDGVVHNSFGQKLMRIYNQKGIFSNTKDSEEGLTHILSEHFENVKTKVQGTVVMFSASGKK >NZ_CP040886|594426:621630|602468_602675_-|WP_001228702.1|DBSCAN-SWA MRKLKMMLFGASLIMVVGCSSKENALCHPQPKPPAPPAWAMMPPSNSLQLLDETFSVSGTELSATKQH >NZ_CP040886|594426:621630|616099_616306_+|WP_000233576.1|DBSCAN-SWA MVHQHYGTQTVNRGAVMPGMLVKHKDGTWTASANLRGRLYLHRGIERTYTRDLLVEVFLDGRGNGLNH >NZ_CP040886|594426:621630|620156_620324_+|WP_000545745.1|DBSCAN-SWA MHFRVTGEWNGEPFNRVIEAENISDCYDHWMLWAQIAHADVTNIRIEELKEHQAA >NZ_CP040886|594426:621630|607947_608118_-|WP_000224914.1|DBSCAN-SWA MATPLIRVMNGHIYRVPNRRKRKPELKPSEIPTLLGYTASLVDKKWLRLAARRNHG >NZ_CP040886|594426:621630|599340_599478_-|WP_072035100.1|capsid|DBSCAN-SWA MAYEPCQGVIIGIMPQHVLSKSELRLDAIFSLKRKTLLQYLEPWF >NZ_CP040886|594426:621630|598401_598578_-|WP_072163407.1|tail|DBSCAN-SWA MQEFSEHIAPLQDAVDLEIATEEENSLLEAWKKYRVLLNRVNTTTAPDIEWPTVPIIE >NZ_CP040886|594426:621630|616683_617469_+|WP_000100847.1|DBSCAN-SWA MSTALATLAGKLAERVGMDSVDPQELITTLRQTAFKGDASDAQFIALLIVANQYGLNPWTKEIYAFPDKQNGIVPVVGVDGWSRIINENQQFDGMDFEQDNESCTCRIYRKDRNHPICVTEWMDECRREPFKTREGREITGPWQSHPKRMLRHKAMIQCARLAFGFAGIYDKDEAERIVENTAYTAERQPERDITPVNDETMQEINTLLIALDKTWDDDLLPLCSQIFRRDIRASSELTQAEAVKALGFLKQKATEQKVAA >NZ_CP040886|594426:621630|612870_613101_-|WP_001067458.1|DBSCAN-SWA MNPAIKTAINIVGSQKKLGAACEVSQQAVYKWLHNKAKVSPEHVGSIVTATGGVVKAYQIRPDLPKLFPHTEKNAA >NZ_CP040886|594426:621630|620038_620224_-|WP_071525073.1|DBSCAN-SWA MFSASITLLNGSPFHSPVTRKCIYHLHKTKPAVASSDKRNPRQCEDAVHCCYTLFCSQRKR >NZ_CP040886|594426:621630|614017_614767_+|WP_000389051.1|DBSCAN-SWA MECVASSNSAIPNVVEVIRRINEGSTQPFLCKCDDGQLYVLKSKPSMPPKNLLAEFISACLANDIGLPLPDFKIVFVPEELIEYSPDLQQQICTGYAFASLFIDGAIALTFTQSRNETIIPVEQQKLIYVFDKWILNADRTLTDKGGNVNILYDISNDKYYLIDHNLSFDQNAGPEDFSVHVYGPGNRKWQYDLVDRVEYRQRVVNSLHKLPAILDEIPEEWIVDEEFLPFVCTTLDKGDCDEFWSAIE >NZ_CP040886|594426:621630|606025_606550_+|WP_000780581.1|DBSCAN-SWA MKLWPVLTGIALSFTLIACKAPTPPKGVQPITNFDANRYLGKWYEIARLENRFERGLEQVSATYGKRNDGGIRVLNRGYDPTKNKWSESEGKAYFTGDTKTAALKVSFFGPFYGGYNVIKLDDEYKYALVSGPNREYLWILARTPTIPDKVKADYVRTAQKLGFNVNELLWVKQ >NZ_CP040886|594426:621630|619311_619914_-|WP_000120065.1|DBSCAN-SWA MSYFLRKKWMVNLSGSGKILWALNMKKDSYPYLICMTVSGLIFIFLFFWWRADIYRVTFLNQSISHYYILFSMGIAFLLSLFWVKKGIVKQSGWKSLSAYLKVYAGMCIFAGFFLIIPLTTLTYFLPGETSSYVAPYRYTSGSSKSCSGAEVDDPDLHENIRICYPYGNYEYDNIIYVEKKINILGAVVTYAQTARDDTE >NZ_CP040886|594426:621630|611245_612175_-|WP_001415152.1|DBSCAN-SWA MANTAEIFNFPVPDAAQKEPRVADLDDGYTRIANELLEAVMLAGLTQHQLLVFLAVMRKTYGFNKKLDWVSNEQLSELTGILPHKCSAAKSVLVKRGIFIQSGRNIGINNVVSEWSTLPESGKKNKVYLKEVNLPESGKKSLPKSGKGTYPNQVNTKDKLTKDNIKPYSSENSGESSDLPENDLPVVKADAAIQNGSKWGTAEDLTAAEWMFDMVKTIAPSARKPNFAGWANDIRLMRERDGRNHRDMCVLFRWACQDNFWSGNVLSPAKLRDKWTQLEINRNKQQAGVTAGKPKLDLTNTDWIYGVDL >NZ_CP040886|594426:621630|608763_609216_-|WP_000825400.1|DBSCAN-SWA MKRLFVIAPLLVLVGCAQNISPNSYSVGSVGMVNRTIAGTVISARGVDISGTSALGGTAGAAVGATAGSALGGGVRSNIVGAVGGAVIGGIAGAAIESSATKQTGMEYVVETENGNLMTIVQGKDPLFTQGSKVLVLYGNPSRIITDPRH >NZ_CP040886|594426:621630|604873_605833_-|WP_000592543.1|DBSCAN-SWA MIKKPVIGISGCLAGSAVRFDGGHKRADFLMDKLVEWVTFRPVCPEMAIGLPVPRPALRLVRSMQGNIRMCFSHDQNEDVTERMTEFSRSYMDKLKDVSGFVVCAKSPSCGMERVRVYDENGNRGRKDGVGLFTSTLMEKFSWLPVEEDGRLHDPVLRENFVERVFALHELNHLYKEKLSRRELLAFHSRYKLQLLAHSQAGYKDMGPFVAAIHEWADLESYFEVYRDKLMAILRKPASRKNHTNVLMHIQGYFSNYLSTRQRKELSEVILNYRSGTLPLLAPLTLLKHYLGEYPNDYLLTQNYFDPYPDELALRLMVN >NZ_CP040886|594426:621630|618297_618489_+|WP_023148020.1|DBSCAN-SWA MHKASPVELRTSIEMAHSLAQIGVRFVPIPVETDEEFHTLAAFLSQKLEMMVAKAEADERDQV >NZ_CP040886|594426:621630|600286_600847_-|WP_001372490.1|DBSCAN-SWA MEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIENEKLRREVEELRQASETDLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLSRIAGEIASILDGIPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIKRRRSLSTRMRLTQQLI >NZ_CP040886|594426:621630|602287_602440_-|WP_001139678.1|DBSCAN-SWA MPSMIAIILLIILHIWLCRQGGAHFWSESRLNISLLMLDIEHFARGKGLR >NZ_CP040886|594426:621630|596996_598328_+|WP_001753290.1|DBSCAN-SWA MIINQVPIKIKIFIFLFSCISIIFLLLHANNGIYITQTTQISYSVFIIGLFFINLMIFIFLLLYYVSNQRQSYLLILSFAFLSNTYYLLEVAIISLSPLGNDLSTIYQKSNDIAIYYLFRQFSFISIIFLAVYSTNVKNKSVLEDKRNIIIVVLSILILFITPFVAKNLSSDNIKYSLNIIQYSLNRHLPTWNIVYTKIISVFWLVLLISSCISIRNYSKIWLCIILISIVSVCNNLILLYFIDKSHPAWYMTKFLELISMIYIISTLMYYVFRKLNHANHMAIHDPLTNTYNRRYFIDSLKNISKHHDFSVIMLDIDSFKSINDKWGHHMGDQVIVMVTRIIKKSIRKEDVLGRLGGEEFGIIIKGNTQKLLLSIAERIRKNIEEQCSEKLLSHGPEKITVSIGCFTSKENNLSPSEMLVNADKALYQAKRTGKNKVIIHSK >NZ_CP040886|594426:621630|607305_607668_-|WP_001372483.1|DBSCAN-SWA MNTYNITLPWPPSNNRYYRHNRGRTHVSAEGQAYRDNVARIIKNAMLDIGLAMPVKIRIECHMPDRRRRDLDNLQKAAFDALTKAGFWLDDAQVVDYRVVKMPVTKGGRLELTITEMGNE >NZ_CP040886|594426:621630|603388_603604_-|WP_000839582.1|lysis|DBSCAN-SWA MKSMDKLTTGIAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >NZ_CP040886|594426:621630|614763_615591_+|WP_000210934.1|DBSCAN-SWA MTTPCLYSIVRYAPYAETEEFANIGVLLCAPKENYFDFQLTKRNDSRVKNFFHDDCIFPVAKDSIQRELQFAKMHATQIVGHQQLAQFFRYFTNKKESIFQFSSTRVILSENPKEELARIYNKYVNHSDYTKERREDVLARELKRSIDRIDGLKNVFKQATIDGYFAKFSMPLVAKKHDRIQCAIKPLAFTQAEPGKMMEHSDTWVMRITRAAEENLLSLDDILFTIETPESPNSGQSKVIDIIKRTMDAKKINHIPASNHKETIDFAKKILPQV >NZ_CP040886|594426:621630|607664_607955_-|WP_001372487.1|DBSCAN-SWA MADLRKAARGRECQVRTPGVCNGNPETSVLAHIRLTGLCGTGTKPPDLIATIACSACHDEIDRRTHFVDAAYAKECALEGMARTQVIWMKEGVIKA >NZ_CP040886|594426:621630|595774_596251_+|WP_000767389.1|DBSCAN-SWA MKLISNDLRDGDKLPHRHVFNGMGYDGDNISPHLAWDDVPAGTKSFVVTCYDPDAPTGSGWWHWVVVNLPADTRVLPQGFGSGLVAMPDGVLQTRTDFGKTGYDGAAPPKGETHRYIFTVHALDIERIDVDEGASGAMVGFNVHFHSLASASITAMFS >NZ_CP040886|594426:621630|608117_608573_-|WP_001372486.1|DBSCAN-SWA MNLPQDGIKLHRGNFTAIGQQIQPYLEDGKCFRMVLKPWREKRSLSQNALSHMWYSEISEYLISRGKSFATAAWVKEALKHTYLGYETKELVDVVTGEITTIQSLRHTSDLDAGEMYVFLCKVEAWAMNIGCHLTIPQSCEFQLLRDKQEA |
46 | Enterobacteria_phage(47.06%) | capsid,lysis,integrase,tail | attL 596342:596356|attR 621704:621718 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1406072 : 1421944
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP040886|1406072:1421944|DBSCAN-SWA ATTAATGCTCGCGGGTCTGGTGGAACTGAACGTCCGGATAACGTTCCTGTGCCAGGCGCAGGTTAACCATGCTGGTAGCGATGTAAGCGAGATTATCGCCGCCATCAAGCGCCAGTTGGCTTTCGTTCTTACGCTTGAACTCTTCAAATTTCTTCGCGTCCGCACATTCTACCCACCGGGCGGTGGCAACGTTGACTGATTCATATACTGCTTCAACGTTGTATTCGCTCTTCAGGCGCGATACCACTACATCAAACTGCAGCACACCAACCGCACCAACGATCAGGTCGTTGTTGGAGATCGGACGGAACACCTGCACCGCGCCCTCTTCGGAAAGCTGTACCAGCCCTTTGAGCAGCTGTTTTTGCTTCAGCGGATCTTTCAGGCGGATACGACGGAACAGTTCTGGTGCGAAGTTCGGAATACCGGTGAACTTCATCATCTCACCCTGGGTAAAGGTGTCGCCGATCTGAATGGTGCCGTGGTTGTGCAGGCCGAGGATATCGCCAGGATACGCTTCTTCAACGTGCGAACGGTCACCCGCCATAAAGGTCAGCGCGTCAGAGATCACCACGTCTTTCGCGGTGCGCACCTGGCGCAGCTTCATGCCTTTTTCATATTTACCGGATACCACGCGCATAAACGCCACGCGGTCGCGGTGTTTCGGGTCCATGTTGGCCTGAATTTTAAATACGAAGCCGGTAAATTTGTCTTCGCTCGCTTGTACGGTACGGGTATCAGTCTGACGCGGCATCGGCGCAGGTGCCCACTCCACCAGGCCATCCAACATATGATCGACGCCGAAGTTACCCAGCGCAGTACCGAAGAATACCGGAGTGATTTCGCCCGCAAGGAACAGCTCTTTGTCGAACTCGTTAGACGCGCCTTTAACCAGTTCCAGTTCGTCACGCAGCTGCTGTGCCAGATCTTCACCAACCGCAGCATCGAGATCCGGGTTATTCAGCCCTTTAACAATGCGGACTTCCTGAATGGTGTGCCCTTTACCGCTCTGATAGAGATAGGTTTCATCTTTATAAAGGTGGTAAACGCCTTTAAACAGCTTGCCGCAGCCAATTGGCCAGGTGATCGGTGCGCAGCCAATTTTCAGCTCGTTCTCAACTTCATCGAGCAATTCCATCGGGTCGCGGATATCACGGTCAAGTTTGTTCATAAAGGTGAGGATCGGCGTGTCGCGCAGACGGGTAACTTCCATCAGCTTACGGGTACGATCTTCAACACCTTTTGCGGCGTCGATAACCATCAGGCAGCAGTCCACCGCCGTCAGGGTACGATAGGTATCTTCCGAGAAGTCTTCATGCCCCGGGGTGTCGAGCAGGTTAACCAGGCAATCGTGATACGGAAACTGCATCACAGACGTAGTAATGGAGATCCCACGCTGCTTTTCCATCTCCATCCAGTCCGACTTAGCGTGCTGGTTGGAACCACGGCCTTTTACTGTACCGGCGGTCTGAATGGCCTGTCCGAACAGCAGCACCTTCTCGGTGATGGTAGTCTTACCGGCGTCCGGGTGAGAAATAATGGCAAAGGTTCTTCTTTTCGCTACCTCTGCGGCCAAAAGGCTTGTCATAATTGCTGTCTCTTTGATTATGTACAATCATATGTACAAAGAAAAATTCATGCCGCTGATTGTACATTTTGGAGGTATCAGAGCATAGCAGAACATGATCAGTAGGTGTATTTGTAGCCAAAGAACATTTGTACATGCAGCAATCTTTGTTCTATTGACACTGTTGATTGGGCGGTGTACAACACAAACAAAAACAGGATGTTAGAGGTCTCAGCAGGACACCGACCAGACGGTGAAGTGACAAAAAGATACGCAAGGGAGCCGCGGCTCCCTACTGAAATATTATGACTTTAAGTGAATTTTTACTTCTTATTGCTATACTAGTTGACATAAAATTATACCGCCTTTCAATATAATCGCTCAAAGCAGTTGAAAATTTGTTCTTTAAGCCCCATATTCATTAAACACCAAATACTGTGGATAAAAATTTTCCAATAAGCTAGGATTTGTCCCAATTTATGGGTACCAGCTGGAAACGAACAAACAATGAACTCAAACGTTGCTAGTATTTATAGTGCTGCTAATGTTAACAGTAATGATTTAGCTTTAGAGTTGTACTGGAAAATCCAAGAAGTCTCTGCATGGTTTGTGAAACATGTAAACGCAAAATCGGTTGAGCAACTACGCGACTTTAACCCGTCATTCGCTGAAATTGCCGATCTTTCTGACGCCACTGCAGATATCATTACGAAGTTGCTTCAAGTCGGTGTTTGGGACGATGAAAGAGTTATGGCAAATGCCCGCCAAGCAGTGCTTTTAATGCGCCAAGTCGCAGAAGCGATCGAGCGTGGCGATAACGACAGTATTCAAGACGGAGCCAACCGCTTATCAGCAATGGCATTCGTTTAACTTAGTCAACTTAAAAAGTGAGTTTTACCTAGCTAGGTAAGTCAGGAGCCAATATGATGAATAAAATTGAAGCACGCCGCATTGCGCTGTTGCGAGAAGCCATCAAAAACGTTGATAAAATCAAAGAAATTCAAACGTTTATCGATCAAGAGCTAAAGGCTATGAATCGAAAAGCTGCATAAAGTAAAAAAACCCGGCAAAGCCGGGTTTTTTATTACCATCCTTTTACACTTTCGGCAGAACAGTTGACCACCATTCTGAAAGAACGTATTTCCCTTAGGGCGAGGATTAATGAGTTCCGCGCTTTTAGCATCTCCGGATACCTCACGTACGCCTTAACCCTCTGCCGCAGAGTCCGTATTCTGAGAGGCTCGCCAACGGTAAGCAAAGGCAAAAAGCGTCAGATCCAGCTCGAAGGTCTCGTGCATTCCATCCACGGACACGGAGGTACCCAGTAATCCCTTCCGAACCAGGGACTCCACACCAGTTTGCGGATCATCAAGCCGGTTCATTTGTCCCCATGTCATCACAGCACCGCCGAATGCTACAAAGTGCTGTACCACGTTTTTTTCACTGTCCGTAAGTGAATCATAAAGTGCGGCCAACTGCTGCCTTGAAGCAGCTTTTCCTTTCTGGGCAAGAAGACGCTGATGCCATCCTCCGGCAATGATCTTGACGATCGGGAATGGAATAATAAATGCGAACAGCGTCAGGACATAGTCGTGAAGGATCAGATAACAGCTCATTCCGGCAAGTCCCGCGACAAATATGGCGACATTAATGCCGAAATCCTTCTCCGTATAAACCCTGTCGAACAGTTTTCCAAGAAATTCAGTCATTGATTTACTCCCTACATGCTATGTCGCACCGCCCCAAATCAAAAGAGCGAATGGTTCAGTGTCGGCAGTATTCCCAATGATTAGCCAACATGCGCCCGGCATCATACCAGTTTTCAGTCAACACCAAGCCGGCAATAAGAGAGAACTCTTGATCCATCATGGCAATCAGATCTCCAAACAACACTGACAGAAATTCCTGGATGAAGTTGCTTTTCGATTTACCGGCTACGCTGACCTAAGTGGTGTTGCCGTATTTATCAGTTTATTAACAGCCCTCATAGCCCAGTACCTGTGAACATTGCTCAGTCCCTAGACAAACCGCTGTGTGGTGACGGTCTTCCGGCCATTCGGTTCCCACTGTATTGAAGCATGCCAGGCTATTTCAATATCGCTATGCCGTGGCATCATTTAACCCCTTGTAATTCATCGTCATAACTGTAGTAATGTTTTCCGCTTCAGGGAAAAAATAGCATCCAACCGCAGCACGTTCTTGCATACGACGTGCTGCGGCATAATCCCAATGATTACTCCCTGACAGGGTTCGTAGGCCACTCAATATCAGGTGCAGTTGATGTATCAACACGGTTCAGCAACACCCGATACTTTTTCCAGGCTTCCACCACCAGCACGACAAGATGCCGCATACAGTGACCCAGTCAGTCCAGTTTCCAGACAACCAGTGCGTCACCTTTTTGAAGGCGCTTTAAAGCACGTTTTAATCCAGGTCGGCCTGTCCTTATTCCGCTTAATTTATCTTCAAATATTTGTTCACATCCTGCACAAACAAGAGCGTTTCGTTGCAGGTCTGTATTCTGGTCATTTGTTGATACCCTTACATAGCCAATCAGCACGCTGAATCTCCCGTCCAAAAGCACAAATCATGCCATGCAGGCCAGAAACCGCCATTATCTAAAACCTCGGTTTACAGGAAACGGTAAACAGGGCCAGGAACGCCGTGCAAAAGAATGGCGATACCTTGTCCGGTGGGCTTACTTTTGAAAACGACTCAATCCTTGCCTGGATTAGAAATACTGACTGGGCAAAGATTGGTTTTAAAAATAATGCCGACAGCGACACTGATTCATACATGTGGTTTGAAACAGGCGACAACGGCAATGAATATTTCAAATGGAGAAGCCGCCAGAGCACCACAACAAAAGACCTGATGACTCTTAAATGGGATGCTTTGTCTGTCCTTGTTAAAGCCCTTTTCAGCAGTGAAGTAAAAATATCGACAGTCAATGCACTGAGGATATTTAATTCATCTTTTGGTGCTATTTTTCGTCGTTCTGAAGAATGCCTGCATATCATCCCTACACGAGAGAATGAGGGAGAAAATGGTGATATAGGGCCACTACGCCCCTTTACGCTTAATCTCAGAACTGGTCGGATAAGCATGGGGCATGGTCTTGATGTTACAGGGGATATATTTGCAAACCGTTTTGCAATTAACAGTAGTACCGGCATGTGGATTCATATGCGTGACCAGAATGTTATTTTGGGACGCAATGCGGTATCCACCGATGGTGCGCAGGCATTACTTCGTCAGGACCACGCTGATCGCAAATTTATGATTGGTGGACTGGGGAATAAGCAATTTGGCATCTACATGATTAATAACTCAAGGACAGCCAATGGCACCGATGGTCAGGCGTACATGGACAACAATGGCAACTGGCTTTGCGGTGCGCAAGTTATTCCCGGCAATTATGGTAATTTTGACTCACGTTATGTGAGAGATGTCCGACTTGGTACACGTGTTGTTCAGACTATGCAAAAAGGCGTGATGTATGAGAAATCAGGTCATGCAATTACGGGGCTTGGCATTATCGGTGCAGTTGATGGCGATGATCCGGCAGTATTCAGACCAATACAAAAATACATCAATGGCACATGGTATAACGTCGTACAGGTGTAATTTATGCAGCATTTAAAAAATATTAAGTCTGGAAATCCAAAAACAAAAGAACAATATCAGCTAACAAAGAATTTTGATGTTATCTGGTTATGGTCCGAAGACGGAAAAAACTGGTATGAGGAAGTGAAAAACTTTCAGCCAGACACAATAAAGATTGTTTACGATGCAAATAATATTATTGTCGCCATCACCAAAGATGCCTCCACGCTTAACCCTGAAGGTTATAGCGTCGTTGAGGTTCCAGATATTACAGCCAACCGCCGCGCTGATGATTCCGGTAAGTGGATGTTTAGGGACGGAGCTGTGGTTAAACGGATTTATACGGCAGACGAGCAACAACAACAGGCCGAATCACAAAAGGCCGCATTGCTTTCCGAAGCTGAATCAGTCATCCAACCGCTGGAACGCGCTGTCAGGCTGAATATGGCAACAGACGAGGAACGCACACGACTGGAAGCATGGGAACGCTACAGTGTTCTGGTCAGCCGTGTGGATACGGCAAATCCTGAATGGCCACAAAAACCAGAGTAAAAATTAAGGCCCGATAGCGGGCCTTCTCTCATTCTGGTTGTTCGGGAAACGTTACTGGCAGGCCGGAAGTGTCTGTAGATTCGACTTTCTGCGCATAGAGCATCCACTCGGTTAATTTTTGTTTATTCTCGTCGGAAATGATGCCCAGCCGTAGCTGTGAGTCCCATAGCTGGGTTTTATCCCTGACAAGTTGCAACAGGCTTTGCTTTTCATTTTCCGCTTGTTGCCTCTGCTCTTCCTCGGTATAAGTTCGCTTTATCACTACGCCATCTTTGAACATCCATTTCCCCGAAATATCAGCCCGGCGATTTGCTGTAATATCAGGTAATTCAACGACGCTTGCACCCTCTGGATTAATTGCTGAAACATCCTTTTCAATACAAATAATAACGCCGTTATGGTCATAGACCATTTTCAAAGTGTCTGGCTGGAAATTCTTTTGTTCCTCATACCAGTTTTTTCCATCATCTGAATAAAGCCATTTGATGTTAAATTGCTTTGTTAGCTGGTATTGCTCTTTTGTTTTAGGGTTGCCAGCAGTAATGTTTTTTAAGTGCATCATCGTTAAATACTCCCCGCGTTATACCACGTCCCATTAATGCAATACTGAATTGGCCTTGCCTGAGTTGTATCAATTAATTCATCACGGTTTCCGTTAACTGAACCCGTAACGACATAACCTGACCTGTCAGACCAGCCGGGGCCTTTCCATGTCTGAACAGATGACAGACCGCCAAGGCGAATAGTGCAGTTGTTACATAACTGAGTTTATGTAAGTCTTCATCATTCAGACGAGAGAGGGCTGGGACAGTAGCCATGATGGCAGCCTCCGTATGCAATGGATAACTTCCACCACCGGAAACGCCAATTTCGCTGGTGGTGAACTGAGCAGGGTTGGCGTAACCGGCGCATACGGAAACCGGCGCACCTTTCGGTGCCCCCACCCAGCCCACCATAATTTGGGTATAGCTGAGTTGTAGCAACAAAAAAGACGCTAACGCGCCAATTGTCGCCGTATGCAATTCCAGGACGCCAATCCCGACACCCGCTTTATAAGGTGCCTGAACAGTGTAACGTCCCGGAATGGCAGAATCAATGTGCTGGTGGTCCTTCACACTCAACAAAATCACGCCTGAATTTCCACAAAGGACTAAAGCACTCATGCGGGTAGTCTTTGCGAAGATAGATAACGCGCTGTGTTTCTGGCTCCCAACGAATAACATGGACATAAAGCCCTCTTCCGTCACGAAACCAGCGGTTAAGTTCTTGCACAACTCGCCCCCCACAGTCAGGTAAAGTTCTCTGTGGTTACTTACAGCCAGGTGATTTGGTAATCTGCATTCATGCCGTAACAACAGGTGTTCAGCGACGCTGACCACCAGCTGTTGCGACAAACGGTTATTTGCCGTTAAACTGTTCATGCGTTAGTTTCTCCACAGACACAAAACGCCACGACGCCCGGAGCTGCACACTCGCGGGCGTCACTCTTTTCTGGAGCGCAAAAGATTTTGTAGACCAGTGCTGCATGCTCCTGGAGCTTCGAAATTGACAGATACAACTCATCATTAATTGCTGTCTGCTCGTGTGGCTCCACGACCCCATCTTCGATTGCCGAACGAATCTGCTTTGAGTAATTCCCGATCTGTTCGATGACTTCCAGCAGGCGCTGGTTTATATCGGCGTTCTCTACTTCCTCAATTTCAGGAAGCGATACGAACACCCCACCAGCAGACTGTGCGACAGCATCCGCAATGTAGTGAGTGCCAGCCGCGCGCTGTAAAACCATTGCCCATCCCAGCGGGAAAATCTGATCGCCATCGGCACGAAGGCGGTTAAATAATGCGTTCTCTGTTACATCCAGCCAGTCAGCTGCTTCAGCGTAACCACCCGGCAACGCTGCGATAGTTTTTCTGACAGCTTTCACGTACCACTCAGGCTGTTTTTCTACTTTCCAGTGATACTTACCCACGGTTAGCCTCATCGTTCTGTGGTTAAAAATTGAAGGTGTTCTGTTAATCTTTCGGATAGATATCCGGTCTTAAGTCAGATTTCGTAATTGCACCTGACGTGCATTGCTCAAGTTTTTTCGCCAGCACAAAACTGGCTTTTTTATAACCATTGAAAACCAGCCGTAAGTAGCCAGGTGTTGAGCCAACTTTTCCGGCCAACTCGCCCTGCTGTTCTTTGGTTAAAGAGTCCCAATACGCTTTCATACAATATGTACCTCCGATATACATATTACATGATTGAAATGAACCTTCAAGATACTTGTACCTTATCGGTACAAAGGTTTTAATTTCGTTATGAAAACAATCCATGACATCCGGCGGTCTAACGCCAGAAAACTGAGAGATGGTGTTGGCGGGAATTCTTCCTTTGCCACCATGATTGATCGCGAGCCAACCCAGACCAGCAGGTTTATGGGAGATGGTGCTACTAAAAATATCGGTGACAGCATGGCGCGGCACATCGAAAAATGTTTCGACCTGCCTGTCGGATGGCTTGATCAAGAACACCAGACCACGAACATCACAAAAAAACCTGATGTTTCAATCACTAACAAACAAATAACGTTAGTCCCTGTCATATCATGGGTACAGGCCGGAGCATGGAAAGAAGTTGGCTATTCTGAGGTTGATTTGAGCACAGCAGAAACTTACCCCTGCCCTGTACCCTGTGGCGAAATGACTTATATCTTGCGGGTGATTGGTGATTCAATGATTGATGAGTACCGCCCTGGAGACATGATTTTTGTTGATCCCGAAGTCCCTGCCTGCCACGGTGACGACGTTATTGCATTGATGCACGATACAGGCGAAACCACCTTCAAGCGATTGATAGAAGATGGAACACAGCGTTATCTCAAAGCATTAAACCCAAACTGGCCTGAGCCTTACATTAAGATTAACGGTAATTGCTCTATAATTGGTACAGTGATTTTCTCGGGAAAACCAAGAAGATACAAAATAAAGGCCTAATCAATATTTATAACCTGCTTCGGCAGGTTTTTTTATACTTGACAATGTACCCTTGAGATACATAATGTATCTAAAAGAAACATAACACAGGCAAGATTAAACTAAATTTGGTTGTAACACGGCGTATGGCACATGCGTCGTTAGCGGTCTGGGGACGTTAAAGGGGACAATCCACTCCTTGCTCGGGCAAACAAACCAGGTAGCCGGAATGTGCAAGTCAATGATGATGCTGATAAGACGCCTAACCAGCGTGGCGATCCGGTTTGACGCCTGGGAAGAGACCAGGGTGCAACGATGAGGGCATTTATGGAACCGCGACAAAGTGTGGTGCCGTAACTGGCTAAGTGCTCTCAGCGTTGTGGTAATCCGCGAAATGGCGCGGCGGTAAGTATGGCGGGGTTACTCTTTCCCCGTTGAGGACACCGGATTGTCAGGTTGACCATACGCCTGAGTGACAACCCCACCACAACAGCCACTGCTTTGGCGGTACCAGTTTGTACACTTGCTTCCGGCTGGTACCGCTCTTTTTACAAAACAGAGAAGGGCATCACCGGACGACGGGCTCATAACCCAATCCATCCGGGCGGCTGCCACCGCAGGTGTTCTTCTCTGTTTTGTGGAGAAACCAACCGACCTTGCAGGGTCGATATGATGAGGAGCAGCAAAATGGCTAGCGAACGCAGTACTGATGTGCAGGCATTTATCGGGGAGCTGGACGGCGGCGTATTTGAAACCAAAATCGGCGCTGTTCTCAGTGAAGTCGCTTCCGGTGTGATGAACACGAAAACCAAAGGTAAGGTCTCGCTCAACCTGGAAATCGAACCGTTTGATGAGAACCGTGTAAAAATCAAACACAAACTCTCATATGTTCGCCCGACTAACCGCGGGAAAATTTCCGAAGAAGACACCACCGAAACGCCGATGTATGTCAATCGCGGTGGTCGCCTGACTATTCTGCAGGAAGACCAGGGACAATTACTGACTCTTGCCGGTGAACCTGACGGAAAACTCCGCGCAGCAGGTCGTTAATATCGTTTTTAATAAACTGATTATTTATCTCATCACTGAATATCTTTATATAGTGAGGACTTATTATGTCTCAGAACTTAGACACAACCGCAATTAATCAAATCCATGCCCTTATTTCTGCTCAGGGTGTTAATGAAATTATCAGTAAGATTGGTGCCGATGCTGTGGCATTGCCTGAGAATTTCCGCATTCATGATCTGGAAAAATTTAATTTAAATCGCTTCCGTTTCCGTGGTGCGCTTTCCACTGCCAGCATCGATGACTTTACCCGTTATTCTAAAGATCTTGCAGATGAAGGCACCCGCTGCTTTATCGATGCTGATAATATGCGAGCCGTCAGTGTGCTTAACCTGGGTACTATTGATGAACCAGGTCACGCAGATAACACCGCCACTCTCAAACTGAAAAAGACAGCACCGTTCTCTGCCCTGTTGTCTGTTAACGGCGAGCGTAACTCCCAGAAGTCACTGGCAGAATGGATTGAAGACTGGGCCGACTACCTTGTGGGCTTTGATGCTAATGGTGACGCCATTCAGGCAACAAAAGCGGCTGCGGCGGTCCGTAAAATCACGATTGAAGCAAACCAGACCGCTGATTTTGAAGACAATGACTTCAGCGGCAAACGCTCTCTGATGGAGTCTGTCGAAGCGAAAACCAAAGACATTATGCCAGTAGCATTTGAGTTTAAATGCGTTCCGTTTGAAGGCCTGAAAGAACGTCCGTTTAAATTACGCCTCAGCATTATCACTGGTGATCGCCCTGTACTGGTTCTGCGCATTATTCAGCTGGAAGCAGTGCAGGAAGAAATGGCTAACGAATTTCGTGATCTGCTTGTTGAAAAATTCAAAGACAGCAAAGTAGAAACCTTTATTGGTACTTTCACCGCCTGATTTCATTACTGCAAATGCCCCTGCGGGGGCATTTATGGAAACGTAATTAACTCAATAATCGCCTGATGGCGAGGGTTTTCTTTAACCAAAATTCAGCGCGGTGCAGCGCATATAAAGTGGAGAACAAAATGTCATTTATTAAAACTTTTTCCGGGAAGCATTTTTATTATGACAAGATAAATAAAGACGACATCGTTATTAACGATATCGCGGTTTCCCTTTCAAATATCTGCCGCTTTGCCGGTCATCTTTCTCACTTCTACAGCGTCGCCCAACATGCGGTGCTTTGCAGCCAGCTGGTGCCGCAGGAATTTGCTTTTGAAGCTTTAATGCATGATGCAACAGAAGCATATTGCCAGGACATCCCCGCACCACTGAAACGACTTCTTCCTGACTATAAACGGATGGAAGAAAAAATAGACGCCGTAATCCGTGAGAAATACGGGTTACCTCCTGTTATGAGCACGCCAGTGAAATATGCCGATCTCATTATGCTGGCAACCGAACGCCGCGATCTCGGGCTTGATGATGGCTCTTTCTGGCCTGTGCTGGAAGGTATCCCGGCAACAGAGATGTTCAACGTGATTCCACTGGCTCCAGGCCATGCCTACGGGATGTTTATGGAACGCTTTAACGAATTATCGGAGTTACGCAAATGCGCATGAATGTTTTCGAAATGGAAGGGTTTCTTCGTGGGAGATGTGTACCGCGAGATCTGAAAGTGAATGAAACGGATGCTGAATACCTAGTGCGTAAATTCGATGCGCTTGAAGCTAAATGTGCAGCACAGGAAAACAAAGTAATACCAGTGTCAACTGAACTGCCACCAGCAAATGAAAGTGTTTTGTTATTTGATGCTAATGGAGAAGGCTGGCTGATTGGCTGGCGTTCTCTCTGGTATACATGGGGGCAAAAAGAAACCGGAGAATGGCAGTGGACATTTCAGGTCGGGGACCTTGAAAACGTCAATATCACTCACTGGGCAGTAATGCCAAAAGCACCGGAGGCTGGAGCATAATGACCACATTTACCGATAAAGAACTGATTAAAGAAATCAAAGAACGAATCAGCAGCCTAGAGGTTCGAGACGATATTGAGCGCCGTGCTTATGAAATTGCTCTGGCATCGCTAGAAGAGGAGCCGGTGGCATGGCTGCATTCAGAAAATGGCTTAGGTATTCCGGCAATAACGAGGAGTAAAAACATTGCTGACAGTTGGTTATCAAAGGGCTGGTATGTTCAGCCGCTATATATAGCCAAGCCAGTGCCGGTGGTGCCAGATGCTCGTCCGTCTTTAAATAATGGCATAGTCGGTTTTGATGAAGGCTGGAACGCCTGCCGCGCCGCCATGCTTCATGGTGCCGAACCTGTAAGCCAGACTTACAAGTTGAACAAGCTGTCGGGCAACTCTCCAGTAACTCCGGATGGTTGGATAAGCTGTAGTGAGCGAATGCCGAACGATAAACAGTATGTTTGGTGTTGGGGGAAGCCTTACGGCTGGACTGAGTGCGATACCTTCGAAGGGTATTACGATTGGTCGAGAAACAAATGGTGGGCAGTTACTGACGATAGGGAAGAACCGGCATCGAAAGTAACCCACTGGATGCCGCTACCGGAGCCGCCGCAGGAGGTGAAGTAATGAACAACTTAATGACAACTAAACAAGTCGCCGACTTCTGTGGCGTTTCAGTATCGACTGTTCTTCGCTGGAACAGCGTAAACAGGAGAACTGGCCAGAAATACAGGCCTGACTTTCCAGATCCTGATATTAAATCCTGCCCAAATAAATGGGCATCACGCAAGATATACAGATTTGCTGGAGTTATTGAGTAACGTGTATTATCTCAGATGGGAGCTGACATATCTATGGCACAGACCAAACTAATCTGACAGTCAAGTCTGTGCCAAGAGCAAACGTTGCTAATTTAATGTTAAGTTGTTTCTCTTGAAGACGATCACGCTATGATACAACCTATAAAATTATCGATATCTGAGAGAACCAGTAAAAAGTTTAAGTCAAGGGTTCTATAAGTGTGTTAATTTATTGGGAAGCAATAAGCTTCCCTTCTCTTTATTTAGGTACGCTATCAACACTTTTGAGAATCGTTGATGTTAAAGGCTGCGACTTTAGTAAGTCTCGAGAAATATCCGGATCTGGTATAGCTATTTTTACTAAACTCTCAAGTTTTTCAAACCACTCCCCATCATTCATTGAGAATAAGTTATGAAGAGCTTCATTTTTATTTTTGAACCCCCAAGCCTTATAATTATTACCACAGTATTTTGACGCAGCAGTTAGAATATGCCAGCGTAATTTTGAATACTTCCCATCAAACCTCTTATTTGAGATAAGAGCCTTTAGCCTATACAAGCAATAGCAAGATATATAATAATCATCCTCGAGGGCATCGGAAGAGAAAACCTCATTAAGTAAGTCACCAGTTAACCTATTTGGATAACGACTGGAATAATCTGGTCTCATCATAACAATAGCAGCATATGCTCGCGCAACTTCTCTGATATCAAATATGCGTACTGGAGCTATACTTTCTGAACTATACTGCCCCTTTCTTCTCTCAAAATAAATTTTATTTCCTTCAATAGCACCTTTAGCATTAAAATAATGTTCTAATTCCCTTAATTTTTTTAGCGTTGAAATGAACTGTGCATCTTCAACTTTAGATTGTCTATTCGTAGCTCGTACAATGTCATCAAGGATCGCTGGCTCATCGGTTTCAATTAATTTAATCATTAAACTAACTGATTCATCTACTTGAATGTCTTTTGATATAAGAACATTAGACGTTTGACATCCATTAACAATTTGAAAATCTCGTATAAAAATTTCTTGCCCTGCAGGCCTCACGCTTGATGCAACTATTGTCACTCCATTATTCATTAAACCAAATCTTGCTTTTTTTCCATCAGTATCTAGTGTTCCTGCAATTTCTGAATTTACGTCACCATCAATTCCCAGAAAATCTCTAACATTTTCTTCGAATAATTTTTTTCGAGGGTTTCCATTTTTGTCTTTAAGTATAGAATCAATAAAACTACGAGCTTTGACTGTAGCTACATAGGCATTATTAATATTAGGGGCCGCTGGGAATGGAGCATAGCCTATTGTAGGAAGTTTAGCTTCTATTGGACCTTCCGCAGCAAGCCAAAGCTCATGAATCATGTCTTTGTGAGCCATAATGAAGGATGTTTCATGTGAAAACCCGAGTGACTTTAAGCTTTTTTCACCCGATGCAAATGCAGCTTTTATTTCTCTGGCCTCTGTATTTTGTGCAGCACTAAAAAAATATGCGTATAAGTCTGGAAGGCCATTTTTGACTCTGCCGATATTAGCAAATATCAAGTTGAACATTTTTTTAAAGTCCGCTAGATATTCACTGTGGGGTTGTTGTGGCGCTGGTGAAAGATAATCTCTTATTGACGCAATGTAAGAGTCTATTTCCTGTTTGCTCCACTTCTCCGATGACTTAGCTTGCGTGAATACTAATGAAACTTGAAACTCGCGACGTGAGTTTTGGAAAATCTCCTGAAGTTCTTCAGTTGAAAAAATGGCTCTGTCGTCTAAAAATAAAAATGCTCCATCAATACCAGGGTCCGGGCCTTCATACACAAGATCACTGACCTCAACTTTGTCGCCTGAGTATTTTGAAAAAGCACAATAGTTCACAAACGCCTCAAAATTTTTGGTCTCCTCATAGGGCGCAGCGAATGCTTTGCAAAAGGCATCGAAATAAGATTTCGTGACTAAATGCATAAAAACTCCTTTCTGAAATAAGATAAGTTTATTAGAAACAACAAGATAAAGATAAAGATAATAGTAATACAAGTTAAAGACATACTGCATCACATAAGTTGGTCAAGATCAATAAATCAATAAGGATCCCCTATGTTTTATGTATTGTTGATAATTCTATCACCTCTTACTTCTGGTCTAAGCGTAACCTACTCAAGAATGCCATTGAAAATGTCCACTGTTCGCTCAAAACAGACTGTCAGGGCTACGCAAGCTGTCAGGTAGATTCTGGGCCAGTACAAGTAACGATCGATTCAACTCTCTCCCACCATGCCTGATATGCTTTACGCTGTTCTTCTAGATAATCGCTCTTGTCATAAACCTGCCAAACCCCTGGCAGTTTATGGCCGAGCATTATTTCTGCAATATGAGGAGCAGTAAGATCAGAAAAGTTTGTTCGTGCTGTTCGCCTCAAATCATGAAGAGACCAATGAGGGAATTGATGCCCCAAACGCCTCCATGCGTACTGCATTAAATTGTAAGGCAGCGACTGCAATGATGTTCGACCAACGGGTTCCCTGCTTCCTTCCTTAGTAAAAAGCATATCGGAACCATTGTTCATAGAGATAGCGTTCTTTATAAGCTCCTCCACCGGTTCAATAATGGGCCTCTTTAGCGGTTCGCCTGTTATCTCTCCAGTCTTATGTCGTTCTGGTGGTACAGTCCATACCTTATTAATGAAATCAAAATCGCCCACCCTGGCAGTAATTAGCTCTGAACTACGGCAGCCAAAATGCAGCAATAGCTTAATGAAGGCCCGGTATTTAGGAACCATTCGAGAACCATCGATCGCAGCATAAAGGATTTTAATTTCATCATGTGTCAGAAACCGTTTCTTCTGACCTTTACGGATATCCATATCTTTACCCGTGATATCCGACAGCGGGCGAGTTTCAATGAGCTTTCTCTTATACGCCCAGACATGGGCCTGCTTTGCGTTAATTAGCAATCGGTCTGCTATTGCTGGAGTCTTAGTGCTAAGAGGCTCCAGGACTTCTAACCAATCATGCAATGTAGCTGCATCGTGAGGGATATTCCCGATTTTAGAGAACAGGTGCAGCTCAAACGAGCGGAGTATCTGTTCAGAACCTTTTTTATTTTTTACACAATATGCTTCATACCAGGCACGGATCACAGACTCTACCGTCATGGCTTCAGTAGCTTTTCGTTTTTCAGCCTGCTTGACCAATCGTGGATTACGGTTTGACTCAAGTTCACCACGGAGACGGATAACTTCTTCTCTGGCCTCTTTTAATCCAGTTGCCGGGTAAGTTCCGATATCAAGACGCTCACCTTTCCCTGCCCATTGATAACGATATTGGAACACTACGCGACCTTTCGGTGATACTCTGACAGACAGACCATCACGATCGGATTTAACCAAAACCTTATCACGTTCCTTTCCAACGACTGAACGCAACCACGCATCAGACAACGCCAT
Protein sequences of DBSCAN-SWA_4 >NZ_CP040886|1406072:1421944|1408211_1408508_+|WP_001378647.1|DBSCAN-SWA MYWKIQEVSAWFVKHVNAKSVEQLRDFNPSFAEIADLSDATADIITKLLQVGVWDDERVMANARQAVLLMRQVAEAIERGDNDSIQDGANRLSAMAFV >NZ_CP040886|1406072:1421944|1416821_1417358_+|WP_001401560.1|DBSCAN-SWA MSFIKTFSGKHFYYDKINKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQDIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDGSFWPVLEGIPATEMFNVIPLAPGHAYGMFMERFNELSELRKCA >NZ_CP040886|1406072:1421944|1410302_1411265_+|WP_001171282.1|DBSCAN-SWA MQKNGDTLSGGLTFENDSILAWIRNTDWAKIGFKNNADSDTDSYMWFETGDNGNEYFKWRSRQSTTTKDLMTLKWDALSVLVKALFSSEVKISTVNALRIFNSSFGAIFRRSEECLHIIPTRENEGENGDIGPLRPFTLNLRTGRISMGHGLDVTGDIFANRFAINSSTGMWIHMRDQNVILGRNAVSTDGAQALLRQDHADRKFMIGGLGNKQFGIYMINNSRTANGTDGQAYMDNNGNWLCGAQVIPGNYGNFDSRYVRDVRLGTRVVQTMQKGVMYEKSGHAITGLGIIGAVDGDDPAVFRPIQKYINGTWYNVVQV >NZ_CP040886|1406072:1421944|1406072_1407659_-|WP_000202566.1|DBSCAN-SWA MTSLLAAEVAKRRTFAIISHPDAGKTTITEKVLLFGQAIQTAGTVKGRGSNQHAKSDWMEMEKQRGISITTSVMQFPYHDCLVNLLDTPGHEDFSEDTYRTLTAVDCCLMVIDAAKGVEDRTRKLMEVTRLRDTPILTFMNKLDRDIRDPMELLDEVENELKIGCAPITWPIGCGKLFKGVYHLYKDETYLYQSGKGHTIQEVRIVKGLNNPDLDAAVGEDLAQQLRDELELVKGASNEFDKELFLAGEITPVFFGTALGNFGVDHMLDGLVEWAPAPMPRQTDTRTVQASEDKFTGFVFKIQANMDPKHRDRVAFMRVVSGKYEKGMKLRQVRTAKDVVISDALTFMAGDRSHVEEAYPGDILGLHNHGTIQIGDTFTQGEMMKFTGIPNFAPELFRRIRLKDPLKQKQLLKGLVQLSEEGAVQVFRPISNNDLIVGAVGVLQFDVVVSRLKSEYNVEAVYESVNVATARWVECADAKKFEEFKRKNESQLALDGGDNLAYIATSMVNLRLAQERYPDVQFHQTREH >NZ_CP040886|1406072:1421944|1411268_1411796_+|WP_001681074.1|tail|DBSCAN-SWA MQHLKNIKSGNPKTKEQYQLTKNFDVIWLWSEDGKNWYEEVKNFQPDTIKIVYDANNIIVAITKDASTLNPEGYSVVEVPDITANRRADDSGKWMFRDGAVVKRIYTADEQQQQAESQKAALLSEAESVIQPLERAVRLNMATDEERTRLEAWERYSVLVSRVDTANPEWPQKPE >NZ_CP040886|1406072:1421944|1409776_1409899_-|WP_071594465.1|capsid|DBSCAN-SWA MAYEPCQGVIIGIMPQHVVCKNVLRLDAIFSLKRKTLLQL >NZ_CP040886|1406072:1421944|1415225_1415426_+|WP_071587686.1|DBSCAN-SWA MTTPPQQPLLWRYQFVHLLPAGTALFTKQRRASPDDGLITQSIRAAATAGVLLCFVEKPTDLAGSI >NZ_CP040886|1406072:1421944|1418330_1418525_+|WP_001061361.1|DBSCAN-SWA MNNLMTTKQVADFCGVSVSTVLRWNSVNRRTGQKYRPDFPDPDIKSCPNKWASRKIYRFAGVIE >NZ_CP040886|1406072:1421944|1413808_1414009_-|WP_000649477.1|DBSCAN-SWA MKAYWDSLTKEQQGELAGKVGSTPGYLRLVFNGYKKASFVLAKKLEQCTSGAITKSDLRPDIYPKD >NZ_CP040886|1406072:1421944|1420720_1421944_-|WP_001680166.1|integrase|DBSCAN-SWA MALSDAWLRSVVGKERDKVLVKSDRDGLSVRVSPKGRVVFQYRYQWAGKGERLDIGTYPATGLKEAREEVIRLRGELESNRNPRLVKQAEKRKATEAMTVESVIRAWYEAYCVKNKKGSEQILRSFELHLFSKIGNIPHDAATLHDWLEVLEPLSTKTPAIADRLLINAKQAHVWAYKRKLIETRPLSDITGKDMDIRKGQKKRFLTHDEIKILYAAIDGSRMVPKYRAFIKLLLHFGCRSSELITARVGDFDFINKVWTVPPERHKTGEITGEPLKRPIIEPVEELIKNAISMNNGSDMLFTKEGSREPVGRTSLQSLPYNLMQYAWRRLGHQFPHWSLHDLRRTARTNFSDLTAPHIAEIMLGHKLPGVWQVYDKSDYLEEQRKAYQAWWERVESIVTCTGPEST >NZ_CP040886|1406072:1421944|1411824_1412358_-|WP_000972143.1|tail|DBSCAN-SWA MMHLKNITAGNPKTKEQYQLTKQFNIKWLYSDDGKNWYEEQKNFQPDTLKMVYDHNGVIICIEKDVSAINPEGASVVELPDITANRRADISGKWMFKDGVVIKRTYTEEEQRQQAENEKQSLLQLVRDKTQLWDSQLRLGIISDENKQKLTEWMLYAQKVESTDTSGLPVTFPEQPE >NZ_CP040886|1406072:1421944|1418763_1420464_-|WP_001419254.1|DBSCAN-SWA MHLVTKSYFDAFCKAFAAPYEETKNFEAFVNYCAFSKYSGDKVEVSDLVYEGPDPGIDGAFLFLDDRAIFSTEELQEIFQNSRREFQVSLVFTQAKSSEKWSKQEIDSYIASIRDYLSPAPQQPHSEYLADFKKMFNLIFANIGRVKNGLPDLYAYFFSAAQNTEAREIKAAFASGEKSLKSLGFSHETSFIMAHKDMIHELWLAAEGPIEAKLPTIGYAPFPAAPNINNAYVATVKARSFIDSILKDKNGNPRKKLFEENVRDFLGIDGDVNSEIAGTLDTDGKKARFGLMNNGVTIVASSVRPAGQEIFIRDFQIVNGCQTSNVLISKDIQVDESVSLMIKLIETDEPAILDDIVRATNRQSKVEDAQFISTLKKLRELEHYFNAKGAIEGNKIYFERRKGQYSSESIAPVRIFDIREVARAYAAIVMMRPDYSSRYPNRLTGDLLNEVFSSDALEDDYYISCYCLYRLKALISNKRFDGKYSKLRWHILTAASKYCGNNYKAWGFKNKNEALHNLFSMNDGEWFEKLESLVKIAIPDPDISRDLLKSQPLTSTILKSVDSVPK >NZ_CP040886|1406072:1421944|1413213_1413765_-|WP_000521508.1|DBSCAN-SWA MGKYHWKVEKQPEWYVKAVRKTIAALPGGYAEAADWLDVTENALFNRLRADGDQIFPLGWAMVLQRAAGTHYIADAVAQSAGGVFVSLPEIEEVENADINQRLLEVIEQIGNYSKQIRSAIEDGVVEPHEQTAINDELYLSISKLQEHAALVYKIFCAPEKSDARECAAPGVVAFCVCGETNA >NZ_CP040886|1406072:1421944|1415868_1416693_+|WP_001763729.1|DBSCAN-SWA MSQNLDTTAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKDLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNGERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAVRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA >NZ_CP040886|1406072:1421944|1417348_1417711_+|WP_001242749.1|DBSCAN-SWA MRMNVFEMEGFLRGRCVPRDLKVNETDAEYLVRKFDALEAKCAAQENKVIPVSTELPPANESVLLFDANGEGWLIGWRSLWYTWGQKETGEWQWTFQVGDLENVNITHWAVMPKAPEAGA >NZ_CP040886|1406072:1421944|1408843_1409347_-|WP_001378643.1|DBSCAN-SWA MTEFLGKLFDRVYTEKDFGINVAIFVAGLAGMSCYLILHDYVLTLFAFIIPFPIVKIIAGGWHQRLLAQKGKAASRQQLAALYDSLTDSEKNVVQHFVAFGGAVMTWGQMNRLDDPQTGVESLVRKGLLGTSVSVDGMHETFELDLTLFAFAYRWRASQNTDSAAEG >NZ_CP040886|1406072:1421944|1415440_1415803_+|WP_000135682.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGR >NZ_CP040886|1406072:1421944|1414099_1414774_+|WP_000848748.1|DBSCAN-SWA MKTIHDIRRSNARKLRDGVGGNSSFATMIDREPTQTSRFMGDGATKNIGDSMARHIEKCFDLPVGWLDQEHQTTNITKKPDVSITNKQITLVPVISWVQAGAWKEVGYSEVDLSTAETYPCPVPCGEMTYILRVIGDSMIDEYRPGDMIFVDPEVPACHGDDVIALMHDTGETTFKRLIEDGTQRYLKALNPNWPEPYIKINGNCSIIGTVIFSGKPRRYKIKA >NZ_CP040886|1406072:1421944|1417710_1418331_+|WP_001377405.1|DBSCAN-SWA MTTFTDKELIKEIKERISSLEVRDDIERRAYEIALASLEEEPVAWLHSENGLGIPAITRSKNIADSWLSKGWYVQPLYIAKPVPVVPDARPSLNNGIVGFDEGWNACRAAMLHGAEPVSQTYKLNKLSGNSPVTPDGWISCSERMPNDKQYVWCWGKPYGWTECDTFEGYYDWSRNKWWAVTDDREEPASKVTHWMPLPEPPQEVK |
19 | Escherichia_phage(35.29%) | capsid,integrase,tail | attL 1407598:1407617|attR 1422175:1422194 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
1510249 : 1516808
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP040886|1510249:1516808|DBSCAN-SWA GATGAAAATTGCGCTGGTTATTTTCATCACCCTTGCCCTGGCGGGCTGTGCGCTGTTATCACTCCATATGGGAGTGATCCCCGTGCCGTGGCGCGCGCTGCTGACCGACTGGCAGGCCGGACGCGAGCATTATTATGTATTGATGGAGTACCGACTGCCGCGCTTGCTGCTGGCACTGTTTGTCGGTGCAGCCCTCGCCGTGGCGGGCGTGCTGATACAGGGGATTGTGCGCAACCCTCTGGCATCACCGGATATTCTCGGCGTTAACCATGCCGCCAGCCTGGCCTCTGTGGGGGCTCTACTTCTTATGCCGTCACTGCCCGTGATGGTGCTGCCGCTGCTGGCCTTTGCGGGCGGCATGGCGGGGTTGATATTACTGAAGATGCTGGCAAAGACCCACCAGCCGATGAAGCTGGCGCTCACCGGCGTGGCGCTTTCTGCATGCTGGGCCAGCCTGACGGATTATCTGATGCTCTCGCGCCCACAGGATGTGAACAACGCCCTGCTGTGGCTGACCGGCAGCTTATGGGGCCGTGACTGGAGCTTTGTGAAGATTGCCATCCCGCTGATGATTTTATTTCTGCCGCTGAGCCTGAGTTTTTGCCGCGATCTCGACCTCCTTGCACTCGGCGATGCGCGCGCCACCACGCTCGGTGTGTCGGTGCCCCATACCCGATTCTGGGCTTTGTTACTAGCTGTCGCCATGACATCTACCGGCGTGGCCGCCTGCGGCCCGATTAGCTTTATTGGTCTCGTGGTGCCGCATATGATGCGTAGCATCACCGGTGGACGTCACCGCAGACTGCTGCCTGTTTCAGCCCTGACAGGTGCGTTGCTGTTGGTGGTTGCCGATCTGCTGGCGAGAATTATTCATCCCCCACTGGAGCTCCCGGTTGGCGTGCTGACCGCCATTATCGGTGCGCCGTGGTTTGTCTGGTTGCTTGTGAGAATGCGATAAATGACTTTACGAACTGAAAATCTGACGGTCAGTTACGGGACAGACAAGGTACTTAACGACGTTTCACTCTCACTGCCAACGGGGAAGATCACCGCCCTGATCGGTCCTAACGGTTGCGGGAAATCGACGCTGTTAAACTGTTTTTCGCGGCTTTTAATGCCGCAGTCTGGCACCGTATTTCTCGGCGATAATCCCATAAATATGCTCTCATCGCGCCAGTTGGCCCGCAGGCTTTCGCTGCTGCCTCAGCACCATTTAACGCCAGAGGGGATCACAGTCCAGGAGCTGGTTTCGTATGGTCGTAATCCCTGGCTGTCACTCTGGGGGCGTCTCTCCGCTGAAGACAATGCACGAGTTAATGTCGCCATGAACCAGACCCGGATCAATCATCTTGCCGTTCGTCGGTTAACCGAGCTTTCCGGCGGTCAGCGCCAGCGCGCATTTCTGGCGATGGTCCTGGCCCAGAATACGCCCGTTGTATTACTTGATGAGCCAACCACCTATCTTGATATCAATCACCAGGTGGACCTGATGCGGTTGATGGGCGAACTCCGGACTCAGGGGAAAACGGTGGTCGCTGTGCTGCACGACCTTAATCAGGCTAGCCGGTACTGCGATCAACTGGTGGTAATGGCAAACGGACATGTTATGGCGCAAGGCACACCAGAAGAGGTGATGACCCCAGGATTGCTGAGAACAGTATTCAGCGTGGAAGCGGAAATACACCCCGAGCCGGTATCTGGCAGGCCGATGTGCCTAATGAGGTAGATTGCACAGGCCGTAAGAACCAAACCACGACTGAATGAAACTGGACTGGCGCCAGCAAGCCTGTTCAGACTGGGGCTGAACTTTTCCGGACTCTGAAAGATTACCAATACTCATCGTCCATCCGCTTGCTTTAGGCTGACAGGTTCATAATCAACGCAAACCAGAGCTGTACAGGCTTGGGCGCGGCTTTCAAACCAGTCGTGATCACGGCAATCAATTTTGAACTCTGCTTAACGGACATTTCTGTATAACCCTTACGGCAACGAAAAACGCGAAGTTAAAATTTTAGAAACCCAAAAACGTGACATGACTAAGTTTAGATTTCAGGGGGGGAGATCAAAAAATTTCGCTCTGTGCCAGAGCGGACATTCACGGAGCTGGTTCATTACCAATGAGGTTGGGCTTTTGAGGATAAATCAATGATCAGACGCCAACGTAAATCAAAAGCACCCCTGGAACGGAATTGCTAATCCAGTTTCTGACCATCGATTTTTCTAAAAAGTGTGCGTTTGCTACTACTTAGGTAGGTGCAGCTTTCTTAATCACCGGCAGCCACGTTATACAGGCCAGTTGATGGATCGATTGTTATCAATGATATCTTTATGAGTCGGTGTCTCACCCAGCTTACCGAAGCTGGCATAAAGTGAAGGCAGACGGGCCCGTCCTTCTCCCTTTTTCGCCAGAGGGAAAGCGCGAAGCATGGTGGCAGCCTCCCAGTCACAACAATAGGATGGTGTGCACGGCTGCTGACGCCATGATTCAGCGATAGAGCCGGAAAATACGGGGTCAAAGCCGGTATCATTAACCAGAGTCATCGTGACCTGTACTGCGGCTGGATGATCTCCTGCTACTGCAACTGCTAGGCGACCGCGGCTCCCTTCAGGTAGTCTGTTGTGGTCAACTAAAACTGGCCTCCGCGTTAGAGTTTTTCCAGTATCGGTTTTCTGATTCGTTTGGTGGTAACCCACCATTATATTCGTGCGGTCTTAGTGCGCTGTAATATCCAACGATATAGTCCGTTATTGCGTGAGCTGCATCGCTGAAGCTTACATAGCCCGTCGCTGGCACCCATTCGTTCTTCAGACTCCTGAAGAAGCGCTCCATTGGGCTGTTATCCCAGCAGTTTCCACGCCGACTCATACTCTGCCTGATCCGGTATCGCCACAGTAACTGCCGGAACTGCCTGCTCGTATAATGACTGCCTTGATCGCCTGGAACATCACCCCGACGGGCTTACCACGGGTTTCCCATGCCATTTCCAGTGCTTTCATGGTAAGCCTGCTGTCCGGCGAGAACGACATGGCCCAGCCCACTGGTTTTCTTGCGAACAGGTCGAGAACAACGGCGAGGTACGCCCAGCGCTTACCCGTCCAGATATAGGTCACATCACCGCACCACACCTGATTTGGTTCCGTTACGGCGAACTGTCGCTCAAGATGATTCGGGATAGCAACGTGCTCATGACCGCCACGCTTATACCGGTGAGTCGGCTGCTGGCAACTGACCAGCCCCAGCTCTTTCATGAGTCTGCCAGCAAGCCAGTGCCCCATCTGGTAGCCTCTCCGGGTTGCCATTGTGGCGATGCTTCTTGCTCCGGCAGAGCCGTGGCTGATGCCATGCAGTTCAAGTACCTGGCTGCGTAATACAGCCCGTCTGCCGTCTGGTTTTTCAGGACGGTTTTTCCAGTATCTGTAGCTGCTGCGATGAACCCCGAACACATGGCAGAGTGTGACCACAGGATAATGCGCTCTGAGTTTCCTGATTATCGAGAACTGTTCAGGGAGTCTGACATCAAGAGCGCGGTTGTAGATTCAATTGGTCAACGCAACAGTTATGTGAAAACATGGGGTTGCGGAGGTTTTTTGAATGAGACGAACATTTACAGCAGAGGAAAAAGCCTCTGTTTTTGAACTATGGAAGAACGGAACAGGCTTCAGTGAAATAGCGAATATCCTGGGTTCAAAACCCGGAACGATCTTCACTATGTTAAGGGATACTGGCGGCATAAAACCCCATGAGCATAAGCGGGCTGTAGCTCACCTGACACTGTCTGAGCGCGAGGAGATACGAGCTGGTTTGTCAGCCAAAATGAGCATTCGTGCGATAGCTACTGCGCTGAATCGCAGTCCTTCGACGATCTCACGTGAAGTTCAGCGTAATCGGGGCAGACGCTATTACAAAGCTGTTGATGCTAATAACCGAGCCAACAGAATGGCGAAAAGGCCAAAACCGTGCTTACTGGATCAAAATTTACCATTGCGAAAGCTTGTTCTGGAAAAGCTGGAGATGAAATGGTCTCCAGAGCAAATATCAGGATGGTTAAGGCGAACAAAACCACGTCAAAAAACGCTGCGAATATCACCTGAGACAATTTATAAAACGCTGTACTTTCGTAGCCGTGAAGCGCTACACCACCTGAATATACAGCATCTGCGACGGTCGCATAGCCTTCGCCATGGCAGGCGTCATACCCGCAAAGGCGAAAGAGGTACGATTAACATAGTGAACGGAACACCAATTCACGAACGTTCCCGAAATATCGATAACAGACGCTCTCTGGGGCATTGGGAGGGCGATTTAGTCTCAGGTACAAAAAACTCTCATATAGCCACACTTGTAGACCGAAAATCACGTTATACGATCATCCTTAGACTCAGGGGCAAAGATTCTGTCTCAGTAAATCAGGCTCTTACCGACAAATTCCTGAGTTTACCGTCAGAACTCAGAAAATCACTGACATGGGACAGAGGAATGGAACTGGCCAGACATCTAGAATTTACTGTCAGCACCGGCGTTAAAGTTTACTTCTGCGATCCTCAGAGTCCTTGGCAGCGGGGAACAAATGAGAACACAAATGGGCTAATTCGGCAGTACTTTCCTAAAAAGACATGTCTTGCCCAATATACTCAACATGAACTAGATCTGGTTGCTGCTCAGCTAAACAACAGACCGAGAAAGACACTGAAGTTCAAAACACCGAAAGAGATAATTGAAAGGGGTGTTGCATTGACAGATTGAATCTACAGTAGCCTTTTTTAATATTTCATTCTCCATTTCAATGCGTTGTAGCTTTTTCCTCAGCTTACGTATTTCGATTTGTTCTGGTGTTATCGGAGAGGCTTTTGGTGTTTTGCCCTGACGCTCATCACGCAGTTGTTTGACCCATCTTGTCATTGTGGAAAGGCCAACATCCATAGCTTTGGCGGCATCTGCCACCGTGTATGTCTGGTCAACAACCAGTTGAGCGGATTCGCGTTTAAACTCTGCGCTAAAATTTCTTTTTTTCATTGGAGCACCTGTGTTGTTCTGAGGTGAGCATATCACCTCTGTTCAGGTGGCCAAATTCAGTAAACCACTTCACCACTTCAGAGCAGACGAGACAAAAAGAATGGTTGACGCATCAGCTCAACCCCATATAGTTGTAACACTTGAGCCAAACCCTTGGGCCGCTTTTTACTTTGATATTAACATTGCTAATACAGGGAACGCACCTGCCTATAATGTTGAGGTTGTGTTTGATCCTCCACTAGTAAATGCGGAGCATAGAGAAAAAAGTGAGATTCCGTTTAGTAAGGTAAGCGTGTTAAAAAATGGGCAATCACTTACCAGCAATCTCTGTAAGTATGAACAAATCAAAGATCAAATTTATAATATTAATATAAGCTGGGCAAGCAAACCTAAATCAAACGATAGAGAAACAAATGAATATGTGTATGACATGGCGACATTTGAAGGAATAAGTTATCTAGGAGCGAGAAGCCCATTGACGCAAATTGCAGAACAAATTAAAGGTATAAGAGAGGATTGGAAACCTATTGCACAAGGAGCTAAAAAAGTAAAAGCAGACGTATATACTTCAAGCGATAGAAACGAAGAACGCACGTATCTGCAAGAGCAACACGATTTGGCAATAAAAAGGAGAGATGAGAAAAGAGAAAAAAGATTAGAGTCTGGTGAATAATTTTAAAGGGAGTGGGTAACTAACCCACTCGTAACTATAAACCTGTAATTAATCACTTATTTTTGACAACAGATAATTACTGAACGCACTGCAAGTGACTAACATAAATTTAGCTTCAGCTAAGGTAGGATTTACATCTTCTTCTGTTAGCGCATGACGTATTCCACCCTGATCACTTGTATAACCATAGAGCTGACTAAAAGCGCCTTTCATTGCAGAGTGTATATATCCTTTTTCCTCTATAGCTTTAAGACAAGCCCCCAAGGTTCCTTTATCATTGCCCGTGATTTTCCTGCATAAAGATTCAATTGCAGAGATAGACTCTTTAATCGAGTTTCTGTAGTCTGGCTGCTCTCTATCCGTCATTAGTTGTAACGCCCTTTCGAAATGGCTACGCGATGAATCAGTGCCATTATCAACTGCGTTCTGAACACTTTCAATTTCGTTATCATTTGAAATAGGAGTAATACAACCATTTATTATGGTATAACCAACGCCATGCTTTTTAAAGATGGAATTGAGATGCTTCGATAGATTAATATATGAATTAGTTCTCTCAATGATGAACTCAATTAAATCATATACCAAATACCATGCTTCCCCATATATATAATCTCGGATAGCAGTCAGCAACGTCTTATCACTTTTGTATCCACTTTCATAACGAGGAATATTATCCGCAGGTTGATTTAGATAATATATCCACACAGACTGCGCACATTTTGTTGCTGTAGCAGTTTGACGATTGTTAGTCCAAAGGAAAAGATATAAGCAATTCCACAATGCCATGCGTGTATCAGAATTAAGATCATTCAGCTGGACATGCTCTCTAACGTCAACATGACCATACCTCACAGAAAATGGCTTTATCAT
Protein sequences of DBSCAN-SWA_5 >NZ_CP040886|1510249:1516808|1511206_1511974_+|WP_000175457.1|DBSCAN-SWA MTLRTENLTVSYGTDKVLNDVSLSLPTGKITALIGPNGCGKSTLLNCFSRLLMPQSGTVFLGDNPINMLSSRQLARRLSLLPQHHLTPEGITVQELVSYGRNPWLSLWGRLSAEDNARVNVAMNQTRINHLAVRRLTELSGGQRQRAFLAMVLAQNTPVVLLDEPTTYLDINHQVDLMRLMGELRTQGKTVVAVLHDLNQASRYCDQLVVMANGHVMAQGTPEEVMTPGLLRTVFSVEAEIHPEPVSGRPMCLMR >NZ_CP040886|1510249:1516808|1510249_1511206_+|WP_000684856.1|DBSCAN-SWA MKIALVIFITLALAGCALLSLHMGVIPVPWRALLTDWQAGREHYYVLMEYRLPRLLLALFVGAALAVAGVLIQGIVRNPLASPDILGVNHAASLASVGALLLMPSLPVMVLPLLAFAGGMAGLILLKMLAKTHQPMKLALTGVALSACWASLTDYLMLSRPQDVNNALLWLTGSLWGRDWSFVKIAIPLMILFLPLSLSFCRDLDLLALGDARATTLGVSVPHTRFWALLLAVAMTSTGVAACGPISFIGLVVPHMMRSITGGRHRRLLPVSALTGALLLVVADLLARIIHPPLELPVGVLTAIIGAPWFVWLLVRMR >NZ_CP040886|1510249:1516808|1514911_1515262_-|WP_000747102.1|transposase|DBSCAN-SWA MKKRNFSAEFKRESAQLVVDQTYTVADAAKAMDVGLSTMTRWVKQLRDERQGKTPKASPITPEQIEIRKLRKKLQRIEMENEILKKATVDSICQCNTPFNYLFRCFELQCLSRSVV >NZ_CP040886|1510249:1516808|1515983_1516808_-|WP_000594911.1|DBSCAN-SWA MIKPFSVRYGHVDVREHVQLNDLNSDTRMALWNCLYLFLWTNNRQTATATKCAQSVWIYYLNQPADNIPRYESGYKSDKTLLTAIRDYIYGEAWYLVYDLIEFIIERTNSYINLSKHLNSIFKKHGVGYTIINGCITPISNDNEIESVQNAVDNGTDSSRSHFERALQLMTDREQPDYRNSIKESISAIESLCRKITGNDKGTLGACLKAIEEKGYIHSAMKGAFSQLYGYTSDQGGIRHALTEEDVNPTLAEAKFMLVTCSAFSNYLLSKISD >NZ_CP040886|1510249:1516808|1512531_1512789_-|WP_000177060.1|DBSCAN-SWA MTLVNDTGFDPVFSGSIAESWRQQPCTPSYCCDWEAATMLRAFPLAKKGEGRARLPSLYASFGKLGETPTHKDIIDNNRSINWPV >NZ_CP040886|1510249:1516808|1515362_1515935_+|WP_000227281.1|DBSCAN-SWA MVDASAQPHIVVTLEPNPWAAFYFDINIANTGNAPAYNVEVVFDPPLVNAEHREKSEIPFSKVSVLKNGQSLTSNLCKYEQIKDQIYNINISWASKPKSNDRETNEYVYDMATFEGISYLGARSPLTQIAEQIKGIREDWKPIAQGAKKVKADVYTSSDRNEERTYLQEQHDLAIKRRDEKREKRLESGE >NZ_CP040886|1510249:1516808|1513840_1514992_+|WP_001254876.1|transposase|DBSCAN-SWA MRRTFTAEEKASVFELWKNGTGFSEIANILGSKPGTIFTMLRDTGGIKPHEHKRAVAHLTLSEREEIRAGLSAKMSIRAIATALNRSPSTISREVQRNRGRRYYKAVDANNRANRMAKRPKPCLLDQNLPLRKLVLEKLEMKWSPEQISGWLRRTKPRQKTLRISPETIYKTLYFRSREALHHLNIQHLRRSHSLRHGRRHTRKGERGTINIVNGTPIHERSRNIDNRRSLGHWEGDLVSGTKNSHIATLVDRKSRYTIILRLRGKDSVSVNQALTDKFLSLPSELRKSLTWDRGMELARHLEFTVSTGVKVYFCDPQSPWQRGTNENTNGLIRQYFPKKTCLAQYTQHELDLVAAQLNNRPRKTLKFKTPKEIIERGVALTD |
7 | uncultured_Caudovirales_phage(16.67%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
1858480 : 1878526
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP040886|1858480:1878526|DBSCAN-SWA AATGGCGTACTATAACATAGAGAAACGACTAAAATCCGATGGCACACCACGCTATCGCTGTAATGTGATTATCAAAGAAAAAGGTGTTATCACTTACAGGGAAAGCAAAACATTCCCTAAACATGCTCATGCCAAAACATGGGGCGCACAGAAAGTGATGGAATTAGATCTATATGGCATTCCATCATCAAATGCTGTTGACGGACTTACAGTCCGTGACTTACTACACAAATATTTAAATGACCCAAATGCCGGAGGTAAAGCAGGCCGTACTAAAAGATATGTGCTGGAACTGCTTATGGATAGTGACATCTCCGCGATCAAACTATCTGAACTGACAGAAAATGACGTAATTGAACATTGCAGGCTAAGAAACAACGCTGGTGCAGGCCCAGCAACAGTCAGCCACGATGTTAGTTATCTTGGCAGTGTTCTGGATGCGGCAAAACCTGTATACGGAATCAATTACACATCAAACCCGGCGAAAAGCGCTCGTCCATATCTACTTAAACTCGGTTTGATTGGTAAATCAAACCGTCGTAATCGTAGACCAGCATCTGATGAACTGAACATGCTCATTGAAGGCCTTCAACAACGATCTACTCATAAATGCTCAAAAATTCCGTTCGTTGATATCCTCAAATTTTCTGTGTGGTCCTGTATGCGAATCGGAGAAGTATGCCGGTTACGATGGGAAGATCTCGACCAGGAACAAAAATCTATACTAGTAAGAGACAGGAAAGATCCACGTAAAAAGGAAGGCAACCATATGAAAGTTGCCTTGCTTGGGGAAGCCTGGGATATCGTCCAGCGACAACCCAAAAAATCAGAATTCATTTTTCCATATAACAGCACTTCTGTTACCGCAGGATTTCAGAGGGTAAGAAGCAAATTAGGTATTAAAGATCTGAGATACCATGATTTGCGTAGAGAAGGGGCAAGTCGCTTATTTGAGGCTGGTTTTAGTATTGAGGAAGTCGCTCAGGTTACAGGGCATCGTTCATTAAACGTGCTATGGCAGGTATATACCGAACTGTATCCGAAATCTTTACATAATCGTTTTGAAGAACTCCAAAGGAGCAGAAATAAGACCTCTTGACACTGTTTATCCATACAGTTAAAAATAATACTGTATACAAATACAGTGTAGGGGACTTTTATGCGTATTGAAATCTGCATAGCCAAAGAAAAAATGACTAAAATGCCAACCGGTGCTGTGGATGCGTTAAAGGAAGAATTAACCCGACGCATCAGTAAACGTTATGACGATGTAGAAGTGATCGTAAAAGCCACCAGCAACGATGGCCTTTCTGTTACGCGCACCGCTGATAAAGATTCAGCTAAAACTTTTGTTCAGGAGACTCTGAAAGATACCTGGGAGTCTGCTGACGAGTGGTTTGTTCATTAAGTTCATCATCTCTGGACACCATTCACTTGTATTGACTTTAAGATCATTAAATTGTAATAATTCAAAGGGATGGGGCAGGTTTTTCCTCTTGCGCATGCAAGGGCGGCTCTGATAATGAACAGTAATATTTTACTGTTGAGCTTATGCTCACTGATCCTGCCGGGCAGGATGTTCAACGGTTCCAATATATCCTGCCCCTTCCGCTTTTAAAAAGGATCTTTTATGCATGACAATATTTGGTTTACATATAAAGCACGTATCCAAGCGCACCATCGACTAGAATGGCTTGAAAAACACTCTCAATTTATCCTCGTTTGGTATGCTATATTGAGTGCGGTACTTTCAATTGTAACGTTGCGATTTCCAAAGGTTCTAGGAGATAATACAGATGTCGTTGCGGCGATACTTTCAGTGGCTCTACTGGGTATTTCTCTGATCGTATCTAACCTAGATTTTCGTGGTCGAGCAATAGCCATGAGAAGGAATTATATTGCACTACAGCGACTCTATTTTGACATTACCACCAGTCAACAGTTATCTCTTGAACAGAAAGAAAAATATTTTAATTTGCTCAATGAGGTTGAGAATCACCGTGACATAGATGATAAAATTTCAAGGGTAACTCAAGTTGGACTTAAGACGAGGATCCCCACACAAAAAGAAAAAATAATTGTTATTTTATGGATATTACTTCGAATATTTATTACTGCCGCACTTTATATACTCCCATTAATATATCTTTGGATTGACTATGACTGCAAGCAGAATTTTTAAAAAGTCATTCTCGAAAAAAAATCTTCTAAAAGTATACTCTGAAAAAATCAAAGAATCAGGAGCGATTGGCATAGATCGGATTCGCCCATCAAAACTTGATTTGACAATAAAAAATGAGATCACTTTCATTTTTGAAAAGGTTAATTCTGGCAATTACAAATTTACAGCATATAAAGAAAAATTAATATCTAAAGGCGCTAACTCTACACCCAGACAGATTTCCATACCAACTGCTAGGGACAGAATTACTCTTAGAGCTCTCTGTGAATGCCTTACGGAAATATATCCTAAGTCCAGATTAAAACTACCACATACAGTAATTGACTCATTGAAAGAAGCATTAAACAACAGTCTATATGCTGAATATGCAAAAATAGATCTTAAAAGTTTCTATCCTTCAATTGAACATAAATTGATAATTAATGCAATAAAAAATAAAATTAGAAAAAAAGAAATTAGACAGTTAATAACATCATCATTAATCGTGCCTACTGTAAGTGGAACCACAGGAAGCAAAGGTATCCCTAATAATACCAGAGGAGTACCTCAGGGATTAGCGATATCAAACATTTTAGCTGAAATATCACTATCTAATTTCGATGATGAAATCAATAAAATGCATGACATATGGTACATGCGATACGTTGATGACATTCTTATTTTAACACCAAAATATCAAGCAACAAAAATAGCTTCTCATATCATTGATAAGCTTCAATCATTAAATTTAAACCCACATCCATTAAATGAAGAGAACTCAAAATCCAAAGTAGGCAGTTTGGATGAAAGTTTTAACTTTTTGGGATACCACATAGAAAATCGAGAATTATTGATAAAACATGAGAGCATTCTTAGATTTGAGTCATCCTTAGCAAAAATTTTTACTGCATATAGGCACGCTCTACTACAAGCTAAAAGTAAGCGTGATAAAGAACGAGCTGTTGCATATTGTCAGTGGAAACTAAATCTCAGAATTACGGGATGTGTGTTTGAAGGTAAACGATTGGGATGGGTATCGTACTTCTCACAAATAACCTCAACAGCTCAACTTCGCTCTGTTAATCATACTATCAATAATCTTATCCGCCGATTCGGCCTTTCATCAGAAATAAAACCAAAATCTTTGATTAAAACTTTCTATGAACTCCGCAGAGGTAGAGCGGAGACTTTTAAATACATACCTAACTTTGACAATCTACATATATCTCAGAAACGAGAACTTGTTTCTATGTGGATAGGTAAAGAGAAGGAAAAAAAACTTAGCAATAGTGAAATAGAGAGGAAGTTTAAATTTAAAATTGCGAAATCAGTAAAAGAGCTTGAAGAGGATATTTCAGGAATATCATAGATATGTAATCCATTAAGTCATTAAAATATATCGCAATAAACACACTATTTAAACTCACAAACCAGCCGCAGTATCCTGCCATGGCAAGTTGCTGCGGCTTTTTATGTTCAACGGATCAACAGCCAGATCAGAAGACACGCTACCCACGTTGTAAACGAACGAGTAAAGAGCGCCGCGCGTTGTTTCCGGTATATCGACTTTGATGTACGGGTTAATTTGTCTGGCGACCGTGGCAAGGTCTTTATTCAGGAGGGCTTTGCATTCTGCTTCGGTATACGTTTTACCTAGCATGATGTCTTTTCCGGTGTGTCCGTAACATACAGTCCATACGCCAACGATATCTTTGTATGGTATGTAGCTGACACCTTCCAGGCCATCGTCACCACTTGGGCCAGTGATTAACACTGATGCTATAGCAATTGCTCCGCCACCAATAGCAGCAGCAACTGCTTTTCGTAATGATGGAGGCATTATTCACCTCTTGCAGCCTTGCGCTTATCTTCTTTAATCTTGAAATAAAGGTTTGTCAGGTACGTCAGCAGGCCAAATACCAGGCTACCCAGCACACCTATTGCTGCCCACTGTGAGGGCGTGACTTTATCGAGCAGCTGTAAAAACCAGTAACCGGCACTACCTGCTGAGGTGCCATAGGCGACACCCGTTGTTAACTTATCCATGGATTTCATAACCCCACCTCGCAGATGCGGGTGCTGTGTAATGGAAATAAAAAGGCCACCTGACGTGGCCACCAGATTATTTCCCCACCAGCTCGTTTATCTCTTTCACTGTCTGGTTAAACCGCTCTGACTCAAGCTCAACACCTAAGGCCCGACGCCCCAGCGCCATTGCTGCTTTTATTGTGGAACCGGATCCCATAAAAAAATCAGCAACCAGATCACCAGGTCGACTACTGGCATTGATTATTTGCCTGAGCATATCCGCCGGTTTCTCACACGGATGTTTACCCGGGTAGAACTGAACGGGTTTATGCATCCAGACATCGGTATAAGGCACGGAGACTGATACGGAGAAATAGCGCCGGAGAGATTTAAACTCATCCAGCAATTCAGAATATTTGCGATTCAGTGAATCATAAGATGCCACCAGCTGGTGGTGTGGTTGTTCCAGTTGTTGTTCCTGAAACTTCTCTGCCGCTATACGGGAAAACAGTGCCTGTAACTTCCGATAGTCAGCCTCATTCGGCAACTGCCACTGACTGGCACCAAACCAGTGGGAAACCATATTTTTCTTACCTGTGGCTTCGGCAATTTGTTTTGCCGTTATACCCAGTTCGGCACGAGCATCCCTGAAATACGATATCAGCGGTGCCATTATGTGCTGTTTGAGTTCCCTTTCTTTTGCCGCATAGCCGTCACTTTTGCCGCGATATGGCCCCTGGTAATGTTCAGCAAACAGAACGCGCTCTGTGGCAGGAAAATATGCGCGCAGACTTTCTTTATTACACCCATTCCAACGTCCGGACGGCTTCGCCCAGATGATATGGTTAAGCACGTTGAAACGTTCACGCATCATGATCTCAATATCAGATGCCAGGCGATGCCCACAGAACAGGTAAAGGCTTCCGGCAGGTTTTAACACCCGCCAGAACTGGGCCAGACAGTGGTCCAGCCACTTAAGGTAATCTTCGTCCCCTTTCCACTGATTGTCCCAGCCGTTGGGTTTCACCTTGAAGTACGGCGGATCGGTAACAATCAGGTCAATGGAATCATCAGGCAGGGACTGAATAAAATGCAGGCAATCAGCGTTGATTAAATCAACACTGTTTATTTTTACAGTATTTTTCATGGATCAGTAAGCGTAACTCTGGTAGGCTCACTCTGCTTTTGCGCTAAAGCAGTGGGCCGTGGTTCGCTTGTGACCAGTAAGCATGAGCGAATGGCTGGCAGGTGCTACCAACACCCACCAGCCGCCCATTTTCACAAATTAAAAGTCCTTCATTGCTGAAGGCGTCTGTAACAGCCGAACTGGTAATCTGCCAGCCCCGCCATAACCAACTGGGTCAGTATTAACTGACAGCGTTCGCGTGAAAGATATGTGTTTTGTGCAATCTCCCCGACTGTTGCCGGTTCGATGCTTAATTCATTAAAAACAACTTTCGCCGTTTCTGTCATATCTTGCTGTTTTAGCATGTCTTTTTTCCTTCTGGTTAACATGACATACCAATAACTCTTGTCTAAAAAGCCAGCAAGATAAAAAGTCAGTATTCACGACCACCAGCGTGTTTACCGTACTGCACCAGGTTTACAGGTACAAAAAAACCCGCTCTACGGCGGGTTTAAGCTGTGTGGCGAAGTAACCACTCTTAACAGATTATGATAGTTTTTGCGTACGCGTTAGTAATTTATTATGCTCTTTACAATTGTTAGCTTAACCGCATGGAATTGACCAAAAAAATGAGCAAAGCAAGCATACGCCACGTAATCGTTCATGAGCTCTTAAAAGAATCTAATAAAGACTTCGATCACTCCAAACCATACAATCTTCGTGATACAGAACTAGATAAAACAAATGATATAGTAAAAAAATTAGTAGACGGTGTTATTGATTTGTATGGTTCAAAAGGGAACTCAGCGCATTATGGTGTTTTTATTAAAAATAAAACAAAGCAAGGCCCTATACCAGAACTATTTCATAAATACTCTTTAGTTCAACAATCTGTTTCAAGTGATTTCATTGAATTATCGAAGGAAGTTATGAAACAAATGTATAAATCTGCTCAAGAGCAGATTTGGGCTTCTGGAGGATATGTTGTTTTTACTGATTATATTTTATCTGGTTTCCGTTATCTATTGGTTACAATGATCAAAAAAACTAATGGCGTAACTATTAGTGAAAATTTAGAGCCAGAGGAAATGATTCACTTAGAACTTGGTAATATTAACCAAGCAGCAAAAATAAATTTCAGATATTATGAAGAATACCAAAAAGCAGATGACTTAAAAAAAACAGACTTAAGTTATCTAAGCTTTATAAGCAAAACTACGGGACAGTCAGCGGCAGCATATTTTATAGCAGCATTAGGATGTGACAAGGGGATTGCTTCAGCAGGTGCAACCCGTAAGTTACCAGATGAAATAAGGCGTTTCTTTAAGAAAGAACCTCTTTTAAAAAATCAAGCAGAGTCATTTAGAAATGATGTTATCAAATACTTAGAAAAGCAATTTGACAACGAGCACTCTGCAAGGCTTTCTGATATCGAATCGCTTGCTTCAGGCCATATGTCCTATTTAAAAGAGGAAGAAAAAACAGAACTTGTTGATAAATTAATGAAACACCTCAATAGTGAGGAAGTCAGAATCCCATCAGAGTTCGTAATCAATAAAAACTCCTTAGATAAAATCAGCAATGTGATATATAAAACCCCATCATTGAGCTTTCACTTCGACAAGGATTTACTCGGTGTCACAACTGATGCTAAAATATATTATGATGACGAAAACCAAAGCCTAACATTTAATAATTTGCCTGTTGAAGCATTAACTAAGATAAGAAGAGCGTTGAAAGAAACTGATAACCCAAGTAATGAAGAAGATAAAGAATGAATGATTTTAGTATAATAGTTAATCTGTATAGATTATCAAGCTATCCTCATTTTGACGGGGCTAAGTTTTCTGCGCGTATAGCTTATAATGCAGACGTAAAATCATTGTTCAAAAGAATTTTGAACCCTACTTTTCAAGCTGGTACAGCTGACGAAATAGAGGTGGATGGTCATTTAATTTATGATTATGAAGACTTTCCTGAAAAGGGAAATTTTCTTACATACTCGTTTAAAATTTCACAAGGAAGTGCGAATCGTTTTTATAAAAATAAAAACGAGTTTGTAAAAATAAACACGCTCAAGAAAGGCATAATGCCAGAGTATTTCTATATTATAGAGGATGATTTCTATTCATTAGAAACACCAAAACCTTCTTATATCCAAAAAATTGAGGACATTTGTGAGCTAATCAATGCTCTTTCCATGCTTGCTCATTTCCATGATATAAAAAAAGATAGCAAAGGTACATTTTATCGTTTAGTCTTTATTTTAAACTCAGAGTCTAAATCTTCTTCTGCTGTAATTGAAACAAATATTACAGAAGAAATTTTTAATGATAAAACAGTAAATACTCAGTTAGTTAAAACATTAGTAAGTAGTGAAGCTACTACTGATGCCCATCACATTGAAAAGATTAACACTTTCAGAAACACAGTTATTGAGTATGTTAATAAAAATGGAAATTCCTTTGTCGAGTTAATTAACAAGTGGGATTTCATATGCGAACTTTATACGAACAACTTAGCTGCTTATATGTCGGCATTTTCTTTTCATAAAGCAAGGAAAGAAGTTGTTGATGCTGAACTCGATTACTCAGAAAAACTGTCAAAAATAATTTCAGAAATTTCTAACAAAGCTCTTGCAATACCTATTTCACTAGCTGGTTCAATTGCTATTTTCAAATTAACAACAAAAGCTGATTGGATTATTGCTTTAATTGGATTGATTATCACAGCAATAATAACATCTGCAATGATTGTGTCACAAAAAAAACAACTTGCTCGTATTTCACACTCTAAAGAAATACTTTTTGGACAATTAAGATATAGAATAAAGGATGACACCAGCGATCTTAAAGAGAGCTTAGAAGAGGCTATTAAAAAATTAAATGACAATGAGGATTTTTGTCATAAGGTGCTTGACAGTTTATTATCACTAGCATGGATGCCTACATTCATAGGCATCATCGGTATTTTATTTAAATTAATGCCAAATATTACTTGAGCATGTACATAACCCCATTAATGAAACCTAACGCAGTCTGCAATTGCTTCCGGATTGTTCCATCTGAGCATCTGCGTCTCTTCGCAATAGAACGCAGTGAAATACCTATAACGAAGTGAGCAATGATCAGCTCATATTCGTCTGGTTTATACTTACACAGGCGGGCAACACAGCCGTCTATCATGATGCCTTCATCATCATCACACTGAAGGCGTATTTTTTTACCATGAGGTAAAAGCCCCTTAAAGCCTGCTGCTATCGGCTGCCAGTCCACACCACTGTTATCTGCTGCAGCCCACGCTCCCCAGCGGTCTAAAACTTCATACATATCACGCATCAACTTTCTCCACAAAATCAGGCCAGCACGCCAATTGCCAGCGCACGATCGATAAAACGAAATATCAGCTCCAGCTGGGAGCCATACTTCTCTTCAAATGCCACGGTATCCGCATGCAGCTCGTCGTGATGCTTTCTGCACAAAGGCAACACAAAAAGGTCATGCGCTTTTGTTCCCATTCCACCCTGACCGTGACCTATCAGGTGGTGGGGATCATCAGCGGGCTTTCCACAACATGCACACGGCTGTGTCTTAACCCAGCGCGTGTACTTTTCATTAACCCAGCGGCGACGTTTGGGGCGTAACATAAAAGACTCCGGCGACTCCGGATCCACTTTCAGCGCCAGCACCTTTTTCGCCTTATCCTGGATGATGCTGGTGGCAGGAACCGAAGGCACAAGGTCACTTTCCCGGGTAACAGACGGCACAACAGGCTTCGGTAATCTCAGTGCCTTACGGGCTGCACTTTCCGGTAAGGCATCCGCCAGATCATTACGAATCAGCCACCAGCACAGTTCCGGCATTGTCACAACGTGACTGTCATCAAAACCGAGATCCCGACGCACAACAGACAACACCCAGCGGGCACAGTTATCCGTTGCCATTGATTCCAGCCGTTCCGTGAACTGATCGCGCAGCTGGTTATCGCAGTGCCAGCACAGACGGATTGCACCCGGAGCGTGTCGCATTGTGGTCATGTTCTCGCTGTGCCAGTCGGAATGAGGCCACTGGCAGCCTTTTTCACGAAGTAACCAGCTTTCAAGACATTCCACGCCACCAGCACGACGGATCACTGCCTCATTGCGGAACACGGCCCGAACGGCAGGATCATCCGCCAGCGGTTGTGATGCCGCCGGAACGGCACCACTGGCGAAAGATGAATAACGCTCCGGCTCAGGCTCCAGCAGGACACGCCCCTGCATAAACAGGGGCATCAACTCTGAACCTGGCCTGAACAATACGATCCCCATACGCGGGGCAATTTCAGGGGTCAGTAGTGCTCTCACGGTCACCTCAATGAACGGTATCGAGCAGCTTTAACAGCTCAGGGAATCGGGATTCGAAGAAATGCGGCTGCGTCTCGCGCGGATTTGCGGGACTGGTGATGTTCTTGCCGAACATGCAACCTTTCGCTGTCAGCGACCAGAATTTTTTGATGTTGTTAATCGCGGTACGACTGTATCGTTCGCGCTGCTCGACGATCCCCAGCTTCACCATCTGGTGATATGCCTGATTAGCTGTCAGGCGGATACCATACTGCTTCAGCAGTGCACTCAGTGACAGCGTGGGGCGGCTTGAGCCATCAGGCGCGTCAGCAGGAGCATCAATGGCATAGCGCGGTGCCAGATTCGGTAAGCCAACAGCCTCCTGGAGTTTCTGACAGGCACCAAGCACTGAAGAGTTAGACAGGTTTAATTCCCGGCGCATAAAGTCCAGCAGAATCACACCAGCCTGCATCTTGTCAGCAGCCTGTCCGGATAATTTTTCCGGTGCGCTGGTTACCATGTCGAAAGTACGGATCACCTTAAGATGGAATGACGGGCTTATCCACATTGCATAGGCATACACCAGTTCTTTGCAGACATACGTCCCCTGGTTATTTCCGCCACGAATAACGTTAACTGGCTCTATATTGACCGAGTTGCAAATCTGCAACTCGCTTATTAAACGTTCAGTTTGCTCATTGCGGAGCCAGAATGCAGGCTTATGCTTATCCAGAGAACCGGCAGCCCTGTGCAGATCGTTCAGGCTGTAACGCCCATAAGCATCACGACGAACTTCAATACCATCAATAACCATCAGATTATTCATACTTCGTTTCTCCTCTTAATCAGGCGGCTGCACCCGCCGGTTTCTCATACTTACTGATAGTGATCTCGACCTTCCCTTTCGGGATAACCGGTCCCCACTCCACCAGCATTCTTTTCACCTGTCTGTCGTCTTCCCACACACCCGCGTGGGTCAACGCGTCAAACAGCGCCTTGTTATAGTTGTCCAGATCGCGGATCCGGTTATCCGGAGGAAACAACACGATCTCCACTGAAGCAGGTGCCGACGTTGGTTTCGGCAGACGACGTAACTGCTCAACTATTGCTGCGCACGCCGCGCTCTGAAATTTTCGCCCCGCCTCGCTTATCAGGCTCTTACCAGCAAATGCCCCTTTGTTGGGGTGTCGCCAGTACGTGTTCACACTGGGCGGGAAAGGAAGGATCAACTTCATACTTTCAGGCCCCTCTCATGTAACCAGTGGGCTGCACGCAACCTGGCGTTCTCCTCACCGGCAAGCAGTGCGCGGATGATACCGACCGCCTCGCTGTCGTCGTCCTTCACTGCGGTATGAAGCGTGATCCCCCGGGCCACGCCACGCTTTATCGTGATGACGCCTTTTTTCTCCAGTGCGCGAAGATGCTCCACGGCTGCATTCACTGAACGGTATCCCAGCATGGTTGCCACCTCCTGATTGGTTGGCAGAAAGCCACGCTCTTGCTGGTAAGAAATCAGAATATCCAGCACCTGCTGCTGGCATTGAGTTAACGTCGTCATGCCGCCATCTCCCTAACCAGTTTTTCCGCCTGCTGGCGAACCTGCGCCAGAAACGCCTCACCACATGCCTCAAGTTCATCGCGCCCGATGTAGCTGATTGCCGGTCCCTTCCAGGTCTTATCGAAAACAGCAATAGCACCAGCGAAGAAAGCGCCTGTCGGCACCTGCTTCTCATCCTTCGGGATAAACCAGGCAGGCAGTTCAAAACCAATACGCCCGCGAATAAAAGCAATATGGTCCGCATCTTCCGGCCACCACACTTCGCTGGTGGCAGCTTTGATCAGGAAAACATAGCGCCCACCCTTATCACGCATGGCACTGGCATGTTTCATGATGTAACGCATGCCGGTGATGTATTGCCCCTCATGCTGACTGGCGCGGCTGTATGGGGGATTACCAAAGGCAGCCCCTTTAAGCTCCGCAAGACGTTCTGACCAGTCATGCGCCAGCGCGTTGTCTTCCGCCGTGTAATACGCAGCACATTTGGCGTTATCACCGTCAGTGAACAGATCCAGAACAAACGGGCCAAACAGGGTGTTAATTCCCCAGAAAATGTTGTCCGGCGTGCGCCACTGATCGCCCACTTCCTTCAGTTCATGGGCTGGTTTGTTCCGCAGTTCCACCAGCGCCTGGCAATATTTATTACTCATTAAGCCCCCACGTAATTCCCTGACAGATACCACTCATCACCCGGTACAGCGCGCTTGCTGCTTTTCCGTAAACACCGCTCACGACGCGCAAGAAAATTGTTTCGCTCTGGCTGGGAGTGGCTTTCACGGAATGCCGCCATCCACACCGTTGCAGCACGACGGTATAAGCCCCTCGACTCCAGTTCTTCCGCCTGGCGGGTCAGGCACAAAATCACCCGGGGATCGTTAGTGCCGACATAGAAATTGCGCACAGGTCTGGTTTCACGAACTGGTTGCGGTTCCGCCTCCTGCGCTCTCTCAGTCAGGCGCGGGAAATGTCTGCGTGTATCCCCTTCACAACGGTGAGCCACACGACCACTCTGACGTAACTTGCTTGCTGACTGCAGAACGCGCTGCCGTGAGTAACCTGCAAAAGCATCCGCAATGTCTCCGGAAGTACACCCCGGATGGGCTTCAATGTATTTCTGAACTTCATTCAAAAGACTCATGATCACCCCCTGAATCCTGCCGGGATCTGGCTGTAGTCCACGTTGTCGTAACTGGCTTTGAAGTACGGGTCCTCGCGTCTGGCTGCAGATACCGCAGGAACTTCCCAGGATTCTTCGAAATGACGATCCGGACCAAAGAACGTGACAGCCTGTTTCACAAATTGTGTGCCGCTGTTACCCATCGCAGATACCCAGCCCGCGTAGCGTTTCACACCTTCCAGCATGGTTTCGGGGTTTACCCCCTCATTCAAACGGGCTTTCCAGGCTTTGAAGGCTGCAGATTTTGAATTGCCACCAGCACGTTTGGGATATACCAGCCATGCCTGCTCAAACTCCGGAGAGTATTCCGGTCGGTTTGAACGAACTCGCACGGACTCATCAACTGATGCACCAACAGCTATTGGTTCATTGACTGGTTCTTTGACTGGTTCAAAAGAGTGACTGGTTCTGGGTGAATCTCCTGCACTACCCCCTGGTGCAACTCCTGCACTACCTGGTGAATTTGCTGCACCAGATAGTGAATTATTTGCACTACCCCCTAGTGAATCTCCTGCACCATCCAGATGAAGGAGATAGATATTACTTGAGTTACCTTTTTCACCTTTCCGGGTGACTTTTTTTACCAGCCCGGACTCACAAAGGGCCGCAATATGATTCATCACAGAACGTTTGCTAATCTCGCACTGGTCAGCAATATGCTGGTAGCTGGGCCAGCACTCACCCTGATCGCTGGCATTATCAGCCAGCTTGATCAGAACCAGTTTTCGCAATGGATTACCCACTCGAATTTTCATCGCTTTAACCATCAGCTCCATACTCATGCTGCACCTCCGAGATGCTTCATGTTTTTTCCGGAGCGAAAGGCTATAAGCGGCATACTGACGCGGTAATTACGGCCCAGCGGTTCACAAATCACCTTCTGGCATTCACGGTCAACCAGGCTAACACGTAGAACATGCCCTGCAGGTGTGGTGTACCACTGCCCAACTGTAGGAATTGATGTTTTTTTACGCTGAAGCAAACGGCAAATATTGAGGATCAACGGATTAAGCATGACGATGCCCTCCGCTGATATTCAGGAGACGGTGAATATGAAAATTAGCCTTATCCGCCAGACGAATACGTTCAGCCTGCAAGTTAAGAAGGGTTTCTACCAGAACTTGATGCGCCTGCGGATCCGAAAGAGTTACCTTGCGCAGAGCACGTAGTGCAGTTGTTACATAACTGAGTTTATGTAAGTCTTCATCATTCAGACGAGTGAGGGCTGGGACAGTAGCCATGATGGCAGCCTCCGATAACAGTGAATTACCTTCACCACCGGAAACGCCAATTTCGCTGGTGGTGAACTGAACGGGGTTGGCGTAACCGGCGTTATCGGAAACCGGCGCACCTTTCGGTGCCCCCGTCCAGCCCACCATAATTTGGGTGTGCACAGACGCAGACGATAAAAAAGACGCTGGCGCGTCATATATCGCCGATAACATTTCCAGGACGCCAATCCCGGCACCCGCTTTATAAGGTGCCTGAACAGTGTAACGTCCCGGAATGGCAGAATCAATGTGCTGGTGGTCCTTCACACTCAACAAAATCACGCCTGAATTTCCACAAAGGACTAAAGCACTCATGCGGGTAGTCTTTGCGAAGATAGATAACGCGCTGTGTTTCTGGCTCCCAACGAATAACATGGACATAAAGCCCTCTTCCGTCACGAAACCAGCGGTTAAGTTCCTGCACAACTCGCCCCCCACAGTCAGGTAAAGTTCTCTGTGGTTACTTACAGCCAGGTGATTTGGTAATCTGCATTCATGCCGTAACAACAGGTGTTCAGCGACACTGACCACCAGCTGTTGCGACAAACGGTTATTTGCCGTTAAACTGTTCATGCGTTAGTTTCTCCACAGACACAAAACGCCACGACGCCCGGAGCTGCACACTCGCGGGCGTCACTCTTTTCTGGAGCGCAAAAGATTTTGTAGACCAGTGCTGCATGCTCCTGGAGCTTCGAAATTGACAGATACAACTCATCATTAATTGCTGTCTGCTCGTGTGGCTCCACTACCCCATCTTCGATTGCCGAACGAATCTGCTTTGAGTAACTCCCGATCTGTTCGATGACTTCCAGCAGGCGCTGGTTTATATCGGCGTTCTCTACTTCCTCAATTTCAGGAAGCGATACAAACACCCCACCAGCAGACTGTGCGACAGCATCCGCAATGTAGTGAGTGCCAGCCGCGCGCTGTAAAATCATTGCCCATCCCAGCGGGAAAATCTGATCGCCATCTGCACGAAGGCGGTTGAATAAAGCGTTCTCTGTTACATCCAGCCACTCAGCAGCTTCAGCGTAACCCCCCGGCAACGCCGCGATAGTTTTTCTGACAGCTTTCACGTACCACTCAGGCTGTTTTTCTACTTTCCAGTGATGCTTACCCACGGTTAGCCTCATCGTTCTGTGGTTAAAAATTGAAGGTGTTCTGTTAATCTTTCGGATAGATATCCGGTCTTAAGTCAGATTTCGTAATTGCACCTGACGTGCATTGCTCAAGTTTTTTAGCCAGCACAAAACTGGCTTTTTTATAACCATTGAAAACCAGCCGTAAGTAGCCTGGTGTTGAGCCAACTTTTCCGGCCAACTCACCCTGCTGTTCTTTGGTTAAAGAGTCCCAATACGCTTTCATACAATATGTACCTCCGGTATACATATTACATGATTGAGATGAACCTTCAAGATACTTGTACCTTATCGGTACAAAGGTTTTAATTTCTTTATGAAAACAGTCCATGACATCCGGCGGTCTAACGCCAGAAAACTGAGAGATGGTGTTGGCGGGAATTCTTCCTTTGCCACCATGATTGATCGCGAGCCAACCCAGACCAGCAGGTTTATGGGAGATGGTGCAACTAAAAATATCGGTGACAGCATGGCACGGCACATCGAAAAATGTTTCGACCTGCCTGTCGGATGGCTTGATCAAGAACACCAGACAACGAACATCACAAAAAAACCTGACGTTTCAATCACTAACAAACAAATAACGTTAGTCCCTGTCATATCATGGGTACAGGCCGGAGCATGGAAAGAAGTTGGCTATTCTGAGGTTGATTTGAGCACAGCAGAAACTTATCCCTGCCCTGTACCCTGTGGCGAAATGACTTATATCTTGCGGGTGATTGGTGATTCAATGATTGATGAGTACCGCCCGGGAGACATGATTTTTGTTGATCCTGAAGTCCCTGCCTGCCACGGTGACGACGTTATTGCATTGATGCACGATACAGGCGAAACCACCTTCAAGCGGTTGATAGAAGATGGAACACAGCGTTACCTCAAAGCATTAAACCCAAACTGGCCTGAACCTTACATTAAGATCAACGGTAATTGCTCTATAATTGGTACAGTGATTTTCTCAGGAAAACCAAGAAGATACAAAATAAAGGCCTAATCAATATTTATAACCTGCTTCGGCAGGTTTTTTTATACTTGACAATGTACCCTTAAGATACATAATGTATCTATAGGATACATAACACAGGCAAGATTAAACTAAATTTGGTTGTAACACGGCGTATGGCACATGCGTCGTTAGCGGTCTGGGGACGTTAAAGGGGACAATCCACTCCTTGCTCGGGCAAACAAACCAGGTAGCCGGAATGTGCAAGTCAATGATGATGCTGATAAGACGCCTAACCAGCGTGGCGATCCGGTTTGACGCCTGGGAAGAGACCAGGGTGCAACGATGAGGGCATTTATGGAACCGCGACAAAGTGTGGTGCCGTAACTGGCTAAGTGCTCTCAGCGTTGTGGTAATCCGCGAAATGGCGCGGCGGTAAGTATGGCGGGGTTACTCTTTCCCCGTTGAGGACACCGGATTGTCAGGTTGACCATACGCCTGAGTGACAACCCCACCACAACAGCCACTGCTTTGGCGGTACCAGTTTGTACACTTGCTTCCGGCTGGTACCGCTCTTTTTACAAAACAGAGAAGAGCATCACCGGACGACGGGCTCATAACCCAATCCATCCGGGCGGCTGCCACCGCAGGTGTTCTTCTCTGTTTTGTGGAGAAACCAACCGACCTTGCAGGGTCGATATGATGAGGAGCAGCAAAATGGCTAGCGAACGCAGTACTGATGTGCAGGCATTTATCGGGGAGCTGGACGGCGGCGTATTTGAAACCAAAATCGGCGCAGTTCTCAGTGAAGTCGCTTCCGGTGTGATGAACACGAAAACCAAAGGTAAGGTCTCACTCAACCTGGAAATCGAACCATTTGATGAGAACCGTGTGAAAATCAAACACAAACTCTCATATGTTCGCCCGACTAACCGCGGGAAAATTTCCGAAGAAGACACCACCGAAACGCCGATGTATGTCAATCGCGGTGGTCGCCTGACTATTCTGCAGGAGGACCAGGGACAATTACTGACTCTTGCCGGTGAACCTGACGGAAAACTCCGCGCAGCAGGTCGTTAATATCGTTCTTAATAAACTGATTATTTATCTCATCACTGAATATTTTTATATAGTGAGGACTTATTATGTCTCAGAACTTAGACGCAACCGCAATTAATCAAATCCATGCCCTTATTTCTGCTCAGGGTGTTAATGAAATTATCAGTAAGATTGGTGCCGATGCTGTGGCATTGCCTGAGAATTTCCGCATTCATGATCTGGAAAAATTTAATTTAAATCGCTTCCGTTTCCGTGGTGCGCTTTCCACTGCCAGCATCGATGACTTTACCCGTTATTCTAAAGTTCTTGCAGATGAAGGCACCCGCTGCTTTATCGATGCCGATAATATGCGTGCCGTCAGTGTGCTTAACCTGGGTACTATTGATGAACCAGGTCACGCAGATAACACCGCCACTCTCAAACTGAAAAAGACAGCACCGTTCTCTGCTCTGTTGTCTGTTAACGGCGAGCGTAACTCCCAGAAGTCACTGGCAGAATGGATTGAAGACTGGGCCGACTACCTTGTGGGCTTTGATGCTAATGGTGACGCCATTCAGGCAACCAAAGCGGCTGCGGCGATCCGTAAAATCACAATTGAAGCGAACCAGACCGCTGATTTTGAAGATAATGACTTCAGCGGCAAACGCTCCCTGATGGAGTCTGTCGAAGCGAAGACCAAAGACATTATGCCAGTGGCATTTGAATTTAAATGCGTTCCGTTTGAAGGTCTGAAAGAACGTCCGTTTAAATTACGCCTCAGCATTATCACTGGCGATCGTCCTGTACTGGTTCTGCGCATTATTCAGCTGGAAGCGGTACAGGAAGAAATGGCTAACGAATTTCGTGATCTGCTTGTTGAGAAATTCAAAGACAGCAAAGTAGAAACCTTTATTGGTACTTTCACCGCCTGATTTCATTACTGCAAATGCCCCTGCGGGGGCGTTTACGGAAGCGATAATTTTAACTATTGCCGCCCCTATAAAGAACCATTAAACAATAACGTGACGAAGCTATTGATAGTAAATGAAGCACTTGCAAAATAATTGCATTGCGTATATATACACATGTGTTGTATTACAGCGATAATGGTAAAGCAAATGATTAACTCTGAAGCAATTGAGCAACTAATGTGGCTATGGTCCTTATTTGACATTAAATTCTTATCTATTCTTGCCGCTGCCTTCACTATATATTTTGGCGTGCAAAAAATATCAAAAAAGGTGACAGTGTCGTATTCAGCAAATGCAAGTAGAATATATGACATGCATATATCAACCATAATCCTGAATAATAAAAGAGATAATGCAATTGCTATATCTTCAATCAATATGGAGGTTGAAGGTAAAGGGATACTACAAGTTATTAAATTTGACTCCCCTCTTCTTTTAAAGAACTATGATTCTTTAAAAGTTGAACCACCAAAATTTAGCAGCCTTTATAATAATGATGGCGTAGTTAAGTTAGATATTTATGATAAGTTTCATTTTTATATAATCACGACATCTGGAGATGAAATTAAATGTATTTCTGAAAATAAATATGTGGCACCAAACATGGAAAACAAAATAGCTACAGACATAAGAAAATTTAATGGCATTGTCTTAACAAACAGAATGTCTTATATTTTTTTCTATGCAAATGACAACAGAGAGAAATACTGCATAATAGATGTTTCATTGTTCATAAATGGTGACAACCCATTTCATTTTAATTTTTTAAAAGAAGATGAATTAAGAGATTTTTCTAGCATCCTTATTAGTTACGGATATCACCAACAGTTTAAAAGTTATGCATTGTTTAAAATAGACAACCATCTTGCTCCTTCTTTGGTTTTAAATAAATCAATGATAGAAAATAATATTATTGAAATGAATAAGTAACTCACCGGGTGCAGCCGGTTATGATGGAGAAATGATATGAATACCTTGTTTTTACTGATGGCTGAATTCAATACCCCAAACATTGAACTCTCAGCAGTTAGTCAAAAATACTTTGGTATGAGTCCAGCCACAGCAGAAGCAAAAGCAAACGCTTGTAAGTTGCCTGTACCTACATATCGCATCGGTACATCACAAAAAGCAAAACGCTGCATCAACATTCAGGATCTTGCGGAATATATTGACAAAAGACGGGAAGAAGGGCGAGCTGAGTGGGAAAAAGTCAGAACGGAAAAAAAATATAACTAAACTAAAACTATGGATAACCCGTATATGTACGGGTTATTTTTCTTTATCACTATCTTTTCTTGATTTGAACAATCCACTAACAACGAAACCAACCAAACCAACAATACTAATTGTACTTGTACCTAGTAATGCAACGATCGCTTCAACTGGAGCCTTTCCTTCATGTGCAATAAGAAACGATGTAAACATTGCGACAACGAATAAGCACCAACACGACATAAACCAAACCGTGAATGATGCCATTTTTGTCCGGAGCTCATTGTCTATTTCTTTACCAGTTGCGTCAGCTATCTTATCCCGTACTTGTGATTTGAGCATATCAAGCTGAGCTTGAAGACTGTCCATTCTGTTCTGCTGCATAAACTCATGCAATGCACCAGTATTAGAACCAAACTCTTCTTCCTCCAGAATAGCCTTATTTTCTGAAGAAGAATCATCATCGCGTTCATGATTAGACGGCTCAAATGCGGATTCAAAAGCCTGCTCTTGACTGTCAGTAGAGTTTAAGGATGCTTCAGAGCGACCATTTTCAACACCTGCGGCCGCTCCGATCAGTTTATAGATATCTGAATTATGAGACATGTCATCCCTGAATATTACTTTTTCAGAGGCCCTGACATTGCTGTCGGTTATTCAATAAATCATGATAATAAGCCTTGATCGCATCATTTGAGATGATCGACGAGCCAATACCATTATAAGCTTGTGACCAAGGCGTACCTGGCATATGAGTTAGAGTTGATAACTCAATTCCATTTTTCGAGCCGTAAAACTTATAAACAGCCCCGATAATGCTCTCTGCTTGCGGATCCATAGTAACGATGCCACCAAAAGGAGCTACTGCTACATTCGTAACAGGTTTATTCCCATAGTCTTTGAAAGCATCGTACATTCCAGGAATAACTGGACCGTACTTCCACGCGGAGACACATTCATTGAGCAAAGGCTTACCTGTTAATGCTAAATAGTAACCATGGGCAATATAAGTAAGCTTCTGCAGTTGCATGTGGGTCAGAGGATTATGATGTTGGTTTCCCAACGTTATGAATTTATTGGCTATTTGTACCGGACTGTACAT
Protein sequences of DBSCAN-SWA_6 >NZ_CP040886|1858480:1878526|1869672_1870326_-|WP_000066917.1|DBSCAN-SWA MSNKYCQALVELRNKPAHELKEVGDQWRTPDNIFWGINTLFGPFVLDLFTDGDNAKCAAYYTAEDNALAHDWSERLAELKGAAFGNPPYSRASQHEGQYITGMRYIMKHASAMRDKGGRYVFLIKAATSEVWWPEDADHIAFIRGRIGFELPAWFIPKDEKQVPTGAFFAGAIAVFDKTWKGPAISYIGRDELEACGEAFLAQVRQQAEKLVREMAA >NZ_CP040886|1858480:1878526|1862479_1862695_-|WP_000839596.1|lysis|DBSCAN-SWA MKSMDKLTTGVAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGSLVFGLLTYLTNLYFKIKEDKRKAARGE >NZ_CP040886|1858480:1878526|1868146_1868944_-|WP_001061404.1|DBSCAN-SWA MNNLMVIDGIEVRRDAYGRYSLNDLHRAAGSLDKHKPAFWLRNEQTERLISELQICNSVNIEPVNVIRGGNNQGTYVCKELVYAYAMWISPSFHLKVIRTFDMVTSAPEKLSGQAADKMQAGVILLDFMRRELNLSNSSVLGACQKLQEAVGLPNLAPRYAIDAPADAPDGSSRPTLSLSALLKQYGIRLTANQAYHQMVKLGIVEQRERYSRTAINNIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESRFPELLKLLDTVH >NZ_CP040886|1858480:1878526|1859638_1859887_+|WP_001217553.1|DBSCAN-SWA MRIEICIAKEKMTKMPTGAVDALKEELTRRISKRYDDVEVIVKATSNDGLSVTRTADKDSAKTFVQETLKDTWESADEWFVH >NZ_CP040886|1858480:1878526|1867149_1868139_-|WP_001360050.1|DBSCAN-SWA MRALLTPEIAPRMGIVLFRPGSELMPLFMQGRVLLEPEPERYSSFASGAVPAASQPLADDPAVRAVFRNEAVIRRAGGVECLESWLLREKGCQWPHSDWHSENMTTMRHAPGAIRLCWHCDNQLRDQFTERLESMATDNCARWVLSVVRRDLGFDDSHVVTMPELCWWLIRNDLADALPESAARKALRLPKPVVPSVTRESDLVPSVPATSIIQDKAKKVLALKVDPESPESFMLRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKAHDLFVLPLCRKHHDELHADTVAFEEKYGSQLELIFRFIDRALAIGVLA >NZ_CP040886|1858480:1878526|1869349_1869676_-|WP_032235543.1|DBSCAN-SWA MTTLTQCQQQVLDILISYQQERGFLPTNQEVATMLGYRSVNAAVEHLRALEKKGVITIKRGVARGITLHTAVKDDDSEAVGIIRALLAGEENARLRAAHWLHERGLKV >NZ_CP040886|1858480:1878526|1874920_1875283_+|WP_000135682.1|DBSCAN-SWA MASERSTDVQAFIGELDGGVFETKIGAVLSEVASGVMNTKTKGKVSLNLEIEPFDENRVKIKHKLSYVRPTNRGKISEEDTTETPMYVNRGGRLTILQEDQGQLLTLAGEPDGKLRAAGR >NZ_CP040886|1858480:1878526|1874705_1874906_+|WP_071587686.1|DBSCAN-SWA MTTPPQQPLLWRYQFVHLLPAGTALFTKQRRASPDDGLITQSIRAAATAGVLLCFVEKPTDLAGSI >NZ_CP040886|1858480:1878526|1862762_1863815_-|WP_000799656.1|DBSCAN-SWA MKNTVKINSVDLINADCLHFIQSLPDDSIDLIVTDPPYFKVKPNGWDNQWKGDEDYLKWLDHCLAQFWRVLKPAGSLYLFCGHRLASDIEIMMRERFNVLNHIIWAKPSGRWNGCNKESLRAYFPATERVLFAEHYQGPYRGKSDGYAAKERELKQHIMAPLISYFRDARAELGITAKQIAEATGKKNMVSHWFGASQWQLPNEADYRKLQALFSRIAAEKFQEQQLEQPHHQLVASYDSLNRKYSELLDEFKSLRRYFSVSVSVPYTDVWMHKPVQFYPGKHPCEKPADMLRQIINASSRPGDLVADFFMGSGSTIKAAMALGRRALGVELESERFNQTVKEINELVGK >NZ_CP040886|1858480:1878526|1876359_1877142_+|WP_000610754.1|DBSCAN-SWA MINSEAIEQLMWLWSLFDIKFLSILAAAFTIYFGVQKISKKVTVSYSANASRIYDMHISTIILNNKRDNAIAISSINMEVEGKGILQVIKFDSPLLLKNYDSLKVEPPKFSSLYNNDGVVKLDIYDKFHFYIITTSGDEIKCISENKYVAPNMENKIATDIRKFNGIVLTNRMSYIFFYANDNREKYCIIDVSLFINGDNPFHFNFLKEDELRDFSSILISYGYHQQFKSYALFKIDNHLAPSLVLNKSMIENNIIEMNK >NZ_CP040886|1858480:1878526|1878052_1878526_-|WP_000287252.1|DBSCAN-SWA MYSPVQIANKFITLGNQHHNPLTHMQLQKLTYIAHGYYLALTGKPLLNECVSAWKYGPVIPGMYDAFKDYGNKPVTNVAVAPFGGIVTMDPQAESIIGAVYKFYGSKNGIELSTLTHMPGTPWSQAYNGIGSSIISNDAIKAYYHDLLNNRQQCQGL >NZ_CP040886|1858480:1878526|1875348_1876173_+|WP_001753751.1|DBSCAN-SWA MSQNLDATAINQIHALISAQGVNEIISKIGADAVALPENFRIHDLEKFNLNRFRFRGALSTASIDDFTRYSKVLADEGTRCFIDADNMRAVSVLNLGTIDEPGHADNTATLKLKKTAPFSALLSVNGERNSQKSLAEWIEDWADYLVGFDANGDAIQATKAAAAIRKITIEANQTADFEDNDFSGKRSLMESVEAKTKDIMPVAFEFKCVPFEGLKERPFKLRLSIITGDRPVLVLRIIQLEAVQEEMANEFRDLLVEKFKDSKVETFIGTFTA >NZ_CP040886|1858480:1878526|1868963_1869353_-|WP_000767133.1|DBSCAN-SWA MKLILPFPPSVNTYWRHPNKGAFAGKSLISEAGRKFQSAACAAIVEQLRRLPKPTSAPASVEIVLFPPDNRIRDLDNYNKALFDALTHAGVWEDDRQVKRMLVEWGPVIPKGKVEITISKYEKPAGAAA >NZ_CP040886|1858480:1878526|1872693_1873245_-|WP_000515860.1|DBSCAN-SWA MGKHHWKVEKQPEWYVKAVRKTIAALPGGYAEAAEWLDVTENALFNRLRADGDQIFPLGWAMILQRAAGTHYIADAVAQSAGGVFVSLPEIEEVENADINQRLLEVIEQIGSYSKQIRSAIEDGVVEPHEQTAINDELYLSISKLQEHAALVYKIFCAPEKSDARECAAPGVVAFCVCGETNA >NZ_CP040886|1858480:1878526|1864405_1865572_+|WP_046657263.1|DBSCAN-SWA MELTKKMSKASIRHVIVHELLKESNKDFDHSKPYNLRDTELDKTNDIVKKLVDGVIDLYGSKGNSAHYGVFIKNKTKQGPIPELFHKYSLVQQSVSSDFIELSKEVMKQMYKSAQEQIWASGGYVVFTDYILSGFRYLLVTMIKKTNGVTISENLEPEEMIHLELGNINQAAKINFRYYEEYQKADDLKKTDLSYLSFISKTTGQSAAAYFIAALGCDKGIASAGATRKLPDEIRRFFKKEPLLKNQAESFRNDVIKYLEKQFDNEHSARLSDIESLASGHMSYLKEEEKTELVDKLMKHLNSEEVRIPSEFVINKNSLDKISNVIYKTPSLSFHFDKDLLGVTTDAKIYYDDENQSLTFNNLPVEALTKIRRALKETDNPSNEEDKE >NZ_CP040886|1858480:1878526|1865568_1866795_+|WP_046657265.1|DBSCAN-SWA MNDFSIIVNLYRLSSYPHFDGAKFSARIAYNADVKSLFKRILNPTFQAGTADEIEVDGHLIYDYEDFPEKGNFLTYSFKISQGSANRFYKNKNEFVKINTLKKGIMPEYFYIIEDDFYSLETPKPSYIQKIEDICELINALSMLAHFHDIKKDSKGTFYRLVFILNSESKSSSAVIETNITEEIFNDKTVNTQLVKTLVSSEATTDAHHIEKINTFRNTVIEYVNKNGNSFVELINKWDFICELYTNNLAAYMSAFSFHKARKEVVDAELDYSEKLSKIISEISNKALAIPISLAGSIAIFKLTTKADWIIALIGLIITAIITSAMIVSQKKQLARISHSKEILFGQLRYRIKDDTSDLKESLEEAIKKLNDNEDFCHKVLDSLLSLAWMPTFIGIIGILFKLMPNIT >NZ_CP040886|1858480:1878526|1873579_1874254_+|WP_000859462.1|DBSCAN-SWA MKTVHDIRRSNARKLRDGVGGNSSFATMIDREPTQTSRFMGDGATKNIGDSMARHIEKCFDLPVGWLDQEHQTTNITKKPDVSITNKQITLVPVISWVQAGAWKEVGYSEVDLSTAETYPCPVPCGEMTYILRVIGDSMIDEYRPGDMIFVDPEVPACHGDDVIALMHDTGETTFKRLIEDGTQRYLKALNPNWPEPYIKINGNCSIIGTVIFSGKPRRYKIKA >NZ_CP040886|1858480:1878526|1858480_1859578_+|WP_000332259.1|integrase|DBSCAN-SWA MAYYNIEKRLKSDGTPRYRCNVIIKEKGVITYRESKTFPKHAHAKTWGAQKVMELDLYGIPSSNAVDGLTVRDLLHKYLNDPNAGGKAGRTKRYVLELLMDSDISAIKLSELTENDVIEHCRLRNNAGAGPATVSHDVSYLGSVLDAAKPVYGINYTSNPAKSARPYLLKLGLIGKSNRRNRRPASDELNMLIEGLQQRSTHKCSKIPFVDILKFSVWSCMRIGEVCRLRWEDLDQEQKSILVRDRKDPRKKEGNHMKVALLGEAWDIVQRQPKKSEFIFPYNSTSVTAGFQRVRSKLGIKDLRYHDLRREGASRLFEAGFSIEEVAQVTGHRSLNVLWQVYTELYPKSLHNRFEELQRSRNKTS >NZ_CP040886|1858480:1878526|1877178_1877448_+|WP_001093912.1|DBSCAN-SWA MNTLFLLMAEFNTPNIELSAVSQKYFGMSPATAEAKANACKLPVPTYRIGTSQKAKRCINIQDLAEYIDKRREEGRAEWEKVRTEKKYN >NZ_CP040886|1858480:1878526|1873288_1873489_-|WP_000649477.1|DBSCAN-SWA MKAYWDSLTKEQQGELAGKVGSTPGYLRLVFNGYKKASFVLAKKLEQCTSGAITKSDLRPDIYPKD >NZ_CP040886|1858480:1878526|1866787_1867132_-|WP_016159280.1|DBSCAN-SWA MRDMYEVLDRWGAWAAADNSGVDWQPIAAGFKGLLPHGKKIRLQCDDDEGIMIDGCVARLCKYKPDEYELIIAHFVIGISLRSIAKRRRCSDGTIRKQLQTALGFINGVMYMLK >NZ_CP040886|1858480:1878526|1871631_1871856_-|WP_001446924.1|DBSCAN-SWA MILNICRLLQRKKTSIPTVGQWYTTPAGHVLRVSLVDRECQKVICEPLGRNYRVSMPLIAFRSGKNMKHLGGAA >NZ_CP040886|1858480:1878526|1870325_1870820_-|WP_072165319.1|DBSCAN-SWA MIMSLLNEVQKYIEAHPGCTSGDIADAFAGYSRQRVLQSASKLRQSGRVAHRCEGDTRRHFPRLTERAQEAEPQPVRETRPVRNFYVGTNDPRVILCLTRQAEELESRGLYRRAATVWMAAFRESHSQPERNNFLARRERCLRKSSKRAVPGDEWYLSGNYVGA >NZ_CP040886|1858480:1878526|1860638_1862009_+|WP_001678535.1|DBSCAN-SWA MTASRIFKKSFSKKNLLKVYSEKIKESGAIGIDRIRPSKLDLTIKNEITFIFEKVNSGNYKFTAYKEKLISKGANSTPRQISIPTARDRITLRALCECLTEIYPKSRLKLPHTVIDSLKEALNNSLYAEYAKIDLKSFYPSIEHKLIINAIKNKIRKKEIRQLITSSLIVPTVSGTTGSKGIPNNTRGVPQGLAISNILAEISLSNFDDEINKMHDIWYMRYVDDILILTPKYQATKIASHIIDKLQSLNLNPHPLNEENSKSKVGSLDESFNFLGYHIENRELLIKHESILRFESSLAKIFTAYRHALLQAKSKRDKERAVAYCQWKLNLRITGCVFEGKRLGWVSYFSQITSTAQLRSVNHTINNLIRRFGLSSEIKPKSLIKTFYELRRGRAETFKYIPNFDNLHISQKRELVSMWIGKEKEKKLSNSEIERKFKFKIAKSVKELEEDISGIS >NZ_CP040886|1858480:1878526|1871860_1872697_-|WP_032181493.1|DBSCAN-SWA MNSLTANNRLSQQLVVSVAEHLLLRHECRLPNHLAVSNHRELYLTVGGELCRNLTAGFVTEEGFMSMLFVGSQKHSALSIFAKTTRMSALVLCGNSGVILLSVKDHQHIDSAIPGRYTVQAPYKAGAGIGVLEMLSAIYDAPASFLSSASVHTQIMVGWTGAPKGAPVSDNAGYANPVQFTTSEIGVSGGEGNSLLSEAAIMATVPALTRLNDEDLHKLSYVTTALRALRKVTLSDPQAHQVLVETLLNLQAERIRLADKANFHIHRLLNISGGHRHA >NZ_CP040886|1858480:1878526|1870816_1871635_-|WP_021527492.1|DBSCAN-SWA MSMELMVKAMKIRVGNPLRKLVLIKLADNASDQGECWPSYQHIADQCEISKRSVMNHIAALCESGLVKKVTRKGEKGNSSNIYLLHLDGAGDSLGGSANNSLSGAANSPGSAGVAPGGSAGDSPRTSHSFEPVKEPVNEPIAVGASVDESVRVRSNRPEYSPEFEQAWLVYPKRAGGNSKSAAFKAWKARLNEGVNPETMLEGVKRYAGWVSAMGNSGTQFVKQAVTFFGPDRHFEESWEVPAVSAARREDPYFKASYDNVDYSQIPAGFRG >NZ_CP040886|1858480:1878526|1863964_1864159_-|WP_001355891.1|DBSCAN-SWA MLKQQDMTETAKVVFNELSIEPATVGEIAQNTYLSRERCQLILTQLVMAGLADYQFGCYRRLQQ >NZ_CP040886|1858480:1878526|1877481_1878030_-|WP_000019186.1|DBSCAN-SWA MSHNSDIYKLIGAAAGVENGRSEASLNSTDSQEQAFESAFEPSNHERDDDSSSENKAILEEEEFGSNTGALHEFMQQNRMDSLQAQLDMLKSQVRDKIADATGKEIDNELRTKMASFTVWFMSCWCLFVVAMFTSFLIAHEGKAPVEAIVALLGTSTISIVGLVGFVVSGLFKSRKDSDKEK >NZ_CP040886|1858480:1878526|1860109_1860661_+|WP_000543834.1|DBSCAN-SWA MHDNIWFTYKARIQAHHRLEWLEKHSQFILVWYAILSAVLSIVTLRFPKVLGDNTDVVAAILSVALLGISLIVSNLDFRGRAIAMRRNYIALQRLYFDITTSQQLSLEQKEKYFNLLNEVENHRDIDDKISRVTQVGLKTRIPTQKEKIIVILWILLRIFITAALYILPLIYLWIDYDCKQNF |
29 | Shigella_phage(37.5%) | lysis,integrase | attL 1849745:1849758|attR 1865092:1865105 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2328224 : 2364590
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP040886|2328224:2364590|DBSCAN-SWA TATGCCAACAGTTCCAATTTCTATGAGAAAACTTAAAGAAATTCTTAGGCTTAAATACGGTGTTGGACTCAGCCATCGACAAATTGGTCGTAGTCTTGCAATCTCCCCTTCCGTTGTATCCAGATATGCTAATCGGGCGGCTCAACTTGGCATAAAGCAGTGGCCCTTACCTACAGGATGGGATGATACAAAACTAAAACATGCGTTCCTTCAGACCCAGGTTAAGATGAAGAAGCACTCTCTGCCTGACTGGGCTACAGTACACCGGGAACTGCGTAATAAATGCGTGACGCTGCAGCTACTCTGGGAAGAATACTGTGAGCGTAATCCAGGCGGTTTTTACAGCTATAACCATTACTGCCGGATGTACCGTGAATGGCTCAAAACCACTTCACCATCAATGCGTCAGGTACATAAAGCTGGCGAAAAACTTTTCGTTGATTACTGTGGACCTACCGTTGGCGTTACCGACCCTGAGACCGGAGAAATAAGAACTGCTCAGGTCATCGTAGCTGTTCTCGGGGCATCAAGTTACACATGGGCAGAGGCCACCTGGTCTCAGCAGCTTGAAGACTGGGTGATGAGTCATGTTCGCTGCTTCCAGTGGTTGGGTGGCGTTCCTGAACTTGTTGTTCCGGACAATCTGAAAAGCGCCACATCCAGGGCATGTAAGTATGATCCTGACGTTAACCCTACCTACCAGCAGATGCTTGAGCATTATAATGTCGCAGTTTTGCCTGCGCGGCCACGTAAACCGAAAGATAAAGCCAAAGCTGAAGTTGGCGTTCAGGTTGTTGAACGCTGGATCATGGCCCGAATCAGGCATGAGATCTTCTACAGCCTTGCATCGCTTAATCAGCGCATTCGGGAGTTGCTGGAAAGACTGAATAACAAAATAATGCAGAAGTTGGGTTATTCACGTGCAGAACTCTTCATCCAGCTTGATAAACCCGCACTGAAGCCTCTTCCTGAAGCCAGTTACAGTTACACCCTGGTGAAGAAAGTCAGAGTTCATGCCGATTACCACGTGGAAATCGACAAACATTACTACTCGGTTCCATGTTCGCTGTTAGGCCAGCAACTGGAAGCATGGATCTCCGGAGAACTGGTAAGACTCTTCAATCAGGGGCAGGAGGTTGCTGTGCACCCGCGCAAGCGTACTTATGGCTACAGTACCCGCAACGAGCACATGCCTGAAGCTCATCGACAGCATGCCACCTGGACGCCAGAGCGTCTTCTGGAATGGGCGGGGCACATAGGCAGTGAAACTCATAGTTATGTGCTTCATATACTGAACTCTCGTCCACATCCGGAACAAAGCTATCGCTTCTGCCTTGGACTCCTGAACCTTCATAAAAAATACAGTAAAGCCAGACTTAATGCAGCATGTGCAAGAGCTCTGAAAACAAAGGTATGGCGTCTGTCAGGTATTAAATCGATCCTGGAAAAAGGTCTGGATAAACAACCTGTTCAGGATCCAAAACCAGATCTGTTATCCACGATGGAACACGAAAACGTACGCGGCAGTGAGTATTACCACTGATACGGGATCCAATGATGAATCATCTTTACGAACAACTGACCGCACTTAAACTCACCGGCTTCCGTGATGCGCTTAAAAAGCAACTTGCTCAGCCGGGCACATACCAGGAGCTGGGCTTCGAAGAACGCCTGTCATTACTGACAGCAGAAGAACTAACCTGCCGTGAAAACAGGAAGGCAGAGCGTCTGATCAAACATGCACGGTTCAGACTTAATGCTGAGTTATCAAAGCTGGATTATCGTAACAATAGAGGGCTGGACAGGGCCCTCATCCGTTCACTCAGTCAGGGAAACTGGTTAACCCTGAAACAAAATATTTTACTGACCGGGGCCACCGGCAGCGGTAAAACGTTCCTGGCATGTGCACTTGGTCATAATGCCTGCCGACAGGGATACAAGGTCTACTATTATCGCCTTAAAGCGCTGATGGAACAGTGCTATCAGGGGCATGCTGATGGAAGATACAGCAAACTTTTGACCAGTCTGAATAATAGCGATCTGCTGCTTCTGGATGACTGGGGGCTGGAACCTCTCTCATCAGAACAGCGTAGCGACCTGCTGGAAATAGTGGATCTGATGTACCAACGAGGCTCAATCATCGTAGTGAGCCAGTTGCCGGTGGAAAACTGGTACAAAATGATCGGAGACTCCACACATGCGGATGCCATCCTAGATCGACTGGTTCATGGCAGTATCAAGATCGAACTTAAAGGAGAATCAATGCGGAAAATACAATCTCCGTTGACCGAAGGAGATCAGTGAAGGTAATTTAAAAACGGTTCTGTGAAAGTGACACGAACCGATCTCCATCGATGTTACTCACCGATCTCCTTCACGGTAATACGCAACCCACAATGCGCGATTGAGCTTAATGTCGGTGTCGATGCTGTGAATGGCACGGGTGTGGATACGTTTTCCTCTGGCACTGCGACCGGAAATTCCGCCTTTCAGCATATTCTCCTGGATGGTCTGATAAGTACTCCACAGGTCCTTACCGTAATCCTCCCGGCGTCGTGGCGTCAGAATGTCGGCGGTGGTGACGGGCCGATGTTCGTCACCATAACGGTAAGTCAGTGCCGCCTGTGCCAGCGCCTGGCGTGCCGGTGGCGGCAGGACCAGCGACTGCATGGCATCACGCTTTTCCTCAATCCGGTCAAAAACCCCCACCACTTCGTAAGCCCCTTCGATAACTTTCTCCACCACATTTCCCCGGTGTGGAACACGCACTTCCCCCAGAGACTGACCGCAGACGCAGCCATTCTGGCAGACGAATCTGAAGTAACCCGGCAGCATCTGGTAGCTGGAGGTACCGTCATGAGAGTTGAGCAGAATAATTTCAGGGACATGTTCTCCGTTTATCTCTCCGGCCCGCCGCAGACGCAGCATGTGTTTTGTGTATCCCCGGCGGCCCGGGTCGCGCACGCGGGTCTGGCAGGCGAAGAATGGCTGAAAACCTTCCTGCTGCAGGCTTTCCAGTACGGTGATGGTGGGAATGTACGCATATCGTTTACTGCGGGAAGTATGCCGGTCTTCACCGAAAATACTGGGAACATAGTGCATCAGTTCTTCGTGTGTCAGCGGACGGTCACGGCGTATCTGGTTCGCATAACCAAAACGACTGGCTAGTCGCATAATTTGCTCCTTATCGGTGGTTACGATTTACTGGTGTAATAAATGAAAAAGCCACGTCTCCCGGAGAAGACGCGGCCTGACAGATGAAATGAATGACGTTTATTGTCTGAGAAGTCCTTAACTGGCGAGCTGAGTATTAAGCTGTGTTCCGGCATCACCAGCGCAACTGACCTTCAGCATTACGGATAACCAGCCGGGAATATGTTCCCTGGTCATCTTCAGTAAACACATTGCGGTAAGCTGTTTTGACGGCAACAGCCTGTTCGCGTGAGAAAGGGCCTTCGGGCAATAAACGTGATGTACAACCCGGCATATCTGTTGCTCCCTGAAAGTAAAAAGCCCCGGTCATGATGACCGGGGCCTGAAGGAGAGTGACCTGATTATCAGAAAGTCACATTCAGCGTGGCCTGACCGTTATAACCTTCAGCGCTGCTGCCGCTGACGCTGTGGGCATAACCGCCCTGAACGCCCAGGGTGATATTTTCCCGGACACGGGCTTCCAGTCCGGCCTGCAGGTCCAGTGACGTGCCATTCCGGGACGGTGAGAACGTCATGTTACTGCCGGCGGCGGCTGTACCCATGCTCATGTCACCCCGGGAGCTGAAGGTGCGGATAACAGAAGGCTGTACCCACCAGTTCACCGGCAGTTCACGCACACGGTGTTTTGCACTGTCGCGCAGGGTGTCACGGGATGAGGTGCCTTCACCAAAGCTCATATCGTTGTGGCTGCCCAGACGGAAACCGGCACGCACATGTTGTGCACTGCCATGCCCAAACTTCACATAACCGGCGTTATCCTGGCCGTCATCCAGGGAAAGTCCCTGCCAGGTATACTGCAGTTGTGGCTCCAGCATCAGGTTGTCAGTGATACTGAAGGGCAGACCGGTTTCCAGTGAGCCCAGCCAGCCCCAGCCCCGGGCGCGGAAGTCGTTATTGTCCGATGACGCTTTCATGCTGTGGCGGGTTCCCTGTGCCACAATGTCAGCCCACAGGCCGGAGGACGTGTGTGTCAGATTCAGGTATCCGCCCAGGCTGCCGGCATCATCCCGGACCGTGCCGGCGCGGGAACCGTCATCATCCTTAACATCAACGGAAGAATGGCCTGCGGCACCATACACCCCTGTCGTCAGAGACATACCGGCAACCTCTGTTCTCAGCAGGTCACCCTCCAGACGGACGAAGCCATAGCTGCCGCTGCTTTCCGGCGTGGCTCCACGGGCAATACCGCCGTTGTTATCGTGACCGAGATGACCGCCCTGAATGCTGAGACGGACGCTGTTATTTTCACCGTTTACACCGGTCTGATGGCTGCGGGAGCCTGCCAGAATCCGGTCATAGTCCATTGCCTGTGTCAGCATGGATGTATACAGGGGGACTTCAGCACGATAAGCATTTTCACTGCGCAGGTACCAGTCTTCATCGCTGTCACGGTTCAGGGTGTAGTTAAAGGCGCCGGCCTGAAGCGGGCGACTCAGGGCAAACGCACCTTCTTCTGTGGTAGCGCCATTCTGTGCATCCACAACCCGGATACCCTGTCCGGTGGTTGCCACCCCGAGGTTACTGTTTCCGACATTTGTAAACGCAAGCCAGGTTTTGCCGGTTGCCTGACCACCATTAATCACCAGCTGGTCAGAGGCATTGCTGCCATCAAGGCGAACACGCATATTGATAGTGCCACCCTGACCGGTGAGGTTACTGGTTGTCAGTTTATGGAACGTTACCGGGGCGTTGCCTTTTGCAACAGCACGGCTCAGTACGGCATCCGGTGTTGTCTGATTGTCAAAGTAAATCTCCCCTGCGTTAACAATATCGCCGGTACTGACCACATCACCATTCAGTACCATGCTGGCGCCTTTACTGAGCTGTGTTCTGCCACTCAGACTTCCCCCGGCAGCCAGCATCAGTGTTCCACGATGTCCGACAGTGGTGTTGCTGGCCTGTCCCCCGGCATTAACCGTAAAGCTGCCGCCATTTTCCAGCAGCAGGCCGCTGGCCTGACCGGATATGCCATCAATACTGAACTTACCGCTGGCATTGGTCCCCTCAACAGTGGCACCACTGTCTGCAATCAGGGCACCACCGGATGTCATGGTGACACCTGTTGCCTTACCACCGGCAGACACTGCCAGGGTTCCGCCGTCATCCACCCGTGTTTTCTGCGCTGAATGGCCTTCCAGTACATCCAGGCGACCGCCGGACTCCAGAACAACGCCGTCAGCCTTACCGTTTTCCACCGTGAAATTGCCCAGACGGTTGCTGCCAAGAACCGTTGCCGCAGTACTGGTGACCAGTGCTCCACCCTGTTTCAGGGTGACATTCGTGGCTGTACCACCGGCGTTCACCTGCAGTTTGCCTTTCTGGTTAACGGTGGTGAAATCCGCCAGACCACCTTCCTTGACAATCTGCCAGCCGTCACTGTTTACAACCGTGTCAGAGGCTGTGCCTCCGTTGTGCACATACTGGTAACCGCCATTCAGTGTGGTATCCAGCGCATGCCCGTGTACCGTCTGGTCGCCGCCGGCATAAACCACAGTGGTATTTGCTGTTCCTTCAACAGCCACAATCTGACGACCATTTTTATTGATGGTGGTACGTGCGGCATTTCCGCGAACAAACTGCCCGGTATCACCATTTTCTGCATCCGGCCCCCCTTCCGCGCCGGTATTCACAACCGTGTCGGTTGCCACTGCGCCGGATTTGACGGCCTGCCAGCCCTTCTCATTAATGACGGTACCCGTTGCAATCCCGCCTTCATGTACCCACTGCTCACCGCCGTTCAGAGTGGTATTCACTGCCTGCCCCTGAAGGCTCTGTCCGCCTCCGGCACTGATAACCGTGTCTGAAACGCTTCCTCCGGCATTCACTCTCTGAAGACCACCACCGGTGACAGTAGTGTTGTTGGCGATACCGCCATTTTGTATCCATTGTCCGCCGGTATTGGCCTCGTTATCCGGCCCATACTCCAGACCGGTACTGATGGTCATTCCGTTGGCCGTACCGAGAACAATCTGGTTGTCATGATTTGTCAGTGTTCCGCCGCTCACGGTTTCTACCGCCTGTACAACCGTGTCAGCAGCCAGTGCCGGGACTGATGTGACAGCAGCAAGAGACAGCGCAATCGCCACACCGGCGCGTTTTCCCCGTGAGCGGGCCAGTTCGGAGGCCACCACCAGGGTACCCGTAATGTGATTCCATACCAGCCTGTAGCTGGTGTTCAGATGTCGTTTCATCAGCTTTTCCTTACAGAGGGTGAATAAAAAAGCCGGTACCCACAAATGTGGATACCGGCAGGTACAAAACAGCATGGTCAGATAAAGTGGTTTTTCCCGGGCATAAATGCTTCCCCTCTCTCATCCGCCAGAGAAGTATCAGTCCGTTGTGTGCAATTCCGTTCTGCATGATGAAGAGATTCAGCTGCCTTTTCGCAGACCATTAACCACTATAAACGATCGATAGCAACGATCGATACCTGTTTTATCGATCGTTTTATTCTATTAACGACAAGGCGGTCACGCAAGAGAAAGAGGTGGATATGAGGGGGGTGTATCATGTTTTCAGTTGGTTAACGCAATGTAATCAGGCATTTCCCCTGGCAGACTGTTCAGAAGAAAATCCAGTTCCATACAGCGCGGGCAACAGAGACCACCGAGTCACGGACGGCACGCAGAACCGTGCGTGCAACAGAGGCAACACAGACGCTCTCCGCCGTGTCAAATATCCGGTCCACCGCACCGGTAAACTGTTCACGGGCCTGAGCGCGGACAGATTCCGTGCGCAGCTCGTCCTGCAGGCGGGTCATCAGGGGACTGGCGGCATGGTCGGGAAGCGCTGTCATGAGCGCACTGACCAGCGTATCCAGTTCCCAGCCGGTGCGGGCCGATACGGCAACAACCGGATGTACGGGCCGGAACAGACGGAATACCGCCTCCGTTTTCTCCCGAATGTTCTGTGCCTGTGCGGGAGAAGGCTGAATGCCGGCCATATCCCATTCATGGCAGGGCTCCGTTTTGTCGGCCTGCGTCACCACAAACAGCACCTGCTGATGTCCCCGGTGCAGGATGTGTCGCCAGAAATACTCATCCACAGACAGGGCACGGTCATCGGCTTTAATCAGCCACAGTACCAGGTCAAGTTCAGGCAGAATGTCACGGTACAGGGCTTCATACTCTGCATCCCTGTCCCGGCTCTCGCCCACCCCGGGCAGGTCAGTGATAACCATGCTGTGACCATGGCCACTCAGACGGAAGCGCCGTACTTCCCGGGTGCCGGCGTGAACATCACTGACCGGGGTGACCTCCCCCTGAAACAGTGCATTACAGAGTGAGGATTTACCGGCCCCGCTTTTACCCATAATGCCAATCACGGGTTCGTGACTGGTGAGTTTGCGCAGATGTTCCAGGATGTGACGGGAAAGTGAGTAAGGCAGAGAGGAGAGGGGTTTTTCAATTGCCTCAATGGCATCAGACGGATTCATACATGTATTCCCGGTGAAAAAAGACAAAACCCCGCACACCGGAAAGGTGGCGGGGGACTCACGGAGATAATATCTGTCTGAAATTATTTTGAATGGTTTTATTTTTTAATTTTGCTACGGTAAATCATGTCCAGAGCACTTTCAATAACCCGGTCATTTGAGATACCCTCGTTTATGGATAATTTTCTCAGTTTGTCATATGCCTCCTGTGAAATATTTACGTTCAGGGGGCGCAGCTTTTTATCAACCCTATTTTTTATTCTGTTTTTTTGGGTAGCCCAGGCGCTTTTAAACTTCCGGGTAAACAGTTCTACTGAGTCTGGTGAATTCTGGTTGTTTTTTTTCCATGTGAAGTAACTGGCATAACACCAGGCCATAATCTCTTCAGATGATTCTCCACAGACATAGTTCAGGGAAATGTTCGTCTTTTCCTGAATATATTTTTTGAGCCATGAGCAGATATCAAAATTTCTGGCCTTAACCTCTCTGAAAATATCATCTTTTTCGAGTAGTGTGGTCCATTCAATACTTTTCTGGTTCAGGTAATTTGATGGTGTGTATAAAATGAAAGGCCAGTTGTCAATACACCTGCGTATGGCATCCACCCTCACGCTGTGGTGAGAAGGCAGTGACTCCGGAGATAAACAGGTAAGGGTATTTTCATAACCGTTTATCTTAATCATTTCATATAATTCACAAGTCAACCATAAACTGGCGCGGGCATCATTACGATACCAGTTAAGATCATCCTCAGATGGTAAGGAATCGAAGTAACGCTGGTCTGTTTCAGCCAGAAAATTATTTATTTTCAGTGTGTCAGCACCCTGTGAAATAAGAGTGTTTAAAATCCGTTTCTGATACGAGAAATAATCTTTATCGGCCAGGCTTGATGGTAAATAAAACCCATTTTCATTTAAGCGCCTTTCAAGATATCGTATTCTGCTTCTTTCATCTTCCACAACCTTAACCATAGTATCTCTCCTTCAGTCAGGATAATCACACGTGATTTTAAGTAAAATCAAAGCATCTTTAGCATGATTTTCTCATAAATTTCAGTTCATTTCACGTGTATAAATTAAGGGGCGGGAGATACCTCATGTTATCTTTTTGTATCTGCTGACTGGGAGGGGGTTTTATAAAAGTACATTCTTTGCCAGGACGGTTGAAGTGCATTCACAGGAGTGAACCGCTCTTTTTATTTTCCGGGAAGAGCGCTTTCTCCTCTCATCCTGTTATTATTAATTTACTTTCGTTTTCGCCTTCGGGCAACTCTCCCCTTTGCCTTAAACAACACAGTTAATCCCGCGTCAGGTTATTAAGGTAAATGTTGATATGAATTTCACCAGAGACGATGAGTTTCATTGTGAGAATCATTGTATCCTGTATTTTGATGCGTCTCTCCATGACTTTAACAGAGGGCTTCATATTTTTATTTACGATAAGTCTGGTTTTACGTTACTGGAGATAAAATGACTAACGGTCATCACTTTCCTCATTAATCATATCTTACAGATATGGTATTGTTCACCAGTCTGTCACGGGTACAGGGAAATCATATGGGCTTCCTTGTCAGAGAAAATGACTGGTCGGAAATTTATTCGGGAATTCATTGTAGTGCCTGCAGACACAGATATATCGTTAACGTAAACTATAATTGACGGACGATACGGGAGTTACAGTGACTCATGAACCATGAATTTCATAAGCCATGATGGCAAGTAAATATTCCTGATTACCGTAAAGTATTTCTTAGATATTACCATACCCTCTGGCGTTACTGATTTACTGATAGATATGGCATCCAGAATTATCATAACATTTCATCCGTTGACATTATTTACCCGGTCTAATAGCTTTTTTAATAAAGCTGACTTTCAGGTGATGCTGCCAACTTACTGATTTAGTGTATGATGGTGTTTTTGAGGTGCTCCAGTGGCTTCTGTTTCTATCAGCTGTCCCTCCTGTTCAGCTACTGACGGGGTGGTGCGTAACGGCAAAAGCACCGCCGGACATCAGCGCTATCTCTGCTCTCACTGCCGTAAAACATGGCAACTGCAGTTCACTTACACCGCTTCTCAACCCGGTACGCACCAGAAAATCATTGATATGGCCATGAATGGCGTTGGATGCCGGGCAACCGCCCGCATTATGGGCGTTGGCCTCAACACGATTTTCCGCCATTTAAAAAACTCAGGCCGCAGTCGGTAACCTCGCGCATACAGCCGGGCAGTGACGTCATCGTCTGCGCGGAAATGGACGAACAGTGGGGATACGTCGGGGCTAAATCGCGCCAGCGCTGGCTGTTTTACGCGTATGACAGGCTCCGGAAGACGGTTGTTGCGCACGTATTCGGTGAACGCACTCTGGCCACACTGGAGCGTCTTCTAAGCCTGCTGTCGGCCTTTGAGGTCGTGGTATGGATGACGGATGGCTGGCCGCTGTATGAATCCCGCCTGAAGGGAAAGCTGCACGTAATCAGCAAGCGTTACACTCAGCGAATTGAGCGGCATAACCTGAATCTGAGACAACATCTGGCAAGGCTGGGACGGAAGTCACTGTCGTTCTCAAAATCGGTGGAGCTGCATGACAAGGTCATCGGGCATTATCTGAACATAAAACACTATCAGTAAGTTGGAGTCATTACCGACTTTCAGATCACTCTTCCTGAGACTACTGTTTTTCCGGTATCAGTTCACGGTGGTGGCAAAGAAAAAATTTAAAACCCATCAGTCGTTTAATTTACAACAATGTACTCAGGCTGCGTGGTCCGCTGACAGCCAGACCTGATGTGGCTGCTATTTCTGCATGAATTGACGTGTGTACCGTCTGAAGGTAAGCGTACAGCCTGAACCGTCTGGTCAGAATCTGACAAATTAGACAAAGTGGTGTCCACCAAATAAGTAGTGGGAACCAAAGTGTCAGATATGCAGAAAAATGTGACTCCCGGCAGGCGAAAAGGCTGCCCTAATTATCCTCCCGAATTTAAACAGCAGCTCGTTGCTGCCTCCTGTGAACCCGGGATATCCATCTCAAAACTTGCTCTTGAAAATGGCATTAACGCCAATCTGTTGTTCAAATGGCGACAACAATGGCGCGAGGGAAAGCTGCTATTACCTTCTTCAGAGAGCCCCCAGCTACTTCCTGTGACTCTCGATGCAGCTGCCGAACAGCCAGAATCGCTCGCAGAGGACCCGGAAACCCTCAGTATCAGCTGTGAGGTAACGTTCCGGCACGGGACGCTCCGCTTCAATGGCAATGTCAGCGAAAAGCTCCTGACTCTGCTGATACAGGAACTGAAGCGATGATCCCGTTACCTTCCGGGACCAAAATTTGGCTGGTTGCCGGTATCACCGATATGAGAAATGGCTTCAACGGCCTGGCTGCGAAAGTACAAACGGCGCTGAAAGACGATCCCATGTCCGGCCATGTTTTCATTTTCCGGGGCCGCAGCGGCAGTCAGGTTAAACTGCTGTGGTCCACCGGTGACGGACTGTGCCTCCTGACCAAACGGCTGGAGCGTGGGCGCTTCGCCTGGCCGTCAGCCCGTGATGGCAAAGTGTTCCTTACGCAGGCGCAGCTGGCGATGCTGCTGGAAGGTATCGACTGGCGACAGCCCAAGCGGTTGCTGACCTCCCTGACCATGCTGTAAATCTCTTTATCCTGGTTGTCACAGAATAAGCCCGGTAAAATACGGGCTTATGAACGACATCTCTTCTGACGACATCTTCCTGCTGAAACAGCGCCTGGCCGAACAGGAAGCGCTGATCCACGCCCTGCAGGAAAAGCTGAGCAACCGGGAGCGCGAAATAGACCATCTGCAGGCGCAGCTGGATAAACTCCGCCGGATGAACTTCGGCAGTCGTTCCGAAAAAGTCTCCCGCCGTATCGCACAAATGGAAGCCGATCTGAACCGGCTTCAGAAAGAGAGCGATACGCTGACTGGTAGGGTGTATGACCCGGCTGTACAGCGTCCGTTGCGTCAGACCCGCACCCGTAAGCCGTTCCCTGAATCACTACCCCGTGACGAAAAGCGACTGTTGCCTGCGGCGCCGTGCTGCCCGAACTGCGGCGGTTCACTGAGCTATCTGGGCGAGGATACCGCCGAACAGCTGGAGTTGATGCGTAGTGCCTTCCGGGTTATCCGGACGGTACGGGAAAAACATGCCTGTACTCAGTGCGATGCCATCGTGCAGGCACCTGCACCTTCGCGGCCCATCGAGCGGGGTATCGCCGGACCGGGGCTGCTGGCCCGCGTGCTGACCTCGAAGTATGCAGAGCACACCCCGCTGTATCGCCAGTCAGAAATATACGGCCGGCAAGGTGTGGAGCTGAGCCGTTCACTGCTGTCGGGCTGGGTGGATGCATGCTGCCGGCTGCTGTCTCCGCTGGAAGAGGCGCTTCATGGCTATGTCATGACTGACGGCAAACTCCATGCCGATGATACCCCGGTCCAGGTACTGCTGCCGGGTAATAAGAAGACGAAGACCGGGCGGTTGTGGGCGTATGTTCGTGATGACCGCAATGCCGGGTCAGCGTTGGCACCTGCAGTGTGGTTCGCTTACAGCCCGGACAGAAAAGGCATCCATCCGCAGACTCATCTTGCCTGCTTCAGCGGTGTGCTGCAAGCGGATGCGTACGCCGGGTTCAACGAGCTGTATCGCAATGGTGGGATAACGGAAGCTGCCTGCTGGGCTCATGCCCGCCGAAAGATCCACGATGTGCACGTCCGCATCCCGTCAGCACTGACGGAAGAAGCCCTGGAGCAGATCGGTCAGTTGTACGCCATAGAGGCGGATATAAGGGGAATGCCGGCAGAGCAGCGGCTTGCTGAACGTCAGCGAAAAACGAAACCGCTGTTGAAATCCCTGGAAAGCTGGTTGCGTGAAAAGATGAAAACCCTGTCGCGACACTCAGAACTGGCGAAAGCGTTCGCATACGCCCTGAACCAGTGGCCGGCGCTGACGTACTATGCAGATGATGGCTGGGCTGAGGCGGACAATAACATCGCTGAAAATGCGTTGCGGATGGTCAGTCTGGGCCGCAAAAACTACCTGTTCTTCGGTTCGGATCATGGAGGAGAGCGGGGAGCGCTGCTGTACAGCCTGATCGGGACGTGCAAACTGAACGGAGTGGAGCCAGAAAGCTACCTCCGCTATGTCCTTGACGTCATAGCCGACTGGCCGATAAACCGGGTCGGCGAACTGCTCCCCTGGCGCGTAGCACTGCCGACTGAATAACACATCCCCGTCAATACGGTTCTTGCTGCACGCTTACGTCTGAAGCTCGTTGTCGACTGCTTCACAACACTGATATGTTAGCATGATACGAGTTCCCGGACTGTTACTTCAGGTTTTACAGGGCTTTATTGATTTGATTTTAAGCTGATGGGGTTACTGATATGCTGAGCAGATTACTTTCTGGTTAGCGGGTGGGAAAAGCACGTTAGCGTCAGCATCAGAAGCCAGCTCTCTGCGAAATTCCACATCTGGTTATCGACATTATCACTCTGGGGGTATTCTGGTGAAGGGAAACATAAAGTCTGGCTGCATGGAGTGACAGCCCCGGATAGGGGCGCAAGTTTCATCTGGTAGCAGACAATGTGAAGGACTGGATTATCCGGGACAATTTATTGCTCAACTGAATGAGCAATGGTAAGCGTCAACGGAGCACCGTATTGACGCTTATTTATTGGTGAGTATTACCTCCTGTATTGTCAACGCCGCTGTATATAACCTTTTTTCACCGTTTTATTTGATTGCCGCTATTTATCAATGGCGATGCCCGGTTTCCACCGATTCATCATCACTTATTGGCGCTGGCCCTTGCCAGTACACCTGCTTTACGTTTTTCCTTCAGGTGGTAGCTTTCCCCCCTTAATATTCAGCGTGGTTGAGTGATGCAGCAGATGCTCAGGGATCACTGTTGCCAGTACATTATCGCCGAGCATTTCCCCCCAGTTGGCGAATCCTTTATTTGACGTCAGAACGATTCTCGCTTTTTCATATTGCCGCTTCAGCAACCGGAAGAACAGCCTGGCTTCTTTCCGGGGCATTGGCAGATAGCCTGTTTCATCAAGGATCAGTACGCGTGCGTAACAGAGTTATTGGAGCTGTCGCTCAAGCCGGTTTTCCTGTTTCGCTTTCATTAGCACCATTTTGTGACGGGTCCTTATGGCTGTCGGTTCAGGGTATGTTTCCTGGTTATTGTGTACCACGAAGCAGGGGTATTGTGTCGATGGTCACATTTCAGCACCATAATGCCCAGAATGCTCCCTGATTCCACAGATCACTCAACTCACCGTCCGTTCCTACAACCGGCAATACCCGTAACCTGTGCCACACAGACGGAGCCGGGCCATGTGCAGAGACCGGTAAAATTCGTTACTACAACTGTTCCTGAATATCTGACGAAGAAACAATGATCTCCGGGGACTTCAGCCATAACATCCTGCCGTGTGAGTTTCTTCCGCAGTGCAAGGAGCGCTGTCGCCATGTACTTATTGTCACCAGGATGAGGCATGTACCGCAGCATGCCATTTCTGGTAAATAACCTCTGCCTGCATATTCAGTCCGTAGAGAACGTGGCGTGTATCTGGCTGTAAAGAACATCCCTGTCCATGGCTTTCATTCACTGGTGGTGTTCATGCCAGTTCAGCAATGACAGGATGCCCTTAAGCCATCCGTCATGGCTCAGTATGATGACGACGAACACGCTCTGCAGGTGAGTATGTGGGGCCGGAAGCATCATTCCCGTATTTTGCCGCACCTTCCGGAAGCAACCGGCATCAGTTACGGTTATCGTCAACCGTTGTGCGGTATGCCAGGAGCACGTACTGATCCATCAGATCCGTCAGTGCTGCGCTCTGCTTCTGACGGTCGATGATCCATTCCTCCAGCTCTTTTTCTGATTTCCGTTTCAGGTTCTCTGTCCAGTCTTCAAGGCTGTTTCCGGTCATTAAATCAAGCCAGCAAAGCCAGACCAGTTTAAGGCGTAACTCCGTGCCACCAAACTGACGGAAGAGACTGATCAGTTTATTTGCCAGAGAATGGTGTTTCAGACGTTCACATTCAGACAGCAGAATGCCTTCCAGCATCGGATGATGAACCAGCAGATGTTCACTGGTGCATAAATTCACTATCAGATCATCATCCAGGGGCGGCGTGGCCCGTTTGTCAGGGCCATAAATCAGCCGGTGCAGGCTCTGGGGGACGATGGCAATCTCCGGTTTTTCACCATTGAATGACACCCAACCGGGCGTCAGATTATGGAAGTAGTAGATATTCGGATGACACACGCCCCAGGATACGGCGTGCCGGGTGAAGCCGGTCAGCCATTTCTGGTAGCTTTCGTGGAGCCATTCGTAGCGGGTCTGATACTGTTGTTTCATTTTCATATTCCTCACGGGGATATAACAGGCAGCAGAATGCCGTCGGCAGGCCGGGGATTACGTCTGCTGCGGGTGAACGGATAAGGGGCAGAAATTACGGACGGGACTGTGCAATACGCGCCTGTAGCCAGGCTTCCACTTCACTTTTCAGCCAGCGGGAGCTGCGGCCGAGTTTGATGGGAGCCGGAAAGGCCCCATCCTTGATGAGCTTGTAAAACCACTTATCGGTCAGGCCGGTCAGCTGAGTGATAAACGCCATATCGACCATCTGGTCATCCATCAGCGAAACTGGGGTGGTCATGAGAATGTTCCTCCGGATGTTGTTAAACCAGCCGGTGTGGTCATGAAAACAACGCACCACCTTCACCGGCATAACAACCGGGGAGAAACGTAAAAAATAATCGGGAATGTGACCGCCAGGAGATAAACGGGGAGGTTGCGGTGGCTGAGCCGTACCGGTATGCCGGGTGGTTGTTACGAAACGATGCTTCCGCACGCGCGCGCGATAAAGAAGGGATGATTTTCGTGAGGAAGAGGGGAAAACTGGCGTTCTGATAATTGCAGAGTTATTTACGGGGCGGTTACGGTGGGGCAGCCCGTGCCTTGTGGGCAGGGACAGGTATCACTGTTGGGATAAGACTGTCTGATTGGTATTATAAATATATAAGAGAGATCAAACACATGAGTTCATATATGTGAAATAAACACTGTGAGTGCGATCAGTGGGCATGAGAACGCAGTAGCAGCAGATTGCGTTGTACCCACAGGGCTATCTCTGATAGCTGGCGTATAACCTGCGTATATAAAACGTACCTGCAACCCACGTGAACAGAGTGCATTAAGCATCCGTCTGAGCGGTAAGCGTATCCGACCTGGTGAGTGACAGTCTGAGAGACCGGTTAAGTGGTGGCCCGGGAGTAATGAGTTTGGGGAGGGTACCAGAATGCCAGTCTGGTGCGGTATGCTGACAGTATCACGTTAATTGTAGCATAAAGTGATGAGAGGCTAATGGAAGTTAAAACCAAAGAGGACTGGCTGTATCAGTTCCGTCGTTGTTCATCCCGGGAGACACTGGAAAAGTGATTTCCCACACGCGTTATAAACTTACCCCTGCGGAGCTGGAAACCTTCAACTCTGCGGTCGATCACCGACTGGCAGAACTGACAATGAACAAGCTTTACGATCGTGTTCCCGCTTCAGTCTGGAAATATGTCATCTGAGCGTTCGGTAATCACAACGGCATCGCGCCAGGTGTTATCCGCTTTGTGATTCTTATTGGCTCTTCTCAGGTTATTAAAAGCTGAATATGTTCCTGTTGTCGGGAACCAGGATAATGTCGGGAACGTTAAGCTGTATAACGACAACAAAGAATTCGAAATCAAAAGGGAATGGTGTCCGGAGGTGTGGGTGATCATGGGCTGATTTTCGGGTATTAACGGCCTGTATCTGCTGAGGTTTTCTCGTATTTACCGGGGATCCGGTGCGTTACTGTCTGGCACAGCAAACGCAGAGGCGGCGGGGCGTTCTGCCCCGTTGATTCCGGATAATCCCTCCCCTGTAACCTACTGCAACAGTACAAATACACTGACGCCTGAAACGACAGCCAGAATCAGCACAGGCAGTAACGTCAGTAGAAAGAGCAGACGATCGGTATTTCGGTAATAGCCTGAGGAAGCTGTTGACGCGCAGCTCGGGCAGCATTCCGTATTATGAGTAATTGCGTTACTACAATCCCTGCACACCGTTTTGCGTATCACCATAACGCCCTCCTCACAGTTTTATCCGTAACCGGTTTTTTGCATTAAAAAAACAACGCTGAAAATTCAACCGTCAGTCCGGAGACGACAGTCGGGGTATCACTGAGTGCCTGAGACATTGCCCGGTTGGAGATCACTTCAGGGACTCTGCACATTTTTTACAGCGTGGTTATTCCGTATGAACGAAACCATACGGAGATTTAACCATGACACACGAGTACAACCCCTTCTGGCAACAACGTATCCGTGAGACGGTGCTGCGCGCACTGGATGTTCATCCCCGCCTGACGGCATTGCGGGTTGACCTGCGTCTCCCGGATGTACCGGCAGCAACGGACGCAGCTGTGATATCCCGCTTCATCAATGCCCTGAAAGCCCGAATCAACGCTTACCAGAAGCGTAAGCGCCGTGAAGGTAAACGCGTACATCCCACGACCCTGCATTACGCCTGGGCCCGGGAATTCGGGGAGCTTAAAGGCAAAAAACACTATCACCTCCTGCTGCTGGTCAACCGGGATACCTGGTGTCGTGCCGGCGATTACCGCGCTCCGAGATCACTGGCCGGGATGATTAAACAGGCCTGGTGCAGTGCCCTGGGGGTGGATGCCGGGCCTTATGCCACGTTGGCGCATTTTCCTGACACGTCGGCGGTGTGGCTGGAACGCGATGATGACACTGGCTTTCAGCAGGTGCTGGAACGTGCTGACTATCTGGCGAAGGAACATACCAAAGCTCACGCCACTGGTGAGCGCAACTTTGGCTGCAGTCGTAGCTGAGCCCGACAGACGAGTTATCTGACCTCAGTGTGATGACGCGCTCACGGCCACCACACTGATTTTCCCTTCTGAGACCGGACTGAAACCGCCATTCACACCAGACCAGGCTGCCCGTACCGGGACGGCTGTTGCCTGCGTGTCGTCTGGCGCTAAGGAACATATACTATGTACGCAAAATCTTTTCTCGCACTTGATGGCAACGGACGTCTGACCGGTGCCCGTACCGCACAGACTGCACCTTATGATCGCTACACCTGCCACCTTTGCGGCAGTGCACTCAGATACCACCCGCAATACGACACTGAACGTCCCTGGTTTGAACACACTGACGACGGGCTGACAGCGCACGGTCAGCAGTGTCCTTATGTCAGACCGGAACGCAGGGAGGTACGCTTGATTCAGCGTCTGCAGCAATTCGTACCGGATGCCTTACCCGTGGTGCGTAAAGCCAGCTGGCACTGCAGACAATGTCACCACGATTATTATGGGGAGCGGTACTGCACACACTGCCAGACCGGGCGCTTCAGTGAGGAGGTGGTAGCAGGATGAACGGCATAAAAATGGGGCTGGGGATCACGCCGGGTGAGCACATTATCAGTGCCAACTCTGCGCTCAGCAGAAATATCCGGCATTGTTTCTGCCTTTCCTGTCGTGGCCGGTTGATTTTGCAGACAGATGCTCAAGGGGCGTGGTTTGAGCATGATTTACATGCCCTGAGTGCGCAACAGAAGGCGGCCTGTGTGGTGCTGAATCCGGAGAAGTCCCACCCTTACATTGAGGATATGGCGATGTTTTTATCACCATTACCGGTCGTGCTGGAGTGGCATTGTGTGATGTGTGAGCAGTTTTTTCATGGGAAGAAGTTCTGTGAGGCCTGTGCTACGGGGATTTATTGCAGAGCGGTATGTACGAGGTCTGTTTACAGCTACCAGCCAGACTTGTTTGAGGACTGCGGAGGCGCGAGTACTTAATCCAAAATCGCCGCTCTGCGCAGGGCGTTCAGAGGTATCGCAACCTGCATTTTATCCGGTGTTTCGCACCTGCCCCGTTCGTGACACAACGGCGCGACACCGGATATCCGCAGTCCTGCTTACCTGAATAAAACCCGGGGTATTGATTGGTAGTGTGGTGCGGAATCGTACTGTGATCTCGCATTCGCCGAATTCGCACTGTGGTGTCAATCAACCATACGTGAGAGTCCTCTCCGTATTTATCAGTGCCTCAGGACGATTTCTGCATAGTTAATCCAGGGGCTGCATTGTTCATTTCCCTGCTAATGGACGTGACATCCAGGATATTAACGATCCTGATTTCCCTGAGCAGTATCAGCGCTTATTGGCACGTCTGGATTATCTGACGAAGCCCGGTACTAAAGCATCAGGCCAGCGTAACTCTGGCTACAGCGGTTTTTGACCTGTTGGTAAGCGTCAACGGAGCACCGTATTGACGCTTATTTATTGGTGAGTACTACGTTCCATGGCAGGAGTTCATCAACTCGGTTGGAAGGCCATTCCGGCAGTACGCTCAGGATATGGCGCAGATACGCTTCCGGATCGATACCGTTCAGACGGCAGGTGCCGATCAGCCCGTACAACAGTGCTCCACGCTCGCCGCCGTGATCGCTGCCAAAGAACATAAAGTTTTTCTTTCCGAGACAGACTGCACGAAGCGCTCTTTCCGCAGCATTATTATCCGCCTCCGCCAGACCGTCATCACTGTAATAACAGAGGGCATCCCACTGATTCAGTACATAGCTGAACGCTTCGCCCAGTCTGGATTTTTTCGACAGCGTACCATTCTTCTCCACCATCCATTCATGCAGCGACGTCAGTAACACTTTGCTTCGCTGCTGCCTGACGGCAAGACGCTCTGACTCCGGTAATCCCCGTATTTCATCCTCGATGGCGTACAGTTCACTGATTCGCTTCAGTGCTTCTTCTGCCGTCGCACTTTTGCTGCTGATGTATACATCGTGGATTTTTCGCCGGGCATGGGCCCAGCACGCAACTTCTGTCAGTGCACCACCTTCACGTTCTGCACTGAACAACCTGTCGTAACCTGTGAACGCATCCGCCTGCAGGATACCCCGGAAGGGGCGGAGGTGTTGCTCCGGGTGTTTCCCCTGCCGGTTCGGCGAGTACGCGAACCAGACCGCTGGAGGAGATGACGAACCCACATTGCGATCATCCCGGACATACGTCCAGATACGCCCTGTTTTCGCCTTTTTCTGACCCGGTGCCAGTACCTTTACCGGTGTGTCATCAGTGTGAACCTTGCGGGTGTTCATTACATAACGGTACAGGGCATCATTCACCGGTGTCATTAACTGGCAGCACGCGTCAACCCAGTTGGAGAGTAAGGCCCGGCTCAGTTCGACACCCTGGCGGGCAAAGATTTCACTCTGACGATACAGTGGCAGATGTTCGCAGTATTTTCCCGTTAACACGCGGGCAAGTAATCCGGGGCCCGCGATACCACGCTCTATCGGGCGGGACGGCGCCGGTGCTTCAACAATACAGTCACATTTTGTACAGGCTTTTTTTACCCGTTCTGTGCGGATCACTTTCAGGGCACTGCTCACCAGTTCCAGCTGTTCAGCGCTGACTTCCCCCAGATAATCCAGCTCACCGCCACACTCCGGGCAACAGCTTTCTTCTGGCTCCAGGCGGTGTATTTCACGGGGAAGGTGTGCCGGTAACGGACGACGATGGCGCGACTGTCGCAACTGGCGGGGAACCTGAGGATCGTCTTCCCGCCCACTGTAACGATCGCTGTCCTGTTCACGTTGTTTCAGCAGAGCCTCAGCCAGTTCAACTTCACGACGCAGTTTTTCAGAACGGGTACCGAACAGCATCCGGCGCAGTTTTTCTATCTGAGCCCGCAGATGTTCTATTTCCCGTTCATCTTCTTCGATCTTTTCTTCGGCACGTGTCAGTGCAGAGCGCAGGAAGGCTTCCGTCTCTTCAACCAGACTCAGTTGCTGGTCTTTCTGACGGAGGGCTTCAGCCTGCTCAGAGAGCAATCTTTCCAGCTCTGCGATGCGAATGAGGTATTTCTGACTCATGACCGTTTTTATAATGCGGTCAGGAGTTTTTTACAACATTGTCAGTGAGTTACGGCTGGATGTTTTTGGCTGACGCCAGTCCAGCTTATCGAGGAGCATTGCCAGTTGCGAGCGGGTAATGGATACCTTGCCGTCACGTACCGCAGGCCAGATAAACTGGTCTTCCTCCAGGCGTTTGGTGAACAGGCACAGACCATCAGCATCAGCCCAAAGAATTTTGACGGTGTCACCCCGTCGGCCACGGAAGATAAACAGGTGACCGGAGAAGGGATTATCATTCAGCACATGTTGTACCTGTTCTCCCAGTCCGTTGAAGGATTTACGCATATCGGTAACGCCGGCAACGAGCCAGATACGGGTACCTGATGGGAGTGAGATCATCTTCCCCTCCCGGTCAGTTCACGGATCAACACTGTGAGCAGCTCTGGCGATGGATTTTCCAGCGTCATGTTACCGTGACGGAATTCCACCTTGCAGGAACTGGCACTGACTCTGGTCTGAGTGGAAGTGGATAAAGACGGCGCAATGGCCGCCACAGGTTCTTTCTGCTCATCCGGCGTTATTTCTACAGGTAATAATTCAACGCCAGTGTCAGAAGAGGTCGTTACCGGAAGACGCCGCGAAACACGCCCTTCGTTCTGCCAGAGCCTGAGCCATTTGAAAATAACATTATCATTGACGCCATTTTCACGTGCAATCTGTGCAACACAAGGGAAGGTGCGAACAAGTCCCTGATATGAGATCATGTTTGTCATCTGGAGCCATGGAACAGGGTTCATCATGAGTCATCAACTTACCTTCGCCGACAGTGAATTCAGCAGTAAGCGCCGTCAGACCAGAAAAGAGATTTTCTTGTCCCGCATGGAGCAGATTCTGCCATGGCAAAACATGGTGGAAGTCATCGAGCCGTTTTACCCCAAGGCTGGTAATGGCCGGCGACCTTATCCGCTGGAAACCATGCTACGCATTCACTGCATGCAGCATTGGTACAACCTGAGCGATGGCGCGATGGAAGATGCTCTGTACGAAATCGCCTCCATGCGTCTGTTTGCCCGGTTATCCCTGGATAGCGCCTTGCCTGACCGCACCACCATCATGAATTTCCGCCACCTGCTGGAGCAGCATCAACTGGCCCGCCAATTGTTCAAGACCATCAATCGCTGGCTGGCCGAAGCAGGCGTCATGATGACTCAAGGCACCTTGGTCGATGCCACCATCATTGAGGCACCCAGCTCGACCAAGAACAAAGAGCAGCAACGCGATCCGGAGATGCATCAGACCAAGAAAGGCAATCAGTGGCACTTTGGCATGAAGGCCCACATTGGTGTCGATGCCAAGAGTGGCCTGACCCACAGCCTGGTCACCACCGCGGCCAACGAGCATGACCTCAATCAGCTGGGTAATCTGCTGCATGGAGAGGAGCAATTTGTCTCAGCCGATGCCGGCTACCAAGGGGCGCCACAGCGCGAGGAGCTGGCCGAGGTGGATGTGGACTGGCTGATCGCCGAGCGCCCCGGCAAGGTAAGAACCTTGAAACAGCATCCACGCAAGAACAAAACGGCCATCAACATCGAATCCATGAAAGCCAGCATCCGGGCCAAGGTGGAGCACCCATTTCGCATCATCAAGCGACAGTTCGGCTTCGTGAAAGCCAGATACAAGGGGTTGCTGAAAAACGATAACCAACTGGCGATGTTATTCACGCTGGCCAACCTGTTTCGGGCGGACCAAATGATACGTCAGTGGGAGAGATCTCACTAAAAACTGGGGATAACGCCTTAAATGGCGAAGAAACGGTCTAAATAGGCTGATTCAAGGCATTTACGGGAGAAAAAATCGGCTCAAACATGAAGAAATGAAATGACTGAGTCAGCCGAGAAGAATTTCCCCGCTTATTCGCACCTTCCTTGATAGTGTTTTATGTTCAGATAATGCCCGATGACTTTGTCATGCAGCTCCACCGATTTTGAGAACGACAGCGACTTCCGTCCCAGTCGTGCCAGGTGCTGCCTCAGATTCAGGTTATGCCGCTCAATTCGCTGCGTATATCGCTTGCTGATTACGTGCAGCTTTCCCTTCAGGCGGGATTCATACAGCGGCCAGCCATCCGTCATCCATATCACCACGTCAAAGGGTGACAGCAGGCTCATAAGACGCCCCAGCGTCGCCATCGTGCGTTCACCGAATATGTGCGCAACAACCGTCTTCCGGAGCCTGTCATACGCGTAAAACAGCCAGCGCTGGCGCGATTTAGCCCCGACATAGCCCCACTGTTCGTCCATTTCCGCGCAGACGATGACGTCACTGCCCGGCTGTATGCGCGAGGTTACCGACTGTGGCCTGATTTTTTTAAGTGACGTAAAATCGTGTTGAGGCCAACGCCCATAATGCGGGCAGTTGCCCGGCATCCAACGCCATTCATGGCCATATCAATGATTTCCTGGTGCGTACCGGGTTGAGAAGCGGTGTAAGTGAACTGCAGTTGCCATGTTTTACGGCAGTGAGAGCAGAGATAGCGCTGATGTCCGGTGGTGCTTTTGCCGTTACGCACCACCCCGTCAGTAGCTGAACAGGAGGGACAGCTGATAGAAACAGAAGCCACTGGAGCACCTCAAAAACACCATCATACACTAAATCAGTAAGTTGGCAGCATCACCGGCTGATTCGATGTATTTGCGGGAAAAAAATCGGCCCAGATCCGCGAAATTTTAATCAGCGAGTCAGCTGGGGAAGAAATGACCTGCTTATTCGCACCTTCCCTAACTTTAAAAAAAAGCATCTGTCGATGCCCTGAATAAAAAGTAATTAATACTTGCGATAGAATTTTCTTTTATAAAAAACCGCAAGGTCAATCGTCAGTGCTAAAATGTATTTATTTAGCGAGGGCAGATTGTCTCTTTTACGTATGCCATCGCTCCATTGATTTGCTTTAATAAATTCAATCAATGCTGCACGGTTGTTTTTATTCCATAAAGAGCTTAATATCTCGAAGGTCGTGGGTACATGGTTTTTCTTAACGTATCCACCTAAGCCGAATGTTTCGTTATGATTTAAATTATTAAAAGCGTCAGTTAGAGATTTATTACTACCACTAATTAGCACATCCCAAATATTTAGGGCGTAGCCACGTACAACGTCGTGAGTAATGCCATAACTCAACACTGCAGCGGAGAAGGATTCCATACCGGCCAGGATCCCTTCTTGAAATGTTTTTGTGAAATTATGCCACGCGCTGTTTTCGTCGTCGTTAACACTGCGAATAGCGACTATCTTACCATTCTCCAGCCCATAGCCTGTTACGCTACCTGAAGGGGAGTTGGTAATCATTTCAATAGGAGCAACGGAATCAAGAAAATGAGGATACTCCAGTTGGTGATTCAAATCTGGGCCAAAAGCAACTTTTGATGTGTTTGATGGTTGCTCGTTGAGAAATTTCTGCAGGCCCAGATATATGCCGGTAAAATGAATATCAGGATAAAGAAGAGCAATATTATCGTGAATGGTGCCACGCCAGCCGACATCGACTACGCAAATACGCGCATTGATATCATCAGTTAGACCTTTTGTCGCAAAATAGTTCTTCAAAAGAGCTCTTTGCTGCATCACATGCTGCCAGAGCGTCTCTTTGAAACCAGAGTCGTCAAAAAGTTGCTGAATACGGCTGTCCTGCCATGGATATTGGATCTGTTCATCTGCAGGAATCCCATATTTGTCGATGAAACTCTGGAATGTTTCAGGGGCAACGTTAAGGGTCTTAAATAAAGAGCTAATGGACTGTGTGCTATATAAATTCCAAACACGCATCATCTCTTTTATGCTAATTTCTTGTAGAGACGGCGCGAAGGTAGCCAGACGGCTGACTTCTATAATATCGATTTCTGGTAACTTTATTTCTTTAATATTAGTCTTGAGATGACTAATCAAAATATTCATTGCTTTAATGAAAAACTCGCCTTCACGAGTGAAAAAGAGGATTTTTTCACTTTTTGATATGACGGCTTGTTCCAAAATTTTAAGGCAAAAACCAGCGATTAACGGAGTTGTTTTCAAGCCAAGCTGAAAATCAACTGATAAATCTTTAGATGCTGCATATTTATTAAGAATATTATTGGTGATCGTTTCCGTGAGATCTTCATTTTTATTCCAGAGAAACTCTTTTTGCTCACGCAGCTGATGCTGTTGGGCTGGGAGATACCGAATACTTTTTATACCTTTTGATGTCGGCATTTGCACATCGGACCATTCATTATCGCCAATATGTATCCAGTCAACGCCGGCTAACTGATATTTTTGTTGAATAAAATCGAAGAGACGTCCACTTCTCTTGTTCAGTCGCTCGTCAATTGAAGAAACGCCATCTGATATGACAGACTGATCCAATCCAGCGCTAACGAGTAACTCTGTTAAATCAGTGGATGAAGTATAAAAATCGGACAGAAAGATTGTCTTATCAGCGGGATATTGCGCAAGAAAGTCCTCAATACCTGGATCCAGATAGATGTGCTTCTTCTCTTGCTCAATTTCCCAGGAATATAGCTCATGTGCTATCGCTGGCGTATCATATTGCTCTGCAGAGATATAGTTTTCTACCCATTGAAGCATTACATCCTGAATGAGATACTCATCATCATAACCAAGATCTTTGTTTTTCTTACCAATTAAGCGCTCTGTTTCAACTCGCTTAGTAAATAGCTGGTCGACGGAATCAACATTGTGTAGCGAATGAAATTTAAGGAAAAAATAATATGCAGTTGACCGCTTTATGAAGTCTGGATGGCAATCACGACGCAGAATTGTATCCCATACATCAACAGTTTTGAGTTTAAATTTATTCATCTGGATTAATTCTTCAACTTAAAATAAAGCGCGCTTAAAGGTGTGCGCACAAATAACGGTAAACGTAGAGCAATTCTCTTTAACAGACTCTGATGCGCGAATGTTAACGTGTTGGCAATATCAGGTCTGGCAAGAGGGGCTTCAAACGGGGGCTTATTATACGATCGTGAAATCACTTCTGCTGTGTATGCCTGCTCGTTCATCACGCAGTTGAACGCTTTGTAAAAGCCTTCTGTTTCCGTTTCCAGGGTACGCCCATTCATAAACTTTTGACCAGCCGAGCTCATACTCGCGGCTAAATCAGCATCGTTTAGAATTTTCATAATACCGGCGGCAATGCTTTCAGGTGTCTGTTTACACAACAGCATAGCATCTTCGCTAAAATCATAAAGATTGTTTTCTCGCCAGACTTCAACAACGGGAAGGCCTGCATTCATCATTTCAAAGGGGATGCGAGAGGGATTTGAAGAACTAATACAGAGTCCTACGGAGCATTTATTATATAGCTCATTACATTTCGGCAAATCTAATAAACCTAGATTTTCATGCTCGAACCAGAGATTATCATTAATTGATGAACCATAAAGATAAATTTTAACATCTGGCATGCAATGTTTAACTATACCTAAAGCCTCAATACCAATACGCGAACAACGTCGCGGTTTATCTGGCTGATAGATAAAGCAAACCGCTTTTTCTTTTTGAATGGTGGGAACGCGCTTATAGATATTATGATTCGCCCCAAATTCGAAGTGATAGCTTTCAACAGCAAAATCATTTTGCAGTTTATATCGTAACCAGCGCCCAATCGTAATTGGCTTTAAGCCGTAACAGTATGAATTTTCAGCCATCAGGTAAGCATCGCCCACCGGGTTGAAGTAAGCTTCAAAATCCTGAACGAAATATAAACGTTTACAATCAAATGGTAGAGCGGCCACGAATGGTGCAGAATACCATATTGTTGCAACTGCAGCATCACATGGTTGCGCATTATTCCAGTCATAAATGACGTCTTTAAATTTTAGACCAAATAATTTATGAATTACCTCTGCACCGCTTGTTTTGCCGCTTTTATCTTCAAGAAAAATTGTACATGTATGACCCATTTTTTCCAGATAAGCGGCATGTTGCAGCATTGTGCGGTGGCCGCCAGACCCTTCAATGAGTTGAGGTATAAACCAAGCAATATTAGCCATTATAACCTTTCACAATAGAATTATTTTATATTTAGCATGCGTAGTAGATGCCTTATTACAGGCGTGCTATTAATCTTACGCAGTATGGTATTTGCACGTTCTAGATCTGTAGAGAGCTGGGCAATATGGAGATCTTTTTCATCGATTAATGAAATTAATTTTATGTTTTGATTCTGTAATTCATCAACTTTCTGTTCCTTAGAGGAAACTTGACTATTTAAGTCAGTTACATTTTGAATTAGCTGCTGAATAGTTTTATTATTCTGCTCAATTAATTGATTTTGCTGAGCAGAGACCAGGTCACGCTCATCAATTAAATTTTTTTGCGTAAGAACAGTGCTGTCTCGCTCATCAATAAGGCGCTTCTGGTCGTGAATAGTTCTATCGCGTTCATCAATTAAGCATTTTTGATCATGAATTGTTCTGTCACGCTCATCTATCATTTGCGACATATTATTTATTGCATTCATGCGTTCATGAAGCAAAGCCGACCAGGTGTTACCGTTGTCAAAATTTGCAATGTTGAATTTTCTACCTGGTATGGCGTTTTTAATGTTGACCATCTTTATAAGCCAATCAAGAATATCATGGTAAATATAAAGCACAAAATTAAATGGTTGAGATATTTTACCCACCAGTTCGTCAGGATGTTGATAAAGTAGCTGTATACCTCTTTTAATCCCTCTATCGACGATTATCTCGGCACAGGAGATTCTATTTAAAGAAATTAATATAACGGACTGAAGATAAAAGCTTTGTAAAATCTTCGTGCCGATAGAGGGGCTAAATTTATCAATGCTGGAGTTTGCTGCACGAGATAAATGGATTAACGCATTAGCTTTATCATTATTTTTGTTACAAATAAGACCTAAAAGTGTCGATAAAGATATGAGCCAACGATATTGATGTGGCGTAGGATTATCCATCTGCGACACATTTGAGCAGTAATCTAGCATTTTTGCGTAAATGTCAGCTGTATCATCACTGCCGCTTAGCATTTGATAACCTAATACTGCCAATGCAGCGGCATAATCGGGGCTGTCAGGAGCACTGTGCTCCAAAATCTGATGGCTATACTGCTGTAGATGATAAGTTGACCTATTCCTGAATGGGAATTCTACGATCCCGCGAATAAGCCACGGGTTAGTATAATCTCTGGCAAAAGCTAATAGGTTTTTTGGTGGATGCGAGTAGCCATAAATCGTTTCCTGATACGGAAGCTCTGACCTGGCTAAAGGGCTTTTCATAAAAACACATAATTCAGGAGAGAGTGTAATTTCCTCTCCCTGATGTATTTGAGTCTTTAAAAGGCCATGTTCATAAAATAAATAAACTTCAGGTTGTAAATTGTAATTACTTAATGTCTCAACTACAGTTTTATCTGTATGAAAACTAACGTACCTGCCAGCGGGCACCAGGTAAGATATAATGACTTTGAGTGCATTATTTAGTCCATCATCGGCATATTTATCAGTAATGAGAAAATGGTAGCCATTGCCATTTAGCTGAACACTTTTAATTTCATCCAATGAGTAAATTTCAACTTCAGATGCCTTTGTTTGTTCTGCAATAATATTTTTAATGTTTGTTATCGAATTTTTATTCTTAATGATTACAAGTACTCGATCCCCAGGACGTATGTAGGGTAAAGTATTTACAATTTTTTTTTCAAAAATAGTCGTTTCTAAGATATTTTTGTCTGTCGAAACTACTGACTGACGTCTATAACTCAAAACTATCTTTAGGCTTTGATCATCCCGAAATTGTTCATCATAGCCAGGATTGAGCAAATAAAATACAGTGTCTATCGAATAACCAAGATTGTCAGTTATATTAATAAAATCCTGGAAACTCTGACCGTTTCCCTTTTCTACAAAGATATTAATAACGATTTTATTTACAAGCTGTCTTGCAGCGGTTCTGACAATCTTGGCATATTCATTAAAATTATTATCTTTATCATTGGTTAAAAGGAAATTAACTTTATTGCGGTTAAGAATATCTTTACACTGCGCCAGATCAGCGCAGTTGTGCACCTTATAACCCTTCTCTGCAATCAGGCCCTCAGTTTCTTTTGGGTAATTTTGTAATAAATACACCCCAGGCCCGAGATAATCAGTTATTGAAATATCATTCATCAACTTACCACTCTGCAGTCATCTTGAGATCCAGAATGCCACCAAAGAAATATTCATCACGCAGGACATTAACGTGAAAAAACGCGGCTTCATCACGCCAGTGTAAAATGCGCTGCGAGAAATAATTAGGTGTCCCCTCATAGGATATCGCCGCCTGTATTTCATATAGGTTTGTTCCTAAAGTACATTCAAATTCCATATCAACATAAAAACGCTCGCCTGCTTTAAACTCACGTTCCCAAAATTGGGGATCTTTAAGTTGATGGAAACGAACATACATGTCCTGGTTTAACGTTCCCCATGAATAAATTTTGACGCCTTCTTTATTGCGAATACGAATGCCAACGTTAATGTGGTTTATATCTTTATTTACTTTACAATTAACACGAACTCTGATTTTATCACCAGGGTTAAAGGCATTGCTACGTTCACCAGAAGCCGTAAACAATTCAACATTTTCAACCTCTGCTTCACCTTCACCGAACGATAGTTTATCTGTACGAGAGGGCGAGGTTTCCTCTACAGCAGGGGTTTCAGCATTAGCACTAATTTCTATCTTAGCCGGCGCGGTGGGGGTTTGATTAATATCGCGAGTGGCGTCCGCGTGGGCAGCAACTTCCTTTGCTAACTGAGAAAAATACTTACTTTCTTCCCGGTGTAGTAAACGACGATATTCAAGTAAAACTTCTGATGATAAGCCAGTACTTAATTGCGTCCCACGATCGAGAAGAATAGCTCTGTTTGTTAAGGTACGCACGGACTCTTGATCGTGTGAAACAAACAGTAAAGTTACGCCATTGCTGGTTAATTTCTCAATTTGTTGGAAACAGCGCTTTTGAAAAAGTGCGTCGCCTACAGCAAGTGCTTCATCTACGATGAGAATATCTGGCTCTAATTGTACCTGAACGGCAAATGCAAGGCGCACCATCATTCCGCTGGAGTATGTTTTTACCGGCTGGTCGAGATGGGTTCCTATATCAGCAAAACCGGCAATAGCATCAAATTTTGCATCTATTTCTGCCCGCGATAATCCCAGAACGGCGCCGTTAAGATAGATGTTTTCCCTACCGGTAAACTCTGGGTTGAAACCGGAGCCTAATTCCAGGAGGGCAGCTATTCTTCCCTTGGTTTGTATCGTGCCCGTTGTTGGAGTTAGCGTGCCGGTGATTAGCTGTAGTAATGTACTTTTGCCAGAACCATTACGTCCAATAATTCCCGCTGACTCACCTTTATTGATGGTAAAGTTGATGTTATTTAACGCCCAAAATTCGTTATAGTAGCGTTGTACAGGTTTCCCAACCATACTCTGGACTTTAGGTAAGAAAAACTGTTTTAACCTATCTCTGGGGTTTTGGTAAATGTTATAGCATTTGCTAATGTTACTTACGCTAATTGCAATGTTATCAGATGACATCGGCAAATCCTTTTTTTGTTTTTTGGAAAAATAAAAAACCTGCAGCACAGATAACTAAACCAATAAGCATATTAATTGCGAGCATGGTCCAGTTAGGCTCATTTCCCCAGTAAACAACTTTTCTTGCTTCTTCAATCATAAATGCCAGAGGATTCAACATTACAATTTTTTGGAAAACAACCGGTAATGCTGATAACGGATAAAATACGGGAGATAAAAACATTAATATTGTTACGATGATACTAATAGTTTGTGGTATATCACGAACAAAAACGCCCAGTGAGGCCAGGATCCAGCTTAATCCAAGTATAGAGAGCATAAGTGGAATAGTAATGACTGGCAGTAACAGGACAGTAAAATGAAGCTGATGCTTAAACACCAGTATTGCAATAAATAAGACTATGAGGCTAATGCTGGCATGAAATAAAGCAGCCAGCAGATTTACTATAGGTAATATCTCAATAGGGAAAACAACTTTTTTGACGAAGTTTACATTTGAGGTTATAAGGCTGGTTGACCTGTTAATACATTCTGCAAATAAGTTAAAAATAATCAGTCCGGTAAACAAAATGATTGCAAAATCTGCATGCCCGGCATCTGGACTGCTTCCCCAACGAGATTTGAATATAACCGAGAATACAAATGTGTAGACTGTTAACATCAATATAGGGTTGAAAAGTGACCATGCCAGACCAAGAAAGGATCCTTTATAGCGTCCCATAACGTCTCTTTTAGACATTTGCCAGATTAACTCGCGCTTCTGGATAATTATCTTCGAAATATTTACTAAAGTTCCAATCATTATTTTTACCGCTTTGTTGTACTTAGTATTTTATTGTAAACACATAACGTTTTATCTATAACAATTTCTTTGCTGAATCTTTCTTTTACAATTTCGGCTGAGCGTAGACCGAATTTTTTTCTTATTTCCATGTTATCTCTAAGATATTCAATTTTCTCCGCAATCTCATTAGCGCTATGTGTTTTTAGTACACATCCATTGAAATTATCTTTAATTATGGCTTTACAGCCGGGTACATTTCCGACAATGCAACTTCGGCCCATAGCACACGCTTCAATGATTATTCTTGGCACGCCTTCTGAATATTTAGTTGGCAAAATCACGACATGGCTATTTCTGATTAAATTTTGGATGTCATCTCTTCGACCTAACCAAATGATTTTTCCTTCTTTTTCCCATTGCAAAATTTGATCTAATTCAATTCTATCCGGATCTTTGTCATCGAGAATTCCCGCAACGTATAAGGTTGTGTTATTTTCATTTTGTTTAACCTTACCGATGGCCTCGACTACTTCACCGAGCCCCTTTGACCATAGCAAACGGCTGGCAAATAATACCGAGAAATTATTCTGAGTTGGTTCTGGCTGATAACTAAAGCGAAACCTGTCTACCCCGGCTCCTTCAATGATGTGGATGTTATTTTCATCAAATTTAATATATTTACTCAGCTCAGCAAAGTCAGCATTATGCTCAAATATTACTTGCGCTGGGCTTTTTGCTGATAGCAAAAGACTATAAATTGACAGTACTGAGTTGAAAATGAATTTATTCAACCAACCTCGTTTATTGCCAAACAATCTCCCTAATCCCACAAAGCTCACGACAAAAGGGATGCCTGCAACCCGGGCATACAATCCACCAAAAAGGATTGGTTTTATAGTAATTAGATGAATAAGATCGGGTTTAATTTGCTTACAGATCTTTCTAAATGCAGTGAATATTGAAATGTTTTTAAACACATTTTTAGAAAATCTGTTAAGCTCAATATCCCAACATTTTATACCTTTTTTTTCCAGGTTATTTTTGATGGCGTCATCAGCAAAATTACTTACAAGATGAACTTCATACCCTTTGCTAATTGCCGCTTCTGACCGGTCCAACCAATGCAGTTCGAAGTACCACGCTGCATTGACAAAATACACTATCCTCATACCATTCCTCAACAACTCGCGAGATGGTCAATCACGGCTTACTACATGTAATGTAGGCACGCTACCGCCCTTGGCTCTGGCTGCCAAAGGGCGAAACTCTTGTTTTTTGTCAGGAGTATTGACGGCCAGGCTACTGACCGCTGCAACTACCCTAAGCCTAACACTCCCATTGCAGCGTAGTATTACTCAAGACAACGGGTTGTATTTTGAGTGTAATAATCTTAGAGCCTATCCCAATAGGATTTTATTCCAGACAATGACGCCGCAGGTAAGATACAGCATTGCCACATAATTTTCGACTTTCTTCTCCCAGAGAGTCAACACCCGACGGAAGCGATTCATCCAACTGTGCGTTCTCTCCACGACCCAGCGGTGAGCTTTAAAATCCGTGCTTTTGATGGCTTCCGACTCCTCTTTCCTTGACTGGATATGAGGTTCGTAACGGCGGTTTTTCAGACAGGCCTCCAGCCATTCGGCTTCATAGCCTTTGTCCATACAGAGGTGGAGCCTTCTGCCAGGCCTGCCCGTCTGAAGGGCATCGAGAGTATCAGCAACCAGCTTTATGTCGTGGGTATTTGCACCTGCGACAACCAATGAGAGGGGAAGCCCGTTCGCATCGGTCATCAGGCTACGTTTTACGCCCTGTTTCCCTCGTTCTGTGGGGTTCCGTCCTGTTTTTTGAGCCAGCCAGCGGTGACTTCGTCATACAGCCATCCATCGACAGCCATGACCAGTCGATGGCATCCAACTGCTCACAGGCCAGCAACCCGTTTTGCCAGAACCGTTCAAAAACTCCTGCATCCCGCCACTCCTGGAAGCGGCGGTGAGCAGAGCTTGACGAGCATATCCCGGTGGCATTCAGGGCATTCCACTGGCAACCCGTTCTGAGCACAAAGAGAATGGCGTTCATTGCGGCACGATTATTAACCCGCCTGCGATGCGTACCCAGCGGATGGTGAGTTTTGTGTTCCGGGAGCAACGGAGCCATTTTTTCCCAGAGTTCATCGCTGATCTGCCACTTGTTACCTGCCATACTTCCCTCAGAAATTGTCATTACTTATTAACTGACAATTCCTTTTGGGATAGGTTCTTAATAAGATGAATAAAAAAACAAGCATCATGGCAGTCAGCTCCTGACAGGCTTTGTTGAATAAATCAGAGTCAGCCGAAGAGGGGGCCAGCAGTACACCCGTTATGCGATCCGGACACTATGCGGCATACCCAACAACGTCATCCGGTTTAGCGCTTTAACCATCGCCATCGCTTCGCCCACCTGCGCGTCCCCACCTGCGCATCGTAGTCTCGCAGGCTCAAGTGTCCGCCCAGCAAGGTTTTGATACGGAACATCGCCGTTTCTGCCACGGAACGTCGGTGGTAGCCTACTTTCTTTTTTCATACATCATTGCCACCGCTCAGATGCTGATTAGCCACCGCATGGTTACGTTCATGGTATCTGCCCGGCCAGTATTGCGCCCCGTTTCGTGGAGGGATAAGTGGCCTGATTTTTTTCCTCAGCAACGCATCATGACAGTTGCGCGTGTCATAAGCGCCGTCTGCTGAAGCTTCTCTTATTTTTCGGTGGGTCTGGTTAATCAGCCCCGGCAGTGCCTGCGCATCTGTCGTACCGCTGAGCGATAAATCGGCACAGATTATCTCATGCGTCACACCGTCTGCAGCCAGATGAAGTTTGCGCCATACTCTGCGCCTGTCAGCCCCATGTTGCCTGACTTTCCATTCGCCTTCCCCGAAGACCTTCAGGCCAGTAGCATCGATAACCAGATGCGAGATTTCACCCCGGGTTGGCGTTTTTATGCTGATGTTAACGCTCTTTGCATGTTTGCTGACCAGAGAGTAGTCCGGGCAACGCAGCGGCAGAGCCATCAGCGTAAAAATCGAGTCAACGAAGCCCTGTAATGCCCGGAGTGGCAGATTAAACACGCGCTTCATCATCAGAACCGTGGTGATCGCCATATCGGTGTAGTGGAGAGGCCGACCACGACCTTCAGGCTTTGCGCAGTCAGTCCATGCAGCAATAGCAGATTCATCAAGCCAGATCGTCAGCGAACCACGCTGTCTGAGTGCCTTGTTGTAAGCTGACCAGTTGGTGATTTTAAACTTTTGCTTTGCCATGGTGACACGATGGGTAATGCTGCCAACTTACTGATTTAGTGTATGATGGTGATTTTAAGGTGCTTGCGTGGCTTCCATTTCCATCAGATGTCCTTCCTGTTCAGCTACTGACGGGGTGGTGCGTAACGGCAAAAGCACCACCGGACATCAGCGCTATCTCTGCTCTCACTGCCGTAAAACATGGCAACTGCAGTTCACTTACACCGCTTCTCAACCCGGTACGCACCAGGAAATCATTGATATGGCCATGAATGGCGTTGGATGCCGGGCAACTGCCCGCATTATGGGCGTTGGCCTCAACACGATTTTACGTCACTTAAAAAAATCAGGCCACAGTCGGTAACCTCGCGCATACAGCCGGGCAGTGACGTCATCGTCTGCGCGGAAATGGACGAACAGTGGGGATACGTCGGGGCTAAATCGCGCCAGCGCTGGCTGTTTTACGCGTATGACAGGCTCCGGAAGACGGTTGTTGCGCACGTATTCGGTGAACGCACTATGGCGACGCTGGGGCGTCTTATGAGCCTGCTGTCACCCTTTGACGTGGTGATATGGATGACGGATGGCTGGCCGCTGTATGAATCCCGCCTGAAGGGAAAGCTGCACGTAATCAGCAAGCGATATACGCAGCGAATTGAGCGGCATAACCTGAATCTGAGGCAGCACCTGGCACGACTGGGACGGAAGTCGCTGTCGTTCTCAAAATCGGTGGAGCTGCATGACAAAGTCATCGGGCATTATCTGAACATAAAACACTATCAATAAGTTGGAGTCATTACCGAAGACGCAAAAAAAATAATATAATAACAAAAATCCTCCAGCCTTGTAGGCCAGAGGATAAAATACTTAATTGAGGAAAATATCAGGGAAAAGTATTTCTGATTTAACTTTCCCGGCTGGAGAGTGAATCAATGTAATCAGCGTACCATTGCATCATTTCACGGCGGCCATCGAGATACAGGGCATGATTATAGGTACCCCTGATGGAATTTTTATCCACATGAGCTAACTGCATTTCAATCCAGGCGGAATTAAAGCCTTGTTCATGCAGAATTGTACTCATGGTGTGTCTGAAACCGTGCCCTGTGAGTCTTCCCTTATATCCAAGTAATTCAATTACTTGGTTAATGCTTTCCTTACTAATTGGTTTGCGGGGATCATTTCTACCGATAAAGACTAGGGGATGATGTTTTGAGATTGGTTCAAGCTGCTTAAATAGCATAATGACCTGCTCAGAGAGCGGCACAATATGAGGTCGCTTCATCTTCATTACTTCTGCCGGGATCTCCCACAATTTTGTCTCAAAATCGATATCTTCCCAGCGTGCAAAACGCAGTTCTTGTGTTCGTACGCCGGTTAACATAATGATCTGAGTAGCTGTCTTAGTGATTATACTTCCGGTATAACCCGCCAGATCGTTGAGGAAGTGGGGAAGTTCATTAGCAGTAAGAAAGGGAAAATGTACTTTTTTCGGTGTAGCTAAAGCACTGGCAAGATCGGGGGCTGGGTTATAGTCTGCCCGGCCAGTAACAATTGCATAGCGGAACACCTCACCACAGCGCTGGCGGACCTTGCGCATTTTCTCCAGAGCTCCACGTTTTTCCATTTTACGCAGTGCTTCCAGTAACTCCATTGGTTTGATTTCGGCAATAGGGCGCTTGCCAATGTAGGGAAAAATGTCTTTTTCAAAGGTATCAATGATTTCATCACGATAGCGTAATGACCACCGATCAGCTTTTGACTGGTGCCATTCGCGAGCAATAGCCTCGAACGTATTTTTCAGACTCATTTGCTGCGCGATTTTTTCTTCCTTTTTTGCCTCCCCTGGATCGCCTCCAGCAGCGAGGAGTTTCCTGGCTTCACTGCGTTTCTCGCGAGCATCGGCTAACGTTACATCGGGGTATACGCCAATAGAAAGCATCTTTTCTTTACCAGCAAAGCGGTACTTCATGCGCCAGTATTTAGAACCATTAGTGTTAATCAGTAGATACATACCGCCGCCATCAGCAAGCTTATAGGGTTTGTCTTTGGGTTTTGCGGTTTTCACTTTGATATCGGTTAATGCCAT
Protein sequences of DBSCAN-SWA_7 >NZ_CP040886|2328224:2364590|2339569_2341108_+|WP_000998068.1|transposase|DBSCAN-SWA MNDISSDDIFLLKQRLAEQEALIHALQEKLSNREREIDHLQAQLDKLRRMNFGSRSEKVSRRIAQMEADLNRLQKESDTLTGRVYDPAVQRPLRQTRTRKPFPESLPRDEKRLLPAAPCCPNCGGSLSYLGEDTAEQLELMRSAFRVIRTVREKHACTQCDAIVQAPAPSRPIERGIAGPGLLARVLTSKYAEHTPLYRQSEIYGRQGVELSRSLLSGWVDACCRLLSPLEEALHGYVMTDGKLHADDTPVQVLLPGNKKTKTGRLWAYVRDDRNAGSALAPAVWFAYSPDRKGIHPQTHLACFSGVLQADAYAGFNELYRNGGITEAACWAHARRKIHDVHVRIPSALTEEALEQIGQLYAIEADIRGMPAEQRLAERQRKTKPLLKSLESWLREKMKTLSRHSELAKAFAYALNQWPALTYYADDGWAEADNNIAENALRMVSLGRKNYLFFGSDHGGERGALLYSLIGTCKLNGVEPESYLRYVLDVIADWPINRVGELLPWRVALPTE >NZ_CP040886|2328224:2364590|2331551_2331710_-|WP_001323397.1|DBSCAN-SWA MPGCTSRLLPEGPFSREQAVAVKTAYRNVFTEDDQGTYSRLVIRNAEGQLRW >NZ_CP040886|2328224:2364590|2328224_2329766_+|WP_001298859.1|transposase|DBSCAN-SWA MPTVPISMRKLKEILRLKYGVGLSHRQIGRSLAISPSVVSRYANRAAQLGIKQWPLPTGWDDTKLKHAFLQTQVKMKKHSLPDWATVHRELRNKCVTLQLLWEEYCERNPGGFYSYNHYCRMYREWLKTTSPSMRQVHKAGEKLFVDYCGPTVGVTDPETGEIRTAQVIVAVLGASSYTWAEATWSQQLEDWVMSHVRCFQWLGGVPELVVPDNLKSATSRACKYDPDVNPTYQQMLEHYNVAVLPARPRKPKDKAKAEVGVQVVERWIMARIRHEIFYSLASLNQRIRELLERLNNKIMQKLGYSRAELFIQLDKPALKPLPEASYSYTLVKKVRVHADYHVEIDKHYYSVPCSLLGQQLEAWISGELVRLFNQGQEVAVHPRKRTYGYSTRNEHMPEAHRQHATWTPERLLEWAGHIGSETHSYVLHILNSRPHPEQSYRFCLGLLNLHKKYSKARLNAACARALKTKVWRLSGIKSILEKGLDKQPVQDPKPDLLSTMEHENVRGSEYYH >NZ_CP040886|2328224:2364590|2344820_2345018_-|WP_001545803.1|DBSCAN-SWA MVIRKTVCRDCSNAITHNTECCPSCASTASSGYYRNTDRLLFLLTLLPVLILAVVSGVSVFVLLQ >NZ_CP040886|2328224:2364590|2330584_2331397_-|WP_053909994.1|DBSCAN-SWA MRLASRFGYANQIRRDRPLTHEELMHYVPSIFGEDRHTSRSKRYAYIPTITVLESLQQEGFQPFFACQTRVRDPGRRGYTKHMLRLRRAGEINGEHVPEIILLNSHDGTSSYQMLPGYFRFVCQNGCVCGQSLGEVRVPHRGNVVEKVIEGAYEVVGVFDRIEEKRDAMQSLVLPPPARQALAQAALTYRYGDEHRPVTTADILTPRRREDYGKDLWSTYQTIQENMLKGGISGRSARGKRIHTRAIHSIDTDIKLNRALWVAYYREGDR >NZ_CP040886|2328224:2364590|2346336_2346762_+|WP_032181455.1|DBSCAN-SWA MNGIKMGLGITPGEHIISANSALSRNIRHCFCLSCRGRLILQTDAQGAWFEHDLHALSAQQKAACVVLNPEKSHPYIEDMAMFLSPLPVVLEWHCVMCEQFFHGKKFCEACATGIYCRAVCTRSVYSYQPDLFEDCGGAST >NZ_CP040886|2328224:2364590|2338795_2339176_+|WP_001171554.1|DBSCAN-SWA MQKNVTPGRRKGCPNYPPEFKQQLVAASCEPGISISKLALENGINANLLFKWRQQWREGKLLLPSSESPQLLPVTLDAAAEQPESLAEDPETLSISCEVTFRHGTLRFNGNVSEKLLTLLIQELKR >NZ_CP040886|2328224:2364590|2345956_2346340_+|WP_000271020.1|DBSCAN-SWA MYAKSFLALDGNGRLTGARTAQTAPYDRYTCHLCGSALRYHPQYDTERPWFEHTDDGLTAHGQQCPYVRPERREVRLIQRLQQFVPDALPVVRKASWHCRQCHHDYYGERYCTHCQTGRFSEEVVAG >NZ_CP040886|2328224:2364590|2363405_2364590_-|WP_001218908.1|integrase|DBSCAN-SWA MALTDIKVKTAKPKDKPYKLADGGGMYLLINTNGSKYWRMKYRFAGKEKMLSIGVYPDVTLADAREKRSEARKLLAAGGDPGEAKKEEKIAQQMSLKNTFEAIAREWHQSKADRWSLRYRDEIIDTFEKDIFPYIGKRPIAEIKPMELLEALRKMEKRGALEKMRKVRQRCGEVFRYAIVTGRADYNPAPDLASALATPKKVHFPFLTANELPHFLNDLAGYTGSIITKTATQIIMLTGVRTQELRFARWEDIDFETKLWEIPAEVMKMKRPHIVPLSEQVIMLFKQLEPISKHHPLVFIGRNDPRKPISKESINQVIELLGYKGRLTGHGFRHTMSTILHEQGFNSAWIEMQLAHVDKNSIRGTYNHALYLDGRREMMQWYADYIDSLSSRES >NZ_CP040886|2328224:2364590|2348885_2349236_-|WP_000624711.1|DBSCAN-SWA MISLPSGTRIWLVAGVTDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRRGDTVKILWADADGLCLFTKRLEEDQFIWPAVRDGKVSITRSQLAMLLDKLDWRQPKTSSRNSLTML >NZ_CP040886|2328224:2364590|2344454_2344640_-|WP_072153746.1|DBSCAN-SWA MISNSLLSLYSLTFPTLSWFPTTGTYSAFNNLRRANKNHKADNTWRDAVVITERSDDIFPD >NZ_CP040886|2328224:2364590|2331417_2331552_-|WP_071596305.1|DBSCAN-SWA MPEHSLILSSPVKDFSDNKRHSFHLSGRVFSGRRGFFIYYTSKS >NZ_CP040886|2328224:2364590|2329780_2330527_+|WP_053906593.1|DBSCAN-SWA MNHLYEQLTALKLTGFRDALKKQLAQPGTYQELGFEERLSLLTAEELTCRENRKAERLIKHARFRLNAELSKLDYRNNRGLDRALIRSLSQGNWLTLKQNILLTGATGSGKTFLACALGHNACRQGYKVYYYRLKALMEQCYQGHADGRYSKLLTSLNNSDLLLLDDWGLEPLSSEQRSDLLEIVDLMYQRGSIIVVSQLPVENWYKMIGDSTHADAILDRLVHGSIKIELKGESMRKIQSPLTEGDQ >NZ_CP040886|2328224:2364590|2354907_2357025_-|WP_001618956.1|DBSCAN-SWA MNDISITDYLGPGVYLLQNYPKETEGLIAEKGYKVHNCADLAQCKDILNRNKVNFLLTNDKDNNFNEYAKIVRTAARQLVNKIVINIFVEKGNGQSFQDFINITDNLGYSIDTVFYLLNPGYDEQFRDDQSLKIVLSYRRQSVVSTDKNILETTIFEKKIVNTLPYIRPGDRVLVIIKNKNSITNIKNIIAEQTKASEVEIYSLDEIKSVQLNGNGYHFLITDKYADDGLNNALKVIISYLVPAGRYVSFHTDKTVVETLSNYNLQPEVYLFYEHGLLKTQIHQGEEITLSPELCVFMKSPLARSELPYQETIYGYSHPPKNLLAFARDYTNPWLIRGIVEFPFRNRSTYHLQQYSHQILEHSAPDSPDYAAALAVLGYQMLSGSDDTADIYAKMLDYCSNVSQMDNPTPHQYRWLISLSTLLGLICNKNNDKANALIHLSRAANSSIDKFSPSIGTKILQSFYLQSVILISLNRISCAEIIVDRGIKRGIQLLYQHPDELVGKISQPFNFVLYIYHDILDWLIKMVNIKNAIPGRKFNIANFDNGNTWSALLHERMNAINNMSQMIDERDRTIHDQKCLIDERDRTIHDQKRLIDERDSTVLTQKNLIDERDLVSAQQNQLIEQNNKTIQQLIQNVTDLNSQVSSKEQKVDELQNQNIKLISLIDEKDLHIAQLSTDLERANTILRKINSTPVIRHLLRMLNIK >NZ_CP040886|2328224:2364590|2331780_2334627_-|WP_060503901.1|DBSCAN-SWA MKRHLNTSYRLVWNHITGTLVVASELARSRGKRAGVAIALSLAAVTSVPALAADTVVQAVETVSGGTLTNHDNQIVLGTANGMTISTGLEYGPDNEANTGGQWIQNGGIANNTTVTGGGLQRVNAGGSVSDTVISAGGGQSLQGQAVNTTLNGGEQWVHEGGIATGTVINEKGWQAVKSGAVATDTVVNTGAEGGPDAENGDTGQFVRGNAARTTINKNGRQIVAVEGTANTTVVYAGGDQTVHGHALDTTLNGGYQYVHNGGTASDTVVNSDGWQIVKEGGLADFTTVNQKGKLQVNAGGTATNVTLKQGGALVTSTAATVLGSNRLGNFTVENGKADGVVLESGGRLDVLEGHSAQKTRVDDGGTLAVSAGGKATGVTMTSGGALIADSGATVEGTNASGKFSIDGISGQASGLLLENGGSFTVNAGGQASNTTVGHRGTLMLAAGGSLSGRTQLSKGASMVLNGDVVSTGDIVNAGEIYFDNQTTPDAVLSRAVAKGNAPVTFHKLTTSNLTGQGGTINMRVRLDGSNASDQLVINGGQATGKTWLAFTNVGNSNLGVATTGQGIRVVDAQNGATTEEGAFALSRPLQAGAFNYTLNRDSDEDWYLRSENAYRAEVPLYTSMLTQAMDYDRILAGSRSHQTGVNGENNSVRLSIQGGHLGHDNNGGIARGATPESSGSYGFVRLEGDLLRTEVAGMSLTTGVYGAAGHSSVDVKDDDGSRAGTVRDDAGSLGGYLNLTHTSSGLWADIVAQGTRHSMKASSDNNDFRARGWGWLGSLETGLPFSITDNLMLEPQLQYTWQGLSLDDGQDNAGYVKFGHGSAQHVRAGFRLGSHNDMSFGEGTSSRDTLRDSAKHRVRELPVNWWVQPSVIRTFSSRGDMSMGTAAAGSNMTFSPSRNGTSLDLQAGLEARVRENITLGVQGGYAHSVSGSSAEGYNGQATLNVTF >NZ_CP040886|2328224:2364590|2343352_2343631_-|WP_072153745.1|DBSCAN-SWA MPVKVVRCFHDHTGWFNNIRRNILMTTPVSLMDDQMVDMAFITQLTGLTDKWFYKLIKDGAFPAPIKLGRSSRWLKSEVEAWLQARIAQSRP >NZ_CP040886|2328224:2364590|2347241_2348855_-|WP_000080195.1|transposase|DBSCAN-SWA MSQKYLIRIAELERLLSEQAEALRQKDQQLSLVEETEAFLRSALTRAEEKIEEDEREIEHLRAQIEKLRRMLFGTRSEKLRREVELAEALLKQREQDSDRYSGREDDPQVPRQLRQSRHRRPLPAHLPREIHRLEPEESCCPECGGELDYLGEVSAEQLELVSSALKVIRTERVKKACTKCDCIVEAPAPSRPIERGIAGPGLLARVLTGKYCEHLPLYRQSEIFARQGVELSRALLSNWVDACCQLMTPVNDALYRYVMNTRKVHTDDTPVKVLAPGQKKAKTGRIWTYVRDDRNVGSSSPPAVWFAYSPNRQGKHPEQHLRPFRGILQADAFTGYDRLFSAEREGGALTEVACWAHARRKIHDVYISSKSATAEEALKRISELYAIEDEIRGLPESERLAVRQQRSKVLLTSLHEWMVEKNGTLSKKSRLGEAFSYVLNQWDALCYYSDDGLAEADNNAAERALRAVCLGKKNFMFFGSDHGGERGALLYGLIGTCRLNGIDPEAYLRHILSVLPEWPSNRVDELLPWNVVLTNK >NZ_CP040886|2328224:2364590|2351660_2353691_-|WP_001618954.1|DBSCAN-SWA MNKFKLKTVDVWDTILRRDCHPDFIKRSTAYYFFLKFHSLHNVDSVDQLFTKRVETERLIGKKNKDLGYDDEYLIQDVMLQWVENYISAEQYDTPAIAHELYSWEIEQEKKHIYLDPGIEDFLAQYPADKTIFLSDFYTSSTDLTELLVSAGLDQSVISDGVSSIDERLNKRSGRLFDFIQQKYQLAGVDWIHIGDNEWSDVQMPTSKGIKSIRYLPAQQHQLREQKEFLWNKNEDLTETITNNILNKYAASKDLSVDFQLGLKTTPLIAGFCLKILEQAVISKSEKILFFTREGEFFIKAMNILISHLKTNIKEIKLPEIDIIEVSRLATFAPSLQEISIKEMMRVWNLYSTQSISSLFKTLNVAPETFQSFIDKYGIPADEQIQYPWQDSRIQQLFDDSGFKETLWQHVMQQRALLKNYFATKGLTDDINARICVVDVGWRGTIHDNIALLYPDIHFTGIYLGLQKFLNEQPSNTSKVAFGPDLNHQLEYPHFLDSVAPIEMITNSPSGSVTGYGLENGKIVAIRSVNDDENSAWHNFTKTFQEGILAGMESFSAAVLSYGITHDVVRGYALNIWDVLISGSNKSLTDAFNNLNHNETFGLGGYVKKNHVPTTFEILSSLWNKNNRAALIEFIKANQWSDGIRKRDNLPSLNKYILALTIDLAVFYKRKFYRKY >NZ_CP040886|2328224:2364590|2335969_2336842_-|WP_000241617.1|DBSCAN-SWA MVKVVEDERSRIRYLERRLNENGFYLPSSLADKDYFSYQKRILNTLISQGADTLKINNFLAETDQRYFDSLPSEDDLNWYRNDARASLWLTCELYEMIKINGYENTLTCLSPESLPSHHSVRVDAIRRCIDNWPFILYTPSNYLNQKSIEWTTLLEKDDIFREVKARNFDICSWLKKYIQEKTNISLNYVCGESSEEIMAWCYASYFTWKKNNQNSPDSVELFTRKFKSAWATQKNRIKNRVDKKLRPLNVNISQEAYDKLRKLSINEGISNDRVIESALDMIYRSKIKK >NZ_CP040886|2328224:2364590|2334998_2335871_-|WP_032179701.1|DBSCAN-SWA MNPSDAIEAIEKPLSSLPYSLSRHILEHLRKLTSHEPVIGIMGKSGAGKSSLCNALFQGEVTPVSDVHAGTREVRRFRLSGHGHSMVITDLPGVGESRDRDAEYEALYRDILPELDLVLWLIKADDRALSVDEYFWRHILHRGHQQVLFVVTQADKTEPCHEWDMAGIQPSPAQAQNIREKTEAVFRLFRPVHPVVAVSARTGWELDTLVSALMTALPDHAASPLMTRLQDELRTESVRAQAREQFTGAVDRIFDTAESVCVASVARTVLRAVRDSVVSVARAVWNWIFF >NZ_CP040886|2328224:2364590|2353696_2354887_-|WP_001618955.1|DBSCAN-SWA MANIAWFIPQLIEGSGGHRTMLQHAAYLEKMGHTCTIFLEDKSGKTSGAEVIHKLFGLKFKDVIYDWNNAQPCDAAVATIWYSAPFVAALPFDCKRLYFVQDFEAYFNPVGDAYLMAENSYCYGLKPITIGRWLRYKLQNDFAVESYHFEFGANHNIYKRVPTIQKEKAVCFIYQPDKPRRCSRIGIEALGIVKHCMPDVKIYLYGSSINDNLWFEHENLGLLDLPKCNELYNKCSVGLCISSSNPSRIPFEMMNAGLPVVEVWRENNLYDFSEDAMLLCKQTPESIAAGIMKILNDADLAASMSSAGQKFMNGRTLETETEGFYKAFNCVMNEQAYTAEVISRSYNKPPFEAPLARPDIANTLTFAHQSLLKRIALRLPLFVRTPLSALYFKLKN >NZ_CP040886|2328224:2364590|2339172_2339520_+|WP_000612591.1|DBSCAN-SWA MIPLPSGTKIWLVAGITDMRNGFNGLAAKVQTALKDDPMSGHVFIFRGRSGSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTQAQLAMLLEGIDWRQPKRLLTSLTML >NZ_CP040886|2328224:2364590|2357029_2358439_-|WP_001618957.1|DBSCAN-SWA MSSDNIAISVSNISKCYNIYQNPRDRLKQFFLPKVQSMVGKPVQRYYNEFWALNNINFTINKGESAGIIGRNGSGKSTLLQLITGTLTPTTGTIQTKGRIAALLELGSGFNPEFTGRENIYLNGAVLGLSRAEIDAKFDAIAGFADIGTHLDQPVKTYSSGMMVRLAFAVQVQLEPDILIVDEALAVGDALFQKRCFQQIEKLTSNGVTLLFVSHDQESVRTLTNRAILLDRGTQLSTGLSSEVLLEYRRLLHREESKYFSQLAKEVAAHADATRDINQTPTAPAKIEISANAETPAVEETSPSRTDKLSFGEGEAEVENVELFTASGERSNAFNPGDKIRVRVNCKVNKDINHINVGIRIRNKEGVKIYSWGTLNQDMYVRFHQLKDPQFWEREFKAGERFYVDMEFECTLGTNLYEIQAAISYEGTPNYFSQRILHWRDEAAFFHVNVLRDEYFFGGILDLKMTAEW >NZ_CP040886|2328224:2364590|2342555_2342753_+|WP_014966159.1|DBSCAN-SWA MAQYDDDEHALQVSMWGRKHHSRILPHLPEATGISYGYRQPLCGMPGARTDPSDPSVLRSASDGR >NZ_CP040886|2328224:2364590|2360620_2361425_-|WP_086937185.1|transposase|DBSCAN-SWA MAGNKWQISDELWEKMAPLLPEHKTHHPLGTHRRRVNNRAAMNAILFVLRTGCQWNALNATGICSSSSAHRRFQEWRDAGVFERFWQNGLLACEQLDAIDWSWLSMDGCMTKSPLAGSKKTGRNPTERGKQGVKRSLMTDANGLPLSLVVAGANTHDIKLVADTLDALQTGRPGRRLHLCMDKGYEAEWLEACLKNRRYEPHIQSRKEESEAIKSTDFKAHRWVVERTHSWMNRFRRVLTLWEKKVENYVAMLYLTCGVIVWNKILLG >NZ_CP040886|2328224:2364590|2342655_2343258_-|WP_001387788.1|DBSCAN-SWA MKQQYQTRYEWLHESYQKWLTGFTRHAVSWGVCHPNIYYFHNLTPGWVSFNGEKPEIAIVPQSLHRLIYGPDKRATPPLDDDLIVNLCTSEHLLVHHPMLEGILLSECERLKHHSLANKLISLFRQFGGTELRLKLVWLCWLDLMTGNSLEDWTENLKRKSEKELEEWIIDRQKQSAALTDLMDQYVLLAYRTTVDDNRN >NZ_CP040886|2328224:2364590|2349599_2350616_+|WP_001322394.1|transposase|DBSCAN-SWA MFVIWSHGTGFIMSHQLTFADSEFSSKRRQTRKEIFLSRMEQILPWQNMVEVIEPFYPKAGNGRRPYPLETMLRIHCMQHWYNLSDGAMEDALYEIASMRLFARLSLDSALPDRTTIMNFRHLLEQHQLARQLFKTINRWLAEAGVMMTQGTLVDATIIEAPSSTKNKEQQRDPEMHQTKKGNQWHFGMKAHIGVDAKSGLTHSLVTTAANEHDLNQLGNLLHGEEQFVSADAGYQGAPQREELAEVDVDWLIAERPGKVRTLKQHPRKNKTAINIESMKASIRAKVEHPFRIIKRQFGFVKARYKGLLKNDNQLAMLFTLANLFRADQMIRQWERSH >NZ_CP040886|2328224:2364590|2345221_2345791_+|WP_000148641.1|DBSCAN-SWA MTHEYNPFWQQRIRETVLRALDVHPRLTALRVDLRLPDVPAATDAAVISRFINALKARINAYQKRKRREGKRVHPTTLHYAWAREFGELKGKKHYHLLLLVNRDTWCRAGDYRAPRSLAGMIKQAWCSALGVDAGPYATLAHFPDTSAVWLERDDDTGFQQVLERADYLAKEHTKAHATGERNFGCSRS >NZ_CP040886|2328224:2364590|2359246_2360392_-|WP_001618959.1|DBSCAN-SWA MRIVYFVNAAWYFELHWLDRSEAAISKGYEVHLVSNFADDAIKNNLEKKGIKCWDIELNRFSKNVFKNISIFTAFRKICKQIKPDLIHLITIKPILFGGLYARVAGIPFVVSFVGLGRLFGNKRGWLNKFIFNSVLSIYSLLLSAKSPAQVIFEHNADFAELSKYIKFDENNIHIIEGAGVDRFRFSYQPEPTQNNFSVLFASRLLWSKGLGEVVEAIGKVKQNENNTTLYVAGILDDKDPDRIELDQILQWEKEGKIIWLGRRDDIQNLIRNSHVVILPTKYSEGVPRIIIEACAMGRSCIVGNVPGCKAIIKDNFNGCVLKTHSANEIAEKIEYLRDNMEIRKKFGLRSAEIVKERFSKEIVIDKTLCVYNKILSTTKR >NZ_CP040886|2328224:2364590|2358428_2359241_-|WP_024190230.1|DBSCAN-SWA MIGTLVNISKIIIQKRELIWQMSKRDVMGRYKGSFLGLAWSLFNPILMLTVYTFVFSVIFKSRWGSSPDAGHADFAIILFTGLIIFNLFAECINRSTSLITSNVNFVKKVVFPIEILPIVNLLAALFHASISLIVLFIAILVFKHQLHFTVLLLPVITIPLMLSILGLSWILASLGVFVRDIPQTISIIVTILMFLSPVFYPLSALPVVFQKIVMLNPLAFMIEEARKVVYWGNEPNWTMLAINMLIGLVICAAGFLFFQKTKKGFADVI |
30 | Stx2-converting_phage(45.45%) | integrase,transposase | attL 2323647:2323670|attR 2364890:2364913 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
3375036 : 3388219
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NZ_CP040886|3375036:3388219|DBSCAN-SWA TATGCGCATATTGCTGAGTAATGATGACGGGGTACATGCACCCGGTATACAAACGCTGGCGAAAGCCTTGCGTGAGTTTGCTGACGTTCAGGTGGTCGCCCCCGATCGTAACCGCAGCGGCGCTTCAAATTCTCTGACACTGGAATCCTCCCTGCGCACGTTTACCTTTGAAAATGGTGATATTGCTGTGCAAATGGGAACCCCGACCGATTGCGTCTATCTTGGCGTGAATGCTCTGATGCGTCCGCGCCCGGACATTGTTGTGTCCGGAATTAACGCCGGGCCGAATCTGGGGGATGATGTTATTTATTCCGGTACGGTAGCCGCCGCGATGGAAGGCCGTCATTTAGGTTTTCCGGCGCTTGCCGTCTCGCTTGACGGGCATAAACATTACGACACTGCCGCGGCGGTAATCTGTTCAATTTTGCGCGCACTGTGTAAAGAGCCGCTGCGCACCGGGCGTATTCTTAATATTAACGTTCCGGATTTACCCTTGGATCAAATCAAAGGTATTCGCGTGACGCGCTGCGGTACACGACATCCGGCAGATCAGGTGATCCCGCAGCAAGATCCGCGCGGCAATACGCTGTACTGGATTGGCCCGCCGGGCGGTAAATGTGATGCTGGTCCGGGGACCGATTTTGCTGCGGTAGATGAGGGCTATGTCTCCATCACGCCGCTGCATGTGGATTTAACTGCGCATAGCGCGCAAGATGTGGTTTCAGACTGGTTAAACAGCGTGGGAGTTGGCACGCAATGGTAAGCAGACGCGTACAAGCACTTCTGGATCAATTACGTGCGCAAGGTATTCAGGATGAGCAGGTGCTGAATGCACTTGCCGCCGTGCCGCGTGAAAAATTCGTTGATGAAGCGTTTGAACAAAAAGCCTGGGACAATATCGCTTTGCCGATAGGTCAGGGGCAGACAATTTCGCAGCCATATATGGTGGCGCGAATGACCGAATTACTCGAGCTGACGCCGCAGTCGCGGGTGCTGGAAATTGGCACCGGTTCGGGATATCAAACGGCAATCCTGGCGCATCTTGTCCAGCATGTTTGCTCGGTTGAACGGATTAAAGGCTTGCAGTGGCAGGCACGTCGCCGCCTGAAAAATCTTGATTTACATAATGTTTCAACCCGTCATGGCGATGGATGGCAAGGTTGGCAGGCACGTGCGCCGTTTGACGCTATCATTGTTACGGCGGCACCGCCGGAAATTCCAACTGCGCTAATGACGCAGCTGGACGAAGGCGGGATTCTCGTCTTACCCGTAGGGGAGGAGCACCAGTATTTGAAACGGGTGCGTCGTCGGGGAGGCGAATTTATTATCGATACCGTGGAGGCCGTGCGCTTTGTCCCTTTAGTGAAGGGTGAGCTGGCTTAAAACGTGAGGAAATACCTGGATTTTTCCTGGTTATTTTGCCGCAGGTCAGCGTATCGTGAACATCTTTTCCAGTGTTCAGTAGGGTGCCTTGCACGGTAATTATGTCACTGGTTATTAACCAATTTTTCCTGGGGGATAAATGAGCGCGGGAAGCCCAAAATTCACCGTTCGCCGCATTGCGGCTTTGTCACTGGTTTCGCTATGGCTGGCAGGCTGTTCTGACACTTCAAATCCACCGGCACCGGTCAGCTCCGTTAATGGCAATGCGCCTGCAAATACTAATTCTGGTATGTTGATTACGCCGCCGCCGAAAATGGGGACGACGTCTACAGCGCAGCAACCGCAAATTCAGCCGGTGCAGCAGCCACAAATTCAGGCTACTCAACAACCGCAAATCCAGCCAGTGCAGCCAGTAGCTCAGCAGCCGGTACAGATGGAAAACGGACGCATCGTCTATAACCGTCAGTATGGGAACATTCCGAAAGGCAGTTATAGCGGCAGTACCTATACCGTGAAAAAAGGCGACACACTTTTCTATATCGCCTGGATTACTGGCAACGATTTCCGTGACCTTGCTCAGCGCAACAATATTCAGGCACCATACGCGCTGAACGTTGGTCAGACCTTGCAGGTGGGTAATGCTTCCGGTACGCCAATCACTGGCGGAAATGCCATTACCCAGGCCGACGCAGCAGAGCAAGGAGTTGTGATCAAGCCTGCACAAAATTCCACCGTTGCTGTTGCGTCGCAACCGACAATTACGTATTCTGAGTCTTCGGGTGAACAGAGTGCTAACAAAATGTTGCCGAACAACAAGCCAACTGCGACCACGGTCACAGCGCCTGTAACGGTACCAACAGCAAGCACAACCGAGCCGACTGTCAGCAGTACATCAACCAGTACGCCTATCTCCACCTGGCGCTGGCCGACTGAGGGCAAAGTGATCGAAACCTTTGGCGCTTCTGAGGGGGGCAACAAGGGGATTGATATCGCAGGCAGCAAAGGACAGGCAATTATCGCGACCGCAGATGGCCGCGTTGTTTATGCTGGTAACGCGCTGCGCGGCTACGGTAATCTGATTATCATCAAACATAATGATGATTACCTGAGTGCCTACGCCCATAACGACACAATGCTGGTCCGGGAACAACAAGAAGTTAAGGCGGGGCAAAAAATAGCGACCATGGGTAGCACCGGAACCAGTTCAACACGCTTGCATTTTGAAATTCGTTACAAGGGGAAATCCGTAAACCCGCTGCGTTATTTGCCGCAGCGATAAATCGACGGAACCAGGCTTTTGCTTGAATGTTCCGTCAAGGGATCACGGGTAGGAGCCACCTTATGAGTCAGAATACGCTGAAAGTTCATGATTTAAATGAAGATGCGGAATTTGATGAGAACGGAGTTGAGGTTTTTGACGAAAAGGCCTTAGTAGAAGAGGAACCCAGTGATAACGATTTGGCCGAAGAGGAACTGTTATCGCAGGGAGCCACACAGCGTGTGTTGGACGCGACTCAGCTTTACCTTGGTGAGATTGGTTATTCACCACTGTTAACGGCCGAAGAAGAAGTTTATTTTGCGCGTCGCGCACTGCGTGGAGATGTCGCCTCTCGCCGCCGGATGATCGAGAGTAACTTGCGTCTGGTGGTAAAAATTGCCCGCCGTTATGGCAATCGTGGTCTGGCGTTGCTGGACCTTATCGAAGAGGGCAACCTGGGGCTGATCCGCGCGGTAGAGAAGTTTGACCCGGAACGTGGTTTCCGCTTCTCAACATACGCAACCTGGTGGATTCGCCAGACGATTGAACGGGCGATTATGAACCAAACCCGTACTATTCGTTTGCCGATTCACATCGTAAAGGAGCTGAACGTTTACCTGCGAACCGCACGTGAGTTGTCCCATAAGCTGGACCATGAACCAAGTGCGGAAGAGATCGCAGAGCAACTGGATAAGCCAGTTGATGACGTCAGCCGTATGCTTCGTCTTAACGAGCGCATTACCTCGGTAGACACCCCGCTGGGTGGTGATTCCGAAAAAGCGTTGCTGGACATCCTGGCCGATGAAAAAGAGAACGGTCCGGAAGATACCACGCAAGATGACGATATGAAGCAGAGCATCGTCAAATGGCTGTTCGAGCTGAACGCCAAACAGCGTGAAGTGCTGGCACGTCGATTCGGTTTGCTGGGGTACGAAGCGGCAACACTGGAAGATGTAGGTCGTGAAATTGGCCTCACCCGTGAACGTGTTCGCCAGATTCAGGTTGAAGGCCTGCGCCGTTTGCGTGAAATCCTGCAAACGCAGGGGCTGAATATCGAAGCGCTGTTCCGCGAGTAAGTAAGCATCTGTCAGAAAGGCCAGTCTCAAGCGAGGCTGGCCTTTTCTGTGCACAATAAAAGGTCCGATGCCCATCGGACCTTTTTATTAAGGTCAAATTACCGCCCATACGCACCAGGTAATTAAGAATCCGGTAAAACCGAGAATGGTCGTTAACACTGTCCAGGTTTTCAGACCGTCTGCTACCGACAACCCCAGATATTTGGTCACAATCCAGAACCCTGAGTCATTAATATGTGACGCGCCAAGCCCACCAAAGCAGGCTGCCAGCGTCACCAATACGCACTGAATCGGATTCAATCCCATCACCGCTTCTGAGAGTAACCCGCCGGTTGTCAGGATTGCTACGGTTGCTGACCCCTGCGATGCACGCAGAGCCAGTGAAATAATAAATGCGGCTGGTAACAGAGGCAGGTCAATCATTTGTAGCATATTGGCAAGGGCTTTGCCGACGCCCGATTCCACCAGCACTTTGCCAAATACCCCTCCAGCACCAGTAACCAAAATCACTACCGCCGCAGTGGGAAGGGCTGAGCCCATAATGTCGCTGGTGTGTTGTAAGCTCCAGCCGCGACGTAAAGCCAATAACCAGAATGCCAGCACCAGCGCAATCATTAGAGCTACCATTGGTGAGCCGATCAGCTGTAGCGTACCAAGCAGGGGATGCGAAGGCGGCATCAGTGTTGCGGAAACCGTACCCGCCATGATAATCGCGATAGGAATAACAATTAGCGAGGTGACCAGCGCGACGCCCGGTGGATTTATTTTATCGCTTAATTTTGTCGCGCCTTCCTCACTGGCCGGAGCCAGTTGCATCTGTTCCAGTACTTCCACCGACATCGCATATTGGTGCTTATTGATTATTTTCGCTGCAAAGTAGCCAACAATCCCTACGGGAATAGAAATCGCAATACCGATGATGGTTAGCCAGCCGATGTCTGCGTGGAGTAACCCCGCTGCGGCGACAGGGCCTGGATGCGGCGGTACCGCCACATGTACAGTTAGCATGATCCCAGCGACAGGCAGGCCAAATTTGAGTGGCGATATTTTGGCAACCTTGGCAAAACCGTAAATGATTGGCGCAAGAATAATAAAGCCGACATCAAAGAAGACGGGAATACCGAGGAAGAACGCTGCCAGAGTCAGCGCAGCGATAGTTCGTTTGTCACCTAACTTGCGACTGAAATAATTAGCCAGTGACTCTGCACCACCAGAGTGTTCGATCATACGCCCCAGCATAGCGCCCAGACCAATAATAATAGTGACGGAACCAAGCACACCGCCCATCCCGGCGATCATCACTTTACCCACTTCGCCCGCCGGTATACCCGCCGCAAGTGCGACTAACAGGCTGACGAGGAGCAGAGCAACGAATGGTTGTACCTTTGCCTTGATGACCAGCAGCAACAGCATGATTACGCCAGTTAACGCAATGCATAACAATGTAATTGTGGACATGGGAAACCCTGTCTGAAAGTTATAGTTAACCTACCCCATCCGTAGATGGGGGGATGTATGGGTACGTTGTAATTAGGGATTTAACGAATTAGCGCCAGGCGTCAAACCAGCCAAGCCCTTCTTCGGTGAGGCCACGAGGTTTATATTCACAACCGATCCAGCCCTGATATCCCACCTCATCGAACAGGCGGAACAGCCACGGATAGTTGATTTCTCCATCGTCCGGTTCATGTCGATCAGGTAGTCCGGCAATTTGTACGTGCGCATATTTCCCGGCGTAGTCGCGGATTAAATGCGTCAGGTTGCCATCTACTTTTTGCGCATGAAAAGTATCTAGTTGAATAAACACGTTATCTCGCGCAACCTCTTCAACAATAGCCAGTGCCTGATACTGGCTGGAGAAGAGATAATGAGGCTTAACGTCGGGGCTGAGTGCTTCAACTAATATTCGCTTGCCGTGTAGCGCAAAGCGGTCGGCAGCGTAGCGGAGATTATCGATAAATACTGCCCGGTACCGTTCAGCGTCTTCGCCAGCGGGCACGACGCCTGCCATCACATGGACTTGTTCACAATTGAGCGCCAATGCATATTCCAGTGCCAGGTCGATGTCTGCGTGTGCTTCGTGCTCACGTCCGGGAAGGGCGGATAATCCCCATTCCCCCGCATTAATATCTCCGGGAGCGGTATTGAACAGCGCCAGTGTCAGATGGTTTTGCTCCAGTTGCTTTTGGATTTGCAGGGTGGAGTAGTCATAGGGAAACAGAAATTCCACAGCATCGAACCCGGCTTTTCGCGCTGCGGCGAAGCGTTCAATAAAAGGCACTTCGGTGAACATCATGGATAAATTAGCTGCAAAACGAGGCATTGCATTAACTCCTTAATTCCGCAATTTCACCTGCGGTCAGATAACGGATCGGGCGGTCACCGAGAATAAAAATCAGCTTTGCCGTTTCCTCCAGCTCTTCCATATTGTTGGCGGCTTCTTGCAGGCTTTCACCGCAAACCACTGGGCCATGATTTGCCAGTAAAAAAGCCTGATTGTCTGCTGCCAGTTCCGCCAGATCCTGTGCGATGCGTTTATCGCCCGGTCGGTAATAAGGCACCAGCGGGACATTTCCCATCCGCATCACCACGTATGGTGTGAACGGACGAATAACGTTGCTGCTGTCCAGCCCTTGCAGGCAGGAAAGCGCCGTCGACCATGTGCTGTGCAAATGCACCACCGCTTTACAGCGCGGATTGTTGCGATACAGCGCCAGATGAAAGAGCACCTCTTTCGAGGGTTTGTCACCACTTAACCATTCGCCATCCGCGGCGACTTTGGAAAGCCGCTGCGGATCGAGATTGCCCAGGCATGAACCTGTCGGTGTCGCCAGTAAATTCCCGTCAGGTAAAAGCAGCGACAGATTGCCAGCCGAACCGGTTGCATAGCCGCGCTGAAAGAATGAACTGGCAATCCGCGTCATCTCCTCTCGCAAAGACTGCTCTACTTTTGCGAAATCGCTCATGATAAAAACTCTCTTTGGGCTCGTGAAAAAAAGGCTTCATCACCGAAGTTGCCAGATTTAAGGGCGAGTGAGACAGGCTTATCCAGTGCGTTTACCCACGGCACGCCGGGGGAAATGGTTGGGCCAATATGAAACCCTTTTATGCCCAGGCTCTGTGTGACTACGCCGGAGGTCTCACCGCCTGCGACAATAAAGCGTGTCACGCCTTCCGCTGCTAACCGCGCCGCTAGTTGAGAAAACAGAGTTTCTACTGCCTGACTGGCTTTTTGTGCACCGTATTGCTGTTGAATTGCTGCCAATGCGTCAGTGCTGGCGGTGGCAAAAACCAGTGGAGCAAGTACACTTTCCTGGCCCAGAACCCACTCTGCCAGTTCGTGTGCATAAGCGGCCAGAGTTTCGGTTGAGAGGCAGCGTGCCACATCAACTTCACGGGCTGGTGCAATTTGACGGTAATGTGCCACCTGGCGGTTGGTCATTTGAGAGCATGAACCGGAGAGCACTACGCCGCGCCCAGCGAGCGGATGCCCTGCTTCGCGAGCCTGGTTACCGTTTTCTTGCGCCCACTGCCGGGCCAGGCCAATCGCCAGACCAGAACCGCCCGTTACCAGTGGGGCATCGCGCAAGGCTTCTCCCTGAATTTCCAGATGGTGTTCGGTCAGCGCATCAAGCACCGCGTAGCGGTAGCCCTCTTGCTGTAAGCGAGCCAGCTCTTGACGAACGGCATCCACACCTTGTTCGAAAACATGTGCCGAAACGACGCCGCAGCGCCCTGTGGATTGCGCTTCAACCAGACGGGGAAGATAGCTGTCGGTCATGGGATTTACCGGGTGATGGCGCATCCCGGATTCGGCCAGCAGTTGATTCATTACGAACAAATACCCCTGATAAACCGTACGTCCGTTGACCGGCAGGGCCGGAGAGAAGACCGTAAACGGCGTGTCGAGAGCATCCATTAATGCATCGGTAACCGGGCCGATATTACCTTTCGCCGTACTGTCGAAAGTAGAGCAGTATTTGAAATAGATCTGTTTGCAACCTTGCTGTTGCAACCAGCTCAGAGCCGCCAGCGATTGCTGTGTGGCTTCAACCACCGGACAGGAGCGCGTTTTCAGGCTGATCACCAGTGCGTCGATTGCTTCCGGCATTTTACCTGTTGGAACACCGTTAATTTGTACCGTTGGTAGACCGTTTTCCACCAGAAAACTGGCGATATCCGTCGCGCCGGTAAAATCATCGGCGATAACGCCAATCTTGATCATGATTTCGCTCCCGGTAGAGTGATGCCAGAGAAAATCTTGATAACTGCGCTATCGTCTTCTTTCCCGTAACCTGCGTTACTGGCGCTGGTGAACATATTCAATGCTGTTGAGGCCAATGGCAGCGGGAAGTGCAGGGCTTTGGCTGTATCGGCAACCAGACCAAGATCCTTAACAAAAATATCGACGGCTGAATGCGGGGTGTAATCGCCATCCACCACATGACGCATCCGGTTTTCGAACATCCAGGAATTTCCGGCGGCATTGGTCACGACGTCATACATCACATCCAGCGGGATCCCCGCACGGGCTGCAAGTGCCATCGCTTCGGCTCCGGCAGCAATATGTACGCCCGCTAACAACTGGTGAATAATTTTTACGGTCGAACCTAGTCCCGGTTCTGCACCTATGCGATAAACTTTTCCGGCAACGGCTTCCAGCACGGGTGCCAGTCGTTCAAAGGCAATATCGCTACCGGAGGCCATGACAGTCATTTCACCGTTAGCGGCTTTTACTGCACCACCAGAAACTGGCGCATCCAGCATTTCCAGATCGAATCCAGCCAGAGCGGTAGCAATTTCTTGCGCATCAGCACTAGCGATAGTGGAAGAAACCATTACTGCCGTACCGGGTTTCAGATGTTGTGCAACGCCTGTTTCACCAAACAGCACCTGTTTAACCTGGGCCGCATTGACCACCAGCACCAGCAGAGCGTCCAGTTTTTCGGCAAACGTCGCGGCGTTATCAGAAACCCCGCAAGCACCTGCCTCTTTCAACGTAGCGCAGGCATTGCTGTTCAGGTCTGCGCCCCAGGTAGAAAGACCTGCGCGGACACATGACAGTGCTGCTCCCATTCCCATTGACCCTAAGCCAACGATACCGACATGAAACTCAGATCCCGTTTTCATCTGCTCTCCTTGTTAATTTAAGTGATATTTTGTTTGATATTGTGAATATAAGCGCTGGAAGATAACGATATGGTGAGCTGATTCACATAAATTAACATTGTGTGTTATTTTATGTGAACTAAGCGTTAGTTGCGCCGCGCACGTTTCGCAGGCAAATAGCGTAGAATGTCAGCAGGACAAAGGAAGGAGCAAAAGTTGATACCCGTAGAGCGTCGCCAAATCATCCTTGAGATGGTAGCTGAAAAAGGCATTGTCAGTATTGCTGAACTAACGGACAGAATGAATGTGTCACATATGACCATTCGTCGGGATTTACAAAAACTGGAGCAGCAGGGAGCCGTTGTGCTGGTGTCCGGAGGCGTCCAGTCTCCGGGACGCGTGGCGCATGAACCTTCTCATCAGGTAAAAACTGCGCTGGCAATGACGCAAAAAGCGGCTATTGGCAAGCTGGCGGCAAGTCTTGTTCAGCCGGGAAGCTGTATCTATCTGGATGCGGGAACGACCACGTTAGCGATAGCACAGCATCTGATTCACATGGAGTCACTGACTGTGGTCACAAACGATTTCGTTATTGCGAACTACTTGCTCGACAACAGTAATTGCACAATTATTCACACTGGCGGTGCAGTGTGTCGGGAAAACCGTTCCTGTGTCGGGGAAGCCGCTGCGACCATGCTGCGCAGCCTGATGATTGATCAGGCTTTTATTTCTGCATCGTCATGGAGTGTGCGGGGGATTTCTACGCCAGCGGAAGATAAAGTCACGGTGAAACGGGCGATTGCCAGTGCCAGCCGCCAGCGAGTTTTGGTCTGTGATGCGACGAAATATGGTCAGGTGGCGACATGGCTGGCGTTACCATTAAGCGAGTTTGATCAGATTATCACAGACGACGGTTTGCCGGAGAGTGCCAGTCGCGCGCTGGCGAAGCAGGATCTCTCTTTGCTGGTAGCGAAAAATGAATAATGGCCTGCAATAACATTTGGTTACTCATGCTTCACAGAAGAAGCATGAGACTACTTTATTTTATAAAATGACAGCCGCCCGCTTTTCGGCGTGCCGGTATCAATATAAATCTGGTTAGCGAACGTCTGAATGTTATCAAACATCATATGTCCAAATATAAAATAATCAGCGCCGTTTATTTGTTGTAACTCGCCATTAAGCGATTTCTGCACACGATCAACAGGCCAGAGTAATTCGCTCTCCGCTATTTCTTTACCAAAGAGATATTCACTCCCCGGATAATCTGCATGTGCGATGACATATTTTATGTTGTCGTTAGTGATTTCAATAATATGTGGAAGGTGATGGAATTTCAGCAACAGATCTATTGCCTCTTGTTGCTCTGAATCATTTAAATCGAAAAACCAGTCACCACCGCTGGCAAGCCACATATTGCCATCGCCAGTTTCGAATGCATCCAACGCCATCGCTTCGTGGTTGCCTTTAACCGACGTAAACCAGGGTTGGTTTAGCAGGCGCAGCACGTTAAGACTCTTCGGCCCACGATCAATATTATCGCCTGTGGAAATAAGTAAGTCGGTTTCAGGGTAAAAAGAGAGTTGATGTAAACGGGATTGTAATAACTGATATTCACCATGAATATCACCAACGACCCATATATGGCGATAGTGATGGGCATTGATTTTTTGATAGCGTGTAGATGGCATGGTTTTACCCTGTAAAATAAGCTTTCCTATTATACAGGGTATTTTTATTTGATTCGTCAGTTGTCGTTAATATTCCCGATAGCAAAAGACTATCGGGAATTGTTATTACACCAGACTCTTCAAGCGATAAATCCACTCCAGCGCCTGACGCGGAGTCAGTGAATCCGGGTCGAGATTTTCCAGTGCTTCGACCGCAGGCGAAGTTTCTTCTGGCACTGACAGCAAAGACATCTGCGTACCATCCACTTGCGTCGCGGCGGCGTTCGGCGAAATGCTTTCCAGCTCACGCAGTTTTTGCCGTGCGCGCTTAATCACCTCTTTTGGCACGCCCGCCAGAGCTGCAACCGCCAGGCCGTAGCTTTTGCTCGCCGCGCCATCCTGCACGCTATGCATAAAGGCAATGGTGTCGCCGTGCTCCAGCGCATCGAGATGCACGTTGGCGACGCCTTCCATTTTCTCCGGTAACTGGGTCAGCTCGAAATAGTGGGTAGCAAATAACGTCAATGCCTTAATCTTATTCGCCAGATTTTCCGCGCACGCCCACGCCAGCGACAGACCATCGTAGGTGGACGTTCCACGCCCGATCTCATCCATTAACACCAGACTGTATTCGGTGGCGTTATGTAAAATATTGGCGGTTTCAGTCATCTCCACCATAAAGGTTGAGCGCCCGGAAGCCAGGTCATCTGCCGCGCCTACGCGGGTAAAGATGCGATCGATAGGTCCAATCTCGACTTTTTGTGCCGGTACATAGCTGCCGATGTAGGCCATCAGCGCAATCAGTGCGGTCTGGCGCATATAGGTACTTTTACCGCCCATGTTCGGGCCGGTAATAATCAACATACGGCGCTGCGGCGACAGATTCAGCGGGTTAGCGATAAATGGCTCATTCAGTACTTGTTCAACTACCGGATGGCGACCTTCGGTAATGCGAATGCCCGGTTTATCAATGAAGGTCGGGCAGGTGTAGTTCAGGGTATAGGCCCGTTCCGCCAGGTTAACCAGCACGTCGAGTTCCGCCAGCGCGCTCGCGCTCTGTTGCAACGCTTCCAGATGCGGCAACAGCAGGTCGAACAGCTCTTCATAAAGCTGTTTTTCCAGTGCCAGTGCTTTGCCTTTTGAGGTGAGAACTTTATCTTCGTACTCTTTTAGCTCTGGAATGATGTAGCGCTCGGCGTTTTTCAGCGTCTGGCGACGCATGTAGTTGATGGGTGCCAGATGGCTTTGCCCACGGCTGATTTGAATGTAGTAGCCGTGCACCGCATTAAAGCCAACTTTCAGCGTGTCCAGGCCGGTACGTTCACGCTCGCGGACTTCCAGACGCTCCAGATAATCGGTCGCGCCGTCAGCCAGCGCGCGCCACTCATCCAGCTCTTCGTTATAGCCCGATGCGATAACACCACCGTCGCGTACCAGCACCGGCGGTGTGTCGATGATTGCTCGCTCCAGCAGATCGCGCAGCTCGGCAAACTCGCCCATCTTCTCACGTAGCGCCTGTACCGGTGCACTATCGACAGTTTCTAACTGCGCACGCAGCTCCGGCAGTTGCTGGAAAGCGTGGCGCATACGGGCCAGATCGCGTGGGCGAGCAGTTCGTAAAGCCAGACGTGCCAGAATACGTTCCAGGTCGCCGACCTGACGCAGTACCGGCTGCAACTCGGCGGTGAAATCCTGCAATGCGCCAATAGTTTGCTGGCGCTCAAGCAACACGCGGGTATCGCGCACTGGCATATGCAGCCAGCGTTTCAGCATACGACTACCCATCGGCGTGACGGTGCAGTCGAGCACAGAAGCCAGCGTATTTTCCGCACCGCCCGCCAGGTTCTGAGTAATTTCCAGGTTACGACGTGTCGCGGCATCCATAATGATGCTGTCCTGCTCACGTTCCATGGTGATAGAACGAATATGCGGCAGGGTCGTGCGTTGGGTATCTTTCGCATACTGCAACAGACAACCGGCAGCACAAAGTCCGCGCGGCGCGTTCTCGACGCCAAAACCGACCAGATCGCGGGTGCCAAATTGCAGATTCAACTGCTGGCGCGCGGTGTCGATTTCAAACTCCCACAGCGGGCGACGGCGCAGGCCGCGACGGCCTTCAATCAGCGACATCTCGGCGAAATCTTCTGCATACAGCAGTTCCGCCGGATTAGTGCGTTGCAGCTCTGCCGCCATCGTTTCGCGGTCGGCCGGTTCGCTCAGACGAAAACGCCCGGAGCTGATATCCAGCGTCGCGTAGCCGAAACCTTTGCTGTCCTGCCAGATAGCCGCCAGCAGGTTGTCCTGACGCTCCTGTAACAGGGCTTCATCGCTGATGGTGCCTGGCGTAACGATACGCACAACTTTGCGCTCAACCGGACCTTTGCTGGTCGCCGGATCGCCAATTTGTTCGCAGATGGCAACGGACTCTCCCTGATTCACCAGTTTGGCGAGATAGTTTTCCACCGCATGGTAGGGAATCCCCGCCATCGGGATCGGCTCTCCCGCCGAAGCACCGCGTTTGGTCAGTGAAATATCCAGCAGTTGCGACGCGCGTTTTGCGTCGTCATAAAACAGTTCATAAAAATCACCCATCCGGTAAAACAGCAGGATCTCGGGATGCTGGGCTTTCAGCCTGAGATACTGCTGCATCATGGGCGTATGGGCGTCGAAATTTTCTATTGCACTCAT
Protein sequences of DBSCAN-SWA_8 >NZ_CP040886|3375036:3388219|3381714_3382977_-|WP_000590392.1|DBSCAN-SWA MIKIGVIADDFTGATDIASFLVENGLPTVQINGVPTGKMPEAIDALVISLKTRSCPVVEATQQSLAALSWLQQQGCKQIYFKYCSTFDSTAKGNIGPVTDALMDALDTPFTVFSPALPVNGRTVYQGYLFVMNQLLAESGMRHHPVNPMTDSYLPRLVEAQSTGRCGVVSAHVFEQGVDAVRQELARLQQEGYRYAVLDALTEHHLEIQGEALRDAPLVTGGSGLAIGLARQWAQENGNQAREAGHPLAGRGVVLSGSCSQMTNRQVAHYRQIAPAREVDVARCLSTETLAAYAHELAEWVLGQESVLAPLVFATASTDALAAIQQQYGAQKASQAVETLFSQLAARLAAEGVTRFIVAGGETSGVVTQSLGIKGFHIGPTISPGVPWVNALDKPVSLALKSGNFGDEAFFSRAQREFLS >NZ_CP040886|3375036:3388219|3384077_3384845_+|WP_001297141.1|DBSCAN-SWA MIPVERRQIILEMVAEKGIVSIAELTDRMNVSHMTIRRDLQKLEQQGAVVLVSGGVQSPGRVAHEPSHQVKTALAMTQKAAIGKLAASLVQPGSCIYLDAGTTTLAIAQHLIHMESLTVVTNDFVIANYLLDNSNCTIIHTGGAVCRENRSCVGEAAATMLRSLMIDQAFISASSWSVRGISTPAEDKVTVKRAIASASRQRVLVCDATKYGQVATWLALPLSEFDQIITDDGLPESASRALAKQDLSLLVAKNE >NZ_CP040886|3375036:3388219|3381079_3381718_-|WP_001278994.1|DBSCAN-SWA MSDFAKVEQSLREEMTRIASSFFQRGYATGSAGNLSLLLPDGNLLATPTGSCLGNLDPQRLSKVAADGEWLSGDKPSKEVLFHLALYRNNPRCKAVVHLHSTWSTALSCLQGLDSSNVIRPFTPYVVMRMGNVPLVPYYRPGDKRIAQDLAELAADNQAFLLANHGPVVCGESLQEAANNMEELEETAKLIFILGDRPIRYLTAGEIAELRS >NZ_CP040886|3375036:3388219|3380298_3381075_-|WP_001136918.1|DBSCAN-SWA MPRFAANLSMMFTEVPFIERFAAARKAGFDAVEFLFPYDYSTLQIQKQLEQNHLTLALFNTAPGDINAGEWGLSALPGREHEAHADIDLALEYALALNCEQVHVMAGVVPAGEDAERYRAVFIDNLRYAADRFALHGKRILVEALSPDVKPHYLFSSQYQALAIVEEVARDNVFIQLDTFHAQKVDGNLTHLIRDYAGKYAHVQIAGLPDRHEPDDGEINYPWLFRLFDEVGYQGWIGCEYKPRGLTEEGLGWFDAWR >NZ_CP040886|3375036:3388219|3376557_3377697_+|WP_001272592.1|DBSCAN-SWA MSAGSPKFTVRRIAALSLVSLWLAGCSDTSNPPAPVSSVNGNAPANTNSGMLITPPPKMGTTSTAQQPQIQPVQQPQIQATQQPQIQPVQPVAQQPVQMENGRIVYNRQYGNIPKGSYSGSTYTVKKGDTLFYIAWITGNDFRDLAQRNNIQAPYALNVGQTLQVGNASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPTATTVTAPVTVPTASTTEPTVSSTSTSTPISTWRWPTEGKVIETFGASEGGNKGIDIAGSKGQAIIATADGRVVYAGNALRGYGNLIIIKHNDDYLSAYAHNDTMLVREQQEVKAGQKIATMGSTGTSSTRLHFEIRYKGKSVNPLRYLPQR >NZ_CP040886|3375036:3388219|3377759_3378752_+|WP_000081550.1|DBSCAN-SWA MSQNTLKVHDLNEDAEFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGATQRVLDATQLYLGEIGYSPLLTAEEEVYFARRALRGDVASRRRMIESNLRLVVKIARRYGNRGLALLDLIEEGNLGLIRAVEKFDPERGFRFSTYATWWIRQTIERAIMNQTRTIRLPIHIVKELNVYLRTARELSHKLDHEPSAEEIAEQLDKPVDDVSRMLRLNERITSVDTPLGGDSEKALLDILADEKENGPEDTTQDDDMKQSIVKWLFELNAKQREVLARRFGLLGYEAATLEDVGREIGLTRERVRQIQVEGLRRLREILQTQGLNIEALFRE >NZ_CP040886|3375036:3388219|3375036_3375798_+|WP_001374723.1|DBSCAN-SWA MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFENGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLDGHKHYDTAAAVICSILRALCKEPLRTGRILNINVPDLPLDQIKGIRVTRCGTRHPADQVIPQQDPRGNTLYWIGPPGGKCDAGPGTDFAAVDEGYVSITPLHVDLTAHSAQDVVSDWLNSVGVGTQW >NZ_CP040886|3375036:3388219|3385657_3388219_-|WP_001272924.1|DBSCAN-SWA MSAIENFDAHTPMMQQYLRLKAQHPEILLFYRMGDFYELFYDDAKRASQLLDISLTKRGASAGEPIPMAGIPYHAVENYLAKLVNQGESVAICEQIGDPATSKGPVERKVVRIVTPGTISDEALLQERQDNLLAAIWQDSKGFGYATLDISSGRFRLSEPADRETMAAELQRTNPAELLYAEDFAEMSLIEGRRGLRRRPLWEFEIDTARQQLNLQFGTRDLVGFGVENAPRGLCAAGCLLQYAKDTQRTTLPHIRSITMEREQDSIIMDAATRRNLEITQNLAGGAENTLASVLDCTVTPMGSRMLKRWLHMPVRDTRVLLERQQTIGALQDFTAELQPVLRQVGDLERILARLALRTARPRDLARMRHAFQQLPELRAQLETVDSAPVQALREKMGEFAELRDLLERAIIDTPPVLVRDGGVIASGYNEELDEWRALADGATDYLERLEVRERERTGLDTLKVGFNAVHGYYIQISRGQSHLAPINYMRRQTLKNAERYIIPELKEYEDKVLTSKGKALALEKQLYEELFDLLLPHLEALQQSASALAELDVLVNLAERAYTLNYTCPTFIDKPGIRITEGRHPVVEQVLNEPFIANPLNLSPQRRMLIITGPNMGGKSTYMRQTALIALMAYIGSYVPAQKVEIGPIDRIFTRVGAADDLASGRSTFMVEMTETANILHNATEYSLVLMDEIGRGTSTYDGLSLAWACAENLANKIKALTLFATHYFELTQLPEKMEGVANVHLDALEHGDTIAFMHSVQDGAASKSYGLAVAALAGVPKEVIKRARQKLRELESISPNAAATQVDGTQMSLLSVPEETSPAVEALENLDPDSLTPRQALEWIYRLKSLV >NZ_CP040886|3375036:3388219|3378845_3380210_-|WP_000104456.1|DBSCAN-SWA MSTITLLCIALTGVIMLLLLVIKAKVQPFVALLLVSLLVALAAGIPAGEVGKVMIAGMGGVLGSVTIIIGLGAMLGRMIEHSGGAESLANYFSRKLGDKRTIAALTLAAFFLGIPVFFDVGFIILAPIIYGFAKVAKISPLKFGLPVAGIMLTVHVAVPPHPGPVAAAGLLHADIGWLTIIGIAISIPVGIVGYFAAKIINKHQYAMSVEVLEQMQLAPASEEGATKLSDKINPPGVALVTSLIVIPIAIIMAGTVSATLMPPSHPLLGTLQLIGSPMVALMIALVLAFWLLALRRGWSLQHTSDIMGSALPTAAVVILVTGAGGVFGKVLVESGVGKALANMLQMIDLPLLPAAFIISLALRASQGSATVAILTTGGLLSEAVMGLNPIQCVLVTLAACFGGLGASHINDSGFWIVTKYLGLSVADGLKTWTVLTTILGFTGFLITWCVWAVI >NZ_CP040886|3375036:3388219|3375791_3376418_+|WP_000254708.1|DBSCAN-SWA MVSRRVQALLDQLRAQGIQDEQVLNALAAVPREKFVDEAFEQKAWDNIALPIGQGQTISQPYMVARMTELLELTPQSRVLEIGTGSGYQTAILAHLVQHVCSVERIKGLQWQARRRLKNLDLHNVSTRHGDGWQGWQARAPFDAIIVTAAPPEIPTALMTQLDEGGILVLPVGEEHQYLKRVRRRGGEFIIDTVEAVRFVPLVKGELA >NZ_CP040886|3375036:3388219|3384895_3385552_-|WP_001141340.1|DBSCAN-SWA MPSTRYQKINAHHYRHIWVVGDIHGEYQLLQSRLHQLSFYPETDLLISTGDNIDRGPKSLNVLRLLNQPWFTSVKGNHEAMALDAFETGDGNMWLASGGDWFFDLNDSEQQEAIDLLLKFHHLPHIIEITNDNIKYVIAHADYPGSEYLFGKEIAESELLWPVDRVQKSLNGELQQINGADYFIFGHMMFDNIQTFANQIYIDTGTPKSGRLSFYKIK >NZ_CP040886|3375036:3388219|3382973_3383882_-|WP_000847985.1|DBSCAN-SWA MKTGSEFHVGIVGLGSMGMGAALSCVRAGLSTWGADLNSNACATLKEAGACGVSDNAATFAEKLDALLVLVVNAAQVKQVLFGETGVAQHLKPGTAVMVSSTIASADAQEIATALAGFDLEMLDAPVSGGAVKAANGEMTVMASGSDIAFERLAPVLEAVAGKVYRIGAEPGLGSTVKIIHQLLAGVHIAAGAEAMALAARAGIPLDVMYDVVTNAAGNSWMFENRMRHVVDGDYTPHSAVDIFVKDLGLVADTAKALHFPLPLASTALNMFTSASNAGYGKEDDSAVIKIFSGITLPGAKS |
12 | Escherichia_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
3746219 : 3790394
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >NZ_CP040886|3746219:3790394|DBSCAN-SWA GTTAACCCAGTAGCCAGAGTGCTCCATGTTGCAGCACAGCCACTCCGTGGGAGGCATAAAGCGACAGTTCCCGTTCTTCTGGCTGCGGATAGATTCGACTACTCATCACCGCTTCCCCGTCGTTAATAAATACTTCCACGGATGATGTATCGATAAATATCCTTAGGGCGAGCGTGTCACGCTGCGGGAGGGGAATACTACGGTAGCCGTCTAAATTCTCGTGTGGGTAATACCGCCACAAAACAAGTCGCTCAGATTGGTTATCAATATACAGCCGCATTCCAGTGCCGAGCTGTAATCCGTAATGTTCGGCATCACTGTTCTTCAGCGCCCACTGCAACTGAATCTCAACTGCTTGCGCGTTTTCCTGCAAAACATATTTATTGCTGATTGTGCGGGGAGAGACAGATTGATGCTGCTGGCGTAACGACTCAGCTTCGTGTACCGGGCGTTGTAGAAGTTTGCCATTGCTCTCTGATAGCTCGCGCGCCAGCGTCATGCAGCCTGCCCATCCTTCACGTTTTGAGGGCATTGGCGATTCCCACATATCCATCCAGCCGATAACAATACGCCGACCATCCTTCGCTAAAAAGCTTTGTGGTGCATAAAAGTCATGCCCGTTATCAAGTTCAGTAAAATGCCCGGATTGTGCAAAAAGTCGTCCTGGCGACCACATTCCGGGTATTACGCCACTTTGAAAGCGATTTCGGTAACTGTATCCCTCGGCATTCATTCCCTGCGGGGAAAACATCAGATAATGCTGATCGCCAAGGCTGAAAAAGTCCGGACATTCCCACATATAGCTTTCACCCGCATCAGCGTGGGCCAGTACGCGATCGAAGGTCCATTCACGCAACGAACTGCCGCGATAAAGCAGGATCTGCCCCGTGTTGCCTGGATCTTTCGCCCCGACTACCATCCACCATGTGTCGGCTTCACGCCACACTTTAGGATCGCGGAAGTGCATGATTCCTTCTGGTGGAGTGAGGATCACACCCTGTTTCTCGAAATGAATACCATCCCGACTGGTAGCCAGACATTGTACTTCGCGAATTGCATCGTCATTACCTGCACCATCGAGCCAGACGTGTCCGGTGTAGATAAGTGAGAGGACACCATTGTCATCGACAGCACTACCTGAAAAACACCCGTCTTTGTCATTATCGTCTCCTGGCGCTAGCGCAATAGGCTCATGCTGCCAGTGGATCATATCGTCGCTGGTGGCATGTCCCCAGTGCATTGGTCCCCAGTGTTCGCTCATCGGGTGATGTTGATAAAACGCGTGATAACGATCGTTAAACCAGATCAGGCCGTTTGGATCGTTCATCCATCCGGCAGGTGGCGCGAGGTGAAAATGGGGATAGAAAGTGTTACCCCGGTGCTCATGAAGTTTTGCTAGTGCGTTTTGCGCCGCATGCAATCGAGATTGCGTCATTTTAATCATCCTGGTTAAGCAAATTTGGTGAATTGTTAACGTTAACTTTTATAAAAATATAGTCCCTTACTTTCATAAATGCGATGAATATCACAAATGTTAACGTTAACTATGACGTTTTGTGATCGAATATGCATGTTTTAGTAAATCCATGACGATTTTGCGAGAAAGAGGTTTATCACTATGCGTAACTCAGATGAATTTAAGGGAAAAAAATGTCAGCCAAAGTATGGGTTTTAGGGGATGCGGTCGTAGATCTCTTGCCAGAATCAGACGGGCGCCTACTGCCTTGTCCTGGCGGCGCGCCAGCTAACGTTGCGGTGGGAATCGCCAGATTAGGCGGAATAAGTGGATTTATAGGTCGGGTCGGTGATGATCCTTTTGGGGCGTTAATGCAAAGAATGCTGCTAACTGAGGGAGTCGATATCACGTATCTGAAGCAAGATGAATGGCACCGGACATCCACGGTGCTTGTCGATCTGAACGATCAAGGGGAACGTTCATTTACGTTTATGGTCCGCCCCAGTGCCGATCTTTTTTTAGAGACGACAGACTTGCCCTGCTGGCGACATGGCGAATGGTTACATCTCTGTTCAATTGCGTTGTCTGCCGAGCCTTCGCGTACCAGCGCATTTACTGCGATGACGGCGATCCGGCATGCCGGAGGTTTTGTCAGCTTCGATCCTAATATTCGTGAAGATCTATGGCAAGACGAGCATTTGCTCCGCTTGTGTTTGCGGCAGGCGCTACAACTGGCGGATGTCGTCAAGCTCTCGGAAGAAGAATGGCGACTTATCAGTGGAAAAACACAGAACGATCAGGATATATGCGCCCTGGCAAAAGAGTATGAGATCGCCATGCTGTTGGTGACTAAAGGTGCAGAAGGGGTGGTGGTCTGTTATCGAGGACAAGTTCACCATTTTGCTGGAATGTCTGTGAATTGTGTCGATAGCACGGGGGCGGGAGATGCGTTCGTTGCCGGGTTACTCACAGGTCTGTCCTCTACGGGATTATCTACAGATGAGAGAGAAATGCGACGAATTATCGATCTCGCTCAACGTTGCGGAGCGCTTGCAGTAACGGCGAAAGGGGCAATGACAGCGCTGCCATGTCGACAAGAACTGGAATAGTGAGAAGTAAACGGCGAAGTCGCTCTTATCTCTAAATAGGACGTGAATTTTTTAACGACAGGTAGGTAATTATGGCACTGAATATTCCATTCAGAAATGCGTACTATCGTTTTGCATCCAGTTACTCATTTCTCTTTTTTATTTCCTGGTCGCTGTGGTGGTCGTTATACGCTATTTGGCTGAAAGGACATCTAGGGTTGACAGGGACGGAATTAGGTACACTTTATTCGGTCAACCAGTTTACCAGCATTCTATTTATGATGTTCTACGGCATCGTTCAGGATAAACTCGGTCTGAAGAAACCGCTCATCTGGTGTATGAGTTTCATCCTGGTCTTGACCGGACCGTTTATGATTTACGTTTATTAACCGTTACTGCAAAGCAATTTTTCTGTAGGTCTAATTCTGGGGGCGCTCTTTTTTGGCCTGGGGTATCTGGCGGGATGCGGTTTGCTTGACAGCTTCACCGAAAAAATGGCGCGAAATTTTCATTTCGAATATGGAACAGCGCGCGCCTGGGGATCTTTTGGCTATGCTATTGGCGCGTTCTTTGCCGGCATATTTTTTAGTATCAGTCCCCATATCAACTTCTGGTTGGTCTCGCTATTTGGCGCTGTATTTATGATGATCAACATGTGTTTTAAAGATAAGGATCACCAGTGCGTAGCGGCGGATGCGGGAGGGGTAAAAAAAGAGGATTTTATCGCAGTTTTCAAGGATCGAAACTTCTGGGTTTTCGTCATATTTATTGTGGGGACGTGGTCTTTCTATAACATTTTTGATCAACAACTTTTTCCTGTCTTTTATGCAGGTTTATTCGAATCACACGATGTAGGAACGCGCCTGTATGGTTATCTCAACTCATTCCAGGTGGTACTCGAAGCGCTATGCATGGCGATTATTCCTTTCTTTGTGAATCGGGTAGGGCCAAAAAATGCATTACTTATCGGTGTTGTGATTATGGCGTTGCGTATCCTTTCCTGCGCGCTGTTCGTTAACCCCTGGATTATTTCATTAGTGAAGCTGTTACATGCTATTGAGGTTCCACTTTGTGTCATATCCGTCTTCAAATACAGCGTGGCAAATTTTGATAAGCGCCTGTCGTCGACGATCTTTCTGATTGGTTTTCAAATTGCCAGTTCGCTTGGGATTGTGCTGCTTTCAACGCCGACTGGGATACTCTTTGACCACGCAGGCTACCAGACAGTTTTCTTCGCAATTTCGGGTATTGTCTGCCTGATGTTGCTATTTGGCATTTTCTTCTTGAGTAAAAAACGCGAGCAAATAGTTATGGAAACGCCTGTACCTTCAGCAATATAGACGTAAACTTTTTCCGGTTGTTGTCGATAGCTCTATATCCCTCAACCGGAAAATAATAATAGTAAAATGCTTAGCCCTGCTAATAATCGCCTAATCCAAACGCCTCATTCATGTTCTGGTACAGTCGCTCAAATGTACTTCAGATGCGCGGTTCGCTGATTTCCAGGACATTGTCGTCATTCAGTGACCTGTCCCGTGTATCACGGTCCTGCGAATTCATCAAGGAATGCATTGCGGAGTGAAGTATCGAGTCACGCCATATTTCGCTATCAGGATTCTGTGTGATGGTTACATCGCCCGGCCCAGGGCTGTTTAGTCATCAGCGCTTTCTGACAGTGCTGAGATTTCAACCTGTTGCAGTAAAAATGAGTAGATATAAGGCAAGTGTGCTGCCAAACCCATCTTTTACGGGGTGAAGGTAGATTTCGTTTGAAGGGTATCTGGTGTCCCTGCAGACATCTACTTGACGCGGCAGGGGATTGATTAGAATGGTGTTTTTTAGATGTGAGAAATATTTTACCCGCTATTTTACCCATTGGCGCGGCTTAAGAGCTTATTTTTGAATTCACAATGGTCACGATATAACCATCTTGCTCGCCCGTGGATAACTTTGGCTTTAGGCAGGTCTCCGGACTTAATCCGGTCATATATGAAGGTTTTACCGAAGCCAGTATCAGCCATGATGAATTTCAAATCAACCAGGGAATCAGGCTGTAGTTCGTGTTGCATGAGTGCTATCTCCGAATAGGGAATCGAACCTGCAAATCAGGTAATAAAAAAACCGCATTGATGCGCCGATGGTAGGTCTGGATATCTTGATAAATGAAAATGCCTCATCGAGTGTGAGGCGGGTTAATCCTTGCGTAGCTCGCTGATTCTTCTGTAAGTCTCTGGTGCTTTGTTCCCGTACGTCTTCATTTCAGACTTCAACAGAGCAACGAGTGAATCCCATTCGTTGAGGATTCCTTTGAATGCCGGAACGCGCTTTGCAACCTTGTCGAATGAATCTCTGATTTCTGGAATCTGCTCAACAAGTGCAACGCATCGCCGAAAGTCTGCTGCGTCATGTGGAGCACCGAAGCTATGACCATAGATATTCTTTTTCAGTCCACATGCGATTGAGGCAAGAGTTGCGCTACTGATGCCAACATCGCCAGTCGATTGCCATTTCAAAACCTTCATAGCCAAATCTGACATTTCTTGTCTCCAATAAAAAACCGCCATCAGGCGGCTTGGTGTTCTTTCAGTTCTTCAATTCGAATATTGGTTACATTGTTTTCATATATGAATAAATAAATTAGCTTTTTCCGTTGCCTTCGCGATCTTTATTAATTTTGACAAACTCGTTTTTACCACGCTCTCCAAATGCGTCTTTAGAGTCGTTGTATCCGCAATCGCAGCACACATAATCACCAGACCATCCACGCATTGTTTTTTCTTTTGCAATATTTCCAGAACCGCATTTTGGACAAGACATATCACTACCTCCAAAGCATGAGTGAGATGACAACGTAACATTGATTGGAGATTAACAATAGATTGCTGATGTAAAAGATATGTATAAGCTTCGCTTTCAAAGTGGAGGCTCTGGTAGCGGCATCCAGTGAGTTACGTCATCCAAGATATTTCCTGATAAATACGTGAAAGCTCTATATTTTTTGTAATCAATTGGATTTACAACCCAGTTCCAATATGCGGCCACGATTTCACCTTGACTAAATGCCAGTAACATTTTGGTGTCTTCCGGCATTCGATCACTACAGCTTATCCAACCATCCTTAATGCAATTACCTTGACTCTGAAGCATAGCGGCACGACAGGCGTTCCATCCGACAGCTTTTCCGTGTTCAAACGCGCTGTCAAAGTCATCATCCATTTCCATCGCAGCAGGCACAGATACCGGCGCTGGCGGTGCGGTGTAAACGGCCACGTGAGTTGAACTGTATCGATAGGCTTCACGGTCATTCGACAACCCGCAACTGCCTGTTTTCTCATATGCCGCTTTATGCACATATCCATATGGCTCCGCTTCGAGCGATGCCAGCGCGATACGCGCCAGCTCAGCCGCTTCGTCATCTTGAACGCAATACCATGCGTTTCCAGATGCTAATTGTTCCAGACGTTCTTTGGGAATAGTGCTCATGATGCCTCTCCTTTACCTGCTTCGGTGACGTTTATTCCAGCGTTGTGCAGTGCCTCAAGAACCTGATGCTGCCTGTAAACCATTTCAGTGTGGTAAGGCTCGTCAAAATCGACGCGATGCAACATGCTATAGCATTGCGGAAGCACAACCTCCCGCAACTGCAGCTCAGCAATGCGCTTCTCTGCGGCTTCCAGCTCAACACGCAGTTTCCCTACCGTTAGCGCAATTTCCTCGTTCTCCTGATCGCGGAGTTTGATGTATTGCTGGTTCCTTTCCCGTTCATCCAGCAGTGCCAGCGCGATATCTGGCGAAAAGTGCTTCATAAAATCGTTAAGCGCATTAATTCGCTGATCGAAAGGCATTACAGGTGCTTCACCAGCAATTTTTGTTTTTTCAGCGATTTCACGAAGCTTTTGATAATCAATCTTGCTCACTGCTTGACTCCTTTACGCCACATCGCATTCAGATATTTGTTGTCATTAACAGAACCGAAACTCTTCCTTTTAAGCAATTCCTCTCTCGATGGCATTGGCTTTACGCGTTGGCGAATAATCATTTCTGCCGGAAGAATGCCGGGATTGTATGCAAGTCCTCTCATGGTAAATTCCTCAGTCATTACTGATAGCGCCATAGCGTGAGCGGTAATTACGCAGGCGCGGGTCGATATATTCAGGGAAGTGGGTATATATGGATTTGCGGAATGGTCGGATTGATGTTTCGTTTATTCGGTCTTTTTCCTGTTTTTCTGCGAGTTGTATATCGCGTCGGTACTTCCGTTCTTCTTTTGTTTCCGGTGGCAGAGCAAGAAACGCGTCGAGATTATTCTTGATATTTTCCAGCACCTCCGATATGGAATTGCCGGAACAGCGGCGCGGGTCGTCCGCACCATATAGAGGCGCTGGCATGGTTTTCTCCTGTTGATTATTTAGCTAACTTTTTCCAGATCGCTGAAACGTATTTGGCTTGGTGAATGGCATCATCAAGCGCGTTGTGGCGAGTTCCTTTGAATGGCATATCTCGCTTAGGGTCGAATCCTATTACCTTCCCAAGTTCGACGATTGTTCTTACGTCGCGGTCATTCCACCACTGCCACGGAACTGGCTGCCCTGTCAGTGAATAACTGTTTCGGAGAATAACGCAGTCAAATGATGCTCCATTCCCCCAAACCTGAACGAATTTGTGGTTAGCGTTCTTTATGATGAATTCAGATAACCATGAAAGAGCCGTTGAAAGCTCTTGAGTGTTGCTGGTTAGCGATTTTCTGGCTTCTTCACTCTGTTCCATCCACCATAAAATCGTTGAAGCGTCAGGACGCGCCCGATATCGCATTGATGACTCAAGCGAGATATTTACCGAGAACTCTTCTCCTGTTTCTCCGGTATTCGGGTCAAAGAATACCGCACCAATAGAAATAACTGGCGCATATGGCCCGTTGCCCATTGTTTCAAGGTCAACCATTAAGTGATTCATGTAAGTCCTTAAATTGCGTGAATAGCGTGACGAGGGAAGGGGAGAGTTACTGGTTCCTCGTCTGGGTAGATAGGTTTGTTATGTTTGTGCCACTCGACATGACATGACTTGCAGAGCCACATCACATCGGTTGGTTTGCTGTAGTCGCAGTGGTGCGCCTGTGGTTTACATTCTGATCCGCAGCACTCACATTGTGGTGGTCGGATTAGCTTACCGTCGCGCAAAAAATTACCCACAATGATGTGGGCTTTTCTTTTCCATGGGTTGCTCTGAATGAACCGCTTTTTGGCTGCGTTACACCGTTCTCTTCCGCGTTCCGATGATTGATATTCTCTCCTTGCTGATACTCGATGTGGCAATCCAGCGCGTTCTTTGTCGTATTCAGCCAGGCAAGCCCGGCAAGCGGCAGTTAATCCATCTCTGGATGCTCTTCTGATTTGAAAGTCCCTTTCTTCCTTCTGTTGATGGCATCTTGAGCAGACTTTCATATTCAGCTCCTAGAACGGAATATCCGAATCGTCGAAGTTCATAGGTGGTTCGCTGTGATTCCCCTGCTGCTGAGATTGCTGTCTTTGTTGCTGACCGTTATTTCGCTGATGTGAAGACTGTTCATTGCCTCCTTGTTTGCCACCAAGCATTTGCATGGTTCCACCAACGCCCACGATGACTTCGGTAGTGAACCGATCCTGTCCACTTTGATCCTGCCATTTTCTTGTCCGCAATTTGCCTTCAAGATAAACCTCAGAGCCTTTTCGCAGATATTCGCTGGCAATTTCTGCCAGTTTCCCGCTCATTACCACGCGGTGCCACTCCGTCTGCTCCTTTTGCTCTCCAGTTTGCTTATCACGCCATTGTTCTGACGTAGCAACTGTAAGGTTTGCAAATGCCGTTCCTGATGGTGAATATCTGATTTCTGGATCATGCCCAAGGCGACCAATAATGATCACCTTATTTACGCCTCTGCTTGCCATTTATGCCGCCTGTTTTAGTTCGTTAACTCTGATGTTCATTACCTGAACGCATTTAGCCTGCGCATCCTCATTGCCAGCCATTAATTGCCAGTCATGCTGATAACGCTCGATGAGTTTTTTCTTGTCAGTTTCTTTCGATGCAAAATCGCTGAAGTCTTTCAGGATTTGTTCGTAGTCAACCGATGGAGATTTCTGGTTGGTATTTTCTGGTGATGGTTTGTTATCTGATGCTGGGATTGCCCATCCCGGCAGCGATGGAGGGAGCCAGTAAAATCCTGTTCCATCCTTGAGTTTTGCCCTGTGCCATCCCTGCTTTTTATCGAGAGATGTTTGTGCAAAACCTTCCTCAAGGTTATACAGATACCGACCGATTCCCCACTGAACGGCAGCGCGCTTCATTGCACCGGAACGACCGCCTTTGACGGCTTCTACCTGCGTGTTTTCAGCAGCATCCCATTTGGTTACCCATTCGGAATCAATCTTGATTGATATGCCGCATTCAACTCCGCCGTTGTTGGGAATATCGCGGTATTCATTGCGCCATCCTGCTTTGCCGCAAACATCGTCCAGGCGCTTCATGATTGCCCGGTTCGTGACATAAGCCAGCACCATAGCCCACACCTTGCCATCGCGTGTTTTACCGCTTTGCTGTATTCGCCATTCGATATCTTCAGGGCTGAATGGCTCATCGAATTTGTTCAAATCCATAATTCACCTCAGAATGGACACGGCCCAAGGAAATAACGCTGATTTAATACTTCGACTCGGGACAAATTAAGGCATACCCGCATTCCTTCGCGGTCACCATTATGGCGATACCAGAGAGCTTTCTGCGTGTACATGCGTCTCTGTAACTTGCTCTCCTTCACTGTGGTTGCAAGTGACATGAATATCTCCTTCGTTACCGATTAATTCTTTCATCTGACGAATGAATTCTTCGTCTGACCAGTTATCTGTAAAACTCATTTCCTGCGATACCACGGAAGGTTGATAGCTGATTTCATAGCTTTATTTGCTTCAAGCCACATTTTTGAATCACCAATAAATCTGGCTATTACTGCTTTGTTCTGTGCTGCACGAAGCATCTGGTGATTAATGGCTATTTCATTGCGCATAACGCCTCCAGTTGTTTCTTTGCTGCTCTGATTAATTGTTTAACTCGGCGTGATAATTCAGATTCGTGCGGGTAGAAAGCGGACATGACGCCGCTACCCGCGAGCTGAAAGTGCATCATGGGTAACTCCTTATATTTGATTGCATAACGAAAACGCCTAGAGTGAAGCGTTATTGGTATGCGGTAAAGCCGCGCTTAGGCGGCTGATGTTTCTTCTTTCAGGCTTTCGAGATATTTACGTGGGTCGTCGTAACATTGGCATTCGCTGTACCAATCCACCCAGCGATCAGTAAGCCCCATCTCTGATAAATCTTCATCGGTAAGGCTCTCATCCCACATCTCAAGGCCGTTAGCATTGCAGTAATCAGGCTTGATGTTGTTGTCATACTGAAAGGCGTCATAATCAGCCAGTGCGTCCATCAGGCGAACACCCTCTTCAACACTTGCCACTTCTGCAATGAACGGTTTCATAGGTACTTGCGGGATATGCCAGACACGTAATTTCATATTTCCTCCGTCAAAAAAATTGCCCTCACACTGGAGGGCAAAGAAGATTTCCAATAATCAGAACAAGTCGGCTCCTGTTTAGTTACGAGCGACATTGCTCCGTGTATTCACTCGTTGGAATGAATACACAGTGCTTACTCGTACTAATAAAATACCCAATTTTCTGTTTCTTGGTTGTGCCCAAAGTTATATTCAATATCTGGTGTTGATGTATCAATATTCTTCATCCCATCAACAAGAGTTGATACAACAGCCAAATCTTGTTTGATTCTCATTAAATGGTATTTCTTCCGGCGCAATAAACTTTCAATGGCAAGTTTCTTCGTTGGGAATGCAAAAGATCTTTCTGCATTTTTTGCTACTTTCTTAATTGCATATCTATTTCTCCTTTGTTTCCATTCCTGTAACCACTGATTTGGTGCTGGTTTAAAATTAACAATCCAATGCGCAGGAACCAACCATGCATAATGCTCTGTCTGATGAAAAGCTATATATTGAAGTGCGAATATTTTTATCCCATCTTCTTCAACTGTCGCCTGGAATCTCCAGAAAACAGGCATTCCATCATGTTCAGTTTCTGATTCAGGAAAAGGTACGCTCCATGATTTTGTCATATCTCACCTCAAATAAGTGGTTTGCTGCCTAATTTCATTTTCTGGCGACCAACACAAGTCACACCCATTTCACTGCGTGGCTTGCGGTAGTAAATACGGTTCTGTTTACGCTCGACTTCATCTGCCTTCTTGCAGCGAAGGCTTACGAGTGATGCTGCTTTGTCTGCTCTGACGCAACCAGAGAGCTTTAGCGCAATCTTTCGCGCCAGTCGCTGTTCTTGCATTGCCTGTTCACGTTGAGCCTGTCTGCGTGCTCTGCGGCGATTTCTGGCGTTATCGTCAGCCAGATATGTAATGACTACTGTCATGTTGACCTCCGATGATTGACTTTGGTGATTGGATGGCCGGTGCTGAATTCCGGCTTACTGGTTAGAGCGCCCGCACTACCAGTGACGCTGTCTTGAGGCGCAGATTGGTTACTGCTTGCCATGAGCGCTGTTTATACATTGGTCGAGCATCAGCCTGCTCATTCATCCAATCCCAAAGCCAACTACTCTTTGGTTCCCGCATTTCGGCGGGACAATCCCATCAATGTTAAAGAGCCTGCCAATCTATTCCGTTTGGCTACCAGCGTCCTGCTGATGGCTTAAATTTAAGATCTCTTTAATTAATGGTCAAGAATATTTTTGAAGAAAACTTAAATTTTCTTTCGTGACTTAAGTTTGGCTTTGATTTTTAAAGGAAATAAAAAAAAAGGGGCGAATGCCCCCTTATGGAAGGTTTGCTAGTTTTGCATCGACAACTACGCCGATGATTTTGCAGTTTCCGTTGATCTCGATCATCGGATATTGTGGGTTAAGTGGTTTTAGAAACTTCCTGCCTGCATCAATAACTAACTTCTTGAAAGTTGCCTCGTTTTCTCCTTCGAGCTTTGCAACTACCAGTTTCCCGTTACGCGGCTCTACTTCAGGATCGACGAGTATTATCATTCCTTCAGGGATACTGAGACCGGCCGGAGCCGTCATTGAGTCTCCCTTCACGTCCAACCAAAACGAATCTTCTGAACAGTCTACGGTTGTATCGTACCAGTTATCTATTGCACGCTTATGATATGGTTCTACAGCTTCCATCCAGCATCCTGCGCTCACCCAGCTAATCAGAGGGTATGACCCTCTTGGATCATGCCTACTGTGATAGGCAATGTTTGAAAGACTTTCCTCTCCTTTCATCAGATAGTCAGGGGAACACTTCAACGCATTAGCCAGGGCGAGAAGATTCTCTCCATTTGGCTCTGTCTCAGAGCGTTCCCACTGAGATATGGCAACATTAGACACGCCGACCATCTTTCCAAGTGCGGCCTGCCTGATCTTGAGTTCTTTTCTCCGAGCGCGAATGCGCTCTCCCATCAATTGAGTTTTCATAGTTAAGACATCTTAAATAAACTTGACTTAAGATTCCTTTGGTGGATAATTTAAGTGTTCTTTAATTTCGGAGCGAGTCTATGTACAAGAAAGATGTTATCGACCACTTCGGAACCCAGCGTGCTGTCGCTAAAGCGTTAGGCATTAGCGACGCAGCAGTCTCTCAGTGGAAGGAAGTCATCCCAGAGAAAGACGCCTATCGACTGGAAGTCGTTACAGCTGGCGCCCTGAAGTATCAAGAAAGCGCTTACCGCAAAGCGGCATAAGCAAATTGCTCTTTAACAGTCATGGTCCTCATTCCCGCCGAAATGCGGGAATACAACGCACATAAGTTGATGCGTATAACTTCTTATTTGTTAAGGAAATACTTACATATGCAACTTACAAGTACTCGCAAGAAAGCGAATGCAATTACAAGCAACATCCTGAATCGAATTGCTGTACGTGGTCAGCGAAAGGTTGCTGATGCATTAGGGATTAATGAATCGCAAATTTCGCGATGGAAAGACAGCTTCATCCCAAAAATGGGAATGCTTCTGGCTGTTCTTGAATGGGGTGTTGAAGACGAGGAGTTGGCGGAACTGGCTAAGAAAGTAGCCAGAATGCTGACAAAAGAAAAAGCCCCGAAGAACGGCGAATTCTTCGAGGCCTGATGTAGAAAGACTGGATCAATCCACAGGAGTAATTATGACAAAACGTCGTAAGAAATACCAGGAAAAAGAAGAGATTCGACACCCTGATTCACCTGAGGGATTAGTGGTAGCCGCAGCAAATAACAGGGCGTTCGCAGAGCGCCTTGTTGGTGTTTACAGACTAGCCAAAGCAGGAGTGAAACATGGGCGTCGTTAAGTTAGCTGATTACAGGCATAACCCTGTACAACATCAGGAGGCATCCAGTATGGGGTATGTCTCTATACACCGCCAGTTTATGGACAGCAGGCTCTATAAGGACTCTCAGGCAGTACATCTTTGGCTTCACTTAATCCTCAAGGCTAATCACGAATCTACTGTCGTCAATACGGATATCGGTCCGATAACTGTTGATCGCGGTCAGATGATAACTGGACGCCCGTCGCTGGTCAGAGAAACATTCATTCCCGACAACAAAGTTCGGAGCTTATTACGGACTTTTGAGTCGAAAGGGATGCTTAATATTTGCTCGATGGGGAAGAAATTTAGCCTGTTTACAATCGTTAAATATGACGATTTTCAGGCAAAAAATTGTCCAACGGTTGTCCAACGGTTGTCCAACGCAAACACCAGTAATGGCGCGGCTCTCAGTGGAGATTGTCCAACGGTTGTCCAACGGTTGTCCATAAACAATAATATAAATAATATCTCTAATACTGACGTATTAGAGAGTGCCACAGCAGACAAAAAGTCTGACAAGAAAAAACCTTCCGTTAGCTGTCAGGATGTTGTCGATGCTTACCACGAAATCCTTCCTGAAGCGCCAAGAATCCGCGCACTGAATGACAAGCGTAAAAACCAGATCCGAACGTTCTGGCGCAAAGCCGGAGTGATAACCCGCCAGCTTGACGGACATGGGTTCACGATGCAGGACTGGAGAAATTATTTGAGCTACGTAGGCGAAAATTGTCGATGGATGTTCGAAGAGCGTCCAAACCATCAACGTGGAACTGTCTGGCACAAAAAGGGATTTGATTTCCTGCTTAACGATAATACCTACCTGAAAGTTCGTGAGGGTGAACACGATGACCGATAATTTTTATGCGCCGCCCCATAGCATCGAGGCAGAGCAGGCGGTGATTGGTGGATTGCTTCTGGATGATGACAGCAGTGAGCGCGTCCAGAAAGTTCTGGCGATGCTGAAGCCTGATTCATTTTACAGCCGACCACACAAAATCCTTTTCGAAGAAATAACCAGAATGCACCGGGAGCAAAAGCCAGTAGATGGCCTGACGCTTTTCGATGAACTTGAGCGTAAATCGTTAACGGCGTCTGTTGGAGGTTTTGCTTATATCGCTGAGATCGCAAAGAACACGCCAAGTGCAGCAAACATCGTTGCCTATGCAATGCAGGTTCGTGAAACCGCAATGGAACGCTACGCCATCAACCGCATGACTGAAGCGACGGAATTGCTCTATTCCCGCAACGGAATGACTGCAACGCAGAAGTACGAAGCTATTCAGGCGATTTTCACGCAACTGACAGACCATGCAAAAACCGGATCGCGTCGTGGCCTTCGCTCATTTGGTGAGGTCATGGAAGACTGGGTTAGCGACCTTGAGAAGCGATTTGACCCGTCAGGCGAACAGCGAGGAATGAGCACAGGGATCCCATCGCTGGACAGGATGCTGTCACCGAAAGGTCTGGTGAAAGGCTCTCTGTTTGTCATTGGCGCTCGCCCTAAGATGGGGAAAACGACGCTATACAGCCAGATGGCAATCAACTGCGCAGTGCATGAGAAAAAGCCCGCTCTGATGTTCAGCCTTGAAATGCCAGGCGACCAGATACTGGAAAAACTGGTAGGACAGAAGTCAGGTGTTAACCCGAATATTTTTTACCTTCCGGCGACAAATGACGCTGATGACGGCTATCAGGGTGATTACGATGGTGACTTCAACAGGGCGATCGAAACAGCCAATCGCTTGAGTGAAATCGACCTGCTTTACATCGACGACACGCCGGGATTATCTCTGGCTCAAATCGTCAGCGAAAGCCGTCGAATCAAGCGAGAAAAAGGATGTGTTGGCATGATTCTGGTCGATTACCTGACACTAATGACCGCTGAGAAGGCCGATCGCAACGACCTTGCTTACGGCATGATCACCAAAGGACTGAAGAACCTTGCCAAAGAGCTTGATTGCGTTGTTGTGCTTCTGACACAGCTTAACCGCGCACTGGAAAGCCGAACCAATAAACGCCCATTACCAAGTGACTCACGAGATACAGGGCAGATTGAACAGGATTGCGATTATTGGGTGGGGATCCATCGTGAAGGCGCTTTTGATGACAGCGTTCCTCCTGGTGAAACCGAACTAATCCTTCGCCTCAATCGTCATGGCAATACCGGCACGGTGTATTGCATTCAGGCAAATGGCGCTATTTATGACACAGACCAACAGTCTGCTGAAATGCGCCGCCGTGAACGCGAGGAACCGCAGTCCAAGAAGAAAGGAGGATTCTGATGACCATCTACATCACTGAGCTAATAACAGGCCTGCTGGTAATCGCAGGCCTTTTTATTTGGGGGAGAGGGAAGTGTGGCTGACTGGCAAATTCCAATCATCATTCTTGCCGGAGCTTCGCTGGTTGCTGGCTTTATCCTGCTGAAAAAGCATAAAGACCGTGATCAAAAAGTCGAAGTTCTCTATGGGTATCCAGCGAACAGCACAACATGGCTGACCATTTACCACTACCGAAAATCAGGCCGCTGGGTATTCGAATGGGATGATCTGTTTGCTGAAAAGCGACCAAAGTCATGGGGAGACATCAGCGAATGCATGATGTTTGAAGAAAGAAAATCCGGCGCAACCCGAGAAGAGTTTAACGAAGCGTGGGCGCGATTAAGTGAGAGAGGGTATTTGTGAGCAAGTACGAAAAATTAGATCAAAACATTCTTTCAATGCTGAGTGAAAGACCAACACCTGTTTTTGATATCTGGCTTAAATGGCGGAGCAATGGAATGTATATCGAAACCATCGATCGCCGTATGCAATACCTGAGAAAGAAAGGGCTTGTTGCAAATGTGCGTGGGAAGGGTTGGGTGAAAATTAACCTGTCATAACGGGGATTGATATGGACGAATCAAGAAAGCAGTTTGAAGAATGGTTTAAAAACAAATATCACGTTTCAAGTGACGTGATGAAGATTATGCACATCAAGGTCGAGATTGCATGGGAGGCATGGCAGGCATCGCGAGCAGCTATTGAAATTGAGCTGCAAAAGCCAAAGAAAGGCCCACTTCCCGGTGATTATCACATTGGCTATGACTCAGGTGCAGAATCACAATACGAAAGCGATGTAGAGGCCATCCGCGCGGCTGGAGTTAAAGTGAAGGAGTGAGTATGACAAATCAGCAGCAAATAGAGTTCATCCTTGAGCAGATCAGAAAAATGCGAGAAAAGAACCAGCCAGACATGATGGAAATATGGAGACGCCAGCAGGAAGAATACCGCAAGCATATTTTTGGTGAGAGAAAACAGGATGACTGGAGCCTATATGGCTATGGCACCAGGACAAATAAAAACGGATATAGCCTTTACACATATTGAGGAATTCCATGAAACAGACAATTTTCCTCAGGAGTAAGCAACAACAGCAAGCCGCAATCAACGCCATCCTCGCAACACCACTCGATAAAGACAAGCCAGTTACCATCCGCATTACTGACTACAAGCGCAACCTTGACCAGAACGCAAAATTTCACGCGATGCTGGCGGATATCGCAAGTCAGGTTCAATGGTGCGGCAAATGGTTAAAACCAGAACAGTGGAAGGTTTTGTTGATTAGCGGTCATGCAGTGGCAACAAAACAGGAAGCTGATGTTTTGCGCGGCCTTGAAGGCGAATTCGTCAACATTCGCGAAAGCAGCGCGCAGATGAGTGTGAAGCGTATGGCAAGTCTGATCGAGTACACAACAGCCTGGGCTATTGGTCAGGGTGTCAGATTTACCGACAGGAGGTACGAATGAGACGACAGCGACGAAGTATCACCGACATAATCTGCGAAAACTGCAAATACCTTCCAACGAAGCGCTCCAGAAACAAACGCAAGCCAATCCCAAAAGAATCTGACGTAAAAACCTTCAATTATACGGCTCACCTGTGGGATATCCGGTGGCTAAGACATCGTGCGAGGAAATGACGATGACTGCGTATTACAACGAAATAGATCCGTATGCAGCGCAATGGCTGCGTAACTTAATTGACGCTGGAGAAATTGCCCCCGGTTATGTAGATGAAAGGAGTATTGAAGATGTCACACCAGGTGATTTGCGAGGATTTACCCAGCACCACTTTTTTGCAGGAATCGGAGTTTGGAGCTATGCACTTAGAAAAGCAGGATGGCCAGACAACAAGAGTATCTGGACAGGAAGTTGCCCATGCCAACCTTTCAGCTCGGCAGGCAAAGGAAAAGGGGTTGATGACGAGCGGCACTTATGGCCGGCATTCTTCTGGCTTATTGAAAAATGCAATCCTGGCATCGTTATTGGCGAACAGGTTGCAAGCGCAGACGGCCTCGCTTGGCTCGACCTTGTACAAACTGACTTGGAAGGTGCGAACTACACCTCTGCAGGTACCGATATTTGCGCTGCGGGCTTCGGTTCTCCGCACATCAGGCAGCGATTGTATTGGGTGGCCTACTCCAACGACAAATATCAACTTTCAGCCAGAGACACGCAGGGGAATTCAGAACCTATCTGGATGCGTGAGACTAGCGGGATGGCAAACTCCTTTAGCGAACGATGCAACAGGTTCAACGCATTGCTACAGCGGAAAAGACAAGAGCGGAACCCCAAGAATCTGCTTGAAACTTCCCGGGACGGTGAAGCTATGTACCCATTACCGGTTAACGGCTTCTGGAGAGATGCAGACTGGCTTTACTGTAGAGATGAAAAATATCGTCCAGTTAGACCCGGCTCATTCCCGATGGTTAATGGCATTGCCAAAAGCTTGGGACGAGGCAAGTCCACACTGGGAAGAATGGCAAAGCGCAATCAAGATCAGCGAATTATTGGATATGGAAACGCAATCAATGCAGAAGTAGCAACGGCATTCGTGAAAGTTTGTATGGAGGTTGTTAATGCTTAGCCCATCCCAATCCCTTCAATACCAGAAAGAAAGCGTCGAGCGAGCTTTAACGTGCGCTAACTGCGGTCAGAAGCTGCATGTGCTGGAAGTTCACGTGTGTGAGCACTGCTGCGCAGAACTGATGAGCGATCCGAATAGCTCAATGTACGAGGAAGAAGACGATGGCTAAACCAGCACGAAGACGATGTAAAAACGGTGAATGTCGGGAATGGTTTCACCCTGCATTCGCTAATCAGTGGTGGTGCTCTCCAGAGTGTGGAACAAAGATAGCACTCGAACGACGAAGCAAAGAACGCGAAAAAGCGGAAAAAGCAGCAGAGAAGAAACTACGACGAGAGGAGCAGAAACAGAAAGATAAACTGAAGATTCGAAAACTCGCCTTAAAGCCCCGCAGTTACTGGATTAAACAAGCCCAACAAGCCGTAAACGCCTTCATCAGAGAAAGAGACCGCGACTTACCATGTATCTCGTGCGGAACGCTCACGTCTGCTCAGTGGGATGCCGGACATTACCGGACAACTGCTGCGGCACCTCAACTCCGATTTGATGAACGCAATATTCACAAGCAATGCGTGGTGTGCAACCAGCACAAAAGCGGAAATCTCGTTCCGTATCGCGTCGAACTGATTAGCCGTATCGGGCAGGAAGCAGTAGACGAAATCGAATCAAACCATAACCGCCATCGCTGGACTATCGAAGAGTGCAAGGCGATCAAGGCAGAGTACCAACAGAAACTCAAAGACCTGCGAAATAGCAGAAGTGAGGCCGCATGACGTTCTCAGTAAAAACCATTCCAGACATGCTCGTTGAAGCATACGGAAACCAGACAGAAGTAGCACGCAGACTGAAATGTAGTCGCGGTACGGTCAGAAAATACGTTGATGATAAAGACGGGAAAATGCACGCCATCGTCAACGACGTTCTTATGGTTCATCGCGGATGGAGTGAAAGAGATGCGCTATTACGAAAAAATTGATGGCAGCAAATACCGAAATATTTGGGTAGTTGGCGACCTGCACGGATGCTACACGAACCTGATGAACAAACTGGATACGATTGGATTCGACACCCAAAAAGACCTGCTTATCTCGGTTGGCGATTTGGTCGATCGCGGTGCAGAGAACGTCGAATGCCTGGAATTAATCACATTCCCCTGGTTCAGAGCTGTACGTGGAAACCATGAGCAAATGATGATTGATGGCTTATCAGAGCGTGGAAACGTCAATCACTGGCTGTTTAATGGCGGTGGCTGGTTCTTTAATCTCGATTACGACAAAGAAATTCTGGCTAAAGCTCTTGCCCATAAAGCAGATGAACTTCCGTTAATCATCGAACTGGTGAGCAAAGGTAAAAAATATGTCATCTGCCACGCCGATTATCCTTGTGATAAATACGAGTTTGGAAAGCCAGTTGATCATCAGCAGGTAATCTGGAACCGCGAACGAATCAGCAACTCACAAGACGGGTTCGTGAAAGAAATCAAAGGCGCGGACACGTTCATCTTTGGTCATACGCCAGCAGTGAAACCACTCAAGTTTGCCAACCAGATGTATATCGATACCGGCGCAGTGTTCTGCGGAAACCTCACATTGATTCAGGTACAGGGAGAAGGCGCATGAGACTCGAAAGCGTAGCTAAATTTCATTCGCCAAAAAGCCCGATGATGAGCGACTCACCACGGGCTACGGCTTCTGACTCTCTTTCCGGTGCTGATGTGATGGCTGCTATGGGGATGGCGCAATCACAAGCCGGATTCGGTATGGCTGCATTCTGTGGTAAGCATGAACTCAGCCAGAACGACAAACAAAAGGCTATCAACTATCTGATGCAATTTGCATACAAGGTATCGGGGAAATACCGTGGTGTGGCAAAGCTTGAAGGAAATACTAAGGCAAAGGTACTGCAAGTGCTCGCAACATTCGCTTATGCGGATTATTGCCGTAGTGCCGCGACGCCGGGCGCAAGATGCAGAGATTGCCACGGTACAGGCCGTGCGGTTGATATAGCAAAAACAGAGCAGTGGGGGAGAGTTGTTGAGAAAGAATGCGGAAGATGCAAAGGTGTCGGCTATTCAAGGATGCCAGCAAGCGCAGCATATCGCGCTGTGACGATGCTAATCCCAAACCTTACCCAACCCACCTGGTCACGCACTGTTAAGCCGCTGTATGACGCTCTGGTGGTGCAATGCCACAAGGAAGAGTCAATCGCAGACAACATTTTGAACGCGGTCACACGTTAGCAGCATGATTGCCACGGATGGCAACATATTAACGGCATGATATTGACTTTTTGAATAAAGTTGGGTAAATTTGACCCAACGATGGGTTAATTCGCTCGTTGTGGTAGTGAGATGAAAAGAGGCGGCGCTTACTACCGATTCCGCCTAGTTGGTCACTTCGACGTATCGTCTGGAACTCCAACCATCGCAGGCTGAGAGGTCTGCAAATGCAATCCCGAAACAGTTCGCAGGTAATAGTTAGAGCCTGCATAACGGTTTCGGGATTTTTTATTTGGGTCAGTCGTATAAAGGTCATTACGGAAGGCTGTTAACCTTCTTATCGTGGTTCGAGTCCACGCTGCCCCTCCAAATATGCTGGTTTAGCTCCAATGGTAGAGCAGTCGCCTTGTAAGCGAATGGGTAGCGGTTCAAGTCCGTTAACCAGCACCATTACTGAGCCGTAGCCACTGGCTATTCTGAATTCATCAGTGATAGTTATGCTGCGGCCTTCTTTTTTCCCCTTCCCAATATAAGAACTACGCAATCCGTTACTGGCGGAGGCGTTGCTATGAAATCAATGGACAAAATCTCAACTGGCATTGCCTACGGAACATCCGCTGGTAGTGCGGGATACTGGTTTTTACAGTGGTTGGATCAGGTCAGTCCATCACAGTGGGCTGCGATTGGGGTGCTTGGAAGCCTTGTGTTGGGTTTTCTCACCTATCTGACAAATCTGTATTTCAAAATCAGAGAAGACAGAAGAAAGGCTGCGAGAGGTGAATAATGCCTCCATCATTACGAAAAGCCGTTGCTGCGGCTATTGGTGGCGGGGCTATTGCTATAGCATCTGTGTTAATCACTGGCCCAAGTGGTAACGATGGTCTGGAAGGCGTCAGCTACATACCATATAAAGATATCGTTGGTGTATGGACTGTATGTTACGGGCACACCGGAAAAGACATTATGCTCGGTAAAACGTATACCGAAGCAGAATGCAAAGCCCTCCTGAATAAAGACCTTGCCACTGTCGCCAGACAAATTAACCCGTACATCAAAGTCGATATACCGGAAACAACGCGTGGCGCTCTTTACTCGTTCGTCTACAACGTGGGGGCTGGCAATTTCAGAACATCGACGCTTCTTCGCAAAATAAACCAGGGTGATATCAAAGGCGCATGTGACCAGCTACGTCGCTGGACATACGCTGGCGGTAAGCAATGGAAAGGGCTGATGACCCGTCGTGATATTGAGCGTGAAGTCTGTTTGTGGGGGCAGCAATGAGCAGGAGCAGGGTAACCGCGATTATTTCCGCTCTGGTTATCTGCATCATCGTTTGCCTGTCATGGGCTGTTAATCATTACCGTGATAACGCCATGATCTACAAAGAGCAGCGCGATAAGGCCGCATCCACAATCGCTGACATGCAGAAGCGTCAACGTGATGTAGCGGAACTCGACGCCAGATACACAAAGGAGCTTGCTGATGCTAACGAGACTATCGAAAGCCTCCGTGCTGATGTTTCTGCTGGTCGTAAGCGCCTGCAAGTCGCCGCCACCTGTGCAAAGTCAACGGCCGGAGCCAGCAGCATGGGCGATGGAGAAAGCCCAAGACTTACAGCAGATGCTGAACTCAATTATTACCGTCTCCGAAGTGGAATCGACAAGATAACCGCGCAGGTTAACTACCTGCAGGAGTACATCAGGACGCAATGCCTTCGATGATAGCGATAATTTTACTCATCATCCTTCACATCTGTCTCTGTAGACAGGGTGGTGATCACTTCTGGAGTGAATCCAGATTAAACATCTCATTGCTGATGCTTGAAGTTGAGCATCTGGCGCGCGGTAAGGGGCTGCGTTGAGATAAGAGCCAGTCATTACAAATACCAGGATTTAGCCTCGCATTCGCGGGGCTTTTTTATATCTGCAACAAACGCGCTTCACACGCGCGACTTATGAACACAGAGCCTTTCAGGATGACCCTTGAGGATGCCGGTTTGGTAATCGGTGCCTTTCTGTGGGCCGGAATCCTGTGTGACAAGGTTCATCACTAAAAGGTAATCACTGATGAAGTACCCAACAGTTATTGTCAATGGTGTGTCCGTTCGTGTTGATGAGGATGGACGCTACAACTTAAACGATCTCCATGCAGCAGCAGTTGCAAATGGAGAGGCTACAGAGCAACAGCGCCCAAGCCAGTTTTTGCGTAGCGCGCAGATAAAACGCTTCATAAAAGCACTGGAGGCCAAAGTGCAAAAAAGCACTTTGGAACAAATTCAACCACTTAAAATAATCAAAGGTGGTGCAGAACCAGGTGTGTGGGGTGTTGAACTTCTGGCAATCAGATATGCAGCATGGATTAAGCCGGAATTTGAAATCGAAGTTTATGAAGTTTTCAAAACGGTTGTCCGTCTCGGCGTTGGCGCAATGTCCCGTCTGAATAGAATCGATCACATCATCAATACTGAAACCAAAGCGATAAGCCAGTGCGCAAGCCAAATGGCTAAGTGGGGCGTTGGTGGGCGAAAAAGATTGCTTCATGTTGCACGTGAGAGAGCGGCAAATGAAGTGCAAATGTATTTGCCCGGAATGGTGTGATTTCGCAGGTTAATCCAGTTTTTGCATTACGGCAGTACCGCGAAACAACCCAAGCCAGTAAGTGGGGAAATAACACTGGCAGCCACTGAAAGATGAACCTCCAGCCTTATGGCAAAAAAGATTCTTTGTGGTGGCGGACTGATGGAAAGACATCGGTTATTGCAGAGGCCATTCAATGAGTGGTCTCGACAATGGCTTATACCCTACACGGGATAACTTAACTGATATCCCTTTTAACGGATAAACGGAGCCAACAATGGCAGAGATTATTCCCATGACTGAAGAACAGAAATTCCAGTTAGAGATTTACAAACTGGTCATGAACCAGAACGCAGCCGCAGAAGAAGCATTTCAATTCATTGGCACTGACGAACTGAAGCTTGAGCTATTCAAAATTCACTTCCAGTCAGGTGGCGCTAATTCGGATATCACGATCCGCACATTTGAAGCGGTGCGTAAATCGAAGGAAGCGTTAGACCTGTTCACTACCGGAGCATAAACATGGCAACTCAAGGTTTCGACAACCCATCCAAATTCCGCGATGAATGGGATAAGCAAGCAGAAGGGAAATAATCAATATGGCGACTGAGAAAAAGAATGTCGGTCGCCCTTCGGATTACCTACCGGAGGTGGCTGATGATATCTGTGCGCTGCTTGCCTCCGGGGAAAGTCTGGTTAAGGTTTGCAAGCGCCCCGGCATGCCAGCAAAGGCTACTGTATTTCGCTGGCTGTCAGAGCATGAAGAATTTAGAGACAAGTACGCGAAGGCAACTGAGGCACGAGCTGATTCTATTTTCGAAGAGATATTCGAAATTGCTGACACTGCGATTCCAGATGCTGCTGAGGTGGCAAAGGCAAGACTTCGCGTTGATACCCGCAAATGGGCGCTGGCCCGAATGAATCCCCGTAAGTATGGCGACAAGGTAACTAACGAGCTTGTCGGCAAAGACGGCGGCGCAATTCAGATTGAAACATCACCGATGAGCACTCTATTCGGAAAATGACCTCGATTAATCCTATCTTTGAACCGTTCATTGAGGCGCATCGCTACAAAGTCGCCAAAGGCGGTCGAGGTAGCGGTAAGTCATGGGCAATTGCTAGGCTGCTTGTTGAAGCGGCGCGTCGTCAGCCAGTGCGTATTCTCTGCGCTCGTGAACTGCAAAACAGTATCAGCGATTCGGTAATCCGGTTGCTTGAAGACACCATAGAGCGGGAAGGGTATTCGTCTGAGTTTGAAATTCAGCGTTCAATGATTCGTCATCTCGGAACGAATGCTGAATTCATGTTCTACGGCATCAAAAACAACCCGACGAAGATTAAATCGCTAGAAGGCATTGATATTTGCTGGGTGGAAGAAGCGGAAGCGGTAACAAAGGAATCGTGGGATATCCTGATTCCAACCATCCGCAAGCCGTTTTCCGAAATATGGGTGAGCTTTAACCCGAAGAACATACTCGACGATACCTATCAGCGATTCGTTGTAAATCCTCCCGATGATATTTGCCTGCTGACGGTGAACTACACCGACAACCCGCATTTTCCTGAAGTTCTCCGTCTGGAGATGGAAGAGTGCAAACGCAGAAACCCGACACTGTATCGTCACATCTGGCTTGGTGAGCCAGTAAGCGCAAGTGATATGGCAATCATCAAACGTGAATGGCTTGAAGCCGCAACCGATGCGCACAAGAAACTCGGATGGAAAGCGAAAGGCGCTGTTGTCTCTGCGCATGACCCATCAGATACAGGGCCAGATGCTAAAGGTTATGCATCGCGTCACGGTTCGGTAGTTAAGCGCATTGCCGAAGGTCTGCTGATGGACATCAACGAGGGTGCTGACTGGGCTACTTCGCTGGCGATTGAAGACGGCGCTGACCATTACCTGTGGGATGGTGATGGTGTTGGTGCCGGGCTACGCAGACAGACAACGGAAGCGTTCTCCGGCAAGAAAATCACCGCCACGATGTTCAAGGGCAGCGAATCGCCATTCGATGAAGATGCTCCGTATCAGGCCGGAGCATGGGCTGATGAAGTCGTACAGGGCGACAACGTTCGCACTATTGGCGATGTATTCCGCAATAAGCGAGCGCAATTCTATTACGCGCTGGCTGACAGGCTGTATCTGACATATCGGGCGGTTGTCCACGGTGAGTATGCAGACCCCGACGACATGCTGAGTTTCGACAAAGAAGTGATAGGCGAGAAGATGCTGGAGAAGCTGTTTGCAGAACTGACGCAGATTCAGCGCAAATTCAATAATAACGGGAAGCTGGAGCTTATGACTAAGGTCGAAATGAAGCAGAAGCTCGGTATTTCATCTCCTAACCTGGCTGATGCGCTGATGATGTGTATGCATTGCCCGGCATTGGTCCGCGAAGAAACAGAAATATACGTTCCCTCATCCTCCGGTTGGTAAACATGGCAGAGACATTAGAGAAAAAACATGAGCGGATCATGCTCAGGTTTGACCGCGCCTATTCTCCACAGAAGGAAGTGCGCGAAAAGTGCATTGAAGCTACGAGGTTTGCTCGTGTCCCCGGAGGTCAATGGGAAGGAGCAACGGCGGCTGGAACTAAGCTTGATGAGCAGTTCGAGAAGTATCCTAAGTTTGAAATCAATAAGGTAGCAACTGAACTTAACCGCATCATTGCAGAATACCGCAATAACAGAATAACCGTTAATTTTCGTCCTGGTGACAGAGAGGCAAGCGAAGAGTTAGCCAATAAATTAAATGGTCTGTTCCGTGCTGACTACGAAGAAACTGATGGCGGTGAGGCTTGCGATAATGCATTTGACGACGCTGCTACTGGTGGTTTCGGTTGCTTCCGTTTGACGTCGATGCTGGTCAATGAATACGACCCCATGGACGATCGTCAGCGTATTGCTATTGAACCAATATACGACCCGTCGCGCTCTGTGTGGTTTGACCCTGACGCTAAGAAGTACGACAAATCTGACGCGTTGTGGGCGTTCTGTATGTATTCGTTGTCACCTGAAAAATATGAGGCTGAATACGGGAAGAAACCTCCTACTTCTCTGGATGTAACGTCTATGACCAGTTGGGAATATAACTGGTTTGGTGCAGATGTTATTTACATAGCGAAGTATTACGAAGTTCGTAAAGAGTCTGTTGACGTCATCAGTTATCGACATCCAATCACTGGAGAGATTGCAACATACGACAGTGATCAGGTTGAAGATATTGAAGATGAACTGGCAATAGCTGGATTTCATGAAGTGGCAAGGCGCTCAGTGAAGCGCCGTCGTGTGTATGTATCCGTAGTGGATGGTGATGGTTTCCTTGAGAAACCTCGACGTATTCCTGGTGAACATATCCCCCTCATCCCGGTTTATGGAAAACGCTGGTTCATTGATGACATTGAGCGTGTCGAAGGGCACATTGCAAAAGCAATGGATCCACAACGTTTGTACAACCTTCAGGTATCAATGCTGGCTGATACTGCAGCGCAAGACCCCGGTCAGATCCCTATAGTTGGCATGGAGCAAATTCGTGGACTTGAGAAGCACTGGGAGGCTCGCAACAAGAAACGACCAGCGTTCTTGCCGTTGCGCGAAGTGAGAGATAAATCTGGCAACATTATCGCTGGAGCTACCCCGGCAGGATATACACAGCCTGCGGTTATGAATCAGGCATTGGCTGCATTACTACAGCAAACCAGTGCAGATATTCAGGAGGTTACAGGCGGCAGTCAGGCCATGCAGCAGATGCCAAGTAATATTGCTCAGGAAACGGTTAACAACTTGATGAACAGAGCAGATATGGCTTCGTTTATCTATCTGGACAATATGGCGAAAAGTCTTAAACGCGCTGGTGAAGTATGGCTGTCAATGGCGCGTGAAGTGTACGGTTCAGAGCGTGAAGTGCGCATCGTTAACGAAGATGGAAGTGATGATATCGCTGTCCTGAGCGCACAGGTTGTTGACAGGCAAACAGGGGCTGTTGTTGCTTTAAATGACCTTTCTGTCGGTCGATACGATGTGACGGTTGATGTTGGACCAAGCTACACAGCACGACGTGATGCAACGGTTTCTGTACTGACAAATGTCCTTAGCTCTATGCTTCCAACAGACCCAATGCGCCCGGCAATTCAGGGTATTATTCTGGACAATATTGATGGCGAAGGCCTTGATGACTTCAAAGAGTACAACCGAAACCAACTGCTGATATCTGGCATTGCAAAACCACGCAATGAGAAAGAGCAGCAGATTGTTCAACAGGCGCAAATGGCAGCACAAAGCCAGCCAAATCCTGAAATGGTTCTCGCTCAGGCGCAAATGGTAGCAGCGCAGGCAGAAGCGCAAAAAGCAACTAACGAAACTGCTCAAACTCAAATCAAAGCATTTACTGCCCAGCAGGATGCGATGGAGAGTCAGGCAAACACTGTCTATAAACTGGCTCAAGCCAGAAACATCGATGACAAAGCAGTGATGGAGGCAATACGCCTTCTGAAAGATGTCGCCGAGTCACAACAACAGCAATTCCAGTCACCACCACAGTCACCGGCAGACTTAATGCCGAGTTAACCAGGAGTAATCAATGGAAAACGAACTGATCATCGACGGTCAGGTTATTGACCTGTCTGAAACACAGGAAAATGCAGAAGAAACCATCATCCAAACAGAGTCACAGCCTGAGAATGAAAGCCAGGATGACAACGGTAAAGAGGTGGCAACTGAGCCTGAAAAAACCGAAGAGACACCAGAAGATTACGCCTTGCGTATTGGTGATGAAGAAATTCAGCTTAACGCTGACGATGATGATCACATTGACGGGCAACCTGCGCCGCAATGGGTGAAAGATCTTCGCAAAGGCTTCAAAGAAACACAGAAAGAAAACCGTGAGTTGCGCCGCCAGCTTGAGGAAGCATTAGCCAAGCCAGCGGAACATCAGCAACCACAACCAGACGCTATTCCACCAAAACCGACTCTTGAGTCGTGTGATTATGACGAACAGGCGTTTGAACAGGCATTGACTGATTGGCATGAGAAAAAAGGCCGTGTCGAACAGCAGCAGCAACAAAAACTACGTCAGCAACAGGAATACCAGCAGCGTTTCCAGCAAAGGGTAGAAGCGCATAAACAACGGGCAGCCAAACTTCCTGTGAAAGATTATCAGGAAATGGAAGCCATTGTTCTTAGTGAGCTACCACCAATTCAGCAGGAAATCATCATTCACTGTGCAGACGAAGGCTCTGAACTACTCGCCTATGGCTTAGGCAAGAGCCAGCAATTACGCCAGCGTGTAGCCGCTGAGACAGATCCAATTCGCGCAGCATTCCTCTTGGGGCAGATTAGCAAACAGGTAAGCCTTGCTCCAAAACCAAAGAAAGCCATCAAGCCAGAGCCGGAAGTACGTGGTGGCGGTGCTGATGCGAAACAAGACGAATTCAACAAATTATGCCCCGGCGCAAAAATCGAATAAGGAAAAGATAAATGCCTAACAATCTCGACAGTAACGTCAGTCAAATCGTTCTGAAAAAATTCCTTCCGGGTTTTATGTCAGATTTAGTTCTGGCGAAAACCGTAGACCGTCAGTTGCTGGCAGGTGAAATCAACTCCAGCACTGGCGATAGCGTTAGCTTTAAACGTCCGCATCAATTCTCATCCCTCCGTACTCCCACTGGTGATATTTCAGGGAAAAATAAAAACAACCTGATCTCAGGTAAAGCTACGGGGCGTGTAGGTAACTACATCACTGTTGCTGTTGAATATCAGCAACTGGAGGAAGCGATCAAGCTTAACCAGCTGGAAGAAATTCTCGCGCCGGTTCGCCAGCGAATCGTTACCGACCTTGAAACAGAGCTTGCTCACTTCATGATGAATAACGGTGCGTTGTCACTTGGTAGCCCCAATACTCCAATCACCAAATGGTCTGATGTTGCGCAGACGGCATCTTTCCTGAAAGACCTCGGCGTTAATGAAGGTGAAAACTATGCTGTAATGGATCCATGGTCTGCACAGCGACTTGCTGATGCGCAGACTGGTTTGCATGCTTCAGATCAATTGGTTCGTACTGCATGGGAGAACGCACAGATCCCAACCAATTTTGGCGGCATTCGCGCACTGATGTCTAATGGGCTTGCCTCTCGTACGCAGGGGGCATTTGGCGGAACACTGACAGTCAAAACACAGCCAACTGTTACCTATAACGCAGTTAAAGACTCATACCAGTTCACTGTAACATTGACCGGAGCGACAGCCAGCGTTACAGGTTTTCTGAAAGCTGGTGATCAGGTCAAATTCACCAATACCTACTGGCTGCAACAGCAGACCAAACAGGCGTTGTATAACGGAGCCACACCAATTAGCTTCACTGCAACGGTTACTGCTGATGCTAATTCAGACAGCGGTGGCGATGTGACGGTTACGCTTTCTGGTGTTCCGATTTATGACACTACAAACCCGCAGTACAACTCTGTAAGTCGTCAGGTAGAGGCAGGCGATGCCGTATCTGTAGTAGGCACTGCTAGCCAGACAATGAAGCCAAACCTGTTCTATAACAAGTTCTTCTGTGGACTTGGCTCTATCCCACTGCCGAAACTGCACAGTATTGATTCTGCTGTTGCAACATATGAAGGTTTCTCCATCCGCGTACATAAATACGCAGATGGCGATGCCAACGTGCAAAAAATGCGCTTCGACTTACTGCCTGCATATGTGTGCTTTAACCCTCACATGGGCGGTCAGTTCTTCGGTAATCCGTAATAACAAGGGGCTTACGCCCCTTTTATGTTTTAAGGAAACAATATGGATCGGATGAGTGTATTCCTTGCCGCAGATAACGAATCCGGGCATGTACAGGCCGTTATCGCAGAAAAAGACTTCCAGTTTTTCGAAAGGTTGGGCTTTGTTGCCTCAGTTGATGAATTGAAACCGACCAGTAAGCGAGGTCGTAAGGCGGCAGACAATGGCAACAGTACTGACAAAGGGTGAGATCGTCCTTTTTGCGCTTCGTAAGTTTGCTATTGCTTCTAATGCATCGCTGACTGATGTTGAGCCGCAATCAATTGAAGATGGTGTAAATGATCTGGAAGATATGATGTCCGAGTGGATGATTAACCCCGGCGACATTGGTTACGCTTTCGCAACTGGAGATGATCAGCCATTACCAGATGATGAGTCAGGTCTTCCAAGAAAATACAAACACGCAGTAGGCTATCAGTTATTGCTGAGAATGCTATCTGATTACAGCCTTGAACCAACTCCGCAAGTTCTCAGTAACGCCCAACGCTCATATGATGCCTTGATGACCGACACTCTGGTTGTTCCTTCAATGCGACGACGTGGAGATTTTCCTGTAGGACAGGGTAATAAATATGACGTGTTTACATCTGACTGATATTATCCAGGCGATCTCCCTCTGATTGATGGCGATATCCCAAACGCATAGGTGAATAAATGCCGATTCAGCAACTTCCGCTTATGAAAGGTGTCGGCAAAGACTTCCGAAACGCCGACTATATCGACTATCTGCCAGTGAATATGTTGGCTACACCAAAAGAAATCCTTAACAGCAGCGGATATCTTCGCTCATTCCCGGGCATTGCCAAACGTTCTGATGTGAACGGCGTATCGCGTGGCGTCGAGTACAACATGGCGCAGAATGCTGTTTATCGCGTGTGTGGTGGCAAGCTGTACAAAGGCGAAAGTGAAGTCGGTGACGTCGCCGGAAGTGGTCGTGTATCAATGGCGCATGGTCGGACATCACAGGCGGTAGGCGTTAACGGGAAACTGGTCGAGTATCGCTATGATGGCACGGTTAAAACCGTCTCAAACTGGCCTACAGACAGCGGATTCACGCAGTATGAGTTAGGCTCAGTACGCGACATTACGCGCTTACGTGGGCGTTATGCGTGGTCAAAAGACGGCACTGATTCATGGTTTATCACTGACCTTGAAGACGAATCACATCCTGACCGATACAGCGCACAATATCGCGCAGAATCGCAGCCTGACGGCATCATCGGCATCGGAACATGGCGAGACTTCATCGTCTGCTTTGGTTCATCGACGATTGAATATTTTTCCCTGACTGGTGCAACCACTGTTGGTGCCGCTTTGTATGTCGCCCAGCCATCGTTGATGGTGCAAAAAGGAATCGCCGGAACCTACTGCAAAACGCCGTTTGCTGATTCGTATGCGTTCATCAGCAATCCGGCAACAGGTGCGCCGTCTGTTTACATCATCGGCTCCGGTCAGGTATCACCAATCGCCAGCGCGAGCATTGAGAAAATCCTCCGCTCCTACACTGCTGATGAACTGGCTGATGGCGTGATGGAGTCTCTACGATTTGATGCTCATGAGTTGCTGATTATCCACCTTCCGCGCCACGTCCTCGTGTACGACGCATCTTCAAGCGCCAATGGTCCGCAATGGTGTGTGTTGAAAACAGGCTTGTATGACGATGTGTACCGCGCTATCGACTTCATTTACGAAGGAAATCAGATAACGTGCGGCGATAAGCTGGAATCGGTGACCGGGAAATTGCAGTTCGATATCAGCAGTCAGTACGACAAGCAGCAGGAACACCTGCTGTTTACTCCGTTGTTCAAAGCAGATAACGCCAGAGTTTTCGACCTTGAAGTTGAATCTTCAACTGGAGTTGCGCAGTATGCTGACCGCCTTTTTCTCTCTGCAACCACTGACGGCATCAATTACGGGCGTGAGCAGATGATTGAGCAGAATGAACCGTTCGTTTACGACAAACGCGTTTTGTGGAAGCGAGTAGGGCGCATCAGGAAAAATGTCGGTTTCAAATTGCGCGTTATCACGAAGTCACCTGTCACTCTGTCAGGCTGCCAGATAAGGATTGAGTAATGGCTGATTCGAATCTCAATGTGCCGGTAATCATTCAGGCTACACGGCTCGACACATCAGTCCTTCCACGCAATATCTTCTCGCAGTCGTATCTGCTTTACGTTATCGCACAGGGCACTGATGTTGGTAACGTGGCTAACAAAGCCAACGAGGCCGGACAGGGCGCTTATGATGCACAGGTCAGGAACGATGAGCAGGATGTCACCCTTGCAGACCATGAATCCAGAATTGAAGCTGCTGAAGCAACTCTCATCAATCATGAACATAGAATTGCAGCAGCGGAAAGCACTCTTGCAGATCATGAAACAAGGATTACGGCTGCTGAAACAGAGCTGGCTGATCACGAGACGCGAATTGCTGCCAATGAATCTGAGTTAGCAAACCATGATGCGCGCATAACTCAGAATACAACCGATATCGACGCACTTGATACCAGGCTCACAGCGGCAGAGGGAAGTATTTCGACGCTACAAAGCACAGTTGGTGATCACTCAACAAGAATATCTGCGCTTGAGTATGCCACCACGCGCAAGAAATCAGAGGTTGTTTACTCAGGGGTATCGGTAACAATTCCGACATCGCCTACCAACCTTGTTAGCCTGCTGAAAACGCTCACGCCGTCATCCGGTGAGTTGGCACCATTCTTCGACACCGTTAACAACAAGATGGTTGTGTTCAACGAGAACAAAACCTTGTTCTTCAAGCTGTCGATCGTCGGGACGTGGCCCAGCGGAACCGCCAACAGGTCAATGCAGCTAACATTTTCCGGTTCTGTTCCTGACACACTGGTAAGCAGTCGTAATGCGGCGACAACAACCGACAACATCCTGTTAGCTACGTTCTTCAGTGTGGATAAAGACGGCTTTCTTGCCACAAATGGCAGTACGTTAACCATTCAGTCGAATGGTGCGGCGTTTACTGCCACAACCATCAAGATAATCGCGGAGCAGTAATGATTCAGTTCAAACCAACGCGAAACATCGACTTGATCGAAGCAGTCGGAAATCACCCTGACATTATTGCCGGGAGCAACAACGGTGATGGATACGACTACAAGCCTGAATGCCGTTACTTTGAGGTTAACGTGCACGGTCAGTTTGGCGGCATTGTTTATTATCAGGAGATTCAGCCGCTGACATTCGATTGCCACGCCATGTACCTGCCAGAGATTCGCGGCTTCAGCAAAGAAATAGGGCTGGCGTTCTGGCGATACATTCTGACTAACACCACCGTTCAGTGCGTCACATCGTTCGCCGCACGCAAATTCCGCCACGGGCAGATTTACTGCGCAATGATTGGCCTTAAGCGTGTCGGAACCATCAAGAAATACTTTAAAGGCGTGGATGACGTGACTTTTTACAGCGCCACACGCGAAGAACTAATCGACTTCCTGAATCACGGGAGATAGCCATGTTATATGCATTTAAGCTGGGCAGAAAACTGCGCGGCGAGGAACCTTGGTGCCCTGAAAAAGGCGGGAAAGGTGGTAGCTCTGATAAAAGCGCAAAGTATGCAGCAGAAGCCCAGAAGTATGCCGCAGACCTGCAAAATCAGCAGTTCAACACCATCATGAACAACCTGAAGCCGTTTACTCCTCTGGCTGAGAAGTATGTCGGCAGCCTCGAGAACTTATCGTCTCTGGAGGGGCAAGGTCAGGCACTTAACCAGTATTACAACTCTCAGCAGTACAAAGACCTTGCAGGTCAGGCTCGCTATCAGAGTCTGGCGGCAGCGGAAGCAACAGGTGGATTGGGTTCCACTGCAACCAGTAATCAGTTAGCAACAATCGCACCAACGCTTGGTCAGCAATGGCTATCTGGACAAATGAACAATTACAACAACCTGGCAAATATCGGTCTTGGCGCTCTTCAGGGGCAGGCAAACGCCGGGCAAACATATGCCAACAACATGAGTCAGATTTCACAGCAAAGCGCGGCGCTGGCTGCGGCAAACGCCAACCGACCGTCAGCATTGCAGCAGGGGGTTAGTGGTGCTGCATCCGGTGCGCTTTTGGGTGGTGGCATAGCCAGTGCTCTCGAGCTATCAACTCCGTGGGGTGCTGGTATCGGTGCTGGTCTTGGTCTGCTTGGCTCGTTGTTTTAAGGGGTAATCAATGGCTACGTGGCAACAGGGTATTAATTCTGGTGGTTTTCTGGCTGGCATCGGTACGCAAAATGAGAATGCGCCAAAGGCAAGCGACATTAACGCAACGCTTGGTCTGATCCGCGAAAACAATGAACTGGCTCGCTCAGGTGCAAATAACGTTGGTCTGACCGCGTTACGTGGTCTGGCTGGAGTTGCTGATATTTATAAGCAGGAACAGCAACAGAAAGCGATTAATGCGTTCAATAAGGTTCATGCTGATGCATGGGCTTCTGGTGATCCATCGGGACTATTTAAGTTTGCCAAGGAAAATCCAGCGTTTGTTGCGCAGGCACAACAGGCGTTTTCCGGTCTTAATGAGCAGCAACGCAACGATATGGGCGATTTAGCCATGAGGGCTAACGTCGCTCTTTCTCAGGGACCGGAAGCCTACAGTAAATTCATTACTGACAACAAGGATAGGTTAAATCGCGTTGGTGCTAATGCTGACTGGATGATTCAGACAGGTAACCAGAATCCAGAGCAGCTATCACACATGCTGACTACTATGTCTCTCGGTGCGCTTGGACCAGAAAAGGCGTTTGCTGTTCAGGACAAGATGGCTGGTCGTGAGATTGATCGCGGAAAACTTGCAGAGACAATCCGCAGCAATCAGGCTGGTGAAGCACTTCAGGCGAGAGGGCAAAACCTTTCCTATCAGTCAGCAATGACTGGGCACAATATCGCAGCACAACGCTTGGCTCTGGATCAGCAAGAGTTCGGGTTTAAGATGCAGCAAGCGCAGGAAAAGGCTCAGCAGTTGATTAGCGAAGCACCTAAGCTGTCAGTAAACATGGAAAAAGGCATCGAGACGGCTGTAAACAATGCTACAGCATCATCAAACTCAGCCAATTCTATGAGTGCGCTTGCTCAACAGTTCAGAGCAGAAAAACCAACGACAGGTTTGTTCGGTAACGCACAGAACATGTTCGCAAAACTTACCGGAAGCGATACAACATTGCGTGATTTGCGCATTCGCCAAAATGCCCTTGTTAACAGTCAGGTTCTTAAATTCCTACCTCCCGGCCCAGCAACGGATAAAGACGTTGAGATCGTTCGACAGGGTGCGCCAACTGACATGGATAACCCTGAGACGGTCGCAAGATGGCTTGATGCAATGGCAAACCTTGAGCGACGAAACGCGCAGTTTAATGAGTTTAAAGCCGAGTGGATGAGCGCGAATGGCAACCCTGGACAATCGCGTAATGGCGGTCAGATATTGGGGTTGGATGTTAAAAAAGGTGAATCATTGGGGAGTGCCGTTAAGCGGTATATGTCAATGAATACTGACGCAGCGCCAGCACAAGATTCGACACCTTCAGGAGAACCACGGAATCAGGTTGGATCATATACCTCAAAATCAGGCATTCAATTTACGGTGGAATGATGAAAGTAACTGCAAACGGTAAGACATTTACCTTTCCTGATGGTACGAGCACCGAAGATATTGGCACCGCCATTGATGAGTATTTTGCTGGTCAGTCAGCACCAACACAACAAGGTGTTCAGCAATCTCCAGCAGACAACTCACTTGCATCAGGATATGCACAGCTTGCCACTCAGCAGAAGGAAGGACTAGATCGCTCTGCTGAGCAAGGGGCTGTTTTAGGTGCTGCAATGCGCGATGCCGTTACCGGTGAAAGCCGAATGACACCAGAAATGGAGAGGCTGCAAAATGTTGGGTCTGCTCCAGAGCTTAATAGCTTAAGCACTGATGCACTGCGTGCTGGATTGGGACAGCTATTTGGTTCCGACGCTTCACAGGAGAAAATACTGCAAAGTATTGGCGGGAAAATCCGGAAGGATGAGAAGGGAAATTCCATAGTCACCCTTCCTTCAGGGGAATATGCACTTAACAAGCCTGGTTTGTCACCGCAGGATATAACGTCATTCTTGGCAAATGCTCTTGCATTCACTCCAGCAGGTAGAGCTGCGTCTGTTGTAGGTGCAACACTAAAATCAGGCGCTACTGATTTAGCTTTACAGGGTGCCACTAAGATCGCTGGCGGTGAGAATGTTAATCCAGTTCAAACTGCAATTTCTGCTGGTCTTGGTGGGGTACTGAAGGGTGTAGAAAACACCGCAAGCGCAGTGTCTCGTTCTGCTATGGGTAAGATTGCTCCTGAAAAACAAGCTCAGATTGACTTTGCCAAGCAGAACAACTTGCCACTGATGACAACAGATCTTGTGGAACCGGGACCAAATATTGGTAAGCAAGCACGAGCTATGGCTGAGCGAATCCCAATAGCCGGAACAGGTGGGATAAGAAATGCACAGCAAAAGGCCAGGGAAGATTTAGTTAGAACATTTAGCGATAATGTTGGTGGAATATCTGACGCACAACTTTACCAATCAGCTACTCGTGGTCAGCAGCAATTTATTCAGGCTGCTGGCAAGCGGTACGACAGGATCATCAGTTTGATGGGGGATACTCCTGTTGACATCACTGGAACAGTGAAAGCAATTGATGCGCAGATTTCCAAGTTAACTCGCCCAGGAGTATCGCAAGACCGCTCAGCTGTTTCTGTCCTTCAACAGTTTAGAAATGACATCACCAGCGGTCCAAATAACCTGCAATTAGCTAGAGAAAACCGTACAAACTTACGTAAGCGCTTTATGGCAGCACCTGACGAGGTCGATAGAGATACGCTGGAGAAAGCTGCGCAGTCTGTTTATAACGCATACACAACAGACATGAAAAAAGCGGTTGGCGCAAAACTAGGTGCGAAGGAAGCGCAAAACATGTCGCGTGTTGATCGTTCTTGGGCAAAGTTCAACGACATGATGAGCAATACACGTGTCCAAAAAGCTATTCAGAGTGGTAAAACAACGCCAGAAGATGTCACTAAACTAGTATTCAGCCAAAGCCCAGCGGAAAGGGCGCAACTTTATCGATTGCTTGATGATAGTGGGCGTCAAAATGCTAGAGCAGCACTTGTTCAGCGTGCAATGGATAAGGCGACAAGCGATTCAGGAAAGCTTAGCGTTGAGAAGTTTATTAATGAAATGAAAAGGAATCGGAAGCAGGCTGAGACGTTCTTCAGAGGAGAGCATGGGAAACAGCTTGATGGGATAATGAAATATCTTGATTCCACTAGACAGGCAGCTACAGCTGCCGCAAGCCCACTAACAGGGCAAATGGTAGCTGGTCCAGCAGCGCTGATAACAGCTCTTGCGCCTGTTACAAATCCAATGTTTGCAAAAGTTGCGGCAGTTGGAGCTGGTATCGGTATGGCTGGCAGGGTCTATGAGTCACGCGCGATGAGGAACGCATTACTAAAGTTAGCAAACACGCCAAAAGGAAGTACTGCTTATGATAGAGCGATCAGACTGGTATCTGAAACTCTTACACCTCTAATTCAGGCTTCAAGTGAGAAAGCCCAGCAGTAAAAAGTTGGTTAGCGGTTGATGGTTGCTTTTTTCGGGTCATACCATCTCGGCCATTCTTTCAGGAATGGGAATGAGTCGGGTGCGTGGTTCTTTTTGTACGATTTAAGCAGCCTTAACCGCTCAATCGCACACTCATAAACCTCTTGTTGCCCTGTAGTCATTTCGGTCCATGAAAGGTGATCCATTGATAAGACAACGTTTTCAGCTTCTTTTATGAGGGCGTTTTTATTTCTCACCGCAGCAGCATGGCTGACAGAGCAATCCTGCCATATTCTTAAAAGCCAAATAGCTAAGCAGATGAAAAAAATGGTAGATAGCGATATATACACACCAACCTCCTTAGTTTTACGCAGGATACCATGAAAAAAGTTAACATTGGAAACGTACCAAAAATGCTCGTACCGCTCTTTGAGAGCGGTACAATTGTGTTTTGCAGAGACTTTCCAGAATGGCAACGCCTGCATCAAAAACTTGGCGTGGACGTGCAGGACTCGGATGCCAACGGAGCGTCTCATACAATGAGCAGCGAGAATGGTGTTTTGCATGTGATAGGCGTGTTCAATGGCAAACTATCTACCATTGCCCATGAGTGCGCTCACATGGCATTCGATATCTGCTCAAGGGTCGGTGTTGATGTTGAACCAGGAAGAGCCAACGAGACTTACTGCTACTTAATGAGCAGGCTTGTTGAGTTCTGCGAGCGACATATCAAAAAGCCGGAGTGACCCGGCTTGATTATTACTTTTTTTGGTATGTTAAGAATGGCAAGTCGCCAGTGTACCCCAAGCGGAAAAGCTCCAATTGCTTATCTCTTATGGATAAAAAATAATCGTTTCTTGCCTTTTTAAGCCACAACCAGCCGATCATCAAGATGAATAATATTGATATTATTGGCATCGGGGAGATCAACACCACTCCGATGAATAATGACGCTAGCAGCACCGATAAAATAGCAGAAACCATACTAATCTCCCACTAAGGTAACAATATGACCATAGAAGAACGCCTGAACAACATCGAGTTGAATCAAACCCTGCTTGACCAGCGACTTTCAGATCTTGAACTTAAAGATCTGGATGCGCAAATATCGGAAGCAGAAGCCAAGCTCTCCAGCCTAAACCACCGCAAGAAGCAAATCCGCAACAGAATTACTCAGGGACGCGGAAGCTGTTGAGGTGGGATGCTAGGTCTCTATCGTTAAAATCAAGGCTGCTAATCATTTCATTGTAAATAGCGTTTTTATCTTCCATTGGCAGTCTTGAGTAAACCAGACACAGAGCATATTTCAGGGAGTTTAGCTCTTTCTCTAGCTCTTCCTTGCTTGATGTTTTTGACTTAATAAACTGTTTTTTATTCATTTTGCATCCTTACCATACATGGTTTTCAGTGTTTCAATCAGCGCATCCCTGAATTTGTCAGCCTCTTTCTGAGCAAATTCATTGCTATTAAGCGATCTACCACAAACAGCATCTTCGATAATCTGTATTATTTCAGCATTCATGGAACGCTTGTTATGTTGCGCCCTGGCTTTAACCTTTGCCTTTAACTCTTTGGAAATCCTGATATTTATTTGCGGCTCTTCGCGTGACATACCACCTCCATAGCATTTTGGTGATATTACTATTGCATCACTGCGATCACAATGGCATAACGGTTATACCAAATTGATTGGAGGTAATATGATAGTCAAGTCAGACGCACCAAAGTACCCTTTGCGCATCCCATTAGAGGTTAAGTTAGCAATCGAGAAGTCAGCGAAAGAAAATGGTCGCTCAATAAATACCGAGATGGTAATGCGGTTAGTGGATAGTTTAAGGCGGGATAGTTCTAAAGGTAATCTAGCAAAAAGTTGAAGCCCCAACTGCGGGAACAGTCAGGGCTTCGGTATCAACAAATCGGATTAGGAAATATTGACATGAAAAGTATAGCAAAGGCACAAAACGATTTCACCATCTTCAAATTCGGCGACAGTGAAATCCGCGTCATCAACAAATGCGGTGAGCCGTGGTTTGTAGCAAAAGATGTTTGTGATGCTTTAACCCTGACTAACTCACGCAAGGCGCTTACTGCACTTGATGACGATGAAAAGGGAGTAACTTTAAGTTACACCCTTGGTGGTGAGCAGAATCTAAGCATTGTTAGCGAATCAGGTATGTATACATTGGTTCTGCGCTGCCGCGATGCTGTCAATAAAGGTTCGGTCCCGCACAAATTCCGCAAGTGGGTAACAGCAGAAGTTCTGCCTTCAATTCGCAAACATGGCGAGTATGTGAAAGGCAAGAAAACCACTGTTGAGGAAAGAACACCGCTACGCGATGCAGTAAACATGCTGGTAGGAAAGAAAGGACTTCGCTATGACGATGCATACAATATGGTTCATCAGCGTTTTGGTATTGACAGCATTGATGAACTTTCAATTGAACAAATCCCGCTGGCCGTAGAGTACATCCACAGGGTAGTGCTTGAAGGTGAATTCATCGGAAAACAAGAGAAGAAAACTGATGAGCTTTCTGCAAAAGAAGCAAACAGCCTTGTATGGTTATGGGATTATGCCAACCGCTCACAGGCATTATTCCGCGAACTGTATCCGGCGCTAAAACAAATTCAATCGAACTATTCCGGCAGATGCTACGACTACGGTCATGAGTTCTCGTATGTTATCGGAATGGCGAGAGACGTTTTAATCAATCACACACGAGATGTTGATATTAATGAGCCAGACGGACCAACGAATCTTTCCGCATGGATGAGACTTAAGAATAAAGAATTACCTCCTTCAGTACATAACTACTGACAGATAACCAACGCAACGACCCAGCTTCGGCTGGGTTTTTTTATGCCCAAAATTCACCGTAGCCATGCTGCGGGGATTCATTGTATCTGGAGCAAATTAAATGACAGACATTATCTAAAACAAGCAAACTCGCCATGATATTTTTCCCTGAATGCAATAGACATTAATTCGGCCAGCTCAATATCATCGGTATACCCAAGGCAAATAGACTTACCTTTTAGTTTGAACATAACTTTATATGTGTTTTTCCCTTTAAGAAGATGAACTCCTTTTATACCTGTATTGGATGATGAAACGCTGTTGGTTTTATTCTGACCTCTGGTTACTTCTCTTAGGTTTGAAATTCTATTGTCATTTTTAACCCCGTTAATATGGTCAATTAACCCTTTAGGCCAGCATCCTTTCATATAAAACCAAGCAAGTCGGTGGCAAAGATATGATTTCCCATTAATCCGTATTGATAAGTAGTCGGCCTGTGAGTTCGTCTTAAATCCAGCAACCTTTCCGATACTGGAGCTACCAGAAGCTATCTTCCAAGTGAAGACGCCTGTTTCTGGGTTGTAGTTGAGTGAAGAGGTTAATTCTTTATGAGTAATCATATTCATTCCTTAAATAGAGATTCACTATGTCAAATATTGTGCCAAATGTTATAATTTCAATGCCAAGTCAACTATTTACGTTAGCCAGAAAATTCCAGGCAGCAAGCAATGGCAAGATATTCATTGGTAAGATAGATTCCGATCCCACGCTCCCACAGAATCAAGTGCAGGTTTATGTGGAGAACGAAGACGGCTCTCACATTCCCGTTTCGCAACCAATAATTATTAACGCTGCTGGCTACCCTGTATATAACGGACAGATTGCCAAATTCGTAACCGTGCAAGGGCATTCTATGGCTGTGTACGATGCATATGGATCACAGCAGTTCTATTTCCCTAATGTGCTGAAGTACGATCCAGATCAGTTTTCATATTTGCTGAAAAGTGATGATGGCTATAAGTATATAGGAGAATGCAAAGATATATCTACATTGAGAACAATTGAACCAAGTTATGATGGACAGGTTATAAGACTGAGGGGATATTACTCTGACAGCTATTCAGGAGGTGGAAGGTTTTTCCGTAGTGTTAATCCTGATGGATTGACAGATAACGGAGGAACGATAATTTTCACATCAGAAGGAAATGTATGGAAATCAGATGATATATATAATGTTATGCTTGAGGATTTCGGAGCTAAAGATGGTGCTGACAACACCGAAGCATTTCATAAAGCAATTAACTCACCAGCAAAAAAAATCACAACAAATATAAAAGAGTTGATACTAACAAACAATATTGTAATTACACGCGGCGACCTGGCAATTGATATGCCTGATACGGAGATAATATGGAATGCTCCGGAAGGACATATTCCTGATAGAAGTGGATCTACAAATCCAAGTAACTATCGTTTTCCTGGCATTCTAGCATTTCGCGGCACTGCTGGAGATATAATAGACTCTTTTACTCTAACAAATAGAATATCAAAAGGAGATACAACATATTGGTGTACGAACAACTCTCTTTTTGAGAATAGACAGTGGGTTATTATGTCATCGGATGTTGGCGGCGGAACCGTAGGTAGAGAAATAAATGTAATGACACAAGTGCAGGGCAGTGGTGGAGCAATAACTCAATTGCGAGTTGATTATAAAACAGGCTGGCATCTTGATGTCGGAAGGGTATTGACATATAGAAATGTAACCCCAATAGAAAATGTAAGAATTGCTTTTAAAGGCGTAAAATGGAATCAAAACATAACAGATGCTTCAGGTGGTGGCGATGGATTTGCCAGTCAACAATCATGCGCACTTGTGTCCCTTGAATACGTAAATAACGCTGACATTTGTCTCGGGTATGGATTTAATCACCCTTACCCAATGGTTGTAACTGCCATGGTCAGGAATGTTTTTGTTCATGATACGGAAACAAATTTCCCAAGAGTTCCAGGGTCCGATCATGTTGTTCAATTCAATAACGCATATGAATGCCACGCAAGAAGATTACGCAATATTTCAGGCAGGCACGTAGTTGATTTTAGTGGTTCAAGTTATTGCTCTGTAAGAGATTGCGGAGAAACAGGAACAAGAAATGGTGCATTTACTACGCACGGTATGTTTGAGCACAATTTGCGATTTGATAATAATTATGGTCTTTTAAGCATTGCAAACAGTGGTGAATATTATGGTGAGTCAGCAGATAATATAACTGTAACCCATCATTTCGGAGATTATTTAATTGCAACTTCAAAAGTTACCAACCTTAACATATTGCATGCAATATTTACAAAAGGTGCAAGATTAAACAATGATGCCGTCACATTGATTGATGTTACTATTGGAGAAAACTCAACTGATGCCGATAGAGGGCTTAGATTTACCCAAAGCTCTAATGTTTATGGGCGTGGGGCAAAAATATTCGGCTGTGACATTGTGTTAAACAAGCAAGCAGGGAATGCTCTGCCTGATTCTCTTAACCAAAGGGTGAAAATCGAATCGACATATATCAGGAATTCAAACGCTTCATATTATGGTGGTAGTTGGATTGAACTAATTAATTGTGATGTTTCAGGATCAGGTTCAAGTCTTGAGAATGTTGTTGCTGCAACGCGGTTTAGCATGCTGTCCGGATCACTATCTAATACTGGTTTTATTTTTCAAGGCCCATCAGAGCAAATCGTAAAAATACATGACGTTTCACAGTCTGGAACAGGAGCGTCAGGGCTTAATAGTCATTTTACTTTAAATAAAGATGAGGCAGGAAGTGGACCAATAACAATGTCATATAAAAATAATCAATGTATATTGAGTGGAGAAACATGCGCATTCAAAATAAATAATGCTTCAGGAGCATTCAGAGTAAACTCTACAGGGAATACGTTTCAAGGTGGGAAAATAGAGTTTCAGAAATCTATTACATCAGGTGGCAGTTACTTAAATCACACTGGAAACGTAGAGCGTGGCGTTAGTAGAGTCAACGAGCCAGAAAATACTCCATATATAGTAACAACAGGGAATATGATAATTCCCTGATATAATATAAATCGCCAAGGATGGCTCATTCAAATAAGAATCTTTAGTAATCAGATTGTTAATTAAAGATGGTTTGTTGGTTTTTATGTTTATTAAAAAAGGTTATATCAACGCGCTGTTATTGTTCTTTAATCAACCTTCCCATCAAGCCAGTCTGCCCACCACTGCATCATTTCTCTGCGTTTGTCGAGATACTGAGCATGGTTGTAAATTCCACGCACAGAACCGCCGTTGGCATGTGCCAGTTGCACTTCAATAGCATCAGCGGGCCATTCGTGCTCGTTCATAATCGTGCTGAATTCATGCCTGAATCCGTGACCGCTTTCCAGACCCTCATAGCCGATTTGTTTGATCACAAGCAATACCGCGTTCTCGCAGATTGGCTTCTTCTTATCGTTGCGCCCGGCAAAAACAAACTCTGATACTGGTTTGGTGATTGAGCTTAGCGTAGTGAGAAGTTCAACCACCTGGTCCGACATCGGGACCACATGAATTTTGCGGCCCTTCATCACACTGGCGTCGATGGTGATAATCCTGTTTTCAAAATCGACGTTCTTCCATAGCATGGAACGAAGCTCTTTCGTTCTTAGGGCAGTGTAGCGTAAAACTTTGGTCGCAATGAGCGATACGATACTTCCTGAAAATGTTGCAAGTGCTTTGTTGAATGCCGGGATCTGGTCGGCAGGTAAAAACGGGAAGTTCTTCTTGCGGTATCCCTTCATGGCGTCAGCAAGGTCAGGTGCCGGGTTATATTTAGCCCTACCAGTGACAATAGCGTAACGGAAAACCTCGCCGCATCTTCTGCGGGCTTTGTTGGCTCGCTCCATTGCACCGCGATCTTCAAATCTGCGGATTACTTCCAGCAGTTGCATCGGCTCAATATCCTGAATTTCAAGGCCGCCGATGATAGGTAAAATGTCGTCATCAAACATTTTTGCAAGTTCATTTGCATAGCCTACTGACCAGACTTGCTTCTTGTGCTCGTACCATTCCTTGTAAATGGCGCTAAAGGAATTGTTGTTAGACGAAGCCTTTTTCGCTTTTACCGGATCGATGCCAACCGAGATGTCTTTCCTCGCGGTCCATGCTTTATCCCTTGCCTCCTGCAAAGTCATAAGCGGATATTTTCCGACGGTCAGGATTTTCTCCTTACCGTCAATCTTGTAGCGAAGCTGCCATACCTTTTTCCCTGATACAGGGACATAAAGGTACAGGCCATTACCATCGAGTAGGCGGTATGGTTTTTCTTTCGGCTTTGCTGCTTCAATCTGCTTAACGGTGAGCAT
Protein sequences of DBSCAN-SWA_9 >NZ_CP040886|3746219:3790394|3785156_3786035_+|WP_060504045.1|DBSCAN-SWA MKSIAKAQNDFTIFKFGDSEIRVINKCGEPWFVAKDVCDALTLTNSRKALTALDDDEKGVTLSYTLGGEQNLSIVSESGMYTLVLRCRDAVNKGSVPHKFRKWVTAEVLPSIRKHGEYVKGKKTTVEERTPLRDAVNMLVGKKGLRYDDAYNMVHQRFGIDSIDELSIEQIPLAVEYIHRVVLEGEFIGKQEKKTDELSAKEANSLVWLWDYANRSQALFRELYPALKQIQSNYSGRCYDYGHEFSYVIGMARDVLINHTRDVDINEPDGPTNLSAWMRLKNKELPPSVHNY >NZ_CP040886|3746219:3790394|3758656_3758935_+|WP_001177653.1|DBSCAN-SWA MQLTSTRKKANAITSNILNRIAVRGQRKVADALGINESQISRWKDSFIPKMGMLLAVLEWGVEDEELAELAKKVARMLTKEKAPKNGEFFEA >NZ_CP040886|3746219:3790394|3762935_3763112_+|WP_001254255.1|DBSCAN-SWA MRRQRRSITDIICENCKYLPTKRSRNKRKPIPKESDVKTFNYTAHLWDIRWLRHRARK >NZ_CP040886|3746219:3790394|3769874_3771287_+|WP_060504031.1|terminase|DBSCAN-SWA MTSINPIFEPFIEAHRYKVAKGGRGSGKSWAIARLLVEAARRQPVRILCARELQNSISDSVIRLLEDTIEREGYSSEFEIQRSMIRHLGTNAEFMFYGIKNNPTKIKSLEGIDICWVEEAEAVTKESWDILIPTIRKPFSEIWVSFNPKNILDDTYQRFVVNPPDDICLLTVNYTDNPHFPEVLRLEMEECKRRNPTLYRHIWLGEPVSASDMAIIKREWLEAATDAHKKLGWKAKGAVVSAHDPSDTGPDAKGYASRHGSVVKRIAEGLLMDINEGADWATSLAIEDGADHYLWDGDGVGAGLRRQTTEAFSGKKITATMFKGSESPFDEDAPYQAGAWADEVVQGDNVRTIGDVFRNKRAQFYYALADRLYLTYRAVVHGEYADPDDMLSFDKEVIGEKMLEKLFAELTQIQRKFNNNGKLELMTKVEMKQKLGISSPNLADALMMCMHCPALVREETEIYVPSSSGW >NZ_CP040886|3746219:3790394|3754212_3754686_-|WP_024167014.1|DBSCAN-SWA MASRGVNKVIIIGRLGHDPEIRYSPSGTAFANLTVATSEQWRDKQTGEQKEQTEWHRVVMSGKLAEIASEYLRKGSEVYLEGKLRTRKWQDQSGQDRFTTEVIVGVGGTMQMLGGKQGGNEQSSHQRNNGQQQRQQSQQQGNHSEPPMNFDDSDIPF >NZ_CP040886|3746219:3790394|3777708_3778662_+|WP_060504040.1|DBSCAN-SWA MADSNLNVPVIIQATRLDTSVLPRNIFSQSYLLYVIAQGTDVGNVANKANEAGQGAYDAQVRNDEQDVTLADHESRIEAAEATLINHEHRIAAAESTLADHETRITAAETELADHETRIAANESELANHDARITQNTTDIDALDTRLTAAEGSISTLQSTVGDHSTRISALEYATTRKKSEVVYSGVSVTIPTSPTNLVSLLKTLTPSSGELAPFFDTVNNKMVVFNENKTLFFKLSIVGTWPSGTANRSMQLTFSGSVPDTLVSSRNAATTTDNILLATFFSVDKDGFLATNGSTLTIQSNGAAFTATTIKIIAEQ >NZ_CP040886|3746219:3790394|3764220_3764832_+|WP_060504019.1|DBSCAN-SWA MAKPARRRCKNGECREWFHPAFANQWWCSPECGTKIALERRSKEREKAEKAAEKKLRREEQKQKDKLKIRKLALKPRSYWIKQAQQAVNAFIRERDRDLPCISCGTLTSAQWDAGHYRTTAAAPQLRFDERNIHKQCVVCNQHKSGNLVPYRVELISRIGQEAVDEIESNHNRHRWTIEECKAIKAEYQQKLKDLRNSRSEAA >NZ_CP040886|3746219:3790394|3766845_3767061_+|WP_000839574.1|holin|DBSCAN-SWA MKSMDKISTGIAYGTSAGSAGYWFLQWLDQVSPSQWAAIGVLGSLVLGFLTYLTNLYFKIREDRRKAARGE >NZ_CP040886|3746219:3790394|3750954_3751299_-|WP_001281201.1|DBSCAN-SWA MSDLAMKVLKWQSTGDVGISSATLASIACGLKKNIYGHSFGAPHDAADFRRCVALVEQIPEIRDSFDKVAKRVPAFKGILNEWDSLVALLKSEMKTYGNKAPETYRRISELRKD >NZ_CP040886|3746219:3790394|3767985_3768138_+|WP_032181221.1|DBSCAN-SWA MPSMIAIILLIILHICLCRQGGDHFWSESRLNISLLMLEVEHLARGKGLR >NZ_CP040886|3746219:3790394|3778661_3779117_+|WP_000614037.1|DBSCAN-SWA MIQFKPTRNIDLIEAVGNHPDIIAGSNNGDGYDYKPECRYFEVNVHGQFGGIVYYQEIQPLTFDCHAMYLPEIRGFSKEIGLAFWRYILTNTTVQCVTSFAARKFRHGQIYCAMIGLKRVGTIKKYFKGVDDVTFYSATREELIDFLNHGR >NZ_CP040886|3746219:3790394|3747868_3748783_+|WP_001274871.1|DBSCAN-SWA MSAKVWVLGDAVVDLLPESDGRLLPCPGGAPANVAVGIARLGGISGFIGRVGDDPFGALMQRMLLTEGVDITYLKQDEWHRTSTVLVDLNDQGERSFTFMVRPSADLFLETTDLPCWRHGEWLHLCSIALSAEPSRTSAFTAMTAIRHAGGFVSFDPNIREDLWQDEHLLRLCLRQALQLADVVKLSEEEWRLISGKTQNDQDICALAKEYEIAMLLVTKGAEGVVVCYRGQVHHFAGMSVNCVDSTGAGDAFVAGLLTGLSSTGLSTDEREMRRIIDLAQRCGALAVTAKGAMTALPCRQELE >NZ_CP040886|3746219:3790394|3774325_3775597_+|WP_060504035.1|head|DBSCAN-SWA MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGKNKNNLISGKATGRVGNYITVAVEYQQLEEAIKLNQLEEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSGGDVTVTLSGVPIYDTTNPQYNSVSRQVEAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKLHSIDSAVATYEGFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGGQFFGNP >NZ_CP040886|3746219:3790394|3771289_3773416_+|WP_060504033.1|portal|DBSCAN-SWA MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEYRNNRITVNFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPIYDPSRSVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQAEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVMEAIRLLKDVAESQQQQFQSPPQSPADLMPS >NZ_CP040886|3746219:3790394|3763108_3764059_+|WP_060504017.1|DBSCAN-SWA MTMTAYYNEIDPYAAQWLRNLIDAGEIAPGYVDERSIEDVTPGDLRGFTQHHFFAGIGVWSYALRKAGWPDNKSIWTGSCPCQPFSSAGKGKGVDDERHLWPAFFWLIEKCNPGIVIGEQVASADGLAWLDLVQTDLEGANYTSAGTDICAAGFGSPHIRQRLYWVAYSNDKYQLSARDTQGNSEPIWMRETSGMANSFSERCNRFNALLQRKRQERNPKNLLETSRDGEAMYPLPVNGFWRDADWLYCRDEKYRPVRPGSFPMVNGIAKSLGRGKSTLGRMAKRNQDQRIIGYGNAINAEVATAFVKVCMEVVNA >NZ_CP040886|3746219:3790394|3767554_3767998_+|WP_060504028.1|lysis|DBSCAN-SWA MSRSRVTAIISALVICIIVCLSWAVNHYRDNAMIYKEQRDKAASTIADMQKRQRDVAELDARYTKELADANETIESLRADVSAGRKRLQVAATCAKSTAGASSMGDGESPRLTADAELNYYRLRSGIDKITAQVNYLQEYIRTQCLR >NZ_CP040886|3746219:3790394|3783249_3783570_-|WP_000275950.1|DBSCAN-SWA MYISLSTIFFICLAIWLLRIWQDCSVSHAAAVRNKNALIKEAENVVLSMDHLSWTEMTTGQQEVYECAIERLRLLKSYKKNHAPDSFPFLKEWPRWYDPKKATINR >NZ_CP040886|3746219:3790394|3759117_3760008_+|WP_000539336.1|DBSCAN-SWA MGVVKLADYRHNPVQHQEASSMGYVSIHRQFMDSRLYKDSQAVHLWLHLILKANHESTVVNTDIGPITVDRGQMITGRPSLVRETFIPDNKVRSLLRTFESKGMLNICSMGKKFSLFTIVKYDDFQAKNCPTVVQRLSNANTSNGAALSGDCPTVVQRLSINNNINNISNTDVLESATADKKSDKKKPSVSCQDVVDAYHEILPEAPRIRALNDKRKNQIRTFWRKAGVITRQLDGHGFTMQDWRNYLSYVGENCRWMFEERPNHQRGTVWHKKGFDFLLNDNTYLKVREGEHDDR >NZ_CP040886|3746219:3790394|3784389_3784599_-|WP_001036007.1|DBSCAN-SWA MNKKQFIKSKTSSKEELEKELNSLKYALCLVYSRLPMEDKNAIYNEMISSLDFNDRDLASHLNSFRVPE >NZ_CP040886|3746219:3790394|3750630_3750831_-|WP_001163428.1|DBSCAN-SWA MQHELQPDSLVDLKFIMADTGFGKTFIYDRIKSGDLPKAKVIHGRARWLYRDHCEFKNKLLSRANG >NZ_CP040886|3746219:3790394|3776290_3777709_+|WP_060504038.1|DBSCAN-SWA MPIQQLPLMKGVGKDFRNADYIDYLPVNMLATPKEILNSSGYLRSFPGIAKRSDVNGVSRGVEYNMAQNAVYRVCGGKLYKGESEVGDVAGSGRVSMAHGRTSQAVGVNGKLVEYRYDGTVKTVSNWPTDSGFTQYELGSVRDITRLRGRYAWSKDGTDSWFITDLEDESHPDRYSAQYRAESQPDGIIGIGTWRDFIVCFGSSTIEYFSLTGATTVGAALYVAQPSLMVQKGIAGTYCKTPFADSYAFISNPATGAPSVYIIGSGQVSPIASASIEKILRSYTADELADGVMESLRFDAHELLIIHLPRHVLVYDASSSANGPQWCVLKTGLYDDVYRAIDFIYEGNQITCGDKLESVTGKLQFDISSQYDKQQEHLLFTPLFKADNARVFDLEVESSTGVAQYADRLFLSATTDGINYGREQMIEQNEPFVYDKRVLWKRVGRIRKNVGFKLRVITKSPVTLSGCQIRIE >NZ_CP040886|3746219:3790394|3789236_3790394_-|WP_060504054.1|integrase|DBSCAN-SWA MLTVKQIEAAKPKEKPYRLLDGNGLYLYVPVSGKKVWQLRYKIDGKEKILTVGKYPLMTLQEARDKAWTARKDISVGIDPVKAKKASSNNNSFSAIYKEWYEHKKQVWSVGYANELAKMFDDDILPIIGGLEIQDIEPMQLLEVIRRFEDRGAMERANKARRRCGEVFRYAIVTGRAKYNPAPDLADAMKGYRKKNFPFLPADQIPAFNKALATFSGSIVSLIATKVLRYTALRTKELRSMLWKNVDFENRIITIDASVMKGRKIHVVPMSDQVVELLTTLSSITKPVSEFVFAGRNDKKKPICENAVLLVIKQIGYEGLESGHGFRHEFSTIMNEHEWPADAIEVQLAHANGGSVRGIYNHAQYLDKRREMMQWWADWLDGKVD >NZ_CP040886|3746219:3790394|3755785_3755920_-|WP_000972063.1|DBSCAN-SWA MMHFQLAGSGVMSAFYPHESELSRRVKQLIRAAKKQLEALCAMK >NZ_CP040886|3746219:3790394|3762044_3762311_+|WP_060504012.1|DBSCAN-SWA MDESRKQFEEWFKNKYHVSSDVMKIMHIKVEIAWEAWQASRAAIEIELQKPKKGPLPGDYHIGYDSGAESQYESDVEAIRAAGVKVKE >NZ_CP040886|3746219:3790394|3762313_3762520_+|WP_001515066.1|DBSCAN-SWA MTNQQQIEFILEQIRKMREKNQPDMMEIWRRQQEEYRKHIFGERKQDDWSLYGYGTRTNKNGYSLYTY >NZ_CP040886|3746219:3790394|3768343_3768874_+|WP_000877024.1|DBSCAN-SWA MKYPTVIVNGVSVRVDEDGRYNLNDLHAAAVANGEATEQQRPSQFLRSAQIKRFIKALEAKVQKSTLEQIQPLKIIKGGAEPGVWGVELLAIRYAAWIKPEFEIEVYEVFKTVVRLGVGAMSRLNRIDHIINTETKAISQCASQMAKWGVGGRKRLLHVARERAANEVQMYLPGMV >NZ_CP040886|3746219:3790394|3753166_3753715_-|WP_001016186.1|DBSCAN-SWA MNHLMVDLETMGNGPYAPVISIGAVFFDPNTGETGEEFSVNISLESSMRYRARPDASTILWWMEQSEEARKSLTSNTQELSTALSWLSEFIIKNANHKFVQVWGNGASFDCVILRNSYSLTGQPVPWQWWNDRDVRTIVELGKVIGFDPKRDMPFKGTRHNALDDAIHQAKYVSAIWKKLAK >NZ_CP040886|3746219:3790394|3752853_3753150_-|WP_060504006.1|DBSCAN-SWA MPAPLYGADDPRRCSGNSISEVLENIKNNLDAFLALPPETKEERKYRRDIQLAEKQEKDRINETSIRPFRKSIYTHFPEYIDPRLRNYRSRYGAISND >NZ_CP040886|3746219:3790394|3746219_3747653_-|WP_000194515.1|DBSCAN-SWA MTQSRLHAAQNALAKLHEHRGNTFYPHFHLAPPAGWMNDPNGLIWFNDRYHAFYQHHPMSEHWGPMHWGHATSDDMIHWQHEPIALAPGDDNDKDGCFSGSAVDDNGVLSLIYTGHVWLDGAGNDDAIREVQCLATSRDGIHFEKQGVILTPPEGIMHFRDPKVWREADTWWMVVGAKDPGNTGQILLYRGSSLREWTFDRVLAHADAGESYMWECPDFFSLGDQHYLMFSPQGMNAEGYSYRNRFQSGVIPGMWSPGRLFAQSGHFTELDNGHDFYAPQSFLAKDGRRIVIGWMDMWESPMPSKREGWAGCMTLARELSESNGKLLQRPVHEAESLRQQHQSVSPRTISNKYVLQENAQAVEIQLQWALKNSDAEHYGLQLGTGMRLYIDNQSERLVLWRYYPHENLDGYRSIPLPQRDTLALRIFIDTSSVEVFINDGEAVMSSRIYPQPEERELSLYASHGVAVLQHGALWLLG >NZ_CP040886|3746219:3790394|3756929_3757229_-|WP_060504010.1|DBSCAN-SWA MTVVITYLADDNARNRRRARRQAQREQAMQEQRLARKIALKLSGCVRADKAASLVSLRCKKADEVERKQNRIYYRKPRSEMGVTCVGRQKMKLGSKPLI >NZ_CP040886|3746219:3790394|3752241_3752679_-|WP_060504002.1|DBSCAN-SWA MSKIDYQKLREIAEKTKIAGEAPVMPFDQRINALNDFMKHFSPDIALALLDERERNQQYIKLRDQENEEIALTVGKLRVELEAAEKRIAELQLREVVLPQCYSMLHRVDFDEPYHTEMVYRQHQVLEALHNAGINVTEAGKGEAS >NZ_CP040886|3746219:3790394|3779822_3781238_+|WP_060504042.1|DBSCAN-SWA MATWQQGINSGGFLAGIGTQNENAPKASDINATLGLIRENNELARSGANNVGLTALRGLAGVADIYKQEQQQKAINAFNKVHADAWASGDPSGLFKFAKENPAFVAQAQQAFSGLNEQQRNDMGDLAMRANVALSQGPEAYSKFITDNKDRLNRVGANADWMIQTGNQNPEQLSHMLTTMSLGALGPEKAFAVQDKMAGREIDRGKLAETIRSNQAGEALQARGQNLSYQSAMTGHNIAAQRLALDQQEFGFKMQQAQEKAQQLISEAPKLSVNMEKGIETAVNNATASSNSANSMSALAQQFRAEKPTTGLFGNAQNMFAKLTGSDTTLRDLRIRQNALVNSQVLKFLPPGPATDKDVEIVRQGAPTDMDNPETVARWLDAMANLERRNAQFNEFKAEWMSANGNPGQSRNGGQILGLDVKKGESLGSAVKRYMSMNTDAAPAQDSTPSGEPRNQVGSYTSKSGIQFTVE >NZ_CP040886|3746219:3790394|3765012_3765678_+|WP_060504021.1|DBSCAN-SWA MRYYEKIDGSKYRNIWVVGDLHGCYTNLMNKLDTIGFDTQKDLLISVGDLVDRGAENVECLELITFPWFRAVRGNHEQMMIDGLSERGNVNHWLFNGGGWFFNLDYDKEILAKALAHKADELPLIIELVSKGKKYVICHADYPCDKYEFGKPVDHQQVIWNRERISNSQDGFVKEIKGADTFIFGHTPAVKPLKFANQMYIDTGAVFCGNLTLIQVQGEGA >NZ_CP040886|3746219:3790394|3755648_3755801_-|WP_001243355.1|DBSCAN-SWA MRNEIAINHQMLRAAQNKAVIARFIGDSKMWLEANKAMKSAINLPWYRRK >NZ_CP040886|3746219:3790394|3784229_3784415_+|WP_000151196.1|DBSCAN-SWA MTIEERLNNIELNQTLLDQRLSDLELKDLDAQISEAEAKLSSLNHRKKQIRNRITQGRGSC >NZ_CP040886|3746219:3790394|3756450_3756921_-|WP_000167595.1|DBSCAN-SWA MTKSWSVPFPESETEHDGMPVFWRFQATVEEDGIKIFALQYIAFHQTEHYAWLVPAHWIVNFKPAPNQWLQEWKQRRNRYAIKKVAKNAERSFAFPTKKLAIESLLRRKKYHLMRIKQDLAVVSTLVDGMKNIDTSTPDIEYNFGHNQETENWVFY >NZ_CP040886|3746219:3790394|3769452_3769878_+|WP_000179915.1|DBSCAN-SWA MATEKKNVGRPSDYLPEVADDICALLASGESLVKVCKRPGMPAKATVFRWLSEHEEFRDKYAKATEARADSIFEEIFEIADTAIPDAAEVAKARLRVDTRKWALARMNPRKYGDKVTNELVGKDGGAIQIETSPMSTLFGK >NZ_CP040886|3746219:3790394|3759997_3761434_+|WP_001549089.1|DBSCAN-SWA MTDNFYAPPHSIEAEQAVIGGLLLDDDSSERVQKVLAMLKPDSFYSRPHKILFEEITRMHREQKPVDGLTLFDELERKSLTASVGGFAYIAEIAKNTPSAANIVAYAMQVRETAMERYAINRMTEATELLYSRNGMTATQKYEAIQAIFTQLTDHAKTGSRRGLRSFGEVMEDWVSDLEKRFDPSGEQRGMSTGIPSLDRMLSPKGLVKGSLFVIGARPKMGKTTLYSQMAINCAVHEKKPALMFSLEMPGDQILEKLVGQKSGVNPNIFYLPATNDADDGYQGDYDGDFNRAIETANRLSEIDLLYIDDTPGLSLAQIVSESRRIKREKGCVGMILVDYLTLMTAEKADRNDLAYGMITKGLKNLAKELDCVVVLLTQLNRALESRTNKRPLPSDSRDTGQIEQDCDYWVGIHREGAFDDSVPPGETELILRLNRHGNTGTVYCIQANGAIYDTDQQSAEMRRREREEPQSKKKGGF >NZ_CP040886|3746219:3790394|3784595_3784832_-|WP_001549440.1|DBSCAN-SWA MSREEPQINIRISKELKAKVKARAQHNKRSMNAEIIQIIEDAVCGRSLNSNEFAQKEADKFRDALIETLKTMYGKDAK >NZ_CP040886|3746219:3790394|3781237_3783241_+|WP_060504043.1|DBSCAN-SWA MKVTANGKTFTFPDGTSTEDIGTAIDEYFAGQSAPTQQGVQQSPADNSLASGYAQLATQQKEGLDRSAEQGAVLGAAMRDAVTGESRMTPEMERLQNVGSAPELNSLSTDALRAGLGQLFGSDASQEKILQSIGGKIRKDEKGNSIVTLPSGEYALNKPGLSPQDITSFLANALAFTPAGRAASVVGATLKSGATDLALQGATKIAGGENVNPVQTAISAGLGGVLKGVENTASAVSRSAMGKIAPEKQAQIDFAKQNNLPLMTTDLVEPGPNIGKQARAMAERIPIAGTGGIRNAQQKAREDLVRTFSDNVGGISDAQLYQSATRGQQQFIQAAGKRYDRIISLMGDTPVDITGTVKAIDAQISKLTRPGVSQDRSAVSVLQQFRNDITSGPNNLQLARENRTNLRKRFMAAPDEVDRDTLEKAAQSVYNAYTTDMKKAVGAKLGAKEAQNMSRVDRSWAKFNDMMSNTRVQKAIQSGKTTPEDVTKLVFSQSPAERAQLYRLLDDSGRQNARAALVQRAMDKATSDSGKLSVEKFINEMKRNRKQAETFFRGEHGKQLDGIMKYLDSTRQAATAAASPLTGQMVAGPAALITALAPVTNPMFAKVAAVGAGIGMAGRVYESRAMRNALLKLANTPKGSTAYDRAIRLVSETLTPLIQASSEKAQQ >NZ_CP040886|3746219:3790394|3765674_3766298_+|WP_060504023.1|DBSCAN-SWA MRLESVAKFHSPKSPMMSDSPRATASDSLSGADVMAAMGMAQSQAGFGMAAFCGKHELSQNDKQKAINYLMQFAYKVSGKYRGVAKLEGNTKAKVLQVLATFAYADYCRSAATPGARCRDCHGTGRAVDIAKTEQWGRVVEKECGRCKGVGYSRMPASAAYRAVTMLIPNLTQPTWSRTVKPLYDALVVQCHKEESIADNILNAVTR >NZ_CP040886|3746219:3790394|3758362_3758548_+|WP_000276886.1|DBSCAN-SWA MYKKDVIDHFGTQRAVAKALGISDAAVSQWKEVIPEKDAYRLEVVTAGALKYQESAYRKAA >NZ_CP040886|3746219:3790394|3755995_3756307_-|WP_001609782.1|DBSCAN-SWA MKLRVWHIPQVPMKPFIAEVASVEEGVRLMDALADYDAFQYDNNIKPDYCNANGLEMWDESLTDEDLSEMGLTDRWVDWYSECQCYDDPRKYLESLKEETSAA >NZ_CP040886|3746219:3790394|3764051_3764228_+|WP_000950963.1|DBSCAN-SWA MLSPSQSLQYQKESVERALTCANCGQKLHVLEVHVCEHCCAELMSDPNSSMYEEEDDG >NZ_CP040886|3746219:3790394|3764828_3765035_+|WP_000144614.1|DBSCAN-SWA MTFSVKTIPDMLVEAYGNQTEVARRLKCSRGTVRKYVDDKDGKMHAIVNDVLMVHRGWSERDALLRKN >NZ_CP040886|3746219:3790394|3767060_3767558_+|WP_060504025.1|DBSCAN-SWA MPPSLRKAVAAAIGGGAIAIASVLITGPSGNDGLEGVSYIPYKDIVGVWTVCYGHTGKDIMLGKTYTEAECKALLNKDLATVARQINPYIKVDIPETTRGALYSFVYNVGAGNFRTSTLLRKINQGDIKGACDQLRRWTYAGGKQWKGLMTRRDIEREVCLWGQQ >NZ_CP040886|3746219:3790394|3775799_3776231_+|WP_050484735.1|DBSCAN-SWA MATVLTKGEIVLFALRKFAIASNASLTDVEPQSIEDGVNDLEDMMSEWMINPGDIGYAFATGDDQPLPDDESGLPRKYKHAVGYQLLLRMLSDYSLEPTPQVLSNAQRSYDALMTDTLVVPSMRRRGDFPVGQGNKYDVFTSD >NZ_CP040886|3746219:3790394|3786145_3786640_-|WP_072156916.1|DBSCAN-SWA MNMITHKELTSSLNYNPETGVFTWKIASGSSSIGKVAGFKTNSQADYLSIRINGKSYLCHRLAWFYMKGCWPKGLIDHINGVKNDNRISNLREVTRGQNKTNSVSSSNTGIKGVHLLKGKNTYKVMFKLKGKSICLGYTDDIELAELMSIAFREKYHGEFACFR >NZ_CP040886|3746219:3790394|3754686_3755394_-|WP_060504008.1|DBSCAN-SWA MDLNKFDEPFSPEDIEWRIQQSGKTRDGKVWAMVLAYVTNRAIMKRLDDVCGKAGWRNEYRDIPNNGGVECGISIKIDSEWVTKWDAAENTQVEAVKGGRSGAMKRAAVQWGIGRYLYNLEEGFAQTSLDKKQGWHRAKLKDGTGFYWLPPSLPGWAIPASDNKPSPENTNQKSPSVDYEQILKDFSDFASKETDKKKLIERYQHDWQLMAGNEDAQAKCVQVMNIRVNELKQAA >NZ_CP040886|3746219:3790394|3783600_3783966_+|WP_000757526.1|DBSCAN-SWA MKKVNIGNVPKMLVPLFESGTIVFCRDFPEWQRLHQKLGVDVQDSDANGASHTMSSENGVLHVIGVFNGKLSTIAHECAHMAFDICSRVGVDVEPGRANETYCYLMSRLVEFCERHIKKPE >NZ_CP040886|3746219:3790394|3752675_3752843_-|WP_060504005.1|DBSCAN-SWA MRGLAYNPGILPAEMIIRQRVKPMPSREELLKRKSFGSVNDNKYLNAMWRKGVKQ >NZ_CP040886|3746219:3790394|3773429_3774314_+|WP_000426736.1|DBSCAN-SWA MENELIIDGQVIDLSETQENAEETIIQTESQPENESQDDNGKEVATEPEKTEETPEDYALRIGDEEIQLNADDDDHIDGQPAPQWVKDLRKGFKETQKENRELRRQLEEALAKPAEHQQPQPDAIPPKPTLESCDYDEQAFEQALTDWHEKKGRVEQQQQQKLRQQQEYQQRFQQRVEAHKQRAAKLPVKDYQEMEAIVLSELPPIQQEIIIHCADEGSELLAYGLGKSQQLRQRVAAETDPIRAAFLLGQISKQVSLAPKPKKAIKPEPEVRGGGADAKQDEFNKLCPGAKIE >NZ_CP040886|3746219:3790394|3761832_3762033_+|WP_000049638.1|DBSCAN-SWA MSKYEKLDQNILSMLSERPTPVFDIWLKWRSNGMYIETIDRRMQYLRKKGLVANVRGKGWVKINLS >NZ_CP040886|3746219:3790394|3751675_3752245_-|WP_060503998.1|DBSCAN-SWA MSTIPKERLEQLASGNAWYCVQDDEAAELARIALASLEAEPYGYVHKAAYEKTGSCGLSNDREAYRYSSTHVAVYTAPPAPVSVPAAMEMDDDFDSAFEHGKAVGWNACRAAMLQSQGNCIKDGWISCSDRMPEDTKMLLAFSQGEIVAAYWNWVVNPIDYKKYRAFTYLSGNILDDVTHWMPLPEPPL >NZ_CP040886|3746219:3790394|3769130_3769373_+|WP_000807785.1|DBSCAN-SWA MAEIIPMTEEQKFQLEIYKLVMNQNAAAEEAFQFIGTDELKLELFKIHFQSGGANSDITIRTFEAVRKSKEALDLFTTGA >NZ_CP040886|3746219:3790394|3775639_3775825_+|WP_000375639.1|DBSCAN-SWA MDRMSVFLAADNESGHVQAVIAEKDFQFFERLGFVASVDELKPTSKRGRKAADNGNSTDKG >NZ_CP040886|3746219:3790394|3779119_3779812_+|WP_032191772.1|DBSCAN-SWA MLYAFKLGRKLRGEEPWCPEKGGKGGSSDKSAKYAAEAQKYAADLQNQQFNTIMNNLKPFTPLAEKYVGSLENLSSLEGQGQALNQYYNSQQYKDLAGQARYQSLAAAEATGGLGSTATSNQLATIAPTLGQQWLSGQMNNYNNLANIGLGALQGQANAGQTYANNMSQISQQSAALAAANANRPSALQQGVSGAASGALLGGGIASALELSTPWGAGIGAGLGLLGSLF >NZ_CP040886|3746219:3790394|3753723_3754203_-|WP_023277046.1|DBSCAN-SWA MKVCSRCHQQKEERDFQIRRASRDGLTAACRACLAEYDKERAGLPHRVSARREYQSSERGRERCNAAKKRFIQSNPWKRKAHIIVGNFLRDGKLIRPPQCECCGSECKPQAHHCDYSKPTDVMWLCKSCHVEWHKHNKPIYPDEEPVTLPFPRHAIHAI >NZ_CP040886|3746219:3790394|3761509_3761836_+|WP_000796282.1|DBSCAN-SWA MADWQIPIIILAGASLVAGFILLKKHKDRDQKVEVLYGYPANSTTWLTIYHYRKSGRWVFEWDDLFAEKRPKSWGDISECMMFEERKSGATREEFNEAWARLSERGYL >NZ_CP040886|3746219:3790394|3757631_3758282_-|WP_000856967.1|DBSCAN-SWA MKTQLMGERIRARRKELKIRQAALGKMVGVSNVAISQWERSETEPNGENLLALANALKCSPDYLMKGEESLSNIAYHSRHDPRGSYPLISWVSAGCWMEAVEPYHKRAIDNWYDTTVDCSEDSFWLDVKGDSMTAPAGLSIPEGMIILVDPEVEPRNGKLVVAKLEGENEATFKKLVIDAGRKFLKPLNPQYPMIEINGNCKIIGVVVDAKLANLP >NZ_CP040886|3746219:3790394|3751399_3751579_-|WP_001277766.1|DBSCAN-SWA MSCPKCGSGNIAKEKTMRGWSGDYVCCDCGYNDSKDAFGERGKNEFVKINKDREGNGKS >NZ_CP040886|3746219:3790394|3784920_3785094_+|WP_001549438.1|DBSCAN-SWA MIVKSDAPKYPLRIPLEVKLAIEKSAKENGRSINTEMVMRLVDSLRRDSSKGNLAKS >NZ_CP040886|3746219:3790394|3762528_3762939_+|WP_060504015.1|DBSCAN-SWA MKQTIFLRSKQQQQAAINAILATPLDKDKPVTIRITDYKRNLDQNAKFHAMLADIASQVQWCGKWLKPEQWKVLLISGHAVATKQEADVLRGLEGEFVNIRESSAQMSVKRMASLIEYTTAWAIGQGVRFTDRRYE |
63 | Enterobacteria_phage(47.46%) | holin,integrase,lysis,terminase,portal,head | attL 3743739:3743755|attR 3793649:3793665 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_10 |
4038034 : 4047476
Sequences of DBSCAN-SWA_10
Nucleotide sequences of DBSCAN-SWA_10 >NZ_CP040886|4038034:4047476|DBSCAN-SWA AATGATTGAATTTAGCCATGTCAGCAAACTGTTTGGCGCACAAAAAGCCGTTAACGATCTCAATCTCAATTTTCAGGAAGGGAGTTTTTCGGTGCTGATTGGCACATCTGGCTCCGGCAAATCCACCACCCTGAAAATGATTAACCGCCTAGTGGAACATGACAGCGGCGTGATCCGCTTTGCCGGAGAAGAAATTCGCTCGCTGCCAGTACTGGAGTTGCGCCGCCGGATGGGCTATGCCATTCAATCTATTGGCCTGTTCCCCCACTGGAGCGTGGCGCAAAACATCGCTACCGTGCCGCAATTGCAAAAATGGTCGCGGGCGCGGATTGACGATCGTATCGACGAATTAATGGCGCTACTGGGGCTGGAGTCAAATTTGCGTGAGCGTTATCCGCATCAGCTTTCCGGTGGTCAGCAGCAACGTGTGGGAGTGGCGCGCGCACTGGCTGCCGATCCGCAAGTCTTACTGATGGATGAACCTTTTGGCGCACTGGACCCGGTAACGCGCGGCGCGTTGCAACAAGAGATGACGCGCATTCACCGTTTGCTGGGGCGCACTATTGTGCTGGTCACTCATGATATTGATGAGGCGCTAAGGCTGGCAGAACATCTGGTATTGATGGATCACGGTGAAGTGGTGCAGCAGGGGAATCCGCTGACGATGCTGACTCGTCCGGCGAATGATTTTGTCCGCCAGTTTTTTGGACGTAGTGAACTGGGTGTGCGCCTGCTTTCGTTACGTAGTGTGGCGGATTACGTGCGTCGCGAAGAACGGGCAGAAGGTGAGGCACTGGCAGAAGAGATGACGCTACGCGATGCGCTCTCCCTGTTTGTCGCGCGGGGATGCGAGGTGCTGCCGGTGGTGAACACGCAGGGCCAGCCTTGCGGCACGCTGCATTTTCAGGATCTGCTGGAGGAGGCGTAAGCGTATGAAGATGTTGCGCGATCCGCTGTTCTGGCTCATTGCTTTGTTTGTGGCACTGATTTTCTGGCTGCCTTACAGCCAGCCGCTGTTTGCTGCCTTGTTCCCACAACTGCCACGACCCGTTTATCAGCAAGAAAGTTTTGCAGCTCTGGCACTGGCTCATTTCTGGCTGGTGGGAATTTCGAGTTTGTTTGCGGTGATCATTGGCACTGGTGCCGGAATTGCTGTCACTCGCCCGTGGGGCGCGGAATTTCGCCCACTGGTGGAAACTATTGCCGCCGTTGGACAGACTTTTCCGCCTGTCGCAGTGCTGGCGATTGCCGTTCCGGTGATCGGCTTTGGTCTGCAACCAGCGATTATCGCCTTGATCCTTTACGGTGTGCTGCCCGTCCTGCAGGCGACACTTGCCGGGCTGGGAGCGATTGATGCCAGCGTGACAGAAGTTGCGAAAGGTATGGGAATGAGTCGTGGTCAGCGACTGCGTAAGGTCGAGCTACCGCTGGCGGCTCCGGTGATTCTGGCGGGCGTGCGAACTTCGGTGATTATCAACATTGGTACGGCGACGATCGCCTCAACGGTAGGGGCCAGCACGCTGGGTACGCCCATCATCATCGGGCTTAGCGGATTTAATACCGCGTATGTGATCCAGGGGGCGTTACTGGTGGCACTGGCGGCGATCATCGCAGACCGCCTGTTTGAAAGGCTGGTGCAGGCGCTTAGCCAGCACGCAAAATAAAGGTATAACCTGCGAGCATGACGCCACCAATTCCGCCTAACGCCATAAACAGGAACAGGGCGATGACCCCAATTTTAGCTATGCGCATATTGCACTCCTTATGTTAACGAAAGGATTGTACAGTAAAGCGCATTTGTTAACGAATCATTAAATGCCGAGTGGGAAAATATCATGGCCTTGTTCTTGCCAACTGGTGAGTTGCTGCTGTTGGGCGGAGGTTCGATTTTCACCGCACCACACCAGCAATGTACGGCCTTCGAATAGTTCAGGGCGTAGTTGATTGAGCGAGTGGGCGAGGACATCAATGCGCCATCCTTGTTGACTGGCAATCCAGCCCTCCAGCCACAGACGGGTGGTATCCTGAATATTCCAGCCAACCACCAGCGCATCTTTACCCTGTTTTTTACGTGCCGAAGCCAGACAAATGGCGATGTAGTTGATCAGTACGCCGTCGAGGATCGCCAGCAGCGCCTGGAGAGTCGGTTGTTGGCATTGAAGTCGTCGGCGCAGAGGAATAAACAGATGTGTGGTGAGTGTCTGGGCGGGGTAATCCTGACCGCGCTCTTTGATCCACGTTCGCAGGCTATGCAGATTGCCGCTTTGCAGGTAGGTCAGTAATGTTTCTTGCTGATCGCGCCAGCCGTTCTGCACATCAACATTTTCATTACTGAGCAGCATTTTAACTTTGCTGACCTGCACGCCGTTGTCGATCCAGCGTTTGATCTCGCGGATACGGTCAATATCGGCATCGTTGAACAGTCGATGACCGCCGTCTGTCCGTTGCGGTTTCAGCAACCCGTAACGCCTCTGCCACGCGCGTAACGTGACAGGGTTAATATCACAAAGCAACGCCACTTCACCAATTGTGTAAAGCGCCATCGTCTCACCCTTGCTCGCGAGGTCCCGGTTTAACTTTAGACGCAGTTTTGCGAACCAGGTAGTTTTGCCCGTTTTTTGTGCATCTATAGGGTGATTTTATTTTTGCCAGGCGATTTTGAGTGATCGTACTCACGAATTCTCATTTTTCTGCAAGAGTTCAAAGAAAGTTAAACGCAGGCAATGTATGTTACGCGTTTTAAAGGGAAGTGTGGTTTGCGGGTATGTACGATTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTTTTTTTAGTCATTGCGTGGTTAATGAGTAAAACGCCATTATTCATACCGTTAATGCAGGTCACGGTTCGTCTGCCGCATAAATTTCTCTGCTACATCGTCTTTTCCATCTTCTGCATCATGGGCACCTGGTTTGGGTTGCACATTGACGATTCTATTGCCAATACCCGTGCGATAGGCGCGGTAATGGGCGGCTTACTCGGCGGTCCGGTCGTCGGTGGGCTGGTTGGTCTGACCGGCGGCTTACATCGATATTCGATGGGGGGCATGACCGCGCTAAGTTGCATGATCTCAACCATCGTTGAAGGATTGCTCGGCGGCCTGGTACACAGCATCCTGATCCGTCGCGGGCGCACTGATAAAGTCTTTAACCCCATTACCGCCGGTGCCGTCACGTTCGTCGCTGAAATGGTGCAAATGCTGATCATCCTTGCGATCGCCCGACCTTATGAAGATGCGGTGCGTCTGGTGAGTAATATTGCTGCACCAATGATGGTCACCAATACCGTCGGCGCGGCGCTGTTTATGCGTATATTGCTCGATAAACGCGCGATGTTTGAAAAATACACTTCTGCTTTTTCTGCCACTGCGCTGAAAGTGGCAGCCTCGACGGAAGGCATTTTGCGACAGGGGTTTAACGAAGTGAACAGCATGAAAGTGGCTCAGGTGCTGTATCAGGAGCTGGATATTGGTGCAGTCGCGATTACCGATCGAGAGAAATTGCTGGCCTTTACCGGAATTGGTGACGACCACCATTTACCCGGCAAACCGATTTCTTCGACTTACACCTTAAAAGCGATTGAAACCGGTGAAGTGGTCTACGCTGATGGCAACGAAGTACCTTATCGTTGCTCTTTGCATCCGCAATGCAAACTGGGGTCGACGCTGGTAATTCCGTTGCGTGGCGAAAATCAGCGCGTGATGGGCACCATCAAATTGTATGAAGCCAAAAACCGTTTATTCAGTTCAATAAACCGCACGCTGGGCGAGGGGATTGCGCAACTGCTTTCGGCGCAGATTCTTGCCGGGCAATATGAGCGGCAAAAAGCGATGCTCACCCAGTCAGAAATCAAACTGCTTCACGCCCAGGTGAATCCTCATTTTTTGTTTAATGCGCTTAACACCATTAAAGCGGTGATCCGCCGCGACAGCGAACAGGCCAGCCAGCTGGTGCAGTATCTTTCCACTTTTTTCCGCAAAAACTTAAAGCGGCCTTCGGAGTTTGTTACTCTCGCCGACGAAATTGAACATGTGAACGCTTATCTGCAAATTGAAAAGGCGCGCTTCCAGTCGCGGTTGCAGGTCAACATTGCTATTCCGCAAGAATTATCCCAGCAGCAATTGCCCGCGTTTACCCTGCAACCGATAGTGGAAAACGCCATTAAACATGGGACATCACAACTGCTGGATACAGGGCGAGTGGCAATCAGCGCCCGACGTGAGGGGCAACATTTGATGCTGGAGATCGAAGACAATGCCGGTTTGTATCAACCGGTAACCAATGCCAGTGGGCTGGGGATGAATCTGGTGGATAAGCGTTTACGTGAACGGTTTGGCGATGACTATGGAATAAGCGTCGCCTGTGAGCCTGATAGTTACACCCGAATAACGTTACGACTACCATGGAGGGACGAGGCATGATTAAAGTCTTAATTGTCGATGATGAACCGTTAGCACGGGAGAACCTGCGCGTATTTTTGCAGGAGCAGAGCGATATTGAAATCGTTGGAGAGTGTTCAAACGCCGTGGAAGGGATCGGCGCGGTGCATAAACTGCGCCCGGATGTGCTGTTTCTCGATATCCAGATGCCGCGCATCAGTGGTCTGGAAATGGTGGGGATGCTCGACCCGGAACATCGTCCGTATATTGTTTTTCTCACCGCGTTTGACGAATACGCTATTAAAGCCTTTGAAGAACATGCCTTTGATTATCTGCTGAAGCCAATTGATGAAGCGCGACTGGAGAAAACGCTGGCGCGTTTGCGTCAGGAACGCAGCAAGCAGGATGTTTCGCTGTTACCGGAAAATCAACAGGCGCTGAAATTTATCCCTTGTACGGGGCATAGTCGGATTTATTTGCTGCAAATGAAAGATGTGGCATTTGTCAGCAGTCGGATGAGCGGTGTCTACGTTACCAGCCACGAAGGGAAAGAGGGCTTTACCGAATTGACATTACGTACCCTGGAAAGTCGTACACCACTACTGCGCTGCCATCGTCAGTATCTGGTTAACCTCGCGCATTTACAGGAGATTCGTCTGGAAGATAACGGCCAGGCCGAGTTGATTTTGCGTAATGGCTTAACCGTGCCGGTCAGCCGCCGTTATCTGAAAAGCTTAAAAGAGGCGATTGGCCTGTAAAAGACTGCTAAAATGGCTTTTTGCCTCATCAACACCTGAAGGCCTCATGCTAAGTAACGATATTCTGCGCAGCGTGCGCTACATTTTGAAAGCCAATAATAATGACCTGGTGCGTATTCTGGCGCTGGGTAATGTCGAAGCCACCGCGGAACAGATCGCCGTCTGGCTACGTAAAGAAGACGAAGAGGGTTTTCAGCGTTGTCCGGACATTGTTTTGTCGTCATTCCTCAATGGCCTGATTTATGAAAAACGCGGCAAGGATGAGTCTGCTCCGGCACTGGAGCCGGAACGTCGCATTAATAACAACATCGTGCTGAAAAAATTACGCATCGCGTTTTCGCTGAAAACCGATGACATTCTGGCTATCCTCACCGAACAGCAGTTCCGCGTTTCGATGCCGGAAATTACAGCGATGATGCGCGCACCGGATCATAAAAACTTCCGCGAATGCGGCGATCAATTTTTACGTTATTTTCTGCGTGGACTGGCAGCGCGCCAGCATGTGAAGAAAAGCTAAGACGGGTATGGCGGCCATGCGAAACATGGCCGCCGACAGATTATTTCACTTCTTTAAAACCAGCGGCTTTCATCACCAGTTCCATTTGCGCCATAGTGATACCTTTTTTGGCATCTTCAGCAGAAACGTTGATTCCTGAAATACCCTGCAGGGCTTTAAAATCCACTTTTTCCATATCGATAGTCACGTTTTCCTGCGCGTAGGTATCTGTATAGGTTAATTTTTCTTCAACACCCGCGATGTTTTTGTATTTGGCGCTTAACGGCTCAAGTGTCTTGGCAGCGTCTTCTTTGGTGGTTGCACCAATAGAAGCAAATTGAATTTTGGTTTCAGAAGATTGCTTAAGCACCTTGTCACCTTTGTAGACATAGGTAATGGCAATTTCAGTGCCGTTCAGATTGGCGCTGAATTTCTTCGATTCTTCTTTGTCACCGCAGCCAGCAAGAGAGAAAACCAGAACAGATGCAACAACGAGGGAAAACAGCTTATTGAAAGCCTTCATGTAAAACTCCATTTTATTTAATCAAGAAACTGGTGACTCTCACCAGGGGCTATATAGAATATGCCTAATACCGTGACGTGAGCAGTCCGGAACTGGAGTAGAACTCTTAGTAAAAAGCACTATTTCATCCTTGTTGCTGAAGCATGGGGAATAATTGTTCGCAAAGTAAAACACCGTTATTCATTGCTTCTACCCGTGCCTCGCTTTCTGTATTACGAAATTGTCCCAACACATGTGCCAGCCGATAAAAACCGACCGCGGAGAGGTCATTCGCCAGCAACTCTGCCTGACTAATAGCGCTCTGTTCCTGATAGCGCCAGCCGTTATGGAGCAGTTGAATAAGTAACGCCTGGCAGCGCATCAGCAACTGATGAGCGGTAGACGGCACAGGCAAAACGCTGGCAGAAGGTAGCGGTGCCACAGGCGCAGTTTCTGCGTCCAGCGCCCAGGCGCGGGTTTTTGTCATCATCACCCGTGGTTCCAGTGTCAATTGCCCATCAACAAAACTGACAAAGCCAGAAACCAGACTCACGGGGTCGTCTGTTTGTTGCAAAAGCGCCGCCATGCGTTCAACGGCAAAAGGAGAACAGGCAGATGCCGGAAGGGATAACGTCAGGAGATTATCTTCACCTTCGCCGCTGATTACCTGCGCATCCAGCGTCTGGCGGCTGCTATCCCAACCGAGCGAAATACACTCAGCGACCGGCAGAATAAATAAGTTATCGACCTGATTAAGAGGCCGTATGCAGGCGGGGGGACGCTGGCGTAAATATTCCCGCAAAGCCACAATGCCCGGCTGGCGTAACGGCGCGCTCAACATTTGCCAGGCATCAGGCGACAGCGGCACAACGCTGCTTAAGCGGTTGCGGGTAGCTAACAGCAGCTCGCCATCGGCACTGCGTTTTGCTGCTTGTGAAACAATTTGCCCACCCGCCAGTGCGCCAGCCTGAAAACTAAACAGCCGACGCGTAGCTGCCGGTGAGTTTTCCTGTTCACTTCGCGGCCAACTGCGCGAAAGGTGCAAAATACTGCCGGTGTCGGGATCGGTAAACCAGATGCGTAAACCATAATGCTCAATATCCTGCCAGCAACGCATACCTAAAGACACCAGCCGCAGATGATCAAGCTTTGCTTCTCCGGCAATGCCAGAGCCAACGACCGTGCGCCACGGCACAGGAGGAACTTCACCAACACTGTCGCGCCGGGCCATCTCTTGTGCGCAATTTAATCGACTGTTTAATGCCGCAAGCTGACGTAAGCATTCTCCGGCATGATAGTGGCTGGCGCGGACGTGGAAGGCATCAACGCTTGCCCGTAGCTGTCGTAGTGATTCACTCACCCATCGCCAGTTGCAGCGTTCCGCCGCCTGCTGCGCGCGACTGAAAGCGGCCTCGTAGTGAATAAGCGGCTGGCTGATGCCGCCCAGCCATAATGCCTGGCTTAATTGCTGAACATATTGACGACACGCGTTACCCTCGTCGTTGGCAAACGGATCGTCAGATGATGTGACGTGTTCGCTGCGCATCTGCCAGATTAAATGAGTAAATTCCGCTTGCTGAGTTTTGGCCTCGACGAAGGCCTGCACAGCCAGTACGACATGTTCGCAAAGTGTGCCTTCAATACAATCACAACGGGCGAAACGAATACTGCTGCGGGAATAAAAACGCACATCGCTCATCGGTAAGCGGGCAGAGGGAATTTCGCCCGGCGTACAGAACAACTCAATGGTGATGCCTTTACCGACCAACGCCTGTGCGCGTTTGCGGGTGGCATCGGGCAGGGTAGCCAGTTCTTCCAGCCAGATTGCCGGATCCCACGCTTCTTCTTTTTCCGTAGGCTGGGCGGTAGTACAAAGTCGTTGATAACTTAACACAAGCATCACGCGATGACGGCACATACCGCTGGCCCCGCAGCTGCACTGAGCCTCTTTCAGTGCCTGGCCGTTCGCCAGCTGGGTACGGACACCGTCACTGAAGGTGGCGATTAAAGCGCTGTTCTCATGGCTGATTTCCGGGACGTTGCCATTTTCCAGTTCCTTAAGACTGCGCTTAACAAAACCGGCATTGCTTAACGCCGTCAGGGCCTGCGGTGTCAGTTCTAATAATTCCGGACGTAGTGAATTCATGACTGAAGATTCTCCGCAAGCCATGATGCCAGCTCGCCCGGTGTCATGGCGGCTATTTGTGCGCCGACATTTACCAGCGCCTGGGCCGTATCGTGGTCATAGCAAGGTGTTGCTGTGCTATCGAGCGCTGCCAGTCCCAGCACTTTGATGCCGCTCTGGACACACTTTTTCACCTGATGCGTCAGTAATGATGATGAACCCCCTTCGTAAAAATCGCTCACGAGGATAATGACGCTTTTCGCTGGTTGTTCAATAAGTTGCCGACCATACTCCACGGCACTGGCGATATTGGTCCCGCCGCCCAACTGTACTTTCATTAATAACTCTACCGGATCGGCAACGTCTGCCGTGAGATCAACGACGCTTGTGTCAAACGCCACCAGATGGGTACGAATGCCGGGTAACTGCCACAAACAGGCCGCCATCACCGCAGAGTGGATCACCGAGTCCACCATCGATCCGCTTTGATCAACCAGTAAGACCAGTTGCCATTGTTCGCTTTGGCGTTTAATGCGGCTGTTAAAGCGGGGGGATTCGATATACAACTTGCCGTGTTGCGGGTGCCAGTGTTGCAGGTTGGCGCGCAGAGTACTTTTGAAATCAAAGTTTCGCGCCAGTGGAATTAATGAGCGGCGACGGCGATCGCGGACACCAGAAAAAGCCTGACGAACTTCCTTTGCCAGTCGCGCCATAATTTCTTCAACAACCTGGCGCACTATCTGGCGGGCGGCAGCCAGTACTTCGGGATTCATCAGATGTTTGGTGTGCAAAACGGCGCGTAGCAGGCTTTCCGAAGGCTGCATACGTTCCAGCACGTCGAGATTTGTCACCACATCTTCAATGCCGTAGCGCAGTACGGCATCGCTTTCCAGCCGCTCAATCACCTGTTGCGGAAACAGCGTGTGGATACTGTTGATCCACTCAGGAGTGGTGAGATTTGAGCCACCTAATCCACCAGAGCGTTCACCACGCTGGAGCCGTTCAGGATCGCGCCCATACAGCCACTCCAGCGCGTGATCTATCTGCCGGGCGTTGTCATCCAGCCCACAAAGCGTCGTTTCTGCCGCTTCGCCAAGAATTAATCGCCAGCGTTGTAGCTCACGGGTGGTCAGAAGATCGTTCAGTTCAGACAT
Protein sequences of DBSCAN-SWA_10 >NZ_CP040886|4038034:4047476|4042479_4043199_+|WP_000598641.1|DBSCAN-SWA MIKVLIVDDEPLARENLRVFLQEQSDIEIVGECSNAVEGIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIDEARLEKTLARLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMKDVAFVSSRMSGVYVTSHEGKEGFTELTLRTLESRTPLLRCHRQYLVNLAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >NZ_CP040886|4038034:4047476|4038034_4038961_+|WP_000569361.1|DBSCAN-SWA MIEFSHVSKLFGAQKAVNDLNLNFQEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGVIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWSVAQNIATVPQLQKWSRARIDDRIDELMALLGLESNLRERYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHRLLGRTIVLVTHDIDEALRLAEHLVLMDHGEVVQQGNPLTMLTRPANDFVRQFFGRSELGVRLLSLRSVADYVRREERAEGEALAEEMTLRDALSLFVARGCEVLPVVNTQGQPCGTLHFQDLLEEA >NZ_CP040886|4038034:4047476|4039844_4040576_-|WP_001240401.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWIDNGVQVSKVKMLLSNENVDVQNGWRDQQETLLTYLQSGNLHSLRTWIKERGQDYPAQTLTTHLFIPLRRRLQCQQPTLQALLAILDGVLINYIAICLASARKKQGKDALVVGWNIQDTTRLWLEGWIASQQGWRIDVLAHSLNQLRPELFEGRTLLVWCGENRTSAQQQQLTSWQEQGHDIFPLGI >NZ_CP040886|4038034:4047476|4044342_4046343_-|WP_001374182.1|DBSCAN-SWA MNSLRPELLELTPQALTALSNAGFVKRSLKELENGNVPEISHENSALIATFSDGVRTQLANGQALKEAQCSCGASGMCRHRVMLVLSYQRLCTTAQPTEKEEAWDPAIWLEELATLPDATRKRAQALVGKGITIELFCTPGEIPSARLPMSDVRFYSRSSIRFARCDCIEGTLCEHVVLAVQAFVEAKTQQAEFTHLIWQMRSEHVTSSDDPFANDEGNACRQYVQQLSQALWLGGISQPLIHYEAAFSRAQQAAERCNWRWVSESLRQLRASVDAFHVRASHYHAGECLRQLAALNSRLNCAQEMARRDSVGEVPPVPWRTVVGSGIAGEAKLDHLRLVSLGMRCWQDIEHYGLRIWFTDPDTGSILHLSRSWPRSEQENSPAATRRLFSFQAGALAGGQIVSQAAKRSADGELLLATRNRLSSVVPLSPDAWQMLSAPLRQPGIVALREYLRQRPPACIRPLNQVDNLFILPVAECISLGWDSSRQTLDAQVISGEGEDNLLTLSLPASACSPFAVERMAALLQQTDDPVSLVSGFVSFVDGQLTLEPRVMMTKTRAWALDAETAPVAPLPSASVLPVPSTAHQLLMRCQALLIQLLHNGWRYQEQSAISQAELLANDLSAVGFYRLAHVLGQFRNTESEARVEAMNNGVLLCEQLFPMLQQQG >NZ_CP040886|4038034:4047476|4039677_4039785_-|WP_001216963.1|DBSCAN-SWA MRIAKIGVIALFLFMALGGIGGVMLAGYTFILRAG >NZ_CP040886|4038034:4047476|4046339_4047476_-|WP_001292773.1|DBSCAN-SWA MSELNDLLTTRELQRWRLILGEAAETTLCGLDDNARQIDHALEWLYGRDPERLQRGERSGGLGGSNLTTPEWINSIHTLFPQQVIERLESDAVLRYGIEDVVTNLDVLERMQPSESLLRAVLHTKHLMNPEVLAAARQIVRQVVEEIMARLAKEVRQAFSGVRDRRRRSLIPLARNFDFKSTLRANLQHWHPQHGKLYIESPRFNSRIKRQSEQWQLVLLVDQSGSMVDSVIHSAVMAACLWQLPGIRTHLVAFDTSVVDLTADVADPVELLMKVQLGGGTNIASAVEYGRQLIEQPAKSVIILVSDFYEGGSSSLLTHQVKKCVQSGIKVLGLAALDSTATPCYDHDTAQALVNVGAQIAAMTPGELASWLAENLQS >NZ_CP040886|4038034:4047476|4043756_4044218_-|WP_001295429.1|DBSCAN-SWA MKAFNKLFSLVVASVLVFSLAGCGDKEESKKFSANLNGTEIAITYVYKGDKVLKQSSETKIQFASIGATTKEDAAKTLEPLSAKYKNIAGVEEKLTYTDTYAQENVTIDMEKVDFKALQGISGINVSAEDAKKGITMAQMELVMKAAGFKEVK >NZ_CP040886|4038034:4047476|4043245_4043716_+|WP_001295430.1|DBSCAN-SWA MLSNDILRSVRYILKANNNDLVRILALGNVEATAEQIAVWLRKEDEEGFQRCPDIVLSSFLNGLIYEKRGKDESAPALEPERRINNNIVLKKLRIAFSLKTDDILAILTEQQFRVSMPEITAMMRAPDHKNFRECGDQFLRYFLRGLAARQHVKKS >NZ_CP040886|4038034:4047476|4040797_4042483_+|WP_001295431.1|DBSCAN-SWA MYDFNLVLLLLQQMCVFLVIAWLMSKTPLFIPLMQVTVRLPHKFLCYIVFSIFCIMGTWFGLHIDDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSILIRRGRTDKVFNPITAGAVTFVAEMVQMLIILAIARPYEDAVRLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSTYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKAMLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEFVTLADEIEHVNAYLQIEKARFQSRLQVNIAIPQELSQQQLPAFTLQPIVENAIKHGTSQLLDTGRVAISARREGQHLMLEIEDNAGLYQPVTNASGLGMNLVDKRLRERFGDDYGISVACEPDSYTRITLRLPWRDEA >NZ_CP040886|4038034:4047476|4038965_4039697_+|WP_000783120.1|DBSCAN-SWA MKMLRDPLFWLIALFVALIFWLPYSQPLFAALFPQLPRPVYQQESFAALALAHFWLVGISSLFAVIIGTGAGIAVTRPWGAEFRPLVETIAAVGQTFPPVAVLAIAVPVIGFGLQPAIIALILYGVLPVLQATLAGLGAIDASVTEVAKGMGMSRGQRLRKVELPLAAPVILAGVRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIADRLFERLVQALSQHAK |
10 | Enterobacteria_phage(85.71%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_11 |
4692223 : 4716004
Sequences of DBSCAN-SWA_11
Nucleotide sequences of DBSCAN-SWA_11 >NZ_CP040886|4692223:4716004|DBSCAN-SWA GATGATTAAAACAACGTTACTATTTTTTGCTACTGCGCTGTGTGAAATTATTGGATGCTTTCTGCCCTGGTTGTGGTTAAAACGAAACGCCAGTATCTGGCTGTTGCTTCCGGCGGGGATTTCACTGGCGCTGTTTGTCTGGTTGTTAACGTTGCATCCAGCGGCGAGTGGGCGTGTTTACGCGGCTTATGGTGGCGTTTATGTCTGCACGGCGTTGATGTGGCTGCGCGTTGTGGATGGCGTGAAACTGACTCTTTATGACTGGACGGGTGCGTTGATTGCGCTTTGCGGCATGTTGATCATTGTTGCGGGCTGGGGGCGCACGTAGGAACATAAATCCATTTTATCAATAAGATAAGAGGAAGTGTCAGCTGACAAAAGGTATTCTATTTCATCTTTTGTCAACCATTCACAGCGCAAATATACGCCTTTTTTTGTGATCACTCCGGCTTTTTTCGATCTTTATACTTGTATGGTAGTAGCTCAGTTGCGTAGATTTCATGCATCACGACAAGCGATGCAAGGAATCGAACATGAAGATCGTAAAGGCTGAAGTTTTTGTTACCTGTCCGGGGCGTAATTTCGTCACATTAAAAATCACCACTGAGGACGGTATTACGGGCCTTGGGGATGCCACCCTCAATGGACGTGAGCTTTCCGTGGCCTCTTATTTGCAGGATCACCTTTGTCCGCAGCTTATTGGTCGCGATGCGCACCGTATCGAAGATATCTGGCAGTTTTTCTATAAAGGTGCTTACTGGCGTCGCGGTCCGGTTACGATGTCGGCCATTTCAGCGGTTGATATGGCGCTGTGGGATATTAAAGCCAAAGCTGCCAACATGCCGCTTTACCAGTTACTCGGCGGCGCGTCTCGTGAAGGGGTGATGGTTTATTGCCATACCACCGGTCACAGTATTGATGAAGCTCTGGATGATTATGCCCGTCATCAGGAGCTGGGATTCAAAGCCATCCGCGTGCAGTGCGGAATCCCTGGTATGAAAACCACCTACGGCATGTCGAAAGGTAAAGGTCTGGCTTATGAACCCGCAACCAAAGGACAGTGGCCGGAAGAGCAGCTGTGGTCGACGGAGAAATACCTCGATTTCATGCCGAAATTGTTTGACGCGGTACGTAACAAGTTTGGTTTTAATGAACATTTGCTGCATGACATGCACCATCGCTTAACGCCTATTGAAGCGGCGCGCTTTGGTAAAAGCATTGAAGATTATCGCATGTTCTGGATGGAAGACCCGACGCCTGCAGAAAACCAGGAATGCTTCCGTCTCATTCGCCAACATACCGTCACACCCATCGCAGTGGGTGAAGTCTTCAACAGCATCTGGGACTGCAAACAACTGATTGAAGAGCAACTCATCGATTATATCCGCACCACGCTGACCCATGCAGGCGGAATTACCGGTATGCGCCGGATTGCCGATTTTGCTTCGCTGTATCAGGTACGTACTGGCTCACACGGTCCTTCCGATTTGTCACCAGTCTGCATGGCTGCGGCGCTGCACTTTGATCTGTGGGTCCCCAATTTCGGTGTCCAGGAATACATGGGTTATTCCGAACAAATGCTCGAAGTCTTCCCGCACAACTGGACTTTCGATAACGGCTATATGCATCCGGGAGACAAACCGGGTCTTGGCATCGAATTCGATGAAAAGCTGGCGGCGAAATATCCCTATGAACCTGCTTATCTACCAGTCGCACGTCTGGAAGATGGCACGCTGTGGAACTGGTAAGGAGTAAGATAATGAAAAGCATATTAATTGAAAAACCGAATCAACTGGCGATTGTCGAACGTGAAATACCCACCCCGTCAGCGGGTGAAGTACGAGTAAAAGTGAAACTTGCCGGAATTTGTGGTTCAGATAGCCATATTTATCGTGGGCATAATCCTTTTGCGAAATATCCGCGCGTCATTGGACATGAATTCTTTGGCGTCATTGATGCGGTGGGTGACGGCGTGGAAAGCGCCAGAGTCGGTGAACGTGTTGCTGTCGATCCGGTGGTCAGCTGTGGGCATTGCTATCCGTGCTCTATAGGTAAACCGAACGTTTGTACGACACTGGCTGTTTTAGGTGTGCACGCTGACGGTGGTTTCAGTGAATATGCCGTGGTTCCGGCAAAAAATGCGTGGAAAATTCCTGAAGCAGTGGCCGATCAATATGCGGTAATGATCGAACCTTTTACCATTGCGGCTAACGTAACCGGACATGGTCAACCGACTGAAAATGATACCGTTCTGGTTTATGGTGCCGGTCCAATCGGCCTGACGATCGTTCAGGTATTAAAAGGCGTCTATAACGTTAAAAATGTGATTGTTGCCGATCGCATTGATGAACGACTGGAAAAAGCGAAAGAGAGCGGGGCAGACTGGGCGATCAATAACAGCCAGACACCGCTTGGCGAGATTTTCGCTGAAAAAGGCATCAAGCCGACATTAATTATCGATGCGGCTTGTCATCCTTCTATCCTGAAAGAGGCCGTAACGCTGGCTTCTCCAGCGGCACGTATTGTATTGATGGGCTTCTCCAGTGAACCGTCTGAAGTGATTCAGCAAGGAATTACCGGAAAAGAACTCTCTATTTTCTCTTCACGCTTAAATGCAAATAAATTTCCGGTTGTTATCGACTGGTTAAGTAAAGGGTTAATTAAACCAGAAAAATTAATTACCCATACGTTTGATTTCCAGCATGTTGCTGATGCCATTAGTTTATTTGAACAGGATCAAAAACATTGCTGCAAAGTCTTACTCACTTTTTCTGAATAATACCAATAACGGCGAGTAAGTAGTACGCATCTTACCTCTTTTTTAGAGATAACCATTATGACAATAGAAAAACATGAAAGAAGCACTAAGGATTTGGTGAAAGCAGCAGTATCGGGATGGCTGGGCACTGCGCTTGAATTTATGGATTTCAAGAGTCATGCGTGTTAACTATTTGATAAATATTAAATTAATTTTTCATTGCTTCGTTATGGGGCATGGTTGGGGCAAACTCGCTTAACTGTGTATTTAACAAAGCTACCTGTGCATTATTGTTTTCAGACATCCATTTTCCGTATACCTGAAATACCATTTGCGCATCTGCATGGCCCATCTGGTTTGCTATAAATGCCGGGTTAGCACCAGCTGTCAGCGACCAGCAGGCATAAGTATGTCTCGACTGATATGATTTTCGATGGCGGAGTCCGGCACGTTTTATCGCTGCGTCCCACATCTGCCTTATTGAGTCAACGGTAAAATGGTCACCATAATTTTTTACTCTCGCTGACACTTCAGGTTGAAAAACAAAGGTGCATTTTTGTTTTTCTGTTCTGCCATACTCTCTGAGGTGAACATCAATGATATGCTCTTTGCTCAGTCTCGTTAATGTCATCTGACTCCGGAGAGCGTCGATTGCTGGCTTAATAAGATGAATGACCCGATTGGTTCCCGCCTGTGTTTTTGGTACCGTGAAACGGTCTTTTGCTAAATTTCTCCTGATCATCATTGTTCCATTTTTCAGATCTATGTCCTCCCATCCAAGTGCACACAGCTCACCAGGGCGAACGCCAGTATAAACAGAAACACACCATAAATTTTTTGCTTGCTGATTTCTGCACGCATCGATAAGACGGATAAATTCTTCCCGCGAAAGAGGATCCGGAATGGTTCTTGATTCCTTTAATGGCGAGATCCCCTTAAACGGATTATCTGCCAGGTAACCGTTATCAACACCAAACTGGAACACGGCGTTAAGATTTGTCATGTAATTATTTACAGTTACAGCCGATCTCCCTGGTTGTGTAACAATATAGTTACTTTTGGGGATCTGGTATCCAGTCAGTAACTCTTTACGAACCTCCAGTAATTTTTCTTTATTAATCGATGAGGCAAGATTTTTTTCACCGATTATGCTCAGGATATTTTTGATGACGGCACGGTATGTGTTGAGTGATGTTTTGGCGACTTCAGTTTCTTTCAGTGCCAGAAATTTTTCAGCCAGTTCTTTTATGGTTAAATCTTGTCGGGCCTCACCAAATTTTTCCAGATTGCGTGAGGAGGGAAACTGTTTTGCATAGTCGAAAACACCAGTTTTTATTGCGTAACAAACAGAGGAGCGTAGTTCACCTGCAACGCGCCTGTTTTTTGCTGTGTCAGGAACCCCCAGGTTTTCCCTGACTCTTACGCCTTTATAAACAAACCAGATACGTAATTTCCCTCCATGGTTTTCCACGCCTGTCGGATATTTCATTTCAACTTCTCTCATTAGTTAGTGTGGCTTTTAGTCAAGTAAGATGACGTCTTGGTCTCGCTGATGCCTGGCGCTCAATCCAGCGATCAATTTCTTCCAGGTTGTAAAAGCATGGACTGTTATCCCATGGCATACCGTCATGAGCGACATGCTTATATTCCCTTCCTTCCATAAACGATTTTTCCCGGGCCTTTTTTAACGTACCTTTTTTTATTCCTTTCAGCGCAATTAACTGCTCTTCGGATACCCATTTGCCGGGAGAGACAATCATGATTACTTCGCTCATCGATTTCTTTATCTCTTACATTAGACGAGCGCCGGTTGCAGAATACCAGTCACAACCGGCGACAGTTGAACATTAAGAATCAGCCTGACTCGGGATCAGTTTTTGCCAGATAACTGAAACGTATTTTGCCTGGTAACGGGCGTCATCAAGTGCATTATGGCGCTCACCTTCGAATTGAATAGCCGTTCTGGCATCGAAGTCTATGACTTTCCCCAGCTCAACGATTGTGCGTACATCGCGATCGTTGTAGTAACGCCACGGGCAGGGGATCCCCTGCCGTTCGTATGAACGGCGCAAAATCGTGTTGTCGAAGTTGGCTCCATTCCCCCAAACCTGAACAAAAAATTCACCGGAGTTTTCGTCGATAAATTCCCGCAATTGTAACAGTGCATCATCTAACGGGATTTCATCGGTCATAATGGCAGATTGCGCTTCGCGTGATTGCTTAAGCCACCATTTAATGGTGTCCCGATCAATGACTCCGCCAGCAGTTTCCAGATCGATAGTCTTACTAAATTCCGGTCCCATATCTCCGGTTTGCGGATCGAAAAATATTGCACCTATTGAGGTGATCGGGGCATCAGGATTTTTTCCCATGGTTTCAAGGTCGATCATTAGATGGTCACACGTCCTGCTGGTGGATGTGATTTCGTGATGACCGTTCACCTTAATTGGGTGATCTGCCGTCTCGCCAGTTTCATTATCGCTATTGTGATGCTGATTGCCGCCAGTGTTCTCCTTGTGTGGATGTTCAGCGCCTTCCATTTCCTCCGGATCATCTTCCTGAACTTCAACCTGATACTCTTCATCGAATGTTTCTTGGTATGTTGCGTCGCCCATCACCGCGCCACAATCAGGGCAGTTGCCGCCGCCGGTCTGACCGCAGGCGGTGCAGACTTTTTCCACTTCCTGTTGCGCTACTGGTTCAGGCTGTTTCGTTTCTGGCTCGTTTTGTAACGCATTTGGACTGTTTTGTTCCGCTTTTTGGTAGTTCCGTTCCGATTCATGCTGGTTCTGGTTCACAGAATCGCGGGTCTGGATCCCCTTAACCCATTTCGGATCATTCGGGTCACTAATCCCTTCAACAAATTCACCACGTGATGCAGCAAGCAACTTATCAGCGTCAGGCTGGCTGATATTGGCTGCCTGCATAATTTTGTTTACTTCGTCAGCGGTAACTTTTATCGGCTCTGGTTGTTCTGAATCTTCAGCGGTATCTACATTTTGCGGTAAGCCCGTGTATGTGCCATTTTTTCGGGCAAAATATTCTTCTTTTGTGATTTCAGTGGCGCCAGCAGCCAGTGCCTTATCCAGACCAGAAAGTTTGTTTGCGCGACCGTATTTTTCTCCGTCCTTATCTGCGAAGAGGAAATAGAACGGCCCCTCACGCTCTACAGATGGTTCAGCTTCCGGCGCGGTTTCATTTTTTGGGATATCAGATACCTCAGTTTCCACTGCATCAGTTTGTGTTTCTGATGACTGGAGAACATCAACAGTGCCCAGGTCTGTTTCTTCATTCTCAAACACGCCCTTTGTCGTCAGGTATTCGCAGATATATTTGTTCAGTGCTACGGGATCTTTGTGAATGTCGATCGGACGCTCACGGACAAGGCCAAAAATAGTTTGGCGGTCGTAGCGAAGGGCATCAGGCTGTTTGCGCATTGATGCCGAGATACGCTTCCAGTCTTCGCGGTCGTTGTCGATAACTTCTTTTTTTGCCCAGCGATGGATGCTGCCGTCAATGTTTCCGGCATCCACATCACCAGGCCAGAGAGCGTAGGCCAGTTCGTCATCCAGTGTTTTCCATGTCTGCTTGTATTCGCGATGAATGGCAGCAATGACCGGGCTGATTTTTCCTGTTGAATTTTCAGTGTACTGTTGATTGGCTCTGGCGCGTGCGAGATCAACAACAGACGTGTATTTTCCGGTTTCCTTGCGTTCACCTTCGCGACGTTTTTTCCAGATGCGCATCTCTGCCTGAATTTCGGGCCATTTAGCACCAGGAATACATTTATGCTTAACCCACCCAATGGCGTGCAACTTAAGCTCCGGATACATGGCGTTAACTTCTGGCATTTTCATCAACGCTTCAACGATATGTCCGTCGAATGTTGCCATGTCTTCCTGCAACAATTCCTGTGCGCTAATAACCATATCAACGGTGATGTTTTCACATGTGTCGAACTTAACCATGACAGCGTTCTGTACTTCAGGGGCCAGCTTGTCAAAAGTGACGTTCATCGGATCGGATTCAGTCTCAACCGGGACAAAAGAAGCAGACTCCTCATCCCAGCGGTTTTCCTGCATATATTCAGCATCCCATGAATCGAGGGCAGGGCGGGGTATGCCAGGTTTATCCTCGCAGACAATAAATTTATAAGCGCAGTCCTGAGCAGCCGGATAATGTTCCAGGAATTGCCAGTGAAATTTTGCTCGAGCACGGCGTTCGTCGCCAGCTTCAATGGCTGTGGCTACAGCCACAGCGCCTTCTTCCCTTGTTGCCAGTTCGTCAGGAATAGCGGCGCAAATAAAGACTTTACTCATTTTGTTTTAACCTCATGACAGATTTAAGGATGAACAAATCCCTGCCATTGCTGGCATATAAGAATGAAACCGGATATTTATTACGGAACTGTTTTAAAGACCTGCCGGGATTTCGATATTATCCTGGTTAATAACTTTATCGACCGGGTAACAGTTACCGGGAATTTTCTGTTCGGTTGCTGCAGTCATACACTCCTGCATTGTCCTGTGAACACTGACTGCAATATCAACTGGCACTCCGGAAACAAGAAAAACTGTCAGAACAAGCGCAAATGCTGAATTCATTGTGCACATCCTTTTGGCATCAGACGTAAACGAGCCAGCATTGAAACAATGCATATTTTATTTAATAGCTCCCGTTCTTGTTTTCTCTTGTTAATGGCATCTTCAGTAAATACAGGGTTACTGATAGTGACACCAATTTCAAAACAACCTTCAGACGTATTAACGTTTGGTAATAACGTTTTCATTATCGCGTCCTCAACAATGAATTTTGTGATGCAGTGCCTGGTGCCTCCAGGTGACGTTAACCAGTTAACAATTAACGTCGGATATCCGGATTAGTGATTTCAGGTTGTATCGTGAGATCAGTGATGGAAAAAGTATTACGTACATGATCGCCGGGTTAAATAAAGAATATGGCGATGTGGTGGAATCCGGACTGCTTTTTGCAGATCCTGCCGTTGTAGATCGTGAAACTGACGAACTTATAGAAAAAGCAATTGCTTTCAAGCTTGCGTATCGACAGCAATACCAACAAAAAGCTGGATGGAATTATGAGTCTTCTTTTTGCTGAACGCCCACTGGTTATAAACACGCAGCTGGCAATGAAAATTGGCTTAAACGAAGCCATTGTTTTGCAACAACTGCACTACTGGTTGAGAGATACCAACTCCGGCATGGAATGTGATGGTGTTCGCTGGATTTATAACACAACGGAACAATGGCTGGAACAGTTCCCATTCTGGTCAGAGTCAACGTTAAAGCGCGCGTTTGCAAGTCTGAAAACGCTGGGGCTTTTGCGTTGTGAAAAGCTCAATAAATCAAAGCGCGATATGACCAATTTCTACACGATCAACTACGGGAGCGAGCTTTTAGATGGTGGCAAATTGAGCGAATCCATCGGTTTAAAATGCGCCGCTCCATCAGGTCAAAATGACACGATGGAAGAGGTCAAAATGAAACGCTCCATTGGTTCAAAACGACTCAATGTCATCGGGTCAAAATGGCCTGATGATCTTACAGAGAATACAACAGAGATTACTACAGAGAATAAAAAGACTTCTCGTCCGGAAGCTTCGCAACCGGACCCGCAGACGGTTGAACAGGATTTTTTAACCCGACACCCTGACGCGGTTGTGTTCAGTGCGAAAAAACGCCAGTGGGGCAGCCAGGAAGATTTGGCGTGTGCGCAGTGGATCTGGGGGCGAATCGTGAGTCTTTACGAGCAGGCCGCCAGCGATGATGGCGAGATTTCGCGACCGAAAGAACCCAACTGGACCGCATGGGCCAACGACGTGCGCACAATGCGGATGCTGGATGGCAGAACTCACAGACAAATTTGTGAAATGTTTGGTCGGGTGCAGCGGGATCCATTCTGGGTAAAAAATATCATGAGTCCGTCAAAGCTTCGCGAAAAATGGGATGAACTGGTTATCCGCCTGGGGCGTTCGTCTGTACAGCGTTGTGTGAATCATATTTCTGAGCCGGATACCGAAATTCCGCCGGGCTTCAGGGGGTAAGTGTTAATTTCTGGTCATGAGGTAATTTTCAGGAGGGCTTGTGGCAAAAGTTTTTACACAAGAAGAGCGGGAAAAAATTAAAGGGCAGGTTGTTGAACTCGTACGCCGGAGTGGGCGTGAGACGTTACGGCAACTGGAAGTCAAGACAGGTGCGACAAGATATCTGATGAGCGTTCTCGCAAGAGAGCTGGTTGCCAGCGGCGATGTATACAACTCTGGTTACGGGTTATTCCCGTCTGAACAGGCGCGTAAGGACTGGCAAAATGCCCGTAAAAAGCTCTCAAGGGCAAAGCTGAAGGAACCATCTGCGGTTGATCCGGACCTTATCTGGTCATTACCTGACGGAGAAATACGTCGTTACGACAGGCGTCATAATATGATTTGTACTGAGTGTCGTAAAAGCGAAGTTATGCAGCGCATATTGTCGTTTTATCAGGGGGATGTCCGGTATTTATTGAAGTGACGAGATTAAAGTGCATTAGTTCAGATGCAAATTGACATTTTGTGGCACAGGGTAGAGCTAGCGTGGTTGTCCGCTTTGTGCCAAGAGCGGACTTTGCAAAATGGGGGTTATTTCAATCAAAACGTAACGTCACAACCAGCCGACGCTCTCTCGCCATTTATAATTAGTAACTTTATCATTTTCGCTTATTTTTTTAGATATAGAGCGCGGCTCTCTTCCTAGATACTCAGATATTTCTATCGGGGACAAATCAAAATCAACAAGCATGACTCTAAGTTTTTCCATTTCCTTTAAAGTCCAAGGCTTGCCATAATTTTCATAAAGAGATACCTTATGCTCCCTGATAGTTCGCTTTCTCTGAGTTTCACTTTCCCTCTGAAAAATCTCGCTTTTAAATTGTGAACGGAACTCTTTACAGAAGTTGTTATCAGTTTTTTTGTTATATAGATTGCTAAAAATAAGCTGGGATGCCGGGTCTAAATCAGGTGTGTTAATAATGAAAACTTTGACCTTTTCCATATAGGGATATTCAATTTTACCCAATATGCTTAATTGCTTTATATTAAAAAAACCTCTCAAATCAAAAGATTTAATGAGCTTTGATTGGATTACACTTTCAATCCTGCTTGCATTTAAATAACAGTACTTTGCCATTGGAAGGGCCCATACTACCAGTTGAGGAATATATTTTTCTAATGCAATGCTCTGCCAATCAGTATCAAAGGTTTGGCTTTTTGCTAGCATGTTTTTCGGCAAATCAGAATACATTGTGGTAGATGCCCATATCTTATAATCATTCGCCAAGGCTTGATAGTATTTTGTGTGGTTATGAATTTTATAAGCTGACATGAAGCGATAAACGTCTTCGTCGTGACCAGCGTCGTAAATTGTTCGATTTCCTCTAAGATAACCGTCGTAATGTTCGGTTATCCTTCTACCTACATTACAACTTACCCCAACGTAAACCACACGACTGAAAAGTCCTTTATGGACAATAAGATAAACTCCGCTACAGCCAGATTTCCTGGCCTCTGATAGAGAACCTAAAAATCTCCATTCCATAATTAAATCCATAATTATTGCTACTGTTTTTGTTTATCATTATTTTCGTGAAACTTCAACAATTTTATCCAAAAGCTAAGGGCAAAGACTTATATAATTATACTTGTCATCGTTAGCGATTATATAGAGTAGTGGCGCTGACCTGCTCCCTGGTGATTCACACAGAATGCTGTTAGTAATGTCCGTTCCTCGCTCTCAGCGGACCTTCAGCTCAGTGATATCGTCCGCTCTGTGCAAAGAGCGGACGTTGGTATGCAAGAGCCCTCCAAAAGTTGATGGTTGGTTTGCAGGGGGGCTTAAAGAAACTGCACTTATCAAGTTGAAGTTCTGTATTCAGCGAAATCGTAGCACTCTGACGATAAGTAACTCCGGTACTCCGCTCTTCGATGAAGAACCAGAGTAATCCCCCCGAAAAACCAGCGCATCAAAATTGGATCTTCAGCGGTAGCTTATCGGCTATCGGAAGTACAGGTGTGGATTCGTGGTGAATTGCTTTGATAATAAACGATTAATACGGAAAAACGCATTAATCATTTATTAGCTTTTAGTAAACCACAATTTATTCCGTTTTACATATCATAGTAGTCGATTGGAGAATATAGTTTCTGGGAATGTACTCTTCAAAGTGTTCGTCCTTTTTAAATACATGAACTACATTTGGGAATAATTGATAGTCAACAGGGTGTATAGCGTTTGGATTATGGTACATGTACATGGCTGTACACCATGGTTCTTGATAGTTAGGGTCACTTACATCGGCTGAAAATGGATGTGGGGCTGCATCCTGATCAGTTTTAACACCACTGACGTACACTTTGAATCCACTCGCCTCTACACCTGCAAGAATTCCCATCCGGTTAAACTTAGGTATGGTTGCTTGAGTAGTGAGTAAAACGGCAGAAACATAATTATTTTGTTCTGAGCCAAAAAAGTTCGACTTGATACTTCTATTTTCATCTGTATGTCTTTCAATAGAAATGCCTGACTCAATATCAATCCCGTACAAATAGCTATGCAAGGCTTCGCTTGAGAAGGCCATGGACATTCTTTTTGAATAATCCTGCATTGCTATGACAAATGGTTTGTTCTTTGTATGGTTGAGTTCCCAGTAATGAACTTTCTCTGGCTCAGGGCAATGCCGGACTTTTTTTAATAAACTTCTTGCAAACTTAAAAGGCATGACATTTAGAACATGTTTTCTTAATTCATCCATCTGTTCATCGTTAATGACTTTTCTTTCAAGAGGGGCTTCTGCTTCAGCAATGCTTACAGCCTCTACAGCAATTTCCACTCCAAATTTAGATAGCAGAAAATCTGGTTGATTGTATTCTCTATTCATTTCAAAGTCGAGTTCATAAAATACAGCGTTCAAATATAATTCAAATAACCTTGAATTAAATGCATCACTTTGAAAATCCCTTATAAATATTCCATCAGGATCTTTGAACCAGTATGCAAGTTCCTCAAGAACAATATATGCAGGGAAATGAAGAGGGTCTTCGAGGAGCATTTTTATATAAACATTCCTTTTTTTCGCTGGGACCTTACTCAAGAATAATGAAAAAGGTTTGGTTGATTCATCGCCTTGCATGAATGTACCATTTTGGTGCTGCGCCAGCATCTTTGGTATGTCATCGTTCAAATTATTAAGCAAGACATCCATTGAATCAAATGAAGCCAAGACGTTTATTGCTCTGAATTTTTTATCTAAATCCCGACCTAAGACTATTGCGTTAAAATCTTTATCAATATTGCATATGATTATTGTGGATAACAATGTTATCCCATTCCCCTCATATTTAAACCAGCGTATCTCCTCAGAAAATGTCTTAAGGTAAGGTGAGCGACCGTAAAAATAAATATCAAATTGTTCTTTGCTGATCTCACTGAAGTGTAATCCTGCGTTCATACCAATTCCTTTTCAATGAATAATTGGCCTTTAGGAGTGATTCCCTTTGTCTTTAATTCAGTTCTAACTAGTTCTTTAATCCAATAGCCTAAGCTCATCATGCAGTTGGATCATAAGACAACGCCCTATAGTGCTCGTGATACTATAGGGCATCTGACCACACTGTTAACTGGAGTAACGACTATGGCAGGAATACAGCATAACCAAACTCACCCCAAACTTACATAGCGCTTTCTGGCCGTGAGCATAACAAGGTCCACTCCTCGCTCATAAGGGACAACCATACTCAAATCTCCCACATTGCAGGAGATTTGAGTATGAACACGTCACCGTGGAACAAAGACCGTATCATAGGCCAAAAAAGACCACTTCAGATATCTCATATCTGGGGTATCCGAATCCGACTTGAACTGGAAGGTAAAACTCGCGATTTAGCTCTGTTCAACATGGCCCTGGATAGTAAGCTTCGAGGCTGTGATCTGGTCAAACTCAAAGTATCTGATGTTGCATATGGTGGCTCTGTTTCAAGCAGAGCAACGGTGTTGCAACAGAAAACCGGTAGCCCTGTTCAATTTGAGATAACCAAAGGGACAAGAGAAGCTGTTGCTGCATTGATACAGCTTAGCAATTTGCACAGTAAAGACTTCTTGTTTCGGTCTAGGGTCGGAACTAACCAGCACATTTCAACCCGGCAATACAACCGAATCTTTCATGGGGGGGTAGAAAAGCTTGGTCTCGAAGATTCGCTTTACAGCACACATTCCATGAGAAGAACAAAACCTTACCTGATCTACAAGAAAACCAAGAATCTCCGGGTGATCCAACTTCTGTTGGGTCATAAGAAACTGGAAAGCACAGTCCGTTATCTGGGCATTGAAGTCGATGATGCGTTAGAGATTTCTGAATCGATTGAAGTCTAAGGTTGTCAGGGCTGCAACAGCAGCCCTGTGCCATAAGCGGAAGTATTTAACAACTATCAGTGTTGTTCAACAGATAAAGGGGCACTTGATTTTTTCTGTTCTCAGGAAATGATAAAAGCGCGTCGGTTCAAGCCTGCTTAACGGGAGTTTGTTAATCCTGTTGCCGTGACGTTTTGACACCATTATGATGGGGAGACACTTAATGTATGAAGGTTCCGCCACTTATACCTGTCCAACAACTGCCTCGGATGTTTCTTTGTATGAATAAGTGGTAATGAGTAGTGAATCGCTAACAGTCACCCGAACAATCGGTGCCTGCAATTAATTCTATATTCTAAACGAGGGGGAGATTATTACACATGAAATTTAAGGACAAGAACCTTAAGGCTCTCGCGGAATGTATCATAGGAGATAATAAGGCATTTCTGTATCGTTCAAGCAGTCACATCACTGAATTTTTCCAGGACTGCGGCATGGATGTTACTCATGACGGATCCACTCGGTGGAAATGGACGGCCCAGAGGCTTGAAGAACTTCTTTATGAGCCACAGTCAAAGCCACATACTTTGCCGGAAAGGTTTGTTCATGTGCTCAGAACTTTAATGTTAAAAGAAGATGCAATGGATGACGATCCAGGAAGATTAAAGGCGCTTGAAGAACTGAACAAGCCTTTGATGCGGGAAGGCTATGAGGCATTCTATGGTGACGATCGCCTTTTGTATATACGCCATACCGATACCAAAACGGTTTCAGTCAGTAATAACCCTCATCGGCCCTTAACGCCTCACGAAGTAGAATGCAGAAGGTTACTGACCGCGTTTCTTGATACCTGCTCAGAAGATGAGTTAATAGAAGATATTCTCCTTCCTTTATTCCGGCAACTTGGTTTTCACCGGATAACAGCAGTGGGACATAAAGATAAAGCGCTGGAATACGGGAAAGACATCTGGATGAAGTTCACACTGCCAACTCAGCATGTTCTTTATTTCGGCATTCAGGCAAAAAAAGGTAAGTTGGATGCGTCCGGTGCCAGCAAATCTACGAATTCAAACGTGGCAGAAATCTTCAACCAGGTACTGATGATGCTTGGCCATGAAATATTTGACCCAGAAACAAATAGAAAGGTGCTGGTAGATCATGCCTTTATCGTTGCTGGCGGAGAAATTACTAAACAGGCGAGGAACTGGCTGGGCGGGAAACTTGATGCCAGCAAAAGAAGCCAGATAATATTTATGGACCGGGAAGACATTCTTAATTTATATACTGTAAGTAATGTACCTCTGCCAACAGGTGCTCTCATCTCTGATGATGCCGTTAAGAACGATGATATTCCTTTCTAATCAGAAGTACGTCTTTTTCTGAAAGAATACGTGATAGGTAGCCACACCACACCTTTAGTGACCCCTTAATCTGGTAATATAACAGCCCGTATGAATGTCCGCGGCATCGCGGGCTGAAATTTATTAAAAATACTTATTCATCAAGCTGGAGTAGTTTGCCGAGTAACTGTAAACGCCCAACTTAACCGGACCATTCACTTTTAGATTGCTACCAGCAAACCAACTTCCGTTTCTCGCTCAAAGCGGACTAGAAGGTTAGCTTGCGTCGGACTTGGCGTATTTAAAGAAGTGCTGGTGGTAACTGGTTGTTGTGTTCCATTTCTACAAAACAAAATCACAGAAACTATACCCAATAGTTATATTGAATCAATGATGAGACAGCCTCATATTTATCAGAACTGGTGTACGTCCAATACAGGAGGTTGTCGTGCTGGTTCTCAAATATGCGCTAGCTATTGCGGCTGTAATGGCAATTTATTGTCTTGCTATTGTTCTTACGGATCGCCTTTCTGATTGATTTTATATTGGCGAGGTGACGGGAGTTAAGTAGAATTGCTGCGGGTGCTTGAGGCTATCTGCCTCAGGCATGAACACCAAAGGCAGATAGAGAAAAGCCCCAGTTAACATTACGCGTCCTGCAAGACGTTTAACATTAATCTGAGGCTCAATCTATGAACGGCAAATCTAGGTTAGCCTCTTACGTGCCGAAAGGCAAGGAGAAGCAGGCTATGAAGCAGCAAAAGGCGATGTTAATCGCCCTGATCGTCATCTGTTTAACCGTCATAATGACGGCACTGGTAACGAGGAAAGACCTCTGCGAGGTACGAATCCGAACCGGCCAGACGGAGGTCGCTGTCTTCACAGCTTACGAACCTGAGGAGTAAGAGACCAGGCGGGGGAGAATCCCTCGCCACCTCTGATGTGTCAGGCATCCTCAACGCACCCGCACTTAACCCGCTTCGGCGGGTTTTTGTTTTTATTTTCAACGCGTTTGAAGTTCTGGACGGTGCCGGAATAGAATCAAAAATACTTAAGTAGCGCGCAGGGATAAGAGGGATGGTCCCTTAAAGGGGAGAGCTAATTATCCGGAAGGATTCTGATGATGAACATCGAAGAACTGCGTAAAATTTTTTGTGAAGATGGCCTCTATGCTGTGTGCGTTGAAAATGGAAATCTTGTTAGTCATTACCGCATTATGTGTTTGCGAAAGAATGGGGCTGCGTTAATTAATTTTGTGGATGGTCGAGTGACAGACGGATTTATCTTGCGCGAAGGTGAGTTTGTCACTTCATTACAGGCACTGAAAGAGATTGGAATAAAAGCAGGCTTTTCAGCTTTTGCAGAAGAATAAACTCATCTACAATCTTGCGCGGGGCTGAACTCCCGCTGAGTAACACCGTGCCACCGGAGAAAACCGATGGCACGCAACGTAAAATATTACAATTCTGATAATTCGCCCGTTCTTGCCTGCACGCACGAGCGGTATTCTCACGCATTCAAGTCTGAATGGTTCCAGCACCCTCCATGCACTGAAGAGCAGGCTGAATGGATAATTCAGTGTTACCGCAGGCGCGGATACGAGGTTAAGAAAGCTCTTAGTCTCGACTACCGTCACTGGATAATCTCAGTCAGATTGCCTTACTCCGAACGCCCACCGCGTCCGTCCCGTACATTCCAGCAACGCATCTGGAGGTAACGTGCGGGTATTACTTCGACCTGTTCTGGTACCGGAACTCGGGCTGGTGGTCGTTAAGCCGGGCCGTGAATCCATGCCGGTATTCCACAATACCCGGGTACTGGTGGAGCCGGAACCGAAAAGCATGCGTAATCTGCCGTCCGGGGTCGTTCCTGCCGTTCGCCAGCCGCTGGTGGAAGACAAAACATTGCTGCCGTTTTTCAGTAACGCACGGGTAATTCGTGCTGCTGGTGGTGCTGGTGCATTGTCTGACTGGCTGTTGCGCCATATTAAATCCTGCCAGTGGCCACACGGCGATTATCATCACAGCGAAACCGTCATTCACCGTTATGGTACCGGCGCAATGGTGTTGTGCTGGCACTGCGACAACCAGCTGCGTGACCAGACATCCGAATCACTCGAGCAACTTGCTCATCAAAACCTGTCAGCATGGATGATTGACGTCATCGGTCACGCAATAAGCGGTACGCAGGAGCGTGAATTATCTTTGGCTGAATTATCCTGGTGGGCGGTCCGCAATCAGGTGGCGGACGCGCTACCGGAAGCGGTATTACGTGGTTCGCTGGGGTTGCGTGCGGAAAAAATCCGCTCAATGTACCGTGAAAGCGACATCGTACCGGGAGAGCAGACCGCCAACAGCATACTGAAACAGCGCACAAAAAATCTTGCGCCGCTGCCTCACGCCCACCAGCAACAGAACCCACCACAGGAAAAGACGGTGGTCAGCATTGCCGTTGATCCTGAGTCTCCGGAATCTTTCATGAAACGACCTAAACGTCGCCGCTGGGTTAACGAGAAATACACACGCTGGGTGAAGACACAGCCGTGTGCGTGTTGTGGTAAGCCAGCCGACGATCCCCATCACCTGATTGGTCATGGTCAGGGCGGAATGGGGACAAAATCTCACGATATTTTCACGCTACCGCTGTGTCGGGAGCATCACAACGAGCTTCATGCGGATCCTCTGGCGTTCGAAGAAAAGCATGGTTCTCAGGTTGATTTAATTTTTCGTTTTCTTGATCACGCCTTTGCAACTGGCGTGCTTGGGTAAAAGAGGTGACTGATGCTCATAGATTTGGTTTTACCTTACCCGCCGACGGTGAACACTTACTGGCGACGCCGTGGCAGCACATATTTTATCTCGGAGGAGGGAAAGCGTTATCGCCGGGCTGTGGCGCTTATTGTTCGCCAGCAGCGGCTGAAATTAAGCCTGTCCGGAAGGCTGGCGATAAAGGTGATTGCAGAGCCACCGGATAAGCGTCGTCGCGACCTGGACAATATCCTGAAAGCACCGCTGGATGCGCTGACGCATGCGGGAGTGTTAATGGACGATGAGCAGTTTGATGAAATCAATATCGTTCGTGGTCAGCCAGTATCTGGTGGACGTCTGGGGGTGAAGATTTACCCCATAATGCATGAAGAGCAGGTCAAAAAATGAAACTGGAAGATTTACCGAAATACTACTCCCCAAAATCCCCTGGCCTGACCGATGCATCGGCCTCAACGTCAAAAGATGCGCTGAGTATCACTGATGTGATGGCCGCGCAGGGCATGACACAGAATCGGGCTGAGATGGGTTTTTCTGCGTTCCTGGGGAAAATGGGCATCAGTATGAATGACAGGGCGCGGGCAACAGAATTACTGGCAGATTATGCACTCAGTCGGTGCGATCGTGTGGCGGCGTTGAGAAAGCTTCCGGCAGAAATAAAACCGGTAGTGATGCGCATTATGGCTTCGTACGCTTTTGAGGATTATGCCCGCAGCGCAGCGAGTAAAAAGCAGTGCCCTTGTTGCTATGGGGAAAAATTTATTGAAAGCGTAGTTTTTACAAACAAGGTCCAGTATCCGGATGGTAAGCCGCCGGTATGGGCAAAGTGTACGAAAGGTGTGTATCCGTCTTACTGGGAAGAATGGAAAAAAGTCAGGGAGGTGGTAAAAGTTGCCTGTCCGGAGTGTGGCGGAAAGGGTGAGGTTTCCACCGCCTGTAAGGATTGCCGTGGGCGTGGTGTCGCCATTCATCGTGAAGAGTCGGTAAAACGTGGTATGCCTGTTATCAGAGACTGCCAGCGTTGTGGTGGTCGTGGCTATGAAAGACTACCATCAACGGAGGCATTTAATGCTATATGCGAGGTGACAAACCAGATAACACGCGCGTCATGGGAAAAAACAGTTAAGAAATTCTATGATGCGCTGGTGACCCGGTTTGATATTGAAGAAGCATGGGCTGAGCGGCAGTTAAAAAAGGTAACTAGGTAACAAGGTTGATTTTTCCGGAATCTGTGGTAAATTCGTCATAACGATGGGCGTTTTATGCCTGACGTTAGAAGAGTTTCTACAACCCGCCGCCGAGCGGGTTTTTTATTGCGGAATTAATTATGGACCGTTATTATTCTGCTCCCGGCCCTTTAGCTCAGTGGTGAGAGCGAGCGACTCATAATCGACAGGTCGCTGGGTCAAATCCAGCAAGGGCCACCAACCGTCACCAGTTCATCAGGAAAGAGCGTCAACCCTTTAAGTTGAGTGTGCGAGGTTCGAGTCCCCGGTGGCGGTCCAGTGCCGACTTCGCTCAGTAGGTAGAGCAACTGACTTGTAATCAGTAGGTCACCAGTTCGATTCCGGTAGTCGGCACCATATGCGGGCATCGCATAATGGCTATTACCTCAGCCTTCCAAGCTGATGATGCGGGTTCGATTCCCGCTGCCCGCTCCAGTTAGAGTCTTTCAGTCTGCGATGATGGGAAATCCCGGAGTGACTGAAAGACGTTTAAGTTATGAATGATCGCTTTTTTTTGCAAAATTGCTGTGCAGAAATACTAACCTTCGGGCAGGCGATCATTCATAAGCACTCTGCTTTTATTCCGATTAACTGTGGGTGGTTTGTTGGATAGAGTGCTTTCCTTACTGTATATATTGTTTCGCCCGCTTTTGCGGGCTTTTCTTTTCAAATCCCTTTCATTTCTCAGTGTAAAACTACGCCATCCGTTATTTGCGGAGGTGAGGCTATGAAATCCATGGACAAAATTTCAACGGGCATTGCCTATGGCACCTCCGCAGGCAGTGCTGGCTACTGGTTTTTACAGCTGCTCGATAAAGTCACGCCCTCACAGTGGGCAGCAATAGGTGTGCTGGGTATCCGCACCGGCGGGCGCGTGCTGGCGGTAAACAGCCAGACCCGGACGCTGACGCTCGACCGTGAAATCACGCTGCCATCTTCCGGCACCACGCTGATAAGCCTGGTTGACGGGCAGGGGAGTCCGGTCAGCGTGGAGGTTCAGTCCGTCACCGACGGCGTGAAGGTGAAAGTGAGCCGTGTTCCTGACGGCGTTGCTGAATACAGCGTATGGGGGCTGAAGCTGCCGACGCTGCGCCAGCGCCTGTTCCGCTGCGTGAGTATCCGTGAGAACGACGACGGCACGTATGCCATCACCGCCGTGCAGCATGTACCGGAAAAAGAGGCCATCGTGGATAACGGGGCGCACTTTGACGGCGACCAGAGCGGCACGGTAAATGGTGTCACGCCGCCAGCGGTGCAGCACCTGACCGCAGAAGTCACCGCAGACAGCGGGGAATACCAGGTGCTGGCCCGCTGGGACACGCCGAAGGTGGTGAAGGGCGTGAGCTTCCTGCTTCGCCTGACCGTGGCAGCGGATGACGGCCGTGAGCGGCTGGTCAGCACGGCCCGGACGACGGAAACCACTTACCGCTTCACACAACTGGCTCTGGGGAACTACAGGCTGACAGTCCGGGCAGTAAATGCGCGGGGGCAGCAGGGCGATCCGGCGTCGGTATCGTTCCGGATTGCCGCACCGGCAGCGCCGTCGCGGATTGAGCTGACGCCGGGCTATTTTCAGATAACCGCCACGCCGCATCTTGCGGTTTATGACCCGACGGTACAGTTTGAGTTCTGGTTCTCGGAAAAACGGATTGCGGATATCAGGCAGGTTGAAACCAGCGCGCGTTATCTTGGTACGGCACTGTACTGGATAGCCGCCAGTATCAATATCAAACCGGGCCATGATTATTATTTTTACGTTCGCAGTGTGAACACCGTTGGCAAATCGGCATTCGTGGAGGCTGTCGGTCAGCCGAGTGATGATGCATCAGGCTATCTGGATTTTTTCAAAGGCGAGATAGGGAAAACCCATCTGGCTCAGGAGCTGTGGACGCAGATTGATAACGGTCAGCTTGCGCCTGACCTGACTGAAATCAGGACGTCCATAACGGATGTCAGCAATGAAATAACACAGACCGTCAATAAGAAACTGGAAGACCAGAGTGCAGCGATCCAGCAGATACAGAAGGTTCAGGTTGATACAAATAATAACCTGAACAGCATGTGGGCAGTGAAGCTGCAGCAGATGCAGGACGGACGCCTTTATATTGCGGGTATCGGTGCCGGTATTGAGAACACCCCTGACGGCATGCAGAGTCAGGTGCTGCTGGCAGCAGACAGGATTGCGATGATTAATCCTGCGAATGGCAACACAAAGCCGATGTTTGTTGGTCAGGGCGATCAGATATTCATGAATGAAGTGTTCCTGAAACGCCTGACGGCCCCCACCATTACCAGCGGCGGTAATCCTCCGGCATTTTCCCTGACATCAGACGGGAGACTGACGGCGAAAAATGCGGATATCAGTGGCAGTGTGAATGCGAACTCAGGAACGCTCAACAATGTCACGATTAACCAGAACTGTACGATTAAGGGCATGCTGGAGGCGACCCAGGTCAGAGGAGATTTCGTTAAAGCTGTATCAAAAGCCTTCCCGAAAAAAGTCGGTACGTGGGGTAACACGGAAACACCAAACGGTACGGTTACAGTAACCATCAGCGATGATCATAACTTTGACCGCCAGATTATTATTCCGCCCATTATTTTTAACGGTATAGCGTATGACGATCCGGGGAGCGGAAATAACCCAGGAGGCACGCGATACACGGGTTATGGTTTTGAAGTTCGCAAAAACGGCGTATTAATCGCATCCAGAGAAACTAAAGGGGCCATTCCCGGTAGTTACAGTGCAGTTATTGATATGCCGAGTGGCAGGGGAAGCGTCACTCTGGAGTTTAAGATTTTCCAGAAAGGCAATCAGGGGGCAGGCAATATCACCGACTGTACGGTGATTGTGACCAAAAAAGCGGCTTCCGGCATCAGTATTCGTTGAAATATTTATAACCCCAATAAAGGGCGTCAGGAATGACGCCTTTTTTATTGCAGAAAAGCGAGAGGTAATTATGCGTAAAGTTTGTGCAGCAATTTTGTCCGCAGCCATTTGTCTGGCCGTATCCGGTGCGCCTGCATGGGCGTCTGAACATCAGTCCACGCTGAGCGCGGGGTATCTTCATGCCCGGACCAACGTTCCCGGCAGTGATGATCTGAACGGGATTAACGTGAAATATCGTTATGAGTTTACGGATACGCTGGGGCTGGTGACGTCATTCAGCTATGCAGGAGACAAGAATCGCCAGCTGACCCGTTACAGCGATACCCGCTGGCATGAAGATTCCGTGCGTAACCGCTGGTTCAGCGTGATGGCGGGGCCGTCTGTGCGCGTGAATGAATGGTTCAGCGCGTATGCGATGGTACTGGTGGAAGAACTATCGAGCAAGCACGTGCGAACTTGCGGGTAATGTATGAGCAAAAAGCTGGCCTTGCTAATACTGACCTAAACACCCTTACCGGTGAATATTCTGGTTTCTATCAACAACCAACGAGCGCTTACGCAACAGAAGAGTTAAATTACCCAATCGGTCTGGCGGGCGCTTTAATAGTGCTCCAAACGAGAGCCAACACTGCTTCTTCCTGCGTTCAGGTGTACCACCCTTATAATAATCCGGGAATTACTTATAGACGAATATATGAAGGAGGTAGCGGTACCTGGTCTGAATGGAAGAGAGATGTATCAACAGAAAGGGTTGAAGAGGGAAAAGAAACAACTTACGTATATTCTACGTATTCTTCAGGCGCACCACGCTTACAGGTTTCCAAATCTGGTTTGTGGGGTTGTCATAATGGCACTGGCTGGTTGCCATTAGCTGTTGGGCAAGGAGGTACAGGTGCGACAACAGTAGAAGATGCGCGAAACAACTTAAGTCTTGGCGAAAGTAGCGCAGTTAAATTTAAAAACCTTACTTTAACCGAAGCGCTCGACACGACATTAGGACTGCTTACAAAAACAGGACGAGACTGGAACACGCAGCATACTGATAACATTAATAAATTTATACCAATTGCAGGCAGTACAAACGGCCCGGCAGGCTCTATGGTTCTTGGCGGCATTCATGTTCAATTTAGTAAAAATTATGCTGTGCAGTTCGGAGGCCGCAATTCCGGTTTTTGGGGAAGAACAATTGAAAATGGAACGACACAGGAATGGAAGAAATTACTAACAGTAGACGATCTCAATTCATCTACCGATCTTGCTGTCAGGTCATTAACCACATCTAACCCGGTAAAATCTGGCGGAGGGCGAATTGATGTCCTTGGAAGCACGTCAGACTATAGCAAAATGGATTGCTTTGTACGTGGGTTTGATAGCACCGGTAATTCTCTCGTGTGGGCGTTGGGTTCATCAGTCGGCGTAAGTAAGATGCTATCGCTAAAAAATTTCTTTAGCGGAGCTGAGATACTGTTAAATGGTAATGACGGCGCGGTTCAACTCAAAACAGGTGCTGTTAACGGGGCTACAGCGCAGACGCTCACTATCAACAAGAATGAGGTTAACTCAACCGTTGATTTAACCCTTACAAAGCAATCAGGGACTGGTAATCGTTTTGTTTTACAGAACTCAGGTAATGCAGAACTACCGTTTTCTGTCAGGGTGTGGGGTTCCAGTACTCGACAAAACGTTTTTGAGGTTGGAACGTCTGCTGCGTATCTGTTTTATGCGCAAAAAACGTCAGCAGGCCAGTTGTTTGATGTAAATGGCGCTATTAATTGCACAACGCTGAATCAGTCATCAGACCGCGACCTTAAAGACGATATTCTCGTTATCAGCGACGCGACGAAAGCAATCCGTAAAATGAACGGATACACCTACACGCTCAAGGAAAACGGGATGCCTTATGCTGGCGTTATTGCACAGGAAGTAATGGAGGCGATACCAGAAGCTGTGGGATCGTTTACTCATTATGGTGAAGAGTTGCAAGGTCCGACCGTTGACGGCAACGAGCTACGCGAAGAAACTCGCTATCTTAATGTTGACTACTCCGCCGTGACGGGTTTACTTGTTCAGGTCGCCCGTGAAACAGATGATCGCGTTACCGCGCTGGAAGAGGAAAACACAACGCTACGTCAAAATCTGGCAACAGCAGGCACCCGGATCAGCACTCTGGAAAATCAGGTAAGCGAACTGGTTGCACTTGTCCGGCAGTTAACAGGAAGCGAACATTGATATCCTTCAAGCCCTGAAGGAGGCTGTTCCTGGTACGTTCAGACTGTTGTTGAGCTGGAAATCGCAACGGAGGAAGAAACTTCGTTGCTGGAAGTCTGGAAGAAGTATCGGGTGTTGCTGAACCGTGTTAATACAACAACTGCACCGGATATTGAATGGCCAGTAGCACCTATAGGGTAA
Protein sequences of DBSCAN-SWA_11 >NZ_CP040886|4692223:4716004|4707254_4707362_-|WP_122083109.1|DBSCAN-SWA MLTGAFLYLPLVFMPEADSLKHPQQFYLTPVTSPI >NZ_CP040886|4692223:4716004|4699392_4699584_-|WP_001083281.1|lysis|DBSCAN-SWA MNSAFALVLTVFLVSGVPVDIAVSVHRTMQECMTAATEQKIPGNCYPVDKVINQDNIEIPAGL >NZ_CP040886|4692223:4716004|4701633_4702578_-|WP_001678528.1|DBSCAN-SWA MDLIMEWRFLGSLSEARKSGCSGVYLIVHKGLFSRVVYVGVSCNVGRRITEHYDGYLRGNRTIYDAGHDEDVYRFMSAYKIHNHTKYYQALANDYKIWASTTMYSDLPKNMLAKSQTFDTDWQSIALEKYIPQLVVWALPMAKYCYLNASRIESVIQSKLIKSFDLRGFFNIKQLSILGKIEYPYMEKVKVFIINTPDLDPASQLIFSNLYNKKTDNNFCKEFRSQFKSEIFQRESETQRKRTIREHKVSLYENYGKPWTLKEMEKLRVMLVDFDLSPIEISEYLGREPRSISKKISENDKVTNYKWRESVGWL >NZ_CP040886|4692223:4716004|4711432_4713595_+|WP_001373320.1|DBSCAN-SWA MKSMDKISTGIAYGTSAGSAGYWFLQLLDKVTPSQWAAIGVLGIRTGGRVLAVNSQTRTLTLDREITLPSSGTTLISLVDGQGSPVSVEVQSVTDGVKVKVSRVPDGVAEYSVWGLKLPTLRQRLFRCVSIRENDDGTYAITAVQHVPEKEAIVDNGAHFDGDQSGTVNGVTPPAVQHLTAEVTADSGEYQVLARWDTPKVVKGVSFLLRLTVAADDGRERLVSTARTTETTYRFTQLALGNYRLTVRAVNARGQQGDPASVSFRIAAPAAPSRIELTPGYFQITATPHLAVYDPTVQFEFWFSEKRIADIRQVETSARYLGTALYWIAASINIKPGHDYYFYVRSVNTVGKSAFVEAVGQPSDDASGYLDFFKGEIGKTHLAQELWTQIDNGQLAPDLTEIRTSITDVSNEITQTVNKKLEDQSAAIQQIQKVQVDTNNNLNSMWAVKLQQMQDGRLYIAGIGAGIENTPDGMQSQVLLAADRIAMINPANGNTKPMFVGQGDQIFMNEVFLKRLTAPTITSGGNPPAFSLTSDGRLTAKNADISGSVNANSGTLNNVTINQNCTIKGMLEATQVRGDFVKAVSKAFPKKVGTWGNTETPNGTVTVTISDDHNFDRQIIIPPIIFNGIAYDDPGSGNNPGGTRYTGYGFEVRKNGVLIASRETKGAIPGSYSAVIDMPSGRGSVTLEFKIFQKGNQGAGNITDCTVIVTKKAASGISIR >NZ_CP040886|4692223:4716004|4692755_4693970_+|WP_001295394.1|DBSCAN-SWA MKIVKAEVFVTCPGRNFVTLKITTEDGITGLGDATLNGRELSVASYLQDHLCPQLIGRDAHRIEDIWQFFYKGAYWRRGPVTMSAISAVDMALWDIKAKAANMPLYQLLGGASREGVMVYCHTTGHSIDEALDDYARHQELGFKAIRVQCGIPGMKTTYGMSKGKGLAYEPATKGQWPEEQLWSTEKYLDFMPKLFDAVRNKFGFNEHLLHDMHHRLTPIEAARFGKSIEDYRMFWMEDPTPAENQECFRLIRQHTVTPIAVGEVFNSIWDCKQLIEEQLIDYIRTTLTHAGGITGMRRIADFASLYQVRTGSHGPSDLSPVCMAAALHFDLWVPNFGVQEYMGYSEQMLEVFPHNWTFDNGYMHPGDKPGLGIEFDEKLAAKYPYEPAYLPVARLEDGTLWNW >NZ_CP040886|4692223:4716004|4701081_4701504_+|WP_001373616.1|DBSCAN-SWA MAKVFTQEEREKIKGQVVELVRRSGRETLRQLEVKTGATRYLMSVLARELVASGDVYNSGYGLFPSEQARKDWQNARKKLSRAKLKEPSAVDPDLIWSLPDGEIRRYDRRHNMICTECRKSEVMQRILSFYQGDVRYLLK >NZ_CP040886|4692223:4716004|4715878_4716004_+|WP_072163404.1|tail|DBSCAN-SWA MEIATEEETSLLEVWKKYRVLLNRVNTTTAPDIEWPVAPIG >NZ_CP040886|4692223:4716004|4692223_4692550_+|WP_000598292.1|DBSCAN-SWA MIKTTLLFFATALCEIIGCFLPWLWLKRNASIWLLLPAGISLALFVWLLTLHPAASGRVYAAYGGVYVCTALMWLRVVDGVKLTLYDWTGALIALCGMLIIVAGWGRT >NZ_CP040886|4692223:4716004|4699852_4700095_+|WP_072163420.1|DBSCAN-SWA MRISDFRLYREISDGKSITYMIAGLNKEYGDVVESGLLFADPAVVDRETDELIEKAIAFKLAYRQQYQQKAGWNYESSFC >NZ_CP040886|4692223:4716004|4706939_4707248_+|WP_032181055.1|DBSCAN-SWA MATSKPTSVSRSKRTRRLACVGLGVFKEVLVVTGCCVPFLQNKITETIPNSYIESMMRQPHIYQNWCTSNTGGCRAGSQICASYCGCNGNLLSCYCSYGSPF >NZ_CP040886|4692223:4716004|4714426_4715824_+|WP_032181053.1|DBSCAN-SWA MWGCHNGTGWLPLAVGQGGTGATTVEDARNNLSLGESSAVKFKNLTLTEALDTTLGLLTKTGRDWNTQHTDNINKFIPIAGSTNGPAGSMVLGGIHVQFSKNYAVQFGGRNSGFWGRTIENGTTQEWKKLLTVDDLNSSTDLAVRSLTTSNPVKSGGGRIDVLGSTSDYSKMDCFVRGFDSTGNSLVWALGSSVGVSKMLSLKNFFSGAEILLNGNDGAVQLKTGAVNGATAQTLTINKNEVNSTVDLTLTKQSGTGNRFVLQNSGNAELPFSVRVWGSSTRQNVFEVGTSAAYLFYAQKTSAGQLFDVNGAINCTTLNQSSDRDLKDDILVISDATKAIRKMNGYTYTLKENGMPYAGVIAQEVMEAIPEAVGSFTHYGEELQGPTVDGNELREETRYLNVDYSAVTGLLVQVARETDDRVTALEEENTTLRQNLATAGTRISTLENQVSELVALVRQLTGSEH >NZ_CP040886|4692223:4716004|4696827_4699299_-|WP_001372999.1|DBSCAN-SWA MSKVFICAAIPDELATREEGAVAVATAIEAGDERRARAKFHWQFLEHYPAAQDCAYKFIVCEDKPGIPRPALDSWDAEYMQENRWDEESASFVPVETESDPMNVTFDKLAPEVQNAVMVKFDTCENITVDMVISAQELLQEDMATFDGHIVEALMKMPEVNAMYPELKLHAIGWVKHKCIPGAKWPEIQAEMRIWKKRREGERKETGKYTSVVDLARARANQQYTENSTGKISPVIAAIHREYKQTWKTLDDELAYALWPGDVDAGNIDGSIHRWAKKEVIDNDREDWKRISASMRKQPDALRYDRQTIFGLVRERPIDIHKDPVALNKYICEYLTTKGVFENEETDLGTVDVLQSSETQTDAVETEVSDIPKNETAPEAEPSVEREGPFYFLFADKDGEKYGRANKLSGLDKALAAGATEITKEEYFARKNGTYTGLPQNVDTAEDSEQPEPIKVTADEVNKIMQAANISQPDADKLLAASRGEFVEGISDPNDPKWVKGIQTRDSVNQNQHESERNYQKAEQNSPNALQNEPETKQPEPVAQQEVEKVCTACGQTGGGNCPDCGAVMGDATYQETFDEEYQVEVQEDDPEEMEGAEHPHKENTGGNQHHNSDNETGETADHPIKVNGHHEITSTSRTCDHLMIDLETMGKNPDAPITSIGAIFFDPQTGDMGPEFSKTIDLETAGGVIDRDTIKWWLKQSREAQSAIMTDEIPLDDALLQLREFIDENSGEFFVQVWGNGANFDNTILRRSYERQGIPCPWRYYNDRDVRTIVELGKVIDFDARTAIQFEGERHNALDDARYQAKYVSVIWQKLIPSQADS >NZ_CP040886|4692223:4716004|4707406_4707619_+|WP_001013632.1|DBSCAN-SWA MNGKSRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIMTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >NZ_CP040886|4692223:4716004|4705754_4706735_+|WP_023147794.1|DBSCAN-SWA MKFKDKNLKALAECIIGDNKAFLYRSSSHITEFFQDCGMDVTHDGSTRWKWTAQRLEELLYEPQSKPHTLPERFVHVLRTLMLKEDAMDDDPGRLKALEELNKPLMREGYEAFYGDDRLLYIRHTDTKTVSVSNNPHRPLTPHEVECRRLLTAFLDTCSEDELIEDILLPLFRQLGFHRITAVGHKDKALEYGKDIWMKFTLPTQHVLYFGIQAKKGKLDASGASKSTNSNVAEIFNQVLMMLGHEIFDPETNRKVLVDHAFIVAGGEITKQARNWLGGKLDASKRSQIIFMDREDILNLYTVSNVPLPTGALISDDAVKNDDIPF >NZ_CP040886|4692223:4716004|4708432_4709482_+|WP_001373319.1|DBSCAN-SWA MRVLLRPVLVPELGLVVVKPGRESMPVFHNTRVLVEPEPKSMRNLPSGVVPAVRQPLVEDKTLLPFFSNARVIRAAGGAGALSDWLLRHIKSCQWPHGDYHHSETVIHRYGTGAMVLCWHCDNQLRDQTSESLEQLAHQNLSAWMIDVIGHAISGTQERELSLAELSWWAVRNQVADALPEAVLRGSLGLRAEKIRSMYRESDIVPGEQTANSILKQRTKNLAPLPHAHQQQNPPQEKTVVSIAVDPESPESFMKRPKRRRWVNEKYTRWVKTQPCACCGKPADDPHHLIGHGQGGMGTKSHDIFTLPLCREHHNELHADPLAFEEKHGSQVDLIFRFLDHAFATGVLG >NZ_CP040886|4692223:4716004|4699580_4699769_-|WP_000854559.1|DBSCAN-SWA MKTLLPNVNTSEGCFEIGVTISNPVFTEDAINKRKQERELLNKICIVSMLARLRLMPKGCAQ >NZ_CP040886|4692223:4716004|4693981_4695001_+|WP_000836058.1|DBSCAN-SWA MKSILIEKPNQLAIVEREIPTPSAGEVRVKVKLAGICGSDSHIYRGHNPFAKYPRVIGHEFFGVIDAVGDGVESARVGERVAVDPVVSCGHCYPCSIGKPNVCTTLAVLGVHADGGFSEYAVVPAKNAWKIPEAVADQYAVMIEPFTIAANVTGHGQPTENDTVLVYGAGPIGLTIVQVLKGVYNVKNVIVADRIDERLEKAKESGADWAINNSQTPLGEIFAEKGIKPTLIIDAACHPSILKEAVTLASPAARIVLMGFSSEPSEVIQQGITGKELSIFSSRLNANKFPVVIDWLSKGLIKPEKLITHTFDFQHVADAISLFEQDQKHCCKVLLTFSE >NZ_CP040886|4692223:4716004|4709494_4709869_+|WP_000904112.1|DBSCAN-SWA MLIDLVLPYPPTVNTYWRRRGSTYFISEEGKRYRRAVALIVRQQRLKLSLSGRLAIKVIAEPPDKRRRDLDNILKAPLDALTHAGVLMDDEQFDEINIVRGQPVSGGRLGVKIYPIMHEEQVKK >NZ_CP040886|4692223:4716004|4703125_4704475_-|WP_001678529.1|DBSCAN-SWA MNAGLHFSEISKEQFDIYFYGRSPYLKTFSEEIRWFKYEGNGITLLSTIIICNIDKDFNAIVLGRDLDKKFRAINVLASFDSMDVLLNNLNDDIPKMLAQHQNGTFMQGDESTKPFSLFLSKVPAKKRNVYIKMLLEDPLHFPAYIVLEELAYWFKDPDGIFIRDFQSDAFNSRLFELYLNAVFYELDFEMNREYNQPDFLLSKFGVEIAVEAVSIAEAEAPLERKVINDEQMDELRKHVLNVMPFKFARSLLKKVRHCPEPEKVHYWELNHTKNKPFVIAMQDYSKRMSMAFSSEALHSYLYGIDIESGISIERHTDENRSIKSNFFGSEQNNYVSAVLLTTQATIPKFNRMGILAGVEASGFKVYVSGVKTDQDAAPHPFSADVSDPNYQEPWCTAMYMYHNPNAIHPVDYQLFPNVVHVFKKDEHFEEYIPRNYILQSTTMICKTE >NZ_CP040886|4692223:4716004|4704792_4705395_+|WP_023147793.1|integrase|DBSCAN-SWA MNTSPWNKDRIIGQKRPLQISHIWGIRIRLELEGKTRDLALFNMALDSKLRGCDLVKLKVSDVAYGGSVSSRATVLQQKTGSPVQFEITKGTREAVAALIQLSNLHSKDFLFRSRVGTNQHISTRQYNRIFHGGVEKLGLEDSLYSTHSMRRTKPYLIYKKTKNLRVIQLLLGHKKLESTVRYLGIEVDDALEISESIEV >NZ_CP040886|4692223:4716004|4707834_4708086_+|WP_000980999.1|DBSCAN-SWA MMNIEELRKIFCEDGLYAVCVENGNLVSHYRIMCLRKNGAALINFVDGRVTDGFILREGEFVTSLQALKEIGIKAGFSAFAEE >NZ_CP040886|4692223:4716004|4695058_4695169_+|WP_001360138.1|DBSCAN-SWA MTIEKHERSTKDLVKAAVSGWLGTALEFMDFKSHAC >NZ_CP040886|4692223:4716004|4700075_4701041_+|WP_000054501.1|DBSCAN-SWA MSLLFAERPLVINTQLAMKIGLNEAIVLQQLHYWLRDTNSGMECDGVRWIYNTTEQWLEQFPFWSESTLKRAFASLKTLGLLRCEKLNKSKRDMTNFYTINYGSELLDGGKLSESIGLKCAAPSGQNDTMEEVKMKRSIGSKRLNVIGSKWPDDLTENTTEITTENKKTSRPEASQPDPQTVEQDFLTRHPDAVVFSAKKRQWGSQEDLACAQWIWGRIVSLYEQAASDDGEISRPKEPNWTAWANDVRTMRMLDGRTHRQICEMFGRVQRDPFWVKNIMSPSKLREKWDELVIRLGRSSVQRCVNHISEPDTEIPPGFRG >NZ_CP040886|4692223:4716004|4709865_4710687_+|WP_000762889.1|DBSCAN-SWA MKLEDLPKYYSPKSPGLTDASASTSKDALSITDVMAAQGMTQNRAEMGFSAFLGKMGISMNDRARATELLADYALSRCDRVAALRKLPAEIKPVVMRIMASYAFEDYARSAASKKQCPCCYGEKFIESVVFTNKVQYPDGKPPVWAKCTKGVYPSYWEEWKKVREVVKVACPECGGKGEVSTACKDCRGRGVAIHREESVKRGMPVIRDCQRCGGRGYERLPSTEAFNAICEVTNQITRASWEKTVKKFYDALVTRFDIEEAWAERQLKKVTR >NZ_CP040886|4692223:4716004|4695188_4696469_-|WP_000877001.1|integrase|DBSCAN-SWA MKYPTGVENHGGKLRIWFVYKGVRVRENLGVPDTAKNRRVAGELRSSVCYAIKTGVFDYAKQFPSSRNLEKFGEARQDLTIKELAEKFLALKETEVAKTSLNTYRAVIKNILSIIGEKNLASSINKEKLLEVRKELLTGYQIPKSNYIVTQPGRSAVTVNNYMTNLNAVFQFGVDNGYLADNPFKGISPLKESRTIPDPLSREEFIRLIDACRNQQAKNLWCVSVYTGVRPGELCALGWEDIDLKNGTMMIRRNLAKDRFTVPKTQAGTNRVIHLIKPAIDALRSQMTLTRLSKEHIIDVHLREYGRTEKQKCTFVFQPEVSARVKNYGDHFTVDSIRQMWDAAIKRAGLRHRKSYQSRHTYACWSLTAGANPAFIANQMGHADAQMVFQVYGKWMSENNNAQVALLNTQLSEFAPTMPHNEAMKN >NZ_CP040886|4692223:4716004|4696503_4696740_-|WP_001296941.1|DBSCAN-SWA MIVSPGKWVSEEQLIALKGIKKGTLKKAREKSFMEGREYKHVAHDGMPWDNSPCFYNLEEIDRWIERQASARPRRHLT >NZ_CP040886|4692223:4716004|4708152_4708431_+|WP_023147795.1|DBSCAN-SWA MARNVKYYNSDNSPVLACTHERYSHAFKSEWFQHPPCTEEQAEWIIQCYRRRGYEVKKALSLDYRHWIISVRLPYSERPPRPSRTFQQRIWR |
27 | Enterobacteria_phage(26.32%) | lysis,integrase,tail | attL 4687689:4687703|attR 4712244:4712258 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
86975 : 114977
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP040885|86975:114977|DBSCAN-SWA TATGGACACCTCACTTGCTCATGAGAACGCCCGCCTGCGGGCACTGTTGCAGACGCAACAGGACACCATCCGCCAGATGGCTGAATACAACCGCCTGCTCTCACAGCGGGTGGCGGCTTATGCTTCCGAAATCAACCGGCTGAAGGCGCTGGTTGCGAAACTGCAACGTATGCAGTTCGGTAAAAGCTCAGAAAAACTTCGTGCAAAAACCGAACGGCAGATACAGGAAGCACAGGAGCGAATCAGCGCACTTCAGGAAGAAATGGCGGAAACGCTGGGTGAGCAATATGACCCGGTACTGCCATCCGCCCTGCGCCAGTCTTCAGCCCGTAAACCGTTACCGGCCTCACTTCCCCGTGAAACCCGGGTTATCCGGCCGGAAGAGGAATGCTGTCCTGCCTGTGGTGGTGAACTCAGTTCTCTGGGATGTGATGTGTCAGAGCAACTGGAGCTTATCAGCAGCGCCTTTAAGGTTATCGAAACACAACGTCCGAAACAGGCCTGTTGCCGGTGCGACCATATCGTGCAGGCACCAGTACCTTCAAAACCCATTGCACGCAGTTATGCCGGAGCGGGGCTTCTGGCCCATGTTGTCACCGGGAAATATGCAGACCATCTGCCGTTATACCGCCAGTCAGAAATATACCGTCGTCAGGGAGTGGAGCTGAGCCGTGCCACACTGGGGCGCTGGACAGGTGCTGTTGCTGAACTGCTGGAGCCGCTGTATGACGTCCTGCGCCAGTATGTGCTGATGCCCGGTAAAGTCCATGCTGATGATATCCCCGTCCCGGTCCAGGAGCCGGGCAGCGGTAAAACCCGGACAGCCCGGCTGTGGGTCTACGTCCGTGATGACCGTAACGCCGGTTCACAGATGCCCCCGGCGGTCTGGTTCGCGTACAGTCCGGACCGGAAAGGTATCCATCCACAAAATTACCTGGCCGGTTACAGCGGTGTGCTTCAGGCCGATGCTTACGGTGGTTACCGGGCGTTATACGAATCCGGCAGAATAACGGAAGCCGCGTGTATGGCTCATGCCCGGAGAAAAATCCACGATGTGCATGCAAGAGCGCCCACCTACATCACCACGGAAGCCCTGCAGCGTATCGGTGAACTGTATGCCATCGAGGCAGAGGTCCGGGGCTGTTCAGCAGAACAGCGTCTGGCGGCAAGAAAAGCCAGAGCCGCGCCACTGATGCAGTCACTGTATGACTGGATACAGCAACAGATGAAAACACTGTCGCGTCACTCAGATACGGCAAAAGCGTTCGCATACCTGCTGAAACAGTGGGATGCACTGAACGTGTACTGCAGTAATGGCTGGGTGGAAATCGACAACAACATCGCAGAGAACGCCTTACGGGGAGTGGCCGTAGGCCGGAAAAACTGGATGTTCGCGGGTTCCGACAGCGGTGGTGAACATGCGGCGGTGTTGTACTCGCTGATCGGCACATGCCGTCTGAACAATGTGGAGCCAGAAAAGTGGCTGCGTTACGTCATTGAACATATCCAGGACTGGCCGGCAAACCGGGTACGCGATCTGTTGCCCTGGAAAGTTGATCTGAGCTCTCAGTAAATATCAATACGGTTCTGATGAGTCGCTTACGTCCCAGTCGTCCCAGCCGTGCCAGGTGCTGCCACAGATTCAGGTTATGCCGCTCAATTCGCTGCGTATATCGCTTGCTGATTACGTGCAGCTTTCCCTTCAGGCGGGATTCATACAGCGGCCAGCCATCCGTCATCCATATCACCACGTCAAAGGGTGACAGCAGGCTCATAAGACGCCCCAGCGTCGCCATAGTGCGTTCACCGAATACGTGCGCAACAACCGTCTTCCGGAGACTGTCATACGCGTAAAACAGCCAGCGCTGGCGCGATTTAGCCCCGACATAGCCCCACTGTTCGTCCATTTCCGCGCAGACGATGACGTCACTGCCCGGCTGTATGCGCGAGGTTACCGACTGCGGCCTGAGTTTTTTAAGTGACGTAAAATCGTGTTGAGGCCAACGCCCATAATGCGGGCAGTTGCCCGGCATCCAACGCCATTCATGGCCATATCAATGATTTTCTGGTGCGTACCGGGTTGAGAAGCGGTGTAAGTGAACTGCAGTTGCCATGTTTTACGGCAGTGAGAGCAGAGATAGCGCTGATGTCCGGCGGTGCTTTTGCCGTTACGCACCACCCCGTCAGTAGCTGAACAGGAGGGACAGCTGATAGAAACAGAAGCCACTGGAGCACCTCAAAAACACCATCATACACTAAATCAGTAAGTTGGCAGCATCACCGGGTGGCCTGATTTAGGTGGTGGATCAGCAGTATCACAGTTTGCAATGATGGCCAGAAAATTTGGTAAACAACTTATTGTCGCTGTAAGTGAAAAGGATGCCATCAAATTAAAAGACAATTTTGATATTATTGGGATGTTGTCATATGGTAAAGAAAATTTTATTGCCATGAGTCATACGAAAAGCGAGTTAACTGGAGGGAAACGTCGGTACCGGATTAAAAATTGATATAAATAAAAAGCATACTTATTGGCTGGAGTTTCGGTAGTAAAAAGCGGGAAATGAAGGTAATAGTCAGCAACAGGGAATGTGGTATTATCGCGGCGGGTGTCTGAGCCTTTCTGGTTCAGGCAAGACGCAGGTACCAGAAATGCGAAGACCCCACTCGTTAATCCATTAACTCGTGAGGTCTGCATGAAGTACCTTAACACTACTGATTGTAGCCTCTTCCTTGCAGGGAGGTCAAAGTTTATGACGAAATATGCCCTTATCGGGTTGCTCGCCGTGTGCGCCACGGTGTTGTGTTTTTCACTGATATTCAGGGAACGGTTATGTGAACTGAATATTCACAGGGGAAATACAGTGGTGCAGGTAACTCTGGCCTACGAAGCACGGAAGTAAGCTGCCGGGCGGGGACGGAAGTCCCCGCTTTCCGGAAGTGTGAGGTATTTCAGGGGCAGACACCCGACATGCCAGAAACAGCCGGTCCCGCCCGGGGCCGGCATCCTGGTTAAGGCATTTCCTGCTTTTCAGTCATTTCATTATCAAAATCACATTAAACGGTTGTAATCAGACATGATTTGTGCGCCAACACCGATCATCGTCACAACTTTCAAGTCGCTGATTTCAAAAAACTGTAGTATCCTCTGCGAAACGATCTCTGTTTGATTATTGAGGAGGCGAGATGTCGCAGACAGAAAATGCAGTGACTTCCTCATCGGGAAAAAAACGACCTTACAGAAGGGGTAATCCTGTTCCTGCGAGAGAACGACAAAAAGCGTCTCTTGCAAGAAGGAGTGCCACTCATAAAGCGTTTCATGCCGTCATTCAGCTTCGACTGAAAGAAAAGTTAAGTGAGCTTGCTGATGAAGACGGGATCACTCAGGCACAGATGCTTGAGTGGCTGATAGAGTCAGAGGTTAAGCGCAGAAAATCTTTGTGAGTATTTGCGTTTCTTGCTGTTTCAGTGATGAGATGTTAGATTGCTGATCGTTTTAAGGAATTTTGTGGCTGGCCACGCCGTAAGGTGGCAGGGAACTGGTTCTGATGTGGATTTACAGGAGCCAGAAAAGCAAAAACCCCGATAATCTTCACCAGGTTTGGCGACTAAGAGAAGATTACCGGGGCCCACTTAAACCGTATAGCCAACAATTCAGCTATGCGGGGAGTATAGTTATATGCCCGGAAAAGTTCAAGACTTCTTTCTGTGCTCACTCCTTCTGCGCATTGTAAGCGTAGGATGGTGTGACTGATCTTCAACAAACGTATTACCGCCAGGTAAAGAACCCGAATCCGGTGTTTACACCCCGTGAAGGTGCAGGAACGCTGAAGTTCTGCGAAAAACTGATGGAAAAGGCGGTGGGCTTCACCTCCCGTTTTGATTGCGCTATTCATGTGGCGCATGCCCGTTCGAAGGGACTGCGTCGGCGCATGCCACCGGTGCTGCGTCGACGGGCTATTGATGCGCTGCTGCAGGGGCTGTGTTTCCACTATGACCCGCTGGCCAACCGTGTCCAGTGCTCCATCACCACGCTGGCCATTGAGTGCGGGCTGGCGACGGAGTCTGCTGCCGGAACACTCTCCATCACCCGTGCCACCCGTGCCCTGACGTTCCTGTCAGAGCTGGGGCTGATTAGCTACCAGACGGAATATGACCCGCTTATTGGCTGCAACATTCCGACCGATATCTCGTTATAAGGGCCATCTTCGTACCCTTCTTTCGACGCAATAGCTTTCAGTCTTGCGCTGGCACACGCTGAACTACCCGCATTACCATTACTACAGGCAGCAATAACCTCCTTATCACTGAATATCTTTTTCAAGCAATGCATCATATTTCTGCTGTGCCTTTTCACGCTCTGCCGGATCTTTACTGTTTTTAATCGTCTGTTTTGCTATCTCAAGTTCTGTCTTTTCAGACACGCTCAGGTAGTTATTCTCAACAGCATTCTTCCCGCTCTGTGTGCTCCCACTGCAGCTGATGCGGTGCTGTTTCCGGTTAGACCGCCGGTCAGCCCTGCTGACACGGTTGCCAGCGTACTTATCGTCTGCTTCTGCTCTTCACTGAGGTCAGACAGCTTCACTCCCGGATACAGCATTCCGATCGCTCTGGCTGCCAGCTCTCCTGTTGCTGCGCCTGCTGCACCAGCAGCAACATTATTACTCTGCATTGCTGCTACTGCACCGCCCAGAATGGCGTGAGCAATGGCCTTTACTGCCGGGTCTTTTTCTGTTGATTTCAGCAGGTATGCCAGTTCCGGAGCCGAAGCTCCCGCCAGTACTGCACCAACGTCACCACCTGCCAGCTATGGCTGGAGATGGTGGATGTTACCGATAATCAAGAAGCCTTAGATATAAGGCAATGTTTTATCTTTCTTTCATACGAAAACCAGTGCTGCCATACTAGCCCTTTTCCTGTGACATCTAAGCGAGCTCATGTTTATCGGGAAGGGTGTTAAATCATTGTTAATCATACACAGGTAATTATGCAGCGGTTATGCATAACTTCATTTCGAGCGTGTGATAACGCCAACACAACCACCAGAATTATAAAATAAAATCACAAAACCTGAGAACATATCACATTTTATCACATTAATAATTTTTCTTTATGATTTCATAACTATTTACAGCCTTTTATTAAGCCAAAAAAAGTTAAATTGCAATTATTTCATAATTATTCACGCTACAGTTTTTTTACTTTTAAAATCATATAAATATATAATCACCTTAAAACAGCAAACTCCACAAAATAATAACAAATAGAGATATTTCCACCCAAATAATGACTATCTATTCATGAGCTGACATTATGAAATTAATCATTATTTAATGCAAGTAACACAAATTAAAACAAATACCTTAATATCCATTATTGTGTGAATCTCATCACAATAATGTTTATTTTACCGCCTAGTATGAACCACGAAATCAATTGAAGTTAGCCGCCATGAAATTAAATACACAACATGGCGCATACCTTTTTATTCTAAAAAAATTAAAGGAATAATTATGGAAAAACATTACGTCGGTTCTGAAATTGGTCAATTGCGTAGTGTTATGCTGCACCGCCCAAATTTAAGTCTGAAACGTTTGACACCATCGAATTGTCAGGAACTGCTTTTCGATGATGTACTCTCGGTTGAACGGGCAGGTGAAGAGCATGACATCTTCGCAAATACGCTGCGCGATCAGGGGGTGGAAGTCCTGCTGTTAACAGACCTCCTGACACAAACCCTTGATATTAAAGAAGCGAAAACTTGGTTACTGGAGACGCAAATTTCTGACTACCGCCTCGGACCTACCTTTGCGGGCGATGTGCGCAGCTGGCTGGCGGACATGCCGCACCGTGAACTGGCGCGAAGATTAAGCGGCGGATTAACTTACGGTGAAATTCCGGCTGCCATTAACAATATGGTGGTGGATACCCACACGTCTAATGACTTTATTATGAAGCCGCTACCGAATCATTTATTTACCCGCGATACCTCCTGCTGGATTTATAACGGTGTTTCTATTAACCCGATGGCCAAACCAGCCCGTCAACGTGAAACCAATAACCTCCGGGCAATATATCGCTGGCACCCGGCATTTGCCGACGGCGATTTTATTAAGTATTTCGGCGACGAAAATATTTATTACGACCACGCCACTTTGGAAGGTGGCGACGTATTAGTGATTGGTCGTGGGGCGGTATTGATCGGCATGTCTGAACGCACAACACCGCAGGGCGTGGAGTTCCTCGCCAACAGCCTGTTCAAACATCGTCAGGCCGAGCGAGTGATCGCCGTTGAGCTGCCAAAACACCGCTCCTGTATGCACCTTGACACCGTCATGACCCACATCGACGTTGACACTTTCTCCGTTTACCCGGAAGTGGTGCGCAAAGACGCCCAGTGCTGGACGCTCACTTCGAACGGACGCGATGGCCTACAACGGACCCAGGAAACCGACCTGTTGCACGCCATCGAGAAAGCACTCGGTATTGACCAGGTACGCTTGATCACCACCGGCGGCGACGCCTTTGAAGCCGAACGTGAGCAGTGGAACGACGCCAATAACGTTCTGACCATCCGCCCCGGTGTGGTGATTGGTTACGAGCGCAACGTCTGGACTAACGAGAAATACGACAAAGCCGGCATCACCGTGCTGCCCATCCCGGGGGACGAATTGGGACGAGGCCGCGGCGGCGCACGCTGCATGAGCTGTCCGCTTGAACGCGACGGAATTTAAAGGAGCCATCATGGAACGAAAACCCACTTTGGTTGTGGCGTTGGGCGGCAACGCATTATTGAAGCGCGGCGAACCACTGGAAGCAGAAATCCAGCGCCAGAACATTGAGTTGGCCGCCCGTACCATCGCCGGGCTCACGGTGAATTGGCGCGTGGTGTTGGTTCACGGCAACGGTCCACAGATCGGGCTGCTGGCGCTGCAGAACAGCGCCTACGACAAAGTGACCCCTTATCCACTGGACGTTCTTGGCGCCGAAAGCCAGGGGATGATCGGCTACATGCTCCAGCAGGCGCTGAAAAACAGCCTGCCACAGCGTGAGGTGAGCGTCCTGCTTACTCAGGTGGAAGTGGACGCTACTGACCCGGCGTTCAGCAACCCGACCAAATATATCGGACCGGTGTACAACGAAGACCAGGCAAAAACACTGGCAGCAGAAAAAGGTTGGGGGTTTAAGGCCGACGGCAGCTACTTCCGTCGCGTGGTGCCATCTCCACAGCCGAAACGCATTGTCGAGAGCGATGCTATTACGGCACTGATCCAGCGCGACCATCTTGTTATCTGCAACGGCGGCGGTGGTGTACCAGTTGTGGAAAAGGCTAACGGCTATCGCGGAATTGAGGCGGTGATCGACAAAGACCTCTCTGCTGCCCTGCTGGCATACCAGATAGGGGCCGACGCACTACTGATTCTCACTGATGCCGACGCGGTTTACCTCGATTGGGGCAAACCGACCCAGCGTCCGCTAGCGCAGGTGACGCCAGAACTGCTCAGAGGCATGCAGTTTGACACCGGATCGATGGGGCCGAAAGTGGCCGCCTGCTGCAAGTTTGTTGAAGCTTGCAACGGTATTGCCGGGATCGGCGCTCTGGTCGACGGGGCTGAGATTTTGGCGGGCAATAAAGGCACATTGATTCGTAACTGAATCCCCCTTCACCTAACCCTCTCCTCAAAGGGGAGATGGCAGAGTGAGGGCATCAGACAGTTAAAATTTAAAAAGGATTTCCTAATGACCATCAATTTGAAAAAACGCAACTTCCTTAAACTGCTGGACTACACCCCGGCAGAGATCCAGTACCTGATCGATCTCGCGATCAAACTGAAAGCGGCCAAAAAAGCCGGACGAGAAAAACAGACCTTGGTTGGCAAAAACATTGCCCTGATTTTTGAAAAAACCTCCACCCGTACCCGCTGTGCTTTCGAAGTGGCTGCGTTCGACCAAGGGGCGCAGGTGACCTACCTCGGCCCAGGCGGATCGCAAATCGGCCATAAAGAGTCAATGAAAGACACCGCCCGTGTGCTGGGCCGTATGTATGACGGCATCGAATACCGTGGTTACGGTCAGGCCATCGTTGAGGAGTTGGGCAAATACGCGGGCGTACCGGTGTGGAACGGTCTGACCGACGAATTTCACCCAACCCAAATCCTCGCAGATTTGATGACCATGCTGGAACATTCCCCGGGCAAAAAACTGTCGGAACTGAGCTTTGCCTACCTTGGCGACGCACGCAACAACATGGGTAACTCCCTGATGGTGGGGGCTGCCAAAATGGGGATGGATATCCGTCTTGTAGCCCCAAAATCCTTCTGGCCGGATGTGGTGCTTGTTGAACAGTGCCGTTCCATCGCGGAAGAGACGGGCGCACGTATCACCCTGACCGATGACGTGGAAGAAGGCGTGTGGGGAACGGATTTCCTCTACACCGATGTTTGGGTCTCAATGGGTGAACCGAAAGAGGCGTGGACCGAACGCGTCAGCCTGATGAAGCCTTATCAAATCAACGCTGACGTGATGAACGCCACCGGCAACCCGAACGTCAAGTTCATGCACTGCCTGCCAGCCTTCCACAATGAGCACACCAAAGTGGGCCGAGAAATTGAGATGGCATACGGCCTGAAGGGACTGGAGGTGACGGAAGAGGTCTTCGAATCCCCTAACTCTATCGTCTTTGATGAAGCAGAAAACCGCATGCATACCATTAAAGCGGTCATGGTGGCGACACTCGGCGACTAATCACCACCCGGTGCGTCGTAGGGGGCACCGGGTCTCAGGAGAACATCATGGGTAAGTTCAAATTTCCCTCCGCATATACCATTCTCTTTTTTCTGATTGCCATCGTTGCCGTCCTGACGTGGATCATTCCAGCCGGGCAGTATCACATGGCAATGAACGAAGCTCTCGGCAAGGAGGTTCCTGTTGCCGGCACCTATGCACACGTAGAGGCCAATCCGCAGGGACTGATTTCAGTGCTGATGGCCCCAATTGCCGGGTTGTATGATCCAGACTCCGGTCAGGCTAGGGCGATAGACGTTGCGCTGTTTATTCTGATCATCGGAGGATTCCTCGGGATCGTCACCAAAACCGGGGCCATTGACGCCGGAATCGAGCGCGTCACCACCCGACTACGTGGTCGCGAAGAGTGGATGATCCCGATCCTTATGGCGCTGTTTGCTGCTGGCGGTACAATTTACGGCATGGCCGAAGAATCCCTGCCGTTTTATACCCTGCTGGTGCCGGTGATGTTGGCAGCACGCTTCGACCCTGTGGTAGCCGCCTCCACCGTGCTGCTCGGCGCCGGGATCGGCACGCTCGGCTCCACCATTAACCCTTTCGCGACGGTGATCGCCGCCAATGCAGCCGGGATCCCCTTCACCAACGGTATCACCTTGCGTGTGGTGGTGCTTGTCATCGGCTGGATAATCTGCGTGACATGGGTGATGCGCTATGCCCGGAAAGTTCGCAAGGAGCCGTCTCTCTCCATTATTGCGGATAAACAAGAGGAGAACCTCGCCCACTTCCTCGGCAATAAAAGCGAACAGGCTCTGGAGTTCACCCCGGTACGCAAAATTATTCTGGTGATTTTCGCCCTTACCTTCGCGGTCATGATCTACGGCGTGGCGGTGCTGGGTTGGTGGATGGCGGAGATCTCAACGGTATTTCTGGCCAGCGCAATTATCATCGGTCTGATCGCCCGCATGAGCGAAGAGGAACTGACCTCTACCTTTATCAACGGCGCGCGAGATTTGCTGGGCGTTGCACTGATTATCGGTATCGCGCGCGGTATCGTAGTGATCATGGATAAAGGTATGATTACCCATACTATTTTGCACTACGCCGAGGGAATGGTTACTGGATTATCGACAGTAGCATTCATCAACGTGATGTATTGGCTGGAAGTGGTGCTGTCGTTTCTTGTGCCTTCTTCGTCCGGTCTGGCCGTTCTGACGATGCCGATCATGGCACCTCTTGCCGACTTCGCTAACGTCAACCGCGACCTGGTAGTTACGGCTTACCAGTCTGCGTCCGGTATCGTTAACCTGATCACTCCCACCTCTGCCGTTGTGATGGGCGGACTGGCTATCGCCCACGTGCCTTACGTGCGATATCTGAAATGGGTTGCGCCGCTGCTTGGGATATTAACAGTGGTAATTATGGTGGCATTAAGTCTGGGGGCATTGTTGTAATTTGCCGGATGGCGCTGCCCCTATCCGGTCTACGGAATGGTGTGGTGTCACCGGTTATTTCGAGAACTGAATATAGGATTATGATGGATTACGAAGATTTCTCTCCCAAAGAGCAACTACAGCTAACGGTCTGCCAACGTCTGATTGCAGAAAAGAGCTATTTTTCCCAAGAGGAGCTTCGCCGCGACTTACAGGAGCGTGGTTTTGAGACAATCAGCCAGTCCACCGTTTCTCGTCTGCTCAATTTGTTAGGTGTCATAAAAATTCGAAATGCCAAAGGGCTAAAAGTCTATTCGCTGAATCCACAGTTGCGTCCGGCTCCTGATGCCGCGCGCACTGTGTCCGAAATGGTGGTGAGCGTTGAGCACAATAGAGAATTTATCCTTATCCATACCGTTGCTGGATATGGCCGCGCAGTGGCACGTGTCCTTGATTATCACCAGTTACCAGAAATTTTAGGCGTGGTGGCCGGAAGCAGTATCGTCTGGGTCGCTCCTCGGATGGTGAAGCGTGTCGCTCTGGTGCATAAGCAAATTAATTATTTACTAAGAACGTATTAATATTCACAAATGCCCGCTTGTATTGACTAATACGTAAGCGTCAACGGAGCACCGTATTGACGCTTATTTATTGGTGAGAACTACGTTCCATGGCAGGAGTTCGTCAACACGGTTGGAGGGCCATTCCGGCAGTACGCTCAGAATATGGCGCAGATACGCTTCCGGATCGATACCGTTCAGACGGCAGGTGCCGATCAGCCCGTACAGCAGTGCACCACACTCGCCGCCGTGATCGCTACCGAAGAACACGTAATTTTTCTTTCCGAGACAGAATGCACGAAGCGCTCTTTCCGCTGTGTTATTGTCCGCCTCCGCCAGACCGTCATCACTGTAATAACAGAGGGCGTCCCACTGATTCAGTACATAGCTGAACGCTTCGCCCAGCCTGGATTTTTTCGACAGCGTGCCTGAGCCCGCAGATATTCTATTTCCCGTTCATCTTCTTCGATCTTTTCTTCGGCACGTGCCAGTGCAGAGCGCAGGAAGGCCTCCGTCTCTTCAACCAGACTCAGTTGCTGGTCTTTCTGACGGAGCTGGCTTTCCAGTTCTGCAATGCGAATGAGGTATTTCTGACTCATGGCCGTTTTTATAATGCGGCCAGGCGTTTTTTACAACATTGTCAGGGCGTTAAGGCGGGATGTTTTTGGCTGACGCCAGTCCAGCTTATCGAGGAGCATTGCCAGTTGCGAGCGGGTAATGGATACCTTGCCGTCACGTACCACAGGCCAGATAAACTGGCCTTCCTCCAGGCGTTTGGTGAACAGGCACAGACCATCAGCATCAGCCCAAAGAATTTTAACGGTGTCACCCCGTCGGCCACGGAAGATAAACAGGTGACCGGAGAAGGGATTATCATTCAGCACATGTTGTACCTGTTCTCCCAGTCCGTTGAAGGATTTACGCATATCGGTAACGCCGGCAACGAGCCAGATACGGGTACCTGATGGGAGTGAGATCATCTTCCCCTCCCGGTCAGTTCACGAATCAATACAGTGAGCAGCTCTGGTGAAGGATTTTCCAGCGTCATGTTACCGTGACGGAACTCCACCTTGCAGGAGCTGGCACTGACAGTAGTCTGAGTGGATAAGGACGGAGTAAGAGCAGCCATCGGTTCTTTCGGCTCATCAGGCGTTATCTCTACAGGTAATAATTCAACGCCTGCGTCAGAAGTGGTTGTCACCGGAATACGCCGTGATATACGCCCTTCGTTTTGCCAGAGTCTGAGCCATTTGAAAATAACATTATCATTGACGCCATTTTCTCGTGCAATCTGGGCCACACAAGCACCGGGTTGTGATGCCAGTTCAACCATACGAATTTTGAATTCATTCGAATACTTTTTACGAGGTTCTTTTCGTCAGTCCTGTAATTCCATACTTAGATGTCCGTCTGTGTCAGATGGGCGTCTAAGGGTAATGACTCCAACTTATTGATAGTGTTTTATGTTCAGATAATGCCCGATGACTTTGTCATGCAGCTCCACCGATTTTGAAAACGACAGCGACTTCCGTCCCAGCCGTGCCAGGTGCTGCCTCAGATTCAGGTTATGCCGCTCAATTCGCTGCGTATATCGCTTGCTGATTACGTGCAGCTTTCCCTTCAGGCGGGATTCATACAGCGGCCAGCCATCCGTCATCCATATCACCACGTCAAAGGGTGACAGCAGGCTCATAAGACGCCCCAGCGTCGCCATAGTGCGTTCACCGAATACGTGCGCAACAACCGTCTTCCGGAGACTGTCATACGCGTAAAACAGCCAGCGCTGGCGCGACATAGCCCCACTGTTCGTCCATTTCCGCGCAGACGATGACGTCACTGCCCGGCTGTATGCGCGAGGTTACCGACTGCGGCCTGAGTTTTTTAAGTGACGTAAAATCGTGTTGAGGCCAACGCCCATAATGCGGGCAGTTGCCCGGCATCCAACGCCATTCATGGCCATATCAATGATTTTCTGGTGCGTACCGGGTTGAGAAGCGGTGTAAGTGAACTGCAGTTGCCATGTTTTACGGCAGTGAGAGCAGAGATAGCGCTGATGTCCGGCGGTGCTTTTGTCGTTACGCACCACCCCGTCAGTAGCTGAACAGGAGGGACAGCTGATAGAAACAGAAGCCACTGGAGCACCTCAAAAACACCATCATACACTAAATCAGTAAGTTGGCAGCATCACCTATTGCAGCGTTACTTATCCGGATTATCCTCAGAAGTGTAAATCAGCATTCTCCTTTGCTGATGCAATTGCATGCGGCAGGTATACGGACCGGTGATGCAGAGCGAATACTGTCCAGCGGTGAATGCTGGCAACGTCAGAAGACGCTGCTGACAGGAAGGGAAGTCAGTTTTATGAAAGGACTGTTCAGAATTGTGGATATGAAGCGGTGGTATCTGTGTCCGCAGGTACGGGTCGCGGATATCGTCCAGCTGAACGGGAATATCCGGCCACGATCACGCCAGTGGTGGCAGTTATTCAGGATGGTGTCTCAGTGGCATGTTGATGTGGTTATCGTTGAGCGGCGTTCGTTCAGTATTGTTGCAGCAGTAGAGCTGGATGATGCCAGTCATTTACGACCGGAACGCAGACGCCGGGATATTCTTCTGGAAGAGGTTCTGAGGCAGGCTGGTATTCCGTTGCTCAGAAGCCACGATGCCAGAAAACTGCTGCAGATGACCGGAGAATGGCTGAATACAACAGGGGCTGATCAGCAGTCCCCGGAACATCGTAGCTGACGCCTTCGCGTTGCTCAGTTGTCCAACCCCGGAAACGGGAAAAAGCAAGTTTTCCCCGCTCCCGGCGTTTCCATAACTGAAAACCATACTATTTCACAGTTTAAATCACATTAAAGGGCTGCGTTTGGGAAACGTACAATATTGTACGGTTGCGTTATCGCGCTCGGGTGTACTCCCGTGCTATACGGCGTTTTCAGAGGATTTGACCTCAATCGGATAACCTAAATGCAATACTGCAATGGCGATAACTTAAAAGGGATAGACGTTGACGCAAAATCAATCTGTTATGCATATCTATTGGTTAAATAATGCGCTTAACGTTCAAAATTTTTCGATCTCCAGACTGCCCCCGTTCAGCTCGCATAGGAATTATGGGAGAGGATTTTGGCTGAACCGGACAGATATAAACCCGTTACTTTTTATACCTTACAGATATGGTTCTTTTGACGTAATTCTTACCAATTTCATAGTCCGGGTTCCATGTCGCCAGATTATGGACACCGATTGCGGCAGCGCTCACAGCTCCCAGAAGTGCTGTTGGTGACCATAGGGAGCCTGCCACCGCAGTGCTTCTGTATTCGGTGTATACGCGGCAGGTTAATTTCTTCTGCTGATGTTCAATCACATACTCCACTATTTTGACGGTTTCACTGATGTCGTTAACATTCGTGATATCAAAACTTTTGCGGTAAAAAATCAGGTCGTGTAGTTTCTGTTTGGTCAAAAATTCTGCGTTGAACGACTCAATCTGTACCGTTTCTGACATCCGTTGTTCCTCTCTTTCTTGGAGTCTGTGAATTTCCCTAGGCACGCGCCTTACGGTCAAACGTTGCACTTTTTCAGGGGCAGTGTATGTCAGAAAAGAATATATTGACTTGTATAAACGATCAATAATAAGAAATTGATCGTCTTAACTGTAAGCGTAAACCGACTGCCGTATGTGGGGGCGGTTTGGAGATCGGACATTTTTGTACTATAAGGGCTTCCCCGAACAGTCAACGTCGGTATTTATGGTTTGTGTCTGCACTACAGTTAAAAATAAAGGGGAGAGACTGCGGTACTATAAACGGCCATAATGAATCGCTATCGATTCGGACCCTGATGAAAGACGCGCGTGAGGCTCTTGTTCGTTAAGCGGATACGTATATCCTGTACAATAATCAGATATACCTCACGGGGTCTAACCCTTTATCATCAACGAGTTGCAGATCAGATGGTCATACGGTTTCTTCCTGCTTTAGCTTGCTGAATACAGCCTGAAATCATCACCGTAATGCAATGATGCCCAGCACAGCCTTGCATTTTTGGCGGCGATGGCCACAACAGCGCGCCAGTATCCTCTGCGTTCGACCAGTGAACAGACCCAACGACTGAATAAGTCAGTCCTTTTCTCAGCGCCAATCAGTACTGAACGAGCCCCCTGTACCAGAAGCGTTCGCAGATACGAATCACCGGCTTTTGTTATCCTGCCTAACTTCGACTTTCCGCCGCTACTGTATTGCGATGGAGTCAACCCCAGCCAGGCCGCCAGTTGCCGTCCATTTTTAAAATCATGTGCATTACCGATACAGGCAACCAGAGCACAGGCAGTTGTGGGACCAACTCCTTTTAGCTTCATCAGCCGCTGACTGCGGTGATCTGTTTTGGCTATGCGGGACAAAATTCGGTCATATTCAGTGATGTTAGCTTCAATGCGATCAACGTGTTCCAGTAAATCATCTACACATTGCTGAACCTGAAGAGGTAAAGAGCTCTTCTGCGCAGAAACCATGTGCCGCAAAGCATCTGTACTTTGTGGGGCAATGACGCCGAATTCTGATATCAACCCTCGAAGACGATTATATGTTGCTGTTTTCTCTTCGATAAAACCCTGTCGGTTACGATGCAGACACTGCATCGCCTGCTGGCTCTCGTCCTTAACTGGAACAAACCGCATATGTGGGCGACGAACCGCTTCGCAGATAGCCTGAGCATCAGCTGCATCATTTTTCCCTGATTTACCGGCCATGCGGTATGGAGATACAAATTTAGCGGCCATCAGACGCGGCTCATGACCATATTGCAGAAACAATCTTGCCCAGTAATAAGCCCCGGAGCACGCCTCCATCCCGATAACACAGGGCGGTAAACTTGCAATCAGCCCGGGAAGAGCTGCGCGGGTTACTCTGGATTTCACCAGAACGGTTTTGCCATTTTGGTCAACCCCATGAACAGCGAACACATTTTTAGCAAGATCGATACCGATAGTAGTGATGGTCATAACGAATCTCTCCAGGCTATGTTTACCCCATGATTGCACAAGAGTTAATCAGGTGCATATCCAGGGGAAGTCCCTTCCATTCGTTAGGGGCATTTTTTAACCGATACTGATAAGTAACTGATTGATTCATCGTGATCAGCTATCGCCTTTAAGTTATTGCTTTTCTCTTATCGCATTTAGGTTATCTCATGTCCCTAAAGGTCTGCCAAACACCGTATAGCGTCCGAGCGCGATAACGCAACCGTACAATATTGTACGTTTCCCAAACGCAGCCCTTAAACGACAGTAATCCCCGTTGATTTGTGCGCCAACACAGATCTTCGTCACAATTCTCAAGTCGCTGATTTCAAAAAACTGTAGTATTCTCTGCGAAACGATCCCTGTTTGAGTATTGAGGAGGCGAGATGTCGCAGACAGAAAATGCAGTGACTTCCTCATCTGGTGCAAAACGAGCATACAGAAAGGGTAAACCTTTGAGTGAAGCAGAACGACAGAGAGCAGCTTCTGCGCGCAAACGATCTGTATGTAAAGAGATCAAGGTTTTCGTCAGACCAGAACTGAAAAACTGCCTTACCAGTCTGTGTGCAGAGGAAGGGGTAACTCAGGCAGAGTTGATAGAAAGGCTTATCGAGAAAGAGGCCTGTCACCGGAACATGATGTGATGGGGTCACATTCTTGCTAGTTTTTTCTGTCAGTGCTAGATTACTGATCGTTTAAGGAATTTTGTGGCTGGCCACGCCGTAAGGTGGCAGGGAACTGGTTCTGATGTGGATTTACAGGAGCCAGAAAAGCAAAAACCCCGATAACCTTCACCAGGTTTGGCGACTAAGAGAAGATTACCGGGGCCCACTTAAACCGTATAGCCAACAATTCAGCTATGCGGGGAGTATAGTTATATGCCCGGAAAAGTTCAAGACTTCTTTCTGTGCTCACTCCTTCTGCGCATTGTAAGCGCAGGATGGTGTGACTGATCTTCAACAAACGTATTACCGCCAGGTAAAGAACCCGAATCCGGTGTTTACACCCCGTGAAGGTGCAGGAACGCTGAAGTTCTGCGAAAAACTGATGGAAAAGGCGGTGGGCTTCACCTCCCGTTTTGATTTCGCTATTCATGTGGCGCATGCCCGTTCGAAGGGACTGCGTCGGCGCATGCCACCGGTGCTGCGTCGACGGGCTATTGATGCGCTGCTGCAGGGGCTGTGTTTCCACTATGACCCGCTGGCCAACCGTGTCCAGTGCTCCATCACCACGCTGGCCATTGAGTGCGGGCTGGCGACAGAGTCCGGTGCAGGAAAACTCTCCATCACCCGTGCCACCCGGGCTCTGACGTTCCTGTCAGAGCTGGGACTGATTACCTACCAGACGGAATATGACCCGCTTATCGGGTGCTACATTCCGACCGATATCACGTTCACATCTGCACTGTTTGCTGCCCTCGATGTATCAGAGGAGGCAGTGGCCGCCGCGCGCCGCAGCCGTGTGGAATGGGAAAACAGACAGCGCAAAAAGCAGGGGCTGGATACCCTGGGTATGGATGAACTGATAGCGAAAGCCTGGCGTTTTGTCCGTGAGCGTTTCCGCAGTTACCAGACAGAGCTTAAGTCCCGGGGAATAAAGCGTGCCCGTGCGCGTCGTGATGCGAACAGGGAACGTCAGGATATCGTCACCCTGGTGAAACGGCAGCTGACGCGTGAAATCTCGGAAGGGCGCTTCTCTGCCAGTCGTGAGGCGGTAAAACGTGAAGTGGAGCGTCGTGTGAAGGAGCGCATGATTCTGTCACGTAACCGCAATTACAGCCGGCTGGCCACCGCTTCCCCCTGAAAGTGACCTCCTCAGAATAATCCGGCCTGCGCCGGAGGAATCCGCACGTCTGAAGCCCGCCAGTACAGAAAAAAACAGCACCACATACAAAAAACAACCTCATCATCCACTTTCAGATGCATCCGGTTCTCTCTGTTTTTGATACAAAACACGCCTCACAGACGGGGAATTTTGCTTATCCACATTAAACTGCAAGGGACTTCCCCATAAGGTTACAACCGTTCATGTCATAAAGCGCCAGCCGCCAGCGTTACAGGGTGCAATGTATCTTTTAAACACCTGTTTATATCTCCTTTAAACTACTTAATTACATTCATTTAAAAAGAAAACCTGTTCATTGCCTGTCCTGTGGACAGACAGATATGCACCTCCCACCGCAAGCGGCGGGCCCCTACCGGAGCCACTTTAGTTACAACACTCAGACACAACCACCAGAAAAACCCCGGTCCAGCGCAGAACTGAAACCACAAAGCCCCTCCCTCATAACTGAAAAGCGGCCCCGCCCCGGCCCGAAGGGCCGGAACAGAGTCGCTTTTAATTATGAATGTTGTAACTACATCTTCATCGCTGTCAGTCTTCTCGCTGGAAGTTCTCAGTACACGCTCGTAAGCGGCCCTCACGGCCCGCTAACGCGGAGATACGCCCCGACTTCGGGTAAACCCTCGTCGGGACCACTCCGACCGCGCACAGAAGCTCTCTCATGGCTGAAAGCGGGTATGGTCTGGCAGGGCTGGGGATGGGTAAGGTGAAATCTATCAATCAGTACCGGCTTACGCCGGGCTTCGGCGGTTTTACTCCTGTATCATATGAAACAACAGAGTGCCGCCTTCCATGCCGCTGATGCGGCATATCCTGGTAACGATATCTGAATTGTTATACATGTGTATATACGTGGTAATGACAAAAATAGGACAAGTTAAAAATTTACAGGCGATGCAATGATTCAAACACGTAATCAATATCTGCAGTTTATGCTGGTTATGCTGGCTGCATGGGGCATTAGTTGGGGAGCCAGATTTGTCATGGAGCAGGCCGTTCTGCTTTATGGATCAGGAAAAAACTATTTGTTCTTCAGTCATGGTACTGTTCTGATGTACCTGCTGTGTGTTTTCCTGGTATACCGCCGTTGGATAGCTCCGCTACCGGTCGTTGGTCAGCTGCGCAACGTTGGCGTACCGTGGCTGGTCGGTGCGATGGCCGTGGTGTATGTCGGTGTATTTCTGCTCGGTAAGGCGCTGGCTCTGCCTGCTGAGCCATTTATGACGAAACTTTTTGCCGATAAGTCCATACCTGACGTGATCCTGACGTTGCTGACCATCTTTATCCTTGCCCCGTTGAATGAGGAAACGCTGTTCCGGGGGATTATGCTGAACGTCTTCCGTTCACGGTACTGCTGGACGATGTGGCTGGGGGCGCTGATAACGTCGTTGTTGTTCGTCGCCGCGCACAGCCAGTATCAGAACCTGCTGACACTGGCAGAACTGTTCCTGGTGGGGTTGATTACATCAGTGGCCAGGATCAGAAGTGGTGGCCTGCTGCTGCCGGTATTGCTGCATATGGAAGCAACCACGCTGGGTTTACTGTTTGGTTGAAAGTTATATTTTTATTAAACATTGTGCGTTAAAGCCTGGTGTGTTTTTTTAGTGGATGTTATATTTAAATATAACTTTTATGGAGGTGAAGAATGCATACCACCCGACTGAAGAGGGTTGGCGGCTCAGTTATGCTGACCGTCCCACCGGCACTGCTGAATGCGCTGTCTCTGGGCACAGATAATGAAGTTGGCATGGTCATTGATAATGGCCGGCTGATTGTTGAGCCGTACAGACGCCCGCAATATTCACTGGCTGAGCTACTGGCACAGTGTGATCCGAATGCTGAAATATCAGCTGAAGAACGAGAATGGCTGGATGCACCGGCGACTGGTCAGGAGGAAATCTGACATGGAAAGAGGGGAAATCTGGCTTGTCTCGCTTGATCCTACCGCAGGTCATGAGCAGCAGGGAACGCGGCCGGTGCTGATTGTCACACCGGCGGCCTTTAATCGCGTGACCCGCCTGCCTGTTGTTGTGCCCGTAACCAGCGGAGGCAATTTTGCCCGCACTGCCGGCTTTGCGGTGTCGTTGGATGGTGTTGGCATACGTACCACAGGTGTTGTACGTTGCGATCAACCCCGGACAATTGATATGAAAGCACGGGGCGGAAAACGACTCGAACGGGTTCCGGAGACTATCATGAACGAAGTTCTTGGCCGCCTGTCCACTATTCTGACTTGAACATGGGGTTTGAGGGGCAACTGGATGAAAACGTACGATTTAGGGCACTAAAACCGCTGTTGTCCCACCATTCTGGTGATTCCCAAACGTTATTTGGCTAAAAGGGGTCGGTTCCGGCTGAGGGCGAAATGACACCCTAAGCTTTCGGTTCCTTGGGCCAAAGATATTCGCCAGTCAGTAGAATGTGCGCCCAGCCCAATGGGGATATGTGGGGAAGAAATTCAGGGGGAACATCCAACCCTTCGTTCCGCCGCTCCGTGACGGCATGACCAAGATGGACGGTATTCCAGTAAATGATCACCGCAGTCAATAAATTGAGCCCAGCGATTCGGTAGTGCTGCCCCTCTGTCGTGCGATCGCGAATTTCCCCCTGCCTCCCGATACGGAGCGCATTTTTGAGCGCATGGTGGGCCTCTCCCTTGTTAAGACCGATCTGAGCACGCCGCTGCATGTCCGTATCCAGGATCCACTCAATAATGAAAAGGGTCCGTTCAATACGACCAACTTCACGAAGCGCAACTGCAAGGTTGTTTTGTCGTGGGTAAGAAGCGAGCTTGCGCAGGAGTTGGCTGGGCCTGATTTTGCCAGCGGTCATCGTCGCGGCACAACGGAAAATATCAGGCCAGTTCGCAACGATAAGATCCTCCCGGGCTTTTCCACCTACCAACTTGCGTAACTCCCTGGGGGTCGTATCGGGATTAAATACGTACAACCGCTTCGATGGCAGATCCCTGATTCGCAGAACGAGATTGTAGCCGAGCAGGCTACTGGCTCCGAACAAATGGTCGGTGAATCCTGCTGTATCGGCATACTGTTCGCGAACATGGCGACCGACCTCGTTCATCAGTAGTCCATCGAGAATATACGGTGCCTCGCTCACGGTCGCCGGGATCGACTGACAAGCGAATGGCGCGAACTGGTCGCTTACGTGAGTATACGCTTTGAGGCCGGGAACAGAACCATATTTGGCATTGACCATGTTCATGGCTTCGCCATGCCGCGCTGTCGGGAAAAACTGACCATCGCTCGATGCTGACGTGCCCATCCCCCAGACGCGTGACATCGGCAGTTTACCCTGCGCGGCCACCACAATTGCCAATGCCTGGTTCATGGCTTCGCTTTCAACATGCCAGCGGGCAAGGCGTGAGAGCTGCCAGTAATCATGCGTGTTTGTAGCTTCCGCCATCTTACGCAGGCCCAGATTGAGCCCTTCAGCGAGCAGGACGTTGAGCAGACCGATCCGGTCGCGACATGGAGCCCCGGTTCTCAGATGGGTAAACGCATCTGTGAAACCAAGGGCTGCATCAACTTCAAGCAGCATGTCGGTAATCCGAACGGACGGCATTCGGCGATACAGATCCAGTATGAGTGCCTCGGCACCATCCGGCACGTCTGCTGTCAACCTGTCGATCCGCAACGTTCCATCTTCTATGCTACCGTGCGGAATAGTGCCGTTACGGGCAGCCCGGGCCAGCCGCTTAAGAGCGATCGTGAGTCGCGCCTTTCTGTCTGCCAGCCAATCCTGTGGGTTGGAAGGCACGGCCAGTTTTGCATTTTCCTGCGCCGCGATCATCGGCACCAGTACCTGCTTGAGGTCACCATAGCGGCGCGAATGAGCGAGCCAGACATCTCCGGAACGAAAAGCATCCCGGAGGTGAAAGAGTACCGCCACTTCCCAAAGACGGGTATCTCCTTTTTCCTGAGCTCGTAAATGACGGTTCCATTTGGAGCTGGGCCGCAGGAAACGCCTTTCTGGCGATGCAACACCTTTCATCTCTCCGATCGACAAAGCTGCTGCTACCAATGGTCCGGCGACCGGCGCGGCTTCGAGCTTCAGACAGCGCAACATGCGGGGCGCATAACGACGAAAGCGATGGTATCCCTGCCCGACATATGCAAGAGGGCACTGTTGCAAATAGTCGGTGGTGATAAACTTATCATCCCCTTTTGCTGATGGAGCTGCACATGAACCCATTCAAAGGCCGGCATTTTCAGCGTGACATCATTCTGTGGGCCGTACGCTGGTACTGCAAATACGGCATCAGTTACCGTGAGCTGCAGGAGATGCTGGCTGAACGCGGAGTGAATGTCGATCACTCCACGATTTACCGCTGGGTTCAGCGTTATGCGCCTGAAATGGAAAAACGGCTGCGCTGGTACTGGCGTAACCCTTCCGATCTTTGCCCGTGGCACATGGATGAAACCTACGTGAAGGTCAATGGCCGCTGGGCGTATCTGTACCGGGCCGTCGACAGCCGGGGCCGCACTGTCGATTTTTATCTCTCCTCCCGTCGTAACAGCAAAGCTGCATACCGGTTTCTGGGTAAAATCCTCAACAACGTGAAGAAGTGGCAGATCCCGCGATTCATCAACACGGATAAAGCGCCCGCCTATGGTCGCGCGCTTGCTCTGCTCAAACGCGAAGGCCGGTGCCCGTCTGACGTTGAACACCGACAGATTAAGTACCGGAACAACGTGATTGAATGCGATCATGGCAAACTGAAACGGATAATCGGCGCCACGCTGGGATTTAAATCCATGAAGACGGCTTACGCCACCATCAAAGGTATTGAGGTGATGCGTGCACTACGCAAAGGCCAGGCCTCAGCATTTTATTATGGTGATCCCCTGGGCGAAATGCGCCTGGTAAGCAGAGTTTTTGAAATGTAAGGCCTTTGAATAAGACAAAAGGCTGCCTCATCGCTAACTTTGCAACAGTGCCTTTAAGCGTGCATAATAAGCCCTACACAAATTGGGAGTTAGACATCATGAGCAACGCAAAAACAAAGTTAGGCATCACAAAGTACAGCATCGTGACCAACAGCAACGATTCCGTCACACTGCGCCTCATGACTGAGCATGACCTTGCGATGCTCTATGAGTGGCTAAATCGATCTCATATCGTCGAGTGGTGGGGCGGAGAAGAAGCACGCCCGACACTTGCTGACGTACAGGAACAGTACTTGCCAAGCGTTTTAGCGCAAGAGTCCGTCACTCCATACATTGCAATGCTGAATGGAGAGCCGATTGGGTATGCCCAGTCGTACGTTGCTCTTGGAAGCGGGGACGGACGGTGGGAAGAAGAAACCGATCCAGGAGTACGCGGAATAGACCAGTTACTGGCGAATGCATCACAACTGGGCAAAGGCTTGGGAACCAAGCTGGTTCGAGCTCTGGTTGAGTTGCTGTTCAATGATCCCGAGGTCACCAAGATCCAAACGGACCCGTCGCCGAGCAACTTGCGAGCGATCCGATGCTACGAGAAAGCGGGGTTTGAGAGGCAAGGTACCGTAACCACCCCATATGGTCCAGCCGTGTACATGGTTCAAACACGCCAGGCATTCGAGCGAACACGCAGTGATGCCTAACCCTTCCATCGAGGGGGACGTCCAAGGGCTGGCGCCCTTGGCCGCCCCTCATGTCAAACGTTGGGCGAACCCGGAGCCTCATTAATTGTTAGCCGTTAAAATTAAGCCCTTTACCAAACCAATACTTATTATGAAAAACACAATACATATCAACTTCGCTATTTTTTTAATAATTGCAAATATTATCTACAGCAGCGCCAGTGCATCAACAGATATCTCTACTGTTGCATCTCCATTATTTGAAGGAACTGAAGGTTGTTTTTTACTTTACGATGCATCCACAAACGCTGAAATTGCTCAATTCAATAAAGCAAAGTGTGCAACGCAAATGGCACCAGATTCAACTTTCAAGATCGCATTATCACTTATGGCATTTGATGCGGAAATAATAGATCAGAAAACCATATTCAAATGGGATAAAACCCCCAAAGGAATGGAGATCTGGAACAGCAATCATACACCAAAGACGTGGATGCAATTTTCTGTTGTTTGGGTTTCGCAAGAAATAACCCAAAAAATTGGATTAAATAAAATCAAGAATTATCTCAAAGATTTTGATTATGGAAATCAAGACTTCTCTGGAGATAAAGAAAGAAACAACGGATTAACAGAAGCATGGCTCGAAAGTAGCTTAAAAATTTCACCAGAAGAACAAATTCAATTCCTGCGTAAAATTATTAATCACAATCTCCCAGTTAAAAACTCAGCCATAGAAAACACCATAGAGAACATGTATCTACAAGATCTGGATAATAGTACAAAACTGTATGGGAAAACTGGTGCAGGATTCACAGCAAATAGAACCTTACAAAACGGATGGTTTGAAGGGTTTATTATAAGCAAATCAGGACATAAATATGTTTTTGTGTCCGCACTTACAGGAAACTTGGGGTCGAATTTAACATCAAGCATAAAAGCCAAGAAAAATGCGATCACCATTCTAAACACACTAAATTTATAAAAAATCTAATGGCAAAATCGCCCAACCCTTCAATCAAGTCGGGACGGCCAAAAGCAAGCTTTTGGCTCCCCTCGCTGGCGCTCGGCGCCCCTTATTTCAAACGTTAGACGGCAAAGTCACAGACCGCGGGATCTCTTATGACCAACTACTTTGATAGCCCCTTCAAAGGCAAGCTGCTTTCTGAGCAAGTGAAGAACCCCAATATCAAAGTTGGGCGGTACAGCTATTACTCTGGCTACTATCATGGGCACTCATTCGATGACTGCGCACGGTATCTGTTTCCGGACCGTGATGACGTTGATAAGTTGATCATCGGTAGTTTCTGCTCTATCGGGAGTGGGGCTTCCTTTATCATGGCTGGCAATCAGGGGCATCGGTACGACTGGGCATCATCTTTCCCGTTCTTTTATATGCAGGAAGAACCTGCATTCTCAAGCGCACTCGATGCCTTCCAAAAAGCAGGTAATACTGTCATTGGCAATGACGTTTGGATCGGCTCTGAGGCAATGGTCATGCCCGGAATCAAGATCGGGCACGGTGCGGTGATAGGCAGCCGCTCGTTGGTGACAAAAGATGTGGGGCACTGTTGCAAAGTTAGCGATGAGGCAGCCTTTTGTCTTATTCAAAGGCCTTACATTTCAAAAACTCTGCTTACCAGGCGCATTTCGCCCAGGGGATCACCATAATAAAATGCTGAGGCCTGGCCTTTGCGTAGTGCACGCATCACCTCAATACCTTTGATGGTGGCGTAAGCCGTCTTCATGGATTTAAATCCCAGCGTGGCGCCGATTATCCGTTTCAGTTTGCCATGATCGCATTCAATCACGTTGTTCCGGTACTTAATCTGTCGGTGTTCAACGTCAGACGGGCACCGGCCTTCGCGTTTGAGCAGAGCAAGCGCGCGACCATAGGCGGGCGCTTTATCCGTGTTGATGAATCGCGGGATCTGCCACTTCTTCACGTTGTTGAGGATTTTACCCAGAAACCGGTATGCAGCTTTGCTGTTACGACGGGAGGAGAGATAAAAATCGACAGTGCGGCCCCGGCTGTCGACGGCCCGGTACAGATACGCCCAGCGGCCATTGACCTTCACGTAGGTTTCATCCATGTGCCACGGGCAAAGATCGGAAGGGTTACGCCAGTACCAGCGCAGCCGTTTTTCCATTTCAGGCGCATAACGCTGAACCCAGCGGTAAATCGTGGAGTGATCGACATTCACTCCGCGTTCAGCCAGCATCTCCTGCAGCTCACGGTAACTGATGCCGTATTTGCAGTACCAGCGTACGGCCCACAGAATGATGTCACGCTGAAAATGCCGGCCTTTGAATGGGTTCATGTGCAGCTCCATCAGCAAAAGGGGATGATAAGTTTATCACCACCGACTATTTGCAACAGTGCCCCATCAGGGACAAAGATCTGGCTGGTCGCTGGCATCACCGATATGAGAAACGGCTTCAACGGCCTGGCGGCAAAGGTGCAGACGACGCTGAAAGACGATCCGATGTCAGGTCACGTTTTTATCTTCCGTGGGCGTAATGGCAGTCAGGTAAAGCTCCTCTGGTCTACCGGCGATGGACTGTGTCTGCTGACCAAACGGCTGGAGCGCGGCCGCTTCGCCTGGCCGTCAGCCCGGGATGGCAAAGTGTTCCTCACACCGGCACAGCTGGCGATGCTCCTTGAAGGTATCGACTGGCGGCAGCCTAAAAGACTGCTTACGTCCCTGACTATGTTGTAAGCCTCTTTATCCTGGTCGACGCTGAATGAGCCTGGTAATATACCCGGTATGAGCAGCTCACTTCCTGACGATATCAATGCACTGAAACGTCTCCTTGCCGAACAGGAGGCGCTGAACCGTGCCCTGCTGGAAAAGCTGAACGAGCGTGAACGCGAAATAGACCATCTGCAGGCGCAGCTGGATAAACTCCGCCGGATGAACTTCGGCAGTCGTTCCGAAAAAGTCTCCCGCCGTATCGCACAAATGGAAGCCGATCTGAACCGGCTTCAGAAAGAGAGCGATAGAGAGCGATACGCTGACTGGTAGGGTGTATGACCCGGCAGTACAGCGTCCGTTGCGTCAGACCCGCACCCGTAAGCCGTTCCCTGAATCACTACCCCGTGACGAAAAGCGGCTGTTGCCTGCGGCGCCGTGCTGCCCGAACTGCGGCGGTTCACTGAGCTATCTGGGCGAGGATACCGCCGAACAGCTGGAGTTGATGCGTAGCGCCTTCCGGGTTATCCGGACGGTACGGGAAAAACATGCCTGTACTCAGTGCGATGCCATCGTGCAGGCACCTGCACCTTCGCGGCCCATCGAGCGGGGTATCGCCGGACCGGGGCTGCTGGCCCGCGTGCTGACCTCGAAGTATGCAGAGCACACCCCGCTGTATCGCCAGTCAGAAATATACGGCCGGCAAGGTGTGGAGCTGAGCCGTTCACTGCTGTCGGGCTGGGTGGATGCATGCTGCCGGCTGCTGTCTCCGCTGGAAGAGGCGCTTCATGGCTATGTGCTGACTGACGGTAAGCTCCATGCTGATGACACGCCTGTCCCGGTGCTGTTGCCAGGCAATAAGAAAACGAAGACCGGGCGGTTATGGACCTACGTTCGTGACGACCGTAACGCCGGGTCAACGCTGGCGCCGGCGGTGTGGTTCGCTTACAGCCCGGACAGAAAAGGCATCCATCCGCAGACCCATCTTGCGGGGTTCAGTGGTGTACTGCAGGCGGATGCTGTCACGAACGGTGCAATAGTGATCCACACCCAACGCCTGAAATCAGATCCAGGGGGTAATCTGCTCTCCTGATTCAGGAGAGTTTATGGTCACTTTTGAGACAGTTATGGAAATTAAAATCCTGCACAAGCAGGGAATGAGTAGCCGGGCGATTGCCAGAGAACTGGGGATCTCCCGCAATACCGTTAAACGTTATTTGCAGGCAAAATCTGAGCCGCCAAAATATACGCCGCGACCTGCTGTTGCTTCACTCCTGGATGAATACCGGGATTATATTCGTCAACGCATCGCCGATGCTCATCCTTACAAAATCCCGGCAACGGTAATCGCTCGCGAGATCAGAGACCAGGGATATCGTGGCGGAATGACCATTCTCAGGGGATTCATTCGTTCTCTCTCGGTTCCTCAGGAGCAGGAGCCTGCCGTTCGGTTCGAAACTGAACCCGGACGACAGATGCAGGTTGACTGGGGCACTATGCGTAATGGCCGCTCACCGCTTCACGTGTTCGTTGCTGTTCTCGGATACAGCCGAATGTTGTACATCGAATTCACTGACAATATGCGTTATGACACGCTGGAGACCTGCCATCGTAATGCGTTCCGCTTCTTTGGTGGTGTGCCGCGCGAAGTGTTGTATGACAATATGAAAACTGTGGTTCTGCAACGTGACGCATATCAGACCGGTCAGCACCGGTTCCATCCTTCGCTGTGGCAGTTCGGCAAGGAGATGGGCTTCTCTCCCCGACTGTGTCGCCCCTTCAGGGCACAGACTAAAGGTAAGGTGGAACGGATGGTGCAGTACACCCGTAACAGTTTTTACATTCCACTAATGACTCGCCTGCGCCCGATGGGGATCACTGTCGATGTTGAAACAGCCAACCGCCACGGTCTGCGCTGGCTGCACGATGTCGCTAACCAACGAAAGCATGAAACAATCCAGGCCCGTCCCTGCGATCGCTGGCTCGAAGAGCAGCAGTCCATGCTGGCACTGCCTCCGGAGAAAAAAGAGTATGACGTGCATCCTGGTGAAAATCTGGTGAACTTCGACAAACACCCCCTGCATCATCCACTCTCCATCTACGACTCATTCTGCAGAGGAGTGGCGTGA
Protein sequences of DBSCAN-SWA_1 >NZ_CP040885|86975:114977|90453_90570_-|WP_071586949.1|DBSCAN-SWA MKIIGVFAFLAPVNPHQNQFPATLRRGQPQNSLKRSAI >NZ_CP040885|86975:114977|105473_106127_+|WP_000616807.1|protease|DBSCAN-SWA MIQTRNQYLQFMLVMLAAWGISWGARFVMEQAVLLYGSGKNYLFFSHGTVLMYLLCVFLVYRRWIAPLPVVGQLRNVGVPWLVGAMAVVYVGVFLLGKALALPAEPFMTKLFADKSIPDVILTLLTIFILAPLNEETLFRGIMLNVFRSRYCWTMWLGALITSLLFVAAHSQYQNLLTLAELFLVGLITSVARIRSGGLLLPVLLHMEATTLGLLFG >NZ_CP040885|86975:114977|103416_103527_-|WP_072163418.1|DBSCAN-SWA MKVIGVFAFLAPVNPHQNQFPATLRRGQPQNSLNDQ >NZ_CP040885|86975:114977|108776_109481_+|WP_001067855.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >NZ_CP040885|86975:114977|89250_89484_+|WP_072142979.1|DBSCAN-SWA MTGWPDLGGGSAVSQFAMMARKFGKQLIVAVSEKDAIKLKDNFDIIGMLSYGKENFIAMSHTKSELTGGKRRYRIKN >NZ_CP040885|86975:114977|109624_110179_+|WP_063840321.1|DBSCAN-SWA MTNSNDSVTLRLMTEHDLAMLYEWLNRSHIVEWWGGEEARPTLADVQEQYLPSVLAQESVTPYIAMLNGEPIGYAQSYVALGSGDGRWEEETDPGVRGIDQLLANASQLGKGLGTKLVRALVELLFNDPEVTKIQTDPSPSNLRAIRCYEKAGFERQGTVTTPYGPAVYMVQTRQAFERTRSDA >NZ_CP040885|86975:114977|106219_106477_+|WP_000557619.1|DBSCAN-SWA MHTTRLKRVGGSVMLTVPPALLNALSLGTDNEVGMVIDNGRLIVEPYRRPQYSLAELLAQCDPNAEISAEEREWLDAPATGQEEI >NZ_CP040885|86975:114977|90160_90418_+|WP_000083833.1|DBSCAN-SWA MSQTENAVTSSSGKKRPYRRGNPVPARERQKASLARRSATHKAFHAVIQLRLKEKLSELADEDGITQAQMLEWLIESEVKRRKSL >NZ_CP040885|86975:114977|102900_103152_-|WP_071529016.1|DBSCAN-SWA MRKSLHFLSATSRLLNTQTGIVSQRILQFFEISDLRIVTKICVGAQINGDYCRLRAAFGKRTILYGCVIALGRYTVFGRPLGT >NZ_CP040885|86975:114977|100866_101220_-|WP_000005489.1|DBSCAN-SWA MSETVQIESFNAEFLTKQKLHDLIFYRKSFDITNVNDISETVKIVEYVIEHQQKKLTCRVYTEYRSTAVAGSLWSPTALLGAVSAAAIGVHNLATWNPDYEIGKNYVKRTISVRYKK >NZ_CP040885|86975:114977|101691_102714_-|WP_000156883.1|transposase|DBSCAN-SWA MTITTIGIDLAKNVFAVHGVDQNGKTVLVKSRVTRAALPGLIASLPPCVIGMEACSGAYYWARLFLQYGHEPRLMAAKFVSPYRMAGKSGKNDAADAQAICEAVRRPHMRFVPVKDESQQAMQCLHRNRQGFIEEKTATYNRLRGLISEFGVIAPQSTDALRHMVSAQKSSLPLQVQQCVDDLLEHVDRIEANITEYDRILSRIAKTDHRSQRLMKLKGVGPTTACALVACIGNAHDFKNGRQLAAWLGLTPSQYSSGGKSKLGRITKAGDSYLRTLLVQGARSVLIGAEKRTDLFSRWVCSLVERRGYWRAVVAIAAKNARLCWASLHYGDDFRLYSAS >NZ_CP040885|86975:114977|106478_106811_+|WP_000439434.1|DBSCAN-SWA MERGEIWLVSLDPTAGHEQQGTRPVLIVTPAAFNRVTRLPVVVPVTSGGNFARTAGFAVSLDGVGIRTTGVVRCDQPRTIDMKARGGKRLERVPETIMNEVLGRLSTILT >NZ_CP040885|86975:114977|112500_112875_+|WP_024193849.1|DBSCAN-SWA MISLSPPTICNSAPSGTKIWLVAGITDMRNGFNGLAAKVQTTLKDDPMSGHVFIFRGRNGSQVKLLWSTGDGLCLLTKRLERGRFAWPSARDGKVFLTPAQLAMLLEGIDWRQPKRLLTSLTML >NZ_CP040885|86975:114977|98685_99048_-|WP_059330006.1|DBSCAN-SWA MRMVELASQPGACVAQIARENGVNDNVIFKWLRLWQNEGRISRRIPVTTTSDAGVELLPVEITPDEPKEPMAALTPSLSTQTTVSASSCKVEFRHGNMTLENPSPELLTVLIRELTGRGR >NZ_CP040885|86975:114977|95766_97170_+|WP_000514417.1|DBSCAN-SWA MGKFKFPSAYTILFFLIAIVAVLTWIIPAGQYHMAMNEALGKEVPVAGTYAHVEANPQGLISVLMAPIAGLYDPDSGQARAIDVALFILIIGGFLGIVTKTGAIDAGIERVTTRLRGREEWMIPILMALFAAGGTIYGMAEESLPFYTLLVPVMLAARFDPVVAASTVLLGAGIGTLGSTINPFATVIAANAAGIPFTNGITLRVVVLVIGWIICVTWVMRYARKVRKEPSLSIIADKQEENLAHFLGNKSEQALEFTPVRKIILVIFALTFAVMIYGVAVLGWWMAEISTVFLASAIIIGLIARMSEEELTSTFINGARDLLGVALIIGIARGIVVIMDKGMITHTILHYAEGMVTGLSTVAFINVMYWLEVVLSFLVPSSSGLAVLTMPIMAPLADFANVNRDLVVTAYQSASGIVNLITPTSAVVMGGLAIAHVPYVRYLKWVAPLLGILTVVIMVALSLGALL >NZ_CP040885|86975:114977|103118_103376_+|WP_000083821.1|DBSCAN-SWA MSQTENAVTSSSGAKRAYRKGKPLSEAERQRAASARKRSVCKEIKVFVRPELKNCLTSLCAEEGVTQAELIERLIEKEACHRNMM >NZ_CP040885|86975:114977|89727_89877_+|WP_001312851.1|DBSCAN-SWA MTKYALIGLLAVCATVLCFSLIFRERLCELNIHRGNTVVQVTLAYEARK >NZ_CP040885|86975:114977|90653_90728_+|WP_032336874.1|DBSCAN-SWA MPGKVQDFFLCSLLLRIVSVGWCD >NZ_CP040885|86975:114977|111771_112476_-|WP_001067855.1|transposase|DBSCAN-SWA MNPFKGRHFQRDIILWAVRWYCKYGISYRELQEMLAERGVNVDHSTIYRWVQRYAPEMEKRLRWYWRNPSDLCPWHMDETYVKVNGRWAYLYRAVDSRGRTVDFYLSSRRNSKAAYRFLGKILNNVKKWQIPRFINTDKAPAYGRALALLKREGRCPSDVEHRQIKYRNNVIECDHGKLKRIIGATLGFKSMKTAYATIKGIEVMRALRKGQASAFYYGDPLGEMRLVSRVFEM >NZ_CP040885|86975:114977|110309_111140_+|WP_001334766.1|DBSCAN-SWA MKNTIHINFAIFLIIANIIYSSASASTDISTVASPLFEGTEGCFLLYDASTNAEIAQFNKAKCATQMAPDSTFKIALSLMAFDAEIIDQKTIFKWDKTPKGMEIWNSNHTPKTWMQFSVVWVSQEITQKIGLNKIKNYLKDFDYGNQDFSGDKERNNGLTEAWLESSLKISPEEQIQFLRKIINHNLPVKNSAIENTIENMYLQDLDNSTKLYGKTGAGFTANRTLQNGWFEGFIISKSGHKYVFVSALTGNLGSNLTSSIKAKKNAITILNTLNL >NZ_CP040885|86975:114977|99875_100454_+|WP_032152936.1|DBSCAN-SWA MSKLAASPIAALLIRIILRSVNQHSPLLMQLHAAGIRTGDAERILSSGECWQRQKTLLTGREVSFMKGLFRIVDMKRWYLCPQVRVADIVQLNGNIRPRSRQWWQLFRMVSQWHVDVVIVERRSFSIVAAVELDDASHLRPERRRRDILLEEVLRQAGIPLLRSHDARKLLQMTGEWLNTTGADQQSPEHRS >NZ_CP040885|86975:114977|103677_104535_+|WP_032152935.1|DBSCAN-SWA MTDLQQTYYRQVKNPNPVFTPREGAGTLKFCEKLMEKAVGFTSRFDFAIHVAHARSKGLRRRMPPVLRRRAIDALLQGLCFHYDPLANRVQCSITTLAIECGLATESGAGKLSITRATRALTFLSELGLITYQTEYDPLIGCYIPTDITFTSALFAALDVSEEAVAAARRSRVEWENRQRKKQGLDTLGMDELIAKAWRFVRERFRSYQTELKSRGIKRARARRDANRERQDIVTLVKRQLTREISEGRFSASREAVKREVERRVKERMILSRNRNYSRLATASP >NZ_CP040885|86975:114977|86975_88547_+|WP_023149734.1|transposase|DBSCAN-SWA MDTSLAHENARLRALLQTQQDTIRQMAEYNRLLSQRVAAYASEINRLKALVAKLQRMQFGKSSEKLRAKTERQIQEAQERISALQEEMAETLGEQYDPVLPSALRQSSARKPLPASLPRETRVIRPEEECCPACGGELSSLGCDVSEQLELISSAFKVIETQRPKQACCRCDHIVQAPVPSKPIARSYAGAGLLAHVVTGKYADHLPLYRQSEIYRRQGVELSRATLGRWTGAVAELLEPLYDVLRQYVLMPGKVHADDIPVPVQEPGSGKTRTARLWVYVRDDRNAGSQMPPAVWFAYSPDRKGIHPQNYLAGYSGVLQADAYGGYRALYESGRITEAACMAHARRKIHDVHARAPTYITTEALQRIGELYAIEAEVRGCSAEQRLAARKARAAPLMQSLYDWIQQQMKTLSRHSDTAKAFAYLLKQWDALNVYCSNGWVEIDNNIAENALRGVAVGRKNWMFAGSDSGGEHAAVLYSLIGTCRLNNVEPEKWLRYVIEHIQDWPANRVRDLLPWKVDLSSQ >NZ_CP040885|86975:114977|98086_98308_-|WP_000080227.1|DBSCAN-SWA MSQKYLIRIAELESQLRQKDQQLSLVEETEAFLRSALARAEEKIEEDEREIEYLRAQARCRKNPGWAKRSAMY >NZ_CP040885|86975:114977|113954_114977_+|WP_000255956.1|transposase|DBSCAN-SWA MVTFETVMEIKILHKQGMSSRAIARELGISRNTVKRYLQAKSEPPKYTPRPAVASLLDEYRDYIRQRIADAHPYKIPATVIAREIRDQGYRGGMTILRGFIRSLSVPQEQEPAVRFETEPGRQMQVDWGTMRNGRSPLHVFVAVLGYSRMLYIEFTDNMRYDTLETCHRNAFRFFGGVPREVLYDNMKTVVLQRDAYQTGQHRFHPSLWQFGKEMGFSPRLCRPFRAQTKGKVERMVQYTRNSFYIPLMTRLRPMGITVDVETANRHGLRWLHDVANQRKHETIQARPCDRWLEEQQSMLALPPEKKEYDVHPGENLVNFDKHPLHHPLSIYDSFCRGVA >NZ_CP040885|86975:114977|98338_98689_-|WP_000624725.1|DBSCAN-SWA MISLPSGTRIWLVAGVTDMRKSFNGLGEQVQHVLNDNPFSGHLFIFRGRRGDTVKILWADADGLCLFTKRLEEGQFIWPVVRDGKVSITRSQLAMLLDKLDWRQPKTSRLNALTML >NZ_CP040885|86975:114977|92487_93708_+|WP_000410951.1|DBSCAN-SWA MEKHYVGSEIGQLRSVMLHRPNLSLKRLTPSNCQELLFDDVLSVERAGEEHDIFANTLRDQGVEVLLLTDLLTQTLDIKEAKTWLLETQISDYRLGPTFAGDVRSWLADMPHRELARRLSGGLTYGEIPAAINNMVVDTHTSNDFIMKPLPNHLFTRDTSCWIYNGVSINPMAKPARQRETNNLRAIYRWHPAFADGDFIKYFGDENIYYDHATLEGGDVLVIGRGAVLIGMSERTTPQGVEFLANSLFKHRQAERVIAVELPKHRSCMHLDTVMTHIDVDTFSVYPEVVRKDAQCWTLTSNGRDGLQRTQETDLLHAIEKALGIDQVRLITTGGDAFEAEREQWNDANNVLTIRPGVVIGYERNVWTNEKYDKAGITVLPIPGDELGRGRGGARCMSCPLERDGI >NZ_CP040885|86975:114977|89429_89672_-|WP_023149666.1|DBSCAN-SWA MQTSRVNGLTSGVFAFLVPASCLNQKGSDTRRDNTTFPVADYYLHFPLFTTETPANKYAFYLYQFLIRYRRFPPVNSLFV >NZ_CP040885|86975:114977|104897_105284_+|WP_071940974.1|DBSCAN-SWA MHLPPQAAGPYRSHFSYNTQTQPPEKPRSSAELKPQSPSLITEKRPRPGPKGRNRVAFNYECCNYIFIAVSLLAGSSQYTLVSGPHGPLTRRYAPTSGKPSSGPLRPRTEALSWLKAGMVWQGWGWVR >NZ_CP040885|86975:114977|97250_97730_+|WP_001496175.1|DBSCAN-SWA MMDYEDFSPKEQLQLTVCQRLIAEKSYFSQEELRRDLQERGFETISQSTVSRLLNLLGVIKIRNAKGLKVYSLNPQLRPAPDAARTVSEMVVSVEHNREFILIHTVAGYGRAVARVLDYHQLPEILGVVAGSSIVWVAPRMVKRVALVHKQINYLLRTY >NZ_CP040885|86975:114977|93718_94630_+|WP_000440183.1|DBSCAN-SWA MERKPTLVVALGGNALLKRGEPLEAEIQRQNIELAARTIAGLTVNWRVVLVHGNGPQIGLLALQNSAYDKVTPYPLDVLGAESQGMIGYMLQQALKNSLPQREVSVLLTQVEVDATDPAFSNPTKYIGPVYNEDQAKTLAAEKGWGFKADGSYFRRVVPSPQPKRIVESDAITALIQRDHLVICNGGGGVPVVEKANGYRGIEAVIDKDLSAALLAYQIGADALLILTDADAVYLDWGKPTQRPLAQVTPELLRGMQFDTGSMGPKVAACCKFVEACNGIAGIGALVDGAEILAGNKGTLIRN >NZ_CP040885|86975:114977|103610_103685_+|WP_001365705.1|DBSCAN-SWA MPGKVQDFFLCSLLLRIVSAGWCD >NZ_CP040885|86975:114977|94714_95719_+|WP_000154545.1|DBSCAN-SWA MTINLKKRNFLKLLDYTPAEIQYLIDLAIKLKAAKKAGREKQTLVGKNIALIFEKTSTRTRCAFEVAAFDQGAQVTYLGPGGSQIGHKESMKDTARVLGRMYDGIEYRGYGQAIVEELGKYAGVPVWNGLTDEFHPTQILADLMTMLEHSPGKKLSELSFAYLGDARNNMGNSLMVGAAKMGMDIRLVAPKSFWPDVVLVEQCRSIAEETGARITLTDDVEEGVWGTDFLYTDVWVSMGEPKEAWTERVSLMKPYQINADVMNATGNPNVKFMHCLPAFHNEHTKVGREIEMAYGLKGLEVTEEVFESPNSIVFDEAENRMHTIKAVMVATLGD |
33 | Stx2-converting_phage(42.86%) | transposase,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
0 : 13836
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP040884|0:13836|DBSCAN-SWA
Protein sequences of DBSCAN-SWA_1 >NZ_CP040884|0:13836|6521_6953_+|WP_001326170.1|DBSCAN-SWA MCQGHGKFVDAKAMLVAAIELAKKQREPVIEVVMGLVLQGVDVRGKEPGERFKAAQAFNAPGHNLLRYGWDAWELYALHEGVSPELASLGRSVMREWHSHSWGRFSGEVGFNAAEIMIKQAKDNPEKAEERWAFLLSEEWMVE >NZ_CP040884|0:13836|6937_7270_+|WP_000348669.1|DBSCAN-SWA MDGGVIVVTEQTNKKLMLDESPDMPLQFKWRKFGGAIFTATQTMEHVKEGRKLGPTPAGMLPQEVWLNRLIAMEETLPRGKFFDRFRRRDKNEQYDLAADHLRLVGQKKR >NZ_CP040884|0:13836|1612_2398_+|WP_001151305.1|DBSCAN-SWA MAKVISFANQKGGVGKSTLCIQQAFYLALQKKKKVLVLDMDGQGNTSSRLAPRRELEDGDYEPILTGTKTAELFAYELDGIEVMHCPCGADLIHTPKNDPDLFEMEAVPLDQAMNPARHLAELFENYDYVLIDCPPSLGRKLVAALVMSTHVACPVKLSGFAVDGVEGLLNTIIGVREAYNQNLEILGIVINDMDRSVNHDKALKSLENTVPDLLFENKIMHRPPLDTATTDGIPVWELRYGHVAAKEVEAVLEELLEKVG >NZ_CP040884|0:13836|11643_13836_-|WP_000366823.1|DBSCAN-SWA MDLYICEKPSQAKDLAGVMKASQRGDGFLHDGGNRVITWAFGHLLELYMPDDYDERYKSWSLETLPIAPESWRYNVRKSAFKQYKIVEGLVKKASTIYISTDYDREGEAIARSLLDRFRYSGPIRRVCLTALDESSIKKALNNVKDGKDTVSLYYAALARQRADWLVGMNVSRLYTVLARDVGFNHTLHVGRVITPTVALVCQRDREIAGFTPSPYWTLGVNVSVQNGQFAAQWIPPEECSDEQGRCVNKAYAEQVASQVNGANAVISKAETKPGKESAPLPFDLTSLQQYASKRWGYTAQQVLDAAQALYETHKATTYPRTDSRYLPESQKEDIPDILQALILSDQNVSGLVAGADPHRKARVFNDAKVTAHHAIIPTPARTDISAMSEIEFNLYDAIRRFYIAQFYSEFEFTKTSIEVQCGRHLFASAGKTPTKQGWKVLFASDSESSPKDEGEDTDAPVEQEKLPRVSQGEPALLNGAELANKMTRPAPHFTEATLLAAMENIARFVTEEKFKQILKDTAGLGTPATRASIIQGAVDKGYFKRQKKVLLATDKAHALIAVLPPAIKSPGMTAAWEQELEKVASGSGNMSVFMKQISTWICQMVEQLKVAAPVLTKEGGAMAKAFEGAKPPSHECFNCGGEMHRIKGKNGFFWGCQNEACKKTFPDNRGKPEKRIAAEDCPDCPDCGSPMRLRKGKAPGKKRASKFWGCTAYPDCKGTMPFKKSDFMD >NZ_CP040884|0:13836|5779_6454_+|WP_000344149.1|DBSCAN-SWA MDENKLNIETVDGHNELVVSFLSRMVSLSDEEKQTVLSCLPDTGKQTITQLYEALRSQGHQDLAEKVEPYLQQGVFGPIFDNAKSKVFVRDEAPFFLMDENPLNWDDAKAFNRLRMSTTCVLGRGGWTIGERFDDRFDTEVGGTQLIVTQSLNEKGEIEGGLPTSMSLNDFAEFPKQPRPPQIVDYQEDKRYTLEEAEAIPELAPVVQRLKERIEEYEERRAHD >NZ_CP040884|0:13836|9209_9866_+|WP_000268552.1|DBSCAN-SWA MAVIYYGEGTHDAGFVGFRVARTVGVADDYRQEYFSLREYSYATAHRLAYSLDRKWEAEAEEVKRQNKTCKRRRNSGPNIIAEGLRAYISIENRSRMGVKRTYFAPCFLVTKPGYGNGDIVFRISTHGYAEAYEKAVEKYCEIHDLTDEQYVELLDRMPSTEVFTGYLLNALLIRGHRATKAEILSKLGAAKNEDDITNSKGKSGHNRVRCPEYRWAQ >NZ_CP040884|0:13836|7278_7779_+|WP_000647188.1|DBSCAN-SWA MKAQAYPPSVIRKGAVLYAALYYISDDDKAKVEVTEWIVRSIQKRRNSTSDQRYVNLAQKLDGITWGKRSRKNGDFGWLPSIPSWCLKQFREGGELPFGVYTTRLAALKFAKVSLQEEVQYCEAELKKAQTEEDTQELQEELAENQRLLKAAGAMVKREQNKKKRG >NZ_CP040884|0:13836|9921_10539_+|WP_000464630.1|DBSCAN-SWA MFFDNKVESHSLVMGASGKGKSVLSEQVRKNARLRGDLLVDTEMYREGRGLKPYEHEYARRLVLGLSGPLPRELRGKPVTVISDVSRPKKVKRQPKQFVKTVNGVTLERQLVADARDQLEMQTGVWLKQPQLIELMEESGIDETLADFGEAETQIREMLADALAMKLVGRSWPKCGALYNAAEKSDVNFSSELDAAAKEAGYMVR >NZ_CP040884|0:13836|2401_3583_+|WP_001207227.1|DBSCAN-SWA MALNNLKGLSELAKAAKGKKGKEVLTVPVDDVVSKVQVRKRFRNIEELAATLLTEGQQSPIIVFPKNEEGKFVIQKGERRWRACKHAGIETIDLVVNDKVQNNLDETAGELIENIQRDDLTPVEIAEALNLFIEEGWKQKDIADRLGKNITFVSTHLSLLKLPDCVRELYDNEVCSDTETLNNLRLLFDLNEERCRAVCAVAMSDGITRKQSRELLNDAKRIKDEMEKGPLTGSHQNDELGAGNTDEQSLNSGGDGTSEQTGNDDLNLAQEELEGGKNSNGQDDDDEDPLRDEEGEHKDPVKQPDNSGKDKDEEGGDALPPLPKDKEWKNVRADSLIFAVNVNLDGETKRGVIMTDRVALVPSTVWVKTLDGEGKEKHVHVPVSDIELLSVEG >NZ_CP040884|0:13836|1127_1439_+|WP_000380893.1|DBSCAN-SWA MDTQELNHMIAEAYSRDLQKPELVSFKEVSRWGRKYGFPVVCTLADESEEKQIHWAASLLIQVAGTWPREDMPELLTPERGSALFNDAMQLLANGLGAANQLR >NZ_CP040884|0:13836|11140_11629_-|WP_004201072.1|DBSCAN-SWA MFGKIILTAMLTTSSATAEDAGKNIASGLATASTRQIGQAIMPTLAIGSAIAKSAGKGMVLGFAETSASYISHDDLSATRSYQNPEAVDMAKGLGTLKDVPDFLYVITDVNANMADKCKRVWEPQSLALSQLIVELVALRKTNHKGSYEQALSHLDCSIFTN >NZ_CP040884|0:13836|3956_4592_-|WP_000074431.1|DBSCAN-SWA MSPETQQLTSKALSLIEQSRYRMGTSRFVEAFIDQWAYLQTGLYPAKEEIPEELQPVAFELSHVLSAAIKRDPTSDVLGYVLSMSGFHKKGTNYFPTPPEIGRLMSLIVGSQSSADFYEPCCGSGINAIHWMENLIENHGPEALREASIYLEDIDPLMVKCCMIQLFHYFESRNTTPKTLSIVGIDTLSRRTKNIAYYAEKPPATAATVAA >NZ_CP040884|0:13836|3631_3904_+|WP_000703827.1|DBSCAN-SWA MKISQDMKRKFALVNALSKTEKPSLQDLHKATNIPESTIKRQLSALRDEFGMNILFVRESTGERGATGYYMLTDWGILDRSSFLNRYGKL >NZ_CP040884|0:13836|10750_11050_+|WP_001326171.1|DBSCAN-SWA MSYQVVIMKKRILHLPVKKIYFDQIKSGEKPDEYRLVTDYWIKRLEGREYDEVHVKCGYPKAGDMSRIEIRPWRGFSRNVITHPHFGDYPVEVFAIHVN >NZ_CP040884|0:13836|5523_5805_+|WP_000044823.1|DBSCAN-SWA MSKRYAVVPHPKLKREYKGRLVRTTRVLKNGWGLIPLGAVATVTHQSPKGSELTFEPCDCCGLKAIISHVSMDSIEFIEPITEEEDGREQAQH >NZ_CP040884|0:13836|696_1146_+|WP_001053910.1|DBSCAN-SWA MNLSIIARVLRQLAIIFVLSVLLVAGYIYYAGKQHQQAAINFWGEQYQPDAISTQIDWGFIGNWVIPRGGPIISPGIAGVCPNTPLPVVPLKTGPDGRGYVLCGIGSEAVATSFDVNDIQDEEIRNTLKTMFEEEFEKTVKGDKWTLKN >NZ_CP040884|0:13836|7782_9210_+|WP_000936897.1|DBSCAN-SWA MLPIVSPSVVTKQLAFNRVGDKRKVRVSSNFLDVMGFKPGMGIAVEPGEGMGGFSVIPATDELQTHQVYQRRYQPKSRSNNPLETVIEFSGQGLIDKCFPRYTERFHVEMRKGRVVFTPVANRAFAIADRFRKTSPFRAFVALTGGVDIHVMESLGWKAEIVLEHRPVEARDRASGRNLSEVHALNTLVNSSPRILLNEDIHHLELDRLGALLAECPPIGLAHYSLGCDDHSNAKSPRDKERSLEDLSTMLDMVYPALKQIEVVNPAVVLVENVPNFKASGAGAMMGTTLRRMGYFLTEMVLNGLDFGAYQGRERYYMVASVFPGFVPPKPEQRAGGRLWPVIEKHLGDCADVTALKSIQARESTSRRMPAFLTRESTSCPTILKSQDRGVKDAVYIQDGGRIYKPSVDLVQELMSIPDSFDISWMAKEQATETLGQSVDYRLHSAVMAAVRDHLNVNCGRHTVVQHGIRSKEGK >NZ_CP040884|0:13836|10539_10746_+|WP_000505706.1|DBSCAN-SWA MGFGVDKIDRQSWLVKFRRAKCQDTLDTMRDAAIRNYEGNIRVIADIVLAHEARETEIEKGMFCLIVR >NZ_CP040884|0:13836|5153_5531_+|WP_001125904.1|DBSCAN-SWA MPKQANHLRLKKPCANCPFRKEGAIELAPGRLEGIINDIVENDMTTFHCHKTVHSKSGGEWDEEGNYAPSGQESMCAGAAAYLMKIGRPTVAMRIAFAFGDAKVSDWDEAQELVVEPLVQGDRNE |
19 | Lactococcus_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
27575 : 30934
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP040884|27575:30934|DBSCAN-SWA TATGATTAATAAAATTGATTTCAAAGCTAAGAATCTAACATCAAATGCAGGTCTTTTTCTGCTCCTTGAGAATGCAAAAAGCAATGGGATTTTTGATTTTATTGAAAATGACCTCGTATTTGATAATGACTCAACAAATAAAATCAAGATGAATCATATAAAGACCATGCTCTGCGGTCACTTCATTGGCATTGATAAGTTAGAACGTCTAAAGCTACTTCAAAATGATCCCCTCGTCAACGAGTTTGATATTTCCGTAAAAGAACCTGAAACAGTGTCACGGTTTCTAGGAAACTTCAACTTCAAGACAACCCAAATGTTTAGAGACATTAATTTTAAAGTCTTTAAAAAACTGCTCACTAAAAGTAAATTGACATCCATTACGATTGATATTGATAGTAGTGTAATTAACGTAGAAGGTCATCAAGAAGGTGCGTCAAAAGGATATAATCCTAAGAAACTGGGAAACCGATGCTACAATATCCAATTTGCATTTTGCGACGAATTAAAAGCATATGTTACCGGATTTGTAAGAAGTGGCAATACTTACACTGCAAACGGTGCTGCGGAAATGATCAAAGAAATTGTTGCTAACATCAAATCAGACGATTTAGAAATTTTATTTCGAATGGATAGTGGCTACTTTGATGAAAAAATTATCGAAACGATAGAATCTCTTGGATGCAAATATTTAATTAAAGCCAAAAGTTATTCTACACTCACCTCACAAGCAACGAATTCATCAATTGTATTCGTTAAAGGAGAAGAAGGTAGAGAAACTACAGAACTGTATACAAAATTAGTTAAATGGGAAAAAGACAGAAGATTTGTCGTATCTCGCGTACTGAAACCAGAAAAAGAAAGAGCACAATTATCACTTTTAGAAGGTTCCGAATACGACTACTTTTTCTTTGTAACAAATACTACCTTGCTTTCTGAAAAAGTAGTTATATACTATGAAAAGCGTGGTAATGCTGAAAACTATATCAAAGAAGCCAAATACGACATGGCGGTGGGTCATCTCTTGCTAAAGTCATTTTGGGCGAATGAAGCCGTGTTTCAAATGATGATGCTTTCATATAACCTATTTTTGTTGTTCAAGTTTGATTCCTTGGACTCTTCAGAATACAGACAGCAAATAAAGACCTTTCGTTTGAAGTATGTATTTCTTGCAGCAAAAATAATCAAAACCGCAAGATATGTAATCATGAAGTTGTCGGAAAACTATCCGTACAAGGGAGTGTATGAAAAATGTCTGGTATAATAAGAATATCATCAATAAAATTGAGTGTTGCTCTGTGGATAACTTGCAGAGTTTATTAAGTATCATTGCAGCAAAGATGAAATCAATGATTTATCAAAAATGATTGAAAGGTGGTTGTAAATAATGTTACAATGTGTGAGAAGCAGTCTAAATTCTTCGTGAAATAGTGATTTTTGAAGCTAATAAAAAACACACGTGGAATTTAGGAAAAACTTATATCTGCTGCTAAATTTAACCGTTTGTCAACACGGTGCAAATCAAACACACTGATTGCGTCTGACGGGCCCGGACACCTTTTTGCTTTTAATTACGGAACTGATTTCATGATGAAAAAATCGTTATGCTGCGCTCTGCTGCTGACAGCCTCTTTCTCCACATTTGCTGCCGCAAAAACAGAACAACAGATTGCCGATATCGTTAATCGCACCATCACCCCGTTGATGCAGGAGCAGGCTATTCCGGGTATGGCCGTTGCCGTTATCTACCAGGGAAAACCCTATTATTTCACCTGGGGTAAAGCCGATATCGCCAATAACCACCCAGTCACGCAGCAAACGCTGTTTGAGCTAGGATCGGTTAGTAAGACGTTTAACGGCGTGTTGGGCGGCGATGCTATCGCCCGCGGCGAAATTAAGCTCAGCGATCCGGTCACGAAATACTGGCCAGAACTGACAGGCAAACAGTGGCAGGGTATCCGCCTGCTGCACTTAGCCACCTATACGGCAGGCGGCCTACCGCTGCAGATCCCCGATGACGTTAGGGATAAAGCCGCATTACTGCATTTTTATCAAAACTGGCAGCCGCAATGGACTCCGGGCGCTAAGCGACTTTACGCTAACTCCAGCATTGGTCTGTTTGGCGCGCTGGCGGTGAAACCCTCAGGAATGAGTTACGAAGAGGCAATGACCAGACGCGTCCTGCAACCATTAAAACTGGCGCATACCTGGATTACGGTTCCGCAGAACGAACAAAAAGATTATGCCTTGGGCTATCGCGAAGGGAAGCCCGTACACGTTTCTCCGGGACAACTTGACGCCGAAGCCTATGGCGTGAAATCCAGCGTTATTGATATGGCCCGCTGGGTTCAGGCCAACATGGATGCCAGCCACGTTCAGGAGAAAACGCTCCAGCAGGGCATTGCGCTTGCGCAGTCTCGCTACTGGCGTATTGGCGATATGTACCAGGGATTAGGCTGGGAGATGCTGAACTGGCCGCTGAAAGCTGATTCGATCATCAACGGCAGCGACAGCAAAGTGGCATTGGCAGCGCTTCCCGCCGTTGAGGTAAACCCGCCCGCCCCCGCAGTGAAAGCCTCATGGGTGCATAAAACGGGCTCCACTGGTGGATTTGGCAGCTACGTAGCCTTCGTTCCAGAAAAAAACCTTGGCATCGTGATGCTGGCAAACAAAAGCTATCCTAACCCTGTCCGTGTCGAGGCGGCCTGGCGCATTCTTGAAAAGCTGCAATAACTGACGATGAGGCCCAGGATATTGGGCCTCCTTTCTTTCTCTTTTTTTCCTGTTGTCATTTACACTTAACAAAAATACAGCAAGGAAAATCCCATGCGCATTTTGCCCGTCGTTGCTGCAGTTACGGCTGCATTCCTGGTTGTCGCGTGTAGCTCCCCGACACCGCCGAAAGGCGTTACCGTGGTAAATAACTTTGATGCCAAACGCTATCTGGGAACCTGGTATGAAATTGCGCGCTTCGACCATCGTTTCGAGCGCGGATTGGATAAAGTGACCGCAACATACAGCTTGCGCGACGACGGCGGCATCAACGTTATTAACAAGGGCTATAACCCTGACAGGGAGATGTGGCAGAAAACGGAAGGGAAAGCCTATTTCACCGGCGACCCAAGCAGAGCCGCGCTTAAGGTTTCTTTTTTCGGCCCCTTCTATGGCGGGTATAACGTAATTGCACTCGACCGGGAATATCGTCACGCGCTGGTTTGTGGTCCGGATCGCGACTACCTGTGGATCCTTTCACGGACCCCTACTATTTCAGATGAAATGAAACAGCAAATGTTAGCCATCGCGACCCGGGAAGGGTTTGAAGTGAATAAACTGATTTGGGTGAAACAGCCTGGCGCTTAG
Protein sequences of DBSCAN-SWA_2 >NZ_CP040884|27575:30934|27575_28838_+|WP_000608644.1|transposase|DBSCAN-SWA MINKIDFKAKNLTSNAGLFLLLENAKSNGIFDFIENDLVFDNDSTNKIKMNHIKTMLCGHFIGIDKLERLKLLQNDPLVNEFDISVKEPETVSRFLGNFNFKTTQMFRDINFKVFKKLLTKSKLTSITIDIDSSVINVEGHQEGASKGYNPKKLGNRCYNIQFAFCDELKAYVTGFVRSGNTYTANGAAEMIKEIVANIKSDDLEILFRMDSGYFDEKIIETIESLGCKYLIKAKSYSTLTSQATNSSIVFVKGEEGRETTELYTKLVKWEKDRRFVVSRVLKPEKERAQLSLLEGSEYDYFFFVTNTTLLSEKVVIYYEKRGNAENYIKEAKYDMAVGHLLLKSFWANEAVFQMMMLSYNLFLLFKFDSLDSSEYRQQIKTFRLKYVFLAAKIIKTARYVIMKLSENYPYKGVYEKCLV >NZ_CP040884|27575:30934|30400_30934_+|WP_001221666.1|DBSCAN-SWA MRILPVVAAVTAAFLVVACSSPTPPKGVTVVNNFDAKRYLGTWYEIARFDHRFERGLDKVTATYSLRDDGGINVINKGYNPDREMWQKTEGKAYFTGDPSRAALKVSFFGPFYGGYNVIALDREYRHALVCGPDRDYLWILSRTPTISDEMKQQMLAIATREGFEVNKLIWVKQPGA >NZ_CP040884|27575:30934|29161_30307_+|WP_015058212.1|DBSCAN-SWA MMKKSLCCALLLTASFSTFAAAKTEQQIADIVNRTITPLMQEQAIPGMAVAVIYQGKPYYFTWGKADIANNHPVTQQTLFELGSVSKTFNGVLGGDAIARGEIKLSDPVTKYWPELTGKQWQGIRLLHLATYTAGGLPLQIPDDVRDKAALLHFYQNWQPQWTPGAKRLYANSSIGLFGALAVKPSGMSYEEAMTRRVLQPLKLAHTWITVPQNEQKDYALGYREGKPVHVSPGQLDAEAYGVKSSVIDMARWVQANMDASHVQEKTLQQGIALAQSRYWRIGDMYQGLGWEMLNWPLKADSIINGSDSKVALAALPAVEVNPPAPAVKASWVHKTGSTGGFGSYVAFVPEKNLGIVMLANKSYPNPVRVEAAWRILEKLQ |
3 | Salmonella_phage(50.0%) | transposase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
50418 : 55054
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP040884|50418:55054|DBSCAN-SWA TATGTCTCAATACTCTCAATTCTCAGTAAGCAAGGTTTTCGGTATGCCCAGCATTCCGGAGAAGGTAACGGCCATCGGCTACGCTGACGGTTCAAACCCTTTCATCCCTGCCACTGATACCAACTACGTTTTCCGCAAAGAGTTTCTGCGAGAGGTCTTGGCCTATCTCAAAGAGCCTGGCGGTGACGCATTGTTCGTAACCGGCCCTACCGGGTCTGGCAAAACCTCAGGTATCACCGAGATCGCTGGTCGTCTCAACTGGCCTGTTCAGCAAATCACTGCCCACGGGCGGATGGAGCTGACAGATCTGATTGGACATCACGCTCTGGTCGCGGAAAAGCCTGGTCAACCACCTGTCATGAAGTTCATGTATGGACCTCTGGCAGTCGCTATGCGTGAAGGTCATCTGCTCCTCATCAACGAGGTGGATTTGGCCGACCCTGCCGAGCTGGCTGGTCTCAACGATGTCCTTGAAGGGCGTCCCCTTGTGATCGCTCAGAATGGCGGGGAAATCATCAAGCCGCACCCGATGTTTCGTGTGGTCGTTACTGGTAACTCTACGGGGTCCGGTGATGCTTCTGGTTTGTACCAGGGGGTGATGATGCAGAACCTCGCAGCTATGGATCGCTATCGTTTCACCAAAGTAGGGTATGCGGATGAGGAAGCTGAACTCAGCATTCTTGGCCGTGCTACTCCGAAACTTCCGGAAAATGTGCGGAAGGGAATGGTTCGGATTGCTAATCAGGTCCGCAAACTGTTTCTTGGCGAAAACGGTGAAGATGGTCAGATCAGCGTCACCATGTCCACTCGGACGTTGGTGCGTTGGGCGAAACTGTCTCTAGCGTTCCGTGGTGCCCCAAATGCTCTTGAATACGCCTTGGATCAAGCTCTGCTTATCCGTGCGGCCAAAGAGGAGCGTGAAGCCATCCTGCGTGTTGCGAAGGATGTGTTCGGGGACCAATGGCGCTAAGAGGTGACGCATGAAGAAAAATGACTGTTTGTGCCGTCGTTACACTGCTAAAGAGTGGGGCAATGACGAAACCACAATAGAAGTCTTCATTGGCTACAAGTTGCTCCGGGAGCCCAGCTCCTCGGAGCCAGGCCAATTCACGATGGTCGAGCTACGCCGAACCGTCACTGATGGGAAAGCTGAAAATTGGTCTGAGACAAAGCTCGAAGGACCTTTTGAAGCTAACGGCCCAGACACGATCCCAATGTCCTACAAGGACAAAGAAAGCCAGTATGTGTCACAGTTTCTCAGCCAGGGGTACACCTTTCTGGATGAGGTGCTGGTAAACGCAGAGACACAGACGGTGCTGGAAGGGGGGAATGTTTCAGCCGGACAAACAGCCAGCTTGGGATCTCTCAACTGGCTGTTATCTCCGCCCTCTGAGTTACCTCCAGGTGACATAAACCTCTTCAAAGGTTTTGTTGCTGGGGTTTTTGCCAAAGGAGCCGGTTTAATCGGCTTTGAGGTTGCTCGGAGCGAAGGCTCAAATGACTTGCTACCCAGTGTGCTGATGCGTACTGACAGCGGTTATGAGCTTGGGGTTAGCACTGGACTAGGCGAAAACACTATCCATCCAGCTACGCTGGAAGGCGCTGGTGAACTCCGCCCGGAACATGGTCACAAACCCCTGCTGATGTTGGTTTACCTGCAACAGCGTTTCGCGGATGACTTTTCAAACGTAGAGAAGCCGCTGGTGGCATTCTGTGATGAACAGGGTGATACCTTCGACTACGAGCGTTTTGATTCACTAAAACCCCTCATTGAACGGTTTGGTTTCAGCTACGACGAAGTAAGAGCAGATGCAGAAAGGCTTGGCCTTGTATCTGAGCTGATTCGCCTGGCAGAGATCGACGCCGAACAAGAGGATCACTTTTTTTAACCCTTACGGGGGCTTGCTCCCCGAAAGGGGCGTTGGCCCTCTCCAAATAACTTGGAGGTCATGATGTCTAAAGGCGTCAACAAAGTAATTCTGGTCGGTAATCTCGGTTCTGACCCGGAAATTCGCTACATGCCAAGCGGCACTGCCGTTGCCAACTTCAACGTTGCAACAACGGATACGTGGCGCGATAAGCAGTCTGGCGAGCAAAGAGAGCATACTGAGTGGCACCGTATTGTGCTTAAAGGTCGTTTGGCGGAAGTCGCTGGTGAGTACCTGAAAAAGGGCTCCCAAGTCTATCTCGAAGGGAGCAACCGCACCCGGAAGTGGACTGACAGCCAACAAATCGAGCGCTACACCACCGAAGTACACTGCGTTGAAATGCAGATGCTTGGTGGTCGTGGAAATGCACCTCAGGACAACTCTCAACGTGCAGCGCCCCAAAAAGGGCAACGTACAGGAGCCGGTACGCAATCTGCTCCTGTGCAGCAATCAGCACCGCAAGGTGGTATGGGCGGAGGCTATGGTCCCGCTCCTGATGGCTGGGATGATGACATCCCGTTCATGCGGCTGCACCACTTGGCTGGCGGGTAACACCGCCAACTCTTACTAAACCCCACGGGGGCACTCATGCCCCGAGGGGGTCATGGTGTCCTCAAACCGTTCATTCGTGAATTGGAGAAACACCATGTCTGATAACAAATCTCTTGTAACCCGTATCGCAAGCCGGTTTGGCGTGGACACCCGAAAGTTCTATGAAACTTTGAAGGCGACCGCATTCAAGCAGCGAGATGGAAGTGCCCCGACCGATGAGCAGATGATGACGCTCCTGATCGTGGCTGAACAGTACGGTTTGAACCCTTTCACTCGGGAAATCTATGCGTTTCCTGACAAGCAAAATGGGATCATTCCGGTAGTAGGTGTTGATGGTTGGAGCCGCATCATCAACGAGCATCCCCAGTATGATGGCGTCGAGTTCGTGTATTCGGACAAGATGGTCAGAATGCAGGGGGCGAAAGTTGACTGCCCTGAGTGGATTGAATGCGTGATTTACCGTAAGGACAGATCTCGCCCTATCCGCATCAAGGAGTTCATTGATGAGGTGTACCGTGAACCGTTTCAGGGTCAAGGTCGCAATGGTGCTTACACTGTTGATGGCCCCTGGCAAACGCACACCAAGCGTCAACTCCGGCACAAGTCGCTGATCCAGTGTTCTCGTGTCGCATTTGGTTTCTCTGGTATTTATGACCAGGATGAAGCTGAACGCATCCGTGAAATGGAGCAGGCATCGGCCATTAACCCGGCTATTGCCAATCTCCCTTCACCATCTCAAGTTCAAAGCCAAGAGCCTTTGGCTATTGAGCACAAAGAGCTTGACCCGATCCTAACCAAACTCGCAAATCGCGCCATTGCTGAAAACGCATGGTCTGCGGCGCATGAGTATGTGAAGGGACGGTATGAAGGTTCGGAACTGCAATATGCGACTCAATTCCTTCGTGACAAGGAGATGGATCAAATGGAGCCTCCGAAACCTGACTACCAGGAAGCGCACGAGCAAGAGTCCGCCGCTGGTGGTTCTGCAAATGCTGAACCTGGTGCCGAAGAAATGCCGCCTTTGAGTGACGAGGACATGATCCCTGTTACAGAAGAGGAGGGCGCGGAAGGCAGTTACTACTAACCCCAACGGGGGGGACTCCCCCGCTGGGGGAGATCTCCCTCCTAACCATTGGAGAGAGGTCCATGAAAATAGTCAACCTATCGCAACGCGAGGAAGATTGGCTTGATTGGCGGCGTCAAGGTGTAACAGCCACTGACGCCGCTATCCTGCTCAATCGGTCTCCGTACAAAACACGATGGAGACTGTGGGCCGAGAAGACTGGGTATGCGCGTGAAGTCGATCTGAGTCTTAATCCGCTGGTTCGCCGGGGGATAGAAAACGAAGATGCTGCAAGACGCGCTTTCGAGGAGAAGTATGATGACATGCTGCTCCCCGCCTGTGTCGAATCGGTTCAATACCCGCTCATGAGGGCCTCCCTGGATGGCCTGAGAGATAACGGGGAGCCCGTCGAGCTGAAAAGCCCGAGTGCGACTGTCTGGGAAGATGTTTGTGCTGAGAAAGCAAACAGCAAGGCATACCAGCTTTATTACCCGCAGGTGCAACACCAGCTCCTGGTAACGGGGGCCAAGCAAGGCTGGTTAGTCTTCTACTTTGAAGGTCAGATTCAGGAGTTTCCAATACTCCGAGACGAAGCCATGATTCAAGAAATCTTGGCCGAGGCTAAAAAGTTCTGGCAACAGGTAGTAGACAAGAAGGAGCCCGACAAAGATCCAGAGAGAGACCTGTACATACCGCAAGGTGAAGAGGTCAACCGTTGGATTGCTGCTGCTGAGGAATACCGCCTCTATGATGCAGAGATTCAGGAGCTGAAACAGCGACTGTCTGAGCTTCAAGAAAGGCAAAAGCCTCATCTCGACACCATGAAGTCCCTCATGGGGGAATACTTCCATGCCGACTACTGCGGTGTGATGGTAACGAGATACAAAGCGGCTGGCCGGGTAGACTACAAAAAGCTGTTGGCTGATAAGGCGTCAGGCGTGAAGCCTGAGGATGTTGACCAGTACAGAGAGAAGTCATCAGAGCGGTGCCGTGTAACGGTTACTGGCTCTGTGAAGCCACGGTACATTGTTGATGAGGACGTGCTTGCTCCTCTTGATGATTTGCCGGAAGAAGTAGAGACGTTCTACTGGTGA
Protein sequences of DBSCAN-SWA_3 >NZ_CP040884|50418:55054|50418_51387_+|WP_000085160.1|DBSCAN-SWA MSQYSQFSVSKVFGMPSIPEKVTAIGYADGSNPFIPATDTNYVFRKEFLREVLAYLKEPGGDALFVTGPTGSGKTSGITEIAGRLNWPVQQITAHGRMELTDLIGHHALVAEKPGQPPVMKFMYGPLAVAMREGHLLLINEVDLADPAELAGLNDVLEGRPLVIAQNGGEIIKPHPMFRVVVTGNSTGSGDASGLYQGVMMQNLAAMDRYRFTKVGYADEEAELSILGRATPKLPENVRKGMVRIANQVRKLFLGENGEDGQISVTMSTRTLVRWAKLSLAFRGAPNALEYALDQALLIRAAKEEREAILRVAKDVFGDQWR >NZ_CP040884|50418:55054|54043_55054_+|WP_000706865.1|DBSCAN-SWA MKIVNLSQREEDWLDWRRQGVTATDAAILLNRSPYKTRWRLWAEKTGYAREVDLSLNPLVRRGIENEDAARRAFEEKYDDMLLPACVESVQYPLMRASLDGLRDNGEPVELKSPSATVWEDVCAEKANSKAYQLYYPQVQHQLLVTGAKQGWLVFYFEGQIQEFPILRDEAMIQEILAEAKKFWQQVVDKKEPDKDPERDLYIPQGEEVNRWIAAAEEYRLYDAEIQELKQRLSELQERQKPHLDTMKSLMGEYFHADYCGVMVTRYKAAGRVDYKKLLADKASGVKPEDVDQYREKSSERCRVTVTGSVKPRYIVDEDVLAPLDDLPEEVETFYW >NZ_CP040884|50418:55054|52991_53981_+|WP_001282585.1|DBSCAN-SWA MSDNKSLVTRIASRFGVDTRKFYETLKATAFKQRDGSAPTDEQMMTLLIVAEQYGLNPFTREIYAFPDKQNGIIPVVGVDGWSRIINEHPQYDGVEFVYSDKMVRMQGAKVDCPEWIECVIYRKDRSRPIRIKEFIDEVYREPFQGQGRNGAYTVDGPWQTHTKRQLRHKSLIQCSRVAFGFSGIYDQDEAERIREMEQASAINPAIANLPSPSQVQSQEPLAIEHKELDPILTKLANRAIAENAWSAAHEYVKGRYEGSELQYATQFLRDKEMDQMEPPKPDYQEAHEQESAAGGSANAEPGAEEMPPLSDEDMIPVTEEEGAEGSYY >NZ_CP040884|50418:55054|51397_52306_+|WP_000739139.1|DBSCAN-SWA MKKNDCLCRRYTAKEWGNDETTIEVFIGYKLLREPSSSEPGQFTMVELRRTVTDGKAENWSETKLEGPFEANGPDTIPMSYKDKESQYVSQFLSQGYTFLDEVLVNAETQTVLEGGNVSAGQTASLGSLNWLLSPPSELPPGDINLFKGFVAGVFAKGAGLIGFEVARSEGSNDLLPSVLMRTDSGYELGVSTGLGENTIHPATLEGAGELRPEHGHKPLLMLVYLQQRFADDFSNVEKPLVAFCDEQGDTFDYERFDSLKPLIERFGFSYDEVRADAERLGLVSELIRLAEIDAEQEDHFF >NZ_CP040884|50418:55054|52366_52897_+|WP_000987165.1|DBSCAN-SWA MMSKGVNKVILVGNLGSDPEIRYMPSGTAVANFNVATTDTWRDKQSGEQREHTEWHRIVLKGRLAEVAGEYLKKGSQVYLEGSNRTRKWTDSQQIERYTTEVHCVEMQMLGGRGNAPQDNSQRAAPQKGQRTGAGTQSAPVQQSAPQGGMGGGYGPAPDGWDDDIPFMRLHHLAGG |
5 | Rhizobium_phage(25.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
61642 : 74053
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP040884|61642:74053|DBSCAN-SWA AATGATTAAACAGCATTTTCAAAACGAACTGGTTAAGTGTGGATACCCGGATGATCTGACGATTGAGTACAGTCTCGGATACTGCCAGGGGGATGGCGTAGCCTTCTACGGGGATTTGAGCGTTGATGACGTCAAAGCTCTCATGAATCGCCTATTCAGCACTGAGCCCGGCCAAGTGGATGCTGTCAGCCGCGTGAAGAACCTGATGGCACAGAAAGACATTGAGAATATGCTTTCTGTCCTCCGCGAATATGGTTCCTGTGACCTGTCCATTACTCGGAATAGTCACGGGCATCACTACAGCCATTGGAACTGCATGAACATCGACGACAACGTGGACTTCACAGGGATCTTCCCTGATGATGATTCCATGATTGGCACCGGCATTGAAGGGATTAACCAGGATATGGTCGAGCGCTGGCAAGACCTCTGGGAACGCTTTGTGCTGGAGCTGGCGGATGATGTAAAAAGTCTTTCCAAGAAGCTCGAAGCGGACGGGTACTCGCTGATCGAGGCCTCTCCATGCGAAGATGAAGTGGTTTGGGAACGGGCCACTGAAAACTACCTGGTGCGTGTTACTGAACTCCCTGAGCGGGATTTCGATATGGGCCACTGGGATGACGAAGTAAGAGACCAAACAATTTGTTCTATCCTGGAAGGGAAAGAGCGAGTGCTTGGCCTACGTGTTGAGGTACTCTCTCGTGAAAACGAGATTGTCCTGGGTGAAGAGAGTCTGCACGGCTTGACCGTTGCCAGTGATGACAAGAGCTACGCTGGCTACAGACGAGAACTTCTCCGGGGAGCTATCCAGCAAACCAGGGACTTTTTCTCTCGCCACCTCAAAGCGGCATAACAAGGCAGGGGGAAACCCCTGCTTTACTTCCCAAGTCTTCTTACTCATGGTGAATACCATCAGTAAGCAGATTTTGTTCTCTTCGGAGAGCGAAAAGTGCGATAGCTGGTCGCCAAAAACAAACAGCAAATTAACGTTAATTTATTAATCCCTGCGGGAGTCTATCTCCTGCTGGAGATACCTCCCGTTATTTCTGGAGGTATCCATGAACAAATTCCAAGACTTTAAGTTGGTTTATCGTCAAGGTGGTCGTCGCTTAGAGCGTGTCTTTACGGACACTCTTTACGCGAATGTGAAGCGTGTCGCTGACTCGTTTCCTCCGACCGTTCTTTGGCGCATTCAGCCAGCGTAAAACTCGGAACCTTAAAGCTCGGCTTTGGCCGTGACCTTTCAATAACCCATGCGGGTACACTTCCCCGCACGGGGTAGGTGTACCCATTAACTTTGGAGTACACCTTATGAACCAAGTAGCTAAATTGGACCTCGCCCAAATCCGTCAACAAGCCATCAATGATGGTCTGCTGGTGGACCAATCTTCCATCGGTAAACAAGCTGGTTTCCTGACCAATGTGGCAGTGACCCCGGCAATTGTCGATGGAGTATTTGGTGCAGATGGTAAGCACTCGGTAGAAGACTTCCTTTTCATGTTTTTGCAGCTTTGTGTTGCTCAAACAAAGGTCGCTTTTACCGACAACAAGAATTGGGGAAAGATTCGTCTTTATTACCCCATGCCCACTGTAGATGGCTTTTTCAAACCCACTGAGGTTGTGATTAAGTCAGATCCTGTAACCGCAGATGTCACAATCATGATGGCCTCGGAAGAGGGCGCTCATTTGTGCTTGTGATATTCACAAAATTCGCCTCCGTTTCGGGGGCGACCTTTTTAAACCCCTCGCCGGGGAGAACTCCCCGGCTGGGGAGCCTCCTTGGCACATTCTCCCGGTGTCGGGGTTTCCTTAAAACCCTGGCGACGGGGGTTTAAGGAAGCCTCGATGCTGGGGTTCACGCTAAGGAGAAAACCTATGTCTTACTACAAAGCTGACACCGTTCGTGAAGCTGCAAACGGAAACTGGCTATTCATCCTGGCGGCCCTAGCGCCCCATCTTGAACCAGCTCTCCGTAAACCTGGTCGTCATGTTTCTTGTCCTATCCACGGCGGGAAAGATGGCTTCCGACTGTTCAAGGATGCCCACCTGACTGGTGGCGGCGTATGCAATACATGCGGTGCCAATCATGATGGTTTTGAGCTGCTGATGTGGCTCAATAACTGGGACTTTAAACAGTGCCTGAGCGAGGTTGGTGATTATCTGGGCGTTGAGAAAGAGCAACCCCAGTATCAACAAGCCGCTGCACCGACACGAGCTCCTGTCCAGGCCAAAGCGCCTGTTCAGCAAGAGCCTATGAAGGTGAATAACAAGGTTCTCGATTCCAAGAATCGTAAAAAATCCATTGCCGGTACTCTGATTGCCCACGGTAAAGCTCCTTATGAGCATAACGAAGACAACGAGCTCAGCTACTTTGCCTTCATCCGTGACAAGAGCGGTTTGGAACGCACCATCTGGGGCGTGGATCTTGAAAGAGCCATTGGTGAAAGTGAAGCCAAGTATGGGGATGAGATCGTCATGACAAACCTCGGGCGCGAGCCTGTAACTGTCGTCGTTGAAGTTAAGGACGAGCAGGGGAATGTTGTGAGAGAACAACCTATGCAAACGCATCGCAACACCTGGCTGGTGGAACGTCGCGGCGCTACGGTAACGCAGTTCCGCGCTCGCTCGAACGGTGGTGTTGAGCCGGTGAGTCACCATGTTGAGTCGGCCCCTGTGGTCAACCGCAAGGTAGAAACTCCGGCACCGCAAGTACAACCTGCGGCCACACAACCGGAAGAGCAAAGCAGCGAAAATAAGCCGAAAGTTGTTCCGATGTTTCGTGAACAACCTAAGCCTTGGTTGCTTGAGCTCCAAGAAGAAATGGAGAAGAGAATGGAGCGCGAACGCGCTTACAGTGCTCGTCTCCGTGAGAAAATCGAGAAGGTATGGAACGAGTGTTTGCCGTTCTCCAGTCATGTGACTGAGCCAATGCGTCTGTACTTCAAAAACCGCGAGCTGCTGTTCAAAGTTGATGAAGTAGAAAAAACAGACTGTCTGCGGTTCAATCCGGCTATGGCCTACTACGACGAAGATGGCAATGAAGTTGGGAAATTCCCGGCTATCGTCTGCGCTATCCGAGATGTGGAAGGCAACCTGGTAACGCTCCACCGCACCTATCTCACCCAAAACGGTAAAAAAGCCAAGGTCGGCAACGCCAAAAAGATGATGCCCATTCCTGACGGTTTGGATGTCAATGGCGCGGCCATCCGCCTCGGTGAACCGACTGAGGGTATCCTGGGTGTTGCAGAAGGGCTGGAAACAGCTCTGTCAGCTTATCGAGTCACTCAAATCCCGGTTTGGTCAACGGTCAATGCCACCCTAATGGAGTCCTTCGAGGTTCCAGAAGGTGTTCACACCGTACTGATCTGGGCTGACAAAGATAAGTCTGTGACTGGGGAGAAGTCAGCGAACGTGCTGAAAGCCAAGCTGGAGAAGCGCGGCATTCGTGTGTACGTCCTGCTGCCTAAGCTCCCGATCCCGCCCAGAGCGAAAGGGATTGACTGGAACGATGTCCTGATGAGTCAGGGAAGCCTCGGTTTCCCGAATGCTCGCTACCTGCGCGATTTCATTGCGAGAAGGAGAGCTGAGTATGGCCGTCATTGATGTCTCGAAGGTTGATACAACGCCTGGTAACGACGCGGTGTGCCCCTTCTCTCCCCCTGAGGGGTGGGAGGGGGACTCTGCGGCCTACGTTGAGCTTATGCGGTCTCGGTATCGTCATCTGATGCACGGCCAGAGAATGATGGTGACAGCCTCCTTCGCAAGAAGGGAGCCTATCCAAGTTACTGGCCCGTTTGCTGATGAAGCGACGAAGATCATTAACTCAATGAAGATGAACAAGGCGAAGCCAACAGCTTTGTCTGCCTAAAACTTTAACCCTGAGGGGGTGTTCATGCCCTCTCAGGCATGATTCGCCTCCTCTTCAACTATGGAGGTAATCATGAAAAAGTTTTTGCGTATTAAGACGTGGTTTGTGCGTCTTTTCTCTCCTGACAAGAAGACTCTGGGAGCTATCGGTGAAGACCTGCGTAAGGTCGCCGTAACAGCCATCGGTGTCGGTATTGTAGGGGTCGTCTCAGAAAACGGAATCTATGGTCACTCCCGTTTTTGCAACACCGATTTTGACGATAAATTGGCTTGCTTGAATCTATCCGGCGTCTGAATGGGATTTTATTCCCGCGCCTCGATGAGTTCCGCGCCTGATGAACCTCCAGAAAATATACGGCTTCAATGAGCCTTTCCGTTTTACAGGTTCCTCAACAGGCCGGTGGGCCGTTAGTATCATCAATATCAGTATTCGCAAAACCAGATCAGTGATTCTTTAAACCGGTGTATTTCTGCCGTTATGCTACATAAGTTTGCTGTCGTGCCGTTAGGGCCCAGGCTATTCTGGCCAGCTTGTTTGCCAGAGCACAAGTGACGACAAAGTTGCTTTTCCTACACAGTAGATCCCTGACCCAATCGGCCAATTTGCCAGACTGGTGTTCCAGTTTTTGTATGAATACCCTGGCACATTGAACCAACAAAGTTCGGATCTTTTTGTTACCTCGCTTACTAATTCCCAGCAATGTCGTCCGACCTCCCGTGCTGTACTGTCGAGGCACTAGCCCTGTTGCCGCCGCAAAGTCACGACTGCTGGCGTACTGCTTCCCGTCGCCAATCTCAGTTGAAATAGTACTCGCTGTCAGTGTTCCGACGCAGGGAATGCTCAGCAAGCGCTGTCCAATCTCATCTTCGTCCAACTTTCGTTTCAACTGGGATTCCAAATCTTTAATCTGCTCAACAAGATAGTGATAATGCTGTTGTAATTTCAGCAATAACTGGCTGAGGTATAGAGGCAAACTATTGTCCTCAAGAAGGGTACTCAGTCGGCTAATAACGGCAGCTCCTCGTGGAACGCTGATGCCAAATTCCAGCAGAAAAGCATGCATCTGATTGGTTGTTTTTACCTTATCCTGAACCAGGGATTCACGGACACGATGCAGCGCACGCATTGCCTGCTGAGATTCAGTTCTGGGCTGTACAAAACGCATAGACGGACGCGATGCAGCTTCACAAATAGCTTCGGCGTCGACAAAGTCGTTTTTGTTACTTTTAACGAATGGACGGACAAATTGTGGTGATATCAGCTTAGGAAAATGCCCCAACTCTTCCAACTTGCGTGCCATAAAGTGAGAGCCACCACAGGCTTCCATTGCGATGGTTGTAGCGGGGCATGTCGCCAAAAATTCGATTAACTTTGGCCGTGTAAATTTTTTACGGTAAACAGCCTTGCCGCGACGATCTTGGCAATGAATATGGAAAGAGTTTTTACCCAGATCGATACCAATGAGCGCAATGTTTTCCATGATAGTTCTCCGAATGAAAGCCTATCCTCAGCATAGTACCGGGAAGGAGGGAGTGACCATCTCATTAAATAAAGCACGCTAAGCCGGTGGCAGCGGTCGCAATGGCCTAAACTTCCCCGCACCGACCTTGGCGCTGCTGCGCCATAGGTAATCGCCGGTCAGGTTGATGTGCTCCCACCCCAGCGGCGACAGATATTGCAACAATGTGTCGTCCAGCGCCGTGCCGTTGCCACGCAAAGCACTGGTGGCACGCTCCAGATATACCGTGTTCCACAACACGATGGCCGCCGTCACCAGATTGAGGCCGCTGGCCCGGTAGCGCTGCTGCTCAAAACTGCGGTCGCGGATTTCACCCAATCGGTAGAAGAAGACCGCCCTGGCCAGCGCGTTGCGCGCCTCGCCCTTATTCAGCCCCGCATGGACGCGGCGGCGCAGCTCCACGCTTTGCAGCCAATCCAAAATGAACAGCGTGCGCTCGATGCGCCCCAGCTCGCGCAACGCCACGGCCAAGCCGTTCTGGCGCGGGTAGCTGCCGAGTTTGCGCAGCATCAGCGAAGCCGTTACCGTGCCTTGCTTGATGGAGGTGGCCAGCCGCAGAATTTCATCCCAATGGGCGCGTATTTGCTTGATGTTCAGCCTGTCGCTGCTAATCATCGGCTTGAGCGCGTCATAGGCGGCATCGCCCTTGGGGATGAATAGCTTGGTTTCGCCCAAGTCACGGATACGCGGCGCGAAGCGAAATCCCAGCAAATGCATCAAGCCAAACACGTGATCGGTGAAGCCTGCCGTGTCGGTGTAGTGTTCCTCGATGCGCAAGTCCGACTCGTGGTACAGCAGGCCATCAAGCACGTAAGTTGAATCACGAATGCCCACGTTGACCACCTTGGCACTGAAGGGCGCGTACTGGTCGGAGATATGGGTGTAGAAAGTCCGTCCTGGACTGCTTCCATACTTCGGGTTGATATGACCAGTGCTTTCTGCTTTGCTGCCGGTTCTGAAGTTCTGGCCGTCCGACGATGACGTGGTGCCGTCACCCCAGTTGCCGGCGAAGGGTTGCCGAAACTGCGCATTCACCAGCTCGGCCAGCGCCGTCGAATAGGTTTCATCGCGGATGTGCCAGGCTTGCAGCCAAGACAGCTTGGCGTAGGTGGTGCCAGGGCAGGACTCGGCCATTTTGGTCAGACCCAGGTTGATCGCGTCGGCCAGGATCGTCGTCAACAGCAAGGTTTTGTCCTTGGCCGTGTCGCTGGTCTTCAGGTGTGTGAAGTGGCGGGTGAAGCCCGTCCATTCATCGACCTCCATCAGCAACTCGGTGATTTTGAGGTGCGGCAGCAGCATAGCTGTCTGGTCGATCATGGCTTGCGCGGCGTCTGGTACTGCCGCGTCCAGCGGCGTGATCTTCAGGCCTGACGCGGTGGTGATGATGGCATCCGGTAAGTCGTTGGCCGCAGCCATGCGGTTGACTGTGGCGAGTTGCGCCTCCAACAATTCCAACCGGTCATGCAGGTATTGGTCGCAGTCGGTGGCCACTGCCAGCGGCAATTCGCTGGCCAGCTTCAAAGTGGCGAACTTCTCGACCGGCACCAGGTATTCGTCGAAGTCCTTGAACTGGCGAGAACCCTGCACCCAGACATCACCGGAGCGCAGCGCGTTCTTCAGCTCCGACAGGGCGCATAACTCGTAGTAACGCCGGTCGATGCCGTCGTCGGTCAGAACCAGCTTTGCCCAGCGCGGCTTGATGAATGCGGTTGGCGCATCGGCGGGCACCTTGCGCGCGCTGTCGCTGTTCATGCCGCGCAGCATGTCGATGGCATCGAGCACACCCTTGGCGGCGGGCGCAGCCCGCAATTTGAGCACGCCCAGGAACTGCGGCGCGTAGCGGCGTAGCGTGGCATAGCTTTCACCGATGTGGTGCAGGAAATCAAAGTCGGCAGGCCGCGCCAATGTTTGCGCTTCGGTGACGCTGGCGGCGAAGGTGTCCCAGGGCATAACGGCCTCGATGGCGGCGAACGGATCGCTGCCGCTTTGCTTGGCCTCAATCAACGCTTGACCGATGCGCCCATACATCCGCACCTTGTCGTTGATCGCCTTGCCGGAAGCCTGGAACTGCTGCTGATGCTTGTTCTTGGCCGCGTTGAACAGCTTGCCGATGATGCGATCGTGAAGGTCGATGATTTCATCGGTGACGGTGGCCATGCCTTCGATGGCCAGCGCTACCAGCGTGGCATAGCGTCGTTGCACCTCGAACTTTGCCAGATCAGCAGGCGTCATCTGGCCACCTTCACGAGCGATTTTGAGCAGGCGGTTCTGGTGAACCTGCCGCTCGATGCCTGCGGGCAGATCAAGTGCTTGCCAGGATTTCAGGCGCTCAATATGTTCGAGCATGTGGCGAGAGTTCGGTTTGGCAGGCGACTGGCGCAGCCATGCCAGCCACGTCACTTTACTGCCGTCCTTGCGCTTGAGAAGTTCGTCCAGGCGCTGACGGTGGGGTGATAACAAAGAATCGGTCAGCGCCGCGTAAATGCGTCGGTTGGCACGGGTGATGGCCTCGGCGCTTGCGCGCTCGATGGCATTCATGGCGGGCAGGATAATGCTCTGCCGCCGCAGATTCTCGACAAGTGCGCTCGCCAGCACGATGCCTTTGTCGGTCTGCAAGGCCAGCTCGGTCAATGTATGCACGGCTTGCCGATAGTGGCTCATGGTGAAGGGCTTGAACCCAAAAACCGTTTGCAGCTCGACCAAGTGCTCCCGCCGTGTCTGTTCGCGCTGGCCGTACTCGCTCCAACTTTCCACTGGCATCTTGAGTTGCGCGGCCACCATGCGCAACAGGGGCGGAAACGGAGGCTCATCGACGCCCAAAAAGGTGCCAGGGAATCGCAAGTAGCAAAGCTGCACAGCGAAGCCCAATCGATTCGCGGCGCCGCGACGCTGACGGATCACCGACAGGTCGGTTTCGTTGAACGTGTAGTGCCGTATCAGTTCGTCTTTGGCATCTGGCAGTGCCAGCAGGCTTTCGCGCTCGGTGGCGGACAGGATTGAGCGGCGTGGCATGGTCAGTCTTCCCGCAGGTACTGGTACAAGGTTTCGCGGCTGATGCCGAAGTCACGGGCCACCAAGGTTTTTTGGTCGCCTGCCGCAACTCGCCGTTTCAACTCGGCAATTTGTTCGCTGTTCAGCGATTTCTTTCGTCCCCGGTAGGCACCGCGCTGCTTGGCCAGCACGATTCCCTCGCGCTGACGTTCGCGGATCAGGGCGCGCTCGAACTCAGCGAAGGCTCCCATGACCGACAGCATCAGATTGGCCATCGGTGAGTCCTCGCCGGTGAACTTCAGCCCTTCTTTGACGAACTCCATGCGCACGCCCCGTTGTGTCAGCCCTTGGACGATGCGGCGCAGGTCATCAAGGTTGCGTGCCAGCCTGTCCATGCTATGCACCACCACGGTGTCGCCCTCGCGGACGAAGGCCAGCAGCCTTTCCAGCTCGGGACGCTGGGTGTCCTTGCCAGAAGCCTTGTCGGTGAACACCCGCGCCACCTGAACACCCTCCAATTGCCGTTCCGGGTTCTGGTCGAAGCTGCTGACGCGGACATAGCCGATGCGTTGACCTTGCAAGATGCCTCCAAAGGCAAAAGTGTCAGGATGAAATCTATTACCTTTGACGGAATATGTCAATCAATAGGAAATTTAACTCTATTCTGACATCGTTTGCACATGGTGTCGTTTTCAGAAGACGGCTGCACTGAACGTCAGAAGCCGACTGCACTATAGCAGCGGAGGGGTTGGATCCATCAGGCAACGACGGGCTGCTGCCGGCCATCAGCGGACGCAGGGAGGACTTTCCGCAACCGGCCGTTCGATGCGGCACCGATGGCCTTCGCGCAGGGGTAGTGAATCCGCCAGGATTGACTTGCGCTGCCCTACCTCTCACTAGTGAGGGGCGGCAGCGCATCAAGCGGTGAGCGCACTCCGGCACCGCCAACTTTCAGCACATGCGTGTAAATCATCGTCGTAGAGACGTCGGAATGGCCGAGCAGATCCTGCACGGTTCGAATGTCGTAACCGCTGCGGAGCAAGGCCGTCGCGAACGAGTGGCGGAGGGTGTGCGGTGTGGCGGGCTTCGTGATGCCTGCTTGTTCTACGGCACGTTTGAAGGCGCGCTGAAAGGTCTGGTCATACATGTGATGGCGACGCACGACACCGCTCCGTGGATCGGTCGAATGCGTGTGCTGCGCAAAAACCCAGAACCACGGCCAGGAATGCCCGGCGCGCGGATACTTCCGCTCAAGGGCGTCGGGAAGCGCAACGCCGCTGCGGCCCTCGGCCTGGTCCTTCAGCCACCATGCCCGTGCACGCGACAGCTGCTCGCGCAGGCTGGGTGCCAAGCTCTCGGGTAACATCAAGGCCCGATCCTTGGAGCCCTTGCCCTCCCGCACGATGATCGTGCCGTGATCGAAATCCAGATCCTTGACCCGCAGTTGCAAACCCTCACTGATCCGCATGCCCGTTCCATACAGAAGCTGGGCGAACAAACGATGCTCGCCTTCCAGAAAACCGAGGATGCGAACCACTTCATCCGGGGTCAGCACCACCGGCAAGCGCCGCGACGGCCGAGGTCTTCCGATCTCCTGAAGCCAGGGCAGATCCGTGCACAGCACCTTGCCGTAGAAGAACAGCAAGGCCGCCAATGCCTGACGATGCGTGGAGACCGAAACCTTGCGCTCGTTCGCCAGCCAGGACAGAAATGCCTCGACTTCGCTGCTGCCCAAGGTTGCCGGGTGACGCACACCGTGGAAACGGATGAAGGCACGAACCCAGTGGACATAAGCCTGTTCGGTTGGTAAGCTGTAATGCAAGTAGCGTATGCGCTCACGCAACTGGTCCAGAACCTTGACCGAACGCAGCGGTGGTAACGGCGCAGTGGCGGTTTTCATGGCTTGTTATGACTGTTTTTTTGTACAGTCTATGCCTCGGGCATCCAAGCAGCAAGCGCGTTACGCCGTGGGTCGATGTTTGATGTTATGGAGCAGCAACGATGTTACGCAGCAGGGCAGTCGCCCTAAAACAAAGTTGGGCGAACCCGGAGCCTCATTAATTGTTAGCCGTTAAAATTAAGCCCTTTACCAAACCAATACTTATTATGAAAAACACAATACACAGCATCGTGACCAACAGCAACGATTCCGTCACACTGCGCCTCATGACTGAGCATGACCTTGCGATGCTCTATGAGTGGCTAAATCGATCTCATATCGTCGAGTGGTGGGGCGGAGAAGAAGCACGCCCGACACTTGCTGACGTACAGGAACAGTACTTGCCAAGCGTTTTAGCGCAAGAGTCCGTCACTCCATACATTGCAATGCTGAATGGAGAGCCGATTGGGTATGCCCAGTCGTACGTTGCTCTTGGAAGCGGGGACGGATGGTGGGAAGAAGAAACCGATCCAGGAGTACGCGGAATAGACCAGTTACTGGCGAATGCATCACAACTGGGCAAAGGCTTGGGAACCAAGCTGGTTCGAGCTCTGGTTGAGTTGCTGTTCAATGATCCCGAGGTCACCAAGATCCAAACGGACCCGTCGCCGAGCAACTTGCGAGCGATCCGATGCTACGAGAAAGCGGGGTTTGAGAGGCAAGGTACCGTAACCACCCCAGATGGTCCAGCCGTGTACATGGTTCAAACACGCCAGGCATTCGAGCGAACACGCAGTGATGCCTAACCCTTCCATCGAGGGGGACGTCCAAGGGCTGGCGCCCTTGGCCGCCCCTCATGTCAAACGTTAGATGCACTAAGCACATAATTGCTCACAGCCAAACTATCAGGTCAAGTCTGCTTTTATTATTTTTAAGCGTGCATAATAAGCCCTACACAAATTGGGAGATATATCATGAAAGGCTGGCTTTTTCTTGTTATCGCAATAGTTGGCGAAGTAATCGCAACATCCGCATTAAAATCTAGCGAGGGCTTTACTAAGCTTGCCCCTTCCGCCGTTGTCATAATCGGTTATGGCATCGCATTTTATTTTCTTTCTCTGGTTCTGAAATCCATCCCTGTCGGTGTTGCTTATGCAGTCTGGTCGGGACTCGGCGTCGTCATAATTACAGCCATTGCCTGGTTGCTTCATGGGCAAAAGCTTGATGCGTGGGGCTTTGTAGGTATGGGGCTCATAATTGCTGCCTTTTTGCTCGCCCGATCCCCATCGTGGAAGTCGCTGCGGAGGCCGACGCCATGGTGACGGTGTTCGGCATTCTGAATCTCACCGAGGACTCCTTCTTCGATGAGAGCCGGCGGCTAGACCCCGCCGGCGCTGTCACCGCGGCGATCGAAATGCTGCGAGTCGGATCAGACGTCGTGGATGTCGGACCGGCCGCCAGCCATCCGGACGCGAGGCCTGTATCGCCGGCCGATGAGATCAGACGTATTGCGCCGCTCTTAGACGCCCTGTCCGATCAGATGCACCGTGTTTCAATCGACAGCTTCCAACCGGAAACCCAGCGCTATGCGCTCAAGCGCGGCGTGGGCTACCTGAACGATATCCAAGGATTTCCTGACCCTGCGCTCTATCCCGATATTGCTGAGGCGGACTGCAGGCTGGTGGTTATGCACTCAGCGCAGCGGGATGGCATCGCCACCCGCACCGGTCACCTTCGACCCGAAGACGCGCTCGACGAGATTGTGCGGTTCTTCGAGGCGCGGGTTTCCGCCTTGCGACGGAGCGGGGTCGCTGCCGACCGGCTCATCCTCGATCCGGGGATGGGATTTTTCTTGAGCCCCGCACCGGAAACATCGCTGCACGTGCTGTCGAACCTTCAAAAGCTGAAGTCGGCGTTGGGGCTTCCGCTATTGGTCTCGGTGTCGCGGAAATCCTTCTTGGGCGCCACCGTTGGCCTTCCTGTAAAGGATCTGGGTCCAGCGAGCCTTGCGGCGGAACTTCACGCGATCGGCAATGGCGCTGACTACGTCCGCACCCACGCGCCTGGAGATCTGCGAAGCGCAATCACCTTCTCGGAAACCCTCGCGAAATTTCGCAGTCGCGACGCCAGAGACCGAGGGTTAGATCATGCCTAG
Protein sequences of DBSCAN-SWA_4 >NZ_CP040884|61642:74053|73213_74053_+|WP_000259031.1|DBSCAN-SWA MVTVFGILNLTEDSFFDESRRLDPAGAVTAAIEMLRVGSDVVDVGPAASHPDARPVSPADEIRRIAPLLDALSDQMHRVSIDSFQPETQRYALKRGVGYLNDIQGFPDPALYPDIAEADCRLVVMHSAQRDGIATRTGHLRPEDALDEIVRFFEARVSALRRSGVAADRLILDPGMGFFLSPAPETSLHVLSNLQKLKSALGLPLLVSVSRKSFLGATVGLPVKDLGPASLAAELHAIGNGADYVRTHAPGDLRSAITFSETLAKFRSRDARDRGLDHA >NZ_CP040884|61642:74053|65230_65509_+|WP_000268337.1|DBSCAN-SWA MAVIDVSKVDTTPGNDAVCPFSPPEGWEGDSAAYVELMRSRYRHLMHGQRMMVTASFARREPIQVTGPFADEATKIINSMKMNKAKPTALSA >NZ_CP040884|61642:74053|72125_72704_+|WP_015058213.1|DBSCAN-SWA MKNTIHSIVTNSNDSVTLRLMTEHDLAMLYEWLNRSHIVEWWGGEEARPTLADVQEQYLPSVLAQESVTPYIAMLNGEPIGYAQSYVALGSGDGWWEEETDPGVRGIDQLLANASQLGKGLGTKLVRALVELLFNDPEVTKIQTDPSPSNLRAIRCYEKAGFERQGTVTTPDGPAVYMVQTRQAFERTRSDA >NZ_CP040884|61642:74053|67067_70040_-|WP_001138073.1|transposase|DBSCAN-SWA MPRRSILSATERESLLALPDAKDELIRHYTFNETDLSVIRQRRGAANRLGFAVQLCYLRFPGTFLGVDEPPFPPLLRMVAAQLKMPVESWSEYGQREQTRREHLVELQTVFGFKPFTMSHYRQAVHTLTELALQTDKGIVLASALVENLRRQSIILPAMNAIERASAEAITRANRRIYAALTDSLLSPHRQRLDELLKRKDGSKVTWLAWLRQSPAKPNSRHMLEHIERLKSWQALDLPAGIERQVHQNRLLKIAREGGQMTPADLAKFEVQRRYATLVALAIEGMATVTDEIIDLHDRIIGKLFNAAKNKHQQQFQASGKAINDKVRMYGRIGQALIEAKQSGSDPFAAIEAVMPWDTFAASVTEAQTLARPADFDFLHHIGESYATLRRYAPQFLGVLKLRAAPAAKGVLDAIDMLRGMNSDSARKVPADAPTAFIKPRWAKLVLTDDGIDRRYYELCALSELKNALRSGDVWVQGSRQFKDFDEYLVPVEKFATLKLASELPLAVATDCDQYLHDRLELLEAQLATVNRMAAANDLPDAIITTASGLKITPLDAAVPDAAQAMIDQTAMLLPHLKITELLMEVDEWTGFTRHFTHLKTSDTAKDKTLLLTTILADAINLGLTKMAESCPGTTYAKLSWLQAWHIRDETYSTALAELVNAQFRQPFAGNWGDGTTSSSDGQNFRTGSKAESTGHINPKYGSSPGRTFYTHISDQYAPFSAKVVNVGIRDSTYVLDGLLYHESDLRIEEHYTDTAGFTDHVFGLMHLLGFRFAPRIRDLGETKLFIPKGDAAYDALKPMISSDRLNIKQIRAHWDEILRLATSIKQGTVTASLMLRKLGSYPRQNGLAVALRELGRIERTLFILDWLQSVELRRRVHAGLNKGEARNALARAVFFYRLGEIRDRSFEQQRYRASGLNLVTAAIVLWNTVYLERATSALRGNGTALDDTLLQYLSPLGWEHINLTGDYLWRSSAKVGAGKFRPLRPLPPA >NZ_CP040884|61642:74053|70905_71919_-|WP_000845039.1|integrase|DBSCAN-SWA MKTATAPLPPLRSVKVLDQLRERIRYLHYSLPTEQAYVHWVRAFIRFHGVRHPATLGSSEVEAFLSWLANERKVSVSTHRQALAALLFFYGKVLCTDLPWLQEIGRPRPSRRLPVVLTPDEVVRILGFLEGEHRLFAQLLYGTGMRISEGLQLRVKDLDFDHGTIIVREGKGSKDRALMLPESLAPSLREQLSRARAWWLKDQAEGRSGVALPDALERKYPRAGHSWPWFWVFAQHTHSTDPRSGVVRRHHMYDQTFQRAFKRAVEQAGITKPATPHTLRHSFATALLRSGYDIRTVQDLLGHSDVSTTMIYTHVLKVGGAGVRSPLDALPPLTSER >NZ_CP040884|61642:74053|70637_70961_-|WP_001447826.1|DBSCAN-SWA MPECAHRLMRCRPSLVRGRAAQVNPGGFTTPARRPSVPHRTAGCGKSSLRPLMAGSSPSLPDGSNPSAAIVQSASDVQCSRLLKTTPCANDVRIELNFLLIDIFRQR >NZ_CP040884|61642:74053|63516_65244_+|WP_000122922.1|DBSCAN-SWA MSYYKADTVREAANGNWLFILAALAPHLEPALRKPGRHVSCPIHGGKDGFRLFKDAHLTGGGVCNTCGANHDGFELLMWLNNWDFKQCLSEVGDYLGVEKEQPQYQQAAAPTRAPVQAKAPVQQEPMKVNNKVLDSKNRKKSIAGTLIAHGKAPYEHNEDNELSYFAFIRDKSGLERTIWGVDLERAIGESEAKYGDEIVMTNLGREPVTVVVEVKDEQGNVVREQPMQTHRNTWLVERRGATVTQFRARSNGGVEPVSHHVESAPVVNRKVETPAPQVQPAATQPEEQSSENKPKVVPMFREQPKPWLLELQEEMEKRMERERAYSARLREKIEKVWNECLPFSSHVTEPMRLYFKNRELLFKVDEVEKTDCLRFNPAMAYYDEDGNEVGKFPAIVCAIRDVEGNLVTLHRTYLTQNGKKAKVGNAKKMMPIPDGLDVNGAAIRLGEPTEGILGVAEGLETALSAYRVTQIPVWSTVNATLMESFEVPEGVHTVLIWADKDKSVTGEKSANVLKAKLEKRGIRVYVLLPKLPIPPRAKGIDWNDVLMSQGSLGFPNARYLRDFIARRRAEYGRH >NZ_CP040884|61642:74053|72872_73220_+|WP_000679427.1|DBSCAN-SWA MKGWLFLVIAIVGEVIATSALKSSEGFTKLAPSAVVIIGYGIAFYFLSLVLKSIPVGVAYAVWSGLGVVIITAIAWLLHGQKLDAWGFVGMGLIIAAFLLARSPSWKSLRRPTPW >NZ_CP040884|61642:74053|65581_65803_+|WP_000714163.1|DBSCAN-SWA MKKFLRIKTWFVRLFSPDKKTLGAIGEDLRKVAVTAIGVGIVGVVSENGIYGHSRFCNTDFDDKLACLNLSGV >NZ_CP040884|61642:74053|61642_62494_+|WP_000595210.1|DBSCAN-SWA MIKQHFQNELVKCGYPDDLTIEYSLGYCQGDGVAFYGDLSVDDVKALMNRLFSTEPGQVDAVSRVKNLMAQKDIENMLSVLREYGSCDLSITRNSHGHHYSHWNCMNIDDNVDFTGIFPDDDSMIGTGIEGINQDMVERWQDLWERFVLELADDVKSLSKKLEADGYSLIEASPCEDEVVWERATENYLVRVTELPERDFDMGHWDDEVRDQTICSILEGKERVLGLRVEVLSRENEIVLGEESLHGLTVASDDKSYAGYRRELLRGAIQQTRDFFSRHLKAA >NZ_CP040884|61642:74053|62952_63339_+|WP_001077336.1|DBSCAN-SWA MNQVAKLDLAQIRQQAINDGLLVDQSSIGKQAGFLTNVAVTPAIVDGVFGADGKHSVEDFLFMFLQLCVAQTKVAFTDNKNWGKIRLYYPMPTVDGFFKPTEVVIKSDPVTADVTIMMASEEGAHLCL >NZ_CP040884|61642:74053|65984_66989_-|WP_000427620.1|transposase|DBSCAN-SWA MENIALIGIDLGKNSFHIHCQDRRGKAVYRKKFTRPKLIEFLATCPATTIAMEACGGSHFMARKLEELGHFPKLISPQFVRPFVKSNKNDFVDAEAICEAASRPSMRFVQPRTESQQAMRALHRVRESLVQDKVKTTNQMHAFLLEFGISVPRGAAVISRLSTLLEDNSLPLYLSQLLLKLQQHYHYLVEQIKDLESQLKRKLDEDEIGQRLLSIPCVGTLTASTISTEIGDGKQYASSRDFAAATGLVPRQYSTGGRTTLLGISKRGNKKIRTLLVQCARVFIQKLEHQSGKLADWVRDLLCRKSNFVVTCALANKLARIAWALTARQQTYVA >NZ_CP040884|61642:74053|70042_70600_-|WP_001162012.1|DBSCAN-SWA MQGQRIGYVRVSSFDQNPERQLEGVQVARVFTDKASGKDTQRPELERLLAFVREGDTVVVHSMDRLARNLDDLRRIVQGLTQRGVRMEFVKEGLKFTGEDSPMANLMLSVMGAFAEFERALIRERQREGIVLAKQRGAYRGRKKSLNSEQIAELKRRVAAGDQKTLVARDFGISRETLYQYLRED |
13 | Salmonella_phage(33.33%) | integrase,transposase | attL 70076:70089|attR 75775:75788 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
90230 : 92217
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP040884|90230:92217|DBSCAN-SWA CATGTCCAATATCAAGCCGCTGCACGACCGCGTGGTCATCAAGCGCATGGAAGAAGAGAAGCTGTCCGCCGGCGGGATCGTGATCCCGGATTCGGCCACCGAGAAGCCGATCAAGGGCGAAGTCGTCGCCGTCGGCACCGGCAAGGTGCTGGACAACGGCCAGGTCCGCGCGCCGCAGGTCAAGGTCGGCGACAAGGTGCTGTTCGGCAAGTACAGCGGCACCGAAGTGAAGCTGGACGGCGTCGAGCTGCTGGTGGTGAAGGAAGACGACCTGTTCGCGATCCTCGGCTGATCGCGCGTCGCTCCCACACATTTCTCATCCGAATAATTTTTCGAGGTAATTCGCAATGGCTGCCAAGGACATTCGTTTCGGCGAAGACGCGCGCTCCAAGATGGTGCGCGGCGTCAACGTGCTCGCCAACGCCGTGAAGGCGACCCTCGGCCCGAAGGGCCGCAACGTCGTGCTGCAGAAGAGCTACGGCGCGCCGACCATCACCAAGGACGGCGTCTCCGTCGCCAAGGAAATCGAACTGGCTGACGCGTTCGAGAACATGGGCGCGCAGATGGTGAAGGAAGTCGCTTCCAAGACCTCCGACAACGCCGGCGACGGCACCACCACCGCCACCGTGCTGGCGCAGGCGTTCATCCGCGAGGGCATGAAGGCGGTCGCCGCCGGCATGAACCCGATGGACCTGAAGCGCGGCATCGACCAGGCGGTGAAGGCCGCGGTCGGCGAACTGAAGTCGCTGTCCAAGCCGTCGTCGACCAGCAAGGAAATCGCCCAGGTCGGCGCGATCTCCGCGAACTCGGATGCCAACATCGGCGACCTGATCGCGCAGGCGATGGACAAGGTCGGCAAGGAAGGCGTGATCACGGTCGAGGAAGGCAGCGGCCTGGACAACGAACTCGACGTGGTCGAGGGCATGCAGTTCGACCGCGGCTACCTGAGCCCGTACTTCGTCAACAACCAGCAGTCGATGTCGGCCGACCTGGATGATCCCTTCATCCTGCTGTACGACAAGAAGATCTCCAACGTGCGCGACCTGCTGCCCGTCCTCGAGGGCGTGGCCAAGGCCGGCAAGCCGCTGCTGATCGTGGCGGAGGAAGTCGAAGGCGAAGCGCTGGCGACCCTGGTGGTCAACACCATCCGCGGCATCGTCAAGGTCTGCGCGGTGAAGGCCCCGGGCTTCGGCGACCGTCGCAAGGCGATGCTGGAAGACATGGCGATCCTGACCGGCGGCGTGGTGATTTCCGAGGAAGTCGGCCTGTCGCTGGAGAAGGCCACCATCAAGGACCTCGGCCGCGCCAAGAAGATCCAGGTGTCGAAGGAAAACACCACCATCATCGATGGCGCCGGCGAAGGCGCGGGCATCGAGGCGCGCATCAAGCAGATCAAGGCGCAGATCGAGGAGACCTCCTCCGACTACGACCGCGAGAAGCTGCAGGAGCGCGTGGCCAAGCTGGCCGGCGGCGTTGCGGTGATCAAGGTCGGTGCCGCCACCGAAGTCGAGATGAAGGAAAAGAAGGCGCGCGTCGAAGACGCCCTGCACGCGACCCGTGCGGCCGTCGAGGAAGGCATCGTCCCGGGCGGCGGCGTCGCCCTGATCCGTGCCAAGGCGGCGATCGCCGGCATCAAGGGCGTGAACGAAGACCAGAACCACGGCATCCAGATCGCCCTGCGCGCGATGGAAGCCCCGCTGCGCGAGATCGTGACCAATGCCGGCGATGAGCCGTCGGTCATCCTCAACCGCGTGGTCGAAGGTTCGGGTGCGTTCGGCTACAACGCCGCCAACGGCGAGTTCGGCGACATGATCGAGTTCGGCATCCTGGACCCGACCAAGGTCACCCGCACCGCGCTGCAGAACGCCGCGTCGATCGCGGGCCTGATGATCACCACCGAAGCGATGGTGGCCGAGGCCCCGAAGAAGGACGAGCCGGCGATGCCGGCCGGCGGCGGCATGGGCGGCATGGGCGGCATGGATTTCTAA
Protein sequences of DBSCAN-SWA_5 >NZ_CP040884|90230:92217|90230_90521_+|WP_004201172.1|DBSCAN-SWA MSNIKPLHDRVVIKRMEEEKLSAGGIVIPDSATEKPIKGEVVAVGTGKVLDNGQVRAPQVKVGDKVLFGKYSGTEVKLDGVELLVVKEDDLFAILG >NZ_CP040884|90230:92217|90576_92217_+|WP_004201176.1|DBSCAN-SWA MAAKDIRFGEDARSKMVRGVNVLANAVKATLGPKGRNVVLQKSYGAPTITKDGVSVAKEIELADAFENMGAQMVKEVASKTSDNAGDGTTTATVLAQAFIREGMKAVAAGMNPMDLKRGIDQAVKAAVGELKSLSKPSSTSKEIAQVGAISANSDANIGDLIAQAMDKVGKEGVITVEEGSGLDNELDVVEGMQFDRGYLSPYFVNNQQSMSADLDDPFILLYDKKISNVRDLLPVLEGVAKAGKPLLIVAEEVEGEALATLVVNTIRGIVKVCAVKAPGFGDRRKAMLEDMAILTGGVVISEEVGLSLEKATIKDLGRAKKIQVSKENTTIIDGAGEGAGIEARIKQIKAQIEETSSDYDREKLQERVAKLAGGVAVIKVGAATEVEMKEKKARVEDALHATRAAVEEGIVPGGGVALIRAKAAIAGIKGVNEDQNHGIQIALRAMEAPLREIVTNAGDEPSVILNRVVEGSGAFGYNAANGEFGDMIEFGILDPTKVTRTALQNAASIAGLMITTEAMVAEAPKKDEPAMPAGGGMGGMGGMDF |
2 | uncultured_virus(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
98519 : 100404
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP040884|98519:100404|DBSCAN-SWA TTTGCACGACACAAACAATGACAAGGAAGAGCTAGTTTCACATGCGAAAGTAAATGTTCCGGCAGAACAAAGCATTCGCGGTGAACTGTTGCCGTCATCGAGCTCGCTCCGGAACATCCAGGACAACCCCGCCGTCGCCTATCTGGTGAGCCTCGGGTCAAAGCGAAGCAGGCAGACCATGAGCTCATTCCTCAACATTGTCGCCAAGATGATCGGGTTTCAGAACCTTCGTGACTGTGCATGGAGCTCAATGAGGCGGCACCACATATTGGCGGTGCTGGAAATGCTGGGGGATGCAGGGAAGGCTCCGGCAACGATCAACACCTACCTGTCAGCGCTCAAAGGGGTGGCTCTTGAAGCCTGGACGATGAAGCAAATTGATACGGATAGCTTCCAGCACATTAAGCAAGTCCGTTCAGTACGTGGATCTCGACTTCCTAAAGGGCGGGCACTTGAACGCCATGAGATTCGCAGCCTCTTTTTCACATGTGAAAGTGATTCAAGCGCCAAAGGGCTTCGGGATGCGGCCATTCTCGGGGTGCTCCTCGGGTGTGGCTTGCGCCGCTCGGAAATCGTTGCGCTGGACATGGGAAGCATGATCTACAAGGACCGCGCTCTCAAGGTTCTTGGCAAGGGCAACAAAGAGAGAATGGCGTATGTGCCTGGTGGCGCATGGAAAAGACTGGATAAGTGGGTTGAAGAGGTTCGAGGAACGCATGAAGGGCCTTTATTCCCGAGGATCAGGCGGTTTGATGATGTGACTGGAGAGCGGATGTCGGATCAGGCCATCTACCACATCTTGGAGACCAGAAGAGTTGAAGCCGGTCTGGAGATGTTTGCGCCCCATGACCTGCGACGCACCTTTGCCTCCTCGATGCTGGATAACGGTGAGGATATTGTGACCGTGAAGGACGCAATGGGCCACTCAAGCATTGCAACTACCCAGAAATATGACAGACGCGGCGATGAGCGTTTGAAGAGAGCGAGCCAACGCCTCGATATAGCGGATTAACAGCATGAATGATGAAGAACTCGAACTTGCCAGAGCCGAAGCCATGAAAGCCGATAGGTGCTTTTCAAAAGGACGGCTCAGGGACGAATTTCGGATGAAGCCTAAGCCAGGAGTAGAGCCAGTTTCGTTCTACAAAAACGGGTATGGTGGCCAATTTGGAGTGTACCGGATAGCAGATTGCCAACCAATGCGGAGAAGGGGTTGTTCACCGGCATCACAAAAGCAAATAAGGGCGCAATCCATCCTTTCGGTCAAGGCAAGAATGCGTAGCAACCTGGCAAAGGCCTCTGTCATGGCACAAAGGTGGGTCGCTCTGGAACCTCTGGTTCTCGATACCGAGACGACCGGACTTGGTGAAAGAGACCAGGTGATCGAGCTGGCTGTTACTGACATCAGAGGCGCGGTTCTCCTCTGCACCAGATTACGGCCAACCGTGGAAATAGACCCTCAGGCAATGGGTGTTCATGGCATCACCGAGACTGAGCTGTCGAATGAGCCTACGTGGACTCAAGTTGCGCCAGCTCTCGCCCGGCTTTTGTCGGGCCGTCATCTGGTGATATTTAACTCCAGTTTCGACAGTAGGATGTTGAGGCAAACAGCCAGTGCATTTGGAGACCAACTCTCTTGGTGGCAAGAGCAGAACTGTCTGTGCGCGATGAAGCTGGCGGCTGATGCCTTTGGCTCAACCAACCGGCACGGCACAATCTCACTTGCGGATGCCACCTGCGAGGCAGGTGTGAGCTGGAAAGGTCGAGCCCATTCAGCAGCGACCGACGCTATTGCCACCGCCGACTTGGTGACAGAGATAGCCAAAGTCCAACGTGACCTCATGGTCCAGCTCCAGGAGCTTCAAAGCAAAGGTAATTTGGAATGA
Protein sequences of DBSCAN-SWA_6 >NZ_CP040884|98519:100404|99534_100404_+|WP_004201184.1|DBSCAN-SWA MNDEELELARAEAMKADRCFSKGRLRDEFRMKPKPGVEPVSFYKNGYGGQFGVYRIADCQPMRRRGCSPASQKQIRAQSILSVKARMRSNLAKASVMAQRWVALEPLVLDTETTGLGERDQVIELAVTDIRGAVLLCTRLRPTVEIDPQAMGVHGITETELSNEPTWTQVAPALARLLSGRHLVIFNSSFDSRMLRQTASAFGDQLSWWQEQNCLCAMKLAADAFGSTNRHGTISLADATCEAGVSWKGRAHSAATDAIATADLVTEIAKVQRDLMVQLQELQSKGNLE >NZ_CP040884|98519:100404|98519_99530_+|WP_000543934.1|integrase|DBSCAN-SWA MHDTNNDKEELVSHAKVNVPAEQSIRGELLPSSSSLRNIQDNPAVAYLVSLGSKRSRQTMSSFLNIVAKMIGFQNLRDCAWSSMRRHHILAVLEMLGDAGKAPATINTYLSALKGVALEAWTMKQIDTDSFQHIKQVRSVRGSRLPKGRALERHEIRSLFFTCESDSSAKGLRDAAILGVLLGCGLRRSEIVALDMGSMIYKDRALKVLGKGNKERMAYVPGGAWKRLDKWVEEVRGTHEGPLFPRIRRFDDVTGERMSDQAIYHILETRRVEAGLEMFAPHDLRRTFASSMLDNGEDIVTVKDAMGHSSIATTQKYDRRGDERLKRASQRLDIAD |
2 | Gordonia_phage(50.0%) | integrase | attL 94737:94749|attR 102594:102606 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
103563 : 105075
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP040884|103563:105075|DBSCAN-SWA CATGAAACAACTTCCTCCTGACACACCAGAACAATCACTGATCACTCAGTACAAAGGGCCTCGCCTTGTCGTTAAGGCCTACGCTGGAACGGGTAAAACCACGACACTGGTGAAGTACGCCCACAACAACCTCGATTCACGTATCCTCTACTTGGCATACAACAGGGCTATCCGCGACGAGGCAAGAGAAAAGTTTCCTGCAAACGTAGACTGCAAAACGTCCCACCAGCTTGCCTACGCCACTATAGGAAGGGGCTACCAGCACAAACTCTCCGGCAACCTAAGGCTCACCGATATTGCCCAAGCGGTGAATACCAAGAACTGGACGTTTGCCAAAGATATTCTCGATACGCTCAACGCCTTTATGTGTAGTGCAGACATGCGGATTCTTTATACGCATTTTGCTCGCGCCGATACGGGTAAAGTGCTTACGTCCAAACAGGAGAGATACCAAATCCAGGTGGTCGAAGGTGCTGAGCTCATATGGAAACGGATGACAAACGTTCAAGATCCGTTCCCGACCGTACACGATTGCTACCTCAAACAGTATCAGCTCGGGATGCCGAATCTGTCTCGCCGGTACACCACCATTCTTTTTGATGAGGCACAAGACGCTAACCCCGTAACAAGTAGCATCGTCCTACAGCAGAACTGCAAGGTAATCCTGGTTGGAGATCGCCACCAGCAGATCTATAGGTTCAGAGGCGCAAACAACGCCCTTGATAGCAAAGAGCTCATGAACGCCGACCAACTCTATCTCACTCATAGCTTCCGCTTTGGCCCCAACGTTTCGCTGGTGGCAAACGCCCTTCTTGAACTCAAAGGTGAAACACGACCTGTTGTTGGCCGGGGACCAGCAGATCAGGTACTCATGTTTTTACCAGGTGACGTGGGCCACCGCGCAATACTTCACCGAACCGTCATGGGGGTTATAGAGACGGCGCTCTCTGCGACCGAATCCGGAGCGCAGGTATTCTGGGTCGGTGGAATCGACGCTTACCAGATCAATGAGCTCCAGGATTTGTACTGGTTTTCGATGGCAGAGCCAGACCGGGTAAAGAATAAGAAACTGCTTGATGAGTATGAAGACTACTTCGAGTATCAAGAAGTAGCGAAGGCGACCAAAGACCCTGAGATGATGAGGGCTGTCAAGATCATCAACAGCTACGATGAAATCCCTGAACGACTCACCACTCTACGACGCAATACAGTCAAAGAAGAGTTTGGGGCTGACATTACGGTCTCAACAGCTCATCGGTGCAAAGGGTTGGAGTGGGACTTTGTTCAGCTCTATGACGACTTTCCGGATGTCCTGGACCCAGAGCTCGACCCAATGGCCCGTGACGATGAAATAAACCTGCTCTACGTTGCATCCACCAGAGCGATGCGAATCCTTGCGTTGAACAGCGCTGTCGAGATGGTTATCCGCTACATCACCCAAAAACGCATGGTCGAGAAGCAGATGAAGATGGCCGCAGAAGCGACAGAAGTTGAAGAGGACACGACCAAATAG
Protein sequences of DBSCAN-SWA_7 >NZ_CP040884|103563:105075|103563_105075_+|WP_000811656.1|DBSCAN-SWA MKQLPPDTPEQSLITQYKGPRLVVKAYAGTGKTTTLVKYAHNNLDSRILYLAYNRAIRDEAREKFPANVDCKTSHQLAYATIGRGYQHKLSGNLRLTDIAQAVNTKNWTFAKDILDTLNAFMCSADMRILYTHFARADTGKVLTSKQERYQIQVVEGAELIWKRMTNVQDPFPTVHDCYLKQYQLGMPNLSRRYTTILFDEAQDANPVTSSIVLQQNCKVILVGDRHQQIYRFRGANNALDSKELMNADQLYLTHSFRFGPNVSLVANALLELKGETRPVVGRGPADQVLMFLPGDVGHRAILHRTVMGVIETALSATESGAQVFWVGGIDAYQINELQDLYWFSMAEPDRVKNKKLLDEYEDYFEYQEVAKATKDPEMMRAVKIINSYDEIPERLTTLRRNTVKEEFGADITVSTAHRCKGLEWDFVQLYDDFPDVLDPELDPMARDDEINLLYVASTRAMRILALNSAVEMVIRYITQKRMVEKQMKMAAEATEVEEDTTK |
1 | Pseudoalteromonas_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
112403 : 116150
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NZ_CP040884|112403:116150|DBSCAN-SWA AATGGATAAATGCCAATTGATAGACATCCCAAGCGACCCAGAGAAGAAACGTGAGTGGATCAAGTACAAACTCAAGATCCAGGGGCTTTCTCTGGCCGCATTGGGCAGAAAACACAAAACATCTCGGCAGGTGGTGTCTACGGCACTCTATAAGCCCAGTCCACGCTGGGAACATGAGATAGCTACAGCTTTGGGTGTGAAGCCGTCTGAGATTTGGCCGGAGCGGTACGACGAAGAACACGAAATACCCCTCAGACATAAGGAGGCAAGCTGATGAAGAACAAAGCCAAGGCGCTAGTTCTGTCTGCGGCTCTCCTTTCATCAACAGCGAATGCTATTGACCTGAGCGGAACCATCTTCGACAAAGCAGCGAAAGCATATAACCTCGACCCTCTTCTAGTGTATTCGGTCGCATTGGCCGAATCTGCATCAGGGAGAGGTAATGGCTCTATAAGTCCTTGGCCTTGGACGCTTCGCGTTCCTGGGCTTCCTTTCTATGCTAAGTCGGAAGATCAGGCAAAGGCTAAGCTCGCTGAGTTTCAGCAGCAGTACGGTCGTGCCATTGATGTCGGGTTTATGCAAGTGAGCATCCGGTGGAATGGTCATAGAGTTTCTTCTCCAGCAGATCTTCTCGACCCAGAGACCAACGTCATGGTTGGGGCAGAGGTGCTATCAGAAGCCATTCAGTCATCTCCAAATGACTTGGAGCTTGGCGTTGGCCGCTATCACGCCTGGGAAGACGAAATCCGAGCCAGAAACTATGGTAGCCGAGTCTTGGCTATCTATCGCAACCTTCGTGATTTGTGAGGGGGGCGGAATGTTGGAACTGGATATTATTGGTGCGTGGGATGCAAGAGCCGTCAACCTCGATCAAGAAGAAGCTGATAGAAACGTCTACGAGTTCGATCTGACATTGTGGAACCTGCTATCCACTCTGGCAAAAGAACGTCCAGATGATGCGGCCTCACAATTTTCTTTGGGCATGGACACCGTTCAAAAGCTGTCACTGGCAACACCTTCCCAATTGGAAGCTCTGGCCTCTGGCGTGTTGATCTCTTTCAAACTCGAAACAGCAGAGCAGAACATCATCACGCGACTCTCTGGCGACTACGACCCTGTAGTTTTTATCAACCATAGTGTTGATGAATTTGATGCTGCCTACTGGTTGCTATTTAACCGCGTCGCATCGAGAGACCCGGAGATGGCAAAGGAAGTTTTCGGGGTTTCGAGAGAGCTTGCGGAGCTGGTGGCTAAGGCAACAGACAGCCAGTTGCGCCACATGTCTGGAACAACGGTTACGCATTTTACGCTTCGTTTTGCTCCGAGCATCATTGAAGAAATTCTCGATGACAGCCGGGAAGAGTTAACACACCCGGTATTGAAAAAACTGCAACAGTCTCTACAGGGACGTGGGAGGTGGAGATGAACATTGGCAACTCTGGTACATTGGGTCGCTGGGTTACAGCTCGACACATGGCCCTTGCTGGGTACATCACAAAAATCATCATGATCGAGACTGGCCTGACCTACAAACAGGTCAGACGGCTTTACCAGGATCTGGAGAGGGACGGATATACTCTGGAACGAAAATCCAGAACTTTCCGGGGTGGTGCGACACTGATTCATAGTCACACATCCAAGATACAGGCCTCTCTTCTAATGCAGCTCTACTTCAACATTGGTGGAGAAGCCGTGTTGCGGTCTGTGAACATCAAAGCCTTGAACAAGGCATTTAGAATGTATCACGCAATCCGCAAAGAAGTGCCCGGAATGAAAGGTGCTCGGTGGGCTCCGTTTGATATTACTGATGCCTGGTGTCTTGCTTCGGAGCTGAGAAGTGGGGACGCAATGCTGGAGGTGTGCGACAACTGCAAGTGTACGTACTTCACCTCTGTTAATCAAAGAACCTGCGTTGAATGTCCGTTCTGCAAAGAACAAGGAAGGCATGGTGGTGGGGAGAAAGAGTGTGCTTGAGTAGACTATGACATTTCGGACCAGAAGATGAGCGCCCAAACTTTGCGGCGCTCATTTTTTTACTTCTTCTGTGGGATACGGTAGCGCTCCAGCTCGACAGCTCCCAACCCCTTGAAAGCATCCGGGCGACGTCCTTGCCCACTCCACGAAACTCCGTCTTTTGAATATTTAGCATTATCAGGTTCTGATCTGCTCGTGAACATTTCGTTGAGCAAGCCGATGTCCACACCGCAGGATTCCATGTCGCTCATTATTCGCTCAGCCTGAGCTCGCTTTTCTTTTTCTTCTTCTTCTCGCTTTTTGTACTCTTCCTCCAGTTCATTGAGAACGCCCTTCATTCTGTCAATGATCTCCCGAACTTCATCTAACGGGAGTCCTCGCAACAGGGTGCGAATGCGGCTTTTACGCTTTAACTCTGCGATTATTAACTCTCTACGTTCAGCCGCTGATAGCGTAGAGAACTCCTCGTGGTCTTTCATGTCATACGGAGCCCATCAGTTTTCCTTTAAGTGAATCACGTTGAGAGTTCAGTTTTGCGTCAGTCTGCACTGGCGAATTTTGATGCTGTGCAGGTTGACTCTCTGAACTCTGTTGCTCAAAAGACCCAATGTTGCCAGAAAATCGAAGAGAATAAAGACCCAATAGAGCAAGGGCGCGGAGCCGATCACTACGCGCCTTATGCTGCATCTGGGACAGTTCACGATAGAGCTCCGGAAAGGCTTGCTCCGATATGTTCAGATTAGATATTTTTCCATCCCATTTGCTACCAGCCACAACGACCTCCTATCTATCAGCCGCAGAACCAGAAGCCCCTGGCATTGGATGCCACGGATTCGTTGGGCAGTACGATCCTGCTCTTAGGGAAAAGCTCCTTGGCCGCATCTTGGTAAGCCTCGGCACCGCCGCCAGCCAGCAGAACTACGTCAGCATCCATCCCGTCCTCACGCATTGACTTCCGCATAGGGATCAAGGCGTTTTGAGCGACTTTGGTTGAGGCTTTCTTGAAGTAGTCTTTGATCGATACCTTTTCACCGTAGAGGAAGATTTCGGCCTTACCGGCACGAATAGCTTTCTCGATCTTTTCGATGCCAGGGGCACCGCCGTGGTCTTCCTGAATTAGCCGGTCCGTTTCCTGTAGCAACACCGACATCGCCTTGAGGCTGGTGCCAGATGAGTGATAGCGGACCTCTCCCTCTTCAAGAGCTACCCAGTCTACAGAAAAGAACCCAGGGTCAATTACAACGGTTTTTCCTCCCTGGATAATCTCCAGGAGGTCTTCATCTTTGGTTGAACTTACAACATCCATGTAAGCACCGGCAGGTTGAGGTACAACCACGACAGACTTAACCGCTACCGATCGTTTTGGCGTGATCTGGTGTTCGCCCTCAAGCCGAGCTTTCAACGCCTCTCTGCGCTCTACGTCCATGTACTGACTAACAGGCAGGCCAGTCACCAGCACATCGATCTCCTTCTGCTCGGACATCAGGAGTGCAGCGTAGAAAAGAGCCTTGTATGGATTGGTCGAAGGATAGTCGCCGTGAAGCTCACGCTCCCATCCTTGCAATCGGTCAGGCTCGACGCCTGCAACCCATTTCTCTCCATCAATCACAACCTGAATGCAGGTCCCTGCACCGCCAGTTAACTGTTGTGGCATCAGTTCCAATGGACCTGCCCCCACCGGCATGACGACTGTGCGAGCTTCCTCACCTTTATACCCCATTGCCATTTTCAGGTTGGAGTAACCAATATCCAAACCCAGAACAAATTGACTCAT
Protein sequences of DBSCAN-SWA_8 >NZ_CP040884|112403:116150|113219_113828_+|WP_000891157.1|DBSCAN-SWA MLELDIIGAWDARAVNLDQEEADRNVYEFDLTLWNLLSTLAKERPDDAASQFSLGMDTVQKLSLATPSQLEALASGVLISFKLETAEQNIITRLSGDYDPVVFINHSVDEFDAAYWLLFNRVASRDPEMAKEVFGVSRELAELVAKATDSQLRHMSGTTVTHFTLRFAPSIIEEILDDSREELTHPVLKKLQQSLQGRGRWR >NZ_CP040884|112403:116150|112675_113209_+|WP_000790610.1|DBSCAN-SWA MKNKAKALVLSAALLSSTANAIDLSGTIFDKAAKAYNLDPLLVYSVALAESASGRGNGSISPWPWTLRVPGLPFYAKSEDQAKAKLAEFQQQYGRAIDVGFMQVSIRWNGHRVSSPADLLDPETNVMVGAEVLSEAIQSSPNDLELGVGRYHAWEDEIRARNYGSRVLAIYRNLRDL >NZ_CP040884|112403:116150|114435_114855_-|WP_000651490.1|DBSCAN-SWA MKDHEEFSTLSAAERRELIIAELKRKSRIRTLLRGLPLDEVREIIDRMKGVLNELEEEYKKREEEEKEKRAQAERIMSDMESCGVDIGLLNEMFTSRSEPDNAKYSKDGVSWSGQGRRPDAFKGLGAVELERYRIPQKK >NZ_CP040884|112403:116150|114856_115150_-|WP_000919078.1|DBSCAN-SWA MAGSKWDGKISNLNISEQAFPELYRELSQMQHKARSDRLRALALLGLYSLRFSGNIGSFEQQSSESQPAQHQNSPVQTDAKLNSQRDSLKGKLMGSV >NZ_CP040884|112403:116150|113824_114376_+|WP_001020646.1|DBSCAN-SWA MNIGNSGTLGRWVTARHMALAGYITKIIMIETGLTYKQVRRLYQDLERDGYTLERKSRTFRGGATLIHSHTSKIQASLLMQLYFNIGGEAVLRSVNIKALNKAFRMYHAIRKEVPGMKGARWAPFDITDAWCLASELRSGDAMLEVCDNCKCTYFTSVNQRTCVECPFCKEQGRHGGGEKECA >NZ_CP040884|112403:116150|112403_112676_+|WP_000356489.1|DBSCAN-SWA MDKCQLIDIPSDPEKKREWIKYKLKIQGLSLAALGRKHKTSRQVVSTALYKPSPRWEHEIATALGVKPSEIWPERYDEEHEIPLRHKEAS >NZ_CP040884|112403:116150|115166_116150_-|WP_000077457.1|DBSCAN-SWA MSQFVLGLDIGYSNLKMAMGYKGEEARTVVMPVGAGPLELMPQQLTGGAGTCIQVVIDGEKWVAGVEPDRLQGWERELHGDYPSTNPYKALFYAALLMSEQKEIDVLVTGLPVSQYMDVERREALKARLEGEHQITPKRSVAVKSVVVVPQPAGAYMDVVSSTKDEDLLEIIQGGKTVVIDPGFFSVDWVALEEGEVRYHSSGTSLKAMSVLLQETDRLIQEDHGGAPGIEKIEKAIRAGKAEIFLYGEKVSIKDYFKKASTKVAQNALIPMRKSMREDGMDADVVLLAGGGAEAYQDAAKELFPKSRIVLPNESVASNARGFWFCG |
7 | uncultured_Caudovirales_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
120053 : 121169
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >NZ_CP040884|120053:121169|DBSCAN-SWA ATCAATCAGAACCACAGGCCTCTCGGCCAAATGCGCCACCAGGGAGAACCCACTTCTGCCCCTCCGGCACAAAAATCGGGAGCGTGTAATCCTCTCTGGATACCTGCTCCAATGCCTCAACATCACTAATCTGTATCGGCGCAACAGCAGAACCTAAATCTATTACGTCGATTTTTTTTCTTGATACCGTCACACCAAAGGTTTGTTCATACTGAGCAATCTTCCCAGCTCGTTCAGGCCAGTAGTGACGGATAGTGGACCAGATACGCTGTGAGTTGTAGATGCAGGTCATACACGAAGACCGGCTCCAACCAAGGCGATAAGGGACTGGGGCCAGAATGCGGTGACGCTCGATTACTTCCCATACCTCTTCCTCAGTCCAATGAAGAACGGGCCTCCAGGCATCAACTAACCGTGCAGTCTTCCCATATCTTCTGTCACAGGCATGAGCCTCAAGCTGGTTGTACTTGGATCTGTTTGCACTTTCCTCCCGGCGCTCACCAGTGATGAAAAGGATCTTCTTCCCCTTGAAGCGCTCCTGGTTATTAAGAGCTCTGCGGCCAACATCGATCTTCAAGGCTGATGAACACCACCTTGTTTGGAGTGATGGCGATTGCTGAGGGAAACGAAGGCGTGTACCAGGCTTAGAGCGCTTATGGTCTCTAGGCAGCACCAGAAGCCCCTCAGGCGTCTCTACACGATGGGGGTGGCTATAGGCATTGTCTTTGAGCATTTCGCCCTCAAAGCCGCCCTCAAGCCACGAGAAGTACATTGGGACACCCAGTTCTTCCCCGAGCTGACGACAGTAATCACGCATGAAGGCCCAATCCATCAACGAACTACCTTCCTGACCATCAACATCATGGTGCCAGAACTCCACCTTTGACTTATCAACACCCATGTCCACCAGCCGCAAGTACGCAGCAATCGAGTCCTTGCCTCCAGACAGGCAAACAATGATGTGGTCGTACAAACTCAAGTCCACATCCGGCGCTGAGAAGTACGTTGTTCTATCGTCACAACGCTGACTGGAAGATATACCGGTAAGAACCGACTCCAGTGATGGCAACACCACCTGGTCATCGAACAAATCTCCTTGGTTTTCTCGTCTCAACAT
Protein sequences of DBSCAN-SWA_9 >NZ_CP040884|120053:121169|120053_121169_-|WP_000946104.1|DBSCAN-SWA MLRRENQGDLFDDQVVLPSLESVLTGISSSQRCDDRTTYFSAPDVDLSLYDHIIVCLSGGKDSIAAYLRLVDMGVDKSKVEFWHHDVDGQEGSSLMDWAFMRDYCRQLGEELGVPMYFSWLEGGFEGEMLKDNAYSHPHRVETPEGLLVLPRDHKRSKPGTRLRFPQQSPSLQTRWCSSALKIDVGRRALNNQERFKGKKILFITGERREESANRSKYNQLEAHACDRRYGKTARLVDAWRPVLHWTEEEVWEVIERHRILAPVPYRLGWSRSSCMTCIYNSQRIWSTIRHYWPERAGKIAQYEQTFGVTVSRKKIDVIDLGSAVAPIQISDVEALEQVSREDYTLPIFVPEGQKWVLPGGAFGREACGSD |
1 | unidentified_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_10 |
124809 : 130654
Sequences of DBSCAN-SWA_10
Nucleotide sequences of DBSCAN-SWA_10 >NZ_CP040884|124809:130654|DBSCAN-SWA CATGCAAAAGGAAAGACTAATGGCAGAAAACAAAGAGATCCCCTGGGAGAAAGAGCTCATTGAGAAGTACATGTTCACCCTGCATAAAGAGCAGGTGAAAGACCGGCGCTGGCGGACCATGTTGCGTGTGCTTCGCGCATCAGGTTTCGTGCTTCTCATGATCGGCTTCATCATTTTGGCATCTAATCCGGGTGGGATGCCGTGGCAAAGCGCCAAGGCTGGAGCCCCTCACACCGCGTATATCAACATCCGTGGTGAAATTGCTGCTGGCACACTGGCCGATGCTGATCACCTTATCCCGTCCATCCAAGCAGCATTCGACAACCCGAATTCACAAGCTGTCGTGCTTCGCATAAACAGCCCTGGCGGTAGCCCGGTTCAAGCAGGACGGATTTATGAAGAAGTGAAGGCGCAGCGAGCCCTTCATCCGGAGAAAAAGGTCTACGCCATCATTGATGACATCGGTGCCTCTGGCGGTTATTACATCGCCTCTGCTGCGGATGAAATCTATGCTGACCGCGCCAGTCTTGTCGGTTCTATCGGCGTCATTAGCTCAGGGTTTGGATTCACCGGCTTGATGGACAGGCTCGGCATCGAGCGCCGGGCAATCACTTCCGGAGAGCACAAAGCGCTTCTCGACCCATTCTCCCCTCTTACCTCTGATATGAAGAAATTCTGGGAGGGCGTTCTATCGAAAACCCACCAGCAGTTCATCGAACGAGTGAAGGCTGGGCGGGGTGATCGACTGAAAGACGCCCCAGAGGTGTTTTCTGGATTGCTCTGGAACGGGGAGCAGGCCAAAGAAATTGGGCTGATTGATGGCCTGGGGAGTTTGAACTCCGTGGCGCGAGACGTCATCCACCAGAGCAACTTGGTGGACTACACACCAACCGAAGACATCATCCGGCGACTGACCCAACGAGCGAAGCTCGAAGCCAGTTCCTTCGTGCAAGAACTCAGCGCTGTGAAAGTTTACTGATAGGAGCAGTTTGACATGACAGAACAAAGTACCAAATCACTACGTCTCGGGGTTTTTGCCGCCATTATTTTGGGGCTCGTAGGAACCGGGTTCGGAATCTACCAGCTCGTGAAAGAGAAGGATCTGGCGCAAGAGATCGCCAACGTGAAGTTCACGGTCAACCAGGTGAAAGATGCCGAAGGCGTCACCTTCAAAAGCAAGGCAGAGTTTGAGGCTGCTGTGGCCGAAAGCATCAATAAGTTTGTCGCGCAGAAACAACAAGCCGACATCGATCAGAAGTATGCCCAGTTCGAGGCGGCACCTGAAAAGGTCGAAGACGGCAAACACATCTATGGGGACCTTGGTGCCCGATTCACGCTGGTGGAGTTCTCTGATATGGAGTGCCCCTTCTGTAAGCAATTCCACGACACGCCCAAGCAAATTGTGGATGCCAGTAAAGGCAATGTGAATTGGCAGTGGAAGCACATGCCTCTCGACTTCCATAACCCGGCAGCTCACAAGGAAGCGTTGGCCGCTGAATGTATTGCTGAACAGAAGGGCAACCGTGGCTTCTGGGTCTTCGTGAATGAGATTTTCCATCACAGCAAAGGCAACGGTGCCGGTGTTTCTGACTTGGCCTCTGTTGTCACGGGTGTTGGTGCTGATCTGGATGCTTTCCGTGAGTGTCTCAGCTCTGGCAAGCACGAAGATAAAGTTCAAGCTGACATCCAGAAAGCCAAGAGTTACGGCGTTAATGGTACTCCAGCAACCTTTGTTGTAGACAACCAGACAGGCAAGAGTCAGCTACTCGGTGGTGCTCAACCGGCACAAGCCATCATGGCGGTGATGCGAAAAATGATGATTGAGTCGCAACAAGACGACTCAGCGAACCAATAAAAACTTTCACATGTGAAAGTGGAGTGAGAACTGATGATCAAGATTACAACTAAATTGGGGTGCCTTCTGGCGGGGCTGCTTGTCCTGTCGGCTTGCTCAAGCGTCCCTCAGACAAGCAACGAATACACGAAAGCCCTGGATGATACCAAACAGGTATGTGCTGCCTGTGCTCTGGTTGGTAATGACCTGCTGGTTGCCCTCAACAAGTCATGCGACAAACCCATAACCCCGGAGACTCTAACCAGCGTCATGAACAGTAACTCGATGTTTGCCGCCATGATGGCAATTAACTCCATTGGTGGAACCGACCTTTATCAGGTTTACCGTGATGCGGCTATCGACACCCTGCGATGCAATGAGATGGACACTTGGCCTGAACGGACCAAGGTGCGTTTCCAGCAGCCCGACATGCAAAAGGCGCTGGCCTTGAGGGTTTCTGTCAGACAGCAGAATGCAAATTAACTTTCACATGTGAAAGTCAGGTCCCTCAGGGGCCTGATTTCACTATGAGGTGTACCAATGGAGTTACAAGAAGCAAAAAATGCTCTCGATAGCCTTCACCCCCACAAGGCCTCAGCCCCTTTGAGGCTTGTCATCCACCAGCCTGGAGGGATTGGTGGAACCCCTACCGTAGGGGTGAAAGCAATTCATGCTGGGTTTGATTGGGACAGCAACACCATCCTGATCTACCCGGAAGAGCAGCTCACCCGGTTGACGCCGGATGAGGTCGCCGCCATCACGAAGTCAGTATCGAAGGGACAGTCCTGGCATTCATATCAGCAGTTCAAGAAGTATCGGGAGCAGTTGGCCGAAGCCACGGAGGAAATTAATAGGCTCAGGGCTGAGCTGGGCAGGTATCAGAATAACGGGAGGGGGTAATGCTAAAACGCGGAATTATCAATCTCGCTGCTAGTTACATCATTGTTGATGCCCTACTCCGGAATGCAGCAATTTGGATCTTTGGCTTGTCCTTCTCCATTGGTGGCACTTACGTCGCTGGCGAGGCCAGTACATGGGGCGTTTACCTAGCCACTTCTGGTGCAATGACACTCTGTTCTGTCGTAACAGCCTACCTGCTCGTGACCTATCACCGCTGGGGGCTGATGACGGCCAGGGTCTGGCTGCTATTGAGTGCCTGCCTAAACGGCTATGCCGTCTATTTGAGCAGCCACAACATCCAGTTGGTGGTGGCACTGCTTTCCAGCCTGTTTATTGCTTTGTGGATGCTCAAGACCCTTGAGCAACCAGCAGTGAAGGGGACGTACAAGGTAATCGCTGATCTTCACCGTCAGCTATGGGGAATGTTGAAGGGACAAACACAATGACGACGAATACTCAAAACGCGAACGCCAACCAGGTTCGGTCTTTACGCGACATGCTTGTCCCCGCCCTGTTGTTCTATGTGGTGATGACGGCCATGTTTGTGGGTCTTGACGCATTCATGGATAAACCCACCAGCATGAACCTGCCATTTATGCCGTTCCTTGTCTCGATGGTGAGTTTTACCAGTGACGCACGTCGAGCGTGGGACTGGCGGAACGGGACCAAGGTTGTGGCCGTATTGACTGCGGTAGCCATGCTGTTGGCATTCATCTATCAACTCGCGGTCGGAGAGGTAAATCTCTTAGGGGTTGGTATATACCCGGCCACAGCCATCCTTCTGTTGGCAATCACCTGGGTGATTCGCGCTATCGGAAAGACGGCTCCTTTCCAGTTCCTGGGGAGACACCTGGCGCGGTTTGGTGCATCGAAGTGGGTCCAGCGTACCGCCGCAGTCATTGTGCTCGCGGGTGGCCTTGCCATCACTGTCTATGCCTACTGGCTTAACCACGGGAGCTGATGATGATCATTGCAACGAAGAGCGGCTTGCTGGTGGCCGCAGAACTAATCAAGGAAGAGGCCGGGTACTGGCTACTACAGCCTCGTGACCAAAAGACGCCGGTCAGAGTGAATAAGCAAGATGACAATAAACGCGCTTTCACGCATATGGGAGACGCCCTTCGCTGGGCAGGTGATCCTGAGCTTGCAAAGCAGTTCGATGCCGAGGGGGAAGAACATGCAAATTCGTGACTACATGACAAAGTTGTTTGAAGCATTTGGTGATGTAGAAGAAGTCACCCGAGAAATGCTTCTGGAGCAGGCGGAGCTCATTCATACGATCAGCGATAAGTGTCAGAGCACAGGCCTGTTTCTGGATAGTCAGGTTCGTTTCAACCAGTTCGTTCAAGAGATTGAGGCTGACGACAATGTAGAGGATCGGTTGCTTCATGCTTGGTGCTGGGTAATGGACCGAATAGTGAAGGCACCAACATCCTTTCACATGGATGGGGCTGTGATTTTGACAATGCCTCTGGTCGCCAGATACCTGCCACCAGTTGAACGGGAGCCGGAAACCATCGTGGTGAATCTCGATGAGGACTACAAGGCTCCTGTAGGCAACCAAACACTCTGCGAGCTCATTATGGAACGGAGGCATTGGCCGCAAGGTGCAACGTGCGCGACCCAAGAAGCGGATGGTGAAATCCTCTACTGGGACGCCCCGGTTCAGGTAGTAGAGGAAGGCAGAAAGGCCGCTGGCAAGCATGGCATGATGGCTGAAATAGGATTAAAGCATCAAGTAGACTTTTGGTTTTCTGACATGGCCGAAACTCGGCTCGCAACCGATTGGAACACCGCCGTCATCACACCTCACTGCTTGCTACTTTCCTATCTTGATGTGCTCCAAAAGAACAAAGTGCCGTTTGATGAGGGGGTGCGGCTCGCTGCCGAATGGGTAACGCAACTTGGTGGGGAGTCTCGTAAAGATACCGAGGAAGAGCCGGAAGCTGATGCTACGGTGCTTTCCCTTGGGCGAGCCACAGCTCATTGCTTTAAACCTTATCCGGACACACAAAATTTCTATTACGAGGCCTAAGCCTCACCCAAACAGGAGGATCAACATGCGATACCAGGTATTTAAAACGAAGGAAGGGGGCCTGCCGGTGTTTACTGCACCGTGGTACTGGTTGGCCTCTGCCATTGCTCACTGGTCATCTCTTAACTGGGATGCCTGTCGAATCGTAGACAGCAAGGCGGATAAAACAATGCTCTGCTGGGCCAAGGCTCTACCGGCTGCAAAAAAGATGGAGTGATTATGAGGTTCGGTTTTGCAAAAAAGACGGGGATTTCAGCGTTCCTATCAGCAATGCTGGTGCTGGCTCCTCATGTGTGGGCAGAGACCTTCACTGCAAAGGTTGTGGGGGTGTCTGATGGTGATACGGTCAAAGTGCTAACGGAACAAAGCTGCGATACCGGAAAAGACTGCCGGAGTGGCAAGATCCAGTACCGAGTAAGGCTCGCGGAGATCGATACTCCAGAGAAAAAGCAGCCATACGGCTCAAAGGCGAAGCAGGCCTTATCAGATCTGGTGTTTGGTCGAATGATCAAAGTGGAGCAAATCGACAAAGACCGTTATAGCCGCCTGGTTGCCAATCTCTATGTCGATGGCAAATGGGTCAATGCCGAAATGGTCCGTTCTGGGAGTGCGTGGGTGTACCGGCAGTACGCCAAAACACCGGAGCTGTTCAAGCTGGAGACCGAGGCCAAAGCCGATAAGCGAGGTCTCTGGGCATTACCGGAATCGGAGAGAACTCCCCCTTGGGAGTGGCGAAGAAAGCACTAACCACACCCCAGAGTATGCAGGTGTAAACAGTAACCAAGAAAACAAACAGGAATAACGATGAACAAAAGCGAACTGATTATGAAAGTGGCCGAAGACGCTGATATTAGCAAAGCAAAGGCCGAAGCTGCGGTAAATGCGCTGATCAACTCAGTGAAAGAGGTGCTTAAAGCGGGTGGGACAGTAGCGCTTACTGGGTTTGGTACTTTCCACGTTAAGGAACGCGCAGCGCGAACCGGACGGAATCCCCAGACTGGAGAGAACATCCAGATCGCGGCGGCCAACATTCCTGGGTTCAAAGCTGGTAAGGGGTTGAAAGACTCCGTGAACTAG
Protein sequences of DBSCAN-SWA_10 >NZ_CP040884|124809:130654|127541_127988_+|WP_000919343.1|DBSCAN-SWA MLKRGIINLAASYIIVDALLRNAAIWIFGLSFSIGGTYVAGEASTWGVYLATSGAMTLCSVVTAYLLVTYHRWGLMTARVWLLLSACLNGYAVYLSSHNIQLVVALLSSLFIALWMLKTLEQPAVKGTYKVIADLHRQLWGMLKGQTQ >NZ_CP040884|124809:130654|128502_128733_+|WP_000972665.1|DBSCAN-SWA MMIIATKSGLLVAAELIKEEAGYWLLQPRDQKTPVRVNKQDDNKRAFTHMGDALRWAGDPELAKQFDAEGEEHANS >NZ_CP040884|124809:130654|129796_130324_+|WP_004201083.1|DBSCAN-SWA MRFGFAKKTGISAFLSAMLVLAPHVWAETFTAKVVGVSDGDTVKVLTEQSCDTGKDCRSGKIQYRVRLAEIDTPEKKQPYGSKAKQALSDLVFGRMIKVEQIDKDRYSRLVANLYVDGKWVNAEMVRSGSAWVYRQYAKTPELFKLETEAKADKRGLWALPESERTPPWEWRRKH >NZ_CP040884|124809:130654|125802_126663_+|WP_004201087.1|DBSCAN-SWA MTEQSTKSLRLGVFAAIILGLVGTGFGIYQLVKEKDLAQEIANVKFTVNQVKDAEGVTFKSKAEFEAAVAESINKFVAQKQQADIDQKYAQFEAAPEKVEDGKHIYGDLGARFTLVEFSDMECPFCKQFHDTPKQIVDASKGNVNWQWKHMPLDFHNPAAHKEALAAECIAEQKGNRGFWVFVNEIFHHSKGNGAGVSDLASVVTGVGADLDAFRECLSSGKHEDKVQADIQKAKSYGVNGTPATFVVDNQTGKSQLLGGAQPAQAIMAVMRKMMIESQQDDSANQ >NZ_CP040884|124809:130654|126696_127125_+|WP_000591076.1|DBSCAN-SWA MIKITTKLGCLLAGLLVLSACSSVPQTSNEYTKALDDTKQVCAACALVGNDLLVALNKSCDKPITPETLTSVMNSNSMFAAMMAINSIGGTDLYQVYRDAAIDTLRCNEMDTWPERTKVRFQQPDMQKALALRVSVRQQNAN >NZ_CP040884|124809:130654|124809_125787_+|WP_001348528.1|DBSCAN-SWA MQKERLMAENKEIPWEKELIEKYMFTLHKEQVKDRRWRTMLRVLRASGFVLLMIGFIILASNPGGMPWQSAKAGAPHTAYINIRGEIAAGTLADADHLIPSIQAAFDNPNSQAVVLRINSPGGSPVQAGRIYEEVKAQRALHPEKKVYAIIDDIGASGGYYIASAADEIYADRASLVGSIGVISSGFGFTGLMDRLGIERRAITSGEHKALLDPFSPLTSDMKKFWEGVLSKTHQQFIERVKAGRGDRLKDAPEVFSGLLWNGEQAKEIGLIDGLGSLNSVARDVIHQSNLVDYTPTEDIIRRLTQRAKLEASSFVQELSAVKVY >NZ_CP040884|124809:130654|128719_129577_+|WP_001167036.1|DBSCAN-SWA MQIRDYMTKLFEAFGDVEEVTREMLLEQAELIHTISDKCQSTGLFLDSQVRFNQFVQEIEADDNVEDRLLHAWCWVMDRIVKAPTSFHMDGAVILTMPLVARYLPPVEREPETIVVNLDEDYKAPVGNQTLCELIMERRHWPQGATCATQEADGEILYWDAPVQVVEEGRKAAGKHGMMAEIGLKHQVDFWFSDMAETRLATDWNTAVITPHCLLLSYLDVLQKNKVPFDEGVRLAAEWVTQLGGESRKDTEEEPEADATVLSLGRATAHCFKPYPDTQNFYYEA >NZ_CP040884|124809:130654|129602_129794_+|WP_001270409.1|DBSCAN-SWA MRYQVFKTKEGGLPVFTAPWYWLASAIAHWSSLNWDACRIVDSKADKTMLCWAKALPAAKKME >NZ_CP040884|124809:130654|127984_128503_+|WP_000210757.1|DBSCAN-SWA MTTNTQNANANQVRSLRDMLVPALLFYVVMTAMFVGLDAFMDKPTSMNLPFMPFLVSMVSFTSDARRAWDWRNGTKVVAVLTAVAMLLAFIYQLAVGEVNLLGVGIYPATAILLLAITWVIRAIGKTAPFQFLGRHLARFGASKWVQRTAAVIVLAGGLAITVYAYWLNHGS >NZ_CP040884|124809:130654|130381_130654_+|WP_001043046.1|DBSCAN-SWA MNKSELIMKVAEDADISKAKAEAAVNALINSVKEVLKAGGTVALTGFGTFHVKERAARTGRNPQTGENIQIAAANIPGFKAGKGLKDSVN >NZ_CP040884|124809:130654|127182_127542_+|WP_000422769.1|DBSCAN-SWA MELQEAKNALDSLHPHKASAPLRLVIHQPGGIGGTPTVGVKAIHAGFDWDSNTILIYPEEQLTRLTPDEVAAITKSVSKGQSWHSYQQFKKYREQLAEATEEINRLRAELGRYQNNGRG |
11 | Wolbachia_phage(25.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_11 |
134284 : 138277
Sequences of DBSCAN-SWA_11
Nucleotide sequences of DBSCAN-SWA_11 >NZ_CP040884|134284:138277|DBSCAN-SWA CATGTACAGATACCGTCTGGAAAGGGATGTGCAACCCGAAGGCCTTGTGTTCGGTTATTTTGGTGTAAACGGCTCTACGGCCACGGCCACCGAGGACGATCACACGGTCAGAAAGTGGATAGGGTTTACCAAGGTCAACGGAGGGAGGCGATTCATAGTCGGTAATGCTTTTGCTTTTCGGGCGACCGACGTTCGAGAGCTGGCTACGGCGGTTGATCCTGTCGGGCCTGAAAATGAAATCCACTTGGAAAGGATTATTCGGGATGCTGATGTCTTGGTGCCCTGCTGGGGGAGCCGAACTAAGCTGCCTAAGTCTTTGCATGTTCATTTGGACAGGCTTTTAGAGCAGCTCGTTGCATCTGGGAAGCCGGTATTGGCATTTGGCGTGACTGGTTCGGGGGACCCGAAGCACCCACTAATGCTTGGATATTCGACAAAACTTGTGCCCTGGGGAGGAAAATAATTATGTCGATGCTTGAAGCACGTTACTTCGTGGCAAAGATAAGCGATGCACAGGCCGTGCTCTGCGATGAGGAGCTGGCGACACTGGAAAGGCTGATCCGGAAGGTTGATGATGGCCGCAGAGCAAATGGCAAGAGCTCGCTTACTTGCGTTGTTGTCGAAGAAGATTGGCCGAACTGGCAGCAAACGGTTGACTCTGTACTTTCACTTGCTGACGGGAAAGACAACGATTGGACCAATGCCACACCAGAACAAATCAAGGCCTTTTGGGTCGATGACTCAGTTTGGAAAACTCTGGATGGCCGGGATAAATGGATTGAAGATCTCGACCTGCTGGTGGACGGTTCCCCGGTGGCTTCTGAGTGGGAACCTTTTGGCTTGGAATCAGGGCAGTCTGTCGTCATACGAGGCGGATGGATAGAAGGGAATGCCTTGAGTGATGATGGCGTACCGCTAGTGCAGGCATTCGTCGCTTGGAAACAGGGCCAAGACAACCACTAAAGGAAAATTACTTCGGAAGGGCGGCTATGGCCGCTTTCTTTTTAATATTTACAGTATCACATAATGATACTATGATGTTGTTAATTTACTTGGGGAGGCGCTTATGCTCACTGCTAAATGTATAGGGTGCGGATGCACCGATGATCACGCTTGTGTTGAAAACGGTCAACCGTGTCACTGGCTTCGCGTGAATAGAGTGGAAGGTATCGGGGTCTGCTCTTCGTGCCCGGATGCTCTCAATCAATTCGCATCGACCAGCCAGGAACAAGCAACTGGACAGGATGACTACCGGAACAGTCCAGAATGGAAGGATTTTTCATCACGCATTGGTAATGCACTATGCGGGGGCCGGAGCAAAGCAAAGTAAGGTAAAAACGGCTATGCCGTTTTTTATTTGTCTATTACGGCATCACATTATAAGCCTGTAAATGCTGAATCTGTAATTGCCCAAAATGGGCGTGAGCAGGGGCGGGGTGTGAGGTCGTTCTAAGGCAAAATACTTTGTACTGGCAGAACAAAGAAAGAATGAACTTTCACATGTGAAAGTTTTTGGGGTACGGGATGGAACAAGCAATTCAAAGTTACTTGGCTGACGACCGGCAATATCAAGACAGGATTACCGCAGCTCTCAGCCAGGTTGAAGAGAAAGGTGCAGAGTACGAAGCTCTTTGCCAACAGCGTGCTCAGTTGGGGATCTGGCAAAAAATTATAACGTTCTGGCAGTTTCGTCGAGACATCGCCGTTATACGTTCAGCACTAAAAGGCCATAACAGCGATTTGCGGTATTTGCGCCGTGGGAGAGACCAGCTCAAAGAGGGATTGGTTTCTCGGGCCGTGAAGCAAGCGATTGACGGTAGCCAGATTCTGGAACGAATCACCCAAGCTCAAGATAGGCTTGACGCCGCATCTCGACTCCACGAGAGCAACAAGCGACTGGTGGACATGGGGCAAAAAGCCTTACGTGAAATCAGTGAGGCCTCTTCCAGTATTTCATCAGCACAAACAATGGAGGTTCTCGACCTTGTGACTGACAACAAGGGCATTTCCGTCATGTCGTCGATGTCAAACTCCTCGGCCAGCAGTGAAATAGACGATGCGAAGAGAGCTGTGAAAGCCTTCGCAAATGCGCTGGGGGATCATCGTGACATTGTGGGCTCACTGCACCACTCAATGGCAACTGAGTTTATCGATCTGGGTATGGACTTTGCCGGTTTGAATGATGGTTTCGACTTTGGGAGCGTCTTCTCGCTGTTCAGCTTGTCGTCTGCCAGTTCCTCTCTGGACAAAGTAGAGTCTCGCGTTGAATCTCTGATGCCAGATCTAAGACGAGCCGCCTCCAATTCGGCAGCAGAGTATGCCCGTGTTAACGAAGAGTTCTTTGGCCTAAAACAACAGGCCTGTTGCCAAGTGCATGAGTTGCTGGTGACAAATGGTATAGACGTGTCAGTCAAGCGCGTTGAGTCCGCCGTGAACAGCTACAGAGTTGGAAGGTGATATTTACAGCATCATAAAATGAACTTATGATGTGAGGTGATCAACAAGTCGAAATTTATTTAAAGGATAAGCGATGAGCGATATTACACAGCAAATGGACAAACTTGAGATTCCAATCAAGTTGTCCTTTCCAGTCATCAACGTTAGCACTTTTGAGCTGGGCCGTGCAGAGAGCGTTTTCTCTGACATTGCAAAGAAAGTCGGCAAGCACTTTATTGTGATGCCATTCAAGAAGCTGCCCGATCCAGGCACGATGAAGGCAATGGTGGACGAGTCCAAGAAGTCCTCCAAAAACGGTGTTGTCGTGTTCGACACTTTCTTTTTCGACCGACAACGTGCAAACCCGGAAACACTCCCAGCCCTCAAGTCATCACTGACCTATCTGGAGAATGAGGGGATCAACTACATCATCGCTGGCAAAGACGTCTTCAATGAAGAGTTCGTTTATCACATCGATCTCCCGGCTATGAGCAATCAGGAAATCCTGAAACTGCTCCAGACCTGTGAAGATAACGTGAAAGATGGTGGAGTCTTCGAGAGCAACGAACGTGCTGTCATCGCAAACCACGCCCTGGGCTTGTCACACACCCAGATGAAGAACGTCTTCACCTATTCCGCTTACTTGAAATTCAAGGGTGAAGAATACCTGGGCGAGATCCGAAAAGAAAAAGCTCACATCTTGCGTGATGTCGGCCTCGATGTGCTTGAGGCCATTGATATTGGGAATGTCGGTGGGCTCGAAAACCTCAAGGAGTTTCTCCAGATACGTAAAGCCGGTTGGGACAAAGACCTTCCGGTAAAAGGTGTCCTTCTGGCCGGTGTACCTGGTGGTGGTAAATCGCTGACGGCAAAAGCCGCTGCCGGTGTACTTGGCACTACCTTGGTTCGCCTGGATATGGGCCGTTTCTATAGTAAGTATCTCGGTGAAACCGAGCGCCAGTTCAATCGTGCATTGCAGACCATTGAGCAGATCGCACCCGTTGTTGTGTTGATTGACGAGATGGAGAAGTTTTTTGGTAATGCCGATGGCGAACACGAAGTATCCAAGCGCCTGCTGGGCTCTTTCCTCTACTGGCTTCAAGAGCGCAAGGAGAAGATCTTCATTGTGGCGACGGCCAACCGGGTTCAGTCGCTGCCTCCTGAATTGATGCGAGCTGGCCGCTGGGACCGAGCATTCTTCATTGATCTGCCAAGTGTGGCTGAGCGCCAGAAGATTTTCGAGATCCACCTCGCCAAGCAGAAGGCCAACATCGCCGCGTTCGATATGCCTACGCTACTGCGTACCACCGAGGGATACACCGGGGCAGAGATTGAACAGGCCGTCATTGACGCGATGTATCTGGCGAACGCTCAGGACAAAGAGCTCAACAATGAAGCGCTGGTGGATGCGGTCACTCGCATTACCCCGACCAGTGAAACTCGCCGAGAAGACATCAACCAGATTCGCAGTTTGCGGGATCAAGGCTTCTATCCGGCCAATAACTTCGATGTTCAAGAGCAGAATGGCTCTGGACGAAAACTCGCCATCGAGGACTAA
Protein sequences of DBSCAN-SWA_11 >NZ_CP040884|134284:138277|135807_136740_+|WP_000434070.1|DBSCAN-SWA MEQAIQSYLADDRQYQDRITAALSQVEEKGAEYEALCQQRAQLGIWQKIITFWQFRRDIAVIRSALKGHNSDLRYLRRGRDQLKEGLVSRAVKQAIDGSQILERITQAQDRLDAASRLHESNKRLVDMGQKALREISEASSSISSAQTMEVLDLVTDNKGISVMSSMSNSSASSEIDDAKRAVKAFANALGDHRDIVGSLHHSMATEFIDLGMDFAGLNDGFDFGSVFSLFSLSSASSSLDKVESRVESLMPDLRRAASNSAAEYARVNEEFFGLKQQACCQVHELLVTNGIDVSVKRVESAVNSYRVGR >NZ_CP040884|134284:138277|136813_138277_+|WP_004201081.1|DBSCAN-SWA MSDITQQMDKLEIPIKLSFPVINVSTFELGRAESVFSDIAKKVGKHFIVMPFKKLPDPGTMKAMVDESKKSSKNGVVVFDTFFFDRQRANPETLPALKSSLTYLENEGINYIIAGKDVFNEEFVYHIDLPAMSNQEILKLLQTCEDNVKDGGVFESNERAVIANHALGLSHTQMKNVFTYSAYLKFKGEEYLGEIRKEKAHILRDVGLDVLEAIDIGNVGGLENLKEFLQIRKAGWDKDLPVKGVLLAGVPGGGKSLTAKAAAGVLGTTLVRLDMGRFYSKYLGETERQFNRALQTIEQIAPVVVLIDEMEKFFGNADGEHEVSKRLLGSFLYWLQERKEKIFIVATANRVQSLPPELMRAGRWDRAFFIDLPSVAERQKIFEIHLAKQKANIAAFDMPTLLRTTEGYTGAEIEQAVIDAMYLANAQDKELNNEALVDAVTRITPTSETRREDINQIRSLRDQGFYPANNFDVQEQNGSGRKLAIED >NZ_CP040884|134284:138277|134284_134746_+|WP_000286591.1|DBSCAN-SWA MYRYRLERDVQPEGLVFGYFGVNGSTATATEDDHTVRKWIGFTKVNGGRRFIVGNAFAFRATDVRELATAVDPVGPENEIHLERIIRDADVLVPCWGSRTKLPKSLHVHLDRLLEQLVASGKPVLAFGVTGSGDPKHPLMLGYSTKLVPWGGK >NZ_CP040884|134284:138277|134748_135246_+|WP_000062185.1|DBSCAN-SWA MSMLEARYFVAKISDAQAVLCDEELATLERLIRKVDDGRRANGKSSLTCVVVEEDWPNWQQTVDSVLSLADGKDNDWTNATPEQIKAFWVDDSVWKTLDGRDKWIEDLDLLVDGSPVASEWEPFGLESGQSVVIRGGWIEGNALSDDGVPLVQAFVAWKQGQDNH >NZ_CP040884|134284:138277|135349_135613_+|WP_000954380.1|DBSCAN-SWA MLTAKCIGCGCTDDHACVENGQPCHWLRVNRVEGIGVCSSCPDALNQFASTSQEQATGQDDYRNSPEWKDFSSRIGNALCGGRSKAK |
5 | Pseudomonas_phage(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_12 |
142737 : 143676
Sequences of DBSCAN-SWA_12
Nucleotide sequences of DBSCAN-SWA_12 >NZ_CP040884|142737:143676|DBSCAN-SWA AATGGCTGTTATCAACTCACTCCGCGCTCTCAAAGGCGTAACCGCCCCCGAAGAACTGAAAAAAGGGGATGGATTCACTATCTCTCCTCAGCTTCTGCTGGAGGAAGAAGGGTTTAACACCCGTGGCGCTTTCTGCGAGGACTACTACGAGCGCCCGGACATCAAAGCTGGTATTCGAGTGCTGGCCGATGCTTACAAGCGTGGCGACTATGTTCCGCCGATCATCGTCAAAGTTATCGACGGAAAGGTGTATGTCCGTGAAGGTCATCGCCGTCGCCGCGCCATCCTGCTTGCTATAGAGGAAGGTGCCGACATCCAGTTCGTGCAGGTCGTAGAGCACAAGGGCGATGAAGCCGAACAGAGCCTTCTGATCGCCACCAGCAACGATGGGCTCCCTCTTTCTCCACTTGAGCGAGCCGTGATCTACGCTCGGCTTGCAAACTGGGGGTGGAGCGACCAGATGATTGCCCAACGTGTTGGCCGCTCGGCTGAGCACGTTCGTATCGCCCGTGCCCTTTTGGAGATGCCTCTGGAACTGAAACGGATGATTCAGGAAGGCTCTGTGGCTGCCACCTACGCTCAGGAGCTCTACAACGAGCACGGCACCAACGCTGTTGAGATCCTGAAAAAGGCGCAGGAAGAACAGGCCAGTGGCAATGACGGCAAGAAGGCCCCGAAAAAACTGACCAAAAAGTCTGTCGAGAAAGGTCCCCGCCTGGGCAAAAAGGTGGTTGAAGCCATGCACCGAGGCGTGAGCTCTATTACCAGTCGTCTGGACAACATCAAGCCGAACGATGACGGTGAAACGTTCACCCTTACCCTGAGCCGGGAAGATGTTGATGCGTTTCAGGAGCTGAAAGCCAAGCTGGCGGAGCTGGAGCCCAAAACTGACGAGTCCAATGAAGACCAGCAAGAGCTGGATTTGGCGGGTAATCAGTAA
Protein sequences of DBSCAN-SWA_12 >NZ_CP040884|142737:143676|142737_143676_+|WP_000268394.1|DBSCAN-SWA MAVINSLRALKGVTAPEELKKGDGFTISPQLLLEEEGFNTRGAFCEDYYERPDIKAGIRVLADAYKRGDYVPPIIVKVIDGKVYVREGHRRRRAILLAIEEGADIQFVQVVEHKGDEAEQSLLIATSNDGLPLSPLERAVIYARLANWGWSDQMIAQRVGRSAEHVRIARALLEMPLELKRMIQEGSVAATYAQELYNEHGTNAVEILKKAQEEQASGNDGKKAPKKLTKKSVEKGPRLGKKVVEAMHRGVSSITSRLDNIKPNDDGETFTLTLSREDVDAFQELKAKLAELEPKTDESNEDQQELDLAGNQ |
1 | Yersinia_phage(100.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|