Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_014532 | Halomonas elongata DSM 2581, complete genome | 4 crisprs | cas3f,cas8f,cas5f,cas7f,cas6f,csa3,DEDDh,DinG,WYL,cas3 | 0 | 3 | 6 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_014532_1 | 126711-127158 | TypeI-F |
I-F
Consensus repeat of NC_014532_1
|
7 spacers
spacers of NC_014532_1
>1.1|126739|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT TCCCGGCGGACGGAAAGCTTGGCAGACCAGCG >1.2|126799|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT AAGGCATAAAGATGAATACATTGAGCTCCCAT >1.3|126859|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT CAGTGCAGTTGCGAGATCTGTTTGCCATCGTT >1.4|126919|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT GTAAGCGCCGCATGCTGTGGGCGTCACGCCCT >1.5|126979|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT TTGCGTCTCGAAGTCCTCAAAGCGCGTAGCAT >1.6|127039|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT ACATTGCAGCCGTGACCATTCTTGCTGTGATC >1.7|127099|32|NC_014532|CRISPRCasFinder,CRT CAGCCGTCCTGGCTGTAGTCGCTCAGTTGGCA |
cas6f,cas7f,cas5f,cas8f,cas3f,cas1 |
CRISPR arrays and Neighbor proteins around NC_014532_1
The CRISPR arrays of NC_014532_1 >merge|NC_014532|1|126711-127158|PILER-CR,CRISPRCasFinder,CRT GTTCGCTGCCGCCCAGGCAGCTCAGAAATCCCGGCGGACGGAAAGCTTGGCAGACCAGCGGTTCGCTGCCGCCCAGGCAGCTCAGAAAAAGGCATAAAGATGAATACATTGAGCTCCCATGTTCGCTGCCGCCCAGGCAGCTCAGAAACAGTGCAGTTGCGAGATCTGTTTGCCATCGTTGTTCGCTGCCGCCCAGGCAGCTCAGAAAGTAAGCGCCGCATGCTGTGGGCGTCACGCCCTGTTCGCTGCCGCCCAGGCAGCTCAGAAATTGCGTCTCGAAGTCCTCAAAGCGCGTAGCATGTTCGCTGCCGCCCAGGCAGCTCAGAAAACATTGCAGCCGTGACCATTCTTGCTGTGATCGTTCGCTGCCGCCCAGGCAGCTCAGAAACAGCCGTCCTGGCTGTAGTCGCTCAGTTGGCAGTTCGCTGCCGCCCAGGCAGCTCACTCC >NC_014532|1|1|126711-127098|PILER-CR GTTCGCTGCCGCCCAGGCAGCTCAGAAA TCCCGGCGGACGGAAAGCTTGGCAGACCAGCG GTTCGCTGCCGCCCAGGCAGCTCAGAAA AAGGCATAAAGATGAATACATTGAGCTCCCAT GTTCGCTGCCGCCCAGGCAGCTCAGAAA CAGTGCAGTTGCGAGATCTGTTTGCCATCGTT GTTCGCTGCCGCCCAGGCAGCTCAGAAA GTAAGCGCCGCATGCTGTGGGCGTCACGCCCT GTTCGCTGCCGCCCAGGCAGCTCAGAAA TTGCGTCTCGAAGTCCTCAAAGCGCGTAGCAT GTTCGCTGCCGCCCAGGCAGCTCAGAAA ACATTGCAGCCGTGACCATTCTTGCTGTGATC GTTCGCTGCCGCCCAGGCAGCTCAGAAA >NC_014532|1|1|126711-127158|CRISPRCasFinder GTTCGCTGCCGCCCAGGCAGCTCAGAAA TCCCGGCGGACGGAAAGCTTGGCAGACCAGCG GTTCGCTGCCGCCCAGGCAGCTCAGAAA AAGGCATAAAGATGAATACATTGAGCTCCCAT GTTCGCTGCCGCCCAGGCAGCTCAGAAA CAGTGCAGTTGCGAGATCTGTTTGCCATCGTT GTTCGCTGCCGCCCAGGCAGCTCAGAAA GTAAGCGCCGCATGCTGTGGGCGTCACGCCCT GTTCGCTGCCGCCCAGGCAGCTCAGAAA TTGCGTCTCGAAGTCCTCAAAGCGCGTAGCAT GTTCGCTGCCGCCCAGGCAGCTCAGAAA ACATTGCAGCCGTGACCATTCTTGCTGTGATC GTTCGCTGCCGCCCAGGCAGCTCAGAAA CAGCCGTCCTGGCTGTAGTCGCTCAGTTGGCA GTTCGCTGCCGCCCAGGCAGCTCACTCC >NC_014532|1|1|126711-127158|CRT GTTCGCTGCCGCCCAGGCAGCTCAGAAA TCCCGGCGGACGGAAAGCTTGGCAGACCAGCG GTTCGCTGCCGCCCAGGCAGCTCAGAAA AAGGCATAAAGATGAATACATTGAGCTCCCAT GTTCGCTGCCGCCCAGGCAGCTCAGAAA CAGTGCAGTTGCGAGATCTGTTTGCCATCGTT GTTCGCTGCCGCCCAGGCAGCTCAGAAA GTAAGCGCCGCATGCTGTGGGCGTCACGCCCT GTTCGCTGCCGCCCAGGCAGCTCAGAAA TTGCGTCTCGAAGTCCTCAAAGCGCGTAGCAT GTTCGCTGCCGCCCAGGCAGCTCAGAAA ACATTGCAGCCGTGACCATTCTTGCTGTGATC GTTCGCTGCCGCCCAGGCAGCTCAGAAA CAGCCGTCCTGGCTGTAGTCGCTCAGTTGGCA GTTCGCTGCCGCCCAGGCAGCTCACTCC
>NC_014532.2|WP_109637282.1|126011_126581_+|type-I-F-CRISPR-associated-endoribonuclease-Cas6/Csy4 MDHYLDIRLRPDPEFPASMLMNALYSKLHRALYDLGADDIGISLPDHKTGVRTRTPGDRLRLHARKERLEQLMALSWLAGMRDHVETTDIAPVPAAARHCRVTRRQFNTGGPSRVKRYARRHDISEDEARQCMSVPAKRKISLPFVQVNSRSSGQRFALFIEHGELQDAPMTGHFNHYGLSREATVPWF >NC_014532.2|WP_013330865.1|124976_126008_+|type-I-F-CRISPR-associated-protein-Csy3 MAKKDDALKTASVLAFERKLDPSDALLYAGQWSQRDALEDWQAVSVREKSVRGTISNRLKAKEQDPAKLDAAIENPNLQTVDVATLSHDADTLMARFTLRVLGGAGTPSACNNADYQAKLQQAVADYVKTEGFSELARRYAHNLANGRFLWRNRVGAEQVEVRIRRMEKGHVDKEWTFDALSLSLRGFESDTHTAELAQDIADALAGERHLILEVIAFAHVGNGQEVFPSQELILERGRGDKSKTLYSVDGVAAIHSQKLGNAIRTIDTWYPAPEDGSDLGPIAVEPYGSVTTQGTAYRQPKQKVDFYSLLDNWMLKDQVPETEQQHFVMATLIRGGVFGEAG >NC_014532.2|WP_013330864.1|123999_124959_+|type-I-F-CRISPR-associated-protein-Csy2 MSDVKNLLVLPRLRVQNANAISSPMTWGFPAMSAFVGMMHALERKLVEADIQVSLDRVGVVCHDTEAQATEGGYTRAFHLTRNPVDKAGNTAAIVEEGRIHLDITLIFAIAGKVVEGERQDIAHQISEMVAGMRVAGGSVMPNRSVAANYQKSAWVALDDEPSEREKQFKKLKRRWLPGFSLVLRDDRLAEHTRTLQAQDENATALDAWLDLSRLNHECHVDPESEEVRWQVRRPYRGWLVPMPVGYGAISKQFEPGSVENARDTQIPFRFVESIYSIGEWISPHRLTCPEDMLWYVDNDLDAGLYRLNNDYVQRAHRA >NC_014532.2|WP_013330863.1|122630_124007_+|type-I-F-CRISPR-associated-protein-Csy1 MPTMSGGWQELRELIEAFLKDRFDTKAEKLATDDPKYQALVEQFQRDSWLQDAARRVSQLQVVTHSLKPIHPDAKGSNLYTPPEFLSKHQGVGSHLLPADFDGDVVGNAAALDVYKFLKIEYDGKSLLERVLEGDTELARALSDNSEQSQAWMKAFAGITEPRGEHASHTRAKQVYWLTGDDAVDDGDFHLLAPLYATSLAHQVFQTINRDRFSDEAKEARKAKREGKLGEQEVHDYPNIASQKMGGTKPQNISQLNSERGGNNYLLASLPPAWSSRDIRPPLKADSVLSRGGMFGRRKEVRALVGDLKRFLETNPNIDMHTRDLRDDYTAMIMDELVLFTMQMHSLEPGWSADENCRLAEEEVFWLDPGRAQEDAEFRKARHAADWPDEIRQRFANWLNEALGGKLPLGDVEFRHWKKELGQDASHQRLLDKDRRWMAALVEELDELEELKGREDDE >NC_014532.2|WP_013330862.1|118896_122238_+|type-I-F-CRISPR-associated-helicase-Cas3 MNVLLVSQCNKNALKETRRILDQFAERRGERTWQTPITLEGLDTLRKLLRKTARKNTAVACHWIRGHDHSELVWIVGDAKRFNPQGAVPTNTTSRDILRREDENDWHSGEDILLLASMSALLHDLGKASAAFQQRLQGKLEGRNLYRHEWVSLRLFQAFVGDDDDATWLARLAEGRYQEADWLENMERDGLDAEPAAIFRRLPPLAAALGWLIVSHHRLPLKPPDDLSNPATRWGSQNVTLQMEQLGDVLSLIDAQWNEISTTQDPQRITPYWTFHHGLPLSTWRWRERAAKQAKRLLARLSDDTNWLDSPYVMHLSRLCLMLADHHYSSLEDPARRVQGEPGYPLVANTVRKTGQPNQLLDEHILGVEKHASTIARALPTVESHLSRLARHKGFRKRSEHPRFRWQDQAFDLAQSLRERSQCQGFFGINMASTGCGKTLANGRIMYALADPEQGARFSIALGLRTLTLQTGQALRERLHLGEDELAVRVGGSANKALFEHYEKDAEASGSASTQGLIDEDAHVVFEGHIDTHPVLRRLGDEPGTRSLIAAPILVCTVDHLVPATESTRGGRQIAPMLRLMSSDLVLDEIDDFDINDLPALTRLVHWAGLLGSRVLLSSATLPPALVHGLYEAYRSGREAYQHHRGEPGQPVNICCAWFDENDRQHQDCADGDAFSAAHAAFATRRLKRLAENPVRRRGELLSLPALGKRPEEIRPGLAELLRTQAAQLHERHHSRDPVTGKRVSFGLIRMANIDPLVDVARSLFQQGACSGQRIHLCVYHSRHPLVMRSEIERRLDRTLQRAEPERVFEQADIRRLLDGSDEADHLFIVLGSPVTEVGRDHDYDWAIVEPSSMRSIIQLAGRVWRHRDKPCDTPNIQLLDTNLKHLEDPGRLAFQRPGFETDEAWRLNHHSLNALLVPDEYQIIDARPRVLARETLFPRDSLVDLEHQRLVTQMLEPPTQPLTKKERRLGMEPSPPPLGAYSWYAVPRMHLTGVLSQRQRFRQPTQTDVALALLPDEGGGTWTLHRVEDGAKRGETLYVAVEESLMARIDLEHEQGERIQPWGADDYLTALADLAEDLDIPLDKAARTFGIVSAPESTHGWRYHPVLGFVKK >NC_014532.2|WP_041601826.1|117922_118900_+|type-I-F-CRISPR-associated-endonuclease-Cas1 MDDLSPSDLKTILHSKRANLYYLQHCRVLVNGGRVEYVTDEGKQSRYWNIPIANTTSLLLGTGTSITQAAMRELAKAGVLVGFCGGGGTPLFAANEVDVDVAWLTPQSEYRPTEYLQYWVRFWFDDEKRLDAARCFQLARLDRIEHLWGESRFQRDTGFTPSKTELKALLTSSREAIGQAVDTTALLTEEARLTKKLFRLASHATDYGDFTRVKRGQGVDPANRFLDHGNYLAYGLAATATWVLGIPHGLAVLHGKTRRGGLVFDVADLVKDAIILPQAFLSAMKGDEEQEFRQACIERLTRTESLDFMIDTLKAVALDLGGPEA >NC_014532.2|WP_013330860.1|116154_117660_+|YifB-family-Mg-chelatase-like-AAA-ATPase MTLAIIRTRAGLGLEAPEVLVEVHLTNGLPGITLVGLPETAVKESRERVRSALVNAGFEFPLRRITLNLAPADLPKDGGRFDLPIALGLLVASGQIPPEALAEVECVGELALDGGLRPASGVLPLAMATRQAGRRLIVPRANADEAALAGDLEVLPAEHLLEVVAHLLGQETIAAHRLQAPPRRDTSEPDLREVRGQHQARRALEVAAAGGHNLLFAGPPGTGKTMLASRLPGILPPLGEDEALEVAAVRSVSGLPLAEQWGRRPFRAPHHTASAVALVGGGSRPKPGEISLAHHGVLFLDELPEFSRQVLEVMREPMESGQIHIARANHERRYPARFQLVAAMNPCPCGHLGDPRQACHCTAAQIQRYQARLSGPLLDRIDLQVEVPALPAEQLTSRESGEDSATVRERVLAARERQWSRGALNAYLAGPDLEAACALGADDRAWLAEVLERLQLSARAFHRVLRVALTLADLAGAPRPTREHLIEAIGYRQLDRLLKGG >NC_014532.2|WP_013330859.1|115720_116062_+|accessory-factor-UbiK-family-protein MVSQDRISRLAQQIGERLQGASQAPEDVQKGVQQVVKGAFDRLELVSREDFDILMDVLQRTRGRVEALEKQVAALEEALDASAAADEDAEEVREAEVGSDSPEEDAGAGETGR >NC_014532.2|WP_013330858.1|115089_115428_-|P-II-family-nitrogen-regulator MKLITAVIKPFKLDDVREALADNGVQGITVTEVKGFGRQKGHTELYRGAEYVVDFLPKVKVEVAVDDDRLDTVLDAICNAANSGKIGDGKVFVTPLEDVIRIRTGERGADAV >NC_014532.2|WP_013330857.1|113802_115044_-|ammonium-transporter MTELAYALDTFYFLVCGALVMWMAAGFSMLEAGLVRSKNTAEILTKNIALFAIACTMYLLVGYYLMYSSSAGGILPSLGFLLGGENSVDAVMAGGDDAPYYSARADFFFQVVFVATAMSIVSGAVAERMKLWAFLIFSVILTGFIYPVSGYWTWGGGWLAEIGFSDYAGSGIVHMAGASAALAGVLVLGPRKGKYGKDGAIYAIPGANMPLATLGTFILWLGWFGFNGGSELKVSDVTSANNMAQVLVNTNAAAAGGVIAALILAKAWFRKADLTMALNGAIAGLVSITADPLSPSALGATLIGGFGGLLVVVSIVCLDKLKLDDPVGAISAHGVVGIWGVLAVPLSNGEASFGAQIIGIFGIFVWVFVASLIVWLILKAVMGIRVSEEEEYEGVDLAECGLEAYPEFNVAKK >NC_014532.2|WP_013330867.1|128216_128837_+|DUF4202-domain-containing-protein MSASSAYQRALDALDALHAEDPRRVEVEGQSSSKELVPKELMPKELWHAGRMSAWLERLEASPDELVRLAVRGQHLQRWQVPRDEYPEGRVGYLTWRRDQGQRAGETTAKLMREAGFDEEDAEQVARMIRKQGLGRDAGTQAVEDCACLVFLENYFADFSRQVEHDHLIRIVRKTWGKMSPQARELALELPMSDEAREIVEAALRT >NC_014532.2|WP_013330868.1|128947_129226_+|hypothetical-protein MAYHIKTAFRGTNPILQICDVSSGSVRMAWEYPKDDLERGEDPELLAMRREEAIHDLFRRLFLLTTEQYLKGELEPMPGLGAWRRAPRPGAK >NC_014532.2|WP_013330869.1|129298_130555_-|dicarboxylate/amino-acid:cation-symporter MSEQDPTARPNLWQRIPLWQKILAGLVLGVLAGALMGERASLFKPLGDIFINAIKMLIVPLVFSTLVVGITAMRDPQKMGRIGLRTIALYLLTTAFAIAIGLLASWIFQPGVGLDMTFDSSVEPKEAPTLVEILVGLVPQNPIDALANGNILQIIVFAIGLGISLTLIGEKGEPVVKVFESFAEAMVKLTNIVMSFAPFGVFGLIAHVAGSYGLEVLLPLAKVIGVAYLASVLHVLLVYSGLLALLGRLNPLRYLQGILDALVVAYSSASSSGTLPVSLRCARNNLGVSEGVAGFVLPVGATINMDGTAIYQGVVAVFIAQLLGVDLSMTDYGMIILTGTLASIGTAGVPGAGLVMLSIVMAQIGLPLEAIAVIAGIDRILDMARTCVNVAGDLMVTTLVGKSEGELDEDVYNAKSWR >NC_014532.2|WP_013330870.1|130702_131464_-|dienelactone-hydrolase-family-protein MRPIATLTLGSLLLAGFADTALAEETDGQRIDYQVNDEAFTGYLASAPDEARGTVLIVHDWDGLTDYERQRADMLAAEGYDAFAIDLYGKGNRPVETDAKKAETARLYDDRERMRRLTLAGLEEARRQGVAQPTVVMGYCFGGAVVLELARSGQAEDVRGYATFHGGLNTPEGQAYSADTPPILIAHGGADTSISMSDVAALAEELEAAGAPYEIEVYSGAPHAFTVIGSDAYQQRADEKSWAAFHDLLGEVL >NC_014532.2|WP_049786177.1|131646_132885_+|ABC-transporter-substrate-binding-protein MMNKRILAMAVAASSVAFTGLAQAEVKIGFLGGFTGGIESLTPPIYDGAELAVKQINEQGGLLDGEEIVMPTGDTTCSDASAASNAADRMVNTEEVTAIVGALCTGATIAAANNAAVPGGVTMVSPASTAPAVTNIDDNDLVFRTVPSDGFQGKMLAKLLLDKGIEEVVVTYVNNDYGSGLDKAFTTAFKEGGGTVAENLPHEDNRSDYRAELGRLSSTGVPNLVVLAYADTSGQTVVRQAYESGMFTQFIGADGMVGDSLVKAIGADVLDGMIATRPGSPELPGTEIFNEDAKAAGIDPSAVFAAQAYDAAFLLGLAIEQNGNAERAGLSEALRSVASAPGEVILPGEWKKAKELIAAGTEINYEGASGTHEFDENGDVPGVVLEMVVQDGAFTSQGYVSEEGEPSEDSGS >NC_014532.2|WP_013330872.1|132993_134313_-|branched-chain-amino-acid-ABC-transporter-permease MTQSHPPRQQRADSVAAPRRFPLRESVIFLALLAAVLVVYAAMGSAYGTRMLVEAACYAILALGLTIQWGYAGQFNAGVMGFVALGGFCAMFFSIPVNEAFWSSELPGELGLALLYMVAAIVLVVAVSRLDRIGVPKPLRTFITVVLGVVLYMAVISNFREVAGQIESRVDFIGGLALPAWFGWIIGGALAGGVGYFIGHVCLGLRSDYLAIATIGIAEIIKAFLKNADWLTRGTATVSPLPWPVPGPGDVGFTLARALYLSVTAVIIAAIFFLLHRAYNAPWGRMIRAIRDNEVSAAAMGKDINKRRLEIFVLGCILMGIGGAVLASFNSLFDPQGYLPLNHTFLVLVMVILGGPGNNLGTIFGAVVVYIIWLMSEPLALFLMQLAVDIGSATFGWDAPTNLDSRALQARVFVIGLLISLVLRFAPKGMLPEKVRHHG >NC_014532.2|WP_013330873.1|134309_135317_-|branched-chain-amino-acid-ABC-transporter-permease MNELVFFINNVVIAGSVSGSIYAMGAVGVTLIFSIMRFAHFAHGDMMTFGAFMVLLLTTLFPQAGAAIGVPTPILMLPLAMVLTAGLAVGIDRTFYRPLRAHGVKPIVMVIASLGVTLMLQGLIRLFAGTGGSSLYVDDRKEIFRLPIPIEGVRMPVVITEPQLYLFVLTIICVVALHFFLSRSRLGKAMRAMSDNPDLAQASGINTNTIVAVTWMLAGGLAAIAGTLLSLDVTFKPDLSFFLLLPIFAAAIVGGVGHPFGAVAGGFVVGFAESLAVFNWSVLLRPFRDSLPEWLALPSNLSFVGTEYKIVVPFFILVAILVWRPTGIFKGKVIT >NC_014532.2|WP_013330874.1|135357_136062_-|ABC-transporter-ATP-binding-protein MPLLDARNVHGGYGGMNILNGVDMAIEANEVGVIVGPNGAGKSTMLKAIFGLLNVSQGEILLNGEPIQNQPPNQLVKRGMGFVPQEHNIFPSLTVKENLQMGAYLKPDNVKRMLARIYEFFPPLYDKRHQPAGELSGGQRQMVAMGRALMAEPDLLLLDEPTAGLSPRYMNEIFARVKEINAAGVGVLMVEQNAKQALGIADRGFVLAAGQNRFTDTGAALLADPDVAKSFLGG >NC_014532.2|WP_041601828.1|136148_136919_-|ABC-transporter-ATP-binding-protein MSAIIDVQHVRKAFGGLQVIDDCSIQVAQGSVTGLIGPNGAGKSTLFNIIAGALPLDSGQVWLDGEDITNRPANELFHKGLLRTFQIAHEFANMTALENLMMVPPRQSGEHLFSTWLKPRAVGREEAEVCRRALEVIDFIGLHHVRNELAGNLSGGQKKLLELGRTMMTDARIVLLDEIAAGVNRTLLGDLMRNIERLNREMGYTFLVIEHDMDMIARLCDPVIVLAQGSVMMEGSIEEIRNDKRVIEAYFGADVA >NC_014532.2|WP_041602309.1|136989_138435_-|sodium-dependent-transporter MSTNNIWTHKGTFLLAAVGSAVGLGNLWRFPYLAGENGGGAFLLIYAVTLFAVGVPILIAEILLGRSSRRSPIMGMRFLSRTHGTSRAWESIGWLGAASAFIILSFYSVIAGWALHYTWRMITGSLAGADAATIASGFDALLASPALLTLYHTLFIAASGLIVGLGIHRGIENGLRVLMPALLAILLVILAYSAMQGDMNAAARFLFTFQLSDLSVAGWLAAMGQSFFTLSLGMGAIMAYGAYMPGEASLSRTALAIVVIDTAVALIAGLAIFALVFGADLAPDEGPGLMFVTLPLAFAEMPGGSLVGGAFFILVLGAAISSAISMIEPVAAFLVERFDLNRAQAVAAMVITSWALGLLSVFSFNVWAEHSPFHELLGLSAFGLLELLTHIFMPLGGLMISLFAGWALTHGEVMKELRTSEGWFQTWRFLVRFVSPAAVAFVFLQAIPQLDGYLLPLIGAVVIVGVFAASRIFLAESHQNP |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_014532_2 | 1759176-1759295 | Orphan |
NA
Consensus repeat of NC_014532_2
|
1 spacers
spacers of NC_014532_2
>2.1|1759206|60|NC_014532|CRISPRCasFinder TCGCCGTTGACGCCGTCGGCTGCCCGCTGGATTCCGATGGTGACGGCGTGCCGGACTATC |
CRISPR arrays and Neighbor proteins around NC_014532_2
The CRISPR arrays of NC_014532_2 >merge|NC_014532|2|1759176-1759295|CRISPRCasFinder AGGACCAGTGCCCGGGTACCCCGGCCGGTGTCGCCGTTGACGCCGTCGGCTGCCCGCTGGATTCCGATGGTGACGGCGTGCCGGACTATCAGGACCAGTGCCCGGGTACCCCGGCCGGTG >NC_014532|2|2|1759176-1759295|CRISPRCasFinder AGGACCAGTGCCCGGGTACCCCGGCCGGTG TCGCCGTTGACGCCGTCGGCTGCCCGCTGGATTCCGATGGTGACGGCGTGCCGGACTATC AGGACCAGTGCCCGGGTACCCCGGCCGGTG
>NC_014532.2|WP_013332268.1|1758098_1758566_+|DUF2489-domain-containing-protein MSTTIALILLGLGLAIVAGLGVYAYVLWREVRRRQAFREEELRRAHDNCLENLELVANALQQGQVDITEGAWRCKTLLDILDPSLVSRPEFLAFAEVHERTRHLHTHSARQALTPRARFQEDRERLKVEDEWRDEVIKAASHALVFRRGWPDSLH >NC_014532.2|WP_013332267.1|1756451_1757897_-|DASS-family-sodium-coupled-anion-symporter MSSSPASPPPALAARIGLWLGPLWLVLTWLSPAPAGMPESAWACVGLALLMATWWSTEAIPIPATSLLPLVLMPALGIEGMGDTAVSYANPIIYLFLGGFLLGIAMQRWNLHRRIALHVLKVVGQRPRRQIGGFMIATGFLSMWVSNTATAIMMLPIGMSVVSLLDDSDPEELRRYATALLLAIAYSASIGGVATLIGTPPNALLAGYLADSRGIDLGFAQWMLVGLPISLAMMVCAWWWLTRRGFALDTGEDGAAMVDRELARLGTMSSAERRVGVIFLLAALAWVVRPLLNQHGLDWLSDTGIAIAAGILLFLLPSGNERGQRLMRWEDAQNLPWGILLLFGGGLALAGGISRSGLAEWIAQHLGIFGAFPVLALIGVVVLVIIFLTEVTSNTATAAAFLPLLGALALSLDISPLLVTVPAAIAASCAFMMPVATPPNAIVFATGHMKIQSMIRAGFVLNLISTVLVTLLAYPLLMLFW >NC_014532.2|WP_013332266.1|1755506_1756412_+|protoheme-IX-farnesyltransferase MHDARSMAQQAVMPLWRDLVTLGKPRVVAVMLVCSLVGMLLARPVPPFDKLVLGLVGIGLAASGAAAFNHVVDRRLDAMMLRTASRPLATRRLSIPLALGWASLLSVMGIGLLYVGVNALTAWLTFGSLIGYALIYTAFLKRATPQNIVIGGVAGAAPPLLGWTSVSDQLGPEPLLLVLIVFAWTPPHFWALAIHKREEYERAEVPMLPVTHGEAFTRLQVWLYGWLTVAVTLLPFVIGMSGWLYLAGVTALNVRFMWWNGKVWRGRDPKAPLAAFWFSIRYILGVFVVLLLDSYATLWWS >NC_014532.2|WP_041602485.1|1754458_1755514_+|COX15/CtaA-family-protein MRDRQYRARLNGLRWLSLLGGLLAALVVLAGAWTRLVDAGLGCPDWPGCYGQWVVPDSTRALMHSPDVPLDASKAWMEMLHRYLASSLGLLAIAVVVLGRRLRHHEGYPWRFSLGLLTLILVQGAFGAFTVTLRLWPQVVTLHLLGGMAVMGSFLWLYLRFRRLAVPGVARRRPRRLTPLWGLALVLLVLQLGLGGWTSSNYAGLACQGFPTCNAQWWPNMDWGEGFHLTQTVGPNYLHGQLHGEARSAIQMGHRLGGVALFLCLLGLGLRHRRDRGVSPWLGAMGGACLLQAALGIANVLFWLPLWLALLHTAGAAVLVTATLLAVWHWRWGDTVARSSPSVAARELMHA >NC_014532.2|WP_013332264.1|1753900_1754437_+|hypothetical-protein MTDARIARSRFKLLALFAVFALPMVMAWGMVEWRLGIPDERTAHGTLEPELPQLADWPLGEVSKEGADDWLMAFDCTDDCAESADRWWRVHRALGRDAHRVSRLRIGGTQSEALPGEAVVTWQGAPEWREPGTLWIIDPEGRAVLSYGEGVEASNVLEDIERLLELNPEPPLARLHDE >NC_014532.2|WP_109637631.1|1753178_1753904_+|SURF1-family-protein MTRFATRMRSSRRLMLWFGFWACLVVLGLGLGLWQWERAADKRELLARYDSAPRLVAPESAPPDGARISVSGEFLAKETLFLDNRIHGERLGVAALTPLRGDDGRLWLVERGFLPTGPSRDTPRVSTPEGRVSVAGRWQVAGDSAPLFGPNREGKRLQHIALDAWEGLGGFAHAGWLHQEEGGGHLASWWQPNVLPPSRHLGYAAQWWGLALTALVVMIVGARRLSRDRSRHTPNDKETRP >NC_014532.2|WP_013332262.1|1752973_1753168_-|DUF2909-domain-containing-protein MLLKVLIALVFIAMVASLAAGAGFLLKDGGRSRRVLISLKLRVCLAALLLILLLYGFYAGGLGG >NC_014532.2|WP_013332261.1|1752087_1752951_+|cytochrome-c-oxidase-subunit-3 MSGGSYYVPASSKWPALGSLALGIMMVGTGMVLVHGNSGAPIMVIGLVGILAVMALWFRDVIHESRKGLYDDQMDRSFRWGMGWFIFSEVMFFAAFFGALFYIRTFALPWLDGEGAKGVAALLWPDFTASWPLLEPPDAAIQGPHQTFSPWHLPLVNTLILVGSSITLTVAHEGLKEGRRTTARHWLTLTVLLGLCFIAIQGIEYREAYVHYGITLQAGIYGATFFLLTGFHGAHVIVGTLILIAILARVWKGHFSADDHFGFEAAAWYWHFVDVVWIGLFTFVYVF >NC_014532.2|WP_013332260.1|1751503_1752091_+|cytochrome-c-oxidase-assembly-protein MTERHTDDTRRGVRRTVARTLVALAGMFVFAFALVPLYDVFCQVTGLNGKTSNQAQALVHEDADEGRVVTMQFITRGSPGLPWSLEAHTRQVRVHPGQSAEVEFTFENMGDEVSVARAVPSVTPSQASLHLRKLACFCFQNQRLAPGERFEAPLVFQLTRDLPEDIQTVTLVYTLYRQDAAPSPGSGDQVRGGDA >NC_014532.2|WP_013332259.1|1749844_1751482_+|cytochrome-c-oxidase-subunit-I MASHLPPRPTAQQSQADAGGMAADEHHHYGPRGLKRWLLTTNHKEIGTLYLIFSLTMFFIGGIFAMVVRAELFQPGLQLVQPEFFNQMTTMHGLIMVFGAVMPAFVGLANWMVPLQIGAPDMALPRLNNFSFWLLPVAFALLLSTLVMPGGAPNFGWTFYAPLSTTYAPPSTTFFIFSLHLAGISSILGAINIIATILNLRTPGMRLMDMSLFVWTWLITAFLLIAVMPVLAGVITMMLLDINFGTSFFNAAGGGDPVLFQHLFWFFGHPEVYIMILPAFGIVSVIIPTFARKRLFGYASMVYATASIAILSFLVWAHHMFVVGLPLVAELFFMYSTMLIAVPTGVKVFNWITTLFRGSLTFEPPMLFALAFVVLFTIGGFSGLMLAISPADFQYHDTYFVVAHFHYVLVPGAVFAIMAAVYYWLPKWTGHYPHTRLSQWHFWLSVIGVNLTFFPMHFAGLAGMPRRIPDYALQFADFNMVTSIGAFMFGASQLLFVAVVVLCVRGGEKAPAKAWDGAEDLEWTVPSPAPLHTFETPPHFEPHRH >NC_014532.2|WP_013332270.1|1759879_1761778_+|threonine--tRNA-ligase MPIVTLPDGSQRSFDEPLSIMQLAESIGTGLAKACVAGRIDGELVDAADIIDHDAEVAIITARDPEGLDIIRHSCAHLIGHAVKQLYPDAKMAIGPVIEDGFYYDIDFGRSITPEDLEAIEARMKSLIETGYDVVREYVDRDRAMLTFLHRDEPYKQEIVREIPEGETIRLYHHQEYTDMCRGPHVPNTRHLKAFKLTKLAGAYWRGDAERPMLTRIYGTAWGDKKQLKAYLKRLEEAEKRDHRKLARKLDLFHMQEEAPGMVFWHPRGWTLWQVVEQYMRQVYKDGGYQEIRCPQVMDVSLWKKSGHWDNYADGMFFTESEKREYALKPMNCPGHVQVFNSGLRSYRELPVRYGEFGGCHRNEPSGALHGIMRVRAFTQDDGHVFCTEEQIEPEVTSFHRQALQVYRDFGFEDIAVKIALRPEKRLGDDAVWDRAEEALRGALRTCDVDWDELPGEGAFYGPKIEYHMKDCLGREWQVGTMQVDFMMPVRLGAQYVAEDGERRSPVMLHRAIVGSMERFIGILIEHYAGAMPLWLAPQQAVVLTITDAQRDYATYLEQRLQKKGLRVKADLRNEKIGFKIREHTLQKVPYLLVVGDKEVEADSVAVRSRSGEDLGTMTVDAFIDRIQAERR >NC_014532.2|WP_013332271.1|1761860_1762352_+|translation-initiation-factor-IF-3 MNERITDEQVRLIDSDGEQLGIMPTRDALERAEAAGMDLVQISNADPIVCKIMDYGKFVFEQKKQKAAQKKKQKQIQVKEVKFRPGTDEGDYQVKMKNLTRFLESGDKGKVTLRFRGREMAHQDIGRKLMERIAADLEEIGTVESFPKMEGRQMIMIIAPKKK >NC_014532.2|WP_013332272.1|1762437_1762632_+|50S-ribosomal-protein-L35 MPKIKSNSGAAKRFKKTANGFKHKQSFRSHILTKKSTKRKRHLRGMKQIHDADKPLVQRMLPNL >NC_014532.2|WP_013332273.1|1762668_1763022_+|50S-ribosomal-protein-L20 MTRVKRGVVARRRHKKILKQAKGYYGARSRVFRVAKQAVIKAGQYAYRDRRQRKRQFRALWIQRINAGARQHGLSYSRFVGGLKKAGIEIDRKVLADLAVNEKAAFAAIVEKAKAAQ >NC_014532.2|WP_013332274.1|1763203_1764223_+|phenylalanine--tRNA-ligase-subunit-alpha MDHLPTLVAEARDAIQAAESMAALDELRVRYLGKKGEITALLKGLGQLPAEERPAAGERINQAKQALSADLEERKQALEKADLEARLAAETLDVTLPGRGQPSGGLHPVTRTLERIEGLFTHVGFDVAVGPEIEDDYHNFEALNIPAHHPARGMADTFYFDATRLLRTHTSPVQVRTMKSTEPPIRIVCPGRVYRSDSDLTHTPMFHQVEGLLVDEDVRFSDLKGTIQDFLHAFFERDDLAVRFRPSYFPFTEPSAEVDIQCVMCDGAGCRVCSHSGWLEVMGCGMVHPEVFRHSGIDSERYTGFAFGMGAERLAMLRYGVNDLRLFFDNDLRFLQQFA >NC_014532.2|WP_013332275.1|1764390_1766769_+|phenylalanine--tRNA-ligase-subunit-beta MKFSEQWLREWVSPALATQALADQITMAGLEVDGIEPVAAAFDGVVVAEVIERAPHPDADKLNVCQVDDGVERLQVVCGAPNVAEGQKVAFARVGAVLPGDFKIKKAKLRGVESRGMICSASELGLEEETSAGILELPSAAPVGEDFRTYMTLDDSTIEVDLTPNRGDCLSIKGMAREVGVLNRLPVEGPSVAPVASVHEETFPVRVEDTEGCPRYLGRVIKGVDVTAETPLWMVERLRRSGIRSIDPVVDITNYVMLELGQPLHAFDRANLDGAVVVRRARQGEQLVLLDGQTITLNGDTLIIADERGPLAIGGVMGGEHSGVSVDTRDIFLEAAHFSPLAVAGQARAYGLHTDASHRFERGVDPRLAREAAERATALLLEITGGEAGPLIEAADESKLPDDREVVLRRTRLDQALDKVLPDDEVGEILERLGMSVERVDEGWRARVPSWRFDIAIEEDLIEEVARIHGYNQLPARHPRALLGPRPDNEARTPLSALRQRMVSRGYFEAVTYSFVAPDLQETLLPEAVSPVLANPISSDLSVMRASLFPGLVRALEHNLNRQQNRVRLFETGLVFRGELDDLDQVPMLGALICGSREPEGWSGGKEQVDFFDLKGDLESLIEMGGEAEAWRFEPGAHPALHPGQCARVMYRGQEAGWIGTLHPAVRARLGLKTDALLFEVRLDALTHGRVPAFKPLSRYPEVRRDLAFLVDAEQPVQALLDTLRAQAGEWLVEAHLFDVYQGKGVPEGRKSVALGLTWQHPSRTLNDDEINQLVDAIVEESRLHLGAELRA >NC_014532.2|WP_041602015.1|1766831_1767134_+|integration-host-factor-subunit-alpha MGALTKAELAEHLHAELGLSKREAKSMVESFFEEIRGCLRENEQVKLSGFGNFDLRDKRERPGRNPKTGEEIPISARRVVTFRPGQKLKSQVEAYTGDQS >NC_014532.2|WP_013332277.1|1767213_1767420_-|DUF2788-domain-containing-protein MQDAIDTWITPFMIGGLMLFMGFIIWDLARKSGAGRFGTVMLFVVLGAGMLAYLIKVVIGWSLEHGVL >NC_014532.2|WP_041602016.1|1767537_1768002_+|flavodoxin-domain-containing-protein MPMLKIFVGTVYGGALDVAEQVAPLFEQAGYEVSIFDQPTLDDLIGSPTDLALFCTSTTGSGDYPGNLVAFVRELEAKSPGLVGLKYGLIAMGDSSYVDSFCGAGRSLDEVLQGQGAERLGERLEVDAMETFMADDAALPWVDDWIESQQLKVA >NC_014532.2|WP_049786291.1|1768055_1768379_+|hypothetical-protein MAASVPRYLAGGHDTHLALLAIAGVAVAALAVFQWWLLPVASRAALPALMRRLVACLVIGLLATGIWHALFGAWSGWPLLVSHGAALGLLLHALGLWWKPAAKKGKE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_014532_3 | 3332655-3332754 | Orphan |
NA
Consensus repeat of NC_014532_3
|
1 spacers
spacers of NC_014532_3
>3.1|3332678|54|NC_014532|CRISPRCasFinder ACTGGATAAATCACCTAACCGTGCGAAAAGTCAGCGAAAACAGACTCTTCTCGC |
CRISPR arrays and Neighbor proteins around NC_014532_3
The CRISPR arrays of NC_014532_3 >merge|NC_014532|3|3332655-3332754|CRISPRCasFinder CAAGAACTCAGCACCTTGGAAACACTGGATAAATCACCTAACCGTGCGAAAAGTCAGCGAAAACAGACTCTTCTCGCCAAGAATTCAGCACCTTGGAAAC >NC_014532|3|3|3332655-3332754|CRISPRCasFinder CAAGAACTCAGCACCTTGGAAAC ACTGGATAAATCACCTAACCGTGCGAAAAGTCAGCGAAAACAGACTCTTCTCGC CAAGAATTCAGCACCTTGGAAAC
>NC_014532.2|WP_013333597.1|3330981_3332394_-|MFS-transporter MSRLFATREGDDGLPGPERRLAVLALIFGTTMAVVDATMINLALPSIAADLEVASASAVWVTNIFQVTCAAFLLVFSGLSEVVGRRRLYVAGLALFAVSAAGSALSRDLNTLLAFRALQGLGAAATLSIGPSLYRTIFPSRLLGSALGLSSLVVATGYTAGPAIGGLVLSVADWPWLFALPVPIGVVAVILAWRALPREPGRRGGFDAAGAGCSILALGALFLAMDGVGHQTPVWQSLGWLALSLVVAGFFVWRQRRAPHPLLPLTLFRQRRFSLAVSASGLAFIGQGLAFVALSFLYQQGMGFSPLKTAWLFTPWPLAIMVAGPLAGRLADRVNPSLLSCLGLVVLIAGMIALADLEAEAGVVDCLWRTALCGLGFGIFQPPNNRELMASVPAERSANASGVMSTTRTVGQALGVALVGACLSVGAPVQTALWGGAVAGGLALLASFGRVSLAGEAARTRRRAASERVQ >NC_014532.2|WP_013333596.1|3330641_3330953_-|BolA-family-transcriptional-regulator MSTQSIIEEKLQALEPTLLTVENESHMHNVPPNAETHFKVTLVSSRFEGMMPVKRHQQIYALLADELAGPVHALALHPYTPEEWQSRGEARPDSPNCRGGGAS >NC_014532.2|WP_013333595.1|3329811_3330645_-|hypothetical-protein MTTEPHRLQYLEAMGLTAWVARYRLPNARPTEACEWEPEPAGEAGSRAPGERLHALLDDAAEASSTSAPSNESTTRPSAGQGRARALLGDLVPGEASASTATPPPAPPVSTPTEAPAEALRFTWQVTCLDGRWLVVLPRDVGPSDVEYRLLGNLLRAAGVVPSRPPSFETFRWPQLEGLPVEAPLEEAQEGLRAFLNGRRQRGWVPERLLVFGDDAVLNDLLALAEGQSGLLSMPVWQGPDLKELASGAEAKRALWPRMQGWKRDWRVGDEESRADA >NC_014532.2|WP_013333594.1|3329354_3329819_-|ribosomal-protein-S18-alanine-N-acetyltransferase MPDPSPAPLGRSALAALVELERVYQTYPLSAARLKAALTDGADVVFGLEEDGELLGYAILSRLPFDAELQSILVASHARRRGLAVALMEAVIAQAKAWKSERLLLEVREANAPAITLYRRMGFAEDGRRRDYYPSLDGAGREDALLMSRHLGGA >NC_014532.2|WP_013333593.1|3329115_3329346_+|cell-division-protein-ZapB MSIELFNQLEQKVSSAVEALELMKMEAEELREENARLKQEREEWERRLSALLGKFDDVETEQSSQQEQPAPQQQPG >NC_014532.2|WP_013333592.1|3328375_3329029_+|16S-rRNA-(guanine(966)-N(2))-methyltransferase-RsmD MTRRRSPSRPSRHSRAPRRRDGNRGRGQLRIIGGEYRRRLLPVIDLPGLRPTPDRVRETLFNWLGPGLAGARVLDLFAGTGALGLEALSRGAHDAILVELDARASRALEDNLATLGITHARVVNADVMRFLDAEPTPHSLVFLDPPFRQDLAAACCAALEGGWLSDDASIYLETESTLAPEVPANWILHREVRAGDSTGRLYRRRPTGEDSPTEDAC >NC_014532.2|WP_013333591.1|3326916_3328284_-|signal-recognition-particle-docking-protein-FtsY MFGFFKRKKKQEEQASQTPEQEVERDEAALADEEAASAPEVEPEPAPEVESEPAPEVEPEPTPEVEPDPVPEVEPEPTPEVEPEPAPEVEPEPTPEVEPEPAPEVEPEPTPEVEPEPAPEIEPEPEPTPEPAAAAPREKPRGEKKGWFARIKDGLGKTRANLTDGLAGLFLGRKQIDDDLMEELETQLLMADVGIEATTEIIDRLTERVSRKELKDPEALFKALQEELASLLDGVTQPLELPPKGEGPFVILVVGVNGVGKTTTIGKLTQRFQREGRSVMLAAGDTFRAAAVEQLKVWGERNSVPVVAQHTGADSASVVYDALAAARARGVDVLIADTAGRLHNKSHLMEELKKVRRVMGKLDADAPHEVMLVLDAGTGQNALSQASTFNEAVPVTGITLTKLDGTAKGGIIFALAQQLGTPIRFIGVGETLDDLRPFAAREFVDALFDRDDAAA >NC_014532.2|WP_013333590.1|3326251_3326920_-|cell-division-ATP-binding-protein-FtsE MIAFEHVGKRYGGRFEALAHLNFRVGRGEMVFLTGHSGAGKSSLLRLIIRLERPSRGRILVAGHDIDRLHHTQVPFYRRQIGVVFQDHQLLFDRSIYHNVALPLEIQGMEPRETSRRVRAALDKVGLLHRERALPIELSGGEQQRVGIARAVVNKPALLLADEPTGNLDPQLSADIMSLFEDFNRIGTTVMVASHDLALIARLRHRTLRLHEGRLVADEEAL >NC_014532.2|WP_013333589.1|3325277_3326255_-|ABC-transporter-permease MSRQAQKPAQRGARAYRAGASGRWRSWGRHHRAMARDSAMRLLRHPLSSLLTMLAIAIALVLPAGLWLALDGARLLDAELDESATLTAYLAERVDDGEAGRIEEALAAQQGVADTRLITAAEGMAEFQQSLGLEDALARLPDNPLPASVVISPVDPSPEAVRRLADELEGLNGVEEVRLDLAWLERLRHLAELGQRVTLALAVLFGMGVLLVVGNTIRLAVENRRQEIEVVTLIGATHPFVRRPFLYSGAWYGLGGGVLAWGLLTLGGDWLSGPVSALAASYGASFALPTLGIGGSATLLACSTLLGWLGAWIAVSRHLAQIRPR >NC_014532.2|WP_013333588.1|3324282_3325149_-|RNA-polymerase-sigma-factor-RpoH MSTSLLPVGHLSPGHDLNGYIQAVNGIPMLTVDEERELAFRLHDEGDLEAARRLVMSHLRFVVHIARSYSGYGLAQADLIQEGNVGLMKAVKRFDPNQGVRLVSFAVHWIKAEIHEFVLRNWRIVKVATTKAQRKLFFNLRGAKKRLAWLNSNEVEAIAKDLDVKPEVVREMEGRLSAHDAGFDAAPGEDEESAYQAPVHYLDDASQDPATQLEDSDWEEDSTQRLQAALSELDERSRDILQRRWLSDDKSTLHDLADVYGVSAERIRQLEKNAMKKIRQSIGDTLAA >NC_014532.2|WP_041602190.1|3332891_3333278_-|hypothetical-protein MKLLNRSALSVRPTQHFVDWINALEPTVGDDDLALEDVERESTVYLIPEMDTPENLEAFVRERYLEILETELRAWEEDERQWPETLDWALFQRFLCIEHSYLAIDLDDEAALEVAEVDDSMLLETDQD >NC_014532.2|WP_013333599.1|3333434_3334988_-|lysine--tRNA-ligase MAHQDNSQAPAGTQDENHLIAERRAKLAARREQAAASGGSAFPNDFRRDSLAAELAAELGDKDKAELESLGRPAAVAGRILRKRGPFIVIQDASGQIQLYVDKKGLPAETLEDIKGWDIGDIVAGRGPVHKSGKGDLYVMMEEARLLTKSLRPLPDKFHGLTDMEARYRQRYVDLIMNPDSRRVFETRAGVISAMRRFFEDHGFMEVETPMLQPIPGGATARPFITHHNALDIDMYLRIAPELYLKRLVVGGFEKVFEINRNFRNEGLSTRHNPEFTMVEYYQAYADYQDLMDFTEAMLRTVTREVLGDTTVVSTVRDSEGEVLETFEYDFGKPFERLSVFDAILAYNPDITAEALADEAAARQIAERLDIDVKDGWGLGKVQIEIFEKTVEHRLQQPTFIIDYPTEVSPLARRKDTDPFVTERFEFFVGGRELANGFSELNDAEDQAERFAAQAAEKDAGDQEAMYYDADYVRALEYGLPPTAGEGIGIDRLVMLLTDSASIRDVLLFPAMRPSAD >NC_014532.2|WP_095522731.1|3335087_3336186_-|peptide-chain-release-factor-2 MQEVNPINHLIKDLSERTDVLRGYLDYAEKKERLEEVTRELEDPEVWNDPDYAQKLGKERATLEMVVDTIDTLERGLNDNRDLLELAEMEEDADTVDEVSRELESLRADLEKLEFRRMFAGEMDPNNAYLDIQAGSGGTEAQDWANMLLRMYLRWAEHHGFKADLIELSAGEVAGIKSATVHIQGEYAFGWLRTETGVHRLVRKSPFDSGGRRHTSFASVFLSPEVDDSFEVEINPSDLRVDTYRSSGAGGQHVNTTDSAVRITHEPTGIVVACQSQRSQHANRDFAMKQLRAKLWEHEMDKRNAAKQAAEDSKADIGWGSQIRSYVLDDQRIKDLRTGVQSSSCEKVLDGDLDQFIEASLKQGL >NC_014532.2|WP_013333601.1|3336450_3338034_+|peptide-ABC-transporter-substrate-binding-protein MTHNRTPLAGAIALAAALVTTPAWAQTLNLGVTGELASFDTSQVSGGIWESQILMDVYEGLVKKAPDGEVLPGMATSWEVSDDGRTYTFHIREDAAWSDGEPVTAEDFVFGWQHLLDPKNASKYAYMLYPVVNAEAVNTGEKPLDALGVASLDDGRTFQVELTAPTPYFIQLLTHYTAYPAPKHAVEEYGRKWVKLDNIVTNGAFTPEEWVSQSRISVSPNPEYYDADEIALDGVNYYTVEDRNAGVSRFRSGELDIMREYPSSLYGMLQEELPDATHMAPYLGSYYYVFNHREGHPTADPKVREALSLVVRRKVLSEQIMGGTFLPSRSFVPEGIHHYDVQQMPQEGSMDERMERARQLLAEAGYGPDSPLHLRLRYNTNDEHKKIAVALAAMWRPLGVEIEMINSEATVHYQTIAEGDFDIARAGWIADYNDAENFLSLLHSGVGNNYGAYSNAEFDDLTDQASHTLDADKRESLLEQAEQTALNDYAILPLLYYVSRNLVNPAISGWEDNVEDDHPSRWISFDK >NC_014532.2|WP_013333602.1|3338256_3339840_+|peptide-ABC-transporter-substrate-binding-protein MLSHRFTRAALLATLVAGAAQSAPAAVLQVGNGAEPGTLDPQKTNGVWETRITRELFERLVTYAADGSLVPGLAESWTISDDGTTYTFHLRQAEWSDGTPITADDAVFALRRLLKPAIASHNANLYYPIKNARAVNTGQAEPSELGVSAADEHTLVIQLDEPTAYFLQALAMTEAAPLPRHLVEKAGDEWTRPGTMVSSGAFTLREWRPQARIDLDRNPHFHDADTVSLDGVTFYPIDDTGSALNRFRAGDLDISYSGVPASRFDWVKDNLGESLRVGPLVAEYFYMFNLRDGQPLADERVREALSLAVRREVITDRILGMGQRASYWYVPRAAEGGTRGSLDVAEQPMEQRLARAKRLMQEAGYGPDNPLHVTLRYNTLEDHKKIAVAVAAMWKPLGVEVELINAEAAVHYATVNEGDFEIARYGMVATINDPYDFLNAYAKGGSAQRSTGYRNDAYDALVERSTRELNTERRAELMTRAEQMLLDDHALLPLYDYVSAHLVSPEVKGWQTTAIDVHPLRYIQLED >NC_014532.2|WP_013333603.1|3339894_3340821_+|oligopeptide-ABC-transporter-permease-OppB MLSYTLKRLLQAIPTMLIVITISFFLMRIAPGGPFDGERALPPEIEANLMAAYHLDEPLPMQYLRYMGNLLQGDFGPSFKYKDFSVTELIMQGFPVSLEIGGLAILLALLLGLPLGVIAALKRNSTIDYLVMGTALAGIAIPNFVIAPILALVFGVLLAWLPAGGWNGGALPNLVLPVIALSIQQIAYIARMMRASMIEVLGSHYIRTARAKGLAESQVIWRHALRPALLPVTSYLGPAVAGIITGSVVIEQIFGIPGIGRYFVQGALNRDYTLVMGTVVFYGALIVLMNLLVDLIYSALDPQIRHDD >NC_014532.2|WP_013333604.1|3340841_3341768_+|ABC-transporter-permease-subunit MTTEPLTHDGDRHAPGDDLAPGAAPAAGESLTRDAWRRLKQNRAAMVSLVMLSVITVICVFGPYVLPWGLADVDWNAFNAPPSIENGHLLGTDANGRDLLTRTLYGGRVSLSVALVASLVSLVIGVLYGAISGYLGGRVDNIMMRFVDIMYSLPFMFLVILLMVVFGRNILLIYAAIGAVEWLDMARIVRGQTLALKQREFVEAAHALGVRDSRIVTRHLIPNAIGPVIVYVTLTVPKVILLESFLSFLGLGVQEPLTSWGVLISEGTDMMQSSPWMLLVPSVFLAMTLFCLNFLGDGLRDALDPKTR >NC_014532.2|WP_013333605.1|3341777_3343376_+|ABC-transporter-ATP-binding-protein MSDTLLEIDNLSVDFQLPDGTVPAVKDVSFDIRAGETVALVGESGSGKSVSSTAAMRLLPELAQARGAIRFRGEDLLAATPRRMRRIRGNAISMIFQEPMTSLNPLHRIGAQIIEVLTRHNKAKGRAARTRAIELLEQVGIPEPERRIGSYPHELSGGQRQRVMIAMALACEPELLIADEPTTALDVTVQAQILQLLKSLQARYGMAILFITHDLGIVRHFADRVCVMRRGEMVERGDTAEVFTNPRHDYTRMLIDAEPRGGKSPVEASAPVLLEARNLRVRFALKKRLFRPSSYFEAVRGIDLTIQRGQTVGVVGESGSGKSTLGRALLRLLKSSGDIRFDDSDLTALDGAGMRPLRSRLQVVFQDPFGSLSPRLTVGEVISEGLRVHHPELDRRQRERRVIEALEEVALDPAMRNRYPHEFSGGQRQRIAIARALVLKPEFLLLDEPTSALDRSVQVTVIELLRNLQAKYGLTYLFISHDLAVVRALADTVMVMKSGQVVEQGPTEAIFANPREAYTRELMRAAFVDDAA >NC_014532.2|WP_013333606.1|3343635_3344760_+|porin MFHHYKLTGLAVAIGAALSTQQALAVTAYETDQDKLTISGRIAAGSSFIDNVDDDHDPTNAGSRIRLIHEHEFEHGWSSVARAEWGFDPFFEHGNDGHYKRMLYAGVRHDDYGTLLIGKQYSLWYDMVAYWTDWFWYNGATAQGSFNGAFGDGGFEGNGRPDNAVSYKNTWGDWSLGLLYQTSRDDVPTGAGYTGNLTGFERDYTAQGAAVWQPTEDLSLGATYTHSAIDGKTAGGGKRSKNVDAGLLAARWTPGNWYFALTGGRYDNLVRDGNFSGVNTTDGIVDEARGVEGVALYNLKGQVPGKVQLYTGFNRLEDRASEARSAFYLAGAAWLTFDENLIIALERKFDDSVDADGASDIGNDETNLLVRYNF >NC_014532.2|WP_013333607.1|3344904_3346242_-|outer-membrane-protein-transport-protein MQNQLNKLTVAVTLASAVLASSQAAASGFQVREQSAKALGNAMAGAAAGAEDVSYMTFNPAAIGNVDGTQVAGGISYIDANFELTDASAGPAGLPLSYDRGGSREGGEEAWVPSFAFKTQLDDRFDFGLSVSAPYGLSTEYDKNWIGRYHAIETDLQTIDIQPTLNYRATDRLNLAVGLRAQYADATLSNAIDLGGMSGNPALVGNADGKAEVTGDDWGYGYTLGALFQATDRTRLGISYRSEVDLTLEGDVNYSASNAAGRQILAGAQAMGQLRDAGGKADLTTPANMNLGVYHQLTDRFAVMANAEWTEWSSFDKLVVKSGGQDLSTTTENWDDTWAFSVGANYQLNREWLLRAGLGVDESPVPDSEHRTPRVPDADRRWATLGATWMPTPDLGVTAGYMRVFGDDGDIDQSGAKPENATRGDLSGTYEVDANVFALSVDYRF |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_014532_4 | 3712097-3712185 | Orphan |
NA
Consensus repeat of NC_014532_4
|
1 spacers
spacers of NC_014532_4
>4.1|3712120|43|NC_014532|CRISPRCasFinder GCTGCACGGCGAAGAAGCCTTGCTGCATGCGTACCTGACCGTG |
CRISPR arrays and Neighbor proteins around NC_014532_4
The CRISPR arrays of NC_014532_4 >merge|NC_014532|4|3712097-3712185|CRISPRCasFinder GCCGTGACAGGTCGGGCAGGTCTGCTGCACGGCGAAGAAGCCTTGCTGCATGCGTACCTGACCGTGGCCGTGACAGGTCGGGCAGGTCT >NC_014532|4|4|3712097-3712185|CRISPRCasFinder GCCGTGACAGGTCGGGCAGGTCT GCTGCACGGCGAAGAAGCCTTGCTGCATGCGTACCTGACCGTG GCCGTGACAGGTCGGGCAGGTCT
>NC_014532.2|WP_041602243.1|3710742_3711375_-|LysE-family-translocator MEWLAFVIAALGFAYLPGPAMLYTAAQTLGRGRRAGFCAVLGVHLGCYVHVIAAALGLAALFAAVPPAYVAMKIIGGCYLLWVGMRLWRRGIGSPETTDGTGKGARRVMVDSMLVEILNPKTALFFVAFLPQFAQPEAGMAMGWQLLWLGIAANFLFSSADVVVVMLAAPLRRWLQGPRGSTRLAQRLGGGLLASLGAHGLTQVYRQGIS >NC_014532.2|WP_041602242.1|3709846_3710746_-|DMT-family-transporter MTDAMDRPLAGILLRLCSGVLFTGMMVCVKVVSTEVPLGQTVFFRSAFALLPIVIFMIWRHEFPRALATRRPAGHLLRSGLGAAAMFASFAAVARLPVAEATLLAQLAPVAMAVGGVFLLGERFTRHRAAALALVLGGVAALVVPDLGNGEQAGRLPGYGLGILAALLTAGALVTVRRISRTETPASIAFYFVLVTALAGLATWPLGWVGVSGTTLSLLILAGLFGGAAHICMTLALRFAEVSRLAPFEYVALIWPVLADLLIFGLPLSPGFLVALPLVLGGAGLAALEGRRFRWRLRR >NC_014532.2|WP_013333910.1|3709119_3709869_+|ABC-transporter-ATP-binding-protein MVTLTLDRLTAHYGRRQILSEITTPPLEGGRVVALLGPNAAGKSTLFRRILGLIGGGGSARIDGTTRERPLAYMPQDTGANAVLSVYESVLLARMQGRSLKVQDEDLAEVDRALRELGISELGERDIGDLSGGQRQLVGAAQALVQDPEILLLDEPTSALDLHRQIQLLSILQRLARERHMLILAALHDLGQALRFTDEAIMLENGRLIACGPTGEVVTPELLRRVYRVETRIEACSRGQPQLIVEAAT >NC_014532.2|WP_013333909.1|3708040_3709126_+|iron-ABC-transporter-permease MHASPPDQAAAASPTTLQGRGFYRRLVIRRQLTLAALTLALCLSLCIDLALGPARFGLGEVIAALLDPASASQQVRVILWDIRMPVALMALVVGASLSVAGAQMQTILSNPLASPFTLGISAGASFGAALALAFGVVIVPAAVEYVIPINAFVMAMLTAFAIHALSLKRGVTIETIVLLGIAMVFIFNSLMALIQFFASQQAVAAVVFWTMGSLTKATWPKLGIAAGVLAVVLPLLARHGWALTAMRLGDAKAESLGVKPRALRLEVLVLVSLLAAIAVAFVGTIGFIGLVGPHIARLLMGEDQRFFLPGAALCGALILSVGSVLSKIILPGTIIPIGIITSLVGIPFFLFLVLNHKKSAW >NC_014532.2|WP_013333908.1|3706911_3708039_+|ABC-transporter-substrate-binding-protein MHTSLLKTLGGIALTLGSAVAQADEITVTDVAGREVTVDAPVDRVILGEGRQIYLLGALQPEAPFAHVVGWREDFSQADPDNYARYAAKFPELKDIPTFGGFKDGTFDVEQAAALEPDVVLMNLEAKAATEDAAYDDKLAELGIPILYVDFREAPLEHTIPSMRLMGKLLGKQEAAEDFIDFAEAQMARVTDTIESAAPERPRVFVDRAGGYSEDCCMSFGPGNFGEYVELAGGTNIAKDIIPNTFGSLNPEQIIAANPQQVVVTGGNWDAYVPGGDWVGVGPGADMTTARAKLEALTERTAMTGIEAVENDDVHAIWHQFYNSPYYFVAVQRLAKWFHPALFADLDPGATMKELHDRFLPVDYEPGYWVSLKEH >NC_014532.2|WP_013333907.1|3706334_3706616_-|DUF2218-domain-containing-protein MPISRAEIVTDSGEKLINRLCKHWSHKLEVEQEGDEGRITFDNGSCLLRAEEGKLKVAVESLDEEGLDRLEGVVASHLERMSGKESLDIIWEN >NC_014532.2|WP_041602712.1|3705401_3706304_-|DMT-family-transporter MLLWAALVGLSFPAVGLMGELPPMLLTALRFAIACLGLWPLAHRAEGFALARRAVPVYALMGLCLAGFFGAMFWAAHHATALSMATLYVTVPLLAYGLGLGLRVERLAWRLPAILALGAVGALALAYAEALVRGGQMRFGIGEAVFFVGCVCSALYPVLSKWGLNAGRLPASAAVRTFWSLGLGGVLIGVLGVLVEPVSRFAAMSWSDALLLVYLGLFSSALTFWLMQRATLVLTPGAITAYGYLVPFVSMLVLFLRAPQSLDWVWLPGSLMVLAAIALLLRHDADTERKTGDSARTQAD >NC_014532.2|WP_157953433.1|3704441_3705425_+|esterase-like-activity-of-phytase-family-protein MPRMPTSGRALGALPLAILFMTLLVPLPGCANHARVVSLAGVGADITPPPRVELCGTLSLPSHWPDGTPVNGLSDLVWERDAGLLHMVSDRGWLHRARPRFEDGQLVGLSPIDSHRLRDGDGLPLEGSAADAESLSLLHGTNGKLGDSEFWVSFERDHRLQRFDRDGGPLAAPIRPAQAADAAPNKGMEAMTELPKHGLILGLESPPPGAAPGETRLFTLDGKQWRYPLAAPTGSALTELTADGDDLLALERAFAPPAPLVISLRRVRLGEPPELDVETLASFSSADGWWLDNMEGLTRLDDGRLLLLSDDNASPLQRSLLVCLRPR >NC_014532.2|WP_013333904.1|3703488_3704295_-|4-hydroxy-tetrahydrodipicolinate-reductase MTRIAIAGVAGRMGRTLVNAVQQDAEATLAGGTVSPGSSLVGADIGELAGSGKLGVMATDSLAAIAADFDVLIDFTAPRVTLDNLAVCAEHGKRMVIGTTGLSDEELAELDAYRDRLPMVFAPNMSVGVNLTFKLLETAARALGDEGYDIEVIESHHRHKVDAPSGTALKLGEVVADALGRDLKTDGVFERVGQCGPRSDKEIGFATVRAGDIVGEHTVMFATEGERIEITHKASSRMTFAKGAVRAARWVAGQPVGRYDMQDVIGLD >NC_014532.2|WP_013333903.1|3702047_3703193_-|glutamine-hydrolyzing-carbamoyl-phosphate-synthase-small-subunit MSKPAILALEDGSVFHGTAIGADGQTSGEVVFNTAMTGYQEILTDPSYSRQIVTLTYPHIGNTGVNSEDVESSSIAAAGLVIRDLPLLASNFRCEQTLSDYLAQQNVLGIADIDTRRLTRLLRDKGSQNGAILAGPDAEGEGAEARALEAAGAFPGLKGMDLAKVVSCTEPYEWSEGEWTLGSGYADASEGERPYHVVAYDYGMKRNILRMLASRGCRLTVVPAQTPAEEVLAMNPDGIFLSNGPGDPEPCDYAISAIQAFLETEIPVFGICLGHQLLALASGARTVKMNHGHHGANHPVQDLDSGRVMITSQNHGFAVEEASLPDNLRAIHRSLFDGTLQGIERTDRPAFSFQGHPEASPGPRDVAPLFDRFVEMMRRRR >NC_014532.2|WP_013333913.1|3712763_3714698_-|molecular-chaperone-DnaK MGRIIGIDLGTTNSCVAVLDGDDAKVIENAEGARTTPSIIAYTDDGETLVGQAAKRQAVTNPQNTLYAIKRLIGRRFKDDVVQKDIKMVPYTITEADNGDAWVEVKGNKLAPPQVSAEVLKKMKKTAEDYLGETVTEAVITVPAYFNDSQRQATKDAGRIAGLEVKRIINEPTAAALAYGMDKSRGDKTIAVYDLGGGTFDISIIEVADVDGETQFEVLATNGDTFLGGEDFDLALINYLVDQFKSDSGIDLSGDNLAMQRLKEAAEKAKIELSSAQQTEVNLPYITADNTGPKHLNVKVTRAKLESLVEELVARSLKPCKTALADADLSASDIDDVILVGGQTRMPLVQAKVAEFFGKDARKDVNPDEAVAVGAAIQGGVLGGDVKDVLLLDVTPLTLGIETLGGVMTPLIEKNTTIPTKKTQTFSTADDNQTAVTIHVLQGERKQSSGNKSLGRFDLADIPPAPRGVPQIEVAFDLDANGILNVSAKDKATGKEQSIVIKASSGLSEEEVEQMVRDAEAHADEDKKFEELVALRNQADGMVHAARKTLEEAGDKATDEEKQAIENAASELEEAAKGDDQEDIQAKLDALTEASGNLAQKMYAEQAEDAAGADAGEQAEGQKKTEDDVVDAEYEEVNDDQKKQ >NC_014532.2|WP_041602244.1|3714825_3715455_-|nucleotide-exchange-factor-GrpE MAKDPQTPLDDELSRQQEEADAQVEPESVEGELEDAIENAEQTQEERESTDNPEAEVLAAKVEELEQSLADAKDQSLRAAAEAQNVRRRAEQEAEKARKFALEKFVKELLPVVDSLEKALDAMQEGASETHREGVSMTLKLQLDVLGKFGVEVVDPTGEPFDPQYHEAVTMVPNAELEPNSVMEVIQKGYLLNGRLVRPAMVVVSQSSE >NC_014532.2|WP_013333915.1|3715677_3717351_+|DNA-repair-protein-RecN MLTELAIRDFAIVDHLALELEGGMTAITGETGAGKSILLGALGLCLGERADAGSVRHGCERADLSARFDIAELPAARTWLDERELPSDDCLLRRVVTRSGRSKAWINGQPATVADLKALGDHLIEIHGQHAHQGLLREETHLHLLDDFADHDEAVRDMAATFHAWRESHQRLKRLSEDNDEIRARLQLLRYQVEELDQLALAEGELEGLESEQETLAHAEERLREAQFAAQCCDGDEGGALPLLHQAVNRLSALPGSERSALADALSMLGDACIQVEEAGRELNHFAAGVELDPERLAWVEERLGEVHRIARKHQVAPHELVSLHQHLTEELAELEGGDGDLDALAAEVENLKQAWRQRAEAVSATRRKAAQRFGKAVQEQLAFLAMGKASFDVELTPRDTPSPEGLERARFTISANPGQPARPLTKVASGGELSRISLAIQVVAAQHSTIPSLVFDEVDVGISGATAEVVGQLLRRLGKGGQVMTVTHLPQVAAQAHQHLHIAKQAEDETTLTHMALLDEAGRVGELARMLGGMKLTDQTLAHAREMLDASQRAHH >NC_014532.2|WP_013333916.1|3717429_3717876_-|ferric-iron-uptake-transcriptional-regulator MADQNHELRKAGLKVTLPRVKILHILENATGQHHLSAEDVYKTLLEAGEDVGLATVYRVLTQFETAGLVTRHNFDGGHAVFELTQEEHHDHMVCLDSGEIIEFFDDTIERRQQEIAEEHGYELVDHALVLYVRPKGSRVTRQEPSGKK >NC_014532.2|WP_013333917.1|3717909_3718368_+|outer-membrane-protein-assembly-factor-BamE MIDQNHDSEEQAQMQKLTRTVTLTVALTLVSGCSYFGVYKRDLAQGNLVTSAMAEQLQPGMTRQQVVNLMGSPMLEAPFDAQQWDYVYRLDKAYGGVEQRRLTLTFQGNRLADIDRHGDFSRPPSVADERGIGPTDSTNARGNLLNARPDDE >NC_014532.2|WP_041602245.1|3718375_3718690_-|RnfH-family-protein MDAEEQGMVHVEVAFALPNRQRIVSLTLPAGTHAREAVRQADLAHYFPDVPPETFENAALGIFGKALRDPERHILQEGERVEVYRPLRIDPKAARASRAADKRG >NC_014532.2|WP_013333919.1|3718679_3719114_-|type-II-toxin-antitoxin-system-RatA-family-toxin MPTVNRSALVRHTPQQMFDLVNDFERYPEFLPGCRRARLLERDAEHLVGEMTLGRAGIEQSFTTRNDLQEPERIDLSLVNGPFKRLRGRWLFMPMGEDTCKVSLEMEFEFANRLLGMAFGKLFQQVAGQLVEAFTRRADELYGR >NC_014532.2|WP_013333920.1|3719243_3719735_+|SsrA-binding-protein-SmpB MANKKGKGKGPGSNAIALNKKARFEYHIDETFEAGLALAGWEVKSLRAGKAQLTDTYILVKNGEAWLLGSHITPLNTVSTHEVADPTRTRKLLLHRKEIARIFSRTQDKGHTCVPLKLYWKGSKVKCELALVTGKKLHDKRATEKDRDWQRQKGRILREHNKT >NC_014532.2|WP_041602246.1|3720322_3720553_+|DUF1654-domain-containing-protein MAKKHRPTSYELLGQRVKQIIAAATWREQRQVHLQPAEGDSPDDWDRLIDEISENENVDVTRTDEGWLVSWVPVGA >NC_014532.2|WP_041602248.1|3721072_3721255_+|type-II-toxin-antitoxin-system-HicA-family-toxin MKSSELIKELEADGWQLDRIKGSHHHFRHPSKPGTITVPHPKKDLKKGLVQGIRKQAGLK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_014532_1 | 1.2|126799|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT | 126799-126830 | 32 | NC_013190 | Candidatus Accumulibacter phosphatis clade IIA str. UW-1 plasmid pAph02, complete sequence | 27706-27737 | 8 | 0.75 |
NC_014532_1 | 1.1|126739|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT | 126739-126770 | 32 | NZ_CP032325 | Azospirillum brasilense strain MTCC4035 plasmid p4, complete sequence | 598119-598150 | 9 | 0.719 |
NC_014532_1 | 1.1|126739|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT | 126739-126770 | 32 | NZ_CP007796 | Azospirillum brasilense strain Az39 plasmid AbAZ39_p3, complete sequence | 48757-48788 | 9 | 0.719 |
NC_014532_1 | 1.4|126919|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT | 126919-126950 | 32 | NZ_CP028971 | Aminobacter sp. MSH1 plasmid pUSP3, complete sequence | 9439-9470 | 10 | 0.688 |
NC_014532_1 | 1.4|126919|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT | 126919-126950 | 32 | NZ_CP015008 | Aminobacter aminovorans strain KCTC 2477 plasmid pAA03, complete sequence | 106268-106299 | 10 | 0.688 |
1. spacer 1.2|126799|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT matches to NC_013190 (Candidatus Accumulibacter phosphatis clade IIA str. UW-1 plasmid pAph02, complete sequence) position: , mismatch: 8, identity: 0.75
aaggcataaagatgaatacattgagctcccat CRISPR spacer aaggcacaaagatgaatacagtgatcgctatc Protospacer ******.************* *** * *. .
2. spacer 1.1|126739|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP032325 (Azospirillum brasilense strain MTCC4035 plasmid p4, complete sequence) position: , mismatch: 9, identity: 0.719
tcccggcggacggaaagcttggcagaccagcg CRISPR spacer ctccggcgaacggatagcttggcagcggcacg Protospacer ..******.***** ********** .**
3. spacer 1.1|126739|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP007796 (Azospirillum brasilense strain Az39 plasmid AbAZ39_p3, complete sequence) position: , mismatch: 9, identity: 0.719
tcccggcggacggaaagcttggcagaccagcg CRISPR spacer ctccggcgaacggatagcttggcagcggcacg Protospacer ..******.***** ********** .**
4. spacer 1.4|126919|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP028971 (Aminobacter sp. MSH1 plasmid pUSP3, complete sequence) position: , mismatch: 10, identity: 0.688
gtaagcgccgcatgctgtgggcgtcacgccct CRISPR spacer agaagcgccgcatgttgcgggcgtcatcgata Protospacer . ************.**.********. .
5. spacer 1.4|126919|32|NC_014532|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP015008 (Aminobacter aminovorans strain KCTC 2477 plasmid pAA03, complete sequence) position: , mismatch: 10, identity: 0.688
gtaagcgccgcatgctgtgggcgtcacgccct CRISPR spacer agaagcgccgcatgttgcgggcgtcatcgata Protospacer . ************.**.********. .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1244941 : 1254916
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_014532|1244941:1254916|DBSCAN-SWA GTCATGAGCCGCTGTACTCCTTGATCGAGTTGAATGTCACGCCGTAGATGCCGGTGGGCGCTTGGCCTGACGTACCTGCTTCCAGATACTTTCCATAGGGCGTCGTATTGACGATATGGAAGGAATCGCCCGGTTTCAATGCGGCCATGGCCCGCCGAGCATCGGCAAGCGTGATGCTCCGCGATGGATCGCGCTTGTTCTCGTCGAACGTCTCGTCTGGAGTGTTGACGGAGAAATTGTTGTTCGCGAGGAAGCGGCTCGTATCGATTGGCGCCCGGTAGATAACGCCCCGTGACGCCTGCTCGCAGAGCTCGCGGAACAACACCATGGTGTCGTCCTCGACTTCTAGCGCGAAATCAGTCGGTTTGCCGCTCCATCCGCCCTTGGCCATGTCAGTTCACCTTGAGATGCACGGAATAGGTGGCACCTAGCGGATCGCCGGATACCTCGACCACTCGAGCTGTCCGGCCGTCGGCCAGCGTGATGACATCATCGACCTGGGGAACCTGCTCGGTGGCGTTGGCCAGCATGCCGATCTTGATGTCGGCACTGGTCAGGTCCAGCGACTGGATCTCGGCGAATGTGAACTTGGACCGCCAGTAGCTCCCGGAGTAGGGCAGCTTGGCCGGCTCGTAGGTGCCAGTGTCCGGGTTATAGACCCCGTCGCCCGGCAGCGTCCGCTTGGCTGTGAACGCGAAGTAGGCCTGGCCCAGCTCCTCGTCATCGGCGAACGCTTCGGCCAGCTCGGTCTGGATCTCATCACGGGCAGTCACGGCGATACCCCACTGTGATGCTGCCGCGAGCGGCCTTGTAGGGGCCGATCAGCGCCAGGGCGAATTGGACGCGGGAGGGCAGGGCAGCATTGCCCACGTTCGACGTGGAGATCTCGGCATAGCGCTTCTCGACCTCGGCACCGTCTGCCTTGACCCGCTTTTGCGTCATCGTGCCCTGCGATTGCTGGGCATATAGGCTGCCATCTGCCGCGGCGGCCGCCAGCTCGACGCCGGCCATGATCACGTCGGCCGGCACGGGATCGGGCACCTGAATGCCCTGGGCGGACAGGTAGGCGTTGGCCATGGCCACAAAGCGCTCCTTGTCAGCCTCAGGTGCCCAATCCGCACCGAGCTTCTCATCGACATTGGCGACGGTCACATATTGCGTCATGACAGCCTCGGAATCAGGGAGGCCCCGCAGGGCCTCGGGTTACTCGGCCGCTTTCTGTTGCGGCTGCTCAGCCTTCTGGGCATCGCCGCCTTTCTCGGCGGGCTTTTCGGTCTTGGCGCCGGCGGTGCGGCCGTCCACCTTGGAGAGCACGCCACGCTTGACCATGCGGGCAAGCCCCGGCGCCTTGTCGTCGACATCAAGGGTCCGCTTCGGGGCGGCAAACTTGCCGTCCACATTCACGACACCCAGGCTTTCGTTGCGGTACTTACCCATGATCAAACCCCCACGGCTTTAGTGAGTGACAGCGGACGGTAGACCGCCACGCCGGAGCACTTGGCGATGCAGTCGATCACCATCTCGAGATTGCGCTTCTCCGCCGGCAGTTGCTCGAACGGGCGCGGGATCTCGATGGACAGGTTGTCGACGTCACGCTCGTACATGACGGCAGAAGACTGCTGCCCGTTGGCATCGCTCGGCTCGAGCTCGGTCGAAGCGATCAGGCTGACGTTCGGATAGAGCGCCTTAAACCGAGACCACACGCTTTCCCAGTTGCCGCCCTCGTTGACGTTCTTGCGCATCAGCGCAGAGCGACGAGTCGGGGCCAGCGCCAGGGTGTTGGGTGTGTGGTAGTCATAGGACTGGATCTGCACGGCGTCATACATGGCGAGCAGATCGTCGAGGATCTGCTGGCCGGTAGTGGCCGAATTGGTCCAGTCGCCGTTGACGGCGGTTTCGCCCAGGTTCGGGTGGTTGGTGAAGCCGTAGAAGCCGTGTTTTGCATCACCGACCATGGCGACCTTGTTGATCAACTGGTCGATGGCACGGCGGGCGGTGCGGCCCTTGCGTTCCGGCAGATTGGTGCCCATGGCGTTGGCCAGGCGCATTTCATCCAGGCTATAGCCGTAGGCATCGCCTAGGCTCTTGAGCTTGAGCGTCTCTTCCTTGCCGGTCACATCCACGCGCGGCAGGTCATCGGCGTAGTTCGAGATGAACTTCGCCATGCCGACTTCATCGTAGGTGCGATAGGTGAACGACTCGGCCCACTCGGGCACCTCGGTGGAGACCGGCACCAGGTCCAAACCAGTCAGACTCGGGGTGATGGTCTCGTAGATGCGAGACTTCACATAGTCGAGCTGACGCGCCAGGAACAGGCCCTCGTCCTCTCGCACGATGCCCAGTCGGTCGCCGGCATCCATCACCGCCGCCAGATCGGCGGCGTCATAGTTCATGTGCTGTTTCATCGTCGTCGATCCTTATGCGGTCGGAGAGTGGAGCTCGACGCAGGCAATCACCCGGCCATCGTTGAGGGTCACGGCGTCGGAGCGGAACACGGCGTTCGGGTAAGCAGTACCCACGTCGGCATCCTGTACTTCGCCGGCGGCGTTGAACTCGATGGCACCGCCATCGGTAACGGTCCCGCCATCGATCACGGAGCACCAGGCCAGGCCTCGGGTCAGAACCGACACATCGTCGTACTGGACGTAGTTGTCGATCTTGGTGTGGGTGTGCAGGGCGATGCCGGCGATGCGAGTACCGCCGCCTTCCACCACTGCGCCGCTGGCGTTCACGCCACAGACACGGCCGAAGCGCACGTTACCGCCGGCCGGGAACGACTCGACACGGTCAAAGCCGGCGTCGGCCTTCATGCCCGGGATGGCTTTTTGCATCTCGAAGATCATTGGTCATCCTCCACTTTCTCGCCGCGCATGCGGGCGGCCATTTTCTCGCGGGCTTCGGCAGCACTGCCGCCTTTGGCAGGCACGCCATCCTGGTTGATTCCGGTGGACACCTTGCGCTGATTGGCGGCATCGTCCCGACGCTTGTCGGCGTCGGCCACCGCGTAGTCGTAGGCAGCGTCGATATAGGCATCTTCCTTGTCGGTCAGGTCGACGCTATCGCCGCGAATGGCCTTGATCACGCTCACCTTGAGGTCGCGGTCGGCGGCATCCTCCTTCACCTCGACCTTGTGCTCTTTGGCCTTGGCCTCGAGCTCCAGTCGGGCACGGGCCTGCTTCACAGCGTCCTGCTTGATCTGCTCAGCCTGGTTTTCGGCTTCTTCTTTGGCTTGCTCGGCAGCGTCGGCCCGGGCCTTCTCCTGCTCGAGCATGGTTTTGGCATCGTCAAGATCAGCGCGAGCCTGCTTGTAGGCATGGCCTACCTCGGGCGCGGCGTCGTACTCGATGCCGTTGTCGAGACGCACTTTGTCCATAATCGGATCACCTTCATCGTCGTCGGTTGGTGTTGCGGCATCGGCCGCGTCGCTCATTTGCTCGGCGTCTGCCGAATCGAGATTGAGCCGGGCATTGCCTGCGCGGCCCTCGGGCACGACAGCCAGGTGATTCACCCGGATGTTCGTCTGAATGGCGTCGTAGCGTTCGCCGTTGTACTCGCCGGGTTCTTCTATCGTGTCCACCAGGTAGCCCACAGAGAGCTCTTTCCAGCCGTCCTGCTTCACCACGCGTGGGTCGTGGATCACGATGTCGCCAGTCACATCGTTGCCATCCTGCCGACCTTCGCTTGTCACCGTGCCGATCTGGTACTTCTTCACGGTGGACGCCGCGACCTTGCCGTGGTGGCCCTTGGTGATCGGCTTATTGCGATAGGTGGCCAGCGAGTCCTGGGAGAACACGTCCTCGGGCCGGCGCAGTTCGCGGCGAATGCCGCCTTTTCCATCCCTGTACGTAAAAACGCCCGTGCGGGTCAGCACAGGCGAGTCTTCGAGGTATCCGTCCTCGGTGAACCGGGCTTTGATCGGCGCCCGGTCGTAGCGATAAACACTCATTGCAATGTCACCTCGTCTCGGGGCGGCCATTTCGGGGAGGCCCAACAGCGACAGTGGATCGGCTGACCGGGATGGCCATCCCAGGGCGGCTTCGACCACTTGAAGGACTTGCGCTCCCGGCGCCGGTGGTGAGCACGCTCACGCCAGTCCAGAACACCGCGCCAGTAGTATTCGTCGATGCCAATGGCTTGCTGCCGATACTGGGTCAGGCGGCCGTTGAGCTTGCCGATCTGGTCGTTGGCGATCAGTTGTGCCCGGTTGACCGGCTTGTCGTAGGTCGCCCGGATGGTGTCGCGCAGGTCGGTGGCCGATTGGCCCTCGTTGATCGCGCGGATGATCACGCCCTGCAGATCATCGACGTATCGCTCGGGGATGGACCGGATCAGCTTGAGGTTTTCCAGCTCCCAAGCCCGCATCAGCTTGTCGAGGTCAGGCTCGGCCTTGGCGTAGGGCTCGCCGTAGACACGGCGGACCAGCTTCACGAACTGGCGCAGGTTGAAGCGGCTGACGCGCTCGCCGAACTCGGCCACGCCGACATCAGAGGCTGACGCCGGTCCTGCCATGCCCTCGAAGACCTGTTGGGCGCGGCGACCGAGGAGGGCGGCCAGAACAGTGCCGAGGATCTCGTTCAGGCGGTCGGCCCAGGTGCCTTCGTCGGACTCCTCGCTCTCGCCCAGGGCGTCGCTGCGCATGCCTGCCATCTCGACGAGACGCGGCACCTCCGGCATCAACAGCTCGTTGCATGCTCGCTGGCATCGGCGGGCGTATTGCTTGAGAGTCTTCGTGTACTCGCGCTCGATAGCCTCGGGATGCGCCCATTTCGGCTCAGGCATCGTCTAACGCCTCCAGGCCGTAGAGCCCTTCACGCTCCATGTGGCTGCGCAGGTCGCGCTCGCTCACGGCGCCAGTCTCAAAGTCCAAGGTGCGGGCTTGCGCCTCGTCTCGGGCCGCCTGTGCGTTCTTCTGGCGGATCTCGGCCTGCTCCTTCTCAGTCGGCGTCCACAGGCTGGGCCAATGAATCGACCAATCCGATGGCGCACTGACGCCTGACTGCCGGGTGATGGCGTCGACAAGCCGCTCAAGCGCCGGCTGCGCTCGGGTTTTCTGGATGCCTTCGACCAGGTTGTAGAGGTTGTCGGTGTCGTTCTCGCCGGTGTTATTGAGGCCGCCGGCAGACTGCATGAACAGCGACGTGACCGGAATGCCTGAGTCGGCGGACACGGCAACCTGAAACTCGGCGATCAGGTCTTTGATGCCGGAGACGTTGAGGTCATAGACCTGGTAGTCGTCATCCGCGTCCACGGTCACACCGTTCATGAGGCTGCGCACCTCATCCACCATGTCGACCCGCTTTCTGACTTCCGCTTCATCTCCGTCTTGTATCGCCTGCGCCAAGCCGCTCATCTTGTGCACGGCCTGCTGCTTGCGCTTGAGAACCTCGAGAGCAAGGCGCAGCGCACGTTCGTAACGCTCGATGGTGTGGAACGCGGCATCGGCGGATGGACGCCCCGCCCAGGGCACCGACTTTCCGACCTTGAGTCGGGCCGGCAGGGGTTCGCCCGGGACCGGCAGCAGGCGTGACTCATGGACCTCGAAATAGCCGGTGCCGGCGTAGTTGCCGGTGATCAGTGGGCGGACGTGGTAGACCTCGGGCTGGCCGTATGTCGGATCTCGAGGATCGTCGTAGTAGCCGCCCGGGGCCACCGAGAGCTGGCTGTATTCGATGACGCGGATCTCGGTGATGCGCCCCACATTCTCCGGTAGCGGGTCGGCCAGCTCGCCGGTATCGGTCAGCAGCAACAGCGCCGCTGCGCCGTCCAGGCTCGACCAGCGAATGGCATCGGCCAGGGCGCCCATGACATCGAGCCGATCCAGCTCGCCCATAATGGCCCGCTTGTCATCACCATTGATCTCCACGCCTCGAGACACGGCCATGTCGGCTGGTATATCGATCACGCGAGCCGCGATACCGCCCCGGGCATACATAGTCGCCGGATCGGTAGGCGCAACCTGGCGGCGGCGGTCACCCAGCACTGCCGAGCGATAGCCGTCCTCGTAATAACTCGGCATGGTGTCTCCTAACTTGCGAGCGCTTTGAATCGATCGGTTACCGACTTGTTCGGCGCGTAGAGGATCATCACGGCGTCAGCCAGGTTGGGCGACTTGGTGCCTTCCGGCGCCTTGTCCACCACCACCTTACCGGTGCCATTGACGTGATAAGTCGGCTGCGAGAGCTCGAGCATCAGCTTCGACAGGATCGGAAGCCGGCTGTCGATGGAGATGATGTCGTCGGGGTCGAATTCCATGCCCTCGACAACGGCTCGGTAGGTACGCTGGAAGCGAAGTCGAAGCGCCCACCAAGCCTGGGCCTTGAGGTTGGCGAAGAAATCCTTGTTCTTGCGCTTGGGCACCATCTCCTTGTCGGGCTCGATAACGCCGCCCGATCCACGGAATGGGTTTACCTTCAGCTTCGGGCGCTTCTGCTCGGCACGCTGCTCGTTGATCACCCGGGCATCGCCACGCACCCCTGAGCCTAGGCCGTCAGCGTCGTAATCGAACCGACTGCCGCCGTGCTCATCGGTGTGATCGAACGCCTTTTGGACAGTGCCGAAGATATCGGAGCCTTTCCCGGTCCACTCATCAACCAGGTCCAGCAGGATGCCGTGTCGGCCGGCGTAAGCGTTCTGGTCCTTGCCCTCGTCGGCCACGTCCAGGGCGCCAAGCCGCTCGCCTGTGATCTCGATGCCGAGCTTCTTGTGCGCGTCTACTGCCGCCTGCACCCAGGCCGATGGGATCAGCACACCCTCGACCGAGGCGCTGTAGTTGATGTCGATCTCCTGGGCCACGGTGACCGGGTCCAGCTCATCGACCTGTTTGGCGTACCAGGCATCATCCTTCCGGGGATCGTCCCGCCAGTGGAAGGTGAATACCGAAATCTTGCCGCTATGGCGACGCTGGGCGAACGGATTACCCATGCCGTTGGGCGTCGACACGTCCTGCCGGCAGTTGGTGGTTGCCGACAGTGAGGCGTCCACCAGGTGCGGCCGCTCGAGGAACGCCGACTCGTCGACGATGTAGAAGCCAGTCCGGTCACCGCGCCCGATGCCATCACCCGACTCGCCGGTCATCACCGACTCTGTCTCGGGGAACATGATGCGCATATGCGGCGCGTGTCGCTTGCGCTCCCATCCGCCCCGGAACTCACGGGGCAGCATGCTCATGAAGCTCCGCGCCTTCTCGAACAGCGACTTCGGCGAGCCGATCTTATCGACGTATTCCTCTTTCCGAGAGCCGAAACCGACCGCCATGCCTGGGTGATGCAGGCAGACAGTGCACCCTAGGCCGATGGTCAGCCAGGACATGCCCATGTCGCGGGTCTTCTCGGTGATGCCCGGCTCCTGCCGGCGCCACCGCTCCATGAACCACTCGATCCATTCCTCCTGGCGTGGGAACAGCAGGAAGGGAATGCTGGCCGGCAGCCCTCGCTCGACGTTCCTGGGGTCATAGGTCATGCCCCAATCGATGATGAACTGGGCCGGGTGGTCGCGGTAATACTGCCGCATGGCCGGCAGGCAGCTCGGCTCCTCGCGAATCCGTCGCAGCCGCTCGGCCCGCCACTCGAACACCTCGACATAATCCGGGTTTCGGAAATCGAAGGGGAACGGTATTGGCATCAGTCCCCCATCATGTCGCGGTAGATTCTGGCCGCCTCCTCGGGATCGTCGGTGCTGATACCCACCGTTTCGATGGGGCCGCCATTCTTGCCGGTGTGCTCATGCCGCTCTCGCCAGGCCTGGACATCGACGTGCTTGCCGAGCAGCTCCAGGTTCTTCACCTTGTCCGGCCACTTGATCTTCTTCAGCAGGCCGATCTGTTCGCGTTCGTCGCCCTGGCCATCCCATAGCTCGGCCACATCCAGACCTGATAGCGTGCGGCGCCATGACTGTGGCCACTCCCTGACAGGCAAGATGCTGCCGTCGTCGGCCAGGATGTCGGCCACATCCATCTGATCGATCTCGACCAGGCGGTGGAGCACATAGTCAGCGTCGATCTTGGTGCGCTCTGAGCGCAGAGCTTTGCGCTGCGCGATGGCCTCCGCGACCTTCGCATTGCTTAGCAGGCGAGAGGCGGTGCCTTCAGCGGATCGCTCGGCATAACCGGCACGGATGGCAGCCTGCTTGCCGTTCAAATCCTTCAAGTACTCATCAACGAATCGAGACTGCCGAGCCGTCAGCGCGACAGCCTCCTGGGCAGCCTTGGTCATACTGATTCCTTGTCCGTGATAACCTCGGAAGGGCGCCCCAACCGAGAGGTGAAGTTCTATGAGGGTTCAATATCAACTGGAATCAGACCAGTCCGTTCTGGAGGTCCGCGAAGAGGCTGAATGGCCTCGTCACAGCAAACCCATTCGACTAACAAGCGGCACCTTTAAGCTGGTGGATACTGACGTATCAAATGAAGAAGGCGTCGACCTCATTGCGACTGTCCGGCCAGGCAAGCCTGGTAAAGTCACTTCAGTGGTGGTTTAGCTGTTATCGGACTGATGCCGCCGACGGTTTGCCAGTCAGGCGCACTATCGGCCTCACCTGCTTTCGCATTGATGAGCGCCCCGACCAGGGCTGTGTTGAGACTGCGTGGGCAGAATGGGGCGCTCAGCGATGCGTTATCGGTATTTCGATATCGCCTTACAGGCCGCGGAAGAAGGCCCATCCAACAGCGGCACAGCACACCACGAAGATGGTGGCCGGGATGCCGATCGTGATGCCGAGTGTCCACAGCAGCCAGGTAGGGATTTCGATAGTCATCACTCCCTCCGATTCAGCAGCAGGCCCGCAACGAACACCGCCACCAGCACCCACGCATAGGGCCGGGCGATGGCGTTGCGGGCGAGCCTTACAGCGCGGCTTGGATCTTGGCGCCCATTAGCCCCAGCAGCGCAGAGGCTGCGCTCGCCACTTCGAGGGGAATCTCCACCCCGACGAAGGTGCTCAGCGCCCAGGACAGGACGACCGGCAGGCCTACACCCAGGCCGTCGGCGTTGCTGCGAGCGCCTTGCGCGAAGCGGTTCGGCTTTTTTTCCAGCGTAACGCCCTGAACTGGCGCGCCTTGATCGATGTTTGCTTGGTCGGACATGGTGGGTACCTCAAGGAAAGATGATGCGGGACAGGAAAGCGCCAGCGATGGAGGCGGCAGCGATGGCGGTGGTGCCGATAATCTTCCAGCGCCCTGATAGGTGCTGGTATTGCTGCGTGCCAGTCGCTTGGGCTACAGCGTGATTCAGCTCGACATGTCTGATTCGCTGAGAGTGATCGGCCAGCATCCCGTCGTGCGCCTCGATCTTCGCGCCTTGGTTGTGCTGCCGCTCTTCCGTGCGGGCCGTGCGCTCGTTGAGCTCTTTGACGCCAGCTTCAATGCGGTCGAGTTGGGATTGTGTGCTTGGCACCTGCTCCATACTCACGAGCCCCCCTTGCCGCAGGGCGACAATTCGATAACACGACTCAGCCAGCCGTAGGTGAACGCCTCTTGGCTCTCTCGAGCCTCGGCAATCGACACCATGAACTCGATGCGCAGTGCGTTGACGGCCTCGGCCAGTACATCCAGCCCGGGGCGGCCACGGGTGTCGTGGTAGTCCTCCAGGGCGCCGATGGTGGCCGGGCCGACCGCACCGTCTACGGCGATGTCGTCGTAGTCACGCTCGACGCGATTGAGCACGTTGAGCAGCCGCTGCAGTGTCTCAGCAGGGCGACCTGGGCCTGAGTTGACCCCGTAGTCGAACAGATACGCCGCCAGCCCCTCGTGGATCATCGCCACCTTGTCGAGCCGCAGGCTGTGCCAGTACCGGTCGGCGTAGATGTCGTGGGCCAGCACAAGGGGCAGGGCGCGCATGTCGCCGGTATAGCCGTAGTCCCGCGCCACAGCCTCGGTGATGCCGTAGCGGGTCGGGCCCCCACGATCCGCCGAATGATCGACATAGCCACCCTCACGGTCGATCACCTCGCCGATCAGGCGATTTTTCAGCGACTGATGCAT
Protein sequences of DBSCAN-SWA_1 >NC_014532|1244941:1254916|1248882_1249719_-|WP_013331847.1|capsid|DBSCAN-SWA MPEPKWAHPEAIEREYTKTLKQYARRCQRACNELLMPEVPRLVEMAGMRSDALGESEESDEGTWADRLNEILGTVLAALLGRRAQQVFEGMAGPASASDVGVAEFGERVSRFNLRQFVKLVRRVYGEPYAKAEPDLDKLMRAWELENLKLIRSIPERYVDDLQGVIIRAINEGQSATDLRDTIRATYDKPVNRAQLIANDQIGKLNGRLTQYRQQAIGIDEYYWRGVLDWRERAHHRRRERKSFKWSKPPWDGHPGQPIHCRCWASPKWPPRDEVTLQ >NC_014532|1244941:1254916|1247779_1248886_-|WP_013331846.1|DBSCAN-SWA MSVYRYDRAPIKARFTEDGYLEDSPVLTRTGVFTYRDGKGGIRRELRRPEDVFSQDSLATYRNKPITKGHHGKVAASTVKKYQIGTVTSEGRQDGNDVTGDIVIHDPRVVKQDGWKELSVGYLVDTIEEPGEYNGERYDAIQTNIRVNHLAVVPEGRAGNARLNLDSADAEQMSDAADAATPTDDDEGDPIMDKVRLDNGIEYDAAPEVGHAYKQARADLDDAKTMLEQEKARADAAEQAKEEAENQAEQIKQDAVKQARARLELEAKAKEHKVEVKEDAADRDLKVSVIKAIRGDSVDLTDKEDAYIDAAYDYAVADADKRRDDAANQRKVSTGINQDGVPAKGGSAAEAREKMAARMRGEKVEDDQ >NC_014532|1244941:1254916|1245699_1246104_-|WP_013331842.1|DBSCAN-SWA MTQYVTVANVDEKLGADWAPEADKERFVAMANAYLSAQGIQVPDPVPADVIMAGVELAAAAADGSLYAQQSQGTMTQKRVKADGAEVEKRYAEISTSNVGNAALPSRVQFALALIGPYKAARGSITVGYRRDCP >NC_014532|1244941:1254916|1252559_1253150_-|WP_013331850.1|terminase|DBSCAN-SWA MTKAAQEAVALTARQSRFVDEYLKDLNGKQAAIRAGYAERSAEGTASRLLSNAKVAEAIAQRKALRSERTKIDADYVLHRLVEIDQMDVADILADDGSILPVREWPQSWRRTLSGLDVAELWDGQGDEREQIGLLKKIKWPDKVKNLELLGKHVDVQAWRERHEHTGKNGGPIETVGISTDDPEEAARIYRDMMGD >NC_014532|1244941:1254916|1249711_1250956_-|WP_013331848.1|DBSCAN-SWA MPSYYEDGYRSAVLGDRRRQVAPTDPATMYARGGIAARVIDIPADMAVSRGVEINGDDKRAIMGELDRLDVMGALADAIRWSSLDGAAALLLLTDTGELADPLPENVGRITEIRVIEYSQLSVAPGGYYDDPRDPTYGQPEVYHVRPLITGNYAGTGYFEVHESRLLPVPGEPLPARLKVGKSVPWAGRPSADAAFHTIERYERALRLALEVLKRKQQAVHKMSGLAQAIQDGDEAEVRKRVDMVDEVRSLMNGVTVDADDDYQVYDLNVSGIKDLIAEFQVAVSADSGIPVTSLFMQSAGGLNNTGENDTDNLYNLVEGIQKTRAQPALERLVDAITRQSGVSAPSDWSIHWPSLWTPTEKEQAEIRQKNAQAARDEAQARTLDFETGAVSERDLRSHMEREGLYGLEALDDA >NC_014532|1244941:1254916|1254029_1254344_-|WP_157953394.1|DBSCAN-SWA MSMEQVPSTQSQLDRIEAGVKELNERTARTEERQHNQGAKIEAHDGMLADHSQRIRHVELNHAVAQATGTQQYQHLSGRWKIIGTTAIAAASIAGAFLSRIIFP >NC_014532|1244941:1254916|1246379_1247345_-|WP_013331844.1|DBSCAN-SWA MKQHMNYDAADLAAVMDAGDRLGIVREDEGLFLARQLDYVKSRIYETITPSLTGLDLVPVSTEVPEWAESFTYRTYDEVGMAKFISNYADDLPRVDVTGKEETLKLKSLGDAYGYSLDEMRLANAMGTNLPERKGRTARRAIDQLINKVAMVGDAKHGFYGFTNHPNLGETAVNGDWTNSATTGQQILDDLLAMYDAVQIQSYDYHTPNTLALAPTRRSALMRKNVNEGGNWESVWSRFKALYPNVSLIASTELEPSDANGQQSSAVMYERDVDNLSIEIPRPFEQLPAEKRNLEMVIDCIAKCSGVAVYRPLSLTKAVGV >NC_014532|1244941:1254916|1245332_1245713_-|WP_013331841.1|DBSCAN-SWA MTARDEIQTELAEAFADDEELGQAYFAFTAKRTLPGDGVYNPDTGTYEPAKLPYSGSYWRSKFTFAEIQSLDLTSADIKIGMLANATEQVPQVDDVITLADGRTARVVEVSGDPLGATYSVHLKVN >NC_014532|1244941:1254916|1244941_1245331_-|WP_013331840.1|DBSCAN-SWA MAKGGWSGKPTDFALEVEDDTMVLFRELCEQASRGVIYRAPIDTSRFLANNNFSVNTPDETFDENKRDPSRSITLADARRAMAALKPGDSFHIVNTTPYGKYLEAGTSGQAPTGIYGVTFNSIKEYSGS >NC_014532|1244941:1254916|1246143_1246377_-|WP_013331843.1|DBSCAN-SWA MGKYRNESLGVVNVDGKFAAPKRTLDVDDKAPGLARMVKRGVLSKVDGRTAGAKTEKPAEKGGDAQKAEQPQQKAAE >NC_014532|1244941:1254916|1247357_1247783_-|WP_013331845.1|DBSCAN-SWA MIFEMQKAIPGMKADAGFDRVESFPAGGNVRFGRVCGVNASGAVVEGGGTRIAGIALHTHTKIDNYVQYDDVSVLTRGLAWCSVIDGGTVTDGGAIEFNAAGEVQDADVGTAYPNAVFRSDAVTLNDGRVIACVELHSPTA >NC_014532|1244941:1254916|1253779_1254019_-|WP_041601944.1|DBSCAN-SWA MSDQANIDQGAPVQGVTLEKKPNRFAQGARSNADGLGVGLPVVLSWALSTFVGVEIPLEVASAASALLGLMGAKIQAAL >NC_014532|1244941:1254916|1254340_1254916_-|WP_041601946.1|DBSCAN-SWA MHQSLKNRLIGEVIDREGGYVDHSADRGGPTRYGITEAVARDYGYTGDMRALPLVLAHDIYADRYWHSLRLDKVAMIHEGLAAYLFDYGVNSGPGRPAETLQRLLNVLNRVERDYDDIAVDGAVGPATIGALEDYHDTRGRPGLDVLAEAVNALRIEFMVSIAEARESQEAFTYGWLSRVIELSPCGKGGS >NC_014532|1244941:1254916|1250964_1252560_-|WP_013331849.1|DBSCAN-SWA MPIPFPFDFRNPDYVEVFEWRAERLRRIREEPSCLPAMRQYYRDHPAQFIIDWGMTYDPRNVERGLPASIPFLLFPRQEEWIEWFMERWRRQEPGITEKTRDMGMSWLTIGLGCTVCLHHPGMAVGFGSRKEEYVDKIGSPKSLFEKARSFMSMLPREFRGGWERKRHAPHMRIMFPETESVMTGESGDGIGRGDRTGFYIVDESAFLERPHLVDASLSATTNCRQDVSTPNGMGNPFAQRRHSGKISVFTFHWRDDPRKDDAWYAKQVDELDPVTVAQEIDINYSASVEGVLIPSAWVQAAVDAHKKLGIEITGERLGALDVADEGKDQNAYAGRHGILLDLVDEWTGKGSDIFGTVQKAFDHTDEHGGSRFDYDADGLGSGVRGDARVINEQRAEQKRPKLKVNPFRGSGGVIEPDKEMVPKRKNKDFFANLKAQAWWALRLRFQRTYRAVVEGMEFDPDDIISIDSRLPILSKLMLELSQPTYHVNGTGKVVVDKAPEGTKSPNLADAVMILYAPNKSVTDRFKALAS |
14 | Pseudomonas_phage(33.33%) | capsid,terminase | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1268701 : 1283915
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_014532|1268701:1283915|DBSCAN-SWA CATGTGGTTCCGCAACCTACACTTGTACCGCCTACACGACGCGCCCGGGTTGGACGACGCCTACCTCGAGGGGTTGCTGGCCGCCCAGGCCTATCGTCCCCTCGGCGGCAACGAGGCTCGACGCATTGGCTGGTGCCCACCCGCTGGCCGGGCTGGCACCCAGCTCTGCCACGAGGCCAACGCCCAGCGGCTGCTGACGGCTGTGCGACAGGAACGCCTGCTGCCCTCCGGCGTGGTGCGCGAAGAAGTCGAGGAACGCGCCGAAGCCCTTGAGGCCGACGAAGGCCGCAAACTGCGCCGCCAGGAAAGGCTCACCCTCAAGGAGCAAGTCTACGAGGAGCTGCTACCCCAGGCCTTCGTGCGTAGCACGAGGATCGATCTGTGGTGGGACACCCGACGCGGACTGATCGGCATCAACACCAGCAGCCGCAAGCGTGCCGAGGAAGTGCTCGATCTGCTGCGCGAGACGCTGGGCAGTCTGCGTGTCACCCCGCTGGCCACCAACATCCTGCCCATGCGGGCCATGACCAGTTGGCTCAGCGACCCGGGCACACGGCCCGCCGAGATGGAGATCGGCGACACAGTGGAGCTCAAGGCCAAGGGCGATGATGGCGTGATCCGAGGCCGGCAACTCGACCTGGATAGCGACGAGATCCATAGCCACCTGGAATGCGGTCGTCAGGCCAGCAAGCTCGCCTTGGGCATCGAGAGCATGATCCGGTTCGTTCTCCACGACGACCTGACCATCAAATCAATCAGGTTCGACGACGCGGTGATTGACGAGGCTGCCCAGCAGGACGATGGCGACGACCCCGTTGCTCGCCTCGAGGCTGACTTCACGATCATGACCCATGTTCTCGGCGTCACCGTCGACACCTTGCTCCAGTGGTTGGGTGGCGAAGCCCAGGCTGGTGCCGACTTCCCCTCAGCAGCATAGGAGGCCCCATGGCCCGAGGCATCAACAAGGTCATCCTGATCGGCAACGTCGGCCAGGATCCCGAGATCCGCTTTACCCAGTCCGGTACGCCAGTCGGCAACATCAACCTCGCCACCAGTGACACCTGGACCGACAAGCAGAGCGGGCAGCGCCAGGAGCGCACGGAATGGCATCGCCTGATCGTCTTCCGCCGCCTTGCCGAGGTCGCGCAGCAGTACGTCCGCAAGGGATCGAAACTCTACGTCGAGGGGCGACTGCAGACACGGAAGTGGCAAGACCAGAGCGGTCAGGATCGTTACGTCACGGAGATCATCGTCAACGATCTGCAGATGCTCGACTCCCGGCAAGGGCAGCCACAGGGCAATGCCCAGGCTCAGCAGCAGCCCTCACAGAACGGCTACTTCGACCAGCAACGGCAGTACCAGCAACAACAGACCGCGCCGCCGAATCCTCCTGGCGGGGGCGACTTCGACGACGAGATCCCATTCGCTCCCATGCACCCGCTGATGGGCGGCTGACACCACCGACACCCACCGCCCGGCCTCGTCGGGCTTTCTTTTGAATGGCCGCGCTGCCCGACGCGGTATCAGTATCGGGCTTACCGCAGGCGGTCTTCGGGCAGCAATAAAGGCCGTCAGCCCGCTGGCCTCCGCCTCCTTTGTCTCTATCCGCGCACTGCAATGGGCGGACGAGTCTACTGCACAGCAGCAGGAGGCAGGGCGGGCCGTGCGCCGCGACGAGATAGCGGCATTAAACTTTGTAGCCGCGCCTTGGTCACGGTGATTTTGGAAGATAAGCTTTGGGGAACTAACATCAACCAAAGCGCATGAGGGCAACAATGCCAGTAAGGATCCCCGGAGTCAGAGGTAAAGGCGGGGCTAGTCCTGCCGACTTAATACTGGAGCACATAGAGCTTTGCAGGGAAAATGTAAAGACAATAGAAAGGATAGCAACACACCAAAAGAAAAGGGAGATGAGAAATGAGATCAACAAAAGAATACGAGCATGCAACAATTTGCTTGGCATGACATCCGGGTCTCGCCGGTTTGGGCACATCTACCGAGAAACTGATCTACAAAAAGGGGAAAAGCTGGTGTCAGAGCATGTTATACCAGTTTCTGAACTAACATCCTTGTATGAAAATGGTGTTCCTTTGGAAGAGCTTATATTCTACCCAATAGCATTGATCTCCAATGCATCAAATACTTTGCTAAATAAAAGAGGGTTGAATAGGTCGCGAAAGGATCGCTCAAAGCCATTCAGCAGATATTTAGAGGCTGGAATAAAAGTAGAGTCGCATCTTGGTCGTGAAGTGGAAACAAAAACATGGGGGATGGCCGATCATTGGGATTTGATAAATAAAACCCCAGAGTTAGCAAATATCATGGATGCAGTATATTCTCGTTCCTTATCTCACAAAAGTCACTAAAAAGTTTTCACCATAATGGGCCTCCTATCCGGTGTTCTTTTCATGCCAGGAAGAAAGCCATGATCCGCCGCCAAGCGCAGAGCCCGGCGGCGGGCGAAGTGGCTGGGTCAGGGCTGATCCAGCTTGTCAGACACTTTCTGCTTCAGGTGCTTGATCGCTTCCTCATCGGATATGTCAGGAACAGCGATATGCAGAATGCTTAGATTCTCACCTTCAGTAGAATCGGCGCCCATCCGCATCATGTACATCTCGTCCCGATCAGCCCATAGCCGAATGGAGCGATCGACATTATCGCCTTCAGCAACAGGCCCGTCCTCTGGGGTCGCATAGATGATATTGAAGCCCTCCAAATTCATCTCATCGAACTCTTCCGGTGAGACTGTTTTAGCGCCCGTGAACACGTTCATGGCATAGCCCTCGCGCTGGCAGGAACCCTAACACTAGCCGGGCGGGCAACTCCCGTCCAATAGCCACCCAAGACCACCACCCGGTGGCCTTTTTCATGCCAAGGAGAGACGCATGACCAGCCTGAACCTGTTCGGCCACGAATTGGTGGTCGATAACTTCGCCGGCGGTGGCGGTGCCAGCGAGGGCATCGAGCAGGCCCTGGGGCGGCCGGTGGATCTCGCCATCAACCATGACCCGACAGCCATCGCCGTGCATACCGCGAATCACCCGGACGCCGAGCACAGCGTCGCCGATGTGTGGGATGTCGACCCAGCGGAGGCCGTCCATGGCATGCCGGTGGGGCTTGCCTGGTTCTCTCCTGACTGCCGGCATCACTCCAAGGCCAAGGGCGGGCGCCCGGTCAGCAAGAGTGTGCGGGGCCTGGCCTGGGTAGCTGCTCGATGGGCGGCAAAGGTGAAGCCGCGGGTGATTGCACTGGAGAACGTCGAGGAATTCCTCGACTGGGGCCCGCTGATGAAGGACGCCAAGGGCCGCATCGTCCCGGACCCCGCACGCAAAGGGCAGACATTCCGGGCGTTCGTCCGCGCCCTGGGCCGCCACGGCTACCAGGTGGACTGGAAAATCCTGCGCGCCTGTGACTACGGCGCTCCGACCATCCGGCGTCGGCTTTTCCTGGTTGCTCGGCGCGACGGCCTGCCCATCGTCTGGCCCAAGCCGACGCATGCCGATCCGGCCACCCCGGCCGTTCGACGGGGCAAGCTCAAGCCCTGGACCACCGCTGCCGAGTGCATCGACTGGAGCATCCCTTGCCCCTCGATATTCGACCGGCCGCGCCCGTTGGCCGACGCCACCTTGCGGCGGATCGCCAAGGGCGTGATGCGTTTCGTAGTGGAGGCCGGAGAGCCGTTCATCGTACCCATCGCCAACTACGGCAACGGATCGGAACTGGTGAACGCAACCAGCGAGCCATTGCGCACGGTCACCGCCTGGCCGAAGGGCGGCAGCTTTGCTCTGGTGGCGCCAAGCCTGGTGCAGACCGGCTACGGGGAGCGCGCCGGACAGGCGCCGCGCACCCTCGACATACAGCGTCCGCTCGGCACCGTGGTCGCGGGTGGTCAGAAGCACGCTCTGGTCTCAGCCTTCCTAGCCAAGCACTACGGCGGCGTGGTTGGCGCCGACCTCCGCAAGCCACTGCCGACGATCACCGCGACCGACCACAATGCCCCGGTAGCCGTGAGCCTGCTGAACCTCAAGGGCAGCGAGCGCGGTGGTCGAGATCCGCGGCATCCCATACCCACTGTCTGCGCCGGCGGCACTCACGCCGCGGCGGTGGCCGCCTTCCTGGTGAAGTACTACGGGAGAGGCATCGGTCAGGAGTGCAGTGATCCGCTGCACACCATGCCCACCCGCGACCGCTTTGGCTTGGTCACCGTCACGATCGAGGGCGAGCAATACGCCATCGTCGACATCGGCATGCGCATGCTCCAGCCCCATGAACTCGCCGCCGCCCAGGGCTTCCCCGATGGCTACCAATTCGCCGAGGCAGGCGGTCGCGCTGTGCCCAAGTACCAGCAGGTGCGCCTGATCGGCAACAGCGTCTGCCCACCGCTGGCCCGCGCCATCGTCGAGGCCAACTTCACGCACGAGCGTCAATACATGCCGGCACCCAAGGAGGCTGCATGACCACATCATTCAGACCATCCCTAGCCCCGGTCCACCCACCGGGCTGCCGCGCCCGGCTCAATACGCGCTATCAGCTCGAGATGCTGCGCTGGCGACTGCCGATGCCGTTCACGATGCCGGACCTCAGCCGCTGCATTGCCAGCCGAGGCGGTCAAGCCGAGCGTCACTCGGCTGTGTTGGCCGAGGCGACGATCCGAGACTGGATCAGGAAAGGCTATATCGCGACCGCCGGCCAGATCGGCGGGCTTCCGAGCTACAGGAGGACATCATGAGCATCGATCCCGAACAACTGCTGAAGCTGGCCAAAGATCGCCACGGCCCCGCCGGCGTCACCCTGGGCTGGCCGATGGTCATCAATATCGTCGACGAGCTGCGGAGGTTGCAGTGGGAGCGCGAGGCGCTGGCGTCTCTGGCAACAGAGCGAGCCCAATTCATTCTGAACGCCGTTGAGCTTGGCTATTGCCAACTGCCGGACAGCGACACGCCGGACAGCGCGCACGACACATACGAAAGGTGCAAGCTGGCGGATAGCGAGTCCGTCGCCTGTCAGGATGCTGCTATCAGAGCAGAGGGGGCGCGATTGGCAGCAAATGCCGTCCGCGGCCTAACAGAAGTCAACCGGCATCGCGTCGCTGATTACCTGGAGAAATTCGCTGATGAGTTGCGCTGCCAATCCGAGGAGGCCCCATGCTCATCTACATCGCCATCGGCATCCCCATCGGTGTCGTGCTGACGCTGGGAGCGGAGTGGGCGCTGTTCGCAGCAGGCGCCCGGAAGATGAATAGCAGAGGGGAGGAGGAGTGATGGACGCCAAGCAGGACCATGAGCGAATGTGCGCCGCCTTCGATGAGTGGGTTAAGGACGAGAACACGCACTGGCTCAGCGAACACGATGCCGCCTACCAGGGATTCATGGCGGCATGGGAGATGCGTTCGGATGAGATGGCGGGCCTACGCGAAACCGCCCGCCTGGCAGAGCGCTTCGCCCGGGCCAAGGGTCGATACCATAGCGAGCATTCCTCGTGCGATCTCATGGAGCGCTTTGGTGTTCCGTGCGTGAGGCCGGGTGATGAGCCGGAGATGGTCAAGTGCGCTTGCGGCGATACGTACCCGCCGGACAGCTACGGCGCCGGCTTCATGGCCGCCAATGGCGGGGTCTGCGAGAACTGCGACATGATGAATGGAGGTGGCAATGGCTAGCGTGAAGCTGATCGCGAACTACGTGGCCGTGGCGCTATCTTGTGTCGCGGCCTTTTTGTTCATGGGCGGTTACGGGGTATGGGATTGGACGCTACTGATCGGTACATTTGCCGTGATGAAGTTGGCCGGATGGAGTGACTGGCCCTGGGGGCAAGGATGAGGGTGATTCAACAAGCAGCACTGAGCATAGCCGCCGCGGTCTTGGCGTGGGCGATCTTCGGGTTCATTGCTTGGCAATGGGATGCCTCTGAGTGGGCAGGATTTACAAGAGCGCTGGCGGCTGGATTCTGGATAGTCAGTACGATTCTGATTGTGAATGGGTCGAGGTGGCCATGACTGATCTGATAACTCGCGCAGCGATCTACAGCCGAGCCGCGCACCAAGCTGTGGGGCAGCGCCGCAAGTACACCGACGAGCCTTATCACCTACATCCGGCCGCGGTAGCCACGACGGTGGAAAGTGTTGGTGGAACCACGGCCATGATCGCAGCAGCTTATCTCCACGATGTCGTGGAAGACACGCATGTCACGTTCAGAGCTCTTGCGCACGAATTTGGCCCGGCGGTGGCCGACTACGTCTACGAACTGACGGATCAGTTCACCGATCCTGCCCAAGGCAACCGGGCACACCGCAAAGCAATGGAGCGCGATCGGTTGGTCCGGATCAGCCCTGAAGCTCAGACCATCAAGTTGGCCGACTTGATCGACAACACGATGAGCATTGTGGAGCGCGATCCTGACTTTGCCAGGGTCTACATGGCGGAGAAACGGGAGTTGCTGAGAGTGATGCGGGCTGGTAACGAACACTTGCTCCGGATAGCTGACGCATCCGTCGCTACCTACTACGGGAGGCGTGATGCAGAAGCCTCGGATGCGTGAGCAGCGCAGCGATACCCGATAACAGCCGCCACCAGGGCGGCATTTTTGTATCTGGAGGTAGGGATGGAGATTCCAGACGAAGCCGTGGAGCGCTTCCATACCAAGTACCGGCTGAATCCCGCGACGGGCTGTTGGGAATGGACCGATGCGCTCAGTTCGCGCGGCGGCTACGGGCGGCTCAAGGTCGGCAGGGTGGCAGTCCGGGCTCATCGGGCATCGTATCTGATCCATAAAGGCCCGATCCCCGAGGGCCTGGTGGTCTGTCACACCTGCGACAACCCGGCCTGCGTCAATCCTGACCACCTCTGGCTGGGCACCCATATGGACAACACGCAGGACATGATGACCAAGGGTCGCGGCAAGTTCCCGGGGCACAAGGGCGAGATCAACCCGCGTGCCATCCTGACGCGCCGCAAGGTCGAGAAGATCATCCAGCGAATCACGGAAGGGCAGACCAACAAGCGGATTGCCATCGAGTTTGGCGTCTCGCACGCCACGGTCAGCCTGATCCGACGCGGGAGGATATGGACCGACGTGCCACGACCCGACGATCCAGCGTTTGCCCACTACGCAGCACTCAAGGCCGCCAACAAGGCGGCCTCTTAGTTTCCGGGAGGTCGATATGGAGCTGGTACTATCACGCAAGGAGGTGAGGGAGCTGACAGGCTGCGCCCAGCGTGCCAGGCAGCGACAGCACCTCGACGCCATGGGAATCCCTTACGTGGTGAGGGCGGACGGCTGGCCGGTCATTGACCGGCAGGCCTACCATCAAGCAATGGGATGCGAGGCGGCGAACGACGCCGGACACCCGAAAGCAGTGTTGAACCTGGAGGCCTTGGACGGATGAGCAAGCTCCCCTCGCGCTGGGCGTGGAAGCATGGCGCGTACTATTACCGCCCGCGTCAGCACGAGCGGGCCGCCTTCGACAACAAAAGCTGGTTCCGCCTGGGCGAGAGTTATACTGAGGCCCTGCGCGCCTTTGCGGATCGCATGGACATTCAGACGGGCGACAAGCTGGAGGAGCTGATCGATCGCTACACGGTCGAGGTACTGCCGACGCTGAGTTATAGCGCCCAGAACAACTACCGGCAGAGCCTGAGAAGGCTGCGCCCGACCATGGGTTCGAACCCCGTGGCAGTGATCGTGCCGCGACTGGTCTACCAGTACATGGACGAGGTGCAGCGCCGTAAGTCGATGCATCTGGCGAACCAGGACTTGAAGGTGCTGAATCGCGTGCTCGATTTCGCCGTCCGGTGGGGCGTCATCGATCGCCATCCCATCAAGGGGCTGGTGAAAGCCTATGGGAAGCGAGACGGACTTCGGAAAGGGCGGGACAGATACGTTGAGGACTGGGAGCTGGCAGCGTGGCAGACTGTAGCGACGCGGCAACAGCGGGCGTTCGCGGCCATCATCCTGCTGACCGGCATCCGCAAGTCGGATTGCCTGCGGCTGATGGAGAACCATGTGGGCGAGAATACGCTGACGGTGCAGGTGGCCAAGACAGGCAATAGTATCGACTTCGTTCTGACCGACGCACTGCGTGCCGCCATCGCCGAGGCGAGATCCTGCAAACCCAAGCTGTCGCTTTACCTGCTTCCGAACCGTCGAGGCCGCTGCTATGTCAATGAAGACGGTCTGACCCAGACCTGGGACGGCAGGTGGCGGGCCACCATGGACAAGGCGCTGGAAGCGACGGATCTTGAGCATGCCTTTACGCAGCATGACCTGCGGGCGAAAGTGGGATCGGATGCCGAGAATGACGCCAGGGCGCAAGAATTGCTCGACCATACGAGCGTGGCTGTTACCCGTCAGCATTACCGGCGGAAGCGCCAAGCGATCAAGCCAGTGAAGTAGAGATTGTGAGACACCTTGCATATCGTGAGACACAGGATGACGCAAATGTCGGAATTCAAAGGATTATTCGAAGGCGAGGTGTTCCATGTGTGAACTGCTCGGCATGAGCGCCAATGTGCCCACCGACATCTGTTTCAGTTTCTCCGGATTCCTGCATCGCGGCGGCGGCACCGGGCCGCATCGCGATGGCTGGGGCATCGCCTTCTACGAAGCCGGTGGTTATCGCGATTTCCGCGATCCCCATCCGTCGGTCGACTCGCCGATCGCGCGCTTGATCTGTGACTATCCGATCAAGTCCCATGTCGTCATCAGCCACATCCGTCAGGCCAACGTCGGGCAGGTGCGGCTGGCCAACACCCATCCCTTCACCCGCGAGATGTGGGGGCGCCCATGGTGCTATGCCCACAATGGCCAGCTCAGTCACGAGTGGAAGCAGTTGCCGCTGTCGTTCTATCAGCCCGTGGGCGACACCGACAGCGAACATGCCTTCTGCTGGCTGATGGGCGAGCTGCGTCGCGCCTTCCCCGAGCCGCCGACGGATCGGGACGCCCTGTGGCGTTACCTGCATGAGCTGTGCGAGCACCTGCGGACGTTCGGTGTCTTCAATCTGCTGCTGGCCGATGGCGAGTCTCTCTACACCTACTGCTCGACCAAGCTGGCGCACATCACCCGGCGGGCGCCCTTCGGCCGGGCGAGCCTTTCCGATGCCGAACTGGCAGTGAACTTCGCCGAGCACACCACGCCCAACGATGTAGTGTCGATCATCGCCACCGAACCGCTCACCGACAATGAGGACTGGGTACGCATGGAACCCGGCGAGTTGCTGGTCTGGCGCGATGGCGAGATTCAGGGTCGCTATCAAACGTAAGTTTCAATCGTCTGTTTTACTGCCGGTTCATGCTGGTATAGGGTAGTCCCAGAATAACAAAGGCCGACCCCAGGGTCGGCGCGAATCGTGGAGAACGCCATGAACGAGCACGCCAACCCGGCCATTCTCAATGGCCCCGAGATGGAAGGCACCGGTGACTACACTTCGGTGCTGGATGTCTTTCACGCCAGCGTCGAGCGCTTCGCCGACAACATCGCCTTTAGCTGCATGGGACAGACGCTCAGCTATGCCGAGCTGGATCGCCTTTCCGGCGACTTCGCCGCCTGGCTGCAGCACGAGACGAGCCTCGAGCCCGGCGACCGTATCGCCATCCAGCTTCCCAATACGCTGCAGTTCCCCGTGGCGGTCTTCGGCGCCCTGCGGGCCGGCCTGGTGGTGGTCAATACCAATCCCCTGTACACCGAGCGGGAAATGGCCCACCAGTTCAAGGACTCGGGGGCACGGGGCATCGTCATCCTGGCCAATATGGCCGACAAGCTGGAGCGGGTGCTGGAGCGTACCGACATCGAGCACCTGGTGGTGACCGAGCTCGGCGATCTGCACAGTTTCCCCAAGCGTCAGTTGATCAATGCCGTGGTCAAGTACGTCAAGAAGATGGTGCCGGCCTATGGCCTGCCCCAGGCCGTGCCCTTGCGGCGTGCCCTGCGCCTCGGTGCCGGCCTGGAGCATCGCGAGGCGAGTCGGGGCGCCGACGATATCGTGGCCCTGCAGTATACCGGCGGTACCACGGGGCTGGCCAAGGGCACCATGCTGACCCATGGCAACCTGGTGGCCAACATGCTTCAGGCGCGTCAGGCCATCGGCAAGGGCCTGCGCGAGGGCGGCGAGACCATCATCGCGCCCTTGCCCGTCTACCATATCTACACCTTCACGGTGAATTGCCTGTTCATGCTGGAGACGGGCAACCACTCGGTGCTGATCACCAACCCCCGCGACCTGGATGCCTTCGTCAAGGAGCTGAAGTCGCTCGATTTCTCCGGATTCGTCGGTCTCAACACCCTGTTCAATGCCCTGTGCAATCGTGAGGATTTTCGTGCCCTGGACTTCTCGTCGCTGAAGCTGACCATTTCCGGCGGCATGGCCTTGACCAAGGCGGCGGCCCAGCGCTGGGAGGAGGTGACCGGCTGTGCAGTGGCGGAGGGCTATGGCCTGACCGAAACCTCGCCCATCGTCAGCTTCAATCCGATCGACGCCATTCAACTCGGCACCATCGGCAAGCCGGTAGCCGGCACTGAGGTGAAGGTGGTGGATCTCGAGGGCAACGACCTGCCCTTCGGTGAGGCCGGCGAACTCTGCGTGCGTGGCCCCCAGGTGATGAAGGGCTACTGGAATCGCGACGATGAAACCGCCAAGGCGATCGATGCCGACGGCTGGTTCCATACCGGCGATATCGCCAAACTGCAGGACGACGGCTATATCCGCATCGTCGATCGCAAGAAGGACATGATCCTGGTGTCCGGCTTCAACGTGTATCCCAACGAGATCGAGGATGTGGTGGCCATGCATCCCGGTGTGGTGGAATCCGCCGCGGTGGGCGTGCCGGACGAGGAGAGCGGCGAGGCGATCAAGCTGTTCGTGGTGGCCAAGGACCCCGAACTCGATGCCGAGACCCTGCGCCGCTGGTGCAAGAAGGAACTCGCCGCCTACAAGGTGCCGCGCCAGGTCGTGTTCCGCGACGAACTGCCGAAGACCAACGTCGGCAAGGTGCTGCGCCGCCAGTTGCGTGACGAGGATGGCGAAAGTTCGTGAGCGTTGTGGTAGTGGCTCCGGGGCTTGTCCCCGGGGCCGCTCGGCAGAGCCGACGGCGATAGCTCCGGGGTCGGTCCGGCTCGATTAGGCGACGCACCCGGCGGGGCGTTACAATTCTCCCGCCAACGATCACCGGAGTCATCGAGCCATCGTGAGTAGTAGCCCCCAGACCGACCAAGCCGCCGACGCTTCCCGAACCGATCTCGATGCCCTGAAGGCTCTGCAGGGCGAGCTCGATGGCGTTCAGTCGCGGGATGCCCCGCGTTTGCACAAGCGCCTCGCCGGCTTGACCCGACGCATCCGCCAAGGCAAACCCGTGGATCGCGGACTTGCCGAGATACGCCGCGAGATCGAGCGGTCGCGCGGCAAGGTCGAGGCTCGCGCCGCGCGTCCGGTGCACCTGGACTATCCCGCCGAGCTGCCGGTTTCCGAACGGCGCGAGGACATCCTCGCGGCCCTGCGTGACCACCAGGTCGTGGTGGTGGCCGGCGAGACCGGTTCGGGCAAGACCACCCAGTTGCCCAAACTGTGCCTGGAGCTCGGGCTCGGTCAGCGTGGCCTCGTCGGTCATACCCAGCCGCGGCGGCTGGCGGCGCGCTCCGTGGCGACCCGCCTGGCCGAGGAGATGCAGGTGCCGCTCGGCGAGCAGGTCGGCTATCAGGTGCGCTTCACCGACCAGACGGCGGAGGCCACCCGGGTCAAGCTGATGACCGATGGCATCCTGCTGGCCGAGACGCGCAACGACCCGGACCTGTCCCGCTACGAGGCGATCATCATCGATGAGGCGCACGAGCGCAGCCTCAACATCGATTTCCTGCTCGGCTATCTCAAGCGCTTGCTGTCGCGCCGCCCCGATCTCAAGGTGATCATCACCTCGGCGACCATCGATGTGGAGCGTTTCGCCGCGCATTTCGCCACCGAGGATGCCGAGGGCAAGACGCGACCGGCGCCGGTGGTGGAAGTCTCCGGGCGAGCTCATCCGGTGGAGGTGCGTTATCGCCCGCTGATCCGCGACGAGGAAGACGAGGAGGACCGCACCCTTCAGGAGGGCATCCTGCACGCGGTGGAAGAGATCGAGACCATCGAGCGCGAGAAGCGCTGGTTCCATGGGCCTCGCGACATTCTGGTGTTCCTGCCCGGCGAGCGCGAGATTCGCGAGACCGCCGATACCCTGCGCCGCGCCGATCTGCGCGACACCGAGATCCTGCCGCTCTATGCGCGGCTTTCCAATGCCGAGCAGAACCGCGTCTTCGCGCCCCACAAGGGGCGACGCATCGTGCTGTCGACCAATGTCGCCGAGACCTCGCTCACCGTGCCGGGCATCCGCTACGTGATCGATCCGGGGCTGGTGCGCATCAGCCGTTACAGCTACCGGGCCAAGATCCAGCGCCTGCCCGTGGAGCCGATCAGCCAGGCCAGTGCCAATCAGCGCAAGGGCCGCTGCGGGCGTATCGCGGAGGGCGTGTGCATCCGTCTCTACGATGAGGAGGATTTCCTGGGGCGTCCGGAATTCACCGATCCGGAGATCCGGCGTACCAACCTGGCCTCGGTGATCCTCTCGATGCTGTCGCTCAAGCTCGGTGACATCTCGGCCTTTCCCTTCGTCGATCCACCGGACTCGCGGTTCGTCACCGACGGCTTCCGCCTGCTGTTCGAACTCGGCGCGGTGGATGGCGGGCAGCGCCTCACGCCCATGGGCAAGCGTCTGGCGCGCCTGCCCATCGATCCCCGCCTGGCGCGCATGCTGCTGGCCGGTGCCGAGCAGGGCGGCTTGCGCGAGGTGCTGATCGTGGTTTCGGCACTGGCCACCCAGGATCCTCGGGAACGGCCGGCCGACAAGCGCGAGGCCGCCGATCAGGCGCATCGGCGCTGGCAGGATGCGGATTCCGACTTCGTGGCACTGCTCAACCTCTGGAACGGCTTCGAGGCCGCCAGGGACGAGCTGTCCGGCAATCGCCTGCGCCGCTGGTGCCGCGATCACTACCTGAACTATCTGCGACTGCGGGAGTGGCACGACACCTTCCGTCAGCTTCGCCAGTTGCTGCGCGACATGGATATCGAGGTGCCGGCCAAGCCCTCGGAACCGGCCGCCATTGCCGATGACAAGGCGGATGACGAAGCCACCCGGCAGGCACGCCGCGAGAATGCCGCACGCTTACACAAGGCGCTGCTGCCGGGGCTGCTGTCGCATCTGGGCCAGTTTCTCGAGAATCGCGAATATCTGGGCGCGCGCAACCGCAAGTTCATGATTCACCCCGGCTCGGGGCTGGCGAAGAAGACGCCCAAGTGGCTGATGGCCTTCGAGCTGGTCGAGACCTCGCGACTGTTTGCCCGCACCGTGGCACGCATCGATCCGCAATGGATCGAGCCCGCGGCCGAGCATCTGGTCAAGCGCAGCTACAGCGAGCCGCACTGGGAGATGAAGCGCGCCCAGGTGGTGGCTTTCGAGCAGGTGACCCTGTTCGGCCTGCCGATCGTTGCGCGTCGGCGAGTCCATTACGGCCCCATTGCGCCGCAGGAATCCCGGGAGCTGTTCATTCGCCGGGCGCTGGTCGAGGGCGAGTTCAAGACGCGCGGCGAGTTCTTTGAGCATAACCGGGCGCTGATCGAAGAGGTCGAGGACCTCGAGGACCGGGCCCGCAAGCGTGACATCCTGGTCGACGAAGAGGCGCTGTTCGACTTCTACGATGCGCGGATTCCCGAGGATGTCGTCAACGGCAAGGGGTTCGAGGCCTGGCGCAAGAAGGCCGAGCGCGACGATCCCGATATCCTCAAGTTCGATCGCCAGGCGCTGTTCGCCCGTCAGGCCGAGGAAGTCACCGAGGCGGACTACCCCGACGAACTCTCGCTGGGTGGGGTGCGTTATCCGCTGAGCTATCATTTCGCGCCCGGCGCCGAGGACGACGGCGTGACGCTGACCGTGCCGGCGGCCATGCTGCCGCAGTTGCCCGCGGGGCGTCTGGAGTGGCTGGTACCGGGACTGCTGCGCGAGAAGTGCATCAGCCTGCTCAAGTCTCTGCCCAAGGCCATGCGCCGCCAGGTGGTGCCGATTCCGGACTGGGTGGACGCCGCCCTGCAGACCATGACCCCGGACGATATGCCGCTCACCGAGGCGCTGGGGGAATTCCTGCGCCGCAAGACCGGCGCGCGTCCGCATCCCGACGACTGGCGGCTCGATCAGCTCGAGCCCCATCTGGTCATGAACCTGCGGGTGGTCGATCATGAAGGGCGGGAGCTCGGACAGGGGCGCGATGTCCGCGAGCTCGAGCGACGCTTCGAAGAAGCGGCGGGGCAGGGCGCCCAGGCACTGGCAGAACAGACCAGCGAAGGACGCGCGCTCGATAGCCTGCCGGAGACGGCGCTTCCCGAATCCCGGGTGACCACCCAGGCGGGCATCCGCGTCGAGGCCTTCCCGGCGGTGGTGCCGCGCGGCGATGCTCTGGACGTCGAACTCTTCGATCATCCCGACAAGGCACGGGAAGCGCATCGTCGGGGCGTGGTGTGGGCCGCCATGGCACGCCTGCCCGATCAGGCCAGGGCCATCGAGCGTCTTGGCGGCATGAAAACCTGCGCCTTGCTGTTCACCAAGGTCGGCAGCAAGCGCCAGCTCAGCGACGATGTCGTGGCGGCGGCCTTCCGCCAGGTCGTGGCGGTGGACCCGCTGCCACGCTCGGCGGATGAGCTCGAGCAGCGGCTGTCGGCTTGCCGTGCAGATCTGGTGCCGAAGGCGGAGGCGTTGCTGGCCACCCTGGAACGAGCCCTCGAGGGGCACCTGGCGGTGACCAAGGCGTTGAAGGGTAATCTCAACCTGTCTCTGGCGCTGGTTTACAGTGATTTAAAAGCACAAATGCAGCGTCTGGTGCATCCTGGCTTTGTCAGCGAGGCGGGCGAGTGGCTAGAGGAGTATCCTCGTTACATGGAAGCCGCGCGGATTCGCCTGGAGAAGGCACCGCGGGAACGCATGCGCGATCAGATGCACATGCAGGAGATACAGGATTTCGAGGCGCGCCTGGCCGCGCGACGCGAGAGCGAGCGACGCGGCGAGGTGGAAGACCCGGCACTGGTCGAGTTCGGCTGGTGGATCGAGGAATTGCGGGTCTCGCTGTTTGCCCAGCAACTGGGCACCAGAATGCCGGTGTCGGCCAAGCGCCTTGAAAAGCGCTGGGCGGAGATCACCGGACGCGGCTGA
Protein sequences of DBSCAN-SWA_2 >NC_014532|1268701:1283915|1276077_1277088_+|WP_013331872.1|integrase|DBSCAN-SWA MSKLPSRWAWKHGAYYYRPRQHERAAFDNKSWFRLGESYTEALRAFADRMDIQTGDKLEELIDRYTVEVLPTLSYSAQNNYRQSLRRLRPTMGSNPVAVIVPRLVYQYMDEVQRRKSMHLANQDLKVLNRVLDFAVRWGVIDRHPIKGLVKAYGKRDGLRKGRDRYVEDWELAAWQTVATRQQRAFAAIILLTGIRKSDCLRLMENHVGENTLTVQVAKTGNSIDFVLTDALRAAIAEARSCKPKLSLYLLPNRRGRCYVNEDGLTQTWDGRWRATMDKALEATDLEHAFTQHDLRAKVGSDAENDARAQELLDHTSVAVTRQHYRRKRQAIKPVK >NC_014532|1268701:1283915|1271173_1271473_-|WP_041601957.1|DBSCAN-SWA MNVFTGAKTVSPEEFDEMNLEGFNIIYATPEDGPVAEGDNVDRSIRLWADRDEMYMMRMGADSTEGENLSILHIAVPDISDEEAIKHLKQKVSDKLDQP >NC_014532|1268701:1283915|1273425_1273890_+|WP_013331868.1|DBSCAN-SWA MSIDPEQLLKLAKDRHGPAGVTLGWPMVINIVDELRRLQWEREALASLATERAQFILNAVELGYCQLPDSDTPDSAHDTYERCKLADSESVACQDAAIRAEGARLAANAVRGLTEVNRHRVADYLEKFADELRCQSEEAPCSSTSPSASPSVSC >NC_014532|1268701:1283915|1279877_1283915_+|WP_013331875.1|DBSCAN-SWA MSSSPQTDQAADASRTDLDALKALQGELDGVQSRDAPRLHKRLAGLTRRIRQGKPVDRGLAEIRREIERSRGKVEARAARPVHLDYPAELPVSERREDILAALRDHQVVVVAGETGSGKTTQLPKLCLELGLGQRGLVGHTQPRRLAARSVATRLAEEMQVPLGEQVGYQVRFTDQTAEATRVKLMTDGILLAETRNDPDLSRYEAIIIDEAHERSLNIDFLLGYLKRLLSRRPDLKVIITSATIDVERFAAHFATEDAEGKTRPAPVVEVSGRAHPVEVRYRPLIRDEEDEEDRTLQEGILHAVEEIETIEREKRWFHGPRDILVFLPGEREIRETADTLRRADLRDTEILPLYARLSNAEQNRVFAPHKGRRIVLSTNVAETSLTVPGIRYVIDPGLVRISRYSYRAKIQRLPVEPISQASANQRKGRCGRIAEGVCIRLYDEEDFLGRPEFTDPEIRRTNLASVILSMLSLKLGDISAFPFVDPPDSRFVTDGFRLLFELGAVDGGQRLTPMGKRLARLPIDPRLARMLLAGAEQGGLREVLIVVSALATQDPRERPADKREAADQAHRRWQDADSDFVALLNLWNGFEAARDELSGNRLRRWCRDHYLNYLRLREWHDTFRQLRQLLRDMDIEVPAKPSEPAAIADDKADDEATRQARRENAARLHKALLPGLLSHLGQFLENREYLGARNRKFMIHPGSGLAKKTPKWLMAFELVETSRLFARTVARIDPQWIEPAAEHLVKRSYSEPHWEMKRAQVVAFEQVTLFGLPIVARRRVHYGPIAPQESRELFIRRALVEGEFKTRGEFFEHNRALIEEVEDLEDRARKRDILVDEEALFDFYDARIPEDVVNGKGFEAWRKKAERDDPDILKFDRQALFARQAEEVTEADYPDELSLGGVRYPLSYHFAPGAEDDGVTLTVPAAMLPQLPAGRLEWLVPGLLREKCISLLKSLPKAMRRQVVPIPDWVDAALQTMTPDDMPLTEALGEFLRRKTGARPHPDDWRLDQLEPHLVMNLRVVDHEGRELGQGRDVRELERRFEEAAGQGAQALAEQTSEGRALDSLPETALPESRVTTQAGIRVEAFPAVVPRGDALDVELFDHPDKAREAHRRGVVWAAMARLPDQARAIERLGGMKTCALLFTKVGSKRQLSDDVVAAAFRQVVAVDPLPRSADELEQRLSACRADLVPKAEALLATLERALEGHLAVTKALKGNLNLSLALVYSDLKAQMQRLVHPGFVSEAGEWLEEYPRYMEAARIRLEKAPRERMRDQMHMQEIQDFEARLAARRESERRGEVEDPALVEFGWWIEELRVSLFAQQLGTRMPVSAKRLEKRWAEITGRG >NC_014532|1268701:1283915|1273960_1274356_+|WP_013331869.1|DBSCAN-SWA MDAKQDHERMCAAFDEWVKDENTHWLSEHDAAYQGFMAAWEMRSDEMAGLRETARLAERFARAKGRYHSEHSSCDLMERFGVPCVRPGDEPEMVKCACGDTYPPDSYGAGFMAANGGVCENCDMMNGGGNG >NC_014532|1268701:1283915|1269645_1270155_+|WP_013331866.1|DBSCAN-SWA MARGINKVILIGNVGQDPEIRFTQSGTPVGNINLATSDTWTDKQSGQRQERTEWHRLIVFRRLAEVAQQYVRKGSKLYVEGRLQTRKWQDQSGQDRYVTEIIVNDLQMLDSRQGQPQGNAQAQQQPSQNGYFDQQRQYQQQQTAPPNPPGGGDFDDEIPFAPMHPLMGG >NC_014532|1268701:1283915|1278055_1279726_+|WP_013331874.1|DBSCAN-SWA MNEHANPAILNGPEMEGTGDYTSVLDVFHASVERFADNIAFSCMGQTLSYAELDRLSGDFAAWLQHETSLEPGDRIAIQLPNTLQFPVAVFGALRAGLVVVNTNPLYTEREMAHQFKDSGARGIVILANMADKLERVLERTDIEHLVVTELGDLHSFPKRQLINAVVKYVKKMVPAYGLPQAVPLRRALRLGAGLEHREASRGADDIVALQYTGGTTGLAKGTMLTHGNLVANMLQARQAIGKGLREGGETIIAPLPVYHIYTFTVNCLFMLETGNHSVLITNPRDLDAFVKELKSLDFSGFVGLNTLFNALCNREDFRALDFSSLKLTISGGMALTKAAAQRWEEVTGCAVAEGYGLTETSPIVSFNPIDAIQLGTIGKPVAGTEVKVVDLEGNDLPFGEAGELCVRGPQVMKGYWNRDDETAKAIDADGWFHTGDIAKLQDDGYIRIVDRKKDMILVSGFNVYPNEIEDVVAMHPGVVESAAVGVPDEESGEAIKLFVVAKDPELDAETLRRWCKKELAAYKVPRQVVFRDELPKTNVGKVLRRQLRDEDGESS >NC_014532|1268701:1283915|1271585_1273157_+|WP_013331867.1|DBSCAN-SWA MTSLNLFGHELVVDNFAGGGGASEGIEQALGRPVDLAINHDPTAIAVHTANHPDAEHSVADVWDVDPAEAVHGMPVGLAWFSPDCRHHSKAKGGRPVSKSVRGLAWVAARWAAKVKPRVIALENVEEFLDWGPLMKDAKGRIVPDPARKGQTFRAFVRALGRHGYQVDWKILRACDYGAPTIRRRLFLVARRDGLPIVWPKPTHADPATPAVRRGKLKPWTTAAECIDWSIPCPSIFDRPRPLADATLRRIAKGVMRFVVEAGEPFIVPIANYGNGSELVNATSEPLRTVTAWPKGGSFALVAPSLVQTGYGERAGQAPRTLDIQRPLGTVVAGGQKHALVSAFLAKHYGGVVGADLRKPLPTITATDHNAPVAVSLLNLKGSERGGRDPRHPIPTVCAGGTHAAAVAAFLVKYYGRGIGQECSDPLHTMPTRDRFGLVTVTIEGEQYAIVDIGMRMLQPHELAAAQGFPDGYQFAEAGGRAVPKYQQVRLIGNSVCPPLARAIVEANFTHERQYMPAPKEAA >NC_014532|1268701:1283915|1277173_1277956_+|WP_013331873.1|DBSCAN-SWA MCELLGMSANVPTDICFSFSGFLHRGGGTGPHRDGWGIAFYEAGGYRDFRDPHPSVDSPIARLICDYPIKSHVVISHIRQANVGQVRLANTHPFTREMWGRPWCYAHNGQLSHEWKQLPLSFYQPVGDTDSEHAFCWLMGELRRAFPEPPTDRDALWRYLHELCEHLRTFGVFNLLLADGESLYTYCSTKLAHITRRAPFGRASLSDAELAVNFAEHTTPNDVVSIIATEPLTDNEDWVRMEPGELLVWRDGEIQGRYQT >NC_014532|1268701:1283915|1270475_1271066_+|WP_157953398.1|DBSCAN-SWA MPVRIPGVRGKGGASPADLILEHIELCRENVKTIERIATHQKKREMRNEINKRIRACNNLLGMTSGSRRFGHIYRETDLQKGEKLVSEHVIPVSELTSLYENGVPLEELIFYPIALISNASNTLLNKRGLNRSRKDRSKPFSRYLEAGIKVESHLGREVETKTWGMADHWDLINKTPELANIMDAVYSRSLSHKSH >NC_014532|1268701:1283915|1274688_1275234_+|WP_013331870.1|DBSCAN-SWA MTDLITRAAIYSRAAHQAVGQRRKYTDEPYHLHPAAVATTVESVGGTTAMIAAAYLHDVVEDTHVTFRALAHEFGPAVADYVYELTDQFTDPAQGNRAHRKAMERDRLVRISPEAQTIKLADLIDNTMSIVERDPDFARVYMAEKRELLRVMRAGNEHLLRIADASVATYYGRRDAEASDA >NC_014532|1268701:1283915|1274348_1274516_+|WP_157953399.1|DBSCAN-SWA MASVKLIANYVAVALSCVAAFLFMGGYGVWDWTLLIGTFAVMKLAGWSDWPWGQG >NC_014532|1268701:1283915|1275856_1276081_+|WP_041601960.1|DBSCAN-SWA MELVLSRKEVRELTGCAQRARQRQHLDAMGIPYVVRADGWPVIDRQAYHQAMGCEAANDAGHPKAVLNLEALDG >NC_014532|1268701:1283915|1275297_1275840_+|WP_013331871.1|DBSCAN-SWA MEIPDEAVERFHTKYRLNPATGCWEWTDALSSRGGYGRLKVGRVAVRAHRASYLIHKGPIPEGLVVCHTCDNPACVNPDHLWLGTHMDNTQDMMTKGRGKFPGHKGEINPRAILTRRKVEKIIQRITEGQTNKRIAIEFGVSHATVSLIRRGRIWTDVPRPDDPAFAHYAALKAANKAAS >NC_014532|1268701:1283915|1268701_1269637_+|WP_013331865.1|DBSCAN-SWA MWFRNLHLYRLHDAPGLDDAYLEGLLAAQAYRPLGGNEARRIGWCPPAGRAGTQLCHEANAQRLLTAVRQERLLPSGVVREEVEERAEALEADEGRKLRRQERLTLKEQVYEELLPQAFVRSTRIDLWWDTRRGLIGINTSSRKRAEEVLDLLRETLGSLRVTPLATNILPMRAMTSWLSDPGTRPAEMEIGDTVELKAKGDDGVIRGRQLDLDSDEIHSHLECGRQASKLALGIESMIRFVLHDDLTIKSIRFDDAVIDEAAQQDDGDDPVARLEADFTIMTHVLGVTVDTLLQWLGGEAQAGADFPSAA >NC_014532|1268701:1283915|1273153_1273429_+|WP_041601958.1|DBSCAN-SWA MTTSFRPSLAPVHPPGCRARLNTRYQLEMLRWRLPMPFTMPDLSRCIASRGGQAERHSAVLAEATIRDWIRKGYIATAGQIGGLPSYRRTS |
16 | Halomonas_phage(33.33%) | integrase | attL 1265016:1265031|attR 1278614:1278629 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1900649 : 1931078
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_014532|1900649:1931078|DBSCAN-SWA ATCATTGGGCCTTGATCCTCCATTCCAGCGACGGGTGGGGGGAACTCAACTCAGGGAGGTAGAACCAATCCGGCGATCTCGCCGTGACCGTCAGGTTAGGGGCGTTTAAATAGCCTGCGGCCAGGTAGTCGCCATCCGGGTGCCAGGCGACCGTCTCCCCGTAGTTGGGTAACGATGGCGTACCACCGATCTCTGTCCAGGTGGCAATGTCGATGACTATCAGGTGGGAGCCGCCATAATACCCAATGGCCAGGAAAGCGCCATCCGAGCTGAACTTCACGTCCGCAATGCTCGATTGGATTACAGGCGCGTCACTGACCACAGACCAGTCATCGTTCGGGTTGATAACAGTGAAGCGGGACTCCTCACTATTAAAATCCACTGGGTTGTGCGCGACCGCGATGATTCCGCTGATCGGGCTGATATCAATGGCCCTTCCCATACCGGGCAATGTCGGTGCGACATTGGCCACTGACCAGTCGGCAGCCGCGACAATGGTCAGGTAGGGAGACTCCCGATGCGCCACCGCCAAGTAGGTGCCACCGGGTTCGAACACAACATCCTCACCTCCCCCGGGCAGAGACGGCGTGCCACTGACCACAGACCAATCCGACGTATCAATGATGGTCAGATAAGGAGCGCCGTCATGCACGACGGCCAGGAAGGTGCCATTGGCATTGGAACCAAACGCGACCGCGTTCCCGGTGCCGGGCAATGACGGCGTGCCGCTGACAACCGACCAATCGGCGGTGTCGATAACCGTCAGGTAGGGGCTTCCCTCATGGACGATGGCCAGATAGGCCCCATCGGGTCTAAAAGCCGCCCCCCACCCAGACCTGGGCAGTGACGGCGTTCCACTGACCACGGACCAGTCTGACGTATCGATAATGGTCAAGTACGGAGAACCATTGTGCGCCATCGCCAGATAGGCTCCATCGGGGCTGAAAGCCGCCCCGCGCCCCATGTCGGGCAGCGACGGCGTGCCGCTGACGACTTCCTGGGCAGGCAATGTCTCCGGGAACATGGCGGCCAGCTCTGGGTAGTCCACTTTTCGATAGGCCAGCCCATCCATTCGCAAACGGTCATCGAGAGAGGCATTGGCCTTCACCAGGGACGTCTCCCCGATCCGCGACCGAATACGGTCCATCGTTCGCGCGGTCGCCCGAGCCTCGATACTCTCACCCGCACCATGTGAGACAGCGGTGCCCTCGAACCCACGCTCTACCGTCAACGTATTGTTTGAGATGCCGGTCACGCGCACGATCTCGATGTTCTCGCCGTACCTTGTGGTGTCGGCGAAAATCGTGAGCAGGAACCATTCATTCGCCCCCAATGAGGTCGGCAGGTTCACCGGGGCCGATACCTCGATGGTGGTAGTCACATCATCGATGGCCGAAGCCAGTGTCGCCCGGGCGTTGTTGAAGAAAAGCTGTGGCATGGGTTACATCTCCACCACGATGATCATGAAATCGACCTCTTTGGTACGGCTCTCGGTATAAATCAGCGGTGACAGCTTGTAGCTCTCTCCATCCGTCCCGCCCTTGATCCACAACTTCACGCGAGTGGGCTCGATTCCCACCTGTGTCACCTCCAGACCATCCGGGGCCGTAACCTCGACGTGCTCGATCTCATCGCCCTCGGCCATCCACTCGGAGAGATCAATGTCATAATCCAAATGGTCTCGGGGCTGCTTCTTGAATGTCTTCATGCGACTTGGTACTCCCTCACGGACGCAGGCACCGACATGACACGGGCACCGGGCATCAGGGTGATGGTCCGGCGGTCAGGTGCCGGCGAGTCGGGATTGATCAGGTACGAGACACGGCCTGTTTCGGCCCAGGATTCAGCGACGCGGGCAGGCTCGATGAAGGTGATGCGCAGCATTTCACCGCCGCTATCAGCGTCACCTCGGAAACGTATCTCGAAGAACCGCCAGACGTAGATCGTCACCGGAACCTGAGCCGAGGCACGGGCCTGGCCACTCGCGGGGCGGTAGCGGGCGAAATAGGCAGTGGGTTCTGCCCGCGAGATGCTCGTGCCCCTCATACGGCGGGTGAGATGGGGCATCCCCGTCGTCACGGAACGATTGATCGCAGGCTGCGGCGTCAGGTAGGCGAAGCGAGTGTAGCCAGCATGGGGTGACGCCGAAGCCAAGGCCGCCTCGTCGATATACCTGGCGGTCTTGCGGGCATCACCAGCTCCCTCAGCTCGAGCCACAAACGAGGTCGGTTCGACCTTGCGCACAAAGGCATTGAGGTCGTAGCCACCACCTGCCGATGAAATGGCCCAAGCCGGGTCATCGACAAAACGTGCCGTCTTGCGGGCCACACCGGCACTCTGGACTCTGGCACGGGCGCTTCGCGTGGTAGTGAATCGATAGGGCTGGTTCTCCCCACTCGATTCAGCCACGGCCACCGCCGAAGCCCGCATATACCGAGTGATATCAGGATCGCTGGCTAGGAACACAGCCTCATTGGGCTCGGGGGCGAGCACCGGGACAGCGAGCCGAACGGCGGCGCCCAGCCCTGTCGCGGTTGCCGCCTCTTGCCCACCATCCATCCAGGCAGTCTTATGGGCGGAAGCCTGCCCAGATGCGGTGCCACCCAGTGGCCGGATGTCCGGCCGTCGCAATCCATAGACGTTGGCCTGAGCGATCACCGTGGATATGGCGATCTCTTTCGCACGCGCTTTGGCCAGGCGGTGTACATCCGAGACCACCGATGACATACCAGCCATCGAGTCCCAGGTAGGTACCTGACGGTAGCACCATGCGGTTTGCTGCGACTCGCACGCCAGAGCATCCAGGCCTATGTAGCGCCCACGGAGCGTGTAGGCCGATTGCTCCGCCACCGCTTCACTCTTGCCCGTAGCGCGGACCCACGGATTGATACTGGTCAGGTGAACGGGTGGGACATCGCCGCCCGCCGACGCGATACCCGTAAACGCATGGATCTGGAAATACCGCCCTGCGGACTCGCTGACCGATGCGGCTTTCCCGCTTGCCTCGGTGATGCACAGCCCCGCCGCCACTTGGCGGGATAGCGCCAACGCACGCCCACTGACATAGCAATAGGTCGTGTTGCCAAGCGTGAGATCTGGTGCGATATGGGTCGTGGCCAGCCCGGTCGAGATGCTGCTGGCATGCCTGATTCGTCCCGGTACCGCCGAACTGGCGGCACTACTGGACGCCACACCATAGTGGTTCCGATACAGGCGCAGGGAGCCCGGGACATGGCGGGATTCCGACTCCAGCCATAGCGAACCACGACCAATCTTAGAGTTCGGTAGAGCCCGAGACCTGGCTCTCACTACCGTGCGGGCCTCAGCGAACCGCTGGCGAACGCGCAGTCTGTCGAGCCTGGATTCCGCCAGCCAGGTCAACGATGCCGTGCGCAGTGGGCGCAATGACGCCTGCCAGAACGTCTTGTAGTCGATGCTTGACGCCGCCTCGGCTACAGACTGGCCACCCATGATGTGGGTGATATGAGCATCGCAGTACAACGCCGCCGGAGCTTCGACCGTGCTCGGCGTCATCCAGGCAGTACGGTCAGCACTGACATCGAGGACGGCCAGAGAATCATCCAGGTTGATGTCGGGTGCCGGGGTAATCCGGTGAGCGTCCCCAGTCGCTACGGCTTCTGAGTGGCTCTCAGCATGCGACATTCGGGCATGGCGCACGCCGGCACCATCCATGGCGGCTTCGACATCCGTGATGCCACTCATTGGGCGATGCGCCCAGGACGCGAACGTTGTGTCGGCGATCGTGGCGGACTTGCCCGCCACGAATACCAACTCATGGCCCTCAGCGTCGGCGAACGCCGCAGCCGCCGCCGTCCCCCAGCCAGTCGCCAGTCGCTTGGATGCCCCTACTGAATCGGCTGACGCGGATAGAGAGCCGGTAGCATGCTTGATGCGCAGCGGTGTCGCCGTCGAATCCGCCACCGGCTCGAGAGACAGCACGCCGCCGTACAGCCACAGGCCATTTGCCCCGCCGCCATTCAGCGGGGCCCCGTTGAGGGGCCCAAACATACCCTACCCCTCTTAGCGGAGGATGACCTGAATGCCGCCGATGTCTACAGACAGCACGTCATTGATTTCCAGGGTCTTCGCAACGGTGAACTCACCGTGATAGAGCATGTTGCCGCCGGATTGGGCGTCATAGAGTGCGTAGTGCGTGACGGTCACGCTGCCATCGGCGATGGGCGGGAACTGGATCACCTTGGCATTGGAGCTGACTTTGCCGTCACCGGAGTCGGTCGGCGAGGTCCAGCCGCTAGAGATGGAGTCGCCTTTCGCGGCGTCTTGCCGGACATAAGCCGAGTCGGTGACTTCATCGCCGGTGTCGGCGTCGGTGGGATCGCTGGTGAACAGCGCCACATAGATGGAGGACGGCGTCGGGAGCGTGGTCCCGCGAAGGGTGTGCTCCAGGATGCCGGATTCGAGATAGTCGGAGAATGCTGCCATTGGGAGTTACCTCACGGTAAGTCGCAGAGTTGCGGTCAGATGAAGCGGCGGGCTTTCACACGCGCACCGCCCCGGGCGTGGCCAAGTCGAGAGTTCTGCCTGGCCGTGGCCACATCGGCCAGGTATTGCTGATGGTAGTAATTGGCGAGCTCGGGATTTCGCCAAGGCTGGGGAAGCATGAGCAAACGCCAGCGCGCACCGTTGGCAATCGCCATATCCCAGCGATCCACCACCTCAGAAGGCGGCATGTCGCCCATAGCGGGCCGGCACGCCAGGCGTCCGTAGAGCATCGAGGCGTTGGGACGCTGGTATAGCTCGATCGAGGTGGGCGTGGGCTGGTCGAACCCCTGGCCCTGGACGCACTCGCGGCCATCGATCTTGAGCGAGACGATGCGCAGTGGCTCGCCGCTGGATGCCACGATCTGAGGGTTAGCGGTATTGGCGGCCACCACTACCGGCTCTCCCTCCTGCACCCACACGTCAGCTTCGGTGCACAGCTCACGCGCCATCCGGGTGATGGCATCACGAATCGTCATCAGCGGCGCCTCGGGAACGTCCTGCACCACCTGGTGAACAAGGTCATCCAGCGTCATGGATCAGCCCCTTGTCCGTCCCACCGAGCCAGTCGGGTTGGACGAATTGTCGGGAGCGTTCGGCGAGATCGAACGATCCACCTCGACCTTCTGCCCCAGCGCGTTCATGTAGGCTTCCATGTGGAGCTGGGCGCGATTCAGGTTCTTGGCGTGCTCGGCATCCTTGGAGTAGGCGCGATACAGGATGTAATCCGTGATGACTGGCCCATAGCTGTCGTTCAGCTTGATCGCCTCGTTCGCCAGCCCCTCGAGCCCCTGGGAGACATCGTGCGGCGCTGGTACGGCCGAGTAGATGATCTCCACCTCGGCTCCCGTCGCGGCCGGCGGATAGGTATAGAAGCGCGTGGGGTCTTGGTCATCGAAGATGTACTGTTCGATGTCAATGCTGGGTGGATCAGCGTGCCAACCACGCCGCGTGGTATCCAGCGACTGACGGGACGCCACCATGATGCCCATCTTCTTGCTGCCGGAGGCCGTGTTGCGAACAACGTCGATCAGTCGGAGTCCATCCGCCGGGATCTGTTGTCGCGTGCCTTCGGCCAGCTCATGTGACGCATTTTCGGTCGAGGCATCCGGCTTGATCTGGACAATCGCCTGGTAGCTCTCATTCAGCCAGCCGATGAGCTCCTCGTTGGTCCAGCGGGTACCGGCCGACGTGACTTCCTGCAGCACCAGCTTGGCGTTGTTGATGACGGTTCCCACCGTGGTCAAAGCCATGGCTTACACCTCCTCCATATCGCGCTTTGCAGCCAGGGCAGCGCTATAGGTGAACACACGGCCGTTGCCGCGATGGCGGAGCAACCGAGGTTTCTCCTGGGCCGGTGCTGACTCACCCTGATCCGGCGTTTTCTCGGCGTCAGCCATCTTCGATGCCGGGGTGCCCTCTTGAGACGTGTCGGCGCTCTCCGGCGCGTCTTTATCGGGCGACTCCGCGTCGTTCTGGGCTGGCGTCTTGCCCTGTCCCTCAGACACTTGGTCTTCGGCAAGACTTTGGAGGTCAACACGGATGGTCTCGAGCCCCTTCCGCTTATCGACATCCACGCCCAGCTCATCCTTGCCGATCTTCTCCAACTCGTCCTTGGTCTCGGCATTCTCGATGCGTTCGCTCAGTGACATGGGATTGCCCTCTTGTCTGGAAACAGAAACGCCACCCCGGAGGGTGGCGCTTGCCCGGTTCACCCGGGATCAGGCGTTTCAGCCGCGCTGGGCGTAGAGATGGCCAGCAGCGTTCGGGTCGATCATCTCGTACCCGAAAACGTTGAGGCCGCGGACCAGCTTGCCGAAGTCCTGCGGGTTGGGCAGGTTTTCCATGTTGGTCATCTGAGAGGCGAAGGTGAGCGCCTTTTTGTGACCGAACAGGACGTTGGTGGCCTTCTTGCTGGTGGTCGCGTCGGTGACCGTGGACAGGTTGTTGCTGACGTACACGTCGAACCGATCCAGCATGCCCACCTTGCCGTTGCGGTAGACCGACGTGCTGTCGCCCATGGCGCTGGCATCGCGCAGATCGGACTTCTTCAGCATGCCGTTCATCCAGGCCGGTAGGATGATGTAGCGGCCCGTATCCGGGGCATTCTGCTCATCCAGGACAGAACCACAGTCCACGATGGTGTCGAGGATGTTGCTCTTGGTGATGGATACCGGGGCACCAGACTCGCCCATGTTGTAGGAACCGGACTTCACACCGGCATCTGGGCCGGCGTTCTCCGGTGCGGCGTCAGCGTACACGTCACCGAGGATGACCGTGTCGATGGCGATCTTCATCTGCTGACCGGCATCGTCCGACCAGTCGTCCATCAGCTTGATGTCGGACTGATAGGCATCGATGTCATTGACCTCGAAGGCAAAATACTTGGCCTTATCGATGTGCAGCTCGACCTTGTCACTGGTGGGTTTCTCATAGTTGAGACCACCACCAACGACATAGTCCTTGATCGTGATGGACGGGGTGGTGCGGATCTCCACCGTGTCGCCCTTGTTCTTGATTTCCGTGCTGTTTCCACCCTTTCGGGCTTCGTCCCGGCTCTTTATCCGGGCTCCTCACGGTTTCCCATGAGACCAGACTATATCTTCACTGCTTGCGCTTACGCGGCCGCTTGTCGGGGCCACGACCAACCTTGCCCGTATGGTGGAAACTGGTGTGCTCTCCCAGTTCCATTAAGCCAAGGTTCTCTGGGCGATTGTCGAGCTTGTCTTCATTGATATGGTGAACAACTTCACCCGGCTCTAAGTACCTACCCAGTTTGCGTTCCATCACCAGCCGATGTTCTCGGACGTACCCCTTGCTGTCAGCGTGGGGGTGCCCGTCGGGAGCCCTGACCATCCGGTACCCGTTATGGGTGATGATCTCGCCCCGATGCAGCGTGTCATTAAGCGGAAGGCCGAAGCGTCGAGCCGCACGATGCACAATAGGCTGGCTGAAGCCGGACCGTCTCGATATGTCGCTAGTCGTCATACCAGCCTCAAGCATGGGACGGATTATCTCGTCAGTTTCTGCCGCCGTCCTGCGCTTGATCTTCGGAATCCCGAACCGCTTCATGTATGTCATGACCAGCTTCTTCGAGACGCCGTGCTTTTCAGCAACCTTGAGGGTTGACCGAAGCTCGAGATAGTCCGCTTCCAGCTGTTCCTTGGTGATGAAAAATTTCTTGTCTGCCATGGGCTACTCCTGTTTATTTAAACAGTATCACCCATGTTTTTAATTTCGCAAGCAGTGCCCCGCACTCGTGGGCCTTCATCGTCCCGTCAGGACTGTGTGACCTAGTCGTTGAACCTTCCACCCGTTACCGGGAGGCTTGGCTGCTGATTGCCCAATCCCTGCACTTTTTGGGCCGTCACGTTTGCCGTTGCCGGCTGCGTTGTGGCGCTCAGGGCTCTAAGGGCTTCCCAGCAATTCACGGGGTTTAGTGAGAGCTGACTTCAACCCTCGTAGTTGGTATTGGCGATCTCACCGAAGCAGGTGGAGGCATAGAGCTTCTCCACCAGCTTCCCGCTCCACACCTGGGGAATAAACCCAGAGGAGGACGTGCTGGAATAATCGGGATGGCTCCCGTCGCGAGTCGGACCTGCCATGATGTGTTACCTCGTGTTGTCGCCTCTCGGCGAGAAATCGGTAACCGCCGACCGGGTGGTTACTGCTTCACCCGGCCATCTCGTTGGGCGGCGAAGATGTCGGCTTCCAGGCGTTGAGCCTCCTCGGCGCCGTAGCGTCCCGCTGTCTTGTCCCGGTAGAACTGGCTGATCTCCTGCCCGGTCCAAATCCGCTGCCCCTGCGGGGCCTCGGTGGCTCGGGTGGTGCGGGGCTCAACCTGGTCATCGGGCACGGAGCGCTTGGCGCCCTGCTCGGCCTGGTTCATGTAGAGCTGGAAAATGTCAGCTACTGCCTTGGCGTCCAGGGCCTGCTGGGCCTGATTCAGCGCTTCCTGGCGCTGCACGCCGGTGCTGGGATCGAACTGGGCCAGGAATTGGTGGAACTTCGGATCGGCGTTGACCTCCCGGAAGGTCGGCACAGCCTGCTCCAGGTTCATCCAGAAGCGGGCTTCCGCGTCCTGGTGCTTCTCGCTCTCGAGTCGCTCCAAGCGTTCGGTCAGCTCCTGGGTGTTTCCGGTATCGGCTGGCTTCCCGGACGGGGCGCGTTGGTCAACCATCCGCGTCACGAAGTCCACGAGATCCTCGCCATACGTGGCTTTGAACTGCTCGAGCTGCTCATCGCTGACACCGCTATCGGCGCCACCGGTGGGCTGCTGTTGTTCAAGCTCGGCGATTCGGCGATCTCGCTCGGCGATGGTGTTCTTGAGCTCGCTGTTTTCCTGACGCAGCGCCGGAATCTCGCTGTTGTACTTGCCCTGGAGAACGTTGAACCGGTGCTGCCAATAAAGAGCACCCTGGTCTTGTTCATCCTTGGGCTGTGCTTCAGCGCTGTCGGCGGGTTGCGTCGGGCTGGGCTCGTCGGCCTTGGCGGGCGCCGAGAACTCGTCCTGCGGGTTTCGCTGCCCTTCCTCGGGAGCCGAGCCAGCGGGACGCTGATCATCGGGATTCTCGGGTTGGGGTTCGAAGTGCTGACGGGCGGCTTCGGCTTGGGCCTGAACGGACCTGGGTAGTGACATTGTTGCTCCTGTGACGCCTCACGGCGTGACGGGTGAGCCGGCATTTGCCGGGGTTCACGACTCGGGTAACGGGTTCTGGCTGCAGTCCAGAGCGCTCCCACGACAATGCCGCCCAGGGCGGACCATGAGCGGCATTGCGGTGAGGCCGGGCCGGAGCCCGGCGGCTACTGAGTAAAGCGCTGGTTTACCACGTCGCGGGCTTGCTCGAGCTTCTCTATCAACTCGGCCACCGTTGACGCACGTCCTTGCAGTCGGGCGACTTCAGCGGGTTCGCGGCACTTTTCCAGGCTGTCGCGGCATTCCTCTCGCTCCGCCTTGAGCAGTGCCACCAGACGCCTCCCCTCCGGGGACTCGGCCAGCCTGGCCAGCGCCTTCCAGTCCTGTCCCTCCATCGTTGCGTCCTCGCGTCTCGGTAATCAGCTTGGCGATCTCGGCCAACAGGTGTTGCGCCTCGAGGGGCGCCATCTGCTGCGATTGTTGGGTTTCGGCCTCGGTCTGTGCGGCGTCGGCATTGGCCTCGCGCGCCCGTGCCTGTTTCTCGGCGGCCTCGGCTTCGGCCTTGGCTTGTTCCATCTGTAGCTGGGCCTGTTGGTCCTGCCGTTGCTGTGCCAGGTTCTTCTGCATCTCCTCCTCGGAGGGGATGAGACCCGGCAAATCGAGCTTCTCTGCGATGCTCTCGAGCAGCTTGCGGCGTCCATCGTGGCCGAGGATGCCCATGTCGTAGTCGTTGGCGGTGAGCTGGAGGAACTGCTGGCGTAGCTGGTTGGTCTGCTCGCGAATGAGCATGGCGCTCGATCCTCTGGCCACCACGCTGGCATCCCCCTTGATGCTGTTGTCGTCGCTGAACTGCATGTTGTGGAGCCAGAGCGCCTCGATGACCCGGCGAAGCACACCGCGATCAATGTGCCGTATCGCGTCCTTGATGCCCTTGTTCGCGGATTCCATGAGCATCGACAGGCCTGACGCGGTTTGCCCGGCGCCGCCCGCCTCATCACTGCCGTACATGTAGCGCGGGATGTTGGTGGACTCATCGGCCCGGTATTCGAACTGCTCGTAGACAGCCAGCAATTCACTGGCATTGCTCTGCGGCTGGAAGAAGCGCAAGGCAGGATTGTTGCCGGTCTCTATGCTTGCCTTGGTTCGCCAGATCTTCCACGGGTAGATATCGGTTGGGTCTTCCTGCGGTTGCAGGCGATCTTCATAGACCTCGACCTGGGGCCCGGAGCTGATCGCCAGGTTGTTGACCAGGCCCCGCGCCGTGGCATTGCACACGTCCTGGACATCCGCCATGAGCTCGGGGATGCCCTGCCCCCAGAACGACCCCGGCACCGGCTGGAAGCTGGACTTGTGATAGGGGCGACGTTCAAGCGGATCACGATTGATGCGCACCCGGATGACATGCTGGCCGATCAGGATGGCCTCAACTTCATACTCGGCCAGCGGGTCTTCGATCTCGTCGGGGCTGATACCCCACTGCAATAGCGTCACGCCCTGGGCGCCGCCGGAGTAGATCAACCCATCGATGGTCTCACCCGGCGTCAGCCACTCGTGGCCACGCCCCTCGAGCTCGGCACGCTCACCGTCACTCCAGAGCCAGTCGCGGAGCCCGCCCTGGCCATACTGGTGCAGCACCTGGCGAATTGCGTCCTCGCTGTAACTCGGCACGCCGATCAGTTGGTTGAGCTGCGATCGAGTGAAGCGAGCCCGCTCAATGATGAAGGCACCGTCATCAATGCTGGTGGCATCCGGGCTGGGGTACATGTCGAATGGCGAAACACGCTCGAACTCGGGCCGGATGGTCTCGCCCTTCACGGGACGCCAGCCCTCCAGCCATTGCAGCTCTGGCACGCGCCGGAAAATCGGGGCGCGGATGAAGGCCGCCGGGTATGTCACGAAGTCGTCGATGAACGCCTCGAATGCCTCATCCCAGTACCCTTCCGCGAGCTGGTCGCTGATGAGCTCCTCATGCCGCTCCGTCGCCTCTCGCGCGGCATCCTCGGCAGCCTGTCGCAGTTGCTCGCGGGCCTGCTCGGCCAGGGCGGCCATGTCCACCTGCTGGCCCTGCTGCTGCATCTCTACGGCCTGCTGGCGGATTTGGGATGCCAGAGGAGCCAGATAGGCATCGGGCACTTCCGCCACTGGTGTCGGCTCGAGTCCCCAAGGGCGCTCATTCGCCGGCATCATGATGTCGCGAATCCATGCCGCTGCCGCCCGACACTTGGTCGCTGTCAGCATCATGTAGATCTCGGCCCCGCCTTCCTTGCGGATCGCCTGGAGCTTCTCGGGTGAGTATTCGCCCTTGCGGCGCCGCAGGCAGTCGAGCATCCGATTATCCACCGACTGCCGGGCCATCTTGGCGCTTTCCCAGCACCGGCGGATATGCGCCCCCAGGGACGACTCGACCAGTTCCCGGCGGCGCTGTTCTTCCTCGGCGATGCGCTGGGCCTCGGCTTCCTCCTCAGCCCGCATCTCCGCCGCCGACCGATGTTGTAGCAGCCCCATACTGCTCATTGGCTGCCCCCGGTTGCGCGATAGATGGCCCGGTTCGCATCTTGCTTGTTCCGGCGTAGCCGCATGACGTTACGTCGCATTGGTGCCAACTGGGCGAACAGGTCGCGGATATATCCCGCCGGGTCGGCGGCGAACTCGAGCAGCTTGATGTTCAGCGTCACGCCCATGTCGCCCGCGACCTCGAACTGGAGCCGCATCCCGGGCTCGGGCCGCGTGACACGTTCCTCGATGACGATCATGTCGATCTGAACCGGCCCGACATCCGGGCGAAATCGCTCAGAGGCCGTCGGGATGCCCGCCTCGATGAGCTGTTCCGCAATCGTGCGCGCCGCGTCGCGCATCAACTCCGTCACCTCGGCGCGGGTCATTCCGTTGAGACGATAGGCGGTATTGCCAGCCCGGCGACTGATGTCGCCGTCCGGTGTGGTGATGATCTCCGGCATGGTGTCCTCTGCCTCGCGGCGGTGGTCAGGTATGCGCGGCCCAGCCCGAGCGTTCGCGTTTCTGAGGGCGCGGTGTGGGTGTGTTGGTTTGCTGAAAGATGGACGACCGGGCCAGCGTCTCGAGCGCCTTGGCCCCGTGAGATGCCCAGTCATGGCGCGGTGTGTCCTTGTAGACGCCCCGCTTGTCGTCCCACTCTTTCCGGTAGTTGTCGAGACACTTCACGCCTTCATCGCAGGCGTCCTCGTCTATCCAGCACATCGGCAGGAAGTTGCGGACCGCCTGGACGCCCTCGGCGTGGTTGCTGATACGTGGCACGACCTCGAACTTGATCCCGTATTGCTGGGCCACGTCGGCGCGACTCTTGCCGGTGCCCAGCTCACGGACGGCCAGGTCATGCGGCCCAAAGTGGCCACCGTATCGATAGCCCTTCTTGTTGAGCTCGTCGGCGTAATATTCGATGCCCTCGCCTTCGCCCTCGAGGTAGTCCACCAGGTGGATCTCGCGACCGACGATCTGCGCGAACCAGATCGCCATGGTGTCGTTCATGCCCAGGTCCCAGGCGGTGAAGACCGGCAGCGACGGGTTTACCTGCACCTCGGCGGTGATGCGCTTGTTCTTGCGCAGGAACCGCATCTGGCTGGCGTAGTAGGCGCCCTCGATGCTCTGCTCGAATGCCTCGGCCGGCGTGCTCGGATACTCCCGCTTCATGTCGTCCTGGAGCACTTCGGCTTTCTTGGCGTACCACGCCTGCTGTGCCGCATCGGTCGCGATGCCGTGCTTGTGCTCCAGGTCTTCGAAGTATTCCCGCAGCCATTGCGGTACCAACACTCCCTCGGGATCGAGTCGGTAGGCCGGTTCCTGCCACCACGGGAAGAAGTGGAACTGGAAGTCCATCAACGTCGGTGCTCGCCCAAGCTCCTGCAGGTGCTGGGCTGACTGGCAGTAGTCGAAGAAGTACCCTTCGCGACCCTCGGCGGTAGATTCGAGCGTTATCTGGTTGCCCAGGCCGACCGCCTCGAACGCGCCGGTGACGATTTCCTGTGCCTTGTGCGGATACTGCCGGCAGATCTTACCGAACTCCGAGATGTGGAGCCGCTGGAGCGTGCCGCCACGATACGAGGTGCTGACCTTGATGCTGGAGCCATTGTCGAAGACGTAGGCGCCGGAGCCGCTCTTATCGCTCACTGGTTGCGGGATGCGGATGCCCAGCATCTTGAAGATGCGATGCCAGGTGTCATCGATACTCTGGTACGCAAACGACACCTTGTTACGAAAGATGTCCTGTGCGTCATCGAGCTTGTGGCATATGCAGCCGGCGGCGAAGTTGTCGCGGAACAGGCAGTCGTCCAGGGCGTCGATCATCTCGAACGTGGTGAAACCAAGCTGCCGCGCCTTGAGGATGATGTCGCGGACGTGGCCCTTGATATAACGACGCCGCTGGGCCCGGTTGGGCCGGAAACGTCGCACCTTGCCTTGCTTGTCCTTGATCTTGTACAGCGCGTTGAGCCGAAACCACTTGAGCGACAATGCCTCGAGCAGATCGGCCTTCTCGGTCAGTCGCCCCTTGGCATGCGCACGTAGAAAGGCCTCGGCACGCTTGACCTCACGTTCCCGGTCACTCCGGTTCATCCGGCGCGACCTCGGCCATCAGTTCTTCGAAGGTCTTTTCCTGGTCGTTGGCATCATCCTGGTCCAGCCCGTAGCTCTGGCGCTCGAGCCGGATGACACGATCAAAGGCCTGCGTGCCGGCCCCCATCGCCTTGGCAATATAGTCCAGCGGGAGGTCGATCGAGGCGAGTTCTCCGCTCTTGAGCTGAACAGTGATTTCACCGGCTTCGAGCTGCTGGTCGAGCCTGTCGGCGAACCGGTCGGCGATACGACGCCAGCGTGATAGAGCGGCACGGTGGCCACGCACGATGGCGGCATTCTCGTCCGATGCTTGTTCGATGATGTCCGACTCGGGGACATCCGGCGCAGAAGCCTCTGGGCGACTCAGCTTCTCGCGGGTCCGCTGCTGCACCGCGCCGGTGAGATCCTTTTCCCAGCCTTCGGCGGCGGCCTTCTTGCTGATCTGCGATTTACTCGGGCCGTGGCGGTCGGACAGTTGCTGCAGGCTGAACCGTCCCGTCCGGTAATCCCGTTCAACGGCCTCCCAGTCGTGTCGCTTTGCCATATGCTGCTATGCGGCACCCTCACGGGTGGCGCCTCCATCATGTAGGGCCCGGGCGTCTCACGACGAGCCAGGCGTAGGTGGGCGACGCCTCACGGCGTGGCCCTATCCCATGGTGGGGAATTCGCATTGATGAGCGCCCCGGTACTCAGGGCTGTATTGAGACCGTAGGTGGTACATGCGGGGCGCTCACCGGTGCGTCACTTGCCCGCCACCCGGGTTTTGAGTGCATCCATAAATGTGAACGAACGTGAATCGATACCTTTGGCCGCCAAGCGTTCTTTGCCCACACGGTTGATGTCGATGCCTAGGACGGCGCCGAACACACCGAACATCATCACGGCCTGGCGCAGCAGTTCGATTCGGTCGGTAGTGCCGGCGAACATTTCGAAGATGATCGATGCCATGACGCCGCCCAGGGACAGGCCAAACATCCAGCCAGTGAATCCACGCCAGCCCCCTTTGAGCGTGCTGTCGCTATCGACCTCAGCCCGCATCGTTTGGTTGACCTGGCCCAGCCGTTTGGTGCGCGCCTCGATGCGAGCCTGTTCCAGCTCAGCCTCAACCTTCTTGATCTTGGCCGCTGCCTCGGGATCATCATTCATCACCTGGTTCACGGCCTCCGGCGTCGTTTCTACGCCCAGGACCTCAGCAACCATGCTGCCGACAGCACCACCAGCACGACCGTACAGTGCAGAGCCGACCATCGGCGCGGCTTCGCCCACGGTTTCAGCTACATCGCTCCAGTTCATCGCGATTGCTCCTCGAGCGTCGAGAGACGCTCACCCAGCGCATCGAGTCGGCTATTGACCTCCTCGGCGCGTTGCTCAGCTTCGGACTTACGGACGTACAGGTTGCTCCACTCGCGTATGTCCCGGCGAAGCGAGACAATCTGTTCGCCCTGATACGTGATACGTTCCTGCAGCGCCGCGTTCTCCTCGCCCAGAGAGACCAGCTTCAACCCTGCCCACGCCAACAGCCCTACAAGAACAAGCTGAATCCCGGTCTGAAAGTGCCGCTCGAACTCGAATGGCCGGCGATGGCGGTATTCTTGCTGACTATTCCCCTGCGTATTGCCCTGTGCCTGTGTCGTCATTGGCCTCTCCGCACCTTCACGACCCGGTCAAACCAGTCACTATCTTCACCGGAGAGCAGGCCGCCAGCTTCGACATACACGCGGGCGAGAGTTTCGTGCTCATGCTCAGGCTGGCCGTAGCCTGCACCGGGCAGGCTGGCCCAGATGTTCGACGCGGCGGCAATAGCGCGCTTGAAGTAGCCATCCAGGATGAGGGTCAACGCCCCCTGCTCTCGCAATTGCTGAATCGCGTAGCGATCCTGGCTCTCCGGGCTGAAGTCAGGGAGCCCGAGCTGACGCTTATAGTGCGGCCAGTACGTATTGAGGATCTGGTAGCGCCCAGCGGCGGTGCTGTAGACGCCGTCGGCGTACTCGATCGCCAGGTCATCTGTGGGCAGCGGATGATCGGCATAGCTCGCGAACAGGTGGACGTCGCCTGGTACTGAGCCGACAAGCACGTCATAGCCGTCGTCGCTTGCAGCGAGCAAGTCCGGCCCGATCTCGGCATACGCGAGCGTATCGAGTAGCGCGCAGACGTTGCCGCCTCCGGTCTCGTGAGTAATGCGTGGCATAAGGGCTCCGGCGCCTCACGGCGTGGGAGAAAAGGAGAAAACAGAAGCGGCTTAGGCCGCCAGGATCGAAAAGACCCCACCAGATTGGCAGGGGCCTAGGGGATAGTGGTCACTTGACCTTGAGCAGCTTTCGAGCCTCTTTCAAAGCTACTTTCTGGATTGCCTGCTGGGATGACGTAACCTGAGACCTCATTTCCAATAGCTTCTTTTTCGTACTCTGAACATCATCTTCAGACATGTTCGGAGCTACGCCCCTCAAAGACATCTCGGTGAACAGGGCAACGAAGCTCTTTCCAAAACGCTGGCGCAATTCATCATGCTCAGGGAGGTGTGCTTGCATTAAAAAATCCACCCTATCCATGAGCATATTGAGATCATGCACAGCAGGTCCGCTTGATTCTTGCCGATGCACCTGCCCTTCGTCGTCTACATAGAGAGAGCCAGCGAAGGCATCAACATACTGCTTGCATTCCTTCTCAAGCTCTCCGCTGCTAACGACAATCTCCTCAAGCTTTTCCTGCTTCAGGCGCGCTCGTTCATTCTTTCTGAGCATGCAGTAGGACAAGCACGCACCGAGCACGACCCCTAAAAGCGACGCTACAGGACTGAGCCAATCGCCCCACGAGGACTGAGCAGCCTCAACAACGATCGAGAACTGACCAAACGCCGGATACAGCTTGTCAGACACAGTGTGCTCCATGCGAAAGCGCCCCGGCTCGGAAGCCAGGGCGCAGTGATCGATAGTGGTGTCAGGATAGCGCTTCGGGGCACATACGACAAGGTCTACCCGTATAAAGGCTTCAACTCCACTGCTGCATCAAAGTCCGACTCATCATGGTGGCGGTGTATCACCCTGATCACTTCAAACAAGGCGCTCTTGATTGCCACATGCTCCCCAACACATGGGACCCGTGACAATTCAGTCGTCCCTTTATCCAAGCTGTCAATCGCCGAACTGTGCTCTTGATCAAGCTCTGCATCGCAGTAAGGAAAGATCGTCGCCAATTTAACCCCCCTTTTCCCTAGCGGATATGAATGATCCTAGAGCCTACCATGCACGCTTCCTAAGCCGCATATAACCTTAGAGCCGCCACCGGACCCAACGCCCTACTCTCCCAGTCATCGAGCGTCGTCAGCATCTGGCGCCATATGTCCGACCAGCTAAGTCGAGCAGCAGCACTCCAGCGCCTGACATCGATGGTGGTCCCTCGTTCGTCCTCGAGCCAGCGCTTGATCGCTCTCGGCCCGACCAGGCCCTGCCGGTTGTATGGATACGTCACCTCTCCATGATGATAGATCGCCGCCCAGGCCAGGGCCGTCAGCGTTCGCCATGCTTTCGCCGGCGGATGCGGTTTGTCGCCCTGGCCGGGCAATGGCCGGCCCATCATCGCGCGTAACAGCGACAGGTGCAGCCACTCCATGTCCTCGGTGAGCTCATCCCGGGTGAACGGCCCCCAGCAATAGCGGCACATGGCCTGTAGGTGTTGCGGCTGGCTCTCGACAGCGCTGATCACCGCCCCGGATTCCAGGCCATCCACGATGCGCCAGTCGTTGTTCTGCTTCTGCGTCTTCTGGATCTTGGCCTTGAGCGCGGCAACCTCGGAGGCCCCGGCCATCACGCTCCCTTGTTGCGCCCGATAGGCGTCGAACACCATCTGCCTCGCACTCAGGTACCGCATCACTGCCCCCGTTTCCTGCCGTTTCCCTTTTCGTTTCCGTTTCGTTTCCCCGTTTCGTGGCTCTCGGCCAAAGCCATCGCCTCCTTCGCTGTGTGCCGGATACCGATCCGTTCATTGCCATACCAGACAGTGAAAAGCTTGGTGCCACTGTGGTGCGTCCGGCAGATCGTGTACTCGCCACACTGGATGTGATACTTGCCGACACGATCCCAGTTCATACCTCGATGATCTCCACCCAGGTACCTGTCTCGGCTCTGCGATCTATCTCAGGTGGTTCCAGGATGACACGACGCACATACTGGCCACGGTCATCAGGAAGCACGCCGGCGGCAACCAGACCATCCTCCAGGTGTTTCATGTTCAGCGAATAGTTGCTGGTATCGCGGCGCCGGACCCTCCGCCCTAGGCGCGGTGTGAACACCAGATCTACCCGATTGGCGACACGCCCCAAGCCCTCGCACCGGACAGCATTGCGGACCAGCAGGTGGCAGGCGTCCACTTCACGCTTCCGCGCCCCCCAGTGCTGCCGGGTGCCCACGTTGGTGCTGGTAGCAAGATACGGGATGAATAGCCTCATCGGTCGTCTCGCTCCTCGCCCTGGTCGTCCATGTCTTTCGTCACCCACCACAGGTTGCCGAGCACCACGCCGAATACCGATACAGCAAACAGGACGGCCCACCCGATCCATGCCAATACGCTCATACCGCCCTCACCAGACCGCGGCGGGCCAGTTGATCCAGGGTGCGCACGATAGCCTCATTCATGCGGGCGCGGCGCTCGTCTCGATTCATATCTCTACCGTTGTCGATCTCTCCATGGCATTCCGGGCATAAGGCAGCCGTGAGGCAGTCGCTGGCCTTTTGGCTCATCCCCCGCTCCTGGTTGCTGTGCGCCGCCTGGATGCCCGACTTGCCACACAACACGCAGTTCTCGATCTCATGGACGTTCGCCAACCAGGTGCGGGAACGGTACGGCTTCAGTCGCGGGAACATCAGGCAGCCTCCGTTGGCATCAGCTCGATCATTTCCTCGACCTGCTCAGGCTCGAGGTGCGGCCAATACTCACTGGCGATGAAGCGGCACAGCGCTCGCATCAGCTTGTAGAACTCGGCCTGGTCCATTTCATCGAAGGCGATCGAGTACGGTGCCCGGTGCTCGAGGACACCAAATCCTGGTACTTCGGTACGGGTGATCTCGCAGCCCTCGCCGCTTTTCTCCTGCAGGCGCTTGATGGCCTGGTGATGGTCCAGTCCTTCAAACCCTTCGATGTTCTCGGCCACCAGTCCGCCCAGGGCGTGGGCGAGGCGATGGAATTCCGGGTTCCGGGGAAGAAACAGATCGGCGCGTAACAGTTCGCCATCCTTATACCCACGCCCCCGGACGATGCTCGCGTCAGCGGCGCCGGCCGGCATAAAGGCTCCCACCATCTGCCCGGTGGCTGGGTCGATCAGCTTGCGGGCCCGCATGTAGACCCGTGGCCGTTTGCTCTTGCCCATCAGGCCGCCCCCTTACCGCCGTCATCGTGTGGCGGGAACCACTTGTAGACATGGGCCTTGCTGCGTCCGCCGGGCACCTCATAGCTGGCGATGTTCACCCCATCCTCGCGGCGTATCTCGCGAATGCGGGCACTGATAGCCGGTTCGCTGTCCATCGTGCCGAAGCGCTCCAGGATGGCTTGGCGGATCTCGTGCAGGTGCAGCCAGGCATCGGAGTCCCGCAGGACCATGTAGGTCCTACCATGCTGGCAATTCGGGTTGGTCAGGCGGTTGCCGCGACGGTCGGCGATGCTGGTCACGTTGCTCATGGTCATGGCCTCCTCTATGTCCTCACGCGGCGTCCCGGTAGCTGGGCCAATCCAGCACCAGCGTCATGCCGCCGCCTTCCCGCATCCGGTCCACGATTCGCTCGCCCAGGTAGTCGGCCAGCCCACTGGCATCGATGTTGCTGACGATGATGGTGGGGAGCTGGGCCTTGTACCGCTCGTTGACGATCTTGAAGAGCATCAGGCGCTCCCATTCGGTCCCGAGCTGGGCGCCGACCTCATCCAGAATCAGCAGATCCAAGGGAGCGACGAACGCCTTGATGGCTTCTCGCTCGCTGGTGCCATCCTTGCGGTCGAAGGCCCGTTCTTTGATGAGATCGACCAGCTCGTAGACGTCGATGCCCATGACCCGGTAGCCCTGGTCCAGCAGTGCGTTGCCGATCCCATACGCCAGGTGGCTCTTGCCAGTGCCCACCGACCCGGTCAGCACCATGCCACCGCCCTGCTCAAGCCGATCACCGAAGCGCTCGACGTACTTCTGGCATGCGGCCAAAGCGAACGTCTTACGGTCATGGCCATCAGTCAGGAAGCCGGCCAGCGTGCGTCCCTGGAAACGCTTGGGGATCATCGAGCCCTCGCGCAGCTTCTCGAGCTGGGCATTGCGAGCATGAGCAGCCCGATTCTGGTGCTCGGTCTGACGCTGACGTTCGATGTCGTTATTGACGCATCCAGGGCAGTCCGACCACTTCCCGTTGGGCATCAGCGTTGCGGTAAAGTCGCCATGCACCAGGCAGCGCATCTCCCGGGTTTGGGCCTGGTTGGCCAGTGTCTGGCGAAGGTGCTGGCTCATGCGCTCATAGGCGCCGATGTCAGTCTGCTGTGCGCTGTCCATGATCAATCCCTCATCCAGTCCGGCAGGTTATCCATGTCGGTCGGCGTGTAGCTGCCCTGTGGCTGCGGTTGGGCCATGCCCAGGCGTCCGTCCTGGCGTGTCGCCGGCTTGCGCTGCCCGCGAACTTGCTTGGCCAGTTGGTCCCATTTGTCGCGGAGTTTCTTGGGCGAGAGCACCACGCTGGACCAGAAGTGATGCTCTGCGGTCCAGGTGATCAGGTATCGAATCTGCTCGACGGTGCGACCGTCGCGTTGCCGCATCAGACGAAACTCATTGGCCCAGCTGTCAGGGTTGTGTTTCACCGGCGCTTCCAGGTCAGCGCTCACAGCAGCAACCATCTCGTTCGTCAGCTCGTGGTCCACGGGCTCACCCCACTGTCGTCGCTGTCGCTTCGGCCTAGCCTGCTGTGCATCGACGCTCTCTCCGTCAGGAGAGAGAGGGGTTACCGGACTACCGGAAGTGGACCGCTCCTCCCGGCTCGCTCCTTGCTCGGTGTTTGCCTCGCCAGAGGCTCTAGAATGCGGGTTCTTTTGGCTCGCTCCTGTCGGCTCGCTCTCGGCTCGGTGTTCATCGAGGTCTGAACAGGCCAGTGGAAGCCGGAATTCCATGGCGGCTTTGACGCCCTTTCCCCGATGCAGACGCTCAATCAATCCGGCAGCCTCGAGCTTGTCGAGCATCCGGGTGATCTGTTGCCGCGAGTGAATTCGTGCCTTCTCGCGCGATCCGGCCGGGGGGTGGTAGTCCACGACCTCTCGGAACATGTCCATCGACACGCGACGCTTCCGCCCTGTAATGCCCGTGGCATAGTTCATGTGCTTGCGGATGCCATGCTGATAGAGGCGCTCCACTTCAGGCGGTAACTCGATCAGGGCTTCTTCTTCGATCTCGTTGATACGCCAGGACCTCACACGGGGCTGCCTCCTGGTCGATGCTGGAAATTGAAAAATCTCTGCTGTATCAGCCATAATCACTCCCGTGGCCTGTTCCTAGTGCTGCGTCCCTAGTCCCTGAAGCCCGGTATCGCCCTACACGATCCGGGCTTCATTCGTTTCGAGGCCTACCAGGGCCTTGCATCGAAAAGCCGGACTCACGCCCGGCCCCCTTCCACTACCCGCAGCTCTGGCCGACGACGACGCAGCGCATAAAGCTGGCCCTCGACATCTGCCAGGCACTTCTCGAGCACTACGTCCTCGACGCTCATGCCTCGAGCTGCCGACTCGCGGCGAGCAGCGCTCTGTGACTCAGGCGACATGCGTTCCAGCACCCGCCCCAGCGATTCGCGACTCAACATCGCGTCACCCCTTGGCCCTTTCAGGCCGTTTCAGATTGATCGATGCTGTTCCCAGCAAACTGGCGCTCAGCCATCTCAGCGAGCAGGTTTTCCAGCTCGTTCTGTTGGTCTGCTGCTGGCTCCTCACCATTGGCGAGCTTGGCGAGCAGCTCCCGTCGGCGCTTCTGTATGACCTCGATGCCGATCAGCGTCGCTTCATGGACCAACGGAGCCAGCCTGCCGCCATGGCGACGTGCCGCCTCGGCCTGGATGAGTTCGAATTCGGCGTCCGTGAATCGAGGCTTCACCAAATGCTCCAGGTGCTCGCCATCCGGCTTGCAGCGCCTGTCGGTCTCAAGGTGTGAAAGCATGGGCAAGATCCTTCTGGTAACCGCGGCCTTCACAGGCTGGCGATGGGTTTATGCAACCGCTGGTTGCTCGAAGAAATCAGGACGCAACTCGTACTCGGGACACTGGCCTTCAGTGGCCGTATGAATGGCCCGAGCCATGGGAATCGAAACGCTCGACCGGCCATGCAGCAGCTTCCAGACCATTGTCTGGGAGCAGCCGATAGCCGCAGCGAGAGCCGACTGGCTATTTCCATGCAGCTCTACGGCTTTTCGCAGAAGACGTTTCTGCGTCCGCATACTGGGATTAACAACACAAGTGGTCATGAAAAACCTCATAACAACGCAAGTTGTAATGTAAATACTCGCTAAGTGGTTTGTCAATCTACAACTACGGTTGTAACTTCATGCTTATGGATACGTTACGGAATCGACTCATAGAGACACGCCGCCGCATTGGCCTAACTCAGGCCCAGGTGGCGCGCCGATCAGGGATGTCTCAGGCGGCATATCAGAAGCTGGAAAGCGGCCGATCGCTCAGCTCTCGGAAATTGCCAAGTATCGCACGCACACTGAACGTGACAGCCGAGTGGCTCGAGGCAGGTGGCGATAACCCATCACAATCACAGGTGGATCGTACTGACGCCGGCAACGTTTCCCCCATCACCTCGGGGGGCCGATCCGTGCCAGAGATCAGTTGGGTCCAGGCTGGCGAATGGACGGAGGTAGAAAATGTCGAGGACATAAACCTCAGCGAGGTGCGCCATTGGCCCTGCCCAGTGAGCTGTAGTGCGCGAACGTTCGCCTTACGAGTGGAAGGCGACTCAATGGCCCCAACCTTCCCTCCCGGCAGCATCATCTTCGTTGATCCAGAGGTTCCTCCCATCAGCGGCAAGAAGGTGGTCGCCAAGCTGGTAGATGAGAGCAAGGCAACGTTCAAGCAATACATCGAGGACGGCGATAGCAAGCTCCTCAAGGCGATGAACCCTAACTGGCCGGAGCCCTACGTGCCGATCAACGGGAATTGCGTGATCATCGGTACCGTCGTTTTTGCAGGAACTGAAGTATAGTCATGTCATTGTTTTTAAAGTGGTTTGTTTTCCGGTGCTGTATCAGCCTAGTTAGTGCGCATGCCCAGGACTTCCGCGGAGCCTGGGAGTGGGTCCGTACTGTTCTTGGGCTAGCCGTCTATGCGGGGTTTGCCTTCACCGTCGCCTTCCTGGGGCGCATGGCCTGGGATGCTCACTGGGCCCGGCTCTTATCTGCCCTGGCTGTCATGTTCATGTCTTTCCTCGTCGCAGATCTCATTGTGAAAACCCTTCGAGACAACGCCGCCCTGCTCTCCCTACCTGCCGTGGCCATCGCCACCTGGATGGCGTTCCTCCTCTGGTAGCCATCTACCGAATTCCCTCCGCCCGCACCATGCCGCCCACCTGGGCGGCATTTTTTTGATCCCAACAAATCGCTTACAACCTATGTATTGACACAAAACAACTTGAGTTATACCTTTAATTACAACTTAGGTTGTTTTCAGCCTAACAGGACCCTTCCAGGGCCGTGCTCTTTAACAAGGTGATGAGCCCAGCCGCCTGTAGTGGTAACGCACGAGAGGTCGCTGGGAGTACGCCGCCCGGGGCTGTCGAAGTACCGGGACGCCATCCCACCAGGGCGTTATCTGGTGGGCATCGAGAAGCGTCTTGGCGCTGCAGGTCTCTGACCCTCCCGCCGTTGTGGCGTCAGGACGTTTCTCGATGTGGTGACCGCCAGGTCAGGTCGCTACATCACTCCCTCAGTTCCCCTGCGTATTGCCCGCCTCCCCGGCGGGCCTTTTTTCGAGTCTCTAGCAAATGCCTGGCCGTCGCTCAGGTATCCGCTAGGTACCTCGACCAGAGGAGATAACGACATGGAGCCGAAGACCAAACAGGACCGGCGCGACGAGATCGACCGCGCCGCATCGGAGCACAGCCGGATCATCGAGATTGACCGTTGGCACGAACGACAGCGGCTCAAGCGAGAGCTGCAGGAGGTATGGGATGAACCCGCTTAGAGACGCCGCTATCTGGGCAGTCGGGTTGCTGCCCACCTCCGCGCCGGAGTGGCTGGTCGGAACAACGGTATTCGTTGCCCTGATTGCCGCGTTGGCGGCCATCACCGTGCCGCCCTGTCTTGTCGTCGTCTATTGGATCAGCAGGAGGTCACGGTGACGTTTCATACCCACATTGCCGGCATCCCCTGCCTGTGTGAGGTCACGCACTACAGTGCCGCACGCCCGATGCGCATCACTGGTACAGGATTCGGAGACGCCGAGCCACCAGAGCCGGTGGAATTCGAGTTCCGCATCCTCGACCGCCGAGGGCGTTTGGCCGAATGGCTAGAAAGAAAAGTGACGCAGTCCGACGAGGCTCGCTTACTCGCCGAATACCGCGCCGAGGAAAGCGGAGCTGCATAAAGAGGCCCCGCCAGTTCGCACCTGACGGGGCCTGTTGCTGTGACCGAAAGCCGAAAATCAAATCACGAAAGGAGAATAGCACCATGACGATGACATTCGAAACCGCACGCCGGGAGCTGGAAGCCAATGGCCGAGTCGCCACGATCGATATGTCGGAGGACGTTTGGCTCGAGCTGCGCACACTAGGCATCGGCGCCAGTGAAGCCGCCGCGGCCGTCGGGGCCAGCGAACACCGCACCCCGTATGAGATCGCCGAACGGAAGCTTGGCATGGTGGACGACGAGCCGGAAGCTCGGTACCGGCGCCGGCAAAAAATGGAGATGGGCCACGTGATGGAGCCTGTCACCGCTGCGCGGTTTGCCGAGTTCACTGGCCTCAATGTCCAGAATCACAACTGGATGCAGGCTCATCCTGACCACCCATACATGCTGGCCAACATTGATCGCCGCATTGTCGGCGTTACCGATGCCCAAGCCGATTGGCTCGAGCTGCTCATGGGGCGCGCCGTTTCAGGGCCGGGCGTAGCGGAGCTCAAGAACGTGGAGTTCGCGAAAGGCTGGGGCAAGCCGGACAACGTGAACACCACGGGCGGACTCTGCACGTCTGGCGAGGTGCCGGAGGACTACTACATCCAGATCCAGCACCAACTGGCAGTCACCGGTTATGAATGGGCCTTTCTGGTCGTGACCATCGCCGGATGGGAAACCCGCTGGTACCCGATCCCGCGAGACGAGGAGCTAATCGACGACCTGATCGCTTTGGAGGGCGACCTGTGGGCCACCATCCAGCGCGGCGAGTTGCCCGAGATCGACGTCGGGCATCCCAAGGCCATCGACATTCTCAAACGTCGGTACCCAGGCACTGACGGCAGCATGGTCAATCTTGCCGAGCTCGATCACTGGCGCAAGGTGGAGCAAGAAGCCAAGGCCGAGATCAAGAAGCTGGAAACCACTGCCCGAGTAGCCCGTGCCCACCTCCTAGCAGGCATGGGCAACGCCGCCGTGGCCACCTTCGGCGACAACGAGGTCATGCGCCGCAAGGTCATCAAGCGAGATGCCTACACCGCCGAGGTGGCTGCCAGCGAATACGTGGATATTCGCTACGGGAAGATGAACAAGGCCGAAAAGGACCAGTTAGCCGCCCAGGACACGGAATCCATTTACCAGCAAGGAGAAGCCGCATGACTGCCGACACTCAAACCGCCGAAGCTGTGCAGCAGAACCTTCAAGAGCCTGACTTCCATGCTCCTGCCGAAGCTAGCTCCGATAATGGCGTCGAGTTGGCCACCAACGAAGTGCGCAACCAGTTCATGAAAGCCATGACACCGCGCAACTTCGATGATGTCTGGCGAATGTCCGACATGATCGCCCAGAGCGACCTGGCGCCGAAGGATTACAAGGGCAAGCCCGGTAACGTGATGATTGCCTGGCAGACCGGCGTTGAGCTGGGGATTACCAGCCCATTGCAGGCCATCCAGAACATCGCGGTCATCAATGGTCGTCCCACTATTTGGGGCGACATGATGCTCGCCATCTGCCGGGCGGCACCGGCCTGGTCCGAGGCCGACTTCAAGGAGTGGATTGAAGGCGAAGGCGCTCATGCTGTCGCGCATTGCACCGTACGGCGCCGCCCGAACGGCAATGTCGCCCACTACACCTTCAGCATGAAGGACGCTCAGGACGCCAGCCTCGTCGGTAAGCAAGGGCCCTGGCAGCAGTACCCGAAACGGATGATGCAGATGCGAGCCCGGGCCTTCGCGCTGCGCGACACCTTCACCCCGGAGCTGAAGGGCATTCGTATGGCTGAAGAGGAGCGCGACATCACCCCCGAGACCAACGCCCCCTCCGAGTCAGCTTCAAGCGGTGGCCAGCAGCAGCGTGCCCGTACCGCCAGCCGCGTTGGGAGCAAGCTGGCTCAGCGCCGCCAGAAGACTCAGCAACGCCAGCAAGCTCAAGAGGTCTACGAGGCACAACAGGAATCCGAGGCCCCAGCTCAGCCCCTCAACTGCGCCCAGGTCTGCGAGCAGATCAAGCAAGCGACCACCATGGCTGAGCTCAACGAAGCGGCAGACTTGGCGCAACAGCTCCCCGAAGACGACAAGGGCGTCGCACGCGATGCCTATGAGGTTCGCCGGTCCGAGATCCGTCAAGCCGCCCAGTAACCCAACCCGGGCCCGTTCGGGCCCTTCGCACCACCTGAAAGAGGTACCCGCATGTCACAACATCCAACTGACGTGACCAGGTTCTTCGAGGACCTGGACGGGGGCGTTTTGGCCGAACGATTGGGGGTCATCCTCAGCCATGCGGCATCCGCTGCTACGGACAACCCCAAGAAGAAAGCAAAGGTCACCGTTGACCTCGAAATATCCAACATCGGCAGCGGTAGTCAGGTGGGAATTGCGCACAAGCTCGCGTTCAAGATCCCGCACGCCCACGGCACCCAGGCCGAAGACCACACCACCGAGACCGTCATGTACGTCAACACCGGTGGCGAAATGACGCTCGAGCCCAAGAATCAGCTCGACATGATCGGCCATAGCCATCGCCAATCCACTCCCCAAAAGGAAGATGATCAATGAGCCTGACCAAAGAAGCCCTGCAACATATCGAAGAATCCCACAAGACCGGTACTGAAGCTGCCAACGGGGACGCGCTTCTTGTCGGTAGCGGGTTCGAGCTGATCGATCTGGAAAAATACCGAGACCTCCGCCGCCGGTTCCGGGGCACATTCAAAACCCAGGGACTGGAGGCGTTCACCGGGTACCTTGCCGAACGCGCCGCCGCCAACACTCCGGTGTTCATTGATCGCGAGCGCATGGCCGCAAAGTGCTACCTCGACATCGGCATTCAGGACGCTCCTGGGCACTGTGAGCACAGCGCAATCTTGAACTTGCCCGAGACACCTGAGTTCGAGGCCTTCCACCGGGCAAACGGCAGTACATTCGACCAGGACGGCATCGTCGAGCTGCTCGAGGACTGGGGCCACCTGATGGAATTCGCGAACAGCAAGGGGGAAAGTCTGGAGTACCGCACCGTGCTGCACGCCTTCCGTAATGTGTCGATCGACGACCTGACCAGCATCGACAGCGAGAAACAGGAACACAGCTCACAAGTCGGCGTCATGAACAAAGTTACGGTGAAGCACTCCGAGCGACTTCCGGCCACCATCACGTGGCGCTTTTCCCCCTACGAAGGGCTGATGAAGCGCGAGCTTTCGATGCGCGTGGCGACCATCACCAAGGGCGGACCGAGCTTTCGGCTGCGTGCCATGGCGCTGGAGGCTGTCGAAAAGGACATCGCCGAGGAATTCTCTAACGAGATCGTCGCCGACCTCGACAACTGCGAATGCCTGCTGGGCATCTTCAATCCCTGAACACTGCTAGGCCCTACAAGGCCTCACGAAGAATCCCTCCCCCTGACTGAACCGAGGCTGCTCGCCAGGACGGGCCAGTCAGAACAGTAGTATCCAATATCCACGAGCAGCGACGACGCGCCGAACTCCCCTGCACTACCCCGCTACCCGTCTGGCAATCCGGCCAGAGTGGTGGGATGGCGGTACCGCATGCGCGGTAGGCGCGGGCAATGCCGGCCGCCCCTCCGAGATGGGGATTAAATTTTCCTAGTTATATCGCTAATATCCCCCACATTTACAACAACAGGCTTCAAGTTGGTAAGTAACTCAGGAACCCACAACATCCTAAACTCTACTTGGTCTGCATAGCCATCTGGTTCCTTTACAAAAGCAATATGGCATGGGTCTTCCTCCAGAGAATATTGCACTCTTCTACGGTAAGAAACCGCCCCCATCATGGCATTGCGCAATTTAAATCGTTGGTTCAGCTCATGCCCAATAAGGTTCATAAGTTTTCTCGGACTCGAAAGCTTAACCCCGTAACCTCCAAGCTCGCCATCAGCAAAACCATCGCCACTATATAGGGTCCAGCACAACACGAAGGCGTCCGGAATTCTTGTATAAGACTGACACTCTGAAATGGTAATATCTCGACAACTTCTATCCACCGAAATCGATGATCTTTGGGCGACATACTGCAAAGCAGGGTCATCTCCATCACCTTGAACATGACCACTAAAGTATGTCCACGACCCCTCACCTTTATCTCCTCTCTTCGGGTTCTCATAGCCTCGACATGCTTCTAAAGTACTAATTCTTACTCGCCCAGATAAAAGCTCGGCCCGATAATAATCGGTATCTAAAAAGCGGTATACGCTTAGCATTAATAAGACTCCTGCGTGCCTGTAGGTTTTTATTCTGTATTTTTTATTTTGAGGTTTTTATGCCGTGCACTCAAGAGGTACGACACTTCCACCTATTTTGCGGTCTAGGCGGCGGTGCAGCTGGCTTCAACTGTGGTCACGCCCGAGTGGGCGCCATGGACGCGCAATTCCGTTGCATCGGCGGCGTTGATTCTGACCCGGCGGCTATCGCCGACTTCGGCCGGCTGGCCGGCACCCCTGGCACGGTGCTCGATATGTTCGACCGGGACCAGTACATCGCCTTTCATGATGCCGAACCGCCCGGTGACTGGCGGGAAGCCATGCCGGCCGACATCCAGGCTGCCGCCGGCGGCGAGCGGCCGCACATCGTGTTCCTGTCGGCACCGTGCAAGGGATTCAGTGGCCTGCTGTCTCAGAAGCGCAGCACGACCGACCGATACCAGGCATTGAACCGCCTCACCGTGCGCGGCATGTGGCTGGCCCTGGAAGCCTGGGCCGATGATCCACCCGAACTGTTTATCTTCGAGAACGTCCCACGGATCGCGAACCGTGGGCGACCACTACTCGACAGCATCACCGCGATGCTAGAGCGGTACGGCTACCACGTCGCGGAGACGACGCACGACTGCGGTGAAATCGGCGGTCTGGCTCAGTCCCGCAAGCGCTTCCTCCTGGTCGCCCGGCACGCCGAGAAAGTACCGCCTTTCTTGTATGAACCGGTGAAACGTCCGCTGCGCGCCGTCGGCCAGGTACTGGGCAACATGCCGCTGCCCGGCGACGAGTTGGCCGGACCGATGCACCGTGTACCGCGCCTGCAGTGGAAAACGTGGGTGAGGCTGGCATTCGTGGAGGCTGGCAGTGACTGGCGCAGCCTCAACCGGCTGGCGGTCGAGGATGGCCATCTGCGCGACTACCTCATCATGCCCGAGGGCCGAAATGGCTTCTTGGGGGTGAACGACTGGGAGGAGACGGTCGGCACTGTAGCTGGAGCTAGTCGCCCGGGGAACGGCAAGTTCTCCGTGGCGGACCCTCGTTTCAACCAGTCGGCCAAGTGGAAGGATGGTCAGGCCTACGGTGTCCGGCACTGGCACGGCACTGCCGGCGCTATCACCAGCCAGAAATCGCCAGGCCAGGGATCATTCGCTGTCGCGGACCCCCGCATAGATGGCGTTCGCCACAACAATGTCTTCCGCATCATGCCCTGGGCGGCGACCAGCCAGGCGGTCACCGCCGGAGGCGGACCTACTGCGGGTGGACTGGCAGTGGCCGACCCACGGGGCGCCACTGCCTTCGCCGGCAAGTACCAAGTGACAGGTTTCGACGAATCCGCCGGCACGGTCATCTCGGGCAGCACCACTGGCCAGGGCGCATTCGCTGTTGCCGATCCGCGGCCCGGACTCGTCCGCGGCAAGGGCAGCCACTACCTCACCGCCGGCCACTATGGCGTCGTGCCCTGGTCCTCAAGCTGCGGCGCCGTCTCGGCCTCGGCCCGGCAGGACAACGGTTCTTGGTCGGTGGCCGACCCGCGTCTTCCGGCAGCGACCGACAAGATGGTGGCCGTGATCCGGGCCCTGGACGGCACCTGGCATCGGCCGTTCACCACTCTGGAGCTGGCCGCCCTGCAGGGCCTGGTCGATCCCGCCGAACAGCTCGAACTCGACGGACTGAACGACAGCGCCTGGCGGGAACGCATCGGCAATGCCGTGCCCGCCCCGGCCGCCGAGGCCATCGCCGGCGTGATGGGCACCACACTGCTGCTGGCTTGGGCCGGCGAGACATTCGCCTTGGGTAGCACGCCGATATGGGTTCGCCAGGTCGCCGCCGGCCTGTCGCTCAATCAACCCGACACCCGGGCCGCCCACTAA
Protein sequences of DBSCAN-SWA_3 >NC_014532|1900649:1931078|1919753_1920062_-|WP_013332410.1|DBSCAN-SWA MSNVTSIADRRGNRLTNPNCQHGRTYMVLRDSDAWLHLHEIRQAILERFGTMDSEPAISARIREIRREDGVNIASYEVPGGRSKAHVYKWFPPHDDGGKGAA >NC_014532|1900649:1931078|1917644_1918235_-|WP_109637645.1|DBSCAN-SWA MVFDAYRAQQGSVMAGASEVAALKAKIQKTQKQNNDWRIVDGLESGAVISAVESQPQHLQAMCRYCWGPFTRDELTEDMEWLHLSLLRAMMGRPLPGQGDKPHPPAKAWRTLTALAWAAIYHHGEVTYPYNRQGLVGPRAIKRWLEDERGTTIDVRRWSAAARLSWSDIWRQMLTTLDDWESRALGPVAALRLYAA >NC_014532|1900649:1931078|1912787_1914392_-|WP_013332402.1|DBSCAN-SWA MNRSDREREVKRAEAFLRAHAKGRLTEKADLLEALSLKWFRLNALYKIKDKQGKVRRFRPNRAQRRRYIKGHVRDIILKARQLGFTTFEMIDALDDCLFRDNFAAGCICHKLDDAQDIFRNKVSFAYQSIDDTWHRIFKMLGIRIPQPVSDKSGSGAYVFDNGSSIKVSTSYRGGTLQRLHISEFGKICRQYPHKAQEIVTGAFEAVGLGNQITLESTAEGREGYFFDYCQSAQHLQELGRAPTLMDFQFHFFPWWQEPAYRLDPEGVLVPQWLREYFEDLEHKHGIATDAAQQAWYAKKAEVLQDDMKREYPSTPAEAFEQSIEGAYYASQMRFLRKNKRITAEVQVNPSLPVFTAWDLGMNDTMAIWFAQIVGREIHLVDYLEGEGEGIEYYADELNKKGYRYGGHFGPHDLAVRELGTGKSRADVAQQYGIKFEVVPRISNHAEGVQAVRNFLPMCWIDEDACDEGVKCLDNYRKEWDDKRGVYKDTPRHDWASHGAKALETLARSSIFQQTNTPTPRPQKRERSGWAAHT >NC_014532|1900649:1931078|1920914_1921820_-|WP_049786211.1|DBSCAN-SWA MRSWRINEIEEEALIELPPEVERLYQHGIRKHMNYATGITGRKRRVSMDMFREVVDYHPPAGSREKARIHSRQQITRMLDKLEAAGLIERLHRGKGVKAAMEFRLPLACSDLDEHRAESEPTGASQKNPHSRASGEANTEQGASREERSTSGSPVTPLSPDGESVDAQQARPKRQRRQWGEPVDHELTNEMVAAVSADLEAPVKHNPDSWANEFRLMRQRDGRTVEQIRYLITWTAEHHFWSSVVLSPKKLRDKWDQLAKQVRGQRKPATRQDGRLGMAQPQPQGSYTPTDMDNLPDWMRD >NC_014532|1900649:1931078|1924432_1924576_+|WP_157953414.1|DBSCAN-SWA MEPKTKQDRRDEIDRAASEHSRIIEIDRWHERQRLKRELQEVWDEPA >NC_014532|1900649:1931078|1928586_1929213_-|WP_157953415.1|DBSCAN-SWA MLSVYRFLDTDYYRAELLSGRVRISTLEACRGYENPKRGDKGEGSWTYFSGHVQGDGDDPALQYVAQRSSISVDRSCRDITISECQSYTRIPDAFVLCWTLYSGDGFADGELGGYGVKLSSPRKLMNLIGHELNQRFKLRNAMMGAVSYRRRVQYSLEEDPCHIAFVKEPDGYADQVEFRMLWVPELLTNLKPVVVNVGDISDITRKI >NC_014532|1900649:1931078|1922936_1923599_+|WP_109637425.1|DBSCAN-SWA MLMDTLRNRLIETRRRIGLTQAQVARRSGMSQAAYQKLESGRSLSSRKLPSIARTLNVTAEWLEAGGDNPSQSQVDRTDAGNVSPITSGGRSVPEISWVQAGEWTEVENVEDINLSEVRHWPCPVSCSARTFALRVEGDSMAPTFPPGSIIFVDPEVPPISGKKVVAKLVDESKATFKQYIEDGDSKLLKAMNPNWPEPYVPINGNCVIIGTVVFAGTEV >NC_014532|1900649:1931078|1918957_1919251_-|WP_013332408.1|DBSCAN-SWA MFPRLKPYRSRTWLANVHEIENCVLCGKSGIQAAHSNQERGMSQKASDCLTAALCPECHGEIDNGRDMNRDERRARMNEAIVRTLDQLARRGLVRAV >NC_014532|1900649:1931078|1916689_1917268_-|WP_041602040.1|DBSCAN-SWA MSDKLYPAFGQFSIVVEAAQSSWGDWLSPVASLLGVVLGACLSYCMLRKNERARLKQEKLEEIVVSSGELEKECKQYVDAFAGSLYVDDEGQVHRQESSGPAVHDLNMLMDRVDFLMQAHLPEHDELRQRFGKSFVALFTEMSLRGVAPNMSEDDVQSTKKKLLEMRSQVTSSQQAIQKVALKEARKLLKVK >NC_014532|1900649:1931078|1921999_1922203_-|WP_041602042.1|DBSCAN-SWA MLSRESLGRVLERMSPESQSAARRESAARGMSVEDVVLEKCLADVEGQLYALRRRRPELRVVEGGRA >NC_014532|1900649:1931078|1905708_1906422_-|WP_013332397.1|DBSCAN-SWA MALTTVGTVINNAKLVLQEVTSAGTRWTNEELIGWLNESYQAIVQIKPDASTENASHELAEGTRQQIPADGLRLIDVVRNTASGSKKMGIMVASRQSLDTTRRGWHADPPSIDIEQYIFDDQDPTRFYTYPPAATGAEVEIIYSAVPAPHDVSQGLEGLANEAIKLNDSYGPVITDYILYRAYSKDAEHAKNLNRAQLHMEAYMNALGQKVEVDRSISPNAPDNSSNPTGSVGRTRG >NC_014532|1900649:1931078|1915133_1915685_-|WP_013332404.1|DBSCAN-SWA MNWSDVAETVGEAAPMVGSALYGRAGGAVGSMVAEVLGVETTPEAVNQVMNDDPEAAAKIKKVEAELEQARIEARTKRLGQVNQTMRAEVDSDSTLKGGWRGFTGWMFGLSLGGVMASIIFEMFAGTTDRIELLRQAVMMFGVFGAVLGIDINRVGKERLAAKGIDSRSFTFMDALKTRVAGK >NC_014532|1900649:1931078|1902352_1904677_-|WP_013332394.1|DBSCAN-SWA MFGPLNGAPLNGGGANGLWLYGGVLSLEPVADSTATPLRIKHATGSLSASADSVGASKRLATGWGTAAAAAFADAEGHELVFVAGKSATIADTTFASWAHRPMSGITDVEAAMDGAGVRHARMSHAESHSEAVATGDAHRITPAPDINLDDSLAVLDVSADRTAWMTPSTVEAPAALYCDAHITHIMGGQSVAEAASSIDYKTFWQASLRPLRTASLTWLAESRLDRLRVRQRFAEARTVVRARSRALPNSKIGRGSLWLESESRHVPGSLRLYRNHYGVASSSAASSAVPGRIRHASSISTGLATTHIAPDLTLGNTTYCYVSGRALALSRQVAAGLCITEASGKAASVSESAGRYFQIHAFTGIASAGGDVPPVHLTSINPWVRATGKSEAVAEQSAYTLRGRYIGLDALACESQQTAWCYRQVPTWDSMAGMSSVVSDVHRLAKARAKEIAISTVIAQANVYGLRRPDIRPLGGTASGQASAHKTAWMDGGQEAATATGLGAAVRLAVPVLAPEPNEAVFLASDPDITRYMRASAVAVAESSGENQPYRFTTTRSARARVQSAGVARKTARFVDDPAWAISSAGGGYDLNAFVRKVEPTSFVARAEGAGDARKTARYIDEAALASASPHAGYTRFAYLTPQPAINRSVTTGMPHLTRRMRGTSISRAEPTAYFARYRPASGQARASAQVPVTIYVWRFFEIRFRGDADSGGEMLRITFIEPARVAESWAETGRVSYLINPDSPAPDRRTITLMPGARVMSVPASVREYQVA >NC_014532|1900649:1931078|1916025_1916580_-|WP_013332406.1|DBSCAN-SWA MPRITHETGGGNVCALLDTLAYAEIGPDLLAASDDGYDVLVGSVPGDVHLFASYADHPLPTDDLAIEYADGVYSTAAGRYQILNTYWPHYKRQLGLPDFSPESQDRYAIQQLREQGALTLILDGYFKRAIAAASNIWASLPGAGYGQPEHEHETLARVYVEAGGLLSGEDSDWFDRVVKVRRGQ >NC_014532|1900649:1931078|1908621_1908774_-|WP_157953411.1|DBSCAN-SWA MAGPTRDGSHPDYSSTSSSGFIPQVWSGKLVEKLYASTCFGEIANTNYEG >NC_014532|1900649:1931078|1905147_1905705_-|WP_013332396.1|DBSCAN-SWA MTLDDLVHQVVQDVPEAPLMTIRDAITRMARELCTEADVWVQEGEPVVVAANTANPQIVASSGEPLRIVSLKIDGRECVQGQGFDQPTPTSIELYQRPNASMLYGRLACRPAMGDMPPSEVVDRWDMAIANGARWRLLMLPQPWRNPELANYYHQQYLADVATARQNSRLGHARGGARVKARRFI >NC_014532|1900649:1931078|1914378_1914936_-|WP_013332403.1|DBSCAN-SWA MAKRHDWEAVERDYRTGRFSLQQLSDRHGPSKSQISKKAAAEGWEKDLTGAVQQRTREKLSRPEASAPDVPESDIIEQASDENAAIVRGHRAALSRWRRIADRFADRLDQQLEAGEITVQLKSGELASIDLPLDYIAKAMGAGTQAFDRVIRLERQSYGLDQDDANDQEKTFEELMAEVAPDEPE >NC_014532|1900649:1931078|1900649_1902086_-|WP_013332392.1|DBSCAN-SWA MPQLFFNNARATLASAIDDVTTTIEVSAPVNLPTSLGANEWFLLTIFADTTRYGENIEIVRVTGISNNTLTVERGFEGTAVSHGAGESIEARATARTMDRIRSRIGETSLVKANASLDDRLRMDGLAYRKVDYPELAAMFPETLPAQEVVSGTPSLPDMGRGAAFSPDGAYLAMAHNGSPYLTIIDTSDWSVVSGTPSLPRSGWGAAFRPDGAYLAIVHEGSPYLTVIDTADWSVVSGTPSLPGTGNAVAFGSNANGTFLAVVHDGAPYLTIIDTSDWSVVSGTPSLPGGGEDVVFEPGGTYLAVAHRESPYLTIVAAADWSVANVAPTLPGMGRAIDISPISGIIAVAHNPVDFNSEESRFTVINPNDDWSVVSDAPVIQSSIADVKFSSDGAFLAIGYYGGSHLIVIDIATWTEIGGTPSLPNYGETVAWHPDGDYLAAGYLNAPNLTVTARSPDWFYLPELSSPHPSLEWRIKAQ >NC_014532|1900649:1931078|1906899_1907733_-|WP_109637419.1|DBSCAN-SWA MKSRDEARKGGNSTEIKNKGDTVEIRTTPSITIKDYVVGGGLNYEKPTSDKVELHIDKAKYFAFEVNDIDAYQSDIKLMDDWSDDAGQQMKIAIDTVILGDVYADAAPENAGPDAGVKSGSYNMGESGAPVSITKSNILDTIVDCGSVLDEQNAPDTGRYIILPAWMNGMLKKSDLRDASAMGDSTSVYRNGKVGMLDRFDVYVSNNLSTVTDATTSKKATNVLFGHKKALTFASQMTNMENLPNPQDFGKLVRGLNVFGYEMIDPNAAGHLYAQRG >NC_014532|1900649:1931078|1922223_1922553_-|WP_041602043.1|DBSCAN-SWA MLSHLETDRRCKPDGEHLEHLVKPRFTDAEFELIQAEAARRHGGRLAPLVHEATLIGIEVIQKRRRELLAKLANGEEPAADQQNELENLLAEMAERQFAGNSIDQSETA >NC_014532|1900649:1931078|1910069_1912277_-|WP_157953412.1|DBSCAN-SWA MRAEEEAEAQRIAEEEQRRRELVESSLGAHIRRCWESAKMARQSVDNRMLDCLRRRKGEYSPEKLQAIRKEGGAEIYMMLTATKCRAAAAWIRDIMMPANERPWGLEPTPVAEVPDAYLAPLASQIRQQAVEMQQQGQQVDMAALAEQAREQLRQAAEDAAREATERHEELISDQLAEGYWDEAFEAFIDDFVTYPAAFIRAPIFRRVPELQWLEGWRPVKGETIRPEFERVSPFDMYPSPDATSIDDGAFIIERARFTRSQLNQLIGVPSYSEDAIRQVLHQYGQGGLRDWLWSDGERAELEGRGHEWLTPGETIDGLIYSGGAQGVTLLQWGISPDEIEDPLAEYEVEAILIGQHVIRVRINRDPLERRPYHKSSFQPVPGSFWGQGIPELMADVQDVCNATARGLVNNLAISSGPQVEVYEDRLQPQEDPTDIYPWKIWRTKASIETGNNPALRFFQPQSNASELLAVYEQFEYRADESTNIPRYMYGSDEAGGAGQTASGLSMLMESANKGIKDAIRHIDRGVLRRVIEALWLHNMQFSDDNSIKGDASVVARGSSAMLIREQTNQLRQQFLQLTANDYDMGILGHDGRRKLLESIAEKLDLPGLIPSEEEMQKNLAQQRQDQQAQLQMEQAKAEAEAAEKQARAREANADAAQTEAETQQSQQMAPLEAQHLLAEIAKLITETRGRNDGGTGLEGAGQAGRVPGGEASGGTAQGGARGMPRQPGKVPRTR >NC_014532|1900649:1931078|1918473_1918836_-|WP_049786210.1|DBSCAN-SWA MRLFIPYLATSTNVGTRQHWGARKREVDACHLLVRNAVRCEGLGRVANRVDLVFTPRLGRRVRRRDTSNYSLNMKHLEDGLVAAGVLPDDRGQYVRRVILEPPEIDRRAETGTWVEIIEV >NC_014532|1900649:1931078|1908833_1909808_-|WP_013332400.1|DBSCAN-SWA MSLPRSVQAQAEAARQHFEPQPENPDDQRPAGSAPEEGQRNPQDEFSAPAKADEPSPTQPADSAEAQPKDEQDQGALYWQHRFNVLQGKYNSEIPALRQENSELKNTIAERDRRIAELEQQQPTGGADSGVSDEQLEQFKATYGEDLVDFVTRMVDQRAPSGKPADTGNTQELTERLERLESEKHQDAEARFWMNLEQAVPTFREVNADPKFHQFLAQFDPSTGVQRQEALNQAQQALDAKAVADIFQLYMNQAEQGAKRSVPDDQVEPRTTRATEAPQGQRIWTGQEISQFYRDKTAGRYGAEEAQRLEADIFAAQRDGRVKQ >NC_014532|1900649:1931078|1902089_1902356_-|WP_013332393.1|DBSCAN-SWA MKTFKKQPRDHLDYDIDLSEWMAEGDEIEHVEVTAPDGLEVTQVGIEPTRVKLWIKGGTDGESYKLSPLIYTESRTKEVDFMIIVVEM >NC_014532|1900649:1931078|1918258_1918477_-|WP_041602041.1|DBSCAN-SWA MNWDRVGKYHIQCGEYTICRTHHSGTKLFTVWYGNERIGIRHTAKEAMALAESHETGKRNGNEKGNGRKRGQ >NC_014532|1900649:1931078|1919250_1919754_-|WP_013332409.1|DBSCAN-SWA MGKSKRPRVYMRARKLIDPATGQMVGAFMPAGAADASIVRGRGYKDGELLRADLFLPRNPEFHRLAHALGGLVAENIEGFEGLDHHQAIKRLQEKSGEGCEITRTEVPGFGVLEHRAPYSIAFDEMDQAEFYKLMRALCRFIASEYWPHLEPEQVEEMIELMPTEAA >NC_014532|1900649:1931078|1904689_1905112_-|WP_013332395.1|DBSCAN-SWA MAAFSDYLESGILEHTLRGTTLPTPSSIYVALFTSDPTDADTGDEVTDSAYVRQDAAKGDSISSGWTSPTDSGDGKVSSNAKVIQFPPIADGSVTVTHYALYDAQSGGNMLYHGEFTVAKTLEINDVLSVDIGGIQVILR >NC_014532|1900649:1931078|1907773_1908172_-|WP_109637644.1|DBSCAN-SWA MLEAGMTTSDISRRSGFSQPIVHRAARRFGLPLNDTLHRGEIITHNGYRMVRAPDGHPHADSKGYVREHRLVMERKLGRYLEPGEVVHHINEDKLDNRPENLGLMELGEHTSFHHTGKVGRGPDKRPRKRKQ >NC_014532|1900649:1931078|1925061_1926162_+|WP_013332413.1|DBSCAN-SWA MTMTFETARRELEANGRVATIDMSEDVWLELRTLGIGASEAAAAVGASEHRTPYEIAERKLGMVDDEPEARYRRRQKMEMGHVMEPVTAARFAEFTGLNVQNHNWMQAHPDHPYMLANIDRRIVGVTDAQADWLELLMGRAVSGPGVAELKNVEFAKGWGKPDNVNTTGGLCTSGEVPEDYYIQIQHQLAVTGYEWAFLVVTIAGWETRWYPIPRDEELIDDLIALEGDLWATIQRGELPEIDVGHPKAIDILKRRYPGTDGSMVNLAELDHWRKVEQEAKAEIKKLETTARVARAHLLAGMGNAAVATFGDNEVMRRKVIKRDAYTAEVAASEYVDIRYGKMNKAEKDQLAAQDTESIYQQGEAA >NC_014532|1900649:1931078|1927190_1927556_+|WP_013332415.1|DBSCAN-SWA MSQHPTDVTRFFEDLDGGVLAERLGVILSHAASAATDNPKKKAKVTVDLEISNIGSGSQVGIAHKLAFKIPHAHGTQAEDHTTETVMYVNTGGEMTLEPKNQLDMIGHSHRQSTPQKEDDQ >NC_014532|1900649:1931078|1923601_1923922_+|WP_041602045.1|DBSCAN-SWA MSLFLKWFVFRCCISLVSAHAQDFRGAWEWVRTVLGLAVYAGFAFTVAFLGRMAWDAHWARLLSALAVMFMSFLVADLIVKTLRDNAALLSLPAVAIATWMAFLLW >NC_014532|1900649:1931078|1920084_1920912_-|WP_013332411.1|DBSCAN-SWA MDSAQQTDIGAYERMSQHLRQTLANQAQTREMRCLVHGDFTATLMPNGKWSDCPGCVNNDIERQRQTEHQNRAAHARNAQLEKLREGSMIPKRFQGRTLAGFLTDGHDRKTFALAACQKYVERFGDRLEQGGGMVLTGSVGTGKSHLAYGIGNALLDQGYRVMGIDVYELVDLIKERAFDRKDGTSEREAIKAFVAPLDLLILDEVGAQLGTEWERLMLFKIVNERYKAQLPTIIVSNIDASGLADYLGERIVDRMREGGGMTLVLDWPSYRDAA >NC_014532|1900649:1931078|1922601_1922856_-|WP_157953413.1|DBSCAN-SWA MTTCVVNPSMRTQKRLLRKAVELHGNSQSALAAAIGCSQTMVWKLLHGRSSVSIPMARAIHTATEGQCPEYELRPDFFEQPAVA >NC_014532|1900649:1931078|1926158_1927139_+|WP_013332414.1|DBSCAN-SWA MTADTQTAEAVQQNLQEPDFHAPAEASSDNGVELATNEVRNQFMKAMTPRNFDDVWRMSDMIAQSDLAPKDYKGKPGNVMIAWQTGVELGITSPLQAIQNIAVINGRPTIWGDMMLAICRAAPAWSEADFKEWIEGEGAHAVAHCTVRRRPNGNVAHYTFSMKDAQDASLVGKQGPWQQYPKRMMQMRARAFALRDTFTPELKGIRMAEEERDITPETNAPSESASSGGQQQRARTASRVGSKLAQRRQKTQQRQQAQEVYEAQQESEAPAQPLNCAQVCEQIKQATTMAELNEAADLAQQLPEDDKGVARDAYEVRRSEIRQAAQ >NC_014532|1900649:1931078|1929272_1931078_+|WP_013332417.1|DBSCAN-SWA MPCTQEVRHFHLFCGLGGGAAGFNCGHARVGAMDAQFRCIGGVDSDPAAIADFGRLAGTPGTVLDMFDRDQYIAFHDAEPPGDWREAMPADIQAAAGGERPHIVFLSAPCKGFSGLLSQKRSTTDRYQALNRLTVRGMWLALEAWADDPPELFIFENVPRIANRGRPLLDSITAMLERYGYHVAETTHDCGEIGGLAQSRKRFLLVARHAEKVPPFLYEPVKRPLRAVGQVLGNMPLPGDELAGPMHRVPRLQWKTWVRLAFVEAGSDWRSLNRLAVEDGHLRDYLIMPEGRNGFLGVNDWEETVGTVAGASRPGNGKFSVADPRFNQSAKWKDGQAYGVRHWHGTAGAITSQKSPGQGSFAVADPRIDGVRHNNVFRIMPWAATSQAVTAGGGPTAGGLAVADPRGATAFAGKYQVTGFDESAGTVISGSTTGQGAFAVADPRPGLVRGKGSHYLTAGHYGVVPWSSSCGAVSASARQDNGSWSVADPRLPAATDKMVAVIRALDGTWHRPFTTLELAALQGLVDPAEQLELDGLNDSAWRERIGNAVPAPAAEAIAGVMGTTLLLAWAGETFALGSTPIWVRQVAAGLSLNQPDTRAAH >NC_014532|1900649:1931078|1927552_1928350_+|WP_013332416.1|DBSCAN-SWA MSLTKEALQHIEESHKTGTEAANGDALLVGSGFELIDLEKYRDLRRRFRGTFKTQGLEAFTGYLAERAAANTPVFIDRERMAAKCYLDIGIQDAPGHCEHSAILNLPETPEFEAFHRANGSTFDQDGIVELLEDWGHLMEFANSKGESLEYRTVLHAFRNVSIDDLTSIDSEKQEHSSQVGVMNKVTVKHSERLPATITWRFSPYEGLMKRELSMRVATITKGGPSFRLRAMALEAVEKDIAEEFSNEIVADLDNCECLLGIFNP >NC_014532|1900649:1931078|1912315_1912762_-|WP_041602039.1|DBSCAN-SWA MPEIITTPDGDISRRAGNTAYRLNGMTRAEVTELMRDAARTIAEQLIEAGIPTASERFRPDVGPVQIDMIVIEERVTRPEPGMRLQFEVAGDMGVTLNIKLLEFAADPAGYIRDLFAQLAPMRRNVMRLRRNKQDANRAIYRATGGSQ >NC_014532|1900649:1931078|1915681_1916029_-|WP_013332405.1|DBSCAN-SWA MTTQAQGNTQGNSQQEYRHRRPFEFERHFQTGIQLVLVGLLAWAGLKLVSLGEENAALQERITYQGEQIVSLRRDIREWSNLYVRKSEAEQRAEEVNSRLDALGERLSTLEEQSR >NC_014532|1900649:1931078|1924729_1924978_+|WP_041602046.1|DBSCAN-SWA MTFHTHIAGIPCLCEVTHYSAARPMRITGTGFGDAEPPEPVEFEFRILDRRGRLAEWLERKVTQSDEARLLAEYRAEESGAA >NC_014532|1900649:1931078|1906425_1906821_-|WP_013332398.1|DBSCAN-SWA MSLSERIENAETKDELEKIGKDELGVDVDKRKGLETIRVDLQSLAEDQVSEGQGKTPAQNDAESPDKDAPESADTSQEGTPASKMADAEKTPDQGESAPAQEKPRLLRHRGNGRVFTYSAALAAKRDMEEV |
40 | Escherichia_phage(13.64%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
3219370 : 3230398
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_014532|3219370:3230398|DBSCAN-SWA ATCAGAGCTTCGACTCGACCCACACGGGCACGCTATCCAGCGCACCGGGCAGGGCCGGGACATCGCTGCCACCGGCCTGGGCCATGTCGGGGCGTCCCCCGCCCTTGCCGCCGACCTGCGAGGCCACGTGATTGACCAGCTCACCGGCCTTGATCCGGCCGGTCAGGTCATCGGTGACGCCGGCGATCAGGCTGACCTTGCCGGCCTGTTCATCGCCCACGCCGAGCACCACCACGCCCGAGCCGAGCTTGTTCTTGAGCTGGTCGAGCAGGCCGCGCAGCTCCTTGCCGGAGACGCCCTCGAGGCGCTTGGCCAGCACCTTGATGCCCTTGACCTCGTGGGCTTCGCCGAGCATGTCGCTGCCGGCGGCGCTGGCCAGCTTGGCCTTGAGGCGCTCGAGTTCCTTTTCCAGCTCGCGGTTGCGCTCGACCAGCGAGGTCACGCGCTCCTCGACCTGCTCCGGCTTGGCCTTCAGACGCTCGCCGATGCGCGCCACTCGCGCTTCCTGCTCGCGGAAGTAGGCCAGGGCGCCTTCACCGGTGATCGCCTCGATACGGCGCACGCCACTGGCGATGCCGGCTTCGGCGACGATGTGGCAACAGCCGATGTCGCCGCTGCGCGCCACGTGAGTCCCGCCGCACAGTTCGATGGAGAAATCGTCGGCACCGATGGTCAGCACCCGCACGCTGTCGGCGTACTTGGCTTCGAACAGCGCCGCCGCACCCTTTTCCTTCGCCTCGTCGAGCGTCATGTGCTCGATGCGGGTCGGCGCGTTGGCCAGCACCTGCTCATTGACCAGTCGCTCGACCTCGGCCAGCTGCTCGGCGGTCATCGGCTCGAAGTGGCTGAAGTCGAAACGCAGCCGCTCCGGAGTCACCAGCGAGCCCTTCTGCTGAACGTGATCGCCCAGCACCATACGCAATGCCTGGTGCAGCAGGTGGGTCGCCGAGTGGTTGCGTACCGTGGCGGCCCGCAGGCTCGGGTCGACGCGCGGCGTCACGCTGGCGCCAACCTCGAGCTCGCCTTCGAGCAGGCGCCCCTGGTGCAGATGATGCCCTGCCTGCTTCTGGGTGTCCTCGACGAGGAAACGCCCGCCCTCGAGCTCGAGATAGCCGGTGTCGCCGACCTGGCCCCCGGACTCGCCATAGAAGGGCGTGCGGTCGAGCACCACCACGCCCCGCTGCTCGGCCTCGAGCCGTGCCAGGGCATTGCCCTGCTCGTCGACCAGGGCCACCACGGTGGCGCGATCCTCGAGCCGGTCGTAGCCGGTAAAATCGGTCTGGCCCTCGAGCTCCAGCGCCGCACCGTAGTCGACGCCGAACTGGCTGGCTGCCCGGGCGCGCTCACGCTGGGCCTCCAGTTCGCGCTCGAAACCGGCCTCGTCCAGCGAGACCCCGCGCTCGCGGCATACATCCGCGGTCAGGTCGAAGGGGAAGCCATAGGTGTCATAGAGCTTGAACACCGTCTCCCCGGGCAGGGTATCGCCCTCGAGGGCATCCAGCGCTTCCCCGAGCAATCCCATGCCATGCTCCAGGGTCCGCGCAAACTGCTCCTCTTCCTTGAGCAGCACGCGCTCGATCTGATCGCGGGCCTCGCGCAGCTCCGGATAGGCCTCGCCCATCTCGGCGTCCAGCGCCGCCACCAGCTTGTGGAAGAAACAGCCCTGGGCTCCGAGCTTGTGACCGTGACGCACGGCACGCCGGATGATGCGCCGCAGTACATAGCCGCGCCCCTCGTTGGACGGCAGCACGCCGTCGGCCACCAGGAAGGCGCAGGAGCGAATGTGATCGGCGATCACGCGCAGCGAAGGCGCGGAGGTGTCATCATGCCCGGTGGCTTCGGCAGCGGCGGCGAGCAGGTTCTGGAACAGGTCGATCTCGTAGTTCGAGTGCACGCCCTGCATGACCGCGGCGATGCGCTCCAGCCCCATGCCGGTATCGATGGACGGCTTGGGCAAGGGGTTCATGTTGCCGGCGGCATCGCGGTCGTACTGCATGAACACCAGGTTCCAGACCTCGATGTAGCGGTCGCCGTCCTCCTCGGGGCTGCCGGGAGGTCCACCCCACACCTCGGGGCCATGGTCGAAGAAGACCTCGGAACTCGGCCCGCAGGGGCCGGTATCGCCCATCTGCCAGAAGTTGTCCTCATCGAGACGGGAGAAGCGCTCGGGGTCGATGCCCACCTCGTCCTTCCAGATGCGTTCGGCCTCATCGTCGCTGACATGGACGGTGACCCAGAGCTTTTCCCTGGGCAGGCCGAGCACCTCGGTGAGGAAGTTCCAGGCAAAATGAATGGCATCGCGCTTGAAGTAGTCACCGAAGCTGAAGTTGCCCAGCATCTCGAAGAAGGTGTGGTGACGCGCGGTATAGCCGACGTTGTCCAGGTCATTGTGCTTGCCGCCGGCGCGCACGCAGCGCTGGGCCGAAGTCGCTCGCACATAGGGGCGCGGATCGCGCCCCAGGAAGACGTCCTTGAACGGCACCATGCCGGCATTGGTGAACAGCAGGGTCGGGTCGTTGTCCGGCACCAGGGAGCTCGACGGGACGATGGAGTGCCCTCGCTCCTCGAAGTAGCTCAGAAAGGCCTGTCGGATGTCTGCACTTTTCATGGAATATCCGTGACGATAAGCATGGTGGCCCTCCGAGCGGAGAACGCCGCTCGGGGCACTGGGAAGCACTCAATAAATCGGCAAGCGTGCTCGAGCACGCGGCGCTCGGGATATTGCGACCGACGCTGGCGGGATTGGCCGAGATCTCGGCCGGGCCGGCGGCCATCGCGCAGCGATCGAGGCGCCGAGCCACGCAGGCATTTAGTCAGTGTTTCCTCCGCAAAGGGAGCCATTATAGCGCAGTCGTCAAGCAACGACAGGGCGGGAGGAGGCAGGATTGTCTCAGATATCGTCCCAGGCATGGGTCAGGGCGTGCCGGAGCTGCTCGAAATCGAACCCCCGGCCGGCCAGGAAGCGCTCCCTTCGGGCCCGCTCCCGCGGCGTCTCGCCTGGTCCCGTGAAACGGCGCGCCAGCACCTCGGCGGCCAGCTCGAACCAGTCGACCTCGGAGGCCAGCTCGGCGAAGGCCTGGCGCGCGATGTCATCGTCGATGCCACGCCGACCGAGCTCGGCACGTAACTTGATCGGCCCCTGCCCCCGGGCAACCCGGGAGCGCAGGAAGCTTTCGGCGAAACGCGCATCGGACTGCAGGCCCTGCTCTTCCAGCTCGAGCAGGCAGGCCTCTACGTCTTCGGCGGCATGGCCCTTCGCGGCCAGGCGCTCGCGCAACTCCCCCCGGGCATATTCCCGCCGCGCCAGCAGGCGGATGGCATCGTCCCGGGGTGTGCTCTCGGCCGTTTCGCGCATCTTTTACAGCAGGCCATCCTCGCGTTCGCTGCCCTCGCCGATCGGCGCCTCGGCCTCGCTCTCTTCCTCGCGAGGTTCCGGGGTAGCCAGCAACTGGGCGCGAATCTGGCTTTCGATCTCTTCCATCACCGCCGGGTTGTCCTCGAGGAACTGGGCGGCGTTGGCCTTGCCCTGGCCGATCTTGTTGCCCTGATAGCTGTACCAGGCGCCCGCCTTGTCGACCAGGCCACACTGGACGCCCAGATCTACCACTTCCCCGGCGTGGTAGATGCCCTTGCCATAGAGGATCTGAAACTCGGCCTGACGGAACGGCGGCGCTACCTTGTTCTTGACCACCTTGACCCGGGTCTCGTTGCCGACGACCTCGTCGCCCTGCTTCACCGAACCGGTGCGACGGATATCCAGGCGCACGCTGGAATAGAACTTCAGGGCGTTGCCGCCGGTGGTGGTCTCGGGGCTGCCGAACATCACGCCGATCTTCATGCGGATCTGGTTGATGAACACCACCATGCAGTTGGCGTTCTTCATGTGACCGGTCACCTTGCGCAGTGCCTGGGACATCAGGCGCGCCTGCAGGCCGACGTGGGAGTCGCCCATCTCGCCCTCGATCTCGGCGCGCGGCGTCAAGGCGGCCACCGAGTCGATGATGATCACGTCGACACCGCCCGAGCGCACCAGCATGTCGCAGATTTCCAGCGCCTGCTCACCGGTGTCGGGCTGGGAGACCAGCAGGTCGTCGAGATTGACGCCGAGCTTCTCGGCATAGCTGGGATCGAGGGCGTGCTCGGCGTCGATGAAGGCGCACGTCTTGCCCTGCTTCTGCGCCTCGGCGATGACCGCGAGCGTCAGGGTGGTCTTGCCCGAGGACTCGGGGCCGAAGATCTCGACGACGCGGCCGTAGGGAAGACCGCCGATGCCCAGGGCGATATCCAGGCCCAGCGAGCCGGTGGAGACCGATGGCATCGCCACACGCGGCGTGTCGCCCAGCCGCATTACGGTTCCCTTGCCGAACTGTCGATCGATCTGGGATAGCGCGGCGTTCAGCGCCTTGGAACGATTTTCGTCCTGAGCCATGACAGGCCTCCTAAGCGATAGCGAATAATCGAGAGCCGGCCGGCAGCCATGCCCTCTTCACTGTATGGTTGCACAGTAGTATGACGAAAAAAGCGCCGCTCGACCAGCCCCGCACGTGCTAGCTATTCGTGCGGCGTCAGCCAGCGGATCAAACCCACCAGCGCCTCGCGAACCGCCGCTTCGCGCACGGCATCGCGATCGCCGGGGAAGTGCCGACATTCGGCCGCGCGCTGCCAGCCGTCGCCCCAGGCCAACCATACGGTGCCCACCGGCTTGTCGTCGGTGCCGCCGCCCGGGCCGGCGACGCCACTGATGGCCACGGCCAGGTCGGCCCCGCTCTCGGCGCAGGCCCCGGCCACCATGGCCTCGACCACCTGTCGGCTGACCGCGCCATGTTCGGCCAGCATGGTCTCGGAGACACCCAGCAGGCGCGTCTTGGCGGTATTGGCATAGGTCACGTAGCCGGTCTCGAAGTAATCGGAGCTGCCGGCCACCGAGGTGATGGCGCTGGCCACGCCACCGCCGGTGCAGGATTCCGCGGTACTGACCACGATGCCCCGCTCGCGGCACAAACGTCCCAGGTACTCGGCGAGTCTCGCCGGATCGAGGGGATCCGGACTTTCGGCATGTACCGCCATGGGATGTCCTCCAGACCGTTGGTGACTGGACGACGACGCTCCTGACGCCGAACGTCCTTGAGCGTAGAATACTGCCTTTGCCCGCGACATGCATGACATGCGCGTCGCCATGATTCCTCTGTCGTTCTTCGCACCATCACCACCGGCAAGCCAAGGAAGCCCATGTCACAGGCCAGCGCCCAGCACACCCCGATGATGGCCCAATATCTGAAGATCAAGCGCGAACATCCGGACGTGCTGCTCTTCTATCGAATGGGTGATTTCTACGAACTGTTCTTCGATGATGCCAAGCGCGCCGCCGCCCTGCTCGACATCACCCTGACCCAGCGCGGCCAATCGGCGGGCAAGCCGATCCCCATGGCCGGCGTGCCCTATCACAGCGCCGAGGGCTATCTGGCGCGCCTGGTCGCCGCCGGCGAATCGGTGGCCATCTGCGAGCAGATGGGCGATCCGAATACCACCAAGGGCCCGGTGGAACGCAAGGTCGTGCGCATCGTCACGCCGGGCACCCTGTACGACGAGGCGCTGCTCGACGCCCGACGCGACAACCTGGTGGTCGCGCTGCATGCCGCCGGCGAACACTGGGGTCTGGCCTGGCTGGAGCTGTCCAGCGGGCGCTTCAGCGTGCTCGAGGTCGAAGGTGAAGCCGAGGCGATGGCCGAGCTGACTCGCCTGGACCCGGCCGAGTTGCTGGTCGCCGAAAGCCTGACCCTGCCGCCCGGCCTCGAGGCGCATCGGGGCCTGCGTCGGCAGAACGACTGGCTGTTCGACCTGGAAAGCGCCACGCGCACCCTGTGCGACCAGTTCGCTGTCCAGGATCTGAGAGGCTTCGGCTGCGCGCATCTGGAAGCCGCCATCACCGCTGCCGGCGTGCTGATCGACTACGCCCGCGACACCCAGCGCTCGCAGTTGCCCCATGTCACCGCCATCGGCGTGGAGAGCCGCGACGACGCCGTGGTGATCGACGCCGCCAGCCGCCGCAACCTGGAGATCGGCGTCAACCTGGGAGGCGGCACCGACAACACCCTGGCCAGCGTACTCGACACCACCTCCACCGCCATGGGGTCGCGCCTGCTCAAGCGCTGGCTCAATCGGCCACTGCGCGATCGCGCCCAGGTATCGGGTCGCCAGGCCGCCGTCGCCGCCCTGCTCGACGGCGACGGCTTCGTCGCACCCCGCGAGGCCCTCAAGGCGATCGGCGACGTGGAACGCATCCTGGCCCGGGTCGCCCTCTACAGCGCCCGCCCCCGGGATCTCGCCAGACTGCGCGACGCCTTGAACGCCCTGCCCGAGCTCGAGGCCGAACTGGCCCGCTTCGACGAGGGCACCGCCCTGGACGATCTCAAGCGGCGCATCCATCCCTACCCGACGCTGGCCGATACGCTGAGCCGGGCCCTGATGGAGAACCCGCCGGTGGTGATCCGCGACGGCGGCGTGATCGCCGAAGGATTCGACGCCGAGCTCGACGAACATCGCGGCCTGGCCGAGCACGCCGGCGATTACCTGATCGAACTGGAGACTCGGGAGCGCGAACGCACCGGCCTGCCCGGCCTCAAGGTCGGCTACAACCGCGTTCACGGCTACTATATCGAGATTCCCCGCGCCCAGGCCCGGGATGCGCCGGTAGACTACATCCGTCGCCAGACCCTGAAGAACGCCGAGCGCTTCATCATCCCCGAGCTCAAGGAATTCGAGGACAAGGCGCTATCGGCCAAGTCCCGTGCGCTGGCCCGCGAGAAACTGCTCTACGATGGCCTGCTCGACGACCTCAACGCCGAGCTTTCCACCCTCCAGGCTACCGGCCAGGCGCTGGCGGCACTGGACGTGCTCGCGACCCTGGCCGAGCGCGCCCGGGCGCTGGACTTCGTGCGTCCAGAGCTGCCCGAAACGGCAGGCTTCAGCATTCGCGGCGGCCGCCATCCGGTGGTCGAGCAGGTCAGCGAGACACCTTTCGTGCCCAACGACCTGAACATGGATGACGAGCGGCGCATGCTGGTGATCACCGGCCCCAACATGGGCGGTAAATCGACCTACATGCGACAGGCCGCCCTGATCACCCTGCTCGCCCACACCGGCAGCTTCGTGCCCGCCGACGCCGCCTCTATCGGCCCCGTGGATCGCATCTTCACCCGCATCGGCTCCTCGGACGACCTGGCCGGCGGGCGTTCCACCTTCATGGTCGAGATGACCGAGACCGCCAACATCCTGCACAACGCCACCGATCACAGTCTGGTGCTGATGGACGAGATCGGGCGCGGCACCAGCACCTTCGATGGGTTGTCGCTGGCCTGGGCCAGTGCCGAGCAGTTGACCCGCAGTCGCGCCTTCACGCTGTTCGCCACCCACTACTTCGAGATGACCGCACTGGCCGAACAAGCCAGCGGCGTGGCCAATGTCCACCTGACGGCCACGGAGCACAAGGAAGGCATCGTCTTCATGCACCGGGTGGAAGACGGCCCGGCCAGCCAGAGCTACGGCCTGCAGGTCGCCCAGCTCGCCGGCGTACCCCAGGCAGTGATCGCCCGGGCGCGGGAAAAGCTTGCCAGCCTCGAGCAGCAGGAAATCGACCAGGGCCAGCGCAGCCTGTCCACGGACGACAGCGCCGCCTCAGGCAGCGCCCCGCAACAGGCCGACCTGTTCGCCAGCGCGCCGCATCCCATGCTGGAAAGCCTGGGAAGCCTGGGACTGGACGACATGACGCCGCGTCAGGCGCTGGAGTGGCTCTACCACTGGAAGGAAAGGCTGTAATCGCCTTGCGGTTGGCGTGGCGGCCCGACTAAAATGGGGCCAACAAGCTCACTATATTATAAAAGCAACAGTTTTTATAACCCAAATTCATCATACGAGACCGACTGGCCGCGGCGCCACCCGGCCCGAGGAAGGGAGACAGACATGACATTCGTCGTCACCGAAAACTGCATCAAGTGCAAGTACACCGACTGTGTCGAGGTCTGCCCGGTGGACTGCTTCTATGAAGGCCCCAACTTCCTGGTGATCCATCCCGACGAGTGCATCGACTGCGCCCTTTGCGAACCGGAGTGCCCGGCCGAGGCAATCTATTCGGAGGACGAGCTCCCGGAGGGACAGGAACAGTTCATCGAGATCAACGCTGAACTCTCCGAGACCTGGCCGAACATCACCGAGAAGAAGGATCCACCGGAAGACGCCGAGGAGTGGGACGGCAAGACTGGCAAGCTGGAGCACCTGGAGCGCTAGCGAAGCACGCTCGGGGAAACGCTGAATAAAACGCCGAACGCCGCTCAGGCATTCGATCAGCGTTTCCTCGGAACCGTGCCATTGGCTGAATACAACAGCGCCGAATGTCTCTCGACATTCGGCGCTGTTGCGTTTGCCTTGAATGAACGCAAGGAGAACATCAGACAAAAAAAGGCGGCCCGCAGGCCGCCCGCTCCTAGCCAATCCTTGAGCGAATCCCTTCGCTCATCGTCCTGCGAGTGCTGTCCCTGCTCGGAGGCGACTTCCTGTCACACCGAGCGCACTCGAGACTGCATCCCGTGGCATCCTGCCGGAAGAGCTCCGCTCTCCCTTGTCCCGAGTGCATTGTGGCACGCCTGAGGAGCTTCTCAAGTGGCCCCGATGGCATCATGACGAGCGCAGCGTCCGGCTCCGTTTTTAAAAACCCTCACAATATCAAAAAGATAAAAAAAGACCGCTGCAAAAAGCAGCGGTCTTCGTAGGAATTTTGTCGTTAATGTCTTCGGCTATCTGTAGGAAATCTCCTACAAAGCGAAGAGCTAATCACAACAATGCCCGCTTATTGACCTTTAATGGCGCTGCGCCCGGGATAGGCCACGCGCTCGCCAAGGTCGCGCTCGATCACCAGCAGGCGATTGTACTTGGCAACCCGGTCGCTGCGGCACAGCGAGCCCGTCTTGATCTGGCCGGCGCTGGTGCCCACGGCCAGGTCGGCGATGGTGGTATCCTCGGTCTCGCCGCTGCGATGCGAGATGACGGCGGTGAAGCCGGCATCCTGGGCCATGCGAATGGCATCCAGGGTCTCGGAGAGCGAGCCGATCTGGTTGAACTTGATCAGGATGGAATTGCCGATGCCCTCGTCGACGCCGCGCTTGAGGATGCGCGTATTGGTGACGAAGAGATCGTCGCCGACCAGTTGCACGCGGTCGCCGAGCTTGTCGGTCAGCGCCTGCCAGCCGTCCCAGTCGGACTCGTCCATGCCGTCCTCGATGGAAACGATGGGATACTGGTCGCACAGCTCGGCCAGATAGTCGACGAAACCGGCGGCGTCATAGGACTTGCCCTCGCCGGACAGCGCGTACTGGCCATCCCGGTAGAATTCGCTGGAGGCGCAATCCAGCGCCAGCGTGACATCGCGACCGAGTTCATAGCCGGCATCCGCCACGGCCTGCTTGATCACCGCCAGTGCCTCGGCATTGGAGTCGAGGTTCGGCGCGAAGCCCCCCTCGTCGCCCACGGCGGTGGCCAGGCCACGCGCGGCCAGCACCTTCTTCAGGGCGTGGAACACCTCGGCGCCCATGCGCAGGGCCTCGCGGAAGCTCTCAGCGCCCACCGGCTGGATCATGAATTCCTGAATGTCGACATTGTTGTCGGCATGCTCGCCGCCGTTGAGGATGTTCATCATCGGCACCGGCATCAGGTACTGGCCCGGCTGGCCGTAGAGCTCGGCGATATGGGCATAGAGCGGCATGCCCTTGGCGTTGGCCGCTGCCTTGGCGGCCGCCAGGGAGACCGCGAGGATGGCGTTGGCGCCCAGCTTCGCCTTGTTGTCGGTGCCATCGAGCTCGAGCATGGCTTCGTCCAGGCCCCGCTGATCGCGGGCGTCCATGCCCAGCAGGCGCTCGCGAATCTCGCCATTGACGGCCGCCACGGCCTTCGACACGCCCTTGCCAAGGTACCGGGACTTGTCACCGTCGCGCAGTTCGAGGGCCTCGCGGGAGCCGGTGGAGGCCCCGCTGGGCGCGCAGGCTTCGCCCACGGCGCCGCTTTCCAGGCGCACTTCGGCCTGCACGGTCGGGTTACCACGTGAATCGAGCACCTCGAGGGCGCGGAGTTCGACGATCTTGGTCATAGGTCGTGTCGTCCTTAAGCGAGTCAGTCAGCGTTGAAAGAATCAGGCAATCTCGAGTGGTTCAAACCCCTTCACCAGGTCGTCCAGTGCCTTGAGTTGCGCCAGGAAGGGTTCGAGCTTGTCCAGTGGCAAGGCACAGGGGCCGTCGCACTTGGCATTGTCGGGGTCGGGGTGCGATTCCAGGAAGAGGCCGGCCAGGCCCACGGCCACGCCGGCACGTGCCAGCTCGGCGACCTGGGCGCGACGCCCGTCGGCACTGTCGGCACGCCCGCCGGGGCGCTGCAGGGAGTGGGTGACATCGAACAACAGCGGATAGCCGGTCTGCTTCATGTCACCGAAGCCGAGCATGTCGACCACCAGGTTGTTGTAGCCGAAGCTCGAGCCGCGTTCACAGAGGATCAGGCGGTCGTTGCCGGCTTCCTCGCACTTGCGCAGGATGTGGCGCATCTCATGGGGCGCAAGGAACTGCGGCTTCTTGATGTTGATCACCGCGCCGGTTTCGGCCATGGCCACCACCAGATCGGTCTGGCGGGCCAGGAAGGCCGGCAACTGGATGACGTCGGCGACTTCCGCCACCGGCGCGGCCTGCCAGGGCTCGTGCACATCGGTGATCACCGGAACCCCGAAGCGCTCCTTGATCTCGGCGAGCATCGCCACGCCATCCTCCAGGCCGGGGCCACGGAAGGAATGGATGGAGCTACGGTTGGCCTTGTCGAAGCTGGCCTTGAACACATAGGGCATGCCGAGCCGGGAGGTGACCTCGACATAGGCCTCGGCAACCTCCAGGGCCAGCTCTCGGGACTCCAGCACGTTCATGCCACCGAACAGGGTCAGCGGCTGGGAATTGCCGACCCGAAGGCCGGCGAATTCGATGAGGCGTTCGGGGTCGGTCAT
Protein sequences of DBSCAN-SWA_4 >NC_014532|3219370:3230398|3229546_3230398_-|WP_013333506.1|DBSCAN-SWA MTDPERLIEFAGLRVGNSQPLTLFGGMNVLESRELALEVAEAYVEVTSRLGMPYVFKASFDKANRSSIHSFRGPGLEDGVAMLAEIKERFGVPVITDVHEPWQAAPVAEVADVIQLPAFLARQTDLVVAMAETGAVINIKKPQFLAPHEMRHILRKCEEAGNDRLILCERGSSFGYNNLVVDMLGFGDMKQTGYPLLFDVTHSLQRPGGRADSADGRRAQVAELARAGVAVGLAGLFLESHPDPDNAKCDGPCALPLDKLEPFLAQLKALDDLVKGFEPLEIA >NC_014532|3219370:3230398|3227328_3227652_+|WP_013333504.1|DBSCAN-SWA MTFVVTENCIKCKYTDCVEVCPVDCFYEGPNFLVIHPDECIDCALCEPECPAEAIYSEDELPEGQEQFIEINAELSETWPNITEKKDPPEDAEEWDGKTGKLEHLER >NC_014532|3219370:3230398|3223920_3224436_-|WP_013333502.1|DBSCAN-SWA MAVHAESPDPLDPARLAEYLGRLCRERGIVVSTAESCTGGGVASAITSVAGSSDYFETGYVTYANTAKTRLLGVSETMLAEHGAVSRQVVEAMVAGACAESGADLAVAISGVAGPGGGTDDKPVGTVWLAWGDGWQRAAECRHFPGDRDAVREAAVREALVGLIRWLTPHE >NC_014532|3219370:3230398|3228211_3229504_-|WP_013333505.1|DBSCAN-SWA MTKIVELRALEVLDSRGNPTVQAEVRLESGAVGEACAPSGASTGSREALELRDGDKSRYLGKGVSKAVAAVNGEIRERLLGMDARDQRGLDEAMLELDGTDNKAKLGANAILAVSLAAAKAAANAKGMPLYAHIAELYGQPGQYLMPVPMMNILNGGEHADNNVDIQEFMIQPVGAESFREALRMGAEVFHALKKVLAARGLATAVGDEGGFAPNLDSNAEALAVIKQAVADAGYELGRDVTLALDCASSEFYRDGQYALSGEGKSYDAAGFVDYLAELCDQYPIVSIEDGMDESDWDGWQALTDKLGDRVQLVGDDLFVTNTRILKRGVDEGIGNSILIKFNQIGSLSETLDAIRMAQDAGFTAVISHRSGETEDTTIADLAVGTSAGQIKTGSLCRSDRVAKYNRLLVIERDLGERVAYPGRSAIKGQ >NC_014532|3219370:3230398|3222259_3222724_-|WP_013333500.1|DBSCAN-SWA MRETAESTPRDDAIRLLARREYARGELRERLAAKGHAAEDVEACLLELEEQGLQSDARFAESFLRSRVARGQGPIKLRAELGRRGIDDDIARQAFAELASEVDWFELAAEVLARRFTGPGETPRERARRERFLAGRGFDFEQLRHALTHAWDDI >NC_014532|3219370:3230398|3219370_3221977_-|WP_013333499.1|tRNA|DBSCAN-SWA MKSADIRQAFLSYFEERGHSIVPSSSLVPDNDPTLLFTNAGMVPFKDVFLGRDPRPYVRATSAQRCVRAGGKHNDLDNVGYTARHHTFFEMLGNFSFGDYFKRDAIHFAWNFLTEVLGLPREKLWVTVHVSDDEAERIWKDEVGIDPERFSRLDEDNFWQMGDTGPCGPSSEVFFDHGPEVWGGPPGSPEEDGDRYIEVWNLVFMQYDRDAAGNMNPLPKPSIDTGMGLERIAAVMQGVHSNYEIDLFQNLLAAAAEATGHDDTSAPSLRVIADHIRSCAFLVADGVLPSNEGRGYVLRRIIRRAVRHGHKLGAQGCFFHKLVAALDAEMGEAYPELREARDQIERVLLKEEEQFARTLEHGMGLLGEALDALEGDTLPGETVFKLYDTYGFPFDLTADVCRERGVSLDEAGFERELEAQRERARAASQFGVDYGAALELEGQTDFTGYDRLEDRATVVALVDEQGNALARLEAEQRGVVVLDRTPFYGESGGQVGDTGYLELEGGRFLVEDTQKQAGHHLHQGRLLEGELEVGASVTPRVDPSLRAATVRNHSATHLLHQALRMVLGDHVQQKGSLVTPERLRFDFSHFEPMTAEQLAEVERLVNEQVLANAPTRIEHMTLDEAKEKGAAALFEAKYADSVRVLTIGADDFSIELCGGTHVARSGDIGCCHIVAEAGIASGVRRIEAITGEGALAYFREQEARVARIGERLKAKPEQVEERVTSLVERNRELEKELERLKAKLASAAGSDMLGEAHEVKGIKVLAKRLEGVSGKELRGLLDQLKNKLGSGVVVLGVGDEQAGKVSLIAGVTDDLTGRIKAGELVNHVASQVGGKGGGRPDMAQAGGSDVPALPGALDSVPVWVESKL >NC_014532|3219370:3230398|3222727_3223798_-|WP_013333501.1|DBSCAN-SWA MAQDENRSKALNAALSQIDRQFGKGTVMRLGDTPRVAMPSVSTGSLGLDIALGIGGLPYGRVVEIFGPESSGKTTLTLAVIAEAQKQGKTCAFIDAEHALDPSYAEKLGVNLDDLLVSQPDTGEQALEICDMLVRSGGVDVIIIDSVAALTPRAEIEGEMGDSHVGLQARLMSQALRKVTGHMKNANCMVVFINQIRMKIGVMFGSPETTTGGNALKFYSSVRLDIRRTGSVKQGDEVVGNETRVKVVKNKVAPPFRQAEFQILYGKGIYHAGEVVDLGVQCGLVDKAGAWYSYQGNKIGQGKANAAQFLEDNPAVMEEIESQIRAQLLATPEPREEESEAEAPIGEGSEREDGLL >NC_014532|3219370:3230398|3224598_3227184_+|WP_013333503.1|DBSCAN-SWA MSQASAQHTPMMAQYLKIKREHPDVLLFYRMGDFYELFFDDAKRAAALLDITLTQRGQSAGKPIPMAGVPYHSAEGYLARLVAAGESVAICEQMGDPNTTKGPVERKVVRIVTPGTLYDEALLDARRDNLVVALHAAGEHWGLAWLELSSGRFSVLEVEGEAEAMAELTRLDPAELLVAESLTLPPGLEAHRGLRRQNDWLFDLESATRTLCDQFAVQDLRGFGCAHLEAAITAAGVLIDYARDTQRSQLPHVTAIGVESRDDAVVIDAASRRNLEIGVNLGGGTDNTLASVLDTTSTAMGSRLLKRWLNRPLRDRAQVSGRQAAVAALLDGDGFVAPREALKAIGDVERILARVALYSARPRDLARLRDALNALPELEAELARFDEGTALDDLKRRIHPYPTLADTLSRALMENPPVVIRDGGVIAEGFDAELDEHRGLAEHAGDYLIELETRERERTGLPGLKVGYNRVHGYYIEIPRAQARDAPVDYIRRQTLKNAERFIIPELKEFEDKALSAKSRALAREKLLYDGLLDDLNAELSTLQATGQALAALDVLATLAERARALDFVRPELPETAGFSIRGGRHPVVEQVSETPFVPNDLNMDDERRMLVITGPNMGGKSTYMRQAALITLLAHTGSFVPADAASIGPVDRIFTRIGSSDDLAGGRSTFMVEMTETANILHNATDHSLVLMDEIGRGTSTFDGLSLAWASAEQLTRSRAFTLFATHYFEMTALAEQASGVANVHLTATEHKEGIVFMHRVEDGPASQSYGLQVAQLAGVPQAVIARAREKLASLEQQEIDQGQRSLSTDDSAASGSAPQQADLFASAPHPMLESLGSLGLDDMTPRQALEWLYHWKERL |
8 | Klosneuvirus(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
3739706 : 3748742
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NC_014532|3739706:3748742|DBSCAN-SWA ATCAGGCCACAGTGGCGAAGACGAAAGCGTCCGGCTCGTGGTAGCCCGGCAACGGCGCAGACTGCATCGACAGGAAGCGCACGCCCTCATCGTCCTCGATCCAGCTCTTGGGATAACGATCAACATCGAACAGCGCGCCTTCGATCGCCCGCACGTCCTGGATCGCGCCATAGAGCATGGAACAACGACTGGTGGTCGGGCCGACAACAAGCCCACCCGCCGGAATCATTGGCTGCTCGACTCCATTGTCATCGAGGTACCATTCCTCGTAGGTGTAGAGGTCGATGCCGGGATCGCGCAGATAGCCCAGATAAGTCACACCGTCAGGCAGTTCCTCGGGGCGAATCATGCCCAGGTCGATACGACGCGTATCGAGCTTCTTCCGGAAGTCCTCGTTGTCGAGCATGGCGTCAGCCGCCTCCACACTGAGCACCGCCGCGGCACCGGTGCGGCCGCTGTCCTTGGCAATGAGGCGCTTGTACTTGCGCAGGTTGCCAATCGGATCGCCGCCGGCGTCCGTCCAGGGAGTTGCCTCGGTGATGATGTGGCTGGCGTCCATCTGGAAATCGACGACATCGTCGACACCATCCCCCACCACGTTCACCTGGCCGCTGGTCAGGGCTTGGGCCACCATCCATTCTTCGCGGCGACTGATCTGGTCGTCCAGTTCGTCGAAGTCACGCGCCATTTGCTCACCGGCACGCTGCAGAGGCGTGCGCGCAGAATAGAGGGCCTCCCCCGGTGACCGCTGGTTGAGCAGCAGGCCTGCCGTGGTTTCGAGCTTGGGTTTCACATAGGACGGCGTGTAGGAACGCATCACGCTCCCGGCGCGGTCCACGATGTGGCCCGGTCGATTAGGACGAACGAAGGGCGCCATCTTGCGCTTGCCCTTGATAATGTCGATGTCGACATGCTGGGTCGTGAAGTTGCGCGAGGCGGCACCGAAGAACAGCGTGGTGAGGAAGCGTCGGGAGCGCTTCATCTTCTCCACGGCCTCCAGCATCGTGCGGGTATCAAACAGGTCTACAGACATGAGAGGTTTCCTCGTCGGAAATGTCAGGGAGACAAAGGCTGAGACAGGCCTTGGCCGAGCCGATTAGTCGACGAACAGAGAAAGGGGGCGCAGAGCCTCGCGCACGCTGGCGATGGTGTGGCCGGTACCGATCGTCAGCTTCTCGCCACGGAGGTCCCCGCTGATCTGTACCTCGGCCTCGACGGCACCGCCAGAAGCATCGACGTCGTCGAATAGCACGACGGAGGGCGATTCACTGCCGTCACCGGCAGCCGACGCCGAGAGCTTGTACTCGTTGCTCGCCGTTACTTTTCCGAGTACGGCACCGGCCGGCTGCACCTGGCCGGCTTCGATGGTTACGGTCATGTAGCGACGCGGGAAATTGCCACCGCTGATGCCGTTACGGTCGGGATGATTGGTCTGGGTCATGCCGGGCATGGTCAGGATCCTCTATCGTGATGGGACTGGGCGCCTTAGTCCTTCCAGGCGCCGGTGATGGCAGAGACCGCGGACTTCCGCTCTTCGCCCTCTTTGTCATCCGCGGGAGGCTGGGCGGTGCTGGTCTGCGTACCGTCGGCAGCGATGTCGGACAGAGAAAGGCCCCGGTCCTGAGCGGCTCGGAACAACTTGAGGCCGGTCGCTTCCACGCTCGCGCCTTCATCGATGGCCGCCGCCACCTCCTTCTCGAAGCCCGGCGAAGCCAGTTCGAGAATGCCCTTACAGCGCGCACGCTCGGCCTCGGTGGCCTCGGTCTTGAGCTTGTCGGTATCGACCGCTTCGGGCTCGGCGATCTTGATCGACTTCGGGTCGGTGCCGGCCTCGATCGCAGCCTGCAGCTCGGCCGTGGTCTTAACAGTGGTCATGATGGTGCTCCTCGGTTGGCTGCTGGCGGATGTGCCGGCCAGCTCGGCGATGACGGATTCCAGGGAGCCAAGGCGGTCGGCCATGCCGGCCTCCACGGCGAGGGCGCCAGTGGCGACGCCTCCCTGACGGAACCGATCGTTGACCTCATCGCGTGGGAGGTCCCGGTTGCGGGCGACCTTGTCCAGGAACACGGCGGCCAACTCGTCGGTGCGCGCCTGCAACTGGGCACGGCCGGCCTCAGTCTCGGGGTCGGGCCGTTTGTTGGGTGCGTTGCTGGAAACGATCTCGTAGCTCTTCTCGCCAGGACGGTCCTCGCGCTTGCGCAGGCTGAGCACCACGCCAACGCTGCCCAGCTGGGCGGTGTCGTCGACGATCACCTCGTCGGCAGCACTGCCAACCCAGTAGAGCGCACTGGCCATCTGGCCGCCGGCATAGGCTTTGATCGGCTTCTGCCCTCGGGCGTTGTAGATCAGGTCGCTCAGTTCGTTGATGCCGGTCGCCTCACCGCCAGGACTGTCGATGTTGAGCACGATGCCCTGGATGCTCGGATCATCGAGTGCGGTCTGGATGTCCTTGACCAGCATCTCGGTGCTGACAGCGCCGCTGATCTCAGTGAACAGGTTGGCGTATCGGAACACCGGGCCTGTGATCGGGATCACGGCCACGCCATCGCGCACCGTCACATTGCGGGTGTTGTCCAGAGGTCGACCGAGGCGTGCCTCCAGGGCCTCGACATCACCCTGGCGATCGGCGACGGCAAGTAGGGTGTCCAGCGCCTCGGCGGTCATCAGCCAGGTACGACTGGCCGCCAGCTCGAAGGCGGTGCGGGGCATGGGCGTCTCCTACGAAAAGACCCGCCGAAGTGGCGGGTCAGTCGGTGTCATCATTCTCTCGATTGGGATCCGGCGGCTCAGGATCATGCACCTTGCCGCCGACATAGATCGGCACGCCGTTCTCCCGCTTCCAGCGGATCTCCCGGGCGCGGTCCTGGGCGACGTCGTGCCAGTCTTCACCGTGAAACGCCATGGTTTCCAGGTGCTCATTACTGGTGCCATTGGCGATACGTTCGGTTGCTGCCCTGGCGTCGACCTGCTCATTGAGCGAGCCCAGTGGCTCACCAATCCACAGCGCACGCACGTAGGCACGGCGCTTGGCTGGGTCGCGATAACCAGGCGCCTTGATGCGTCCGCTGGCCACCAGTTCATCCACCACCAGTTCATAGGCCGGCTGGCAGAACTGCACGGTCAGGTGGTGCCGGCGCTGCTTGATGAACTTCCAGAGCTGGTTGAAAGCAGCCCGAGCCGCGGTGTAACTAGTACTGAAGTGCATCATCAGCACTTCGGACGGCTGCTCCAGGGCGGCGCCAATCTCCTTGACGATGGCCATGAAGAACGGGTCGAACTGTGCGTTCGGCCGGTTTGGGCTGATCGATACCGGCTCTGCGCCTTCCTCGAGATCCCAGACGGCCCCCTCGCCCAGGGTCAGGTTGTCACCGTCGGATGTGTCGTCTTGATTCGAGGTGACGACCGGACGCTCGGGTTTGCTCGGGTCGTCACTCTGCTCATCCCACATCGTGGCACCGCCGAGACTCTGGTCACCCTCGGAAGCGTCGTGCTTGATAGCTACGGTGAACATCGCACTGATCACCGCCGCCGTCAGTTCCGCCTGGCTGAAGCGCTCGAGTTTCTGCAGTGCCTCGAGGATCGGGGCCAGATACGGAACACCACGTACCTGGCCCGGACGCCCCTTTTCGTTCATCAGGTGCAAGATGCGCCGACGCCCGGTGCGTTCACCGAATACCGGCACCCATGTCCAGTCCTGATCGGTGGTGTGGTCGCTGGGGTAACCGTCACAGATCCTGACGGCTGTCGGCCGACCGACCGAGTCAACGCGTATGCCGTCCACTTCATTGCGACGCTCCGGACCAGTGAGCGGCGACCCCACTCTTTCCGCCTCGACCAGCTGCAGCTTGGTACCGAAGAGCCCGCCGATTTGGCGCACATTGGGCGTCAGGCCAAAGACGTCGCCGCTGACCAGGGCGCTGATGAATGCCAGGCGTTGGAGCATGTAGAAATCCTGCCCAGCCTCGGCATCGCACTCAGCAGGATCCTCGGCCCACAGGCGAAAGCCCCGGGCCAACTCGTCGTTAAGCTGTGCGGTCTCTTCCTCGGTGAGCCCCAGGGCCTGGCCGTCGACATTCGGCCGGACGGTCAGGCCCATACCGACCACATTGGTGGCTGCGCGGTTGATGGCCGCACGCCCCAGCATGTGATTGCGGTAGGCATCACGGGAGCGACTGATCAGCGTTTCACGCTCGCCCGTCGGGGTATCCTGGCGCGGACTTCCGAGCCCCGGCAACCAGCTCATCATCGAGCGGAGCATGCGACTCGCGCCACGGTGTCGGGTTTCGCTGCCAGTGTTGGCACGAGGTGTCGGTGCCGTTTCAGCTAGTCGCCGCACCTCGTCACGAGCAGCCTGTTCACGCACATCGTTCAGGCTGCGCGACGGAAACAGTGACTTCAGCATGATCAAAACCCGATATAACGAACACGGCTACGACCACCCTTGCCACGGGCAGCAGCCTGTTCCTGGTTAGCGAGCTTGGCGTAGCGCTTTTCCATGTCGTACAGGGTGCTGAGGGTGGCGCGGGTATACTGCTTGTTCCCCAGCATCCAGCTCTGCGAACCGGCCAGGATCTGGTCGATGGCCTCGCGTACCTTGCGCAGGCGCTCGGTATAGGTCTCGGTGCTCATGCGCTGTGGTTCCTGTTCATTCGCCGCCGCCGCCGGCGACCAGACACACCGACCGGGGCCGCAGTCCCAGCCGGGGCATCAAAGAGTGTTGGTTGCGTCAGAGTCTGTTCGAGCGCATCCCAGTCACGCTCGCGCATCACGTGCGTCCGGCTGGAGCGTGCCGCGTGGAGCGCATAAACCTCGCAGTCCAGCGCCTCGTTGTGGACGCCGGATTTTTTCTGCCATACGCGCTTCCGCGGATTGCGTGGATGTGGCGCCTTGACCTCGGCGGTCAGCTGCTGCCAGTAATCGGCGCGAACGCTTCGGTACCAGTGCATCCGTCCAGGGCCTGAGCCATGCAAATGGATGCGGGCCTCGAGGATGAGGTCCTTGGCCTTGTGCGTACCGACGATGAACGGGCGAAGCCCGAACTTGTCGGCTTTGGTGTTGGCCCGGTTGGTGTCGTCGGAGATCTTCGGTCGACTGAAAATCTCGCGATTCTCACTGTTCACCGAGGCTCCCTTGACGGCCATGACCCCATGCCGCTGACGCGCGCGGACGTAGTGGTAGACGGAGTCACTGGTCTGCCCATCCGAGCTGTCGACGCCCACGGCCGCGACACGCAACGTGGCGCCGCTGTCGTGCTCGAACCCGGTAGTCAGCAGGCTGTCCAGCTCATTCCAGACCGGGTCGGCCTTGTCGATGGGATTGCCATAGAGCTCACCCCAGTAGACCAGCCAGCTCTCCTCGCCGCGTCCCCAAGCGCGAATGATCACCGCCAGACGATCGTGTTGGACGTCGACTCCGGCCGTGAGCAGCAACCCTCCACGCGGGACCGTCTTCTCGGCGTAGTCTTCGGCCCGATCCTGCAGCTCGTCGACCTCGGGCGTGTCCCCCTTGAACTCGTAGGGCAAGCCCATCGAGCTGTTGGTGAACACGATCAAGTCGCTGAAGTCGCCCTCGTCCGCCGCGTGCTTGGCCGACAGCCACTTCTCCATGAGTCGCGCGAAACGCGAGTCCGGGAAGGTACTGAGCAACTCATTCATGTAATAGCCGGCAATGCCCCGAAACTGGGCCATGGCCACCCATTCCCCGCGGCGCAGATTGGCGTTTTTCTGCTGATCGTTCCATTCGCCGCCGCAATGCGGGCACGCGTAGATGGTGCGCTCCGGTTGATAACTGCCGTACACCGGGTGCCGTTGGTCCGGGTCCGTCGGGCAGACCAGGTTGTCGAACGACAGTTCGTGGTGTTCGCCACAGTGATGGCACGGCACCATGCAGCGGCGTTGATCGCTATGCTGCATCTCGGCTTCGATCGACGAGATGCCGGCGATGGTCGGCGTGCCTCCGATGATGTACTTCTTCCGGCCTCGACCGTAGGTTTTCCCCCGTTCCTTGAGTAGCAGAATCGAGTCGCCTTGCCCTTTCAGGTTGAGGTTGCAGTCATCTGGCTCTTCGACGAAGCCTCGAGGACTCGGCGTCGACTTCACCGATGCCGGCGAGTTCGAGCCGACCAGCTTCAGAAAGCCGCCAGGGAACCGCTTGAACTGCTGGCGTTGCTGTAACTTGCGCGAACGCAGGTCGATCTTGTTGCGTAGCCGAGGTGTGGCCTCGACCATGGGTTCGAATTTCTCGGCGACGTATTCCTTGGCCGCCCCATCCTTGGGGAACAACCCGATAACCGGTGAAGGGTCGACGTCGATCCAGCGCCCCAGCGCATTACCCAGCACGCCGGATGTCCATGCCACCTGGGCTGATTTCTGGCAGCACACCTCTTCGACATTCGGGTCGTCCAGCGCTTCGAGCGGACCACCTGGGAGTGCCAACGCCGGCGTCACGTGGATGCTGTACTTGCCAGGCCTGGCCGTCTCGACCTCGCTCATCCAGCGATGCTTGTTCGCCCACTCCAGGGTGGTGATGTTCTCGGGCGGGGCAAACTGCCGAGCCAGATTACGAGTCCAGCGCTTCGCGTTCCTCTTCAAGCGTGCTCGCATCTTCGTCTCCAGACGGTTCGCCTGGCTCGATGTCATCGCTTGGGTCATAGTGGCTCAGTGTCGTCAGAGCGGCTTCGACGTGACGACGGATGATCGACACGTCGACCTCGTCGCCCATGATGGCGGTCAGTTCCGCCGCCAGTGCGTCGGGCATGTTGAACAGCATCTCGGCGCGCGCGGCCTCGACCATGGCGGCGTACTCAACATCCAGGTCTTCCGGCATGACCAGAACGGCATCTTCCTTGAGCATGTCGCGTTCGAGCTGGTCGCCACGCAAACGGTCCAGTCGATCCTTGGCCGACTCTCGGCTCGCCTCGGCCGCCCGAGCCAGCATCCACTCAATGGCCTCCCCGACGCTGTACCGGTTGCTACCGCCACGACCGGCAGAGTGCAGCACCGGCATCCCGTCCTTCTGCCAGTCGGTCAGCGACCGCTCGGAGATTCCGAGTATCTCGGCCAGGTTTCGCTTGTTGGCTTCATCGGGATAGCCAGCTTCACGCCAGTTGGTCATCGCCTGGCAGCAAGCCTGGAGCAGTTCCGATGCCGTCATGGGAGATCAACTCGCTGAAAAGTAAGGAAGCCTTGGCGAAAATTCAGCTGCACGAATTCTGCGCTGCTGCGGCCCCGTATAGGCCTGTCCCCTTCCCGGGAGGACCCGTGCCACGATCCGCGCCCGATGAGAGACCGCGCTCGACCCTGCAGCCAGCTGTAGATACGATTAGTTATCATCCATACGCTACGGAAGGCGCAACCTATGCAAGACGGCAACAAATTCGATCCCAGATCCTGGGTACTCTCCAATCTCAGCCTGATCGAACGAGCCTGGGAAGCCCAGTGGGCACTACGGCTGACTCAGGCCGTACTGTTCTTCGATATCGCGATGCGCCTAACCGGTTCACCCGGACTAGTGGACTGGAACTCATCTACACAGACCATCACAGGTCATCTCGGCTTCATCCTAATTACCTTCGCAGCCTTCACTCTCAGCATGTCCATCGTTGTTCCGATCACGATGCCAATCCTCGAGCTACTGCTTTTGATCCCAGGTATCAAATGGCTCTTCACCACCGATGATGAGACATACCATCGACCGGGACGGCCTTACCGATGTGTATCAATCAGCGATCTAGAGAACGAAGCCGGTCGCCGAAATGATCCAGCACTGTATGAGTTGGCCGAAGCGGAAAGCGAAAAACGCACATCACAATCCAGAGCTCAGTTCGGGTTGATAGTAACCCTGACCGGGACGCTGGTACTGACAACCCTTAACGCGTTTCCCTTCCTTATCACTATTCCAGGAGGCACGATCCTCAATGAGATGATGAATTACCTCGGTTATTACAAGGCCGGTTTCGTGATAGGAATTCTCATCGCTAGCATGCTTGGCATAATACTCAACACAATGATTAATCAACATCGCCAGCACTGGGTTCGACATCCGCAATTACATAAAGAACTTGAGGCTAAACGACGAGCCGAACTCGGGGAAGGACTTGCCTTCCGCAACCGATTCACCGAGCAATGAACCAGGGAACAACACGTTATTCTCTCTCAAGCTCGACACCCACCAACTCCACCACGGCAGCGCGGTCCGCGTTGGCCCGCCGCCGCAATGACTCATAGTCAGCGAGTAGCTGCAGCAGATCCCGATTGTGGGTCATCTCCGACCGGCTCGGGGCCGGCAGTGGCTCCACCAGATGCGGCGGCACCTCCGGCGTCACTATCACCGGAACCTTCACCGTCTTCGGCTGGGAGCCGGCGCAGCCACTCACGAGCAGCAGAAGGCACGAACTGATCAGCCCAATCAGCAGTCTCGGCATCGGTACGCTCCAGGTCTCGGGCTGCCTGCCGCATGAGGTCGACGATCTCGCCCTCACGCTGCAGCCGTTCATCGCGATCTCGCAGGGCAGCGGTCATCTGTTCCATCTGCCGCTTCTGCCACTGCTGGTGAGCCAGCAGGATCTCCGCCCGCTCCTGGGCGCGGGTCAGCTGCGCCTCGGTCTCGCTGAGTTGCGCGGCATACTGCCGGGCCTGCATGCCGGCATAGACCGCCACGCCGATCAGCACACCCAGCAGCCAAGGCGTGGCGCTACGGAGCAACGCAATCATTTGCGCCCCCGGAGCAAGAGCGTCTCGAGGATTGTCGCCGCATGGCCCCGAATCCACTCGGTACCGAGGAAGCCGACTGCTGCCCCAACGGTAACGGCCAGCTTCTGGTTCAACCCGAAATACTCCAGCACCGGCACCAGGGCCAGCGTCAGGCAGCCGCACAGCACGGCTTCGAGGATCGACTTGATGGGCTTGCCGCCGCCCTGCAGGCCACGCACCAAAGCCACAATGCACGCCATGCCGGCGGCATAGATCTGCGGCCAAACACTCGCCAACAGCCCGATCAGGGCCTGCCAGTTGTTGGGGTCCTTATCGGGGCCAGGCAT
Protein sequences of DBSCAN-SWA_5 >NC_014532|3739706:3748742|3742451_3744107_-|WP_013333940.1|portal|DBSCAN-SWA MLKSLFPSRSLNDVREQAARDEVRRLAETAPTPRANTGSETRHRGASRMLRSMMSWLPGLGSPRQDTPTGERETLISRSRDAYRNHMLGRAAINRAATNVVGMGLTVRPNVDGQALGLTEEETAQLNDELARGFRLWAEDPAECDAEAGQDFYMLQRLAFISALVSGDVFGLTPNVRQIGGLFGTKLQLVEAERVGSPLTGPERRNEVDGIRVDSVGRPTAVRICDGYPSDHTTDQDWTWVPVFGERTGRRRILHLMNEKGRPGQVRGVPYLAPILEALQKLERFSQAELTAAVISAMFTVAIKHDASEGDQSLGGATMWDEQSDDPSKPERPVVTSNQDDTSDGDNLTLGEGAVWDLEEGAEPVSISPNRPNAQFDPFFMAIVKEIGAALEQPSEVLMMHFSTSYTAARAAFNQLWKFIKQRRHHLTVQFCQPAYELVVDELVASGRIKAPGYRDPAKRRAYVRALWIGEPLGSLNEQVDARAATERIANGTSNEHLETMAFHGEDWHDVAQDRAREIRWKRENGVPIYVGGKVHDPEPPDPNRENDDTD >NC_014532|3739706:3748742|3747061_3747832_+|WP_041602252.1|DBSCAN-SWA MQDGNKFDPRSWVLSNLSLIERAWEAQWALRLTQAVLFFDIAMRLTGSPGLVDWNSSTQTITGHLGFILITFAAFTLSMSIVVPITMPILELLLLIPGIKWLFTTDDETYHRPGRPYRCVSISDLENEAGRRNDPALYELAEAESEKRTSQSRAQFGLIVTLTGTLVLTTLNAFPFLITIPGGTILNEMMNYLGYYKAGFVIGILIASMLGIILNTMINQHRQHWVRHPQLHKELEAKRRAELGEGLAFRNRFTEQ >NC_014532|3739706:3748742|3748412_3748742_-|WP_041602253.1|holin|DBSCAN-SWA MPGPDKDPNNWQALIGLLASVWPQIYAAGMACIVALVRGLQGGGKPIKSILEAVLCGCLTLALVPVLEYFGLNQKLAVTVGAAVGFLGTEWIRGHAATILETLLLRGRK >NC_014532|3739706:3748742|3747930_3748416_-|WP_013333944.1|DBSCAN-SWA MIALLRSATPWLLGVLIGVAVYAGMQARQYAAQLSETEAQLTRAQERAEILLAHQQWQKRQMEQMTAALRDRDERLQREGEIVDLMRQAARDLERTDAETADWADQFVPSAAREWLRRLPAEDGEGSGDSDAGGAAASGGATAGPEPVGDDPQSGSAAATR >NC_014532|3739706:3748742|3746263_3746857_-|WP_013333943.1|terminase|DBSCAN-SWA MTASELLQACCQAMTNWREAGYPDEANKRNLAEILGISERSLTDWQKDGMPVLHSAGRGGSNRYSVGEAIEWMLARAAEASRESAKDRLDRLRGDQLERDMLKEDAVLVMPEDLDVEYAAMVEAARAEMLFNMPDALAAELTAIMGDEVDVSIIRRHVEAALTTLSHYDPSDDIEPGEPSGDEDASTLEEEREALDS >NC_014532|3739706:3748742|3744330_3746232_-|WP_174208864.1|terminase|DBSCAN-SWA MTTLEWANKHRWMSEVETARPGKYSIHVTPALALPGGPLEALDDPNVEEVCCQKSAQVAWTSGVLGNALGRWIDVDPSPVIGLFPKDGAAKEYVAEKFEPMVEATPRLRNKIDLRSRKLQQRQQFKRFPGGFLKLVGSNSPASVKSTPSPRGFVEEPDDCNLNLKGQGDSILLLKERGKTYGRGRKKYIIGGTPTIAGISSIEAEMQHSDQRRCMVPCHHCGEHHELSFDNLVCPTDPDQRHPVYGSYQPERTIYACPHCGGEWNDQQKNANLRRGEWVAMAQFRGIAGYYMNELLSTFPDSRFARLMEKWLSAKHAADEGDFSDLIVFTNSSMGLPYEFKGDTPEVDELQDRAEDYAEKTVPRGGLLLTAGVDVQHDRLAVIIRAWGRGEESWLVYWGELYGNPIDKADPVWNELDSLLTTGFEHDSGATLRVAAVGVDSSDGQTSDSVYHYVRARQRHGVMAVKGASVNSENREIFSRPKISDDTNRANTKADKFGLRPFIVGTHKAKDLILEARIHLHGSGPGRMHWYRSVRADYWQQLTAEVKAPHPRNPRKRVWQKKSGVHNEALDCEVYALHAARSSRTHVMRERDWDALEQTLTQPTLFDAPAGTAAPVGVSGRRRRRRMNRNHSA >NC_014532|3739706:3748742|3740801_3741155_-|WP_013333938.1|head|DBSCAN-SWA MPGMTQTNHPDRNGISGGNFPRRYMTVTIEAGQVQPAGAVLGKVTASNEYKLSASAAGDGSESPSVVLFDDVDASGGAVEAEVQISGDLRGEKLTIGTGHTIASVREALRPLSLFVD >NC_014532|3739706:3748742|3744109_3744334_-|WP_013333941.1|DBSCAN-SWA MSTETYTERLRKVREAIDQILAGSQSWMLGNKQYTRATLSTLYDMEKRYAKLANQEQAAARGKGGRSRVRYIGF >NC_014532|3739706:3748742|3741190_3742414_-|WP_013333939.1|DBSCAN-SWA MPRTAFELAASRTWLMTAEALDTLLAVADRQGDVEALEARLGRPLDNTRNVTVRDGVAVIPITGPVFRYANLFTEISGAVSTEMLVKDIQTALDDPSIQGIVLNIDSPGGEATGINELSDLIYNARGQKPIKAYAGGQMASALYWVGSAADEVIVDDTAQLGSVGVVLSLRKREDRPGEKSYEIVSSNAPNKRPDPETEAGRAQLQARTDELAAVFLDKVARNRDLPRDEVNDRFRQGGVATGALAVEAGMADRLGSLESVIAELAGTSASSQPRSTIMTTVKTTAELQAAIEAGTDPKSIKIAEPEAVDTDKLKTEATEAERARCKGILELASPGFEKEVAAAIDEGASVEATGLKLFRAAQDRGLSLSDIAADGTQTSTAQPPADDKEGEERKSAVSAITGAWKD >NC_014532|3739706:3748742|3739706_3740738_-|WP_013333937.1|capsid|DBSCAN-SWA MSVDLFDTRTMLEAVEKMKRSRRFLTTLFFGAASRNFTTQHVDIDIIKGKRKMAPFVRPNRPGHIVDRAGSVMRSYTPSYVKPKLETTAGLLLNQRSPGEALYSARTPLQRAGEQMARDFDELDDQISRREEWMVAQALTSGQVNVVGDGVDDVVDFQMDASHIITEATPWTDAGGDPIGNLRKYKRLIAKDSGRTGAAAVLSVEAADAMLDNEDFRKKLDTRRIDLGMIRPEELPDGVTYLGYLRDPGIDLYTYEEWYLDDNGVEQPMIPAGGLVVGPTTSRCSMLYGAIQDVRAIEGALFDVDRYPKSWIEDDEGVRFLSMQSAPLPGYHEPDAFVFATVA |
10 | Vibrio_phage(33.33%) | portal,terminase,holin,head,capsid | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
4017529 : 4024835
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NC_014532|4017529:4024835|DBSCAN-SWA CATGTTCATTAAGATTTATAGCTCGCCCGACTGCATGCAATGCCACGCCACCTACCGCGCTCTCGACAAGCAGGGGCTCGACTACACCGTGGTGGATATCAGCGAGGAGCCCGAGGCGGCCGACACCGTTCGCCAGCTCGGTTACCGCCAGCTTCCCGTGGTGGTCGCCGCGGACGAACACTGGTCCGGCTTCCGCCCGGAGCGCATTCGTCAGCTGGCCAGCTGACAACGGAGACGAGCGTGAGCGAACTGGTCTATTTCTCGACGCGATCGGGCAACACGCGACGCTTCGTGGAGAAGCTCGGCTTGCCCGCCCAGCGAATACCGCTGAACCGCAATGAGAAGGCCCTGCGTGTACACCGCCCCTTCGTGCTGGTGGTCCCCACCTATGGTGATGGCGATCCCCGCACGGCGGTGCCGGGTCCGGTGATCCGCTTTCTCAACGATCCGCACAACCGCGCGCTGATCCGAGGCGTGGTGGCGGGCGGCAATACCAACTTCGGCAGCGCTTATGGCCTGGCGGGCCGCGTCATAGCCCACAAGTGCCAGGTGCCGTTGCTGCATCGCTTCGAACTGATGGGAACCCCCGAGGATGTCGCCAAGGTACGCCAGTGCCTGGCCACGGAGATCGACAATGCTGGATGACACCCTGACCCACCCCGACGAGACAGAGGAAACGGCGATGGCCAACGCCGATCATGCGGCGCCGAAAGTACTGGACTTTCATGCGCTTAACGCCATGCTCAACCTCTATGACGCTGACGGCCGCCTGCAACTGGATGCCGACAAGGAAGCGGCACGTCAGTATTTCCTCCAGCACGTCAATCAGAACACGGTGTTCTTCCACACTCTGGAAGAAAAGCTGGATTACCTGGTCGAGGAAGGCTACTACGAGGGCGAGATCCTCGAGCAGTACGACGCGAGCTTCGTCAAGGCGCTGTTCGATCAGGCCTATGCGCTGAAATTCCGCTTCCAGAGCTTCCTCGGCGCCTTCAAGTACTACACCAGCTACACCCTCAAGACCTTCGATGGGCGCCGCTATCTGGAGCGCTTCGAGGACCGCGTCTGCATGGTCGCCCTGACCCTGGCCCGTGGCGACGAAGCACTGGCCCGGACCCTGGTCGACGAGATTCTGCATGGCCGTTTCCAGCCCGCCACGCCGACCTTCCTCAACTGCGGCAAGCAACAGCGCGGCGAGCTGGTCTCGTGCTTCCTGCTGCGGATCGAGGACAACATGGAATCGATCGGCCGCGCCATCAACTCGGCACTGCAGCTCTCCAAGCGCGGCGGCGGAGTGGCCTTTTCGCTCAGCAACATCCGTGAATCCGGCGCACCGATCAAGCGCATCGCCAACCAGTCGTCGGGCGTCATCCCGATCATGAAACTGCTGGAAGACGCCTTCTCGTACGCCAACCAGTTGGGCGCACGCCAGGGGGCCGGGGCGGTCTACCTCAACGTGCACCATCCCGATATCCTGCGCTTTCTCGACACCAAGCGCGAGAATGCCGACGAGAAGATCCGCATCAAGACTCTGTCGCTGGGTGTGGTGATTCCCGACATCACCTTCGAACTGGCCAAGCGCAACGAGCCGATGTACCTGTTCTCGCCCTATGACGTCGAGCGTGTCTATGGCGTTCCCTTCGGCGACATCAGCGTGACCGAGAAGTATCAGGAAATGGTCGACGATGCGCGCATCCGCAAGACCAAGACCGACGCCCGGGAACTCTTCCAGACGCTCGCCGAACTGCAGTTCGAATCCGGCTATCCCTACGTGCTGTTCGAGGATACCGCCAACCGCGACAACCCCATCGCCGGGCGTATCAACATGAGCAACCTCTGCTCGGAAATCCTCCAGGTCAATTCGCCCAGCGAGTACGACGACGACCTGGATTATCGCCATACCGGCGAGGATATTTCCTGCAACCTGGGTTCGCTCAACATCGCCCGGGTCATGGACTCCCCGGATTTCGGAACCAGCGTCGAGACGGCCGTGCGCGCCTTGAGCGCGGTATCCGAGATGAGCGACATTCGCTCGGTTCCCTCGATCGCCGAGGCCAATGCCCGCTCCCACGCCATCGGCCTGGGACAGATGAACCTGCATGGCTTCCTGGCTCGCGAGCACATTCATTACGGTTCCGAAGAAGGGTTGGACTTCACCAACATCTACTTCTACACCATCACCTATCATGCCGTGCGTGCCTCGAACCGGCTGGCCATCGAGCGTGGCGAGCGCTTCGCCGGCTTCGAGAACTCCGCCTACGCCAGCGGCAGCTATTTCGACAAGTACACCGAAAACGAATGGCAGCCCCGGACACAGCGGGTCCGCGAGCTGTTCGACAATGCCGGTATCCAGATCCCCACGCGCGAGGACTGGCGCGAGCTCAAGGCCTCGGTGATGCAATATGGCCTGTTCAACCGCAACCTGCAGGCGGTACCGCCGACCGGCTCGATTTCCTACATCAATCACGCCACCTCGAGCATTCACCCGGTGGTGTCGAAGATCGAGATCCGCAAGGAAGGCAAGCTGGGCCGCGTCTACTATCCGGCGCCCTACTTGAACGATGACAACCAGGACTACTATCGCGACGCTTACGAGATCGGTCCCGAGAAGATCATCGACACCTATGCCGAAGCCACCCAGCACGTCGACCAGGGCCTGTCGTTGACCCTGTTCTTCCGCGATACCGCCACCACGCGGGACATCAATCGCGCCCAGATCTATGCCTGGCGCAAGGGCATCAAGACGATCTATTACGTGCGGCTGCGCCAGGCGGCACTGGAGGGAACCGAAGTCGAAGGGTGTGTGTCCTGCACCCTCTAAGAGAGTATTTCCAGGCTCTCGATAACGGTTGGCGCCGGCGACAAGATGGAGCGAGGGTCATCCGCCAGGAATGGCGGATGTAGCGCCCAGGGATGGGGTTACAGCTCCCTCGCGAAGACTTGTCGCCGGTAAGCCGCCTCCTGCAGCAACGGTTTTTTACTTATTTCGGAGCGCCACCACGCCCCGCATGACAACCGGATGAGATGAACGCCATGACCGCTCGACTGAAACGCGTCGATGCCATCAACTGGAACCGCCTCCAGGACGACAAGGATCTCGAGGTGTGGAACCGGCTCACCAGCAACTTCTGGCTGCCGGAAAAGATTCCACTGTCCAATGATGTCCCATCGTGGAACACCCTCAACGACCGCGAGCAGCAACTGACCATTCGGGTCTTCACCGGCCTGACGCTGCTCGACACCATCCAGGGCACGATCGGTGCGCCGGCACTGATCGAGGATGCGGTCACGCCCCACGAAGAAGCCGTGTTCACCAACATCAGCTTCATGGAATCGGTGCACGCACGCTCCTACAGCTCGATCTTCTCGACGCTGTGCGCCACCCGCGATGTCGATGACGCCTACCGCTGGAGCGAGGAGAACACCCACCTGCAGAACAAGGCAGAGCTGATCCTCGAGCGTTACCGCACAGACGACCCGCTGATGCGCAAGGTCGCCAGCGTGTTCCTCGAGTCCTTCCTGTTCTATTCGGGGTTCTACCTGCCGATGTACTGGTCGAGCCGCGGCAAGCTGACCAATACCGCCGACCTGATTCGCCTGATCATCCGAGACGAGGCAGTGCACGGCTACTACATCGGCTACAAGTTCCAGCGTGCCCTGGAGAAGGAATCCGCCGAGCGCCAGCGCCAGATCAAGGACACTACCTACGACCTGCTGCTCGACCTCTACGACAACGAGGTCCAGTACACCGAGTCGCTGTACGACGAAGTCGGCCTGACCGAGGACGTCAAGGCATTCCTGCACTACAACGCCAACAAGGCGTTGATGAACCTGGGTTACGAGGCGCTGTTCCCGAACGCGACCGAAGCCGTCGACCCGGCCATCCTGGCCGCCCTCTCGCCGGGCGCCGAGGAGAACCACGACTTCTTCTCGGGGTCGGGCTCTTCCTATGTGATCGGCCGTACCGAGCGGACGGAAGACGAGGACTGGGCCTTCTGAGCCTAGCCCATTGGCTCCATACCAATGCGACAACGCCGGGGCCCAGGCCCCGGCGTTATCTTGTGGATATCCCCAAAGGGCTGCGGCGGATCGCTACTCCACCGTCACCGACTTGGCCAGGTTGCGGGGCTGATCCACATCGGTGCCCTTGAGTACCGCGACATGGTAGGACAGCAGCTGCAGCGGCAAGGTGTAGAGGATCGGCGCCAGCGCCTCCTCGATGGGCGGCAGATGCAGCACCCGGATACCATCGTCGCTGGACAGGCCGACGCCTTCGTCGGCGAAGACGAAAAGCTCGCCACCGCGCGCCCTGACCTCCTGGAGGTTCGATTTCAGCTTGTCGAGCAACTCATCGTTGGGCGCCACCGAGACCACCGGCATGTCGCTGTCGACCAGCGCCAGGGGACCATGCTTGAGCTCTCCCGCCGGATAGGCCTCGGCGTGGATGTAGGAGATTTCCTTGAGCTTGAGCGCCCCCTCCAGCGCCACCGGAAAATGGGCACCACGACCGAGAAAGAGGGCATGATGCTTCTCGGCGAAGGCCGTGGACAGCTCTTCGATCTGCGCATCCAGCGCCAGGACCCGTTCGACCTGGGCGGGAAGCTTTCTCAGGGCATCGACCAGGCTGGCCTGCTCCGCCTCGTCCTGGCCATGAACGCGGCCCATCGCCAGGGTCAGCAACATCAGGGCGGTCAGCTGGGTGGTGAAGGCCTTGGTCGAGGCCACACCGATCTCCGGACCGGCCCGGGTCATCAGCGTCAGGTCGGACTCGCGCGCCAGGGAACTGCCCGGCACGTTGCAGATGGCCAGGCTGCCCAGGTAGCCACCGTCCTTGGAAAAACGCAGTGCCGCCAGGGTATCGGCGGTCTCGCCGGACTGGGATAACGTCACGAAAAGCGTGTTCTCGGGGACCACCACCCGGCGATATCGATACTCGGAGGCGACCTCCACCTGCACCGGTATACCGGCATAACGCTCCAGCCAGTAGCGCGCCACGAGACCGGCATGATAGCTGGTTCCGCAGGCCACCAGATGAATCTGGGCGACCCTGGAGAACAACGCCTCGGCGTCAGGACCGAAACTCTCCACCAGCACCGAACGATCGCCGAGCCGGCCTTCCAGGGTCTCGGCGATGACAGCCGGTTGCTCATGGATCTCCTTGAGCATGAAGTGACGATAGTCACCCTTGCTGGCACTGCCATCGCCATGCTCGAAGGTCTGGACCTCACGCTCGACGACCTGGCCCGAGGCGTCGAAGACCTCAATGCCGCCGCCGGCGGAGAGCCTCACCACATCGCCCTCCTCGAGATAGATGAAGCGATCGGTGACCTGCAGCAGGGCCAGCGGATCCGAGGCCAGGAAGGCCTCCTCGATACCGACCCCGACCACCAGTGGGCTGCCCTTGCGCGCGCCGATCACCACATCCGGCTCGTCGGCATGGATCACGCCCAGGGCATAGGCGCCATCGAGCTTCGCGATCACCGCCTGTACCGCCGTCAGCAGGTCCGCAGTGCGTGCTTCGCGCTCGATCAGGTGCGCGATGACCTCGGTATCGGTCTGGGAAGAGAAATCGTAGCCGGCACTTTCCAGCTCGGCCCGAAGAGGTTCGTGATTCTCGATGATGCCATTGTGCACCACCGCCAGGCGTTCGCCGGACTGGTGGGGATGAGCATTTTCCTCGCTGGGACGTCCATGGGTCGCCCAACGGGTATGGGCGATGCCGCTGTGGCCGGTGAGCGGTGCCGCGTCCAGCTTCTCCTCGAGGGCCGCCACCTTGCCCAGGGCCCGGCGGCGCTGCAGGCGTCCGTCGGTGTCTCGCACGACCATGCCTGCCGAATCATAACCGCGATACTCCAGGCGCAGGAGCCCTTCGCGCAGAATTCCTTGTACGTTGCGCTGGGCAACGGCACCAACGATGCCACACATGCCGAATCTCCTCAGTGGTCCTTGCGCTTGACGGGACGCTGCCAGTCAGCCTTGCTGATCGTCTTCGACCGCTCGACCGCCAGGGCATTGTCGGCGACATCGCGATCGATGGTGGAACCGGCCCCCACGGTGGCTCCCTTGCCCACGCTGACCGGCGCCACCAGGGCCGTGTTGGAACCAATGAAGGCCTCGTCGCCGATCTCGGTGCGATGCTTGTTGGCACCATCATAGTTGCAGGTGATGGTGCCCGCGCCCACATTGACGTCCCGCCCGAGACGCGCGTCACCGACATAGCTCAGGTGATTGATCTTGGACCCTTCGCCGACCTCGGCGTTCTTGGTCTCGACGAAATTGCCGACCTTGGCGCCGACCGCCAGGCGGGTCCCCGGACGCAGGCGAGCAAAGGGACCGATCTGATTGTGACCGGCCACCACGGCTCCCTCGATGATGCTATGGGGCTCGATGACCGTCTCGGCGCCGATATGGCTGTCGCGGATCACGCAGTGAGGACCGACGCGCACGCCCTCGCCCAGTTCGACATCGCCCTCGAAGACACAACCGACATCGATCTCGACATCGTGGCCGCACGTGAGGCTGCCTCTTACATCCAGGCGCGACGGATCGCGCAAGGCCACGCCTTGCGTCATCAACGACTCGGCGATATCGGCCTGCAGGGTGCGTTCCAGCCGGGCCATCTGGCGACGGTCGTTGACGCCTTCCACCTCGACCGGACGCGACGGCTGGGCGGTGGCGACCTTCACGCCTTCGGCGGCGGCCATGGCGATGACATCGGTGAGATAATATTCACCCTGGGCATTCTCGGCCGATAGCTGAGGCAGCCAGCGACGCAACTGAGCGGTGGTCATGGCCATGATGCCGGTATTGCACTCGCGTACCGCCCGCTGCTGCTCATCGGCGTCCTTCTGCTCGACGATGGCCACCGCCTCGCCGGCCTCGTCACGCAGGATCCGGCCATAGCCGGTGGGATCTTCCAGGGTCACCGTCAACAGCCCCATGTGATCTTCGTCGACGGGGGCCAGCAGGGCTCTCAGGGTATCCCGCTGGATCAGTGGAACATCGCCGTAGAGTACCAGCACCTTGCCCTCGCCGAGCCCATCCAGGGACTGGGCCACGGCATGGCCGGTGCCCTTCTGCTCGGCCTGCAGGACGAAATTCACCGGATACTCGGCCAGCGCTTCGCGCAACCGTTCGGCACCATGCCCGACCACCACATGGGTCCGCTCGGCCTCGAGGCCCGAGGTGGTATCGAGTATATGGCTCACCATCGGCTTGCCGGCCAGCGGATGCAATACCTTGGGCAACGAAGAGCGCATGCGCGTCCCCTTGCCTGCCGCGAGAATCACCACATCGAGGGTCATCAT
Protein sequences of DBSCAN-SWA_6 >NC_014532|4017529:4024835|4017768_4018173_+|WP_013334186.1|DBSCAN-SWA MSELVYFSTRSGNTRRFVEKLGLPAQRIPLNRNEKALRVHRPFVLVVPTYGDGDPRTAVPGPVIRFLNDPHNRALIRGVVAGGNTNFGSAYGLAGRVIAHKCQVPLLHRFELMGTPEDVAKVRQCLATEIDNAG >NC_014532|4017529:4024835|4018162_4020349_+|WP_013334187.1|DBSCAN-SWA MLDDTLTHPDETEETAMANADHAAPKVLDFHALNAMLNLYDADGRLQLDADKEAARQYFLQHVNQNTVFFHTLEEKLDYLVEEGYYEGEILEQYDASFVKALFDQAYALKFRFQSFLGAFKYYTSYTLKTFDGRRYLERFEDRVCMVALTLARGDEALARTLVDEILHGRFQPATPTFLNCGKQQRGELVSCFLLRIEDNMESIGRAINSALQLSKRGGGVAFSLSNIRESGAPIKRIANQSSGVIPIMKLLEDAFSYANQLGARQGAGAVYLNVHHPDILRFLDTKRENADEKIRIKTLSLGVVIPDITFELAKRNEPMYLFSPYDVERVYGVPFGDISVTEKYQEMVDDARIRKTKTDARELFQTLAELQFESGYPYVLFEDTANRDNPIAGRINMSNLCSEILQVNSPSEYDDDLDYRHTGEDISCNLGSLNIARVMDSPDFGTSVETAVRALSAVSEMSDIRSVPSIAEANARSHAIGLGQMNLHGFLAREHIHYGSEEGLDFTNIYFYTITYHAVRASNRLAIERGERFAGFENSAYASGSYFDKYTENEWQPRTQRVRELFDNAGIQIPTREDWRELKASVMQYGLFNRNLQAVPPTGSISYINHATSSIHPVVSKIEIRKEGKLGRVYYPAPYLNDDNQDYYRDAYEIGPEKIIDTYAEATQHVDQGLSLTLFFRDTATTRDINRAQIYAWRKGIKTIYYVRLRQAALEGTEVEGCVSCTL >NC_014532|4017529:4024835|4017529_4017754_+|WP_013334185.1|DBSCAN-SWA MFIKIYSSPDCMQCHATYRALDKQGLDYTVVDISEEPEAADTVRQLGYRQLPVVVAADEHWSGFRPERIRQLAS >NC_014532|4017529:4024835|4020552_4021527_+|WP_013334188.1|DBSCAN-SWA MNAMTARLKRVDAINWNRLQDDKDLEVWNRLTSNFWLPEKIPLSNDVPSWNTLNDREQQLTIRVFTGLTLLDTIQGTIGAPALIEDAVTPHEEAVFTNISFMESVHARSYSSIFSTLCATRDVDDAYRWSEENTHLQNKAELILERYRTDDPLMRKVASVFLESFLFYSGFYLPMYWSSRGKLTNTADLIRLIIRDEAVHGYYIGYKFQRALEKESAERQRQIKDTTYDLLLDLYDNEVQYTESLYDEVGLTEDVKAFLHYNANKALMNLGYEALFPNATEAVDPAILAALSPGAEENHDFFSGSGSSYVIGRTERTEDEDWAF >NC_014532|4017529:4024835|4023464_4024835_-|WP_041602288.1|DBSCAN-SWA MMTLDVVILAAGKGTRMRSSLPKVLHPLAGKPMVSHILDTTSGLEAERTHVVVGHGAERLREALAEYPVNFVLQAEQKGTGHAVAQSLDGLGEGKVLVLYGDVPLIQRDTLRALLAPVDEDHMGLLTVTLEDPTGYGRILRDEAGEAVAIVEQKDADEQQRAVRECNTGIMAMTTAQLRRWLPQLSAENAQGEYYLTDVIAMAAAEGVKVATAQPSRPVEVEGVNDRRQMARLERTLQADIAESLMTQGVALRDPSRLDVRGSLTCGHDVEIDVGCVFEGDVELGEGVRVGPHCVIRDSHIGAETVIEPHSIIEGAVVAGHNQIGPFARLRPGTRLAVGAKVGNFVETKNAEVGEGSKINHLSYVGDARLGRDVNVGAGTITCNYDGANKHRTEIGDEAFIGSNTALVAPVSVGKGATVGAGSTIDRDVADNALAVERSKTISKADWQRPVKRKDH >NC_014532|4017529:4024835|4021620_4023453_-|WP_013334189.1|DBSCAN-SWA MCGIVGAVAQRNVQGILREGLLRLEYRGYDSAGMVVRDTDGRLQRRRALGKVAALEEKLDAAPLTGHSGIAHTRWATHGRPSEENAHPHQSGERLAVVHNGIIENHEPLRAELESAGYDFSSQTDTEVIAHLIEREARTADLLTAVQAVIAKLDGAYALGVIHADEPDVVIGARKGSPLVVGVGIEEAFLASDPLALLQVTDRFIYLEEGDVVRLSAGGGIEVFDASGQVVEREVQTFEHGDGSASKGDYRHFMLKEIHEQPAVIAETLEGRLGDRSVLVESFGPDAEALFSRVAQIHLVACGTSYHAGLVARYWLERYAGIPVQVEVASEYRYRRVVVPENTLFVTLSQSGETADTLAALRFSKDGGYLGSLAICNVPGSSLARESDLTLMTRAGPEIGVASTKAFTTQLTALMLLTLAMGRVHGQDEAEQASLVDALRKLPAQVERVLALDAQIEELSTAFAEKHHALFLGRGAHFPVALEGALKLKEISYIHAEAYPAGELKHGPLALVDSDMPVVSVAPNDELLDKLKSNLQEVRARGGELFVFADEGVGLSSDDGIRVLHLPPIEEALAPILYTLPLQLLSYHVAVLKGTDVDQPRNLAKSVTVE |
6 | Mycobacterium_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|