Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NC_009446 | Dichelobacter nodosus VCS1703A, complete sequence | 3 crisprs | cas3f,cas8f,cas5f,cas7f,cas6f,DEDDh,cas14j,DinG,cas3,c2c9_V-U4 | 5 | 10 | 9 | 1 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_009446_1 | 179009-179456 | TypeI-F |
I-F
Consensus repeat of NC_009446_1
|
7 spacers
spacers of NC_009446_1
>1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT TATCAAAGAACCAGTCAAGGAACCATGAGTCG >1.2|179097|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT ATTCGCAAACAAAACAGCGAAATTTGGGCGAG >1.3|179157|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT TGTCGAACTAAACGATGACCAGATTTGGTTAA >1.4|179217|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT TATCGCAGCCACAGCGTCGCGCAAGTATTAGC >1.5|179277|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT GCCGCAACATTTCTGGCTCATTTAAATATAAG >1.6|179337|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT GTAAACCATCAAAATAACGTCAAATTGGGTTA >1.7|179397|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT GCTATAGTTATCGAGTCCAGAAAAAATAAAGT |
cas6f,cas7f,cas5f,cas8f |
CRISPR arrays and Neighbor proteins around NC_009446_1
The CRISPR arrays of NC_009446_1 >merge|NC_009446|1|179009-179456|PILER-CR,CRISPRCasFinder,CRT GTTCACCGCCGCACAGGCGGCTTAGAAATATCAAAGAACCAGTCAAGGAACCATGAGTCGGTTCACCGCCGCACAGGCGGCTTAGAAAATTCGCAAACAAAACAGCGAAATTTGGGCGAGGTTCACCGCCGCACAGGCGGCTTAGAAATGTCGAACTAAACGATGACCAGATTTGGTTAAGTTCACCGCCGCACAGGCGGCTTAGAAATATCGCAGCCACAGCGTCGCGCAAGTATTAGCGTTCACCGCCGCACAGGCGGCTTAGAAAGCCGCAACATTTCTGGCTCATTTAAATATAAGGTTCACCGCCGCACAGGCGGCTTAGAAAGTAAACCATCAAAATAACGTCAAATTGGGTTAGTTCACCGCCGCACAGGCGGCTTAGAAAGCTATAGTTATCGAGTCCAGAAAAAATAAAGTGTTCACCGCCGCACAGGCGGCTTAGAAA >NC_009446|1|1|179009-179456|PILER-CR GTTCACCGCCGCACAGGCGGCTTAGAAA TATCAAAGAACCAGTCAAGGAACCATGAGTCG GTTCACCGCCGCACAGGCGGCTTAGAAA ATTCGCAAACAAAACAGCGAAATTTGGGCGAG GTTCACCGCCGCACAGGCGGCTTAGAAA TGTCGAACTAAACGATGACCAGATTTGGTTAA GTTCACCGCCGCACAGGCGGCTTAGAAA TATCGCAGCCACAGCGTCGCGCAAGTATTAGC GTTCACCGCCGCACAGGCGGCTTAGAAA GCCGCAACATTTCTGGCTCATTTAAATATAAG GTTCACCGCCGCACAGGCGGCTTAGAAA GTAAACCATCAAAATAACGTCAAATTGGGTTA GTTCACCGCCGCACAGGCGGCTTAGAAA GCTATAGTTATCGAGTCCAGAAAAAATAAAGT GTTCACCGCCGCACAGGCGGCTTAGAAA >NC_009446|1|1|179009-179456|CRISPRCasFinder GTTCACCGCCGCACAGGCGGCTTAGAAA TATCAAAGAACCAGTCAAGGAACCATGAGTCG GTTCACCGCCGCACAGGCGGCTTAGAAA ATTCGCAAACAAAACAGCGAAATTTGGGCGAG GTTCACCGCCGCACAGGCGGCTTAGAAA TGTCGAACTAAACGATGACCAGATTTGGTTAA GTTCACCGCCGCACAGGCGGCTTAGAAA TATCGCAGCCACAGCGTCGCGCAAGTATTAGC GTTCACCGCCGCACAGGCGGCTTAGAAA GCCGCAACATTTCTGGCTCATTTAAATATAAG GTTCACCGCCGCACAGGCGGCTTAGAAA GTAAACCATCAAAATAACGTCAAATTGGGTTA GTTCACCGCCGCACAGGCGGCTTAGAAA GCTATAGTTATCGAGTCCAGAAAAAATAAAGT GTTCACCGCCGCACAGGCGGCTTAGAAA >NC_009446|1|1|179009-179456|CRT GTTCACCGCCGCACAGGCGGCTTAGAAA TATCAAAGAACCAGTCAAGGAACCATGAGTCG GTTCACCGCCGCACAGGCGGCTTAGAAA ATTCGCAAACAAAACAGCGAAATTTGGGCGAG GTTCACCGCCGCACAGGCGGCTTAGAAA TGTCGAACTAAACGATGACCAGATTTGGTTAA GTTCACCGCCGCACAGGCGGCTTAGAAA TATCGCAGCCACAGCGTCGCGCAAGTATTAGC GTTCACCGCCGCACAGGCGGCTTAGAAA GCCGCAACATTTCTGGCTCATTTAAATATAAG GTTCACCGCCGCACAGGCGGCTTAGAAA GTAAACCATCAAAATAACGTCAAATTGGGTTA GTTCACCGCCGCACAGGCGGCTTAGAAA GCTATAGTTATCGAGTCCAGAAAAAATAAAGT GTTCACCGCCGCACAGGCGGCTTAGAAA
>NC_009446.1|WP_011927920.1|178246_178855_+|type-I-F-CRISPR-associated-endoribonuclease-Cas6/Csy4 MNFYQEITLLPDAEVSLYFLWSKVYGQLHIALADVRNRYGIDTIGVNFPHYVYEEQNHKVVAARLGDQLRIFALAENDLEKLQINQWLERLSDYVHIKRISKIEPNKVTGYVVVKRYRYPSLDKVALRFAQFRKINFEEARKHCTKYKHQAKNYPFIMLKSQSNQEYYKLSIRQENAQESVSGRFNVYGINSATGIVTVPNW >NC_009446.1|WP_011927919.1|177208_178219_+|type-I-F-CRISPR-associated-protein-Csy3 MSNKRESSEQKLKTPSLLSFARSIDISDAFFWQTNSKDAKHNRSIVTVQEKSVRGTISNRLKNTSTSDIAKIDAKIEEANLQRVDASSLDEDKDILLVHFTCKILPFTAEPCVCNDQAYQEKLQTVMSKYIEEQGFKELARRYAINIANARWLWRNRVGAESILVTVTLNEKESLTFDACSFELRHFNCHNKQLDQLAQWIESGFKGEFILLSVEARVKIGYGQEVYPSQELILDGKSKKSKVLYLVGDGSDNHAGMHSQKISNAIRTIDDWYPEAEFPIAVEPYGAVTTLGKAFRHPKDKKDFYNLFDNWILKDQVPEVNDQHYVAAVLIRGGVF >NC_009446.1|WP_011927918.1|176242_177193_+|type-I-F-CRISPR-associated-protein-Csy2 MSSIESFIYPDLKRVGFLLIKRLEVINANALSSPLTYGFPAITAFTGAVHALSRKINRSEALADIFLDGVLIAAHSCQPQTYREYFNKPFTFIQSRHPVEKTGDTAPIIQEGYCHLTVSLLIGVYAKDGYLSEEQIEALKKQLFIAIQQQPLAGGNVIGLDAQEPIQFYKDDVDQCVSELLPAFVLIDAHKELTAITQELQKDNPAATALDALIETASLHHIPSDEQENNWEIYSVKKGRGWLVPIPIGYQGISPQYDAGVMKNARNPHYPSQYVEALYSLGKWVFPYSIDLIDNAMWYQKYDAEKDLYLVTHLME >NC_009446.1|WP_011927917.1|174950_176246_+|type-I-F-CRISPR-associated-protein-Csy1 MKSISEITTEELKNAIRAFLSAECEKKTKDSSDIEKMKKYRPDIWLQDAQKKAERFKVGTHISKGIHSQSQGDNVYFSQKVDHDYVNTKTVTNNYLDGGGAASDFPLASFFEWEVITGSGIKMRDVIWENGAAVQRCFADDPELSQTYQQTFLTCLQAQPQNPQTDALNKQLLWALPQTDDNRDNYLVLVPLHPSVLTHEFYHKIEAINNNRFDVKSKSVPQKRYADLLDLAQIKLGGSKPQNISVLTSKQRGINYLLPSLPPVFRARQDIHFSPKLESIFFSKSLYYRVEDDLKILFGVIYCKENNYEIRNMRKAAVHRIAHQILSIGETICALRPAGWSKDYDLSSAQKYWLDPKRADLAGEEKFKAERDAADWDKAIEKDFANWLQKVLEERFKKHRHEFTDIEHYEWQREMKAVIKESFRLNKRGLL >NC_009446.1|WP_011927916.1|173632_174712_-|class-II-fructose-bisphosphate-aldolase MTKILDSIKAGVVTGDDVQKIFSIAKQNNFALPAMNCVGTNSVNAALETAARVRAPLIIQFSNGGAAFFAGKGLKPVDGQRPDVLGAIAAAQHIHTLAAAYGVPVILHTDHCAKKLLPWLDGLLDAGEAFYQQHGKPLFSSHMLDLSEEPLKENIEICQRYLERMAKIDMTLEIELGCTGGEEDGVDNTGVDNAMLYTQPEDVAFAYQELRKISPRFTIAAAFGNVHGVYKPGNVKLTPKILDNSQKYVSQKFGLPEKSLDFVFHGGSGSSLSDIREAISYGVVKMNIDTDTQWAAWEGVLEFYRKNEAYLQGQLGNPEGEDKPNKKFYDPRAWLRQSENFISQRLEQAFDDLNCRDVL >NC_009446.1|WP_011927915.1|171192_173331_-|UvrD-helicase-domain-containing-protein MNIDTILSGLNAAQRDAVTTKERIVRVIAGAGSGKTRVLVQRMQWLMTVAGCMPYQLLALTFTNKAAQEMRQRLEQSAACSLNQLWMGTFHSICLRILRQYAELVGWEKSFIVIDSDDQLRLIKRLLQKNNWNEEILSAKAVQAQINAYKENGLRAADLPTSAPPLEIAVHHFYQEYEHITRQQGTMDFAELLLLTTELLAQHETVQQRFHQRFQAILIDEFQDTNTLQFKLVTQLCAPETQLFVVGDDDQSIYGWRGAQIDHIVHLERYYPTVHTIRLEQNYRSTKTILAAANAVIAHNQTRLGKTLWSDGKHGEAIALYAAVNEYDEARYLVENIAQFHQHGGAYDQCAILYRSNALSRIYEEALIQKNIPYRIYGGLRFFERAEIKDALAYLRLLHYPDDDAALERIINQPPRGIGAKTMEDVRLLAQRVQCSLWRVITDDALLEQKCSARAQNALRQFRALIIKMTAFAERSDSLRDILKMVVDESGLYAALTNNNQEETENRRENLHELIAAGDYQSDQNDADHDKIADFLAMASLDAGDKETNAHGVQLMTLHSAKGLEFNRVYMVALEEGLFPNARSLENSAQLEEERRLAYVGITRAREQLTMSFAERRRYYGQDNYARPSRFLNEIPPELLNMVRPVLFNRTQPNDIQEDNPWKTGVCVQHAQFGTGVIQAVEGSGEHQRALVKFTTVGEKWLVLAYAKLKIL >NC_009446.1|WP_135325948.1|169321_170902_+|glucose-6-phosphate-isomerase MMNNIFSQLSHHAEQLKRQTLNQLFVEDPKRVEKWQWQVAGIRVDLSKNHIDDAGRILWFSWLKQQQTSAHIKAMLSGEKVNYSEHRPALHHALRARAEGSFIVDCTDIYAEIRKTRAQIRDLTAAIRQGTLRGFSGKAIEDVVHIGIGGSELGPRLLCESFVHRSDRVRIHFLASPDPIHIQSLQQRLNPETTLLIIASKTFTTEETLANAHLMRHWLHAAGGQKADEQMIALTAAIDKAHEFGISSAHILPFWDWVGGRFSLWSAIALPFALQNGYDAYEQLLSGAREMDQHFQSTPEEHNLPMHLALIDAWYNHYFAIDNRAIVTYAQPLNSFVPYLQQLEMESLGKRANQQGAALIKPSGMIIWGGSGTESQHAFFQLIHQGQRRIPLDFITVKSVPNGYEAAGTIVHGNCLAQAEALMCGRTLEDLKDLPLEERYQRTCAGNHPSNMVILDELTPFHLGALIALYEHKTTVLGTLYDVNAFDQWGVELGKVLAKKTEASLRGECTVDNPSTRALIDYLRQK >NC_009446.1|WP_011927913.1|168675_169308_+|cold-shock-and-DUF1294-domain-containing-protein MSVRPKHNEICTGTVVYWNDDKGFGFIDTNEKQANVFFHISHFAYENRRPQRGDKVSFLRSPEQTSGKPSAKRVVIQGHEKTLLSRNVHEQQIQHPHFVEGCIYVLNDILFFLVLATISPIIAITSAIISVMTVSLYSYDKYAAIHDHQRVPEASLHIAALLGGWPGALIARAFLRHKTKKIRFVLFFWMSIFVNIAMIYGLVWVLYFSN >NC_009446.1|WP_011927912.1|167232_168591_+|sigma-54-dependent-Fis-family-transcriptional-regulator MNKNTASTILIVDDETMICETLVDILTDEGYQTYTAGSAAQARTAKQMYHPDLILLDIWMPDSDGITLLREWTSQQLNASVIMMSGHGTIETAVEATKLGAYDFLEKPLSTAKLLITIKRALQTQALIAQNAALKAQLDPNIEIIGRSQAMNEVRELASNLAKQNVPVLISGNAGSGKQHVAHFIHQNSAFCDATFITANIAAMETHDITAALIGSKHHTGLLAAADGGTLFIDEISQLPKDGQRLLLGLIEEQAYLPANQHIRCTTHIRVIAATRLPPLLLKEHLDPALFDLLMVATIILPDLQDHSSDVPELLEYFSKYFADFEQMPYRHFSLAAQNTLRQHCWTGNVRELKNLVQRLLIQNDAAEISAEEAEQALTPTEISPQDGLWSQIIPKDLSLREARELFEHQYLLEQFRHCDGNIARLANRIGMERSNLYRKLRNLGIDPTDKP >NC_009446.1|WP_041729373.1|165033_167172_+|HAMP-domain-containing-protein MKLNIRHMIRTMALGVLTIIASLIAIYQLTQAATRPEDANPYYIHFLIITLIGLALIFSLALWRIYALIRHLRRQHSGARLSLSFALRMLLAALFPLGIIGAFSWSFLSSDLGMIFNRRVTIALEDALQLTRSAISWRANQAIMQTRQLAHFMTTMRYIDLVSEIELLRRANHAIELAQFDHQGNLVAFAHQDLTVMTVAPPDAATLSRVNEEQEFFEFSAESDDTYSIRVLSKMIKPDSEVFYLRAIYAMPTEFNTLANSVRENYQQHLSYSYLQPHITTSLLLVFGLIIALTVLSALWLSTLFGETMARPVRQLIEATRKVAGGDFSTPVTVIHNNDLGVLSNHFNMMMSALRAAEETNSLIQSQLSEQNTFLSTLLDNITAGVMTLDHLGQLQVYNHAAPQLLDCDLLPYLGKVPPAEECAVDSYGEFMAAIARCSDKEEWHQEVVLAKFSQRKIVISHGRRLPAPQQNGHGYIIVFEDVTEFQQNQRNAAWEEVARRLAHEIKNPLTPIRLQTERLQRKLTDKLTDEYDRHILQRATETIINQVDAMLQLVSDFSQYAKPIELRRQRLDINALLQDIANLYHHYDLELQLAPDVPPLLADPIQLRQVMINLTNNALEASKNGEKTMICWTTSYENGLIKVSVEDNGSGFADLSKDPFEPYVTTKPKGTGLGLAIVKKIITEHQGSIQAGPSKQLNGAKITFILPLSSE >NC_009446.1|WP_011927921.1|179552_179840_+|hypothetical-protein MRAAGKWQGSIEDGVSFLRGLDDIVIAPRCAHTLEEAQLWRYKTDRLTGDPLPELDDAHDHCWDAIRYALSDVIRGGYQGNSIIAGAARAFRRGR >NC_009446.1|WP_011927922.1|179836_181435_+|DUF935-family-protein MNDKVDKQASAATSAQALYTDPVFTLTNEDADKVLKNAGLSRSDLGKLLYDDEIFACCDRREKAVVGTRWRIEGDNTDWLHAEISRWHETLVRRTMDAQWIGSSISELIWRRPEEDHNGIRLAAVEPRKIERFINQDGVLRYQTQSGSYIDVEPLKVLEVRMNASAANPYGDALLSRVYWAWFNKNYGEQFWSKYAERHASPLTVGKFNPRTNNQAEAQRHLNDLAITLAQAISDGVIVITQDDEISFVNATSDGSAHQLFTRHHIQRIQKTIIGRVLTSELAGGSRAAQETDDNFSQILFDYDLTLCERVINEFIAKVLRLNGTARGDILFAYDRTESIDKERWERDTALMDRGMRFTEQYFIDQYHLEPIYFSLEQIERAARSERAANAAQKAGLSLSKKQELTPAAQALEDRVQAGMAEAPEPITREMIEDVVKNAPNDYQLLEDLVKLYGDRDPEGFNDWFGEALEIACAHGYHDADLPQNSLKARSPRNFLIINEEVISHNHFPQTRPEKRFSPLISRHRTRVTGLQ >NC_009446.1|WP_011927924.1|184933_185782_-|DUF1837-domain-containing-protein MPWTSEHTKWLIDTGERLKTADGKEVEVWEFRHEKDEAVLSAWAKHFRNHYCLDAEIDFLRGKRPRPDYLDNIKFPCKTSKLGPGIRAGDFGEILVSDYLQWLLGYWVPRVRWSSKVVRDESPKGSDVIGFRFHKKDGDASTKDVLFVFESKTKFSASKINRLQDAINDSAKDHIRIDESLNFIKQKLFEKKEIEQAQRIERFQSPVDMPYKETYGAAAIISDECFDADELASADCQKIPKSAKSKEVFPHPNGDSLVLLVIKGPGMMDLVHELYRRAADEA >NC_009446.1|WP_011927925.1|185800_186556_-|hypothetical-protein MGKRHEAIGIKQAIRFEWMQKAANLLLAGLDAKTIRQELHEFLADRKGNGSEGERSDQTRTFVVNNLMKIWVSPDPELIPFRDASLAFLRENPSMALAVHWGMISAVYPFWFNVARQTGRLLALQDQVTQTQIINRLKEQYGDRQTVSRYARFVIRSFVAWGALKDSEAKGCYEKAAPVSIAEPNLAILMFESALLATPEAKGALGLLLNNPAFFPFQLPVMTGDFVSQRSDRIDVVRYGLDDELLKLKGN >NC_009446.1|WP_011927926.1|186555_188556_-|BREX-3-system-phosphatase-PglZ MSSWRDAILNDFVPNVSKLTLVADPDCLLTEEKLALELRGRGFDLIEFSDPVEFRYAYESKYLSIWDRGEHTDLVVVLRLQDAELESLPYDLLQAGRKLSFNLGDLFPNLSYPVIEKLDRSLLDSLFEAQRKSPSDRMGDNATKDFILRHVFGIAAELIGGEVELLRALLRLHYGKLQIPQMLAEQLIQVLKGHDGLKAWPLSEIVPDDEAFFAFLQERWPLFLSRLGSAHQVREDSPEYGLKYPGPDRLPFDHQDIKVYIDNLFLEGKLTPVEAKGIEVDAGSWVRSGITTSGVDDDELRISRLFGLIEKELPTAEARYSNSNWTAFALKWAELSSLVHCGNSTEYQTRLREIGDALNTIFAAWLADHYSSLINLPPTNPAMLHHVPRRLARDIEDSGSSRAALIVVDGLALDQWVTIRQLLQKQDANLVMRESATFAWIPTLTSVSRQSIFSGKPPLYFPSSINSTNSEEKLWKQFWEGHGLSRLDVAYQRGLGDGDAAGVLDSAIHPGKTKVVGLVVDKVDKIMHGMQLGSAGMHNQIKQWCHAGFLSAMVGQLLDYGYEVWLTADHGNIQCEGKGRPSEGVIAETRGERVRVYPTPELRAQVAGAFPFAHEWQPVGLPADYFPLVAGGRDAFVNPGDSIVGHGGVAIEEVIVPLVKFERRTR >NC_009446.1|WP_011927927.1|188552_191450_-|DEAD/DEAH-box-helicase MESLWQYSTVHNSACKVIEEQTLWGQTVCRVWLPNQDAVVRVPRSALRPLNADLQPEIEAGRIAYVAAAAKVAEVLEGSTSATEGYVLLAPMESNVIPLPHQIHALSRAISGDRVRYLLADEVGLGKTIEAGLVMRELKLRGLVRRTLVVSPKGIATQWVAEMQTHFNEQFQLVLGDDIGTLQRLAPGADHRSSAWSMFDQVIVSLDSVKPMDKRRGWTAERVAEYNRSRFEDLITAGWDLVIVDEAHRLGGSTDQVARYKLGKGLAEAAPYVLLLSATPHQGKTDAFHRLMNLLDDDAFPDMDSVSRERVASYVIRTEKRKAIDADGKPLFKPRRTQMAPVAWESRHQLQQLLYEAVTDYVREGYNQALREKKRHIGFLMILMQRLVVSSTRAIRTTLERRLAALKDGEQQASLRLAELENGADGLESPDDEIAELYDMDGQELLDELLKSHVSALQSEGSHVETLLDAAVRCEQAGPDAKAEALIEWIYKLQAEENEPDLKVLIFTEFVPTQQMLKEFLEARGISVVTLNGSMAMEERGAAQDAFRKSHRVLVSTDAGGEGLNLQFAHVIINYDIPWNPMRLEQRIGRVDRIGQPKTVQAINFVFEDSVEFRVREVLEQKLSVIFDEFGIDKTGDVLDSAQAGELFEDVFASAILNPDGIETSVDHTVARIRDEIQQVRESSAIYGISEEPDVQTAERLRSHPLPHWVERMTVGYLNSHGGAASRKRSWWDLNWPDGQEHRKAVFSAREADRLTDATLLNLENSRIRGLALNLPQVAAGQPLPCVTVSGLPASISGLWGLFEIRLQAGMHQKTQLLRIPMVRRGYVSVFLSEEGKLFLPTARHIWDALQTAEAEVQATLGQDDSITAHERLQIAAEQAGQELFDALQQAHLASVNREEERGMVAFTSRRKAIERVGLPEVRQYRLARCAAEENEWRHELQSARQIVPEIRSLLMLRIIKGGAQ >NC_009446.1|WP_011927929.1|191548_193036_-|hypothetical-protein MTAPVDLSHYADNILAAEDRPLFDDAVEAGKAGALRAAYVMIWLACAESLKRRFREAQKRDGAAGKIVGEIETKEKEHKAVDKFVLMKAHEYGFVSDSGHTVLNHIYEMRCLYGHPYEEAPSQEQVSHAAAVVVEHVLSKPVKLRHGFGKQLLKSLLEEPNFLDDQQTAVVAFTKDILPRLDESIHGWLLDNYWEELEKFSDDSSMAIFFRRGTWFSRTMLTEVGIDVFSHDDWHDRSSRFPKILMRVCSIADIFKEIGKRAQDSLVGLIIAESATRASVLTHLERLSINGALTMRQQERFVEHVSEMPSSAIRSAGLSTKTCYGKLIDAMKFHDWYVQNPAIDLIVSNGPDQAAELDENQQVNLGRNLLQAGEGTAGSANEFLEKLSQDGTSWPFHVVRGIAMESFTNEDNLIRFKDRHLGRVLSAIDHLQQELQDQLIAEISASVDAGIPKDRVDRDDFENTVDSLKVYPWAAPLVTSLEAKVASLSAEEEDA >NC_009446.1|WP_081423575.1|193038_195834_-|DNA-methylase MKMKETSLFDSLLEEPQKPSGPVTCLGMTFENDEARRAHFIEELRKKLQDPEFRKIEGFPIGSDEDILNLSDPPYYTACPNPWIADFIAEWDEQKPKQPEGHHYHREPFAADVSEGKNDPIYNAHSYHTKVPHKAIMRYILHYTQPGDIVFDGFCGTGMTGVAAQMCGDREVVMSLGYQVKSDGTILQEEIDENGKKVWKQFSKLGSRRAVLNDLSPAATFIAYNYNTPVDVAAFEKEAKHILKEVEKECGWMYETLHTDGKTKGKINYTVWSDVFLCPECTKEVVFWDVAVEKGKGIVHDKFPCPHCGSLLLKRSLKRAWETVFDEAFGDTIRQAKQTPVLINYTAGGKRAEKIPDPSDIALIEKINNSHIPYWFPVAELQDGFNTRQPKGSHGITHTHHFYTRRNLWILASLWSKASPKMRFGLTNFLSRNLTKMNRFVVNRHNPNGRINGPMTGTLYIPSEQVEQTATLLFKDKWIKHGWNTCGNLITTQSFSSIEASVTNSLDYIFIDPPFGANINYSELNSLWESWLSVKTDQKPEAVENDVQNKSLNDYRDLMLGCFRKAYELLKPGRWMTVEFSNTRAAVWNNIQTSIADAGFIVANVSVLDKKHGGIKAMAYSTAVKQDLVISAYKPNGGFEERFQKEAQTEEGIWDFVRTHLKYLPVTKQQGALLQFVPERDPRILFDQMVAYFVRKGYPVPISSQEFQIGLAQRFIERDGMFFLPDQVAEYDRKKMTSGELKQMSMFVSDEASAIQWLRQLIKEKPQTFSDINPQFMQQLGGWSKNEAQLDLRELLNQNFLSYDGKGPVPEQIHAYLSTNWKELRNLPKDDPTLVAKARDRWYVPDPNKAGDLEKLREKALLKEFEEYKAAKKKLKVFRQEAVRAGFKKAWQERDYTVIVAVADKIPNNVLEEDPKLLMWYDQAVTRMGGE >NC_009446.1|WP_011927931.1|195919_199651_-|hypothetical-protein MKYGDLIQFDPIESVVQLRDADKSSAAHTLVNTYVISEEMAERLIQLVIPQMQFDQPVDNKGLLVVGNYGTGKSHLMSVVSSLAADASLLEGLKGEGVRDAASQIAGRFKVIRTEIGATTMSLRDILVAELEEHLEKLGVEYVFPEAGTISSHKRAFEDMMAKFGEVFPEHGLLLVVDELLDYLRTRKDQELILDLNFLREVGEVCKDLRFRFMAGVQEAIFDSPRFAFVADSIRRVKDRFEQILIARSDVKFVVAERLLKKTTEQQAKIHDYLMPFAKYYGGLNERMDEFVRLFPVHPDYIDTFERVTVVEKREVLKTLSMGMKSILGKDVPQDEPGLIAFDSYWGTLKQNASFRAIPEIRAVIDCSQVLESRIENAITRKQYKPMALRLIHALSVHRLTTGDIYAPMGASAEELRDRLCLFDPLIAELGSDEPDKDLQTHVETVLREIHKTVSGQFISFNADNRQFYLDLKKTDDFDALIDKRAESLGQAQLDRFYYEALKRVMECQDATYVTGYKIWQHELVWQEHKAARTGYLFFGAPNERSTAVPQRDFYLYFVQPNDPPRFKDDRVNDEVFFRLKGTDEEFQTALKSYAAALDLAATSSGHAKATYESKANGFLKKLVQWLQKHMSGAFEVTYQGRTKTMTEWAKGKSIRDLSGISPHETINFRDLVNTIAGVCLAPNFENQAPDYPFFSILITSNNRAQAAQDALRAIAGQNRTKQATAVLDALELLDGEKIDPYKSKYTKFILDTVKAKGHGQVVNRSEIIQDDHGLEYMNPCGSRLEPEWVAVILASLVYSGDIVLAIPGKKFDATGLQQLAATGMDELVRFKHLEQPKEWNLPALKALFELLGMTPGMAQLVTQGKDEPVQNLQQAVGKIVKRIVMTQQTLREGLSFWGLDLLAGTDLASQASGLDEAKGFFESLQAYSSPGKLKNFRYSAPEVLAHEKAVKALDELDALREFIMDHSPTASWLSTAEAVLPAEHDWVDRMKTTRQDVLDALKQADLTELASQSQSIGAKLQKLKKDYTVAYIGLHTKARLGVNDDKRKAGLLNDQRLQTLLKLAGIDLMPRQQLTDYQNRLAGLKSCFALTEQNLDASPICPHCGFRPSVETGTAAGSQMIDQMDAQLDAMVTAWTSTILSNLEDPITQANMDLLKIDDREPLEAFIKSKELPVPLDSNFVHALKEVLSGLVKVTVKAQELQQALQVTDGPATPAEMKKRFEEYIDQLTKGKDPAKVRIVME >NC_009446.1|WP_011927932.1|199647_200163_-|BREX-3-system-P-loop-containing-protein-BrxF MAEPIHDKIKRSLQAAEGLYHRLVLLVGETGSGKTGVLRDIAEEFGSSVVNVNLALSGELLELTAKQRSLRLPGILDQIADQAQAPVVMDNLEILFDKDLQQDPLRLLQSISRNRAVVASWNGIMNSGRLLYAETGHPEYRSYDSVDALIVGMDGTATVDSAKNNREAGQA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_009446_2 | 315538-315799 | Orphan |
NA
Consensus repeat of NC_009446_2
|
5 spacers
spacers of NC_009446_2
>2.1|315560|26|NC_009446|PILER-CR AATCTCCACAACTACAATCTTTATCT >2.2|315608|26|NC_009446|PILER-CR CATGTTCGCCGATGCAACCGCAGTTA >2.3|315656|26|NC_009446|PILER-CR AATCTCCACAACTACAATCTTTATCT >2.4|315704|26|NC_009446|PILER-CR CATGTTCGCCGACGCAACCACAGTTG >2.5|315752|26|NC_009446|PILER-CR AATCTCCACAACTACAATCTTTATCT |
DEDDh |
CRISPR arrays and Neighbor proteins around NC_009446_2
The CRISPR arrays of NC_009446_2 >merge|NC_009446|2|315538-315799|PILER-CR TTTTCTTCATCGCAATCGCAGGAATCTCCACAACTACAATCTTTATCTTTTTCTTCATTGCAATCACAAGCATGTTCGCCGATGCAACCGCAGTTATTTTCTTCATCGCAATCGCAGGAATCTCCACAACTACAATCTTTATCTTTTTCTTCATTGCAATCACAAGCATGTTCGCCGACGCAACCACAGTTGTTTTCTTCATTGCAATCGCAGGAATCTCCACAACTACAATCTTTATCTTTTTCTTCATTGCAATCACAGG >NC_009446|2|2|315538-315799|PILER-CR TTTTCTTCATCGCAATCGCAGG AATCTCCACAACTACAATCTTTATCT TTTTCTTCATTGCAATCACAAG CATGTTCGCCGATGCAACCGCAGTTA TTTTCTTCATCGCAATCGCAGG AATCTCCACAACTACAATCTTTATCT TTTTCTTCATTGCAATCACAAG CATGTTCGCCGACGCAACCACAGTTG TTTTCTTCATTGCAATCGCAGG AATCTCCACAACTACAATCTTTATCT TTTTCTTCATTGCAATCACAGG
>NC_009446.1|WP_012030640.1|312987_314325_-|HemY-protein MLRFIIVLILLCLGLLTGYAFNIESPVMINIFGRYQIETHFINLVLASILFGFLFITLFRILFFIWNTPTIFSRNLKVRKKNKADRLLRGGLNDLGVGNYKCAEKKLANGGDLAEQLGISPVIYFENAAIAADRQQAFDRRDQYFIRARETVQAHDAVSRKVMRLTEAHSYILNHQFTQAESILNQLYQEDAKNSKVIAMLDEVYVGKKDWERAWLHLSTLRNQLSAEVFNERKLKYAQEMVQAALHDEEALSRVWQHLPAELHAEKSLLLPYASALHEKGHAEEIEKLLAQQIKYNGDLDLIQVYSQLRGINFNRALKNMNDWASMHAENSIFLYCHAQIAYRAKDYETAARCIEASIKLHPTPQAFALWGQILEATDKPGAAFVAYRQSIVDPKADSLNGELLLAQAGEKLALEKLAAEQTDGDAVAEVSENEAEKTESSTDE >NC_009446.1|WP_012030639.1|311825_312887_-|AI-2E-family-transporter MNPLIKWCSRVFNNPSLMALLLFGCTLSLAFFSIGQWLIPVIISAVIAYLLEGLIKKCEKNGVRRIFAVSVVFLLFSFLIIYIFIGVLPILINQAKGLITNLPVYLSYAQEKMHILPKRFPEIISQQDIDLMLGSMNAAVAEYTKILLSKKIFESLFAVFTVLVYIILIPILIFFFLKDKVKILSWLGQFLPDNHQIIQDIWTEVDIQIGNYIRGKFVEVMIIWIMCFIPFNILGLQYSLLLSLMVGLSVLIPYIGATIVTFPVLIVAYMQFGLNSGFWWSTGFYFVVQILDGNVIVPVIFSEAVSIHPIAIIMAVLVFGGLWGFWGIFFAIPLATLVKAIVEAWRRYQNRGQ >NC_009446.1|WP_081423580.1|311197_311815_+|bifunctional-tRNA-pseudouridine(32)-synthase/ribosomal-large-subunit-pseudouridine-synthase-RluA MDLPVVYQDEDMIAVDKPAGLLSVPGRGAEKRDSVEWRIKQEYCGAAAVHRLDMSTSGIMLIAKHKDAERYYKTAFEQRRVKKGYVAICHGLIAEDEGEMNAPLIGDWVNRPKQKVCYETGKAALTRFCVLSRQRDQTRVALFPHTGRSHQLRVHLADKGHPIVGDNLYGDAADCLLPRLLLHAEWLLFTRRDGAPIKLSTKIPF >NC_009446.1|WP_012030637.1|310193_311189_+|NAD(P)-dependent-glycerol-3-phosphate-dehydrogenase MHTIAVLGAGSWGTALALQLARNQHRVFLWGHRAAHIEQLIADGANHKYLPDVFFPKNLIPTADLAAAVASAEMVLAVVPSVGFAGLLSDLKPLLGKKPFMWAIKGFEQGSGRLLSDVFTEHFGKHHAHAILAGPSFAREVAAGKPTAVTIAAAHKNDAPAFAEPFHSSNFLCYTSDDLIGAQIGGAVKNVIAIAVGIADGLRCGANTRAALITRGLQEMTRLATALGAQAQTLSGLTGLGDLVLTATDDQSRNRRFGLALGQGKTALEAKALIGQVIEGEGAAHDTWALACRYQVRMPITQYMHQFLNGEIDIQTAVMHLSNRDLKAESA >NC_009446.1|WP_012030636.1|309720_310194_+|protein-export-chaperone-SecB MAEEQQPRILLEVRKLYVGDLSVEVPNAPEVFQQSLNPEISLGINHENKKLKEENYYSVHLRLTVTAKDSTSSSVIYLVEATQTGIFEIVGLDESQLQHALNVYCTTVLYPYAREVISSAITHAGFPSLYLQPINFDALYQQQLQQEQNTTAQGGEA >NC_009446.1|WP_012030635.1|308838_309642_+|undecaprenyl-diphosphate-phosphatase MTLWQAFILSLIQGITEFLPISSSGHLVITRELLHWQDAGVAFDAFTGLGTLTAVLFYYRKDVCSILYHWFRQFRHCDAPPAPEAKLGNQLIVATLPALLIGFMVKDHIDALTHRPLLIASTTMIFAIFLAAADFWGRKKLSLPETNYRQAFYYGLAQTLALVPGVSRSGITLTAGLAMHFSRESAARFSFLQSIPISAAAGGYGLWKLATNPSDFSWQLIALSYVTATLAAYVCIALFIRFLNTVGMMPHVIYRLLLGAYLFFVFM >NC_009446.1|WP_012030634.1|308125_308689_-|D-sedoheptulose-7-phosphate-isomerase MNWQDTITAHQKVFDALREHEDVVVRIGRGLLAAIERGNTIFVAGNGGSAADAQHFAAELTGRFVRERKPLPGIALTTDTSALTAIANDYGYAQVFARQLDGLAQPGDVFVGISTSGNSPNVLTAVELARESGLVTYGLSGNDGGKLSTACDDCVVVPSSITAQIQEAHIFILHAWCILIDEHADLF >NC_009446.1|WP_148188629.1|307391_307976_-|tetratricopeptide-repeat-protein MNARFFLMILLLWNHAWAEPAPPVALLSPTGKTNTLTESSTATPSNEAPATFNYEHVELEAINGNPESALEQLNKHLSAHPDDARAAYSKGLILMQLKRVDEAERWFKMMQSNFPNVTHSYNALAVIYSGRGDLLSAQSVLEALLRLQPQQQTARLNLAKIYLRLAQENYSKALKADPKNDKIARTLTALKALQ >NC_009446.1|WP_012030632.1|306279_307332_-|recombinase-RecA MNEEQKKALTAVLTQLDKQFGKGTVMRLGEQVAAHDIQAISTGSLTLDIALGIGGLPKGRIVEIYGPESSGKTTMMLHVIAEAQKNGGTAAFIDAEHALDPIYARKLGVNTDDLYVTQPDTGEQALEICDALVRSGAFDVIVVDSVAALTPKAEIEGEMGDSHVGLQARLMSQALRKLTGNIKRANTLVVFINQIRMKIGVMFGSPETTTGGNALKFYASVRMDIRRIGSIKEGDEVLGNETRVKVVKNKVAPPFKQAEFDILYGQGVSREGEIIQLAVNADIMQKSGAWYSYRDEKIGQGKEKVRLYLKEHPDVAQEIETKIREKFIGGELHLPDAAGDEIDTSINDEE >NC_009446.1|WP_041729409.1|305816_306293_-|regulatory-protein-RecX MMKNDELARDFERRCLALLAQREYSRAELAAKAADIAPEIVSAVLDKLAADGWQSDQRFCAVWVRSKAERGDGAQKIRQALKQRGIADALIAEQCAQFDWFALAERLYRKKYTKPAHDLKEQAKRQRFLAQRGFSFAEIRHAQSVFESEHHDAHAEHR >NC_009446.1|WP_012030642.1|316078_316834_-|uroporphyrinogen-III-synthase MNTENKYPLQGCRILYTRSKQHWLQAEPLLRQLGAQPYHLPLLDTKMQPLSAKALEQCRKADDLVFVSAQAVQHFLAQYQPVFQQNLIAIGMKTADALTAHAQTRFLVAPPPYNSEALLRIWQPQRHKIALIAAEGGRDLLYTTLSEDNEVYRIDTYQRFNPTHAWNFEMPLPHCILLASVQTLAHFLAITPQNMLKLLQCRAVIVALSPRIMQAAVHAGFLHCISAQYADERHLISCLEQWWLSTQGDSS >NC_009446.1|WP_012030643.1|316817_317735_-|hydroxymethylbilane-synthase MSTLRIATRKSPLALWQAEHVAQQLKQHYPELTVELVPIVTQGDILAHTPLSKIGGKNLFIKELEIAMQQNAADIAVHSMKDVGVTLPEGFVLAAILPRENPFDALVSNHYAHLNELPNGARVGTCSLRRKMQLAHYRPDLKLIDIRGNVHTRLQKLDSGAFDALILACAGLIRLQQNARIRQILPAEISLPAIGQGAIGVECRADSPFLAHIQTLNHFETAVCVQTERVVNQRLQGDCQVPIAVFATLSGKTMTLQSRIGTIDGQRMLAHQEICALEDAEKAGARCAEALIQQGAQDILHEYRK >NC_009446.1|WP_012030644.1|317879_318254_+|lipoprotein MKYAQLSLLSAALLMSACMDASQQQMVQQGAIGAAVGAGAGALLGKDDAAGKRNKKIATGAVVGGILGSQINRANQAPQYNQYPQNGYQQNYPQQNYNQYPQQNGYNQYPQNGYQQNYGGGYGY >NC_009446.1|WP_012030645.1|318305_319091_-|cell-division-protein-ZapD MMHNTAFQGESSSLHIYEQPLAERMRLFMRLESMFEQLHLFHQANEYYSIRLFLDALFDILDFLHRYEIRAEVFKELQRISLALEREYLGADKTFLEEKVSAALAKIHQLDFNPINRLRENELLNSLRQRNVNKSGNCLFEVPAYQFWLANNIGRENEFLNYCYQLFIPLSEAIAVSLSIIRSSATLTEEYTDNGIFLKTLDKDRKNQILRIHLPTSHCVFPRISGDNHRFAIRFMEQNNPQTRSVQTKEPVVFSLQICAM >NC_009446.1|WP_012030646.1|319458_323160_+|transaldolase MKRMLINATQQEELRVALVDGQQLYDLDIETLYSAQKKANIYTGTITRIEPSLEAVFVDYGSTRHGFLPFKEIAKEYLAEPHDGADKSNIKDLLSVGQKVLVQIEKEERGNKGAALTTYVSLAGRFLVLMPNNPHAGGVSRRIQGDERKELKDYLEQLGVPEEMGVIIRTAGVGRSIEELQWDLDFLRQVWDAITAAYHNTASQKLIYQESNIIVRALRDYLRPDVGQILIDDEQVYQQAMDFMNLVMPSSINKLKLYQDPTPLFTRYQIEGQIETAYQRNVKLPSGGELVIDYTEALVSIDINSSKSTKGCDIEETAYQTNLEAADEIARQMRLRDFGGLIVIDFIDMDVSRNRKDVEQRLIDATKIDRARIQIGRISRFGLLEMSRQRLRASIDEASHQVCPRCKGQGSIRGIQSQALSLLRLIEEEAMKDRTRRITGELPVDIATFLLNEKRSVIQSIEKRNHVDIVLTINPHLHSPDYFIERFRDDEMNEEMSAVPSYRLVNHQHNSEEMPILRPKDDRVEAPVVSSIMPQTPVPQAKGGAAVVKSGLSALFSKVVALFKEGHNGHTVDAVLHKKEEKQTINESVTPTSKNVHHEETAQPVREKPIVTPAPEASTVQTHESTPANGANVKRKEKSDDHDKHHAKPTPKAAKMENDDEVNPSLEELLHPVESKNGREVRKGRPRDVHAVRGQGKAPETMPDFEQSSDELKRTETVAHHDVKNERISDNKSPEVNAETVLTENNATLVPPKVHQAPGLVAFLETENAPISADEDDIDEPQSMQEHDDASSEQAEILETALSEKEQETIGEPLPVPVKKEQESVAEIAPAEKEQESVAETAPAEKEQKTVAETASEKVSHIHAVTKLGQSIWYDNIDRALLQSGTLQRLIEEDDLRGITSNPAIFQKAFSRTRDYDAALSAWLEHHEGDAQAAFYALAIEDIQQACDLMQPVFEKTNGTDGMVSLEVSPHLAHDAPATVAEALSLQQRVARQNLMIKIPATDAGCAALTELTAQGINVNMTLLFSLAQYQRVLEAYIDGLKRRVENGQTIDSIRSVASFFVSRVDTAIDALLDDAHAHLRGRTAVANAQAAYLYYLERISHDDWIELQQKGAAVQRLLWASTSTKNPNYADTRYIDMLIGADTVNTVPPETYAAFKDHGRVSATLLKNIEQAQQTLRNIEDAGIDLDAVTRQLTLDGIAQFERAFTELLQTLTDKIQTLKPHANDITGENHV >NC_009446.1|WP_012030647.1|323152_323347_+|hypothetical-protein MFDEATVSLLRCPVTGQALRFERAENCLYTLDHSRRYPIVDGIALLLPEHSEAIALLTAEKNDA >NC_009446.1|WP_012030648.1|323336_324071_+|3-deoxy-manno-octulosonate-cytidylyltransferase MTPDIRVVIPARYASTRLPAKPLALIGGVPMIVRTAQQVAQAGFPYCVAYDDERIGDVLAAHHIPAIKTRFTHENGTQRLSEVVIARAWTDETIVVNVQGDEPLLPPDLITTVARTLIEHTQASVATLATVCDAPESPNTVKVVCDCAGYALYFSRSVMPYVRDAAAPPVSYLRHIGIYAYRVQLLKRYPQLAPTPLEQAEKLEQLRFLEHGFKIAVAQIDEAPPAGVDSPEDLARVQALFVHE >NC_009446.1|WP_012030649.1|324063_325887_+|excinuclease-ABC-subunit-UvrC MNNSGFAFDPDVFLSHVSTLSGVYQMRDQNGTVLYVGKAKNLRQRLSHYFQKTGLSVKTRALMRAVYDIQTTSTPTEAEALLLENNLIKQYQPKFNILLRDDKSYPYICLSQHDFPRLFLYRGARKNGDFFGPYPNVQSAHHALAILQKVFRLRPCLDSFFKNRSRPCLQYQIKRCYAPCVGKISAEMYAQTVQHARDFLTGNSEHLLQTLTEHMLQASAAQQYERAAIVRDQISELRTIQQKQSMVVYAANVDVLAVATAYGKACVQVLFFRDGHSVTSQAFFPKLPELLPAGAILQAFIGQFYHQRPVPSQIVLSEALPDMDAVSEFLSQMSAHTVTLTTQPRAIRKKWLRMTQENARLNLRLHLAQKLSMHERFKALAQAFDWQKMPQRLECVDISHMQGEYTVASCVVFDRRGAVKSDYRRYKINGITGGDDYAAMKQVIKRRFARLKKGEGVMPDVFFVDGGRGQLQQAIAVFEEMQIEGVQLIGVAKGEGRKAGLEQFWFPHENRPRTLPADSQAMQLIIHIRDEAHRFAISAHRRGRDKKVRVSLLEEIPNIGRKRRQALLQHFGNLAGLMQASPEDITRVPGISVKLAAQIYAALHQGE >NC_009446.1|WP_012030650.1|325888_326446_+|CDP-diacylglycerol--glycerol-3-phosphate-3-phosphatidyltransferase MRSVATFLTVLRIILVPFFIILYYYQFDFWGRWPALIVYAVAGISDYLDGYLARKLKETSAFGAFLDPVADKLMVAAVLIVVLQQNPHIWLMVCTLIIIGREIWISALREWMASMQMRDVVAVAKIGKWKTTLQMLALGFLIYREPFIGLPIWSIGQILMIGAALLTLYSMWSYNLSAWKAIKEK >NC_009446.1|WP_012030651.1|326455_327076_+|ribonuclease-T MNHLISKRFRGFLPVVVDVETGGFDHEKDALLEVAAVLVNFNEAGNLAPVETFHYHVKPFEGAHLNPDSLKINGIDPFHPLRPALDEVTVAKQLFGAIREYQKAQSCTRSILVGHNAHFDLGFINALAARCNYQHNPFHPFSSLDTVSLGALAYGQTVLARIAKAAGFEYDSERAHGAKYDTELTAQIFCHIINTWSEKIGIPEQS |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_009446_3 | 318123-318232 | Orphan |
NA
Consensus repeat of NC_009446_3
|
1 spacers
spacers of NC_009446_3
>3.1|318155|46|NC_009446|CRISPRCasFinder TCCGCAACAAAATTACAATCAATATCCCCAACAAAATGGTTATAAC |
DEDDh |
CRISPR arrays and Neighbor proteins around NC_009446_3
The CRISPR arrays of NC_009446_3 >merge|NC_009446|3|318123-318232|CRISPRCasFinder CAATATCCGCAAAATGGTTATCAACAAAATTATCCGCAACAAAATTACAATCAATATCCCCAACAAAATGGTTATAACCAATATCCGCAAAACGGTTATCAACAAAATTA >NC_009446|3|2|318123-318232|CRISPRCasFinder CAATATCCGCAAAATGGTTATCAACAAAATTA TCCGCAACAAAATTACAATCAATATCCCCAACAAAATGGTTATAAC CAATATCCGCAAAACGGTTATCAACAAAATTA
>NC_009446.1|WP_012030643.1|316817_317735_-|hydroxymethylbilane-synthase MSTLRIATRKSPLALWQAEHVAQQLKQHYPELTVELVPIVTQGDILAHTPLSKIGGKNLFIKELEIAMQQNAADIAVHSMKDVGVTLPEGFVLAAILPRENPFDALVSNHYAHLNELPNGARVGTCSLRRKMQLAHYRPDLKLIDIRGNVHTRLQKLDSGAFDALILACAGLIRLQQNARIRQILPAEISLPAIGQGAIGVECRADSPFLAHIQTLNHFETAVCVQTERVVNQRLQGDCQVPIAVFATLSGKTMTLQSRIGTIDGQRMLAHQEICALEDAEKAGARCAEALIQQGAQDILHEYRK >NC_009446.1|WP_012030642.1|316078_316834_-|uroporphyrinogen-III-synthase MNTENKYPLQGCRILYTRSKQHWLQAEPLLRQLGAQPYHLPLLDTKMQPLSAKALEQCRKADDLVFVSAQAVQHFLAQYQPVFQQNLIAIGMKTADALTAHAQTRFLVAPPPYNSEALLRIWQPQRHKIALIAAEGGRDLLYTTLSEDNEVYRIDTYQRFNPTHAWNFEMPLPHCILLASVQTLAHFLAITPQNMLKLLQCRAVIVALSPRIMQAAVHAGFLHCISAQYADERHLISCLEQWWLSTQGDSS >NC_009446.1|WP_012030641.1|314327_316082_-|hypothetical-protein MNDPNKPSAEHLNENILSTDNALQSTENNPQPHDKKEKDCDCHGEHACDCDEEKDCGCHSEHACDCNEEKDKDCNCGDSCDCDEENNCGCIDKHACDCNEEKDKDCSCGDSCDCNEENNCGCVGEHACDCNEEKDKDCSCGDSCDCDEENNCGCIGEHACDCNEEKDKDCSCGDSCDCDEENNCGCIGESSKKKGPCAFLTFLLAFLALAAAGYHEYQWQQMRANQQTFQSDSEKNIDALKNTVAQFDQGLDKAQVSHLIAEAIKALPLPPSEQEIGVFVEQKMKEQAEHTIKQAHSVAQESVAEFARTHDLNDIRATQASTEAKVQEAVDAFQHTATTAKESFTALADQATKQFTNLTQQAHPQPLIDALALADAAYQHNDYFAAAQFLNQALYRFEALNLMQTPFAAFKEPITAAQTQLASLIKADQERAQQLIALTESVDSWSFKSFEPVQVTMEDEASDETNLMSQAEQWGKQLLSKAVVIHKNDLSAAERVPANKAQRAIIRETIRLDVAYLRNAAMLHDRVGAKMAADDLTALITRYFAANDEAVQSALSVLSQFGADEPQPLEITTIIKAVKEAAGE >NC_009446.1|WP_012030640.1|312987_314325_-|HemY-protein MLRFIIVLILLCLGLLTGYAFNIESPVMINIFGRYQIETHFINLVLASILFGFLFITLFRILFFIWNTPTIFSRNLKVRKKNKADRLLRGGLNDLGVGNYKCAEKKLANGGDLAEQLGISPVIYFENAAIAADRQQAFDRRDQYFIRARETVQAHDAVSRKVMRLTEAHSYILNHQFTQAESILNQLYQEDAKNSKVIAMLDEVYVGKKDWERAWLHLSTLRNQLSAEVFNERKLKYAQEMVQAALHDEEALSRVWQHLPAELHAEKSLLLPYASALHEKGHAEEIEKLLAQQIKYNGDLDLIQVYSQLRGINFNRALKNMNDWASMHAENSIFLYCHAQIAYRAKDYETAARCIEASIKLHPTPQAFALWGQILEATDKPGAAFVAYRQSIVDPKADSLNGELLLAQAGEKLALEKLAAEQTDGDAVAEVSENEAEKTESSTDE >NC_009446.1|WP_012030639.1|311825_312887_-|AI-2E-family-transporter MNPLIKWCSRVFNNPSLMALLLFGCTLSLAFFSIGQWLIPVIISAVIAYLLEGLIKKCEKNGVRRIFAVSVVFLLFSFLIIYIFIGVLPILINQAKGLITNLPVYLSYAQEKMHILPKRFPEIISQQDIDLMLGSMNAAVAEYTKILLSKKIFESLFAVFTVLVYIILIPILIFFFLKDKVKILSWLGQFLPDNHQIIQDIWTEVDIQIGNYIRGKFVEVMIIWIMCFIPFNILGLQYSLLLSLMVGLSVLIPYIGATIVTFPVLIVAYMQFGLNSGFWWSTGFYFVVQILDGNVIVPVIFSEAVSIHPIAIIMAVLVFGGLWGFWGIFFAIPLATLVKAIVEAWRRYQNRGQ >NC_009446.1|WP_081423580.1|311197_311815_+|bifunctional-tRNA-pseudouridine(32)-synthase/ribosomal-large-subunit-pseudouridine-synthase-RluA MDLPVVYQDEDMIAVDKPAGLLSVPGRGAEKRDSVEWRIKQEYCGAAAVHRLDMSTSGIMLIAKHKDAERYYKTAFEQRRVKKGYVAICHGLIAEDEGEMNAPLIGDWVNRPKQKVCYETGKAALTRFCVLSRQRDQTRVALFPHTGRSHQLRVHLADKGHPIVGDNLYGDAADCLLPRLLLHAEWLLFTRRDGAPIKLSTKIPF >NC_009446.1|WP_012030637.1|310193_311189_+|NAD(P)-dependent-glycerol-3-phosphate-dehydrogenase MHTIAVLGAGSWGTALALQLARNQHRVFLWGHRAAHIEQLIADGANHKYLPDVFFPKNLIPTADLAAAVASAEMVLAVVPSVGFAGLLSDLKPLLGKKPFMWAIKGFEQGSGRLLSDVFTEHFGKHHAHAILAGPSFAREVAAGKPTAVTIAAAHKNDAPAFAEPFHSSNFLCYTSDDLIGAQIGGAVKNVIAIAVGIADGLRCGANTRAALITRGLQEMTRLATALGAQAQTLSGLTGLGDLVLTATDDQSRNRRFGLALGQGKTALEAKALIGQVIEGEGAAHDTWALACRYQVRMPITQYMHQFLNGEIDIQTAVMHLSNRDLKAESA >NC_009446.1|WP_012030636.1|309720_310194_+|protein-export-chaperone-SecB MAEEQQPRILLEVRKLYVGDLSVEVPNAPEVFQQSLNPEISLGINHENKKLKEENYYSVHLRLTVTAKDSTSSSVIYLVEATQTGIFEIVGLDESQLQHALNVYCTTVLYPYAREVISSAITHAGFPSLYLQPINFDALYQQQLQQEQNTTAQGGEA >NC_009446.1|WP_012030635.1|308838_309642_+|undecaprenyl-diphosphate-phosphatase MTLWQAFILSLIQGITEFLPISSSGHLVITRELLHWQDAGVAFDAFTGLGTLTAVLFYYRKDVCSILYHWFRQFRHCDAPPAPEAKLGNQLIVATLPALLIGFMVKDHIDALTHRPLLIASTTMIFAIFLAAADFWGRKKLSLPETNYRQAFYYGLAQTLALVPGVSRSGITLTAGLAMHFSRESAARFSFLQSIPISAAAGGYGLWKLATNPSDFSWQLIALSYVTATLAAYVCIALFIRFLNTVGMMPHVIYRLLLGAYLFFVFM >NC_009446.1|WP_012030634.1|308125_308689_-|D-sedoheptulose-7-phosphate-isomerase MNWQDTITAHQKVFDALREHEDVVVRIGRGLLAAIERGNTIFVAGNGGSAADAQHFAAELTGRFVRERKPLPGIALTTDTSALTAIANDYGYAQVFARQLDGLAQPGDVFVGISTSGNSPNVLTAVELARESGLVTYGLSGNDGGKLSTACDDCVVVPSSITAQIQEAHIFILHAWCILIDEHADLF >NC_009446.1|WP_012030645.1|318305_319091_-|cell-division-protein-ZapD MMHNTAFQGESSSLHIYEQPLAERMRLFMRLESMFEQLHLFHQANEYYSIRLFLDALFDILDFLHRYEIRAEVFKELQRISLALEREYLGADKTFLEEKVSAALAKIHQLDFNPINRLRENELLNSLRQRNVNKSGNCLFEVPAYQFWLANNIGRENEFLNYCYQLFIPLSEAIAVSLSIIRSSATLTEEYTDNGIFLKTLDKDRKNQILRIHLPTSHCVFPRISGDNHRFAIRFMEQNNPQTRSVQTKEPVVFSLQICAM >NC_009446.1|WP_012030646.1|319458_323160_+|transaldolase MKRMLINATQQEELRVALVDGQQLYDLDIETLYSAQKKANIYTGTITRIEPSLEAVFVDYGSTRHGFLPFKEIAKEYLAEPHDGADKSNIKDLLSVGQKVLVQIEKEERGNKGAALTTYVSLAGRFLVLMPNNPHAGGVSRRIQGDERKELKDYLEQLGVPEEMGVIIRTAGVGRSIEELQWDLDFLRQVWDAITAAYHNTASQKLIYQESNIIVRALRDYLRPDVGQILIDDEQVYQQAMDFMNLVMPSSINKLKLYQDPTPLFTRYQIEGQIETAYQRNVKLPSGGELVIDYTEALVSIDINSSKSTKGCDIEETAYQTNLEAADEIARQMRLRDFGGLIVIDFIDMDVSRNRKDVEQRLIDATKIDRARIQIGRISRFGLLEMSRQRLRASIDEASHQVCPRCKGQGSIRGIQSQALSLLRLIEEEAMKDRTRRITGELPVDIATFLLNEKRSVIQSIEKRNHVDIVLTINPHLHSPDYFIERFRDDEMNEEMSAVPSYRLVNHQHNSEEMPILRPKDDRVEAPVVSSIMPQTPVPQAKGGAAVVKSGLSALFSKVVALFKEGHNGHTVDAVLHKKEEKQTINESVTPTSKNVHHEETAQPVREKPIVTPAPEASTVQTHESTPANGANVKRKEKSDDHDKHHAKPTPKAAKMENDDEVNPSLEELLHPVESKNGREVRKGRPRDVHAVRGQGKAPETMPDFEQSSDELKRTETVAHHDVKNERISDNKSPEVNAETVLTENNATLVPPKVHQAPGLVAFLETENAPISADEDDIDEPQSMQEHDDASSEQAEILETALSEKEQETIGEPLPVPVKKEQESVAEIAPAEKEQESVAETAPAEKEQKTVAETASEKVSHIHAVTKLGQSIWYDNIDRALLQSGTLQRLIEEDDLRGITSNPAIFQKAFSRTRDYDAALSAWLEHHEGDAQAAFYALAIEDIQQACDLMQPVFEKTNGTDGMVSLEVSPHLAHDAPATVAEALSLQQRVARQNLMIKIPATDAGCAALTELTAQGINVNMTLLFSLAQYQRVLEAYIDGLKRRVENGQTIDSIRSVASFFVSRVDTAIDALLDDAHAHLRGRTAVANAQAAYLYYLERISHDDWIELQQKGAAVQRLLWASTSTKNPNYADTRYIDMLIGADTVNTVPPETYAAFKDHGRVSATLLKNIEQAQQTLRNIEDAGIDLDAVTRQLTLDGIAQFERAFTELLQTLTDKIQTLKPHANDITGENHV >NC_009446.1|WP_012030647.1|323152_323347_+|hypothetical-protein MFDEATVSLLRCPVTGQALRFERAENCLYTLDHSRRYPIVDGIALLLPEHSEAIALLTAEKNDA >NC_009446.1|WP_012030648.1|323336_324071_+|3-deoxy-manno-octulosonate-cytidylyltransferase MTPDIRVVIPARYASTRLPAKPLALIGGVPMIVRTAQQVAQAGFPYCVAYDDERIGDVLAAHHIPAIKTRFTHENGTQRLSEVVIARAWTDETIVVNVQGDEPLLPPDLITTVARTLIEHTQASVATLATVCDAPESPNTVKVVCDCAGYALYFSRSVMPYVRDAAAPPVSYLRHIGIYAYRVQLLKRYPQLAPTPLEQAEKLEQLRFLEHGFKIAVAQIDEAPPAGVDSPEDLARVQALFVHE >NC_009446.1|WP_012030649.1|324063_325887_+|excinuclease-ABC-subunit-UvrC MNNSGFAFDPDVFLSHVSTLSGVYQMRDQNGTVLYVGKAKNLRQRLSHYFQKTGLSVKTRALMRAVYDIQTTSTPTEAEALLLENNLIKQYQPKFNILLRDDKSYPYICLSQHDFPRLFLYRGARKNGDFFGPYPNVQSAHHALAILQKVFRLRPCLDSFFKNRSRPCLQYQIKRCYAPCVGKISAEMYAQTVQHARDFLTGNSEHLLQTLTEHMLQASAAQQYERAAIVRDQISELRTIQQKQSMVVYAANVDVLAVATAYGKACVQVLFFRDGHSVTSQAFFPKLPELLPAGAILQAFIGQFYHQRPVPSQIVLSEALPDMDAVSEFLSQMSAHTVTLTTQPRAIRKKWLRMTQENARLNLRLHLAQKLSMHERFKALAQAFDWQKMPQRLECVDISHMQGEYTVASCVVFDRRGAVKSDYRRYKINGITGGDDYAAMKQVIKRRFARLKKGEGVMPDVFFVDGGRGQLQQAIAVFEEMQIEGVQLIGVAKGEGRKAGLEQFWFPHENRPRTLPADSQAMQLIIHIRDEAHRFAISAHRRGRDKKVRVSLLEEIPNIGRKRRQALLQHFGNLAGLMQASPEDITRVPGISVKLAAQIYAALHQGE >NC_009446.1|WP_012030650.1|325888_326446_+|CDP-diacylglycerol--glycerol-3-phosphate-3-phosphatidyltransferase MRSVATFLTVLRIILVPFFIILYYYQFDFWGRWPALIVYAVAGISDYLDGYLARKLKETSAFGAFLDPVADKLMVAAVLIVVLQQNPHIWLMVCTLIIIGREIWISALREWMASMQMRDVVAVAKIGKWKTTLQMLALGFLIYREPFIGLPIWSIGQILMIGAALLTLYSMWSYNLSAWKAIKEK >NC_009446.1|WP_012030651.1|326455_327076_+|ribonuclease-T MNHLISKRFRGFLPVVVDVETGGFDHEKDALLEVAAVLVNFNEAGNLAPVETFHYHVKPFEGAHLNPDSLKINGIDPFHPLRPALDEVTVAKQLFGAIREYQKAQSCTRSILVGHNAHFDLGFINALAARCNYQHNPFHPFSSLDTVSLGALAYGQTVLARIAKAAGFEYDSERAHGAKYDTELTAQIFCHIINTWSEKIGIPEQS >NC_009446.1|WP_012030652.1|327072_327651_+|DUF2167-domain-containing-protein MTMATLRMPESWVLLKNEQRARFLREIEIEDEPALLAVAQSKEHDHAFALLRHQKSGYVVRPEETPIHPQLIRKQTEADLAILNSESALSEAERVRWQKFYLEPVYQAQTRTVEYGITLLFGNEAAVNLYRMLLVRDGALVLTLVGKPSDHLSLADWAIEPKDEMRYERFDPAHDKKSEGTLDNLILMNRFI >NC_009446.1|WP_041729417.1|327904_329047_-|N-acetylglucosamine-6-phosphate-deacetylase MSTYYVGARIFDRGQLVRNLALSVDKNHTQRILPETEIPENAPVVHLNGGILSGGFIDTQANGGGEVLVNDDFSADGLETVIQAHYQFGTVAMLPTFITDNQQKYHRAIAAIADGVKNGLNGLLGGHFEGPFIHPAKKGTHQARFIRQPDARDFACYQKHADYLQHSILSLAPEQVRAGTIAQIKPAIPQIQLAHSMATHQEILAAWCEGLTGITHLYNAMRAFSGRDVGAIGSAAELGLHCGIIADGIHSHPYALAMAYRNLGAEKLMLVTDAMSPLGAKNMQSFDLMGIKVFVQADRLINEDGALAGAQVTMLQCVQNAMKYMPIDCQSVLQMAVSTPAYYLGRPDLARIYPRPISEIIYLDEQLQTVTALPQLCGSM >NC_009446.1|WP_041729782.1|329463_329757_+|YfcZ/YiiS-family-protein MSDSLKCKADEVQACCCVEIGTIIDGKDCTVDVDYHYDNKGLAQKALDYFTEKARAAESEPCRIKSEIIESAHGAQLKAQFTFSCQAEAMIFQLSTR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|
NC_009446_1 | 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179037-179068 | 32 | NC_009446.1 | 822923-822954 | 0 | 1.0 |
NC_009446_1 | 1.2|179097|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179097-179128 | 32 | NC_009446.1 | 791357-791388 | 0 | 1.0 |
NC_009446_1 | 1.3|179157|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179157-179188 | 32 | NC_009446.1 | 824416-824447 | 0 | 1.0 |
NC_009446_1 | 1.4|179217|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179217-179248 | 32 | NC_009446.1 | 791606-791637 | 1 | 0.969 |
NC_009446_1 | 1.6|179337|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179337-179368 | 32 | NC_009446.1 | 788658-788689 | 1 | 0.969 |
1. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to position: 822923-822954, mismatch: 0, identity: 1.0
tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer tatcaaagaaccagtcaaggaaccatgagtcg Protospacer ********************************
2. spacer 1.2|179097|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to position: 791357-791388, mismatch: 0, identity: 1.0
attcgcaaacaaaacagcgaaatttgggcgag CRISPR spacer attcgcaaacaaaacagcgaaatttgggcgag Protospacer ********************************
3. spacer 1.3|179157|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to position: 824416-824447, mismatch: 0, identity: 1.0
tgtcgaactaaacgatgaccagatttggttaa CRISPR spacer tgtcgaactaaacgatgaccagatttggttaa Protospacer ********************************
4. spacer 1.4|179217|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to position: 791606-791637, mismatch: 1, identity: 0.969
tatcgcagccacagcgtcgcgcaagtattagc CRISPR spacer tatcgcagccacagcgccgcgcaagtattagc Protospacer ****************.***************
5. spacer 1.6|179337|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to position: 788658-788689, mismatch: 1, identity: 0.969
gtaaaccatcaaaataacgtcaaattgggtta CRISPR spacer gtaaaccatcagaataacgtcaaattgggtta Protospacer ***********.********************
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NC_009446_2 | 2.1|315560|26|NC_009446|PILER-CR | 315560-315585 | 26 | NZ_MF547664 | Clostridioides difficile strain LIBA-6289 plasmid LIBA6289, complete sequence | 7434-7459 | 4 | 0.846 |
NC_009446_2 | 2.3|315656|26|NC_009446|PILER-CR | 315656-315681 | 26 | NZ_MF547664 | Clostridioides difficile strain LIBA-6289 plasmid LIBA6289, complete sequence | 7434-7459 | 4 | 0.846 |
NC_009446_2 | 2.5|315752|26|NC_009446|PILER-CR | 315752-315777 | 26 | NZ_MF547664 | Clostridioides difficile strain LIBA-6289 plasmid LIBA6289, complete sequence | 7434-7459 | 4 | 0.846 |
NC_009446_2 | 2.2|315608|26|NC_009446|PILER-CR | 315608-315633 | 26 | NZ_CP015734 | Arthrobacter sp. U41 plasmid unnamed2, complete sequence | 162075-162100 | 5 | 0.808 |
NC_009446_1 | 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179037-179068 | 32 | NZ_CP019281 | Escherichia coli strain 13P484A plasmid p13P484A-1, complete sequence | 30528-30559 | 7 | 0.781 |
NC_009446_1 | 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179037-179068 | 32 | AP012536 | Stx2-converting phage Stx2a_1447 proviral DNA, complete genome | 9299-9330 | 7 | 0.781 |
NC_009446_1 | 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179037-179068 | 32 | JQ182728 | Enterobacteria phage mEp460, complete genome | 34861-34892 | 7 | 0.781 |
NC_009446_1 | 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179037-179068 | 32 | KF030445 | Escherichia phage 1720a-02, complete genome | 36616-36647 | 7 | 0.781 |
NC_009446_1 | 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179037-179068 | 32 | NC_003444 | Enterobacteria phage SfV, complete genome | 28700-28731 | 7 | 0.781 |
NC_009446_1 | 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179037-179068 | 32 | NC_049941 | Stx2-converting phage Stx2a_WGPS2 proviral DNA, complete genome | 9299-9330 | 7 | 0.781 |
NC_009446_1 | 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179037-179068 | 32 | U82619 | Shigella flexneri bacteriophage V, complete genome | 28700-28731 | 7 | 0.781 |
NC_009446_1 | 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179037-179068 | 32 | NC_009514 | Phage cdtI DNA, complete genome | 37944-37975 | 7 | 0.781 |
NC_009446_1 | 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179037-179068 | 32 | LR595862 | Escherichia virus Lambda_2H10 genome assembly, chromosome: 1 | 36240-36271 | 7 | 0.781 |
NC_009446_1 | 1.3|179157|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179157-179188 | 32 | NC_049832 | Escherichia phage vB_EcoS-DELF2 DNA, complete genome | 16338-16369 | 7 | 0.781 |
NC_009446_1 | 1.7|179397|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179397-179428 | 32 | MT774401 | CrAssphage cr6_1, complete genome | 16098-16129 | 7 | 0.781 |
NC_009446_1 | 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179037-179068 | 32 | MF807953 | Escherichia phage Ayreon, complete genome | 35979-36010 | 8 | 0.75 |
NC_009446_1 | 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179037-179068 | 32 | MT225100 | Escherichia phage Lys8385Vzw, complete genome | 36000-36031 | 8 | 0.75 |
NC_009446_1 | 1.3|179157|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179157-179188 | 32 | MT682715 | Escherichia phage vB_EcoS_Chapo, complete genome | 698-729 | 8 | 0.75 |
NC_009446_1 | 1.4|179217|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179217-179248 | 32 | NZ_CP040760 | Paracoccus sp. 2251 plasmid unnamed6, complete sequence | 58645-58676 | 8 | 0.75 |
NC_009446_1 | 1.5|179277|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179277-179308 | 32 | NC_011246 | Borrelia recurrentis A1 plasmid pl124, complete sequence | 62158-62189 | 8 | 0.75 |
NC_009446_1 | 1.5|179277|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179277-179308 | 32 | NC_011247 | Borrelia duttonii Ly plasmid pl165, complete sequence | 100482-100513 | 8 | 0.75 |
NC_009446_1 | 1.6|179337|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179337-179368 | 32 | NC_009517 | Psychrobacter sp. PRwf-1 plasmid pRWF102, complete sequence | 979-1010 | 8 | 0.75 |
NC_009446_1 | 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179037-179068 | 32 | NC_004313 | Salmonella phage ST64B, complete genome | 31420-31451 | 9 | 0.719 |
NC_009446_1 | 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179037-179068 | 32 | KU927493 | Salmonella phage 118970_sal3, complete genome | 69032-69063 | 9 | 0.719 |
NC_009446_1 | 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT | 179037-179068 | 32 | AY055382 | Salmonella typhimurium phage ST64B complete sequence | 31420-31451 | 9 | 0.719 |
1. spacer 2.1|315560|26|NC_009446|PILER-CR matches to NZ_MF547664 (Clostridioides difficile strain LIBA-6289 plasmid LIBA6289, complete sequence) position: , mismatch: 4, identity: 0.846
aatctccacaactacaatctttatct CRISPR spacer tatctctacaactacaatctgtatca Protospacer *****.************* ****
2. spacer 2.3|315656|26|NC_009446|PILER-CR matches to NZ_MF547664 (Clostridioides difficile strain LIBA-6289 plasmid LIBA6289, complete sequence) position: , mismatch: 4, identity: 0.846
aatctccacaactacaatctttatct CRISPR spacer tatctctacaactacaatctgtatca Protospacer *****.************* ****
3. spacer 2.5|315752|26|NC_009446|PILER-CR matches to NZ_MF547664 (Clostridioides difficile strain LIBA-6289 plasmid LIBA6289, complete sequence) position: , mismatch: 4, identity: 0.846
aatctccacaactacaatctttatct CRISPR spacer tatctctacaactacaatctgtatca Protospacer *****.************* ****
4. spacer 2.2|315608|26|NC_009446|PILER-CR matches to NZ_CP015734 (Arthrobacter sp. U41 plasmid unnamed2, complete sequence) position: , mismatch: 5, identity: 0.808
catgttcgccgatgcaaccgcagtta CRISPR spacer cgagttcgcggatgcaaccgcagtgc Protospacer *. ****** **************
5. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP019281 (Escherichia coli strain 13P484A plasmid p13P484A-1, complete sequence) position: , mismatch: 7, identity: 0.781
tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer agtcaaagaaccagtcaatgaaccaatagctg Protospacer .**************** ****** **..*
6. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to AP012536 (Stx2-converting phage Stx2a_1447 proviral DNA, complete genome) position: , mismatch: 7, identity: 0.781
tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer agtcaaagaaccagtcaatgaaccaatagctg Protospacer .**************** ****** **..*
7. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to JQ182728 (Enterobacteria phage mEp460, complete genome) position: , mismatch: 7, identity: 0.781
tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer agtcaaagaaccagtcaatgaaccaatagctg Protospacer .**************** ****** **..*
8. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to KF030445 (Escherichia phage 1720a-02, complete genome) position: , mismatch: 7, identity: 0.781
tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer agtcaaagaaccagtcaatgaaccaatagctg Protospacer .**************** ****** **..*
9. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to NC_003444 (Enterobacteria phage SfV, complete genome) position: , mismatch: 7, identity: 0.781
tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer agtcaaagaaccagtcaatgaaccaatagctg Protospacer .**************** ****** **..*
10. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to NC_049941 (Stx2-converting phage Stx2a_WGPS2 proviral DNA, complete genome) position: , mismatch: 7, identity: 0.781
tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer agtcaaagaaccagtcaatgaaccaatagctg Protospacer .**************** ****** **..*
11. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to U82619 (Shigella flexneri bacteriophage V, complete genome) position: , mismatch: 7, identity: 0.781
tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer agtcaaagaaccagtcaatgaaccaatagctg Protospacer .**************** ****** **..*
12. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to NC_009514 (Phage cdtI DNA, complete genome) position: , mismatch: 7, identity: 0.781
tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer agtcaaagaaccagtcaatgaaccaatagctg Protospacer .**************** ****** **..*
13. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to LR595862 (Escherichia virus Lambda_2H10 genome assembly, chromosome: 1) position: , mismatch: 7, identity: 0.781
tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer agtcaaagaaccagtcaatgaaccaatagctg Protospacer .**************** ****** **..*
14. spacer 1.3|179157|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to NC_049832 (Escherichia phage vB_EcoS-DELF2 DNA, complete genome) position: , mismatch: 7, identity: 0.781
tgtcgaactaaacgatgaccagatttggttaa- CRISPR spacer cgtggaacttaacgatgaccaga-ttgatgagg Protospacer .** ***** ************* ***.* *.
15. spacer 1.7|179397|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to MT774401 (CrAssphage cr6_1, complete genome) position: , mismatch: 7, identity: 0.781
---gctatagttatcgagtccagaaaaaataaagt CRISPR spacer caggcta---acatagagtctagaaaaaataaagt Protospacer **** .** *****.**************
16. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to MF807953 (Escherichia phage Ayreon, complete genome) position: , mismatch: 8, identity: 0.75
tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer agtcaaagaatcagtcaatgaaccaatagctg Protospacer .********.******* ****** **..*
17. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to MT225100 (Escherichia phage Lys8385Vzw, complete genome) position: , mismatch: 8, identity: 0.75
tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer agtcaaagaatcagtcaatgaaccaatagctg Protospacer .********.******* ****** **..*
18. spacer 1.3|179157|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to MT682715 (Escherichia phage vB_EcoS_Chapo, complete genome) position: , mismatch: 8, identity: 0.75
tgtcgaactaaacgatgaccagatttggttaa- CRISPR spacer cgtggaacttaacgatgaccaga-tcgatgagg Protospacer .** ***** ************* *.*.* *.
19. spacer 1.4|179217|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP040760 (Paracoccus sp. 2251 plasmid unnamed6, complete sequence) position: , mismatch: 8, identity: 0.75
tatcgcagccacagcgtcgcgcaagtattagc CRISPR spacer gtcggcagccacagcgtcgcgcaggcattggg Protospacer . *******************.*.***.*
20. spacer 1.5|179277|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to NC_011246 (Borrelia recurrentis A1 plasmid pl124, complete sequence) position: , mismatch: 8, identity: 0.75
-gccgcaacatttctggctcatttaaatataag CRISPR spacer aaccatta-atttctagatcatttaaatataaa Protospacer .**.. * ******.* **************.
21. spacer 1.5|179277|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to NC_011247 (Borrelia duttonii Ly plasmid pl165, complete sequence) position: , mismatch: 8, identity: 0.75
-gccgcaacatttctggctcatttaaatataag CRISPR spacer aaccatta-atttctagatcatttaaatataaa Protospacer .**.. * ******.* **************.
22. spacer 1.6|179337|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to NC_009517 (Psychrobacter sp. PRwf-1 plasmid pRWF102, complete sequence) position: , mismatch: 8, identity: 0.75
gtaaaccatcaaaataacgtcaaattgggtta CRISPR spacer tttaaccatcaaaataacgtgaaaatgtgggt Protospacer * ***************** *** ** *
23. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to NC_004313 (Salmonella phage ST64B, complete genome) position: , mismatch: 9, identity: 0.719
tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer agtcaaagaaccagtcaatgaatcaaaaacca Protospacer .**************** ***.** .*..*.
24. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to KU927493 (Salmonella phage 118970_sal3, complete genome) position: , mismatch: 9, identity: 0.719
tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer agtcaaagaaccagtcaatgaatcaaaaacca Protospacer .**************** ***.** .*..*.
25. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to AY055382 (Salmonella typhimurium phage ST64B complete sequence) position: , mismatch: 9, identity: 0.719
tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer agtcaaagaaccagtcaatgaatcaaaaacca Protospacer .**************** ***.** .*..*.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
396937 : 407337
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NC_009446|396937:407337|DBSCAN-SWA GATGACGCAATATATTTTTATCACCGGCGGCGTGGTTTCTTCATTGGGAAAAGGGATTTCTGCTGCTTCTTTAGGCGCAATTTTGGAAGCGCGCGGTTTACGGATCACCATGATCAAATTAGATCCTTATATTAACGTTGACCCCGGCACGATGAGTCCTTTTCAACATGGAGAAGTGTTTGTAACGCATGACGGCGCGGAAACGGATTTGGATTTGGGGCATTATGAACGGTTCGTTAATATGCGCGCCACGCAATTTCATAATTGCACCGCCGGACGCGTTTATTTAGAAGTGATTCAACGCGAACGTAAAGGCGATTATTTAGGAAAAACGGTACAAGTTATTCCGCATATTACGGATACCATTCAACAATATATTCTCCGCGCGGCACAAGGCGCTGATATTGCTATGGTGGAAATCGGCGGCACGGTGGGCGATATTGAATCGTTGCCGTTTATGGAAGCCATTCGTCAATTGGGCACGAAATTAGGGCGCCGCGATACCATGTTTATTCATCTCACTTTAGTGCCTTTTATTTCGGCTGCCGGTGAACTCAAAACCAAACCAACGCAACACAGCGTTAAAGAATTGCGTTCCATCGGCATTCAACCGGATATGTTAATTTGCCGCGCCGATCGGGAAATTCCAGAAGATGAAAAAGCCAAAATTTCTTTGTTTACCAATGTGCCGGCAAAAGCAGTAATCAGCGGTTTAAGCGTGAAAAATATTTATGAAATTCCCATGCTTTACCAAAAACAAGGCGTTGATGATTTGGTGGTGAAACATTTTGGTTTACACGTTAAAGAAGCCAATTTAGGCGATTGGGAAAAAGTTGTTGACGCCATCAATCACCCCGAACATGAAGTAACCGTGGCAATGGTCGGTAAATACGTCAATTTAACTGATGCTTATAAATCATTGAATGAAGCGCTTTATCACGCCGGCATTAAAAATAAAACCAAGGTTAATATTCGCTACGTTGATTCTGAAGATTTATATTCTCAAGGAACAGAATTGCTCGCGGGCGTTGATGCCATCGTTGTGCCCGGCGGTTTTGGTAATCGCGGTGTGAACGGCAAATTGCGGGCAGTGCAATATGCGCGCGAGGAAAATATTCCTTATTTAGGCATTTGTTTGGGATTACAAATTGCTGTGGTGGAATTTGCACGCAACGTTTTAGGTATTAAAAATGCATGCAGCACGGAATGGGACGAAAAAACCGATGAGCCGATTATCGCATTAGTGGCGCAATGGGTAAATGAACGCGGCGAAGTAGAACAACGCGATAAAAAAATGGATCTCGGCGGCACGCTGCGTTTGGGCGCATTGCCAGCAAAATTAAAAGCCGGCAGCAAAATTGCCACGATTTACGGCAGCGAAATCATGAATGAACGCCATCGTCATCGTTATGAAGTCAACGCCAATTATGAAAAACGCTTGGAAGCCGCCGGCATGCGCATTTCCGGACGTTCCGCGGATAATGATTTGGTAGAAGTCATTGAAATTCCAAGCCATCGTTGGTTTATCGGTTTGCAATCGCACCCTGAATTTACGTCCACACCGCTCGGCGGTCACCCGCTATTTTCCGCATTCATTGCCGCGGCATTGGATTATCAAACGGAACGTTTATCATGAAATTAATTGATTTTGACGTTGACAACGATCGTCCTTTTTTCTTAATCGCAGGTCCCTGCGTTATTGAATCGGAATATTTAGCGTTGCATACGGCGGAAGTGTTAAAAACCATCACCGATGAATTAAAAATTCCTTTTATTTACAAATCTTCTTTTGATAAAGCCAATCGCTCATCGGGGGCGAGTTTTCGCGGTTTAGGATTGGCAGAAGGCTTGCGGATTTTAGCCAAAGTGAAAGAAACGTTTGCCGTGCCAATTCTTACGGACGTTCACGAATATACGCCGCTTGATGAAGTGGCATCAGTGGTTGACGTTTTGCAAACACCGGCGTTTTTGTGTCGGCAAACCGATTTTATAACCCGCGTTTGCGCGCAGGGAAAACCCGTGAATATCAAAAAGGGACAATTTCTCGCGCCGTGGGATATGAAACATGTTGTTGCCAAAGCCAAAGCTACTGGCAATGAACAGATTATGTTGTGCGAACGCGGCGCTTCTTTTGGTTATAACAATTTAGTTGCCGATATGCGTTCGCTTGCCGTGATGAAAACATTTTCTGCGCCGGTGGTTTTTGATGCCACGCATTCCGTACAACTCCCCGGTGGCAATGGTGCCAGCAGCGGCGGACAACGCGAATTTGTGCCCGTTTTGGCGCGCGCAGCAATTGCCGTTGGTGTTGCTGGTATTTTTATGGAAACGCACCCCGAGCCAGAAAAAGCGTTATCCGATGGCGCCAATTCCATTCCTTTAGCCCAAATGCGCCATTTATTACAACAATTGCGCGCATTAGATACGCTGGTTAAAACTAATGAACTGATGAAGGAGCTTTCATGACCATAACCATTTCACGCATTCACGCTTTAGAAATTTTAGATTCACGCGGCAATCCGACGCTACAAGTTGGCGTTACCTTATCGGACGGCAGTTTTGCGCAAGCTGGCGTTCCTTCTGGCGCTTCTACTGGTTCGCGCGAAGCTTTGGAATTGCGCGACGGCGACGCTCAACGTTATTTAGGAAAAGGCGTGCAAAAAGCGCTGGCAAACGTGAAAAACCATTTTGCGCCGGCGTTAATCGGTAAACCGATTTACGATTTAGCCGCACTCGATCGCATCATGTTGGCGCAAGATGGCACGGATTTTAAAACCCATTTGGGCGCAAATGCGATTTTGGGCGTTTCTTTAGCGTTGGCGCGGGCAAAAGGGGCGAGTTTGAAAAAACCGTTGTATGCAGTGCTTTGTCCGCAAGAAGAATATACCTTACCCGTGCCGATGATGAATATTATTAACGGCGGTGAACACGCAGATAATTCCGTTGATATTCAAGAATTTATGATTGTACCCGCGGGATTTGATCGTTTTTCTGAAGCGTTGCGCGCTGGTTCGGAAATTTTTCATACGCTGAAAAAAGTATTAAAAGAACAAGGTCTTAATACGGCAGTTGGCGATGAAGGCGGTTTTGCGCCGGATTTACCTTCTAATGAAGCGGCTTTTGCCGTCATCATGCAAGCGATTGAACGCGCGGGTTATCGCGCCGGCGAGCAGATTTTCTTGGCGATGGACGCCGCGGCTTCCGAATTTTATCGCGAGGGACGTTATCATTTGGCTTCCGAGCAAAAAGCGTATACATCGGCAGAATTTGTTGATTATTTGGCGGATTTGTGTCGGCGTTATCCGATTGTTTCCATTGAAGATGGTTTGCATGAAAGCGATTGGGACGGCTGGCAATTGCTAACCCAATCATTGGGCGAACGCGTGCAATTAGTCGGTGATGATTTATTTGTTACAAACAGCGCGATTTTGCAAGAAGGCATTGATAAAGGCGTTGCCAACGCGATTCTCATTAAACCGAACCAAATTGGTTCATTAAGCGAAACGTTGCAAACAATCGCGCTTGCCGATGCTGCTCATTACGCCGCTATTATTTCGCATCGTTCCGGCGAAACGGAAGACACGACGATTGCCGATATTGCCGTGGCAACTACGGCGACCCAAATTAAAACTGGTTCTTTATGTCGCTCCGATCGGGTGGCGAAATATAATCGTTTGCTGACGATTGAAGATGAATTGGGAACGCGCGCGCGTTATGCCGCAAAAGCCGCGTTTTTGGGGAAAATTAAAGCGTAAATTCACCCTCATCAATCACCGCCTCGTGCGGTGATTGTGTTTTTTGTGGAGTATGTGATGCGTCAACGCAGTTTTTATCTTTTTTTAATCATTATCGCCGCATTGTTGGCGTTGCTTAATGTGTATTTATGGCAGTTGCAAGATGATAAAAAAAGCAAAATTCGCGAGTTATCCGAGCAAGTTTCTTATTTTAATACGCAAAACGAACGCTTAAAGGCGCGCAATAAAACGTTAGATCTCGATTTGCAAACGTTGCAATCGCCGGATTCTTTTTATACGTATGAAGAAAAAGCGCGCGAGGAGTACGGCATGATTGGGCAAGATGAAACGTTTTTTGTGTTGCCGCAAGAAGAATTAAATGCCTTGCCCGATCTTGCCGCATTGCAAGAATACGACCGCGAAGGTTTGGCGCCGATTTATGCTGTTTCGCCGCAAACGCCGCAACCGTCGCCGTCGGAACCGGAACCCGTTTCTGCGCCGATTGAAATGCCGAAAATTGACGCGCTGCCATTGGAATTAGAATCGTTGGAAGGCAATTAAAAACCTTACTTCAACGCATGGTAAGGTTTGTTTCTTTGTTTTATATTTTTTATTGATTTATCCTCGCAACAATAAGTTTTTAATAAAAGGACAACGATTGTGTTACGATTTTTTGAGCGCCTCACCGAGGCTTATCCCGAGCAATTAAACGCCGCGCCTGCTAAAAACTTGCTGCGTTTTTGTTTGCAATATGCGCAGGGTTTTAAAAAATATATTCTTTTATTGGGCGTAATCAGTGCGGCGCAAGCGGTATTTGAAGTCAGTTTATTTTCTTTTTTGGGGACGATTGTTGACTGGTTATCGCGGCATCAGCGCGCGGATTTTTTAAATCAAGAAGCGTTAACTTTGTTTGCGATGGCGGCAGTTGTTTTGTTGATATTGCCTTTATTGCATTTGATGAATACGCTTTTTCGTTATCAAGCATTGATGGGCAATTTCCCAATGTCGATTCGCTGGCGCATGCATCGGCATTTGCTGGGGCAAAGTTTATGTTTTTATCAAGATGAATTTGCCGGACGCGTTGCCACCAAAGTGATGCAAACGGCGCTTTCCACGCGCGAATTAATCATTAAATTGATGGACGTCGTGGTGTACGTCATCGTTTATTTTGCGTCAATGTTTTATCTTTTGGGGCGGTTAAATACCCATTTTCTCATTCCGATTTTTTGCTGGTTTGCGTTATACATTGCATTGCAGATTTATTTTATTCCGCGTTTAAAAGCCATCGCGCAAGCACAAGCGCACGCACGCTCAACGATGACGGGGCGCGTGGTTGATGCTTATACGCATATTTCCACCATCAAATTATTTGCTCATACGCGCCGCGAAGAAAATTATGCCCGCGCAGCAATGGACGATTTTATGCAAACGGTTTACACGCAATTTCGTTTGGTAACGCGTTTGCAAGTCAGCATGAACAGCATCAATTTACTGCTCATTTTTGCCGTCACCGCGCTGGGGATTAGTTTGTGGATTAAAGGCGCGGCAACCGCCGGCGCGGTGGCAATCGCCAGCGCGTTGGCATTACGGTTAAACGTGATGTCGCATTGGATTATGTGGGAAATCAGCAGTTTATTTGAACATTTAGGCACCGTAGTTGACGGCATGACGATGATGGCAGTGCCACCGGCAATAACCGATCAACGCGGCGCCCGCGATTTAGTCGTTAGCCGCGGCGAAATTACGTTTAAAGATGTGTGTTTTTCTTATCAAAATGCAGTCACTCAAAAAGCGCCGGTCGTCATCGATCATTTAAATTTAACGATTCGTTCGGGCGAAAAAATTGGTTTGGTGGGACGAAGCGGCGCCGGCAAATCCACTTTAGTTAATTTATTGCTGCGGTTTTACGACGTCAGCGGCGGCGAAATTTTAATTGACGGGCAAAATATCGCGGCGGTAACGCAAGAAAGTTTGCGCAGCCACATCGCGATGGTAACGCAGGATACTTCTTTACTACACCGATCCGTGCGCGAAAATGTGTTATACGGTAAACCTGATGCCGATGAAGCGGCGTTACAGCGCGCTTTGGAACAATCCGAAGCGTTGGAATTTATCGCACAACTTTCTGATCCACAAGGTCATTGTGGTTTAGATGCTCAAGTTGGCGAACGCGGCGTGAAACTTTCCGGCGGACAGCGGCAGCGCATCGCAATTGCGCGCGTTTTGCTCAAAAATGCGCCGATTTTAATTTTAGATGAAGCCACTTCCGCGTTAGACAGCGAAGTTGAAGCCGCGATTCAAAGCAGTTTATTAACGCTGATGACTGGAAAAACCGTCATCGCTATTGCGCATCGTTTGTCTACCATCGCGCAAATGGATCGTTTAATTGTTTTGGATAAAGGACGCATTGTTGAAGAAGGCACGCATCAAGAATTATTGGCGCAAAAAGGATTATATGCGCGCTTGTGGTCGCGGCAAACGGGCGGTTATTTGGGCGATATCGAAGAGTTTGACGATTAAATGAAGATTTTGCACAGCGCCGATTGGCATTTAGGCGCGAAATTACACGGTCAATCGCGCGAATCCGAACAACAGGCATTTCTCGATTGGTTTTTAGAAACGTTGGCGCGCGTGCAGCCGGATATTTTATTGCTCGCCGGTGATATTTTTGATACGGCAACGCCGCCCGTTTCTGCGCAGCGGCAGTATTATCATTTCCTTTATCAAGCCGCGCAGCAATGCGATGCCATCGTTATCATTGCCGGCAATCACGATAGCGCCGCGTTTTTAGATGCACCGCAGGCGTTGCTCGCTAATATGCAAATTTATGTGGTTGGGCAGGCGCCGCAAAATGCCGCCGAAGCAGTATTTGCTTTGGAGACGAAAAACGGGCGCGCGATTGTTGCCGCAGTGCCTTTTTTGCGTGAACGCGATATTCGTTGCACGCAGGTGGGCGAATCGTTGGCAGATAAAGCCTCAGCAATTGCTGACGGCGTGCGTCAGTATTATCAACAAGCTGCGGAACAGGCGCAAAAATTGCGGCGTGGTTCGGAACCGTTAATTGCGTTAGGACATTTATTTGCTGCCGGCGGTAAAACGACCGATCACGACGGCGTGCGCGATTTATTCGTGGGCAAATTGGGGCATTTGAGCGCAACGATTTTTCCTCAATGTTTTGATTATGTTGCGCTGGGACATTTACATTTGCCGCAAATGGTGGCGCAAAATCCGCGCATTCGTTATAGCGGCGCGCCGTTAATGATGGGCTTTGGTGAAAGCGTTTCAGAAAAACACATTGTGCAGCTGCATTTTCACGAGCGGCAGCCGATCATTGATGCCATTACGGTTCCGGTGTGGCAAAAATTGCGGCAATTACGAGGACGGCAAGAAACTGTTATCGCTGAAATTGAAGCGCTTTTAAATGATGAGGCGGCGGTGTGGTTGGAAATTATCATTGAAGAGGGCGTTTTTGATAGTGCTTTAAATGCGCGGTTGCAGGATTTAGTTGCCAATACGTCGGTAAAAATTTTGCGCATTCAACAGCCTTCCACGCAAGATTTTGTAGCCGCGTTAAGCGATACGCGGCAATTGTATGAACCGCAGGAAGTTTTTGCGCAACGTTTGGAGAAAACGCCGTTTAATGATGTGGAAAAAGCGCAGTTAACGGCAGATTTTTTAACCATTTTGCAGCAGGTGCACGATGCGCATTGAATCGGTACGTTTTCATAATTTAAACGCGTTAAAAGGCGCGTGGCAGATTGATTTTACCGCAATGGCAGATGATATTTTTGCCATCACCGGCGCCACCGGCGCGGGAAAATCCACGATTTTAGATGCAATTTGTTTGGCTTTATACGGGCAAACGCCGCGATTGGGCAAAATCACGACAAAAGATAATCAAATTATGAACCGCAGCCGCGCGGAATGCGGCGCAGAAGTAGTATTTTTTATTGCGGGGAAACGTTATCGTGTTTTTTGGCGGCAAAATCGCGCATACGGGCGGCGTGATGGCAAATTGCAGGAGCCGCAGCATTACATCAGTGATGCCGAAACTGGCGTCATTATCGAAGAAAAATTAAGCAGAACCGCGGCGAAAGTCGCCGAAATCACGCATTTAGATTTTGCCCGTTTTACGCGCACGGTATTATTAGCGCAGGGGCAATTTGCCACATTTTTGCAAGCAAAAGGCGAGGAACGCGCGCCGATTTTAGAACAAATTACCGGCACGGAAATTTATAGCGAGATTTCAATCGCCGTACAAAAACGTTTTGCGCGAGAACATGATGATTTTTCCGTATTAGCAGACAAAATGAGCCGGATTTCGTTGGTCAGTGCGGAAGAAATTGCTCGTTGGCAAGCGGAAGAGCAGCAGTTGATGGATAGCATTTGCGCGCAGCAAAAAGCCGCTGATGATTGGCGTAATGCCGTGCACATTCATCAACAATTGCATACTTTGCAGCAAGAAGCGCACGCCACGCAAACGCGTTTAGCAGAAATTGATAAAAAAATGAATGCGTTATTGCCAGAACAGGCGCGTTTTGCGCAGTATCAAAAAGTTTCCGCCGGCGCGGCTTTATGGGACGCTTTTATGCGCACGCAGACGCGTTGTGCCGAGCAGCAGCAATTGGTGAATAACAGCGCTGCGTTGGCAAAAAAGGCGGCGGAACAGGTGAAAAACGCACAAATCGCGCGCGCGCAAGCGGAAGCCGTTTGGCAAAACGCGCAGCAAAAAGAAGAAGCCGCGCAGCCGTTGTTAAATGAAGCAAGACAATTGCAAGCGCAAATCGCGGCGCTACGCATTTATGAACGGGATTTTCCGAAAACGTTGCCGGAAATGGTGTTGCGCGTTGAAGTAGCAGAAGCGGCGTTAAATGCGGCGCTGGCGCAAAAAAGTAAACAAGAATATTGGTCGCGGCAAAATTGTTTATTGCAACGGCAGCACGTGTTGCAGGATTTTCAGCAAAAATATCAGCAAATTACGGCGTTGAATAAAACGCAGCAAGACGTTCGGCAGCAATGGCAAGAAACGGCGCAGCGTTTGCAGGCGGTGCAGGCGGATTTAGCTCAAAAAACAATGCATTGGCAACAAAATAAAGCGCGCGCGGAAGATAAACGGCAAATTTATGATTTATCGTTGCAAATCAAAAGTTTGAATGAACAGCGGCAACTTTTGCGTACCGGTGAACCGTGTCCTTTATGTGGTGCGCGGGAACACCCGTTTGCCGTGTCAATGCCAGATATGACGACGGCGGAGCAATTGTGGCGCGCGGCGCAGCAAGAAGCAGAAGCGGCAGATGAGGCGCGTCAGCAATTGGCGCAACTGGCATTACAATTAATGGAGCGGTGTGCACAAATTAAGGAGCAATCTCTCATAATTGAGCAGCAAATTTCGGCGTTACGGCAGGAACTCGTTATTTTTGAACGCGATTTAAGTTTCCGTGCGGCAGATATTGGGCGAGAGATGGAGGCGGTAAAAACGGATTTGGCGCAAGTTGTGCAGCGGTTAACGTTGATTGATGAGGCAGAAAAAAATTTGGCAGCGGCGCGGGCGCAGGCGTTTTTTGAGAAAAAGGCAAAATTTGCGCAGTTGTTAGGCGGTCAAACGATTGAAGATTATCAGGCGGCGCAAAAACGTTTGCTCGCGGAATGTGAACGCGGTTATCAGGCGGCGCAGCAGGCGGAATTGGCGCAAATTAAAGATGAGTTGGCGCAGCGGGCGGCGTATCAACGCGATGCGGCGTTATTGATGCAAGCGCAGCAAGAACAAACCGCGGCGGAGCAGGAATTTATGCAGTTTTTAGCGGCAAATGGTTTTGAACATCGCGCGGCATTTGAGGCGTTGCAGGCAGCCGCGCGGGATTTTGAACGCGTTGAGCAACAAATGGCGCAGTTAACTGAAGAAAAAAAGCAACAACAATGGCAGTTGGAAACGCTGTGCGATCGGGTAAACAAGGCACAGCAAGGTTTACCGGATACGATGAAAGACGCTGCCGTATGTCAAACGCATTTGCAAACGATAACGTCAGCTTTAGCTAAAGCGCAGGAAGATTTGGGCGCGGTGCGCGCGCGCCGCCGTCAGCAGGAAAACGAGCAGGCGGAAAATGCGGCTTTGAAAGAACGTTATCAACGGCAGGACGCCGTGGTGCAGCGCTGGCAGCGTTTATATGATTTGATCGGTTCATCAGATGGGAAAAAGTTTCGCAATTTTGCGCAAAGTCTCACTTTTGAAGCGATGTTGCTCGAAGCGAATAAGGTTTTAGCGAAAATGTCGGACAGATATCAATTGCAAGCCGAAATGGAGCCGCCGCTTAGTTTGGTGGTGGTGGATTTATGGCAGGGCGGCGAGGTGCGCAGCAGTAGAAATTTATCCGGCGGTGAGAGTTTTATTGTGTCATTGGCGTTGGCGTTGGGACTTGCGGCGCTTTCGGGAAGAGAGGCGCGCGTGGATAGTTTATTTTTAGATGAGGGATTTGCGACTTTGGACGCGCAGGCGTTAGACGTAGCGCTCGATACTTTGGCGAGCATTCAAATGAGCGGCAAAATGATCGGGATTATTTCTCATTTGCCGTTATTAAAAGAACGCGTGAGTACGCATATTCAAGTGATTGCGGCGGGAAATGGAATCAGCCAATTGCGCGGCGCCGGCGTGGGAAAAGTTGAGGAGTAA
Protein sequences of DBSCAN-SWA_1 >NC_009446|396937:407337|398568_399402_+|WP_012030704.1|DBSCAN-SWA MKLIDFDVDNDRPFFLIAGPCVIESEYLALHTAEVLKTITDELKIPFIYKSSFDKANRSSGASFRGLGLAEGLRILAKVKETFAVPILTDVHEYTPLDEVASVVDVLQTPAFLCRQTDFITRVCAQGKPVNIKKGQFLAPWDMKHVVAKAKATGNEQIMLCERGASFGYNNLVADMRSLAVMKTFSAPVVFDATHSVQLPGGNGASSGGQREFVPVLARAAIAVGVAGIFMETHPEPEKALSDGANSIPLAQMRHLLQQLRALDTLVKTNELMKELS >NC_009446|396937:407337|403190_404381_+|WP_012030708.1|DBSCAN-SWA MKILHSADWHLGAKLHGQSRESEQQAFLDWFLETLARVQPDILLLAGDIFDTATPPVSAQRQYYHFLYQAAQQCDAIVIIAGNHDSAAFLDAPQALLANMQIYVVGQAPQNAAEAVFALETKNGRAIVAAVPFLRERDIRCTQVGESLADKASAIADGVRQYYQQAAEQAQKLRRGSEPLIALGHLFAAGGKTTDHDGVRDLFVGKLGHLSATIFPQCFDYVALGHLHLPQMVAQNPRIRYSGAPLMMGFGESVSEKHIVQLHFHERQPIIDAITVPVWQKLRQLRGRQETVIAEIEALLNDEAAVWLEIIIEEGVFDSALNARLQDLVANTSVKILRIQQPSTQDFVAALSDTRQLYEPQEVFAQRLEKTPFNDVEKAQLTADFLTILQQVHDAH >NC_009446|396937:407337|400748_401231_+|WP_012030706.1|DBSCAN-SWA MRQRSFYLFLIIIAALLALLNVYLWQLQDDKKSKIRELSEQVSYFNTQNERLKARNKTLDLDLQTLQSPDSFYTYEEKAREEYGMIGQDETFFVLPQEELNALPDLAALQEYDREGLAPIYAVSPQTPQPSPSEPEPVSAPIEMPKIDALPLELESLEGN >NC_009446|396937:407337|404370_407337_+|WP_012030709.1|DBSCAN-SWA MRIESVRFHNLNALKGAWQIDFTAMADDIFAITGATGAGKSTILDAICLALYGQTPRLGKITTKDNQIMNRSRAECGAEVVFFIAGKRYRVFWRQNRAYGRRDGKLQEPQHYISDAETGVIIEEKLSRTAAKVAEITHLDFARFTRTVLLAQGQFATFLQAKGEERAPILEQITGTEIYSEISIAVQKRFAREHDDFSVLADKMSRISLVSAEEIARWQAEEQQLMDSICAQQKAADDWRNAVHIHQQLHTLQQEAHATQTRLAEIDKKMNALLPEQARFAQYQKVSAGAALWDAFMRTQTRCAEQQQLVNNSAALAKKAAEQVKNAQIARAQAEAVWQNAQQKEEAAQPLLNEARQLQAQIAALRIYERDFPKTLPEMVLRVEVAEAALNAALAQKSKQEYWSRQNCLLQRQHVLQDFQQKYQQITALNKTQQDVRQQWQETAQRLQAVQADLAQKTMHWQQNKARAEDKRQIYDLSLQIKSLNEQRQLLRTGEPCPLCGAREHPFAVSMPDMTTAEQLWRAAQQEAEAADEARQQLAQLALQLMERCAQIKEQSLIIEQQISALRQELVIFERDLSFRAADIGREMEAVKTDLAQVVQRLTLIDEAEKNLAAARAQAFFEKKAKFAQLLGGQTIEDYQAAQKRLLAECERGYQAAQQAELAQIKDELAQRAAYQRDAALLMQAQQEQTAAEQEFMQFLAANGFEHRAAFEALQAAARDFERVEQQMAQLTEEKKQQQWQLETLCDRVNKAQQGLPDTMKDAAVCQTHLQTITSALAKAQEDLGAVRARRRQQENEQAENAALKERYQRQDAVVQRWQRLYDLIGSSDGKKFRNFAQSLTFEAMLLEANKVLAKMSDRYQLQAEMEPPLSLVVVDLWQGGEVRSSRNLSGGESFIVSLALALGLAALSGREARVDSLFLDEGFATLDAQALDVALDTLASIQMSGKMIGIISHLPLLKERVSTHIQVIAAGNGISQLRGAGVGKVEE >NC_009446|396937:407337|399398_400691_+|WP_012030705.1|DBSCAN-SWA MTITISRIHALEILDSRGNPTLQVGVTLSDGSFAQAGVPSGASTGSREALELRDGDAQRYLGKGVQKALANVKNHFAPALIGKPIYDLAALDRIMLAQDGTDFKTHLGANAILGVSLALARAKGASLKKPLYAVLCPQEEYTLPVPMMNIINGGEHADNSVDIQEFMIVPAGFDRFSEALRAGSEIFHTLKKVLKEQGLNTAVGDEGGFAPDLPSNEAAFAVIMQAIERAGYRAGEQIFLAMDAAASEFYREGRYHLASEQKAYTSAEFVDYLADLCRRYPIVSIEDGLHESDWDGWQLLTQSLGERVQLVGDDLFVTNSAILQEGIDKGVANAILIKPNQIGSLSETLQTIALADAAHYAAIISHRSGETEDTTIADIAVATTATQIKTGSLCRSDRVAKYNRLLTIEDELGTRARYAAKAAFLGKIKA >NC_009446|396937:407337|401330_403190_+|WP_012030707.1|DBSCAN-SWA MLRFFERLTEAYPEQLNAAPAKNLLRFCLQYAQGFKKYILLLGVISAAQAVFEVSLFSFLGTIVDWLSRHQRADFLNQEALTLFAMAAVVLLILPLLHLMNTLFRYQALMGNFPMSIRWRMHRHLLGQSLCFYQDEFAGRVATKVMQTALSTRELIIKLMDVVVYVIVYFASMFYLLGRLNTHFLIPIFCWFALYIALQIYFIPRLKAIAQAQAHARSTMTGRVVDAYTHISTIKLFAHTRREENYARAAMDDFMQTVYTQFRLVTRLQVSMNSINLLLIFAVTALGISLWIKGAATAGAVAIASALALRLNVMSHWIMWEISSLFEHLGTVVDGMTMMAVPPAITDQRGARDLVVSRGEITFKDVCFSYQNAVTQKAPVVIDHLNLTIRSGEKIGLVGRSGAGKSTLVNLLLRFYDVSGGEILIDGQNIAAVTQESLRSHIAMVTQDTSLLHRSVRENVLYGKPDADEAALQRALEQSEALEFIAQLSDPQGHCGLDAQVGERGVKLSGGQRQRIAIARVLLKNAPILILDEATSALDSEVEAAIQSSLLTLMTGKTVIAIAHRLSTIAQMDRLIVLDKGRIVEEGTHQELLAQKGLYARLWSRQTGGYLGDIEEFDD >NC_009446|396937:407337|396937_398572_+|WP_012030703.1|DBSCAN-SWA MTQYIFITGGVVSSLGKGISAASLGAILEARGLRITMIKLDPYINVDPGTMSPFQHGEVFVTHDGAETDLDLGHYERFVNMRATQFHNCTAGRVYLEVIQRERKGDYLGKTVQVIPHITDTIQQYILRAAQGADIAMVEIGGTVGDIESLPFMEAIRQLGTKLGRRDTMFIHLTLVPFISAAGELKTKPTQHSVKELRSIGIQPDMLICRADREIPEDEKAKISLFTNVPAKAVISGLSVKNIYEIPMLYQKQGVDDLVVKHFGLHVKEANLGDWEKVVDAINHPEHEVTVAMVGKYVNLTDAYKSLNEALYHAGIKNKTKVNIRYVDSEDLYSQGTELLAGVDAIVVPGGFGNRGVNGKLRAVQYAREENIPYLGICLGLQIAVVEFARNVLGIKNACSTEWDEKTDEPIIALVAQWVNERGEVEQRDKKMDLGGTLRLGALPAKLKAGSKIATIYGSEIMNERHRHRYEVNANYEKRLEAAGMRISGRSADNDLVEVIEIPSHRWFIGLQSHPEFTSTPLGGHPLFSAFIAAALDYQTERLS |
7 | Bacillus_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
507668 : 513752
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NC_009446|507668:513752|DBSCAN-SWA CTTACTTTAATTCCGTTAAAAATTCGCTGACTTCTAATTGATAACCGGATAATCCTGTAAATTTATAAGGCGTGGATAATAAACGCATGCGGCGCACGCCTTGATCGATGAGAATTTGCGCGCCAATGCCATAAGTTCGCAAATCATCTTCCTGCTGCGGTTTGGGCGGTGGATTTTGGTGCGCAAATGCATGCATACGATTGATAATGTGGTTTTGTTTTTCTCCCTCATCTAAAACCACCACAATGCCTTCCTCTTCTTGCGCCAAACGCGATAATGCATCATCCAAAGTAAAATTACGTTCGGAAAGCGGCGCACCAAATAAATCGCTAAACGGATTTTGCACGTGCACGCGCACAAAAGTAGCCTTTTGTGGCGAAATTTTTCCTTTCACCAAAGCATAATGCAAAGTTTTATCCAAAATAGATTGGTACGCGGTGAGCGTAAACGTGCCGTAACGCGTAGGCAATTCACAGGTTGAAATTTGCACCACGATTTGCTCGCGTGCCAAACGATAAGCGATTAAATCCGCGATTGTGCCGATTTTTAAACCATGTTCTTCAGCAATCGCTTCTAAATCCGGACGACGCGCCATCGTGCCATCATCTTTTAAAATTTCAACAATCACCGCCGCCGGTTCAAAACCCGCCAAACGCGCAAAATCACAGCCTGCTTCCGTGTGTCCCGCGCGCACCAAAACGCCGCCTTCGCGCGCCACAATAGGAAAAACGTGCCCCGGTTGTACCAAATCTTCGGGAAGCGCTTCTTTAGCAACGGCTTTTTTAATTGTAGTGGCGCGATCGTACGCCGAAATGCCAGTGGTGACGCCTTCAGCCGCTTCAATGGAAACGGTGAAATTTGTTGCAAATTGAGAACCGTTATTCATCGCCTGCGGCCATAATTTTAAGCGGTCATTGCGCGCTTGATTAATCGTTAAACAAATCAAACCGCGTCCATATTTCGTCATGAAATTAATTGCTTCCGGCGTAATATGTTCTGCCAAAATCATCAAATCGCCTTCATTTTCGCGATCTTCGTCGTCCATAATAATGACCATTTTTCCGGCGCGTAAATCGGCAATAATTTCTTCAGTACTATGAATCGTCATCGATTATTCCTTTTAAAATCCCAATGATTGCAGGCGTTCGAGCGTCAGCGTTCCGGTTTCAGAATCCGTACTTTTGAGCAAACGCTCCAAATAACGCGCGAGCAAATCCACTTCAATATGAACCGCTTGCCCGATGTGATAATCATCGATAATCGTGTTTTTTTGCGTGTGCGGCACAATCATCACGCTAAACTGATCGCCTTGGATATCGTTGATGGTCAAACTCACGCCGTCAACCGTAATGGAACCTTTACGGGCAATATATTTGGTCAGCGCCGTGGGAACGGAAAACTGTAAACGTTGGCTGCGTCCTTCGGTAACGCGTTCTTTTAATACGGCAATGCCATCTACGTGACCGCTGACTAAATGTCCGCCCAATTGAGTGTTGAGCGTTGCTGCGGGCTCTAAATTAACGCGCGTATCGACGCGCCAACGCGCAATCGTGGTTAAACGCAACGTTTCTGCGGAAACGTCCACCCGAAATCCTTTTGAAATAAAATCGATCACGGTTAAACAAACGCCGTTAACTGCAATGCTATCGCCGAGATGACTATGACTTAAATCAAATTGTTCAGCGCAAATCTCTAAAGAAAGATCATTGCCGTGCGCCGCGATGGCAGTAATCGTGCCGACGGATTGGATAATTCCCGTAAACATATGTTTTTCCTGATTTAAGGGTTAACGAGCAAACTCAATCGCCAATCGTCGCCGATGAGATGCGCGTCAGCAATTGAGAAGCGCATTCTATCAGATAAGTGCGCAAGCGGCGGCAGCGCAACCAGTGGACGCGCCGAATCACCCAAAAAAGTTGGCGCTAAATACAATAAAATTTCATCAACTAATTGCTCGGTTAAAAAAGCGCCGGCAAGTTGCATGCCTGCTTCCACAAAAAGATGATTGATTTGCCGACGCCCCAATTCATCAAGCAATGCAGAAAGCGGCACTTTTCCCTGTTCGGTTGCCGGCAGCGATAAATATTCCACCATTTGGGGGAAAGATTGCGGCGGTGGTTGAGTTGTCACCATTAAAATAGGCGACGGATCGCAAAACAACGCAGCATTTGCCGGTGTTTTTAATTGACTATCGATCACCACGCGCAGCGGCGGATTTGCCGGCAGATCCGTTGGAAAACGCGCCGTTAAACGCGCATTATCGCCAATAACGCTCCCCGTTCCCGCGAGCACCGCATCAGCGGCAAGACGATGAAAATGTACATCTTTTCTTGCTAATTCGCCCGTAATCCACTGACTTTCGCCGTTTGCCAGCGCCGTGCGTCCGTCCATACTGGCAGCAATTTTTAAGGTAACCCAAGGTCGTCCCCAACGCATGCGGTGAAAAAAAGGGCGATTTAATTGCAATGCTTCTTGCGCACAAATACCTTGTTCAACGCTTATTCCTGCCGCTCGTAATGCCGCAATTCCTTTGCCAGCAACCAAAGGATTGGGATCGGAACAGGCAATAACTACGCGTTTGACGCCGGCGGCAATTAAAGCGTGGGTGCACGGCGGCGTGCGTCCAAAATGAGCACAAGGTTCTAAAGTAACGTAAACGGTGGCATCGCGCGCTTGCGTTCCCGCAGCATTTAACGCATTAATTTCCGCGTGCGCTTCGCCGGCGCGTTGATGCCAGCCTTCGCCGATAATCGTATGATTTTTAACGATAACACAGCCAACGCGCGGATTGGGTGCCGCAGAAAAAATACTGTTACGCGCCAGTTCCAAAGCGCGGTGCATATAAAAGAGATCGGAATCAGGAGGATTGTTCGTCGGCGAGTTCATCGATGGTTTTTTGAAAAGCCGCTACATCTTCAAAGCTCAAATAAACCGAGGCAAAGCGAATGTAAGCGACGTGATCTAAATCTTTTAATCCGTCTAAAACCCAGCGCCCAATGCGCTTTGAGGGAATTTCGCGCGCGCCGCTGTCGCGCAGCCGCGCTTCAATTTGATTGATCAGTTGCGTGATTTCTTGATCGGAAACCGGACGTTTTTCAACGGCGCGCAGCAATCCGTGGCGCAATTTATCGGCATTGAAAGTTTCGCGGGTGCCGTCATTTTTAATCACGGTTGGCGTGGAATCTTCAGCGATTTCATACGTTGTAAAACGAGCGCGGCAGTGTTCATTCAAACATTCGCGCCGACGGCGGATTTTACTACCGTCGGCGATCAAGCGAGAATCAATAACTTTCGTCTCTTTTGCATGGCAAAATGGACAAAACATCGTCGTTTCTCTTAATTATTGACCATAAACTGGAAATTCAGCGCACAATTCCGCTACTTTATTGCGCACGTTAAGAATGACTTCTTCATTGTCAATGTCATCTAAAACGTCGCAAATCCAGTTGGCAACTTTTTTCACTTCATTAACGCCAAAACCGCGTGTGGTAATTGCCGGTGTACCAATGCGGATTCCGCTGGTGACGAAAGGCGATTGCGGATCGTTGGGCACGGAGTTTTTGTTTACCGTAATATGAGCGCGAGATAATGCATCGTTTGCCGCTTTACCGGTTAATCCTTTATTGATCAAACTCACCAGCATTAAGTGGTTTTGCGTGCCGCCGGAGACGATATCGTAACCGCGCGCGATAAATACTTTTGCCATGGTTTTCGCGTTTTCCACAACTTGTTCTTGATATTTTTGGAAAGACGGTTCCAGCGCTTCTTTAAACGCAACGGCTTTTGCCGCAATCACATGCATTAACGGGCCACCTTGAGATCCTGGGAAAATGGCAGAATTTAATTTCTTTTCAATGGTGGGATTGGCTTTGGCTAAAATGATGCCGCCGCGCGGGCCGCGCAATGTTTTATGCGTGGTTGAAGTGACCACATCAGCGAAAGGAACAGGATTGGGATATAAACCTGCTGCTACTAATCCAGCAACGTGCGCCATATCGACCATTAAATAAGCGCCAACGCTGTCGGCAATTTCGCGGAACCGTTGGAAATCAAGAACTTGTGAATAAGCAGAAAAGCCGGCAATAATCATTTTAGGACGATGTTTTTCGGCTAATTGCGCCACAGCGTCATAATCAATTAAGCCTTTATCATCAATGCCGTAGCCGATGGAATTATAAGTTTTTCCAGAAAAGCTGACTTTTGAACCGTGGGTTAAATGTCCGCCGTGATCCAAATCCATACCTAAAACGGTATCACCTGGTTCTAATAAAGCTAAAAATACGGCAGCATTGGCTTGAGAACCGCTGTGGGGTTGTACGTTGGCATAATCAGCAGCAAATAACATTTTAACGCGCGCAATTGCTAATTCTTCAACGATATCAACGTATTCACAGCCACCATAATAACGTTTCCGCGGATAACCTTCGGCATATTTATTGGTTAAACAACTGCCTTGTGCTTCCATGACGCGCGGACTGCAATAGTTTTCAGAAGCGATCAATTCAATATGATCTTCCTGACGAGTGGCTTCATCAGCGATTGCTTGGGCTAATTCATTATCAAAATCAGCGATGTTCATTGCTTTTGTAAACATTATGCTCCTCCTATATTAGCGAAAGTCGCCATTTTAAAACCTGTAGATATAATAAACAAGGCTGCACAAACGCGATCAATATGTCGTTATTTTTCCTCATCTTGGCAGCTTTCAATAAAATCTATCACGCACTTAATCAGTGCGCCGTATAATACCGCGCCGATGACTGGGAGAGTTAATGTCTGAACAAATTTTGCTTTTTGGGGCGCCGCTGGAAACCTTGCCCCAAGATTTATACATTCCGCCCGATGCGTTGCGCGTTTTATTAGAAATTTTTGAAGGTCCGTTAGATTTATTGTTATACCTTGTTCGCAAAGCCAAAATGGATATTTTAGCAATTCCGATCAGCGAAATTACCGAGCAATATTTAGTCTACATTCGCATGATGCAAACACTTGATATTACTTTAGCCAGCGAATATTTACTTATGGCGGCAACGCTTTCGGAGATGAAATCGCGTCTTTTGTTGCCCGTTTTAGAGCGAGAAGATGAGCCGGAAGAAGAGGATATGGCTTTTGATTTGAGCGAACGGTTACTGGCTTATGCGCAACTCGTTGATGCGGCAGAAAAATTATCAGAATTACCGCGCATTGATTCGGGAATTGCCGTATCTGCTTACGAATATATTCCCGAGCAGCCGCGCCAACGTCCACGCGGCGATTTAACTTTATTATTGCAAGCCGCGGCGCAATTACGTTATCGGCAAAAAGTGCAACAAGCTCATGAAATTTCGCGCGAGCAGTTACCGCTGGCGCCGCGTATCGCGCTGATGCAATCGCGTTTAGAACAGGAAAACGGTTGGCACGATTTGCGCCAGTTTTACCAAAATGAAGAAGGAACGGGCGGTTTAGTGGTGAGTTTATTGGCGCTTTTGGAATTGGATCGCCGGCAATTTTTGGAATGGCAACAAGAATGCGCTTTTGCAGCCGTACAATTACGGAAACGCGCATGAATACATTAACTAATCGCATTGAAGCTTTATTATTTACCCGTAATTCCGGTTTATCGGTTAACGATATCGCTCAAGCATTAAATGCGGATAAAACGGCAGTCGTGGCGGCGTTAAATGAACTGATGAAACGCTATGAATCTTCTGCTATTTCCGTGGTAGAAATTGCCAGCGGTTGGCGTTTACAAGTTCGACCGGATTATTTTTCCGATATTTGCGCGCTCAATCACATGCAACCGGTAACGTATTCGCGCGCTTTTTGGGAAACTTTGGCTTACATTGTTTATCATCAGCCGGTGACGCGCGCCGAAATTGACCGCGTGCGCGGCGTAACCACGAATACGCGGATTTATCAGCAACTTTTTGATTTGGGGTGGATTGTGGTGGCAGGACAAAAAGAAGTGGTGGGGCGTCCGGATTTATTAGCTACCACCCGCGCATTTTTAGATGATTTTGGTGTGCAAACGTTGGCAGAACTGCCACCGTTGAGCGCGATTGAACAATTTATCGTCGCAGGAGACGACAATGAACAAAAAAAATGA
Protein sequences of DBSCAN-SWA_2 >NC_009446|507668:513752|512439_513213_+|WP_012030813.1|DBSCAN-SWA MSEQILLFGAPLETLPQDLYIPPDALRVLLEIFEGPLDLLLYLVRKAKMDILAIPISEITEQYLVYIRMMQTLDITLASEYLLMAATLSEMKSRLLLPVLEREDEPEEEDMAFDLSERLLAYAQLVDAAEKLSELPRIDSGIAVSAYEYIPEQPRQRPRGDLTLLLQAAAQLRYRQKVQQAHEISREQLPLAPRIALMQSRLEQENGWHDLRQFYQNEEGTGGLVVSLLALLELDRRQFLEWQQECAFAAVQLRKRA >NC_009446|507668:513752|513209_513752_+|WP_012030814.1|DBSCAN-SWA MNTLTNRIEALLFTRNSGLSVNDIAQALNADKTAVVAALNELMKRYESSAISVVEIASGWRLQVRPDYFSDICALNHMQPVTYSRAFWETLAYIVYHQPVTRAEIDRVRGVTTNTRIYQQLFDLGWIVVAGQKEVVGRPDLLATTRAFLDDFGVQTLAELPPLSAIEQFIVAGDDNEQKK >NC_009446|507668:513752|507668_508775_-|WP_012030808.1|DBSCAN-SWA MTIHSTEEIIADLRAGKMVIIMDDEDRENEGDLMILAEHITPEAINFMTKYGRGLICLTINQARNDRLKLWPQAMNNGSQFATNFTVSIEAAEGVTTGISAYDRATTIKKAVAKEALPEDLVQPGHVFPIVAREGGVLVRAGHTEAGCDFARLAGFEPAAVIVEILKDDGTMARRPDLEAIAEEHGLKIGTIADLIAYRLAREQIVVQISTCELPTRYGTFTLTAYQSILDKTLHYALVKGKISPQKATFVRVHVQNPFSDLFGAPLSERNFTLDDALSRLAQEEEGIVVVLDEGEKQNHIINRMHAFAHQNPPPKPQQEDDLRTYGIGAQILIDQGVRRMRLLSTPYKFTGLSGYQLEVSEFLTELK >NC_009446|507668:513752|510524_510992_-|WP_012030811.1|DBSCAN-SWA MFCPFCHAKETKVIDSRLIADGSKIRRRRECLNEHCRARFTTYEIAEDSTPTVIKNDGTRETFNADKLRHGLLRAVEKRPVSDQEITQLINQIEARLRDSGAREIPSKRIGRWVLDGLKDLDHVAYIRFASVYLSFEDVAAFQKTIDELADEQSS >NC_009446|507668:513752|511007_512261_-|WP_012030812.1|DBSCAN-SWA MFTKAMNIADFDNELAQAIADEATRQEDHIELIASENYCSPRVMEAQGSCLTNKYAEGYPRKRYYGGCEYVDIVEELAIARVKMLFAADYANVQPHSGSQANAAVFLALLEPGDTVLGMDLDHGGHLTHGSKVSFSGKTYNSIGYGIDDKGLIDYDAVAQLAEKHRPKMIIAGFSAYSQVLDFQRFREIADSVGAYLMVDMAHVAGLVAAGLYPNPVPFADVVTSTTHKTLRGPRGGIILAKANPTIEKKLNSAIFPGSQGGPLMHVIAAKAVAFKEALEPSFQKYQEQVVENAKTMAKVFIARGYDIVSGGTQNHLMLVSLINKGLTGKAANDALSRAHITVNKNSVPNDPQSPFVTSGIRIGTPAITTRGFGVNEVKKVANWICDVLDDIDNEEVILNVRNKVAELCAEFPVYGQ >NC_009446|507668:513752|508787_509432_-|WP_012030809.1|DBSCAN-SWA MFTGIIQSVGTITAIAAHGNDLSLEICAEQFDLSHSHLGDSIAVNGVCLTVIDFISKGFRVDVSAETLRLTTIARWRVDTRVNLEPAATLNTQLGGHLVSGHVDGIAVLKERVTEGRSQRLQFSVPTALTKYIARKGSITVDGVSLTINDIQGDQFSVMIVPHTQKNTIIDDYHIGQAVHIEVDLLARYLERLLKSTDSETGTLTLERLQSLGF >NC_009446|507668:513752|509446_510508_-|WP_041729443.1|DBSCAN-SWA MHRALELARNSIFSAAPNPRVGCVIVKNHTIIGEGWHQRAGEAHAEINALNAAGTQARDATVYVTLEPCAHFGRTPPCTHALIAAGVKRVVIACSDPNPLVAGKGIAALRAAGISVEQGICAQEALQLNRPFFHRMRWGRPWVTLKIAASMDGRTALANGESQWITGELARKDVHFHRLAADAVLAGTGSVIGDNARLTARFPTDLPANPPLRVVIDSQLKTPANAALFCDPSPILMVTTQPPPQSFPQMVEYLSLPATEQGKVPLSALLDELGRRQINHLFVEAGMQLAGAFLTEQLVDEILLYLAPTFLGDSARPLVALPPLAHLSDRMRFSIADAHLIGDDWRLSLLVNP |
7 | Staphylococcus_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
674078 : 683705
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NC_009446|674078:683705|DBSCAN-SWA CATGGTTGATGAGTACGTCATTTGGTTTGAAAATTTGCGCATGAGCGATGTGGAACGGGTCGGCGGCAAAAATGCATCTTTGGGAGAGATGATCAGTCAGTTAACGGATAGAGGCGTGCGCGTTCCCGGCGGCTTTGCCACGACTGCCGCCGCTTATCGCGCTTTTTTAGCGCATGACGGTTTGGATAATCGCATCGCCGCGGCGCTTAAAGATTTAAATGTTGATGATGTGGTTGCTTTGGCAGAAGTTGGGCAAAAAATTCGTCAATGGATTTTGGAAACGCCATTTCCGCAAGCGTTTGATGAGGCTTTAGCAAAGGCGTGGAAAAAAATGGTTGCCGATGCCGGATCAGATGAGATTTCGGTTGCCGTGCGTTCTTCTGCTACCGCTGAAGATTTACCGGACGCATCATTTGCCGGTCAACAAGAAACATATTTAAATATCAAAGGATTAGATAACGTTAAAGAAGCCATTCATCACGTTTTTGCTTCTTTATATAACGACCGCGCAATTTCTTACCGCGTTCACAAAGGTTTTGCCCACGATATGGTGGCGCTGTCTGCCGGCGTGCAACGTATGGTGCGCTCTGATACGGGCGCTTCTGGCGTAATGTTTTCCATCGATACCGAAAGCGGATTTAATCAAGTGGTTTTCATCACCGCGGCATACGGTTTAGGCGAAACCGTGGTGCAGGGCGCCGTTAACCCCGATGAATTTTATGTACACAAACCCACGTTAGCCGCCGGTCGTCCCGCGATTTTGCGCAAAACTTTGGGCTCTAAACGCATCAAAATGATTTTTAGCGATACCGCACAAGCGGGAAAATCCGTGCAAACAATTGATGTTTCTTTGGCAGAACGGCAACGTTTCGCGATTTCTGATAAAGAAATTACGGAACTCGCGCAATTTGCCGTGTTAATTGAACAACATTATGGTTGCCCGATGGATATTGAATGGGGACGCGATGGTTTTGACGGCAAACTTTATATTTTGCAAGCGCGCCCAGAAACGGTGAAATCGCAAGAACAGCATCAAAATACGCTGCGCCGTTATCAAATTACTGGCGAAAAACAAGCGTTATGCCGCGGACGTGCGATTGGGCAAAAAGTCGGTCAGGGGCGCGTGCGCAAAGTTAGCGATGCTTCTGAAATGGATAAAGTGCAAGCCGGCGATATTTTGGTTACCGATATGACCGATCCCGATTGGGAGCCCGTTATGAAGCGTGCGGCAGCCATCGTTACTAATCGCGGCGGCAGAACGTGCCACGCGGCAATTATTGCTCGTGAATTGGGCATTCCTGCAGTGGTGGGTTGTCATAATGCTTCTGAAGTGTTGCAAGAAGGGCAGGAAGTAACGGTTTCTTGCGCGGAAGGCGATACGGGTTTTGTATACCGCGGCAAATTAAACGTTGATATTCATGATTTTGCGCTCAACGATATGCCAGAACCGCCCGTTAAAATTATGATGAATGTGGGCAATCCGGAATTGGCGTTTTCTTTTGCTCATTTGCCAAATGAAGGCATTGGTTTGGCGCGTATGGAATTTATTATTAACCGTCAAATCGGTATGCACCCCAAATTATTATTGGCTTTTGATGAACAAGATGAAGAAGTCAAAGAAGAAATTGAAGACCGCATTGCGGGATACGATTCGCCGGTGGATTTTTATGTGCGTAAGATTGCAGAAGGCGTGGCAACAATTGCGGCATCGGTTTATCCGCGCAAAATCATCGTGCGACTTTCTGATTTTAAATCAAATGAATACGCAGGTTTGATCGGTGGTAAACAATACGAACCGCACGAAGATAATCCGATGTTGGGTTATCGCGGCGCAGCGCGTTATATTGCGCCGGATTTTGCGGATTGCTTTGCTTTGGAATGCCGCGCGCTGAAATACGTACGCGATGAAATGGGGCTCATTAATGTGGAATTGATGATTCCTTTTGTGCGTACTTTAAAAGAAGCGGCTGCCGTTGTAGAAACCTTAAAAAACAATGGATTGGAACGCGGTAAAAATGGTTTGCGCTTGATTATGATGTGCGAAGTACCGAGTAACGCGATTTTGGCAGAACAATTTTTGGAATATTTTGATGGCTTTTCTATCGGCTCTAACGATATGACGCAATTAACTTTAGGCGTTGATAGAGATAGCGGCGGCGTTATTGCAGCAACTTTTGATGAACGCGATCCTGCGGTAAAAGCCATGTTGCATTTGGCAATTACGGCGTGTCGTAACCAAGGAAAATATATTGGTATTTGTGGACAAGGTCCTTCTGATCACCCTGATTTTGCGCGGTGGTTAGTTGAAGAAGGCGTTTCTTCAATTTCTTTAAATCCTGATACGGTGATTTCAACTTGGTTGTATTTGGCAAACGCCAAACAGTAAAAATAAAACCGGCGCCGGATTTTAAAACTGGCGCCGGTAATCATTTTTCTTAAATATTACACTTGCCAATTAATCGGCGTCTTTCCGTGCGCGATTAAATAATCATTGGCGCGCCGAAAATGATCGCAACCGATAAATGCACCGCCAGCGAGCGGCGACGGGTGCACCGCCGTTAAAATGCAATGTTTTTCCGCATCAATAAATTGCGCTTTCGCTTTGGCAAAATTACCCCATAACATAAAAACAATATGTTCCCGTTCATTTGATAACGCTTCAATAGCCGCGTGGGTAAATTCCTGCCAACCGATAGCGCGATGACTGCCGGCTTTTCCGGCTTCTACCGTCAGCGATGCGTTTAATAATAAAACGCCTTGTTTTGCCCACGCGCTTAAATCGCCGTGATCGGGTACCTGAAAATCCGGATAACTGCGTTTGAGTTCTTTATAAATATTGAGCAAAGAGGGCGGTGGGCGCACGCCTTTAGGCACCGAAAACGATAATCCCATCGCCTGACCGCGCCCGTGATATGGATCTTGCCCAATAATAACGGCTTTTACCGCATCAAATGGCGTTTGATCAAAAGCGTTAAAAATCAATTTATTAGGCGGATAAGTGACGATTCCTTGCGCTTTTGCCGCAAGTAACGTTTCTTTAATGCGCACAAAATAATCTTGGGAAAATTGTTCATACAACACTTTCTTCCAACTTTCCTCGATGCGCACATTTTCCGGACAAACTTCGATCATAATTACTCCTGAAGACGGCAGCCCGTCACCAGCACATTATCAGAATAGGGCGCGCCGATAACGGCATGACGTCCATCGCTAAAAAAAGCCAGCGTTGCATTTTGATAAATCACCGGACGCGTACTTTCTTGTTTTTGCAAATGATATTTTTTCCCTTGATAAGTGAGCGTCAGCTTTTTTTCTTGATGATCGCGCAGTTGCGTTTCAATTTTTTCACCACGATCACATTGCCAGCGATGCCGTCCTAAAGGAAGATGCACTTGCTCGGGCTCTTGTTCATCGATAATGGTTTGTTTTTTTTCTAATGAAGATTGGCAAGCGCATAAAATCACAGCCAATATCGTTAATAAACTGATTTTATTCAACATAATTACCACATTCGTATGTTTTTCTGTTGCCACGGCTGCACCGGCAAACCCAAACGGTACTCATAGACTTTTTCATATTCCAAAACTTGCTTAACGTAATCGCGGGTTTCATAAAAAGGAATTTGAGCAATCCATTCATCGAGCGGCAAATTGGGGCGTTCCTGCAACCATTTATTAACGCGCGCCGGCCCCGCATTATAAGCCGCGGCGGCATAAGCTAAATGCCCAAATTGGTTTAAACAATCGTACAAATACCAACTTCCCAAACGAATATTAACGGCAGGATCAATTAAACTGGCGCTGCCGCTATAAGGAATATTATGTTTTTTTGCCGTGTGATGAGCGGTTGCCGGCATGATTTGCATTAAACCGATGGCACCCGCCGCGGATTTGATTTCCGGTTGAAAAATACTTTCTTTGCGCATAATGGCAAAAATTTTTGCCGGCGAAATTGCCAATTGCTTTGCCATCTGCCGCACCAAATCCTGATGATGCAAAGCAAAACGCTGCTGCAAATAATCCCATTTTTTAGTTTTCGCCAACGTGCTCACGGCTTGCACCGACCAACCTAATTCATCAGCAAAAAGCGCCAATTGTTCGAGTTGTTCGGGCGTTAACTGCTTTTGCAAACTGTAAAATTCTTGCAACGCGCGCCGTTTTTCGCCTAATTGCCAAAACGTTTTTAAACGATAAGTTTCGGGGCGCCGCATAATTCTATGATAATCGATTGTTTTTACTAATGATTTATTATTAAAACGATAAGATTGCTTCATTTTTTCGGCAGCAAGAAAACCAAAAAAATCACGTTCCAACGCCGCGCGCCGATAATAATTATCAGCTTTTTCTTGATGTCCAGAACGTTCAAAACTTTTGGCAATCCAATACAAAATTTCCGGTTTTTTCAAAGTTTCCGCGTCCAAAGTATCGAGCAATAACGGCGCCAAACTCGATAATTTATGTGTGCGCAAATGATGCGCAATCACATCAAAAACGGTATTCATTTGTTGCCGATCGCGCGGAATAGCGTGATAAATAAACAATAATTCCGGTGCATCTTGTTGCGCGAGTTTTGCCGCTAAACGATTAAAACTCTCGGCTAACGCTTCATCTTCTTCCAGCGAAACATTTTGCCGCAGCGCTTCCAAAGCTAATTGCGTGGCGGCTGTTAAATCTTTTGACGCCAAACGCGTGATGCCGTCGGCTAAAATCGCGCTTTTCCACGTTTGCGGTAATAAAAATGCTTTTTCTATCGGCTCATTGCGGCGCCGAATAGACAACCATAATTGCGCGGCTTGTTGATCGCTGCCCGTCATCAAACGCGTTAAATATTGTGCCAGCGCAAGATTATTGGCGCGCAAAGTGCGTTTAAAGCGTTCTGCAATCAGTGCCGGCGGAATTTTGATGTTGGGGTGCGCAAAAACGGGATCGCAGGCGCTGTCGATATTGCCATCTTTTAACCATAATTCTTCAATATGTTTTTGTGCGGCGGCGGTTTTTCCCGTTGCCAGCAGCGCTTGTCGCCAAATGCATTCGCAGCGTTCGCTGGCAAAATGCGGGGAATATGCTGCCAAAATCGCGTCCGTTTGATTGTTTTTTAACCAAATCGGAAAAATTTTAACGGCTAATTGTGCGGCAAATGGGCTGCGGGCGTGTTGTTGCAAAAAAGTGACGATTTCCGCGCTGGCGGTGGTTTCGCCGTGTTGCACGTAAGCTTCATATTGCAAATAGGGATATAAAGGGTGCCCAAAAAATTGTTGGTAATCGCTCAACGGCGCGCCGGCACGCAACGCTGCCTCCGCTTCCAATAAACGATTGGCACGCGCGCTGGTGATGATAAGAAAAAAAGCCAATAAAATATTTTTAAACATGAACTTCTCGGCTGATTATTGAATATGCATTTTCCACTCGTTTTGATAATGCCCGCGATACGCTAACACGCGGTTAACGTAATTTTGCGTTTCAGGGTAGGGCGGAATCGTGTAATGATATTTTTTCACGGCTTCGGGGCCAGCATTGTATGCGGCGAGCGCTAACGCTTCATTACCATTAAAATAAGTCAGCATTTGCCGCAAATAACGAACGCCACCATCAATATTTAAAGCAGGATCAAATGGATCGTTAACGTTTAACGCACTGGCAGTTGTCGGCATTAATTGCATCAAACCAATAGCGCCTTTATTAGAAACCACTTCAGAACGATAAGCCGATTCCACCGTAATCACCGCGTGAACCAAACTGGCATCAACTTGATGTTTTTTGGCTAATTCATTGATTAAATTCTTCATTTCATTTTGACGTTTTTCAAATTCGCCGCTGGGTTTGCTCACCGTTTGCGCGCCGGTTTTGCCGCTGGGTTTTACGATACATTGATGACGAATGCGAATTAACACGCCGTTAATCTTTAATTCTGCTTTGCAGGCGGAAGAAACCGAAGTTTTCGTGGTCAACGGTCTGCCCAAACTGCAGGGAATATAATGCGTTTTACCATTTTTAACGTAACTGCCGCAGGAACGCGCGGCAAAACAATGAGTAGCAAACGCGTTTAACAGTAAAAAAATGATAAATTTTTTCAGCACAATCACAAACCTATCAAAGACGGATTACTTTTATAACGGCGGTAATATTGCATAACTTTTGGCACATAAGCGCGAGTTTCAATAAACGGCGGAATTTTATAACCATATTTGCGCACATTGCCTTCCCCCGCGTTATAACCGGCAACTGCGAGCTCTAAATTGCCATTAAATTCGTTGAGCAACCATTTCAAATACGTTGTTCCTCCGCGGACGTTTTCGCCAGTATGAAAAGCATCGGTTACACCAAAACGCCGCGCCGTCGCCGGCATCAATTGCATCAAACCTTGCGCGCCCGCGCGTGAAACAGCTTTGGGTTTATAAGCCGATTCAGCAGAAATCACCGCATGCACCAAAAAGGGATCGACGCCAATCGTGCGCGCGATGCGGTTAATTTCGGCAGATAATCCGCGTTGACGTTCCAAAAATTGTCCACTGACGGGAATGTCTAACGTTTTAATATAGGCGGCTTTAATTTTGCGCGCTTCAACAATCAAAGCGGCAGGGCAATCTTTTGTGGGCTCGAGGTGAAAAATTTTGCCATCAGCGCTGGTTACCGTTCCGCCGCAAGAACCTTCAACGTTTAAATAATTTTGATACGGCGCAACCGTTGCCTGCTGCTCACCATAAACAACCACGCCGCCGCTGCTGCCGAGCGTAAAATCATAAGAACGCGTGGCGCATCCATATTCACGTTTTACGCCATTAAAATGCACTTTTCCCGCGCCGGTGCAACGAATAGCAGCGGTTGCCGTTGGTTCTGGTTCATTAAGATTTTGCACGCCTTGAACCGCTAAATCGCCTAAAGAACGCGGCTGTGTAGATTGTTGTTTAAAACGCGCTTTGGCGCTGGCAAACGAACGAGAAAAAAAACTGCAACGTCTAAACTTAGTTTGTTGATTTTCCGCAACATAACTGGCGCGCCCATCTTGATCAAAACATTTATAAATATCCATCGGCGGTGAATCCGTAACGGCAATTTCGCCTTTTCCTGATGATTTTTCGGTGGTGGGTTTGGACGTCGTTTTTTTCGGTTTGGGTTGCCACGCATCGGCTTTTTTAGCAAAAAAACGGCATTGTTCGTGAGGCGGCTCCGTGGCTTCTAAAATTTCATCGCCATTAGCATGAATACAAATATAAAGATTAACGGGTTTGGGTTGATTTTTTTCTTCTTTTTCAACGGTTTTCGTTTCTGGAATTTGATTTTTATTGAGCGCGGCAGGTTTTTCTGATTGCCCCGCTTGAACCTTGGGTTGCTCGATAAAACGAAATAACTGCGATGCCGGAATGCCTTGTTCGGGAACGTTTTTTTCAGGAAGCGCGATCTTTGTTGCCGTCGGTGGCACGGGAGATACCGCTTTCGGTGGCGCGGAAGTTGTTTTTTGTGGCGCAAACATTCCTTTTGTATAAGGCACGCAATTTGCGGATTTTTTTTCAGCAGGAACAATCGTTTGCATACCGTCCGGCGCAGTGCAAACAAACAGCGCATTTTGCGCAAAAGCGCTGCCTACAAAAATAAACGTCGCGCCCACAAAAAAACGGTAAAAAACGTTCATCATGATTTTCTACTTTTCATCACGTCAATACAGACTTTTACCGGCAACCATTGCCGCATTCCTTTTTTCCAAATTAAATCGCGCTCGTCGATGCTGCCGTTATTCAAACCTGCTTCCAATTCCGCCATGCTAAACGGTCCAATTTGTTCCCCATTTTGCACGCGGAAAAATTCGCGCCGTGTGGGAAGCGTTGAGTTTGACGCATGATGAACAGGCGCCGGTTCATTTTTTTTCACCGCGTTTTTTTCGGGAGAGTGTGCAGACGGTTCCGCTGATAACGGCAACGGCGCGGTTTCTTCAGCGGCAATCGTCAGCGCATTGATTTGAATGCCTTGCGCGTTGAGCGTTCTCATTAAAGATTCGGTGAGAAATTGCCCTAAACCAACGGCATGCTTTTTAATATCTTCCGCGGGCACTTTTTGTGCGCGCAAAATATCAACCACGGCACGCGCAATAAAATCATTAATCGCGTCCATACCGGATAAATAAGATAAATCACGGAAAAATTCGCAAAACGCCGCCGGCGCGGTAATCGTGAATTCATAATTTCCTTCAAGCGCCAAAGTCAACGCGGCTTCGGGCAAGGTATAACGCGGATAATGCCAAACGTTGCGCGATGGCGTGTGGCAACGAAAATTAATCAAATAAAGCGGCTGTTGATGATGAAAAAGCTGCAAATCCGTTTTATGAATCACATGTTCACCGCCGATAAATAAATGGCTGGATTGTTTTTTTTGCAATAAAAAAGCCATTTGATCTTTGGGCACAAGCAAACGATCGCCTGCCGCGATTTTATGAAGTTCCGTCACCGTTAAACATTGTTGCTCGGGCAATTGCGGCGCCGCAATTTGGTTTGACGAATGAGCAAAAAATTTATTTAAAAAAGACATCTTATTGATTTTTATGAAGCAGTACTAATGGTGGATTTTAATGCGTTTTTAAGCTCTTGCTCGGCAGCGATTAAACGATCCTCGGTTTGTTTGCGTTGAATACGGGCTTGTTCATGAATTTGAATGCCTTCTTCAATGGTTGCAATTAACAGCCGATTGGCTTGCTCGACGGCTTCAATATCAAAAATACCGCGTTCAATTTGTTGCCGACTTTCACGATTGGATACTTGCAAAGTTTCGGCGTTTTTCATCAATAATTCGTTCGTCAGATCGCTACTTTCTTTCAGCGATTGCGCCGCTTGATGAGAGCGATAAATCGTAATCGTTTGCGCGAGTTGCTGACGCCATAAAGGAATCGTATTCAATAAAGTGCTGTTGATTTTATTAATCATGCCTTTATCGTTTTCTTGCACTAAGCGAATGCTGGGCAAGGCTTGCATGGAGACTTGTTGCGTTAACCGCAAATCCGCCAAACGGCGCTCGATTTCATCGCGCATTGCTTGAAAATCGCGCAAACGTTGGGCGCTTTCCATGGTTTGACTTTCTTCGGTGCGCGCTTTTAAAGCGGGCAAAGAATGTTGATCGGCGTGTTCAATGACTTGTTCGGCAGCGGCAACGTAATGACGCAAATCGCGGTAATAACCCAGCGTTGCTTCATAAAGTCGATCTAATGAAGTGATATCCAGCAATAATTGTTGTTTGTGCTGCTCTAATTCATTAGCGATGCCGTCGATTTGATCGTTGACGGATTCAAATTGTTGAATAAACGATTGCACGCCGCTGGCTTTGCCCAATAATTTATTCCAAAACCGCGGTTTTTCGTACAATTGTTTTTCTTTTCCTTGAAAACCACGCAACGCGCTGACCATATTATTCAATAAAGCGCCGGCGCCGCCGAGATCTTTATTGCGCACATCTTTCAGCATATTATTGGCAACGCTATTGATTTGCCGCTGTGCATCTGATCCAAAACTCAATATTGATTGACTGCTTTTAATATCAAGCTGCGCCACCAATTGCCGCACGCGCTGCGGATCTAATTGATCGGAATCCTCTTGTTCGGTTATCGTCAAAGCATTTTGCTGCTCAATTTGGATATCATTTGTCAT
Protein sequences of DBSCAN-SWA_3 >NC_009446|674078:683705|682607_683705_-|WP_012030970.1|DBSCAN-SWA MTNDIQIEQQNALTITEQEDSDQLDPQRVRQLVAQLDIKSSQSILSFGSDAQRQINSVANNMLKDVRNKDLGGAGALLNNMVSALRGFQGKEKQLYEKPRFWNKLLGKASGVQSFIQQFESVNDQIDGIANELEQHKQQLLLDITSLDRLYEATLGYYRDLRHYVAAAEQVIEHADQHSLPALKARTEESQTMESAQRLRDFQAMRDEIERRLADLRLTQQVSMQALPSIRLVQENDKGMINKINSTLLNTIPLWRQQLAQTITIYRSHQAAQSLKESSDLTNELLMKNAETLQVSNRESRQQIERGIFDIEAVEQANRLLIATIEEGIQIHEQARIQRKQTEDRLIAAEQELKNALKSTISTAS >NC_009446|674078:683705|676519_677206_-|WP_041729866.1|DBSCAN-SWA MEVCPENVRIEESWKKVLYEQFSQDYFVRIKETLLAAKAQGIVTYPPNKLIFNAFDQTPFDAVKAVIIGQDPYHGRGQAMGLSFSVPKGVRPPPSLLNIYKELKRSYPDFQVPDHGDLSAWAKQGVLLLNASLTVEAGKAGSHRAIGWQEFTHAAIEALSNEREHIVFMLWGNFAKAKAQFIDAEKHCILTAVHPSPLAGGAFIGCDHFRRANDYLIAHGKTPINWQV >NC_009446|674078:683705|677211_677610_-|WP_012030965.1|DBSCAN-SWA MATEKHTNVVIMLNKISLLTILAVILCACQSSLEKKQTIIDEQEPEQVHLPLGRHRWQCDRGEKIETQLRDHQEKKLTLTYQGKKYHLQKQESTRPVIYQNATLAFFSDGRHAVIGAPYSDNVLVTGCRLQE >NC_009446|674078:683705|681705_682596_-|WP_012030969.1|DBSCAN-SWA MSFLNKFFAHSSNQIAAPQLPEQQCLTVTELHKIAAGDRLLVPKDQMAFLLQKKQSSHLFIGGEHVIHKTDLQLFHHQQPLYLINFRCHTPSRNVWHYPRYTLPEAALTLALEGNYEFTITAPAAFCEFFRDLSYLSGMDAINDFIARAVVDILRAQKVPAEDIKKHAVGLGQFLTESLMRTLNAQGIQINALTIAAEETAPLPLSAEPSAHSPEKNAVKKNEPAPVHHASNSTLPTRREFFRVQNGEQIGPFSMAELEAGLNNGSIDERDLIWKKGMRQWLPVKVCIDVMKSRKS >NC_009446|674078:683705|674078_676463_+|WP_012030963.1|DBSCAN-SWA MVDEYVIWFENLRMSDVERVGGKNASLGEMISQLTDRGVRVPGGFATTAAAYRAFLAHDGLDNRIAAALKDLNVDDVVALAEVGQKIRQWILETPFPQAFDEALAKAWKKMVADAGSDEISVAVRSSATAEDLPDASFAGQQETYLNIKGLDNVKEAIHHVFASLYNDRAISYRVHKGFAHDMVALSAGVQRMVRSDTGASGVMFSIDTESGFNQVVFITAAYGLGETVVQGAVNPDEFYVHKPTLAAGRPAILRKTLGSKRIKMIFSDTAQAGKSVQTIDVSLAERQRFAISDKEITELAQFAVLIEQHYGCPMDIEWGRDGFDGKLYILQARPETVKSQEQHQNTLRRYQITGEKQALCRGRAIGQKVGQGRVRKVSDASEMDKVQAGDILVTDMTDPDWEPVMKRAAAIVTNRGGRTCHAAIIARELGIPAVVGCHNASEVLQEGQEVTVSCAEGDTGFVYRGKLNVDIHDFALNDMPEPPVKIMMNVGNPELAFSFAHLPNEGIGLARMEFIINRQIGMHPKLLLAFDEQDEEVKEEIEDRIAGYDSPVDFYVRKIAEGVATIAASVYPRKIIVRLSDFKSNEYAGLIGGKQYEPHEDNPMLGYRGAARYIAPDFADCFALECRALKYVRDEMGLINVELMIPFVRTLKEAAAVVETLKNNGLERGKNGLRLIMMCEVPSNAILAEQFLEYFDGFSIGSNDMTQLTLGVDRDSGGVIAATFDERDPAVKAMLHLAITACRNQGKYIGICGQGPSDHPDFARWLVEEGVSSISLNPDTVISTWLYLANAKQ >NC_009446|674078:683705|677579_679445_-|WP_012030966.1|DBSCAN-SWA MFKNILLAFFLIITSARANRLLEAEAALRAGAPLSDYQQFFGHPLYPYLQYEAYVQHGETTASAEIVTFLQQHARSPFAAQLAVKIFPIWLKNNQTDAILAAYSPHFASERCECIWRQALLATGKTAAAQKHIEELWLKDGNIDSACDPVFAHPNIKIPPALIAERFKRTLRANNLALAQYLTRLMTGSDQQAAQLWLSIRRRNEPIEKAFLLPQTWKSAILADGITRLASKDLTAATQLALEALRQNVSLEEDEALAESFNRLAAKLAQQDAPELLFIYHAIPRDRQQMNTVFDVIAHHLRTHKLSSLAPLLLDTLDAETLKKPEILYWIAKSFERSGHQEKADNYYRRAALERDFFGFLAAEKMKQSYRFNNKSLVKTIDYHRIMRRPETYRLKTFWQLGEKRRALQEFYSLQKQLTPEQLEQLALFADELGWSVQAVSTLAKTKKWDYLQQRFALHHQDLVRQMAKQLAISPAKIFAIMRKESIFQPEIKSAAGAIGLMQIMPATAHHTAKKHNIPYSGSASLIDPAVNIRLGSWYLYDCLNQFGHLAYAAAAYNAGPARVNKWLQERPNLPLDEWIAQIPFYETRDYVKQVLEYEKVYEYRLGLPVQPWQQKNIRMW >NC_009446|674078:683705|680155_681106_-|WP_081423615.1|DBSCAN-SWA MDIYKCFDQDGRASYVAENQQTKFRRCSFFSRSFASAKARFKQQSTQPRSLGDLAVQGVQNLNEPEPTATAAIRCTGAGKVHFNGVKREYGCATRSYDFTLGSSGGVVVYGEQQATVAPYQNYLNVEGSCGGTVTSADGKIFHLEPTKDCPAALIVEARKIKAAYIKTLDIPVSGQFLERQRGLSAEINRIARTIGVDPFLVHAVISAESAYKPKAVSRAGAQGLMQLMPATARRFGVTDAFHTGENVRGGTTYLKWLLNEFNGNLELAVAGYNAGEGNVRKYGYKIPPFIETRAYVPKVMQYYRRYKSNPSLIGL >NC_009446|674078:683705|679460_680153_-|WP_123962257.1|DBSCAN-SWA MLKKFIIFLLLNAFATHCFAARSCGSYVKNGKTHYIPCSLGRPLTTKTSVSSACKAELKINGVLIRIRHQCIVKPSGKTGAQTVSKPSGEFEKRQNEMKNLINELAKKHQVDASLVHAVITVESAYRSEVVSNKGAIGLMQLMPTTASALNVNDPFDPALNIDGGVRYLRQMLTYFNGNEALALAAYNAGPEAVKKYHYTIPPYPETQNYVNRVLAYRGHYQNEWKMHIQ |
8 | Hokovirus(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
790933 : 839041
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NC_009446|790933:839041|DBSCAN-SWA GATGGGACGCATACTCAAGATTAAAACGCCGCGCTGGGCATTACCGCTATTAAAACCAGCACGTTATAAAGGCGCTTACGGCGGACGCGGTGGCGGTAAATCGCATTTTTTCGCAGAACGCATCGTTGAAGAGCATATTATCAACCCCAACCGGAAAACGGTTTGCATTCGTGAAATCCAAAAATCATTGCGCCACAGCGTTAAAGCCTTAGTGGAATCAAAAATTGAAGCGCTCGGCGTAAGCAGCGCCTTTGATATTCAACGTGATTTAATTTTAAACCGCGGCGGCAATGGTTTAATGATTTTTCAAGGTATGCAAGACCACACCGCCGACAGTATCAAATCCTTAGAAGATTTTGATTGTGCGTGGATTGAAGAAGCTCAAAGCATTTCCAAGCGTTCGCTTTCCTTACTGCGCCCGACCATTCGCAAACAAAACAGCGAAATTTGGGCGAGCTGGAATCCGCAACTCAAAACCGACCCCATCGACCAATTTTTACGCGTTGACTGCCCCGAAAACAGCATCACCGTTGCCGTCAACTTACGCGATAACCCGTTCGCCTCTGAGGAAATTTGGCGTGAATATCGTGACGACCGCGAGCGTGCCAAAAGAAAAGCCGCCGCCGGTGATAAAAATGCGTGGGGCGAATTCGAGCATATTTGGCACGGCGCCTATCGCAGCCACAGCGCCGCGCAAGTATTAGCCGGCTGTTACCGCGTTGCGGATTTTGCAATTGAGCCGCACTGGTCAATTTATCACGGCGTGGACTGGGGTTTTGCTTCCGACCCGACCGTACTCGTGCGGTGTTATTTAGATGAAGCGGCGCGCGTTTTATACATTGCCGAAGAAGCCTACGGTGAGCACGTGGAAACGGTTGACGTGCCCCAATTATTGAGCACCATCACCAACAGCGCGCAATACGTGATTCGCGCTGATGATGCTCGTCCGGAATTAATTAGCCATTGCCGCAACCACGGCTTCCCATTAATGCGTGCCGCCGGCAAATGGCAAGGCAGCATTGAAGACGGCGTCAGCTTTTTGCGCGGACTTGATGATATTGTTATCGCGCCGCGTTGTGCGCATACCCTCGAAGAAGCGCAATTATGGAGCTACAAAACCGACCGTTTAACCGGCGACCCGTTGCCTGAATTAGACGATGCGCACGACCACTGCTGGGACGCCATTCGGTATGCGTTATCTGATGTAATTCGCGGCGGCTATCAAGGCAACAGCATTATTGCCGGCGCCGCACGTGCTTTTAGGAGAGGACGATGAACGATAAAGTAGACAAACAAGCCAGCGCTGCCACCAGCGCCCAAGCGTTATATACCGACCCCGTCTTTACCCTGACTAATGAAGACGCTGATAAAGTATTAAAAAATGCCGGACTTTCGCGTTCGGATTTGGGCAAACTGCTTTATGATGATGAAATATTTGCGTGCTGTGACCGCCGCGAAAAAGCAGTCGTGGGCACGCGCTGGCGCATCGAAGGCGATAATACCGATTGGTTGCACGCTGAAATCAGTCGCTGGCATGAAACCCTTGTCCGGCGCACGATGGACGCGCAATGGATTGGCAGCAGCATCAGCGAGTTAATCTGGCGCCGCCCCGAAGAAGACCATAACGGCATTCGTTTAGCCGCGGTTGAGCCGCGTAAAATTGAGCGCTTTATCAATCAAGACGGCGTATTGCGTTATCAAACCCAAAGCGGCTCCTATATTGATGTTGAACCGTTGAAAGTGCTTGAAGTGCGCATGAATGCGAGCGCGGCAAATCCCTACGGCGACGCATTATTGAGCCGCGTTTATTGGGCATGGTTTAATAAAAATTACGGGGAGCAATTCTGGAGCAAATATGCCGAACGCCACGCCAGCCCGTTGACCGTGGGCAAATTCAATCCCCGAACCAATAACCAAGCCGAAGCGCAACGGCATTTAAACGATTTAGCCATTACCTTAGCGCAAGCCATCAGCGATGGCGTTATCGTCATCACGCAAGACGACGAAATCTCATTCGTTAATGCTACATCGGACGGATCCGCTCATCAGCTTTTTACCCGTCACCATATTCAACGCATTCAAAAAACCATCATCGGGCGCGTGCTCACCAGCGAACTTGCCGGTGGCAGCCGCGCTGCCCAAGAAACCGATGATAATTTCTCGTAAATTCTCTTTGATTACGATTTAACCTTATGCGAGCGCGTGATAAACGAATTCATCGCAAAAGTGCTGCGGTTAAACGGCACCGCGCGCGGCGACATTCTTTTTGCTTACGACCGCACCGAAAGCATCGATAAAGAACGCTGGGAACGGGATACCGCACTGATGGATAGAGGTATTCGGTTTACAGAACAATATTTTATCGACCAATATCATTTGGAACCAATTTACTTTAGCCTCGAGCAAATAGAGCGCGCGGCACGTTCTGAACGTGCGGCAAATGCTGCTCAAAAAGCCGGTTTATCGTTATCGAAAAAACAAGAATTAACCCCTGCTGCGCAAGCGTTAGAAGACCGCGTGCAAGCCGGTATGGCAGAAGCGCCCGAACCCATTACCCGTGAAATGATTGAAGACGTGGTCAAAAACGCGCCCAATGATTACCAATTGTTGGAAGATTTGGTCAAACTCTACGGCGACCGTGACCCCGAAGGGTTTAATGATTGGTTCGGGGAAGCGCTCGAAATCGCCTGCGCGCACGGTTATCACGACGCTGACCAAGGAACTTTATGAGCGAAGCCGCCAGTTTTTTACGCCGGTTAAAAGTGGTACCGGCGCCGGAGTTTTATGCGCTGCAAGCAAAGCTCAAACAAGAAGCATGGACAATGTCTAAAATTGCGCAACTCAAACAAATCCAAGGCGTTTTAGATGATTTAGCGCAAAATATCCAGCGCGGCGGTACTTTTGAAGAATGGCAAAAAAATTACACCGCCCAAGGTTTGCCGCGGCATTATTTGGAAACCGTTTACCGCACCGCCTGCCTGCGCGCTTATAACGCCGGTAAATGGGTGCAATTCCGCGCGCAAAAAGAAAACCGCCCCATTCTGCGCTACAGCGCCATCAATGATAGCCGCACGCGTCCGCATCATAAGGCATTGCATGGTTTTATGGCACCCGTTGATAATGTTGCTTGGAAAACACTTGCACCGCCTAATGGCTTTAACTGCCGCTGTACACTGATGAGTTTGAGCGCACGGCAAGCGGCGAAATTGGGCTATAAAGGCACGGTGACGGTGCCGGAATATGTGGACGAATACGGCATCAGAAGACCGATTACACCGGATAAAGGTTGGGAAAGTTCACCGGAAAATAGCAGCATTCTTGCGTTATTACGGGAAAAGGAAAAGAAAGCAGGGTTTCCGCCGGCGCATAAAGTCTTCCCCGAACCGCTGCCGTCGCCGGAACAATGGCAGGAAGTCGCCAAAATCGGCGAGCAAGTATGGACAAAACATCTGCCGGAATTGGAACGTTCTTTTGTTAGCGGCGATTATCCGCGCACCATTCGCAAGATATTAAAAGCTGAAGGCGTGGATTTGGGCGCACCGCCACGGGTGAACGGTGAAGCGACAGAACGCTTCCAAAACATCATCAAATCCAGTTATCCGCGAAAATGGGTTGAGCGCGCTAATGAAGCCGGGCGCGTTTTGATTCGCAACTTTGATAATGAACGCGGCTTTCAATTGTTTATTAAAGATGAGGCAACCGCCGCCTATCTTAGAAACAACGGCGCGCGCTTCAAAGACTATAAACCTTTTGCGCCGAAGAAAAATCAGGTGAAAGCCGGCGATAGCTTTTTACGTTTGCGTGAGATGACGCGCAAAGAAGCAGTAGCTGACGCTTTAGAAATCAGCATTCATGAATATGCCCACCGCCTGCAATTCATCATGCCGGAATTAGATGCTTATTTTATGAACATGTGGCTTGACCGCACGCGCGGCGAAAAAACGCGACCGCTCAATGACATCTTAAAAGAGCAAGGCGAAAAACCTGTATATAACGCGAGAGAAAAAGGACGCGCTGACCACTTTTTTAACCTATATTTTGGGAAAAATTACGGCAGCGATGACGAACCCAAACCACTCGAAATGATGACCATGGCTTACCAATATTTATTAGCCGGTTCTGCTTACAAAAGCCATAGCGAAAAATGGAGAGTTTATCATTTGCATCACAAAGACCCTGAAGTGTTATACTTGGCATTAGGCTTATTATTGAGGTATCAGCCATGAAATATGAGTTTACTTTGAACGAGTTACCAGCTTCTCTGGATACCGATAAGGGTATTTTTACCTACGAAGATGAAGAGATAAAAAAAGTTATTGATGAGACAATCGCCGATGCGAAGTCTTGGGGGCGTTGGGGCGGTGGCACCAGTATTTACGTTGGTATTGATATCACTGACCCCTACCGCGATATTTATCAGTTTACTGCCTGTTTATATACCGCCATGGCTGGGAACCACCTGCCCAAGATTAGAGGCTCTGATACAGACAAATATTTCCCCAAAGAGCTTTATCCTTATGCCCCTTGTCTTGCCGGCTTATGCGAAGATATACCTCCAATTAATGTCTTCTTAGGCACCAAAGAAGAGTTTGAAGCCTACGGCGAAAAATTAGAAGAAATGGAAAAATTCGGCGTTGTCTTCTAACCATTAACCCATTTATTCCCCCCAGCAGCCAGCTTTAAGCTGGCTTTTTTATTGCCCTCGTTTTGAGGGCTTTTTATTTTAAGGAGCCAAAATGACTGAAACGATTCCTTATGAACGTCCCCAGTTCCTCCATTGGGAAGCCAATCCCCACGTCTCGCGCGAAATCGGCGTCGCTAAAGCTGCCATCAAAGCCGGCGACGTGGTGCTTGCTACTGATGGCGTCAATTTTGAAGTGCTTAAAGACGGCACTCCACCACCACCGTCCACCGGCGATGCGCCCAAAATCGGCTTTGCTTTGGAAGATGCGGCAGCCGGTGCGAAGTTCGCGCTCGTCGTCCGCTTTGCCGTAATTCTCATTGATAAATTAAATCATGTGACGGCGGCATCATTTGCTAAAGGTGGCGCGTTTGAATTCTACGGTAATTTTTTCCCACCCTTAAATATCATTTTGAAGAAATCCGTAGACGTGCAACGCGGCGTCAAACCCTAAGAGGTACCTCATGAATGAAATCATTTTATCGTTGCGCCTCTCGCTGACGGGGCGCGAATTTACCGGCGTCGCCTACAGCGGCACCACGTTATCTTACGGCGATAGCACCATCGGCATTGATATCAGCAGCATTCAAAAGCTGGATAAACAAGTGCCCATCTTGTTAGAGCACGACCCACAAACGCCCATCGGTTTCGGCAAACTGCGCGCGCAAGAAGGCAAGCTCATCATCGACGGCACTTTGCTGGATAACAAAAGCATTGCGCAAGAAATCATTGCCGATGCTGAAACGGGCAAAGAATGGCAGTTATCCGTATTCGTTGAAAGCGACCGCATCAGCGAGCGCAAAAGCGGCGACATGCTCAACGGGCAAGCCATTGAACAAGATAACGTGTTGGTCTTTGAAAACGCGTTGATTCGTGAAGTCAGCTTTTGCACCCTCGGCGTTGATGGCAACACGTACGTCAGCCTGTTATCGCTGAACCACAAACCCGACGACACCTTAAAAGTCGCCGAACAAAATGCCGATGAGCAAGATGAACAAAATGCTGAATATGAACGCGTTTGCGCCGAACTTTCCGCAGCGCGTGCCGAAGCTGAACAGCTCAAAAAGCAACTTAACGACCAAGCGCAATCGGCGCGCCTATCACGCTTAAAAGCGCTTGGCATCGCCGATGTAGCCGCCGCGCAACTTGCGGTTATTGAAGGTGAAACTTTTGAGGTGATGCTTGAGCAATTTGTGCTGCAACAAAAACAAACCGCGGCATTAAGCATTAATGCCTATCCCACCCCAACCCCCGAACGGGAAAACCCCATAAAATGGAGTAAATAATCATGGCAGACGTTTTAAGTAAACTCGGGCTAACTCAAGCTGAACTTGATGAAGCCGTCAACAGCAAACCCAATGTACCGTATCGTTTATTAAACAGCGGTTTATTCCAAGATAAAAATTTAACCACTACCAGCGTGATGGTTGAATTTACCGATAAAGAGCTCAAATTGATTCCGGCTAATAGCCGCGTCGGTGCGGCTAATGTTAGAGCCTACGGCAAAGGAAGCACCGTCCGGACTTTTACGCCGCCGCTGTTAAATTTAGAAACCACTATCCGCAGCGAGCAAATCCAAGATGTGCGCAAAGTGGGCACACGTGATGCGCTGCTTTCGAATGCAGAAGCGATGCAGGAAGAAATTGCAACACACCGCGAAATGCACGACCTCACCATCGAGCACCTCATGCTCGGCGCCATCAAAGGCAAAATCATTGATGCCGATGGTGAAACAGTTTTATTTGATTTGTTTAAAGAATTGGGCATCAGCGAACCGTTAACCACCATTGACGCCGCCGCTGCTGATATCGGTCAACAATTCGCGAAAACCTTGCGCATCATGAAAGACGGTTTAGCCGGCGACACCTGCACCCAAGCGCGCGTGTTATGTTCGCGTGGTTTTTTCGATAGCGTTATCGCGCATAAATCCGCGCATGAAGCCTTTGAGCGTTATCAAGAAAATGCTTTCGCGCGTAATTTGCCCGTTGATACCTTTCAATGGAACGGCTTTATTTTCGAAGTTTATCACTACGAAATCGACGGCAAACCCGTTATTGCCGACGGTGAAGCGCATGCTTATTTAGAAGGTATGCAACGCGGTTTTACCCGTTATAACGCTACCGGTACCTTAATGAGCGCTGTCAACCAAATAGCACGCCCGTTTTATATCGATATCGAAGATTTAGCGCATAAGCGCGGCATTAGTGTTTATACCGAATCCGCGCCGTTGCCCTTGTGTTTGCGTCCTAAAACCTTAATGCATTTCAAAATCTAAACCGCGCTTTAGGAGATTGCTATGTATATCACCGGCGCCGACATTATCACGCGTTTTGGTCGTGATGAATTAGCGCAAATTTTAGCCGTGCCTATCGATGATTTAAGCCCTGACCATGAGCGTTTATTAGCCGCCTGCCGCGATGCTGATGCCATCATCGATAGCTATTTAAGCCGTGCGCTGGATTTGCCGTTAAAAACGCCGCCGGCGGTGCTTATCGCATACGCCGCTGATATCGCTCGTTATCGTTTGCATGATGACCAAGTCGAAGACGGCACCAGCGTGCAGCGGAAACGTTATCAAGATGCCATTTTATGGCTGGAACGCGTTGCCCGTGGTGAAGTCTCACTCATCGCGCACAACGAACAACAGCAAAAAAACATTAAACCATCGCCAATGGCAGTTAGCAGCGTCAGCGGTTTAGCGGTGGTTTCGAGCCCGCCCGTTTTCACCGATGACTTGCTGGCTAAAGGGTTGGTGAAGTCGTGATTTTTGTAAGCGTTTCTGGTTTTGATGAAGCACACGAACGTTTGCGCGCACTCGTTTCACGCGGTGCCGATTTAACGCCGCTGTTAACTGAAATCGGCGAAAACGAAGTCAGCAATACTCTCTTACGTTTTGAACAAGCCCGCGCACCCGACGGCAGCAGCTGGCAAGCATTAAAACGCCCGCGTGCCCATCGCCGCGGCGGTGATGGCGGCAATCTTCCGCTTAACGACACCGGCGCCTTAAAAACCAGTATTAAAAGCCAATTGATGGGCGACAGCAGCGTGCTCATCGGCAGCGATTTAGTGTATGCGCTCACGCATCAAAGCGGACGCGATGCTATTCCTGCCCGCCCATTTTTAGGTTTAGAAGCCGATGCAGAAGCAGAAATTCTGGAGATAATTCATGCCTACTTTGCCGATTGACCCTTTAGATTTAAATAGCGCTTGGGACGCCATCGTGGCGCGCATTGCTGCTGTTGATGGAATTCGCGCCGTGCGCGGCGCGGTGGATTTAGAAAAAATCATCAACGGGCAAACCACCGGCAGCAATCATTATGCCTTTGTGACTTTTGACGGCATCGAGCTTTTGCAACCTGCCGGCAATTCAACACGCCATCAAACCGTTGATATTGCTTATCAAGTCGTTATTGCCGAACTCGATTACAGCAATGATGGCAAACCGCAACAAGCCGGCGTATTGCTCGGACGCGTACTTTCGGCGCTGATGAGCTTTGCACCTTTTACTGATGATAAACGCAGCAGTCAACGCCTGCGCGCCGTGACGCCACCGCGCGCCGTCCACCGTAATAAATACAGTCTTTATCCGCTTAAATTTATCTTAAGCCTACATATTAAAGGAGAACAATTATGAGACACCCCACCGCGCATATTCGCGAAGATGCCTTCATCGGTGAAGGCACCGTCAAATTAAAAGTGAAAGGACGCGAAGAGTTGGGCATTTTTGAACTCGGTAATGCGACCGAATTTTCGGTTGCGATGAGTTCTGAAGTGCTCGAGCGCATCTCTAAAAAACGCGGAACCTACGGACAAGTATTAAATACCGTCACATTGCCCAAAGCCGGCGAGCTTTCCATCACGATTGACACCATCAATAAAGAAACGTTTGCAATGGCACTTATGGGTTCTTTAGGCGTTGATGAGATGAAAGCTGAAACCGTCGCCGATGAAGAAATCGCTTTCAAACCAGATGTTTGGCTCAAACTGCCGCATCGTTATATCGATGGCGAGGTCGTCGTTAAAGATGCCACCGCCGCCGGCACGGTTGACCCAAAAGAAATTGAAGTAGAACCGCGGCTCGGGTTGTTCCGCGTGACGTCCGCCGGCGCTGCTGCCGCTAATATCGACGGCACCGTTAAAGTCAGCTACAAAACGGCAACGTGGACGCGCTGGGTGATTCAAGCGTTCAAATCAACGGAATTGCGCGGCGAATTGTGGCTCGATGGGAGGAACCGGATTACCGGTGAAGATGTTTTGCTGCACATGCCGCAGTTTACTTTAGCCGTTGACGGCGAATTCAATTTCTTCACCGATGAATTTAACACCATCACATTTAAAGGTCGTCCTGAAACCGCTGAGGGTTATGAAACCGCCTTTACCGTTGAGATGAAGGAATAACGATGAAGATTGAATTAAAACAAGTGGTTATTGTTGGCGGCGTCGCGCGCGCTGCCGGCAGTGTTATTGATATTGCTGAAAGCACGGCGCGCTGGTTAATCGAACAAGGCGCTGCGGTATCTAACGAAGATTTAACTGTCGCCGAACTCAAAGCGCAATTAGACGCGCGCGGCATTTCATACCGTAAAAATGCCAGCAAAACTGAATTATCCGCATTATTAACTAAATAGGAGGACGATGAACAATAAACCTTTTAAACAAGCACCTTTGCCTTTCATTGGGCAAAAACGCATGTTTTTAAACGCATTTCAAAAAGTACTCAAAGAACATATCCCGAGCGACGCCGAAGGCTGCAACCGCATGAGTGATTATCAACGCATCAATATCCAAACACGGCTCAATTACTCTTCTGTTTATGAAGACAATTTGACTTATAAATTCTCATAACCGCCGCTGGGCGGTGTTTTTATCAGGGAAAACACAATGGCAAATGATTTAAAAGTTGGCATTAAAATTGATGCCAGCGTTCGCAATTTTAAAAAAATCGAAGCAATATCAGAAACGTTAACCGGCGTAGCGCAGCAAACTGCAGCGCTACAAAACCAAACCGAAAAACTGCGCAGCGAATGGGAAAAACTCACGCCAGAACAACATGCCCAACAAATCACCGCATTAGAAAGCGCTATTGATGGCTTGGGCAAACAAACCGCCGGCGCCGTGTTGCAAAGTGATGAAATGGCGCACAGCTTTGAGCACGTAAGCGAAAAAGCGCAACAGTTACAAGCCATCGCCAAAACAAAAACGCAGCTCGGTATCGATACCGACGAACGCGCCGTTAGTAAAATTGAACAAGCTGTTGATGCCTACCGTGATTTAAAAAAGGAAGGCGGCACGGCACAAAAAGAATTAGCGCACGCTGCTGACGCGCACCGGCAAAAAATCGATGAACTGGAAAGCGCGCTGCGTAATACCAAACCCAGTTTGGGCGATATCGCCGGCGGACTTGCCAAAATTATCAGCAGCGCCGGCGGACTTTCCGTGGTCGCGCAATCCGCGATGAGCTTTGAAACAGCGATGGCAGGCGTTAAAAAAGTAGTTGATGCTACGCCCGAACAAATGCAGCAGTTATCCGGCACCATTCGTCAACTTGCCTACGAGCTCGGCATGACTGCCGAAGAGACAGCTAATATTACGGCGATGGGCGGACAATTAGGCGTGGCGTTTCAAGACTTGCCAGAATTTACCCGTCTCGCCGGCGAAATGGCAGTCGCGTTTAATATGACGGCGGAGCAAGCCGGAGACGCCGCCGCCAAATTAGCAAACGTTTATCAAATACCGCTGGAAAATGTGCGCGCGCTCGGAGATGCCATTAATACCCTCGGCAATAATACCGCTGCCACCGAAGCCGAAATCATTAATGCCACTTTGCGCATTGGCGGCACTGCCAAACAATTCGGGCTCGCTGCCGAATCTGCCGCCGCGCTTGCCGATAGTTTTATCGCCTTAGGCAAAACACCGGAAGTTGCTGGTACTGCCATCAATGCTTTGTTGAACAAACTACAAACAGCGCCGGCTGCCGGTAAAGAATTTAAAGATGCGCTCAAAAGCATCGGCTTAGAAGCTGATGCACTGGCGCAATCGATTCGCGCTAATCCCGAACAGGCGTTAATGTCGTTCATGGAAAAATTAAGCGCGCTTGACCAGCAGCAGCGCGCCATCACCGTTACCAAATTATTTGGCTTGGAATATGCCGATGATATCGCTTTAGCCGCCGGCAGCTTAGACAGCTTTCGCCACGCACTCTCACTAGTTGCAGATCAGCAAGCAACCGCAGGCGCCATGCAAGAAGAAACAAACGCGGCAATGGATACCGCAGCGAAAAAAATCGAACAAGCAAAAACTGCCATTAACAACATTGCCATCGAGCTGGGCAGCCTATTATTACCCATCATCGCCGACGTAGCATCTGCTTTTGGCGGTACCGTCAAAGAGATTTTGGCGCTTGCACAGGCACACCCGCATATTACGGAACTCGTCTCTGCGGTTTCCGGAATAAAGGTCGTGGCTTATGCCGCTGCAGAAAGTTTTAAATTATTAGGCGGCGTCGCACAAAACAGTTTTTCATTATTAGGGGCAGGGAGCGAACAAGCAAGCGCTGGATTGCACCGTTTACGCGGCGAAACAGAACAAGGCGTTGAAGGGTTTAGCAAAATGGATAAAGCAGCAAACGCTTTAAGCAAAACGCTAAACGTACTTTCTGCTGCAACGGCTGGCTTCACAACTGGCTTTTCATTCGGCAGTTGGCTTTATGAGCAATCAGAACACGTACGTGCATTCGGTGATAGTCTGGGCAAACTTGCGGCTTACACTGTAGCTATCTTTAGCGACAGCACATTTGAAGATGTCGCCACCTATTATAAAACCAGCGCGCAAGTTGCCACCGAAGAAACACAACGCCTCGCCCAAGCCAAAGAACAATTGGCGCAAAAAGCCGCAGAAGCTGCCGAAGCAGAACGCCAACATGCTGAAATGGTTGCCCAAAATCAAGCGCAAATTAATCAATTAGTTGCTGAAATTGAACAACATCAATCATCTTTACAAGTACTGCGCGAAGCTGGTGAAGCAGGCGGTGCAACTTATGCCTACATTTCATCGCAAATTGAAACTGCTAAACAAAAAATACACGAACTCACCGAGGCAATCGAAGGGAAGCCCATCGGTTTAATAATGAAAACCGAATTCGAAGCCGCCGATAAAGCCTTTAAAACTCTGGGATTAAGTTTATCCGAATTGGAAACTGGCATCAGTGATAAAGCCAAACAATCATTAGAAGCGTTTGGCACCGTTGCAGTAGTTGCAGAGGGTAATGTGACGCAATTAGCGCGTGCCTACGAAGCAGCACGCCAAAACATGGGTAGCAGTAGCGCTGCACAGGAGCAGTTGAACCAAAAATTGCTCGATGCGGTTGGCGGCAATCAAGAACTCTATCAAGCCGTTATTCAAACCGCGCAAGCCCAAAACATCGCAAAACGCGCAGCTGATGAACAAAGCGCGGCGTTGCAGGCTCTGGGGTTAAATATGGAAGATATTGCCAATCGCACATCGTCAGGCGTGCAAAAAATGCTTGCCCATTGGAAAACGGGCATGGAAACATTGAAAGCATCAGGCAAACAAAGCGCCGAAGCCGTACGCCTTGCGTTTGATAACATGCTTAAAAGCTTGCACAGTACCGAAGATTTTAAAGCCTTTTCCGATGCGCTCCAAGAAACGGGTACCGCAAGCGCGCTTACAGCCGAACAACTGGCACAACTGCGCGCCGGCGTCAATGGCGGTGCGAACGCAGCACAAGCAGCAGCGCAAGCTAACGCGGCACATACGCAATCGCTATCTGCCAATACCAGCGAAGTGCTCGCCAATAAAGCCGCTATCGAACAAAAAACACAAGCGCTTAAAGAAAGCGCCAACGCCGCTAAAGAAGCCAGCACTGCTGAAGCAAAAGTCGAATCTCAAAGCGAAAAAACGCGTAAAACTATTACCCTTGCCGCGCCAGCCTATTACACCGAAGCGCGCAAACGTATCGAAGCGATGAGAGAACTGGGAGCGACCAGTGAAGAAGTTGAAAATGCTCTGCAAAGCTTTTGGCAAAAAACGCGCTTTAATTTTGGTGTAAGCAACGTTAACGAAGTTGCCACGTCCCTTGTCAGCGCGATGAAAGCCGCAGCCTCGGCGCGGGCGCGAATCGATGAAATGACGGAATCACTACGCAACGGTTCTTTTAGTGCTCAAGATATTTCCGAAGCAATGGGCAAATTACATATTTCCTCTTTGGATTCGGTCGATAGCGTGCAAAATTTGGGTGATAGCCGTTTAAATAATTTACGCGATGCGCTCAAAGAAGCACGCAACCACATGCGCGCGTTATCCGAAGAAGCGCGCGACACTGCCGACAGTTTGGAAGCTGATTTGGCGCGACTGCGCGGTGATGATTCGCTTGCCAAACAAATTGAAGAAACTAAAAAATTAAAAGAATTAGAAGCAAAGCGCGCTGCGGCACAAAAAGCCGGCAATAAAGAAGCTGCTAAAGAATTTGAGCGTGCCTTGATTTTACAAAAACAGATTTTTGCCGAAGAACAACGCCAAGCTGAAGTGCGTGCTGCTGAAGCGCAAAAGAAAGCGGCTGAACGTCAAGCCGCCGATGCCCAACGTAAACAAAATAAAGCCGAACACATCGCTCAAAATCGTACGCCGGATAATCGCATGGCTGACGATGCGCGCACGCCACAAGTTGATATGAATAAACCGGCGGTAACATTGGTTGGTTCTGCCCCCACGGCTGAAGCGCTTGCCGATATTTGGAACGCTAAAATCGCCGCCGCGGAAGAGCGCGGCGCGCAGCGCGGCAAAGAAGAATTCGCCAAAGAGCTTTATAACGCAGCTAAAAGAAGACCATTATGAATGATTACTGGCAACTCACCCGTAAAGACACGCAGGCATGGTTACAGTTTGACCAAGACATGCGCTGGATAGATGAATTTGACTGGTCAAACATCGCGCAATCTAATCCCGTGCGCACATTATCCGGCGCGCAAGTAATTCAACAAGGCACTAAATACAGCGGACGCCCGATTACGCTCGCCGGCGACTGGGTATGGATTCGGCGCGCGCAACTGCAAACAATGCAGGATTGGACGACGACGCCGGAGCTGGAAATGATGCTGACGCATTACGATGGACGCGTTTTTAACGTCACCTTTCGCTTGCATGAAAACGCTTTTGAGGCAACGCCCGTCGTCTACCGCACACCAGAAGAAGACGGCGATTTTTACACCATCAAAATTAATCTTATGACCATTTGAGGCACGATATGGAACGAAAAACCGCGCTGACGCGTCAAGATTTGCAAATTTATTTAACCGAACGCTTAACCGATGCTGACGACGGCGGCGGTTTGATGACGAAAACCGCGCTTACCGATGAAGAAAATCAGCTTTTTAATCCCATTTCGGACGTCGCGAGAACCATGGGCGCTTTTCATGCGCGTTCTGTGCACGCTGCCGTGCGACGCCCTGATGATACGCCGCTGGGCGGTGCTTATGTGATTTTAAGCGAGCCGCCGAAAACGCAAAACGTCAGTTATTTATTGTTTCGCGGCATTAAATATGGCGAAGAGCGCCGCGATATTGTCCCAAGAATTGAAGCTTATTCAGTTGGCACTATTGAATCCCGTATGACTTTATTATCCGTACAATCACGCGGCAGCCGCGTTATACAAGCTTATCAACGCCAAGGTGAGCCGTTGCCGCTGATTGGCGATGTTTATTGTTTGCGGCAGGACAAAAAAGGTTATGCGCAATATGAACAATATATTCAAGTGATTAAAGTATCCAGCGAAGACCGCACCTTCACCGACCCGACCGGTAAAGATTTTGTGCGCACTGTGGTAAAAATGGAAACATCAACAGCGCTTGAGCAAGACATGCTCGGCATTGATTATCCCGTGCTCGGCTACGGCGATGCACCGTGCAAAATCCGCGAAACCCACGTCGCCGATTCAGCGCAGTATTACGGCGTTAAAGCATTGAGCGCTGATGCAGTTTCCGGCGCGATGAAAATCCGCGTGCCGTCGTTGATGGAAAAACTAGTGCCGACGTCGCAAGTTGAAACGTCATTAGTTGACTTAACCGCGGCAGGGCAACGGCAAGTATTAGTGGATAATGCGATTAAAGGCGACGATGGATTTATCACGCAAACCTTGCGCCTGCGAACCTTAAATATCGATGAGGTTATTCATTTAGGGCGCGCCATTGTGCCGGATTCTTTGACGATTAAAGGCAATATCACTGCAAATGATGTTGGCGGCACATTAGTCAACGCATCAAGCCAAGAATCCATTGGCACCGTTGATTATGCGCGCGGTGAAATTCGGTTTTCGGTTTATACGTCCGGCATCAGCACGGTTTCATTTCGTCCTGCGGTTTCCGAGTTGAAAGTATCTGATACCACCAAAATTGATATCAGTATTAATAACCGTAGTTATAACTATGTGTTAGCGATTAATCCGATTCCAGCGCCGGCGAGTTTACTGGTTTCATATCGCGCGCAAGGGCGTTGGTATGATTTATATGATGACGGTTCTGGCGCGTTGCGCGGCTTTAGCGCGGCACACGGCAGCGGCGCGTTAAATTATGCTAGCGGTACTGTTACGCTGACCTGCGGCGAATTGCCCGACGTTGGTTCATCTATTTTATTTGCTTGGTGCACGCCGGCGCAGTATAAAAACAGAAGCAGCGAAACGCCGAAAGTAAGCATCGTGCTGGTATTAAATCAAACCGCAGATCCGGCAACGCTGAAATTGACGTGGAACACATATCAAGCAAGTTGTGACGCCGCCGGCAAAATCACCGGTAACGCAACAGGTTATTATGACGCGCGCACAAAAACGATAAAACTCGATAGCGCAAGTTATATGCTGGGGCAAAAAGTCACGCTCACTTATTCAAAATTCGAAGAGGCGGATAAAGTGCAGCAAGAGCATAAAGCGCCGCTACGCAATGGCGCCGGCGAAATTGTGCTCGACTTAGGCGATAAACCAATAGCACCCAATACTGTGCGCCTGAAATACAATTTATTGATTGAAGATTTTCAACAGCAAACATACGGCGAAATTTATTTAAAACGCATTGACCCGTATAAAGTATTGCGCGATAACGGCGCCGGCGTACTCATCGATGAAAATAATGTGCCATTCGGTACGATTGATTACGACGCGCGCAAACTCACTTTTAAACCGGAGACCACCGTTAAAATTCCCAAAGCGCAATATATGACTTATAAATCGGGTACTGAAATCGTTCGCAAATCGCAAAGCGATCCAGATAAGCATGAAGTGCGCGATGTGTTTCGTAATTTCTTCACCGGTTTTGAGTATCAAAGCGTTGGCGCTTTTATGCCATTCGGCGATGATGGCGTGGTTGAAGTGTGGTTTACGCCCAAAACCGTCACCAATACGTTTGAAGAAATCAATACTGAGCGCTTTTTAGAAATTGAACTCGCGCCCAATTTAGCAGAACGCATCGTCACCGGCAGCGTTCATCTTAAAGTCGGCAATAAATTTCATTTTGACCGCGCCGGCGCGATGTATACCGATTTAGATACTTCAACCGGCTATGCGCGCAAAGTCGGCACCATTGATTACCAAAACGGCGTGATTCGTTTATCAGAATTTACCGATATCAACGCGCGCATCGTTGCGCTTTCAACGACAATTGACAGTAATCCGGTTGATGCCGCTACTTTCCGCACCCCATCTTCGCCCATTCGCGCCGGCAGTTTGCAAATCCGCGCGACCACCGCAACCGGCGAACAATTAAGCGCCATTGCACAACTCGACGGCAGCATCAATGATAATAAAATTACCGGCACGGTTGACGTTGAAAGCGGTGTCGCCGCGGTTAAATTCGGTGAATTAGTCAATGCCGCCGGCAATGAAAATGAACCGTGGTATCAGGCAGAAGCCGTTGTTGATGGCAAAATATTTAAACCGGCGCACGTGCTCGCCGAAACCATTACTTATAACGCCGTGGCGTATACGTATTTACCGCTTGATACTGCTGTAATCGGCATTGATGCGGTGCGTTTGCCGCAAGACGGACGCGTGCCGATTTTCCGCCGCGGCGATATGATTGTTATCGGTAACCGTATTATTGAAGATATCGGCAGCGCGCATACGGCAAAAGGCGTTGTGTCATTATCGCGCGGTGATTTAGACGGGCTTTGTGTTTTCGATAACGCCGGCAAATCCGTTGATGCTCATTTGTATGATTATGACTTAACCGCGGGCACGCTGACGTGGTCGGAACCGCTTGATTTATCAGCTTATCAAATGCCGCTCAAAGTCAAACATGGGCAAGAAGAAGAAAACCGTATTATCAGCGTGGATATTGATGGCACGTTAACGCTGCAATTTCCGCTGCGCCGCGCTTATCCTGCAAACAGCACCTTTGTTTCTTCCGCGCTCATCGGCGGCGATTTGCAAGTGCGCGTGACGGCGCCATTCGGGCAAAAAGCATTTGATAATATTTGGTCAGATGAACGCCGCGGCGATGATATCAGAAGCCGATTAAACGTTACGGATTTCCCCTTTGTTTTAACAGACGACGGCACAACAACTGACCGCTGGGCGATTGTTTGGCGCGATGCCATGCAATTTGATTTATACAGTGAGGCACTCGGTTTCGTCGGTCGTTATGACACGTTAACGGATTTAGCACCGATTAATCCTGCCACCAATAAACCGTATTTCACGTTACCGCTCGGCGCCTTTGGTATCCGAAGCGGCGTTTCCGGTTGGGCTGCCGGCGAAACCGTCCGCTTTAATACGTTTGGCACGCACATCGGCGTCTGGATTTTGCGCGCAGTGCAACCGTCATCTGAAAAACAAAGCGCCAGCGATGGCTTTACGATTTGCCTGCGTGGCAACACCACGGAGCTTTAAGATGAATCAATACTATCGAACAGAACCGGTGAACGTTTATCGTTTTGATGATGAAGATGCACCGCCGCTGAATGGTAATCCCGATTCGCTTTTAACGATTTTAAAAGCCTGTTTAGTCACTGGTTACGGCAAGCAAAAACCGCTTGGGTTTTCACTTGCTTTTGAAGATGAGCACGTGAAAGTATTCTGCCCAAAACCGCGCGGTTTAGAACCGCAGTGGTTTTTACGCGCCAGCGATGACAACGGCGCCTCAGCCAGATTACAAATTTATCTTGACATGACGGGGATTAATGACGGGCGCATCATGTGCGCGCCGCAAACACCTTATAAATATTCAAATAAAAACCGCACGGGCGAGTGGTTATTGGTTGGCAGCAGCCGTGGTTTTGTTTTTTTTGCGGTTTGCGGTTATACCGACGCCATCAACAAAGGCAGCTTTTGCGTCTGCGGCGACAGCAGCAAAAATGCAAAAGGCGAGCGCGCCGTTTATTTACACCACGCCGGCGGCAGCTGGTCTGATTCAGATATTTACGGTATACGTCCGCCCACTTTTGATACGAGTACGCATATTGCCGGCTGTTTAGCGCGCGTTGACCAAAACAATAATTTTGTTGTGCGCAACGTTTATGCTTCGTGTGCGCTGGGCGCTGGTTTAGAGACCGTAGAAACGCATCTCGCACCGCTTTATGTTGCTGATAGCGAAAGCGTTTATTTAATCGCCGGCGTTTATTTATCGTCAAATACGCATGCAGCAAACAGTTATGACACCGTAAAACACGATGACGCGCAATTCATTGTGCACGCAAGTTCCGGCAGAGAGCGATGTCACGAACAGCTTTATGTACGTACTGATTTTTGGTATTACTAACATGATTTTATCGGATAAACATTATATTCCGCACGACACTGGGTTCATCGGCGGCACGCTCGGCGGTATTGTGACTGTTGCGCATATGCCGGCTGTGCAACCTGTTTTTTTGTTAGACAGCAAAACGCTGCAGATTGTCGCGCAAACGTACTCAAACCAAAACGGGCATTATTGTTTTACTTGTTTGCCGGCTGATAAATGTTATACGCTCTTTGCGCGCGACCGCTTCAAGCGAGCGAGTTTGCGTCCGCCGGTGTGGGATTACGTCACGCCGGTTAATGATATGAGCTTAAGTGAGCAATATGATTTTTTAAAAGCATACGATGAGCATGTATGATAATAAGCAATTGCTTCACAGTTAATAACAGCGCAGCGGTCAATGTGACGCATTGTGCAAACCACCAAGCAAAACCTGTGCCGGCTATGCACTGCGTGCAAGTGCGTCATCATTCGTCAGCGACCATAGCGCGCTGTCATCATTATCATGTTACATCATTTTACCGTCTTTCATCATGCGTTCATGATAAAGCGCAGTATGCCGCACCGGTTAAACGCTGCCAGCCGGTTTTATTATCGTTTTATCGTCTGCTTAGTAACTGCGCCACCGATACAACGCCTTCCAGCGTCTCTTTAAAAAACTGTATTGCGGCGCTGATAATGTCGCCGTTGAGTTTTAAAAACTGCACTGCAGCGCGCACAGGTACGATGATATTTTTGCACCACTGTACCGCGCCGGCGGTTATCAGTTCTCATATCAAAAATTGTCAGATGGCACGCTATCAAAAAGCTGTGCGCCCGCCATGCTTATTTTTCCCCGTCCCGCTGCCGCCGCCCCCGCCGCCGGGAGAGGCACGCGATTTTTGCCGCCTGCCACCGCCATCGTCACAATTGCCGCTGCATTTCTGGCAACAACCGTTAAGTAGCGATGCCACGGCTTTACCGCTGCCGTTTGCATGTTTTGACAAACCGATATATTCATTCATCGTTCCAGACCTAAAGACATATATTATGCACAACACGATTACCGCTACTTTTGATAACGAACCGCTGCACTTACTATCGTTGCGCCTACAAACCGATATAAACGCGTACTGCTGGCAAGCCAATTTTGATATCAGCGTTGACGATTTCGCACGTTTGAATATAGACAAACGCAAAAAAGGCGATGAAGCCGTTGTGTCTATCTTTATTAATGGTGAGCGTTTTGATGTGCTTGCAGAAGATTACAGCGATAACCGTCGCTTTATCGGTAACACTTACACGATTACAGGACGTAGTATTACCGCAAAACTCGGCGCAGATTATGCCGCCGGCAAGCATCAGATTTATCAAGAAGCACGTTATGCACGCCAAATCGCCGATGAGCAATTGTATTTATTACCCTACAAGATTGCGGCATGGGAATGTATCGATTGGTTAATACCAAGCAATCATTATGCTGTCAACGGTCAAACACCCATCGCCGTGATTGCCGATATCGCAGCTGCCGCAGGTGCTTTTGTCAACAGCCATACTTATCTGCCTGAATTGAGCATTAAGCCTGTTTGGCAGAAAGCTGCGTGGGAAACAGTATCCCCAAAACATACCGTTCCTGCTTCATTAATTTACAGCATCAGCGGTCGCAGAACGATTAAAGTACGCGCCAATGCCGTGCGCGTTGTCGGCAGCGGCATCAGTGCGCGCGGTTTTTTAGTCTATCGTGAAGCCAGTAATCAAATTCCGGAAGCCGCGGTTTTAAATCATGTGTTATATACTGAAGAGGCGGTCGCGCGTGCCGCCGGCATTCATGCTTTATCCGAAACCGGTATTCATAAAACCGAAACGGTGACCTTGCCGGTTGCCGATAAATATCAATTGCCGCGCGCCGAACTTGGCGATGTGTGGGCATTCAATGAAAACGGCGAGCAATTTCAAGGCGTGGTAAAAAGCGTTACGCTTACCGTTTCTCTTGAAAATGACGCGCCGGTGGTGACGCAAACTTTAGATGTTGACCGTTATCTCGATTTTTAGGTGTTTCCGTGGATATTTTAAAGCAGTTCAATGATTTAATTGCGCCGAAAAAACAACAAGCAGCGCGAGTTATCGCGCAAAAAGGCGCTGATGCGTGGGTTGCAGAAACGCCGGCGGGGTTAATTGTTGTTATCACCGGCACGACGCAAGTCGGTCAGCATGTTTATTATGATGATTATACGAAGCGCATCATCGGACAAGCGCCGGCGGTTGCTTGGACGGGTATTCCCGTGTAATTATTTAATGTATTTTATTGCATTTTTTTTCGCTGCATTGCTTCGTAAAAGCCCCACGCGAGACAGCGTTCAAAATCGGCGAGGTCAATATCGACAAGCAGCGGATTAATTATTAAAACTGCGATAAACGGGTTCTCCGGCGGAGCACCTTCGCCACCAACTATCGACCAACATTCTTTCGCAGTTCGGCTATGTAAGCAAACTGTAAAACGAATAACGTTATCAGAATATACTGCACCTCTTTTCCGCATACTTATCATAAAGTCAGCCATTTTGTTATAGCTTTTTATCAGATAGCACGAGTAATCATCATAAATATTAATAGTTTCACCGTTTTGCATGCGCTCAAACCAATCAATCATTGTTTCGATGACGCTATCTCTGACTTCATCGCGCGTGCTGATGTAAGTATGACCGGTGTTTACCGTAACGTGCTTAAAGTACTTCATATTTTACCTGCAAAAAATCAAGTTGCTGCTTCATAATATTAAATGGTTCTGTTTCTGCTTCGATTAACGTGCAGTCTTTTTCATGCTTAACGCATGCTAAGCCCGTTGTTCCGCTGCCGGCAAACATATCGCAAACTGTTGATACATTGAACGCGCCGATGATGTCATCAATCATTTGCTGATTTTTCTGATATCTGTAAACAATATCGCGTTTCGGCGCATCAAAAAACGTCGGATAATAATGCGTTTTAGTGTCGCTATATTGATCCGCGCGCGCAACTAGTCGTCTATCAAACGCTGATTTAACGCCGTGTTTTTTAAAATAAAAAATGTTGTTATGTAGATAATGCGGCATCGCATAGCTTCTTGATTCTTTCGGTTTACTTCTATTTGCGACTAAATCAAAACAAAAATCCATATCTAATTTTGGATAAAGTTCTAACACTTGACGCATTGATGCTATTAACACTAGATGCTCAAACTGATAGTTGCTTAAAATTTTATGCAACTTGCCTGCGCCCATATCAAACGGCGGGTCAGTGAAAATCATCTCAAAGCTGCCGCCGCGGTTTTCTGATTTATCATTGATTAATGTTATCATTTATGTGCCCCGTATATTTTCGCGGCAACCGCAGCCGGATTATTTTCAAAATTGCGCGGCATGCTTTTTATATAAAAGCTCTCGCTTTGCAAAACATAATCATTATTTTTATAAAAGTTATTATATTGTTCCGGTGTTAGTTGCTTGCGTAGTTCTGTTTGCAACAAACAATACAATTGCGCGCAGCGCCTGCCTACAAGCGAGTCATCAAATTTTGGTGAAAAATTAAAGTATTCTTTACGCGCGCGTTCTGCTTCTTTGCGCTCTTGTTCTTCTATTAATTTCTGTTGCTCTCGCGCTGCGATTTTTTTGCGCTCGAGCTCAACCTCTTTCGCAGTGTAAAAAAATAGTTCGATACCCTCGCCGCGCCCCTCGTGCCGCATTGATAAATCATTGAATTCACCACAATACGGGCAATGCGTCAACGTTGCTTTGAAAAATGTGTCGCAATAGCAGCAAATAGTTAAATCGCGGCGGCGGATTTCGCCCTGTTTTTTATTCCAATCAACATTGTCAGATGGCAAGCCGTGCTGAGCGACGGCGTGACCGCAAACGTCAATAATCAGCGCTTGCTTATTTTCGCGCGGTCTAAGCACACGCCCACACAGCTGACGATATAGCCCGAAACTATTTACTTTTCTGTTAATAATCAGTACGTCAGCATCTGGCACGTCGAAACCTTCATTAATCATATCAACCGCGACTAAAATTTGAACGTTCTTATTATCAAATTCATCTAATATTTTTTCGATATCGTACTGCGGCATTTTGCTATGAATAACCGCGGAAGAATATCCTGCTTTTGCGAGCTCATCTACTGAATAAATTGCGTTATCAATTCTCGGCTCAATAAGTATTGTTTGTTTTTTCTTGCAATATTTCCGCGCGACGTCGATTATTTCGGCGCCGCGGTCTCCTGCTTCTATGTCAGCGTCGCCGACCCATATTTCTATGTCAACGCCATTATCATAAATCACTTCACGGTCTTCTATTTTGTAATAGATGCATCGATATTGCGCTAAATATCCCTCGGCAATTAATCGTTCAGTACCATTTTCCTCATACCCTTCTGCCTGTATGATTTTGTCAAAAAAACCGCCGTACTGTTTGAGCATCGGCTGCCCGTCGCCGCGGCACGGCGTCGCGGTAAAACCAACGCAACGCACATTTAAACTGTCAGCGATATAAAACCATTTGTTATCCTCAGCGACGTGGTGCGCTTCATCAATTAATATTATCTTTGTGTTGTCATCTAACCCTTTGCCGCCGCGCTTTATTCTGCTGTTAATCGTATCAATCGATACAAGATAAATGCTCGCCGCCGGATTAATAAAATGCCTGCCTACTTTTGCAACGTTGTTTCGTGCGCATATTTTTTTAGTACTATTCGCGCCGATGATGCGGTGCTCTAATCCGCACATTGCGAGCTTTTCGCTCGCTTGTTTTATTAAGATATTGCGATGGCAAATAATCGCGGCGTGCTCATAATATTCTGCGAGCTTCGCAATTATTGGCGTTTTGCCGGCGCCAGTGTCTAATTGAACTAAATCGTCATCGCTTGAATTAATTACTTTGTCAAAGATTGACTGCTGATAGCTTCGTAAATTCATTCCGTCACCATTGTTATAAAGCCGCCGCACAGGCGGCTTTAGTCATTCTTAAATCTCTACGCCTAATTTATCAATCGCTGCCTGGCGAATTTTTACTTGCTCGTCGTAGCCTTCGTTACCCTCACCGATTTCATTTGCTTTTCGTGTCGCAACAATGAGCGTTTTTATTGCTTGCTCTTTTACTGCAGTTGATATGTCTCGCGTTTCAATCCAAAATTTTGAATGTTGCGCACATGATGCAGATTTCAAAAATAAATAGACGTCCTCATCAGCGCTTGCGTTTTCTAAAAATTTCCGACGAATATCTTCCGCCCATTTTTTTTGCTTCAATGTGCCTTTAAGCGCTGTTAAACCCAGCAGTTTCGCTGTTTGACGCCGCTCCGCCGTTTCTTTAATCACTTTTTTTCTATACGCTTCACGACGTTCTTCATACGTCAATGAATAAAAATATGAGTTATAACGGCTACGCAAATTATAATCATCTTCACAGTCTCGTATTGCACATAGCACGTCTCTTATAACTTTAAGTAATACGTCATCATCATCTAAATCATAGAGTTCTAAGCCAAGCTCGCCTGCTTTCACTCTAATGAATTCTCGTACATTTTCCAAATCAATTTTTCCGCTATCAACCATTTCTTCAACGTCGCTATCGTAGAAACTTCGCAAAGATTCATCAGATTGCATAAAATCGTAAACTTCTGTTACGAGCGGCATCGCGGCGATGAGCAATTTAATCGCTGTTACGTATGTTGTCTCATCGAAGCCTTCTATAATTCGGACTAATGCATGTGAAGCGTGGTAGCTAAGTTCTTCATTTAATATGTCGTTTGATGTGTCAATGGCGAGCTGTTTAACCATCGCGATTAATTCTTCAGCTTCCGCGGCGGTGCTCATTTTATGCCTGATTACGTTCTCATCTTCCTGCCGTTTAATAAAAACACTGTCGCCGTCGTACTCAACGATTACTGTTAATTCTCTGTCGCCTGCAAAATAAACGCTTATTAATTGCACAAAGCTCGAAATTTTCGGCGCGTCGTTTGAGCGCGTATCAATTTTGATATAGCCGCGTCTATAGTACTGTTTAGCTTCTAATTTGATGTCTTCAATTTTGTATGTTTTTTTAACTGTTTCTGTTGTCATTTTGCTTTTCCTTTTTTTGCGTTTTTGTTTTGATGTGCGTATTTTAAACATTATGTTTAAAATGTCAACGCTTTTTAAAAGTTTTTATTACTAAAATTCACGGCGCAAAAAAGCCAGCGCGTGGCTGGCTCGTGTTCTTTGCTTGCGATGGTTATGTATTGCGTTCCATGTAATTTGCTAGTATTTTATGGAATTGCGAAAGCATAGCGACGTAGTCATAATTTAAATACCATGAATTGATACGTCTTTCTTTATCTATTACGTCTTCAATAACATTAAATGCCCTCTTCGCCGCCTCGCCGGCTATGTCGCTGAACAGCTTGCCGAAAACTTCCGGCGACAAAGACACGGCTCTAACCCGTTTGCTGTATTCCTCGTGACGCGCGTCTGGTCTATCATTTGTCGCTGATTCGATAATGCAGCGATATAATTCTTTTGCAGTCGCCCGACCTAATCTGAAGACCTCGACTAATACGAACTTCTCAAAAGAGTCCCGTATAGGAAATGTGCTTTCGTCGTGTTTGTAGTCGTCGCCGAAAAATTTGCGCATAAAATCATCTTCAAATGTTATATCGTAATATGTCCCCTCGCGGCAGTTACTGAAAGACCAGTAAGTGACTTCACGTGTCGTGACATCGAACGGCGAGCGTATAATTTCACCGTTATCTTCGTGCTCAATCGCATAGCTAATATTGCCCGCGCATACTTTCCGATTATCTTCCTCGCCAGACATGTAAAGATAAGCGCCGTGGTCGTCAGTTATAACTACTTTAAATTTAGAAAAATCTTTCTCAGTTTTGATAATAGTTTCTGTTGTCATGTTACTTTCCCTTTTTAGTTTGTTTATATGCCGCGCGTCTCGCTTTCAACGATTTTGCACCAGCTCAATAAGTCATTTGTCGCGTCGTAAATTTCTTCGTAAGAGTACATAGCTATTCCCTTAAATTTTTTCGTTATGCAGCTGCATCACCGCTTCACGCACTTCATGCTCTAATTCTGCTGACAATTCTGGAATCGGCATCGGATAACCGTAAGCAAGAATGATAAACGCATCGACAACCGGTTCTAGCCATACCAAATCACGCCCGATGCGCCGCGGCGGCGGTAAACAACCGCCTGCCAATAAATCATGAAATTTCGTTCTTTTAACGCCAAGTCGCCGTGCGAATTCTTGGCGTGTGATATTTCTAATCATTGTGTTCACCGCCGGCGTTAAAACGGAACATTGTCGTCGACGAAATCGTCGCCGTCGCTTTCGCGCGCGTATTGATTCGCATAATTGCGCGCATTTTGTTTGCGCTGCGGTTTTTGTTGCGGTGCTCTATTGTTGCTTTGCACAGTACCCAGCATTTGCATTTCGTTGACGATGATTTCAGTGGTGTAGCGCTCAACGCCGCTTTGGTCTTGCCATTTGCGCGTTTCTAATCGCCCTTCAATATATAATTTGCTGCCTTTTCTGACGTATTGCCCGATGATTTCCGCGATGCGCCGATAAGCAACACAACGATGCCATTCTGTTTTTTCGCGTTTTTCGCCGGATTGTCGGTCGTTCCAACTCATGCTGGTTGCCAATGAAAAATTTGCCACTTGCTCGCCGTTGGTCATCACGCGCATATCAGGGTCTTTCCCCACATTGCCAATCAAAATTACTTTATTGATGCCTGCCATCATTGACTCCGATTAACCAATTTTCCAAAAAATGCTTTCTTTTTTCCGATACTTATCTAATTCAACGCCGGCGAGTTCTGGTATCTTGCTATAATCAACGGCACCTTTTCGCGTTACTTTTGTGCACGTCACGCCGCCGCCGGCGATTTTCTCGCGTCCAGTTTGTTCCGCTAACGCTTCAAGCTGCTTTTTCAAAAGCGTTTCGCGCTCTGATAATTCCGCAATTTGCTCACGAATGAGCAGCAACTCTTGCGCGTAATCATTCCAGCCTTCCGGCGCGTCTTCACTGTACGCTGCCAAATCACGCTCAAAAGCTGCCCAGCCATCGCGAATGCGCGCGAACCAAACGTTATCAGGCAATACTTCTACCGTTGCCATCACCGCCGCCGTGCCGTCGGAAACGGTGAACAAACATTTTTGAGCACCGGAGACTAAAAGCTGCTGTTGAATTTGCGCCAAATCATAATCTGCAACCGCACCGGCACGCGCTAACTCCAAGCGATTTTGCGCGCGTTTGCTGCCCGTGAAATGCTTATGCTCCCAAAGCGTTTCGCCGTCTAACGTTATGCCGTCATAACTTGCCGCGAGTTTTGTGCCGTCCAAAACTTCAACAACCGGCGCCAGTGCAATACCATATTGCTCTTCGATAATAGCGCGGGCGCTGTCTTCTGCTGCGTGCCCGTCGGCGAATATTTGACGTGTAAATGCATTTAACAATTCTTTGTAACCGTATTTCTTCTCTGATAAAACTGATTCTTTGCTGCGGTATGCGCCCTCGATTTCTAAAATTGCAGCAGCATCTGAGGCGGTAAAATGCTGATTCCGAAACGCTTTCCATTGTTCGCTACCTTGCTGCACGTCAATAATCATTGCAAACACTCCGCGCGCATTATTTGAATTTGCTCATTCGTTAACACCCAATTTTTCGCCGCCGCGCGTTTTGTTACCAGCGCGATGACGGCATCAAAATCAACGCCTTTTGCCACTTGATTTTTTAGATAAACTTTCAATTCTTGGAATTCTTTTTCCGGCACCGCTATTTTTGCCGCTTGTTTTTTGCTTGCCGCTTTTGTTTGCGGCGCGTCGTCGGCGTCTGGGTCACTTTCGCCTTCGATGCCGAACAAACCGCAAAGCGCATATTTACGCGCATACGATGAAGCCGCGCCGGTAATCTGCGCTGGGTCACTGCCTTTTTTCTCTTGCACTTCACGCGCGCAGGCGGTCACAATAATTTGCTCTTCGGGCTTTTCCGCATTTACTAACGTCGCTGTTGCTTCAACGTAATGACGACCGTCAATAAATAAAATCTTGTCGCTCATGAATAAATAGATGCCCAAGCTTTCAAGAATTGGTTTAATTGCCGCGAGAATGGTTTCTAAAGACCGGTATTTAAAGCCGGCGAATTTATTATCTAAATCCTTTTTTACCACTAATTTTGTTTGTGCTTCTGCCAGTTTTTGATAAATATTCATGATTCTGTCCTCAAATTATGATTAATTAATCAGCAAGATAAAAATAAATTTCAAGGTCGCCGTGTTCGCTAATGTGCACTTTCAGCTGAACTTGTTCGCTGATGCTCACAATTTCGAAATTCGGCGTTCCAATTTCATAACCAACGGTTGCAGCGAGTGTTCTTTCGCCTAAGCTGCCGCCGACGAGAACATAAATATCAATGAGTTGGCAGCGGTTTTCCGGTTTGTATTGCGCGCAAACAAACTGCAAGACATCTGTTTCGTTTTGCGTTTGCGGAAAGAATTCGTTGAATTTGATTTGAAAATAAGATGCGATTTCTATTTCATCTTTACGGTTATTAATAGAATTTGCGTACATTATTTATCTCCTCTGCGAGCGAATAACGCGCGGCAGGCGGCTTCTTCACCGTCAGCGCAGTAATCGGCAAGTTGTTGATAGGTCGGACGCGGCGCGCTATTTACTTCATAAAGCGCGGCATAGATAGCACTACGTGTGCAAGCGAATATATTTAAAAAGAGAAAAACACACAGAAAAATATTTGTGAGTAAACTATTGTTTTTTGTGAAGAATGTTGCCATTTTTTGCTCCGTATTCGTCGTTTGAATAGCACTCATGTTAAACGTTATGTTTAATAAATGCAACATTTTTAATTATTATTTTGTGTAAATTACGTTATGCTATGTTTTTATTACATAAAAAAACCGCCGAAATGGCGGTATTCACTCAAAATTTAAACAGTATGGGCAGAATATCAATTCATTTTTTCTTTTGTAATTTGCAGGCTTTGGCGATATTCAACGGCTTCTGCGGCTAATGTCCATTGCGCTTGTATTTTTTCCTCTTCTAAATAACGTTTAATTTCGTTTAGCGGCAGGCGGAAAAACTCTTTACGTTTGTTTACTTTATTCACAGCATAATCAGCAAAACGCTCATGCAAAGAGCTTTCAAGCGCTGGCGCATCAGCGCTATAAATCATGGCATGGACGTCGAAAGAAAATGGAACAGAAGCATCGCCAAGCTCTCTAACGCGGTCGAGCGGTTCAAGCCGTCGTGTCATACCAATTTTATAAACATCTTTCCCGAAACTTCCAATATTGCTGATGATATAAACGTGTCCGCTGCGTGTTTGTTGCGCCATCGATAGCGCTCTCTTTCCTTTAGTTTCGGCTTCTGCGAGTTTTGCTTGTAGGTTTTTTAATTGCGCTTCGTATTGCGCTTTTTGCTCCTCATTTGCCGCAAGCATTTCTTTTTGTAAGCGTTCAACCGCTTGTTTGGCAATAAGTTCTTCTCTTTCCGCTTCGCGTAGTGCGCGTTCAATTTCTTTTTGCGCTTGTGCGTCTTCACGCATTTTTTCTTTGATTGCGCGTTGCTCTTCAGCTTCCAATTCTTTAGCAGCTTGATATTGATAATATAATTTGCATTCTTTCATTTTTAACATGACGTATTGCGGCGTTAAACCAATTTCTAAAGAAACCAGCATTTTTTCTAAACTTTCAGCAAGTTGTTCAATCCGCGTCATTGTTGTTTCGTAGTTTCTGCTGTTTAGTTTTGCAAGAAGATAATCACATTCAGAGTTAAAACTGCGTACTAAAAGTTTTCCTTGCGCAGTCGCAACTTTTTTGTCGTAGGATCCGTCTCCGGTTAAAATAAAGTTTTCCGGAACGCGATAAGCGCTACCATCTTTAAGCATATTCTTTTGTTGCTGCCGATTGTCTTTTATCTGCTCTAAATAACCGCCGGCGGTGAATTCGGAATAACTCGGTAGTGGATAAAGCCCGTAATCGATGAAAAATTCAGCACCGCTATAAAGCGCTATCTTCGCCAAAATTGCTTTCCGCTGTTGCTCAATTCGTTCTATCGCCGCTTTTTCTTCCGCAAGTCGCGCATATTTCGCTTCAATTTCTTCCGAATGCTGTTTAATTTCTTGTAAACGTTGCGTTTCATGATTAATAGCAGATAGCTCATCACGCGCATCTTTCTCTTCTTGCCTTGCTTCTTTTATTTTCTGGTGTAGCGATGATAGATATTGGTTTAAGTGGTCTTCTTGCTCTTCTAATGCTTCGATATCTTCCGTGAGTTTTTTATAAACTCTGCTTTTTTTGAAAATAAAAAAACCGATTTTCGGCGCCGAAATAATAAAAAGCACAAGAAAAGCGAAAACAATAAAGGCGAAAATTTCGGGCACTATTTAATCCTCATTTTTCAATAATGCTTGTTTTATTTTATTTCGTTGTGCGCATCGCCGCGCCAGCGCTGCCTTGCCACGTTCCACCCACTCTTTAAACAGCGCTTTATCTTTTTTGGTCACCACAATCACAACATCACCTGATAAGTAAAAACGCGCCCGATAATTTCAATATTTTCAGGGCGCACGAGCTCGTCTTTATATTCTTCTTGGTTGAAACTACTGATTCTAATCATGCCGTTGGGCGCGTGATATAAGCGTTTGACGCGGAATAAATCATCTTGCCGGATTGCGTAAATATCGCCTTCTTTAAGCACGGTATCGGCTTTATTGATGCCAAGCGTCGAGCCTTTGGGGATAACGGGCTCCATGCTGTTGCCGGTGAGCGTAACGCAAAATGCTTGGTCAAGCGGAACGCCGTAACGGTGTAACGTTGATTTCGCAAACGGCAGGCGGAAACCGTTGTAATCTTGCATTTCGCAGCAACCTGTGCCGCCTTGAAATTCAACATCTTTGAAAAACGGCAAATAAGCATATTCATCTTCAGGCAACGGGTCGTTACTACTCCAAAGCCGGAATTTGCCCATTGTGCCGATATCACTTTCAATAGCTTGTTTCTTTAATTCTTTTTCGAAATCTTTATATTCCAAATCAATACTTGAAACGCCTAACGCGTTAGCCAAATCTTGCAAAAAACGTGGTCGCAGCGTCAAACCTGCTTCAATTTTTTGAATCGCAGCTTGTGATTTGCCAATTTTTTCGGCGAGTTGGTCTTGTGACAGATTATTCAATTCGCGCAATATTTTTACGTTGCTAGCTAGACTGACTTCAGATTTATTATTTGTGCGTCTTTGTACGCCGTCGATGATGTAGCCCATATCGATACCAAATTCTCTTTCAGCTTTTAACACGCCAGATTTTGATACGCCGCGCGTCCCCCAATTGGTAACAACTTGTGGCGTGGTATCTAGCGCTTTTGCTAATTCGCTTGGAGATAGTTGTGTCGCTCTTAATAAGCGCTCCAGCGATGGGTGTAGCGTCTTCATTTCACTCCTCATTTTGTTGCTCACATTAATCAATTTCAGTCTTATTTTACACAAAAAGTGGACAATGTGTTACACGGCTCGCGTATTTTGTTTGTGTATCATACACATTTGTTTTATAATAAATTACACATTTATGGGAGATTGGTATGACTGACAGTGAATTAATAACGAAATTGGGCGGCGCAACCGCATTAGCACGCAAATTAGGCACGACGCCGCAGGCAGTGTGTAACTGGAGAACGCGGGGCATACCCGCAAGAATAAAACTGGAATATCAAGAGCTCTTTGAGAAGGCAGCAAAAAACGAAAAGAGCAAATAATTCGGGATAAAAAAAAGCCCAGTTGAAAAACTGGGCTCTACAAAACACCAAAAGGTCAAACAAAATAAATAGGTATTTATTATGGCACATGTTGAAACAAAAAAGCAAGCACGCGTTACGCAAAAGCGATTAATTGCAGCGCATCTCAAAAAGCACGGCTCAATTTCGTCGTGGGAAGCAATTGAGCTTTATCACTGCACGCGTCTTGGCGCGTACATTTACGAGCTTCGAGAAAGCGGCTGGGATATTTCAACGCTGCGAAAAACATTTACGAGCAGCGTTACTGGCAACAGCGGCGTATATGCGCTTTATCTTTTAAACGAATCAAACGAATTGGGGGAATAAATGAGTCTGAAATATATGGTTGATGCGCTCGCCATTAAGGTTGGTAACCCATTGCGCAAATTAGTGTTAGTCAAGCTCGCTGACAACGCAAACGATGACGGCGAATGCTGGCCATCATATCAAAAAATCGCAGATACATGCGAAATCAGTCGTCGCTCTGCGATAAATCACATTAAATGGCTGGAAGAGCACGGCTTTTTAATATCGTGCGCTAGAAAAGATGCTGACGGCATGAATCGTACAAATATTTACAAACTCACCATCGCGGAAGGCAAAAATGCTGACAAAAACGATGGTGGAAACGATGGTGGAAACGATGGTGGTCAAGATTCACTGCATAGCGAACAGGATTCATTGTGTAGTGAATCTATCGCATCGGGTAGTGAATCTATCGCATCGGGTAGTGAATCTAATGACGCGAAGGTGGTGCAGCAGCTGCACCCAGAACCAGTCAATAAGAACCTATCAAAGAACCAGTCAAGGAACCATGAGTCGCGCGCGAACGCGCGCATGCGCGAAGCCCAAAACTTCGACCCAATTTCTGCTTTGAAAGCCGAGGGTGTGAGCGAAGAAAAAATCCGTGATTGGTTAGCTATTCGTGAAACAAAAGGCGCTAAAAGTCTCACAACGCGTAGCTATCAAGCAATCATGAGCGAAATCGAAAAAGCAGGTTTATCGGCGATTGAATTCGTTGATTTGGCGCTGTTCAAAGGCTGGCGCGGTTTCGGCGCTGATTGGAATTGGCAGGCATCGTTTGAGGAAATGAAGCGGCAACAGCGCGGCGATGCGCAAACGCAGCGCAGTGATTGGCAATCGCAATTCCCGTCAATAATAGAACTCCATAAGGGCGGCACGCCGGACGAGCTAATTCCTTGGATTAAAAGGTGAAAACATGGAAACAGCACAAGATATTGTTGCGCGTATTATCGAAGAGCTAGATAGCGTAGACGAAATTGATTGCCCGAAGCACGGCAAACAAAAAAGCAGTAACGGGCAATGCTGGGCGTGTATTCGCGAAAGTATCGAGAAAGAAAAAGCAGAGGCAGCAAAACAAGCAGCATTGACGGCAAGAGCAGCGCTGTTGAGCAAAAGCGGTATCCCGCGGCGGTTTATGCATGCTAGTTTCGATAATTACTGCGTTGATAAAACGATTAAGCAACAGCAGCGCGCTTACGCCATTGCACGCCATTATGCTGAAAATTACGATGACAGCATGGATATCGGTCGCAGTATCATCATGACCGGCGCCGTTGGCACGGGGAAAACGCATTTAGCAGTCGCAATCGCACAAAGTGCCTTGGCACGCGGTAAAAGCGTTTGTTTTACCAGCGTACAAAAGTTGATTAGAAGTATTACAGATACGTACAGCAGAGACGCAGAGCAGCGCGAAAAAGACGTTTTTTCAGCTTATCGCGCAGTTTATTTGCTTATTTTGGACGAAGCAGGTTTGCAGCGTGGCACAGAAAACGAACGCAATATCATAACTGAAGTGATAAGCGACAGGTACAACGACCAAAAGCCGACAATTATCACAAGCAATTTACCGCTTGAGCGCTTAAAAGATTACTTAACAGAACGCGCTGTTGACCGGCTTTTACACGGTGGCGTATTGCTTGAGATGACGTGGGACAGTTACCGCCGGCGGGGGGATTTATGATGGCACTTGACAACGGGAAACGGGTTAGTAATACTGGAGTCCTCTCAAAAACCAACAGCGGACTTCCGCACCCGTCAAACGCGGTTTTTTTATGCCTATTTCTAGGAAAGCCTAATAATGGCGGGACGATAGCGAAGGAATATAACACCTTCGGGGAATACTTTGCAGCGGAAACTGTTGGTTCCGAGAGTGAGTCCCGCCGCCCTCAAAAAAAGCGGCTCCTCATAATCCAACAGGAGACTGCACATGTCTAATCTTCAGGCTGAGCCTTTATCTTTTAATCACGTTATTTTCAAACCTGTCGAACTAAACGATGACCAGATTTGGTTAACCTCACTGCAATTAGCGCAAGCTTTGGGGTATAAGCGGATAGATTCTGTCTCACAGATTTATCGCCGCAACGCGGATGAATTTAGCGAAAATATGACGCAAGTCATTGATTTTCTCGAGAACGTCAAGTTGAGGGTCTCGAAGAAAAACTTTGCGTTCTTGACTTTTCAACGCATGCTCAGCGATAATCGAGTTGCCGCAGCAATATCTGCGGTCGAGATTAGCCTCTCGTTGATACGAAAGCGTGTGCAGCGCTTCTACCGCGCTATTTTTATGCCTAAAATTTGTTATGGCGGCATACGTAGGAGAACCGCAAGGTTCGCCGTTTCTTTCGTAGCGGCAAGGCTAATCCTGCGTATGCTGTCGCCCTCTTATTTAGCCTTAACGGCGGCAGTCAATCTTCAACTACGAAAGGAGACTGCAAATGTCTAATCTTCAGGCTGCGCCTTTAACCATTCTCAACGAAACCATTAATGTTTTTGAGAACCTCTACTCTTTAAACGATTTACACAAAGCCAGCGGTGGCGCAAACAAAGACCGCCCAACTTTCTTTTTACAAAATAAAGAAACACAAGCGTTAATCGCTGAAATTGAAAGAGAAAATTATAATGCTTGTAATCCAACATTTAAAAATGAAAGCGGAAATCATAATGTTGGTGTTCCAACATTTAAAATCGAAGCGGGAAAACCTGCTTTGGCGATTAAAACAGTTCGAGGACGAGTTAAAAACCGCGGTACATACGTTTGCAAAGAAATCGTTTATCGCTATGCGATGTGGGTAAGTCCCAAATTCGCTTTGGCGGTTATTCGCGCCTTTGATGCCGCTGTTACCAATAACCTCCTGCCTGCGACCATCACCATTGAACAGCAGCGCATTATCCAAGAAGCGGTGAACGCTAAAGCGCAGCGCGACGGCATCAGCCATCAGACGATTTATCACGATTTAAAAACGCGTTACCGCATTCCGCGTTATAACGAATTACCGGCGGAACTATTCCAAGATTGTTTAGCGTGGCTCGGTGGTTATCACATCGAATATAAAAACAGCAATTTTGAATATCGGAAAGAACGTGCGTTTGCTGTTGGGATCATGGAGCACGTTGCTCTTGCTTTCGAAGAGCAAGACGAAGAGTTGCATGACTTGTGGACTAAAACCACATGTTTATGCCTTGCCTTGCGGCAAATTGAGAAGCAAGCGGAAGAATTGCGGCGCGGCTTGCAAAAAACAATTCAAGGAAACTGCGCCGAAGCTCATGTCGGTTTGCGTGCAGCGCAATCGTATTTGAAGTTCCCGACAGAGGTAATCAATGAAGGGCGTGAAGCCGCGCGTTCGTGTCTTCGCCGCAAAAAAATGGAAGGTGAAAAATGAGCGTCGTCATGCTAAAGGCGTGGTACCGCGCATTATTCATGCAATTATTAGCGCTGCAAGGCATTGAGATTGCGCAATATGCAATACAGCGGCTCGAATATATGATTGCGCGCGAGCCGTATTACGTGCAAATCGCTTATTACCAAGCCGTTTGTGCTATCGCTACTACAATGAAACAAATAAAAAATGCGGATAAGCTAAATATTTGTATCGGTATTTATGAGATTGACGCCGATAAAGTTAAGGTGCAGCTGCGTTACGGCGTGCTTCATGCTTTTGTTGCGCGCGACAATGATTTTTATCGTTCAGCAAAATACGAGCAGCTGTTATTAGGGCGAACGCGACGCACAAAAAAACAAGTTGATATGGGGTTGGTCGTCCAACCCCTGCCGGCTGGCTGGCGCGTGAGGTGGTAACGATGAGCGTACATACTTTTAAAGCGTTGTTGCCGATGCCGCCGAGCGTCAATCATTATTACCGCCGGCGCGGTAGTCATGTTTATTTAAGCGATGAGGCGCGCATTTTCCGCCGCGACGTGGTTTTGAAAACGATTGGCAAAATTCCGAATAAAAAGCGGTGGTTCGACAAAGAGCAGCGCTTAGCGATGACGGTCACTTTATATCCGACAAACCGCCGTTCGTTTGATATCGATAATCGTTTAAAAGCATTGCTAGACGTGTTGCAAGAAATCGGCGTTTATCACAACGATAATCAAATTGATGATTTGCGCGTGGTGCGCGGTGCGGTGGTTAAACGTGAGCTTGGATTTTGCGTCGTTGAGGTGAAGCCATGTTAATTGAAGAGCTTTTAGTGGCGTGGGGCGAATGGAGCCGCAAAGATGTTTTGCCGGCGCGCCCTCGGTGTTTTATCGCCGCGTTTTGCCGCCGTTCGGCACGCGATGCTGATGATTCGGTTGAATTAGATGATGAAGCATGCGAGCAAATTAATGAAGGGCTAATGGTATTACGCGGCAATCATCAGCGCATTCATGACGTGCTGATAAGCCGTTATCTTTACAATGAATATAACGATAAAGCCGTGGCGAAAAGGCACGGGATATCAGAAAGCACGGTGAAAGATTACCGCCGGCGGGGGCATTTATTTTTAGAAGGTTGGTTATGCAGCTGTTCGCAGCGGAGTGAATAATGTTATATCCAAGTAAACACGACCACCCAGATAGAACCGTATTTTATGCGAGCTTTATCATGCTGCGCGCATTAAAAAATGAGGGCGCTATTTCTTATAACGATTTATTCCGGTTTGTCCGAAAAGAAATTAAAAACGGGCATTATTTATTTTTCGATGCATTACATTTCTTATTTTTGATGGGATTTATTTCTTACCAAAAGCAAACCGGTTTACTGATATATCAGGTGGAAAATGACCGAATTTATTGAACGCTTAGTTGAAATTGACCGTGAATTGACACGGCTGCACAAAAAAGAATATGCACTCAGAAAAAGCATTTCAGATATCGAACAAACAATTGACGAAACAGAAAACGCAGTCTTGGATTCGACAGGGACGAGAGAATCTAACGGTATCAAATGCGACCCTGCCGAAATGAGAGCATTTTTTGAAAATCATAAACTCTTGACGATTGAGCATGTTATTAAAAGACATGAAGAGCTCATCGAGTTTTACGAAGAAACGAGCAAAGAACGCAACGATGCGCTCAAAGAACAGCTCACTGAAGATAAAGCAGAACTTGAGCAGATAAATGAGCGTTTGCAGCTCGCGAATAAAGAACGCGACGCGCTTTTGGCGCAGCTAAACGATGGTACATAAAGCTATTATTGCAAATTTTTAATTGAAAAGATAACTTAATTTGTGATATTATAATCAATGTAGAATTTACTCTCTACAAAATGTTTAAAACAGTCATAAACGAACAACTAACGGTACGAAACGCGAGAGCGTAATTTGACCGTTGGTTATGATTGTTTTAAACATAAAAAACTCCTTGCCATACGGCATGAGGCGTACAAAAGGACTACCCGCCCTTTTGTACTGATTTTTGGGGTCATTTACTGACTGTTGGTAGCAATCAGTAAATTGTTCTATGGAAATACTGTCCGTATTCCTAAAGCGGTTACTTTTGTGACCGCTTTTTTATTTTCTGTTTTTCATTTCAAGCTTATTTCTCCTTTACTTTGAATTTGACCGTTAAACGGCGTTTTAAGCGTTCGGTTACGTCAAAAACTTGCAAAAATGCCAATTCGCGCAATTCCGCAAGTTTTAAGTGGTGAAATTTGAAATATTTTAATAATCAGCTTGACGCTTATTTAATCTTCAATACCAAGACGCTTCCGTGTTTTTGCTTTTAATTCGTCACAATAATCTGCCCACGCCTGCATCATTTCGCGGCGCTGCGTCAAATATTCAGCGTGATTATACGCTGATGAAACACTGCTTCGCTGCCAATGCGCTAATTGGAAATCAACAATATGACTATCCCAGCCTAATTCGTTTAATAACGTTGCTGCTGTACCGCGGAAGCCGTGCCCGACCATTTCGTCGTGTCCAAAGCCCAAAGTGCGCAATGCGGAGCCAATCGTATTTACACTAATCGGTTGCCGTGGATTGCGCACGCCGCAAAATACTAAATCACTACAACGTAACTGTTCGGCTTGGCGTAATATTGCGATGGCTTGCGTTGAAAGCGGCACCAAATGGGCGCGTCTCATCTTGATTTTTTCGGCGGGAATGCACCACAGCGCGCGCTCAAAATCGACTTCTTGCCAAACCATTCGCCGCAATTCAATTTGTCGGCAAAAAACCAGCGGCAGAAATTTCAGTGCCAAATAAACTTGCCAGCGTCCATTGAAATAATCGATTGCGTGCAATAACTCTCCAAAACGGGCGGGGTTCGTGATACGCGGATAATTTTGTGCTGGTTTCTTTTTTACTGTTCCCCGTATTTCCAGAATCGGGTTGGCAATTTTCAACCCAGTTTTTAAGCGCACCGTTTTGTAAAGTTCGCCGGCGAGGTCTTGCACGCGGATTGCCGTGTAATTTTTACCGCCATTATCTAACCAATTAATCAAAGCCCGCCATTCGTCTTCACTGACTTCCGCTAAAGCACGCGCGCCAAACCGCGATAACACATGATTGCTTAAGCGACTGCGGCAGCTTTCCAAGTAACTTGCGCTACAATATTTTGCTTTCTCAGATAGCAACAATTCTACCGCTTCGGCAACCGTGACGCCGTCTTGTTTTTTATGACGTTCGACATGTTGCGCATGTGGGTCAACGCCGCGCTCAATTTGCTCACGGTACAGCAAAACGATTTGACGCGCTTCTGATAATGAAACCGCGGGATATTTGCCCAGCGCTAATTCACGCCGCTTGCCTGCTAAGACGTAACGCATACGCCACAAAATACTACCGTTTGGCATCACCACGGCAAACAAACCATGGCTATCGGCGATTTTGTACTTCTTTTCTGTCGGTTTTAACTTTCTTAATTTTGCATCAGATAACATGTTGGGGCTCAAAAATAATTTAATAGGTGAACCCCTATATTAAACCCCTACATTGTGCGCGGATTCAAGCGGTAGCTAACGAACACTAATGAACAAACATTACCAAAAAAACTCTTATAAAAAAACAGGTTGCGACACTTAATGAACACAAACGGTTGTGATATTGATGCCCACATCATGCACACGGGTAAACTCACTTGTCAGCAACGCGCGAACCTTAATCTTAACGATAAATCGCGCGCTTTTAAACGGCATTTCAAATCACTTCAGGCTTTGTGGCAAATCATCGCGCACCCCGCGCGTTTTGCGCCGCCGCAGCCGTTACCCGTTGCACTTCCTGATTGGAATGATTTTTTTAATGCCCAACACGATATGCGTTTTATTTGGTTGGGACATTCTTCGCTACTATGTAGCAGCGGCAAAACCACTGTGCTCATCGATCCGATTTTTGCGCGTTATGCCGCACCAATTCCGTTTATCATCAAACGCTTCCAAAAACCGCCGCTCGCGATTACGGAACTGCCGCCCATCGATTGGATTTTGATCACGCACAATCATTATGACCATTTGGATAAAAAAACTATTCGCTACTTCAGTAAACACACCACCAAATTTGTCATACCGCGAGGATTAGAAAACTTTTTTATCCGTTTAGGCATTCAACGGCAACGCATTTTCGCGTTAAATCATTGGGAATCTTTAACTTTAAATGATATTACGCTGCACAGCGTACCGGCGCGACATTATTCGGGGCGCCACCTTTTTGATCACAATCGATCATTATGCGCCGCGTATGTATTGCAAACGCCTTTTGAAAAGTTCTTCTTTTCAGGCGACACCGCTTTTGGCAACGGTGATCATTTTCGCGCGATTGCCGCGCGTTTTGGTGCCTTTGATTTCGTTTTTATGGAAAACGGGCAATACCATTGCGCGTGGCATAATCATCATCTAATGCCTGATGAAACCATTGCCGCGGTCAAAATATTACAAGCACGCTGTTGGACGCCCATTCATTGGGGCGCTTACGCCTTATCAACGCATCTTTGGCATGAATCCGTTTCCCAAACGTGGGCGTTAACACAAAAAACCTGCTTAAAAATGTGCACGCCGCGGCTCGGCGCCGTATTCAGCCGCACTCACGAAAGCGATAAATGGTGGTAATCTATATCTTCTTTATTCGTAAAACTTTGAGGACTTTTTCATGAAAATACAATCAAAACAAATTGATAATCTCTGTTTTATCGCAACTAATGACAGCGGTCACGGCGTGGTGATGGAAGGACAACCGCCAGAAGGTCACGCTAAACGCGGCGCTAATCCGATGGAATTGGTGTTAATGGCGGCGGCAGGTTGTTCCAGCATTGATGTTGTCAGTATTTCGCAAAAACAACAACAAAAAGTCCGCGATTGCCAAACCACTGTAACGGCAGAACGCGCGGAAACCGAACCTGCAGTTTTTACTAAAATTCATATTCATTTCACCGTATACGGTCAAGATTTAAAAGAAAGCGCTATTGAACAAGCGATTAATTTATCTGCAGAAAAATATTGTTCTGTAGCAATTATGTTGGGAAAAACGGCAAAAGTGACCCATAGTTTTGAAGTGATCAATGAAGCGTAATTGTCTTTTAAAATACATTAAACATCTCTGATATTTTCGACCCTTCTCCTTTTGGAGAAGGTTTTTTCTAAGGTTTTTTATGGTGCAAGCATTTAAAAATGAACTTCGTCCCACATTGTCATTAGCATTGCCGATGATTTGCACGCAATTGCTCAATTACGGTCAGCAAATTATTGATACCGTGATGGCAGGACGCCACACGCCGCTCACGTTGGCGGGCGTTTCTTTAGCAAATCAATTATTCGCCATTGTTTATTTATTTATGATTGGCATCGGCGTTGGTTTTTCGGCGCTCATTTCACGCCGGCACGGAAATGATAGCCACGTCACCATTCGCCGCGAATTTCAACAAGGATTTTGGCTGTTTAGTCTTTTGGGTATTTTGATGATTTTTGCCGTGATTGGCTGCGCTTATTTGCCCCAAATTATCGGTAGCGAAAAAAGTATTGCCGAAGAAAGTAAACGTTATTTATTGGTTTTAGCGCTGCCGGCAGGTATTTTTTTGTTGGGACAACTCGCCCGCTTTTTTTTAGAAGGTATGGCAAATCCCCGACCTATTAATTACGTTCAAGCCGCATTGTTGCCGGTTAATATCGTTGGTAACTGGTTTTTCTTAACGTTTACCGATTTAGGCGCTGCCGGCATGGCAATTTCTACGGGATTTTGTTATGTCCTTTATACCGGCGCTTTATTATGGATACTGGCAACGCGTCCGCGCTGGCGCCGTTACCGTTTATTCCACAAAATGAGCGCATTTAATTTGCCGATGATTAAAGAATTATTGTGGGTTGGATTGCCCATCGGCGCCGCAATGGTCATGGAAGCGGCAATGTTTTCTTATATCGGTATTATGGCAAGCCGCGAAAATGCCATCATCACCAGCGCCAATCAAATTGCCAGCAATTATTTGAGTATTATTTTTATGGTGCCTTTGGGCATTGCATCGGCGTTAACCATTCGCTGCGCGCACGCTTTGGGACGTAGAGATTGGACGGCAATTCGTTATCGCGCGTATACCGGCATGATTTTTTCCGGTGGATTTATGTTGCTATCGAGCATCGTTTTGATTCTCGCGCGCTATTACATTCCACTTTTTTATACCGATCACCCAGAAATCATTGCCATTGCCGCGAAAATTTTATTTGTGGTGGCATTTTTTGAATTTATTGACGGCATTCAAGTTTCCTGCGCGGGGATTTTGCGCGGCTTGGGCGATACGCGAATTTGTTTGTTGTACGCATTTATCGGTTATTGGATTATCGGCATTCCCACCGGCACGATCATGGCATACGGCTTTGATTGGGGCGTTTATGGACTTTGGGGTGGTTGCGCGTTTGGATTAGGAACTTTTGCTGTGCTCGGCGCGCGCCGCGTTTTTTATCATACGCATCGCAAAAGGACGTTACATGCCGAAGCCTAAAATACATTTAATTGCCGCGCAAACTTTAAACCGCGTCATCGGAAAAGATGGCGGTATGCCGTGGCATTTGCCGCGCGATTTGCAGCATTTCAAACAACGCACCAGCGGACATACCATCATCATGGGGCGCAAAACGTTTCAAAGTTTAAAAAAGCCATTGCCCAATCGCAAAAATATTGTCATCAGCCACAGCGGGCAACCGTTGCATGCTGATGTCGCCGTTTTTCCTGCTATTTCAGAAGCATTGGCTTTTTGTTGCAATGATGAAAAAGTATTTATTATCGGAGGCGGTAGTTTGTATGCGGCAACGATTGATGAAGCGGATTTTTTAACGATAACGTGGATTCATACCGAACTGTTGGGCGATACTTTTTTCCCCCCGATTCGACCTCAAGATTGGCAAGAAACGCACCGCAGCCATTTTGCCGCGGACGCGAAAAACGCATTTGCGCTCGATTTTGTCGATTATCAACGGCGTTAGCGGTTGACATTAGACTTCATCAGCCAATACAAGTTTTTAATAGTAGAAATGGCATTGGTTAAATCCTGTTCGGAAGTTGACGTGCAGTGCTGGGCGATTGCTTGATGATAATTAAGGAAATATTCCTGATGGCATTGCTCGCCCAACGGCGTTAAATGAAGCCAAGTGACGCGTTTATCCATATCGTCAGCACTTTTTTGAATCATGCCCAGCATTGTCAACTCATGAATCAATTTGGTAATTCCAGGTTTGGTCACGCGCAAATTATCGCTGACGTCCGATATTTTCACCGCCCTATTTTTTTGTGATAACTCATGAATTTTATCAAGAATGTAAATATGACGCGGCGTTAAACCAGCAGGTAATTGCGGCATTATTTGGGTCATTTCTTTTGCCAAAAAACACGCATCAAGAAATTCTTTAATTAATGCTGATGAAATCATATAAAACCTTTAAAAAATGATGTATTGATATAATTTCTGAAATAAGAAAACCGATTATCAATTAATTAATTGAGCAGAATCTCATTCCTTTTAACATCAATAAAAAGATGCGCCAACATTTCAATAATTTCTATTATTTTTCTAATAAAAATAATATGCGTTAATTGCTTTTCTGCCGCTAAAATTATTGAATGTTTTTTTAAATAAGAAAAACCGCCCGTTTTATTAAACGTCATACATAATTTCATTGAAATTTATTCCTCTTTTGTTTGGTGAAAAGAAAGATAATTACTCGATTCATCATGCTAAAAACAATAGCGCTTTTATTTGATAATAATCAACAAAATTTTAATCTCCGTAAATAATGAATGAGTAGAATTATCAGAAAATCATTTATTAATCAGTATATTATTTTTATGTATTAAATATTGCGATGATAAAACGAAGAAAAATCTTAAATATTATTTCGCGCTGCCGTCAGATGCATACCATAATAAAATATGAAGAAAAACTCAAATAAATTTAACGATCAAAAATAGCGCCGCGATTTAATTACTCATCTAAATTTTATTACTTATATAATTTTTATTTTGAATATAGGTATAATTAGTTTGTATAAATAACAAAGAATTTAATAATAAATAAAAAATAATACTTATATGGTGACTCAAAAAGGCATTTGTTAAAGAAACCACCATAAAACAACAATTAATCGCAATGCCCTGCCAAGCAATAGATGGAAATTGTGCCAACATCACGAAATATAAAAATAATGCGCCACAATAAATAAATAAAAAATTGAAACAGCCCACAATGCCGTAACGAACGCCGATATCAATATAATCATTATGCAAATGATTAAAAGGCGCAATTGAAAAATGACAATCATTGCGCGCAATTAAACGCGCTTTTTCCGCTTGCATGCCCACGCTGCCCCACCCCATCAACGGCTTTTCCTGCCACATCAATCGGGCGCAGCGCCACATTTCCAAACGCATTCCTTGAGACGTGGCAACGGTTCCTTGGCGATAATGATCGAGATCCGAAAAAACGCCCCGCACGCGCGCGGCAACGCCCGTTTGAGGAATGCAATATAAAGCAGAAAAAAACGACAAGAGTAGAAAAAATCCCAAAGCGGCGTTTTTCAGCGTCACCCGTTTACGCATCATTATGAAAAATAAAACCAATATCGGCGGCACGATTAACCAACTGCCGCGTCCGCCACCCAAAGCTGCCGCGCAAACACCGCAAAACAGCGCTACGATTAAAATAAAACGTTGCCAACGTGTTTTGGCGTACGGCAACCGAAATCCCGAAAATACGGCAAAAATCATCGAAATACCGGAAAATTGAATGTGATGCTGATAGCCGATCGCGCTGTCGCCGCGATATCGCGCCACTAAAACCATCAATAAAGCCGTTAAAGTCCCCGCTAATAACGCCCAACTATAATATTTTTCCGCCGGCGGATAACGGCTTAAACCGTATAAAAGAATAATCGATAAACCATAACGGCTTAAATTATCGTAGGTAGATAAAGGTTCGCCGTGTAACAATCGCAACGTCAGCGAAATTGCGCTAAAAATCAATAAAAACGCAATCCAGCGGCGCTCAAAATACGATAATCGATGCGGCGATTTTTCGGCAATCGCAATGATGCCGGTGATGAGTAACGCCAGCATGCCGTAGTTATAACCGCCGGGAATCAAAAAGGTGCATAAAAAAATGGCGACTATAATGCCGCCTTGTATCGTATTTAATTTCACTTTTTCACCTTGATTTTTAAATCGCCATAACCTGCGGCGTCCATTTTTGACCACGGAATAAAGCGCAAACTGGTTCCATTGATGCAATAACGTTTGCCGCTTGGAGACGGTTCATCAAAAAAAACGTGTCCCAAATGATTATCGGAAACGGCACTTCTCACTTCAATGCGCTTTTGTCCGCCTTTAAAATCAGCATGATAAGTAAGCGCTTCTTCCGTTAACGGTTGCGAAAAACTCGGCCAGCCACAACCGGCATCATATTTATCATTAGAACAAAACAGCGGTTCTCCGCTTAATAAATCAACGTAAATCCCCGCGGCGAATAATTGATCATAACAATGCGAAAAAGGCGCTTCGGTTGATTGTTCTTGCGTGATAGCAAACTCTTCTTCCGAAAGGATTTTTTTCAATTCAGCAAGTTTTGGTTTTTGCCAAAAAGCGTAAAGTAATTGACGTTCTTCAGCGGAAAAAGGTTCATTTGCCTGCCGCGGTGGTTCCAGCGTTGAAAGCGGCGCATTTTCTGCGGCAAAAAAATTATCAAGCGCGGCAATTTCAATTGTGATCGGATTTAAATGGCGTTTTTGCAGCGCCGTAACTGCCGCGCGCGCTAAATCTCGTTCTTCCGAAAAGACGCTATAAATACCGTGGCGTTGGTGATGCGGTTGATATTGAAAAAGAATGGGTTCGAGCAGACGAAAGAAATGCAAAAAGACTGCCGAAAGCGAAAGAATATGCGGTTGATACGTCAACGCAACGGCTTTAACTGCGCCCGTTTTACCGCTACAAACTTGCTGATAACTCGGCGCCGGCAACGCGCTATTAGCGTAACCACAAACCACGCTAACCGCCCCCGTGAGCGCATGAAAATAAGCTGCTACGCTACGCGAATCTCCGCCTGCCAGATATATTTTTCTCATTGTGAAGTGACTGTATTGTGATAAAACTGCCCACAATTATGGCACAAACCCTCAGCGCCACGAGAAACAAATTTAAGATGCGCCCGAGACCAGAATCGAACTGGTAACCTTTCCCTTAGGAGGGGACTGCTCTATCCATTGAGCTACAAGGGCAAAAAGATGAGTTCACCGATAAGCCGGGTTCTGTCGTGGACAATCATTCCTCTGCGACTAACATTACTGCCAGCCTCTAGCGACCTACCCAGAATCACGACGGGCCGCCGCATTGATTCTCTATTTGGTCTTGCTTTAAGCGGGGTTTTCCCTGCCATTTACGTTGCCGCAAATGCGGTGCGCTCTTACCGCACCATTTCACCCTTACCTCAATCGAGGCGGTATCTTTTCTGTGGCACTTTCCGTCAGCTCACGCTGCCTGGGCGTTACCCAGCGCTTTGCCCTATAAAGCCCGGACTTTCCTCCAACTTTCGTCAGCGATTGTCTGGCAAACTCAAGCGGCAAATTGTAGAGAAATTAAGGGGAAATGCAACTATTCCCCGTCGGTAAATAAAATATTTTGCAAGACCAGAAATAAAAACTGATAAAATTCTTTCATTTTAATTTTAGGAGTTTACGATGATATTTATCGCTGTTTTAGCCATCATTGGAATCGCCGCATTATTTTTAATTCGGATTTATAACCAACTCGTTCGCGCACGAAATGAAATCCATAACGCTTTTTCTCAAATTGATGTGCAGCTGCATCGCCGCCACGATTTAATTCCTAATTTAGTTGCCGTTGCTCAAAAATATTTGCAACATGAAGAACAAACCTTAACGCACGTTATTTCCGCCCGAACCACCGCCGTCAATGCCTTAAAATCCACACAAATTGAACAAATTGCCCCAGCCGAAAAAGCGCTCGAGCAAGCCATGAAAGGATTTTATGCCTGCATTGAATCCTATCCCGAATTAAAAGCCAATGAACAAATGCAAACGTTGCAAGAAGAAATCAGCAGCACAGAAAACCGCATCGCGTTTGCGCGGCAATATTTCAATGAAATGGTGACGATTTATAACGCCAGCGTTGAACAATTTCCCCAAAATATCATCAGTCAATTATTTGGATTCCGTACACAATCTTGGTACGAAATTGAAGAATCCGTGCGCGCTTGCCCAACCGTCAGCTTTGATAAAACGCTTTAATCAATCATGAGCGCTACCCATTTTCGTTTTCATCATCAGGAAGCGCAAATTGCCAGCCGGCGTTTGAAAACACAATTTTATACCTGCATTTTATTGAGCAGCGTTATTTATGCGGTCAGCGTTTATTTATTGCTTGTTCTCATTTATGCCTTTTTTAATGGATTTAATTCAGATGCGTATTCGGTCGGATTTTGGTACCACATTTTCAGCGAACACCACGGCGAAATGAGCAAAATCAGCCTAATCACCGCGGCGCTGGTGAGTTTTATTGTTTTAACGGAATACGTTATTGCGTTAAAACATTACCGCCAACACAGCGCCCGTCTTTTAGCCCAAAAAATGGGTGCAAAACCGTTTCCGATGAACGGGTTACCCAGAGAAATGGCAAAATTTAAACAAATGCGCAACGTCGTTGAAGAATTATCCGTAGCCGCGCATATCAAACCGCCGGAATTATTTTATCTACCCAAAGATGACACGATTAACGCGTTTGTTGTTGGCGGACAAGCGCGCAGCACGGTTTTAGTCGTTTCCCAAGGCATGATGAATATTTTGCGCCGCGATGAGCAACAAGCCGTAATCGCGCATGAAATCGCGCATATTGTTGATGAAGATGTTTTTTTATATGCGCAATTAACCGCGATTCTGGAAGGTTTTTGGGTAATGAGTCAATGGCGCGAAGAACCGATTATTAAAGCCGCACCGGAAGGTTTTTTATATCATTCGCTTTCTTATCAATTGTATTTACAACGCTTTTTAGGCGTAATCGGCTGGATTTTGTATTTATTCGGGCGCTATATTCAAAGCGCATTTTCGCGGCAACGCGAATTGATGGCTGACGCCAAAGCGGTGGAATATACGCGTTATCCCGAAGCGCTGGTGAGTGCTTTAAAAAAAGCGCTGGCGCTGCACTATTTGAAAAAACAGTCTTATAAACCGCGTCCGCAAAATGCGCATATTTTATTTATCAATTATTTTAATTCCGTTCATTTTGCCACACACCCGTCATTGGAAGAACGCATTGCGCGCTACGGCGGCAAAATTGATAAACGCGAACTCGATGCTTTAGCTTATGAACTCGATCACAGTCAATATACGCCCAATATTGAAGAGCAACATTTAACCTATCGGCATGCTTTTGAGAAATATACGTTTTATCCCATTAATTTCATTAAAAATAAACAAAATGCAGCAATTACGCTTCCAGATTTAACGCCGGAAACTTTATGCGCGGCAATCAACAGTTTTTTTATTTATCACAGCGGTTTGAGCATTTTTGAACTGCAACAAAGTTATCCCAATTTAGCGGTGATGCCGTCCGCGCCTTTTTTGGAACAATTAGAGCAAACGCACCCGCTGCTTCAACCGTTATGGTTTAACCGTTATACAAATCGCGCGCGCGAACTATTAAACGCCGAACAACAAAAATCGTTAAAACAAACAATTATGCGTATGATTCGCTTGGATAATATGGTTTCTTTTTATGAATGGTGTTATTCGGTGTTACTCGATGATGCATTCGGCGCGCCGCGCGGCGATGCGCCCGATGAAGCAACCGAACGCGCGATATACGTGCGCATCGTGACGTTTATCGCTCAACACGCTTTTGGCGAAATCGGAGAAGATAACGAGCAAATCGCGCTTCAAAAAACGTTATATCGCCAGCTTTTGCGGCAAATTTCGCCCTATTTTTTAAAAAACGATCCACCGCCGTTTGCGCCGCATCATTATCAGCCGTCCACATTTAAGATTTTGTTTCAAGATATGAAAATGCTGCGTTTTCTTTCCTCGGGACGCCGGCAAAATATGCAAACAGCGCTCGATCAAGAATGGGAAAAACGCATGCAAATGAGTTTAAATGAAGTTTATTTGCGCTACACGTTGACGCAGATTTTGCACACCGCGTCGTAA
Protein sequences of DBSCAN-SWA_4 >NC_009446|790933:839041|825811_826231_+|WP_041729542.1|DBSCAN-SWA MSVVMLKAWYRALFMQLLALQGIEIAQYAIQRLEYMIAREPYYVQIAYYQAVCAIATTMKQIKNADKLNICIGIYEIDADKVKVQLRYGVLHAFVARDNDFYRSAKYEQLLLGRTRRTKKQVDMGLVVQPLPAGWRVRW >NC_009446|790933:839041|817102_817960_-|WP_012031096.1|DBSCAN-SWA MIIDVQQGSEQWKAFRNQHFTASDAAAILEIEGAYRSKESVLSEKKYGYKELLNAFTRQIFADGHAAEDSARAIIEEQYGIALAPVVEVLDGTKLAASYDGITLDGETLWEHKHFTGSKRAQNRLELARAGAVADYDLAQIQQQLLVSGAQKCLFTVSDGTAAVMATVEVLPDNVWFARIRDGWAAFERDLAAYSEDAPEGWNDYAQELLLIREQIAELSERETLLKKQLEALAEQTGREKIAGGGVTCTKVTRKGAVDYSKIPELAGVELDKYRKKESIFWKIG >NC_009446|790933:839041|832327_832822_+|WP_012031113.1|DBSCAN-SWA MPKPKIHLIAAQTLNRVIGKDGGMPWHLPRDLQHFKQRTSGHTIIMGRKTFQSLKKPLPNRKNIVISHSGQPLHADVAVFPAISEALAFCCNDEKVFIIGGGSLYAATIDEADFLTITWIHTELLGDTFFPPIRPQDWQETHRSHFAADAKNAFALDFVDYQRR >NC_009446|790933:839041|832818_833265_-|WP_012031114.1|DBSCAN-SWA MISSALIKEFLDACFLAKEMTQIMPQLPAGLTPRHIYILDKIHELSQKNRAVKISDVSDNLRVTKPGITKLIHELTMLGMIQKSADDMDKRVTWLHLTPLGEQCHQEYFLNYHQAIAQHCTSTSEQDLTNAISTIKNLYWLMKSNVNR >NC_009446|790933:839041|836560_837130_+|WP_012031118.1|DBSCAN-SWA MIFIAVLAIIGIAALFLIRIYNQLVRARNEIHNAFSQIDVQLHRRHDLIPNLVAVAQKYLQHEEQTLTHVISARTTAVNALKSTQIEQIAPAEKALEQAMKGFYACIESYPELKANEQMQTLQEEISSTENRIAFARQYFNEMVTIYNASVEQFPQNIISQLFGFRTQSWYEIEESVRACPTVSFDKTL >NC_009446|790933:839041|824870_825815_+|WP_049752494.1|DBSCAN-SWA MSNLQAAPLTILNETINVFENLYSLNDLHKASGGANKDRPTFFLQNKETQALIAEIERENYNACNPTFKNESGNHNVGVPTFKIEAGKPALAIKTVRGRVKNRGTYVCKEIVYRYAMWVSPKFALAVIRAFDAAVTNNLLPATITIEQQRIIQEAVNAKAQRDGISHQTIYHDLKTRYRIPRYNELPAELFQDCLAWLGGYHIEYKNSNFEYRKERAFAVGIMEHVALAFEEQDEELHDLWTKTTCLCLALRQIEKQAEELRRGLQKTIQGNCAEAHVGLRAAQSYLKFPTEVINEGREAARSCLRRKKMEGEK >NC_009446|790933:839041|814327_815422_-|WP_012031092.1|DBSCAN-SWA MTTETVKKTYKIEDIKLEAKQYYRRGYIKIDTRSNDAPKISSFVQLISVYFAGDRELTVIVEYDGDSVFIKRQEDENVIRHKMSTAAEAEELIAMVKQLAIDTSNDILNEELSYHASHALVRIIEGFDETTYVTAIKLLIAAMPLVTEVYDFMQSDESLRSFYDSDVEEMVDSGKIDLENVREFIRVKAGELGLELYDLDDDDVLLKVIRDVLCAIRDCEDDYNLRSRYNSYFYSLTYEERREAYRKKVIKETAERRQTAKLLGLTALKGTLKQKKWAEDIRRKFLENASADEDVYLFLKSASCAQHSKFWIETRDISTAVKEQAIKTLIVATRKANEIGEGNEGYDEQVKIRQAAIDKLGVEI >NC_009446|790933:839041|837136_839041_+|WP_012031119.1|protease|DBSCAN-SWA MSATHFRFHHQEAQIASRRLKTQFYTCILLSSVIYAVSVYLLLVLIYAFFNGFNSDAYSVGFWYHIFSEHHGEMSKISLITAALVSFIVLTEYVIALKHYRQHSARLLAQKMGAKPFPMNGLPREMAKFKQMRNVVEELSVAAHIKPPELFYLPKDDTINAFVVGGQARSTVLVVSQGMMNILRRDEQQAVIAHEIAHIVDEDVFLYAQLTAILEGFWVMSQWREEPIIKAAPEGFLYHSLSYQLYLQRFLGVIGWILYLFGRYIQSAFSRQRELMADAKAVEYTRYPEALVSALKKALALHYLKKQSYKPRPQNAHILFINYFNSVHFATHPSLEERIARYGGKIDKRELDALAYELDHSQYTPNIEEQHLTYRHAFEKYTFYPINFIKNKQNAAITLPDLTPETLCAAINSFFIYHSGLSIFELQQSYPNLAVMPSAPFLEQLEQTHPLLQPLWFNRYTNRARELLNAEQQKSLKQTIMRMIRLDNMVSFYEWCYSVLLDDAFGAPRGDAPDEATERAIYVRIVTFIAQHAFGEIGEDNEQIALQKTLYRQLLRQISPYFLKNDPPPFAPHHYQPSTFKILFQDMKMLRFLSSGRRQNMQTALDQEWEKRMQMSLNEVYLRYTLTQILHTAS >NC_009446|790933:839041|790933_792208_+|WP_012031072.1|terminase|DBSCAN-SWA MGRILKIKTPRWALPLLKPARYKGAYGGRGGGKSHFFAERIVEEHIINPNRKTVCIREIQKSLRHSVKALVESKIEALGVSSAFDIQRDLILNRGGNGLMIFQGMQDHTADSIKSLEDFDCAWIEEAQSISKRSLSLLRPTIRKQNSEIWASWNPQLKTDPIDQFLRVDCPENSITVAVNLRDNPFASEEIWREYRDDRERAKRKAAAGDKNAWGEFEHIWHGAYRSHSAAQVLAGCYRVADFAIEPHWSIYHGVDWGFASDPTVLVRCYLDEAARVLYIAEEAYGEHVETVDVPQLLSTITNSAQYVIRADDARPELISHCRNHGFPLMRAAGKWQGSIEDGVSFLRGLDDIVIAPRCAHTLEEAQLWSYKTDRLTGDPLPELDDAHDHCWDAIRYALSDVIRGGYQGNSIIAGAARAFRRGR >NC_009446|790933:839041|812665_814279_-|WP_012031091.1|DBSCAN-SWA MNLRSYQQSIFDKVINSSDDDLVQLDTGAGKTPIIAKLAEYYEHAAIICHRNILIKQASEKLAMCGLEHRIIGANSTKKICARNNVAKVGRHFINPAASIYLVSIDTINSRIKRGGKGLDDNTKIILIDEAHHVAEDNKWFYIADSLNVRCVGFTATPCRGDGQPMLKQYGGFFDKIIQAEGYEENGTERLIAEGYLAQYRCIYYKIEDREVIYDNGVDIEIWVGDADIEAGDRGAEIIDVARKYCKKKQTILIEPRIDNAIYSVDELAKAGYSSAVIHSKMPQYDIEKILDEFDNKNVQILVAVDMINEGFDVPDADVLIINRKVNSFGLYRQLCGRVLRPRENKQALIIDVCGHAVAQHGLPSDNVDWNKKQGEIRRRDLTICCYCDTFFKATLTHCPYCGEFNDLSMRHEGRGEGIELFFYTAKEVELERKKIAAREQQKLIEEQERKEAERARKEYFNFSPKFDDSLVGRRCAQLYCLLQTELRKQLTPEQYNNFYKNNDYVLQSESFYIKSMPRNFENNPAAVAAKIYGAHK >NC_009446|790933:839041|793658_795158_+|WP_012031073.1|capsid|DBSCAN-SWA MSEAASFLRRLKVVPAPEFYALQAKLKQEAWTMSKIAQLKQIQGVLDDLAQNIQRGGTFEEWQKNYTAQGLPRHYLETVYRTACLRAYNAGKWVQFRAQKENRPILRYSAINDSRTRPHHKALHGFMAPVDNVAWKTLAPPNGFNCRCTLMSLSARQAAKLGYKGTVTVPEYVDEYGIRRPITPDKGWESSPENSSILALLREKEKKAGFPPAHKVFPEPLPSPEQWQEVAKIGEQVWTKHLPELERSFVSGDYPRTIRKILKAEGVDLGAPPRVNGEATERFQNIIKSSYPRKWVERANEAGRVLIRNFDNERGFQLFIKDEATAAYLRNNGARFKDYKPFAPKKNQVKAGDSFLRLREMTRKEAVADALEISIHEYAHRLQFIMPELDAYFMNMWLDRTRGEKTRPLNDILKEQGEKPVYNAREKGRADHFFNLYFGKNYGSDDEPKPLEMMTMAYQYLLAGSAYKSHSEKWRVYHLHHKDPEVLYLALGLLLRYQP >NC_009446|790933:839041|830501_830921_+|WP_012031111.1|DBSCAN-SWA MKIQSKQIDNLCFIATNDSGHGVVMEGQPPEGHAKRGANPMELVLMAAAGCSSIDVVSISQKQQQKVRDCQTTVTAERAETEPAVFTKIHIHFTVYGQDLKESAIEQAINLSAEKYCSVAIMLGKTAKVTHSFEVINEA >NC_009446|790933:839041|823351_824116_+|WP_012031104.1|DBSCAN-SWA METAQDIVARIIEELDSVDEIDCPKHGKQKSSNGQCWACIRESIEKEKAEAAKQAALTARAALLSKSGIPRRFMHASFDNYCVDKTIKQQQRAYAIARHYAENYDDSMDIGRSIIMTGAVGTGKTHLAVAIAQSALARGKSVCFTSVQKLIRSITDTYSRDAEQREKDVFSAYRAVYLLILDEAGLQRGTENERNIITEVISDRYNDQKPTIITSNLPLERLKDYLTERAVDRLLHGGVLLEMTWDSYRRRGDL >NC_009446|790933:839041|792204_793098_+|WP_041729522.1|DBSCAN-SWA MNDKVDKQASAATSAQALYTDPVFTLTNEDADKVLKNAGLSRSDLGKLLYDDEIFACCDRREKAVVGTRWRIEGDNTDWLHAEISRWHETLVRRTMDAQWIGSSISELIWRRPEEDHNGIRLAAVEPRKIERFINQDGVLRYQTQSGSYIDVEPLKVLEVRMNASAANPYGDALLSRVYWAWFNKNYGEQFWSKYAERHASPLTVGKFNPRTNNQAEAQRHLNDLAITLAQAISDGVIVITQDDEISFVNATSDGSAHQLFTRHHIQRIQKTIIGRVLTSELAGGSRAAQETDDNFS >NC_009446|790933:839041|804875_808505_+|WP_012031085.1|DBSCAN-SWA MERKTALTRQDLQIYLTERLTDADDGGGLMTKTALTDEENQLFNPISDVARTMGAFHARSVHAAVRRPDDTPLGGAYVILSEPPKTQNVSYLLFRGIKYGEERRDIVPRIEAYSVGTIESRMTLLSVQSRGSRVIQAYQRQGEPLPLIGDVYCLRQDKKGYAQYEQYIQVIKVSSEDRTFTDPTGKDFVRTVVKMETSTALEQDMLGIDYPVLGYGDAPCKIRETHVADSAQYYGVKALSADAVSGAMKIRVPSLMEKLVPTSQVETSLVDLTAAGQRQVLVDNAIKGDDGFITQTLRLRTLNIDEVIHLGRAIVPDSLTIKGNITANDVGGTLVNASSQESIGTVDYARGEIRFSVYTSGISTVSFRPAVSELKVSDTTKIDISINNRSYNYVLAINPIPAPASLLVSYRAQGRWYDLYDDGSGALRGFSAAHGSGALNYASGTVTLTCGELPDVGSSILFAWCTPAQYKNRSSETPKVSIVLVLNQTADPATLKLTWNTYQASCDAAGKITGNATGYYDARTKTIKLDSASYMLGQKVTLTYSKFEEADKVQQEHKAPLRNGAGEIVLDLGDKPIAPNTVRLKYNLLIEDFQQQTYGEIYLKRIDPYKVLRDNGAGVLIDENNVPFGTIDYDARKLTFKPETTVKIPKAQYMTYKSGTEIVRKSQSDPDKHEVRDVFRNFFTGFEYQSVGAFMPFGDDGVVEVWFTPKTVTNTFEEINTERFLEIELAPNLAERIVTGSVHLKVGNKFHFDRAGAMYTDLDTSTGYARKVGTIDYQNGVIRLSEFTDINARIVALSTTIDSNPVDAATFRTPSSPIRAGSLQIRATTATGEQLSAIAQLDGSINDNKITGTVDVESGVAAVKFGELVNAAGNENEPWYQAEAVVDGKIFKPAHVLAETITYNAVAYTYLPLDTAVIGIDAVRLPQDGRVPIFRRGDMIVIGNRIIEDIGSAHTAKGVVSLSRGDLDGLCVFDNAGKSVDAHLYDYDLTAGTLTWSEPLDLSAYQMPLKVKHGQEEENRIISVDIDGTLTLQFPLRRAYPANSTFVSSALIGGDLQVRVTAPFGQKAFDNIWSDERRGDDIRSRLNVTDFPFVLTDDGTTTDRWAIVWRDAMQFDLYSEALGFVGRYDTLTDLAPINPATNKPYFTLPLGAFGIRSGVSGWAAGETVRFNTFGTHIGVWILRAVQPSSEKQSASDGFTICLRGNTTEL >NC_009446|790933:839041|800248_800458_+|WP_012031082.1|DBSCAN-SWA MNNKPFKQAPLPFIGQKRMFLNAFQKVLKEHIPSDAEGCNRMSDYQRINIQTRLNYSSVYEDNLTYKFS >NC_009446|790933:839041|799243_800011_+|WP_012031080.1|DBSCAN-SWA MRHPTAHIREDAFIGEGTVKLKVKGREELGIFELGNATEFSVAMSSEVLERISKKRGTYGQVLNTVTLPKAGELSITIDTINKETFAMALMGSLGVDEMKAETVADEEIAFKPDVWLKLPHRYIDGEVVVKDATAAGTVDPKEIEVEPRLGLFRVTSAGAAAANIDGTVKVSYKTATWTRWVIQAFKSTELRGELWLDGRNRITGEDVLLHMPQFTLAVDGEFNFFTDEFNTITFKGRPETAEGYETAFTVEMKE >NC_009446|790933:839041|811391_811619_+|WP_012031089.1|DBSCAN-SWA MDILKQFNDLIAPKKQQAARVIAQKGADAWVAETPAGLIVVITGTTQVGQHVYYDDYTKRIIGQAPAVAWTGIPV >NC_009446|790933:839041|831000_832341_+|WP_012031112.1|DBSCAN-SWA MVQAFKNELRPTLSLALPMICTQLLNYGQQIIDTVMAGRHTPLTLAGVSLANQLFAIVYLFMIGIGVGFSALISRRHGNDSHVTIRREFQQGFWLFSLLGILMIFAVIGCAYLPQIIGSEKSIAEESKRYLLVLALPAGIFLLGQLARFFLEGMANPRPINYVQAALLPVNIVGNWFFLTFTDLGAAGMAISTGFCYVLYTGALLWILATRPRWRRYRLFHKMSAFNLPMIKELLWVGLPIGAAMVMEAAMFSYIGIMASRENAIITSANQIASNYLSIIFMVPLGIASALTIRCAHALGRRDWTAIRYRAYTGMIFSGGFMLLSSIVLILARYYIPLFYTDHPEIIAIAAKILFVVAFFEFIDGIQVSCAGILRGLGDTRICLLYAFIGYWIIGIPTGTIMAYGFDWGVYGLWGGCAFGLGTFAVLGARRVFYHTHRKRTLHAEA >NC_009446|790933:839041|824362_824878_+|WP_041729540.1|DBSCAN-SWA MSNLQAEPLSFNHVIFKPVELNDDQIWLTSLQLAQALGYKRIDSVSQIYRRNADEFSENMTQVIDFLENVKLRVSKKNFAFLTFQRMLSDNRVAAAISAVEISLSLIRKRVQRFYRAIFMPKICYGGIRRRTARFAVSFVAARLILRMLSPSYLALTAAVNLQLRKETANV >NC_009446|790933:839041|809344_809710_+|WP_148188635.1|DBSCAN-SWA MYVLIFGITNMILSDKHYIPHDTGFIGGTLGGIVTVAHMPAVQPVFLLDSKTLQIVAQTYSNQNGHYCFTCLPADKCYTLFARDRFKRASLRPPVWDYVTPVNDMSLSEQYDFLKAYDEHV >NC_009446|790933:839041|816634_817093_-|WP_187145766.1|DBSCAN-SWA MMAGINKVILIGNVGKDPDMRVMTNGEQVANFSLATSMSWNDRQSGEKREKTEWHRCVAYRRIAEIIGQYVRKGSKLYIEGRLETRKWQDQSGVERYTTEIIVNEMQMLGTVQSNNRAPQQKPQRKQNARNYANQYARESDGDDFVDDNVPF >NC_009446|790933:839041|815573_816242_-|WP_012031093.1|DBSCAN-SWA MTTETIIKTEKDFSKFKVVITDDHGAYLYMSGEEDNRKVCAGNISYAIEHEDNGEIIRSPFDVTTREVTYWSFSNCREGTYYDITFEDDFMRKFFGDDYKHDESTFPIRDSFEKFVLVEVFRLGRATAKELYRCIIESATNDRPDARHEEYSKRVRAVSLSPEVFGKLFSDIAGEAAKRAFNVIEDVIDKERRINSWYLNYDYVAMLSQFHKILANYMERNT >NC_009446|790933:839041|796901_797888_+|WP_012031076.1|capsid|DBSCAN-SWA MADVLSKLGLTQAELDEAVNSKPNVPYRLLNSGLFQDKNLTTTSVMVEFTDKELKLIPANSRVGAANVRAYGKGSTVRTFTPPLLNLETTIRSEQIQDVRKVGTRDALLSNAEAMQEEIATHREMHDLTIEHLMLGAIKGKIIDADGETVLFDLFKELGISEPLTTIDAAAADIGQQFAKTLRIMKDGLAGDTCTQARVLCSRGFFDSVIAHKSAHEAFERYQENAFARNLPVDTFQWNGFIFEVYHYEIDGKPVIADGEAHAYLEGMQRGFTRYNATGTLMSAVNQIARPFYIDIEDLAHKRGISVYTESAPLPLCLRPKTLMHFKI >NC_009446|790933:839041|828100_829300_-|WP_012031109.1|integrase|DBSCAN-SWA MLSDAKLRKLKPTEKKYKIADSHGLFAVVMPNGSILWRMRYVLAGKRRELALGKYPAVSLSEARQIVLLYREQIERGVDPHAQHVERHKKQDGVTVAEAVELLLSEKAKYCSASYLESCRSRLSNHVLSRFGARALAEVSEDEWRALINWLDNGGKNYTAIRVQDLAGELYKTVRLKTGLKIANPILEIRGTVKKKPAQNYPRITNPARFGELLHAIDYFNGRWQVYLALKFLPLVFCRQIELRRMVWQEVDFERALWCIPAEKIKMRRAHLVPLSTQAIAILRQAEQLRCSDLVFCGVRNPRQPISVNTIGSALRTLGFGHDEMVGHGFRGTAATLLNELGWDSHIVDFQLAHWQRSSVSSAYNHAEYLTQRREMMQAWADYCDELKAKTRKRLGIED >NC_009446|790933:839041|797909_798377_+|WP_012031077.1|DBSCAN-SWA MYITGADIITRFGRDELAQILAVPIDDLSPDHERLLAACRDADAIIDSYLSRALDLPLKTPPAVLIAYAADIARYRLHDDQVEDGTSVQRKRYQDAILWLERVARGEVSLIAHNEQQQKNIKPSPMAVSSVSGLAVVSSPPVFTDDLLAKGLVKS >NC_009446|790933:839041|796077_796899_+|WP_012031075.1|DBSCAN-SWA MNEIILSLRLSLTGREFTGVAYSGTTLSYGDSTIGIDISSIQKLDKQVPILLEHDPQTPIGFGKLRAQEGKLIIDGTLLDNKSIAQEIIADAETGKEWQLSVFVESDRISERKSGDMLNGQAIEQDNVLVFENALIREVSFCTLGVDGNTYVSLLSLNHKPDDTLKVAEQNADEQDEQNAEYERVCAELSAARAEAEQLKKQLNDQAQSARLSRLKALGIADVAAAQLAVIEGETFEVMLEQFVLQQKQTAALSINAYPTPTPERENPIKWSK >NC_009446|790933:839041|800494_804466_+|WP_012031083.1|tail|DBSCAN-SWA MANDLKVGIKIDASVRNFKKIEAISETLTGVAQQTAALQNQTEKLRSEWEKLTPEQHAQQITALESAIDGLGKQTAGAVLQSDEMAHSFEHVSEKAQQLQAIAKTKTQLGIDTDERAVSKIEQAVDAYRDLKKEGGTAQKELAHAADAHRQKIDELESALRNTKPSLGDIAGGLAKIISSAGGLSVVAQSAMSFETAMAGVKKVVDATPEQMQQLSGTIRQLAYELGMTAEETANITAMGGQLGVAFQDLPEFTRLAGEMAVAFNMTAEQAGDAAAKLANVYQIPLENVRALGDAINTLGNNTAATEAEIINATLRIGGTAKQFGLAAESAAALADSFIALGKTPEVAGTAINALLNKLQTAPAAGKEFKDALKSIGLEADALAQSIRANPEQALMSFMEKLSALDQQQRAITVTKLFGLEYADDIALAAGSLDSFRHALSLVADQQATAGAMQEETNAAMDTAAKKIEQAKTAINNIAIELGSLLLPIIADVASAFGGTVKEILALAQAHPHITELVSAVSGIKVVAYAAAESFKLLGGVAQNSFSLLGAGSEQASAGLHRLRGETEQGVEGFSKMDKAANALSKTLNVLSAATAGFTTGFSFGSWLYEQSEHVRAFGDSLGKLAAYTVAIFSDSTFEDVATYYKTSAQVATEETQRLAQAKEQLAQKAAEAAEAERQHAEMVAQNQAQINQLVAEIEQHQSSLQVLREAGEAGGATYAYISSQIETAKQKIHELTEAIEGKPIGLIMKTEFEAADKAFKTLGLSLSELETGISDKAKQSLEAFGTVAVVAEGNVTQLARAYEAARQNMGSSSAAQEQLNQKLLDAVGGNQELYQAVIQTAQAQNIAKRAADEQSAALQALGLNMEDIANRTSSGVQKMLAHWKTGMETLKASGKQSAEAVRLAFDNMLKSLHSTEDFKAFSDALQETGTASALTAEQLAQLRAGVNGGANAAQAAAQANAAHTQSLSANTSEVLANKAAIEQKTQALKESANAAKEASTAEAKVESQSEKTRKTITLAAPAYYTEARKRIEAMRELGATSEEVENALQSFWQKTRFNFGVSNVNEVATSLVSAMKAAASARARIDEMTESLRNGSFSAQDISEAMGKLHISSLDSVDSVQNLGDSRLNNLRDALKEARNHMRALSEEARDTADSLEADLARLRGDDSLAKQIEETKKLKELEAKRAAAQKAGNKEAAKEFERALILQKQIFAEEQRQAEVRAAEAQKKAAERQAADAQRKQNKAEHIAQNRTPDNRMADDARTPQVDMNKPAVTLVGSAPTAEALADIWNAKIAAAEERGAQRGKEEFAKELYNAAKRRPL >NC_009446|790933:839041|808506_809373_+|WP_012031086.1|DBSCAN-SWA MNQYYRTEPVNVYRFDDEDAPPLNGNPDSLLTILKACLVTGYGKQKPLGFSLAFEDEHVKVFCPKPRGLEPQWFLRASDDNGASARLQIYLDMTGINDGRIMCAPQTPYKYSNKNRTGEWLLVGSSRGFVFFAVCGYTDAINKGSFCVCGDSSKNAKGERAVYLHHAGGSWSDSDIYGIRPPTFDTSTHIAGCLARVDQNNNFVVRNVYASCALGAGLETVETHLAPLYVADSESVYLIAGVYLSSNTHAANSYDTVKHDDAQFIVHASSGRERCHEQLYVRTDFWYY >NC_009446|790933:839041|810267_811383_+|WP_041729892.1|DBSCAN-SWA MHFWQQPLSSDATALPLPFACFDKPIYSFIVPDLKTYIMHNTITATFDNEPLHLLSLRLQTDINAYCWQANFDISVDDFARLNIDKRKKGDEAVVSIFINGERFDVLAEDYSDNRRFIGNTYTITGRSITAKLGADYAAGKHQIYQEARYARQIADEQLYLLPYKIAAWECIDWLIPSNHYAVNGQTPIAVIADIAAAAGAFVNSHTYLPELSIKPVWQKAAWETVSPKHTVPASLIYSISGRRTIKVRANAVRVVGSGISARGFLVYREASNQIPEAAVLNHVLYTEEAVARAAGIHALSETGIHKTETVTLPVADKYQLPRAELGDVWAFNENGEQFQGVVKSVTLTVSLENDAPVVTQTLDVDRYLDF >NC_009446|790933:839041|817956_818562_-|WP_012031097.1|DBSCAN-SWA MNIYQKLAEAQTKLVVKKDLDNKFAGFKYRSLETILAAIKPILESLGIYLFMSDKILFIDGRHYVEATATLVNAEKPEEQIIVTACAREVQEKKGSDPAQITGAASSYARKYALCGLFGIEGESDPDADDAPQTKAASKKQAAKIAVPEKEFQELKVYLKNQVAKGVDFDAVIALVTKRAAAKNWVLTNEQIQIMRAECLQ >NC_009446|790933:839041|821940_822114_+|WP_081423590.1|DBSCAN-SWA MTDSELITKLGGATALARKLGTTPQAVCNWRTRGIPARIKLEYQELFEKAAKNEKSK >NC_009446|790933:839041|816362_816617_-|WP_012031094.1|DBSCAN-SWA MIRNITRQEFARRLGVKRTKFHDLLAGGCLPPPRRIGRDLVWLEPVVDAFIILAYGYPMPIPELSAELEHEVREAVMQLHNEKI >NC_009446|790933:839041|833825_835031_-|WP_012031116.1|DBSCAN-SWA MKLNTIQGGIIVAIFLCTFLIPGGYNYGMLALLITGIIAIAEKSPHRLSYFERRWIAFLLIFSAISLTLRLLHGEPLSTYDNLSRYGLSIILLYGLSRYPPAEKYYSWALLAGTLTALLMVLVARYRGDSAIGYQHHIQFSGISMIFAVFSGFRLPYAKTRWQRFILIVALFCGVCAAALGGGRGSWLIVPPILVLFFIMMRKRVTLKNAALGFFLLLSFFSALYCIPQTGVAARVRGVFSDLDHYRQGTVATSQGMRLEMWRCARLMWQEKPLMGWGSVGMQAEKARLIARNDCHFSIAPFNHLHNDYIDIGVRYGIVGCFNFLFIYCGALFLYFVMLAQFPSIAWQGIAINCCFMVVSLTNAFLSHHISIIFYLLLNSLLFIQTNYTYIQNKNYISNKI >NC_009446|790933:839041|827195_827603_+|WP_012031108.1|DBSCAN-SWA MTEFIERLVEIDRELTRLHKKEYALRKSISDIEQTIDETENAVLDSTGTRESNGIKCDPAEMRAFFENHKLLTIEHVIKRHEELIEFYEETSKERNDALKEQLTEDKAELEQINERLQLANKERDALLAQLNDGT >NC_009446|790933:839041|804462_804867_+|WP_012031084.1|DBSCAN-SWA MNDYWQLTRKDTQAWLQFDQDMRWIDEFDWSNIAQSNPVRTLSGAQVIQQGTKYSGRPITLAGDWVWIRRAQLQTMQDWTTTPELEMMLTHYDGRVFNVTFRLHENAFEATPVVYRTPEEDGDFYTIKINLMTI >NC_009446|790933:839041|822195_822459_+|WP_012031102.1|DBSCAN-SWA MAHVETKKQARVTQKRLIAAHLKKHGSISSWEAIELYHCTRLGAYIYELRESGWDISTLRKTFTSSVTGNSGVYALYLLNESNELGE >NC_009446|790933:839041|793134_793662_+|WP_041729524.1|DBSCAN-SWA MINEFIAKVLRLNGTARGDILFAYDRTESIDKERWERDTALMDRGIRFTEQYFIDQYHLEPIYFSLEQIERAARSERAANAAQKAGLSLSKKQELTPAAQALEDRVQAGMAEAPEPITREMIEDVVKNAPNDYQLLEDLVKLYGDRDPEGFNDWFGEALEIACAHGYHDADQGTL >NC_009446|790933:839041|822459_823347_+|WP_012031103.1|DBSCAN-SWA MSLKYMVDALAIKVGNPLRKLVLVKLADNANDDGECWPSYQKIADTCEISRRSAINHIKWLEEHGFLISCARKDADGMNRTNIYKLTIAEGKNADKNDGGNDGGNDGGQDSLHSEQDSLCSESIASGSESIASGSESNDAKVVQQLHPEPVNKNLSKNQSRNHESRANARMREAQNFDPISALKAEGVSEEKIRDWLAIRETKGAKSLTTRSYQAIMSEIEKAGLSAIEFVDLALFKGWRGFGADWNWQASFEEMKRQQRGDAQTQRSDWQSQFPSIIELHKGGTPDELIPWIKR >NC_009446|790933:839041|833330_833501_-|WP_161802473.1|DBSCAN-SWA MTFNKTGGFSYLKKHSIILAAEKQLTHIIFIRKIIEIIEMLAHLFIDVKRNEILLN >NC_009446|790933:839041|811633_812068_-|WP_049752493.1|DBSCAN-SWA MKYFKHVTVNTGHTYISTRDEVRDSVIETMIDWFERMQNGETINIYDDYSCYLIKSYNKMADFMISMRKRGAVYSDNVIRFTVCLHSRTAKECWSIVGGEGAPPENPFIAVLIINPLLVDIDLADFERCLAWGFYEAMQRKKMQ >NC_009446|790933:839041|829441_830461_+|WP_081423591.1|DBSCAN-SWA MNTNGCDIDAHIMHTGKLTCQQRANLNLNDKSRAFKRHFKSLQALWQIIAHPARFAPPQPLPVALPDWNDFFNAQHDMRFIWLGHSSLLCSSGKTTVLIDPIFARYAAPIPFIIKRFQKPPLAITELPPIDWILITHNHYDHLDKKTIRYFSKHTTKFVIPRGLENFFIRLGIQRQRIFALNHWESLTLNDITLHSVPARHYSGRHLFDHNRSLCAAYVLQTPFEKFFFSGDTAFGNGDHFRAIAARFGAFDFVFMENGQYHCAWHNHHLMPDETIAAVKILQARCWTPIHWGAYALSTHLWHESVSQTWALTQKTCLKMCTPRLGAVFSRTHESDKWW >NC_009446|790933:839041|820876_821626_-|WP_148188642.1|DBSCAN-SWA MGYIIDGVQRRTNNKSEVSLASNVKILRELNNLSQDQLAEKIGKSQAAIQKIEAGLTLRPRFLQDLANALGVSSIDLEYKDFEKELKKQAIESDIGTMGKFRLWSSNDPLPEDEYAYLPFFKDVEFQGGTGCCEMQDYNGFRLPFAKSTLHRYGVPLDQAFCVTLTGNSMEPVIPKGSTLGINKADTVLKEGDIYAIRQDDLFRVKRLYHAPNGMIRISSFNQEEYKDELVRPENIEIIGRVFTYQVML >NC_009446|790933:839041|812054_812669_-|WP_041729532.1|DBSCAN-SWA MITLINDKSENRGGSFEMIFTDPPFDMGAGKLHKILSNYQFEHLVLIASMRQVLELYPKLDMDFCFDLVANRSKPKESRSYAMPHYLHNNIFYFKKHGVKSAFDRRLVARADQYSDTKTHYYPTFFDAPKRDIVYRYQKNQQMIDDIIGAFNVSTVCDMFAGSGTTGLACVKHEKDCTLIEAETEPFNIMKQQLDFLQVKYEVL >NC_009446|790933:839041|800013_800241_+|WP_012031081.1|DBSCAN-SWA MKIELKQVVIVGGVARAAGSVIDIAESTARWLIEQGAAVSNEDLTVAELKAQLDARGISYRKNASKTELSALLTK >NC_009446|790933:839041|818919_819177_-|WP_148188636.1|DBSCAN-SWA MSAIQTTNTEQKMATFFTKNNSLLTNIFLCVFLFLNIFACTRSAIYAALYEVNSAPRPTYQQLADYCADGEEAACRALFARRGDK >NC_009446|790933:839041|795668_796067_+|WP_012031074.1|DBSCAN-SWA MTETIPYERPQFLHWEANPHVSREIGVAKAAIKAGDVVLATDGVNFEVLKDGTPPPPSTGDAPKIGFALEDAAAGAKFALVVRFAVILIDKLNHVTAASFAKGGAFEFYGNFFPPLNIILKKSVDVQRGVKP >NC_009446|790933:839041|819314_820748_-|WP_012031100.1|DBSCAN-SWA MPEIFAFIVFAFLVLFIISAPKIGFFIFKKSRVYKKLTEDIEALEEQEDHLNQYLSSLHQKIKEARQEEKDARDELSAINHETQRLQEIKQHSEEIEAKYARLAEEKAAIERIEQQRKAILAKIALYSGAEFFIDYGLYPLPSYSEFTAGGYLEQIKDNRQQQKNMLKDGSAYRVPENFILTGDGSYDKKVATAQGKLLVRSFNSECDYLLAKLNSRNYETTMTRIEQLAESLEKMLVSLEIGLTPQYVMLKMKECKLYYQYQAAKELEAEEQRAIKEKMREDAQAQKEIERALREAEREELIAKQAVERLQKEMLAANEEQKAQYEAQLKNLQAKLAEAETKGKRALSMAQQTRSGHVYIISNIGSFGKDVYKIGMTRRLEPLDRVRELGDASVPFSFDVHAMIYSADAPALESSLHERFADYAVNKVNKRKEFFRLPLNEIKRYLEEEKIQAQWTLAAEAVEYRQSLQITKEKMN >NC_009446|790933:839041|798779_799247_+|WP_012031079.1|DBSCAN-SWA MPTLPIDPLDLNSAWDAIVARIAAVDGIRAVRGAVDLEKIINGQTTGSNHYAFVTFDGIELLQPAGNSTRHQTVDIAYQVVIAELDYSNDGKPQQAGVLLGRVLSALMSFAPFTDDKRSSQRLRAVTPPRAVHRNKYSLYPLKFILSLHIKGEQL >NC_009446|790933:839041|826233_826611_+|WP_012031105.1|DBSCAN-SWA MSVHTFKALLPMPPSVNHYYRRRGSHVYLSDEARIFRRDVVLKTIGKIPNKKRWFDKEQRLAMTVTLYPTNRRSFDIDNRLKALLDVLQEIGVYHNDNQIDDLRVVRGAVVKRELGFCVVEVKPC >NC_009446|790933:839041|798373_798799_+|WP_012031078.1|DBSCAN-SWA MIFVSVSGFDEAHERLRALVSRGADLTPLLTEIGENEVSNTLLRFEQARAPDGSSWQALKRPRAHRRGGDGGNLPLNDTGALKTSIKSQLMGDSSVLIGSDLVYALTHQSGRDAIPARPFLGLEADAEAEILEIIHAYFAD >NC_009446|790933:839041|835027_835948_-|WP_012031117.1|DBSCAN-SWA MRKIYLAGGDSRSVAAYFHALTGAVSVVCGYANSALPAPSYQQVCSGKTGAVKAVALTYQPHILSLSAVFLHFFRLLEPILFQYQPHHQRHGIYSVFSEERDLARAAVTALQKRHLNPITIEIAALDNFFAAENAPLSTLEPPRQANEPFSAEERQLLYAFWQKPKLAELKKILSEEEFAITQEQSTEAPFSHCYDQLFAAGIYVDLLSGEPLFCSNDKYDAGCGWPSFSQPLTEEALTYHADFKGGQKRIEVRSAVSDNHLGHVFFDEPSPSGKRYCINGTSLRFIPWSKMDAAGYGDLKIKVKK >NC_009446|790933:839041|818587_818920_-|WP_012031098.1|DBSCAN-SWA MYANSINNRKDEIEIASYFQIKFNEFFPQTQNETDVLQFVCAQYKPENRCQLIDIYVLVGGSLGERTLAATVGYEIGTPNFEIVSISEQVQLKVHISEHGDLEIYFYLAD >NC_009446|790933:839041|795154_795577_+|WP_041729526.1|DBSCAN-SWA MKYEFTLNELPASLDTDKGIFTYEDEEIKKVIDETIADAKSWGRWGGGTSIYVGIDITDPYRDIYQFTACLYTAMAGNHLPKIRGSDTDKYFPKELYPYAPCLAGLCEDIPPINVFLGTKEEFEAYGEKLEEMEKFGVVF >NC_009446|790933:839041|826604_826961_+|WP_012031106.1|DBSCAN-SWA MLIEELLVAWGEWSRKDVLPARPRCFIAAFCRRSARDADDSVELDDEACEQINEGLMVLRGNHQRIHDVLISRYLYNEYNDKAVAKRHGISESTVKDYRRRGHLFLEGWLCSCSQRSE |
55 | Vibrio_phage(12.0%) | integrase,terminase,protease,capsid,tail | attL 782235:782251|attR 838181:838197 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
856223 : 870326
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NC_009446|856223:870326|DBSCAN-SWA TTTACGCATGAGTTTTCTCCTGTTCTTTAATTGTAGCGATCAATTCATTGACCACATCATTAAGCAATGTTTCATCATCGCCTTCTGCCATCACCCGAATTTTAGGTTCCGTACCCGATGGACGCAATAACAACCGCCCGCGGTTACGCAATTTTTCTTCGGCATTTTGAATCGCGGTTTGCACCGCTGCACTATTGATCACCGCATCACGTTCAACAACGGTCACACTTTTTAAAACTTGCGTCGTCAACGCCAAATCAGCAATTAAATCCTCGAGCGTAGTTTCCATTTCTAAAATGGCTGCCAAAACTTGCAACGCAGCAATAATGCCGTCACCGGTGGAATTACGATCAAGACAGATAATGTGCCCCGAATTTTCGCCGCCTACGCGCGCTTGCAATAACTGCAATTGTTCCAAAACGTAGCGGTCGCCAACTTGCGCGCGGTGCAAAGTAATTCCCATTTTGCGCAACGATTGCTCTAATCCCAAATTCGTCATCAGCGTGCCAACAATGGTGTTGAGTTGTTCACCCTTAAATGCACGATATTTAGCGATAATGTACAAAATCGCATCGCCATCGATAATACGCCCGCGCTGATCCACCATAATGACGCGGTCGCCATCGCCATCAAAAGCAATACCGATATCGGCTTGTTCTTGACAAACGCGTTGTTGCAACGCTTCTGGGTGCGTAGAACCGCAACCATCATTAATGTTGCAACCATTAGGCGCGGCATTGGTCACGATCACTTGCGCGCCGAGTTCCCGAAAAACATCGGGCGCCACGGCATAAGTTGCGCCGTGGGCACAATCTAAAACCAATTTTAAACGGTGAAAAGGCAAATTAAGCGGAATCGAATTTTTACAAAATTCAATATAACGCCCTCTCGCTTCGGCAACGCGATAAACTTTTCCCATTTTCGCCGGCGCAACCAGCGTCATCGGTTTATCCAAATAGGCTTCGATTTGTTGCTCGGCGCTGTCGGGCAGTTTATAACCGCCCCGCGCAAAAACTTTTACGCCATTATCGCTAAAAACGTTATGCGACGCGCTCACCACCACGCCTGCCGCCGCGTTAAAAGTGCGCGTAAAATAAGCAATGCCCGGCGTGGGCATCACGCCTAACATGTGCACTTCCATGCCCGCGGCTGACAATCCAGCGATGATCGCGTTTTCAAACAAATAACCCGATAAACGCGTATCTTTCCCTACCAGCACCTTTTCGCCGGCAAAACCGTTATCGCATAAAACGCTACCGACCGCCCAAGCAAGATGCATGACCCATTGTGGTGTCATCGGTTCTTTTCCGACTTCACCACGAACGCCGTCTGTACCAAAATATTTTCGAGAAATCATAACCATTATTGAGATACTGATGTTCCGGTTTCAGCTTTTGCCGCACCGGTAATAAAACGTAACCAACAATAACCTAAAGTCGCCGCGGTTAATGACGCCGCTAAAATTGTAATTTTAGCTTCAGCTAAATGTTGAGCTTGTCCGGAAAAAGCAAGTTCAGATACGAAGATCGACATCGTAAACCCAATTCCCGCCAACATACCAATCCCCAAAACTTGTTTAAAGGTAGCATCAGCTGGCAACGCTGCAATTTTCAGTTTGACACAAATCATAATCGCAGAGACAACGCCAATGAGTTTTCCAAAGACTAATCCAAAGAAAACGCCTTTTAAAACGGGACTATGTAATAGAGCATTCAAATTATTTAATTCAACGTGTACGCCAGCATTAAATAAAACGAACAGCGGAACAATTAAGAAATAAACCGGCGTATTTAATATGTGTTCCAAACGTTGCAACGGTGTTTGGGTTTTTTGAATGCCGGTTGAGAGCTGTTGCAAAAGATCGTTGAGTTCACGCGAATGTGCCACTTGAGTTTTGCTGTCGGGGTGCGCATCAAATTCATCTAATAATTGCCGCGCTTGAGTGCTAAATTCTGTTGGCGAATAAACTGCATTCATCGGTGTTGCCAGTGCGGTTAATACGCCAGCAACCGTTGCGTGAACGCCGGAAAATAACATAAACACCCACATCAAACTGCCGATGGCAATATATGCCCATAAAGCTCTAATACCGCCGCGGTTTAAAAGTAGTAATACCGCAAAGCATAAAAACGCCGCAATCAGCGGTGACCACGCCAAATTATCCGTATAAAAAATGGCGATAACAATCACGGCGCCCAAATCGTCAACAATCGCTAACGCAACCAGCACCGCGGTTAAAGATGGAGACACTTTTCCTTTTAACAATAATAAGATGGCAATGGCAAAAGCGATATCCGTTGCCATGGGAATTCCCCAACCGCTAACGGTTTCACCTTGCGAGTTGAAACTAAAATAAATTAATGCCGGAACTATCATGCCGCCGAGCGCCGCAAAAATCGGTAAAACCGCCCGTTTTAAAGAAGCGAGCCGTCCGACTTTCATTTCATATTTAATTTCCAAACCGACCATTAAAAAGAAAAGAGTCATCAAAGCATCGTTAATCCAATAATGGATCGTCATATCAATTTTAAAAGAGCCCATTTGAACGAGTAATTGTTGATTTAAAACGCGTTCATACCATTCACGGCAAGACGTATTGGCAAATACCAATGCAACAATGGTTAAAAAAACGAGCAAAATGCCACTGCCGTTTGCGCTGTTCATGAAACGGTTAAAAGGCGACAACATTTTTGCAACCGTCTTCTCCCACGGTGTTGCTGCCGTTAATTCGTGTTTTTCCCGAGTAAAATATGACATAAAATAATATCCCTTTTTCAATGATGAATAATTTTTAACTACTATACCATAATCACTTTTCTGTACGACAACCGTCATGGTATATAATCATTTTTTATTGATCATTACACTGGAAAACATATGGATTTACAAAACACTATTGAAACAGCGTTTGAAAAGCGTAGTGAGATTTCACCCAAAACGGCTGATAGCGCCTTAGTGACCGCTGTCAATGAAACGCTGGCTTTATTAGAAGAAGGAATGATACGCGTTGCCGAGCCGACGCCCGAAGGTTGGAAAGTGAACGAATGGATAAAAAAAGCCGTGATTTTATCGTTTCGTTTATATGATAATCACGTCATTCCGCACGGCTATACCCATTATTTTGATAAAGTTGCCAGTCGTTATGCAGATTATGATGAAGTGCGTTTTAACGCTGACGGCGTGCGCGTGGTGCCGCCGGCTGTAGCCCGCCGCGGAACTTTTTTAGGAAAAGGCGTGGTGCTCATGCCGTCTTATATTAATATCGGCGCTTTTGTTGACGAAGGAACGATGGTTGATACTTGGGCAACGGTAGGATCTTGCGCGCAAATCGGAAAAAACGTGCATTTATCAGGCGGCGTCGGCATCGGCGGCGTTTTAGAACCGCTGCAAGCATCGCCCACCATTATTGAAGATAGCTGTTTTATCGGCGCCCGTTCTGAAATTGTTGAAGGCGTCATCGTTGAAAAAGGCGCCGTGGTTTCGATGGGCGTTTATATTGGACAATCAACCAAAATCTATAATCGTATGACCGGAGAAATCACTTACGGTCGCGTTCCAACAGGTTCCGTTGTTGTTTCAGGCAGCCTGCCGGCAGAAGATGGCAGCCACAGTTTGTATTGTGCTGTTATCATTAAACAAGTTGACGAAAAAACACGCAGTAAAACCAGTATTAACGAACTACTCCGCTGTTAAAAGTCACTTTCTTGCGGTGCTTTTAAGCACCGCTTGTGGTTCGAAAACTTCCCACACTAAAAAACTAGGAAAAATCATGTTGACATTATTTTTATTCACGTTTCGTGGCGATAATTTATTGTGCTTTTTTAAAGGGGATAAACGATGAGAACCCCGCAAGAATGCGCTCAAATCACCTTATTGGGCAAACAAAAATCAGATTATCCCACGCATTACGATCCTTCAATTTTAGAAGCGTTCGCTAATAAACATCTCGAAAACGATTATTTCGTACATTTTATTTGTCCGGAATTTACCAGCCTTTGCCCGATCACGGGGCAACCGGATTTTGCCACGATTCATCTTGCTTATTTGCCGGATCAATTGCTGGTTGAAAGTAAATCCCTCAAATTTTATTTATTCAGTTTTCGCAACCACGGCGATTTTCACGAAGATTGCGTCAATATCATCATGAAAGATCTCATTACCTTAATGGCGCCGAAATATATTGAAGTTTTGGGGTGTTTTACGCCGCGCGGCGGCATCGCTATTCACCCGTACGCCAATTATGGTCGCCCGAACACAATTTATGCCGAAATGGCGCAACAACGTTTACAAAATCACCGCATCGCTTCTTAAAATTAAAGGATTTTTATGACAACTCATCTTTTATTAACCGCCGAACAGCATCAAAACGCTTTACGCTGGCTGGTTTTTTGGCATATCGTCGTTATTGCTTCCAGCAATTATTTAGTGCAAATTCCCTTTTCGGTATTTGATTTTCATACCACTTGGGGCGCTTTTACGTTTCCGTTCATTTTTTTAACCACGGATCTCACCGTGCGCATATTTGGCGCTCTTCCCGCGCGCCGCATTATTTTTTGGGCAATGTTTCCAGCATTAATAATTTCTTATTTCATCGGCGTTTTATTTTCTGACGGGCAATTTAACGGCTTTGCGCAATTGATGATTTTTAATACATTTGTTGGACGCATCGCGCTCGCCAGTTTTTTAAGTTATATCGTCGGTCAACTGCTGGATATTACGGTATTTAATCGTCTGCGGCAGTTAGAACAATGGTGGATTGCGCCGCTGGCTTCAACCATTGTAGGTAATGCTATTGATTCTTTAGTGTTTTTTACCACGGCATTTTATCGATGCAGCGATGAATTTATGGCAACGCACTGGGTTGAAATTGCCGCACTCGATTACGGCTGGAAAATGTTTATCAGCGTTACCTTTTTCCTACCCGCTTACGGTTGGCTATTGAAAAAATTGACGGCAATGTTATTGACCGTGCGCACTAAAACCACAAGCGTTACTTATTAATTTTGTGGCTCCATTTCGCGCTTTATTACGCGCGAAATGGGCAATTCAATGGTAAATAACGATCCCAAATCCACTTTTGATTTAATCACCAAACGTCCGCCGTGAATTTCCGCACCGTGTTTTACAATCGATAAACCCAATCCCGTTCCCGTCACGCGCAATCGTCCGCTATCAACGCGATAAAAACGTTCGGTAACGCGTGAAATATGTTCCGGCGCAATGCCGATGCCGTCATCGTTTACTGCAATTAAACACCATTCTTCCGTACACGCCGCATTGATTTCAATCGGATTTTTACTGTGCGCGTACACCAAAGCATTTACCAATAAATTACTGATCATGGATTGCAAAATTTCGGCGTTTACTTGCAGCGTAATGTGCTCAACATAACCAATTTTAATTCCTGTACTTTCAGGATATTGCGGCAATAAACTGGCTACAATTTCTTCAATAAACGGCAGTAATAATACTTCTTCCGTTTTCACCGCTGATTCCACTTCTTCCAGCGAAGCCAGTTTCAACATATCTTGAACCAACCCGTGCATTCTTTCGATTTGTTTCACCATTTCCCGCATCGGCACTTGCATATATTCCGGCAAATTTTGCGCCGATAATAATTCCAAAAAACCGCGAATTACGGTAATTGGCGTTTTTAATTCATGTGAAGCATTATCAATAAACGCTTTACGTTTTAAATCTAAATCAACAATCCGCGTCATATCGCGCGCGATGACCAATATTTTTCCCTGCGATAACAAGATGATATTAAATTCTAACCAATGACTGTCATCAACAATGAGTCGCGTCATAAATAAATGATTTTGTTGAATATGTTGCCAAAATGAACCAATACTTTCCGTGCGGATAAAATGATCGATGGGACGATTAATATCAGTACGATGATTTAAATGAAATAATAATTGCGCGTGATGATTAAAATCCGCCAATCGCCCTTGTTTATCAATCACAAATGCCGCGTCAGGCATCGCATCAATGAGTTCACGAAATAACGAAATATAGTTATTGAGGCGCTTTTTGTGTTTTTTGGTACGCTCAAAACGGCGATAAGAAAGTCGCGAAATTTCATCTAACGGCGCGGGTAAATCAGGAGGCAATTCTTGCGGGCGATCGCGCAACCAGTCCAATAAATTTTGTTCAACACGCAAACGATACATTATGACGACCAATAAAATGGCGGCTAATCCCCAGCCGCCAAAAATCAAAAAATTCATACGTACATTTGCTTTTATATAAAATGAATTGCCAACGCTGCTAACGCCATCACAACGCCCAAACCGCCGCCGCCCAACGCTATTTTGAGCTGCCGTTCCATATCTGCCAGTTTGCGTTGTGCAAAAACTAAATCCAAACGTTGACGTTCCAAAGCATCTTCTAAATGCGCATTATTATTGCGTTCTTGAGAAGATTGACGATTTTCTAAATCATCAATTCTCCGTCTCAACGTTAAATCAACCGCTGGTTCTGGACGATAATTTTCCAAAGCTTCAATTCGGCGTTTGATTTCTTTTAATTCAGCGTTGGAAATGACAGGCGCTGCGGGTGTTGCGGTATTAAATGAAATTTCTGCAGCAACGGGGTGCGCGGGATCGAAAGAAATGGTTTCGCGCGCCGCATGCGCGGTTGTCGTTTTTGTCGGTTCTGGAATTTCTTCGGCAAGCGATTCAACAGTCGCATCATCGGTTTCTTGACTGCCCAAACGTTCATTGAGCGCATTAATTAAATGCGCCAAACGATTGGAGTCCGCCGGTTTTGGCAAATAACCAGTGGCACCGTTATCGCGCGCTTTTTTGCGCGCTTCTTCAGAAATATCCCCCGAATACATAATTACGGGAAGATCTCGGGTTTCAGGATTTTTACGCAGACGAGCAAGTCCTTCAAATCCATCTAAATCCGGCATCATCACGTCCATAAAAACGAGGTCTGGCAAAAGATGGTTGACAATCCAACGTTCAGCATCAATAACCCCGTCCGCTTCGTTTACTTCGATATGATGTGCTTCGAGTAATTTCTTTAACGTCAGTCGCGCTAAACGAGAGTCGTCTACAATTAAAGCAGTTTTAAGCATTTAAAAAACCTCAAATTTTACCAGTCTCCAGTTTCGCGCAGAGAAATCACCTGATTGCCTGCCACAACGAAATGATCCAATAAACGCACGCCCAAAGGTTTCAAGAAGTTTTCAATAAATTTTGTAGTTTGCAGGTCAGATTCAGATAGTTCCGTAGATTGCGCGGGGTGATTATGCACCAAAACCACCGCGGCTGCATCTGAATCAAGAACGGATTTCACCAATTTCCGGCACGATACATTCACTTGGCTTACTTCCGAACCAAAAGCCTGCCCGATTCTTTCGTACGCCAAATATTGATTTTTTTGATTGAGCAACAAAATGGCAACTTCTTCAAAACCCAGCCCTTTATAATGCATCAATAAAAAGTTTCTCACGTCATCAGCCGATGAAAACTGCCATTCTGGTAATTGCAAATCGGCGGCGATAAAACGTTTACATAATTCTGTTACCGCTAAAATTTCGGCAGTTTTTGCTTGACCAATGCCTTTGATGGGTAATAAATGTTCTTTACGGCTCGTTAATAAACCAACCAAACCGCCGCGAGAAGCAATTAATTCTTGTGCAAACGCTAAAACGGGTTTACCAATAACGCCCGTGCGCAACAAAATTGCCAATAATTCCGCATCAGATAAAGCACTCGCGCCGAGTCGCAAAATGCGTTCGCGCGGCATATCCAATGATGGATCGCTGGTGATATTTTCTTCAACCAGCGCTTCTTGAGAAGATTCGTGAACTGTTTCTGCTTGATGAACCGTTTCTTGAGCTGCCGGCATTTCGGGTTTTTCCACCGCAACAGCGGGTTCTTCGTGCGCGGGCTGCGGCGCGGGCGCTTGCGCTTCGTTGTGATGAACATGTTTTGCCAGCGCGGTAACTAAAATATTCAATCGCTCCGGTTCAACTGGTTTTGGCAAATAACCCGTTGCGCCGTTTTCTTTGGCTTTGCGTCTGGCTTCTTCGGAAACATCGCCGGAATACATAATCACGGGTAGAGCGTGTGTTAATGGATTTTTGCGCAGACGAGCAAGCCCTTCAAAACCATCTAATTCTGGCATCATGACGTCCATAAAAACGACATCGGGAATTGCCTGATTAAAAATAATCTTCTCACCTTCCAAAACACCTTGCGCTTGACTGACTGTAATACCATGTTTTTCGAGCAAGAGCCTTAGCGTATGTCTTGCCAAACGGGAGTCATCAACAATCAAAGCAGTTTTAATCATCTAAATAACCTCAAATTTTTTGTTTTTTACCAGCATCGGCTCCACCCAAAAAGGCTGATCGATTTCCGTTTTTCTGGCATAAGACCGTAGAATACAATCACTGTTTATAATTGAGCGGTTTTCACCTGCGTGATCATACAGGGATTATTGTTATCGCGCTAATCTGGGAAAAACCGGCCTAATTCTACCTTTGAGCAGGGGACAGATTCAACATTAAGGATATTGAAACATGAGCCAAATCATACTCGGCATTTGTGGCGGCATTGCTGCTTATAAATCAATTTTACTCGCTCGAGAACTGAATAAACAAAACCACCGCGTTCAGTGCGTTTTAACCGAATCCGCGCGCACATTCGTTACCGAAGAAACTCTACAAGCGATAACGGGACTTGCACCGCGCCACGATTTATTCGATGCCAATGCAGAAGCGGCGATGAGCCATATCGAACTCGCTCGCTGGGCAGATATTTTGTTGATTGCGCCGGCAACGGCAAATACCATCGCCAAACTCGCGCACGGCATCGCTGATGATTTATTAACAACGCTTTATCTTGCAACCGATGCAGAAATCGTCATCGCGCCGGCAATGAATCATATGATGTGGCATCACCCCGCAACTCAGGAAAATATTGCCATTTTGAGTCGTCATCCCAAACATCACGTATTACCCGTTGCTTACGGCGAACAAGCCTGTGGCGAAAGCGGTTTGGGCAGAATGTTAGAACCCGAAGAAATTATTGCCGCTTTACCTCATTTAACCGCGCAAGATTGGCAAAACATTCGTTTGACGATTACCGCCGGCGCCACGCGCGAACCCATCGATCCCGTGCGTTATATTTCTAATCATAGTTCGGGGAAAATGGGCTACGAATTAGCCAAAAACGCCCTCGCCCGCGGTGCAAAAGTAACGTTAATCAGCGGCATCAGCAATGTAGCGCCGCCCAAACAATGCCGATTGATTACCGTAAACACCGCGCTGGAAATGTATGACGCGGTGCATTCTGTTTTAGCGGATACGGATATTTTTATCGGCGCGGCAGCCGTTGCCGATTATCGTGTTGCGCAGCCGGCAGCGGAAAAAATCAAGAAAAGTGTACACGGTCTTGCGCCGCTGACTTTAATTGAAAATCCGGACATTATTGCCAGCGTTGCCGCTGCCGCAAATCGTCCTTTTACCGTTGGTTTTGCCGCAGAAACGGAGCATTTACTCGATTATGCGCGCGCCAAAAAACAACGTAAAAATTTAGATATGATTATTGCCAATGATGTTCGTCAACACGTTTTTGGAAGCGATACCAACAGCGTAACGATGATTGGCGCCGGCGGCGAAATAACGCTGCCCACTCAATCTAAAGCATTAATTGCGCAACAAATTTTAGATTATATTTTAACGTGTTACCAAAAAAATGATTGCAGTTCCCTATAAAATTTTAGATCCGCGTTTAGGCGGTGAAATTCCGTTACCTACTTACGCGACAAGCGGCAGCGCGGCGCTGGATATTCGCGCCGTTTTTGCAGAAGAATCCATTGTGCTGGCGGCTGATGAATGCCGTTTAATCGGCAGCGGTTTGGCGTTTCACATTGCCGATCCGAATTATTGCGGCATCGTTTTACCGCGTTCGGGTTTGGGTTATAAACACGGCATTGTTTTGGGAAATTTGGTAGGACTGATCGATAGCGATTATCAAGGAGAATTAAAAATTCCGTTGTGGAATCGCAGCCAAACGCCTTATACCGTCACTTTGGGCGAACGTATCGCACAATTATTATTTCTGCCGATTGCTCAAGCGCAACTTTTTCCGGTGGAATCGTTTGATCAAAAACAATCAACGCGCGGCAGCGGTGGATTTGGTCATACGGGGCGTTTTTAATACGACGTTCAATCTTTTTTCATTCGTGCGCGCTTTTAGCAAAAGTTCGCTGCGGCGACGCGCGCCTTGATTTTTTAATTTTTATCCTCAAACATGAGGGGCAGTTATTTCATATTTATTTAAATCACGCATAAGGAATTGATCATGCACGACCCAAAAGAATCGCTTGAAACAAATATTCAAGAAACAGAATCACAAGAAAAACTTCCAGAAACGCCGATTATCGAAGAAGAACCTATTTTAACTTTACCCGATGATCAAATTAATCAATTACAACAGGAAGTTGCAGAACTCAAAGACCAATTAATCTGGCAAAAAGCAGAAAATGAAAATCTGCGCAAACGGCAAGCGCGGGAACTGGAAAATGCTTATAAATTTGCCAGCGAACGTCTATTAAAAGATTTATTGCCCGTTATTGATTCCTTGAATTTGGGTTTGCAAGCCGCACTTGATACGGAAAATGAAGCCGTTAAACAATTTATTACCGGATCGGAAATGACGCTGACGATGTTCCAAGAAACATTAGCGCGCCACGGCATTGAAGAAATTAATCCCGTCGGCGAAAAATTTAATCCGGAATTACACGAAGCCGTCACCATGACGCCGTCAGAAGCACACGAACCTAATACCGTCATTCAAGTAACGCAAAAAGGTTATCTTTTAAACGGTCGGACGGTACGCGCCGCGCAAGTGATTGTCAGCAAATAATATTTTTTAATTTATGAACTTCCGGTCATGATGACTTGTTTTTTTTAAAAGCCGTACCCATTAAGGCGAATAATCAGTGATTTTACGTTTAAACGAGGATTAAATATTATGGGAAAAATTATTGGAATTGATTTAGGAACTACGAACTCTTGCGTTGCGGTGATGGACGGCGATTCTGCCAAAGTGATTGAAAACTCTGAAGGAACGCGTACCACCCCTTCCATTATTGCTTTTAGCGATGGTGAAGTTTTGGTGGGACAACCGGCAAAACGTCAAGCTGTTACCAATCCTAAAAATACGCTTTACGCCATTAAACGCTTGATTGGTCGGCGTTTTGATGAAAAAGAAGTGCAAAAAGATATCAATTTGGTTCCGTATAACATCGTCAAATCAGATAACGGCGATGCTTGGGTTGAAATTGATGGCAAAAAAATGGCGCCGCCGGAAATTTCTGCGCGCATTTTGCAAAAAATGAAAAAAACCGTTGAAGATTATTTGGGCGAAACCATCACCGAAGCCGTGATTACGGTTCCCGCTTATTTTAATGACAGTCAACGTCAAGCAACTAAAGATGCCGGACGCATCGCCGGTTTGGAAGTTAAACGCATCATCAACGAACCAACCGCGGCAGCGTTGGCATACGGCATTGATCGCGGCGCGAAAGATGCAAAAATCGCGGTTTATGACTTGGGCGGCGGTACTTTTGATATTTCTATTATTGAAACCATTGATTTAGATGAAGAAGGTCAACAATTTGAAGTATTAGCCACCAACGGCGATACTTTTTTAGGCGGGGAAGATTTTGACCGCAGAATTATTGATTATCTCGTCAATGAATTCAAAAAAGAACAAGGAATTGATTTAACAAGCGATTCTTTGGCTTTGCAACGTTTAAAAGAAGCGGCAGAAAAAGCCAAAATTGAATTATCATCAAGTCAACAAACCGATATCAATTTGCCGTATATCACCGCTGATGCCAGTGGCCCTAAACATATGAATTTGAAATTAACGCGGGCAAAATTGGAATCTTTAGTTGCTGATTTAATCGAACGCTCTTTGGAACCCTGCCGCATCGCGATGAAAGACGCAGGTTTATCCAATAGCGACATCACTGATGTGATTTTGGTCGGCGGACAAACCCGCATGCCCAAAGTTCAAGAAGCGGTGAAAAATTTCTTTGGTAAAGAACCGCGTAAAGACGTAAACCCTGATGAAGCGGTCGCGATGGGTGCGGCGATTCAAGGCGGCGTTTTGGGCGGTCAAGTAAAAGATGTTTTATTGTTAGACGTTACGCCGTTGTCTTTAGGTATTGAAACTTTAGGCGGCGTGATGACGAAATTAATTGAAAAAAATACGACGATTCCAACCAAAGCATCTCAAATTTTTTCAACGGCTGAAGATAACCAATCAGCGGTCACCATTCATATTTTGCAAGGTGAACGTCAACAAGCGTCCGCTAATAAATCTTTGGGACGTTTTGATTTATCGGATATTCCGCCGGCACCGCGCGGCATGCCACAAATCGAAGTTTCTTTTGATATTGATGCCAACGGTATTTTGAACGTTTCGGCAAAAGATAAACAAACGGGCAAAGAACAATCCATCATTATCAAAGCCAGCTCTGGTTTATCCGATGAGGAAGTTGCGCGCATGGTGAAAGATGCCGAAGCGCACGCCGAAGAAGATCGGAAATTCCAAGAACGGATTGAAACTAAAAATAGCGCTGAATCGATGATTAACGGCGTTGAAAAAGCCATCAGCGAATTGGGCGATGAAGTAACCAGCGACGAAAAAGAAAAAACAGAAGCCGCAATTAAAGCGTTGCGCGAGGTGATGAAAGGCGAAGATAGCGACGCCATTAAAGAAAAAACCAATGCGTTAATGGAAGCTGCGAGTTCAATCATGCAAAAAGCGTATGCCAAAATGACGGAAAAACAACAATCCGATGACGGCGCAGGCACACAAAACGCAGATCATAAAGAAGATGATGTTGTCGATGCTGACTTTGAAGAAGTTAAAAGCGACAAAAAAGATTAATTTACTTCTTTTCGGGAGACTTATAGGGGCGGTAGCCGCCCCTTTTTAAATCTATGGCAGATAAAGATCTATACGCAATACTCGGTGTATGCCGCACGGCAAACCAAGATGAAATTAAAAAAGCCTATCGCAAATTATCGATGAAATGGCACCCTGATCGTAATCCGAATAATAAGGAAGAAGCCGAAGAAAAGTTCAAAGAAATCAATAAGGCATATGAAATTCTTTCCGATAGCCAAAAAAGAGCTTCTTATGACCGTTTTGGTTTCGATGCTGCCAATCAAGGCGCTGCCGGTGGCGGATTTAGCGGCGGTAATTTCTCTGATATTTTTGGTGACGTTTTTGGCGATATTTTTGGCAATGTGCGGCAAACAAACGGCGCGCGCCAAACGCGTGGTCATGATTTAGCTTATAAAATTGAATTATCCTTGGAAGAGGCAATTCACGGCGTAGAAAAACAAATTCGCATTGCCACTCAAGTGCGTTGCGGCGAATGTCACGGCTCGGGCATGAATGCAAAATCCAAGAAAAAAACCTGCCCCACCTGTAACGGCGCCGGACAAGTCAGAATGCAACAAGGATTTTTCTCTATCGCCCAACCTTGCCCTACGTGCCACGGTCGCGGTGAAATTATTGAAAATCCGTGCAATAAATGCCAAGGAACGGGACGCGTTAAAGATACGCGCGTTTTAACGGTCAATATTCCCGCCGGCGTTGATAACGGCGATCGCATTCGTTTAAGTGGCGAAGGCGAAGCGGGAGAATTAGGCGCACCGGCTGGCGATTTATACATTGAAATTTTCGTCCGCGCTCACCCCATTTTCGAAAGGCAAGGTAATGATTTATATTGCAAAATGCCTATTAGTTTTACTACGGCTTGTTTGGGCGGCGATTTGGAAGTGCCTACGTTAAACGGACGCGTAAAATTAAGCATTCCCGAAGAAACGCAAACCGGCAAAACCTTTCGTTTGAAAGGAAAAGGCGTGCAATCGGTGCGCAGTAACAGCGTTGGCGATTTATATTGCACGGTTACGATTGAAACGCCGATCAATTTATCCAAAGCGCAAAAAGAGTTATTGATGAATTTTGAGCAAGCGCTCAATGAAGGCGGAAAAACGCATACGCCTCAAGCTAAAGGCTTTTTTGATAATATCAAGCAATTTTTTGATAATTTGTAA
Protein sequences of DBSCAN-SWA_5 >NC_009446|856223:870326|859099_859915_+|WP_012031140.1|DBSCAN-SWA MDLQNTIETAFEKRSEISPKTADSALVTAVNETLALLEEGMIRVAEPTPEGWKVNEWIKKAVILSFRLYDNHVIPHGYTHYFDKVASRYADYDEVRFNADGVRVVPPAVARRGTFLGKGVVLMPSYINIGAFVDEGTMVDTWATVGSCAQIGKNVHLSGGVGIGGVLEPLQASPTIIEDSCFIGARSEIVEGVIVEKGAVVSMGVYIGQSTKIYNRMTGEITYGRVPTGSVVVSGSLPAEDGSHSLYCAVIIKQVDEKTRSKTSINELLRC >NC_009446|856223:870326|860059_860533_+|WP_012031141.1|DBSCAN-SWA MRTPQECAQITLLGKQKSDYPTHYDPSILEAFANKHLENDYFVHFICPEFTSLCPITGQPDFATIHLAYLPDQLLVESKSLKFYLFSFRNHGDFHEDCVNIIMKDLITLMAPKYIEVLGCFTPRGGIAIHPYANYGRPNTIYAEMAQQRLQNHRIAS >NC_009446|856223:870326|860548_861223_+|WP_012031142.1|DBSCAN-SWA MTTHLLLTAEQHQNALRWLVFWHIVVIASSNYLVQIPFSVFDFHTTWGAFTFPFIFLTTDLTVRIFGALPARRIIFWAMFPALIISYFIGVLFSDGQFNGFAQLMIFNTFVGRIALASFLSYIVGQLLDITVFNRLRQLEQWWIAPLASTIVGNAIDSLVFFTTAFYRCSDEFMATHWVEIAALDYGWKMFISVTFFLPAYGWLLKKLTAMLLTVRTKTTSVTY >NC_009446|856223:870326|862469_863309_-|WP_012031144.1|DBSCAN-SWA MLKTALIVDDSRLARLTLKKLLEAHHIEVNEADGVIDAERWIVNHLLPDLVFMDVMMPDLDGFEGLARLRKNPETRDLPVIMYSGDISEEARKKARDNGATGYLPKPADSNRLAHLINALNERLGSQETDDATVESLAEEIPEPTKTTTAHAARETISFDPAHPVAAEISFNTATPAAPVISNAELKEIKRRIEALENYRPEPAVDLTLRRRIDDLENRQSSQERNNNAHLEDALERQRLDLVFAQRKLADMERQLKIALGGGGLGVVMALAALAIHFI >NC_009446|856223:870326|866547_867111_+|WP_012031148.1|DBSCAN-SWA MHDPKESLETNIQETESQEKLPETPIIEEEPILTLPDDQINQLQQEVAELKDQLIWQKAENENLRKRQARELENAYKFASERLLKDLLPVIDSLNLGLQAALDTENEAVKQFITGSEMTLTMFQETLARHGIEEINPVGEKFNPELHEAVTMTPSEAHEPNTVIQVTQKGYLLNGRTVRAAQVIVSK >NC_009446|856223:870326|869201_870326_+|WP_012031150.1|DBSCAN-SWA MADKDLYAILGVCRTANQDEIKKAYRKLSMKWHPDRNPNNKEEAEEKFKEINKAYEILSDSQKRASYDRFGFDAANQGAAGGGFSGGNFSDIFGDVFGDIFGNVRQTNGARQTRGHDLAYKIELSLEEAIHGVEKQIRIATQVRCGECHGSGMNAKSKKKTCPTCNGAGQVRMQQGFFSIAQPCPTCHGRGEIIENPCNKCQGTGRVKDTRVLTVNIPAGVDNGDRIRLSGEGEAGELGAPAGDLYIEIFVRAHPIFERQGNDLYCKMPISFTTACLGGDLEVPTLNGRVKLSIPEETQTGKTFRLKGKGVQSVRSNSVGDLYCTVTIETPINLSKAQKELLMNFEQALNEGGKTHTPQAKGFFDNIKQFFDNL >NC_009446|856223:870326|856223_857576_-|WP_041729907.1|DBSCAN-SWA MSRKYFGTDGVRGEVGKEPMTPQWVMHLAWAVGSVLCDNGFAGEKVLVGKDTRLSGYLFENAIIAGLSAAGMEVHMLGVMPTPGIAYFTRTFNAAAGVVVSASHNVFSDNGVKVFARGGYKLPDSAEQQIEAYLDKPMTLVAPAKMGKVYRVAEARGRYIEFCKNSIPLNLPFHRLKLVLDCAHGATYAVAPDVFRELGAQVIVTNAAPNGCNINDGCGSTHPEALQQRVCQEQADIGIAFDGDGDRVIMVDQRGRIIDGDAILYIIAKYRAFKGEQLNTIVGTLMTNLGLEQSLRKMGITLHRAQVGDRYVLEQLQLLQARVGGENSGHIICLDRNSTGDGIIAALQVLAAILEMETTLEDLIADLALTTQVLKSVTVVERDAVINSAAVQTAIQNAEEKLRNRGRLLLRPSGTEPKIRVMAEGDDETLLNDVVNELIATIKEQEKTHA >NC_009446|856223:870326|864761_865958_+|WP_012031146.1|DBSCAN-SWA MSQIILGICGGIAAYKSILLARELNKQNHRVQCVLTESARTFVTEETLQAITGLAPRHDLFDANAEAAMSHIELARWADILLIAPATANTIAKLAHGIADDLLTTLYLATDAEIVIAPAMNHMMWHHPATQENIAILSRHPKHHVLPVAYGEQACGESGLGRMLEPEEIIAALPHLTAQDWQNIRLTITAGATREPIDPVRYISNHSSGKMGYELAKNALARGAKVTLISGISNVAPPKQCRLITVNTALEMYDAVHSVLADTDIFIGAAAVADYRVAQPAAEKIKKSVHGLAPLTLIENPDIIASVAAAANRPFTVGFAAETEHLLDYARAKKQRKNLDMIIANDVRQHVFGSDTNSVTMIGAGGEITLPTQSKALIAQQILDYILTCYQKNDCSSL >NC_009446|856223:870326|857584_858979_-|WP_012031139.1|DBSCAN-SWA MSYFTREKHELTAATPWEKTVAKMLSPFNRFMNSANGSGILLVFLTIVALVFANTSCREWYERVLNQQLLVQMGSFKIDMTIHYWINDALMTLFFLMVGLEIKYEMKVGRLASLKRAVLPIFAALGGMIVPALIYFSFNSQGETVSGWGIPMATDIAFAIAILLLLKGKVSPSLTAVLVALAIVDDLGAVIVIAIFYTDNLAWSPLIAAFLCFAVLLLLNRGGIRALWAYIAIGSLMWVFMLFSGVHATVAGVLTALATPMNAVYSPTEFSTQARQLLDEFDAHPDSKTQVAHSRELNDLLQQLSTGIQKTQTPLQRLEHILNTPVYFLIVPLFVLFNAGVHVELNNLNALLHSPVLKGVFFGLVFGKLIGVVSAIMICVKLKIAALPADATFKQVLGIGMLAGIGFTMSIFVSELAFSGQAQHLAEAKITILAASLTAATLGYCWLRFITGAAKAETGTSVSQ >NC_009446|856223:870326|867219_869148_+|WP_012031149.1|DBSCAN-SWA MGKIIGIDLGTTNSCVAVMDGDSAKVIENSEGTRTTPSIIAFSDGEVLVGQPAKRQAVTNPKNTLYAIKRLIGRRFDEKEVQKDINLVPYNIVKSDNGDAWVEIDGKKMAPPEISARILQKMKKTVEDYLGETITEAVITVPAYFNDSQRQATKDAGRIAGLEVKRIINEPTAAALAYGIDRGAKDAKIAVYDLGGGTFDISIIETIDLDEEGQQFEVLATNGDTFLGGEDFDRRIIDYLVNEFKKEQGIDLTSDSLALQRLKEAAEKAKIELSSSQQTDINLPYITADASGPKHMNLKLTRAKLESLVADLIERSLEPCRIAMKDAGLSNSDITDVILVGGQTRMPKVQEAVKNFFGKEPRKDVNPDEAVAMGAAIQGGVLGGQVKDVLLLDVTPLSLGIETLGGVMTKLIEKNTTIPTKASQIFSTAEDNQSAVTIHILQGERQQASANKSLGRFDLSDIPPAPRGMPQIEVSFDIDANGILNVSAKDKQTGKEQSIIIKASSGLSDEEVARMVKDAEAHAEEDRKFQERIETKNSAESMINGVEKAISELGDEVTSDEKEKTEAAIKALREVMKGEDSDAIKEKTNALMEAASSIMQKAYAKMTEKQQSDDGAGTQNADHKEDDVVDADFEEVKSDKKD >NC_009446|856223:870326|865938_866403_+|WP_012031147.1|DBSCAN-SWA MIAVPYKILDPRLGGEIPLPTYATSGSAALDIRAVFAEESIVLAADECRLIGSGLAFHIADPNYCGIVLPRSGLGYKHGIVLGNLVGLIDSDYQGELKIPLWNRSQTPYTVTLGERIAQLLFLPIAQAQLFPVESFDQKQSTRGSGGFGHTGRF >NC_009446|856223:870326|861219_862455_-|WP_012031143.1|DBSCAN-SWA MNFLIFGGWGLAAILLVVIMYRLRVEQNLLDWLRDRPQELPPDLPAPLDEISRLSYRRFERTKKHKKRLNNYISLFRELIDAMPDAAFVIDKQGRLADFNHHAQLLFHLNHRTDINRPIDHFIRTESIGSFWQHIQQNHLFMTRLIVDDSHWLEFNIILLSQGKILVIARDMTRIVDLDLKRKAFIDNASHELKTPITVIRGFLELLSAQNLPEYMQVPMREMVKQIERMHGLVQDMLKLASLEEVESAVKTEEVLLLPFIEEIVASLLPQYPESTGIKIGYVEHITLQVNAEILQSMISNLLVNALVYAHSKNPIEINAACTEEWCLIAVNDDGIGIAPEHISRVTERFYRVDSGRLRVTGTGLGLSIVKHGAEIHGGRLVIKSKVDLGSLFTIELPISRVIKREMEPQN >NC_009446|856223:870326|863326_864085_-|WP_081423617.1|DBSCAN-SWA MPAAQETVHQAETVHESSQEALVEENITSDPSLDMPRERILRLGASALSDAELLAILLRTGVIGKPVLAFAQELIASRGGLVGLLTSRKEHLLPIKGIGQAKTAEILAVTELCKRFIAADLQLPEWQFSSADDVRNFLLMHYKGLGFEEVAILLLNQKNQYLAYERIGQAFGSEVSQVNVSCRKLVKSVLDSDAAAVVLVHNHPAQSTELSESDLQTTKFIENFLKPLGVRLLDHFVVAGNQVISLRETGDW |
13 | Hokovirus(18.18%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
939847 : 953204
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NC_009446|939847:953204|DBSCAN-SWA CTTATTTTTGCCGCCATAATGTGAAGCTATCTTTTAACCGTTTAAAAAATCCTGCGGCAGCAATATCTTTACCGGCAAGCGCGGGAACGGTGGCATAACTTTTGCTGCCATCGGTAACGGTAATCGTGCCCACTTTTTGCCCTTTTGTAATCGGCGCGATCAGCGTATCCAAAGCAACGTGCGCACTGATCGCATTTTGTGCGGTAACGGGCAACAATAAATGAACGGTGCGCTCGGCAACGATCGGCACTTCCGCTTCCAATCCTTTATACACCTTGCCGCGCAATAATACTTGCTCGGCGCGCAACGCTGTTACCTGCGTATAATTTGCAAAACCGAAATTGAGCAAATTTTCTGCCGCATCATAACGTTGCGCCTCTTTTTCTGCCCCTAAAACCACGCCAATTAACCGCCAATCGCCGCGTTTTTCACTCGCTGCTAAACAATAACCCGCGGCATCGGTATAACCCGTTTTCAAGCCGTCAACATTTTGATTGCGCCACAATAAACGATTGCGGTTGGGCTGTTTGATATTGTTCCACGTAAATTCTTTTTGCTGATAAAAATGATAAAACTCGGGAAAATCGCGAATTAACGCCTGAGCGAGCCGCGCAATATCATGAGCGGTGGAAAAATGTTGTTCATCGGGCATGCCGGTTGGATTTTGATAATGCGTTTTTTGTAAATCTAATTGTTTGGCGGTGGCATTCATCATATGAACAAATCCTGCCACTGTTCCGCCAATATATTCCGCCAGCGCCACGCTGGCATCGTTGCCCGATTGCACGATCATGCCTTGCAATAAATCGTTCACCGAAACTTGAGAATTAACTTCCACAAACATACGCGAACCATTTTGCGCTCGCGCATTTGCGCTGATCGTCACCAAATCGGTTAAATGAATTTTGCCGGTTTTTAACGCGTTAAACGTCACGTATGCCGTCATCAATTTTGTAATACTCGCCGGCGGCAAACGCTCATCGGCAGCGCGTTCTGCCAATATTTGTCCACTTTGCGCGTCCATTAAAAGATAACTTTTTACCGGCAAATCCGGCGGCGGTACTTCTGCGCGCACGCCGAAAAAAAGCAAAAAAACCGTCACAAAACAACATTTTTTCCAAAAAAGCATTTTCTTCCTTTATTTTTGAGAATGATTCACGCGTTACGGTTAAGCGGCGTTGTGGTAGCGGCTTTTAAACGCTTCTGGATCGGGAAACGATAATTCAATCCGCAAACCCGTTGGTTTATTTTCCAATAATTGAATCGTGCCGCCGTGTAAACTCATGATTGCGGCGACTAATGATAATCCTAATCCGTTTCCTTGCGTACTTCGGGCGGCATCTAAACGCACAAAACGCTTTAATACTTTTTCCCGATCTTGCTCGGCAATGCCTTTTCCATCATCACACACGGCAACTACAACGGTATGTTCCCGCCGTAACGCTTGTAAAAAAATATGACCGCCTTCCGGCGTATATTTCACGGCATTATCCAATAAATTTGTAATCGCTTGAGCGGTTAATTGTCGATTGCCCATAATCATCAAATCCGCGTCAATTTGTGCACAAAAATGATGCATGCCTTCTTCCTCTTCGTTGAGCACTTCATACATTTCTGCCATATCTTCAGATATTTGCGACAAACTGATCAATTCAAAATCATCACGAGCGCGCGATTCAACTTGCGCAATATTGAGCAAGGAATTGAACGTTTGCAACAATCCTTGGGCATCAACAATCGATTGTGAAATCACCTCGCGCAATTCGGCATTACTGCGATGCTCGTCTAATAACGCAACTTCCATGCGGCTGCGCAAACGATTCAGCGGACTGCGCAAATCGTGAGCAATATTATTAGTAACTTGCCGTTGACTGGTAATTAAGCGCTCGATTCTGTCGAGCATGTGATTTAAATCTTCGGCGAGTAATTGCAATTCAATGGCATCATTGCCTTTAATTAAAATGCGCTCGGAAAAATCGCCGTCAACAATGACTCTTGCCGTGCGGCTGATACGGGCAATAGATTTTGTAATCTTTTGTCCAATCATAAAAGAGCCAAAAAACGATGCCAGAATTAAAACCGCCGTCATCAGCATTGCCATCGTATTCATTTTATGCAAAATCTGCCGTTCGCTGCGGGTGTCATACCCCACGATAACCGCATATTGATTGCGCAGCGGCATGACGATCACGCGCATGATATTTTTGCGATTGACGAGCGGCTCGTGTTCTTCATTATTCAAATAATTTTGACTTTGCATGTGGCGCTTGAGATTACACAAATCATCAAAAGAACTGTTTGTCATAATCATCACCGGCATTTCTTCATCAAAGCTCGGATATTCCTGCAAACAATAAGACATCGCCGGCTCTGTTTGCACGCGTTCTGTAATCCCCGTTGCCAGCGGATTAATATCAAAGCTGCTGTGATATTCATAGCGCCTTTGTAAACGCATACTTTCCGACAAAAGTCGCGCGTCAATTTGATCAAAAATTTGCTGGCGCGCGGTTTGATAAATCATTACCATTGCCGCGCCCGTAATAATGCTTGCGACCAAAGTAAACAACAATGAGAACCGGAATGCCGAAATTCTCATTGTTTGTTTGGCGTCGCTGATAATTTTAGCGATAAGTGGATGGATCATACATGCTGTAACCGGCGCCGCGTTCGGTATGAATTAAAGGAGGATTAAAATCTTTGTCGATTTTACTGCGCAAACGTGAAATATGAACATCAATCACGTTGGTTTGCGGGTCAAAATTATATTCCCAAACTTTTTCTAAAAGCATGGTGCGCGTCACCACTTGCCCAGAATTGCTCATTAAATATTCCAAAAGACGAAATTCGCGCGGTTGTAAATTAATCTTTTTTCCCGCACGGGTTACTTTGCGCCGCAATCGATCGAGATGCAAATCCGCAATTTCTAGTACCATTTGATCGTGAACGTCGGTATTACGCCGCCGCGCCAGCGATTGCACGCGTGCTAAAAGTTCCGAAAACGCATAAGGTTTAACCAAATAATCATCACCGCCGGCAGCAAGTCCTTCAACGCGATCGTCAACTTCTCCCAAAGCCGAAAGAATTAATACTGGAGTACTGGCTCCTTGAGCGCGCAATTGTTGAATTAAAGAAAGCCCATCTAATTCCGGCAACATGCGATCAACGATTAAAACGTCATATTGACCTTTTTGCGCTATCATTAATCCTTTGCTGCCGGTATGCGCGGTATCAACAACATAACCGCTTTCAGTCAATCCTTTTGCAATATAGGCTGCGGCATCTTGATCGTCTTCAATTACCAATATGTGCATATTTTCCTCCCAATTTGTTATCAGCATTTAATTGACCGCTCCATTGATGTCACTGCGGCAATTTAGTAAAAATATTGTACTCATCTTATCGGAAATAGGAGTAATGATGAATAAAACCTTAGTAACCCTTATTTTTGCAATAAATTTGCTCTTCTCCAGTTTAATTCATGCCGATCTGCCACCGAATTTTACCGAATTAGTTGAAAAAGCTTCTCCGGCAGTTGTCAGTATTGAAGTGCAAAGTCTTGTTACGCAACCAAAATTTTCAAGACTTACGCCTCCTTTTCCTGATTTGTTTGAACATTTTTTCGGCGAACAAGGCTCACCTTTTAGCGAACCATTCCCAGAAGAACAACCCGAAAAAGAATTGCGCAAAGGCAACGGCTCCGGATTTATTATTGACGCAGAAGGTTATGTTTTAACTAATGCGCACGTTATTGATGGCGCTGATAGCGTTTCCGTATTATTAACCGATCAACGCGAATACAGCGCTGAAATTGTTGGCGTTGATAAACGTACCGATATCGCATTGTTAAAAATTGCCGCGCAAAAATTGCCCACCGTGCAATTAGGCGATTCTGACGCGGTAAAAGTTGGCGATTGGGTACTCGCAATTGGTTCCCCTTTTGGTTTTGATACCACTGCCACCAAAGGCATCGTTTCTGCTTTGGGACGCAGCCTCCCCAGCGGAACCTATACACCTTTTATTCAAACCGATGCCGCCATCAATCCAGGCAATTCCGGCGGACCTTTATTTAACGGCAAAGGCGAAGTAATCGGCATTACTTCACAAATTTATACCCGCAGCGGCGCCTTCAATGGCGTTGGTTTTGCCATTCCCATTAATTTAGCCAAAACTATTGCCGAACAATTAAAAACCACTGGCAGCGTCAACCGCGGTTGGTTGGGCGTTTCCATTCAAGCCGTTGACCAAAAATTGGCAGAATCTTTTGGTATGGAAAAACCAGAAGGCGCATTAATTGCGCAAATTGTTAAAGATGCGCCGGCGGAAAAAGCACAATTAAAAGTCGGCGATATTTTGCTTTCTTTTAATGGTCATACCATCAATAAAGCCAGCGATTTACCGCCGCTCGTTGCGATGGCGCCGTTGGGTAAAGATGTAGAAATTGAATATTTGCGCGACGGCAAAAAGCAAACCACTACCGTCAAAATCGAAAATTTAGAAACAGCTGACACTTCATCTGCGGCAACATCGCGTGAAATGCGCAATTGGGGCATTGAACTCAAAGCGCTTGATGATGATACGCGTAACGCTTTGGAATACGAAGATAAAGAAGGCGTCTTGATTGCGCGCGTTGAACCGAATTCGGCAGCAGCAAAATCAGGATTGCGCGCCGGCGATATTTTAATTGCCGTTGGCGACAGCATCATCAACACGCCTAAAGAAGCTTCTAAATTATTAGCGAAAACGGATCGCGCGCTGCCGGTACTCATTTACCGCCGCGGCTCAACAATTTTCTTGCCGCTGATGCCCGAGAAAAAGAAATAAATCCGTTTTTCCATAAAAAAATCTGTACCCTTTGCGGTACAGATTTTTTTATCACTTAATTTTTTTGCCCCAAACGGTAACCAATTCCACGAATCGACATCACCGGCAATTGTTCGTCAATATTTTCAAAAGCTTTGCGCAAACGTCTAATGGCAACGTCCACCGCGCGATCGCCAATATCTTCTTCGCCCTGCCAAGCAAAATCTAATAATTGCTCCCGACTTAATACTTTATTGGGGTGGCGCAAAAAACATTCTAATAATTGAAATTCTCGCCGATTGAGTTTTAATAATTTATCTTGAAAAAAAACTTCATACGTCAATAAATTCATCGTAATGTCTTCCCAAGATAAACAAGATTGTTCTTCTTCTGATTCTGGCATGCGGCGAAATAATGCTTTTATGCGTGATTGCAGTTCTGCCATCGAGAATGGTTTGGTTAAATAATCATCAGCGCCGTAATCCAATCCTTTGATAATATCGCGTTCATCGGCGCGCGCGGTCAACATGATCACCGGCAAATAACGCAACGTTTCTATTTTGCGAAGATCTTCTAAAATTTTAATACCGCTGCCGTCGGGCAACATCCAATCCAATAAAACCAAATCAGGAATTTGCTGCTGCAAAATCGTTTTTGCCTGCGCTAAAGTTCCGGCTTCAATGCATTGATACTGGTCTTGCAAACCGTATAACAATAATTGTTGAATAGCCGGCTCGTCTTCAACTAATAAAATACGTTTTTTCACTATATTTTCCTCAATTGATAACCAATAAGCGCTTCTATCTTGCCTCGCCTACGATGGATTAACAAAAATACGTTGCCGATAATGCCGCAATTCATCGATAGAATCGCGAATATCTTGCAATGCTTGATGCGTCGTATTTTTATGAAAATAATGCTCGGGATACCAACGCGCCACCAATTCTTTGATCGTTGAAACGTCCAAATTGCGGTAATGAAAAAAAGCTTCTAAACGCGGCATCAAACGCGATAAAAACCGCCGATCCTGACAAATACTATTGCCACAAATCGGTGAACGCCCTTTTTCGCTGTATTGCTCTAAAAACGATAACGTTTGCGCTTCGGCTTCAAGCGTATTAAATGAACTTTGCCGCACGCGCTCAATCAAACCCGAAGCGCCGTGATGCTTTTGATTCCACGCGTCCATTTGCGCCAAAATCGGCTCCGGCTGATAAACAGCAATCACCGGACCTTCTGCCAAAATATTTAATTGTGCATCGGTAACAATCGTTGCAATTTCAATGATTTCATCGTGTTGCGGATCCAATCCGGTCATTTCCAAATCGATCCAAACCAAATTATTTTTTTTATCCACGAGGTTATCCTTTTGAACTTTTTAAGTACGCAAACCATTATTCCGCGATTATGGCGCGAGGGTCAACGTTTGCCCCAATAAAGAAAGCGGCGCATTACTATAAATTGCCTGAAAAACGGGCGGATTTTCGAAAAAAAATCGGACGCAAACCGGATCGCGGACGCATTCTTGCCAAAGCGCCGAAAAAATCCAATCGCGATTAAAAACCGCCGTCAACCAAAAATAATGACGATTTAACGCGCTTTTTAAACGCCAAATTGGCGCAAATTCTGCCACGCCAACGGCTTTCATTTGCGTAGGAAAACGTAAATCTTCCGCCTTAATTAGCGCCTCTTTTAACGTCCACAAACGATAAAAATCCACCATTAATTGCGGTGATGATTGCAAAAAATCCTGTTCTTTTTCATTGCAAACCTGCTTCATCAAAGCATGAAAATCGCGCGGCTGCATAAACTCCAAATCCGCACCAACTGCTCCTTTTTGTCCCAAAGCAATAAAAGCATGGTCATATTTATGACTCAAACACCACGGCGCCTCGGGATTTTTCTGCGTCACAAATGCTTTTAAAGCGCGCGAATTTTGCCAAGAAAGCGTGCGTTCACGCTGCGGCGCGTGTTGTAAACGTAATCGATCCGACGACGACAACTGCGCACGTTGATAAAAACGCGCGCAATCAGGCGTTGCAAAAAAAACCGAATAATCGAGCATCGTTTGCGGTTTCAAAACGCTTTATTCTTTCAGTTGTTTTAAAATTTTTTCATGTAACGCGCGTTCCGCTTCTGGAACGTCAACGGGCAACCACGCGCACGGCGTTTTTGCAAACGTTGCGGCGGTTTTTTGTTCGGCGCGGCGGGCGGACGCCATCATGGGGTGCGCAAACGATAAATCCGATTGTCCGCCGGTTAATTTTAAATACACTTCCGCTAATAATTGCGCATCTAATAAAGCGCCGTGCAATTCGCGTCCCGTATTATCCACGCCGAAACGTTTACATAACGCGTCCAAATTATTGCGCGCGCCGGCAAATTTTTCTCGGGCAATTTTCAAACTGTCTACCAAAGTAAATTTATCGCCCAAACGATAATCGCAACCCGCCAGCTGCAATTCCCGATCAATAAATTGTCGATCAAAGGCGATATTATGCGCCACCAGTTCATCGGCAGAAGCCAAATAAGCTTCAAATTCCGGCAAAACGTCTTTAAAAAACGGCTTATCTGCCAAAAATTCATCAGAAATGCCGTGCACTTTAAAAGCTTGAGGATCAACTGGTTGCTCGGGATTAAGATAACAATGGAAATAATGCTCGGTTGGAACGCGGTCAATTAATTCCACGCAGCCAATTTCAATAATGCGATGCCCTTCGGCTTTATTATCGCTTTCAAAATTCATGCCCGTGGTTTCGGTATCAAAAATAACGCTGCGAAAACTCATATCAATTCGTCATTTTTAAAAATTGTTCGATATTCATCGCTAATTGCGCGGCGATTTCGGCAGCTGTTTTTTGATTATCGGCAAAAGTATCTAAATAAATCTTTAATTTGGGTTCGGTGCCCGATGGACGCACGATCACGCGATGACCGTCCACCAATTCAAATACCAACATATTACTTGCCATTGCCGTTTTTAAATGATCGCAATAATGCATCACGCGATAATCGCCCACTTTTTCAAAAGGTTTTTCACGCAATACATTGATAATACGATTAATTTTTTCATTATCTTCAATACGTATACTGATTTGTTGGCTGTGAAAAGCGCCAAATTGCGCTTGAAATGCTTGCATATAATCCAAAAACGTTTTGCCTTGCGCTTTTAAGGTTGCCATTAAATCCAAAAAAGCAATCGCGGCAGAAATACCATCTTTATCCGCAACTTTATCGGCATCAACGAGATAACCCAAAGCTTCTTCAAAACCAAAAATCAGCCCCTCAATTCTGCCAATCCATTTAAAACCGGTCGGCGTTTCTTGATACGTCATATCGTACGCGGCGGCGATTTTTTTCAATAACGGACTGGAAACCAATGAACAGGCAAAAACACCTTTTTTTCCTTGCGCCTGCGCGCGTTGCGCTAAATGCCACGCCAAATACAATCCCACCATATTGCCGTGCAAACGTTCCCAAACGCCTTCAGAACTCGGCAATGCTACCGCCAAACGATCAGCATCGGGATCGTTGGCAATAATAAATTCCGCGTTGACTTTTTTTGCCAACTCAATTGCCAAATCCAAAGCGCCTTTTTCTTCAGGATTGGGAAATGCTACCGTTGGGAAATTCGCATCAGGTTGCGCCTGCGTTTCCACCAATTGCGGTTGCGTTAATCCCACCGCGTGCAAAGTATCGAGCAAAGTTTCCGTGCCCACGCCGTGCATCGCGGTATAAACGTAATTCATTAAAACCGGTTTACTTTGAAACACCGCTGCCGTGCGTTCAATATATTGATCAATCACCGACATCGGAATCAGATGATAATGAATATCCCGCGGCAATTCGTTAACGCGTAAATGTTGCGCCACATAATCAATTTCCGCGGCAATATCGGCATCAGCTGGCGGCACAATTTGCGCCCCACGATGTTCACCGCCCAAATAAACTTTATAACCGTTATCTTCGGCAGGATTATGGCTCGCCGTCACCATCACACCAGCATCAGCTTGATAAACGCGCAAAGCAAAAGCCAAAACTGGCGTCGGACGTAAATCAGGCATTAAATAAGCTTCAATTCCTGATGCTTGCAAAATTTCTGCCGTATCTTTGGCAAAACGGCGCGAATGTTTGCGCCCGTCATAACCGATAACCACTTTTTTTCCGGCGCCCAAATAACGCGCCAAACCCGCTGCCGCCTGCGCCACTAATACACGATTCATGCCATTGGGTCCCGCCTGCAAACGTCCGCGCAAACCCGCCGTGCCAAATTGCAAGCGCCCTTCAAAACGGCGTTTTAATTCCGCTTCATCGTTGGCATCAATAAACGCTTGTAATTCGGCGCGCGTTTCCGAATCGGGATCTTGGGCAAGCCATGCTTGCGCCTGTTCAATTAAAGTAGACATAAATCCTCCAAAAAACGATTAAATTCATGATGATATATTTTCGGCGCGCACAATTTCGTACACACAATTTAACGGCGCCACAACGGCATCATCAATCGTCACTGCGGCAAGATAAGCGTTTGCCGCGCGTTGCCACGAATACTCATCGCGCGCGTGAATCATCACCAGCGGCGTATGTTCATCAACCGCGCTGCCTATCGGTAAAATTTGAGAAAAACCCACGCAGTAATCAATGCGATCATCGGTGCGCACTCTTCCGCCGCCCAGTTGAATAACAGCTAAACCCATTTCGCGACTGTTCATCGCCGTAATCGTTCCTGCATGCGGCGCGAATAATGGTTTTTGCACCGCTGCCTGCGGCAATAACCGCGCATAATGCTGACAAAAATCCTGCACACCACTTTGAGCGGCAATCATACGATCAAAACGATCTGCGGCGGCGCCGCTGTCTAATGCTTTTTGCACGATTTTTAGAGCGGCACCCTGATCGGGCGCTAATTGCGCGTGCCGTAACGCTTCCGCTGCTAACGTAAAAGTTACGTTTTTCAAACGCGGATTTTGATCGTCACCGCGCAGAAATTGTACCGCTTCTGCCACTTCAACCGCATTGCCAATACTTGCCGCCAACGGTTGATTCATATCGGTTAATAAAGCGCTGGTTTGGCACCCCGCCGCATTGGCAACGCGCACGATATTATGCGCGAGCGCGCGGGCTTGAGTGATATCGGTCATAAAAGCGCCGTTTCCCACTTTAATATCCATCACCAAACCATCGAGTCCTTCCGCTAATTTTTTGGACAATATCGAAGCGGTAATTAACGCGATCGATTCTACCGTGGCGCTGATATCGCGAACGGCATAAATGCGTTTATCCGCCGGCGCCAATTCATCAGTTTGCCCAACAATGGCGCAGCCGATATGTTGCACGATTTTTTGCATACGCGCCGCGCTCGGAAAAGCCTCAAAATGAGGAATGGATTCGAGTTTATCGATGGTGCCGCCGGTATGCCCTAATCCGCGACCGGCAATCATCGGCACATAAACACCGCAGGCGGCTAAAAGCGGCGCCAAAATCAGCGAAACGTTATCGCCGATGCCGCCGGTTGAATGTTTATCAACGATGGGCGTGGTTGTTTGCCAACGCAAACATTGCCCCGAATCGCGCATGCTTAAAGTTAACGCGGTTTGCTCGTCAATAGTCAATCCGTTGAAAAAAACTGCCATACAAAATGCAGCAATTTGCGCTTCAGAAATGCGGTTTTGACTGATTTCGTTGACAAAAAAACCGATTTCTTCTGAAGAAAGCGGTTGTTTATCCCTTTTTTTGCGGATAATTTCTTGGGGAAGATAAGTTGTCATCACAAAGTGGTCATTTACGCGGATAATTCCGTGTAAATATCAGATAATAAAGAAGATGATCCCAAACGCACGCGGTCAGGTTTAATCCATTCTTCACCGAAATGACGGCGAATGATCTCTAAATAAACCGCGGCATCGTTCATCGTGCGAATGCCGCCGGAAACTTTTATGCCCACGCGATCTTTGGCGTCTAGACGAATAATGGTGGCGCAAATGATTTCCACGGCTTCTGGCGTCGCGCCGACGGCAATTTTTCCGGTGGACGTTTTGACAAAATCACCACCGCCCAAAATGGCGATTTCAGTGGCTTTGGCAATCATGGCATCGTTTTCCAGCGCGCCGCTTTCAATAATAACTTTCAAATGCCCGCCGCCGCCGTGCGTTGCTTCGGCGACTTGACTCACCACGGTTAACGCGGTGGTATCTTCTTTTTGAAACAAGCTGTTATAGGGCAAAACCAGATCAATTTCGGTGGCACCATCGGCGAGCGCTTGTTGCGTATCTTGCAAAATGGTAGTTAATTCATGTTCGCCGGAAGGAAAGTTAACAACGGTCGCAACGGCAACGGAACTGCCATCGCAATCGTCTAACGTTTGCCGCGCGGTTTTAATATATTGCGGATAAACACAAACCGCGGCGACTTTGCCGTGAATGCTGACGGCTTGACGACAAAGACGCTCAATGTGTTCCGCCGTATCATCTTCATTGAGACTAGTTAAATCGATTAAAGAAAGTTGAAATTTTGCAGATTGAATCATACGTTTCACTATGACCATAAAACAGTAGCCGGTAAGTGTATCATGACGCACCGGAAGCGAATAGGGAGCGCTTGATGAGAAATTAGCGCGGGCGTTTAGTGGCGTCAATCAACCGATAAAACCACCGCATCAGCAGTTATCCATAAAAAAATCACGCCGTTTTCCGGCGCGATTTTATTTTACTAACACGTTTTTTAAACAGTTATGACGCCTCAACTGTCAACGCGGTTTGATGGGCAGTTTTCTGATTAAAATTGCCGGATTGGCGATGATAATGGCGCAATGCGTGTTCTTTACGGCGACCGCCGCTAAATGCTGATACGCGTTTTAAATACCCAATAACGCGTGTGCCGTAATCAATATCATGTGAACCGCAAGCGCTGCACGCCGCTAAAGTGCGTTTATCGATGGTGCCGCAGTGATTGCAAATGGTAATGCGAACGTTGATACAAAAATAATTGCAACCGGTTTTGGCAGCGATATTAAGCATTTTGCGGTAGCCTTCCGTGCTCAATGCTTCTTCCAAATTTAAATGCAACGCCGAACCGCCATCTAAATAATCGACCAATTCGCGTCCGTGCAATAAAAATTTATCCAAAGCATTAATTTCTTCATTTTCAACGGGATAAAAATAAGAATTGTAGCAATCGCGTTTAACTTGATAACCGTCTGCTTTATCCCATTTTGCATTTTTAACGCCCAAATTTTCCGCCGGCACGAATTCCGTATTAAATTTAACGCGATAATGTTTAGAAGCTTCTTGATTTAATTGATAAATGGTTTTGAGGCGATTTTGAACAAATTCAATATATTCCGGATTATATCCCACCTCAATGCCGTGATATTCAGCCGCTTCCACCATGCCGTTAATGCCAATCGTTAAAAACTGTTTATCCAAAGTAATAAAACCAGCATCGTAAACCGGCAACATACCGGCAGCTTGATATTCTTCCATCAATTTGCGGTAAGCGTATTGATATTGATGGATTTTTTTAACTTCGGTTGCCAAATCGCGTCCGTCTTGTTCCAAACGGTTCATATTTAACGTAATGACATTAATTGAACCGGTCGCCACGCCGCCCGCGCCCAAAGTATAAGAAAAGGTGCGATCTTCAATGGCGTTACGCAAACGGCAACAAGACGCAAGCGAATCGGGGTTATCTGATAAATACATAAAAAATGAGTTGCCGTGCGCGAGCTCATCTGCCATTTGTTGCGCAAATGCTTCATCTTTACATTGACCATTTTGCGTTAACATCGCCGCGGTCACCACGGGAAAAGTTAAAACGGATTTGCAGCGCTCTTGATTAAACCAATCCATAAAGAAATATTGCAATTGGCTCACAGTCTGCCAATTAGGGCGCGAAAAATCCGGAAAAACAAAGTCGCCAAACATCGCCTCAAAATAATAGCGATCGTAAATAGAAATATTCCAAAACGCGCTTTGATAACCGCGCGCGGCAGCGGGCTGGTTGATGCTGTAGACGACTTGTTGCAGATGATTGGCGATTTCGTGGCGATGCGTATTTAAATAATCGTCACCAAAATCTTTGCGCGCGAAATAGTCGAAATAAGTTAAAAATTCGACGGTAGCAACGGCGCCGGCAAATTGCGCGCTGATGGCAAAAACTAAATTAATAAAAGAACCGCAAAAAGAAGCCAAATGATGCGGCGCTTTCGATTCACCGCCTAAAGATGTTAAACCGTCGCGCAAAAAGGGATATAAAGTTGCCGATACGCAATAAGGTTTTAAGCTGGTTTCATCGTGAACGTAGATTTCATGCGATTCAATTTGGCGCAAATATTCTTCGGCGGTTTCTTGTCCAAATAATTCGGCGATTTTTTGGCTAACTTGTCCGCGGTTAATTTGCACCAAAAAATCTTTTAATAATTCGGTTTCCATGGTGGCAATATTTTTTTGCGTGACGTTAGCGTTAGCGTCTACTTTTGAACCGTCTGCCGCATTTTGAGCGGCAATATAATCGCGTATAAATTGAATTTTGCCGTCAATTTGTTGAGACGATAACCGAATCAT
Protein sequences of DBSCAN-SWA_6 >NC_009446|939847:953204|941014_942490_-|WP_049752499.1|DBSCAN-SWA MIHPLIAKIISDAKQTMRISAFRFSLLFTLVASIITGAAMVMIYQTARQQIFDQIDARLLSESMRLQRRYEYHSSFDINPLATGITERVQTEPAMSYCLQEYPSFDEEMPVMIMTNSSFDDLCNLKRHMQSQNYLNNEEHEPLVNRKNIMRVIVMPLRNQYAVIVGYDTRSERQILHKMNTMAMLMTAVLILASFFGSFMIGQKITKSIARISRTARVIVDGDFSERILIKGNDAIELQLLAEDLNHMLDRIERLITSQRQVTNNIAHDLRSPLNRLRSRMEVALLDEHRSNAELREVISQSIVDAQGLLQTFNSLLNIAQVESRARDDFELISLSQISEDMAEMYEVLNEEEEGMHHFCAQIDADLMIMGNRQLTAQAITNLLDNAVKYTPEGGHIFLQALRREHTVVVAVCDDGKGIAEQDREKVLKRFVRLDAARSTQGNGLGLSLVAAIMSLHGGTIQLLENKPTGLRIELSFPDPEAFKSRYHNAA >NC_009446|939847:953204|946051_946711_-|WP_012031221.1|DBSCAN-SWA MLDYSVFFATPDCARFYQRAQLSSSDRLRLQHAPQRERTLSWQNSRALKAFVTQKNPEAPWCLSHKYDHAFIALGQKGAVGADLEFMQPRDFHALMKQVCNEKEQDFLQSSPQLMVDFYRLWTLKEALIKAEDLRFPTQMKAVGVAEFAPIWRLKSALNRHYFWLTAVFNRDWIFSALWQECVRDPVCVRFFFENPPVFQAIYSNAPLSLLGQTLTLAP >NC_009446|939847:953204|939847_940975_-|WP_012031215.1|DBSCAN-SWA MLFWKKCCFVTVFLLFFGVRAEVPPPDLPVKSYLLMDAQSGQILAERAADERLPPASITKLMTAYVTFNALKTGKIHLTDLVTISANARAQNGSRMFVEVNSQVSVNDLLQGMIVQSGNDASVALAEYIGGTVAGFVHMMNATAKQLDLQKTHYQNPTGMPDEQHFSTAHDIARLAQALIRDFPEFYHFYQQKEFTWNNIKQPNRNRLLWRNQNVDGLKTGYTDAAGYCLAASEKRGDWRLIGVVLGAEKEAQRYDAAENLLNFGFANYTQVTALRAEQVLLRGKVYKGLEAEVPIVAERTVHLLLPVTAQNAISAHVALDTLIAPITKGQKVGTITVTDGSKSYATVPALAGKDIAAAGFFKRLKDSFTLWRQK >NC_009446|939847:953204|943230_944667_+|WP_119185628.1|protease|DBSCAN-SWA MVLILSEIGVMMNKTLVTLIFAINLLFSSLIHADLPPNFTELVEKASPAVVSIEVQSLVTQPKFSRLTPPFPDLFEHFFGEQGSPFSEPFPEEQPEKELRKGNGSGFIIDAEGYVLTNAHVIDGADSVSVLLTDQREYSAEIVGVDKRTDIALLKIAAQKLPTVQLGDSDAVKVGDWVLAIGSPFGFDTTATKGIVSALGRSLPSGTYTPFIQTDAAINPGNSGGPLFNGKGEVIGITSQIYTRSGAFNGVGFAIPINLAKTIAEQLKTTGSVNRGWLGVSIQAVDQKLAESFGMEKPEGALIAQIVKDAPAEKAQLKVGDILLSFNGHTINKASDLPPLVAMAPLGKDVEIEYLRDGKKQTTTVKIENLETADTSSAATSREMRNWGIELKALDDDTRNALEYEDKEGVLIARVEPNSAAAKSGLRAGDILIAVGDSIINTPKEASKLLAKTDRALPVLIYRRGSTIFLPLMPEKKK >NC_009446|939847:953204|942467_943157_-|WP_012031217.1|DBSCAN-SWA MHILVIEDDQDAAAYIAKGLTESGYVVDTAHTGSKGLMIAQKGQYDVLIVDRMLPELDGLSLIQQLRAQGASTPVLILSALGEVDDRVEGLAAGGDDYLVKPYAFSELLARVQSLARRRNTDVHDQMVLEIADLHLDRLRRKVTRAGKKINLQPREFRLLEYLMSNSGQVVTRTMLLEKVWEYNFDPQTNVIDVHISRLRSKIDKDFNPPLIHTERGAGYSMYDPSTYR >NC_009446|939847:953204|950425_951169_-|WP_012031225.1|DBSCAN-SWA MIQSAKFQLSLIDLTSLNEDDTAEHIERLCRQAVSIHGKVAAVCVYPQYIKTARQTLDDCDGSSVAVATVVNFPSGEHELTTILQDTQQALADGATEIDLVLPYNSLFQKEDTTALTVVSQVAEATHGGGGHLKVIIESGALENDAMIAKATEIAILGGGDFVKTSTGKIAVGATPEAVEIICATIIRLDAKDRVGIKVSGGIRTMNDAAVYLEIIRRHFGEEWIKPDRVRLGSSSLLSDIYTELSA >NC_009446|939847:953204|951371_953204_-|WP_012031226.1|DBSCAN-SWA MIRLSSQQIDGKIQFIRDYIAAQNAADGSKVDANANVTQKNIATMETELLKDFLVQINRGQVSQKIAELFGQETAEEYLRQIESHEIYVHDETSLKPYCVSATLYPFLRDGLTSLGGESKAPHHLASFCGSFINLVFAISAQFAGAVATVEFLTYFDYFARKDFGDDYLNTHRHEIANHLQQVVYSINQPAAARGYQSAFWNISIYDRYYFEAMFGDFVFPDFSRPNWQTVSQLQYFFMDWFNQERCKSVLTFPVVTAAMLTQNGQCKDEAFAQQMADELAHGNSFFMYLSDNPDSLASCCRLRNAIEDRTFSYTLGAGGVATGSINVITLNMNRLEQDGRDLATEVKKIHQYQYAYRKLMEEYQAAGMLPVYDAGFITLDKQFLTIGINGMVEAAEYHGIEVGYNPEYIEFVQNRLKTIYQLNQEASKHYRVKFNTEFVPAENLGVKNAKWDKADGYQVKRDCYNSYFYPVENEEINALDKFLLHGRELVDYLDGGSALHLNLEEALSTEGYRKMLNIAAKTGCNYFCINVRITICNHCGTIDKRTLAACSACGSHDIDYGTRVIGYLKRVSAFSGGRRKEHALRHYHRQSGNFNQKTAHQTALTVEAS >NC_009446|939847:953204|944722_945412_-|WP_012031219.1|DBSCAN-SWA MKKRILLVEDEPAIQQLLLYGLQDQYQCIEAGTLAQAKTILQQQIPDLVLLDWMLPDGSGIKILEDLRKIETLRYLPVIMLTARADERDIIKGLDYGADDYLTKPFSMAELQSRIKALFRRMPESEEEQSCLSWEDITMNLLTYEVFFQDKLLKLNRREFQLLECFLRHPNKVLSREQLLDFAWQGEEDIGDRAVDVAIRRLRKAFENIDEQLPVMSIRGIGYRLGQKN >NC_009446|939847:953204|947435_949052_-|WP_012031223.1|DBSCAN-SWA MSTLIEQAQAWLAQDPDSETRAELQAFIDANDEAELKRRFEGRLQFGTAGLRGRLQAGPNGMNRVLVAQAAAGLARYLGAGKKVVIGYDGRKHSRRFAKDTAEILQASGIEAYLMPDLRPTPVLAFALRVYQADAGVMVTASHNPAEDNGYKVYLGGEHRGAQIVPPADADIAAEIDYVAQHLRVNELPRDIHYHLIPMSVIDQYIERTAAVFQSKPVLMNYVYTAMHGVGTETLLDTLHAVGLTQPQLVETQAQPDANFPTVAFPNPEEKGALDLAIELAKKVNAEFIIANDPDADRLAVALPSSEGVWERLHGNMVGLYLAWHLAQRAQAQGKKGVFACSLVSSPLLKKIAAAYDMTYQETPTGFKWIGRIEGLIFGFEEALGYLVDADKVADKDGISAAIAFLDLMATLKAQGKTFLDYMQAFQAQFGAFHSQQISIRIEDNEKINRIINVLREKPFEKVGDYRVMHYCDHLKTAMASNMLVFELVDGHRVIVRPSGTEPKLKIYLDTFADNQKTAAEIAAQLAMNIEQFLKMTN >NC_009446|939847:953204|945460_946003_-|WP_012031220.1|DBSCAN-SWA MDKKNNLVWIDLEMTGLDPQHDEIIEIATIVTDAQLNILAEGPVIAVYQPEPILAQMDAWNQKHHGASGLIERVRQSSFNTLEAEAQTLSFLEQYSEKGRSPICGNSICQDRRFLSRLMPRLEAFFHYRNLDVSTIKELVARWYPEHYFHKNTTHQALQDIRDSIDELRHYRQRIFVNPS >NC_009446|939847:953204|949076_950411_-|WP_012031224.1|DBSCAN-SWA MTTYLPQEIIRKKRDKQPLSSEEIGFFVNEISQNRISEAQIAAFCMAVFFNGLTIDEQTALTLSMRDSGQCLRWQTTTPIVDKHSTGGIGDNVSLILAPLLAACGVYVPMIAGRGLGHTGGTIDKLESIPHFEAFPSAARMQKIVQHIGCAIVGQTDELAPADKRIYAVRDISATVESIALITASILSKKLAEGLDGLVMDIKVGNGAFMTDITQARALAHNIVRVANAAGCQTSALLTDMNQPLAASIGNAVEVAEAVQFLRGDDQNPRLKNVTFTLAAEALRHAQLAPDQGAALKIVQKALDSGAAADRFDRMIAAQSGVQDFCQHYARLLPQAAVQKPLFAPHAGTITAMNSREMGLAVIQLGGGRVRTDDRIDYCVGFSQILPIGSAVDEHTPLVMIHARDEYSWQRAANAYLAAVTIDDAVVAPLNCVYEIVRAENISS >NC_009446|939847:953204|946732_947434_-|WP_012031222.1|DBSCAN-SWA MSFRSVIFDTETTGMNFESDNKAEGHRIIEIGCVELIDRVPTEHYFHCYLNPEQPVDPQAFKVHGISDEFLADKPFFKDVLPEFEAYLASADELVAHNIAFDRQFIDRELQLAGCDYRLGDKFTLVDSLKIAREKFAGARNNLDALCKRFGVDNTGRELHGALLDAQLLAEVYLKLTGGQSDLSFAHPMMASARRAEQKTAATFAKTPCAWLPVDVPEAERALHEKILKQLKE |
12 | Bacillus_phage(20.0%) | protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
993315 : 1000410
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NC_009446|993315:1000410|DBSCAN-SWA ATCATATCGATTCCTTTTTATCTAAAACACAAATACTGTCGCGCATAATTTTACCGTGAAAATCGGTCATCAAAGTACGCACGACGGGATTTTGAAGCAATTTTTGCTTTGCCATTTGATGCATCTGATCGGATTCTTGCTGACGTAACATCGCGGACGTTGCGATCGTATCGTCAAACGTAATGCTCACCGCGATTTTTCTGGCACACGATGCGCTTAAAATTCGCGTTAAATCCGCCAAACGTTGCTCATTTACCCACGATGCCGCAGCGGATAAAACTGCCAACTGCAAGCTATCATTTTGAAAAGAAACAGGAATACTTTGTTCCACAAGCGACTGCAAAAAAACATCTTCCACGGGCAAAGTATTTAAAAACTGCGACCACAATTGCAATTGATTAACCGCTTGCTCCAATTGAATAACGGATTTATTTTCTGACGATTTCAGCGGCACGGTATTCATTTCTAATTCATCGTTATGCCACGGCGGTAAATCCACATCATTATTATCCGGCACATCATTTTCTGACGATTCCAGCGGCGCGGTATTTATCATTTCTAATTCATCATCACGCCACGGCGGCATATCCGCATCATTGTCATTCGGCACATCATTTTCTGACGCTTCCGCCGGCGCGGTATTCATCATTTCTAATTCATCATCACGCCACGGCGGCAAATCGACACCGTTATTATCCGGCGCATCATTTTCTAACGGTTCTGCCGGCGCGGTGTTTATCATTTCTGATTCATCATCATGCCACGGCGGTAAATCCACATCATTATTATCCGGTACATCATTTTCTGACGATTCCAGCGGCGCGGTATTCATTTCTGATTCATCATCACGCCACGGCGGTAAATCCACATCGTTATCCGGTTCTGGAATGATGTGTTTTTCGGATATCGTCGTTTCTGGAAGCAACGGCTGCAATGCAATCATGCGTAAAAACGTCATTTCCAACGCCACGCGCATATCGGGCTGATAAGGCATACTTTGCCAAGCATCGCTGATAATTTGATACCACAATTGCACCAATTCCACAGGCAATTGTTGCGCTAAACGCACCAAGGTTTGATCAATTTCATTTGGGTTTTGCTGCGCTTGCAATTGCGCGACGGTAATTTGTTGTAATCCTTGAAAAACATGCCGTAAAACGTCCATATAATCGGGCGAAAAAGCATCTAATTCTGTCAAAATCGCCCGCACCGCCCGCGCATCGCTTTCGGCAATTTTCGTTAACAAACGATAAATCTGCGCCGTGGGCACCGCGCCCAATAAATAAGCCACATCATCGGTTTTCACTACGCCTTGCCCGTAAGCGATGGCTTGATCCAAAAAGCTCAAGCCGTCCCGCATACTGCCGCGCGCTGCCTGCGCTAAAAGCACCAATGCTTCTGCTTCATAATCAATGCGTTCTACTTGCAAAATATGCGCTAAATGCTGCCGCAATTGTTCAGGATTAATATTTTTGAGATGAAATTGTAAACACCGCGATAAAACGGTTGTCGGCACTTTTTGAATATCTGTGGTAGCAAGAATAAATTTAACGTGCGGCGGCGGTTCTTCCAAAGTTTTTAACAATGCGTTAAAACTTTCCCGCGTAAACATATGCACTTCATCAATCAAATAAACCTTATAACGTCCTTTAACGGGCGCATAAGGCAGGTTATCGAGCAACTCTTTAGTTTCATCGATTCCGCGCCGAGAAGCGGCATCTACTTCAATTAAATCAGGAAAACGACCTTGATCAATTTCTAAACAATGTTCACATTGTCCGCAAGGTTGAGCGCTGGTGCCGTTAATTAAACAATTAAGCGATTTGGCATAAATGCGCGCGAGAGAGGTTTTACCGACGCCGCGCGTACCGGTAAAAAGATAGGCGTGATGAAGGCGATCATGATCTAAGGCATAACTTAGCGCTTGCGTAATATGCTGCTGCCCTACGAGTTCACGAAATGTTTTGGGTCGCCATTTTCGTGCTAAAACCTGATACATAAATCGTTACCCTCAAAAACAAAGAGGTCTGTACCAGCACCCTTCGGCACCCAAATCACCGCTACCGCTGCTTCCTTCCGGACCTGACGGGGTTGACGGCTTATTGTCGCGAAGAACCAATACAGACCAGTGCGCGATTATAAAGTAAAAAAAGCAAAAAGCAATCATCAATTAAAAAGAATAGATAATATTTTGAAACCAAAAAACTAATGGCACACCAAACTCACCATAATGCCCGTATTACTCAAAATCGCATCGCACGGCTTATCGGCTCCTGATGGATAATGCGTTTCGCACCAACTGCCATCTAAAGCCTGCCCGTTGATTTGGCACACCCGCGGACTATCCCCAAATTTCAATATTCCCGACAATTTATTAAAACTATATTTTTTCGCCGGCACGTAAGTTGATAATTCCGTTTTTTGCGCTTGTTCTGCGGCTTCTTTTTCCGCTTGCGCCGTACGAACGGCTTGCAACTGCGCCAAACGCATTTGCTCCTGTTGCATTCTTTCCAATTCCATTTGCTTTTGTTGCGCTTGTAATAACGCTTGCTGGCGCCGTTTTTGTTCTTCTTGCTGTGCTTTTTGCTTTTCGGCGATCAATTGCGCGGCTTGCGCGCGCTCCGTTCTTTCGGCGGCGGCTGCTTTTGCTGCCACCAATTGCGCTTGTTGCAATGCGGCTTGTGCTTTTTGTTCTGCCAACAAATTGCGCGACGTTTGCCGCAATGGAATTTTTTGTGACGCGGGCGCGTCTAACATATCGCGCAACGTATTTTTAACATCTTCATGATGAATCGTATTTTTGGACTCAACAAAATTCATCTGTGCAACGTGTTTTTGCAATTGCGCAGCAACAAGACGCTCTCGTTCTAAAGCTTTTTGTTGCTCGCGTTCTTCAAACTGTTTGCGGGCGATTTCTTGTTGCCGCAAACGTTCCGCCGCTGCTTTTTTTTGTTGTTCTCGTTGTTTTTTTGCAATTAACTGATTTTTTTCAATTTGTTTTTTATTATTTTCGTTTTCCGTCAACATAATTTGATAAATGAGCCGGTCATCGATAAAGCTATAAGTTAACCACGGCAATTGATATTGTTTTCGCAGCGCGATTAACGCCTCTTGCGTTTTTTTACCAATAATACCGTCCGCGCCGCCGGCAGCATAACCAAAGTGATTTAAACGCTGCTGAACTTTAGCCGTTAAACGGCGGTTAAAAATCCGCAGCCAATCCGCCGTGATTTTGCCGGTAGTTTTTTGTTTTTGTGATTTCTGAAACGCTTTGATGCCCGATTGCGTATCTTGATTATAAACGCCATCAATAACACCGGCGTCAAAATCCAAAAAGCGCAATCCCGTTTGAATTAAACGAATTAAAGCGCTGGAATATTCGCCTTCTTTGGCTACTTTTTGTTTGACTAACCATTGCTGCAATTCGCTCGCACCGCCCATTGCCGCTAAATCATAAAGCGTTAAACCATATTCATCGCTCACATCAATCGCGCATTGATTTGCCAGATAACGGGCTACAACATTTTGCTCGCCACGATAAACTGCCGCAAATAACGGCTCTTTATTTTGGCAATCGAACGCAAAAGCTAAAGTATGAATGGAAAATAATCCGAAAAATAACACTCTTGAAATGGTGCTCATGAATTGGCTCACTTAAATAAAGAAACGCGATCGCGCAAGAAAAACGCCTGTAAAGCTAATAATAACGTTGCATTACGAATTACCCCTTCTTGTAACCAACGTTGCGCTTGTTGCCGAGAAACCCGCATCACACGAATATCTTCACCTTCTTCATGCAATCCGGTTAGCGGCTGAATTTTATTCACATCGACGCGTGCAAAATATAAATGAGTCACGGCGCCACTTCCGCCGGGACTCGCAAAATAACGCATTACCGGCAGCAACTCTTCTGCCGGCGTGCCAATTTCTTCTTGTAATTCGCGCTGAATACACTGTTCTGGCGATTCATTTGCCTTATCCATAAAACCTGCCACAATCTCCACGCTCCAAGGCTTTTCACCAGCTGCCAGCAAACCAATTCTAAATTGTTCAATAAAAACAAACTCATCCGTATTAGGATCATAAGGAAGCGCAGCAACAACGCCGCCGGTGCCGAGACATTCGCGCGCCAATGCACCGCTCCACCCACCGCTAAAATATTGAAAATCAACCAAATAACGCGTTATTTTAAAAAAACCGTGATAAACTTCTTCACAATTCAAAATACGGTATTTATATTCCACATCAATACCTTGGTGAAGTTCCGATATAAATATGTGGATTAACGGGTTTACCTTGTTTTCTGGTTTCAAAATGCAACATTGGCGTACTATTTGCCGCAATCCCCATAGTGCCAATTGCTTGACCGGCGGCAATTGTTTGTCCTTCGCGCACTAAAATTTCATCTAAAAAACCATAAGCTGACAAAATTCTTCCGGGGTGTTGAATAATCACCATGCGCCCAAAACCAGATAATCCTGTTCCGGTATACGCCACTTCTCCAGACGCCGCAGCGACGATTTTTTGTCCGCGCTGACCACTGATGCGGATACCTTGTTTACCCGGCGCGTTAGAAGAGAAATTTTGCGCCAAATCTCCTCGTGTCGGCCAAATCCAACCGCTCATACCGGTAGCGGTAACTTCTGGAACGATTATGCGCTCGCGCATTACGCCGATCGGCGGAATCGTTTGCAAAGCTTCTCCAGCTAATATACGGTTTTTATTAGTAATATTATTCCAACGCGCTAATTCATCAATATCCAAACCATAGCGCCACGCGATGCCGAATAAAGTATCGCCTTTTTGAATCATATAGGGCGCACCGGGTTTTAAGGCGCCGCGTTGTCCGTGTAATTCATACGGCGAGGGGTAAGAATGGCAACCAGCAATAATTACTGCTGCTGTCAGTAAAAAACCAATCCATAACTGCTTTACATTTTTTTTCATCTCATAGGCTCCAACCAATGTTTCACCGCTTCCAGTTGATCATCATGCGTCACATCAAACTGCAATGGCGTAATCGACACAAATCCCTGTTCAATGGCAGCAAAATCGCTTCCCTCATCAGCTAAAAACCCCCCTTTATTCGCCCCAATCCAATAACATTCCTCTTGCCGCGGATTGATCATTTTTTCCAGCGGACGCTCCTGACGACATTGTCCCAATCGCGTCACGCGTATCCCCCGAATTTCTGCCCGCGGCAAATCCGGAATATTGATATTCAACAGCGTAGCGCCGGTTAATGGATTTTTTTTAAAAAAGGAGAATAAATCTAACACAATCTGCGCAGTATCCGCGAGATGTTTGGGACGATGGGCAACATTGGAAATTGCCAACGCTGGAAATTTTAAATACCGCCCTTCAAAAGCCGCCGCAACCGTGCCCGAATAAAGCACATCATCGCCCAAATTTGCTCCACAATTAATGCCCGAAATCACCATATCTGGCACTTCATCAAAATAACCACCCACCGCCACGCGCACACAATCTGCCGGCGTACCATTTACGCTATAAATCGCGTTGCCGTGAGTTTGCACGGTTAAAGGTCGTGTTAATGTTAGAGCATGACTAACGCCGCTGCAGTTGCTTTCCGGCGCCATCACGATCAAGCGCTCAACGGCGCCCTCCATCACTTCCACCAATGCGCGCATTCCCGCCGATGCATAGCCATCATCGTTAGATAGCAGCAAAAACATAATCGCTCCTTTGATCAAAATTGAGGTTAGAGTGTACCGAAAGGTTGCTAACTTTCAACAAATTTATCATCATAAACCGCCCCGCGATTGCTCCCATAACGAAATTCGTTTTTTTGCTGACGGTTTGAGTGTAAAGGAAGATAATGTCTCGTCAACATATCGTATTGTTTATATTATTCTTTTTAATTTTCATCAGCGCCATTATCATCATAAAAGTGCATTCTGCGCCCCAATATCGCACCCGTTCGGAGCAAAAAGGCGCGCTCGGCGAAGCAAAAGTTTCCGCACTCAATCAACAACTTAACCGCCATTATCAAGCGCTTGATGACATATTAATGCCGTTAAAAAATGGTAAAACCACCCAAATCGACCACATCATTTTTTCCCCTTTTGGGATTTTCGTTATTGAAACCAAAAACATGAGCGGTTGGATTTTTGGTGACGAAAACGCAGCTTATTGGACGCAAGTTTTGCCCAACGGACGCCAATTTTCGTTTCAAAATCCACTACATCAAAACCGCAAACATTGCCGTGTTATCTGCGAAAGTCTACATTTACCGGCTATCAATATTATTTCGGTCATCGTTTTTATTGGCGATTGCAGTTTTAAAACGCCGATGCCAAGCAACGTATGTTTGGGGGGGAAAGCTTATTTGCGCTATATAAAACGCCGAAAGAAAATCCGTATTCCCCAAAAATATATCATGCCGATGATGCGGCATATTAACGCGGTACGTTTAAAAAATACGGAAAAAAACCGGCGCCGTCACATTCAAAACCTGCAATCATCTTGACCATTATTTTACGGTCACAATACTGACATTGGTCACGCCTGAATGCAGCAATCCAATTTTTTGTGCGGCGGCTTGAGAAAGATCGAGCACACGATTGCTATGAAAAGGACCGCGGTCGTTGATTTTTACCACTACACTTTTACCATTTTTTAAATTTGTCACTCTGATTTTAGTGCCCAAAGGTAAACGTTTGTGGGCAGCAGTCAGTTCATTTTGATTAAAAATTTCACCACTTGCCGTTTTTTTACCGTGAAAACCCGGACCATACCAAGAAGCGGTTCCTTCTTGTTTAAAATCATTAGCATGCGCTAAAGTTTGATATGTTTTTCCGCGGACGGTGTATTCATAAGATGTAGCTTCTGCTATCGCAGCGCTTTTTTCCACCGCGTTTTTTTGATTATCATCAAGCAAACTAAGGCGCGAACGCTCATATTCAGATTTTGTAAAATCCAAATATTTCACGATTTTGATGGGTTGCTCACGATCCTTTGCTTGTTTATCAGATTCAGATTCAGAAAAATCTTTCTCTTGAGGCGCTGATATTGTTTGGCTAAAAGAAGGCAGACTCGCCAAAAGCACGATTAGCGCGATATTTCGTGTAATCAT
Protein sequences of DBSCAN-SWA_7 >NC_009446|993315:1000410|997559_998261_-|WP_012031271.1|DBSCAN-SWA MKKNVKQLWIGFLLTAAVIIAGCHSYPSPYELHGQRGALKPGAPYMIQKGDTLFGIAWRYGLDIDELARWNNITNKNRILAGEALQTIPPIGVMRERIIVPEVTATGMSGWIWPTRGDLAQNFSSNAPGKQGIRISGQRGQKIVAAASGEVAYTGTGLSGFGRMVIIQHPGRILSAYGFLDEILVREGQTIAAGQAIGTMGIAANSTPMLHFETRKQGKPVNPHIYIGTSPRY >NC_009446|993315:1000410|999807_1000410_-|WP_012031274.1|DBSCAN-SWA MITRNIALIVLLASLPSFSQTISAPQEKDFSESESDKQAKDREQPIKIVKYLDFTKSEYERSRLSLLDDNQKNAVEKSAAIAEATSYEYTVRGKTYQTLAHANDFKQEGTASWYGPGFHGKKTASGEIFNQNELTAAHKRLPLGTKIRVTNLKNGKSVVVKINDRGPFHSNRVLDLSQAAAQKIGLLHSGVTNVSIVTVK >NC_009446|993315:1000410|998257_999010_-|WP_012031272.1|DBSCAN-SWA MFLLLSNDDGYASAGMRALVEVMEGAVERLIVMAPESNCSGVSHALTLTRPLTVQTHGNAIYSVNGTPADCVRVAVGGYFDEVPDMVISGINCGANLGDDVLYSGTVAAAFEGRYLKFPALAISNVAHRPKHLADTAQIVLDLFSFFKKNPLTGATLLNINIPDLPRAEIRGIRVTRLGQCRQERPLEKMINPRQEECYWIGANKGGFLADEGSDFAAIEQGFVSITPLQFDVTHDDQLEAVKHWLEPMR >NC_009446|993315:1000410|993315_995313_-|WP_012031268.1|DBSCAN-SWA MYQVLARKWRPKTFRELVGQQHITQALSYALDHDRLHHAYLFTGTRGVGKTSLARIYAKSLNCLINGTSAQPCGQCEHCLEIDQGRFPDLIEVDAASRRGIDETKELLDNLPYAPVKGRYKVYLIDEVHMFTRESFNALLKTLEEPPPHVKFILATTDIQKVPTTVLSRCLQFHLKNINPEQLRQHLAHILQVERIDYEAEALVLLAQAARGSMRDGLSFLDQAIAYGQGVVKTDDVAYLLGAVPTAQIYRLLTKIAESDARAVRAILTELDAFSPDYMDVLRHVFQGLQQITVAQLQAQQNPNEIDQTLVRLAQQLPVELVQLWYQIISDAWQSMPYQPDMRVALEMTFLRMIALQPLLPETTISEKHIIPEPDNDVDLPPWRDDESEMNTAPLESSENDVPDNNDVDLPPWHDDESEMINTAPAEPLENDAPDNNGVDLPPWRDDELEMMNTAPAEASENDVPNDNDADMPPWRDDELEMINTAPLESSENDVPDNNDVDLPPWHNDELEMNTVPLKSSENKSVIQLEQAVNQLQLWSQFLNTLPVEDVFLQSLVEQSIPVSFQNDSLQLAVLSAAASWVNEQRLADLTRILSASCARKIAVSITFDDTIATSAMLRQQESDQMHQMAKQKLLQNPVVRTLMTDFHGKIMRDSICVLDKKESI >NC_009446|993315:1000410|996964_997558_-|WP_012031270.1|DBSCAN-SWA MEYKYRILNCEEVYHGFFKITRYLVDFQYFSGGWSGALARECLGTGGVVAALPYDPNTDEFVFIEQFRIGLLAAGEKPWSVEIVAGFMDKANESPEQCIQRELQEEIGTPAEELLPVMRYFASPGGSGAVTHLYFARVDVNKIQPLTGLHEEGEDIRVMRVSRQQAQRWLQEGVIRNATLLLALQAFFLRDRVSLFK >NC_009446|993315:1000410|999153_999804_+|WP_012031273.1|DBSCAN-SWA MSRQHIVLFILFFLIFISAIIIIKVHSAPQYRTRSEQKGALGEAKVSALNQQLNRHYQALDDILMPLKNGKTTQIDHIIFSPFGIFVIETKNMSGWIFGDENAAYWTQVLPNGRQFSFQNPLHQNRKHCRVICESLHLPAINIISVIVFIGDCSFKTPMPSNVCLGGKAYLRYIKRRKKIRIPQKYIMPMMRHINAVRLKNTEKNRRRHIQNLQSS >NC_009446|993315:1000410|995519_996956_-|WP_012031269.1|DBSCAN-SWA MSTISRVLFFGLFSIHTLAFAFDCQNKEPLFAAVYRGEQNVVARYLANQCAIDVSDEYGLTLYDLAAMGGASELQQWLVKQKVAKEGEYSSALIRLIQTGLRFLDFDAGVIDGVYNQDTQSGIKAFQKSQKQKTTGKITADWLRIFNRRLTAKVQQRLNHFGYAAGGADGIIGKKTQEALIALRKQYQLPWLTYSFIDDRLIYQIMLTENENNKKQIEKNQLIAKKQREQQKKAAAERLRQQEIARKQFEEREQQKALERERLVAAQLQKHVAQMNFVESKNTIHHEDVKNTLRDMLDAPASQKIPLRQTSRNLLAEQKAQAALQQAQLVAAKAAAAERTERAQAAQLIAEKQKAQQEEQKRRQQALLQAQQKQMELERMQQEQMRLAQLQAVRTAQAEKEAAEQAQKTELSTYVPAKKYSFNKLSGILKFGDSPRVCQINGQALDGSWCETHYPSGADKPCDAILSNTGIMVSLVCH |
7 | Bacteriophage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
1117909 : 1125753
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NC_009446|1117909:1125753|DBSCAN-SWA ATCAATCGCTTAAAAAGCTTTTCAATAATTCTGAACGATTGGGGTGACGCAATTTACGTAACGCTTTGGCTTCAATTTGCCGAATGCGTTCGCGCGTCACATCAAACTGCCGTCCCACTTCTTCCAAAGTGTGATCCGTATTCATATCAATACCAAAACGCATGCGTAAAACCCGTTCTTCACGCGGCGTCAACGATGCCAAAACTTCATTCGTCGCTTCCGATAAACCGCGCGTTGTCGCTGCTTCCAGAGGAGAAATCACATTGCTATCTTCAAGGAAATCACCAAGATGCGAATCTTCATCATCTCCAATCGGTGTTTCCATAGAAATCGGTTCTTTGGCAATTTTCCACACTTTCCGAATTTTATCTTCAGGCATATCCATGGCAACTGATAACTCTTCTGGCGTTGGATCGCGTCCATTTTTTTGTTGCAACATACGCGAAATTCGGTTCAATTTATTAATTGTCTCAATCATATGCACAGGAATCCGAATAGTTCTCGCCTGATCGGCGATAGAACGCGTAATCGCCTGCCGAATCCACCAAGTGGCATACGTTGAAAACTTAAAACCGCGGCGATATTCAAATTTATCCACCGCTTTCATCAAACCAATATTACCTTCTTGAATTAAATCCAAAAATTGCAAGCCGCGGTTGGTGTATTTTTTCGCAATTGAAATCACCAAGCGCAAATTCGCCTCAACCATTTCTTCTTTTGCCCGTTTGGCGCGTTCTTTACCCACTGCCAATTGGCGATAAATATCCTTGATTTGTTGAATGGACATGCCGTAAGTTTTTTCTACTTCTAAAAATTTTTGCTGCTCGGCTTGCACGTCTTCAATCATTAATTTCAATTTTTCGGCATATTTACGCCGCAATGTTGATTGTTTAACCACCCATTCCAAATCCGTTTCATGCCCTGGAAAATCGCTGATAAAACTCTTGCGCGGCATGCCGGATTTTTCAATCACCAAACGCATTAATCGCCGCTCAATGGCATGAATCATTTCCATCGGCTCTTCTAATTGCGCTTCCATCAAATCCATTGTTTTTTGCGGAAAAGAAAAAGCCACAAAATATTGTTTTACTTCGTCATGCAACTTCTGATTTTTCTTTGTCGCTAAACTTTGCCCTTTATCCAAAGCCGTTTTTAATTTTTTAAAAACTTGCGAAAGTTCTAATAATTTCTCCCGCGCTTCTTCCAAACTATCGTTGACGGTAATTTCTGCTTCCGTGCTTTCGTCGCCGTCTTCGTCGTCATCATCGCCTGCTTGCGTTAAAAGTTCATCGTCTTCATCTTCTGATGAAACTGCGCTTTCTGTTTCTTCGTCATCGGTTGCTTCTTCCGTTTTTTGTGCAAACGGATCGCCGTAACCGTTGACCAAATCACTTAGACGACCTTCCCCCGCTTCAACGCGTTCAAATTCCGCAACCAAAGCGGCAATTGCTAAAGGAAATTCTGCTAAAGTTTGTTGCACTAAATCCAAACCGCTTTCAATGCGTTTTGCGATGGCGATTTCTCCTTCCCGCGTTAATAAATCCACCGCTCCCATTTCGCGCATATACATTCTTACCGGATCCGTTGTTCTACCGACTTCTGCATCAGCGCTCGACAAAACATTATTAGCAAGCGATTCTTCATCATCGGTAACCGCTTCCTCTTCTTCCGGTGGCGTTTCAACAACATCAATACCGTTTTCGTTTAAAAAAGCGATTAATTCATCAAATTTTTCGCCTTCTACCGCACTTTCTGGCAATGTTTCGTTGATTTCTTCGTAGGTTAAAAAACCCTGCTCACGTCCTTGTTCAAATAATTCTTTGTATTCTTCATGAACATCATCATCTTCCGGATTGCGCTCAAAATCAGTCATAAAATATCCTTTCAAAAACAATAACCAAATCACGCATTTGCGCTACCAAAACGCCACTACCCTCATTTTTAACGGCATCAAGTTTCTTACCGAAATTGGGTTAATTGCCCAATTTTTCAAGTCTTGCTTGTTTTTTAAGATGTTGCCGCAATAATTCCTGCATCATGCCCTCAAATTCAACCGTCAATAATTTTTCATCTAAAAGCGAAAACAACTGCCGCGATCGCATCAAATCCACCTCCAAATGCATGGCACGGATTTCTGCCTCTAAAACCGCTTTATCAGCACCACAACGCAATAAATATAACAATCGATAAAAAAACGGCACTTCTTTTGCCAATAAAGAAGCGTATTGATTCATATCAATTTTTAAATACCAACTTGGTTGTTGCAGCAAAACGGCAGCCATACGCGCTTCCAATTTTGTGGGCGCCGCTGCCAAACTGCCGCCAGCGGCATGATATTCAATTGATTTTTCAGAAGAATCAATAATTGCTACCGATGTATTTAATTGTTTTTGTAATTGTTTTTGCATCATTTGCCGATAATGTCCTTGCGGCAACAAACCCAACCACTGCGCAGCTTTTTCCACGACGGCAGCATATTCTTCCGCCGTCCACGTTGCCCGCTGATCAATATTTAACAAACGTTGCAAAAACAATGATGGCGGTTCGCTGGCGTCCAACAATTGCAAAAACGCGGCGCTGCCGTGTTGCGTTAAAAAAGAATCAGGATCCTCACCCGCCGGCAAAAAAACAAAACGCCAATCGTAGCCTTCTTCATAATGCGTAAAAATAATATGCAACGCTTTTTCCGCCGCATGTTTGCCCGCTGCATCACCATCAAAACAAAAATAGACTTTTTTACTGCGCTTTTTTAATTGCTGAAGATGCGTATCACCAAATGCCGTGCCCAAAGTCGCCACCGCATTATCCAGTCCAAATTGCGCCATTTTAATAACATCTACATAACCTTCGGTAACAATTAACGAGGTTTGTTTGCTGCGCGCTGCCGTATCAAAACCATAAAGTTCATTGCGTTTATTAAACCAAGGCGATTCCGCAGAATTGAGATATTTGGGCTGTTCATTGGCAATGGTGCGCGCACCAAAAGCAATAATTTGTCCGCGGATATTGCGAATCGGAAACATCAAACGCTGACGAAACCAATCGTAATAACGACCATCTTTTTCACCAATTAAACCAACCGCTTGCAATAAAGGAAGTTCATATTTATCACGCAAAAACGCCAATAATTGATTACCCGCCGGCGCATATCCCAACAAAAAATCATCAACAAGATGTTTCTTAATGCCGCGCGCGCGCAAATATTCTCGCGCGGAAACCGCCTCTTCCGTATAAAAACACCGCTGAAAAAATTGCGCGGCATCAGCCAAACACAACAATCCCAATTCGATTCGTGATTTTTCTTCAGGATTATAGGATTCGGTTTCTTCATAAGGAATCGTCAAACCGTGAAAATGCGCCAAAGATTCAACCGCTTCCACGAATGATAAATGTTCATAATCCATTAAAAAGCGCAGCGCATCGCCCCCTGCCTGACAACCAAAACAATAATAAAACTGTTTGACTTCGCTAACGGTAAATGAAGGCGTTTTTTCCTGATGAAACGGACAACAGGCGACATAATTGGCGCCCATTTTTTTCGCATAAGAAAGACGCGACCCGATCAAACTCACGATATTCGTGCGGTTGATCAGGTCATCAATAAAAGATTTTGGGATCCGTTTACCCACGTTGAGCGATATTAAGAAGCTTGCAATTTTTGCCGCAAATAGGCGCTTACCGCTGCCATATCCGCTTTACCTTGCAAATCTTTTTTCAAAATTCCCATCAACGCGCCCATTGAAGAAATAGCCAAAGGCATTTGAGAATCTGCAATCACTTTATCTACATGCGCTTGCATTTCTTCAACACTCAATTGGGCAGGTAAAAAACGCTGAATAATGGCAATTTCCGCCTCTTCTTGTTGCGCAAGATCCTCGCGATCAGCTTGGCGGTATTGCTGCATTGCATCTTGACGTTGCTTAATTTGCCGCACTAATTCGTTAATCGCCACTTCATCAGTAATTTCAATTTTCTGATCAATTTCGATTTGCTTAAAAGCAGCACTCATCATGCGCAATGTTTGCAAAACGGCTTTATCCTTAGCAAGCATCGCCTCTTTAATTGCACTTTTCAGCTGTTCTTGAATAAGCATGATTAATACAAGCGTTCAAACCGACTACGTTCTTTTTTCAAACGCCGAGAATTACGTTTTACCGCCGCGGCTAACTTACGTTTACGCTCTTGGGTTGGTTTTTCATAAAATTCACGACTACGAACTTCCGCCAAAACGCCTGCTTTTTCGCAAGAACGCTTAAAACGACGCATGGCAACATCAAACGGCTCGTTATCCCGTACTTTCACACTTGGCATTCAATATCCTTTTGTCCAGAAAATAAAGAAGCGCGTATTGTACGCGCCGCCCAAACGAAAATCAAACATCATTGCAGAAAGAAACACTTAAACCCCATTTTCGATAATCAAAGCGCAAAAATCGTGCGGTAAACCGCGATTTATCCGCAATAATTTTGCCTGCAATTTTTTCTCAATTTACTGATATTTTCTTACTTCCGGTTTTTGGTAATTTTTTATAAAACGTGCGATTTCTTCAAGATTATCCGAAAACAACGTTTTTTCTCTATCGGCGCGCGATAAAAATCCTTCTTCCATCATATGATCGAAAAACTTTGCTAAATAATCATAATAACCGTTAACGTTTATCAAAATGCACGGTTTATCATTTTGCCCCACTCTCGCCCATGAAATCACTTGAGCAATTTCTTCCAGCGTTCCCAGTCCGCCCGATAAAGCCATAAAAACGTCTCCGCGCTCAATCATGCGCGCCTTTCTGTCCGACAAATTATCAACCACGATTAACTCATTTAATTTTGTATGACTGATTTCTCTTTCCACTAAAAAGCGCGGCATCACGCCAATCACGCGACCGCCATTTTCCAAAACCGTATCAGCAATTAACCCCATCAATCCCACTTTTCCGCCGCCATAAATCAATTGATGATGATTTTGCGCAATCCATTTTCCCAACTCGATTGTTTTTTCTTGATATAGTTTATTCATTCCCAAACTTGCGCCGCAAAATACCGTGATATTCATCGTTTAAACCGCCCTTCTATAATTCTTGTTTCAACTCATTGATTTTAAATTGTTCTTGTCATCAAAATGACAGACTCAATATGCGCACTGTGCGCAAACATATTGACAACTTGACCGCGTTCTAAATGAAAACCTTTACTTTGCAAAATCGCCACATCACGCGCCAAAGTTGCTGGATTACAAGAAACATAAACAATTTTTTCTGGAAATTTTTTAGGCAACGCTTGAACAACGGCGTGCGCACCCGCACGCGGCGGATCCAATAACCACGATTTTGCCGACTGCCACGATTTCATTTGCGCCGCGGAAACAGAAAATAAATCCATACATTGCATTTCCAGCCGCGATGATAATTGTTGTTCTGCTGCCATTTTTGCACCGCGCTGCACCATCGCCCGCACACCTTCAACCGCCGTCACCCGCGCGCCCTTATAAGCCAGTGGTAAAGAAAAATTTCCTAAACCAGAAAATAAATCAAGCACTTCTGATTTTTCCAAAGGCGCAAGCCACGCGAGCACCGTTTGAATCAACGCCTCATTAACTGACGCATTAACTTGAATAAAATCATCGGGCGTAAAATGAAGAGTTACCCCAGAAATTGGCTGATAATACAAATCATTAGGTTCACCGAAAAGACAACGATCATTTTCCCATAATTGCCAATGAGCGCCGGCTAAATTTGCCCATTTTTCGCCCCACGCCGGCGACAATGCCCGTTTTTTTGTCCGCAACGTTAACGCCGCGACTTTCTCGCCCGCTGTCAATAAAATCTCATCAACTTTCACTGGTAATAATGCGCGCAAAAAATCGGGTAATAACGGCAATGCGGCGGCTAAATGTTCCTGCAAAATCAAACAATCGTTAACAGCCACCACATCATGCGAACGTTTTGCTTTAAAACCAACCGCAATAACATCATTTTCATAATGAACTGCCAACCGCGCGCGGCGACGATAGCGCCATTCTTTTCCTGCTAGCGGCAGCAAAACGTGTTCTGGTTGCGCGCCGCCTAAACGTTGCAATTGCGTTAACCATAATTTTTGTTTGCCCAATAATTGCTCCTGACTGTGCCAATGTTGCAACGCACAACCGCCGCAGCGGTCATAAAAAGGACAACGCGGCGCCACGCGTTGCGGCGACGGCTCAATAATTTCATCAACCACCGCCTCAATAAACTGCTTTTTCGCGCTCGTTTTGCAAAAAGTAACCGTTTCATCGGGCAAAGCACCTTCAATAAAAACCACTTGCCCATCGATTTTCGCCACGCCGCGTCCCTGATAATCCAAGTCAAAAACCGTTACCGCCGGAAATTTTTCCACCCTATTTCTCCATAAAACAACATAAAAAAACCGCCCTAAAGGCGGTTTACAACGAAAACAGTTTAACTAACGTTGACGAGCTTTGCAGCGCGGATTTGTTTTGCAAATAACATACAATTTGCCGCGACGTTTTACGATTTGGCAGTTGCGGTCGCGCGTTTTAGCACTCTTCAGTGAAGATATGACTTTCATAAATACTCCCATCGAAATAGTTCACCACTCGAAACAAGCTTAATTGTTTCGAAATTGGTCGCGCATTATCAACGCTTATGACTTTGTTTTCAAGTATTCTTTTAGAAATTTTGCCGAACATGAATCGTTTTTGAAAACACCAACATGAATGCGTTATCCCAAAAATTACCGTTTAAAAACGCGCAAATGATTCATTTTCAGATCGACTCGCCGCAATCACTGCAAAAATTTCCGTTTCGGTAACATCATCAACCACGTGCGCTTGCCCCAAACCCGATAACAGCACAAAACGCAATTGCCCCGCGCGCACTTTTTTATCCAAAAATAATGTTTGATAAATATCCTCACAATTCAATTGGATATCGATTGCCGTTGGCAAATGAGCGCGCTGCAATAAACTAATCACGCGCGCTTTTTCATCAGCCGAAATCAAACCGCGCATTTCTGATAATTCCGCTGCCATACGCATGCCGATTGCCACCGCTTCGCCGTGCAACCAATGGCGATAAGCGGTGATCGTTTCCAACGCATGTCCAAACGTATGCCCAAAATTCAGCAAAGCGCGCATTCCTTGTTCGCGTTCATCAGCGCAAACGATTTCAGATTTATAACGGCAACAATTCATAATCATGTTTTCTAAAGATATTTTTTCTTTTCGTAAAACGGCGTCCATATGTTGTTCGAGCCACGACAAAAAAGAAATATCATAAATGAGCGCGTATTTAATCACTTCCGCCAGTCCTGCCGAAAATTCGCGCGGCGGAAGCGTAGATAATGTTGCCGTATCCATCAAAACCGCTTGCGGGGGATAAAAAGCACCGATCATATTTTTTCCGCAAGCATGATTAACGCCGGTTTTACCGCCAACAGCAGCATCAACTTGCGCCAATAATGTTGTAGGAAACGTTATCCAACAAATTCCACGCTGATAAGTCGCTGCCACAAATCCTGTAATATCGTTGATTACGCCACCGCCTAAAGCGATTAATAAACCATCGCGCCCCAAATGCGCTTCCAATAAAGTGGTCAAAATATGCTGATAAGTAGCTAAATTTTTATGACATTCACCATCTAATAAAATGATATCTATCACTTTTTTATCCGTTAACGCCGCTTTTAATGACGATAAATAAAGCGGTGCGACAGTTTCATTAGTGATAATGGCAATGTGCGGCGCTGAACAATAAAGATGCCATAATGACGCGTTTTTAATGAGATCGGGCGCGATCAAAATGGGATATTGCTGCGCCTTTTCCGCAGTATGAACCGTTAGCCGTTTCAT
Protein sequences of DBSCAN-SWA_8 >NC_009446|1117909:1125753|1122003_1122219_-|WP_012031388.1|DBSCAN-SWA MPSVKVRDNEPFDVAMRRFKRSCEKAGVLAEVRSREFYEKPTQERKRKLAAAVKRNSRRLKKERSRFERLY >NC_009446|1117909:1125753|1124643_1125753_-|WP_012031391.1|DBSCAN-SWA MKRLTVHTAEKAQQYPILIAPDLIKNASLWHLYCSAPHIAIITNETVAPLYLSSLKAALTDKKVIDIILLDGECHKNLATYQHILTTLLEAHLGRDGLLIALGGGVINDITGFVAATYQRGICWITFPTTLLAQVDAAVGGKTGVNHACGKNMIGAFYPPQAVLMDTATLSTLPPREFSAGLAEVIKYALIYDISFLSWLEQHMDAVLRKEKISLENMIMNCCRYKSEIVCADEREQGMRALLNFGHTFGHALETITAYRHWLHGEAVAIGMRMAAELSEMRGLISADEKARVISLLQRAHLPTAIDIQLNCEDIYQTLFLDKKVRAGQLRFVLLSGLGQAHVVDDVTETEIFAVIAASRSENESFARF >NC_009446|1117909:1125753|1122396_1122960_-|WP_012031389.1|DBSCAN-SWA MNITVFCGASLGMNKLYQEKTIELGKWIAQNHHQLIYGGGKVGLMGLIADTVLENGGRVIGVMPRFLVEREISHTKLNELIVVDNLSDRKARMIERGDVFMALSGGLGTLEEIAQVISWARVGQNDKPCILINVNGYYDYLAKFFDHMMEEGFLSRADREKTLFSDNLEEIARFIKNYQKPEVRKYQ >NC_009446|1117909:1125753|1119881_1121537_-|WP_012031386.1|DBSCAN-SWA MGKRIPKSFIDDLINRTNIVSLIGSRLSYAKKMGANYVACCPFHQEKTPSFTVSEVKQFYYCFGCQAGGDALRFLMDYEHLSFVEAVESLAHFHGLTIPYEETESYNPEEKSRIELGLLCLADAAQFFQRCFYTEEAVSAREYLRARGIKKHLVDDFLLGYAPAGNQLLAFLRDKYELPLLQAVGLIGEKDGRYYDWFRQRLMFPIRNIRGQIIAFGARTIANEQPKYLNSAESPWFNKRNELYGFDTAARSKQTSLIVTEGYVDVIKMAQFGLDNAVATLGTAFGDTHLQQLKKRSKKVYFCFDGDAAGKHAAEKALHIIFTHYEEGYDWRFVFLPAGEDPDSFLTQHGSAAFLQLLDASEPPSLFLQRLLNIDQRATWTAEEYAAVVEKAAQWLGLLPQGHYRQMMQKQLQKQLNTSVAIIDSSEKSIEYHAAGGSLAAAPTKLEARMAAVLLQQPSWYLKIDMNQYASLLAKEVPFFYRLLYLLRCGADKAVLEAEIRAMHLEVDLMRSRQLFSLLDEKLLTVEFEGMMQELLRQHLKKQARLEKLGN >NC_009446|1117909:1125753|1124345_1124471_-|WP_041729592.1|DBSCAN-SWA MKVISSLKSAKTRDRNCQIVKRRGKLYVICKTNPRCKARQR >NC_009446|1117909:1125753|1117909_1119781_-|WP_012031385.1|DBSCAN-SWA MTDFERNPEDDDVHEEYKELFEQGREQGFLTYEEINETLPESAVEGEKFDELIAFLNENGIDVVETPPEEEEAVTDDEESLANNVLSSADAEVGRTTDPVRMYMREMGAVDLLTREGEIAIAKRIESGLDLVQQTLAEFPLAIAALVAEFERVEAGEGRLSDLVNGYGDPFAQKTEEATDDEETESAVSSEDEDDELLTQAGDDDDEDGDESTEAEITVNDSLEEAREKLLELSQVFKKLKTALDKGQSLATKKNQKLHDEVKQYFVAFSFPQKTMDLMEAQLEEPMEMIHAIERRLMRLVIEKSGMPRKSFISDFPGHETDLEWVVKQSTLRRKYAEKLKLMIEDVQAEQQKFLEVEKTYGMSIQQIKDIYRQLAVGKERAKRAKEEMVEANLRLVISIAKKYTNRGLQFLDLIQEGNIGLMKAVDKFEYRRGFKFSTYATWWIRQAITRSIADQARTIRIPVHMIETINKLNRISRMLQQKNGRDPTPEELSVAMDMPEDKIRKVWKIAKEPISMETPIGDDEDSHLGDFLEDSNVISPLEAATTRGLSEATNEVLASLTPREERVLRMRFGIDMNTDHTLEEVGRQFDVTRERIRQIEAKALRKLRHPNRSELLKSFLSD >NC_009446|1117909:1125753|1121548_1122001_-|WP_012031387.1|DBSCAN-SWA MLIQEQLKSAIKEAMLAKDKAVLQTLRMMSAAFKQIEIDQKIEITDEVAINELVRQIKQRQDAMQQYRQADREDLAQQEEAEIAIIQRFLPAQLSVEEMQAHVDKVIADSQMPLAISSMGALMGILKKDLQGKADMAAVSAYLRQKLQAS >NC_009446|1117909:1125753|1123004_1124279_-|WP_012031390.1|DBSCAN-SWA MEKFPAVTVFDLDYQGRGVAKIDGQVVFIEGALPDETVTFCKTSAKKQFIEAVVDEIIEPSPQRVAPRCPFYDRCGGCALQHWHSQEQLLGKQKLWLTQLQRLGGAQPEHVLLPLAGKEWRYRRRARLAVHYENDVIAVGFKAKRSHDVVAVNDCLILQEHLAAALPLLPDFLRALLPVKVDEILLTAGEKVAALTLRTKKRALSPAWGEKWANLAGAHWQLWENDRCLFGEPNDLYYQPISGVTLHFTPDDFIQVNASVNEALIQTVLAWLAPLEKSEVLDLFSGLGNFSLPLAYKGARVTAVEGVRAMVQRGAKMAAEQQLSSRLEMQCMDLFSVSAAQMKSWQSAKSWLLDPPRAGAHAVVQALPKKFPEKIVYVSCNPATLARDVAILQSKGFHLERGQVVNMFAHSAHIESVILMTRTI |
8 | Vibrio_phage(16.67%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_9 |
1130701 : 1145709
Sequences of DBSCAN-SWA_9
Nucleotide sequences of DBSCAN-SWA_9 >NC_009446|1130701:1145709|DBSCAN-SWA ATCAGCCATGAGATTTGCCTCCAATCAAAGATTCTATAGTTTCAATCAAAACATCTTCACGATAAGGTTTACCCAAATATCCTTGCACGCCAATCTTATCAGCCCGTTCACGGTGTTTATCACCCGTGCGCGAAGTAACCATAATGATGGGAACATCGTTATACATTGCATTTTGGCGCACATGAGCGGCAAATTCGAAGCCGTCCATTCGTGGCATTTCAATATCAAGCAAAATAACATCAGGCATAAATTCGTTTAAAACTTCAATCGCATCTAAACCATCTTTTGCCGTTTGAACAACATAATCATTGCGTTCCAATAAACGCGTAGAAACTTTACGCATTGTGACCGAGTCGTCAACCACCAATACTTGTCGGGCGCGAACTTCTTCAACCACATGAACACTTTCTAATTGAGACAATCCCGCAATTCGTGTTGATAAATCTAATAAATCCAACACTGGAACAACGCGACCGTCCCCTAAAATCGTTGCACCAGAAATACCCGGTAAATTCAGCACCTGACGATTCACGTTTTTCACAATAATTTCAAGGCGATTTGCAATGTGTTCAACGTGGAAAGCAACCGGCTCACCAACCCCGTTAATAAATAAAACGGGCGCCGTATCCGTATCCATATCAAATTCATAATTGGCTGATTGGAAATAAGATGCCAACACGAATAAACGATAATCCATACCATCATATTCGTGATAAACCACCTCGCCGTTAAAGCTGCGTTGCAATTCCTCTTTGCTCACATGGCTGATTGCTGCAACTGCATTCATAGGCGCCGCATAACGTTGATCGCCGATATCAACCAACAATACTTCAACAATACTCATGGAGAACGGTAAAACGATAACAAATTCCGTTCCCTTTCCTGGAATAGAATCCACGCTCAAACGTCCGCGCCGTTGCTTGATCATTTCGTTGACTACGTCCAAACCAACGCCGCGTCCCGAAATTTGCGTTAACGATTCAGCAGTTGAAAAGCCCGAACGCAATAACATGGTATTGAGATATTGGGTGTCATTTTCTCGCTCTGGATCCAACCAACCTTTGCTGCGCGCTTTTTCACGTATCCGAACATAGTCAAAACCGCGACCATCGTCGATAACTTGCACTTGTACGTTAGAAGCTTGCGCCGTCACTTTAATTTTAATCGTACCGCTTGCTGGTTTTCCTGCGGCGACACGTTGTTGTGGCAATTCGATTCCGTGCCCCACGGCATTACGAATCATATGCTCCAGCGGTGGCAAAATATCTTCAACTAAACGCCGTTCCAATTCGACGCCGCCGCCTTCCAGCACCAAATCAACCTCTTTACCCAACGATTTCGCGGTTTGCCGCACTAAACGCCGCAAGCGAGATTCATTCACATCAAAACGAACGAGTTGGGTTGCCATCAAACGATCTTGAATCGCGCGCTGCAAAACGCCTTGTTGCGTTGATAAATTGCGCATCAACGCATTTTCAGCGGTCAACGTTTCTTGTACGTTTTTCAAGTCTTCAATAGCTTCAGACAATTGCCGCGAAAGTTGTTGAACTTCGGTGAAACGGTCCATTTCCAACGGGTCAAAATGCTCATCGTCTTCCGTATCACTTTCCCGACGGAATAACATTTGCGCTTCTGTTTCATTATCCAAGCGCCGCATTTGCTCATCGATCCGAGTGGCAATGCGGGTTAATTCGCTCAAGTTAAATTCAGATGCCAACGTTAAGTTTTCCATGCGAGAACGAACAATCGCATATTCCCCAGTCATAGAAATCATATCGTCTAACAAAGCCGAATCGACGCGAACAAATCGCTGTACATCATTGCCGATTTTATTATTATTTTCGGGCACTTCTGCAGCAGCAGGTTTTTCCGCATCTGGAGTACTGCTGCTTTTTTCTGCCGGTAATTCTGCTTTTTTAGCAACATCGCTGGTTTTTTCTGCAACTTCCGGCGCTTTTTCTTTAACCGGTGCTTTTTCAGCTTTAGGCTCTTTTTTAGCAACTTCCTCAACAGGTTTTTCTTTTTTCGCCGCGGCTTTCGGTTCTGCTCCGGGCATTTCAAAACGCCCTTGATTAGCAAACAGCTCTAAAGAACAGTTAACTTGCGTCGCCGGTTCAGGCATTTCATGTCGTAACACGCTATCAAGCATGTTATAAAGCGATTCATGACCTAAACCGATTAACGTTTTCGCTTGTTCATGTTTTACATCAGGATAATCCGCTAATTTTTCAATTACAGATTCCATCAAATGCGAAGTATCGCCGATGACGGTAAAACCAACCATACGCGCGCCACCTTTTAAGGTGTGCATATTTCTTTTCAGTTCTTCGATAACGGTATGATCTTTTAAATCTTGACCTAATAATTGTTCGGCTTGAGCAATTAATTCAGAAGCTTCATCAGTGAAAATCTCAACCAAAACAGGGTCAATGCTTTCATCAAACGGCACGCCGCCTTGGATAACTTCCTCTTCGGCATCTGCTTCATCAACAACTTCCGATGAAACATCGGTTTCTGCTTCAACTGCACTTTCTACTGACGCAGCATCTGATTGCGCATTCTCACCAGCAGCGTCATCATCACCAAATTGCGCCGCCAACAAATCTTCAGCTTGTTTTCTTTCTCTTTCTTTAATGCGTTTTTCGTTTTCTGCCACAGCAGGATACGGCAACGGCATTCCCAAAAAGCTATTGATATGATTGAGAATATAAGGATTTGGTGTCGGTAAATATCCATCGCGAATAGAATCCAACATTTCCATACTTTGCCAAATCGCACCCGCAAGAATAGAGCTGGCGCGCGCGTTGCCCTGCAATTGTCCATCTAAAATCTGGCTGGTAATTAATTCAACGCCTTGCGCTAAATCCGCAATCGCAGAATAACCCGCACTGGTTGCACTGGTTTTTAAAGCATGCATAGAACGACGAACAGCGTCTAAATGATTTAATTCTTCGCGATTTTCGTCCCATTGTTCCGCTTCTAATTGACTATCGCCCAATAAATGCAAGGCTTCGGAATGGAACAATTTGGCAATTTCTTGGTTGACATTCGGATCAGGAATCAACGGCGTTTCAATAGAACCATCATCATTGATATCTAAAGGAATGTCAGCTGCCGAACGCGCAATCATCGTTGGCTTCGACATTTCATAATCGCTGTGTGGAATTTCCGAACTCAACCATTGACGCGTTGACATCAATGAATTCATCGTCCGTTTTGCATCTTCTTCGGAACTATCGTCAGAATAATGTTCAATTAAATCCAAGCTTTGCTTCAATAACGGCGTAGATTGACGGATACCTTTACCGCCGACTTTCCGGTACTGTAACAACAATTGTTCCATCGCATCTAACAAGCGCCAAATCCACGCCGGAACATCGTTTAAATCGAGAGAATCTTCAATTTCGTAGACAGAATCTAATAATTCCTCAATTTTATCTTCGGAAGCATCTTCTTCGAGTGATTGAAAACGTCCAGAAAATTCGTCCAAAGCCCTTAAAAATAAGCTGGCATCTCGAACCGGCGGTCTTGCGCCATCAAAAGAAAATGAATACACACTGCCTTCTGATGCGGATTCGGAAGGAACTTTAGTAATATTAGATAATTCCTCGCTTAACCGTGAAGAAGGTGCCGGCATAATGTACGGTGAAAAAGATGAAAACCGAACATTTGATGCCTCTTTTTCAATATTTCTATTCGGCTTTTTATGAGTAGACGGTTTCTTTTCGGGCGCTGGTTTTGTCAACTCTGGAACCGAAATCGCATCGATGATTTGTTGACTGGTTTTTTTCGTTAAATCATCTTCTGTTTTTAAATCGTCATAAATATCTTGCAATACGTCGTTAACGTCTTGCACTTCTGGCGTTTCATCTGCAAAATTTTGCGGTGCATCGGCATCTGGAGAAACTAAATCCGTATCCAAAAATACGGAATCACCACCGATTTCCCCCGAATTATCCACGTCAGAAACCGCATTTTCTACGACCACAACATCGGGCTCATCAGCTGCAACATTGTTTTGAACTGGTAAATCATCAATGCCTAAATCAAATAAACCTGTCGCATCTGACGGCGCCTCTACAATCACCGCCTCATTTTCATTTACATCTACCGCAGGCGTCTCAACAGCATCATCAATACCAAAGCCCAAATCATCTGTCACAGATACCAAACTGGAATCAAGATCAGATGGCGCTAACGCATCAACTGCCGGCGAATCTATTGCTAATTTCTGCTCAATTCGTTGCACGATGCGGTCAACAAAACCTTCCGCCCCATTTTCAGAAGCAAGCAATTGATGCAATTGATTACCTAAATTAGGCATTAACTCATTAAAATTGAGCTCTTTATCATTTTGTAAATTTTTAGCTAAATGATCGGCAACTAAAGTCGCATCGGTATCATCAACATCGCCTGCGCGCAACATTTGCCGCAAAGTATCGGCAATTTTATTGGTTGCTTTTTGCTCGGCGGTATGTGTAACGATATGACTTAAAAGGTGATCAATCACATTTTGATCAGCGCCAGCCAACTGATCAGAAACCATTTGTTTTAGTTTTGATTCATCAATCGTATTGGCATCAATATTTGCCGATTGAAATTGGGATTTAACCACATTTGCTAAAACTTCAGTTTCTTCAGTATCCAGCTGCCCCATTGCCAAACCTTGCATCAAGGTTTTTGCCGCTTGTTCGTAGGCGGATTGATGTTGTGGCGCCGGTTCTGCTGGTTTTTCAGAAGAAGAAATACGCGCGGAAACAAAGCGTTGTCTCATCGTTTGACTGACAATGTTTAACGCTTTTGCCTGTGAAAGTAACGCTTGCGGATTCTCGATAAATTCAGTCTCCCCGCGCATCGTTTTCAAAAGATTAACGGAATCTTCCACGGAATCTTGAATAATCGGGTTCGCATCGTAAAGATTATCCAAAATATCGTTTAACAAATTCTCATTTTGCCAAGCAAGTTCTTGTAAAGATTCATAACCTAAATTTTGGCTCGAGCTACCAATTGCGCTAAATGCTTGCTTAATATTCCCTGGATAATCTAGATTATCCGGTGCATTTCTCCACCGCGGCAAATGGTCGTCCATTTTCCGAATATTTTCATTTAATTCTTCTAAAAATTTTCTTTTTTGCTCGGCGGCAATTTTATCCGCGTTAGAAACTACGGCATCAGGAATATCTGCTTTGCGAACGTCAACCGCTTCCACTTCTGGCATCGTCGCCGGAGTTGTATCAACTTCCGCGGCAGCAGCAAAGGGCAGACATTCTGCATAAACTTTTTGTAAAAACGCATCGACTTCTGCGTCGCTGGGTTTTGAATTTTCATTTTGCAAATAAGAAGTTGACTTTTCAGCGATCAACGCTTGCGTTAATAATGCTGGGTGATGCTCAACAAAAGTTTCTTCATTACGCAATTTTTTCAGCAAACAAATAGCTTCTTTAACATTGTGCAATACGATGCGGTTAACCGAATAATGCTTATCTAACACACGGTTAAGCATTTGTTCGTGTTGCCACGCGCATTCGCCTAATGCTTCATAGCCCACAGTCCGTCCCGAACCTTTCAAAGTATGAAAAGCCCGTCGAATCACTCCAGTGCTTTCATCTGCTTGAGCGGGATCCGTCCAAATATCATAAGAATCCGATAATTCAACGATTTTTTCATCAAATTCTTCCAAAAAGAATTCTCTAATTTCATCATCAATCACCGCTGATTCATGGCTTTCGGTAAATGATACTTTAGATTCCGGTACGGATAATTGTTCCCAATCTGTCAATCCCGTATGTTCAAAGAAATCAACATCAACAGAACCACTGGCTTGAGAAACGGGTTCTTGGATTTCTGTTTCTACCGCTGAATCCAATTCCAATACATCATCAAAGTTCACCGCTTCAACTGCTTCTGCGGCAGCTTTAGGTGCTGCGATTTCTTTTTCTGCAACTGGTGCAATATCATCAAAACCAAATCCTAAATCTTCAACTGCTTCTGTGGCAGCTTTAGGCACTGCGGTTTCTTTTTCTGCAACTGGCGCAATGTCATCGAAGCCAAATCCTAAGTCTTCAACTGCTTCTGTGGCAGCTTTAGGCGCTGCGGTTTCTTTTTCTGCAACTGGCGTAATGTCATCGAAGCCAAATCCTAAGTCTTCAACTGCTTCTGTGGAAGCTTTAGGCGCTGCGGTTTCTTTTTCTGCAACTGGCGCAATGTCATCGAAGCCCAAACCACTTTCAACCGCTGCCGATGAAGTATTTGTTGAAGCAGAAGCGGGCGTGGGTGCAGGCGTGGCAGAATCAAAATCAACCGTTGCCGTTATTTTCACCGCCGCTACCGTAGTTTCTCCAACATCGATAAAGCTATCCAACGCATCTAAATCGCCCAACAAATCATCAAAGCTTTGTTGTTCTTGTGCTGGTTTTTCTTTCTTCGGCGCTACCGTTTGCATTGAAGAGGCTAAAAGACGTTGTGCATCTTGTGTAATCGGGTAGATAGCAGCTAATTGATCAACGGCATTTTTTAAGAAATCCACATCGCTTTCAACGAGTGTTGCCGCGTCTTCTAAATCGGTGAAAATATATTCGGTAGCAACTAATGCGTGAACTACTGGCGGGTAGACGGTTTCTTTAAATTCTTTTTGCGCCGAAAAAATGGCAAAAATGTGCGCTAAGCGGCGCATTTCAACCGATAAACTTTTTTGTCCAAAAAATTGGCAGACAGTGCCGTAATACGCGGTGCGCCGCGCGGCTTCTGTCCAGTCAGCTTCGCTTGAGACGCGTTCTTCACCAACAACAATTTTTCTCAATGATTCAAAATATTGCCGAGCAACTGCGCCCGTATTCGTAATTAAGCGATAAAAACCAATGTATTGGGAATGGCTTCTTTGTTGTAAAAACGCGGTTTTAATGTCGGTAACTTTGGCATTATCACAGGAATGTTTACTAATATTTTGTAAAGCTTCTTCAACAAGCAACATGGATTGAGCAATTTCAACCCAATTCACTTTCGCCAAATCTCCTTTCAACATCTGATTGGAGTACTTCAGCAAAAATTGCGGCGCTTTTAAGTTCAATAAAGTAAATAAATTAATCAGCTCTTGGTTATAGGTGATCAGTTGATTAACGTCTAACGATTGACCGCCACTTGCAGCAAATTGCTCAATTTGGTTGCGGTTCACTTCGATAAGTTCATTTACGCGTTCAGCCAAAACTTGGTAAGTTTCCAGATCAATTTGAGGATGAGTACTGGCAGGCGTTTTCTTCCAACCATCGGTAAATACTGACGATAACGATTGGGAAATAGCGTTTATTTCCTGCGTATTATCAGCATGAGGCGTATTTAATCGATTGATATAGTCGCTTAATAACAAAATCCCAGCGGCAACCTCGCTCCCGACATCTGAAAAAGATTTATTATCAACCGCATTTTCTTTAAATTTGTTATCCAGAAGTTTGATAACTTTTTGAACTTTATCAGATAATTCCGCTGCTGGTTTTTTATTGAGTAAAAATAGACCACCGCTGACTTCGGTAAGCTGATTACAAAAAGCTTCCACCCCGCCGGACTCAGGACGATCAAGCCATTCAACCAAATGTTTTTTCACGTCAGCAATATTTTTTGCCATTTYTGCTTGTACGCGAAGCAATAACTGACTGGCTAATTCACTACTCATGTATTCCCCTTACACAAATTATCCCGATATCCATTCCGTTTAATTACCATTCGCTTAAATTAGTATCCTTCCGGTAATTTAAATCCGGAAACAGAGTCATTCAATTCCGATGACAATTGACTCAAGTTTGCAATTGAGTCAGAAGTTTTAATAACGTTCTCAGAAGTCGTTGTCGCCATATCGTTAATTCGGGACATATCTTGAGAAATATTAATTGCCATAGAAGAAACTTTTCGCGTTGAATCCGAAACTTTTTCAATCAATAACGCCAGAGAAGTTGATACTTCTTGAATCCGCGTCAACGACATTCCCGCTTCTTCCGCAATATTGGCGCCGGTAACCACTTCACGCGTGGATTTTTCCATTGAAGCAATGGCTTCGTTTGTATCCGTTTGAATTGTTTTTACCAAAGATTCAATTCGTCGGGTCGCGTTTGAAGAGCGTTCCGCCAAACGTTGAATTTCATCGGCGACCACCGCAAAACCGCGTCCCGCTTCACCAGCAGAAGTTGCTTGAATAGCGGCGTTTAACGCCAAAATGTTGGTTTGATCGGTAATACCTTTAATGATTTCAACAATGTTACCGATTTCTTGAGAACTTTCCCCTAATCGTTTGATTCGTTTAGAAGTTTCTTGAATATTTTCACGAATATTATTCATGCCGCCGATGGTATCATTTACCCGTTTGGCACCTTCTTGCGCGATGGTTACCGAATTGCGGGCAATTTCTGCGGAATCAGTCGTACTTGCAGCAACGCGGTCCAAAGAGCCCACCATGTTGCCGATTGTTTGAGTAATCGAGTGAATTTTTTCCGCTTGCTCAATAGAAGATTTTTGCATATCCGATGCAATAAATCGGGTGATATTAGCGGCATCAGAAACGCGGCTAGAAGTTCTATTAATCGTTCCGACCAATTCCCTCATTGATTCAATCGCGTAGTTAACGGAGTCAGCAATTGCGCCGGTAAAATCTTCGGTAACTTGCGCTTCAACCGTCAAATCTCCATTTGACAAGTCTGTCAATTCGTCCATTAACCGCAAAATGGCGTCTTGATTTTGCCGCTCTTTCATTTGTTGACGAATCGCAATATCTTGAATTTCTGAAGACTGGCTAGAAATAAATAAGTACCAAAAAATAGCTGCCAACACTACAAACAGTAATAAAATCGGTAGTAGAAAACGCAATCCTGCATGATTGACATCTGCTATTCCTAGTGACAGCGGTTGTTCCTCTGTTCCCGTAAACAACCCAATAATAATTAACACAATTAAAATGAGCGACAATGCCCCAAACCCTATTGCCACCCACCCAAAAACGTTTTGCTTGTCTTGATGATTATCGTCTTGTTGAGATCTTCTACTACCTTTCATGTGTCACCTGCGCTTCATTGCATTTGTTGGCTAAATACAGCATCGTTAATCAGATGCCCGATATTAATTCTAAACCAGTCCTTATCTTCCATAAAAATATGACCATCAACCCAGATTCTTTTCGAATCTAAAGCGTTAACTTCTACTTGTCGGATACCGCTGATACTGTCTACTTTTAAAATATATCCCTGACCCGCATCGCGTAACACAATATAATGAGATTTCTTTTGCGTGGCATATTTTGCCTTTGGTTCAACAAAATACTTAAAATCTGTTACAGAACAAACATCTCCCCGATGGCTCGTTAAACCTAATAACCAAAAAGGACAAAATGGCAAAGGAGTAATGGTTGGTAAATCTTGTGTTACTTCTAGTACAGAACCCATACCAATGAGATAGTTTTCAGCCCCCGCAGAACACGCCAAATAGCTGGCATGCTGCTGCACCGCAGCAACGCTAATCGTGCCCCTTTGCTGAAATTGCCGCACGTATCCTACTAAAATCTCAAAAGGTGAAGAATTTGCTAAAGTACTCATGCCAGTAACTCGCGGATTTCACGTAACAACGCTTCTTTTTCGGGCGGTTTTACCATATATTTTTTAGCGCCATTGCGCATACCCCATACTCTATCTGAATCTTGGTCTTTTGTTGTTAACATCAGAATAGGAATATCTTTGGTTTCTGGTGTGCGGCTAAGCTTTCTCGTTGCTTGGAAACCACTCATTCCCGGCATAACAACATCCATCAGAATCAAATCAGGATGCCTATTCATTGCCACCGCAATGCCTTTATCGGCAGCTTCTGCCCATAAAACTTCATACCCAGCTTCAACCAACATCGATTTTACAACATTGGCTTCAGTGGGTGAATCATCAACAATAAGAATGGTCGTCATTTCCTGTACCTATACCTTTCTATACCTTCTATACCTCTAGTGTGTAGCATGAACCTGAATTGCTTTTAATAATTCGTCGCGAGAAAAAGGTTTTGTCATGAAATGTTCCGAACCTGCGATTTTTCCACGCGCTTTATCGAACACACTATCCTTACTTGATAACATAATCACCGGAATATTTTTATACTCTGGATTACTTTTAATCACGGAGCACACTTGGTACCCATCAAGTCGCGGCATCATAATATCGAGGAAAATGATGCTCGGAGAAAACTCAATAATTTTTGAAAACGCCGCAAACCCGTCTTCTGCCGTTGCAACTTGAAAACCCTCTTTCCCCAAGATTGCTTCGGCGGTTTTCCTGATCGTACCACTATCATCTACGATGAGTATCTTTACTGATTGACTTGCTTCAGCCACAACAACTCCTTAATAATTCAAATCAAAACCGAACGAACTATCCTCTTAAACCAGCGATAGGATAGCGTATTTATTTCATCGAGGCGACAATAATCCATTTTTTAATGTGAGGACAAAAAATCCTCAACAATGACCGCCGCCGCCGCGCTATCCTTATCAATTGTTTTTGGAAAACGCATTTGTGCAGCATGAGAAGTTAACGTCTCATCAATAAAATGTACGGGAACGCCGTACAGTTTAATACATTCATCTGCCAAACGATGAATTTCATCTGACAACGGATGCTTCTTTCCATCTGCCAAGGAAGGTAATCCTATCACGATACAATTGACTTTCCACTGCTGCACCAACGCCGAAAAAGTATCAGCGTGAACCTGTTTCAGCGGGAGGTTAAATCTTTGCAATGGTTTTGCCAAACCCGTCACCGACTGCCCAATTGCAACCCCAGTATGATATGTTCCTACATCAATCCCTAATACCCACTGCACCATCGGCTACCTCCTCAACGAGATGTATGGTTATAATGCCGCGTAGCATTGATTCAATCAAGTACAATAAGAATCCATTTTCAAACAGGAAAACTCATGATACATCACCGCCTTATTATCCTCGGTGCCGGGCCTGCTGGTTATACTGCTGCCATTTACGCCGCACGAGCAGGACTCGAACCCGCGTTAATTACGGGGCTCGAACCCGGTGGGCAGCTCATGACTACCACTCATGTTGACAATTGGCCAAGCGCATTTGAAGGAATACTTGGGTCGGAACTCATGACGAATATGTGCCAACACGCGCAACGTTTCCAAACCCAAATCATTTACGATCACGTTACCGAAGTTAATTTACAGCACCGCCCTTTTACCTTAAAAACAGAAAAAGAAACCTACAGCTGCGATGCTCTCATCATTGCAACGGGCGCCCGCGCAAAATATTTAGGGCTCCCCAGCGAAACCCAATATTTGGGATACGGCGTGTCTGCCTGCGCCACCTGCGATGGTTTTTTTTACCGCAATAAACCCGTCATGGTCATTGGCGGCGGCAATACGGCGCTGGAAGAAGCGCTTTATTTATCCAATATTGCCAGTTCTGTAACACTTGTGCACCGCCGCGATGCATTCCGCGCGGAAAAAATCATGGTTGATCGTCTAATGGAAAAAGTGAACGCGGGAAAAATTGTTGTGAAATACAGTGCACAACTGCAAGAAGTTTTAGGCGATGATAACGGCGTCACCGGCGCGATCATTCGTTTTAATAACGGACAAATCGAACAACTCGCCGTCGATGGTATTTTTATTGCCATCGGTCATCAACCCAATACTGAACTATTTAAAAATCAATTAGCAACTGACGCACATCATTATTTATCCGTACACAGCGGCAGCAACGGTAACGCCACTCAAACCAGTATTGACGGCGTTTTTGCCGCTGGAGATGTTGCCGATCCTGTATACCGCCAAGCAATCACCTCGGCAGCTTCTGGCTGTATGGCAGCTTTAGATGCCGAGCGCTATTTAGCAACACTCAAAAATTAACTCAATGAAATTAGGAGTTCTCTACCATGGCAAATAAACAAGCCCTTACCATTCCCGATATTGGCGATTTTGCTGATGTTGATGTTATTGAAGTTCTCGTCAAAGTTGGCGACAAAATCGCCGTTGACCAATCTTTGGTGGTTTTAGAATCAGATAAAGCATCAATGGAAGTTCCCGCTAGCATTGCTGGAACCATCACTTCTTTAACAGTAAAAGTCGGCGATAAAGTCTCTGAAGGCAGTGTTATCGGCGAAATTGAAGTGGCAAACGGCGCAAGTGCTGCAAAAACTGAAGAAAAACCTGCAGAAAAAACCGCGGAAAAACCAGCAGCGGCGGCAACACCCGCAGAAAAAACCGCGGTACCCGATGATGCCGTTGATTTAGTTGTAATCGGCGCTGGTCCTGGCGGTTATTCGGCGGCGTTCCGTGCGGCGGATTTGGGATTAAAAGTAACTTTGATCGAACGTTACGCTACTTTAGGCGGCGTTTGCTTAAACGTTGGCTGTATTCCTTCCAAAGCGTTGTTACACGTTGCAGAAATTATGGAAGAAGCCGAATGGGCAAAAAAAGCCGGCGTCACATTTGCCAAACCCAGCGTAGATTTAGACGCTTTGCGCACACATAAAGAAGGTGTGATTAAAAAATTAACCACCGGTCTTGCTGGCATGGCAAAAGCAAGAAAAGTAACCGTTATTCAAGGTGTTGCGCAATTTACCGGCAGCCACAGTATTCATATTAAAACCGTTGACGGCGAACAAAATCTCAATTTCAAAAACTGCATCATCGCCGCCGGTTCTGAAAGCGTGAAATTACCTTTCATGCCAACCGATCCGCGCGTTATCGATTCCACCGGCGCTTTGCAATTGCAAGATATTCCAGAACGATTACTCGTTATCGGCGGCGGCATCATCGGCTTGGAAATGGCAACCGTTTATCACGCATTAGGCAGCAAAATTGACATCGTCGAAATGATGGACGGTTTAATGGCAGGCGCGGATAAAGATTTAGTAAAAGTATGGCAAAGACGCAATCCCGATCTTTTTGAACACATTTATTTAAATACCAAAACCGTTGCCGCCGAAGCCAAAGATGATGGTATTCACGTTACTTTCGCCGGCGACAAAGCGCCAAAAGAAGCGCAACGTTACGACCGTGTTTTAATGGCAGTAGGACGCCGTCCAAACGGCAAAACTTTAAATGTGGAAGCTTGCGGCGTCACTGTTGACGAACGCGGATTTATCCCTGTTGATAAACAAATGCGCACCAATCAAGCCCATATTTTTGCCATCGGCGATATTGTGGGACAACCCATGTTAGCGCATAAAGCCGTACACGAAGCGCACGTTGCCGCCGAAAATGCCGCAGGACATCAAGCCTTTTTCGATGCGCGCGTAATTCCAGGCGTTGCTTATACTTCTCCTGAAGTTGCTTGGGTGGGCGTTACAGAATCACAAGCCGCTAAAGAAAATATTGCCGTTGAAAAAGCCGTTTTCCCGTGGGCTGCTTCTGGTCGCGCTATTGCAAACGGTTGCGATGAAGGTTTTGTGAAACTGATTGTTGATAAAGCGTCTCAACGCGTCATCGGCGGCGCCATTGTTGGTCCTAATGCCGGCGATATGATCGGCGAAATCGCTTTAGCAATTGAAATGAATGCGGTGCCGGCTGATATCGCTTTAACCATTCACCCGCACCCAACATTAGGCGAAACCATTGGTTTAGCTGCTGAAGTATTTGAAGGTTCTTGCACGGATTTACCACCGCAAAAGAAACGTTAATCAACCTCACTCAACGCTGCTAGGCACGAGCTTAGCAGCGCTTTTTTATGCGCTGATAATTCGGCGTTTTTTTAAGGATTTGAGATGTTTTATTATTACTACCGTCTTTTGCCGCGCAAATTATTAAGCCGATTATTTTATTGGCTCGCGCGCATTAAAATCGTATGGATCAAAAATGTACTGATCCGCGGTTTTTGTTTTGTCACCAAAGCAAATACTGATTTTGCTGCCGAAAAAGATCCTTTTGCTTATCCCACGTTAAACGCGTTTTTCACCCGCACGTTAGCCGCTGATGCACGTCCCATCGATGCCGCGCCAGAATCCATCATCAGTCCTGTTGACGGTCGCTGTGCGTATTACCACACCATCGAAAATGGTTTAATGATTCAAGCAAAAAGCCAACGTTATTCTCTTGCCGCTTTGCTTAATAGTTATGAATTGGCGCAAGCATATGAATCGGGAACGGCAATTACGCTTTATCTTGCTCCAGATGATTATCATCGCGTGCATATGCCGTGTGACGGTCATTTGGTCAGCATGACTTTTTGCCCTGGCGATAAACATAGCGTTGCTTTGGATTTATTAGAAAAAATTCCGTTGCTTTTTGCTGGTAATGAACGTTTGGTTTGCCATTTTGAAACCGAATTGGGAAAAATGAGCGTGATTTTTGTCGGCGCGTTAAACGTAAGCAGTATTTCAACGGTTTGGCATGGCATCGTCAGCGATAACGGCGCCGATAATCATTATTTTTATCCAGAAAAACCATTTTTTGCGAAAGGCGCGGAACTGGGTCAATTTAATTTGGGTTCTACGGTGATTTTATGTTTTCAGTCGCAACAGATTGATTGGCAAAATGAAAAACTCAATAATCGCGACAAAATCTTGATGGGAGAAAAAATTGCCTGCACCTATTCTTGATATTTTTGTTGACGGCGCTTGCAAAGGAAACCCCGGCATCGGCGGCTGGGGCGTTTTAATGCGTTACGGTCAACATGAAAAAGTATTGATGGGTGCACAATGGCATACAACGAATAACCGCATGGAATTAACCGCGGCAATTGAAGCTTTAAAAGCGATTAAACGACCGTGCCCTATTTTAATCAGCACGGATTCTGTTTACGTTAAAAACGGGATCACCCACTGGCTACCCGTTTGGAAAAAAAATAATTGGCGGAACGCTTCCAAAAAACCAATTAAAAACATTGAATTATGGCAAGCGCTCGACCAACTTAATCAACGTTATGAAATTGAATGGCGCTGGGTTAAAGGACACGCTGGCAATCCAGGTAATGAAATTGCTGATGAACTTGCCAATCGTGCCATTGAATCGTTGCGGCAAAAAACGTCCTAA
Protein sequences of DBSCAN-SWA_9 >NC_009446|1130701:1145709|1142609_1144358_+|WP_012031404.1|DBSCAN-SWA MANKQALTIPDIGDFADVDVIEVLVKVGDKIAVDQSLVVLESDKASMEVPASIAGTITSLTVKVGDKVSEGSVIGEIEVANGASAAKTEEKPAEKTAEKPAAAATPAEKTAVPDDAVDLVVIGAGPGGYSAAFRAADLGLKVTLIERYATLGGVCLNVGCIPSKALLHVAEIMEEAEWAKKAGVTFAKPSVDLDALRTHKEGVIKKLTTGLAGMAKARKVTVIQGVAQFTGSHSIHIKTVDGEQNLNFKNCIIAAGSESVKLPFMPTDPRVIDSTGALQLQDIPERLLVIGGGIIGLEMATVYHALGSKIDIVEMMDGLMAGADKDLVKVWQRRNPDLFEHIYLNTKTVAAEAKDDGIHVTFAGDKAPKEAQRYDRVLMAVGRRPNGKTLNVEACGVTVDERGFIPVDKQMRTNQAHIFAIGDIVGQPMLAHKAVHEAHVAAENAAGHQAFFDARVIPGVAYTSPEVAWVGVTESQAAKENIAVEKAVFPWAASGRAIANGCDEGFVKLIVDKASQRVIGGAIVGPNAGDMIGEIALAIEMNAVPADIALTIHPHPTLGETIGLAAEVFEGSCTDLPPQKKR >NC_009446|1130701:1145709|1140667_1141051_-|WP_012031401.1|DBSCAN-SWA MAEASQSVKILIVDDSGTIRKTAEAILGKEGFQVATAEDGFAAFSKIIEFSPSIIFLDIMMPRLDGYQVCSVIKSNPEYKNIPVIMLSSKDSVFDKARGKIAGSEHFMTKPFSRDELLKAIQVHATH >NC_009446|1130701:1145709|1145256_1145709_+|WP_012031406.1|DBSCAN-SWA MPAPILDIFVDGACKGNPGIGGWGVLMRYGQHEKVLMGAQWHTTNNRMELTAAIEALKAIKRPCPILISTDSVYVKNGITHWLPVWKKNNWRNASKKPIKNIELWQALDQLNQRYEIEWRWVKGHAGNPGNEIADELANRAIESLRQKTS >NC_009446|1130701:1145709|1139750_1140272_-|WP_012031399.1|DBSCAN-SWA MSTLANSSPFEILVGYVRQFQQRGTISVAAVQQHASYLACSAGAENYLIGMGSVLEVTQDLPTITPLPFCPFWLLGLTSHRGDVCSVTDFKYFVEPKAKYATQKKSHYIVLRDAGQGYILKVDSISGIRQVEVNALDSKRIWVDGHIFMEDKDWFRINIGHLINDAVFSQQMQ >NC_009446|1130701:1145709|1130701_1138366_-|WP_012031397.1|DBSCAN-SWA MSSELASQLLLRVQAXMAKNIADVKKHLVEWLDRPESGGVEAFCNQLTEVSGGLFLLNKKPAAELSDKVQKVIKLLDNKFKENAVDNKSFSDVGSEVAAGILLLSDYINRLNTPHADNTQEINAISQSLSSVFTDGWKKTPASTHPQIDLETYQVLAERVNELIEVNRNQIEQFAASGGQSLDVNQLITYNQELINLFTLLNLKAPQFLLKYSNQMLKGDLAKVNWVEIAQSMLLVEEALQNISKHSCDNAKVTDIKTAFLQQRSHSQYIGFYRLITNTGAVARQYFESLRKIVVGEERVSSEADWTEAARRTAYYGTVCQFFGQKSLSVEMRRLAHIFAIFSAQKEFKETVYPPVVHALVATEYIFTDLEDAATLVESDVDFLKNAVDQLAAIYPITQDAQRLLASSMQTVAPKKEKPAQEQQSFDDLLGDLDALDSFIDVGETTVAAVKITATVDFDSATPAPTPASASTNTSSAAVESGLGFDDIAPVAEKETAAPKASTEAVEDLGFGFDDITPVAEKETAAPKAATEAVEDLGFGFDDIAPVAEKETAVPKAATEAVEDLGFGFDDIAPVAEKEIAAPKAAAEAVEAVNFDDVLELDSAVETEIQEPVSQASGSVDVDFFEHTGLTDWEQLSVPESKVSFTESHESAVIDDEIREFFLEEFDEKIVELSDSYDIWTDPAQADESTGVIRRAFHTLKGSGRTVGYEALGECAWQHEQMLNRVLDKHYSVNRIVLHNVKEAICLLKKLRNEETFVEHHPALLTQALIAEKSTSYLQNENSKPSDAEVDAFLQKVYAECLPFAAAAEVDTTPATMPEVEAVDVRKADIPDAVVSNADKIAAEQKRKFLEELNENIRKMDDHLPRWRNAPDNLDYPGNIKQAFSAIGSSSQNLGYESLQELAWQNENLLNDILDNLYDANPIIQDSVEDSVNLLKTMRGETEFIENPQALLSQAKALNIVSQTMRQRFVSARISSSEKPAEPAPQHQSAYEQAAKTLMQGLAMGQLDTEETEVLANVVKSQFQSANIDANTIDESKLKQMVSDQLAGADQNVIDHLLSHIVTHTAEQKATNKIADTLRQMLRAGDVDDTDATLVADHLAKNLQNDKELNFNELMPNLGNQLHQLLASENGAEGFVDRIVQRIEQKLAIDSPAVDALAPSDLDSSLVSVTDDLGFGIDDAVETPAVDVNENEAVIVEAPSDATGLFDLGIDDLPVQNNVAADEPDVVVVENAVSDVDNSGEIGGDSVFLDTDLVSPDADAPQNFADETPEVQDVNDVLQDIYDDLKTEDDLTKKTSQQIIDAISVPELTKPAPEKKPSTHKKPNRNIEKEASNVRFSSFSPYIMPAPSSRLSEELSNITKVPSESASEGSVYSFSFDGARPPVRDASLFLRALDEFSGRFQSLEEDASEDKIEELLDSVYEIEDSLDLNDVPAWIWRLLDAMEQLLLQYRKVGGKGIRQSTPLLKQSLDLIEHYSDDSSEEDAKRTMNSLMSTRQWLSSEIPHSDYEMSKPTMIARSAADIPLDINDDGSIETPLIPDPNVNQEIAKLFHSEALHLLGDSQLEAEQWDENREELNHLDAVRRSMHALKTSATSAGYSAIADLAQGVELITSQILDGQLQGNARASSILAGAIWQSMEMLDSIRDGYLPTPNPYILNHINSFLGMPLPYPAVAENEKRIKERERKQAEDLLAAQFGDDDAAGENAQSDAASVESAVEAETDVSSEVVDEADAEEEVIQGGVPFDESIDPVLVEIFTDEASELIAQAEQLLGQDLKDHTVIEELKRNMHTLKGGARMVGFTVIGDTSHLMESVIEKLADYPDVKHEQAKTLIGLGHESLYNMLDSVLRHEMPEPATQVNCSLELFANQGRFEMPGAEPKAAAKKEKPVEEVAKKEPKAEKAPVKEKAPEVAEKTSDVAKKAELPAEKSSSTPDAEKPAAAEVPENNNKIGNDVQRFVRVDSALLDDMISMTGEYAIVRSRMENLTLASEFNLSELTRIATRIDEQMRRLDNETEAQMLFRRESDTEDDEHFDPLEMDRFTEVQQLSRQLSEAIEDLKNVQETLTAENALMRNLSTQQGVLQRAIQDRLMATQLVRFDVNESRLRRLVRQTAKSLGKEVDLVLEGGGVELERRLVEDILPPLEHMIRNAVGHGIELPQQRVAAGKPASGTIKIKVTAQASNVQVQVIDDGRGFDYVRIREKARSKGWLDPERENDTQYLNTMLLRSGFSTAESLTQISGRGVGLDVVNEMIKQRRGRLSVDSIPGKGTEFVIVLPFSMSIVEVLLVDIGDQRYAAPMNAVAAISHVSKEELQRSFNGEVVYHEYDGMDYRLFVLASYFQSANYEFDMDTDTAPVLFINGVGEPVAFHVEHIANRLEIIVKNVNRQVLNLPGISGATILGDGRVVPVLDLLDLSTRIAGLSQLESVHVVEEVRARQVLVVDDSVTMRKVSTRLLERNDYVVQTAKDGLDAIEVLNEFMPDVILLDIEMPRMDGFEFAAHVRQNAMYNDVPIIMVTSRTGDKHRERADKIGVQGYLGKPYREDVLIETIESLIGGKSHG >NC_009446|1130701:1145709|1138425_1139736_-|WP_012031398.1|DBSCAN-SWA MKGSRRSQQDDNHQDKQNVFGWVAIGFGALSLILIVLIIIGLFTGTEEQPLSLGIADVNHAGLRFLLPILLLFVVLAAIFWYLFISSQSSEIQDIAIRQQMKERQNQDAILRLMDELTDLSNGDLTVEAQVTEDFTGAIADSVNYAIESMRELVGTINRTSSRVSDAANITRFIASDMQKSSIEQAEKIHSITQTIGNMVGSLDRVAASTTDSAEIARNSVTIAQEGAKRVNDTIGGMNNIRENIQETSKRIKRLGESSQEIGNIVEIIKGITDQTNILALNAAIQATSAGEAGRGFAVVADEIQRLAERSSNATRRIESLVKTIQTDTNEAIASMEKSTREVVTGANIAEEAGMSLTRIQEVSTSLALLIEKVSDSTRKVSSMAINISQDMSRINDMATTTSENVIKTSDSIANLSQLSSELNDSVSGFKLPEGY >NC_009446|1130701:1145709|1141152_1141542_-|WP_012031402.1|DBSCAN-SWA MVQWVLGIDVGTYHTGVAIGQSVTGLAKPLQRFNLPLKQVHADTFSALVQQWKVNCIVIGLPSLADGKKHPLSDEIHRLADECIKLYGVPVHFIDETLTSHAAQMRFPKTIDKDSAAAAVIVEDFLSSH >NC_009446|1130701:1145709|1141635_1142583_+|WP_012031403.1|DBSCAN-SWA MIHHRLIILGAGPAGYTAAIYAARAGLEPALITGLEPGGQLMTTTHVDNWPSAFEGILGSELMTNMCQHAQRFQTQIIYDHVTEVNLQHRPFTLKTEKETYSCDALIIATGARAKYLGLPSETQYLGYGVSACATCDGFFYRNKPVMVIGGGNTALEEALYLSNIASSVTLVHRRDAFRAEKIMVDRLMEKVNAGKIVVKYSAQLQEVLGDDNGVTGAIIRFNNGQIEQLAVDGIFIAIGHQPNTELFKNQLATDAHHYLSVHSGSNGNATQTSIDGVFAAGDVADPVYRQAITSAASGCMAALDAERYLATLKN >NC_009446|1130701:1145709|1140268_1140631_-|WP_012031400.1|DBSCAN-SWA MTTILIVDDSPTEANVVKSMLVEAGYEVLWAEAADKGIAVAMNRHPDLILMDVVMPGMSGFQATRKLSRTPETKDIPILMLTTKDQDSDRVWGMRNGAKKYMVKPPEKEALLREIRELLA >NC_009446|1130701:1145709|1144442_1145276_+|WP_012031405.1|DBSCAN-SWA MFYYYYRLLPRKLLSRLFYWLARIKIVWIKNVLIRGFCFVTKANTDFAAEKDPFAYPTLNAFFTRTLAADARPIDAAPESIISPVDGRCAYYHTIENGLMIQAKSQRYSLAALLNSYELAQAYESGTAITLYLAPDDYHRVHMPCDGHLVSMTFCPGDKHSVALDLLEKIPLLFAGNERLVCHFETELGKMSVIFVGALNVSSISTVWHGIVSDNGADNHYFYPEKPFFAKGAELGQFNLGSTVILCFQSQQIDWQNEKLNNRDKILMGEKIACTYS |
10 | Bacillus_phage(28.57%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage | |||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NC_009446.1|WP_012031102.1|822195_822459_+|helix-turn-helix-domain-containing-protein |
822195_822459_+
Protein sequences of NC_009446.1|WP_012031102.1|822195_822459_+|helix-turn-helix-domain-containing-protein>NC_009446.1|WP_012031102.1|822195_822459_+|helix-turn-helix-domain-containing-protein MAHVETKKQARVTQKRLIAAHLKKHGSISSWEAIELYHCTRLGAYIYELRESGWDISTLRKTFTSSVTGNSGVYALYLLNESNELGE |
87 aa aa | NA |
HTH_3,HTH_36
HTH_3,HTH_36 HTH domain information
|
NA | 790933-839041 |
yes
Self-targetings in the prophage
1. spacer 1.1|179037|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to NC_009446 position: 822923-822954, mismatch: 0 tatcaaagaaccagtcaaggaaccatgagtcg CRISPR spacer tatcaaagaaccagtcaaggaaccatgagtcg Protospacer ******************************** 2. spacer 1.2|179097|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to NC_009446 position: 791357-791388, mismatch: 0 attcgcaaacaaaacagcgaaatttgggcgag CRISPR spacer attcgcaaacaaaacagcgaaatttgggcgag Protospacer ******************************** 3. spacer 1.3|179157|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to NC_009446 position: 824416-824447, mismatch: 0 tgtcgaactaaacgatgaccagatttggttaa CRISPR spacer tgtcgaactaaacgatgaccagatttggttaa Protospacer ******************************** 4. spacer 1.4|179217|32|NC_009446|PILER-CR,CRISPRCasFinder,CRT matches to NC_009446 position: 791606-791637, mismatch: 1 tatcgcagccacagcgtcgcgcaagtattagc CRISPR spacer tatcgcagccacagcgccgcgcaagtattagc Protospacer ****************.*************** |