Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP011395 | Salmonella enterica subsp. enterica serovar Enteritidis str. 18569 plasmid pCFSAN000006, complete sequence | 0 crisprs | cas14j | 0 | 0 | 0 | 0 |
NZ_CP011394 | Salmonella enterica subsp. enterica serovar Enteritidis str. 18569 chromosome, complete genome | 2 crisprs | PD-DExK,WYL,cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2,csa3,DEDDh,DinG | 0 | 12 | 8 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP011394_1 | 940996-941635 | TypeI-E |
I-E
Consensus repeat of NZ_CP011394_1
|
10 spacers
spacers of NZ_CP011394_1
>1.1|941023|34|NZ_CP011394|PILER-CR,CRT TGAGGATTTCGCCGCGCTGTTGGCCTCCAGATTC >1.2|941084|34|NZ_CP011394|PILER-CR,CRT TGCCGTGCCTGTCCAGGACAAATTGCCGATTATT >1.3|941145|34|NZ_CP011394|PILER-CR,CRT TGCTGCCGGTCTGTGCTGTTGTCGTCAATAATCA >1.4|941206|34|NZ_CP011394|PILER-CR,CRT CGGCATGTGACAGTCTGATTTTTATAGCGCATGA >1.5|941267|34|NZ_CP011394|PILER-CR,CRT CCAATTTAGGGGCCGGAACTCCGGGAAAGGCAGC >1.6|941328|35|NZ_CP011394|PILER-CR,CRT CGCAAACACCAAGCTCTTCCGCCGCGCGTTCCTGA >1.7|941390|34|NZ_CP011394|PILER-CR,CRT CGTTGCTCTCATTAAAGGGGTTTCCATGTTTGAT >1.8|941451|34|NZ_CP011394|PILER-CR,CRT CGCGGTCGCAGCCTGGCCTGTTGCCGTAGAATCG >1.9|941512|34|NZ_CP011394|PILER-CR,CRT CGAGTTACTGATGCAGACTGCGGATCTTAATCGG >1.10|941025|32|NZ_CP011394|CRISPRCasFinder AGGATTTCGCCGCGCTGTTGGCCTCCAGATTC >1.11|941086|32|NZ_CP011394|CRISPRCasFinder CCGTGCCTGTCCAGGACAAATTGCCGATTATT >1.12|941147|32|NZ_CP011394|CRISPRCasFinder CTGCCGGTCTGTGCTGTTGTCGTCAATAATCA >1.13|941208|32|NZ_CP011394|CRISPRCasFinder GCATGTGACAGTCTGATTTTTATAGCGCATGA >1.14|941269|32|NZ_CP011394|CRISPRCasFinder AATTTAGGGGCCGGAACTCCGGGAAAGGCAGC >1.15|941330|33|NZ_CP011394|CRISPRCasFinder CAAACACCAAGCTCTTCCGCCGCGCGTTCCTGA >1.16|941392|32|NZ_CP011394|CRISPRCasFinder TTGCTCTCATTAAAGGGGTTTCCATGTTTGAT >1.17|941453|32|NZ_CP011394|CRISPRCasFinder CGGTCGCAGCCTGGCCTGTTGCCGTAGAATCG >1.18|941514|32|NZ_CP011394|CRISPRCasFinder AGTTACTGATGCAGACTGCGGATCTTAATCGG >1.19|941575|32|NZ_CP011394|CRISPRCasFinder TGCGCCAACGACTGGAATTTTTGCGTGTAGCC >1.20|941573|34|NZ_CP011394|CRT CGTGCGCCAACGACTGGAATTTTTGCGTGTAGCC |
cas3,cas8e,cse2gr11,cas7 |
CRISPR arrays and Neighbor proteins around NZ_CP011394_1
The CRISPR arrays of NZ_CP011394_1 >merge|NZ_CP011394|1|940996-941635|PILER-CR,CRISPRCasFinder,CRT GTGTTCCCCGCGCCAGCGGGGATAAACTGAGGATTTCGCCGCGCTGTTGGCCTCCAGATTCGTGTTCCCCGCGCCAGCGGGGATAAACTGCCGTGCCTGTCCAGGACAAATTGCCGATTATTGTGTTCCCCGCGCCAGCGGGGATAAACTGCTGCCGGTCTGTGCTGTTGTCGTCAATAATCAGTGTTCCCCGCGCCAGCGGGGATAAACCGGCATGTGACAGTCTGATTTTTATAGCGCATGAGTGTTCCCCGCGCCAGCGGGGATAAACCCAATTTAGGGGCCGGAACTCCGGGAAAGGCAGCGTGTTCCCCGCGCCAGCGGGGATAAACCGCAAACACCAAGCTCTTCCGCCGCGCGTTCCTGAGTGTTCCCCGCGCCAGCGGGGATAAACCGTTGCTCTCATTAAAGGGGTTTCCATGTTTGATGTGTTCCCCGCGCCAGCGGGGATAAACCGCGGTCGCAGCCTGGCCTGTTGCCGTAGAATCGGTGTTCCCCTCGCCAGCGGGGATAAACCGAGTTACTGATGCAGACTGCGGATCTTAATCGGGTGTTCCCCGCGCCAGCGGGGATAAACCGTGCGCCAACGACTGGAATTTTTGCGTGTAGCCGTGTTCCCCGCGCCAACAAGGATAGCCGT >NZ_CP011394|1|1|940996-941572|PILER-CR GTGTTCCCCGCGCCAGCGGGGATAAAC TGAGGATTTCGCCGCGCTGTTGGCCTCCAGATTC GTGTTCCCCGCGCCAGCGGGGATAAAC TGCCGTGCCTGTCCAGGACAAATTGCCGATTATT GTGTTCCCCGCGCCAGCGGGGATAAAC TGCTGCCGGTCTGTGCTGTTGTCGTCAATAATCA GTGTTCCCCGCGCCAGCGGGGATAAAC CGGCATGTGACAGTCTGATTTTTATAGCGCATGA GTGTTCCCCGCGCCAGCGGGGATAAAC CCAATTTAGGGGCCGGAACTCCGGGAAAGGCAGC GTGTTCCCCGCGCCAGCGGGGATAAAC CGCAAACACCAAGCTCTTCCGCCGCGCGTTCCTGA GTGTTCCCCGCGCCAGCGGGGATAAAC CGTTGCTCTCATTAAAGGGGTTTCCATGTTTGAT GTGTTCCCCGCGCCAGCGGGGATAAAC CGCGGTCGCAGCCTGGCCTGTTGCCGTAGAATCG GTGTTCCCCTCGCCAGCGGGGATAAAC CGAGTTACTGATGCAGACTGCGGATCTTAATCGG GTGTTCCCCGCGCCAGCGGGGATAAAC >NZ_CP011394|1|1|940996-941635|CRISPRCasFinder GTGTTCCCCGCGCCAGCGGGGATAAACTG AGGATTTCGCCGCGCTGTTGGCCTCCAGATTC GTGTTCCCCGCGCCAGCGGGGATAAACTG CCGTGCCTGTCCAGGACAAATTGCCGATTATT GTGTTCCCCGCGCCAGCGGGGATAAACTG CTGCCGGTCTGTGCTGTTGTCGTCAATAATCA GTGTTCCCCGCGCCAGCGGGGATAAACCG GCATGTGACAGTCTGATTTTTATAGCGCATGA GTGTTCCCCGCGCCAGCGGGGATAAACCC AATTTAGGGGCCGGAACTCCGGGAAAGGCAGC GTGTTCCCCGCGCCAGCGGGGATAAACCG CAAACACCAAGCTCTTCCGCCGCGCGTTCCTGA GTGTTCCCCGCGCCAGCGGGGATAAACCG TTGCTCTCATTAAAGGGGTTTCCATGTTTGAT GTGTTCCCCGCGCCAGCGGGGATAAACCG CGGTCGCAGCCTGGCCTGTTGCCGTAGAATCG GTGTTCCCCTCGCCAGCGGGGATAAACCG AGTTACTGATGCAGACTGCGGATCTTAATCGG GTGTTCCCCGCGCCAGCGGGGATAAACCG TGCGCCAACGACTGGAATTTTTGCGTGTAGCC GTGTTCCCCGCGCCAACAAGGATAGCCGT >NZ_CP011394|1|1|940996-941633|CRT GTGTTCCCCGCGCCAGCGGGGATAAAC TGAGGATTTCGCCGCGCTGTTGGCCTCCAGATTC GTGTTCCCCGCGCCAGCGGGGATAAAC TGCCGTGCCTGTCCAGGACAAATTGCCGATTATT GTGTTCCCCGCGCCAGCGGGGATAAAC TGCTGCCGGTCTGTGCTGTTGTCGTCAATAATCA GTGTTCCCCGCGCCAGCGGGGATAAAC CGGCATGTGACAGTCTGATTTTTATAGCGCATGA GTGTTCCCCGCGCCAGCGGGGATAAAC CCAATTTAGGGGCCGGAACTCCGGGAAAGGCAGC GTGTTCCCCGCGCCAGCGGGGATAAAC CGCAAACACCAAGCTCTTCCGCCGCGCGTTCCTGA GTGTTCCCCGCGCCAGCGGGGATAAAC CGTTGCTCTCATTAAAGGGGTTTCCATGTTTGAT GTGTTCCCCGCGCCAGCGGGGATAAAC CGCGGTCGCAGCCTGGCCTGTTGCCGTAGAATCG GTGTTCCCCTCGCCAGCGGGGATAAAC CGAGTTACTGATGCAGACTGCGGATCTTAATCGG GTGTTCCCCGCGCCAGCGGGGATAAAC CGTGCGCCAACGACTGGAATTTTTGCGTGTAGCC GTGTTCCCCGCGCCAACAAGGATAGCC
>NZ_CP011394.1|WP_001199961.1|940027_940699_+|7-carboxy-7-deazaguanine-synthase-QueE MQYPINEMFQTLQGEGYFTGVPAIFIRLQGCPVGCAWCDTKHTWDKLSDREVSLFSILAKTKESDKWGAASSEDLLAVINRQGYTARHVVITGGEPCIHDLMPLTDLLEKSGFSCQIETSGTHEVRCTPNTWVTVSPKVNMRGGYDVLSQALERANEIKHPVGRVRDIEALDELLATLSDDKPRVIALQPISQKEDATRLCIETCIARNWRLSMQTHKYLNIA >NZ_CP011394.1|WP_000036734.1|938593_939892_+|phosphopyruvate-hydratase MSKIVKVIGREIIDSRGNPTVEAEVHLEGGFVGMAAAPSGASTGSREALELRDGDKSRFLGKGVTKAVGAVNGPIAQAILGKDAKDQAGIDKIMIDLDGTENKSNFGANAILAVSLANAKAAAAAKGMPLYEHIAELNGTPGKYSMPVPMMNIINGGEHADNNVDIQEFMIQPVGAKTVKEAIRMGSEVFHHLAKVLKGKGMNTAVGDEGGYAPNLGSNAEALAVIAEAVKAAGYELGKDITLAMDCAASEFYKDGKYVLAGEGNKAFTSEEFTHFLEELTKQYPIVSIEDGLDESDWDGFAYQTKVLGDKIQLVGDDLFVTNTKILKEGIEKGIANSILIKFNQIGSLTETLAAIKMAKDAGYTAVISHRSGETEDATIADLAVGTAAGQIKTGSMSRSDRVAKYNQLIRIEEALGEKAPYNGRKEIKGQA >NZ_CP011394.1|WP_000210863.1|936873_938511_+|CTP-synthase-(glutamine-hydrolyzing) MTTNYIFVTGGVVSSLGKGIAAASLAAILEARGLNVTIMKLDPYINVDPGTMSPIQHGEVFVTEDGAETDLDLGHYERFIRTKMSRRNNFTTGRIYSDVLRKERRGDYLGATVQVIPHITNAIKERVLEGGEGHDVVLVEIGGTVGDIESLPFLEAIRQLAVDIGREHALFMHLTLVPYLAAAGEVKTKPTQHSVKELLSIGIQPDILICRSDRAVPANERAKIALFCNVPEKAVISMKDVDSIYKIPGLLKSQGLDDYICKRFSLNCPEANLSEWEQVIYEEANPAGEVTIGMVGKYIELPDAYKSVIEALKHGGLKNRVTVNIKLIDSQDVETRGVEILKDLDAILIPGGFGYRGVEGKIATARYARENNIPYLGICLGMQVALIEFARNVAGMDNANSTEFVPDCKYPVVALITEWRDEDGNVEVRSEKSDLGGTMRLGAQQCQLSDDSLVRQLYGASTIVERHRHRYEVNNMLLKQIEAAGLRVAGRSGDDQLVEIIEVPNHPWFVACQFHPEFTSTPRDGHPLFAGFVKAANEHQKRQAK >NZ_CP011394.1|WP_000210451.1|935845_936646_+|nucleoside-triphosphate-pyrophosphohydrolase MTTNHQIDRLLTLMQRLRDPENGCPWDKEQTFASIAPYTLEETYEVLDAIAREDFDDLRGELGDLLFQVVFYAQMAQEEGRFDFNDICAAISDKLERRHPHVFGELSADNSEEALVRWEQIKTEERAQKAQHSALDDIPRSLPALMRAQKIQKRCSNVGFDWTTLGPVVDKVYEEIDEVMFEARQAVVDQAKLEEEMGDLLFATVNMARHLGTKAELALQKANDKFERRFREVERIVAARGLEMTGVDLETMEEVWQEVKRQEIDL >NZ_CP011394.1|WP_000842512.1|934650_935238_-|fimbrial-protein MKSSHFCKLAVTASLVMGIVSGAQAAGSNTAKVTFLGNIVDSPCSVTLDTEDQTVNMGSSIGNGTLSNGKTTINNARTFHIDLEGCTWATEKNMNVVFTTGSGTTAATGATDNLALMKTDGTGAISNVSLAIGDAGKNNIKLGDTYTQAIADLDGDTILDEKQSLNFTAWLVGAATGTVGTGEFSSAANVTISYL >NZ_CP011394.1|WP_000981797.1|931871_934571_-|fimbrial-biogenesis-outer-membrane-usher-protein MMNNTWKSVLCPIACGVGMLLSLSPYSASGKDIEFNTDFLDVKNRDNVNIAQFSRKGFILPGVYLLQIKINGQTLPQEFPVNWVIPEHDPQGSEVCAEPELVTQLGIKPELAEKLVWITHGERQCLAPDSLKGMDFQADLGHSTLLVNLPQAYMEYSDVDWDPPARWDNGIPGIILDYNINNQLRHDQESGSEEQSISGNGTLGANLGAWRLRADWQASYDHRDDDENTSTLHDQSWSRYYAYRALPTLGAKLTLGESYLQSDVFDSFNYIGASVVSDDQMLPPKLRGYAPEIVGIARSNAKVKVSWQGRVLYETQVPAGPFRIQDLNQSVSGTLHVTVEEQNGQTQEFDVNTASVPFLTRPGMVRYKMALGRPQDWDHHPITGTFASAEASWGVTNGWSLYGGAIGESNYQAVALGSGKDLGVVGAVAVDITHSIAHMPQDDGFDGETLQGNSYRISYSRDFDEIDSRLTFAGYRFSEKNFMSMSDYLDAKTYHHLNAGHEKERYTVTYNQNFREQGMSAYFSYSRSTFWDSPDQSNYNLSLSWYFDLGSIKNLSASLNGYRSEYNGDKDDGVYISLSVPWGNDSISYNGTFNGSQHRNQLGYSGHSQNGDNWQLHVGQDEQGAQADGYYSHQGALTDIDLSADYEEGSYRSLGMSLRGGMTLTTQGGALHRGSLAGSTRLLVDTDGIADVPVSGNGSPTSTNIFGKAVIADVGSYSRSLARIDLNKLPEKAEATKSVVQITLTEGAIGYRHFDVVSGEKMMAVFRLADGDFPPFGAEVKNERQQQLGLVADDGNAWLAGVKAGETLKVFWDGAAQCEASLPSTFTPELLANALLLPCKMLEGQPPTAPQKSSPLPAQPLIQEHTQTDGQPAAPVATTTQTPPIPLADNHAVNRKDME >NZ_CP011394.1|WP_001044459.1|931085_931859_-|fimbria/pilus-periplasmic-chaperone MNKTNHFKRQALIASVLLAAPLVSHSAIVPDRTRVIFNGNENSITVTLKNGNATLPYLAQAWLEDDKFAKDTRYFTALPPLQRIEPKSDGQVKVQPLPAAASLPQDRESLFYFNVREIPPKSDKPNTLQLALQTRIKFFYRPVAVARQVDKTHPWQTKLTLTYQGDGVIFDNPTPFYLVISNAGSKENETASGFKNLLIAPREKVTSPIKGASLGSSPVVGYVDDYGGHRLLVFTCSGNTCKVNEEKTRDAEKKANK >NZ_CP011394.1|WP_000178270.1|930559_931066_-|fimbrial-protein MTMLTRWKMLVLLCGGFVTGTEAAGTKTVQLELHLVVTQPPPCTVGGASVEFGDVLTTKVGDASQTKPVGYSLNCDGRASDYLKLQIQGTTTTISGEQVLQTSVQGLGIRIQQAGNKQLVPVGITDWLNFTLSGSNGPELEAVPVKEPTTQLAGGDFNASATLVVDYQ >NZ_CP011394.1|WP_000832393.1|930074_930545_-|fimbrial-protein MKRVLILTLLITQFACADNLTFHGKLINPPACTINNGEMLEVSFGSVIIDNIDGVNYLTEIPWTLTCDSSFRDDALTFTLSYLGTATPYSAKALTTSVPELGIELQQNGTVFPPGTSLTINESSLPTLKAVPVKQPGKEPAEGDFEAFATLQVDYQ >NZ_CP011394.1|WP_001079646.1|929541_930078_-|fimbrial-protein MNRIFQTAGHLIGGVMLWAVCNTLPAATPNVHYSGKLVAGACNLVVDNDTMATVDFHTIGSDNFDASGQTTPVPFTLSLQDCKTALANGVLVTFQGVEDSTLPGLLALEPSSEASGFAIGVETAAQQPVSINATVGTAFVLKEGITTINLQARLQKYAGEEVMPGEFSGSATVSFEYQ >NZ_CP011394.1|WP_001207998.1|941733_942531_+|MBL-fold-metallo-hydrolase MALRIRVLLENHKGAGADKSLKARPGLSLLVEDESTSILFDTGPDGSFMQNALAMGIDLSDVSAVVLSHGHYDHCGGVPWLPDNSRIICHPDIARERYAAMTFLGITRKIKKLSCEVDYSRYRMMYTRDPLPIGKNFIWSGEIPVVAPEAYGIFGGHDAEPDSILDEGVLIYQSTKGLVIITGCGHRGIANIVRHCQNITGIKRIYALVGGFHLRCASPFTLWRVRRFLQEQKPEKLCGCHCTGAWGRLWLPEITAPATGDVLRF >NZ_CP011394.1|WP_000108313.1|942618_942981_-|6-carboxytetrahydropterin-synthase-QueD MSTTLYKDFTFEAAHRLPHVPEGHKCGRLHGHSFMVRLEITGEVDPHTGWIMDFADLKAAFKPTYDRLDHYYLNDIPGLSNPTSEVLAKWIWDQVKPVVPLLSAVMVKETCTAGCVYRGE >NZ_CP011394.1|WP_000210932.1|943404_945204_+|NADPH-dependent-assimilatory-sulfite-reductase-flavoprotein-subunit MTTPAPLTGLLPLNPEQLARLQAATTDLTPEQLAWVSGYFWGVLNPRSGVVAVTPVPERKMPGVTLISASQTGNARRVAEALRDDLLAANLNVTLVNAGDYKFKQIASEKLLVIVTSTQGEGEPPEEAVALHKFLFSKKAPKLENTAFAVFSLGDTSYEFFCQSGKDFDSKLAELGGERLLDRVDADVEYQAAASEWRARVVDVLKSRAPVAAPSQSVATGAVNDIHTSPYTKDAPLIATLSVNQKITGRNSEKDVRHIEIDLGDSGLRYQPGDALGVWYQNDPALVKELVELLWLKGDEPVTVDGKTLPLAEALEWHFELTVNTANIVENYATLTRSESLLPLVGDKAQLQHYAATTPIVDMVRFSPAQLDAEALIGLLRPLTPRLYSIASAQAEVESEVHVTVGVVRYDIEGRARAGGASSFLADRVEEEGEVRVFIEHNDNFRLPANPQTPVIMIGPGTGIAPFRAFMQQRAADGAEGKNWLFFGNPHFTEDFLYQVEWQRYVKEGVLSRIDLAWSRDQKEKIYVQDKLREQGAELWCWINDGAHIYVCGDARRMAADVEKALLEVIAEFGGMDLESADEYLSELRVERRYQRDVY >NZ_CP011394.1|WP_001290670.1|945203_946916_+|assimilatory-sulfite-reductase-(NADPH)-hemoprotein-subunit MSEKHPGPLVVEGKLSDAERMKLESNYLRGTIAEDLNDGLTGGFKGDNFLLIRFHGMYQQDDRDIRAERAEQKLEPRHAMLLRCRLPGGVITTTQWQAIDKFAADNTIYGSIRLTNRQTFQFHGILKKNVKPVHQMLHSVGLDALATANDMNRNVLCTSNPYESQLHAEAYEWAKKISEHLLPRTRAYAEIWLDQEKVAITDEEPILGQTYLPRKFKTTVVIPPQNDIDLHANDMNFVAIAENGKLVGFNLLVGGGLSIEHGNKKTYARTASEFGYLPLEHTLAVAEAVVTTQRDWGNRTDRKNAKTKYTLERVGLETFKAEVERRAGIKFEPIRPYEFTGRGDRIGWVKGIDNNWHLTLFIENGRILDYPGRPLKTGLLEIAKIHQGEFRITANQNLIIASVPESQKAKIETLARDHGLMNAVSAQRENSMACVSFPTCPLAMAEAERFLPSFTDKVEAILEKHGIPDEHIVMRVTGCPNGCGRAMLAEIGLVGKAPGRYNLHLGGNRIGTRIPRMYQENITEPDILASLDELIGRWAKEREAGEGFGDFTVRAGIIRPVLDPARDFWE >NZ_CP011394.1|WP_000039870.1|947017_947752_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRVLALAETNAQLETLTAEERVAWALENLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYQFIDELTDKLKLNLKVYRAGESPAWQEARYGKLWEQGVEGIEKYNEINKVEPMNRALKELKAQTWFAGLRREQSGSRAHLPVLAIQRGVFKVLPIIDWDNRTVYQYLQKHGLKYHPLWDQGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >NZ_CP011394.1|WP_001145541.1|947839_948793_-|SPI-1-type-III-secretion-system-effector-SopD MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINKIYAFKRLQHLACPAHQDLFTIKMDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGAKMTWHPELLQESISTLRKEVTGNAQIKAAVYEMMRPAEAPDHPLVEWQDSLTEDEKSMLACINAGNFEPTTQFCKIGYQEVQGEVAFSMMHPCISYLLHTYSPFAEFKPTNSGFLKKLNQDYNDYHAKKMFIDVILEKLYLTHERSLHIGKDGCSRNILLT >NZ_CP011394.1|WP_000029737.1|949236_951900_+|CRISPR-associated-helicase/endonuclease-Cas3 MSIYHYWGKSRRGETDGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYMLWNSWLSECPELFPPSSLSVRKSKRVMALWMPVTTGHHGRPPEAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATANAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPYSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERCVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMRNSAYVYPDHGRMWLTQRILREQGTIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFAGQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDSVVTPYASGEHAWEMSVLRVRQSWWNKHKDEFEKLDGEPLRKWCAQQHQDKDFATVIVVTDFAACGYSANEGLIGMMGE >NZ_CP011394.1|WP_000368579.1|951911_953468_+|type-I-E-CRISPR-associated-protein-Cse1/CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVDLADENVVDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADVLHKALAPLEHAFQFGAESPSFMQDFEPLSGEKVSIASLLPEIPGAQTTKFNKDHFVKRGVTERFCPHCAALALFSLQLNAPAGGKGYRTGLRGGGPLTTLVELQEYQGERQTPIWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGCCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIRGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKAVQTAARLLSLLRSALKEAWFTNAKDARGDFSFIDIDFWNLTQGRFLNLIHDLENGHKPDERLNKWQRELWLFTRCYFDDHVFTNPYESSDLERIMKARKKYFTSSAEKQSAKAAKAKKQEAAE >NZ_CP011394.1|WP_000117945.1|953464_954019_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MSVVTKDDKATLRQWHDELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKTPDELLRQLRRAVKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDTDNEQD >NZ_CP011394.1|WP_000206417.1|954032_955091_+|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRISSQSLKRAWRTSELFEQALAGHIGIRTGRIAREAAQILVDSGIDAKKAVEYVKNIANCFGKVKEDKKPKDELTNAETEQLVHISPAEFEAVKALARRLAEEKRPATEEEAELLRHDRMAVDIAMFGRMLAKKTDFNVEAACQVAHAFGVSETIIEDDFFTAVDDLRQASAEDAGAGHLGETGFGSALFYTYICIDKDLLVKNLNGNEELANKTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTDQPRSLAAAFYEPINGTDQLNVAVKRITALHENMNEVYAQETAFKNFNVMNQQGSMKDVLDFICA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP011394_2 | 957787-958303 | TypeI-E |
I-E
Consensus repeat of NZ_CP011394_2
|
8 spacers
spacers of NZ_CP011394_2
>2.1|957816|32|NZ_CP011394|CRISPRCasFinder,CRT ACGTTGGCTGAAAACGGTTTTTCGGTCCGCCT >2.2|957877|32|NZ_CP011394|CRISPRCasFinder,CRT AATATATGGCGCTCACGCGCATGAGCATTCTC >2.3|957938|32|NZ_CP011394|CRISPRCasFinder,CRT CAGGGCAAATTCATCCGCCGCTGACCACTGGT >2.4|957999|32|NZ_CP011394|CRISPRCasFinder,CRT GACGCTTACATCTCACCGAGAGATTTTGAGGC >2.5|958060|32|NZ_CP011394|CRISPRCasFinder,CRT GGAACTGGTTTAGCTATCGCTGCCGGGGCTAT >2.6|958121|32|NZ_CP011394|CRISPRCasFinder,CRT GATTGCTCAGATTGGGAATTTGACCAGCGGCC >2.7|958182|32|NZ_CP011394|CRISPRCasFinder,CRT TCACGAGGGCCCCCTTATTGGGTCGGGCAGGT >2.8|958243|32|NZ_CP011394|CRISPRCasFinder,CRT GTTGGGTTGCATAGATGACACGCTTATAAATA |
cas2,cas1,cas6e,cas5,cas7,cse2gr11,cas8e,cas3 |
CRISPR arrays and Neighbor proteins around NZ_CP011394_2
The CRISPR arrays of NZ_CP011394_2 >merge|NZ_CP011394|2|957787-958303|CRISPRCasFinder,CRT GTGTTCCCCGCGCCAGCGGGGATAAACCGACGTTGGCTGAAAACGGTTTTTCGGTCCGCCTGTGTTCCCCGCGCCAGCGGGGATAAACCGAATATATGGCGCTCACGCGCATGAGCATTCTCGTGTTCCCCGCGCCAGCGGGGATAAACCGCAGGGCAAATTCATCCGCCGCTGACCACTGGTGTGTTCCCCGCGCCAGCGGGGATAAACCGGACGCTTACATCTCACCGAGAGATTTTGAGGCGTGTTCCCCGCGCCAGCGGGGATAAACCGGGAACTGGTTTAGCTATCGCTGCCGGGGCTATGTGTTCCCCGCGCCAGCGGGGATAAACCGGATTGCTCAGATTGGGAATTTGACCAGCGGCCGTGTTCCCCGCGCCAGCGGGGATAAACCGTCACGAGGGCCCCCTTATTGGGTCGGGCAGGTGTGTTCCCCGCGCCAGCGGGGATAAACCGGTTGGGTTGCATAGATGACACGCTTATAAATAGTGTTCCCCGCGTCAGCGGGGATAAACAC >NZ_CP011394|2|2|957787-958303|CRISPRCasFinder GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGTTGGCTGAAAACGGTTTTTCGGTCCGCCT GTGTTCCCCGCGCCAGCGGGGATAAACCG AATATATGGCGCTCACGCGCATGAGCATTCTC GTGTTCCCCGCGCCAGCGGGGATAAACCG CAGGGCAAATTCATCCGCCGCTGACCACTGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GACGCTTACATCTCACCGAGAGATTTTGAGGC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGAACTGGTTTAGCTATCGCTGCCGGGGCTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GATTGCTCAGATTGGGAATTTGACCAGCGGCC GTGTTCCCCGCGCCAGCGGGGATAAACCG TCACGAGGGCCCCCTTATTGGGTCGGGCAGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTGGGTTGCATAGATGACACGCTTATAAATA GTGTTCCCCGCGTCAGCGGGGATAAACAC >NZ_CP011394|2|2|957787-958303|CRT GTGTTCCCCGCGCCAGCGGGGATAAACCG ACGTTGGCTGAAAACGGTTTTTCGGTCCGCCT GTGTTCCCCGCGCCAGCGGGGATAAACCG AATATATGGCGCTCACGCGCATGAGCATTCTC GTGTTCCCCGCGCCAGCGGGGATAAACCG CAGGGCAAATTCATCCGCCGCTGACCACTGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GACGCTTACATCTCACCGAGAGATTTTGAGGC GTGTTCCCCGCGCCAGCGGGGATAAACCG GGAACTGGTTTAGCTATCGCTGCCGGGGCTAT GTGTTCCCCGCGCCAGCGGGGATAAACCG GATTGCTCAGATTGGGAATTTGACCAGCGGCC GTGTTCCCCGCGCCAGCGGGGATAAACCG TCACGAGGGCCCCCTTATTGGGTCGGGCAGGT GTGTTCCCCGCGCCAGCGGGGATAAACCG GTTGGGTTGCATAGATGACACGCTTATAAATA GTGTTCCCCGCGTCAGCGGGGATAAACAC
>NZ_CP011394.1|WP_001518648.1|957396_957690_+|type-I-E-CRISPR-associated-endoribonuclease-Cas2 MSMVVVVTENVPPRLRGRLAVWLLEVRAGVYVGDTSKRIREMIWQQITQLGGVGNVVMAWATNTESGFEFQTWGENRRIPVDLDGLRLVSFLPVENQ >NZ_CP011394.1|WP_000144830.1|956476_957397_+|type-I-E-CRISPR-associated-endonuclease-Cas1 MTFVPLNPIPLKDRTSMIFLQYGQIDVLDGAFVLIDKTGIRTHIPVGSVACIMLEPGTRVSHAAVHLASTVGTLLVWVGEAGVRVYSSGQPGGARADKLLYQAKLALDDDLRLKVVRKMYELRFREPPPARRSVEQLRGIEGSRVRATYALLAKQYGVKWHGRNYDPKDWEKGDVVNRCISAATSCLYGISEAAILAAGYAPAIGFIHSGKPLSFVYDIADIIKFESVVPKAFEIAARHPAEPDKEVRLACRDIFRSSKLTGKLIPLIEEVLAAGEIEPPQPAPDMLPPAIPEPESLGDSGHRGHG >NZ_CP011394.1|WP_000281483.1|955829_956480_+|type-I-E-CRISPR-associated-protein-Cas6/Cse3/CasE MYLSRITLHTSELSPAQLLHLVERGEYVMHQWLWDLFPGGKERQFLYRREELQGAFRFFVLSQEQPAASTIFDVQTRPFAPMLSAGQTLRFNLRANPTICKNGKRHDLLMEAKRQRKTQGDSQDIWSYQQQAALEWLARQGEQNGFTLREASVDAYRQQQIRREKSRQMIQFSSVDYTGVLVINEPALFLQRLAQGYGKSRAFGCGMMMIKPGDDA >NZ_CP011394.1|WP_000085115.1|955101_955848_+|type-I-E-CRISPR-associated-protein-Cas5/CasD MSQYLVFQLHGPMASWGVDAPGEVRHSHELPSRSALLGLLAAALGIRRDEEERLNTFNRHYQFLLCASGNPRWARDYHTVQMPKEVRKARYFSRREELQDPELLSALISRRDYYTDAWWMIAVSATPDAPYTLAQLQAALQHPVFPLYLGRKSHPLALPLAPQLLEGNAADVLREAYRWYQDQFNALKLTLPGLQNECWWEGEHDGLTANKILRRRDMPLSRQQWLFGERSVNQGPWLRKEDACISQE >NZ_CP011394.1|WP_000206417.1|954032_955091_+|type-I-E-CRISPR-associated-protein-Cas7/Cse4/CasC MTTFIQLHLLTAYPAANLNRDDTGAPKTVVLGGATRLRISSQSLKRAWRTSELFEQALAGHIGIRTGRIAREAAQILVDSGIDAKKAVEYVKNIANCFGKVKEDKKPKDELTNAETEQLVHISPAEFEAVKALARRLAEEKRPATEEEAELLRHDRMAVDIAMFGRMLAKKTDFNVEAACQVAHAFGVSETIIEDDFFTAVDDLRQASAEDAGAGHLGETGFGSALFYTYICIDKDLLVKNLNGNEELANKTLRAFTEAALKVSPTGKQNSFASRAYASWALAEKGTDQPRSLAAAFYEPINGTDQLNVAVKRITALHENMNEVYAQETAFKNFNVMNQQGSMKDVLDFICA >NZ_CP011394.1|WP_000117945.1|953464_954019_+|type-I-E-CRISPR-associated-protein-Cse2/CasB MSVVTKDDKATLRQWHDELQEKRGLRASLRRSKTVNDACLAEGLHSLLMQTHSLWKNKAPWNVTALAITAALAAHIKFIDEQKSFAAQLGQKKGGDTPVMSKLRFSHLLAVKTPDELLRQLRRAVKLLDGSVNLFSLADDIFCWCQEQNDLLNHHRRQQRPTEFLRIRWALEYYQAGDTDNEQD >NZ_CP011394.1|WP_000368579.1|951911_953468_+|type-I-E-CRISPR-associated-protein-Cse1/CasA MDNFSLLTTPWLPVRFKDGSTGKLAPVDLADENVVDIAATRADLQGAAWQFLLGLLQCSIAPKRYKNWEDIWFDGLHADVLHKALAPLEHAFQFGAESPSFMQDFEPLSGEKVSIASLLPEIPGAQTTKFNKDHFVKRGVTERFCPHCAALALFSLQLNAPAGGKGYRTGLRGGGPLTTLVELQEYQGERQTPIWRKLWLNVMPQDTADLPLPDQCDATVFPWLAATRTSEQANAVTTPEQVNKLQAYWGMPRRIRLDFATLQSGCCDICGAESDELLGFMTVKNYGVNYDGWRHPLTPYRAPVKDQNAFFSVKPQPGGLIWRDWLGLSQNNQTEANYESPAQVVKVFNARSLTDVKAGIRGFGADFDNMKIRCWYEHHFPLLMTEGLIPDLRKAVQTAARLLSLLRSALKEAWFTNAKDARGDFSFIDIDFWNLTQGRFLNLIHDLENGHKPDERLNKWQRELWLFTRCYFDDHVFTNPYESSDLERIMKARKKYFTSSAEKQSAKAAKAKKQEAAE >NZ_CP011394.1|WP_000029737.1|949236_951900_+|CRISPR-associated-helicase/endonuclease-Cas3 MSIYHYWGKSRRGETDGGDDYHLLCWHSLDVAAVGYWMVINNIYFIDHYLKKLGIQDKEQAAQFFAWILCWHDIGKFAHSFQQLYRHEALNIFNEPTRHYEKIAHTTLGYMLWNSWLSECPELFPPSSLSVRKSKRVMALWMPVTTGHHGRPPEAIQELDHFRQQDKDAARDFLLRIKALFPLITLPEAWDEDEGIDQFQQLSWFISAAVVLADWTGSASRYFPRTAEKMPVDTYWQQALAKAQTAITLFPSAANVSAFTGIETLFPFIQHPTPLQQKALELDINVDGAQLFILEDVTGAGKTEAALILAHRLMAAGKAQGLYFGLPTMATANAMFERMANTWLALYQPDSRPSLILAHSARRLMDRFNQSIWSVTLSGTEEPDEAQPYSQGCAAWFADSNKKALLAEVGVGTLDQAMMAVMPFKHNNLRLLGLSNKILLADEIHACDAWMSRILEGLIERQASNGNATILLSATLSQQQRDKLVAAFSRGVRRSVQAPLLGHDDYPWLTQVTQTELISQRVDTRKEVERCVDIGWLHSEEACLERIGEAVEKGNCIAWIRNSVDDAIRIYRQLQLSKVVATENLLLFHSRFAFHDRQRIESQTLNLFGKQSGAQRAGKVIIATQVIEQSLDIDCDEMISDLAPVDLLIQRAGRLQRHIRDRNGLVKKSGQDERETPVLRILAPEWDDAPRENWLSSAMRNSAYVYPDHGRMWLTQRILREQGTIRMPQSARLLIESVYGEDVNMPVGFAKTEQLQEGKFYCDRAFAGQMLLNFAPGYCAEISDSLPEKMSTRLAEESVTLWLAKIVDSVVTPYASGEHAWEMSVLRVRQSWWNKHKDEFEKLDGEPLRKWCAQQHQDKDFATVIVVTDFAACGYSANEGLIGMMGE >NZ_CP011394.1|WP_001145541.1|947839_948793_-|SPI-1-type-III-secretion-system-effector-SopD MPVTLSFGNHQNYTLNESRLAHLLSADKEKAIHMGGWDKVQDHFRAEKKDHALEVLHSIIHGQGRGEPGEMEVNVEDINKIYAFKRLQHLACPAHQDLFTIKMDASQTQFLLMVGDTVISQSNIKDILNISDDAVIESMSREERQLFLQICEVIGAKMTWHPELLQESISTLRKEVTGNAQIKAAVYEMMRPAEAPDHPLVEWQDSLTEDEKSMLACINAGNFEPTTQFCKIGYQEVQGEVAFSMMHPCISYLLHTYSPFAEFKPTNSGFLKKLNQDYNDYHAKKMFIDVILEKLYLTHERSLHIGKDGCSRNILLT >NZ_CP011394.1|WP_000039870.1|947017_947752_+|phosphoadenosine-phosphosulfate-reductase MSKLDLNALNELPKVDRVLALAETNAQLETLTAEERVAWALENLPGEYVLSSSFGIQAAVSLHLVNQIRPDIPVILTDTGYLFPETYQFIDELTDKLKLNLKVYRAGESPAWQEARYGKLWEQGVEGIEKYNEINKVEPMNRALKELKAQTWFAGLRREQSGSRAHLPVLAIQRGVFKVLPIIDWDNRTVYQYLQKHGLKYHPLWDQGYLSVGDTHTTRKWEPGMAEEETRFFGLKRECGLHEG >NZ_CP011394.1|WP_000490481.1|958317_959364_-|aminopeptidase MFSATRRFAVILALGVGFILPAQAASPGPGEIANTQARHIATFFPGRMTGSPAEMLSADYLRQQFTQMGYQSDIRTFNSRFIYTTKDNRKNWHNVTGSTVIAAHEGRVPQQIIIMAHLDTYAPQSDADVDANLGGLTLQGMDDNAAGLGVMLELAARLKDIPTHYGIRFIATSGEEEGKLGAENLLKRMSDAEKKNTLLVINLDNLIVGDKLYFNSGKNTPEAVRTLTRDRALAIARRYGIAANTNPGRNPSYPKGTGCCNDAEVFDKAGISVLSVEATNWNLGKKDGYQQRVKNASFPNGNSWHDVRLDNQQHIDKALPGRIERRSRDVVRIMLPLVKELAKAEKTS >NZ_CP011394.1|WP_000372384.1|959614_960523_+|sulfate-adenylyltransferase-subunit-CysD MDQKRLTHLRQLEAESIHIIREVAAEFANPVMLYSIGKDSSVMLHLARKAFYPGTLPFPLLHVDTGWKFREMYAFRDRTANAYGCELLVHKNPEGVAMGINPFVHGSAKHTDIMKTEGLKQALNKYGFDAAFGGARRDEEKSRAKERIYSFRDRFHRWDPKNQRPELWRNYNGQINKGESIRVFPLSNWTEQDIWQYIWLENIDIVPLYLAAERPVLERDGMLMMVDDDRIDLQPGEVIKKRMVRFRTLGCWPLTGAVESHAQTLPEIIEEMLVSTTSERQGRMIDRDQAGSMELKKRQGYF >NZ_CP011394.1|WP_001092251.1|960532_961972_+|sulfate-adenylyltransferase-subunit-CysN MNTILAQQIANEGGVEAWMIAQQHKSLLRFLTCGSVDDGKSTLIGRLLHDTLQIYEDQLSSLHNDSKRHGTQGEKLDLALLVDGLQAEREQGITIDVAYRYFSTEKRKFIIADTPGHEQYTRNMATGASTCDLAILLIDARKGVLDQTRRHSFISTLLGIKHLVVAINKMDLVDYCEETFARIREDYLTFAEQLPGDLDIRFVPLSALEGDNVAAQSANMRWYSGPTLLEVLETVDIQRAVDRQPMRFPVQYVNRPNLDFRGYAGTLASGSVKVGERIKVLPSGVESSVARIVTFDGDKEEACAGEAITLVLNDDIDISRGDLLLAANETLAPARHAAIDVVWMAEQPLAPGQSYDVKLAGKKTRARIEAIRYQIDINNLTQRDVESLPLNGIGLVEMTFDEPLALDIYQQNPVTGGLIFIDRLSNVTVGAGMVRELDERGATPPVEYSAFELELNALVRRHFPHWDARDLLGDKHGAA >NZ_CP011394.1|WP_001173663.1|961958_962564_+|adenylyl-sulfate-kinase MALHDENVVWHSHPVTVAAREQLHGHRGVVLWFTGLSGSGKSTVAGALEEALHQRGVSTYLLDGDNVRHGLCRDLGFSDADRQENIRRVGEVASLMADAGLIVLTAFISPHRAERQLVKERVGHDRFIEIYVNTPLAICEQRDPKGLYKKARAGELRNFTGIDAIYEAPDSPQVHLNGEQLVTNLVSQLLDLLRRRDIIRS >NZ_CP011394.1|WP_001537530.1|962614_962938_+|DUF3561-family-protein MRNSHNITFTRSDAFMVDDDATSAFPGAVVGFVSWLLALGIPFLLYGPNTLFFFLYTWPFFLALMPVSVIIGIALHLLVKGKILFSIMFTLLAVGALFGALFIWLLG >NZ_CP011394.1|WP_000517480.1|963128_963440_+|cell-division-protein-FtsB MGKLTLLLLALLVWLQYSLWFGKNGIHDYSRVNDDVVAQQATNAKLKARNDQLFAEIDDLNGGQEAIEERARNELSMTKPGETFYRLVPDASKRAATAGQTHR >NZ_CP011394.1|WP_000741653.1|963458_964169_+|2-C-methyl-D-erythritol-4-phosphate-cytidylyltransferase MAATLLDVCAVVPAAGFGRRMQTECPKQYLSIGNKTILEHSVHALLAHPRVTRVVIAISPGDHRFAQLPLANHPQITVVDGGNERADSVLAGLQAVAKAQWVLVHDAARPCLHQDDLARLLTISENSRVGGILASPVRDTMKRGEPGKNAIAHTVERADLWHALTPQFFPRELLHDCLTRALNEGATITDEASALEYCGFHPALVEGRADNIKVTRPEDLALAEFYLTRTIHQEKA >NZ_CP011394.1|WP_001219253.1|964168_964648_+|2-C-methyl-D-erythritol-2,4-cyclodiphosphate-synthase MRIGHGFDVHAFGGEGPIIIGGVRISYEKGLLAHSDGDVALHALTDALLGAAALGDIGKLFPDTDPAFKGADSRELLREAWRRIQAKGYTLGNVDVTIIAQAPKMLPHIPQMRVFIAEDLGCHMDDVNVKATTTEKLGFTGRGEGIACEAVALLMKAAK >NZ_CP011394.1|WP_000134246.1|964644_965694_+|tRNA-pseudouridine(13)-synthase-TruD MTEFDNLTWLHGKPQGSGLLKANPEDFVVVEDLGFTPDGEGEHILLRILKNGCNTRFVADALAKFLKIHAREVSFAGQKDKHAVTEQWLCARVPGKEMPDFSAFQLEGCKVLEYARHKRKLRLGALKGNAFTLVLREISDRRDVETRLQAIRDGGVPNYFGAQRFGIGGSNLQGALRWAQSNAPVRDRNKRSFWLSAARSALFNQIVHQRLKKPDFNQVVDGDALQLAGRGSWFVATSEELPELQRRVDEKELMITASLPGSGEWGTQRAALAFEQDAIAQETVLQSLLLREKVEASRRAMLLYPQQLSWNWWDDVTVELRFWLPAGSFATSVVRELINTMGDYAHIAE >NZ_CP011394.1|WP_001221538.1|965674_966436_+|5'/3'-nucleotidase-SurE MRILLSNDDGVHAPGIQTLAKALREFADVQVVAPDRNRSGASNSLTLESSLRTFTFDNGDIAVQMGTPTDCVYLGVNALMRPRPDIVVSGINAGPNLGDDVIYSGTVAAAMEGRHLGFPALAVSLNGYQHYDTAAAVTCALLRGLSREPLRTGRILNVNVPDLPLAQVKGIRVTRCGSRHPADKVIPQEDPRGNTLYWIGPPGDKYDAGPDTDFAAVDEGYVSVTPLHVDLTAHSAHDVVSDWLDSVGVGTQW |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP011394_1 | 1.15|941330|33|NZ_CP011394|CRISPRCasFinder | 941330-941362 | 33 | NZ_CP032236 | Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence | 63403-63435 | 4 | 0.879 |
NZ_CP011394_1 | 1.15|941330|33|NZ_CP011394|CRISPRCasFinder | 941330-941362 | 33 | NZ_LN681230 | Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence | 91355-91387 | 4 | 0.879 |
NZ_CP011394_1 | 1.5|941267|34|NZ_CP011394|PILER-CR,CRT | 941267-941300 | 34 | NZ_CP044178 | Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1 | 18967-19000 | 5 | 0.853 |
NZ_CP011394_1 | 1.5|941267|34|NZ_CP011394|PILER-CR,CRT | 941267-941300 | 34 | CP053324 | Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence | 25136-25169 | 5 | 0.853 |
NZ_CP011394_1 | 1.6|941328|35|NZ_CP011394|PILER-CR,CRT | 941328-941362 | 35 | NZ_CP032236 | Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence | 63403-63437 | 5 | 0.857 |
NZ_CP011394_1 | 1.6|941328|35|NZ_CP011394|PILER-CR,CRT | 941328-941362 | 35 | NZ_LN681230 | Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence | 91353-91387 | 5 | 0.857 |
NZ_CP011394_1 | 1.5|941267|34|NZ_CP011394|PILER-CR,CRT | 941267-941300 | 34 | NZ_LN890526 | Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence | 31716-31749 | 6 | 0.824 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_CP044178 | Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1 | 18969-19000 | 6 | 0.812 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | CP053324 | Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence | 25138-25169 | 6 | 0.812 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_LN890526 | Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence | 31718-31749 | 7 | 0.781 |
NZ_CP011394_1 | 1.8|941451|34|NZ_CP011394|PILER-CR,CRT | 941451-941484 | 34 | MN694003 | Marine virus AFVG_250M677, complete genome | 17627-17660 | 8 | 0.765 |
NZ_CP011394_1 | 1.12|941147|32|NZ_CP011394|CRISPRCasFinder | 941147-941178 | 32 | MG592432 | Vibrio phage 1.050.O._10N.286.48.A6, partial genome | 21687-21718 | 8 | 0.75 |
NZ_CP011394_1 | 1.12|941147|32|NZ_CP011394|CRISPRCasFinder | 941147-941178 | 32 | MG592431 | Vibrio phage 1.049.O._10N.286.54.B5, partial genome | 21426-21457 | 8 | 0.75 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_CP053022 | Sphingobium yanoikuyae strain YC-XJ2 plasmid p-A-Sy, complete sequence | 329022-329053 | 8 | 0.75 |
NZ_CP011394_1 | 1.17|941453|32|NZ_CP011394|CRISPRCasFinder | 941453-941484 | 32 | MN694003 | Marine virus AFVG_250M677, complete genome | 17629-17660 | 8 | 0.75 |
NZ_CP011394_1 | 1.10|941025|32|NZ_CP011394|CRISPRCasFinder | 941025-941056 | 32 | CP006879 | Rhizobium gallicum bv. gallicum R602 plasmid pRgalR602b, complete sequence | 405613-405644 | 9 | 0.719 |
NZ_CP011394_1 | 1.12|941147|32|NZ_CP011394|CRISPRCasFinder | 941147-941178 | 32 | NC_047790 | Pseudoalteromonas phage C5a, complete genome | 34441-34472 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_CP048340 | Escherichia coli strain 142 plasmid p142_C, complete sequence | 2410-2441 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_LR130559 | Escherichia coli strain MS14385 isolate MS14385 plasmid 5 | 41882-41913 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_CP020518 | Escherichia coli strain 222 plasmid unnamed2, complete sequence | 13450-13481 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_CP020497 | Escherichia coli strain 103 plasmid unnamed2, complete sequence | 37140-37171 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_CP040921 | Escherichia coli strain FC853_EC plasmid p853EC2, complete sequence | 32060-32091 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | CP053252 | Escherichia coli strain SCU-204 plasmid pSCU-204-5, complete sequence | 19381-19412 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_CP042622 | Escherichia coli strain NCYU-26-73 plasmid pNCYU-26-73-7, complete sequence | 2614-2645 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_LT985302 | Escherichia coli strain ECOR 39 genome assembly, plasmid: RCS82_pI | 11943-11974 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_CP028194 | Escherichia coli strain CFSAN018748 plasmid pGMI14-004_3, complete sequence | 15383-15414 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_CP024865 | Escherichia coli strain AR_0015 plasmid unitig_3_pilon, complete sequence | 22646-22677 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | AP019710 | Escherichia coli O145:H28 122715 plasmid pO145_122715_2 DNA, complete genome | 4361-4392 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_CP024829 | Escherichia coli strain CREC-544 plasmid pCREC-544_3, complete sequence | 2221-2252 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_CP009861 | Escherichia coli strain ECONIH1 plasmid pECO-b75, complete sequence | 2868-2899 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | CP025877 | Escherichia coli strain 503458 plasmid p503458_49, complete sequence | 18343-18374 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_CP023368 | Escherichia coli strain 1428 plasmid p48, complete sequence | 4914-4945 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_CP032259 | Escherichia coli strain AR_0067 plasmid unnamed2, complete sequence | 23402-23433 | 9 | 0.719 |
NZ_CP011394_1 | 1.14|941269|32|NZ_CP011394|CRISPRCasFinder | 941269-941300 | 32 | NZ_CP037450 | Escherichia coli strain ATCC 25922 plasmid unnamed, complete sequence | 15851-15882 | 9 | 0.719 |
NZ_CP011394_1 | 1.15|941330|33|NZ_CP011394|CRISPRCasFinder | 941330-941362 | 33 | NZ_CP031947 | Ruegeria sp. AD91A plasmid unnamed1, complete sequence | 143751-143783 | 9 | 0.727 |
NZ_CP011394_2 | 2.2|957877|32|NZ_CP011394|CRISPRCasFinder,CRT | 957877-957908 | 32 | KY006853 | Erythrobacter phage vB_EliS_R6L, complete genome | 41418-41449 | 9 | 0.719 |
NZ_CP011394_2 | 2.6|958121|32|NZ_CP011394|CRISPRCasFinder,CRT | 958121-958152 | 32 | MK449011 | Streptococcus phage Javan92, complete genome | 36157-36188 | 9 | 0.719 |
NZ_CP011394_2 | 2.6|958121|32|NZ_CP011394|CRISPRCasFinder,CRT | 958121-958152 | 32 | MK448835 | Streptococcus phage Javan93, complete genome | 36157-36188 | 9 | 0.719 |
NZ_CP011394_2 | 2.6|958121|32|NZ_CP011394|CRISPRCasFinder,CRT | 958121-958152 | 32 | MK448836 | Streptococcus phage Javan95, complete genome | 37400-37431 | 9 | 0.719 |
NZ_CP011394_2 | 2.6|958121|32|NZ_CP011394|CRISPRCasFinder,CRT | 958121-958152 | 32 | MK448825 | Streptococcus phage Javan639, complete genome | 37400-37431 | 9 | 0.719 |
NZ_CP011394_2 | 2.8|958243|32|NZ_CP011394|CRISPRCasFinder,CRT | 958243-958274 | 32 | NZ_MG266000 | Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence | 5501-5532 | 9 | 0.719 |
NZ_CP011394_1 | 1.10|941025|32|NZ_CP011394|CRISPRCasFinder | 941025-941056 | 32 | NZ_CP049244 | Rhizobium pseudoryzae strain DSM 19479 plasmid unnamed3, complete sequence | 699963-699994 | 10 | 0.688 |
NZ_CP011394_1 | 1.16|941392|32|NZ_CP011394|CRISPRCasFinder | 941392-941423 | 32 | NZ_LR134399 | Listeria monocytogenes strain NCTC7974 plasmid 2, complete sequence | 103231-103262 | 10 | 0.688 |
NZ_CP011394_1 | 1.6|941328|35|NZ_CP011394|PILER-CR,CRT | 941328-941362 | 35 | NZ_CP031947 | Ruegeria sp. AD91A plasmid unnamed1, complete sequence | 143751-143785 | 11 | 0.686 |
1. spacer 1.15|941330|33|NZ_CP011394|CRISPRCasFinder matches to NZ_CP032236 (Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence) position: , mismatch: 4, identity: 0.879
caaacaccaagctcttccgccgcgcgttcctga CRISPR spacer catttaccaagctcttccgctgcgcgttcctga Protospacer ** .***************.************
2. spacer 1.15|941330|33|NZ_CP011394|CRISPRCasFinder matches to NZ_LN681230 (Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence) position: , mismatch: 4, identity: 0.879
caaacaccaagctcttccgccgcgcgttcctga CRISPR spacer catttaccaagctcttccgctgcgcgttcctga Protospacer ** .***************.************
3. spacer 1.5|941267|34|NZ_CP011394|PILER-CR,CRT matches to NZ_CP044178 (Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1) position: , mismatch: 5, identity: 0.853
ccaattt----aggggccggaactccgggaaaggcagc CRISPR spacer ----tttgagaaggggccggaactccgggaaaggcacc Protospacer *** ************************* *
4. spacer 1.5|941267|34|NZ_CP011394|PILER-CR,CRT matches to CP053324 (Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence) position: , mismatch: 5, identity: 0.853
ccaattt----aggggccggaactccgggaaaggcagc CRISPR spacer ----tttgagaaggggccggaactccgggaaaggcacc Protospacer *** ************************* *
5. spacer 1.6|941328|35|NZ_CP011394|PILER-CR,CRT matches to NZ_CP032236 (Yersinia ruckeri strain NHV_3758 plasmid pYR4, complete sequence) position: , mismatch: 5, identity: 0.857
cgcaaacaccaagctcttccgccgcgcgttcctga CRISPR spacer ggcatttaccaagctcttccgctgcgcgttcctga Protospacer *** .***************.************
6. spacer 1.6|941328|35|NZ_CP011394|PILER-CR,CRT matches to NZ_LN681230 (Yersinia ruckeri strain CSF007-82 plasmid pYR3, complete sequence) position: , mismatch: 5, identity: 0.857
cgcaaacaccaagctcttccgccgcgcgttcctga CRISPR spacer ggcatttaccaagctcttccgctgcgcgttcctga Protospacer *** .***************.************
7. spacer 1.5|941267|34|NZ_CP011394|PILER-CR,CRT matches to NZ_LN890526 (Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence) position: , mismatch: 6, identity: 0.824
ccaattt----aggggccggaactccgggaaaggcagc CRISPR spacer ----tttgagaaggggccggaactccggaaaaggcacc Protospacer *** *****************.******* *
8. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_CP044178 (Salmonella enterica subsp. enterica serovar Concord strain AR-0407 plasmid pAR-0407-1) position: , mismatch: 6, identity: 0.812
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccgggaaaggcacc Protospacer . ************************* *
9. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to CP053324 (Salmonella enterica subsp. salamae serovar 40:c:e,n,z15 strain 2013K-0524 plasmid unnamed, complete sequence) position: , mismatch: 6, identity: 0.812
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccgggaaaggcacc Protospacer . ************************* *
10. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_LN890526 (Salmonella enterica subsp. enterica serovar Weltevreden strain 2511STDY5712385 plasmid 3, complete sequence) position: , mismatch: 7, identity: 0.781
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggaaaaggcacc Protospacer . *****************.******* *
11. spacer 1.8|941451|34|NZ_CP011394|PILER-CR,CRT matches to MN694003 (Marine virus AFVG_250M677, complete genome) position: , mismatch: 8, identity: 0.765
cgcggtcgcagcctggcctgttgccgtagaatcg CRISPR spacer cgcgatcgcagcctgggctgttgccgtgctcgcc Protospacer ****.*********** **********. *
12. spacer 1.12|941147|32|NZ_CP011394|CRISPRCasFinder matches to MG592432 (Vibrio phage 1.050.O._10N.286.48.A6, partial genome) position: , mismatch: 8, identity: 0.75
ctgccggtctgtgctgttgtcgtcaataatca CRISPR spacer aatctgctctgtgctgttgtagtcaattataa Protospacer *.* ************* ****** ** *
13. spacer 1.12|941147|32|NZ_CP011394|CRISPRCasFinder matches to MG592431 (Vibrio phage 1.049.O._10N.286.54.B5, partial genome) position: , mismatch: 8, identity: 0.75
ctgccggtctgtgctgttgtcgtcaataatca CRISPR spacer aatctgctctgtgctgttgtagtcaattataa Protospacer *.* ************* ****** ** *
14. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_CP053022 (Sphingobium yanoikuyae strain YC-XJ2 plasmid p-A-Sy, complete sequence) position: , mismatch: 8, identity: 0.75
---aatttaggggccggaactccgggaaaggcagc CRISPR spacer gtcagtt---gggccggaaagccgggaaaggcata Protospacer *.** ********* ************
15. spacer 1.17|941453|32|NZ_CP011394|CRISPRCasFinder matches to MN694003 (Marine virus AFVG_250M677, complete genome) position: , mismatch: 8, identity: 0.75
cggtcgcagcctggcctgttgccgtagaatcg CRISPR spacer cgatcgcagcctgggctgttgccgtgctcgcc Protospacer **.*********** **********. *
16. spacer 1.10|941025|32|NZ_CP011394|CRISPRCasFinder matches to CP006879 (Rhizobium gallicum bv. gallicum R602 plasmid pRgalR602b, complete sequence) position: , mismatch: 9, identity: 0.719
aggatttcgccgcgctgttggcctccagattc CRISPR spacer cagggtcgaccgcgctgtcgccctccagattc Protospacer .*. *. .*********.* ***********
17. spacer 1.12|941147|32|NZ_CP011394|CRISPRCasFinder matches to NC_047790 (Pseudoalteromonas phage C5a, complete genome) position: , mismatch: 9, identity: 0.719
ctgccggtctgtgctgttgtcgtcaataatca CRISPR spacer tttggtgtctgtgccgttttcgtcaataagct Protospacer .* ********.*** ********** *
18. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_CP048340 (Escherichia coli strain 142 plasmid p142_C, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
19. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_LR130559 (Escherichia coli strain MS14385 isolate MS14385 plasmid 5) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
20. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_CP020518 (Escherichia coli strain 222 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
21. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_CP020497 (Escherichia coli strain 103 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
22. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_CP040921 (Escherichia coli strain FC853_EC plasmid p853EC2, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
23. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to CP053252 (Escherichia coli strain SCU-204 plasmid pSCU-204-5, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
24. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_CP042622 (Escherichia coli strain NCYU-26-73 plasmid pNCYU-26-73-7, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
25. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_LT985302 (Escherichia coli strain ECOR 39 genome assembly, plasmid: RCS82_pI) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
26. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_CP028194 (Escherichia coli strain CFSAN018748 plasmid pGMI14-004_3, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
27. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_CP024865 (Escherichia coli strain AR_0015 plasmid unitig_3_pilon, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
28. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to AP019710 (Escherichia coli O145:H28 122715 plasmid pO145_122715_2 DNA, complete genome) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
29. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_CP024829 (Escherichia coli strain CREC-544 plasmid pCREC-544_3, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
30. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_CP009861 (Escherichia coli strain ECONIH1 plasmid pECO-b75, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
31. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to CP025877 (Escherichia coli strain 503458 plasmid p503458_49, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
32. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_CP023368 (Escherichia coli strain 1428 plasmid p48, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
33. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_CP032259 (Escherichia coli strain AR_0067 plasmid unnamed2, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
34. spacer 1.14|941269|32|NZ_CP011394|CRISPRCasFinder matches to NZ_CP037450 (Escherichia coli strain ATCC 25922 plasmid unnamed, complete sequence) position: , mismatch: 9, identity: 0.719
aatttaggggccggaactccgggaaaggcagc CRISPR spacer tgagaaggggccggaactccggtaaagggcac Protospacer . ***************** ***** .*
35. spacer 1.15|941330|33|NZ_CP011394|CRISPRCasFinder matches to NZ_CP031947 (Ruegeria sp. AD91A plasmid unnamed1, complete sequence) position: , mismatch: 9, identity: 0.727
caaacaccaagctcttccgccgcgcgttcctga CRISPR spacer gaaacaccaaggtcttccgccgcacgggcaaag Protospacer ********** ***********.** * ..
36. spacer 2.2|957877|32|NZ_CP011394|CRISPRCasFinder,CRT matches to KY006853 (Erythrobacter phage vB_EliS_R6L, complete genome) position: , mismatch: 9, identity: 0.719
aatatatggcgctcacgcgcatgagcattctc CRISPR spacer acgcaatggcgctgacgcgcatgatcatttcg Protospacer * ******** ********** ****..
37. spacer 2.6|958121|32|NZ_CP011394|CRISPRCasFinder,CRT matches to MK449011 (Streptococcus phage Javan92, complete genome) position: , mismatch: 9, identity: 0.719
gattgctcagattgggaatttgaccagcggcc CRISPR spacer agtcgctcagattgggaatttgtcaagatgta Protospacer ..*.****************** * ** *.
38. spacer 2.6|958121|32|NZ_CP011394|CRISPRCasFinder,CRT matches to MK448835 (Streptococcus phage Javan93, complete genome) position: , mismatch: 9, identity: 0.719
gattgctcagattgggaatttgaccagcggcc CRISPR spacer agtcgctcagattgggaatttgtcaagatgta Protospacer ..*.****************** * ** *.
39. spacer 2.6|958121|32|NZ_CP011394|CRISPRCasFinder,CRT matches to MK448836 (Streptococcus phage Javan95, complete genome) position: , mismatch: 9, identity: 0.719
gattgctcagattgggaatttgaccagcggcc CRISPR spacer agtcgctcagattgggaatttgtcaagatgta Protospacer ..*.****************** * ** *.
40. spacer 2.6|958121|32|NZ_CP011394|CRISPRCasFinder,CRT matches to MK448825 (Streptococcus phage Javan639, complete genome) position: , mismatch: 9, identity: 0.719
gattgctcagattgggaatttgaccagcggcc CRISPR spacer agtcgctcagattgggaatttgtcaagatgta Protospacer ..*.****************** * ** *.
41. spacer 2.8|958243|32|NZ_CP011394|CRISPRCasFinder,CRT matches to NZ_MG266000 (Clostridioides difficile strain 7032985 plasmid pCD-ISS1, complete sequence) position: , mismatch: 9, identity: 0.719
gttgggttgcatagatgacacgcttataaata CRISPR spacer gaattatggcatagatgacatgattataaatt Protospacer * .* ************.* ********
42. spacer 1.10|941025|32|NZ_CP011394|CRISPRCasFinder matches to NZ_CP049244 (Rhizobium pseudoryzae strain DSM 19479 plasmid unnamed3, complete sequence) position: , mismatch: 10, identity: 0.688
aggatttcgccgcgctgttggcctccagattc CRISPR spacer gggatatcgccgcgctgatggcctatgaccac Protospacer .**** *********** ****** ... . *
43. spacer 1.16|941392|32|NZ_CP011394|CRISPRCasFinder matches to NZ_LR134399 (Listeria monocytogenes strain NCTC7974 plasmid 2, complete sequence) position: , mismatch: 10, identity: 0.688
ttgctctcattaaaggggtttccatgtttgat CRISPR spacer gattcgtcattacagtggtttccatgttttag Protospacer .. ****** ** ************* *
44. spacer 1.6|941328|35|NZ_CP011394|PILER-CR,CRT matches to NZ_CP031947 (Ruegeria sp. AD91A plasmid unnamed1, complete sequence) position: , mismatch: 11, identity: 0.686
cgcaaacaccaagctcttccgccgcgcgttcctga CRISPR spacer gagaaacaccaaggtcttccgccgcacgggcaaag Protospacer . ********** ***********.** * ..
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
1414891 : 1420944
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP011394|1414891:1420944|DBSCAN-SWA CATATACTTTTTGTGGGACTGCTTCAACCTTTGGGCAGATATCGGAAATGAAAAAGATAGACCAGGCAACTATTCGCTATCAGAATATCCGGTACACCAACTACCAACTACCAACTACCAACTACCAACAAATCATTTAGTCGATGGTCTCGTTGCCATTGGTTCATAGAGTGTTGGTCTTGGTATGGATGGCTGGGGAAGTTATGTATCGAACATTCTTATGCAAGATTGCGCAGGGTCTGGTGATCTATGGTACACATATGGGAAGGCATTCACATATATTTCTGTAATCGATACTAAAACTTTAACACTAACTAATTGTTTGTAGAAAGTGGTTGCATTATTAATGGTTTGAGACTCATTGACATAAAACAAACACCATCTGGTAATCTGTCAGACCCCGCATCCTTAATAGTTAACTATAAAGATTGTATGGTAGTTGAGATGTCGTTAGTTCTAATATCGTGTCAGATAAATTTATAAAAGATTACTCATGTTTTGTGTCACATGCAAAAATAATTCGGCTTCGTTAAGGTCTTTAGGGGAAATACCTAATGGATAATTAGTTAGATTAACGTTAACAACACTTTGAACGTGTAATGAATATGGGGGTAAAATATAAGTATTGGGAGATTGTAATTAAAAATTATGTAATTGTCTGATTATTATATATTCACTCCAGCAAAGGAGAAAGGCAATTATGGACGAAAAGAAACTCACAGCTCTTGCGGCTGAACTGGCTAAAGGTCTTAAAACTGAAGCCGACCTCAGTCAGTTTTCCCATATGCTGACGACATTAACCGTCGAAACGGCGCTCAATGCTGAGCTGGCTGACCATCTCGGGCATGAGAAAAATGCTCCTAAAACGAGCTCAAACTCCCACAATGGCTACTCATTAAAAACGGTGCTGTATGATGGCGGCGAGATAGTGCTGAACATGCTGCGTGACCGTGAAAATACCTTTGAACTGCAGCTGATTAAGGAGCATCAGACGCGTATTACGAGATGGACAGTCAGATTTATCCCTTTATGCCAAAGGCATGACATGACTACCCGCGAAATCATCGCCACCTTCAAAGAGATGTACAACGCTGATGTGTCGCCCATGCTGATACTTAAGATCACCGCCGCAGTAAAAGAGTTGGTCAAAGGGGCGTCAAAAAATGGAGTATGCCGATCCAGGACTGGTCGCTGGCAATGAGTCGCTTTATTATCGAGTTCGGTGACCGCCTGAGCGATCACCGTTAATATAGTGGCAGTTACACAGTATGACTGGCAGGTTCTATATTGGATTTATATGTAATAGTAACAGCCTATCTATTTGATTATTTTATTTCCGATTTTATTAAAAAGGAAATCAGAACCAGGCTTTGTTAAATGTCCCCAATCAACGGCAGTGATAAAATCAGGACCATTACCAACTCTTGTAAGACATCCACTTTCGTTACATAATGCTTTGTATGCTGATATATATTCAATTCCCATTTTTGGAACATTGTTACTAAAGTAAGAGTCCCACTCGCTTATTTCACTATTTAATCCATATGTCATATACAATGGTGGGGTTTTTTTAAACTCACTCAGGTAGTTAGATATTATTTTAACTAAATTTGCATTCCATTCCGGGACTGGTCCAATGAAAATAATCCTTGAGTCAGGGGATGCCTCTTTAATTTTTTTTATGGTTAATGATAACGTATCAATTGCTAACTTTTTATCATGTACTCCATTTGTTCCTCGAACTGACCATGTCAGCAGAACCACCTCAGGCTGAACACGTTTAATTTCATTAATTCTATTATTGTTTAGAGTGATGACACTTCTCTGTAAATCATCTTTACCGTCAACAAATAGAGGAGGAGCGTTACCATCTGTCATTTGGCTTATTATATAATCAGAACCTTTATTATCTATATAATGAGAAAGTCCATTGAAAAGAGCCGCCGCATAAGAATCACCAATGATAAATATATTATGCTTGCCATTTTTTATACATCCATTGGATATGGCAGCAGTAAGTTGTACTGAGTGGCATATCCCTCCACGGAGTAGTTCTCCATATTTATAATAATTGTACACGTCAGTAACAGAAGCATATTCACCTGCTGATTTATTGATTTCCCTGTCTTTAACTCCATTTATATGAAAAATAAATGCGCCAATTAAACCTGTCCCAAATACTGATAATGCTAATAATATTGCTGTGATATATTTATTTCTGGCATTTCTCAGTGGTTTTTCAATTAAATAATAAGTTAATATCGCCAAAAAGAACGATGATAATAAAAGAAGAATTAATTCATGGTAGTCTGGTGAACCAGCAAATATTGAACGATAGAATGAATAAATAGGCCAATGCCACAAATAAAGAGGATAGCTAATAAGACCAAAGAAAACAACAGGCCTAACACTAAGCAATTTCGACACAACTAAATCATTACCATTAGATGCTATTATAAGAGAGGCGCCAAGTACTGGGATTATTGCTATATATCCAGGAAATGACATCTTTTCATCTATCATGGTTATTGATAATGCGATTAGTATAATTCCTAACAGGGACATTAATTTTGATAACGAAGTGTTTATTCCTATAAATCTCAATGTGGATATAATCGCTCCAGCCATTAACTCCCAAAATCTTGATGCGGGAGAGTAGTAATTAGCTCCGCCATCAGATGCCATTGTAAAAATGCTAATCGCATAGCTAATTATAAATATAGTTGCGCATGATAATACTATGTTTCTGTTATGGTTTTTGCTTCTAAAGCATAGCAATATAACTACTGGCCATATTATATAAAATTGCTCTTCAATTCCCAGTGACCACAAATGTAGTAAAGGTTTAAGGTATGACTTTGAATCAAAATAGCCAGACTCACTCCAAAGAGTAAAGTTTGATATAAAGAATGAGCCGCTAAAAACATGCTTACCAAGTAATTTGTAATCATCCTGGAATAAATAAACCCAGCCAACAATAAGACATGATACAAGAACTATGGATAATGCTGGAAATATTCTAAGCACTCTCCTTTTATAGAAATCAAGGTATGAAAATGATTTGTTTGATGCAGATTTTAATATTATTGATGTTATAAGGTATCCAGATATCACAAAGAATATATCTACTCCAACAAACCCACCCGGCAATAATGATGGGAAATAATGGAATATTACCACAGATAAAACCGCTATTGCGCGTAATCCATCTATATCAGGTCTGTATTTTAAGTGTTCCAACTTAAATTACCTCAATTTTAAAAAAAGATTAATAAAATGGTTGTGCATCTTGCATCATTCCCGAAGTTTCGTGTAAACGAAAACGGAATGACGAGTGGATCAGATACGGCTATTATTTTAATTATTGACTCTGTCACATCTTTACTTCCGTCATTAATAAAAATAATCTCAACTTTATATTTTTCAAGTTCATTAAACTCATGTACCGTTCTATAGAAAATCGGTATCGTGTCTTCTTCGTTAAAAACTAGGACGACAAGAGAGATTTTCATTTTTATCCCTGAAGATAAAGAATCTGGAATAGATAAAGCCGCATACCAGGCTAATTGCCGAGAAAGTGATGAGGGTAACCAATGGTGGCAAAGAACATTGGTCAGCCATCCATCCAACAACAGCGCTCAGTGCTCCCATGAATCCCATATACATCATGTAGCGAATTGCAGTAGTGCTGGCATTAAAGGTGAAGCGCGCATTAGCATAGAAGCTGAACGATACAGCGATAACAAAATCGGAAAAGTTCGTCAGCGCCTGATGCGTATGCATCCCATACATACAGAAAGCAAATACTCTCCAATGAATGAGCATGTTAAGAACACCGATCGATGTGTACTTAGCGAATAACTTCAACATTATGAAAATTATCAGATTCAGAAAGGTCTGGAGTGTAGCACTACAAATTGGTTTGATCGATATAAGCGATCAATAATTGTATTTTTAATAGTTTTAAACTATTGAGTTTTAATATATTGATCGATGTTATCGATCAATTGGTATTGCTGATTGCCAAGCGTCTTGGAATAAAAACGGGACATGTAAAGCTTTGCATCGTCTTACAAGGCTTTGCATTTTTTTTCAGGGAGAGGTACTTGAAAGGGTGGAAGTGCTGGGGGGAGGGGGAGCGTTAAAAATTCTGTATAATTTTAGTAACATAAAATAAAAAAGAATGGCACATGTCCCATCCCTTCGATTTCGACAAAGCACTTAAAGCCCTTCAGGTCCGGCCAGGCATTAACGGGCAAAGATGGCATCTTAACGCCATTAATCAAGTATTTAACCGAGTCTACCCTGTCTGCTGAACTTGATTCCCATCTGGCTCAGGATGTTGAGGCAAACCGTAAAAATGGTTCCGGCAAAAAGCCATTAAAGCCCCAACAGGCAGTTTTGAACTGGCAACTCCGCGCAATCGTAACGGCACTTTTGAGCCATAACTGGTGAAGAAGCTTCAGACCCCCTGTACGACGAGATCGAGCGCAATATCATTCGACTGTTTGCGCTGGAGATGAGTTATCAGGACATCGGCCGGGAGAGTGAAGATCTTTATGCCTTCAGCGTTTCAACCGCCACCGTCAGTGCAGTACCGATAAAGTTATCCCTGAACTAAAACAGTGGCAACAGCGCCCGCTGGAGAAGGTTTATCCCGTCGTCTGGCTGGACGCTATTCATTATAAAAACCGTGAGGATGGCCGTTATCAGAGCAAGGCGGTTTATACCGTTCTGGCACTGAATCTGGAAGGCAAAAAAGAAGTTCTGGGCCTATATCTGTCGGAAAGTGAAGGTGCTAACTTTTGGTTAAGTTCTAACAGCGAGAGGGTACTTTAAAGGGATGCTTTTCGTTATGTTTATAGGCACTATTCGCTGGAAATCATAAGACATCAAAAACGCTGCAACGCCTTGTGTGGTGTGGGGTTGCTGAGATTTGTGAGAGGTGGGTAAAAGAGGTCATGGTGTCCCCTGCAGGAATCGAACCTGCAACTAGCCCTTAGGAGGGGCTCGTTATATCCATTTAACTAAGGGGACAACGCGGCGCCAGTATAGCGTTTTTTATTCGCCGGAGTAAGTGTAGCGCCGCCTGACTGGTTAAACCGTCGCCACTCAGCGCTGTTTTTCCGCTTTTTTCCGCTCCCGTTCCAGGCGCTCGCCGCGTAGCCTCGCTTCTTCCTTACGCTTATTGCTCATATCGTTGCGGATCTGCGCGTGGCTCATCAATGCGAAAATAAAAGTGCCGCCGCAGATATTTCCGGCAAGTGTGGGAAGGGCGAAGGGCCAGAGAAAGTCGCTCCAGGGCAGCGTGCCGTTGAAAACCAAATACAAAATTTCAACGGAACCGACGACAATATGGGTGGTATCGCCCAGCGCGATAAGCCAGGTCATCAAAATAATGACCACAATCTTTGCCCCGCCTGCTGCAGGAAACATCCATACCATTGTGGCGATGATCCAGCCAGAGATAATCGCGTTGGCAAACATCTCCGTTGGGCTATTTTTCATGACCTCCATACCAATTTTGACAAAGGCGTCGCGGGTCTCTTCATCAAATATAGGCATATATTCAAATGCCCACGCCGCAACCCCGGTGCCAATAAGGTTGCCCAATAAGACTACGCCCCACAAGCGCATCAGCAGGCCAACGTTACTCAGAGTGGGATTTTGCATTACCGGCAACACGGCGGTAACGGTATTTTCAGTAAATAATTGCTGGCGGGCCATGATGACAATGATAAAACCAAAGGTATAGCCGAGATTTTCCAGTAAAAAGCCGCCGGGAACGCCTTCAAGCTGCACGTGGAAAATCCCTTTCGCCAGGAGTGATGCCCCCATAGAAAGTCCTGCGGCAATGGCTGACCAGAGCAAAGCCATCGCATCGCGTTCCATCTCTTTTTCACCATCCTGGCGAATATGTTCATGAATCGCCATGGCGCGGGAAGGAAGACGATCTTCATCCACTTCAATCTCTTTACCGCTTTGTTTTTCTTCGCTTTCAACTTCCAGGTCACTGCTTTGCCGGTTAATTTTATCGTCGTTAAGGCTATCCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP011394|1414891:1420944|1418147_1418402_-|WP_000703599.1|DBSCAN-SWA MKISLVVLVFNEEDTIPIFYRTVHEFNELEKYKVEIIFINDGSKDVTESIIKIIAVSDPLVIPFSFTRNFGNDARCTTILLIFF >NZ_CP011394|1414891:1420944|1414891_1415059_+|WP_105789229.1|DBSCAN-SWA MYFLWDCFNLWADIGNEKDRPGNYSLSEYPVHQLPTTNYQLPTNHLVDGLVAIGS >NZ_CP011394|1414891:1420944|1415074_1415218_+|WP_105789228.1|DBSCAN-SWA MDGWGSYVSNILMQDCAGSGDLWYTYGKAFTYISVIDTKTLTLTNCL >NZ_CP011394|1414891:1420944|1416207_1418130_-|WP_000400616.1|DBSCAN-SWA MEHLKYRPDIDGLRAIAVLSVVIFHYFPSLLPGGFVGVDIFFVISGYLITSIILKSASNKSFSYLDFYKRRVLRIFPALSIVLVSCLIVGWVYLFQDDYKLLGKHVFSGSFFISNFTLWSESGYFDSKSYLKPLLHLWSLGIEEQFYIIWPVVILLCFRSKNHNRNIVLSCATIFIISYAISIFTMASDGGANYYSPASRFWELMAGAIISTLRFIGINTSLSKLMSLLGIILIALSITMIDEKMSFPGYIAIIPVLGASLIIASNGNDLVVSKLLSVRPVVFFGLISYPLYLWHWPIYSFYRSIFAGSPDYHELILLLLSSFFLAILTYYLIEKPLRNARNKYITAILLALSVFGTGLIGAFIFHINGVKDREINKSAGEYASVTDVYNYYKYGELLRGGICHSVQLTAAISNGCIKNGKHNIFIIGDSYAAALFNGLSHYIDNKGSDYIISQMTDGNAPPLFVDGKDDLQRSVITLNNNRINEIKRVQPEVVLLTWSVRGTNGVHDKKLAIDTLSLTIKKIKEASPDSRIIFIGPVPEWNANLVKIISNYLSEFKKTPPLYMTYGLNSEISEWDSYFSNNVPKMGIEYISAYKALCNESGCLTRVGNGPDFITAVDWGHLTKPGSDFLFNKIGNKIIK >NZ_CP011394|1414891:1420944|1420002_1420944_-|WP_000377779.1|DBSCAN-SWA MDSLNDDKINRQSSDLEVESEEKQSGKEIEVDEDRLPSRAMAIHEHIRQDGEKEMERDAMALLWSAIAAGLSMGASLLAKGIFHVQLEGVPGGFLLENLGYTFGFIIVIMARQQLFTENTVTAVLPVMQNPTLSNVGLLMRLWGVVLLGNLIGTGVAAWAFEYMPIFDEETRDAFVKIGMEVMKNSPTEMFANAIISGWIIATMVWMFPAAGGAKIVVIILMTWLIALGDTTHIVVGSVEILYLVFNGTLPWSDFLWPFALPTLAGNICGGTFIFALMSHAQIRNDMSNKRKEEARLRGERLERERKKAEKQR >NZ_CP011394|1414891:1420944|1418370_1418760_-|WP_001576268.1|DBSCAN-SWA MLKLFAKYTSIGVLNMLIHWRVFAFCMYGMHTHQALTNFSDFVIAVSFSFYANARFTFNASTTAIRYMMYMGFMGALSAVVGWMADQCSLPPLVTLITFSAISLVCGFIYSRFFIFRDKNENLSCRPSF |
6 | Salmonella_virus(50.0%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1657386 : 1666557
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP011394|1657386:1666557|DBSCAN-SWA GATGATTGAATTTAACCATGTCAGTAAAACCTTCGGCGATCAACAGGCTGTTAGCGACCTCAATTTGCACTTTAGCGAAGGCAGCTTTTCGGTGTTAATTGGCACCTCCGGTTCGGGAAAATCGACCACTCTGAAGATGATTAACCGGCTGGTAGAGCATGATAGCGGAACGATCCGTTTTGCCGGGGAAGAGATCCGCAGCCTGCCGGTGCTTGAACTGCGCCGTCGCATGGGCTATGCCATTCAGTCTATCGGTCTTTTTCCCCACTGGACGGTGGCGCAAAATATCGCCACCGTACCGCAACTACAAAAGTGGTCGCGTGCGCGGATTAACGATCGTATTGATGAACTGATGGCATTATTGGGTCTGGAAAGCGCGCTGCGCGATCGTTATCCGCATCAGCTTTCCGGCGGGCAACAGCAGCGGGTCGGCGTTGCGCGGGCGCTGGCTGCCGATCCGCAGGTATTGCTGATGGACGAGCCTTTCGGCGCGCTTGATCCGGTAACGCGCGGCGCATTGCAGCAGGAGATGACCCGCATTCATCAGCTACTGGGGCGCACCATCGTACTGGTGACGCACGACATCGACGAGGCGCTACGCCTCGCCGACCATCTGGTGCTGATGGACGGGGGCAATGTTATCCAACAGGGATCGCCACTTTCTATGCTGACCTCGCCGGAAAATGATTTCGTGCAGGCGTTTTTTGGCCGCAGCGAGCTGGGCGTAAGGCTGCTTTCGTTACGTAGCGTAGGCGATTATGTACGTCGGCATGAACAGCTCAGCGGCGACGCGCTGGTGGAAGAGATGACGCTACGCGATGCGCTATCGATGTTTGTCGCCCGTCGGTGCGACGTCCTGCCGGTGGCGAATCAGCAGGGCGAGCCCTGCGGTACGCTCCATTTCCGCGATCTACTTTCGGAGACGTCCCCCCGTGAAACGACTGTGTGATCCGCTTCTCTGGCTTATTGTTCTGTTCTTGCTTCTGCTGTTTGGATTGCCTTATAGCCAGCCGTTCTTCGCCGCGCTGTTTCCCGATTTACCGCGCCCGGTCTACCAACAGGAGAGTTTTGCCGCCCTCGCGCTCGCCCATTTCTGGTTGGTGGGCATCTCAAGTCTGTTTGCCGTCGTGGTGGGCGTCGGCGCAGGGATTGCGGTCACGCGAGAAAGTGGGAAGGAGTTTCGTCCCCTGGTGGAGACTATCGCCGCCGTCGGGCAGACCTTTCCCCCGGTCGCGGTACTGGCGATCGCGGTACCCGTCATGGGTTTTGGTCAGCAACCAGCCATTATCGCCTTGATCCTGTATGGAGTGTTGCCCATCCTGCAGGCGACCCTGGCCGGGCTGGGCGCGGTGCCTGCCAGCGTGATGAGCGTTGCCAGCGGTATGGGAATGAGCCGTCGCCAACAGTTGTATCAGGTTGAACTGCCGCTGGCCGCGCCGGTGATTCTGGCGGGCATCCGAACCTCGGTGATTATCAATATTGGTACGGCGACCATCGCTTCAACAGTGGGGGCCAGTACGTTAGGCACGCCGATCATTATCGGGCTTAGCGGCTTTAATACGGCCTATGTTATCCAGGGGGCGCTGCTGGTGGCGCTGGCGGCGATCATTATCGATCGCCTGTTTGAAAGGCTGACGCGCGCGCTTACCCGGCACGCAAAATAAAACTGTAACCTGCCAGCATCACGCCGCCGATACCGCCAATAGCCATCAGCAGGAAAAGGGCGATCACCCCGATTTTCGCTACGCGCATTATGTACTCCTTATGTTAATAAAAGGAGTATACATTAAAGCGAATTTGTTAGCTGCTGTTTAAACGCCAAGGGGATGAATGTCGCGTCCCTGGGCGCGCCATGCCAGGAGTTGCTGCTGCTGCGCCAGCGTCTGGTTTTCTCCGCACCATACCAGTAACGTCTTGCCGTCAAACAGTTCCGGGCGGAACTGGCTAAGCGAATGCGCCAGCACGTCGATTCGCCATCCCTGTTGGCTGGCGACCCAACCTTCCAGCCACAGGCGGGTGGTATCATGGATATTCCAGCCGATCACCAACGCATCTTTTCCCTGTTTCTTACGCGCAGACGCCAGGCAGAGCGCAATATAGTTGATCAGGATACCGTCAAGAATGCCGAGCAGCGCCTGAAGGGCAGGTTGTTGGCACTGTAATCGTCGCCGCAGCGGGACGAACAGGTTAGTGGTCAATGTTTGGGCTGGATAATCCTGACAGCGTTCTTTGACCCATAACCGTAAACTGTGCAGATTACTGCTTTGCAGGTAGTGCAGCAGGATCTCCTGCTGTTCGCGCCAGCCGTTAGGTTGTTCGCTACTGTCGCTACTGAGCAGCACTTTGACTTTGCTGACCTGGACGCCGTTATTTATCCAGCGCTTGATTTCGCGGATTCTGTCGATATCGGCATCGTTAAACAGACGATGACCGCCATCCGTTCGCTGTGGTTTTAAAAGTCCATAACGTCTCTGCCACGCGCGCAACGTGACAGGATTGATATCACAAAGCAAAGCCACTTCACCAATTGTGTAAAGCGCCATCGTTTCACCCTTGCTCGCGAGGTCCCGGTTTAACTTTAGACGCTCTTTTACGAACCAGGAAGTTTTGCCTGTTTTTTATGCATTAAAACGCGAAGTAGCGGGTTGCGGCGGGGTGTTTAAGTGATCGTATTCACGAATTCATATTTTTATGCAACAGTTCAAAGAAAGTTAATCGTACTCAATGTATGTTACGCGCTTTTAATTGAAGTGTGGTTTGCGGGTATGTACGAGTTTAATCTGGTGTTGCTGCTGCTTCAGCAGATGTGCGTGTTTCTGGTCATTGCGTGGCTAATGAGTAAAACGCGTCTGTTCATCCCGCTTATGCAGGTCACGGTTCGTCTGCCGCACAAGCTTCTGTGTTACGTCACGTTTTCTATCTTCTGCATCATGGGCACTTATTTTGGGCTGCATATCGAAGATTCGATTGCCAATACCCGCGCAATTGGCGCGGTGATGGGCGGCCTACTCGGCGGGCCGGTCGTCGGCGGGCTGGTCGGTCTGACCGGTGGGTTACATCGGTATTCTATGGGCGGCATGACGGCGCTGAGCTGTATGATTTCCACCATCGTCGAAGGGCTGCTGGGCGGGTTGGTACACAGCGTTCTCATACGTCGCGGACGCCCGGACAAAGTGTTTAGCCCGCTGACGGCGGGAGCAATTACGTGTATTGCCGAACTGGTGCAGATGCTGATCATTTTACTGATAGCCAGGCCGTTTGACGATGCCTTGCATCTGGTCAGTAATATTGCCGCGCCGATGATGGTGACGAATACCGTTGGCGCCGCGCTGTTCATGCGTATTTTGCTCGATAAGCGCGCCATGTTCGAAAAATATACTTCGGCATTTTCTGCTACCGCGCTGAAGGTCGCCGCGTCAACGGAGGGGATTCTGCGTCAGGGATTTAACGAAGTGAACAGTATGAAGGTGGCGCAGGTGTTATATCAGGAGCTGGATATTGGCGCCGTCGCCATCACCGATCGCGAAAAACTGCTGGCTTTTACTGGTATTGGCGACGATCACCATCTACCGGGCAAACCCATTTCATCAGGTTATACGCTGAAAGCAATTGAAACCGGAGAGGTGGTTTATGCCGATGGCAACGAAGTGCCGTATCGCTGTTCGCTACACCCGCAGTGTAAACTCGGCTCGACGCTGGTGATCCCGCTGCGTGGCGAAAATCAGCGAGTCATGGGCACCATTAAATTGTACGAAGCGAAAAACCGGCTGTTTAGCTCAATTAACCGCACCCTGGGAGAGGGTATTGCGCAGCTTTTATCCGCGCAGATCCTGGCCGGGCAGTATGAACGGCAGAAGGCGTTGCTGACGCAGTCAGAGATCAAGCTGTTGCACGCGCAGGTGAACCCGCATTTTCTGTTTAACGCGCTCAATACCATTAAAGCGGTGATTCGCCGCGACAGCGAACAGGCCAGCCAACTGGTGCAGTACTTGTCGACCTTTTTTCGCAAAAATTTAAAACGCCCGTCGGAAATCGTCACGCTGGCGGATGAAATTGAACACGTAAACGCTTATCTGCAAATTGAAAAAGCGCGTTTTCAGTCGCGTCTGCAGGTACAGCTTGATGTTCCATCGACGCTTTCACGTCAGAAATTGCCTGCGTTTACATTACAGCCGATTGTTGAGAACGCCATTAAACATGGCACGTCGCAACTGCTTGATACCGGCAACGTCGCTATTCGCGCCCGGCGCGAAGGGCAGCATTTGATGTTAGATATTGAGGATAATGCGGGACTGTATCAGCCTTCCGCCGGCAGTAGCGGGCTGGGGATGAGTCTGGTTGATAAACGTCTGCGCGAACACTTTGGCGATGATTATGGTATTAGCGTGGCCTGCGAGCCGGACTGTTTTACCCGAATTACATTACGACTTCCACTGGAGGAGGACGCATGATTAAAGTGCTGATTGTGGATGATGAGCCGTTAGCGCGGGAAAATCTGCGGATTTTGCTCCAGGGGCAGGATGACATTGAGATTGTGGGAGAGTGCGCGAACGCGGTAGAAGCGATTGGCGCGGTACATAAGTTGCGACCTGATGTGCTGTTTCTGGATATTCAGATGCCGCGTATCAGTGGACTGGAGATGGTAGGAATGCTTGATCCGGAACACCGCCCGTATATCGTTTTTTTAACCGCGTTTGACGAATACGCCATCAAAGCCTTTGAAGAACACGCTTTTGATTATCTGCTCAAGCCGATAGAGGAGAAACGGCTGGAAAAAACGTTACATCGTCTGCGTCAGGAGCGCAGTAAACAGGATGTTTCGTTGTTGCCGGAAAACCAGCAGGCGCTTAAATTCATTCCCTGTACCGGACACAGCCGGATCTATTTGTTGCAAATGGATGATGTCGCCTTTGTCAGTAGCCGTATGAGCGGCGTTTATGTGACCAGCAGTGAAGGGAAAGAGGGGTTTACCGAGCTGACGCTGCGCACGCTGGAAAGCCGGACGCCGCTACTGCGTTGTCATCGTCAGTTTCTGGTGAATATGGCCCATTTGCAGGAAATTCGGCTGGAGGATAATGGGCAGGCAGAGCTGATTTTACGCAACGGCCTGACGGTGCCGGTAAGCCGTCGCTATCTGAAAAGTTTAAAAGAGGCGATTGGCCTGTAAAAGACTGTTAGAATATCGTTTTGCCATAGAAACGACCGAAGGCCTCATGCTGAGTAACGATATTCTGCGTAGCGTGCGCTACATTTTAAAAGCTAATAATACCGATCTGGCGCGTATCCTGGCGCTGGGTAACGTTGATGCTACGCCGGAGCAGATAGCAATCTGGTTGCGCAAAGAAGAGGAAGAGGGGTTTCAGCGTTGCCCGGATATCGTGTTGTCCTCATTTCTCAATGGCCTCATTTATGAAAAACGCGGCAAAGATGAGGCGGCGCCTGCATTGACGGCGGAACGTCGTATCAACAACAATATTGTGCTGAAAAAGCTGCGTATTGCCTTTTCGCTAAAAACAGATGATATCCTGGCGATACTTACCGGTCAGTTGTTTCGTGTCTCAATGTCAGAGATCACCGCGATGATGCGCGCGCCGGACCATAAGAACTTCCGCGAATGCGGCGATCAGTTTATGCGTTATTTTCTGCGCGGTCTGGCGGCCCGTGAACACGCGGCGAAGTAATTCTGCGGTATTGTTCCCGGCAGCGTCCTGTCTGACCGGGAAAACGCATTATTATACTAATTGATTCTATGATACCCGCTCTCTTCCAACAGTTTCTGCGAGCGAATCATTGACAGATAGTACGCGGAACAGTTGTCAATTGATGATCCTGGCAATTTACAGAGGTCGCTTATTTTTGCCTGGGTAAAATCAATATCCACATATTCCGTAGCATAGCTATCATAATAGTCGATTCGTTCAGTCAAACCCGGCATACCCTGATAAGCTTCGCCGACTTGACTCAGCATTTTTTGTGCTTCTTCTTTATTATTGGCTTTCAGGGTCTTATAAAGTAGTTTATGTTCAGATATTTGACGTAAAACGATATCCCCTTTGTAGTAATAGGTTAACTTGATTTCAATACCGTTGAGATTACCTACATAGCGTTGTGTTTCCTCTGACTCTTTGCTAGCGGCTATCTTCTTGATAAACGCTGTCATGTTATTTTGCTTTCCCTGGAGAGTATCGTTTTTCTGATCGCAGCCAGTTATGCTAACCGATAGAGAGAGCGCGAATAGTGGCAGTGCCATAAGACGTAGAACCTGCATAACAATTCCTTGTCGTTAAGTATTGGTGTGGCCAGGAATTCAGGGATTATAGGCTTTGGCGAGGGGACTTACAGCGAGGCTGTCTTTTTTCGGAATTCATAAAGAAAAGACGCTGCCGAAGCAGCGCCCTGAGCGACTTTACCAGTCGATGCAATACATTATGCCTGCCAGTTATTTCGCTTCTTTAAAACCAGCAGCTTCCAGCAGCGTCTGGGTTTGTTTCATGCTGATACCTTTGCTGGTATCGCCGGACACCATCGTCCCTGAGATTTGCTGTAACGCTTTAAAGTCCACTTTTTCCATATCCACAGAGACGTTTTCCTGGGCATAGGTATCTTCATAGGTTAATTTTTCTTCCACTCCGGCGATATTTTTATATTTCGCGCTCAGCGGATCGAGAATTTTGGCGGCATCTTCTTTCGTTTTAGCGCCTACAGTGGCATAGCTGATTTTACTTTCAGACGTCTGCTTAATGATTTTGTCACCTTTATAGGTGTAAGTAATTGAAATTTCTGTCCCCGCCAGGTTTGCGTTAAAGGTCTTTGATTCTTCTTTATCGCCACAGCCAGCAAGAGAGAACACCAGTACGGAAGCCAAAGCCGTGGACAATAACTTGCCAGAAATTTTCATCTAAAACTCCATTTTATATAATAATTGGGCTTTTAAAATAATTTCAATGAATTAATTTAACCCAGTAATAGCAATGTATCAGGGAGAGATAGAATATGACTTTTAGCCGTTATTTAGCAGTCCGGATATGGAGTCTTAGCGCTATTGCTTATTAAGGAAAAAGTTAAAACGTGCGGAGGAGGCGATATGCCAGTCAGGATTAAGCGGTTAAAAAAGCCGGAGCATGCTCCGGCTTGTTGCTTATTTCACCTGTTGGCCAGGCTTCGCGCCGTCATCAGGGCTTAACAGGAAGATATCTTTCCCGCCAGGGCCTGCGGCCATCACCATTCCCTCGGAGACGCCAAAGCGCATTTTGCGCGGCGCGAGGTTGGCGACCATTACCGTCTGGCGGCCGATCAGCGCCTGCGGGTCCGGGTAGGCGGAACGAATGCCGGAGAAGACGTTACGCTTCTCGCCGCCCAGATCCAGCGTCAGACGCAGCAATTTGTCGGAGCCTTCCACGAACTCAGCGTTTTCAATCAATGCTACGCGCAGGTCAATTTTGGCGAAATCGTCAAAGGTGATGGTTTCCTGAATCGGGAAGTCGGCTAACGGGCCGGTAACCGGTGCGGCTGCGGCTTTCACCTCTTCTTTAGACGCTTCAACCAGCGCTTCAACTTGCTTCATGTCGATGCGATTGTAGAGCGCCTTAAAGGTGTTGACCTTGTGACTGAGCAGCGGCTGTTCGATGGCATCCCAGTTCAGTTCGCTGTTCAGGAAGGCTTCAACGCGTTCAGAAAGCGTCGGCAGTACCGGTTTCAGATACGTCATCAGCACGCGGAACAGGTTGATGCCCATCGAGCAAATGGCCTGCAGGTCAGCGTCGCGGCCTTCCTGTTTAGCCACCACCCACGGCGCTTGCTCGTCAACATAACGGTTAGCGACGTCGGCCAGCGCCATAATCTCACGGATAGCTTTGCCGAATTCACGGCTTTCCCATGCTTCGCCAATCACCGCAGCGGCGTCAGTAAAGGTTTTGTACAATTGCGGATCGGCCAGTTCAGCCGCCAGCACGCCGTCGAAACGCTTATTGATAAAACCGGCGTTACGGGATGCCAGGTTGACTACTTTATTGACGATATCGGCATTGACGCGCTGGACAAAGTCTTCCAGGTTCAGGTCGATGTCATCAATGCGTGAAGAAAGCTTCGCGGTGTAGTAGTAGCGCAGGCTGTCGGCGTCAAAGTGTTTCAGCCAGGTGCTGGCCTTAATAAAGGTGCCGCGAGACTTAGACATCTTCGCGCCGTTCACCGTCACGTAACCGTGAACGAACAGGTTGGTCGGCTTACGGAAGTGGCTGCCTTCCAGCATGGCAGGCCAGAACAGGCTGTGGAAATAGACGATGTCTTTGCCGATAAAGTGATACAGCTCGGCGTCGGAGTCTTTTTTCCAGTACTCATCAAAACTGGTCGTGTCACCGCGCTTATCGCACAGATTTTTGAAGGAGCCCATATAGCCAATCGGCGCGTCCAGCCAGACGTAGAAATATTTGCCCGGCGCGTTCGGGATTTCGAAACCAAAATACGGCGCGTCGCGGGAAATGTCCCACTGTTGCAGGCCGGATTCAAACCACTCCTGCATTTTGTTCGCCACCTGCTCCTGCAGCGCGCCGCTGCGGGTCCACGCCTGCAGCATTTCGCTGAATGACGGCAGATCAAAGAAAAAGTGCTCGGAGTCACGCATTACCGGCGTCGCGCCGGACACCACGGATTTCGGCTCGATAAGTTCGGTCGGGCTGTAAGTTGCGCCGCAGACTTCACAGTTATCGCCGTACTGGTCCGCGGATTTACATTTCGGGCAGGTGCCTTTCACAAATCGGTCCGGCAGGAACATGCCTTTTTCCGGATCGTAGAGTTGAGAGATAGTGCGGTTCTTAATAAAACCGTTCTCTTTCAGGCGCGTATAAATCAGCTCGGACAGCTCGCGATTCTCGTCGCTGTGCGTTGAGTGGTAGTTGTCGTAGCTAATATTAAAACCGGCGAAATCGGTCTGGTGCTCCTGGCTCATTTCACCGATCATTTGCTCCGGCGTAATACCAAGCTGCTGCGCTTTCAGCATGATCGGCGTGCCATGAGCGTCATCGGCACAGATGAAGTTAACCTCATGGCCGCGCATTCGCTGGTAACGGACCCAGACATCAGCCTGGATGTGCTCCAGCATATGGCCGAGGTGGATAGAGCCGTTGGCGTACGGCAGCGCGCACGTTACCAGAATTTTCTTCGCGACTTGAGTCAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP011394|1657386:1666557|1658317_1659049_+|WP_000824854.1|DBSCAN-SWA MKRLCDPLLWLIVLFLLLLFGLPYSQPFFAALFPDLPRPVYQQESFAALALAHFWLVGISSLFAVVVGVGAGIAVTRESGKEFRPLVETIAAVGQTFPPVAVLAIAVPVMGFGQQPAIIALILYGVLPILQATLAGLGAVPASVMSVASGMGMSRRQQLYQVELPLAAPVILAGIRTSVIINIGTATIASTVGASTLGTPIIIGLSGFNTAYVIQGALLVALAAIIIDRLFERLTRALTRHAK >NZ_CP011394|1657386:1666557|1659196_1659928_-|WP_001240420.1|DBSCAN-SWA MALYTIGEVALLCDINPVTLRAWQRRYGLLKPQRTDGGHRLFNDADIDRIREIKRWINNGVQVSKVKVLLSSDSSEQPNGWREQQEILLHYLQSSNLHSLRLWVKERCQDYPAQTLTTNLFVPLRRRLQCQQPALQALLGILDGILINYIALCLASARKKQGKDALVIGWNIHDTTRLWLEGWVASQQGWRIDVLAHSLSQFRPELFDGKTLLVWCGENQTLAQQQQLLAWRAQGRDIHPLGV >NZ_CP011394|1657386:1666557|1663122_1663653_-|WP_001197951.1|DBSCAN-SWA MQVLRLMALPLFALSLSVSITGCDQKNDTLQGKQNNMTAFIKKIAASKESEETQRYVGNLNGIEIKLTYYYKGDIVLRQISEHKLLYKTLKANNKEEAQKMLSQVGEAYQGMPGLTERIDYYDSYATEYVDIDFTQAKISDLCKLPGSSIDNCSAYYLSMIRSQKLLEESGYHRIN >NZ_CP011394|1657386:1666557|1661832_1662552_+|WP_000598637.1|DBSCAN-SWA MIKVLIVDDEPLARENLRILLQGQDDIEIVGECANAVEAIGAVHKLRPDVLFLDIQMPRISGLEMVGMLDPEHRPYIVFLTAFDEYAIKAFEEHAFDYLLKPIEEKRLEKTLHRLRQERSKQDVSLLPENQQALKFIPCTGHSRIYLLQMDDVAFVSSRMSGVYVTSSEGKEGFTELTLRTLESRTPLLRCHRQFLVNMAHLQEIRLEDNGQAELILRNGLTVPVSRRYLKSLKEAIGL >NZ_CP011394|1657386:1666557|1657386_1658334_+|WP_000569168.1|DBSCAN-SWA MIEFNHVSKTFGDQQAVSDLNLHFSEGSFSVLIGTSGSGKSTTLKMINRLVEHDSGTIRFAGEEIRSLPVLELRRRMGYAIQSIGLFPHWTVAQNIATVPQLQKWSRARINDRIDELMALLGLESALRDRYPHQLSGGQQQRVGVARALAADPQVLLMDEPFGALDPVTRGALQQEMTRIHQLLGRTIVLVTHDIDEALRLADHLVLMDGGNVIQQGSPLSMLTSPENDFVQAFFGRSELGVRLLSLRSVGDYVRRHEQLSGDALVEEMTLRDALSMFVARRCDVLPVANQQGEPCGTLHFRDLLSETSPRETTV >NZ_CP011394|1657386:1666557|1663824_1664283_-|WP_000703145.1|DBSCAN-SWA MKISGKLLSTALASVLVFSLAGCGDKEESKTFNANLAGTEISITYTYKGDKIIKQTSESKISYATVGAKTKEDAAKILDPLSAKYKNIAGVEEKLTYEDTYAQENVSVDMEKVDFKALQQISGTMVSGDTSKGISMKQTQTLLEAAGFKEAK >NZ_CP011394|1657386:1666557|1659029_1659137_-|WP_001261696.1|DBSCAN-SWA MRVAKIGVIALFLLMAIGGIGGVMLAGYSFILRAG >NZ_CP011394|1657386:1666557|1662598_1663066_+|WP_000950414.1|DBSCAN-SWA MLSNDILRSVRYILKANNTDLARILALGNVDATPEQIAIWLRKEEEEGFQRCPDIVLSSFLNGLIYEKRGKDEAAPALTAERRINNNIVLKKLRIAFSLKTDDILAILTGQLFRVSMSEITAMMRAPDHKNFRECGDQFMRYFLRGLAAREHAAK >NZ_CP011394|1657386:1666557|1664523_1666557_-|WP_000195340.1|tRNA|DBSCAN-SWA MTQVAKKILVTCALPYANGSIHLGHMLEHIQADVWVRYQRMRGHEVNFICADDAHGTPIMLKAQQLGITPEQMIGEMSQEHQTDFAGFNISYDNYHSTHSDENRELSELIYTRLKENGFIKNRTISQLYDPEKGMFLPDRFVKGTCPKCKSADQYGDNCEVCGATYSPTELIEPKSVVSGATPVMRDSEHFFFDLPSFSEMLQAWTRSGALQEQVANKMQEWFESGLQQWDISRDAPYFGFEIPNAPGKYFYVWLDAPIGYMGSFKNLCDKRGDTTSFDEYWKKDSDAELYHFIGKDIVYFHSLFWPAMLEGSHFRKPTNLFVHGYVTVNGAKMSKSRGTFIKASTWLKHFDADSLRYYYTAKLSSRIDDIDLNLEDFVQRVNADIVNKVVNLASRNAGFINKRFDGVLAAELADPQLYKTFTDAAAVIGEAWESREFGKAIREIMALADVANRYVDEQAPWVVAKQEGRDADLQAICSMGINLFRVLMTYLKPVLPTLSERVEAFLNSELNWDAIEQPLLSHKVNTFKALYNRIDMKQVEALVEASKEEVKAAAAPVTGPLADFPIQETITFDDFAKIDLRVALIENAEFVEGSDKLLRLTLDLGGEKRNVFSGIRSAYPDPQALIGRQTVMVANLAPRKMRFGVSEGMVMAAGPGGKDIFLLSPDDGAKPGQQVK >NZ_CP011394|1657386:1666557|1660150_1661836_+|WP_000272845.1|DBSCAN-SWA MYEFNLVLLLLQQMCVFLVIAWLMSKTRLFIPLMQVTVRLPHKLLCYVTFSIFCIMGTYFGLHIEDSIANTRAIGAVMGGLLGGPVVGGLVGLTGGLHRYSMGGMTALSCMISTIVEGLLGGLVHSVLIRRGRPDKVFSPLTAGAITCIAELVQMLIILLIARPFDDALHLVSNIAAPMMVTNTVGAALFMRILLDKRAMFEKYTSAFSATALKVAASTEGILRQGFNEVNSMKVAQVLYQELDIGAVAITDREKLLAFTGIGDDHHLPGKPISSGYTLKAIETGEVVYADGNEVPYRCSLHPQCKLGSTLVIPLRGENQRVMGTIKLYEAKNRLFSSINRTLGEGIAQLLSAQILAGQYERQKALLTQSEIKLLHAQVNPHFLFNALNTIKAVIRRDSEQASQLVQYLSTFFRKNLKRPSEIVTLADEIEHVNAYLQIEKARFQSRLQVQLDVPSTLSRQKLPAFTLQPIVENAIKHGTSQLLDTGNVAIRARREGQHLMLDIEDNAGLYQPSAGSSGLGMSLVDKRLREHFGDDYGISVACEPDCFTRITLRLPLEEDA |
10 | Enterobacteria_phage(66.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1733755 : 1744262
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP011394|1733755:1744262|DBSCAN-SWA TATGCCCGTGAATAAGTTCTCCCGACGTACCCTCCTGACGGCAGGTTCCGCGCTTGCTGTTCTTCCTTTTCTGCGCGCCTTGCCGGTACAGGCGCGTGAACCTCGCGAGACCGTCGATATTAAGGATTATCCGGCGGATGACGGTATCGCCTCGTTCAAACAGGCCTTCGCCGACGGACAGACCGTGGTCTTACCGCCAGGATGGGTGTGTGAAAATATCAATGCGGCGATAACGATTCCGGCGGGAAAAACGCTGCGGGTACAGGGCGCGGTGCGTGGGAATGGCCGGGGACGGTTTATTTTGCAGGACGGGTGTCAGGTGGTGGGGGAGCAGGGCGGCAGTCTGCACAATGTGACGCTGGATGTTCGCGGGTCGGACTGTGTGATTAAAGGCGTGACGATGAGCGGCTTTGGCCCCGTCGCGCAAATTTTCATCGGCGGTAAGGAACCGCAGGTGATGCGTAATCTCATTATCGATGACATCACCGTTACCCACGCCAACTACGCCATTCTCCGCCAGGGATTTCATAACCAAATGGACGGCGCGCGGATTACGCATAGCCGCTTTAGCGATTTGCAGGGGGACGCCATTGAGTGGAATGTCGCGATTCACGACCGCGACATCCTGATTTCCGATCATGTCATCGAACGCATTGATTGTACCAATGGCAAAATCAACTGGGGGATCGGCATCGGGCTGGCGGGTAGCACCTATGACAACAGTTATCCTGAAGATCAGGCAGTAAAAAACTTTGTGGTGGCCAATATTACCGGATCTGATTGCCGACAGCTGGTGCACGTAGAAAATGGCAAACATTTCGTCATTCGCAATGTCAAAGCCAAAAACATCACGCCCGATTTCAGTAAAAATGCGGGTATTGATAACGCAACGATCGCCATTTATGGCTGTGATAATTTCGTCATTGATAATATTGATATGACGAATAGTGCTGGGATGCTCATCGGCTATGGCGTCGTTAAAGGAAAATACCTGTCAATTCCGCAAAACTTTAAATTAAACGCTATTCGGTTGGATAATCGCCAGGTTGCTTATAAATTACGCGGCATTCAAATTTCCTCCGGCAACATCCCCTCTTTTGTCGCCATCACCAATGTACGGATGACGCGTGCTACGCTGGAACTGCATAATCAACCGCAGCACCTCTTTCTGCGTAATATCAACGTGATGCAAACTTCAGCGATTGGCCCGGCGTTAAAAATGCATTTCGATTTGCGTAAAGATGTCCGTGGTCAATTTATGGCCCGCCAGGACACGCTGCTTTCCCTCGCTAATGTTCATGCCATCAATGAAAACGGGCAGAGTTCCGTGGATATCGACAGGATTAATCACCAAACCGTGAATGTCGAAGCAGTGAATTTTTCGCTGCCGAAGCGGGGAGGGTAAGTACCGCTATTTTTACGAAAATTCCTGGGAAAAAGTTGTTCATACTTAATGTTATGGTGCCGACTAAGACGTAATGTAAAGCGTGCCATCATTATCCCTGGCAGCAGAGTAATTCATGCTGGCGAAAACAAGCTAAAGAGCTATAATTCAGCAACCATTTTACAGGTGGAAGAAACAATGATGAATTTGAAAGCAGTTATACCGGTAGCGGGTTTGGGTATGCATATGTTGCCTGCCACCAAGGCAATCCCAAAAGAGATGCTACCGATCGTCGACAAGCCAATGATTCAGTACATTGTCGATGAGATTGTGGCTGCAGGGATCAAAGAAATCGTACTGGTGACTCACGCGTCTAAAAACGCCGTTGAGAACCACTTCGACACCTCTTATGAACTTGAATCACTTCTTGAGCAGCGCGTTAAGCGCCAGCTTTTGGCGGAAGTGCAATCTATCTGCCCACCGGGCGTGACGATTATGAACGTTCGCCAGGCGCAGCCGTTAGGGCTGGGGCACTCTATTCTGTGCGCGCGTCCGGTCGTGGGCGATAACCCTTTCATTGTGGTACTCCCGGATATTATTATCGATGATGCTACCGCCGATCCGCTGCGCTATAACCTTGCGGCGATGGTGGCGCGTTTCAATGAAACGGGTCGCAGCCAGGTGCTGGCGAAGCGCATGAAAGGTGATTTATCGGAGTATTCCGTTATCCAGACGAAAGAACCTCTGGATAATGAAGGCAAAGTCAGCCGGATTGTGGAGTTTATCGAAAAACCGGATCAGCCGCAGACGCTGGATTCCGATTTGATGGCGGTAGGCCGTTATGTGCTTTCAGCCGACATCTGGGCGGAACTGGAAAGAACCGAACCGGGCGCCTGGGGCCGTATCCAGCTCACCGATGCCATTGCAGAACTGGCGAAAAAACAGTCGGTTGACGCGATGCTAATGACGGGTGACAGCTATGACTGCGGTAAAAAAATGGGCTACATGCAGGCATTTGTGAAGTACGGGCTGCGCAACCTGAAAGAAGGAGCGAAGTTCCGTAAGAGCATAGAGCAGCTTTTGCATGAATAAGTATTAACAACCGTGATAAATGGTTGGTGATAAACATAATAACGGCAGTGAACATTCGAAGCGGCAAGTTGGCTGAAGCGAGTGTTGACTGCCGTTTTAGTTTTGTATAAAGGGCTTAAGTAACAAGGGGTTATCTGGAGCATTTTAATGCTGATTTTATAAGATTAATCCTTGTTTCCGGATGCAATTAATAAGACAATTAGCGTTTAAGTTTTAGTGAGCTTTGCCCTGCTGGGCGAGGTTTGTAACAAGTCGATATGTACGCAGTGCACTGGTAGCTGATGAGCCAGGGGCGGTAGCGTGTGTAACGACTTGAGCAATTAATTTTTATTGGCAAATTAAATACCACATTAAATACGCCTTATGGAATAGAAAAGTGAAGATACTTATTACTGGCGGGGCAGGTTTTATTGGATCAGCTGTTGTCCGCCATATTATTAAGAATACACAGGACACTGTAGTTAATATTGATAAATTAACCTACGCCGGTAATCTTGAATCCCTTTCTGATATTTCTGAGAGTAATCGCTACAATTTTGAACACGCGGATATTTGTGATTCCGCTGAAATAACGCGTATTTTTGAGCAGTACCAGCCGGACGCGGTGATGCATTTGGCTGCGGAAAGTCATGTGGACCGTTCGATTACCGGGCCAGCAGCATTTATTGAAACCAATATCGTCGGCACCTATGTACTTCTTGAAGTTGCGCGTAAATACTGGTCTGCGCTTGGCGAAGATAAAAAAAATAATTTTCGTTTTCATCATATTTCCACTGATGAAGTTTACGGCGATTTACCGCATCCTGATGAAGTTGAAAACAGCGTTACGCTGCCGTTATTTACTGAAACGACGGCATATGCGCCAAGTAGCCCCTATTCTGCGTCAAAAGCATCCAGCGATCATTTAGTCCGTGCCTGGCGGCGTACCTATGGTCTACCAACGATCGTTACCAATTGTTCTAATAACTATGGCCCTTATCACTTCCCTGAAAAACTGATTCCGTTGGTCATTTTGAACGCACTGGAAGGAAAGCCTTTGCCAATTTATGGCAAAGGGGATCAGATTCGCGATTGGCTATATGTAGAAGATCATGCTCGCGCGCTTCATATGGTAGTGACTGAAGGCAAGGCGGGGGAGACTTATAACATTGGTGGCCACAATGAGAAGAAAAATCTCGATGTGGTATTTACCATCTGTGATCTGCTGGACGAGATTGTACCCAAAGCGACTTCTTATCGTGAACAAATCACTTATGTCGCGGATCGTCCGGGCCATGATCGTCGTTATGCCATTGATGCAGGTAAAATTAGCCGCGAATTAGGCTGGAAACCGCTGGAGACCTTTGAAAGCGGTATTCGTAAAACAGTGGAATGGTACCTTGCAAATACTCAATGGGTAAACAATGTTAAAAGTGGGGCGTATCAGAGTTGGATAGAACAGAACTATGAAGGACGCCAGTAATGAATATCTTACTTTTTGGTAAGACAGGGCAAGTAGGCTGGGAGTTGCAACGTTCTCTGGCACCAGTAGGGAATCTGATTGCCCTGGATGTCCATTCAAAAGAGTTTTGCGGTGATTTTAGTAATCCGAAAGGCGTTGCCGAAACCGTTCGTAAGCTTCGTCCCGATGTGATTGTTAACGCAGCAGCACATACTGCAGTAGATAAAGCAGAGTCTGAACCAGAACTGGCGCAGTTACTTAACGCCACCAGTGTGGAAGCCATCGCTAAAGCAGCCAACGAAACTGGCGCATGGGTAGTGCATTATTCAACCGATTATGTATTTCCTGGTACCGGCGATATCCCATGGCAGGAAACGGACGCTACGTCGCCGCTGAATGTCTATGGCAAGACCAAACTGGCGGGAGAAAAGGCCCTGCAGGATAACTGCCCTAAGCATCTTATCTTCCGCACCAGTTGGGTTTATGCAGGTAAGGGCAATAATTTCGCAAAGACAATGCTTCGTCTGGCGAAAGAGCGTCAGACACTTTCAGTCATCAACGATCAGTACGGTGCGCCAACCGGTGCAGAATTACTGGCTGACTGCACGGCTCATGCGATCCGTGTGGCGTTAAAGAAACCAGAAGTCGCAGGTCTTTACCATCTGGTTGCCGGGGGAACCACAACCTGGCATGACTACGCGGCCTTAGTCTTTGACGAGGCGCGCAAAGCAGGGATAACGCTTGCGCTGACTGAGCTTAATGCTGTGCCGACCAGCGCCTACCCGACGCCGGCGAGCAGACCAGGCAATTCGCGTCTCAATACTGAAAAGTTTCAGCGTAATTTTGACCTTATTCTGCCGCAATGGGAATTAGGAGTTAAGCGCATGCTGACTGAAATGTTTACGACGACAACCATCTGATAAATTTAAATGCCCATCAGGGCATTTTCTATGAATGAGAAATGGAAATGAAAACGCGTAAGGGCATTATTTTAGCGGGGGGCTCCGGCACCCGTCTTTATCCGGTGACCATGGCGGTAAGTAAGCAATTGCTACCAATTTATGATAAACCGATGATTTACTATCCCCTTTCCACGCTTATGCTGGCAGGCATTCGGGATATCCTGATCATCAGTACGCCACAGGACACGCCGCGTTTTCAACAACTGCTGGGAGACGGCAGCCAGTGGGGGCTGAATCTTCAATATAAAGTACAGCCAAGCCCGGATGGCTTAGCACAGGCGTTTATTATTGGTGAAGAGTTCATTGGTAATGATGATTGTGCATTAGTACTGGGTGACAATATCTTCTATGGTCATGATTTACCAAAGTTAATGGAAGCTGCCGTTAATAAAGAAAGTGGTGCTACCGTCTTTGCTTATCATGTAAACGATCCGGAGCGCTACGGTGTGGTTGAGTTTGACCAAAGTGGCACAGCCGTTAGTCTGGAGGAAAAACCGTTACAACCGAAGAGTAATTACGCGGTAACGGGGCTGTATTTTTATGATAATAGCGTGGTGGAGATGGCGAAAAATCTTAAGCCTTCCGCTCGCGGTGAGTTAGAAATCACGGATATTAACCGTATCTATATGGATCAGGGAAGATTGTCTGTCGCCATGATGGGGCGCGGTTATGCCTGGCTGGATACAGGGACGCATCAGAGTTTGATAGAGGCCAGTAATTTTATTGCAACCATCGAAGAACGCCAGGGGCTAAAAGTGTCCTGCCCGGAAGAGATCGCATTTCGTAAAAATTTTATAAATGCACAACAGGTTATAGAACTGGCCGGGCCATTATCAAAAAATGATTATGGCAAATATTTGCTGAAGATGGTGAAAGGTTTATAAGTGATGATTGTGATTAAAACAGCAATACCAGATGTCTTGATCTTAGAGCCTAAAGTTTTTGGCGATGAGAGGGGATTCTTTTTTGAAAGTTATAACCAGCAGACCTTTGAAGAGTTGATTGGACGTAAAGTTACATTTGTTCAAGATAATCATTCAAAATCCAAAAAGAACGTACTCAGAGGGCTACATTTTCAGAGAGGAGAAAATGCACAGGGGAAGTTAGTTCGTTGTGCTGTCGGTGAGGTTTTTGATGTTGCGGTCGATATCCGAAAAGAATCGCCTACTTTTGGTCAATGGGTTGGCGTAAATCTATCTGCTGAGAATAAGCGACAGCTTTGGATTCCAGAAGGTTTTGCTCATGGTTTTGTTACTCTTAGTGAGTATGCAGAGTTTCTGTACAAAGCAACTAATTATTACTCACCTTCATCGGAAGGTAGCATTTTATGGAATGATGAGACAATAGGTATTGAATGGCCTTTTTCTCAGCTGCCTGAGCTTTCAGCAAAAGATGCTGCAGCACCTTTACTGCATCAAGCCTTGTTAACAGAGTAAGCATCGTGTCTCATATTATTAAGATTTTTCCATCAAATATTGAATTTTCCGGTAGAGAGGATGAATCAATCCTCGATGCTGCGCTATCGGCTGGCATCCATCTTGAACATAGCTGCAAAGCGGGTGATTGTGGTATCTGTGAGTCCGATTTGTTGGCGGGAGAAGTTGTTGACTCCAAAGGTAATATTTTTGGACAGGGTGATAAAATACTAACCTGCTGCTGTAAACCTAAAACCGCCCTTGAGCTAAATGCGCATTTTTTTCCTGAACTAGCTGGACAGACAAAAAAAATTGTCCCATGCAAGGTAAATAGTGCTGTACTGGTTTCAGGCGATGTTATGACTTTGAAGTTACGCACACCACCAACAGCAAAAATTGGCTTCCTTCCAGGGCAGTATATCAATTTACATTATAAAGGTGTAACTCGCAGTTATTCTATCGCTAATAGTGATGAGTCGAATGGTATTGAGTTGCATGTAAGGAATGTTCCCAATGGTCAGATGAGCTCTCTCATTTTTGGGGAGTTACAAGAAAATACTCTTATGCGCATTGAAGGACCTTGCGGAACATTTTTTATTCGTGAAAGTGACAGACCTATAATCTTCCTTGCAGGCGGTACTGGATTCGCTCCAGTTAAATCAATGGTTGAGCATCTCATTCAGGGAAAATGTCGTCGTGAGATCTACATCTACTGGGGAATGCAAGATAGTAAAGATTTTTACTCTGCATTACCGCAGCAGTGGAGTGAACAGCACGACAACGTTCATTATATCCCTGTTGTTTCTGGTGATGACGCCGAATGGGGGGGAAGAAAGGGATTTGTCCATCATGCTGTGATGGATGATTTTGATTCTCTAGAGTTCTTCGATATATATGCATGTGGTTCACCTGTGATGATCGATGCCAGTAAAAAGGACTTTATGATGAAAAATCTCTCTGTAGAACATTTCTATTCTGATGCATTTACCGCATCTAAATAATATTGAGGATAATTTATGAAAGCGGTCATCCTGGCTGGTGGACTTGGTACCAGACTAAGTGAAGAAACAATTGTAAAACCAAAACCGATGGTAGAAATTGGTGGCAAGCCTATTCTTTGGCACATTATGAAAATGTATTCTGTGCATGGTATCAAGGATTTTATTATCTGCTGTGGTTATAAAGGATATGTGATTAAAGAATATTTTGCGAACTACTTCCTTCACATGTCAGATGTAACATTCCATATGGCTGAAAATCGTATGGAAGTTCACCATAAACGTGTTGAACCATGGAATGTCACATTGGTTGATACGGGTGATTCTTCAATGACTGGTGGTCGTCTGAAACGTGTTGCTGAATACGTAAAAGATGACGAGGCTTTCCTGTTTACTTATGGTGATGGCGTTGCCGACCTTGATATCAAAGCGACTATCGATTTCCATAAGGCTCACGGTAAGAAAGCGACTTTAACAGCTACTTTTCCACCAGGACGTTTTGGCGCATTAGATATCCAAGCTGGTCAGGTCCGGTCATTCCAGGAAAAACCGAAAGGCGATGGGGCAATGATCAATGGTGGTTTCTTTGTGTTGAATCCATCGGTTATCGATCTCATCGATAACGATGCAACAACCTGGGAACAAGAGCCATTAATGACATTGGCACAACAGGGGGAGTTAATGGCTTTTGAACACCCAGGTTTCTGGCAGCCGATGGATACCCTACGTGATAAAGTTTACCTTGAAGGGCTGTGGGAAAAAGGTAAAGCTCCGTGGAAAACCTGGGAGTAAGTAGATGATTGATAAAAATTTTTGGCAAGGTAAACGTGTATTCGTTACCGGCCATACTGGCTTTAAAGGAAGCTGGCTTTCGCTATGGCTGACTGAAATGGGTGCAATTGTAAAAGGCTATGCACTTGATGCGCCAACTGTTCCAAGTTTATTTGAGATAGTGCGTCTTAGTGATCTTATGGAATCTCATATTGGCGACATTCGTGATTTTGAAAAGCTGCGCAATTCTATTGCAGAATTTAAGCCAGAAATTGTTTTCCATATGGCAGCCCAGCCTTTAGTGCGCCTATCTTATGAACAGCCAATCGAAACATACTCAACAAATGTTATGGGTACTGTCCATTTGCTTGAAGCAGTTAAGCAAGTAGGTAACATAAAGGCAGTCGTAAATATCACCAGTGATAAGTGCTACGACAATCGTGAGTGGGTGTGGGGCTATCGTGAGAACGAACCCATGGGAGGGTACGATCCATACTCTAATAGTAAAGGTTGTGCAGAATTAGTCGCGTCTGCATTCCGGAACTCATTCTTCAATCCTGCAAATTATGAGCAACATGGCGTTGGTTTGGCGTCTGTGAGGGCTGGTAATGTCATAGGCGGAGGCGATTGGGCTAAAGACCGTTTAATTCCCGATATTCTGCGCTCATTTGAAAATAACCAGCAGGTTATTATTCGAAACCCATATTCTATCCGTCCCTGGCAGCATGTACTGGAGCCTCTTTCTGGTTACATTGTGGTGGCGCAACGCTTATATACAGAAGGTGCTAAGTTTTCTGAAGGATGGAATTTCGGCCCGCGTGATGAAGATGCGAAGACGGTCGAATTTATTGTTGACAAGATGGTCACGCTTTGGGGTGATGATGCAAGCTGGTTACTGGATGGTGAGAATCATCCTCATGAGGCACATTACCTGAAACTGGATTGCTCTAAAGCAAATATGCAATTAGGATGGCATCCGCGTTGGGGATTGACTGAAACACTTGGTCGCATCGTAAAATGGCATAAAGCATGGATTCGCGGCGAAGATATGTTGATTTGTTCAAAGCGTGAAATCAGCGACTATATGTCTGCAACTACTCGTTAAGAAAATAAGTTTAAGGAATCAAAGTAATGACAGCAAATAACCTGCGTGAGCAAATCTCTCAGCTTGTCGCTCAGTATGCGAATGAGGCATTGAGCCCGAAACCTTTTGTTGCAGGTACAAGCGTTGTGCCTCCTTCCGGGAAGGTTATTGGTGCCAAAGAGTTACAATTGATGGTTGAGGCGTCTCTTGATGGATGGCTAACTACTGGTCGTTTCAATGATGCCTTTGAGAAAAAACTTGGGGAATTTATTGGGGTTCCTCATGTTTTAACGACTACATCTGGCTCTTCGGCAAACTTGCTGGCACTGACTGCGCTGACTTCCCCAAAATTAGGCGAGCGTGCTCTCAAACCTGGTGATGAGGTTATTACTGTCGCTGCTGGCTTCCCGACTACAGTTAACCCGGCGATCCAGAATGGTTTAATACCGGTATTCGTGGATGTTGATATCCCGACATATAATATCGATGCCTCTCTCATTGAAGCTGCAGTTACTGAGAAATCAAAAGCGATAATGATCGCTCATACACTCGGTAATGCATTTAACCTGAGTGAAGTTCGTCGGATTGCCGATAAATATAACTTATGGTTGATTGAAGACTGCTGTGATGCCCTTGGGACGACTTATGAAGGCCAGATGGTAGGTACCTTTGGTGACATCGGAACCGTTAGTTTTTATCCGGCTCACCATATCACAATGGGTGAAGGCGGTGCTGTATTCACCAAGTCAGGTGAACTGAAGAAAATTATTGAGTCGTTCCGTGACTGGGGCCGGGATTGTTATTGTGCGCCAGGATGCGATAACACCTGCGGTAAACGTTTTGGTCAGCAATTGGGATCACTTCCTCAAGGCTATGATCACAAATATACTTATTCCCACCTCGGATATAATCTCAAAATCACGGACATGCAGGCAGCATGTGGTCTGGCTCAGTTGGAGCGCGTAGAAGAGTTTGTAGAGCAGCGTAAAGCTAACTTTTCCTATCTGAAACAGGGCTTGCAATCTTGCACTGAATTCCTCGAATTACCAGAAGCAACAGAGAAATCAGACCCATCCTGGTTTGGCTTCCCTATCACCCTGAAAGAAACTAGCGGTGTTAACCGTGTCGAACTGGTGAAATTCCTTGATGAAGCAAAAATCGGTACACGTTTACTGTTTGCTGGAAATCTGATTCGCCAACCGTATTTTGCTAATGTGAAATATCGTGTAGTGGGTGAGTTGACAAATACCGACCGTATAATGAATCAAACGTTCTGGATTGGTATTTATCCTGGCTTGACTACAGAGCATTTAGATTATGTAGTTAGCAAATTTGAAGAGTTTTTTGGTTTAAATTTCTAA
Protein sequences of DBSCAN-SWA_3 >NZ_CP011394|1733755:1744262|1740074_1741049_+|WP_000018223.1|DBSCAN-SWA MSHIIKIFPSNIEFSGREDESILDAALSAGIHLEHSCKAGDCGICESDLLAGEVVDSKGNIFGQGDKILTCCCKPKTALELNAHFFPELAGQTKKIVPCKVNSAVLVSGDVMTLKLRTPPTAKIGFLPGQYINLHYKGVTRSYSIANSDESNGIELHVRNVPNGQMSSLIFGELQENTLMRIEGPCGTFFIRESDRPIIFLAGGTGFAPVKSMVEHLIQGKCRREIYIYWGMQDSKDFYSALPQQWSEQHDNVHYIPVVSGDDAEWGGRKGFVHHAVMDDFDSLEFFDIYACGSPVMIDASKKDFMMKNLSVEHFYSDAFTASK >NZ_CP011394|1733755:1744262|1741842_1742922_+|WP_000565913.1|DBSCAN-SWA MIDKNFWQGKRVFVTGHTGFKGSWLSLWLTEMGAIVKGYALDAPTVPSLFEIVRLSDLMESHIGDIRDFEKLRNSIAEFKPEIVFHMAAQPLVRLSYEQPIETYSTNVMGTVHLLEAVKQVGNIKAVVNITSDKCYDNREWVWGYRENEPMGGYDPYSNSKGCAELVASAFRNSFFNPANYEQHGVGLASVRAGNVIGGGDWAKDRLIPDILRSFENNQQVIIRNPYSIRPWQHVLEPLSGYIVVAQRLYTEGAKFSEGWNFGPRDEDAKTVEFIVDKMVTLWGDDASWLLDGENHPHEAHYLKLDCSKANMQLGWHPRWGLTETLGRIVKWHKAWIRGEDMLICSKREISDYMSATTR >NZ_CP011394|1733755:1744262|1735336_1736230_+|WP_000981469.1|DBSCAN-SWA MMNLKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEIVLVTHASKNAVENHFDTSYELESLLEQRVKRQLLAEVQSICPPGVTIMNVRQAQPLGLGHSILCARPVVGDNPFIVVLPDIIIDDATADPLRYNLAAMVARFNETGRSQVLAKRMKGDLSEYSVIQTKEPLDNEGKVSRIVEFIEKPDQPQTLDSDLMAVGRYVLSADIWAELERTEPGAWGRIQLTDAIAELAKKQSVDAMLMTGDSYDCGKKMGYMQAFVKYGLRNLKEGAKFRKSIEQLLHE >NZ_CP011394|1733755:1744262|1737691_1738591_+|WP_001023658.1|DBSCAN-SWA MNILLFGKTGQVGWELQRSLAPVGNLIALDVHSKEFCGDFSNPKGVAETVRKLRPDVIVNAAAHTAVDKAESEPELAQLLNATSVEAIAKAANETGAWVVHYSTDYVFPGTGDIPWQETDATSPLNVYGKTKLAGEKALQDNCPKHLIFRTSWVYAGKGNNFAKTMLRLAKERQTLSVINDQYGAPTGAELLADCTAHAIRVALKKPEVAGLYHLVAGGTTTWHDYAALVFDEARKAGITLALTELNAVPTSAYPTPASRPGNSRLNTEKFQRNFDLILPQWELGVKRMLTEMFTTTTI >NZ_CP011394|1733755:1744262|1741064_1741838_+|WP_000648783.1|DBSCAN-SWA MKAVILAGGLGTRLSEETIVKPKPMVEIGGKPILWHIMKMYSVHGIKDFIICCGYKGYVIKEYFANYFLHMSDVTFHMAENRMEVHHKRVEPWNVTLVDTGDSSMTGGRLKRVAEYVKDDEAFLFTYGDGVADLDIKATIDFHKAHGKKATLTATFPPGRFGALDIQAGQVRSFQEKPKGDGAMINGGFFVLNPSVIDLIDNDATTWEQEPLMTLAQQGELMAFEHPGFWQPMDTLRDKVYLEGLWEKGKAPWKTWE >NZ_CP011394|1733755:1744262|1738638_1739517_+|WP_000857535.1|DBSCAN-SWA MKTRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFQQLLGDGSQWGLNLQYKVQPSPDGLAQAFIIGEEFIGNDDCALVLGDNIFYGHDLPKLMEAAVNKESGATVFAYHVNDPERYGVVEFDQSGTAVSLEEKPLQPKSNYAVTGLYFYDNSVVEMAKNLKPSARGELEITDINRIYMDQGRLSVAMMGRGYAWLDTGTHQSLIEASNFIATIEERQGLKVSCPEEIAFRKNFINAQQVIELAGPLSKNDYGKYLLKMVKGL >NZ_CP011394|1733755:1744262|1736606_1737692_+|WP_000697846.1|DBSCAN-SWA MKILITGGAGFIGSAVVRHIIKNTQDTVVNIDKLTYAGNLESLSDISESNRYNFEHADICDSAEITRIFEQYQPDAVMHLAAESHVDRSITGPAAFIETNIVGTYVLLEVARKYWSALGEDKKNNFRFHHISTDEVYGDLPHPDEVENSVTLPLFTETTAYAPSSPYSASKASSDHLVRAWRRTYGLPTIVTNCSNNYGPYHFPEKLIPLVILNALEGKPLPIYGKGDQIRDWLYVEDHARALHMVVTEGKAGETYNIGGHNEKKNLDVVFTICDLLDEIVPKATSYREQITYVADRPGHDRRYAIDAGKISRELGWKPLETFESGIRKTVEWYLANTQWVNNVKSGAYQSWIEQNYEGRQ >NZ_CP011394|1733755:1744262|1739517_1740069_+|WP_000973709.1|DBSCAN-SWA MMIVIKTAIPDVLILEPKVFGDERGFFFESYNQQTFEELIGRKVTFVQDNHSKSKKNVLRGLHFQRGENAQGKLVRCAVGEVFDVAVDIRKESPTFGQWVGVNLSAENKRQLWIPEGFAHGFVTLSEYAEFLYKATNYYSPSSEGSILWNDETIGIEWPFSQLPELSAKDAAAPLLHQALLTE >NZ_CP011394|1733755:1744262|1733755_1735159_+|WP_001144948.1|DBSCAN-SWA MPVNKFSRRTLLTAGSALAVLPFLRALPVQAREPRETVDIKDYPADDGIASFKQAFADGQTVVLPPGWVCENINAAITIPAGKTLRVQGAVRGNGRGRFILQDGCQVVGEQGGSLHNVTLDVRGSDCVIKGVTMSGFGPVAQIFIGGKEPQVMRNLIIDDITVTHANYAILRQGFHNQMDGARITHSRFSDLQGDAIEWNVAIHDRDILISDHVIERIDCTNGKINWGIGIGLAGSTYDNSYPEDQAVKNFVVANITGSDCRQLVHVENGKHFVIRNVKAKNITPDFSKNAGIDNATIAIYGCDNFVIDNIDMTNSAGMLIGYGVVKGKYLSIPQNFKLNAIRLDNRQVAYKLRGIQISSGNIPSFVAITNVRMTRATLELHNQPQHLFLRNINVMQTSAIGPALKMHFDLRKDVRGQFMARQDTLLSLANVHAINENGQSSVDIDRINHQTVNVEAVNFSLPKRGG >NZ_CP011394|1733755:1744262|1742948_1744262_+|WP_000126349.1|DBSCAN-SWA MTANNLREQISQLVAQYANEALSPKPFVAGTSVVPPSGKVIGAKELQLMVEASLDGWLTTGRFNDAFEKKLGEFIGVPHVLTTTSGSSANLLALTALTSPKLGERALKPGDEVITVAAGFPTTVNPAIQNGLIPVFVDVDIPTYNIDASLIEAAVTEKSKAIMIAHTLGNAFNLSEVRRIADKYNLWLIEDCCDALGTTYEGQMVGTFGDIGTVSFYPAHHITMGEGGAVFTKSGELKKIIESFRDWGRDCYCAPGCDNTCGKRFGQQLGSLPQGYDHKYTYSHLGYNLKITDMQAACGLAQLERVEEFVEQRKANFSYLKQGLQSCTEFLELPEATEKSDPSWFGFPITLKETSGVNRVELVKFLDEAKIGTRLLFAGNLIRQPYFANVKYRVVGELTNTDRIMNQTFWIGIYPGLTTEHLDYVVSKFEEFFGLNF |
10 | Enterobacteria_phage(37.5%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
1851257 : 1904681
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP011394|1851257:1904681|DBSCAN-SWA TATGCCCCTTTTTGGATCATATTTTCAGATGGGAGATATAATTATGATAAAGAAAAGAGGATTTACTCTATTAGAGATCACCATCGTTTTAGGTATAGGTTCTCTTATTGGTTTTATGAAGTTTCAGGACATGAGGAAAGAACAGGAAGCAGTAATGGCACAGGCGGTAGGCTTTCAAATGAAACAGGTTGGCGAAGCAGTTAACCGTTATATTAGCATTCGCTATAATAAGCTTTCCACACTGTCATCTTCACGCAGTCAAAGTAGCGATCCGGGACCAAGAACCTGTACGGCTAATGGCTGTGAAATCACTTACCAGACGCTGGTGAATGAAAGTCTCTTGCCCTCTACATATGCGGGAGTTAATGCCCAAAAGTCTTCTTATAAAATCCTGTTAAAACGATCAGGCGTATCACCAAACTATGTTATCAATGGTTTGATTGCTACGACGATACCTTGGGTCGAAGGTGGAAAAACGCGATACGACCTGTTAGGAAAATCGATGCAATCAGCAGGTATTGATAGCGGTATGACGAAATCGCCTGCTCAGGCATCAGGATATAATGGAGTGTGGACAGAAAAGTCTACTGATTACCCGGCAATTAATAAAGCAGGATTACTGGTTTACCGTGTGGGCTATGATAGTTCTATGTATTCTGTTTACTTGCGTCGCGATGGAACATTGCCTATGACTGGAAATCTCAATATGGGCACTAACGATATTAATAATGCCAAGAACATTACCGCATCGGGAACGGGGATCTTTGGTGGGGATGTTTCTTCCGCAGGGAAAATAATCGCAGCACAAGAAATAATAGCGCATAATGGTTATGGTGATGCCATACATTTTGGCGGCGATGCTGTTAATAACGACTATGAAATAACCATGTCTAAGGATAAGACCTTATCTATTCATATGGCAAGCAATCGAACAGATCTCACTACGCTGAAAATTAGTGGAGGGCTGTCTACGATTGGTAATGGAACGATTAGTGGAACATTAGATACAGGTAAAAGTATCACATCCGGTGGACAATTTAATGGTCATAACGGTGGTGGTGATAGTTTCAGTATTGGTGGTGGAGATGCTAACGATTATGAGTTCAGATTAGATACTGTTAAGCCGCTAACCATCTGGCGAAATGGTGGCACCAGTACTGAAACAAGGCTTCAGGTTTTTGGGAAACAAACCAACCAAGGTGATTTTGCTATTACTCCGGGGACAGGAAGCACTGGTAGTATTGCAGCATCTGGAAATATTCAGGGACATACTCTACAACCTACATCGACGAATACAACTGGTGGGACATGTCCAAGTATAGGCCTGATATCTAAAGACAACATGGGAAATATATTATCCTGCGTTAATGGAAAATGGTCTGTTGTCAGTAACCTGCCAGTTGGTTCACCTGTTCCTTGGCCTTCAAATACAGCGCCTGTAGGTTGGTTAATATGCAGGGGACAATCTTTTAATACTGCCCTATATCCTCAGTTAGCAAAAGCCTATCCAAGAGGAAGACTTCCTGATTTAAGAGGAGTATTTATTCGAGGGCTGGATTCGGGAAGAGGACTTGACTCAGGAAGAGTTATTAATTCTTATCAGGATGATCAGATTCAAAATATTACAGGTCATATGGCGGCTGATGTATCACAGAGTGGAAACATCGGTAAATATGTGAGTGGCGCTTTCGCGGATAGTGGTGCGTTAGGTGAAGGTGATGAAGGTCATAAAAGCAATGAAGTAAGGAAATATACGTTTGATGCTTCGAGAGTCGTTCGGGCAGGTAATGAAACAAGACCTAAGAATGTTGCCATGAATTATATCGTTCAGGCACAATGACTATTTTAATGATCAGGCGACTTTAAATGCGAAATGTACGCCTAATGGTCTGGTGGGCCGAGACAGTATGGGTAAGGTTTTATCCTGCGTCAGTGGTAAGTGGCAAACAGTAAGTGGTGATAGATTAAAAGGTATTTATATCACGATCACAGACCAAGTATCGGGCTATAAGTGTGTAATACCAAACAGCGATACAAGGGGATGTTCTTGTCCGGGATCGGCATATAGCTCAAAATACAGCTATTTATCAGGAACGTTAATTGCTCAATATGATGATCAACGTTGTGGTGGTGGAAAGAATGATCACTTTTATAGTAATTACAGAAGATTATATGCATGCCAATAATGTTAACTGTTGAGGAATTAATGTTTGTGTTTAAACATTATAATAAAATTTAACTTTTAATAAATAACATATTATTTAATAAGTGTCTTTTAAAGATATTCACTTTTATAATATATCTGATTGTCTAATATATAGGGAGTAGTTTACAGCTCAATGAGTTGATTCAGTTTTCCCAAGTGATATGTATTCTCTCCAATGGAGAGTTCCCTTTCGCTTAAATATTTATTTCATTTCAATAATGATTCAAGCTGTTCAGCAATTTCATCAATGGTATTTATCGATGTATTTAATGCCATTTTATCTACTAGTGTAGGGCTGAATTTTAAAATATCAGATTTAGATACTTCGTGCCAGATAGGCAGTATTACTTGGTTTTCACCGCTCATCTCTCTAGCAGTCAAACCATTTAATTCATACTCTGTCCAGTCCTTTTTGATAAAGGATTTAGATAAAACAACAACTCCAAACCGAGAATTTGCTAATCCATAATCGATTGTCTTACGTAGACTTTTACCCCAGCCTAAAGAGAATTCGTCATACCATACATTTATCCCTTTTGCTCTTAACAACTCAGCAAGGGGACGGACAAAACTATCTTTATCTTCTGTTGCATGAGAGATAAAAACATCATAACTCTCTGTAATATCTTCTGTCTCATCAGATTTGTTGATGACTGGACGATTGAATTTATGGCTTGGAGATATTGCCCTTTTCTGCGATTTCAATTCTCGTGTGATTTTCTTCTGGTAATCTAATTGTTCTTTTTCAAGTTTCTTCTGTGCAGCAATTCTTTTTTTCTGGTCATTCTCTTGCTCTTTAATGAGTTGTAATTGATAACGGTGTAAGTCTCCAGTTTTTGCTGTTATTTTTTTATTAATATCTGCTTTTTTAGAGTTACATCTTGAAATATCATTTTTATGACGAGAGATCTCTGACATTTTGGAATTCAATGTGCTTAAAGACGTTGACTTAGTGACGCTACGCTTTATTTGATTGATTTTTCCTGAAAGTTGGGCCTCTTTACGCTGCTCATCAGAAAGTTGTTTTTGTAGACTAGCAATGTCTTTTTGTATTCTGTTGATATTTGACATTGTGGTGCTTATAGACATCAGATTAACTCCCTGAGATATTTACATATGGTGTATACTACACTATCTATGTACTTGTAGGCGATTGATTTTACCTAATAATATATCGATAAAAGAGGACTTTCTTATCAGGCCGTTCTAACTCTATACTTCAACACCTGAACATGATCTCCATCATTGAAGTATTTAGCTCCGCTGGTAAAGGTGCTTAGTCTTTTGGCGTGCCCGGAATGACCACTGTTGCTTGGCGCGTGGACAATAAATTTACGCGTTTAGTTGGTCATTTTTGCTCCAATATTCTTAATGGATACATTCCCCTTCATTATGAAATTTGCATTAATATCATTAAAATGAATGTCATTTGATATTTATTATTAGGTATAGAAATGTGTATAATCGATTATTGTTTTATGTATACGATATGGTGGGAAATGAGTAATACTACAGATTGGATTAATACTATATCTAATGTGATGATCGCCGGTTCTGCGATTTATGGCGCTTTTAAGGCTAAGAGCTATTTCAAAAGTAAAAATGATGAGGAAGCATATAATGATGCAAAAAAATTGATATTTGAACTGTATCCTCAATATAGAATTGATTTGCATAAATTGGTTATCGTTTTGTTATCAATGACATCCAATAGAATTCCAAATGATATATTGGAAACTCGGTTGGAGAAAATTACCAATAGACTGTCTAATACACAAATTGAAATTGTAACTATCAGTTCCAATATTATTGAACGCCATAGATGGAAGATCAGAGATGATTTTAAAGAGTCTTTTGAAATAATTAAAAATATCTATAGGAATAAATTAAGTTTTCCATTGATTAATGTAGTGGATGAAGTAAGTCAAAGCACAGAAACTGAAAGGCAAAATATACTTAGAGAATTAATAGGGAAATGTCATGAAGGTATTGATGCCATTCATACATTCACAGATAATAAATTTAGAGTTGACGAGTATTTCGTTCGTCCCTTTTTAAACTAATTTTTCTATAAGAAGATTGATATGAGCAATGAGAAAACATCAGAGGAATCATTTTTCAATAACATAAAAAGTTCCTTGATCGAGCGTTTCACGACACCATTATATGTATATGTTATTTCAGCCTTCTGTATTGATAATTGGGATAAGATATTATTTATTATGTTTGGAAAAGGGAACATTGAGTATAGGACGTCGATAGTTCAAATGCAGGGCATTAACTTTTGGCAACCTATTGTTTACGGTATAATCATAACCATAATAATGCCTTTTTTATCAAGAGCGATAGAATTTTTCCATCTTAAGTCGGACAGATATTATCTGTACTCTTTCCTCCAAAAGGGACTCTCGTAGGGTAAAAAGTATGAACTCTATTAGAGTGATTGAACGTCAACATGAAATAGATATGATTGAAGGTGAAGCAAAACATAAAGCAGAACTTGAAAGAATAAAATCTGAATCTCAATACAATACGAGCGAGTTAAAAGAAAAACTTGAACGGCTTGATATAATTGTCAAAAACCAAGAAGCGCGAAATAATGAACTATTAGAAGAGCGAAAAAAATTAGATGATGATCTCAATAATATATTCATTTTTGTAGATGATCTTAGGGAAACTGTATTGAGAGTAAATAATGCTTATAATCGGTCTATGAAAGAAGATGATGTTGTAAGGAAAAACATGATATTATCCCCTATATATGAGATAATAAAGGAATCAAAATTTAAAGACATAGAGAAATATCTAGAAAAAGTCGGTGTTCTTGAAAATAGTTTAAAACGTCTCCGAGACTCCGAAAGAAAAACATTTTCAATGGATGAACTGGAAAATAACGACTATAATGCAATAAATGCCGATTAAGACACGTTATTTAAAATAATTACCTAAGCATGATAATCATGTTGTAAATATATAATACTGTATTGATAAACAGATTGGATACCACTTGTGCGATACCCCGTCCAATCATCCATCAATGCCACATGCTTAGCCCATAGCGTCCCATGTTGATACTCTGCTTTGGCCTTATCTTCTAACTGGTGAGCTAACGCATGTTCAATGATATTCCTCGCGTATTATGAGCACGGCCTCCCCACTGATTTTCAGAAGGCAACAATTTCAGGATGACAATGAGCTTTTAGCGTATCTTGTTCCTGTTAGACAAAATATTTTTATAAATAAAATCAATTAACTACACGTATACTGTGATTTTATACAGCAATGATGTTGTCTTTAGTGGAATTAAAGCTTATAAAGAATATGGTTATAACTGCATATAGTGATTTTTTTGATGAAGACACTACATAAGTGATGTTCATGACTTTTGGGGATGCGTTTAATGAGTAAAAATGTAAAACAGACTTCTGAAAATGTGGCTTCTACTGCTGCAAAAACGCTGACTGATCCTAATGCCTCAGCCATCCAGAAAAGCCTTGCGGGTTCCGCGCTTTCTCAACGTGGGACGTCAAACCAGACCAGTGGCAAAATGGAACATAAGGCATCATCGGCCCTTGATAACCCACGCTCTAGCGAATTAACGAAGCAGCTTGCTGCTTCGGTATTGGCACAATCAAACAAAGGGCGTAAGTGATAAGTAACAGGGAGAGTAAAACCTCCCTGTATTTTTTGATTTAATGCGATTGTTTGCTTGTAGGAAGGTGATTATTGTTTCTGAGCAAACTCAAACGGCCTGACCATCTCATACCGATTCTCATCAAGAAAATCAGCCCACCACTGGAGCATCAGGCGGCGTTGGTCAAGATGCTTGGCCTTATGGGTATAGGCTGCGCGAACGCTGTTACTTTCCTTATGGCTCATTTGAAGCTCTACAGCATCTTCTGACCATAGACCAGACTCAATTAAGGCACTACACGCCAGTGTGCGGAAACCATGACCGCAGATGTCCTGTGTGGTGTCATAGCCCATCTTACGTAGCGCCTTGTTAATAGTGTTTTCACTCATGGGTTTGAATGAATCATAACAGCCAGTGAAAATTAATTCTGCTTCGTTACCTTCTTCATAGGTAAGCTGGCGGATCTCTTTCAGTATCTTGAGAGCCTGCCTGCAAAGGGGAACGAAGTGCTGACGTTTCATTTTAGCCCCACGAGTCGAGTGCTTGACGTTTTCAATCGCTTCCCGCTGTTCGGGGATCACCCATAACTTACTTTTGAAGTCGATTTCCGACCACCGGGCGAAACGAAGTTCGCTGGAACGAATGAAGATCAGCAGATTGAGTTTAATCGCTAGCGTAGTCAGTCCACGACCTTTGTAGGCATCAATACGTTCAAGCAGTAGCGGGATCTCTTCCAGCTCCAGTGCAGGGCGGTGTTCCGTCTCTGGTTTCTGGACAGCGCCTTCCATATCATAGGCCGGATTATGACGCATAAGCTTTTGCTGGACGGCATGACGTAGGATCGCGGTGATGTATTGCTTAATCCGCATGGCAATTTCAAGGTAGCCGAGTGTTTCCGCTTTTTTGACCGGGACAAGCAGATCACCCGTATCCAGTTCTGAAACGTTTCTGTCGCCTATATCCGGGAAGACATAGGTTTCAAGGCGCTTCCAGACAGTATCGGCGTAATCTTCTGACCATTTTGTTTTGGTGGCAAACCAGCTTTTGGCGACGACACGGAACGAGCGGGTTTTATCCCGCTTCTCCTGAAGGACTTTTTCATCAGCCTGTTTTTTAGCGTTCGGGTCAATCCCCTGAGTCAGCAGCCTTTTGGCCTCGTCCCGGCGTTGTCTGGCATCGGCAAGTGAAACCGCAGGGTAAACCCCAATGGAAAACACCTTCTGTTTGCCATCAAAGCGATAGCCTAACTGCCAGTATTTTGAACCGTTAGGATGCACCAGCAGATAGAGGCCAAACCCGTCAGTGAGCTTGACGGCCTTTTCCGATGGTCTGGTATTTTTTACTTTGGTATCAGTAAGTGACATGACGGTTCCCTCCGCGTGCTGGTAAAACACAAATCGAACCAGCTTTACCAGCATTTTTACCAGCAAAAGGGTATGGCTTCGAGTGGTTTTTAGTGAACGAGAGTGAACCTGAAGAAGGGAATAACCAGTTGATATAAATGCAGAAAGCAGACGTCAGTGAACGTCTGCTTTCCTTAATTTGGCTCCTCTGACTGGACTCGAACCAGTGACATACGGATTAACAGTCCGCCGTTCTACCGACTGAACTACAGAGGAATCGTGTGAACGGGGCGCATAGTAACGATGTGCGATCGGCTTGTCAAAGGGGGAAATAAGGTTGCGCGTTTGTTTGCTGACAAAAACAACAAAGCGTTGAAGTTTTGATCTAATTTCTACTTTGCCCCGGCATGGCGCAACTTTGTCTGTAATTGCACAAGTCAAATGCTGTGACCTTACCGCAATGGCTATGTGCCGGCGTCTGATGAAACGTGAAAAACTGGCAGGCACTTGGCAAATAATTCTGAGACATAACGCCGTAGAGATTAAGGGCAGGGGGCAGAATGAACTTTAGACGTGAAATATTTTGTAAAAATGGTTGATACAGGCAGTCTGACGCCGGTAGCGGAAATGGCAGATAAATTTCTGGTGCAGGCGAAAAGATTTCCGTCAATATCATAGGCAGAATTATGGCGCATCAGCTTTTGGCGACGACACGGGACGAGCGGGTTTTATCGCGCTTTTCCTGAAGGATTTTTTCATCAGCCTGTTTTTTGCGTTCGGGTCAATCCCCTGAGCCAGCAGCCTTTTGACCTCGTCACGGCGTTGTCTGGCATCAGTAAGAGAAACCGCAAGGTAAACCCCAATAGAAAACACCTTCTGTTTGCCATTGAAGCGATAGCCTGTCTGCCAGTATTTTGAATCGATAGAGGCCAAACCCGTCAGTGAGCTTGACGGCCTTTTCCGCTGGTCTGGCATTTTTTACTTTAGTATCAGTAAGTGACATGACGGTTCCCCCCGCGTGCTGGTAAAACGCAAATCGAACCAGCTTTACCAGCAAAAGGGTATGGCTTCGAGTGGTTTTTAGTGAACGAGGGTGAACCTGAAGAAGGGCATAACCAGTTGATATAAATGCAGAAAGCAGACGTCACTGAACGTCTGCTTCCCTAAATTTGGCTCCTCTGACTGGACTCGAACCAGTGACATACGGATTAACAGTCCGCCGTTCTACCGACTGAACTACAGAGGAATTGTTTCAACGAGGCGCATAATACTGGGCCGGCTATAACGTGTCAACAGTAAAATTAACGCGCCAATTCAATTGGTTAATTAACCCACAAAGTGAGGAATTAATGTTCGGATTCCTCGCCGTAGTCGCCTTCGGCAGCGCTTACCCGTAAGCGCTGAGAAGGATCCTGGCGATAGAACTGGCAAAAACGCTGCCATAGTGCCGGGAAACGTGGAGCAAACAGTTCTGGCGCGCTGAAAAAATACTCTGACAACACGGCAAAACATTCTGCAGGGTCGGTGGCGGCATAGGCATCTATACTGGCAGCGCTTTCGCCAACAAGATCGATTTCATCCTGAATATTATTCATTGCCGCGTGGAGATCGTGTTCCCAGCCAGCCACATCGCGTAACGGGATGAAAGGGATGCCGCTGGCGCGATCGCCATTACGCATATCCAGTTTGTGCGCGACTTCATGAATAATGAGGTTGAAACCCGAAGCATCGAACGAGTCCTGGATATCCAGCCAGTTCAGAATGATGGGCCCTTGTTGCCAGCTTTGCCCCGACTGTACGACACGCTGGCTATGCACCAGACCTATGTCATCTTCCCATTCATCATCTACCACAAAGGGCGCGGGATAAATGAGCACTTCATGAAAACCATCAAGCCACTCAATACCGAGCTCCAGGATCGGTAAGCAAAAAATTAACGCAATACGTGCACTTTTTAACGAGTCGAGCTCAAATCCCTGTAGCGCTACCAGTCTTTTCTGCTGCAAAAAACGTTCGGCTAGCGCAATAAGCCGAGCCTGTTCTTGCGCGGTGAGGTTTACCAGAAGAGGTATAGCCAGCGCATCATCCCACGGCCAGTCTTCGTTCTGGGTTATTTCTTGTGCTTTCCAGGGCCACTTAATCATCGTTTTGCTCGTAAACTCGTCACTTGAACAAAATTACCCGAATAGGGTCTGTTAAAATGCCAAATTACCTGGCATCATTGCAATATACGGAGAGATGCCGGAGCGGCTGAACGGACCGGTCTCGAAAACCGTTGCGGGGGTAACTCCGCCGAGGGTTCGAATCCCTCTCTCTCCGCCACTATTCAAGCACTTACGTGATTTTCTTATAGTGATGAAAATCACGTTGAGAAAAAATGAGAAAATTCGGTGAGAAAAAAACGCCAGAATTTTAACTGGCGCACATCGAAAAGCTCAACGCTTCCTGTCCAGGGTTGGGCTCATTTTCACCTTGCGATCGTAAACAAGAACCTGGGATTCGGTTTTGTGGCCACTGTACTTCTGCTTGTCTTTTGCCGTTCCCTCATAGTCTGAAATCCCCTTTGCCTTTAGATCGTGGAAAGTGCAGTCAAGAGGACGTCCCAGATCATCCCCCGCAGCCTTTCGCGCCTTTCTCCACGCCTCGTTAAATCCTTTATAAGAATAACGCTCGCCATACATAGTCCTGATAACAGGGCCTTCCTCTCCCCATTCACGACATATTTCAACGGCATCACGTAAGCGATCTGTCCAGGATTTAATTTGTTTAACTCCGGTTTTTCCTTGCTGAATAAAAATTCCTTTCTCCAGTATTTGATTCCAGTTCATTTTCAATACATCAGAAACTCTGGCAGCACATAAATAAGCTATTTCCATTGCAGCCCTGACGGCTGGCGTTGCGTTATTATATATCGCTCTGTACTCTTCATCGGTAATATATCGATCGCGTTGAGGCTTAGGAAACTTATCCACACCAACGCAAGGATTACCAGGAACATAACCACGTTGATAACTCCAACGAAATACGCGCGACATAGAGCTGTGTTCATGATTCGCCTGGACACGGCTTTTTTGCCCACGGGCATCCATATAACGCCGGATATGTTCTGGCTTTATTGCTTTAGCTTCGGCATCACCAAATACGGCAAGTATATATTTCTCATGTGCCAGATAATCTTTCTGCGTTCTTGGGGCCAGATCAGCATAATCGGCACTGGCAAGAAATTTTCGCCATAATTGTGTGAATGTAATTCTGTTTTTTCTACCCTCAACTTTTTTTTCGTAAGCCACCCAGACCTCAGCTTTAGTTGCATCAGCTGGAGCTATATTTTCTGTTGATCCTCCCGGTTTCCAGTAGTAACCAGAAGGGCGAAAGAATACACCCTTTGGCATCCACTCATTACCGGGCGCACGTTTGCGACCCATGTTAACTCTCTATAGCGTCAAAATTCATGCCTGGTACAGGCTGATACCCTGCTGGTGGAAGAAGGCGCGAAACTGGGTGATTTATATGATACCAGGTTGTTCTGATTGAACCGTCCCGGCGTTCTATAAAATAAATACCGTTCAGCGTTAATACTTCTTTCTGGAGTGACTTCTGGCTTGCTCCTGTAGCATCTTCCAGTTCCTCCTCAGTCAGGAAGCGATCGCTCATGAGTTGCTTCTCCATCTAACCGGCTGCACCCGGTTTAAATCAGATATTGTTGCTGGTGGGCGGGATCAGTTTCTGCCAGATAGCTGAAACGTATTTCGCCTGATGCCTGGCATCAGCCAGCGCGTTGTGCATATCGCCTTCGAACGGCATGTCACGTTTGGGATCGAAGCCGATTGCACGACCCAAAGTAACCATCGTGCGGACGTCATGATCGTTCCGGTAATTCCACAGGCAGGGGAGGCTGGCACGTTCGAAAGCGCCACGCAGGATAACGTTATCGAAATTGGCACCGTTACCCCATACCTTCAGGTATTTCAAATCGTCGGCGTGACAGGTGATAAAGTCATTGAACTCAATAAGCGCGGTCGTAACAGATACTGCATCAGCGCAAATAGCCGCTCGCGCTTCCGGGCTTTGTCTTAACCACCATAGAATGGTATCGCCATCAGGAACGGCACCTTGTTCCATTGCGCTTTCAAGGCTAACGGCGGTATAGAACTCAGGTCCAATTTCACCACTTTGCGGATCGAAGAACACAGCACCGATGGAGACAACAGGCGCGTTAGGTTTTTTACCCATCGTTTCAAGGTCGACCATTAAGTTGTTCATTCGTTATTTATCTCCGGTTTTGGTGCTGCTGGCACTGGCATCCAGTGTGTAACAAAGATGTGCTCAATACAGTTCATCTGATTGCCGCCTCGCATGTCAAAAAATAGTCCAGAATGCTCATCAAAATATGAAACATAACGGTATCCCAACTTGTTATGAACAATTACTTCTTGCTCGTCTTCTGGCATCCGCTCACTACAGCTTATCCAACCATCCGGAATTACCGGAGAGTTGCCAGCATTTGGATGCAGAGCGCGAATGCCTTCGGCCAATTCTTCCAGGGTGGATGAATATCCGCGTTGGGCATCATTGCCGAACTCGAACGCTCCGGTATCAGGATCAGACCATCCATGCTCGCTGTCGTATGCCTCACGCTGCTGATCTATCCATTTGGCGGCGGCTTCAACGCCATCGCGATAGAAAGTGACTACCGGCGCTGACTGTTCTTTGCCAACCAGACGCGTTATTTCTGATTCCAGAAGTGAACCTAAAACTGTACTCGTGCAGTGCTCAGCCCATTCGTTGTTTTCCAACAGGCTGATGATATTTAGCACATCATTGTAAACGCCAGTTTCCTGTACTGGCTGGGCGGTGTAAAAATACGGTCTGATAGTCCACTTTTTATTCCAGAAATCTCGCGTTTTCTCGGCTTCCTCAAGCGTTGCAACACTACAGCCAACCTTTCCGCACTCTTTGATTACGTGATATCCGGCTGGCTCAGCTTCCAGCGATGCCAGCGCACGCTTCAGCACAATAAGAACTTTGGCGTCGTCATCGCTCAGGCCAAACGGAATATCGTCGCGAGTGTTTTCAAATTCAGCGATAGTTTGCTGTAGCCATTCTTTGGTAATAGTGGTCATGGTGTACTCCAGTTATCTTCGATAGCCACACTAAGTCGGTGCAGCCAGTCAGCTAATTTCAGCATTGCTTCGCGTTCGCTAAGCCCTTCCGGAAAATCTTCAAGTTCAACGAAAGGCTTAAACCGTCCGAAGGCGTCGTTCTTAACTGTCAGCTTCTGTTCAAGCGTGGTCTTCTTAACCTTGCTGTGATGCCGTAGAAGGTAAACAGACTGAGATTTTTCGGTTTCCGGATCGTATTCGTAGGCAGTTAGAATCATCTGGCTGCCGCCGCGATTTAATCCTCTCCACATAGTCACTTCCCCTTGCCGATGCCAGCGGCGTTTAATTTCTGCCGTAATTCTACTAATTTGGTTTGTACCGCTCTTTGCCTGTGCCATTGCAGGACGAGCATTTCGGACTACCGTTGTGGTCGTAATAACCACTGCCGTTACACGCCGTGCAGGGACGCAGCTTCCAGCCAAAAACGAAACGTTGGTAATATTCAGTTCGACGAACTTTGCGTTCGTGGAAGTTACACATCTCACCCCCTCTCAATACGCAGCGCCTGTTTATCGATGTTGCTCATTGGGCTGTCTCCTGTGGATAACAAATATCGTCGAAATATTTTTCTGCTACGCGCATGTTGAAGTGATCGAGATTCATCTCCTTCACCTGGAGTTTTGCCCCAACAATGCCTGTGCATTGATTGACGTAATCCCGGTTTTCTGGGGATTCCGTTACCCACTCCATAAGGTTTTCGGTGACACTTTTTAAGCAACGTAAAGCGCAGTCCAAATCAGTAAAATGCTGAGCATCAGTTATGCAGGAGACGACATAATACGTGGTGACTTTTGGCCCATCAGCGCGTCGTTTAAGCTCTCTTTCGATAGCGTTTTTCAGATCAACCAGTTCATGGTCATTGAGTTTGTCGATGTTGCTCATTGGGCAGCCTCCATTAGTTGCATTACGTGTCGCTTGTGCTCTTCGCTTTGCGGAACACCTGTAAAGTTCACCGCCATGAAATAAGCCAGGCGATCCTTGTACGTCACCTCATCCAGCACAACTGCTGGGAGTGGACGACGCCCAAATGCCAATTGCTCCGCACGAGTCATATCTCGCCAGTAAAGCGGACCATCAGCCAAGATGATTGGAATCTCATTGGTGATGAGTTTTTTCAGGGTTGTGAGACGCTGCTTGCCGTCAACAACTTCTATGTAAGGAAGTTCACGCGAACACCAGTCAGGTGCCTTTGCCAGCGCCACTGAGCCGATAGGAAAACCAGAAATAACTGCGTTTAAGAATGCCTGCTGCTCTTCATGCCCCCAGACATACCCGCGCTGATAATTGGCATCAAAATCAAGTTCACCACCAATGATCCAGCGAATGTACATATCAACCGGGTACTCACCGGTGCGCGCGTCGAATACCTGAGCATTGCGAATTCGGTTGCTCATTGAGCTGCTCCTTCAGCTTTCTTTTCATCAACGCTCCACGCCGTAGCCAGCGCACTTGTTACCTGAAGAAACGAGTGTTTAACCCTCACGGTGAAAGTTTCTCCGGTGGCTGAAATAGTCTCGATGACTGTTAACTCGCCGCCGCTTTTGTAGTCTGGATAGAACTGTGTTACCAGATTGCTATCAACAATCACCGATCCGTCCAGGGTGTACATTTCCAGTTTCATGCTGCACCGCCTTCAACGCGCTCCCACAAAAGTCTTGATCTGATTGCCTTCACTACAGACTCTTTATCTTTTATGCAGCACATTGGCGTAGCTCCATCAGTTCATTAAAGCGGGCCATAAACAGGCCGAAAGCCTGACCGGGGCGAAGTGGGTAGATTTCGAATAAATCTGTCGGGGGGATACCTTCCAGTATTACCCAGGGAATACTGTCATCAATATCCAGATCACGGCGTTCAGTTGCCAGCATGGTCAGATCTGCATACTTCACTACGCTGGCTTCTTCCAGTGGCAAGCCAAACTTAAAGCGGATCAGTTGATCGGTACGTTTCTCAATCTCGCGATAATCAGGCAGTAACGCTTTTAATGGGGCAGGGATATCCTGGCAATACGCTTCGGCTGCGTCGTGCATCAGGGCTTCAAAGGCAAACTCCGGTGATACAAGCTGGCTGCACAGTACGGAATGCTGCGCCACGCTATAAAATTCAGGAAGATGTCCGGAGAAGCGGCAAATATTGGAAAGCGCCACGGCGATATCTTCAATATCAATGTCGTCAATAGTTGCGCTGAGATAATCAAATTGTTTACCTGAAAGTGTTTGAATAAAACTCATCGTTGGTTCTCCTTATAATTTATTTCGCGCTGCACCGCGTGAATTTTGAGTACAGCAACCCAACCCACGATGTGGGGTTAATTGCCGCTATGAGTTATCGCTTGGCTTCGCCGCCGAGGGCAGCCGTTAAATCAGAAATAAGATTACTGAGTTCGCCTGTCATCAGCGTAATGTCAGCATCAAATCGCTGCACGACATCTTCGCTGTCGATATCGTCGTTCTGGCTGATTAACTGATCTGCAAATTTTATGCGTTTCAATATACCGTCACAAGAAAGGGTGAAACTGATACGCTGCTGCCATTCCATTGAAATCTGGGTAACTACCTTCCCAGCTTCGATATGGGTAAGAATTTCATCGCAGGCAAGATCCTGCTTTTTAAACCGGCCTGTGCCACCATCTTCGAGAATAGCTTTAAGGACCGCTTCATCGCCGATGGAGAACCCGGAAGGAGCCGCTTCGCTACGAACCCACTCAGTTAGCGTGAGCTCGATAGGGTTTTCCATCGTCAGCGGCACGACTGGCAAGGAACCTAGGGTTTTACGAAGCAGGGCGAGAGAATCTTCTGCGCGCTTGATGCTGGATGTATCAACAACGATAAACCCGGCTGCAGTGTTTATCCAAATACGAACCAGACTGTTTTTAGTAAACGCCCTGGGTAACAGGGAATGAAGAACCTCATCACGAATAGAGTCTTTCTCAGTTTTCTTAAGACGGCGGCCTTGCTCTCGCTCAAGCGTGGAAACCTTCTTATTAATCTCATCGGCGATCGTTTGTTTAGGTATGATTTTTTCTTCACGACGAATAACCAAAAGTAACTGGTTATTGACTGCATGATATAGCACATCTGAATACTGGACTAATGGTGAAAACCATCCGCTTTTTGCCATATCCTGGCTTCCGCATGGTGAGAAGCGAAACAGCTCAAGTTTCTTATCAAGAGAGTCTATGTCGATGTTAAAGTCGCGGCTGAAGCGATATATCAGCATATTTTTAAAAAATGGGTTATGCATTTTGTTTCCTTAACGCCTCTGCACTGGCGTTTTACGTTGGTTTCTCCACAAAACAGAAAAGAGCACCTGCTGTAACAGCTTTCCGGGTGGATTGGGTAATGAGCCCGTCGCGCGGAGATGCTCTTTTCTGTTGTGTAAAAAGGTCGGCGTCACGGCAGAACACTGTCGCCTTCCTCCTGTTGTTGGAAGAGCCGGACGCCGACAAGACTTCACACAGCAATAACGTTGTGGTGGGGCTGTCACTCAGGCGCATGGTCAACCTGACAACCCGGTGTCCTACTGGGTACAAATGGAGAAAAACCCGCCATACTTACCGCCGCGCCATTTCGCGGATTACCACAACGAAGAGAGCACTGCCGGTGTCCGAATTGAACGGACCTTTTCTCTGCCCAACCCTCCTGACTAAACAGGACTGTCTGGAATCGAACCAGCACTTATGCCTTGCTCGTCAATGCTCTCATCGTTGTGTGCCTGTCTTTTCACCACATCAGGCTCGGTGGACCTTGCTATTCCCCAACAGTAAGGATTCGGGTAATCTTTTTAATTCCCCAACAACATAAGGGCTTAACATGTCTCAGAAGGATGATATTCCTGTCTTTCCCGTAACCGGCTGGCAGGCTGGACCGCTTCCTGGTTACGACGCTCTGGTAGTGAAATTCCAGTTTCTCTCATCACCGATGCAACCAATTGAGTCTGCTCAGGAAACGCAATTTTTAGTACTTACTCCTGAGATGGCTGAGAGCCTGGCTTCAGACTTGCAAAGGCATATTCAGGATTTGCGAAATTCCGACGTTCACAGCCCACAAGAAGGCAAGCACTAATAAGGAACACCTGAACTACTTCATTTCCCTTAAAGCGCCGTTGGGTGATGGCGCTTTTCTTTGCATTAACCAGCATCATTCCCCCTTCGTGACGTTCAGTTTTACTGGCTTTATCACGGCTGCGTAGTTGATAAGAATGTTTACGCATGAAACAACGCACTCGGAACAGATAGCTGGCTCGTCCTTACTTCCTTTAGCGATGAGCTTTTTTGCATCCAGCTCACTGACCCCGCAGAAGGAGCATGTGTAGTAGTTATTCATCTGAACTCCTGTGTAATGCATCATTGCGAATCATCCGGTCATTCGTATGCCACCGGCGGCTACTTCGTGGGCGTCCTGCCTGTTCGCTGCTCTATGAGTGCAAATTACATTTAAATTGCACATTGCGCAAGTATAAAATTGCGATATATGCAATTTTGAGTCAAAAAAAAAGCCACTATAATGGTGGCCTTGTCGACGCTTTCTATTAATTGTGTCGTTTGAGTGACTGCGTCTGGCTTATCAGAACCTTGCCAAAAACACCGAACCTGCACTCGTTGTCTTTGGTAATACTCCATTCCCTGTAGTTAGTGTTATCAGATATCACCAGTAATTTATCGGGGATCATCTGTAGCCTTTTTACGTATATTTTATCATCAAAGCCAAAGACATAGATGCCATCACCATCGAACTGGTTGATGCTTATATCGACAAAAATAAGATCTCCGGGTTCAATTGTTGGTGCCATGCTGTCACCACGCACGTTAATCACTTTAAGCTCAGCGGCAGGACGCCCGCCGAACATAGCTAGTGCTTTGTCCTTGTTATATTCGATAGCATGGATTACATCGATAACATCACCGCCCTGAATGAGTCCATTACCGGCGCTTGCACTGACATCCAGTATCTCGATACGGAACAAATCCTTCACGTTAGCTGAATCCTTCCTCATATCACTGTGTTTACATACAGTATTACCTTTTGGGTCTGAGGTAAAGAGTTCTGCTATATCAACACTTAAGCAGTCAGCCAGCCTAGAAAGTGTTTGTTCGGTAAATTGCTTTTGCTTGCCAGTCTCCAGACGAGAGATGTTTGCGGCATCCACGCCGATCGCTTCTGCTAGCTCAGCAATTTTCATGTTCTTCGCGCGGCGAAGTTGTCTGACACGGTTTCCTATATTCATGCGTTCATTACATTAATTTTTTGCGCATTGTGCAAATCAACTTGCGCAAGTTTGCTGTATGAAATAACATGCGACATACGCAAAAGAAGGAGGTTTTATGCAATCACCATTGAGAAAATTGCGGAAATCGCATGGTTATACGTTACAGCACGTCGCTAAAGGGGTTCAGGTTGATCCTGCAACATTAAGCCGGGTTGAAAGATGCGAGCAGGCTCCTTCAACAGAGCTTGCTGAGCGCCTGGCTCAATTTTACGCCGGAGAAATTAGCGAGATGCAAATTTTGTATCCAAACAGATATCAGCTTAGTGATTCGGCGATTTGACCGCCACCACAGCAGAAGGAGTAGATCCGTGGGACATGAACCTGAATGGAAAGTTGAAAAGCAGCCCCGCTGGCTGGTGGCTGCGATTAAAAAGACGATTTCCAGTCTGCATGGCGGTTATGAAGAAGCTGCGGAATGGCTGGATGTCACCAAAGATGCTCTGTTTAACCGCCTGCGTACTGGTGGTGATCAGATCTTCCCGATTGGGTGGGCGCTGGTACTGCAACGTGCCGGAGGAACCTATCACCTGGCACATTCAGTAGCCAGGGCATCAGGTGGCGTTTTTGTTCCGCTGGCAGATATGGAAGAAGTGGATAACGCAGATATTAATCATCGCCTGCTGGAAGCGATTGAGCAGATCACCAGTTATTCCCAGCAAATCAGGGTGGCTATCGAAGATGGCGTTATTGAGCCACATGAAAAAGCCGTGATTGATGAGGAGTTGTATCAGGCGATCGCAAAGCTGCAACAGCATTCGACACTGGTATACAGAGTTTTTTGCGTGCCAGAAAAGGGTGACGCCCGCGAGTGTGCAGCTCCGGGCGCCGTGGCGTCAAATTTTATGGAGAAAACCAACGCATGAACAGTTTAACGGTAAATAACCGTTTGTCGCAACAACCGGGGATGTATGAGTACCGGCCGTTGCGTCATGAATGCAGATTATCAAATAGCCTGGTCGTGCGTAACCACAGGGAACACAGCCTGACCGTGGGGGATGAATCGTGCAGGAACTTAACCGCTGGTTTCGGGATGGAAGGGGACTTTATGTCCATGTCATTCGCTGGGAACCAGAAACTGAGCGCGTTATCTATCTGCGCAAGGGCTATCCGCATGAGTGTTTTAGCCCTTTGTGGAAATTCAGGCGTGATTTTGTTGAGTGTGAAGCGCCAGGAACACATTGATTCTGCAATTCCGGGACGTTACACTGTTCAGGCACCTCATAAAGCGGGTGCCGGGCGTGGAAACCCGGAATTCAATATAGAGCACAACCGCGCTCATGCGGTTTTTTCTTGTCATGAGCATTGTTACGCCCAAATTATGGTGGGGCGTGCAGGGCCAGTTTCGGCTGGGCCGGGTTCTATGTTGACCGGTATTTCCACCCCTGTACGTCTCACCACCTATAAGGTCGTGGAAAGCCTTGGTGGTGAGTTCATTGAATTCAACATAGAGGCTGCCACTATGGCTACTGTCCCAACCCTCGCTCAACCTGAAATTAGAATTATTAACGGCCAAGCCGTTACTTCCTCCCTGGCTGTTGCCGACTACTTCATCAAGCGTCACGCTGATGTTATCCGTAAAATAGAATCTCTCGAATGTTCCACTCTATTTCGTAAACGCAATTTTGCGTTTACATCGATTTCAATAAATCAGCCCAACGGCGGTACTCGCAAACTCCCATGCTATCAAATCACACGCGATGGTTTTGCGTTTTTGGCAATGGGTTTCACTGGTAAACGTGCTGCTCAGTTTAAAGAGGCATACATCGATGCCTTTAACCAGATGGAGAAACAGCTTTCAACTCCATCGGTGCTGAGCGATGCAGCACATAATGCCAGCGTTCTTTATTCCTACATTTCATCCATTCATCAGGTTTGGTTACAGCAGCTTTATCCCATGCTGGAAAAAGCGGAATCTCCGCTGGCTGTAAGCCTGCACGATCGCATCAATGACGCTGCGGCGCTTGCGAGCCTTATCAATATGACACTGAACCGTTCAGAGGTAAGGGGGCGCAAATGATCCGGAATATTTTTAAACGGTTCACCAGCCAACGTTTTCATTGCCCTCGTCCAGGACAGTGGTACAGCACACCAGAAGGGTACGTTCTGCGTATTAGCCTGGTCGATCGCGAATGTCAGAAGGTTGTCTGTGAGCCTCTTGGGCGTAATTACCGCGTCAACATGCCGCTTATTGCCTTTCGTTCCGGCAAAAACATGAAGCATCTCGGAGGTGCTGCATGAGTTCCCTTATTCAATTACTCGATCGCCCCATCGCCTACAACCCTGCTTTTGCAAAACTGAAAGCCGGGAAGGTAAAAGCTGGCCCGGTTGCGGCAGTATTCCTGTCCCAGCTTGTTTACTGGCATAACCGGATGGATGGCGGCTGGATGTACAAAACACAGGCTGATATTGCCAGTGAAACGGCGCTAACCCGCGACGAACAGGAAACAGCACGTAAACGTCTGGTAGCACTTGGTGTACTGGAAGAAGCCCGTCGCGGTGTACCTGCCACCATGCACTACCGCATCAACACCGCACGGCTTGAAGCGCTGTTGCTGGAAACGGCGAAGCCAGTGAAAAAGGGCGCTCAGGAGAAAACCAGATTGCGGGACTTCCAGAATGTGGAAACCCCGCAATCTGGATTGGTGCAACCCCGCAAACCAGATTGCGGTGATGCCGCAAACAAGAATGTGGAAACCCCGCAAACAAGTACGGGGCAACCCAACGAACAAGCATGTGGCGATCCCACAATCTTTCCTACAGGAGATTACACAGAGACTACTCAGGAGATTACACAGGAGAGTAAAACCCCTTTTTGTCCGGTTGCTGAGCAACCCGACCCCGAAGTGACGCTCACCGATCAGGCGATTGAGGTTTTAACCCACCTGAACCAGGTAAGTGGCTCCCGGTATCAGAAGTCAAAAACCTCCCTGGAAAACATCCGTGCCCGACTGCGTGAGGGGTACAGCGTTGCTGATCTGCAACTGGTTATCGACCTGAAGCATGAGCACTGGCACGAGAACGACGAGCAGTACCAGTACATGCGCCCGGAAACGCTGTTCGGTCCGAAGAAATTCGAGAGCTATCTGCAAAGCGCTACCCGCTGGGATCAGAAGGGGCGGCCTAAACGCGCTGACTGGGGTGCGAAGAAGCGCGATGTGATGGCTTTTGGTCCGGTTGATACAACGATTCCGGAGGGATTCAGAGGATGAGTCTGTTAGCAAAAGTGCAGGCGTTTATCGAGCTTAATCCGGGGCTGACATCAAATGAGATTGCCGATGCTTTTCCTGAATACGCACGCTTTGATGTGCAGCGTTCAGCGAGCAAGTTGTATCGGTGTAAGCGTGTTAACCGCCGCCTGGATGGAGATGTATTTCGCTATTACGCGGGTAAAGACGAGGCAGTGATTTTGACGTTACGACAGAAAAGGTCAGGTCATACAGGTTCGGGTGATCCGATGGTGATTGCAAAGCTGGTAAGCCGCGCTGAAGAACTGGAATCCAGAGGGTTATTTAATCGTGCATCGATAGTGTGGCTGGAGGCATTTAGCGAAAGCCAGTTTATCTACGAACGCGAGGAATTTTTACGCCGCCGTCAGAAGTGTCTGAACCGCATCAAAAAGAGAATCAGACCCGTAGAGCAGGTTTATCTGGCAGGGCGATTTGTGGGGAACGTGGAATGACCAGTGAATCCGTTTGTATTGAAAGCAGTGATGTAACGATATCTGTTGATGAATCCGCTTCGCGCACCTGGCGTCGCCCGTTCCTGAAATGGGCAGGCGGTAAATATTCCATGTTACCCGATCTTTACCAGGTCATTCCGGCAGGTATGCGCCTGATTGAACCGTTTGTCGGCGGTGGTTCGGTGTTTCTCAACTCAGACAAACACGCCTGCTTCCTGCTGGCCGATGTGAATACCGACCTTATCAATCTGTATCAGATGCTGGCTGTTGTACCTGGTGCGGTGATAAGACATGCTAGGGTAATGTTTGACCGTCTCAATGACGCTGAAAGCTATATGGCGCTACGGGAAGAGTTCAATGCTCAGGTGATGGACGCTCCGGAACGCGCCGCCGCTTTCCTTTTCCTTAATCGTCACTGCTTCAATGGCCTGATCCGGTACAACCGCAACAACCAGTTTAACGTTGGCTGGGGCAAATACCCGTCGCCTTATTTCCCGGAAGAAGAAATCAGGGCATTTACCGAAATGGCGCACAACTGCGTATTCATGGCGGCAGGATTTCGCCGGACGCTGGCACTTGCGGGAGAGGGTGACGTTGTGTACTGCGATCCACCCTACGAACCGATGCCCGGCAAGGATGGTTTTACTCACTACGCCGCTGGTGGCTTTACCTGGGATGATCATATCGCGCTGGCGGAATGTTGTGTTGCTGCTCATCAGCGAGGTGCCAGAGTCGTGATCGGCAATTCCACATCTCCGCGTGTTATCGACCTGTACTCGCAGCACGGCTTTGAAATCCGCTATATCAGCGCCCGCCGCTCAATATCAAGTAAGGGCAGTACCCGCGAGAAAGCGAAAGATCTCGTGGCGATTCTGTAGGGGGCGGCATGAAACTGACATTGCCATTTCCACCCAGCGTTAACACCTACTGGCGGGCTCCGAATAAGGGACCGCTTAAAGGTCGTCACATGGTCAGCGCCAGCGGCCGGAAGTATCAGAGCGAGGCGTGCGCGGCAGTGATTGAGCAGTTACGCCGTCTGCCAAAACCTTCAACAGCCCCGGCAGCGGTGGAAATCACCCTGTATCCGCCAGACAAGCGGATCAGGGATCTGGACAACTACAACAAGGCGCTGTTTGACGCCCTGACCCACGCGGGTGTGTGGGAAGACGACAGCCAGGTGAAAAGAATGCTGGTGGAGTGGGGACCAGTTTTCCCGAAGGGGAAGGTAGAAATCACGATCACGAAATTTGAAACAGGGGCGGGTGCAGCTGCCTGAACATGGAGAAAGAAGCATGAATAATTTAATGGTCATTGATGGTATCGAAGTTCGCCGCGACGTTCATGGGCGCTATTGTCTTAACGATTTGCACCGGGCTGCGGGTGGAGAGCAGAAATACCGTCCGAAGTACTGGCTTGATAATAAGCAAACCCGTGAGCTGATTGAGCAACTTTTCACCGAGGGCGGAATTCCACCCTCGGAACAAAATCAATCTGTTAGCTTTTTTCAGGGCGGTAGTGATACCCGAAGTTTGGCACGTGCTCCAGTAAATACTGTTCGCGGTGGTGCTGAACAAGGTACATACGTATGCAAAGAACTGGTATTTGCTTATGCAATGTGGATCAGTCCGTCTTTCCATCTCAAGGTGATCCGCACGTTCGATCGGATTACCAGTGCGCCACAAATATCTTCTGGTATGGCTGCCGATAAGATGCAGGCGGGGGTGATTCTGCTGGGTTTTATGCGCAAAGAGTTAAACCTGTCCAATTCATCGGTACTGGGCGCGTGCCAGAAACTCCAGGAGGCAGTGGGACTACCTAACCTGGCGCCACAATATGCCATTGATGCTCCGGCTGGCGCGCCGGATGGTTCAAGCCGCCCGACGCTTGCACTGAGCGCGCTGTTAAAACAGCATGGTATCCGGATGACGGCTAATCAGGCGTATCAGCAGTTAGCAAAGCTGGGTGTTGTTGAACATCGTGAGCGTTACAGTCGCTCCGCGATTAACGGCATTAAAAAATTTTGGTCGCTGACGGCGAAAGGCTGCATGTTCGGCAAAAACATCACCAGCCCGGCAAACCCTCGCGAGACGCAGCCGCATTTCTTCGAATCCAAATTCCCTGAGCTGCTGAAGCTGCTCGATACCGTTCATTGAGGTGATCGTGAGAGCGTTACTGACCCCTGAAATTGCTCCTCGTATGGGCGTTGTATTGTTCAGGCCGGGATCGGAACTGATGCCCCTGTTTATGCAGGGGCGTGTTCTGCTTGAACCAGAGCCGGAACAATATTCATCTTTCGCCTGCGGCGCGGTCCCGGCGGTATCACAGCCGCTGGCGGATGATCCTGCTGTTCGTGATGTGTTCCGTAATGAGTCGGTTATCTATCGTGCTGGTGGTCTCGATAGTCTGGAAAGCTGGCTACTCCGGGGGAATGGCTGTCAGTGGCCGCATTCAGTCTGGCACAGCGAACAGATGACAACCATGCGCCACGCACCGGGGGCAATCCGACTGTGCTGGCACTGCGATAACCTGCTGCGCGAACAGTTTACGGAACGGCTGGAATCAATAGCTGTGGAGAACACGACAAAATGGGTTTTATCGGTTGTTTGTCGTGATCTGGGTTTTGACGATATGCACGCAGTCACGCTCCCGGAACTGTGCTGGTGGATGGTACGCAATGACCTGGCAGAAGTCTTACCGGAGAGCGCTGCGAGAAAAGCATTAAGGATGCCGAAGGCAATTGTCCAGTCAGCTACCCGTGAAAGTGAAATTGTCCCCTCGGTGCCGGCCACCAGCATTGTACAGGATAAGGCGAAAAAGGTACTGGCACTCAGGGTTGATCCGGAATCGCCGGAAAGCTTCATGTTACGTCCGAAACGCCGTCGATGGATCAATGAAAGATATACCCGCTGGGTTAAATCCCAGCCGTGCGCGTGCTGCGGGAAGCAGGCGGATGATCCGCACCACCTGACAGGCCACGGTCAGGGAGGGATGGGAACAAAGGCGCATGACCTCTTTGTGCTGCCGTTGTGCAGAACGCATCACAATGAGTTACATGCGGACACCGTGGCATTCGAAGAGAAATACGGCTCTCAACTGGAGTTGATATTTCGTTTTATCGATCGCGCGCTGGCGATCGGCGTGCTGGCGTAAATGGAGAACACGCATGAACCTTGAAGCCTTACCAAAATATTACTCACCAAAATCTCCAAAATTGAGCGATGACGCTCCGGCGACAACCTCCGAATCTTTGACGATTACGGATGTAATGGCGGCGCAGGGGATGGTGCAATCGAAAGCACCACTGGGGTTTGCTTTATTCCTGGCAAAAGTTGGTATTCAGAATCCTGACTTCGCGATTGAAGGGCTGATTCATTACGCGGTGGCACTGGATAACCCGACACTGAATAAATTGAGTGAAGAAACTCGGTTACAGATTGTTCCTTACCTCGTGAATTTTGCATTTGCTGATTATTCCAGATCTGCTGCAAGCAAGGCTCGCTGTGAGCATTGTGCTGGTACGGGATTTCATCATGTATTACGTGAAGTGGTGAAACACTCCAGAAATGGTGAACCCGTCATCAAAGAGGAGTGGGAGAAGGAACTATGTCAGCATTGTCATGGTAAGGGAGAAGTCAGCACGGTGTGCAGAGGGTGTAAGGGTAAAGGTATTGTCTTGGATGAAAAAAGAACTCGGTTTCATGGCGCGCCTGTTTATAAGATTTGTGGGCGTTGCAATGGAAACCGGTTTAGTCGTTTACCAACCACACTGGCGCGGCACCATGTCCAGAAACTGGTACCGGATCTGACGGATTATCAGTGGTACAAAGGATATGCAGACGTCATTGATAAACTGGTTACAAAATGCTGGCAGGAAGAAGCATATGCAGAAGCGCAATTAAGAAAGGTGACAAGATGAAAGATTTTCAACGAAGATAGCGACATGATGCTTGCATATTTCAAAAAATATGGATAAGATTCTCCCAACGATGGGCTTTGTATGTCTATCGTTGATAAGTCTCAAGAACCCGCCTCCGAGTGGGTTTTTTATTTGTGATCACTTTATTTTTTGTCTTGCTAAGTTATTGTATGGACAAGAACTAAAATTAAGTGGTGACATTGTGCTCTCTAATAACGAACGTTGGGTTTCCTTTTTTGACTTTGCTTTTACGCCTACACACGCAGCGGCGCCGAGTATTCCCATTGAAGACATACTCAAGAAATTGAAGGTACTGGTGAGCTCAGGGAGTGCTGTAAAGTTATACAATCATAGGTCTAGAGCGCTTAGGATTTCGGAGATGAAATATTCTATTGGGGATAGCCAGGCGACTCTACTTATCCAGCTTTGTGATAAAAATGGTTCTGACCCTGTTTTTGGTGAGTTAACAACAGGCAACCTTAGAGTAGAACCTAAGCTTGCCGGTGAAGGTATCGCAGTTTCTTGTCACATTGTAATATCCACAGATGTTGTCAAAAACACTGCCGATCACCACAAAACTCTCGTTGAATCTGTCCCCGGTATCAGTAAGTCAGTTCTTGAGCCATTTTTAAATGCTATGCTCAGAGAAGCCTTCGCTGGATGTGAGTTTAAAAATCCTGCAACTAAAGGTATGTGCCAGCACCGCCCAAAGCTGGAAATCTATTCTCATGGTTCACAAACGCTGATGGATGCATTAAAAGGTGCAAAGATTCATAACGTTAAACTTGTGAGTACAAGAAGGAAAGGTGGATTGGACCAAACGGCGTACACTGAGCTCTCAGAAAGGTCCGTAAAGTATAAAATCATTAGACAGCCGCCATTGAAAGATAAAGAAAGGTTGTTAGAGATTTTAAGAAAGAAAGGGCAGCAGTCTGGATATACCAAGGTTTCAATTAGTTACTCAAAAGATGGCAAGCAAGCCAGTTTGGATCTTGACCGTAACGAAGATGCTGCCACAAAACTGTTCACTAAAAGTGAGAGGGTAATATTAGGTAACCTCATCAACCAATGTGAAAGCACAGTACATCTGCAGCTTGAAACAAAAATGATAGGGTTGCTCTAACGGGAGTTTCATATGAAACTTTTTTCACCGCTGAGTTATCTCCGCATCAAGCATGAGGAAAAGGACTGGTATGATTACAAAATACCAGCTGCAGTGTCTCTAATCGTCACTATTGTTTATTATTTTCACGCTAGCAAAATTTCTTTAATCGAGACTAACGGACTCCTGCTTCAGGTTAATGGGTTACTTCAAGTCTTGATTGGTTTTTATATCGCAGCACTGGCTGCGGTTTCTACTTTTTCTAGCTCTTCGATCGACGAAGTAATGGCGGGCGTACCTCCGACTCTAGTAGAGAAATTCCGAGGGCAGAAGCTTACTGTAGAACTGACGCGCAGGCGCTTTGTTTGTTACCTTTTTGGTTATCTAGCTCTTGTGAGCTTTATGTTATTTTGCTTAGGGATGATTTCTATTCTGATTGGGAAGCCTTTCCATTTGTGGCTGCTCACATTCTGTTCTCCTGATGCAATCTTGTGGCTTAAAACGGTATTTGTTGGCGTTTATATATTCATCTTAATGAATATCATAACAACAACTTTGCTGGGACTTTACTTCCTTGCAGTTCGGTTCCACCAATCATCGCTGTAAAAAATCTAAATACTTTTAGGCTGCCTTCGGGCGGCCTTTTTTATTTCCCCTCATAACTGAGAGGACCCACATAACCAGAGGGGGATGAATGTCCGAACCTGTATCCAGTGCGACAGTGTTGGCTGGTGGATTAATGGGGGCCAGTGTATTCGGTCTGGCAACCGGAACTGATTATGGTGTGGTATTCGGCGCTTTTGCCGGCGCGGTGTTTTATGTCGCCACGGCAACCAACATCGGACGCATCAGGCTGGTCGCTTATTTTATTACATCATTTATTGTGGGAGTGCTTGGTGCCGGGCTGATAGGTACTAAGCTTGCGGCAATAACGCATTATGAAAAACCACTGGATGCACTTGGCGCAGTGATTATTTCTGCAATGTGTATAAAGTTTCTCACTTTTCTCAACAGTCAGGATCTGAACACCCTGTTCAGTATTCTCTCTCGTATCAGGGGAGGGGGATCAGATGGTAGCAAATGACCCTTCTGCAGCTCTGAATGCCGTAATTTGTGGGGTGATAGTCATCGTTCTGATGTTTTACCGACGCGGTGATGCGACACACCGCCCCCTGATTTCGTTACTGGCCTATGTCATGGTGCTGGTATATGCCAGCGTCCCTTTCCGGTTTGTTTTTGGTTTATATGAATCATCCCACTGGCTGGTGGTGATGGTGAATATCCTTATCTGCGCCGCTGTGCTGTGGGCTCGCGGTAATGTGGCGCGTCTGGTCGATGCACTGAGGCACTGATGAATCAACAACAATTTCAGCAGGCGGCTGGTATTAGCGCCGGGCTTTCTGCACGCTGGTTTCCGCACATTGATGCGGCAATGAAAGAGTTTGGTATTACAGCAGTTAATGATCAGGCCATGTTTATTGCACAAACGGGACATGAATCAGCAGGATTTACTGTTCTGAAGGAAAGCTTCAATTATTCGGTGGAGGCGCTGAAAAAGACGTTTGGTAAACGCCTGACGCCGTATCAGTGTGAAATGCTGGGGCGTATTGATGGTCGCCAGGTTGCCCACCAGCCGCAAATAGCCAATCTGGTTTACGGTGGCCGCATGGGTAACAAAGACGCCGGAGATGGCTGGAAGTATCGCGGGCGTGGTCTGCTTCAAATCACCGGCCGCGAGAACTACGTCAAATGCGGAGCTGCGCTGAAGCTTGATCTGATCAGCACACCAGAGTTGCTGGCACAGGAGAAGCATGCAGCCCGTTCTGCTGCATGGTTTTTCACATTACGTGGTTGCCTGATGTATTCAGGTGATGTTGTCCGTGTAACGCAGATCATCAACGGTGGCCAGAATGGACTGGCTGACAGAAATAGTCGTTATAACAAAGCGCGGGCGGCGTTGCTGGTATGACAGCGGTCTTTGCTTTCGTTAAGGCGCGGTGGAAAACAATCATTGTTTTGCTGATGTTGGCTGGTGCATTTCTTGCCGGGATCATCTGGAGTGATCGGGGCTGGCAAAAGAAGTGGGCTGACCGCAATAGCATGGAATCTTCACAGGAAGCGAACGCGCAGACTGCCGCACGCTGGATTGAACAAGGGCGCATAATTGCCCGTGATGAGGCTGTAAAAGATGCACAAGCACAAGCCGCTAAATCTGCTGCCACTGCTGCTGGCCTGTCTGCCACTGTTAGCCAGCTGCGTACCGAAGCAACAAAGCTTGCCGCCCGCCTGGACGCCGCAAAGCACACCTCAGATCTTGCCGCTGCCGTCAGAAGCAAAACAGCCGGAGCCGACGCCGCAGTGCTCGCCGACATGCTCGGACGCCTTGCAGAAGAAGCTCGATATTATGCTGAGCGATCTGACGAAAGCTACCGCGCAGGAATGACGTGTGAGCGTATTTACAACTCGGTGAGAGAGTCAACCAACAATCCCATAGCCCCGCACTAGCGGTGCTTTTTACCGGAGTTTATATGCCACCAAGAACACCTAAATCCTGTCGTGTTCGCGGCTGTCGCAGTACAACAACAGATCCATCCGGATATTGTGAAAGTCACAGAAGCGAGGGCTGGAAACAATACAAGCCAGGACAATCCCGTCATCAACGCGGTTATGGTTCGAAGTGGGACGTTATCCGTGAGCGCATACTGAAGCGTGATAAAGGTTTATGCCAGTTATGTCTGCGTGCCGGTGTGGTGCGTGAGGCGAAAACTGTTGACCACATTATCCCTAAAGCGCATGGCGGAACTGATGCCGACAGTAATCTGCAGAGTCTGTGCTGGCCCTGCCATAAGGCGAAGACGGCTCGTGAACGGCTGAAATAAAAACCAGTTTCCACAGCCAGAGGGGAGGGGCGGGGTAAATCCCTGTGGCCTGACGTCTTCCGGACTGCCCGCCTCATCAAATTTTTACGCGCCAAAAATAAGAAACTTTTTTCCGGAAGGTTCAACCTATTGAACTGGAGGTTTTGATGGGTGCTGTTGTGAGATCTTCCGGTGGTGGCCGTAAGCGCAATTTGCCTTCGGGCCAGAAAAGCAAGCTGACCAGGATCGCACCGCCGGAAGAGTTAATGAGTGATATCGCGATCCGCATCTGGAAAACGCAGAGCAAAATTTTAATTGAGCGGGGCGTTTTTGATCTTGAAGACGCGCCGCTACTCCTGGCGTACTGCAATGCGTTTCACTTGATGATTGAGGCCGAAAAAGTCATCGCGGAAGAAGGCCTGACCGTATCAAGTGAAATGGGTGGTGAGAAAAAACACCCTGCAGTCAATGTCCGTAATGACTCCGTTTCGCAGCTCGCCCGTCTGGGTTCACTTCTCGGGTTAGACCCGCTCAGCCGCATAAGAATGACCAGCGGAAAAAATGATCCGGACGATGAAGGGAATGAATTTGATGAGTTTGACTGATGGCTACATATCCGAACGTCAATGCGGCGAACCAGTATGCGCGGGACGTCGTGAACGGGAAGATACTGGCCTGCCGGTTAACCATGCTTGCCTGTCAGCGACATCTTGACGACCTGGAACGTGCCAAAGATCCGCATTGGCCTTACCGCTTCGATAAAAATAAAGCAGAACGTTTCCTTCGCTTTTCCCAGAAAATGCCGCACACCTCCGGAGAGTGGGCTCGCCGGAAGTTGCGGATAGAATTTGAACCCTGGCAAAAATTTGCGCTGGGCGTGCCGTTTGGCTGGGTGCGCAAGGATACCGGTTTTCGCCGCTTCACTGAGATTTACATCGAGGTACCGCGTAAAAATGGGAAATCGGCGATTGCGGCCGCCGTCGGTAACTATATGTTCTGTGCAGATGGCGAGTACGCAGCGGAAGTTTACTGTGGTGCCACAACGGAAAAACAAGCCTGGAAAGTTTTTGCGCCTGCACTGGCGATGGTGAAAAAGCTGCCGGCGTTGCGTCAGAAGTTCTGTATCAAACCCTGGGCAAAGAAAATGACTCGCCCGGATGGTTCCCTGTTCGCGCCAATTATCGGTGACCCTGGAGATGGCGACTCACCATCATGTGCGATCATCGATGAGTACCACGAGCATGATACTGACGCGCTATACACCACAATGACTACCGGGATGGGGGCGAGGGAGCAGCCCATCACGCTGATCATCACCACGGCAGGCTTTGATATTGCCTCGCCTTGCTATGAAAAACGTACTCAGGTGGTCGAGATACTGGAGCGCATCCGGGAGGGTGGTGAAAACGAGGCAATTTTCGGGATCATCTATACCCTGGATGATGACGATGACTGGACACAGCCGGAAGCTCTGATCAAAGCCAACCCGAATTACAACATTTCGGTGAAAGAGGGATTCCTCAAGGCTAAACAGTTGCTGGCGATGTCCACGCCAGGCCAGACCAATAAAATACTCACCAAGCATTTCAACAAATGGGTGAGTTCTAAAGCAGCTTACTACAACCTGCAGAAGTGGATGACCGCAGCAGACAAAACGCTCAGACTGTCCGATTTTGCAGGTGAGGAGTGTTATCCCGGCATCGACCTGGCATCAAAACTTGACCTTAATGCAGTGGTGCCGGTATTCCGCCGTGAAATAGACGGCCTGAGTCATTATTACTGCGTTTCGCCTATGTTCTGGGTACCGGAAGACACCGTCTACGCCACGGACCCGGCGTTGAAAACTATTGCAGACCGTTACCAGTCTTTTGTTAATCAGGGCGTGCTGGTTCCGTCAGACGGTGCAGAAGTGGATTACCGCCTTATCCTGGAAGCGATCCTGAAATTACGGGAAACGGTGAAGATAGCCGCGAGTCCGATTGACCCCTACGGTGCAACAGGCTTATCTCATATGCTGCAGGATGAAGGGCTTGAACCTGTCACCATTACCCAGAACTACACGAACATGAGCGACCCGATGCGTGAGATTGAGGCTGCGATCGCTGCTGGCCGATTCCATCATGACGGTAATCCCTTGATGACCTGGTGTATTTCGAACGTGGTTGGCAAGTACCTGCCTGGTAGCGACGATGTTGTTCGCCCGGTGAAAGAAGGCGCAGGCAACAAAATTGATGGTGCAGTTGGCCTGATGATGGGGGTTGGCCGCGCAATGCTGAACGAGCCGAAAGACTTTCTTTCTAACCTCGATCCTGATGAGGAACTGTTATTCCTGTGAAATCACTAATTATCGATGTGGCCGGGGTGGCAGGCTTCGGCGCGCTGGTGGGAGGTATTTACCTCAAATTTGGCGCGGCGGTTGCTCTTATGGCTGGTGGTAGTGGCCTGCTGCTGTGGGCACTGCTGGCGGCCAGGAGAATAAAAACATGCTGATTGATGCCATTTTCAGAAGCAACTCGCTGGAAAACCCAGCTGTTCCGGTCACCGTTGAAGCGGTCGAAAACGACGGGATCTTTAATGGTGATGTGATTGTTAACCCCCGGACGGCAATGAAACTGGCGGCGGTGTATGCATGTATCTACGTTATTTCATCCAACGTTGCGCAGATGCCCCTGCACGTCATGCGGCGAACCGGGAAGAAGGTTGAAACTGCCCGCGACCATCCTGCCTTTTACCTGGTTCATGACGAACCCAATTCCTGGCAGACCAGCTATAAATGGCGCGAGCTCAAACAACGTCACATTCTGGGCTGGGGTAACGGATATACCAGAGTTCTCCGTCACCGCCGAACCGGTGAAGTCACTGGCCTTGAAGCCTGTATGCCGTGGGAAACAACGCTGCTGAACACCGGCGGGCGCTATACCTACGGCGTGTATAACGAAGAAGGTTCCTTTGCCATTAATCCTGATGACATGATCCACGTCAGGGCGTTGGGTAACGATCAGAAAATGGGGCTCAGTCCGGTTCTTCAGCACGCCGAAACCATCGGTATGGGTATGAGCGGGCAGAAATACACGGAAAGTTTTTTCAGCGGTAACGCCAGACCAGCGGGCATAGTTTCAGTAAAAGGAGAATTGAATGACGGCTCCTGGAAAAGGCTGAAAGAGATGTGGCAAAAAGCCACGGCGATGCTGCGCAGCCAGGAAAACAGGACAATGTTGCTCCCGGCTGAACTGGATTATAAAGCGCTGACGGTTTCCCCAGTCGATGCCCAGCTCATCGACATGATGAAGCTCAACCGTTCCATGATTGCCGGGATTTTCAACGTGCCGGCACACATGATCAACGACCTCGAAAAAGCCACCTTCTCCAATATTTCCGAACAGGCGATTCAGTTTGTTCGCTACACAATGATGCCGTGGGTGACGAACTGGGAGCAGGAGCTTAACCGTCGGTTGTTCACCCGCGCCGAACGGGAAGCCGGGTATTACGTGCGCTTTAACCTGGCGGGTTTATTGCGCGGTACTGCCAAAGAGCGCGCGGAGTTCTATCACTTCGCTATCACCGATGGCTGGATGAGCCGCAACGAAGCACGCGCGTTTGAGGATATGAATCCGAAAGACGGCCTTGATGAAATGCTGGTCAGCGTTAACGCCTCCCGGCCAGCCAAATCCACAACCCAGGAGAACACTCAAGATGAGTGAACGTGAAATTCGCTGTTACAGCGGCGAGGTGCGCGCAGAAACGCACGACAGCGAGCCCAGCCGGATCATCGGGTATGGTTCGGTCTTTGACAGCCGTTCTGAACTGATTTTCGGTTCGTTTCGCGAAATCATCCGGCCCGGTGCGTTTGATGAAGTGCTGAATGACGATGTACGGGCGTTATTCAACCATGACCCCAATTTTATCCTGGGTCGCAGAAGTGCGGGCACGCTGGCACTGACGGTTGATGAGCGGGGTCTGCGTTATGACATCACCGCGCCAGAAACTCAGACAATCCGTGATCTGGTGCTGGCACCAATGCAGCGCGGGGATATCAACCAGTCCTCTTTTGCATTTCGCGTCGCCCGCGACGGAGAGGAATGGTACCAGGACGAGGATGGTGTGGTGATTCGTGAGATTACCCGTTTTTCCCGTCTGCTGGATGTCAGCCCTGTGACATATCCGGCGTATCAGGAGGCAGATTCCGCCGTCCGCTCTATGAAAGCCTGGCAGGAGGCGCGCGATAGTAGCGCACTGCAGAAAGCCATTAACCAACGAATGGCGCGTGAGCGCGTCCTGACCCTTCTTAACGCGTAAGGAAAAACCATGAAATTGCATGAACTGAAACAAAAACGTAACACCATCGCGACCGACATGCGCGCGCTGAACGAAAAAATCGGCGATAACCCATGGACGGATGAGCAGCGTACCGAATGGAACAAGGCAAAATCTGAACTGGAAGCACTCGACGAGCGCATCGCCCGCGAAGAAGAGCTGCGCCGCCAGGACCAGACCTACGTTGATGAAAACGAGGAAGAGCAGCGCAATAATCAGGATCCTGATAAAGACCCGCAGCAGGACGAAAAACGCGGCCAGATTTTTGATAAATGGATGCGTCACGGCGCCAGCGAACTGAGTTCCGAAGAGCGCAAAGCCTTACGCGAACTGCGTGCGCAGGGTGTGGCGCCGGATGAAAAGGGCGGCTATACCGTGCCTGATACCTTCCTGGCGAAAGTGGTCGAACAGATGAAATCCTACGGTGGTATTGCCAGCGTGGCGCAGATCCTCGCTACATCCGATGGGCGCACTATGGAATGGGCCACTGCTGATGGTACCGCTGAAGTGGGTGTGCTGCTGGGTGAAAACGAAGAAGCGGGTGAAGAAGATACCGAATTCGGTATGGATAGTCTGGGCGCGCTGAAAATGACATCCAAAATTATCCGCGTATCCAACGAGCTGCTACAGGACAGTGCGATCGACATGGAAGCCTATCTCGCCCGCCGTATTGCGGAGCGCATTGGCCGCGGTGAAGCGCGTTACCTTATTCAGGGGACCGGCACCGGTACGCCAAAACAGCCTAAGGGTCTGAAAGCATCCGTAACCGGCACTACGCAGACGGCCGCTGCCGGAGCTGTTAAATGGCAAGAGATTCTGGCGCTGAAACACAGTATTGATCCGGCGTACCGCCGCGGGCCGAAGTTCCGCCTGGCGTTCAATGACAATACGCTGAAACTCATCAGCGAGATGGAAGACGGTCAGGGCCGTCCACTCTGGCTGCCTGATATCGTCGGCGTGGCGCCAGCATCAGTGCTGAATGTTCCGTACGTTATTGATCAGGAGATCGATGATATTGGCGCGGGCAAAAAATTCATGTTCTGTGGCGACTTCGACCGCTTCATTATCCGCCGTGTTCGCTACATGATCCTGAAGCGCCTGGTGGAGCGTTACGCGGAATTCGACCAGACCGGCTTCCTGGCGTTCCATCGCTTTGACTGTATTCTCGAAGATACCTCTGCGATTAAAGCGCTGGTGGGCAAAGGCTCGGCAAGCAGCTAATCCCTCTCACCTCTGAACAAACCATGCCGCGTTAAGCGGTTTTTTTGTGCCCGCCACCCGGCGGGCGCAGGAGGATCCTATGTTGCTTTCTCCTGAGGAGATCAAGTTGCAGCTCAGGCTGGATGAGGATTACGCCGATGAAGATAAATTTCTTGAGCTGTTGGGGCGGGCGGTTCAGGCCAGGACAGAAAATTTTCTGAACCGGAGACTTTATACGGCGGAGGCGGGGGTGCCAGCCGACGATCCGGAGGGGCTTATTCTCTCGGATGACATCAGGATGGGGATGCTGCTTCTGGTGACGCACTTCTACGAGAATCGTTCTACCGTCACCGAAGTGGAGAAAGTCGAACTGCCGATGAGCTTTAACTGGCTCGTCGGTCCATACAGGTACATCCCGCTATGAAACTCAGGCAGGCGCAGGCCAGCGCCACATACCTTTTGCCCGACCCAGGCGAACTTGACCAGCGCATTGTTATCCGGCGGCGTGTCGATGTTCCGGCTGATGACTTTGGCGTAACGCCGACGTACCCGGAGCAGATCCGGGCGTGGGCCAAAAAAGCGCAACCCGGCGCGGCAGCTTATCAGGGGGCTGTGCAGATAGAAAACAGGGTGACGCACTATTTCACCATCCGTTTTCGCCGCGGTATCACCGCCGATCATGAAGTGCTCCACGACGATATTTCTTATCGGGTTAAACGGGTCCGTGATCTGAACAGTAAACGCCGCTTTCTGTTGCTCGAGTGCGAAGAGCTGGGTACCGATAACGGGAGTGACTATGCCGCAGAAAGCATATTTACACGTTGATTTTGAACAGCCGGAAACGCTTGTTTTTAACCGGGCGCGTATGCGCCGGGCGTTTGTCAGTATCGGGCAGGTACATATGCGCGATGCCCGCCGCCTGGTCATGAAGCGGGGGCGTTCCGGACCCGGCGATAATCCTTCATACAGAACGGGAAAACTGGCACGCTCCATCGGGTATTACGTTCCGCGGGCATCCAGTCGCCGTCCTGGATTGATGGTGAAAATTGCCCCTAATCAGAAGAACGGGGAAGGGAACCGCCCGATCTCAGGCGCATTTTACCCTGCCTTTCTGTTCTACGGTGTTCGCCGTGGGGCGAAGCGTAAGAAAGGCCATCATCGAGGCGCATCAGGCGGCAGCGGCTGGCGTGTGGCACCACGTAACAACTACATGACTGAGGTTCTGGATAAACGCCGCAGCTGGACACGTTATGTGCTCTCCCGCGAATTGCGAAAATCACTCCGTCCTCAGCGAAGGAAGAAAAAATGAAATTAACCCCGATTATTGCGGCACTTCGCAGCCGTTGCCCTCGGTTTGAAAACCGTGTGGGTGGCGCAGCGCAGTTTAAAGCGATACCGGAGGCCGGAAAGCTCAGGCTACCAGCCGCGTATGTTGTGCCAGCCGAAGACGTCACGGGTGAGCAGAAATCGCAGACCGACTACTGGCAGGATTTGACGGAGGGTTTTTCCGTCATCGTGGTACTCAGCAACGAACGGGATGAAAAAGGGCAGTGGGCTTCTTACGACGCAGTTCACGACGTCAGGCAGGAAATCTGGAAGGCGCTGCTGGGGTGGGAACCGGACCCGCAGGCGCATGAAATTCAGTATGCGGGTGGGATGCTTCTCGATCTGAACCGCCACGAACTGTATTACCAGTTCGACTTCACGGTGAAGTATGAAATTACCGAAACAGACACCCGCCAGCAGGATGATCTGGACGGCCTGCCCGACCTTAAAACGCTCAGTATTGATGTTGATTTTATCGAACCCGGTACCGGGCCAGATGGCGACATCGAGCACCACACCGAAATTACATTTCAGGAATAAACCATGTTTGTGAAACCCGCAAAAGGGCGATCGGTTCCCGATCCGGCCCGTGGCGACCTTTTACCTGAAGGAGGTCGAAATGTTGATGAGAATAACTACTGGCTGCGCCGCGAGGCCGCTGGTGATGTCCGGCGCACGAATAAAAAGGTGAAAACAAATGGCGATTAGTTTTAATTCCATCCCGTCAGATACGCGGGTTCCGCTGTTTTATGCCGAGATGGATAACTCGGCGGCAAATACCGCCCGGGACAGCGGGGCATCACTGCTGATTGGTCACGCCAGCAATGATGCGTCAATTGCCGTCAACAGTCTTGTTCTGGTGTCATCGGTTGATTATGCCCGTCAGATTTGCGGTGCAGGAAGCCAGCTGGCCCGTATGGTCGGGGCGTACCGTAAGACCGATCCATTTGGCGAACTGTATGTCATTGCCGTACCTGAATCCACAGGCGCGGCAGCAACCGTCGCTTTGACGGTAACTGGCGAAGCGACGGAAACCGGAACGGTGAATGTCTATACCGGCCGAACCCGCGTTCAGGCTCCCGTGACCAGCGGTGATGACGCTGCGGCGGTGGCTGTGAGCATTAAGGATGCGGTCAATGCAAACCCTGATCTTCCCTTTACGGCAACATCAGAAGCGGGGGTGGTGACACTGACTGCGCGCCACAAGGGGTTATATGGAAATGAAATTCCGGTCACTCTCAATTATTACGGCTTTGGCGGTGGGGAGGTGTTACCGGCAGGTGTGAATATTACGGTTGCCAGCGGCGTGAAGGGGGCTGGTGCGCCAGCTCTTAACGACGCGGTGGCAGCGATGGGAGATGAGCCGTTCGATTATATCGGCCTTCCGTTTAACGACACGGCATCGGTGAACACGATGGCAACTGAAATGAATGATTCCAGCGGTCGCTGGAGTTATATCCGGCAGTTGTATGGTCACGTTTATACGGCGAAGACGGGGACGCTGTCGGAGCTTGTGGCCGCGGGTGACCAGTTTAACCTGCAGCACATCACCCTGGCGGGCTATGAGAAAGACACCCAGACGCCTGCTGATGAACTGGCTGCAAGCCGTACTGCCCGTGCTGCGGTTTTTATCCGTAACGATCCGGCGCGCCCGACCCAGACCGGGGAACTGGTGGACATGCTGCCGGCACCGAAAGGCAAACGCTTCACGACGACTGAACAGCAGACGTTACTTTCCCACGGTGTGGCAACGGCGTATGTGGAAAGCGGCGTGCTGCGTATTCAGCGGGATATCACGACGTACAGGAAAAATGCGTATGGTGTGGCGGATAACAGCTACCTTGACAGCGAGACGCTGCATACCAGTGCTTATGTGTTGCGCCGTCTGAAATCTGTTATTACCAGTAAATACGGGCGCCATAAACTTGCTAATGATGGTACGCGTTTCGGGCCTGGTCAGGCCATTGTCACGCCTGCCGTTATCCGTGGTGAGCTGGGATCAACATATCGCCAGCTGGAGCGGGAAGGCATCGTGGAAAACTTCGATCTGTTCCAGCAACATCTGATAGTTGAGCGTAACGCGAACGATTCGAACCGCCTTGATGTGCTGTTTCCGCCTGATTATGTCAATCAGTTACGTGTGTTTGCGGTGCTTAACCAGTTCCGTCTGCAGTACAGCGAGGAGGCTGCATAATGGGAAAAATTGCGGGAACAACATATTTCAAAATCGACGGACAGCAACTGTCGGTAACCGGAGGGATTGAAGTCCCCATGAACACCAAAGTTCGTGACGACGTGATTGGCCTGGATGGTTCCGTTGACTACAAGGAAACCAGCCGGGCACCGTATACGAAGGTGACCGCCAAAGTGCCGAAAAACTTCCCGGTCGATAAAATTACGTCTTCTGATGTCATGACCATCACATCAGAGCTGGCAAATGGTCAGGTGTATGTTCTCTCAAACGCCTGGCTGCACGGCGAAGCCAACCATAACCCGGAAGAGGGCACCGTGGATCTTGAGTTCCACGGTGAGGAGGGATTTTACCAGTGATAAAAGAACTTGTGCTCAAAAAGCCGATTATGGCGCATAACGAAAAGCTTCATGTGCTGGAGCTGCGCGAACCGTCCTACGATGAAATCGAAGCCATTGGTTTTCCGTTCACCGTTTCCGGTGATGGCGGCGTCCGGCTGGACAGTTCGGTTGCTCTGAAATATATCCCTGTGCTGGCAGGTATTCCACGCTCCTCGGCAGCGCAACTGGCAAAACTGGATATTTTCAAAGCCTGTATGTTGATCCTCAATTTTTTTACCCGGTCGGAGACGGAGGAGGACTCAGAAAGCGGGTCTACAACACCGCATACTTCTGGCGAATAAATCCCCTGGAGCTCCGGCGGGCGGCGATATCCGATTTTCTGGAGCTGGAGTCGGAGGCTGTCCGTATCAATGAGGAAATGAAGCATGGCTGACAGTTTCCAGTTAAAGGCCATTATCACTGCCGTTGACCAGTTATCGGGTCCGCTGAAAGGGATGCAGCGGGAACTGAAGGGATTTCAGAAAGAAATGGCCGGGCTGGCGATCGGTGCTGCCGCTGCCGGGACCGCTGTTCTTGGGGCGCTGGCGCTGCCCGTGAATGCTGCGATCGGCTTTGAGTCAAAAATGGCTGACATCCGGAAGGTGGTTGACGGCCTGGATGATAAAAAAGCATTCGCGCAGATGAGTGACGATATCCTGACGCTGTCCACACAGTTACCGATGGCGGCGGAGGGAATTGCAGAGATCGTGGCGGCGGGCGGGCAGGCAGGCATTGCCCGCGGCGATTTGATGCAGTTTGCGAACGACGCAGTGAAAATGGGTGTGGCGTTTGATACCACTGCCGAAGAGTCCGGTCAGATGATGGCGCAGTGGCGGACAGCGTTCAGACTGACGCAGGAAGACGTGGTTGTCCTGGCCGATAAAATCAACTATCTGGGGAATACCGGCCCGGCAAATGCGAAGAAAATTTCTGATATCGTGACGCGGATTGGTCCGCTGGGCGGTGTTGCCGGAGTGGCATCCGGCGAAATTGCCGCGATGGGCGCCACCATTGCCGGGATGGGGGTTGAATCAGAAATTGCCTCCACCGGCATCAAAAACTTCATGCTGTCGTTAACCGCAGGTAATTCGGCAACCAAAGCCCAGAAACAGGCTATGGCTTTCCTGAAGCTGAATCCCCGGAAACTCGCTGAGGATATGCAAAAGGATTCGCGCGGGGCCATGCTGAAGGTGCTGGACTCGCTCGCGAAAGTGCCAAAAGCTAAACAGGCCGCCGTCATGAATGCGCTGTTTGGCAAGGAGTCACTTAGCGCGATTGCCCCGCTGCTGACCAACCTGGATTTGTTACGCACCAATTTTGATCGTGTGGCTGATGCCCAGGAATATGGCGGCTCGATGCAGAAGGAATACGCATCCCGCGCGTCCACAACAGAAAACCAGCTGGTTCTGCTGAAAAACAGCGTCAATGCGATTTCGGTAACGCTGGGCGATACCTTCCTGCCCGCCATTAACGAAGCTGCAGAAGCGGTCATGCCTTACCTGGAGCAGCTCCGGACATTCGTTCGCGCGAATCCTGAACTGGTTCAGTCTGCGGCGAAGTTCGGCGCGGCGCTGCTGGCTGTTGGCGTATCCATTGGCAGCCTGTCCCGGGCTGTCAAAATCCTGAACAGTGTCATTAACCTCTCTCCGGCGAAAGTCGCCATTGCGGCGCTGGTGGCCGGCGCTATGCTGATCATTGAGAACTGGGACGATGTTGCTCCGGTGATTAAGGCGGTATGGCAGGAGGTCGATAACGTTGCGCAGGAGATGGGCGGATGGGAGACGGTGATTGAAGGGGTTGGTCTGGTTATGGCTGGTTCTTTTACCGTCAGGACCATTGGTGCCCTGCAGCAGTCCGTCCTGCTGGCCGGACGGCTTTCCGGTCTGCTGGGTAAAATTGGCCGGATGGGGGCCATGACGCTGACAATTGGCGTGGCGGTGTCACTCTTTAAAGAGCTTAAGGATCTGGAGCAGGGGGCAAAGGATGCGGGTATGGATGCTGGCGCATTCGCTGTACAGAAGCTGCAAACGAAGGAGCGTGAACGCGGGTATAACGGTTTTATTCCCAGACTCAAAGAGCTTCTTGGTATGGACACCCCGATTCCGCAGGGGCGTTATCAACCTTATGTGCCACTGACCCGGCGTTCTGGCGTACTCGGGCGAGCTGTCCCGCCATCAACGCAGCGCAGCGAACTCAAAGTGACATTTGAGAATGCACCACAAGGTATGCGTGTGACTGATATACCGAAATCCGGTAATCCATTGATGAACATCAGCCATGATGTGGGTTACTCACCCTTTCGTACATCACGATAAACCTGCTCCGGCAGGTTTTCTTATGGGGTAAATATGGCTTTTTTCTCCTCAACTGGCTGGCGCGGGCGCCTGCGTGATGCATCATTTCGTGGAGTGCCTTTCTCCGTTGAAGATGATGAAAGCACGTTTGGACGCCGCGTACAGGTACATGAATATCCGAACAGGGATAAGCCCTGGACGGAGGATTTAGGTCGCGCCACGCGCCGCCTGACGATAAATGCTTATCTTGTCGGTGATGATTACGCAGACAGGCGGGATCGTCTTATTGGTGCCATTGAAACCGCAGGCCCTGGTACGCTGGTCCATCCGCAGTATGGCGAAATGCAGGGCAGCATTGACGGACAGGTCAGGATCACTCACAGCAGTACAGAAGGGCGCATGTGTCGTGTCTCCTTTCAGTTTGTGGAAAGTGGTGAACTTTCTTTTCCTGTGGCGGGAATGGCAACGGCGAAGCGCCTGGAAACATCAGGCGGGCTTTTCGACGATGCGATTGACAGTATGTTTTCCACATTCTCGTTGTCAGGTATTTCTGATTTTATCCAGAACGATGTCATTGCCGATGCTGCCTCCATGCTGGGCGATGTTGCCGATGCTTTCAGGATGGTTGACTCCGGCGTGTCTGCCGCAATGCGGCTGTTACAGGGGGATTTGTCTGTCATTCTGATGCCACCGGGCGCCGCAAGTGATTTCGTTAACGCACTGCAAAAAGCCTGGCGCTCAGGTGACAGGCTCAGAGGCAGTACATCGGATCTGGTCACGATGATAAAAACGATGTCAGGTATCACCCTTGATCCCGGTCTTTCCCCCCGTGGCACCTGGCCCACTGACTCCGGATCTGCTGCGAAACAGAAAATGCAACGCAATATGATCGCAGCCGCCATCAGGACAACAGCCATCAGCACAGCCGTCCACGCCGTGACAACACTGAAGCAGCCGCGTGATGTACCTGATGTCCGGGGCGTAAATCAGCCTGCAGGAACAGGCCGTGACTCAGACATTATCACTGTCATGCACCCGGCGCTGGATGGTGTACAGACAGTCAGTAATGGCAGCTTTCCACCGAATTATGAAGATCTGAAAGCTATCCGGACCGCGCTCAATGCTGCGATTGACCAGGAGCAGTTGCGTATCCGGGATGATGTGCTTTTCCAGCAAATTTCCGTTATGCGGACGGATCTCAATCGCGATATTTCTGCACGACTGGCACAGGTTGAACGTACTGCATTGCGAACGCCTGATGATGTTCTGCCTGCACTGGTACTGGCTGCAGCCTGGTATGACGACGCCGGGCGGGAATCTGATATCCTCACTCGTAATCCCGTTCCCCATCCGGGATTTATCCCGGTTGAGCCGCTGAGGGTTCCGGTACGATGAATAATACGGTTTTTTTACGCGTCAACGGGCGTGACTGGGGAGGATGGACGTCAGTACGGATAAGTGCGGGCATTGACCGTATTGCCCGGGACTTTAATGTCTCGATCACCCGGCAGTGGCCTGGTGGAGAAGACGTACCGCCAGTAAAAAATGGTGACGCTGTAGAGGTACTCATTGGCGATGATTTAGTTATTACCGGCTGGGTTGAGGCGTTACCGCTACGTTATGATGCGCAGACCATTATGACGGGCATTGTCGGGCGCAGCAAAACGGCAGATCTTATCGACTGTTCTGCATCGCCTGCACAGCATAACGGGAAAAATTTATTCCTGATCGCCAGCGCACTTGCCCGGCCATTCGGTGTGGACGTTGTTGATGCAGGCGCGCCGGCAGCCGCCGTTATTGAGGCTCAGCCGGAACATGGTGAAACGGTTGTGGACTGTCTGAACAGGCTGCTTGGACAGGCTCAGGCGCTGGCATATGACGACGAACGGGGACGGCTGGTTCTCGGCAGGCCGGGCAGTATGAAAGCAGCCACGGCACTGGTACTTGGCGAAAATATTCTTTCCTGTGATACCGAGCGTAGTGTTCGTGAGCGTTTCTCCAGTTATCTGGTTACGGGGCAGCGTCCTGGTACGGATGACGATTTCGGCGAGGCAACCATTGCTGCTATCCGGCAGAGTACTGGTGATGCAGGCGTCACGCGGTATCGTCCCCACACCATTCAGCAGTCAGGAACTGCCACAACTGACAGCTGCAAATCCCGCTGTGAATTTGAAGCCCGTCAGCGTGCGGCGAAAACGCTGGAAACCACCTATACCGTACAGGGATGGAGACAGGGGAATGGCGAATTGTGGAAACCGAATCAGGCCGTGGTGGTGTATGACCCGCTGAACGGTTTTGACAATGAAACGCTGGTGATCGCCGAAGTGACGTACAGCCAGGACAATAACGGCACCCTGACCGAAATCCGGGTGGGGCCTGCGGATGCCTATCTTCCTGAACCATTCAGGCCGAAAGCGAAGAAAAAAGTCAGTGAGGAGGCAGATTTCTGATGGCTAACCATCCTCTTCAGAACATGATAACGCGCGCAGTCATTACCGCGATTGATACCGTCAGAAAATGCCAGACTGCCGGACTGAAACTTATTGCCGGTGAAAAAAAAGAGAATGTGGAGCATCTTGAACCTTACGGTTTCACCTCTGCAGCACAGAATGGCGCAGAAGCGGTGGTATTGTTTCCCGGCGGTGGCCGTTCGCACGGAGTGGCTGTGGTTGTGGCTGACCGCCGCTTCAGACTGAAAGGGCTGGCGCGCGGGGAAGTCGCGCTATATGACGATCAGGGGCAGTCGGTCACATTAACCCGAGCCGGAATAGTGGTAAATGGCGGCGGAAAGCCAGTTATTTTCACGAATGCCACTAAAGCACGTTTTGAAATGCCGATCGAATCCACTGGCGATATCAGGGACAACTGTGACAGCAGTGGAAAAACGATGGCTGAAATGCGCACGACCTATAACGGTCATACCCATAAAGAAAATGGCGATGGCGGCGGTATAACCGATAAGCCTGGCCAACCCATGAGCTGACACCATGATCCTTTATGTTAATGGAATCCGTAAGGATGCCACGGCTTCGCTCGACTTTCTGACGCGGGCAGTGGTGATTTCTCTTTTTACCTGGCGCCGGGCGGAGCGGGATGACAGGACCCCACAGCCATACGGCTGGTGGGGGGACACCTGGCCTGCTGTTCAGAATGACCGCATCGGTTCCCGCCTCTACCTGCTGAAACGCCGCAAACTCACCAATAAAACGCCGCAGGATGCCCGCGAATACATGCAGCAGGCGCTGGCGTGGATGACAGACGATGGCGTGGCGGCACGTATTGATGTGACATCTGAACGCACAGGAACAGATACCCTGGCAGCTGGCGTGACGATATATCAGCGGGACGGGGTAATTCACAATATTACATTCGATGATATATGGAGCAAACTTAATGGCTGACAGTCAATTTGCACGTCCTGAACTTCCTCAGTTGATTGCTACCATTCGCAGCGATTTACTGACCCGTTTTCAGCAGGATGTTGTGTTACGTCGCATGGATGCCGAGGTTTACAGCCGGGTACAGGCTGCTGCCGTACATACGCTGTATGGTTATATCGATTATCTGGCCCGGAATATGCTGCCTGATATGTGTGATGAGGACTGGCTTTACCGTCACGCGAGGATTAAGCGTTGTCCCAGGAAAAATGCCGTATCTGCGAAGGGATTTGCACGCTGGGATGGTATTGCCGGAACGCCGGAGATCCCCGCGGGTACACAGATTCAGCGGGATGATCAGGTTACATTCACGACCCTGCAGACGGTGAAAGCTTCCGGCGGCCTGTTACGTGTGCCGGTTATTGCTGATGTGGCGGGAACTGCCGGTAATACTGACGATGGTACGGCGTTACGCCTTGGCACGCCGATTACTGGTATTCCTTCTACAGGTTACGCTGACACTCTGACCGGGGGGGCTGATACAGAGGAGCCTGAAACGTGGCGCGCGCGCGTCATGGAACGCTATTACTGGATACCACAGGGGGGCGCTGATCCTGATTACGTCATCTGGGCAAAGGAAATCGCGGGAATAACCCGTGCGTGGACATTCCGCCATTATAAGGGGACCGGCACCGTTGGTGTGATGGTGGCTACCAGTAACCCGGTGAATCCGGCTCCTGGCGACGATCTCGTTAAGGCTGTACGTGACCATATTTTGCCGCTGGCACCTGTTGCTGGCGGCGGACTCTTTGTTTTCGCTGCCACTGAAAAAAGCATTCCGGTAACAGTCGCACTGGCCAAAGATACCCCGGAAATTCGTACTGCCATTATTGCGGAGCTAAATGCGCTGATGCTGCGTGATGGCGCGCCGTCCGGAAAAATTTATGTTTCGCGAATCAGCGAGGCGATAAGTCTGGCGACCGGGGAAGTGGCACATCAGCTGCGTGTGCCGGCGGCAGATGTGGTGCTGGGAAAAACTGAACTTCCTGTCCTGGGGAATATAACCTGGGCCACCTATACCGGGGAGAACGGATAACTATGGCATTACAGGACGAATATACGCAGTTACTTTATCACCTTCTGCCGGAAGGGCCTGCCTGGGACGGAGAAAACCCACTGATTGAAGGGCTGGCGCCGTCGCTGAACCGGGTACATCAGAGAGCGGATGAACTGATGGCTGAAATTGATCCGGCCAGAACCACAGAACTGATAGACCGTTATGAACAGCTGTATGGCCTGCCTGATTCCTGTGCACCGGAAGGCGTTCAGACATTACAGCAGCGCCAGCAACGGCTGGATGCAAAGGCAAATGTTGCTGGCGGTATAAACGAGAGGTTTTATCGGGAACAGCTTGATGCGTTGGGGTATACCGCTGCCACCATTGAGCAGTTTCAGAATCTCGACAGCACACCCGATCCTGAATGGGGGGAATTCTGGCGTTACTACTGGCGTGTGAATATTCCGGCTGATGCGAACATCAGCTGGCAGACCTGTACAAGCACCTGCGACTCTGCGATCAGAACGTGGGGCGATACTGTTGCTGAATGTGTGATTGATAAGCTTTGTCCGTCACATACGGTTGTTGTTTTTGCTTATCCGGAAGGAAAAGAGAATGCACAGAATTGATACGCCCACCGCGCAAAAAGATAAATTTGGTCAGGGAAAAAACGGATTTACGAATGGTGATCCCGCCACGGGCCGCCGCGCAACGGATCTCAACAGTGATATGTGGGATGCAGTCCAGGAAGAGGTCTGTACTGTTATTGAAGCCGCCGGCATACCACTCAGTAAAGGCGAACATACGCAGCTTCACGCCGCCATTGGCAGGCTGATCGATGAACAGGTTAAAACCCGTCTTGAAAAAAATCAGAATGGCGCGGACATCCCGAATAAGCCGCTGTTTCTCCAGAACGTCGGTTTAGGAGAAACGATAAATCTCGCTGCAGGGGCCCTGCAAAAATCGCAGAACGGCGGCGATATTCCTGACAAAAAACAATTTGCGAGAACCATCGGTGCGGTAACGTCAACCACCATTACACTTGGCGAATCAGGCTGGTTCAAAATCGCCACGGTTGTAATGCCGCAGGCTACATCAACTGCGGTGATTAAACTGTACGGTGGGGCGGGGTTTAACGCTGGTTCACCTGAACAGGCGGCAATCAGCGAACTGGTATTGCGTGCCGGTAATGGTTCACCTGTTGGAATAACCGCCACATTATGGAGGCGTTCACCTTCTGCTGCTAACGAGGTCGCATGGGTTAATACATCAGGCGACACCTACGATATTTATATTAATATCGGCCAGTATGCGTACTGGTTAATTGCGCAATATGATTACACCGGTAATGCAAATGTCACGCTGCACAGTACGCCTGAATATTCATCAGTTCAGCCGGGAAACTCAACCAGCGGTCAGACATATACACTGTTTAATAGTCTGATGAAACCCACAGCCGGTGACGTTGAGGCACTGTCAGTTAATGGAGGGAGGCTAAACGGTCCGTTAGGCATTGGTACTGATAATGCGCTGGGTGGTAATTCGATTGTATTCGGAGATAACGATACAGGGTTTAAGTGGCACAGTGACGGCGTTCTGGGGATTTATGCCAATAATGCTCTGGTTGGTTATATCGACAATTCCGGGCTGCACATGTCAGTAGATGTTCTCACTAATGGTGCCGTACGCGCAGGCAACGCAAAAAAACTGTCACTGACGAGCAATAATAATTCGACAATGACAGCCACGTTTAATTTATGGGGCGACGCAAACAGGCCAACAGTTATTGAACTGGACGACGATCAGGGATGGCATCTGTACAGCCAGCGAAATCCTGACGGTTCGATTGTCTTTACGGTCAATGGAGATATCACCGCTAACACGCTTCGTGCAAGCGGGGCTATCTATCAGAATAACGGCGACATCTTTGGTTCGCTATGGGGAAATGGCTGGTTAAGTACCTGGATTAATAATAATCTCGTCTTAGATGTTCAGTTAGGGGCTGGCACATCAGTGACTACCTGGAACAATGCAGGTTCCTGGCCTAACACTCCCGGATATGTAGTTACCTCCGTCTGGAAAGATTATCAGGGCGAAAATATTGATGGTATTAATTATGCGCCTTTGCAAAAACGAGTCGGGAGTCAGTGGTATACCGTACAAGGGGGAACGGTATAATGAAAAAATATCAGAATATCAAAAATTTCAGACTTATTGACGCGCCCGTAAACAGGGATAAAACTCAGGCTGAAATAAATATAGGTGCATATTTTCTGGAGTCGGACGATGGACAGGACTGGTATGAGTGTCAGTCATTATTTTCTGATGATACTGCAAAAATAATGTACGACCATGAGGGGGTTATCTGGGGTGTTGTTAATAAGCCAGTCCCGCAACGAGGAAACACATATTCTGTATCAATGTTGTGGCCGGTTAATATGTCTGTTGCGGAAATAGACGCTGCTGACTGCCCTGATGATTGCCGTGGTGATGGTACGTGGTTATATCAGGACGGTAAAGTCGTTCAACGGGGTTATTCTCCGGAAGAGCTGCGTAAAAAGGCGGAGGCTGAAAAAGTTCGCCGCCTTGCTGAGGCTGAATCAGCCATCGCACCACTGGCACGGGCAGTAAAACTAAAAATTGCCACAGATGAAGAGATTAAACGGCTGGAAGCATGGGAACTTTATAGCGTAATGGTAAACAGGGTGGATACATCTGCGCCTGACTGGCCGGATATACCACGCTAAATATTCAGGCGGGTTTATTACCCGCCTTTTCTTTTTCCTGTCGTTGTGCCATCAACCTGACAGCCGGTACAAATAGCCCCCTCTTGTGTACTGACCTGAAAATATACTCACCCCTTAACCACGGAGTTAACCGGATGAGTGATTTTCACCACGGCACGCAGGTCATCCAATTTAATGGAGGTACGCGCGTCACATCCACGATATCGATCGAAATCGTCAGTATGGTCTTTACGTTTAGCTTTGCCGAAACATTGCTCGCGTTTATATCATACGTTTGCAAATAATTATCTCCAGGAGCTGATAGTCAAACTGCTGGTATCCATAAAAATATGTTCATGCTTAAATTCTATAAAATTACAAAACTCTTCATAATTGTGCCGTATAGAACAATTAAAATAATGCCTTAGCCCACCGCAGACACCATCAATGTATGGATCGCCATAAGGTTTACTGTGCATAATTTCAAGTCCTTTTTTCAATGCCGGATGCTCGCTGCGGTTAACGGCGATAATCCCATTTTCAAGACTCATGCTATTACCTTTACGACTTACATGAACAGCAATACCATCAGGTAAATACAAAGTGCCGAGTTTACCTGTAAGTAACATATCAGCATCAAGATATATGCAGCCACCTCCAGGTTGCAGGTGATGGCAGCCATGTTTGCCCGCCTCCAGAAAAGCATTACTTCCTTTTAATAAAAAAAGATTTCTGTAAAAATCAAACCGTACATGTCCCAAGCGTTTATCATGGGACGAAACAAGGGACTCCTCTGGATTGTTCTTTAAAACTTCATTTAAACTCTTTTTTATCTCACCAAGCAGATATTCATCTCTGACATTTGCTGGTTGAGCTTCAATTTTAGCGATGTTTTCTAAATAAATATCTGATAGTTTCTTGTCATACATGCTATAATCCAGGTCGGAATTATAGATAACCTTTATATTTTCATATTGTTTTTCCAGCTTTGCTAATGCTTTCTTTTGTCCAGCACTAAAATTCCCATCAACTAAAACCCCGATAGTTCTCTCTTTTTCTATTATAGCGGCGTTGATAATATTATTGAGATAGGGGTTTTGTTGAGTATTAATAATTGGGATCTGGTTTTCCCCAAACCTGCTTGGGTTTCGTTCAAACCATTGAAAAAGTAGGGGGGTGTGCTGATCTAATGGCAATAAAGGATATTCAACTCCGGCAAAGTTTGCACTACCTGATGAAGGCAGAGTAATAGCTGGAGTTGCAGTATGAGAATAGTTCTGGCATGAAAGAAAACCTCTGACTCGAGAAAACATTTTTCAACCCTTACGCTATTAATATAACATACCATGATTTGAGATTGATAAGATATCGGGTTTTAACCTTAGTTTAATAAATTGGAAGATTTTTGTATGAAGATTTGCTACCGTATTTTCGTGCCATAGCTATCTGAAACGATAGTTTTTACTTGGTTAGGGGGGGCTTAAAATTACATTTTTGAATAAGTATTTTATACTTCTTATACGATATGTTTTTATTGTATTGCAGGGGGACAGGGGAGATTGGCGGGCGAAGACCAAAGTTATCACCTGAGCAATGGGCGCAGGCCGGGCGTCTGATTGGGGCAGGAATACCGCGACAGCAGGTAGCGATTATTTATGATGTGGGGCTGTCGATACACGGGCTGTACTGGATTCTGTTATCGGCTAACCGGAAGGGAGTTAAAAAACGGGATTACCGCGCTAACTAGAAGCATGAGGGAAGGTTGTGGCATCCGGTGAACCTGTAGGAATCCCATAACTGATTAAGGACGGAAACCACCAGATGCCACGAAGATGTTATGGCGCATGTTGTTGTGGGAAGTCAATAAAGGAGATGGTTTTGTTTAAAAAATAGTCTATCATGGTGAGAAAATGATTTTGATAAGTAGGCTAACTTTCTGAAAATACTGCGTACAAAAATGCTACTTTTTTCTCATGTTATTAGATAAGTTAATGTTAAATAAGAATATTGGGACGGTCTCGAAAACCGGAGTAGGGGCAACTCTACCGGGGGTTCAAATCCCCCTCTCTCCGCCAATCATTCAACAAAATCAATCACTGACAAAGCGTTTTTGATTTCGTCATACATAAATCCCTGCATTAAAATTTCTTTCACTCGCTCGATTTTTCTTACTACTGATGCTGTTTTTTACCATTTTGCTGCGCGTCGGGCATCCACTTTTATCTTCTTTAGCTCACAAGCTCACACTCTTTTGCAACGCCCATCTCACATCTCTGGTGACAAAACAGTTTAACTGCCGTTGCTTCACACGGATCCTGTGAGGGTATCTTAAAGGAAATGGATTGCTGGAGGATGGAGGAAAAGAGGCGAAATCTTTTTCAGTGGTGATACCGCCTCCCCTGGACATGAGCTTGCCAGAACCTACGCGGTAAATCCTACGACACGAAGCCACGCGCAATTTTATGCGAAAACGCTGGCAATAGCCTTGCTGACTGATGCAAGCTATCTAAAATCGCTTGAAGATGACATCATGCAAAACAATCGCTTAAAAGACTCACCTGTTGATAAGCATGTTGAGGACGAAAAGAAGAAGAAAAATGCAGCCAGGTAAAAAGGGTTACGTTCGCCGTAAGCGAACGTGATCGCATCCAGCCTTAACGAACCTTTTTCCGTTCATCTCCCTTCATGTTTTCAAAAAACTCTTTTATGAATGCCCCTAAAACGTCATACGCGAATTCGCTATTATTGTTCCCGTCGCGACAACATAGGGATTACGTAAAAAAACAGAGGGCAGACATTGAAAATATTGGAAATAACGTAAAACGATTTGTCCGTGTTTTTATCTGTCATTTAAACTTACAGAAATGGCTGTACTGCTATCAGTATAAAGATAATTTCAGTTGTCCAGATAAATCATTTTTAAAACTATTTGTATTAACAGGGTTTATTCATTTATTGAATGGATGCGTGCTCGTTTGGTCTTGTCATCTTGACGCTTTGAGATCACATATTCTATTCTATCATAGACAGCAGGGAAAATTAGCTGCTGGGATCGATTTTTGATTAATAAAATATTGGATGGAACCACGATGCAATTACCAGAACAGGATGAGTTTTCTGATTTTTTTGCTGCCAATGATGATGAACAAGCCTCTTTAAGGCGTAAGTTTTTTTTGGAGAAACATAAAGAACCGTGTCTGTCTGAGTCTGCATTAGAGGACTACCAGGCGCTGTTTATGAGTATCTACGGAATTAATATTGACTGGAAAGAGGGGACTTTTAGCCTGCTTGAGGCACTTTCAGATAATCAGGGAGGGAAGCCTGTCACGGTCAAATTCGATTATGACAGTGAGATTGAAACAGCAACGATAAATTTGGTTGATACGCAGTATGTGTTTCATCACTACCCAATGGGAAGTGATGGTTTTGATACAGAACTGGTGCGCATTGAGCATATATTGGCTAATAGTGGATATAGTTTGCGGGTATATCAGAACAGCACTTTTAGTGATACATTATCGTTCTTACTCATTCCGTCAGATGAGTGGAAACGTGTTGAACAGCATTATAGCCCAGAGCATATTTCTGAATACTTCGTTCCGTATGGAAAACAACTTGTTATTCCTGAGGTTACTGCTCCAGTCGTAAATTATGTGCCATCCGTTAAACAAGAAGCATCAAATGTCCCAGCGTTGTTTAATGCTCGGGGTATCCGTATTTGTTTTTTAAGTATAATGCTAATCGCATTTGCAATTTATATTTTGTGGAATATCCTGACAAAAATAGAGCCTTTATCATCAGGCCAACCTGCTGGCTGTGAAAATCTACAAAATTTATACTCAAAATTACGTCCAGAAGTAGCCGAGCCATTAAAAGAAAAAATGCGTAAGAGTTTGGGCTGTAAATGATAAATTTTCAGTTTAAGAAAATTACACTAATGGATAGGTAATAGAGATATATTTAATTTGATATTGTGTTATCGAAATAGTAATAACAGATAATATACATATTCTTGCGGTGGTGGATGAAGCGATTTTTATCTTATTTGTACAGGAAAAGAAAAAGAGTTTTTTTAGCTACCATGACTCTTATTAATTTTGTTGCCGCAAATCAAAAATACTACGCTTTTAATTTGTTGTCAGTGTTTATCATGGTGTTCTGGTCAGTTCAGTACCTCCTGCAGCATCTCCTTGTCCAGACTCAGGTCAGCCACCAGCTTCTTCAGCCGCTGATTCTCATCCTCCAGTTGCCGCAGACGCCGCAGTTCCGTCACGCCCGGCCCGGCAAATTTTTTCTTCCAGTTAATGGGATGGACTACTTCCTCCCGCTGCGGTTAACTGTACGAAATGTGCTCACCAGGAGAATCACCATGAATATCGTATTTCTGGGTATTGATCTGGCTAAAAATGTTTTTCAGCTCTGCGGGTTAAACCAGGCCGGCAAACCGGTTTATACGAAACGCACTGGCCGAAAAGAATTGCTCCAGACGCTGGCAAATATTCCTGCATGTCTGATTGGGATCGAAGCGTCCACCGGGGCATTTTACTGGCAGCGTGAGTTTGAGAAACTGGGGCACAAAGTAAAGGTCATCAGTCCTCAGTATGTAAGACCCTTTGTCCGCGGGCAAAAAAATGATGGTAATGATGCACAGGCCATCGCAGTGGCTCTGATGCAACCGACAATGCAGTTCGTGCCGCCAAAAAGCCCCGAACAGCAGGATATCCAGGCTTTACACCGGGCAAGGCAGCGTATTGTCAATCACCGCACTGCTACAGTCTGTCAAATAAGGGGGCTGTTACTTGACCGGGGGATCCCCATTGGCAGTGCTGTCTCCAGAGCTCGCCGTGCTATTCCTCTTATCCTTGAAGATGCAGAAAACGGTCTAAGTTCCCGTATGCGCAGAACAATTGCCGAACTCTATGATCTCTTTAACGATCTCGGGCGTCGGATCCATTTTTTTGATAAGGAAATTGAAACAGTATTCAGGCAATCAGAAGCCTGTCCGCGTATCGCCAAAGTTAAAGGCATTGGTCCTAAAACGGCCACGGCCGTTGTTGCTGCTATTGGCAAAGGAACTGAATTTAAGAATGGTCGCCACTTTGCTGCATGGCTGGGTCTGGTTCCACGCCAGCATTCGAGTGGCGACAGGCAGGTGCTGATGAATATGACGAAAAAAGGCGACAAGCATCTGCGGACACTTTTTATTCATGGTGCCCGCGCTGTCGTCAGGGTTGCCACGAATAACAATGATGGTCATATGAATCAGTGGGTTAACCAGTTAAAGGAACGGCGCGGATTTAATAAAACGACCGTGGCGGTCGCTAACAAAAACGCGAGAATAATCTGGTCGATGCTGAGAAATGATACCGGGTATCAGGTAGTGTGTAATTAA
Protein sequences of DBSCAN-SWA_4 >NZ_CP011394|1851257:1904681|1879531_1880071_+|WP_000127618.1|DBSCAN-SWA MTAVFAFVKARWKTIIVLLMLAGAFLAGIIWSDRGWQKKWADRNSMESSQEANAQTAARWIEQGRIIARDEAVKDAQAQAAKSAATAAGLSATVSQLRTEATKLAARLDAAKHTSDLAAAVRSKTAGADAAVLADMLGRLAEEARYYAERSDESYRAGMTCERIYNSVRESTNNPIAPH >NZ_CP011394|1851257:1904681|1869177_1869732_+|WP_000509728.1|DBSCAN-SWA MGHEPEWKVEKQPRWLVAAIKKTISSLHGGYEEAAEWLDVTKDALFNRLRTGGDQIFPIGWALVLQRAGGTYHLAHSVARASGGVFVPLADMEEVDNADINHRLLEAIEQITSYSQQIRVAIEDGVIEPHEKAVIDEELYQAIAKLQQHSTLVYRVFCVPEKGDARECAAPGAVASNFMEKTNA >NZ_CP011394|1851257:1904681|1871103_1872078_+|WP_000096529.1|DBSCAN-SWA MSSLIQLLDRPIAYNPAFAKLKAGKVKAGPVAAVFLSQLVYWHNRMDGGWMYKTQADIASETALTRDEQETARKRLVALGVLEEARRGVPATMHYRINTARLEALLLETAKPVKKGAQEKTRLRDFQNVETPQSGLVQPRKPDCGDAANKNVETPQTSTGQPNEQACGDPTIFPTGDYTETTQEITQESKTPFCPVAEQPDPEVTLTDQAIEVLTHLNQVSGSRYQKSKTSLENIRARLREGYSVADLQLVIDLKHEHWHENDEQYQYMRPETLFGPKKFESYLQSATRWDQKGRPKRADWGAKKRDVMAFGPVDTTIPEGFRG >NZ_CP011394|1851257:1904681|1878917_1879535_+|WP_001075993.1|DBSCAN-SWA MNQQQFQQAAGISAGLSARWFPHIDAAMKEFGITAVNDQAMFIAQTGHESAGFTVLKESFNYSVEALKKTFGKRLTPYQCEMLGRIDGRQVAHQPQIANLVYGGRMGNKDAGDGWKYRGRGLLQITGRENYVKCGAALKLDLISTPELLAQEKHAARSAAWFFTLRGCLMYSGDVVRVTQIINGGQNGLADRNSRYNKARAALLV >NZ_CP011394|1851257:1904681|1859785_1860583_-|WP_000598920.1|DBSCAN-SWA MIKWPWKAQEITQNEDWPWDDALAIPLLVNLTAQEQARLIALAERFLQQKRLVALQGFELDSLKSARIALIFCLPILELGIEWLDGFHEVLIYPAPFVVDDEWEDDIGLVHSQRVVQSGQSWQQGPIILNWLDIQDSFDASGFNLIIHEVAHKLDMRNGDRASGIPFIPLRDVAGWEHDLHAAMNNIQDEIDLVGESAASIDAYAATDPAECFAVLSEYFFSAPELFAPRFPALWQRFCQFYRQDPSQRLRVSAAEGDYGEESEH >NZ_CP011394|1851257:1904681|1869728_1870886_+|WP_001087406.1|DBSCAN-SWA MNSLTVNNRLSQQPGMYEYRPLRHECRLSNSLVVRNHREHSLTVGDESCRNLTAGFGMEGDFMSMSFAGNQKLSALSICARAIRMSVLALCGNSGVILLSVKRQEHIDSAIPGRYTVQAPHKAGAGRGNPEFNIEHNRAHAVFSCHEHCYAQIMVGRAGPVSAGPGSMLTGISTPVRLTTYKVVESLGGEFIEFNIEAATMATVPTLAQPEIRIINGQAVTSSLAVADYFIKRHADVIRKIESLECSTLFRKRNFAFTSISINQPNGGTRKLPCYQITRDGFAFLAMGFTGKRAAQFKEAYIDAFNQMEKQLSTPSVLSDAAHNASVLYSYISSIHQVWLQQLYPMLEKAESPLAVSLHDRINDAAALASLINMTLNRSEVRGRK >NZ_CP011394|1851257:1904681|1862698_1863562_-|WP_000208076.1|DBSCAN-SWA MTTITKEWLQQTIAEFENTRDDIPFGLSDDDAKVLIVLKRALASLEAEPAGYHVIKECGKVGCSVATLEEAEKTRDFWNKKWTIRPYFYTAQPVQETGVYNDVLNIISLLENNEWAEHCTSTVLGSLLESEITRLVGKEQSAPVVTFYRDGVEAAAKWIDQQREAYDSEHGWSDPDTGAFEFGNDAQRGYSSTLEELAEGIRALHPNAGNSPVIPDGWISCSERMPEDEQEVIVHNKLGYRYVSYFDEHSGLFFDMRGGNQMNCIEHIFVTHWMPVPAAPKPEINNE >NZ_CP011394|1851257:1904681|1893520_1894579_+|WP_001066630.1|plate|DBSCAN-SWA MNNTVFLRVNGRDWGGWTSVRISAGIDRIARDFNVSITRQWPGGEDVPPVKNGDAVEVLIGDDLVITGWVEALPLRYDAQTIMTGIVGRSKTADLIDCSASPAQHNGKNLFLIASALARPFGVDVVDAGAPAAAVIEAQPEHGETVVDCLNRLLGQAQALAYDDERGRLVLGRPGSMKAATALVLGENILSCDTERSVRERFSSYLVTGQRPGTDDDFGEATIAAIRQSTGDAGVTRYRPHTIQQSGTATTDSCKSRCEFEARQRAAKTLETTYTVQGWRQGNGELWKPNQAVVVYDPLNGFDNETLVIAEVTYSQDNNGTLTEIRVGPADAYLPEPFRPKAKKKVSEEADF >NZ_CP011394|1851257:1904681|1855886_1856384_+|WP_001084817.1|DBSCAN-SWA MNSIRVIERQHEIDMIEGEAKHKAELERIKSESQYNTSELKEKLERLDIIVKNQEARNNELLEERKKLDDDLNNIFIFVDDLRETVLRVNNAYNRSMKEDDVVRKNMILSPIYEIIKESKFKDIEKYLEKVGVLENSLKRLRDSERKTFSMDELENNDYNAINAD >NZ_CP011394|1851257:1904681|1903658_1904681_+|WP_001028172.1|transposase|DBSCAN-SWA MNIVFLGIDLAKNVFQLCGLNQAGKPVYTKRTGRKELLQTLANIPACLIGIEASTGAFYWQREFEKLGHKVKVISPQYVRPFVRGQKNDGNDAQAIAVALMQPTMQFVPPKSPEQQDIQALHRARQRIVNHRTATVCQIRGLLLDRGIPIGSAVSRARRAIPLILEDAENGLSSRMRRTIAELYDLFNDLGRRIHFFDKEIETVFRQSEACPRIAKVKGIGPKTATAVVAAIGKGTEFKNGRHFAAWLGLVPRQHSSGDRQVLMNMTKKGDKHLRTLFIHGARAVVRVATNNNDGHMNQWVNQLKERRGFNKTTVAVANKNARIIWSMLRNDTGYQVVCN >NZ_CP011394|1851257:1904681|1890221_1892150_+|WP_000785387.1|tail|DBSCAN-SWA MADSFQLKAIITAVDQLSGPLKGMQRELKGFQKEMAGLAIGAAAAGTAVLGALALPVNAAIGFESKMADIRKVVDGLDDKKAFAQMSDDILTLSTQLPMAAEGIAEIVAAGGQAGIARGDLMQFANDAVKMGVAFDTTAEESGQMMAQWRTAFRLTQEDVVVLADKINYLGNTGPANAKKISDIVTRIGPLGGVAGVASGEIAAMGATIAGMGVESEIASTGIKNFMLSLTAGNSATKAQKQAMAFLKLNPRKLAEDMQKDSRGAMLKVLDSLAKVPKAKQAAVMNALFGKESLSAIAPLLTNLDLLRTNFDRVADAQEYGGSMQKEYASRASTTENQLVLLKNSVNAISVTLGDTFLPAINEAAEAVMPYLEQLRTFVRANPELVQSAAKFGAALLAVGVSIGSLSRAVKILNSVINLSPAKVAIAALVAGAMLIIENWDDVAPVIKAVWQEVDNVAQEMGGWETVIEGVGLVMAGSFTVRTIGALQQSVLLAGRLSGLLGKIGRMGAMTLTIGVAVSLFKELKDLEQGAKDAGMDAGAFAVQKLQTKERERGYNGFIPRLKELLGMDTPIPQGRYQPYVPLTRRSGVLGRAVPPSTQRSELKVTFENAPQGMRVTDIPKSGNPLMNISHDVGYSPFRTSR >NZ_CP011394|1851257:1904681|1873840_1874701_+|WP_001061459.1|DBSCAN-SWA MNNLMVIDGIEVRRDVHGRYCLNDLHRAAGGEQKYRPKYWLDNKQTRELIEQLFTEGGIPPSEQNQSVSFFQGGSDTRSLARAPVNTVRGGAEQGTYVCKELVFAYAMWISPSFHLKVIRTFDRITSAPQISSGMAADKMQAGVILLGFMRKELNLSNSSVLGACQKLQEAVGLPNLAPQYAIDAPAGAPDGSSRPTLALSALLKQHGIRMTANQAYQQLAKLGVVEHRERYSRSAINGIKKFWSLTAKGCMFGKNITSPANPRETQPHFFESKFPELLKLLDTVH >NZ_CP011394|1851257:1904681|1886038_1886362_+|WP_000927251.1|head,tail|DBSCAN-SWA MLLSPEEIKLQLRLDEDYADEDKFLELLGRAVQARTENFLNRRLYTAEAGVPADDPEGLILSDDIRMGMLLLVTHFYENRSTVTEVEKVELPMSFNWLVGPYRYIPL >NZ_CP011394|1851257:1904681|1874708_1875698_+|WP_012543375.1|DBSCAN-SWA MRALLTPEIAPRMGVVLFRPGSELMPLFMQGRVLLEPEPEQYSSFACGAVPAVSQPLADDPAVRDVFRNESVIYRAGGLDSLESWLLRGNGCQWPHSVWHSEQMTTMRHAPGAIRLCWHCDNLLREQFTERLESIAVENTTKWVLSVVCRDLGFDDMHAVTLPELCWWMVRNDLAEVLPESAARKALRMPKAIVQSATRESEIVPSVPATSIVQDKAKKVLALRVDPESPESFMLRPKRRRWINERYTRWVKSQPCACCGKQADDPHHLTGHGQGGMGTKAHDLFVLPLCRTHHNELHADTVAFEEKYGSQLELIFRFIDRALAIGVLA >NZ_CP011394|1851257:1904681|1868131_1868827_-|WP_001020644.1|DBSCAN-SWA MNIGNRVRQLRRAKNMKIAELAEAIGVDAANISRLETGKQKQFTEQTLSRLADCLSVDIAELFTSDPKGNTVCKHSDMRKDSANVKDLFRIEILDVSASAGNGLIQGGDVIDVIHAIEYNKDKALAMFGGRPAAELKVINVRGDSMAPTIEPGDLIFVDISINQFDGDGIYVFGFDDKIYVKRLQMIPDKLLVISDNTNYREWSITKDNECRFGVFGKVLISQTQSLKRHN >NZ_CP011394|1851257:1904681|1892183_1893524_+|WP_000863817.1|DBSCAN-SWA MAFFSSTGWRGRLRDASFRGVPFSVEDDESTFGRRVQVHEYPNRDKPWTEDLGRATRRLTINAYLVGDDYADRRDRLIGAIETAGPGTLVHPQYGEMQGSIDGQVRITHSSTEGRMCRVSFQFVESGELSFPVAGMATAKRLETSGGLFDDAIDSMFSTFSLSGISDFIQNDVIADAASMLGDVADAFRMVDSGVSAAMRLLQGDLSVILMPPGAASDFVNALQKAWRSGDRLRGSTSDLVTMIKTMSGITLDPGLSPRGTWPTDSGSAAKQKMQRNMIAAAIRTTAISTAVHAVTTLKQPRDVPDVRGVNQPAGTGRDSDIITVMHPALDGVQTVSNGSFPPNYEDLKAIRTALNAAIDQEQLRIRDDVLFQQISVMRTDLNRDISARLAQVERTALRTPDDVLPALVLAAAWYDDAGRESDILTRNPVPHPGFIPVEPLRVPVR >NZ_CP011394|1851257:1904681|1880591_1881029_+|WP_000501481.1|terminase|DBSCAN-SWA MGAVVRSSGGGRKRNLPSGQKSKLTRIAPPEELMSDIAIRIWKTQSKILIERGVFDLEDAPLLLAYCNAFHLMIEAEKVIAEEGLTVSSEMGGEKKHPAVNVRNDSVSQLARLGSLLGLDPLSRIRMTSGKNDPDDEGNEFDEFD >NZ_CP011394|1851257:1904681|1867413_1867665_+|WP_000078504.1|DBSCAN-SWA MSQKDDIPVFPVTGWQAGPLPGYDALVVKFQFLSSPMQPIESAQETQFLVLTPEMAESLASDLQRHIQDLRNSDVHSPQEGKH >NZ_CP011394|1851257:1904681|1894578_1895112_+|WP_001273650.1|plate|DBSCAN-SWA MANHPLQNMITRAVITAIDTVRKCQTAGLKLIAGEKKENVEHLEPYGFTSAAQNGAEAVVLFPGGGRSHGVAVVVADRRFRLKGLARGEVALYDDQGQSVTLTRAGIVVNGGGKPVIFTNATKARFEMPIESTGDIRDNCDSSGKTMAEMRTTYNGHTHKENGDGGGITDKPGQPMS >NZ_CP011394|1851257:1904681|1851257_1853096_+|WP_001127942.1|tail|DBSCAN-SWA MPLFGSYFQMGDIIMIKKRGFTLLEITIVLGIGSLIGFMKFQDMRKEQEAVMAQAVGFQMKQVGEAVNRYISIRYNKLSTLSSSRSQSSDPGPRTCTANGCEITYQTLVNESLLPSTYAGVNAQKSSYKILLKRSGVSPNYVINGLIATTIPWVEGGKTRYDLLGKSMQSAGIDSGMTKSPAQASGYNGVWTEKSTDYPAINKAGLLVYRVGYDSSMYSVYLRRDGTLPMTGNLNMGTNDINNAKNITASGTGIFGGDVSSAGKIIAAQEIIAHNGYGDAIHFGGDAVNNDYEITMSKDKTLSIHMASNRTDLTTLKISGGLSTIGNGTISGTLDTGKSITSGGQFNGHNGGGDSFSIGGGDANDYEFRLDTVKPLTIWRNGGTSTETRLQVFGKQTNQGDFAITPGTGSTGSIAASGNIQGHTLQPTSTNTTGGTCPSIGLISKDNMGNILSCVNGKWSVVSNLPVGSPVPWPSNTAPVGWLICRGQSFNTALYPQLAKAYPRGRLPDLRGVFIRGLDSGRGLDSGRVINSYQDDQIQNITGHMAADVSQSGNIGKYVSGAFADSGALGEGDEGHKSNEVRKYTFDASRVVRAGNETRPKNVAMNYIVQAQ >NZ_CP011394|1851257:1904681|1853669_1854551_-|WP_000028416.1|DBSCAN-SWA MSISTTMSNINRIQKDIASLQKQLSDEQRKEAQLSGKINQIKRSVTKSTSLSTLNSKMSEISRHKNDISRCNSKKADINKKITAKTGDLHRYQLQLIKEQENDQKKRIAAQKKLEKEQLDYQKKITRELKSQKRAISPSHKFNRPVINKSDETEDITESYDVFISHATEDKDSFVRPLAELLRAKGINVWYDEFSLGWGKSLRKTIDYGLANSRFGVVVLSKSFIKKDWTEYELNGLTAREMSGENQVILPIWHEVSKSDILKFSPTLVDKMALNTSINTIDEIAEQLESLLK >NZ_CP011394|1851257:1904681|1898710_1899310_+|WP_015701331.1|tail|DBSCAN-SWA MVYRTRGNGIMKKYQNIKNFRLIDAPVNRDKTQAEINIGAYFLESDDGQDWYECQSLFSDDTAKIMYDHEGVIWGVVNKPVPQRGNTYSVSMLWPVNMSVAEIDAADCPDDCRGDGTWLYQDGKVVQRGYSPEELRKKAEAEKVRRLAEAESAIAPLARAVKLKIATDEEIKRLEAWELYSVMVNRVDTSAPDWPDIPR >NZ_CP011394|1851257:1904681|1860874_1861864_-|WP_000532847.1|integrase|DBSCAN-SWA MGRKRAPGNEWMPKGVFFRPSGYYWKPGGSTENIAPADATKAEVWVAYEKKVEGRKNRITFTQLWRKFLASADYADLAPRTQKDYLAHEKYILAVFGDAEAKAIKPEHIRRYMDARGQKSRVQANHEHSSMSRVFRWSYQRGYVPGNPCVGVDKFPKPQRDRYITDEEYRAIYNNATPAVRAAMEIAYLCAARVSDVLKMNWNQILEKGIFIQQGKTGVKQIKSWTDRLRDAVEICREWGEEGPVIRTMYGERYSYKGFNEAWRKARKAAGDDLGRPLDCTFHDLKAKGISDYEGTAKDKQKYSGHKTESQVLVYDRKVKMSPTLDRKR >NZ_CP011394|1851257:1904681|1900814_1901036_+|WP_001526483.1|DBSCAN-SWA MFLLYCRGTGEIGGRRPKLSPEQWAQAGRLIGAGIPRQQVAIIYDVGLSIHGLYWILLSANRKGVKKRDYRAN >NZ_CP011394|1851257:1904681|1875711_1876464_+|WP_001047141.1|DBSCAN-SWA MNLEALPKYYSPKSPKLSDDAPATTSESLTITDVMAAQGMVQSKAPLGFALFLAKVGIQNPDFAIEGLIHYAVALDNPTLNKLSEETRLQIVPYLVNFAFADYSRSAASKARCEHCAGTGFHHVLREVVKHSRNGEPVIKEEWEKELCQHCHGKGEVSTVCRGCKGKGIVLDEKRTRFHGAPVYKICGRCNGNRFSRLPTTLARHHVQKLVPDLTDYQWYKGYADVIDKLVTKCWQEEAYAEAQLRKVTR >NZ_CP011394|1851257:1904681|1864479_1864995_-|WP_000071068.1|DBSCAN-SWA MSNRIRNAQVFDARTGEYPVDMYIRWIIGGELDFDANYQRGYVWGHEEQQAFLNAVISGFPIGSVALAKAPDWCSRELPYIEVVDGKQRLTTLKKLITNEIPIILADGPLYWRDMTRAEQLAFGRRPLPAVVLDEVTYKDRLAYFMAVNFTGVPQSEEHKRHVMQLMEAAQ >NZ_CP011394|1851257:1904681|1887807_1887972_+|WP_000497739.1|DBSCAN-SWA MFVKPAKGRSVPDPARGDLLPEGGRNVDENNYWLRREAAGDVRRTNKKVKTNGD >NZ_CP011394|1851257:1904681|1899594_1900602_-|WP_000492926.1|DBSCAN-SWA MFSRVRGFLSCQNYSHTATPAITLPSSGSANFAGVEYPLLPLDQHTPLLFQWFERNPSRFGENQIPIINTQQNPYLNNIINAAIIEKERTIGVLVDGNFSAGQKKALAKLEKQYENIKVIYNSDLDYSMYDKKLSDIYLENIAKIEAQPANVRDEYLLGEIKKSLNEVLKNNPEESLVSSHDKRLGHVRFDFYRNLFLLKGSNAFLEAGKHGCHHLQPGGGCIYLDADMLLTGKLGTLYLPDGIAVHVSRKGNSMSLENGIIAVNRSEHPALKKGLEIMHSKPYGDPYIDGVCGGLRHYFNCSIRHNYEEFCNFIEFKHEHIFMDTSSLTISSWR >NZ_CP011394|1851257:1904681|1854961_1855525_+|WP_000072670.1|DBSCAN-SWA MSNTTDWINTISNVMIAGSAIYGAFKAKSYFKSKNDEEAYNDAKKLIFELYPQYRIDLHKLVIVLLSMTSNRIPNDILETRLEKITNRLSNTQIEIVTISSNIIERHRWKIRDDFKESFEIIKNIYRNKLSFPLINVVDEVSQSTETERQNILRELIGKCHEGIDAIHTFTDNKFRVDEYFVRPFLN >NZ_CP011394|1851257:1904681|1872074_1872548_+|WP_000054227.1|DBSCAN-SWA MSLLAKVQAFIELNPGLTSNEIADAFPEYARFDVQRSASKLYRCKRVNRRLDGDVFRYYAGKDEAVILTLRQKRSGHTGSGDPMVIAKLVSRAEELESRGLFNRASIVWLEAFSESQFIYEREEFLRRRQKCLNRIKKRIRPVEQVYLAGRFVGNVE >NZ_CP011394|1851257:1904681|1868924_1869149_+|WP_001191666.1|DBSCAN-SWA MQSPLRKLRKSHGYTLQHVAKGVQVDPATLSRVERCEQAPSTELAERLAQFYAGEISEMQILYPNRYQLSDSAI >NZ_CP011394|1851257:1904681|1884729_1885959_+|WP_000766103.1|capsid|DBSCAN-SWA MKLHELKQKRNTIATDMRALNEKIGDNPWTDEQRTEWNKAKSELEALDERIAREEELRRQDQTYVDENEEEQRNNQDPDKDPQQDEKRGQIFDKWMRHGASELSSEERKALRELRAQGVAPDEKGGYTVPDTFLAKVVEQMKSYGGIASVAQILATSDGRTMEWATADGTAEVGVLLGENEEAGEEDTEFGMDSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGTGTPKQPKGLKASVTGTTQTAAAGAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVERYAEFDQTGFLAFHRFDCILEDTSAIKALVGKGSASS >NZ_CP011394|1851257:1904681|1872544_1873426_+|WP_000200166.1|DBSCAN-SWA MTSESVCIESSDVTISVDESASRTWRRPFLKWAGGKYSMLPDLYQVIPAGMRLIEPFVGGGSVFLNSDKHACFLLADVNTDLINLYQMLAVVPGAVIRHARVMFDRLNDAESYMALREEFNAQVMDAPERAAAFLFLNRHCFNGLIRYNRNNQFNVGWGKYPSPYFPEEEIRAFTEMAHNCVFMAAGFRRTLALAGEGDVVYCDPPYEPMPGKDGFTHYAAGGFTWDDHIALAECCVAAHQRGARVVIGNSTSPRVIDLYSQHGFEIRYISARRSISSKGSTREKAKDLVAIL >NZ_CP011394|1851257:1904681|1896604_1897192_+|WP_001207832.1|DBSCAN-SWA MALQDEYTQLLYHLLPEGPAWDGENPLIEGLAPSLNRVHQRADELMAEIDPARTTELIDRYEQLYGLPDSCAPEGVQTLQQRQQRLDAKANVAGGINERFYREQLDALGYTAATIEQFQNLDSTPDPEWGEFWRYYWRVNIPADANISWQTCTSTCDSAIRTWGDTVAECVIDKLCPSHTVVVFAYPEGKENAQN >NZ_CP011394|1851257:1904681|1878260_1878650_+|WP_001294874.1|holin|DBSCAN-SWA MSEPVSSATVLAGGLMGASVFGLATGTDYGVVFGAFAGAVFYVATATNIGRIRLVAYFITSFIVGVLGAGLIGTKLAAITHYEKPLDALGAVIISAMCIKFLTFLNSQDLNTLFSILSRIRGGGSDGSK >NZ_CP011394|1851257:1904681|1897178_1898741_+|WP_000554738.1|DBSCAN-SWA MHRIDTPTAQKDKFGQGKNGFTNGDPATGRRATDLNSDMWDAVQEEVCTVIEAAGIPLSKGEHTQLHAAIGRLIDEQVKTRLEKNQNGADIPNKPLFLQNVGLGETINLAAGALQKSQNGGDIPDKKQFARTIGAVTSTTITLGESGWFKIATVVMPQATSTAVIKLYGGAGFNAGSPEQAAISELVLRAGNGSPVGITATLWRRSPSAANEVAWVNTSGDTYDIYINIGQYAYWLIAQYDYTGNANVTLHSTPEYSSVQPGNSTSGQTYTLFNSLMKPTAGDVEALSVNGGRLNGPLGIGTDNALGGNSIVFGDNDTGFKWHSDGVLGIYANNALVGYIDNSGLHMSVDVLTNGAVRAGNAKKLSLTSNNNSTMTATFNLWGDANRPTVIELDDDQGWHLYSQRNPDGSIVFTVNGDITANTLRASGAIYQNNGDIFGSLWGNGWLSTWINNNLVLDVQLGAGTSVTTWNNAGSWPNTPGYVVTSVWKDYQGENIDGINYAPLQKRVGSQWYTVQGGTV >NZ_CP011394|1851257:1904681|1861865_1862093_-|WP_001527041.1|DBSCAN-SWA MSDRFLTEEELEDATGASQKSLQKEVLTLNGIYFIERRDGSIRTTWYHINHPVSRLLPPAGYQPVPGMNFDAIES >NZ_CP011394|1851257:1904681|1876513_1877587_+|WP_000357930.1|DBSCAN-SWA MDKILPTMGFVCLSLISLKNPPPSGFFICDHFIFCLAKLLYGQELKLSGDIVLSNNERWVSFFDFAFTPTHAAAPSIPIEDILKKLKVLVSSGSAVKLYNHRSRALRISEMKYSIGDSQATLLIQLCDKNGSDPVFGELTTGNLRVEPKLAGEGIAVSCHIVISTDVVKNTADHHKTLVESVPGISKSVLEPFLNAMLREAFAGCEFKNPATKGMCQHRPKLEIYSHGSQTLMDALKGAKIHNVKLVSTRRKGGLDQTAYTELSERSVKYKIIRQPPLKDKERLLEILRKKGQQSGYTKVSISYSKDGKQASLDLDRNEDAATKLFTKSERVILGNLINQCESTVHLQLETKMIGLL >NZ_CP011394|1851257:1904681|1878636_1878918_+|WP_000226304.1|holin|DBSCAN-SWA MVANDPSAALNAVICGVIVIVLMFYRRGDATHRPLISLLAYVMVLVYASVPFRFVFGLYESSHWLVVMVNILICAAVLWARGNVARLVDALRH >NZ_CP011394|1851257:1904681|1863558_1863852_-|WP_000267991.1|DBSCAN-SWA MWRGLNRGGSQMILTAYEYDPETEKSQSVYLLRHHSKVKKTTLEQKLTVKNDAFGRFKPFVELEDFPEGLSEREAMLKLADWLHRLSVAIEDNWSTP >NZ_CP011394|1851257:1904681|1864991_1865222_-|WP_000764235.1|DBSCAN-SWA MKLEMYTLDGSVIVDSNLVTQFYPDYKSGGELTVIETISATGETFTVRVKHSFLQVTSALATAWSVDEKKAEGAAQ >NZ_CP011394|1851257:1904681|1886358_1886763_+|WP_000776844.1|head|DBSCAN-SWA MKLRQAQASATYLLPDPGELDQRIVIRRRVDVPADDFGVTPTYPEQIRAWAKKAQPGAAAYQGAVQIENRVTHYFTIRFRRGITADHEVLHDDISYRVKRVRDLNSKRRFLLLECEELGTDNGSDYAAESIFTR >NZ_CP011394|1851257:1904681|1889457_1889814_+|WP_000515952.1|tail|DBSCAN-SWA MGKIAGTTYFKIDGQQLSVTGGIEVPMNTKVRDDVIGLDGSVDYKETSRAPYTKVTAKVPKNFPVDKITSSDVMTITSELANGQVYVLSNAWLHGEANHNPEEGTVDLEFHGEEGFYQ >NZ_CP011394|1851257:1904681|1867740_1867926_-|WP_001067433.1|DBSCAN-SWA MNNYYTCSFCGVSELDAKKLIAKGSKDEPAICSECVVSCVNILINYAAVIKPVKLNVTKGE >NZ_CP011394|1851257:1904681|1895522_1896602_+|WP_000785580.1|plate|DBSCAN-SWA MADSQFARPELPQLIATIRSDLLTRFQQDVVLRRMDAEVYSRVQAAAVHTLYGYIDYLARNMLPDMCDEDWLYRHARIKRCPRKNAVSAKGFARWDGIAGTPEIPAGTQIQRDDQVTFTTLQTVKASGGLLRVPVIADVAGTAGNTDDGTALRLGTPITGIPSTGYADTLTGGADTEEPETWRARVMERYYWIPQGGADPDYVIWAKEIAGITRAWTFRHYKGTGTVGVMVATSNPVNPAPGDDLVKAVRDHILPLAPVAGGGLFVFAATEKSIPVTVALAKDTPEIRTAIIAELNALMLRDGAPSGKIYVSRISEAISLATGEVAHQLRVPAADVVLGKTELPVLGNITWATYTGENG >NZ_CP011394|1851257:1904681|1865926_1866844_-|WP_000551790.1|DBSCAN-SWA MHNPFFKNMLIYRFSRDFNIDIDSLDKKLELFRFSPCGSQDMAKSGWFSPLVQYSDVLYHAVNNQLLLVIRREEKIIPKQTIADEINKKVSTLEREQGRRLKKTEKDSIRDEVLHSLLPRAFTKNSLVRIWINTAAGFIVVDTSSIKRAEDSLALLRKTLGSLPVVPLTMENPIELTLTEWVRSEAAPSGFSIGDEAVLKAILEDGGTGRFKKQDLACDEILTHIEAGKVVTQISMEWQQRISFTLSCDGILKRIKFADQLISQNDDIDSEDVVQRFDADITLMTGELSNLISDLTAALGGEAKR >NZ_CP011394|1851257:1904681|1887243_1887804_+|WP_000779215.1|DBSCAN-SWA MKLTPIIAALRSRCPRFENRVGGAAQFKAIPEAGKLRLPAAYVVPAEDVTGEQKSQTDYWQDLTEGFSVIVVLSNERDEKGQWASYDAVHDVRQEIWKALLGWEPDPQAHEIQYAGGMLLDLNRHELYYQFDFTVKYEITETDTRQQDDLDGLPDLKTLSIDVDFIEPGTGPDGDIEHHTEITFQE >NZ_CP011394|1851257:1904681|1887961_1889458_+|WP_001007993.1|tail|DBSCAN-SWA MAISFNSIPSDTRVPLFYAEMDNSAANTARDSGASLLIGHASNDASIAVNSLVLVSSVDYARQICGAGSQLARMVGAYRKTDPFGELYVIAVPESTGAAATVALTVTGEATETGTVNVYTGRTRVQAPVTSGDDAAAVAVSIKDAVNANPDLPFTATSEAGVVTLTARHKGLYGNEIPVTLNYYGFGGGEVLPAGVNITVASGVKGAGAPALNDAVAAMGDEPFDYIGLPFNDTASVNTMATEMNDSSGRWSYIRQLYGHVYTAKTGTLSELVAAGDQFNLQHITLAGYEKDTQTPADELAASRTARAAVFIRNDPARPTQTGELVDMLPAPKGKRFTTTEQQTLLSHGVATAYVESGVLRIQRDITTYRKNAYGVADNSYLDSETLHTSAYVLRRLKSVITSKYGRHKLANDGTRFGPGQAIVTPAVIRGELGSTYRQLEREGIVENFDLFQQHLIVERNANDSNRLDVLFPPDYVNQLRVFAVLNQFRLQYSEEAA >NZ_CP011394|1851257:1904681|1862132_1862702_-|WP_001061370.1|DBSCAN-SWA MNNLMVDLETMGKKPNAPVVSIGAVFFDPQSGEIGPEFYTAVSLESAMEQGAVPDGDTILWWLRQSPEARAAICADAVSVTTALIEFNDFITCHADDLKYLKVWGNGANFDNVILRGAFERASLPCLWNYRNDHDVRTMVTLGRAIGFDPKRDMPFEGDMHNALADARHQAKYVSAIWQKLIPPTSNNI >NZ_CP011394|1851257:1904681|1889810_1890137_+|WP_000588852.1|tail|DBSCAN-SWA MIKELVLKKPIMAHNEKLHVLELREPSYDEIEAIGFPFTVSGDGGVRLDSSVALKYIPVLAGIPRSSAAQLAKLDIFKACMLILNFFTRSETEEDSESGSTTPHTSGE >NZ_CP011394|1851257:1904681|1880094_1880445_+|WP_001135228.1|DBSCAN-SWA MPPRTPKSCRVRGCRSTTTDPSGYCESHRSEGWKQYKPGQSRHQRGYGSKWDVIRERILKRDKGLCQLCLRAGVVREAKTVDHIIPKAHGGTDADSNLQSLCWPCHKAKTARERLK >NZ_CP011394|1851257:1904681|1881028_1882759_+|WP_000257219.1|terminase|DBSCAN-SWA MATYPNVNAANQYARDVVNGKILACRLTMLACQRHLDDLERAKDPHWPYRFDKNKAERFLRFSQKMPHTSGEWARRKLRIEFEPWQKFALGVPFGWVRKDTGFRRFTEIYIEVPRKNGKSAIAAAVGNYMFCADGEYAAEVYCGATTEKQAWKVFAPALAMVKKLPALRQKFCIKPWAKKMTRPDGSLFAPIIGDPGDGDSPSCAIIDEYHEHDTDALYTTMTTGMGAREQPITLIITTAGFDIASPCYEKRTQVVEILERIREGGENEAIFGIIYTLDDDDDWTQPEALIKANPNYNISVKEGFLKAKQLLAMSTPGQTNKILTKHFNKWVSSKAAYYNLQKWMTAADKTLRLSDFAGEECYPGIDLASKLDLNAVVPVFRREIDGLSHYYCVSPMFWVPEDTVYATDPALKTIADRYQSFVNQGVLVPSDGAEVDYRLILEAILKLRETVKIAASPIDPYGATGLSHMLQDEGLEPVTITQNYTNMSDPMREIEAAIAAGRFHHDGNPLMTWCISNVVGKYLPGSDDVVRPVKEGAGNKIDGAVGLMMGVGRAMLNEPKDFLSNLDPDEELLFL >NZ_CP011394|1851257:1904681|1864123_1864483_-|WP_000065085.1|DBSCAN-SWA MSNIDKLNDHELVDLKNAIERELKRRADGPKVTTYYVVSCITDAQHFTDLDCALRCLKSVTENLMEWVTESPENRDYVNQCTGIVGAKLQVKEMNLDHFNMRVAEKYFDDICYPQETAQ >NZ_CP011394|1851257:1904681|1884117_1884720_+|WP_000003793.1|head,protease|DBSCAN-SWA MSEREIRCYSGEVRAETHDSEPSRIIGYGSVFDSRSELIFGSFREIIRPGAFDEVLNDDVRALFNHDPNFILGRRSAGTLALTVDERGLRYDITAPETQTIRDLVLAPMQRGDINQSSFAFRVARDGEEWYQDEDGVVIREITRFSRLLDVSPVTYPAYQEADSAVRSMKAWQEARDSSALQKAINQRMARERVLTLLNA >NZ_CP011394|1851257:1904681|1877599_1878172_+|WP_000765639.1|DBSCAN-SWA MKLFSPLSYLRIKHEEKDWYDYKIPAAVSLIVTIVYYFHASKISLIETNGLLLQVNGLLQVLIGFYIAALAAVSTFSSSSIDEVMAGVPPTLVEKFRGQKLTVELTRRRFVCYLFGYLALVSFMLFCLGMISILIGKPFHLWLLTFCSPDAILWLKTVFVGVYIFILMNIITTTLLGLYFLAVRFHQSSL >NZ_CP011394|1851257:1904681|1856862_1857114_+|WP_000042271.1|DBSCAN-SWA MSKNVKQTSENVASTAAKTLTDPNASAIQKSLAGSALSQRGTSNQTSGKMEHKASSALDNPRSSELTKQLAASVLAQSNKGRK >NZ_CP011394|1851257:1904681|1873434_1873824_+|WP_000779149.1|DBSCAN-SWA MKLTLPFPPSVNTYWRAPNKGPLKGRHMVSASGRKYQSEACAAVIEQLRRLPKPSTAPAAVEITLYPPDKRIRDLDNYNKALFDALTHAGVWEDDSQVKRMLVEWGPVFPKGKVEITITKFETGAGAAA >NZ_CP011394|1851257:1904681|1886734_1887247_+|WP_001135695.1|DBSCAN-SWA MPQKAYLHVDFEQPETLVFNRARMRRAFVSIGQVHMRDARRLVMKRGRSGPGDNPSYRTGKLARSIGYYVPRASSRRPGLMVKIAPNQKNGEGNRPISGAFYPAFLFYGVRRGAKRKKGHHRGASGGSGWRVAPRNNYMTEVLDKRRSWTRYVLSRELRKSLRPQRRKKK >NZ_CP011394|1851257:1904681|1895116_1895530_+|WP_000605050.1|DBSCAN-SWA MILYVNGIRKDATASLDFLTRAVVISLFTWRRAERDDRTPQPYGWWGDTWPAVQNDRIGSRLYLLKRRKLTNKTPQDAREYMQQALAWMTDDGVAARIDVTSERTGTDTLAAGVTIYQRDGVIHNITFDDIWSKLNG >NZ_CP011394|1851257:1904681|1857185_1858460_-|WP_001680077.1|integrase|DBSCAN-SWA MSLTDTKVKNTRPSEKAVKLTDGFGLYLLVHPNGSKYWQLGYRFDGKQKVFSIGVYPAVSLADARQRRDEAKRLLTQGIDPNAKKQADEKVLQEKRDKTRSFRVVAKSWFATKTKWSEDYADTVWKRLETYVFPDIGDRNVSELDTGDLLVPVKKAETLGYLEIAMRIKQYITAILRHAVQQKLMRHNPAYDMEGAVQKPETEHRPALELEEIPLLLERIDAYKGRGLTTLAIKLNLLIFIRSSELRFARWSEIDFKSKLWVIPEQREAIENVKHSTRGAKMKRQHFVPLCRQALKILKEIRQLTYEEGNEAELIFTGCYDSFKPMSENTINKALRKMGYDTTQDICGHGFRTLACSALIESGLWSEDAVELQMSHKESNSVRAAYTHKAKHLDQRRLMLQWWADFLDENRYEMVRPFEFAQKQ >NZ_CP011394|1851257:1904681|1870882_1871107_+|WP_000620702.1|DBSCAN-SWA MIRNIFKRFTSQRFHCPRPGQWYSTPEGYVLRISLVDRECQKVVCEPLGRNYRVNMPLIAFRSGKNMKHLGGAA >NZ_CP011394|1851257:1904681|1883030_1884125_+|WP_077905357.1|portal|DBSCAN-SWA MKLAAVYACIYVISSNVAQMPLHVMRRTGKKVETARDHPAFYLVHDEPNSWQTSYKWRELKQRHILGWGNGYTRVLRHRRTGEVTGLEACMPWETTLLNTGGRYTYGVYNEEGSFAINPDDMIHVRALGNDQKMGLSPVLQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKGELNDGSWKRLKEMWQKATAMLRSQENRTMLLPAELDYKALTVSPVDAQLIDMMKLNRSMIAGIFNVPAHMINDLEKATFSNISEQAIQFVRYTMMPWVTNWEQELNRRLFTRAEREAGYYVRFNLAGLLRGTAKERAEFYHFAITDGWMSRNEARAFEDMNPKDGLDEMLVSVNASRPAKSTTQENTQDE >NZ_CP011394|1851257:1904681|1865292_1865832_-|WP_000008351.1|DBSCAN-SWA MSFIQTLSGKQFDYLSATIDDIDIEDIAVALSNICRFSGHLPEFYSVAQHSVLCSQLVSPEFAFEALMHDAAEAYCQDIPAPLKALLPDYREIEKRTDQLIRFKFGLPLEEASVVKYADLTMLATERRDLDIDDSIPWVILEGIPPTDLFEIYPLRPGQAFGLFMARFNELMELRQCAA >NZ_CP011394|1851257:1904681|1902378_1903197_+|WP_001176778.1|DBSCAN-SWA MQLPEQDEFSDFFAANDDEQASLRRKFFLEKHKEPCLSESALEDYQALFMSIYGINIDWKEGTFSLLEALSDNQGGKPVTVKFDYDSEIETATINLVDTQYVFHHYPMGSDGFDTELVRIEHILANSGYSLRVYQNSTFSDTLSFLLIPSDEWKRVEQHYSPEHISEYFVPYGKQLVIPEVTAPVVNYVPSVKQEASNVPALFNARGIRICFLSIMLIAFAIYILWNILTKIEPLSSGQPAGCENLQNLYSKLRPEVAEPLKEKMRKSLGCK >NZ_CP011394|1851257:1904681|1882755_1882914_+|WP_000838395.1|DBSCAN-SWA MKSLIIDVAGVAGFGALVGGIYLKFGAAVALMAGGSGLLLWALLAARRIKTC |
65 | Salmonella_phage(76.36%) | integrase,protease,holin,head,terminase,capsid,plate,portal,tail,transposase | attL 1860086:1860100|attR 1908276:1908290 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2438536 : 2452812
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP011394|2438536:2452812|DBSCAN-SWA TTCATATCAATACGCGCTCTTTCATCCAGCCATAGACAAACGACTCGTCAGCCTCCCGCTTTTCCGCCAGCTCCAGATAGCGCTCCCCCTGCGTGCAGTTCAGCGCCACCAGCAGTACACGCTCGCCATCCTTACCGCGCTTTTCCAGATAAACACGTAACGCGTTAAGGGTTCGCGGCCCGATGCGCCCGTCCGTATCCATATCCGGATACAGCCTCCCGCCCTGGTTGAACACGTTCAGCCAGCGCTGCAACATTTTCGCTGCCACCGACGGCCCCATGTTCACGCCCGTGTCACACAGTTCGGCAGCAACATCCGGCGAGGCCTTCGCCACCCGGTCAAAGCGGGGACCGTACCAGTAGTCGGTCTCCAGAATTTCCAGCGCCTGTCCACGGGTTAAATTGCGCATATCACCACGGTATCCGTGGGCGCGGGCAACTTTTTCCGTAATACCCCATTTTGTCGGCCCGCCTTTATCGTCCGGATGGTTGACGTAGCCGCCTTCCTTACCCAGAATTTCATCAAAAATTTCGTCCTTCGGTTTCATCGCAGCCTCAGCAGTGAAAGTATTTTCGACACATTACCCCGCGCCCACAGCACCAGCCCGCAGAACAGCGCGTTCATCAGCACCACAGGCCAGGATGAAGACGGGTAATGTCCGGCCAGAAAACGGAACGGAACCAGCGCATAGCCCAGCATCAGCAGGTACGACAGCCAGGCTATTCCCGGTTTGTATCTGGCGCCGTTCCGGCGGTAGAGAAAAAGGGCCAGCACTGTCACCAGGCACAGGCCCGCATTCAGTGTGCCGGTCAGATTACTTTCCATGACCACTCCCTCCTCCCCGCATCCGTGAAAAAACCCCGGTCAGCGATGAAATGTCCTGGTCATGAATGAAGGTCAGAAATTTCACTGACAGCACCGACATCACCACTGCACACAGCGCATCCAGCGGTTTACCGTTGTAGTCAGCAAGGCGGGACAGGAACGAGGCCAGTACAGGCACAGCACATCAACCGCTTCTGCTGTTCCTGCCACGTCAGCCAGCTCACGCAGTGCCTGAGCATTCGCGCTACGCGGTGACGCCTCATAGCGGGCGCGGATATTGGCACTCATCTGGGCAATCTCGCCGGAGAACGGCGCAAAGGCAGGAAGTGCCGCGATGGTTTTCTCCAGCACTGCAATTTCTTTCGCGTCACAGGTGCCATCGGCGTATGCAATGGAATATGCGCCCCAGACGGTCGCCTCCACCGCATCCCGGTTCTCCATCTTCTTCACTTCAACAATAGCCTTGCGGGTTTTCTTTCTGAAAATACCTAACATCGTGACTTTTCCTTTTAGTGGGTGAGCCTGCGCCCGGGGGTGACCAGCCCACAGAGAAAGTCACACTGACCATCCCGTAAGCTCACCCCTGAAAGGCTCCGTGGTTAGTTATGAATGTGCGCCGGGCGTGGCGCGGGGAATGAAATAAGCCTTACCGGAAAATAAGGTTGTTTCCGGGGTTCCGGTCTTATTTTGTTATGAGTAAAATAATGGCGACCTCCCGAAAGGGATAGCCTGAAATTTTTCAACTATCTATCACGGACATTGTCCCGCGGCTTAAATCCGACAGCCGCGCTTCTTTTTTAATTCATTATTTCCCGCCCCGGATACTTCCCGCACCCTGGTCTGAAAACGTTAACTTTATTATATGGCGGGCAGAACTTTTTCCCGGAATAAAAAAAACCGCCTCACGGGAGGCGGTGCTCAATATCTGAAGCATGTTTTATAGTTATCGACTGGAAAAACAGTAGAGGTGTCGGGTGCCTCCCGAAAACACATATTAGCCCGGTATGTGTCTGTGACCATCGGCAGAACATTTTTCCCGGTCACCCCCGCACTGGGGAACACCTCAATAAATAATGCAGTGCCGGAATTATCTCCGGAGGGCCGTTGGTCAGTCACCATGACCCGCAGACGAAAATAAACGCACCTCATATCCTGTTTATGCCCCACTTTATGTGGGTTTCACTGCATTCAGATGGCGTACCACAAATCCCGTATGTGTTACAAGAGCATCAGACAGTGGCGCAAGCAATGACCGTTTTTTTGCAGTACGCCATCTGAATGCAGTAAAAAAATTGCGGTGCCTGATTCACTATCAGGAGGTTAAATTAAAGGAACATTAATGTAACCACCCCATATCCATATGCCTGCACCACAAAGAGTAAGAACGATACCTATATCAATGGAATCTTAATTTCACCTTAATGCCAAATCGAGTGATGATTCTTTAATCAGACGACTCAGGAGGTATAAAGAATCAACTCTCTTATTCGACGGAGTGGAGAAATCTGCCAGTCACCTCCGCCAAACTGTCGACAATAATAAATCATAAAAATTTCATTTCAACAAGCAGTCGCGTCAAGAAATGTAAATTTATCATTAGATGTTTATTTATTCCAAATGTTTTGCATTAATTCAAAAAAGACAAAATACAATCAGGGCATATTCTGATGACTCCATCTTATTTCTGCAATTTGCGGGAATAAAAAAACCACCTCATGAGAGAGAGGTGGTAATCATAACATGGAGTTATTATTATAGTTTTATTGATGGCGATAAAAAACCTGAGGCGCCGGGGACTCCCGAAAAATATTGCTGTGTCAGGTGTTGTATTGACAAGAATTTTCCGTATGTTCTGCCTCTGTGATATGAAGATTTTTAGTAATAAATGCAGTGCCGGGATGCCCCGGAGAGTCTTTAGTCAGCCACTATGACCCGCTACAAATTGATCAAACGCAGACAATGCTGTTTAGCTCTGCTCAGAGCAGATTTCACTGCATTCAGATAGCGCACTGAGAACCCCGCATTACCAACACACTGACATTCTCATTTATCCGATTCACAGTACGCTATCTGAATGCAATAAAAAATTGTGGCACTGAGTCCATCGGGGCTTCTCCGCCCCCACAAACCGGATGCTGTATCAGGATATACATCATGCACTCATCATAAACGCACTATCCACACTTTACGTGCCAGCACCACCAACGGAGTAATTACGATAGTTGTATCGAAAAAAACGCCTCTTCTTCTCTGGTTGAGAGCCGGTTCCTTCACGTGAATTTTCCAGGATGCACAAAAAAACCGGCTCTCTGATCCTCGGGGGCAGAAGAAGGTTATCCATCATCCCCTTCGGACCGATGAAAATAATACGAGTCAGAACAGGCGTTTCAACAATTCATTGTGTCAATAAACGTAAAGTTATTATTACATTTTTATTTTATCAAAACATTTCACATTAATTTAATGTACATAAAACATATTTAAAACGCACCATGTTTAACAACCCGGATTACACTGGCGAATATTTTTCTGTATTGCAGAAACGACAAAACCCGCACGATGGCGGGTTTCAAAATGCGTTCATGTCTGTCATTAGCCTCGCGATACAGCTTTGCGAAGCGTACTGGAATTGAAGCAGTTTGTGGCTCATTTTGCAAATGATTTTTTAAGCATAATCGAACGCTTTTCTCATAGGTGAATACAAAATGAACTCAGCGATACTCAACCACTGCTCAACACGGCGTCGGCATGTAATCAACGCCCACTCTGGGTACTGTTCGTTTAGTAACTCGGCCATCCTTCTCTTACTCATCCCCCGCCCCACGTAACGCTGGTTCAGGATACCAAGCAGCCCCGGATGATCCGTCAGTACCTCGCCTACAACCCGATCGATAATCAGCGCTTCCGTATCCGTACAGTGTGCCAGCCAGCTTTTTTGCTTTCCGTTGATCATCTCTCTCAGGAACACTTCCAGTTCAGGTTTCTCCAGCCCCGCTTTTTTCATACGGCGCAAAGCATCGTTAATGGCCGTCTTCGTCAGTTTTTTGGACGCCAGCAACTGGTTAAACATATTCCCTGTTTTGCCGCCACCAATGTATGACCAGCGCCCCCACATACGCAATTTCCCCTGAATCCACACACTTTCCAGCGTGCTGAGACGAAGGTGTTCTCCGCTTTTTCCGGTGTTTGTTGGGTAAATCATAAAATGCCTTTCTCTCTCCAGATTTCCTGCGTGCGGAAAACGCCCTCTGCGTGCATCAGGCGAATTTCTGTTTTCGTGAAGTCGTCGATTTTTACCCGCCCATCGACAATATCGTGGCACGAGCTACAGGCAATTGCTGCCTGCATATCGTTTGGTTTTGTCGCCGTTCCGCACGTACCCGCCAGCCGGTAATGTGCCAGTACGGACGTTTCAGGATTATGGTTGCAATGGCCGGGAATTCTTACCGTACACATCAGGCCCCGCGCCGCTTTACGTAAATCCGCCATTACGCAAACTCCAGCAGTTGCGCTGCGACATTTTCGACCTCTTCCGGAGAGGAGAATTTACGAAACAGAATCCAGTTCCACAGGACGTTCAGTACGGCCTTATAAACCTGCTGAAACTCGGTTTCGTCCATATTCGCAAAAGCGATGGATTTCGCCCGGCGCCCACGGCTGCCATCCGGATAATAATGCTCAGTATAAAACCCGGCCTGAATGGTTACCCACTCGCGGAAAGCGTCGAAGGACTTGAGCAGCGCAACATCGAGCGTTCGGTTGACAGCAACCTTATGCAGGTACTGTTCCGCCGCGTCACTGAGTGCGGGACTGTGGCCCTGCGCTGCTGACTCACACAGGAAATCGACAAAGCCGGTAATCAGCTCCTGTTCCTGAGGCAGGATCGCCCCGCCGATCGGAGTCCAGTAATCGAAACCAAGCTGAAGGAGCTTGAAAAAACGTTTGTGGAACGCGTAGTTACGTACTCGCTTAAAATCGGCGTGTATCCACGCACCGATTTTTACTGAGCGCAGGAAGTCCTCACTCTCCGGCGTTGCCGGGAGCAGAAGCCCTGATGAGGTTTGTTTAACAAGTTGTAGATGCGCCATCGTTCTCTCCGGTGGCGCTGTAGGTTGCTGATTGTTCAGGTCAGCCGTAACATATTAAAACATTAATAACTGACAGTGAAACCCAGTCTTATCAGATAATCAATAAACGCTTCAACAGACAGAATCAGATGGTCGTCAGGAATTAGCGTACAGAATGAGATTTCACCATTTTTTACACGTACTGCATAAAGCCCGTCTTCATCTAACTCTAATAAATCCTTGAGTTTTTTCACGTTACCTCCAGACAACTAAGGAAAAATGAAAAGGTGCGATTTCAACGCGATTTCTGTTGAGGCGGGAAATATAAACACTGCGACTATTTATTTCATTATATAAATTTGCTTATTTTATGTTCACCAGCAAGGACATTTTTCACTTGTTGCGCAACCAATCTGAAAGTTGATCATTTTTATGAATTTTTATTTTACGGGTAACAAAAAACCCGCCGAAGCGGGTTAAGTGTGGGTGCGTTGAGGATGCCTGACACGTCAGAGGTGGCGGGGATTTCTCCCCGCCAGGTCTCTTACTCCTCAGGTTCGTAAGCTGTGAAGACAGCGACCTCCGTCTGGCCGGTTCGGATTCGTACCTCGCAGAGGTCTTTCCTCGTTACCAGTGCCGTCACTATGACGGTTAAACAGATGACGATCAGGGCGATTAACATCGCCTTTTGCTGCTTCATAGCCTGCTTCTCCTTGCCTTTCGGCACGTAAGAGGCTAACCTACATGTGCAAAGCATGAAATTGGCCTCAGATTAATGTTAAGCGTCTTGCCGGACGCGTAATGTTAACTGGGGCTTTTCTCTATCTGCCTTTTGGTGTTCATGCCTGAGGCAGATAGCCTCAAGCACCCGCAACAATTCTACTTAACTCTCCTTTTCCCGCAAACCGTTTTTATCCCCAACGCAAATTTTACCAATACCCCTTAATACATCTCCCTTGCCCTGACGATACATCCCTCTTTACACAGACCAAAATTTATGTATTATCGCTTGAAAACAATCATTTAAGAGCTATCGGTGGGTGAATTCGCCCTGCGGTAGCTTTTCCTTTATGCATTGCATACATTTATGTTCTAGTATATTCCTGTATACTCAAAAGGATTTTTCATGCACAGCGTTAATTTCTATTCATTCCGCGTATTGACCCATAAAGGCAGTCGAGCCAGCAAAAAACTTAATGACTTAGGTTTAAGTAATAAAAAAACGGCATATGAACTTTTTGTTGATTATTTTACTCTTTATAAAAACACCCCCATCGAGTTCGGCGTTTCAAAAACTAAAATATCTCTGGAACAACATACTAAACTTCACTTTGATAACACAAAAAAAATCATATATGGTTATATAAAAGTTGGAAAATATGGAGAAAGCAGTGAAATAAAAGATGTAAAACTCAAAAAAGTCCATTACAGAACAACTGCTTATGATGTAACACTCAAAGAGCGTTATATTTTAATATATCTACCAGATAATCTTGAAGAAGGAATTATTGCATTCCATTCATGCGATAATATTTCTGCTCGAGGTGTTCTTTCTGATTCTATCACTGAATATCTAAAAAAACAATTTCAACTAGAAGCAAGAATCAATCCATTACATCATAAGAAAATCCCTCAATACATTCTCAATTCTGAATTGAAACAAATTAAAGCTCAAGGATATAAAGCACCAGAAGATATTGCTGATTCCTTTGGTAAAAACAAAACAAACATCAAGACAGACTTAATAATAAAAGCAAACGATGGCATATTCGGAAGTTTCAGGGATTTAAGAAACAAGAATATAGGAAACATCATTGAGATTATTGAAGATAAATGTGATGCAATAAAAGTAAGCTTACAGCTCGGCAGTCGGACTGTCGTTTTCAATTATGATACCATACTAAAAAAAGGAATTTCAGCAGAGTTAGATGATAATGATCTAAAAATCAACCCATTAACAGGTATACCTGATCTAACAGCACTTCATGACACGATAAAAAACCTTTCCAATGATATATTGGAAGAACTGCACTGTGGAAATAAAGGGGTGATTATATGAATAAAATAAATGTGCTGGGTGTAATAATAAAACACTACAAAACAATGTCAGATCAGCGTGGAACAATGTTGATGAGCGACATTACCGTACATTTTATAGTTCCTCTATCTCTTTCTTTCGTTCTGTGCTGGACATACGGAATAATGAAACCGGCAATTGCTTCCGTCTTCGTTAACTTCGGGGCTATTACAACAGCACTATTAATGAGTGCAGTAATAATGATTTATGAACAAAAACAAAAAACCATTACTAAGATATCAGATATAATTGAAGGAAACAAATCAAGAGACAAATTGATATCATTAAACACTAACAAAACCATATATGAGCAGTTATGCCACAACGTCGCTTATGCAATATTAACTTCAATAGTATTGGTTATATTTTCAGTAATAATATATTTCCTGCCTGACAATGCAGTGGATTTAATGAAATGGTATTTTCGCGCACCCGCATATATCGTTAGCTTTTTAGCCTATACATCCTTTTTTATCACTGTCATAACTTTCTTAATGGTAATAAAAAGATTTAGCACGATTTTAGACAATTAAGCAATGGAGCTACCGCCCTTTCGGGCGGTCTCCTGGTGTTCTGAGCGTGCAGGAATCCATCCGGTTAAGGATTAAAGTTTATTTACATCACTAAATTTAATTATTCATATTTGGATTATGCTTTCTCTTTCACTTCACGCAGTTCCGATTGTTAATTTGGCTCACAACAGCACCTCCTGAAAGTTTCCCCGATAAAACGCCAGTACCCGCTGCATGTACTCGCTTTTACGACACTCACGACAAATTACGTTGTGGTGCCTGTCGTAACGACGTACCTCACCGTCAGGCAGCTTCCAAATCAGGTCAGCGTCCACTTTCGCCTGTTTCTTCCAGGCACGATAAGCCTCTTCTGACGGGAAAATCCCCCCTCTTCTGCCCGCCTGGTACACATCCCCACAACGTTCTGCCTTGTCCAGGTAGTGGCGGGTCGAATAAATGGTTAACCCCGTCATCCTCCTCAGCTCCGTAAGTGTTATCAGGCCGGAGTTCATGAGGTGTAACCTCGAAATTCGTAGCCTCTGCGATACGCAATACCTTTTCGGGGCTTAACTGACTACGGCCAGTAGTAACAAGGCTAATCATCGATTGCGAACAACCAGCCAGCGTGGCCAAACAAGACTGTCGTACACGATTTTTTTTCAAATATTCATCTAACGTCATAAAGGTCACCTTAGTAATTCTCGCTAAATATTAACCATACTAATTTAAATGATCAATACTTATATCAGTTTGAGTTTATGAGTCACATTCATAAGATGAGCGCATGAGAAAAAAACGTGAAGAAATAGCTCCACCAGAAGCTACCCAGCGCTTACGCGCCATCTGGGACGCCAAAAAGCGAGACCTCAAACTTACTCAGGAGATCGCCGCTGATCTTATGGGCTTTGAGACACAATCTACCGTCAGTCACTATTTGAACGGTAAGGCACCTCTCAACACTGATGCGGCCTTAAAATTTTCTGTTCTATTGAGAGTTAAACCCGAAGAGTTACGACCGGATTTGGCCGATCTAATGAACTACGTTCGTTCTTCTGGTACTTACGATGACAACTTCGAAGGTGGTGGCTGGCGAATGGTGAGCAGACAACAGGCTGATTTACTAAACCTTTTTGATATACTCCCTGAATCTGAGAAAGAAAAACTCATTGACCGGCTTAAAGGTCAGAATGAGCTATATAAAGAGGCTTTCCAGAACATGCTCGCTGCACAAAAGCGCCTAAAAAATCAGTGACGAACCAACCACATCAAACCGCCTCTCCCGGCGGTTTTCTTTTGGCTAAAGCTCCTCCCTCCCCCTCTTGCATCAGATAAAACACATTTATTTTCATTAGGATAGATCATATTTATGATATATATGTATATTTATCTTGATCAACTATATGAATATCGCTAATATAATCTCAGAAACAGCACGGCGCTGTAGGTTTTAGTTCCGCCACCCGGCGTTAAGGGGAGAGGGAAAGATGGGAAGGAATGAAGTGATTCAGTATTTGATGGATAGTTGCAACGTCAGCTTTAGCGCAGCTCTCCAAGCATTGCGCGACAATGGATGGGATATGTTTTTGGCTCAATGCGAGCTACAGGAACAGTATTATCCGGGGTGATAATGGACAAGCTGCAAAAAATCCACCTCGGCAATAACGAATCCCTGGTGTGTGGCGTGTTCCCCAACCAGGATGGAACGTTCACCGCCATGACGTACACCAGAAGCAGGACGTTTAAAACTGAAACAGGCGCACGTCGCTGGTTAGAAAGAAACTCAGGTGAGTGATATGGATTTCGACACAATCATGGAAAAGGCTTACGAAGAATACTTTGAAAGCCTTGACGAAGGAGAAGAAGCACTCAGTTTCAGTGAGTTTCTGCTGGCGCTTTCAGCTAACGGCTAATATCGAACCGTTTTGTGCAGGGATTCCAGGTGGAAAGCATTAACGACTCATACTCGGGTTCGATTTTGAACATCACAGAAACACCGTCGATACACGAACCAGCCGGACGAACGATACCTGCCTTAACCAGAGCATCTGCATCAGGGTTATCCCAGGACGCCCGGAAAGTGGGAGAATGAGTTTTAAGAAAGGGTTCAAGTAAAAACAGCTGTTCAGTACTCAATGAATTAAGCGTTTTTAACATGCGTTTTGCCCTGCCCCTACGCTGGCACAACGGCAGCGCCGACACGAAAAAATAACCAGTGACTTTGAGGATCAGCACTATAAGGTAAGCCACAGCAAAACTAAAAATCTGAACGGCATATGGTAAATCGCTACGCGCCTCAATAAGCTCCGTAATATCTCTAGGAATGAAGATGATAATTAAGAAAAATAAAGCCACTGTGAGCATAAATTGTCCGACGGATTTACCAATTAGGAATTTTACGATAACCAGAGCGGTTTCTGACATATCAATAACTCTCAACTGTAAGGGTATTGAAATGTTAACACAGGTTCTCGCTGTAGGGGTATAGCCGAGACCACCGAAGCCCGGAGGTGGTTAAATAAAGCCGGGCACAACACGAAGGCGCATTTCTGATGTTTTCTGAGTCGGTCTTGTCTGTAAATCCAAATAGTGGAAGTGCGCCTCCGGTTGTAGTTGCCACTGCGACAATAATGCTGTGTGTAGTACTTGGCGGCATCAGTTTTTCTTAGTCCTTTCTGATGTCCGCCCTTTTTAAAGTGAATTTTGTGATGCGGTGAATGCGGCTAAGCGCACGCGGCACAGTTAAAACTCCCTGAATCAGTATGGGTGGTTTAAGTCGGCATTAATTGTTAACTGGTTAATGTCACCTGGAGGCACCAGGCACCGCACCACAAAATTTATTTACCAGAAATGGAGGGGCTATGATTGCTCATCACTTCGGAACGGATGAAATACCGCGTCAGTGTATTACGCCGGGAGATTATGTTATCCATGATGGTCGTACTTATATCGCCTCAGCGAATAACATTAAAAAACGCCGTTTATATATCCGTGATTTAACAACGCAAAGATGTATTACCGATTGCATGGTAAAAGTCTGGCTGAACAGAAATGGTCTGCCTGCCAAAGCTGAATCATGGTAACAGAACAGTAATCGTTTAAACCATCCTGTTTTTAAATATGCCTGCAATGGCAGGAATTCACTCAACCTGAAAAAAGGAATCTATATGAAAAATGTACCTGAATCAGTAATTGCAGAGCTCCGCCAGCTTTCAGGAAAAATTCGTACGTTATGTATCGAAAACAATATGCCCTGTGTTGTTTCTTATGCCCGGGACTGTGACGATGAAAGTGTTTCCAGAACTCTTGTTGCATATACAGACAGTGAAACAGGCGCATATGACAGGTCAATAACAGCCGCAATAATGCTGTTAAAAATGAACGAAGCTCCTCCCGAAAATTTTATTTCATTGTTGAAACTGATGGAGTGTAAAGAGCTCGTCACGAACGCATTCTGCTCAATGAAAAATGAAAGTCTTCATTAAGTATGATTTGTAATAAGGGTAATGATAATGAGCGACAATAAAACAGAATACTCATATTATATTAAGGTTAAAAATGAAAGCGCCCGGAAACGTCTCGGCTTCCCTTTTGCTTTCTGGTGGAAAACTGAAAGCAGCGAAGCCGCTGCCACAGCACGCCTTGCCGTATCAATGCTTGACGCCGGATTCGAACCGACAGATTTTGCAAAACCGGTTCGCGTTAATTCCCCCGCTGTTAACGAACTTCCGCCGGAGGGAAGTTTTGATACCACCTTCTGTCAGAAATATGAGCTGGGCGGCGAAGATGGCAAAACATTTATGCTCATCCCCGGCACGCCCGCTACTGACGCCCACGACGAAAAAACGGAGGAATGCGCCGACGACGCTGGCACCGAAGAAAGCGGGACAGACACCAGTGACAACGACGAATGTCAGGACTGCGAAGTTTCCGTCGCCACCCTGCCGTTCCCCCAGCGCGTGTTGCACATTTTTACTTACGCTGCCACAGACAAAAAATATTTGCATCACGCCACCCGCGCTCAACGCAGGCATATTACCGTTCTCGAAATGGAACAGGAAAACAGCTATATCCAGAACCTGTTAATGGTATTGCGGAAGTCTGAACAGGTTCATGCCCAGGATGAGTAAAGGAAAACAGAAACCTTATTACTCGGTAAGCAGTTTAGGGGCAAGGTGGAATGCGGCAGTAAAACGTGCTGGTATTCGCCGCCGTAATCCGTACCATACGCGACATACTTTTGCCTGCTGGCTGTTGACGGCAGGAGCGAACCCGGCATTTATCGCCAGCCAAATGGGGCATGAAACTGCGCAGATGGTGTATGAAATTTACGGTATGTGGATTGATGACATGAACGACGAACAAGTAGCGATGTTGAATGCACGGTTATCGTAGTTGCAAAGTTTGCCCCCAATTTGCCCCATTTAGTACCAGAGAACTGAAATAATGCAAGAAATTCAAAAGAATACAAAGAAAGAACAATACAACCTCAACAAGTTGCAAAAGCGCCTGCGCCGTAACGTTGGCGAAGCGATTGCCGATTTTAATATGATTGAAGAAGGCGATCGCATTATGGTTTGCCTTTCTGGCGGCAAAGATAGCTATACGATGCTGGAAATTTTACGTAATTTGCAGCAAAGCGCCCCGATCAATTTTTCACTGGTCGCCGTCAACCTCGATCAAAAGCAGCCAGGTTTTCCGGAACATATCCTGCCAGCCTACCTTGAGCAGCTGGGCGTAGAATATAAAATCGTCGAAGAAAACACCTACGGCATTGTGAAAGAAAAGATTCCGGAAGGAAAAACCACCTGCTCGCTGTGCTCGCGTTTGCGTCGGGGTATCCTGTATCGTACGGCGACTGAACTGGGCGCGACCAAAATCGCCCTGGGCCACCATCGCGACGATATTCTGCAAACCCTGTTTCTGAATATGTTCTATGGCGGAAAAATGAAAGGGATGCCGCCGAAACTGATGAGCGATGACGGCAAACATATCGTGATCCGCCCGCTGGCTTACTGCCGCGAGAAAGATATTGTCCGTTTTGCTGAGGCCAAAGCCTTCCCTATCATTCCTTGTAATCTGTGCGGTTCGCAACCAAACCTGCAACGCCAGGTGATTGCCGACATGCTACGCGACTGGGATAAGCGCTATCCTGGACGGATCGAGACGATGTTTAGCGCCATGCAGAATGTCGTGCCGTCTCACCTTTGTGACACTAACCTGTTCGATTTCAAAGGAATCACTCACGGTTCCGAGGTCGTCGACGGCGGCGATTTAGCGTTCGATCGTGAAGAGATTCCCTTGCAGCCCGCTGGCTGGCAGCCGGAAGAAGATGACACCGCCTTAGAGGCGTTGCGGCTTGATGTTATCGAAGTGAAATAATCTGCAGGCGTCTCAGCACTCCGCTGAGACGCCATGCTATCGATCATTTTAATAGCCGTACCCGGCATGACTTGCCTTTGATCTTCCCGTTTTGCAACTGCTTCCAGGCTTTTTGCGCTACTGCTTGACGTACGGCGACGTAAACGTGCATTGGATGCACGTTAATTTTGCCAATATCCGCCCCGTCTAATCCAATATCGCCGGTCAGCGCGCCCAAAATATCTCCCGGACGCATTTTCGCTTTTTTGCCGCCGTCAATGCATAGGGTAGCCATCTCTGCGGCCAGAGGGAGTGACGGCTGCCGGGCGGGCGCATTCAGCCAGTTCAGCTTGAGTTGCAGCATTTCTGAAAGAATATTCGCCCGCTGCGCCTCTTCCGGCGCGCAGAAACTGATCGCCAGGCCGCTGCTTCCCGCGCGCGCCGTACGGCCAATACGATGGACATGCACCTCCGGGTCCCAGGCCAGTTCATAGTTAACCACCAGTTCGAGCGATTTAATGTCTAATCCTCGCGCGGCAACGTCGGTGGCAACCAGAATGCGCGCGCTACCGTTTGCAAAACGCACCAACGTCTGGTCGCGGTCGCGTTGTTCCAGATCGCCGTGGAGCGCCAACGCGCTTTGTCCTACCGCATTAAGCGCATCACAAACGGCCTGACAATCTTTTTTGGTATTGCAAAATACCACGCAGGACGCTGGCTGATGCTGGCTAAGCAACGTTTGTAGCAGCGAAATTTTTTCATGCGCAGACGTTTCGAAGAACTGTTGTTCGATAGCCGGTAGCGCATCTACCGTATCGATTTCAATACGTATTGGCTGCTGCTGTACACGACCGCTAATCGCCGCGATGGCCTCAGGCCAGGTTGCTGAAAACAATAACGTCTGGCGCGTCGCAGGCGCAAAGCGGATCACCTCATCAATGGCGTCACTGAATCCCATGTCCAGCATTCGGTCTGCTTCATCCATTACCAGAATATGCAGCGCATCCAGCGATACGGTTTCTTTTTGTAAATGATCCAGCAGGCGCCCCGGCGTCGCGACAATGATATGCGGAGCGTGCTGAAGCGAGTCGCGCTGTGCGCCAAAGGGTTGCCCGCCACACAAGGTCAGAATTTTGGTATTTGGCAGAAAACGGGCCAGGCGACGTAACTCTCCGGCAACCTGATCCGCCAGCTCCCGCGTCGGGCACAGCACTAATGCCTGTGTCTGGAACAGAGTGACGTCAATTCGATGCAAGAGCCCAAGACCAAACGCCGCCGTTTTGCCGCTACCGGTCCTGGCCTGCACACGCACATCATTACCCGCCAGAATGACGGGTAATGCTGCGGCCTGAACAGGCGTCATCTCAAGATAGCCCAGCTCAGTAAGGTTATTGAGCTGGGCGGCGGGCAAAACATTCAGGGTTGAAAAAGCGGTCAC
Protein sequences of DBSCAN-SWA_5 >NZ_CP011394|2438536:2452812|2442848_2443448_-|WP_000940751.1|DBSCAN-SWA MAHLQLVKQTSSGLLLPATPESEDFLRSVKIGAWIHADFKRVRNYAFHKRFFKLLQLGFDYWTPIGGAILPQEQELITGFVDFLCESAAQGHSPALSDAAEQYLHKVAVNRTLDVALLKSFDAFREWVTIQAGFYTEHYYPDGSRGRRAKSIAFANMDETEFQQVYKAVLNVLWNWILFRKFSSPEEVENVAAQLLEFA >NZ_CP011394|2438536:2452812|2448874_2449096_+|WP_000560208.1|DBSCAN-SWA MIAHHFGTDEIPRQCITPGDYVIHDGRTYIASANNIKKRRLYIRDLTTQRCITDCMVKVWLNRNGLPAKAESW >NZ_CP011394|2438536:2452812|2446800_2447268_+|WP_001227859.1|DBSCAN-SWA MRKKREEIAPPEATQRLRAIWDAKKRDLKLTQEIAADLMGFETQSTVSHYLNGKAPLNTDAALKFSVLLRVKPEELRPDLADLMNYVRSSGTYDDNFEGGGWRMVSRQQADLLNLFDILPESEKEKLIDRLKGQNELYKEAFQNMLAAQKRLKNQ >NZ_CP011394|2438536:2452812|2442025_2442562_-|WP_000640113.1|DBSCAN-SWA MIYPTNTGKSGEHLRLSTLESVWIQGKLRMWGRWSYIGGGKTGNMFNQLLASKKLTKTAINDALRRMKKAGLEKPELEVFLREMINGKQKSWLAHCTDTEALIIDRVVGEVLTDHPGLLGILNQRYVGRGMSKRRMAELLNEQYPEWALITCRRRVEQWLSIAEFILYSPMRKAFDYA >NZ_CP011394|2438536:2452812|2449180_2449498_+|WP_000800272.1|DBSCAN-SWA MKNVPESVIAELRQLSGKIRTLCIENNMPCVVSYARDCDDESVSRTLVAYTDSETGAYDRSITAAIMLLKMNEAPPENFISLLKLMECKELVTNAFCSMKNESLH >NZ_CP011394|2438536:2452812|2449525_2450143_+|WP_001676915.1|DBSCAN-SWA MSDNKTEYSYYIKVKNESARKRLGFPFAFWWKTESSEAAATARLAVSMLDAGFEPTDFAKPVRVNSPAVNELPPEGSFDTTFCQKYELGGEDGKTFMLIPGTPATDAHDEKTEECADDAGTEESGTDTSDNDECQDCEVSVATLPFPQRVLHIFTYAATDKKYLHHATRAQRRHITVLEMEQENSYIQNLLMVLRKSEQVHAQDE >NZ_CP011394|2438536:2452812|2439349_2439538_-|WP_001688615.1|DBSCAN-SWA MPVLASFLSRLADYNGKPLDALCAVVMSVLSVKFLTFIHDQDISSLTGVFSRMRGGGSGHGK >NZ_CP011394|2438536:2452812|2443510_2443681_-|WP_000734094.1|DBSCAN-SWA MKKLKDLLELDEDGLYAVRVKNGEISFCTLIPDDHLILSVEAFIDYLIRLGFTVSY >NZ_CP011394|2438536:2452812|2450459_2451395_+|WP_001156217.1|tRNA|DBSCAN-SWA MQEIQKNTKKEQYNLNKLQKRLRRNVGEAIADFNMIEEGDRIMVCLSGGKDSYTMLEILRNLQQSAPINFSLVAVNLDQKQPGFPEHILPAYLEQLGVEYKIVEENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFLNMFYGGKMKGMPPKLMSDDGKHIVIRPLAYCREKDIVRFAEAKAFPIIPCNLCGSQPNLQRQVIADMLRDWDKRYPGRIETMFSAMQNVVPSHLCDTNLFDFKGITHGSEVVDGGDLAFDREEIPLQPAGWQPEEDDTALEALRLDVIEVK >NZ_CP011394|2438536:2452812|2444553_2445486_+|WP_000556390.1|DBSCAN-SWA MHSVNFYSFRVLTHKGSRASKKLNDLGLSNKKTAYELFVDYFTLYKNTPIEFGVSKTKISLEQHTKLHFDNTKKIIYGYIKVGKYGESSEIKDVKLKKVHYRTTAYDVTLKERYILIYLPDNLEEGIIAFHSCDNISARGVLSDSITEYLKKQFQLEARINPLHHKKIPQYILNSELKQIKAQGYKAPEDIADSFGKNKTNIKTDLIIKANDGIFGSFRDLRNKNIGNIIEIIEDKCDAIKVSLQLGSRTVVFNYDTILKKGISAELDDNDLKINPLTGIPDLTALHDTIKNLSNDILEELHCGNKGVII >NZ_CP011394|2438536:2452812|2443971_2444184_-|WP_000882662.1|DBSCAN-SWA MLCTCRLASYVPKGKEKQAMKQQKAMLIALIVICLTVIVTALVTRKDLCEVRIRTGQTEVAVFTAYEPEE >NZ_CP011394|2438536:2452812|2439078_2439360_-|WP_000445513.1|holin|DBSCAN-SWA MESNLTGTLNAGLCLVTVLALFLYRRNGARYKPGIAWLSYLLMLGYALVPFRFLAGHYPSSSWPVVLMNALFCGLVLWARGNVSKILSLLRLR >NZ_CP011394|2438536:2452812|2445482_2446037_+|WP_001033796.1|DBSCAN-SWA MNKINVLGVIIKHYKTMSDQRGTMLMSDITVHFIVPLSLSFVLCWTYGIMKPAIASVFVNFGAITTALLMSAVIMIYEQKQKTITKISDIIEGNKSRDKLISLNTNKTIYEQLCHNVAYAILTSIVLVIFSVIIYFLPDNAVDLMKWYFRAPAYIVSFLAYTSFFITVITFLMVIKRFSTILDN >NZ_CP011394|2438536:2452812|2442558_2442849_-|WP_000774470.1|DBSCAN-SWA MADLRKAARGLMCTVRIPGHCNHNPETSVLAHYRLAGTCGTATKPNDMQAAIACSSCHDIVDGRVKIDDFTKTEIRLMHAEGVFRTQEIWREKGIL >NZ_CP011394|2438536:2452812|2447652_2447808_+|WP_085981757.1|DBSCAN-SWA MQKIHLGNNESLVCGVFPNQDGTFTAMTYTRSRTFKTETGARRWLERNSGE >NZ_CP011394|2438536:2452812|2438536_2439082_-|WP_000802786.1|DBSCAN-SWA MKPKDEIFDEILGKEGGYVNHPDDKGGPTKWGITEKVARAHGYRGDMRNLTRGQALEILETDYWYGPRFDRVAKASPDVAAELCDTGVNMGPSVAAKMLQRWLNVFNQGGRLYPDMDTDGRIGPRTLNALRVYLEKRGKDGERVLLVALNCTQGERYLELAEKREADESFVYGWMKERVLI >NZ_CP011394|2438536:2452812|2447915_2448437_-|WP_000004762.1|DBSCAN-SWA MSETALVIVKFLIGKSVGQFMLTVALFFLIIIFIPRDITELIEARSDLPYAVQIFSFAVAYLIVLILKVTGYFFVSALPLCQRRGRAKRMLKTLNSLSTEQLFLLEPFLKTHSPTFRASWDNPDADALVKAGIVRPAGSCIDGVSVMFKIEPEYESLMLSTWNPCTKRFDISR >NZ_CP011394|2438536:2452812|2446198_2446528_-|WP_001676916.1|DBSCAN-SWA MNSGLITLTELRRMTGLTIYSTRHYLDKAERCGDVYQAGRRGGIFPSEEAYRAWKKQAKVDADLIWKLPDGEVRRYDRHHNVICRECRKSEYMQRVLAFYRGNFQEVLL >NZ_CP011394|2438536:2452812|2451438_2452812_-|WP_000123686.1|DBSCAN-SWA MTAFSTLNVLPAAQLNNLTELGYLEMTPVQAAALPVILAGNDVRVQARTGSGKTAAFGLGLLHRIDVTLFQTQALVLCPTRELADQVAGELRRLARFLPNTKILTLCGGQPFGAQRDSLQHAPHIIVATPGRLLDHLQKETVSLDALHILVMDEADRMLDMGFSDAIDEVIRFAPATRQTLLFSATWPEAIAAISGRVQQQPIRIEIDTVDALPAIEQQFFETSAHEKISLLQTLLSQHQPASCVVFCNTKKDCQAVCDALNAVGQSALALHGDLEQRDRDQTLVRFANGSARILVATDVAARGLDIKSLELVVNYELAWDPEVHVHRIGRTARAGSSGLAISFCAPEEAQRANILSEMLQLKLNWLNAPARQPSLPLAAEMATLCIDGGKKAKMRPGDILGALTGDIGLDGADIGKINVHPMHVYVAVRQAVAQKAWKQLQNGKIKGKSCRVRLLK |
19 | Escherichia_phage(66.67%) | tRNA,holin | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_6 |
2677237 : 2693352
Sequences of DBSCAN-SWA_6
Nucleotide sequences of DBSCAN-SWA_6 >NZ_CP011394|2677237:2693352|DBSCAN-SWA GTCAGTTTGTGAAGTTGCTCCCCGACCGGGAAACCATCACCAGCGGCCAGACGGAAGCAGACGTGGTGTACTGCCCACGGACCCTCAGAGAGACGCTGATATCCACGACAGGTGAAGTGGTGTAGACCGAAAAGACGACGGTCTGATACATGGCCGGAATCCCTGCGGTATACGGCATAACCTCCGCCGTTTTCACCTGGCCGTTAATATTTATCGTGACGGTGATGGCACCGGTGCCACCGTTACGCTCACAGTTAGCCATCACCGTGATGGTTTTCCCTATCTGATAGGTGGCGCTGTCGGTATACCGTGTTGAGGTGCTGCGTTCGTCGTTCGTCGCCCTGATGCTCACGCCCTGCATGACTTTTGAGCCGCAGATATCACCGACAAACTCTCTTGCTTCTATCACGCCAGAAAACTTACCGGAGGTGGCATTGATTTCTCCCGTAAACGAGCCAGATACAGCGTTGATATGGCCGCTGATATCCGCATTTTTCGCAGTCAGCTTTCCATCCGGCGTCAGGGAAAATGCCGGAGGATTCCCGCCACTGGTAATGGTCGGCGCGCTCAGGTATTTCAGGAACGCCTCATTCATGATTATCTGGTCGCCCTGCATGACGAATCCGGGCGTCTCGTTTCCGTTTGCCGGGTTAATATAAGCAATGCGATCCGCCGCCACCAGGAACTGGCTTATCTTCCCGTCAGGCGTGTCTTCCATGCTCAGTCCAAGTCCGGCCACATAATATTTGCCGTCTTTGGTCTGCTCTATTTTGACGCCCCACGTGGCGCTCCATTTATCGTTAGCGTCCTGCCACTCCTTCGAAAATTGTTGCAGTTTGCTGGCGTTATCCTCCGTCAGGTCAATTTTTTTTCAGCAACTCCGTACCAAGATACGTCTCCGTAATCAGTCCCTTAAAAAAATCCAGATACCCTTTCGCGTCATCCCCCGGACGTCCGGATGCTTCCGCAAATACTGATTTTCCAGCCAGATTTACACTGCGCACGTAAAACCAGGCATCATGCAGTGGTTTCAGTCCATCCTTTATCCAGAATGACCCGACGCCCAGATACTGTGCTTTTGACTGAATATCGGCGGCAGTCGCCAGTTGCGTGGCGGAGTACCAGAACTCATACTGCACACTGGCATCGTAGACAGTCTGGTGCGGCGTCACCGTTATCTGAAAATAACCCGGCGTCATCTCAATCGTGGATGGCGCTTCCGGTGCCTGAATACTGAATGCCACGGACGCCGGTTCACCCTGCTGCCCGTAACCGTTTATTGCCCTGACTGTCAGCGTGTAGTCACCCAGTGGCAGTTCGTGGAAGGCGTACTCCGTTTCGCTGGTCGTCGCCGTTGTCACCAGACGAACCGGATCGCCCTCGTTCCCACTGCCTGTGGTCAGCCTCACCACAAAACGCACATCTTTTACCACCCGCGGCGTGCCCCACTTCGCTTTGGCCTGATACAGGGTACTGTCGTTATCCGTGCTGACTGTCAGATGCTGCACAGCGGGCGGAATAATGCTGTTGGTGGTCCCCGGTAACGGGTCAAAGTGCGCCCCGTTGTCCACGATGGACTCTTTTTCCGGAACGTGCTGCAAGGCAGTGATGGCGTATGTGCCGTCGTCATTCTCCTTAATACGCACGCAACGGAAAAGGCGGCGCTTCAGGGAGGGCAGTTTCAGCCCCCAGATACTGTATGGCTGCACGGTTTCCGGCAGGACTTTCGTTACCACCCGATCCGGTGCGGGCTGCGACTGAATCTCCGTACTGAACGGCTTACCGTCAGGCCCGACAATATTCAGCGTGGTGGCGCCGCTTTCCGGTAGTGTTATTTCCCGGTCAAGCGTCAGCGTGCGGGTGGAAATATCCAGGTCAGTGATACGCCCACCGACCGACGCCCCGGCGTAATCGTTGTCGCAGACCTCAATAATATCGCCCGGTGTATGACGCAGACCTTCCGCACCGACAGAAAAATCCACGGTCTGCGTTTCCAGCAGCTCCGTCATCATCACCCACAACCCCGTCCGGTGCGCCTGTCCACGTGAGGTACAGCCGAACGCGTCCATTTTCAGCAGATTGCGTCCATAACGGGCCTGTGAGGCATGGTCTTCCACCAGCTCCGTGGAGGTTTGCCAGCCATTCAGCGGATCGGTGTATCTCACTTCTATCGCGTTATGGCGGTCTTTCAGGGCACTGAAGCTGTATTTAAAGCGCCCGCCCACCACGTTACCGTTGGTGTAGGTCCATGCTTTATCGGAGGGGCGGTCCTGGATGAAGGTCATTTTGCGGCCATTCCATACCGGCATACAACGCATCACCGAGCAGAAATCCGCCAGAACGTCATACGCCTTACGCTGGGTGGTAATATACGCATTAAGCGTCATGCGGGGTTCCGTGCCGCCAAATCCGTCCGGCACCGGTTGATCGCAGTACTGCGCGATGGCGTACAGCGCCCATTTATCCACATCCGCCCCCCCGATACGCCTGCCCAGCCCGTAACGGGGGTGGGTCAGTTTATCCATCGTGCACCACGCCGGGTTATTCGTGTACGCCGGTTTAAACGCCCCGTCCCACAGGCCGGTATATGTGCGGGTATCCGGGTCATAGTTTGAGGGGACCTGAAAAATACGTCCGCGCAGGTGGTAGTTACGCGTGACCTGCTGGCTGCCGAACTGTTCCGCATCCACCAGCAGACCGGCAACCGCTGTGCCAGGATAACCCTGCCGGATATCGATGATTTCCGTATACGACGACCACAGCGTTTTGTTCTGAAGCCTGTCGGTGGTGCTGTCCGGTGTCACCCTGACCATGCGGACACTGAACGGGCGCGGCGGTAAATTATCAGCCACTACCGATGCCAGATATTGTGTTGTGATCTTGCCGTTAATAGTGATATCAAATTCTGTGTTCCAGATCCCGCTACGCTGAAACTGTATCAGCAGATTCACGGAGGACGGGTTACGGTCCCCCTTGTCCATGGTCTCCTGCAGCATCTGTACACCAAAGGTGAAGCGTAGCCGGTCGACATTCTCTGAGACAACAGTACGGGTAACGGGATTATCGTGTTTCACTTCCACACCCAGCACCGTTTCCGCGCCGGAAGCCTCAAAACCTTCCAGCGGTGCCTGTGGTGTCTCCCCCACCTGATATACCACGGTCACGCCGTGAATATTACTGTTACCGTCCGCGTCCACCACCGGCGTGTTATTAATCAGCACGCTCTGCAGACCGTTCCGCTTTGTATCTGGAGGGTACGATCCGCACTGCTGAGACAGGGCTTACCGTCAGGATGGCTGTGTACCAGCGCCACGATGTCGCCGCGGTTCCGGGCATTCAGGTAATCCTCCGGGGATATACGAAAATACATCGTGGGTTCAGCAGACAGATTTTCACACGGAAAATACCGCTCTCCCTGTGCTGTTCTGACCACATAACCGCACGATTCCGCAGGCGCACACTGTCGGGCATGTGCCAGAATGTCATCGTTAATCATGGGAACCTGTTAAGACAGTTTGTTGATGGAAGCGAAAAATCCGGCATTCACCAGATTGTTACGCATTTCACAGCCTTTCATGCAGTGGCTGCATTTATCCTTTTTCGGGTCTGAGGTGGGCTTATCGAACTCATCGGCCACGGGCGGGCCGTCGTATCCGCAGTTTTCATCCCGGTAATCCCACGGACAGGAGTCCGCCAGCATGGTACGCCCCGGCACCACAGAACCGTCGGTTTCTGCCGGTGATGCCAGAATAATGGTAGCAGTTGATGAATCCAGTTCTGACAACTGCTCCACGTTATAGCGCGCTACCGCCTCCTGCTCCGGGTCAGCGCCCGGATTGCCGTTACTGAAATTCACCGCATCAAGAAACTTGCTGTAAACCTGATGCCTTACCACTGACGCGCCGACGAGACTTTGCAAATCCTCCGCCATCCCCGTGACCAGACCAAAGAGATTGGCAACAACGAGGTTCGGGCGGGGAGATGCGCCTTTCCCGTTCATCTCAAAATCCTGTACCTGTATCGGGTACGGTTCGTACTGCCTCCCCTGCCAGGTTAACGGCTCGCCTTTTTCGTTCGGTTCGTTACAGAAGAAAAAGCGCTCACCGCCAATCGCGGTTAAATCAAATTCCCACAAATCCACCTTCGCGGACTGCTCCGCTTTGGTGGTCTCGCTCAAGGTTTCCTGTGGTATATCCTGCATATATGAGAGATCCTTTATTATTTATCTTGCAAAAATATACCTGCTTTTATTAATGGTATTTACGATACAACCAAAAAACGAGGTAACTAATGAAATACACAATATTGTCGCTGGTAGCTGGTGCGCTCATCAGTTGTTCAGCAATGGCAGAGAATACCCTGACTGTAAAGATGAACGATGCCCTGTCCAGCGGAACAGGAGAAAACATAGGTGAAATCACAGTTTCAGAGACACCTTACGGTCTGCTTTTCACTCCTCACCTAAATGGTCTTACGCCAGGAATTCACGGCTTCCATGTCCACACAAACCCAAGTTGTATGCCGGGAATGAAAGACGGTAAAGAGGTTCCGGCGCTCATGGCCGGAGGACATCTTGACCCCGAAAAAACCGGGAAACATCTTGGCCCATATAATGACAAAGGGCATTTGGGGGATCTGCCTGGACTGGTTGTCAATGCAGATGGTACAGCCACGTATCCGTTACTGGCACCACGCCTTAAATCACTGTCAGAACTGAAAGGTCACTCATTGATGATCCATAAAGGCGGTGACAATTACTCCGATAAACCTGCTCCACTGGGTGGTGGCGGTGCACGTTTTGCCTGTGGTGTCATTGAGAAATAACAGCAACATAGCCATATCGTCATAATTTCGTTTTACCCATAAAAAAGCCCTCTCACTGGAGGGCATTAAATCTGTATCGATGTTAAAGGTCAGAAGCTGTAACCTACGCCAAGCACCTAGGTTCCAGCTTTGACGTCACTGTCAGCATCAGTGGAAAAACTTGTATGCTCATAAGACGCATTAACGGCAATATTTTCAACCGGGTTAAGCTGAATACCTGCCCCATAAGCAAAGGCGGTTTTATTGTCAGAATTTCCCCAGTTATCCTTAATATGTCCGTTTGCTGCACCAATCATCACGTAAGCATTCAGATAGTCGTTAAAACGGTATGAAGGACCAACAAGAAGGGAGGTATAATCAGCATCACCTACCTTAAACGCTCAGCACCTCGCTGGCCGGGATCAGAGTTCTCATCTCCTTTTCCAGCTCAATACGCGTCATTTCCGACTGAAACCATGCCCGTCGATCTGAGGGTTTCATCTTATTGGGGTCATTCTCCCCGGTAACAGCCGGCATCGTCATCATGGCGGTCAGAATATCCACCAGCCGGTAGATTTTCAGGTTACTACCGTTCCCACCTGAGGTTTTTACGCCCTTCAGCCTGCTGGCGATGGTCTGTCGGTGTGCACCAGTGATGGCCGAAAGCTGTGTGATGTTTAATTCGAGGCTTTTAATTTCCTGATCCACAGTCGTGCTCTTTTCCTGTATACGGTGAAAATGGCGTTCAGTGTCGAACAAAAAACGTACCACCTCGACACTGAAAACAGTAAATGTATTGATTTTTAAGGTTATTTTTCAGTGCTGACAGAGACTAAAAAATCAAAAATCAGCCGATTCCCGCGAGCCCGAAGCCACCCGTGGCGCCCCCTGCCCGGGAGTACCTTTTTAATACAGTCACCATTGGTTACTAGTTTTTCCTGCATTACCGTGGCATTGGGTGCGAATGTACGCCTGCGCCCCTTCCAGTTGCTTTTGCATCGTCTTCACTCGCTCTTTGAGGGCGAAATAATCCCGTTGAGCGGAGTCTGCCAGTCTGGGGCTGGCTGCATTATCCATGCGGGCGGTGGAGGTGGATTTACCTGTCGGCACTGCGGGGCATGTTGCGTTGACGTACAGGCGACGGCGGCCAGCGGCAACATCATCGCGCAAAGCATCATTCTCAGCTTTCGCATCGGCTAATTCCTTCGTGTATTTTTCATCGAGGGCGGCAACGTCACGCTGGCGCTTAGTCATGTCGGTAATTGTCGCGTTCGCCAGCGTCAGCCTATGAGTAACGGTATCGCGCTGCTCTTTGTAGGTGATGGCGTTATTTCGGTAGTGATTTGCCAGCCGACCGGCAACAATTAGCGAGACAAGCAACAGGCCAACAAACATCGTTTTCCAGTTGAACATCATGACAGGAACAGAGCACGCTCCGCCTCACGCCGACGGGTAAGCCCGTTCAGTACTTTGCCACCAGCCTTATTCCAGCGCAGGAACTCATCAGCGGCGCCAGCGTAATCACCAGCGTTTAGCTTCCGCAGCAGAGTTGATGAGGATAATGTCCGGGCGCCGAGGTTGTACGCGAACGACACCAGCGCATCAAACTGGCCTTGCGTCAACTTGACCTTAACCAGTCTGGACACATCATTTTCATAACCGACTAAACCAGTGTTAAGCAAGCGCTCGGCAGTAGCCTCGTCAATCATCATTCCGGGCTTAACTGGCTTACCGTCAACAGAGTGGGTCCAGCCATAACCAATCGTCCAGGGATCTCCCCCCGTTCCCGGGTCCGGATAAGCTGTCAGGCTACAACCTTCAAACTCTTTGATTAGGGTAATGCCTTTTTCACTGATTCTCATCATTAACCCCTGCACGTTTTTTGAGTGCGCTAATTGCGATTTCGCGCAGCTTGTCCACACCGACAAAGCCAATAATTCCGCCAACGAAAGGCGAAATGGAAACCGGCAGGCCTACCACATCAAGCGCACTGGTGACACATAAGGAAAGAGCGCCACACAGGACGCCCTCAAGCCATTTATTTTTACGGGTGGCGCCGTCGTATATCAGTCGGCCGTAGGCAATGAGTCCGGCCATTAACGCCCCAAGTATCTGGGGCCACGCATTTTTGAGTCCGGTCAAAACCGCAGCCCAGAATTCAGGAGTCTTGTCATTCATTTTCATAAGCCTCACCTCCGATGATTTCGGATGGTAACTAGAGTGAGTGAAATGGTTGGGTTGCAGGGTTTAATATCTTGTAAAACAGGATTGCCTGTGGTTGCAGAATCTGAAAGTAAAATCACGCAGAGTACAATTTTAATGGAGGTGAGGCACAAATACTGCAAATTTAGCTTTTAGTTTAATTGATTGCGTGCTGAGTGAATTCTGTTTGACAAAAACATGCTATTTATAGAATGTTAATTCCATGTAATAAAAAGGATGTGTAACTCATCATGCCAACGGGAATTAAACCAATATTTATCAATAATATGATGTCAACATATGGATTATCCCATCCTCATGACAGCAAGGTATTTCCAGACCTTCCAGAACACCAAGATAATCCTTCGCAATTACGCCTCCAACATGATGGTCTTGCTACCGATGATAAAGCCAGGCTGGAACCAATGTGTCTTGCTGAATACCTTATCTCTGGACCAGGAGGAATGGATCCTGATATCGAAATTGATGATGATACCTATGATGAATGCCGTGAGGTGCTATCACGCATACTTGAAGATGCATACACTCAAAGCGGGACATTCCGCAGACTGATGAATTATGCCTACGATCAGGAATTGCGTGATGTAGAACAACGCTGGTTGCTGGGAGCCGGAGAAAACTTTGGTACTACCGTAACTGATGAGGACCTGGAGAGTTCAGAAGGCAGAAAAGTGATTGCCCTCAACCTGGATGATACAGACGATGATTCAATACCAGAGTACTATGAAAGTAATGATGGCCCACAACAATTTGATACAACACGCTCATTTATTCACGAAGTTGTACACGCGTTGACTCACCTTCAGGACAAAGAAGACAGTAATCCAAGAGGCCCGGTAGTCGAGTATACCAATATCATTTTAAAAGAGATGGGTCACACATCACCACCAAGAATCGCCTACGAATTTAGTAATTGACACTCATCAAAAAATGCAAAATCCCACGATGCTACAACACAGTAACCAGTTCAGGTCTGAGCTAATACAGGTCAGCAGTCCATAGACACTGGCTCCTGTCAGGATGCCACCTGCTAACCCAGTACCAGAAATCGATTCGGACATTCATCCCCCTCTGGTTGTGTGGGGCCTCTCAGTTATGAGGGGAAATAATAAATATCCTCCGGCATAGCCGGAGGATATTTATTCATAAAGAACACAATTAAGAATAATACCGATTTAATTAAAATAACTTGATCTCACAGTTGAAGAATGAATAATAGCGAGCCCTGCCAAGGCAGGGCATAGAAATAACCAACGAGAAGAAATAGGTAGGAACTAATGAAAAACACCGCTCTGGGTAAGTTCATTTTTATCGTCGGCACCGCGTTACTGCTCGGTGGCTGTAGTGGCATGGTCATGCCTCCCTATGCCACCCACGGTACATCGGTCGGAATCATTGCGCCAGCGGGAGGCTATAGCGAGTGGCACACGGATAGCCGCAACCACACCACAGGAGACAGTCACAGCCAGTCACAGGGAAACTGCACCCAAAGTGAAGATAGCCAGCTCAGCGAAAATAGTCTCACACGGACACACCAAAGCAACTGTAACACCCGTAGTCAAACCCACAGCAGTAGCACCAGCAAAACCCGCTCCAGCAGCGTCGGTTTCAGCGTCGGGGGGCCTGTTGGTGCTAGCATAGGGTTGATCAAGCAGATGGAGTCGATGAACCGTGCGCCAGCCAACGATATGAGTAGTAATGAGATGTTCAAGAATTTCGGTTTCTAGCACATAACGCCACCTGGTACCGTTGTGGTGTCTGGCCCGGCGGCTATCTGTAACGACTCACAATCGAAAAAAGTCAGACTCGCAATCAGCGCAAAAATAGATTGTGAGTCCATTGAATGGGGATCGTTGTGCATTTTCATAAGCCTCGCCCCCGATAGCTTGGATGGCGCTGTATTTGTAAGGGTGAGAGGCCCTCGGGCGGGGTTTTAACAACGAAGCGTGTAGATGATGATTTCCGAGGGCTGAATAAAAAAACCGGCGAAAAGCCAGGAAGATGTAAATAAGGCCATTTCGACTCTGTGGACGAAGATACCCTAACATTAGTTTGATGTGTGGCAATTCTTACCGAGGGTGTTGAGCAATCCTCTCTAAACTATTTCTGCGAGGCTATATAAAGTTCATGAGTATCAGCTAACTGCACAAATTTGTACTTAATAGCCCCCAATAAAGTACCTAATCTTGTATGACCCTCCATAAGGTGTAAGCCGCTCTCTCCAGGAATAATAAGCGAACGCTCAATAAACATCGGTGGTTCAGCCCATGTACCGAATTTAAGCCAATGGTTTGCAACCTCTTCACGGGCATCAATGCAAAACTTGCTGCCGCAGGCATTAAAGTCTTCTGAAATCTCGAGCATGTAATCAGGATATGTGGCATTTCTGCCAAACTTTGTAAACTCTGCTGTTTTCAATCTGACCAAATCCCACTTCAGTGATTTAAGATTTAGATGCCCATACAAGGTTTGAAATTCAGAATTATTAGATAACCCACAATAAATTTGCTTAAAAATTTGTTCTGGAGCTTCGATCCCATATTGCTCACGAAGGATGGCAATTCCTTCTTCTTCCTTATACAACGGGTCGGGACCAAAAACTTGAAATAAATCACGATAAAACATCATCAATTCCTATTGCTGATTAATACAAAAATCCCGTTCCTCAGCAGGCTTGCATATTTTAGGCATGATATCAAATTTACATGAAATATATGTATTTCAGTTCGGTTTTGCAAGACTTATATCCAAATTTGTCGCCTTTTGTTGTGAACGCGATCGTGTTACGGAGATAAGCGCATCACTATCAAGCCGTTTAAAGCTGTTACGCATTACTAACCAATGAGACATGTACGTCTCTGTCCAGGTAGATTTTGCCACGCCCATCAGTTCCGCCAACGTTTGGTACTCATACGTCTCACGTCTGGCCAGTTCCGCCTTCACATCCTGCGCAGCAAGCCAGATGAGCTGGCGCAACCGTGCTAAAGTCTTCTTCGCAACTCGCTTCCCTTCCAACTGCTGGCTGAATTTTTCCCAAGCCCATCGAGTTACAGTGACCTGATATTCCCAGCAGGTGTTTTCACTGTAATTCCACAATAGCCAGGCCTTATTATGTTCTTCGAGTGACAAAAGCGCACGGCGCCATGAGGAAGTGGAGTATTCTACGGGCTGCACAAGCGGAATTGCGCTTCCTTTCGCCAGTGATTGCTTACCAGCAATCGGTGGATTATTCAGCGTTATCATTTTTCCGGTCACTTCATCCCTGATACGCTGTTTTTTTCGGGGATAGTTTTTCGTATAGAGCTGGGCGTTCTCCAGCCATGCCTGCAACTGACCTTTCGTGGCGCCACTCAGATCTGCAGTCGCCACGATGAGCTGCTGTCGCACATATTCCAGATATTGTGTATTCATACGGTACCGCCCGTTATCTTCACGTAGTTTTTCAAAATCCGGTAATCGATCAGGATGGAACCCGGAAATGGTATAAGCACAACTGCTACCAGCGAGCACGGAGATGATCGGCAAAGTAGGATTCGAATGTCATGCCGCCTCCAGCTTTTTTAGCGCACGCAGATCCGCCAGTGCAGCGATCCTGATTTCTTTCAGCTCCTCAACCGTCCAACGTTGCGGAGTATTATTGCTCTCAAGTTCCAGCACCTCCGCCTCACCGTAACGCTCAACCTTCACGCCACGGCGCCGGTATCCCGCCACCAGCTCATCCGCCTCTTCGGTGGTACACACCGGATGCTGAAACCATGTCATTTTCATGCGAACTCCAGCAGATGCGCGGCCACATTTTCAACTTCTTCCAGAGAGGAAAATTTACGAAACAGAATCCAGTTCCACAGGACGTTCAGCACAGCTTTATAGACCTGTTGAAACTCGGTTTCGTCCATACTGGCGAATGCTATGGATTTCGCCCGGCGCCCGCGACTGCCATCCGGATAAAAATGCTCGGTATAAAACCCGGCCTGAACGGTTACCCATTCCCGGAAGGCATCGAAAGACTTAAGAAGGGCGACGTCCCCGGTTCGCAGGGTAGCTACGTTATGGAGGTACTGTTCCGCCGCCTCGTTAAGGGCCGGGGTATATTCCTGGCCTGCGGAGTCGCAAAGAAAATTAACGAACCCGGAGATAAGTTTCTGTTCCCGCGATGTGACCGTGCCGCCCGTTGGCATCCAGTAGTCGAAACCAAGCTGAAGGAGTTTAAAAAATCGTTTATGAAAGGCGTAGTTGCGGACACGTTTAAAATCGGCGTGTATCCACTCACCGATTTTTACTGAGCGCAGGAAATCCCCACTCTCCGGCGTCGCCGGGAGCAGAAGCCCTGATGAGGTTTGCTTGACCAGTTGTAAATGCGCCATCGTTCTCTCCGTTGGCGCAGTAGATTGGGAGTTCAGCCCGCAGACGAGTATAACAAAGGATGATTATTCATGATAACCGGCCCTGATAGTCAGCTCATTAATCAGGGTATCGCTCCCCATGATGTCATTTTGCAACAACGGCAGAAACCGGACATAGCGGCCATCCCGATACATCAATGACCTGTTGCAGTCAGGAAAAAAATCCATTTCAGCAATTACTGTCATGTCATCACGGCGAATAACAGCATATTTACAAGTGAATGTTTTATTTAAATTTTTCACGGTGTCTCCATAGATAACGAACTTGAGCATTTTTAAAGCATCTTCATTCTCAACATGAATATATAGGAGACTATTAATTATCATCATCAATAAATATATCTATTTTTTGACCATATGCAATGACATTTTCTCTGTGTTCTATTTATAATCTTATAACTGGTTATTTTTTGACATGCTCATTTCCCGGACATTAAAAAAACCGCCGGCGCAGGTATTAAGTGCGGGTACATTGAGGTTGTCTGACACATCACAGGTGATGGAGATTCATCCCCCAAGGTCTCTTACTTAGCAATGAAGACAACTACCTCCTCTCTGTCTGGCCGGTTCGATCGCAGTCTCTCCTCGTTACTGGTGCAGTCACTGTGACAGTGATGCAGATGATAATCAGGACGATTAACATCGCTGCGGTTGACTTATCCGGCAAAATTATGCTGCCATGATGCCAGTTAACCATACTGGCATCATGGCCAACCGGCATCGAAAAGCATGTTGGCCAGACTCGCAGGCCATTGAATCACGCAACAACCAGTTACTGTCATCTGATGAAAAAGGCTGTGCATAACAGAATCAGAACTGACTGGTATCAGGGCCATGTTCTTCAGCAGCAAATACATAAGATGAAGCAAGATATAAAGAATGAAGGGAAAATAGAGTATAAAAAACGTACAGAATTGTCTGAAGTAACTTCCCTGCAGCATTGACGCCGCAGGGAATCTATTTATGGTGTAACTATATTGAACCAGAACTCAAACTTGTCCATATAGCCCAGCATCTCATCCAGTTTCGCAGCATTACCGGTAACGTTGACTTCTCCTTTATCTTGAGCCTGCTTCAGAGTTTCTTCCTTCAGGATAATTTTATTCAGCGTGTCACGGTTCAGAGTAATCGTGGCATCAGCATCTTTCGCTTCAGCATTAGCCGTGTGGTTCAGCACGCCATTTTCCAGCTCAAGCTTGTACTTTCCGCCGTCGCTGCCAAGGTCAATATTAAATACCGCCCGGGCATTACCCGCTTTTTCACCGTTGATATGTACAGCCAGAAAGTCGAAGAACATTTCAGGGGTCATCGCCCGAACGGTATCCGGACTTGCTGTATTTGGCGTCGGACCTTTAACCACACCGTTACGCAGCTCCTGCGCACCGGTCAGGTAGAAGTTACGCCATGGACCAGATTCAGCCTGATACCCCAATTGCTCCAGCGCATCGGCTTCAAGGTTACGTGCATTCTGGTTATTTGGATCGGCAAACACGACCTTACTCACCACCTGAGCAACCCAACGGTAGTTCCCCTGGTCAAAGTCTGCTTTAGCTTTCTGAAGAATCGCATCGGCACCGCCCATGTATTCAACAAATTTCTTGGCCGCTTCTTCGGGTGGCAGCTCATCAAGGGTTGCCGGATTGCCATCGAACCAACCGAGATACAGCACATACGTTGCTTTTACGTCATGGCTGATGGAGCCGTAATAGCCGCGGTTGGCCCAGGTTTTTGCCAGGCTATCCGGTAGTTTAAAGTTGGCCGCTATTTCGTCGCGAGTCAGACCTTCATTGGCCATGCGCAGAGTCTGGTCATTGATATAACGATACAGGTCTCGCTGGCTTTTCAGCAGACCAACAACATTCTCGTTACCCCAGGTCGGCCAGTGGTGCTGGGCCATAATAATTTCAGCTTTGTCACCCCAACGCACTATAGCTTCGTTGATATATTTCGACCACGGCAACGGCTCACGAATTTTTGCGCCACGTAGCGAGTAAGTGTTATGCAGGGTGTGAGTGACGTCCTCTGCGGCTTCGATGAGTTTCTTCTCTTCGATGAACCACAGCATTTCCGAAGGGGCTTCCGAACCAGGGGCCAGCATAAAGTCGTAAGTCAGGCCATCAATCACTTCTTTCTGGCCGTCTTTATCGATGATATTAGTGGGCGCAATCAGTGTCACCGTCCCCGCAGAGGTGGTCGTCCCCAGTCCGGCGCCAACCTGGCCGGAGGCATCTGGTTTCAGGAGGTTGCCATACATATAGCTGGCACGGCGGCTCATCACGTTGCCGGCCATAATATTCTCGGCTACTGCTGCCTCCATAAAGCCAGCAGGCGCATACACTTTCACCTTGCCGGATTTCACGTCCGCTTCATCGACAACGCCACGCACACCGCCATAGTGGTCAACATGGCTATGAGTATAAATGATGGCGACAACAGGCTTATTGCCACGGTTTTTGAAATACAAATCCATACCGGCTTTGGCTGTTTCCGCAGAAACCAGCGGATCGACAACCGTAATCCCCTCTTTACCTTCGATAATCGTCATGTTGGATAAATCAAGGTTACGAATCTGGTAGACGCCGTCTGTGACTTCAAACAAGCCACTGATATTGATTAGCTGGGACTGACGCCACAGACTAGGGTTAACAGTGTCAGGAGATTTTTCCCCTTCTTTTATGAAAGCGTACTGCTGTGGATTCCAGATGACATTCCCTTGCTCTCCCTTAATCACCTCTTCAGGTAAACCAGCGATAAAGCCTTTATGGGCATTCGTGAAATCGGTGTTATCAGAGAAAGGAAGTTGGTTATAAAGCGCATCGTTAGCTTGCTTGGTTGAAGCAGTGGCACCTTTTGGGGCTTCCTGTGCAAATAAAGGTGTCAGCGCAGTGGAAGAGAGTAGCCCCGCCAGCGCAAAACTTTTAACGATCAACTTAAGTCTCATTTGTACCCCTCATGTAAAAATATTCTGTATCACTCAGTCTGGTAGATTAATTATCTGTTAATTCAAACAATTAAAGTTATTGCTGACCATTTTCTCTCTTTTAAATATAACCAAAACGTTACATTTCGCTATTTATGGATACAAATAAATCGTGTTTTACGTCAGCCAGTTCCATCCTCTTTTAGTAAGTGGGGTAAGCTCGCTTCCCGTTTCCGGGACCCTGCCTGACTGAAGAGCAGGCTGACAGGATACGCGCCGCGCATTGGGCAGGATTTACAGGACGGCGCGCCGGAGATGTAAAAGTTTTACCGGGCGGCGGTCAGCAGTCCTTTAAATTTTACCCTGATCATTGATGTTCAACCCTGACCGACCGCCACACCGTATAGTTGGCGGCGGTCATGAAGTAAAGAGACATGACTATGAGCTTTGTGAGACTTGAAACCTGGGGTGAATTAAATTATCCCGATGATCCACCACCTCTCACAACACTAAGACGATGGGCGCGAAACGGAAATATTTACCCGACTCCAGTATTACATGGCAGGACGTATCGGGTTGATCCGGACGCGTTTTATATCAAGCCGAATAAAGTGGGACTTGTGCTTGAACAGCACCACCCAAACGGGCGCACCGGAAAACCGAGTGCATTGCTGGAGAAGTTGATCAGTGAGTCGAAAAAAGTACGATGCTAACCTTCCGAGGAACCTCACCTACCGTAAGGCCAGTAAATCTTTTTTCTGGCGTAACCCGCTAACTGACAAGGAATTTCCGCTCGGTCAGATCGCCCGCAGGGACGCTATCACACAGGCCATAGAGGCAAACAACTTCATAGCGCAAAACCACACACCAGTGGCGCTTATTGAAAAGCTAAAAGGAACTGACTCATTCACTGTGTCCGCATGGATTGATCGCTATGAGGTTTTATTACAGCGCCGGAGTCTGTCGGTTAATACCTACAAGATTCGCGGTAATCAATTAGCGACCGTACGCGAAAAAATGGGGGAAATAATACTGGCAGAAGTAACAACCAGGCACATTGCCAAGTTTCTTGAGTCGTGGATAACCGAGGGAAAAAACACTATGGCGGGAGCAATGAGATCAGTTCTATCTGACATGTTCAGAGAGGCTATTGTCGAAGGGCATATTGTGAAAAACCCGGTGGAAGCAACCCGGATACCAGAGATTAAGGTGGCCAGGGAACGCCTGCAACTGGAAACGTATAACGCCACACGAGCGGCAGCAGAGCATATGCCTGCATGGTTCCCTCTCGCGATGGATTTAGCGCTCGTTACTGGTCAACGTAGGGAGGATATCGTAAATATGAAATTTAGTGATGTTTTTGACAACCGCTTATACGTAACTCAGATTAAAACCGGAATGAAAATAGCCATTCCCCTCTCCCTGACACTTCGGGCGACGGGGTTACGTCTGGGAACGGTAATCGATCGCTGCCGACTGGTAAGCCGCACTGATTTCATGATCAGTGCCGGAATCAGGAAAAATAGCCCAACCGGGAATATTCATCCGGATGGATTGACAAAGACATTTGTAAAAGCAAGAAAAGCCTCCGGTGTTAACTTCAGCAATAATCCACCGACATTTCACGAGATCCGAAGTCTGGCCGGGCGGCTGTACAAAAACGAGCACGGCGAGGTGTTCGCCCAAAAACTCCTGGGCCACACATCAGCGAACACCACGAAACTCTATCTCGATGAGCGTGATGATAAAGCTTATATGATGCTCTAA
Protein sequences of DBSCAN-SWA_6 >NZ_CP011394|2677237:2693352|2685539_2685989_+|WP_000798708.1|DBSCAN-SWA MKNTALGKFIFIVGTALLLGGCSGMVMPPYATHGTSVGIIAPAGGYSEWHTDSRNHTTGDSHSQSQGNCTQSEDSQLSENSLTRTHQSNCNTRSQTHSSSTSKTRSSSVGFSVGGPVGASIGLIKQMESMNRAPANDMSSNEMFKNFGF >NZ_CP011394|2677237:2693352|2687802_2688030_-|WP_000784710.1|DBSCAN-SWA MKMTWFQHPVCTTEEADELVAGYRRRGVKVERYGEAEVLELESNNTPQRWTVEELKEIRIAALADLRALKKLEAA >NZ_CP011394|2677237:2693352|2682954_2683434_-|WP_001541990.1|lysis|DBSCAN-SWA MFVGLLLVSLIVAGRLANHYRNNAITYKEQRDTVTHRLTLANATITDMTKRQRDVAALDEKYTKELADAKAENDALRDDVAAGRRRLYVNATCPAVPTGKSTSTARMDNAASPRLADSAQRDYFALKERVKTMQKQLEGAQAYIRTQCHGNAGKTSNQW >NZ_CP011394|2677237:2693352|2688026_2688626_-|WP_000940753.1|DBSCAN-SWA MAHLQLVKQTSSGLLLPATPESGDFLRSVKIGEWIHADFKRVRNYAFHKRFFKLLQLGFDYWMPTGGTVTSREQKLISGFVNFLCDSAGQEYTPALNEAAEQYLHNVATLRTGDVALLKSFDAFREWVTVQAGFYTEHFYPDGSRGRRAKSIAFASMDETEFQQVYKAVLNVLWNWILFRKFSSLEEVENVAAHLLEFA >NZ_CP011394|2677237:2693352|2677237_2678101_-|WP_072100756.1|DBSCAN-SWA MDLTEDNASKLQQFSKEWQDANDKWSATWGVKIEQTKDGKYYVAGLGLSMEDTPDGKISQFLVAADRIAYINPANGNETPGFVMQGDQIIMNEAFLKYLSAPTITSGGNPPAFSLTPDGKLTAKNADISGHINAVSGSFTGEINATSGKFSGVIEAREFVGDICGSKVMQGVSIRATNDERSTSTRYTDSATYQIGKTITVMANCERNGGTGAITVTININGQVKTAEVMPYTAGIPAMYQTVVFSVYTTSPVVDISVSLRVRGQYTTSASVWPLVMVSRSGSNFTN >NZ_CP011394|2677237:2693352|2692272_2693352_+|WP_000087636.1|integrase|DBSCAN-SWA MSRKKYDANLPRNLTYRKASKSFFWRNPLTDKEFPLGQIARRDAITQAIEANNFIAQNHTPVALIEKLKGTDSFTVSAWIDRYEVLLQRRSLSVNTYKIRGNQLATVREKMGEIILAEVTTRHIAKFLESWITEGKNTMAGAMRSVLSDMFREAIVEGHIVKNPVEATRIPEIKVARERLQLETYNATRAAAEHMPAWFPLAMDLALVTGQRREDIVNMKFSDVFDNRLYVTQIKTGMKIAIPLSLTLRATGLRLGTVIDRCRLVSRTDFMISAGIRKNSPTGNIHPDGLTKTFVKARKASGVNFSNNPPTFHEIRSLAGRLYKNEHGEVFAQKLLGHTSANTTKLYLDERDDKAYMML >NZ_CP011394|2677237:2693352|2684492_2685179_+|WP_001574215.1|DBSCAN-SWA MPTGIKPIFINNMMSTYGLSHPHDSKVFPDLPEHQDNPSQLRLQHDGLATDDKARLEPMCLAEYLISGPGGMDPDIEIDDDTYDECREVLSRILEDAYTQSGTFRRLMNYAYDQELRDVEQRWLLGAGENFGTTVTDEDLESSEGRKVIALNLDDTDDDSIPEYYESNDGPQQFDTTRSFIHEVVHALTHLQDKEDSNPRGPVVEYTNIILKEMGHTSPPRIAYEFSN >NZ_CP011394|2677237:2693352|2683451_2683904_-|WP_000984586.1|DBSCAN-SWA MMRISEKGITLIKEFEGCSLTAYPDPGTGGDPWTIGYGWTHSVDGKPVKPGMMIDEATAERLLNTGLVGYENDVSRLVKVKLTQGQFDALVSFAYNLGARTLSSSTLLRKLNAGDYAGAADEFLRWNKAGGKVLNGLTRRREAERALFLS >NZ_CP011394|2677237:2693352|2680741_2681437_-|WP_001152416.1|tail|DBSCAN-SWA MQDIPQETLSETTKAEQSAKVDLWEFDLTAIGGERFFFCNEPNEKGEPLTWQGRQYEPYPIQVQDFEMNGKGASPRPNLVVANLFGLVTGMAEDLQSLVGASVVRHQVYSKFLDAVNFSNGNPGADPEQEAVARYNVEQLSELDSSTATIILASPAETDGSVVPGRTMLADSCPWDYRDENCGYDGPPVADEFDKPTSDPKKDKCSHCMKGCEMRNNLVNAGFFASINKLS >NZ_CP011394|2677237:2693352|2689626_2691606_-|WP_001237395.1|DBSCAN-SWA MRLKLIVKSFALAGLLSSTALTPLFAQEAPKGATASTKQANDALYNQLPFSDNTDFTNAHKGFIAGLPEEVIKGEQGNVIWNPQQYAFIKEGEKSPDTVNPSLWRQSQLINISGLFEVTDGVYQIRNLDLSNMTIIEGKEGITVVDPLVSAETAKAGMDLYFKNRGNKPVVAIIYTHSHVDHYGGVRGVVDEADVKSGKVKVYAPAGFMEAAVAENIMAGNVMSRRASYMYGNLLKPDASGQVGAGLGTTTSAGTVTLIAPTNIIDKDGQKEVIDGLTYDFMLAPGSEAPSEMLWFIEEKKLIEAAEDVTHTLHNTYSLRGAKIREPLPWSKYINEAIVRWGDKAEIIMAQHHWPTWGNENVVGLLKSQRDLYRYINDQTLRMANEGLTRDEIAANFKLPDSLAKTWANRGYYGSISHDVKATYVLYLGWFDGNPATLDELPPEEAAKKFVEYMGGADAILQKAKADFDQGNYRWVAQVVSKVVFADPNNQNARNLEADALEQLGYQAESGPWRNFYLTGAQELRNGVVKGPTPNTASPDTVRAMTPEMFFDFLAVHINGEKAGNARAVFNIDLGSDGGKYKLELENGVLNHTANAEAKDADATITLNRDTLNKIILKEETLKQAQDKGEVNVTGNAAKLDEMLGYMDKFEFWFNIVTP >NZ_CP011394|2677237:2693352|2686983_2687673_-|WP_001097218.1|DBSCAN-SWA MNTQYLEYVRQQLIVATADLSGATKGQLQAWLENAQLYTKNYPRKKQRIRDEVTGKMITLNNPPIAGKQSLAKGSAIPLVQPVEYSTSSWRRALLSLEEHNKAWLLWNYSENTCWEYQVTVTRWAWEKFSQQLEGKRVAKKTLARLRQLIWLAAQDVKAELARRETYEYQTLAELMGVAKSTWTETYMSHWLVMRNSFKRLDSDALISVTRSRSQQKATNLDISLAKPN >NZ_CP011394|2677237:2693352|2692019_2692298_+|WP_001575998.1|DBSCAN-SWA MTMSFVRLETWGELNYPDDPPPLTTLRRWARNGNIYPTPVLHGRTYRVDPDAFYIKPNKVGLVLEQHHPNGRTGKPSALLEKLISESKKVRC >NZ_CP011394|2677237:2693352|2688689_2688938_-|WP_000911593.1|DBSCAN-SWA MLKFVIYGDTVKNLNKTFTCKYAVIRRDDMTVIAEMDFFPDCNRSLMYRDGRYVRFLPLLQNDIMGSDTLINELTIRAGYHE >NZ_CP011394|2677237:2693352|2681526_2682060_+|WP_000877926.1|DBSCAN-SWA MKYTILSLVAGALISCSAMAENTLTVKMNDALSSGTGENIGEITVSETPYGLLFTPHLNGLTPGIHGFHVHTNPSCMPGMKDGKEVPALMAGGHLDPEKTGKHLGPYNDKGHLGDLPGLVVNADGTATYPLLAPRLKSLSELKGHSLMIHKGGDNYSDKPAPLGGGGARFACGVIEK >NZ_CP011394|2677237:2693352|2686362_2686887_-|WP_001574213.1|DBSCAN-SWA MFYRDLFQVFGPDPLYKEEEGIAILREQYGIEAPEQIFKQIYCGLSNNSEFQTLYGHLNLKSLKWDLVRLKTAEFTKFGRNATYPDYMLEISEDFNACGSKFCIDAREEVANHWLKFGTWAEPPMFIERSLIIPGESGLHLMEGHTRLGTLLGAIKYKFVQLADTHELYIASQK >NZ_CP011394|2677237:2693352|2683887_2684217_-|WP_001574216.1|holin|DBSCAN-SWA MNDKTPEFWAAVLTGLKNAWPQILGALMAGLIAYGRLIYDGATRKNKWLEGVLCGALSLCVTSALDVVGLPVSISPFVGGIIGFVGVDKLREIAISALKKRAGVNDENQ |
16 | Salmonella_phage(30.77%) | integrase,holin,lysis,tail | attL 2673867:2673896|attR 2693488:2693517 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_7 |
2895217 : 2906011
Sequences of DBSCAN-SWA_7
Nucleotide sequences of DBSCAN-SWA_7 >NZ_CP011394|2895217:2906011|DBSCAN-SWA GTCAGGCTAACGCTTTAGTTATCTTCTCGTACAGATCGCCGGAAAGATTCTCCAGTCCTTTTAACTGCTCCAGCGCCGCACGCATTTTCTCCTGACGCTTTTCATCGTAACGTTTCAGACGAATCAGCGGTTCAATGAGGCGAGATGCTACCTGCGGGTTACGGCTATTCAGATCGGTCAGCATCTCGACCAGGAACTGGTATCCGCTACCGTCTTGCGCATGGAACGCCGCCGGGTTGCTGCCAGCAAACGCGCCAATTAATGAACGGACGCGGTTCGGGTTGCTCATACTGAAAGAACGGTGTTTGAGCAGGCCGCGTACGGTTTCCAGTACATTTTCCGCCGGGCTTGTGGATTGCAGGATAAACCATTTATCCATCACCAGGCCGTCCTGATGCCACTTATCGTCATACTCCTGCATCAGCGTATCGCGGCACGGCAACTGCGCCGCCACCGCAGCAGACAGGGCCGCCAGCGCATCGGTCATATTATTGGCGTCGCGATACTGTTTGCTGACCAGCGTATTAGCCAGCTCCGTCTCGCCGAACGCCAGGAAGCGCAGGCAAGCATTGCGCAGCGTGCGCTTACCGATATCGCCGTGATCAACACGATACTCATCCAGATGATTGGCGTTATAGATAGCCAGGAACTCATCCGCCAGTTCTGCCGCCAGCGTACGCGTTAGCGCTTCACGAACTTGCGCAATGGCGATCGGGTCAATGACCTCAAACAGCTCCGCAATTTCATTGGCCGAAGGCAGCGTTAAAATTTCTGCGGCCAACGCCGGATCGATTTTCTCATCCAACAGTACTGCACGGAACGCATCAGCGACATGCACCGGAAGCGATAGCGGTTGCCCCTGCTGATGACGCGCCACATTCAGTTTAATGTATGTGGCCAGCAGGCTTTGCGCCGCATCCCAACGGGAGAAATCATTGCGCGCATGGCGCATCAGGAACGTCAACTGCTGATCGCTCCATTTATATTCCAGTTTCACCGGCGCTGAAAACTCGCACAGCAAGGCCGGAACAGGCTGGAAGTAAACATTATCGAAGGTAAATGTCTGCTCCGCCTGCGTGACGTTCAGCACGGCGTTGACCGGGTGACCGCCTTTTTGCAACGGAATGACGTTGCCTTCGTTATCGTACAGTTCGATGGCGAATGGAATATGCAGCGGCTGCTTCTCCGCCTGATCCGCCGTCGCCGGAGTGCGCTGGCTGATGGTCAACGTGTACTGCTCGGTTTCCGGATTATAATCATCTTTTACCGTTACAATCGGCGTGCCGGACTGACTGTACCAGCGGCGGAAATGGGACAAATCGACATTAGAAGCATCTTCCATCGCCTGTACGAAGTCATCACACGTCGCGGCGCTGCCGTCATGGCGCTCAAAATAAAGCTGCATCCCCTTCTGGAAATTTTCCTCACCCAGCAACGTGTGGATCATGCGAATGACTTCCGCGCCCTTTTCATAAACGGTGAGGGTGTAGAAGTTATTCATTTCGATTACTTTATCCGGGCGGATAGGATGCGCCATCGGGCTGGCGTCTTCCGCGAATTGTAAACCGCGCATGGTACGCACGTTACTGATGCGGTTCACCGCGCGTGACCCCAAATCAGAGCTAAACTCCTGATCGCGGAACACGGTTAGCCCCTCTTTAAGGCTCAACTGGAACCAGTCGCGGCAGGTGACGCGGTTGCCGGTCCAGTTGTGGAAATACTCATGGCCTATCACGCGCTCAATATCGAGATAATCTTTATCCGTCGCGGTATCGGTTCGCGCCAGCACGTATTTGGAGTTAAAGATATTGAGACCTTTATTCTCCATCGCGCCCATATTAAAGAAATCCACCGCGACAATCATATAGATGTCGAGGTCATATTCGAGCCCAAAACGCGCTTCATCCCATTTCATGGAATTTTTCAGCGAGGTCATTGCCCACGGCGCGCGATCCAGATTGCCACGGTCAACGTACAGTTCTAATGCGACGTCACGCCCGGAGCGGGTGGTAAAGGTATCGCGCAGCACGTCAAAATCACCGGCCACCAGCGCAAACAGATAACACGGTTTCGGGAACGGATCTTGCCACTGAACCCAGTGACGGCCATTCTCCAGCTCGCCCTGTGCAACACGGTTGCCATTGGAGAGCAGGAACGGATATTTGCTTTTATCGGCAATAATTTTGGTGGTAAATCGCGCCAGTACGTCCGGGCGGTCAAGATACCAGGTAATATGGCGGAAGCCCTCCGCTTCACACTGGGTACAGAGCGCATCGCCGGACTGGTACAATCCTTCCAGCGCCGTATTCGCCGCCGGACTTATCTCGTTGACAATGCGTAACGTAAAACGCTCTGGCAGGTCGCTGATGATAAGCGCGCCCTCTTCTTCCTTATATGCTGTCCACGGCGCATCGTTGACGTGGATAGATACCAGCGTTAAATCTTCCCCATCAAGGCGAAGAGGCGCATCAGGCGCGCTATGACGAACAGCCTGGCTTATTGCGGTGACCACGGTTTTTTCGGCATCGAGGTCAAAGGTCAAGTCAATATCAGTAATCTGGTAATCCGGCGCGCGATAGTCATGGCGGTATTTGGCTTGTGGCTGTTGTGTCATAAAAAACCTTTCGCATCTTTGTGTAGAGTGTCGACTCCAGTCTATTCCTGTTGCGCAAATCGCGCTACGCAGAATGTTCATCTTTTCAGGCACAAACGGCCTATTTGCTACATTTTTATAATATGTACTCATAGTTTTTAAAATCGATAAAGATCGTCCAGGAGCGCTTTAAACAAGGGATGAGCGAGATCGAGATGAAGCTTGATTAACGAACTTTAACGAACTTCACAGACCACTTTTGCCCCATCCATGCCCCACACATAATTTTAGTCAACGCCAGACCCCATCAGCCCTGCAGGGAAAATCGATAACAAACACGCCCGCAGTATGTTCCCCAAAATTTAAAAACAAGTGGTTCATGCCTTGACAGACAAAACCGCCAAAGCAAAAATACTGTATATACAAACAGTTATTAGCGAAACACGTATGTTCGTAGAACTGGTTTATGACAAGCGTAATGTTGAAGGACTCGAAGGGGCCAGCGAGATCATTCTGGCCGAACTGACGAAGCAGGTGCACCAGATTTTCCCTGATGCCGAAGTGAGGGTGAAGCCGATGCAGGCAAACTGCTTGAATAGTGATACCAACAAAAGCGATCGCGAAAATTTGAACAGATAGCTTATTAAAAATAGATTTATCTTAAACCACGTCATTTACATTTAGCCACCTCCCCAAAATCCGGATTCAGCTTAAGAAAAATGCGACAATACAATAAAAACATATCATATAAGCCCCCTCAACAAATGTAATTTTAAGGCCAACAAACACCTCTAACTTATTCACTTTCAATTAATTTCATAAATAATAATTAACAACAAAAGAATTGTATTAATATCCACACTGTAGTATATAATTACATTAACAAAATTACTATTCGGCGAGTATATTATGTTAAGACACATTCAAAATAGTTTAGGCAGCGTTTACAGAAGTAATACAGCAACTCCTCAGGGTCAGATTATTCACCATCGTAACTTTCAAAGCCAGTTTGATACCACAGGCAACACCCTCTACAATAATTGCTGGGTTTGCTCATTAAATGTTATCAAATCCAGAGATGGCAATAATTATAGTGCATTAGAGGACATCACTTCTGATAATCAAGCGTTTAATAATATATTAGAGGGTATTGATATAATAGAATGTGAGAATTTATTAAAAGAAATGAATGTGCAAAAAATACCTGAATCCTCTCTTTTTACAAACATTAAAGAAGCTTTACAGGCAGAAGTTTTCAATAGTACTGTAGAAGATGACTTTGAGAGTTTTATTTCTTACGAATTACAAAACCATGGACCACTGATGTTGATCAGGCCTTCACTTGGCTCGGAATGTCTACATGCAGAGTGCATTGTAGGCTATGATAGTGAAGTGAAAAAAGTATTAATTTATGATTCAATGAATACCTCACCTGAATGGCAATCAAATATTGATGTCTATGACAAGCTTACCTTAGCATTCAATGATAAATATAAAAATGAAGATTGCAGTATTTGTGGTCTTTACTATGACGGTGTTTATGAGCCAAAACCTTTACACTCCTCCTCCTGGAAAGACTGGTGTACCATTTTATGATAGTTAACCTTTACCAAGATAATTATTCAGGCTACCGCCAACATGGGGGGTCGGGGGTCGGAGGTTCAAATCCTCTCGTGCCGACCAAAATTCCCCTTAAAAACCAGCCTGTCAGGGCTGTTTTTTTTATGGCTCAATTTCCTACGGGGAAGCTATGGGGTGAAACTGGGGAATAAAGCCGTCGAGATTCGACGCAATTTGCGATTGATTCATCAGTTTGCTCACTGCTCAGTTTCGGAATTCATCAATACACAAATTTTCATTTCGAATTACTGTATAAATTCCCTGTAAATCATTACCGGAGTGCGCCACATTTTCCCCCTGCCCTATACTTTCAGTCTGACGACTGGAGGTTTCATATGTGTGGACGCTTTGCACAAGCACAGACCCGTGAAGAATATCTGGCATATCTGGCCGACGAAGCCGATCGTAATATTGCTTATGACCCTCAGCCTATAGGCCGGTATAACGTGGCGCCCGGGACTAAAGTCCTGCTATTGAGCGAACGCGACGAGCAATTACATCTCGACCCGGTGATTTGGGGTTACGCTCCCGGATGGTGGGATAAAGCTCCACTTATTAACGCCCGTGTCGCGACAGCGGCCTCCAGCAGAATGTTTAAGCCACTATGGCAGCATGGCCGGGCTATCTGTTTTGCCGATAGATGGTTCGAGTGGAAGAAGGAAGGCGACAAAAAACAGCCGTATTTCATTCACAGAAAGGACGGGAAGCCGATATTCATGGCTGCCATTGGCAGTACGCCGTTTGAGCGCGGTGATGAAGCAGAGGGATTCCTGATTGTTACCTCCGCAGCCGATAAAGGTCTGGTAGACATTCACGATCGTCGCCCGCTGGCACTGACACCGGAAACTGCTCGGGTATGGATGCGCCAGTTCCTGGAACCACATTCTAAGTCAATAACATACCGCGTCATACCTGCGCTCACACGTCCCATGATGCGAAAAGATACCAATCCATGCCAATAGTTAAAAACGGATGACTGTCCCGAATCCGTCCCACCTGCCCTATCCCACAAACCGGCGTTCAGATGCTCATTAAGAAAACCACCTCACCCTCATAACTCAGTAAGCGTCCCGTTTAGGACGTAGCGTAAGGATTATTTTACGGTTTCGAGGTTCCAGGGCAGCAGTTCGTGCACCCGGTTCGATGACCAGTCGCTGATTTTCCACAGCACGTCGCGTAACCATGCCTCGGACTCTACGCCGTTTAGTTTGCACGTACCCAGCAGGCTGTAGATGATCGCCGCTGCCTCGCCGCTCCTGTCTGAGCCGAAGAACAGATAGTTACGTCGGCCCAGCGCCACGCACCGTAAGGCGTTTTCACAGATGTTGTTGTCGATCTCCACCCGACCGTCGCTGCAGAGGACACGCTCAACGCATCACACTGCTTCAGCATGTAACCGAACTCCTTCGCCATCTCCGCATGCACCGACAACGTTTTCAACTGCGCCTGTATCCAGTCGTACAGCGACTGGCTTTTCTCTTTCCTGACCGACAGCCGTGTTTGCGCCGGGCTGTGGACTCTGTCATCTGTGATAGTGTCCATCAAAATTTAAGTGGACACTATCATCGCCGGATTGACAGGGTTCTGACAGACGTCCTCCACGGTGCGCTTACATTTTACCTATTAAGGAATATTTTTGCTTTTTAAAGGTATTAAACCATCTCGGTGATGTAACAAAAACTTTCCCTGCCATAGATTCTGATTCTAATTCTCGTGGTAATGCATCATAGGCATTAGCTGCTATACTTGAATTACTAAAATCAGTATAATAAATAAGTTTCCTTCTTGTTGCCATTCTATATTTACACACCCATTCATCTGCTGAAAGAATTAATGGGCCATTCAGATTACTCATACCTCTATTTTCAAACTGATGGGCTGAAACATCAAACACATAGTCTTTCCCTTCTTTATTTCCAACCACTGCAAAATGATTTGTTGGTATTTCCTCTGTTGGTTTATCCCAGATAAATATACCTCGATAACGAATATTATCGAACCCTTTTTCATTCATAAAATTGCTTACAGGAGTCATTAATGACTCACACTGCCCTACCGGATTCATTATTTTATTATTTATAATTGGATTCTGTTTCAATTCCTCCAGATAGGCCGCAGCATCAATATCACTGGTTAGGTTGTAAGTTATATCTGTGCGTTCCACTCCCGGTTCTGTTGCCATGGTTAAGTGATGTGTTTCACTGTACCCCTGGCAATTTACGGTATAGTTCCCGACGTCATCCAGGGTGACTGATAATATCTCCTGACTGTCTTCATCCAGAATACAGAAGTAGTTTTCCCCGTGCAGGCCGGAATGAATGTTTTCCTCCCATCCGTCATACGCGAGCGTCCTGAGCAGTTCAAATCTGCTGACCACATCCTCCCGCGTCGTTCCGGCCGGCGGGTGACAAATCGTCCAGATGCACTCCAGCGCTTCAGTCTGGTGCGTTGAGCAAAAAAATTCCTTCATTTTTTCCCAGGAACTCATTTCAGGGGGGGTATCAGACCAGGCAATACGATAAATGCGGCGGTTACTGATGATGGCGGGAAGACATCCGCTTCCAATATGAAAGGGCATAACAAAAAACCTTTATAAATTTACATATAGTATCTGTCCGACAGACATCATCTCTTCCTGGTCTTAATTTCACAATAAGGTTATCGGCGGATTCATGGTCGTCCTGCCATGGCGGGCTTCAGAAGGTGCAGAAGAAAAATCCGTTATGATGACCGGATGGCGGGACTGTCATTTTACAGCTAAAGTGTCGATTTTTTCAGGGTCGCTTTCCACGATGACCAGATCATCCGGCATCAGCGCCCGGGCGTTCAATTTTGGCGGGCAGAAGTCACCCGGCAGAAATTACTTAACGATGCAGATAATGCCATTAAGGACTGGCGCACAGAATTAACGTTGGGAATTATCAGTGATGAAAATAAAGCAGCTTTGATTCTGCCGATGAATTATATCAATGTTCTTAAATCGCTGGACTTAACAGGTGTTTCAGATGAGGCCACCTTCACAGCAATCAGGTGGCCTGCATTACCACAGTAACGCCTACTGGCTGGCTGGTCTTTCCGGCCAGTCAGGGGCGGCTGTATCCATCCGGTTTACCAGCACCCTATATTTTTTCCATTCGTCGAGCCGCGCTTTCTCATCATCTGTTGCGATTCCAAGATCAACTGCATCCTGCAATGGCGCGATTTTTTCAGATGCCATTTGCAAAAGACGGCTTTTGGTTTCTTCCGCCTGACGAAGCTGCGCTGCTTTTTCAGCCGCTTCGTCTTTTACCCAGACCTTAGCCTTACCATCCCATTTCTGGTATTCACCACCTGGTGAAACTGATGTGACATTTTCGGGCAACGGACCAGGAGCGGAGATATAAACCTGATTGCCGGTTGTTGTGTCGTAAACCATCTCGCCGCGGTGATCCTCATGCAAACTCCATGTCTGGGTTTCAGCGTCAAATATAGCAATATGACTGGCGGGAATATCAGGAGGGGCGATATCAGTACAGTTTGCCGGTAGTCCTGTGTGCGGCGGAATATACGCATCACCTGCACCAATAAATTCGTTAGTATCTGAACGCAGATTGAAAATTTTAATTGTCTGCGCCTGTTCGCTCATTTTAAAAGTCATTATGCCAGCCTCACTATGTAGTTAAATGCAATGTTTTTAACCGTGGTTTCCGCATTACCGTCTGCGTCCACAATAACGACGTGTCCGTGTGGACCTATATACATGGTGTGCTCATGTCCTCCGATATAAACTGTATGTGCATGGTCGCCAGCGGCCTGTGTCCATGCACCACCTCCTGGCTGAAATGAGGTGTGATTGGAATCTCCCCAGTATGAATTGATATAACCGCCGAACTGGTGAGTATGATTGCCCGTGGTATTGGTCGATTTCGTGCCGTAATCAAAGGATGAGGTAGATTTTGTCCCTAAGTCAGTAACCTGCGCCCGCGCGGTGTGCGAGTGCGATTTATTGCCGTCCATTTCTTGCGACAATACGGCACGTCCACTGATGGGCTTACCCTTTATTGTCCAGCCTCTCATGTCAGGGATAACGCCGGACGGATACGCTATAGCCAGTAACGGGTAAGCAGATTTATCGAAGGACTGCCCCTGCATCAGAGCGTAACCTGCCGGAGTAGCATCAGATGGCCATGCAATCGCCGCCCCTACTGGATGCGAATCCGGAGGTGGGTTTAGTGTGGTGTAAAGCATTGCCCATTCGGACCACTCAGCATCGGCGGTATCTCGATGGCTGCGAATATATGCGGGCGCTGGCGCACCATTTGTCCCGCTCCAGCCAATGAGGATTTCCCCATCACCGGTTCCGGTCAGACGTAAAATATTTCCGTATTGCGTCGGATAGCCATTGTTGTAAACCTCGCCCATTATCAGGCCACTATCGCTGCCTCTTGTCGTACCAGTCAGTGCCGGAAGCGCGCCGCGTGATGCCAGTCTGTTCGCTGCAACAGCCGTACCTGATGCAGGGAGCGCTCCGATATTTTGTACAAACAGCGGCTTATTCGGGATATCCGCACCACACTGGCTTTTAGCCATGTAGTTTTCATCACTTTCGGTTTTGCTGTAGACCTCAAGACTGGAGCGACCTTTGGCCTTATCCGGTACGTCGGACAGGTTCTGGTCCTTCTGCAAATACCGTGATCCCAAATAAATCTCCAGGGGATTAAGCACAGAAAAATAGGTTTTTGTATTATCCAGAACGCATAAGACAGGAATATCTTTAATAATATCATTGGCCGATAACTCTGCTTTATTCCCCTTGTATAGTGGGAATATGCCAAGCACACGTCCTCCCATCGTCAGTTGCAGAGTGCTGGCTCCGGTATTGTTTAGCGCCGGAATAACCACAAGTGGAGTGCGCAATGTCCAGTCAACTCCACCATTGACGAAATAAGTTGCTGGTAACTCCAGCGTCAGATTATTTTCTGTACCTCCGGCCACACCAGCGACATAATGCCCACTCTGGAGCTCTTCAATTTGTACAAACTGATTTTCAGATCCTCGCGTCGCAAAATTCGCTATAACGTCATTCAGTGACCATCCCTTCGCTGTTGTACCTTCCTGACCGCGAATAACCGTCAGCATGTCATTATTAACTGCTGTCAGATGGCATACCTCAAAAACTGTTTCTTTTGCGTCTGTCAGTGTAATTTTGGCGTAAGTTTTAAGAGGGTTTGAGCTGTTTGCATAATCGCTGGTCAGCAAATTAGCAAACATCGCTCCCACACCAGGCATCACCTGAATGGTCGTCTGGCTGGCGGTAATATCAGCCGCCAGTGAGGAGACGACATTATTTCCGAATCCAATAATCATTGCTCAACCACCGTTACCGAATAGGTATAAATAAAAGGGAGTTTCACCAGCGACTGGTCAATTGCATCTTTAAGAAAGTGTCCGACACCATCGCCATAGTCAGGAATGGAGACAAAAAAAATGCCCTTATCGGGCATTACACTAATATCAAAAGTGGACTGTACAGGTGGGTCTATTCCGTTAGCTCCATGTATAAAGCGTGCAAGCCGTCGTTTGAACCAGTTGATACAGAAGTGCGAACCATCGCCTTTATAAAAATTCCATGTCAGTATCCGTTTAAAATAGTCGTCCGGAACATATGACGCTGAGCCGGGAACATAATTTCTCAGTTTTGCATACGCGACATTATTGTACTCAATAGTGTTATACGCCCCACGAGCAATGGCATCCTCGGAGATTTGAAGCAAGGGGCGTGATTCCCCATAAATACCCGCCGCAATCCAGTCCAGCAACTCACCGGTAATCGCCGGGGAGGTCCAGCAAGGTAAATTCAGGTTGTTAAAGTAATCAAGATACCCCTGTGCCAGTTTGTTATAAGCATCAAAAAAGGCAACTATATCCGGATCGTCATTATATTGCGTATAGGGGTAGGCCGGAATAATGCTTTCAAGAAGAGCTGCCATATTGCTTAACCTGAATTTGTGAAGATGAAGTGGAAAAATAGGCGTAAGTATCACCATAAACCAGGCTGGAGTCGGTTGCAGGTGGGACAATTTTTCCGTTTATTCCAACCTGAATATCAATCATTGATACAAGGTTTGAAGATACAAGCCCCTTAACCTGATTAAGAAAAATATCCCGAATCAGGAAAATGTTTATTGGTTCACCCGTTGCAATTCCGTTAATGTAATCAGCAATGCTTTGCTGCACTGCTTTTTCAATCCCGGTTGGATCGATATAGCTGGTTGAGGCTGTATTCCAGGTGATTAAAAGCGTAACGTTTTGTGATGATGGCACTACAAACGGCACGTGATACGTATCCGGATACACAATGATCGGTATCGTTTTTTTATCCACCGCAGCGCCTGATGGATTCACTACATCATTCGTCAGTACGGAGATATCTGGCACGGCTTTATAGATAGCGTAAGCCACTTCATAAGGATCGCCGCCACCAGCAATCGCTACCCATGCCCCCAGCGATGCCTGTCGGTATGAGATCAGATTCTCCTGTACACCATAAACATTTTTCAGTTCAATCCGGTAACAGTCAGGCGTTCCCTGTACACCGTACAT
Protein sequences of DBSCAN-SWA_7 >NZ_CP011394|2895217:2906011|2903065_2904775_-|WP_000583382.1|tail|DBSCAN-SWA MIIGFGNNVVSSLAADITASQTTIQVMPGVGAMFANLLTSDYANSSNPLKTYAKITLTDAKETVFEVCHLTAVNNDMLTVIRGQEGTTAKGWSLNDVIANFATRGSENQFVQIEELQSGHYVAGVAGGTENNLTLELPATYFVNGGVDWTLRTPLVVIPALNNTGASTLQLTMGGRVLGIFPLYKGNKAELSANDIIKDIPVLCVLDNTKTYFSVLNPLEIYLGSRYLQKDQNLSDVPDKAKGRSSLEVYSKTESDENYMAKSQCGADIPNKPLFVQNIGALPASGTAVAANRLASRGALPALTGTTRGSDSGLIMGEVYNNGYPTQYGNILRLTGTGDGEILIGWSGTNGAPAPAYIRSHRDTADAEWSEWAMLYTTLNPPPDSHPVGAAIAWPSDATPAGYALMQGQSFDKSAYPLLAIAYPSGVIPDMRGWTIKGKPISGRAVLSQEMDGNKSHSHTARAQVTDLGTKSTSSFDYGTKSTNTTGNHTHQFGGYINSYWGDSNHTSFQPGGGAWTQAAGDHAHTVYIGGHEHTMYIGPHGHVVIVDADGNAETTVKNIAFNYIVRLA >NZ_CP011394|2895217:2906011|2898256_2898448_+|WP_000497441.1|DBSCAN-SWA MFVELVYDKRNVEGLEGASEIILAELTKQVHQIFPDAEVRVKPMQANCLNSDTNKSDRENLNR >NZ_CP011394|2895217:2906011|2899764_2900391_+|WP_000334547.1|DBSCAN-SWA MCGRFAQAQTREEYLAYLADEADRNIAYDPQPIGRYNVAPGTKVLLLSERDEQLHLDPVIWGYAPGWWDKAPLINARVATAASSRMFKPLWQHGRAICFADRWFEWKKEGDKKQPYFIHRKDGKPIFMAAIGSTPFERGDEAEGFLIVTSAADKGLVDIHDRRPLALTPETARVWMRQFLEPHSKSITYRVIPALTRPMMRKDTNPCQ >NZ_CP011394|2895217:2906011|2898718_2899405_+|WP_001525490.1|DBSCAN-SWA MLRHIQNSLGSVYRSNTATPQGQIIHHRNFQSQFDTTGNTLYNNCWVCSLNVIKSRDGNNYSALEDITSDNQAFNNILEGIDIIECENLLKEMNVQKIPESSLFTNIKEALQAEVFNSTVEDDFESFISYELQNHGPLMLIRPSLGSECLHAECIVGYDSEVKKVLIYDSMNTSPEWQSNIDVYDKLTLAFNDKYKNEDCSICGLYYDGVYEPKPLHSSSWKDWCTIL >NZ_CP011394|2895217:2906011|2902484_2903066_-|WP_000143167.1|tail|DBSCAN-SWA MTFKMSEQAQTIKIFNLRSDTNEFIGAGDAYIPPHTGLPANCTDIAPPDIPASHIAIFDAETQTWSLHEDHRGEMVYDTTTGNQVYISAPGPLPENVTSVSPGGEYQKWDGKAKVWVKDEAAEKAAQLRQAEETKSRLLQMASEKIAPLQDAVDLGIATDDEKARLDEWKKYRVLVNRMDTAAPDWPERPASQ >NZ_CP011394|2895217:2906011|2901038_2902007_-|WP_001674638.1|DBSCAN-SWA MPFHIGSGCLPAIISNRRIYRIAWSDTPPEMSSWEKMKEFFCSTHQTEALECIWTICHPPAGTTREDVVSRFELLRTLAYDGWEENIHSGLHGENYFCILDEDSQEILSVTLDDVGNYTVNCQGYSETHHLTMATEPGVERTDITYNLTSDIDAAAYLEELKQNPIINNKIMNPVGQCESLMTPVSNFMNEKGFDNIRYRGIFIWDKPTEEIPTNHFAVVGNKEGKDYVFDVSAHQFENRGMSNLNGPLILSADEWVCKYRMATRRKLIYYTDFSNSSIAANAYDALPRELESESMAGKVFVTSPRWFNTFKKQKYSLIGKM >NZ_CP011394|2895217:2906011|2895217_2897830_-|WP_000193790.1|DBSCAN-SWA MTQQPQAKYRHDYRAPDYQITDIDLTFDLDAEKTVVTAISQAVRHSAPDAPLRLDGEDLTLVSIHVNDAPWTAYKEEEGALIISDLPERFTLRIVNEISPAANTALEGLYQSGDALCTQCEAEGFRHITWYLDRPDVLARFTTKIIADKSKYPFLLSNGNRVAQGELENGRHWVQWQDPFPKPCYLFALVAGDFDVLRDTFTTRSGRDVALELYVDRGNLDRAPWAMTSLKNSMKWDEARFGLEYDLDIYMIVAVDFFNMGAMENKGLNIFNSKYVLARTDTATDKDYLDIERVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDLGSRAVNRISNVRTMRGLQFAEDASPMAHPIRPDKVIEMNNFYTLTVYEKGAEVIRMIHTLLGEENFQKGMQLYFERHDGSAATCDDFVQAMEDASNVDLSHFRRWYSQSGTPIVTVKDDYNPETEQYTLTISQRTPATADQAEKQPLHIPFAIELYDNEGNVIPLQKGGHPVNAVLNVTQAEQTFTFDNVYFQPVPALLCEFSAPVKLEYKWSDQQLTFLMRHARNDFSRWDAAQSLLATYIKLNVARHQQGQPLSLPVHVADAFRAVLLDEKIDPALAAEILTLPSANEIAELFEVIDPIAIAQVREALTRTLAAELADEFLAIYNANHLDEYRVDHGDIGKRTLRNACLRFLAFGETELANTLVSKQYRDANNMTDALAALSAAVAAQLPCRDTLMQEYDDKWHQDGLVMDKWFILQSTSPAENVLETVRGLLKHRSFSMSNPNRVRSLIGAFAGSNPAAFHAQDGSGYQFLVEMLTDLNSRNPQVASRLIEPLIRLKRYDEKRQEKMRAALEQLKGLENLSGDLYEKITKALA >NZ_CP011394|2895217:2906011|2904771_2905398_-|WP_000729406.1|DBSCAN-SWA MAALLESIIPAYPYTQYNDDPDIVAFFDAYNKLAQGYLDYFNNLNLPCWTSPAITGELLDWIAAGIYGESRPLLQISEDAIARGAYNTIEYNNVAYAKLRNYVPGSASYVPDDYFKRILTWNFYKGDGSHFCINWFKRRLARFIHGANGIDPPVQSTFDISVMPDKGIFFVSIPDYGDGVGHFLKDAIDQSLVKLPFIYTYSVTVVEQ >NZ_CP011394|2895217:2906011|2905381_2906011_-|WP_000274547.1|DBSCAN-SWA MYGVQGTPDCYRIELKNVYGVQENLISYRQASLGAWVAIAGGGDPYEVAYAIYKAVPDISVLTNDVVNPSGAAVDKKTIPIIVYPDTYHVPFVVPSSQNVTLLITWNTASTSYIDPTGIEKAVQQSIADYINGIATGEPINIFLIRDIFLNQVKGLVSSNLVSMIDIQVGINGKIVPPATDSSLVYGDTYAYFSTSSSQIQVKQYGSSS |
9 | Escherichia_phage(37.5%) | tail | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_8 |
2977721 : 2985034
Sequences of DBSCAN-SWA_8
Nucleotide sequences of DBSCAN-SWA_8 >NZ_CP011394|2977721:2985034|DBSCAN-SWA TTTGCATAAGTATCTCTCGGTAGTAAAAAAGCACCGAGTTCCTCTGTCTGATGCTGCTGTTGATTTGTTAAAAGATTTACCACGATTAAAAGATAACAATCATGTATTCCCTGCCCCTCGCGCTGAAACACTTTCTGATATGTCGTTATTGGCTGTATTGAAACGAATGGGATATATCGACTTAACGCAACATGGCTTCCGTTCTACTTTCCGTGAGTGGGCTGGTGAAGCAACGGATTATCAACGTGAGGTTATTGAACATGCGTTGGCGCACCAGTTGGCAGATAAGGCTGAAGCAGCGTATCAGCGTGGGACGTTATGGCCTAAACGGGTGGCGTTGATGGATGATTGGACGGGGTATAGCACTGCCAACAGCTAAGCTACCTGTACGAAAGCATTATCGTTGATAACAACGTAGAAAGTGTGATGCTAATAGCATTCGCTTTCGAAAATGTGATAAGTAATAATTTCATACTGAACTATTTCTTATATAATTATTATCATAATTTGCAAATTACATAACCCACTCAAGGAGAGGTTATGCCCGGACTGATAGGCTACTGGAAGCAACTTCCCACCAAAGATGAATATATTAAAAAACACAATATGAGCAAAATATCCTGCTACAGTTGTGGTCACGAGAAATTCAGCGATGTTGGTTTGATACAGGTATGGGATAATCACAGAAGAATTCTTTGTGCTAAGTGTAAGACTACTCTTTTCAGAGAAGAGGATTAGTTTTTTTGGCATTGGTAACAGCGGCTTCAGCATCCCTTTTCACGCAGCGGGTCGGGCTTTTTTTTCGCATTTGACCCGTCGATTACCGGATGATGACGCAATTTACAAGCGCCTTGTCCGCCTACCGCGAGCACAACGCCATCAGGCTAACTATTAGCCGGCGTAAAAAAACCGGGCGCTAAGGCCCGGTTTGTACGGCAGTGAAACGAAGATTAATGCGCGGCTTCCGGCTTGTGCTTTTGCGCACTCTGGAAGCCATACGTCAACGCATTTTTCTCTTTATCCAGCGCGACGGTGACCTGTCCGCCATCAACCAGCGATCCAAACAGCAACTCATTGGCCAGCGGTTTTTTCAGGTTATCCTGAATCACACGCGCCATCGGTCGTGCGCCCATCGCCCGGTCATAGCCCTTTTCCGCCAGCCAGTCGCGCGCTTCCTGACTGACTTCCAGAGAGACGCCTTTCTGATCCAACTGAGCCTGCAACTCGACGATAAACTTATCGACAACCTGATGAATCACCTCGCCAGACAGATGATCGAACCAAATAATGTTGTCGAGACGGTTACGGAACTCCGGCGTAAACACTTTCTTGATCTCGCCCATCGCATCGGTACTGTTGTCCTGATGAATAAGACCAATAGATTTACGTTCGGTTTCTCGCACGCCGGCGTTGGTGGTCATCACCAGCACCACGTTGCGGAAATCCGCCTTACGGCCATTGTTATCGGTCAGCGTACCGTTATCCATCACCTGCAGCAGCAGGTTAAAGACATCCGGGTGCGCTTTTTCGATCTCATCCAGCAACAGCACCGCATGAGGATGCTTAATCACCGCATCCGTCAGCAGCCCGCCCTGGTCGAAACCGACGTATCCCGGAGGCGCGCCGATCAAACGGCTCACCGTATGACGCTCCATATATTCGGACATATCGAAGCGCAACAGCTCAATACCCAGCGCTTTTGAAAGCTGTACCGTAACTTCAGTTTTCCCTACGCCAGTTGGCCCGGCGAACAAGAATGAGCCGACAGGTTTATGCTCATGGCCCAGACCGGCACGACTCATCTTAATAGCTTCGGTCAGCGCCTCAATCGCGTTATCCTGGCCGAAGACCAGCATTTTCAGACGATCGCCCAGGTTCTTCAGCGTATCGCGATCGCTCTGCGAGACGCTCTTTTCAGGAATTCGCGCAATTCGCGCCACTACGGACTCAATATCCGCCACGTTGACCGTTTTCTTACGTTTGCTCACCGGCATCAGACGCGCCCGAGCGCCCGCTTCGTCAATCACGTCAATGGCTTTATCCGGCAGATGGCGGTCATTGATATATTTTACCGCCAACTCGACCGCCGCACGCACCGCTTTCGCGGTATAACGCACGTCGTGGTGCGCTTCGTACTTAGGTTTCAAGCCGTTGATAATTTGCACCGTCTCTTCCACCGAAGGCTCGGTAATATCAATTTTCTGGAAACGGCGCGCTAATGCACGGTCTTTCTCAAAAATATTGCTGAATTCCTGATAGGTCGTTGAGCCGATCACCCGGATCTTGCCGCTGGAAAGCAGCGGTTTAATCAGATTTGCCGCATCCACCTGTCCGCCCGACGCCGCGCCAGCGCCGATAATGGTATGGATTTCATCGATAAACAGGATGCTGTTGGTATCCTGCTCAAGCTGTTTCAGCAACGCCTTAAACCGTTTTTCAAAATCGCCGCGGTATTTGGTGCCCGCCAGCAGCGAACCGATATCCAGAGAGTAAATGGTGCAATCGGCCATCACTTCCGGCACATCGCCCTGCACGATACGCCAGGCCAGCCCTTCGGCAATCGCCGTTTTGCCGACGCCGGATTCCCCTACCAGCAACGGGTTATTTTTACGGCGACGACACAAGACCTGGATCGCGCGTTCAAGTTCTTTTTCACGACCAATCAGCGGATCGATGCCGCCCACGCGAGCAAGTTGGTTAAGATTCGTCGTGAAGTTTTCCATACGTTCCTCCCCGCCAGCTTGTTCGTCGCCAGTTGGCTGATTGCCGAGATCGGAAGATTGGCTCGGTTCGTCTTTTCGCGTCCCGTGAGAAATAAAGTTCACGATATCCAGACGGCTCACTTCATGCTTGCGCAGCAGATAAGCCGCCTGTGATTCCTGTTCGCTAAAGATAGCCACCAGCACATTCGCGCCAGTCACTTCACTACGCCCGGAAGACTGAACATGGAAGACGGCACGCTGCAGGACACGCTGGAAACTTAACGTCGGCTGCGTATCACGCTCTTCTTCACTGGCAGGCAGTACGGGTGTGGTTTGTTCAATGAAGGCTTCGAGTTCCTGACGGAGCGCCACCAGATCCACGGAGCATGCTTCCAGCGCTTCGCGAGCCGATGGGTTGCTGAGCAGCGCCAGCAACAGATGCTCGACGGTCATAAACTCATGACGGTGCTCGCGCGCTCTGGCGAAAGCCATGTTTAAACTGAGTTCCAGTTCTTGATTGAGCATAGGCACCTCCCCCAATTTTTATACCTGCATTCAGGCTTTTTCCAGCGTACACAGCAACGGATGCTCGTTCTCCCTTGCATACTTGTTCACCATCGCCACTTTGGTTTCCGCCACCTCGGCGGTGAACACGCCGCAGATGGCTTTGCCTTGATAGTGAACTGCAAGCATCAATTGCGTTGCACGTTCTACATCATAAGAAAAGAATTTTTGTAACACGTCAATAACAAACTCCATCGGAGTGTAATCATCATTGACTAATATCACTTTATACATAGATGGCGGTTTTAGCGCGTCGCGCACGCTATCTTCCACCAACTGGTCAAAATCCAGCCAATCGTTCGTCTTACCCATTGTCAGTCGTCATTATCGGTTACGGTTGTCGGCAGGAAAATCTGCCGCTGACCAGAGTCTATGCACACAATCAATCTACCTCAATTGATAGATAACTAACATCTATCAGTACCATCCGCGACATCTGTCACATTCCCGGCAATAGCGTTAACTGCTTCAAATTTTTGATTCATTTTTACCCGATCCCCCCTGCCTGATGCTTGACGCCTCGCCTGATTTCTCTAAATTGTAATGTCGAGAGTTGGTGAGGTTTTGAACAGCCCCCACTCCGTCACCGGTTCATTCCATCTTACTTATATAAGATTTACGAAGGATGTCGAAGCATGGAAACGGGTACTGTAAAGTGGTTCAACAATGCCAAAGGGTTTGGTTTCATCTGCCCTGAAGGCGGCGGCGAGGATATTTTCGCCCATTATTCCACCATTCAAATGGATGGTTACAGAACGCTTAAAGCCGGACAGTCTGTCCGGTTTGATGTCCACCAGGGGCCAAAAGGCAATCACGCCAGCGTCATCGTGCCCATCGAAGCAGAGGCCGTTGCATAGCTCCTCTGTCTCATTGTGTACATCCAGGAGGCAAAATGCCAGCCCGATCGGCTGGCATTTTTATTTAACGCCAGTGCCTGATAGCGACACTGTTGCATCTTATCAGGCCGACAAATGACGTCAGCGAGATTACTCCCTTGCCAGCGCATCCACCGGGTCCAGTCGCGCCGCGTTTCTCGCCGGTAGCCAGCCAAACAGTATCCCGGTAAACGTCGAACATAAAAACGCGCTCGCCAGCGCAGTCAGTGAAAAACCGATCTCCCAGCCGGGCAGGAAAAGCTGTAGCATAAATGCGATGAACATCGACAAGCTAATCCCCAGCGCTCCCCCAACCAGGCAAACCAGCACCGCTTCAATAAGAAACTGCTGTAGCACATCGCTGGCGCGCGCGCCTACCGCCATACGGATGCCGATTTCACGCGTTCGCTCGGTGACGGAAACCAGCATAATATTCATAACGCCAATGCCGCCGACAACCAGCGAAATGACGGCCACCAGCGTCAGAAATAACTGAAGAGTATAGGTGGTTTTTTCAGCCGTTTTCAGAACGCTGTCCATATTCCAGGTGAAGAAGTCTTTTTTACCGTGGCGTAAGGTGAGCAGGCGGGTAAGCTGCTGTTCAGCCTGATCGCTATCAACGCCATCTTTCACACGAACGGTGATCGAGTTAAGCCATGACTGACCCATTATGCGATCTGACATCGTGCTATAGGGCAACCAAACTTGCAACAGATTGCTATTGCCGTACATGGACGGTTTCTCTTCCGCCACGCCAATAACAATAACCGGCATATTACCCACCAGCACCACTTCCCCTACGACATTCGCTTTATTTGGAAATAGCTGGCGTCGCGTGTTGGCATCCAGCACCACCACCTGCGCGCGATCCTGTTGCTGTACAGAATTGAAGGTGTTCCCCTCCCTAAAGGACATGCCGTAAACGTTAAAATAATCGCCACTGACGCCATTAGCATTTACGGCAATATCAATATTGCCATAGCGAAGACGTAAGCTCTTTGAAACGCTGGGCGTCGCAGAGTTAACCCACGGCTGTTTCTGAATAGCGACCAGATCGTCATATTTCAGCGCCTGTCGATACTGCGGATTGTCGTCGCCAAAATCTTTGCCTGGATGAATATCAATCGTGTTAGTGCCCATAGCGCGGATATCCGCCAGTACCATCTGTTTTGCGGCGTCGCCGACCACCACAATCGACACCACCGACGCAATACCGATAATAATTCCCAGCATGGTCAGTAAAGTACGCATTTTGTTAGCGGCCATCGCTAACCACGCCATTGACAGCGCTTCGCGAAAGCTGCTGGCAAATTGCCGCCAGCCGGGAGCCGTATTAACTACGGCAGCGTCAACGCCCTGTTCGCGTTTCTTTTCCTCCGCGGGCGGATTATGGACAATCTTGCCATCGTGAATTTCAATAATCCGCTCCGCCTGGGCGGCAATCAGCGGATCGTGCGTCACAATGATCACCGTATGTCCGCGATCGCGCAGTTGGCGCAAAATCGCCATCACCTCTTCGCCGGAATGGCTATCCAGCGCGCCGGTCGGCTCATCTGCCAGAATCACCTGTCCACCGTTCATCAGAGCGCGGGCAATACTGACACGCTGCTGCTGTCCGCCAGAAAGCTGTGAAGGTGGGTAATCGACGCGATCGCTTAATCCCAGCCGCAAAAGCAACTCTCTGGCGCGCGCCTGGCGTTTTTTGCGTTCAATGCCGGCGTAGACGGCGGGGATTTCAACATTTTGCGCTGCCGTTAAATGCGACAACAGATGGTAGCGCTGAAAGATAAAGCCAAAATGCTCACGCCGCAGCTGCGCCAGCGCGTCCGGGTCCAGCGTCGAGACGTCCCGCCCCGCCACCCGATAAGTGCCGCTGGTCGGTTTATCCAGGCACCCGAGGATATTCATCAGCGTTGATTTTCCAGAACCGGAAACGCCGACGATCGCCACCATCTCCCCGGCGTGGATTTGCAGGGAGATATCTTTCAACACCGCCACCTGCTCTTCTCCGGAGGGGTAGCTACGACTCACATTGCACAGTTCAAGCAATGCCGTCATGGCGTCGCTCCTGGCCTGCTTTCGCCGATGATCACCTCATCGCCCGCTTCCAGACCTTTAACCACTTCCACGTCTGTATCGTTACGCTCGCCAATGACCACTTCGCGCTCACGTTTTTCGCCGTTACGCAACAGCGCCACTTTATAACGATTGCCGCCCACCGGTTCGCCAAGCGCGGCGAGAGGAATAATCAGCACATTTTTGACATCCATGAGTTGAATATAAACCTGTGCGGTCATATCAAGACGCAAGATTCTTTTGGGATTCGGCACTTCAAACCGGGCGTAATAAAAAATAGCGTCGTTGATCTTTTCCGGCGTCGGCAGAATATCTTTTAAAACGCCTTCATAGCGCGTTTGCGGATCGCCTGCAATGGTGAACCATGCTTTCTGCCCCGCCCGAAGATGGATCACGTCCGCTTCCGAGACCTGCGCTTTTACCAGCATGGTGCTCATATCCGCCAGCGTCAGAATATTGGGCGCCTGCTGAGCTGCAATCACCGTTTGTCCTTGCAGGGTAGTGATTTGCGTCACTTCCCCCGCCATGGGGGCGACAATACGGGTATATTCCAGGTTGGTTTTCGCGGTGTCCAACGAGGCCCGATTACGTTTGATCTGGGCATCTATGGTGCCAATACGCGCCTGTTTAACCGCCATCTCCGTCGCCGCGGTATCCAGATCCTGTTGCGATACCGCCTGAGTCTTAGCTAACTGCTGCTGGCGCGCCAGCGTAACCCGCGCCAGCTTTAACTCAGCGGCTGCCTGCTGACGCTCCGCGTTCAGCTCCATCAAGGTGGCCTCGACCTCTTTTATCTGGTTCTCCGCCTGATCTGGGTCAATCACGCCGAGTAGCTGATCTTTTTTAACGTTATCGCCAATGGAGACCAGCAGCGTTTTCAACTGGCCGCTCACCTGCGCGCCGACATCCACTTTACGCAACGCGTCCAGTTTTCCAGTCGCCAGTACACTCTGTTCAAGATCGCCTGGCCGCACGATTAATGTCTGATAAGTTGGCAGCGGGGCATTTATCATTCGCCAGCCAGCCATCCCCCCCACTAAAAGAATTAAAATAATGACCAGATAACGCTTTTTAAATTTCTTTCCCTTAGCACGCAT
Protein sequences of DBSCAN-SWA_8 >NZ_CP011394|2977721:2985034|2977721_2978099_+|WP_001539594.1|integrase|DBSCAN-SWA MHKYLSVVKKHRVPLSDAAVDLLKDLPRLKDNNHVFPAPRAETLSDMSLLAVLKRMGYIDLTQHGFRSTFREWAGEATDYQREVIEHALAHQLADKAEAAYQRGTLWPKRVALMDDWTGYSTANS >NZ_CP011394|2977721:2985034|2978670_2980947_-|WP_000934064.1|protease|DBSCAN-SWA MLNQELELSLNMAFARAREHRHEFMTVEHLLLALLSNPSAREALEACSVDLVALRQELEAFIEQTTPVLPASEEERDTQPTLSFQRVLQRAVFHVQSSGRSEVTGANVLVAIFSEQESQAAYLLRKHEVSRLDIVNFISHGTRKDEPSQSSDLGNQPTGDEQAGGEERMENFTTNLNQLARVGGIDPLIGREKELERAIQVLCRRRKNNPLLVGESGVGKTAIAEGLAWRIVQGDVPEVMADCTIYSLDIGSLLAGTKYRGDFEKRFKALLKQLEQDTNSILFIDEIHTIIGAGAASGGQVDAANLIKPLLSSGKIRVIGSTTYQEFSNIFEKDRALARRFQKIDITEPSVEETVQIINGLKPKYEAHHDVRYTAKAVRAAVELAVKYINDRHLPDKAIDVIDEAGARARLMPVSKRKKTVNVADIESVVARIARIPEKSVSQSDRDTLKNLGDRLKMLVFGQDNAIEALTEAIKMSRAGLGHEHKPVGSFLFAGPTGVGKTEVTVQLSKALGIELLRFDMSEYMERHTVSRLIGAPPGYVGFDQGGLLTDAVIKHPHAVLLLDEIEKAHPDVFNLLLQVMDNGTLTDNNGRKADFRNVVLVMTTNAGVRETERKSIGLIHQDNSTDAMGEIKKVFTPEFRNRLDNIIWFDHLSGEVIHQVVDKFIVELQAQLDQKGVSLEVSQEARDWLAEKGYDRAMGARPMARVIQDNLKKPLANELLFGSLVDGGQVTVALDKEKNALTYGFQSAQKHKPEAAH >NZ_CP011394|2977721:2985034|2981621_2981843_+|WP_000447499.1|DBSCAN-SWA METGTVKWFNNAKGFGFICPEGGGEDIFAHYSTIQMDGYRTLKAGQSVRFDVHQGPKGNHASVIVPIEAEAVA >NZ_CP011394|2977721:2985034|2983915_2985034_-|WP_001201751.1|DBSCAN-SWA MRAKGKKFKKRYLVIILILLVGGMAGWRMINAPLPTYQTLIVRPGDLEQSVLATGKLDALRKVDVGAQVSGQLKTLLVSIGDNVKKDQLLGVIDPDQAENQIKEVEATLMELNAERQQAAAELKLARVTLARQQQLAKTQAVSQQDLDTAATEMAVKQARIGTIDAQIKRNRASLDTAKTNLEYTRIVAPMAGEVTQITTLQGQTVIAAQQAPNILTLADMSTMLVKAQVSEADVIHLRAGQKAWFTIAGDPQTRYEGVLKDILPTPEKINDAIFYYARFEVPNPKRILRLDMTAQVYIQLMDVKNVLIIPLAALGEPVGGNRYKVALLRNGEKREREVVIGERNDTDVEVVKGLEAGDEVIIGESRPGATP >NZ_CP011394|2977721:2985034|2981972_2983919_-|WP_000125875.1|DBSCAN-SWA MTALLELCNVSRSYPSGEEQVAVLKDISLQIHAGEMVAIVGVSGSGKSTLMNILGCLDKPTSGTYRVAGRDVSTLDPDALAQLRREHFGFIFQRYHLLSHLTAAQNVEIPAVYAGIERKKRQARARELLLRLGLSDRVDYPPSQLSGGQQQRVSIARALMNGGQVILADEPTGALDSHSGEEVMAILRQLRDRGHTVIIVTHDPLIAAQAERIIEIHDGKIVHNPPAEEKKREQGVDAAVVNTAPGWRQFASSFREALSMAWLAMAANKMRTLLTMLGIIIGIASVVSIVVVGDAAKQMVLADIRAMGTNTIDIHPGKDFGDDNPQYRQALKYDDLVAIQKQPWVNSATPSVSKSLRLRYGNIDIAVNANGVSGDYFNVYGMSFREGNTFNSVQQQDRAQVVVLDANTRRQLFPNKANVVGEVVLVGNMPVIVIGVAEEKPSMYGNSNLLQVWLPYSTMSDRIMGQSWLNSITVRVKDGVDSDQAEQQLTRLLTLRHGKKDFFTWNMDSVLKTAEKTTYTLQLFLTLVAVISLVVGGIGVMNIMLVSVTERTREIGIRMAVGARASDVLQQFLIEAVLVCLVGGALGISLSMFIAFMLQLFLPGWEIGFSLTALASAFLCSTFTGILFGWLPARNAARLDPVDALARE >NZ_CP011394|2977721:2985034|2978260_2978458_+|WP_001117984.1|DBSCAN-SWA MPGLIGYWKQLPTKDEYIKKHNMSKISCYSCGHEKFSDVGLIQVWDNHRRILCAKCKTTLFREED >NZ_CP011394|2977721:2985034|2980977_2981298_-|WP_000520789.1|protease|DBSCAN-SWA MGKTNDWLDFDQLVEDSVRDALKPPSMYKVILVNDDYTPMEFVIDVLQKFFSYDVERATQLMLAVHYQGKAICGVFTAEVAETKVAMVNKYARENEHPLLCTLEKA |
7 | Ralstonia_phage(16.67%) | integrase,protease | attL 2972518:2972532|attR 2983770:2983784 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|