Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP016592 | Ketogulonicigenium vulgare strain SKV chromosome, complete genome | 2 crisprs | csa3,WYL,DEDDh,cas5,cas8c,cas7,cas4,cas1,cas2 | 0 | 5 | 5 | 0 |
NZ_CP016593 | Ketogulonicigenium vulgare strain SKV plasmid pKvSKV1, complete sequence | 0 crisprs | NA | 0 | 0 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016592_1 | 1851317-1852010 | TypeI |
I-C
Consensus repeat of NZ_CP016592_1
|
10 spacers
spacers of NZ_CP016592_1
>1.1|1851349|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT CCCGATCCAACCGCGAAGTGGGCCAGCCGCGATG >1.2|1851415|35|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT ACTGTCACTCCGACTTGGAGAGCATCTACAATGAC >1.3|1851482|33|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT AGTTGGCGTGCGGGCGTCCACATCTGAATGATG >1.4|1851547|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT TCATTGTAGATGGTGGTATCGAAACGCGACACCT >1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT TCCCATAAAAAAACCCGCCTCTAAGGGCGGGCTG >1.6|1851679|33|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT TGCTTCCGGTAAAACTCCACCGCAGCGTGGAAC >1.7|1851744|41|NZ_CP016592|PILER-CR GATGCCGTCCTCGCGTCCCTCCAATCCAGCGCGCGGTCGCT >1.8|1851817|36|NZ_CP016592|PILER-CR AGCGACCAATGGCGCGTAGCGACACCCTATCAGGAC >1.9|1851885|35|NZ_CP016592|PILER-CR TACGCTATGACCCAGATCCTCCGGGGGCGCGCAAC >1.10|1851952|34|NZ_CP016592|PILER-CR CCCAGCAGGCGCGGCGCAGACGACAGGCGGGACG >1.11|1851744|34|NZ_CP016592|CRISPRCasFinder GATGCCGTCCTCGCGTCCCTCCAATCCAGCGCGC >1.12|1851810|36|NZ_CP016592|CRISPRCasFinder AGCGACCAATGGCGCGTAGCGACACCCTATCAGGAC >1.13|1851878|35|NZ_CP016592|CRISPRCasFinder,CRT TACGCTATGACCCAGATCCTCCGGGGGCGCGCAAC >1.14|1851945|34|NZ_CP016592|CRISPRCasFinder,CRT CCCAGCAGGCGCGGCGCAGACGACAGGCGGGACG >1.15|1851744|35|NZ_CP016592|CRT GATGCCGTCCTCGCGTCCCTCCAATCCAGCGCGCG >1.16|1851811|35|NZ_CP016592|CRT GCGACCAATGGCGCGTAGCGACACCCTATCAGGAC |
cas2,cas1,cas4,cas7,cas8c,cas5 |
CRISPR arrays and Neighbor proteins around NZ_CP016592_1
The CRISPR arrays of NZ_CP016592_1 >merge|NZ_CP016592|1|1851317-1852010|PILER-CR,CRISPRCasFinder,CRT GTCGCTCCCCCCACGGGAGCGTGGATAGAAACCCCGATCCAACCGCGAAGTGGGCCAGCCGCGATGGTCGCTCCCCCCACGGGAGCGTGGATAGAAACACTGTCACTCCGACTTGGAGAGCATCTACAATGACGTCGCTCCCCCCACGGGAGCGTGGATAGAACCAGTTGGCGTGCGGGCGTCCACATCTGAATGATGGTCGCTCCCCCCACGGGAGCGTGGATAGAAACTCATTGTAGATGGTGGTATCGAAACGCGACACCTGTCGCTCCCCCCACGGGAGCGTGGATAGAAACTCCCATAAAAAAACCCGCCTCTAAGGGCGGGCTGGTCGCTCCCCCCACGGGAGCGTGGATAGAAACTGCTTCCGGTAAAACTCCACCGCAGCGTGGAACGTCGCTCCCCCCACGGGAGCGTGGATAGAAACGATGCCGTCCTCGCGTCCCTCCAATCCAGCGCGCGGTCGCTCCCCCACGGGAGCGTGGATAGAAACAGCGACCAATGGCGCGTAGCGACACCCTATCAGGACGTCGCTCCCCCCACGGGAGCGTGGATAGAAACTACGCTATGACCCAGATCCTCCGGGGGCGCGCAACGTCGCTCCCCCCACGGGAGCGTGGATAGAAACCCCAGCAGGCGCGGCGCAGACGACAGGCGGGACGGTCGCTCCCCCCACGGGAGCGGGGATAGAAAG >NZ_CP016592|1|1|1851317-1852010|PILER-CR GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC CCCGATCCAACCGCGAAGTGGGCCAGCCGCGATG GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC ACTGTCACTCCGACTTGGAGAGCATCTACAATGAC GTCGCTCCCCCCACGGGAGCGTGGATAGAACC AGTTGGCGTGCGGGCGTCCACATCTGAATGATG GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TCATTGTAGATGGTGGTATCGAAACGCGACACCT GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TCCCATAAAAAAACCCGCCTCTAAGGGCGGGCTG GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TGCTTCCGGTAAAACTCCACCGCAGCGTGGAAC GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC GATGCCGTCCTCGCGTCCCTCCAATCCAGCGCGCGGTCGCT CCCCCACGGGAGCGTGGATAGAAACAGCGACC AATGGCGCGTAGCGACACCCTATCAGGACGTCGCTC CCCCCACGGGAGCGTGGATAGAAACTACGCTA TGACCCAGATCCTCCGGGGGCGCGCAACGTCGCTC CCCCCACGGGAGCGTGGATAGAAACCCCAGCA GGCGCGGCGCAGACGACAGGCGGGACGGTCGCTC CCCCCACGGGAGCGGGGATAGAAAG >NZ_CP016592|1|1|1851317-1852010|CRISPRCasFinder GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC CCCGATCCAACCGCGAAGTGGGCCAGCCGCGATG GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC ACTGTCACTCCGACTTGGAGAGCATCTACAATGAC GTCGCTCCCCCCACGGGAGCGTGGATAGAACC AGTTGGCGTGCGGGCGTCCACATCTGAATGATG GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TCATTGTAGATGGTGGTATCGAAACGCGACACCT GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TCCCATAAAAAAACCCGCCTCTAAGGGCGGGCTG GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TGCTTCCGGTAAAACTCCACCGCAGCGTGGAAC GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC GATGCCGTCCTCGCGTCCCTCCAATCCAGCGCGC GGTCGCTCCCCCACGGGAGCGTGGATAGAAAC AGCGACCAATGGCGCGTAGCGACACCCTATCAGGAC GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TACGCTATGACCCAGATCCTCCGGGGGCGCGCAAC GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC CCCAGCAGGCGCGGCGCAGACGACAGGCGGGACG GTCGCTCCCCCCACGGGAGCGGGGATAGAAAG >NZ_CP016592|1|1|1851317-1852010|CRT GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC CCCGATCCAACCGCGAAGTGGGCCAGCCGCGATG GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC ACTGTCACTCCGACTTGGAGAGCATCTACAATGAC GTCGCTCCCCCCACGGGAGCGTGGATAGAACC AGTTGGCGTGCGGGCGTCCACATCTGAATGATG GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TCATTGTAGATGGTGGTATCGAAACGCGACACCT GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TCCCATAAAAAAACCCGCCTCTAAGGGCGGGCTG GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TGCTTCCGGTAAAACTCCACCGCAGCGTGGAAC GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC GATGCCGTCCTCGCGTCCCTCCAATCCAGCGCGCG GTCGCTCCCCCACGGGAGCGTGGATAGAAACA GCGACCAATGGCGCGTAGCGACACCCTATCAGGAC GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC TACGCTATGACCCAGATCCTCCGGGGGCGCGCAAC GTCGCTCCCCCCACGGGAGCGTGGATAGAAAC CCCAGCAGGCGCGGCGCAGACGACAGGCGGGACG GTCGCTCCCCCCACGGGAGCGGGGATAGAAAG
>NZ_CP016592.1|WP_013384683.1|1850849_1851140_+|CRISPR-associated-endonuclease-Cas2 MLVLVTYDVNTLSDGGKKRLRQVARACEDWGQRVQFSVFEIELDPAQWTKLRARLESIIDAKTDSLRYYFLGTNWERRIEHVGAKPAKDLNGPLII >NZ_CP016592.1|WP_013384682.1|1849810_1850845_+|type-I-C-CRISPR-associated-endonuclease-Cas1 MKKLLNTVYVTTEGAALKKDGENLVAEIEGSEKARVPLHMVASVVTFGPIFVSPALIGTCAERGITIALMDRIGRFQARIEGPVSGNVLLRRAQYRTADDAVDVVRSIVLGKLANQRAVIRRGLRDYGDEMAAPVRDALERASDRIEMILRRVQVKDDSIDLLRGAEGEAATLYFGVFNHLIRSPDATLHWTGRSRRPPLDPMNALLSFLYTLLTHDCRSACEAVGLDPAVGFLHRDRPGRPSLALDLMEELRAPLADRLALSLVNRRQLRAGDFRQMDNGAVLLTDEARKTVLTAWQERKKEERLHPFLNEKAPFGLVPYLQAQMLARHLRGDIEAYPPWFWS >NZ_CP016592.1|WP_013384681.1|1849169_1849814_+|CRISPR-associated-protein-Cas4 MGAEEDAIPLSALQHAVYCLRQAALIHLERLWVANRFTAEGDVLHAVADKGGARRARGVRRVMSLPLASARLNLIGTADLVEFIPGPAGEVAFPVEYKRGKPKLHRADEVQLCAQALCLEEMTGQPVPEGALFYAHTKRRVTVPFDTELRALTQNAAQSLADILASRITPAPTAHKSRCRACSLHEACRPETYARPVLAWRDQMLARSLKDISE >NZ_CP016592.1|WP_013384680.1|1848216_1849167_+|type-I-C-CRISPR-associated-protein-Cas7/Csd2 MTTLSRRHDFVLIFDVTNGNPNGDPDAGNQPRLDPETGHGLVSDVSLKRKIRNYVELAHEGKDGHHIYVQEGAILNEKHRAAYIAKRPGDEKAKTDKKLNPKDDAEAKELRDWMCANFFDVRTFGAVMSTGINCGQVKGPVQMTFATSVERILPAEITITRMAATNEAEKKKAEEGSDGDQRTENRTMGRKHIVPYGLYVAHGFVSAKFAERTGFSEADLDLLLEALKNAFEHDRSAARGEMATRKLIVFKHENALGNAPAHELFDRVRIGRNLAGEFRPIGDSRLDNQPPARKFGDYLIEIDREALPAGVDIIEL >NZ_CP016592.1|WP_013384679.1|1846450_1848220_+|type-I-C-CRISPR-associated-protein-Cas8c/Csd1 MSVLSSLSRAYERLPDAPPYGFSSEKIGFCIVLNADGSVHDVIDKRQDDKKRSPAMLLVPQAVKRTAGIAPNFLWDKTAYVLGVTAGEGKRTADEHAKFREMHVQWLAGTQDEGLLALLRFLDAWTADRFTAPVWPEDMRDQNIVFALASEYRERYIHERSGAKEIWQRLGTEGASDPQICLVSGDPAPIARLHPSIKGVWGAQTAGASLVSFNLDAFTSYGHEQGDNAPVSEVATFQYTTALNRFLEKDSGHRLQIGDASTVFWADASGLISEMAESLFAGMFDTPEAAHDDDSIETKKIAAKLERIRRGERLDEVEPQLTQGVRFHVLSLAPNAARLSVRFYWENDFAQLTRNYQAYLEDTKIEPPPRDGWPPLWRYLVELAVLNKRENIPPNLAGEWTRAILTRTPYPMTLLATALMRIRSDGEVNALRAAVLKSVLVRNFNKEAPVALNPTFSDSKGYLLGRLFAAYEEVQREALGRNVNATIKDKFYGAASASPQKVFATLDSGAQNHFTKLRKINAGRAVNLDQLIMSIMDQMSPDKDPIPAFLNAPEQGLFGLGYYHQRSDFFRKRDTKTDSVTETQPETTA >NZ_CP016592.1|WP_013384678.1|1845689_1846454_+|type-I-C-CRISPR-associated-protein-Cas5 MTYGIRLHIWGTHGLFTRPELKVERASYDVITPSAARGILEAIHWKPAIRWIVDRIHVLEPIRFQSIRRNEVGHKAPAGKIRAAMNRGDLADLQILVDQDRQQRASNVLVAPAYVIEAHFDLTAKAGPDDTVGKHLDIFNRRAAKGQCFNQPSLGTREFVAHFALVPPDAVIPSPDGHSWGAMLHAGQTNGAPRPSRAPEAETSDLGFGTPRDLGFMLWDIDHMAPGRPSMFFRATLRDGIVEIPAPGSPDIKR >NZ_CP016592.1|WP_014537824.1|1840031_1840883_+|LysR-family-transcriptional-regulator MAAAARQLNIAQPALSGHIAQIEEHYALQLFQRHARGVTLTPAGEALLRHARRILENLSEAEAELRHLSPVTTRAPVRLGLLPSWGASLAPAIIQATQLALPDIALRIVEMRHDESLDAIRQQNIDLAVVLEDTAPAQTQLLGSEALLYVSHVAVADRMSFRDVAALPLILPSAANLLRHQLDKAAKAAHVVLGPVMEIDGQDTIKSAVKAGVAGSIMSWNSIRNECLDHSLSACLIDDPEITRNVYLRRGEHVPANLAEAFFVVLRQVAEDNSYSRLRHART >NZ_CP016592.1|WP_013384674.1|1839237_1839906_+|4-carboxy-4-hydroxy-2-oxoadipate-aldolase/oxaloacetate-decarboxylase MMAHVRRSFTRPSPEAIDAIRPYSPATVHEAQGKLGALDSRIKPLRHGWQICGSAITAQCHIGDNLMIFEAINLAKPGDVLVLSAGNNPEQGGFGDVLAAACRGKGIAGLIIDAGVRDGRGLRAGDFPVFSLGLCVKGTSKDTLGTVNMPVMVGNQLITPGDIIVADDDGVVVVRQEPFALAKACEAREASEAKLIEMHLSGRMEIEDRYDMMRAKGCVWED >NZ_CP016592.1|WP_013384673.1|1837543_1839217_+|iron-ABC-transporter-permease MNALQKPKISAYSVVAFFCAALVVVLIAYPLVRMLWNTVLGGNLQQGLSVVMQPWFGQVFLNTAIVVGISTAFAVAIGGSLAWINQRTDAGMGAVGGLMPIIPLLIPNVAMSIGWVFIAAPRVGFLNGWLATLPDWMSFQVNIYSWAGMIFIYTINGVPYVYLIAAAAFRNMDPALEEASRINGAGIWRTFWKVSLPSVKPALVSSALLLTITSIGIYSIPAVIGTTAKIDVLTTRIVYLLNREFPPRMPEAQMIGVIILILVGVLWWVQGRWAGKGQFATVGGRATGGGKLEMGRWKWPARTLTLIYLACTSILPLAALAVVALQPFWSPRINPAIFSLNNFNQALFVNGMAVSSIRNSLFYSSIGALIAMGIAVVIAIYTAERKNLFGKGLDTAIKSSAAIPNLILGVGFLIAFAGAPFYLSGTAMILILAFVVMYISPGSIAATSAITQIGRDLREAARINGASEGRMVRRVILPLAMPGFIGGWAIVFVHMMGDLSAAALLAGITNPVVGFAILSIWEAGTFGVLAAFSMMLCLINIFVIAAMFGLGRLIAKR >NZ_CP016592.1|WP_014537823.1|1836407_1837547_+|ABC-transporter-ATP-binding-protein MTSHCAPTPPRIEVVDLVKSYRRENGTVITPVDHIDLIVAQNELVVLLGPSGCGKTTLLRCVAGLERPDSGEIIVDGKVVFSSRKGIYEPPDRRALAMVFQSYALWPHMTVAQNVAFPLQSQKVATPEINERVSKALSMVGVGGLERQFPGRISGGQQQRVALARALVTNSSVVLFDEPLSNVDAQVRAQLRFELQSMQRRLQFSGLYVTHDQAEAMELGQRVAVLDNGKISAIGAPWDVYDRPTNEYVARFIGVANMWRGTVTCTGTDVCVETPLGPIRVHGGGASHAVGTALTVVARPEKLTLTAERPTGDTNAIPVMVEAVMFSGAHTEVVCRAGDGSAVTVWTGAHEATSQLARMGTAWLCALPADLRLVPTGAA >NZ_CP016592.1|WP_013384684.1|1852136_1853588_-|aldehyde-dehydrogenase MDLLGLLINNETLPASGGKTFTRKNPISGEVATEAAAATTDDAQRAADAAAAAFPEWSRSSPKTRRTVLLKAADMLEANGPQFVAAMGAEIGATAGWAMFNVTLAADMLREAASLVTQIKGEIIPSNRPGSTAMAVRQPAGVVLAMAPWNAPVILGVRALATPLACGNTVVMKTSELCPRTHHLIVSSLLQAGLPAGVLNAVSNAPEDAAEIVEALIAHPAVRRVNFTGSTRVGRIIAEKAGRYLKPALLELGGKAPFVVLDDADLDEAVAAAAFGAYMNQGQICMSTERIVVMESVADAFVEKLAVKARTLIAGDPREGKTPLGSVVDVSAAQRIEQLIKDATSKGAVLAAGGRIDGTLMDAALLDHVSPAMRIYGEESFGPVVTVVRVGSIDEAVRVANDTEYGLSSAVFGGDVNRALAVARRIESGICHVNGPTVHDEAQMPFGGTKASGYGRFGGNWGIHEFTELRWITVQDGHIHYPI >NZ_CP016592.1|WP_014537825.1|1853615_1854338_-|ABC-transporter-ATP-binding-protein MLMLKIANLSLRYGRHLALQCVNVAVARGETVVILGANGAGKSSALKAVGGIVRPDAGSVVTLDDVPLLGAPAHQIVDRGLALVPEGRGVFADLTVAENLLLGANPKRARAGEGARRDFVYTLFPRLAERRRQTVRTMSGGEQQMVAIGRALMSNPDYLLLDEPSLGLAPIVVAELFAALRRVKETGVAILLVEQNVALSLSLADRGYLMEAGRIVGEGTADTLRNDPAVQNAFLGGSAA >NZ_CP016592.1|WP_013384686.1|1854324_1855053_-|ABC-transporter-ATP-binding-protein MTVLLQVDGLKKQFGGLMAVNDLSFTVAEGEILALLGPNGSGKTTVMNLISGALPATAGRIQLDGVQISGLPAHRIARLGVARTFQLVRILPSLTVAENVIAALAFRAQPLWGDDAARAAEALLAEVGLAGRGGEYAADLTYIDQKRMELARALGAAPKLLLLDEWLSGLNPTELRVGIALILSLKARGMTIMLVEHIMEAVRALCPRTVVMNAGRKIADGPTNDVLADPAVVAAYLGGVDA >NZ_CP016592.1|WP_013384687.1|1855049_1855976_-|branched-chain-amino-acid-ABC-transporter-permease MSARTLTGLGLAALALVMLAWLPSQLDAYGVGLLLGMTGYVTLATAWALFSGPTRYISLATVAFFGIGAYTVAVLSEAMPYPMVLITAALVGGAVALVVGLATLRLAGVYFVIFSFGLAELVRQLVTWYEVNVTGTLGRYIFLPITAQQIYWQLLALCALTFLIGWLIARSRLGLALRVIGDDEAVAAHTGINIAGAKLALFVISATLITLVGAIQAPRWTYVEPAMVFNPTTSFLTVIMALLGGAHRLWGPILGAIPLFLLFEWLSANFPNHYAIILGLLFITIVFLVPKGVLALVESAFARRRRLQ >NZ_CP016592.1|WP_013384689.1|1856945_1858169_-|ABC-transporter-substrate-binding-protein MIMFSRRLALRSFGATIAVAAAMTGFSASAQQTSIKIGYAVSLTGGNAGGAGITTLPNYRLWVSEVNAAGGLELPDGTRLPIEVVEYDDRSSTEEVVRAIERLATQDQVDFILSPWGTGFNLAVAPLLDRFGYPQLASASVTDRADEFAQRWPRSFWLLGGGADYAGGLADVLATATASGVMNGDVAIISVADGFGIDLINAARPAFAAAGLNIVMDRTYPPGTTDFSPMLNEAKSSSATAFVAFSYPPDTFALTQQAQVADYNPAVFYLGVGAPFPTYLGANGANAEGVMSLGGIDTSNAAMMDYRARHEAFAGQPPDSWASMITYAGLEILQEAIKRAGLDRDAVSAEIASGSFTTILGETQLQNNQLRDLWLTGQWQDGTFVAIEPTDRPGASAAIVPKQPWAN >NZ_CP016592.1|WP_013384690.1|1858335_1859214_+|helix-turn-helix-domain-containing-protein MTLRPTHSHLDMLQEVAAQVIATTLDTAAWRMLGRRNRIFVLSAGTGSITYKGESHLLAAPGLVWVPAGAPAQLSLDAGSKGAWLAISDRAILQVDLAGNIAEDMRRFAQRPQFGRKISREMAARLIGLMALMAEELQRSEAGMQEMIRHHLSILAILLWRGSDLRPIAARPAPRVIFSEFLRLVDQHMRAHWRVSDYARYLGVSIDRLTSTVQRDTGQPPLAIIHTRLHAEACQMLETSAMQIAEISASLGFPDPAYFSRFFKRISGYSPRDYRNGLYREVTSGQAWAAWP >NZ_CP016592.1|WP_014537826.1|1859371_1860166_-|MBL-fold-metallo-hydrolase MANTTMTKKPVQITQVRNATLVVEYANTKFLIDPLFAAQGAFPGFAGSASSQLANPLVPLPIAQEQLIAVDAVIVTHLHEDHWDAAASAALPKDMPLFAQNEEDADKIRNEGFTDVRVLTQQSQFNGIGLCKTGGQHGTDATLDVIPLGEVCGVVFSHPQYATLYIAGDTIWNDHVQTAIDSHQPDAIVLNIGNAVFMGYDPIIMGLEDAVAVHRAAPNAILIASHMEAINHCILSRQTLYDYAKANGFDTRLLIPADGETVSV >NZ_CP016592.1|WP_014537827.1|1860226_1860412_+|hypothetical-protein MSSRQSGGFVTIHVGHNLFAASLFEARHDLIDKNRRGRWRNRQTTGRSRHNGDHHVRKRLR >NZ_CP016592.1|WP_014537828.1|1860387_1861350_-|GlxA-family-transcriptional-regulator MSQNEVLEVGLLIYPGVQMASVLGMTDLFEMANHVNGKDAAKTIRISHWKTTEDDGAPARVFDSFPLAESEPAVLVVPPAFGVPITPEVAQVYAPWLRQRHGGGTALGSVCTGSFLLAETGLLDGRRITTHWTVDAYLRARFPKVALDADQVIVEDGDIMTGGGAMSWIDLGLRIVDRLLGPAIMAETARSLLVDPPHREQRYSSTFAPRLNHGDAAILKVQHWLQATSAKEGDVERLANVAGLEGRTFLRRFKKATGLTTTEYFQRIRVGRAQELLQAGSQSIDQIAWDVGYSDPGAFRKVFTRIIGLSPGDYRKRLRT >NZ_CP016592.1|WP_014537829.1|1861656_1862598_+|polysaccharide-deacetylase-family-protein MPQSTNTSSPMRSDTGQYRFAPMRGRPRLHWPGNARMAFWVAPNIEHYELDPPVNPNRSPYARVQPDVLNYGWRDYGNRVGFERMARVMADRGIRGSVSLSVAVIEHFPDIIAQCNDLGWEMFSHGVYNTRYFYGMTEDQQRQVIRDSRESLARVGQTLDGWLTPAITPSLATQDLLAEEGVRYTLDYFHDDQPMPVKVRNGRLISVPYSIEMNDVPMVNWQNASPTAMLDSLKAQFDRLYAEGAGNPNVMCFATHPFLLGQPHRISVLTEFLDYVKSHDQVWYPTAREIADYYYTHHYDRVNTWLASLEESA |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP016592_2 | 2138731-2138830 | Orphan |
NA
Consensus repeat of NZ_CP016592_2
|
1 spacers
spacers of NZ_CP016592_2
>2.1|2138756|50|NZ_CP016592|CRISPRCasFinder TGGTATACGGAACTGAACACCGCGCAGCATGTGGGCGCGCCCGGCTTTAT |
CRISPR arrays and Neighbor proteins around NZ_CP016592_2
The CRISPR arrays of NZ_CP016592_2 >merge|NZ_CP016592|2|2138731-2138830|CRISPRCasFinder GGACGAGGGCAAAGCCGCCTTCCGTTGGTATACGGAACTGAACACCGCGCAGCATGTGGGCGCGCCCGGCTTTATGGACGAGGGCCAAGCCGCCTTCCGT >NZ_CP016592|2|2|2138731-2138830|CRISPRCasFinder GGACGAGGGCAAAGCCGCCTTCCGT TGGTATACGGAACTGAACACCGCGCAGCATGTGGGCGCGCCCGGCTTTAT GGACGAGGGCCAAGCCGCCTTCCGT
>NZ_CP016592.1|WP_013384834.1|2136595_2137804_-|ROK-family-protein MVSAMTTAEGATEKRKRAIGANPERNRAHNRSLVLNLLREHGQIGRAAMARHTRLTQQAVGNIIDELLLEGMVIETGRLRVGRGQPARQFALNPCGPVSLGVEIAAGHLAIVFQALTGAIRARSIVPLADTAPAPVIAALVAQIEKLKSEAGAPEIIGMGVVMPGPFEIEGISAVGPATLKGWAGLDPAALIADATGIGTVVYENDATAAAVFESLHGVGRGLRDFCHVYFGVGLGLGLIHDGRPLRGAFGNAGEIGQIAVPPRGGGAAAALEDRASVFALRDFLRETRGAPDDLDLLASLDPAEDPALQDWIARAADQLSPVLAILENIFDPETITLGGLMPRPIIEAMIDCLQPLPVTVSSRSARGLPRLMLAQTGPYTAALGAAAMPFMDQNTTATLRQ >NZ_CP016592.1|WP_013384833.1|2134222_2136577_-|PIG-L-family-deacetylase MLTDRNRLFRRIADPRMVRLARALGRLGSTVTMMNTGAHPDDEQTELLAWFSFGRNMRVVIACSTRGEGGQNALGPERGAALGLVRSRELEESARVIDADIAWLGHGPVDPVHDFGFSKDGKDTLERWGRARVIDRLVRAYREYRPDIVLPTFLDVPGQHGHHRAMTEAAEAALALAADPTYEVDGLAPWRVAKYYLPAWSGGGSTYDDELPPPPATVDVRVEGFDPVSGMRYDQIGEASRGYHASQGMGTWRATPRRHWALHGTAPEGDILDGLPATLGALADVAGAPAELALAASEIAKARAAFPDDQAMITALVAAHKAFSAPMGADFDALHGHRIRQKIAEVEAALALAAGVDVAAWLQDPLVPGRSATLAVWVNAGHATLRGIAAAASDGIAQKGGSEERDGLHLLTLAVPADLSPASRYLPGWARLGGNGILAASVTVEVGGITFALPVDLEEEPLLQPAASVTPSADAVLVNLNDPQPVHFAMTGTASAASLGLGSIDGLTVENADGHVTLTASGLAPGKTRLPISVAGAAGWQAKPINYGHIGRLAQVVPAGVDILALDLKIPDGRVGYIGGGADRVGLWLERMGVDVVDLDAEAFDAARANGFAGFDTLVVGIFTFGLRPDLAAATADLRAWVHAGGNLVTLYHRPWDNWKPDETTPAHMVVGSPSLRWRVTRPGAPVTILEPDHDLLAGPNTITHADFDGWDKERGLYFLSSWDQVYQPLLAMSDPDEQPLLGSLVTGRIGKGRHTHTALVLHHQMDRLVPGAFRLMANLIQPA >NZ_CP016592.1|WP_013384832.1|2133334_2134222_-|DMT-family-transporter MKPATALDDQTSAAEMRQGMTWVLLDMALVSAMTVMVKKGGVDFPAVQMVFFRSLVGLVAVLPLVLRHWRVIRQTRNVKRNVFRVTCNAVALSCNWGALTILPLATANAIGFLRPLIVMVMAIFLLSERVTGWRWAGAALGLMGVGVMLLPSLTGMGEAQDHLLGYAFAGGAILFGAMATIQTRALKGENTTVMMVFYTVGLTLFTAIPAFFVWQPVALHHLPHLLGIGIIAQVAQYCYLRGYQLAPASKLAPLGYLSLIFATVMGYVFFDEVPTVYTAGGAIVIIIGLIVARRA >NZ_CP016592.1|WP_014537902.1|2132713_2133328_+|nuclear-transport-factor-2-family-protein MKKLIASFAIAASIAAPVFADTEVRPGVFFAGTVETAGSDRAVMQDLIFDLATAWAVCDRDAMANAITDDVSFSYPTSAVNGREAIMADLEAFCGAATDTSLYFPADAFYIDVDTGRIAAEVQFRTFQRGNRQVVNDVWIATVTDGKVSVIKEYLDGRVKDLQAQGVLQLEESPDFLTPWPPRTEAWASCFPIVRAAPTNDCVQ >NZ_CP016592.1|WP_013384829.1|2131869_2132709_+|bifunctional-5,10-methylenetetrahydrofolate-dehydrogenase/5,10-methenyltetrahydrofolate-cyclohydrolase MTTIFTGFDLAADILQGVRADIATLGRAPVCVTLFDDSSAPARAYLNRQITLARGAGIDLRPMGYADAQLAQLAADARVDAIATLYPLPSGLTPMGAAQAIGGGKDIDGQHPNHAGPLLLGDGTLRPAATAQASLICARAILGDLAGAEIVLIGASRLIGRPLAMLLLDAGATVTTCHIQTRDLARHTRAADLVISAAGVPALLTADNIAKGGRILDLAIIPKDGSLVGDADLPSLMGHAALVSAVPDGVGPVTTACLFANIAAAAKSRAMNLPLLQQD >NZ_CP016592.1|WP_083205735.1|2131081_2131873_+|helix-turn-helix-domain-containing-protein MPATDENRALFEDDKEISLTLARGLDLIEAFAGDERRLSIPELAARTGMNRTVVRRLVRTLEKKGYASADRGQYELTPHILRLIRGFIEGRSLPQIVHPLLRAAAEDIGESVSFAMLDDTEAVYVAHAFLPARFTLNMVTVGSRAPLLPTAVGRVIVAFLPDIERSAILSRLSPQAHTPQTETDAARLDAIFADCRRLGYCMADGEYVEGVASLAVPVFDGMRRVTGALSIIFPTHGHDATEIAEKLAPRMQATASALGSALQ >NZ_CP016592.1|WP_044008068.1|2130237_2131095_+|bifunctional-5,10-methylene-tetrahydrofolate-dehydrogenase/5,10-methylene-tetrahydrofolate-cyclohydrolase MMALLLDGDALAAKLRQQMTERVAASGIRPVMATVLVGDNPASESYVARKHKDCREIGIEALRIRLPAGASPEQVLAEVARLNDDPSVDGFFVQFPLPEGHDEQAIAAAIRPDKDIDGLHPENLGRLITGKGGIPPCTPMAVLSLLRGYNVPLAGKHVVIIGRGLLVGRPLAMVLSAPGVDASVTLLHSQTPDIAAFTRNADVVIAAAGHPELIRADMIRAGATVVGVGITYGDDGAMVSDIAADVSAIAGAVTPAHGSVGSLTRAMLLQNLINLALEKHSHARN >NZ_CP016592.1|WP_013384825.1|2129090_2130236_+|hypothetical-protein MNRILTALTASVAAFAGTTASAGEFATAHYTTPLADVCPSPFYIQKDWLAQAEHGGLYQMIGAGGTMESGAYRGPLGATGIELAILEGGGGIGLGDGETAYSALFNGNSKAGVIPHLGFQELDNAYIFSNLFPVVGVFVPLDIAPSGLIWDTGTYPDGFHSVDDLKAFGESGAGMIYVSTITRTFGLWLVEQGVSRDAFVEGYRGDLENFVANNGTWLNQGFVTTEVFNLSNGMNWAKPVDAVTVNELGYPTITGMVSVAQPRLEELAPCLELLVPIMQQAAVDYINDPAEVNQLIADFTAGGFSASWWRATPELNAYSAAAQRDRGIVGNGNNATIGDFDLDRAAAMLELVKPMLDDRANPDVTVDDVVTNRFINPEIGL >NZ_CP016592.1|WP_014537901.1|2128174_2129035_+|ABC-transporter-permease-subunit MTEIAPKIRTEAYAPTPVAPPRTPAQKFAATFLPPLIMGVLVVLLYWVVRESLPAHRQFLMPSASGMWDKALSQPAVWAELGSRSLTTLTIALTGLAFSIPIGMALGIIMFRFFVMERAVYPFLVALQSIPIMAIIPLIQSALGFGFMPKVLIVILFTFFAIPTTLLLGLKSLDQGVLNLFRLQGASWWTMLRKAGLPSSAPALFAGFRISTSMAVIAAVTSELFFMAGRGGLGQMLVNAKTDFKYEQMYAALIASATLSISIFVVFTLVGNRIFASWYETAERKS >NZ_CP016592.1|WP_014537900.1|2127464_2128169_+|ABC-transporter-ATP-binding-protein MQYPDGTIALEGIDLTIRKGEFVSVVGPSGCGKSTLLKLASGLEAHTGGQIRVDRSNLGYTFQDATLLPWRTVLPNVELLMELRGIPPEERRRVALEQIELVGLKGFENHYPKRLSGGMRMRASLARSLALNPAVFMFDEPFGALDEITRERLNDELIALYLRNGFTGMFITHSIPEAVYMSSRVIVMSRRPGRIIADFPIPFAYPRQPELRYDPEFSRIAGEVSVALRHAIEE >NZ_CP016592.1|WP_013384836.1|2139319_2140255_+|sugar-ABC-transporter-permease MSVTDPNGAAEGRPGLWNRLGIRTKHVLWAWAFLAIPVLFYVVIRFYPTFDAFWLSLTDGNIRRGPSFIGLENYARMYADPVFWKVFGNTFLYLLIGTPVSLVISFTIAYYLDRVRFMHGLIRALYFLPYLTTAAAMGWVWRFLYQPVPIGMINSFLTSIGLEQQPFLRSTDQALMAATIPAIWAGLGFQIIIFMAGLRAIPSSFYEAARIDGLGEWAILRKITLPLLKPTTIFLVVLSSIGFLRIFDQVQSLTANDPGGPLNATKPLVMLIYQTAFSSFRMGYASAQTVILFLVLLLISLLQLWLLRDKK >NZ_CP016592.1|WP_013384837.1|2140251_2141100_+|carbohydrate-ABC-transporter-permease MSASTELAANRRNIRPGRVIAWTLLILGGFLMALPILYMFSTSLKPASDTFDLRLIPAAPTLANYIDILQDGRFIRWFYNSMIIAVAVTASNVFFDSLVGYTLAKFDFRGKNIVFIAILSTLMIPTEMLVIPWYMMSAKLGWLDSHWGIMFPGMMTAFGTFLMKQFFEGVPNDFLEAARVDGLNEFTIWWKIAMPMVLPAISALAIFTFLGNWTAFLWPLISTTSPDLYTLPVGLNSFAVGEAVRWERIMTGAALATIPTLLVFLALQRFIVRGVMLAGLKG >NZ_CP016592.1|WP_013384838.1|2141103_2142621_+|argininosuccinate-lyase MSNPNDPRLTDGSVFPDPVYKETVLRPLFDGAKTHHVAAFGAIDRAHLVMLAETGILPAADAGKIAVARAALDTEIDPATLTYTGEVEDYFFLIEKELKARVGAELGGRLHTARSRNDIDHTLFKLGLRARLNLLIEQAIALHGAIVAKAEAESATLIVAYTHGQPAQPSTLGHYLSAMAEILARDIQRLFEAYRIVNLSPMGAAAITTSGFPINRERVAELLGFAAPLQNSYSCIASVDYITSTYSAMELMFLHLGRPIQDLQFWTSFEVGQIYVPNALVQISSIMPQKRNPVPIEHLRHLASQTVGRAHSMLTIMHNTPFTDMNDSEGETQETGYQAFEVAGRVLTLLAALVAQIKVDPARVASNIRRSCITITELADSTVRREGLSFREGHEIAAAVARAVVAAEGDLTTDGYAPFVTAFKHATGRDPQIDAAAFAQITSPEYFVAVRDRTGGPAPEALAQAISGYKTQNAGFAAQLATLIATQSAADADLATAFNLLKESA >NZ_CP016592.1|WP_013384839.1|2142620_2143682_+|sn-glycerol-3-phosphate-ABC-transporter-ATP-binding-protein-UgpC MAKIELEGLVKDYGKVRAVHGIDLQIEDGEFVVFVGPSGCGKSTTLRMIAGLEDISGGALKIGGKVVNQLEPKQRNIAMVFQNYAIYPHMTVGQNIAFGLYTSKLPKAEKDRLVREAGETLGLTPYLDRRPAALSGGQRQRVAIGRAMVRSPSAFLFDEPLSNLDAQLRGQMRIEIKRLHQRLGTTIVYVTHDQVEAMTMADKIVVMRDGRILQVGSPLDLYENPVDVFTARFIGSPSMNVIEGESDGVNLRLGNSTLPGFGANLPAGKVMVGLRPHDLKVGVPGDATLEAVVTAIEPLGAETLVHMEVAGQPLVGSAPGRVLPVVGSTVTASVTRGVLYVFDAQTEKALGRA >NZ_CP016592.1|WP_013384840.1|2143678_2144584_+|ribokinase MTSKNNGEGVLSLGRIYADLAFAELDAPPTPGREVYAQSFSLTPGGGAVITAAHLVAAGRPAHLLARLGTDPIAVAIASELTALDLDLTYVERAADAGPQLTVAIVTPEDRAFITRRSPRGMPSQAAAALHGAGLRHLHIAEYATLAENPALIVTAKMAGLTISLDPSWDESLIHGPALLAASSGVDVFFPNMDEATALTGKTAPAAALDILAQHFPVVALKCGSAGAMLAVGSTRFSVTAPKTVVVDTIGAGDSFNAGFLDAWLSGLAPEEVLRRAVQRGSQSVMAAGGTGCLSQMKSAS >NZ_CP016592.1|WP_013384841.1|2144596_2145940_-|Ktr-system-potassium-transporter-B MKTAVKGWAARFLSLPPPLVVAGIYIATITMGASLMMLPMAQAMPMRWSDAFFMATSAVTVTGLAVVDVGSHLSLLGQAVLVTLVQLGGLGLMTFAVLILEIVGRPVGLMGEAYLREDLKQNALWRVGRLVRRIAVVVFAIEAVGIAILCLSFIPDLGFWPGLWAAIFHGIGAFNNAGFSIFRTGLMEYVADPIVNLVIPALFITGGIGYFVLHDLIYKRRWRYWSLNTRIMLAGTAVLIPWSVLMFAALEWTNPATLGGLDGIWPRIAASWFQGVTPRTAGFNTLDISGIHDSTAMLFISLMLIGGGATSTAGGIKVTTFVVMILATIAFFRRQTQLHIFGRGIGPDEVLKVMAIVAVSLVLVFCGVFLLSLSHDGHFLDIAFEVASAFSTTGLSRNYTPELNDFGRCVIMVIMFIGRLGPLTLGFFLATQLSPRVRYPQERIHIG >NZ_CP016592.1|WP_013384842.1|2145943_2146606_-|TrkA-family-potassium-uptake-protein MARTEQSFVVIGLGAFGAAVASELARFGNRVMGIDLDERRVAQMVSVLPTALILDATDEIALREAGVDRYDVALVAIGQNIEASILATMNLRLLGLETVWVKAASRVHHRILVKIGADRVILPEQEMGRHIAQMLNNPVVQDYVSLGNGFNVVSIELPKALDGATPKSLGLVGREEPRLMAAMRGTQQLDIANPDLRFAPNDKLILLGRRVVLQAFSDGL >NZ_CP016592.1|WP_013384843.1|2146720_2147068_-|hypothetical-protein MIATSAFAQAPAMPDMDEAVAAANNQLGVLEYCAAEGHIESTAVEVQERLLQVLPPASDPTAVEAAYAAGKEGTIAVSGTEMSLADAATGQGTDVAALCQQLGSMVEQAGASLPN >NZ_CP016592.1|WP_013384844.1|2147692_2147845_+|hypothetical-protein MSLINQTVGLGWWSNIAMVRYVEGSMFLRATMDVPHGRRIYQSGMIRLPS >NZ_CP016592.1|WP_013384845.1|2148319_2148526_-|hypothetical-protein MNTAPFVRIALRYIAGGLVSYGILTPEGATAFASDPQVIAQASIVLGAATAALTEGFYALAKRWGWRT |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP016592_1 | 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT | 1851613-1851646 | 34 | NC_017903 | Escherichia coli Xuzhou21 plasmid pO157_Sal, complete sequence | 24723-24756 | 6 | 0.824 |
NZ_CP016592_1 | 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT | 1851613-1851646 | 34 | NC_011148 | Salmonella enterica subsp. enterica serovar Agona str. SL483 plasmid, complete sequence | 37052-37085 | 7 | 0.794 |
NZ_CP016592_1 | 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT | 1851613-1851646 | 34 | NZ_KU963390 | Escherichia coli strain ECO37 plasmid ECO37P2, complete sequence | 30385-30418 | 7 | 0.794 |
NZ_CP016592_1 | 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT | 1851613-1851646 | 34 | NZ_CP010193 | Escherichia coli strain M8 plasmid B, complete genome | 33332-33365 | 7 | 0.794 |
NZ_CP016592_1 | 1.6|1851679|33|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT | 1851679-1851711 | 33 | NZ_AP022566 | Mycolicibacterium alvei strain JCM 12272 plasmid pJCM12272, complete sequence | 45464-45496 | 7 | 0.788 |
NZ_CP016592_1 | 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT | 1851613-1851646 | 34 | NZ_CP049247 | Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed1, complete sequence | 18758-18791 | 8 | 0.765 |
NZ_CP016592_1 | 1.10|1851952|34|NZ_CP016592|PILER-CR | 1851952-1851985 | 34 | NZ_CP007130 | Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence | 539756-539789 | 8 | 0.765 |
NZ_CP016592_1 | 1.14|1851945|34|NZ_CP016592|CRISPRCasFinder,CRT | 1851945-1851978 | 34 | NZ_CP007130 | Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence | 539756-539789 | 8 | 0.765 |
NZ_CP016592_1 | 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT | 1851613-1851646 | 34 | NZ_CP019707 | Pantoea alhagi strain LTYR-11Z plasmid pPALTYR11Z, complete sequence | 62089-62122 | 9 | 0.735 |
NZ_CP016592_1 | 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT | 1851613-1851646 | 34 | NZ_CP007740 | Bacillus methanolicus MGA3 plasmid pBM69, complete sequence | 16414-16447 | 9 | 0.735 |
NZ_CP016592_1 | 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT | 1851613-1851646 | 34 | NZ_LT222315 | Pseudomonas cerasi isolate Sour cherry (Prunus cerasus) symptomatic leaf plasmid p58T3 | 108477-108510 | 10 | 0.706 |
NZ_CP016592_1 | 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT | 1851613-1851646 | 34 | NZ_LT963398 | Pseudomonas cerasi isolate PL963 plasmid PP3, complete sequence | 75521-75554 | 10 | 0.706 |
NZ_CP016592_1 | 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT | 1851613-1851646 | 34 | CP034538 | Pseudomonas poae strain CAP-2018 plasmid unnamed | 107853-107886 | 10 | 0.706 |
NZ_CP016592_1 | 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT | 1851613-1851646 | 34 | NC_008738 | Marinobacter hydrocarbonoclasticus VT8 plasmid pMAQU01, complete sequence | 62733-62766 | 10 | 0.706 |
NZ_CP016592_1 | 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT | 1851613-1851646 | 34 | NC_007678 | Salinibacter ruber DSM 13855 plasmid pSR35, complete sequence | 14814-14847 | 10 | 0.706 |
NZ_CP016592_1 | 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT | 1851613-1851646 | 34 | NZ_AP022559 | Geobacillus subterraneus strain E55-1 plasmid pGspE55-2, complete sequence | 821-854 | 10 | 0.706 |
NZ_CP016592_1 | 1.16|1851811|35|NZ_CP016592|CRT | 1851811-1851845 | 35 | NC_021289 | Burkholderia insecticola plasmid p1, complete sequence | 984065-984099 | 10 | 0.714 |
NZ_CP016592_1 | 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT | 1851613-1851646 | 34 | NZ_CP029542 | Streptomyces sp. NEAU-S7GS2 plasmid unnamed1, complete sequence | 9497-9530 | 11 | 0.676 |
1. spacer 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT matches to NC_017903 (Escherichia coli Xuzhou21 plasmid pO157_Sal, complete sequence) position: , mismatch: 6, identity: 0.824
tcccata-aaaaaacccgcctctaagggcgggctg CRISPR spacer -gcgacacaaaaaacccgcctctaagggcgggtta Protospacer * *.* ************************.*.
2. spacer 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT matches to NC_011148 (Salmonella enterica subsp. enterica serovar Agona str. SL483 plasmid, complete sequence) position: , mismatch: 7, identity: 0.794
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer cgacacaaaaaaacccgcctctaaggacgggtta Protospacer . **.********************.****.*.
3. spacer 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT matches to NZ_KU963390 (Escherichia coli strain ECO37 plasmid ECO37P2, complete sequence) position: , mismatch: 7, identity: 0.794
-tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer gcttcat-aaaaaacccgcctctaaaggcgggtta Protospacer ...*** *****************.******.*.
4. spacer 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP010193 (Escherichia coli strain M8 plasmid B, complete genome) position: , mismatch: 7, identity: 0.794
-tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer gcttcat-aaaaaacccgcctctaaaggcgggtta Protospacer ...*** *****************.******.*.
5. spacer 1.6|1851679|33|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT matches to NZ_AP022566 (Mycolicibacterium alvei strain JCM 12272 plasmid pJCM12272, complete sequence) position: , mismatch: 7, identity: 0.788
tgcttccg--gtaaaactccaccgcagcgtggaac CRISPR spacer --ccgccgacgtgaaacaccaccgcagcgtggaag Protospacer *. *** **.**** ****************
6. spacer 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP049247 (Rhizobium rhizoryzae strain DSM 29514 plasmid unnamed1, complete sequence) position: , mismatch: 8, identity: 0.765
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer gggcaacaaaaaacccgcctcgaggggcgggctt Protospacer ** ************** *.*********
7. spacer 1.10|1851952|34|NZ_CP016592|PILER-CR matches to NZ_CP007130 (Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence) position: , mismatch: 8, identity: 0.765
cccagcaggcgcggcgcagacgacaggcgggacg CRISPR spacer atcgacgggcgcggcgcagacgacaggcaagacc Protospacer .*..*.*********************..***
8. spacer 1.14|1851945|34|NZ_CP016592|CRISPRCasFinder,CRT matches to NZ_CP007130 (Gemmatirosa kalamazoonesis strain KBS708 plasmid 2, complete sequence) position: , mismatch: 8, identity: 0.765
cccagcaggcgcggcgcagacgacaggcgggacg CRISPR spacer atcgacgggcgcggcgcagacgacaggcaagacc Protospacer .*..*.*********************..***
9. spacer 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP019707 (Pantoea alhagi strain LTYR-11Z plasmid pPALTYR11Z, complete sequence) position: , mismatch: 9, identity: 0.735
tcccataa------aaaaacccgcctctaagggcgggctg CRISPR spacer ------aaaggcttaaaaaaccgcctctaagggcggtctt Protospacer ** ***** **************** **
10. spacer 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP007740 (Bacillus methanolicus MGA3 plasmid pBM69, complete sequence) position: , mismatch: 9, identity: 0.735
tcccataaaaaaacccgcctctaagg-gcgggctg CRISPR spacer aaccataaaaatacccccctctaaggtgtttgtt- Protospacer ********* **** ********* *. *.*
11. spacer 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LT222315 (Pseudomonas cerasi isolate Sour cherry (Prunus cerasus) symptomatic leaf plasmid p58T3) position: , mismatch: 10, identity: 0.706
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer caaaacgaaaaaacccgcctattagggcgggttt Protospacer . *..************* * ********.*
12. spacer 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT matches to NZ_LT963398 (Pseudomonas cerasi isolate PL963 plasmid PP3, complete sequence) position: , mismatch: 10, identity: 0.706
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer caaaacgaaaaaacccgcctattagggcgggttt Protospacer . *..************* * ********.*
13. spacer 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT matches to CP034538 (Pseudomonas poae strain CAP-2018 plasmid unnamed) position: , mismatch: 10, identity: 0.706
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer acttcacaaaaaacccgcctttaatggcgggttt Protospacer *.. *************.*** ******.*
14. spacer 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT matches to NC_008738 (Marinobacter hydrocarbonoclasticus VT8 plasmid pMAQU01, complete sequence) position: , mismatch: 10, identity: 0.706
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer agaaaacaaaaaacccgcctcaatgggcgggttt Protospacer * ************** * *******.*
15. spacer 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT matches to NC_007678 (Salinibacter ruber DSM 13855 plasmid pSR35, complete sequence) position: , mismatch: 10, identity: 0.706
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer agtcagtaaaaaacccgcctttacgggcggggca Protospacer .** *************.** ******* ..
16. spacer 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT matches to NZ_AP022559 (Geobacillus subterraneus strain E55-1 plasmid pGspE55-2, complete sequence) position: , mismatch: 10, identity: 0.706
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer gaaaaataaaaaacccgcctcaaaaggcggggtc Protospacer * ************** **.****** *
17. spacer 1.16|1851811|35|NZ_CP016592|CRT matches to NC_021289 (Burkholderia insecticola plasmid p1, complete sequence) position: , mismatch: 10, identity: 0.714
gcgaccaatggcgcgtagcgacaccctatcaggac CRISPR spacer acgacgaatggcgcgtcgcgacacccatctgcaac Protospacer .**** ********** ********* ... .**
18. spacer 1.5|1851613|34|NZ_CP016592|PILER-CR,CRISPRCasFinder,CRT matches to NZ_CP029542 (Streptomyces sp. NEAU-S7GS2 plasmid unnamed1, complete sequence) position: , mismatch: 11, identity: 0.676
tcccataaaaaaacccgcctctaagggcgggctg CRISPR spacer cgtaacgaaaaaacccgcctcggagggcgggact Protospacer . . *..************** .******** .
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
566367 : 574800
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP016592|566367:574800|DBSCAN-SWA TTTAGTGGCCCGTTGCCCCAGCCGCAGGCAGCGCGGCCGACCAGCTGAGATTGCGATGCGCATCGCGCAGGCGCAGCAATGACGAGACATGGCTATCCGCGTCCAGCGCCGTCATCACACGGTGCAGATGTTCGGCATCGCGCACATCGACATCAATGCGCAGGCTGTAGAAATCCAGCTTGCGATCGAGGAAATCGAGGTCCGAGATATTGGCGTCCTGCTCGCCAATCAGCGTGCAGACGCGGCCCAGAACGCCCGCATCATTGGCGACAGACAGTTCCAGCGAGACGGTGTTGATCGCGCGGTGCTGCCCCTCTTGCCAGCGCAGATCGACCCAGCGATCGGGCTGATCCTCGTATTCCGACAGGGCCGGGCAGTCGATGGCATGGACGATAACACCCTGACCGCGGAATGTGATGCCGACGATCCGCTCGCCCGGCACCGGCTGGCAGCAGGGCGCACGGCGGAAACTTTGGTCGGGGCTAAGGCCGACGACCGCTTTTTCCGCGTCGATTTCATTCGCATCGCGCAGCTTCAGATCGGGATAGATGGCGCGCACAACCTCGCGCGCGGTAATTTCGGCGGCTCCGACGCGCAGCAAAAGCTGCTCGGCATTCTCGAAAGCCAGCGCGCGCGCGGCGGTGGCCAGCGCTTTGTCAGTGGATTTTTTCCCTGCGTTTTCAAAGGCCACGCGGGTCAGCTCGGTGCCAAGCTTGATGAACCGGTCACGGTCCTTTTCGCGCAGCCAACGGCGGATCGCGGATTTCGCGCGGCCGGTGACGGCAATGTCGATCCATGTCGCCTGCGGGGTCTGGCCGTCGGCGATGATAATCTCGACCGATTGACCGTTCTTCAGCCGTGTCCACAGCGGCACGCGCAGCCCGTCGACCTTGGCGCCGACGCAGGCATGGCCGATGCGGGTGTGGATCGCATAGGCGAAATCAATCGGGGTCGCGCCTTGCGGCAGCTTGATTACTTCGCCCTTGGGGGTAAAGCAAAAGACCTGATCTTGATACATCTCAAGCTTGAACGTCTCGAGGAATTCGTCGTGGTCCTGATCCTCTTCGAACCGCTCGGAGAGTTGCGCGATCCAGCGGGCGGGATCGACGACAAAGCGGTTCTGCACCGGCTCGCCGTCGCGATAGGACCAGTGGGCCGCGACGCCTGCCTCGGCCACTTCGTGCATTTCGCGGGTGCGGATCTGCACCTCGACCCGCTTGCCGCCGCGCGCGGACACAGCCGTATGGATCGAGCGGTAGCCGTTCGATTTGGGCTGGCTGATGTAATCTTTGAACCGGCCGGGGACCGAGCGCCAGCGTTGATGGATCGCCCCCAGCGCGCGATAGCAATCGACATCGGTTTTCGTGATCACGCGAAAGCCATAGATGTCCGACAATTGCGAGAAGCTTTGATCCTTCTCCTGCATTTTGCGCCAGATCGAATAGGGCTTTTTCGCGCGGCCATGCACCTCGGCGGGAATGCCGGATTTCTCGAACTCGGCCAGCAGGTCGGTTTTGATCTGATCGACCAGATCGCCCGATTCCTGTTGGATCAGGCTGAACCGCTGGATGATGGAATCGCGCGCCTCGGGGTTGAGGACGCGAAAGGCCATATCCTCCAGCTCTTCACGCATCCATTGCATCCCCATGCGGCCGGCAAGCGGCGCATAGATATCCATGGTTTCGCGGGCTTTTTTCACCTGCTTTTCCGGCCGCATCGAGGCGATGGTGCGCATATTGTGCAAGCGGTCGGCCAGCTTGACCAATGTCACCCGCAAATCGCGCGAGGTTGCCATGATCAACTTGCGGAAATTCTCGGCCTGTTTGGTTTCAGCCGAATGCAGTTGCAGATTGGTCAGTTTGGTGACGCCATCGACCAGATCGGCAATGACCGCGCCAAAGCGGGCCTCGACCTCGGCATAGGTGGCGCGGGTGTCCTCGACCGTATCATGCAGCAGGGCGGTGATGATCGTGGCATCATCCATCTGTTGCTGCGCCAGGATCATCGCGACGGCTACGGGATGGGTGAAATAGGGCTCGCCCGAATGGCGGTACTGGCCGTCATGCATCAGACGGCCGAATTCCCAGGCATCGCGCAGCAGGGATTCATTGGTGGCGGGATTATAGCTGCGCACGCGCGCGATCAGATCGTCTTGCGTGATCAGCGGTTCGGGCACGACTTGCGCCGCGCCCGCAGGCAGGGTGGGGGCGTTGTTATCCACCCCCGTCGACAAAGCCACATCAGACCGCGTTACATCAGGCGGGCCCTGATCATTCGCCGCCGCGCTGGCTTTGTCACTGCCTTCCATGGGTGTGGCCTCAGCGGCCCTCTTGCGCTTGCAGCAGCTCGCGCAGCAGGCGCTCTTCGGTCATGTCGTCCTGGGCGGGACGATCCAGCTCGGCGCTCATCAGCAGAGCCATCGAATCGTCTTCGGGCTCGTCGACTTCGATCTCGTGCTGGTTGGCTTCGATCATCCGCTCGCGCAGGTCATCGGCCAGCTGGGTTTCTTCGGCGATCTCGCGCAGCGATACAACGGGGTTCTTGTCGTTGTCGCGCGGCACAGTCAGGGCCGACCCTGCAGCGATCTCGCGTGCACGATGTGCGGCCAGCATGACCAGTTCGAAACGGTTCGGAATCTTGTCTACACAATCTTCGACGGTGACACGTGCCATGCGACTACTCCCCTTCGGTGGGTTTGACGGGTCTTGCCGGTAGCTACCCGCACGAATCCGATATATCGTGCAGCGACCGGGGGAAAACATTCATTTAGAGGGGGCAGACGCGGAACGCAAGGGCCGCAGGCAGGATAGCGAGGATATCCGTGTCAAATTCGCCGTTGGGGAGGGGTATCCGTTAAAACCGCGCGGTCTTATTTTAGAGTTGCTCTAAATTTGGTTAACGCAACTTTATTATAGACGAGGATTTTCCGCGTGTTTTCGGTAATTTGGGGGAACTTTAGACTTTATGCAGCATTAGATTCGCGTTTTACTTGCACCGCGAGGTGAAGTTGTTAACTAATTTGTTATCCGCTGCACATCGCACTGATGTTTCATTTCCTGTTTCCGTAGCAGACCGCGATTGGAAGGATAATATATGTTTTACCGGGACGAGCGGATTGCGCTCTTTATCGATGGGGCGAATCTGTATGCCGCATCGAAATCCTTGGGGTTTGATATCGACTACAAACTGTTGCGTAGCGAGTTCATGCGACGCGGACGGCTGATCCGTGCTTTCTACTATACAGCGCTGCTTGAGAACGAGGAATATTCCCCGATCCGACCATTGGTTGACTGGCTGCATTACAACGGCTTTTCAATGCGTACAAAGCCCGCGAAAGAGTTTCAGGACGCCCAAGGCCGCCGCAAGATCAAAGGCAATATGGATATCGAGCTGACCGTCGATGCGATGGAATTGGCGCCCCATGTCGATCACATTGTTCTGTTCTCGGGCGATGGCGATTTCCGTCCATTGATCGAGGCGCTGCAGCGGCGCGGCGTGCGCGTATCGGTTGTATCCACCGTGCGTAGCCAGCCGCCGATGATTGCCGATGAATTGCGCCGTCAGGCTGATAATTTCATCGAACTGGACGAGCTGCGCGATGTGCTGGGTCGCCCGCCGCGCCCCGATGCCCGCCCCGGTATGCCGCGCGACGAGACGGTCGAGACGCAAAGCCTGCTGGATTAAACCACGCCCGGCGCCCTTTACGGGGGCGCCGCATCGGCTTATCTGAAAGGGCAACGCTGTCTCGGAGCCTCGCCCATGTCCCTGCCTCCTCTTACGGTCTATCTGGCCGCACCGCGCGGCTTTTGTGCCGGTGTGGATCGCGCGATCCGTATTGTGGAAATGGCGCTGGAAAAATGGGGCGCGCCGGTTTTTGTGCGCCACGAGATTGTTCATAACAAATATGTCGTCGACGCGCTGCGCGCCAAAGGTGCAGTCTTTGTCGAGGAATTGGATGAATGTCCCGAAGATCGCCCTGTGATCTTTTCGGCGCATGGCGTGCCGAAATCTGTCCCGGCCGAGGCGGTGCGCCGCAATATGATCCATGTCGATGCCACCTGTCCGCTGGTGACAAAGGTGCATAACGAGGCCGCCCGTCATCATACCAACGGTTTGCAGATGATCATGGTCGGTCACAAGGGCCACCCCGAGGTCATCGGCACCATGGGCCAACTGCCCGATGGCGAGGTGATGCTGGTCGAGACGCTCGCCGATGTCGCGACGGTTCAGGTGCGTGACCCCGCGCGCCTGGCGATGATCACACAGACCACATTGTCGGTCGATGATACTGCCGAGATTGCCGCCGCGCTGAAAGCCCGCTTTCCGGCGATCAATGTCCCCGCGAAAGAGGACATTTGCTATGCCACCACCAATCGGCAAGAGGCGGTCAAAGTGATGGCCCCCAAATGCGATGCGATCCTTGTGGTCGGCGCGCCCAATTCCTCGAACTCGAAACGTCTGGTCGAGGTCGGCAGCCGCGCCGGTTGCGATTACTCGCAGCTTGTCCAGCGCGCGGATGAGATTGATTGGCGGGCCTTGCAGGGCATCCGCACATTGGGTGTAACCGCCGGCGCCTCGGCCCCCGAAATTTTGATCGAAGAAGTGATCGATGCGTTTCGCGCCCATTATGACGTAACGGTTGAACTGGTCGTGACCGCCGAAGAACGGGTAGAGTTCAAAGTTCCCAAAGTCCTGCGCGAGCCTGCCTGATATGCCTGAATTCATCTGCTTTACCGACGGCGCCTGTTCGGGCAACCCGGGGCCCGGCGGTTGGGGCGTTTTGATGCAGGCGCGCGAGGGTGGGGCCGTGGTCAAAGAGCGACCGCTCTGCGGCGGCGAGGCGATGACCACCAATAACCGCATGGAATTATTGGCCGCGATCAATGCTTTGGAAAACTTTACGCGTTCCAGCACCATCACCATCGTGACCGACAGCGTCTATGTGAAAGACGGCATTGGCGCGTGGCTGTTCAACTGGAAGCGCAACGGCTGGCGTACCTCGCAGGGCAAGCCGGTCAAGAATGATGATCTGTGGCGGCGTCTGGATGCCGAGGTGCAGCGCCATCAGGTGACGTGGAAATGGGTCAAGGGTCACGCGGGCCATCCCGAGAATGAACGCGCGGACGAACTCGCCCGCCAAGGCATGGCCCCGTTCAAGGCCGCCCGCGCGCTTTAACGCTGGAAAGCGCAGGGCTTGCCTGCTAGCCAGTCGGGCGATGCATCTAAAGGCCCGTCCATGTCCAATTATATCCTGACCGTTACCTGCGCTACGACCCGTGGCATTGTTGCTGCTGTCTCGGGGTTCTTGGCGGAAAACGGTTGTAATATCACCGATTCCGCGCAGTTCGACGATGTGCTGACGGGCAAGTTTTTCATGCGTATCAGCGTCACCAGCCAAGAGGGCGCGACGCTTGCCGATCTGCAAAGCCGCTTTGCAACTGTGGGTGCGCGCTTTGGCATGGAATTTGCCTTTTTTGACGCCAGCGAACGGGTCAAGGCGGTGATCATGGTCAGCCGTTTTGGCCATTGTCTGAATGATCTGCTGTATCGCCAGCGCATCGGCGCGCTGCCCATTGATATCGTGGGGGTGATCTCGAACCACTTCGAATATCAAAAGCTGGTGGTGAACCACGATATCCCCTTCCACCACATCCGCGTCACGCCCCAGAACAAGCCCGAGGCCGAGGCCGCCCAGATGCAGATCCTGCGCGAGACCGGTGCCGAGCTGGTGGTGCTGGCCCGTTATATGCAGATCCTGTCGGACGAGATGTGCCGCGAGATGTCGGGGCGGATCATTAATATCCACCACTCGTTCCTGCCCAGTTTCAAAGGGGCAAACCCCTATAAACAGGCGTACGAGCGGGGCGTAAAGTTAATTGGCGCGACGTCACATTATGTAACGGCAGATCTGGACGAAGGCCCGATCATCGAACAAGATACGGTCCGCGTGACTCATGCGCAGTCGCCCGAGGATTACGTCAGCCTTGGTCGCGATGTTGAAAGTCAGGTTCTGGCGCGTGCGATCCACGCGCATATTCACCGCCGTGTCTTTATCAACGGCAATAAAACCGTCGTCTTCCCGGCCTCGCCCGGATCTTATGCATCGGAACGGATGGGATGAAATATCTATTAGCCGCATTATTGCTGGTGCCTTTGCCCGCCTTTGCGCAGGGCGAGGATAAGGGCGATTGCCCTGATGCCTCCTCGACATCGGAAATCGTTGTTTGCCTGAACGAGCTGTATTCAACGGCGCAGCTGGAAATGCAGGTGCGGCTGGACGGGTTGGTCGCGGGCATGGCATCCAGCAATCGCGTTGCGGCGCTGAATGCGGCGCAAGCTGTCTGGAAAGCCTTTCGTCAGTTGGAATGCGAATCGCAGGCGCTGATCGCCGAAGGTGGCACCCTTGCCAATGTGCTGGGGGCCAGTTGCTATCTGCATATGACGCGCGACCGGATTGTGGCGTTGAACGCCTACGATCAGACCAATTAAAGCCGCGCGGGCAGTGTCAGGCCCAGCAGCGCGTCGGCCACCGGCATGGCGCGTTTGCCCTCGCGCTGGATGTCCAGCACGCGCAGGGCGCCGCTGCCGCAGGCCACGGTAAAGCCGTCTAGCACGGTGCCAGCCGCTGCCGTGCCGGGCACCGCCTCGGCGCGCAGCAGTTTCACCCGCTCATCCCCGATCATGCACCACGCACCCGGAAAAGGGGACAGCCCGTTGATCTGGCGGGCAACCGTGGGGGCGGGGCGCGTCCAGTCGACCAGCGCTTCGGCCTTGTCGATCTTGGCGGCATAGGTCACGCCATCCTCGGGCTGTACCTGCGGGATCAGGCCGCCCAGCCGTTCCAGCGTATCGATAATCATCCTTGCGCCCATCTGGGAAAGGCGCTGGTGCAGATCGCCGCTGGTATCCGTCGCACCGATCGGGGTTGCCGCGCGCAGCAGCACCGGGCCGGTATCAAGGCCCGCCTCCATCTGCATGATGCAGACGCCGGTTTCCGCATCGCCCGCCATGATTGCGCGCTGGATCGGGGCGGCACCGCGCCAACGCGGCAAAAGGCTCGCGTGGATGTTCAGGCAGCCATGTTTGGGCGCATCCAGCACCACCTGCGGCAAGATCAGGCCATAGGCGACCACGACCGCGACATCGGCATTCAGCGCCGCAAATTCGGCTTGCTCATCCGCGCCGCGCAGCGATTTGGGGTGGCGCACCATCAGCCCCAGGCTTTCGGCGCGGGCATGGACCGGCGTGGGGCGGTCTTTTTTGCCGCGACCGGCAGGGCGGGGCGGCTGGCAATAGACGCAGGCGATCTCGTGCCCGGCCGCGACCAGCGCCTCCAAAACCCCAACGGAAAAATCAGGGGTGCCCATAAAGACGACGCGCATCAGCCGAACTTCCTTGCCTTGCGCAAAAACATGTCGCGCTTGATTTTGCTAAGGCGGTCAAAATACATCTTGCCCGCCAGATGGTCGATCTGATGCTGCATCGAGGTGGCCCAAAGCCCGACCAGATCGCGTTCCTCGACCTCGCCCCATGCGTTCAGGAACCGCACCGTGACGGCACGGGGTCGGCTGATGACAGCGCTGATCCCCGGCAGATTTGGCGACGCTTCCTCGTGCTCGCGCAGTTGCACCGAGGCGTGCAAAATCTCGGGGTTGGCCATGCGGATCGCCTGACCGCGCGCATCCGATGCATCGACCACGGCCAGCGCCAGCGGCACGCCCAGTTGCACCGCGGCAAGGCCGACGCCCGGCATCGCATCCATCGCCTCGACCATCTCGTCCCATAGGGCGGTGATCTCGGGGGTGATCGCCTCGACCTGGGCGGCGGGTTTGCGCAGCACGGGGGCGGGCCACATCACAAAGGGACGGTGCATCTATTCGCCCCGCGCGCGTTCACGCTTTAGCTTTTCCATCTTGCGGGTGATCAACTGCCGCTTCATCGGACCCAGATAGTCGATGAACAGCTTGCCGTCCAAATGGTCGATCTCGTGCTGGACGCAGGTGGCCCATAGGCCCTCCATCTCGCGGTCCTGTTCATTGCCGTTCAGATCCAGCCAGCGCACTTTGACCGAGGCGGGACGCTCGACCTCGGCATATTGGTCGGGGATCGACAGACAGCCTTCCTCATAGACCGAGCGGTCGTCCGAAGACCAGACGATCTGCGGGTTCACCATGACCAAGGGCTGCGGCGCGTCCGGGTCTTTGGCACAATCCAGCACGATGATGCGCTGCAATTGACCGACCTGCGGCCCCGCAAGGCCAATGCCGGGCGCGTCATACATGGTTTCCAGCATGTCATCGGCCAGCCGACGGATCTCGTCCGAGATGTCGGGCAGCGGCTTTGCGATGGCGCGCAGGCGCGGATCGGGGTGGATAAGAATAGGGCGCGTTGTCATGGCACCGCATGTAAGCCAGCGCGCCCCGGCTTTCAACAC
Protein sequences of DBSCAN-SWA_1 >NZ_CP016592|566367:574800|574239_574800_-|WP_014537459.1|DBSCAN-SWA MLKAGARWLTCGAMTTRPILIHPDPRLRAIAKPLPDISDEIRRLADDMLETMYDAPGIGLAGPQVGQLQRIIVLDCAKDPDAPQPLVMVNPQIVWSSDDRSVYEEGCLSIPDQYAEVERPASVKVRWLDLNGNEQDREMEGLWATCVQHEIDHLDGKLFIDYLGPMKRQLITRKMEKLKRERARGE >NZ_CP016592|566367:574800|566367_568674_-|WP_013383420.1|DBSCAN-SWA MEGSDKASAAANDQGPPDVTRSDVALSTGVDNNAPTLPAGAAQVVPEPLITQDDLIARVRSYNPATNESLLRDAWEFGRLMHDGQYRHSGEPYFTHPVAVAMILAQQQMDDATIITALLHDTVEDTRATYAEVEARFGAVIADLVDGVTKLTNLQLHSAETKQAENFRKLIMATSRDLRVTLVKLADRLHNMRTIASMRPEKQVKKARETMDIYAPLAGRMGMQWMREELEDMAFRVLNPEARDSIIQRFSLIQQESGDLVDQIKTDLLAEFEKSGIPAEVHGRAKKPYSIWRKMQEKDQSFSQLSDIYGFRVITKTDVDCYRALGAIHQRWRSVPGRFKDYISQPKSNGYRSIHTAVSARGGKRVEVQIRTREMHEVAEAGVAAHWSYRDGEPVQNRFVVDPARWIAQLSERFEEDQDHDEFLETFKLEMYQDQVFCFTPKGEVIKLPQGATPIDFAYAIHTRIGHACVGAKVDGLRVPLWTRLKNGQSVEIIIADGQTPQATWIDIAVTGRAKSAIRRWLREKDRDRFIKLGTELTRVAFENAGKKSTDKALATAARALAFENAEQLLLRVGAAEITAREVVRAIYPDLKLRDANEIDAEKAVVGLSPDQSFRRAPCCQPVPGERIVGITFRGQGVIVHAIDCPALSEYEDQPDRWVDLRWQEGQHRAINTVSLELSVANDAGVLGRVCTLIGEQDANISDLDFLDRKLDFYSLRIDVDVRDAEHLHRVMTALDADSHVSSLLRLRDAHRNLSWSAALPAAGATGH >NZ_CP016592|566367:574800|571077_571542_+|WP_013383423.1|DBSCAN-SWA MPEFICFTDGACSGNPGPGGWGVLMQAREGGAVVKERPLCGGEAMTTNNRMELLAAINALENFTRSSTITIVTDSVYVKDGIGAWLFNWKRNGWRTSQGKPVKNDDLWRRLDAEVQRHQVTWKWVKGHAGHPENERADELARQGMAPFKAARAL >NZ_CP016592|566367:574800|572851_573748_-|WP_013383426.1|tRNA|DBSCAN-SWA MRVVFMGTPDFSVGVLEALVAAGHEIACVYCQPPRPAGRGKKDRPTPVHARAESLGLMVRHPKSLRGADEQAEFAALNADVAVVVAYGLILPQVVLDAPKHGCLNIHASLLPRWRGAAPIQRAIMAGDAETGVCIMQMEAGLDTGPVLLRAATPIGATDTSGDLHQRLSQMGARMIIDTLERLGGLIPQVQPEDGVTYAAKIDKAEALVDWTRPAPTVARQINGLSPFPGAWCMIGDERVKLLRAEAVPGTAAAGTVLDGFTVACGSGALRVLDIQREGKRAMPVADALLGLTLPARL >NZ_CP016592|566367:574800|570125_571076_+|WP_014537458.1|DBSCAN-SWA MSLPPLTVYLAAPRGFCAGVDRAIRIVEMALEKWGAPVFVRHEIVHNKYVVDALRAKGAVFVEELDECPEDRPVIFSAHGVPKSVPAEAVRRNMIHVDATCPLVTKVHNEAARHHTNGLQMIMVGHKGHPEVIGTMGQLPDGEVMLVETLADVATVQVRDPARLAMITQTTLSVDDTAEIAAALKARFPAINVPAKEDICYATTNRQEAVKVMAPKCDAILVVGAPNSSNSKRLVEVGSRAGCDYSQLVQRADEIDWRALQGIRTLGVTAGASAPEILIEEVIDAFRAHYDVTVELVVTAEERVEFKVPKVLREPA >NZ_CP016592|566367:574800|569459_570050_+|WP_013383422.1|DBSCAN-SWA MFYRDERIALFIDGANLYAASKSLGFDIDYKLLRSEFMRRGRLIRAFYYTALLENEEYSPIRPLVDWLHYNGFSMRTKPAKEFQDAQGRRKIKGNMDIELTVDAMELAPHVDHIVLFSGDGDFRPLIEALQRRGVRVSVVSTVRSQPPMIADELRRQADNFIELDELRDVLGRPPRPDARPGMPRDETVETQSLLD >NZ_CP016592|566367:574800|572483_572855_+|WP_013383425.1|DBSCAN-SWA MKYLLAALLLVPLPAFAQGEDKGDCPDASSTSEIVVCLNELYSTAQLEMQVRLDGLVAGMASSNRVAALNAAQAVWKAFRQLECESQALIAEGGTLANVLGASCYLHMTRDRIVALNAYDQTN >NZ_CP016592|566367:574800|573747_574239_-|WP_013383427.1|DBSCAN-SWA MHRPFVMWPAPVLRKPAAQVEAITPEITALWDEMVEAMDAMPGVGLAAVQLGVPLALAVVDASDARGQAIRMANPEILHASVQLREHEEASPNLPGISAVISRPRAVTVRFLNAWGEVEERDLVGLWATSMQHQIDHLAGKMYFDRLSKIKRDMFLRKARKFG >NZ_CP016592|566367:574800|568684_569038_-|WP_013383421.1|DBSCAN-SWA MARVTVEDCVDKIPNRFELVMLAAHRAREIAAGSALTVPRDNDKNPVVSLREIAEETQLADDLRERMIEANQHEIEVDEPEDDSMALLMSAELDRPAQDDMTEERLLRELLQAQEGR >NZ_CP016592|566367:574800|571602_572487_+|WP_013383424.1|DBSCAN-SWA MSNYILTVTCATTRGIVAAVSGFLAENGCNITDSAQFDDVLTGKFFMRISVTSQEGATLADLQSRFATVGARFGMEFAFFDASERVKAVIMVSRFGHCLNDLLYRQRIGALPIDIVGVISNHFEYQKLVVNHDIPFHHIRVTPQNKPEAEAAQMQILRETGAELVVLARYMQILSDEMCREMSGRIINIHHSFLPSFKGANPYKQAYERGVKLIGATSHYVTADLDEGPIIEQDTVRVTHAQSPEDYVSLGRDVESQVLARAIHAHIHRRVFINGNKTVVFPASPGSYASERMG |
10 | Synechococcus_phage(50.0%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1085621 : 1098652
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP016592|1085621:1098652|DBSCAN-SWA GATGAAATCGGCTGCCGCCTGGCTCGCCTCAACCCCTGCTGCGACGCGGGCTGAGTTCTTGGGCGCGCTTGCCGATCACCAGATCGCGGCGCTGCCCTGGCTGTTCGAGTTTTGGGCGCTGCCGCATCAACTGCCGCCCTCGGGCGATTGGCGCACATGGGTCATCATGGGCGGGCGCGGCGCGGGCAAGACGCGGGCAGGGGCCGAATGGGTGCGCAGCATGGTCGAAGGGCCGCGCCCCGATACCCCCGGCCGCGCCAAACGGGTCGGCCTTATCGCCCAGACGATGGATCAGGCGCGCGAGGTGATGGTCTTTGGCGACAGCGGGCTGATGGCCTGCTGCCCGCCCGCGCGCCGCCCCGAATGGATCGCGGGCCGCGCCATGCTGCGCTGGCCGAACGGGGCCGAGGCGCGCCTGTTTTCCGCCCACGACCCCGAGGCTTTGCGCGGCCCCCAGTTCGACGCCATCTGGGCGGACGAGGTCGCCAAATGGCGCCTTGCGCAAGAGGCATGGGACATGCTGGTGATGGGCCTGCGGTTGGGGGATGACCCGCGCGCCTGCCTGACCACCACCCCGCGCGGCGGGCCGTTCCTGCGCAAGCTGCTGGCCCAAAGCGGCACCGTGATGACCCATGCCCCCACCCGCGCCAATCGCGCCAACCTCGCCCCCGGCTTTGTCGCCGCGGTCGAGGCGATGTTCGAGGGCTCGCATTTGGGTCGTCAGGAACTCGATGGCCTGCTGGTTGACGAGGCCGAGGGCACGCTGTGGCCGCAGCACCTGCTGGATGCGGCCTTGCAGCGGCAGGCGCCCCCGCTGGATCGCATCGTGGTCGCGGTCGATCCGCCCGTGACGGGTCATGCGGGTTCGGACGCCTGCGGCATCATTGTCGCGGGGGTCGAACAGCGCGGGGCCCCCACCGACTGGCGGCTGTGGGTGATCGAGGATGCAACCGTGCAGGGCGCGTCCCCCCATACCTGGGCCAGCGCCGCCATCGCGGCCTTTCACCGCCACGGCGCGGATCGCCTCGTGGCCGAGGTGAACCAAGGCGGCGCGCTGGTGGAAAGCGTGCTGCGCCAGCTTGACCCGCATATCCCCTATCGCGCCGTGCGGGCCAGCAAATCCAAGGGCGCGCGCGCTGAACCCGTCTCGACCATTTACGAGCGGGGCCGCGCCTGCCACCTGCCGGGGCTAGCGCTGCTTGAGGCGCAAATGTCCCTGATGACGCTGCAGGGCTTTACCGGCAAAGGCTCGCCCGACCGGGTCGACGCGCTGGTCTGGGCCGCACACGAGCTGATGCTGGGGGCAGGCGGCGCATCGCCCCAGATCCGCGCTATCGGCTAGCGCAACGGGCAACGCGCGGTGCCACCGCCGCGCGTTAACCCCCATCCCCGATAACCCCCCTTAAGCGCAAGACGGCAGCAGCGAGGGGCACATGTTCGGATTTGGCGAGAAAAAACAACCGGCGCCGCTGGTCGAGGTCAAAGCCTCGGCGGCGGGAAAGGTGGTGGGTTTCGGCACAGCCGGTCGCACCGGATTCCAGCCGCGCGAGGGATCATCGCTGGTGCGCGCGGGCTTTGCCGCCAACCCCATCGGCTTTCGTGCCGTGCGCCTGATCTCTGAGGCGGCCTCGGCCCTGCCGCTGATCTTGCAAGACGCGACGCAGCGCTATGACACCCACCCCGTGCTGGATCTGCTGGCCCGCCCCAATCCCGCCCAAGGCCAGCTTGAACTGTTCGAGGCGATCTACGCCCAGCTTTTGTTGACCGGAAATGCCTATGTCGAGGCCGTGTCGGATGGCGCGCTGCCCACGGAACTGCATGTCCTGCGCAGCGATCGTATGTCGGTCGTGCCGGGGCCCGATGGCTGGCCCACCGGTTATGACTATGCGGTGGCCGGGCGCAAACACCGCTTTGACGCCGCCGCGATCTGCCATATCCGTGCCTTTCACCCCCATGACGACCATTACGGCCTGTCGCCCCTGACGCCCGCGGCGGCGGCGGTCGAGGTGCATAATGCCGCCTCGCGCTGGTCGCGGGGGCTTTTGGAGAACGCGGCGCGGCCTTCGGGCGCGATTGTCTTTCGCGGCGCCGATGGCAACGGCACCTTGTCCAACGGCCAATTCGACCGGCTGGTGGCCGAGATGGAAAGCCAGCATCAGGGCGCGAGGAATGCCGGACGCCCGATGCTGCTTGAGGGCGGCCTCGATTGGAAGCCGATGGGCTTTTCGCCCTCGGACATGGAATTCCTGCAAACCAAAGAGGCCGCCGCGCGCGAGATTGCCACCGCCTTTGGCGTGCCGCCCATGTTGCTGGGCATCCCCGGGGACGCGACCTATGCCAATTACCAAGAGGCGAACCGCGCCTTTTACCGCCTAACCGTGCTGCCTTTGGCGTCGCGGGTCACGGGTGCGCTGGTGAATTGGCTGGACGATTTTGCGGGCACCTGGCTGGATCTGCGGCCCGATCCCGACCAGATCGCCGCCTTGCAAACCGAACGCGACGCCCTGTGGGCACGCGTGGGCGCCGCAAGCTTCCTCTCGACCGCCGAAAAACGCGCCTTGCTCGGCCTTCCCGGAGAGCCAGATGGAGCCGCGTGAGCGCCCCTTCCTCTGCGCCCCCGGCCTGAAGATCGAGGCGCAGGAGCGGCTGGTCGCGCTGCAATTCCAGCAATTGCAGCAACAGCTTGCCCGGGTAGAGGCGCTGATCGAGCGGTTAGAAAAACGTCTGTGGCTGACGGTTTACGGCGTTTTGGGCGCGATTCTGGCGCAGGCGTTTCAATCATTCCTGTCGGTTGCTCCCGGTTAAATGGAGGGGATTTTGGATCTCGAATATAAATACGCAACGCTATCCGCGCCCGATCCTGCGGGCGTGAGCGTCGCAGGCTATGCCTCGGTCTTTGGGCTGCGCGATCAAGGCGGCGATATCGTGCAAAAGGGGGCCTTTGCCGCTTCGCTCGCGCGTCTTGCCGCTGCTGGCACCAAGGTGCGGATGCTGTGGCAGCACGACCCCAGCCTTCCTATCGGCGTCTGGGACGAGGTCACCGAGGATGCCACCGGCCTGCGCGTCAGCGGCCGCCTGCTGCCCGAGGTCGCCAAGGCCGCCGAAGTTTCCGCGCTGCTGGCGGCCGGTGCGATCGACGGCCTTTCCATCGGCTATCGCACCCTGCGCTCTACCAAATCGGACACAGGCACCCGCCTCTTGCACGAGGTCGAGCTGTGGGAGGTGTCGCTGGTGACATTCCCCATGCTGCCCAACGCCCGCGTACACACCAAAACCGACGCCGCGCTGATCGCTGCTTTGCGCAGCGCCCGCGCCACCATCCGCAACCTCTAGGAGCCGCCATGGATACGCCCTCCGTAACCGACGAGATGAACGGTTTCATCTCTGATTTCAATGTCTTTGCAGGCGAAGTGAAACAACGTCTTGAACAGCAGGAGACCCGCATGACCCGTCTTGACCGCAAATCCGCCTACCGTCCCGCCTTGTCCGCTGCCGTCGACACCGACGCGCCGCATCAAAAGGCCTTTGACGCCTATTTGCGCTCGGGCGACGATGACGGCCTGCGCCATATCGAGATCGAGGGCAAGGCGATGTCCACCGCCGTCGCCGCCGATGGCGGCTATCTGGTCTCGCCGCAAACCGCGCAAACGATCCAGTCGGTGCTCAATGCCACCGCCTCGATCCGCGCGATCTCGAGCGTCGTGAATGTCGATGCCAGCAGCTATGACGTGCTGGTCGACCGGACCGAGCCGGGCGCGGGCTGGGCGAGCGAGACCGGCACCGTTGCAGAAAGCACGACCCCGGTCATCGACCGCATTTCGATCCGCCTGCATGAACTCTCGGCGCTGCCCAAAGTCTCGCAGCGCCTGCTCGATGACAGCGCCTTTGATCTTGAAGACTGGCTCGCCACCCGCATCGCGCAGCGCTTTGCCCGCGCCGAGGCGGCCGCTTTCGTCAATGGCGATGGCGTCGATAAGCCGAACGGCTTCCTGACGGTGACGAAAGTCGCGAATGCCAGCTTTAGCTGGGGCAACCTCGGCTATGTCGCAACCGGCTCGACCGCAGCGCTGCCTGCGGATTCGATCGTCGATCTGGTCTATGCGCTGGGCGCCGAATACCGCGCCGGCGCCAGCTTTGTGATGAACTCCAAAACCACCGGCGTTTTGCGCAAGCTGAAAGACAGCGACGGCCGCTTTTTGTGGTCTGACGGCCTTGCCGCGGGCGAGCCTGCGCGCCTGATGGGCTATCCCGTTCTGATTGCCGAGGACATGCCCGATATCGCCGCCAATGCCTTCCCCGTCGCCTTTGGCAATTTCACCGCTGGCTACACGATTGCCGAACGCGCCGACCTGCGCGTCTTGCGCGACCCGTTCTCGGCCAAGCCGCATGTGCTGTTCTATGCGACCAAACGCGTCGGCGGTGCGGTCACCGATTATGCCGCGATCAAGCTGCTGCGCTACGCGACCGCGTAAATCCAACGGGCGGGGCCATCAGCCCCGCCATCCCCCCAATGATCACACGGAAGGGCTGGGCGCGATGATGCTGGTCGAAGAAACAACGGTGGATGATGCCGCCTTGCCGGTCGCGGCGCTGGGTGCCTTCCTGCGCCTCGGCTCTGGCTTTGGCACGGATGGGTTGCAAGATGACCTGCTGCGCGCCTTCCTGCGCGCCGCCCTTGCCGCGATCGAGGGGCGTATCAACAAAATCCTGATCGCTCGCAGCTTTGCGCAGCAGATGACATCCGGTCAGGCGATGGCGGTCGGCCCTTTGCGCGCGGTGCTGTCGGTGACGGTGGATGGCACGGCGCAGCCCTATGCGCTGGCGGGCACAACAATCACCGCGCCGACAACGGCCAAGCTGACCGTCCGCTATGACGCCGGGCTGGCGGATGATTTCGCCGCGCTGCCCGCCGATTTGCAACAGGCGGTGCTGATGCTGGCGGCGCATTATTACGAATACCGCCAGGACCCGGCGCTGGACGGCGCCTGTATGCCCTTTGGCGTCTCGGCCCTGACCGAACGCTATCGCACGCTGCGTCTGTCGATGGGGGCCCGCACATGAGGCCGCCGCGCATGAACCGCGCCCTAATCCTGCAGGCCCCGACCCGCACCCCCGATGGCGCGGGCGGCTATACGCAAGGCTGGCAGACCCTCGGCACGCTTTGGGCTGCGGTGACACCTGCCACGGGGCGCGAGGCGGCGGCGCTGGGCGCGGCACTGGCGCGCGTGCCGGTGCGCATCACCCTGCGCGCCGCGCCCGCCGGCGATCCCCGCCGCCCCATCGCGGGCCAGCGGCTCACCGAAGGGCCGCGCAGCTTTCTGATCCTAGCGGTGCAAGAGACCAGCGCCCGCCTGCTGACCTGCATTGCCGAGGAGGAGCTGGTGCGATGACCCAATCCCTTGCCCTGCAACAGGCGCTGTATACACGGCTGACGGCCGCGCTCGATGGCGTGGACATCTATGACGCGCTGCCCAGCGGCCCCGTGCCCGCGCTTTACGTCGCCCTCGGCCCCGAGGAGGTCGAAGACCTTTCCACGCATGAGGGCGCGCTGACCATCCACGAGGTGAAAATCTCGGTCATCGCGACGGGTGGCGGCTTTGGCAGCGCCAAGACCATTACCACCGCCATCACCGAGGCGCTGGCCGCGCCGCTCACCCTGCCGTCCTTTACCGCCAGCCCCGCGCAATTCCTGCGCGCCTCGGCCAAGGGCACATCCGCCTCGGGGGCCGAGCGCCGCATCGACCTGTTCTTCCGCATCCGTATCGAACCCTAAAGGAGCCTCATATGAGCGCGCAAAACGGCAAGGATCTCCTCATCAAGATCGACATGACGGGCGATGGCCTGTTCGAGACCGTCGCGGGGCTGCGCGCCTCGCGGATCAGTTTCAACGCCGAAACGGTGGATGTCACCACGATGGAAAGCCAGGGCGGCTGGCGCGAATTGCTGGCCGGGGCGGGGATGCGTTCGGCCTCCGTGTCCGGCGCGGGCGTGTTCCGCGACCAATCCACGGACGAGCGGATGCGCGCCCTGTTCTTTTCCGGCGAGGTGCCCGCCTTTCGCATCATCATCCCGCATTTCGGCGCGATCGAGGGCCGCTTTCAGATCACCGCGCTGGAATATGCCGGCACCTATAATGGCGAGGCCACCTATGACGTGACCCTCGCCTCGGCAGGCGCACTGACATTCGAGGCCGAGGTATGAGCGCCAATCCTCATGCGGGCGAGGTCGAAATCCCGCTCGACGGCGTCATCCACATCGGCCGTCTGACCCTTGGCGCACTGGCGCGGCTGGAGGCGGATCTGCAATCGGGCAATCTGCCCGATCTGGTCGCGCGGTTTGAAAGTGGCGATATCCGCACGGCGGATGTGCTGGCGCTGATCGTTGCGGGCCTGCGCGGCGGCGGCTGGCAGGGCACGGCGGCGGATCTCGAGCAGATCGACGTGGGCGGCGGGCCGCTGGCAGCGGCGCGTATTGCAGGCCAGCTTTTGGCCCGCGCCTTTGCAAGCGCGGGATAAGCCCATGGATTTCCCCGGCCTTTTGCGCCTTGGCCTGCAGCATCTGCGCCTGAAACCCGCCGAATTCTGGGCGCTGACGCCGATTGAACTGATGCTGATGCTGGGCCTTGCAGCAGGCAGCCAACCCATGGCGCGCGCGCGCCTTGATGCGCTCGTCCGTGCCTATCCCGATCACGCCCCACTCCAGGAGGCCACCGATGGCTGACACCACCACCGCCGAGCTTCAGCAAACCCAATCCGTCACCGCCGCCTTCAACGCGGGCCTGCGCGAGATGCGCGGCACGCTATCGGCGACCTCGCGCGATGTGGCGGGGCTGGAACGCGGCCTCTCGCAGGGGCTGCGCCGCGCCTTTGACGGGCTGGTGTTCGACGGCGACCGCCTCAGCACGGCGGTCAGCAGCATCGCGCAAAGCGTTCAAAACGCCGCCTATAACGCCGCCATGCGGCCCATCACCGACAAGATCGGCGGCTGGTTGGCCAGTGGCATCGAAAGCCTGATGCCTTTCGCTGAGGGTGGCACCTTCACCCAAGGCCGCGTCATGCCCTTTGCCAAAGGCGGTGTTGTCACCGCCCCCACCACCTTTCCGATGCGCGGCGGCACCGGGCTGATGGGCGAGGCGGGGCCCGAGGCGATCATGCCGCTGACGCGCGGCGCCGATGGCCGTTTGGGCGTCGCGGCGCAAGGCGGCGGCGGCGTCAATCTGGTGATGAACATCCAGACGCCCGATGCCGCCAGTTTCCACCGCTCGCAAAGCCAGATCGGCGCGCAGGTCTCGCGCCTTGTGGCGCGCGGCAACCGCAACCGCTGACAGGGGGGCATCATGGCCTTTCACGACATCCGCTTTCCCGCCGCCATCAGTTTTGACTCGCTCGGCGGCCCGACGCGGCGCACGGAAATCGTCACGCTGACGAGCGGCTATGAACAGCGCAACACCGCTTGGGCCCATTCCCGCCGCCGCTATGACGCAGGCGTCGGCCTGCGCTCGTTGGATGATGTCGCGCAGCTCATCGCCTTTTTCGAGGCGCGCGGCGGGCAATTGCATGCCTTTCGCTGGAAAGACTGGTCGGATTACAAATCCTGCGCGCCGTCCGCCGCGATTTCCGAAATGGATCAGACGCTTGGATATGGCGATGGCGCAACCGCCGACTGGCCGCTGGTGAAAAACTATGTCTCGGGCGAGGGCGCTTATGCCCGCCCGATCACCAAACCTGTCGCCAATACCGTCCAGATCGCCGTCGCCGGTCAAAAGCTGGACGAGGGGACGGATTACACGCTGAACCTTGGCCTTGGCCGCGTGATCTTTGCCAGCCCGCCTGCGCCGGGGGCCGAAATCAGCGCGGGCTTTGAATTCGACGTCCCCGTGCGGTTTGAAACCGACACGATCCAGATCTCGGTCTCGTCCTTTCGGGCGGGCCAAATCCCCTCCGTCCCCCTGATCGAGGTGCGCCCATGAGCGACCATTCCACCACCCGCTGCACCGCCTGGGCCATCACCCGCACGGACGGGCTGCAACTGGGCTTTACCGATCACGATGGCGATCTGACCTTTGCGGGCCTGACGTTCCGCGCGGGCGCGGGCATGAGCGGCGCGGCACTGGTGCAAGGCGCAGGCCTTGCTGTCGACAATACCGAAGGCTTTGGCATGATCACCGATGATGCCGTGGGCGAGGGCGACCTGCGGGCAGGCCGTTTCGACGGGGCCGATATCCGCATCTATCAGGTCAACTGGCGCGCCCCTGCCGACCGCAGCCTGATCTTTCACGGCACTTTGGGCGAGATCACCCTCGAGGATGGCGCATGGCGGGCCGAGCTGCGCGGCGCGGCCGAGGCGCTGTCCCGTCCCATCGGGCGCAGCTATCAGCGCGGCTGCGCGGCGGTGCTGGGCGATGCCGCCTGCGGCTTTGACCTCGATACCCCCGGCTTTGCGATGGATGCCGCGCTGATCGCGGTGGATGACACCACGCTGACCATCGCCGCGCCGGACCTTGATCCGCGCTGGTTCGAACGCGGCGTTGTGAAAATCACCACGGGGGCTGCGGCTGGCCTCTCGGGCATGATCAAATCCGACGCCAGCTTGGGCGCACAGCGGCTGATCTCGCTCTGGTCGCCCTTGGGGGTGCAACCGCAGGCAGGCGATCAGATCCGCCTGCTGCCCGGCTGCGACAAACGCCTCGCCACCTGCCGCGCGAAATTCGGCAATCTGCACAACTTTCGCGGCTTCCCCCATATCCCGGGCGAGGATTGGCTGATCGCCGCCCCTAAAACAAACGGCAGCGGCGAGAGCCTGTTCCGATGACACCTGACACCCTTGCCCGCGCATGGATCGGCACGCCTTTCGTGCACGGCGCTAGCCTGCAAGGGGTCGGGGCCGATTGCCTTGGCCTCATCACCGGCCTTTGGCGGCAGATTTACGGCCCCGCGCCGTGGCCGCTGGACTACAGCCCCGACTGGTCCGTAACGCTGGGGCCAGATGCGATGGCGCGCGCCGCCGACCGCTATCTGCCGCGCGCAGCGCAGCTTTATCCCGGCGCGCTGATGCTGTTGCGCCTGCGCCCGCATTTGCCGCCCGCGCATCTGGCCATCTGCGCGGGGCCGACGTTCATCCATGCCTTCCACGCGGGCGGCGTCGTTGAAAGCCCCCTCAGCCTGCCGTGGCGCCGCCGCATCGCAGGCCTTTACCATTTCGCCCCTAAACAGGAGTCCTAACATGGCAACGCTGGTTCTTTCTGCCGTTGGCGCCTCGGTCGGTGCCTCGATCGGGGGCGGGATTTTGGGCCTCTCCTCGGCCGTCATCGGCCGCGCTGTCGGCGCTGTTGCAGGCAGCCTGATCGACCAGCGCATCTTGGGCGGCGGCGCGCAGCCCGTTGAAACCGGCCGCATCGACCGCTTTCGCGTCACAGGTGCGTCCGAAGGCGCCGCGATGGCGCGCCTTTATGGCCGGATGCGTGTCGGCGGGCAGGTGATCTGGGCGACCAAATTCATGGAAACCAGCACCCAGACCCGCGCTGGCAAAGGTCAGCCAAAGACCACCACTTTCAGCTATACCACCTCGCTGGCCATTGCGCTGTGCGAGGGGCCGATCAATGGCATCGGTCGCATCTGGGCCGACGGGACCGAGATTGCGCCCACCGACCTAAGCCTGCGCCTTTACCACGGCCATATGGATCAACTGCCCGACCCCCGTATCAGCGCGGTCGAGGGGGCAGACAACACCCCCGCCTATCGCGGCACCGCCTATGTGGTGATCGAGGACCTCGACCTTGGCCCCTATGGCAATCGCGTGCCGCAGTTTTCATTCGAGGTGATCCGCAACGATCCCGCCCGCGATGATACATTCGCAGGCGCGGTGCAGGCTGTCGCCATGATCCCCGGCACCGGCGAATATGCCCTGTCGGATACACCCGTCGCCCTGCGCTATTCCTATGCCGATGAAGGCACACAGAACGAAAACACCCCCAGCGGCCAAAGCGACTTTCTGACGGCGCTTGACCAGTTGAATACCGAACTGCCGCGCGTGACATCCGTCTCGCTGGTGGTCTCTTGGTTCGGCGATGACCTGCGCGCGGGCCAGTGCAAGGTGCAGCCAAAGGTCGAACAGACCGCCTTTGATGCCCCCGACCAGCCGTGGCGGGCAGGCGGCATCACGCGCAGCGCCGCGGCAACCGTGCCCCGCGTCGGCGGCTCGCCCATCTATGGCGGCACGCCGTCCGATGCCGCAGTCATCAGCGCCATCCGCACCATCCGCGCGCGCGGGCAGGAAGTGATGTTCTATCCCTTCATCCTGATGGATCAACTGGCGGATAACACCCTGCCGAACCCCTGGACGGGGCAGGCTGGTCAGCCGCCCTTGCCATGGCGCGGCCGCATCACGACCAGCCTTGCCCCGGGTCAACCCGGCACAACCGATGGCACAGCCGCCGCACGGGCCGAGGTTGCGGCCTTTTTCGGCACCGCCACCCCCGCGCATTTCACCCGCACCGGCGAGCGGGTGCATTATACCGGCCCCAATGAGTGGTCGCTGCGCCGCTTCATCCTGCATTACGCGCATCTATGCGCGGCGGCGGGCGGCGTCGACAGTTTCTGCATCAGCTCGGAAATGGTGGCGCTGACGCAAGTGCGCGACGATATCGGCTTTCCCGCCGTCAGCGCACTCATGGCGCTGGCCGCCGATGTGCGCAGCATCCTCGGCCCCGATACCCTGATCACCTATGCCGCGGATTGGAGCGAATATCACGGCTACCAACCGCTTGGGACGGGCGACAAGCTGTTCCACCTCGACCCGCTTTGGGCGCATGAGGATATCGACTTCATCGGTATCGACAATTACATGCCGCTGTCGGATTGGCGCGACGGCGATAGCCACTTGGACGCGCAGGCGGGCGCGATCTATAACCTCGATTACCTGACCGCCAATGTCGCAGGCGGCGAGATGTACGATTGGTTCTACCACTCGCCCGAGGCACGAGATGCGCAAATTCGCACTGCAATCACAGATGGTTACGATCAGCCTTGGATGTGGCGCGTGAAGGATATCTTAGGGTGGTGGAGCCATGCGCATTTCGACCGCGTGGACGGCGCGCAGGGCCCGCAAAGCCCTTGGCTGCCGCGTTCCAAACCGATCCGCTTTACCGAAATCGGCTGCGCCGCCATCGACAAAGGCACCAACCAGCCGAACAAATTCCTCGATCCGAAATCCTCGGAATCGGCGCTGCCGTACTATTCCAACGGCCTACGCGACGACTTTATCCAGCTTCAATATCTGCGCGCCCTAAACCGCCATTTCGCCGATCCCTCGCAGAACCCGACCTCTGAAATCTACGACGGCCCCATGGTCGAAATGGACTACGCGCATGTCTGGGCGTGGGACGCGCGGCCCTATCCGTGGCTCCCCGCGCGCGGCGATCTGTGGTCGGATGGCGCGAATTACGACCGTGGCCATTGGCTGAACGGCCGCGCCGGCGGGCAGGCTTTGGCCGCCGTCGCGGATCAGATTTGCACGGATGCGGGTCTGTCCGCGAACACCGATGCGCTTTGGGGCATGGTGCAGGGCTATGCGATGGACCGGATCGAGACGGGGCGCGCCGCGCTGCAACCGTTGATGCTGGCGCATGGGTTCGATGCGGTCGACCGCGACGGTGCGTTGCACCTGATCACCCGCCACGGCCGCCCCATCGCCACGCGCGAGATGGATGATCTGGTGGCCCACGACGCCCCCGCGCTGGTGCGGACCCGTCTGCCCGAGGCCGAGCTCGCCGGTCAGGTCCGCGTGGCTTTTGTCGCAGCGGGCGGCGATTTCAGCATCGGCGGGGCCGAGGCGACGCTTGCCGATACGCCGCGCGATACGGTCTCGACCTCGGACCTGCCGCTGCTGATGTCGCGCGCCGACGCCACCCGCGCCGCCGAGCGCTGGCTGCTGGAATCCCGTCTCGCGCGCGAGGTGGCGACATTCACGCTGCCGCCCTCGGACGCATGGCTGCGCGTGGGTGATGTGCTGACGCTTGCGGGCGATGACTACCGCATCGACCAGCTAGAGCGGGCCGAGGCGCTGGGCATCACCGCCACCCGCACCAGCCGCAGCCTGTTCCTGCCCCATGATGCGGTGGAGGATATCCCCCAGCCCGCCGCCTTTGCGCCGCCGATGCCGGTCGCGGCAACCTTCCTTGATCTGCCGTCCGAGACCGGCCCCAGTTTCGCTGTCGCCCTCACTTCGGCCACATGGCCGGGCGAGGTCGCGATCCACGCCGGGCCGCCGTTGGTGGAACTCGCCCGCAGCGCCGCCCCGGCGGTGGTGGGCGAGACGCTGAACGATCTATCCGCCGCCCGCGCAGGGATCTGGGATCGCGGCCCCGCGCTGCGGGTGCGGCTGGTGTCCGGCACGCTCGCCTCGCACCTGCCCGAGGCATTGCTATCGGGCGCGAACCTTGCCGCTATCGGCGATGGCACCAGTGATATCTGGGAGGTCTTTCAATTTGCCGAGGCCGCGCTGGTGGCCCCGAACGAATACGCCCTCAGCCTGCGCCTGCGCGGTCAGGGCGGCAGTGATGGGGTGATGCCGCCCGTCTGGCCCGCAGGGTCGCGGTTCGTGCTGTTGGACAATCGCCTGACGCCGCTTGATGTGCCGCGCGGTGTGTCGCGCGACTGGCATTGGGGGCCGGTGCAACGCCCGATGAGCGACCGCACTTGGCGGCAGGCCAACCGCGCCTTTACCGGCGTGGGCTTGCGTCCCTATGCGCCCTGCCATCTGCGCGTGAGTGATACCGCCGTGACATGGCAACGCCGCACCCGCAGCGGCGGCGACAGTTGGGACGGCATCGACGTGCCGCTGGGTGAGGAGCGCGAGCTGTACCGCCTGCGCATGTATCAATCCGGCGCGCTGCTGCGCGAGGTGATGCTGGACACGCCCGCCTTCGCCTATCCCGCCGCCATGCGCGCGGCAGATGGGGCGGGTGTGACGGTCGAAGTGGCGCAGATGTCCCAAGTCTTCGGTGCGGGGCCCGCGCTGGTTGGGGCGATCTGA
Protein sequences of DBSCAN-SWA_2 >NZ_CP016592|1085621:1098652|1091006_1091393_+|WP_014537611.1|DBSCAN-SWA MTQSLALQQALYTRLTAALDGVDIYDALPSGPVPALYVALGPEEVEDLSTHEGALTIHEVKISVIATGGGFGSAKTITTAITEALAAPLTLPSFTASPAQFLRASAKGTSASGAERRIDLFFRIRIEP >NZ_CP016592|1085621:1098652|1088961_1090092_+|WP_013383948.1|capsid|DBSCAN-SWA MDTPSVTDEMNGFISDFNVFAGEVKQRLEQQETRMTRLDRKSAYRPALSAAVDTDAPHQKAFDAYLRSGDDDGLRHIEIEGKAMSTAVAADGGYLVSPQTAQTIQSVLNATASIRAISSVVNVDASSYDVLVDRTEPGAGWASETGTVAESTTPVIDRISIRLHELSALPKVSQRLLDDSAFDLEDWLATRIAQRFARAEAAAFVNGDGVDKPNGFLTVTKVANASFSWGNLGYVATGSTAALPADSIVDLVYALGAEYRAGASFVMNSKTTGVLRKLKDSDGRFLWSDGLAAGEPARLMGYPVLIAEDMPDIAANAFPVAFGNFTAGYTIAERADLRVLRDPFSAKPHVLFYATKRVGGAVTDYAAIKLLRYATA >NZ_CP016592|1085621:1098652|1088425_1088953_+|WP_013383947.1|head,protease|DBSCAN-SWA MEGILDLEYKYATLSAPDPAGVSVAGYASVFGLRDQGGDIVQKGAFAASLARLAAAGTKVRMLWQHDPSLPIGVWDEVTEDATGLRVSGRLLPEVAKAAEVSALLAAGAIDGLSIGYRTLRSTKSDTGTRLLHEVELWEVSLVTFPMLPNARVHTKTDAALIAALRSARATIRNL >NZ_CP016592|1085621:1098652|1091817_1092135_+|WP_013383954.1|DBSCAN-SWA MSANPHAGEVEIPLDGVIHIGRLTLGALARLEADLQSGNLPDLVARFESGDIRTADVLALIVAGLRGGGWQGTAADLEQIDVGGGPLAAARIAGQLLARAFASAG >NZ_CP016592|1085621:1098652|1088203_1088425_+|WP_013383946.1|DBSCAN-SWA MEPRERPFLCAPGLKIEAQERLVALQFQQLQQQLARVEALIERLEKRLWLTVYGVLGAILAQAFQSFLSVAPG >NZ_CP016592|1085621:1098652|1092139_1092340_+|WP_013383955.1|tail|DBSCAN-SWA MDFPGLLRLGLQHLRLKPAEFWALTPIELMLMLGLAAGSQPMARARLDALVRAYPDHAPLQEATDG >NZ_CP016592|1085621:1098652|1090677_1091010_+|WP_013383951.1|head,tail|DBSCAN-SWA MRPPRMNRALILQAPTRTPDGAGGYTQGWQTLGTLWAAVTPATGREAAALGAALARVPVRITLRAAPAGDPRRPIAGQRLTEGPRSFLILAVQETSARLLTCIAEEELVR >NZ_CP016592|1085621:1098652|1087053_1088217_+|WP_013383945.1|portal|DBSCAN-SWA MFGFGEKKQPAPLVEVKASAAGKVVGFGTAGRTGFQPREGSSLVRAGFAANPIGFRAVRLISEAASALPLILQDATQRYDTHPVLDLLARPNPAQGQLELFEAIYAQLLLTGNAYVEAVSDGALPTELHVLRSDRMSVVPGPDGWPTGYDYAVAGRKHRFDAAAICHIRAFHPHDDHYGLSPLTPAAAAVEVHNAASRWSRGLLENAARPSGAIVFRGADGNGTLSNGQFDRLVAEMESQHQGARNAGRPMLLEGGLDWKPMGFSPSDMEFLQTKEAAAREIATAFGVPPMLLGIPGDATYANYQEANRAFYRLTVLPLASRVTGALVNWLDDFAGTWLDLRPDPDQIAALQTERDALWARVGAASFLSTAEKRALLGLPGEPDGAA >NZ_CP016592|1085621:1098652|1093585_1094431_+|WP_013383958.1|DBSCAN-SWA MSDHSTTRCTAWAITRTDGLQLGFTDHDGDLTFAGLTFRAGAGMSGAALVQGAGLAVDNTEGFGMITDDAVGEGDLRAGRFDGADIRIYQVNWRAPADRSLIFHGTLGEITLEDGAWRAELRGAAEALSRPIGRSYQRGCAAVLGDAACGFDLDTPGFAMDAALIAVDDTTLTIAAPDLDPRWFERGVVKITTGAAAGLSGMIKSDASLGAQRLISLWSPLGVQPQAGDQIRLLPGCDKRLATCRAKFGNLHNFRGFPHIPGEDWLIAAPKTNGSGESLFR >NZ_CP016592|1085621:1098652|1090156_1090681_+|WP_014537610.1|DBSCAN-SWA MMLVEETTVDDAALPVAALGAFLRLGSGFGTDGLQDDLLRAFLRAALAAIEGRINKILIARSFAQQMTSGQAMAVGPLRAVLSVTVDGTAQPYALAGTTITAPTTAKLTVRYDAGLADDFAALPADLQQAVLMLAAHYYEYRQDPALDGACMPFGVSALTERYRTLRLSMGART >NZ_CP016592|1085621:1098652|1094842_1098652_+|WP_013383960.1|DBSCAN-SWA MATLVLSAVGASVGASIGGGILGLSSAVIGRAVGAVAGSLIDQRILGGGAQPVETGRIDRFRVTGASEGAAMARLYGRMRVGGQVIWATKFMETSTQTRAGKGQPKTTTFSYTTSLAIALCEGPINGIGRIWADGTEIAPTDLSLRLYHGHMDQLPDPRISAVEGADNTPAYRGTAYVVIEDLDLGPYGNRVPQFSFEVIRNDPARDDTFAGAVQAVAMIPGTGEYALSDTPVALRYSYADEGTQNENTPSGQSDFLTALDQLNTELPRVTSVSLVVSWFGDDLRAGQCKVQPKVEQTAFDAPDQPWRAGGITRSAAATVPRVGGSPIYGGTPSDAAVISAIRTIRARGQEVMFYPFILMDQLADNTLPNPWTGQAGQPPLPWRGRITTSLAPGQPGTTDGTAAARAEVAAFFGTATPAHFTRTGERVHYTGPNEWSLRRFILHYAHLCAAAGGVDSFCISSEMVALTQVRDDIGFPAVSALMALAADVRSILGPDTLITYAADWSEYHGYQPLGTGDKLFHLDPLWAHEDIDFIGIDNYMPLSDWRDGDSHLDAQAGAIYNLDYLTANVAGGEMYDWFYHSPEARDAQIRTAITDGYDQPWMWRVKDILGWWSHAHFDRVDGAQGPQSPWLPRSKPIRFTEIGCAAIDKGTNQPNKFLDPKSSESALPYYSNGLRDDFIQLQYLRALNRHFADPSQNPTSEIYDGPMVEMDYAHVWAWDARPYPWLPARGDLWSDGANYDRGHWLNGRAGGQALAAVADQICTDAGLSANTDALWGMVQGYAMDRIETGRAALQPLMLAHGFDAVDRDGALHLITRHGRPIATREMDDLVAHDAPALVRTRLPEAELAGQVRVAFVAAGGDFSIGGAEATLADTPRDTVSTSDLPLLMSRADATRAAERWLLESRLAREVATFTLPPSDAWLRVGDVLTLAGDDYRIDQLERAEALGITATRTSRSLFLPHDAVEDIPQPAAFAPPMPVAATFLDLPSETGPSFAVALTSATWPGEVAIHAGPPLVELARSAAPAVVGETLNDLSAARAGIWDRGPALRVRLVSGTLASHLPEALLSGANLAAIGDGTSDIWEVFQFAEAALVAPNEYALSLRLRGQGGSDGVMPPVWPAGSRFVLLDNRLTPLDVPRGVSRDWHWGPVQRPMSDRTWRQANRAFTGVGLRPYAPCHLRVSDTAVTWQRRTRSGGDSWDGIDVPLGEERELYRLRMYQSGALLREVMLDTPAFAYPAAMRAADGAGVTVEVAQMSQVFGAGPALVGAI >NZ_CP016592|1085621:1098652|1085621_1086962_+|WP_049776461.1|DBSCAN-SWA MKSAAAWLASTPAATRAEFLGALADHQIAALPWLFEFWALPHQLPPSGDWRTWVIMGGRGAGKTRAGAEWVRSMVEGPRPDTPGRAKRVGLIAQTMDQAREVMVFGDSGLMACCPPARRPEWIAGRAMLRWPNGAEARLFSAHDPEALRGPQFDAIWADEVAKWRLAQEAWDMLVMGLRLGDDPRACLTTTPRGGPFLRKLLAQSGTVMTHAPTRANRANLAPGFVAAVEAMFEGSHLGRQELDGLLVDEAEGTLWPQHLLDAALQRQAPPLDRIVVAVDPPVTGHAGSDACGIIVAGVEQRGAPTDWRLWVIEDATVQGASPHTWASAAIAAFHRHGADRLVAEVNQGGALVESVLRQLDPHIPYRAVRASKSKGARAEPVSTIYERGRACHLPGLALLEAQMSLMTLQGFTGKGSPDRVDALVWAAHELMLGAGGASPQIRAIG >NZ_CP016592|1085621:1098652|1094427_1094841_+|WP_013383959.1|DBSCAN-SWA MTPDTLARAWIGTPFVHGASLQGVGADCLGLITGLWRQIYGPAPWPLDYSPDWSVTLGPDAMARAADRYLPRAAQLYPGALMLLRLRPHLPPAHLAICAGPTFIHAFHAGGVVESPLSLPWRRRIAGLYHFAPKQES >NZ_CP016592|1085621:1098652|1092956_1093589_+|WP_013383957.1|DBSCAN-SWA MAFHDIRFPAAISFDSLGGPTRRTEIVTLTSGYEQRNTAWAHSRRRYDAGVGLRSLDDVAQLIAFFEARGGQLHAFRWKDWSDYKSCAPSAAISEMDQTLGYGDGATADWPLVKNYVSGEGAYARPITKPVANTVQIAVAGQKLDEGTDYTLNLGLGRVIFASPPAPGAEISAGFEFDVPVRFETDTIQISVSSFRAGQIPSVPLIEVRP >NZ_CP016592|1085621:1098652|1092332_1092944_+|WP_013383956.1|tail|DBSCAN-SWA MADTTTAELQQTQSVTAAFNAGLREMRGTLSATSRDVAGLERGLSQGLRRAFDGLVFDGDRLSTAVSSIAQSVQNAAYNAAMRPITDKIGGWLASGIESLMPFAEGGTFTQGRVMPFAKGGVVTAPTTFPMRGGTGLMGEAGPEAIMPLTRGADGRLGVAAQGGGGVNLVMNIQTPDAASFHRSQSQIGAQVSRLVARGNRNR >NZ_CP016592|1085621:1098652|1091404_1091821_+|WP_013383953.1|tail|DBSCAN-SWA MSAQNGKDLLIKIDMTGDGLFETVAGLRASRISFNAETVDVTTMESQGGWRELLAGAGMRSASVSGAGVFRDQSTDERMRALFFSGEVPAFRIIIPHFGAIEGRFQITALEYAGTYNGEATYDVTLASAGALTFEAEV |
16 | Paracoccus_phage(33.33%) | capsid,head,tail,portal,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
1288500 : 1295722
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP016592|1288500:1295722|DBSCAN-SWA AATGACCGACATGACCACGCCGCGCCAAGCCGCCCCGCGTCAGGGGCTTTTGATCATCTTGTCCTCGCCTTCGGGCGCGGGAAAATCGACGCTGTCGCGCCGTCTGATGGCCTGGGATGAGACGCTGCGATTCTCGGTCTCGGCCACGACGCGTGCGCCGCGGGCGGGCGAGGTGGATGGCGAACATTACCATTTCATGACGCGCGACGGCTTTGGCCAGTTGATCGCCACCGATCAGATGCTGGAGCATGCCGAGGTCTTTGGCAATTATTACGGCAGCCCGCGCGGTCCGGTGGAAATGGCGATGGCGCAGGGCCGCGATACGCTGTTCGACATCGACTGGCAGGGCGGGCAACAGATCCGCAACTCGCCGCTGGGGGCGGCTGTTGTGTCGATTTTCATCCTGCCTCCCTCGATTGCCGAGCTGGAAAGCCGTCTGCGCGCGCGCGCGCAAGACAGCGAAGAGGTGATCGCGAAACGTATGCGCGAAAGCATGAACGAAATCAGCCATTGGGCGGAATACGATTACGTGCTGGTCAACGAGGATCTGGATCAGGCCGAGGCGCAGCTGATCACCATCATTCAGGCCGAACGCGCCCGTCGCAGCCGCCAACCTTGGCTGAATGGCTTTACGCGCGGTCTTCAGCAGGAATTTACATTGCGTGGCACGTCGTCATAATTCCGTTTCGTTTCGCCCGGATAAGCGCGAGACGAAGGAGACCGACATGCCCGACAATCTGAACGCCATCATTGGCGCGCTGGATTTGCCCGCTTTGATTATCTCGCCCGAGGGTCAGATCATCGCGCATAACGCGGCGGCGCGGGATTTGATCGGGATGGATATGGTCGGGCTGCCCCATGCGGCGGTGCTGCGCCAGCCTGCGGTCAGTGCTGCGGTTGATCTGGTGCTGTCGGGCGCGCCTGAATCGCGGGCGCGACTGACCCAGCGCGGTGCCTCGCGCGATTCGATCTGGCAGATGCGCGCCGCCGCATTCGAGGGGCAGCGCCGCGCCATTTTGGTGACCTTTACCGATCTGACCGCAGTCGAGGAGGCGAACCAGATCCGCCGCGATTTCGTCGCCAATGTCAGCCACGAGCTGCGCACCCCGCTGACCGCGATCATGGGCTTTATCGAGACATTGCGCGGCCCCGCCCGCGATGACCCCGGCGTGCGCGGCCGTTTTCTGGATATTATGGAGCGCGAGGCCAACCGTATGGTGCAGCTGGTCGATGGCCTGCTTTCGCTGTCGCGGGTCGAGGTGGATGAACGCGTCCGCCCGACGACACCCGTGGATCTCAAGGCGCTGGCCGAAGAAACCATTGCCGCGCTAGAGCCTTTGGCCGCGCAGGGCAATAATACAGTCACCCTGCACGCCGAACCCGGCAATTGGATTGTCCCCGGCGATATCGGGCAGTTGCATCAGGTGCTGCGCAATCTGGTGCAGAATGCGCTGAAATACGGCGGGCCGGACAAGAATGTCGTGATCGCGTTGCATCCGGCGCAGTTCGATGTGGCTTTGCGCGCGACAGCCGTACGGATCGACGTGCAGGACGAAGGCCCGGGGATCGAGGCGCACCACATTCCCCGCCTGACCGAGCGTTTCTATCGCGTCGATGCGCATCGCGCCCGCACCGTGGGCGGCAGCGGCCTTGGCCTTGCGATCGTCAAGCATATCGTGAACCGCCACCGCGGCCGGTTGGCGATTTCCAGCACACCCGAAAAGGGCAGCACGTTCAGCGTGCTATTGCCGCAAGAGTGATCCGTTATACGTCTTGCGCGGAATTTCCCCCTCGCCATCCCTTGACCTTTGAAGGCGATGGGTCACATATCAGCTTACCAAGGCATTGAAAGGATTAGTCCCATGGCTTTCACCCTTCCGGAACTTCCCTACGCCCACGACGCACTTGCCGCCAAAGGCATGTCGCGTGAGACGCTGGAATACCACCACGACCTGCACCACAAGGCCTATGTCGACAACGGCAACAAGCTGATCGCCGGCACCGAGTGGGAGGGCAAGACCCTCGAAGAGATCATCACCGGCACCTATAATGCCACTGCTGTTGCGCAAAACGGCATCTTCAACAACATCAGCCAGCTTTGGAACCACAACCAGTTCTGGGAATGGCTGTCGCCCGAAACCGTCGCCATCCCGGGCGAGCTGGAAAAGGCCCTGACCGAGTCCTTTGGTTCGGTTGCCAAGTTCAAAGAAGAGTTCTCGGCCGCTGGTGCCGCCCAATTCGGTTCGGGCTGGGCATGGCTGGTCAAAGACAAAGACGGCAGCCTGAAAGTCACCAAGACCGAAAACGGTGTGAACCCGCTGGTGTTTGGTCAAACCGCATTGCTGGGCGTCGACGTGTGGGAACACTCGTATTACATCGACTTCCGCAACAAGCGTCCGGCCTATCTGACCAACTTCCTCGACAATCTGGTCAACTGGGAAAAGGTTGCGTCGGCGCTTTAAGCCGCCTACAGATTTCGAACCAGACCCGGCGCTGTGGCGCCGGGTTTTTTCTTGGGTGAGACATGCGCGACTTGCAGCAATCCTCGGGCGTTCTGGCCCTGATCGGCACCTATCTGCTCTGGGGCTTTATCGCGATCTATTTCGGGGCGGTCTCGCATGTGCCGCCGATGGAAGTGCTGGCCTATCGCGTGTTCTGGGCGGCAGTGTTTTATGGGTTGATCCTGCTGGTGCAGGGGCGGTTCAGCGAAGTGCCCACGGCCATGCGCGATCCGCGCAAGCTGCGGCTGATGCTGCTGGCCGGGCTGATGATCGCCGCGAACTGGCTGCTGTTCATCTTTGCCGTCAGCAATGGCCACGCGACCGAGGCCTCGATCGGTTATTACATCCTGCCGCTGATCGCCGCTGTCACCGGCTTTGCCGTCTTTCGCGAAAAGCTGGGGCGCTGGCAGATCGTCGCACTGCTGATCGCGGCCAGCGGCGTGCTGGTGCTGACGCTGGGCCTTGGGCGCGCGCCATGGGTCAGCCTGCTGCTGGCGGGCACGTTCGTCATCTATAATGTGATCAAAAGAACGCTAAAGGATGTGCCCTCGCTGGTCTCGGTGATGGCCGAGGTGATCTTGCTGGTCATTCCCGCCGCGCTTTATCTGGTGTTTTTCGGCGAGACGCTGTGGCAAGCGCCGCTGTCGCCCGCCTGGTGGCAGGATCAACTGCTGCTGATGTTGGCGGGGCCGATTACGGCCATTCCGCTGATCTTGTTCGGTTATGGCGCGCAACGGGTCAGTATGGCCACAACGGGGATCATCTCTTACATGAACCCCACGATGCAACTCCTTGTTGCAACGCTTTATTTCCACGAGGCGCTGACGATCTGGCACGGTGTCGCACTGGCGCTGATCTGGCTGGCCCTTGCGGTCTATACCGGGGCCAGCCTGCGGGCGCATCACGCCGCGAAGTAAGGCTTCAGCGCGGCTTCGATCTCTTCGACCGTGTTCACCACCGTGGTGTAGGACAGCAGGCTGCTTTCGGCAAAGCCCTGCGCCACGATGTTTTCCAAAAGCTGCACCAGCGGGTCCCAGAACCCGTCGACATTCAGCAAAAAGATCGGCTTTTGGTGCAGGCCGATCTGGCGCCAGGTCAGCACCTCGAAATACTCGTCCAGCGTGCCTGCGCCGCCGGGCAGGACGACGATGGCATCCGCGTTCATGAACATGACCTTTTTGCGTTCATGCATCGTCTCGGTGATGATCAGCTGATCCAGTTGGCGGCGACCAACCTCGCGCTTCATCAGATGGGTGGGGATGACGCCCACGGCCGCGCCGCCGGCCTCTTGTGTGGCGCTGGCGACAAGGCCCATCAGGCCGACATCGCCCGCGCCGTAAATCAGGCCCCAGTCATTGCGCGCGATCATCGCACCCGTGGCACTGGCCAGTTCTGCATAATGCGCCAGCGTGCCATAGCGAGAGCCGCAGTATACGCAGATTGATTTCTTGGTCAGCGCCATGAATTTACCCCTAAACTGTTTCCTTCAAGTCTCAGGTCCCAGATTAGTTGGGGCGGGCGGGTAACTCAACGCATGTCGAACTCGTCACGATGACAGGATTGTGTCACATCCTGTTCAAATTCTTGATGGGATTCGGATAGGCAGTGTCAAAGACGGTGAAAATTATCCTTGGGTTGCTGGCGGCTGCGGTTGCCTTTGTGGTGCTGATCCTGTTCGGCACAAGCGCGCGCATGACGCCCGGCACCCAAGTTCCCGCAGCAGCAGCAGAGGCACCCGCCGTCGCACCGCCAGAAACAGCAACCGCTGAGGCCGATACGGCCGAGCAACAGTCCGCCGCCGCAACGCCTGATCCGGCTGCGGATGCTGCGCCCGAACCTGAGCCCGTCGCGGATGAACCCGCCGCCCAGCCCGCACAGGACGCGCCGCGCCTTGATGTCGTGCGTCTGGCCGGAGATGGTCTGGCTGTTGTCGCCGGAAATGCCGCACCGGGCAGTGATGTGACGATTGTGGTGGATGGCGCGCCATCGACCACGGTGACCGCCAGCGCGGATGGCAGTTTTGCTGGCGTTGTCGATATCGGCGCCAGCGATGCGCCGCGCGTCATCGGCTTGCAAACAGAAACGGCCGACGGTCCCGTTGCCAGCTTGTCCGAGGCGATCCTTGCGCCGAACCCGCCCGCAGCAGCGCCTGAACCTGTCGAAGTGGCTGATGGTGAGGCGGCGGGTCAAGTGCCGACCGAAGGCGAACCCGCAGATGAACCCGCAGAACAATTGCCGGTGCAACAGGCCGCGCCGACGATCCTGATCACCGATGCCGATGGTGCGCGCGTGATCGCCGCGCCTGCGCCGCTCTCTGCCGATGCGCCGGCGCAGCTGCTGCAGACAATCTCCTATAATGCCGCCGGGAATGTGCTGCTGTCCGGCCGCGCTGCCGGTGTCGGTGGGCGCGTTGCGATCTATGTCGATAACGTGCTGCTGGGCTTTGCCGCGGTCGCCGATGACGGCAGTTGGAGCCTGCAACACGACGGCATCGACCCCGGTCGCCACACGTTGCGCGTCGATTGGCTGGACACGGAGGGCCGTGTGCGCAACCGTGTCGAGACGCCTTTCCTGCGCGAGGACGAAGGCGCGCTGGCGCAGGCCGTTGCAACCGAGGCCACTGCCGCCACCACCGGCATCGCCAGCCGCACCGTGCAACCCGGCAATACGCTCTGGGCTATTGCGCGCGAACGTTACGGCAGCGGCATTCTTTATGTGCAGGTGTTCGAGGCGAACCGCGACCGCATCCGCGATCCCGATCTGATCTATCCGGGCCAGATTTTCGACCTGCCGGATCTGCCGGAATCGCCCGATCGCCCTGCGACGCCGTAACCGCGCAGACGCACTGGCGCATCCCGGTCCAAGGGACTAGGGTGCGCCTCCTTGGGAAAAGGGGGCAACATGCACAGGCCGCGTATTTCAATCACTGACGCTTCGGCAAAGCCGGTGCAGGGCCTTTCCATCCTGCGGCGCGTGCTGCCCTATTTGTGGCCCGATGGCGCGAATTGGGTGAAATACCGGGTGGTTGCGGCCCTTGTCCTGCTGATGATCGCGAAACTGATCACGGTTGCGACGCCTTTGTTTTACAAATGGGCGGTCGATAGCCTGTCGGGCGTGGTCAGCGGGCCCGCGGGCATGATGGCGCTGGGGGCGGTGGGCCTGACCGTCGCATATGGCGGCGCGCGTCTGCTGACGGTCGGCTTTCAGCAGCTGCGCGATGCGGTCTTTGTGCGCGTCGGCCAGCGGTCGCTGCGGATCATCGCGGGACAGGCCTTTGCGCATATGCATCAACTTTCGATGCGCTATCACATCACGCGCAAGACCGGCGGCCTTAGCCGCATTATGGAGCGCGGGATCAAAGGCGTCGACTTCCTGCTGCGGATGTTCGTCTTTTCGCTGGGGCCCTTGGTGCTGGAGCTGGTGCTGGTCTGCGCCACCTTATTCTTTCTGTTCGACGTGCGTTTCCTGCTGGTCGTTGCGGGCACGATTGCCGCCTATATCGTGTATACATTGCTGGCGACCGAATGGCGCGTCCGCATCCGCCGCAAGATGAACGAGCAGGACAATGACGCCAATCAAAAGGCCATCGACAGTCTGCTGAACTTTGAAACGGTGAAATATTTCGACGCCGAGACGCGCGAAGTGAACCGCTATGATTCGGCGATGGAGAAATATGAGGATGCCGCCGTCAAAACTGGTGTCTCGCTGGCCGCGCTGAATTTTGGCCAATCGCTGATCATCACCCTTGGCCTGACCGCCGTCATGGTGCTGGCCGCGATGGGGGTGCAGGATGGCACGATGACGGTTGGCGATTTCGTCATGGTCAACGCCTATATGATCCAGGTCACCCAGCCGCTGAACTTCCTTGGCACGATCTATCGCGAAATCCGGCAGGCTCTGGTCGATATGGGTCAGTTGTTCGATCTGATGGGCGAGGCGGCCGAGGTCAAGGACAAGCCGGGCGCGCCCGCGCTGCGGATCACCGGCGGCGAAATCCGCTTTGAGGACGTCCGCTTTCACTATGCCCCGGATCGCGAGATTTTGAAGGGCGTCAGTTTCACCGTGCCCGCGGGCAAAACGCTGGCGCTGGTCGGCGCGACGGGATCGGGGAAATCCACCATCGGGCGGCTGTTGTTCCGGTTCTATGACGTGACGGGCGGACGCATCTTGATCGACGGCCAAGATATTCGTGACGTGACCCAGGAATCGCTGCACCGCGCCATCGGTGTGGTGCCGCAAGACACCGTGCTGTTCAACGACACCATCGGCTATAACATCGGCTATGGTCGCGCAGGCGCCACGCAGGCGCAGATCGAGGATGCTGCCCGCGCCGCGCAAGTCCATGATTTCATTGCCAGCCTGCCCGAAGGCTATGAGACGCAGGTGGGCGAGCGGGGCTTGAAACTCTCGGGCGGGGAAAAGCAGCGCGTGGGTATTGCCCGCACGCTTTTGAAAGATGCGCCGCTGTTGCTGCTGGACGAGGCGACATCTGCACTCGACACCAATACCGAGATGGGCGTGCAAGAGGTGCTGGCGCGGGCCGAGGCGGGGCGCACCACGATTTCCATCGCGCACCGCCTGTCCACCATTGCCGACGCCGACGAGATCATCGTTCTGGATCACGGCGCGGTGGTCGAACGCGGCAGTCATGGCGGGCTTTTGGCCCAGAACGGCCGCTATGCCCAGCTATGGGCGCATCAGCAAGCTTCCGACCAAGACTGA
Protein sequences of DBSCAN-SWA_3 >NZ_CP016592|1288500:1295722|1291901_1292456_-|WP_044008154.1|DBSCAN-SWA MTKKSICVYCGSRYGTLAHYAELASATGAMIARNDWGLIYGAGDVGLMGLVASATQEAGGAAVGVIPTHLMKREVGRRQLDQLIITETMHERKKVMFMNADAIVVLPGGAGTLDEYFEVLTWRQIGLHQKPIFLLNVDGFWDPLVQLLENIVAQGFAESSLLSYTTVVNTVEEIEAALKPYFAA >NZ_CP016592|1288500:1295722|1288500_1289178_+|WP_014537685.1|DBSCAN-SWA MTDMTTPRQAAPRQGLLIILSSPSGAGKSTLSRRLMAWDETLRFSVSATTRAPRAGEVDGEHYHFMTRDGFGQLIATDQMLEHAEVFGNYYGSPRGPVEMAMAQGRDTLFDIDWQGGQQIRNSPLGAAVVSIFILPPSIAELESRLRARAQDSEEVIAKRMRESMNEISHWAEYDYVLVNEDLDQAEAQLITIIQAERARRSRQPWLNGFTRGLQQEFTLRGTSS >NZ_CP016592|1288500:1295722|1293901_1295722_+|WP_013384148.1|DBSCAN-SWA MHRPRISITDASAKPVQGLSILRRVLPYLWPDGANWVKYRVVAALVLLMIAKLITVATPLFYKWAVDSLSGVVSGPAGMMALGAVGLTVAYGGARLLTVGFQQLRDAVFVRVGQRSLRIIAGQAFAHMHQLSMRYHITRKTGGLSRIMERGIKGVDFLLRMFVFSLGPLVLELVLVCATLFFLFDVRFLLVVAGTIAAYIVYTLLATEWRVRIRRKMNEQDNDANQKAIDSLLNFETVKYFDAETREVNRYDSAMEKYEDAAVKTGVSLAALNFGQSLIITLGLTAVMVLAAMGVQDGTMTVGDFVMVNAYMIQVTQPLNFLGTIYREIRQALVDMGQLFDLMGEAAEVKDKPGAPALRITGGEIRFEDVRFHYAPDREILKGVSFTVPAGKTLALVGATGSGKSTIGRLLFRFYDVTGGRILIDGQDIRDVTQESLHRAIGVVPQDTVLFNDTIGYNIGYGRAGATQAQIEDAARAAQVHDFIASLPEGYETQVGERGLKLSGGEKQRVGIARTLLKDAPLLLLDEATSALDTNTEMGVQEVLARAEAGRTTISIAHRLSTIADADEIIVLDHGAVVERGSHGGLLAQNGRYAQLWAHQQASDQD >NZ_CP016592|1288500:1295722|1289224_1290259_+|WP_013384143.1|DBSCAN-SWA MPDNLNAIIGALDLPALIISPEGQIIAHNAAARDLIGMDMVGLPHAAVLRQPAVSAAVDLVLSGAPESRARLTQRGASRDSIWQMRAAAFEGQRRAILVTFTDLTAVEEANQIRRDFVANVSHELRTPLTAIMGFIETLRGPARDDPGVRGRFLDIMEREANRMVQLVDGLLSLSRVEVDERVRPTTPVDLKALAEETIAALEPLAAQGNNTVTLHAEPGNWIVPGDIGQLHQVLRNLVQNALKYGGPDKNVVIALHPAQFDVALRATAVRIDVQDEGPGIEAHHIPRLTERFYRVDAHRARTVGGSGLGLAIVKHIVNRHRGRLAISSTPEKGSTFSVLLPQE >NZ_CP016592|1288500:1295722|1292605_1293832_+|WP_014537688.1|DBSCAN-SWA MSKTVKIILGLLAAAVAFVVLILFGTSARMTPGTQVPAAAAEAPAVAPPETATAEADTAEQQSAAATPDPAADAAPEPEPVADEPAAQPAQDAPRLDVVRLAGDGLAVVAGNAAPGSDVTIVVDGAPSTTVTASADGSFAGVVDIGASDAPRVIGLQTETADGPVASLSEAILAPNPPAAAPEPVEVADGEAAGQVPTEGEPADEPAEQLPVQQAAPTILITDADGARVIAAPAPLSADAPAQLLQTISYNAAGNVLLSGRAAGVGGRVAIYVDNVLLGFAAVADDGSWSLQHDGIDPGRHTLRVDWLDTEGRVRNRVETPFLREDEGALAQAVATEATAATTGIASRTVQPGNTLWAIARERYGSGILYVQVFEANRDRIRDPDLIYPGQIFDLPDLPESPDRPATP >NZ_CP016592|1288500:1295722|1290361_1290961_+|WP_013384144.1|DBSCAN-SWA MAFTLPELPYAHDALAAKGMSRETLEYHHDLHHKAYVDNGNKLIAGTEWEGKTLEEIITGTYNATAVAQNGIFNNISQLWNHNQFWEWLSPETVAIPGELEKALTESFGSVAKFKEEFSAAGAAQFGSGWAWLVKDKDGSLKVTKTENGVNPLVFGQTALLGVDVWEHSYYIDFRNKRPAYLTNFLDNLVNWEKVASAL >NZ_CP016592|1288500:1295722|1291023_1291917_+|WP_014537686.1|DBSCAN-SWA MRDLQQSSGVLALIGTYLLWGFIAIYFGAVSHVPPMEVLAYRVFWAAVFYGLILLVQGRFSEVPTAMRDPRKLRLMLLAGLMIAANWLLFIFAVSNGHATEASIGYYILPLIAAVTGFAVFREKLGRWQIVALLIAASGVLVLTLGLGRAPWVSLLLAGTFVIYNVIKRTLKDVPSLVSVMAEVILLVIPAALYLVFFGETLWQAPLSPAWWQDQLLLMLAGPITAIPLILFGYGAQRVSMATTGIISYMNPTMQLLVATLYFHEALTIWHGVALALIWLALAVYTGASLRAHHAAK |
7 | Bacillus_phage(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_4 |
2030689 : 2084819
Sequences of DBSCAN-SWA_4
Nucleotide sequences of DBSCAN-SWA_4 >NZ_CP016592|2030689:2084819|DBSCAN-SWA ATTAGACGCGGTCTTTGGGCGCTTCTTCGGCAACCGGGGTTGCGACAGGCGCAGCCGGGGTCACATCGCGCAGTTCGGTTGCGGATACTTCCAGCTCTTTTTTGCCGTCATCCACGCCGCGTTTGAACGCAGTGATGCCCTTGCCGACTTCACCCATCAGGTTCGAGATTTTGCCCCGGCCAAACAGCACCAGAACAACGACAGCGATCAGCAGAATGCCCATCGGGCCGATATTGTTAAACATGTCACTCTCCGCAGCAACCGATGCGCCCATGGCACCGGTGAAATAAATCCATGCCCTGTGATGCGCCTGCGGATGGGCGGAAGTCAAAGCGTGCCACGGGGTGGGTGGCGCAATAAGGTTGCATTTTTCGAAATCCGCGACAAAAACATGTCAGTAGCCAAGAACGAGAGGCTAAAATGCCATCACCGCGCAATCAACGCCTGTTCCAGATCATGCAAATTCTGCGCAGCGCGCCTGGCGGTGCCCTGCTGTCTGCCCATGATATTGCGACGCAAACCGGCGTCAGCGACCGCACAATCTATCGCGATATGGCAACGCTGATCGATTCGGGCCTGCCGGTCGCGGGCACGCCCGGACAGGGCTATCACATTACCGCCGCCATTACGCTGCCCCCCTGAACCTTAGCCTAGACGAGATCGAGGCCCTCCATATCGGCCTCTCCATTTTGGGCGAGGCCGATGACATCGGCCTACGCGCCGCCGCGCAAAGCCTGTCGAATAAGGTCGATGCGGCGCTGTCTGCGGATCTCAGCGATGCAGATCGGCGCTGGGCCCTCAGCGGCCCCGGCGTCAACGCGGCTGTCGCCGAAGCCGCGCGCGGCTTTCATTTTTTGCCCGTTTTGCGCAGCGCGATTGCCCGCCGCCAAAAGCTGCGCCTTAGCCTCAGCGCCAGCGCTTCCCCGCCGTCAGAGCGGATCGTCAGACCCCTGCGGATTGACTATTGGGGACGCATCTGGTCATTGCAGTGCTGGTGTGAGACCACCGGCGGGATGGAGCAACTGCGCACCGATCACATTGATAGTGTTAACGTTCTGCCGCAGCTTTTCACGCCGCCGCCGCAGTAGAAACCGCTCGGGACGTTATTTATAGGCAATATACCGCGCACCCCATCACATTAATTAACAAACCATTAACGAATACTTGACTGAATAACCTGTCACAGAACATGCTTGCATTAGATTTAGGGAATTCCCCGGAATGATTGGGGTGACGGGCGGTATGGAAGCAACGTCGGGCGGCCTTTTTTCGCAGTCTTTGATGGTTTATCGCGGCGTGGATCTGACCGTGACGGATGGCGTTGCGCTGGGCGACAGCCTGACCTTTGCAGCAGGTCTATTGCCGGACGACTATTACACCCTTGACCGCGAGGCAGCGCCCGCGCGTCTCAGCTTTGCCGTTGCAGAAACCGGCCGCTTGATGATCGCGCCCGGCTCGAGCATCGGGACAGCGGGCCATAGCCTGCATGTCGATAGTGTGTTGCTGCTGCTGGATCGAAATGGCCGCGTGGCAGAAGTGCTACTGCTGGTCGAGGAAAAGGGCGGCATCGCGATTGCGGTCTATGCCCTGCCCCTATCGCCTTTGAATCGTGGCTTTGAATATCGCTTGGCCGCAGTCACCCAGCGAAACGCACGCCATCGCCTGTTTGGCACGCCGATGCTGCAATTCACGCATGATACGCTGCTGACGCTGGCAGATGGATCACGCATTGCGGCAAGCGACGTGCAGGTCGGACAACATCTGATGTCGGGCGATGGCAATATGATCACCGTCAACTGGATCGGCCGCTCGCAGCTGATGGCCGCGCGGGATATGGCGCCGGTGCTGGTGCCTGCGGGCAGTGCGGGCAATCAATCCGATCTGATCCTTGGGCCGCGTCACCGCATCCGTGGGCGACTGATCCGTGATATGATCGGCCGCAGTAATATCCGCCAGATCGAGCCGATGGTGCAGCACTATGTCCAGATCCTGCCGCATCGCCATCGCAGCCTTGCCGTGGCCGGGGTCGCCGTCGACTGCCTGATGATCGAGGCCCCGCCGCCCCACACAGCCAATAGCGCCCCCGTGGCCCAAGGCCTGCCGTTGCCTGCCGTCGCCCAGAACGACTGGCGCTAGCATTTCTTGCCGTAGACGCGCGGATCGCCTATAGCCGCGCCATGCAAAACCAGACCTCGCCCCTCCCCGCTGAAATCGCCCGCCGTCGCACCTTCGCGATCATCGCGCACCCTGACGCGGGCAAAACGACATTGACGGAAAAATTCCTGCTGTTCGGGGGTGCGATCCAGATGGCGGGTCAGGTGCGCGCCAAGGGCGAAGCGCGCCGCACGCGGTCGGACTTTATGAAAATGGAACAAGATCGCGGGATTTCGGTCTCGGCCTCGGCCATGTCGTTCGACTTTGGTCAGTTCCGTTTCAACCTTGTCGACACCCCCGGCCACAGCGACTTTTCTGAGGATACCTATCGCACGCTGACCGCGGTGGATGCGGCTGTGATGGTGATCGACGGCGCCAAAGGCGTGGAAAGCCAGACCCGCAAGCTGTTCGAGGTCTGCCGCCTGCGCGATCTGCCGATTCTGACCTTTTGTAACAAGATGGACCGCGAGGCCCGCGATACGTTCGAGATCATCGACGAGATCCAGGAAAACCTGGCCATCGATGTCTCGCCCGCCAGCTGGCCCATCGGGTCGGGCCGCGACTTTCTGGGATGTTACGATCTGCTGCATGACCGGCTAGAGCTGATGGACCGCGCCGACCGTAACCGCGTCGCGGAAACCGTCTCGATCAGCGGGTTGGACGATCCGAAACTCGCGGAACATATCCCCACAGATATGCTAAAAAAACTGCGCGAAGAGATCGAAATGGCGCGCGAATTGATGCCCGCCTTTGACCGCGAGCGCTTCCTCGAGGGCTCGATGACGCCGATCTGGTTCGGCTCGGCGATCAATTCGTTCGGCGTGAAAGAATTGATGACCGGCATCGGCGAATTCGGCCCCGAGCCGCAGCCGCAAAAGGCGGCCGAGCGGATGGTGCCTGCAGGCGAAGGCAAAGTCGCCGGTTTCGTATTCAAAGTGCAGGCGAATATGGACGCCCGCCACCGCGACCGCGTGGCCTTTATCCGCCTTGCCTCGGGCCATTTTGAACGCGGCATGAAGCTGATCCATGTACGCTCAAAAAAGCCGATGGCCGTGACGAACCCCGTGCTGTTTCTGGCGGCCGACCGCGAATTGGCCGAAGAGGCTTGGGCCGGTGACATCATCGGCATCCCGAACCACGGCCAGTTGCGCATCGGCGATGCGCTGACCGAGGGCGAGATGCTGCATTTCACCGGCATCCCCAGCTTTGCCCCCGAACTGCTGCAAAACATTCGTGCGGGCGATCCGATGAAAGCCAAACATCTGGAAAAGGCGCTGATGCAATTCGCCGAGGAAGGCGCGGCGAAAGTGTTCAAACCGTCCATTGGCTCGGGCTTTATCGTCGGTGTCGTCGGCGCGTTGCAATTCGAGGTGCTGGCCAGCCGGATCGAGGTTGAATACGGCCTGCCTGTGCGGCTCGAGAGCTCGCAATTCACCTCGGCGCGCTGGGTGTCGGGTGACAAGGATGCGGTCGAGGCCTTTGTCAGCGCCAACAAACAACATATCGCCACCGATAACGATGGCGATCTGGTCTTCCTGACCCGTCTGCAATGGGACATCGACCGCGTGGCGCGTGATTATCCAAAGGTATCGCTGACCGCGACCAAGGAAATGATGGTCTCGTAAAATACAAAACGGCCGCTGTCCCTGCAGCGGCCGTTTTCAGGTCAATGGCTGCGCTTATTGAAGCAGGCCGATTGCCGTGCGGGACGTCACATTGATCGCGGCGGTGCAGGCGCCGGCCTCAGTGCCCTCAACGGTGCAGCTTTGCCAATCGTTCACCAGCACATTGCCCAAACCGTCACAGGCACTGCCCGTCACTTCATAGCGGCGCACGCGGGTCTGGCCTGCGGCGACATCGCCGAGTTCAAACACCGTCAAACGCGACAGCCCGCCGTTCGTGTCGAACAGCGCGGTTTCGGCCATCATGCCGCCCAGGGGCTGTGATAACCCGTTGCGGGTGACAAAGGTCAGACGACAGCTGCCCTCGAACGATTCCAGACGGTTCAGCTCGACACTGATCTGTCCTTCGGTCACATCGGGGGTGCTGATGCGGCTGGCGACATCAAGATTTTGTGCGCAGGCCTGCGGGTCAAGGCCGACCGCCTCGCAGGTCAGGGTGCGGTTCACGATCACTTGGCCAACGTCATCACAGGCGCGCCCACGCAGATCGAACTGCTTCACGCGCACCGCGCCTTGGGGCACTTCGGTGAACTCGAGCAAGGTGAATTGCGCCGCCTCGCCTGCCGTATCCATCACGACGGCCTCAAAACTCAGCTTATAAAGGTTTTCTTCCAACCCGTTGCCGACGACGAACGACACGCGACAGGCGCCGTTTACATCCTCTTTGGCGTTCGCCTCGACCGCTAGGCGGCCGGGGGTGACCTGCTGGGCGGCGAGCGGCGCAGCCGCTGCGAAAAGCGCAAGGCCTACGGCCAGCGAAGAAATCATGCGCGTTTGGATCATGTGCATTCCAGTTTACAATCTGTTTCGCAGCCTGTTCATAGGCGGTGGAGACCAAAGCAATCGGCGCGCGCAGCGCGCAGCCGCCTTGCCTATACCTGCATCTCCGAGTAAATTAATCAACAATGCCGTTCAAGAGGAACGGTGATCACCAGCCACGTTACCGCATTTCCCCTCGTGCGGGCCGCAGCCAGATACGCCAGCTTCAAACCACAGGTGCATCATGGCTGCAGCCGATACCCACGCCCAAATTCCGACCCCGGCCGAAATCCGTCAAGCCCGCGCTGACAATCCCAAGGCGCGCGACCGCGATCTGGCCGAAAGCCTTGGCGTCTCCGAGGCCGCGCTGGTCGCCGCCTATGTGGGTCACGGCGTCACGCGCATTGCCGCAAACCCTGATCAGCTGATGCCCCTGATCCCCGCTTTGGGCGAGGTTATGGCCCTGACCCGGAACGAGGCTTGCGTCCATGAAAAGGTCGGCACCTATTCGGAATATCACGCGAACCCCCATGCGGGTTCCGTGCTGAACCCGAACATTGATCTGCGCACCTTCCCCAAACATTGGGTGCATGGCTTTGTGCTGGAAAAAGAGACCGAAACCGGCACCCGCCGCAGTATTCAGGTCTTTGACAGCGCAGGCGATGCGGTTCACAAAATCTTCCTGCGCGAGGGCTCGAACGTAGAGGCGCTCGAGGCCGTGAAAGACGCGCTGCGCCTGCCCGAGCAGTCGGATGTGGTCGAGACCACCGCCCGTCCCGCCGTCCAAGGCCCCAAAGCCGACCCGAGCAAAGCCGACGCCCTGCGCGCCGATTGGCAAGCGATGACCGACACGCACCAATTCCTGCGTATGGTCAGCGGCCTTGGCATGAACCGTCTGGGTGCTTACCACACGGTCGGTGCGCCCTATGCCCGCCTGCTGGACAAGACCGCTTTCCAAGCCATGCTGGACGGTGTTGTCGCACAAGAGATCGGCATCATGATCTTTGTCGGCAACCGCGGCATGATCCAGATCCACACCGGCCCGATCTATAAGCTGATGCCGATGGGCCCGTGGCAGAACATCATGGACCCCGGTTTCAACCTGCACCTGCGCGCCGATAAAATCGCCGAAGTTTGGGCTGTGACCAAACCGACCAGCCGCGGCGATGCGATTTCGATCGAGGCATTTGACGCCGAAGGCGACATCATCTTGCAGGTCTTTGGCGTGCAAAAGCCGGGTATGGAACATCGCCCGATGTGGAACGCCTTGGTCGAGGCTCTGCCTTCGGCCCAAGTCGAAGAGGTCGCATAATGCGGCGCCTTTTGCTGACCTCGGCCATCGCAATTGCGGTGGCCACGGGCGCTCAGGCGCAGGATCGCATCCTGTCGCTGGGCTCGTCCGTCACCGAAATCCTGTTCGCCATCGGTGCCGAGGATAAGGTCATTGCGCGCGATCTGACCTCGACCTATCCCGCCGCTGCCGAGGCGCTGCCCGATGTCGGTTATGTCCGCGCCCTGTCGCCCGAAGGCGTGCTGTCGGTGAATGCTGATATGATCATCGCCGAACCTGACGCGGGGCCGGTCGAGACGATCGACGTTCTGAAAGCCGCCTCGATCCCGTGGGTGACGGTGCCTGCGGGCTGGGATGCCGCCCAGATCGTCGAGAAAATCAACCTGATCGGCGAAGCCACCGGCCATGCGGCCGAGGCCGCCGCGCTGGCGGCCACTGTGACAGCGGAGCTGGAGACCGCCGCCACCGCAGCGGCAGAGATCCCCGAGGATCAGCGCAAGCGCGTGCTGTTCATCATCTCGACCAATGGTGGCCGCGTGATGGCCGCAGGCAGCGAGACCGGCGGCAATGCGATCATTGAACTGGCGGGCGCGGTCAATGCCGTGCAGGGCGTCGAGGGTTACAAGCCCCTGACCGACGAGGCGATCACCGCCGCTGCCCCCGATTTCATCTTGATGATGGACCGCGGTCATAATCTGGATGCGGCGAATGACGAGCTATGGGCGATGCCCGTGCTGGCCTCGACCCCCGCAGGTCAAAATCAGGCCGTGATCCGCATGGATGGCATCTATCTGCTGGGCTTTGGCCCCCGCACCGGCGCCGCCGCGCTGGAGTTGCATAACGCGCTTTACGCCGGAAACTAAGCCCATGACTTTCGCCCTATCACATGCCGCCAGCGTGGATAACGCCGACCCGCGCGAGGTTCGCGCCCGCAGGTTAACCCTGCTGCTGATCGCCGCGCTGGTTGTGACCTGCGGCGCATCGGTGATGTTCGGCGCATCCGGCACGTCGGTGACCAAGGTTCTAGGCCAATTGTGGCGCGGCGAAGAGATCGCGCTGATCGACCAGATCGTTCTGCTGCAAGTGCGCATCCCGCGCATGGTGCTGGGCGTGCTGGTGGGCGCAAGCCTTGCCGTCTCGGGCGCGGTCATGCAGGGTCTGTTCCGCAACCCGCTGGCCGATCCTGGTCTGGTCGGGGTTAGCGCAGGCGCCAGCCTTGGGGCGATTACGGCGATTGTGCTGGGCGGATTCCTTCCGGCGGCGGCGCTGGCTTTTGTCGGCGGCTGGCTGGTGCCGGCGGCGGCCTTTGTCGGCGGCTGGGGTGCGACGATGGCGCTTTATGCGGTTGCAACGCGCTCGGGGCGCACCTCGATTGCGACGATGCTGCTGGCGGGAATTGCGCTGGGGGCGCTGACGGGCGCGATCTCGGGCATCCTTGTGTATCGGGCCAATGACAATCAGCTGCGCGATCTGACGTTCTGGGGCATGGGATCGCTGGCCGGTGCGAACTGGCCAAAGGTCCTCAGCGCCGCGCCCTTGATCGTCATCGCACTGGCAGTTGCGCCGTTTCTGGCGCGGTCGCTGAATGCGCTGGCTTTGGGCGAGGCGGCGGCGGCCCATATGGGCATTCCGGTGCAAAAGATGAAATCCGTCGCGATCCTGACGGTTGCGGGCGCAACCGGCGCGGCGGTGGCGGTGTCGGGCGGCATCGGCTTTATCGGCATCGTCGTCCCGCACCTCCTGCGTCTGGCCGCAGGGCCGGATCACCGACATTTGCTGGTGAACGCGGGTCTTTTGGGCGCGATTGTGCTGCTGCTTGCGGATATGATCAGCCGCACCATCGTCGCACCTGCCGAACTTCCTCTTGGTATTGTGACAGCCGTTTTGGGTGGCCCCGTCTTTTTGTGGGTGCTGCTGCGGCAACGCGGAGTGGTCGACCTATGACTGTGACAGCACGACAGATCAGCGTCAAACTGGGGCGCAAACAGATCCTCGAGGGTGTGGATTTCACGGCCGCAGGGGGCCGCCTGACCGCCATCGTCGGGCCGAACGGGTCGGGCAAGACCACCTTGCTGAAAGCGCTGACAGCCGAGATCGGTAATGGTGACGGCGTTGAAATCAACGGTCGCGCGATCAATGCGTTAAAGCCGTGGCAACTGGCGGCGATGCGTGCGGTGATGCCGCAGGCCACCAGCCTCGCCTTTCCCTTCACCGCGATCGAGGTCGTGCGCCTTGGCCTGCAGGCCGGCGTCCATGCCGCCGACCGCACGCTGGCGCGCCGCGCGCTGGAACGTGTGGGGCTGCAAGATAAGGCCGAGCAGCATTACCAACAGATGTCAGGGGGCGAGCAGTCGCGCGTCCATCTGGCCCGCACCCTGTGCCAGGTCTGGGAGCCGATGGCCCATGGCAAACCGTCATGGCTGTTTCTGGACGAGCCGGTCTCGGCCCTCGATATCGGCCATCAGCTGCTGGTCATGGATATCACGCGCGACTTTGCACGCGCGGGCGGCGGCGTGGTTGCGGTCATGCACGACCTGAACCTGACTGCGCTTTATGCCGATCATGTTGTGCTGATGCGCGATGGCGCGATTCTGGCGGCGGGCGCGGTGCAAGATGTGATGACCAGCGAAAACCTGTCCCGCGCCTATGGCTGCGCGTTGCGGGTGAACCATGCCCCCACCGCAGATCACACGTTCCTGCTGCCCCATGCTGCCAGTTCACACGCGGCGTGAAGGAACTTTCTGCACTGTCATAATTCTGTTACGTTCATGTCACATTCCCTACGCACGGATCCCTTAGATCCCCGGGCGACGGTGCTGCCACAGCGCTACCACCGCACCCACGACACAGGAGTATGTCATGTCGTTGAAGACCGTTTGCGTTTCGGCAATCGCCATCGCCGCTGTTGCTGGCGCCGCTCAGGCCCGCGACAACATCCAGATCGCCGGCTCGTCCACCGTTCTGCCCTATGCCTCGATCGTTGCCGAATCGTTCGGCGAGAACTTCCCTGAATTCCCGGTTCCCGTTGTTGAATCGGGTGGCTCGTCGGGCGGCCTGCAGCGTTTCTGCGCTGGCATCGGCGAAAACCAAACCGATATCGCCAACTCCTCGCGCCCGATCCGCGCTGGCGAAATCGAGACCTGCGCCGCAAACGGCGTGACCGACATTATCGAGGTCCGCGTCGGCTATGATGGCATCGTGTTTGCCTCGGCGCTGAACGGCCCGGAATTCGCTTTCACCCCCGCTGACTGGTACAAGGCGCTGGCCGCTGAAGTCGTCGTCGACGGCGAAATCGTCCCGAACCCCTACACCACTTGGGACCAAGTAAACCCCGCCCTGCCCGCACAGCAGATCCTGGCCTTCATCCCCGGCACCCGTCACGGCACCCGCGAAGTGTTTGACGAAAAAGTTCTGGTTGCTGGTTGCGAAGAATCGGGCGCGGCTGAAGTGCTGAGCGCTGCACGCGGCGACGAGGCTGCCTGCGTTGCCCTGCGCACCGACGGCGTCTCGGTCGACATCGACGGCGACTACACCGAAACGCTGGCCCGCATCGCGGCAAACCCGCAAGCGCTGGGCGTCTTCGGCCTGTCGTTCTACGAAAACAACACCGACACCCTGCGCGTTGCAACCATGTCGGACATCGAGCCGACTGTCGAAGCGATCGCAACCGGCACCTACCCCGTGTCGCGCCCGCTGTACTTCTACATCAAGAAAGCCCACATCGGCGTCATCCCCGGCCTGAAAGAATACGCTGAATTCTTCATGTCGGACGACATGGCTGGCCCGGCTGGCCCGCTGGCCCAATACGGTCTGGTGTCCGATCCGGAACTGGCCGAGACCCAGGCCCTGATCGCCAACGAAACCGTGATGGCTTCGAACTGATGATCAAGCGGGGGCGGCCAACAGGCCGCCCCCGGCAACTTTCGCGGCAGGACTCCCATGCCTTTACTTTGGACGCTTATCGTCATTCTCGCCATCGCGGCGATTGGTTACTGGCTAGGCCGTAGCCGCGCTATGCAATCCGCTGGACACAGCACCCGCGCGCTGCACTCGCTTCCCGGATATTACGGCTGGAACGTTGCCATCTGGGCGATGGCTCCGGCTTTTTTGCTGATTTTGGTCTGGCTGGTCATCCAGCCCGCCTATGTGAACCATATCGCCCTGCAAGCGTTGGGTGATGCCGCGCAAAACAGCGGCTCTGCCAGCCTGCTGCTGGCCGACGTCCACCGTATGGCCGACCAACTGGCCGCAGGCAGTGCCATCAGTGCCGAGGGTCCGACCGCCGCAGCCGCGCAACTATCGTTTGCCGCGAATACGGCGGGCCGCATGTGGATGAGCATCGCGGCCATTGCACTGGCCCTTGCCGGCACCGCTTATGCCTGGTCGCGCACCAATGCTCAGTTCCGCGCCCGTCCCCGTGTCGAGGCTGCCGTCCGCACCCTGCTGATCGCATCCGCATCCATCGCGATCCTGACCACCGTCGGCATCGTGGTCGCGCTGATTTTCAATACGATCGCCTTTTTCCAAGCCTATCCTGCGCTGGACTTTTTCTTTGGCCTGACCTGGAGCCCCTCTGCAGGCGGCGTCAATTCGCGCCTTGGAATTTTGCCGCTGCTGTGGGGCACGCTGTATATCTCGTTCATCGCGCTGCTGGTTGCGGTGCCGCTGGGCCTGTTTTCGGCGATCTACCTGTCGGAATATGCAAGCCCCCGCATCCGCGCCATCGGCAAGCCGATGCTCGAGGTGCTGGCCGGTATCCCCAGCATCGTTTACGGCCTGTTCGCCCTGATCGTCGTCGGCCCGCTGCTGATGAGCTGGTTCAGCCCGACCGGTATGCTCGGCCTTGGCTGGATGCGCGGTGGCACGGCTGTTATCACCGCAGGCGTCGTGATGGGGATCATGATCATCCCCTTTGTCTCGTCGCTGTCCGATGACATCATCAACGCGGTACCGCAAAGCCTGCGCGATGGCTCGCTGGGCTTGGGTGCCACGCGGTCGGAAACGATCCGTCAGGTTGTGCTGCCCGCCGCGCTGCCCGGTATCGCCGGTGCGATCCTGCTGGCCGCCAGCCGTGCCATCGGCGAGACGATGATCGTCGTGATGGGCGCAGGTGCCGCCGGTGTGCTGTCGCTGAACCCGTTCGATGCGATGACAACCGTCACCGCAAAGATCGTCAGCCAGCTGACCGGAGATGCGGATTTCTCGTCGCCAGAAGCGCTTGTCGCCTTCTCTCTTGGGATGACGCTGTTTGTCATTACGCTGGGGCTGAACGTCCTCGCGATGGCCATCGTGCGCAAATATCGCGAGCAATACGAATGACCGATATGAACACAACCCCGAAACAATCGCTGCTTGCCCCCGATGCGCGCACCAAAAAGCGCAACGCAGCCGAAGCGCGGTTCCGCGCCTATGGGATCGGCGCGCTGCTGATCGCCATGGGCTTTCTGATCGCGCTGATGTGGTCGATCTTTGGTAATGGCATCGGCGCCTTCACGCAGACATTCGTGAAAATCCAGGTGCCGCTGGAAGAATCGGTGCTGGACCCGAACGGCAACCGCGAACCGGCTGATCTGGCACGTGCCACAACGCTGCGCTACGGCTCGCTGCTGCAAGGCGCGATGGCCTCGACCCTAGAGCGCGAGGGGATCACCACCGATATGGAGCCTGCCGCGCTGGCCCAGATCCTGTCGTCCTCCGCCGCCGCCGAAGTACGTAACCGCGTGCTGGCCGAGCCGAGCCTGATCGGCCAAACCGTCGAACTGGAGGTTCTCGCCTCCTCGCGCGTCGATGGTTACATGAAAGGCCGTGTGTCACGCGAAAGCCTGGCCCGCGACCGTAATCTGTCGCCCGCCGCGCTGAACGTCGTGGACCAGATGCGCGATGCCGGCGTGATTGAACGCCGCTTTAACTGGAACTTCATCTTCGGCTCGGATGCATCCGATCAGCGCCCCGAACAGGCCGGTATCGGTGTGTCGATGCTGGGTTCGCTTGCGATGATGCTGGTGGTTCTGGCGCTGTCGCTGCCAATCGGCGTCGCCGCCTCGATCTATCTCGAGGAATTCGCCCCCCAGAACCGCTTTACCGACCTGATCGAGGTGAATATCTCGAACCTTGCAGCGGTGCCGTCCATCGTGTTCGGTATCCTTGGCCTTGCGGTCTTCATCCAGATCATGCACCTGCCGCAATCCGCGCCGGTGGTGGGGGCCTTGTGCTCACGCTGATGACGCTGCCGACGATCATCATCTCGACCCGCGCGGCGCTCAAAGCGGTACCGCCCTCGATCCGCGACGCGGCTTTGGGGATCGGGGCGTCGCGGATGCAGGCGGTGTTCCACCATGTGCTGCCGCTGGCAATGCCGGGCATTTTGACCGGCACGATCCTGGGCCTTGCACAGGCCTTGGGGGAAACCGCGCCCTTGCTGCTGATCGGGATGGTGGGGTTCATCGCCTCGAACTACCCCGGGAGTGTCGACGCTGCATTCAACGCACCGAACTCGGCCATGCCTGCACAAATCTATGAATGGGCCAAACGCGCTGACCCGGCCTTCTATGAACGCGCTTGGGGCGGGATCATCCTGCTGCTGGTGTTCCTCTTGCTGATGAACTTCCTCGCCGTTCTGCTGCGCCGCCGGTTCGAGCGTAAATGGTAAGCCTGCTTAAAGGTCATAAAAAATGAACAACCCCAAGATCATGGCCCGCAATGTGCAGGTCTATTACGGCGACACCCACGCCATCAAAGACGTGAATGTCGACATCGACGACCGCACGGTCACCGCCTTTATCGGCCCGTCGGGCTGCGGGAAATCGACCTTCCTGCGCACGCTGAACCGGATGAACGACACGATCGCAAGCGCGCGTGTCGAGGGCGAGATCCTGCTGGACGCCGAGAACATCTACGACCCAAAGGTCGATCCGGTGCAATTGCGCGCCAAAGTGGGCATGGTGTTCCAAAAGCCGAACCCCTTCCCCAAATCGATCTATGACAACGTGGCCTACGGCCCGCGCATTCACGGTCTGGCCAAGAACAAGGCCGATCTGGACGACATCGTCGAGCGCGCCCTGCGGCGCGGCGCGATCTGGAACGAAGTGAAAGACCGCCTGCATTCGCCCGGTACCGGCCTGTCCGGCGGCCAGCAACAGCGTTTGTGTATCGCCCGCGCTGTCGCAACCGAACCCGAAGTGCTGCTGATGGACGAGCCCTGCTCGGCGCTTGACCCGATTGCAACCGCTCAGGTCGAGGAGCTGATCGACGAGCTGCGCGCCACCTATTCGGTGGTGATCGTGACCCACTCGATGCAGCAGGCCGCGCGCGTCAGCCAAAAGACCGCCTTCTTCCACCTCGGCAATCTGGTGGAATTCGACGACACCACCAAAATCTTCACCAACCCCGAAGACCCCCGCACCGAGAGTTACATCTCGGGCCGTATCGGCTAAGGAGAGCGTGATATGAGCGAACAGCACATTGTCTCGGCCTATGACCGCGACCTCGAGACGATCCAAGCCCTGATCTTTAAGATGAGCGGTCTGGTCGAGGACGCCATCGGCCGCAGTATCGAAGCCCTTTCGACCCGCGATGTCGAACTGGCCGAACAGATCCGCGCCGCCGACAAGCAAATCGACGCGCTGGAAGAAAAGATCAATGACGAGGCCGCCCGCACCATCGCGCTGCGCGCACCGGTGTCGAAAGATCTGCGCATCATCCTGTCGGTGCTGCGGATTTCGTCCAGCCTCGAGCGGATCGGCGATTACGCCAAGAATATCGCCAAACGCGTGACGGTGCTGGCAGAACAGCGCGCCATCACCGAATCGGACGCCACGCTGCGCCGCATGGCCCGCGAGGTCGAGCGGATGCTGAAAGACACGCTGGACGCCTTTGTGCAGCGCGACGCTACGCTGGCGCAGGAAATCATCGGGCGCGATACTGAAATCGACCAGATGTATAATGCGCTGTTCCGCGAATTCTTCACGCATATGCTGGAAGATCCGCGCAACATCACCGCCTGTATGCATCTGCACTTTGTGGCCAAGAACCTGGAACGGATGGGCGACATCGTCACCAATATCGCCGAGCAGGTCATCTATGTGACCACCGGCAACCGTCCCGAAGAGCCGCGCACCAAAGAAGATGAAACCCCCTTCATCGGTAAGGTAGACTAAGCAATGGCAAGCCAACTGCCCCATATTCTTGTCATCGAAGATGAACCCGCCCAGCGCGAAGTTCTGGCCTATAACTTCGAAGCCGAGGGTTACCGGGTATCAACAGCGCCGAACGGTGATTCGGCGCTGTTGCAACTGGCCGAAGAGCCGCCAGATCTGATCGTCTTGGATTGGATGCTGCCCGGCGTTTCGGGGATCGAGATCTGCCGCCAGATCAAAGCGCGGGCTGAAACCCGTGCGATCCCGGTGATCATGTTATCGGCCCGTTCCGAAGAGGGCGACAAGGTGCGCGGTTTGGAAACCGGCGCCGATGATTATGTGACCAAGCCCTATTCCATCACGGAACTGCTGGCGCGCGCCCGCGCCCAGCTGCGCCGCACCCGCCCCGCCACCATCGGCGGCGTGCTGCGGTTCGAGGATATCACCCTTGACGGCGAGACCCACCGCGTCACCCGCGACGGCAACGAGCTGCGCCTGGGCCCGACCGAGTTTCGCCTGCTGACCACGCTGATGGAGCGTCCCGGCCGCGTCTGGTCGCGCGAGCAGCTGCTGGATCGCGTCTGGGGCCGCGATATCTACGTGGATAGCCGCACCGTCGATGTCCATGTCGGACGTCTACGCAAGGCCTTGATGATCCACGGCGGCACCGACCCTCTGCGCACGGTGCGCGGGGCCGGATATGCGCTGGGTTGACCTTTGACAGTTGACGTTTGGTCGTCGCAAGGGGAGCGATTTTTCCCCTTGCGCGCCAAGGTCGAAACCCATAATTAGCCCATACCAACACGGAAGCGGGCGTAGCTCAGGGGTAGAGCATAACCTTGCCAAGGTTAGGGTCGGGCGTTCGAATCGCCTCGCCCGCTCCAAGTTGGTTACCCGGCCGGATAGATCAGCCTGTGCTGACATCTCCGTCCACGGAAGCGGGCGTAGCTCAGGGGTAGAGCATAACCTTGCCAAGGTTAGGGTCGGGCGTTCGAATCGCCTCGCCCGCTCCATATTAAACACAAAGGCCCGGCCGTCAGGCACGGGCCTTTTGTTTTGCCTAGAGCCTGCGACATATGCGTCAAGTTTAGGCGACAGACGCACCAAGCCTTGCCCCTGCCGTCTTGTGTGACAAGTTAGGGGCAGACGCCTGATCCCGAAAGCCTGAAGATGACTGACGATCCCGCAGGACAGCCGCCGCGCGCGCCTGCCGAAAACCCGATTTTGGTTGTGGGGCTGACCACCGCTATTCAGGCAATGACATCCTATGGTTTGCTGTCACTGCCGGTGGCCAGCGTGTTCTATGCCGCCGATTTTGGCCTGCCCGCATGGATCGTCGGGGTGCAGATCTCGGGCATTTATTGCGTTGCGCTTTTCAGCTCGCTGATCGCGTCGAATATGGTGCGGCGACTGGGCGGCGGGCGTACATCACAGATCGCGCTGCTGGCGATGGCACTGGGCGTGGCCTGTATCGCATCGGGTCTGGGCGCGCTGCTGCTGCCCGGACTGGTGCTGATGGGACTGTCCTATGGGCTGCCGAACCCCGCCGCCAGCCATCTGCTGCGCCGCTTTACGCCCCCTGCAAGGCGCAACCTGCTGTTTTCCATTAAACAAGCGGGCGTGCCGATCGGCGGCGCGATGGGCGGCATCGTCACCGCATGGATTGCGCATCACGTCAATTGGCAGGCGGCGCTGTGCCTGCCTGCGCTCCTCAGTCTGACGCTGGGCGTGGTTTTGCAACTGGTGCACAAAGGCTGGGACGATGACCGCCAACGCGACCAGCGCGTGCTGCAAGCGCCGCTACGCGATCTGATCCGAGTCTTTGGCTTTCCGCGTTTCAAGGCGATCTTTGCCTCGGGCATGTTGCTGGCGGCGGCACAGCTTTGCGTGTCGACCTTTATCGTCCTGCTGCTGGTGGTGGATCTGCAGATCGACCCGATCACCGCCGGCGCAGGCCTGTCACTGCTGCAAATCGCCGGCATCTTGGGGCGGATCAGCACCGGCGCGCTGGCGGACTTTTTCCGCAGCGGTCTGCGCGTGCTGATCTGGCTGGCCTTGGCACTGGCCGCGACCACGCTTGTTCTGGTGCTGACGCCTGCCCCTTCGGCGCTGCTGCTTACCGCGCTGCTGATCGTGATCGGCCTGCTGTCCAGCGGCTGGTCCGGCGTCCTGATTGCCGAGGCCGACCGCTGCGCCCCGCCCGCCTATGCCAGCGCTGCAACCGCTGCGCTGATGTGCGGAACCTTCTTTGGCGTCATGATATCGACCACCGCCTTTGCCGGGATCGTGCAATTATTCGGCACTTATCGCACGCCATTCGCCATGATCGCGATGGGCTGTATTGTGGCGGCGGGCCTGCTGCGCATCGCCTATCGTGCTGATATAAATGATAAAGAGGTCTAACCCTGACATATATGTCAGTCAGATCGGCGGCTATTTGCCGATCAGGTGATAGGTTTAACGGCCGCGCAGACCCGCGTGCCTTGATGCGCCCCCTTTGCACCCCCACTTGCAATGTGAATGCGCCAATCGAGGAGACGCAAGTGAAGTGCCCGGTAGATAACGAGACCCTGGTGATGACATCCCGCAACGGGGTCGAGATCGACTATTGCCCGACCTGTCGCGGCGTTTGGCTGGATCGCGGCGAGTTGGACAAGATTGTCGAGCGTTCCGAGCAGGTCGTTGCGGCCGCGCCTGCGCCCGCAGCGGCCCCCCAGCCCGATCGTCACCGCGATGATGACCGTGGCGGCCGCTATCGCGATGACCGTGGGCGCGGGGATCGCTATGACGACGATGACGACGACCGCCGCGACGGACGGCGTGGACGGCGCCGTGAAAGCTTCCTTGGCGATCTTTTCGATTTCTAAACATGACCCCCGGGCCGCATACGCCGCCCGGGGTTTTTCTTTAAGACGTGCGCGCGTCCCAATAGGCGCGTGCCGCCGCCAGACAATCGCGAAACGCCTGAAGGTCACCTAAAAGATCAGCGTCGGCCGCGACAGCACCCACGACAAAGGCGCCGCCGCTTTCCTCGGCTGCTGTATCCAACATCAGGTTGTCGACCATGTCGTGGTTCTCGATCATCAGCCCATCGGCCCCGCGCGCCACGCCGTCGATCACCTCGGGCTGGGCGCCCGCGTAATAATCGGCACTCAGGCGCGTGAAATAGTCGCGATCCGTGTTCGTATAGGCAAACACAGGTTTACCCTGTGCACGCATATAGCCCACCTCGAACGCGGTACCGACATCAGCCGATATCCCGCGAAACGGGGTCAGATTTGCGATGATAAAATCCGCCTCGTCCATCTTGCGCTCGTTCGCAAGGCCGATGCTGATGCCGAACTGTCGCGCGGTTTTCCCGACAGGGTCCAGATCATCCTCAGCCAGGGTCGAGGGATCGAAACCGTATTCCAGCGCGATCTCGCCCTTTTTGCGCATGACCTCGGAGGCATTGCGAAAGAAGACATCCGGGCCCGCAATATAGACCTTTTTACGGATGTGGCTCACGTCAGCTTGCCTTCGTCCAGCGCCTTGATGATCCAGTTTTTCCAAGTCTGATCCTTGCCTTCCATCTCGGGGATCACAACGGACGCTTCCTCCATGAAAAACCGGTAATCGCGGCCCATTTCTTGGCCCCGACCGGTGTGCATATCCAGCGCGATATCCAGGATCTCGGGCATACGCTCGCCCAGCTCAACCGAGCGGCGCGCCCAGTTCACCAGATCATCCGATGTGCGTTCCTTGACCGAGCCTGCGATCATGCGGATGGCATGGGCGGCGAACAGGAAGCGGTCGCCAGCCGGGCGCGGATAGCGCATATGCTGCTGATACAGCGTCTCGATAATCGAGGGCAGGCCGGGATTGCCCAGCCCCACATCCTCGACCGCGATCACGCACAGGCGCGACCACAGCATTTCTTCCATTTCGGGGCTGGTCAGGAACATTTCCCAGCCCAGCAGGATTGCATTTTCCAGCAAGCCGCGACGCAGGCTTTTTTGGATACACGAGATAACCTCATCCGCGGCAAAGCCGTGCTGGGTCGTCGTGCGTTGCCACGGATCGGGCGGTTGCTGCACCTTGATTGCCATAATAGTCTCCTTCGGGGTTGGTGCGGCGCTTACGACCAGCGGGCTTTAACGGCCCGCTCGGCCCGATGCGCGAGGAAAAAACAAAGCACGGCCGTCAGCGTCACCACCAGCGAGACGCCAAAGGCGCGCTCCATGGCGAAACGGCTGGTTGCCTGTGCGAAAACATAGCCCAGCCCGCTGCGGCCCATCAGATACTCGGCCAAAATGGCAGCAAGAACCGATTCCGGCACGATCAGGCGCAGGGCGATCATCATATCCGGCACGGCAGAGGGCAGGACGAGGCGGCTGAAGCGCTTGAGACGCCCGGCCCCCAACACGCGCATCAGGTCGTCCGCCCCGCGCGGCAACTGTTGCAGGCCTTTGCCGACAAAAACAAAGGTCGGAAAAAAGGCGGCAATGGCGACGATGGCGATGACCGTTGTGCTGCCATAGCCAAGTATACGCGCAATGATCGGGATCAGCGCCACCACCGGCACGGATGAGAATATCAGACCAAACGGCGTCAGCACCCCGGCCAAAAACCGCGAGGCATAGGCGAGACAGGCCACCAGCACACCAAGGGCGGAACCGATCAACAACCCGGCAAAGGCCGTCCACAAGGTTTGCAACGTATTGACAGCATAGAGCCCCGGATTGCCCAGCATATCCATCAGCACTGGCAGCGGCGCGGGCAAGACGATGGAATTCAGCCCCGACAGCGAGACCCCCAGTTGCCAGATCGCCAGCAGCAGCAGGATCGGCCAGTTGCGCGACAAAAAGGTCATGCGAACCTCCGCGCGACAAGGCGTTCTGCTTGGGTCATCAGCAGATAGAGCGCAAGTGATGGCACCATGACCAGCAGCACCGCCGACCACAGCAGCGGGATCTGGAAATTCTGCATGGCGGAAATCATCAACAACCCCAGTCCGCGCGAGGCGCCGAACCATTCGCCCACGATCCCCCCATAAAGGCGACCGGTACAGCAAGCTTCAGGCCGGACACGATCGCGGGCAGCGCCGCAGGTATTTCCAGACGAAAAAAGCGGGTTTGCGGCTTTGCGCCCAGCACGCTGAACAGATCATGATGGGCGCGATGCGCGGCCTCGAGGCCCGAAGTCGTCGCGACATAGACAAGGTAAAAGACCGCCAGCGCGGCAATCGCGACCGGGATCTGGTCGCGCTGCAACAAAACCATGAACACAGGCGCAAGGGCGATGCCCGGCGTTGCATGGATCAAGGCGACAAAGCGGTCGATCCCCGGCCGTGTCACCCGCCAAACCCGCACCAGCGCAGCCAGCGCAAAACCGATCCCCACGCCAAAGACATAGCCCAGCCCGACCGCCACAAAGCTGGCTGTAGCCGCGCGTTGCAACAGCGCATGGTTCGCAGGATTACCCAAAAATCCCAACACGGTGCTCAGAGGTGGCCAGGTCAGTCCGGCAAGACGATACTGGCCGATCAGCTCCCACCCCGCTGCGAAAATGACGATACCTAAAATGCCAGGCAGCAGCTTTGTCAGCCGTGTCATTGCGCACCCGATGGCTCGAGCGCCTCGGTCAACTCATCGACAAGACGGTGGAATTCAGGATCGCGCATCACCGAGGGATCACGTGGGCGGCCAAAGGGCACGCGCAGATCGCGGATGACACGGCCCGGTCGGCCCGACATGACCAAGATACGGTCTGCAAGGAACAAAGCCTCGTCCACCGCATGGGTGACCAGCAGCGTCGTCAGCGTGCGCGTCGACCAGATACGCTGCAGTTCCACATTCATGTGGCGGCGTGTGACCGCATCCAATGCGCCAAAGGGCTCGTCCAGCAACAAAACATCCGGCTGCAATACAAGCGCCCGCGCAATGGATGCGCGCTGGCGCATTCCGCCTGACAACTGGCTGGGTCGCGCGTGTTCAAAACCCGTCAGCCCCACCAACGCGATCAGTTCGGCCACGCGGGCATGATCGACGCTTTGCCCGGCAACTTGGAATGGCAGCGCGATATTCTGCGCGATCGAAAGCCATGGCAGCAAGGCGTGATCTTGAAATGCAACGCCCAGCCTATGCGCCTTTGACAGCGCGGCGGGGCTGCGTCCTTCGATACTGACACTGCCGGTGGTTGCCGTGTCCAACCCTGCCACCAACCGCAAGATCGTGCTTTTGCCACAACCCGAGGGCCCCAGCAGCGCGACGAATTCACCCGGTGCAAGGCTTAGATCGATGCCCTCTAGCGCGGTCAGCTGGCGGCCACCCTCGAGTTTAAAGCTTTTTGAGACACCGCAAAGGCTGATGGCAGGATCTTTGGTCACGCGCGTATTCCGTCAAAGGGATTCCCCGGGCTGCACAGCCCGGGGAGATAGGGGATTAAAGTGAGGCAATCGCCTCTTCCAGCGGGCCAAGATCGATCAGATCGCTGGCGGCGGGCAGCCCTGTCAGGCCTGCGGCGGCGGCAACAGGCAGAATGTTCGTCTCGTACAGCGCGGGATCGAACCAGAATGGGCCGGGCGCGCCTGCAACGGCCGTCAGCGGCTGGCCCAGCTCGCTCTGGCGGATCTGTTGGTCCAGATCGAGGCCGAAATCGACACCATATTTCGTCGCACCCAGCTCGGCCCCATAGGCCGGGTTGGCGCCGTTCTCGATCCAGCCACGCAGCAGCGCGCGCAGATAGCCAACGACCAGCGGACGATTTTGCGCCACATAATCGCGCTTTGCGACAATGGGGCCTGCGGGCACATCATAGCCCAAATCGCGCAGCAGCGTGACGTGGAAATCCTGACCCGCGACCATGCCCATCTTTTCAAAGGTGATCGGCTGGTTGGTGGCAAAGGCCATATAGACATCGCCATCGCCCGCCAGCAGCGGCTCGGGCGAAAAGCCGGTCGGCACCATCTCATAGTCCAGGGGCAGACCGGCTTTGGTCAGAATGAAATCGATGGTGTTCTTATCCGCCGGCATTTGGCTGAGGATACGCTTGCCCACCAGATCCGCCGGGGTCAGTACGGGGTTGGCAGAAAGCGACATCAGCGCCGCAGGGTTGGCAGGGAAAGCGGCGCCCAGAACAACGAAATCATTGTCACGGTTCAGCGTCTCGAACAGCGGGATCCAGTCGCCACCTGCAAAATCCGCCTGCCCCGCCGCCAGTCGCACCAAAACGCCCGGTGCGTTCGGGCCGCCCGGCGTATAGGCGATCTCGATCCCCTCTTCGGCGAAATACCCCTTTTCGATCGCGACCCACAGGCCGGCATATTCAGCGTTGGGAATCCAGCCCAGCGCGGTCGAGACGACCGGCATGGATTGCGCAATCGCCATACGCGGCAGGCTGAAACCTGCAGCAGCCGCACCGCCAAGGGCCAAAAGGCTACGGCGCGAAAGAGGAAGAGCAGTTGTCATCGTCTTGAGCATCCCTGTAAAGATTTTAGGCATCTGTTGCAGGCATGTAGTTTTTCTTACATCGTCACAATCAAGATGTAATATGATTTACACGTAATTTTTTTCCGGAGACCCGCGATGAACCACGGCAACCGCCTATTTTCTGGGCTTCCGGCTGAAAGTACAGTCGCGCGGGCCATTCAAAACGCCCTGCCCGCCATGAGCGAGGCACAGCGCCGCTTTGCCGCCTTGGTTCAGGCCGAGCCGCTGCGCGTCGCACGTCTCAGTATCAATGATGCCGTTTCCGGGGCGGATGTGTCGGTCGCCACCGCAAACCGGTTCGCCACAGCGCTGGGTTACGCAGGCTATCCCGAATTTCGCGCCGATCTGATCCGTGCGTTCGAGGATTTCTTTGTCCCTGTTGAACGGCTGAAACGCCGTCAGGCCGAAAAGCGCAGTGCTATGGATATCGCGCAAGCCGCCTTTGCCGAGGATCTGGAAAGCATTGGCGCGACCGCCTCCAGCTTGGACAGCGCCTCGCTCGAGGCCGCTGTGCAACAAATCATCGCGGCGCGCCGTGTCTTTGTCGCGGGCTTTGACCTTTCTGCGCATCTGGGCGGCATGTTGGCCATCGGCCTCGTGATGACCGGCTGTGATGCGCAGACCGTCCCCTCGGGCGGTGGCGCGGTAGGGGCGGTTCGCACGCTGACCCGGATGGGGCCGCAGGATCTGGTCATCACCATCGCCTTTCCGCATTACTATCGCGACACGATAGATATGGCGGGTTTCGCCAAAGGCGCGGGCATCCCGGTGCTGGCAATCACGGATAGCCCGCGCTCGCCGCTGGTGCCGCTGGCGCAGGTCGCGCTTTATGTGACCGCAAAGCAGGAGTTGAACGCGCCCTCCCCGTCCTCTGCCGCGATCCTGAGCCTTATCGAGGCATTGGTCGCCACCGTTGCCAGCCAGCGCCCCGAAGCCGCCGAAGCCAGCGAGCGCTTTGCAAGCTCGGCCTATCCATGGATGACCAATCGCTAATCTGATTAACCGCTTCCTTACCCTTGCGGTGCTAAAACCTACGCAGCTATCTGGGGGATCGGTAAGGGGCGATGAAGAAATGCTTGGTCGTCGGGGCAGGCCTTTCTGGCGCGGTGATTGCCCGTCAGCTTGCTGACGCAGGACAGTTCATCACCGTGGCCGACAGCCGCGCCCATATCGCCGGCAATTGTCATACCGCCCGCGATGCCGATACAGGCATCATGGTTCACACCTATGGCCCACATATTTTCCACACCGATGATCGCGAAGTCTGGAATTACGTGAACGCCTTTGCAACTTTCATGCCCTACCAGAATCGTGTCAAAACCACGACCCGGGGCGCTGTTTACGCGCTGCCGGTCAATCTGCACACGATCAATCAGCTGTTCAATACCGCCCTGCGACCCGACGAGGCGCGCGCCTTTATTGCTGCCAAAGCCGACATCACAATCACAGACCCGCAAAGCTTTGAAGAACAGGCGCTGCAAATGGTCGGCCGCGAGATCTACGAGGCGTTCTTCAAGGGCTACACAAACAAGCAATGGGGTTGCCCGCCCAGCGCCCTGCCTGCGGCGATCCTCAAACGCCTACCGCTGCGCTTTTCCTATGACGACAATTACTTTTGCCAACAATTCCAAGGCATTACAAAGGACGGCTATACAGCGATGGTGGCGCGCATCCTGGATCATCCGCGCATTGCAGTGCGGTTGAACACCCATGTCAGGCGCGACGAGATCAAAGGCTATGACCATATCTTCTATTCTGGGCCCATCGATGCGTGGTTTGGCTATACGCTGGGCCGACTGACCTATCGCACGCTGGAGTTTGAGCGTTTTTATCACGACGGCGACTATCAAGGCTGCGCCGTCATGAATTATGCCGATCTTGAGATTCCCTATACCCGCATAACCGAGCATAAACATTTCGCACCATGGGAGCAGCATGCCCGTTCGGTACTGTACCGCGAATTCTCACGCGCCTGCGGGCCGGGCGATATCCCCTTTTACCCCACGCGCCAATCACGCGACAAAGACCTGCTCCACAGCTATGCCGCGCTGGCGGCACAGGAAACGCATGTGACGTTTATCGGCCGCCTGGGCACCTATCGCTACCTCGATATGGATCAGACCATCCGCGAGGCATTGGACTGTAGCCGCGCATGGCTGAAAGGCGCGGGCCAACGGCCGACGGCCTTCCACCTTGATCCGGCTTAAATCGCGCCAAAATTGCATATGACATTTTCATACGCGGTGGCAGTTGCGTCAGCGTGCATCAGCGGACTTTGATGGTCAAAAGCCCCTGCAAAGCCCCCATCTTGGTCCAGCCGCCTGATCATAAGCGCCGCCAGATGCACCAGACATATCAATAAATTGACGCATCACGTCGGTATTACATAAAAGGTTAAGAAACGCCGATATGAAAGAATCACATGCCAACCTTTACCCTTCAGATCCGCACTCCCGCGGCTGGTTACGACGTTGTGGACAACGATCAAGGTAAGCTAAGCACCTATTCCGACGCCATAATTGTCAATACCGATAGCAACGTCTTCTCTTCGGCCGCTTATAGATCGGCGTTGGACGTTTATCGGGAACAGACGGCGGGGACCAATCCCAGCTATGACCAATACCTGACATATCTTATTGATCTACTGCCCCAATTCGGCGTGAATACAACAGCCGAAGATTTCACACAACAACATGGCCCCACCTTTCCTGCGGGAAGCTTTGTCTTTTTTGGTCATACTTGGAGCAGTTCACCAGGCGGAAAAGACTTTGATGTTATGGAGGTTTTGCTGCTCGATCCTGAAAAATTGACGTCCCAAATCAATTATGTCGCACCGGTATATGGATTTATTGGCGATATCCCAGACGCGGGCGAGCCATTTCAAGTCATGCATAATACGCATCAAGCGGGATATCTTCCCGACAAGCCTCCAATTTCATTTGATATTTCTCGCTACATGCTTGCGCCTTGCTTTACGGCGGGTACATTCATCGAAACAGATCGCGGGGATATCGCGATCGAGGCGTTACGCATAGGCGATCTTGTCAAAACGATCGATAATGGACTGCAACCCATTCGCTGGATCGGCTCAAGCAGGATCTGCAGCAATGCACTTTCGTCAAATACCAAACTGCGGCCGATCCAAATCAGCGCCGACGCCCTAGGTGCGGGTGTCCCGGCGCATGATCTTGTTGTTTCGCCGCAGCATCGCGTGCTGATCCGCTCTAAAATATCCGATCGGATGTTCGGAGCCGCTGAAGTTCTTGTTCCAGCGGTAAAATTGACCGCCCTGCCCGGCATTTTCACCGATAACTCCTGTGATCCGGTGGAGTATTTCCATATTTTATTCGATCAACATGAGATGGTGAGGTCAAACGGCGCGATCACCGAAACCCTGCACACCGGCCCAATCGCATTGCGCAGCCTGTCGAGCGCAGCGCGCGCCGAGATTTTCGCAATCTTCCCAGAGCTCGCCGCGCTTGGCGTGCCAAGGCCGCTTGCCCGTTCAACGCCTGCGGGCAAAGATGCGGCAGCACTGATTGCGCGGCATTTGAAAAACCAAAAGCCTGTCCAGATCGGGCTATAATTGCGGCCATTACACAGGCAGGCCCATCGGCAGACTTGCATGGCCGCCTGCGCTATTTGGGTTTGCGCAGCCATCAGCCGATTTCAACACGCGACAGGGGCCTAAACGCCAGCCGATCTGAACACAAATGTGAGGTCGCAAATCCCCGCAGCCTTGGATATGACAGCGGGGCCCGCCATGCGCCCACAGCCATTGGCGCATGTGCTTTCGATAGCTAAGAGAGGCGCAATATGAATGCGTTGCGTGTAGATTTCGTAACTGGAGCGGTCGCTTATAGGCTAAGGTCCGGTGGCGGCAAATCACAACCGATCGCGCGCGCGCTGGGCTTTCGCGCCGGGCAAGACATGAATGTCGTGGATGCGACCGCTGGCTTGGGGCGCGACTCATTCTTGTTCGCTTCGCTGGGCGCGAACGTGACGATGATCGAGCGTTCTGCCCAGATGTACGCCTTATTGCGCGCCGGCATGGATGAAGCGCGCGCCGCAGGGCCTGAATTTGCAGACATTATCAACCGAATGACATTGCTTCATGGCGATGCAATGCAGCTGCTGCCCGGCCTGTCGCCCGATGTCATATTCGTCGACCCGATGCATCCCCCGCGCCGCAGCTCGGCATTGGTTAAGCTAGAGCTGCGCCAAGTGCGCGAGATCGTCGGCTTTGACGAGGATGCCGCCGATCTCATGCGCGTCGCCCTTGCGCATGCGAAAAAGCGAGTCGTCCTTAAATGGCCGCGCAAAGGTGACGCCATGGCGGGCATCCCCGCCCCTTCGCATCAGATCTTGGGGAAATCGACACGGTATGACGTGTTTATCACAAAACGCGGTCCTGGCGTTTAGCACAAGGCAACCACCGCATCGCGGCGCAAGACTGCGGTTGAAGTGAGGGGACATTGCCCGCGAAAACATGCCGATTATGATCCGCATGGTTGCGGAAGGCTGTGATCTATAGGGGGATAAGGCTGGTGCTGTGAGAGAGGATTGAACTCTCGACCTCTCCCTTACCAAGAATGTGATCTGTGCCCAAGTATTTGATATTTGCACAATGAAGCGCTCTTTTTACAGCCTACACAGGCCGATGTGCAGCCTTTGTTCCCATTTGCAGGTTCAACGAACCAACGAGCACTCCCCAGATAGGCTATCCAGTCCCAAGAGGCCGAATTGGCATGACAGCAAACCCATCACCTGAGCCTGAAATCCGCGGATTGTGTTGCGCGCCGCTCGAACAAGTAACGGCGAGAATCACGACCCGAGGCACCAAGCAAAATCGGCGTACCGGAAACTTCCACTAGATCTTACACGCTCAGGCATCTTCATCCCGCTAGATCCACTTGGTCACCCGCCCCACTGGCCGCCGGATCTTGCCGATCCGCTGCATCAGTTCAGGATAGGGAAACATCCATCGCATCACCATCCACTCTCCTGTCCGGCACGCCGCACATGGCCAGGGCGGGACATGGGCCTCGTGGAACGGCTCCAATACCTTGACCAGATCAGAGGCCCAGTAATACCGCTGCTGCTTGCAGCCGGTGCACTGCATTGCGATCAGCAAACTGTCGCGCGCACAATCACTGACGCTGCGATTGCGGAATTTGTTGGTGGTCGGCACCGCTGGCATGTGACGCTCTCCTAACGGACAGGAACATTACCAGAACACTTTAGCGCTTCCCATACGGAGAAATCCATGTGCAATCTCTACAGCCACACTTCCACTCAAGAGGCGATGGCGCGGCTCTTCAAACCCAATGAGGTGGTGGATCGCCTAGGAAATTATGAGCCGCAGCCAGAGATTTATCCCGACCAGTTGGCGCCGATTGTGCGGGCCGAGGGCGATCAGATCATCTTGCAAACTGCCCGCTGCGGCCTGCCGACGCCAGAGACCTATCTGGAGGGGCACGCGGTCGATCGAGGCGTGACCAACATCCGCAACACCAGCTCACCCCACTGGCGGCGCTGGCTGGGCACTGCGCACCGCTGCCTTGTGCCTCTGACCTCTTTTGCCGAGCCTGCAGGCAAGGGAAAAGGCAACGTATGGTTTCACCTTGCCGACTACCGGCCCGCCATGTTCGCGGGCCTCTATGTGCCAGACTGACCGAGCGTGCGCAAAAAGGCGGACGGCGAGACGACAGACAACCTGTTCGCCGCCCTTACTTGCGATCCGAATGCGACGATCGTACGGATCCACCCCAAGGCCATGCCTGTCATTCTGACCCAACGCGCCGCGCTGCGCACTTGGCTGCGCGCGGGTTTGAATGAGGCGCGTGCACTACAAAACCCCATAAAAGATGAGTTTTGATCGTCGAGCGGGATTAGGCTTGAAGTTAGGTTTCAATGCCTTGCAAGAGGCCGCACACAAGAATCGCTTGCTAGTATACTTTTCTATTTACTTAAATTAACGCAGGGTTAAGGTAACTTATGCCGTAAAGGCATAACGTCCGTAGACTATAATTCGCTCAAAAACGCCTGCGAAAACGAGAGTTTTTCTCGTTTAAAGTTTGTAACTTAATCGGTGATATCAGGAGCTTAACATGCCTTACTTAACTGCAGTAAATGGTGCGGTGGTTAACCTTAGTCTGCCCACACCCATCTTAACTCTTCCCTCAACCAACATTCTTAATGCCTTCGGCACATCGACGCTGTTTAGTCAGTATAACGTCGATGGGGTCGGCGATGGCTCTACGCCTGAAACGGTCCAGGGCGGCGACTATCTCGCCCCGATCATTGGCGGTTCGCCAGTGCCTGGAACTTACGCGGGTTCTGGCACTTTCCAGACTGCCGGCCTAACAGTCGGCAATGCCTTTTTGGGGGCAACGGTTCGGCTGAACCCGGTGGACGTCGACTACTTTGTTGACGAGAATGACCAACTTTACATCATCAGCGACGCACCTCTAGACGCAGCAAACCTGACTGTGACTATCACGGTTAACGCGCTAGGAACGTCAACGCCGCTCACACTGCCACTTACAGATCTTCTGACCAACCCAATCGTTGCCCCGGTGCTGGGACTTCTGGGTGGCCCTAACGCGGTCAACAATATCCTGAATCAGGTCATCAACTCACAGACGTTTGACCCTAACGGCACGATGACCATTCCACCTGGCGAGATTAACGACATCGTTTGCTTTGTTGCAGGCACGATGATCCTTACTCCCGATGGTTACCGTATGGTCGAGACGTTGCAGGTCGGCGACCTTGTGATGACCAAAGATAATGGTGCCAAACCCGTTAAATGGGTCGGTGTTCGCAAGCTTTCGGCAGCTGAAATTATCGTAAATCAACACCTGCGTCCCATTCGCATCAAAGCTGGTGCGCTGGGTGTAAACATCCCATCGCAAGATTTGATGGTCTCGCCGCAGCACCGTGTACTAGTTCGCTCTAAAATCGCGCAGAAGATGATACAGAGCGACGAAGTGCTCGTTGCAGCCAAGCAATTATTGCAGCTTGGAGGTATTGATATCGCCACTGATCTTACGGAAGTTGAATATCATCACTTCCTTTTCGACCAGCACGAAATCGTGTTCTCTAACGGCGCTGAAACTGAATCGCTCTATACCGGCGCTCAGGCTTTAAAAGGTGTCGGAGCCGAGGCACGTCGCGAGATCTTCGCGCTGTTTCCAAATCTTCTTGATCAGGAGAAAGCGCCTATCGAAGCACGCCCCATGCTTACCGGCCGCAAGGGGCGCAGGCTCGCAATGCGGCATATGCAAGCAAATCGGCCGCTCGTGGTATAAAAAAGTAGGAGAGCCTTCGGGCTCTCTCATCTTACCAACACCCGGACGCCACCCCGAAATCCTGATGGCCAATCAGTGCCCGCAGGGCCTGCGGATCCTGATCGGCCAGATAGTCCACGGTCGCGCCAGCCACTTGGATCTGCCGCCAGCCCGCGCAACCCGCCTCAATTCTCTCGCCGCACGAGGCCAGCGCGCTTAGCGCGATTAGCCAAGTCAACATCATCAGCGATCTGCGCATCTATCTCGTCCCGTTTTGCGTCTGCATTCGCAGCAGCCTCTGCCTGCTGCTGGCGCACCTCGGCGCGACCATTGCGCCGTCCCTGCCACCACGCGACGGCTATCGCCCCCGCGCCGATGATCATCCCAATCAGCCAGTCCATCAGGTGGCCCAGCCAAAGCGCTTGGCGAGCCGATAGGCCCACTCCGCGATCACGGCCAACACAGCGCCAAGCGCCATGACCAGCACCCCATGCAGCTCTGCATCCGTGGCGAGCTGCGCGCCAAAATCGGCAGAGATAAAGCCAGCGCTTACCAGAGCACCGGACAGATAGCGCATGATGATGCGCGCCCAGACAGCGGTCATTGTTTCTTTCCCGTAATGATGCCCCAGAAGGCTTTGAAAATCCGCAGCCAGATGCTGCCCTGATCCTCCGGCAGCGCTGCAAGATAGGCTGTTGCGATGGCCGCAATCTCGGCGGCCTTATCCTTGCCATTGACGATGGCCCGAGCGCCGACGAAATCGACCCGCGCGCCATCGATGTAATCGGCCAACTTCTTGCCGGTGAACCAACCGCCGATCATACCGTGGACAAGGATCTGCGCGGCAGCGTCCGGCTCCATCGCGCGGGACGGATCACCAATCAGATCGACGCCGATCTTGGCCGAGGCTTTTTGATAATTGGCTTTCCACGTCAGCTGCACGAACCCGCGCCCATGCCAGGGGTAATAGCGCAGGTTCTTTTCGCGCCACGCGTCGGAGAGCCAGTAAGCTTCTTCCACCGGCAACATGGTGCGGTTGGTTTCCCAATATGCGGTTCCCAGCACGTAGGCCGTCTGTGCGCGCAAAAGCCCTGCCCTGCGGCACGCCGCTAGGATCCGCCGGGTTTCCCCCAGCGAAAGATCGATCTGCATCGATACCTCCTGAAAATGAAAAAACCCGCACGAGGCGGGGACTAGCGAACCAATTTCAAGCAGCTGAGTGGACGTTGGATGCGCCTGCTCAGCGACTTTTCGTTTACAAAGTGCCGCGATTCAGTCTTCATAAATATCGAATATACAAAGGTGAATCTCGAAAATGGCCACATTCCAATTCAGCAGCGGTTTAGTAGTCACCAAAAATAGTTCCGGCATCTACGAACTCGACCCACCGGGATCATGGTTTGGGACTTCGGGTACAACCACTAACGGTTCATCGGGAGCGCTATTTTCGCCTGACGAAACGCTCGTTATCTCCGGTCTTTTCACCGTAAAATACCTCGGAGCAGACGACGACATAATCCTTACCGAGGTGGTAAATTTCGGCGCATTTGGGGAAACTTATCAGCCGGGCATGCTGATGGTGTTTTCACAGAATCCTTTGAATGCCAATCAGATTGCAGCACTTTCATTCCAAGCTGTCGCTTTCGATACATTGACCGGACAGGCCGTCGTCGCCCCTTGCTTTACGCGCGGCACGCTGATCATGGCGATGTCTGGCATGGTCCCTATCGAGGACCTGCGCGCCGGAGATCTAGTGGATACCATCGACAACGGGCTGCAGCCGATCCGCTGGATTGGATCGTCAACGGTTTCAGGCGCAGCCCTCAAGGCAACACCAAAGCTGCGCCCCATCCATATTTCTGCGGGGGCTTTAGGAGAAGGGCTTCCCTCCCAAGATCTGCGGGTTTCACCGCAGCATCGCATCTTGGTACGGTCAAAGATCGCAGTTCGGATGTTCGGCGCGGCTGAGATCCTGGTTCCCGCTGTGAAGCTCGTGGCCCTACCCGGCATTTACAGCGTCGATGAGTGTGACAGCGTTGAATACTTCCACATGCTATTCGATCGCCATGAGATCGTCCTATCGAATGGCGCAGAAAGCGAGAGCATGCATACGGGTCCCGTGGCGCTTCGCAGCCTCTCTTCGCAAGCGCGCGCCGAAATATTATCGATCTTCCCGGAGCTAGAGCAAATCGGCGCTGCACGCGAGCTAGCGCGCCCGGTACCAAGCGGCAAAGACCTTGCCGCCATGTTTGACCGCCACGTAAAAAATACCCAACCCATCCAGAGACCGCTGGTGTAACCCGATGGGCTGGGCCGAAGGCTTAGGTTAAAGCTATTTCCGCCAGGCTTTCCAAAGCTCGATCGCTTGCTTGGGATCATTTACCGCGATCAAGATCCAACGCATGATGCCCTCGGCCGTCAGCGTGAGCATCGCGGCCGCCACAGCCTTGGGGGTATTGGTCATGGCTGACACCCAGTCCGTGGCCAGCCACGCTGCACCGACAGCGACGATCAAGGTACTGATCACCTGCCATGCGCCAAGCTGCTGTGTGGTGCGCACCTTGACGATAAGCGCGACCGCGACGCCGCCCCAGAATTCGGGGCTGCGCAAAGGTGATTGGTCTGACATATTGGCTCCAGCGCATGTAAGTTAACTGTCCGTTAACTATGTTTATTGTTTGATGGTCGCGGTTGGATGAGGGGGATCATCAAGATGACAACCGCCAGAGAGCAGAAGCTTTATGACGCATTACTCGACTACATCGCGTGCTACGGCCTTTCGAAGGCCGCGCGCGAAGCACTTCGGGAATACGAGGCAACTTCTCCAGCAAAACTGCAGGCCCGCGAATTGCGGCCAGTACTCCTGGAGTAATTACATCAGCCGCGATCCGATCACTGCACGCGGGGCGGCGCGTTGCGATAGGGATGGTTCGCGGGCAAGCGCTCCTCGAGGCCGTGACGCCAGTGCATGTAAGCCTCGATCAGATGGCGCGTGGCAAGATCCGGCACCACCCCCAGGGCGATGGCCCCAAGGATCGGCCCGACCCACTGGCGGTTGTTGTTTGCATCAAAGCCCCGACCGATACACCAAGAAGCGCTGGCACTAACCCCGAACGACATCGACCCCATCGGCATCGGGAAAATGCCCGACGACACCTCAGTATCAGCCGCGTTCAGCCGGATCGCACTCGTCCCGGATTGCACGGCATCCGGCTGCTGGAACGACACACGCGCAAGCGCAGCCTGCCCGTTACCGAGGATCTGCGTATAGATCGCACTGCCCGTTGACGTGCCACTGATGATCTCAACGCCCGTGGCAAATTCCGCGAGGATGAGCCACCACATCGGTGCAAAAGCCGCAGGCGGTTGCAGATATTGGTAGGCCGGGGACTTGTCCCAGATGACGGCAGGCATACCCCGAAATGTGCCAGCACGGGGCTGATTGGCGGCTGTGGGCTGGGTCATGTCCCGCACGCCAAACTTGTCGCGCCACGCGGTGACGTAATCCGCGCCGTTGCGCTCGACCGTGGTCCGATCCTCGCCCAGCCATAGTCCACCATTGGCCTTTTGCGGCGCGGTCAGCGCGCGCTCTGGCACCCACATCTGGATCGTATTGCTTTGCATGATTGTCACTTCCGCATCCTGAGAAATCGCCGCGCCCTCCAGCGCGGGGCTGATGATAAGGTCCCGGCCAATCGAGCCGGGGATCGCCACGCCGTCCGCAAACCACTGGCCGGGCAGGTTCGAGCGCAGCCGCGATCCGGCATAGCCCGCGCCCTGCACGATCCGAATGATCGGGCGATAATCGAGGCTGATGGCCGCGCCCTCATAAGCGGGCGTCATCTGCCAGCGCAGCCCCACCGCGCCGGGGATGTCGACGCCATCGGCCTGCCACTGGCCAAAGACGCTGGATTGATAGCTGCTGCCTGCATAGCCCGGATCGCCATCGCTGACCCAGATATGCGGATCCACCCGCAGGAATATCGCCGCACCCTCGAGGTCGGGCGTGATCATCAGCGCACCGCCCCAAGCGCCGGGGATGGCGACGCCGTCCGCATACCACTGGCCCTCGACGCTCGAGACGAGTTCTGCGCCCGCGTATCCGCTGCTGGACGCTAGCGAGATCACCGGCACCGCCCCCGCCCGCGCGCCTGCCAGCGCGGCAAGGGCGGTCAGGCCAAAGCCAAGGGCCAGCATTACCAGATTCCGACGAGGCCGGTGGCCGTGGTGCCCGAGGCCCAGACGCGCGAGACGCGCACCGGCAGCGGCACCCCCACGAGGATCGGCAGCACCACGGGATCGCCACCGCCGCGCATGGTCACCCGCACGTTACCCTCCCCCAGCGCATAGAGCGCGCGCGGGACATGCGGCAGATCATCGGTGTTCGAGGGCGTGATGCTGGCGGCATTGGATGCCGGGCTGTCCATGCCTACTGCATGGTGTTGAAAGGCGTCTGCCATGGTGTTCCTCACATGAGCATGGGGCAAAAGAAAAACCCGCCACAATGGGCGGGTTGGGGTTGGTTGACGTATACTTGTCGAACGCGGTGATCGATGTTCAATCCTTCATAAGGCACAATCGTCGAGAGTTAGACCCTTCAAACCCAGCTCATGTACCCAAGTTGGACGCCTACCACGCCCTGACCAAGTTTGAGCGGGATTGCTTGGATTGCGATAACGCGCAACACCACGAGGGGTTACCTCTTTCATTGACCTGATATCATCCGCAAGTTGGTTCAATGACAGTCCAAATCTCGAGACCGCATGCTCAGCCGCCCTAAGGGCGCACTTTCTATTCCTGTCATCCACGCTGACTAGCGCCTGATCAATTTCCACTAAGAGTTGAACCAGCTCGCGGCGATCGAGGCTATCTAAATCCATTTTTGTCCCCCTAACTGAACAGAAAGAAACCACCTATGGTCATATACAGCTCGTTCAGAGCACATCTGATGAATGTGTTTTCCATTCACTAATAAATGCTTAACGCCTACCGAAAGTTCCTTAGTTAGCTTGTCACTTAAGCAGCCCGTCGCTCAGCCCAACTCATCCTCCATCACCTTGGGCAAACTCACCCCATCCGCCTCAGCGCACCCTTCGGGCACATCCATGCCCGGATAGGCAGGAATCGCGAAGGTGCCCGGCGTGGCGGTCTCGCGCGGTTCTGCCCAGCGGGCGGTGACGCGGGCCTCGGGGGCGACCTCCTGCCCGTCGCGGTATTCCGGCAGGCCAGTCACCAGCGCGGTGGCGGCGGCTTCGGCAGCGATGATGGCGGCTTCGGCGCCCGTCCAATATTTCTGGGTCATGTCAGTCAATCCTTGGGCCAAGTGATTTGTAAGGGTGCGCGGCGGGCAGGCTATCCGCCTGCCCCCAGCTGTGGGCGAGGTATCCAGAAAGCCGGTCGTGGTCGTCAGCGCTCGGCTCATTGCTCAGGGCAATGACCTCCGCAATCAGACCATCCCAGCCAAAGCTGGCAGACTGGCCAGCACCGATGCCCCATGCGCGCAGCGCCTCACTGTTGCGGTGCGGGGCAGTGCCCAGAACGGCCCCCGCCATCGGCAGGACAGCAGCCGTCGGTGCGTGGCCATTCATGCGGACCAGCGGGGTTTGCTGCAATCCGTTTGTGTCGCGATTGCCGCGCACACGACCGGCACCGCTGCCGATCATGCCCCAAAGCGTCGCATAGCTGGCGAAGATGGTTTCGACGCCGGTTTTGTATTGCGCGACGGCGTAGACATACCGCCCGTAATGCGACGACGCCGCCAGCAGGTAGGTCTGTTCCTCCTGCGCGAAGGTCATCACCCTGCGCCCCATCTGCACGACCGTGCGCGGCTGAAGGCTTGCACTGGTCTGCAACATATCCCGCAAGCCGAAGCTGTCAGCGATGGCCGCCACACGATCATCAGCGGCAAGCTGGACACCGCGCCGCATCGACCACCAGCCCTCCTTAAGCGCGGCCGCAAGCACAGTCGGCGTCCACATCTGGATGCGGTTACTGCGCGGCGCGGTGAATGTGATGTAGCTGATCGCCGCGCCCTCGAGTGCGATGGTCATTGTCCATGTCTGACCGCGCGCGCCGGGGATGGGCAGGCCGTCCGCATACCACTGCCCGCCCCCGCGCGACGCGCTGTAGACAGAGCCAGCGAAGCCCGCGCCCGAGGTGATCGAGATCTCGACCGACTGGGGCTGAATCGCGATATCATACTGGATCACCGCGCCCTCATAGGCATCGGTCATGACCCATGTCTGGTCTGTCGCGCTATGGATTGGCACGCCATCGGCAAACCACTGACCACCGGGCTGCGTGGCCGCATAGACCGACCCAGCGTAACCGCTGCCAGATGCGAGCGAGATCTCGGGCGGCTTGGCCCACATCAGCACGGCCCCGCGCCGCAGCTCAGTCACATCGCGGCCCGCGATGCGCAGGGCGCGGATCTTTGCAAAGTCCAATTCGATCATTTGACCCTCGCATACCAATGTAGCGGATTGGGATCGGCCGGCGGCGGCCAGCTGCCCCCCTCGACCAGCGTCACCGTGACGCCGAAGGGCACCGGCACGCCAAGCAGATCAGCCAAAGCGAAGGGGCCGGGGCCGTCGATAAAAATGCTGCCAATAGTGGTGATCCGTTCGCGCGGCAGGACATAGGCTACGTCATAGACAACACCCTGCCCCAAGTCCTGCCGGAACAGCGAGGCACTGAACGCGCCGCCCGTCACCGTTACATCGATCGGCGCGCCGGTGATGATCGCAGCGCCGCTCTTGTCCCAGCCGCGCGGGGTAAAGCGGATCAGGCTCAAATTCTGCGGCGCACCCGCGATATCGCGCAGCTCGCCCGTAACCGTGACGGTCGTGATCGCCATGACTGGCCTTTCTATTGGATAGGTTTGGGCAAAGAAAAACCCGCCTCAAAGGGCGGGTTCGGGCTAACAATGCTAGCCACTGAAGGAAGCTTAATAGGGGCAAAGCCGCCACTAAACGGCAAGTCGACAGTCAGGCTTTTGCCTGTTCTCCTATCTCATTGTAAGCACGACCTTACACGACCACTGCCAATCTCTCGGCACACCGGAGACCAATATGAACGAGTTGCAGAAGGCCGCAGCCGGACTGCTATATGATGCAAATTACGACCCTGCCTTGCTCGCGAAACGTCGCGCCGCTAAGCGCATCCTATTCGAAATTAACAATCTGCATCCCGACGAGGACGAGAAACGGACACAGCTGCTGAAGGGTCTGCTCGGCAAAACCGGCCAGAACATCACGTTCGACGGCCAATTCCATTGCGATTACGGATTTAACATCGAGGTCGGCGAAAACTTCTATGCCAACGTGAACCTTGTGATCCTTGACGGAGCGAAGGTTACCATCGGCAACAACTGCTTCATCGCACCGAATGTTGGCATCTACACCGCTGGTCATCCGCTAGATGCCGAAAGGCGTAACAAGGGGTTAGAATACGCGCATCCCATCACTATTGGAGATGATGTCTGGATTGGCGCTGGCGTGACGGTGTTGCCAGGCGCGTCGATTGGTTCGGGCAGTGTAATCGCCGCCGGTTCGGTTGTCCGTGGAGAGGTGCCGCCAAATGTCATCTGCGGCGGAAATCCCGGGAATGTCATTCGCGAAATAAATGAACGCGACAGTCAGAAGTATCGCTGACGTCCCCTTTACCCCGGCAGATCCGGCCACCTCACCTGCGCCGGATCTGTAAGGTTGCCGGGCAGATCTCGCAGCGCCTGCCGATAGGCGAGCCACGCGGCGCGCGCCTCATCGCTTTGCGGGTAGTCTGGCTGGCTGCGATAGTCGCAGGCCGCGAGCAGCCGCGTGCGCTCCATCCGCAGCTGCGCCCATTGGTAGGTATCGAGATCAGGATTGATGACCCAATCCCATGCGACAGGGTCCCAGACATGGCGATCATCCGGGCGCGGCGGCAGCTCGACCGGCATCCCCCGCACCCAATGCGTCGCAGCCGACCAGTATCCCTGCAGAAGTCCAAAGCCCGCAGGGACACTGATTGTCGGATCGGCGACCACAAACGCCGCGATCTCTCCGGTATCCAGCCGATAAAGGGTAAACGGCACCTCTTGCTGCATGGCTATCTCCGGAATTGGTGCGCCGACAGATAGCGCTGGTAAACATTCGAGGCGTGCACCACCGTCCGCGCCATCAAAGTGTAGGTCGTCGGACCTTGGCCTGTATCCCAGTCGACAAAGTTAATGACCGATGACTGCCGCCATGCGCCACCATAACCGTAGGACCCACCGACCTGCTGACCATTGCGCAGGAGCCACACAACGATCCGCATGTCCGTCGAGCTGCCATCAAGCTGTGCATTCGCGGTGATCATCGTGGCGAGGCCGCGCCGATCGACAGTCAGCTCGAGCAGCGGGTAGTCCGCCGAGTTGGTGACAAAGGTCGGACTGCTTGGCTCCCAATACGCATAAGCCGGGACGGTCACCGCATTGCCCGCGATCTGCAATGTGTTGACCGCAAGATCCGCGATCGCGCCGTTTGCCGCGGTGACGGAATTGGCCGCCATCTTGTCCGCAGTGACGATTTGGGACGTTAGGTGTTGGGTTTGGATGCCGCCCTCGACGATGAGCAGGACGGCGTTGCGCTCCAGCAGTTGCGCTGACCCGATGTTGAGGACAGGCCGACCATAACTGGGGTCGATGTAGACCTCGAGCTTTGCCGTTTTCGCAGTGGCGGGAGAGGTGCCACTGACACTGAACCGGTTGATCCACGCAGCGCCATGCAAAGCGATGTGGCCGAGCTCTCCGACGTAGTTTCCATCCCTGTCCAGATAGGTAATCCGGAACAAAGTCCTGCTCCCACCGCTCACCCAGTAGCCCAATTCGAACGAATAGGACGTGGCGCTTTTGATCGGAAACACTTCGCCACTTAGAACCCAGGCCCAAGGATTCCCTGTGACCGTGCTAGGCGCGGTATTCAGCTGGATCCCTACCCCTTGCGGCCCGTAATTATTGCCATTTTGCGTATCGACAAAGCGCTGCTGGAATACGACGCCTTGCGCGGCGTTCCAGAACCACGAGCGCCCGCCCTGCCCGACCGGCGTGCCGAGATCCTGATAGAGATAATCCGGCACCAAGGAGCCGCCAGTGATCAGCACATGATGCGCGCGAAGTGCACCTGTGGCGATCTCTCGCGCCCCAACGGCGCCCGCCATGAACGCAGCACCCGTCAGGGTGTTGACGACGATATCGCCGCCATCGGTCTTATTAGACCATGCGCCAAGCTCGCTATCCCAACGATAGATCTTTTGATCTGTCGTCAGCACCAGCATGTCGCCGACCGAGCCTGTGGCCGGCAGCACATCGACCACCTCTGGCACCGTGAGGCCCGCCGCAAACTTTGTTTTATCAAGCGACGCCGCAGAGACCCCAGCAAAGACATTCTGCGTCCAAACCCCCGCCGCTGCATCCCAGCGCCAGATCTCGCCCGTGGTGCGGATCATCACCAGCTGATCGGCCCGCGCCCCCGCATCGGGCAGCGCATCCACAGGCTGAATGCCCTGCGCCGCCGCCATCTCCTCAATCCGGGCGACAAAGCCGTCCGGCAGATCGCCCTCCTGCATCTGGATCGCGGGCGTGGTGACCGTGATCCAGCCCGACCAGACATAGCCGCCGGTCAGCTCGGACATCAGCGCACCGCGCACCTCGTAGACAGACGCCGGTGCAACGCCGGTGATCCGCCACTGATAGGGCCGATCATAGGGGCGCATGACGTCAAAGGTCGGCTCGCTTGCTCCCAGACGCCTTGCCTGAATGCGCGCTTGCGCGATGCCTATGGCATCGCCCGCGCAGCCGACGCGGATCGCAGCGACCCGCCCCACGCCTGCATCATCGGCTACCTCGTCGCCGACCGCCGTCCAGCCGTCGATGGCCTGCACCCACGGGATCTCCGGCACCGGGGCCACGCTATCATAGGGCAGCTCGAAGCTCGGCGACCAATCGTAATCGGCGGGATCGACCTCGCGCAGGCGCACGGTGACATTCATGCCGGGCAGCTTGGTCACCTGCTCAACGACGAACTGCTTTGCGTCATAGCCGTTGCGGGCGCTGGTCCAGCTGATCGTGTCCACCAGCGGCTCAAGCGCATAGGCGCCGGGATGCAGGCTGAACTCCACGATAGTATCGCGGCGATAGTCCTCGAGCTGCGCCCGCATCAGCCGTTGCACCTGGTGCGGATAGGGCACCGCCGGATAGCTGAGCGAGGTCGGCAGATGCCGCCCGCCGTCAGCCGCCACGCCCGCTTCCGAGATATACTCGGGCGCATCGCGCGAGGCCCACTTGGCGGTGGGGTCCGGGAAGGTCGCCGTGATCGCATTGAACGTCTGGCTTGCCGGCGCAAAGGGCGTCAGGCTCTGCCCCTCGCTGATGACGATATCCGCATCGGTGATGGCAAACACCGGCGCGGCCGGCAGGCCGACCACGGGCTTTAGCATCCCCCCGACCTCGGCATAGCGCATGTTTGCGCCCTTGCCGATTTCCTCCATCACCCCCAGCGGCTCATCGCTGACCGTGATTTGCAGACCGGCGCGGTAGGCCGGTTCCGTGCCGCCCTCGGCCAGCGCGACCGGCATGTCGCAGGCGTTCATTGCCGCCATCCATTCGGCCAGCGGCAGACGCCACTGCGCCATGTTCTTGCCGCCGAACAGCCATTCATCGCCCCAGTAGATCCCGCGTGCGATATTGTAGCTGATCACCGCCGCATTGCGGCTGGGCTGATAGGTCGCGCGCTGGCCCCAGCGCTGCGCGCCCTGCCCGCCCGCCGTGCTGTCATAGCGCGGGTCATAGAGCGGCAGCGGCTCCGGCTCCCATAGATAGGTCGGATAGCTGGTCAGCGTCTCGCTATCAAAGCGCGTGGTGACGATGACATAGGTCTTGCCGGTGCCGATATGGTTGGCCGTCCACGGGTAATCTGCATCCTCGCCAAATGCCCACAGCAGGAACGGATCAGCGGCGGTTTGCGTGCCGTCGATCCATTTGACCCAGATCCGCGCGCCGAGGTCGCCCTCGCCCTCGTCATCCTTCATATTCGACAGCGGCCAGCCGACAACGGTATGGGTGGCAGGTACTGCGCCAATATCATGCGCGGGCAGCGAACGGATACTGGACGGGCCAAAGAGCGAAAGACCCGTGACGACTGTTCCGACTTTGCCCGCGACCAGGTCGGCGCGCTCATCATTGACCCATGCGCCCGCAAAGCCCTGTGGCAGGCTCGAGATCTCGATCACCTCGGTGATGAACCGCGTCTCGCGGCCCCATTTGGCGATAAACCGGCGCTTGCCCGCCGTCGCGAAGTCGCCGGCGGTGAAGGTCAGCGCGGTATCGTCGCCGAATTCGACATCGAACTTCACATCGTAATTTTGGCCCTGCTGCGCCGCCACGCCGGTCAGCGCCGCGCCAAGGATGCCGGTGCCGAGGCTGACAACCGCGCTAGCCAAAAACGCGCCTATGGTGGCACCCGCCCAGCCAAAGACATTGACGGCAAGCCAGCCCGTCACCGGGTCAGCCGCAGCCGGGGCGGCCACGCTCATCAGCGCGACCACCGTCAGGACCAGCGTCCAGATGCGTGTCATATTAACCTACCTTGAAAGCCCGCTTGATCTGGTCGCGCGGAACCCGCGCGTGACCTGTCTTGTCCAGCACGATCACGCTGGAGATATCGATGACCCCGAGCGCGCCGATCTGCCCCTCGCCGTCCAGCAGCGCCAGATCACCGATCTGGGCGAAGGCCGGATGCACTTCTGGCAGCAGCGCCGCCATCGCATCGCCCAGGCTCTCATAGCCCGCCTTGCGCAGCACACGCAGCGCGCCAGTGGGTGTTTTGTAATTGGCCCAGGCGGGGCGCAGATCTTCGCCCGTGATCGCCTCGACCGCGCCGGCCGCCAAGCCCAGCCCGCAATCATGGCGGCCCCACTCAAACGGATGATCCCGCTGGATATCGAGCGCATGGCTGAGACGCGCCCGCCAGTCCGGCAGGCGGGTCAGCGATCCTTTTTGTACCATTGGACAGACCTCGATCCGATGACCGCGGCGAATTCGCAAAAGCGGTCGATAAGGTTACGACGCTTCTGATGGGCGTCGGATGATTTGGCGGGGTTGCGCGCGGTCAGCTGCCACATGATCTCGGAGCGGATCTGATAGGTGATGCCGCCCTCGCCGTTTTCGCCCGGCGTATTGATCGGCCCCGCGTCGATGATGCCGACCCATTGCAGCTGGGGCGCGGCCGTAAACGCGCCGCCTGTCATGGTGGTGGCGTGGATCTCGCAATAGGCCAGCCGCAGATCATAGCCCCGCGCCAGCAGCTGCGCGGCATCCACCCCTTGGCTGGTCGAGACCGCCACCGGGTTATCGGTCAGATCGCCGACATAGGTCAGATCCTCGACCTTGAGGTTTACCCCGCCGAAGTAGTGCCGCGAGACGTTACTGCCATCGGGCTGCGTTAGCGTCAGCGTGATATCCTCATCGCCAGACCACAGCCCGATCGGCACTTCCGCGCCGCCAGCACGCGGCCGCGCCAGCACCCAGAAAAAGTAAACGGGGGCAATGCCCCCGTCCCGCGCCGCCGTAAGGCTGGCCGCAAATTGTGCGTCGTATGTGCGCATTATCATTTTCCTAGAAAGGCTTAGGCAAGGGTAGACATCTGACAAACCCTTGAGTCATTATCTGGCCACGCAACCTATAAGGGAGTGACACGCCATGATTGACGGTTTTTATAGTGTTGAGTTCGAGACGGTGTTGGGTTCCGGCGGAGGCGTCGTCGTCCTCGAGGACGGCAGCCTGCGCGGCGGCGATTCCAAGCGATATTTCCTTGGAAGCTATCGCATAGAAGATCAGAAGCTGTTGGCCGATGTGCACGTCGGAACACATATGGACAAACTCGACATTCCGCCGGTCTTCGGGGTTAATGAACTCGACCTGAAGATCACCGGAAAATTGACAGCTAGTGCAGCAATTGAAGGCACGGCGCGTAGTCCCCAACGCCCGGACAGCGTCATGGTGTTCAACATGAAGCGCATATCGGGCTGATCATCGCTTTTCCAACATGGTGACGCTCGCGCCGGTCGCCGCTGAACCGCGCCGATAGCTATAGGGCGTGTATCCGTCGTCCGGCACGAACATGCGGCAGACAGGGCGCAACAGTTCGATCTGCGCCCCGACTTGCAGCGACAACGGCGGATAGGGATAGACCGCCACCTGCACACTGCCACCCGCCACCGCGCCGCCGTCCGCGATCTCGCCCAGGTAATAGCGCCCCGTGCCCCAAGGCGCACTGAACCGATCGCCGGCCGCGAACTCAAACCCCGCAGGCAAGCCCGAGAATGTGATCCGTGTGCGATCAGCCGAGAAGCCCGCGACCCGCACGTCGGCATTCACCAGATGCGCCGGCGCGCCAATCGCTGGCCCGCTGTAGGTCGGATCGGCCCACAGAAAAGCCCGATTGGTGCCGAGCGCCCGGAACCGCGCATCGGTGCTCCGCGCGCGGCCGATGCAGTCGCGCGAGAGGTTATTGAGGTCAAAGCTGGCTGTCCACAGCGGCGGCGCGAGCTCAGCCGCCCACCGCCGCCCGTCGCCACCGCCGGACATCTCGTCATAGCGTTGCAGCTTCAGACTGACCTCTGCGCGCGTGGCGATCAGATCGCCGAGGAAAGCCAGCGGGTAGAGATCCGGCAGGGTCATCTGGTCTTCCTTCTTTCTACCATTGCATTGCGGACTTTGACATTGAACCGCTTGTCATAATCGCTGAGCACTTGTGTCATAGCGGCTCTGGCGCCGACGGCGGCGCGCTCTTCGATCGCGGCATCCCCTTGCGCTCCGCTGACATCGACCGTGATCGCCACCGACATGGACTGCGGGCCGGCATGGTCAGCAGTGGTCAAGCTGGACATCGCCGGAAGTCGCATGCCGACCAGCCCCCCTTCCGAGTAGCCAGGTATGCCCGCAGTCTGACGTAGCATTTCTGCGGCCGCCGGCCCGCCCACGCGACTTACATCATCCTGCGACCATACGACCTCACCTTTATGCACAATGCCCGCAGGCTGATACCGCCCGCCGGGTCCAGTGTATCCGCCGGTGTCATAGCTAAGGCCGCTGCCGATCGCATTAATCAGGCCGCCGCCCCATGAGGTCGAGCCAATCAGCCCCATCATCCCCTTGACCATCTGCACCTTAGCAATCTGCATCAGCAAATCCGCAACAGCCTCGCGCGCCGATTTCGACCCATCGATGATCGAGCCAAAGAAATCTTCCAATGTCGCGCGCCCGGTATCGGATGCCTCCTTGATCTTGCCGAGCTTTTCGGCGGCTTCCTCAGCCCCGAGGCCTGCCGTCACATAGGCCTGTGCAAGTGCATCGATCTCCGCCATCAGTTCCGGCGTCATCGCGCGACCTTGCTGTTGCGCCGCGTGCAACAGCTCAGCCCGCTTACGTGCGAACTCCAGCGCATCACCATAGTCTCGCGTGCCATCAGCCACAGAAAGAATTGCAGCAGCCTCGATTTGGAAAGCCTGGGTCTCCTCCCGGATCGATTTGACTGCGTCCGTATAAGGCGTGTCATCGCGCCTGCGTCCCGATCGGCCTTGCTCGCGCCTTGCGTCCTGCGCCGCGAGATTACCAGCAGCGATATGACGGATCTGGGCCTCGGTCAGCTGCACCTCATCACTGAGCGATTGCCGACGAACGCTTGCGATCTCATTTTCCAGCGCAAGTGCTTCACGTGTCAGACCGTTGCGACGATTGGCCTCTGTGACGTAATCCGCAGCCTCGCGCTCGATCCGCTGCCCTTCGCTACGTGACGCCCCATACTCCGTCGCAGCGGCGGCGCGTGCCGCGAGCGACGGATCAGATGCCAGTGCGGCGGCCGCTGACGCGCGATCAGCTGTCCTAATCAGATCAGCAAGCGCTCCCGTCATCTGTGCAATACGACCCAGAAATGGTGCAACGTTTGGATCGGCTTCACCGATGTCTTGCAAAGCCTGCTGCGCCTCGTACGCGGACATAGTTCCTTCTTGCAATGCGACCCTGACAGCCATCAATTGCGCGATGGTTTCGCTCGCGATGTCATCATATTGAAGACCCTTCAAATAGCGATCCAACTCGCGCACGGCCTCCAACTGATCAGAAAGAATGTCGCTCATACCAGACTGCGCAATTGCATCGCGCATCGCACTGGTGGCACGCACATGATTTGCAAGAAGGCTGATGATCTCCCGCCCATCCTCTGTGAAGCTCGACGTATCAATGCTCTCTAGAGAACGGATCACATCATCAGCGCTGATGCGAAGGGCGCCGAACTCTACTGCCAAGTCCCGAACTGCGCTGAGGGCATCCTTTTCTTCACCACCCCGACCCCAACCACCGAACAAGCCGCCCGCGCTCCCGGCGCGATCGACGATCGATTGCAAGTCAACCGTCGAGACTTTCTCCAATTCCTCACGCAAGTCGCGCAAGTTTGCCAGTCGTCCGTCCAATCCATCAAGAGCGGAGCCCACCGCATCAATCGCACTCGCGGCCGGCGGCGCAACAAGTCCTAGGCGCTCCAGCTCGTCACTTACACGCTGCGTCCGGCCTTCGGCTTTCAGTGCAGCATCCGAATACATCACCAGCCCGCCGGCAACGACACCACCGAGCAAAATGCCCAGAGGCCCTGCCGCAGCACTCATGCCGCCAAGCGCCGCAGCAAGACCGCCCACGCTCGAGGCTGCGCGCATGGCTGTAACGTATTTCATGATCTCAACGGCACCGGTGCCAAAACGCACCGCAAGAGCAGCGACAGAACGACCGAGTACCGCCCCAGCCAAAACCGCCGCGACTTTCAGACCTTTATCTGCAACCGTGTCGAAGTTATCCGCAATGATCGTCAACGCATCGGCGATGCGCGCCGAGATCCCCACCGCATCATCACCGCGACCGATATATTCCAGCAGCGCGTTGTTCAGCAGCATGAAGCCATCCTGGATCGTCGCAGGCATGGCCGCAGCCTCATCGCGCAGCTTTGCCATCTGCGAGGTGATCCCGAGCAGCTCCGCCCGTCCGATCTTGCCATCCCGGCCCATCTTGCGCAGCTCGAGTGTGGTGACGCCCATCGATGCTGCGAGCGCCTGTGCCACCCGCCCCCGGACGCAATGACGGTGTTGAGGTTGTCGCCCTGCAACGTCCCGAGCGCCATCGCGTTCGACAGTGCATTCATCACACGCGAGGCGACATCGCCTTTCGCACCGGAGACAACCAGAGCATTGTTTAGGCTCTCGACGAAGTCGAGCTGGGTATTTGTCGCGACTCCGAGATCGCCAAGCACAGACGCGAAGCTAATATAGCTGTCCGCCGTCATGCGCAGATCGGAGTAAGTCCGGCGCGCCATGTCGCTGACACGCTCCATGACATCGGTGCCGCGATCCATCGACCCAGCCGCGTTATTCACGCGACTGGTAATATCCGTCCAGGCGCTGGTCATCGCTATCAGTTGGCGGCCGCCAAGCGCCGCCGCCAAAGGCGCAAGGATCGAGCCTGCCCGCGCTTTGATATTGTTGAGGCTGCGCCCGATCTGTGTCTCCATATCCTTGAAGGATCGGGTTAAAGTCGCATTCTGGCGCTGGAACTTTGTTGTCGCGTCGCGTTCCATTTTCGCCAGACGGCCCTCGAGCCGCACAATGGATTGGGTGAACTGTTTCTCGGACAACCCAACTGGGAGCTCAAGCGGATCTTCTGAAGCCATCATTTCACCCCAAGTGCCTTCAGGCGATCGAGAGTCACCTCGCCGCCACCTTCCGTTTTGATGCCGTTAAATCGCTTCCAGGCGGCGAAGGCGCTGTGGAACTCCCACAGGCTCATGCGCTGGACCTGTTCGGGGGCGAACCCCATCACGGCCCCGCTGCCATAGAGGAGCGAGAACCTTAACCGTTCTCGTCCGTCATCTCCGCCGCCACTGGCTTTTCCGGCTGATCGTGATCCTTGCCGGAGAAAAAGCCTGTTACGATGCCGTAGCAGATCTTTACGAGCTCCGCGAAATCGGCCTCCTCCATCGCGCGCAAAGCGAGCGCGCGCGCAGCATCCCCTTCCATACCGCCGCCGATCAGTCCCAGGCGAATGCAATCAATGACCTCGCGCACCCGCACCGGCGAGAAGCCGAGCGAGCCACGCTCGATCCCCTGCCGGCAGCGGAAGCGGAAATCCGCGATGCCTTGCGGCGTTAGATCGTCAAGCGCTTCGGCCTCGCCAATACGAAGCCGGAACGCATGCTCGCCGCAGGTCCAATTGATGACCTGTGCCTCCGCCATTAGGACCCGACCTTCAGGCTGCGGGTTGGCGTTTTTGCGAAGCGTGCCTCAACCGAGGCGGAAACCACCGCGCCCTTGACGCGTGACTGGCCAAGGCTGGTGATGATGATCGGGCCAGTCTCGTATTCGATCTCGCCGGCGCCTGCGTTCAGATTGCCGATCCGCGCCAGCATTTCAGTGCTGTCGCCCGCATAGAACTTTTTCAGGAAGGTATCATACCCACCCTGCGTCCAGTTGCCGGTGCCCGAAAAGCTGACCGCAAGGCTCTGCACCCGCACGCTTACGTTGTTCGGTTTGCTCTCGTCGTCGCAATCATCGACCGTTTCGGTCTCACTGGTCGATGCCGTCCGTGTCACATCGGCGCCCATGATCACGCAATTACGTGCCCAGGTAGTGCCGTTGTCTTCAGAGAACTCGAACACGAGCTCGTGGTATTCTTCATGAAAAGGCGCTGCCATTTTGGCGATCTCCTCGTGTTGCAAACAAAAATGCAGCCGGTCAGGGCTGCGCAGGGTCGTCAGGGGTTTTGTGCGTGCTGCGGCGCGGAGGTGGCACTGTGGCACAACCCGCACTGACCGCCGCATCTATAAACTCTTGCGGGAAGTGCTGAGGATCTGGACCCGCCTTGGCTGACCATCCCGCATTGCGCAGACGGCTGGTGTAGTGGAAGTCACGGTGGAAGATCGCTTTAGCCATCATAGCGCTCCATCTGGAATTCGTAGCGAAGCACACCGTGCAGCGTCAGTCCGTTGGGATCGCGCAGGACCTGGCTAAATGGATCGCCCCGTGCCACGATTGGATGTCGATCTGTCTGGACTTGACCCATGATCCGCCGCAGCCGTTGCATGATCTCTTCGCAATGCGCGCGGCCCGTCCGGTTCGACCAGGCATCGAGCTGCAAAGATACATCCTCGATCGCTTGGCACCCGCCTTCTTGTGAGACGCTGTCCCAAGGCCCGAAGCTGATGTAGCCATTAGCAGCTCCCCAAGGGCGATCCGGCACCCGGTCATAGATGCCGGCAATGATTGCCATAAGTGCAGCATCCGCCCGCACCGCTGCCTCAATGGCGTCTTGCAGTTCCGTCGCTGGTGATGACATCAGCCACCTTCCTTCCATGCTTTGCGCACCGCCCGCCGTATTGCTGCCATGGCCTGCTTGCGGCGCAAACGCTTTGCGGGGTTGAAGAAAGGCTTTGCGGGCATTGATTGCGTGCCGACCTCTTGGAGCTTCGCGTTTTGGAAGCGCTCGCCTTGCTTATTGGTAACGATCGTCGCGTCGTTACCCGCCCTTACAACGACCCCAATGAAGCCTGATCTGCCCTGGCTGGTCATGACTGCATCAGCCTCCTCAACGCGGATCGACGCTGCCAGATCCCCATCCGATGCACCCGCGAATGCACGGGCAAGGCTCGCGATCTCTTCGCCCTCTTTGCGGGCTTGGGACCGCCCCGCCTCGATCGCGGCCTTGGCATGGGTGCGCAATTTACGGCGCACAGCGTCGAGTCCCTTAACCATATGCCACCCCCGCCTCCACCAGCAGATAGATCCAGGCGCGATCACTGATCGGATCGACCTCGCGGATATTGTAAACGCCTGTCTGCGGCACATCGCCCTCGTAGGTGGCGCGCCGCATGTCGATCATCCGCCATGATTGCAGGATCTCGCGCGCAGCGGCGCATTGGCGGATCTTGATCTTGAACACAGACCGCCCTTCCAGACGCGCCGCATAAACCGCCTCGCCGCCGCGCGCATAGATGAATTCGGCCCGGCATTCGTATTCATGGGTCCAAATGCCAGTGCGCATATCCGGCGACTCGAACTGAACCCGCTCGATCAGTTTGCCCGCAGTGCTCATCCGAATGCGAAGCTCCTGTAGGTTGCGACCATGTCGCGCACGCCAAGCGGCACCTGGGCGAGACCTGCACCAACGGCTTCGCGATTTAGATACCAATGCGCGACCAGCATGCGCATCGCCTGCACGAGGCGCGCCGGCACATTACCGGGCCCGAGGCCCGCCACGAACACCACCCTCGCCGACGCGACATTCTCCGCGCCGTATACGATCAGGCGGCCATCGCGCAGAGCGTAGCGGCTGGCGTCGAGCGTCACTGCGGCGCCCAGATCATCGACATAGCCTACGCTGGTGATCTCAGGATCGGGATAACCCGCGAAGCCGAGATCAAAGCGCGATCCCGTTGCGGTGAACTCCGCGCGCTCGAGGCGCGTCTGACAGACCGATTGCACCCAATCCACCGCCGCGTCGATGTAGAGGCTGATCAGCGCATCATCATCGCCGAAGTCATCGGCGCGGCAATGCTTCTTGGCATCCTCGAGGGTGATAATCACGCCCGTTGCCTCGCTCGTTTGCTGAATATCCATGCTCACCCTCCAGCCAGCAGAGCGGGGCCCAGCGGCCCCGCCCCAGTTGGTTACTCGCCGCCGGCGGCCGTCAGATCGCCATAGACGATGCCTTCGGGACGCAGCGTTTCCAGCTGCAGCCGTTCTTCCAGAAGGATGGTAACCAGGTTCTTGACGAAGTTGTCGCGATCCTCGGTCGAACGGCGCACTTCGATCCCCTTGCGCTGCCAAATCAGCGTGTTGCCGATAAAGCCGCCGACGATGAACTTGCCCTGCGGCAGGCCCTTGGTGCGGACGACCGGCAGACCCCATGCGGTATTGCCGGCAAAAGCCGGGTGCAGATAGCGACCATCCGCATCTTTCGCCAGATCCAGCGCGGCAGCGTCCAGATGGTTCATCACGATGGCGGAGGCCAGCAGATCCGCCTCGGCCACCTGCGCAATAGCGATGCGGATATCATCCATGGCGTTGACCGGCGTAATGCCGGGCACCAGCCCGCTGTCATAGGTGGTGGAGTTCGCCAGCAGGCCATCGATGCGGCCAGTGGTGCCGGGGCCATTCAGCAGCTCGCCCTCTTCCTTCAGCTGCAAGCCGTAGAGGCCGCGCTGATTGATATAGGCTTCCATGCCGTCAACATCGTCCAGTGTCTCTTCTGTGACCCGGAAGAAGTGCGCCATCTTCACCATCGGCGCTGCCTTCGGCTCGAAGTCCAGCTCGGACTGCGGCTTCTGCGCGCCTTCGGCAACCACGCCGGCATTGTTGGTGTAGCCGCTCTCTTGCAGATATTCGATCACGGCGGCCGTGGTCGCGGCCGTGGGGATGATATCGCGCAGGAACAGAGCCTGATTGACCGGCTGGATCAGCCCGCGCGAGCCGCGGCGTACGCCGCCCGGCAAGTCGATGGTGCCAAAGCTGCCGGTGGTCACATCCTTGACCTCGAAACGCGCCTTGCGGTCATCCTTCAGGGTCTTGAACTCGTCATGCTCGGCGATGATCCGGCCCAGCGACTTACCTTCCAGATCAGCGCCGCGATGCGACAGCTTTTTGGTCAGGTCGGCGACCTGATCGCCCAGTTCGGTCAGGCCCTTGCGGCTGTCCTCGACGTGACCCTTGATATCGGTGATATCCTCGCCGGAGGCTTTCTTCTGGTCCAGCACCTTCAGTTGGTCGCTGAGATCAACCTGCGCCTTCTTCACCTCGGCCAGCGTTTTGCTCGCGTTATCGAGGGCCTCTTTGATTTCCAAGTCCATGGTTTATCCTTTCGATGGACGGTTAGAACTTGAAGGCGTCCTTGATCGCCTTCGCGGTCTCCGAAGCGGACGCGTCGCGCTGTCCATCGCCCAGAGCTTCTGGCGCGCGGGCGGCAATCGCCTTGCGCAGCCAAACCGGGAGGCCCGCGTCACGCAGGGCCACCTCCACGGATTTCTTCAGGGGCGCGAAGTCACCATCCTTCGCCGCCCGCATGATCTGTACCGAGGATTTCACGGCATCGACGCCAGAGCCGTCTTCCATCGGAAAGGTCACGACCGAGATCTCCCACAGGTCGATCTCTTTGAGGTGCCGGATACCGTCGATGATATCGGCGTCTTTGGTGCGATAGCCGATCGACAATCCCTCGATGGCGCCCATCTTCATCAGCGCATGGGTCTCGCGGCCCTTGACCGTTTCCAGCGCCAGACGTCCTTCGACGCGCAGCCCCTTCTCGTCCTCACTGATCGACGTCCAAACCCCGATCGGCTGCGCTGCATCATGCTGCCACAGCAGCTTCACCGACTTTTGGCCCGCGATCGATTTGGCATAGGCCCCCGGGCGCACCATATCGCCGCCCTGATCGACCACATTGAAGCGCGAGGCATAGCCGACGAACACGCCTTCTTCGGTCGCGGCCTTTGCATCGAGCCGCGCCAGCTTAAATTCCATTGCCATGATCTGCCCCTATTCGGTGATTGGCGCGCCGCCGGCCGCGGGCAATGCGTCACCGCCCTCGACCGGGTTCTTGCCCAGCCATTCCAGAATGCTGTTCGGCGTTTCCCAGGCGGCATTGTTGCCCAGGGACTTGGCCGCGTAATCAGCGCGCGCGGCCAGATCCATGCGGTAGAACTGGGTCTCGTCAAAATCGACGTATTCATCCGCACCCAGCAGCGAGAACCGGATCGCTTCCTCCCAGCGCTTAACCCAAGGCCCCAGCGTGATCGTCACGTGGTAATCCATCGCATCCGAGATCCGCGTCAGCGACTGCCCCGCCGCGTCATGCGCCAAGAAGATCGGGTGGATGCCATAGGCACGCGCCACCTCCTCGATAATGAATCGCCGCGTCTCGAGCAGCTGCAGCTCGGCCTGCGTCGGGATGATGCTGTTGTATTTGGTGCCGCTATCAAACACCGGCGTGTTCGGCAGCTTGTCCTTCAGTGCTTCCTTCACCATCTTGGCAGAGTCCTTGGAAAGCTGCTGATCGGTGGTGATGTAGCCCGGCACGGCCTTCTTTCGGCCATCCTCCATCTGGCGATCCTCGAGGGTCAGCGCGAGGCGTAGCACCTTGCCGATCTCGGCCGTGATGTTCAGCCCCTCGATCTCGTTCCACCTCGGGCAGGTCACTTCAATGAAGTCGCGGCGCGTTAGGTTCGACAAGAAGCCAAGGCCGGGGATATTGGCATCGTACCAGACCTTGCCGGTGTCATAATCGCGGCGGATCGTGACCCCGCCGTCATTGATCGGGATCAGCGCGCGGATGCGCCCGCGATAGCCGCGATGGATATAGGCCCGCCCTACGCCCCGGAACACCGCCCACATTGTCAGGGTCTCGACAAACTCAGTCGGCGTCATCCAGTCGTTGGGCGCGAGCGTCAGGCGCTCGGTCAGCTCGCCCGCGCGGATCGGCGTGCGGATCGCGCGGCCCAGCCGGTCATGGCCCAGTTTGCCAACCGTGATCGGCAGGCTTGCCACCCCTTCCGCGATCCGCATGCCGGCCGCCAACGAGGCAGTGACCCGCAGCTGCTTCTGGTGCTCGGACACAGCGCGCTCGACGATGTTCTGTTGGTAAAACCGATTGCCCGTGCCGGGATCCGTGTTCTTTTGTCCGAAGGGCCACAGCTTCATGTGAACAATACCCCGCCTTCCATGTAGCTGCGCCGCCCCTTGCGGTCGGCCTTGGTGGCGCCCACGCCCATTGCAAGCGCGACCATGCCGTCGATACGGCCGCGCTGACGCTGTTTGTCGAATGCCTGATTGCCCTGTCCGTCCGAGATCAGCACCGTGTTCGCGGCGCAGACATGGGTCATCTTGTTCTCGGCGATCAGGATCTCGCGCTTGAGGATCTTGTCCGTCATGTGCGTGATCGAATGTGGCATGCACAGCTGCCGCCCCTCGAAAGCAATGCGTGTGCCCTGGGCGTGTGTCACGATCTTCAGGCCCGTGCCCTCAGGCTCATCGGGCCCCATGTACCGCCAGACCTCGAACGAGATCTGCTCACAGGCCGCGATAAAGTCGGAGACATAGGCGGGGTCAACCACCAGCGCCTCAACCTCTTGCGCGACGCACAGCTCCTTCACCTGCTGCGCGACGAAGGTGTAATCGATCACCTCGCCCCGCACCGCGATCAGGTCACCGCCCGCGATATAGGATTTATAGGGCGTGCGATCCTCGGCCTCGCGCCGCTCTAACCCGCCCTCAGTGGTCCAATACCAAAGCTTTGCCGTCAGCTGGTCATCATCATCGCGCCAGACCGCAGAAAGCGCAGTCAGGTCGTTCTTCTTGGACAAATCCAGCGCCAGCACACAGGGCGTGCCGCGCATATCGTCCTCCGCCACCGGCCCCAGCACCGCGCGCCAGGCAGCTTCGTCAGCCAGCCAAAAGCCAGACGATCCGACAGGGCGCCCGAAGTAGAGCCGCTCGGTCGCCAGCCGCTCGGACGGCATGTGCTTCGCAGTCTCGACCCGCCGGCGCACGTTGTCGATCGGATAGGTGACGCCAAGCGCAGGCAGCGCCTTCACCCAGCAGCTCTCATCGCTGAACGGGTCATCGTCGACGTCGACCCGGGCGATATAGCTAAAGGCGCTATCGTCCTCGATCATCCCCAGCGCGACGCGCTGATAAAACTCGCTATAATCCGTGCCCACCGCCTGATCGGCCGCTGGCGTATTGGTGCCAAGGATCATCAGTGGATCGCCTGGCATTTTGTCGATCGCCGCCTTCCAAAGCTGGATCGCCTTGTCAGTGCGCATCTCGTGCACCTCGTCGGCAAAGACCGCGATCGGCTTCGGCCCCGAGATGCTGTCGGCCGAGGCGACCGGCAAGAACTTAGCGCCGCTCTCTGGCACCTCGATCTTCCACGCGTTGTCGCCGACGCCGCGGATCTTCACCTTGGCGACGCTTTCGAGCGTCGCGCCATCCTTGCCCGGGATCGGCGCGCGGCACAGCGCAACCGCATCCGAGAAAAGGACCTTAGCCTGGTCCTTGTCGTTGGCGATGGCATAGGCCTCGGCGCGCGGCACGCCGTTGAAGCCGATCATGTAAAGCCCGATCGCACCCATCAGCGGTGATTTGGCCTGGCCCTTCCCTGTCTCGATCCAAGCCTGGCGAAAGCGCCGTGTGCCATCGGCATTGCGCCAACCGAACAAGCTGCCCACCACAAACTGCATCCATTCCAGCAGATGGAACGGTTCGCCCGCCTTCGCGCCGGCCGTGACCGTGAACATGGCCGGAAAGAACCGGAAGGCCCGCGCCGCCTCATCTATATCGAAATTCAGCCCGCGCGCCGCGCCCTCCCGCAGATCGTTCAAGTGCCGCTGGCACGCAGCCCGCACGAACTTGCCTGCGATGATATCGCCGGACAAAACCCTATTGGCGTAATCCGTTGTAAGATCCGAGGAACTCATCTGCCGGGGCGCTCCCCTTCTTCGGCTTCTCGGCCGGCGGCCGCGCCTGTGCCTCCCCGAACATCGCCCGCTCCAGCTTCAGCATCCGGTCATTCAACTTCTCGACGGCCGACCAGGTAAAACTGAACACCTCGCCCCCGTTCGGCCCGCGCGTCACCGGGCCCGCCTCTGCCGCCTCCGGATAGAGCGCCTCGAACTCCACCTTCGCGCGCACATAGCGGTCCGCACGCGCCAGATTCACCCGGCTGAGTGCGGAAACCTCTTCAAGTGCGGCAATCACCTCTTTCCAGAGCAGTTTTGCGAGTGAAACACGCGCCTCATCACCCCTAAAAACAGCGTCATAGCGCGGCATTTGCGGCTTTTTTGCGGCCATCGGTCACCTCAAGAATGCCCCCCGACCCCTACCCCCTTTTTGAACCGCTTCTCAGTGCAAACGAAGGGGGGACGCGGGTGTGGCGCCAGACGGGGGGTCTTTAAAATCTAGGGGGGTGGGCCGGCCCTCCCCGACTACGATGCGCCCAATGCCCCTGAGCGCGCCCAGGGATGCGCGGGATCGATCGGCCTGCCATCCTCGTCGTGCCCAGCGATGAAGCCCCTGTGCGTCGCCTCCTGAATGGAGCGGTCGTGACACTGCTTGCACACGGCCATCAAATTGTCCGGGTCGAGGAACAGATCAATGTCGCCCTTGTGGTCTCGCTTGTGGTGCGCCGTCGCCGCGTTCGGTGGCAGCGGCTCATCCCTCTTCGGCTGCGCCAGCATCACCCCGCAACCAGGCCACTGACAGGTCCACATATCCCGCACAAAGATCTGCCGCCGCAGCAT
Protein sequences of DBSCAN-SWA_4 >NZ_CP016592|2030689:2084819|2057858_2058056_+|WP_013382979.1|DBSCAN-SWA MRKKADGETTDNLFAALTCDPNATIVRIHPKAMPVILTQRAALRTWLRAGLNEARALQNPIKDEF >NZ_CP016592|2030689:2084819|2039433_2040456_+|WP_013383001.1|DBSCAN-SWA MSLKTVCVSAIAIAAVAGAAQARDNIQIAGSSTVLPYASIVAESFGENFPEFPVPVVESGGSSGGLQRFCAGIGENQTDIANSSRPIRAGEIETCAANGVTDIIEVRVGYDGIVFASALNGPEFAFTPADWYKALAAEVVVDGEIVPNPYTTWDQVNPALPAQQILAFIPGTRHGTREVFDEKVLVAGCEESGAAEVLSAARGDEAACVALRTDGVSVDIDGDYTETLARIAANPQALGVFGLSFYENNTDTLRVATMSDIEPTVEAIATGTYPVSRPLYFYIKKAHIGVIPGLKEYAEFFMSDDMAGPAGPLAQYGLVSDPELAETQALIANETVMASN >NZ_CP016592|2030689:2084819|2063666_2063930_-|WP_014537874.1|DBSCAN-SWA MADAFQHHAVGMDSPASNAASITPSNTDDLPHVPRALYALGEGNVRVTMRGGGDPVVLPILVGVPLPVRVSRVWASGTTATGLVGIW >NZ_CP016592|2030689:2084819|2067558_2071065_-|WP_013384759.1|DBSCAN-SWA MTRIWTLVLTVVALMSVAAPAAADPVTGWLAVNVFGWAGATIGAFLASAVVSLGTGILGAALTGVAAQQGQNYDVKFDVEFGDDTALTFTAGDFATAGKRRFIAKWGRETRFITEVIEISSLPQGFAGAWVNDERADLVAGKVGTVVTGLSLFGPSSIRSLPAHDIGAVPATHTVVGWPLSNMKDDEGEGDLGARIWVKWIDGTQTAADPFLLWAFGEDADYPWTANHIGTGKTYVIVTTRFDSETLTSYPTYLWEPEPLPLYDPRYDSTAGGQGAQRWGQRATYQPSRNAAVISYNIARGIYWGDEWLFGGKNMAQWRLPLAEWMAAMNACDMPVALAEGGTEPAYRAGLQITVSDEPLGVMEEIGKGANMRYAEVGGMLKPVVGLPAAPVFAITDADIVISEGQSLTPFAPASQTFNAITATFPDPTAKWASRDAPEYISEAGVAADGGRHLPTSLSYPAVPYPHQVQRLMRAQLEDYRRDTIVEFSLHPGAYALEPLVDTISWTSARNGYDAKQFVVEQVTKLPGMNVTVRLREVDPADYDWSPSFELPYDSVAPVPEIPWVQAIDGWTAVGDEVADDAGVGRVAAIRVGCAGDAIGIAQARIQARRLGASEPTFDVMRPYDRPYQWRITGVAPASVYEVRGALMSELTGGYVWSGWITVTTPAIQMQEGDLPDGFVARIEEMAAAQGIQPVDALPDAGARADQLVMIRTTGEIWRWDAAAGVWTQNVFAGVSAASLDKTKFAAGLTVPEVVDVLPATGSVGDMLVLTTDQKIYRWDSELGAWSNKTDGGDIVVNTLTGAAFMAGAVGAREIATGALRAHHVLITGGSLVPDYLYQDLGTPVGQGGRSWFWNAAQGVVFQQRFVDTQNGNNYGPQGVGIQLNTAPSTVTGNPWAWVLSGEVFPIKSATSYSFELGYWVSGGSRTLFRITYLDRDGNYVGELGHIALHGAAWINRFSVSGTSPATAKTAKLEVYIDPSYGRPVLNIGSAQLLERNAVLLIVEGGIQTQHLTSQIVTADKMAANSVTAANGAIADLAVNTLQIAGNAVTVPAYAYWEPSSPTFVTNSADYPLLELTVDRRGLATMITANAQLDGSSTDMRIVVWLLRNGQQVGGSYGYGGAWRQSSVINFVDWDTGQGPTTYTLMARTVVHASNVYQRYLSAHQFRR >NZ_CP016592|2030689:2084819|2067130_2067556_-|WP_013384758.1|DBSCAN-SWA MQQEVPFTLYRLDTGEIAAFVVADPTISVPAGFGLLQGYWSAATHWVRGMPVELPPRPDDRHVWDPVAWDWVINPDLDTYQWAQLRMERTRLLAACDYRSQPDYPQSDEARAAWLAYRQALRDLPGNLTDPAQVRWPDLPG >NZ_CP016592|2030689:2084819|2057081_2057378_-|WP_013382981.1|DBSCAN-SWA MPAVPTTNKFRNRSVSDCARDSLLIAMQCTGCKQQRYYWASDLVKVLEPFHEAHVPPWPCAACRTGEWMVMRWMFPYPELMQRIGKIRRPVGRVTKWI >NZ_CP016592|2030689:2084819|2059522_2059729_-|WP_014537871.1|DBSCAN-SWA MRRSLMMLTWLIALSALASCGERIEAGCAGWRQIQVAGATVDYLADQDPQALRALIGHQDFGVASGCW >NZ_CP016592|2030689:2084819|2066540_2067122_+|WP_013384757.1|DBSCAN-SWA MNELQKAAAGLLYDANYDPALLAKRRAAKRILFEINNLHPDEDEKRTQLLKGLLGKTGQNITFDGQFHCDYGFNIEVGENFYANVNLVILDGAKVTIGNNCFIAPNVGIYTAGHPLDAERRNKGLEYAHPITIGDDVWIGAGVTVLPGASIGSGSVIAAGSVVRGEVPPNVICGGNPGNVIREINERDSQKYR >NZ_CP016592|2030689:2084819|2071473_2072100_-|WP_014537877.1|DBSCAN-SWA MIMRTYDAQFAASLTAARDGGIAPVYFFWVLARPRAGGAEVPIGLWSGDEDITLTLTQPDGSNVSRHYFGGVNLKVEDLTYVGDLTDNPVAVSTSQGVDAAQLLARGYDLRLAYCEIHATTMTGGAFTAAPQLQWVGIIDAGPINTPGENGEGGITYQIRSEIMWQLTARNPAKSSDAHQKRRNLIDRFCEFAAVIGSRSVQWYKKDR >NZ_CP016592|2030689:2084819|2072188_2072518_+|WP_013384762.1|DBSCAN-SWA MIDGFYSVEFETVLGSGGGVVVLEDGSLRGGDSKRYFLGSYRIEDQKLLADVHVGTHMDKLDIPPVFGVNELDLKITGKLTASAAIEGTARSPQRPDSVMVFNMKRISG >NZ_CP016592|2030689:2084819|2040513_2041893_+|WP_013383000.1|DBSCAN-SWA MPLLWTLIVILAIAAIGYWLGRSRAMQSAGHSTRALHSLPGYYGWNVAIWAMAPAFLLILVWLVIQPAYVNHIALQALGDAAQNSGSASLLLADVHRMADQLAAGSAISAEGPTAAAAQLSFAANTAGRMWMSIAAIALALAGTAYAWSRTNAQFRARPRVEAAVRTLLIASASIAILTTVGIVVALIFNTIAFFQAYPALDFFFGLTWSPSAGGVNSRLGILPLLWGTLYISFIALLVAVPLGLFSAIYLSEYASPRIRAIGKPMLEVLAGIPSIVYGLFALIVVGPLLMSWFSPTGMLGLGWMRGGTAVITAGVVMGIMIIPFVSSLSDDIINAVPQSLRDGSLGLGATRSETIRQVVLPAALPGIAGAILLAASRAIGETMIVVMGAGAAGVLSLNPFDAMTTVTAKIVSQLTGDADFSSPEALVAFSLGMTLFVITLGLNVLAMAIVRKYREQYE >NZ_CP016592|2030689:2084819|2084504_2084819_-|WP_014537881.1|DBSCAN-SWA MLRRQIFVRDMWTCQWPGCGVMLAQPKRDEPLPPNAATAHHKRDHKGDIDLFLDPDNLMAVCKQCHDRSIQEATHRGFIAGHDEDGRPIDPAHPWARSGALGAS >NZ_CP016592|2030689:2084819|2083959_2084370_-|WP_014537880.1|DBSCAN-SWA MAAKKPQMPRYDAVFRGDEARVSLAKLLWKEVIAALEEVSALSRVNLARADRYVRAKVEFEALYPEAAEAGPVTRGPNGGEVFSFTWSAVEKLNDRMLKLERAMFGEAQARPPAEKPKKGSAPADEFLGSYNGLRQ >NZ_CP016592|2030689:2084819|2078616_2079144_-|WP_013384771.1|head,tail|DBSCAN-SWA MDIQQTSEATGVIITLEDAKKHCRADDFGDDDALISLYIDAAVDWVQSVCQTRLERAEFTATGSRFDLGFAGYPDPEITSVGYVDDLGAAVTLDASRYALRDGRLIVYGAENVASARVVFVAGLGPGNVPARLVQAMRMLVAHWYLNREAVGAGLAQVPLGVRDMVATYRSFAFG >NZ_CP016592|2030689:2084819|2044019_2044730_+|WP_013382997.1|DBSCAN-SWA MSEQHIVSAYDRDLETIQALIFKMSGLVEDAIGRSIEALSTRDVELAEQIRAADKQIDALEEKINDEAARTIALRAPVSKDLRIILSVLRISSSLERIGDYAKNIAKRVTVLAEQRAITESDATLRRMAREVERMLKDTLDAFVQRDATLAQEIIGRDTEIDQMYNALFREFFTHMLEDPRNITACMHLHFVAKNLERMGDIVTNIAEQVIYVTTGNRPEEPRTKEDETPFIGKVD >NZ_CP016592|2030689:2084819|2081058_2082219_-|WP_014537879.1|portal|DBSCAN-SWA MKLWPFGQKNTDPGTGNRFYQQNIVERAVSEHQKQLRVTASLAAGMRIAEGVASLPITVGKLGHDRLGRAIRTPIRAGELTERLTLAPNDWMTPTEFVETLTMWAVFRGVGRAYIHRGYRGRIRALIPINDGGVTIRRDYDTGKVWYDANIPGLGFLSNLTRRDFIEVTCPRWNEIEGLNITAEIGKVLRLALTLEDRQMEDGRKKAVPGYITTDQQLSKDSAKMVKEALKDKLPNTPVFDSGTKYNSIIPTQAELQLLETRRFIIEEVARAYGIHPIFLAHDAAGQSLTRISDAMDYHVTITLGPWVKRWEEAIRFSLLGADEYVDFDETQFYRMDLAARADYAAKSLGNNAAWETPNSILEWLGKNPVEGGDALPAAGGAPITE >NZ_CP016592|2030689:2084819|2072518_2073169_-|WP_013384763.1|DBSCAN-SWA MTLPDLYPLAFLGDLIATRAEVSLKLQRYDEMSGGGDGRRWAAELAPPLWTASFDLNNLSRDCIGRARSTDARFRALGTNRAFLWADPTYSGPAIGAPAHLVNADVRVAGFSADRTRITFSGLPAGFEFAAGDRFSAPWGTGRYYLGEIADGGAVAGGSVQVAVYPYPPLSLQVGAQIELLRPVCRMFVPDDGYTPYSYRRGSAATGASVTMLEKR >NZ_CP016592|2030689:2084819|2044733_2045423_+|WP_013382996.1|DBSCAN-SWA MASQLPHILVIEDEPAQREVLAYNFEAEGYRVSTAPNGDSALLQLAEEPPDLIVLDWMLPGVSGIEICRQIKARAETRAIPVIMLSARSEEGDKVRGLETGADDYVTKPYSITELLARARAQLRRTRPATIGGVLRFEDITLDGETHRVTRDGNELRLGPTEFRLLTTLMERPGRVWSREQLLDRVWGRDIYVDSRTVDVHVGRLRKALMIHGGTDPLRTVRGAGYALG >NZ_CP016592|2030689:2084819|2036594_2037437_+|WP_060486268.1|DBSCAN-SWA MRRLLLTSAIAIAVATGAQAQDRILSLGSSVTEILFAIGAEDKVIARDLTSTYPAAAEALPDVGYVRALSPEGVLSVNADMIIAEPDAGPVETIDVLKAASIPWVTVPAGWDAAQIVEKINLIGEATGHAAEAAALAATVTAELETAATAAAEIPEDQRKRVLFIISTNGGRVMAAGSETGGNAIIELAGAVNAVQGVEGYKPLTDEAITAAAPDFILMMDRGHNLDAANDELWAMPVLASTPAGQNQAVIRMDGIYLLGFGPRTGAAALELHNALYAGN >NZ_CP016592|2030689:2084819|2032865_2034467_+|WP_013383008.1|DBSCAN-SWA MQNQTSPLPAEIARRRTFAIIAHPDAGKTTLTEKFLLFGGAIQMAGQVRAKGEARRTRSDFMKMEQDRGISVSASAMSFDFGQFRFNLVDTPGHSDFSEDTYRTLTAVDAAVMVIDGAKGVESQTRKLFEVCRLRDLPILTFCNKMDREARDTFEIIDEIQENLAIDVSPASWPIGSGRDFLGCYDLLHDRLELMDRADRNRVAETVSISGLDDPKLAEHIPTDMLKKLREEIEMARELMPAFDRERFLEGSMTPIWFGSAINSFGVKELMTGIGEFGPEPQPQKAAERMVPAGEGKVAGFVFKVQANMDARHRDRVAFIRLASGHFERGMKLIHVRSKKPMAVTNPVLFLAADRELAEEAWAGDIIGIPNHGQLRIGDALTEGEMLHFTGIPSFAPELLQNIRAGDPMKAKHLEKALMQFAEEGAAKVFKPSIGSGFIVGVVGALQFEVLASRIEVEYGLPVRLESSQFTSARWVSGDKDAVEAFVSANKQHIATDNDGDLVFLTRLQWDIDRVARDYPKVSLTATKEMMVS >NZ_CP016592|2030689:2084819|2054381_2054507_-|WP_013382984.1|DBSCAN-SWA MIRRLDQDGGFAGAFDHQSPLMHADATATAYENVICNFGAI >NZ_CP016592|2030689:2084819|2077453_2077864_-|WP_013384768.1|DBSCAN-SWA MSSPATELQDAIEAAVRADAALMAIIAGIYDRVPDRPWGAANGYISFGPWDSVSQEGGCQAIEDVSLQLDAWSNRTGRAHCEEIMQRLRRIMGQVQTDRHPIVARGDPFSQVLRDPNGLTLHGVLRYEFQMERYDG >NZ_CP016592|2030689:2084819|2050299_2051076_-|WP_013382989.1|DBSCAN-SWA MTKDPAISLCGVSKSFKLEGGRQLTALEGIDLSLAPGEFVALLGPSGCGKSTILRLVAGLDTATTGSVSIEGRSPAALSKAHRLGVAFQDHALLPWLSIAQNIALPFQVAGQSVDHARVAELIALVGLTGFEHARPSQLSGGMRQRASIARALVLQPDVLLLDEPFGALDAVTRRHMNVELQRIWSTRTLTTLLVTHAVDEALFLADRILVMSGRPGRVIRDLRVPFGRPRDPSVMRDPEFHRLVDELTEALEPSGAQ >NZ_CP016592|2030689:2084819|2043245_2044007_+|WP_013382998.1|DBSCAN-SWA MNNPKIMARNVQVYYGDTHAIKDVNVDIDDRTVTAFIGPSGCGKSTFLRTLNRMNDTIASARVEGEILLDAENIYDPKVDPVQLRAKVGMVFQKPNPFPKSIYDNVAYGPRIHGLAKNKADLDDIVERALRRGAIWNEVKDRLHSPGTGLSGGQQQRLCIARAVATEPEVLLMDEPCSALDPIATAQVEELIDELRATYSVVIVTHSMQQAARVSQKTAFFHLGNLVEFDDTTKIFTNPEDPRTESYISGRIG >NZ_CP016592|2030689:2084819|2035527_2036595_+|WP_013383006.1|DBSCAN-SWA MAAADTHAQIPTPAEIRQARADNPKARDRDLAESLGVSEAALVAAYVGHGVTRIAANPDQLMPLIPALGEVMALTRNEACVHEKVGTYSEYHANPHAGSVLNPNIDLRTFPKHWVHGFVLEKETETGTRRSIQVFDSAGDAVHKIFLREGSNVEALEAVKDALRLPEQSDVVETTARPAVQGPKADPSKADALRADWQAMTDTHQFLRMVSGLGMNRLGAYHTVGAPYARLLDKTAFQAMLDGVVAQEIGIMIFVGNRGMIQIHTGPIYKLMPMGPWQNIMDPGFNLHLRADKIAEVWAVTKPTSRGDAISIEAFDAEGDIILQVFGVQKPGMEHRPMWNALVEALPSAQVEEVA >NZ_CP016592|2030689:2084819|2061802_2062099_-|WP_013382973.1|DBSCAN-SWA MSDQSPLRSPEFWGGVAVALIVKVRTTQQLGAWQVISTLIVAVGAAWLATDWVSAMTNTPKAVAAAMLTLTAEGIMRWILIAVNDPKQAIELWKAWRK >NZ_CP016592|2030689:2084819|2048212_2048797_-|WP_013382992.1|DBSCAN-SWA MAIKVQQPPDPWQRTTTQHGFAADEVISCIQKSLRRGLLENAILLGWEMFLTSPEMEEMLWSRLCVIAVEDVGLGNPGLPSIIETLYQQHMRYPRPAGDRFLFAAHAIRMIAGSVKERTSDDLVNWARRSVELGERMPEILDIALDMHTGRGQEMGRDYRFFMEEASVVIPEMEGKDQTWKNWIIKALDEGKLT >NZ_CP016592|2030689:2084819|2082215_2083997_-|WP_013384775.1|terminase|DBSCAN-SWA MSSSDLTTDYANRVLSGDIIAGKFVRAACQRHLNDLREGAARGLNFDIDEAARAFRFFPAMFTVTAGAKAGEPFHLLEWMQFVVGSLFGWRNADGTRRFRQAWIETGKGQAKSPLMGAIGLYMIGFNGVPRAEAYAIANDKDQAKVLFSDAVALCRAPIPGKDGATLESVAKVKIRGVGDNAWKIEVPESGAKFLPVASADSISGPKPIAVFADEVHEMRTDKAIQLWKAAIDKMPGDPLMILGTNTPAADQAVGTDYSEFYQRVALGMIEDDSAFSYIARVDVDDDPFSDESCWVKALPALGVTYPIDNVRRRVETAKHMPSERLATERLYFGRPVGSSGFWLADEAAWRAVLGPVAEDDMRGTPCVLALDLSKKNDLTALSAVWRDDDDQLTAKLWYWTTEGGLERREAEDRTPYKSYIAGGDLIAVRGEVIDYTFVAQQVKELCVAQEVEALVVDPAYVSDFIAACEQISFEVWRYMGPDEPEGTGLKIVTHAQGTRIAFEGRQLCMPHSITHMTDKILKREILIAENKMTHVCAANTVLISDGQGNQAFDKQRQRGRIDGMVALAMGVGATKADRKGRRSYMEGGVLFT >NZ_CP016592|2030689:2084819|2078272_2078620_-|WP_013384770.1|head,tail|DBSCAN-SWA MSTAGKLIERVQFESPDMRTGIWTHEYECRAEFIYARGGEAVYAARLEGRSVFKIKIRQCAAAREILQSWRMIDMRRATYEGDVPQTGVYNIREVDPISDRAWIYLLVEAGVAYG >NZ_CP016592|2030689:2084819|2038514_2039306_+|WP_013383002.1|DBSCAN-SWA MTVTARQISVKLGRKQILEGVDFTAAGGRLTAIVGPNGSGKTTLLKALTAEIGNGDGVEINGRAINALKPWQLAAMRAVMPQATSLAFPFTAIEVVRLGLQAGVHAADRTLARRALERVGLQDKAEQHYQQMSGGEQSRVHLARTLCQVWEPMAHGKPSWLFLDEPVSALDIGHQLLVMDITRDFARAGGGVVAVMHDLNLTALYADHVVLMRDGAILAAGAVQDVMTSENLSRAYGCALRVNHAPTADHTFLLPHAASSHAA >NZ_CP016592|2030689:2084819|2031909_2032824_+|WP_013383009.1|DBSCAN-SWA MIGVTGGMEATSGGLFSQSLMVYRGVDLTVTDGVALGDSLTFAAGLLPDDYYTLDREAAPARLSFAVAETGRLMIAPGSSIGTAGHSLHVDSVLLLLDRNGRVAEVLLLVEEKGGIAIAVYALPLSPLNRGFEYRLAAVTQRNARHRLFGTPMLQFTHDTLLTLADGSRIAASDVQVGQHLMSGDGNMITVNWIGRSQLMAARDMAPVLVPAGSAGNQSDLILGPRHRIRGRLIRDMIGRSNIRQIEPMVQHYVQILPHRHRSLAVAGVAVDCLMIEAPPPHTANSAPVAQGLPLPAVAQNDWR >NZ_CP016592|2030689:2084819|2055799_2055982_+|WP_014537870.1|DBSCAN-SWA MHGRLRYLGLRSHQPISTRDRGLNASRSEHKCEVANPRSLGYDSGARHAPTAIGACAFDS >NZ_CP016592|2030689:2084819|2076764_2077223_-|WP_013384767.1|DBSCAN-SWA MAAPFHEEYHELVFEFSEDNGTTWARNCVIMGADVTRTASTSETETVDDCDDESKPNNVSVRVQSLAVSFSGTGNWTQGGYDTFLKKFYAGDSTEMLARIGNLNAGAGEIEYETGPIIITSLGQSRVKGAVVSASVEARFAKTPTRSLKVGS >NZ_CP016592|2030689:2084819|2055994_2056600_+|WP_013382982.1|DBSCAN-SWA MNALRVDFVTGAVAYRLRSGGGKSQPIARALGFRAGQDMNVVDATAGLGRDSFLFASLGANVTMIERSAQMYALLRAGMDEARAAGPEFADIINRMTLLHGDAMQLLPGLSPDVIFVDPMHPPRRSSALVKLELRQVREIVGFDEDAADLMRVALAHAKKRVVLKWPRKGDAMAGIPAPSHQILGKSTRYDVFITKRGPGV >NZ_CP016592|2030689:2084819|2030689_2030932_-|WP_013383011.1|head,protease|DBSCAN-SWA MFNNIGPMGILLIAVVVLVLFGRGKISNLMGEVGKGITAFKRGVDDGKKELEVSATELRDVTPAAPVATPVAEEAPKDRV >NZ_CP016592|2030689:2084819|2077863_2078259_-|WP_013384769.1|DBSCAN-SWA MRRKLRTHAKAAIEAGRSQARKEGEEIASLARAFAGASDGDLAASIRVEEADAVMTSQGRSGFIGVVVRAGNDATIVTNKQGERFQNAKLQEVGTQSMPAKPFFNPAKRLRRKQAMAAIRRAVRKAWKEGG >NZ_CP016592|2030689:2084819|2054600_2055764_+|WP_013382983.1|DBSCAN-SWA MPTFTLQIRTPAAGYDVVDNDQGKLSTYSDAIIVNTDSNVFSSAAYRSALDVYREQTAGTNPSYDQYLTYLIDLLPQFGVNTTAEDFTQQHGPTFPAGSFVFFGHTWSSSPGGKDFDVMEVLLLDPEKLTSQINYVAPVYGFIGDIPDAGEPFQVMHNTHQAGYLPDKPPISFDISRYMLAPCFTAGTFIETDRGDIAIEALRIGDLVKTIDNGLQPIRWIGSSRICSNALSSNTKLRPIQISADALGAGVPAHDLVVSPQHRVLIRSKISDRMFGAAEVLVPAVKLTALPGIFTDNSCDPVEYFHILFDQHEMVRSNGAITETLHTGPIALRSLSSAARAEIFAIFPELAALGVPRPLARSTPAGKDAAALIARHLKNQKPVQIGL >NZ_CP016592|2030689:2084819|2060070_2060499_-|WP_014537872.1|DBSCAN-SWA MLPVEEAYWLSDAWREKNLRYYPWHGRGFVQLTWKANYQKASAKIGVDLIGDPSRAMEPDAAAQILVHGMIGGWFTGKKLADYIDGARVDFVGARAIVNGKDKAAEIAAIATAYLAALPEDQGSIWLRIFKAFWGIITGKKQ >NZ_CP016592|2030689:2084819|2048826_2049561_-|WP_013382991.1|DBSCAN-SWA MTFLSRNWPILLLLAIWQLGVSLSGLNSIVLPAPLPVLMDMLGNPGLYAVNTLQTLWTAFAGLLIGSALGVLVACLAYASRFLAGVLTPFGLIFSSVPVVALIPIIARILGYGSTTVIAIVAIAAFFPTFVFVGKGLQQLPRGADDLMRVLGAGRLKRFSRLVLPSAVPDMMIALRLIVPESVLAAILAEYLMGRSGLGYVFAQATSRFAMERAFGVSLVVTLTAVLCFFLAHRAERAVKARWS >NZ_CP016592|2030689:2084819|2071066_2071495_-|WP_013384760.1|DBSCAN-SWA MVQKGSLTRLPDWRARLSHALDIQRDHPFEWGRHDCGLGLAAGAVEAITGEDLRPAWANYKTPTGALRVLRKAGYESLGDAMAALLPEVHPAFAQIGDLALLDGEGQIGALGVIDISSVIVLDKTGHARVPRDQIKRAFKVG >NZ_CP016592|2030689:2084819|2062362_2063667_-|WP_014537873.1|DBSCAN-SWA MLALGFGLTALAALAGARAGAVPVISLASSSGYAGAELVSSVEGQWYADGVAIPGAWGGALMITPDLEGAAIFLRVDPHIWVSDGDPGYAGSSYQSSVFGQWQADGVDIPGAVGLRWQMTPAYEGAAISLDYRPIIRIVQGAGYAGSRLRSNLPGQWFADGVAIPGSIGRDLIISPALEGAAISQDAEVTIMQSNTIQMWVPERALTAPQKANGGLWLGEDRTTVERNGADYVTAWRDKFGVRDMTQPTAANQPRAGTFRGMPAVIWDKSPAYQYLQPPAAFAPMWWLILAEFATGVEIISGTSTGSAIYTQILGNGQAALARVSFQQPDAVQSGTSAIRLNAADTEVSSGIFPMPMGSMSFGVSASASWCIGRGFDANNNRQWVGPILGAIALGVVPDLATRHLIEAYMHWRHGLEERLPANHPYRNAPPRVQ >NZ_CP016592|2030689:2084819|2045879_2047112_+|WP_013382995.1|DBSCAN-SWA MTDDPAGQPPRAPAENPILVVGLTTAIQAMTSYGLLSLPVASVFYAADFGLPAWIVGVQISGIYCVALFSSLIASNMVRRLGGGRTSQIALLAMALGVACIASGLGALLLPGLVLMGLSYGLPNPAASHLLRRFTPPARRNLLFSIKQAGVPIGGAMGGIVTAWIAHHVNWQAALCLPALLSLTLGVVLQLVHKGWDDDRQRDQRVLQAPLRDLIRVFGFPRFKAIFASGMLLAAAQLCVSTFIVLLLVVDLQIDPITAGAGLSLLQIAGILGRISTGALADFFRSGLRVLIWLALALAATTLVLVLTPAPSALLLTALLIVIGLLSSGWSGVLIAEADRCAPPAYASAATAALMCGTFFGVMISTTAFAGIVQLFGTYRTPFAMIAMGCIVAAGLLRIAYRADINDKEV >NZ_CP016592|2030689:2084819|2064773_2065925_-|WP_014537876.1|DBSCAN-SWA MIELDFAKIRALRIAGRDVTELRRGAVLMWAKPPEISLASGSGYAGSVYAATQPGGQWFADGVPIHSATDQTWVMTDAYEGAVIQYDIAIQPQSVEISITSGAGFAGSVYSASRGGGQWYADGLPIPGARGQTWTMTIALEGAAISYITFTAPRSNRIQMWTPTVLAAALKEGWWSMRRGVQLAADDRVAAIADSFGLRDMLQTSASLQPRTVVQMGRRVMTFAQEEQTYLLAASSHYGRYVYAVAQYKTGVETIFASYATLWGMIGSGAGRVRGNRDTNGLQQTPLVRMNGHAPTAAVLPMAGAVLGTAPHRNSEALRAWGIGAGQSASFGWDGLIAEVIALSNEPSADDHDRLSGYLAHSWGQADSLPAAHPYKSLGPRID >NZ_CP016592|2030689:2084819|2058288_2059491_+|WP_013382978.1|DBSCAN-SWA MPYLTAVNGAVVNLSLPTPILTLPSTNILNAFGTSTLFSQYNVDGVGDGSTPETVQGGDYLAPIIGGSPVPGTYAGSGTFQTAGLTVGNAFLGATVRLNPVDVDYFVDENDQLYIISDAPLDAANLTVTITVNALGTSTPLTLPLTDLLTNPIVAPVLGLLGGPNAVNNILNQVINSQTFDPNGTMTIPPGEINDIVCFVAGTMILTPDGYRMVETLQVGDLVMTKDNGAKPVKWVGVRKLSAAEIIVNQHLRPIRIKAGALGVNIPSQDLMVSPQHRVLVRSKIAQKMIQSDEVLVAAKQLLQLGGIDIATDLTEVEYHHFLFDQHEIVFSNGAETESLYTGAQALKGVGAEARREIFALFPNLLDQEKAPIEARPMLTGRKGRRLAMRHMQANRPLVV >NZ_CP016592|2030689:2084819|2047252_2047576_+|WP_081447077.1|DBSCAN-SWA MKCPVDNETLVMTSRNGVEIDYCPTCRGVWLDRGELDKIVERSEQVVAAAPAPAAAPQPDRHRDDDRGGRYRDDRGRGDRYDDDDDDRRDGRRGRRRESFLGDLFDF >NZ_CP016592|2030689:2084819|2047616_2048189_-|WP_081447076.1|DBSCAN-SWA MAGPDVFFRNASEVMRKKGEIALEYGFDPSTLAEDDLDPVGKTARQFGISIGLANERKMDEADFIIANLTPFRGISADVGTAFEVGYMRAQGKPVFAYTNTDRDYFTRLSADYYAGAQPEVIDGVARGADGLMIENHDMVDNLMLDTAAEESGGAFVVGAVAADADLLGDLQAFRDCLAAARAYWDARTS >NZ_CP016592|2030689:2084819|2053242_2054385_+|WP_065739322.1|DBSCAN-SWA MKKCLVVGAGLSGAVIARQLADAGQFITVADSRAHIAGNCHTARDADTGIMVHTYGPHIFHTDDREVWNYVNAFATFMPYQNRVKTTTRGAVYALPVNLHTINQLFNTALRPDEARAFIAAKADITITDPQSFEEQALQMVGREIYEAFFKGYTNKQWGCPPSALPAAILKRLPLRFSYDDNYFCQQFQGITKDGYTAMVARILDHPRIAVRLNTHVRRDEIKGYDHIFYSGPIDAWFGYTLGRLTYRTLEFERFYHDGDYQGCAVMNYADLEIPYTRITEHKHFAPWEQHARSVLYREFSRACGPGDIPFYPTRQSRDKDLLHSYAALAAQETHVTFIGRLGTYRYLDMDQTIREALDCSRAWLKGAGQRPTAFHLDPA >NZ_CP016592|2030689:2084819|2065921_2066326_-|WP_013384756.1|DBSCAN-SWA MAITTVTVTGELRDIAGAPQNLSLIRFTPRGWDKSGAAIITGAPIDVTVTGGAFSASLFRQDLGQGVVYDVAYVLPRERITTIGSIFIDGPGPFALADLLGVPVPFGVTVTLVEGGSWPPPADPNPLHWYARVK >NZ_CP016592|2030689:2084819|2076381_2076765_-|WP_013384766.1|DBSCAN-SWA MAEAQVINWTCGEHAFRLRIGEAEALDDLTPQGIADFRFRCRQGIERGSLGFSPVRVREVIDCIRLGLIGGGMEGDAARALALRAMEEADFAELVKICYGIVTGFFSGKDHDQPEKPVAAEMTDENG >NZ_CP016592|2030689:2084819|2037441_2038518_+|WP_060486269.1|DBSCAN-SWA MTFALSHAASVDNADPREVRARRLTLLLIAALVVTCGASVMFGASGTSVTKVLGQLWRGEEIALIDQIVLLQVRIPRMVLGVLVGASLAVSGAVMQGLFRNPLADPGLVGVSAGASLGAITAIVLGGFLPAAALAFVGGWLVPAAAFVGGWGATMALYAVATRSGRTSIATMLLAGIALGALTGAISGILVYRANDNQLRDLTFWGMGSLAGANWPKVLSAAPLIVIALAVAPFLARSLNALALGEAAAAHMGIPVQKMKSVAILTVAGATGAAVAVSGGIGFIGIVVPHLLRLAAGPDHRHLLVNAGLLGAIVLLLADMISRTIVAPAELPLGIVTAVLGGPVFLWVLLRQRGVVDL >NZ_CP016592|2030689:2084819|2059870_2060074_-|WP_013382976.1|DBSCAN-SWA MTAVWARIIMRYLSGALVSAGFISADFGAQLATDAELHGVLVMALGAVLAVIAEWAYRLAKRFGWAT >NZ_CP016592|2030689:2084819|2061040_2061769_+|WP_013382974.1|DBSCAN-SWA MLMVFSQNPLNANQIAALSFQAVAFDTLTGQAVVAPCFTRGTLIMAMSGMVPIEDLRAGDLVDTIDNGLQPIRWIGSSTVSGAALKATPKLRPIHISAGALGEGLPSQDLRVSPQHRILVRSKIAVRMFGAAEILVPAVKLVALPGIYSVDECDSVEYFHMLFDRHEIVLSNGAESESMHTGPVALRSLSSQARAEILSIFPELEQIGAARELARPVPSGKDLAAMFDRHVKNTQPIQRPLV >NZ_CP016592|2030689:2084819|2052274_2053171_+|WP_014537869.1|DBSCAN-SWA MNHGNRLFSGLPAESTVARAIQNALPAMSEAQRRFAALVQAEPLRVARLSINDAVSGADVSVATANRFATALGYAGYPEFRADLIRAFEDFFVPVERLKRRQAEKRSAMDIAQAAFAEDLESIGATASSLDSASLEAAVQQIIAARRVFVAGFDLSAHLGGMLAIGLVMTGCDAQTVPSGGGAVGAVRTLTRMGPQDLVITIAFPHYYRDTIDMAGFAKGAGIPVLAITDSPRSPLVPLAQVALYVTAKQELNAPSPSSAAILSLIEALVATVASQRPEAAEASERFASSAYPWMTNR >NZ_CP016592|2030689:2084819|2080395_2081049_-|WP_013384773.1|head,protease|DBSCAN-SWA MAMEFKLARLDAKAATEEGVFVGYASRFNVVDQGGDMVRPGAYAKSIAGQKSVKLLWQHDAAQPIGVWTSISEDEKGLRVEGRLALETVKGRETHALMKMGAIEGLSIGYRTKDADIIDGIRHLKEIDLWEISVVTFPMEDGSGVDAVKSSVQIMRAAKDGDFAPLKKSVEVALRDAGLPVWLRKAIAARAPEALGDGQRDASASETAKAIKDAFKF >NZ_CP016592|2030689:2084819|2034521_2035292_-|WP_014537866.1|DBSCAN-SWA MISSLAVGLALFAAAAPLAAQQVTPGRLAVEANAKEDVNGACRVSFVVGNGLEENLYKLSFEAVVMDTAGEAAQFTLLEFTEVPQGAVRVKQFDLRGRACDDVGQVIVNRTLTCEAVGLDPQACAQNLDVASRISTPDVTEGQISVELNRLESFEGSCRLTFVTRNGLSQPLGGMMAETALFDTNGGLSRLTVFELGDVAAGQTRVRRYEVTGSACDGLGNVLVNDWQSCTVEGTEAGACTAAINVTSRTAIGLLQ >NZ_CP016592|2030689:2084819|2062183_2062342_+|WP_156771220.1|DBSCAN-SWA MTTAREQKLYDALLDYIACYGLSKAAREALREYEATSPAKLQARELRPVLLE >NZ_CP016592|2030689:2084819|2064035_2064350_-|WP_014537875.1|DBSCAN-SWA MDLDSLDRRELVQLLVEIDQALVSVDDRNRKCALRAAEHAVSRFGLSLNQLADDIRSMKEVTPRGVARYRNPSNPAQTWSGRGRRPTWVHELGLKGLTLDDCAL >NZ_CP016592|2030689:2084819|2079194_2080373_-|WP_013384772.1|capsid|DBSCAN-SWA MDLEIKEALDNASKTLAEVKKAQVDLSDQLKVLDQKKASGEDITDIKGHVEDSRKGLTELGDQVADLTKKLSHRGADLEGKSLGRIIAEHDEFKTLKDDRKARFEVKDVTTGSFGTIDLPGGVRRGSRGLIQPVNQALFLRDIIPTAATTAAVIEYLQESGYTNNAGVVAEGAQKPQSELDFEPKAAPMVKMAHFFRVTEETLDDVDGMEAYINQRGLYGLQLKEEGELLNGPGTTGRIDGLLANSTTYDSGLVPGITPVNAMDDIRIAIAQVAEADLLASAIVMNHLDAAALDLAKDADGRYLHPAFAGNTAWGLPVVRTKGLPQGKFIVGGFIGNTLIWQRKGIEVRRSTEDRDNFVKNLVTILLEERLQLETLRPEGIVYGDLTAAGGE >NZ_CP016592|2030689:2084819|2077263_2077461_-|WP_014537878.1|DBSCAN-SWA MAKAIFHRDFHYTSRLRNAGWSAKAGPDPQHFPQEFIDAAVSAGCATVPPPRRSTHKTPDDPAQP >NZ_CP016592|2030689:2084819|2076202_2076349_-|WP_013384765.1|DBSCAN-SWA MGFAPEQVQRMSLWEFHSAFAAWKRFNGIKTEGGGEVTLDRLKALGVK >NZ_CP016592|2030689:2084819|2064502_2064772_-|WP_013384754.1|DBSCAN-SWA MTQKYWTGAEAAIIAAEAAATALVTGLPEYRDGQEVAPEARVTARWAEPRETATPGTFAIPAYPGMDVPEGCAEADGVSLPKVMEDELG >NZ_CP016592|2030689:2084819|2051131_2052190_-|WP_013382988.1|DBSCAN-SWA MPKIFTGMLKTMTTALPLSRRSLLALGGAAAAGFSLPRMAIAQSMPVVSTALGWIPNAEYAGLWVAIEKGYFAEEGIEIAYTPGGPNAPGVLVRLAAGQADFAGGDWIPLFETLNRDNDFVVLGAAFPANPAALMSLSANPVLTPADLVGKRILSQMPADKNTIDFILTKAGLPLDYEMVPTGFSPEPLLAGDGDVYMAFATNQPITFEKMGMVAGQDFHVTLLRDLGYDVPAGPIVAKRDYVAQNRPLVVGYLRALLRGWIENGANPAYGAELGATKYGVDFGLDLDQQIRQSELGQPLTAVAGAPGPFWFDPALYETNILPVAAAAGLTGLPAASDLIDLGPLEEAIASL >NZ_CP016592|2030689:2084819|2057444_2057852_+|WP_013382980.1|DBSCAN-SWA MCNLYSHTSTQEAMARLFKPNEVVDRLGNYEPQPEIYPDQLAPIVRAEGDQIILQTARCGLPTPETYLEGHAVDRGVTNIRNTSSPHWRRWLGTAHRCLVPLTSFAEPAGKGKGNVWFHLADYRPAMFAGLYVPD |
63 | Paracoccus_phage(14.29%) | capsid,terminase,head,tail,portal,protease | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_5 |
2092027 : 2098098
Sequences of DBSCAN-SWA_5
Nucleotide sequences of DBSCAN-SWA_5 >NZ_CP016592|2092027:2098098|DBSCAN-SWA ATTACATGCCCATCGCTTGCTTATAGAGGTCGAAGATCGCTTCTTCCTCGGCCACGTCATTCTGGTCGCGCTTTCGCAGCGCGATCACCTTGCGCAGGACCTTGGTGTCGTAGCCGCGCCCCTTGGCTTCGGCCATGACCTCCTTCATCTGCTCAGCGACGTACTTCTTCTCAGCCTCGAGCGCTTCATAGCGCGCCACGATCGAGCGGATCTCGTCGGCCGCCACCCCCGCGACCTTCTCGGCCACATTGATGTCCGCCTCGGTGTTCTTCATCGGGATGCTCAATTGTCCGCCTCCCGCTGAATCTGGCTGAGGCGAACAAATGCCTGGTCGATCGTCAGATCAGAGTTTTTGGTTTGCCAGGCTTTTGTGGCACCTATGAGCCAAAGGATGAGAAAGCAAAAACCCACCCACTGCATAGCCACGCTTTGCGTCACAATGCCGATGCCAATGACACCGACCTGCGCACCCGCGATGTAAATAGCGGATCGAATTTTGGGCTTGTTAGTCACATTGATCGTGATCGTAAAATCTTTGGTCATGCTACCCCCTCAGGTTGGGACATCGCCGCAGGCGCAGGGAAGCGGTCAGTCTGGTCGCCCCAGACATCCCAGCCCGCGCGCTCCTGGCGGCTGAACAATTCGATGCGGCGCGCGCTGGGCATTAGCCGCTCGGCCGCGGTGAAGGCTTCCTCGGGCTTGCGGGAATGCTGGCGTCGGGCGCCCTCGATGACGGAGCGCACACTGCGGGTCGTCTTGGGGTTGCCCCGCGTCCCGATCAAAAACGGCTCGCCCGCGCAGCGCAGGATATAGCCCGTGCCAAACGCGAGCTTGCCCGTGACCAGGTTGCGCTTTGACCAGTGGCCCGCGGTTTTGAATGTAAAGCCCCAGTACCGCAGCATCGTCAGCGCCTGGGGAAGCATGGGGTTGGTCGCCCACAGCCAAAGCAAGCAGTCAGGCGCCGCTATCGAGGCGACGGGCAGCTCGGAGATCCAACGCAGATCACGCGTCGCGTAATGCGCATCTGCGGACTTGCCCTGCCCCTTTGCCGAATAGGTCGCAAAGCTCCAAGGCGGGTCAGCCATAATCAGATCGTATTGCTCGATCGGGAACGCCGTCATCGCCGCGCCCCTTTATGTTTGGGCCAGCGATAACCGCGATCGACAACAGCGGCGCGCTCACCGCTTTCGCGCAGATCGGCTTCGAATACCGCCTGCGTGGCCATACCGATCGTGCTTTTTCCAACACCGAGAATGTCGCCAACTTCGGACAGTGAAATGCCGCGCACGCGCATCGCTAGCGCGCGCAGGATCAGCTCATCATCCTGGCGGGTAGGAAGGGCACTCATGGCCGACCTCCCCTTCGCTTTTTCCACTGGTAGCCAAGCTCAACGACCGCTGCAGGCTCCCCGCTTTCGCGAAGGTCAGCGTCTCGCACCGCGATGGTGAAGAGCCCCACATTGCTGCGCGGCATGCCCAGCGCGCCGCCGATCACATCGAGAGGCTTGCCAGACTGGCGCATGGCCAACGCCTGTAGCAGCAGTTCATCATGTTGCCGGGAGAGGGCAATGGTCATTTTGATGCCCCCTCACCTTGATCGCAAAGCGGCTGAGCTCCATGTACTGTGACATGAGATTTCATTCCAAAAAGCTCGGGTGGACAGGTCAAGCCTGCCTCATCAGCGATCGCTTTTACGGCGATGAACCAGGATGATGGGAAACGGCCTCGAACGATCGCATTGCTTACAGCCGTGGGCAAAACACCAACTGTATCTGCCATTTTTCTACGCCCAAGGGCGTTTGCTAGATTGGATGCTGCTGTCATGACCTCACTATGTCCACACAATGTGGATTATAGCAAGTCCACATTTAGTGGTTTGCCTAAGATTTCACAATTTGTGAATTATTCAAGCATGGGCAACAAGCGTTCTGAACCTGGACAATATGCTGAAATTGGTGATCGCCTCGAGGCGATCCGTAAGGTTTTCTCAGATCTCAACCAAAAGGACTGGGCTGAAAAACACCACTTTGGCCATTCCACATACAACAACTGGGTTACCGGAATCAGGCGTATACCCGTCGAAAGTGCCGAGGCGCTCTGCGACCTCTACGGCGTTGACTTGGACTTCATATACCGCGGAAGAAGAGACGGTCTGCCGGACAATCTCAAAAAGGTACTCTGATCACATCTCCCCATGTGCTTGACCACATGGTCTAGGGGCACGTCCAGCTCATCTGCAATTTCGATCATCCGATCCAGCCGCCTGTCTATTTCTTTTTCCGCCGTGCACTCGCCCCTTCTCAAGCCATACTAAACTTGTTCCCGTTTTGTTCAATCCCTCTTTGTCAGGATGGCAAATTATGTTGTGGACAGTGAGGCTTATAGTTTCCGAGGATGCCTGGAGCAGAATCGGGAGAACAACGTATGTCGATCGACGCAACACTTAAGGTATTGACTGAACGCGTCCAACAGCACGCGAACACGATGCTCACGGAGGAGGCTGTCAAAACTGCGGTTGTCCTCCCGTTCCTCCAGGCACTCGGCTACGACGTATTTAACCCGGGTGAAGTCATTCCCGAGTTTACGGCAGATGCTGTAGGTAAGAAGGGCGAAAAGGTAGACTACGCGATAAAGCTCGATGGCGAGATTCGCATATTGATCGAATGCAAGCCGATCAGCACCAACTTGGACAAAGTGCACCTCGCACAACTTTACAGATACTTCTCCGTGACTAGCGCAAAATTTGCGGTTCTCACGAACGGGAGATTTTTCCACTTCCACAGCGACTTAGAAGAGCCAAATAAGCTTGATACAAGGCCATTTTTCACCTTCGACATCACCGAGCCGAATAGCCAGGTGCTTGGTGAACTCAAGAAGTTCGAGAAATCTGGCTTCGATGTAGACGGCATTCTCGCGAATGCTGAGCGCCTGAAGTACACATCCGCGTTGAAAACCGAGATCAACAAGCACATGGACGCACCATCAGATGAATTCGTGCGGGTCTTAGCCGCGCCTGTTCACGAGGGCCGCTTCACTGCAGCCGTGCTCGATCAATACCGAGGGCTAGTAAAGTCGGCATTCCGTGAAATAATCCGCGACTCTGTCCAAGAACGTTTGTCCTCCGCATTAGCAACAACTGACACTGCTGCTGAAATTGCCGAAGAACCGGTCGTTCCAGATGCTGAGATTGTGACGACTCAGGATGAAGTTGAAGGATTCATGATCGTCAAAGCGATTGTGTCTTCGGTGATAAAGCCCGGCAGGGTCAGCATGCGTGACCAAAAGAGCTACTGCGGAATACTCGCTGACGACAACAACCGTCGTCCGATAGCTAGGCTCCACTTCAACCGCTCTACCAAATACCTAGGCCTCTTCGATGGAGAAGCAGAAGAAAGAGTAGTCGTTGAAAGTTTAGATGACATCTACTCTCACGCCGAGCGTCTACGGCAGACAGCCATCAAATACCCGACGCCCTGATAGCGCCCCACCCGAACAGATTTCAAGCAGAGGCCCGCATTATCAGTGCGGGCCTTTTCGCATCCAACATCCGATTCGGCGGATTTCACGCAACTCTCTTGCCTCATCATGACAACTAAAATCCACCATGTGTGAATTTTAAATTGCATATCCACATTTAGTGGATATTTTAACTTCATATCCCGCAGCGACTCACCCTCGCAGACGGTCACAGATGGAGAACTTGATGAGCATTCTTTACGCCCACCCTGACGGCCTGATGTCGGCTCAGGACCTGCGTGAAACGCTGGTTCGCGACCACATCCGCGCAAGCCGCGTCACTCTGGCCCGGCCCCGCGGCTTCTTTGACATTTCCCTGCACCGCGACTGCGAGGTCGACGGCGCCGCGCTGGCCGACTGGGCTTTGCGTCGCCCCGCAAACGGCCCTTCGGCGCCCGCCGCCTCGCTGGCGTCCTGAGGCTACCGCGATGAACGGCTTCACGAACACAGAGGCGCGGCTGATCTTCCTGATCGCCAAGGATTTTCACATCGCGGCCGAGATCCTGCCCAGCACGAGCCTGCGCGGTGACCTGCGCATGGACAGCCTCGATCTGGTCGACCTCTGCGTAAAAGCCGAGGATCTGTTCGACATCGAGATCAACGATGCCGAGGCGACCGAGGCCACGAGCATTGCCGACCTCGCGCGGCTGATCGACCACGTTTTAGCCGAGGAAGCTGCAGCATGAGCCGCGTCTCCTTCGCCCTCTCGATTGCCCCGATGATACTGGGGATCGCGCTGCTTGCGTGGCATGCGTGGCCGACCATCCAGTGGGCGCGTGCACTGTCGCCAGCGGCTGGCCCCCATATGACCTGCGCCACTGATCCGGTCATCAACTGCCGCCTGCCCAGCGCGGAGACCGGGCAATGAGCGCGGCAATGATCCACGAGCTTGAAAGCCTCATCGCGCGCAGCAAGCGGCTCGGAACCTGGTCGCACACCGCGTCATCGGTCTATGGCAGCGAAATGACCAAGCCACACGCCCTAATACTGGCCGCCCCTGTTGAGCTGCCGCATTACGATCAGGCGCCCCCTTTTGCTGTGGCGATCGCAGGTAGCCTGGTGGCGCAGCCCGATCAGCTGCTGAGCGAGATCGCCGATCAGCTCAATGCCGTGCCGGTCATCGCGCAATATTGCTTGGACCTGCTGCGGGAAAAGGAGTCCGGCCGATGACCGTGTCCGACGCCCAGCGCGGCCTCGCCGACATCCTCGAAGAGCTGTTCGGCCTGCTGCCGGAACGCACCGCGCCCACCGCGCACCTGCGCCACGACCTGGGCCTAGACAGCCTCCACCTGATCGAGCTGTTCATGGAAATCGAGGTCCGCTTCGGCATCGAAATCTCCGACGCCGCAATTGAATCCGTCCAGACCGTCGCGGGCCTGTCCTGCGTGATCGAGCATCTGGTTACCCAGAAAAGGAATGCAGCATGACCGATACTGTTTCAGCAATCATCGACCGCCATGCGCTGCGCAAAGCCATGACATCGCTACAGCGCACCGTGCAGCAATGGGCAGCAATTCCCGCCCTCAAATATATCTCAATTTCTAGCGGGCTGGGCGAGGTCACACTTCGCGCCACCGATCTGGACAACTTGCTGACAATCAAGCTCGAGGCAGAGACAACGGGCACCGTTCCGTTTCTAGTCTCTGCGGAGGTGCTGCAAAAGTTTGCCTCTTTGGCGGCAGGCCCAGTGACGGTCACGCGCACACCCGATATCGAGGGCAAGGATGCTCTGATCACGATCACAGATCAGGAAACCACCCTGCGGCTGCGCGAGCGCATCCAGCACGATGACTTCCCCACCGTTCCAGCTTGGGACATGAAAGACGCTGCCCGCTTCACCGGGAGCGGCGCGGAGCTATCGCGCATTCTCGATCTATCTCGCCATTGCGTCAGCACCGAAGAGACCCGGTATTACCTCAATGGGATCTATCTGATCACAGCACCCGAGCGCAGCACGCTGCGCGCTGTCGCCACAGACGGCCATCGGATGGCAGTAATCGACAGCAATATCGAAGCGCCCAACCTCGCAGGCGTCATTTTTCCAAGATTTGCGCTCGATGTGTTTCGCGGGCTGCTAGATCCCAAGTCGAACACCCCAATCAAAATGCAATTTGAGGAAAACAGGGGCATCATCGAGGGTGACGACTGGATCCTTCACTCCAAGATGATCGACGGCACATTCCCTGATTACACGCGGGTCATCCCCAAACATGAAACGAACTGCGAGGCGCACTTCACCCGCGCAATCGTGACAAAGGCGCACCGCCTCTCACAAGCAGTCAGGGGCCACATCGCGGCGGCCGCAACAATTACCTCGGATGGCAAGATGCACCTGATGCATGAAGGTGAGGACGAATCTGTATCCGTGCCTGTTGGGGCATCACCCAACTTCGATCTTGGGCGCCATGGCTTCAATGTCAAATATCTGAACAAACAGGCACAGGTCACGCCGGAATTCACCCTTCGCGCGTCCACTGAACGGCCTAACGATCCAGCGACCATAGTTTCCGATGATCCCGACGCCTGCTGGATACTGATGCCGATGAGGGCGGCATGA
Protein sequences of DBSCAN-SWA_5 >NZ_CP016592|2092027:2098098|2093370_2093601_-|WP_044008065.1|DBSCAN-SWA MTIALSRQHDELLLQALAMRQSGKPLDVIGGALGMPRSNVGLFTIAVRDADLRESGEPAAVVELGYQWKKRRGGRP >NZ_CP016592|2092027:2098098|2096405_2096711_+|WP_013384789.1|DBSCAN-SWA MSAAMIHELESLIARSKRLGTWSHTASSVYGSEMTKPHALILAAPVELPHYDQAPPFAVAIAGSLVAQPDQLLSEIADQLNAVPVIAQYCLDLLREKESGR >NZ_CP016592|2092027:2098098|2096964_2098098_+|WP_013384791.1|DBSCAN-SWA MTDTVSAIIDRHALRKAMTSLQRTVQQWAAIPALKYISISSGLGEVTLRATDLDNLLTIKLEAETTGTVPFLVSAEVLQKFASLAAGPVTVTRTPDIEGKDALITITDQETTLRLRERIQHDDFPTVPAWDMKDAARFTGSGAELSRILDLSRHCVSTEETRYYLNGIYLITAPERSTLRAVATDGHRMAVIDSNIEAPNLAGVIFPRFALDVFRGLLDPKSNTPIKMQFEENRGIIEGDDWILHSKMIDGTFPDYTRVIPKHETNCEAHFTRAIVTKAHRLSQAVRGHIAAAATITSDGKMHLMHEGEDESVSVPVGASPNFDLGRHGFNVKYLNKQAQVTPEFTLRASTERPNDPATIVSDDPDACWILMPMRAA >NZ_CP016592|2092027:2098098|2092565_2093147_-|WP_013384784.1|DBSCAN-SWA MTAFPIEQYDLIMADPPWSFATYSAKGQGKSADAHYATRDLRWISELPVASIAAPDCLLWLWATNPMLPQALTMLRYWGFTFKTAGHWSKRNLVTGKLAFGTGYILRCAGEPFLIGTRGNPKTTRSVRSVIEGARRQHSRKPEEAFTAAERLMPSARRIELFSRQERAGWDVWGDQTDRFPAPAAMSQPEGVA >NZ_CP016592|2092027:2098098|2092308_2092569_-|WP_014537887.1|DBSCAN-SWA MTKDFTITINVTNKPKIRSAIYIAGAQVGVIGIGIVTQSVAMQWVGFCFLILWLIGATKAWQTKNSDLTIDQAFVRLSQIQREADN >NZ_CP016592|2092027:2098098|2092027_2092312_-|WP_013384783.1|DBSCAN-SWA MSIPMKNTEADINVAEKVAGVAADEIRSIVARYEALEAEKKYVAEQMKEVMAEAKGRGYDTKVLRKVIALRKRDQNDVAEEEAIFDLYKQAMGM >NZ_CP016592|2092027:2098098|2096223_2096409_+|WP_013384788.1|DBSCAN-SWA MSRVSFALSIAPMILGIALLAWHAWPTIQWARALSPAAGPHMTCATDPVINCRLPSAETGQ >NZ_CP016592|2092027:2098098|2094449_2095502_+|WP_014537890.1|DBSCAN-SWA MSIDATLKVLTERVQQHANTMLTEEAVKTAVVLPFLQALGYDVFNPGEVIPEFTADAVGKKGEKVDYAIKLDGEIRILIECKPISTNLDKVHLAQLYRYFSVTSAKFAVLTNGRFFHFHSDLEEPNKLDTRPFFTFDITEPNSQVLGELKKFEKSGFDVDGILANAERLKYTSALKTEINKHMDAPSDEFVRVLAAPVHEGRFTAAVLDQYRGLVKSAFREIIRDSVQERLSSALATTDTAAEIAEEPVVPDAEIVTTQDEVEGFMIVKAIVSSVIKPGRVSMRDQKSYCGILADDNNRRPIARLHFNRSTKYLGLFDGEAEERVVVESLDDIYSHAERLRQTAIKYPTP >NZ_CP016592|2092027:2098098|2096707_2096968_+|WP_013384790.1|DBSCAN-SWA MTVSDAQRGLADILEELFGLLPERTAPTAHLRHDLGLDSLHLIELFMEIEVRFGIEISDAAIESVQTVAGLSCVIEHLVTQKRNAA >NZ_CP016592|2092027:2098098|2093597_2093849_-|WP_014537888.1|DBSCAN-SWA MTAASNLANALGRRKMADTVGVLPTAVSNAIVRGRFPSSWFIAVKAIADEAGLTCPPELFGMKSHVTVHGAQPLCDQGEGASK >NZ_CP016592|2092027:2098098|2093847_2094207_+|WP_014537889.1|DBSCAN-SWA MTSLCPHNVDYSKSTFSGLPKISQFVNYSSMGNKRSEPGQYAEIGDRLEAIRKVFSDLNQKDWAEKHHFGHSTYNNWVTGIRRIPVESAEALCDLYGVDLDFIYRGRRDGLPDNLKKVL >NZ_CP016592|2092027:2098098|2095969_2096227_+|WP_013384787.1|DBSCAN-SWA MNGFTNTEARLIFLIAKDFHIAAEILPSTSLRGDLRMDSLDLVDLCVKAEDLFDIEINDAEATEATSIADLARLIDHVLAEEAAA >NZ_CP016592|2092027:2098098|2093143_2093374_-|WP_013384785.1|DBSCAN-SWA MSALPTRQDDELILRALAMRVRGISLSEVGDILGVGKSTIGMATQAVFEADLRESGERAAVVDRGYRWPKHKGARR >NZ_CP016592|2092027:2098098|2095728_2095959_+|WP_013384786.1|DBSCAN-SWA MSILYAHPDGLMSAQDLRETLVRDHIRASRVTLARPRGFFDISLHRDCEVDGAALADWALRRPANGPSAPAASLAS |
14 | EBPR_podovirus(33.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|