Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP044107 | Enterobacter hormaechei strain FDAARGOS_642 chromosome, complete genome | 2 crisprs | RT,WYL,DEDDh,csa3,cas3,DinG | 0 | 6 | 3 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP044107_1 | 3601266-3601427 | Orphan |
NA
Consensus repeat of NZ_CP044107_1
|
1 spacers
spacers of NZ_CP044107_1
>1.1|3601318|58|NZ_CP044107|CRISPRCasFinder GAAACGATAAAAAGCCGGGTGGCGGCTACGCCTTACCCGGCCTACATGTTCTACATAT |
CRISPR arrays and Neighbor proteins around NZ_CP044107_1
The CRISPR arrays of NZ_CP044107_1 >merge|NZ_CP044107|1|3601266-3601427|CRISPRCasFinder TCTCCCGTAGGCCCGGTAAGCGCAGCGCCACCGGGCAACGGTTTTAACCGGAGAAACGATAAAAAGCCGGGTGGCGGCTACGCCTTACCCGGCCTACATGTTCTACATATTCTCCCGTAGGCCCGGTAAGCGCAGCGCCACCGGGCAACGGTTTTAGCCGGA >NZ_CP044107|1|1|3601266-3601427|CRISPRCasFinder TCTCCCGTAGGCCCGGTAAGCGCAGCGCCACCGGGCAACGGTTTTAACCGGA GAAACGATAAAAAGCCGGGTGGCGGCTACGCCTTACCCGGCCTACATGTTCTACATAT TCTCCCGTAGGCCCGGTAAGCGCAGCGCCACCGGGCAACGGTTTTAGCCGGA
>NZ_CP044107.1|WP_033486784.1|3599834_3601214_+|efflux-transporter-outer-membrane-subunit MTLRPIAGLLIMTILAGCQSVDVEPAKSSLHIPAQWRATSGPASPTEQLWWRNFHDSNLNRYVDQALKNNSDVLIARERINEYQARVYAADGSLFPSLDAGVTGTRARSQSAATGLPVYGTLYKGSLTASYDVDIWGVNRSTSRAAEASLEAQKAAAAAADLTVASSVASGYVTLLSLDEQLRVTRSTLKSREEAFNLAKRQFETGYSSRLELMQSDSELRATRAQVPLLQHQIAQQENALSLLLGSNPGDVARGESFDALTPLKLPSQLPSTLLNRRPDIVQAERQLIAADATLAASRASLLPSINLTATGSVQDRTLSGLLDNPLQLWSVGGSILAPLLNRQALNAQVDISQSQRNQALYSYEKTVRNAFAEVNDSLDAITRYQEQLTELLAQQTVAQETLRIAQNRYRNGYSSYLDELDAQRTLFSVQTSVVQVKNNLLLAQIDLYKALGGGWVSA >NZ_CP044107.1|WP_017693458.1|3598755_3599838_+|HlyD-family-secretion-protein MSQQDAAKEQANTRKNVRVVSIFTAAAIGIVGVLVILYAWQLPPFTRHAQFTDNAYVRGQTTFISPQVNGYITEVHVQDFAQVKKGELLLQIDDRIYRQRVHQAEAQLAMKIAALNNNLQQRKSAEAVIAKNEAALKNARAQSLKTQADLKRVKELTADGSLSIRERDSALASAAQGSADIDQAKATLEMSRQDLQTVIVNRGSLEADVENAKAALELAQIDLQNTRIVAPRDGQLGQIAVRLGAYVTAGTHLTTLVPPQHWVIANIKETQLANLRVGQPVKFTVDALNDKAYQGRVESISPATGVEFSAITPDNATGNFVKIAQRIPVRIEVLGEPEAYRLLRPGMSVQVTIDTREAKQ >NZ_CP044107.1|WP_115503847.1|3597086_3598733_+|MFS-transporter MPQRDPYAPREWQPHEKPALLGSPSTPEHPTSKRIAYGVVGLLVCLTGALGNAVVTANLQNLQGTFGAWSTEIAWLPAVYVMTNVSINLLLVKFRQQYGLRAFTEGFLVLYVLVTFFHLFVNDLSSALMVRAAHGMVAAALSSLGIYYQIQAWPAKHRLKALTIGITGSSLAIPLARLFSTELLQLDEWRGLYFFELGLALISLACVMALKLPPGDRRKVFEKKDFITFFLLAPGMALLCAVLSLGRLDWWFEAPWIGWALALSLVLIVSAIVFEHNRNNPLLNTRWLSSGSIVRLGLIMLLIRIVLAEQNTGVIGWLQYVGLQNEQMTHLAWAIFAGIVCGIVTSCLTIKPTKLAWPIITSLALMIIASLLDSQSNNLTRPDQLIFSQFLLGFGSAFFLAPAMLAAIGGVIADPRNLVSFSVMFGMSQNLGGLLGSAILGTFQTWREKYHSSLLADQLTTLNPLVNERIQLYTQMYKSLVGDSSLLGTQAITQLQSVTTLEANILAYNDTYLLTASIATATLVWILWRLLRLRITARMALKNATGNK >NZ_CP044107.1|WP_017384495.1|3596070_3596865_-|YdcF-family-protein MTMALFPCLPGTTLDAVNTVGAWLAQDDYQDNQPVDLVILAGNAVIPAIDAACKIAAEQGVPLIISGGIGHSTTFLYAAIAKHPRYNRIPTTGRAEAAILADIAREFWNIPAEHLHVEDQSTNCGENARFSRALMKQSGLNAARVLVVQDPTMQRRTMATFARVCRDEAASPAWVSHPGLTPVLQNSDDGLVFSGPVEGLWPVERYLSLVLGEFPRLRDDINGYGPAGRDFIAHVDIPADVDAAWQILRNDVILTDALVSRSLL >NZ_CP044107.1|WP_047637364.1|3594432_3595872_-|aldehyde-dehydrogenase MTVPVQHPMYIDGQFVAWQGDAWIEVINPATEEVISRIPDGSAEDARKAIDAADRAQAGWEALPAIERASWLRKISAGIRERVSEISALIVAEGGKIQQLAEVEVNFTADYIDYMAEWARRYEGEIIQSDRPGENILVFKRALGVTIGILPWNFPFFLIARKLAPALLTGNTIVIKPSEFTPNNAIAFAKIVDEIGLPKGVFNLVLGRGETVGQELAGNPKVAMVSMTGSVGAGEKIMAAAAKNITKVGLELGGKAPAIVMDDADLELAVKAIVDSRVINTGQVCNCAERVYVQKGIYDRFVNRLGEAMKAVQFGNPAERTDIAMGPLINAAALERVEQKVARAVQEGAKVVLGGKAAEGKGYFYPPTLLLDVRQDMAIMHEETFGPVLPVVAFDTLEEALNMANDSDYGLTSSVYTQDLNVAMKAIKGLKFGETYINRENFEAMQGFHAGWRKSGIGGADGKHGLNEYLQTQVVYLQS >NZ_CP044107.1|WP_047637363.1|3593814_3594345_-|cytochrome-b561 MRTKYTGLQISIHWLVFLLVIMAYCAMEFRGWFPRTDRPLINMIHVSCGISILVLMVARLLIRLKFPAPPIQPKPKAMITGLSHLGHLVIYLLFITLPLIGMVMMYNRGNDWFAFGLTMPHAAEGNFDLVDTLKEWHVTLANLGYFVIGLHAFAALMHHYFWKDNTLLRMMPKKRQ >NZ_CP044107.1|WP_032669925.1|3592705_3593773_+|oxidoreductase MNKVKTMNIALIGYGFVGKTFHAPLIQSVEGLKLAVVSSRDEEKVKRDLPDVLVVATPEEAIQHPDIDLVVIASPNATHAPLATLALNAGKHVVVDKPFTLDMQEARDLIALAQEKQRLLSVFHNRRWDSDFLGIKQVIEQGRIGKVKHFESHIDRFRPEVRVRWREQNVPGSGLWFDLGPHMIDQTLQLFGLPQSVQGNIATLRDGAEINDWAHVVLNYPEHKVVLHCSMLVAGGVARFTVHGDKASVVKAHIDQQEAQLLAGVVPGSESWGEDSDAMVLFNAQGEASAIPAPKGDQRQYYINVRDALNGKIDNPVHPVEALAVMAVLEAAVKSSETGSTQELDLTAQERAQLQ >NZ_CP044107.1|WP_032669924.1|3591948_3592692_+|DeoR/GlpR-transcriptional-regulator MHKTARQKYVLDIITEQGQASITELAERLQVSADTIRRDLTDLEKQGLAQKNHGGAIALNLSTMTRVSRNSLLPEIKQRLGKQVAQCVPAGSTLFLDAGSTLLAVASFLKGPLTIITPSLDIAQQVSDREDIDLILLGGKWDQKQRLFAGSATLSLLSRYRADIAILGACAIHAELGLSASQEADAEVKRAMLAASQAHWVVADHLKLNQCEPYLVSGLSEIHQLFLDRPWAELGDHSALQVTVGAH >NZ_CP044107.1|WP_047733386.1|3589878_3591216_-|TolC-family-outer-membrane-protein MKMKCNNRLLRLSVSLTLISLVVTAANANNGQAGISPVAAMTMKESILFALDRDPSVSQQAAQLGIGQAQIDEARSGWMPQIALNGSTGHSQTTDSSGSLRNSAAWGLSLTQLVYDFGKTNNSIRQSSAQRDSYRYQLMSTMSAVAEKTALSYVEVKRYSDLLQAAKENVQALKNVEQLAKLRADAGVSSTSDELQTRTRIAGMQATVEQYNASLNSARARLAVLTGIQAERYSPVPGGLAVEPDSLNRIDYSLIPTVMAAQNMERSAQYGVETARSQHWPTLSLKGGRTRYESDNRAYWDDQIQLNIDAPLYQGGAVSARVRQAEGARAMASSQVDQARFDVLQKASVAQADWTGARGRMEAGKRQLENALRARDVYKNEYTLSKRSINDLLSVEQDVWSATSAKIMAEYDGWSAAINYASAVDNLMPLIGIEKNAAAKLPDLS >NZ_CP044107.1|WP_150391149.1|3571750_3589756_-|BapA-prefix-like-domain-containing-protein MSTAKVVDVIIRKTAEKTKLTGEGNLSVSISSPSVIEIQGSAQDVVRYVRQGNDLLIYMKYGSVIRCNNYFVEDTETHNHSELVFNDNQELTHISFADAGEASGVAATELTAQAAPISSIEPFLEQGSVLSDAPWGWIAGAALGGGAIGALLAHGGDGETKTRVIDNTKEVESATPTFLLTDNAGDKQGVLSAKEVTDDNTPTFSGTGQPGATIQVKDGNGSTIASTMVAKDGTWTVTLPTQADGEHTWSVVQIDGSKTTSAGSITVTVSTADTSVTLATTAGDNVINASEQAAGFTLSGTSKNLAQGTALTVTLNGKTYTAEVGANGAWSVKVPAADAQALGDGTWTVNVSGKDAAGNTVSGSQTIGVDTASPVISVDTIAQDNIINAAEHNQPLTLTGKTDAEAGQIVTVTLNGKNHTATVGSDGSWSVTLPASEVQALANGEHTLTVNVSDKAGNGSSTTADFTVDTAAPVVTINTVAGDDILNTSEQGQAQIISGQANGAAEGDIVTVTVGGKTFTGAVQADGSWSVGVPASVIGALGEGSHSISVSVTDAAGNTGSATHGITLSGNPPEFTLDPISQDNVLNAQEAMQPLSLSGTSNLPNGSAVTVTLNNVNYQATVENGRWSVQVPVSDVLDLANTLYTVSVSGTDSVGNSGSAEANLLVDTVLPQVIVNTFAGDNLVNNAEAAVDQTLSGRVTGAAAGDTVSVTVGGKSYTATVGSDLKWSVTIPSADLQAFGDGDLTFSASVTNAHGNTGTGERDININAELPGLRVNTISGDDVINAIEQQQDLAVTGSSTHLAEGTQITVTINNVEYVTTVNASGNWQIGVPAADLQAWTAGGMTVSVSAEDAWGNTVAAEHPIELDLNAVAVTIDTVTTDDMLNAAEKGADVTLSGQTQGVEAGQTVVVKFADQTFTAQVQQDGSWRLTVPASAMETLIDGRAQVSVSVTNVNGNSADASRVVIVDTQPPAITLDNLTDDNIINAAEAQQDLVLSGSTTAEAGQTVTVTLNGKSYQTTVQADGRWQLNVPAADVGALTDGNVTVTATVSDVAGNSSSADRVGLVDATVPQVIINDFVTDTNTVNQLAHAQAQILSGSVTGAAAGDLVTITINNVDYTTVVDAAGNWSLGLPASVVQGLTDGTWTINVSVTDQSGNTGSSSVDVVVNTVTPIIGINTLAADDVINAAEKGEDLLLSGTSNQPEGTTITVNLNGINYTATTDASGNWSVTVPASAVSALGEANYTVTASVTDNVGNSAAATHDVLVDSSLPVVTINTLAGDNIVNAAEVAAGQTLTGKVSNAASGDTVTIILGGQTYTATVQDDLTWSLPLTQSQLTALGNGDLTVSASVTNAHGNTGSFSLDVTIDAQLPGLRIDTVAGDDVINVIEHAQNLVISGTSTDLAAGSTVTVTINGKSYSASVLADGTWQAAVPAADVSRWADGSLTISASAQDTSGNPVNIGTVVDVDLAPVAISINSVTDDNVLNAAEKGQDLVLSGSSSNVEAGQTVTIIFAGKTWTTTVDANGDWTCTVPAADLSGLKDGDASVQVSVTNVNGNAASSSQAFSVDTAAPAVTINTISGDNMLNAAEAAQDLTLSGTSTAEAGQTVTVTFNGNQYTAQVQANGSWTLDVPAADLAGIADGSAAVTVTVSDKAGNPASAGASVLVDTTVPQITFNIVAGDDIVNIAEHGQALIVTGKVTGAQAGDVITLSLNGKDYTAMLDASGNWSVGIPATDVGALANGDQTISATLTDKAGNSTSATHAFDVSLTAPVIAINTLAVDDVINATEKGQDLLISGTSNQPDGTRISVTLNGISYAATTDASGNWSVTVPAANVSVLGEASYSVTASVTDTAGNSANTSHSVLVDSALPQVTINAVATDDVINAAEVASGQTMSGKVSGAASGDTVTIGIGGNTYTATVQDDLSWSVNVASDVLTAIGNGDLTVTASVTNGHGNTGTGERDITIDASLPGLRVDTVAGDDVINSIEHGQNLIITGSSDGLASGSALTVTVNGKTYAATVLADGTWTAAIPAADVGALSAGTITVTVDGQSAAGNPVSISHDVKVDLAAVAISINPIASDDVINAAEKGADLVLSGSTTNVEENQTVTITFGGKLYTATVDASGNWTATVPSADLGGLKDGDASVQVSVTNVNGNSASAGREYSVDATAPTVSIEIVSDNNIINAAEAQQDLVINGVSNAEAGQTVTVTLNGVDYTTTVQANGSWSVTVPSADIGAITDGDYTITAAVADKAGNPASADRDVLVDTTVPQLTINTVSDDDVINSAEHAQALIVTGSVTGAAAGDVVTVTINNKDYTATLDTSGRWSVGVPAADVSALAAGDYTITAALTDKAGNSNSTTHEVEVNLTAPVLTIDTVSGDDVINSSEKTQDLTITGTASGLAAGAVVTVMLNGKAYSATVDTNGQWTTTVPASEVGQLGEALYTVSASATDSVGNSSSTSHTVNVESVLPGVIINTVAGDDVINAAELATGQTISGTVVNAEAGNTVTVSVGGHSYTATVQDNLTWSVSVPESVLAALGNGDLTVTASVTNGVGNSGSGERDITIDANLPGLRVDTVAGDDVINSIEHGQNLIITGSSDGLTAGTALTVTVNGKTYPATVLADGTWSAAIPSADVSALAAGTVTVNVEGQSSAGNPVTINHDVTVDLANVAISIDAIASDDVINAAERGADLVLTGTTSNVEENQIVTITFGGKNYTATVDAEGKWTATVPSADLTGLKDGDASVQVSVTNVNGNSASAGREYSVDATAPSVTINTIATDDILNASEAQSDLAISGTSTAEAGQTVTVSLNGKDYTTTVSANGSWTLNVPAADLAGLTDGSVTMTAAVSDKAGNPASVDHTLTVDVTVPAVTIHTVAGDDVINVAEHNQAQIISGSATGAAAGDKVTVTIGGQTYTTVLDAAGNWSVGVPASVISGLSDGSVTVTASVTDAAGNTGSGTHNVTVDTGLPSVSFNAISDDNVLNAVEKGQDLSVSGTSANLAEGTVVTVTLNGKNYTATTAADGTWSLTVPAADLAGLGQASYTLNATATNGVGNSVSSSANLLVDTALPTVTINTVAGDNVINAAEVAAGQTLSGTVANAEAGNTVTVTIGGHSYTATVQNNLSWSVNVPSDVLTALGNGSLSVTATVTNGHGNTGTGEREIAIDANLPGLRVNTVAGDDVVNTIEHAQNLVVSGSSDGLTAGTALTVTVNGKDYAATVLADGTWSAAIPSADVSAWPEGTVKISVTGDSAAGNPITISHDVTVDLATVAISINALATDDVINAAEKGADLVLSGVTTNVEAGQTVTISLNGRIYTTTVDDSGNWTYTVPSADLAGLKDGDASVQVSVTNVNGNSASAGREYSVDATAPSVTINTIATDDILNATEAQSDLAISGTSTAEAGQTLTVSLNGKDYTTTVSANGSWTLNVPAVDLAGLTDGSVTVTASVSDKAGNPASVDHTLTVDVTVPAVTIHTVAGDDVINVAEHNQAQIVSGSATGAAAGDTVTVTIGGQSYTTVLDAAGNWSVGVPANVISGLSDGSVTVTASVTDAAGNTGSGTHNVTVDTGLPSVSFNAISDDNVLNAVEKGQDLSVNGTSANLAEGTVVTVTLNGKNYTATTAADGTWSLTVPAADLSGLGEASYTLSATATNGVGNSISTTANLLVDTALPTVTINTVAGDNVINAAEVAAGQTISGKVANAEAGNTVTVTIGGNSYTATVQSDLTWSVNVPETVLTALGNGELTVSATVTNGHGNTGAGEREIVIDASLPGLRVDTVAGDDVINSIEHGQNLIVTGSSDGLAAGTTLTVTVNGKTYAASVLAEGTWSAAIPAADVGALAAGTVTVTVAGQSAAGNPVTISHDVTVDLAAVAISIDAIATDDVINAAEKGADLVLSGSTSNVEENQTVTITFGGKSYTVKVDADGNWTATVPSSDLAGLKDGDASVQVSVTNVNGNSASAGREYSVDATAPTVTIDTVAGDNVINGSEAAAGVAISGTTTAEVGQTVTVNLGGNSYTAQVQQGGVWSINVPAADLSTLADNGYTVQVSVSDAAGNPGSAGKAITLDTTPPTVSFNVVAGDDVINSVEHGQAQVVSGTATGASVGDKLVITIGSNQYTTTVDASGKWSVGVPASDISALTDGTVTLSATITDSAGNSSTQTHDVVVNTASVALTVNTLSGDDVINAAEAGASLVINGSSAQFASGTQVTITLNGKSYTATIQSDGSWTTTVPAADVGTLADGASYQVSVSAQDSAGNSASATHTISVDTTAPVISVNTLSGDDVLNAAEAQQPLTVHGSSSAEAGQTVTVTLGGKTYTALVANDGTWTLDVPAADLANLSEGALTVTASVNDKAGNNGQTTHTLTVDTVAPAVTISTVADDDIVNDAEQLAGQTISGTTTAEQGQTVTVSFNGHSYQATVAANGSWSVFVPGRDFLGLSDGDYTITATVSDKAGNPGSATHDVTLNGDVPTIAINTFAQDDIVNAAEHGTPLVISGTTDAPTGQTVTITLNGKTYTATVQNDGTWSYTVGSADVTALADGGSYVINAQVSNAIGNSASDNHTVIVDLTAPSMGISIDSLHNDTGLSANDFITNDSQVVVNGSLTAQLGNNEKAQISLDGGTTWIDLTVTGTTWRYTDGRTLTDGTYQYQVRVIDNAGNVGATDSQDVVIDLTKPAAATITVDSVSQDTGLSDSDFITSDNQISLKGTLGAALGSGDHAQISLDGGATWTDVSVSGLSWTYVDGRTLADGDYNYQLRVIDDAGNISATTSQVVTIDTVAPDASKTIAIDSISDDTGLSSSDFITNDTSLTLHGSLGATLADGEYAQISIDGGVTWQDVIVTGNSWYYVDGRTLGNQTYDYYVRVVDAAGNVGASAHQQVTVDTVAPDAAITVTVDNITVDTGFDNNDFLTSSTSYTLNGTLGAELGAGEYVQVSMDGGTTWVYATVSGTRWSYNDTRTLADGDYRYQVRVVDQAGNVGATTTQDVTVDTQAPQYGITIDSISEDTGQSGSDFITMDTSLTINGSLGSALASDERVQISLDGGNTWIDTTVTNQRWSYTDSRDLADGDYTYQVRIIDQAGNVGSTSSQVVTVDTTPPDTVGTVVSYTDGEGERTGTYGASVATDDTSPLINGTLNRAPEDGEIVQLYRDGILLGQVTMNGSASWSYQDNGLLDGNHTYILRVTDKAGNYTESDGFVLNVDTSIPTTTAAITAQTTSDTTPIVSGTVSADLVNGEYLVVTVNGKTYTSQTGGAVVVDPDHNTWYLQIPDSDALSVASYDVTAQVKSSAGNGNTTGTATGSLVIDTTSVNTDWATTAGNSNNSTMTLGMNSSGLWNIIANGQSYSSSDDSTYAGNTLTNTRSYYVVSQTAADFDRNGTQDIFATENTYAGSTQVMWTYDGSSYTASQLAMGTTIWYGGVIAYDKTGDGYLDLAYGDAGMDSLTYLVNTNGVPSPDGTGGEGGFYGQFDSGREISGVDLNNDGTVDIVQHTNRSGAYSLTVINNNGNGTLSIGQNLTNVFVANASNTTTAASMTWADFNGDGYMDLYLGSSYNNNGGVIYYNDGTGQLSTTKSAVEASNATAGYLSVAVDWNGDGQMDIIKLSTYGSSQTATLFTNNGYGSTWTSSQLASGLANVTGVAAVDYNWDGAQDLLVSQQNGKVVLVQNNAEIADGTAMHLHIVDSEGINAYYGNTVNLYNAAGVLVASQIINAQSGIGSNDTSALVSFYGLDPNETYSAEIVKITNGVSDNVTWTGLDAGNGKEGYVLTAEAATGGHSGTITGTGYNDTFIAEDGTYTYNGSGGWNTHSDYDTWSNTGGMDVVDYRNATSGITVDLRLSTAQDTGFGTTRLLNIEGINGSDYDDVITGNSGDNQFEGRGGNDTFNIGSGGHDTLLYKLINASDATGGNGSDVVNGFTVGTWEGTADTDRIDLRDLLSDSGYTGTGSASYVNGVATLDSSAGNIADYIRVVQNGSNTEIQVDLDGTGGQFTPTTLVTLNGVQTDLATLLANHQLLIA >NZ_CP044107.1|WP_015570577.1|3605524_3606130_+|FMN-dependent-NADH-azoreductase MSKVLVLKSSILAGYSQSGQLSDYFVEQWREQHSADEITVRDLAANPIPVLDGELVGALRPSDAPLTPRQQEALALSDELIAELQAHDVIVINAPMYNFNIPTQLKNYFDLVARAGVTFRYTENGPEGLVKGKRAVVLTSRGGIHKDTPTDLVAPYLTLFLGFIGITDVNFVFAEGIAYGPEVATKAQTDAKAAIDSLVAA >NZ_CP044107.1|WP_063135140.1|3606220_3606907_+|RluA-family-pseudouridine-synthase MSVIIDTFIAPPCHDDIEILWQDEHLLLINKPSGLLSLSGKNPQNLDSVHHRLVQTFPGCTLVHRLDFGTSGLMVIARNKAINAALCHQFSQRAVNKVYTALLCGHVEQDEGTVDAPIAKDPALFPLMTICARTGKPARSRYRVVERIYQDTTMPLTRVELTPETGRTHQLRIHCQRLGHPILGCDLYGGLEWPGAEETPRLMLHASALNFIHPLSGETINARHAAPF >NZ_CP044107.1|WP_017384500.1|3606914_3607454_-|DUF2058-domain-containing-protein MTKLTLQEQMLKAGLVSSKKMAKVQRTAKKSRVQAREAREAVEENKKAQLERDKQLSEQQKQAVLAKEFRAQVKQLIEMNRITVAKGNITFNFTDGNLIKKIEVDKQTQTQLINGRLAIARLVINANGDCDYAIIPAVVADKIAQRDADSIVLNSALSQEEQDEDDPYADFKIPDDLMW >NZ_CP044107.1|WP_023296571.1|3607622_3608597_+|DUF1852-domain-containing-protein MSQAFTFTLKRSCFDENYNPSENTRTTTNFANLARGEKRQENLRNTLVMINNRFNALASWDNPKADRYAVELEIISVDMNIGGDFTFPAIEILQTTIVDKKTHERIEGIVGNNFSSYVRDYDFSVLLLEHNKDRARFSLPENFGELHGNIFKSFVHSAEYQANFKKAPVICLSVSSKDTYRRTGNHHPVLGYEYQPDGESLTEQYFAKMGLKVRYFMPENSVAPFAFFFTGDLLRDYTNLELIGTISTMETFQKIYRPEIYNANSAAGQCYQPDLNQQDHSLTKIVYDREERSRLAIEQGKYTEERFIKPYKTLLEQWSQHFTL >NZ_CP044107.1|WP_150326207.1|3608622_3609651_+|methionine-synthase MKTLLPTSTAGSLPKPTWLAQPETLWSPWKLQDEELLAGKQDALRLSLDEQIRAGIDIVSDGEQTRQHFVTTFIEHLRGVDFENRQTVRIRNRYDASVPTVVDAVARQKPVFVDDAKYLRQLTDKPIKWALPGPMTMIDTLYDAHYKSREKLAWEFAKILNQEARELEAAGVDIIQFDEPAFNVFFDEVNDWGIAALERAIEGLKCETAVHICYGYGIKANTDWKKTLGSEWRQYEEAFPKLQTSKIDIISLECHNSRVPMDLLELIRGKKVMVGAIDVATQTIETPEEVADTLRKALQFVDADKLYPSTNCGMAPLSRQVANGKLKALSAGADIIRRELAR >NZ_CP044107.1|WP_033486781.1|3609654_3610587_-|LysR-family-transcriptional-regulator MMGAGHISIRALLIFIDVYETQNFSVVARREGISASQVSRVIHQLEDALGQQLFYRNTRAIMPTESGHLFVRYARAMAGNMEDARRELDERAREPSGTLRINGPVFFGQRHIAPGLPGLLARYPRLSIELTLTDDFIDPHRDAADVIFRIGALTDSSFHARVFGQQFYHLAASPDYLQKHGAPEGPDDLSRHHCLVYRGSSGPNRWLIRRPGEAWVHYPIVPLMTSNNAETLLIAALGGMGVVLFPDWMVSERLKSGELVALLPEMECSINTEPLTIAAIYPNARHPPLNVRAVIDYYIERFGTPLYWQT >NZ_CP044107.1|WP_003857304.1|3610685_3611120_+|DMT-family-transporter MHIILILLVIAGGMGLSVEAGLLGPLGAEVGDLWAAFSIFSVGTGLTFLLMLFFSPRNSPSFFAQPSWHLLGGVLGPVYVIILTIATPAIGIAMTMIGILAGQVFKSLIIDHYGLLGTPHRRIDTKRIIALGFIIAALILVAQG >NZ_CP044107.1|WP_003857305.1|3611124_3611583_+|DMT-family-transporter MTVIMIILAVIGGATLSIQAAINGQLGSSVGVFKSAFLTFSVGALVTALLIFFFEPKQAVSLMDVPKWQLLGALCGVPYIVIMVLAVQRIGTAVATVAVILGQLAMSMLIDNFGWLNNEAIPFSVSRFGAVVCLSIALFFIYSSSKPQPEED >NZ_CP044107.1|WP_111962054.1|3611579_3612113_-|DUF3833-domain-containing-protein MKSFLLMALALTMLVAGCSTEVTEYRQQQPRLDIFTYFQGKTEAWGMVQDRSGKQIRRFHVEIAGDVIGDTLTLNEHFVYDDGEKQQRVWHIRRVGQNRYEGTAGDIEGVATGQAAGNALNWRYSMNVKADGKTWLLHFDDWMYLQDSTRLFNKTEMKKFGVTVATVTLFFTRKEGG >NZ_CP044107.1|WP_045141532.1|3612591_3613812_-|class-I-SAM-dependent-methyltransferase MTNPVFALEPDIPRNVRVARWLLFRLLNGLHGGSLTLREGAQTFQFGDASAALHAEVQVLAPGVYWRILTGGSLAAAQAWMDGDWETPHLTPLLELIARNSQILGKLEKGFRLLGKPVERLRHWMRRNSRAQARENIAAHYDLGNAFYAHFLDEHLLYSSALFSGDEQDLTAAQQAKMARLCDQLALTANDHLLEIGTGWGAMAEYAARHYGCRVTTTTLSQEQYHWATARIARAGLQDRVEVLLCDYRDLTGVYDKLVSVEMIEAVGQRYLPTFFRTCQARLRPGGRMAIQAITIQDQRYRDYSKSVDFIQRYIFPGGFLPSITAMNELMTRHTDFVVRNLFDMGPDYARTLAHWRQRFVHAWQEIEKLGFDDRFRRMWLYYLGYCEAGFNARTISVVQLTAERV |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP044107_2 | 4182764-4182972 | Orphan |
I-F
Consensus repeat of NZ_CP044107_2
|
3 spacers
spacers of NZ_CP044107_2
>2.1|4182793|31|NZ_CP044107|CRT CGTGACTAAAGGCATGAGCAAATCAGGCAAG >2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder AAAACACGCTGGCGCGTGTCGGTGTCGCCGT >2.3|4182913|31|NZ_CP044107|CRT,CRISPRCasFinder GAGGGAGAAAGCCAGGAAGTCTGGCAGTTAG >2.4|4182853|29|NZ_CP044107|PILER-CR AAAACACGCTGGCGCGTGTCGGTGTCGCC >2.5|4182913|29|NZ_CP044107|PILER-CR GAGGGAGAAAGCCAGGAAGTCTGGCAGTT |
CRISPR arrays and Neighbor proteins around NZ_CP044107_2
The CRISPR arrays of NZ_CP044107_2 >merge|NZ_CP044107|2|4182764-4182972|CRT,PILER-CR,CRISPRCasFinder TTGTGCCTACGGCCTGTACGGCAGTGAACCGTGACTAAAGGCATGAGCAAATCAGGCAAGATTTCTAAGCTGCCTGCCCGGCAGTGAACAAAACACGCTGGCGCGTGTCGGTGTCGCCGTATTTCTAAGCTGCCTGCCCGGCAGTGAACGAGGGAGAAAGCCAGGAAGTCTGGCAGTTAGATTTCTAAGCTGCCTGCCCGGCAGTGAAC >NZ_CP044107|2|1|4182764-4182972|CRT TTGTGCCTACGGCCTGTACGGCAGTGAAC CGTGACTAAAGGCATGAGCAAATCAGGCAAG ATTTCTAAGCTGCCTGCCCGGCAGTGAAC AAAACACGCTGGCGCGTGTCGGTGTCGCCGT ATTTCTAAGCTGCCTGCCCGGCAGTGAAC GAGGGAGAAAGCCAGGAAGTCTGGCAGTTAG ATTTCTAAGCTGCCTGCCCGGCAGTGAAC >NZ_CP044107|2|1|4182822-4182972|PILER-CR AGATTTCTAAGCTGCCTGCCCGGCAGTGAAC AAAACACGCTGGCGCGTGTCGGTGTCGCC GTATTTCTAAGCTGCCTGCCCGGCAGTGAAC GAGGGAGAAAGCCAGGAAGTCTGGCAGTT AGATTTCTAAGCTGCCTGCCCGGCAGTGAAC >NZ_CP044107|2|2|4182824-4182972|CRISPRCasFinder ATTTCTAAGCTGCCTGCCCGGCAGTGAAC AAAACACGCTGGCGCGTGTCGGTGTCGCCGT ATTTCTAAGCTGCCTGCCCGGCAGTGAAC GAGGGAGAAAGCCAGGAAGTCTGGCAGTTAG ATTTCTAAGCTGCCTGCCCGGCAGTGAAC
>NZ_CP044107.1|WP_150391217.1|4181346_4182315_+|NADH-oxidoreductase MTMPTSQCPWRMQVHHIHQETPDVWTLSLLCHDYYPYRAGQYALVSVRNSADTLRAYTISSTPGVSEYITLTVRRIDDGAGSEWLTRDVKRGDYIWLSDAQGEFTCDDKTEDKFLLLAAGCGVTPIMSMRRWLAKYRPQADVQVIFSVRSPEDVIFAEEWRNYPVTLVAEHNATHGFVAGRLSRELLQSVPDIANRIVMTCGPAPYMEIVEKEVKALGVTRFFKEQFFTPVAEAATSGMKFTKLQPAQTFFGRVGTTLLEALESNNVPVAAACRAGVCGYCKTKVVSGEYTVTSTMTLTDAEIAEGYVLACSCHPQGDLVLA >NZ_CP044107.1|WP_017384847.1|4179683_4181336_+|hydroxylamine-reductase MFCVQCEQTIRTPAGNGCSYAQGMCGKTAETSDLQDLLIAALQGLSAWAFKAREYGIVDHYVDSFAPRAFFSTLTNVNFDSPRIVGYAREAIALREALKAQCLNADASARVDNPMAELQLVSDDLGELQRQAAEFTPNKDKAAIGENILGLRLLCLYGLKGAAAYMEHAHVLGQYDNAIYAQYHKIMAWLGTWPSDMNALLECSMEIGQMNFKVMSILDAGETSTYGHPTPTQVNVKATEGKCILISGHDLKDLYNLLKQTEGTGVNVYTHGEMLPAHGYPELRKFKHLIGNYGSGWQNQQVEFARFPGPIVMTSNCIIDPTVGAYDDRIWTRSIVGWPGVSHLEGDDFGPVIAQAQQMAGFPYSEIPHLITVGFGRETLLGAADSLIDLVSREKLRHIFLIGGCDGARGERNYFTDFATRVPEDCLILTLACGKYRFNKLDFGNIEGLPRLVDAGQCNDAYSAIILAVTLAEKLGCGVNDLPLSLVLSWFEQKAIVILLTLLSLGVTNIVTGPTAPGFLTPDLLAILNEKFGLRSVTNVEDDMKQLLSA >NZ_CP044107.1|WP_015570988.1|4178639_4179539_+|lysine-exporter-LysO-family-protein MFSGLLIILLPLIVGYLIPLRHESALKLINRFLSWIVYVILFFMGISLAFLDNLATNLLSILHYSAVTVVVILLCNIAALFWLERTIPWKNHHHQEKLPSRIAMALESLKLCGVVVLGFLLGLTGWAFLQHATEASEYTLIFLLFLIGIQLRNNGMTLKQIVLNRRGMMVAVIVVASSLAGGVINAFILDLPLKTSLAMASGFGWYSLSGILLTESFGPVIGSAAFFNDLARELIAIMLIPGLVRRSRSTALGLCGATSMDFTLPVLQRSGGLEMVPAAIVHGFILSLLVPILMAFFSA >NZ_CP044107.1|WP_026080729.1|4177767_4178463_+|aquaporin-Z MFRKLAAECFGTFWLVFGGCGSAVLAAAFPELGIGFVGVALAFGLTVLTMAFAVGHISGGHFNPAVTLGLWAGGRFPAKDIIGYIIAQVIGGIIAAAVLYVIASGKAGFDAAASGFASNGFGEHSPGGYSMLSAIVIEIVLTAGFLLVIHGATDKYAPAGFAPIAIGLALTLIHLISIPVTNTSVNPARSTAVAIFQGGWALEQLWLFWVMPIIGGILGGVLYRTLLEKRD >NZ_CP044107.1|WP_017693708.1|4175817_4177476_-|ATP-dependent-endonuclease MLLERVEIVGFRGINRLSLQLEQNNVLIGENAWGKSSLLDALTLLLSPEENLYHFVHDDFWFPPGDVNGREKHLHIILTFRESEPGRHRVRRFRPMSPCWVPCEDGFQRIFYRLEGEMAQNDGVLTLREFLDEKGNPIPLDNIDELARHLIRLSPVLRLRDARFMRRIRNGTVPNMPEVEVTARELDFLARELVSRPQNLTDGQIRQGLSAMVQLLEHYFSEQGTGQARHRLMRRRSHDEQRSWRYLDIINRMIDRPGGRTHRVILLGLFSTLLQAKGTVRLDRDARPLLLVEDPETRLHPIMLSVAWHLLNLLPLQRVTTTNSGELLSLTPVEYVCRLVRESSRVTAYRLGPGGLNAEDGRRIAFHIRFNRASSLFARCWLLVEGETETWVINELARQCGHHFDAEGIKVIEFAQSGLKPLIKFARRMGIEWHVLVDGDEAGKKYAATVRSLLNNDREEEREHLTALPAMDMEHFMYRQGFDDVFHRVAMIPVDVPMNMRRVIAKAIHRSSKPDLAIEVATEAGRRGVESVPTLLRKMFSRVLWLARGRAD >NZ_CP044107.1|WP_017693707.1|4174864_4175821_+|DUF535-domain-containing-protein MSSIVDTPYSTLPQPKSGWQLFKSLASGSLTPGLAWQNPAYRRKFMLRSLATPFTTARLLGNLAKQPRLMQILRVQPGLPCRLHRPWLTVNMGRQTTLDALNDHYEMMSRHLPASLLNGYLSSQGITLVTLTGKEEQQFSVRLSADAFLDKEGEATLTFCDHQNTVLAELTFTLCTYQGKPTLFIGGMQGAKAHVPHEHIQLATKACHGLFPKRLLVEAVMTLAGAFPVEQILAVSNATHIYRSWRYRKKKEGKLLADYDSFWRSLGGQQQDNGNFALPLTMPRKPMEEIASKKRSEYRRRYALLDSLIQQVSQATAR >NZ_CP044107.1|WP_017384842.1|4173589_4174705_-|macrolide-transporter-subunit-MacA MNLKGKRRKLFLLLAVVVLAGGFWLWKVLNAPVPQYQTLIVRPGELQQNVLATGKLDALRKVDVGAQVSGQLKTLSVEIGDKVKKGQLLGVIDPEQAQNQIREVEATLMELRAQRAQAQAERNLAQVTLTRQQALAKTQAISKQDLDTAATELAVKQAQIGTIDAQIKRNQASLDTAKTNLDYTQIVAPMAGEVTQITTLQGQTVIAAQQAPNILTLADMSTMLVKAQVSEADVIHLKPGQKAWFTVLGDPQTRYEGVLKDILPTPEKVNDAIFYYARFEVPNPQGVLRLDMTAQVHIQLTGVKNVLTVPLSALGESAGDNRYKVKVLRNGETREREVVIGARNDTDVVVVKGLEEGEEVVTSETLPGAAQ >NZ_CP044107.1|WP_150391216.1|4171652_4173593_-|macrolide-ABC-transporter-ATP-binding-protein/permease-MacB MTALLELNDIRRNYPSGDGPVEVLKGISLRVEAGEMVAIVGASGSGKSTLMNILGCLDKPTSGTYHVAGTDVSTLDGDALAKLRREHFGFIFQRYHLLSHLSAAQNVEVPAVYAGVERKKRLERAKALLTRLGLAERVDYQPSQLSGGQQQRVSIARALMNGGQVILADEPTGALDSRSGEEVMAILHQLRDQGHTVIIVTHDPQVAAQAERIIEIHDGELVSNPPPRQSRAAAPKEALPASTGWGQFSSGFREALTMAWLAMAANKMRTLLTMLGIIIGIASVVSIVVVGDAAKQLVLADIRAIGTNTIDVYPGKDFGDDEPQYQQALKYDDLAAIQKQPWVNSATPAVSQNLRLRYGNIDVAASANGVSGDYFNVYGMTFSEGATFNAEQLAGRAQVVVLDANSRRQLFPNKTRVVGEVILVGNMPATVIGVAEEKQSMFGSSKILRVWLPYSTISGRIMGQSWLNSITVRVKEGYDSALAEQQLERLLTLRHGKKDFFTWNMDGLLKTAEKTTRTLQLFLTLVAVISLVVGGIGVMNIMLVSVTERTREIGIRMAVGARASDVLQQFLIEAVLVCLVGGAMGIALSMMIAFALQLFLPGWEIGFSPVAILTAFLCSTFTGILFGWLPARNAARLDPVDALARE >NZ_CP044107.1|WP_006809408.1|4171360_4171582_+|cold-shock-like-protein-CspD MEMGTVKWFNNAKGFGFICPEGGGEDIFAHYSTIQMDGYRTLKAGQSVRFDVHQGPKGNHASLIVPVEAETVA >NZ_CP044107.1|WP_006174393.1|4170772_4171093_-|ATP-dependent-Clp-protease-adapter-ClpS MGKTNDWLDFDQLAEDKVRDALKPPSMYKVMLMNDDYTPMEFVIDVLQKFFSYDVERATQLMLTVHYRGKAICGIFTAEVAETKVAMVNDYARENEHPLLCTLEKA >NZ_CP044107.1|WP_032653119.1|4183251_4184970_+|ubiquinone-dependent-pyruvate-dehydrogenase MKQTVAAYIAKTLEQAGVKRIWGVTGDSLNGLSDSLNKMKTIEWMPTRHEEVAAFAAGAEAQLTGELAVCAGSCGPGNLHLINGLFDCHRNHVPVLAIAAHIPSSEIGSGYFQETHPQELFRECSHYCELVSSPEQIPQVLAIAMRKAILNRGVSVVVLPGDVALKAAPETATTHWYSAPQPTITPADEELKKLAQLLRYSSNIALMCGSGCAGAHKELVEFAGKLKAPVVHALRGKEHVEYDNPYDVGMTGLIGFSSGFHTMMNADTLILLGTQFPYRAFYPTDAKIIQIDINPGSIGAHSKVDMALIGDIKSTLAALLPLLEEKTDRKFLDKALSDYRDARKGLDDLAKPSEKAIHPQYLAQQISHFADDDAIFTCDVGTPTVWAARYLKMNGKRRLLGSFNHGSMANAMPQALGAKATAPERQVVAMCGDGGFSMLMGDFLSVAQMKLPLKIVVFNNSVLGFVAMEMKAGGYLTDGTELHDTNFARIAEACGITGIRVEKASEVDDALQRAFAIDGPVLVDVVVAKEELAIPPQIKLEQAKGFSLYMLRAIISGRGDEVIELAKTNWLR >NZ_CP044107.1|WP_032670227.1|4185008_4186010_+|low-specificity-L-threonine-aldolase MIDLRSDTVTRPSRAMLEEMMAAPVGDDVYGDDPTVNELQRYAAELSGKEAALFLPTGTQANLVALLSHCERGEEYIVGQGAHNYLYEAGGAAVLGSIQPQPIDAAPDGTLPLDKVAAKIKADDIHFARTKLLSLENTHNGKVLPREYLKAAWDFTRERKLGLHVDGARIFNAVVEYGCELKAITQYCDSFTICLSKGLGTPVGSLLVGSADYIRRANRWRKMTGGGMRQAGILAAAGLYALKNNVSRLKNDHDNAAWMAAQLREIGADVMRHDTNMLFVRVGDEHAAALGDFMKARGVLINASPVVRLVMHLDVNREQLTEVVKHWQAFLQR >NZ_CP044107.1|WP_150391218.1|4186020_4187457_+|DUF2867-domain-containing-protein MPQRILVLGASGYIGQHLTTALSQQGHQVLAAARNTERLQKLHLPGVTCHNVDLNWPKALPALLEGVDTLYYLVHSMGEGGDFIAHERQVAMNVRDALRQTPVKQVIFLSSLQAPEHEQSDHLRARQLTAETLRSARIPVTELRAGIIVGAGSAAFEVMRDMVYNLPVLTPPRWVRSRTTPIALENLLHYLVALLDHPAEQHRVLEAAGPEVLSYQAQFEHFMRVSGRHRWLIPIPFPTRWISVWFLNVITSVPPTTAKALIQGLKHDLLADDLALRALIPQELIRFDDAVRNTLKEEEKLVNSSDWGYDAQAFARWRPEYGYYPKQAGCTVKTTASLAALWEVVNQIGGKERYFFGNILWQTRGALDLLVGHRLAKGRPAHPWLKVGDTVDSWKVIIVEPEKQLALLFGMKAPGLGRLCFTLKDNGDHRELDVRAWWHPHGMPGLFYWLLMIPAHLFIFRGMAKRIAQLAEEKRENN >NZ_CP044107.1|WP_017384852.1|4187549_4188563_+|NAD(P)-dependent-oxidoreductase MKVLVTGATSGLGRNAVEFLRNKGISVRATGRNEAMGKLLQKMGAEFVHADLTELVSSQAKVMLAGIDTLWHCSSFTSPWGTQEAFDLANVRATRRLGEWAVAWGVRNFIHISSPSLYFDYHHHRDIQEDFRPARFACEFARSKAAGEEVIDLLAQSNPHTRFTVLRPQSLFGPHDKVFIPRLAQMMHHYGSVLLPRGGDALVDMTYYENAVHAMWLASQPECDKLVSGRAYNITNGEPCTLRSIVQRLIDELKIDCRIRSVPYPMLDMIARSMERFGSKSAKEPALTHYGVSKLNFDFTLDISRAENELGYKPIVSLDEGIVRTAAWLRDHGKLHR >NZ_CP044107.1|WP_150391219.1|4188701_4190513_-|PKD-domain-containing-protein MNKRTLLSVLIAGACVAPLMAQAANLKETSSEPYTIKDSDLAKKEKELTDFPLMASVKETIQTLDNAQVELIEPGRAANPDNVKRVEGIVKASDWEYLFPLRAQAYTYSNFLKAVGKFPALCKTYNDGRDSDAICRKELATMFAHFAQETGGHESWRPEAEWRQALVHVREMGWSEGQKGGYNGECNPDVWQGQTWPCGKDKDGDFLSYFGRGAKQLSYNYNYGPFSEAMYGDVRTLLDKPELVADTWLNLASAIFFFAYPQPPKPSMLQVIDGTWQPNDHDKANGLVPGFGVTTQIINGGVECGGPTEIAQSQNRIKYYKEFANYLKVPVPANEVLGCANMKQFDEGGAGALKIYWEQDWGWSADTPDGKTYSCQLVGYQTPFSAFKDGDYSKCVQHFFNVKIVNDDGSSVTPDETPVTPTPTPSGDETPAPTPTPDETPVVVNHAPVAQIAGPIGAVEAGAQVSLSAEGSTDPDGNTLTYTWRSQDGQTVTGQDKAVVTFTAPESATAQQYEVSLTVSDGELSSTTSYLLNVKAKAATPSGEDTSYPAWSANSKYNAGDIVNNHGKLFQCKPFPYSGWCNNAPTYYEPGAGLAWAEAWTAL >NZ_CP044107.1|WP_045339555.1|4190647_4191088_-|type-III-secretion-system-invasion-protein-IagB MKKLILLLLIISQSALANCWNKAAHYYHVDPYLLYAIANVESGMNPYAIGQNRDGTRDVGLMQINSSHFTALESRGIDEYRLITEPCTSIMVGASILAGMIRVYGYNWEAVGAYNAGLKKENYPQRMKYAHKVWAKYQQLKLAARY >NZ_CP044107.1|WP_150391220.1|4191190_4193893_-|PKD-domain-containing-protein MKFMKPKYLALFIAAATSSAFAAAPGAPTIGYGNDKFALVEVDQAAQDYNNLVKVHNDGVDVKVEWNVWSGDAPTSAKVLLDGQTVWTGAAGATGSATFKVKKGGRYQEQVEVCNASGCAKSASKLIIVADTDGSHLLPLNTSLKENNKAFAKHTDKVVAAYFPEWGVYDRNFPVDKIPAANLNHILYGFIPICGGDGINDGLKTIEGGNSFRVLQNDCKGRPDYTVAIHDPWAALQKPQAGVSGWDDPYKGNFGQLMALKKAHPDLKVLPSIGGWTLSDPFFHMGDPAIRARFVSSVKEFLQTWKFFDGVDIDWEFPGGGGVSENLGNPQQDKATYTALMHDLRTMLNELSAQTGRTYELTSAIGAGRDKIEDVDYTAAQQYLDHIFLMSYDFYGGWSNTVLGHQAALRAPAWRPDTDYTTENGVNALLSQGVQPGKIVVGAGMYGRGWTGVHGYTGNNPFTGTATGMVKGTWEPGVVDYRQIVNEYKGKPGWEYGYDADAEAPYVFNKTTGDLITYEDARSTTAKGKYVLANKLGGLFAWSIDSDTGDILNAMNESLLGGDATPVDPEVTNHAPIASSADQDVSGPVTVTLDGSASSDPDGDAITYKWTQVSGPSVTITNSTKAKATFNVAAATSDQTMVFRLTVTDAKGLSNAIDIQVVNKAPKANQAPVLNPMEAITLESGETYALHAQAADPDGDALTYAWSVPADMHATGTDSANVNITAPEVSSTSTYTLSVVVSDGKTSVQSNVQVTVNPKAAPAPVPDDEDTNPADDVTPPADDVTPPSDKGSCDAPVDANASKYAAWESSKIYNGGDTVSFDHLVWKAKYWTQGNQPGFGVDAWELVSNVKMNWRSDLVYNGGDTTTYEGNVYRAKWWTRGDNPANSDVWVKEGASTDCK >NZ_CP044107.1|WP_015570997.1|4194147_4194945_-|prepilin-peptidase MNTFSLMRDACPVGFPIMSAILGGIVGSFLGVVAERVPGMVMDEEGSGNLLFPASHCPVCQHALAAWENIPLLSWLLLRGRCHQCGSAIPLRLFLVELISALFFGITAWCMPDVQALFSLWLLAAFLLPLAMIDWQHQLLPDCLTQPLLWAGLVLHAFDHTLPLRDALFGAVAGYLSLWLLYWAFRLITGREGLGYGDFKLLAALGAWCGWQALPSIELAAALSGIVGYFAVNNLNKNNLTISFGPYLAFAGIGVFMSQQFAFTF >NZ_CP044107.1|WP_150391221.1|4194941_4195427_-|type-II-secretion-system-protein-M MKERIAQLKSRYQNYSTREKIILKICAVAIVGAVVYYTGVIPLDNMIQNSKSTIKRQKETLNWMRSEIDKNHLQVQIVKTNNPRTVVENSAHEINLSLTDMRQEGQTLSFVLNRVNVYELRSWLREINQTSGVRLQKINLTPVDHLSDVKAQVQLTWSKNA >NZ_CP044107.1|WP_032647854.1|4195423_4196557_-|general-secretion-pathway-protein-GspL MKQVLFVRPDSREGGKIMWCESGSERVEVVDSLEMLAEHPLATRVCLLLPASDMIFRHFTLPKKVASQAMAFSWMAEETLIGDVDNLHWTVLHKKGADVDAVAIDADRLRAALTRCQEAGLNVIQALPDAWLLPVTTGGSTLVAQDDSYWLRLSPHVAGEMEATLLPLLMQKAGVGEVWCYGDAPAKVHVDVQHAWQHPLALIQPQWQTCRVNLLHGEFSLKAGHGRAAKSMKAAMVAVGVLSVALLLGPRIAMAWMLVQQENRVQEEIVQVYQHHFPSMRQQTNIKYHFGQSLKKQSKGFFLQLDELENARQSVPAMEIELLEYDAQQNTLTLSVSAQNQPALQAFVNQTSENFDFTLQPVSTTEPYTAMIAGKHK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP044107_2 | 2.1|4182793|31|NZ_CP044107|CRT | 4182793-4182823 | 31 | MK972693 | Salmonella phage SI23, complete genome | 6397-6427 | 2 | 0.935 |
NZ_CP044107_2 | 2.1|4182793|31|NZ_CP044107|CRT | 4182793-4182823 | 31 | MK770415 | Salmonella phage SF11, complete genome | 15202-15232 | 2 | 0.935 |
NZ_CP044107_2 | 2.1|4182793|31|NZ_CP044107|CRT | 4182793-4182823 | 31 | MK972694 | Salmonella phage SE22, complete genome | 27485-27515 | 2 | 0.935 |
NZ_CP044107_2 | 2.1|4182793|31|NZ_CP044107|CRT | 4182793-4182823 | 31 | MK770414 | Salmonella phage SE16, complete genome | 19098-19128 | 2 | 0.935 |
NZ_CP044107_2 | 2.1|4182793|31|NZ_CP044107|CRT | 4182793-4182823 | 31 | NC_013059 | Salmonella phage c341, complete genome | 37506-37536 | 2 | 0.935 |
NZ_CP044107_2 | 2.1|4182793|31|NZ_CP044107|CRT | 4182793-4182823 | 31 | MK972686 | Salmonella phage SF3, complete genome | 38552-38582 | 2 | 0.935 |
NZ_CP044107_2 | 2.1|4182793|31|NZ_CP044107|CRT | 4182793-4182823 | 31 | MK972685 | Salmonella phage SE10, complete genome | 39771-39801 | 2 | 0.935 |
NZ_CP044107_2 | 2.1|4182793|31|NZ_CP044107|CRT | 4182793-4182823 | 31 | MK972687 | Salmonella phage SE1 (in:P22virus), complete genome | 30045-30075 | 2 | 0.935 |
NZ_CP044107_2 | 2.1|4182793|31|NZ_CP044107|CRT | 4182793-4182823 | 31 | FJ000341 | Salmonella phage g341c, complete genome | 37506-37536 | 2 | 0.935 |
NZ_CP044107_2 | 2.1|4182793|31|NZ_CP044107|CRT | 4182793-4182823 | 31 | EU570103 | Salmonella phage epsilon34, complete genome | 39547-39577 | 2 | 0.935 |
NZ_CP044107_2 | 2.1|4182793|31|NZ_CP044107|CRT | 4182793-4182823 | 31 | MK972692 | Salmonella phage SE21, complete genome | 26621-26651 | 2 | 0.935 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | JF974302 | Vibrio phage VBpm10, *** SEQUENCING IN PROGRESS ***, 8 unordered pieces | 21037-21067 | 6 | 0.806 |
NZ_CP044107_2 | 2.4|4182853|29|NZ_CP044107|PILER-CR | 4182853-4182881 | 29 | NZ_AP023151 | Klebsiella pneumoniae strain SMKP03 plasmid pSMKP03S, complete sequence | 22220-22248 | 6 | 0.793 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | MG592513 | Vibrio phage 1.142.O._10N.261.49.E11, partial genome | 26631-26661 | 7 | 0.774 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | MG592412 | Vibrio phage 1.028.O._10N.286.45.B6, partial genome | 26631-26661 | 7 | 0.774 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | MG592527 | Vibrio phage 1.159.O._10N.261.46.F12, partial genome | 26631-26661 | 7 | 0.774 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | MG592588 | Vibrio phage 1.217.O._10N.261.45.A1, partial genome | 26631-26661 | 7 | 0.774 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | MG592589 | Vibrio phage 1.219.O._10N.261.45.E2, partial genome | 26631-26661 | 7 | 0.774 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | MG592524 | Vibrio phage 1.156.O._10N.261.45.A6, partial genome | 26631-26661 | 7 | 0.774 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | MG592669 | Vibrio phage 2.159.A._10N.261.46.F12, partial genome | 26631-26661 | 7 | 0.774 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | MG592670 | Vibrio phage 2.159.B._10N.261.46.F12, partial genome | 26631-26661 | 7 | 0.774 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | MG592507 | Vibrio phage 1.136.O._10N.261.45.E11, partial genome | 26631-26661 | 7 | 0.774 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | NZ_CP043441 | Cupriavidus campinensis strain MJ1 plasmid unnamed1, complete sequence | 1534665-1534695 | 7 | 0.774 |
NZ_CP044107_2 | 2.4|4182853|29|NZ_CP044107|PILER-CR | 4182853-4182881 | 29 | NC_015184 | Agrobacterium sp. H13-3 plasmid pAspH13-3a, complete sequence | 62536-62564 | 7 | 0.759 |
NZ_CP044107_2 | 2.5|4182913|29|NZ_CP044107|PILER-CR | 4182913-4182941 | 29 | NC_023006 | Pseudomonas phage PPpW-3 DNA, complete sequence | 36335-36363 | 7 | 0.759 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | NC_015184 | Agrobacterium sp. H13-3 plasmid pAspH13-3a, complete sequence | 62534-62564 | 8 | 0.742 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | NZ_CP035511 | Haematobacter massiliensis strain OT1 plasmid pOT1-1, complete sequence | 283146-283176 | 8 | 0.742 |
NZ_CP044107_2 | 2.3|4182913|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182913-4182943 | 31 | NC_023006 | Pseudomonas phage PPpW-3 DNA, complete sequence | 36335-36365 | 8 | 0.742 |
NZ_CP044107_2 | 2.4|4182853|29|NZ_CP044107|PILER-CR | 4182853-4182881 | 29 | NZ_CP054621 | Azospirillum oryzae strain KACC 14407 plasmid unnamed6, complete sequence | 714186-714214 | 8 | 0.724 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | NZ_CP054620 | Azospirillum oryzae strain KACC 14407 plasmid unnamed5, complete sequence | 22328-22358 | 9 | 0.71 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | NC_013859 | Azospirillum sp. B510 plasmid pAB510e, complete sequence | 475099-475129 | 9 | 0.71 |
NZ_CP044107_2 | 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder | 4182853-4182883 | 31 | NZ_CP054621 | Azospirillum oryzae strain KACC 14407 plasmid unnamed6, complete sequence | 714186-714216 | 10 | 0.677 |
NZ_CP044107_1 | 1.1|3601318|58|NZ_CP044107|CRISPRCasFinder | 3601318-3601375 | 58 | NZ_LN868946 | Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 4, complete sequence | 15954-16011 | 12 | 0.793 |
1. spacer 2.1|4182793|31|NZ_CP044107|CRT matches to MK972693 (Salmonella phage SI23, complete genome) position: , mismatch: 2, identity: 0.935
cgtgactaaaggcatgagcaaatcaggcaag CRISPR spacer cgtgactaatggcatgagcaaatcaggcagg Protospacer ********* *******************.*
2. spacer 2.1|4182793|31|NZ_CP044107|CRT matches to MK770415 (Salmonella phage SF11, complete genome) position: , mismatch: 2, identity: 0.935
cgtgactaaaggcatgagcaaatcaggcaag CRISPR spacer cgtgactaatggcatgagcaaatcaggcagg Protospacer ********* *******************.*
3. spacer 2.1|4182793|31|NZ_CP044107|CRT matches to MK972694 (Salmonella phage SE22, complete genome) position: , mismatch: 2, identity: 0.935
cgtgactaaaggcatgagcaaatcaggcaag CRISPR spacer cgtgactaatggcatgagcaaatcaggcagg Protospacer ********* *******************.*
4. spacer 2.1|4182793|31|NZ_CP044107|CRT matches to MK770414 (Salmonella phage SE16, complete genome) position: , mismatch: 2, identity: 0.935
cgtgactaaaggcatgagcaaatcaggcaag CRISPR spacer cgtgactaatggcatgagcaaatcaggcagg Protospacer ********* *******************.*
5. spacer 2.1|4182793|31|NZ_CP044107|CRT matches to NC_013059 (Salmonella phage c341, complete genome) position: , mismatch: 2, identity: 0.935
cgtgactaaaggcatgagcaaatcaggcaag CRISPR spacer cgtgactaatggcatgagcaaatcaggcagg Protospacer ********* *******************.*
6. spacer 2.1|4182793|31|NZ_CP044107|CRT matches to MK972686 (Salmonella phage SF3, complete genome) position: , mismatch: 2, identity: 0.935
cgtgactaaaggcatgagcaaatcaggcaag CRISPR spacer cgtgactaatggcatgagcaaatcaggcagg Protospacer ********* *******************.*
7. spacer 2.1|4182793|31|NZ_CP044107|CRT matches to MK972685 (Salmonella phage SE10, complete genome) position: , mismatch: 2, identity: 0.935
cgtgactaaaggcatgagcaaatcaggcaag CRISPR spacer cgtgactaatggcatgagcaaatcaggcagg Protospacer ********* *******************.*
8. spacer 2.1|4182793|31|NZ_CP044107|CRT matches to MK972687 (Salmonella phage SE1 (in:P22virus), complete genome) position: , mismatch: 2, identity: 0.935
cgtgactaaaggcatgagcaaatcaggcaag CRISPR spacer cgtgactaatggcatgagcaaatcaggcagg Protospacer ********* *******************.*
9. spacer 2.1|4182793|31|NZ_CP044107|CRT matches to FJ000341 (Salmonella phage g341c, complete genome) position: , mismatch: 2, identity: 0.935
cgtgactaaaggcatgagcaaatcaggcaag CRISPR spacer cgtgactaatggcatgagcaaatcaggcagg Protospacer ********* *******************.*
10. spacer 2.1|4182793|31|NZ_CP044107|CRT matches to EU570103 (Salmonella phage epsilon34, complete genome) position: , mismatch: 2, identity: 0.935
cgtgactaaaggcatgagcaaatcaggcaag CRISPR spacer cgtgactaatggcatgagcaaatcaggcagg Protospacer ********* *******************.*
11. spacer 2.1|4182793|31|NZ_CP044107|CRT matches to MK972692 (Salmonella phage SE21, complete genome) position: , mismatch: 2, identity: 0.935
cgtgactaaaggcatgagcaaatcaggcaag CRISPR spacer cgtgactaatggcatgagcaaatcaggcagg Protospacer ********* *******************.*
12. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to JF974302 (Vibrio phage VBpm10, *** SEQUENCING IN PROGRESS ***, 8 unordered pieces) position: , mismatch: 6, identity: 0.806
aaaacacgctggcgcgtgtcggtgtcgccgt CRISPR spacer acaatgctctggcgcgtttcagtgtcgccgt Protospacer * **..* ********* **.**********
13. spacer 2.4|4182853|29|NZ_CP044107|PILER-CR matches to NZ_AP023151 (Klebsiella pneumoniae strain SMKP03 plasmid pSMKP03S, complete sequence) position: , mismatch: 6, identity: 0.793
aaaacacgctggcgcgtgtcggtgtcgcc CRISPR spacer aaaacacgctggcgcgtgtcgtgcgcttc Protospacer ********************* * .*
14. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to MG592513 (Vibrio phage 1.142.O._10N.261.49.E11, partial genome) position: , mismatch: 7, identity: 0.774
aaaacacgctggcgcgtgtcggtgtcgccgt CRISPR spacer actatgctctggcgcgtttcagtgtcgccgt Protospacer * *..* ********* **.**********
15. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to MG592412 (Vibrio phage 1.028.O._10N.286.45.B6, partial genome) position: , mismatch: 7, identity: 0.774
aaaacacgctggcgcgtgtcggtgtcgccgt CRISPR spacer actatgctctggcgcgtttcagtgtcgccgt Protospacer * *..* ********* **.**********
16. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to MG592527 (Vibrio phage 1.159.O._10N.261.46.F12, partial genome) position: , mismatch: 7, identity: 0.774
aaaacacgctggcgcgtgtcggtgtcgccgt CRISPR spacer actatgctctggcgcgtttcagtgtcgccgt Protospacer * *..* ********* **.**********
17. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to MG592588 (Vibrio phage 1.217.O._10N.261.45.A1, partial genome) position: , mismatch: 7, identity: 0.774
aaaacacgctggcgcgtgtcggtgtcgccgt CRISPR spacer actatgctctggcgcgtttcagtgtcgccgt Protospacer * *..* ********* **.**********
18. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to MG592589 (Vibrio phage 1.219.O._10N.261.45.E2, partial genome) position: , mismatch: 7, identity: 0.774
aaaacacgctggcgcgtgtcggtgtcgccgt CRISPR spacer actatgctctggcgcgtttcagtgtcgccgt Protospacer * *..* ********* **.**********
19. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to MG592524 (Vibrio phage 1.156.O._10N.261.45.A6, partial genome) position: , mismatch: 7, identity: 0.774
aaaacacgctggcgcgtgtcggtgtcgccgt CRISPR spacer actatgctctggcgcgtttcagtgtcgccgt Protospacer * *..* ********* **.**********
20. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to MG592669 (Vibrio phage 2.159.A._10N.261.46.F12, partial genome) position: , mismatch: 7, identity: 0.774
aaaacacgctggcgcgtgtcggtgtcgccgt CRISPR spacer actatgctctggcgcgtttcagtgtcgccgt Protospacer * *..* ********* **.**********
21. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to MG592670 (Vibrio phage 2.159.B._10N.261.46.F12, partial genome) position: , mismatch: 7, identity: 0.774
aaaacacgctggcgcgtgtcggtgtcgccgt CRISPR spacer actatgctctggcgcgtttcagtgtcgccgt Protospacer * *..* ********* **.**********
22. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to MG592507 (Vibrio phage 1.136.O._10N.261.45.E11, partial genome) position: , mismatch: 7, identity: 0.774
aaaacacgctggcgcgtgtcggtgtcgccgt CRISPR spacer actatgctctggcgcgtttcagtgtcgccgt Protospacer * *..* ********* **.**********
23. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to NZ_CP043441 (Cupriavidus campinensis strain MJ1 plasmid unnamed1, complete sequence) position: , mismatch: 7, identity: 0.774
aaaacacg----ctggcgcgtgtcggtgtcgccgt CRISPR spacer ----cccggcacctggcgcgtgacggtctcgccgt Protospacer * ** ********** **** *******
24. spacer 2.4|4182853|29|NZ_CP044107|PILER-CR matches to NC_015184 (Agrobacterium sp. H13-3 plasmid pAspH13-3a, complete sequence) position: , mismatch: 7, identity: 0.759
aaaacacgctggcgcgtgtcggtgtcgcc CRISPR spacer tgagatcgcgggcgagtgtcggtgtcgcc Protospacer .*. *** **** **************
25. spacer 2.5|4182913|29|NZ_CP044107|PILER-CR matches to NC_023006 (Pseudomonas phage PPpW-3 DNA, complete sequence) position: , mismatch: 7, identity: 0.759
gagggagaaagccaggaagtctggcagtt CRISPR spacer cggcgagaaggccaggaagtcgggcagca Protospacer .* *****.*********** *****.
26. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to NC_015184 (Agrobacterium sp. H13-3 plasmid pAspH13-3a, complete sequence) position: , mismatch: 8, identity: 0.742
aaaacacgctggcgcgtgtcggtgtcgccgt CRISPR spacer tgagatcgcgggcgagtgtcggtgtcgccgc Protospacer .*. *** **** ***************.
27. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to NZ_CP035511 (Haematobacter massiliensis strain OT1 plasmid pOT1-1, complete sequence) position: , mismatch: 8, identity: 0.742
aaaacacgctggcgcgtgtcggtgtcgccgt CRISPR spacer tagaactgctggcgcgtgtcggcatcgccga Protospacer *.* .***************..******
28. spacer 2.3|4182913|31|NZ_CP044107|CRT,CRISPRCasFinder matches to NC_023006 (Pseudomonas phage PPpW-3 DNA, complete sequence) position: , mismatch: 8, identity: 0.742
gagggagaaagccaggaagtctggcagttag CRISPR spacer cggcgagaaggccaggaagtcgggcagcaaa Protospacer .* *****.*********** *****. *.
29. spacer 2.4|4182853|29|NZ_CP044107|PILER-CR matches to NZ_CP054621 (Azospirillum oryzae strain KACC 14407 plasmid unnamed6, complete sequence) position: , mismatch: 8, identity: 0.724
aaaacacgctggcgcgtgtcggtgtcgcc CRISPR spacer ctgcggcgctggcgcgggtcggagtcgcc Protospacer . .********** ***** ******
30. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to NZ_CP054620 (Azospirillum oryzae strain KACC 14407 plasmid unnamed5, complete sequence) position: , mismatch: 9, identity: 0.71
aaaacacgctggcgcgtgtcggtgtcgccgt CRISPR spacer atcctgggctgccgcgtgtcgatgtcgccgg Protospacer * .. **** *********.********
31. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to NC_013859 (Azospirillum sp. B510 plasmid pAB510e, complete sequence) position: , mismatch: 9, identity: 0.71
aaaacacgctggcgcgtgtcggtgtcgccgt CRISPR spacer atcctgggctgccgggtgtcggtgtcgccgg Protospacer * .. **** ** ***************
32. spacer 2.2|4182853|31|NZ_CP044107|CRT,CRISPRCasFinder matches to NZ_CP054621 (Azospirillum oryzae strain KACC 14407 plasmid unnamed6, complete sequence) position: , mismatch: 10, identity: 0.677
aaaacacgctggcgcgtgtcggtgtcgccgt CRISPR spacer ctgcggcgctggcgcgggtcggagtcgcccc Protospacer . .********** ***** ****** .
33. spacer 1.1|3601318|58|NZ_CP044107|CRISPRCasFinder matches to NZ_LN868946 (Salmonella enterica subsp. enterica serovar Senftenberg strain NCTC10384 plasmid 4, complete sequence) position: , mismatch: 12, identity: 0.793
gaaacgataaaaagccgggtggcggctacgccttacccggcctacatgttctacatat CRISPR spacer tcgcaaataaaaagccgggtggcggctacgccttacccggcctacatcgtctgcttga Protospacer . .***************************************** ***.* *.
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
2656359 : 2709677
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP044107|2656359:2709677|DBSCAN-SWA ATTAACGAACTACCGCTTGTCCTACCGTCTCATCCGCCTTCCATACGCGATATTTCACATCCACGTTTTCCGGCGTGTATACGACGATCGGCAGCTTGCTGTTATAGCGCAGCAGACTGTTATCACCCAGATACGCGGTAATAAATTTCTTCTCTTTCTTGCCGTCCGGGCACGCCATCATGGTGGAGACTGGCGAGGTTACTTTGTCGAAGACATAGTAGTCATAGCCCCAGCCTTCCAGCGTTTTGCTTTCCAGCTGGCCGCCCAGGCGATGCTGGTTGCAGTCCACTTCCAGCGTCTGGCCGATTAACAGCTCAACTTTGAAATTTGCTTCATCCTGCTGAGCCGGGAGCTGAATAACCTGGCGCTTCATCCCTTTGTCCGCTTGCGGATACGGAGCAACTTTTTCCAGCGGCTGCTCTTTCCGGGTTTCGCTGGCGAAAGCACTGCTGCTGGCGCACGCAGCGGCGATGAGGGCGATGGCAAATTTAGGTGCGTTTTTCACTGTCATTCCTTTTTTGTCAAAACGATCCGGTAAAGACTAACATAACGTACGTGAATGGTTGGCGCAATTTACTCATGTTTTCAATTTAAATGGAGTAATACACTGCTTTGGTGTTCTTTTTGTGGTGTTTAATGCTGCGTTGGAGTCGGGGCGGCAATCGTGTTATTTTACGCGTTAATTATTGTTATTCATCTGTAAAGGGAGGGGAGATGGTACGGGAAAAACTGAAGACACCTGAAGGCCGCAAGTTCCTGCTGGCGTTACTGGTTGTATTTATGATTGCTGCCGCATGCGTGGGGCGAGCAACCATTGTCGGTGTGATTGAGCAGTACAACATCCCGCTCTCTGCCTGGACAACCAGCATGTTTGTCCTGCAATCGGCGATGATCTTTGTTTACAGCCTGGTGTTTACCGTCCTGCTGGCGATTCCGCTGGGGATTTTCTTTCTCGGTGGGCGTGAAAAACACTAAGCATTGCGCATGCGGAGACAGGCAATAAAAAACCAGCCATAAGCGGCTGGTTTTTTTGTGTAGTTTTGGTCGGCACGAGAGGATTTGAACCTCCGACCCCCGACACCCCATGACACCGCTCATAAAACGCTGAGAACCGCGGCATTGCTGGGGGTAAGCGTAATACGCATGTATAAACAAACAGCGCTTTTTTTGCAAAATCTGCTCTATACACATCAATGGGTTAGCGGGGAGTTTCTCCCCATACCTTAATAGTTCTAAAAGCTTGTATGGCATAATACGGGAACAATCCAGTAGAGACTGCAATATGAAAAAAAGAACTAAAAAACATACTAATATTCGTTACCGAGACCACACAGAAATCGAAAAAACTTACATTAGTCTTTATAAGCAGTTTTGCCGAGCTTGTGAAATTTCTATTGTAATCTCGTTGAATCAGGGCGGAATCGACACTGATGGTCGGGGAAGACGTGCCGCAGACATTTTCACACGACAAGTGTTGACGGTGCACAGCCTCAAGAAACTACTCCCTGTATTGCGCAACGGAAATGATCCTGAAGATACGGTGTGGGATTTAACCACCATTGCACTTGTATCTAGGAGTGTAATGGAAAATTTTCAGGCACTGTTTTTTTACGGTACGGAAACCATATCGGAGTCTGAGGCTGACTTACGGTTTCGTATTTTCCAAAAAGACAGAAATGTTAAATGGCGTGATATTCGGATGAAGGCTGGTGAATCGGCAGAAGAACTAGAGGAGTTTTTCACCGGATTGGCTGAACAGCAGAAAATTATAGTCAATCATGAGTTCTATCATTCCCTATCAAAAGAGCAAAAGAATAGTCTCAAAAATAGGGCTGAGATGTATTATTCTAAAGCAGAGTTTGAAGCTCACTGTCCGCGGTTAGCCAACATTGCGCTACATCATCAATTACTTTCAAATCTGGCTCATCCCTTACCTCTGGCTATCCATCGAATCGACGAAGTTAGCGGTCATGGCAGCCCAAGCGAAGCTGATATTAGATTAGTTGTCATTTCTCTGAATGTGGCTACACATTGTCTAATCGCCTCAATTGAGGAGATGGGGAAAAAGTTTTCTGATAGCATCGGTAAGCAGTATCGATCTATGATTAATGAACTCTCAGATTACCCAGCTTTAAATTAACATGCGTCTCTCCTACGGCATAGAACAACCCAAAAGATTAGGTTTGCTTTATGCCGTAGCAATGTTCAGTGCATGCTTAATCAAATTTAATCTTAACTTGCTTTTATTTCCCCATGTGGCACAACAACCCAATTGATATGATTTTGAGTATAAATTTTTGTCGACTTTGCATCGCTATGCGCCATACGTCCTTGGGGGTCGATGCCCTGTCTATCAAACAGGTGAGCAGCTAAAGCACGTATTTCATGGAAAGTCGGTCTTTCATCCATCGTGAGGTGACTGCATATTCCGAGTCTGTCTCGTACTGCTGAGAATGATCGGCTTAAATAGTCTGGTGCTATCTGTGTGGGATGTGAAACCTCTTTACTTCGTTTTACCTGCCTTTCTGGTATCCGGTGCACCACATACGGACTAGCAACATTATCTCTACTGTCATCAATTATGCGTTTTAATTCATCACCGATAGGGATCGCAACATGCGAGGCTTCTTTCTTTTGTACCTTTTGGCGATGAATATAAAGCGTTCCATATATGCCATCCTCTGGCTGTTCTAGCCACACACAACCACAAACGCCGTTCTTTGGTTCACGTATTGAATACCTAATCCTAGACACTTCAAGGCGTGCATGTGTCGTCTGTAACGCCAAATCCATCGCAGTTCTTAGCCACGGTGCAGCAGCACGACGTATAGCCATGAAATGCTCTAGCGACAGTCGTTGCCTTTTCTTCTCTTCGGTCCTGCGCATTTTCTTCCTGGTTGCTGGATTGTCGAGCATCAGGGATTCATCGACCGCATACGAAAAAAGTTTTTTGAGAAAACTGACTTTTCTGTTTTGCACGTTGGCGGACGCGCCGGCATGGTATTTGTTGATATAGGCGTTAACGTGCTCCAGTTCGATATCACAAGCTGGAATATTAATGAAGAACTCTTTGACGCGTAACGCGTCATTATTCCAATCATCTAATGTACTCGGAGAGGGGCGTTCGTCTTCAACAGCCCGAGCCATGATGTGATCCATGTGATCTGCAAATGGTTTGGCTTCTCCAGTAACCCCGCCGGATTCTCTAATGAGATTATCAACGGATGGGGAGAGTTCAGGTCTCATTCTAAGATTGTACTCTCGGGCAATAGCTATGGCCATGGCCCGATCCTTACCTATATTCTTCTTCTTTCCTGTAACAAGCGTGAATTTATAAACACCGCGATCTTTATCAAAAAATAAATAATCTGGTAGGTGTCTGTATTCTTTTTTTCTTGGCCTTGCTGCCATGGTTAGCCCTCATTAATCAACTGGCGAACTGCTTGACTAACCATTGAGTCGACGCCCCATTTTTCTGTTTCGCAGACAAAAACAGATCCGTCCACGATACGCCCCATGAGTAAACCATTCTCGACCCAACGTTTAATCGTTCGGTTATCAGGAACAGAGTCATTGGTGAACTCTCGGCGTCCCCATTGACTCGCTTTCATCAGCTTTGCCATGGTTTCCTCCTCCATAAAGCCCGGCTGCACCCGGGCTGAGTGGTTTACTCGTTAGTACTGGTGGCAGGGATAAGTTTCTGCCAAATTGCTGACACATATTTTGCCTGATGGCGCGCATCAGCTAGTGCATTGTGCACATCGCCAATGAAAAGCATGTCACGCTTTGGATCGAAACCGACATTGCGTCCGAGGGTAACAATCGTGCGTACATCATGATCGTTCCAGAATTCCCATGGGCAAATGCGTCCGGCGCGTTCGTAAGCTCCACGCAGAATCACATTGTCAAAGGTGGCCCCGTTACCCCAGACCTTCATGTATTTGAGATTATATGCATGCCGGTGAATGAAATGGCTGAGTTCCGACAGCGCATCAGTGATAGACATCGCGTCATCAACACAAATAGCTGAGCGTGCTTCCGGGCTTTGTTTTAGCCACCAAAGAATGGTATCTCCATCCGGTACCGCGCCTTGAGCCATTGCGCTTTCAACCGAGACGGCCGTGTAGAATTCTTGGCCAAGCTCACCTGTTTGAGGGTTAAAAAAGACGGCACCAATTGAGACGATCGGCGCGTTTGGCTTTTTCCCCATTGATTCGAGGTCAATCATTAAATCGTTCAACTGCTTTCTCCATTGTTTATTCTGGGCTTTTGATTCACTGGGCTGCTGTAGCGCGAGTTGGCGTTGATGTGGCAATAATGAGCCCCTTCAGGACGAGTGCAAATTGTCCCGCACCTATCACATGCCAATACTGCTTCCAGCTCCGCGATCCGCTCTTCTGCTTTTGCCAGCTGATTGAGAAGTTCTTCAGCTAACGTCACACGCATCACTACCTTCTGACATTCATGGCGTTTGGCTCTGGCGATTGTGGTGCGCAGAGTCGCGTATTTGTTGGGTGTCAATGGACGGACTCCTGACGAAGATGGTTTATTTCGGCGTCAAGGCTCATTCGCTGGTCCATCGATTCCGTCAAGGCGGCAAATGTAACGTCCAGGCGAGTGGCTACCTCACGCATCAGAGAGGCTTCTGCTGGTGGCAGTTTCCCCGCCGCAGCATGGGCTGCGGCTACCAGTTCTTTTATCTTCATGCGAGGCATGCGCGTGATTCCGTAAGCTCATTGAAACGGTTAATCAACAAGCCATATGCCTGGCCTGGTCGAAGAGGAACGATCTGGATAATGTCGCTGGCCGGAATACCTTCGAGGCAAGGCCAGAGTGAGCCGTCGTCGATATCCAGATCGCGGCGTTCCGTGGCAAGCATCACCAGATCGGCGTATTTCACTACCTCTGACATATCAGGGGTGATGCTGAATTTGGCCCGGATCACCTGTTCTACCCGCTCTTCAATGCGACGGTAATCTGGAAGCAATGCTTTCAGGCGGGCAGGGATGTCCTGGCAATAGGCTTCAGCTGCGTCATGCATCAGGGCTTCAAAGGCAAACTCTGGCGGCACAATTTGGCTGCACAGTACCGAGTGCTGGGCCACGCTGTAAAATTCCGGCAGATGACCACTGAAGCGGCAGATGTGGGAAAGTGCTGTGGCAATATCCTCGATCTCTACGTCGTCAGTGGTTGAATTGAGGTAATCGAATTTCTTACCTGAAAGTGTCTGGATATAACTCATCGTATTTTCTTCTCCATATTTGGCAGCTGCACCTGCGCCAGTTTTTGGTTGTACGAATCCCTCGCCATTGGCGATTAATAAAGGGGAATTACGCTTCAATAAATCCCCGCGGCGCCGGGGATTTAATGCAGAGAAATTACGCTTTAAAGTTACCGATAAAGGTTTCAACCGGCTTGTCGGTGAACTTCTCGATTAGCAGGTCACGGAACTCGTTGGCGATAGCTTCTTCCTGGGCTTCCAGTTGAACGATGCGGAGTACAAACACCGGTTCCCCGCTTTTAAGCAGGCTGTTACGCAGGTTAAAGCGGCGTTCGCCCAGGCCTTCATATGGAACGCATTTGAACTCAAAGGCCACTGGCATGACGTCTTTACTGCTGGCTTCAACGCTCTGCATCAGGGACTTTTTGCCGCCAAAATCTTCGTCTTCATGAGCTGATTCTGAGACTTGTTTAATGTTGACGCGGCGAACAGCACCAGCTGCCTGCGCGATGGACAACATATTCCCGTCGGCATCAAATGCGGTCAGGAAGTCGGCCCAGTCCTCCAGCCATTCAGCAATTTCTTTCTGGCCCAGTCGATCGCCATTTACCTGAAGCAGGGCTCGGAATGGCGCTGTCTTTTTGAGGGTGATCGAGGCGATGTTATCAGCATGGCCCGGGTTAGCCAGCGTACCGATATTGAACATGGAGCGTGCGGTCATGTTGTCTGCATCGATAAAGCATCGTGCTGGTTCGTCGGCTTTGGCGTAGCCTGCGGCGTAACGCACAAAATCAGGAATACTGGTTGTGGTCATGGCGCCACGGAAGCGAAAACGCTCAAACTCGAAACGCTCGAGGCTTTCTACGTGAACCCCTTCAGGAAGCAGGGCAGTCGGGCATGCCGTAACTTTTGCTGCATCCAAGTGGTAACCGGAAAGAACCAGGTCTTTCACCTGCTGCAGGGCATTGCCGTCTAAAATCTGGGACATAAAATTTCCTTAATATGTGGTCAATGGGATGTCAGTGATTTGTCTGCTGCGGATCACTGTGCCGCTTTAAGCTTTCCGTCAACGCCGCCGTTGATCCCGAACAGCTGCCCCTGATCTTCCTGCAGGATGGTCAGCTTGCCGCCTTTGTTAACCCACATTGGTGTTTCGGTGGTGTCTTCTTCGGAGGCTTTACCGCGCGGGGTTGGGGTGACGTAGTTCAGCTTGTGCTTGATCTTGACGCGCTTCTCTTCGACGGAGTTACCCATACGCTCAATATCAAAGGTGAGGACTACTTTGCCTTTGGTCCCGTTGTTCAGAACGCCAAGCGCGGTAGTGTTTAAAGCTGCCGCGATCTTGTTCATGAACACGCCGGCATCCAGTTCGACCAGGAAATCGGGCACTACGGTCATGCGGTCATTACTCATGGTTTAACCCTCTGTGAGGCGGCTGCCACCGCCAGTGGAACTTCTCCATACACAACAGAAAAGGGCACCTGTACTTCGGCTATGGGTAGGAGGTCCATTTCCATAGCACCCGGATGGATTGGGGAATGAGCCCGTCGCCCGGTGATGCCCTTGTCTCTTGTGTAAAAAAGGTGCCCACCGATGTGATGGGCAAAGACTACACACAGCAATGATTTTGTTGTGGCGGTGGTGCCTCCACTTGCCGGACCGGCCAGAACCGGCGACGCTACACCTCAAGAAACGTATTCATTTCAAAAGTTGAAATAAAAACTTGTTGGCCTCGTCACGTGCGCAGAGCCGCATTACCACAACTGGAAGCGCACTCTTTCGGTAACAAACCTGCCCCCGCAGTGAAAGAGAGAAGGAGTGCGCTTTCACGTTGTGCCCTTAAAAAGCTGGCTGTCACCCTCAAGGGGAAAGTGAGCAGCCAGAACAGGGATCACTTCTTATTGCTTTGGCCTGCTTTTAACCACATCAGGCGCGGTGGTAAGCCATTTGCGAATCATCCGGTCATTCATACGCCACCGGCGGCTACTTCGTGGGCTTCCTCCCTGTTCGCTGTTGATGAAAATGAAGATAAAAGATATTTGCGAAATGCGCAAGAAATAAAATGCGTAAAACGCAATTGAAAGGGCGTAAAAAAACCGCCGTGTAGGCGGTTATTTATTCTTAGGGGATGTTATCCGTGCCGTTTAACTGACTGCGACTGGCTGATCATTACCTTGCCGAAAACGTAAAATCTGTCTTCATTGGACTTATCCACTGACCACTCCCGATATTTAGGGTTATCGGAAATGACAAGAATTTTATCTGGGATCATCTGTAATCTTTTGACGTAAATTTTATCATCAAAACCAAATACATAAATCCCATCACCATCAAACTCATTGATCGAAATATCAACGAATATCAAATCACCTGGTTCAATGGTATCAGCCATGCTGTCACCACGAACGTTAATCACTTTGACCGTATCTGGTGTTCTCCCACCAAATAATACGGCGGCTCTTTCGTTGTTGTACTCAATGGACCTGATGACATCGACGACATCACTTCCTTGTATATGGCCCATGCCAGCACTGGCGCTGACATCGAGCAACTCGACTCTAAACACAGGGTCACCCTCTCCATAAGCTGGATTTTTACCACTGGATTTACATACAGTAGTTTCATTTGGAGACGGTGTAAAGAGTTCTGCCACGCTAACACCAAGTGCCGTAGCATATTTGCTAAGCGATTGTTCAGTAAATGACTTTTGTTTTCCAGTCTCAACCCGTGAGACGTTCGCACCGTCAATACCTACAGCTTCAGCAAGATCTGAAATTTTCATGCCCTTTTCGAGGCGTAATTCTCTAATGCGGTTTCCTATATTCATGCGTCCATTACAGGTTGTTTTTGCGTGATATGCAAAGCAACTTGCGCAAGTCGTAACTACACACTAATATGCGTAATGCGCAATTACAGGGGGCATTATGCAATCACCGTTAAGAAATTTGCGAAAATCGCAAGGTTTAACTCTCTCTCATGTGGCAAACGTGGTGGACATTGATCCAGCCAACCTAAGCCGAATTGAAAGGGGTCAACAAATCGCATCACTCGATGTAGCTGAAAGGCTTGTGAAGTTTTATTCGGGCCAAATTGATGAGCTCCAAATCTTGTACCCGCACCGTTATACGCAGGCTACAGAAAGTGGCGCAGCATCGGTACCACAGGAAAAAGGGGAAAGCCGTGGGTAACGAACCGGAATGGAAAGTTGATAAGCAGCCAGCCTGGCTGGTGGCCGCAATCAAAAAAACGATCACCGAGCTGCCTGGCGGATATTCAGAAGCTGCTGAGTGGTTGGGTGTGACCGAGAACGCGCTGTTTAACCGGCTACGAACCGACGGTGATCAGATCTTCCCGCTCGGTTGGGGGATGGTGCTTCAGCGTGCTGGTGGTTCAAACCACATAGCGAACGCTATTGCACGTCACTCGAACGGTGTTTTTGTGCCATTGACTGATGTTGAAGAGATTGAGAACGGCGATATCAACCAGCGTCTCATGGAGTCAGTTGAGTGGATCGGCAGGCATTCGCAATACGTTCGTAAAGCCACCGCTGACGGCGTTATTGATGCTCAGGAACGCGCCCAGATCGAAGAGAACAGCTATCAGGTGATGGCTAAGTGGCAGGAACATTTGACGCTGCTTTTCCGTGTGTTTTGCGCGCCGGAAAAGAATGACGCCCGCGAGTGTGCAGCTCCGGGCGCCGTGGCAGACAAATCTTGTATGGAGAAGTAATCCGCATGACCAGTTTAACGGCTTTTAACCGTTTACCGCAACTCAGGATGATCCCGGTACCGGGCGCTCCGTTGTTTCGGTATGAACGCAGAATAGCAAACCGCTGGGTGCCATGTAACCACAGTCGGGCGGTCGCAATTGTGGGGGTTTACTACAGGAAGGCGAAACGCTTATGCGCGAAGTTAACCGAAGGTTCAAAGACCACAGAGGGATCCCCGTTCGGGTTATCAGGTGGGAGCCAGAGACTCAACGAGTTATCTACCTGCGGGACGGTTATGACCACGAATGTTTCAGCCCGCTCGAACAATTCAAGCGCAAGTTTACAGAGTTAAAGGACGACCATGAGCACTAAATTAACGGGTTACGTTTGGGACGCTTGTGCCGCTTCTGGCATGAAGCTGTCCAGCGTTGCCATCATGGCGCGTCTGGCTGACTTCAGCAGTGATGAAGGGGTTAGCTGGCCTTCCATTGCTACCATCGCGCGCCAGATTGGTGCTGGTGAGAGCACGGTTCGCACAGCCATATCTCAGCTGGAAAAAGACGGTTGGTTAACCCGCCAGCAGCGCCGTAAAGGCAACCGAAATGCATCGAACGTTTACCAGCTCAATGTTGCGAAATTGCAGGCTGCTGCCTTTTCTCACCTGTCAGATTCTGACGCATCAAAATCTGAGGCATCAAAAAACGATGAGAAAGGTGGTTTTCACCCGTCAGAATCTGGGGGGGATCCGTCAGTAAATACAACTACTGATCCATCAGTTAAAAAACCTTCTTGTCCGGTTGTGAAGCAACCAGACCCTGAAGTGACGATCACCGATAACGCCATCCTGGTTCTGAATCATTTGAACCTGGTTAGCGGCTCACGATACCAAAAATCAAAAACTTCTCTGGAAAACATCCGTGCTCGTTTGCGTGAAGGTTACACCGTTGGCGACTTACAGCTGGTGATTGACCTTAAGCATGAGCACTGGAATGGCAATGACGTGCAGTACCAGTACATGCGCCCTGAAACGCTATTTGGCCCGAAAAAGTTTGAGGGTTATCTGCAAAGCGGGATCCGTTGGGACAAGAAGGGGCGTCCACCGCGTGAAAGCTGGGGTGAAAAGAAACACGATCCGATGAAGTTCGGTCCGGTTGATACCAAGATTCCAGAGGGGTTCAGAGGATGAATGAAAATAAATACTGCCGCGCGCTGGCTGAACTGCGTTCAAGACCAGCCCACGAGTTGAAAGAGGTCGGCGATCAATGGTGCACTCCGGATCTGTTGTTTTGGGGTATCAATGCGATGTTCGGCCCTCTGGTGTTGGACCTTTTTGCCGACGACAGCAACGCTAAGTGCCCAGCATGGTACACGGCTGAAGATAATGCCCTGACGCAGGATTGGTCAGAGCGTCTGGCAGAACTCGGTGGTGCAGGGTTTGGCAACCCGCCTTACAGCCGCTCTCAGTACCACGACAAGCAGGCCGTTACCGGAATGACCCACATCATTAACCACGCTATGGCCATGCGAGAAAAGGGGGGGCGGTACGTTTTTCTCATTAAGTCTGCGACGAGTGAAACGTGGTGGCCTGAAGAGGCAGATCACGTCACATTCATCCGTGGCCGAATTGGTTTCGATCTTCCTACATGGTTCATGCCGAAAGACGAAAAGCAGCAGCCCACCAGCGCGTTTTTTGCTGGCGCTATCGTGGTCTTCGACAAAACATGGCGGGGAGAGCGTTTCAGTTACATCAACCGCACCGATTTGGAGGCCAAAGGCCGTGCTTCCATGTTGCTGGCCCAGTTTGCTGTGGGAAGAACGCAAACTGATGCGGCGCCGGAGCTGGACGCTGAGGTAGTGCCGGAGAAATCAGAGGCAGAACTGCCATTAACCCAAAAAGCCATTCTGGAAACCAGTGGTGTAGAGGCTTGGGCCTGTGTTGTCGCGGCGTTCGGCGAGAAAGATGAGTACACCTTCAGCGAGTCAAAGTTTGGTCATACCTGGGCTGCCGACTCTTTGGAAAACCCTGAATTTACCAATGTTTCACCGCTGACGATCGACATAGCGAAAAAGCTGATCAGCGAGAGCATCCTGGTGGGTGTTAATGCATGGCTGGAAACATTGCCCTTTGATAGCGATGACATGAAACAAGACATGTCAGAGCGGTTACGCACGGTTGCTGTTGAGTCTGCGAAAGAATACGGCATTAACCACAGTGAATTCATCGCGACCATGGTAAGCCTGGATAAAGCCAAATGGTCAAATATTCGGAGGATCCGCGCCCATGTCCGTGAGACGCAGGAATCAAAGGACAAGGCGTTAAACGAATCGCGCGTTTGGCCTCTTGAGGTTGGACTGGTGTTTAACCAGATTGAAGGGGCTGACGCTCTACCTGTTTCACAGCAGAACAAGCTGAAAGCCAATATCAACCAGCTGTGGCTCGAACGTATGCCGACGAGTGAAATTATCACGACCGCTGGTGGTCTCTTCAACAGCATGCAGGGGGCCGTCAATGCGTGAAATTATCGTTGATAACTTTGCTGGTGGTGGCGGCGCGAGTACCGGCATTGAGCTGGCGATCGGGCGTAGCGTAGATATCGCTATCAACCACGACGAAAACGCTATTGCGATGCATAAGACGAATCACCCGGACACGCTGCATTATTGCGAGTCGGTGTTTGACGTTGACCCAATCGCAGCCACCAGCGGTAAACCTGTCGGCCTGGCCTGGTTTAGCCCTGACTGCCGCCACTTTTCCAAAGCGAAGGGCGCTAAGCCAGTTAAGAAAGAGATTCGCGGGCTGGCGTGGATTGTCCTGCGCTGGGCGCTGGCAGTACGTCCCCGCGTCATGATGCTGGAGAACGTCGAAGAATTTAAGACATGGGGCCCGCTGCTGGAAGAAGAGTTACGCCCGGATCCTGCGCGCGCGGGTGAAACATTCGAGGCATTTGTCGGCATGCTGTCGACGGGAATCGCGGCGAATCCCCCTGCACTGGCTGAGGTTTGTGAATTCCTCGCCATTGAGCCGCACGGCCAGCAGGCGCAACAGTTGATCGCCGGGCTTGGTTATGAGGTTGATTATCGCGAGCTGCGCGCGTGTGACTACGGCACGCCGACGATCAGAAAGCGTTTCTTCATGGTCATGCGCTGTGACGGCCGCCAGATTCATTGGCCTGAAGCGACCCATGGGGATCCAAAATCACTGGAAGTACAGAGTGGCAAGCTAGCGCCATGGCGTACCGCGGCGGAATGCATTGACTGGAGTATCCCGGCCCGGTCCATATTCGACCGCAAAAATCCGCTTGCGGAAAATACGCTCAAACGTATCGCGCGGGGCATCCAGCGTTTCGTTATCGAGAGTTCTTCGCCGTTTATCGTTAAGTGCAATCACACCACATCACACGGCAGGTATGACTGTTTCCGTGGGCAGGGGCTCGAGGCTCCTTTACAGACTATCACTAAAACACACGGCTACGCGCTGGCGGTACCGCATCTGACTAAATTCCGGACCGGGGCCACTGGGCAGCCAGTAACCGAGCCGGTACCAACCGTCACCGCAGGTACGTCGGCGCGCCCGGGCGGGAATGGGCATGCGCTCGGCGTAGTTGAGGCTGCCCTGACGCCGTTCCTGGCTGGCAACGGTGGCAGTGAGTACCAGGCAAAGCCGCGCCCGCTGGATAAACCCGCTCATACAATCCTCAAGCAATCCCGCGCGTGCGTGGTTGCGCCGGTCATCGCCCGCCAGTTTGGCGCCAGTGTTGGACACAGGGCTGACGAACCGAGCGCGACGATTACTGCAGGTGGTGGCGGTAAGTCGCAGTTGGTAACTCCAACACTGATCCAGATGGGGTACGGCGAACGCCCAGGGCAAGAACCGCGTGTTCTTCAACTGAATAACCCGCTCGGCACGGTCACTGCTGGTGGTAATAAGTTTGCAACGGTGAGCGCGTTCCTGGCGAAGCACTATGGTGGGAATTACACGGGGCCGGGTGTTGGTATGGATGAGCCTGCCCACTCAGTCACTACTGTTGATCATCACGCGGTAGTTGCGTCTCACCTGGTGAAGCTGCGCGGAACCTGCCGCGACGGTCAGACCATGGATACACCTATGCCGACGATTACCGCTGGTGGCCAGCATGTTGGCGAGGTCCGGACATTCCTCGAAACATACTGCGGTGATAGCGAGGATGAATGGCTGGTGACGATCGAGGGGGTTAAGTACCAGATCGTCGATATCGGAATGCGCATGCTGCAACCGCATGAGCTTTATAAGGCGCAGGGCTTCCCTGACGGCTACGTTATCGATCAGGACTATCGCGGCAATCGTTACGCCAAAGACAAGCAGGTAGCGCGCTGCGGTAACGCAGTACCGCCGCCGTTCGCTCGTGCGCTGGTAGAAGCAAATCTTCCTGAATTATGTGCAAATCAAAAGGCGGGTGCAGCCGCCTGATATGGAGAAATAGCATGAATCAGTTAACCGCAAAGGGTGTTGTGACAATGTCCAGCCGTGAAATTGCCAGGCTGGTGCAGAGCAAACATGGTGATGTGAAGCGCTCAGCTGAGCGCCTTGCATCTGCTGGTATTTTAACCGCGCCGTTGGCGCACACCCCCTACACACACCCGCAAAACGGGCAAACCTACGAAGAGTATTGGTTCAACAAACGTGATTCTCTGGTGATCGTCGCTAGGCTGTCGCCAGAATTTACCGCCGCTGTTGTCGATCGCTGGCAAGAGCTGGAGAACAGCCAGGCCGTAAGTGTCCCGCAAACATTGCCGGAGGCATTACGTCTCGCCGCAGATCTGGCCGAGCAGAAAGAACAACTCAGCCAGCAGTTAGCCGCTGCCGCGCCGAAAGTTGAGTTTGTCGATCGGTATTGTACTGCCAAAGGCTCAATGTCTTTCCGCCAGGTGGCAAAGCTGTTGCAGGCCAAGGAAACAGATTTCCGCTTGTTCCTCATTGAGAGCGGCATTTTGTACCGGCTCGGCGGTGTGCTGACACCGCGGCACCAGCACATTGCTGCCGGGCGGTTTGAAGTTAAAACTGGCACTTCGAGCGAAACTAACTACGCCTTTAGCCAGGCACGCTTTACACCCAAAGGCATCGAGTGGATCGGCGGTCTGTGGACGGCACACATCGCTAAGGAGCATGCCGCGTGAGAGGACTGTTTACAGCCGAGACTGTACCGCGCCTGGGGCTTGTGGTTTTAAAGCCGGGTAGCGAACTGATGTCTTTGTTTCAACAGGGGCGTGTGCTGGTGGAGCCTCAGCCAAAAAGTATGGCTGGGCTTCCGTCGGGGCTCGTCCCTTATGCCAGGCAGCCGCTGGCAGAAGATAAGTCCCTCGAGGAATTCTTCACCGACGAGAGAGTTATCCGTGCAGCAGGCGGTTTGACCGCGTTGGAATCCTGGTTAGAACGTAACGTGAAGGAATGCCAGTACCCGCACACTGATTATCACCATCATGAGCTGGTAACGATGCGACATCCCCCTGGATCAATGTTGCTCTGTTGGCATTGCGATAACCAACTGCGCGATCAAACCACCGCGGCGCTGGCAGAACTGGCCCGGCGTAATCTCATTAACTGGCTGATCAGTTCCATCCAGTCATCGCTTGGCTATAACAACGAGCGTGAATTATCCCTCGGTGAATTGTGCTGGTGGGCCGTTTATTCAGGCATTGCTGATGCAATCACGGAAAGGATGGCCCAGCGTGCGCTTCGCTTACCGGACGAGCCGTTTTTATCCGTATATCGAGAAAGTGACATTGTGCCGATGCTCCCCGCAAAAAACATTTTGCAGAAGAAGGTCACCCCTGCGCTCACGGCTGCGAAATTAAAGGATGGAGCAAATCAGGAAGTGGCCTATGACCAGCCAAAGGTTTTGGCTCTGCATGCGGATCCTGAATCCCCTGAATCATTCATGTTGCGCCCAAAACACCGCAGGTGGGTGAATGAGGACTATACCCGGTGGGTTAAAACCCAGCCCTGTGAAGGTTGCCGGCGGCCAGCGGATGATCCACACCATGTCATTGGTCACGGCATGGGCGGTACCGCCACTAAAGCCCACGATTTGTTCGTGATCCCTCTGTGCAGAGAGTGTCACGACAAATTACATGCTGATGTTGCAGCGTTCGAGAAAAAAAACGGTACTCAGCTGGAGCTGCTATTCCGGTTTATGAATCGAGCGCTGGCGATCGGCGTAATAACAAAAGCGTAATTGTATGGAGCGCTGAGCATAATGAATTTACAAGAACTGGAATTTACGCGGATTGAACTGCGCCGCGCGCTGGCGGATTTATCAGGATCGACAAAAGGCCAGCTGCAGGCGTTTAGTGAGCATCCACCAGCAGATAAGAACAAATACCCCCGGCACCATCCTGAAATCGTCATGGAGGGTGGGGAAGGTTGTGGGTCAAAGGTTGTAAAAACGCTGGCCACTCCACTATATGTTCTTGAGACAAGGAGTCGTCGTCGACCTTTACCGCCTATTAAGGATACGGAGTTCGCCTGTTCAGCATGGCGTCGGTCGGTGAATGGTCTGGGGGAGCATTTGCAGGCATGGGTGCGGTATTGCTATGGGTATGACCTGACCTTCCGGTACCAGACGTTAATGTGCCAGTACGTGTGGGAACAGTTTCAGCGTCAGCATAGCGGCAAACAAATCCAGGGCCGTGTAACTAAAAAACTGATAGGGCTTGTCTGGCTGGCGGCGCAAGAAGTTGCTGCCTCACGTAATAACGATACCTATCAGGAGTATGCTGGTGCAGCTCTGGCCCGCATGGTCAGCGTAGAGCGTTCAACCTGGCTCAGAGTGTATTCAGGGCACTGGGCGGCCTTTAAAGCGTTGTTTGCTGAAATGGACAGTCAGGCACTAAGCGAAATTTTGTCACGGTACGAAGAGTTCCAAGAACTGAAAGTGGCGGAAATGTGAGGTAACTTTCACTAACTACCTCAATTAGGCTTGCAAAATGCAACAAAATGAGCGATATTTGAAGCTAATTTAATAAGTTGCCAAAAGTATATAAACCCGCCAGTCTGCGGGTTTTTTTCGGATTTTCTTCATATGGCATTGAAATACTAGTTGCCGAATAGGTAACATTTAGGTTTTCGTCTCGTCTGGATCGACATGATGTTCCAAGCCTTCCAAACAGCAGTATTTAAAAATGTTGCAGAACACTTGAAAGGACGCTTTCCGCATAAGTCGCTGGAACTCGTATATGCAGATGAGCTGATCCTGAAAGATATGGAATTCCTCAAAACCAACAACAAATTGCGTTGGGATCCGGGGCTAAAATCGCGAGTGTTTATCGATATGATGGAAGAACACCCGATTAAGCTTGTTGTGTACTATCGCGGTGAGCCAATTGGATTTGCATTTGGATGCTACTACAAACCGAAAAACGCAGTACATGTTTGCTGGATGGAAAAGCGTAACGATGCGCATGAAGATTTAGATCATCAAATGCTTGGTATCGTTTTGGATTGTTTTGCTGCTTACGCACAATTCCTCAATCATCAAGGCGAAACTATTGATACCATAGCTCTAGTCAGTCCAGTTGATGGTGCAATGAGGTACTATACTGAAAGTGGTTTTGAGTACATTGCAGATTATGAACGAGGTGGGTGTGCGATGGTTCTTAGGAACACTTTACAGAGTAAGTAATGTGAGTAGTGTTCTTAAATTGACTCGGTTTATCATAAAAGTATTGAAAACCACATCTAGTTGATGCAATCTGAGCCAACGCAAAGCACATCAAAATGTACTACGCTTTACTTTACACGTTTTAAAGTAACTAACTTAGTTAGCTACATGCTTCAAAAGAAGCAATCACAGCCTAGTGCTGCATTAAGGGGACTTCAATGAAACATCATGAGCAGATCGAGATCGAAGCAGCTAAAGTCGTTGCCGAACTCTTTGCTGGTAACGCCTCTCCAATGGAATCTTTTGGTATTACTTGGAGCCAAACTCAGATGTTAGAACGTAAGAATCCTGGAGTTGTTATCAAACTGACACCAGATGATGGCAGAAAGCTTGCATATTGCTAATATCTGTTGCTAGCGTTTAACGTTAACAAGTAGTTAACTCCATACAAATTTTGAAAACCTCGCCACGGCGGGGTTTTTTCGTTTCTACAGGGTGCTTACAAGCGGCCTTCCGGATCTCGCGCCCCAGTGATTGGGGTTTTCTGCTGTGAAAATGGGCGGCTGGTGGGTGTTGTAGCACCCAACCAGCCATTAGCTCATGCTTCAGGTCACAAGCTAACCAAGGCCCATTGCTTTAGCGCAAAAGCATAGTGAGCCTATCAGAGTTACGCTTACGGATCTATGAAAAATACTGTGAATATAAACAGTGTTGAGCTTATCAACGCTGACTGCCTGCATTACCTCGCAACCCTCCCAGATAACACCATTGACCTTATTGTTACGGATCCGCCTTACTTTAAGGTGAAGCCGAACGGCTGGGATAATCAATGGAACGGTGACGCCGATTATCTTCTCTGGCTTGATATGTGTCTTGCACAGTTCTGGCGAGTGCTTAAGCCTACCGGGAGCCTGTATTTGTTTTCTGGTCACCGCCTTGCATCCGATATTGAGATCATGATGCGTGAACGCTTCAACGTCATGAACCACATTATCTGGGCGAAGCCATCAGGGCGCTGGAACGGATGCAATAAAGAAAGCCTGCGCTCTTATTTCCCCGCGACGGAACGCATACTTTTCGCAGAGCATTATCAGGGGCCATATAGGCCGAAAAGCGACGGGTTTGCTGAGAAAAGCAACGAGGTCAAACAGCACGTCATGGCCCCGTTAATCTCCTATTTCCGGGATGCAAGAGCTGAATTGGGGGTCACGTCCAGGCAAATAGCTGACGCCACCGGAAAGAAAAACATGGTGTCCCACTGGTTCGGGGCCAGTCAGTGGCAACTACCGAACGAGCAGGACTACGAAAAGCTGCAGGAATTGTTCACTCAGATCGCCATTGAGAAGCACGGCGCCTCTGAACTCAAAGCACCGCATCACCAGCTGGTAGCCACATGGCATTCGTTGAACCGGAAATACCTTGATCTGCTGGAAGAGTACAAATCTCTTCGGCGGCATTTCTCTGTGACAGTAGCCGTGCCCTATACAGACGTCTGGACACATAAACCCGTCCAGTTCTATCCAGGCAAACACCCGTGCGAAAAGCCCGCTGATATGTTGCGGCAAATCATCAACGCCAGCAGCAGGCCCGGCGATGTGGTAGCTGATTTCTTTATGGGCTCAGGATCAACTGTTAAAGCAGCCATTGAACTGGGCCGCCAGGCTATCGGCGTAGATCTGGAAGAGGAACGTTTCAACCAGACGGTAAGTGAGGTAAGGCAGCTGGCAGGGGAATAAAAGCTTGGGTCGCTATCGCGGCCCTTTTTATTACCTCAACTGGACACCCGCAACGTAGCGAGGTGAGAGCATGTATCGAATGGAAAAAATCACGACGGGTATTGCATACGGCGCATCGGGAGGGGGAACCGGATACTGGTTGCTTCAGCTCCTCGATAAAGTCTCCCCATCTCAATGGGCGGCCATTGGTGTGCTCGGTAGCCTCATGTTTGGTTTGCTGACGTGGTTAACGAGTCTGTACTTCCAAATCAAAGCGGATCGCCGCAAAGCTGCGCGGGGTGAATGATGTCGAACAAAGCAAAGCTCAGCGCAGCAGTGCTGGCGCTAATCGCATCAGGGGCATCTGCTCCACTCATTTTCGACCAATTCATCAGCGAAAAAGAAGGCAATGCGCTGGTGGCCGTTGTTGATCCGGGTGGGGTCTGGTCTTTATGTCACGGCGTGACCGTTATCGATGGCAGGCGTGTTGTTAAAGGCATGACGGCCACTGAGGAACAATGCCGGAAGGTTAACGCTATTGAACGCGATAAGGCATTAGCCTGGGTTGATCGCAATATCAAAGTGCCTCTGACAGAGCCACAGAAGGTGGGCATCGCATCCTTTTGCCCGTATAACATCGGCCCCGGTAAATGCTTCCCATCGACCTTCTATAAGCGTATCAACGCAGGTGACCGCATCGGTGCATGCGAGGCCATCCGCTGGTGGATTAAGGACGGTGGACGTGATTGCCGTCTGACTAAAGGCCAGAAGAATGGCTGTTATGGGCAGGTCGAGCGGCGCGATCAGGAAAGTGCACTGACGTGCTGGGGGCTGGACCAATGAAAATTAACCAGGGTCTTATAGGCGTTGTCGTCATTGCTGTCCTTTCGGTCGCTCTCGTTAAGAGCTGCTCCGATGCCAGTAGCCTTCAGAGCGATAACGACGTTCTGCGAAGTGACAACTCTTTGCAGGGGCAGGTGATCGCCACCCAGGCATTCAACTTCAATCGATTTAATCAGGTTGCAGAACATGCCAACAGGCTTAATTCCCTGATTGATACCAGGACCGAAGAAACCGTAATCGAATATCGGGAGATTCTCCGCCGTGAAAAAACCTGTGATCTGCCTGTTCCTGCTGACATTGCTGGTGGGCTGCTCGAATACGCGTACCGTTTACGTTCCAGCGCAATGCACGCCGATACCGACGGAACTGACGCAGCCGATGATAGTACCGCTGCCGCCCGCTCAATAACGTACTGCCAGGCTGTGCTCTGGATTAAGCCGTTGCTGGCCGTAATAGAGAAGGGCAACAATAACTTTGCTGGCATTCGTCAAGTAGAACAGGAGCGACAATAGGTATGGTCAAATCCTAACGTGAAAATCTGCAGCAGCATTTCCATAATATTGGGCTGTAAATTACCTATCACTATAGGTTTCTTGCGTAATCAGGTAATAGACCGACCTCATAAAAAGAGGTAAATTATTACTTTACGTTGCGCTCAGGTAGACGAGATGGAAGAACGTCATCACAGCTATGTTCTTAAAACTATTGAAGAAATAGGAAGGGGTGGGTTCGGTTACGTTGAAAAAATTGAACTTTTCAATGTTAACGGTCACAAATGCGGTGACTACGCCAAAAAGATTTTGGCGCAGGATCATGGGCTTAGCAAAGAGGACTTCAAAAGAAGATTCAAACGGGAAGTGGATTATCAAGCTAGGTGTACGCACTCCAATATAGCGCCTATTTATCTACATAATCTTCAGGTCGATAGCCCTTGGTTTGTTATGGATCTCGCTGAAAGTGATTTAAGTACCGATTTAGCATCAGGCACGCTCGATAATGCAAGTAAAATGCATATTGCAGAAATGATCCTTTCAGGGGTACGTTTCATGCATACAGAGAAAACTGATGATCCAGGACGTAAGCCAGTATATTTACACCGTGATTTAAAGCCATCTAATATTCTTCGTTTTAAAGATGGAGTTTACAAAATTTCCGATTTTGGTTTAGTAAAAAATGCAGGAAAAGAAAGACCAGAATCTGAGCTTTTAACAAGAGTCGCTACAGCCATGGGAACGCTCAAATACATGGCTCCTGAAATAACAACAGCTGGGCATTACTCAGAGCAAACTGACATTTTTGCTCTCGGAGTAGTTATTGATGATATGGGGTTTGATAACGTTAATGGTATTAGACAGTTAATCGATAAATGCACCGCCTGGAGAGCTGCAAGCCGATACAAATCGGTTGATGATATGATGCAGGAACTGGCTGATATAAAAATAAGGAATGGGCTATGATTACCCTACTTTCAAGTGGTCTTTTCTCTTACGCTAAAAGTTCTTCGCGTACTAATCAGGATTCTATTCTTGCTCCACAGTGCATCGATAATGGCTATTTGATTGCTGTCGCTGATGGCGTTGGCTCGTACTTGGGCGCAGAACACGCTTCACAAACAGCAATTAAATATTTAGCGAACCTAATCAATGCTTCTTCTATCCAAGATCTTGATAGCCTCTTCGCCACAATAAAAGAAAAGATTTCTGCGTTATCTGACGCTGATGAATCCTATCTAGAAGCTGCAACGACGTTGACGTTTGCATATGTAAACAATCAAGGTTTGTACATTGGTCATGTTGGTGATTGTCGTTTGTATATTAAAAAAGATAATAAACTCAAACAACTAACGAAAGATCATACCCAACATCAAAAACTGTTGGATCAGAAGATTTTCAATAAAAAAGAACTTAAGGATATGGGGGGGAAAAACACCCTAACAACTGCGATTTCTAAAGTTATCCCATTGGAATTTCAACAAACGTTTATTCCTGCGTCTGAAGTTTTCGAAGACAGCGAAGAGGCCACTCTCTACATTCTTTCAGATGGAGCACACCACTTCTGGGATAAGAGACCAAGGTTCTCAATCACAACCTTGAGCAATCCAAACAGTTTTGCAGCTAGCTTGCATAAACGAATTATTCGTCATGGACCTATAGATGATTTTTCTTTGGTTGTAGCCAAGTTCAAGCGCAGCAGTGTTAACTAATGGTGTGGTTGAACTTGTAAGACTTACGGGTCAATAAATGTTCGGGCATACATAGTTCGTGTTGAACACAAGGGCTGAAGTTCTGATTACATAGTTGAACAAACTCCATACTAGGGCCACCAGCAATCACTGGTGGTTTTTTTTCGCATCGCACGCGCACATCAAGGAAAGTCTTTCAGATGTGAGCCCGGGCAGACCGTTAACTTTCGGCGGTTTCGCCTTGCGATAGGCGCAATATGGGATCAACAAAGAGCGCGTTGATGCTGTCCCGCAGGGCAGGGGCCGCGGGTCCTTTCCGGCTATCTGACATGTTACGGGGCGGCGACCTCCTAAGCTCTCGCTGTTCATGAGGTTCTGACAGCAACCTTTGCTTCCTTTCTACTTGTGATTATGTCTTACCCCTTACTACTGATAACTCAGCAATTATCCCCACCAACGAGAAAAGCGAGTCTCTCTGGAATCGCTATATATGAACAAACTCAAACATAAAGTTGGTCGTTAGGTATGCCCCCACGTATCAAAAGGCCATGCAGGCACAAAGGTTGTGCTGCTCTGACAAATGATCCAAGCGGCTATTGCGATGAACATCGGCAGCAGCATGCCGGTGAAGGGTGGCGCAACTATCAGAACGGTAAAAGCCGACACGAACGTGGATACGGGCGTCCCTGGGAAGTCAGGCGTGCACGTATTCTCCAGCGAGATAAACACATCTGCCAGGCGTGCCGACGCGTTGGCATAGCAAGACGTGCGAGTACCGTCGACCATATCCTCGCTAAGGCTCATGGCGGAACGGATGATGATTTCAATCTGGAAGCATTGTGCTGGCCATGCCACAGAGCCAAGACCGCAAGAGAACGTCTCAGGTGAAACTCGGTCGCTCAGCGCATGGGGAGGGGGGGATAAAATCCCAAACCCCTTTCGCTTTTAAGGACTGCCGCTCCCGGTAGTTTTTTGCGCGTGAGAAATAAGAATTTTTTTTTTGATGATTTTTGAGGTGTTTCGCTATGAGTAACGGAGTGAGATCGCCAGGGGGAGGTCGTAAGCCGAAGAAGACCGGAACGCAGGTAAGTTCTCTGACTCGAGCAGTTTCACCGCCAGATGAACTGCTGGGTGAGATGGCGATCGATGCCTGGAAACGAACCTGCAAAATTCTGATTAACCGTGGTTCGTTCGAAATGGAGGACTGCTATCTGCTGATGGAATATTGCAACACGGTGCAGCTCCTTTACGACGCGAACCAGGAAATAAAAGCTGATGGGATTGGGGATGAAACTGCTGCTGGTGGGCAGAAAATGGGAGCCGCAGTAAAGGCGCGGGATAAGTATATCTCACAGCTTATCCGTCTTAGCGTGGTTTTGAAGCTTGATCCCAACAGCAGAGCCAGAAAACGCACGCCGGGCGAAGACAGTAAATCCGGCAATGAATTTGACGAATTTTGATTGGGGCGATGTTCCCAATTTTTAGGGACTTATTATGGCCGCGTACCCGAGCGTCAATATGGCGAACCAGTATGCGCGGGATGTGCTGAACGGGAAAATACTTGCCTGCAAGAGCATCCAGCTGGCATGTCAGCGCCATTTTAATGATCTGAAAATTTCTCTCGATAAGGATTATCCCTACCGGTTCGACCGTGAACTGGCGGAACGCGCCTGCCGTTTCGTTCAGCTTTTACCGCATTCCAGCGGTGATTTAGCCGGTCAAAAACTGAAGCTGGAACCCTGGCAGGCATTTGCATTCAGCTCGATTTTCGGCTGGGTTACGAAAAAGACCAAAAAACGCCGATTTCGCGAAGCGTATATCCGGGTGGCCAGGAAAAACGGGAAGTCGTTTTTCGCGGCAGGTATTGGCACGTACATGTTCTGCGCTGACGGTGAAAACAGCGCGGAAGTGTACTGCGGGGCCACCACGATGGCGCAGGCGAAAAAGGTCTTCACCCCAGCCAGGCAGATGGCAGACCGCCTTCCGTCGCTCCGCTCAAAATTCAATATCTCGGTATGGGTGGACAGCCTGACCCGTCCTGACGGTTCGCTGTTTGCACCCATCGCCGGAAAGCCTGGCGACGGCGACAGCCCTCATTGCGCGATTATTGATGAATACCATGAGCACGATACGGATCACATGTACGAGGCCATGACGCTGGGTATGGGAGCACGTTCGCAGCCGCTGACGCTCATCATTACCACAGCAGGTACGTCGCTGGAATCGCCATGCTACGACAAGGATAAGCAGGTCAAGGAGATGCTCAACGGGCATGTGCCTAACGAGCGCCTTTTTGGTCTGATTTACGAGCTCGATGAAGGGGACGACTGGACTGACCCGACCAACTTCATTAAAGCGAATCCGAACCTCGATGTGTCGATATCGTTTGACGATCTGCTGGCGGAGATGGAGGTCGCAAAACAGGTTCCCCGTAAGGTGAATGCCTTTAAAACGAAGCGCCTCAATATCTGGGTATCGGGCAAAGCGGCGTTCTACAACATGACGCAATGGCATGCTGCCGCCGATAAATCCCTGCGTTACGAGGACTTTGCCGGCGAGGATTATTACCTCGGTCTGGACCTTGCCCAGCGTCTTGATCTTAACGCTGGTGTTGGCGTTTTCGTTCGCGAAATAGAGGGGAAGAAACACTACTACTGCATCAGTCCGAAATTTTGGGTACCGGAGGACACGGTCCGGAGCACTGATCCGAAAATTGCCAAAACTGCCGACCGGTATGTGAAGTTCGTCGAAATGGGAGCGCTTGAAGCGACAGATGGAGCAGAAGCGGACTATCGCGAAATCCTGGCCAGCATCATCGACCTTCAGGAGATTAATAAGGTCCGCATCAGCGAGATCCCAATCGACCCCAGCGGTGCCACGGCACTAAGTCATGAGCTGCAGGACCACGGGTTTGAGCCAATTTCTATCCGGCAGGACTACACCAACATGTCGCCGCCTATGAAGGAGCTGGAAGCGGCGCTCGCTGGCGGTCGTTTCCATCATGACGGGAACCCGGTCCTGTCATGGTGTATTAGCAACGTTATCGGAAAAAATGTCCCCGGAAGCGACGATATTGTCCGACCGACGAAGGGCGACAAGCAGTCAAAAATCGACGGCGCGACAGCGCTGTTTATGGCTATAGGCCGCGCAATGCTGAACGGTCGGGCCAGCAATCAATCCGTTTATGATGAGGAAGACGTCGCATGTTAACGGCAATTATTACCTTTATGATCGGCCTGTTCGGCGCGGCGCTTATCTCGTTTGGCGCGTGGATGGTGTTTCCGCCTGCAGGCGTTATTGCTGCAGGCTTGTTTTGCCTTCTGGCATCCTATTTTGCTGCCAGAGCCGCTGCGCCTGCGAATGATTCTCCAGGGGGTAACTGATGTTCATTCCTCAGTTCTTCCGGGGCAGGTCGCGTCCGGGAGGGAGTAACTGGACAACGGTTCTCGGGAGCGTCAGCGCCAGCAAGAGCTCATCGGGCATGCTGGTTACGCCGGAAACGGCAATGGGTATCGGGGCCATACGCGCCTGCGTGACGCTCCTTGCTGAATCCATCGCCCAGCTGCCCGTCGAGCTTTATCAGCGCGACGAAAAAGGCGGTCGGCGCAGGGCAACGGATCATCCCCTGTACGATGTGATCCATTCGCAGCCAAACAGAAAGGACACCAGCTTTGAGTATTACGAACAGCAGCAGGGCGTGCTGGGGCTTGAGGGGAACAGCTATTCCCTGATTGACCGGCACGGCAACGGCGATATCGCTGAACTGATACCGATAAATCCCAAAAAGGTCATCGTCCTGAAAGGGCCGGACGGGATGCCGTATTACGAACTGCCTGAGCTGGGTGAAACGGTGCCGATGCGCATGATGCATCACATCAAGTATTTCTCGCTCGACGGGTACATCGGCACCTCACCGATTCAGACGAACGCGGACGTTCTCGGGCTGGGCATGGCGGTTGAGCAGCATGCCGCGCAGGTGTTCGCCCGTGGCACCACGATGTCCGGCGTGATTGAGCGCCCCAAAGAGGCGGGAGCCATCAAGAGCCAGGCGTCAATTGACAAGCTTCTGGCCAAATGGACGGACCGCTATTCCGGAGTGCGAAACGCCTTCAGCGTGGCGTTGCTGCAGGAGGGCATGAGCTATAAGCAGCTGTCGCAGGACAACGAAAAAGCGCAGCTGCTGCAGTCGCGCCAGTGGACGGTAAACGAGGTGTGTCGGCTTTACAAAATCCCGCCGCACATGATTCAGCTTCTCGACAAATCGACCAACAACAACATCGAGCACCAGGGGCTTCAGTACGTGATGTATACGCTGCTGGCCTGGCTGAAGCGCCATGAAGCGGCGATGATGCGCGATTTGTTGTTACCCAGCGAGCGTCGCGACTTTTACATCGAGTTCAACGTCTCGTCGCTGCTGCGCGGCGATCAGAAATCGCGTTACGAGTCCTACGCGCTGGGCCGCCAGTGGGGCTGGCTGTCGGTAAACGATATCCGGCGCATGGAGAACATGGCCCCGGTAGATGGCGGCGACAAGTATCTGACGCCGCTGAACATGGTCGATACCAGCACCGTTCACGGGCTGGATAAAGCCACCCCCGCGCAGATAAGCGAAATCAGCGCAATCCTGCAGCGAACTGCATAAAACCTGATTATCAGGCTCTCACAGGTATAAAAATGTCGAAATTAATCAACCTGCCGCACCTGGCTGACCAGGTGTTCGGGGTACCTCACTACGCCACGCGGCAAATCATGGACTCGGTGAAGTCGATCCTGGTTCCTCGTCTGCAGGGCATGAATGTGGCCCCGCTGGAAATGGCCCTGGGACCGGATGAGTCACAGGAGGCGAACGAACCGCAGCAAAGTGGCGGCGGTGTGGGCGTTATTCCCGTTCACGGTATCTTGGTACCCCGGCGTGGCCAGATCGTGAATATGTGTACGGAGCTGAACAGCTACGAGCGCATTCGAGGCCAACTGGCCGCCCTGCTGAACGATCCGGGCATTAAAGAAATCGTGCTCGATATTAACTCTGGCGGCGGCGCGGTATCGGGTTGCAAAGAGCTGGCGGACTATATCTATCAATCGCGCAGCGTGAAGCCCATCACGGCCATCGTGAACTTCAGCGCGTTCTCTGCGGCGTACTTTATCGCGTCGGCCTGCAGCAAAATTATCGTCAGCGAAACTAGTGGCGTGGGCTCTATCGGTGTCATTCTGGAGCACATGGAGGCGTCGAAATGGGAAGAAAGCGTGGGGCTGAAATTTACCACGTTCTCACGCGGCGATAACAAGAACAACGGCTCCCCGCATGAACCGCTGACGGAGCTGGCCACGGCACAGATACAGGCGATGATCGACGGCGCGTACCAGACGTTCACGTCCTCCGTCGCGCAGTATCGCGGCATTGATATTGACGCCGTTATTGGCACTCAGGCTGCGCTGTATTTTGGTCAGAACGCCGTCGCGGCAGGACTGGCAGATGAGATGTCCGATCCTCAGTCAGCCATCAACGCGATTGTTGCGAAATACAAGCCCTCACCCCAGCAATCCAGTATCCAGTTACGTGCCGCTGTAATGGATCAGCAGGCCCGTATGTAACCCGACGCAAAGCGTCACCGTAAGCAGCCAGATGGCTGCTTTTTTTATGCGTAAAAGAGAGAAAAACGATGAACAAAATCGAAGAACTGCGTCGCCAGCGTGCGGGTATTAACACTCAGGTTCAGGCCTTGGCACAGATTGAAATTAACGGCGGCACGCTGAGCGCGGAGCAACTGGAGCAATTCACTGGCCTGCAGGCTCAGTTTGATGAGATTTCAGCGTCTATTGAGCGTCTGGAAGCGGCAGAACGCCTTGCCGCCACCACTGCGGTTCCGGTGAAGGTTGCGCAGAACGGTCGCAATGCACCGGCTGTGCAGGTGAAAGCTGAACCGGATCAGTACAAAGGCGCAGGCATGACCCGCATGGTGATGGCCATCGCGGCGGGTAAGGGCGATCTGCAGCAGGCCGCTTCGTTCGCTGCGGAAGACCTGAACGATCAGGGGCTGTCGATGGCTATCACGACCGCAGCCAATTCAGGCGGCGCGCTCGTTCCGCAGAACATGCAGAACGAGGTGATTGAGCTCCTGCGCGACCGCACCATCGTGCGTAAGCTCGGGGCGCGAACTGTTCCGCTGCCGAACGGTAACCTGGCGATCCCGCGACTGGCCAGCGGCTCAACGGCAAGCTATGTCGGTGAAGGCAAGGATGTGAAGGCGAGCGGTGCGACCTTCGATGACGTCAAACTGAACGCCAAAACGCTGATCACCATGGTGCCGATTTCCAACCAGCTGATTGGTCGCGGCGGCTTCAACGTCGAACAGCTGATTTTAGGCGACATCATCAGCGGCATTTCCACCCGAGAAGATAAGGCGTTCCTCCGTGATGACGGCACCAACGACACCCCGAAAGGGATGAAAGCGGTAGCTACAGCTGGTAGCCGCACGCTCCCATGGGTGGCGGACGAAGAAGTGAACCTGCAGACCATCGATACCTACCTTGATGCGCTGATCCTCATGGCGATGGACGGTAACAGCAACATGCTGAAGTGCGGCTGGGGTATGTCCAACCGCACCTACATGAAGCTGTTTGGCCTGCGCGACGGGAACGGCAACAAGGTGTATCCGGAAATGACAGTGGGTAACCTGAAAGGCTATCCGATTGAGCGCACCTCGGCTATTCCGGCGAACCTGGGTACAGGCGGCAAGGAGTCGGAGATTTACTTTGCGGACTTCAATGATGTCCTGATTGCTGAAGACGGCGCAATGGTTGTCGATTTCTCCCGCGAGGCGACCTACATCGATGCAGACGGGAACACCGTTTCCGCGTTCGCGCGTAACCAGTCCCTGATCCGCGTCATCATGGAGCACGATATCGGTTTCCGCCATATCGAAGGCCTGGCGCTGGGTACCGGCGTTACCTGGTAATACTCCGACAATCGTGATTAACAGCCCGCCTCGCGCGGGCTTTTTTACAGGTGAACATTATGGCTACGAAAACCAAAAACACTCAGAAAGACGATACCGCCACCGACGCCAACGCCGAGCCAGCGGTAACGACCGCAGCGGCGGCGGATACTTCGGCACCGGTACCAGACGTTAACGCCGGTTCTGCAGGCGATGCCGGTGGTGATGGTGATGGTACCGAACCCGGTCCGGACGGCGACGATACGGATTCAGGTGGTGATGCGAAACAGGACGAAACCCCAGAGGAACGTATGTCAAAACTGACTGGAAAAGTCGCTTCGGTACAAAACGGACGTGTTGCGGTGACGTTCCTTGGCCCATTCAGCCGCTACAGCCGTGGCGATGTGGCCTGCTTTGACCGCCCCGTCGCTCAGGACATGGTGGACCGAAATATCGCCGTCTGGGTAAAAGACGCAGAACGCGCCCTTCAACCGAATAAGGACGATGACGCGCATGATACTGACATTGGCTGAGGCCAAAACCCAGTTGCGCCTCGAGCTGGATTTTGATGAGCACGACAGCCTGCTGACCAGCCTGATTGATGCGGCTCAGCGCAGCATCGAGCGCAGCTACTACTGCAAGCTGGTAGAGAACCAGGCGCAGCTTGACGCACTGCCTGACGGTGAGACGGGTTACATCATTGATGAAGATATCAAGCTGGCCGCGAAGATGATGGTCTCGCAGTGGTATCTGAATCCCACCGGCACGGCAGAAGGTTCGCCGTCCGATTTGGGCGTTGAATACCTGCTGTTCCCGCTAATGGAGCATACCGTATGAGTGACCCCCTGCGCCCCGGCGAGCTGAACTGCCGGATAACACTCAGCTACGTGGAAACAGAACGCGGCGAGCTCGGCGAGACGCTTCCGGCCAGAGAGGTGATCGCCGGAAATGCCTGGTCCAAAAAGGAGCTGGTCTCCGGTCGGAAGGTCCGGACGCTGGACCAGCAGCAGGTCGTCGAAACGTGCCTCTTCACGCTGTACCCGCGCGAGGTTGACGTGGACTGGAAGGTATCGACAGCGGACCGGGTATATACCGTTCGCAACGTCGAGCGCCTGACGGATCGGATAATCATCACCGGAGAGGCGGATTCACGCCATGATCGAGTCAGCAATTAAAACCGCTGTCGAGCGGATCACCGGGCTGGATACGTACCCGCTGCTGCTTCCGGATACGGTGCAGGAAGGCGCGACGTTCCAGCGTATTTCCGACCCGCAGGTCGGTGACGGACTGAGGCGGACCGGGCTGTCCGCGGTCCGGATACAGCTTTCGCTTTATGTCGTCGACCGGTACACGTCACTGCTTCAGTTCGACGGGGCGCTCTGGGCCGAATGGAAGGGAATTGTTCATGGCCTGCTGGAAGGTCAGCCCGTTCAGTACGTTGAGCGCGGAGGCATACAGCAGGGGAAAACCACGCTTCCCAACAACCGCATCCAGTTCCGGCTGGTTCGCGACTTCATCTTCACCGTTCCGGAGTAAACACCATGCAGATGGACATTAAATTCCCCACCGGGAAGGAGTTCGATCAGCTTCTGGAAAGCATCGAGAAAAAAGTCGGGGTGAAACTTCTGCGCGATGCCGGACGGGCTGCGCTTGCGGTCGTTGAGCAGGATATGCGGCAGCATGCCGGTTTTGATGAAGAAAGTATCGGGCCACACATGCGCGACTCCATCAAAATCCGCAGTACCAACGTGGCAGAGACCTCGCGCTACAACACCATCGTTACGCTGCGCGTCGGTCCCAGCAAAATTCACCACATGAAAGCGCTGGCTCAGGAGTTCGGTACCGTCAAACAGGTGGCCGCCCCCTTCATTCGTCCGGCGCTGGACTACAACGTTCAAAAAGTTCTTAAAGTGCTTGCCGCAGAAATCCGGCTGGGGCTCGAAGGGCGTTAGCAATCAGGAGAGAGTAAATGGCAGATCAAGAAATTAAATCCCCGTCAGAGTACGCGACACTCCCTGCGGGGACAGAGGTTCGCTACGGTCAGAAAGGCGCAACCATCGCCACCGCCGCGCTTCTGCAGAGCGCGATGGCAATTGGTGCCACGGGTAAAAAAGGCGTCTTTATGGAGGTGACGCGGCTCATCGACAGAGAGCCGAAATACATGGCCGACATGGGCGAGGGTGAGGATAAAACGCTTGTCTTCATTGACGATCCTTCCGATACCGTTCAGGAAGCGCTGCTGAGTGATGCAGACGCGAAAAAAACGGTGGTCTTCTTCATGAAGTTCCCTAACAAGCGCATTTCGGAAGTTGAACTGGTGCTGGCAGGCTGGAGCCTGCAGTCCGTTGACACGCCGAAAGGTAAGGTGCTGCAGGTCGAAGTCTATGGCAAGCAGAACAGCGTTAAATGGTCCGTAGAGCAGCCTGCCGGTGGCAGTGACTAACGGTATTTATCCCCGCTCAGGAAGCGGGGTTTTTCTAATGACACAGGACAACCTTCATGAACTACAAATCCCTCATCAATCCGCTGAACACCACCGTTGAACAAACGCTCCTGGGCCAGAAGGTGTATCTTCGTCGCCTGACCAGTGCCGAGCTGGATGACTATAACGACAAAGTTGAAGCCGGACGTCAGGCCAGGCTTCCGTCGCGAGAGCTGTCCGCGATGGGGGTAAACCTGTTTCTCGCGGCGCTGGTCAATGAAGACGGCAGCAAGCCAAAAGCCAGCGAACTGCCCACCGCCGACCAGCTGATGGCTGCACACTCAAACGCCGATCTTCTCGATGCGGTCACGCTCGTTCAGCGCCATTCTTACGGCACGCTGGAGGAAGCCACAAAAAACTAACCGACTCGTCCCATCTCAGGCTGCTGTTCACGCTGGCGGACCGATGGGGCGAGAAGGACCCCCGCAAAATAGCCGAGCTTCCGGCGAATATACTGACCCACTGGCAGGCCTATTTCGAACTCCTGAAAACGGAGGCCGAAACGCCAGCGCCGGTTAACTCTCCCCCGGTGACTGCTGCGCAATCTGAAAGCGATCAGCAGTTCGCTGACTGCTTCAGGATATTAGGACATGGCTGCTGACGTTGCGTCGTTAGCTGTCGCGCTGCATCTCAATTCCGCCAGCTTTAAATCACAGTTTGCTGATGCTATGCGAACGGCGGACAGCAGCGCCCAGCAATTTAACAGGAAAGTCCAGACGGACAATCAGAAAACCCGGCAGTCGTTTGAAGGGCTGGGCAAGGGGATTACCGGGCTGGATGCCGACTTTAACAAGCTTGGTAAAACGGTCGACAAACGGCTGACCGGGCTGGATGAAATGCGTGGTCTTCTGGCCAACATTTCTGCAGGCAGTACGGTTGCCGGAAGTTCTATCACCACGGCGCTGGTCTCGGCCCTCAGCGAGGGGATGAGCACCGCGCTGGATAACAGCATTACGGGCCTGAAATCCCAGCGACAGGCCCAGATTGAGTTTACCCAGGCGCAGATAAGCGCAGCGCAGGGCTCGATAGAGAACGCCAGGCAGCTGCGTGCTGAAGCTATTGAGAAACAGAACATCGCGGTAAAAACCATCGAAGCCGCCCGTGCCGACCGCGAACGCGCATTTGCGCTCGATGAGCACTTTGCCAAACAGGCCGAGGTGAACAAGCAGTACGGGCTGGCCGTCAGCTATGAGGCCGAGCACGTTAAAAACGCCAGAACCATTCAGGAGGCGAATCTTGCTGAAGCGAAGGCGAAGGGCAGTCTTGCAGAAGCGACGAAAACGATGCTGGCGGCTGATATCGCCGAGTCTGCCGGGAAGCAGCAGCTGGCCACCTCAACGCGCCAGCTAGCCGTGGCCAGCCAGGATCTATCGTTAGGACAGAGAGCGGCAGCAGCCAGTGCAGGATTAATGCGCGGCGCGATGGCGATGGTTGGCGGTCCTGTCGGACTGGCTGTGATTGCCGTCGCCAGTGCGGTGACTGCGATTTACTCGGCCTACTCCAACAGCGAAGCGGTCATTAAAGGGTATACGCAGGCGTTACAGAAATCCGGGCAGCAGTCCGTTATGTCGGTGATGTATCTACAGAACCTAACTTCAAGCCTCGGTGATTCAGATCGGGCAGTTAAAGCGGTTACGGCATCCGTATCGGCGGGGTTCGGCGGCAATATGCTGGAGCAGGTCGCCAGCCTCGGCACGCGAATGGAGGAAATTGGGCAGAGTTCTGACGATCTTGTGTCGCTGCTTTCAAGCCTGAAAGGCGATCCGCTGCAGGCGCTTCAAAAGCTGACCGACCAGGGGATTTTGCTCAACGGCAGCATGATAGACCAGATAGTCACGCTCGAGCGCCAGGGGAAAACCTCTGAAGCAACGGCGCTGCTGCAGCAGGCGGCGATGAATGACCTTGATTCCAAACTCAAGGAACAGGAATCGAATGTAGGTGGGCTGAAAAGCGCGTGGAAATCGCTGAAAGACTTTGTATCCGATGCGTTCAAAACGATGGGTGATGCGCATATAGCCACCGCGCAGGCGATGGCTGCAGGTGCAGGTGTAGACCTCGATACCACTCCTGACCCGGCGATTAAGCAGCGTGAAGAGGCGGAAAAGCAGTATCAGGCGCAGAAAAAGCAGCGTGAAGAAATTTCGAAACGTCTGAAGGATGAAAACACGCTTTCAGGTCTGCTAAAAGCCGGTACATCGCGCGAAAAAGAGCGTGCAGATGCCGTTGCGCTTGTGAATGCCAATTTCACGAAAGGAACGTCTGAATATACGCAGGCAATGCGCGGTATAGACAAAATGTATGCCGAGCAGAAAAAAGTGCGTGCTAAGGCGTACAGCGACGATGCAGCGACTACGCGGCTGAATCAACTTCGCCAGGAAGAGGCGGCGCTGCGTTCCCAGAATGAACAGACCGAGACGCTGACTCAATCAGAAAAGAAACTGGCGCAGTTCAACCAGGAAATCGCGGACCTCAAAGAGAAGCGCATCCTGACCGCTGGCCAGCGTAGCATTCTGGCGCAGGAGACGGAGCTGCGTCACCAGCTGGAGATTAACGCCAGCCTGGATAAAGCCAACCAGCAGCGCAAACTCGGCCTCCAGATTCAGGAGCGGAACCAGGAGCTTTATCGCTCAACGCTGCAGCTGCAGCAGGAATATGCAAACCGGGTCGCCCAGATGACCATGAGCTCCGATGCTTATGACCAGATGGTTGCTGAGCAGCAGGTCCGGGAGCGTTTTGCAAAGCTCCGGGAAGAGCAGGATAAAACGATTGCCGATCACAGTTCCGAACTGTACCGAAAACAAACTGAGGTGCTAAGGGACGAAGAGCAGAAACAGTTAGAGATTGTCCGTAGCGGTGCGGAACGGAAAAAACAGGTAGACGGGTCCTGGTTTGACGGCATGAAGAAAGGGCTGACAGACTGGCGCGTTGACGCTGAAAACCAGTTCACTCAGGCTCGGGATATTGCCATAAACGCGATGGAAGGCATGGGTACAGCCCTCTGGAATGTCGCCTCGAAGGGAAAGGGAGATTTCAAATCGCTGGCCGTGTCCGTTATTGATGATATTGGCAAAATGATTACTCAGATGGCGATGTTTAATGCCATTAAATCCGGCTCGAAAGCCCTGGGAATCGAAAGCTGGTTCGGTTGGGCTGACGGGGGTTATACCGGCGACGGCGGCAAGCATGACGTCGCCGGTGTGGTTCACCGTGGTGAATGGGTGGTTCCGCAATCTGTGGTCAAGAAGCCCGGCATGCTCGGATTTCTGAATCAGCTTACATACGGCAACGGCTATGCAGAAGGCGGGCTGGTCGGTGGTGGCGTGGCAAAACCATCCGGCGATTCGTATTCGCAGCCACCTGCTGGTCAGGGCAGCATCCATTTTTCTTTAACCATTCCGCTGCAGGTTATACACCAGGGCGGTGCGAACCAGGAACCCACCTCAAAAAGCCAGGAATTTCTGTCCAGCGAAACCAAAGCCCTATTCAAGCAGTTTGTTCTGGAAACGCTTGACCGCGAACTGGCCAACGGAGGCATGATCGACACCAAAATGAGGACGGCCTGATGGCATTGCAGACGTTTACCTGGTCTCCGCGTAATGGCCCTGTAGGAGACATTAAGTACCGAACCAGCAGCGTTCAATACGGTGATAGCTATGAGGCAATAACCGGAGAGGGGATTAACCCGGAGACGCAGTCGTGGCCGCTAACGTTTACCGGTATGAATGAGGACATGAAGCCTGTTCTCAAGTTTTTGCGCGAGCATGGCGAAGTCAAATCATTCAAATGGACCAATCCATTGGGGGAGCTAGGGCTTTACCGGGCATCCCAGCTGAAAGTTAGCGCTCTTGATTTTGCGCGTATGACAATTACAGTCACGTTTGCTACAGCATATCGTGCAGAACCAATATAAACTTGAGGGAGATGAATATCATTTTGCTATGATACTTTCTTTGAAAAAAGGGAATGTTGTCATGCTTAAAATATGTGGATTTGCAGTTCTTGCGCTTGGTGTTATATGCATCATTATGGGCCTTGATATGGATGTCACCGTGAGCTCAGGTGCTCAGATGAACGTATATAACACGGGCCTCATTGCTTCCCGACAGATGACTATCTCAATCGGGTGTTCACTTATGGTCACCGGCGCAATTCTTTTGTCAGGTGGTATTTTGAAAGAAGCGATTATCAAGAGTGTTTTACCACAGACAAAAGCTGGCACTGAATCGCCTGTTCAGGAATCGCAATTTGTAGAGAAGACCACTGACGGTAGCTATATTCTGAGTGAAAATGCAGTTCGTCATTATGCAGAAAAGTTGCATAACGAAATGCCAGATAATACCGCCCTGTCCGTAATGGTCACTAATGCACCACATATTGAAAGAATAAAATCGGCTATGTCTCCTGAATTAGCTAAAAAGTTTGAAAGGCTACTGGAAACACACTTACAAGCCATCAAATAACGTTTAATAGGCACCATAAAAACCCCGCATTGCGGGGTTTTTTATTATGGGGGCCACGCTTCTGTATGAGAGGTTTCTATGGGCATTGCTGCTGACGATCAAAAACTCCAACCCGGCAATCAGATCACCCTGTTTGAAGTTGACGGTACCGCGTTCGGGGCCGACGTTCTATATTTCCACAACCACGCGGTACCGTACACAGAAGCGGAAATTATCGATGCCGGTGATGATGAATCAAAGCTCCCCGGGAAGCCTATTTACTGGCAGGGAATACGATACGAACTCTGGCCATGCCAGATAGAGGATATTGAAGCTAACGGCGACGGAACGCCAGTATCGCCTAAGTTATCTGTTGGGAATCTGGACGGCTCAATATCCGCGCTGTGTCACCTGTTCCAAGATATGAAGCAGGCGAAGGTTACTATTCACCGGACGTATGCCCACTACCTCGATGCCAGCAATTTTCCTGATGGAAACCCGCAAGCCGATCCGACTGCCGAGCAGCTGGAGGTGTTTTACATCGACAGCAAAACTGCGGATAACGAAACGGACGTCCAGTTCAAACTGAGCTCGCCTGTTGACGTGACCGGGCAGAAGGTTCCGGGCAGGCAAATGACCAGCCGGTGCGCCTGGTGCCTGCAGGGCCAGTATCGCGGTGCGGACTGCGGTTACACCGGCACGAAGTATTTCGACAAATTCGGCAATCCGGTTGATAACCCTGCAGATGATGTCTGTTCCGGAACGGTCGCTGGCTGCAAGCTGCGCTGGGGGGAGGATGAGCAGCTGCCGTTTGGCGGTTTTCCGGCAATTGCGATCACGAGGATTTAATCATGCTGAGCCAGCGACTTATTACCGCTATTGAAAAACACGCTGCAGCAGCTTACCCCAATGAATGCTGTGGCCTGATTATTCGCGCAACTAGACAGCGCCGGTACATCCCCTGCAGTAATGCACACGAAAACCCCACGGAGCACTTCATGATTTCTGCGCAGGCCTGGGCTGATGCTGAGGATATGGGGGAAGTGCTGGCCATCGTTCATTCACATCCGGATGCGGGACCGCATGCTTCTTCCGACGATCTGAAGTCGTGTCATGACTCCGGATTGCCTTGGGTGATCATGTCATGGCCAGGCGGTGAGTACACGGTGGCCACACCGGCAGATACGCCACCGATTCTCAAGCGGCCCTTTATACACGGCAGCTGGGATTGCTACGGGCTCATCCGGGACTGGTATCAGCAGGAGCGGGGCATCGAATTGCCTGATTTTCACCGTGACGACAACTGGTGGACGCGCGGCGAAAACCTTTACGTAAAACACTATGCCGACGCGGGATTTTATTCACACGCTGACGAGCTGCAGGTGGGGGATGTGATCCTGATGCAGTACAAAGCGGAAGAAATCAACCACGCAGGCATCTATCTCGGTGACGGGAAAATGTTGCATCACATGTACGGAAAACTGAGTGAAGTTGTTCCCTACGGCGGCATGTGGCGCGAGAGAACAATGCTTACCTTAAGGTATCAGGATGGCACAGAACACAGTTGAGAAAATTGTGCTTGTGCGGCTCTACGGCAAGCTAGGCACATTATTTGGGCGAGAGCATCGCCTTTCTGTTTCATCAGTGCGAGAGGCTATCAGGGCGCTTTGTATCATGATCCCCGGACTTGAACGCTGGCTTGAAACAAGTGAAGAAAGAGGGGTGACATACGCGGTGTTTAACGGAACACGAAATATCAGCACTCAGGATCTTCACCTGAATGGCGTGCATGATGTCATCAAAATTGCGCCGGTCATTATTGGCAGTAAAAAATCTGGTGTTTTCCAGACGATATTTGGTGCAGTGCTGGTAGTTATTGGCGCGGTACTGAGTTTTACGCCAGCTGCAGCTGCATCACCGTTCCTCTACAAAATGGGGGCGGCGATGATGCTGGGTGGTGTTGTCCAGATGCTCACGCCCAGCGGTACGCAGGGCATGACGATGGACTCCGGTGATACCCGCAAAAGCTATTCATTCGGAGCCCCAATAAATCAGTCTGCAGCCGGAAACGGCGTCAATCTTCTCTACGGTGAGCGTCTGATTGCCGGTGTTCTTATCAGCGGCGGTATCTACGCAGAAGAACAGCAATAACGCTTATCTCGCAACATGTTTAATTCTCCCGCTCAGGCGGGATTTTTTTTGCCCGGAGTTTGCATATGGCAGTAATCAGAGGTTCGAAAGGGGGCGGTGGCGGCGGTGATAAAGGCGGCAATCGCGGTACCGAGATCGCCTCCGTAGCGTACATGAAAATTCTGCTGGCGCTGACGGAAGGGGAAGCTGCAGGAGACTTTACCGGCAAAGATATTTATCTCGATGGAACTCCACTGCTTGATGATGCTGGCAATGAAAACTTTCCTGGCGTGACGTGGGAATGGCGCAGCGGCACAGTGGACCAGGATTATATTGCTGGATTCCCGGCCGTAGAGAATGAAATCACCGTTGGCACGGAACTGAAATACGGGACGCCGTGGGTTAAATCCATTAACAACACCCAGCTATCGGCAGTGCGTTTGCGACTTAAATTCCCGAACGGCGTTTACAAACTGCGCGACAGCGGCGGGAAGGATGGCTACCGGATTGAGTTCGCTATCGATATTTCAACCGATGGCGGTCCCTATGTTGAATACGGTACCGATGAAGCGGATGGCATTGCCGATGCCGGGTATGAGCGGAGCTATCGAATTGATCTGCCGGCAGCAACATCCGGCTGGCAAATCCGCGTCAGACGCCTGACGGAAAATACCACTAATGGGCGGCATGCGGATATCTCGCGTATTGAGTCGATGACCGATATTGTCGATGCCAAGCTGCGCTATCCACACACGTCCCTTCTGTTCATCCAGTTCGACTCGAAGCTGTTTGATGGCAGAACGCCAAACGTCACCGTGAAAATGAAGGGGATAATCGTTCGCGTACCGGCGAACTACGATCCGATATCCCGCACCTACAGCGGTATCTGGGATGGAACCTTTAAATGGGCATGGACGAACAACCCCGCCTGGATTTTTTACGATCTCGTGCTGAATAAGCGGTACGGCCTGGGAAAACGAATCACCGCGGATTTAGTTGATAAATGGACCCTGTACCAGATTGCACAGTACTGCGATGCGCCGGTTTCGGATGGCGCAGGCGGGAAAGAAGCGCGGTATCTCTGTGATTTGTACATCTCCCAGCGCACCGATGCATGGACCGTGCTGATGGATTTGGCGAACATCTTCAGAGGAATGATTAGCTGGTCCAACAATCTTCTGTCAGTTGACGCCGATATGCCCCGCGAGCTGGAGCCTGATTTTGTGTTCAATAAGTCGAATATCGTGGGTGCGTTTAATTTCTCCAGCACATCTGAGAAGACGAACTATTCATCAGCAATCGTCACCTACAGCAATCCGGCCAACGGCTATCAGGACGATCAGGCCAGCGCCTGGGTACCGGAAATCTCAAGCCGATTCGGGTTTAACACTATAGAGCTGACACGCATCGGGTGTACGAGGGAATCAGAAGCGCAGCGGCACGGGCTTTACGCTATAGAAACCAACCGCGATGACAATGCGGTGGAGTTTAAAACCGGGCTGGAAGGGCGCATCCCGCGTATAGGTAAGGTGATTGGCATCAATAACGCCCCGCAGGCTGGTCGTGACAACGGCGGTCGGGTGTCGGCAGCTTCCGGAACGAAAGTGACACTGGACCGGATTACAACAGCAAAAGCGGGGGACACACTTATCGTGAATCTGCCTACCGGTAAATCCGAAGGCAGACAGGTGGTAAGCGTCTCCGGACGCGCTGTTACCGTTGAAAAGGCATACAGCGTTACCCCAAATGCCGAATCCGCGTGGGTTCTGGACCAGCCAGATTTAGCAATTCAGCTGTTCCGCGTTAAGCGGCTTGCGGTTAATTCGGATAACACGGTCACCATTAATGGCCTGCCTTACAACCCGAACAAGTTTCCGCGCGTTGATGATGGCGCGGTGATTGAAGACAGGCCTGTCAGCGTCGTTCCGCCGCGCGGACAGGGCATGCCGGAAAATATTGCTATCTCAAGCGTGTACCGAGTTGAACAGGGGATAGGCATCACCACGATGGTTGTTACGTGGGATACCGTCAAAAATGCCGTTGCCTATGAGGCGCAGTGGCGTCAGAACAACGGCGACTGGATTAATGTTCCGCGCACCGGCAACACGCGCTTTGAGGTAGACGGGATTTACGCTGGTCGCTACGTGGTCCGAATCCGCGCGGTTAACGCGCTCGATATCGCATCCCTCTGGGCAACGTCAGCAGAGACTGAACTTACGGGTAAGGTGGGAAAACCACCTATGCCCGTGAATCTCTCCACGCAGTCTTTAGTGTTTGGGATCGGCATTTCCTGGGGATTTCCATCCGGGGCGCAGGACACGCAGAAGACAGAGATCCACTACAGCACCACGGCGAACGGTGATTCTCCGTTACTGCTGGCAGACGTGCCTTATCCCTCATCGACCTACCAGCAAATGGGGCTGCTCGCTGGCAAATCGTTCTGGTACCGGGCAAGGCTCGTTGATCGCCTGGGCAATCAGAGCGACTGGACCGAGTGGGTATTTGGTCAGTCGAGCACGGACGTATCTGATATCACCGATTCCATTCTCAAGGAGATGGAGGAAACAGGTCTACTGAAGGATGTGGTTGAGAATGCCGTCGACAGCAATGAAAAAATTGCTGCCATGGTTGATGACATCAAACAGGCTAATGACGAGCTGGAGCTGCAGGCGAAGGATATCGCCCAAAACGCCCAGAACATTGGGAAGGTACAAACCAGCGTTAATGAGCTTTCCAGCACGGTCGGGGATGTTTCGTCTTCACTCAGTCAGCTTGAGCAAACGGTGGCAACAGAAGATGCCGCCCTGGGCCAGCGAATCGACAGCATCAGCGTATCCATGGACGGCATGACGGGTGGAGTGAAGAACTCAGCCATCGCCATTATCCAGAACGGGCTGGCGCAGGTGGCCACACGCAAAAGGTTATCCGCAACGGTCGCCGGTAATAGCGCGCAGCTGGACCGTATTGACGAGGTTATCGTTAACGAGAAGGAAGCAACGGCGCGCTCGCTGCTGAGTCTTCAGACGGACGTGAACGGCAACAAAGCGTCAATCAATAGTCTGAACCAGACGTTTTCGGACTACCAGCAGGCTATGGCCACGCAGGTAAACAGCATCACGGCGACCGTTAATGGTCACACTTCTGCAATTACCACCAATGCCGAGGCAATTGCCAACGTGAATGGCGACCTGAAGGCGATGTACAGCATCAAGGTCGGGTTATCCAGCAACGGCCAGTATTACGCGGCGGGGATGGGAATTGGTGTTGAGAATACGCCGTCAGGCATGCAGTCGCAGGTTATCTTCCTGGCAGACCGCTTCGCAGTAACGCACCAGGCCGGAGCGCAGGTCACGCTTCCGTTCGTTATTCAGAACGGGCAAACCATCATCCGGGACACGGTCATTGGTGACGGGACAATCGGCAACCTCAAAATCGGCAGCTACATCCAGTCGACAACCTGGGATGGCACAGGGAACGTTGGCTGGCACATCAACAAGTCAGGCTACGCGACGTTCAACAACGTGACCGTTCGCGGCTCGATTTACGCCACAAACGGTAATTTTTCTTTCAATGGCTCCGGCAACACAACAGTGATCAATGGCAACGGTTTAACCGTCAACATTCCTGGTGGTGGCCGGATTGTACTGGGGACATGGACATAAGATGCCGACAGGATTATTGATAGAACTTAATGACGGCGGAAAGCGTATGGAGATAACGGCGGGCCTGAGATGCCCGTCTTTTGGTGGCAGCTTTGACACTGGCTACCAGAAAGCAAAGTATGTGGATATCGCTGGTTATGTTTCAGGATCTCAGGTTCTGTTTATACCTCATGCGACTGCTTATGTTGACTCAGGGCTGTGGCATAAAATGAATTCCATCACAATCTCTGGTGGGAGGGTTACGCAAAATTCGAGAATGCAGGCTCTGGGTATAAGTGAGAGGGATAGTACCTATACCTTTCCCGGTAGTGTCTGGCAGATATTCCCGACAGGTCAGCGAAATGGGGTTGGCTTGCTTATTGGTGACAGCACCGACTTCCTGGCGATCACCAACGCCACACAGTCAGGCCAGTGTATCTGGAAGGGTACCGTTAATGTTCCGACCGCGGGATGGGCGGTTCCCGCGATAGCAGGATACGACAAGTCGAAGTATATCGTTTTCGGGCGCTGTAATAGCGGTAACACGATTGACTTCGACGGTAACACGGTCAGGTTCTTCAGCCCTCCGTCCACGAACGATGACGCTCCCGCAACCGGCACGATAGACATCGTTATTTTCGCCAGTGGCATAGCGCCGCAGCCTGGTACCGGCCTCAATATTTTTAATGCTGCAGGAGTATGCACGTTTTCAACAACAAGACGACCTTTCGTATACCTCAACCAACTCTGGACCCCTTCGACAAGTGCCGTGAGCATCGGTAACGGATATGTTCCGCTGGGTAGGTTTGGGCTTATGGTGCATACGGTAAACGGCATGTATGTATATCGAATGTTCGGAATAAAAATACAGAACGGCAGCGCTTCTGTTCAGGGCGGGAAATATCTTGGCCGCGAACAATATGCCATTTTCGGTAATAACACGGTTACGTCGCTCAGCCTTCCTGTTTTGCCCGATATGTACGTCTGAATTAACTGTCTATTCAAATCAACCTCGCTTCGGCGGGTTTTTTTTATGTCTGGAGAAAATATGCTTTATAACACTGGCACTATCGCTATTAACGGAAATACTGCAACCGGCACAGGTACAAACTGGACGGCACCGGCCAGCCAGGTTCGTGCTGGCCAGACGATTATTGTCATGTCTAACCCGGTCCAGATGTTCCAGATTTCTTCCGTGAACAGCGCCACGTCAATGACGGTTACGCCAGCGGCTTCCCCGGCGCTGAGCGGCCAGAAGTACGGCATTCTGGTATCAGACAATATCTCGGTCGATGGCCTGGCGCAGGCGATGTCTCAGCTCATTAACGAGTATGACGAGAATATCGGTGCGTGGGAGACGTTCGCCACCACATCAGCAAATCAGAACATCACCGTAACCATCAACGGCACGCCAGTAACCATCCCCGGCATCGGGAAACTGGCGCAGAAAGGGAGCAATGGTGCGCTTGCAATCGCTGACGGCGGGACCGGGGCAACGAAGCCAGAAGACGTTCGCACAAACCTCGGTTTGGGAGATGACATCACTGCCAACTTCGGAAGCCTCGAAATTGGAGCGAAAAAAGCCTCTTCTGCAAGCTTCATTGATTTTCATTTTCTTGGCACTAATGACTATGACGCGCGCATCCTTTGTGGTGGCAATTCGAATGGAGGGATGGGGAAAGGGGATTTCACTTTCTATGCTGGAAAATACACTTTTATCGGTGACAGTTTTGAGTTTCGAAATCCTATCACCTGCCAGAACAGCATAACTGCTTCAGGGAGCATTAACGCAGGCGGCTCACTAAGAGCCGTAACGTCATCAAACGTATGGGCCTCCAGCGATACACAGAACGCTCACGTGTGGTTTTACGGTGCGGGAGGTATTGAATCACGAGGGGTAATCTATGCGCCCAAGGAAGGTACCATCCGATTAAGGCCTGATAATAATGATAATGGTGGAGCAAATGGCTACAGCTTCTCCTTCGGAGCTGATGGCAGGTTTACCTGTATTTCTGTGAACCAGACTTCGGATGAGCGAGTTAAATTCGACAAAGAGCCCGTCGGTAACGCTCTGGAGAAGATTTGTTCCCTGACGGGTTATACGTTTGGCATTCAACTCACTGAATCAGAATCGGTGCGCAGCGCAGGTATTATTGCTCAGGATTTGGAAAAAGTGCTGCCCGTCGCTGTAAGTTCTGGCGCAACCTGCACCACAAGGACCGGAGAAGAGATTAACGACCTCAAAACCGTTGACTACAGCGCAATGAGCGCACTGTATGTTGAAGCAATTAAAGAGCTAGCCGAACGGCTACAGATCCTTGAAAATAAACTGGCTGAACTACGGTTTTTTAGCACAAATTTAAGCTGACAGATTTCCGTGGAAACACATAACCAAATCTGACGCCCTGATTTGAACCGTAAGCGCATCAAACCATATCTTGGCTTATTAATTTTATTCAGATAATATGCATTGTGTAAAATTCATAACTAAAAATGTTGATTGGAAAGTAAAATCATGCCAAAAACGCACCACTACAAATGGTTTCTTAGTACATTTTTTGCTATGCATAACTTTGGTTTTACCTTAGCAACAATTAAAAGCTTTAACTACCAAACTGAAAAAGCTATTCGAGCAATGATCGATGACTATCAGAAAAATGGCCCTGCAGTTTTCGACTATATACAAGACAAACAAGGCATAATAGATTCTATAGACTCCATACCAGAATCACTCGTCGAATCCACAGACAAAATTCATACAGTTTATCATCCTAGTTTAATGCGCAGATCTGCTTTTCTAACCATTTTTGGCATGATTGAGCATGAAGTGGATAACGTATGTGACAACTTTTCCAAAAAGCACAAAGTGAGTGTTAAAGTAAATGATCTCAAAGGGAATGGTTTCGAAAGAAGTAACTTATTTATCTCGCGAATTATTGGTTTAAAAGACTCGCAACACTACGCACAAATAAAAAGGATCATAAAGCTAAGGAATAGTTGTGCTCATAATGATGCAAAGTTTATAACCCCTGACGGTAATGAAATTAAGGAAATCACTAGATTAATGAATGATTTTCCAAAATATTTCTTCAAAGATGGAAGCTCAGTGGGGTTCCATCCTAATGTTTTAGACTTTTTTACAGAATCTTTGGAAGCATATCTTCTTGAAATTGAAGTGGCGCTAAACAAGCATCAAAAGTAATTAGAACCTGTACCATAGCGTTTAAGCTCCTCATAAGCTTAACCTTCGCTTCTAGAAGCCTTTGTGTTGCTCTGTTCCTCTAATACCATAATACGTAATGCAAGCGTTTTTATTGCTGCCAGCGCATCGAGCAACATTGGCGTCTGATCAAGATGCAATATGCCGCCTATTTCTTTCACGTATTCGGGATCAATCGTTTCAATCTGCTGAGATATCACCCCGCGTCGTGGGGTCTGCGTTTCATCATCCTTAAAAGTGAAGTGCTTGAATTCCATCCTGCAGATATTAAGCAGCGCCTCTTCTAGATCGAGGTCATCACCGACGTTTTTCATAGTTCTATCCGACACCGCTGAAGTCATGATCTCCTTCCACGGGCTCCAGGCATTGGTGTTGTAACCCCTGAAAAAAAATCGTCCAGCATCAGTTTGGCTTGGATACGGCAAACAAAACTGCGTTAGCGCTACATCGGCAATACGAACATAATTTTGAACATACCCATACCAACTTGAGATAGGTCCTGAGGTTGATTTCGACATATCCAAAAGTAAACGATAAGTTCCTGGCTCTGTAAGGCTGTTGAAGTTTGTCCCATCAGGAGCAACTGCCGAGTCTGTTTTAAATACCCTGGCATCACCAGTAGGTAATCCAAATGCTCCCACTTGCATGACGTTCCCGGTTGCCGTTCCGACGTCCTTCGTAGCGCTACTTCCCAAACCGACGTTTTATAGATTGCCCTTTGGTGGCCTGGCCGATAACTTCATCTGATTTTTTTGTGAAAATTATTGGGTGAAAAGTATGCAAATTGGCTACGTAAGGGTGTCAACAAATGACCAAAACACGGATCATCAGCGACAGGCACTCGAACGCTCAGGATGTGAACAGATTTTCGAAGAAAAAATGAGCGGAACTGTGGCGAACCGACCGGCGCTTAAAAAGCTTCTGAGAACGCTGAACGAGGGCGATACGCTTGTGGTCTGGAAGCTGGATCGTCTTGGTCGTAGCATGCGTAATTTGGTGCTGCTGGTGGATGAACTACGTCAGCGTGGCATCCACTTCAAGAGCCTCACGGACAGCATTGATACATCCAGCCCAATGGGGCGTTTTATATTTCACATCATGTCAGCACTGGCGGAAATGGAAAGAGAGTTGATCGTTGAACGCACCCGGGCGGGATTAGCGGCGGCGAGAGAAAAAGGGCGAATAGGTGGGCGACGCCCAAAGTTGACCGAAGAGCAATGGGCTCAGGCTGGCAGATTGATCACAAATGGAGTGGATCGGAAGCAGGTGGCAATTATTTATGATGTAGCCGTATGCACTCTTTATAAAAAATTTCCTGCATCTAAGCCGGTTTAATTGTGAGCATACGTGGTCATACCGGGAAAATTTACAAAAAGCATAATTTGAAGCGAGATAGAAACTTACAAACCAAACGGCGAAGCTTTGCACAGTCGCTGGAACCGTGGTGTCTTGCGCGCAAACCCAAATGAAACTACTGTATATAAAAACAGTGTTCGAGGTATGCGTAATGGAATTCTTCAGACCTACAGAACTGAGAGAAATTATCTCAATCCCACTTTTCAGCGACTTAGTTCAGTGTGGCTTCCCGAGCCCGGCAGCTGACTACGTTGAGCAGCGCATTGATCTCAATGAGCTTTTAGTTGCACATCCCAGCTCGACGTATTTCGTAAAAGCCGCGGGCGACTCGATGATTGAGGCCGGGATTAGCGACGGTGATCTGCTGGTGGTGGATAGCTCTCGTACTGCTGAACACGGAGATATCGTAATCGCCGCGGTAGAAGGGGAGTTCACAGTCAAACGTCTGCAGCTCCGCCCGACAGTTCAACTCAATCCAATGAACAGCGCTTATTCGCCAATTATCGTTGGCAGCGAGGACACGCTCGATGTGTTTGGCGTTGTAACCTTCATCGTAAAATCTGCGGGCTAAATATGTTTGCGCTCTGTGATGTGAATTCGTTCTACGCATCATGCGAGACGGTCTTCAGGCCCGATTTAAGAGGCCGGCCAGTTGTCGTTCTCTCGAATAACGATGGGTGTGTGATAGCTCGCAGCGCTGAGGCCAAGGCGGCTGGAATTACTATGGGGGAGCCTTTCTTCAAGCAAAAGGATTTGTTCCGCCGTGCCGGCGTTGTTTGCTTCAGCAGTAACTACGAACTCTATGCAGACATGTCGAACCGGGTAATGACGACACTGGAGGAAATGAGCCCCCGTGTCGAAATTTACAGTATTGACGAAGCTTTTTGCGACCTGACTGGAGTGCGCAATTGCCGGGACCTGACTGAGTTTGGCAAAGAAATTCGCGCTACACTCCTCAAGCGTACACACCTGACAGTGGGCGTAGGTATTGCCCAGACAAAGACACTGGCGAAGTTGGCAAATCACGCAGCCAAGAAATGGCAGAGGCAGACTGATGGGGTGGTTGATTTGTCCAATATCGATCGCCAGCGTCGGCTATTGGCTATCGTGCCTGTGGAGGATGTCTGGGGCGTCGGCAGGCGCATAAGTAAGAAGTTGAACGCCATGGGCATCAAAACGGCTCTGGACCTCTCTGAGCAGAGTACGTGGATTATTCGAAAGCACTTCAATGTCGTCCTGGAGCGAACTGTCCGGGAGCTGCGCGGCGAACCATGTCTGGATCTGGAGGAGTTTGCACCAGCTAAGCAGGAAATTGTCTGTAGCAGGTCGTTCGGTGAACGCGTTACCGAATACGAACAGATGCGGCAGGCTATTTGCAGCTATGCGGCGCGTGGTGCTGAAAAGCTTCGCGGCGAACACCAGTATTGCCGTTTTATCTCTGCGTTCGTGAAGACGTCACCTTTTGCCCTGAACGAGCCGTATTACGGCAACAGCGCGTCAGTGAAGCTTCTCACCCCCACTCAGGATTCCCGCGACATCATCAACGCCGCGGTAAAGTGCTTGGACAAAATTTGGAAGGATGGGCACCGCTATCAAAAGGCTGGAATTATGCTGGGAGATTTTTTCAGTCAAGGGGTGGCACAGCTGAACCTTTTCGATGAGCACGCGCCGCGCGCTGGAAGCGATAAGTTGATGGATGTGCTCGATCAATTGAACGCGAAAGAAGGGAAAGGAACACTCTTCTTTGCCGGGCAGGGCATTCAACAGCAGTGGCAGATGAAGCGGCAGTTGCTTTCACCCCGATATACTACTCGCATATCTGATTTATTAATTGTGAAGTAAAAAGGCTGCTTTACGCAGCCTTTTTAGTTAATTAAGCAGCTTTTTTCTCGCTGCATTTGCAATGAGGGATAACAAAAACTTTCGCCTTTTTAGGGCGGATGATTTTACCATCCTTCATTACATAGGGCGTGAAGGTTACTTCACACGAATTCCCACATTTTGGACAATAGCCAGTTTTCATGTGAGATCTCCTATGTAAATGTACAGCCTGTAAATCTAAACAGATTGCACCTTGTTAGGAGAACCACTATACTTTTAGCCGTCAAGCATTTTGTGTAGCGGTAGATTTCTCCATACACACCGAGTTATGCGTCAACATAGCTCGCCCCCTAAGAAGCCCTGCGTCAACAGGGCTTTTTTCAATAAAATTCAAAGCAAAGCTGTGCTGTTTGATAGCCCAATTTATCGAGCATCGTTTCTGCAAAAAGATCCGCCTGCCATTCAGAATCTTCATTTTGTTCCGGCATGGCATTCGCAAAGTGTAAGGCTGGCTTATGTTGCAGTAGCAAGTGCCCGAGCTCATGGAATATAACAAATAATGCTGATCGTTCACCGCCACATGCCATTTCATAGATATGGTTTGGAACCCTGATAGTCAGTTTGTCAGGTTCACAGTGTCCAATCGTTAGACCCAGCGTTTCCTGAAACCATTCGTCATCATCCATTGGATCGAGGACAATATTCCATTGAGATAATGTTTCCAGCGCTACGTCAAATCTGCGCGGTCTACGCCGGTATTTATACTTACTAGTAAATCCGAGTGCAAAGCAAGCATTTACAGCTATCGCTTTAATTTCCATTTCACTCAAAGGACGAACACGTGTACCGCGCATTAAATGCATAACAGTTTCCTAAGCAGTTATTTTTTGTTGATTTCTGCAAGTAATTCTGCAAATTTTTTTAATTCTTCATGGGTGAATTCCGATTTTGCAAAACCAGCAACAAGCATTTGTTGTTGCTTTGGTAAACCTTCAATAGGTACCGATTCATTCGCAATAGAAGCTAAAACATCGAGATCATCAATGTAATAACCTTGCGCTGCAAAAAGTTCATTGATTGCCGTTACCCATTTCTCTGGTATTTTTTTAGTCCCTGTTTCGAGCCCGCTTAGAAAAGCAGGTGTTGTTCCTAAAGCCTTTGCCATTGTAAGCAAAGTATAACCAGTATCGATCCGAGCTTTCCTTACGGCCTTTCCGAAATCAGTAAGAGCCATAGTCATATCCTCAGTGCTTTCATTGTGGCTTATGCCAGATGTTCTAAATATACTCTAACCATAAAAAAAGTAAACCCATCAAGAATAATAAATTAACCAAAAAGGTTAATTTATTATTTCTATTAGATTTTTTCCTTGATTTTTGGCGTTTCCAACCGCGCGTGTAACAGCGTGCCAGATAAACTTGTTGGCAGGCACTGCGCCGTCGGCTGCTATCTCTTCAGCTTCTTTCCCGCCTATATCCTGACGTATCCACTCGCGAGCCGCTTCCGGTGACAGAACCAGTGGCCGGCGATCGTGAATATCGACCAGACCTTTATCAGCTGCAGATGTCACGATGAGAAAACCTTCAGCTTCGTCTCCACGCTCAAATGGTGTGCTGCCGATAGCCGCCATAAATATTGGCTGGCCGTCGGCCCGGTGAATGAAGTAGGGCTGTTTCTTGTCTCCTTCCTTCTTCCATTCGAACCAGCCATCCGCAAAGCAGATCGCCCGGCCATGTTGCCAGAGAGGTTTGAACATACGGCTGGTGGCCGCCGTCTCGACGCGTGCGTTAATCAGTGGCGGTTTATCCCACCACTCGGGCGCGTAAGACCACAGGACTGGATCAAGATGCAGCTGCTCGTTGCGTTCGCTCAGGAGCAGCACCTTGGTACCGGGCGAGACGTTGTACCGGCCTATAGGTTCCGGGTCATACGCAATTTCACGATCGGCTTCATTGGCCAGGTATGCCAGATATTCTTCACGGGTTTGGGCTTGTGCAAAACGTCCACACATAGGAACCTCCAGTCAGTCAGACTGAAAGTATAGGGCAGGGAGAAAAAGTAGTGCGCGCTGGTTAAGTCCTACAAACGGAATCGCGCTGATTATGCTAATGAGGATGAAATGCGTAAAGCGATGCGTTGTGAAACTGGAAGGAGCTACGCAAAGTTGCGACAATTCCAAAAGAGAAAGCTAAACCCGATAGGGGATCCGGGTTCGCTGTGATGACAATTTGGAAATTCATTCATTAGTAAGCCACATATCAGACTCTTCAAACATATCCTCCAGCATACGGTTCAACTTTTCACGGTCACTTTTGCTGGCATCACTATTCAATCCGTTAGCCTGCATAGGTTTCACCTTCACTTCAGCATCAGGGAATATGTTATGAACTCTCCTCGTCAGTTCAGCCAAAATAATTTCTCTGGCGCCATCCAAGCCCTGGACATTTCGCTTATCGTAAACCAACTCAACAAACATAACAGTCTCCTTAATCTCCGTGAGAGGAAGCAATATTTTTACTGTAAATATATACAGTGTCAAGCGAGTGATACGATCTACACGATGGGATTTTTTCGTCCGATTGAACAGAAATTACCGGTAGTTATTCCCCAGTTATTCCCCAATGTTTCCCCATAAGAAAACTACCAATAAAAAAACCAGCCATAAGAGGCTGGTTTTCAATGTGTTTTTGGTCGGCACGAGAGGATTTGAACCTCCGACCCCCGACACCCCATGACGGTGCGCTACCAGGCTGCGCTACGTGCCGACGCATAAAAGGAATACTACTCGATTCCTTTTTGAATGCAAGGGGCAAGCAACGCTAACTGATTCATAAATAATCAGTTAGCGATAAAGCGCTTCTCGTCTGTCAGCACCTGCAACAGCAAACTGAGCTGCGGCTTCTGGTCTTTCAGCTTCTCACCTTCGAGGTTATACGTCTGGTAATTGCCATTGCTGTTCAGCACCAGGGACAACGTTGGCGTGGTCACCACCAGCGTGTTTCCGCCAGCAGCCGTGACCCAGTTATGACGCCGCGTGGCGCTAAAGAGGTCTTGTCCCTGCGAGTACTCAATTGCCGGTGTGCTGACGTGCAGCAGCCGTTGCATTAACGTGGTCATGACGTCTTTATGGTCGGTGAGCATGTTGATGCGCTGCGCGGGAGTGCCCGGCCAGTGGATCACCAGGGGAACATGCAGGTTCGGACGCGACCACTCCATGCCCTTCGCCTCGTCGCCGAGCGGCACGCCATGCCCCGCCGTGATAATCACGACCGTATTGTCCAGCTTGCCTGCGTCGCGCAGGGCGGTCAGCACGCGGCCGATCTGCGCATCGACATCACTCGCCGCACGGCTATAGCGGCGCGCGAAGCCTTTCTGATTGCTGTCATCAAGCGTTGTGCCGTTAAACGCAACCCAGGAGAACCAGCGGTTATCTTCCTGCGCATAGCGCTTCAGCCAGTCTATCCACTGGTCAGCGGTCCGATCGTCAGACTGGCTCTGCGCGGCGGGCAGTGAGAAATCAGACAGCAGCGCCTGGCGGTACAGCGAGCTGTTGAAACCATCAGAGGCGAACAGCCCCAGCTGGTAGCCCTGCTGATTAAGCCCGGTAATCAATGCCGCCGGAATACGGGCCGACAGCACGCCGTCCATATAGCTCGGTGAAATGCCGTAGAACAGGCCGAAAATACCCGCGTCGGTCGAATTACCGGAGCTCATATGCTGAGTGAATACGATGTTGTTCTCAGCGAATTCAGCCAGCGCTGGCATCTGCTTCTCATAGCGGGAATAGTTCAGGCCATCGACGGTAATCAGCAGCACGTTCTGACCGCGGCCCATGTCGCGATATTTCAGATCGCTCAGGGGATACTGCACGCTGACCGCTTCCGGATTACCCTGTTCGACCAGACGACGCTGATACTCCTGCGCGTCCAGCAGGCCGTGTTTTTCGAGGAAACGACGCGCCGTCATCGGATAGGAGAGCGGCAGGTTCGCGCGCTGCATGGTAATCGGGCGGTAGAAGTTCGCATCTGCCCAGATATACATCAGGTGCGAACCGATAAATGAGGCGAAAAAGAGCGCCGCGACGGGCTTCGCATAATGGCGGCGTCGGGTCAGACTGCGGAGTTTTTGCCAGCTCCACGTGGCGAACAGCATTTCGATCAGCAGGATGATCGGCACGCTGATAAACATCAGCTGCCAGTCGCGCGCGGTCTCGTTCTGATCCGGGTTAATTACCAGCTCCCAGACGACAGGATTCAGGTGCAGGTGGAAGCGGGTAAAGACTTCGCTGTCGATCAGCAACAGCGTCATGCCCGCCGTCGCAAGAATGGCGGACAGGAAGCGCATCAGCCGCTGCGACATGACGATAAACGTCAGTGGGAACAGGATCAGCAGATAGGTGGCGAAGACCAGAAAGCTAAAGTGACCAACAAGGCTCATCCACGAGTAGATGCGCCCGGTTAACGTGGTTGGCCAGTCCGCCACAAACAGATAACGGCAGCCGAGCACCATCGCCAACAGAATGTTGAACAGGGCAAACCAGTGCCCCCAGCTGACCATCTGGGAGACTTTTTCACGGTAGCGCTGACGATTCGTCACCATAAACTGTTGTTCGTTTCCCTTAGTGTGCCTGGTCGTCGCTAACGGAGGACTGCAAAGCCTGGGCGAAAGATTTGGCGATCGCCTGACGCTGAGCCGGAGCAACGCTGGTATTGATCAGGTTGGTAACCATATTTCCCAAAACCATCAGGGAGAGATCGGTTGGCGTTTTATGTTTTTCCAGTACGTTGACCAGCTCACTGAGCAATTGTTCAACGTGTTCGTCACTGTAGCGGGAATGTTGTGGCATAAATCAAAATCAGTTTGTTGAGAGAAGGGCGACATATTACCGTAGCAACAGCTTTTTTTCCCCTTTTTTATCTTCTGTTGCACACATGCGTTGGTGGTGGTTGAATACCGCCCGGTCTAAAAGGAGAGTTTATCATGAGTCTGGAAATCAACCAGATTGCCTTGCACCAGCTTATCAAGCGTGATGAGCAAACCCTTGAAGTGGTGCTGCGCGATTCGTTACTGGAACCAACGCCTACCGTTGTCGAGATGATGGCGGAGTTGCATCGCGTCTACAGCGCGAAAAATAAAGCCTATGGCTTGTTCAGCGAAGAGAGCGAACTGGCGGATAGCCTGCGCCTGCAACGTCAGGGCGAAGAGGATTTCCTGGCATTCAGCCGCGCTGCTACTGGCCGTCTGCGTGACGAGCTGGCGAAATATCCCTTCGCCGATGGCGGTATCGTCCTGTTCTGCCACTACCGCTATCTGGCGGTAGAGTATCTGCTGGTCACCGTGCTGAATAACTTAAGCAGTATGCGCGTGAACGAGCAGCTGGATATCAGCTCAACGCACTACCTGGATATCAACCATGCGGATATTGTGGCGCGTATCGATTTAACCGAATGGGAAACCAACCCGGAATCAACGCGCTATCTGACCTTCCTTAAAGGCCGCGTGGGCCGCAAAGTGGCGGATTTCTTTATGGACTTCCTCGGCGCCAGCGAAGGGCTGAATGCCAAAGCGCAGAACAAAGGTCTCCTCCAGGCGGTGGATGACTTTACGGCTGAGGCGCAGCTTGATAAATCTGAGCGTCAGACCGTGCGTCAGCAGGTTTACAGCTACTGCAACGAGCAGTTACAGGCTGGAGAAGAGATTGAGCTGGAGTCGCTGTCCAAAGAGCTGGCAGGCGTCAGCGAAGTCAGCTTCCAGGAATTTACTGTGGAAAAAGGCTATGAGCTGGAAGAGAGCTTCCCGGCGGATCGCAGTACGCTGCGTCAGCTGACGAAATTTGCCGGTAGCGGCGGTGGGTTAACCATTAACTTTGATGCCATGCTGCTCGGGGAACGTATCTTCTGGGACCCGGCCACCGACACGCTGACCATTAAAGGCACGCCGCCGAATCTGCGCGACCAGCTTCAGCGTCGCACCTCAGGCGGTAAGTAA
Protein sequences of DBSCAN-SWA_1 >NZ_CP044107|2656359:2709677|2665649_2666474_+|WP_150391009.1|DBSCAN-SWA MSTKLTGYVWDACAASGMKLSSVAIMARLADFSSDEGVSWPSIATIARQIGAGESTVRTAISQLEKDGWLTRQQRRKGNRNASNVYQLNVAKLQAAAFSHLSDSDASKSEASKNDEKGGFHPSESGGDPSVNTTTDPSVKKPSCPVVKQPDPEVTITDNAILVLNHLNLVSGSRYQKSKTSLENIRARLREGYTVGDLQLVIDLKHEHWNGNDVQYQYMRPETLFGPKKFEGYLQSGIRWDKKGRPPRESWGEKKHDPMKFGPVDTKIPEGFRG >NZ_CP044107|2656359:2709677|2693069_2693672_+|WP_150391034.1|tail|DBSCAN-SWA MAQNTVEKIVLVRLYGKLGTLFGREHRLSVSSVREAIRALCIMIPGLERWLETSEERGVTYAVFNGTRNISTQDLHLNGVHDVIKIAPVIIGSKKSGVFQTIFGAVLVVIGAVLSFTPAAAASPFLYKMGAAMMLGGVVQMLTPSGTQGMTMDSGDTRKSYSFGAPINQSAAGNGVNLLYGERLIAGVLISGGIYAEEQQ >NZ_CP044107|2656359:2709677|2691620_2692370_+|WP_150391032.1|tail|DBSCAN-SWA MGIAADDQKLQPGNQITLFEVDGTAFGADVLYFHNHAVPYTEAEIIDAGDDESKLPGKPIYWQGIRYELWPCQIEDIEANGDGTPVSPKLSVGNLDGSISALCHLFQDMKQAKVTIHRTYAHYLDASNFPDGNPQADPTAEQLEVFYIDSKTADNETDVQFKLSSPVDVTGQKVPGRQMTSRCAWCLQGQYRGADCGYTGTKYFDKFGNPVDNPADDVCSGTVAGCKLRWGEDEQLPFGGFPAIAITRI >NZ_CP044107|2656359:2709677|2698340_2699660_+|WP_150391289.1|tail|DBSCAN-SWA MLYNTGTIAINGNTATGTGTNWTAPASQVRAGQTIIVMSNPVQMFQISSVNSATSMTVTPAASPALSGQKYGILVSDNISVDGLAQAMSQLINEYDENIGAWETFATTSANQNITVTINGTPVTIPGIGKLAQKGSNGALAIADGGTGATKPEDVRTNLGLGDDITANFGSLEIGAKKASSASFIDFHFLGTNDYDARILCGGNSNGGMGKGDFTFYAGKYTFIGDSFEFRNPITCQNSITASGSINAGGSLRAVTSSNVWASSDTQNAHVWFYGAGGIESRGVIYAPKEGTIRLRPDNNDNGGANGYSFSFGADGRFTCISVNQTSDERVKFDKEPVGNALEKICSLTGYTFGIQLTESESVRSAGIIAQDLEKVLPVAVSSGATCTTRTGEEINDLKTVDYSAMSALYVEAIKELAERLQILENKLAELRFFSTNLS >NZ_CP044107|2656359:2709677|2704569_2704923_-|WP_040242227.1|DBSCAN-SWA MALTDFGKAVRKARIDTGYTLLTMAKALGTTPAFLSGLETGTKKIPEKWVTAINELFAAQGYYIDDLDVLASIANESVPIEGLPKQQQMLVAGFAKSEFTHEELKKFAELLAEINKK >NZ_CP044107|2656359:2709677|2701289_2701847_+|WP_150391039.1|DBSCAN-SWA MQIGYVRVSTNDQNTDHQRQALERSGCEQIFEEKMSGTVANRPALKKLLRTLNEGDTLVVWKLDRLGRSMRNLVLLVDELRQRGIHFKSLTDSIDTSSPMGRFIFHIMSALAEMERELIVERTRAGLAAAREKGRIGGRRPKLTEEQWAQAGRLITNGVDRKQVAIIYDVAVCTLYKKFPASKPV >NZ_CP044107|2656359:2709677|2673608_2674661_+|WP_150391015.1|DBSCAN-SWA MKNTVNINSVELINADCLHYLATLPDNTIDLIVTDPPYFKVKPNGWDNQWNGDADYLLWLDMCLAQFWRVLKPTGSLYLFSGHRLASDIEIMMRERFNVMNHIIWAKPSGRWNGCNKESLRSYFPATERILFAEHYQGPYRPKSDGFAEKSNEVKQHVMAPLISYFRDARAELGVTSRQIADATGKKNMVSHWFGASQWQLPNEQDYEKLQELFTQIAIEKHGASELKAPHHQLVATWHSLNRKYLDLLEEYKSLRRHFSVTVAVPYTDVWTHKPVQFYPGKHPCEKPADMLRQIINASSRPGDVVADFFMGSGSTVKAAIELGRQAIGVDLEEERFNQTVSEVRQLAGE >NZ_CP044107|2656359:2709677|2675479_2675995_+|WP_150391016.1|DBSCAN-SWA MKINQGLIGVVVIAVLSVALVKSCSDASSLQSDNDVLRSDNSLQGQVIATQAFNFNRFNQVAEHANRLNSLIDTRTEETVIEYREILRREKTCDLPVPADIAGGLLEYAYRLRSSAMHADTDGTDAADDSTAAARSITYCQAVLWIKPLLAVIEKGNNNFAGIRQVEQERQ >NZ_CP044107|2656359:2709677|2661060_2661600_-|WP_150391285.1|DBSCAN-SWA MSYIQTLSGKKFDYLNSTTDDVEIEDIATALSHICRFSGHLPEFYSVAQHSVLCSQIVPPEFAFEALMHDAAEAYCQDIPARLKALLPDYRRIEERVEQVIRAKFSITPDMSEVVKYADLVMLATERRDLDIDDGSLWPCLEGIPASDIIQIVPLRPGQAYGLLINRFNELTESRACLA >NZ_CP044107|2656359:2709677|2680868_2681048_+|WP_058609097.1|DBSCAN-SWA MLTAIITFMIGLFGAALISFGAWMVFPPAGVIAAGLFCLLASYFAARAAAPANDSPGGN >NZ_CP044107|2656359:2709677|2674946_2675483_+|WP_045347923.1|DBSCAN-SWA MSNKAKLSAAVLALIASGASAPLIFDQFISEKEGNALVAVVDPGGVWSLCHGVTVIDGRRVVKGMTATEEQCRKVNAIERDKALAWVDRNIKVPLTEPQKVGIASFCPYNIGPGKCFPSTFYKRINAGDRIGACEAIRWWIKDGGRDCRLTKGQKNGCYGQVERRDQESALTCWGLDQ >NZ_CP044107|2656359:2709677|2708307_2708535_-|WP_001135586.1|DBSCAN-SWA MPQHSRYSDEHVEQLLSELVNVLEKHKTPTDLSLMVLGNMVTNLINTSVAPAQRQAIAKSFAQALQSSVSDDQAH >NZ_CP044107|2656359:2709677|2708669_2709677_+|WP_015572297.1|DBSCAN-SWA MSLEINQIALHQLIKRDEQTLEVVLRDSLLEPTPTVVEMMAELHRVYSAKNKAYGLFSEESELADSLRLQRQGEEDFLAFSRAATGRLRDELAKYPFADGGIVLFCHYRYLAVEYLLVTVLNNLSSMRVNEQLDISSTHYLDINHADIVARIDLTEWETNPESTRYLTFLKGRVGRKVADFFMDFLGASEGLNAKAQNKGLLQAVDDFTAEAQLDKSERQTVRQQVYSYCNEQLQAGEEIELESLSKELAGVSEVSFQEFTVEKGYELEESFPADRSTLRQLTKFAGSGGGLTINFDAMLLGERIFWDPATDTLTIKGTPPNLRDQLQRRTSGGK >NZ_CP044107|2656359:2709677|2660025_2660598_-|WP_150391284.1|DBSCAN-SWA MNDLMIDLESMGKKPNAPIVSIGAVFFNPQTGELGQEFYTAVSVESAMAQGAVPDGDTILWWLKQSPEARSAICVDDAMSITDALSELSHFIHRHAYNLKYMKVWGNGATFDNVILRGAYERAGRICPWEFWNDHDVRTIVTLGRNVGFDPKRDMLFIGDVHNALADARHQAKYVSAIWQKLIPATSTNE >NZ_CP044107|2656359:2709677|2667797_2669735_+|WP_150391286.1|DBSCAN-SWA MREIIVDNFAGGGGASTGIELAIGRSVDIAINHDENAIAMHKTNHPDTLHYCESVFDVDPIAATSGKPVGLAWFSPDCRHFSKAKGAKPVKKEIRGLAWIVLRWALAVRPRVMMLENVEEFKTWGPLLEEELRPDPARAGETFEAFVGMLSTGIAANPPALAEVCEFLAIEPHGQQAQQLIAGLGYEVDYRELRACDYGTPTIRKRFFMVMRCDGRQIHWPEATHGDPKSLEVQSGKLAPWRTAAECIDWSIPARSIFDRKNPLAENTLKRIARGIQRFVIESSSPFIVKCNHTTSHGRYDCFRGQGLEAPLQTITKTHGYALAVPHLTKFRTGATGQPVTEPVPTVTAGTSARPGGNGHALGVVEAALTPFLAGNGGSEYQAKPRPLDKPAHTILKQSRACVVAPVIARQFGASVGHRADEPSATITAGGGGKSQLVTPTLIQMGYGERPGQEPRVLQLNNPLGTVTAGGNKFATVSAFLAKHYGGNYTGPGVGMDEPAHSVTTVDHHAVVASHLVKLRGTCRDGQTMDTPMPTITAGGQHVGEVRTFLETYCGDSEDEWLVTIEGVKYQIVDIGMRMLQPHELYKAQGFPDGYVIDQDYRGNRYAKDKQVARCGNAVPPPFARALVEANLPELCANQKAGAAA >NZ_CP044107|2656359:2709677|2705028_2705700_-|WP_150391041.1|DBSCAN-SWA MCGRFAQAQTREEYLAYLANEADREIAYDPEPIGRYNVSPGTKVLLLSERNEQLHLDPVLWSYAPEWWDKPPLINARVETAATSRMFKPLWQHGRAICFADGWFEWKKEGDKKQPYFIHRADGQPIFMAAIGSTPFERGDEAEGFLIVTSAADKGLVDIHDRRPLVLSPEAAREWIRQDIGGKEAEEIAADGAVPANKFIWHAVTRAVGNAKNQGKNLIEIIN >NZ_CP044107|2656359:2709677|2687464_2687683_+|WP_150391288.1|DBSCAN-SWA MLFTLADRWGEKDPRKIAELPANILTHWQAYFELLKTEAETPAPVNSPPVTAAQSESDQQFADCFRILGHGC >NZ_CP044107|2656359:2709677|2678801_2679125_+|WP_080975379.1|terminase|DBSCAN-SWA MAIDAWKRTCKILINRGSFEMEDCYLLMEYCNTVQLLYDANQEIKADGIGDETAAGGQKMGAAVKARDKYISQLIRLSVVLKLDPNSRARKRTPGEDSKSGNEFDEF >NZ_CP044107|2656359:2709677|2672412_2672946_+|WP_150391287.1|DBSCAN-SWA MFQAFQTAVFKNVAEHLKGRFPHKSLELVYADELILKDMEFLKTNNKLRWDPGLKSRVFIDMMEEHPIKLVVYYRGEPIGFAFGCYYKPKNAVHVCWMEKRNDAHEDLDHQMLGIVLDCFAAYAQFLNHQGETIDTIALVSPVDGAMRYYTESGFEYIADYERGGCAMVLRNTLQSK >NZ_CP044107|2656359:2709677|2662620_2662992_-|WP_150391006.1|DBSCAN-SWA MSNDRMTVVPDFLVELDAGVFMNKIAAALNTTALGVLNNGTKGKVVLTFDIERMGNSVEEKRVKIKHKLNYVTPTPRGKASEEDTTETPMWVNKGGKLTILQEDQGQLFGINGGVDGKLKAAQ >NZ_CP044107|2656359:2709677|2664503_2664767_+|WP_150391007.1|DBSCAN-SWA MQSPLRNLRKSQGLTLSHVANVVDIDPANLSRIERGQQIASLDVAERLVKFYSGQIDELQILYPHRYTQATESGAASVPQEKGESRG >NZ_CP044107|2656359:2709677|2657072_2657333_+|WP_006811332.1|DBSCAN-SWA MVREKLKTPEGRKFLLALLVVFMIAAACVGRATIVGVIEQYNIPLSAWTTSMFVLQSAMIFVYSLVFTVLLAIPLGIFFLGGREKH >NZ_CP044107|2656359:2709677|2676936_2677686_+|WP_150391018.1|DBSCAN-SWA MITLLSSGLFSYAKSSSRTNQDSILAPQCIDNGYLIAVADGVGSYLGAEHASQTAIKYLANLINASSIQDLDSLFATIKEKISALSDADESYLEAATTLTFAYVNNQGLYIGHVGDCRLYIKKDNKLKQLTKDHTQHQKLLDQKIFNKKELKDMGGKNTLTTAISKVIPLEFQQTFIPASEVFEDSEEATLYILSDGAHHFWDKRPRFSITTLSNPNSFAASLHKRIIRHGPIDDFSLVVAKFKRSSVN >NZ_CP044107|2656359:2709677|2656359_2656869_-|WP_032103230.1|protease|DBSCAN-SWA MTVKNAPKFAIALIAAACASSSAFASETRKEQPLEKVAPYPQADKGMKRQVIQLPAQQDEANFKVELLIGQTLEVDCNQHRLGGQLESKTLEGWGYDYYVFDKVTSPVSTMMACPDGKKEKKFITAYLGDNSLLRYNSKLPIVVYTPENVDVKYRVWKADETVGQAVVR >NZ_CP044107|2656359:2709677|2673143_2673329_+|WP_150391014.1|DBSCAN-SWA MKHHEQIEIEAAKVVAELFAGNASPMESFGITWSQTQMLERKNPGVVIKLTPDDGRKLAYC >NZ_CP044107|2656359:2709677|2657640_2658498_+|WP_150391001.1|DBSCAN-SWA MKKRTKKHTNIRYRDHTEIEKTYISLYKQFCRACEISIVISLNQGGIDTDGRGRRAADIFTRQVLTVHSLKKLLPVLRNGNDPEDTVWDLTTIALVSRSVMENFQALFFYGTETISESEADLRFRIFQKDRNVKWRDIRMKAGESAEELEEFFTGLAEQQKIIVNHEFYHSLSKEQKNSLKNRAEMYYSKAEFEAHCPRLANIALHHQLLSNLAHPLPLAIHRIDEVSGHGSPSEADIRLVVISLNVATHCLIASIEEMGKKFSDSIGKQYRSMINELSDYPALN >NZ_CP044107|2656359:2709677|2704069_2704552_-|WP_063157296.1|DBSCAN-SWA MHLMRGTRVRPLSEMEIKAIAVNACFALGFTSKYKYRRRPRRFDVALETLSQWNIVLDPMDDDEWFQETLGLTIGHCEPDKLTIRVPNHIYEMACGGERSALFVIFHELGHLLLQHKPALHFANAMPEQNEDSEWQADLFAETMLDKLGYQTAQLCFEFY >NZ_CP044107|2656359:2709677|2685113_2685437_+|WP_042889858.1|head,tail|DBSCAN-SWA MILTLAEAKTQLRLELDFDEHDSLLTSLIDAAQRSIERSYYCKLVENQAQLDALPDGETGYIIDEDIKLAAKMMVSQWYLNPTGTAEGSPSDLGVEYLLFPLMEHTV >NZ_CP044107|2656359:2709677|2686569_2687043_+|WP_150391027.1|tail|DBSCAN-SWA MADQEIKSPSEYATLPAGTEVRYGQKGATIATAALLQSAMAIGATGKKGVFMEVTRLIDREPKYMADMGEGEDKTLVFIDDPSDTVQEALLSDADAKKTVVFFMKFPNKRISEVELVLAGWSLQSVDTPKGKVLQVEVYGKQNSVKWSVEQPAGGSD >NZ_CP044107|2656359:2709677|2684680_2685133_+|WP_150391023.1|DBSCAN-SWA MATKTKNTQKDDTATDANAEPAVTTAAAADTSAPVPDVNAGSAGDAGGDGDGTEPGPDGDDTDSGGDAKQDETPEERMSKLTGKVASVQNGRVAVTFLGPFSRYSRGDVACFDRPVAQDMVDRNIAVWVKDAERALQPNKDDDAHDTDIG >NZ_CP044107|2656359:2709677|2692372_2693089_+|WP_150391033.1|DBSCAN-SWA MLSQRLITAIEKHAAAAYPNECCGLIIRATRQRRYIPCSNAHENPTEHFMISAQAWADAEDMGEVLAIVHSHPDAGPHASSDDLKSCHDSGLPWVIMSWPGGEYTVATPADTPPILKRPFIHGSWDCYGLIRDWYQQERGIELPDFHRDDNWWTRGENLYVKHYADAGFYSHADELQVGDVILMQYKAEEINHAGIYLGDGKMLHHMYGKLSEVVPYGGMWRERTMLTLRYQDGTEHS >NZ_CP044107|2656359:2709677|2658590_2659769_-|WP_150391002.1|DBSCAN-SWA MAARPRKKEYRHLPDYLFFDKDRGVYKFTLVTGKKKNIGKDRAMAIAIAREYNLRMRPELSPSVDNLIRESGGVTGEAKPFADHMDHIMARAVEDERPSPSTLDDWNNDALRVKEFFINIPACDIELEHVNAYINKYHAGASANVQNRKVSFLKKLFSYAVDESLMLDNPATRKKMRRTEEKKRQRLSLEHFMAIRRAAAPWLRTAMDLALQTTHARLEVSRIRYSIREPKNGVCGCVWLEQPEDGIYGTLYIHRQKVQKKEASHVAIPIGDELKRIIDDSRDNVASPYVVHRIPERQVKRSKEVSHPTQIAPDYLSRSFSAVRDRLGICSHLTMDERPTFHEIRALAAHLFDRQGIDPQGRMAHSDAKSTKIYTQNHINWVVVPHGEIKAS >NZ_CP044107|2656359:2709677|2659771_2659981_-|WP_150391003.1|DBSCAN-SWA MAKLMKASQWGRREFTNDSVPDNRTIKRWVENGLLMGRIVDGSVFVCETEKWGVDSMVSQAVRQLINEG >NZ_CP044107|2656359:2709677|2697314_2698280_+|WP_150391036.1|DBSCAN-SWA MPTGLLIELNDGGKRMEITAGLRCPSFGGSFDTGYQKAKYVDIAGYVSGSQVLFIPHATAYVDSGLWHKMNSITISGGRVTQNSRMQALGISERDSTYTFPGSVWQIFPTGQRNGVGLLIGDSTDFLAITNATQSGQCIWKGTVNVPTAGWAVPAIAGYDKSKYIVFGRCNSGNTIDFDGNTVRFFSPPSTNDDAPATGTIDIVIFASGIAPQPGTGLNIFNAAGVCTFSTTRRPFVYLNQLWTPSTSAVSIGNGYVPLGRFGLMVHTVNGMYVYRMFGIKIQNGSASVQGGKYLGREQYAIFGNNTVTSLSLPVLPDMYV >NZ_CP044107|2656359:2709677|2687672_2690645_+|WP_150391029.1|tail|DBSCAN-SWA MAADVASLAVALHLNSASFKSQFADAMRTADSSAQQFNRKVQTDNQKTRQSFEGLGKGITGLDADFNKLGKTVDKRLTGLDEMRGLLANISAGSTVAGSSITTALVSALSEGMSTALDNSITGLKSQRQAQIEFTQAQISAAQGSIENARQLRAEAIEKQNIAVKTIEAARADRERAFALDEHFAKQAEVNKQYGLAVSYEAEHVKNARTIQEANLAEAKAKGSLAEATKTMLAADIAESAGKQQLATSTRQLAVASQDLSLGQRAAAASAGLMRGAMAMVGGPVGLAVIAVASAVTAIYSAYSNSEAVIKGYTQALQKSGQQSVMSVMYLQNLTSSLGDSDRAVKAVTASVSAGFGGNMLEQVASLGTRMEEIGQSSDDLVSLLSSLKGDPLQALQKLTDQGILLNGSMIDQIVTLERQGKTSEATALLQQAAMNDLDSKLKEQESNVGGLKSAWKSLKDFVSDAFKTMGDAHIATAQAMAAGAGVDLDTTPDPAIKQREEAEKQYQAQKKQREEISKRLKDENTLSGLLKAGTSREKERADAVALVNANFTKGTSEYTQAMRGIDKMYAEQKKVRAKAYSDDAATTRLNQLRQEEAALRSQNEQTETLTQSEKKLAQFNQEIADLKEKRILTAGQRSILAQETELRHQLEINASLDKANQQRKLGLQIQERNQELYRSTLQLQQEYANRVAQMTMSSDAYDQMVAEQQVRERFAKLREEQDKTIADHSSELYRKQTEVLRDEEQKQLEIVRSGAERKKQVDGSWFDGMKKGLTDWRVDAENQFTQARDIAINAMEGMGTALWNVASKGKGDFKSLAVSVIDDIGKMITQMAMFNAIKSGSKALGIESWFGWADGGYTGDGGKHDVAGVVHRGEWVVPQSVVKKPGMLGFLNQLTYGNGYAEGGLVGGGVAKPSGDSYSQPPAGQGSIHFSLTIPLQVIHQGGANQEPTSKSQEFLSSETKALFKQFVLETLDRELANGGMIDTKMRTA >NZ_CP044107|2656359:2709677|2664759_2665308_+|WP_150391008.1|DBSCAN-SWA MGNEPEWKVDKQPAWLVAAIKKTITELPGGYSEAAEWLGVTENALFNRLRTDGDQIFPLGWGMVLQRAGGSNHIANAIARHSNGVFVPLTDVEEIENGDINQRLMESVEWIGRHSQYVRKATADGVIDAQERAQIEENSYQVMAKWQEHLTLLFRVFCAPEKNDARECAAPGAVADKSCMEK >NZ_CP044107|2656359:2709677|2702441_2703710_+|WP_150391040.1|DBSCAN-SWA MFALCDVNSFYASCETVFRPDLRGRPVVVLSNNDGCVIARSAEAKAAGITMGEPFFKQKDLFRRAGVVCFSSNYELYADMSNRVMTTLEEMSPRVEIYSIDEAFCDLTGVRNCRDLTEFGKEIRATLLKRTHLTVGVGIAQTKTLAKLANHAAKKWQRQTDGVVDLSNIDRQRRLLAIVPVEDVWGVGRRISKKLNAMGIKTALDLSEQSTWIIRKHFNVVLERTVRELRGEPCLDLEEFAPAKQEIVCSRSFGERVTEYEQMRQAICSYAARGAEKLRGEHQYCRFISAFVKTSPFALNEPYYGNSASVKLLTPTQDSRDIINAAVKCLDKIWKDGHRYQKAGIMLGDFFSQGVAQLNLFDEHAPRAGSDKLMDVLDQLNAKEGKGTLFFAGQGIQQQWQMKRQLLSPRYTTRISDLLIVK >NZ_CP044107|2656359:2709677|2705925_2706165_-|WP_063667174.1|DBSCAN-SWA MFVELVYDKRNVQGLDGAREIILAELTRRVHNIFPDAEVKVKPMQANGLNSDASKSDREKLNRMLEDMFEESDMWLTNE >NZ_CP044107|2656359:2709677|2700532_2701159_-|WP_150391038.1|tail|DBSCAN-SWA MQVGAFGLPTGDARVFKTDSAVAPDGTNFNSLTEPGTYRLLLDMSKSTSGPISSWYGYVQNYVRIADVALTQFCLPYPSQTDAGRFFFRGYNTNAWSPWKEIMTSAVSDRTMKNVGDDLDLEEALLNICRMEFKHFTFKDDETQTPRRGVISQQIETIDPEYVKEIGGILHLDQTPMLLDALAAIKTLALRIMVLEEQSNTKASRSEG >NZ_CP044107|2656359:2709677|2702019_2702439_+|WP_142503495.1|DBSCAN-SWA MEFFRPTELREIISIPLFSDLVQCGFPSPAADYVEQRIDLNELLVAHPSSTYFVKAAGDSMIEAGISDGDLLVVDSSRTAEHGDIVIAAVEGEFTVKRLQLRPTVQLNPMNSAYSPIIVGSEDTLDVFGVVTFIVKSAG >NZ_CP044107|2656359:2709677|2669749_2670442_+|WP_150391011.1|DBSCAN-SWA MNQLTAKGVVTMSSREIARLVQSKHGDVKRSAERLASAGILTAPLAHTPYTHPQNGQTYEEYWFNKRDSLVIVARLSPEFTAAVVDRWQELENSQAVSVPQTLPEALRLAADLAEQKEQLSQQLAAAAPKVEFVDRYCTAKGSMSFRQVAKLLQAKETDFRLFLIESGILYRLGGVLTPRHQHIAAGRFEVKTGTSSETNYAFSQARFTPKGIEWIGGLWTAHIAKEHAA >NZ_CP044107|2656359:2709677|2657271_2657547_-|WP_150391000.1|DBSCAN-SWA MYRADFAKKALFVYTCVLRLPPAMPRFSAFYERCHGVSGVGGSNPLVPTKTTQKNQPLMAGFLLPVSACAMLSVFHAHRERKSPAESPAGR >NZ_CP044107|2656359:2709677|2661736_2662567_-|WP_150391005.1|DBSCAN-SWA MSQILDGNALQQVKDLVLSGYHLDAAKVTACPTALLPEGVHVESLERFEFERFRFRGAMTTTSIPDFVRYAAGYAKADEPARCFIDADNMTARSMFNIGTLANPGHADNIASITLKKTAPFRALLQVNGDRLGQKEIAEWLEDWADFLTAFDADGNMLSIAQAAGAVRRVNIKQVSESAHEDEDFGGKKSLMQSVEASSKDVMPVAFEFKCVPYEGLGERRFNLRNSLLKSGEPVFVLRIVQLEAQEEAIANEFRDLLIEKFTDKPVETFIGNFKA >NZ_CP044107|2656359:2709677|2706527_2708288_-|WP_150391042.1|DBSCAN-SWA MVTNRQRYREKVSQMVSWGHWFALFNILLAMVLGCRYLFVADWPTTLTGRIYSWMSLVGHFSFLVFATYLLILFPLTFIVMSQRLMRFLSAILATAGMTLLLIDSEVFTRFHLHLNPVVWELVINPDQNETARDWQLMFISVPIILLIEMLFATWSWQKLRSLTRRRHYAKPVAALFFASFIGSHLMYIWADANFYRPITMQRANLPLSYPMTARRFLEKHGLLDAQEYQRRLVEQGNPEAVSVQYPLSDLKYRDMGRGQNVLLITVDGLNYSRYEKQMPALAEFAENNIVFTQHMSSGNSTDAGIFGLFYGISPSYMDGVLSARIPAALITGLNQQGYQLGLFASDGFNSSLYRQALLSDFSLPAAQSQSDDRTADQWIDWLKRYAQEDNRWFSWVAFNGTTLDDSNQKGFARRYSRAASDVDAQIGRVLTALRDAGKLDNTVVIITAGHGVPLGDEAKGMEWSRPNLHVPLVIHWPGTPAQRINMLTDHKDVMTTLMQRLLHVSTPAIEYSQGQDLFSATRRHNWVTAAGGNTLVVTTPTLSLVLNSNGNYQTYNLEGEKLKDQKPQLSLLLQVLTDEKRFIAN >NZ_CP044107|2656359:2709677|2678190_2678553_+|WP_150391019.1|DBSCAN-SWA MPPRIKRPCRHKGCAALTNDPSGYCDEHRQQHAGEGWRNYQNGKSRHERGYGRPWEVRRARILQRDKHICQACRRVGIARRASTVDHILAKAHGGTDDDFNLEALCWPCHRAKTARERLR >NZ_CP044107|2656359:2709677|2671521_2672214_+|WP_150391013.1|DBSCAN-SWA MNLQELEFTRIELRRALADLSGSTKGQLQAFSEHPPADKNKYPRHHPEIVMEGGEGCGSKVVKTLATPLYVLETRSRRRPLPPIKDTEFACSAWRRSVNGLGEHLQAWVRYCYGYDLTFRYQTLMCQYVWEQFQRQHSGKQIQGRVTKKLIGLVWLAAQEVAASRNNDTYQEYAGAALARMVSVERSTWLRVYSGHWAAFKALFAEMDSQALSEILSRYEEFQELKVAEM >NZ_CP044107|2656359:2709677|2665480_2665660_+|WP_032676944.1|DBSCAN-SWA MREVNRRFKDHRGIPVRVIRWEPETQRVIYLRDGYDHECFSPLEQFKRKFTELKDDHEH >NZ_CP044107|2656359:2709677|2682339_2683257_+|WP_150391021.1|DBSCAN-SWA MSKLINLPHLADQVFGVPHYATRQIMDSVKSILVPRLQGMNVAPLEMALGPDESQEANEPQQSGGGVGVIPVHGILVPRRGQIVNMCTELNSYERIRGQLAALLNDPGIKEIVLDINSGGGAVSGCKELADYIYQSRSVKPITAIVNFSAFSAAYFIASACSKIIVSETSGVGSIGVILEHMEASKWEESVGLKFTTFSRGDNKNNGSPHEPLTELATAQIQAMIDGAYQTFTSSVAQYRGIDIDAVIGTQAALYFGQNAVAAGLADEMSDPQSAINAIVAKYKPSPQQSSIQLRAAVMDQQARM >NZ_CP044107|2656359:2709677|2676151_2676940_+|WP_150391017.1|DBSCAN-SWA MEERHHSYVLKTIEEIGRGGFGYVEKIELFNVNGHKCGDYAKKILAQDHGLSKEDFKRRFKREVDYQARCTHSNIAPIYLHNLQVDSPWFVMDLAESDLSTDLASGTLDNASKMHIAEMILSGVRFMHTEKTDDPGRKPVYLHRDLKPSNILRFKDGVYKISDFGLVKNAGKERPESELLTRVATAMGTLKYMAPEITTAGHYSEQTDIFALGVVIDDMGFDNVNGIRQLIDKCTAWRAASRYKSVDDMMQELADIKIRNGL >NZ_CP044107|2656359:2709677|2670438_2671500_+|WP_150391012.1|DBSCAN-SWA MRGLFTAETVPRLGLVVLKPGSELMSLFQQGRVLVEPQPKSMAGLPSGLVPYARQPLAEDKSLEEFFTDERVIRAAGGLTALESWLERNVKECQYPHTDYHHHELVTMRHPPGSMLLCWHCDNQLRDQTTAALAELARRNLINWLISSIQSSLGYNNERELSLGELCWWAVYSGIADAITERMAQRALRLPDEPFLSVYRESDIVPMLPAKNILQKKVTPALTAAKLKDGANQEVAYDQPKVLALHADPESPESFMLRPKHRRWVNEDYTRWVKTQPCEGCRRPADDPHHVIGHGMGGTATKAHDLFVIPLCRECHDKLHADVAAFEKKNGTQLELLFRFMNRALAIGVITKA >NZ_CP044107|2656359:2709677|2685433_2685775_+|WP_150391024.1|head,tail|DBSCAN-SWA MSDPLRPGELNCRITLSYVETERGELGETLPAREVIAGNAWSKKELVSGRKVRTLDQQQVVETCLFTLYPREVDVDWKVSTADRVYTVRNVERLTDRIIITGEADSRHDRVSN >NZ_CP044107|2656359:2709677|2690975_2691542_+|WP_150391031.1|DBSCAN-SWA MQNQYKLEGDEYHFAMILSLKKGNVVMLKICGFAVLALGVICIIMGLDMDVTVSSGAQMNVYNTGLIASRQMTISIGCSLMVTGAILLSGGILKEAIIKSVLPQTKAGTESPVQESQFVEKTTDGSYILSENAVRHYAEKLHNEMPDNTALSVMVTNAPHIERIKSAMSPELAKKFERLLETHLQAIK >NZ_CP044107|2656359:2709677|2679159_2680875_+|WP_150391020.1|terminase|DBSCAN-SWA MAAYPSVNMANQYARDVLNGKILACKSIQLACQRHFNDLKISLDKDYPYRFDRELAERACRFVQLLPHSSGDLAGQKLKLEPWQAFAFSSIFGWVTKKTKKRRFREAYIRVARKNGKSFFAAGIGTYMFCADGENSAEVYCGATTMAQAKKVFTPARQMADRLPSLRSKFNISVWVDSLTRPDGSLFAPIAGKPGDGDSPHCAIIDEYHEHDTDHMYEAMTLGMGARSQPLTLIITTAGTSLESPCYDKDKQVKEMLNGHVPNERLFGLIYELDEGDDWTDPTNFIKANPNLDVSISFDDLLAEMEVAKQVPRKVNAFKTKRLNIWVSGKAAFYNMTQWHAAADKSLRYEDFAGEDYYLGLDLAQRLDLNAGVGVFVREIEGKKHYYCISPKFWVPEDTVRSTDPKIAKTADRYVKFVEMGALEATDGAEADYREILASIIDLQEINKVRISEIPIDPSGATALSHELQDHGFEPISIRQDYTNMSPPMKELEAALAGGRFHHDGNPVLSWCISNVIGKNVPGSDDIVRPTKGDKQSKIDGATALFMAIGRAMLNGRASNQSVYDEEDVAC >NZ_CP044107|2656359:2709677|2660875_2661073_-|WP_150391004.1|DBSCAN-SWA MPRMKIKELVAAAHAAAGKLPPAEASLMREVATRLDVTFAALTESMDQRMSLDAEINHLRQESVH >NZ_CP044107|2656359:2709677|2690644_2690992_+|WP_150391030.1|tail|DBSCAN-SWA MALQTFTWSPRNGPVGDIKYRTSSVQYGDSYEAITGEGINPETQSWPLTFTGMNEDMKPVLKFLREHGEVKSFKWTNPLGELGLYRASQLKVSALDFARMTITVTFATAYRAEPI >NZ_CP044107|2656359:2709677|2686141_2686552_+|WP_150391026.1|DBSCAN-SWA MQMDIKFPTGKEFDQLLESIEKKVGVKLLRDAGRAALAVVEQDMRQHAGFDEESIGPHMRDSIKIRSTNVAETSRYNTIVTLRVGPSKIHHMKALAQEFGTVKQVAAPFIRPALDYNVQKVLKVLAAEIRLGLEGR >NZ_CP044107|2656359:2709677|2685755_2686136_+|WP_150391025.1|DBSCAN-SWA MIESAIKTAVERITGLDTYPLLLPDTVQEGATFQRISDPQVGDGLRRTGLSAVRIQLSLYVVDRYTSLLQFDGALWAEWKGIVHGLLEGQPVQYVERGGIQQGKTTLPNNRIQFRLVRDFIFTVPE >NZ_CP044107|2656359:2709677|2674731_2674947_+|WP_000286102.1|holin|DBSCAN-SWA MYRMEKITTGIAYGASGGGTGYWLLQLLDKVSPSQWAAIGVLGSLMFGLLTWLTSLYFQIKADRRKAARGE >NZ_CP044107|2656359:2709677|2666470_2667805_+|WP_150391010.1|DBSCAN-SWA MNENKYCRALAELRSRPAHELKEVGDQWCTPDLLFWGINAMFGPLVLDLFADDSNAKCPAWYTAEDNALTQDWSERLAELGGAGFGNPPYSRSQYHDKQAVTGMTHIINHAMAMREKGGRYVFLIKSATSETWWPEEADHVTFIRGRIGFDLPTWFMPKDEKQQPTSAFFAGAIVVFDKTWRGERFSYINRTDLEAKGRASMLLAQFAVGRTQTDAAPELDAEVVPEKSEAELPLTQKAILETSGVEAWACVVAAFGEKDEYTFSESKFGHTWAADSLENPEFTNVSPLTIDIAKKLISESILVGVNAWLETLPFDSDDMKQDMSERLRTVAVESAKEYGINHSEFIATMVSLDKAKWSNIRRIRAHVRETQESKDKALNESRVWPLEVGLVFNQIEGADALPVSQQNKLKANINQLWLERMPTSEIITTAGGLFNSMQGAVNA >NZ_CP044107|2656359:2709677|2663710_2664406_-|WP_069598597.1|DBSCAN-SWA MNIGNRIRELRLEKGMKISDLAEAVGIDGANVSRVETGKQKSFTEQSLSKYATALGVSVAELFTPSPNETTVCKSSGKNPAYGEGDPVFRVELLDVSASAGMGHIQGSDVVDVIRSIEYNNERAAVLFGGRTPDTVKVINVRGDSMADTIEPGDLIFVDISINEFDGDGIYVFGFDDKIYVKRLQMIPDKILVISDNPKYREWSVDKSNEDRFYVFGKVMISQSQSVKRHG >NZ_CP044107|2656359:2709677|2699807_2700494_+|WP_150391037.1|DBSCAN-SWA MPKTHHYKWFLSTFFAMHNFGFTLATIKSFNYQTEKAIRAMIDDYQKNGPAVFDYIQDKQGIIDSIDSIPESLVESTDKIHTVYHPSLMRRSAFLTIFGMIEHEVDNVCDNFSKKHKVSVKVNDLKGNGFERSNLFISRIIGLKDSQHYAQIKRIIKLRNSCAHNDAKFITPDGNEIKEITRLMNDFPKYFFKDGSSVGFHPNVLDFFTESLEAYLLEIEVALNKHQK >NZ_CP044107|2656359:2709677|2693737_2697313_+|WP_150391035.1|DBSCAN-SWA MAVIRGSKGGGGGGDKGGNRGTEIASVAYMKILLALTEGEAAGDFTGKDIYLDGTPLLDDAGNENFPGVTWEWRSGTVDQDYIAGFPAVENEITVGTELKYGTPWVKSINNTQLSAVRLRLKFPNGVYKLRDSGGKDGYRIEFAIDISTDGGPYVEYGTDEADGIADAGYERSYRIDLPAATSGWQIRVRRLTENTTNGRHADISRIESMTDIVDAKLRYPHTSLLFIQFDSKLFDGRTPNVTVKMKGIIVRVPANYDPISRTYSGIWDGTFKWAWTNNPAWIFYDLVLNKRYGLGKRITADLVDKWTLYQIAQYCDAPVSDGAGGKEARYLCDLYISQRTDAWTVLMDLANIFRGMISWSNNLLSVDADMPRELEPDFVFNKSNIVGAFNFSSTSEKTNYSSAIVTYSNPANGYQDDQASAWVPEISSRFGFNTIELTRIGCTRESEAQRHGLYAIETNRDDNAVEFKTGLEGRIPRIGKVIGINNAPQAGRDNGGRVSAASGTKVTLDRITTAKAGDTLIVNLPTGKSEGRQVVSVSGRAVTVEKAYSVTPNAESAWVLDQPDLAIQLFRVKRLAVNSDNTVTINGLPYNPNKFPRVDDGAVIEDRPVSVVPPRGQGMPENIAISSVYRVEQGIGITTMVVTWDTVKNAVAYEAQWRQNNGDWINVPRTGNTRFEVDGIYAGRYVVRIRAVNALDIASLWATSAETELTGKVGKPPMPVNLSTQSLVFGIGISWGFPSGAQDTQKTEIHYSTTANGDSPLLLADVPYPSSTYQQMGLLAGKSFWYRARLVDRLGNQSDWTEWVFGQSSTDVSDITDSILKEMEETGLLKDVVENAVDSNEKIAAMVDDIKQANDELELQAKDIAQNAQNIGKVQTSVNELSSTVGDVSSSLSQLEQTVATEDAALGQRIDSISVSMDGMTGGVKNSAIAIIQNGLAQVATRKRLSATVAGNSAQLDRIDEVIVNEKEATARSLLSLQTDVNGNKASINSLNQTFSDYQQAMATQVNSITATVNGHTSAITTNAEAIANVNGDLKAMYSIKVGLSSNGQYYAAGMGIGVENTPSGMQSQVIFLADRFAVTHQAGAQVTLPFVIQNGQTIIRDTVIGDGTIGNLKIGSYIQSTTWDGTGNVGWHINKSGYATFNNVTVRGSIYATNGNFSFNGSGNTTVINGNGLTVNIPGGGRIVLGTWT >NZ_CP044107|2656359:2709677|2683325_2684621_+|WP_150391022.1|capsid|DBSCAN-SWA MNKIEELRRQRAGINTQVQALAQIEINGGTLSAEQLEQFTGLQAQFDEISASIERLEAAERLAATTAVPVKVAQNGRNAPAVQVKAEPDQYKGAGMTRMVMAIAAGKGDLQQAASFAAEDLNDQGLSMAITTAANSGGALVPQNMQNEVIELLRDRTIVRKLGARTVPLPNGNLAIPRLASGSTASYVGEGKDVKASGATFDDVKLNAKTLITMVPISNQLIGRGGFNVEQLILGDIISGISTREDKAFLRDDGTNDTPKGMKAVATAGSRTLPWVADEEVNLQTIDTYLDALILMAMDGNSNMLKCGWGMSNRTYMKLFGLRDGNGNKVYPEMTVGNLKGYPIERTSAIPANLGTGGKESEIYFADFNDVLIAEDGAMVVDFSREATYIDADGNTVSAFARNQSLIRVIMEHDIGFRHIEGLALGTGVTW >NZ_CP044107|2656359:2709677|2681047_2682307_+|WP_058609096.1|portal|DBSCAN-SWA MFIPQFFRGRSRPGGSNWTTVLGSVSASKSSSGMLVTPETAMGIGAIRACVTLLAESIAQLPVELYQRDEKGGRRRATDHPLYDVIHSQPNRKDTSFEYYEQQQGVLGLEGNSYSLIDRHGNGDIAELIPINPKKVIVLKGPDGMPYYELPELGETVPMRMMHHIKYFSLDGYIGTSPIQTNADVLGLGMAVEQHAAQVFARGTTMSGVIERPKEAGAIKSQASIDKLLAKWTDRYSGVRNAFSVALLQEGMSYKQLSQDNEKAQLLQSRQWTVNEVCRLYKIPPHMIQLLDKSTNNNIEHQGLQYVMYTLLAWLKRHEAAMMRDLLLPSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENMAPVDGGDKYLTPLNMVDTSTVHGLDKATPAQISEISAILQRTA >NZ_CP044107|2656359:2709677|2687099_2687444_+|WP_150391028.1|tail|DBSCAN-SWA MNYKSLINPLNTTVEQTLLGQKVYLRRLTSAELDDYNDKVEAGRQARLPSRELSAMGVNLFLAALVNEDGSKPKASELPTADQLMAAHSNADLLDAVTLVQRHSYGTLEEATKN |
65 | Klebsiella_phage(24.49%) | head,tail,portal,protease,capsid,terminase,holin | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
2888368 : 2897073
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP044107|2888368:2897073|DBSCAN-SWA AATGAAATTCTCACTACTGCTTTCATTGTATAACAAGGAAAATCCTTATTTTCTTTCTGAGTGTCTTAAAAGTATCGAAACTAACTCTCATTTGCCTGAGCAAATAGTTATTGTGTTCGATGGGCCTATTGGCAAAGAATTGGAAGAAATAGTTAATAATTATGCCGTTCGATTGCCTATTGATATCGTTCCAATCAATAAAAACGTTGGGTTGGGTGCTGCTTTAAATTATGGGTTAAAGTTTTGCAAAAATGAAATGGTTCTGAGAATGGATACTGATGACATATGCATGCCAGATCGTTTTGCAAAACAAATAGATTTTCTAAAGTTACATCCCAGTATTGGATTGATGGGCGGTGCGATCCAAGAGTATGACGAAAGTATGAATGAATCGAAAGGGATTCGGTTTTCAAAATGCCAGCATAAAGAGATAGTGGAATATGCAAAAAAGAGAAATCCCTTTAATCATATGACTGTTGCGTTTAGAAAAAGTTTAGTTGAGCAAGTTGGTGGGTATCAACACCATCATTTGATGGAGGACTATAACCTTTGGCTGCGTATGATTGCAGCAGGTGTTGAATGCTATAATATGGTTGATGTACTTGTTAAAGTTAGAGCAGGTAACAACATGCTCGCAAGACGTAAAGGAATATTTTATGCTAAAAGTGAAATCAAGATTGCTAGACTTAAATACAATTTAAAAATTGATCATCTTTGCGGATGTATTTCTTGCGCAGTCATTAGGATGATACCTAGGTTACTTCCTGTTTTTGCTTTAAATTATCTCTATAGAATATTGAGGAAGTAATTAATTATTTAACTGATTTTATCTAATTTGTTTTTTTCAGCGGATAAGTCAGCTTTAATCTGCTATTCATTTCATTCATCATGAGTTAGTATATTTGACACACTTCAAGCCGCGCAACAGTCGCGGCGACCACACCTGACAGGAGTATGTAATGTCCAAGCAACAGATCGGCGTTGTCGGTATGGCAGTGATGGGGCGCAACCTGGCGCTCAACATCGAAAGCCGTGGTTATACCGTCTCCGTTTTCAACCGCTCCCGTGATAAGACCGAAGAAGTCATCGCTGAGAATCCAGGCAAAAAACTGGTTCCTTTCTATACGGTTAAAGAGTTCGTTGAGTCTCTGGAAACACCTCGTCGTATCCTGTTAATGGTGAAAGCGGGCGCAGGTACCGATGCAGCTATCGATTCACTGAAACCGTATCTGGATAAAGGCGACATCATTATCGATGGCGGCAACACCTTCTTCCAGGACACTATTCGTCGTAACCGTGAACTGTCTGCTGAAGGCTTCAACTTCATTGGTACCGGCGTATCCGGCGGTGAAGAGGGCGCTCTGAAAGGTCCATCCATCATGCCTGGCGGTCAGAAAGAAGCGTACGAACTGGTCGCGCCTATCCTGACAAAAATCGCAGCCGTAGCTGAAGACGGCGAGCCGTGCGTGACCTATATCGGTCCTGATGGCGCTGGTCATTACGTGAAAATGGTTCACAACGGCATTGAATATGGCGATATGCAGCTGATTGCTGAAGCCTACTCTCTGCTGAAGGGTGGCCTGAACCTCTCCAACGAGGAGCTCGCAGAGACCTTCACCGAGTGGAACAAAGGCGAGCTGAACAGCTACCTGATCGACATCACCAAAGATATCTTCACCAAGAAAGATGAAGAGGGTAAATACCTGGTTGATGTGATTCTGGATGAAGCTGCGAATAAAGGTACCGGTAAATGGACCAGCCAGAGCTCTCTGGATCTGGGTGAGCCATTATCCCTGATCACCGAATCCGTATTTGCACGTTATATCTCTTCTCTGAAAGAGCAACGCGTCGCGGCATCTAAAGTGCTGTCTGGTCCACAGGCTAAACCAGCCGGCGACAAAGCTGAGTTCGTTGAGAAAGTCCGTCGCGCGCTGTATCTGGGTAAAATCGTCTCTTACGCGCAGGGCTTCTCTCAGCTGCGCGCGGCATCTGACGAGAACAACTGGGATCTGAACTACGGTGAAATTGCGAAGATCTTCCGTGCCGGCTGTATCATTCGCGCTCAGTTCCTGCAAAAAATCACCGATGCCTATGCTGAAAACGCGGGTATCGCAAACCTTCTGCTGGCACCTTACTTCAAACAAATTGCTGACGAATATCAGCAGGCGCTGCGTGATGTTGTTGCGTACGCGGTGCAGAATGGTATCCCGGTTCCGACCTTCTCCGCTGCGGTGGCATATTACGACAGCTATCGTGCTGCCGTTCTGCCAGCGAACCTGATTCAGGCACAGCGTGACTACTTCGGTGCGCATACATATAAACGTACGGATAAAGAAGGTGTGTTCCACACCGAATGGCTGGATTAATATACGGTTAGCCATGCCATCAAGCCCGGTAGCGATACCGGGCTTTTTTATCTTGCCCGATCCACCTACTGTTTAATGGAATAGATTGCGTGAAAATTCTAGTTACTGGCGGTGCCGGTTTTATCGGTTCCGCGGTTATCCGACATATTATTAGCAATACTCGGGATAGCGTTGTTAACGTTGATAAATTGACCTATGCCGGTAATCTGGAATCCTTGCGTGAAGTAAGCGATAGCGAACGCTATGTTTTTGAGCATGCCGATATATGTGATAAAGAGGCAATGGCGCGTATTTTTGCTACCCATCAGCCTGATGCGATAATGCACCTGGCTGCCGAGAGCCATGTCGACCGTTCTATTACCGGTCCCGCTGCGTTTATTGAAACCAATATTGTTGGCACCTATATCCTGTTAGAGACAAGTCGTGCTTACTGGTCTTCACTGGACGAGGCAGCAAAATCCGCCTTCCGTTTCCATCACATTTCAACTGATGAGGTTTACGGCGATCTGCCTCATCCCGATGAACATTCTGACTCCACCCTGTTACCGCTGTTTACCGAGAAAACAGCGTACCAGCCAAGCAGCCCTTACTCAGCGTCAAAAGCGTCCAGCGACCATTTAGTTCGAGCCTGGATACGCACCTATGGTTTACCCGGGATTGTGACAAATTGTTCGAATAATTATGGCCCATACCACTTTCCTGAAAAGCTGATTCCTCTTGTTATCCTTAATGCGCTGGATAATAAACCATTGCCGATTTACGGTAAGGGCGATCAAATCCGTGACTGGTTATACGTTGAAGACCATGCTCGTGCACTGTATACGGTTCTGACCACGGGTAAGCCGGGAGAAACCTATAATATTGGCGGTCATAACGAGAAGAAAAATATTGAAGTTGTTCAGACCATTTGTGATCTGCTCGATGACATGGTACCGAAAGAAACATCCTATCGCGCACAAATTACTTATGTCGCGGACCGACCAGGTCACGACAGACGTTATGCAATTGATGCTCATAAAATTAGCGACGAGCTTGGATGGACTCCTGTGGAAACCTTTGAAAGTGGCATCCGTAAAACGGTTGAATGGTATTTGAATAACCAGGAGTGGGTATCGAATGTAAAAAGCGGCGCCTATAAGAGCTGGATTGAGCAAAATTATGGGGAGCGTGAGTAAATGACGAAACGTAAAGGGATTATATTAGCTGGCGGATCGGGTACACGTCTTTATCCCGTGACTATGGCCGTAAGTAAGCAATTGTTGCCAATTTACGATAAGCCAATGATCTATTATCCGCTCTCTACGCTTATGCTGGCAGGTATACGGGATATTCTGATTATCAGCACGCCTCAGGACACTCCGCGTTTTGAACAACTGCTCGGGAACGGTAGTCAGTGGGGATTGCATATCCAGTATAAAGTGCAACCAAGTCCTGATGGACTGGCACAAGCTTTTATTCTGGGTGAAGAGTTTATCGGTGAGGATAATTGTGCGCTGGTATTAGGCGATAATATTTTCTACGGACACGATCTTCCCAGACTGCTTGAAGGCGCAGCAAGCCAGCAAGAGGGTGCGACCGTATTCGCCTATCATGTCAGCGACCCGGAACGCTATGGCGTCGTTGAGTTTGATAAAGACGGTACTGCAATTGGCCTTGAGGAGAAGCCTCAGCAACCCAAGAGTAATTACGCAATAACCGGTCTTTATTTTTACGACAACGATGTGATTGAGATGGCCAAAAGTTTAACTCCGTCCGAGCGAGGTGAACTCGAAATTACCGACATCAACCGCATCTATATGCAGCAGGGACGATTGTCTGTCGCGATGATGAGACGCGGTTATGCCTGGCTGGATACCGGAACGCATCAGAGTATGATCGAGGCAAGCAATTTTATCGCCACAATTGAAGAGCGACAGGGGCTAAAAGTTTCATGTCCTGAAGAGATTGCGTTCCGACGAGGTTTTATTGATGCCGAACAACTTCGGGTACTCGCTGAACCATTGAAAAAGACAGGGTATGGTCAGTATCTGCTGAATCTGACCAAGGGATTAGTCTGATTTGGATCCCTCTGTCATTTATCATTTTTAAGTGTCGTTAATTATTTCAGGCTGCATTAGTGCTGGTCAGAAGGCACTTTACATTGCATTTTGCAGTTTTCGGATTCTAGATTTAGTCACTGGTCAGACAAGGCATGAATCTTGACAGCGAATTGGTATTAGCGATAAAAAACCACCAGTTAACTATTGTGGTGGGCTTCTGGATGCCACCCGAAACTCATACTGAAGAAGTCAAGTAAATGAAAATAACTATCTCCGGAACAGGCTATGTTGGTCTCTCAAACGGTATTCTGATTGCACAAAACCATGAAGTGGTTGCGCTGGATATCGTGCAGGCAAAAGTGGATATGCTCAATCAGAAGAAGTCGCCAATTGTTGATAAAGAGATTCAGGATTACCTATCTAATAAGTCGCTGAACTTCCGCGCGACCACGGACAAAGAAGACGCTTACCGCGATGCGGATTTTGTCATCATCGCCACCCCAACCGACTACGATCCTAAAACCAACTACTTCAATACCTCAACCGTTGAAGCGGTGATCAAAGACGTTACGGCAATCAACCCTAACGCGGTGATGATCATCAAGTCGACAATTCCAGTGGGTTTCACCAAATCGATTAAAGAAGAGTTGGGTATTGATAATGTCTTCTTCTCGCCGGAATTTCTCCGCGAGGGCAGGGCGTTATACGATAACCTGCACCCATCCCGTATCGTGATTGGTGAGCGTTCCGAGCGTGCGGAGCGTTTTGCTGCGTTGCTTCAGGAAGGCGCAATCAAAAAAGATATTCCGGTGCTGTTCACCGATTCCACCGAGGCAGAGGCCATTAAGCTTTTCGCCAATACCTATCTGGCGATGCGTGTGGCTTACTTCAACGAACTTGACAGCTATGCCGAGAGCTTAGGTTTGAACACCCGTCAGATCATCGAGGGCGTGTGTCTTGACCCGCGTATCGGTAACCATTACAACAACCCGTCCTTCGGTTACGGTGGCTACTGTCTGCCAAAAGATACCAAGCAGCTGCTGGCTAACTATCAGGCGGTGCCGAACAACCTGATTTCCGCCATTGTGGATGCCAACCGCACGCGTAAAGACTTCATCTCGGATTCCATTCTGGCGCGTCAGCCGAAAGTGGTGGGCGTGTATCGCCTGATCATGAAGAGCGGTTCCGATAACTTCCGCGCTTCTTCCATTCAGGGGATCATGAAGCGTATTAAGGCGAAGGGCGTGCAGGTCATTATTTATGAACCGGCGATGCAGGAAGATGAGTTCTTCCACTCTCGCGTCATTCGCGATCTGGATGCATTTAAGAAAGAAGCGGATGTAATTATCTCCAACCGTATGGCGGAAGAGCTGGCGGACGTGAAAGATAAAGTCTATACCCGCGATTTGTTCGGCAGCGACTGATTAAAATCGTGATATAAAAAACCCGGCAATCTGCCGGGTTTTTTGTTTTTAAACGTTATAGAAGTTGCGGTACCAGTCGACGAAGTTTTTCACGCCTTCTTTGACTGAGGTTTGTGGTTTAAACCCAATAACGTCATACAGCGCTTTGGTGTCCGCACTGGTCTCCAGCACGTCACCCGGCTGGATCGGCATCATATTTTTTACCGCTTCCTTGCCCAGCGCCTCTTCAAGTGCGGTAATGTAATCCATCAGTTCGACAGGTGAACTGTTACCGATGTTATACACGCGATACGGGGCAGAACTGGTTGCTGGCGAACCGGTTTCGACTGTCCAGTCGGTATCGGCCTGAGGAATGACGTCCTGTAGGCGAATAATCGCCTCCGCAATATCATCGATATAGGTGAAGTCGCGCTTCATCTTGCCGTAGTTGTATACGTCGATGCTATTGCCTTCAATCATGGCTTTGGTGAATTTAAACAGCGCCATGTCCGGGCGTCCCCACGGGCCATAGACGGTAAAGAAGCGCAAGCCAGTGGTCGGCAGGTTATACAGATGCGAATAGGTATGCGACATCAGCTCATTGGCTTTTTTGGTTGCCGCATACAGGGATACCGGATGGTCTACGGAGTCGTCAGTAGAAAACGGCATCTTACGGTTAAGGCCGTAAACAGAACTGGATGACGCATACAGAAGATGCTGAACCTTGTTGTGGCGACAGCCTTCCAGCACGTTCAGGTGACCTACCAGATTAGCATCTGCGTAGGCATGCGGGTTTTCCAGCGAGTAACGCACGCCCGCCTGTGCGGCAAGATGGATTACGCGGTCAAATTTCTCATTGGCGAAGAGCGCAGTCATGCCCTCGCGGTCCGCTAAGTCCAGCTTGTAGAAGGTGAAACTCTCGGATTTGAGCAGTTCGAGGCGAGCGAATTTGAGGTTAGGGTCATAGTAATCATTCAGATTATCAAGCCCAACAACTTCATGCCCTGCATCAAGCAGGCGCTTGCTGACATGAGAACCGATAAAGCCCGCAGCGCCAGTTACCAGAAATTTCATAGTTTCCCTCATTCACAGTGTTCTTGTTGATGTTCCAGGACATCAGAATCCGACTATGATAGCGCGTCGCACTTCACTTACATAACCCATCTTTCCGATATAACTCATCGGATTCTTATAGAAAAATCGCCAGCAACAGATACACTAAGGAAACAAGATTATTTTTGCCCTTTTTCATCTGTTAGGGATTGTATGACGCAAAACAACAATAACCTGGTCACACGCAACAATGACCCGGAGCAAATTGATTTACTGGATTTAATGCTACAGCTGTGGCGTGGTAAATGGGTGATTGGAGCATTTGTTGCTGCTTTTATCGTTCTTGCCATTGTTTACATCACTGTAGCCAAAGAGAAATGGACGTCATCTGCCATCATAGCCCAGCCTGATGCGGCACAAATCGCCACCTATTCCAATGCGTTGAATATCCTCTATGGCGGGGCAGCACCTTCCATGTTGGACATTCAGAATCGTGCTATTGGTCGTTTTAATTCCTCCTTTTCGGCATTAGCTCAGGCGCTGGAGAATCAGGAATATCCCGAGAAATTGTCCATTGAGCCTACTGTAAAGGGGCAGAGTGTACCCTTAACTGTCAGTTACCAGAGCGATTCGGCTGAAGCAGCACAAAAGCAGTTGGCCCAATACATACAGCAGGTCGACGAACAGACCGCAAAAGAATTAACGCTTGACCTCAGAGACAACCTCAAGCAACAGATCACGACCCTGAACGACTCTCTTCTGAACCAGGAGAAAGTGGCTCAGGAGCAAAAAGAACTGCGTATCAAGCAAATTTCTGAGGCGCTAAAGAACGCGGAAGCCGCGAAGATCAGTACACCGCAGCTCCAGCAAACGCAGGACGTAACGCAGGAAACCATGTTCCTGCTGGGCACCGTAGCGCTCAAATCAATGGTTGATAACGAGGCGTCCCGTCCGCTGGTCTTCCCTGGCTCTTACTATCAGACCAAAAAGAACTTACTGGATATTGAAAACCTCAATGTTAACCCAGACACAGTACACGTGTATCGTTACGTGATGAAACCAGATCTGCCAATCAAGCGTGACAGCCCTAAAAAGGCCATCACTCTCATCCTTGCCGTTCTCCTCGGCGGGATCATCGGTTCTGCTGTCGTCCTCGGCCGTAACGCGCTGAGAAACTACAAACCAAGAGCCTGATTTACACACAAAAAAAGACCGGGTCTCCCCGGTCTTTTTTTATTTATGTCGTTTCCGCAAATTCTCAATCACCGTCGTTAAATCCAGCTCCTGATCCTGAAGCAGCACCAGCAGGTGATACATCAAATCCGACGCTTCGTTCGTCAGCTCCTCCCGGTCATGCACGGTGGCCGCCAGCGCCGTTTCAACGCCTTCCTCACCCACCTTCTGCGCAATACGCTTGGTGCCGCTGGCGTACAGCTTCGCCGTGTACGAGCTCTCGGGATCCGCTGATTTACGCTCTGCCAGCAGCTGTTCCAGCTGATAGAGGAACAGCCACTGGTGGCTCGCCTCGCCAAAGCAGCTGCTGGTACCTTTGTGGCAGGTCGGCCCAATCGGGTTGACCAGCACCAGCAGGGTGTCGTTGTCGCAGTCTGGCGTAATGCTGACCACATTCAGGAAGTGACCCGAGGTTTCCCCTTTCGTCCACAGGCGCTGTTTGGTGCGCGAGAAAAACGTCACCTTGCCGCTGTCGATCGTTTTTGCCAGTGCCTCCTGGTTCATGTATCCCAGCATCAGCACTTCGCCTGAAACGGCATGCTGTACAACCACCGGCAGTAATCCGTCAGTTTTTTCCCAGTCCAGCTGCGCCTGTTGTTGCTCTGTTAACAT
Protein sequences of DBSCAN-SWA_2 >NZ_CP044107|2888368:2897073|2896461_2897073_-|WP_017693103.1|DBSCAN-SWA MLTEQQQAQLDWEKTDGLLPVVVQHAVSGEVLMLGYMNQEALAKTIDSGKVTFFSRTKQRLWTKGETSGHFLNVVSITPDCDNDTLLVLVNPIGPTCHKGTSSCFGEASHQWLFLYQLEQLLAERKSADPESSYTAKLYASGTKRIAQKVGEEGVETALAATVHDREELTNEASDLMYHLLVLLQDQELDLTTVIENLRKRHK >NZ_CP044107|2888368:2897073|2894244_2895249_-|WP_150391068.1|DBSCAN-SWA MKFLVTGAAGFIGSHVSKRLLDAGHEVVGLDNLNDYYDPNLKFARLELLKSESFTFYKLDLADREGMTALFANEKFDRVIHLAAQAGVRYSLENPHAYADANLVGHLNVLEGCRHNKVQHLLYASSSSVYGLNRKMPFSTDDSVDHPVSLYAATKKANELMSHTYSHLYNLPTTGLRFFTVYGPWGRPDMALFKFTKAMIEGNSIDVYNYGKMKRDFTYIDDIAEAIIRLQDVIPQADTDWTVETGSPATSSAPYRVYNIGNSSPVELMDYITALEEALGKEAVKNMMPIQPGDVLETSADTKALYDVIGFKPQTSVKEGVKNFVDWYRNFYNV >NZ_CP044107|2888368:2897073|2890822_2891908_+|WP_047056191.1|DBSCAN-SWA MKILVTGGAGFIGSAVIRHIISNTRDSVVNVDKLTYAGNLESLREVSDSERYVFEHADICDKEAMARIFATHQPDAIMHLAAESHVDRSITGPAAFIETNIVGTYILLETSRAYWSSLDEAAKSAFRFHHISTDEVYGDLPHPDEHSDSTLLPLFTEKTAYQPSSPYSASKASSDHLVRAWIRTYGLPGIVTNCSNNYGPYHFPEKLIPLVILNALDNKPLPIYGKGDQIRDWLYVEDHARALYTVLTTGKPGETYNIGGHNEKKNIEVVQTICDLLDDMVPKETSYRAQITYVADRPGHDRRYAIDAHKISDELGWTPVETFESGIRKTVEWYLNNQEWVSNVKSGAYKSWIEQNYGERE >NZ_CP044107|2888368:2897073|2895441_2896422_+|WP_150391069.1|DBSCAN-SWA MTQNNNNLVTRNNDPEQIDLLDLMLQLWRGKWVIGAFVAAFIVLAIVYITVAKEKWTSSAIIAQPDAAQIATYSNALNILYGGAAPSMLDIQNRAIGRFNSSFSALAQALENQEYPEKLSIEPTVKGQSVPLTVSYQSDSAEAAQKQLAQYIQQVDEQTAKELTLDLRDNLKQQITTLNDSLLNQEKVAQEQKELRIKQISEALKNAEAAKISTPQLQQTQDVTQETMFLLGTVALKSMVDNEASRPLVFPGSYYQTKKNLLDIENLNVNPDTVHVYRYVMKPDLPIKRDSPKKAITLILAVLLGGIIGSAVVLGRNALRNYKPRA >NZ_CP044107|2888368:2897073|2888368_2889175_+|WP_059295499.1|DBSCAN-SWA MKFSLLLSLYNKENPYFLSECLKSIETNSHLPEQIVIVFDGPIGKELEEIVNNYAVRLPIDIVPINKNVGLGAALNYGLKFCKNEMVLRMDTDDICMPDRFAKQIDFLKLHPSIGLMGGAIQEYDESMNESKGIRFSKCQHKEIVEYAKKRNPFNHMTVAFRKSLVEQVGGYQHHHLMEDYNLWLRMIAAGVECYNMVDVLVKVRAGNNMLARRKGIFYAKSEIKIARLKYNLKIDHLCGCISCAVIRMIPRLLPVFALNYLYRILRK >NZ_CP044107|2888368:2897073|2889326_2890733_+|WP_023303973.1|DBSCAN-SWA MSKQQIGVVGMAVMGRNLALNIESRGYTVSVFNRSRDKTEEVIAENPGKKLVPFYTVKEFVESLETPRRILLMVKAGAGTDAAIDSLKPYLDKGDIIIDGGNTFFQDTIRRNRELSAEGFNFIGTGVSGGEEGALKGPSIMPGGQKEAYELVAPILTKIAAVAEDGEPCVTYIGPDGAGHYVKMVHNGIEYGDMQLIAEAYSLLKGGLNLSNEELAETFTEWNKGELNSYLIDITKDIFTKKDEEGKYLVDVILDEAANKGTGKWTSQSSLDLGEPLSLITESVFARYISSLKEQRVAASKVLSGPQAKPAGDKAEFVEKVRRALYLGKIVSYAQGFSQLRAASDENNWDLNYGEIAKIFRAGCIIRAQFLQKITDAYAENAGIANLLLAPYFKQIADEYQQALRDVVAYAVQNGIPVPTFSAAVAYYDSYRAAVLPANLIQAQRDYFGAHTYKRTDKEGVFHTEWLD >NZ_CP044107|2888368:2897073|2893029_2894196_+|WP_045343191.1|DBSCAN-SWA MKITISGTGYVGLSNGILIAQNHEVVALDIVQAKVDMLNQKKSPIVDKEIQDYLSNKSLNFRATTDKEDAYRDADFVIIATPTDYDPKTNYFNTSTVEAVIKDVTAINPNAVMIIKSTIPVGFTKSIKEELGIDNVFFSPEFLREGRALYDNLHPSRIVIGERSERAERFAALLQEGAIKKDIPVLFTDSTEAEAIKLFANTYLAMRVAYFNELDSYAESLGLNTRQIIEGVCLDPRIGNHYNNPSFGYGGYCLPKDTKQLLANYQAVPNNLISAIVDANRTRKDFISDSILARQPKVVGVYRLIMKSGSDNFRASSIQGIMKRIKAKGVQVIIYEPAMQEDEFFHSRVIRDLDAFKKEADVIISNRMAEELADVKDKVYTRDLFGSD >NZ_CP044107|2888368:2897073|2891908_2892790_+|WP_017693099.1|DBSCAN-SWA MTKRKGIILAGGSGTRLYPVTMAVSKQLLPIYDKPMIYYPLSTLMLAGIRDILIISTPQDTPRFEQLLGNGSQWGLHIQYKVQPSPDGLAQAFILGEEFIGEDNCALVLGDNIFYGHDLPRLLEGAASQQEGATVFAYHVSDPERYGVVEFDKDGTAIGLEEKPQQPKSNYAITGLYFYDNDVIEMAKSLTPSERGELEITDINRIYMQQGRLSVAMMRRGYAWLDTGTHQSMIEASNFIATIEERQGLKVSCPEEIAFRRGFIDAEQLRVLAEPLKKTGYGQYLLNLTKGLV |
8 | Organic_Lake_phycodnavirus(14.29%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_3 |
3677351 : 3684893
Sequences of DBSCAN-SWA_3
Nucleotide sequences of DBSCAN-SWA_3 >NZ_CP044107|3677351:3684893|DBSCAN-SWA CATGATTTACGACTGTTTTTTATACTATGACGAAGATATGTTGCTCGACATCAGATTACATACTCTCGCTGATGTCGTTGACCGTTTTGTCATTGTAGAAGCAACACACTCTTTTACGGGCATACCGAGAGAATTGCATTTCGATATTACAAAGTTTGCCAAATTCAAAGACAAAATCATTTACGTGCCTTTTGACGCGCAGCCTATTTTAAACCGTGCGGATAATAACCAGGTTGATGCCTGGGCAAATGAAGCAGCGCTTCGCAACTCCATTATGAACGGGTTAAAAGACGCGGCAGACGACGATCTGATTCTGGTGTCAGACGTTGACGAAATCTTCTCTCCCGACACGGTCAGGGCCATTAATCCGCGCGCGCTCTGCACGACTATTCATCAAAACGTGTTCAACTATCAGTTTAATCTCCAGGTTCACAACACGGATGGTACGCCGAGAAAATGTACCTTGCCCCGCGCGACGTCCTATTACAACCTTAAGCATTTCTTCCACGGTGAGCCTGAATCTTTCAGAAACTGGAAGCGTGCGCGCAAAGATAAAAACTGGTCATGGTTTAAATGGAACTGGCTAAAAATCAATAATAAAATTGTGAAAGATGGCGGCTGGCATTTCTCATGGGTAATGACTCCAGAAAGAATTTCCGAAAAAATGTCTACCATTTCTCATACCGAATACGATCTGCCGGAATTTAATAACCCGGAACATATTATGAAGGTTATTACCAACGCCGAAGATATCTGGGGACGAGACCGAAAACTGGTCAGGCAAGAGGTATCAAAACGCACCCTGCCTTCTTATCTGGTAGACAATCAGCACCATTACTCGCAATTTATTTTATGACGCAGGCAGGGCTCCAGGACAATTCAGAAAGACAGGTCCAAAAACAAAAGCTTACCCTAACCCTGCTATGCTTACAGTCCACACAGTGAAAAGGAGCTACAGATGAAACGGTTACCCTGGATTACCGCCCTGCTGTTAATGAGTGCTTCACCCGCTCTTCTTGCGGCCCCGGACTCCTGCGAGCGCGTGAAAAGCGACATTCAGCAGAAGATTATCAACAACGGCGTACCGGAGTCTGGCTTTACCCTGAACATCGTCCCGAACGATCAGGCCGATCAGCCGGATGCGCAGGTCGTTGGGCATTGTGCCAACGATACTTTCAAAATTTTGTACACCCGCACCAGTAGCGGCAACTACCCGGTGAGCGGCGCAGGTACGCAAGAGAATGCGCCCGCTGAGCCGCAATGATTTTCCCACTCAATTCCCCTGATTTGACGAGGATTAATATTTAATACCTCTAAATAAGTAACACTCCCCCCATCATTAGCCCGGTTGTAGTCACGGGTGAAAATAATTATTAACAATGATGGGGTGAGTTATGTCCGAAGTTGAACATCACGGCGGTATAAGCCGTCGAACTCTGGTTAAATCTACTGCCATAGGATCTCTGGCGCTTGCCGCCGGTGGGATCGCATTACCTTTTGGTCTGAAAAGCGCCGCCGCCGCTGTGCAGTCCGCTATTCAGCCCGCAGAAGACAAAGTTGTCTGGGGCGCCTGCTCGGTAAACTGCGGTAGCCGCTGCGCGCTACGTCTGCACGTTCGCGATGACGAAGTCTACTGGGTTGAAACGGATAATACCGGCGAGGATATTTACGGCAACCATCAGGTTCGCGCCTGCCTGCGAGGCCGTTCAATTCGCCGTCGCATTAATCACCCAGACCGTCTGAACTATCCGATGAAACGCGTGGGCAAACGCGGAGAAGGCAAGTTTGAGCGTATCACCTGGGAAGAAGCGCTGGACACCATCGCCGCGAGCCTGAAAAGCGTGGTCGAAAAATACGGCAACGAAGCGGTCTACATTAACTACTCCTCCGGAATTGTAGGCGGCAACATCACCCGCTCCTCCCCTTATGCCTCGCTGGTCGCGCGCCTGATGAACTGCTACGGCGGCTTCCTCAGCCACTACGGCACCTACAGCACCGCGCAGATCGCCTGCGCAATGCCCTACACCTACGGCAGCAACGACGGCAACAGCACATCGGATATCGAAAACACCAAACTGGTCGTGATGTTCGGCAATAATCCGGCGGAAACGCGCATGAGCGGCGGCGGGATCACTTACTACCTTGAGCAGGCGCGCGAACGGTCAAACGCGCGGATGATCGTTATCGATCCGCGCTACACCGACACTGCCGCAGGGCGTGAAGACGAGTGGATCCCGATTCGTCCGGGCACCGATGCCGCGCTGGTGGCAGGTATTGCGTGGGTGCTGATTAATGAAAATCTGGTCGATCAACCTTTCCTCGATAAATACTGCGTGGGTTATGACGAAAAAACCCTGCCGGAAGGCGCACCGGCTAATGGTCATTACAAAGCCTATATTCTCGGCCAGGGTGATGACAAAACCGCGAAAACCCCTGAGTGGGCGTCTCGCATAACGGGCATCCCTGCCGATCGCATCATTAAGCTGGCCCGTGAAATCGGTTCGGCGAAACCGGCCTACATTTGCCAGGGCTGGGGCCCGCAGCGTCAGGCTAACGGGGAGCAAACGTCCCGCGCCATCGCCATGCTGCCGATCCTGACCGGCAACGTCGGCATTAACGGCGGCAACAGCGGCGCGCGCGAATCGACCTACACCATCACCATCGAACGCATGCCGCTGCCGGAAAATCCGGTGAAAACGCAAATTTCCTGCTTCAGCTGGACGGATGCCATCGTGCGTGGACCGGAGATGACCGCCCTGCGCGACGGCGTACGCGGCAAAGATAAGCTCGATGTGCCGATCAAGTTCATCTGGAACTACGCGGGTAATACCATCATCAACCAGCACTCCGATATCAACAAAACTCACGACATTTTGCAGGATGAGAACAAGTGCGAAACGATCGTCGTCATCGACAACTTTATGACCTCCTCCGCGAAGTATGCCGATATTGTTTTGCCGGATCTGATGACCGTCGAGCAGGAAGATATCATCCCCAACGATTACGCCGGCAACATGGGATACCTGATTTTTCTCCAGCCGGTTACCGCCCCGAAGTTCGAGCGCAAGCCCATCTACTGGATCATGAGTGAAGTGGCGAAACGCCTCGGACCGGATATCCATCAGAAATTCACCGAAGGCCGTACGCAGGAGCAGTGGCTGCGCTACCTGTACGCCAAAATGGTCGCCAAAGATCCGCTGCTGCCATCCTACGATGCGCTGAAAAAAATGGGTATTTATAAGCGCAAAGATCCTAACGGACATTTTGTGGCCTATAAAAAATTCCGTGACGATCCGGATGCCAATCCGCTGAAAACCCCGTCGGGCAAAATCGAGATCTACTCCATCAAGCTGGCGGATATTGCGGCAACCTGGGAACTGCAAAAAGACGAAACCATCACCCCGCTGCCGGTCTATACCTCAACCTTTGAAGGCTGGGACGCGCCCGAGCGCAGCAAATTCCCGCTGCAACTGTTCGGTTTCCACTTTAAAGCCCGTACCCACTCCAGCTACGGCAACGTGGATGTGCTCCAGGCCGCCTGTCGCCAGGAGGTGTGGCTTAACCCTGTGGATGCGGAGCAACGTGGAATCAAAAACGGGGATATGGTGCGCGTCTTCAACGACCGCGGCGAAGTGCGTATTGCCGCGAAAGTCACCCCGCGCATCATGCCCGGCGTAAGCGCGATGGGCCAGGGCGCCTGGCATGACGCCAACATGAACGGCGATCGTGTCGATCACGGCTCCTGCATCAATACCCTGACCACACACCGCCCGTCACCGCTGGCGAAAGGCAACCCGCAGCACACGAACCTGGTGCAGATCGAGAAGGCATAAGGATTAACCGATGACTACCCAGTATGGATTTTTTATTGATTCCAGCCGCTGCACCGGGTGTAAAACCTGCGAGCTGGCCTGCAAGGATTACAAAGACCTGACCCCGGACGTCAGCTTCCGGCGTATTTATGAATATGCGGGCGGCGACTGGCAGGAGGACAACGGCGTCTGGCATCAGAATGTCTTCGCCTATTACCTGTCGATTGCCTGCAACCACTGCGAAGATCCGGCCTGCACCAAGGTCTGCCCGAGCGGGGCAATGCACAAGCGCGACGACGGTTTTGTGGTGGTGGACGAGGATGTCTGCATCGGCTGTCGCTACTGCCACATGGCCTGCCCGTACGGCGCGCCGCAGTACAATGCCGCCAAAGGCCACATGACCAAGTGCGACGGCTGCCACAGCCGCGTGGCGGACGGCAAAAAGCCCATCTGCGTCGAATCCTGCCCGCTGCGCGCGCTGGACTTTGGCCCGATTGAAGAGCTGCGCAAAAAACACGGTCAGCTTGCTGCCGTCGCGCCGCTGCCGTCTGCGCACTTCACGAAGCCGAGTATTGTGATTAAACCTAACGCCAACAGCCGTCCGACGGGTGACACCACCGGCTACCTGGCAAACCCGAAGGAGGTGTGAAATGGGAAGTGGATGGCATGAATGGCCGCTGGTGATCTTCACCGTTTTCGGGCAGTGCGTGGCCGGCGCGTTAATCGTGATGGGCTTCGTCTGGCTAAAGGAAAATGATGACAAGGCCAGAATGCGTATCGTGCGCAGCCTGTTTTGTCTCTGGCTGGTAATGGGTATTGGGTTTATGGCCTCGGTACTGCATCTTGGCTCCCCGCTGCGTGCCTTCAACTCGCTTAACCGCGTGGGCGCGTCAGCGCTGAGTAATGAGATCGCGGCGGGTTCGATCTTCTTTGCCGTGGGCGGTTTCTGGTGGCTGGTGTCGGTTATCGGTAAAATGCCCCCAGCTTTGGGCAAAATCTGGCTGGTTGTCAGCCAGATCCTGGGCATCGTTTTTGTCTGGGCGATGACCCGTGTCTATCAGATTGAGACTGTCCCGACCTGGTATACGGGTTACACCACGCTGAGCTTCTTCCTGACGATGGTTCTGGCTGGTCCTCTGCTGGCCGCGCTGCTGCTGCGCGTCGCAAACGTGACATTCAAAGGTACTCTTGCCGCGTCAGTCAGCGTACTGGCACTTATCGTCTGCGTGGCGGTCGTCGTGTTGCAAAGCAATACGCTGGGGACAATACAGAGTTCCATCCAGCAGGCTAACGCCCTGCTTCCGGATTATGGTTCGCTTCAGGTATGGCGCATCGCGTTGCTGGCGGCGGGTTTAGGCTGCTGGATCTGCCCGCTGATTCGTCGTCAGGAGCCAAAAACGCTCGGCCTGTTTGCAGGTGTCGTGCTGGTGGCGCTGGGTGAACTGATTGGCCGTGGGCTATTTTATGGTCTGCATATGACCGCAGGGTTAGCAATTGCAGGTTAACACAGGTGCGCGGGGCTACCCGCGCGCAAGTAAGGAAAGTTGTAATGAATGACGTCTCACACCGCGAATCGTTCGCGTTCAGCGCCCGGGTACTGGGCGCGCTGTTTTATTTCGCTCCAGACAGCGAGCAGACCGCGCCGCTGGTGAGTGCCCTGACCGCAGGTGACTGGGTTCAGGACTGGCCGCTGGCGGAGGAAAATCTGCTGCCTGTCGCCAGTATGTTTAAGACCCCATCGGATGAAGCGTTGAAAGACGCCTGGCTGCGTCTGTTTATTGGCCCGTATGCCCTGCCCGCCCCCCCGTGGGGCTCGGTCTGGCTTGATCGCGAGTCCGTGCTGTTTGGCGATTCGACCCTCGCGTTGCGTCAGTGGATGCGTGAAAACCATATCGCCTTTGAGATGCAGCAGAATGAGCCCGAAGATCATTTCGGAACGTTGCTGATGCTGGCGGCATGGCTTGCCGAGAACGGTCGCGAAACAGAACGCGACCAGCTTCTTGCCTGGCATCTGCTGCCATGGAGCACGCGTTTCCTTAGCGTATTCGTTGAAAACGCGGCCCATCCGTTCTACACCGCGCTGGGTAAACTTGCCCAGCTGACGCTGGCGGAATGGCAGTCCACTTTGCTGATCCCGATTGTCGAAAAAACGCTGTACCGATAACAGCACCTGGCTTGTCCACCCGTTGTGCAGGCCAGGCTCGCACATTTTTGCTGCAACACAACATCACAACTGCTATACTGTATATAATTACAGTATACAGGGGGTGCACAATGGCTGTTGAAACAAAATTTGTTGTCGTAAGAAAAGGTGAAGAAAAAATGACATTTGCCAGTAAGAAAGAGGCTGACGCTCACGACAAACTGCTCGATATGGCAGAAGCGTTTACCGACTGGCTGTTGCAAAGCGGAATGCAGATGGATGAAACGCAGGCGGAGAACCTAGGTCTTTATCTCGCCGAGCAGAAAGAGTCCGTGCAGCATATCCTGCGTACCAGCAAGCTTCCCGAGCTCAATGCTGAAACGGATAAAACAGCATCAGATGCGGACAGCAGCAAAAAAATCCGGGCCGTCAAAGCCGCCTGATTGTCAGAACGTCTCATCACCTGTCCTGCCGCCGCGGGCCGGAAACGGCTCGCGGCTTTATTAAGGTCGGCTGAACCTTATATTCATCAACCGTAACCCTGAAAATACGTCCTGTTATCTTGATTTGTCCGCCTCAACGTTTACGCTGATATTCAGCCTTTTTGCTTCCATGGTTATCATTGTTTTGGCCAGGTCCTTATCGCCCCCGTGCACTTTATCTATCAGATCCTGTACAAGGGTCATCATCAGGAAGTTTTTATTTGCAGGATAGAAATCTGTCGTAGTGATCCCGTTATCCTTCAGGCGCTGATGCGCCACCGTCTCTGTATCTGCGATTATTTCAGCTACTTTCTCATGCATGATCTCGATCTGCTCGAAAACAATGCGGTTAATCATTTCAATCTGTTCTGGCGTAAAATTCGGTTCCATTATTTCCCCTCGTCAGGCGTATGTTAAAAACACGTGAATCACACGTTGAGCGCCTCTGAATGGTGCGGTAAATTATCACACAACTCTTGTAACTCGTTGATCAGCCTGGCCACATGGAAACCGTCACAGACAGAATGATGGACCTGAACGGCGAAAGGCAACAAGACCTTTCCATCCTGGTTATAGTATTTTCCAAACGTGAACATGGGCGCAAAAAAATTCCGCATGTTAGCGACGTTGACGTTAAAACTGCTAAAACTGACCCACGGAATATCCGATATGAAGAATATATTCTCCCGGGACTCTTCTTTAGGCCAGTAAGCAAGGATATTGCCATAGCGGGCAACATCTTCTGCATAAACGCGCTGAAAATGGTGAATATTTCCATCGTAATGACTCCATAACGATGAAAATGTCTCCGTTTCTTTATGGAAAAGGGTATAGCTTGGATGAACCTCATTCCATATTACAAGCTCATCATCCTTCATGGCCATACGGAATTCCGGATGCCGGTTTACGACGTCAGAAATAAGGGAAATAATCGCAGGATAAAATTTCCAGCCAACCTCTTTGATATGCTTTAGCAGCACGGTAATGTCCAGCTGAACCGTTTGGTTAAATGTAGATTGAGCAAAGCCCTGAAATACTTCAAAGTGTTCCTTTCTTGCCCAACGTGATAAGTCAACTACCGTATATTCTGGCATTATTTTTGTCAT
Protein sequences of DBSCAN-SWA_3 >NZ_CP044107|3677351:3684893|3684215_3684893_-|WP_015570622.1|DBSCAN-SWA MTKIMPEYTVVDLSRWARKEHFEVFQGFAQSTFNQTVQLDITVLLKHIKEVGWKFYPAIISLISDVVNRHPEFRMAMKDDELVIWNEVHPSYTLFHKETETFSSLWSHYDGNIHHFQRVYAEDVARYGNILAYWPKEESRENIFFISDIPWVSFSSFNVNVANMRNFFAPMFTFGKYYNQDGKVLLPFAVQVHHSVCDGFHVARLINELQELCDNLPHHSEALNV >NZ_CP044107|3677351:3684893|3678744_3681183_+|WP_150391155.1|DBSCAN-SWA MSEVEHHGGISRRTLVKSTAIGSLALAAGGIALPFGLKSAAAAVQSAIQPAEDKVVWGACSVNCGSRCALRLHVRDDEVYWVETDNTGEDIYGNHQVRACLRGRSIRRRINHPDRLNYPMKRVGKRGEGKFERITWEEALDTIAASLKSVVEKYGNEAVYINYSSGIVGGNITRSSPYASLVARLMNCYGGFLSHYGTYSTAQIACAMPYTYGSNDGNSTSDIENTKLVVMFGNNPAETRMSGGGITYYLEQARERSNARMIVIDPRYTDTAAGREDEWIPIRPGTDAALVAGIAWVLINENLVDQPFLDKYCVGYDEKTLPEGAPANGHYKAYILGQGDDKTAKTPEWASRITGIPADRIIKLAREIGSAKPAYICQGWGPQRQANGEQTSRAIAMLPILTGNVGINGGNSGARESTYTITIERMPLPENPVKTQISCFSWTDAIVRGPEMTALRDGVRGKDKLDVPIKFIWNYAGNTIINQHSDINKTHDILQDENKCETIVVIDNFMTSSAKYADIVLPDLMTVEQEDIIPNDYAGNMGYLIFLQPVTAPKFERKPIYWIMSEVAKRLGPDIHQKFTEGRTQEQWLRYLYAKMVAKDPLLPSYDALKKMGIYKRKDPNGHFVAYKKFRDDPDANPLKTPSGKIEIYSIKLADIAATWELQKDETITPLPVYTSTFEGWDAPERSKFPLQLFGFHFKARTHSSYGNVDVLQAACRQEVWLNPVDAEQRGIKNGDMVRVFNDRGEVRIAAKVTPRIMPGVSAMGQGAWHDANMNGDRVDHGSCINTLTTHRPSPLAKGNPQHTNLVQIEKA >NZ_CP044107|3677351:3684893|3681812_3682667_+|WP_032653420.1|DBSCAN-SWA MGSGWHEWPLVIFTVFGQCVAGALIVMGFVWLKENDDKARMRIVRSLFCLWLVMGIGFMASVLHLGSPLRAFNSLNRVGASALSNEIAAGSIFFAVGGFWWLVSVIGKMPPALGKIWLVVSQILGIVFVWAMTRVYQIETVPTWYTGYTTLSFFLTMVLAGPLLAALLLRVANVTFKGTLAASVSVLALIVCVAVVVLQSNTLGTIQSSIQQANALLPDYGSLQVWRIALLAAGLGCWICPLIRRQEPKTLGLFAGVVLVALGELIGRGLFYGLHMTAGLAIAG >NZ_CP044107|3677351:3684893|3677351_3678206_+|WP_003857421.1|DBSCAN-SWA MIYDCFLYYDEDMLLDIRLHTLADVVDRFVIVEATHSFTGIPRELHFDITKFAKFKDKIIYVPFDAQPILNRADNNQVDAWANEAALRNSIMNGLKDAADDDLILVSDVDEIFSPDTVRAINPRALCTTIHQNVFNYQFNLQVHNTDGTPRKCTLPRATSYYNLKHFFHGEPESFRNWKRARKDKNWSWFKWNWLKINNKIVKDGGWHFSWVMTPERISEKMSTISHTEYDLPEFNNPEHIMKVITNAEDIWGRDRKLVRQEVSKRTLPSYLVDNQHHYSQFIL >NZ_CP044107|3677351:3684893|3683436_3683748_+|WP_017384548.1|DBSCAN-SWA MAVETKFVVVRKGEEKMTFASKKEADAHDKLLDMAEAFTDWLLQSGMQMDETQAENLGLYLAEQKESVQHILRTSKLPELNAETDKTASDADSSKKIRAVKAA >NZ_CP044107|3677351:3684893|3682711_3683326_+|WP_150391156.1|DBSCAN-SWA MNDVSHRESFAFSARVLGALFYFAPDSEQTAPLVSALTAGDWVQDWPLAEENLLPVASMFKTPSDEALKDAWLRLFIGPYALPAPPWGSVWLDRESVLFGDSTLALRQWMRENHIAFEMQQNEPEDHFGTLLMLAAWLAENGRETERDQLLAWHLLPWSTRFLSVFVENAAHPFYTALGKLAQLTLAEWQSTLLIPIVEKTLYR >NZ_CP044107|3677351:3684893|3683862_3684177_-|WP_017694081.1|DBSCAN-SWA MEPNFTPEQIEMINRIVFEQIEIMHEKVAEIIADTETVAHQRLKDNGITTTDFYPANKNFLMMTLVQDLIDKVHGGDKDLAKTMITMEAKRLNISVNVEADKSR >NZ_CP044107|3677351:3684893|3681193_3681811_+|WP_017384545.1|DBSCAN-SWA MTTQYGFFIDSSRCTGCKTCELACKDYKDLTPDVSFRRIYEYAGGDWQEDNGVWHQNVFAYYLSIACNHCEDPACTKVCPSGAMHKRDDGFVVVDEDVCIGCRYCHMACPYGAPQYNAAKGHMTKCDGCHSRVADGKKPICVESCPLRALDFGPIEELRKKHGQLAAVAPLPSAHFTKPSIVIKPNANSRPTGDTTGYLANPKEV >NZ_CP044107|3677351:3684893|3678308_3678614_+|WP_003857424.1|DBSCAN-SWA MKRLPWITALLLMSASPALLAAPDSCERVKSDIQQKIINNGVPESGFTLNIVPNDQADQPDAQVVGHCANDTFKILYTRTSSGNYPVSGAGTQENAPAEPQ |
9 | Escherichia_phage(83.33%) | NA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|