Contig_ID | Contig_def | CRISPR array number | Contig Signature genes | Self targeting spacer number | Target MGE spacer number | Prophage number | Anti-CRISPR protein number |
---|---|---|---|---|---|---|---|
NZ_CP044399 | Moritella marina ATCC 15381 strain MP-1 chromosome, complete genome | 3 crisprs | cas3,TnsE_C,DEDDh,DinG,csa3,RT | 0 | 2 | 2 | 0 |
NZ_CP044398 | Moritella marina ATCC 15381 strain MP-1 plasmid unnamed1, complete sequence | 1 crisprs | NA | 0 | 1 | 0 | 0 |
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP044399_1 | 74968-75080 | Orphan |
NA
Consensus repeat of NZ_CP044399_1
|
1 spacers
spacers of NZ_CP044399_1
>1.1|74995|59|NZ_CP044399|CRISPRCasFinder GCAGTTAAAGGGGAGGAGCAGACCCCCTCCCAACCTCCCCCTTGAAGTTAAAGGGGAGG |
CRISPR arrays and Neighbor proteins around NZ_CP044399_1
The CRISPR arrays of NZ_CP044399_1 >merge|NZ_CP044399|1|74968-75080|CRISPRCasFinder AGCAGACCCCCTCCCAGCCTCCCCCTTGCAGTTAAAGGGGAGGAGCAGACCCCCTCCCAACCTCCCCCTTGAAGTTAAAGGGGAGGAGCAGACCCCCTCCCAACCTCCCCCTT >NZ_CP044399|1|1|74968-75080|CRISPRCasFinder AGCAGACCCCCTCCCAGCCTCCCCCTT GCAGTTAAAGGGGAGGAGCAGACCCCCTCCCAACCTCCCCCTTGAAGTTAAAGGGGAGG AGCAGACCCCCTCCCAACCTCCCCCTT
>NZ_CP044399.1|WP_019440394.1|74075_74963_+|lipoprotein-NlpI MKRCLSLLIVTALLSGCSSLSGSFSTDDRTPSELILAEPLQVNYQTEIMLMRYSQLILDAKDDRSRQARYFYERGLLADSMGLRSLAHADFQRSLTLQPDFVPAYNFIGLYMTQTEQFDEAYDAYDSIAQLDPENNYVLLNRGIALYYGERYRLATDDLISAYNESPNDPFRTLWLYYPEYEVSPQDALAAVKTRYSQHIDNNWSWNIVALYTQELSETQLLAKLLDGLDKTDPAYNKILAHRLTETYFYLGKYKLLSNDNRAAESYFKLALSNNVYEFIEHGYARLELSRLAAK >NZ_CP044399.1|WP_019440395.1|71877_73995_+|polyribonucleotide-nucleotidyltransferase MNPIVKSFKYGQHTVTLETGVIARQATAAVMASIGDTSVLVSVVGKKQAEAGRDFFPLTVNYQERTYAAGKIPGGFFKREGRPSEGETLTCRLIDRPIRPLFPAGFKNEVQVVATVVSVNPEIQPDLVALIGVSAALSISGMPFNGPIGAARIGFQNDEYILNPSTSELAESKLDLVVAGTENAVLMVESEAEILSEEQMLGAVVYGHEQMQVVIEAVKEFAAEVNTPKWDWSAPVVNTELKAKIAELASGELAEAYQIQEKTERYAKVGGIKSAAIAKLQEENEELNTREAGELLGSLEKNIVRNRILDGEPRIDGRDPEMIRALSVMTGVLPRTHGSSLFTRGETQALVTATLGTERDAQRIDSLTGETVDRFLMHYNFPPYCVGETGMVGSPKRREIGHGRLAKRGVLAVMPNADEFPYTVRVVSEITESNGSSSMASVCGTSLALMDAGVPIKASVAGIAMGLVLDGDRSVVLSDILGDEDHLGDMDFKVAGSTGGITALQMDIKIEGITKEIMQKALVQAKAARLHILSVMDQAIATNRDDVSEFAPRIHTIKINTDKIKDIIGKGGATIRALTEETGTTIEIEDDGTVKIAATSGEQAQAAIERIHQLTAEVEVGQIYEGKVVRLADFGAFVNILPGKDGLVHISQITQERVNKVADHLSVEQVVKVKVLEVDRQGRIRLSIKEAMDAPAAPAEQPVSE >NZ_CP044399.1|WP_019440396.1|71396_71666_+|30S-ribosomal-protein-S15 MSLSAEAKAVIVADFARQEGDTGSTEVQVALLTAQINHLQGHFKKHIHDHHSRRGLLRMVASRRKLLDYLKRTENVRYADLIARLGLRR >NZ_CP044399.1|WP_019440397.1|70224_71190_+|tRNA-pseudouridine(55)-synthase-TruB MSGRRRRWNGRDVHGVFLLDKPTGISSNDALQRVKKIFFAAKAGHTGALDPLATGMLPLCFGEATKFSQFLLDSDKRYIVTAKLGERTDTSDSHGEIVQTRTVNVSDAELLIALDTFRGDTKQVPSMFSALKHEGKPLYWYARQGIFIDRPARPISVFELKLLSFENDEVNLEIHVSKGTYIRTIVDDLGELLGCGAHVSMLRRIGVSAYPAERMMTFEQLEEMVEQAKAAGVEPKDVLDPLLMPLDSAVSHLPEANMSEETGGFVLHGQPVVVPNTPESGLVRMTVGDERAFIGVGAIDDQGRVAPKRIVNYETQAREAK >NZ_CP044399.1|WP_019440398.1|69796_70222_+|30S-ribosome-binding-factor-RbfA MAKEFSRSRRVAQQLQQEIARILQREVKDPRVGMVTVSSIDLSRDLSYAKVYVTFFNIDNDEERIKDGIAALDTASGYIRSLVGSSMKLRIVPELRFIYDNTLVEGMRLSSLVTEVRAKDKKLQDDYGTTADEKDASEGES >NZ_CP044399.1|WP_019440399.1|67022_69719_+|translation-initiation-factor-IF-2 MADVSITKLAEDIGTTVDRLVQQFSDAGIAKANDSTVNEGEKQTLLVHLSEQHGSDTAEPSRLTLQRKTKSTLSVASGGGKQKSVAVEVRKKRTYVKRTAAEDEAQLAEEKAAAEAAELKAQAEAKAQAKAQADAQAKEKADAEAKAKRDAADKAKRDTKQKTTKSKEADDMAKREAEALKQKQEQEATRKAELEAQQKAEEARKLAEENSGRWAAEEAERAKTEKSADYHVTTSTHAQAAEDDADAQAQKGERKKKPVAPVTEAAKPAPKGKGKARKAKGRKPDNRYNRHQGKSVNAPEGMQQGFNKPVAKVERDVRIGETISVSELAQKMAIKATEIIKYMMKQGSMVTINQVLDQETAQLVAEDMGHKVILVKENALEEAVLADAQDVTQGVKVTRAPVVTIMGHVDHGKTSLLDYIRRAKVADGEAGGITQHIGAYHVETENGMVSFLDTPGHAAFTSMRARGAQATDIVVLVVAADDGVMPQTIEAIHHAKAAGVPLIIAVNKMDKEGADTDRVKSELAQHNVMPEDWGGENMFVYVSAKAGTGVDELLEAILLQADVLELEAVATGPAAGVVIESRLDKGRGPVATVLVQQGELKQGDIVLCGLEYGRVRAMRDENGKAIESAGPSIPVEILGLSGVPQSGDEATVVRDEKKAREVALYRQGKFRDVKLARQQKSKLDNMFANMEAGEVSELNIVLKADVQGSLEALCDSLVKLSTDEVKVAIITRGVGGITETDVTLAAASNAIVLGFNVRADAKAREVVSNESVDLRYYSVIYDVIDEVRQAMSGLLAPEFRQEIIGLAEVRDVFKSPKIGSIAGCMVTEGIIKRSAPIRVLRDNIVIYEGELESLRRFKDDVQEVRMGIECGIGVKNYNDVRVGDQIEVFETVEIKRTL >NZ_CP044399.1|WP_019440400.1|65496_66996_+|transcription-termination/antitermination-protein-NusA MNKEILLVVDAVSNEKALPREKIFEAMEIALATATKKRYEGEIEVRVEIDRKTGNFETFRRWLVIDDKGEALENPFSEITLDAAKFDDETIEVGGYIEDTIESVVFDRVTTQTAKQVIIQKVREAERDLIVQQYAKHEGELITGLVKRANRETVVLDLGNNAEAVMFKDEMLPRESFRTGDRIKGLLKEVKPEARGTQLFISRACNEMLIELFRVEVPEFNEEMLELKAAARDPGSRAKIAVKSNDKRIDPVGACVGMRGARVQAVSSELNGERVDIILWDDNPAQFVINAMAPAEVASIIVDEDQHSMDIAVEQDNLAQAIGRNGQNVRLASQLTGWELNVMTVAEANEKHQKENDRLMNIFTDKLDIDEDMAELLIGEGFSSLEEIAYVPVNEFLQIDGFDEDLVDELRSRAKNALTTSALAAEESLEGAEPSADLLALEGLEKHLAYVLASIGVTTLEELAEQGIDDLSEIEELTDERAGELIMAARNICWFSDSE >NZ_CP044399.1|WP_019440401.1|65023_65482_+|ribosome-maturation-factor-RimP MASLEQTLTELLEPTVEMLGFDLIGIEFTRAGKHSTLLVYIDHENGIFVDDCSKVSHQISAIMDVEDPITTEYFLEVSSPGMERPLFKVAHYAEYCGSEIKALLRMAVNGRRKFKGVIKSVDGEMITVTIDGKDEVLAHANIQKANIVPKFD >NZ_CP044399.1|WP_019440402.1|64004_64325_+|preprotein-translocase-subunit-SecG MYEVIIVIYLIVALAIIGFVLMQQGKGADMGASFGSGGSNTVFGSGGSGNFLTRVTAILAVVFFALSLVLGNLSTQSETDVILDAEKPVITSDVPVSPVDNSDVPQ >NZ_CP044399.1|WP_019440403.1|62409_63744_+|phosphoglucosamine-mutase MSRKYFGTDGIRGLVGKAPITPEFVLKLGWAAGKVLAQQGTKKVLIGKDTRISGYMLESALEAGLSAAGLDAAFMGPMPTPAVAYLTRTFRAEAGIVISASHNPYHDNGIKFFSANGTKLPDEVELAIEAQLEKELTCVESALLGKAVRIDDAAGRYIEYCKSTFPSRASLKGLKIVLDCAHGATYHIAPSVFKELGAEIIPIGVSPNGLNINDGCGATEPAALAARVLAEKADLGVAYDGDGDRLMMVDHTGYVIDGDEILYIMAREALRNGELKGGVVGTLMANMGLEVALKSLGIPFARSAVGDRYVVEMLLEKGWRIGGENSGHIISLDHTTTGDGIVSSLLVLAAMINSGLTLQELRSGMSKFPQVLVNVRFSGDSDPLLAESVLAAVKDVEQELADRGRVLLRKSGTEPLIRVMVEGEDETHVLALANKIADAVKATF >NZ_CP044399.1|WP_019440393.1|75166_76048_-|U32-family-peptidase MKYALGPILYYWPKQQVEDFYTAAVNSDADIIYLGETVCSKRRELKPKDWLGLAKEIANSGKQVVISTMALLEAPSEVNILRKYCENGDFIVEANDFGAINLLAEAKTPFVCGHALNVYNAQVLQLLVNKGMQRWVMPVELSRDWLVQLQEDSRLLNIRDQFEIEVFAHGHLPLAYSARCFTARSENRAKDDCELCCINHANGKPVYSQDDKELFTINGIQTMSGYKYNLLNDVASMQDLVDVVRVSPLGDSAFETLGQFKQAAEDNIKFDLKLDRECNGYWHQIAGFDTVTT >NZ_CP044399.1|WP_019440392.1|76149_77145_-|U32-family-peptidase MELLCPAGNLPALKTAVDNGADAVYIGFKDDTNARHFAGLNFTDKKLDKAVDYIRSNNRHLHVAINTFAHPGKLERWERAVDRCADMGVDAAIISDVAVLDYATKKYPDLELHLSVQASATNVEAINFYTNNFNVSRVVLPRVLSIHQVKQLARNTDVELEVFAFGSLCIMAEGRCYLSSYLTGESPNTVGACSPAKFVRWEETEQGLESRLNDVLIDRYQPEEKTGYPTLCKGRFNVDGKVFHALEEPTSLNTLALIPELAQANIAAVKIEGRQRSPAYTEQVTKVWRAALDRYRQDPAQYQVETAWNKQLDQLSEGTSTTLGAYHRDWQ >NZ_CP044399.1|WP_019440391.1|77366_77909_+|SCP2-domain-containing-protein MLHSLHRKLVHTVPTLLAIPAKVLPFSLQEKVLSQVFNKVFAEALADDEFEFLEQKWLQVEITDLGINWFISCVDNKLVIAPCAATVDVSFKGNLNELVLITARKEDPDTLFFQRRLKIEGDTELGLEVKNMLDSFDLDELPTAVTTLLAYVAEFIQQGLADPVLSNELSSSTVKNKTMA >NZ_CP044399.1|WP_019440390.1|78127_78706_+|hypothetical-protein MRLVVISGSTRNRSTTIKVAQSVLQLAEQSQLFSKINLLDFVKVSLPIWDKAIQNEFDDWQDEWQVTAQLIRSADAIIIVSPEWEEESLDNFYAFCQHDDFPLLPCAVLRVGSNCRGAYSSTELAMANFCKNHTCLILEHFIVATIESVNNCKETYYPFETPLVERILANLGLMKQLVDNGGLLSPTRLHLA >NZ_CP044399.1|WP_019440389.1|78748_79333_+|ribosomal-protein-S5-alanine-N-acetyltransferase MSSKFPQFATERLIIRVAVASDAEKLCQYYIRNQVHLAPWEPIRSEVYYTLRWWQLRVEQIHIEFNAASAINFIAIDRDTSEIVAVANFSNIIQGVFKSCYLGYSISKAYEGRGLMVEFLQSCLAFMFENVGLNRVMANYIPVNERSGALLQRLGFEREGYARQYLKIAGVWQDHVLTALLHADWSARNRNSDN >NZ_CP044399.1|WP_019440388.1|79467_79731_-|YfhL-family-4Fe-4S-dicluster-ferredoxin MALLINDKCINCDMCDPECPNGAITFGAKIYEIDPLLCTECVGHYDKPTCKTVCPINCIITDPDNVEKEETLWEKFVMIQEATKATR >NZ_CP044399.1|WP_019440387.1|79757_81128_-|tRNA-5-hydroxyuridine-modification-protein-YegQ MFTPELLSPAGSLKNMRYAFAYGADAVYAGQPRYSLRVRNNEFSLENLAVGINEAHALGKQFYVVCNIQPHNSKLKTFIRDLTPIIAMKPDAIIMSDPGLIMMVREAFPDMVIHLSVQANAVNWATVKFWYTQGIKRVILSRELSLDEIEDIRFHCPDMEIEVFVHGALCMAYSGRCLLSGYINKRDPNQGSCTNSCRWKYDAHDATENETGDIVATKPEIYMPETDSPEPTLGEGKPTDQIFLLQEQGRPNEYMPAFEDEHGTYIMNSKDLRAIQHVERLTKMGVHSLKIEGRTKSFYYCARTAQVYKQAINDAVAGRDFDPSLLGTLEHLAHRGYTEGFLSRHTHDAYQNYDYGYSISETQQFVGELNGRNDKGFAEVIVKNKFLVGDSLELMTPQGNMTFKLEELENRKGESMEYAPGSGHIVYLPVPEEVELDHALLMRNFANSEDTRNPHK >NZ_CP044399.1|WP_019440386.1|81338_82493_-|Na/Pi-symporter MKNNNEAIELNPSTMQKVFSWVSVAALVYFVLVAVSTVSGGFKMFSGGSAGAEQIFAFATNPFVALLLGILVTALVQSSSTVTSVIVGLVAGGLPLSIAIPMVMGANMGTTITNTFVSMGHIRDKKEFERAFSAATVHDFFNLLAVAIFLPLEIAFGILEKMATFLADFFVSDSSLSIKEFNFIKPLTKPAVNQIKELAGSLPVESNTVGLVMVFIGIFMIGFSVTFLGKVLKSVMVGRAKAVLHGAIGRGPVSGILSGTAVTVMVQSSSTTTSLMIPLAGSGVFTTRQIFPFTLGANIGTTITALLAATSISGEFAQVAMTIALVHVMFNVFAVALIYGIPFLREIPIKCSEALARQGTENKFIAFGYVVGAFFALPGLMIIF >NZ_CP044399.1|WP_019440385.1|82995_83910_-|recombination-associated-protein-RdgC MWFKNLLIYRFTRPFELDIEQLETKLADFPFTPCGSQDLSKFGWIKPLGKSGQALTHGISDNILICAKKEDRVLPASVVKDMLQEKVDSIEAEQGRGLKKKEKDALKEDIVHQLLPRAFPRSSQTFAWICPSQDLLVVDASSAKKAEDLIALLRKCVGSLPVVPVALTTPADITMTEWLNKGNAAPGFELGDEAELRSALEHGGIIRCKEQDLTSEEIQHHLNADKLVTKLALDWSESLSFLLGDDMSVKRLKFSDLIKEQNDDVATDDYAAKFDADFALMTGELMRFIPELITALGGEESTAK >NZ_CP044399.1|WP_019440384.1|84307_84994_+|phosphate-regulon-transcriptional-regulator-PhoB MSKRILVVEDELAIREMLCFALEQKGFDVVEAGDYPEAVERLVEPYPDLILLDWMLPGGSGIKYIKHLKSQPHSSAIPVVMLTARGEEEDKVKGLEVGADDYITKPFSPKELIARLNAVMRRVAPMTQDSVIDISGLQLDPVAHRVSAGDEVIDIGPTEFKLLHFFMTHTERVYSREQLLDNVWGMNVYVEDRTVDVHIRRLRKALTPSEHDKYVQTVRGAGYRFSVR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP044399_2 | 1013664-1013774 | Orphan |
NA
Consensus repeat of NZ_CP044399_2
|
1 spacers
spacers of NZ_CP044399_2
>2.1|1013691|57|NZ_CP044399|CRISPRCasFinder CTCATCTTTAGGGGCATCTTCAAGTGTTGTACTGGTGACATTCTCATCTTTAGGGAC |
CRISPR arrays and Neighbor proteins around NZ_CP044399_2
The CRISPR arrays of NZ_CP044399_2 >merge|NZ_CP044399|2|1013664-1013774|CRISPRCasFinder ATCTTCAAGTGTCGTACTGGTGACATTCTCATCTTTAGGGGCATCTTCAAGTGTTGTACTGGTGACATTCTCATCTTTAGGGACATCTTCAAGTGTCGTATTGGTGACATT >NZ_CP044399|2|2|1013664-1013774|CRISPRCasFinder ATCTTCAAGTGTCGTACTGGTGACATT CTCATCTTTAGGGGCATCTTCAAGTGTTGTACTGGTGACATTCTCATCTTTAGGGAC ATCTTCAAGTGTCGTATTGGTGACATT
>NZ_CP044399.1|WP_019442871.1|1012433_1013231_-|response-regulator MSLEVRDLSILLIEPSTTQNKFIKTQLQDAGVDNIECVVSIAQAKQSLTGFIPDLIISAMYFEDGSGAELAEFVKSNRLTENIAFMLISSEQRFSVLDKVKQAGAVAILPKPFKFVDLQRALNATLSYIEPEEMELDLYDVTALSILVVDDSLTARKHICRVLNSMGIVGVTSAENGVEALEYLAQNTFDLIVTDYNMPEMDGKELVEKIRMNPELSYLPIMMVTSEEGNAQLSAVKQAGVSALCDKPFDIDTVRMLIKQLLDEK >NZ_CP044399.1|WP_019442872.1|1011381_1012299_+|LysR-family-transcriptional-regulator MLEVKHLKTIIALEKTGSLVEASESLYMTQSALSHQIKDLEERLNTPLFIRKTRPLRFTVAGERVLKLAKSVMPMFTNTERDISRLLSGNAGRLHMAIECHSCFQWLMPAIDVFRDQWPEVELDLASGFSFAPLPALKRGDVDLVVTSDPQVLSGIHYEPLFSYQPMLAVSRHHMLASKSYIDPEDLATETLITYPVEQERLDIFNLFLDPAGVSPHAIRHAELTIMMLQLVASGRGVAALPNWALTEYLEKDYILAKPLGEESCWTTLYAAVRVEQLEMPYMSEFLKDAKASSFQQLKGIKIAN >NZ_CP044399.1|WP_019442873.1|1008910_1011199_-|5-methyltetrahydropteroyltriglutamate---homocysteine-S-methyltransferase MAKSHILGFPRIGADRELKKAIESYWKGDITKAELETVGKTLRAKHWQQQIDAGLDFITVGDFAWYDQVLALSATLGVIPARHQEETITLDTLFNMARGSSPCCGQQAAACEMTKWFDTNYHYLVPELNEDQQFTLSYNQLIEEIQEAKTLGFPIKATLLGPVSYLSLSKTNSDFDKLALLPQLVTTYKQLLANIAAEGIEWLQIEEPILVQDLTNDWQQAFTTSYAALAQGTNKLLLTSYFGALGDNAELAFSLPVDGFHIDLSRAPKQLETALTLLPANAILSAGIVNGRNIWRNDLATSVSSLQQAKAQLGDRLWVASSCSLQHSPVDLDNETKLDSELKSWLAYATQKLTEISSINAVLNGVNNETLTQQLRESTAVVSSRATSTRIHNTAVKARVAAITDQDAQRHSEFTQRIASQQQELNLPLFPTTTIGSFPQTSDIRQTRNQFKQNVISQDQYITKMQAEIKDVVTRQEALGLDVLVHGEPERNDMVEYFGELLDGFAFSKNGWVQSYGTRCVKPPIIFGDISRPAPMTVAWSQYAQAQTNKLMKGMLTGPVTILCWSFTRDDISREEQTNQIALAIRDEVVDLEQAGIKVIQIDEPALREGLPLRSCEQQAYLDWSTKAFRISASGVRDNTQIHTHMCYCEFNEIMPSIAALDADVITIETSRSNMELLSAFTDFSYPNDIGPGVYDIHSPNVPSVEWMTQLITNASEYIDVARLWVNPDCGLKTRGWPETEAALKNMVTAAHNLRVTFSHKA >NZ_CP044399.1|WP_019442874.1|1008527_1008827_+|Dabb-family-protein MAFKHIVMWTLLDTANGNDKSTNAKLAKEALEALNGQIPGLQHLEVGIDSLQGAGSYDLVLIADLDSRATLDVYQDHPAHQAVLPMMKSITSQRAAVDY >NZ_CP044399.1|WP_019442876.1|1007616_1008030_-|hypothetical-protein MANLLNQVETIYNVAKETAKTSVTAGFGVYGTIVDEASKSSDKATQLFESLVERGTQVEPQVKEQVSALLGKKISLETIETKAQSITSRFTGVQGQKLNEVESKIDLLAQMISELKTEPAKVQKVVKAATAKVTAEA >NZ_CP044399.1|WP_019442877.1|1006869_1007364_-|hypothetical-protein MSQVSLTYNPVEDRMLLIVSNNINHPQWWLTRHMCKKLLEMLNAELTLQYELDKIQSCYKENKANQEASFADKHQQALHDAAGRTEIQKKSTPAQPDALLTTRISLDKKPDNLVALYIYSRENHGICLDLDNNGLHIFLDMMLKVAIKGEWGLKQVKSIENKLI >NZ_CP044399.1|WP_019442878.1|1005724_1006759_+|nucleoid-associated-protein-YejK MQLKLNNIILHSLAFNTEGELKCYPRNEELANSQPVEELASELHRIYNAKPAKGFGYFKCAEEDNSRLPFEIELRKFIDEESNFVDFSSAASSLLVGELLKYDFVTQGILAFVHYNWMASDYLIVALLENKDSVMVTEQLDLNSSHYLELSKVQLAAKIDLTEWRQNSDSKRYLSFIKGRAGRKVSDFFLDFLGCTEGMDAKIQNAGLMRAVDEFCHVAELDADEAIQAREQVAQYCNEQIKEGSEIEVKDLSDHLADVSSRDFYQYASEAYELEDSFPADRGAVRKLTKYVGQGGGLSVSFDQKLMGERISYNAQTDTLTIVGIPPNLREQLTRRSNSEDDSE >NZ_CP044399.1|WP_019442879.1|1005413_1005644_-|DUF1414-domain-containing-protein MPIVSKYKSDKVEKVIDEVIDVLEKHDAPLDLGLMVLGNAAANIINASLSPKQRQAVAEKFAKALVASVKSKDTSH >NZ_CP044399.1|WP_019442880.1|1003442_1005395_-|DUF3413-domain-containing-protein MPCHLFLPITLLKAKLETVVQLMLETGHHYRDQVSKIISWGHWFSLANILLAILLASRYLFIAEWPETMLGQAYSLISLLGHFSFIIFIMYLVVIFPISFVIPFPRALRFLTVIFATVGLSLLIIDTEIFKLYNLHINPIIFEILLGESEQTLNSDWQTLFAFVPFLFLLELLISSLLWHRLRPLSRFKLGPIIAIFFFCCFLTGHLLHMWADAAVYRPITAQKANFPLAYPMTARTFLAKYGWLDKDAFNKRVSDTKKQSDSRLDYPKNPLDVNDEKQDFNVLLINISALRADMLNDSVMPEMTKLALEGQRFNNHFSISNSDLLGNFGIMYGLAPQYWDDIEISAKSPFMLDYFAQADYNLGIFNTEALSRHKQKQTTFINLDSPQTTIVEDTENDKETVTKTREWIRDQDATTPWFAYVSLESVQNMDTPAGFPALFYPNIQDLNSQANNRQIALFNSYRNSVSYVDKAIAKIVYQLKQSQQYANTVIIFTANHGNEFNESEDHSWGYGSNYSIYQTQVPLFIVWPGKKPSVITQDTNSTDLVPTILTNLNAVNNPISDYSSGIDLFAGEFKSWQLLGDKNNFVILQQDTITQFSYQGLFTNQGNHNVRNRDNYKPMPRGAMLDTQFNQILAELNYFYKATPAQEQK >NZ_CP044399.1|WP_019442881.1|1002729_1003407_+|protein-phosphatase MLIHKDMPMNHKGQDFFVGDIHGEYDLLLTTLTQCQFNFECDRLFSVGDLVDRGSNSIACLALLHEPWFFAVRGNHEEMLLADEDSELARIHRSAGGEWFFQCSLLEQHRLRMLVEEYCPFAFTIESKFGSIGVCHANAPHHWSALQNATVDDIALLQDCAWSTKQYQQVKQGKLFNISGVKFVVHGHVNCARVTTNLNQLWIDTLMRTRRLTVLSAQQAYMVTA >NZ_CP044399.1|WP_019442869.1|1015013_1016423_-|HAMP-domain-containing-histidine-kinase MATGLFVIYCAITDTNSARVEQTLHKNLAQQIIHYSDDLQQGDISRSALKPAFHSLMLLGPRYEIYITDNRGQLLVYAAEPSKIKRNNINTAPLERFIKGADYPIYADNPRSPDQQKLFSSAPIFKNSQQIGYVFVILGGDKYDSIVKNLAFDSDMYKILAALIIFFALAFALLVFIFARLVRPISQLDKDMANFVNSDFSTVSNSIPDQYAANEIINLHNNFGSLESKINKQLTQIKSTEQLRREMLSHISHDLKTPLASLKGYLETWLLQYPDAAGTDFIQVAQKNANQLQRLVEQIIELAQLDSNTVSLYQEPVAVAELAQDVLSKFQLQAQQKNITLSVEPKDPSLQAIADIAKLERVLTNLVDNALRHCQSGDSIKIQLKPKDNQLIISIADSGVGIPKEDVDHIFDAHFRAKNTVNGQQGNSGLGLAIVAKLLSLHHAHISVSSVLSQGTTFSFSLPTTSVNI >NZ_CP044399.1|WP_019442868.1|1016472_1017177_-|response-regulator-transcription-factor MDTHVLVVEDQQDIANLIRINLEMIGNKVICCHNAKDAFQQLSAHTFQLILLDLNLPDMDGLDICKKIRSTDAIVPIMMLTARTEELDRVQGLEAGADDYLAKPFSVLELQARVKALLRRSNVQAVKNEEPEKIKIADLIIDQATHSVRRNDTLITLTSTEFSLLLFLAKSPGRVYSREQLLAEVWDYHNDCYEHTVNSHMNRLRNKIEPNPAQPTYIKTVWGVGYKLEVNDVT >NZ_CP044399.1|WP_019442867.1|1017399_1017855_+|hypothetical-protein MLGQITESDKLILLYQFETQGELAPESIIDISDEEARFIRTSGEYILWESSKRDFDYPEVANSHWLETTYLGQAAKLDCLQSRDAILCPLFMSNQFRGEWHIHNGFLRMNIESSHHQMELFSVANDDCNIHSLLLFKDKQLKGAANITLMV >NZ_CP044399.1|WP_019442866.1|1018497_1020036_+|Re/Si-specific-NAD(P)(+)-transhydrogenase-subunit-alpha MQIGIPRESLKGETRAAATPATVEQLQKLGFTVLVESNAGQLASFSDATFEAAGATISTDTKQVWASDIVLKVNAPANDKEIKLLQKGTSLISFIWPAQNEELLEKLAKREINVLAMDSVPRISRSQSLDALSSMANIAGYRAVIESANEFGRFFTGQITAAGKVPPAKVLIIGAGVAGLAAVGAAGSLGAIVRAYDTRPEVKEQITSMGAEFLEVDFEESAGSGDGYAKVMSDDYKVHEQKMLADQVADADIVITTALIPGRPAPRLISQEMVDAMKAGSVIVDLAAVNGGNVEPSVVDKVITTDGGVKIIGYNEMARRLPAQASQLYGTNLVNLLKLLTPEKDGEMSINFDDVVQRGVTVIKDGEITWPAPPIQVSAAPAAKKEEVTAAPAKPEKKKTGIYKALLAGGGIWAYSALASYVPAEFLNHLMVFALACVIGYYLIWDVASSLHTPLMSVTNAISGIVILGAFFQMGAESGLVTFLAFLGTFIATINIAGGFAVTERMLKMFRK >NZ_CP044399.1|WP_019442865.1|1020047_1021583_+|Re/Si-specific-NAD(P)(+)-transhydrogenase-subunit-beta MSQESIDAAQTAINAAQAAVDAATQAAQVAQTAVAEQAPVVIEAVQEVAVATSGKGILEAAYIAAAVLFVLALAALSKQETAQKGIFVGILGMVVAVVATLFSSDVTNIGYIIAAMLAGGAFGVRWANKVAMTEMPEMVAILNSFGGLAAVFIGYNSYIEHSITEPVMLSIHLTLIFLGVFIGIVTFVGSLVAWGKLNGRVKSSALMLPHRHKMNLAALLVIVFLMFSFVGAGLEGDTAALVIMSLIAIVFGAHLVLSIGGADMPVVVSMLNSYSGWAGAATGLILGNDLLIVTGALVGSSGAILSYVMCKAMNRSFISVIAGGFGNDVAAPTGDEEQGVHVETSAAEVAEMLMGSKRVIITPGYGMAVAQAQYPVFAITKKLRDAGVDVRFGIHPVAGRLPGHMNVLLAEAKVPYDIVLEMDEINEDFNTTDTVLVIGANDTVNPAAKEENSPISGMPVLEVWNATNVVVFKRSMATGYAGVQNPLFFKENTTMLFGDAKESCEQIISAM >NZ_CP044399.1|WP_019442864.1|1021809_1022220_+|hypothetical-protein MADVKWEAAPSGLTSGDLNGWKRRMALACPSPKLGQMVAVETDDSDGKVIHIGYKCIRADGENSSWRRMKGTEIVITPETIIIKGAVTESGQQATLSDVFLGNASTSSGIPVTMSDDGPGVYLGDGVYVSAEDCWF >NZ_CP044399.1|WP_019442863.1|1022306_1023533_-|hypothetical-protein MGSYAEIKINGNGLIDWKNTYDEWYFTKADRVRYIANKEDEYDPENIIGYRTNVATLRRRLQLAGNDLKSVECDFNDTRSIWVQNMKDMLLLYQEDKESKYDQFNSNMVDRITSQLEIVQNTSFNDWKKAVPIALEMSDNYTEQAIMNRDVYIPDEPLLSLMLSPLAGVYDHSLGFMGSTFPCTYVESYAIILFDMCNDDDICELNISDLVYGGWVDDFEDIAQIQAGRTVFHEHFKQSLDELSTLNNSSENKILQRMIFATAISTVEAYLSDTMKKQVLNRHAIKRRFVKHFKSFNKNVKESGVFEFLDTLDERLNEEIDKISFHNLDTVTGLYKNVLLCEFPKDKISKLDAAIDIRHDIVHRNGKKTDGSLVMVSQQDVVNLIDLVQHIIKEIDYQIIDGLLDNVE >NZ_CP044399.1|WP_019442862.1|1023878_1025024_+|hypothetical-protein MLRILVLFLVSFNLLASTIEQEYPQEIKQLKIEINETNSTLKILNNRITSSLDSNKEVVNQSRDLIKAVNESLRVKTVQNNQILDAFEKAKDMTAQEVDLSLSLEPDEYGIWMNFASALLIAIGSIAVTFLILKRTLAKETDVQLKGFELNLQEETKLSNSQISTQLAVATEQNKANHLLKMAEFRQAWINTFRDYISVYIKTVITLIDFHTVESSLFSSWDNLKRAERARDRFLLEKSVEAKQIREEEYKAEPVTNRQKLTNSVRDKLLIQASSEYQDFVENTANAKNGFERYKTELRDFKDLQSSITQQKTRIILMYGPDRTKIEDLIIERLNNIEEYLMFNSDRTFLPQGNVNKVHEYINELQHLVQIMLKIEWTKIK >NZ_CP044399.1|WP_019442861.1|1025072_1026533_-|hypothetical-protein MSIEAIDIEGTETIEIPLFENPSMTDFEISMNKFNTSINSQFMNVSTGSQRINFNGILSVEGIDSKAKKIITNSNYEINALDGELVILFTDPLYVSMLTCRGKSFRDSKSCMIRLISGEFSEKKFISRHSVTSHGREAEYKINDIIRGVILSPSTEFSHASAIDFPLFFSDDIKMNSISDAHKGLSLEAETLRQTIFNQVNDVTSYLDTKTSDYKALVKDMNTATKEKERLEKSNAQNSLILNQIQHDIAKTSEEAAKYQLLKMNAAAEIDGLKASINEHKDIVKSESREFDVLLKDVADKKIELLGLKDEIRTAKKDINLTSLDMKGHSTESQKQLTYYYRLSLAVIVFLAAVFFFIYSNAETFKVLIDANSKVSPWNILLSRLPLITATTLIIGTLSALLFYLVNHIILVNADKMNMLKASILAEQITGTLSSKGMTDEEIRDCKRNTKIELVMNVFTTKPEKVSESKQQDVLKQILEAVKITK >NZ_CP044399.1|WP_019442860.1|1027254_1028100_+|hypothetical-protein MSFKKVLLASIIASTLVGCDSSSPDSVIAPAPVTLTIPTEMQAQDVSGCDNEYMVTHSIEMRDDLELPVNVAEVRSVCNPELFVEYDVAEVDKPMVHINWAKEVFSEDGDLIMYQGGHAKPSSSAAHDKLAAERGYRGYYLTLDELGGYINANEVTDITYFSSGNYHHTFTSGLDKQSYTIIDNTIDTVVVSAAMFVDPTKVDEHDWNEFWYECTGTVTSIGSNETMGCLVHKAGAPKYQFQTITVDMNNFPKPTKFKEDYAHATQMYVEKAFTDNGIMIE |
You can click texts colored in the table to view more detailed information
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP044399_3 | 2267637-2268024 | Orphan |
NA
Consensus repeat of NZ_CP044399_3
|
5 spacers
spacers of NZ_CP044399_3
>3.1|2267680|26|NZ_CP044399|CRISPRCasFinder CAGTGTAGCTGGTTTCAATGATGTCG >3.2|2267749|26|NZ_CP044399|CRISPRCasFinder TCACTATGGTGCGTATAATACCGTCG >3.3|2267818|26|NZ_CP044399|CRISPRCasFinder TTCAGGTGGTCTTGGTAATAATGTGA >3.4|2267887|26|NZ_CP044399|CRISPRCasFinder CTCTCTCGGCCTTGATAACAACGTAA >3.5|2267956|26|NZ_CP044399|CRISPRCasFinder TACTCACGGTGCTCATAATACGGTTG |
CRISPR arrays and Neighbor proteins around NZ_CP044399_3
The CRISPR arrays of NZ_CP044399_3 >merge|NZ_CP044399|3|2267637-2268024|CRISPRCasFinder TTATCGCACAGGATGGTGACGAAAACAAAGCTAAAGTATATCTCAGTGTAGCTGGTTTCAATGATGTCGGTATAGCACAAGATGGTGATTATAATACAGCGAAAGTTGATATTCACTATGGTGCGTATAATACCGTCGATATCGCTCAAGATGGTGATGACAATTTAGCGAAAGTTGATATTTCAGGTGGTCTTGGTAATAATGTGAATATCGCTCAAGATGGTGATGACAATTTAGCGAAAGTTGATATCTCTCTCGGCCTTGATAACAACGTAAATATCGGCCAAGAGGGTCATGATAATACAGCCAAAGTTGATATTACTCACGGTGCTCATAATACGGTTGATATCGAGCAAGATGGTTATGATAATTTAGCGAAAGTTGATGT >NZ_CP044399|3|3|2267637-2268024|CRISPRCasFinder TTATCGCACAGGATGGTGACGAAAACAAAGCTAAAGTATATCT CAGTGTAGCTGGTTTCAATGATGTCG GTATAGCACAAGATGGTGATTATAATACAGCGAAAGTTGATAT TCACTATGGTGCGTATAATACCGTCG ATATCGCTCAAGATGGTGATGACAATTTAGCGAAAGTTGATAT TTCAGGTGGTCTTGGTAATAATGTGA ATATCGCTCAAGATGGTGATGACAATTTAGCGAAAGTTGATAT CTCTCTCGGCCTTGATAACAACGTAA ATATCGGCCAAGAGGGTCATGATAATACAGCCAAAGTTGATAT TACTCACGGTGCTCATAATACGGTTG ATATCGAGCAAGATGGTTATGATAATTTAGCGAAAGTTGATGT
>NZ_CP044399.1|WP_019440514.1|2266429_2266972_+|hypothetical-protein MARLLKVKNISICCLLTLSSYVFAGGSPYEPPPELAALSELAPSGNYSNIEFDGAYLSKVKVLQSRDGGLGNKAHIQLSGIKNKAKIVQDGSNNSAYIDQSGRFNIAKTVQRGQGHESAIIQHGHRNVAVHIQGGNAQHKGTINQSGNNNLAFIKDTTNNSRDFSVNQTGRGRIIINNTF >NZ_CP044399.1|WP_019440513.1|2266150_2266417_-|hypothetical-protein MTLFLLKQHVVTWIKYWIVRGFNHQLYDEIINVYVNFKGLIRLIIGHLFGLETVKKLMLKFIFWLGLNRCWDICLCFYNKKATAVYIA >NZ_CP044399.1|WP_019440512.1|2265000_2265975_-|paraslipin MNNAIDILLTPWLWITIVVIFTIQRSVLFIPQNRGYVIYTFGRYSGTLQAGLNFIVPFIQRVAADRNLKEQSLDISSQLAITKDNITLELDGILFMKVVDAAAATNNITDYKLAVVQLATTTMRNAIGSMELDQCFQNRDNINASILASMTEATQPWGVQVTRYEIKDITPPISIKEDMEKQMAAEREKRSVILTAEGVKTAAITQAEGLKQARVLDAEAAKAEQVLAAEASKESQILEATGKAEAIRLVADADSSALHVVGAVAITGEGQQAVRLKLAQDAIAAHKAIAAEGSVILTDGKTSENIGNTVAQAIAVSSALKLSE >NZ_CP044399.1|WP_019440511.1|2264504_2264969_-|NfeD-family-protein MDTLMDYLQNNHDQLLYVIGALALIIELSVTGLSGPLLFFGLSCLLTGLLVSIGVIQGWEFEILSVGLLSAVVALLLWKPLKQFQGNRVVQDTSSDMIGQTVPVSEVITINGGKVRHSGINWNARLSESATVSSIAVDLRVKIVAVDGNVLIVE >NZ_CP044399.1|WP_019440510.1|2263511_2264405_+|LysR-family-transcriptional-regulator MDIKIQQLRHFVLVVEDGGFRAAASRANRSQAALSTSIKELERTLEQPLFEPGNKSTLTPFGQICLPKITQFLQIYKTLDNDLRAAAAGQQGRVRIASVPSVAAKLIPNVLGRFCKEYPNVEVSLIDDNAAGVEARLVSGEVDLALGNCANLDADTVDFTPLISDPIGVVCLKDNPIANNMNGIEWQALLDQPFIHNGTCTLLEPTPARVLIDKALYSVENITSLFSVLKLGIGITTLPKLAFPSNETDLVWLPLLDPPLERKMGIFRLAEHTISPQAQAFYELCVEHLICSDELGS >NZ_CP044399.1|WP_026032076.1|2262921_2263314_-|endoribonuclease MNMTTQSYPVKTELFASKAPLEWAIVNNGTLYTAQIPIDQTGAVVAGGIEAQTRQTFDNLVHTLECAGESLNSVLQVLIYVTDREYLATVNKVYAEYFDAPYPNRAAMIVAGLAREEMLVEFVVYAAVSE >NZ_CP044399.1|WP_019440508.1|2261321_2262785_-|aldehyde-dehydrogenase-family-protein MTTQTQRIQAENSLYIGGEWQTGVSTVANINPSDISQNLGNFAQADTAQVHQAISAAKHAQPTWEKTPLEQKQAVLQGIGDELIARCDELGRLLSSEEGKPFLEGRGEIYRAGQFFQYFAAEVLRQIGDSAASVRPGVSVEVTREAVGVVAIISPWNFPTATAAWKIAPALAFGNSVIWKPANLTPASAVALTEIIHRQGLPAGTFNLVLGSGSEVGNVLINSTEVNGVSFTGSVDTGRKIAAATAPNFVRCQLEMGSKNALIVADDADINIAVEATIAGSFSGAGQKCTASSRLVVMDGIHDAYVEALIKRMSELKVGHALKDGVFMGPVVDGKQLDANFDWIDTARQSGGELAFGGERLNLEHEGFYMSPTLFINTKNDWSVNQEEVFAPMASVIRVADLEEAIATTNDTRFGLTSGIITQSLRTSTLFKQQAQTGCVMVNLPTAGTDYHVPFGGRKESSFGPREQGQYAKEFYTVVKTAYQRAY >NZ_CP044399.1|WP_019440507.1|2260240_2261230_-|peptidase-M19 MYSQRIVIDGLQYCNWDREYFQTLKNSGITAVHATIVYHETARETLSRFAEWNLRFEQNADLIMPIHSVADIEKAKALGKVGIFFGAQNCSSIDDEIGLIEVMRQQGLLIMQLTYNNQSLLATGCYEKNDNGITRFGKQAIAEMNRVGMIIDMSHSAERSTLEAIDLSSRPICISHANPTFAFEALRNKSDTVIKSLAARGGLLGFSLYPFHLPNGSQCSLDDFCQMVAKTADMVGVEHLGIGSDLCLNQPQAVLEWMRNGRWSKAMDYGEGSANNSGWPDSLPWFCGSAGMENIYNGLIRYGFSESEAGQVLGENWFNFLKQGLEPIS >NZ_CP044399.1|WP_019440506.1|2258575_2260141_-|BCCT-family-transporter MSDLTKSAKAASLDNDNNSTADKLGFSNPAFWYSGSFLALFVLLALYDEVLLSSLVNTGFSWAVTVFGPYWQVLLLLTFLIGIALAAGRTGKVVLGALPKPEMDGFRWMAIIFCTLLAGGGVFWAAAEPIAHFVNPPPLYGAQENIQQTAVNALSQSFMHWGFLAWAIVGSLTSIVVMHLHYDKGLPLKPRILLYPVFGKRVLTGHTGALIDACCIVAVAAGTIGPIGFLGLQVSYALNVLFEIPDGFTTQLIIVLFAIALYTISALSGLNRGMQMLSRYNVVLACLLMAYILIFGPTNFIFNGYIQGVGSMVDNFIPMATYRGDEGWLSWWTVFFWGWFLGYGPMMAIFIARISRGRTIRQLVSTISIIAPLTTCFWFTIVGGSGLAFEIANPGSVSSAFEGFNLPGALLAVTSQLPFPMITSVLFLILTTIFIVTTGDSMTYTISVVISGEEEPNAFIRTFWGVVMGITALVLISLGSGGISALQSFIVITAVPVSLILLPSLWNAPQIAIQMAKDQGL >NZ_CP044399.1|WP_019440505.1|2256732_2258463_-|hypothetical-protein MDARPSTCGGTYMRDPVTVMAPERLGAMHQNRISFVRSLIRKMAQQKWQVTKHDWQLSAEGFGHVIYKLTTLNHIYHLVIFCDEIADEDRNDRVIAEKWDVTFALVYGDVDVDLLSRLRTNVPLQEAGRNPNKVLVLARANKSVRVFEHLVSHLAKGQQPNAKELAEVGYILRTTAVYGNGKFGIADFGWLETTEDFNQSFSAQMCAVYILREFSLDWVHYLAQQQGGDKAVNLDLGLQRYLGIGNATGLGMAPYLINHPCVVDQWLSSRESALTAVLNAAVEVHKLAPLQHLLQKGLCHLEQIITINEHQDDLNNTAITELHDLLSNLDSLLIQSHSLLPQMKTWSELIDYASKYSLETQEILLSCLMELYPALVDNYETKMNCDESLNLPSGKRIEDLLAVLQSRYRWAIETDFKQPENNYWFWYRSQDKEEPRLGVRGEEPGEDRELPLDIGRQANRLYHALLICKPNMQLAEFLVLHPQYRAISRRVWTLGNKQMGEIQMNVLHQKSLPMHLLRCKLAVLGATKFDPRSERWVRVTFFQGAPLLNELHDGEWLFPLLPSNALNQAVLEGEVS >NZ_CP044399.1|WP_019440516.1|2268208_2268592_+|hypothetical-protein MTYLVNRKKSAAFSAAIFLFALQLPQTLHSKEQLRIKPNDIKREAKEMNILCQNVGNSIHLVIINQDDNYHTINLTSKTSNQFTTSIDKNSQVNLSLSKEQFPIKIITSSSNKTTVFMIDKDCKISS >NZ_CP044399.1|WP_019440517.1|2269446_2270394_-|hypothetical-protein MKFLWLFFIVIMTGCSNSLSIPDTSDTPSLMQRGNTYNDLVALPKPKGKIYVAVYDFRDQTGQYKPQPNSNFSTAVPQGATALLTMALLDSQWFYPLERQGLQNLLTERKIIRAAQSKDKVVSNHGTDLPSLQSANVMIEGGIVAYDTNIKTGGMGAKYLGIGGSGKYRTDQITVNIRAVDIRSGKILSSITTTKTILSYELAAGAFRFVDYKELLEVEMGYTNNEPVNIALMSAIDAAVIHLIVNGVEQGLWSPSSLDSLDSPVFKKYASQSSTLNTNAQQASTNDVFTKDASELMTKNTSVTNSRPKDYRATY >NZ_CP044399.1|WP_019440518.1|2270416_2270824_-|hypothetical-protein MKITTQLFALSFILSCSNVIASELVYTPVSPSFGGNPLNSSHLFNTANAINDYSGPEIDSGFEEKSALERLASSLESRLISQILSDASEGKTGQLITEDFTVNVVEGDSGALLIHLVDNLTGESSTIQVGGITSN >NZ_CP044399.1|WP_019440519.1|2270834_2271293_-|hypothetical-protein MMKSKQGKALFYVMCISGLCISANVQAIDDKTPLEESQSNDDSLVEIQGLLIDRTLTRLGKDFYFTFAMKMNSEYGDLEVNLTISEVPTALSGSIITIHHFNRVIYKTALSPGRYQAEQRAEEAMYVTRNYIVKWKAEKQFQDTFDLERSEL >NZ_CP044399.1|WP_019440520.1|2271292_2271955_-|helix-turn-helix-transcriptional-regulator MINEKYTCYLVSKSSLQSSLLKQSLEKSLDIVILDVSFTELLQSLSSKKSNKNLNYVIIDLNHLQDDYLSKYLILVDEKNLNTKEILINSESVIIIDDLMRLPNLTGLFYESDTMELMSKGMQKMLDGEFWISRDLATSIITVHRKDKYFTSSVIAELTRREEEIMKLLTLGASNSQIAEQLFVSENTVKTHLHNVFKKIKVKSRLQAVMWAKGQQFQRV >NZ_CP044399.1|WP_019440521.1|2272496_2273402_+|acyltransferase MSSLRGCLAFVLWLVNLLFWVIPIMILSPIKLLPIKIIQRICSSLLVFFASSWIRVNGVIEHFIHPVKIHVHNADIELSEKEWYMVIANHQSWVDILILQRVLNKKIPFLKFFLKKELIFVPFLGMAWWALDFPFMRRYSTAQLKKNPKLRGKDIEVTRKACAKFKSSPVSVMNFVEGTRLTTEKHSKQKSPFKHLLKPKAGGLAFALSALGEHIQKIVDVAIYYPGQTPSFWQYLCGEVKDVHVHIRVADIDDKMRGDYQKDRAFKIGFQQHLNDVWVEKDAILKTMAQSHKADTDKTAL >NZ_CP044399.1|WP_019440522.1|2273684_2274665_+|tripartite-tricarboxylate-transporter-substrate-binding-protein MLKQFRKCVTSSLIVASIATLSSAAWAADLEKIHFIVPGGAGGGWDMTARGTGDVLMKADLVEKVSYQNLSGGGGGKAIAHMIETAARQQDTLMVNSTPIVIRSLTGIFPQSFRDLTPVATTIADYGAIVVAADSKFENWSQVVTAFKENPRKVKIAGGSARGSMDHLIAAAAFKGEGFDARKVRYIAYDAGGKAMAAVLSGETPLLSTGLGEVLEMSKSGQVRILAITAPERLAAAPNVPTLTEMGNETVFANWRGFFASPGISDAKVAEWNKVLAEMYKTDQWATVRDRNGWIDNYKADKDFYAFLEQQEEQMGALMRELGFLK >NZ_CP044399.1|WP_019440523.1|2274733_2275231_+|tripartite-tricarboxylate-transporter-TctB-family-protein MQTTPLPRLNRDRVSGLIFLLVCLIYGYQATQIQLFPGDEYEAFTARTLPYLLTAGGIIMSLLLIVMSPTNACSTSACENNNESSLDWRLLSAFVALMTAYGVGLTWLGFVLATSLFLLVGFWLLGERRKAVLFGASFPFVTVFWLLLTKVLDIYLEPGYLFLSF >NZ_CP044399.1|WP_019440524.1|2275241_2276777_+|tripartite-tricarboxylate-transporter-permease MLDGILAGLSTAIMPTNLMMVMVGCFVGTFIGMLPGLGPISAIALMIPITYGLDPSSGMILMAGVYYGAIFGGSTSSILINAPGCSSTVVTAFDGYPLAKKGQAGKALALAAYASFTGGTLSAIMLLIAAPALAKVSLSFQSSDYFALMLVGLSAVAAFAGKGQVLKAWMMTIFGLMLSTVGIDKGIGVERFTFGLTDLMDGFSFLLLAMATFALGEILFSILKPEPDTSAEENSALSEIGSMKVTKEEFKEVAPVAARSSILGFFVGVLPGAGATIAAFLSYGLERNLAPKDKRDEFGKGSIRGLVAPEAANNAASSGSFVPLLTLGIPGSGTTAIMLGAMISYGIQPGPRLFVDNPEIFWSVIISMYFGNLVLMVLNLPLIPYIAKLLAVPRTVLLPMIIFFSITGVYLVSFNTVDVFIMILVAVIAIFLRLASFPLAPLLLGFILGGLMEENLRRSLMISDGELSFLWERPITLTFTVISALVLVTPILLTAFNRRRAKKAVFVDECD >NZ_CP044399.1|WP_019440525.1|2276919_2277609_-|response-regulator MTSTNTIKVLIIEDDVGIAEIHRRNLMKIDGLDIIGIATTKAEAEVLLDVLTPDLILLDVYLPDGNGLDILRDLRQQQHACDVILITADRDSDTLQAAMRGGVVDYILKPVIFARLEESLNKYLKQKNQFVNLDDVDQHMVDAMISVSVKSPATSRLPKGIDSVTLDKVRGLFAEHADITADNAGVLIGASRTTARRYLEHLISTGELVADLNYGTVGRPERTYKKQVR |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP044399_3 | 3.1|2267680|26|NZ_CP044399|CRISPRCasFinder | 2267680-2267705 | 26 | KJ019094 | Synechococcus phage ACG-2014e isolate Syn7803US33, complete genome | 22081-22106 | 4 | 0.846 |
NZ_CP044399_3 | 3.1|2267680|26|NZ_CP044399|CRISPRCasFinder | 2267680-2267705 | 26 | KJ019156 | Synechococcus phage ACG-2014e isolate Syn7803C2, complete genome | 22081-22106 | 4 | 0.846 |
NZ_CP044399_3 | 3.1|2267680|26|NZ_CP044399|CRISPRCasFinder | 2267680-2267705 | 26 | KJ019054 | Synechococcus phage ACG-2014e isolate Syn7803C85, complete genome | 22081-22106 | 4 | 0.846 |
NZ_CP044399_3 | 3.3|2267818|26|NZ_CP044399|CRISPRCasFinder | 2267818-2267843 | 26 | MN694284 | Marine virus AFVG_250M474, complete genome | 20112-20137 | 4 | 0.846 |
NZ_CP044399_3 | 3.3|2267818|26|NZ_CP044399|CRISPRCasFinder | 2267818-2267843 | 26 | NZ_CP011829 | Enterococcus faecium strain UW8175 plasmid unnamed1, complete sequence | 44935-44960 | 5 | 0.808 |
NZ_CP044399_3 | 3.3|2267818|26|NZ_CP044399|CRISPRCasFinder | 2267818-2267843 | 26 | NZ_CP032307 | Enterococcus faecium strain HY07 plasmid unnamed2, complete sequence | 49947-49972 | 5 | 0.808 |
NZ_CP044399_3 | 3.3|2267818|26|NZ_CP044399|CRISPRCasFinder | 2267818-2267843 | 26 | NZ_LR135289 | Enterococcus faecium isolate E7199 plasmid 3 | 21750-21775 | 5 | 0.808 |
NZ_CP044399_3 | 3.3|2267818|26|NZ_CP044399|CRISPRCasFinder | 2267818-2267843 | 26 | NZ_CP040704 | Enterococcus faecium strain HOU503 plasmid p1, complete sequence | 83821-83846 | 5 | 0.808 |
NZ_CP044399_3 | 3.3|2267818|26|NZ_CP044399|CRISPRCasFinder | 2267818-2267843 | 26 | NZ_CP035137 | Enterococcus faecium strain SRCM103341 plasmid unnamed1, complete sequence | 194324-194349 | 5 | 0.808 |
NZ_CP044399_3 | 3.3|2267818|26|NZ_CP044399|CRISPRCasFinder | 2267818-2267843 | 26 | NZ_CP035221 | Enterococcus faecium strain SRCM103470 plasmid unnamed1 | 101565-101590 | 5 | 0.808 |
1. spacer 3.1|2267680|26|NZ_CP044399|CRISPRCasFinder matches to KJ019094 (Synechococcus phage ACG-2014e isolate Syn7803US33, complete genome) position: , mismatch: 4, identity: 0.846
cagtgtagctggtttcaatgatgtcg CRISPR spacer aagtgttgctggtttcaatgctgtca Protospacer ***** ************* ****.
2. spacer 3.1|2267680|26|NZ_CP044399|CRISPRCasFinder matches to KJ019156 (Synechococcus phage ACG-2014e isolate Syn7803C2, complete genome) position: , mismatch: 4, identity: 0.846
cagtgtagctggtttcaatgatgtcg CRISPR spacer aagtgttgctggtttcaatgctgtca Protospacer ***** ************* ****.
3. spacer 3.1|2267680|26|NZ_CP044399|CRISPRCasFinder matches to KJ019054 (Synechococcus phage ACG-2014e isolate Syn7803C85, complete genome) position: , mismatch: 4, identity: 0.846
cagtgtagctggtttcaatgatgtcg CRISPR spacer aagtgttgctggtttcaatgctgtca Protospacer ***** ************* ****.
4. spacer 3.3|2267818|26|NZ_CP044399|CRISPRCasFinder matches to MN694284 (Marine virus AFVG_250M474, complete genome) position: , mismatch: 4, identity: 0.846
ttcaggtggtcttggtaataatgtga CRISPR spacer gacaggtggtcttggtaaaaatgtaa Protospacer **************** *****.*
5. spacer 3.3|2267818|26|NZ_CP044399|CRISPRCasFinder matches to NZ_CP011829 (Enterococcus faecium strain UW8175 plasmid unnamed1, complete sequence) position: , mismatch: 5, identity: 0.808
ttcaggtggtcttggtaataatgtga CRISPR spacer agtaggtggtcttgttaataatttga Protospacer .*********** ******* ***
6. spacer 3.3|2267818|26|NZ_CP044399|CRISPRCasFinder matches to NZ_CP032307 (Enterococcus faecium strain HY07 plasmid unnamed2, complete sequence) position: , mismatch: 5, identity: 0.808
ttcaggtggtcttggtaataatgtga CRISPR spacer agtaggtggtcttgttaataatttga Protospacer .*********** ******* ***
7. spacer 3.3|2267818|26|NZ_CP044399|CRISPRCasFinder matches to NZ_LR135289 (Enterococcus faecium isolate E7199 plasmid 3) position: , mismatch: 5, identity: 0.808
ttcaggtggtcttggtaataatgtga CRISPR spacer agtaggtggtcttgttaataatttga Protospacer .*********** ******* ***
8. spacer 3.3|2267818|26|NZ_CP044399|CRISPRCasFinder matches to NZ_CP040704 (Enterococcus faecium strain HOU503 plasmid p1, complete sequence) position: , mismatch: 5, identity: 0.808
ttcaggtggtcttggtaataatgtga CRISPR spacer agtaggtggtcttgttaataatttga Protospacer .*********** ******* ***
9. spacer 3.3|2267818|26|NZ_CP044399|CRISPRCasFinder matches to NZ_CP035137 (Enterococcus faecium strain SRCM103341 plasmid unnamed1, complete sequence) position: , mismatch: 5, identity: 0.808
ttcaggtggtcttggtaataatgtga CRISPR spacer agtaggtggtcttgttaataatttga Protospacer .*********** ******* ***
10. spacer 3.3|2267818|26|NZ_CP044399|CRISPRCasFinder matches to NZ_CP035221 (Enterococcus faecium strain SRCM103470 plasmid unnamed1) position: , mismatch: 5, identity: 0.808
ttcaggtggtcttggtaataatgtga CRISPR spacer agtaggtggtcttgttaataatttga Protospacer .*********** ******* ***
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
DBSCAN-SWA_1 |
360529 : 369251
Sequences of DBSCAN-SWA_1
Nucleotide sequences of DBSCAN-SWA_1 >NZ_CP044399|360529:369251|DBSCAN-SWA GATGAACAAATTTAATTTATCCATTCCATCTAAATATTTCGCACCATTGTCTGGTATTCAGCGGGTTTTCGAAGTCGGATTCATTCTTTCTACTTTTATTGCCTGTTATGCGATGATTGCACTAGTGAGCTTTGATCCTGCTGATCCATCATGGTCACAAACCAGTTGGCAAGGTCCAGTTAAAAATGCTGCAGGTTCGTTAGGTGCATGGATGGGGGATGTCTTATTCTTCACGTTTGGCCTTTATGCTTACGCAATCCCACTCGCATTTGTCTCATTAGCTTGGTTCATATTTTGGCGTCCAAGACAACTCGATGAAATTGATTTTTTCACCGTAGGTTTGCGCATGATCGGCGCATTATTACTCCTCATTGGCGTGTGTGGCTTAGCGTCTGTTAACTTTGATGACCTTTACTATTTCTCATCGGGTGGTTTAATTGGTGACGTTGTGGAACAGGCGATCAGTGAATTATTTGGCATACTCGGTTCAACACTTATTTTACTCAGTTTTGTTGCTATTGGTTTTACCCTGCTAACGGGCATTTCTTGGTTATCAATCGTTGATATGCTGGGCGCTGGCGTGATTAATAGCTGCCAATACTGCGTGGATAAAGTCACGGAATTAAAAAATAGATCAGCGAGTGAAGCTGAAGATGATACGCCTCATGATGATGCTCTGAGTGAATCGACTGTTCCAGTACCTGTTAATATGCAGCAAATGCATAATCATTTAACCAGTGAGTTCGATAACCAGAAACAAGAAGATGGTCCTGTAGCAGGACGCGAAGTTTATGAGCAGGATGACGTCATGAATATGTCATTTACCGCTGTAGACCATGTGCAATCACGATCTGAACCGTCAATATCTACATTCGATGTGCCAGAGAGTGGTAATCTGCACGACGATTATCTTGCCGACTATGACGCTCAACAAACCCATAGCCAAGTCGACAAGCCTGTCGGAAATGAAGCGACAATCGACGCAGAAATCGATCCTGTACTTGCCGCTGTAGCTGCACCAGAACCAGCTCCTGAATTCAAGATTAATAATAACGGTACGGACAGTGGTTTTGAAATTGTTGGTGACCAAGTGGTATCAACAGATCCGCTGCAATTTAAAGAAAAACCAGTGACTTTGTTGCCAGGGTTAGAGCTATTAGATAAACCAAATAAAAAAGCCAATCCGATCTCGCAAGCAGAGCTCGACCATGTCGCTCATCTGGTGGAAGAGAAACTGCTTGAATTTAATATTAAAGCGAAAGTGGTCGATGTCCACCCAGGGCCGGTGATCACGCGTTTTGAATTAGATTTAGCGCCGGGTATTAAAGTCAGTAAGATAACGGCATTATCGAAGGATTTAGCCCGTTCATTATCCGCCATGAGTGTCCGTGTTGTTGAAGTGATCCCGGGTAAATCGGTGATTGGTTTGGAATTACCAAATAAGTACAGAGAAACAGTATTTCTTTCTGATGTTATGTCGAGCCCGTCATTTACGGATGCTAAGTCCAAAACGTCAATGGTGCTAGGCCATGATATTGCGGGTGAATCTGTGGTTGTTGATTTAGCTAAGATGCCGCATCTATTGGTCGCAGGTACGACAGGCTCGGGTAAGTCGGTTGGTGTGAACGTGATGATCATGAGTTTGCTTTATAAAGCATCGCCTGATGAAGTACGTATGATCATGATCGATCCAAAAATGTTGGAACTTTCGGTGTATGAAGGTATCCCACACTTATTAACCGAAGTTGTTACTGACATGAAAGATGCAGCTAATTCGTTGCGTTGGTGTGTGGGTGAAATGGAACGTCGTTATAAATTATTATCTGCAGTCGGTGTGCGTAATCTCGCTGGTTTTAATAGTAAAATACAGCAAGCGATAGATGCTGGTCAGCCTATCCTAGATCCATTGTGGAAACCGGGTGATAGCATGGATGAGACTGCTCCAGAGCTAACTAAACTGCCCGCCATTGTGGTGATTGTCGATGAGTTTGCTGACATGATGATGATTGTAGGTAAAAAGGTTGAAGAGTTAATTGCCCGCATTGCGCAGAAAGCGCGTGCAGCTGGTATTCACTTAATTTTAGCGACTCAGCGTCCTTCTGTAGATGTGATCACGGGTCTCATTAAAGCCAACATACCAACACGTATCGCCTTCCAAGTATCGAGTAAAATTGACTCGCGTACTATTCTTGATCAAGGCGGCGCAGAAACACTGTTAGGTATGGGTGATATGTTGTACCAACCAGCTGGTACTAGCGTCCCAATTCGTGTGCACGGTGCATTTGTTGATGATCATGAAGTACATCGTGTGGTCGCAGATTGGAAACTACGTGGTAAACCAAATTATATTGATGAAATTTTACATGGTGAAGCGACTGCAGATAGCTTATTGCCAGGTGAAGTTGCAGAAGGTTCATCGGATGTTGATGAGCTATTTGATCAAGCTGTATATCATGTAACAGAAACGCGTCGAGGTTCTGTATCCGGCGTACAACGTAAGTTTAAAATCGGTTATAACCGCGCTGCACGTATCGTTGAAGAGATGGAAGTGCAGGGTATTGTGAGTTCACCAGGACATAACGGTAACCGAGAAGTATTGGCACCACCGCCCCCAAAGGATTAATTTATGAGAAATAAAATGTCACTTAAAAAAATCGTCGCAACAGTTGTGTTAATGACAGCGTCTATGTTGTCGAATGCACAAAGCGTCAAAGAAGAATTACAAAGCCAACTTAGCGCATTAAAACCATTCAGTGCTGACTTTACCCAAACCGTTACGTCAGCGGAAGGTGATAATTTAGCGACCGCTCAAGGACTGATGCAACTACAGCGTCCCAATCAATTTCGTTGGGAAACGACGTCTCCGGATGAGCAACTGATTGTATCGAATGGTGAAAACTTGTGGTTTTATAATCCATTTGTAGAGCAAGTTAGTATTTACTCACTTAAAGATGCGATTGCGAATACACCTTTCATGTTAATTGCGGGGGCGCAACAAACTGCGTGGGAAACGTATCGAGTAAGCAAGAAAGCGGGAGTCTATACGGTGATTACGCCAAATGATCCTGCTGCAGCAGTGTTTACCTTGCAGTTTAAACAAGGTGACATTGCTCAGTTCACAGTACAAGAGCAGCAAGGTCAATATAGCCAGTTTGCTTTAACAAATCATAAGACAATGAACAAGATGGATCCTGCGTTGTTTAATTTTATTATCCCGGAAGATACTGATATAGATGATCAACGCTAATGAATAATACGATGTCTCTTGATTTTAGTCCTGATTTTCAACCGCTAGCCGCCAGAATGCGACCAGAAAAAATGTCGCAGTATATCGGTCAAACGCACATTCTTGGTGAGGGGAAACCCTTACATCGTGCATTATTAGCTGGCCATGCTCATTCGATGATTTTATGGGGACCACCTGGCACAGGTAAAACCACGATTGCCGAGATGATTGCCAACTATTGCGATGCTAAAGTAGAGCGTGTACATGCCGTCACATCGGGTATTAAAGATATTCGCTTGGCGATAGAAAAAGCCAAAGATAATGCTATTCAAGGTTATCGCACCATCTTGTTTGTAGATGAAGTGCACCGTTTTAATAAAAGTCAGCAAGATGCGTTTTTACCGCACATTGAAGATGGCACCATTATTTTTATCGGCGCAACAACTGAAAACCCGTCATTCGAATTAAATAACGCCTTGTTATCACGTGCACGTGTGTATGTGCTAAAAAAATTAACCGTAGATGAAATTGAGCAGGTTGTTGATCAAGCGCTAACGGATGAACGTGGACTGGTGAATGTCAAATTCGACTTTGCACCGGGTGTAAAAGAGAAACTGTGCGATCTTGTTGATGGTGATGCCAGAAAAAGCTTAAATTATCTTGAGCTATTAAGCGACATGCTTGAAGAGGGGCCGCAAGGCAAGTTAGTGACATTGGAAAGACTGATGTCTGTGGTTGGGCGTAAAGTGAATAGCTTTGATAACAAAGGTGAACTTTACTATGACTTAATGTCTGCATTACATAAGTCTATCCGTGGTTCATCACCTGATGCGGCACTATATTGGTTTGCACGTATGCTGGCTGGTGGCTGTGATCCATTGCATGTGGCGCGCCGTTTATTGGCGATTGCCTCTGAAGATATTGGTAATGCCGATCCAAGAGCAATGCAAGTTGCAGTCTCGGCATGGGATTGCTTCAGTCGTATTGGTGCATATGAAGGTGAGCGCGCGATTGCACAAGCGATTGTATACCTGGCATCGGCAGCCAAGAGTAACTCGGTTTATACTGCATTTACTCAAGCTAAGATGATAGTACAAGAACAACCCGATTTTGATGTGCCGATGCACTTGCGTAATGCGCCGACTAAATTGATGTCCGACCTTGGCTATGGTGATGACTATCGTTATGCGCATGATGAAGTAGGGGCTTATGCACCTGGTGAATGTTATATGCCCGAAGCATTACAAGGTACCAAGTTATACCATCCGGTTGATCGCGGTGTTGAAAAGCAAATTAAGCAAAAGCTCGATTACTTACATCAGCTCGATATTAACAGCCCAAGAAAGCGTTATAAATAATACCCGCTCTTACAAATAACAGGTTTGGTCTTATCATAATTGAGACCATCAATACATAATCAGAGTATCAGTCATGAATAGTTTAGCTTTATACGGTTTTGTTGCACTTGGTGGTGCAAGTGGTGCCGTATTACGTACTTTTATCGCCCAGACAGCGACGGCTGTATTAGGCAAAGGTTTTCCGTACGGTACCTTGATTGTTAATATTTTAGGGTCGTTATTAATGGGGGTATTATTTGCCTTACTTGATGACAACTTGATTGCAGAGACACCGTGGCGACAACTTATTGGCTTAGGTTTCTTAGGCGCACTGACGACATTTTCGACATTTTCAATGGATTCATTACTGTTGTTACAAGATGGTCAGTGGTTAAAAGCAGGGCTAAATGTGTCTTTAAACGTATTCGTGTGTATTTTTGTTGCATATTTAGGTATGCGCTTAGTATTTCGAGCATAAAGAGCCGATTTTTAACTATAAGTGCGTGATGAAATTGTAAAACCACGTTAAAATAATAAGCATAAACAATACCAACAGACATTTCTATCTAGAAGCCTGTAATGTGCTTTTAGGTTTAAATTAAACAAGCAGAGTAGGTAAAGAATGTTAGATCCTAAATTTCTTCGCGCAGACATAGAAGAAGCTAAACAGCGTTTAGCAAGCCGTGGTTTTGAACTAGACGTTGAGACGATTACATCGTTAGAAGAGCAGCGTAAAGCATTACAGATCCGTACAGAGCAATTACAGAATGATCGTAACGTACGTTCTAAATCCATTGGTAAAGCTAAAGCAAGTGGTGAAGATATTCAGCCACTATTAGCTGAAGTTGGTAAATTAGGCGAAGAACTTGACGCTGCAAAATTAGAGCTTGCTGAGTTATTAGCTCAAATCGAAGCAATTGCATTATCGTTACCAAACCTACCGCATCCATCAGCACCTATCGGTAAAGATGAAGATGACAACGTAGAGTTAAAGCGTTGGGGTCAACCAAAAGAATTCGATTTTGAAATTAAAGATCACGTTGATGTAGGCGAAGCGCTTAACGGTCTTGATTTTAAAAATGCAGTTAAAATCACGGGTTCACGTTTCATTGTTATGCGTGGCCAAATTGCTCGTATGCACCGTGCTATCGCGCAATTTATGCTGGATACACACACTGAAGAACATGGTTATACTGAGCTGTATGTGCCTTATCTTGTTAACGCTGATTCGTTATACGGTACTGGTCAATTACCTAAATTTGGTGACGATCTATTCCACACTAAACCAGCTACAGAAGAAGGCCAAGGTTTAAGCCTGATCCCGACTGCTGAAGTACCAGTTACAAATACAGCACGTGATACCATCACAGATGAAGCTGATTTACCTGTACGTTTAACAGCGCATACACCTTGTTTCCGTAGTGAAGCGGGCTCATATGGCCGTGATACACGTGGTCTTATCCGTCAACATCAGTTTGATAAAGTAGAGATGGTGCAATTAGCACATCCTGAAAAATCATACGAAGCACTTGAAGAGATGCGTCAGCACGCTGAAAACATCTTAATCAAACTTAACCTTCCGTACCGTGTTGTGACACTATGTACTGGTGATATGGGCTTTGGCGCAACGAAAACATACGATCTAGAAGTATGGGTTCCAGCACAGAAAACATATCGTGAAATCTCTTCAGTATCTAACTGCGAAGATTTCCAAGCGCGTCGTCTACAAGCGCGTACGCGTCTTAAAGACAACAAGAAACCAGTTCTATTGCATACACTAAATGGTTCTGGTTTAGCGGTAGGTCGTACACTAGTGGCAATCCTAGAGAACTACCAGTTAGAAGACGGTCGTGTTGAAGTACCAGCTGTACTACAAGGCTACATGGGCGGTCTAACGCACATTGGCTAACATTAAATAGCCTAATTAACTTAATGGCGTGTCGTCTGATTATTCGCCATTAATGATGAGCTAAATAAAAAGCCTCACTTAGTTATGACTAAGTGAGGCTTTTTTGTTTCTGCTGTCATGAACCTTGTCGTGACGCTTAATCAGGTGATGATTAATAAGTCCGTTTAATTAGCTGGGCAAGATTGACAGCTCAAAGATCTACAGGCATTTAGCTGGCTTGGGTAAGCCTGCAATTTTAGTTGCTTGTTTCGCAGGGCCTTTAGGGAACAGTTTATACAGATAGATACTATTGCCTTTGTCTGCGCCTAGTTTTTTCGCCATTGCTTTTACTAATGCACGAATAGCAGGGGAGGTGTTGTATTCAAGGTAGAAGTTACGTACAAACAATACCACTTCCCAATGCGCAGGACTTAGCGCGATACTTTCTTGTTCAGCAATAATAGGGGCTAGTGCTTCGCTCCAATCAGCTGGGTCCAGTAAGTAACCTTGGGTGTCAGTTGCGATTTCTTGTTCATTGATAATTAACATGGGCATAAACTAATCGTAGAGGGGAAAACAAAAAAGCCTCGTCAAACGAGGCTTTTCACTATTTATAATTACAGTTTAACTGTAATTAGTCGTCGCTACTCATAATACCTAATATTTGCAGTAAGCTAACAAAGATATTATACAGTGATACGAATATTGTTACCGTAGCTGAGATATAGCTTGTTTCGCCACCGTGAATGATCGAGCTTGTTTGCATCAAGATAGCACCTGATGAGAACATGATGAACATTGCACTGATTGCTAAACTAAGTGCTGGGATTTGTAAGAAAATGTTACCGATCATCGCGACGATAAGCACAACAAAACCAGCCATCATCATGCCATTTAGGAAAGACATATCTTTCTTCGTTGTAAGCACATATGCAGACAAACCAAAGAAAGTTAAGGCAGTACCAGCAAAAGCTAATACAACAATGTCACCCATACCTGCACCTATGTACATGTTAATTAATGGGCCAACGGTATAACCTAAGAAACCGGTGAATAGGAATACAAATAAGATACCCATTGAGTTATTACGGTTCTTTTCTGTTAAGTACATTAAACCGTATACACCAACAAGCGTAATGATAATGCCTGGAGCAGGTAAGTTTAGTGTCACTACTGCAGCAGCAATTATCGCTGAAAACGCGAGTGTCATCGCTAATAACATATAGGTATTGCGAAGAACCTTATTAGTCGCGAGTACGCTTTCTTGCGACTTAGTCTGAATCGTATGTTGTTGCATTGTGTCTTTTCCTTAATAATTAACACTGCTAGTTAGAGTATCAGTATACCCTAAAGTTCCGACATAATATGACGATGTAGTTAATTTATAGCCAAATAAGTTAAAAACAAGCCTTTATTATCAAAAATGTGATTAATTTTGTAAAAGGCTTTATCTTTTTTTTAAACTCTGTATTATACGCTCCATTGCTGCGGAGGGGTGGCAGAGTGGCCGAATGCACTGGTCTTGAAAACCAGCAGGGGTTAATAGCTCCTCGAGAGTTCAAATCTCTCCTCCTCCGCCATCTATATAGAAAAAGGCGCTATCTGAAAAGATAGCGCCTTTTTTGCATTTTAGCCTTTTTAATACTACCCATCACGCGTATTACTTCTCAGATAATGAGCATTGCCCACGTTCGGTATTCGCCTTAACCTCATGTTTAACGATACGACTTAACTCATCTGCGCCACCCACATAATGCCCGTCTATCCATATTTGCGGTACCGTTACAGGTTGTTTTGGATCTATCAGTGGCTTTACTCGCGAGAGCATTTCATAGAGGGCGCGGGGACTGGTGATCACGTCGTGATAAGTATAGTCAATGTTGGCTTGTTGTAAGTAACGTTTAGCGCGCTGGCAATGCGGGCAGTTTTGCTTACCAAAAATATGCTGGTCTGTGATAGGTACTTGCTGAGAGTGTGCTTCGATAACGGCTTGGGTTAACATGCCTCGGTTTAATGCGCGACCTTGACTGATAACTTTACCAGCAACCATGATAATCGGCGCATGCCAACCACCTTTTAGTAGCGGCTGCCACCATTCATTTAGCCAATCTTTAATTTCTAGTTCGACGGCAATACCGTCTAATTCGTTTTGCAGGGTATCTAAGACTATGTCTTTAGTGAGCGCACATTCGCCACAAGGAATATTGATATGAAAAGGCCCCCAATCGCCAGCCCAGCGATAGATAGTAATGTTGACTGCTTGTTTATCCAT
Protein sequences of DBSCAN-SWA_1 >NZ_CP044399|360529:369251|365713_367000_+|WP_019440953.1|tRNA|DBSCAN-SWA MLDPKFLRADIEEAKQRLASRGFELDVETITSLEEQRKALQIRTEQLQNDRNVRSKSIGKAKASGEDIQPLLAEVGKLGEELDAAKLELAELLAQIEAIALSLPNLPHPSAPIGKDEDDNVELKRWGQPKEFDFEIKDHVDVGEALNGLDFKNAVKITGSRFIVMRGQIARMHRAIAQFMLDTHTEEHGYTELYVPYLVNADSLYGTGQLPKFGDDLFHTKPATEEGQGLSLIPTAEVPVTNTARDTITDEADLPVRLTAHTPCFRSEAGSYGRDTRGLIRQHQFDKVEMVQLAHPEKSYEALEEMRQHAENILIKLNLPYRVVTLCTGDMGFGATKTYDLEVWVPAQKTYREISSVSNCEDFQARRLQARTRLKDNKKPVLLHTLNGSGLAVGRTLVAILENYQLEDGRVEVPAVLQGYMGGLTHIG >NZ_CP044399|360529:369251|367198_367528_-|WP_019440954.1|DBSCAN-SWA MLIINEQEIATDTQGYLLDPADWSEALAPIIAEQESIALSPAHWEVVLFVRNFYLEYNTSPAIRALVKAMAKKLGADKGNSIYLYKLFPKGPAKQATKIAGLPKPAKCL >NZ_CP044399|360529:369251|368639_369251_-|WP_019440956.1|DBSCAN-SWA MDKQAVNITIYRWAGDWGPFHINIPCGECALTKDIVLDTLQNELDGIAVELEIKDWLNEWWQPLLKGGWHAPIIMVAGKVISQGRALNRGMLTQAVIEAHSQQVPITDQHIFGKQNCPHCQRAKRYLQQANIDYTYHDVITSPRALYEMLSRVKPLIDPKQPVTVPQIWIDGHYVGGADELSRIVKHEVKANTERGQCSLSEK >NZ_CP044399|360529:369251|365185_365569_+|WP_019440952.1|DBSCAN-SWA MNSLALYGFVALGGASGAVLRTFIAQTATAVLGKGFPYGTLIVNILGSLLMGVLFALLDDNLIAETPWRQLIGLGFLGALTTFSTFSMDSLLLLQDGQWLKAGLNVSLNVFVCIFVAYLGMRLVFRA >NZ_CP044399|360529:369251|360529_363151_+|WP_019628835.1|DBSCAN-SWA MNKFNLSIPSKYFAPLSGIQRVFEVGFILSTFIACYAMIALVSFDPADPSWSQTSWQGPVKNAAGSLGAWMGDVLFFTFGLYAYAIPLAFVSLAWFIFWRPRQLDEIDFFTVGLRMIGALLLLIGVCGLASVNFDDLYYFSSGGLIGDVVEQAISELFGILGSTLILLSFVAIGFTLLTGISWLSIVDMLGAGVINSCQYCVDKVTELKNRSASEAEDDTPHDDALSESTVPVPVNMQQMHNHLTSEFDNQKQEDGPVAGREVYEQDDVMNMSFTAVDHVQSRSEPSISTFDVPESGNLHDDYLADYDAQQTHSQVDKPVGNEATIDAEIDPVLAAVAAPEPAPEFKINNNGTDSGFEIVGDQVVSTDPLQFKEKPVTLLPGLELLDKPNKKANPISQAELDHVAHLVEEKLLEFNIKAKVVDVHPGPVITRFELDLAPGIKVSKITALSKDLARSLSAMSVRVVEVIPGKSVIGLELPNKYRETVFLSDVMSSPSFTDAKSKTSMVLGHDIAGESVVVDLAKMPHLLVAGTTGSGKSVGVNVMIMSLLYKASPDEVRMIMIDPKMLELSVYEGIPHLLTEVVTDMKDAANSLRWCVGEMERRYKLLSAVGVRNLAGFNSKIQQAIDAGQPILDPLWKPGDSMDETAPELTKLPAIVVIVDEFADMMMIVGKKVEELIARIAQKARAAGIHLILATQRPSVDVITGLIKANIPTRIAFQVSSKIDSRTILDQGGAETLLGMGDMLYQPAGTSVPIRVHGAFVDDHEVHRVVADWKLRGKPNYIDEILHGEATADSLLPGEVAEGSSDVDELFDQAVYHVTETRRGSVSGVQRKFKIGYNRAARIVEEMEVQGIVSSPGHNGNREVLAPPPPKD >NZ_CP044399|360529:369251|363154_363775_+|WP_019440950.1|DBSCAN-SWA MRNKMSLKKIVATVVLMTASMLSNAQSVKEELQSQLSALKPFSADFTQTVTSAEGDNLATAQGLMQLQRPNQFRWETTSPDEQLIVSNGENLWFYNPFVEQVSIYSLKDAIANTPFMLIAGAQQTAWETYRVSKKAGVYTVITPNDPAAAVFTLQFKQGDIAQFTVQEQQGQYSQFALTNHKTMNKMDPALFNFIIPEDTDIDDQR >NZ_CP044399|360529:369251|367613_368276_-|WP_019440955.1|DBSCAN-SWA MQQHTIQTKSQESVLATNKVLRNTYMLLAMTLAFSAIIAAAVVTLNLPAPGIIITLVGVYGLMYLTEKNRNNSMGILFVFLFTGFLGYTVGPLINMYIGAGMGDIVVLAFAGTALTFFGLSAYVLTTKKDMSFLNGMMMAGFVVLIVAMIGNIFLQIPALSLAISAMFIMFSSGAILMQTSSIIHGGETSYISATVTIFVSLYNIFVSLLQILGIMSSDD >NZ_CP044399|360529:369251|363774_365112_+|WP_019440951.1|DBSCAN-SWA MNNTMSLDFSPDFQPLAARMRPEKMSQYIGQTHILGEGKPLHRALLAGHAHSMILWGPPGTGKTTIAEMIANYCDAKVERVHAVTSGIKDIRLAIEKAKDNAIQGYRTILFVDEVHRFNKSQQDAFLPHIEDGTIIFIGATTENPSFELNNALLSRARVYVLKKLTVDEIEQVVDQALTDERGLVNVKFDFAPGVKEKLCDLVDGDARKSLNYLELLSDMLEEGPQGKLVTLERLMSVVGRKVNSFDNKGELYYDLMSALHKSIRGSSPDAALYWFARMLAGGCDPLHVARRLLAIASEDIGNADPRAMQVAVSAWDCFSRIGAYEGERAIAQAIVYLASAAKSNSVYTAFTQAKMIVQEQPDFDVPMHLRNAPTKLMSDLGYGDDYRYAHDEVGAYAPGECYMPEALQGTKLYHPVDRGVEKQIKQKLDYLHQLDINSPRKRYK |
8 | Mycobacterium_phage(16.67%) | tRNA | NA |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DBSCAN-SWA_2 |
1524700 : 1560905
Sequences of DBSCAN-SWA_2
Nucleotide sequences of DBSCAN-SWA_2 >NZ_CP044399|1524700:1560905|DBSCAN-SWA GTCATAATGTGGACACTGGGTTGTACTTCATGGCCTCATTTAAATAGTCTGGTGCAAGGTGTGAATAGGTCATTGTTTGCATTATTGTTGAGTGTCCTAATATCTTCTGTAGCGTTAATATATTGCCACCGTTCATCATAAAGTGACTCGCAAACGAATGTCGTAACACATGTGAAGCCTGACCTTTAGGTATATCGAAGTTCTGTTTCTTTAATCTGGCTAGGAATACAGTGTAAGACCCATTAAATAATCGACCACTTTTACCATGTGCAATTTCCTTTAAAAGCTCATCAGTGATAGGGACTGTTCTGTTTTTTCCGTTCTTAGTATCACCAAACGTCACTCGCCCGTGCACAATATCACTCGTTTTCAGTTCTGCAGCTTCACTCCAACGCGCACCGGTGGATAAACATAACTTAGCGATATGTAATTCGTCACCACTTAGGTTTGTTAATAATTGCTTTATTTCGGCTTTAGTTAGGAAACCCATTTCAGGCGCACGTACTTTTAATGGTCCCATGCCCTTGAATGGGTGTTCATTATGGAACTCTTCCGCTTTAATCAATGCGGTAAAAACACTACTCAACCGTGTTTGATTGCGGTTGATAGATGCGGGCTTCATACCTCCTGCCATTTTATCAGCACGATATTGCATGAAGCGTTGCCGTGTTAGCTGGTCAGCCCTTAGATTACCTAAGTCAGCATTAATCGTGATTAGGTGTTTTAGTTCTCTAGGACCTGTCTTGAGCTGCTGGCCATGGTAATGATACCAAAGGTGGATTAAATCGATGAGAGGGCGCTTATCAGCGGTTTTCTCAATCCAATCCTTTTGATTTTGAGTTGAAAGCAGCCAGCGCTCGTATTTCTGAGCCTCACCTTTGGTTGTAAACTTCTTTCTGTAGCGTTTACCGTCACGGCCTTGTGGACGGCAATCGACTTCGTAGCCGTTCGTGATCTGTTTAATTGACATTAAATTAGCCTTTTAATACTGGTGTTACATACAGTACTATAGGGGGTGAGTGAGGAGTATTCAATGTTTGTTTGTCGCTGATTAATACACAACAACTATTGTATATGTGTCATGTACAGGTTGTTTTTTCTTCAAAATATGTTGTAAATCAAATATAAATATCTATATTTCTTTAGAACGCCATCATTAGACTGGACTTTCACCTTGTAAATAGTACTTAAATGACTTTCAGGTTATTTGATCGACAAACGGCAAAAAATAAATTTAGTGTTCAGCACTATTAAGTTATTATAGAGCTCTGTAAAGTAGGCCTTAAAATGCCTTAATTGACAAGGCATAGTAATAAATGAACCCTCATTCACTTAATAAAAAAAGAGAACTAGGTGCGTATTATACACCTCCTGAATTGAGCCAAGTTCTTGTCGACTGGGCTATTATTTCACGATTAGAAAATATATTAGAGCCTAGCTTTGGTGGTTGTGGTTTTTTTGAAAGCTGCATTAAAAAACTTGAAAATTTGGGATGTAAAATACCAAGTGAGCAATTGTATGGGGTAGACATAGATCCTCATGCATTTGATATACTTAGCCAAAAATTTGGCGAAAAGGTCTCTTTAGACAAACGGTTTTTACATAGTGATTTTATTTCAGTTTCTCCTGAACAATTTCTGGTGCCTGACTTTGATGTTGTTTTGGGTAATCCTCCCTATGTATCGATGCACAATATGTCTTTAGAGCAAAGGCGTAGATGTGAGAAAACATTTAAAGAATCTCCATTTATAAAAAAAACGATGGGAAGGAATGTAAGCTTATGGGCATTCTTCCTATTACATAGTCTTTCATTTCTGAAAGACGGAGGTAAAGTTGCTTGGGTTCTCCCATCTAGCTTACTGCACACGGATTATTCTAAAAAACTAATTGAAGTGCATAAACAGCATTTTAAAACAGTAAAAATTGCCAAACTAGCGGAGCGTTTTTTTGTTAGTGAAGGTGCTCAAGAAACTTCTATTGTATTAATAGCCGAAGGGTTCAGCAAAACTGCGAGTAATACAGGTTGTTTAGAAATCTCTTCAGTTCATAATATTGAAGAGCTCCATGAATTTATACATGCACCTATTGACCAAAAAAAGCAAAATCATTTTGATAATTATAAATTTTCCTTATTATCAAGTGCTATTAGGGATTCTTATTTTCAGGTTGAACAAAGTCAGGTTTCAAAAAAGTTAATTGATTACCTGAATATAAAGATCGGGATGGTAACTGGTGCAAATAAATTCTTTATTATAAACAGAGATACTATCGAGAAGAATAATCTTGATAGTAGCTATTTAAGACCTATTGTCAGCCGGTTTTCTTGTCTAGTTGGTGTTCGACATAACAAATTAAGACAGAGAGAAAATGAAGATAATAATCTCCGTTCATATCTTTTAAACCCTTGTAATGAGGATATGAAAGAAAAAAATACACCTATACGTAATTACTTAGCTCAAGTTTCAGCTAAGGAACGCAGACGCAATAAAACATTTCCTAAAAGAGCTAATTGGTACACACCTGATGACAATATCTATCCAGATGCATTCTTCTCTTATATGAGTCATTTAGGCCCGCGAATTGTTCTTAACCAAGGGAAAGTAAACTGTACTAACTCAATTCATAAAGTATTTTTTAATGAACGTTTATCACATAGTCGTAAGCTAGCTATATCTATCAGTTTATTATCATCATATTCTCAATTATCTGCTGAAATCGAAGCAAGATCATATAGTTCTGGTGTATTGAAGATTGAGCCTACCGCAGGTAAGAACATCAGAATAATAATGTCAGACGAATGCATTCAAGATTTGGCTAGTAATGTGACTCATATTGAAAATTTATTAATAAAAGATAAACAACCTGAAATGACCATATTTATTGATGACATATTTATTAAGCATGACATTCTGACGAAAGATCAATGTAACCTATTTCGTGAAGGTGTAAGGATACTCAGAAAAGAACGTTATAAAGGGGTTAAGACTTACGATGAGTAATGAAACTTTATTAAAGAAGCAATTATCAAAGCTACTAGAAAAAGATAATGTTGATTATGGAAAAGTACTTTCACTTGCATCTGAAATATCAGAGCATGATCAACATAATGTGAGATTTTCAGTTGATGCCCAAGTTGTCCAAAGACTAGGGGAGCAATTAGTAGCTAAAAAAACAACCGCATTAAGTGAGCTAATCAAGAATTCTTACGATGCTGATGCTACAAAAGTTACTGTGTTATTTGAAGCAACTGAAGACGCAGGCGGTGACATTACTATTCAAGATAATGGTAATGGGATGTCTAAAGCTGACTTAATTAATGGCTTTATGAAAATCAGTACCTCAGATAAAGCTGAGAATCCTAAAAGCCCTGTTTATGATAGACAGCGGGCTGGTAAAAAAGGGATAGGTCGATTTTCAGCTCAAAAGATTGGAAGTAAATTAACCATCATAACTCGATGCTCTTCAAATGAACCTTATTTGGTCGTTAACATTGATTGGGATGATTACAGTGCCAAGAAGTCATTATCTTCAATTAGTAATAATATAATTACTTCTAATGAAAATTTTGGTTTTGAAAAAGGAACTCAGTTAGTTATCTCAGATACGCGTGATATCTGGTCTCAGGAGAATATTCAAACAGCGTATAGATATACATCCTCAGTAGTTAAGAATAAACCTTCAATTAAGAAGTGTGGAATAAAAGATCCTGGTTTTAAAGTTAAGATTAAATCCTTTTTTTCATCCGAGAACACCCCGTCAAATATCGTCGATGACGATACTGAATATTTGCAATATGCAGATTCAATTATCACATCTTGGATTAATGATGACGGTTGTTGTGTCGTTAAAATTAAAGGGCAAAATGGAGTCGATTTATCCGATAAGTTTATTCTTGACTTAAAGGTTTCGGACCTATTAAAAAAAGTGAATTTTAACTTTAGCGCACATTATTTTGCTATTGCAAAGAGTGAACCAGGAAAACATTTTCTGAAAACATATTTACGAAACAATGGTGGGATAAAGCTATTTCGTAATGGATTCTATGTAGCTCCTTACGGCGAGTTACTTAATGATTGGCTTGGCCTCGATGATTCATCTAGACGTAGGGTTATTCTACCTCCACATTCTAATACTAACTTTATAGCGAGTATTAATATTCTAGATAATGAATTATCGATGTTTGAAGAAACATCAGCAAGAGAAGGTTTAATCGAAAGTGATGCTTATAAAGAGCTTTGCAGCTTATCGTATGCGCTAATTGTCGAAGCTGTAAAGCGGATTTCGGCAGTTAGGGGAGTAAAAATTACGGCAAATCAAAAAGATTTTGTACCTAATAAACCGATAGAGACTCAAGTAAAGGATCAGGTTGAGGTTGTTGTCGAGACTCTTAATAATTGGGTAGAGGCTTCAAGCAAGGTTAATGAAGTTGTTACCGATAATGAAGTTGTTACCGATAATGAAGGTGTTACCGATAAGCCTGTAAATGAAAAACCAGCGTTACCAGAAACTGTAATTCCTAAGCTTATTCAAGAGCAGGTTGAAAAATTATCTATTCTTACGAATGAATTAATAGATGAAAAGCATATGTATAAAGTGCTTACTTCAACGGGTCTTGCTATTGCAGAGTTTACTCATGAAATTCAGTTGTATCTTGATGCTTTAACTTTAACAAGCACCCAACTGAAAACTATACCAGATGAATATCCAGAACTTAAAAGAACAGCTGATATTTTAGATGGCAACCTAAGTATGTTAATTGCTTATACTGATTTTTTTGATGATACAATTCGAAGTAACTCAAATCGCGAAAAGTCTTATTATGAAATAAGAGATATTGTAGATAAATTCTTTACCGCAATGGAACCGACATTAGATCGTAGAGGGTATGAGTTAATTAAAACTTACGATGACTGGGGGATTTGGACAAAACAAATCCATATATCAGAGCTGTCATCAGTACTTATGAATTTGTTCACCAACGCTTGTAAAGCAATTAAACGTTCTGGAAATGCAAATGGCCAAATAGGCGTTTTTATATCTTCTACAATGGATGACATCGTAATTAAGTTCGAAGATACAGGTGATGGAATCCCAAGTAAAAATAGGAAGCGTATTTTTAATCCTTTATTTACTACATCGGTCACTGCTGGTGCATTTAGTTCTGACAACCAGAATTTAAGGGGGATGGGACTTGGACTTACAATTAGCAAAGATATTATAAAAGGGTTAGATGGTGAAATTGCCGTCGTAGAGCCTTCTTCAGGTTTTAATACATGTATAGAAGTTATAATTCCAAGAGCACAAGAAACAGAGATCCCAAATAATGCCTATTAATTACTTATATATCGATGATGATGAAACATCTTCTGTAGCACACTATATTAGCGCAATAGAGTCTCATTCTGGTGGTGAACTGAAAATTACTCATGTACGTGTAAGTTCTATGAGTAGTATTAAAAATCAATTTATTGAAGGTGCGTTTGACGGTTTTATTATTGATCAAAAGTTAGATGCCTCGAATGATGAAGGTGAGACCGTTGATTTTTATGGAACAGCTTTAGCACAAAATTTAAGAACTGAAATGATTGTTGCGGATGTTCCTCCGTCACCTATTGTTTTACTCTCAAATGAAGAACCTTTTGTGAAATATTATGATGCTGATGAGTCATCGCATAACCTGTTTGATTATACAGTTAAGAAAAAAGATATTAGAAATAAAAGCATTGCAAAAAATATGTGTTTGACACTTATTGCACTTGTGGAGGCATATAAAACTGTAATTGATTTACCTTCAAACGGAACTACAGAAGATACATTTGCTATTTTAAAACCTGTTTTAGGTTGGACGGAACAAGAAGCTGAGTATGTCGACACCCGCTTTAAAAGTGAATTACTTGATACACGAAAGGATGTACATACATTCGTTGCAATATTATTTTCTTCTTTTATTCGAAGTGCAGGAACATTAGTTACTGAAGAGATGTTAGCAACTAAACTTGGTATCGATATATCTTCGTCTTCTGATTGGAACGCCTTGAAAGACTTGTTTGAAGAATATAAATATAAAGGTTTATTTTCCGAGTTAAAAGATCGTTGGTGGTTTAGTGATATAGAAGATTGGTGGTATGACAACAATGAAGACGGACGTGTATTGCAGGCATTAACTTGTAATGAAAGAGTTAGTGTTATTAAAAATGCAACGACCTTTGAGCATTTGTCTTCTTTGATTGGCAGCTACGAAAATCAGAGCCTTAATTTTTGGGTTAATTGTATTGTTACTGGCAAACCTTTAGACCCTTATGATGCATTAAGAGCAAATGATTCGAAATTAAAAAACTGGGAAGCTCCAATATATTTAGATTTAGTAGCTGTTCTTTCACGTCAAGCGTCAAAAGTTGGGTATAAAGTTCATCCAGAGGATCAAGCTAAAATTAAGCTTTTAACTGCGAGATTAAATCCGGATGTTAAGTAAAGATAATTTATTAACTTACCTCGATGAGTTCGCTGGAGTTTTAGAGCTTGAAGGGATCGTTATAAGTGGTGCTCCTATTAGGATGCTAAAGGCTGGGCTGCAGCGTTCTAAACCACATAAGGGCTTAACTTATAATTTAACGGGATTAGAGTTTAAAAATATATCACCTGAATTATTTAGCCGAACAGGGAAGTGGGATGATGATATTTCTGTACGGCTAGGCATGGACATTCGATTAAATCAGAATGTTGAACAGTTTAAATTTGGCAATACTCATTCTCTAACTGTTAACGTGGAATATTCAGCATTATCCCAGTTGGAAGAGGATAAGTTTTTTGAGTGTAAAGGGGCATGGCATCTTGATTTCCATAACCAGCCAGAGAAAGATGGGGATCCTGAATATATTCACCCTAACTATCACTTCCATCATGGCGGGAAAAAGTTAAGTGAGTTGAATGATTGTGGTCAAATTATCTTATTAGATGCCCCTAGAGTGATGCATCACCCTTTAGATCTATTTTTGTCCGTCGATTTTGTAGTGTCTAACTTTTGTAAGAAAGAAGTATGGAAAAAATTAAGAGCTAATACAACGTATACTAATTATATCAAAATGGCCCAAGATCAATGGTGGAAGCTATATTATCAAGATTTAGCGAACTATTGGAATCATTTTGGTAAAAGTGGTGTGGATGATAGAGCTATCTGTAAATCAGCTAAAGAGGCTAACCCTCATTATGTGTAATATAGATTAGCTCCGAGTAAAGGCTTTTTGTGATATAAAATAAAACCTCCCAAATGGAGGTTTTTTATATCTAAAAGTAAATCACATTTTCTGCCCAGTCCAAACAACTCGCCCAATAAGCGCAAGTTCTTCACCGTCCAATTGTGACTTCGTGATCTCCCAAGGGTCATACATCGTATTATCAGATTTAACGCGGATAATGCCGCCAGGTAACATCTGCAGTCGCTTCACTAATAAGTTATTGTCATGACGCATGACATAAACACCATCAGATAAATTATCCACATCACGATTAACCATGATCATAGAGCCGTTCTTTAATGTCGGCTGCATACTATCGCCGTCAACTAACATCAAAAATACATTATTAGGGCAAACGCCAATTTCATTACGTAACCAGTAAGGTTCAAAACTGATCGTTTTTGGACGCTCTTCAACTTCAGCTAATAAACCAACACCAGCAGATGCAGAGATTTCGTAGAAGGGAAGTTGAATGTGCGCACTGTCTTTTTTATGTGTTTGCACATACACGTTTAAGTCGCTAGGGGATATTTGGCCAACCTCTGGCTCTGTCTTTTCACTGACTAACCACAGTGCATACTTGTGAAAACGGTCATGGTTTACGATTTTGTCTAGCTTTTCTAAGCCAATCGTTTTAATTCTTCCAGTTTCGTATTGAGCAAGTGTATCAGTGCTAATTCCAATTAAATCAGATAGTTGGCTTCTAGTTAGGCCTTCGGCTTCCCGAATGCTCTTTATTTTTAGTCCGACAGTAGTTGACATGTTTATACATAGACCTATATTATGTGTTTACACAGCCTGTGGCAGCAGGTGGTGTTATTAAGTTGTAGTAACCCTAAATAATCCCATAAGGAAGTAATTATGTCGACAACAATTGTGATTCAAATGAACGATGTACCTACCATGGCGGTCGAGTTGTTCGCTGAGAAAACTGGTCAAACGGTCTGCGCTGTTACAAGCCAAATGGACAGCGGTGCATTACCGTTCACGCAACATAAACCCCGTGCAACTCGCCATGTCAATATCGCGAAGTTAACTGTTATGTGTTTGGAATCAAACTCAGATAAACCATGGCTTGCTTAAGAGGTATTTAACATGCTTAAGCGAATCAAATTAAGCAGCAAAGGTAAATGCACAAGTGGTTGTAGTCAGTGTGAAATCAAACACGAATTCATATGCCCAGTAATCATCGTATGTGAAGTTGCCTTTGTAGTGTTCATATTCACCTTCGCTGCCTTGTCAGCGTAGTTTCATCATCGCAAATAGGGATATTTATTCAATGTATGAAATTAATAATAGCAAACAAACAGTAATTGACGCGGCTTGTATCCGTTTTGCAGATATTGAAAACGTAGAAAACATTGCAAGCGCATGTGGTATGCGTGGGCAGATGCTCCGTAATAAGTTAAACCCAAACCAACCACACCAACTAACTGTTAGTGAACTAATCAAGATCACCAAGGAAACCGGTAATCACGACATCATTAACAGCGCGATCTTGGATATTGGCCTAGTTGCTGTTCGTCTACCAAAGCAGGGTAGCGCTAAACCATTAGCATTTAGTGCCATGAGTGTTACTGCTCATACGGGTGAAATAAACCGCCACATCTTAGAGTCTGAAGCAGACAACCGACTGACGCGACATAAAAAAGACGCAATTGTTACCAAGGCTCATTCAACAATCCGTGAATTGGTCTTTCTTATATCAGACGTTGAAAACCGTTGTGGTGGTGCAGGGCCGTTTGTGTCCATGTTCACCGAGGCTGTTTTAAATGGAATGCCAATACCAGGTATGTAACTTAAAAGGATAAATATTATGAATACAGTTGAAAGCGTTATTAATGAAATTATGTTCATCGCTATATCAAGACCAGATGCGATTGATATCACGGTTGAATACAGTGGTGTTAGCGATTCGCTTTCTGTGAAAGTTATGCCACGCGGTTTTGATTACATTAATACCACGACAGAAAGTTATGAAGCCGCAATCCTTTACCGTACTGACGTTTGGTTAAATGAGCCAGGACCAATGCAAACAGCACTTGAAGCTAAAAGCAAGATACTCGAATTAATGGCTACGTCAGTAAATGTTGGGGTGGCAGCATGAAGTTTGTCGCAATAAAAACTTGCCCTAATGGCGGCACAATTCGAGAGCCTAAAACCAACGAGCCTCAAACTGTTGAAATTGGTGATCACGACAGCAAAGCCGATGCAATTGAAAACGCATGTTATTTATTGGATTGCCGCCAACTTTTTAGAGGTGTGCTACGTAGCCTTAAAAATGCGGGTGGTTACATCGTATTAGATATGCAGGAGTACGCCGAGGTATGAACAAATTAACGATTGAAGAAAGACGTCGTGGTTTGGTTAACGTCAAGCGCTTACGTATGGAATTACAATCAGTAATGGATAGTAAGAAAGTTGAGCGCGGCGTGGGCAAGTGTGATTTGAATAGTCGTATCTCATTTGTAAAACTTAAATGATGAATACATTAACCCCCGAGCAAAGGCGTCGTGGCCTCGCTAATGTGAAGAAACTACGTGCTGACCTAGAAGTGATCAGTGAGCGCCAAAACTTTACTAAGCGTAACTTCATTCAACGCATAAAAATAAAAGCAGGTCAGTTACGAACAATGACTACGACTGAATATACAGCAAAATACGGCAAGGATTTTTATGTTCACCCCAATCACTAAAACAAGTAATCCACTTAAAAAACTGCCGTATAACCTGCGTTGTGAACTAGCTAGTTTTGTTGATAACAACGCAAACAAATACAACGTAGAACCAGCCCGCGATCGTGCCTTTGCTTTCGCGCGTTCGGTCGTGTCTAAACTTAACGTTCCTTACGATATTAAAAACTACCTCGATAAAGCCGGTGCGGCACGACTTAAAAAACACGGTCTAAAACGTGCGGTTAAATTTATTGATGATCGTAGTGCACACATTGTTGCTAGTTTAGGTGTATTACCTGAGCCTTGGTACCGTGTAGATACTGAGTATAAACGTGTTCGTCTTGCTGATGAGTTAACAGGACGCGCATGTTTACATTTAGAACTGGCACTAAAAGCAGGGAAGAGGCCACTTGAGGCTCTCGAAGAAATTAACGAGTTCACAGGTTTAGCTTTGTGGATGCCACATTTTGAACCTAAAAAGCGTGATCGTGATGATGACGTTTATTTAGGTCTTATCGCCCGCGCTTGTGACGATGCGGTTTGGCTACGTGCAATCAATCAAAAGGTAACAATAGCTTTTGAATCCGCTCGTCGTGCTGCTGGTATGGTTTCACCACATGTAAGTCCTTACGCCTCATTTACTACTTGTCAGTGGTTAAAAGACCGCAAGAAAAAGCAATTAGATTGGCTTGATAGTATGGCCATTGAAAGTGAATGCGGTGAAACGCTTGAACTTAAAGATGTGCATGACGCATCGGTTTCTAATCCTGCTAATCGCCGTTATGAGTTAATGACTCAACTTAGTGGTTGTCAGGCATACGCAGACTCTCAAGGTCACGTTGGTTTGTTCGTAACAATGACAGCTGCAGGTCGCTATCACCGATTAAAGAAACACGGTAAATACTTTGTCGAAAACCCTAATTGGAATGGAGCCGATCCAATAGCCGCACATGATTGGTTGAAAACATCGTGGCAGCGCTTCCGTGCAGCTGCCGATAGAGCAGGGCTAACTTATTACGGTATGCGAGTTGTTGAGCCACACGTAGACGGCACACCACATTGGCATGGTGCTTTCTTTGTACCTGAAAACCAAGTGGAAGAATTCACTCAGCTATTAACTCAATATCAGCATCAACGTGATAACGATGAGCTCTATACGCTTGATGGCGATCCAAAACTAAAAGCAATGGAAGCCCGCGTTAAAATTGATAAAGTCGATCGCAGTAAAGGTGATGCCGTTGCTTATATTGCTAAGTACATTTCTAAAAACATTGATGCGCATAAGCTTGAAGGTAAAAAAGACTTAGATTCAGAGCTTGTTGATTTGGTTGAAACCGTGACTAACGTTACCGCCTGGTCCCGCGCATTTTGTTTCCGACAATTCCAATTCCAAAAAACACCATCGGTCACCGTATGGCGCGAACTTCGACGCATTGAAAAAGAGCAAGAGTTCTGCCTATTCGAAAAAATACGCCTTGCTGCCGACTGTGGTTGCTTTGCGTCTTACTTCAACTGGATGGGTGGGCACCGCCTTAAACAACGTAACCGACCAATAAAGGTATTGTATGAACGTTCAGAAAATCACTATCAAGAATTAGTTAAAAAGACTGTAGGTCTTACGGGTGTTGGTATCACTGTTTTAACACGCGAAAAGCAGTGGACACTCGTTAAGAAGCCAACACAACCAAAAGACTTAGGCGCACTTGCACTTAGTCTCTTAGGTACTGCTGAAAGCGGCGGGAGCCGTTTTCCTTGGACTAGTGTCAATAACTGTACGCAAGGTGCTATTCCACAGGAAAGCACCTGTGTTCCAGATGTTGAAACCAGTAATCAGACAATAACGATGAATATTGAGTCGGTGATCCCAGTCCAGGGACAACAAATTGAGAAGCCACCTTTGATAGAGAAAGCAAAGTGCGGCTCAAACCAGAATATTATTAAGGAAATATAGATGCGAGTTACTTGTAAGTTTTGTAAGGGTAAAGCACGTATTTCAAGTACCGATAAAGTATCAGTCGAATTCACACGATTGTATTGTCAGTGCTTAGATGCAAAGTGCGGGCACACGTTCGTGATGGATTTAGGGTTTAGCCATACGTTGAATCCGCCATCGAATATTGTTGATCAGTTATTGGTTGACCGAGTTAGGCAAATGCCAGTAGAAATACAGCGAGAACTGTTTGGGTTTACTAGTGGAATATCGAATATAATTTGAGAGTATTATTTATTTTTAAATCAAATAGTTGGACTCAATGATATTAAAAGCTACTATATAGCATTCATATATAAATCAGAGTGTTAAATGGGAAATATCTTTTTCAAAGAAATAACAAAAAAGAAAGGTTTAAAAACACCTACGGATAGACTATTTGATATAGATAAATGGCATGAAGTTTGGACTCGAGTTGAAGGGCTTGCAACCGAACAAGATAAGATGATTTATAATGCCGGAATTATAATTGATGAATTACTCACTGAAATAAGAAGTAATTTGAAATTTATGTATGAAAATGAAATTCCAAAAGTAAGCTATAAAGAACTTTTAATGGCGTATATATCTCTAACTAATAGAGACCGTTCAGTCATTTCAAATTCAATGACTGAATTAAGTGGGGTTCATGCTGATGTGCACAGCATAACAATGAATAATAATATGATGGGTAATGACATCACTATTCAAGAAATGTCAGATGCTGCTGTTGATGGCATGGAGTCGGCGATAAAAACAAGCTTGATGAATAAAAATAAGAAAATAACTTCAAGTGAAAACCCAATTGATATAGTTGATTTCATAATAAAAGAATCTAGTTTGTCTCAACTATATAGTATTTACAGCGGCTATTGGAATGCATTTTTATGGAATGGATATGAGTTAACAGAACTCGATAAAAAAAATAAAATATTCAGTTTTAAACAACCAAAATGCAAGTATGAAATTGGGTTTATGGCTAGTCAGATTAGAAAATCAAAGTTAGTTGCTCACTCTTTTTCTATTATTGGTAATCCTAACGTAATTAATAAATTTGATGATGATAAATATATAATTATCGAAAAAAGAGGGAGAATAAAATCTGTAAAAGTATCTAGCATATCAAACGCACCTGAACAAGTTCGGTACACAAATGCTGACTGGAGGGTTCAGACCCTCGATCTAGAAGATAATCTCCCTAAAGAATTGCTAAATCAAGAATTTGAAAATGGTTTTACTATTTTTGAAGTTTTGGATGTTTTTAGGTGTTTGATACTTTTATCTTCATATTATCATGATAAATACCCTAAAAAAGACGGCTTCGATAATTTGAAGAAATTACTCGAGTTTTGCCCTAAAATTCAAAAATCTAAACTGTCTCTAGCTCTCCAAAATGCGACTGGATACCCGATTGAAAAAATAAAGATAATATTGCAATTCATTGAGTTTAAAGGGGCGCGTGGTGAAGACCTATGGTGCCACCCTATAGTGTCCATATCGGAGTCTGAATACGCCTTAATAACGAGTGCATTATTAACCCCTGTGATGATAAGAGTCGTTGAGCATTGGTTCGTTAAGTTGAATATTACTCTTCAAGGTAAAGGTAAGACATACGAAAAAAATATTATTCATCCATTAAATCATAATATACTTCTAAATTGTTATATAGATGATTACGATAAAGCGATATCGGGAGATGTAAAGAGCGATTCTGGAGTGGAAGATATCGATTTATTGGTAAGGGTGGGTGAAACAATTTTGATCGGTGAAGCTAAATCAATAGTCACTACTGATTCACCAATATCTCAGTATAGAACAATAAACATATTAAAACATGCAGCGAAACAAGCGAAAAGAAAAGCGAAATTCGTTGAAGATAACATTGAATCAGTGTTTAAAAATTTAGGTTGGTTTTTTGATAAAAGTAAAAAGTATACATTTACCAAATGCATACTAAATAGTGGGCGGATGTATGTTGGTTCTGAAATTGATGATGTGCCTGTTTGTGATGAAAAAATTATAAATAAATATTTTGAAGAAAATATTATTCCCCTAATCAGTAGTTATAATGATAAGAATAAGGTAGAACATTTAGCATGGTTTAAACTTTATGATGACTTTCATGAGTTACAAACGAATCTGAATACTTACTTATGTAATCCCCCTCAAATATCTGAACGAAATTTTATATATAACGATCAAAAACTTCCATGTTTGAGTAATGCGTCATATAAAATAATTACTAAAAGATTAATTCCTGATGACTTACGTGCACAAGATCTTCTGAATGCAAAATATGTATTTCCATTAGAAACCGTTGATGATGTCGATGAAAAAATCAAGATGTTCGATTTATCTGTTTAAATTAAAATTTCATTGGAAGTGGAGATAACGAACTTAAGATTAAACTACTTCAAAGCTCTATTAAATTGTAAGAGGTAGTTCATGAGAATATTGAGTGGGGTGAGCATTCATCTGTGATGATGCTAAAAATTCACTCCTCTTCGCCTACCGCGTTTTCGCAATTTTTTTACGTTTTTGACACAACGATGGACACGCTGATATTCCCAAGCCTTATTTATAAAGGATCTTAACGATCATTTATGGATCGCTAATGTCAAAGTTGTGACATTGTTTGCCACTAAATGACAGACCATTCAGGTTTATCCAAGCGATAAACATCAATCCGCCAGAATCAGTTAAGTGATGTTAATTTTATGAAATTTTTAACAAAAGGTTTAGAGCATGAAGAACGAATCAATCTGCTTTTACAGTTAACCAAAATTGGTAGCGAGAATATTAAAAGTGCTTTGGTTGATCATTTAACTAAAGGGTTAACAGAAAATGATGCCGCCATGTTGAATGATGTACCACAGCAGAATTTTAACCGGGCATTGAAGAGATTGAATGATGTGGCTGGTGTTGTAGAAAAAGTTAAAGAGTTAGATTGGAACTGAATGCTTAATATCTGATTTATAATAAGTGCTTAATGAATAGAGCTTGAGAATAGTTTAAGATATTGATTACATAAATGTAATTTTGTATATTTGACTTTCTAATCAACGTTACTTATTACGCACCTCGCTGTTAAAAGGATATTTCATTGTTAAACGAATTTTTAAGTGGCTACCCTTTATTAGCCGGAGTGCTTGCTTTTGTATTAGCATGTATACCGCTATTTAAGTTTGCTCACAGCATAATAATGTCAAAGAGTGTACTTAGAGCTCAAGCCCTAGACTTGGTGTTTAAATCTTACGGTGATACATCGAATTTGAGTCATCGCTTGGTTATCGAGCAAATGTTTCAGAGTAACTTTAATTTAAAATTAGATTACGAAACAATAACCCTGCTTCTCGCACTCCCTAATCCAACGGAAACACTTGATCTTTACAGTAAATCTAAACGTTACTTGGTTTCTAATAATCAAATATTAGAATTTGAAACAAAGTTTAAATGTAGTAAGCGTAGAAAATTAGAACGAGTCTTAAGGCCTTCGAGAAACATTATTTTGTACGGGATATTCGGAACAATTGCAGGCACTGCTGGTTTGTATGTGTTGAAAAATTTTAATATGGATGAATTTTTTACTATTAATGATTCTATTTTTAATGGTGCGATTTGGTCTTTCTTGTTGATTATTTCCATTTTATCAGCAAAATTTGCGTTGATGAGTTTAACGGACAAAATAAATATTCGGGATGCGGAAAAACTAGAAGCTAAGTTTCAACCTAAAACAGTAAAAAAAATATTTGGTACTACTGAAATAGTTAATAATTTACGGAGATCTAATCTTTAACTTAACTTTAATTTATCGAACTCTGCGTAGAGTTCGATAAATGTTAATACCTCCGTTGCTTGATATGGCCGCAAGGGGCTAACTAACCCCTTTAGTATTCTTACTTCCCCAGAACGCCCGAAACAATCGCATTAACCCTAACGTTGAAACCGCAATTCCCACAATCACAAACTCAAAATACCAGGGCGCACCTTTATAACCCATGGCTTGCCAACCATTTGCCATGTACGGTTGCAGCTGTGGTATGAAGTGGCAAATAAACAAACCTAAAAATACCGTAATAATAATTTCATCCATTATCGATTCGCGCCGGTTCTTCAACACCTGAAGATCATAATCAGCATCGTTACCTTCTTGGTTTGCCAGACGTTTAGATTCCGCGTCTAACTTGGCCAATTTAAGGTTGCCTTCTGCTGTCGCAATCGATGCGGCCATCTCCGCAGCAATACGTTTACGCTCGCGATAGCTACCCGATAAATCGGCAATGGGTGCCGAAATAAAACTAAATAACGAAGTAAGTAAACTCATGATTTATTCCTCATAATAATATCTAAAAAGTGTCTCGGGTCTTTTGATACGGCTTTCGCCAGTGCGTTTAGTCCGGTTAAAAAATGCGGCGCCACATACGCCGCAATGCCGATAATCCCTGTTTTCAAACCATCATTCAGGCCAAGCCAAATACAAAAGCTTTCCGCGATGTACGCAGATAGAATGGCCATCAGCACAGACATGAAATAATGAAAAAAAGTAATACGGGTACCAGACATATACATTTGCGTTGCCGCGGCCAATAGCGACAACAAACACAGCTGTCCCCATTGGCGAATAAATACAATCAACTCTTCCATCAATCTTCACTCCCAGGGTTTAAATCTGAATAAGCCGGCACTGAAAATTCAATGTGCAGCGCGGCAGGTAAATATTCGTTAATCTCCATCATGTCTTGCTGCATCGGCACGACTTCATTGTTGTAATATGCGCGGGTGATTTTATCCAGATCACCAAAGCCAGGACTATCACCCGATGTTTGTCCGCTTAACGCTTCCTGGGCGCGGTGCATACTCAACATATCGTTTAACGTCATCTTCTTGATGCGCTCAAATTCATCCTTGGTCGAGATATCTCCCACCGGAATAATCTTGATTGCCTTCTCTGCGTCAGCCTTACCACTGCGATTATTAATAAACAGACTGCGAAAGTTACCCACGCCGCGAGAATCTTTGATCGCTGTTTTCAAATCAGCTTCATCATCGGTCGATAAATTGGGATCTGCCATCGAAAAGATAAACCCCATGTGGGCGCCGTTCTTGTAATACTTACGGCGAAAGAGGGTGGCATCTTCATTCAATAACGCCGATTGAATACCGCCATAATATTGCGGGATACCGTAGATACCTTGATTCGGGTCGTATTCTTTTAAGTGAATTACTTCGCCTTTTTTAAATCGCAGCACTTTGCCATTACTTAAGCGCTGGGCATAAACACCTGGCGTGGAGGTATAGCGCATCGATAACGCTGGTAAATGGCGTAGTTTAATCACATTACCGAAGGCATTTTTAATCACCTGCAGGTAAGCATTCGCCGCCCAGCAATAATCAAAGGCAAACTTTTTAAAGGTGCGCTGCGTTAATGCCATATTAGGTTGAAACCATTTCATGATCATATTGCGTTTAAAATATAAAATGGGTCCGTGCTGCGCATTCACGCGCAGTAATTTAACTAGCCCAGGCAAACTTACGGGCGGTGAATATAACCCGTCCATGTCAGCGTAAAGGCCGACATATTCAGTCATGTGGTTATCTAAACATGGCTCCGGGTCGCCAAAGCTAAAGGTGTCGATAGATTTACTTTTCACCGGTTCAGCGGTCGTACTCTTAGTTAAATTCATTATGCTGCATCTAATCCTATTGATGTTCTGGTACTTGAGCAGTCACCAGATAATGGTTCGTAAATCATGGCGTGCATGATTGCCCAGGCTATATCAGCATGGCCTGTGGCGGCGGTGCGGTTGGTTGCATAACTGATTTGGTCACCAACGACCTTTTTGCGAATATTAATAAAGCTACTGGCAACCATCACTGAGCCCTCATCAAATTCAAAACGGCCCTTACCAATCACGTTTAAGGCTTTAATGACCATCTTGTTTTTGTTGTGCGGGTTGTAATGAATTGGCATCGCCAACGGGAAGAACTTTTGTATTAACTCAAACACACCTAAGCCCATGCCTGTGGTATCAACACCGATATGCACAACATGGTATTTAAGCGTGAGCTCTTTGATTTCACTGGCCATGGTTTCAAAGTCGTTACCACTTAAATTAAGTGATTCTAGTAAGCGAAACTTATCATCAGGACCAAGCGGTAAACTCAAGACCACAACGGATGCAATATCGCGCGTTCGTGCTGGGTCAAAACCAATAACCACCGGCTTCATGGCATAAGGGCGTGACCAAGTGGGATCGAAGTCAGGCCATTTTTTGCTATTACCGACGCATGCCATTAGCTGTTTAAGGCTAAATGCACTGTGGGCATCATCAATGAACTTGCACATAAAGAGGTTGTTGAACTCTTCTGTCGAATATTCATTTTCCAAAATGCTAATATCAATGCGGTCAAAGCCTTGTTTCACTACATCGTAAACATTGAGCTTTTGACGCCAAATACCGTCTTCACAAAGTCGACCGTCCTTTAAGGTCTTGTGGCTAACATCGATAGCAAACTCGGGATCATTACAGGCTTTAGTTTTTCGGTACCAGCGACCATTCCACAAATCATAGGCTTCATGACTGGTCACCGATGGCGTACTGAAATAGGTAATACGAAAATCTTTATGGGTGGCCATTGCCTGGGCAAGGCCGCGTAACTCTTTAAACTTCGGGATCCAAAACACTTCATCAATATACAAATCACCGGATGCCGATTGTGCAGTCCGGGCATTGGTTGATTTGAAATATAAGGTTGTTGTCTTGCCTTTGTTACGCATCGTCAGTGGTGAGCCGCTGAGTTCAATACCAAACTGTTCACGACATAAAGCAATAATATTGGCTTTGAATATCTCCGCCTGGTCCCGCGATGCTGAAATAAAGATCTTGTTACGACCATTCACAATGGCATCGTAAAAGGCTTCAAAAGCAAAGTAGAAAGTCGCGCCAATCTGACGCGGCTTTAATATGAACCGGCTACGGTAATCTTGATGTTCAAACCAATGTAATTGGTGCGGGTAGAGCAGGTTGTCTTTGAGTGTATCGAGCATTTCTTTGGTGATACTGGACACATCATTTTTAATCTTTTTCTGACGTTTCTTATTTTTACTGGTGCGCTCGCTTGTTTGTTCATTTTGAGCCGTCGCAGCTGCAGGCGTATTGCCATATTTTTTCGTGATGCCGGCACTGGGTAAGCGAGATTGATTTAACGCGCATTGCTGTTTGGTCAGAAAATCCAGTTCTTTATAATCTGCAGCGCTTTTATTTTCACGGTCTGCCAATAACACAATGCGTCGTGCAATCGCTGTCTCGGCATTTAATGACGGGCACAGTTCATTCCAGCTGCCATCATCCGCCCAACGTCGTAACGATCGTGCGCTTGGCATACTGTCCATTTCTGCAATTTCATCAAACGTCAGCCCACCAAACACATAATGGTCACGCGCTGTTTTAATGATTTCGGGTGTATATCGGGGAGTCCTCGGTTTCATATCTGGCCTGTTGATAAGAACAAGCGCCAAGTTTATAACCCTAAAACCGGTAATTCTTTAAGAAGAATTCCGCGTTATTCCGCTTTCGTCAAAATCGGAATAACGCGGATTTAATGTGATGGATTAAGGCCTGTTCAACGGGCAAAATTGGTTTCACTTAGTTCCACTCAGACATTAAACAGGAAACAACATGGCTCAATTACGCACTATTCCACTTGCCATTGCCGCCATGGGGTTAACGGTAGATGGTCGCGAAATATCAGAAAAAGATATCGACGATATTGTTGCTACTTACAACTATAAAAAGTACGGCGCACGCATCAACTTAGACCACGAATTTAATTGGTCAGGCTGGGCAGCGAAGAATCTTTTAAACGTTGAAATTAACGGTGGCATGCTAGGTGATGTAATTGAGCTGAGCACGGCTAAAAATGAAGACGGCATCAAAGTGTTATACGCGGTGTTATCGCCTAATGCGTCATTTGTACAGTTGAACCAAGCCGACCAGGCTGTGTATTTCAGTATTGAAATTAACCGTGATTTTATGAAGTCAGGGCAAACCTATTTAACGGGGCTGGCTGTAACGGATTATCCGGCGAGTACCTACACCGACCGTATTCACTTTAGTCAAAATGACAACGCTGATAATACGCATGCAGCGGACACTAATGTAAATAGCACACCCTCAGATACTGACCTATTAAAGGTGTCATTAGCATTAGAAGAAGCGGCCAAACCAACCAAGAGCTTGTTTAAAAAACTCTTTAATTTTAATAAGGATGATGACGACATGAAACGCGAAGATTTTGCTGCTGCATTGACTGATTCGCTCGGCGGCCCGTTGTTGCAATTTAGCCAAGCGCTGGAAGCCAATACCCAAGCAACGCAAGCCTTACTGGCTAAACAGGGCACGACCACTGCACCGGTTGACGATGTGAATACAGATACTCAGGAAAATACCGGTGCTGATAAACCAACAGCTGAGTTCTCGGCGATTGACGGCAAGGTTAATGGTTTAGTTGAGCAAATTGAAGCGCTGACTAAAACCGTGGCTGATGCAATCAAAGACCCTGCACCAACGACCACTGACGGTGAAGAAGAGCACCTTGGTGAAAATGCCAAATACCACAACCTTCTTTAGTAACAGGAACCAAATTAATGAAGCTTAAAACAACACAGATTTTTGCCGCGGTTGTGTCTGCATTGGCTGTTAATTACGGGGTTGCCTCCGTATCAGAACAGTTCAGTGTCGAGCCGTCGATTGAACAGACACTGTATGACCAGGTATATCAAAGTGCGGAATTCTTACAGCGCATTGATACGCAAATGGTCGATGATTTAGTCGGCAGTTCAATTACAGCTGGTATCAGTGGCGGTGTGACTGGTCGTGCTGGTGTAGAAGATGACGATAGTAAAAGTCGTAGCACCAAAGACCCACTGGGTTTAACCGACCGTGAATACCGTTGTTATCCGGTTGAATGTGATACTCATATTACCTGGCAACGCATGGATATGTGGGCGAAGTTTCCCGATTTCCACAGCCGTTTTCGCGCCCATGTTCGCCAAGCCATTGCACTGGATATCATCAAAATTGGTTGGAACGGCACAAGCGCTGCGAAAGTAACGGATATCGCGACATACCCAATGATGCAAGATGTGAACATCGGCTGGCTTGCGTTAGTTCGCCGTGACAATGCCGCCAATGTATTTGCTGACGGTGAACAAAAAGACGGTGAAATTCGTATCGGTTCGGGCGGTGATTATGAAAACCTTGATCAAGCCGTGCATGATTTACTACAAGCGATCCCAGCGCATAAACGCATTGGTCTAGTCGCCATCATTGGTGATGAGCTGTTATCTAAAGAAAAGAACAAGCTCTATGCCAAGCAAGCGCATACCCCCAGTGAAAAAGACAAAATCGAGCTAGAGCAGATCATTGAAACATTCGGTGGTTTGAAGGCGTATAAGGTCCCGTTCTTTCCTGAGCGGGGCATCTTAATCACCTCGTTCGATAACCTTTGTCATTACGTGCAATCAGGTTCAACGCGCACCTCGATTGAAAATAATGCCAAGAAGAAACGTGTCGAAGATTATCAATCACGTAACGATTGTTACTACATCAACGACATGGAAAAGATCGCCTTCTTTGAAGCGGATAGCGTGAAGCTGGATAAAGTGAAAGACCCTGCAGTCGCGGTCGGTGACTTTGATGCGGATAACCCTGACCACTGGGTTTGGTCTTAGCCGATTTCAAATAAATAGATCAAATGAAGCACTGGCTTATCACGGTAAGTCAGTTATTTCCCAAGCAGAGAGTTTTTTGAAATGAGTATAGTCAAACGAAATCAACGTAAAACCCATGCATCTATTGTCGGTGTTGATCTGGCAAGCGGTCCTGATCAATCTGTCACAGTTACAGCAAAGCATGGCAAGGTTGTCACGGCCGAACCAAACCGTGCCCAAGACATTATGGATGAGTTTGATTTTTTCAAAGCGGCGATGGATTCCGACCTTGCTCAACTTAAAAAGTTTTCTCATATCGAAGACAAAATTGCATATAAAGCGCAGGCAATTGAGAACCATCAATACCTGGATTATTTACGTCGCTATCAAGCACAAGGAACCAACCATCAGAACATGGTACTGGCTTGGGTGGTGATTTGGCTTGTTGACCTTGGCCACTGGAAAACAGCCTTTGAATTCTTGCCTTTGTTAGTGACGCAAAACCAACGTTTACCAGGGCGCTTTAGTACCCAAGATTGGCCGACATTTTTAATCGACCAGCTTTATGACGAAGGGGCTAAACACCTAAGCAAAGGCCGTGATGCCGTAGAGCGTAGCCAAGTGATTAACTTATTCACCCAATTTATCCAGCTGTTGGAAACTCATCAATGGCGGTTAAGTGAATTGATTGGCGGCAAGCTATATGCCATGGCTGCAAAGTTAGAACAAAGCGTATTTAACTTAGGCAATGCCTATACCTACGGCACCAAAGCAACCGAGTTAAATGACAAAGCGGGCGTTAAAAAGATGGTCAGAGAAATCGCCAAAACCATTGGTAAAGACAACGAATAGTAATGACTCTCGCGCCACCGGCTCGGCTGCGTTGGACTTATTCACAGTAGTGAAATGGGATCCTTATCGCAGTGGCCAGAGCCGACCTATTAGAAGTGAGTAACGTGATGAATTTAACCGGTATGCCACTGGCACAAGTAAGCAATGAAAATGTAGTTAACAACGGCTTTTATCCAGACCTTGGCACGGCTGAATTTATCACTGATTACTCGATCGCTACGGAATACGCCAACAACAGTGAACAGGTTAAACGAACATTAGTGCTCGCTATGCTCGATGTTAACAAGGCATTAGCAAAATACCGCTTACGCCATTGGCAACAGGTCGAACAGTTACAAGATGTGAACCTTGATGAAATTAATGGTGTGAATACATTAATCCTTACCTATCAACGAGCGGTGTATTGTCGCGCTAAAGCAGCGTTGTTAATTGGCCGTTTAGGTGAGACGCACCGTGACCAACGGGCAGCACAGCAAGTGATGGCCAGTGATAATCAAGAATACTGGTTAGCAGAAAGTGACATGGCCTTACGTCAAATCATGCAGCTAACGCGTTCAGGTGTTGAATTGTTATGAGCCAAAGCAAATTACAACGTTTAGTCCAGTACCTGGTATCGGCCACTTACAAAGGCCGCAATTTAGCCAGGGCGGGTGAGTTTGATAGTTGGATTGAAGGCGGGCGCATTGAACACGCCAGCAAACGAATAAACGGAACAGGGCTACTCGCTGCACGTTTTTTTTATAGTGGCGTGATCAGCATTAATCCATGCAATGCCCCTGTTGAATTGATCGCAACCTATGTGAGTTTTTGGTTAATGACAAATGCAGAAAAAGACGACAGCCATGATGTGGAATTTAGCCTGGATATTAATGATGACAACAGTGCTGAAATCGAGTTGACGATCGAGCGGTTCGCAGAAGATGTGATGTTGGTTGAAGACATGAATGGACCCTTTGAATTGGAATTTCAAGGTGAAATGAAACGCTTTGATTTTGGTGAGCAGAGTCTTTGGATAGCGGCGTCATTCGATTTAGACGCAGAATTTATGGCCCAGTAGTCGAGCATGCTGAATCTTGATATCCCTACGGATAGCGCGTTAAAGCAATTAGATATGTTAACGCTCGATGCCAATAAGCGCCGGCGTATTTTACGCGGTGCAGGTCGGCAAGTCAGGCGGGATACTAAAACGCGATTAAAAGGCCAGAAAGGCTTATCGGGTACCAACTGGCAAGGTCGCTCTGATGGTCGTAAAAAACGGATGTTAAAGAAGTTGGGTAAAGGTATTCAGGTACACACCACACCCAATAACGCGACAGTTACCTTTGGTAATAAGCGCTTAGGGCAAATAGCCAGAGCGCATCAAGAGGGGATCACCTCTACGCAGACCGCGCAACAGGCAGCAAAAACCAATGGCACCCCTGAATACAGGGCGAAGGCGTCACGCAGACAAGCGAAATCACTGCGAGACAATGGTTATAAAGTCAGAAAGAAACGCGGTAAAGGTTGGAAATCACCGTCGCTTAAATGGATCACTGAGAATATATCCGTCGGACAAGCGGGGCTGATACTGCGTATTTTACGCGGTAATAAAAAAGCTAAATCCAGTTGGGACGTTAAGTTACCAGCCCGTTCGTTCTTAGGGCAAAACAGTAATGAGCAAACAGAATTAAAAAACTACATGTTGGACGAGGCATTTCGCCTCCGCTAAAAAGGTAAATATATGGCACAGGGCAAGGTTTTAGTTACCGCATTAAATACGGGTAGCGGAGCAACAAAAGAAGTGGAACGTTCAGCATTATTTATCGGTGTTGGTGCGTTGAACATAAACAAGATAGTGCCGCTTAATGCGCAATCAGACCTTGATGCATTGATCTCTGCAGCTGATTCAGCACTGAAAACGCAATTAACGGCTTGGATGCGTAATGGTGATGCACTCGTTTCTGGTTGGGCGATCCCTATCAATGCCGGTGATGACGAATTTGCGCTTATCGATAAAGCCATGGACCAAAACATCAGCCCTGAAATTATCGTGATCACCACTCCGGTGACGGGTAAGGCACAAGTTGAAGCGTACCAAGCCAAAGCGCTGGATATCTTATCCAAGTATGCACGTCGTGTGCGTTTTCTGGTATCAGCACCTGGCTTGAGTGATAACCAAACATGGACTGAACATCTAGCCGCGATCACCCCATTAACGGATGGCGTTGTTGCTGCGCGAGTTGCGGTGATTCCTGCGCTTTATGGTGATGAACTTGGCGCTGTCACAGGGCGACTTTGTAAACGTTCTGTCACGGTTGCCGATTCACCCATGCGCGTGCAAACAGGGTCGATGGCATTGCAACCGACACCGCACGACAGTTCAGGGCAACCGATTACCAATGCCGTTACAGCGGCGCTTGATGCGATCCGTTTTAGTTGTGTGCAGTTTTATCCTGACTTTGATGGCATTTATTTTGGTGACGTCAATATGCTCGATGCCGAAGGTGGTGATTACCAACAAATTGAAGCCGGTCGTATCGTCGATAAAGCCGCGCGACAAATTCGCATTATTGCCATTTATCAAATTAAAAACCGCCGACTGAATAACAGTTCAACAGGCGTTGCATTTGGTAAGCGTGTACTGGGTAAACCACTGCGTGATATGTCAAAAAGTATCAATATCGGCGCGGATAAATTTCCAGGTGAAATACGTGAGCCGAAAGATGATTCAATCACCTTAACCTTCATGAACGCCCGACAATTGCGCGTCACCGTAAAAATACAGCCGATTGACTCACCAAGTGAAATCCTTGTCGGCATCATGTTAGACAAAGACGAATAAGGAACAACACATGTCAGTAAAAGCATTAGGCGGTAAAGACTTTGATATTTTCATTGGTGACAAAATGGTGCATGTCATTGAAGCCAGCGTCAAGATCACCGATGGCCGTAAAGCCAAAAAAGTACGCGGCATTACCAAGGGTTACATTGATGGACCAGTTGACGCCGAAGTGACCATTAAATTAGACCATGAAAACTTTCTTATTGTGCAGGATGTGGCGAAAACAGCTGGCAGCTGGAAAGGCATCGAACCATTTGATGTGTCGTTCTTAGCTGAAGTTGCCGCTGGTACCAAGAACATTGAAGCTTTTGGTGTGTTGCCTCAATTAGACGAGATCTTGAATATCAAAGCGGAAGGTGGCGAAGAAGATACCACCACAATCAAAGGTCCGGTGACGTCGACTGATTTTATCAAAATCAATGGCATTCCTTATCTGACTGCAGAAGAAGTGAGAGATCTGTAATGGCCGCAATCAAAAACCTCACGGCCGCTAGTTTACTGGTGGCGATGAAAGCTAAGCAACACATCGTGTTTGAAGGTGAATTAAACCTGAACATCATTGGTATTCGCAACACAGATACCAAGGCCAACAGTTTTAACGACCTGCTTTGTGTCCTGTATCAGCAAGACGAAAAGTGGCAGCTGGAAACCTTTAAATGTACCACGGACGCAGGCACGTATTACCGTGAAACGCCTTGTAATGTCGATGGCACCGCCGTATTAGCAGCAATGCAGCACCGCAGTTTATGGACCTTTGGTTATCACAAAGGTCAATATCCTGCGCTAGTGCAGCATAAACCGGTTGCGGTGTTCAGAGATAATAACAACGACAATCAAGTAGATTGTGATAGCGCGCTACAACGTGGTTATTTCGGTATTAATTGCCATCGCGCCAGTGCAAATCATGAGTCAAAGCAAGTCGACAAGTGGTCTGCAGGTTGCCAGGTATTAGCCAACCCGAATGATTTTAATAAGCTGATGGCGCTTTGCCATCAAAGCAGTCAGCAATGGGGCAAGACGTTTACGTATTCCTTATTAAACCAAGCTGATTTAAACCCAACGAAAGAGCGAGCATAACCATGGCATTAGAACAAAAGATTGTCTTAGAAGTGAATGACATTGAACTTAGTTTCAATGTGAATGTAACGGCGTACAACAAGTTTCTAAACCAAAGTAATCAGGTGAATAAAATTCAACCGGCGACGAACTTCTTAATGACCGTGGTTGAACCTGAATGTAAATCCGCATTAAAAGAAATGCTCGCGATGCCAGGCGCAGCACTGCATTTAGTCGGCAGTGTGGTTGAAGAGTATCAACCGGAATTTAATATCACCGTAAAAAAATAGAACAGCGAGCAAAAGAAATAAGCAAGAACCGCTTAGATCAATTGCTCGCCTATCAGCAAAAATGGTTGCCGCATAGCGCCGCCACCGAAGACAGTTTAGCCCAGGCATTGTTTTTGGAAAATGATAATCAAGAAAAGCAGCAAATAGCCATCAACAATGGTATTTGCATGGCGTTGGACAGCGGTTAAGTTGCAGGTTAGGGGGTACGGATGAGCGCATTAAGTAAGTTGGAAAAACTCATGTACACCATTGGTGTTGTTGATAAAGCAACGGGACCAGTGAATAAGATCATGGATAAGATAAACCAACTTAGCAGCCAAACCGCCAGTGCTCAAAACCAAATGATGAGTGGTTTTATGGGTACTGCAGGCGGTGCCATTGCACTTGTTAGTAGTTTGTCACCGGCCATTGATCATGTTGCTGCACTTGGTGAAGTGCAAACACTGGGTGTGGCGAATGAAGACTTAACCAAACTAACCAAAACCGCCTTTGAGTTTACCACTCAGTTTGGCGGTAACTCGGCAGAGTTCGTGCGCAGTGCTTACGATATTCAATCTGCGATATCAGGGCTCACCGGTGATGAACTTGCATCCTTTACCAAAGCATCGAATGTATTAGCGGTTGCGACGAAGGCAGATGCCAGCACGATCACCAGTTACATGGGCACCATGTACGGGATATTTGAAAAAAATGCTAACAAGATGGGCAAGTCTGATTGGGTTAACCAGATTGCAGGGCAAACCGCGAAAGCTGTGCAGATGTATAAAACCACCGGCGCAGAAATGCAATCCGCCTTTAGTGCGTTAGGGGCCAAAGCGGCTAATCGTGGCATTGATGCTGCAGAGCAATTTGCTGTGCTGGGCGAACTGCAGCTGGTCCTAAAGTCAGGTTCGGTAGCGGGTACTCAATACGCAGCCTTCATTGACGGTATTGGTAAAGCGCAAAAAGAATTAGGTATCGAACTGACTAACAGCCAGGGCGATATGCTCGGTATTGATGTGGTCATGTCGCGCATTAACCAAAAGCTGGCTGGCGTTGGCAGTGTTGCAAGAGGCGATATTTTAAATAAAGCCTTTGGGTCTAAAAATGCCGCTTCTGTGGTTGATATTTTAAGTTCAAAAACGGCCAAGTTAAAACAGGGCATTAATGAGCTGACCAATGTCACCGATGCATCCAAAGCAAGTGAAATGGCCAATATCATTGCCAGTCCCTGGGACAGATTTAGCGGCTCATTAAATGGCGCTGCAACGGCGATGGGTCAAGCGGTATTACCGATTATTGAACCGGTTGTCGATATGTTGGTTGCTATGTTGGGTGGTGTGATCTGGCTAACGCAGGAGTTCCCAACCTTAACGGGTGTACTTGGTGCGGTAGTTGTTGGCGTGGTCGCCTTAATGATGGCGTTTAGTGCCATGAATATGATCATCGGTATTTATCGTTTTGCATTGATTGGACTGAGTTTGGTAAGCAATGCCGCTGCAGTCTCAACAAAGCTATGGCAGATAGGACTAGTTGCACTGCGCGTGATGGGGTTCTTAGGCAATATTGCTGCGATTGGCGCTTATCTTACAGCTATCGCGCTTTATCGTGGTGCGATGTTAGCGGCGCAGGGCGTAACTTGGTTATTTAATACCGCCTTACTTGCCAATCCGATTGGGTTAGTTATTGCTGGGGTTGTGGCGTTAGTTGCTGCAGTGGCGGGGCTTATTTTTTATTGGGATAGCATTGTTACTGCCTTTAAAGATACTTCTTGGGGCCAAGCGCTGATTGGCATATTTGATAGTGTTATGTCGGTGTTTAGTGGTCTGATTGACAACGTGAAGTGGGTACTCGAAGCACTGGGGCTGATGGACGGTAAAGAAATGACCGTTAATTCAAAAGTGGAAGAGGTGACCAAAACAGCGGCGCCAATTTCGGCTGAGAACACATTATCAACGCCGCTTAACCCGACAGTATCAAACGGGCAGCAAGTCGCGAGCACTGAGGTCTTTGCAGCGAACAATGTCGCTAACATGAATATGCAGCACAATCAGGTCGGTGGTTTAACGCCAGTTGTTGCAAACAACCTCGCTAATATGAATACGCGTATTAGTCAGGTTAATGGTTCAGCGTCGGTTAATACAAACCAAGTGTTAAACACTGAGGCATTCGCAGCAAACAACTTATCGAATGCAAATACAGGGATTAATCAAGTTAGTAGTTTTGCGCCAGTTGCAGCAAATCAGGCATTCAATACTGAGGCGTTAGCAACGAGCAATCTTGCTAATACAAATACGCAGCTTAATCAGGTGAGTAGTTTCACGCCGGCTGCTGCAAATCAGGTATTTAATAGTGAGACGTTAGCAACGAACCACCTAGCTAATACCAATACGCAGTTCAATCAGGTTAGCGGTTTATCGCCCGTTGCAACTTCGCTTGTTGAAAGCCAAACGTTTAATGAAACAAGTGCGCACAGCGCGCGTATTCAACAATATCAAGAAAATAACCAGCAGCAGGCGAAGGTAAGTCGACCACGTATACAGCGAACGCAGTACTTTCAGCAAAGCAAGCAGAGCGGAAATAATCAAAGCAGTAGCAGCGCCGATAATAGTAAGCGCGTGTATATCGATAACGTGGTGATGAAAAGTGACAACCTCGCACATGATTTTGAACAGTTAATGGAGTTAGCCGGCTAATGATAAATACGCATGTTGATTTAAATATTGTCGATGGTGACTTTGTGTTTAACCCGTCTTTAAGCGTTGAAAAACTCTCTGCAGCCAAGGTGATTGGGCAGGACGTTAAACACCGCATTCTTGAAAGTGGCTTACTGGTTAAGCTAGTGAAGCAGCGCAACAGAAACGGTATTGCGCCGGTGTTAACGGACTTAGAACTGGAAGTTGAACAAGATGACAGACTTAAGCCTGGCACGATTTTAATTACTTATAACAGTGATAACACCTTGTCGATTGAAGCTGAAACAAAGCAATACGGCTTAATGAAGTGGGCTGGGGCGTAACAGGTGAATAGTATGGGCAATGAAAATAGAACACCGGATATCGTTCCTGACTTTAAAAAAATGATGGCTGATGCGGGTCTACCAGTGAATGAAACGGTTGCTAAGCAGCAGTGGGACCAGGTACTCAGCGATCAGCAAATCACTATTGAAAATGGCAGTCCGTTTAGTCCCTTTTGGCGCACGGTTAAAGCGCTTATCACGCAGCCGGTTGTCAGTTTACTTGATTGGATTTCCCGTATATTGATGCCCGATTTATTCATCATGACGGCGAGTCGCAGCGCATTAATTGGTTTACATGGCCCGAGTCGTAATGTCTTTGTGATGGATGCCATTAAAGCCAAAGGCATGCTGCGATTAACGCGGGTTAATCCGGATGGCGTGTTGAGTATTACGGCCGGTGCCTTGGTCGAAAGTGACAGCATTGGCGGCGCGGTATATCAGTTGCGCACACTCAGCGCAGCAGTATTCCAAGAAGGTGAATCCGTTATTGAAGTGCTGATTGAAGCTACTTCACCAGGGCAAGCCTATAACTTACCGGTAGGCAGTTATTACCGCCTAGTTAACCCTATTGAAGGTGTGACAGTTCGTAATGAAAAAGATTGGTTGCTTATCCCTGGTGCGAATGAAGAAAGCACAGAGGCATACCGAAACCGGATAAGAAACGTATTCGGCACGGCCGCTAAATGGCACATCAATACTGTGTATAAATCGATAATCAGTGACTTTGCTATACCGGTCGAGAATATTGAAATTGTAACCCAAGCGCCGCGCGGCCCTGGTACGGCGAATGCGTTTATTTACCTTAATGTAGGTCAGGTATCAACGGGGTTATTAAGGGCAATTAACCAACATATTCGTGATGACGGCCATCATGGCCATGGTGATGACTTTCTGGTGTATGCCATGCCGACGCAAGACAAAATGATAACGGCCACGTATTCACTACATGCTAATAGCGCTGATATTAAAGCGGATATAAAGACATTTATCCAGGCGGCGTTTCGGTTAAACGATGCGTATCAGCCAATGCGAACGCGCCCGAATTCGTTGTTTAGCATGAGCCTGTTACAAGCGCAGTTACATAATCAATTTCCTGCACTACGCAGTATCGATTTTGATTGTGGTGATATTAAAACGGGCTTGTGGTTACCCAAACTCACGCAGTTGGTTGTGCAACATGGATAAACAGCCACTACAACAAGCACAGCCCGAAATCGCGACTTGGTTAAACAAAGGCCATGCCGAGCAGTTAATGAAAGCCGCGCAACAATATTGGACCAATACCAAAGACTGGGTGATGTGGGCAGTTGCGCAAAAAGATGAACAGCAAAGTGAAGAACCCTTTTTAGGTCTATTAGCCTGGGAACGTTTGACTGAAAGGCTGCCTTTTGAGCCCGCTGACTTTTTTAGAAAACGGGTCCAGCATGCGCTTGTTAATACCATTGATGCCGGCGAAATTGCGACGATTGAAGCTATCTTTAATCGCCTTGGTATCGATGTAATCAAAGTCAGTGAGCGTATTGATAACCGCGATTGGGACATTATTGCGATTGATTTTAGCAGTCATACGGTATCGAAATATGGCGAGCTGATGCCTGAGTTAATTCAGCTTTATGGACGCACCTGCAGACGTTACGAATTCACGGTGCACAGCGTGGTCGATGTTGGATTAGCACCGGGCTTTCTTGATGTGCAATGGGACAGTGTCCATGTGCCTTTAGGCTTAACACTGACGCAAGGCAGTGATATTGAATTTAACCCCATGAGTGGTTTTTTAGGAGGGCAATACAATGTACAAAACGTGCCATTGTTGCCACTGAGTATGTCGGTTGAACATCCACTGACATTGCAACCACTGTATGGATTTTTGAGTAAAGAAAGCAGCATTAGTACTGCAACGGAGTAAGTCTTTATGTCTGAACAACAGGTAACTGGCATTCTTACTAATGCTGGTAAACAGCACATTACCAGTTGTGCACTTGAAAATACGGGGCTGAATGTCTCAACACTGGTGTTAGCGAATGTCCCTAATTTAAGTGATAACGCAGCGCGTGACCCTAATATGGCGATCCCTGCGCAAGCCCAGATTGTTTATCAAACGGATGAATTAATCACCGGCTTCATTGATGAACATACCGTTGCTTGGGCGACAGTGATGGACCAAGACATTGGCGACTTTGATTACAACTGGATTGGCTTAGTCACCAGTACCGGCATTTTGTTAGCGCTGGATTATTTACCACTGCAACGTAAACGCCAGGGCGTGAATAACGTACACAACCGCAGTTTTGTATTGAAGTTTGCCGCAGCCAAAGCGCTTACCCGCATTGAGATAAAAGCCAGTAGTTGGATGTTTGATTACAGCCCAAGGCTTGACAGCATGACGCTGGCCATTGTCGCCAATGCAAGTGCTCAAATTGACAACATGACACGTTATTTAGGGTTGAAAGATGTTGTCACTGGCTTACGAAATACTATCGAACTGCAGCAAGTGCACATTGGAAAATTAGAGCAAAAAGGGCAAGTGCAGCAGGAAAAACAAGCGGTAATGATAAAGCAGCGTCAACAAAAGGATAGCGAGGTGCAAACCGCTATCGTCAACATGACAACCGCGCAGGTCAGTACGATGTACCGACAAGTAAAACAAATTACATCAACTTCATAATCAGTAAAAGGAACACAACATGGCAATAGAACAAAGCATTGCAGAACTCGTGCAAGCCAGCAATAAACTCACTGGGGTTGTTGATGGCAAAGTAAAAGAAATTGACAATAAAGTTTTAGAATCAAAAACGAAGGTGGATGATTTTATTGGTTCAGCGCGTGGTGAGTTATCCCACGTTCTTTTGAGCCGAAATCAAATCATGGAGCCAACCACTAATGGGGATGGCATTAAAGGCTTTTCAACCATTGGTCTTGATTCATTTGAAGTGATTAAAGAAGCGACCATCCATGCGAGTTCAACTAGTGATGTTGACCATACGGGAAATGGTTACGCTGCTGAATTTAGAAAAAATGTTCATGGTGGTTATGTGAATCGTGCATTCCATATACTGCGCCTCAAGTGGACGCGTGGTACAGCATCACATCCTGCGCGAATAGATAATAATTGGAATAATGGTTATCAGCAAGGTGCGTTGACGAGTGGTTGCTACTTGAAAATATTAAGTGGTGATATCAGTGGTGATATGACATCGATTCATGAATTTTCTAATGATTGGCATTTCTACGGTATGCGCCAAGGCGTTAATAGTCTCAATGAAGCCTTCCATGGTGGGCATTCTAAGTTAGCGCTGTCTTCAGCACCTGGTGGCAGTGGCGAAATGCTGATTTGTTTGTTCGGTACGGTGAGTGGCGTTGTTAATTACGAGAAAAAGACATGGGGCTTATACCCTGAGTTTGCTCGCATTTCAGATGTTTAATAAGGAATTGTTAATATGAAAATAGTACACAATGGCGGATTAATCGCAACTATCGGTGATGCTTTAGATATTCACCAGGTAGAACGTTTCTTGTTTACGAATGGTTTTGCTTTACCGTTTAATGAGCTTGAACTTATCTATACGAAAGATGAAGCGTTAGATGTGCGTAAAAAAGCATATAAAACGCAATCAGACCCACTTTATATGGAATGGCAATTTGATAAGACCGCTGAAAAAGAGCAAGTATGGCGTGATAAAGTCGCGGAAATTAAAGCGCGTTATCCATTACCAGTTGATAGCTAATGCACAGTATTAGTCTTTGCTTCCACCAAACCGCGACCCTTTGCCCAGCTGCAGATGGAGCGGCCTTATTAGCAAATGCTATTAAGGATGAAAGCCGCCAGTTTAGACCAGCGGATTTCAATGCTCTGGTCTTATGGGTGAGCAGTGATAGCTCAACCGATTTAGCCAAGCGTTTAGCGCCGGTTAATGAATATTGCCCATTAGCCGGCTTTGTTGAATGTGGCCGTTATGCAGCGAGTCTAGCAACATTAGATTCAGATAAACTCACTCGTCCTAAAGGGACCGAGCAATCACTTGAGTGGACGGTAATGAATGACAAGCGGTCGATACCAGTGATTAAGCGGCAATATCAGGCTGCTTCATTAGCGCAGGTTAATGACCAAGGACAAGCGCTTATTAGTGATGTTGATAGCGCACTAGCAGAATGTAAGCAGTTAAAGGGGCTACGTGATGCGCGTTTAACAGCAACTGAATTTAACTCCACCAATACGGGCTTAAATTGTATGAGCATCAGCGCCAATACAGCCAAAGGCTTAGCCGATAAGCTGCAAGGTCTTGGTGATGATAAAGCGTATTGGGCTTTTTGTGCGTTCGTGGGCAATAGCAGTGAACTTGAACCGATTAAGGAACTGTTTTTATGATAGCGCTCGATGGTTGGCAAGTACCAGGTTATGAAACCAAAATAAAATGCAGCTTCAAATTAGCTGGTGAAGACTTAAGTGGCTACGGCTCCTTGACCCTATCATCGGATAACGGTGTTAAACCTGCCGTATTATCGGTTACTACAAAGATCCCTTTTAAAGATAAAGCAGAGTTAGCCAAGTTAATATCTAAGGCTAAAGAACTCGATGAACACGGCGCAAGAATGATCCGCACCGTGAATTGTGATGTAGCAGAATCATTCAAAGTCCGTAAAACCAAATTTGATGGTGAAATGAGTGCGACGGAAGATGATGAAATGAAGGCTTGGATTATCAGTTTTAATTTACTTGAAGTAATGAGTAAGTCAGAACGCGAGCAACTGCAGCTCGATGATGAAGCCAGTAACAATACAATTCCGCAAAGCAGTGATGGTCATAATTCACTACAGCAGCAATTTGAAAGTGTAGAGGGTCCGTAACTTATGACTAAGGCTATGGATAAACCCGATTCAGCACGACTAACCACCGTATTAACCATTGGTGGTAAACCCGTGGTTAATATTATCACCAACAGTGTTCAACTTGATTTGTTTAGCCCTGGTCGCGCTAGTTTTGTTGTTACTTGCGGACAAGAGCCTAAAGGTATCGTGGAATTACACCTAGGCTATCGTGTGGATAAACTGCAGCCTTATTTCATGGGCGTCATTGAATCTAAGTATCAATCATCCGGTCGTTGGTATCTGACTTGCCGTGAATTACTTGGCGCGTTGTCATTGCCGGCTAACATGGCCATTCTTTTGCCACGATGAAAGATGTATTAGACCAGTTATCACTGACCGGTCTTCAATTTAGCTATCCCGATGTTGATTATCTCAATCAACCGGTACCGTGTTTTTATCATCAAGGCGATGGCATATCGGTATTACGTCAGCTGGGGAAAATATACCAGGTACCGGATTACATTTTTCAGCAACGCCCAGATGGCAAAATCTATGTGGGCAGCTGGCATGATTCAGGCTGGGCCAAGTCGGTCATTACTGATTTTTCAGAACACCCAATCAAACCGATTAGTGCGACGACCGCTGAATTAATTGCGATCCCTAAATTACGTCCAGGACTAAAACTCAATGGCCGTTACCTTACTGAAGTGACATTAACAGGAAATAAGCAGGTTATCAGATGGTCAAAAACGCTATACGCCGCTTAGTATTACGCTACTTTCCCGAATTAGGACAGCGTAAACACTTACCGCAATTGGCTCGAATAGAGAAGATTTATGATATGCCGGTGAATGGCGCCGGTGTCAGCACTGCATTTCGCGCTTATAAAGCGGCTGATATTCAGTTGTTAGATGCGGTGACAGCTAAACCCTTAGCGGTACCTGTCTTTGAGCAAGTGAGTATTGCGTCAGGGCAGGGACATGAACATGGTTTGTTCGTAGAGCCCACACCTGGCATGCAATGCTTGATTCAATACATTGATGGCCTAGATTCATTGCCGGTTATTACGTCGCTGTTACCCTGGCATACTTTGGTACCCGATCATCGTTCTACGGATGTCAGTCTGCAGCAATCTCATCGCAGTAAGTTAGTTGGTAGTAATGGCGATTGGTACCTGCAGACAGACGGCGAAATAAAACAGACTAGCCAGAAATCAATTATTGAAGCCCAAACCAGCGAGCAGACTTACCATGAACGAAGCACCAAGGTCGCTACGCACGACATTAATAAAATAGACGGTAACCAGGTAAATGAAATCATGGGTGCGTTGAAAATACTCGTTGGTGAAAAGGCGATCATTACCTCGTTGGATAACCTGCTTTTAGGCAGTAACAAAGAAGTTAAAATACAAAGCGCAGAAGATATGCATCTTGACAGTGCCAAATCACTGATCATCAAAGCCAAGTATATAACTGAAGATGCAGACACTATTAAATTAAATGGCGGTACCGGCGTGATCACCTGTGCAAGCATTTGTCCTTTTACTGGTAAACCGCATGTTGATGGCTCCACTACCGTTTTTGCAGGGAAATAATATGGCATTAAGTAAATCAGCACTAAAAAGCAAGATTGAAGCTGAAATGGTGAAGGGCGGTATTGTAATTAAGGGAGAGCATGCCCAGGCTTCGGTATTAGCGCAAGCAATTGCGAATGCAGTCGTTGATGAGATCACTGCGAATGCCGAAGTCGGCGTAACAGGTGGTAGTTCGGCAGGGAAATATAAGGTTGGATAA
Protein sequences of DBSCAN-SWA_2 >NZ_CP044399|1524700:1560905|1531160_1531913_+|WP_019440198.1|DBSCAN-SWA MLSKDNLLTYLDEFAGVLELEGIVISGAPIRMLKAGLQRSKPHKGLTYNLTGLEFKNISPELFSRTGKWDDDISVRLGMDIRLNQNVEQFKFGNTHSLTVNVEYSALSQLEEDKFFECKGAWHLDFHNQPEKDGDPEYIHPNYHFHHGGKKLSELNDCGQIILLDAPRVMHHPLDLFLSVDFVVSNFCKKEVWKKLRANTTYTNYIKMAQDQWWKLYYQDLANYWNHFGKSGVDDRAICKSAKEANPHYV >NZ_CP044399|1524700:1560905|1557743_1558031_+|WP_019440164.1|DBSCAN-SWA MKIVHNGGLIATIGDALDIHQVERFLFTNGFALPFNELELIYTKDEALDVRKKAYKTQSDPLYMEWQFDKTAEKEQVWRDKVAEIKARYPLPVDS >NZ_CP044399|1524700:1560905|1556990_1557728_+|WP_019440165.1|DBSCAN-SWA MAIEQSIAELVQASNKLTGVVDGKVKEIDNKVLESKTKVDDFIGSARGELSHVLLSRNQIMEPTTNGDGIKGFSTIGLDSFEVIKEATIHASSTSDVDHTGNGYAAEFRKNVHGGYVNRAFHILRLKWTRGTASHPARIDNNWNNGYQQGALTSGCYLKILSGDISGDMTSIHEFSNDWHFYGMRQGVNSLNEAFHGGHSKLALSSAPGGSGEMLICLFGTVSGVVNYEKKTWGLYPEFARISDV >NZ_CP044399|1524700:1560905|1534261_1534417_+|WP_019440192.1|DBSCAN-SWA MNKLTIEERRRGLVNVKRLRMELQSVMDSKKVERGVGKCDLNSRISFVKLK >NZ_CP044399|1524700:1560905|1534413_1534629_+|WP_019440191.1|DBSCAN-SWA MMNTLTPEQRRRGLANVKKLRADLEVISERQNFTKRNFIQRIKIKAGQLRTMTTTEYTAKYGKDFYVHPNH >NZ_CP044399|1524700:1560905|1532795_1533017_+|WP_019440196.1|DBSCAN-SWA MSTTIVIQMNDVPTMAVELFAEKTGQTVCAVTSQMDSGALPFTQHKPRATRHVNIAKLTVMCLESNSDKPWLA >NZ_CP044399|1524700:1560905|1556218_1556971_+|WP_019440166.1|tail|DBSCAN-SWA MSEQQVTGILTNAGKQHITSCALENTGLNVSTLVLANVPNLSDNAARDPNMAIPAQAQIVYQTDELITGFIDEHTVAWATVMDQDIGDFDYNWIGLVTSTGILLALDYLPLQRKRQGVNNVHNRSFVLKFAAAKALTRIEIKASSWMFDYSPRLDSMTLAIVANASAQIDNMTRYLGLKDVVTGLRNTIELQQVHIGKLEQKGQVQQEKQAVMIKQRQQKDSEVQTAIVNMTTAQVSTMYRQVKQITSTS >NZ_CP044399|1524700:1560905|1555483_1556212_+|WP_019440167.1|DBSCAN-SWA MDKQPLQQAQPEIATWLNKGHAEQLMKAAQQYWTNTKDWVMWAVAQKDEQQSEEPFLGLLAWERLTERLPFEPADFFRKRVQHALVNTIDAGEIATIEAIFNRLGIDVIKVSERIDNRDWDIIAIDFSSHTVSKYGELMPELIQLYGRTCRRYEFTVHSVVDVGLAPGFLDVQWDSVHVPLGLTLTQGSDIEFNPMSGFLGGQYNVQNVPLLPLSMSVEHPLTLQPLYGFLSKESSISTATE >NZ_CP044399|1524700:1560905|1547712_1548198_+|WP_019440177.1|DBSCAN-SWA MSQSKLQRLVQYLVSATYKGRNLARAGEFDSWIEGGRIEHASKRINGTGLLAARFFYSGVISINPCNAPVELIATYVSFWLMTNAEKDDSHDVEFSLDINDDNSAEIELTIERFAEDVMLVEDMNGPFELEFQGEMKRFDFGEQSLWIAASFDLDAEFMAQ >NZ_CP044399|1524700:1560905|1551041_1551308_+|WP_019440172.1|DBSCAN-SWA MALEQKIVLEVNDIELSFNVNVTAYNKFLNQSNQVNKIQPATNFLMTVVEPECKSALKEMLAMPGAALHLVGSVVEEYQPEFNITVKK >NZ_CP044399|1524700:1560905|1531994_1532696_-|WP_019440197.1|DBSCAN-SWA MSTTVGLKIKSIREAEGLTRSQLSDLIGISTDTLAQYETGRIKTIGLEKLDKIVNHDRFHKYALWLVSEKTEPEVGQISPSDLNVYVQTHKKDSAHIQLPFYEISASAGVGLLAEVEERPKTISFEPYWLRNEIGVCPNNVFLMLVDGDSMQPTLKNGSMIMVNRDVDNLSDGVYVMRHDNNLLVKRLQMLPGGIIRVKSDNTMYDPWEITKSQLDGEELALIGRVVWTGQKM >NZ_CP044399|1524700:1560905|1545224_1546310_+|WP_019440180.1|capsid|DBSCAN-SWA MKLKTTQIFAAVVSALAVNYGVASVSEQFSVEPSIEQTLYDQVYQSAEFLQRIDTQMVDDLVGSSITAGISGGVTGRAGVEDDDSKSRSTKDPLGLTDREYRCYPVECDTHITWQRMDMWAKFPDFHSRFRAHVRQAIALDIIKIGWNGTSAAKVTDIATYPMMQDVNIGWLALVRRDNAANVFADGEQKDGEIRIGSGGDYENLDQAVHDLLQAIPAHKRIGLVAIIGDELLSKEKNKLYAKQAHTPSEKDKIELEQIIETFGGLKAYKVPFFPERGILITSFDNLCHYVQSGSTRTSIENNAKKKRVEDYQSRNDCYYINDMEKIAFFEADSVKLDKVKDPAVAVGDFDADNPDHWVWS >NZ_CP044399|1524700:1560905|1527690_1530033_+|WP_019440200.1|DBSCAN-SWA MSNETLLKKQLSKLLEKDNVDYGKVLSLASEISEHDQHNVRFSVDAQVVQRLGEQLVAKKTTALSELIKNSYDADATKVTVLFEATEDAGGDITIQDNGNGMSKADLINGFMKISTSDKAENPKSPVYDRQRAGKKGIGRFSAQKIGSKLTIITRCSSNEPYLVVNIDWDDYSAKKSLSSISNNIITSNENFGFEKGTQLVISDTRDIWSQENIQTAYRYTSSVVKNKPSIKKCGIKDPGFKVKIKSFFSSENTPSNIVDDDTEYLQYADSIITSWINDDGCCVVKIKGQNGVDLSDKFILDLKVSDLLKKVNFNFSAHYFAIAKSEPGKHFLKTYLRNNGGIKLFRNGFYVAPYGELLNDWLGLDDSSRRRVILPPHSNTNFIASINILDNELSMFEETSAREGLIESDAYKELCSLSYALIVEAVKRISAVRGVKITANQKDFVPNKPIETQVKDQVEVVVETLNNWVEASSKVNEVVTDNEVVTDNEGVTDKPVNEKPALPETVIPKLIQEQVEKLSILTNELIDEKHMYKVLTSTGLAIAEFTHEIQLYLDALTLTSTQLKTIPDEYPELKRTADILDGNLSMLIAYTDFFDDTIRSNSNREKSYYEIRDIVDKFFTAMEPTLDRRGYELIKTYDDWGIWTKQIHISELSSVLMNLFTNACKAIKRSGNANGQIGVFISSTMDDIVIKFEDTGDGIPSKNRKRIFNPLFTTSVTAGAFSSDNQNLRGMGLGLTISKDIIKGLDGEIAVVEPSSGFNTCIEVIIPRAQETEIPNNAY >NZ_CP044399|1524700:1560905|1526015_1527698_+|WP_019440201.1|DBSCAN-SWA MNPHSLNKKRELGAYYTPPELSQVLVDWAIISRLENILEPSFGGCGFFESCIKKLENLGCKIPSEQLYGVDIDPHAFDILSQKFGEKVSLDKRFLHSDFISVSPEQFLVPDFDVVLGNPPYVSMHNMSLEQRRRCEKTFKESPFIKKTMGRNVSLWAFFLLHSLSFLKDGGKVAWVLPSSLLHTDYSKKLIEVHKQHFKTVKIAKLAERFFVSEGAQETSIVLIAEGFSKTASNTGCLEISSVHNIEELHEFIHAPIDQKKQNHFDNYKFSLLSSAIRDSYFQVEQSQVSKKLIDYLNIKIGMVTGANKFFIINRDTIEKNNLDSSYLRPIVSRFSCLVGVRHNKLRQRENEDNNLRSYLLNPCNEDMKEKNTPIRNYLAQVSAKERRRNKTFPKRANWYTPDDNIYPDAFFSYMSHLGPRIVLNQGKVNCTNSIHKVFFNERLSHSRKLAISISLLSSYSQLSAEIEARSYSSGVLKIEPTAGKNIRIIMSDECIQDLASNVTHIENLLIKDKQPEMTIFIDDIFIKHDILTKDQCNLFREGVRILRKERYKGVKTYDE >NZ_CP044399|1524700:1560905|1534609_1536559_+|WP_019440190.1|DBSCAN-SWA MFTPITKTSNPLKKLPYNLRCELASFVDNNANKYNVEPARDRAFAFARSVVSKLNVPYDIKNYLDKAGAARLKKHGLKRAVKFIDDRSAHIVASLGVLPEPWYRVDTEYKRVRLADELTGRACLHLELALKAGKRPLEALEEINEFTGLALWMPHFEPKKRDRDDDVYLGLIARACDDAVWLRAINQKVTIAFESARRAAGMVSPHVSPYASFTTCQWLKDRKKKQLDWLDSMAIESECGETLELKDVHDASVSNPANRRYELMTQLSGCQAYADSQGHVGLFVTMTAAGRYHRLKKHGKYFVENPNWNGADPIAAHDWLKTSWQRFRAAADRAGLTYYGMRVVEPHVDGTPHWHGAFFVPENQVEEFTQLLTQYQHQRDNDELYTLDGDPKLKAMEARVKIDKVDRSKGDAVAYIAKYISKNIDAHKLEGKKDLDSELVDLVETVTNVTAWSRAFCFRQFQFQKTPSVTVWRELRRIEKEQEFCLFEKIRLAADCGCFASYFNWMGGHRLKQRNRPIKVLYERSENHYQELVKKTVGLTGVGITVLTREKQWTLVKKPTQPKDLGALALSLLGTAESGGSRFPWTSVNNCTQGAIPQESTCVPDVETSNQTITMNIESVIPVQGQQIEKPPLIEKAKCGSNQNIIKEI >NZ_CP044399|1524700:1560905|1551517_1553983_+|WP_019440170.1|tail|DBSCAN-SWA MSALSKLEKLMYTIGVVDKATGPVNKIMDKINQLSSQTASAQNQMMSGFMGTAGGAIALVSSLSPAIDHVAALGEVQTLGVANEDLTKLTKTAFEFTTQFGGNSAEFVRSAYDIQSAISGLTGDELASFTKASNVLAVATKADASTITSYMGTMYGIFEKNANKMGKSDWVNQIAGQTAKAVQMYKTTGAEMQSAFSALGAKAANRGIDAAEQFAVLGELQLVLKSGSVAGTQYAAFIDGIGKAQKELGIELTNSQGDMLGIDVVMSRINQKLAGVGSVARGDILNKAFGSKNAASVVDILSSKTAKLKQGINELTNVTDASKASEMANIIASPWDRFSGSLNGAATAMGQAVLPIIEPVVDMLVAMLGGVIWLTQEFPTLTGVLGAVVVGVVALMMAFSAMNMIIGIYRFALIGLSLVSNAAAVSTKLWQIGLVALRVMGFLGNIAAIGAYLTAIALYRGAMLAAQGVTWLFNTALLANPIGLVIAGVVALVAAVAGLIFYWDSIVTAFKDTSWGQALIGIFDSVMSVFSGLIDNVKWVLEALGLMDGKEMTVNSKVEEVTKTAAPISAENTLSTPLNPTVSNGQQVASTEVFAANNVANMNMQHNQVGGLTPVVANNLANMNTRISQVNGSASVNTNQVLNTEAFAANNLSNANTGINQVSSFAPVAANQAFNTEALATSNLANTNTQLNQVSSFTPAAANQVFNSETLATNHLANTNTQFNQVSGLSPVATSLVESQTFNETSAHSARIQQYQENNQQQAKVSRPRIQRTQYFQQSKQSGNNQSSSSADNSKRVYIDNVVMKSDNLAHDFEQLMELAG >NZ_CP044399|1524700:1560905|1533750_1534041_+|WP_019440194.1|DBSCAN-SWA MNTVESVINEIMFIAISRPDAIDITVEYSGVSDSLSVKVMPRGFDYINTTTESYEAAILYRTDVWLNEPGPMQTALEAKSKILELMATSVNVGVAA >NZ_CP044399|1524700:1560905|1536559_1536823_+|WP_019440189.1|DBSCAN-SWA MRVTCKFCKGKARISSTDKVSVEFTRLYCQCLDAKCGHTFVMDLGFSHTLNPPSNIVDQLLVDRVRQMPVEIQRELFGFTSGISNII >NZ_CP044399|1524700:1560905|1524700_1525669_-|WP_019440202.1|integrase|DBSCAN-SWA MSIKQITNGYEVDCRPQGRDGKRYRKKFTTKGEAQKYERWLLSTQNQKDWIEKTADKRPLIDLIHLWYHYHGQQLKTGPRELKHLITINADLGNLRADQLTRQRFMQYRADKMAGGMKPASINRNQTRLSSVFTALIKAEEFHNEHPFKGMGPLKVRAPEMGFLTKAEIKQLLTNLSGDELHIAKLCLSTGARWSEAAELKTSDIVHGRVTFGDTKNGKNRTVPITDELLKEIAHGKSGRLFNGSYTVFLARLKKQNFDIPKGQASHVLRHSFASHFMMNGGNILTLQKILGHSTIMQTMTYSHLAPDYLNEAMKYNPVSTL >NZ_CP044399|1524700:1560905|1539294_1539534_+|WP_019440187.1|DBSCAN-SWA MKFLTKGLEHEERINLLLQLTKIGSENIKSALVDHLTKGLTENDAAMLNDVPQQNFNRALKRLNDVAGVVEKVKELDWN >NZ_CP044399|1524700:1560905|1547212_1547716_+|WP_019440178.1|head|DBSCAN-SWA MARADLLEVSNVMNLTGMPLAQVSNENVVNNGFYPDLGTAEFITDYSIATEYANNSEQVKRTLVLAMLDVNKALAKYRLRHWQQVEQLQDVNLDEINGVNTLILTYQRAVYCRAKAALLIGRLGETHRDQRAAQQVMASDNQEYWLAESDMALRQIMQLTRSGVELL >NZ_CP044399|1524700:1560905|1534037_1534265_+|WP_019440193.1|DBSCAN-SWA MKFVAIKTCPNGGTIREPKTNEPQTVEIGDHDSKADAIENACYLLDCRQLFRGVLRSLKNAGGYIVLDMQEYAEV >NZ_CP044399|1524700:1560905|1542260_1544066_-|WP_019440182.1|terminase|DBSCAN-SWA MKPRTPRYTPEIIKTARDHYVFGGLTFDEIAEMDSMPSARSLRRWADDGSWNELCPSLNAETAIARRIVLLADRENKSAADYKELDFLTKQQCALNQSRLPSAGITKKYGNTPAAATAQNEQTSERTSKNKKRQKKIKNDVSSITKEMLDTLKDNLLYPHQLHWFEHQDYRSRFILKPRQIGATFYFAFEAFYDAIVNGRNKIFISASRDQAEIFKANIIALCREQFGIELSGSPLTMRNKGKTTTLYFKSTNARTAQSASGDLYIDEVFWIPKFKELRGLAQAMATHKDFRITYFSTPSVTSHEAYDLWNGRWYRKTKACNDPEFAIDVSHKTLKDGRLCEDGIWRQKLNVYDVVKQGFDRIDISILENEYSTEEFNNLFMCKFIDDAHSAFSLKQLMACVGNSKKWPDFDPTWSRPYAMKPVVIGFDPARTRDIASVVVLSLPLGPDDKFRLLESLNLSGNDFETMASEIKELTLKYHVVHIGVDTTGMGLGVFELIQKFFPLAMPIHYNPHNKNKMVIKALNVIGKGRFEFDEGSVMVASSFINIRKKVVGDQISYATNRTAATGHADIAWAIMHAMIYEPLSGDCSSTRTSIGLDAA >NZ_CP044399|1524700:1560905|1560707_1560905_+|WP_019440160.1|DBSCAN-SWA MALSKSALKSKIEAEMVKGGIVIKGEHAQASVLAQAIANAVVDEITANAEVGVTGGSSAGKYKVG >NZ_CP044399|1524700:1560905|1546391_1547141_+|WP_019440179.1|DBSCAN-SWA MSIVKRNQRKTHASIVGVDLASGPDQSVTVTAKHGKVVTAEPNRAQDIMDEFDFFKAAMDSDLAQLKKFSHIEDKIAYKAQAIENHQYLDYLRRYQAQGTNHQNMVLAWVVIWLVDLGHWKTAFEFLPLLVTQNQRLPGRFSTQDWPTFLIDQLYDEGAKHLSKGRDAVERSQVINLFTQFIQLLETHQWRLSELIGGKLYAMAAKLEQSVFNLGNAYTYGTKATELNDKAGVKKMVREIAKTIGKDNE >NZ_CP044399|1524700:1560905|1549972_1550425_+|WP_019440174.1|DBSCAN-SWA MSVKALGGKDFDIFIGDKMVHVIEASVKITDGRKAKKVRGITKGYIDGPVDAEVTIKLDHENFLIVQDVAKTAGSWKGIEPFDVSFLAEVAAGTKNIEAFGVLPQLDEILNIKAEGGEEDTTTIKGPVTSTDFIKINGIPYLTAEEVRDL >NZ_CP044399|1524700:1560905|1540451_1540901_-|WP_019440185.1|DBSCAN-SWA MSLLTSLFSFISAPIADLSGSYRERKRIAAEMAASIATAEGNLKLAKLDAESKRLANQEGNDADYDLQVLKNRRESIMDEIIITVFLGLFICHFIPQLQPYMANGWQAMGYKGAPWYFEFVIVGIAVSTLGLMRLFRAFWGSKNTKGVS >NZ_CP044399|1524700:1560905|1541220_1542261_-|WP_019440183.1|portal|DBSCAN-SWA MNLTKSTTAEPVKSKSIDTFSFGDPEPCLDNHMTEYVGLYADMDGLYSPPVSLPGLVKLLRVNAQHGPILYFKRNMIMKWFQPNMALTQRTFKKFAFDYCWAANAYLQVIKNAFGNVIKLRHLPALSMRYTSTPGVYAQRLSNGKVLRFKKGEVIHLKEYDPNQGIYGIPQYYGGIQSALLNEDATLFRRKYYKNGAHMGFIFSMADPNLSTDDEADLKTAIKDSRGVGNFRSLFINNRSGKADAEKAIKIIPVGDISTKDEFERIKKMTLNDMLSMHRAQEALSGQTSGDSPGFGDLDKITRAYYNNEVVPMQQDMMEINEYLPAALHIEFSVPAYSDLNPGSED >NZ_CP044399|1524700:1560905|1558030_1558672_+|WP_019440163.1|DBSCAN-SWA MHSISLCFHQTATLCPAADGAALLANAIKDESRQFRPADFNALVLWVSSDSSTDLAKRLAPVNEYCPLAGFVECGRYAASLATLDSDKLTRPKGTEQSLEWTVMNDKRSIPVIKRQYQAASLAQVNDQGQALISDVDSALAECKQLKGLRDARLTATEFNSTNTGLNCMSISANTAKGLADKLQGLGDDKAYWAFCAFVGNSSELEPIKELFL >NZ_CP044399|1524700:1560905|1533213_1533732_+|WP_019440195.1|DBSCAN-SWA MYEINNSKQTVIDAACIRFADIENVENIASACGMRGQMLRNKLNPNQPHQLTVSELIKITKETGNHDIINSAILDIGLVAVRLPKQGSAKPLAFSAMSVTAHTGEINRHILESEADNRLTRHKKDAIVTKAHSTIRELVFLISDVENRCGGAGPFVSMFTEAVLNGMPIPGM >NZ_CP044399|1524700:1560905|1550424_1551039_+|WP_019440173.1|DBSCAN-SWA MAAIKNLTAASLLVAMKAKQHIVFEGELNLNIIGIRNTDTKANSFNDLLCVLYQQDEKWQLETFKCTTDAGTYYRETPCNVDGTAVLAAMQHRSLWTFGYHKGQYPALVQHKPVAVFRDNNNDNQVDCDSALQRGYFGINCHRASANHESKQVDKWSAGCQVLANPNDFNKLMALCHQSSQQWGKTFTYSLLNQADLNPTKERA >NZ_CP044399|1524700:1560905|1554318_1555491_+|WP_019440168.1|DBSCAN-SWA MGNENRTPDIVPDFKKMMADAGLPVNETVAKQQWDQVLSDQQITIENGSPFSPFWRTVKALITQPVVSLLDWISRILMPDLFIMTASRSALIGLHGPSRNVFVMDAIKAKGMLRLTRVNPDGVLSITAGALVESDSIGGAVYQLRTLSAAVFQEGESVIEVLIEATSPGQAYNLPVGSYYRLVNPIEGVTVRNEKDWLLIPGANEESTEAYRNRIRNVFGTAAKWHINTVYKSIISDFAIPVENIEIVTQAPRGPGTANAFIYLNVGQVSTGLLRAINQHIRDDGHHGHGDDFLVYAMPTQDKMITATYSLHANSADIKADIKTFIQAAFRLNDAYQPMRTRPNSLFSMSLLQAQLHNQFPALRSIDFDCGDIKTGLWLPKLTQLVVQHG >NZ_CP044399|1524700:1560905|1544256_1545207_+|WP_019440181.1|capsid|DBSCAN-SWA MAQLRTIPLAIAAMGLTVDGREISEKDIDDIVATYNYKKYGARINLDHEFNWSGWAAKNLLNVEINGGMLGDVIELSTAKNEDGIKVLYAVLSPNASFVQLNQADQAVYFSIEINRDFMKSGQTYLTGLAVTDYPASTYTDRIHFSQNDNADNTHAADTNVNSTPSDTDLLKVSLALEEAAKPTKSLFKKLFNFNKDDDDMKREDFAAALTDSLGGPLLQFSQALEANTQATQALLAKQGTTTAPVDDVNTDTQENTGADKPTAEFSAIDGKVNGLVEQIEALTKTVADAIKDPAPTTTDGEEEHLGENAKYHNLL >NZ_CP044399|1524700:1560905|1539680_1540373_+|WP_019440186.1|DBSCAN-SWA MLNEFLSGYPLLAGVLAFVLACIPLFKFAHSIIMSKSVLRAQALDLVFKSYGDTSNLSHRLVIEQMFQSNFNLKLDYETITLLLALPNPTETLDLYSKSKRYLVSNNQILEFETKFKCSKRRKLERVLRPSRNIILYGIFGTIAGTAGLYVLKNFNMDEFFTINDSIFNGAIWSFLLIISILSAKFALMSLTDKINIRDAEKLEAKFQPKTVKKIFGTTEIVNNLRRSNL >NZ_CP044399|1524700:1560905|1540897_1541221_-|WP_019440184.1|DBSCAN-SWA MEELIVFIRQWGQLCLLSLLAAATQMYMSGTRITFFHYFMSVLMAILSAYIAESFCIWLGLNDGLKTGIIGIAAYVAPHFLTGLNALAKAVSKDPRHFLDIIMRNKS >NZ_CP044399|1524700:1560905|1553982_1554306_+|WP_019440169.1|DBSCAN-SWA MINTHVDLNIVDGDFVFNPSLSVEKLSAAKVIGQDVKHRILESGLLVKLVKQRNRNGIAPVLTDLELEVEQDDRLKPGTILITYNSDNTLSIEAETKQYGLMKWAGA >NZ_CP044399|1524700:1560905|1536910_1538941_+|WP_019440188.1|DBSCAN-SWA MGNIFFKEITKKKGLKTPTDRLFDIDKWHEVWTRVEGLATEQDKMIYNAGIIIDELLTEIRSNLKFMYENEIPKVSYKELLMAYISLTNRDRSVISNSMTELSGVHADVHSITMNNNMMGNDITIQEMSDAAVDGMESAIKTSLMNKNKKITSSENPIDIVDFIIKESSLSQLYSIYSGYWNAFLWNGYELTELDKKNKIFSFKQPKCKYEIGFMASQIRKSKLVAHSFSIIGNPNVINKFDDDKYIIIEKRGRIKSVKVSSISNAPEQVRYTNADWRVQTLDLEDNLPKELLNQEFENGFTIFEVLDVFRCLILLSSYYHDKYPKKDGFDNLKKLLEFCPKIQKSKLSLALQNATGYPIEKIKIILQFIEFKGARGEDLWCHPIVSISESEYALITSALLTPVMIRVVEHWFVKLNITLQGKGKTYEKNIIHPLNHNILLNCYIDDYDKAISGDVKSDSGVEDIDLLVRVGETILIGEAKSIVTTDSPISQYRTINILKHAAKQAKRKAKFVEDNIESVFKNLGWFFDKSKKYTFTKCILNSGRMYVGSEIDDVPVCDEKIINKYFEENIIPLISSYNDKNKVEHLAWFKLYDDFHELQTNLNTYLCNPPQISERNFIYNDQKLPCLSNASYKIITKRLIPDDLRAQDLLNAKYVFPLETVDDVDEKIKMFDLSV >NZ_CP044399|1524700:1560905|1548861_1549962_+|WP_019440175.1|DBSCAN-SWA MAQGKVLVTALNTGSGATKEVERSALFIGVGALNINKIVPLNAQSDLDALISAADSALKTQLTAWMRNGDALVSGWAIPINAGDDEFALIDKAMDQNISPEIIVITTPVTGKAQVEAYQAKALDILSKYARRVRFLVSAPGLSDNQTWTEHLAAITPLTDGVVAARVAVIPALYGDELGAVTGRLCKRSVTVADSPMRVQTGSMALQPTPHDSSGQPITNAVTAALDAIRFSCVQFYPDFDGIYFGDVNMLDAEGGDYQQIEAGRIVDKAARQIRIIAIYQIKNRRLNNSSTGVAFGKRVLGKPLRDMSKSINIGADKFPGEIREPKDDSITLTFMNARQLRVTVKIQPIDSPSEILVGIMLDKDE >NZ_CP044399|1524700:1560905|1558668_1559151_+|WP_019440162.1|DBSCAN-SWA MIALDGWQVPGYETKIKCSFKLAGEDLSGYGSLTLSSDNGVKPAVLSVTTKIPFKDKAELAKLISKAKELDEHGARMIRTVNCDVAESFKVRKTKFDGEMSATEDDEMKAWIISFNLLEVMSKSEREQLQLDDEASNNTIPQSSDGHNSLQQQFESVEGP >NZ_CP044399|1524700:1560905|1548204_1548849_+|WP_019440176.1|DBSCAN-SWA MLNLDIPTDSALKQLDMLTLDANKRRRILRGAGRQVRRDTKTRLKGQKGLSGTNWQGRSDGRKKRMLKKLGKGIQVHTTPNNATVTFGNKRLGQIARAHQEGITSTQTAQQAAKTNGTPEYRAKASRRQAKSLRDNGYKVRKKRGKGWKSPSLKWITENISVGQAGLILRILRGNKKAKSSWDVKLPARSFLGQNSNEQTELKNYMLDEAFRLR >NZ_CP044399|1524700:1560905|1559851_1560706_+|WP_019440161.1|DBSCAN-SWA MVKNAIRRLVLRYFPELGQRKHLPQLARIEKIYDMPVNGAGVSTAFRAYKAADIQLLDAVTAKPLAVPVFEQVSIASGQGHEHGLFVEPTPGMQCLIQYIDGLDSLPVITSLLPWHTLVPDHRSTDVSLQQSHRSKLVGSNGDWYLQTDGEIKQTSQKSIIEAQTSEQTYHERSTKVATHDINKIDGNQVNEIMGALKILVGEKAIITSLDNLLLGSNKEVKIQSAEDMHLDSAKSLIIKAKYITEDADTIKLNGGTGVITCASICPFTGKPHVDGSTTVFAGK >NZ_CP044399|1524700:1560905|1530022_1531171_+|WP_019440199.1|DBSCAN-SWA MPINYLYIDDDETSSVAHYISAIESHSGGELKITHVRVSSMSSIKNQFIEGAFDGFIIDQKLDASNDEGETVDFYGTALAQNLRTEMIVADVPPSPIVLLSNEEPFVKYYDADESSHNLFDYTVKKKDIRNKSIAKNMCLTLIALVEAYKTVIDLPSNGTTEDTFAILKPVLGWTEQEAEYVDTRFKSELLDTRKDVHTFVAILFSSFIRSAGTLVTEEMLATKLGIDISSSSDWNALKDLFEEYKYKGLFSELKDRWWFSDIEDWWYDNNEDGRVLQALTCNERVSVIKNATTFEHLSSLIGSYENQSLNFWVNCIVTGKPLDPYDALRANDSKLKNWEAPIYLDLVAVLSRQASKVGYKVHPEDQAKIKLLTARLNPDVK |
42 | Vibrio_phage(28.57%) | head,terminase,portal,capsid,integrase,tail | attL 1524586:1524606|attR 1561344:1561364 |
Homologous phage analysis in the prophage regionThe bacterium proteins that are colored denote the protein is present at specific phage-related keywords (such as 'capsid', 'head', 'integrase', 'plate', 'tail', 'fiber', 'coat', 'transposase', 'portal', 'terminase', 'protease' or 'lysin' and 'tRNA')
|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|
CRISPR_ID | CRISPR_location | CRISPR_type | Repeat_type | Spacer_info | Cas_protein_info | CRISPR-Cas_info | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NZ_CP044398_1 | 20525-20625 | Orphan |
NA
Consensus repeat of NZ_CP044398_1
|
1 spacers
spacers of NZ_CP044398_1
>1.1|20548|55|NZ_CP044398|CRISPRCasFinder AATATCTGAACCCTAGGTGGAAATTAATTAATAATCGCAAGAACTAACGCCTTAC |
CRISPR arrays and Neighbor proteins around NZ_CP044398_1
The CRISPR arrays of NZ_CP044398_1 >merge|NZ_CP044398|1|20525-20625|CRISPRCasFinder ATAACTGAGTAAATCACCGTGGGAATATCTGAACCCTAGGTGGAAATTAATTAATAATCGCAAGAACTAACGCCTTACATAACCGAGTAAATCACCGTGGG >NZ_CP044398|1|1|20525-20625|CRISPRCasFinder ATAACTGAGTAAATCACCGTGGG AATATCTGAACCCTAGGTGGAAATTAATTAATAATCGCAAGAACTAACGCCTTAC ATAACCGAGTAAATCACCGTGGG
>NZ_CP044398.1|WP_019442232.1|20361_20511_-|hypothetical-protein MRTFKGRIQFPSGVSQDVVVQADNQYKATQLAKSMYQGARISRSFTEVR >NZ_CP044398.1|WP_019442233.1|19511_19904_+|hypothetical-protein MDPILSGLLGVLVGAILGHRLSLGRDRRKEFNQATELLRKNSIIQLDSMEDDYIGTKRVTEDEIQTLRSIIGDKRSKKIAYAFKLYTQSHKNYSQSQPPSNPINPQPINISKIPECKLALKKLIKSLEPL >NZ_CP044398.1|WP_019629019.1|19152_19443_+|hypothetical-protein MRYYLLPILLLLPLYAQAEIPSLDSMAFAAKHQINSNAFKNAKYVQGNTFPLGKHKTAYELFYSGKYKGRSAVMQINCKAVNKTGDIEYCKPVDIK >NZ_CP044398.1|WP_019443267.1|17855_18299_+|winged-helix-turn-helix-transcriptional-regulator MFKVDNPTHSIGLQFWNLYTKWNAEITISLKPLGITHTQFVILAAILWREKTHNISSQSEISALTSIDKMTLSKALIKLVEKKFIVKNKAENDSRIFILTLTEIGAGLTKQAISIVEDIDEKIFGSLGAEKKAIFLSLILELKSFSS >NZ_CP044398.1|WP_019443266.1|17441_17852_+|hypothetical-protein MIIEESIKVNSTPEHIFSLYKDVSNWKEWDKEVKASSLIGAFKNGSFGSVTPSKGPKSKIYLSEVEENKTFTAESKLPFCVMYFEHNLTAVDNAVLVTHRVKFKGPLRFIFGYLIGKPIKVGLPVTLKGLKYSAEK >NZ_CP044398.1|WP_019443153.1|16621_17137_+|metal-dependent-hydrolase MKWINHILIAGSIAAVISPSLVAPAIAGATAPDWMESLLKAIGRPVKHRTTTHVFTHWLIAGIATSFLWDFHGIFAAFCWGGFSHILTDGMTVSGVPFSPYSDRRFHLFGGRFRTGEPVEYAISAVIVMISITLNAMIGDSFAPYFYDWSGLYAEGMIDASEWKANRFRLM >NZ_CP044398.1|WP_019443152.1|15890_16625_+|hypothetical-protein MQKWNLCTLIIATQLASISLAKAIELPWQQTEEKPTVLPFEVPRLHGQSESFKNPVNPLGSAPLIDTDSIYYAALNCYPEQSTFKISVNLVAGYKANTDQFEEDDWPDITDHYIGIVAKMPLYDTTDRSRSRDREYNRRVKTASHVAGFAQALANRNYAYREVGIYLAMEARAQARVSQGIVGVDEQIKYLEKTAGAQRNIIKTTAEATEHRLALVAMCDSEKSDRFNDYLVNVANLPKTVQTQ >NZ_CP044398.1|WP_151676813.1|15745_15964_+|hypothetical-protein MKKRIKSILRITTASALLTLLTFAAIPTPAGPSYLTGYIELFETHRETNAEMEFMHANYCDTASKHFACQSD >NZ_CP044398.1|WP_019629005.1|15000_15741_+|hypothetical-protein MRFLKIKRVSALRADGSEMKTPSIIDTPRRFAKSHERKETCIKRTKCQLITGAHDSGKTRWLERLYDDWEPIWSAKIKSQPVYISALDPVSDWVDAAHVAKWFEVQERESAEQGGGEPRNWRKLSQKQRISETARYLHETGTLLFLDDAHKLTGRKLQFVRQIMMSTRIWLMTANAENRLSPSLRTLVERASPQRTELDSDASYDATRIMLWLMIAGFTVSGVWEAALILGGLQMLGAGRNAAKPD >NZ_CP044398.1|WP_019442210.1|13387_14416_-|hypothetical-protein MLNSLGLSYNDVSVKVTSDVYNALPNKPYCSDSKTYAIIRSKYYAQDKPYIQVNNPNLKRYLIVDIDEQDAYSTLLDSRLPQPTYISINRVNGHLQCAWKLRDAVSTSYNSRVAPMRFLAAIDAAYNYRLGGDASFGDCLAKNPLHDRWHNEYYDTEYTLHELADYVDLREKDASNVAANDDVSGLGRNNTVFDVARKQAYKMVRKAVSKPQFQSWKADILACCESLNKQFSKPMQHNEVKNIARSIARYTFKMWALFVHSMDNFRAIQAVRGAIGGKLGSNKALSGAKGGAKSKRSGSVKKDGLLSKVLAMKSQHYNHRAIAEDLNISASTVSLWLKGARS >NZ_CP044398.1|WP_019442231.1|20969_21455_+|VOC-family-protein MKMNHVGIMVGDMDKAVEFYTKALGLKTVMGNTKVEEERETAIGKMCIAVFGEGFKGFNIAHLVTSDGIGVEMFEMKERQERHEVDFSRIGIFHFCLQTDDFDGVIARTEEYGGKVRMDIHRYHPEDDSKQAQMVYLEDPFGNLFELYSHTYEETYASDYE >NZ_CP044398.1|WP_019442230.1|21832_22000_+|hypothetical-protein MSKLYNLGYALGSMINKEKPAEPVQEQSVKRTAQSFTTTRRVQVKAHTRNYPQSK >NZ_CP044398.1|WP_019442229.1|22214_22787_+|hypothetical-protein MDIIFVDAENIGLKELEKLETSIIDKVFVFSKSNCIKLVCEKKLYLFLSDYPCGSNQADFYIIAYLSRVLSSLNHTELTSINFKLITNDESLISAFGFQCSQLGGISKIIKTNEKIKTDVNTVVQLTPVLAPKSVEEKIIFHLKSPETLNPEFRKKLGISQQDFSRATGELIRQNKIKRSKGSKKKWVTR >NZ_CP044398.1|WP_019442228.1|22997_23984_-|hypothetical-protein MKNKLLVMLISLIPLIALYAPVTFAAQNIDTCEQLLDIPNRATETYMLTQDVDCNGYVQNKAIDFKGKLDGHGYRVIGLEVQYDDNYMGLFSRIIGGSVIRLGLDSMVITGNKGNTAVGLLAGNVAYDSLITDIEINASSISITESVSNGLGLLVGYVSDQSQLEGIRSYNSQIDTTDKAKHVGGLVGVLKESSLSLASVDENKISISYDLGGNVSVGGIIGTLEKSVTSDVTIESSHILADEIDRGYGAQFIGKMNKSRLVNALSINNYIEYLSAGTHWNPAIAAGYINGDFGLIPTLENIRVSSSNDFPWYNSASDVLTKDLQIMK |
You can click texts colored in the table to view more detailed information
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_ID | Protospacer_location | Mismatch | Identity |
---|
CRISPR_ID | Spacer_Info | Spacer_region | Spacer_length | Hit_phage_ID | Hit_phage_def | Protospacer_location | Mismatch | Identity |
---|---|---|---|---|---|---|---|---|
NZ_CP044398_1 | 1.1|20548|55|NZ_CP044398|CRISPRCasFinder | 20548-20602 | 55 | NZ_CP044398 | Moritella marina ATCC 15381 strain MP-1 plasmid unnamed1, complete sequence | 20548-20602 | 0 | 1.0 |
1. spacer 1.1|20548|55|NZ_CP044398|CRISPRCasFinder matches to NZ_CP044398 (Moritella marina ATCC 15381 strain MP-1 plasmid unnamed1, complete sequence) position: , mismatch: 0, identity: 1.0
aatatctgaaccctaggtggaaattaattaataatcgcaagaactaacgccttac CRISPR spacer aatatctgaaccctaggtggaaattaattaataatcgcaagaactaacgccttac Protospacer *******************************************************
Region | Region Position | Protein_number | Hit_taxonomy | Key_proteins | Att_site | Prophage annotation |
---|
Acr ID | Acr position | Acr size | Homology with known anti | Neighbor HTH/AcRanker | Neighbor Aca | In prophage | Protospacer in prophage |
---|